BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (273 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P52132 UPF0380 protein yfjQ n=153 Tax=Bacteria RepID=YF... 553 e-156 UniRef50_P18005 UPF0380 protein yubP n=179 Tax=root RepID=YUBP_E... 389 e-107 UniRef50_Q1ND23 CP4-6 prophage n=4 Tax=Alphaproteobacteria RepID... 257 3e-67 UniRef50_B8IVF8 Putative uncharacterized protein n=1 Tax=Methylo... 230 5e-59 UniRef50_A4X0R7 Putative uncharacterized protein n=2 Tax=Rhodoba... 228 2e-58 UniRef50_A9HST1 Putative uncharacterized protein n=1 Tax=Glucona... 206 6e-52 UniRef50_B6C6K7 Conserved domain protein n=2 Tax=Nitrosococcus o... 205 1e-51 UniRef50_B9JPN2 Phosphoribosylamine-glycine ligase n=2 Tax=Prote... 196 7e-49 UniRef50_C6N6D1 Putative uncharacterized protein n=2 Tax=Legione... 185 2e-45 UniRef50_C6RFJ3 Putative uncharacterized protein n=1 Tax=Campylo... 154 2e-36 UniRef50_B2Q5G8 Putative uncharacterized protein n=3 Tax=Provide... 152 2e-35 UniRef50_B1ZQ12 Putative uncharacterized protein n=3 Tax=Opitutu... 145 2e-33 UniRef50_Q17W97 Putative uncharacterized protein Hac prophage I ... 80 9e-14 UniRef50_B9PA18 Predicted protein (Fragment) n=2 Tax=cellular or... 58 4e-07 UniRef50_B5EW31 Putative uncharacterized protein n=1 Tax=Vibrio ... 49 1e-04 UniRef50_A9V0Z1 Predicted protein n=1 Tax=Monosiga brevicollis R... 42 0.015 >UniRef50_P52132 UPF0380 protein yfjQ n=153 Tax=Bacteria RepID=YFJQ_ECOLI Length = 273 Score = 553 bits (1425), Expect = e-156, Method: Compositional matrix adjust. Identities = 267/273 (97%), Positives = 270/273 (98%) Query: 1 MTRLASRFGAANLIRRDRPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDSLQRE 60 MTRLASRFGAANLIRRDRPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDSLQRE Sbjct: 1 MTRLASRFGAANLIRRDRPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDSLQRE 60 Query: 61 GFQPFFACQTRVRDPGRREHTKHMLRLRREGQITGKQVPEIILLNSHDGTSSYQMLPGLF 120 GFQPFFACQTRVRDP RREHTKHMLRLRREGQITGKQVPEIILLNSHDGTSSYQMLPG+F Sbjct: 61 GFQPFFACQTRVRDPRRREHTKHMLRLRREGQITGKQVPEIILLNSHDGTSSYQMLPGMF 120 Query: 121 RAVCQNGLVCGESFGEVRVPHKGDVVSQVIEGAYEVLGIFDRVEEKRDAMQSLLLPPPAQ 180 RAVCQNGLVCGESFGEVRVPHKGDVVSQVIEGAYEVLGIF+RVEEKRDAMQSLLLPPP Q Sbjct: 121 RAVCQNGLVCGESFGEVRVPHKGDVVSQVIEGAYEVLGIFERVEEKRDAMQSLLLPPPVQ 180 Query: 181 QALAKAALTYRFGEDHQPVTESQILSPRRWQDESNDLWTTYQRIQENLIKGGLSGRNAKG 240 QALAKAALTYRFGEDHQPVTESQILSPRRWQDESNDLWTTYQRIQENLIKGGLSGRNAKG Sbjct: 181 QALAKAALTYRFGEDHQPVTESQILSPRRWQDESNDLWTTYQRIQENLIKGGLSGRNAKG 240 Query: 241 GRSHTRAVRGIDGDVKLNRALWVMAEALLTQLQ 273 GR+HTRAVRGIDGDVKLNRALWVMAE LLTQLQ Sbjct: 241 GRTHTRAVRGIDGDVKLNRALWVMAETLLTQLQ 273 >UniRef50_P18005 UPF0380 protein yubP n=179 Tax=root RepID=YUBP_ECOLI Length = 273 Score = 389 bits (1000), Expect = e-107, Method: Compositional matrix adjust. Identities = 178/267 (66%), Positives = 221/267 (82%), Gaps = 1/267 (0%) Query: 3 RLASRFGAANLIRRDRPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDSLQREGF 62 RLASRFG N I R+RPLT +EL + VPSVFS DKHESRSERYTYIPTI++++ L+ EGF Sbjct: 2 RLASRFGRYNSIHRERPLTDDELMQFVPSVFSGDKHESRSERYTYIPTINIINKLRDEGF 61 Query: 63 QPFFACQTRVRDPGRREHTKHMLRLRREGQITGKQVPEIILLNSHDGTSSYQMLPGLFRA 122 QPFFACQ+RVRD GRRE++KHMLRLRREG I G++VPEIILLNSHDG+SSYQM+PG+FR Sbjct: 62 QPFFACQSRVRDLGRREYSKHMLRLRREGHINGQEVPEIILLNSHDGSSSYQMIPGIFRF 121 Query: 123 VCQNGLVCGESFGEVRVPHKGDVVSQVIEGAYEVLGIFDRVEEKRDAMQSLLLPPPAQQA 182 VC NGLVCG +FGE+RVPHKGD+V QVIEGAYEVLG+FD+V + +AM+ + L Q Sbjct: 122 VCTNGLVCGNNFGEIRVPHKGDIVGQVIEGAYEVLGVFDKVTDNMEAMKEIHLNSDEQHL 181 Query: 183 LAKAALTYRF-GEDHQPVTESQILSPRRWQDESNDLWTTYQRIQENLIKGGLSGRNAKGG 241 +AAL R+ E+ PVT QI++PRRW+D+ NDLWTT+QR+QEN+IKGGLSGR+A G Sbjct: 182 FGRAALMVRYEDENKTPVTPEQIITPRRWEDKQNDLWTTWQRVQENMIKGGLSGRSASGK 241 Query: 242 RSHTRAVRGIDGDVKLNRALWVMAEAL 268 + TRA+ GIDGD+++N+ALWV+AE Sbjct: 242 NTRTRAITGIDGDIRINKALWVIAEQF 268 >UniRef50_Q1ND23 CP4-6 prophage n=4 Tax=Alphaproteobacteria RepID=Q1ND23_9SPHN Length = 281 Score = 257 bits (656), Expect = 3e-67, Method: Compositional matrix adjust. Identities = 130/270 (48%), Positives = 181/270 (67%), Gaps = 5/270 (1%) Query: 4 LASRFG-AANLIRRDRPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDSLQREGF 62 LA+RFG ++ I PL E L+R VPS+F+ + H+SRSERY Y+PTI +++ L+REG+ Sbjct: 6 LATRFGRNSHQIGGYEPLDNEALYRHVPSIFAREAHDSRSERYVYVPTIDIVEGLRREGW 65 Query: 63 QPFFACQTRVRDPGRREHTKHMLRLRREGQITGKQVPEIILLNSHDGTSSYQMLPGLFRA 122 PFFA Q+ RD R H KHMLRLRRE + + E I++NSHDGTS++Q+ G+ R Sbjct: 66 FPFFAVQSVPRDGNRHGHAKHMLRLRREDGVGKSEAAEAIIVNSHDGTSAFQLFAGMLRF 125 Query: 123 VCQNGLVCGESFGEVRVPHKGDVVSQVIEGAYEVLGIFDRVEEKRDAMQSLLLPPPAQQA 182 VC N ++ GE F EVRVPHKG++ +IEG Y V F R+ + + M+ + L Q+ Sbjct: 126 VCTNSMIAGERFEEVRVPHKGNIEHDIIEGVYTVAEDFPRLIDASETMKGVRLSEDEQRL 185 Query: 183 LAKAALTYRFGEDHQPVTESQILSPRRWQDESNDLWTTYQRIQENLIKGGLSG--RNAKG 240 L + +L R+GED P+T QI+ PRR++D + LWTT+ IQEN+I+GGL G RNA+G Sbjct: 186 LGEVSLVARYGEDESPLTPEQIIEPRRYEDRGDSLWTTFNVIQENVIRGGLHGRKRNAEG 245 Query: 241 --GRSHTRAVRGIDGDVKLNRALWVMAEAL 268 RS +R + GID +V LNRALW +AE + Sbjct: 246 RIRRSRSRPINGIDQNVTLNRALWTLAEGM 275 >UniRef50_B8IVF8 Putative uncharacterized protein n=1 Tax=Methylobacterium nodulans ORS 2060 RepID=B8IVF8_METNO Length = 295 Score = 230 bits (586), Expect = 5e-59, Method: Compositional matrix adjust. Identities = 125/276 (45%), Positives = 171/276 (61%), Gaps = 13/276 (4%) Query: 6 SRFGA-ANLIRRDRPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDSLQREGFQP 64 +RFG+ A ++R + L L P+VF+EDKH SRS++YTYIPT+ +L L REGF P Sbjct: 14 TRFGSGAVVVRNNGGLDEAALRSAAPTVFAEDKHSSRSDKYTYIPTVEVLRGLGREGFLP 73 Query: 65 FFACQTRVRDPGRREHTKHMLRLRREGQI---TGKQVPEIILLNSHDGTSSYQMLPGLFR 121 RD +R +TKH+LRLRR G G E++LLNSHDGTSSYQ++ GLFR Sbjct: 74 VEVRVGGTRDEEKRGYTKHLLRLRRMGDAPTRVGDSSRELVLLNSHDGTSSYQLMSGLFR 133 Query: 122 AVCQNGLVCGESFGEV-RVPHKGDVVSQVIEGAYEVLGIFDRVEEKRDAMQSLLLPPPAQ 180 +C NGLVC + ++ ++PHKGD+V QVI+GAY ++ + V+ M+ + L P Q Sbjct: 134 LICSNGLVCADGDAQILKIPHKGDIVQQVIDGAYRIVDASEEVDRIAAEMKQIELRPAEQ 193 Query: 181 QALAKAALTYRFGEDHQ--PVTESQILSPRRWQDESNDLWTTYQRIQENLIKGGLS--GR 236 A A+AA R+ + Q PV QI +PRR +D N LW + R QE LI+GG+ R Sbjct: 194 DAFAEAAAELRWNGEGQRVPVEPRQIHAPRRREDVGNSLWLAFNRTQEGLIRGGIDYQQR 253 Query: 237 NAKGG----RSHTRAVRGIDGDVKLNRALWVMAEAL 268 N + G R TR V+G+DG+ LNRALWV+A + Sbjct: 254 NPETGRLIARRQTRPVQGVDGNTALNRALWVLANRM 289 >UniRef50_A4X0R7 Putative uncharacterized protein n=2 Tax=Rhodobacter sphaeroides RepID=A4X0R7_RHOS5 Length = 316 Score = 228 bits (580), Expect = 2e-58, Method: Compositional matrix adjust. Identities = 120/266 (45%), Positives = 167/266 (62%), Gaps = 9/266 (3%) Query: 12 NLIRRDRPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDSLQREGFQPFFACQTR 71 ++ R PLT EL VPS+F+ + HESRS R+ +PTI++LD L+ EGF+PFFA Q R Sbjct: 44 SIFSRGEPLTNAELHARVPSIFATEAHESRSARFAPVPTITVLDGLRAEGFEPFFAQQAR 103 Query: 72 VRDPGRREHTKHMLRLRREGQIT-GKQVPEIILLNSHDGTSSYQMLPGLFRAVCQNGLVC 130 R G+ E TKHMLRLR G + + EI+L+N++DGTS+YQM+PG FR VC NGL+ Sbjct: 104 TRIEGKAEFTKHMLRLRHRGIVNEAGEAFEIVLVNANDGTSAYQMIPGFFRFVCANGLMA 163 Query: 131 GESFGEVRVPHKGDVVSQVIEGAYEVLGIFDRVEEKRDAMQSLLLPPPAQQALAKAALTY 190 GE+F EV+V H G+ + +VIEGAY VL RV ++ +S+ L ++ LA+AA + Sbjct: 164 GETFEEVKVRHSGNAIGEVIEGAYRVLEDAPRVADQVQRFKSIRLQDREREILAEAAHSL 223 Query: 191 RFGEDHQ----PVTESQILSPRRWQDESNDLWTTYQRIQENLIKGGLSGR-NAKGG---R 242 RF + P+ +L PRR +D + DLWT + +QEN ++GG+ GR G R Sbjct: 224 RFPATAEGKAAPIDPPALLRPRRSEDRATDLWTAFNVVQENTLRGGMRGRIETDSGFIRR 283 Query: 243 SHTRAVRGIDGDVKLNRALWVMAEAL 268 R V GID LNRALW++ E + Sbjct: 284 QTVREVTGIDQSRALNRALWMLTERM 309 >UniRef50_A9HST1 Putative uncharacterized protein n=1 Tax=Gluconacetobacter diazotrophicus PAl 5 RepID=A9HST1_GLUDA Length = 282 Score = 206 bits (525), Expect = 6e-52, Method: Compositional matrix adjust. Identities = 113/269 (42%), Positives = 159/269 (59%), Gaps = 17/269 (6%) Query: 15 RRDRPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDSLQREGFQPFFACQTRVRD 74 R +PLT E+L R+ PS+F+E KHESRS+RYTYIPTI ++ L+ EGF P A Q R Sbjct: 11 RHAQPLTDEQLQRLAPSIFAEAKHESRSDRYTYIPTIEVVRGLRSEGFFPVMARQGNSRI 70 Query: 75 PGRREHTKHMLRLRREG-----QITGKQVPEIILLNSHDGTSSYQMLPGLFRAVCQNGLV 129 PG+ E+TKH++R R + G PE+ LLNSHDGTS+Y+++ + R C+NG+V Sbjct: 71 PGKAEYTKHLIRFRHMDHGPMYENLGDLYPEVALLNSHDGTSAYKIIAAMMRLACENGMV 130 Query: 130 CGES-FGEVRVPHKGDVVSQVIEGAYEVLGIFDRVEEKRDAMQSLLLPPPAQQALAKAAL 188 ++ E+ VPHKG V +VIEG+Y VL + E L Q+ A+A Sbjct: 131 VQDARLAEISVPHKGTVTDKVIEGSYTVLDESRKALEIAGEWSGKTLTERQQKGFAEAVH 190 Query: 189 TYRFGEDHQ--PVTESQILSPRRWQDESNDLWTTYQRIQENLIKGGLS-------GRNAK 239 ++G+D + P T L RR D+ DLW R+QE+ I+GG++ GRN K Sbjct: 191 IAKYGDDAERMPFTPESYLRTRRAADQGADLWRVANRVQESAIRGGMTGFRWDEDGRNRK 250 Query: 240 GGRSHTRAVRGIDGDVKLNRALWVMAEAL 268 R R V+ IDGD+KLN+A+W +A+ L Sbjct: 251 --RVTARPVKSIDGDIKLNKAVWHLAQML 277 >UniRef50_B6C6K7 Conserved domain protein n=2 Tax=Nitrosococcus oceani RepID=B6C6K7_9GAMM Length = 226 Score = 205 bits (522), Expect = 1e-51, Method: Compositional matrix adjust. Identities = 99/220 (45%), Positives = 148/220 (67%), Gaps = 4/220 (1%) Query: 53 LLDSLQREGFQPFFACQTRVRDPGRREHTKHMLRLRR---EGQITGKQVPEIILLNSHDG 109 ++++L+REG+ P A ++RVR P R+ +KH+LR RR E + G PEI+L+NSHDG Sbjct: 1 MIEALEREGWSPVHAEESRVRIPDRKGFSKHLLRFRRFDNELPMVGDSFPEIVLVNSHDG 60 Query: 110 TSSYQMLPGLFRAVCQNGLVCGES-FGEVRVPHKGDVVSQVIEGAYEVLGIFDRVEEKRD 168 + +YQ+ GLFR VC NG++ +S G+V+ H GDVV +VIEG YE++ R+ + + Sbjct: 61 SCAYQLHAGLFRLVCSNGMIVADSNMGQVKRRHTGDVVREVIEGTYEIVEELPRIAARVE 120 Query: 169 AMQSLLLPPPAQQALAKAALTYRFGEDHQPVTESQILSPRRWQDESNDLWTTYQRIQENL 228 ++L L Q+ A++AL R+ E P +L PRR +D+ NDLW TYQR+QEN+ Sbjct: 121 DFKTLELSLQEQEIFAESALRVRWREGEAPCMPQALLRPRRHEDQGNDLWATYQRVQENM 180 Query: 229 IKGGLSGRNAKGGRSHTRAVRGIDGDVKLNRALWVMAEAL 268 +KGG+ GR+A G + TRAV+ +DG+VKLN+ALW + E + Sbjct: 181 LKGGIRGRSAVGRQITTRAVKSVDGNVKLNKALWFLTEQM 220 >UniRef50_B9JPN2 Phosphoribosylamine-glycine ligase n=2 Tax=Proteobacteria RepID=B9JPN2_AGRRK Length = 391 Score = 196 bits (498), Expect = 7e-49, Method: Compositional matrix adjust. Identities = 112/264 (42%), Positives = 157/264 (59%), Gaps = 13/264 (4%) Query: 18 RPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDSLQREGFQPFFACQTRVRDPGR 77 R +T E+++V PS+F+ HESRS+R+ IPTI +L L EGF P A Q+ R G+ Sbjct: 121 RTMTETEMWKVAPSIFATTAHESRSDRFKPIPTIEVLRGLMAEGFVPVGAKQSASRTEGK 180 Query: 78 REHTKHMLRLRR--EGQI--TGKQVPEIILLNSHDGTSSYQMLPGLFRAVCQNGLVCGE- 132 + TKH++RLRR +G+ G V EI+L N++DGTS+Y++L GLFR C N LV Sbjct: 181 ADFTKHLIRLRRVDDGKTYRVGDTVCEILLKNANDGTSAYELLAGLFRIRCMNSLVTQTG 240 Query: 133 SFGEVRVPHKGDVVSQVIEGAYEVLGIFDRVEEKRDAMQSLLLPPPAQQALAKAALTYRF 192 + ++V H GDV ++VIEG Y VL +R + L QQ +A+AA RF Sbjct: 241 TIDAIKVRHSGDVSAKVIEGTYRVLNEAERTLVAPQDWATHKLNRDEQQIMAEAAHVLRF 300 Query: 193 ----GEDHQPVTESQILSPRRWQDESNDLWTTYQRIQENLIKGGLSGRN----AKGGRSH 244 GE P+ Q+L PRR D ++DLWT + QEN+I+GGL G + R Sbjct: 301 GDNDGETKTPIKPEQLLLPRRHDDRADDLWTVWNVTQENVIRGGLRGIGREDLGRPRRVK 360 Query: 245 TRAVRGIDGDVKLNRALWVMAEAL 268 +RAV GID D+KLN+ALW++ E + Sbjct: 361 SRAVNGIDQDIKLNKALWLIGEKM 384 >UniRef50_C6N6D1 Putative uncharacterized protein n=2 Tax=Legionella drancourtii LLAP12 RepID=C6N6D1_9GAMM Length = 275 Score = 185 bits (469), Expect = 2e-45, Method: Compositional matrix adjust. Identities = 103/257 (40%), Positives = 151/257 (58%), Gaps = 7/257 (2%) Query: 20 LTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDSLQREGFQPFFACQTRVRDPGRRE 79 LT E+L++ PS+F+ SERY I T ++D L +EGF P A Q+ R ++ Sbjct: 16 LTIEQLYKAAPSLFTRGAAVHTSERYQPIATSDVIDRLLQEGFYPTKATQSASRSEEKKV 75 Query: 80 HTKHMLRLR-REGQITGKQV-PEIILLNSHDGTSSYQMLPGLFRAVCQNGLVCGESFGEV 137 +KH++R R R+ G + PE++L+NSHDG SSY+++ GL+R VC NGLV G+S+ EV Sbjct: 76 FSKHLVRFRHRDYHNPGNGLFPELVLINSHDGLSSYRLMAGLYRQVCTNGLVAGKSYDEV 135 Query: 138 RVPHKGDVVSQVIEGAYEVLGIFDRVEEKRDAMQSLLLPPPAQQALAKAALTYRFGEDHQ 197 RV H+GDV+ VIEG Y V+ ++ + + M LP + A RF ED Sbjct: 136 RVKHQGDVIGNVIEGTYRVIESSQKMLQVVEQMGDCALPDEKLLEFSAQAHALRFSEDAN 195 Query: 198 PVTE-SQILSPRRWQDESNDLWTTYQRIQENLIKGGLSG----RNAKGGRSHTRAVRGID 252 V E +L PRR +D DL++ + +QENLIKGG+ G + + R+ +R + ID Sbjct: 196 LVIEPKNLLVPRRREDMKRDLFSVFNVVQENLIKGGVLGYRLNEHGRWRRARSRKITSID 255 Query: 253 GDVKLNRALWVMAEALL 269 +VK+NR LW +AE L Sbjct: 256 QNVKINRDLWTIAENTL 272 >UniRef50_C6RFJ3 Putative uncharacterized protein n=1 Tax=Campylobacter showae RM3277 RepID=C6RFJ3_9PROT Length = 271 Score = 154 bits (390), Expect = 2e-36, Method: Compositional matrix adjust. Identities = 86/263 (32%), Positives = 142/263 (53%), Gaps = 19/263 (7%) Query: 17 DRPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDSLQREGFQPFFACQTRVRDPG 76 + PLT E+L ++ PS+F+++ + S++Y +I TI +++ ++ + P + VRD Sbjct: 5 NEPLTNEQLEQLAPSLFADEPYFEASDKYHFISTIDVINEIRDYAWYPVGVSEASVRDEK 64 Query: 77 RREHTKHMLRLRREGQIT--GKQVPEIILLNSHDGTSSYQMLPGLFRAVCQNGLVCG-ES 133 + KH +R R G+ V E++L NSHD + + + G+FR VC NGLV E Sbjct: 65 KEGFQKHYVRFRHLDDFLNPGENVVELLLFNSHDRSKCFSISAGVFRFVCANGLVVSDEV 124 Query: 134 FGEVRVPHKGD-------VVSQVIEGAYEVLGIFDRVEEKRDAMQSLLLPPPAQQALAKA 186 F ++ H GD ++++ + Y++L K + L + + AKA Sbjct: 125 FESYQIKHLGDKENDVSIAINKIAKAKYDILN-------KIKLFSKIPLTQDDKASFAKA 177 Query: 187 ALTYRFGEDHQPVTESQILSPRRWQDESNDLWTTYQRIQENLIKGGLSGRNAKGGRSHT- 245 A+ RF E H V +L P R +DE +DL+TT+ IQE+LI+G +SG NA+ R T Sbjct: 178 AIPLRF-EKHLKVDYRDLLVPHRIEDEKDDLYTTFNTIQEHLIRGNISGINAETNRRFTS 236 Query: 246 RAVRGIDGDVKLNRALWVMAEAL 268 R ++ I D +N+ LW MAE++ Sbjct: 237 RIIKSISTDTDINKKLWNMAESI 259 >UniRef50_B2Q5G8 Putative uncharacterized protein n=3 Tax=Providencia RepID=B2Q5G8_PROST Length = 122 Score = 152 bits (383), Expect = 2e-35, Method: Compositional matrix adjust. Identities = 83/112 (74%), Positives = 98/112 (87%) Query: 157 LGIFDRVEEKRDAMQSLLLPPPAQQALAKAALTYRFGEDHQPVTESQILSPRRWQDESND 216 + FD V EKR+ MQSLLLPPPAQQALA+AALTYRFGE+HQP+TE Q+L PRRW+D+ +D Sbjct: 1 METFDTVAEKREQMQSLLLPPPAQQALAQAALTYRFGEEHQPITEEQVLQPRRWEDKKDD 60 Query: 217 LWTTYQRIQENLIKGGLSGRNAKGGRSHTRAVRGIDGDVKLNRALWVMAEAL 268 LWT YQR+QENLIKGGLSGRNAKG R+ TR+V GIDGD+KLN+ALWVM E + Sbjct: 61 LWTVYQRLQENLIKGGLSGRNAKGKRARTRSVNGIDGDIKLNKALWVMTEKM 112 >UniRef50_B1ZQ12 Putative uncharacterized protein n=3 Tax=Opitutus terrae PB90-1 RepID=B1ZQ12_OPITP Length = 288 Score = 145 bits (365), Expect = 2e-33, Method: Compositional matrix adjust. Identities = 99/271 (36%), Positives = 145/271 (53%), Gaps = 23/271 (8%) Query: 18 RPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDSLQREGFQPFFACQTRVRDPGR 77 R L+ ++L RV PSVF+E S RYT++ T ++D L+ EG++P A Q RVR R Sbjct: 14 RALSLDDLRRVAPSVFAEQARPGVSSRYTFVSTAQVVDLLRGEGWEPVKANQQRVRLENR 73 Query: 78 REHTKHMLRLRREGQI------TGKQVPEIILLNSHDGTSSYQMLPGLFRAVCQNGLVCG 131 + H LR R + G PE+IL N+HDGT +Y++ GL+R VC+NGL Sbjct: 74 QGFQMHELRFARRADLENASFAIGDVRPELILQNAHDGTRAYRIDAGLYRLVCRNGLTVA 133 Query: 132 ES-FGEVRVPHKGDVVSQVIEGAYEVLGIFDRVEEKRDAMQSLLLPPPAQQALAKAALTY 190 ++ F V + H + A V RV E Q++ L P A+ + A A+ Sbjct: 134 DADFAHVAIRHVDVSAEKFAAAAQAVAENTPRVMEVIARWQAVALTPLARHSFAARAMAL 193 Query: 191 RFGEDHQPVTE----SQILSPRRWQDESNDLWTTYQRIQENLIKGGL--SGR--NAKGG- 241 R+ + QPVT Q+L+P R+ D++ DLWTT+ +QE L +GGL +G A+G Sbjct: 194 RW-DSAQPVTRLLRPDQLLAPARYGDQATDLWTTFNVVQERLCRGGLRYAGHIPAAEGAV 252 Query: 242 ------RSHTRAVRGIDGDVKLNRALWVMAE 266 R+ TR V G+ +LN+ALW +AE Sbjct: 253 FPTHYLRNTTRPVGGLTEGQRLNKALWNLAE 283 >UniRef50_Q17W97 Putative uncharacterized protein Hac prophage I orf7 n=1 Tax=Helicobacter acinonychis str. Sheeba RepID=Q17W97_HELAH Length = 176 Score = 79.7 bits (195), Expect = 9e-14, Method: Compositional matrix adjust. Identities = 42/161 (26%), Positives = 81/161 (50%), Gaps = 3/161 (1%) Query: 18 RPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDSLQREGFQPFFACQTRVRDPGR 77 +PL+ EL R+ PS+F+ + + S++Y +I TI +++ ++ + P + VR+ + Sbjct: 6 QPLSNNELKRLAPSLFTAEPYYEASDKYHFISTIDIIEEIRFHAWYPVAVSEASVRNEDK 65 Query: 78 REHTKHMLRLRREGQIT--GKQVPEIILLNSHDGTSSYQMLPGLFRAVCQNGLVCG-ESF 134 + +H +R R + E++L NSHD + + + G+FR VC NGLV E F Sbjct: 66 EGYQQHYVRFRYLDDFLRPSENCVELLLFNSHDRSKCFTISAGVFRFVCANGLVVADEVF 125 Query: 135 GEVRVPHKGDVVSQVIEGAYEVLGIFDRVEEKRDAMQSLLL 175 ++ H G+ + V ++ D++ +K + L Sbjct: 126 ESYQIKHIGEKANGVAVAIPSIVQAKDKIMDKISTFSQITL 166 >UniRef50_B9PA18 Predicted protein (Fragment) n=2 Tax=cellular organisms RepID=B9PA18_POPTR Length = 87 Score = 57.8 bits (138), Expect = 4e-07, Method: Compositional matrix adjust. Identities = 27/52 (51%), Positives = 40/52 (76%), Gaps = 1/52 (1%) Query: 4 LASRFGA-ANLIRRDRPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLL 54 LASRF + + +R D PL+ +++ RV PS+F++ HESRSERY+YIPT ++L Sbjct: 36 LASRFASHSPALRSDSPLSDDQIRRVAPSIFADAPHESRSERYSYIPTAAVL 87 >UniRef50_B5EW31 Putative uncharacterized protein n=1 Tax=Vibrio fischeri MJ11 RepID=B5EW31_VIBFM Length = 318 Score = 49.3 bits (116), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 19/48 (39%), Positives = 32/48 (66%) Query: 96 KQVPEIILLNSHDGTSSYQMLPGLFRAVCQNGLVCGESFGEVRVPHKG 143 K + ++++NS+DG+ ++Q+ G FR VC NG++ GE F + V H G Sbjct: 139 KVILRLVVVNSYDGSCNFQVQAGGFRIVCTNGMITGEKFLSLDVRHTG 186 >UniRef50_A9V0Z1 Predicted protein n=1 Tax=Monosiga brevicollis RepID=A9V0Z1_MONBE Length = 981 Score = 42.4 bits (98), Expect = 0.015, Method: Compositional matrix adjust. Identities = 27/83 (32%), Positives = 40/83 (48%), Gaps = 12/83 (14%) Query: 27 RVVPSVFSEDKHESRSERYTYIPTISLLDSLQREGFQPFFACQTRVRDPGRREHTKHML- 85 RV+P +FSE + + R + + I QPFFA R D RE KH+L Sbjct: 181 RVLPKLFSESREQDRHDVAKHKAQI-----------QPFFAMYERHFDESVREEPKHILA 229 Query: 86 RLRREGQITGKQVPEIILLNSHD 108 RL ++ + + V I +L+ HD Sbjct: 230 RLEQQPAVENRSVTHIFILHCHD 252 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P52132 UPF0380 protein yfjQ n=153 Tax=Bacteria RepID=YF... 424 e-117 UniRef50_P18005 UPF0380 protein yubP n=179 Tax=root RepID=YUBP_E... 406 e-112 UniRef50_Q1ND23 CP4-6 prophage n=4 Tax=Alphaproteobacteria RepID... 353 5e-96 UniRef50_B9JPN2 Phosphoribosylamine-glycine ligase n=2 Tax=Prote... 331 2e-89 UniRef50_A4X0R7 Putative uncharacterized protein n=2 Tax=Rhodoba... 330 3e-89 UniRef50_A9HST1 Putative uncharacterized protein n=1 Tax=Glucona... 328 2e-88 UniRef50_B8IVF8 Putative uncharacterized protein n=1 Tax=Methylo... 327 3e-88 UniRef50_C6N6D1 Putative uncharacterized protein n=2 Tax=Legione... 325 1e-87 UniRef50_C6RFJ3 Putative uncharacterized protein n=1 Tax=Campylo... 323 4e-87 UniRef50_B6C6K7 Conserved domain protein n=2 Tax=Nitrosococcus o... 299 7e-80 UniRef50_B1ZQ12 Putative uncharacterized protein n=3 Tax=Opitutu... 277 2e-73 UniRef50_Q17W97 Putative uncharacterized protein Hac prophage I ... 214 2e-54 UniRef50_B2Q5G8 Putative uncharacterized protein n=3 Tax=Provide... 147 5e-34 UniRef50_B5EW31 Putative uncharacterized protein n=1 Tax=Vibrio ... 78 2e-13 UniRef50_B9PA18 Predicted protein (Fragment) n=2 Tax=cellular or... 73 1e-11 Sequences not found previously or not previously below threshold: UniRef50_A8RIH4 Putative uncharacterized protein n=3 Tax=Clostri... 55 3e-06 UniRef50_A8ZS75 Putative uncharacterized protein n=1 Tax=Desulfo... 55 3e-06 UniRef50_Q2LV02 Hypothetical cytosolic protein n=1 Tax=Syntrophu... 53 1e-05 UniRef50_A6GXR9 Putative uncharacterized protein n=1 Tax=Flavoba... 52 2e-05 UniRef50_D2R5Z8 Phage/plasmid-related protein TIGR03299 n=1 Tax=... 48 4e-04 UniRef50_D1N225 Putative uncharacterized protein n=1 Tax=Victiva... 47 9e-04 UniRef50_C7Q5L2 Phage/plasmid-related protein TIGR03299 n=1 Tax=... 46 0.001 UniRef50_A8ZYJ5 Putative uncharacterized protein n=1 Tax=Desulfo... 45 0.003 UniRef50_C6W397 Phage/plasmid-related protein TIGR03299 n=12 Tax... 44 0.007 UniRef50_A3XKH6 Putative uncharacterized protein n=2 Tax=Leeuwen... 43 0.010 UniRef50_B4VVD2 Phage/plasmid-related protein TIGR03299 n=2 Tax=... 42 0.016 UniRef50_C2LEJ7 Putative uncharacterized protein n=1 Tax=Proteus... 42 0.017 UniRef50_UPI0001AF46A9 hypothetical protein MkanA1_07449 n=1 Tax... 42 0.027 UniRef50_C4DCZ5 Phage/plasmid-related protein TIGR03299 n=3 Tax=... 41 0.040 UniRef50_Q47CX4 Putative uncharacterized protein n=4 Tax=Betapro... 41 0.046 >UniRef50_P52132 UPF0380 protein yfjQ n=153 Tax=Bacteria RepID=YFJQ_ECOLI Length = 273 Score = 424 bits (1091), Expect = e-117, Method: Composition-based stats. Identities = 267/273 (97%), Positives = 270/273 (98%) Query: 1 MTRLASRFGAANLIRRDRPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDSLQRE 60 MTRLASRFGAANLIRRDRPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDSLQRE Sbjct: 1 MTRLASRFGAANLIRRDRPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDSLQRE 60 Query: 61 GFQPFFACQTRVRDPGRREHTKHMLRLRREGQITGKQVPEIILLNSHDGTSSYQMLPGLF 120 GFQPFFACQTRVRDP RREHTKHMLRLRREGQITGKQVPEIILLNSHDGTSSYQMLPG+F Sbjct: 61 GFQPFFACQTRVRDPRRREHTKHMLRLRREGQITGKQVPEIILLNSHDGTSSYQMLPGMF 120 Query: 121 RAVCQNGLVCGESFGEVRVPHKGDVVSQVIEGAYEVLGIFDRVEEKRDAMQSLLLPPPAQ 180 RAVCQNGLVCGESFGEVRVPHKGDVVSQVIEGAYEVLGIF+RVEEKRDAMQSLLLPPP Q Sbjct: 121 RAVCQNGLVCGESFGEVRVPHKGDVVSQVIEGAYEVLGIFERVEEKRDAMQSLLLPPPVQ 180 Query: 181 QALAKAALTYRFGEDHQPVTESQILSPRRWQDESNDLWTTYQRIQENLIKGGLSGRNAKG 240 QALAKAALTYRFGEDHQPVTESQILSPRRWQDESNDLWTTYQRIQENLIKGGLSGRNAKG Sbjct: 181 QALAKAALTYRFGEDHQPVTESQILSPRRWQDESNDLWTTYQRIQENLIKGGLSGRNAKG 240 Query: 241 GRSHTRAVRGIDGDVKLNRALWVMAEALLTQLQ 273 GR+HTRAVRGIDGDVKLNRALWVMAE LLTQLQ Sbjct: 241 GRTHTRAVRGIDGDVKLNRALWVMAETLLTQLQ 273 >UniRef50_P18005 UPF0380 protein yubP n=179 Tax=root RepID=YUBP_ECOLI Length = 273 Score = 406 bits (1044), Expect = e-112, Method: Composition-based stats. Identities = 178/267 (66%), Positives = 221/267 (82%), Gaps = 1/267 (0%) Query: 3 RLASRFGAANLIRRDRPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDSLQREGF 62 RLASRFG N I R+RPLT +EL + VPSVFS DKHESRSERYTYIPTI++++ L+ EGF Sbjct: 2 RLASRFGRYNSIHRERPLTDDELMQFVPSVFSGDKHESRSERYTYIPTINIINKLRDEGF 61 Query: 63 QPFFACQTRVRDPGRREHTKHMLRLRREGQITGKQVPEIILLNSHDGTSSYQMLPGLFRA 122 QPFFACQ+RVRD GRRE++KHMLRLRREG I G++VPEIILLNSHDG+SSYQM+PG+FR Sbjct: 62 QPFFACQSRVRDLGRREYSKHMLRLRREGHINGQEVPEIILLNSHDGSSSYQMIPGIFRF 121 Query: 123 VCQNGLVCGESFGEVRVPHKGDVVSQVIEGAYEVLGIFDRVEEKRDAMQSLLLPPPAQQA 182 VC NGLVCG +FGE+RVPHKGD+V QVIEGAYEVLG+FD+V + +AM+ + L Q Sbjct: 122 VCTNGLVCGNNFGEIRVPHKGDIVGQVIEGAYEVLGVFDKVTDNMEAMKEIHLNSDEQHL 181 Query: 183 LAKAALTYRF-GEDHQPVTESQILSPRRWQDESNDLWTTYQRIQENLIKGGLSGRNAKGG 241 +AAL R+ E+ PVT QI++PRRW+D+ NDLWTT+QR+QEN+IKGGLSGR+A G Sbjct: 182 FGRAALMVRYEDENKTPVTPEQIITPRRWEDKQNDLWTTWQRVQENMIKGGLSGRSASGK 241 Query: 242 RSHTRAVRGIDGDVKLNRALWVMAEAL 268 + TRA+ GIDGD+++N+ALWV+AE Sbjct: 242 NTRTRAITGIDGDIRINKALWVIAEQF 268 >UniRef50_Q1ND23 CP4-6 prophage n=4 Tax=Alphaproteobacteria RepID=Q1ND23_9SPHN Length = 281 Score = 353 bits (905), Expect = 5e-96, Method: Composition-based stats. Identities = 130/270 (48%), Positives = 181/270 (67%), Gaps = 5/270 (1%) Query: 4 LASRFG-AANLIRRDRPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDSLQREGF 62 LA+RFG ++ I PL E L+R VPS+F+ + H+SRSERY Y+PTI +++ L+REG+ Sbjct: 6 LATRFGRNSHQIGGYEPLDNEALYRHVPSIFAREAHDSRSERYVYVPTIDIVEGLRREGW 65 Query: 63 QPFFACQTRVRDPGRREHTKHMLRLRREGQITGKQVPEIILLNSHDGTSSYQMLPGLFRA 122 PFFA Q+ RD R H KHMLRLRRE + + E I++NSHDGTS++Q+ G+ R Sbjct: 66 FPFFAVQSVPRDGNRHGHAKHMLRLRREDGVGKSEAAEAIIVNSHDGTSAFQLFAGMLRF 125 Query: 123 VCQNGLVCGESFGEVRVPHKGDVVSQVIEGAYEVLGIFDRVEEKRDAMQSLLLPPPAQQA 182 VC N ++ GE F EVRVPHKG++ +IEG Y V F R+ + + M+ + L Q+ Sbjct: 126 VCTNSMIAGERFEEVRVPHKGNIEHDIIEGVYTVAEDFPRLIDASETMKGVRLSEDEQRL 185 Query: 183 LAKAALTYRFGEDHQPVTESQILSPRRWQDESNDLWTTYQRIQENLIKGGLSG--RNAKG 240 L + +L R+GED P+T QI+ PRR++D + LWTT+ IQEN+I+GGL G RNA+G Sbjct: 186 LGEVSLVARYGEDESPLTPEQIIEPRRYEDRGDSLWTTFNVIQENVIRGGLHGRKRNAEG 245 Query: 241 --GRSHTRAVRGIDGDVKLNRALWVMAEAL 268 RS +R + GID +V LNRALW +AE + Sbjct: 246 RIRRSRSRPINGIDQNVTLNRALWTLAEGM 275 >UniRef50_B9JPN2 Phosphoribosylamine-glycine ligase n=2 Tax=Proteobacteria RepID=B9JPN2_AGRRK Length = 391 Score = 331 bits (848), Expect = 2e-89, Method: Composition-based stats. Identities = 111/268 (41%), Positives = 154/268 (57%), Gaps = 13/268 (4%) Query: 14 IRRDRPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDSLQREGFQPFFACQTRVR 73 R +T E+++V PS+F+ HESRS+R+ IPTI +L L EGF P A Q+ R Sbjct: 117 FDTARTMTETEMWKVAPSIFATTAHESRSDRFKPIPTIEVLRGLMAEGFVPVGAKQSASR 176 Query: 74 DPGRREHTKHMLRLRREGQ----ITGKQVPEIILLNSHDGTSSYQMLPGLFRAVCQNGLV 129 G+ + TKH++RLRR G V EI+L N++DGTS+Y++L GLFR C N LV Sbjct: 177 TEGKADFTKHLIRLRRVDDGKTYRVGDTVCEILLKNANDGTSAYELLAGLFRIRCMNSLV 236 Query: 130 CGE-SFGEVRVPHKGDVVSQVIEGAYEVLGIFDRVEEKRDAMQSLLLPPPAQQALAKAAL 188 + ++V H GDV ++VIEG Y VL +R + L QQ +A+AA Sbjct: 237 TQTGTIDAIKVRHSGDVSAKVIEGTYRVLNEAERTLVAPQDWATHKLNRDEQQIMAEAAH 296 Query: 189 TYRF----GEDHQPVTESQILSPRRWQDESNDLWTTYQRIQENLIKGGLSGRN----AKG 240 RF GE P+ Q+L PRR D ++DLWT + QEN+I+GGL G + Sbjct: 297 VLRFGDNDGETKTPIKPEQLLLPRRHDDRADDLWTVWNVTQENVIRGGLRGIGREDLGRP 356 Query: 241 GRSHTRAVRGIDGDVKLNRALWVMAEAL 268 R +RAV GID D+KLN+ALW++ E + Sbjct: 357 RRVKSRAVNGIDQDIKLNKALWLIGEKM 384 >UniRef50_A4X0R7 Putative uncharacterized protein n=2 Tax=Rhodobacter sphaeroides RepID=A4X0R7_RHOS5 Length = 316 Score = 330 bits (846), Expect = 3e-89, Method: Composition-based stats. Identities = 119/266 (44%), Positives = 167/266 (62%), Gaps = 9/266 (3%) Query: 12 NLIRRDRPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDSLQREGFQPFFACQTR 71 ++ R PLT EL VPS+F+ + HESRS R+ +PTI++LD L+ EGF+PFFA Q R Sbjct: 44 SIFSRGEPLTNAELHARVPSIFATEAHESRSARFAPVPTITVLDGLRAEGFEPFFAQQAR 103 Query: 72 VRDPGRREHTKHMLRLRREGQIT-GKQVPEIILLNSHDGTSSYQMLPGLFRAVCQNGLVC 130 R G+ E TKHMLRLR G + + EI+L+N++DGTS+YQM+PG FR VC NGL+ Sbjct: 104 TRIEGKAEFTKHMLRLRHRGIVNEAGEAFEIVLVNANDGTSAYQMIPGFFRFVCANGLMA 163 Query: 131 GESFGEVRVPHKGDVVSQVIEGAYEVLGIFDRVEEKRDAMQSLLLPPPAQQALAKAALTY 190 GE+F EV+V H G+ + +VIEGAY VL RV ++ +S+ L ++ LA+AA + Sbjct: 164 GETFEEVKVRHSGNAIGEVIEGAYRVLEDAPRVADQVQRFKSIRLQDREREILAEAAHSL 223 Query: 191 RFGEDHQ----PVTESQILSPRRWQDESNDLWTTYQRIQENLIKGGLSGR----NAKGGR 242 RF + P+ +L PRR +D + DLWT + +QEN ++GG+ GR + R Sbjct: 224 RFPATAEGKAAPIDPPALLRPRRSEDRATDLWTAFNVVQENTLRGGMRGRIETDSGFIRR 283 Query: 243 SHTRAVRGIDGDVKLNRALWVMAEAL 268 R V GID LNRALW++ E + Sbjct: 284 QTVREVTGIDQSRALNRALWMLTERM 309 >UniRef50_A9HST1 Putative uncharacterized protein n=1 Tax=Gluconacetobacter diazotrophicus PAl 5 RepID=A9HST1_GLUDA Length = 282 Score = 328 bits (840), Expect = 2e-88, Method: Composition-based stats. Identities = 111/276 (40%), Positives = 157/276 (56%), Gaps = 13/276 (4%) Query: 6 SRFGAANLIRRDRPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDSLQREGFQPF 65 S + R +PLT E+L R+ PS+F+E KHESRS+RYTYIPTI ++ L+ EGF P Sbjct: 2 SFLSRVSAHRHAQPLTDEQLQRLAPSIFAEAKHESRSDRYTYIPTIEVVRGLRSEGFFPV 61 Query: 66 FACQTRVRDPGRREHTKHMLRLRREGQ-----ITGKQVPEIILLNSHDGTSSYQMLPGLF 120 A Q R PG+ E+TKH++R R G PE+ LLNSHDGTS+Y+++ + Sbjct: 62 MARQGNSRIPGKAEYTKHLIRFRHMDHGPMYENLGDLYPEVALLNSHDGTSAYKIIAAMM 121 Query: 121 RAVCQNGLVCGES-FGEVRVPHKGDVVSQVIEGAYEVLGIFDRVEEKRDAMQSLLLPPPA 179 R C+NG+V ++ E+ VPHKG V +VIEG+Y VL + E L Sbjct: 122 RLACENGMVVQDARLAEISVPHKGTVTDKVIEGSYTVLDESRKALEIAGEWSGKTLTERQ 181 Query: 180 QQALAKAALTYRFGEDHQ--PVTESQILSPRRWQDESNDLWTTYQRIQENLIKGGLSG-- 235 Q+ A+A ++G+D + P T L RR D+ DLW R+QE+ I+GG++G Sbjct: 182 QKGFAEAVHIAKYGDDAERMPFTPESYLRTRRAADQGADLWRVANRVQESAIRGGMTGFR 241 Query: 236 ---RNAKGGRSHTRAVRGIDGDVKLNRALWVMAEAL 268 R R V+ IDGD+KLN+A+W +A+ L Sbjct: 242 WDEDGRNRKRVTARPVKSIDGDIKLNKAVWHLAQML 277 >UniRef50_B8IVF8 Putative uncharacterized protein n=1 Tax=Methylobacterium nodulans ORS 2060 RepID=B8IVF8_METNO Length = 295 Score = 327 bits (838), Expect = 3e-88, Method: Composition-based stats. Identities = 125/276 (45%), Positives = 171/276 (61%), Gaps = 13/276 (4%) Query: 6 SRFGA-ANLIRRDRPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDSLQREGFQP 64 +RFG+ A ++R + L L P+VF+EDKH SRS++YTYIPT+ +L L REGF P Sbjct: 14 TRFGSGAVVVRNNGGLDEAALRSAAPTVFAEDKHSSRSDKYTYIPTVEVLRGLGREGFLP 73 Query: 65 FFACQTRVRDPGRREHTKHMLRLRREGQI---TGKQVPEIILLNSHDGTSSYQMLPGLFR 121 RD +R +TKH+LRLRR G G E++LLNSHDGTSSYQ++ GLFR Sbjct: 74 VEVRVGGTRDEEKRGYTKHLLRLRRMGDAPTRVGDSSRELVLLNSHDGTSSYQLMSGLFR 133 Query: 122 AVCQNGLVCGESFGEV-RVPHKGDVVSQVIEGAYEVLGIFDRVEEKRDAMQSLLLPPPAQ 180 +C NGLVC + ++ ++PHKGD+V QVI+GAY ++ + V+ M+ + L P Q Sbjct: 134 LICSNGLVCADGDAQILKIPHKGDIVQQVIDGAYRIVDASEEVDRIAAEMKQIELRPAEQ 193 Query: 181 QALAKAALTYRFGEDHQ--PVTESQILSPRRWQDESNDLWTTYQRIQENLIKGGLS--GR 236 A A+AA R+ + Q PV QI +PRR +D N LW + R QE LI+GG+ R Sbjct: 194 DAFAEAAAELRWNGEGQRVPVEPRQIHAPRRREDVGNSLWLAFNRTQEGLIRGGIDYQQR 253 Query: 237 NAKGG----RSHTRAVRGIDGDVKLNRALWVMAEAL 268 N + G R TR V+G+DG+ LNRALWV+A + Sbjct: 254 NPETGRLIARRQTRPVQGVDGNTALNRALWVLANRM 289 >UniRef50_C6N6D1 Putative uncharacterized protein n=2 Tax=Legionella drancourtii LLAP12 RepID=C6N6D1_9GAMM Length = 275 Score = 325 bits (832), Expect = 1e-87, Method: Composition-based stats. Identities = 104/268 (38%), Positives = 150/268 (55%), Gaps = 11/268 (4%) Query: 13 LIRRDRP----LTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDSLQREGFQPFFAC 68 LI P LT E+L++ PS+F+ SERY I T ++D L +EGF P A Sbjct: 5 LIESGEPAMNVLTIEQLYKAAPSLFTRGAAVHTSERYQPIATSDVIDRLLQEGFYPTKAT 64 Query: 69 QTRVRDPGRREHTKHMLRLRREG-QITGKQ-VPEIILLNSHDGTSSYQMLPGLFRAVCQN 126 Q+ R ++ +KH++R R G PE++L+NSHDG SSY+++ GL+R VC N Sbjct: 65 QSASRSEEKKVFSKHLVRFRHRDYHNPGNGLFPELVLINSHDGLSSYRLMAGLYRQVCTN 124 Query: 127 GLVCGESFGEVRVPHKGDVVSQVIEGAYEVLGIFDRVEEKRDAMQSLLLPPPAQQALAKA 186 GLV G+S+ EVRV H+GDV+ VIEG Y V+ ++ + + M LP + Sbjct: 125 GLVAGKSYDEVRVKHQGDVIGNVIEGTYRVIESSQKMLQVVEQMGDCALPDEKLLEFSAQ 184 Query: 187 ALTYRFGEDHQPV-TESQILSPRRWQDESNDLWTTYQRIQENLIKGGLSG----RNAKGG 241 A RF ED V +L PRR +D DL++ + +QENLIKGG+ G + + Sbjct: 185 AHALRFSEDANLVIEPKNLLVPRRREDMKRDLFSVFNVVQENLIKGGVLGYRLNEHGRWR 244 Query: 242 RSHTRAVRGIDGDVKLNRALWVMAEALL 269 R+ +R + ID +VK+NR LW +AE L Sbjct: 245 RARSRKITSIDQNVKINRDLWTIAENTL 272 >UniRef50_C6RFJ3 Putative uncharacterized protein n=1 Tax=Campylobacter showae RM3277 RepID=C6RFJ3_9PROT Length = 271 Score = 323 bits (828), Expect = 4e-87, Method: Composition-based stats. Identities = 85/257 (33%), Positives = 138/257 (53%), Gaps = 5/257 (1%) Query: 16 RDRPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDSLQREGFQPFFACQTRVRDP 75 + PLT E+L ++ PS+F+++ + S++Y +I TI +++ ++ + P + VRD Sbjct: 4 SNEPLTNEQLEQLAPSLFADEPYFEASDKYHFISTIDVINEIRDYAWYPVGVSEASVRDE 63 Query: 76 GRREHTKHMLRLRREGQIT--GKQVPEIILLNSHDGTSSYQMLPGLFRAVCQNGLVCG-E 132 + KH +R R G+ V E++L NSHD + + + G+FR VC NGLV E Sbjct: 64 KKEGFQKHYVRFRHLDDFLNPGENVVELLLFNSHDRSKCFSISAGVFRFVCANGLVVSDE 123 Query: 133 SFGEVRVPHKGDVVSQVIEGAYEVLGIFDRVEEKRDAMQSLLLPPPAQQALAKAALTYRF 192 F ++ H GD + V ++ + K + L + + AKAA+ RF Sbjct: 124 VFESYQIKHLGDKENDVSIAINKIAKAKYDILNKIKLFSKIPLTQDDKASFAKAAIPLRF 183 Query: 193 GEDHQPVTESQILSPRRWQDESNDLWTTYQRIQENLIKGGLSGRNAKGGRSHT-RAVRGI 251 E H V +L P R +DE +DL+TT+ IQE+LI+G +SG NA+ R T R ++ I Sbjct: 184 -EKHLKVDYRDLLVPHRIEDEKDDLYTTFNTIQEHLIRGNISGINAETNRRFTSRIIKSI 242 Query: 252 DGDVKLNRALWVMAEAL 268 D +N+ LW MAE++ Sbjct: 243 STDTDINKKLWNMAESI 259 >UniRef50_B6C6K7 Conserved domain protein n=2 Tax=Nitrosococcus oceani RepID=B6C6K7_9GAMM Length = 226 Score = 299 bits (765), Expect = 7e-80, Method: Composition-based stats. Identities = 98/220 (44%), Positives = 147/220 (66%), Gaps = 4/220 (1%) Query: 53 LLDSLQREGFQPFFACQTRVRDPGRREHTKHMLRLRREG---QITGKQVPEIILLNSHDG 109 ++++L+REG+ P A ++RVR P R+ +KH+LR RR + G PEI+L+NSHDG Sbjct: 1 MIEALEREGWSPVHAEESRVRIPDRKGFSKHLLRFRRFDNELPMVGDSFPEIVLVNSHDG 60 Query: 110 TSSYQMLPGLFRAVCQNGLVCGES-FGEVRVPHKGDVVSQVIEGAYEVLGIFDRVEEKRD 168 + +YQ+ GLFR VC NG++ +S G+V+ H GDVV +VIEG YE++ R+ + + Sbjct: 61 SCAYQLHAGLFRLVCSNGMIVADSNMGQVKRRHTGDVVREVIEGTYEIVEELPRIAARVE 120 Query: 169 AMQSLLLPPPAQQALAKAALTYRFGEDHQPVTESQILSPRRWQDESNDLWTTYQRIQENL 228 ++L L Q+ A++AL R+ E P +L PRR +D+ NDLW TYQR+QEN+ Sbjct: 121 DFKTLELSLQEQEIFAESALRVRWREGEAPCMPQALLRPRRHEDQGNDLWATYQRVQENM 180 Query: 229 IKGGLSGRNAKGGRSHTRAVRGIDGDVKLNRALWVMAEAL 268 +KGG+ GR+A G + TRAV+ +DG+VKLN+ALW + E + Sbjct: 181 LKGGIRGRSAVGRQITTRAVKSVDGNVKLNKALWFLTEQM 220 >UniRef50_B1ZQ12 Putative uncharacterized protein n=3 Tax=Opitutus terrae PB90-1 RepID=B1ZQ12_OPITP Length = 288 Score = 277 bits (709), Expect = 2e-73, Method: Composition-based stats. Identities = 101/281 (35%), Positives = 146/281 (51%), Gaps = 23/281 (8%) Query: 10 AANLIRRDRPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDSLQREGFQPFFACQ 69 A R R L+ ++L RV PSVF+E S RYT++ T ++D L+ EG++P A Q Sbjct: 6 TAPSSRVFRALSLDDLRRVAPSVFAEQARPGVSSRYTFVSTAQVVDLLRGEGWEPVKANQ 65 Query: 70 TRVRDPGRREHTKHMLRLRREGQI------TGKQVPEIILLNSHDGTSSYQMLPGLFRAV 123 RVR R+ H LR R + G PE+IL N+HDGT +Y++ GL+R V Sbjct: 66 QRVRLENRQGFQMHELRFARRADLENASFAIGDVRPELILQNAHDGTRAYRIDAGLYRLV 125 Query: 124 CQNGLVCGES-FGEVRVPHKGDVVSQVIEGAYEVLGIFDRVEEKRDAMQSLLLPPPAQQA 182 C+NGL ++ F V + H + A V RV E Q++ L P A+ + Sbjct: 126 CRNGLTVADADFAHVAIRHVDVSAEKFAAAAQAVAENTPRVMEVIARWQAVALTPLARHS 185 Query: 183 LAKAALTYRFGEDHQPVT----ESQILSPRRWQDESNDLWTTYQRIQENLIKGGLS--GR 236 A A+ R+ + QPVT Q+L+P R+ D++ DLWTT+ +QE L +GGL G Sbjct: 186 FAARAMALRW-DSAQPVTRLLRPDQLLAPARYGDQATDLWTTFNVVQERLCRGGLRYAGH 244 Query: 237 --NAKGG-------RSHTRAVRGIDGDVKLNRALWVMAEAL 268 A+G R+ TR V G+ +LN+ALW +AE Sbjct: 245 IPAAEGAVFPTHYLRNTTRPVGGLTEGQRLNKALWNLAEEF 285 >UniRef50_Q17W97 Putative uncharacterized protein Hac prophage I orf7 n=1 Tax=Helicobacter acinonychis str. Sheeba RepID=Q17W97_HELAH Length = 176 Score = 214 bits (546), Expect = 2e-54, Method: Composition-based stats. Identities = 43/170 (25%), Positives = 84/170 (49%), Gaps = 3/170 (1%) Query: 18 RPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDSLQREGFQPFFACQTRVRDPGR 77 +PL+ EL R+ PS+F+ + + S++Y +I TI +++ ++ + P + VR+ + Sbjct: 6 QPLSNNELKRLAPSLFTAEPYYEASDKYHFISTIDIIEEIRFHAWYPVAVSEASVRNEDK 65 Query: 78 REHTKHMLRLRREGQIT--GKQVPEIILLNSHDGTSSYQMLPGLFRAVCQNGLVCG-ESF 134 + +H +R R + E++L NSHD + + + G+FR VC NGLV E F Sbjct: 66 EGYQQHYVRFRYLDDFLRPSENCVELLLFNSHDRSKCFTISAGVFRFVCANGLVVADEVF 125 Query: 135 GEVRVPHKGDVVSQVIEGAYEVLGIFDRVEEKRDAMQSLLLPPPAQQALA 184 ++ H G+ + V ++ D++ +K + L + + A Sbjct: 126 ESYQIKHIGEKANGVAVAIPSIVQAKDKIMDKISTFSQITLTEQDKISFA 175 >UniRef50_B2Q5G8 Putative uncharacterized protein n=3 Tax=Providencia RepID=B2Q5G8_PROST Length = 122 Score = 147 bits (370), Expect = 5e-34, Method: Composition-based stats. Identities = 83/112 (74%), Positives = 98/112 (87%) Query: 157 LGIFDRVEEKRDAMQSLLLPPPAQQALAKAALTYRFGEDHQPVTESQILSPRRWQDESND 216 + FD V EKR+ MQSLLLPPPAQQALA+AALTYRFGE+HQP+TE Q+L PRRW+D+ +D Sbjct: 1 METFDTVAEKREQMQSLLLPPPAQQALAQAALTYRFGEEHQPITEEQVLQPRRWEDKKDD 60 Query: 217 LWTTYQRIQENLIKGGLSGRNAKGGRSHTRAVRGIDGDVKLNRALWVMAEAL 268 LWT YQR+QENLIKGGLSGRNAKG R+ TR+V GIDGD+KLN+ALWVM E + Sbjct: 61 LWTVYQRLQENLIKGGLSGRNAKGKRARTRSVNGIDGDIKLNKALWVMTEKM 112 >UniRef50_B5EW31 Putative uncharacterized protein n=1 Tax=Vibrio fischeri MJ11 RepID=B5EW31_VIBFM Length = 318 Score = 78.5 bits (192), Expect = 2e-13, Method: Composition-based stats. Identities = 26/101 (25%), Positives = 49/101 (48%), Gaps = 5/101 (4%) Query: 80 HTKHMLRLRREGQITGKQVPEIILLNSHDGTSSYQMLPGLFRAVCQNGLVCGESFGEVRV 139 HM+++ K + ++++NS+DG+ ++Q+ G FR VC NG++ GE F + V Sbjct: 127 FPAHMVQI----GSGDKVILRLVVVNSYDGSCNFQVQAGGFRIVCTNGMITGEKFLSLDV 182 Query: 140 PHKGDV-VSQVIEGAYEVLGIFDRVEEKRDAMQSLLLPPPA 179 H G + QV + F+ + + D + + L Sbjct: 183 RHTGTMNFGQVTRQVTTAVSSFENMGQYWDTLINSPLNRKD 223 >UniRef50_B9PA18 Predicted protein (Fragment) n=2 Tax=cellular organisms RepID=B9PA18_POPTR Length = 87 Score = 72.7 bits (177), Expect = 1e-11, Method: Composition-based stats. Identities = 27/53 (50%), Positives = 41/53 (77%), Gaps = 1/53 (1%) Query: 3 RLASRFGA-ANLIRRDRPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLL 54 +LASRF + + +R D PL+ +++ RV PS+F++ HESRSERY+YIPT ++L Sbjct: 35 QLASRFASHSPALRSDSPLSDDQIRRVAPSIFADAPHESRSERYSYIPTAAVL 87 >UniRef50_A8RIH4 Putative uncharacterized protein n=3 Tax=Clostridiales RepID=A8RIH4_9CLOT Length = 312 Score = 55.0 bits (131), Expect = 3e-06, Method: Composition-based stats. Identities = 45/218 (20%), Positives = 77/218 (35%), Gaps = 21/218 (9%) Query: 42 SERYTYIPTISLL---DSLQREGFQPFFACQTRVRDPGRREHTKHMLRLRREGQITGKQV 98 ++RY + D L EG GRR + +L + I+G ++ Sbjct: 76 TDRYKVVQNEDAFAFTDQLLGEG---VTYETAGSLQNGRRTWL--LAKLPQRYIISGDEI 130 Query: 99 -PEIILLNSHDGTSSYQMLPGLFRAVCQN--GLVCGESFGEVRVPHKGDVVSQVIEGAYE 155 P ++ +N+HDGT + ++ R VC N L + H GD+ ++ + Y Sbjct: 131 TPYMVFMNTHDGTGAIRVAMTPVRVVCMNTLNLALSTAKRSWSTNHTGDIAGKMEDARYT 190 Query: 156 VLGIFDRVEE---KRDAMQSLLLPPPAQQALAKAALTYRFGEDHQPVTESQILSPRRWQD 212 +L + E D M+ L L A P + Q R +D Sbjct: 191 LLYADRYMSELGKAIDHMKRLRLSERQVMEYIDALFPLY----DNPTPQQQKNLNRMKED 246 Query: 213 ESNDLWTTYQRIQENLIKGGLSGRNAKGG-RSHTRAVR 249 + +++ K G NA +H R +R Sbjct: 247 MKTRYFDAPDL--KHVGKNGYRFINAVSDFATHARPLR 282 >UniRef50_A8ZS75 Putative uncharacterized protein n=1 Tax=Desulfococcus oleovorans Hxd3 RepID=A8ZS75_DESOH Length = 318 Score = 55.0 bits (131), Expect = 3e-06, Method: Composition-based stats. Identities = 36/157 (22%), Positives = 59/157 (37%), Gaps = 18/157 (11%) Query: 42 SERYTYIPTISLLDSLQREGFQPFFACQTRVRDPGRREHTKHMLRL----RREGQITG-- 95 +ERY + + +L L R GF P Q + D ++R+ R G G Sbjct: 108 TERYKPLDNMDVLSQLLRHGFDPDTQVQYAIDDG------MFLVRIPEYARAFGVNPGYG 161 Query: 96 ---KQVPEIILLNSHDGTSSYQMLPGLFRAVCQNGLVCGESFGEVRVPHKGDV-VSQVIE 151 + VP + NS G ++ + +R VC NGL+ S R H + + E Sbjct: 162 KLDEIVPGVSFANSEVGLLAFSIEAFFYRLVCTNGLISKTSSTFSRFKHISNRGLENFPE 221 Query: 152 GAYEVLGIFDRVEE--KRDAMQSLLLPPPAQQALAKA 186 V+ R +E K + P + + A+ Sbjct: 222 TIAGVIEDSVRKQEQFKLSRQSPVENPIRSIETFARQ 258 >UniRef50_Q2LV02 Hypothetical cytosolic protein n=1 Tax=Syntrophus aciditrophicus SB RepID=Q2LV02_SYNAS Length = 264 Score = 53.1 bits (126), Expect = 1e-05, Method: Composition-based stats. Identities = 51/271 (18%), Positives = 93/271 (34%), Gaps = 57/271 (21%) Query: 15 RRDRPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDS--------LQREGFQPFF 66 R +T+++L + ++ Y + L D L+ Sbjct: 8 RGGELVTKDQLDLI--------PLPEPTDSYMPVSHYDLADKFLMISQDILRDYKL--VG 57 Query: 67 ACQTRVRDPGRREHTKHMLRLRREGQITGKQVPEIILLNSHDGTSSYQMLPGLFRAVCQN 126 R G + +L+ +RE G I NS+D + + + G VC N Sbjct: 58 ENYGIAR-QGNQFFA--VLKFQRERSEIG---LSIAFRNSYDRSMAIGLAIGASVFVCDN 111 Query: 127 GLVCGESFGEVRVPHKGDV----VSQVIEGAYEVLGIFDRVEEKRDAMQSLLLPPPAQQA 182 + GE V H +V + I Y+ +D++ DA +S LP A Sbjct: 112 LALSGEIV--VMKKHTKNVWSELEEKAIATIYKSQNNYDQLIGDVDAFKS--LPVDDNGA 167 Query: 183 LAKAALTYRFGEDHQPVTESQI-------LSPRRWQDESNDLWTTYQRIQENLIKGGLSG 235 A+ FG + ++ Q+ L P + E +LW+ Y E+L Sbjct: 168 F--QAMGLLFGNN--IISPRQLTVLKEEWLKPSHEEFEPRNLWSFYNAATESL------- 216 Query: 236 RNAKGGRSHTRAVRGIDGDVKLNRALWVMAE 266 + V ++ ++L+ AL + + Sbjct: 217 -------KSSPPVTIMEKHIRLHEALTYLGK 240 >UniRef50_A6GXR9 Putative uncharacterized protein n=1 Tax=Flavobacterium psychrophilum JIP02/86 RepID=A6GXR9_FLAPJ Length = 285 Score = 51.9 bits (123), Expect = 2e-05, Method: Composition-based stats. Identities = 35/187 (18%), Positives = 65/187 (34%), Gaps = 32/187 (17%) Query: 94 TGKQVPEIILLNSHDGTSSYQMLPGLFRAVCQNGLVCGESFGEVRVPHKGDVVSQVIEG- 152 K P + NS+DG+ G FR VC NGL + + H+G++ V+ Sbjct: 109 LDKIRPMLRFTNSYDGSCKTSGTFGFFREVCSNGLHTASTDIGFSLKHRGNINELVLPAI 168 Query: 153 ---AYEVLG----IFDRVEEKRDAMQSLLLPPPAQQALAKAALTYRF-GEDHQP---VTE 201 Y L R E + + P Q +A+ ++F D P + Sbjct: 169 GKTIYNFLDNEFYELRRKFEVLADFK-IADPSEIVQHIAQQTKLFKFESSDKNPAPSLNA 227 Query: 202 SQILSPRRWQ----DESNDLWTTYQRIQENLIKGGLSGRNAKGGRSHTRAVRGIDGDVKL 257 ++ + E ++W Y E L+ G + + D K+ Sbjct: 228 RLVIETIENETLILKEDANMWMVYNAFNE-LLHGKIK--------------KTFDQQKKI 272 Query: 258 NRALWVM 264 ++ ++ + Sbjct: 273 DKEIFNL 279 >UniRef50_D2R5Z8 Phage/plasmid-related protein TIGR03299 n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R5Z8_9PLAN Length = 327 Score = 47.7 bits (112), Expect = 4e-04, Method: Composition-based stats. Identities = 38/209 (18%), Positives = 78/209 (37%), Gaps = 38/209 (18%) Query: 85 LRLRREGQITGKQVPEIILLNSHDGTSSYQMLPGLFRAVCQNGLVCGESFGE---VRVPH 141 +R++ + K ++L N+HDG+S+ ++ R VCQN L ++ + + H Sbjct: 128 IRVKNSDDLVDKF---LLLSNAHDGSSALRVYFTPIRVVCQNTLNLADNRSTGQGISILH 184 Query: 142 KGDVVSQVIEGAYEVLGIFDRVEE----KRDAMQSLLLPPPAQQALAKAALTYRFGEDHQ 197 KG++ +++ E A VLG+ + + D + S +A ++ + G D+ Sbjct: 185 KGNLHTKIRE-AQRVLGLAEEFYDEAEGIIDILASHHPSSVQVEAFFQSVIPDPIGADNA 243 Query: 198 PVTESQILSPRRWQDE---------SNDL-------WTTYQRIQENLIKGGLSGRNAKGG 241 R+ +D D+ W Y + E + R+ Sbjct: 244 --------RARKVRDRLTCLFETGIGQDMPEIKGTSWAAYNAVTE-FVDHHRPTRSTDPL 294 Query: 242 RSHTRAVRG--IDGDVKLNRALWVMAEAL 268 +R + +L W +A + Sbjct: 295 ERASRRLDSSWFGSGARLKAKAWNLAFDM 323 >UniRef50_D1N225 Putative uncharacterized protein n=1 Tax=Victivallis vadensis ATCC BAA-548 RepID=D1N225_9BACT Length = 241 Score = 46.5 bits (109), Expect = 9e-04, Method: Composition-based stats. Identities = 35/202 (17%), Positives = 69/202 (34%), Gaps = 16/202 (7%) Query: 37 KHESRSERYTYIPTISLLDSL----QREGFQPFFACQTRVRDPGRREHTKHMLRLRREGQ 92 + + + +P ++D++ + +Q R+ R M + R + Sbjct: 20 PTPAATASWKPVPHSEVIDAVTDVVRAHNWQILDEQYGLARNGQR------MFGVIRINR 73 Query: 93 ITGKQVPEII-LLNSHDGTSSYQMLPGLFRAVCQNGLVCGESFGEVRVPHKGDVV--SQV 149 + + I + NSHD T + + GL VC N + G + ++ H + V Sbjct: 74 TSSSEWSRCIGICNSHDRTIAVGLAAGLNVQVCANLMFGGSTV--LKRRHTSRIELNGLV 131 Query: 150 IEGAYEVLGIFDRVEEKRDAMQSLLLPPP-AQQALAKAALTYRFGEDHQPVTESQILSPR 208 +E + F +E + ++ + A+ A+ KAA + PR Sbjct: 132 VEAIDALEDDFLTLETVAEDLKIQFVRDDTARAAIVKAAEAGAVNSCDIVPIFREFKEPR 191 Query: 209 RWQDESNDLWTTYQRIQENLIK 230 + W EN K Sbjct: 192 YEEFAEPTRWALLNAFTENAKK 213 >UniRef50_C7Q5L2 Phage/plasmid-related protein TIGR03299 n=1 Tax=Catenulispora acidiphila DSM 44928 RepID=C7Q5L2_CATAD Length = 329 Score = 46.1 bits (108), Expect = 0.001, Method: Composition-based stats. Identities = 22/103 (21%), Positives = 42/103 (40%), Gaps = 12/103 (11%) Query: 75 PGRREHTKHMLRLRREGQITGKQVPEIIL--LNSHDGTSSYQMLPGLFRAVCQN--GLVC 130 GR+ +RL + G ++ + LNSHDGT +Y+++ R VC N L Sbjct: 127 EGRQVFVT--MRLPETMTVAGTDRLDLYISGLNSHDGTGAYKLIVTPIRIVCANTQSLAL 184 Query: 131 GESFGEVRVPHKGDVVSQVIEG------AYEVLGIFDRVEEKR 167 + + H ++ E ++ + F++ E+ Sbjct: 185 DRARSSFSIRHTESAKKKIAEARKALGLMFKYVEEFEKAAERM 227 >UniRef50_A8ZYJ5 Putative uncharacterized protein n=1 Tax=Desulfococcus oleovorans Hxd3 RepID=A8ZYJ5_DESOH Length = 308 Score = 45.0 bits (105), Expect = 0.003, Method: Composition-based stats. Identities = 26/126 (20%), Positives = 43/126 (34%), Gaps = 15/126 (11%) Query: 44 RYTYIPTISLLDSLQREGFQPFFACQTRVRDPGRREHTKHMLRLRREGQITGKQV-PEII 102 +YT + +L+ L G+ P Q + + + R+ I G + P I Sbjct: 107 KYTPVDNFEILERLDSLGYGPDTKVQCSL---DAEFLSLSIPDGRKAFDINGDRFKPGIS 163 Query: 103 LLNSHDGTSSYQMLPGLFRAVCQNGLVCGESFGEVRVPHKGDVVSQVIEGAYEVLGIFDR 162 + NS G +S + + R VC NGL+ H +L F + Sbjct: 164 ISNSEVGLASLTISAFVLRLVCTNGLIARTGI-SASYRHVST----------RILKEFPQ 212 Query: 163 VEEKRD 168 E Sbjct: 213 TIETVS 218 >UniRef50_C6W397 Phage/plasmid-related protein TIGR03299 n=12 Tax=Bacteroidetes RepID=C6W397_DYAFD Length = 350 Score = 43.8 bits (102), Expect = 0.007, Method: Composition-based stats. Identities = 24/95 (25%), Positives = 41/95 (43%), Gaps = 7/95 (7%) Query: 101 IILLNSHDGTSSYQMLPGLFRAVCQNGLVCG--ESFGEVRVPHKGDVVSQVIEGAYEVLG 158 + L SHDG+ S R VC N L V++ H + V + + A++V+G Sbjct: 152 LFLTTSHDGSGSITAAFTPVRIVCANTLNAAMKNITNVVKIRHTSNAVER-LRTAHKVMG 210 Query: 159 IFDR----VEEKRDAMQSLLLPPPAQQALAKAALT 189 I ++ VEE + + P + L + A+ Sbjct: 211 IANKFSHEVEEIFNHWAKKPITDPQLKKLIEIAMA 245 >UniRef50_A3XKH6 Putative uncharacterized protein n=2 Tax=Leeuwenhoekiella blandensis MED217 RepID=A3XKH6_9FLAO Length = 312 Score = 43.1 bits (100), Expect = 0.010, Method: Composition-based stats. Identities = 16/86 (18%), Positives = 33/86 (38%), Gaps = 5/86 (5%) Query: 98 VPEIILLNSHDGTSSYQMLPGLFRAVCQNGLVCGESFGEVRVPHKGDVVSQVIEGAYEVL 157 +P + NS+DG+ G +R VC NGL + E + H + ++ + Sbjct: 141 LPMLRFKNSYDGSEKTSGHFGFYREVCSNGLHVSLAEIEFSIKHSKNNTHLIM---PRLN 197 Query: 158 GIFDRVEEKRDAMQSLLLPPPAQQAL 183 +FD+ + ++ + Sbjct: 198 NLFDKFLDN--EFYTITKKFDKMKEF 221 >UniRef50_B4VVD2 Phage/plasmid-related protein TIGR03299 n=2 Tax=Cyanobacteria RepID=B4VVD2_9CYAN Length = 336 Score = 42.3 bits (98), Expect = 0.016, Method: Composition-based stats. Identities = 30/144 (20%), Positives = 54/144 (37%), Gaps = 17/144 (11%) Query: 97 QVPEIILLNSHDGTSSYQMLPGLFRAVCQNGLVCGESF--------GEVRVPHKGDVVSQ 148 P ++L NSHDG+++ + R VC N L F + +PH + Q Sbjct: 133 VRPYLLLHNSHDGSTAVWLQFTPVRVVCWNTLNGAARFRFGDLWQKKAICIPHSLSLTEQ 192 Query: 149 VIEGAYEVLGIFDRVEE-KRDAMQSLLLPPPAQQALAKAALTYRFGEDHQPVTES---QI 204 +E + +L + + + + Q++ + LA R QP Q+ Sbjct: 193 -LEHIHNILDLTQKEFQYSVEEYQAMAHKELTTELLAD--YIGRVLGTTQPTLHPAWSQL 249 Query: 205 LS--PRRWQDESNDLWTTYQRIQE 226 ++ ++ LW Y I E Sbjct: 250 VANFESGRGNQGQTLWDAYNSITE 273 >UniRef50_C2LEJ7 Putative uncharacterized protein n=1 Tax=Proteus mirabilis ATCC 29906 RepID=C2LEJ7_PROMI Length = 39 Score = 42.3 bits (98), Expect = 0.017, Method: Composition-based stats. Identities = 17/35 (48%), Positives = 21/35 (60%) Query: 239 KGGRSHTRAVRGIDGDVKLNRALWVMAEALLTQLQ 273 K + T +V GID D KLN+ALWVM E +Q Sbjct: 2 KSKHTRTCSVNGIDSDSKLNKALWVMTEKCTNIIQ 36 >UniRef50_UPI0001AF46A9 hypothetical protein MkanA1_07449 n=1 Tax=Mycobacterium kansasii ATCC 12478 RepID=UPI0001AF46A9 Length = 348 Score = 41.5 bits (96), Expect = 0.027, Method: Composition-based stats. Identities = 31/178 (17%), Positives = 59/178 (33%), Gaps = 31/178 (17%) Query: 101 IILLNSHDGTSSYQMLPGLFRAVCQNGLVCG--ESFGEVRVPHKGDVVSQVIEGAYEVLG 158 + LNSHDG+++++ L R VC N + + H G + + E + Sbjct: 163 LAALNSHDGSAAFRFLLSPIRIVCANTQSAAIRSAKSSFSIRHTGGARASIAEARNALKL 222 Query: 159 IFDRVEEKRDAMQSLLLPPPAQQALAKAALTYRFGEDHQPVTESQILSPRRW-QDESNDL 217 + +E +L P + + A T V + + RR ++ +N + Sbjct: 223 SWRYIEAFEAEAAALYAAPMDTEEMRSFANTL------LEVDSAGTTATRRHRRERANSI 276 Query: 218 -----------------WTTYQRIQENL-----IKGGLSGRNAKGGRSHTRAVRGIDG 253 W Y + E L ++G + +A R+ G Sbjct: 277 VKLWTSSETIAPIAGTRWAAYNAVTEYLDHVVPVRGAKTATDASAARALRNITTAASG 334 >UniRef50_C4DCZ5 Phage/plasmid-related protein TIGR03299 n=3 Tax=Actinomycetales RepID=C4DCZ5_9ACTO Length = 395 Score = 41.1 bits (95), Expect = 0.040, Method: Composition-based stats. Identities = 25/121 (20%), Positives = 46/121 (38%), Gaps = 16/121 (13%) Query: 43 ERYTYIPTISLLDSLQR--EGFQPFFACQTRVRDPGRREHTKHMLRLRREGQITGKQV-- 98 +Y + + L+ E + + VR GRR + +R +T Sbjct: 151 SKYHTVQNRECFEFLRNLVESYDVVWESAGAVR-GGRRTF----VSMRLPDTVTVDAAGI 205 Query: 99 -----PEIILLNSHDGTSSYQMLPGLFRAVCQNG--LVCGESFGEVRVPHKGDVVSQVIE 151 P +++ NSHDG+SS + +R VC N L ++ + H + Q+ + Sbjct: 206 NDTITPFVVVFNSHDGSSSITAVVTPYRPVCANTERLALDNAYTSWSIRHTESAMHQMRQ 265 Query: 152 G 152 Sbjct: 266 A 266 >UniRef50_Q47CX4 Putative uncharacterized protein n=4 Tax=Betaproteobacteria RepID=Q47CX4_DECAR Length = 354 Score = 40.7 bits (94), Expect = 0.046, Method: Composition-based stats. Identities = 32/132 (24%), Positives = 46/132 (34%), Gaps = 13/132 (9%) Query: 42 SERYTYIPTISLLDSLQREGFQPFFACQTRVRDPGRR-EHTKHML-----RLRREGQITG 95 S+RY + L E P VR TK L RL+ E Sbjct: 115 SDRYRRLDNFDL-----AESVLPILQQLPEVRFESVELTETKMYLKCITPRLKYEMAPGD 169 Query: 96 KQVPEIILLNSHDGTSSYQMLPGLFRAVCQNGLVCGESFGEVRVPHKGDVVSQVIEGAYE 155 +++ NS G + + P LFR VC NGL+ + +R H G + E Sbjct: 170 VVQAGVVISNSEVGQGTLSVQPLLFRLVCSNGLIVPDR--SLRKMHVGRALGGEDERIQV 227 Query: 156 VLGIFDRVEEKR 167 R ++K Sbjct: 228 YQDDTLRADDKA 239 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P52132 UPF0380 protein yfjQ n=153 Tax=Bacteria RepID=YF... 382 e-105 UniRef50_P18005 UPF0380 protein yubP n=179 Tax=root RepID=YUBP_E... 367 e-100 UniRef50_Q1ND23 CP4-6 prophage n=4 Tax=Alphaproteobacteria RepID... 322 6e-87 UniRef50_B9JPN2 Phosphoribosylamine-glycine ligase n=2 Tax=Prote... 303 6e-81 UniRef50_C6N6D1 Putative uncharacterized protein n=2 Tax=Legione... 302 1e-80 UniRef50_C6RFJ3 Putative uncharacterized protein n=1 Tax=Campylo... 301 2e-80 UniRef50_A9HST1 Putative uncharacterized protein n=1 Tax=Glucona... 300 3e-80 UniRef50_A4X0R7 Putative uncharacterized protein n=2 Tax=Rhodoba... 300 3e-80 UniRef50_B8IVF8 Putative uncharacterized protein n=1 Tax=Methylo... 289 8e-77 UniRef50_B6C6K7 Conserved domain protein n=2 Tax=Nitrosococcus o... 274 2e-72 UniRef50_B1ZQ12 Putative uncharacterized protein n=3 Tax=Opitutu... 256 4e-67 UniRef50_Q17W97 Putative uncharacterized protein Hac prophage I ... 197 4e-49 UniRef50_Q2LV02 Hypothetical cytosolic protein n=1 Tax=Syntrophu... 167 3e-40 UniRef50_A8RIH4 Putative uncharacterized protein n=3 Tax=Clostri... 158 1e-37 UniRef50_D1N225 Putative uncharacterized protein n=1 Tax=Victiva... 152 1e-35 UniRef50_B2Q5G8 Putative uncharacterized protein n=3 Tax=Provide... 129 7e-29 UniRef50_D2R5Z8 Phage/plasmid-related protein TIGR03299 n=1 Tax=... 127 3e-28 UniRef50_A6GXR9 Putative uncharacterized protein n=1 Tax=Flavoba... 122 8e-27 UniRef50_A8ZS75 Putative uncharacterized protein n=1 Tax=Desulfo... 115 2e-24 UniRef50_B5EW31 Putative uncharacterized protein n=1 Tax=Vibrio ... 113 8e-24 UniRef50_C7Q5L2 Phage/plasmid-related protein TIGR03299 n=1 Tax=... 80 9e-14 UniRef50_B9PA18 Predicted protein (Fragment) n=2 Tax=cellular or... 68 4e-10 Sequences not found previously or not previously below threshold: UniRef50_B9E574 Putative uncharacterized protein n=5 Tax=Clostri... 100 3e-20 UniRef50_A3XKH6 Putative uncharacterized protein n=2 Tax=Leeuwen... 80 8e-14 UniRef50_Q024R3 Putative uncharacterized protein n=1 Tax=Candida... 75 3e-12 UniRef50_A1SIX8 Putative uncharacterized protein n=2 Tax=Nocardi... 75 3e-12 UniRef50_C4DCZ5 Phage/plasmid-related protein TIGR03299 n=3 Tax=... 74 7e-12 UniRef50_UPI0001AF46A9 hypothetical protein MkanA1_07449 n=1 Tax... 73 9e-12 UniRef50_B4VVD2 Phage/plasmid-related protein TIGR03299 n=2 Tax=... 72 1e-11 UniRef50_UPI00017465AE hypothetical protein VspiD_04485 n=2 Tax=... 69 1e-10 UniRef50_A1UPG4 Putative uncharacterized protein n=1 Tax=Mycobac... 69 2e-10 UniRef50_B8F9V3 Putative uncharacterized protein n=4 Tax=Deltapr... 65 3e-09 UniRef50_Q0RM54 Putative uncharacterized protein n=1 Tax=Frankia... 63 1e-08 UniRef50_Q5LU35 Putative uncharacterized protein n=1 Tax=Ruegeri... 62 3e-08 UniRef50_A8ZYJ5 Putative uncharacterized protein n=1 Tax=Desulfo... 62 3e-08 UniRef50_B4CXI2 Putative uncharacterized protein n=1 Tax=Chthoni... 61 4e-08 UniRef50_C6W397 Phage/plasmid-related protein TIGR03299 n=12 Tax... 61 4e-08 UniRef50_A1WP45 Putative uncharacterized protein n=2 Tax=Comamon... 60 8e-08 UniRef50_Q47CX4 Putative uncharacterized protein n=4 Tax=Betapro... 57 5e-07 UniRef50_C4ZMQ9 Phage/plasmid-related protein TIGR03299 n=1 Tax=... 54 6e-06 UniRef50_Q19YQ9 Gp96 n=7 Tax=unclassified Siphoviridae RepID=Q19... 54 6e-06 UniRef50_B4WVT0 Putative uncharacterized protein n=2 Tax=Synecho... 54 8e-06 UniRef50_C5CKG6 Phage/plasmid-related protein TIGR03299 n=10 Tax... 53 8e-06 UniRef50_UPI00016C3597 hypothetical protein GobsU_16407 n=1 Tax=... 53 9e-06 UniRef50_B7I5L8 Phage/plasmid-related protein n=5 Tax=Moraxellac... 53 1e-05 UniRef50_A8ZKZ6 Putative uncharacterized protein n=3 Tax=Cyanoba... 51 5e-05 UniRef50_Q5Y1B4 Putative uncharacterized protein n=1 Tax=uncultu... 51 6e-05 UniRef50_C6RKU8 Phage/plasmid-related protein n=12 Tax=Acinetoba... 51 6e-05 UniRef50_C0VFU1 Putative uncharacterized protein n=4 Tax=Acineto... 49 1e-04 UniRef50_Q2IFF9 Putative uncharacterized protein n=3 Tax=Anaerom... 47 7e-04 UniRef50_C8X3A3 Putative uncharacterized protein n=1 Tax=Desulfo... 46 0.001 UniRef50_A6SWN5 Uncharacterized conserved protein n=39 Tax=Prote... 46 0.001 UniRef50_B3VM79 Gp52 n=2 Tax=unclassified Siphoviridae RepID=B3V... 46 0.001 UniRef50_A6WZ56 Putative uncharacterized protein n=1 Tax=Ochroba... 46 0.002 UniRef50_Q18F79 Putative uncharacterized protein n=1 Tax=Haloqua... 45 0.002 UniRef50_C4V5A4 Putative uncharacterized protein n=1 Tax=Selenom... 45 0.002 UniRef50_A8ZPY1 Putative uncharacterized protein n=5 Tax=Bacteri... 44 0.004 UniRef50_B5LJ78 Gp67 n=1 Tax=Mycobacterium phage Myrna RepID=B5L... 42 0.024 UniRef50_B8KMK8 Putative uncharacterized protein n=1 Tax=gamma p... 42 0.031 UniRef50_C0GUY0 Putative uncharacterized protein n=2 Tax=Desulfo... 41 0.032 >UniRef50_P52132 UPF0380 protein yfjQ n=153 Tax=Bacteria RepID=YFJQ_ECOLI Length = 273 Score = 382 bits (982), Expect = e-105, Method: Composition-based stats. Identities = 267/273 (97%), Positives = 270/273 (98%) Query: 1 MTRLASRFGAANLIRRDRPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDSLQRE 60 MTRLASRFGAANLIRRDRPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDSLQRE Sbjct: 1 MTRLASRFGAANLIRRDRPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDSLQRE 60 Query: 61 GFQPFFACQTRVRDPGRREHTKHMLRLRREGQITGKQVPEIILLNSHDGTSSYQMLPGLF 120 GFQPFFACQTRVRDP RREHTKHMLRLRREGQITGKQVPEIILLNSHDGTSSYQMLPG+F Sbjct: 61 GFQPFFACQTRVRDPRRREHTKHMLRLRREGQITGKQVPEIILLNSHDGTSSYQMLPGMF 120 Query: 121 RAVCQNGLVCGESFGEVRVPHKGDVVSQVIEGAYEVLGIFDRVEEKRDAMQSLLLPPPAQ 180 RAVCQNGLVCGESFGEVRVPHKGDVVSQVIEGAYEVLGIF+RVEEKRDAMQSLLLPPP Q Sbjct: 121 RAVCQNGLVCGESFGEVRVPHKGDVVSQVIEGAYEVLGIFERVEEKRDAMQSLLLPPPVQ 180 Query: 181 QALAKAALTYRFGEDHQPVTESQILSPRRWQDESNDLWTTYQRIQENLIKGGLSGRNAKG 240 QALAKAALTYRFGEDHQPVTESQILSPRRWQDESNDLWTTYQRIQENLIKGGLSGRNAKG Sbjct: 181 QALAKAALTYRFGEDHQPVTESQILSPRRWQDESNDLWTTYQRIQENLIKGGLSGRNAKG 240 Query: 241 GRSHTRAVRGIDGDVKLNRALWVMAEALLTQLQ 273 GR+HTRAVRGIDGDVKLNRALWVMAE LLTQLQ Sbjct: 241 GRTHTRAVRGIDGDVKLNRALWVMAETLLTQLQ 273 >UniRef50_P18005 UPF0380 protein yubP n=179 Tax=root RepID=YUBP_ECOLI Length = 273 Score = 367 bits (942), Expect = e-100, Method: Composition-based stats. Identities = 178/267 (66%), Positives = 221/267 (82%), Gaps = 1/267 (0%) Query: 3 RLASRFGAANLIRRDRPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDSLQREGF 62 RLASRFG N I R+RPLT +EL + VPSVFS DKHESRSERYTYIPTI++++ L+ EGF Sbjct: 2 RLASRFGRYNSIHRERPLTDDELMQFVPSVFSGDKHESRSERYTYIPTINIINKLRDEGF 61 Query: 63 QPFFACQTRVRDPGRREHTKHMLRLRREGQITGKQVPEIILLNSHDGTSSYQMLPGLFRA 122 QPFFACQ+RVRD GRRE++KHMLRLRREG I G++VPEIILLNSHDG+SSYQM+PG+FR Sbjct: 62 QPFFACQSRVRDLGRREYSKHMLRLRREGHINGQEVPEIILLNSHDGSSSYQMIPGIFRF 121 Query: 123 VCQNGLVCGESFGEVRVPHKGDVVSQVIEGAYEVLGIFDRVEEKRDAMQSLLLPPPAQQA 182 VC NGLVCG +FGE+RVPHKGD+V QVIEGAYEVLG+FD+V + +AM+ + L Q Sbjct: 122 VCTNGLVCGNNFGEIRVPHKGDIVGQVIEGAYEVLGVFDKVTDNMEAMKEIHLNSDEQHL 181 Query: 183 LAKAALTYRF-GEDHQPVTESQILSPRRWQDESNDLWTTYQRIQENLIKGGLSGRNAKGG 241 +AAL R+ E+ PVT QI++PRRW+D+ NDLWTT+QR+QEN+IKGGLSGR+A G Sbjct: 182 FGRAALMVRYEDENKTPVTPEQIITPRRWEDKQNDLWTTWQRVQENMIKGGLSGRSASGK 241 Query: 242 RSHTRAVRGIDGDVKLNRALWVMAEAL 268 + TRA+ GIDGD+++N+ALWV+AE Sbjct: 242 NTRTRAITGIDGDIRINKALWVIAEQF 268 >UniRef50_Q1ND23 CP4-6 prophage n=4 Tax=Alphaproteobacteria RepID=Q1ND23_9SPHN Length = 281 Score = 322 bits (826), Expect = 6e-87, Method: Composition-based stats. Identities = 130/270 (48%), Positives = 181/270 (67%), Gaps = 5/270 (1%) Query: 4 LASRFGA-ANLIRRDRPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDSLQREGF 62 LA+RFG ++ I PL E L+R VPS+F+ + H+SRSERY Y+PTI +++ L+REG+ Sbjct: 6 LATRFGRNSHQIGGYEPLDNEALYRHVPSIFAREAHDSRSERYVYVPTIDIVEGLRREGW 65 Query: 63 QPFFACQTRVRDPGRREHTKHMLRLRREGQITGKQVPEIILLNSHDGTSSYQMLPGLFRA 122 PFFA Q+ RD R H KHMLRLRRE + + E I++NSHDGTS++Q+ G+ R Sbjct: 66 FPFFAVQSVPRDGNRHGHAKHMLRLRREDGVGKSEAAEAIIVNSHDGTSAFQLFAGMLRF 125 Query: 123 VCQNGLVCGESFGEVRVPHKGDVVSQVIEGAYEVLGIFDRVEEKRDAMQSLLLPPPAQQA 182 VC N ++ GE F EVRVPHKG++ +IEG Y V F R+ + + M+ + L Q+ Sbjct: 126 VCTNSMIAGERFEEVRVPHKGNIEHDIIEGVYTVAEDFPRLIDASETMKGVRLSEDEQRL 185 Query: 183 LAKAALTYRFGEDHQPVTESQILSPRRWQDESNDLWTTYQRIQENLIKGGLSG--RNAKG 240 L + +L R+GED P+T QI+ PRR++D + LWTT+ IQEN+I+GGL G RNA+G Sbjct: 186 LGEVSLVARYGEDESPLTPEQIIEPRRYEDRGDSLWTTFNVIQENVIRGGLHGRKRNAEG 245 Query: 241 --GRSHTRAVRGIDGDVKLNRALWVMAEAL 268 RS +R + GID +V LNRALW +AE + Sbjct: 246 RIRRSRSRPINGIDQNVTLNRALWTLAEGM 275 >UniRef50_B9JPN2 Phosphoribosylamine-glycine ligase n=2 Tax=Proteobacteria RepID=B9JPN2_AGRRK Length = 391 Score = 303 bits (775), Expect = 6e-81, Method: Composition-based stats. Identities = 110/268 (41%), Positives = 155/268 (57%), Gaps = 13/268 (4%) Query: 14 IRRDRPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDSLQREGFQPFFACQTRVR 73 R +T E+++V PS+F+ HESRS+R+ IPTI +L L EGF P A Q+ R Sbjct: 117 FDTARTMTETEMWKVAPSIFATTAHESRSDRFKPIPTIEVLRGLMAEGFVPVGAKQSASR 176 Query: 74 DPGRREHTKHMLRLRREGQ----ITGKQVPEIILLNSHDGTSSYQMLPGLFRAVCQNGLV 129 G+ + TKH++RLRR G V EI+L N++DGTS+Y++L GLFR C N LV Sbjct: 177 TEGKADFTKHLIRLRRVDDGKTYRVGDTVCEILLKNANDGTSAYELLAGLFRIRCMNSLV 236 Query: 130 CGE-SFGEVRVPHKGDVVSQVIEGAYEVLGIFDRVEEKRDAMQSLLLPPPAQQALAKAAL 188 + ++V H GDV ++VIEG Y VL +R + L QQ +A+AA Sbjct: 237 TQTGTIDAIKVRHSGDVSAKVIEGTYRVLNEAERTLVAPQDWATHKLNRDEQQIMAEAAH 296 Query: 189 TYRFGEDH----QPVTESQILSPRRWQDESNDLWTTYQRIQENLIKGGLSGRN----AKG 240 RFG++ P+ Q+L PRR D ++DLWT + QEN+I+GGL G + Sbjct: 297 VLRFGDNDGETKTPIKPEQLLLPRRHDDRADDLWTVWNVTQENVIRGGLRGIGREDLGRP 356 Query: 241 GRSHTRAVRGIDGDVKLNRALWVMAEAL 268 R +RAV GID D+KLN+ALW++ E + Sbjct: 357 RRVKSRAVNGIDQDIKLNKALWLIGEKM 384 >UniRef50_C6N6D1 Putative uncharacterized protein n=2 Tax=Legionella drancourtii LLAP12 RepID=C6N6D1_9GAMM Length = 275 Score = 302 bits (772), Expect = 1e-80, Method: Composition-based stats. Identities = 103/268 (38%), Positives = 150/268 (55%), Gaps = 11/268 (4%) Query: 13 LIRRDRP----LTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDSLQREGFQPFFAC 68 LI P LT E+L++ PS+F+ SERY I T ++D L +EGF P A Sbjct: 5 LIESGEPAMNVLTIEQLYKAAPSLFTRGAAVHTSERYQPIATSDVIDRLLQEGFYPTKAT 64 Query: 69 QTRVRDPGRREHTKHMLRLRREG-QITGK-QVPEIILLNSHDGTSSYQMLPGLFRAVCQN 126 Q+ R ++ +KH++R R G PE++L+NSHDG SSY+++ GL+R VC N Sbjct: 65 QSASRSEEKKVFSKHLVRFRHRDYHNPGNGLFPELVLINSHDGLSSYRLMAGLYRQVCTN 124 Query: 127 GLVCGESFGEVRVPHKGDVVSQVIEGAYEVLGIFDRVEEKRDAMQSLLLPPPAQQALAKA 186 GLV G+S+ EVRV H+GDV+ VIEG Y V+ ++ + + M LP + Sbjct: 125 GLVAGKSYDEVRVKHQGDVIGNVIEGTYRVIESSQKMLQVVEQMGDCALPDEKLLEFSAQ 184 Query: 187 ALTYRFGEDHQ-PVTESQILSPRRWQDESNDLWTTYQRIQENLIKGGLSG----RNAKGG 241 A RF ED + +L PRR +D DL++ + +QENLIKGG+ G + + Sbjct: 185 AHALRFSEDANLVIEPKNLLVPRRREDMKRDLFSVFNVVQENLIKGGVLGYRLNEHGRWR 244 Query: 242 RSHTRAVRGIDGDVKLNRALWVMAEALL 269 R+ +R + ID +VK+NR LW +AE L Sbjct: 245 RARSRKITSIDQNVKINRDLWTIAENTL 272 >UniRef50_C6RFJ3 Putative uncharacterized protein n=1 Tax=Campylobacter showae RM3277 RepID=C6RFJ3_9PROT Length = 271 Score = 301 bits (770), Expect = 2e-80, Method: Composition-based stats. Identities = 84/257 (32%), Positives = 138/257 (53%), Gaps = 5/257 (1%) Query: 16 RDRPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDSLQREGFQPFFACQTRVRDP 75 + PLT E+L ++ PS+F+++ + S++Y +I TI +++ ++ + P + VRD Sbjct: 4 SNEPLTNEQLEQLAPSLFADEPYFEASDKYHFISTIDVINEIRDYAWYPVGVSEASVRDE 63 Query: 76 GRREHTKHMLRLRREGQ--ITGKQVPEIILLNSHDGTSSYQMLPGLFRAVCQNGLVCG-E 132 + KH +R R G+ V E++L NSHD + + + G+FR VC NGLV E Sbjct: 64 KKEGFQKHYVRFRHLDDFLNPGENVVELLLFNSHDRSKCFSISAGVFRFVCANGLVVSDE 123 Query: 133 SFGEVRVPHKGDVVSQVIEGAYEVLGIFDRVEEKRDAMQSLLLPPPAQQALAKAALTYRF 192 F ++ H GD + V ++ + K + L + + AKAA+ RF Sbjct: 124 VFESYQIKHLGDKENDVSIAINKIAKAKYDILNKIKLFSKIPLTQDDKASFAKAAIPLRF 183 Query: 193 GEDHQPVTESQILSPRRWQDESNDLWTTYQRIQENLIKGGLSGRNAKGGRS-HTRAVRGI 251 E H V +L P R +DE +DL+TT+ IQE+LI+G +SG NA+ R +R ++ I Sbjct: 184 -EKHLKVDYRDLLVPHRIEDEKDDLYTTFNTIQEHLIRGNISGINAETNRRFTSRIIKSI 242 Query: 252 DGDVKLNRALWVMAEAL 268 D +N+ LW MAE++ Sbjct: 243 STDTDINKKLWNMAESI 259 >UniRef50_A9HST1 Putative uncharacterized protein n=1 Tax=Gluconacetobacter diazotrophicus PAl 5 RepID=A9HST1_GLUDA Length = 282 Score = 300 bits (769), Expect = 3e-80, Method: Composition-based stats. Identities = 111/276 (40%), Positives = 157/276 (56%), Gaps = 13/276 (4%) Query: 6 SRFGAANLIRRDRPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDSLQREGFQPF 65 S + R +PLT E+L R+ PS+F+E KHESRS+RYTYIPTI ++ L+ EGF P Sbjct: 2 SFLSRVSAHRHAQPLTDEQLQRLAPSIFAEAKHESRSDRYTYIPTIEVVRGLRSEGFFPV 61 Query: 66 FACQTRVRDPGRREHTKHMLRLRREGQ-----ITGKQVPEIILLNSHDGTSSYQMLPGLF 120 A Q R PG+ E+TKH++R R G PE+ LLNSHDGTS+Y+++ + Sbjct: 62 MARQGNSRIPGKAEYTKHLIRFRHMDHGPMYENLGDLYPEVALLNSHDGTSAYKIIAAMM 121 Query: 121 RAVCQNGLVCGESF-GEVRVPHKGDVVSQVIEGAYEVLGIFDRVEEKRDAMQSLLLPPPA 179 R C+NG+V ++ E+ VPHKG V +VIEG+Y VL + E L Sbjct: 122 RLACENGMVVQDARLAEISVPHKGTVTDKVIEGSYTVLDESRKALEIAGEWSGKTLTERQ 181 Query: 180 QQALAKAALTYRFGEDHQ--PVTESQILSPRRWQDESNDLWTTYQRIQENLIKGGLSG-- 235 Q+ A+A ++G+D + P T L RR D+ DLW R+QE+ I+GG++G Sbjct: 182 QKGFAEAVHIAKYGDDAERMPFTPESYLRTRRAADQGADLWRVANRVQESAIRGGMTGFR 241 Query: 236 ---RNAKGGRSHTRAVRGIDGDVKLNRALWVMAEAL 268 R R V+ IDGD+KLN+A+W +A+ L Sbjct: 242 WDEDGRNRKRVTARPVKSIDGDIKLNKAVWHLAQML 277 >UniRef50_A4X0R7 Putative uncharacterized protein n=2 Tax=Rhodobacter sphaeroides RepID=A4X0R7_RHOS5 Length = 316 Score = 300 bits (769), Expect = 3e-80, Method: Composition-based stats. Identities = 119/266 (44%), Positives = 167/266 (62%), Gaps = 9/266 (3%) Query: 12 NLIRRDRPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDSLQREGFQPFFACQTR 71 ++ R PLT EL VPS+F+ + HESRS R+ +PTI++LD L+ EGF+PFFA Q R Sbjct: 44 SIFSRGEPLTNAELHARVPSIFATEAHESRSARFAPVPTITVLDGLRAEGFEPFFAQQAR 103 Query: 72 VRDPGRREHTKHMLRLRREGQIT-GKQVPEIILLNSHDGTSSYQMLPGLFRAVCQNGLVC 130 R G+ E TKHMLRLR G + + EI+L+N++DGTS+YQM+PG FR VC NGL+ Sbjct: 104 TRIEGKAEFTKHMLRLRHRGIVNEAGEAFEIVLVNANDGTSAYQMIPGFFRFVCANGLMA 163 Query: 131 GESFGEVRVPHKGDVVSQVIEGAYEVLGIFDRVEEKRDAMQSLLLPPPAQQALAKAALTY 190 GE+F EV+V H G+ + +VIEGAY VL RV ++ +S+ L ++ LA+AA + Sbjct: 164 GETFEEVKVRHSGNAIGEVIEGAYRVLEDAPRVADQVQRFKSIRLQDREREILAEAAHSL 223 Query: 191 RFGEDHQ----PVTESQILSPRRWQDESNDLWTTYQRIQENLIKGGLSGR----NAKGGR 242 RF + P+ +L PRR +D + DLWT + +QEN ++GG+ GR + R Sbjct: 224 RFPATAEGKAAPIDPPALLRPRRSEDRATDLWTAFNVVQENTLRGGMRGRIETDSGFIRR 283 Query: 243 SHTRAVRGIDGDVKLNRALWVMAEAL 268 R V GID LNRALW++ E + Sbjct: 284 QTVREVTGIDQSRALNRALWMLTERM 309 >UniRef50_B8IVF8 Putative uncharacterized protein n=1 Tax=Methylobacterium nodulans ORS 2060 RepID=B8IVF8_METNO Length = 295 Score = 289 bits (739), Expect = 8e-77, Method: Composition-based stats. Identities = 125/276 (45%), Positives = 171/276 (61%), Gaps = 13/276 (4%) Query: 6 SRFGA-ANLIRRDRPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDSLQREGFQP 64 +RFG+ A ++R + L L P+VF+EDKH SRS++YTYIPT+ +L L REGF P Sbjct: 14 TRFGSGAVVVRNNGGLDEAALRSAAPTVFAEDKHSSRSDKYTYIPTVEVLRGLGREGFLP 73 Query: 65 FFACQTRVRDPGRREHTKHMLRLRREGQIT---GKQVPEIILLNSHDGTSSYQMLPGLFR 121 RD +R +TKH+LRLRR G G E++LLNSHDGTSSYQ++ GLFR Sbjct: 74 VEVRVGGTRDEEKRGYTKHLLRLRRMGDAPTRVGDSSRELVLLNSHDGTSSYQLMSGLFR 133 Query: 122 AVCQNGLVCGESFGEV-RVPHKGDVVSQVIEGAYEVLGIFDRVEEKRDAMQSLLLPPPAQ 180 +C NGLVC + ++ ++PHKGD+V QVI+GAY ++ + V+ M+ + L P Q Sbjct: 134 LICSNGLVCADGDAQILKIPHKGDIVQQVIDGAYRIVDASEEVDRIAAEMKQIELRPAEQ 193 Query: 181 QALAKAALTYRFGEDHQ--PVTESQILSPRRWQDESNDLWTTYQRIQENLIKGGLS--GR 236 A A+AA R+ + Q PV QI +PRR +D N LW + R QE LI+GG+ R Sbjct: 194 DAFAEAAAELRWNGEGQRVPVEPRQIHAPRRREDVGNSLWLAFNRTQEGLIRGGIDYQQR 253 Query: 237 NAKGG----RSHTRAVRGIDGDVKLNRALWVMAEAL 268 N + G R TR V+G+DG+ LNRALWV+A + Sbjct: 254 NPETGRLIARRQTRPVQGVDGNTALNRALWVLANRM 289 >UniRef50_B6C6K7 Conserved domain protein n=2 Tax=Nitrosococcus oceani RepID=B6C6K7_9GAMM Length = 226 Score = 274 bits (701), Expect = 2e-72, Method: Composition-based stats. Identities = 98/220 (44%), Positives = 147/220 (66%), Gaps = 4/220 (1%) Query: 53 LLDSLQREGFQPFFACQTRVRDPGRREHTKHMLRLRREG---QITGKQVPEIILLNSHDG 109 ++++L+REG+ P A ++RVR P R+ +KH+LR RR + G PEI+L+NSHDG Sbjct: 1 MIEALEREGWSPVHAEESRVRIPDRKGFSKHLLRFRRFDNELPMVGDSFPEIVLVNSHDG 60 Query: 110 TSSYQMLPGLFRAVCQNGLVCGES-FGEVRVPHKGDVVSQVIEGAYEVLGIFDRVEEKRD 168 + +YQ+ GLFR VC NG++ +S G+V+ H GDVV +VIEG YE++ R+ + + Sbjct: 61 SCAYQLHAGLFRLVCSNGMIVADSNMGQVKRRHTGDVVREVIEGTYEIVEELPRIAARVE 120 Query: 169 AMQSLLLPPPAQQALAKAALTYRFGEDHQPVTESQILSPRRWQDESNDLWTTYQRIQENL 228 ++L L Q+ A++AL R+ E P +L PRR +D+ NDLW TYQR+QEN+ Sbjct: 121 DFKTLELSLQEQEIFAESALRVRWREGEAPCMPQALLRPRRHEDQGNDLWATYQRVQENM 180 Query: 229 IKGGLSGRNAKGGRSHTRAVRGIDGDVKLNRALWVMAEAL 268 +KGG+ GR+A G + TRAV+ +DG+VKLN+ALW + E + Sbjct: 181 LKGGIRGRSAVGRQITTRAVKSVDGNVKLNKALWFLTEQM 220 >UniRef50_B1ZQ12 Putative uncharacterized protein n=3 Tax=Opitutus terrae PB90-1 RepID=B1ZQ12_OPITP Length = 288 Score = 256 bits (655), Expect = 4e-67, Method: Composition-based stats. Identities = 98/281 (34%), Positives = 142/281 (50%), Gaps = 23/281 (8%) Query: 10 AANLIRRDRPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDSLQREGFQPFFACQ 69 A R R L+ ++L RV PSVF+E S RYT++ T ++D L+ EG++P A Q Sbjct: 6 TAPSSRVFRALSLDDLRRVAPSVFAEQARPGVSSRYTFVSTAQVVDLLRGEGWEPVKANQ 65 Query: 70 TRVRDPGRREHTKHMLRLRREGQI------TGKQVPEIILLNSHDGTSSYQMLPGLFRAV 123 RVR R+ H LR R + G PE+IL N+HDGT +Y++ GL+R V Sbjct: 66 QRVRLENRQGFQMHELRFARRADLENASFAIGDVRPELILQNAHDGTRAYRIDAGLYRLV 125 Query: 124 CQNGLVCGES-FGEVRVPHKGDVVSQVIEGAYEVLGIFDRVEEKRDAMQSLLLPPPAQQA 182 C+NGL ++ F V + H + A V RV E Q++ L P A+ + Sbjct: 126 CRNGLTVADADFAHVAIRHVDVSAEKFAAAAQAVAENTPRVMEVIARWQAVALTPLARHS 185 Query: 183 LAKAALTYRFGEDHQPVT----ESQILSPRRWQDESNDLWTTYQRIQENLIKGGLSGRNA 238 A A+ R+ + QPVT Q+L+P R+ D++ DLWTT+ +QE L +GGL Sbjct: 186 FAARAMALRW-DSAQPVTRLLRPDQLLAPARYGDQATDLWTTFNVVQERLCRGGLRYAGH 244 Query: 239 KGG-----------RSHTRAVRGIDGDVKLNRALWVMAEAL 268 R+ TR V G+ +LN+ALW +AE Sbjct: 245 IPAAEGAVFPTHYLRNTTRPVGGLTEGQRLNKALWNLAEEF 285 >UniRef50_Q17W97 Putative uncharacterized protein Hac prophage I orf7 n=1 Tax=Helicobacter acinonychis str. Sheeba RepID=Q17W97_HELAH Length = 176 Score = 197 bits (500), Expect = 4e-49, Method: Composition-based stats. Identities = 43/172 (25%), Positives = 84/172 (48%), Gaps = 3/172 (1%) Query: 16 RDRPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDSLQREGFQPFFACQTRVRDP 75 +PL+ EL R+ PS+F+ + + S++Y +I TI +++ ++ + P + VR+ Sbjct: 4 STQPLSNNELKRLAPSLFTAEPYYEASDKYHFISTIDIIEEIRFHAWYPVAVSEASVRNE 63 Query: 76 GRREHTKHMLRLRREGQI--TGKQVPEIILLNSHDGTSSYQMLPGLFRAVCQNGLVCG-E 132 + + +H +R R + E++L NSHD + + + G+FR VC NGLV E Sbjct: 64 DKEGYQQHYVRFRYLDDFLRPSENCVELLLFNSHDRSKCFTISAGVFRFVCANGLVVADE 123 Query: 133 SFGEVRVPHKGDVVSQVIEGAYEVLGIFDRVEEKRDAMQSLLLPPPAQQALA 184 F ++ H G+ + V ++ D++ +K + L + + A Sbjct: 124 VFESYQIKHIGEKANGVAVAIPSIVQAKDKIMDKISTFSQITLTEQDKISFA 175 >UniRef50_Q2LV02 Hypothetical cytosolic protein n=1 Tax=Syntrophus aciditrophicus SB RepID=Q2LV02_SYNAS Length = 264 Score = 167 bits (423), Expect = 3e-40, Method: Composition-based stats. Identities = 51/274 (18%), Positives = 94/274 (34%), Gaps = 57/274 (20%) Query: 13 LIRRDRPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDS--------LQREGFQP 64 + R +T+++L + ++ Y + L D L+ Sbjct: 6 MHRGGELVTKDQLDLI--------PLPEPTDSYMPVSHYDLADKFLMISQDILRDYKL-- 55 Query: 65 FFACQTRVRDPGRREHTKHMLRLRREGQITGKQVPEIILLNSHDGTSSYQMLPGLFRAVC 124 R G + +L+ +RE G I NS+D + + + G VC Sbjct: 56 VGENYGIAR-QGNQFFA--VLKFQRERSEIG---LSIAFRNSYDRSMAIGLAIGASVFVC 109 Query: 125 QNGLVCGESFGEVRVPHKGDV----VSQVIEGAYEVLGIFDRVEEKRDAMQSLLLPPPAQ 180 N + GE V H +V + I Y+ +D++ DA +S LP Sbjct: 110 DNLALSGEIV--VMKKHTKNVWSELEEKAIATIYKSQNNYDQLIGDVDAFKS--LPVDDN 165 Query: 181 QALAKAALTYRFGEDHQPVTESQI-------LSPRRWQDESNDLWTTYQRIQENLIKGGL 233 A A+ FG + ++ Q+ L P + E +LW+ Y E+L Sbjct: 166 GAF--QAMGLLFG--NNIISPRQLTVLKEEWLKPSHEEFEPRNLWSFYNAATESL----- 216 Query: 234 SGRNAKGGRSHTRAVRGIDGDVKLNRALWVMAEA 267 + V ++ ++L+ AL + + Sbjct: 217 ---------KSSPPVTIMEKHIRLHEALTYLGKE 241 >UniRef50_A8RIH4 Putative uncharacterized protein n=3 Tax=Clostridiales RepID=A8RIH4_9CLOT Length = 312 Score = 158 bits (400), Expect = 1e-37, Method: Composition-based stats. Identities = 47/245 (19%), Positives = 87/245 (35%), Gaps = 33/245 (13%) Query: 42 SERYTYIPTISLL---DSLQREGFQPFFACQTRVRDPGRREHTKHMLRLRREGQITGKQV 98 ++RY + D L EG GRR + +L + I+G ++ Sbjct: 76 TDRYKVVQNEDAFAFTDQLLGEG---VTYETAGSLQNGRRTWL--LAKLPQRYIISGDEI 130 Query: 99 -PEIILLNSHDGTSSYQMLPGLFRAVCQN--GLVCGESFGEVRVPHKGDVVSQVIEGAYE 155 P ++ +N+HDGT + ++ R VC N L + H GD+ ++ + Y Sbjct: 131 TPYMVFMNTHDGTGAIRVAMTPVRVVCMNTLNLALSTAKRSWSTNHTGDIAGKMEDARYT 190 Query: 156 VLGIFDRVEE---KRDAMQSLLLPPPAQQALAKAALTYRFGEDHQPVTESQILSPRRWQD 212 +L + E D M+ L L A P + Q R +D Sbjct: 191 LLYADRYMSELGKAIDHMKRLRLSERQVMEYIDALFPLY----DNPTPQQQKNLNRMKED 246 Query: 213 ESNDLWTTYQRIQENLIKGGLSGRNAKGG-RSHTRAV------------RGIDGDVKLNR 259 + +++ K G NA +H R + + ++G+ ++R Sbjct: 247 MKTRYFDAPDL--KHVGKNGYRFINAVSDFATHARPLRESANHKENLFAKTVEGNALIDR 304 Query: 260 ALWVM 264 A ++ Sbjct: 305 AFAML 309 >UniRef50_D1N225 Putative uncharacterized protein n=1 Tax=Victivallis vadensis ATCC BAA-548 RepID=D1N225_9BACT Length = 241 Score = 152 bits (383), Expect = 1e-35, Method: Composition-based stats. Identities = 42/229 (18%), Positives = 80/229 (34%), Gaps = 18/229 (7%) Query: 34 SEDKHESRSERYTYIPTISLLDSL----QREGFQPFFACQTRVRDPGRREHTKHMLRLRR 89 + + + + +P ++D++ + +Q R+ R M + R Sbjct: 17 AMVPTPAATASWKPVPHSEVIDAVTDVVRAHNWQILDEQYGLARNGQR------MFGVIR 70 Query: 90 EGQITGKQVPEII-LLNSHDGTSSYQMLPGLFRAVCQNGLVCGESFGEVRVPHKGDVV-- 146 + + + I + NSHD T + + GL VC N + G + ++ H + Sbjct: 71 INRTSSSEWSRCIGICNSHDRTIAVGLAAGLNVQVCANLMFGGSTV--LKRRHTSRIELN 128 Query: 147 SQVIEGAYEVLGIFDRVEEKRDAMQSLLLPPP-AQQALAKAALTYRFGEDHQPVTESQIL 205 V+E + F +E + ++ + A+ A+ KAA + Sbjct: 129 GLVVEAIDALEDDFLTLETVAEDLKIQFVRDDTARAAIVKAAEAGAVNSCDIVPIFREFK 188 Query: 206 SPRRWQDESNDLWTTYQRIQENLIKGGLSGRNAKGGRSHTRAVRGIDGD 254 PR + W EN K R + R TR + G+DG Sbjct: 189 EPRYEEFAEPTRWALLNAFTENAKKYS-PARADQCYRGLTR-LFGLDGQ 235 >UniRef50_B2Q5G8 Putative uncharacterized protein n=3 Tax=Providencia RepID=B2Q5G8_PROST Length = 122 Score = 129 bits (325), Expect = 7e-29, Method: Composition-based stats. Identities = 83/112 (74%), Positives = 98/112 (87%) Query: 157 LGIFDRVEEKRDAMQSLLLPPPAQQALAKAALTYRFGEDHQPVTESQILSPRRWQDESND 216 + FD V EKR+ MQSLLLPPPAQQALA+AALTYRFGE+HQP+TE Q+L PRRW+D+ +D Sbjct: 1 METFDTVAEKREQMQSLLLPPPAQQALAQAALTYRFGEEHQPITEEQVLQPRRWEDKKDD 60 Query: 217 LWTTYQRIQENLIKGGLSGRNAKGGRSHTRAVRGIDGDVKLNRALWVMAEAL 268 LWT YQR+QENLIKGGLSGRNAKG R+ TR+V GIDGD+KLN+ALWVM E + Sbjct: 61 LWTVYQRLQENLIKGGLSGRNAKGKRARTRSVNGIDGDIKLNKALWVMTEKM 112 >UniRef50_D2R5Z8 Phage/plasmid-related protein TIGR03299 n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R5Z8_9PLAN Length = 327 Score = 127 bits (320), Expect = 3e-28, Method: Composition-based stats. Identities = 42/256 (16%), Positives = 86/256 (33%), Gaps = 47/256 (18%) Query: 45 YTYIPTISLL---DSLQREGFQPFFACQTRVRDPGRREHTK----HMLRLRREGQITGKQ 97 Y + D++ +G G R +R++ + K Sbjct: 83 YVPVQNRQAFGFLDAVVADG--SLRYHTAGALGKGERIWLLAKLPSQIRVKNSDDLVDKF 140 Query: 98 VPEIILLNSHDGTSSYQMLPGLFRAVCQNGLVCGESFGE---VRVPHKGDVVSQVIEGAY 154 ++L N+HDG+S+ ++ R VCQN L ++ + + HKG++ +++ E Sbjct: 141 ---LLLSNAHDGSSALRVYFTPIRVVCQNTLNLADNRSTGQGISILHKGNLHTKIREA-Q 196 Query: 155 EVLGIFDRVEE----KRDAMQSLLLPPPAQQALAKAALTYRFGEDHQPVTESQILSPRRW 210 VLG+ + + D + S +A ++ + G D+ R+ Sbjct: 197 RVLGLAEEFYDEAEGIIDILASHHPSSVQVEAFFQSVIPDPIGADNA--------RARKV 248 Query: 211 QDE---------SNDL-------WTTYQRIQENLIKGGLSGRNAKGGRSHTRAVRG--ID 252 +D D+ W Y + E + R+ +R + Sbjct: 249 RDRLTCLFETGIGQDMPEIKGTSWAAYNAVTE-FVDHHRPTRSTDPLERASRRLDSSWFG 307 Query: 253 GDVKLNRALWVMAEAL 268 +L W +A + Sbjct: 308 SGARLKAKAWNLAFDM 323 >UniRef50_A6GXR9 Putative uncharacterized protein n=1 Tax=Flavobacterium psychrophilum JIP02/86 RepID=A6GXR9_FLAPJ Length = 285 Score = 122 bits (307), Expect = 8e-27, Method: Composition-based stats. Identities = 34/187 (18%), Positives = 66/187 (35%), Gaps = 32/187 (17%) Query: 94 TGKQVPEIILLNSHDGTSSYQMLPGLFRAVCQNGLVCGESFGEVRVPHKGDVVSQVIEGA 153 K P + NS+DG+ G FR VC NGL + + H+G++ V+ Sbjct: 109 LDKIRPMLRFTNSYDGSCKTSGTFGFFREVCSNGLHTASTDIGFSLKHRGNINELVLPAI 168 Query: 154 YEVLGIF--------DRVEEKRDAMQSLLLPPPAQQALAKAALTYRF-GEDHQP---VTE 201 + + F R E + + P Q +A+ ++F D P + Sbjct: 169 GKTIYNFLDNEFYELRRKFEVLADFK-IADPSEIVQHIAQQTKLFKFESSDKNPAPSLNA 227 Query: 202 SQILSPRRWQ----DESNDLWTTYQRIQENLIKGGLSGRNAKGGRSHTRAVRGIDGDVKL 257 ++ + E ++W Y E L+ G + + D K+ Sbjct: 228 RLVIETIENETLILKEDANMWMVYNAFNE-LLHGKIK--------------KTFDQQKKI 272 Query: 258 NRALWVM 264 ++ ++ + Sbjct: 273 DKEIFNL 279 >UniRef50_A8ZS75 Putative uncharacterized protein n=1 Tax=Desulfococcus oleovorans Hxd3 RepID=A8ZS75_DESOH Length = 318 Score = 115 bits (287), Expect = 2e-24, Method: Composition-based stats. Identities = 38/206 (18%), Positives = 66/206 (32%), Gaps = 26/206 (12%) Query: 42 SERYTYIPTISLLDSLQREGFQPFFACQTRVRDPGRREHTKHMLRL----RREGQITG-- 95 +ERY + + +L L R GF P Q + D ++R+ R G G Sbjct: 108 TERYKPLDNMDVLSQLLRHGFDPDTQVQYAIDDG------MFLVRIPEYARAFGVNPGYG 161 Query: 96 ---KQVPEIILLNSHDGTSSYQMLPGLFRAVCQNGLVCGESFGEVRVPHKGDV-VSQVIE 151 + VP + NS G ++ + +R VC NGL+ S R H + + E Sbjct: 162 KLDEIVPGVSFANSEVGLLAFSIEAFFYRLVCTNGLISKTSSTFSRFKHISNRGLENFPE 221 Query: 152 GAYEVLGIFDRVEE--KRDAMQSLLLPPPAQQALAKAALTYRFGEDHQPVTESQILSPRR 209 V+ R +E K + P + + A+ E + Sbjct: 222 TIAGVIEDSVRKQEQFKLSRQSPVENPIRSIETFARQ-FGLAHLETEVVCKAYLL----- 275 Query: 210 WQDESNDLWTTYQRIQENLIKGGLSG 235 ++ ++ L Sbjct: 276 --EQGATMFHIINAFTRAAQDKHLDT 299 >UniRef50_B5EW31 Putative uncharacterized protein n=1 Tax=Vibrio fischeri MJ11 RepID=B5EW31_VIBFM Length = 318 Score = 113 bits (282), Expect = 8e-24, Method: Composition-based stats. Identities = 40/237 (16%), Positives = 78/237 (32%), Gaps = 21/237 (8%) Query: 43 ERYTYIPTISLLDSL--------------QREGFQPFFACQTRVRDPGRREHTKHMLRLR 88 RYT + DS+ F + + HM+++ Sbjct: 76 SRYTLLKNSDAFDSVNAAVNTLAENGVLNMDGAFIKDAVVNKGGKVIRQYFFPAHMVQI- 134 Query: 89 REGQITGKQVPEIILLNSHDGTSSYQMLPGLFRAVCQNGLVCGESFGEVRVPHKGDVV-S 147 K + ++++NS+DG+ ++Q+ G FR VC NG++ GE F + V H G + Sbjct: 135 ---GSGDKVILRLVVVNSYDGSCNFQVQAGGFRIVCTNGMITGEKFLSLDVRHTGTMNFG 191 Query: 148 QVIEGAYEVLGIFDRVEEKRDAMQSLLLPPPAQQALAKAALTYRFGEDHQPVTESQILSP 207 QV + F+ + + D + + L + T + L Sbjct: 192 QVTRQVTTAVSSFENMGQYWDTLINSPLNRKDADKIITDMSTVGRELNMNKFDMFDRLYT 251 Query: 208 RRWQDESNDLWTTYQRIQENLIKGGLSGRN-AKGGRSHTRAVRGIDGDVKLNRALWV 263 + + W Y + ++ N + + I + A+W Sbjct: 252 DHKKTLGENHWAMYNSLTAWATHYKVNESNISNIDNVRLEREKSI-QHLMRKPAIWN 307 >UniRef50_B9E574 Putative uncharacterized protein n=5 Tax=Clostridiales RepID=B9E574_CLOK1 Length = 325 Score = 100 bits (250), Expect = 3e-20, Method: Composition-based stats. Identities = 45/241 (18%), Positives = 80/241 (33%), Gaps = 33/241 (13%) Query: 42 SERYTYIPTISLL---DSLQREGFQPFFACQTRVRDPGRREHTKHMLRLRREGQITGKQV 98 ++RY + DSL EG GR+ + +L + +I +V Sbjct: 88 TDRYKIVQNKEAFSFTDSLIGEG---CKYETAGSLQNGRKVWL--LAKLPDKYKILDDEV 142 Query: 99 -PEIILLNSHDGTSSYQMLPGLFRAVCQN--GLVCGESFGEVRVPHKGDVVSQVIEGAYE 155 P ++ NSHDGT + ++ R VC N L + H G++ S++ E Sbjct: 143 TPYMVFSNSHDGTGAIKVAMTPIRVVCNNTLNLALSNAKRIWSTIHTGNISSKLNEAMKT 202 Query: 156 VL--GIFDRVEEKRDA-MQSLLLPPPAQQALAKAALTYRFGEDHQPVTESQILSPRRWQD 212 +L + + + + + L ++ E I R D Sbjct: 203 LLLAESYMENLDYEAHYLSRKTISDEKVLEFIELLLPLP--DNASKTQEKNINLLR--DD 258 Query: 213 ESNDLWTTYQRIQENLIKGGLSGRNAKGG-RSHTRAV------------RGIDGDVKLNR 259 + I +L K NA +H + + IDG+ ++R Sbjct: 259 MKLRYFDAPDLI--DLPKTSWRFVNAVSDFATHINPLRKTKNYKENLFSKTIDGNPLIDR 316 Query: 260 A 260 A Sbjct: 317 A 317 >UniRef50_A3XKH6 Putative uncharacterized protein n=2 Tax=Leeuwenhoekiella blandensis MED217 RepID=A3XKH6_9FLAO Length = 312 Score = 80.2 bits (196), Expect = 8e-14, Method: Composition-based stats. Identities = 27/181 (14%), Positives = 63/181 (34%), Gaps = 30/181 (16%) Query: 97 QVPEIILLNSHDGTSSYQMLPGLFRAVCQNGLVCGESFGEVRVPHKGDVVSQVIEGAYEV 156 +P + NS+DG+ G +R VC NGL + E + H + ++ + Sbjct: 140 ILPMLRFKNSYDGSEKTSGHFGFYREVCSNGLHVSLAEIEFSIKHSKNNTHLIMPRLNNL 199 Query: 157 LG-----IFDRVEEKRDAMQ--SLLLPPPAQQALAKAALTYRFGEDHQPVTE----SQIL 205 F + +K D M+ ++ +A+ +R+ + +++ Sbjct: 200 FDKFLDNEFYTITKKFDKMKEFKIIDTQEFVKAILDRTKLFRYECSDKNSDPSKKSREVI 259 Query: 206 SPRRWQ----DESNDLWTTYQRIQENLIKGGLSGRNAKGGRSHTRAVRGIDGDVKLNRAL 261 ++ +E +LW Y +++ L + +L++ L Sbjct: 260 EILNYEALLLNEEPNLWLGYNAFN-SVLHNVLK--------------KSFGQQERLDKKL 304 Query: 262 W 262 + Sbjct: 305 F 305 >UniRef50_C7Q5L2 Phage/plasmid-related protein TIGR03299 n=1 Tax=Catenulispora acidiphila DSM 44928 RepID=C7Q5L2_CATAD Length = 329 Score = 79.8 bits (195), Expect = 9e-14, Method: Composition-based stats. Identities = 26/140 (18%), Positives = 49/140 (35%), Gaps = 25/140 (17%) Query: 45 YTYIPTIS-------LLDSLQREGFQPFFACQTRVRDPGRREHTKHMLRLRREGQITGKQ 97 YT + L+D+ GR+ +RL + G Sbjct: 96 YTPVQNEENCQIMNTLVDASGAH------FETAGSLREGRQVFVT--MRLPETMTVAGTD 147 Query: 98 VPEIIL--LNSHDGTSSYQMLPGLFRAVCQN--GLVCGESFGEVRVPHKGDVVSQVIEG- 152 ++ + LNSHDGT +Y+++ R VC N L + + H ++ E Sbjct: 148 RLDLYISGLNSHDGTGAYKLIVTPIRIVCANTQSLALDRARSSFSIRHTESAKKKIAEAR 207 Query: 153 -----AYEVLGIFDRVEEKR 167 ++ + F++ E+ Sbjct: 208 KALGLMFKYVEEFEKAAERM 227 >UniRef50_Q024R3 Putative uncharacterized protein n=1 Tax=Candidatus Solibacter usitatus Ellin6076 RepID=Q024R3_SOLUE Length = 237 Score = 74.8 bits (182), Expect = 3e-12, Method: Composition-based stats. Identities = 34/227 (14%), Positives = 71/227 (31%), Gaps = 24/227 (10%) Query: 8 FGAANLIRRDRPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDSL----QREGFQ 63 A LI LTR +L ++ + + +P + ++++L Sbjct: 1 MSEATLIASTAKLTRLQL--------ADVPTPLGTATHRPVPHVEVVEALVETLSFRHIG 52 Query: 64 PFFACQTRVRDPGRREHTKHMLRLRREGQITGKQVPEIILLNSHDGTSSYQMLPGLFRAV 123 +D M + I + NSHD + + G+ V Sbjct: 53 VVTEEYAVSKDG------MKMFGVLDLDTGMPGCRFSIGIRNSHDRSMRLAAVVGVRVLV 106 Query: 124 CQNGLVCGESFGEVRVPHKGD--VVSQVIEGAYEVLGIFDRVEEKRDAMQSLLLPPPAQQ 181 C+N G+ F V H + + + + G ++ FD + ++ DA + L + Sbjct: 107 CENMAFSGD-FQPVLAKHSKNFSLQNALSIGVDQMQRNFDGMRKQVDAWRESQLSDTVAK 165 Query: 182 ALAKAALT---YRFGEDHQPVTESQILSPRRWQDESNDLWTTYQRIQ 225 + A + SP+ + + +W+ Sbjct: 166 MIIYRAFIESDLEVPKHLARPVHDLYFSPKHEEFQPRTMWSLSNAFT 212 >UniRef50_A1SIX8 Putative uncharacterized protein n=2 Tax=Nocardioides sp. JS614 RepID=A1SIX8_NOCSJ Length = 334 Score = 74.8 bits (182), Expect = 3e-12, Method: Composition-based stats. Identities = 48/257 (18%), Positives = 84/257 (32%), Gaps = 24/257 (9%) Query: 27 RVVPSVFSEDKHESRSERYTYIPTISLLD--SLQREGFQPFFACQTRVRDPGRREHTKHM 84 R P + + YT + + +L E F +R GR+ Sbjct: 77 RTNPFTGAPEALGVVGGGYTPLQNEDHAEFLNLLAEESGAIFDTAGSLR-GGRQVFIT-- 133 Query: 85 LRLRREGQITGKQVPEIIL--LNSHDGTSSYQMLPGLFRAVCQN--GLVCGESFGEVRVP 140 ++L + G ++ + LNSHDG+S++++L R VC N + Sbjct: 134 MQLPDSLTVGGTDRVDLNIAALNSHDGSSAFRILVTPVRVVCANTQSAALRNHESSFSIR 193 Query: 141 HKGDVVSQVIEGAYEVLGIF---DRVEEKRDAMQSLLLPPPAQQALAKAAL--TYRFGED 195 H + + V + F D + + + + + A AL A G Sbjct: 194 HTRNAKAAVQAARDALGLTFTYVDAFQVEAERLIQQTMTDAAFDALIDATFGKAEANGTK 253 Query: 196 HQPVTESQILSPRRWQDESNDL--------WTTYQRIQENLIK-GGLSGRNAKGGRSHTR 246 TE + S W D W YQ + E + + + + TR Sbjct: 254 RVRETERRRRSRLHWLFADADTQAGIRATAWAGYQAVAEYVDHYAPVRTKGDEHAARATR 313 Query: 247 AVRGIDGDVKLNRALWV 263 + D D ++ R W Sbjct: 314 VLTSDDPD-RIKRRAWT 329 >UniRef50_C4DCZ5 Phage/plasmid-related protein TIGR03299 n=3 Tax=Actinomycetales RepID=C4DCZ5_9ACTO Length = 395 Score = 73.6 bits (179), Expect = 7e-12, Method: Composition-based stats. Identities = 32/193 (16%), Positives = 62/193 (32%), Gaps = 22/193 (11%) Query: 22 REELFRVVPSVF------SEDKHESRSERYTYIPTISLLDSLQR--EGFQPFFACQTRVR 73 ++L P F + +Y + + L+ E + + VR Sbjct: 125 DDQLHTH-PDKFHTLRSDTAAPLGVVGSKYHTVQNRECFEFLRNLVESYDVVWESAGAVR 183 Query: 74 DPGRREHTKHMLRLRRE-----GQITGKQVPEIILLNSHDGTSSYQMLPGLFRAVCQNG- 127 GRR +RL I P +++ NSHDG+SS + +R VC N Sbjct: 184 -GGRRTFVS--MRLPDTVTVDAAGINDTITPFVVVFNSHDGSSSITAVVTPYRPVCANTE 240 Query: 128 -LVCGESFGEVRVPHKGDVVSQVIEGAYEV---LGIFDRVEEKRDAMQSLLLPPPAQQAL 183 L ++ + H + Q+ + + + +D ++ + + +AL Sbjct: 241 RLALDNAYTSWSIRHTESAMHQMRQARRTLKMSVKYYDEFAAQQTTLAHHDMVIDEFRAL 300 Query: 184 AKAALTYRFGEDH 196 Sbjct: 301 IDELWPLEPNATK 313 >UniRef50_UPI0001AF46A9 hypothetical protein MkanA1_07449 n=1 Tax=Mycobacterium kansasii ATCC 12478 RepID=UPI0001AF46A9 Length = 348 Score = 73.2 bits (178), Expect = 9e-12, Method: Composition-based stats. Identities = 47/259 (18%), Positives = 80/259 (30%), Gaps = 38/259 (14%) Query: 43 ERYTYIPTI---SLLDSLQREGFQPFFACQTRVRDPGRREHTKHMLRLRREGQITGKQVP 99 +Y + LLD+L + F +R GR ++L GK Sbjct: 99 SKYEPLQNEASCDLLDALVDQSGGAHFETAGALR-GGRETFVT--MKLPSSMVFDGKDGS 155 Query: 100 ------EIILLNSHDGTSSYQMLPGLFRAVCQN--GLVCGESFGEVRVPHKGDVVSQVIE 151 + LNSHDG+++++ L R VC N + + H G + + E Sbjct: 156 KDRTDFYLAALNSHDGSAAFRFLLSPIRIVCANTQSAAIRSAKSSFSIRHTGGARASIAE 215 Query: 152 GAYEVLGIFDRVEEKRDAMQSLLLPPPAQQALAKAALTYRFGEDHQPVTESQILSPRRWQ 211 + + +E +L P + + A T + R + Sbjct: 216 ARNALKLSWRYIEAFEAEAAALYAAPMDTEEMRSFANTLLEVDSAGTTATR-----RHRR 270 Query: 212 DESND-----------------LWTTYQRIQENLIK-GGLSGRNAKGGRSHTRAVRGIDG 253 + +N W Y + E L + G S RA+R I Sbjct: 271 ERANSIVKLWTSSETIAPIAGTRWAAYNAVTEYLDHVVPVRGAKTATDASAARALRNITT 330 Query: 254 DVKLNRALWVMAEALLTQL 272 ++L A +L L Sbjct: 331 AAS-GQSLKAQAFRMLQTL 348 >UniRef50_B4VVD2 Phage/plasmid-related protein TIGR03299 n=2 Tax=Cyanobacteria RepID=B4VVD2_9CYAN Length = 336 Score = 72.5 bits (176), Expect = 1e-11, Method: Composition-based stats. Identities = 39/200 (19%), Positives = 60/200 (30%), Gaps = 19/200 (9%) Query: 45 YTYIPTISLL---DSLQREGFQPFFACQTRVRDPGRREH-TKHMLRLRREGQITGKQVPE 100 YT + D L G G+R ++ E P Sbjct: 79 YTPLQNEEAFRWFDPLLSRG--GVQLEAAGSLKGGKRIWILAKLINTEAEIISGDIVRPY 136 Query: 101 IILLNSHDGTSSYQMLPGLFRAVCQNGLVCGESFGE--------VRVPHKGDVVSQVIEG 152 ++L NSHDG+++ + R VC N L F + +PH + Q +E Sbjct: 137 LLLHNSHDGSTAVWLQFTPVRVVCWNTLNGAARFRFGDLWQKKAICIPHSLSLTEQ-LEH 195 Query: 153 AYEVLG----IFDRVEEKRDAMQSLLLPPPAQQALAKAALTYRFGEDHQPVTESQILSPR 208 + +L F E+ AM L L H ++ Sbjct: 196 IHNILDLTQKEFQYSVEEYQAMAHKELTTELLADYIGRVLGTTQPTLHPAWSQLVANFES 255 Query: 209 RWQDESNDLWTTYQRIQENL 228 ++ LW Y I E L Sbjct: 256 GRGNQGQTLWDAYNSITEWL 275 >UniRef50_UPI00017465AE hypothetical protein VspiD_04485 n=2 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI00017465AE Length = 256 Score = 69.0 bits (167), Expect = 1e-10, Method: Composition-based stats. Identities = 33/206 (16%), Positives = 64/206 (31%), Gaps = 15/206 (7%) Query: 38 HESRSERYTYIPTISLLDS----LQREGFQPFFACQTRVRDPGRREHTKHMLRLRREGQI 93 + + IP L+++ L+ + + R +L + + Sbjct: 35 TPRSTSSWCPIPHNRLIETVQKTLKSTNLRIGTQAHSLSHKGHRYFGLMEILGPKNDDDY 94 Query: 94 TGKQVPEIILLNSHDGTSSYQMLPGLFRAVCQNGLVCGESFGEVRVPHKGDVVSQVI--- 150 + L NSHD T ++ G VC N GE H +V + Sbjct: 95 ----CWVLGLRNSHDKTFPAGIVAGASVFVCDNLSFSGEVK--FARKHTRFIVRDLPGIT 148 Query: 151 -EGAYEVLGIFDRVEEKRDAMQSLLLPPP-AQQALAKAALTYRFGEDHQPVTESQILSPR 208 +++ + +++ A + + A + +A P + PR Sbjct: 149 ERAIGQLMSKWHHQDKRIGAYKEADIEDSIAHDLIIRATDVGVCSNRLIPSVLKEWREPR 208 Query: 209 RWQDESNDLWTTYQRIQENLIKGGLS 234 E +W+ + E L G LS Sbjct: 209 YQVFEDRSVWSLFNAFTEALKDGSLS 234 >UniRef50_A1UPG4 Putative uncharacterized protein n=1 Tax=Mycobacterium sp. KMS RepID=A1UPG4_MYCSK Length = 344 Score = 69.0 bits (167), Expect = 2e-10, Method: Composition-based stats. Identities = 46/264 (17%), Positives = 75/264 (28%), Gaps = 52/264 (19%) Query: 43 ERYTYIPTI---SLLDSLQREGFQPFFACQTRVRDPGRREHTKHMLRLRR------EGQI 93 +Y + LLD+L E GR +RL Sbjct: 99 NKYEPMQNEASCDLLDALTGE--SGAVYETAGALRGGRETFVT--MRLPESMVFDGIDGT 154 Query: 94 TGKQVPEIILLNSHDGTSSYQMLPGLFRAVCQN--GLVCGESFGEVRVPHKGDVVSQVIE 151 + + LNSHDG+S ++ L R VC N + + H G + E Sbjct: 155 KDRTDFYLAALNSHDGSSKFRFLVTPVRIVCANTQSAAIARAAASFGISHTGGAAVALQE 214 Query: 152 GA------YEVLGIFDRVEEKRDAMQSLLLPPPAQQALAKAALTYRFGEDHQPVTESQIL 205 + + F+ ++ A+ + + + A GE Sbjct: 215 ARRALKLSWRYVEAFE---QEAAALYAAPMDLDQMRRFA--------GELVDVDGAESKT 263 Query: 206 SPRRWQDESN-----------------DLWTTYQRIQENLIKGGLSGRNAKGGRSHTRAV 248 + R +D +N W Y + E + S A G RA+ Sbjct: 264 TARNRRDTANAIVKLWVSSPTVAPIAGTRWAAYNAVTEYV--DHYSKVRAAGDPQSVRAL 321 Query: 249 RGIDGDVKLNRALWVMAEALLTQL 272 R + G A +L L Sbjct: 322 RAVTGGSTAQTLKTN-AFRMLQTL 344 >UniRef50_B9PA18 Predicted protein (Fragment) n=2 Tax=cellular organisms RepID=B9PA18_POPTR Length = 87 Score = 67.8 bits (164), Expect = 4e-10, Method: Composition-based stats. Identities = 27/53 (50%), Positives = 41/53 (77%), Gaps = 1/53 (1%) Query: 3 RLASRFGA-ANLIRRDRPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLL 54 +LASRF + + +R D PL+ +++ RV PS+F++ HESRSERY+YIPT ++L Sbjct: 35 QLASRFASHSPALRSDSPLSDDQIRRVAPSIFADAPHESRSERYSYIPTAAVL 87 >UniRef50_B8F9V3 Putative uncharacterized protein n=4 Tax=Deltaproteobacteria RepID=B8F9V3_DESAA Length = 311 Score = 64.8 bits (156), Expect = 3e-09, Method: Composition-based stats. Identities = 27/151 (17%), Positives = 50/151 (33%), Gaps = 7/151 (4%) Query: 42 SERYTYIPTISLLDSLQREGFQPFFACQTRVRDPG-RREHTKHMLRLRREGQITGKQVPE 100 + RY + I +++ L++ GF Q + H K P Sbjct: 105 TPRYQPVDNIRVMERLEQMGFGHDMEIQLALDAEFFSLSIPDHEKTFAVGND--DKLTPG 162 Query: 101 IILLNSHDGTSSYQMLPGLFRAVCQNGLVCGESFGEVRVPHKG-DVVSQVIEGAYEVLGI 159 I + NS G ++ + + R VC NGL+ + H V+ E +V G Sbjct: 163 ITVCNSEVGRAALSIAAFVLRLVCTNGLIAKTAVSA-SYRHISAKVMEVFPETLQQVAGE 221 Query: 160 FD--RVEEKRDAMQSLLLPPPAQQALAKAAL 188 D + + + P + + + Sbjct: 222 LDVQQTRFRLSMESQVENPSNTIHSFNRQFM 252 >UniRef50_Q0RM54 Putative uncharacterized protein n=1 Tax=Frankia alni ACN14a RepID=Q0RM54_FRAAA Length = 360 Score = 62.8 bits (151), Expect = 1e-08, Method: Composition-based stats. Identities = 29/168 (17%), Positives = 53/168 (31%), Gaps = 14/168 (8%) Query: 40 SRSERYTYIPTISLLDSLQRE-GFQPFFACQTRVRDPGRREHTKHMLRLRREGQITGKQ- 97 + +T I + + ++ G + V + GR +++L + G Sbjct: 116 HPRDTWTLIDHAEMGEIVEAFLGMENVQYETGGVLEKGRAVWA--LIKLDEPIALPGDNS 173 Query: 98 --VPEIILLNSHDGTSSYQMLPGLFRAVCQNGLVCGESFGE-----VRVPHKG---DVVS 147 +P +L N HDG S + R VC N E E H D + Sbjct: 174 LTLPYFLLRNRHDGNGSCSVSHTPVRVVCANTWKVSEMTDEANGTVFSFRHNEKWRDRLE 233 Query: 148 QVIEGAYEVLGIFDRVEEKRDAMQSLLLPPPAQQALAKAALTYRFGED 195 + + V F +E + + + + QQ + G Sbjct: 234 EAKQAIKGVRKQFTLYQEIAERLLDMTVTEKQQQMFVNDFIPTPTGAT 281 >UniRef50_Q5LU35 Putative uncharacterized protein n=1 Tax=Ruegeria pomeroyi RepID=Q5LU35_SILPO Length = 275 Score = 61.7 bits (148), Expect = 3e-08, Method: Composition-based stats. Identities = 53/266 (19%), Positives = 82/266 (30%), Gaps = 38/266 (14%) Query: 13 LIRRDRPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDSLQ-REGFQ--PFFACQ 69 L PL + L ++ + + + IP L+D ++ GF Sbjct: 32 LHAGASPLDYDGLRQL--------ETPEATSTHVPIPHHRLVDVVRLTLGFYGHTVEEEH 83 Query: 70 TRVRDPGRREHTKHMLRLRREGQITGKQVPEIILLNSHDGTSSYQMLPGLFRAVCQNGLV 129 V G R LR G + L NSHD T + G VC N Sbjct: 84 HGVTPDGMRYFGVLSLR-----STYGDYTDTVGLRNSHDKTFPIGISFGSRVFVCDNLAF 138 Query: 130 CGESFGEVRVPHKG----DVVSQVIEGAYEVLGIFDRVEEKRDAMQSLLLPPPAQQALAK 185 + VR H D+ V + + + ++ L Q+L Sbjct: 139 IADHV--VRRKHTAQAKRDLPGLVGDLIEPLADQREAQHRVISRYRAANLS----QSLVD 192 Query: 186 AALTYRFGEDHQPVTESQILSPRRWQDESNDL-----WTTYQRIQENLIKGGLSGRNAKG 240 A+ + + VT RW++ +D W + + L GR A+ Sbjct: 193 HAVLELYRAEVITVT-RIAAVMERWENPPHDWGVKTAWRLFNCVT-----HALEGRIAEQ 246 Query: 241 GRSHTRAVRGIDGDVKLNRALWVMAE 266 +R ID LN V AE Sbjct: 247 PALTSRLHDVIDA-TCLNGNATVSAE 271 >UniRef50_A8ZYJ5 Putative uncharacterized protein n=1 Tax=Desulfococcus oleovorans Hxd3 RepID=A8ZYJ5_DESOH Length = 308 Score = 61.7 bits (148), Expect = 3e-08, Method: Composition-based stats. Identities = 24/156 (15%), Positives = 48/156 (30%), Gaps = 22/156 (14%) Query: 42 SERYTYIPTISLLDSLQREGFQPFFACQTRVRDP--------GRREHTKHMLRLRREGQI 93 + +YT + +L+ L G+ P Q + GR+ Sbjct: 105 TPKYTPVDNFEILERLDSLGYGPDTKVQCSLDAEFLSLSIPDGRKAF----------DIN 154 Query: 94 TGKQVPEIILLNSHDGTSSYQMLPGLFRAVCQNGLVCGESFGEVRVPHKGD-VVSQVIEG 152 + P I + NS G +S + + R VC NGL+ H ++ + + Sbjct: 155 GDRFKPGISISNSEVGLASLTISAFVLRLVCTNGLIARTGISA-SYRHVSTRILKEFPQT 213 Query: 153 AYEVLGI--FDRVEEKRDAMQSLLLPPPAQQALAKA 186 V + + + + P + + Sbjct: 214 IETVSKELGAQQRQFRISMEAPVDNPMQTMDSFNRQ 249 >UniRef50_B4CXI2 Putative uncharacterized protein n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4CXI2_9BACT Length = 320 Score = 61.3 bits (147), Expect = 4e-08, Method: Composition-based stats. Identities = 38/212 (17%), Positives = 63/212 (29%), Gaps = 41/212 (19%) Query: 42 SERYTYIPTISLLDSLQREGFQPF------FACQTRVRDPGRREHTKHMLRLRREGQIT- 94 S RY + F P + G R M R+ ++ Sbjct: 76 SRRYRPLQNSEAFKF-----FDPIVGDRKAYFETAGALGEGERIWV--MARMPEVMEVVR 128 Query: 95 -GKQVPEIILLNSHDGTSSYQMLPGLFRAVCQNGLVCG--ESFGEVRVPHKGDVVSQVIE 151 ++L N+H+G S + R VCQN L+ + RV H + ++ E Sbjct: 129 GDDCFKYLLLSNTHNGEGSVIVKFTTVRVVCQNTLMLAMEDGQKAYRVRHSKQMQFKLDE 188 Query: 152 GAYEVL---GIFDRVEEKRDAMQSLLLPPPAQQALAKAALTYRFGEDHQPVTESQILSPR 208 A + +F E+ + ++ + + A V + + P Sbjct: 189 LADFLAITQQVFQEAEQTFRRLAAVKMTSERLEQYFDAVFP------RTDVQKKRHEKPP 242 Query: 209 RWQ------DESND---------LWTTYQRIQ 225 RW D D LW Y I Sbjct: 243 RWGFLQEMFDSQPDLQLPGVQGTLWGAYNAIT 274 >UniRef50_C6W397 Phage/plasmid-related protein TIGR03299 n=12 Tax=Bacteroidetes RepID=C6W397_DYAFD Length = 350 Score = 61.3 bits (147), Expect = 4e-08, Method: Composition-based stats. Identities = 33/156 (21%), Positives = 55/156 (35%), Gaps = 16/156 (10%) Query: 45 YTYIPTISLL---DSLQREGFQPFFACQTRVRDPGRREHTKHMLRLRREGQITGKQVPE- 100 Y + DS+ G G R +L Q+ + E Sbjct: 95 YQIVQNRDAFTFFDSIV--GNDGILYETAGALGKGERIFIT--AKLPGYIQVGSNDLIEK 150 Query: 101 -IILLNSHDGTSSYQMLPGLFRAVCQNGLVCG--ESFGEVRVPHKGDVVSQVIEGAYEVL 157 + L SHDG+ S R VC N L V++ H + V + + A++V+ Sbjct: 151 YLFLTTSHDGSGSITAAFTPVRIVCANTLNAAMKNITNVVKIRHTSNAVER-LRTAHKVM 209 Query: 158 GIFDR----VEEKRDAMQSLLLPPPAQQALAKAALT 189 GI ++ VEE + + P + L + A+ Sbjct: 210 GIANKFSHEVEEIFNHWAKKPITDPQLKKLIEIAMA 245 >UniRef50_A1WP45 Putative uncharacterized protein n=2 Tax=Comamonadaceae RepID=A1WP45_VEREI Length = 312 Score = 60.1 bits (144), Expect = 8e-08, Method: Composition-based stats. Identities = 42/230 (18%), Positives = 72/230 (31%), Gaps = 31/230 (13%) Query: 37 KHESRSERYTYIPTISLLDSLQ----REGFQPFFACQTRVRDPGRREHTKHMLRLRREGQ 92 S RY + +L+ + REGF GRR + R E Sbjct: 64 PLSVVSPRYKIVQPKKMLEFYRSLVEREGFAI---ETIGSLKGGRRIWA--LARTHIEND 118 Query: 93 ITGKQVP--EIILLNSHDGTSSYQMLPGLFRAVCQNG--LVCGESFGEVRVPHKGDVVSQ 148 + G ++L+ S DG+ + R VC N + ES +V+V H Sbjct: 119 VLGSDRLKAYVLLITSCDGSLATTAKFTCVRVVCWNTQAIALNESGKQVKVRHNTAFNPD 178 Query: 149 VIEGAYEVLG--IFDRVEEKRDAMQSLLLPPPAQQALAKAALTYRFGE----DHQPVTES 202 ++G ++G FD K ++ + L P Q + L E +H+ + ++ Sbjct: 179 AVKGEMGLMGAKAFDAFLGKMRSLTRVKLTEPDAQGIVACLLASPMDERKGVEHKGIEQT 238 Query: 203 QILSPRRWQDESN-----------DLWTTYQRIQENLIKGGLSGRNAKGG 241 + W + E RN + Sbjct: 239 KGFQKIMALFNGAAQGAHLPGVQGTAWGLLNAVTEYA-DHHARARNPENR 287 >UniRef50_Q47CX4 Putative uncharacterized protein n=4 Tax=Betaproteobacteria RepID=Q47CX4_DECAR Length = 354 Score = 57.4 bits (137), Expect = 5e-07, Method: Composition-based stats. Identities = 33/153 (21%), Positives = 54/153 (35%), Gaps = 16/153 (10%) Query: 42 SERYTYIPTISLLDSLQREGFQPFFACQTRVRDPGRREHTKHML------RLRREGQITG 95 S+RY + L +S+ P VR M RL+ E Sbjct: 115 SDRYRRLDNFDLAESVL-----PILQQLPEVRFESVELTETKMYLKCITPRLKYEMAPGD 169 Query: 96 KQVPEIILLNSHDGTSSYQMLPGLFRAVCQNGLVCGESFGEVRVPHKGDVVSQVIEGAYE 155 +++ NS G + + P LFR VC NGL+ + +R H G + E Sbjct: 170 VVQAGVVISNSEVGQGTLSVQPLLFRLVCSNGLIVPD--RSLRKMHVGRALGGEDERIQV 227 Query: 156 VLGIFDRVEEKRDAMQSLLLPPPAQQALAKAAL 188 R ++K ++ + Q A++ A Sbjct: 228 YQDDTLRADDKAFFLK---VRDVVQAAVSDATF 257 >UniRef50_C4ZMQ9 Phage/plasmid-related protein TIGR03299 n=1 Tax=Thauera sp. MZ1T RepID=C4ZMQ9_THASP Length = 334 Score = 54.0 bits (128), Expect = 6e-06, Method: Composition-based stats. Identities = 38/185 (20%), Positives = 62/185 (33%), Gaps = 28/185 (15%) Query: 51 ISLLDSLQREGFQPFFACQTRVRDPGRREHTKHMLRLRREGQITGKQVPE--IILLNSHD 108 + D+L +G +P G + RL + Q+ K V E ++ NSHD Sbjct: 93 AEMFDALLGQG-RPI-YHTGGYLKNGEVVWL--LARLPGDIQVQEKDVIETYLLFSNSHD 148 Query: 109 GTSSYQMLPGLFRAVCQNGLVCGESFGEVRVPHKGDVVSQVIEGAYEVLGI--------- 159 G+S+ + R VCQN L V G V + +G Y VL Sbjct: 149 GSSAIDIRLTTVRVVCQNTLSLALDNTSV-----GKVFRRAHDGRYRVLKEEARAFFEFS 203 Query: 160 ---FDRVEEKRDAMQSLLLPPPAQQALAKAALTYRFGEDHQPVTE-SQILSPRRWQDESN 215 + + + + A + L +PVT + R W+ Sbjct: 204 VKRSEEAQALFGRLANAECDDRAFEDFLAQLLPDP----KRPVTAGQNLQVQRAWETRLA 259 Query: 216 DLWTT 220 ++ T Sbjct: 260 NVRAT 264 >UniRef50_Q19YQ9 Gp96 n=7 Tax=unclassified Siphoviridae RepID=Q19YQ9_9CAUD Length = 400 Score = 53.6 bits (127), Expect = 6e-06, Method: Composition-based stats. Identities = 32/211 (15%), Positives = 69/211 (32%), Gaps = 23/211 (10%) Query: 68 CQTRVRDPGRREHTK-----HMLRLRREGQITGKQVPEIILLNSHDGTSSYQMLPGLFRA 122 D GRR HM + + + N HDG S R Sbjct: 175 ETGGSLDGGRRTFVTMKMPDHMELVSPITGKRDVTDLYLSIFNHHDGGGSLVANISPVRV 234 Query: 123 VCQNG--LVCGESFGEVRVPHKGDVVSQVIEGAYEVLGIF----DRVEEKRDAMQSLLLP 176 VC N + + V + H G+ ++ E +LG+ D + + M + + Sbjct: 235 VCANTQRMAERAAVSRVSIRHTGEAQVRLEE-VRRILGLTWKYQDTYVAEVEEMAKIEMS 293 Query: 177 PPAQQALAKAAL-TYRFGEDHQPVTESQILSPRRWQ---------DESNDLWTTYQRIQE 226 A+ ++ + + + ++ ++ ++ D + Y + E Sbjct: 294 NVETFAIMRSVFEVDKVDPESRSASQRTQMATEAFEIYRSSATVDDFRGVAFGGYNAVTE 353 Query: 227 NLIKG-GLSGRNAKGGRSHTRAVRGIDGDVK 256 + + G++ + R + G G++K Sbjct: 354 WVDHYMPVRGKDNVDVKRALRTINGGGGEIK 384 >UniRef50_B4WVT0 Putative uncharacterized protein n=2 Tax=Synechococcus sp. PCC 7335 RepID=B4WVT0_9SYNE Length = 352 Score = 53.6 bits (127), Expect = 8e-06, Method: Composition-based stats. Identities = 28/161 (17%), Positives = 52/161 (32%), Gaps = 1/161 (0%) Query: 22 REELFRVVPSVFSEDKHESRSERYTYIPTISLLDSLQREGFQPFFACQTRVRDPGRREHT 81 +E+ + + D S RY + L D++ + A R + Sbjct: 91 QEQPEQRMIRTMGTDARAFLSRRYRRLDNFDLADAVLPTLLEMQGARVVSCELTETRMYL 150 Query: 82 KHMLRLRREGQITGKQV-PEIILLNSHDGTSSYQMLPGLFRAVCQNGLVCGESFGEVRVP 140 K + + G V + + NS G S ++ P ++R VC NG+V + R Sbjct: 151 KVVTDRIQADVKVGDAVQAGVCISNSEIGMGSLRVEPLIYRLVCTNGMVSPDRSARNRFT 210 Query: 141 HKGDVVSQVIEGAYEVLGIFDRVEEKRDAMQSLLLPPPAQQ 181 H G + + + ++ L A Sbjct: 211 HLGRAAADTPDAYELFSDKTLEADNTAFFLKVQDLVRDAVD 251 >UniRef50_C5CKG6 Phage/plasmid-related protein TIGR03299 n=10 Tax=Proteobacteria RepID=C5CKG6_VARPS Length = 342 Score = 53.2 bits (126), Expect = 8e-06, Method: Composition-based stats. Identities = 37/238 (15%), Positives = 67/238 (28%), Gaps = 33/238 (13%) Query: 34 SEDKHESRSERYTYIPTISLLDSLQR-EGFQPFFACQTRVRDPGRREHTKHMLRLRREGQ 92 + S RY + +L+ + + V GR+ + R ++ Sbjct: 83 TRAPLSVVSSRYQVVQPREVLEFYRDLTEIGGYEMETAGVLKGGRKVWA--LARTGQQAV 140 Query: 93 ITGKQVP--EIILLNSHDGTSSYQMLPGLFRAVCQNGLVCG--ESFGEVRVPHK------ 142 + G + ++L S DGT + + P R VC N L + +RVPH Sbjct: 141 LKGNDIVNGYLLLATSCDGTLATSVTPTTVRVVCSNTLAVALDATSNVIRVPHSTSFDPD 200 Query: 143 ------GDVVSQVIEGAYEVLGIFDRVEEKRDAMQSLL---------LPPPAQQALAKAA 187 G + Q E Y + + R + ++A+Q + P A Sbjct: 201 AVKRQLGIAIGQWDEFMYRMKTLSQRKVKTKEALQYIERVLYGPSELNPADDVSTQAAQT 260 Query: 188 LTYR----FGEDHQPVTESQILSPRRWQDESNDLWTTYQRIQENLIKGGLSGRNAKGG 241 + W + E + RN + Sbjct: 261 EASPAPRGWAARKVLELYEGRGRGAELAAAKGTTWGLLSAMTE-FVDHERRARNREYR 317 >UniRef50_UPI00016C3597 hypothetical protein GobsU_16407 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C3597 Length = 235 Score = 53.2 bits (126), Expect = 9e-06, Method: Composition-based stats. Identities = 31/217 (14%), Positives = 53/217 (24%), Gaps = 28/217 (12%) Query: 70 TRVRDPGRREHTKHMLRLRREGQITGK-QVPEIILLNSHDGTSSYQMLPGLFRAVCQNGL 128 G+R + + G +L N+HD + + + R VC N L Sbjct: 24 AGSLKEGKRIWVLARINGAEAEVVDGDPVRGYFLLSNAHDASQAVRAQFTSIRVVCANTL 83 Query: 129 VCGESFGE------VRVPHKGDVVS---QVIEGAYEVLGIFDRVEEKRDAMQSLLLPPPA 179 + E VRV H + + V F M S LP Sbjct: 84 NAADRRAERGFEDCVRVRHTTGLETSLVLVQHTIDMAAKTFSASLADYQRMVSRRLPVDG 143 Query: 180 QQALAKAALTYRFGEDHQPVTES--QILSPRRWQDESND-------LWTTYQRIQENLIK 230 + L L W Y I + + Sbjct: 144 FRKYVIDVLEVPESVQRMGKMPKAWDTLQWAYHAAPGARINGVFGTYWGAYNAITDWV-- 201 Query: 231 GGLSGRNAKGGRSHTRAVRG--IDGDVKLNRALWVMA 265 + +G + + +L + + +A Sbjct: 202 -----DHTRGVKDADSRLDSAWFGSGARLKQRAFELA 233 >UniRef50_B7I5L8 Phage/plasmid-related protein n=5 Tax=Moraxellaceae RepID=B7I5L8_ACIB5 Length = 342 Score = 52.8 bits (125), Expect = 1e-05, Method: Composition-based stats. Identities = 36/183 (19%), Positives = 59/183 (32%), Gaps = 11/183 (6%) Query: 15 RRDRPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDSLQREGFQP-FFACQTRVR 73 R L E RV+ + S+RY + +L+ + Q F V Sbjct: 48 RGQNILMPYEEQRVLYRSDTHAPLSVVSQRYQEVQPKEILEFYRDLTEQSGFELETAGVL 107 Query: 74 DPGRREHTKHMLRLRREGQITGKQVP--EIILLNSHDGTSSYQMLPGLFRAVCQNGLVCG 131 GR+ + R + + K V I+L + DGT + R VC N L Sbjct: 108 KGGRKFWA--LARTGQSAALKSKDVSNGYILLATACDGTLATTAQFTSIRVVCSNTLAIA 165 Query: 132 -----ESFGEVRVPH-KGDVVSQVIEGAYEVLGIFDRVEEKRDAMQSLLLPPPAQQALAK 185 S G V+VPH ++ + + +D + + + A Sbjct: 166 LRGQNSSVGVVKVPHSTKFDAEKIKQQLGISVRAWDEHMYEMKQLSQRKVTQQEAAAYFD 225 Query: 186 AAL 188 A Sbjct: 226 AVF 228 >UniRef50_A8ZKZ6 Putative uncharacterized protein n=3 Tax=Cyanobacteria RepID=A8ZKZ6_ACAM1 Length = 351 Score = 50.9 bits (120), Expect = 5e-05, Method: Composition-based stats. Identities = 24/137 (17%), Positives = 49/137 (35%), Gaps = 14/137 (10%) Query: 42 SERYTYIPTISLLDSLQREGFQPFFACQT------RVRDPGRREHTKHMLRLRREGQITG 95 S+RY + + +++ P A V R + K + + G Sbjct: 111 SDRYRRVDNFEIAETVL-----PVLAEFGQGLKIMSVGLTDSRLYIKAVNERVQLDVRKG 165 Query: 96 KQV-PEIILLNSHDGTSSYQMLPGLFRAVCQNGLVCGESFGEVRVPHKGDVVSQVIEGAY 154 V +++ NS G S ++ P ++R VC NGL+ + + H G V + Sbjct: 166 DAVQAGVVISNSEIGLGSIRIEPLVYRLVCLNGLISQD--HSFKKYHVGRQVGESDAAVE 223 Query: 155 EVLGIFDRVEEKRDAMQ 171 +++ ++ Sbjct: 224 LFSDETREADDRALLLK 240 >UniRef50_Q5Y1B4 Putative uncharacterized protein n=1 Tax=uncultured organism BAC21E04 RepID=Q5Y1B4_9ZZZZ Length = 315 Score = 50.5 bits (119), Expect = 6e-05, Method: Composition-based stats. Identities = 37/238 (15%), Positives = 71/238 (29%), Gaps = 13/238 (5%) Query: 42 SERYTYIPTISLLDSLQREGFQPFFACQTRVRDPGRREHTKHMLRLRREGQITGKQV-PE 100 S++Y + SLL + + C + D + T + R + G V Sbjct: 81 SKQYEIVQNDSLLRMAEFIREEVDMDCVIVLSDGAKVCFTATL-RGAETDIVPGDTVKRR 139 Query: 101 IILLNSHDGTSSYQMLPGLFRAVCQNGLVCG---ESFGEVRVPHKGDVVSQVIEGAYEVL 157 I+ HDG + R VCQN L + HK + + Sbjct: 140 IVGYLGHDGKTGCGAKFTNIRVVCQNTLTAALGEAGGAHSSITHKNGANNNFDTLINSID 199 Query: 158 GIFDRVEEKRDAMQSLLLPPPAQQALAKAALTYRFGEDHQPVTESQILS---PRRWQDE- 213 + + M+ + ++ Q + + L R + Sbjct: 200 VARQDFVTECELMREFSRASMGVSQFNEFVDEVYNIDEGQVFRKREKLERAFTRGFGFRF 259 Query: 214 -SNDLWTTYQRIQENLIKGGLSGRNAKGGRSHTRAVRGIDGDVKLNRALWVMAEALLT 270 +W+ I E + + AKG R ++++ + +A L+T Sbjct: 260 APASVWSAVNAITE-VETSTRNTTAAKGRAQFAR--GTFGVGAQISKRAFALARDLVT 314 >UniRef50_C6RKU8 Phage/plasmid-related protein n=12 Tax=Acinetobacter RepID=C6RKU8_ACIRA Length = 347 Score = 50.5 bits (119), Expect = 6e-05, Method: Composition-based stats. Identities = 31/163 (19%), Positives = 54/163 (33%), Gaps = 10/163 (6%) Query: 34 SEDKHESRSERYTYIPTISLLDSLQREGFQP-FFACQTRVRDPGRREHTKHMLRLRREGQ 92 + S+RY + +L+ + Q F V G++ + R + Sbjct: 74 THAPLSVVSQRYQEVQPKQILEFYRDLTEQSGFELETAGVLKGGKKFWA--LARTGQSAA 131 Query: 93 ITGKQVP--EIILLNSHDGTSSYQMLPGLFRAVCQNGLVCG----ESFGEVRVPHKGDV- 145 + GK V I+L + DGT + R VC N L S G V+VPH Sbjct: 132 LKGKDVSNAYILLATACDGTLATTAQFTSIRVVCNNTLAIALKGQSSAGVVKVPHSTRFD 191 Query: 146 VSQVIEGAYEVLGIFDRVEEKRDAMQSLLLPPPAQQALAKAAL 188 ++ + + +D + + + A A Sbjct: 192 AGKIKQQLGISVRQWDEHMYEMKQLSQRKVTQTEAAAYFDAVF 234 >UniRef50_C0VFU1 Putative uncharacterized protein n=4 Tax=Acinetobacter RepID=C0VFU1_9GAMM Length = 357 Score = 49.4 bits (116), Expect = 1e-04, Method: Composition-based stats. Identities = 29/167 (17%), Positives = 54/167 (32%), Gaps = 14/167 (8%) Query: 34 SEDKHESRSERYTYIPTISLLDSLQREGFQP-FFACQTRVRDPGRREHTKHMLRLRREGQ 92 + + S+R+ + +L+ + Q F V G++ + + + Sbjct: 74 THEPLSVVSQRFQEVQPKEILEFYRDLTEQSGFELETAGVLKGGKKFWA--LAKTGQTSA 131 Query: 93 ITGKQVP--EIILLNSHDGTSSYQMLPGLFRAVCQNGLVCG--------ESFGEVRVPHK 142 + GK V I+L + DGT + R VC N L + G V+VPH Sbjct: 132 LKGKDVSNGYILLATACDGTLATTAQFTSIRVVCNNTLAIALKAQNAGSNNTGVVKVPHS 191 Query: 143 GDV-VSQVIEGAYEVLGIFDRVEEKRDAMQSLLLPPPAQQALAKAAL 188 +V + +D + + + A A Sbjct: 192 TRFDAEKVKHQLGISVRAWDEHMYEMKQLSQRKVTQQEAAAYFDAVF 238 >UniRef50_Q2IFF9 Putative uncharacterized protein n=3 Tax=Anaeromyxobacter RepID=Q2IFF9_ANADE Length = 325 Score = 47.0 bits (110), Expect = 7e-04, Method: Composition-based stats. Identities = 31/222 (13%), Positives = 61/222 (27%), Gaps = 18/222 (8%) Query: 42 SERYTYIPTISLLDSLQREGFQPFFACQTRVRDPGRREHTKHMLR-LRREGQITGKQVP- 99 S+ Y + + +L E A T G +L + ++ G P Sbjct: 74 SKSYEVVQFSEVARTLV-EAAGDVKAVFTTAGTLGPVGIKGWLLGEIPNPIKVKGDPSPI 132 Query: 100 --EIILLNSHDGTSSYQMLPGLFRAVCQNGLVCG---ESFGEVRVPHKGDVVSQVIEGAY 154 ++ HDG ++ + R VC N L R+ H + ++ E Sbjct: 133 RKYVLGTTGHDGVTAVVLKNVATRVVCANTLGVALGERGGATWRIQHTANAKMRLDEAGK 192 Query: 155 E---VLGIFDRVEEKRDAMQSLLLPPPAQQALAKAALTYRFGE-DHQPVTESQILSPR-- 208 ++ ++R+ E + + +A + + DH + R Sbjct: 193 AFRQLVESYERLGELANVLAVTPFTTRQMKATIDRLMPVPKDDRDHTKPEAERGKVIRLF 252 Query: 209 ----RWQDESNDLWTTYQRIQENLIKGGLSGRNAKGGRSHTR 246 + W Q E + R Sbjct: 253 DTAAAIERVRGTAWAALQGWTEYADHHRQVRDTGREDPRRAR 294 >UniRef50_C8X3A3 Putative uncharacterized protein n=1 Tax=Desulfohalobium retbaense DSM 5692 RepID=C8X3A3_DESRD Length = 243 Score = 46.3 bits (108), Expect = 0.001, Method: Composition-based stats. Identities = 28/161 (17%), Positives = 52/161 (32%), Gaps = 17/161 (10%) Query: 37 KHESRSERYTYIPTISLLD----SLQREGFQPFFACQTRVRDPGRREHTKHMLRLRREGQ 92 + + +P ++D ++ R+G +D + R Q Sbjct: 15 PVVPGTATWNPVPHNQVIDTVETAISRQGLGIVRKRFELTQDGAN-VFASY-----RLDQ 68 Query: 93 ITGKQVPEIILLNSHDGTSSYQMLPGLFRAVCQNGLVCGESFGEVRVPHKGDVVSQVIEG 152 EI NS + + G F VC N + G+ F E R H + + Sbjct: 69 SRNGSSWEIGFRNSVAKKFAVGITAGTFTIVCSNLVFTGD-FLEFR-RHTKGLDLDELRA 126 Query: 153 AYE-----VLGIFDRVEEKRDAMQSLLLPPPAQQALAKAAL 188 + +E+ ++ +++ LP Q L AL Sbjct: 127 IANRALLGTISRLQSLEQWQEGLKAKPLPRRDMQCLTYEAL 167 >UniRef50_A6SWN5 Uncharacterized conserved protein n=39 Tax=Proteobacteria RepID=A6SWN5_JANMA Length = 318 Score = 46.3 bits (108), Expect = 0.001, Method: Composition-based stats. Identities = 32/172 (18%), Positives = 58/172 (33%), Gaps = 24/172 (13%) Query: 34 SEDKHESRSERYTYIPTISLLDSLQ----REGFQPFFACQTRVRDPGRREH----TKHML 85 ++ S RY + +L+ + R GF+ V GR+ T Sbjct: 74 TKAALSVVSNRYQVVQPDEILEFYRDLTTRSGFE---LETAGVMKGGRKLWALAKTGQSF 130 Query: 86 RLRREGQITGKQVPEIILLNSHDGTSSYQMLPGLFRAVCQNGLVCGES--FGEVRVPHKG 143 ++ + +I G ++L + DG+ + R VC N L S V+VPH Sbjct: 131 SIKDKDRING----YLLLATACDGSLATTAQFTSVRVVCNNTLAIALSGGKDVVKVPHST 186 Query: 144 ----DVVSQVIEGAYEVLGIFDRVEEKRDAMQSLLLPPPAQQALAKAALTYR 191 D+V + + ++ F K + L A + + Sbjct: 187 TFEPDLVKKELGISFSAWDNFRYRMTKLAERK---LKDQEADAFLRTLFSIP 235 >UniRef50_B3VM79 Gp52 n=2 Tax=unclassified Siphoviridae RepID=B3VM79_9CAUD Length = 403 Score = 45.9 bits (107), Expect = 0.001, Method: Composition-based stats. Identities = 24/197 (12%), Positives = 55/197 (27%), Gaps = 27/197 (13%) Query: 89 REGQITGKQVPEIILLNSHDGTSSYQMLPGLFRAVCQNGLVCGESFG-----EVRVPHKG 143 + P +++ S DG+ + + VC N L S + + H Sbjct: 184 HNDRAGFDYRPNLLIYTSFDGSLKTTLARTITATVCDNTLQIAASEAKRAGTALTIGHTR 243 Query: 144 DVVSQVIEGAYE---VLGIFDRVEEKRDAMQSLLLPPPAQQALAKAALT---YRFGEDHQ 197 ++ E + D D + + +A L + + Sbjct: 244 LSSDRMPEARQVLGIIEQESDDFNTLLDEWAATPVSTKQFEAWLDEVLPVPEVKVIDGKA 303 Query: 198 PVTESQILSPRRWQ-------DESNDLWT--------TYQRIQENLIKGGLSGRNAKGGR 242 I+ +R DE W + + + G + + G + Sbjct: 304 KTNSQTIVLNKREAIGDLYYTDERAATWVGTKLGVRQAWNTAHHHKFRSG-NAKQFDGNK 362 Query: 243 SHTRAVRGIDGDVKLNR 259 + R + +K+++ Sbjct: 363 TLARVESNMMRSLKMDK 379 >UniRef50_A6WZ56 Putative uncharacterized protein n=1 Tax=Ochrobactrum anthropi ATCC 49188 RepID=A6WZ56_OCHA4 Length = 402 Score = 45.5 bits (106), Expect = 0.002, Method: Composition-based stats. Identities = 21/68 (30%), Positives = 30/68 (44%), Gaps = 1/68 (1%) Query: 94 TGKQVPEIILLNSHDGTSSYQMLPGLFRAVCQNGLVCG-ESFGEVRVPHKGDVVSQVIEG 152 + NS G+S+ ++ RAVC N L+ G E F E+ + H S+ IE Sbjct: 241 PDLVFRGFYITNSEVGSSALKVAAFYLRAVCCNRLMWGVEGFQEISMRHSKYAPSRFIEE 300 Query: 153 AYEVLGIF 160 A L F Sbjct: 301 ARPALEGF 308 >UniRef50_Q18F79 Putative uncharacterized protein n=1 Tax=Haloquadratum walsbyi DSM 16790 RepID=Q18F79_HALWD Length = 351 Score = 45.1 bits (105), Expect = 0.002, Method: Composition-based stats. Identities = 47/223 (21%), Positives = 82/223 (36%), Gaps = 13/223 (5%) Query: 43 ERYTYIPTISLLDSLQRE----GFQPFFACQTRVRDPGRREHTKHMLRLRREGQITGKQV 98 + Y+ I +L+++ RE G +P+ + R E +M + +I Sbjct: 84 DFYSVIQYGDVLEAVHREMGDQGVEPYGTV-SLSGSAHRMEAPVYMSGDQARVEIGEGDR 142 Query: 99 PEIILLNS--HDGTSSYQMLPGLFRAVCQNGLVCGESFGEVRVPHKGDVV-SQVIEGAYE 155 + + S H G G R VC+NG+ S + H E Sbjct: 143 LNMGVKVSAGHSGHMGVHYNLGAERLVCRNGMTRFVSDLHLDQSHGERFQPGLAYEAVRG 202 Query: 156 VLGIFDRVEEKRDAM-QSLLLPPPAQQALAKAALTYRFGEDHQPVTESQILSPRRWQDES 214 VLG DRVEE+ + + LL + L R E+ + + + + ES Sbjct: 203 VLGSTDRVEERLERARKRELLNLDEARLLLHDIGVDRVAENSEADIMNALFEEVESR-ES 261 Query: 215 NDLWTTYQ---RIQENLIKGGLSGRNAKGGRSHTRAVRGIDGD 254 L+ YQ R+ ++ G G + R + + +DG+ Sbjct: 262 PSLYEVYQAGTRVVDHYADSGSPGHFQETVRDNVARLLDVDGE 304 >UniRef50_C4V5A4 Putative uncharacterized protein n=1 Tax=Selenomonas flueggei ATCC 43531 RepID=C4V5A4_9FIRM Length = 365 Score = 45.1 bits (105), Expect = 0.002, Method: Composition-based stats. Identities = 27/135 (20%), Positives = 52/135 (38%), Gaps = 14/135 (10%) Query: 42 SERYTYIPTISLLDSLQREGFQPFFACQTRVRDPGRREHTKHML------RLRREGQITG 95 S+RY + + L ++ P H+ +L+ E + Sbjct: 114 SDRYRRLDNLELCTAVL-----PVIQEMKDAAIMSCEVTESHLYLKVVNKKLKAEVGVGD 168 Query: 96 KQVPEIILLNSHDGTSSYQMLPGLFRAVCQNGLVCGESFGEVRVPHKGDVVSQVIEGAYE 155 ++ NS G S ++ P ++R VC+NGL+ + F + + H G V+ + AYE Sbjct: 169 VVQAGFVVSNSEVGLGSLKVEPLIYRLVCKNGLIVKD-FAQ-KKYHVGRQVAAEDDTAYE 226 Query: 156 VLGIFDRVEEKRDAM 170 L + + + Sbjct: 227 -LYSDETLAQDDKTF 240 >UniRef50_A8ZPY1 Putative uncharacterized protein n=5 Tax=Bacteria RepID=A8ZPY1_ACAM1 Length = 209 Score = 44.3 bits (103), Expect = 0.004, Method: Composition-based stats. Identities = 20/98 (20%), Positives = 39/98 (39%), Gaps = 12/98 (12%) Query: 42 SERYTYIPTISLLDSLQREGFQPFFACQT------RVRDPGRREHTKHMLRLRREGQITG 95 S+RY + + +++ P A V R + K + + G Sbjct: 111 SDRYRRVDNFEIAETVL-----PVLAEFGPGLKIMSVGLTDSRLYIKAVNERVQLDVRKG 165 Query: 96 KQV-PEIILLNSHDGTSSYQMLPGLFRAVCQNGLVCGE 132 V +++ NS G S ++ P ++R VC NG++ + Sbjct: 166 DAVQAGVVISNSEIGLGSIRIEPLVYRLVCLNGMISQD 203 >UniRef50_B5LJ78 Gp67 n=1 Tax=Mycobacterium phage Myrna RepID=B5LJ78_9CAUD Length = 418 Score = 41.7 bits (96), Expect = 0.024, Method: Composition-based stats. Identities = 18/84 (21%), Positives = 32/84 (38%), Gaps = 6/84 (7%) Query: 69 QTRVRDPGRREHTKHMLRLRREGQITGKQVP----EIILLNSHDGTSSYQMLPGLFRAVC 124 V P + H + +++ K P ++ NS G ++Q+LP VC Sbjct: 211 YLSVDVPEIKIHAQDLVKNYHFYDQDSKDNPFMSAGLVFTNSEVGRGAFQILPRAVVQVC 270 Query: 125 QNGLVCGESFGEVRVPHKGDVVSQ 148 +NG+ R H G + + Sbjct: 271 KNGM--RRDVDGFRKVHLGGRLQE 292 >UniRef50_B8KMK8 Putative uncharacterized protein n=1 Tax=gamma proteobacterium NOR5-3 RepID=B8KMK8_9GAMM Length = 348 Score = 41.7 bits (96), Expect = 0.031, Method: Composition-based stats. Identities = 42/204 (20%), Positives = 69/204 (33%), Gaps = 29/204 (14%) Query: 51 ISLLDSLQREGFQPFFACQTRVRDPGRREHTKHMLRLRREGQITGKQVPEIILLNSHDGT 110 L+ SL+R G P E + +G G + + LN +G Sbjct: 134 EQLVKSLRRLGILPRSKVFKTPFGEVVEEFST-----PGQGGQVGLRCRAVYGLN--NGY 186 Query: 111 SSYQMLPGLFRAVCQNGLVCGESFGEVRVPHKGD------VVSQVIEGAYEVLGIFDRVE 164 SSY+++ G +C NGL ES G R H D V V E + ++ Sbjct: 187 SSYRIIWGRVVLICSNGLTAFESVGRDRWIHNSDVDVDVFVEESVTEAYSRLAVTEKQIA 246 Query: 165 EKRDAMQSLLLPPPAQQALAKAALTYRFGEDHQPVTESQILSPRRWQDESNDLWTTYQRI 224 + R + L LA A + V + I + D ++ W+ Q + Sbjct: 247 DARSRAINYSLLDQFMTRLALA------NASKERVRKRLIHE---FSDTGHNEWSVSQAL 297 Query: 225 QENLIKGGLSGRNAKGGRSHTRAV 248 +G + K +R + Sbjct: 298 T-------YAGEHEKPIPIGSREI 314 >UniRef50_C0GUY0 Putative uncharacterized protein n=2 Tax=Desulfonatronospira thiodismutans ASO3-1 RepID=C0GUY0_9DELT Length = 236 Score = 41.3 bits (95), Expect = 0.032, Method: Composition-based stats. Identities = 32/215 (14%), Positives = 66/215 (30%), Gaps = 22/215 (10%) Query: 37 KHESRSERYTYIPTISLLDSL----QREGFQPFFACQTRVRDPGRREHTKHMLRLRREGQ 92 +E + + ++D++ + +G G + R Q Sbjct: 15 PEVQGTETWNPVHHSLVIDAVENAVRDKGLGIQDKRFELTTGGGN------LFASYRLDQ 68 Query: 93 ITGKQVPEIILLNSHDGTSSYQMLPGLFRAVCQNGLVCGESFGEVRVPHKGDVVSQVIEG 152 +I NS + + G + VC N + G+ F E R KG ++ Sbjct: 69 GRDGVNWQIGFRNSIAKRFAVGITAGTYTMVCSNLVFAGD-FVEFRKHTKGLDTDELFSM 127 Query: 153 AYEVLGIFDRVEEKRDAM----QSLLLPPPAQQALA-KAALTYRFGEDHQPVTESQILSP 207 + + E +A +++ L + L+ +A F + L Sbjct: 128 SGRAIETTVNRLESLEAWQLDLKNIPLSQRHMRILSFEAMRKQAFPASR----FHRFLEA 183 Query: 208 RRWQDE--SNDLWTTYQRIQENLIKGGLSGRNAKG 240 R + L++ Y I + LS + + Sbjct: 184 YREEIALNGLTLYSFYHSITRTIRDQSLSRISTRS 218 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.309 0.128 0.330 Lambda K H 0.267 0.0395 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,456,190,872 Number of Sequences: 3077464 Number of extensions: 54531646 Number of successful extensions: 146145 Number of sequences better than 1.0e-01: 61 Number of HSP's better than 0.1 without gapping: 67 Number of HSP's successfully gapped in prelim test: 42 Number of HSP's that attempted gapping in prelim test: 145957 Number of HSP's gapped (non-prelim): 109 length of query: 273 length of database: 1,040,396,356 effective HSP length: 127 effective length of query: 146 effective length of database: 649,558,428 effective search space: 94835530488 effective search space used: 94835530488 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.2 bits) S2: 92 (40.1 bits)