BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (273 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P18005 UPF0380 protein yubP n=179 Tax=root RepID=YUBP_E... 570 e-161 UniRef50_P52132 UPF0380 protein yfjQ n=153 Tax=Bacteria RepID=YF... 385 e-105 UniRef50_Q1ND23 CP4-6 prophage n=4 Tax=Alphaproteobacteria RepID... 265 1e-69 UniRef50_A4X0R7 Putative uncharacterized protein n=2 Tax=Rhodoba... 245 1e-63 UniRef50_B8IVF8 Putative uncharacterized protein n=1 Tax=Methylo... 221 3e-56 UniRef50_B6C6K7 Conserved domain protein n=2 Tax=Nitrosococcus o... 212 9e-54 UniRef50_B9JPN2 Phosphoribosylamine-glycine ligase n=2 Tax=Prote... 212 1e-53 UniRef50_A9HST1 Putative uncharacterized protein n=1 Tax=Glucona... 207 4e-52 UniRef50_C6N6D1 Putative uncharacterized protein n=2 Tax=Legione... 192 1e-47 UniRef50_C6RFJ3 Putative uncharacterized protein n=1 Tax=Campylo... 162 9e-39 UniRef50_B1ZQ12 Putative uncharacterized protein n=3 Tax=Opitutu... 141 2e-32 UniRef50_B2Q5G8 Putative uncharacterized protein n=3 Tax=Provide... 117 5e-25 UniRef50_Q17W97 Putative uncharacterized protein Hac prophage I ... 95 2e-18 UniRef50_B9PA18 Predicted protein (Fragment) n=2 Tax=cellular or... 59 1e-07 UniRef50_B5EW31 Putative uncharacterized protein n=1 Tax=Vibrio ... 50 6e-05 >UniRef50_P18005 UPF0380 protein yubP n=179 Tax=root RepID=YUBP_ECOLI Length = 273 Score = 570 bits (1468), Expect = e-161, Method: Compositional matrix adjust. Identities = 273/273 (100%), Positives = 273/273 (100%) Query: 1 MRLASRFGRYNSIHRERPLTDDELMQFVPSVFSGDKHESRSERYTYIPTINIINKLRDEG 60 MRLASRFGRYNSIHRERPLTDDELMQFVPSVFSGDKHESRSERYTYIPTINIINKLRDEG Sbjct: 1 MRLASRFGRYNSIHRERPLTDDELMQFVPSVFSGDKHESRSERYTYIPTINIINKLRDEG 60 Query: 61 FQPFFACQSRVRDLGRREYSKHMLRLRREGHINGQEVPEIILLNSHDGSSSYQMIPGIFR 120 FQPFFACQSRVRDLGRREYSKHMLRLRREGHINGQEVPEIILLNSHDGSSSYQMIPGIFR Sbjct: 61 FQPFFACQSRVRDLGRREYSKHMLRLRREGHINGQEVPEIILLNSHDGSSSYQMIPGIFR 120 Query: 121 FVCTNGLVCGNNFGEIRVPHKGDIVGQVIEGAYEVLGVFDKVTDNMEAMKEIHLNSDEQH 180 FVCTNGLVCGNNFGEIRVPHKGDIVGQVIEGAYEVLGVFDKVTDNMEAMKEIHLNSDEQH Sbjct: 121 FVCTNGLVCGNNFGEIRVPHKGDIVGQVIEGAYEVLGVFDKVTDNMEAMKEIHLNSDEQH 180 Query: 181 LFGRAALMVRYEDENKTPVTPEQIITPRRWEDKQNDLWTTWQRVQENMIKGGLSGRSASG 240 LFGRAALMVRYEDENKTPVTPEQIITPRRWEDKQNDLWTTWQRVQENMIKGGLSGRSASG Sbjct: 181 LFGRAALMVRYEDENKTPVTPEQIITPRRWEDKQNDLWTTWQRVQENMIKGGLSGRSASG 240 Query: 241 KNTRTRAITGIDGDIRINKALWVIAEQFRKWKS 273 KNTRTRAITGIDGDIRINKALWVIAEQFRKWKS Sbjct: 241 KNTRTRAITGIDGDIRINKALWVIAEQFRKWKS 273 >UniRef50_P52132 UPF0380 protein yfjQ n=153 Tax=Bacteria RepID=YFJQ_ECOLI Length = 273 Score = 385 bits (988), Expect = e-105, Method: Compositional matrix adjust. Identities = 177/267 (66%), Positives = 220/267 (82%), Gaps = 1/267 (0%) Query: 2 RLASRFGRYNSIHRERPLTDDELMQFVPSVFSGDKHESRSERYTYIPTINIINKLRDEGF 61 RLASRFG N I R+RPLT +EL + VPSVFS DKHESRSERYTYIPTI++++ L+ EGF Sbjct: 3 RLASRFGAANLIRRDRPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDSLQREGF 62 Query: 62 QPFFACQSRVRDLGRREYSKHMLRLRREGHINGQEVPEIILLNSHDGSSSYQMIPGIFRF 121 QPFFACQ+RVRD RRE++KHMLRLRREG I G++VPEIILLNSHDG+SSYQM+PG+FR Sbjct: 63 QPFFACQTRVRDPRRREHTKHMLRLRREGQITGKQVPEIILLNSHDGTSSYQMLPGMFRA 122 Query: 122 VCTNGLVCGNNFGEIRVPHKGDIVGQVIEGAYEVLGVFDKVTDNMEAMKEIHLNSDEQHL 181 VC NGLVCG +FGE+RVPHKGD+V QVIEGAYEVLG+F++V + +AM+ + L Q Sbjct: 123 VCQNGLVCGESFGEVRVPHKGDVVSQVIEGAYEVLGIFERVEEKRDAMQSLLLPPPVQQA 182 Query: 182 FGRAALMVRYEDENKTPVTPEQIITPRRWEDKQNDLWTTWQRVQENMIKGGLSGRSASGK 241 +AAL R+ E+ PVT QI++PRRW+D+ NDLWTT+QR+QEN+IKGGLSGR+A G Sbjct: 183 LAKAALTYRF-GEDHQPVTESQILSPRRWQDESNDLWTTYQRIQENLIKGGLSGRNAKGG 241 Query: 242 NTRTRAITGIDGDIRINKALWVIAEQF 268 T TRA+ GIDGD+++N+ALWV+AE Sbjct: 242 RTHTRAVRGIDGDVKLNRALWVMAETL 268 >UniRef50_Q1ND23 CP4-6 prophage n=4 Tax=Alphaproteobacteria RepID=Q1ND23_9SPHN Length = 281 Score = 265 bits (677), Expect = 1e-69, Method: Compositional matrix adjust. Identities = 133/276 (48%), Positives = 190/276 (68%), Gaps = 6/276 (2%) Query: 3 LASRFGR-YNSIHRERPLTDDELMQFVPSVFSGDKHESRSERYTYIPTINIINKLRDEGF 61 LA+RFGR + I PL ++ L + VPS+F+ + H+SRSERY Y+PTI+I+ LR EG+ Sbjct: 6 LATRFGRNSHQIGGYEPLDNEALYRHVPSIFAREAHDSRSERYVYVPTIDIVEGLRREGW 65 Query: 62 QPFFACQSRVRDLGRREYSKHMLRLRREGHINGQEVPEIILLNSHDGSSSYQMIPGIFRF 121 PFFA QS RD R ++KHMLRLRRE + E E I++NSHDG+S++Q+ G+ RF Sbjct: 66 FPFFAVQSVPRDGNRHGHAKHMLRLRREDGVGKSEAAEAIIVNSHDGTSAFQLFAGMLRF 125 Query: 122 VCTNGLVCGNNFGEIRVPHKGDIVGQVIEGAYEVLGVFDKVTDNMEAMKEIHLNSDEQHL 181 VCTN ++ G F E+RVPHKG+I +IEG Y V F ++ D E MK + L+ DEQ L Sbjct: 126 VCTNSMIAGERFEEVRVPHKGNIEHDIIEGVYTVAEDFPRLIDASETMKGVRLSEDEQRL 185 Query: 182 FGRAALMVRYEDENKTPVTPEQIITPRRWEDKQNDLWTTWQRVQENMIKGGLSGRSASGK 241 G +L+ RY E+++P+TPEQII PRR+ED+ + LWTT+ +QEN+I+GGL GR + + Sbjct: 186 LGEVSLVARY-GEDESPLTPEQIIEPRRYEDRGDSLWTTFNVIQENVIRGGLHGRKRNAE 244 Query: 242 N----TRTRAITGIDGDIRINKALWVIAEQFRKWKS 273 +R+R I GID ++ +N+ALW +AE ++ K+ Sbjct: 245 GRIRRSRSRPINGIDQNVTLNRALWTLAEGMQRLKT 280 >UniRef50_A4X0R7 Putative uncharacterized protein n=2 Tax=Rhodobacter sphaeroides RepID=A4X0R7_RHOS5 Length = 316 Score = 245 bits (625), Expect = 1e-63, Method: Compositional matrix adjust. Identities = 121/267 (45%), Positives = 173/267 (64%), Gaps = 8/267 (2%) Query: 15 RERPLTDDELMQFVPSVFSGDKHESRSERYTYIPTINIINKLRDEGFQPFFACQSRVRDL 74 R PLT+ EL VPS+F+ + HESRS R+ +PTI +++ LR EGF+PFFA Q+R R Sbjct: 48 RGEPLTNAELHARVPSIFATEAHESRSARFAPVPTITVLDGLRAEGFEPFFAQQARTRIE 107 Query: 75 GRREYSKHMLRLRREGHIN-GQEVPEIILLNSHDGSSSYQMIPGIFRFVCTNGLVCGNNF 133 G+ E++KHMLRLR G +N E EI+L+N++DG+S+YQMIPG FRFVC NGL+ G F Sbjct: 108 GKAEFTKHMLRLRHRGIVNEAGEAFEIVLVNANDGTSAYQMIPGFFRFVCANGLMAGETF 167 Query: 134 GEIRVPHKGDIVGQVIEGAYEVLGVFDKVTDNMEAMKEIHLNSDEQHLFGRAALMVRY-- 191 E++V H G+ +G+VIEGAY VL +V D ++ K I L E+ + AA +R+ Sbjct: 168 EEVKVRHSGNAIGEVIEGAYRVLEDAPRVADQVQRFKSIRLQDREREILAEAAHSLRFPA 227 Query: 192 -EDENKTPVTPEQIITPRRWEDKQNDLWTTWQRVQENMIKGGLSGR--SASG--KNTRTR 246 + P+ P ++ PRR ED+ DLWT + VQEN ++GG+ GR + SG + R Sbjct: 228 TAEGKAAPIDPPALLRPRRSEDRATDLWTAFNVVQENTLRGGMRGRIETDSGFIRRQTVR 287 Query: 247 AITGIDGDIRINKALWVIAEQFRKWKS 273 +TGID +N+ALW++ E+ + KS Sbjct: 288 EVTGIDQSRALNRALWMLTERMAELKS 314 >UniRef50_B8IVF8 Putative uncharacterized protein n=1 Tax=Methylobacterium nodulans ORS 2060 RepID=B8IVF8_METNO Length = 295 Score = 221 bits (562), Expect = 3e-56, Method: Compositional matrix adjust. Identities = 118/281 (41%), Positives = 168/281 (59%), Gaps = 12/281 (4%) Query: 5 SRFGRYNSIHRERP-LTDDELMQFVPSVFSGDKHESRSERYTYIPTINIINKLRDEGFQP 63 +RFG + R L + L P+VF+ DKH SRS++YTYIPT+ ++ L EGF P Sbjct: 14 TRFGSGAVVVRNNGGLDEAALRSAAPTVFAEDKHSSRSDKYTYIPTVEVLRGLGREGFLP 73 Query: 64 FFACQSRVRDLGRREYSKHMLRLRREGHIN---GQEVPEIILLNSHDGSSSYQMIPGIFR 120 RD +R Y+KH+LRLRR G G E++LLNSHDG+SSYQ++ G+FR Sbjct: 74 VEVRVGGTRDEEKRGYTKHLLRLRRMGDAPTRVGDSSRELVLLNSHDGTSSYQLMSGLFR 133 Query: 121 FVCTNGLVCGNNFGEI-RVPHKGDIVGQVIEGAYEVLGVFDKVTDNMEAMKEIHLNSDEQ 179 +C+NGLVC + +I ++PHKGDIV QVI+GAY ++ ++V MK+I L EQ Sbjct: 134 LICSNGLVCADGDAQILKIPHKGDIVQQVIDGAYRIVDASEEVDRIAAEMKQIELRPAEQ 193 Query: 180 HLFGRAALMVRYEDE-NKTPVTPEQIITPRRWEDKQNDLWTTWQRVQENMIKGGLSGRSA 238 F AA +R+ E + PV P QI PRR ED N LW + R QE +I+GG+ + Sbjct: 194 DAFAEAAAELRWNGEGQRVPVEPRQIHAPRRREDVGNSLWLAFNRTQEGLIRGGIDYQQR 253 Query: 239 SGKNTR------TRAITGIDGDIRINKALWVIAEQFRKWKS 273 + + R TR + G+DG+ +N+ALWV+A + + K+ Sbjct: 254 NPETGRLIARRQTRPVQGVDGNTALNRALWVLANRMAELKA 294 >UniRef50_B6C6K7 Conserved domain protein n=2 Tax=Nitrosococcus oceani RepID=B6C6K7_9GAMM Length = 226 Score = 212 bits (540), Expect = 9e-54, Method: Compositional matrix adjust. Identities = 102/225 (45%), Positives = 150/225 (66%), Gaps = 5/225 (2%) Query: 52 IINKLRDEGFQPFFACQSRVRDLGRREYSKHMLRLRR---EGHINGQEVPEIILLNSHDG 108 +I L EG+ P A +SRVR R+ +SKH+LR RR E + G PEI+L+NSHDG Sbjct: 1 MIEALEREGWSPVHAEESRVRIPDRKGFSKHLLRFRRFDNELPMVGDSFPEIVLVNSHDG 60 Query: 109 SSSYQMIPGIFRFVCTNGLVCGN-NFGEIRVPHKGDIVGQVIEGAYEVLGVFDKVTDNME 167 S +YQ+ G+FR VC+NG++ + N G+++ H GD+V +VIEG YE++ ++ +E Sbjct: 61 SCAYQLHAGLFRLVCSNGMIVADSNMGQVKRRHTGDVVREVIEGTYEIVEELPRIAARVE 120 Query: 168 AMKEIHLNSDEQHLFGRAALMVRYEDENKTPVTPEQIITPRRWEDKQNDLWTTWQRVQEN 227 K + L+ EQ +F +AL VR+ E + P P+ ++ PRR ED+ NDLW T+QRVQEN Sbjct: 121 DFKTLELSLQEQEIFAESALRVRWR-EGEAPCMPQALLRPRRHEDQGNDLWATYQRVQEN 179 Query: 228 MIKGGLSGRSASGKNTRTRAITGIDGDIRINKALWVIAEQFRKWK 272 M+KGG+ GRSA G+ TRA+ +DG++++NKALW + EQ + K Sbjct: 180 MLKGGIRGRSAVGRQITTRAVKSVDGNVKLNKALWFLTEQMAELK 224 >UniRef50_B9JPN2 Phosphoribosylamine-glycine ligase n=2 Tax=Proteobacteria RepID=B9JPN2_AGRRK Length = 391 Score = 212 bits (539), Expect = 1e-53, Method: Compositional matrix adjust. Identities = 114/285 (40%), Positives = 173/285 (60%), Gaps = 15/285 (5%) Query: 1 MRLASRFGRYNSIHRERPLTDDELMQFVPSVFSGDKHESRSERYTYIPTINIINKLRDEG 60 M + + R+++ R +T+ E+ + PS+F+ HESRS+R+ IPTI ++ L EG Sbjct: 108 MTIYTETARFDT---ARTMTETEMWKVAPSIFATTAHESRSDRFKPIPTIEVLRGLMAEG 164 Query: 61 FQPFFACQSRVRDLGRREYSKHMLRLRR----EGHINGQEVPEIILLNSHDGSSSYQMIP 116 F P A QS R G+ +++KH++RLRR + + G V EI+L N++DG+S+Y+++ Sbjct: 165 FVPVGAKQSASRTEGKADFTKHLIRLRRVDDGKTYRVGDTVCEILLKNANDGTSAYELLA 224 Query: 117 GIFRFVCTNGLVC-GNNFGEIRVPHKGDIVGQVIEGAYEVLGVFDKVTDNMEAMKEIHLN 175 G+FR C N LV I+V H GD+ +VIEG Y VL ++ + LN Sbjct: 225 GLFRIRCMNSLVTQTGTIDAIKVRHSGDVSAKVIEGTYRVLNEAERTLVAPQDWATHKLN 284 Query: 176 SDEQHLFGRAALMVRYED---ENKTPVTPEQIITPRRWEDKQNDLWTTWQRVQENMIKGG 232 DEQ + AA ++R+ D E KTP+ PEQ++ PRR +D+ +DLWT W QEN+I+GG Sbjct: 285 RDEQQIMAEAAHVLRFGDNDGETKTPIKPEQLLLPRRHDDRADDLWTVWNVTQENVIRGG 344 Query: 233 LS--GRSASGKNTR--TRAITGIDGDIRINKALWVIAEQFRKWKS 273 L GR G+ R +RA+ GID DI++NKALW+I E+ + K+ Sbjct: 345 LRGIGREDLGRPRRVKSRAVNGIDQDIKLNKALWLIGEKMAELKA 389 >UniRef50_A9HST1 Putative uncharacterized protein n=1 Tax=Gluconacetobacter diazotrophicus PAl 5 RepID=A9HST1_GLUDA Length = 282 Score = 207 bits (526), Expect = 4e-52, Method: Compositional matrix adjust. Identities = 112/270 (41%), Positives = 156/270 (57%), Gaps = 13/270 (4%) Query: 12 SIHRE-RPLTDDELMQFVPSVFSGDKHESRSERYTYIPTINIINKLRDEGFQPFFACQSR 70 S HR +PLTD++L + PS+F+ KHESRS+RYTYIPTI ++ LR EGF P A Q Sbjct: 8 SAHRHAQPLTDEQLQRLAPSIFAEAKHESRSDRYTYIPTIEVVRGLRSEGFFPVMARQGN 67 Query: 71 VRDLGRREYSKHMLRLRREGHIN-----GQEVPEIILLNSHDGSSSYQMIPGIFRFVCTN 125 R G+ EY+KH++R R H G PE+ LLNSHDG+S+Y++I + R C N Sbjct: 68 SRIPGKAEYTKHLIRFRHMDHGPMYENLGDLYPEVALLNSHDGTSAYKIIAAMMRLACEN 127 Query: 126 GLVCGN-NFGEIRVPHKGDIVGQVIEGAYEVLGVFDKVTDNMEAMKEIHLNSDEQHLFGR 184 G+V + EI VPHKG + +VIEG+Y VL K + L +Q F Sbjct: 128 GMVVQDARLAEISVPHKGTVTDKVIEGSYTVLDESRKALEIAGEWSGKTLTERQQKGFAE 187 Query: 185 AALMVRY-EDENKTPVTPEQIITPRRWEDKQNDLWTTWQRVQENMIKGGLSGR--SASGK 241 A + +Y +D + P TPE + RR D+ DLW RVQE+ I+GG++G G+ Sbjct: 188 AVHIAKYGDDAERMPFTPESYLRTRRAADQGADLWRVANRVQESAIRGGMTGFRWDEDGR 247 Query: 242 NTR---TRAITGIDGDIRINKALWVIAEQF 268 N + R + IDGDI++NKA+W +A+ Sbjct: 248 NRKRVTARPVKSIDGDIKLNKAVWHLAQML 277 >UniRef50_C6N6D1 Putative uncharacterized protein n=2 Tax=Legionella drancourtii LLAP12 RepID=C6N6D1_9GAMM Length = 275 Score = 192 bits (488), Expect = 1e-47, Method: Compositional matrix adjust. Identities = 101/254 (39%), Positives = 158/254 (62%), Gaps = 6/254 (2%) Query: 19 LTDDELMQFVPSVFSGDKHESRSERYTYIPTINIINKLRDEGFQPFFACQSRVRDLGRRE 78 LT ++L + PS+F+ SERY I T ++I++L EGF P A QS R ++ Sbjct: 16 LTIEQLYKAAPSLFTRGAAVHTSERYQPIATSDVIDRLLQEGFYPTKATQSASRSEEKKV 75 Query: 79 YSKHMLRLR-REGHINGQEV-PEIILLNSHDGSSSYQMIPGIFRFVCTNGLVCGNNFGEI 136 +SKH++R R R+ H G + PE++L+NSHDG SSY+++ G++R VCTNGLV G ++ E+ Sbjct: 76 FSKHLVRFRHRDYHNPGNGLFPELVLINSHDGLSSYRLMAGLYRQVCTNGLVAGKSYDEV 135 Query: 137 RVPHKGDIVGQVIEGAYEVLGVFDKVTDNMEAMKEIHLNSDEQHLFGRAALMVRYEDENK 196 RV H+GD++G VIEG Y V+ K+ +E M + L ++ F A +R+ ++ Sbjct: 136 RVKHQGDVIGNVIEGTYRVIESSQKMLQVVEQMGDCALPDEKLLEFSAQAHALRFSEDAN 195 Query: 197 TPVTPEQIITPRRWEDKQNDLWTTWQRVQENMIKGGLSGRSASG----KNTRTRAITGID 252 + P+ ++ PRR ED + DL++ + VQEN+IKGG+ G + + R+R IT ID Sbjct: 196 LVIEPKNLLVPRRREDMKRDLFSVFNVVQENLIKGGVLGYRLNEHGRWRRARSRKITSID 255 Query: 253 GDIRINKALWVIAE 266 +++IN+ LW IAE Sbjct: 256 QNVKINRDLWTIAE 269 >UniRef50_C6RFJ3 Putative uncharacterized protein n=1 Tax=Campylobacter showae RM3277 RepID=C6RFJ3_9PROT Length = 271 Score = 162 bits (411), Expect = 9e-39, Method: Compositional matrix adjust. Identities = 89/267 (33%), Positives = 154/267 (57%), Gaps = 20/267 (7%) Query: 18 PLTDDELMQFVPSVFSGDKHESRSERYTYIPTINIINKLRDEGFQPFFACQSRVRDLGRR 77 PLT+++L Q PS+F+ + + S++Y +I TI++IN++RD + P ++ VRD + Sbjct: 7 PLTNEQLEQLAPSLFADEPYFEASDKYHFISTIDVINEIRDYAWYPVGVSEASVRDEKKE 66 Query: 78 EYSKHMLRLRR-EGHIN-GQEVPEIILLNSHDGSSSYQMIPGIFRFVCTNGLVCGNN-FG 134 + KH +R R + +N G+ V E++L NSHD S + + G+FRFVC NGLV + F Sbjct: 67 GFQKHYVRFRHLDDFLNPGENVVELLLFNSHDRSKCFSISAGVFRFVCANGLVVSDEVFE 126 Query: 135 EIRVPHKGD-------IVGQVIEGAYEVLGVFDKVTDNMEAMKEIHLNSDEQHLFGRAAL 187 ++ H GD + ++ + Y++L + ++ +I L D++ F +AA+ Sbjct: 127 SYQIKHLGDKENDVSIAINKIAKAKYDIL-------NKIKLFSKIPLTQDDKASFAKAAI 179 Query: 188 MVRYEDENKTPVTPEQIITPRRWEDKQNDLWTTWQRVQENMIKGGLSGRSA-SGKNTRTR 246 +R+E K V ++ P R ED+++DL+TT+ +QE++I+G +SG +A + + +R Sbjct: 180 PLRFEKHLK--VDYRDLLVPHRIEDEKDDLYTTFNTIQEHLIRGNISGINAETNRRFTSR 237 Query: 247 AITGIDGDIRINKALWVIAEQFRKWKS 273 I I D INK LW +AE K K+ Sbjct: 238 IIKSISTDTDINKKLWNMAESIAKIKA 264 >UniRef50_B1ZQ12 Putative uncharacterized protein n=3 Tax=Opitutus terrae PB90-1 RepID=B1ZQ12_OPITP Length = 288 Score = 141 bits (355), Expect = 2e-32, Method: Compositional matrix adjust. Identities = 92/277 (33%), Positives = 146/277 (52%), Gaps = 26/277 (9%) Query: 17 RPLTDDELMQFVPSVFSGDKHESRSERYTYIPTINIINKLRDEGFQPFFACQSRVRDLGR 76 R L+ D+L + PSVF+ S RYT++ T +++ LR EG++P A Q RVR R Sbjct: 14 RALSLDDLRRVAPSVFAEQARPGVSSRYTFVSTAQVVDLLRGEGWEPVKANQQRVRLENR 73 Query: 77 REYSKHMLRLRREGHIN------GQEVPEIILLNSHDGSSSYQMIPGIFRFVCTNGL-VC 129 + + H LR R + G PE+IL N+HDG+ +Y++ G++R VC NGL V Sbjct: 74 QGFQMHELRFARRADLENASFAIGDVRPELILQNAHDGTRAYRIDAGLYRLVCRNGLTVA 133 Query: 130 GNNFGEIRVPHKGDIVGQVIEGAYEVLGVFDKVTDNMEAMKEIHLNSDEQHLFGRAALMV 189 +F + + H + A V +V + + + + L +H F A+ + Sbjct: 134 DADFAHVAIRHVDVSAEKFAAAAQAVAENTPRVMEVIARWQAVALTPLARHSFAARAMAL 193 Query: 190 RYEDENKTPVT----PEQIITPRRWEDKQNDLWTTWQRVQENMIKGGL--SGR--SASG- 240 R+ ++ PVT P+Q++ P R+ D+ DLWTT+ VQE + +GGL +G +A G Sbjct: 194 RW--DSAQPVTRLLRPDQLLAPARYGDQATDLWTTFNVVQERLCRGGLRYAGHIPAAEGA 251 Query: 241 -------KNTRTRAITGIDGDIRINKALWVIAEQFRK 270 +NT TR + G+ R+NKALW +AE+F + Sbjct: 252 VFPTHYLRNT-TRPVGGLTEGQRLNKALWNLAEEFSR 287 >UniRef50_B2Q5G8 Putative uncharacterized protein n=3 Tax=Providencia RepID=B2Q5G8_PROST Length = 122 Score = 117 bits (292), Expect = 5e-25, Method: Compositional matrix adjust. Identities = 59/113 (52%), Positives = 83/113 (73%), Gaps = 1/113 (0%) Query: 156 LGVFDKVTDNMEAMKEIHLNSDEQHLFGRAALMVRYEDENKTPVTPEQIITPRRWEDKQN 215 + FD V + E M+ + L Q +AAL R+ +E++ P+T EQ++ PRRWEDK++ Sbjct: 1 METFDTVAEKREQMQSLLLPPPAQQALAQAALTYRFGEEHQ-PITEEQVLQPRRWEDKKD 59 Query: 216 DLWTTWQRVQENMIKGGLSGRSASGKNTRTRAITGIDGDIRINKALWVIAEQF 268 DLWT +QR+QEN+IKGGLSGR+A GK RTR++ GIDGDI++NKALWV+ E+ Sbjct: 60 DLWTVYQRLQENLIKGGLSGRNAKGKRARTRSVNGIDGDIKLNKALWVMTEKM 112 >UniRef50_Q17W97 Putative uncharacterized protein Hac prophage I orf7 n=1 Tax=Helicobacter acinonychis str. Sheeba RepID=Q17W97_HELAH Length = 176 Score = 95.1 bits (235), Expect = 2e-18, Method: Compositional matrix adjust. Identities = 49/170 (28%), Positives = 88/170 (51%), Gaps = 3/170 (1%) Query: 17 RPLTDDELMQFVPSVFSGDKHESRSERYTYIPTINIINKLRDEGFQPFFACQSRVRDLGR 76 +PL+++EL + PS+F+ + + S++Y +I TI+II ++R + P ++ VR+ + Sbjct: 6 QPLSNNELKRLAPSLFTAEPYYEASDKYHFISTIDIIEEIRFHAWYPVAVSEASVRNEDK 65 Query: 77 REYSKHMLRLRREGHI--NGQEVPEIILLNSHDGSSSYQMIPGIFRFVCTNGLVCGNN-F 133 Y +H +R R + E++L NSHD S + + G+FRFVC NGLV + F Sbjct: 66 EGYQQHYVRFRYLDDFLRPSENCVELLLFNSHDRSKCFTISAGVFRFVCANGLVVADEVF 125 Query: 134 GEIRVPHKGDIVGQVIEGAYEVLGVFDKVTDNMEAMKEIHLNSDEQHLFG 183 ++ H G+ V ++ DK+ D + +I L ++ F Sbjct: 126 ESYQIKHIGEKANGVAVAIPSIVQAKDKIMDKISTFSQITLTEQDKISFA 175 >UniRef50_B9PA18 Predicted protein (Fragment) n=2 Tax=cellular organisms RepID=B9PA18_POPTR Length = 87 Score = 59.3 bits (142), Expect = 1e-07, Method: Composition-based stats. Identities = 25/54 (46%), Positives = 40/54 (74%), Gaps = 1/54 (1%) Query: 1 MRLASRFGRYN-SIHRERPLTDDELMQFVPSVFSGDKHESRSERYTYIPTINII 53 M+LASRF ++ ++ + PL+DD++ + PS+F+ HESRSERY+YIPT ++ Sbjct: 34 MQLASRFASHSPALRSDSPLSDDQIRRVAPSIFADAPHESRSERYSYIPTAAVL 87 >UniRef50_B5EW31 Putative uncharacterized protein n=1 Tax=Vibrio fischeri MJ11 RepID=B5EW31_VIBFM Length = 318 Score = 50.4 bits (119), Expect = 6e-05, Method: Compositional matrix adjust. Identities = 25/85 (29%), Positives = 46/85 (54%), Gaps = 5/85 (5%) Query: 79 YSKHMLRLRREGHINGQEVPEIILLNSHDGSSSYQMIPGIFRFVCTNGLVCGNNFGEIRV 138 + HM+++ + + ++++NS+DGS ++Q+ G FR VCTNG++ G F + V Sbjct: 127 FPAHMVQIGSGDKV----ILRLVVVNSYDGSCNFQVQAGGFRIVCTNGMITGEKFLSLDV 182 Query: 139 PHKGDI-VGQVIEGAYEVLGVFDKV 162 H G + GQV + F+ + Sbjct: 183 RHTGTMNFGQVTRQVTTAVSSFENM 207 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P18005 UPF0380 protein yubP n=179 Tax=root RepID=YUBP_E... 421 e-116 UniRef50_P52132 UPF0380 protein yfjQ n=153 Tax=Bacteria RepID=YF... 384 e-105 UniRef50_Q1ND23 CP4-6 prophage n=4 Tax=Alphaproteobacteria RepID... 355 1e-96 UniRef50_B9JPN2 Phosphoribosylamine-glycine ligase n=2 Tax=Prote... 343 3e-93 UniRef50_A4X0R7 Putative uncharacterized protein n=2 Tax=Rhodoba... 333 4e-90 UniRef50_B8IVF8 Putative uncharacterized protein n=1 Tax=Methylo... 333 5e-90 UniRef50_C6RFJ3 Putative uncharacterized protein n=1 Tax=Campylo... 331 1e-89 UniRef50_A9HST1 Putative uncharacterized protein n=1 Tax=Glucona... 328 1e-88 UniRef50_C6N6D1 Putative uncharacterized protein n=2 Tax=Legione... 328 2e-88 UniRef50_B6C6K7 Conserved domain protein n=2 Tax=Nitrosococcus o... 298 1e-79 UniRef50_B1ZQ12 Putative uncharacterized protein n=3 Tax=Opitutu... 275 1e-72 UniRef50_Q17W97 Putative uncharacterized protein Hac prophage I ... 220 4e-56 UniRef50_B2Q5G8 Putative uncharacterized protein n=3 Tax=Provide... 138 2e-31 UniRef50_B5EW31 Putative uncharacterized protein n=1 Tax=Vibrio ... 102 2e-20 UniRef50_B9PA18 Predicted protein (Fragment) n=2 Tax=cellular or... 73 8e-12 Sequences not found previously or not previously below threshold: UniRef50_A6GXR9 Putative uncharacterized protein n=1 Tax=Flavoba... 55 2e-06 UniRef50_A8ZS75 Putative uncharacterized protein n=1 Tax=Desulfo... 54 5e-06 UniRef50_A8RIH4 Putative uncharacterized protein n=3 Tax=Clostri... 53 9e-06 UniRef50_Q2LV02 Hypothetical cytosolic protein n=1 Tax=Syntrophu... 50 6e-05 UniRef50_D1N225 Putative uncharacterized protein n=1 Tax=Victiva... 50 6e-05 UniRef50_D2R5Z8 Phage/plasmid-related protein TIGR03299 n=1 Tax=... 47 6e-04 UniRef50_C7Q5L2 Phage/plasmid-related protein TIGR03299 n=1 Tax=... 46 0.001 UniRef50_A8ZYJ5 Putative uncharacterized protein n=1 Tax=Desulfo... 45 0.002 UniRef50_A1UPG4 Putative uncharacterized protein n=1 Tax=Mycobac... 43 0.007 UniRef50_B9NX09 Putative uncharacterized protein n=1 Tax=Rhodoba... 43 0.011 UniRef50_B4VVD2 Phage/plasmid-related protein TIGR03299 n=2 Tax=... 43 0.012 UniRef50_C6W397 Phage/plasmid-related protein TIGR03299 n=12 Tax... 43 0.014 UniRef50_C4DCZ5 Phage/plasmid-related protein TIGR03299 n=3 Tax=... 43 0.015 UniRef50_A3XKH6 Putative uncharacterized protein n=2 Tax=Leeuwen... 42 0.017 UniRef50_C2LEJ7 Putative uncharacterized protein n=1 Tax=Proteus... 42 0.020 UniRef50_UPI0001AF46A9 hypothetical protein MkanA1_07449 n=1 Tax... 42 0.027 UniRef50_Q47CX4 Putative uncharacterized protein n=4 Tax=Betapro... 42 0.031 UniRef50_B4WVT0 Putative uncharacterized protein n=2 Tax=Synecho... 40 0.068 >UniRef50_P18005 UPF0380 protein yubP n=179 Tax=root RepID=YUBP_ECOLI Length = 273 Score = 421 bits (1083), Expect = e-116, Method: Composition-based stats. Identities = 273/273 (100%), Positives = 273/273 (100%) Query: 1 MRLASRFGRYNSIHRERPLTDDELMQFVPSVFSGDKHESRSERYTYIPTINIINKLRDEG 60 MRLASRFGRYNSIHRERPLTDDELMQFVPSVFSGDKHESRSERYTYIPTINIINKLRDEG Sbjct: 1 MRLASRFGRYNSIHRERPLTDDELMQFVPSVFSGDKHESRSERYTYIPTINIINKLRDEG 60 Query: 61 FQPFFACQSRVRDLGRREYSKHMLRLRREGHINGQEVPEIILLNSHDGSSSYQMIPGIFR 120 FQPFFACQSRVRDLGRREYSKHMLRLRREGHINGQEVPEIILLNSHDGSSSYQMIPGIFR Sbjct: 61 FQPFFACQSRVRDLGRREYSKHMLRLRREGHINGQEVPEIILLNSHDGSSSYQMIPGIFR 120 Query: 121 FVCTNGLVCGNNFGEIRVPHKGDIVGQVIEGAYEVLGVFDKVTDNMEAMKEIHLNSDEQH 180 FVCTNGLVCGNNFGEIRVPHKGDIVGQVIEGAYEVLGVFDKVTDNMEAMKEIHLNSDEQH Sbjct: 121 FVCTNGLVCGNNFGEIRVPHKGDIVGQVIEGAYEVLGVFDKVTDNMEAMKEIHLNSDEQH 180 Query: 181 LFGRAALMVRYEDENKTPVTPEQIITPRRWEDKQNDLWTTWQRVQENMIKGGLSGRSASG 240 LFGRAALMVRYEDENKTPVTPEQIITPRRWEDKQNDLWTTWQRVQENMIKGGLSGRSASG Sbjct: 181 LFGRAALMVRYEDENKTPVTPEQIITPRRWEDKQNDLWTTWQRVQENMIKGGLSGRSASG 240 Query: 241 KNTRTRAITGIDGDIRINKALWVIAEQFRKWKS 273 KNTRTRAITGIDGDIRINKALWVIAEQFRKWKS Sbjct: 241 KNTRTRAITGIDGDIRINKALWVIAEQFRKWKS 273 >UniRef50_P52132 UPF0380 protein yfjQ n=153 Tax=Bacteria RepID=YFJQ_ECOLI Length = 273 Score = 384 bits (987), Expect = e-105, Method: Composition-based stats. Identities = 177/267 (66%), Positives = 220/267 (82%), Gaps = 1/267 (0%) Query: 2 RLASRFGRYNSIHRERPLTDDELMQFVPSVFSGDKHESRSERYTYIPTINIINKLRDEGF 61 RLASRFG N I R+RPLT +EL + VPSVFS DKHESRSERYTYIPTI++++ L+ EGF Sbjct: 3 RLASRFGAANLIRRDRPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDSLQREGF 62 Query: 62 QPFFACQSRVRDLGRREYSKHMLRLRREGHINGQEVPEIILLNSHDGSSSYQMIPGIFRF 121 QPFFACQ+RVRD RRE++KHMLRLRREG I G++VPEIILLNSHDG+SSYQM+PG+FR Sbjct: 63 QPFFACQTRVRDPRRREHTKHMLRLRREGQITGKQVPEIILLNSHDGTSSYQMLPGMFRA 122 Query: 122 VCTNGLVCGNNFGEIRVPHKGDIVGQVIEGAYEVLGVFDKVTDNMEAMKEIHLNSDEQHL 181 VC NGLVCG +FGE+RVPHKGD+V QVIEGAYEVLG+F++V + +AM+ + L Q Sbjct: 123 VCQNGLVCGESFGEVRVPHKGDVVSQVIEGAYEVLGIFERVEEKRDAMQSLLLPPPVQQA 182 Query: 182 FGRAALMVRYEDENKTPVTPEQIITPRRWEDKQNDLWTTWQRVQENMIKGGLSGRSASGK 241 +AAL R+ E+ PVT QI++PRRW+D+ NDLWTT+QR+QEN+IKGGLSGR+A G Sbjct: 183 LAKAALTYRF-GEDHQPVTESQILSPRRWQDESNDLWTTYQRIQENLIKGGLSGRNAKGG 241 Query: 242 NTRTRAITGIDGDIRINKALWVIAEQF 268 T TRA+ GIDGD+++N+ALWV+AE Sbjct: 242 RTHTRAVRGIDGDVKLNRALWVMAETL 268 >UniRef50_Q1ND23 CP4-6 prophage n=4 Tax=Alphaproteobacteria RepID=Q1ND23_9SPHN Length = 281 Score = 355 bits (911), Expect = 1e-96, Method: Composition-based stats. Identities = 133/276 (48%), Positives = 190/276 (68%), Gaps = 6/276 (2%) Query: 3 LASRFGR-YNSIHRERPLTDDELMQFVPSVFSGDKHESRSERYTYIPTINIINKLRDEGF 61 LA+RFGR + I PL ++ L + VPS+F+ + H+SRSERY Y+PTI+I+ LR EG+ Sbjct: 6 LATRFGRNSHQIGGYEPLDNEALYRHVPSIFAREAHDSRSERYVYVPTIDIVEGLRREGW 65 Query: 62 QPFFACQSRVRDLGRREYSKHMLRLRREGHINGQEVPEIILLNSHDGSSSYQMIPGIFRF 121 PFFA QS RD R ++KHMLRLRRE + E E I++NSHDG+S++Q+ G+ RF Sbjct: 66 FPFFAVQSVPRDGNRHGHAKHMLRLRREDGVGKSEAAEAIIVNSHDGTSAFQLFAGMLRF 125 Query: 122 VCTNGLVCGNNFGEIRVPHKGDIVGQVIEGAYEVLGVFDKVTDNMEAMKEIHLNSDEQHL 181 VCTN ++ G F E+RVPHKG+I +IEG Y V F ++ D E MK + L+ DEQ L Sbjct: 126 VCTNSMIAGERFEEVRVPHKGNIEHDIIEGVYTVAEDFPRLIDASETMKGVRLSEDEQRL 185 Query: 182 FGRAALMVRYEDENKTPVTPEQIITPRRWEDKQNDLWTTWQRVQENMIKGGLSGRSASGK 241 G +L+ RY E+++P+TPEQII PRR+ED+ + LWTT+ +QEN+I+GGL GR + + Sbjct: 186 LGEVSLVARY-GEDESPLTPEQIIEPRRYEDRGDSLWTTFNVIQENVIRGGLHGRKRNAE 244 Query: 242 N----TRTRAITGIDGDIRINKALWVIAEQFRKWKS 273 +R+R I GID ++ +N+ALW +AE ++ K+ Sbjct: 245 GRIRRSRSRPINGIDQNVTLNRALWTLAEGMQRLKT 280 >UniRef50_B9JPN2 Phosphoribosylamine-glycine ligase n=2 Tax=Proteobacteria RepID=B9JPN2_AGRRK Length = 391 Score = 343 bits (881), Expect = 3e-93, Method: Composition-based stats. Identities = 111/285 (38%), Positives = 169/285 (59%), Gaps = 15/285 (5%) Query: 1 MRLASRFGRYNSIHRERPLTDDELMQFVPSVFSGDKHESRSERYTYIPTINIINKLRDEG 60 M + + R+++ R +T+ E+ + PS+F+ HESRS+R+ IPTI ++ L EG Sbjct: 108 MTIYTETARFDT---ARTMTETEMWKVAPSIFATTAHESRSDRFKPIPTIEVLRGLMAEG 164 Query: 61 FQPFFACQSRVRDLGRREYSKHMLRLRREGH----INGQEVPEIILLNSHDGSSSYQMIP 116 F P A QS R G+ +++KH++RLRR G V EI+L N++DG+S+Y+++ Sbjct: 165 FVPVGAKQSASRTEGKADFTKHLIRLRRVDDGKTYRVGDTVCEILLKNANDGTSAYELLA 224 Query: 117 GIFRFVCTNGLVCG-NNFGEIRVPHKGDIVGQVIEGAYEVLGVFDKVTDNMEAMKEIHLN 175 G+FR C N LV I+V H GD+ +VIEG Y VL ++ + LN Sbjct: 225 GLFRIRCMNSLVTQTGTIDAIKVRHSGDVSAKVIEGTYRVLNEAERTLVAPQDWATHKLN 284 Query: 176 SDEQHLFGRAALMVRYED---ENKTPVTPEQIITPRRWEDKQNDLWTTWQRVQENMIKGG 232 DEQ + AA ++R+ D E KTP+ PEQ++ PRR +D+ +DLWT W QEN+I+GG Sbjct: 285 RDEQQIMAEAAHVLRFGDNDGETKTPIKPEQLLLPRRHDDRADDLWTVWNVTQENVIRGG 344 Query: 233 LSGRSAS----GKNTRTRAITGIDGDIRINKALWVIAEQFRKWKS 273 L G + ++RA+ GID DI++NKALW+I E+ + K+ Sbjct: 345 LRGIGREDLGRPRRVKSRAVNGIDQDIKLNKALWLIGEKMAELKA 389 >UniRef50_A4X0R7 Putative uncharacterized protein n=2 Tax=Rhodobacter sphaeroides RepID=A4X0R7_RHOS5 Length = 316 Score = 333 bits (854), Expect = 4e-90, Method: Composition-based stats. Identities = 119/272 (43%), Positives = 172/272 (63%), Gaps = 8/272 (2%) Query: 10 YNSIHRERPLTDDELMQFVPSVFSGDKHESRSERYTYIPTINIINKLRDEGFQPFFACQS 69 + R PLT+ EL VPS+F+ + HESRS R+ +PTI +++ LR EGF+PFFA Q+ Sbjct: 43 GSIFSRGEPLTNAELHARVPSIFATEAHESRSARFAPVPTITVLDGLRAEGFEPFFAQQA 102 Query: 70 RVRDLGRREYSKHMLRLRREGHIN-GQEVPEIILLNSHDGSSSYQMIPGIFRFVCTNGLV 128 R R G+ E++KHMLRLR G +N E EI+L+N++DG+S+YQMIPG FRFVC NGL+ Sbjct: 103 RTRIEGKAEFTKHMLRLRHRGIVNEAGEAFEIVLVNANDGTSAYQMIPGFFRFVCANGLM 162 Query: 129 CGNNFGEIRVPHKGDIVGQVIEGAYEVLGVFDKVTDNMEAMKEIHLNSDEQHLFGRAALM 188 G F E++V H G+ +G+VIEGAY VL +V D ++ K I L E+ + AA Sbjct: 163 AGETFEEVKVRHSGNAIGEVIEGAYRVLEDAPRVADQVQRFKSIRLQDREREILAEAAHS 222 Query: 189 VRY---EDENKTPVTPEQIITPRRWEDKQNDLWTTWQRVQENMIKGGLSGRSASG----K 241 +R+ + P+ P ++ PRR ED+ DLWT + VQEN ++GG+ GR + + Sbjct: 223 LRFPATAEGKAAPIDPPALLRPRRSEDRATDLWTAFNVVQENTLRGGMRGRIETDSGFIR 282 Query: 242 NTRTRAITGIDGDIRINKALWVIAEQFRKWKS 273 R +TGID +N+ALW++ E+ + KS Sbjct: 283 RQTVREVTGIDQSRALNRALWMLTERMAELKS 314 >UniRef50_B8IVF8 Putative uncharacterized protein n=1 Tax=Methylobacterium nodulans ORS 2060 RepID=B8IVF8_METNO Length = 295 Score = 333 bits (853), Expect = 5e-90, Method: Composition-based stats. Identities = 117/281 (41%), Positives = 168/281 (59%), Gaps = 12/281 (4%) Query: 5 SRFGRYNSIHRERP-LTDDELMQFVPSVFSGDKHESRSERYTYIPTINIINKLRDEGFQP 63 +RFG + R L + L P+VF+ DKH SRS++YTYIPT+ ++ L EGF P Sbjct: 14 TRFGSGAVVVRNNGGLDEAALRSAAPTVFAEDKHSSRSDKYTYIPTVEVLRGLGREGFLP 73 Query: 64 FFACQSRVRDLGRREYSKHMLRLRREGHIN---GQEVPEIILLNSHDGSSSYQMIPGIFR 120 RD +R Y+KH+LRLRR G G E++LLNSHDG+SSYQ++ G+FR Sbjct: 74 VEVRVGGTRDEEKRGYTKHLLRLRRMGDAPTRVGDSSRELVLLNSHDGTSSYQLMSGLFR 133 Query: 121 FVCTNGLVCGNNFGEI-RVPHKGDIVGQVIEGAYEVLGVFDKVTDNMEAMKEIHLNSDEQ 179 +C+NGLVC + +I ++PHKGDIV QVI+GAY ++ ++V MK+I L EQ Sbjct: 134 LICSNGLVCADGDAQILKIPHKGDIVQQVIDGAYRIVDASEEVDRIAAEMKQIELRPAEQ 193 Query: 180 HLFGRAALMVRYEDE-NKTPVTPEQIITPRRWEDKQNDLWTTWQRVQENMIKGGLSGRSA 238 F AA +R+ E + PV P QI PRR ED N LW + R QE +I+GG+ + Sbjct: 194 DAFAEAAAELRWNGEGQRVPVEPRQIHAPRRREDVGNSLWLAFNRTQEGLIRGGIDYQQR 253 Query: 239 SGK------NTRTRAITGIDGDIRINKALWVIAEQFRKWKS 273 + + +TR + G+DG+ +N+ALWV+A + + K+ Sbjct: 254 NPETGRLIARRQTRPVQGVDGNTALNRALWVLANRMAELKA 294 >UniRef50_C6RFJ3 Putative uncharacterized protein n=1 Tax=Campylobacter showae RM3277 RepID=C6RFJ3_9PROT Length = 271 Score = 331 bits (850), Expect = 1e-89, Method: Composition-based stats. Identities = 87/263 (33%), Positives = 147/263 (55%), Gaps = 6/263 (2%) Query: 15 RERPLTDDELMQFVPSVFSGDKHESRSERYTYIPTINIINKLRDEGFQPFFACQSRVRDL 74 PLT+++L Q PS+F+ + + S++Y +I TI++IN++RD + P ++ VRD Sbjct: 4 SNEPLTNEQLEQLAPSLFADEPYFEASDKYHFISTIDVINEIRDYAWYPVGVSEASVRDE 63 Query: 75 GRREYSKHMLRLRREGHI--NGQEVPEIILLNSHDGSSSYQMIPGIFRFVCTNGLVCGNN 132 + + KH +R R G+ V E++L NSHD S + + G+FRFVC NGLV + Sbjct: 64 KKEGFQKHYVRFRHLDDFLNPGENVVELLLFNSHDRSKCFSISAGVFRFVCANGLVVSDE 123 Query: 133 -FGEIRVPHKGDIVGQVIEGAYEVLGVFDKVTDNMEAMKEIHLNSDEQHLFGRAALMVRY 191 F ++ H GD V ++ + + ++ +I L D++ F +AA+ +R+ Sbjct: 124 VFESYQIKHLGDKENDVSIAINKIAKAKYDILNKIKLFSKIPLTQDDKASFAKAAIPLRF 183 Query: 192 EDENKTPVTPEQIITPRRWEDKQNDLWTTWQRVQENMIKGGLSGRSA-SGKNTRTRAITG 250 E K V ++ P R ED+++DL+TT+ +QE++I+G +SG +A + + +R I Sbjct: 184 EKHLK--VDYRDLLVPHRIEDEKDDLYTTFNTIQEHLIRGNISGINAETNRRFTSRIIKS 241 Query: 251 IDGDIRINKALWVIAEQFRKWKS 273 I D INK LW +AE K K+ Sbjct: 242 ISTDTDINKKLWNMAESIAKIKA 264 >UniRef50_A9HST1 Putative uncharacterized protein n=1 Tax=Gluconacetobacter diazotrophicus PAl 5 RepID=A9HST1_GLUDA Length = 282 Score = 328 bits (841), Expect = 1e-88, Method: Composition-based stats. Identities = 111/282 (39%), Positives = 155/282 (54%), Gaps = 15/282 (5%) Query: 1 MRLASRFGRYNSIHRERPLTDDELMQFVPSVFSGDKHESRSERYTYIPTINIINKLRDEG 60 M SR + +PLTD++L + PS+F+ KHESRS+RYTYIPTI ++ LR EG Sbjct: 1 MSFLSRVSAH---RHAQPLTDEQLQRLAPSIFAEAKHESRSDRYTYIPTIEVVRGLRSEG 57 Query: 61 FQPFFACQSRVRDLGRREYSKHMLRLRREGHIN-----GQEVPEIILLNSHDGSSSYQMI 115 F P A Q R G+ EY+KH++R R H G PE+ LLNSHDG+S+Y++I Sbjct: 58 FFPVMARQGNSRIPGKAEYTKHLIRFRHMDHGPMYENLGDLYPEVALLNSHDGTSAYKII 117 Query: 116 PGIFRFVCTNGLVCGN-NFGEIRVPHKGDIVGQVIEGAYEVLGVFDKVTDNMEAMKEIHL 174 + R C NG+V + EI VPHKG + +VIEG+Y VL K + L Sbjct: 118 AAMMRLACENGMVVQDARLAEISVPHKGTVTDKVIEGSYTVLDESRKALEIAGEWSGKTL 177 Query: 175 NSDEQHLFGRAALMVRYEDE-NKTPVTPEQIITPRRWEDKQNDLWTTWQRVQENMIKGGL 233 +Q F A + +Y D+ + P TPE + RR D+ DLW RVQE+ I+GG+ Sbjct: 178 TERQQKGFAEAVHIAKYGDDAERMPFTPESYLRTRRAADQGADLWRVANRVQESAIRGGM 237 Query: 234 SGRS-----ASGKNTRTRAITGIDGDIRINKALWVIAEQFRK 270 +G + K R + IDGDI++NKA+W +A+ Sbjct: 238 TGFRWDEDGRNRKRVTARPVKSIDGDIKLNKAVWHLAQMLSD 279 >UniRef50_C6N6D1 Putative uncharacterized protein n=2 Tax=Legionella drancourtii LLAP12 RepID=C6N6D1_9GAMM Length = 275 Score = 328 bits (840), Expect = 2e-88, Method: Composition-based stats. Identities = 100/254 (39%), Positives = 155/254 (61%), Gaps = 6/254 (2%) Query: 19 LTDDELMQFVPSVFSGDKHESRSERYTYIPTINIINKLRDEGFQPFFACQSRVRDLGRRE 78 LT ++L + PS+F+ SERY I T ++I++L EGF P A QS R ++ Sbjct: 16 LTIEQLYKAAPSLFTRGAAVHTSERYQPIATSDVIDRLLQEGFYPTKATQSASRSEEKKV 75 Query: 79 YSKHMLRLRREG-HINGQE-VPEIILLNSHDGSSSYQMIPGIFRFVCTNGLVCGNNFGEI 136 +SKH++R R H G PE++L+NSHDG SSY+++ G++R VCTNGLV G ++ E+ Sbjct: 76 FSKHLVRFRHRDYHNPGNGLFPELVLINSHDGLSSYRLMAGLYRQVCTNGLVAGKSYDEV 135 Query: 137 RVPHKGDIVGQVIEGAYEVLGVFDKVTDNMEAMKEIHLNSDEQHLFGRAALMVRYEDENK 196 RV H+GD++G VIEG Y V+ K+ +E M + L ++ F A +R+ ++ Sbjct: 136 RVKHQGDVIGNVIEGTYRVIESSQKMLQVVEQMGDCALPDEKLLEFSAQAHALRFSEDAN 195 Query: 197 TPVTPEQIITPRRWEDKQNDLWTTWQRVQENMIKGGLSGRSASG----KNTRTRAITGID 252 + P+ ++ PRR ED + DL++ + VQEN+IKGG+ G + + R+R IT ID Sbjct: 196 LVIEPKNLLVPRRREDMKRDLFSVFNVVQENLIKGGVLGYRLNEHGRWRRARSRKITSID 255 Query: 253 GDIRINKALWVIAE 266 +++IN+ LW IAE Sbjct: 256 QNVKINRDLWTIAE 269 >UniRef50_B6C6K7 Conserved domain protein n=2 Tax=Nitrosococcus oceani RepID=B6C6K7_9GAMM Length = 226 Score = 298 bits (764), Expect = 1e-79, Method: Composition-based stats. Identities = 101/225 (44%), Positives = 149/225 (66%), Gaps = 5/225 (2%) Query: 52 IINKLRDEGFQPFFACQSRVRDLGRREYSKHMLRLRREG---HINGQEVPEIILLNSHDG 108 +I L EG+ P A +SRVR R+ +SKH+LR RR + G PEI+L+NSHDG Sbjct: 1 MIEALEREGWSPVHAEESRVRIPDRKGFSKHLLRFRRFDNELPMVGDSFPEIVLVNSHDG 60 Query: 109 SSSYQMIPGIFRFVCTNGLVCGN-NFGEIRVPHKGDIVGQVIEGAYEVLGVFDKVTDNME 167 S +YQ+ G+FR VC+NG++ + N G+++ H GD+V +VIEG YE++ ++ +E Sbjct: 61 SCAYQLHAGLFRLVCSNGMIVADSNMGQVKRRHTGDVVREVIEGTYEIVEELPRIAARVE 120 Query: 168 AMKEIHLNSDEQHLFGRAALMVRYEDENKTPVTPEQIITPRRWEDKQNDLWTTWQRVQEN 227 K + L+ EQ +F +AL VR+ E + P P+ ++ PRR ED+ NDLW T+QRVQEN Sbjct: 121 DFKTLELSLQEQEIFAESALRVRWR-EGEAPCMPQALLRPRRHEDQGNDLWATYQRVQEN 179 Query: 228 MIKGGLSGRSASGKNTRTRAITGIDGDIRINKALWVIAEQFRKWK 272 M+KGG+ GRSA G+ TRA+ +DG++++NKALW + EQ + K Sbjct: 180 MLKGGIRGRSAVGRQITTRAVKSVDGNVKLNKALWFLTEQMAELK 224 >UniRef50_B1ZQ12 Putative uncharacterized protein n=3 Tax=Opitutus terrae PB90-1 RepID=B1ZQ12_OPITP Length = 288 Score = 275 bits (703), Expect = 1e-72, Method: Composition-based stats. Identities = 89/276 (32%), Positives = 141/276 (51%), Gaps = 24/276 (8%) Query: 17 RPLTDDELMQFVPSVFSGDKHESRSERYTYIPTINIINKLRDEGFQPFFACQSRVRDLGR 76 R L+ D+L + PSVF+ S RYT++ T +++ LR EG++P A Q RVR R Sbjct: 14 RALSLDDLRRVAPSVFAEQARPGVSSRYTFVSTAQVVDLLRGEGWEPVKANQQRVRLENR 73 Query: 77 REYSKHMLRLRREGHIN------GQEVPEIILLNSHDGSSSYQMIPGIFRFVCTNGL-VC 129 + + H LR R + G PE+IL N+HDG+ +Y++ G++R VC NGL V Sbjct: 74 QGFQMHELRFARRADLENASFAIGDVRPELILQNAHDGTRAYRIDAGLYRLVCRNGLTVA 133 Query: 130 GNNFGEIRVPHKGDIVGQVIEGAYEVLGVFDKVTDNMEAMKEIHLNSDEQHLFGRAALMV 189 +F + + H + A V +V + + + + L +H F A+ + Sbjct: 134 DADFAHVAIRHVDVSAEKFAAAAQAVAENTPRVMEVIARWQAVALTPLARHSFAARAMAL 193 Query: 190 RYEDENKTPVT----PEQIITPRRWEDKQNDLWTTWQRVQENMIKGGLSGR----SASGK 241 R+ ++ PVT P+Q++ P R+ D+ DLWTT+ VQE + +GGL +A G Sbjct: 194 RW--DSAQPVTRLLRPDQLLAPARYGDQATDLWTTFNVVQERLCRGGLRYAGHIPAAEGA 251 Query: 242 -------NTRTRAITGIDGDIRINKALWVIAEQFRK 270 TR + G+ R+NKALW +AE+F + Sbjct: 252 VFPTHYLRNTTRPVGGLTEGQRLNKALWNLAEEFSR 287 >UniRef50_Q17W97 Putative uncharacterized protein Hac prophage I orf7 n=1 Tax=Helicobacter acinonychis str. Sheeba RepID=Q17W97_HELAH Length = 176 Score = 220 bits (561), Expect = 4e-56, Method: Composition-based stats. Identities = 49/170 (28%), Positives = 88/170 (51%), Gaps = 3/170 (1%) Query: 17 RPLTDDELMQFVPSVFSGDKHESRSERYTYIPTINIINKLRDEGFQPFFACQSRVRDLGR 76 +PL+++EL + PS+F+ + + S++Y +I TI+II ++R + P ++ VR+ + Sbjct: 6 QPLSNNELKRLAPSLFTAEPYYEASDKYHFISTIDIIEEIRFHAWYPVAVSEASVRNEDK 65 Query: 77 REYSKHMLRLRREGHI--NGQEVPEIILLNSHDGSSSYQMIPGIFRFVCTNGLVCGNN-F 133 Y +H +R R + E++L NSHD S + + G+FRFVC NGLV + F Sbjct: 66 EGYQQHYVRFRYLDDFLRPSENCVELLLFNSHDRSKCFTISAGVFRFVCANGLVVADEVF 125 Query: 134 GEIRVPHKGDIVGQVIEGAYEVLGVFDKVTDNMEAMKEIHLNSDEQHLFG 183 ++ H G+ V ++ DK+ D + +I L ++ F Sbjct: 126 ESYQIKHIGEKANGVAVAIPSIVQAKDKIMDKISTFSQITLTEQDKISFA 175 >UniRef50_B2Q5G8 Putative uncharacterized protein n=3 Tax=Providencia RepID=B2Q5G8_PROST Length = 122 Score = 138 bits (348), Expect = 2e-31, Method: Composition-based stats. Identities = 59/115 (51%), Positives = 82/115 (71%), Gaps = 1/115 (0%) Query: 156 LGVFDKVTDNMEAMKEIHLNSDEQHLFGRAALMVRYEDENKTPVTPEQIITPRRWEDKQN 215 + FD V + E M+ + L Q +AAL R+ +E P+T EQ++ PRRWEDK++ Sbjct: 1 METFDTVAEKREQMQSLLLPPPAQQALAQAALTYRFGEE-HQPITEEQVLQPRRWEDKKD 59 Query: 216 DLWTTWQRVQENMIKGGLSGRSASGKNTRTRAITGIDGDIRINKALWVIAEQFRK 270 DLWT +QR+QEN+IKGGLSGR+A GK RTR++ GIDGDI++NKALWV+ E+ + Sbjct: 60 DLWTVYQRLQENLIKGGLSGRNAKGKRARTRSVNGIDGDIKLNKALWVMTEKMYE 114 >UniRef50_B5EW31 Putative uncharacterized protein n=1 Tax=Vibrio fischeri MJ11 RepID=B5EW31_VIBFM Length = 318 Score = 102 bits (253), Expect = 2e-20, Method: Composition-based stats. Identities = 27/101 (26%), Positives = 51/101 (50%), Gaps = 5/101 (4%) Query: 79 YSKHMLRLRREGHINGQEVPEIILLNSHDGSSSYQMIPGIFRFVCTNGLVCGNNFGEIRV 138 + HM+++ + + ++++NS+DGS ++Q+ G FR VCTNG++ G F + V Sbjct: 127 FPAHMVQIGSGDKV----ILRLVVVNSYDGSCNFQVQAGGFRIVCTNGMITGEKFLSLDV 182 Query: 139 PHKGDI-VGQVIEGAYEVLGVFDKVTDNMEAMKEIHLNSDE 178 H G + GQV + F+ + + + LN + Sbjct: 183 RHTGTMNFGQVTRQVTTAVSSFENMGQYWDTLINSPLNRKD 223 >UniRef50_B9PA18 Predicted protein (Fragment) n=2 Tax=cellular organisms RepID=B9PA18_POPTR Length = 87 Score = 73.1 bits (178), Expect = 8e-12, Method: Composition-based stats. Identities = 25/54 (46%), Positives = 40/54 (74%), Gaps = 1/54 (1%) Query: 1 MRLASRFGRYN-SIHRERPLTDDELMQFVPSVFSGDKHESRSERYTYIPTINII 53 M+LASRF ++ ++ + PL+DD++ + PS+F+ HESRSERY+YIPT ++ Sbjct: 34 MQLASRFASHSPALRSDSPLSDDQIRRVAPSIFADAPHESRSERYSYIPTAAVL 87 >UniRef50_A6GXR9 Putative uncharacterized protein n=1 Tax=Flavobacterium psychrophilum JIP02/86 RepID=A6GXR9_FLAPJ Length = 285 Score = 55.0 bits (131), Expect = 2e-06, Method: Composition-based stats. Identities = 36/188 (19%), Positives = 67/188 (35%), Gaps = 41/188 (21%) Query: 97 VPEIILLNSHDGSSSYQMIPGIFRFVCTNGLVCGNNFGEIRVPHKGDIVGQVIEG----A 152 P + NS+DGS G FR VC+NGL + + H+G+I V+ Sbjct: 113 RPMLRFTNSYDGSCKTSGTFGFFREVCSNGLHTASTDIGFSLKHRGNINELVLPAIGKTI 172 Query: 153 YEVLG----VFDKVTDNMEAMKEIHLNSDEQHLFGRAALMVRYEDENKTP---------- 198 Y L + + + K + QH+ + + ++E +K P Sbjct: 173 YNFLDNEFYELRRKFEVLADFKIADPSEIVQHI-AQQTKLFKFESSDKNPAPSLNARLVI 231 Query: 199 --VTPEQIITPRRWEDKQNDLWTTWQRVQENMIKGGLSGRSASGKNTRTRAITGIDGDIR 256 + E +I ED ++W + E ++ G + D + Sbjct: 232 ETIENETLIL---KED--ANMWMVYNAFNE-LLHGKIK--------------KTFDQQKK 271 Query: 257 INKALWVI 264 I+K ++ + Sbjct: 272 IDKEIFNL 279 >UniRef50_A8ZS75 Putative uncharacterized protein n=1 Tax=Desulfococcus oleovorans Hxd3 RepID=A8ZS75_DESOH Length = 318 Score = 53.9 bits (128), Expect = 5e-06, Method: Composition-based stats. Identities = 31/157 (19%), Positives = 58/157 (36%), Gaps = 18/157 (11%) Query: 41 SERYTYIPTINIINKLRDEGFQPFFACQSRVRDLGRREYSKHMLRL----RREGHING-- 94 +ERY + ++++++L GF P Q + D ++R+ R G G Sbjct: 108 TERYKPLDNMDVLSQLLRHGFDPDTQVQYAIDDG------MFLVRIPEYARAFGVNPGYG 161 Query: 95 ---QEVPEIILLNSHDGSSSYQMIPGIFRFVCTNGLVCGNNFGEIRVPHKGDI-VGQVIE 150 + VP + NS G ++ + +R VCTNGL+ + R H + + E Sbjct: 162 KLDEIVPGVSFANSEVGLLAFSIEAFFYRLVCTNGLISKTSSTFSRFKHISNRGLENFPE 221 Query: 151 GAYEVLGVFDKVTD--NMEAMKEIHLNSDEQHLFGRA 185 V+ + + + + F R Sbjct: 222 TIAGVIEDSVRKQEQFKLSRQSPVENPIRSIETFARQ 258 >UniRef50_A8RIH4 Putative uncharacterized protein n=3 Tax=Clostridiales RepID=A8RIH4_9CLOT Length = 312 Score = 53.1 bits (126), Expect = 9e-06, Method: Composition-based stats. Identities = 38/184 (20%), Positives = 76/184 (41%), Gaps = 19/184 (10%) Query: 41 SERYTYIPTINII---NKLRDEGFQPFFACQSRVRDLGRREYSKHMLRLRREGHINGQEV 97 ++RY + + ++L EG + GRR + + +L + I+G E+ Sbjct: 76 TDRYKVVQNEDAFAFTDQLLGEG---VTYETAGSLQNGRRTWL--LAKLPQRYIISGDEI 130 Query: 98 -PEIILLNSHDGSSSYQMIPGIFRFVCTNGLVCGNNFG--EIRVPHKGDIVGQVIEGAYE 154 P ++ +N+HDG+ + ++ R VC N L + H GDI G++ + Y Sbjct: 131 TPYMVFMNTHDGTGAIRVAMTPVRVVCMNTLNLALSTAKRSWSTNHTGDIAGKMEDARYT 190 Query: 155 VLGVFDKVTD---NMEAMKEIHLNSDEQHLFGRAALMVRYEDENKTPVTPEQIITPRRWE 211 +L +++ ++ MK + L+ + + A + P +Q R E Sbjct: 191 LLYADRYMSELGKAIDHMKRLRLSERQVMEYIDALFPL-----YDNPTPQQQKNLNRMKE 245 Query: 212 DKQN 215 D + Sbjct: 246 DMKT 249 >UniRef50_Q2LV02 Hypothetical cytosolic protein n=1 Tax=Syntrophus aciditrophicus SB RepID=Q2LV02_SYNAS Length = 264 Score = 50.4 bits (119), Expect = 6e-05, Method: Composition-based stats. Identities = 45/253 (17%), Positives = 97/253 (38%), Gaps = 44/253 (17%) Query: 36 KHESRSERYTYIPTINIINK--------LRDEGFQPFFACQSRVRDLGRREYSKHMLRLR 87 ++ Y + ++ +K LRD R G + ++ +L+ + Sbjct: 22 PLPEPTDSYMPVSHYDLADKFLMISQDILRDYKL--VGENYGIAR-QGNQFFA--VLKFQ 76 Query: 88 REGHINGQEVPEIILLNSHDGSSSYQMIPGIFRFVCTNGLVCGNNFGEIRVPHKGDI--- 144 RE G I NS+D S + + G FVC N + G + H ++ Sbjct: 77 RERSEIG---LSIAFRNSYDRSMAIGLAIGASVFVCDNLALSGEIV--VMKKHTKNVWSE 131 Query: 145 -VGQVIEGAYEVLGVFDKVTDNMEAMKEIHLNSDEQHLFGRAALMVRYEDENKTP----V 199 + I Y+ +D++ +++A K L D+ F A+ + + + +P V Sbjct: 132 LEEKAIATIYKSQNNYDQLIGDVDAFKS--LPVDDNGAF--QAMGLLFGNNIISPRQLTV 187 Query: 200 TPEQIITPRRWEDKQNDLWTTWQRVQENMIKGGLSGRSASGKNTRTRAITGIDGDIRINK 259 E+ + P E + +LW+ + E++ + +T ++ IR+++ Sbjct: 188 LKEEWLKPSHEEFEPRNLWSFYNAATESL--------------KSSPPVTIMEKHIRLHE 233 Query: 260 ALWVIAEQFRKWK 272 AL + ++ + Sbjct: 234 ALTYLGKEASNVQ 246 >UniRef50_D1N225 Putative uncharacterized protein n=1 Tax=Victivallis vadensis ATCC BAA-548 RepID=D1N225_9BACT Length = 241 Score = 50.4 bits (119), Expect = 6e-05, Method: Composition-based stats. Identities = 38/202 (18%), Positives = 72/202 (35%), Gaps = 15/202 (7%) Query: 36 KHESRSERYTYIPTINIINKL----RDEGFQPFFACQSRVRDLGRREYSKHMLRLRREGH 91 + + + +P +I+ + R +Q R+ G+R + ++R+ R Sbjct: 20 PTPAATASWKPVPHSEVIDAVTDVVRAHNWQILDEQYGLARN-GQRMFG--VIRINRTSS 76 Query: 92 INGQEVPEIILLNSHDGSSSYQMIPGIFRFVCTNGLVCGNNFGEIRVPHKGDIV--GQVI 149 I + NSHD + + + G+ VC N + G+ ++ H I G V+ Sbjct: 77 SEWSRC--IGICNSHDRTIAVGLAAGLNVQVCANLMFGGSTV--LKRRHTSRIELNGLVV 132 Query: 150 EGAYEVLGVFDKVTDNMEAMKEIHLNSD-EQHLFGRAALMVRYEDENKTPVTPEQIITPR 208 E + F + E +K + D + +AA + P+ + PR Sbjct: 133 EAIDALEDDFLTLETVAEDLKIQFVRDDTARAAIVKAAEAGAVNSCDIVPI-FREFKEPR 191 Query: 209 RWEDKQNDLWTTWQRVQENMIK 230 E + W EN K Sbjct: 192 YEEFAEPTRWALLNAFTENAKK 213 >UniRef50_D2R5Z8 Phage/plasmid-related protein TIGR03299 n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R5Z8_9PLAN Length = 327 Score = 47.3 bits (111), Expect = 6e-04, Method: Composition-based stats. Identities = 39/202 (19%), Positives = 75/202 (37%), Gaps = 21/202 (10%) Query: 84 LRLRREGHINGQEVPEIILLNSHDGSSSYQMIPGIFRFVCTNGLVCGNNFGE---IRVPH 140 +R++ + + ++L N+HDGSS+ ++ R VC N L +N I + H Sbjct: 128 IRVKNSDDLVDKF---LLLSNAHDGSSALRVYFTPIRVVCQNTLNLADNRSTGQGISILH 184 Query: 141 KGDIVGQVIEGAYEVLGVFDKVTD----NMEAMKEIHLNSDEQHLFGRAALMVRYEDENK 196 KG++ + I A VLG+ ++ D ++ + H +S + F ++ + +N Sbjct: 185 KGNLHTK-IREAQRVLGLAEEFYDEAEGIIDILASHHPSSVQVEAFFQSVIPDPIGADNA 243 Query: 197 TPVTPEQIITPRRWEDKQNDL-------WTTWQRVQENMIKGGLSGRSASGKNTRTRAIT 249 +T D+ W + V E + RS +R + Sbjct: 244 RARKVRDRLTCLFETGIGQDMPEIKGTSWAAYNAVTE-FVDHHRPTRSTDPLERASRRLD 302 Query: 250 G--IDGDIRINKALWVIAEQFR 269 R+ W +A Sbjct: 303 SSWFGSGARLKAKAWNLAFDMA 324 >UniRef50_C7Q5L2 Phage/plasmid-related protein TIGR03299 n=1 Tax=Catenulispora acidiphila DSM 44928 RepID=C7Q5L2_CATAD Length = 329 Score = 46.1 bits (108), Expect = 0.001, Method: Composition-based stats. Identities = 23/103 (22%), Positives = 43/103 (41%), Gaps = 12/103 (11%) Query: 74 LGRREYSKHMLRLRREGHINGQEVPEIIL--LNSHDGSSSYQMIPGIFRFVCTN--GLVC 129 GR+ + +RL + G + ++ + LNSHDG+ +Y++I R VC N L Sbjct: 127 EGRQVFVT--MRLPETMTVAGTDRLDLYISGLNSHDGTGAYKLIVTPIRIVCANTQSLAL 184 Query: 130 GNNFGEIRVPHKGDIVGQVIEG------AYEVLGVFDKVTDNM 166 + H ++ E ++ + F+K + M Sbjct: 185 DRARSSFSIRHTESAKKKIAEARKALGLMFKYVEEFEKAAERM 227 >UniRef50_A8ZYJ5 Putative uncharacterized protein n=1 Tax=Desulfococcus oleovorans Hxd3 RepID=A8ZYJ5_DESOH Length = 308 Score = 45.4 bits (106), Expect = 0.002, Method: Composition-based stats. Identities = 28/127 (22%), Positives = 46/127 (36%), Gaps = 15/127 (11%) Query: 43 RYTYIPTINIINKLRDEGFQPFFACQSRVRDLGRREYSKHMLRLRREGHINGQEV-PEII 101 +YT + I+ +L G+ P Q + S + R+ ING P I Sbjct: 107 KYTPVDNFEILERLDSLGYGPDTKVQCSL---DAEFLSLSIPDGRKAFDINGDRFKPGIS 163 Query: 102 LLNSHDGSSSYQMIPGIFRFVCTNGLVCGNNFGEIRVPHKGDIVGQVIEGAYEVLGVFDK 161 + NS G +S + + R VCTNGL+ H + +L F + Sbjct: 164 ISNSEVGLASLTISAFVLRLVCTNGLIARTGI-SASYRHV----------STRILKEFPQ 212 Query: 162 VTDNMEA 168 + + Sbjct: 213 TIETVSK 219 >UniRef50_A1UPG4 Putative uncharacterized protein n=1 Tax=Mycobacterium sp. KMS RepID=A1UPG4_MYCSK Length = 344 Score = 43.5 bits (101), Expect = 0.007, Method: Composition-based stats. Identities = 35/178 (19%), Positives = 52/178 (29%), Gaps = 35/178 (19%) Query: 100 IILLNSHDGSSSYQMIPGIFRFVCTN--GLVCGNNFGEIRVPHKGDIVGQVIEGAYEVLG 157 + LNSHDGSS ++ + R VC N + H G GA L Sbjct: 162 LAALNSHDGSSKFRFLVTPVRIVCANTQSAAIARAAASFGISHTG--------GAAVALQ 213 Query: 158 VFDKVTDNMEAMKEIHLNSDEQHLFGRAALMV----RYEDENKTPVTPEQIITPRRWEDK 213 + + + E A + + R+ E E T R D Sbjct: 214 EARRALKLS--WRYVEAFEQEAAALYAAPMDLDQMRRFAGELVDVDGAESKTTARNRRDT 271 Query: 214 QNDL-----------------WTTWQRVQENMIKGGLSGRSASGKNTRTRAITGIDGD 254 N + W + V E + S A+G RA+ + G Sbjct: 272 ANAIVKLWVSSPTVAPIAGTRWAAYNAVTEYV--DHYSKVRAAGDPQSVRALRAVTGG 327 >UniRef50_B9NX09 Putative uncharacterized protein n=1 Tax=Rhodobacteraceae bacterium KLH11 RepID=B9NX09_9RHOB Length = 39 Score = 43.1 bits (100), Expect = 0.011, Method: Composition-based stats. Identities = 8/24 (33%), Positives = 15/24 (62%) Query: 249 TGIDGDIRINKALWVIAEQFRKWK 272 GID + +N+A+W + E+ + K Sbjct: 10 KGIDQNKALNRAIWSLTEKMAELK 33 >UniRef50_B4VVD2 Phage/plasmid-related protein TIGR03299 n=2 Tax=Cyanobacteria RepID=B4VVD2_9CYAN Length = 336 Score = 42.7 bits (99), Expect = 0.012, Method: Composition-based stats. Identities = 29/143 (20%), Positives = 53/143 (37%), Gaps = 16/143 (11%) Query: 97 VPEIILLNSHDGSSSYQMIPGIFRFVCTNGLVCGNNF--------GEIRVPHKGDIVGQV 148 P ++L NSHDGS++ + R VC N L F I +PH + Q+ Sbjct: 134 RPYLLLHNSHDGSTAVWLQFTPVRVVCWNTLNGAARFRFGDLWQKKAICIPHSLSLTEQL 193 Query: 149 IEGAYEVLGVFDKVTD-NMEAMKEIHLNSDEQHLFGRAALMVRYEDENKTPVTP--EQII 205 E + +L + K ++E + + L + R + + P Q++ Sbjct: 194 -EHIHNILDLTQKEFQYSVEEYQAMAHKELTTELLAD--YIGRVLGTTQPTLHPAWSQLV 250 Query: 206 T--PRRWEDKQNDLWTTWQRVQE 226 ++ LW + + E Sbjct: 251 ANFESGRGNQGQTLWDAYNSITE 273 >UniRef50_C6W397 Phage/plasmid-related protein TIGR03299 n=12 Tax=Bacteroidetes RepID=C6W397_DYAFD Length = 350 Score = 42.7 bits (99), Expect = 0.014, Method: Composition-based stats. Identities = 20/95 (21%), Positives = 38/95 (40%), Gaps = 7/95 (7%) Query: 100 IILLNSHDGSSSYQMIPGIFRFVCTNGLVCGNNFGE--IRVPHKGDIVGQVIEGAYEVLG 157 + L SHDGS S R VC N L +++ H + V ++ A++V+G Sbjct: 152 LFLTTSHDGSGSITAAFTPVRIVCANTLNAAMKNITNVVKIRHTSNAVERL-RTAHKVMG 210 Query: 158 VFDKVTDNMEA----MKEIHLNSDEQHLFGRAALM 188 + +K + +E + + + A+ Sbjct: 211 IANKFSHEVEEIFNHWAKKPITDPQLKKLIEIAMA 245 >UniRef50_C4DCZ5 Phage/plasmid-related protein TIGR03299 n=3 Tax=Actinomycetales RepID=C4DCZ5_9ACTO Length = 395 Score = 42.7 bits (99), Expect = 0.015, Method: Composition-based stats. Identities = 27/121 (22%), Positives = 46/121 (38%), Gaps = 16/121 (13%) Query: 42 ERYTYIPTINIINKLRD--EGFQPFFACQSRVRDLGRREYSKHMLRLRREGHINGQEV-- 97 +Y + LR+ E + + VR GRR + + +R + Sbjct: 151 SKYHTVQNRECFEFLRNLVESYDVVWESAGAVR-GGRRTF----VSMRLPDTVTVDAAGI 205 Query: 98 -----PEIILLNSHDGSSSYQMIPGIFRFVCTNG--LVCGNNFGEIRVPHKGDIVGQVIE 150 P +++ NSHDGSSS + +R VC N L N + + H + Q+ + Sbjct: 206 NDTITPFVVVFNSHDGSSSITAVVTPYRPVCANTERLALDNAYTSWSIRHTESAMHQMRQ 265 Query: 151 G 151 Sbjct: 266 A 266 >UniRef50_A3XKH6 Putative uncharacterized protein n=2 Tax=Leeuwenhoekiella blandensis MED217 RepID=A3XKH6_9FLAO Length = 312 Score = 42.3 bits (98), Expect = 0.017, Method: Composition-based stats. Identities = 35/153 (22%), Positives = 59/153 (38%), Gaps = 21/153 (13%) Query: 97 VPEIILLNSHDGSSSYQMIPGIFRFVCTNGLVCGNNFGEIRVPHKGDIVGQVIEGAYEVL 156 +P + NS+DGS G +R VC+NGL E + H + ++ + Sbjct: 141 LPMLRFKNSYDGSEKTSGHFGFYREVCSNGLHVSLAEIEFSIKHSKNNTHLIMPRLNNLF 200 Query: 157 GVF--------DKVTDNMEAMKEIHLNSDEQHLFGRAALMVRYEDENKTPVTP----EQI 204 F K D M+ K I + + R L RYE +K P ++ Sbjct: 201 DKFLDNEFYTITKKFDKMKEFKIIDTQEFVKAILDRTKL-FRYECSDKNS-DPSKKSREV 258 Query: 205 ITPRRWE----DKQNDLWT---TWQRVQENMIK 230 I +E +++ +LW + V N++K Sbjct: 259 IEILNYEALLLNEEPNLWLGYNAFNSVLHNVLK 291 >UniRef50_C2LEJ7 Putative uncharacterized protein n=1 Tax=Proteus mirabilis ATCC 29906 RepID=C2LEJ7_PROMI Length = 39 Score = 41.9 bits (97), Expect = 0.020, Method: Composition-based stats. Identities = 15/29 (51%), Positives = 22/29 (75%) Query: 239 SGKNTRTRAITGIDGDIRINKALWVIAEQ 267 K+TRT ++ GID D ++NKALWV+ E+ Sbjct: 2 KSKHTRTCSVNGIDSDSKLNKALWVMTEK 30 >UniRef50_UPI0001AF46A9 hypothetical protein MkanA1_07449 n=1 Tax=Mycobacterium kansasii ATCC 12478 RepID=UPI0001AF46A9 Length = 348 Score = 41.5 bits (96), Expect = 0.027, Method: Composition-based stats. Identities = 21/131 (16%), Positives = 42/131 (32%), Gaps = 13/131 (9%) Query: 100 IILLNSHDGSSSYQMIPGIFRFVCTNGLVCGNNFG--EIRVPHKGDIVGQVIEGAYEVLG 157 + LNSHDGS++++ + R VC N + H G + E + Sbjct: 163 LAALNSHDGSAAFRFLLSPIRIVCANTQSAAIRSAKSSFSIRHTGGARASIAEARNALKL 222 Query: 158 VFDKVTDNMEAMKEIHLNSDEQHLFGRAALMVRYEDENKTPVTPEQIITPRRWEDKQND- 216 + + ++ + A + V RR ++ + Sbjct: 223 SWRYIEAFEAEAAALYAAPMDTEEMRSFANTL-------LEVDSAGTTATRRHRRERANS 275 Query: 217 ---LWTTWQRV 224 LWT+ + + Sbjct: 276 IVKLWTSSETI 286 >UniRef50_Q47CX4 Putative uncharacterized protein n=4 Tax=Betaproteobacteria RepID=Q47CX4_DECAR Length = 354 Score = 41.5 bits (96), Expect = 0.031, Method: Composition-based stats. Identities = 31/133 (23%), Positives = 49/133 (36%), Gaps = 15/133 (11%) Query: 41 SERYTYIPTINIINKLRDEGFQPFFACQSRVRDLGRR-EYSKHML-----RLRREGHING 94 S+RY + ++ E P VR +K L RL+ E G Sbjct: 115 SDRYRRLDNFDL-----AESVLPILQQLPEVRFESVELTETKMYLKCITPRLKYE-MAPG 168 Query: 95 QEV-PEIILLNSHDGSSSYQMIPGIFRFVCTNGLVCGNNFGEIRVPHKGDIVGQVIEGAY 153 V +++ NS G + + P +FR VC+NGL+ + +R H G +G E Sbjct: 169 DVVQAGVVISNSEVGQGTLSVQPLLFRLVCSNGLIVPDR--SLRKMHVGRALGGEDERIQ 226 Query: 154 EVLGVFDKVTDNM 166 + D Sbjct: 227 VYQDDTLRADDKA 239 >UniRef50_B4WVT0 Putative uncharacterized protein n=2 Tax=Synechococcus sp. PCC 7335 RepID=B4WVT0_9SYNE Length = 352 Score = 40.4 bits (93), Expect = 0.068, Method: Composition-based stats. Identities = 30/137 (21%), Positives = 50/137 (36%), Gaps = 2/137 (1%) Query: 34 GDKHESRSERYTYIPTINIINKLRDEGFQPFFACQSRVRDLGRREYSKHML-RLRREGHI 92 D S RY + ++ + + + A R Y K + R++ + + Sbjct: 104 TDARAFLSRRYRRLDNFDLADAVLPTLLEMQGARVVSCELTETRMYLKVVTDRIQADVKV 163 Query: 93 NGQEVPEIILLNSHDGSSSYQMIPGIFRFVCTNGLVCGNNFGEIRVPHKGDIVGQVIEGA 152 + + NS G S ++ P I+R VCTNG+V + R H G + A Sbjct: 164 GDAVQAGVCISNSEIGMGSLRVEPLIYRLVCTNGMVSPDRSARNRFTHLGRAAADTPD-A 222 Query: 153 YEVLGVFDKVTDNMEAM 169 YE+ DN Sbjct: 223 YELFSDKTLEADNTAFF 239 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P18005 UPF0380 protein yubP n=179 Tax=root RepID=YUBP_E... 387 e-106 UniRef50_P52132 UPF0380 protein yfjQ n=153 Tax=Bacteria RepID=YF... 357 3e-97 UniRef50_Q1ND23 CP4-6 prophage n=4 Tax=Alphaproteobacteria RepID... 329 6e-89 UniRef50_B9JPN2 Phosphoribosylamine-glycine ligase n=2 Tax=Prote... 326 4e-88 UniRef50_C6RFJ3 Putative uncharacterized protein n=1 Tax=Campylo... 313 3e-84 UniRef50_A4X0R7 Putative uncharacterized protein n=2 Tax=Rhodoba... 313 4e-84 UniRef50_B8IVF8 Putative uncharacterized protein n=1 Tax=Methylo... 312 1e-83 UniRef50_A9HST1 Putative uncharacterized protein n=1 Tax=Glucona... 311 2e-83 UniRef50_C6N6D1 Putative uncharacterized protein n=2 Tax=Legione... 308 1e-82 UniRef50_B6C6K7 Conserved domain protein n=2 Tax=Nitrosococcus o... 280 4e-74 UniRef50_B1ZQ12 Putative uncharacterized protein n=3 Tax=Opitutu... 262 9e-69 UniRef50_Q17W97 Putative uncharacterized protein Hac prophage I ... 203 4e-51 UniRef50_Q2LV02 Hypothetical cytosolic protein n=1 Tax=Syntrophu... 172 8e-42 UniRef50_D1N225 Putative uncharacterized protein n=1 Tax=Victiva... 156 9e-37 UniRef50_D2R5Z8 Phage/plasmid-related protein TIGR03299 n=1 Tax=... 149 9e-35 UniRef50_A8RIH4 Putative uncharacterized protein n=3 Tax=Clostri... 139 1e-31 UniRef50_B2Q5G8 Putative uncharacterized protein n=3 Tax=Provide... 128 2e-28 UniRef50_A6GXR9 Putative uncharacterized protein n=1 Tax=Flavoba... 122 1e-26 UniRef50_A8ZS75 Putative uncharacterized protein n=1 Tax=Desulfo... 117 3e-25 UniRef50_B5EW31 Putative uncharacterized protein n=1 Tax=Vibrio ... 114 5e-24 UniRef50_C7Q5L2 Phage/plasmid-related protein TIGR03299 n=1 Tax=... 82 3e-14 UniRef50_B9PA18 Predicted protein (Fragment) n=2 Tax=cellular or... 71 5e-11 Sequences not found previously or not previously below threshold: UniRef50_B9E574 Putative uncharacterized protein n=5 Tax=Clostri... 98 3e-19 UniRef50_A3XKH6 Putative uncharacterized protein n=2 Tax=Leeuwen... 81 5e-14 UniRef50_Q024R3 Putative uncharacterized protein n=1 Tax=Candida... 78 4e-13 UniRef50_UPI00017465AE hypothetical protein VspiD_04485 n=2 Tax=... 75 2e-12 UniRef50_A1SIX8 Putative uncharacterized protein n=2 Tax=Nocardi... 74 5e-12 UniRef50_UPI0001AF46A9 hypothetical protein MkanA1_07449 n=1 Tax... 72 1e-11 UniRef50_C4DCZ5 Phage/plasmid-related protein TIGR03299 n=3 Tax=... 72 2e-11 UniRef50_B4VVD2 Phage/plasmid-related protein TIGR03299 n=2 Tax=... 72 2e-11 UniRef50_Q5LU35 Putative uncharacterized protein n=1 Tax=Ruegeri... 70 7e-11 UniRef50_A1UPG4 Putative uncharacterized protein n=1 Tax=Mycobac... 69 1e-10 UniRef50_B8F9V3 Putative uncharacterized protein n=4 Tax=Deltapr... 67 9e-10 UniRef50_Q0RM54 Putative uncharacterized protein n=1 Tax=Frankia... 64 4e-09 UniRef50_B4CXI2 Putative uncharacterized protein n=1 Tax=Chthoni... 64 6e-09 UniRef50_C6W397 Phage/plasmid-related protein TIGR03299 n=12 Tax... 63 1e-08 UniRef50_A8ZYJ5 Putative uncharacterized protein n=1 Tax=Desulfo... 62 2e-08 UniRef50_B4WVT0 Putative uncharacterized protein n=2 Tax=Synecho... 58 3e-07 UniRef50_A1WP45 Putative uncharacterized protein n=2 Tax=Comamon... 58 3e-07 UniRef50_UPI00016C3597 hypothetical protein GobsU_16407 n=1 Tax=... 58 4e-07 UniRef50_Q47CX4 Putative uncharacterized protein n=4 Tax=Betapro... 57 5e-07 UniRef50_C4ZMQ9 Phage/plasmid-related protein TIGR03299 n=1 Tax=... 57 5e-07 UniRef50_C6RKU8 Phage/plasmid-related protein n=12 Tax=Acinetoba... 56 2e-06 UniRef50_C5CKG6 Phage/plasmid-related protein TIGR03299 n=10 Tax... 55 2e-06 UniRef50_A8ZKZ6 Putative uncharacterized protein n=3 Tax=Cyanoba... 54 5e-06 UniRef50_Q19YQ9 Gp96 n=7 Tax=unclassified Siphoviridae RepID=Q19... 54 7e-06 UniRef50_B7I5L8 Phage/plasmid-related protein n=5 Tax=Moraxellac... 53 9e-06 UniRef50_C0VFU1 Putative uncharacterized protein n=4 Tax=Acineto... 53 1e-05 UniRef50_C8X3A3 Putative uncharacterized protein n=1 Tax=Desulfo... 52 3e-05 UniRef50_Q5Y1B4 Putative uncharacterized protein n=1 Tax=uncultu... 51 3e-05 UniRef50_Q2IFF9 Putative uncharacterized protein n=3 Tax=Anaerom... 48 3e-04 UniRef50_A6SWN5 Uncharacterized conserved protein n=39 Tax=Prote... 48 4e-04 UniRef50_A8ZPY1 Putative uncharacterized protein n=5 Tax=Bacteri... 46 0.001 UniRef50_A6WZ56 Putative uncharacterized protein n=1 Tax=Ochroba... 46 0.001 UniRef50_C1D7A8 Putative uncharacterized protein n=1 Tax=Laribac... 46 0.002 UniRef50_B3VM79 Gp52 n=2 Tax=unclassified Siphoviridae RepID=B3V... 45 0.002 UniRef50_Q2JC95 Putative uncharacterized protein n=2 Tax=Frankia... 45 0.003 UniRef50_C4V5A4 Putative uncharacterized protein n=1 Tax=Selenom... 44 0.006 UniRef50_C0GUY0 Putative uncharacterized protein n=2 Tax=Desulfo... 43 0.010 >UniRef50_P18005 UPF0380 protein yubP n=179 Tax=root RepID=YUBP_ECOLI Length = 273 Score = 387 bits (995), Expect = e-106, Method: Composition-based stats. Identities = 273/273 (100%), Positives = 273/273 (100%) Query: 1 MRLASRFGRYNSIHRERPLTDDELMQFVPSVFSGDKHESRSERYTYIPTINIINKLRDEG 60 MRLASRFGRYNSIHRERPLTDDELMQFVPSVFSGDKHESRSERYTYIPTINIINKLRDEG Sbjct: 1 MRLASRFGRYNSIHRERPLTDDELMQFVPSVFSGDKHESRSERYTYIPTINIINKLRDEG 60 Query: 61 FQPFFACQSRVRDLGRREYSKHMLRLRREGHINGQEVPEIILLNSHDGSSSYQMIPGIFR 120 FQPFFACQSRVRDLGRREYSKHMLRLRREGHINGQEVPEIILLNSHDGSSSYQMIPGIFR Sbjct: 61 FQPFFACQSRVRDLGRREYSKHMLRLRREGHINGQEVPEIILLNSHDGSSSYQMIPGIFR 120 Query: 121 FVCTNGLVCGNNFGEIRVPHKGDIVGQVIEGAYEVLGVFDKVTDNMEAMKEIHLNSDEQH 180 FVCTNGLVCGNNFGEIRVPHKGDIVGQVIEGAYEVLGVFDKVTDNMEAMKEIHLNSDEQH Sbjct: 121 FVCTNGLVCGNNFGEIRVPHKGDIVGQVIEGAYEVLGVFDKVTDNMEAMKEIHLNSDEQH 180 Query: 181 LFGRAALMVRYEDENKTPVTPEQIITPRRWEDKQNDLWTTWQRVQENMIKGGLSGRSASG 240 LFGRAALMVRYEDENKTPVTPEQIITPRRWEDKQNDLWTTWQRVQENMIKGGLSGRSASG Sbjct: 181 LFGRAALMVRYEDENKTPVTPEQIITPRRWEDKQNDLWTTWQRVQENMIKGGLSGRSASG 240 Query: 241 KNTRTRAITGIDGDIRINKALWVIAEQFRKWKS 273 KNTRTRAITGIDGDIRINKALWVIAEQFRKWKS Sbjct: 241 KNTRTRAITGIDGDIRINKALWVIAEQFRKWKS 273 >UniRef50_P52132 UPF0380 protein yfjQ n=153 Tax=Bacteria RepID=YFJQ_ECOLI Length = 273 Score = 357 bits (915), Expect = 3e-97, Method: Composition-based stats. Identities = 177/267 (66%), Positives = 220/267 (82%), Gaps = 1/267 (0%) Query: 2 RLASRFGRYNSIHRERPLTDDELMQFVPSVFSGDKHESRSERYTYIPTINIINKLRDEGF 61 RLASRFG N I R+RPLT +EL + VPSVFS DKHESRSERYTYIPTI++++ L+ EGF Sbjct: 3 RLASRFGAANLIRRDRPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDSLQREGF 62 Query: 62 QPFFACQSRVRDLGRREYSKHMLRLRREGHINGQEVPEIILLNSHDGSSSYQMIPGIFRF 121 QPFFACQ+RVRD RRE++KHMLRLRREG I G++VPEIILLNSHDG+SSYQM+PG+FR Sbjct: 63 QPFFACQTRVRDPRRREHTKHMLRLRREGQITGKQVPEIILLNSHDGTSSYQMLPGMFRA 122 Query: 122 VCTNGLVCGNNFGEIRVPHKGDIVGQVIEGAYEVLGVFDKVTDNMEAMKEIHLNSDEQHL 181 VC NGLVCG +FGE+RVPHKGD+V QVIEGAYEVLG+F++V + +AM+ + L Q Sbjct: 123 VCQNGLVCGESFGEVRVPHKGDVVSQVIEGAYEVLGIFERVEEKRDAMQSLLLPPPVQQA 182 Query: 182 FGRAALMVRYEDENKTPVTPEQIITPRRWEDKQNDLWTTWQRVQENMIKGGLSGRSASGK 241 +AAL R+ E+ PVT QI++PRRW+D+ NDLWTT+QR+QEN+IKGGLSGR+A G Sbjct: 183 LAKAALTYRFG-EDHQPVTESQILSPRRWQDESNDLWTTYQRIQENLIKGGLSGRNAKGG 241 Query: 242 NTRTRAITGIDGDIRINKALWVIAEQF 268 T TRA+ GIDGD+++N+ALWV+AE Sbjct: 242 RTHTRAVRGIDGDVKLNRALWVMAETL 268 >UniRef50_Q1ND23 CP4-6 prophage n=4 Tax=Alphaproteobacteria RepID=Q1ND23_9SPHN Length = 281 Score = 329 bits (843), Expect = 6e-89, Method: Composition-based stats. Identities = 133/276 (48%), Positives = 190/276 (68%), Gaps = 6/276 (2%) Query: 3 LASRFGR-YNSIHRERPLTDDELMQFVPSVFSGDKHESRSERYTYIPTINIINKLRDEGF 61 LA+RFGR + I PL ++ L + VPS+F+ + H+SRSERY Y+PTI+I+ LR EG+ Sbjct: 6 LATRFGRNSHQIGGYEPLDNEALYRHVPSIFAREAHDSRSERYVYVPTIDIVEGLRREGW 65 Query: 62 QPFFACQSRVRDLGRREYSKHMLRLRREGHINGQEVPEIILLNSHDGSSSYQMIPGIFRF 121 PFFA QS RD R ++KHMLRLRRE + E E I++NSHDG+S++Q+ G+ RF Sbjct: 66 FPFFAVQSVPRDGNRHGHAKHMLRLRREDGVGKSEAAEAIIVNSHDGTSAFQLFAGMLRF 125 Query: 122 VCTNGLVCGNNFGEIRVPHKGDIVGQVIEGAYEVLGVFDKVTDNMEAMKEIHLNSDEQHL 181 VCTN ++ G F E+RVPHKG+I +IEG Y V F ++ D E MK + L+ DEQ L Sbjct: 126 VCTNSMIAGERFEEVRVPHKGNIEHDIIEGVYTVAEDFPRLIDASETMKGVRLSEDEQRL 185 Query: 182 FGRAALMVRYEDENKTPVTPEQIITPRRWEDKQNDLWTTWQRVQENMIKGGLSGRSASGK 241 G +L+ RY E+++P+TPEQII PRR+ED+ + LWTT+ +QEN+I+GGL GR + + Sbjct: 186 LGEVSLVARYG-EDESPLTPEQIIEPRRYEDRGDSLWTTFNVIQENVIRGGLHGRKRNAE 244 Query: 242 N----TRTRAITGIDGDIRINKALWVIAEQFRKWKS 273 +R+R I GID ++ +N+ALW +AE ++ K+ Sbjct: 245 GRIRRSRSRPINGIDQNVTLNRALWTLAEGMQRLKT 280 >UniRef50_B9JPN2 Phosphoribosylamine-glycine ligase n=2 Tax=Proteobacteria RepID=B9JPN2_AGRRK Length = 391 Score = 326 bits (836), Expect = 4e-88, Method: Composition-based stats. Identities = 110/285 (38%), Positives = 169/285 (59%), Gaps = 15/285 (5%) Query: 1 MRLASRFGRYNSIHRERPLTDDELMQFVPSVFSGDKHESRSERYTYIPTINIINKLRDEG 60 M + + R+++ R +T+ E+ + PS+F+ HESRS+R+ IPTI ++ L EG Sbjct: 108 MTIYTETARFDT---ARTMTETEMWKVAPSIFATTAHESRSDRFKPIPTIEVLRGLMAEG 164 Query: 61 FQPFFACQSRVRDLGRREYSKHMLRLRREGH----INGQEVPEIILLNSHDGSSSYQMIP 116 F P A QS R G+ +++KH++RLRR G V EI+L N++DG+S+Y+++ Sbjct: 165 FVPVGAKQSASRTEGKADFTKHLIRLRRVDDGKTYRVGDTVCEILLKNANDGTSAYELLA 224 Query: 117 GIFRFVCTNGLVCGN-NFGEIRVPHKGDIVGQVIEGAYEVLGVFDKVTDNMEAMKEIHLN 175 G+FR C N LV I+V H GD+ +VIEG Y VL ++ + LN Sbjct: 225 GLFRIRCMNSLVTQTGTIDAIKVRHSGDVSAKVIEGTYRVLNEAERTLVAPQDWATHKLN 284 Query: 176 SDEQHLFGRAALMVRYEDEN---KTPVTPEQIITPRRWEDKQNDLWTTWQRVQENMIKGG 232 DEQ + AA ++R+ D + KTP+ PEQ++ PRR +D+ +DLWT W QEN+I+GG Sbjct: 285 RDEQQIMAEAAHVLRFGDNDGETKTPIKPEQLLLPRRHDDRADDLWTVWNVTQENVIRGG 344 Query: 233 LSGRSAS----GKNTRTRAITGIDGDIRINKALWVIAEQFRKWKS 273 L G + ++RA+ GID DI++NKALW+I E+ + K+ Sbjct: 345 LRGIGREDLGRPRRVKSRAVNGIDQDIKLNKALWLIGEKMAELKA 389 >UniRef50_C6RFJ3 Putative uncharacterized protein n=1 Tax=Campylobacter showae RM3277 RepID=C6RFJ3_9PROT Length = 271 Score = 313 bits (803), Expect = 3e-84, Method: Composition-based stats. Identities = 87/263 (33%), Positives = 146/263 (55%), Gaps = 6/263 (2%) Query: 15 RERPLTDDELMQFVPSVFSGDKHESRSERYTYIPTINIINKLRDEGFQPFFACQSRVRDL 74 PLT+++L Q PS+F+ + + S++Y +I TI++IN++RD + P ++ VRD Sbjct: 4 SNEPLTNEQLEQLAPSLFADEPYFEASDKYHFISTIDVINEIRDYAWYPVGVSEASVRDE 63 Query: 75 GRREYSKHMLRLRREGHI--NGQEVPEIILLNSHDGSSSYQMIPGIFRFVCTNGLVCGNN 132 + + KH +R R G+ V E++L NSHD S + + G+FRFVC NGLV + Sbjct: 64 KKEGFQKHYVRFRHLDDFLNPGENVVELLLFNSHDRSKCFSISAGVFRFVCANGLVVSDE 123 Query: 133 -FGEIRVPHKGDIVGQVIEGAYEVLGVFDKVTDNMEAMKEIHLNSDEQHLFGRAALMVRY 191 F ++ H GD V ++ + + ++ +I L D++ F +AA+ +R+ Sbjct: 124 VFESYQIKHLGDKENDVSIAINKIAKAKYDILNKIKLFSKIPLTQDDKASFAKAAIPLRF 183 Query: 192 EDENKTPVTPEQIITPRRWEDKQNDLWTTWQRVQENMIKGGLSGRSAS-GKNTRTRAITG 250 E K V ++ P R ED+++DL+TT+ +QE++I+G +SG +A + +R I Sbjct: 184 EKHLK--VDYRDLLVPHRIEDEKDDLYTTFNTIQEHLIRGNISGINAETNRRFTSRIIKS 241 Query: 251 IDGDIRINKALWVIAEQFRKWKS 273 I D INK LW +AE K K+ Sbjct: 242 ISTDTDINKKLWNMAESIAKIKA 264 >UniRef50_A4X0R7 Putative uncharacterized protein n=2 Tax=Rhodobacter sphaeroides RepID=A4X0R7_RHOS5 Length = 316 Score = 313 bits (802), Expect = 4e-84, Method: Composition-based stats. Identities = 118/272 (43%), Positives = 171/272 (62%), Gaps = 8/272 (2%) Query: 10 YNSIHRERPLTDDELMQFVPSVFSGDKHESRSERYTYIPTINIINKLRDEGFQPFFACQS 69 + R PLT+ EL VPS+F+ + HESRS R+ +PTI +++ LR EGF+PFFA Q+ Sbjct: 43 GSIFSRGEPLTNAELHARVPSIFATEAHESRSARFAPVPTITVLDGLRAEGFEPFFAQQA 102 Query: 70 RVRDLGRREYSKHMLRLRREGHI-NGQEVPEIILLNSHDGSSSYQMIPGIFRFVCTNGLV 128 R R G+ E++KHMLRLR G + E EI+L+N++DG+S+YQMIPG FRFVC NGL+ Sbjct: 103 RTRIEGKAEFTKHMLRLRHRGIVNEAGEAFEIVLVNANDGTSAYQMIPGFFRFVCANGLM 162 Query: 129 CGNNFGEIRVPHKGDIVGQVIEGAYEVLGVFDKVTDNMEAMKEIHLNSDEQHLFGRAALM 188 G F E++V H G+ +G+VIEGAY VL +V D ++ K I L E+ + AA Sbjct: 163 AGETFEEVKVRHSGNAIGEVIEGAYRVLEDAPRVADQVQRFKSIRLQDREREILAEAAHS 222 Query: 189 VRY---EDENKTPVTPEQIITPRRWEDKQNDLWTTWQRVQENMIKGGLSGRSASG----K 241 +R+ + P+ P ++ PRR ED+ DLWT + VQEN ++GG+ GR + + Sbjct: 223 LRFPATAEGKAAPIDPPALLRPRRSEDRATDLWTAFNVVQENTLRGGMRGRIETDSGFIR 282 Query: 242 NTRTRAITGIDGDIRINKALWVIAEQFRKWKS 273 R +TGID +N+ALW++ E+ + KS Sbjct: 283 RQTVREVTGIDQSRALNRALWMLTERMAELKS 314 >UniRef50_B8IVF8 Putative uncharacterized protein n=1 Tax=Methylobacterium nodulans ORS 2060 RepID=B8IVF8_METNO Length = 295 Score = 312 bits (798), Expect = 1e-83, Method: Composition-based stats. Identities = 117/281 (41%), Positives = 168/281 (59%), Gaps = 12/281 (4%) Query: 5 SRFGRYNSIHRERP-LTDDELMQFVPSVFSGDKHESRSERYTYIPTINIINKLRDEGFQP 63 +RFG + R L + L P+VF+ DKH SRS++YTYIPT+ ++ L EGF P Sbjct: 14 TRFGSGAVVVRNNGGLDEAALRSAAPTVFAEDKHSSRSDKYTYIPTVEVLRGLGREGFLP 73 Query: 64 FFACQSRVRDLGRREYSKHMLRLRREGHIN---GQEVPEIILLNSHDGSSSYQMIPGIFR 120 RD +R Y+KH+LRLRR G G E++LLNSHDG+SSYQ++ G+FR Sbjct: 74 VEVRVGGTRDEEKRGYTKHLLRLRRMGDAPTRVGDSSRELVLLNSHDGTSSYQLMSGLFR 133 Query: 121 FVCTNGLVCGNNFGEI-RVPHKGDIVGQVIEGAYEVLGVFDKVTDNMEAMKEIHLNSDEQ 179 +C+NGLVC + +I ++PHKGDIV QVI+GAY ++ ++V MK+I L EQ Sbjct: 134 LICSNGLVCADGDAQILKIPHKGDIVQQVIDGAYRIVDASEEVDRIAAEMKQIELRPAEQ 193 Query: 180 HLFGRAALMVRYEDE-NKTPVTPEQIITPRRWEDKQNDLWTTWQRVQENMIKGGLSGRSA 238 F AA +R+ E + PV P QI PRR ED N LW + R QE +I+GG+ + Sbjct: 194 DAFAEAAAELRWNGEGQRVPVEPRQIHAPRRREDVGNSLWLAFNRTQEGLIRGGIDYQQR 253 Query: 239 SGK------NTRTRAITGIDGDIRINKALWVIAEQFRKWKS 273 + + +TR + G+DG+ +N+ALWV+A + + K+ Sbjct: 254 NPETGRLIARRQTRPVQGVDGNTALNRALWVLANRMAELKA 294 >UniRef50_A9HST1 Putative uncharacterized protein n=1 Tax=Gluconacetobacter diazotrophicus PAl 5 RepID=A9HST1_GLUDA Length = 282 Score = 311 bits (797), Expect = 2e-83, Method: Composition-based stats. Identities = 111/282 (39%), Positives = 155/282 (54%), Gaps = 15/282 (5%) Query: 1 MRLASRFGRYNSIHRERPLTDDELMQFVPSVFSGDKHESRSERYTYIPTINIINKLRDEG 60 M SR + +PLTD++L + PS+F+ KHESRS+RYTYIPTI ++ LR EG Sbjct: 1 MSFLSRVSAH---RHAQPLTDEQLQRLAPSIFAEAKHESRSDRYTYIPTIEVVRGLRSEG 57 Query: 61 FQPFFACQSRVRDLGRREYSKHMLRLRREGHIN-----GQEVPEIILLNSHDGSSSYQMI 115 F P A Q R G+ EY+KH++R R H G PE+ LLNSHDG+S+Y++I Sbjct: 58 FFPVMARQGNSRIPGKAEYTKHLIRFRHMDHGPMYENLGDLYPEVALLNSHDGTSAYKII 117 Query: 116 PGIFRFVCTNGLVCGNNF-GEIRVPHKGDIVGQVIEGAYEVLGVFDKVTDNMEAMKEIHL 174 + R C NG+V + EI VPHKG + +VIEG+Y VL K + L Sbjct: 118 AAMMRLACENGMVVQDARLAEISVPHKGTVTDKVIEGSYTVLDESRKALEIAGEWSGKTL 177 Query: 175 NSDEQHLFGRAALMVRYEDE-NKTPVTPEQIITPRRWEDKQNDLWTTWQRVQENMIKGGL 233 +Q F A + +Y D+ + P TPE + RR D+ DLW RVQE+ I+GG+ Sbjct: 178 TERQQKGFAEAVHIAKYGDDAERMPFTPESYLRTRRAADQGADLWRVANRVQESAIRGGM 237 Query: 234 SGRS-----ASGKNTRTRAITGIDGDIRINKALWVIAEQFRK 270 +G + K R + IDGDI++NKA+W +A+ Sbjct: 238 TGFRWDEDGRNRKRVTARPVKSIDGDIKLNKAVWHLAQMLSD 279 >UniRef50_C6N6D1 Putative uncharacterized protein n=2 Tax=Legionella drancourtii LLAP12 RepID=C6N6D1_9GAMM Length = 275 Score = 308 bits (789), Expect = 1e-82, Method: Composition-based stats. Identities = 102/265 (38%), Positives = 157/265 (59%), Gaps = 10/265 (3%) Query: 12 SIHRERP----LTDDELMQFVPSVFSGDKHESRSERYTYIPTINIINKLRDEGFQPFFAC 67 I P LT ++L + PS+F+ SERY I T ++I++L EGF P A Sbjct: 5 LIESGEPAMNVLTIEQLYKAAPSLFTRGAAVHTSERYQPIATSDVIDRLLQEGFYPTKAT 64 Query: 68 QSRVRDLGRREYSKHMLRLRREG-HINGQE-VPEIILLNSHDGSSSYQMIPGIFRFVCTN 125 QS R ++ +SKH++R R H G PE++L+NSHDG SSY+++ G++R VCTN Sbjct: 65 QSASRSEEKKVFSKHLVRFRHRDYHNPGNGLFPELVLINSHDGLSSYRLMAGLYRQVCTN 124 Query: 126 GLVCGNNFGEIRVPHKGDIVGQVIEGAYEVLGVFDKVTDNMEAMKEIHLNSDEQHLFGRA 185 GLV G ++ E+RV H+GD++G VIEG Y V+ K+ +E M + L ++ F Sbjct: 125 GLVAGKSYDEVRVKHQGDVIGNVIEGTYRVIESSQKMLQVVEQMGDCALPDEKLLEFSAQ 184 Query: 186 ALMVRYEDENKTPVTPEQIITPRRWEDKQNDLWTTWQRVQENMIKGGLSGRSASG----K 241 A +R+ ++ + P+ ++ PRR ED + DL++ + VQEN+IKGG+ G + + Sbjct: 185 AHALRFSEDANLVIEPKNLLVPRRREDMKRDLFSVFNVVQENLIKGGVLGYRLNEHGRWR 244 Query: 242 NTRTRAITGIDGDIRINKALWVIAE 266 R+R IT ID +++IN+ LW IAE Sbjct: 245 RARSRKITSIDQNVKINRDLWTIAE 269 >UniRef50_B6C6K7 Conserved domain protein n=2 Tax=Nitrosococcus oceani RepID=B6C6K7_9GAMM Length = 226 Score = 280 bits (715), Expect = 4e-74, Method: Composition-based stats. Identities = 100/225 (44%), Positives = 149/225 (66%), Gaps = 5/225 (2%) Query: 52 IINKLRDEGFQPFFACQSRVRDLGRREYSKHMLRLRREG---HINGQEVPEIILLNSHDG 108 +I L EG+ P A +SRVR R+ +SKH+LR RR + G PEI+L+NSHDG Sbjct: 1 MIEALEREGWSPVHAEESRVRIPDRKGFSKHLLRFRRFDNELPMVGDSFPEIVLVNSHDG 60 Query: 109 SSSYQMIPGIFRFVCTNGLVCGNN-FGEIRVPHKGDIVGQVIEGAYEVLGVFDKVTDNME 167 S +YQ+ G+FR VC+NG++ ++ G+++ H GD+V +VIEG YE++ ++ +E Sbjct: 61 SCAYQLHAGLFRLVCSNGMIVADSNMGQVKRRHTGDVVREVIEGTYEIVEELPRIAARVE 120 Query: 168 AMKEIHLNSDEQHLFGRAALMVRYEDENKTPVTPEQIITPRRWEDKQNDLWTTWQRVQEN 227 K + L+ EQ +F +AL VR+ E + P P+ ++ PRR ED+ NDLW T+QRVQEN Sbjct: 121 DFKTLELSLQEQEIFAESALRVRWR-EGEAPCMPQALLRPRRHEDQGNDLWATYQRVQEN 179 Query: 228 MIKGGLSGRSASGKNTRTRAITGIDGDIRINKALWVIAEQFRKWK 272 M+KGG+ GRSA G+ TRA+ +DG++++NKALW + EQ + K Sbjct: 180 MLKGGIRGRSAVGRQITTRAVKSVDGNVKLNKALWFLTEQMAELK 224 >UniRef50_B1ZQ12 Putative uncharacterized protein n=3 Tax=Opitutus terrae PB90-1 RepID=B1ZQ12_OPITP Length = 288 Score = 262 bits (669), Expect = 9e-69, Method: Composition-based stats. Identities = 86/276 (31%), Positives = 138/276 (50%), Gaps = 24/276 (8%) Query: 17 RPLTDDELMQFVPSVFSGDKHESRSERYTYIPTINIINKLRDEGFQPFFACQSRVRDLGR 76 R L+ D+L + PSVF+ S RYT++ T +++ LR EG++P A Q RVR R Sbjct: 14 RALSLDDLRRVAPSVFAEQARPGVSSRYTFVSTAQVVDLLRGEGWEPVKANQQRVRLENR 73 Query: 77 REYSKHMLRLRREGHIN------GQEVPEIILLNSHDGSSSYQMIPGIFRFVCTNGLVCG 130 + + H LR R + G PE+IL N+HDG+ +Y++ G++R VC NGL Sbjct: 74 QGFQMHELRFARRADLENASFAIGDVRPELILQNAHDGTRAYRIDAGLYRLVCRNGLTVA 133 Query: 131 N-NFGEIRVPHKGDIVGQVIEGAYEVLGVFDKVTDNMEAMKEIHLNSDEQHLFGRAALMV 189 + +F + + H + A V +V + + + + L +H F A+ + Sbjct: 134 DADFAHVAIRHVDVSAEKFAAAAQAVAENTPRVMEVIARWQAVALTPLARHSFAARAMAL 193 Query: 190 RYEDENKTPVT----PEQIITPRRWEDKQNDLWTTWQRVQENMIKGGLSGRSASG----- 240 R+ ++ PVT P+Q++ P R+ D+ DLWTT+ VQE + +GGL Sbjct: 194 RW--DSAQPVTRLLRPDQLLAPARYGDQATDLWTTFNVVQERLCRGGLRYAGHIPAAEGA 251 Query: 241 ------KNTRTRAITGIDGDIRINKALWVIAEQFRK 270 TR + G+ R+NKALW +AE+F + Sbjct: 252 VFPTHYLRNTTRPVGGLTEGQRLNKALWNLAEEFSR 287 >UniRef50_Q17W97 Putative uncharacterized protein Hac prophage I orf7 n=1 Tax=Helicobacter acinonychis str. Sheeba RepID=Q17W97_HELAH Length = 176 Score = 203 bits (517), Expect = 4e-51, Method: Composition-based stats. Identities = 49/172 (28%), Positives = 88/172 (51%), Gaps = 3/172 (1%) Query: 15 RERPLTDDELMQFVPSVFSGDKHESRSERYTYIPTINIINKLRDEGFQPFFACQSRVRDL 74 +PL+++EL + PS+F+ + + S++Y +I TI+II ++R + P ++ VR+ Sbjct: 4 STQPLSNNELKRLAPSLFTAEPYYEASDKYHFISTIDIIEEIRFHAWYPVAVSEASVRNE 63 Query: 75 GRREYSKHMLRLRREGHI--NGQEVPEIILLNSHDGSSSYQMIPGIFRFVCTNGLVCGNN 132 + Y +H +R R + E++L NSHD S + + G+FRFVC NGLV + Sbjct: 64 DKEGYQQHYVRFRYLDDFLRPSENCVELLLFNSHDRSKCFTISAGVFRFVCANGLVVADE 123 Query: 133 -FGEIRVPHKGDIVGQVIEGAYEVLGVFDKVTDNMEAMKEIHLNSDEQHLFG 183 F ++ H G+ V ++ DK+ D + +I L ++ F Sbjct: 124 VFESYQIKHIGEKANGVAVAIPSIVQAKDKIMDKISTFSQITLTEQDKISFA 175 >UniRef50_Q2LV02 Hypothetical cytosolic protein n=1 Tax=Syntrophus aciditrophicus SB RepID=Q2LV02_SYNAS Length = 264 Score = 172 bits (437), Expect = 8e-42, Method: Composition-based stats. Identities = 45/253 (17%), Positives = 97/253 (38%), Gaps = 44/253 (17%) Query: 36 KHESRSERYTYIPTINIINK--------LRDEGFQPFFACQSRVRDLGRREYSKHMLRLR 87 ++ Y + ++ +K LRD R G + ++ +L+ + Sbjct: 22 PLPEPTDSYMPVSHYDLADKFLMISQDILRDYKL--VGENYGIAR-QGNQFFA--VLKFQ 76 Query: 88 REGHINGQEVPEIILLNSHDGSSSYQMIPGIFRFVCTNGLVCGNNFGEIRVPHKGDI--- 144 RE G I NS+D S + + G FVC N + G + H ++ Sbjct: 77 RERSEIG---LSIAFRNSYDRSMAIGLAIGASVFVCDNLALSGEIV--VMKKHTKNVWSE 131 Query: 145 -VGQVIEGAYEVLGVFDKVTDNMEAMKEIHLNSDEQHLFGRAALMVRYEDENKTP----V 199 + I Y+ +D++ +++A K L D+ F A+ + + + +P V Sbjct: 132 LEEKAIATIYKSQNNYDQLIGDVDAFKS--LPVDDNGAF--QAMGLLFGNNIISPRQLTV 187 Query: 200 TPEQIITPRRWEDKQNDLWTTWQRVQENMIKGGLSGRSASGKNTRTRAITGIDGDIRINK 259 E+ + P E + +LW+ + E++ + +T ++ IR+++ Sbjct: 188 LKEEWLKPSHEEFEPRNLWSFYNAATESL--------------KSSPPVTIMEKHIRLHE 233 Query: 260 ALWVIAEQFRKWK 272 AL + ++ + Sbjct: 234 ALTYLGKEASNVQ 246 >UniRef50_D1N225 Putative uncharacterized protein n=1 Tax=Victivallis vadensis ATCC BAA-548 RepID=D1N225_9BACT Length = 241 Score = 156 bits (393), Expect = 9e-37, Method: Composition-based stats. Identities = 38/202 (18%), Positives = 72/202 (35%), Gaps = 15/202 (7%) Query: 36 KHESRSERYTYIPTINIINKL----RDEGFQPFFACQSRVRDLGRREYSKHMLRLRREGH 91 + + + +P +I+ + R +Q R+ G+R + ++R+ R Sbjct: 20 PTPAATASWKPVPHSEVIDAVTDVVRAHNWQILDEQYGLARN-GQRMFG--VIRINRTSS 76 Query: 92 INGQEVPEIILLNSHDGSSSYQMIPGIFRFVCTNGLVCGNNFGEIRVPHKGDIV--GQVI 149 I + NSHD + + + G+ VC N + G+ ++ H I G V+ Sbjct: 77 SEWSRC--IGICNSHDRTIAVGLAAGLNVQVCANLMFGGSTV--LKRRHTSRIELNGLVV 132 Query: 150 EGAYEVLGVFDKVTDNMEAMKEIHLNSD-EQHLFGRAALMVRYEDENKTPVTPEQIITPR 208 E + F + E +K + D + +AA + P+ + PR Sbjct: 133 EAIDALEDDFLTLETVAEDLKIQFVRDDTARAAIVKAAEAGAVNSCDIVPI-FREFKEPR 191 Query: 209 RWEDKQNDLWTTWQRVQENMIK 230 E + W EN K Sbjct: 192 YEEFAEPTRWALLNAFTENAKK 213 >UniRef50_D2R5Z8 Phage/plasmid-related protein TIGR03299 n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R5Z8_9PLAN Length = 327 Score = 149 bits (376), Expect = 9e-35, Method: Composition-based stats. Identities = 43/249 (17%), Positives = 85/249 (34%), Gaps = 30/249 (12%) Query: 44 YTYIPTINII---NKLRDEGFQPFFACQSRVRDLGRREYSK----HMLRLRREGHINGQE 96 Y + + + +G + G R + +R++ + + Sbjct: 83 YVPVQNRQAFGFLDAVVADG--SLRYHTAGALGKGERIWLLAKLPSQIRVKNSDDLVDKF 140 Query: 97 VPEIILLNSHDGSSSYQMIPGIFRFVCTNGLVCGNNFGE---IRVPHKGDIVGQVIEGAY 153 ++L N+HDGSS+ ++ R VC N L +N I + HKG++ + I A Sbjct: 141 ---LLLSNAHDGSSALRVYFTPIRVVCQNTLNLADNRSTGQGISILHKGNLHTK-IREAQ 196 Query: 154 EVLGVFDKVTD----NMEAMKEIHLNSDEQHLFGRAALMVRYEDENKTPVTPEQIITPRR 209 VLG+ ++ D ++ + H +S + F ++ + +N +T Sbjct: 197 RVLGLAEEFYDEAEGIIDILASHHPSSVQVEAFFQSVIPDPIGADNARARKVRDRLTCLF 256 Query: 210 WEDKQNDL-------WTTWQRVQENMIKGGLSGRSASGKNTRTRAITG--IDGDIRINKA 260 D+ W + V E + RS +R + R+ Sbjct: 257 ETGIGQDMPEIKGTSWAAYNAVTE-FVDHHRPTRSTDPLERASRRLDSSWFGSGARLKAK 315 Query: 261 LWVIAEQFR 269 W +A Sbjct: 316 AWNLAFDMA 324 >UniRef50_A8RIH4 Putative uncharacterized protein n=3 Tax=Clostridiales RepID=A8RIH4_9CLOT Length = 312 Score = 139 bits (349), Expect = 1e-31, Method: Composition-based stats. Identities = 47/246 (19%), Positives = 95/246 (38%), Gaps = 34/246 (13%) Query: 41 SERYTYIPTINII---NKLRDEGFQPFFACQSRVRDLGRREYSKHMLRLRREGHINGQEV 97 ++RY + + ++L EG + GRR + + +L + I+G E+ Sbjct: 76 TDRYKVVQNEDAFAFTDQLLGEG---VTYETAGSLQNGRRTWL--LAKLPQRYIISGDEI 130 Query: 98 -PEIILLNSHDGSSSYQMIPGIFRFVCTNGLVCGNNFG--EIRVPHKGDIVGQVIEGAYE 154 P ++ +N+HDG+ + ++ R VC N L + H GDI G++ + Y Sbjct: 131 TPYMVFMNTHDGTGAIRVAMTPVRVVCMNTLNLALSTAKRSWSTNHTGDIAGKMEDARYT 190 Query: 155 VLGVFDKVTD---NMEAMKEIHLNSDEQHLFGRAALMVRYEDENKTPVTPEQIITPRRWE 211 +L +++ ++ MK + L+ + + A + P +Q R E Sbjct: 191 LLYADRYMSELGKAIDHMKRLRLSERQVMEYIDALFPL-----YDNPTPQQQKNLNRMKE 245 Query: 212 DKQN------DL-------WTTWQRVQENMIKGGLSGRSASGKNTRTRAITGIDGDIRIN 258 D + DL + V + SA+ K ++G+ I+ Sbjct: 246 DMKTRYFDAPDLKHVGKNGYRFINAVSDFATHARPLRESANHKENLF--AKTVEGNALID 303 Query: 259 KALWVI 264 +A ++ Sbjct: 304 RAFAML 309 >UniRef50_B2Q5G8 Putative uncharacterized protein n=3 Tax=Providencia RepID=B2Q5G8_PROST Length = 122 Score = 128 bits (321), Expect = 2e-28, Method: Composition-based stats. Identities = 59/115 (51%), Positives = 82/115 (71%), Gaps = 1/115 (0%) Query: 156 LGVFDKVTDNMEAMKEIHLNSDEQHLFGRAALMVRYEDENKTPVTPEQIITPRRWEDKQN 215 + FD V + E M+ + L Q +AAL R+ +E P+T EQ++ PRRWEDK++ Sbjct: 1 METFDTVAEKREQMQSLLLPPPAQQALAQAALTYRFGEE-HQPITEEQVLQPRRWEDKKD 59 Query: 216 DLWTTWQRVQENMIKGGLSGRSASGKNTRTRAITGIDGDIRINKALWVIAEQFRK 270 DLWT +QR+QEN+IKGGLSGR+A GK RTR++ GIDGDI++NKALWV+ E+ + Sbjct: 60 DLWTVYQRLQENLIKGGLSGRNAKGKRARTRSVNGIDGDIKLNKALWVMTEKMYE 114 >UniRef50_A6GXR9 Putative uncharacterized protein n=1 Tax=Flavobacterium psychrophilum JIP02/86 RepID=A6GXR9_FLAPJ Length = 285 Score = 122 bits (307), Expect = 1e-26, Method: Composition-based stats. Identities = 35/191 (18%), Positives = 69/191 (36%), Gaps = 41/191 (21%) Query: 94 GQEVPEIILLNSHDGSSSYQMIPGIFRFVCTNGLVCGNNFGEIRVPHKGDIVGQVIEGAY 153 + P + NS+DGS G FR VC+NGL + + H+G+I V+ Sbjct: 110 DKIRPMLRFTNSYDGSCKTSGTFGFFREVCSNGLHTASTDIGFSLKHRGNINELVLPAIG 169 Query: 154 EVLGVF--------DKVTDNMEAMKEIHLNSDEQHLFGRAALMVRYEDENKTP------- 198 + + F + + + K + QH+ + + ++E +K P Sbjct: 170 KTIYNFLDNEFYELRRKFEVLADFKIADPSEIVQHI-AQQTKLFKFESSDKNPAPSLNAR 228 Query: 199 -----VTPEQIITPRRWEDKQNDLWTTWQRVQENMIKGGLSGRSASGKNTRTRAITGIDG 253 + E +I ED ++W + E ++ G + D Sbjct: 229 LVIETIENETLIL---KED--ANMWMVYNAFNE-LLHGKIK--------------KTFDQ 268 Query: 254 DIRINKALWVI 264 +I+K ++ + Sbjct: 269 QKKIDKEIFNL 279 >UniRef50_A8ZS75 Putative uncharacterized protein n=1 Tax=Desulfococcus oleovorans Hxd3 RepID=A8ZS75_DESOH Length = 318 Score = 117 bits (294), Expect = 3e-25, Method: Composition-based stats. Identities = 35/224 (15%), Positives = 73/224 (32%), Gaps = 29/224 (12%) Query: 41 SERYTYIPTINIINKLRDEGFQPFFACQSRVRDLGRREYSKHMLRL----RREGHING-- 94 +ERY + ++++++L GF P Q + D ++R+ R G G Sbjct: 108 TERYKPLDNMDVLSQLLRHGFDPDTQVQYAIDDG------MFLVRIPEYARAFGVNPGYG 161 Query: 95 ---QEVPEIILLNSHDGSSSYQMIPGIFRFVCTNGLVCGNNFGEIRVPHKGDI-VGQVIE 150 + VP + NS G ++ + +R VCTNGL+ + R H + + E Sbjct: 162 KLDEIVPGVSFANSEVGLLAFSIEAFFYRLVCTNGLISKTSSTFSRFKHISNRGLENFPE 221 Query: 151 GAYEVLGVFDKVTD--NMEAMKEIHLNSDEQHLFGRAALMVRYE-DENKTPVTPEQIITP 207 V+ + + + + F R + +T V + + Sbjct: 222 TIAGVIEDSVRKQEQFKLSRQSPVENPIRSIETFARQ-----FGLAHLETEVVCKAYLL- 275 Query: 208 RRWEDKQNDLWTTWQRVQENMIKGGLSGRSASGKNTRTRAITGI 251 ++ ++ L + + I + Sbjct: 276 ----EQGATMFHIINAFTRAAQDKHLDTLQSYRLESAGGQILSL 315 >UniRef50_B5EW31 Putative uncharacterized protein n=1 Tax=Vibrio fischeri MJ11 RepID=B5EW31_VIBFM Length = 318 Score = 114 bits (284), Expect = 5e-24, Method: Composition-based stats. Identities = 41/239 (17%), Positives = 87/239 (36%), Gaps = 24/239 (10%) Query: 42 ERYTYIPTINIINKL--------------RDEGFQPFFACQSRVRDLGRREYSKHMLRLR 87 RYT + + + + D F + + + + HM+++ Sbjct: 76 SRYTLLKNSDAFDSVNAAVNTLAENGVLNMDGAFIKDAVVNKGGKVIRQYFFPAHMVQIG 135 Query: 88 REGHINGQEVPEIILLNSHDGSSSYQMIPGIFRFVCTNGLVCGNNFGEIRVPHKGDIV-G 146 + + ++++NS+DGS ++Q+ G FR VCTNG++ G F + V H G + G Sbjct: 136 SGDKV----ILRLVVVNSYDGSCNFQVQAGGFRIVCTNGMITGEKFLSLDVRHTGTMNFG 191 Query: 147 QVIEGAYEVLGVFDKVTDNMEAMKEIHLNS-DEQHLFGRAALMVRYEDENKTPVTPEQII 205 QV + F+ + + + LN D + + + R + N + Sbjct: 192 QVTRQVTTAVSSFENMGQYWDTLINSPLNRKDADKIITDMSTVGR--ELNMNKFDMFDRL 249 Query: 206 TPRRWEDKQNDLWTTWQRVQENMIKGGLSGRS-ASGKNTRTRAITGIDGDIRINKALWV 263 + + W + + ++ + ++ N R I + A+W Sbjct: 250 YTDHKKTLGENHWAMYNSLTAWATHYKVNESNISNIDNVRLEREKSI-QHLMRKPAIWN 307 >UniRef50_B9E574 Putative uncharacterized protein n=5 Tax=Clostridiales RepID=B9E574_CLOK1 Length = 325 Score = 97.9 bits (242), Expect = 3e-19, Method: Composition-based stats. Identities = 44/240 (18%), Positives = 86/240 (35%), Gaps = 30/240 (12%) Query: 41 SERYTYIPTINII---NKLRDEGFQPFFACQSRVRDLGRREYSKHMLRLRREGHINGQEV 97 ++RY + + L EG + GR+ + + +L + I EV Sbjct: 88 TDRYKIVQNKEAFSFTDSLIGEG---CKYETAGSLQNGRKVWL--LAKLPDKYKILDDEV 142 Query: 98 -PEIILLNSHDGSSSYQMIPGIFRFVCTNGLVCGNNFGE--IRVPHKGDIVGQVIEGAYE 154 P ++ NSHDG+ + ++ R VC N L + + H G+I ++ E Sbjct: 143 TPYMVFSNSHDGTGAIKVAMTPIRVVCNNTLNLALSNAKRIWSTIHTGNISSKLNEAMKT 202 Query: 155 VL--GVFDKVTDNMEA-MKEIHLNSDEQHLFGRAALMVRYEDENKTPVTPEQIITPR--- 208 +L + + D + ++ ++ F L + +N + + I R Sbjct: 203 LLLAESYMENLDYEAHYLSRKTISDEKVLEFIELLLPLP---DNASKTQEKNINLLRDDM 259 Query: 209 --RWEDKQN--DL----WTTWQRVQENMIKGGLSGRSASGKNTRTRAITGIDGDIRINKA 260 R+ D + DL W V + ++ + IDG+ I++A Sbjct: 260 KLRYFDAPDLIDLPKTSWRFVNAVSDFAT--HINPLRKTKNYKENLFSKTIDGNPLIDRA 317 >UniRef50_C7Q5L2 Phage/plasmid-related protein TIGR03299 n=1 Tax=Catenulispora acidiphila DSM 44928 RepID=C7Q5L2_CATAD Length = 329 Score = 81.7 bits (200), Expect = 3e-14, Method: Composition-based stats. Identities = 31/177 (17%), Positives = 58/177 (32%), Gaps = 31/177 (17%) Query: 44 YTYIPTIN-------IINKLRDEGFQPFFACQSRVRDLGRREYSKHMLRLRREGHINGQE 96 YT + +++ + GR+ + +RL + G + Sbjct: 96 YTPVQNEENCQIMNTLVDASGAH------FETAGSLREGRQVFVT--MRLPETMTVAGTD 147 Query: 97 VPEIIL--LNSHDGSSSYQMIPGIFRFVCTN--GLVCGNNFGEIRVPHKGDIVGQVIEG- 151 ++ + LNSHDG+ +Y++I R VC N L + H ++ E Sbjct: 148 RLDLYISGLNSHDGTGAYKLIVTPIRIVCANTQSLALDRARSSFSIRHTESAKKKIAEAR 207 Query: 152 -----AYEVLGVFDKVTDNMEAMKEIHLNSDEQHLFGRAALMVRYEDENKTPVTPEQ 203 ++ + F+K E M L F + + + N P T Sbjct: 208 KALGLMFKYVEEFEK---AAERMINETLT---LAEFEKVCHELWPLEPNAGPRTKSN 258 >UniRef50_A3XKH6 Putative uncharacterized protein n=2 Tax=Leeuwenhoekiella blandensis MED217 RepID=A3XKH6_9FLAO Length = 312 Score = 80.6 bits (197), Expect = 5e-14, Method: Composition-based stats. Identities = 34/187 (18%), Positives = 61/187 (32%), Gaps = 31/187 (16%) Query: 98 PEIILLNSHDGSSSYQMIPGIFRFVCTNGLVCGNNFGEIRVPHKGDIVGQVIEGAYEVLG 157 P + NS+DGS G +R VC+NGL E + H + ++ + Sbjct: 142 PMLRFKNSYDGSEKTSGHFGFYREVCSNGLHVSLAEIEFSIKHSKNNTHLIMPRLNNLFD 201 Query: 158 VFD--------KVTDNMEAMKEIHLNSDEQHLFGRAALMVRYEDENKT--PVTP-EQIIT 206 F K D M+ K I + + RYE +K P ++I Sbjct: 202 KFLDNEFYTITKKFDKMKEFKIID-TQEFVKAILDRTKLFRYECSDKNSDPSKKSREVIE 260 Query: 207 PRRWE----DKQNDLWTTWQRVQENMIKGGLSGRSASGKNTRTRAITGIDGDIRINKALW 262 +E +++ +LW + +++ L R++K L+ Sbjct: 261 ILNYEALLLNEEPNLWLGYNAFN-SVLHNVLK--------------KSFGQQERLDKKLF 305 Query: 263 VIAEQFR 269 Sbjct: 306 DEVYAMA 312 >UniRef50_Q024R3 Putative uncharacterized protein n=1 Tax=Candidatus Solibacter usitatus Ellin6076 RepID=Q024R3_SOLUE Length = 237 Score = 77.9 bits (190), Expect = 4e-13, Method: Composition-based stats. Identities = 30/201 (14%), Positives = 64/201 (31%), Gaps = 15/201 (7%) Query: 33 SGDKHESRSERYTYIPTINIINKL---RDEGFQPFFACQSRVRDLGRREYSKHMLRLRRE 89 + + + +P + ++ L + V G + + L Sbjct: 19 ADVPTPLGTATHRPVPHVEVVEALVETLSFRHIGVVTEEYAVSKDGMKMFGVLDL----- 73 Query: 90 GHINGQEVPEIILLNSHDGSSSYQMIPGIFRFVCTNGLVCGNNFGEIRVPHKGD--IVGQ 147 I + NSHD S + G+ VC N G+ F + H + + Sbjct: 74 DTGMPGCRFSIGIRNSHDRSMRLAAVVGVRVLVCENMAFSGD-FQPVLAKHSKNFSLQNA 132 Query: 148 VIEGAYEVLGVFDKVTDNMEAMKEIHLNSDEQHLFGRAALM---VRYEDENKTPVTPEQI 204 + G ++ FD + ++A +E L+ + A + + PV + Sbjct: 133 LSIGVDQMQRNFDGMRKQVDAWRESQLSDTVAKMIIYRAFIESDLEVPKHLARPVH-DLY 191 Query: 205 ITPRRWEDKQNDLWTTWQRVQ 225 +P+ E + +W+ Sbjct: 192 FSPKHEEFQPRTMWSLSNAFT 212 >UniRef50_UPI00017465AE hypothetical protein VspiD_04485 n=2 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI00017465AE Length = 256 Score = 75.2 bits (183), Expect = 2e-12, Method: Composition-based stats. Identities = 34/205 (16%), Positives = 60/205 (29%), Gaps = 12/205 (5%) Query: 37 HESRSERYTYIPT---INIINKLRDEGFQPFFACQSRVRDLGRREYSKHMLRLRREGHIN 93 + + IP I + K + G R + + + Sbjct: 35 TPRSTSSWCPIPHNRLIETVQKTLKSTNLRIGTQAHSLSHKGHRYFGLMEILGPKNDD-- 92 Query: 94 GQEVPEIILLNSHDGSSSYQMIPGIFRFVCTNGLVCGNNFGEIRVPHKGDIVGQVI---- 149 + L NSHD + ++ G FVC N G + H IV + Sbjct: 93 -DYCWVLGLRNSHDKTFPAGIVAGASVFVCDNLSFSGE--VKFARKHTRFIVRDLPGITE 149 Query: 150 EGAYEVLGVFDKVTDNMEAMKEIHLNSDEQHLFGRAALMVRYEDENKTPVTPEQIITPRR 209 +++ + + A KE + H A V P ++ PR Sbjct: 150 RAIGQLMSKWHHQDKRIGAYKEADIEDSIAHDLIIRATDVGVCSNRLIPSVLKEWREPRY 209 Query: 210 WEDKQNDLWTTWQRVQENMIKGGLS 234 + +W+ + E + G LS Sbjct: 210 QVFEDRSVWSLFNAFTEALKDGSLS 234 >UniRef50_A1SIX8 Putative uncharacterized protein n=2 Tax=Nocardioides sp. JS614 RepID=A1SIX8_NOCSJ Length = 334 Score = 74.0 bits (180), Expect = 5e-12, Method: Composition-based stats. Identities = 43/239 (17%), Positives = 75/239 (31%), Gaps = 23/239 (9%) Query: 44 YTYIPTINIIN--KLRDEGFQPFFACQSRVRDLGRREYSKHMLRLRREGHINGQEVPEII 101 YT + + L E F +R GR+ + ++L + G + ++ Sbjct: 95 YTPLQNEDHAEFLNLLAEESGAIFDTAGSLR-GGRQVFIT--MQLPDSLTVGGTDRVDLN 151 Query: 102 L--LNSHDGSSSYQMIPGIFRFVCTNGLVCGNNF--GEIRVPHKGDIVGQVIEGAYEVLG 157 + LNSHDGSS+++++ R VC N + H + V + Sbjct: 152 IAALNSHDGSSAFRILVTPVRVVCANTQSAALRNHESSFSIRHTRNAKAAVQAARDALGL 211 Query: 158 VF---DKVTDNMEAMKEIHLNSDEQHLFGRAALM-VRYEDENKTPVTPEQIITPRRWEDK 213 F D E + + + A + T + + W Sbjct: 212 TFTYVDAFQVEAERLIQQTMTDAAFDALIDATFGKAEANGTKRVRETERRRRSRLHWLFA 271 Query: 214 QNDL--------WTTWQRVQENMIK-GGLSGRSASGKNTRTRAITGIDGDIRINKALWV 263 D W +Q V E + + + TR +T D D RI + W Sbjct: 272 DADTQAGIRATAWAGYQAVAEYVDHYAPVRTKGDEHAARATRVLTSDDPD-RIKRRAWT 329 >UniRef50_UPI0001AF46A9 hypothetical protein MkanA1_07449 n=1 Tax=Mycobacterium kansasii ATCC 12478 RepID=UPI0001AF46A9 Length = 348 Score = 72.5 bits (176), Expect = 1e-11, Method: Composition-based stats. Identities = 39/259 (15%), Positives = 78/259 (30%), Gaps = 41/259 (15%) Query: 42 ERYTYIPTI---NIINKLRDEGFQPFFACQSRVRDLGRREYSKHMLRLRREGHINGQEV- 97 +Y + ++++ L D+ F +R GR + ++L +G++ Sbjct: 99 SKYEPLQNEASCDLLDALVDQSGGAHFETAGALR-GGRETFVT--MKLPSSMVFDGKDGS 155 Query: 98 -----PEIILLNSHDGSSSYQMIPGIFRFVCTNGLVCG--NNFGEIRVPHKGDIVGQVIE 150 + LNSHDGS++++ + R VC N + + H G + E Sbjct: 156 KDRTDFYLAALNSHDGSAAFRFLLSPIRIVCANTQSAAIRSAKSSFSIRHTGGARASIAE 215 Query: 151 GAYEVLGVFDKV----------------TDNMEAMKEIHLNSDEQHLFGRAALMVRYEDE 194 + + + T+ M + L D + Sbjct: 216 ARNALKLSWRYIEAFEAEAAALYAAPMDTEEMRSFANTLLEVDSAGTTATRRHRRERANS 275 Query: 195 NKTPVTPEQIITPRRWEDKQNDLWTTWQRVQENM-----IKGGLSGRSASGKNTRTRAIT 249 T + I P W + V E + ++G + AS R IT Sbjct: 276 IVKLWTSSETIAP-----IAGTRWAAYNAVTEYLDHVVPVRGAKTATDASAAR-ALRNIT 329 Query: 250 GIDGDIRINKALWVIAEQF 268 + + + + Sbjct: 330 TAASGQSLKAQAFRMLQTL 348 >UniRef50_C4DCZ5 Phage/plasmid-related protein TIGR03299 n=3 Tax=Actinomycetales RepID=C4DCZ5_9ACTO Length = 395 Score = 72.5 bits (176), Expect = 2e-11, Method: Composition-based stats. Identities = 38/187 (20%), Positives = 64/187 (34%), Gaps = 22/187 (11%) Query: 21 DDELMQFVPSVF------SGDKHESRSERYTYIPTINIINKLRD--EGFQPFFACQSRVR 72 DD+L P F + +Y + LR+ E + + VR Sbjct: 125 DDQLHTH-PDKFHTLRSDTAAPLGVVGSKYHTVQNRECFEFLRNLVESYDVVWESAGAVR 183 Query: 73 DLGRREYSKHMLRLRRE-----GHINGQEVPEIILLNSHDGSSSYQMIPGIFRFVCTNG- 126 GRR + +RL IN P +++ NSHDGSSS + +R VC N Sbjct: 184 -GGRRTFVS--MRLPDTVTVDAAGINDTITPFVVVFNSHDGSSSITAVVTPYRPVCANTE 240 Query: 127 -LVCGNNFGEIRVPHKGDIVGQVIEGAYEV---LGVFDKVTDNMEAMKEIHLNSDEQHLF 182 L N + + H + Q+ + + + +D+ + + DE Sbjct: 241 RLALDNAYTSWSIRHTESAMHQMRQARRTLKMSVKYYDEFAAQQTTLAHHDMVIDEFRAL 300 Query: 183 GRAALMV 189 + Sbjct: 301 IDELWPL 307 >UniRef50_B4VVD2 Phage/plasmid-related protein TIGR03299 n=2 Tax=Cyanobacteria RepID=B4VVD2_9CYAN Length = 336 Score = 71.7 bits (174), Expect = 2e-11, Method: Composition-based stats. Identities = 40/206 (19%), Positives = 66/206 (32%), Gaps = 30/206 (14%) Query: 44 YTYIPTINII---NKLRDEGFQPFFACQSRVRDLGRREYSKHMLRLRREGHINGQEV-PE 99 YT + + L G + G+R + L I+G V P Sbjct: 79 YTPLQNEEAFRWFDPLLSRG--GVQLEAAGSLKGGKRIWILAKLINTEAEIISGDIVRPY 136 Query: 100 IILLNSHDGSSSYQMIPGIFRFVCTNGLVCGNNF--------GEIRVPHKGDIVGQVIEG 151 ++L NSHDGS++ + R VC N L F I +PH + Q +E Sbjct: 137 LLLHNSHDGSTAVWLQFTPVRVVCWNTLNGAARFRFGDLWQKKAICIPHSLSLTEQ-LEH 195 Query: 152 AYEVLG----VFDKVTDNMEAMKEIHLNSDEQHLFGRAALMVRYEDENKTPVTPEQI--I 205 + +L F + +AM L ++ + L P + Sbjct: 196 IHNILDLTQKEFQYSVEEYQAMAHKELTTELLADYIGRVLG------TTQPTLHPAWSQL 249 Query: 206 TPRRWEDKQN---DLWTTWQRVQENM 228 + N LW + + E + Sbjct: 250 VANFESGRGNQGQTLWDAYNSITEWL 275 >UniRef50_B9PA18 Predicted protein (Fragment) n=2 Tax=cellular organisms RepID=B9PA18_POPTR Length = 87 Score = 70.5 bits (171), Expect = 5e-11, Method: Composition-based stats. Identities = 25/54 (46%), Positives = 40/54 (74%), Gaps = 1/54 (1%) Query: 1 MRLASRFGRYN-SIHRERPLTDDELMQFVPSVFSGDKHESRSERYTYIPTINII 53 M+LASRF ++ ++ + PL+DD++ + PS+F+ HESRSERY+YIPT ++ Sbjct: 34 MQLASRFASHSPALRSDSPLSDDQIRRVAPSIFADAPHESRSERYSYIPTAAVL 87 >UniRef50_Q5LU35 Putative uncharacterized protein n=1 Tax=Ruegeria pomeroyi RepID=Q5LU35_SILPO Length = 275 Score = 70.2 bits (170), Expect = 7e-11, Method: Composition-based stats. Identities = 54/268 (20%), Positives = 87/268 (32%), Gaps = 39/268 (14%) Query: 15 RERPLTDDELMQFVPSVFSGDKHESRSERYTYIPTINIINKLR-DEGFQ--PFFACQSRV 71 PL D L Q + + + IP +++ +R GF V Sbjct: 35 GASPLDYDGLRQL--------ETPEATSTHVPIPHHRLVDVVRLTLGFYGHTVEEEHHGV 86 Query: 72 RDLGRREYSKHMLRLRREGHINGQEVPEIILLNSHDGSSSYQMIPGIFRFVCTNGLVCGN 131 G R + +L LR G + L NSHD + + G FVC N + Sbjct: 87 TPDGMRYFG--VLSLRSTY---GDYTDTVGLRNSHDKTFPIGISFGSRVFVCDNLAFIAD 141 Query: 132 NFGEIRVPHKG----DIVGQVIEGAYEVLGVFDKVTDNMEAMKEIHLNSDEQHLFGRAAL 187 + +R H D+ G V + + + + + +L+ A+ Sbjct: 142 HV--VRRKHTAQAKRDLPGLVGDLIEPLADQREAQHRVISRYRAANLS----QSLVDHAV 195 Query: 188 MVRYEDENKTPVTPEQIITPRRWEDKQND-----LWTTWQRVQENMIKGGLSGRSASGKN 242 + Y E T ++ RWE+ +D W + V L GR A Sbjct: 196 LELYRAEVITVTRIAAVME--RWENPPHDWGVKTAWRLFNCVT-----HALEGRIAEQPA 248 Query: 243 TRTRAITGIDGDIRINKALWVIAEQFRK 270 +R ID +N V AE + Sbjct: 249 LTSRLHDVIDA-TCLNGNATVSAELAAQ 275 >UniRef50_A1UPG4 Putative uncharacterized protein n=1 Tax=Mycobacterium sp. KMS RepID=A1UPG4_MYCSK Length = 344 Score = 69.4 bits (168), Expect = 1e-10, Method: Composition-based stats. Identities = 37/254 (14%), Positives = 76/254 (29%), Gaps = 35/254 (13%) Query: 42 ERYTYIPTI---NIINKLRDEGFQPFFACQSRVRDLGRREYSKHMLRLRREGHING---- 94 +Y + ++++ L E + GR + +RL +G Sbjct: 99 NKYEPMQNEASCDLLDALTGE--SGAVYETAGALRGGRETFVT--MRLPESMVFDGIDGT 154 Query: 95 --QEVPEIILLNSHDGSSSYQMIPGIFRFVCTN--GLVCGNNFGEIRVPHKGDIVGQVIE 150 + + LNSHDGSS ++ + R VC N + H G + E Sbjct: 155 KDRTDFYLAALNSHDGSSKFRFLVTPVRIVCANTQSAAIARAAASFGISHTGGAAVALQE 214 Query: 151 GA------YEVLGVFDKVTDNMEAMKEIHLNSDEQHLF---------GRAALMVRYEDEN 195 + + F+ A+ ++ D+ F + R + Sbjct: 215 ARRALKLSWRYVEAFE---QEAAALYAAPMDLDQMRRFAGELVDVDGAESKTTARNRRDT 271 Query: 196 KTPVTPEQIITPRRWEDKQNDLWTTWQRVQENMIKGGLSGRSASGKNTRT-RAITGIDGD 254 + + +P W + V E + + ++ R RA+TG Sbjct: 272 ANAIVKLWVSSPTVAPIAGT-RWAAYNAVTEYVDHYSKVRAAGDPQSVRALRAVTGGSTA 330 Query: 255 IRINKALWVIAEQF 268 + + + + Sbjct: 331 QTLKTNAFRMLQTL 344 >UniRef50_B8F9V3 Putative uncharacterized protein n=4 Tax=Deltaproteobacteria RepID=B8F9V3_DESAA Length = 311 Score = 66.7 bits (161), Expect = 9e-10, Method: Composition-based stats. Identities = 39/220 (17%), Positives = 70/220 (31%), Gaps = 17/220 (7%) Query: 41 SERYTYIPTINIINKLRDEGFQPFFACQSRVRDLG-RREYSKHMLRLRREGHINGQEVPE 99 + RY + I ++ +L GF Q + H + P Sbjct: 105 TPRYQPVDNIRVMERLEQMGFGHDMEIQLALDAEFFSLSIPDHEKTFAVGND--DKLTPG 162 Query: 100 IILLNSHDGSSSYQMIPGIFRFVCTNGLVCGNNFGEIRVPHKG-DIVGQVIEGAYEVLGV 158 I + NS G ++ + + R VCTNGL+ H ++ E +V G Sbjct: 163 ITVCNSEVGRAALSIAAFVLRLVCTNGLIAKTAVSA-SYRHISAKVMEVFPETLQQVAGE 221 Query: 159 FD--KVTDNMEAMKEIHLNSDEQHLFGRAALMVRYEDENKTPVTPEQIITPRRWEDKQND 216 D + + ++ S+ H F R ++ E + P+++ P Sbjct: 222 LDVQQTRFRLSMESQVENPSNTIHSFNRQFMLAEPEVQAVDWAYPQEMELPA-------- 273 Query: 217 LWTTWQRVQENMIKGGLSGRSASGKNTRTRAITGIDGDIR 256 T + V G A + R I G ++ Sbjct: 274 --TMFNVVNTYTKASQAPGLPAESCHRLGRVGGAILGMVK 311 >UniRef50_Q0RM54 Putative uncharacterized protein n=1 Tax=Frankia alni ACN14a RepID=Q0RM54_FRAAA Length = 360 Score = 64.4 bits (155), Expect = 4e-09, Method: Composition-based stats. Identities = 27/171 (15%), Positives = 55/171 (32%), Gaps = 14/171 (8%) Query: 39 SRSERYTYIPTINIINKLRDE-GFQPFFACQSRVRDLGRREYSKHMLRLRREGHINGQE- 96 + +T I + + G + V + GR ++ +++L + G Sbjct: 116 HPRDTWTLIDHAEMGEIVEAFLGMENVQYETGGVLEKGRAVWA--LIKLDEPIALPGDNS 173 Query: 97 --VPEIILLNSHDGSSSYQMIPGIFRFVCTNGLVCGNNFGE-----IRVPHKG---DIVG 146 +P +L N HDG+ S + R VC N E H D + Sbjct: 174 LTLPYFLLRNRHDGNGSCSVSHTPVRVVCANTWKVSEMTDEANGTVFSFRHNEKWRDRLE 233 Query: 147 QVIEGAYEVLGVFDKVTDNMEAMKEIHLNSDEQHLFGRAALMVRYEDENKT 197 + + V F + E + ++ + +Q +F + + Sbjct: 234 EAKQAIKGVRKQFTLYQEIAERLLDMTVTEKQQQMFVNDFIPTPTGATSDR 284 >UniRef50_B4CXI2 Putative uncharacterized protein n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4CXI2_9BACT Length = 320 Score = 64.0 bits (154), Expect = 6e-09, Method: Composition-based stats. Identities = 43/257 (16%), Positives = 77/257 (29%), Gaps = 38/257 (14%) Query: 41 SERYTYIPTINIINKLRDEGFQPF------FACQSRVRDLGRREYSKHMLRLRREGHIN- 93 S RY + F P + + G R + M R+ + Sbjct: 76 SRRYRPLQNSEAFKF-----FDPIVGDRKAYFETAGALGEGERIWV--MARMPEVMEVVR 128 Query: 94 GQEVP-EIILLNSHDGSSSYQMIPGIFRFVCTNGLVCGNNFGE--IRVPHKGDIVGQVIE 150 G + ++L N+H+G S + R VC N L+ G+ RV H + ++ E Sbjct: 129 GDDCFKYLLLSNTHNGEGSVIVKFTTVRVVCQNTLMLAMEDGQKAYRVRHSKQMQFKLDE 188 Query: 151 GAYEVL---GVFDKVTDNMEAMKEIHLNSDEQHLFGRAALMV------------RYEDEN 195 A + VF + + + + S+ + A R+ Sbjct: 189 LADFLAITQQVFQEAEQTFRRLAAVKMTSERLEQYFDAVFPRTDVQKKRHEKPPRWGFLQ 248 Query: 196 KTPVTPEQIITPRRWEDKQNDLWTTWQRVQENMIKGGLSGRSASGKNTRTRAITGIDGDI 255 + + + P Q LW + + + R G D Sbjct: 249 EMFDSQPDLQLP----GVQGTLWGAYNAIT-RFEDYKEPKQDELPDQRLERTWFGAGADN 303 Query: 256 RINKALWVIAEQFRKWK 272 ++ AL E +WK Sbjct: 304 KLT-ALVKADELAVRWK 319 >UniRef50_C6W397 Phage/plasmid-related protein TIGR03299 n=12 Tax=Bacteroidetes RepID=C6W397_DYAFD Length = 350 Score = 62.8 bits (151), Expect = 1e-08, Method: Composition-based stats. Identities = 27/156 (17%), Positives = 54/156 (34%), Gaps = 16/156 (10%) Query: 44 YTYIPTINII---NKLRDEGFQPFFACQSRVRDLGRREYSKHMLRLRREGHINGQEVPE- 99 Y + + + + G + G R + +L + ++ E Sbjct: 95 YQIVQNRDAFTFFDSIV--GNDGILYETAGALGKGERIFIT--AKLPGYIQVGSNDLIEK 150 Query: 100 -IILLNSHDGSSSYQMIPGIFRFVCTNGLVCG--NNFGEIRVPHKGDIVGQVIEGAYEVL 156 + L SHDGS S R VC N L N +++ H + V + + A++V+ Sbjct: 151 YLFLTTSHDGSGSITAAFTPVRIVCANTLNAAMKNITNVVKIRHTSNAVER-LRTAHKVM 209 Query: 157 GVFDK----VTDNMEAMKEIHLNSDEQHLFGRAALM 188 G+ +K V + + + + A+ Sbjct: 210 GIANKFSHEVEEIFNHWAKKPITDPQLKKLIEIAMA 245 >UniRef50_A8ZYJ5 Putative uncharacterized protein n=1 Tax=Desulfococcus oleovorans Hxd3 RepID=A8ZYJ5_DESOH Length = 308 Score = 62.4 bits (150), Expect = 2e-08, Method: Composition-based stats. Identities = 30/149 (20%), Positives = 50/149 (33%), Gaps = 8/149 (5%) Query: 41 SERYTYIPTINIINKLRDEGFQPFFACQSRVRDLGRREYSKHMLRLRREGHINGQEV-PE 99 + +YT + I+ +L G+ P Q + S + R+ ING P Sbjct: 105 TPKYTPVDNFEILERLDSLGYGPDTKVQCSLDAE---FLSLSIPDGRKAFDINGDRFKPG 161 Query: 100 IILLNSHDGSSSYQMIPGIFRFVCTNGLVCGNNFGEIRVPHKGD-IVGQVIEGAYEVLGV 158 I + NS G +S + + R VCTNGL+ H I+ + + V Sbjct: 162 ISISNSEVGLASLTISAFVLRLVCTNGLI-ARTGISASYRHVSTRILKEFPQTIETVSKE 220 Query: 159 --FDKVTDNMEAMKEIHLNSDEQHLFGRA 185 + + + F R Sbjct: 221 LGAQQRQFRISMEAPVDNPMQTMDSFNRQ 249 >UniRef50_B4WVT0 Putative uncharacterized protein n=2 Tax=Synechococcus sp. PCC 7335 RepID=B4WVT0_9SYNE Length = 352 Score = 58.2 bits (139), Expect = 3e-07, Method: Composition-based stats. Identities = 38/205 (18%), Positives = 73/205 (35%), Gaps = 11/205 (5%) Query: 34 GDKHESRSERYTYIPTINIINKLRDEGFQPFFACQSRVRDLGRREYSKHML-RLRREGHI 92 D S RY + ++ + + + A R Y K + R++ + + Sbjct: 104 TDARAFLSRRYRRLDNFDLADAVLPTLLEMQGARVVSCELTETRMYLKVVTDRIQADVKV 163 Query: 93 NGQEVPEIILLNSHDGSSSYQMIPGIFRFVCTNGLVCGNNFGEIRVPHKGDIVGQVIEGA 152 + + NS G S ++ P I+R VCTNG+V + R H G + Sbjct: 164 GDAVQAGVCISNSEIGMGSLRVEPLIYRLVCTNGMVSPDRSARNRFTHLGRAAADTPDAY 223 Query: 153 YEVLGVFDKVTDNMEAMKEIHLNSD--EQHLFGRAALMVRYEDENK---TPVTPEQIITP 207 + + +K L D ++ F +R E + PV +++T Sbjct: 224 ELFSDKTLEADNTAFFLKVQDLVRDAVDRTKFEHLVAQMRDTTERRIEGNPVKTVEVLTN 283 Query: 208 RRWEDKQNDLWTTWQRVQENMIKGG 232 + + V +++I+GG Sbjct: 284 KFKLQQNES-----SGVLQHLIRGG 303 >UniRef50_A1WP45 Putative uncharacterized protein n=2 Tax=Comamonadaceae RepID=A1WP45_VEREI Length = 312 Score = 57.8 bits (138), Expect = 3e-07, Method: Composition-based stats. Identities = 38/259 (14%), Positives = 77/259 (29%), Gaps = 34/259 (13%) Query: 36 KHESRSERYTYIPTINIINKLR----DEGFQPFFACQSRVRDLGRREYSKHMLRLRREGH 91 S RY + ++ R EGF GRR ++ + R E Sbjct: 64 PLSVVSPRYKIVQPKKMLEFYRSLVEREGFAI---ETIGSLKGGRRIWA--LARTHIEND 118 Query: 92 INGQEVP--EIILLNSHDGSSSYQMIPGIFRFVCTNG--LVCGNNFGEIRVPHKGDIVGQ 147 + G + ++L+ S DGS + R VC N + + +++V H Sbjct: 119 VLGSDRLKAYVLLITSCDGSLATTAKFTCVRVVCWNTQAIALNESGKQVKVRHNTAFNPD 178 Query: 148 VIEGAYEVLG--VFDKVTDNMEAMKEIHLNSDEQHLFGRAALMVRYED-------ENKTP 198 ++G ++G FD M ++ + L + L ++ + Sbjct: 179 AVKGEMGLMGAKAFDAFLGKMRSLTRVKLTEPDAQGIVACLLASPMDERKGVEHKGIEQT 238 Query: 199 VTPEQIITPRRWEDKQNDL-------WTTWQRVQENMIKGGLSGRSASGKNTRTRAITGI 251 ++I+ + L W V E R+ + Sbjct: 239 KGFQKIMALFNGAAQGAHLPGVQGTAWGLLNAVTEYA-DHHARARNPENRLYS----QWF 293 Query: 252 DGDIRINKALWVIAEQFRK 270 + + +A + + Sbjct: 294 GVNANLKEAAQSLLCEMAD 312 >UniRef50_UPI00016C3597 hypothetical protein GobsU_16407 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C3597 Length = 235 Score = 57.8 bits (138), Expect = 4e-07, Method: Composition-based stats. Identities = 33/216 (15%), Positives = 61/216 (28%), Gaps = 25/216 (11%) Query: 69 SRVRDLGRREYSKHMLRLRREGHINGQEVP-EIILLNSHDGSSSYQMIPGIFRFVCTNGL 127 + G+R + + ++G V +L N+HD S + + R VC N L Sbjct: 24 AGSLKEGKRIWVLARINGAEAEVVDGDPVRGYFLLSNAHDASQAVRAQFTSIRVVCANTL 83 Query: 128 VCGNNFGE------IRVPHKGDIVG---QVIEGAYEVLGVFDKVTDNMEAMKEIHLNSDE 178 + E +RV H + V F + + M L D Sbjct: 84 NAADRRAERGFEDCVRVRHTTGLETSLVLVQHTIDMAAKTFSASLADYQRMVSRRLPVDG 143 Query: 179 QHLFGRAALMVRYEDENKTPVTPEQIIT---------PRRWEDKQNDLWTTWQRVQENMI 229 + L V E + P+ T R W + + + + Sbjct: 144 FRKYVIDVLEVP-ESVQRMGKMPKAWDTLQWAYHAAPGARINGVFGTYWGAYNAITDWV- 201 Query: 230 KGGLSGRSASGKNTRTRAITGIDGDIRINKALWVIA 265 G + + R+ + + +A Sbjct: 202 -DHTRGVKDADSRLDS---AWFGSGARLKQRAFELA 233 >UniRef50_Q47CX4 Putative uncharacterized protein n=4 Tax=Betaproteobacteria RepID=Q47CX4_DECAR Length = 354 Score = 57.4 bits (137), Expect = 5e-07, Method: Composition-based stats. Identities = 30/148 (20%), Positives = 52/148 (35%), Gaps = 6/148 (4%) Query: 41 SERYTYIPTINIINKLRDEGFQPFFACQSRVRDLGRREYSKHML-RLRREGHINGQEVPE 99 S+RY + ++ + Q V + Y K + RL+ E Sbjct: 115 SDRYRRLDNFDLAESVLPILQQLPEVRFESVELTETKMYLKCITPRLKYEMAPGDVVQAG 174 Query: 100 IILLNSHDGSSSYQMIPGIFRFVCTNGLVCGNNFGEIRVPHKGDIVGQVIEGAYEVLGVF 159 +++ NS G + + P +FR VC+NGL+ + +R H G +G E Sbjct: 175 VVISNSEVGQGTLSVQPLLFRLVCSNGLIVPDR--SLRKMHVGRALGGEDERIQVYQDDT 232 Query: 160 DKVTDNMEAMKEIHLNSDEQHLFGRAAL 187 + D +K + Q A Sbjct: 233 LRADDKAFFLK---VRDVVQAAVSDATF 257 >UniRef50_C4ZMQ9 Phage/plasmid-related protein TIGR03299 n=1 Tax=Thauera sp. MZ1T RepID=C4ZMQ9_THASP Length = 334 Score = 57.4 bits (137), Expect = 5e-07, Method: Composition-based stats. Identities = 37/210 (17%), Positives = 71/210 (33%), Gaps = 30/210 (14%) Query: 50 INIINKLRDEGFQPFFACQSRVRDLGRREYSKHMLRLRREGHINGQEVPE--IILLNSHD 107 + + L +G +P G + + RL + + ++V E ++ NSHD Sbjct: 93 AEMFDALLGQG-RPI-YHTGGYLKNGEVVWL--LARLPGDIQVQEKDVIETYLLFSNSHD 148 Query: 108 GSSSYQMIPGIFRFVCTNGLVCGNNFGEIRVPHKGDIVGQVIEGAYEVLGV--------- 158 GSS+ + R VC N L + + G + + +G Y VL Sbjct: 149 GSSAIDIRLTTVRVVCQNTLSLALDNTSV-----GKVFRRAHDGRYRVLKEEARAFFEFS 203 Query: 159 ---FDKVTDNMEAMKEIHLNSDEQHLFGRAALMVRYEDENKTPVTP-EQIITPRRWEDKQ 214 ++ + + F L K PVT + + R WE + Sbjct: 204 VKRSEEAQALFGRLANAECDDRAFEDFLAQLLPDP-----KRPVTAGQNLQVQRAWETRL 258 Query: 215 NDLWTTWQRVQENMIKGGLSGRSASGKNTR 244 ++ T +V + + G+ + + Sbjct: 259 ANVRATRAQVM-GVRREGIPAQCVPPEGKT 287 >UniRef50_C6RKU8 Phage/plasmid-related protein n=12 Tax=Acinetobacter RepID=C6RKU8_ACIRA Length = 347 Score = 55.5 bits (132), Expect = 2e-06, Method: Composition-based stats. Identities = 32/179 (17%), Positives = 71/179 (39%), Gaps = 14/179 (7%) Query: 33 SGDKHESRSERYTYIPTINIINKLRDEGFQP-FFACQSRVRDLGRREYSKHMLRLRREGH 91 + S+RY + I+ RD Q F + V G++ ++ + R + Sbjct: 74 THAPLSVVSQRYQEVQPKQILEFYRDLTEQSGFELETAGVLKGGKKFWA--LARTGQSAA 131 Query: 92 INGQEV--PEIILLNSHDGSSSYQMIPGIFRFVCTNGLVCG----NNFGEIRVPHKGDI- 144 + G++V I+L + DG+ + R VC N L ++ G ++VPH Sbjct: 132 LKGKDVSNAYILLATACDGTLATTAQFTSIRVVCNNTLAIALKGQSSAGVVKVPHSTRFD 191 Query: 145 VGQVIEGAYEVLGVFDKVTDNMEAMKEIHLNSDEQHLFGRAALMVRYEDENKTPVTPEQ 203 G++ + + +D+ M+ + + + E + A + + +P+ ++ Sbjct: 192 AGKIKQQLGISVRQWDEHMYEMKQLSQRKVTQTEAAAYFDAV----FNNTGLSPIEQDE 246 >UniRef50_C5CKG6 Phage/plasmid-related protein TIGR03299 n=10 Tax=Proteobacteria RepID=C5CKG6_VARPS Length = 342 Score = 55.1 bits (131), Expect = 2e-06, Method: Composition-based stats. Identities = 39/244 (15%), Positives = 75/244 (30%), Gaps = 44/244 (18%) Query: 33 SGDKHESRSERYTYIPTINIINKLRD-EGFQPFFACQSRVRDLGRREYSKHMLRLRREGH 91 + S RY + ++ RD + + V GR+ ++ + R ++ Sbjct: 83 TRAPLSVVSSRYQVVQPREVLEFYRDLTEIGGYEMETAGVLKGGRKVWA--LARTGQQAV 140 Query: 92 INGQEVP--EIILLNSHDGSSSYQMIPGIFRFVCTNGLVCGNNFGE--IRVPHK------ 141 + G ++ ++L S DG+ + + P R VC+N L + IRVPH Sbjct: 141 LKGNDIVNGYLLLATSCDGTLATSVTPTTVRVVCSNTLAVALDATSNVIRVPHSTSFDPD 200 Query: 142 ------GDIVGQVIEGAYEVLGVFDKVTDNMEAMKEIH---------LNSDEQHLFGRAA 186 G +GQ E Y + + + EA++ I +D+ Sbjct: 201 AVKRQLGIAIGQWDEFMYRMKTLSQRKVKTKEALQYIERVLYGPSELNPADDVSTQAAQT 260 Query: 187 LMVRYEDENKTPVTPEQIITPRRWEDKQN---------DLWTTWQRVQENMIKGGLSGRS 237 P +E + W + E + R+ Sbjct: 261 EASP------APRGWAARKVLELYEGRGRGAELAAAKGTTWGLLSAMTE-FVDHERRARN 313 Query: 238 ASGK 241 + Sbjct: 314 REYR 317 >UniRef50_A8ZKZ6 Putative uncharacterized protein n=3 Tax=Cyanobacteria RepID=A8ZKZ6_ACAM1 Length = 351 Score = 54.4 bits (129), Expect = 5e-06, Method: Composition-based stats. Identities = 43/238 (18%), Positives = 80/238 (33%), Gaps = 40/238 (16%) Query: 41 SERYTYIPTINIINKLRDEGFQPFFACQS------RVRDLGRREYSKHMLRLRREGHING 94 S+RY + I + P A V R Y K + + G Sbjct: 111 SDRYRRVDNFEIAETVL-----PVLAEFGQGLKIMSVGLTDSRLYIKAVNERVQLDVRKG 165 Query: 95 QEV-PEIILLNSHDGSSSYQMIPGIFRFVCTNGLVCGNNFGEIRVPHKGDIVGQVIEGAY 153 V +++ NS G S ++ P ++R VC NGL+ ++ + H G VG+ Sbjct: 166 DAVQAGVVISNSEIGLGSIRIEPLVYRLVCLNGLISQDH--SFKKYHVGRQVGESDAAVE 223 Query: 154 EVLGVFDKVTDNMEAMKEIHL--NSDEQHLFGRAALMVRYEDENK---TPVTPEQIITPR 208 + D +K + + + F + +R E K PVT +++ Sbjct: 224 LFSDETREADDRALLLKVRDMVCGAADIAKFTQIVEQMRDATERKIEGNPVTAVEVL--- 280 Query: 209 RWEDKQNDLWTTWQRVQE-------NMIKGG--LSGRSASGKNTRTRAITGIDGDIRI 257 + L QE ++I+GG + + ++ + D + Sbjct: 281 -----GDKL----NVNQEERSGILTHLIQGGDLTAYGMMNAVTRTSQDVESYDRATEL 329 >UniRef50_Q19YQ9 Gp96 n=7 Tax=unclassified Siphoviridae RepID=Q19YQ9_9CAUD Length = 400 Score = 53.6 bits (127), Expect = 7e-06, Method: Composition-based stats. Identities = 38/211 (18%), Positives = 72/211 (34%), Gaps = 22/211 (10%) Query: 67 CQSRVRDLGRREYSK-----HMLRLRREGHINGQEVPEIILLNSHDGSSSYQMIPGIFRF 121 D GRR + HM + + + N HDG S R Sbjct: 175 ETGGSLDGGRRTFVTMKMPDHMELVSPITGKRDVTDLYLSIFNHHDGGGSLVANISPVRV 234 Query: 122 VCTNG--LVCGNNFGEIRVPHKGDIVGQVIEGAYEVLGVF----DKVTDNMEAMKEIHLN 175 VC N + + + H G+ + +E +LG+ D +E M +I ++ Sbjct: 235 VCANTQRMAERAAVSRVSIRHTGEAQVR-LEEVRRILGLTWKYQDTYVAEVEEMAKIEMS 293 Query: 176 SDEQHLFGRAAL-MVRYEDENKTPVTPEQIITPRRW--------EDKQNDLWTTWQRVQE 226 + E R+ + + + E+++ Q+ T +D + + + V E Sbjct: 294 NVETFAIMRSVFEVDKVDPESRSASQRTQMATEAFEIYRSSATVDDFRGVAFGGYNAVTE 353 Query: 227 NMIKG-GLSGRSASGKNTRTRAITGIDGDIR 256 + + G+ R I G G+I+ Sbjct: 354 WVDHYMPVRGKDNVDVKRALRTINGGGGEIK 384 >UniRef50_B7I5L8 Phage/plasmid-related protein n=5 Tax=Moraxellaceae RepID=B7I5L8_ACIB5 Length = 342 Score = 53.2 bits (126), Expect = 9e-06, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 63/164 (38%), Gaps = 11/164 (6%) Query: 33 SGDKHESRSERYTYIPTINIINKLRDEGFQP-FFACQSRVRDLGRREYSKHMLRLRREGH 91 + S+RY + I+ RD Q F + V GR+ ++ + R + Sbjct: 67 THAPLSVVSQRYQEVQPKEILEFYRDLTEQSGFELETAGVLKGGRKFWA--LARTGQSAA 124 Query: 92 INGQEVP--EIILLNSHDGSSSYQMIPGIFRFVCTNGLVCG-----NNFGEIRVPH-KGD 143 + ++V I+L + DG+ + R VC+N L ++ G ++VPH Sbjct: 125 LKSKDVSNGYILLATACDGTLATTAQFTSIRVVCSNTLAIALRGQNSSVGVVKVPHSTKF 184 Query: 144 IVGQVIEGAYEVLGVFDKVTDNMEAMKEIHLNSDEQHLFGRAAL 187 ++ + + +D+ M+ + + + E + A Sbjct: 185 DAEKIKQQLGISVRAWDEHMYEMKQLSQRKVTQQEAAAYFDAVF 228 >UniRef50_C0VFU1 Putative uncharacterized protein n=4 Tax=Acinetobacter RepID=C0VFU1_9GAMM Length = 357 Score = 53.2 bits (126), Expect = 1e-05, Method: Composition-based stats. Identities = 33/183 (18%), Positives = 68/183 (37%), Gaps = 18/183 (9%) Query: 33 SGDKHESRSERYTYIPTINIINKLRDEGFQP-FFACQSRVRDLGRREYSKHMLRLRREGH 91 + + S+R+ + I+ RD Q F + V G++ ++ + + + Sbjct: 74 THEPLSVVSQRFQEVQPKEILEFYRDLTEQSGFELETAGVLKGGKKFWA--LAKTGQTSA 131 Query: 92 INGQEVP--EIILLNSHDGSSSYQMIPGIFRFVCTNGLVCG--------NNFGEIRVPHK 141 + G++V I+L + DG+ + R VC N L NN G ++VPH Sbjct: 132 LKGKDVSNGYILLATACDGTLATTAQFTSIRVVCNNTLAIALKAQNAGSNNTGVVKVPHS 191 Query: 142 GDI-VGQVIEGAYEVLGVFDKVTDNMEAMKEIHLNSDEQHLFGRAALMVRYEDENKTPVT 200 +V + +D+ M+ + + + E + A + + N + Sbjct: 192 TRFDAEKVKHQLGISVRAWDEHMYEMKQLSQRKVTQQEAAAYFDAV----FNNSNLSVAD 247 Query: 201 PEQ 203 E Sbjct: 248 QED 250 >UniRef50_C8X3A3 Putative uncharacterized protein n=1 Tax=Desulfohalobium retbaense DSM 5692 RepID=C8X3A3_DESRD Length = 243 Score = 51.7 bits (122), Expect = 3e-05, Method: Composition-based stats. Identities = 34/253 (13%), Positives = 72/253 (28%), Gaps = 34/253 (13%) Query: 15 RERPLTDDELMQFVPSVFSGDKHESRSERYTYIPTINIIN----KLRDEGFQPFFACQSR 70 R +T+ E+ + + +P +I+ + +G Sbjct: 2 RANQVTEAEVRAV--------PVVPGTATWNPVPHNQVIDTVETAISRQGLGIVRKRFEL 53 Query: 71 VRDLGRREYSKHMLRLRREGHINGQEVPEIILLNSHDGSSSYQMIPGIFRFVCTNGLVCG 130 +D G ++ + R EI NS + + G F VC+N + G Sbjct: 54 TQD-GANVFASY-----RLDQSRNGSSWEIGFRNSVAKKFAVGITAGTFTIVCSNLVFTG 107 Query: 131 NNFGEIRVPHKGDIVGQVIEGAYE-----VLGVFDKVTDNMEAMKEIHLNSDEQHLFG-R 184 + F E R H + + + + E +K L + Sbjct: 108 D-FLEFR-RHTKGLDLDELRAIANRALLGTISRLQSLEQWQEGLKAKPLPRRDMQCLTYE 165 Query: 185 AALMVRYEDENKTPVTPEQIITPRRWEDKQNDLWTTWQRVQENMIKGGLSGRSASGKNTR 244 A + + R L++ + + + +S + + R Sbjct: 166 ALQRGAFPGGRFSRFVEAYEDEASRH---GQSLYSFHGALTQT-----IRDQSLNQISHR 217 Query: 245 TRAITGIDGDIRI 257 +R I + R+ Sbjct: 218 SRIINDLVETYRL 230 >UniRef50_Q5Y1B4 Putative uncharacterized protein n=1 Tax=uncultured organism BAC21E04 RepID=Q5Y1B4_9ZZZZ Length = 315 Score = 51.3 bits (121), Expect = 3e-05, Method: Composition-based stats. Identities = 32/240 (13%), Positives = 66/240 (27%), Gaps = 20/240 (8%) Query: 41 SERYTYIPTINIINKLRDEGFQPFFACQSRVRDLGRREYSKHMLRLRREGHINGQEV-PE 99 S++Y + +++ + C + D + ++ LR + G V Sbjct: 81 SKQYEIVQNDSLLRMAEFIREEVDMDCVIVLSDGAKVCFTA-TLRGAETDIVPGDTVKRR 139 Query: 100 IILLNSHDGSSSYQMIPGIFRFVCTNGLVCGNNF---GEIRVPHKGDIVGQV---IEGAY 153 I+ HDG + R VC N L + HK I Sbjct: 140 IVGYLGHDGKTGCGAKFTNIRVVCQNTLTAALGEAGGAHSSITHKNGANNNFDTLINSID 199 Query: 154 EVLGVFDKVTDNMEAMKEIHLNSDEQHLFGRAALMVRYEDENKTPVTPEQIITPRR---- 209 F + M + + + F Y + + + Sbjct: 200 VARQDFVTECELMREFSRASMGVSQFNEFVDEV----YNIDEGQVFRKREKLERAFTRGF 255 Query: 210 -WEDKQNDLWTTWQRVQENMIKGGLSGRSASGKNTRTRAITGIDGDIRINKALWVIAEQF 268 + +W+ + E + + +A G+ R +I+K + +A Sbjct: 256 GFRFAPASVWSAVNAITE-VETSTRNTTAAKGRAQFAR--GTFGVGAQISKRAFALARDL 312 >UniRef50_Q2IFF9 Putative uncharacterized protein n=3 Tax=Anaeromyxobacter RepID=Q2IFF9_ANADE Length = 325 Score = 48.2 bits (113), Expect = 3e-04, Method: Composition-based stats. Identities = 31/226 (13%), Positives = 65/226 (28%), Gaps = 19/226 (8%) Query: 41 SERYTYIPTINIINKLRDEGFQPFFACQSRVRDLGRREYSKHML-RLRREGHINGQEVP- 98 S+ Y + + L E A + LG +L + + G P Sbjct: 74 SKSYEVVQFSEVARTLV-EAAGDVKAVFTTAGTLGPVGIKGWLLGEIPNPIKVKGDPSPI 132 Query: 99 --EIILLNSHDGSSSYQMIPGIFRFVCTNGLVCG---NNFGEIRVPHKGDIVGQVIEGAY 153 ++ HDG ++ + R VC N L R+ H + ++ E Sbjct: 133 RKYVLGTTGHDGVTAVVLKNVATRVVCANTLGVALGERGGATWRIQHTANAKMRLDEAGK 192 Query: 154 E---VLGVFDKVTDNMEAMKEIHLNSDEQHLFGRAALMVRYEDENKTPVTPEQIITPRR- 209 ++ ++++ + + + + + V +D + T E+ R Sbjct: 193 AFRQLVESYERLGELANVLAVTPFTTRQMKATIDRLMPVPKDDRDHTKPEAERGKVIRLF 252 Query: 210 -----WEDKQNDLWTTWQRVQENMIKGG--LSGRSASGKNTRTRAI 248 E + W Q E + R ++ Sbjct: 253 DTAAAIERVRGTAWAALQGWTEYADHHRQVRDTGREDPRRARLASV 298 >UniRef50_A6SWN5 Uncharacterized conserved protein n=39 Tax=Proteobacteria RepID=A6SWN5_JANMA Length = 318 Score = 47.8 bits (112), Expect = 4e-04, Method: Composition-based stats. Identities = 33/195 (16%), Positives = 64/195 (32%), Gaps = 20/195 (10%) Query: 33 SGDKHESRSERYTYIPTINIINKLR----DEGFQPFFACQSRVRDLGRREYSKHMLRLRR 88 + S RY + I+ R GF+ + V GR+ ++ + + + Sbjct: 74 TKAALSVVSNRYQVVQPDEILEFYRDLTTRSGFE---LETAGVMKGGRKLWA--LAKTGQ 128 Query: 89 EGHINGQEVP--EIILLNSHDGSSSYQMIPGIFRFVCTNGLVCGNN--FGEIRVPHKG-- 142 I ++ ++L + DGS + R VC N L + ++VPH Sbjct: 129 SFSIKDKDRINGYLLLATACDGSLATTAQFTSVRVVCNNTLAIALSGGKDVVKVPHSTTF 188 Query: 143 --DIVGQVIEGAYEVLGVFDKVTDNMEAMKEIHLNSDEQHLFGRAALMVRYEDENKTPVT 200 D+V + + ++ F + K L E F R + + + + Sbjct: 189 EPDLVKKELGISFSAWDNFRYRMTKLAERK---LKDQEADAFLRTLFSIPTDHNGRQYMD 245 Query: 201 PEQIITPRRWEDKQN 215 +E K Sbjct: 246 RTMRQVKNIYEGKGR 260 >UniRef50_A8ZPY1 Putative uncharacterized protein n=5 Tax=Bacteria RepID=A8ZPY1_ACAM1 Length = 209 Score = 46.3 bits (108), Expect = 0.001, Method: Composition-based stats. Identities = 22/98 (22%), Positives = 37/98 (37%), Gaps = 12/98 (12%) Query: 41 SERYTYIPTINIINKLRDEGFQPFFACQS------RVRDLGRREYSKHMLRLRREGHING 94 S+RY + I + P A V R Y K + + G Sbjct: 111 SDRYRRVDNFEIAETVL-----PVLAEFGPGLKIMSVGLTDSRLYIKAVNERVQLDVRKG 165 Query: 95 QEV-PEIILLNSHDGSSSYQMIPGIFRFVCTNGLVCGN 131 V +++ NS G S ++ P ++R VC NG++ + Sbjct: 166 DAVQAGVVISNSEIGLGSIRIEPLVYRLVCLNGMISQD 203 >UniRef50_A6WZ56 Putative uncharacterized protein n=1 Tax=Ochrobactrum anthropi ATCC 49188 RepID=A6WZ56_OCHA4 Length = 402 Score = 45.9 bits (107), Expect = 0.001, Method: Composition-based stats. Identities = 21/76 (27%), Positives = 29/76 (38%), Gaps = 3/76 (3%) Query: 85 RLRREGHINGQEVPEIILLNSHDGSSSYQMIPGIFRFVCTNGLVCG-NNFGEIRVPHKGD 143 +L + NS GSS+ ++ R VC N L+ G F EI + H Sbjct: 235 KLPSGD--PDLVFRGFYITNSEVGSSALKVAAFYLRAVCCNRLMWGVEGFQEISMRHSKY 292 Query: 144 IVGQVIEGAYEVLGVF 159 + IE A L F Sbjct: 293 APSRFIEEARPALEGF 308 >UniRef50_C1D7A8 Putative uncharacterized protein n=1 Tax=Laribacter hongkongensis HLHK9 RepID=C1D7A8_LARHH Length = 192 Score = 45.9 bits (107), Expect = 0.002, Method: Composition-based stats. Identities = 29/150 (19%), Positives = 47/150 (31%), Gaps = 15/150 (10%) Query: 105 SHDGSSSYQMIPGIFRFVCTNG--LVCGNNFGEIRVPHKGDIVGQVIEGAYEVLGVF--D 160 S DGS R VC N + G +RVPH + V LG+ D Sbjct: 20 SCDGSLCTTAQFTSVRVVCNNTLQMAVAGRSGAVRVPH-STVFDPVAVKTELGLGLSGWD 78 Query: 161 KVTDNMEAMKEIHLNSDEQHLFGRAALMVRYEDENKTPVTP--EQIITPRRWEDKQN--- 215 +++A+ + ++ +E F L D+ PV+ +Q+ Sbjct: 79 AFIGHIKALSQRPVSPEEARQFFAGVLDEPVADDPDAPVSKALQQLSALYGGLGMGALLG 138 Query: 216 ----DLWTTWQRVQENMIKGGLSGRSASGK 241 W E + RS + Sbjct: 139 SSRGTAWGLVNAATE-FVDHHRRARSQDYR 167 >UniRef50_B3VM79 Gp52 n=2 Tax=unclassified Siphoviridae RepID=B3VM79_9CAUD Length = 403 Score = 45.5 bits (106), Expect = 0.002, Method: Composition-based stats. Identities = 28/196 (14%), Positives = 58/196 (29%), Gaps = 14/196 (7%) Query: 88 REGHINGQEVPEIILLNSHDGSSSYQMIPGIFRFVCTNGLVCGNNFG-----EIRVPHKG 142 P +++ S DGS + I VC N L + + + H Sbjct: 184 HNDRAGFDYRPNLLIYTSFDGSLKTTLARTITATVCDNTLQIAASEAKRAGTALTIGHTR 243 Query: 143 DIVGQVIEGAYE---VLGVFDKVTDNMEAMKEIHLNSDEQHLFGRAALMVR-YEDENKTP 198 ++ E + D ++ +++ + + L V + + Sbjct: 244 LSSDRMPEARQVLGIIEQESDDFNTLLDEWAATPVSTKQFEAWLDEVLPVPEVKVIDGKA 303 Query: 199 VTPEQIITPRRWEDKQNDLWTTWQRVQENMIKGGLSGR--SASGKNTRTRAITGIDGDIR 256 T Q I + E + +T + K G+ +A R+ DG+ Sbjct: 304 KTNSQTIVLNKREAIGDLYYTDERAATWVGTKLGVRQAWNTAHHHKFRSGNAKQFDGNKT 363 Query: 257 INKALWVIAEQFRKWK 272 + + + R K Sbjct: 364 LARVESNM---MRSLK 376 >UniRef50_Q2JC95 Putative uncharacterized protein n=2 Tax=Frankia RepID=Q2JC95_FRASC Length = 335 Score = 44.7 bits (104), Expect = 0.003, Method: Composition-based stats. Identities = 28/177 (15%), Positives = 50/177 (28%), Gaps = 18/177 (10%) Query: 96 EVPEIILLNSHDGSSSYQMIPGIFRFVCTNGLVCG--NNFGEIRVPHKGDIVGQVIEGAY 153 P ++ S DGS S + VC N + G + +V H ++ E Sbjct: 150 FRPNLLATTSFDGSLSTTYKRIVTNVVCDNTMAAGLREQGQQTKVKHSAKSHLRLGEARQ 209 Query: 154 EVL---GVFDKVTDNMEAMKEIHLNSDEQHLFGRAALMV-------RYEDENKTPVTPEQ 203 + + D + + + ++ + F A + R E Sbjct: 210 ALAIVHTIADDFAAEVAELCALEVSDRQWAAFLDAHAPMPDEKGRARTSAEKHRDTLTRL 269 Query: 204 IITPRRWEDKQNDLWTTWQRVQ-----ENMIKGGLSGRSASGKNTRTRAITGIDGDI 255 R +N W Q V E ++G + T +DG Sbjct: 270 WNHDERVSPWRNTGWGVIQAVNTYTHHEQTVRGASR-AERNMLRAVTGGADSLDGST 325 >UniRef50_C4V5A4 Putative uncharacterized protein n=1 Tax=Selenomonas flueggei ATCC 43531 RepID=C4V5A4_9FIRM Length = 365 Score = 44.0 bits (102), Expect = 0.006, Method: Composition-based stats. Identities = 27/135 (20%), Positives = 51/135 (37%), Gaps = 14/135 (10%) Query: 41 SERYTYIPTINIINKLRDEGFQPFFACQSRVRDLGRREYSKHML------RLRREGHING 94 S+RY + + + + P + H+ +L+ E + Sbjct: 114 SDRYRRLDNLELCTAVL-----PVIQEMKDAAIMSCEVTESHLYLKVVNKKLKAEVGVGD 168 Query: 95 QEVPEIILLNSHDGSSSYQMIPGIFRFVCTNGLVCGNNFGEIRVPHKGDIVGQVIEGAYE 154 ++ NS G S ++ P I+R VC NGL+ + F + + H G V + AYE Sbjct: 169 VVQAGFVVSNSEVGLGSLKVEPLIYRLVCKNGLIVKD-FAQ-KKYHVGRQVAAEDDTAYE 226 Query: 155 VLGVFDKVTDNMEAM 169 L + + + + Sbjct: 227 -LYSDETLAQDDKTF 240 >UniRef50_C0GUY0 Putative uncharacterized protein n=2 Tax=Desulfonatronospira thiodismutans ASO3-1 RepID=C0GUY0_9DELT Length = 236 Score = 43.2 bits (100), Expect = 0.010, Method: Composition-based stats. Identities = 26/217 (11%), Positives = 61/217 (28%), Gaps = 23/217 (10%) Query: 36 KHESRSERYTYIPTINIINKL----RDEGFQPFFACQSRVRDLGRREYSKHMLRLRREGH 91 +E + + +I+ + RD+G G ++ + R Sbjct: 15 PEVQGTETWNPVHHSLVIDAVENAVRDKGLGIQDKRFELTTGGGN-LFASY-----RLDQ 68 Query: 92 INGQEVPEIILLNSHDGSSSYQMIPGIFRFVCTNGLVCGNNFGEIRVPHKGDIVGQVI-- 149 +I NS + + G + VC+N + G+ H + + Sbjct: 69 GRDGVNWQIGFRNSIAKRFAVGITAGTYTMVCSNLVFAGDFVEF--RKHTKGLDTDELFS 126 Query: 150 ---EGAYEVLGVFDKVTDNMEAMKEIHLNSDEQHLFGRAALMVRYEDENKTPVTPEQIIT 206 + + + +K I L+ + A+ + + + Sbjct: 127 MSGRAIETTVNRLESLEAWQLDLKNIPLSQRHMRILSFEAM----RKQAFPASRFHRFLE 182 Query: 207 PRRWEDK--QNDLWTTWQRVQENMIKGGLSGRSASGK 241 R E L++ + + + LS S + Sbjct: 183 AYREEIALNGLTLYSFYHSITRTIRDQSLSRISTRSE 219 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.307 0.129 0.332 Lambda K H 0.267 0.0395 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,499,979,901 Number of Sequences: 3077464 Number of extensions: 57129960 Number of successful extensions: 150538 Number of sequences better than 1.0e-01: 59 Number of HSP's better than 0.1 without gapping: 67 Number of HSP's successfully gapped in prelim test: 42 Number of HSP's that attempted gapping in prelim test: 150340 Number of HSP's gapped (non-prelim): 110 length of query: 273 length of database: 1,040,396,356 effective HSP length: 127 effective length of query: 146 effective length of database: 649,558,428 effective search space: 94835530488 effective search space used: 94835530488 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.1 bits) S2: 92 (40.1 bits)