BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (207 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_C8UDI8 Conserved predicted protein n=3 Tax=Enterobacter... 381 e-105 UniRef50_Q9HVT1 Uncharacterized protein PA4490 n=70 Tax=Proteoba... 315 5e-85 UniRef50_A1K1H3 Conserved hypothetical secreted protein n=3 Tax=... 207 2e-52 UniRef50_C7RN59 Putative uncharacterized protein n=1 Tax=Candida... 206 5e-52 UniRef50_C5CEH1 Putative uncharacterized protein n=1 Tax=Kosmoto... 114 2e-24 UniRef50_Q1IT70 Putative uncharacterized protein n=1 Tax=Candida... 107 2e-22 UniRef50_A6LJB4 Putative uncharacterized protein n=2 Tax=Thermos... 100 3e-20 UniRef50_Q9X081 Uncharacterized protein TM_0986 n=5 Tax=Thermoto... 95 2e-18 UniRef50_Q025M9 Putative uncharacterized protein n=1 Tax=Candida... 95 2e-18 UniRef50_A0LH84 Putative uncharacterized protein n=1 Tax=Syntrop... 77 2e-13 UniRef50_B9K9Y4 Uncharacterized protein n=2 Tax=Thermotoga RepID... 63 7e-09 UniRef50_Q1DES2 Putative uncharacterized protein n=1 Tax=Myxococ... 54 3e-06 >UniRef50_C8UDI8 Conserved predicted protein n=3 Tax=Enterobacteriaceae RepID=C8UDI8_ECO1A Length = 208 Score = 381 bits (978), Expect = e-105, Method: Compositional matrix adjust. Identities = 184/199 (92%), Positives = 188/199 (94%), Gaps = 2/199 (1%) Query: 1 MRHGLLALICWLCCVVAHSEMLNVEQSGLFRAWFVRIAQEQLRQGPSPRWYQQDCAGLVR 60 MRHGLLALICWLCCVVAHSEMLNVEQSGLFRAWFVRIAQEQLRQGPSPRWYQQDCAGLVR Sbjct: 1 MRHGLLALICWLCCVVAHSEMLNVEQSGLFRAWFVRIAQEQLRQGPSPRWYQQDCAGLVR 60 Query: 61 FAANETLKVHDSKWLKSNGLSSQYLPPEMTLTPEQRQLAQNWNQGNGKTGPYVTAINLIQ 120 FAANETLKVHDSKWLKSNG SSQYLPPEMTLTP QRQLAQNWNQGNGKTGPYVTAINLIQ Sbjct: 61 FAANETLKVHDSKWLKSNGFSSQYLPPEMTLTPGQRQLAQNWNQGNGKTGPYVTAINLIQ 120 Query: 121 YNSQFIGQDINQALPGDMIFFDQGDAQHLMVWMGRYVIYHTGSATKTDNGMRAVSLQQLM 180 YNSQFIGQDINQALPGDMIFFDQGDAQHLMVWMGRYVIYHTGSATKTDNGMRAVSLQQLM Sbjct: 121 YNSQFIGQDINQALPGDMIFFDQGDAQHLMVWMGRYVIYHTGSATKTDNGMRAVSLQQLM 180 Query: 181 TWKDTRWIPNDSNPNFIGI 199 TWK +I N N ++ + Sbjct: 181 TWK--LYIKNFINYQYVNV 197 >UniRef50_Q9HVT1 Uncharacterized protein PA4490 n=70 Tax=Proteobacteria RepID=Y4490_PSEAE Length = 212 Score = 315 bits (808), Expect = 5e-85, Method: Compositional matrix adjust. Identities = 140/186 (75%), Positives = 169/186 (90%) Query: 22 LNVEQSGLFRAWFVRIAQEQLRQGPSPRWYQQDCAGLVRFAANETLKVHDSKWLKSNGLS 81 L+ +QS +FRAWFVRIAQEQLRQGPSPRW+QQDCAGLVRFAANE LKVHD KWL++NGLS Sbjct: 27 LDPQQSQVFRAWFVRIAQEQLRQGPSPRWHQQDCAGLVRFAANEALKVHDGKWLRANGLS 86 Query: 82 SQYLPPEMTLTPEQRQLAQNWNQGNGKTGPYVTAINLIQYNSQFIGQDINQALPGDMIFF 141 ++YLPPE+ L+PEQR+LAQNW QG G+ GPYV AI L+Q+NS+ +G+D+NQA PGD++F+ Sbjct: 87 NRYLPPELALSPEQRRLAQNWQQGGGQVGPYVNAIKLVQFNSRLVGRDLNQARPGDLMFY 146 Query: 142 DQGDAQHLMVWMGRYVIYHTGSATKTDNGMRAVSLQQLMTWKDTRWIPNDSNPNFIGIYR 201 DQGD QHLM+WMGR + YHTGS+T TDNGMR+VSLQQLMTWKDTRWIP++SNPNFIGIYR Sbjct: 147 DQGDDQHLMIWMGRSIAYHTGSSTPTDNGMRSVSLQQLMTWKDTRWIPDESNPNFIGIYR 206 Query: 202 LNFLAR 207 L FL++ Sbjct: 207 LAFLSQ 212 >UniRef50_A1K1H3 Conserved hypothetical secreted protein n=3 Tax=Proteobacteria RepID=A1K1H3_AZOSB Length = 227 Score = 207 bits (527), Expect = 2e-52, Method: Compositional matrix adjust. Identities = 92/186 (49%), Positives = 131/186 (70%), Gaps = 2/186 (1%) Query: 22 LNVEQSGLFRAWFVRIAQEQLRQGPSPRWYQQDCAGLVRFAANETLKVHDSKWLKSNGLS 81 L+ EQS RAW VRIA+ Q+R+GP+PRW +DCAGLVRFA E L HD++W ++ G+S Sbjct: 43 LDAEQSRALRAWVVRIAEAQIRRGPTPRWTHRDCAGLVRFAVAEALAEHDARWRQAMGIS 102 Query: 82 SQYLPPEMTLTPEQRQLAQNWNQGNGKTGPYVTAINLIQYNSQFIGQDINQALPGDMIFF 141 + LPP+ + +L W + +G TG YV AI L+Q NS+ +G+D+ A P D++FF Sbjct: 103 RR-LPPDTVPAALRAELRHRWKRPDGSTGAYVNAIGLVQENSRPVGRDLRLARPADLLFF 161 Query: 142 DQGDAQHLMVWMGRYVIYHTGSATKT-DNGMRAVSLQQLMTWKDTRWIPNDSNPNFIGIY 200 DQGDAQHLM+W G ++YH G+ + D+G+RAV+ ++L+ W DTRW P+ +NPNF G++ Sbjct: 162 DQGDAQHLMLWTGYRIVYHNGAEPRPGDDGLRAVTPRELLQWTDTRWRPDATNPNFAGVH 221 Query: 201 RLNFLA 206 L FLA Sbjct: 222 TLAFLA 227 >UniRef50_C7RN59 Putative uncharacterized protein n=1 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RN59_9PROT Length = 238 Score = 206 bits (523), Expect = 5e-52, Method: Compositional matrix adjust. Identities = 96/202 (47%), Positives = 126/202 (62%), Gaps = 17/202 (8%) Query: 22 LNVEQSGLFRAWFVRIAQEQLRQGPSPRWYQQDCAGLVRFAANETLKVHDSKWLKSNGLS 81 L+ QS FRAW I +Q+R GPSPRW+ +DCAGL RFA E L+ HD++W ++NG + Sbjct: 38 LDGTQSDRFRAWMALIVADQIRHGPSPRWFHRDCAGLARFAVGEALRPHDTRWRRANGFT 97 Query: 82 SQYLPPEMTLTPEQRQLAQNWNQGNGKTGPYVTAINLIQYNSQFIGQDINQALPGDMIFF 141 + LPPE+ L PEQ L + W G +VTA+ L+Q NS +G++ + PGD++FF Sbjct: 98 GR-LPPEVDLRPEQASLLKGWTTLAGDQQAFVTALALVQKNSHRLGRESAMSKPGDLLFF 156 Query: 142 DQGDAQHLMVWMGRYVIYHTGSATKT----------------DNGMRAVSLQQLMTWKDT 185 D GD QHLM+WMG ++ YHTG DNG+RAV+ QL+ WKDT Sbjct: 157 DHGDDQHLMIWMGSWIAYHTGQPPAAAEQGAGRRRPSPEMGEDNGLRAVTPAQLLQWKDT 216 Query: 186 RWIPNDSNPNFIGIYRLNFLAR 207 RW P D NPNF G YRL FL+R Sbjct: 217 RWRPRDDNPNFAGFYRLAFLSR 238 >UniRef50_C5CEH1 Putative uncharacterized protein n=1 Tax=Kosmotoga olearia TBF 19.5.1 RepID=C5CEH1_KOSOT Length = 321 Score = 114 bits (285), Expect = 2e-24, Method: Compositional matrix adjust. Identities = 72/209 (34%), Positives = 106/209 (50%), Gaps = 27/209 (12%) Query: 21 MLNVEQSGLFRAWFVRIAQEQ-LRQGPSPRWYQQDCAGLVRFAANETLKVHDSKWLKSNG 79 +L E S + R+WFV IA Q L+ P+ R Q+DCAGLVRFA E LK H+ KW + G Sbjct: 114 VLKGEDSYILRSWFVNIALCQSLQLSPNWRENQRDCAGLVRFAMKEALKHHNKKWFQETG 173 Query: 80 LSSQYLPPEMTLTPEQRQL-------AQNWNQ-----------GNGKTGPYVTAINLIQY 121 + PE+ L L A N+ + G + + TA +++Y Sbjct: 174 ID-----PEIWLEKTGVDLRSIPDVSAYNYPEAPVGGTNIFVVGKDRLSSFATAYAILKY 228 Query: 122 NSQFIGQDINQALPGDMIFFDQGDAQ--HLMVWMGRYVIYHTGSATKTDNG-MRAVSLQQ 178 N F+G D++ ALPGD++FF + H M++ G ++YHTG ++ D G ++ L Sbjct: 229 NVNFVGFDLSAALPGDLLFFHNPNPVTFHTMIYTGEGLVYHTGPVSEEDPGKIKLWRLDD 288 Query: 179 LMTWKDTRWIPNDSNPNFIGIYRLNFLAR 207 M +W+P N F+GIYRL L Sbjct: 289 YMRIMPIQWLPIRENTCFLGIYRLKILGE 317 >UniRef50_Q1IT70 Putative uncharacterized protein n=1 Tax=Candidatus Koribacter versatilis Ellin345 RepID=Q1IT70_ACIBL Length = 283 Score = 107 bits (268), Expect = 2e-22, Method: Compositional matrix adjust. Identities = 67/198 (33%), Positives = 98/198 (49%), Gaps = 23/198 (11%) Query: 30 FRAWFVRIAQEQLRQGPSPRWYQQDCAGLVRFAANETLKVHDSKWLKSNGLS-------- 81 FR WF IA+ Q P DCA L+R++ E L+ HD+ W K +G Sbjct: 83 FRRWFTAIAEHQAEVSKLPAEIN-DCAALLRYSYREALRSHDAAWSKDSGFDQHPAVADI 141 Query: 82 SQYLPPEMTLTPEQRQLAQN----WNQGNGKTGPYVTAINLIQYNSQFIGQDINQALPGD 137 +Y P L P+ + +Q + +G + A LI N+ F+ +D++ A PGD Sbjct: 142 DKYRYPHTPLGPQLFRTSQGRFAAADLQDGTFAEFADAKTLIVNNAHFLSRDVHLAQPGD 201 Query: 138 MIF---FDQGDAQHLMVWMGR-------YVIYHTGSATKTDNGMRAVSLQQLMTWKDTRW 187 +IF F+Q H M+++GR +V+YHTG K +R V+L L+ D RW Sbjct: 202 LIFYRQFEQHSPFHSMIFLGRSGGDAGEWVVYHTGPDGKWPGEIRRVTLVSLLNHPDARW 261 Query: 188 IPNDSNPNFIGIYRLNFL 205 P SN NF+G+YR N L Sbjct: 262 RPVPSNRNFLGVYRWNIL 279 >UniRef50_A6LJB4 Putative uncharacterized protein n=2 Tax=Thermosipho RepID=A6LJB4_THEM4 Length = 207 Score = 100 bits (249), Expect = 3e-20, Method: Compositional matrix adjust. Identities = 62/185 (33%), Positives = 85/185 (45%), Gaps = 10/185 (5%) Query: 22 LNVEQSGLFRAWFVRIAQEQLRQGPSPRWYQQDCAGLVRFAANETLKVHDSKWLKSNGLS 81 L+VE S FR WFV IA + + P + +DC+G V + E LK HD W + G Sbjct: 24 LDVEDSMNFRLWFVAIALDNV-DNEHPTFKTKDCSGFVFYCIKEALKKHDKDWFEKTGYK 82 Query: 82 SQYLPPEMTLTPEQRQLAQNWNQGNGKTGPYVTAINLIQYNSQFIGQDINQALPGDMIFF 141 L N K YV A NL+ YN+ FI + A PGD+IF+ Sbjct: 83 GPVFEDVKKYNYPNTPLGVNIFFNRKKYVSYVDAYNLLHYNTYFISYEKRYAKPGDLIFY 142 Query: 142 ----DQGDAQHLMVWMGRYVIYHTGSATKTDNGMRAVSLQQLMTWKDTRWIPNDSNPNFI 197 D H+M++ G +YHTG + +R VS + +M D W P + NP F+ Sbjct: 143 FHPEDPEFPYHVMIYTGNGFVYHTGPEGE----VRYVSYENMML-GDMAWKPIELNPAFL 197 Query: 198 GIYRL 202 G Y L Sbjct: 198 GYYGL 202 >UniRef50_Q9X081 Uncharacterized protein TM_0986 n=5 Tax=Thermotogaceae RepID=Y986_THEMA Length = 318 Score = 94.7 bits (234), Expect = 2e-18, Method: Compositional matrix adjust. Identities = 61/206 (29%), Positives = 102/206 (49%), Gaps = 24/206 (11%) Query: 22 LNVEQSGLFRAWFVRIAQEQLRQGPS--PRWYQQDCAGLVRFAANETLKVHDSKWLKSNG 79 L+ E S FR WFV IA + P P+ ++DC+G VR+ A E LK H W +G Sbjct: 114 LDSEDSERFRNWFVWIALSAFKNDPLLWPK-EERDCSGFVRYCAREALKKHTGSWFSLSG 172 Query: 80 LS-------SQYLPPEMTLT-PEQRQLAQNWNQGNGKTGPYVTAINLIQYNSQFIGQDIN 131 + +Y P + L + ++ + +G + A L++ + +F+ + ++ Sbjct: 173 YNGPVWEDVEKYNYPNLPLVGTKMFRIEKGAYRGVEDFSNFAVARILVECSMEFVTKSVS 232 Query: 132 QALPGDM-IFFDQGDAQ---HLMVWMGR--------YVIYHTGSATKTDNGMRAVSLQQL 179 +ALPGD+ +FF D + HLM+++G + +YHTG + +R V +L Sbjct: 233 EALPGDIAVFFHPEDVEMPYHLMIFVGNLNLADHEGWFVYHTGPIGENPGELRFVRYSEL 292 Query: 180 MTWKDTRWIPNDSNPNFIGIYRLNFL 205 + + D W P + NP F+G YR FL Sbjct: 293 VNY-DPSWAPLEINPYFLGFYRFRFL 317 >UniRef50_Q025M9 Putative uncharacterized protein n=1 Tax=Candidatus Solibacter usitatus Ellin6076 RepID=Q025M9_SOLUE Length = 323 Score = 94.7 bits (234), Expect = 2e-18, Method: Compositional matrix adjust. Identities = 63/208 (30%), Positives = 104/208 (50%), Gaps = 28/208 (13%) Query: 23 NVEQSGLFRAWFVRIAQEQLRQGPSPRWYQ-QDCAGLVRFAANETLKVHDSKWLKS---- 77 + + G FR WF +A+ Q Q + R + DCA L+R+A E L+ HD W + Sbjct: 115 DADDQGAFRRWFTFLAETQFYQPRAGRAAEIVDCAALIRYAYREALRAHDGAWATAAHLP 174 Query: 78 --NGLSS--QYLPPEMTLTPEQRQLA----QNWNQGNGKTGPYVTAINLIQYNSQFIGQD 129 G+SS +Y P L ++A Q + +G + A L N+ F+ +D Sbjct: 175 LAPGISSLAKYQYPFTPLAAGLFRVAPGRFQPADLTSGAFAQFADAKTLQLRNTHFVTRD 234 Query: 130 INQALPGDMIFFDQ--GDAQ-HLMVWMG---------RYVIYHTGSATKTDNGMRAVSLQ 177 + +A PGD++F+ Q GD H M+++G RY++YHTG + +R +++ Sbjct: 235 LARAQPGDLLFYRQESGDMPFHSMIYLGESQIEKSAARYLVYHTGPGP---DEIRRPTVE 291 Query: 178 QLMTWKDTRWIPNDSNPNFIGIYRLNFL 205 +L+ + + W P NP F+G+YR N L Sbjct: 292 ELLHFPEPEWRPLPDNPRFLGVYRWNIL 319 >UniRef50_A0LH84 Putative uncharacterized protein n=1 Tax=Syntrophobacter fumaroxidans MPOB RepID=A0LH84_SYNFM Length = 583 Score = 77.4 bits (189), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 58/193 (30%), Positives = 96/193 (49%), Gaps = 25/193 (12%) Query: 31 RAWFVRIAQEQLRQGPSPRW--YQQDCAGLVRFAANETLKVHDSKWLKSNGLSSQ-YLPP 87 R R+A Q R+ S RW Q+DC+GLVRFA E ++V K + G+ + +LPP Sbjct: 386 RESVTRLALFQARKL-SRRWEERQRDCSGLVRFAYREAIEVRSPKQQQKLGIPATLHLPP 444 Query: 88 EMTLTPEQRQLAQNW---------NQGNGKTGPYVTAINLIQYNSQFIGQDINQALPGDM 138 ++P R+L + + G + GP+ A L+ +N + + ++ A GD+ Sbjct: 445 ---VSPLSRRLFPRYPFIWQTGYESDGKPRFGPFADAETLVGFNFRKKARSLDAASSGDL 501 Query: 139 IFFDQG--DAQ--HLMVWMG----RYVIYHTGSATKTDNGMRAVSLQQLMTWKDTRWIPN 190 + F + D Q HLM+++ +YH G + + +R V + LM D W+P Sbjct: 502 LVFRKSFEDDQPYHLMIFVEDRPENLAVYHNGERGE-EAQVRVVRMSDLMRSPDPVWLPG 560 Query: 191 DSNPNFIGIYRLN 203 NP+F+G+Y N Sbjct: 561 ADNPHFLGVYEWN 573 >UniRef50_B9K9Y4 Uncharacterized protein n=2 Tax=Thermotoga RepID=B9K9Y4_THENN Length = 138 Score = 62.8 bits (151), Expect = 7e-09, Method: Compositional matrix adjust. Identities = 33/106 (31%), Positives = 58/106 (54%), Gaps = 13/106 (12%) Query: 112 YVTAINLIQYNSQFIGQDINQALPGDM-IFFDQGDAQ---HLMVWMGR--------YVIY 159 + A L++ + +F+ + +++ALPGD+ +FF D + HLM+++G + +Y Sbjct: 33 FAVARILVECSMEFVTKSVSEALPGDIAVFFHPEDVEMPYHLMIFVGNLNLADHEGWFVY 92 Query: 160 HTGSATKTDNGMRAVSLQQLMTWKDTRWIPNDSNPNFIGIYRLNFL 205 HTG + +R V +L+ + D W P + NP F+G YR FL Sbjct: 93 HTGPIGENPGELRFVRYSELVNY-DPSWAPLEINPYFLGFYRFRFL 137 >UniRef50_Q1DES2 Putative uncharacterized protein n=1 Tax=Myxococcus xanthus DK 1622 RepID=Q1DES2_MYXXD Length = 222 Score = 53.9 bits (128), Expect = 3e-06, Method: Compositional matrix adjust. Identities = 51/192 (26%), Positives = 80/192 (41%), Gaps = 43/192 (22%) Query: 29 LFRAWFVRIAQEQLRQGPSPRWY--QQDCAGLVRFAANETLKVHDSKWLKSNGLSSQYLP 86 L R R+A Q+R+ + W+ Q+DCAGL+RFA K + + LS+ Sbjct: 53 LLRRLVARVALAQVRKQDAA-WHPDQRDCAGLIRFAFRSAY-----KQVAAERLSTPL-- 104 Query: 87 PEMTLTPEQRQLAQNWNQGNGKTGPYVTAINLIQYNSQFIGQD--INQAL-PGDMIFF-- 141 W G G+ + A L+Q + Q +G+D ++L GD++ F Sbjct: 105 ---------------WRDGRGRPSDFADAEVLLQQSFQLLGRDDATRESLRTGDVLAFRQ 149 Query: 142 --DQGDAQHLMVWM--------GRYVIYHTGSATKTDNGMRAVSLQQLMTWKDTRWIPND 191 D G HLM+ + V+YH G + +R L L+T W P Sbjct: 150 EQDAGPIFHLMLVVRPEDRAHAPARVVYHPG---EKGAAVRTGVLHHLVTEAPLEWRPVP 206 Query: 192 SNPNFIGIYRLN 203 N +F+G +R Sbjct: 207 QNASFLGFFRFK 218 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_C8UDI8 Conserved predicted protein n=3 Tax=Enterobacter... 322 5e-87 UniRef50_Q9HVT1 Uncharacterized protein PA4490 n=70 Tax=Proteoba... 293 4e-78 UniRef50_C7RN59 Putative uncharacterized protein n=1 Tax=Candida... 246 5e-64 UniRef50_A1K1H3 Conserved hypothetical secreted protein n=3 Tax=... 244 1e-63 UniRef50_C5CEH1 Putative uncharacterized protein n=1 Tax=Kosmoto... 234 2e-60 UniRef50_Q9X081 Uncharacterized protein TM_0986 n=5 Tax=Thermoto... 232 5e-60 UniRef50_A6LJB4 Putative uncharacterized protein n=2 Tax=Thermos... 227 2e-58 UniRef50_Q1IT70 Putative uncharacterized protein n=1 Tax=Candida... 218 9e-56 UniRef50_Q025M9 Putative uncharacterized protein n=1 Tax=Candida... 208 8e-53 UniRef50_A0LH84 Putative uncharacterized protein n=1 Tax=Syntrop... 186 4e-46 UniRef50_B9K9Y4 Uncharacterized protein n=2 Tax=Thermotoga RepID... 152 7e-36 UniRef50_Q1DES2 Putative uncharacterized protein n=1 Tax=Myxococ... 140 3e-32 Sequences not found previously or not previously below threshold: UniRef50_C0ZCP2 Putative uncharacterized protein n=1 Tax=Breviba... 42 0.018 UniRef50_C8WIX2 NLP/P60 protein n=1 Tax=Eggerthella lenta DSM 22... 42 0.019 UniRef50_B1L2P1 Cell wall-associated hydrolase n=4 Tax=Clostridi... 39 0.085 CONVERGED! >UniRef50_C8UDI8 Conserved predicted protein n=3 Tax=Enterobacteriaceae RepID=C8UDI8_ECO1A Length = 208 Score = 322 bits (825), Expect = 5e-87, Method: Composition-based stats. Identities = 186/205 (90%), Positives = 191/205 (93%), Gaps = 3/205 (1%) Query: 1 MRHGLLALICWLCCVVAHSEMLNVEQSGLFRAWFVRIAQEQLRQGPSPRWYQQDCAGLVR 60 MRHGLLALICWLCCVVAHSEMLNVEQSGLFRAWFVRIAQEQLRQGPSPRWYQQDCAGLVR Sbjct: 1 MRHGLLALICWLCCVVAHSEMLNVEQSGLFRAWFVRIAQEQLRQGPSPRWYQQDCAGLVR 60 Query: 61 FAANETLKVHDSKWLKSNGLSSQYLPPEMTLTPEQRQLAQNWNQGNGKTGPYVTAINLIQ 120 FAANETLKVHDSKWLKSNG SSQYLPPEMTLTP QRQLAQNWNQGNGKTGPYVTAINLIQ Sbjct: 61 FAANETLKVHDSKWLKSNGFSSQYLPPEMTLTPGQRQLAQNWNQGNGKTGPYVTAINLIQ 120 Query: 121 YNSQFIGQDINQALPGDMIFFDQGDAQHLMVWMGRYVIYHTGSATKTDNGMRAVSLQQLM 180 YNSQFIGQDINQALPGDMIFFDQGDAQHLMVWMGRYVIYHTGSATKTDNGMRAVSLQQLM Sbjct: 121 YNSQFIGQDINQALPGDMIFFDQGDAQHLMVWMGRYVIYHTGSATKTDNGMRAVSLQQLM 180 Query: 181 TWKDTRWIPNDSNPNFIGIYRLNFL 205 TWK +I N N ++ + R + L Sbjct: 181 TWK--LYIKNFINYQYVNV-RWSLL 202 >UniRef50_Q9HVT1 Uncharacterized protein PA4490 n=70 Tax=Proteobacteria RepID=Y4490_PSEAE Length = 212 Score = 293 bits (749), Expect = 4e-78, Method: Composition-based stats. Identities = 140/186 (75%), Positives = 169/186 (90%) Query: 22 LNVEQSGLFRAWFVRIAQEQLRQGPSPRWYQQDCAGLVRFAANETLKVHDSKWLKSNGLS 81 L+ +QS +FRAWFVRIAQEQLRQGPSPRW+QQDCAGLVRFAANE LKVHD KWL++NGLS Sbjct: 27 LDPQQSQVFRAWFVRIAQEQLRQGPSPRWHQQDCAGLVRFAANEALKVHDGKWLRANGLS 86 Query: 82 SQYLPPEMTLTPEQRQLAQNWNQGNGKTGPYVTAINLIQYNSQFIGQDINQALPGDMIFF 141 ++YLPPE+ L+PEQR+LAQNW QG G+ GPYV AI L+Q+NS+ +G+D+NQA PGD++F+ Sbjct: 87 NRYLPPELALSPEQRRLAQNWQQGGGQVGPYVNAIKLVQFNSRLVGRDLNQARPGDLMFY 146 Query: 142 DQGDAQHLMVWMGRYVIYHTGSATKTDNGMRAVSLQQLMTWKDTRWIPNDSNPNFIGIYR 201 DQGD QHLM+WMGR + YHTGS+T TDNGMR+VSLQQLMTWKDTRWIP++SNPNFIGIYR Sbjct: 147 DQGDDQHLMIWMGRSIAYHTGSSTPTDNGMRSVSLQQLMTWKDTRWIPDESNPNFIGIYR 206 Query: 202 LNFLAR 207 L FL++ Sbjct: 207 LAFLSQ 212 >UniRef50_C7RN59 Putative uncharacterized protein n=1 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RN59_9PROT Length = 238 Score = 246 bits (627), Expect = 5e-64, Method: Composition-based stats. Identities = 96/202 (47%), Positives = 126/202 (62%), Gaps = 17/202 (8%) Query: 22 LNVEQSGLFRAWFVRIAQEQLRQGPSPRWYQQDCAGLVRFAANETLKVHDSKWLKSNGLS 81 L+ QS FRAW I +Q+R GPSPRW+ +DCAGL RFA E L+ HD++W ++NG + Sbjct: 38 LDGTQSDRFRAWMALIVADQIRHGPSPRWFHRDCAGLARFAVGEALRPHDTRWRRANGFT 97 Query: 82 SQYLPPEMTLTPEQRQLAQNWNQGNGKTGPYVTAINLIQYNSQFIGQDINQALPGDMIFF 141 + LPPE+ L PEQ L + W G +VTA+ L+Q NS +G++ + PGD++FF Sbjct: 98 GR-LPPEVDLRPEQASLLKGWTTLAGDQQAFVTALALVQKNSHRLGRESAMSKPGDLLFF 156 Query: 142 DQGDAQHLMVWMGRYVIYHTGSATK----------------TDNGMRAVSLQQLMTWKDT 185 D GD QHLM+WMG ++ YHTG DNG+RAV+ QL+ WKDT Sbjct: 157 DHGDDQHLMIWMGSWIAYHTGQPPAAAEQGAGRRRPSPEMGEDNGLRAVTPAQLLQWKDT 216 Query: 186 RWIPNDSNPNFIGIYRLNFLAR 207 RW P D NPNF G YRL FL+R Sbjct: 217 RWRPRDDNPNFAGFYRLAFLSR 238 >UniRef50_A1K1H3 Conserved hypothetical secreted protein n=3 Tax=Proteobacteria RepID=A1K1H3_AZOSB Length = 227 Score = 244 bits (623), Expect = 1e-63, Method: Composition-based stats. Identities = 92/186 (49%), Positives = 131/186 (70%), Gaps = 2/186 (1%) Query: 22 LNVEQSGLFRAWFVRIAQEQLRQGPSPRWYQQDCAGLVRFAANETLKVHDSKWLKSNGLS 81 L+ EQS RAW VRIA+ Q+R+GP+PRW +DCAGLVRFA E L HD++W ++ G+S Sbjct: 43 LDAEQSRALRAWVVRIAEAQIRRGPTPRWTHRDCAGLVRFAVAEALAEHDARWRQAMGIS 102 Query: 82 SQYLPPEMTLTPEQRQLAQNWNQGNGKTGPYVTAINLIQYNSQFIGQDINQALPGDMIFF 141 + LPP+ + +L W + +G TG YV AI L+Q NS+ +G+D+ A P D++FF Sbjct: 103 RR-LPPDTVPAALRAELRHRWKRPDGSTGAYVNAIGLVQENSRPVGRDLRLARPADLLFF 161 Query: 142 DQGDAQHLMVWMGRYVIYHTGSATKT-DNGMRAVSLQQLMTWKDTRWIPNDSNPNFIGIY 200 DQGDAQHLM+W G ++YH G+ + D+G+RAV+ ++L+ W DTRW P+ +NPNF G++ Sbjct: 162 DQGDAQHLMLWTGYRIVYHNGAEPRPGDDGLRAVTPRELLQWTDTRWRPDATNPNFAGVH 221 Query: 201 RLNFLA 206 L FLA Sbjct: 222 TLAFLA 227 >UniRef50_C5CEH1 Putative uncharacterized protein n=1 Tax=Kosmotoga olearia TBF 19.5.1 RepID=C5CEH1_KOSOT Length = 321 Score = 234 bits (596), Expect = 2e-60, Method: Composition-based stats. Identities = 72/209 (34%), Positives = 106/209 (50%), Gaps = 27/209 (12%) Query: 21 MLNVEQSGLFRAWFVRIAQEQ-LRQGPSPRWYQQDCAGLVRFAANETLKVHDSKWLKSNG 79 +L E S + R+WFV IA Q L+ P+ R Q+DCAGLVRFA E LK H+ KW + G Sbjct: 114 VLKGEDSYILRSWFVNIALCQSLQLSPNWRENQRDCAGLVRFAMKEALKHHNKKWFQETG 173 Query: 80 LSSQYLPPEMTLTPEQRQL-------AQNWNQ-----------GNGKTGPYVTAINLIQY 121 + PE+ L L A N+ + G + + TA +++Y Sbjct: 174 ID-----PEIWLEKTGVDLRSIPDVSAYNYPEAPVGGTNIFVVGKDRLSSFATAYAILKY 228 Query: 122 NSQFIGQDINQALPGDMIFFDQGDA--QHLMVWMGRYVIYHTGSATKTDNG-MRAVSLQQ 178 N F+G D++ ALPGD++FF + H M++ G ++YHTG ++ D G ++ L Sbjct: 229 NVNFVGFDLSAALPGDLLFFHNPNPVTFHTMIYTGEGLVYHTGPVSEEDPGKIKLWRLDD 288 Query: 179 LMTWKDTRWIPNDSNPNFIGIYRLNFLAR 207 M +W+P N F+GIYRL L Sbjct: 289 YMRIMPIQWLPIRENTCFLGIYRLKILGE 317 >UniRef50_Q9X081 Uncharacterized protein TM_0986 n=5 Tax=Thermotogaceae RepID=Y986_THEMA Length = 318 Score = 232 bits (592), Expect = 5e-60, Method: Composition-based stats. Identities = 60/205 (29%), Positives = 98/205 (47%), Gaps = 22/205 (10%) Query: 22 LNVEQSGLFRAWFVRIAQEQLRQGPS-PRWYQQDCAGLVRFAANETLKVHDSKWLKSNGL 80 L+ E S FR WFV IA + P ++DC+G VR+ A E LK H W +G Sbjct: 114 LDSEDSERFRNWFVWIALSAFKNDPLLWPKEERDCSGFVRYCAREALKKHTGSWFSLSGY 173 Query: 81 SSQ-------YLPPEMTL-TPEQRQLAQNWNQGNGKTGPYVTAINLIQYNSQFIGQDINQ 132 + Y P + L + ++ + +G + A L++ + +F+ + +++ Sbjct: 174 NGPVWEDVEKYNYPNLPLVGTKMFRIEKGAYRGVEDFSNFAVARILVECSMEFVTKSVSE 233 Query: 133 ALPGDM-IFFDQGD---AQHLMVWMGR--------YVIYHTGSATKTDNGMRAVSLQQLM 180 ALPGD+ +FF D HLM+++G + +YHTG + +R V +L+ Sbjct: 234 ALPGDIAVFFHPEDVEMPYHLMIFVGNLNLADHEGWFVYHTGPIGENPGELRFVRYSELV 293 Query: 181 TWKDTRWIPNDSNPNFIGIYRLNFL 205 + D W P + NP F+G YR FL Sbjct: 294 NY-DPSWAPLEINPYFLGFYRFRFL 317 >UniRef50_A6LJB4 Putative uncharacterized protein n=2 Tax=Thermosipho RepID=A6LJB4_THEM4 Length = 207 Score = 227 bits (579), Expect = 2e-58, Method: Composition-based stats. Identities = 62/187 (33%), Positives = 85/187 (45%), Gaps = 10/187 (5%) Query: 22 LNVEQSGLFRAWFVRIAQEQLRQGPSPRWYQQDCAGLVRFAANETLKVHDSKWLKSNGLS 81 L+VE S FR WFV IA + + P + +DC+G V + E LK HD W + G Sbjct: 24 LDVEDSMNFRLWFVAIALDNV-DNEHPTFKTKDCSGFVFYCIKEALKKHDKDWFEKTGYK 82 Query: 82 SQYLPPEMTLTPEQRQLAQNWNQGNGKTGPYVTAINLIQYNSQFIGQDINQALPGDMIFF 141 L N K YV A NL+ YN+ FI + A PGD+IF+ Sbjct: 83 GPVFEDVKKYNYPNTPLGVNIFFNRKKYVSYVDAYNLLHYNTYFISYEKRYAKPGDLIFY 142 Query: 142 ----DQGDAQHLMVWMGRYVIYHTGSATKTDNGMRAVSLQQLMTWKDTRWIPNDSNPNFI 197 D H+M++ G +YHTG + +R VS + +M D W P + NP F+ Sbjct: 143 FHPEDPEFPYHVMIYTGNGFVYHTGPEGE----VRYVSYENMML-GDMAWKPIELNPAFL 197 Query: 198 GIYRLNF 204 G Y L Sbjct: 198 GYYGLKI 204 >UniRef50_Q1IT70 Putative uncharacterized protein n=1 Tax=Candidatus Koribacter versatilis Ellin345 RepID=Q1IT70_ACIBL Length = 283 Score = 218 bits (555), Expect = 9e-56, Method: Composition-based stats. Identities = 67/206 (32%), Positives = 98/206 (47%), Gaps = 23/206 (11%) Query: 24 VEQSGLFRAWFVRIAQEQLRQGPSPRWYQQDCAGLVRFAANETLKVHDSKWLKSNGLS-- 81 FR WF IA+ Q P DCA L+R++ E L+ HD+ W K +G Sbjct: 77 PSDRQSFRRWFTAIAEHQAEVSKLPAEIN-DCAALLRYSYREALRSHDAAWSKDSGFDQH 135 Query: 82 ------SQYLPPEMTLTPEQRQLAQN----WNQGNGKTGPYVTAINLIQYNSQFIGQDIN 131 +Y P L P+ + +Q + +G + A LI N+ F+ +D++ Sbjct: 136 PAVADIDKYRYPHTPLGPQLFRTSQGRFAAADLQDGTFAEFADAKTLIVNNAHFLSRDVH 195 Query: 132 QALPGDMIF---FDQGDAQHLMVWMGR-------YVIYHTGSATKTDNGMRAVSLQQLMT 181 A PGD+IF F+Q H M+++GR +V+YHTG K +R V+L L+ Sbjct: 196 LAQPGDLIFYRQFEQHSPFHSMIFLGRSGGDAGEWVVYHTGPDGKWPGEIRRVTLVSLLN 255 Query: 182 WKDTRWIPNDSNPNFIGIYRLNFLAR 207 D RW P SN NF+G+YR N L Sbjct: 256 HPDARWRPVPSNRNFLGVYRWNILRE 281 >UniRef50_Q025M9 Putative uncharacterized protein n=1 Tax=Candidatus Solibacter usitatus Ellin6076 RepID=Q025M9_SOLUE Length = 323 Score = 208 bits (530), Expect = 8e-53, Method: Composition-based stats. Identities = 61/208 (29%), Positives = 102/208 (49%), Gaps = 28/208 (13%) Query: 23 NVEQSGLFRAWFVRIAQEQLRQGPSPRWYQ-QDCAGLVRFAANETLKVHDSKWLKS---- 77 + + G FR WF +A+ Q Q + R + DCA L+R+A E L+ HD W + Sbjct: 115 DADDQGAFRRWFTFLAETQFYQPRAGRAAEIVDCAALIRYAYREALRAHDGAWATAAHLP 174 Query: 78 --NGLSS--QYLPPEMTLTPEQRQLA----QNWNQGNGKTGPYVTAINLIQYNSQFIGQD 129 G+SS +Y P L ++A Q + +G + A L N+ F+ +D Sbjct: 175 LAPGISSLAKYQYPFTPLAAGLFRVAPGRFQPADLTSGAFAQFADAKTLQLRNTHFVTRD 234 Query: 130 INQALPGDMIFFDQGD---AQHLMVWMG---------RYVIYHTGSATKTDNGMRAVSLQ 177 + +A PGD++F+ Q H M+++G RY++YHTG + +R +++ Sbjct: 235 LARAQPGDLLFYRQESGDMPFHSMIYLGESQIEKSAARYLVYHTGPGP---DEIRRPTVE 291 Query: 178 QLMTWKDTRWIPNDSNPNFIGIYRLNFL 205 +L+ + + W P NP F+G+YR N L Sbjct: 292 ELLHFPEPEWRPLPDNPRFLGVYRWNIL 319 >UniRef50_A0LH84 Putative uncharacterized protein n=1 Tax=Syntrophobacter fumaroxidans MPOB RepID=A0LH84_SYNFM Length = 583 Score = 186 bits (473), Expect = 4e-46, Method: Composition-based stats. Identities = 52/200 (26%), Positives = 88/200 (44%), Gaps = 21/200 (10%) Query: 22 LNVEQSGLFRAWFVRIAQEQLRQ-GPSPRWYQQDCAGLVRFAANETLKVHDSKWLKSNGL 80 L R R+A Q R+ Q+DC+GLVRFA E ++V K + G+ Sbjct: 377 LAPSDERAVRESVTRLALFQARKLSRRWEERQRDCSGLVRFAYREAIEVRSPKQQQKLGI 436 Query: 81 SSQYLPPEMTLTPEQRQLAQNWN---------QGNGKTGPYVTAINLIQYNSQFIGQDIN 131 + P ++P R+L + G + GP+ A L+ +N + + ++ Sbjct: 437 PATLHLP--PVSPLSRRLFPRYPFIWQTGYESDGKPRFGPFADAETLVGFNFRKKARSLD 494 Query: 132 QALPGDMIFFDQGD----AQHLMVWMG----RYVIYHTGSATKTDNGMRAVSLQQLMTWK 183 A GD++ F + HLM+++ +YH G + + +R V + LM Sbjct: 495 AASSGDLLVFRKSFEDDQPYHLMIFVEDRPENLAVYHNGERGE-EAQVRVVRMSDLMRSP 553 Query: 184 DTRWIPNDSNPNFIGIYRLN 203 D W+P NP+F+G+Y N Sbjct: 554 DPVWLPGADNPHFLGVYEWN 573 >UniRef50_B9K9Y4 Uncharacterized protein n=2 Tax=Thermotoga RepID=B9K9Y4_THENN Length = 138 Score = 152 bits (384), Expect = 7e-36, Method: Composition-based stats. Identities = 37/137 (27%), Positives = 68/137 (49%), Gaps = 14/137 (10%) Query: 82 SQYLPPEMTL-TPEQRQLAQNWNQGNGKTGPYVTAINLIQYNSQFIGQDINQALPGDM-I 139 +Y P + L + ++ + +G + A L++ + +F+ + +++ALPGD+ + Sbjct: 2 EKYNYPNLPLVGTKMFRIEKGAYRGVEDFSNFAVARILVECSMEFVTKSVSEALPGDIAV 61 Query: 140 FFDQGD---AQHLMVWMGR--------YVIYHTGSATKTDNGMRAVSLQQLMTWKDTRWI 188 FF D HLM+++G + +YHTG + +R V +L+ + D W Sbjct: 62 FFHPEDVEMPYHLMIFVGNLNLADHEGWFVYHTGPIGENPGELRFVRYSELVNY-DPSWA 120 Query: 189 PNDSNPNFIGIYRLNFL 205 P + NP F+G YR FL Sbjct: 121 PLEINPYFLGFYRFRFL 137 >UniRef50_Q1DES2 Putative uncharacterized protein n=1 Tax=Myxococcus xanthus DK 1622 RepID=Q1DES2_MYXXD Length = 222 Score = 140 bits (352), Expect = 3e-32, Method: Composition-based stats. Identities = 51/192 (26%), Positives = 80/192 (41%), Gaps = 43/192 (22%) Query: 29 LFRAWFVRIAQEQLRQGPSPRWY--QQDCAGLVRFAANETLKVHDSKWLKSNGLSSQYLP 86 L R R+A Q+R+ + W+ Q+DCAGL+RFA K + + LS+ Sbjct: 53 LLRRLVARVALAQVRKQDAA-WHPDQRDCAGLIRFAFRSAYK-----QVAAERLSTPL-- 104 Query: 87 PEMTLTPEQRQLAQNWNQGNGKTGPYVTAINLIQYNSQFIGQD--INQAL-PGDMIFFDQ 143 W G G+ + A L+Q + Q +G+D ++L GD++ F Q Sbjct: 105 ---------------WRDGRGRPSDFADAEVLLQQSFQLLGRDDATRESLRTGDVLAFRQ 149 Query: 144 ----GDAQHLMVWM--------GRYVIYHTGSATKTDNGMRAVSLQQLMTWKDTRWIPND 191 G HLM+ + V+YH G + +R L L+T W P Sbjct: 150 EQDAGPIFHLMLVVRPEDRAHAPARVVYHPG---EKGAAVRTGVLHHLVTEAPLEWRPVP 206 Query: 192 SNPNFIGIYRLN 203 N +F+G +R Sbjct: 207 QNASFLGFFRFK 218 >UniRef50_C0ZCP2 Putative uncharacterized protein n=1 Tax=Brevibacillus brevis NBRC 100599 RepID=C0ZCP2_BREBN Length = 302 Score = 41.6 bits (96), Expect = 0.018, Method: Composition-based stats. Identities = 13/54 (24%), Positives = 27/54 (50%), Gaps = 8/54 (14%) Query: 129 DINQALPGDMIFFD---QGDAQHLMVWMGRYVIYHTGSATKTDNGMRAVSLQQL 179 ++QA PGD++F+D +G H+ +++G I H +R ++ + Sbjct: 238 SLDQAQPGDLLFYDALGKGRVSHVAIYLGNGAIVH-----ANGEDVRYGKVEYM 286 >UniRef50_C8WIX2 NLP/P60 protein n=1 Tax=Eggerthella lenta DSM 2243 RepID=C8WIX2_EGGLE Length = 696 Score = 41.6 bits (96), Expect = 0.019, Method: Composition-based stats. Identities = 30/157 (19%), Positives = 49/157 (31%), Gaps = 25/157 (15%) Query: 48 PRWYQQDCAGLVRFAANETLKVHDS--KWLKSNG-------LSSQYLPPEMTLTPEQRQL 98 RW D G + F AN ++ D +W S+G +S L L Sbjct: 534 GRWCNADNDGTLTFIANRAFRLSDGTYQWTDSSGRVNQNEAMSRLMSAAHSVLGVPYVWL 593 Query: 99 AQNWNQGNGKTGPYVT-AINLIQYNSQFIGQ---------DINQALPGDMI---FFDQGD 145 G + I F +++A PGD+I F G Sbjct: 594 GNYPEDGGMDCASFTYWCYKQIGITIDFETYGQIHEGRAVSLSEARPGDLILMYFSSPGV 653 Query: 146 AQHLMVWMGRYVIYHTGSATKTDNGMRAVSLQQLMTW 182 +H++++ G ++Y + V L W Sbjct: 654 PEHVVMYAGNGMVY---EEPTFGGHCQYVPLSSKNAW 687 >UniRef50_B1L2P1 Cell wall-associated hydrolase n=4 Tax=Clostridium botulinum RepID=B1L2P1_CLOBM Length = 342 Score = 39.3 bits (90), Expect = 0.085, Method: Composition-based stats. Identities = 20/77 (25%), Positives = 33/77 (42%), Gaps = 5/77 (6%) Query: 109 TGPYVTAINLIQYNSQFIGQD--INQALPGDMIFFDQGDAQHLMVWMGRYVIYHTGSATK 166 Y ++ I Q +G D ++QA GD++FF A H+ ++MG H A Sbjct: 265 YNAYGISLPRISQEQQQVGIDVPLSQAKAGDLVFFHGYPATHVGIYMGNGNYIH---APH 321 Query: 167 TDNGMRAVSLQQLMTWK 183 T + ++ L K Sbjct: 322 TGDVVKISPLGDYTNIK 338 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.310 0.133 0.399 Lambda K H 0.267 0.0412 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 861,416,404 Number of Sequences: 3077464 Number of extensions: 32363860 Number of successful extensions: 78033 Number of sequences better than 1.0e-01: 16 Number of HSP's better than 0.1 without gapping: 18 Number of HSP's successfully gapped in prelim test: 10 Number of HSP's that attempted gapping in prelim test: 77950 Number of HSP's gapped (non-prelim): 29 length of query: 207 length of database: 1,040,396,356 effective HSP length: 123 effective length of query: 84 effective length of database: 661,868,284 effective search space: 55596935856 effective search space used: 55596935856 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.2 bits) S2: 90 (39.3 bits)