BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (225 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P76220 Uncharacterized protein ydjY n=54 Tax=Enterobact... 468 e-131 UniRef50_C5W4W7 YdjY protein (Fragment) n=4 Tax=Bacteria RepID=C... 411 e-114 UniRef50_B0S0B9 Putative uncharacterized protein n=2 Tax=Finegol... 236 6e-61 UniRef50_C1QBU3 Putative uncharacterized protein n=1 Tax=Brachys... 167 3e-40 UniRef50_B0S3S5 Putative uncharacterized protein n=2 Tax=Finegol... 149 9e-35 UniRef50_A8S409 Putative uncharacterized protein n=2 Tax=Clostri... 145 1e-33 UniRef50_C8P4B5 Putative uncharacterized protein n=1 Tax=Lactoba... 131 1e-29 UniRef50_Q2LXG8 Hypothetical cytosolic protein n=1 Tax=Syntrophu... 115 1e-24 >UniRef50_P76220 Uncharacterized protein ydjY n=54 Tax=Enterobacteriaceae RepID=YDJY_ECOLI Length = 225 Score = 468 bits (1204), Expect = e-131, Method: Compositional matrix adjust. Identities = 225/225 (100%), Positives = 225/225 (100%) Query: 1 MLQHYSVSWKKGLAALCLLAVAGLSGCDQQENAAAKVEYDGLSNSQPLRVDANNHTVTML 60 MLQHYSVSWKKGLAALCLLAVAGLSGCDQQENAAAKVEYDGLSNSQPLRVDANNHTVTML Sbjct: 1 MLQHYSVSWKKGLAALCLLAVAGLSGCDQQENAAAKVEYDGLSNSQPLRVDANNHTVTML 60 Query: 61 VQINGRFLTDDTRHGIVFKDGSNGHKSLFMGYATPKAFYEALKEAGGTPGENMTMDNKET 120 VQINGRFLTDDTRHGIVFKDGSNGHKSLFMGYATPKAFYEALKEAGGTPGENMTMDNKET Sbjct: 61 VQINGRFLTDDTRHGIVFKDGSNGHKSLFMGYATPKAFYEALKEAGGTPGENMTMDNKET 120 Query: 121 THVTGSKLDISVNWQGAAKAYSFDEVIVDSNGKKLDMRFGGNLTAAEEKKTGCLVCLDSC 180 THVTGSKLDISVNWQGAAKAYSFDEVIVDSNGKKLDMRFGGNLTAAEEKKTGCLVCLDSC Sbjct: 121 THVTGSKLDISVNWQGAAKAYSFDEVIVDSNGKKLDMRFGGNLTAAEEKKTGCLVCLDSC 180 Query: 181 PVGIVSNATYTYGAVEKRGEVKFKGNASVLPADNTLATVTFKIAE 225 PVGIVSNATYTYGAVEKRGEVKFKGNASVLPADNTLATVTFKIAE Sbjct: 181 PVGIVSNATYTYGAVEKRGEVKFKGNASVLPADNTLATVTFKIAE 225 >UniRef50_C5W4W7 YdjY protein (Fragment) n=4 Tax=Bacteria RepID=C5W4W7_ECOBB Length = 200 Score = 411 bits (1057), Expect = e-114, Method: Compositional matrix adjust. Identities = 197/200 (98%), Positives = 198/200 (99%) Query: 26 GCDQQENAAAKVEYDGLSNSQPLRVDANNHTVTMLVQINGRFLTDDTRHGIVFKDGSNGH 85 GCDQ+ENAAAKVEYDGLSNSQPLRVDANNHTVTMLVQINGRFLTDDTRHGIVFKDGSNGH Sbjct: 1 GCDQKENAAAKVEYDGLSNSQPLRVDANNHTVTMLVQINGRFLTDDTRHGIVFKDGSNGH 60 Query: 86 KSLFMGYATPKAFYEALKEAGGTPGENMTMDNKETTHVTGSKLDISVNWQGAAKAYSFDE 145 KSLFM YATPKAFYEALKEAGGTPGENMTMDNKETTHVTGSKLDISVNWQGAAKAYSFDE Sbjct: 61 KSLFMAYATPKAFYEALKEAGGTPGENMTMDNKETTHVTGSKLDISVNWQGAAKAYSFDE 120 Query: 146 VIVDSNGKKLDMRFGGNLTAAEEKKTGCLVCLDSCPVGIVSNATYTYGAVEKRGEVKFKG 205 VIVDSNGKKLDMRFGGNLTAAEEKKTGCLVCLDSCPVGIVSNATYTYGAVEKRGEVKFKG Sbjct: 121 VIVDSNGKKLDMRFGGNLTAAEEKKTGCLVCLDSCPVGIVSNATYTYGAVEKRGEVKFKG 180 Query: 206 NASVLPADNTLATVTFKIAE 225 NASVLPADNTLATVTFKI E Sbjct: 181 NASVLPADNTLATVTFKITE 200 >UniRef50_B0S0B9 Putative uncharacterized protein n=2 Tax=Finegoldia magna RepID=B0S0B9_FINM2 Length = 243 Score = 236 bits (601), Expect = 6e-61, Method: Compositional matrix adjust. Identities = 119/239 (49%), Positives = 147/239 (61%), Gaps = 23/239 (9%) Query: 10 KKGLAALCLLAVAGLSGCDQQEN-----------------------AAAKVEYDGLSNSQ 46 +K LA LA+ GL+GC Q+ + A E +G+S Sbjct: 5 RKLLAGFLCLAIVGLAGCSQKADDKKEANNTATTEQSKTEENKKEEKKADDEVNGVSLKN 64 Query: 47 PLRVDANNHTVTMLVQINGRFLTDDTRHGIVFKDGSNGHKSLFMGYATPKAFYEALKEAG 106 P++VD VT+L +NG++ T++TRH V DGSNG KS+ Y TP+ FY AL E G Sbjct: 65 PIKVDKEAKKVTVLSSVNGKYFTENTRHASVNTDGSNGAKSVLTAYGTPEDFYNALIEIG 124 Query: 107 GTPGENMTMDNKETTHVTGSKLDISVNWQGAAKAYSFDEVIVDSNGKKLDMRFGGNLTAA 166 PGENM DN THV GSK+ +V W+GA K Y +EVI DSNGKK++ RFGGNL A Sbjct: 125 AKPGENMNPDNATKTHVEGSKIGATVTWEGAGKDYDINEVIKDSNGKKIEFRFGGNLERA 184 Query: 167 EEKKTGCLVCLDSCPVGIVSNATYTYGAVEKRGEVKFKGNASVLPADNTLATVTFKIAE 225 + KKTGCL CLDSCPVGI+SN TYTYGAVEKR EVKF GNA VLP D T VT+ + + Sbjct: 185 KTKKTGCLTCLDSCPVGIISNTTYTYGAVEKRNEVKFTGNADVLPEDGTYVAVTYTLED 243 >UniRef50_C1QBU3 Putative uncharacterized protein n=1 Tax=Brachyspira murdochii DSM 12563 RepID=C1QBU3_9SPIR Length = 230 Score = 167 bits (422), Expect = 3e-40, Method: Compositional matrix adjust. Identities = 90/227 (39%), Positives = 128/227 (56%), Gaps = 14/227 (6%) Query: 11 KGLAALCLLAVAGLSGCDQQENAAAKVEYDGLSNS-----------QPLRVDANNHTVTM 59 K + LCLL+ LS C ++ A++ V L+NS +P+ +D V + Sbjct: 3 KKIIMLCLLSSFMLSSCGNKQ-ASSSVSSAELTNSMGSVILRDEGAEPVVIDIEKKEVII 61 Query: 60 LVQINGRFLTDDTRHGIVFKDGSNGHKSLFMGYATPKAFYEALKEAGGTPGENMTM-DNK 118 ++N ++ TRHGIVF G+NG KS+ G + + FY+AL + G G N+TM D K Sbjct: 62 PAEVNAKYFNSPTRHGIVFDRGANGDKSVLRGLSDEREFYQALIDIGAVAGNNLTMEDMK 121 Query: 119 ETTHVTGSKLDISVNWQGAAKAYSFDEVIVDSNGKKLDMRFGGNLTAAEEKKTGCLVCLD 178 V G KLD+ V W G K F ++I + +D+RFGGN AA+ K+TGC++CLD Sbjct: 122 LEKTVDGQKLDVFVTWDGLGKEIPFSDIIRSDEERPMDIRFGGNFEAAKAKRTGCILCLD 181 Query: 179 SCPVGIVSNATYTYGAVEKRGEVKFKGNASVLPADNTLATVTFKIAE 225 SC VGI S+A Y GAVE + ++ G VLP D T +V F+IAE Sbjct: 182 SCAVGITSDAAYETGAVEVKKIGRY-GREDVLPPDGTRVSVIFRIAE 227 >UniRef50_B0S3S5 Putative uncharacterized protein n=2 Tax=Finegoldia magna RepID=B0S3S5_FINM2 Length = 241 Score = 149 bits (375), Expect = 9e-35, Method: Compositional matrix adjust. Identities = 74/178 (41%), Positives = 109/178 (61%), Gaps = 2/178 (1%) Query: 47 PLRVDANNHTVTMLVQINGRFLTDDTRHGIVFKDGSNGHKSLFMGYATPKAFYEALKEAG 106 P+ VD + + ++N +F + T H IV K G N +S+F+ YA Y+AL++ G Sbjct: 65 PMIVDEAKKQIKVYAEVNDKFKKESTMHAIVAKSGKNNEQSMFVSYANQNDLYDALEKIG 124 Query: 107 GTPGENMTMDNKETTHVTGSKLDISVNWQGAAKAYSFDEVIVDSNGKKLDMRFGGNLTAA 166 PG N+TM+N V G K+D++ +QG+ ++VI DS+GK++D+RFGGN A Sbjct: 125 AKPGNNVTMENMGKEAVKGDKIDLTFKFQGSDNELGINDVIKDSSGKEIDIRFGGNQKPA 184 Query: 167 EEKKTGCLVCLDSCPVGIVSNATYTYGAVEKRGEVKFKGNASVLPADNTLATVTFKIA 224 ++ TGC+ CL SCP+GI SNA+ GA EK G VK+ SV PAD T +T+K+A Sbjct: 185 KDMNTGCMTCLQSCPLGITSNASQLIGADEKDG-VKYTLADSV-PADKTPVVITYKLA 240 >UniRef50_A8S409 Putative uncharacterized protein n=2 Tax=Clostridium RepID=A8S409_9CLOT Length = 222 Score = 145 bits (365), Expect = 1e-33, Method: Compositional matrix adjust. Identities = 73/185 (39%), Positives = 111/185 (60%), Gaps = 7/185 (3%) Query: 46 QPLRVDANNHTVTMLVQINGRFLTDDTRHGIVFKDGSNGHKSLFMGYATPKAFYEALKEA 105 + + VD + + ML ++NG + T+ TRHGIV+K GSNG K++ G A K FY+AL + Sbjct: 40 ETMTVDKDKKEIIMLCEVNGTYFTEPTRHGIVYKGGSNGEKAVLRGLADEKEFYQALLDI 99 Query: 106 GGTPGENMTMDNKET-----THVTGSKLDISVNWQGAAKAYSFDEVIVDSNGKKLDMRFG 160 G G+N+T + + V G KLD+ V W+G + + ++I + +D+RFG Sbjct: 100 GAKAGDNLTAADMKAGPDNGKAVEGDKLDVFVKWEG-QEEIPYQDIIKCTEDYTMDLRFG 158 Query: 161 GNLTAAEEKKTGCLVCLDSCPVGIVSNATYTYGAVEKRGEVKFKGNASVLPADNTLATVT 220 GN+ +A+E TGC++CLDSC GIVS+A + G + KF G+ VLP D T TV Sbjct: 159 GNIESAKENNTGCVLCLDSCATGIVSDAAWPTGTTQ-NDIAKFYGDKDVLPEDGTQVTVI 217 Query: 221 FKIAE 225 F++A+ Sbjct: 218 FRLAK 222 >UniRef50_C8P4B5 Putative uncharacterized protein n=1 Tax=Lactobacillus antri DSM 16041 RepID=C8P4B5_9LACO Length = 218 Score = 131 bits (330), Expect = 1e-29, Method: Compositional matrix adjust. Identities = 83/219 (37%), Positives = 109/219 (49%), Gaps = 15/219 (6%) Query: 12 GLAALCLLAVAGLSGCDQQENAAAKVEYDGLSNSQPLRVDANNHTVTMLVQINGRFLTDD 71 GLAA L G S Q N A + Q + V+ V +NG + T Sbjct: 9 GLAAAFSLVAVGTS--VSQSNTAYAAD-------QKIVVNKAKKQVEYPAVVNGTYFTQP 59 Query: 72 TRHGIVFKDGSNGHKSLFMGYATPKAFYEALKEAGGTPGENMTMDNKETT-----HVTGS 126 TRH V+K+GSNG K++ G A+ FY LK+ G PG N+ + + V GS Sbjct: 60 TRHLCVYKNGSNGDKAVLRGEASEITFYNDLKKIGAKPGNNLRPADMKAVKGQGKRVEGS 119 Query: 127 KLDISVNWQGAAKAYSFDEVIVDSNGKKLDMRFGGNLTAAEEKKTGCLVCLDSCPVGIVS 186 KL+I + W+G KA + I + K D RFGGNL A+ TGC+ C DSC GIVS Sbjct: 120 KLNIYIKWKGH-KAVPIQDCIKSTKKYKTDFRFGGNLGRAQTMNTGCVTCFDSCSTGIVS 178 Query: 187 NATYTYGAVEKRGEVKFKGNASVLPADNTLATVTFKIAE 225 + + G+ E VKF GN VLP D T T+ K+AE Sbjct: 179 DHAWPTGSTEPNHVVKFYGNQKVLPKDGTHVTMIVKLAE 217 >UniRef50_Q2LXG8 Hypothetical cytosolic protein n=1 Tax=Syntrophus aciditrophicus SB RepID=Q2LXG8_SYNAS Length = 241 Score = 115 bits (287), Expect = 1e-24, Method: Compositional matrix adjust. Identities = 65/186 (34%), Positives = 91/186 (48%), Gaps = 5/186 (2%) Query: 40 DGLSNSQPLRVDANNHTVTMLVQINGRFLTDDTRH-GIVFKDGSNGHKSLFMGYATPKAF 98 D ++ P+ VD V + ++ R LT+ T H GI + G K + + A P A Sbjct: 52 DWPTSGNPVMVDTARRIVKLYTKLQLRHLTETTPHWGIGYSGGKLADKFILVSPAGPVAL 111 Query: 99 YEALKEAGGTPGENMTMDNKETTHVTGSKLDISVNWQGAAKAYSFDEVIVDSNGKKLDMR 158 ++AL G N+ +D V G +L +S W G +E+ DS GK D+R Sbjct: 112 HDALVRIRARAGNNLPLDG-YGKFVDGDRLILSAQWPGLPTPVGLNEIFYDSAGKGFDIR 170 Query: 159 FGGNLTAAEEKKTGCLVCLDSCPVGIVSNATYTYGAVEK---RGEVKFKGNASVLPADNT 215 FGGN A EKKTGCL CL+SCP+GI SNA Y + + + R F+G LP Sbjct: 171 FGGNRAIAAEKKTGCLTCLESCPIGISSNAVYPHLSTLQRMLRPTSSFRGKPERLPNKEA 230 Query: 216 LATVTF 221 + V F Sbjct: 231 VPIVVF 236 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P76220 Uncharacterized protein ydjY n=54 Tax=Enterobact... 346 4e-94 UniRef50_C5W4W7 YdjY protein (Fragment) n=4 Tax=Bacteria RepID=C... 297 2e-79 UniRef50_C1QBU3 Putative uncharacterized protein n=1 Tax=Brachys... 281 1e-74 UniRef50_B0S0B9 Putative uncharacterized protein n=2 Tax=Finegol... 279 5e-74 UniRef50_C8P4B5 Putative uncharacterized protein n=1 Tax=Lactoba... 265 9e-70 UniRef50_A8S409 Putative uncharacterized protein n=2 Tax=Clostri... 256 4e-67 UniRef50_Q2LXG8 Hypothetical cytosolic protein n=1 Tax=Syntrophu... 243 2e-63 UniRef50_B0S3S5 Putative uncharacterized protein n=2 Tax=Finegol... 239 5e-62 Sequences not found previously or not previously below threshold: UniRef50_B9M6P1 Putative uncharacterized protein n=1 Tax=Geobact... 49 2e-04 UniRef50_B4DBK3 Putative uncharacterized protein n=1 Tax=Chthoni... 47 7e-04 >UniRef50_P76220 Uncharacterized protein ydjY n=54 Tax=Enterobacteriaceae RepID=YDJY_ECOLI Length = 225 Score = 346 bits (887), Expect = 4e-94, Method: Composition-based stats. Identities = 225/225 (100%), Positives = 225/225 (100%) Query: 1 MLQHYSVSWKKGLAALCLLAVAGLSGCDQQENAAAKVEYDGLSNSQPLRVDANNHTVTML 60 MLQHYSVSWKKGLAALCLLAVAGLSGCDQQENAAAKVEYDGLSNSQPLRVDANNHTVTML Sbjct: 1 MLQHYSVSWKKGLAALCLLAVAGLSGCDQQENAAAKVEYDGLSNSQPLRVDANNHTVTML 60 Query: 61 VQINGRFLTDDTRHGIVFKDGSNGHKSLFMGYATPKAFYEALKEAGGTPGENMTMDNKET 120 VQINGRFLTDDTRHGIVFKDGSNGHKSLFMGYATPKAFYEALKEAGGTPGENMTMDNKET Sbjct: 61 VQINGRFLTDDTRHGIVFKDGSNGHKSLFMGYATPKAFYEALKEAGGTPGENMTMDNKET 120 Query: 121 THVTGSKLDISVNWQGAAKAYSFDEVIVDSNGKKLDMRFGGNLTAAEEKKTGCLVCLDSC 180 THVTGSKLDISVNWQGAAKAYSFDEVIVDSNGKKLDMRFGGNLTAAEEKKTGCLVCLDSC Sbjct: 121 THVTGSKLDISVNWQGAAKAYSFDEVIVDSNGKKLDMRFGGNLTAAEEKKTGCLVCLDSC 180 Query: 181 PVGIVSNATYTYGAVEKRGEVKFKGNASVLPADNTLATVTFKIAE 225 PVGIVSNATYTYGAVEKRGEVKFKGNASVLPADNTLATVTFKIAE Sbjct: 181 PVGIVSNATYTYGAVEKRGEVKFKGNASVLPADNTLATVTFKIAE 225 >UniRef50_C5W4W7 YdjY protein (Fragment) n=4 Tax=Bacteria RepID=C5W4W7_ECOBB Length = 200 Score = 297 bits (761), Expect = 2e-79, Method: Composition-based stats. Identities = 197/200 (98%), Positives = 198/200 (99%) Query: 26 GCDQQENAAAKVEYDGLSNSQPLRVDANNHTVTMLVQINGRFLTDDTRHGIVFKDGSNGH 85 GCDQ+ENAAAKVEYDGLSNSQPLRVDANNHTVTMLVQINGRFLTDDTRHGIVFKDGSNGH Sbjct: 1 GCDQKENAAAKVEYDGLSNSQPLRVDANNHTVTMLVQINGRFLTDDTRHGIVFKDGSNGH 60 Query: 86 KSLFMGYATPKAFYEALKEAGGTPGENMTMDNKETTHVTGSKLDISVNWQGAAKAYSFDE 145 KSLFM YATPKAFYEALKEAGGTPGENMTMDNKETTHVTGSKLDISVNWQGAAKAYSFDE Sbjct: 61 KSLFMAYATPKAFYEALKEAGGTPGENMTMDNKETTHVTGSKLDISVNWQGAAKAYSFDE 120 Query: 146 VIVDSNGKKLDMRFGGNLTAAEEKKTGCLVCLDSCPVGIVSNATYTYGAVEKRGEVKFKG 205 VIVDSNGKKLDMRFGGNLTAAEEKKTGCLVCLDSCPVGIVSNATYTYGAVEKRGEVKFKG Sbjct: 121 VIVDSNGKKLDMRFGGNLTAAEEKKTGCLVCLDSCPVGIVSNATYTYGAVEKRGEVKFKG 180 Query: 206 NASVLPADNTLATVTFKIAE 225 NASVLPADNTLATVTFKI E Sbjct: 181 NASVLPADNTLATVTFKITE 200 >UniRef50_C1QBU3 Putative uncharacterized protein n=1 Tax=Brachyspira murdochii DSM 12563 RepID=C1QBU3_9SPIR Length = 230 Score = 281 bits (719), Expect = 1e-74, Method: Composition-based stats. Identities = 88/227 (38%), Positives = 129/227 (56%), Gaps = 14/227 (6%) Query: 11 KGLAALCLLAVAGLSGCDQQENAAAKVEYDGLSNS-----------QPLRVDANNHTVTM 59 K + LCLL+ LS C ++ A++ V L+NS +P+ +D V + Sbjct: 3 KKIIMLCLLSSFMLSSCGNKQ-ASSSVSSAELTNSMGSVILRDEGAEPVVIDIEKKEVII 61 Query: 60 LVQINGRFLTDDTRHGIVFKDGSNGHKSLFMGYATPKAFYEALKEAGGTPGENMTMDNKE 119 ++N ++ TRHGIVF G+NG KS+ G + + FY+AL + G G N+TM++ + Sbjct: 62 PAEVNAKYFNSPTRHGIVFDRGANGDKSVLRGLSDEREFYQALIDIGAVAGNNLTMEDMK 121 Query: 120 -TTHVTGSKLDISVNWQGAAKAYSFDEVIVDSNGKKLDMRFGGNLTAAEEKKTGCLVCLD 178 V G KLD+ V W G K F ++I + +D+RFGGN AA+ K+TGC++CLD Sbjct: 122 LEKTVDGQKLDVFVTWDGLGKEIPFSDIIRSDEERPMDIRFGGNFEAAKAKRTGCILCLD 181 Query: 179 SCPVGIVSNATYTYGAVEKRGEVKFKGNASVLPADNTLATVTFKIAE 225 SC VGI S+A Y GAVE + ++ G VLP D T +V F+IAE Sbjct: 182 SCAVGITSDAAYETGAVEVKKIGRY-GREDVLPPDGTRVSVIFRIAE 227 >UniRef50_B0S0B9 Putative uncharacterized protein n=2 Tax=Finegoldia magna RepID=B0S0B9_FINM2 Length = 243 Score = 279 bits (713), Expect = 5e-74, Method: Composition-based stats. Identities = 119/239 (49%), Positives = 147/239 (61%), Gaps = 23/239 (9%) Query: 10 KKGLAALCLLAVAGLSGCDQQENAA-----------------------AKVEYDGLSNSQ 46 +K LA LA+ GL+GC Q+ + A E +G+S Sbjct: 5 RKLLAGFLCLAIVGLAGCSQKADDKKEANNTATTEQSKTEENKKEEKKADDEVNGVSLKN 64 Query: 47 PLRVDANNHTVTMLVQINGRFLTDDTRHGIVFKDGSNGHKSLFMGYATPKAFYEALKEAG 106 P++VD VT+L +NG++ T++TRH V DGSNG KS+ Y TP+ FY AL E G Sbjct: 65 PIKVDKEAKKVTVLSSVNGKYFTENTRHASVNTDGSNGAKSVLTAYGTPEDFYNALIEIG 124 Query: 107 GTPGENMTMDNKETTHVTGSKLDISVNWQGAAKAYSFDEVIVDSNGKKLDMRFGGNLTAA 166 PGENM DN THV GSK+ +V W+GA K Y +EVI DSNGKK++ RFGGNL A Sbjct: 125 AKPGENMNPDNATKTHVEGSKIGATVTWEGAGKDYDINEVIKDSNGKKIEFRFGGNLERA 184 Query: 167 EEKKTGCLVCLDSCPVGIVSNATYTYGAVEKRGEVKFKGNASVLPADNTLATVTFKIAE 225 + KKTGCL CLDSCPVGI+SN TYTYGAVEKR EVKF GNA VLP D T VT+ + + Sbjct: 185 KTKKTGCLTCLDSCPVGIISNTTYTYGAVEKRNEVKFTGNADVLPEDGTYVAVTYTLED 243 >UniRef50_C8P4B5 Putative uncharacterized protein n=1 Tax=Lactobacillus antri DSM 16041 RepID=C8P4B5_9LACO Length = 218 Score = 265 bits (677), Expect = 9e-70, Method: Composition-based stats. Identities = 83/219 (37%), Positives = 109/219 (49%), Gaps = 15/219 (6%) Query: 12 GLAALCLLAVAGLSGCDQQENAAAKVEYDGLSNSQPLRVDANNHTVTMLVQINGRFLTDD 71 GLAA L G S Q N A + Q + V+ V +NG + T Sbjct: 9 GLAAAFSLVAVGTS--VSQSNTAYAAD-------QKIVVNKAKKQVEYPAVVNGTYFTQP 59 Query: 72 TRHGIVFKDGSNGHKSLFMGYATPKAFYEALKEAGGTPGENMTMDNKE-----TTHVTGS 126 TRH V+K+GSNG K++ G A+ FY LK+ G PG N+ + + V GS Sbjct: 60 TRHLCVYKNGSNGDKAVLRGEASEITFYNDLKKIGAKPGNNLRPADMKAVKGQGKRVEGS 119 Query: 127 KLDISVNWQGAAKAYSFDEVIVDSNGKKLDMRFGGNLTAAEEKKTGCLVCLDSCPVGIVS 186 KL+I + W+G KA + I + K D RFGGNL A+ TGC+ C DSC GIVS Sbjct: 120 KLNIYIKWKGH-KAVPIQDCIKSTKKYKTDFRFGGNLGRAQTMNTGCVTCFDSCSTGIVS 178 Query: 187 NATYTYGAVEKRGEVKFKGNASVLPADNTLATVTFKIAE 225 + + G+ E VKF GN VLP D T T+ K+AE Sbjct: 179 DHAWPTGSTEPNHVVKFYGNQKVLPKDGTHVTMIVKLAE 217 >UniRef50_A8S409 Putative uncharacterized protein n=2 Tax=Clostridium RepID=A8S409_9CLOT Length = 222 Score = 256 bits (654), Expect = 4e-67, Method: Composition-based stats. Identities = 77/217 (35%), Positives = 121/217 (55%), Gaps = 8/217 (3%) Query: 14 AALCLLAVAGLSGCDQQENAAAKVEYDGLSNSQPLRVDANNHTVTMLVQINGRFLTDDTR 73 + V L+ C + N + + + + + VD + + ML ++NG + T+ TR Sbjct: 9 IMVATAMVFSLAACGGKSNTVES-KAEEKAAVETMTVDKDKKEIIMLCEVNGTYFTEPTR 67 Query: 74 HGIVFKDGSNGHKSLFMGYATPKAFYEALKEAGGTPGENMTMDNKE-----TTHVTGSKL 128 HGIV+K GSNG K++ G A K FY+AL + G G+N+T + + V G KL Sbjct: 68 HGIVYKGGSNGEKAVLRGLADEKEFYQALLDIGAKAGDNLTAADMKAGPDNGKAVEGDKL 127 Query: 129 DISVNWQGAAKAYSFDEVIVDSNGKKLDMRFGGNLTAAEEKKTGCLVCLDSCPVGIVSNA 188 D+ V W+G + + ++I + +D+RFGGN+ +A+E TGC++CLDSC GIVS+A Sbjct: 128 DVFVKWEG-QEEIPYQDIIKCTEDYTMDLRFGGNIESAKENNTGCVLCLDSCATGIVSDA 186 Query: 189 TYTYGAVEKRGEVKFKGNASVLPADNTLATVTFKIAE 225 + G + KF G+ VLP D T TV F++A+ Sbjct: 187 AWPTGTTQ-NDIAKFYGDKDVLPEDGTQVTVIFRLAK 222 >UniRef50_Q2LXG8 Hypothetical cytosolic protein n=1 Tax=Syntrophus aciditrophicus SB RepID=Q2LXG8_SYNAS Length = 241 Score = 243 bits (621), Expect = 2e-63, Method: Composition-based stats. Identities = 65/186 (34%), Positives = 91/186 (48%), Gaps = 5/186 (2%) Query: 40 DGLSNSQPLRVDANNHTVTMLVQINGRFLTDDTRH-GIVFKDGSNGHKSLFMGYATPKAF 98 D ++ P+ VD V + ++ R LT+ T H GI + G K + + A P A Sbjct: 52 DWPTSGNPVMVDTARRIVKLYTKLQLRHLTETTPHWGIGYSGGKLADKFILVSPAGPVAL 111 Query: 99 YEALKEAGGTPGENMTMDNKETTHVTGSKLDISVNWQGAAKAYSFDEVIVDSNGKKLDMR 158 ++AL G N+ +D V G +L +S W G +E+ DS GK D+R Sbjct: 112 HDALVRIRARAGNNLPLDG-YGKFVDGDRLILSAQWPGLPTPVGLNEIFYDSAGKGFDIR 170 Query: 159 FGGNLTAAEEKKTGCLVCLDSCPVGIVSNATYTYGAVEK---RGEVKFKGNASVLPADNT 215 FGGN A EKKTGCL CL+SCP+GI SNA Y + + + R F+G LP Sbjct: 171 FGGNRAIAAEKKTGCLTCLESCPIGISSNAVYPHLSTLQRMLRPTSSFRGKPERLPNKEA 230 Query: 216 LATVTF 221 + V F Sbjct: 231 VPIVVF 236 >UniRef50_B0S3S5 Putative uncharacterized protein n=2 Tax=Finegoldia magna RepID=B0S3S5_FINM2 Length = 241 Score = 239 bits (610), Expect = 5e-62, Method: Composition-based stats. Identities = 84/237 (35%), Positives = 126/237 (53%), Gaps = 24/237 (10%) Query: 10 KKGLAALCLLAVAGLSGCDQQENAAAK----------------------VEYDGLSNSQP 47 K + AL L+A +GC +++N A E + P Sbjct: 6 KFKILALLLMASLVFAGCSKEDNKQASSTEQKTEEKKEEKTEQKTEEKKEEAQEPTADNP 65 Query: 48 LRVDANNHTVTMLVQINGRFLTDDTRHGIVFKDGSNGHKSLFMGYATPKAFYEALKEAGG 107 + VD + + ++N +F + T H IV K G N +S+F+ YA Y+AL++ G Sbjct: 66 MIVDEAKKQIKVYAEVNDKFKKESTMHAIVAKSGKNNEQSMFVSYANQNDLYDALEKIGA 125 Query: 108 TPGENMTMDNKETTHVTGSKLDISVNWQGAAKAYSFDEVIVDSNGKKLDMRFGGNLTAAE 167 PG N+TM+N V G K+D++ +QG+ ++VI DS+GK++D+RFGGN A+ Sbjct: 126 KPGNNVTMENMGKEAVKGDKIDLTFKFQGSDNELGINDVIKDSSGKEIDIRFGGNQKPAK 185 Query: 168 EKKTGCLVCLDSCPVGIVSNATYTYGAVEKRGEVKFKGNASVLPADNTLATVTFKIA 224 + TGC+ CL SCP+GI SNA+ GA EK G VK+ SV PAD T +T+K+A Sbjct: 186 DMNTGCMTCLQSCPLGITSNASQLIGADEKDG-VKYTLADSV-PADKTPVVITYKLA 240 >UniRef50_B9M6P1 Putative uncharacterized protein n=1 Tax=Geobacter sp. FRC-32 RepID=B9M6P1_GEOSF Length = 242 Score = 48.6 bits (114), Expect = 2e-04, Method: Composition-based stats. Identities = 44/248 (17%), Positives = 90/248 (36%), Gaps = 39/248 (15%) Query: 5 YSVSWKKGLAALCLLAVAGLSGCDQQENAAAKVEYDGLSNSQPLRVDANNHTVTMLVQIN 64 Y ++ + L A +L+ A + +Q+ AA K + G S L + V + + Sbjct: 2 YRLALRITLLASLILSFALSASAAKQKPAAVKTDKYG---SDTLTTNGRTREVRVTATV- 57 Query: 65 GRFLTDDT--------RHGIVFKDGSNGHKSLFMGYATPKAFYEALKEAGGTPGENMTMD 116 + + + + KDG +F A +A+K G + M Sbjct: 58 VKDCSQPSVCDWGRRFQGFFGSKDGKMAPFFIFSTEVHRAALDKAIKSVGIKSRRQIPMT 117 Query: 117 NKETT-----------HVTGSKLDISVNWQGAAK--AYSFDEVIVDS---NGKKL----- 155 + ++ G + +SV W+ K + +E+I + +GK++ Sbjct: 118 EVKQRSGLKSTTQMDDYLDGDPILVSVRWKQDGKMVERAMEELIEEKILVDGKEVIKPYT 177 Query: 156 -DMRFGGNLTAAEEKKTGCLVCLDSCPVGIVSNATYTYGAVEKRGEVKFKGNASVLPADN 214 + G A +GC+VC C G++++ + K + ++ N +P Sbjct: 178 PHFVYHGTAE-AINFASGCIVCPSGCNGGVIADNSVP----LKETKNYYRFNWKKMPHPG 232 Query: 215 TLATVTFK 222 T + K Sbjct: 233 TKVEIVLK 240 >UniRef50_B4DBK3 Putative uncharacterized protein n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4DBK3_9BACT Length = 243 Score = 46.6 bits (109), Expect = 7e-04, Method: Composition-based stats. Identities = 24/119 (20%), Positives = 51/119 (42%), Gaps = 15/119 (12%) Query: 48 LRVDANNHTVTMLVQINGRFLTDDTRHGIVFKDGSNGHKSLFMGYATPKAFYEALKEAGG 107 +R+D TVT +N + D ++ + N H+SL + TP + A+ G Sbjct: 46 IRLDQKARTVTFPGVLN---MNDGNLEYLIVTEQGNTHESLLVSDVTPSDLHFAMLLLGA 102 Query: 108 TPGE----NMTMDNKETTHVT------GSKLDISVNWQ--GAAKAYSFDEVIVDSNGKK 154 ++ ++ ++ G +DI+V+W+ G K+ ++ + ++ KK Sbjct: 103 KGSGSQSGDLPPSQIDSKYLKTAPPLKGDDIDITVHWKAGGTEKSAPVEDWLFNTETKK 161 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P76220 Uncharacterized protein ydjY n=54 Tax=Enterobact... 331 1e-89 UniRef50_C5W4W7 YdjY protein (Fragment) n=4 Tax=Bacteria RepID=C... 287 3e-76 UniRef50_C1QBU3 Putative uncharacterized protein n=1 Tax=Brachys... 270 2e-71 UniRef50_B0S0B9 Putative uncharacterized protein n=2 Tax=Finegol... 269 6e-71 UniRef50_A8S409 Putative uncharacterized protein n=2 Tax=Clostri... 260 3e-68 UniRef50_C8P4B5 Putative uncharacterized protein n=1 Tax=Lactoba... 255 6e-67 UniRef50_B0S3S5 Putative uncharacterized protein n=2 Tax=Finegol... 243 2e-63 UniRef50_Q2LXG8 Hypothetical cytosolic protein n=1 Tax=Syntrophu... 229 6e-59 UniRef50_B9M6P1 Putative uncharacterized protein n=1 Tax=Geobact... 210 4e-53 UniRef50_B4DBK3 Putative uncharacterized protein n=1 Tax=Chthoni... 111 2e-23 Sequences not found previously or not previously below threshold: UniRef50_D2R883 Putative uncharacterized protein n=1 Tax=Pirellu... 61 2e-08 UniRef50_A4A001 Putative uncharacterized protein n=1 Tax=Blastop... 58 2e-07 >UniRef50_P76220 Uncharacterized protein ydjY n=54 Tax=Enterobacteriaceae RepID=YDJY_ECOLI Length = 225 Score = 331 bits (849), Expect = 1e-89, Method: Composition-based stats. Identities = 225/225 (100%), Positives = 225/225 (100%) Query: 1 MLQHYSVSWKKGLAALCLLAVAGLSGCDQQENAAAKVEYDGLSNSQPLRVDANNHTVTML 60 MLQHYSVSWKKGLAALCLLAVAGLSGCDQQENAAAKVEYDGLSNSQPLRVDANNHTVTML Sbjct: 1 MLQHYSVSWKKGLAALCLLAVAGLSGCDQQENAAAKVEYDGLSNSQPLRVDANNHTVTML 60 Query: 61 VQINGRFLTDDTRHGIVFKDGSNGHKSLFMGYATPKAFYEALKEAGGTPGENMTMDNKET 120 VQINGRFLTDDTRHGIVFKDGSNGHKSLFMGYATPKAFYEALKEAGGTPGENMTMDNKET Sbjct: 61 VQINGRFLTDDTRHGIVFKDGSNGHKSLFMGYATPKAFYEALKEAGGTPGENMTMDNKET 120 Query: 121 THVTGSKLDISVNWQGAAKAYSFDEVIVDSNGKKLDMRFGGNLTAAEEKKTGCLVCLDSC 180 THVTGSKLDISVNWQGAAKAYSFDEVIVDSNGKKLDMRFGGNLTAAEEKKTGCLVCLDSC Sbjct: 121 THVTGSKLDISVNWQGAAKAYSFDEVIVDSNGKKLDMRFGGNLTAAEEKKTGCLVCLDSC 180 Query: 181 PVGIVSNATYTYGAVEKRGEVKFKGNASVLPADNTLATVTFKIAE 225 PVGIVSNATYTYGAVEKRGEVKFKGNASVLPADNTLATVTFKIAE Sbjct: 181 PVGIVSNATYTYGAVEKRGEVKFKGNASVLPADNTLATVTFKIAE 225 >UniRef50_C5W4W7 YdjY protein (Fragment) n=4 Tax=Bacteria RepID=C5W4W7_ECOBB Length = 200 Score = 287 bits (733), Expect = 3e-76, Method: Composition-based stats. Identities = 197/200 (98%), Positives = 198/200 (99%) Query: 26 GCDQQENAAAKVEYDGLSNSQPLRVDANNHTVTMLVQINGRFLTDDTRHGIVFKDGSNGH 85 GCDQ+ENAAAKVEYDGLSNSQPLRVDANNHTVTMLVQINGRFLTDDTRHGIVFKDGSNGH Sbjct: 1 GCDQKENAAAKVEYDGLSNSQPLRVDANNHTVTMLVQINGRFLTDDTRHGIVFKDGSNGH 60 Query: 86 KSLFMGYATPKAFYEALKEAGGTPGENMTMDNKETTHVTGSKLDISVNWQGAAKAYSFDE 145 KSLFM YATPKAFYEALKEAGGTPGENMTMDNKETTHVTGSKLDISVNWQGAAKAYSFDE Sbjct: 61 KSLFMAYATPKAFYEALKEAGGTPGENMTMDNKETTHVTGSKLDISVNWQGAAKAYSFDE 120 Query: 146 VIVDSNGKKLDMRFGGNLTAAEEKKTGCLVCLDSCPVGIVSNATYTYGAVEKRGEVKFKG 205 VIVDSNGKKLDMRFGGNLTAAEEKKTGCLVCLDSCPVGIVSNATYTYGAVEKRGEVKFKG Sbjct: 121 VIVDSNGKKLDMRFGGNLTAAEEKKTGCLVCLDSCPVGIVSNATYTYGAVEKRGEVKFKG 180 Query: 206 NASVLPADNTLATVTFKIAE 225 NASVLPADNTLATVTFKI E Sbjct: 181 NASVLPADNTLATVTFKITE 200 >UniRef50_C1QBU3 Putative uncharacterized protein n=1 Tax=Brachyspira murdochii DSM 12563 RepID=C1QBU3_9SPIR Length = 230 Score = 270 bits (690), Expect = 2e-71, Method: Composition-based stats. Identities = 88/229 (38%), Positives = 129/229 (56%), Gaps = 14/229 (6%) Query: 9 WKKGLAALCLLAVAGLSGCDQQENAAAKVEYDGLSNS-----------QPLRVDANNHTV 57 K + LCLL+ LS C ++ A++ V L+NS +P+ +D V Sbjct: 1 MNKKIIMLCLLSSFMLSSCGNKQ-ASSSVSSAELTNSMGSVILRDEGAEPVVIDIEKKEV 59 Query: 58 TMLVQINGRFLTDDTRHGIVFKDGSNGHKSLFMGYATPKAFYEALKEAGGTPGENMTMDN 117 + ++N ++ TRHGIVF G+NG KS+ G + + FY+AL + G G N+TM++ Sbjct: 60 IIPAEVNAKYFNSPTRHGIVFDRGANGDKSVLRGLSDEREFYQALIDIGAVAGNNLTMED 119 Query: 118 KE-TTHVTGSKLDISVNWQGAAKAYSFDEVIVDSNGKKLDMRFGGNLTAAEEKKTGCLVC 176 + V G KLD+ V W G K F ++I + +D+RFGGN AA+ K+TGC++C Sbjct: 120 MKLEKTVDGQKLDVFVTWDGLGKEIPFSDIIRSDEERPMDIRFGGNFEAAKAKRTGCILC 179 Query: 177 LDSCPVGIVSNATYTYGAVEKRGEVKFKGNASVLPADNTLATVTFKIAE 225 LDSC VGI S+A Y GAVE + ++ G VLP D T +V F+IAE Sbjct: 180 LDSCAVGITSDAAYETGAVEVKKIGRY-GREDVLPPDGTRVSVIFRIAE 227 >UniRef50_B0S0B9 Putative uncharacterized protein n=2 Tax=Finegoldia magna RepID=B0S0B9_FINM2 Length = 243 Score = 269 bits (687), Expect = 6e-71, Method: Composition-based stats. Identities = 119/239 (49%), Positives = 147/239 (61%), Gaps = 23/239 (9%) Query: 10 KKGLAALCLLAVAGLSGCDQQENAA-----------------------AKVEYDGLSNSQ 46 +K LA LA+ GL+GC Q+ + A E +G+S Sbjct: 5 RKLLAGFLCLAIVGLAGCSQKADDKKEANNTATTEQSKTEENKKEEKKADDEVNGVSLKN 64 Query: 47 PLRVDANNHTVTMLVQINGRFLTDDTRHGIVFKDGSNGHKSLFMGYATPKAFYEALKEAG 106 P++VD VT+L +NG++ T++TRH V DGSNG KS+ Y TP+ FY AL E G Sbjct: 65 PIKVDKEAKKVTVLSSVNGKYFTENTRHASVNTDGSNGAKSVLTAYGTPEDFYNALIEIG 124 Query: 107 GTPGENMTMDNKETTHVTGSKLDISVNWQGAAKAYSFDEVIVDSNGKKLDMRFGGNLTAA 166 PGENM DN THV GSK+ +V W+GA K Y +EVI DSNGKK++ RFGGNL A Sbjct: 125 AKPGENMNPDNATKTHVEGSKIGATVTWEGAGKDYDINEVIKDSNGKKIEFRFGGNLERA 184 Query: 167 EEKKTGCLVCLDSCPVGIVSNATYTYGAVEKRGEVKFKGNASVLPADNTLATVTFKIAE 225 + KKTGCL CLDSCPVGI+SN TYTYGAVEKR EVKF GNA VLP D T VT+ + + Sbjct: 185 KTKKTGCLTCLDSCPVGIISNTTYTYGAVEKRNEVKFTGNADVLPEDGTYVAVTYTLED 243 >UniRef50_A8S409 Putative uncharacterized protein n=2 Tax=Clostridium RepID=A8S409_9CLOT Length = 222 Score = 260 bits (663), Expect = 3e-68, Method: Composition-based stats. Identities = 77/221 (34%), Positives = 122/221 (55%), Gaps = 8/221 (3%) Query: 10 KKGLAALCLLAVAGLSGCDQQENAAAKVEYDGLSNSQPLRVDANNHTVTMLVQINGRFLT 69 + + V L+ C + N + + + + + VD + + ML ++NG + T Sbjct: 5 RLTAIMVATAMVFSLAACGGKSNTVES-KAEEKAAVETMTVDKDKKEIIMLCEVNGTYFT 63 Query: 70 DDTRHGIVFKDGSNGHKSLFMGYATPKAFYEALKEAGGTPGENMTMDNKE-----TTHVT 124 + TRHGIV+K GSNG K++ G A K FY+AL + G G+N+T + + V Sbjct: 64 EPTRHGIVYKGGSNGEKAVLRGLADEKEFYQALLDIGAKAGDNLTAADMKAGPDNGKAVE 123 Query: 125 GSKLDISVNWQGAAKAYSFDEVIVDSNGKKLDMRFGGNLTAAEEKKTGCLVCLDSCPVGI 184 G KLD+ V W+G + + ++I + +D+RFGGN+ +A+E TGC++CLDSC GI Sbjct: 124 GDKLDVFVKWEG-QEEIPYQDIIKCTEDYTMDLRFGGNIESAKENNTGCVLCLDSCATGI 182 Query: 185 VSNATYTYGAVEKRGEVKFKGNASVLPADNTLATVTFKIAE 225 VS+A + G + KF G+ VLP D T TV F++A+ Sbjct: 183 VSDAAWPTGTT-QNDIAKFYGDKDVLPEDGTQVTVIFRLAK 222 >UniRef50_C8P4B5 Putative uncharacterized protein n=1 Tax=Lactobacillus antri DSM 16041 RepID=C8P4B5_9LACO Length = 218 Score = 255 bits (652), Expect = 6e-67, Method: Composition-based stats. Identities = 83/220 (37%), Positives = 109/220 (49%), Gaps = 15/220 (6%) Query: 11 KGLAALCLLAVAGLSGCDQQENAAAKVEYDGLSNSQPLRVDANNHTVTMLVQINGRFLTD 70 GLAA L G S Q N A + Q + V+ V +NG + T Sbjct: 8 IGLAAAFSLVAVGTS--VSQSNTAYAAD-------QKIVVNKAKKQVEYPAVVNGTYFTQ 58 Query: 71 DTRHGIVFKDGSNGHKSLFMGYATPKAFYEALKEAGGTPGENMTMDNKE-----TTHVTG 125 TRH V+K+GSNG K++ G A+ FY LK+ G PG N+ + + V G Sbjct: 59 PTRHLCVYKNGSNGDKAVLRGEASEITFYNDLKKIGAKPGNNLRPADMKAVKGQGKRVEG 118 Query: 126 SKLDISVNWQGAAKAYSFDEVIVDSNGKKLDMRFGGNLTAAEEKKTGCLVCLDSCPVGIV 185 SKL+I + W+G KA + I + K D RFGGNL A+ TGC+ C DSC GIV Sbjct: 119 SKLNIYIKWKGH-KAVPIQDCIKSTKKYKTDFRFGGNLGRAQTMNTGCVTCFDSCSTGIV 177 Query: 186 SNATYTYGAVEKRGEVKFKGNASVLPADNTLATVTFKIAE 225 S+ + G+ E VKF GN VLP D T T+ K+AE Sbjct: 178 SDHAWPTGSTEPNHVVKFYGNQKVLPKDGTHVTMIVKLAE 217 >UniRef50_B0S3S5 Putative uncharacterized protein n=2 Tax=Finegoldia magna RepID=B0S3S5_FINM2 Length = 241 Score = 243 bits (621), Expect = 2e-63, Method: Composition-based stats. Identities = 83/237 (35%), Positives = 125/237 (52%), Gaps = 24/237 (10%) Query: 10 KKGLAALCLLAVAGLSGCDQQENAAAKVEYDG----------------------LSNSQP 47 K + AL L+A +GC +++N A + P Sbjct: 6 KFKILALLLMASLVFAGCSKEDNKQASSTEQKTEEKKEEKTEQKTEEKKEEAQEPTADNP 65 Query: 48 LRVDANNHTVTMLVQINGRFLTDDTRHGIVFKDGSNGHKSLFMGYATPKAFYEALKEAGG 107 + VD + + ++N +F + T H IV K G N +S+F+ YA Y+AL++ G Sbjct: 66 MIVDEAKKQIKVYAEVNDKFKKESTMHAIVAKSGKNNEQSMFVSYANQNDLYDALEKIGA 125 Query: 108 TPGENMTMDNKETTHVTGSKLDISVNWQGAAKAYSFDEVIVDSNGKKLDMRFGGNLTAAE 167 PG N+TM+N V G K+D++ +QG+ ++VI DS+GK++D+RFGGN A+ Sbjct: 126 KPGNNVTMENMGKEAVKGDKIDLTFKFQGSDNELGINDVIKDSSGKEIDIRFGGNQKPAK 185 Query: 168 EKKTGCLVCLDSCPVGIVSNATYTYGAVEKRGEVKFKGNASVLPADNTLATVTFKIA 224 + TGC+ CL SCP+GI SNA+ GA EK G VK+ SV PAD T +T+K+A Sbjct: 186 DMNTGCMTCLQSCPLGITSNASQLIGADEKDG-VKYTLADSV-PADKTPVVITYKLA 240 >UniRef50_Q2LXG8 Hypothetical cytosolic protein n=1 Tax=Syntrophus aciditrophicus SB RepID=Q2LXG8_SYNAS Length = 241 Score = 229 bits (583), Expect = 6e-59, Method: Composition-based stats. Identities = 65/186 (34%), Positives = 91/186 (48%), Gaps = 5/186 (2%) Query: 40 DGLSNSQPLRVDANNHTVTMLVQINGRFLTDDTRH-GIVFKDGSNGHKSLFMGYATPKAF 98 D ++ P+ VD V + ++ R LT+ T H GI + G K + + A P A Sbjct: 52 DWPTSGNPVMVDTARRIVKLYTKLQLRHLTETTPHWGIGYSGGKLADKFILVSPAGPVAL 111 Query: 99 YEALKEAGGTPGENMTMDNKETTHVTGSKLDISVNWQGAAKAYSFDEVIVDSNGKKLDMR 158 ++AL G N+ +D V G +L +S W G +E+ DS GK D+R Sbjct: 112 HDALVRIRARAGNNLPLDG-YGKFVDGDRLILSAQWPGLPTPVGLNEIFYDSAGKGFDIR 170 Query: 159 FGGNLTAAEEKKTGCLVCLDSCPVGIVSNATYTYGAVEK---RGEVKFKGNASVLPADNT 215 FGGN A EKKTGCL CL+SCP+GI SNA Y + + + R F+G LP Sbjct: 171 FGGNRAIAAEKKTGCLTCLESCPIGISSNAVYPHLSTLQRMLRPTSSFRGKPERLPNKEA 230 Query: 216 LATVTF 221 + V F Sbjct: 231 VPIVVF 236 >UniRef50_B9M6P1 Putative uncharacterized protein n=1 Tax=Geobacter sp. FRC-32 RepID=B9M6P1_GEOSF Length = 242 Score = 210 bits (533), Expect = 4e-53, Method: Composition-based stats. Identities = 44/248 (17%), Positives = 90/248 (36%), Gaps = 39/248 (15%) Query: 5 YSVSWKKGLAALCLLAVAGLSGCDQQENAAAKVEYDGLSNSQPLRVDANNHTVTMLVQIN 64 Y ++ + L A +L+ A + +Q+ AA K + G S L + V + + Sbjct: 2 YRLALRITLLASLILSFALSASAAKQKPAAVKTDKYG---SDTLTTNGRTREVRVTATV- 57 Query: 65 GRFLTDDT--------RHGIVFKDGSNGHKSLFMGYATPKAFYEALKEAGGTPGENMTMD 116 + + + + KDG +F A +A+K G + M Sbjct: 58 VKDCSQPSVCDWGRRFQGFFGSKDGKMAPFFIFSTEVHRAALDKAIKSVGIKSRRQIPMT 117 Query: 117 NKETT-----------HVTGSKLDISVNWQGAAK--AYSFDEVIVDS---NGKKL----- 155 + ++ G + +SV W+ K + +E+I + +GK++ Sbjct: 118 EVKQRSGLKSTTQMDDYLDGDPILVSVRWKQDGKMVERAMEELIEEKILVDGKEVIKPYT 177 Query: 156 -DMRFGGNLTAAEEKKTGCLVCLDSCPVGIVSNATYTYGAVEKRGEVKFKGNASVLPADN 214 + G A +GC+VC C G++++ + K + ++ N +P Sbjct: 178 PHFVYHGTAE-AINFASGCIVCPSGCNGGVIADNSVP----LKETKNYYRFNWKKMPHPG 232 Query: 215 TLATVTFK 222 T + K Sbjct: 233 TKVEIVLK 240 >UniRef50_B4DBK3 Putative uncharacterized protein n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4DBK3_9BACT Length = 243 Score = 111 bits (277), Expect = 2e-23, Method: Composition-based stats. Identities = 33/191 (17%), Positives = 71/191 (37%), Gaps = 19/191 (9%) Query: 48 LRVDANNHTVTMLVQINGRFLTDDTRHGIVFKDGSNGHKSLFMGYATPKAFYEALKEAGG 107 +R+D TVT +N + D ++ + N H+SL + TP + A+ G Sbjct: 46 IRLDQKARTVTFPGVLN---MNDGNLEYLIVTEQGNTHESLLVSDVTPSDLHFAMLLLGA 102 Query: 108 TP----GENMTMDNKETTH------VTGSKLDISVNWQ--GAAKAYSFDEVIVDSNGKKL 155 ++ ++ + + G +DI+V+W+ G K+ ++ + ++ KK Sbjct: 103 KGSGSQSGDLPPSQIDSKYLKTAPPLKGDDIDITVHWKAGGTEKSAPVEDWLFNTETKKQ 162 Query: 156 DMRFGGNLTAAEEKKTGCLVC-LDSCPVGIVSNATYTYGAVEKRGEVK--FKGNASVLPA 212 R G + G + ++ +V+ K + + N +P Sbjct: 163 VTR-GPWIYNGSTFNEGHFLAQIEGAHAALVTYPAALINNPRKGNDNDQIWAVNTKAVPP 221 Query: 213 DNTLATVTFKI 223 T +T + Sbjct: 222 VKTPVEITLTL 232 >UniRef50_D2R883 Putative uncharacterized protein n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R883_9PLAN Length = 289 Score = 61.3 bits (147), Expect = 2e-08, Method: Composition-based stats. Identities = 31/196 (15%), Positives = 62/196 (31%), Gaps = 25/196 (12%) Query: 42 LSNSQPLRVDANNHTVTMLVQINGRFLTDDTRHGIVFKDGSNGHKSLFMGYATPKAFYEA 101 LS + + +D V + ++ R + G+ H+S+ P+ + Sbjct: 72 LSKTHDVWLDKKRRAVIIDGEVCLR---EGQLEMFACPKGTKEHESVISLNCIPETVHAG 128 Query: 102 LKEAGGTPGENMTMDNKETTHV--TGSKLDISVNWQ---GAAKAYSFDEVIVDSNGKK-- 154 L AG G T + +V G +DI + W+ G + I + +K Sbjct: 129 LLAAGAKSG---TPVRFDPEYVAAKGDIIDIYILWKDAQGERHQVKAQQWIKHAKTQKAM 185 Query: 155 -LDMRFGG--------NLTAAEEKKTGCLVCLDSCPVGIVSNATYTYGAVEKRGEVKFKG 205 D F G + G +C+ + P + + + ++ F Sbjct: 186 AFDWVFAGSGFWKDEETGKQHYQANGGDFICVSNFPTATLD---LPVESSQANTDLLFTA 242 Query: 206 NASVLPADNTLATVTF 221 +P T + Sbjct: 243 FTENIPPKGTKVRLVL 258 >UniRef50_A4A001 Putative uncharacterized protein n=1 Tax=Blastopirellula marina DSM 3645 RepID=A4A001_9PLAN Length = 299 Score = 58.2 bits (139), Expect = 2e-07, Method: Composition-based stats. Identities = 38/230 (16%), Positives = 71/230 (30%), Gaps = 29/230 (12%) Query: 14 AALCLLAVAGLSGCDQQENAAAKVEYDGLSNSQ--------PLRVDANNHTVTMLVQING 65 A L +G+ + A E SQ P+ D V +I Sbjct: 6 IAAICLVFSGIVSVAGAQTDAPPAEKKTPEKSQVRKLSDQHPIWFDRGRKMVIADGEICL 65 Query: 66 RFLTDDTRHGIVFKDGSNGHKSLFMGYATPKAFYEALKEAGGTPGENMTMDNKETTHVTG 125 R T G+ H+S+ + L G G + E TG Sbjct: 66 R---KGTLEMFACLQGTKEHESIVSLPVKAMMVHAGLIAIGAKQGTPVKFS-PEFKPATG 121 Query: 126 SKLDISVNWQ---GAAKAYSFDEVIVDS---NGKKLDMRFGGNL--------TAAEEKKT 171 ++ I V W+ G + + + + S + + F G+ + Sbjct: 122 EEIAIYVQWKDDQGKTQIANARDWVRKSGTDDTLDTNWVFAGSYLWTDERTGEKVYTAEG 181 Query: 172 GCLVCLDSCPVGIVSNATYTYGAVEKRGEVKFKGNASVLPADNTLATVTF 221 G L+C+ + + + + ++ + F+ +PA+ T V F Sbjct: 182 GDLICVSNF---MTATLDLPIESSQENATLNFEAFTDRIPAEGTKVRVFF 228 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.307 0.126 0.305 Lambda K H 0.267 0.0388 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,035,077,112 Number of Sequences: 3077464 Number of extensions: 34851980 Number of successful extensions: 76026 Number of sequences better than 1.0e-01: 14 Number of HSP's better than 0.1 without gapping: 26 Number of HSP's successfully gapped in prelim test: 6 Number of HSP's that attempted gapping in prelim test: 75957 Number of HSP's gapped (non-prelim): 33 length of query: 225 length of database: 1,040,396,356 effective HSP length: 124 effective length of query: 101 effective length of database: 658,790,820 effective search space: 66537872820 effective search space used: 66537872820 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 42 (21.6 bits) S2: 90 (39.4 bits)