BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (114 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P0AD22 Uncharacterized protein yejG n=123 Tax=Enterobac... 237 7e-62 UniRef50_Q7N356 Similar to unknown protein yejG of Escherichia c... 134 8e-31 UniRef50_Q2NUF3 Putative uncharacterized protein n=1 Tax=Sodalis... 131 6e-30 UniRef50_C8QDK7 Putative uncharacterized protein n=1 Tax=Pantoea... 110 1e-23 UniRef50_B4ET97 Putative uncharacterized protein n=4 Tax=Enterob... 109 3e-23 UniRef50_B2PZQ3 Putative uncharacterized protein n=3 Tax=Provide... 99 3e-20 UniRef50_D2TYP2 Putative uncharacterized protein n=1 Tax=Arsenop... 97 1e-19 >UniRef50_P0AD22 Uncharacterized protein yejG n=123 Tax=Enterobacteriaceae RepID=YEJG_ECOL6 Length = 114 Score = 237 bits (605), Expect = 7e-62, Method: Compositional matrix adjust. Identities = 114/114 (100%), Positives = 114/114 (100%) Query: 1 MTSLQLSIVHRLPQNYRWSAGFAGSKVEPIPQNGPCGDNSLVALKLLSPDGDNAWSVMYK 60 MTSLQLSIVHRLPQNYRWSAGFAGSKVEPIPQNGPCGDNSLVALKLLSPDGDNAWSVMYK Sbjct: 1 MTSLQLSIVHRLPQNYRWSAGFAGSKVEPIPQNGPCGDNSLVALKLLSPDGDNAWSVMYK 60 Query: 61 LSQALSDIEVPCSVLECEGEPCLFVNRQDEFAATCRLKNFGVAIAEPFSNYNPF 114 LSQALSDIEVPCSVLECEGEPCLFVNRQDEFAATCRLKNFGVAIAEPFSNYNPF Sbjct: 61 LSQALSDIEVPCSVLECEGEPCLFVNRQDEFAATCRLKNFGVAIAEPFSNYNPF 114 >UniRef50_Q7N356 Similar to unknown protein yejG of Escherichia coli n=27 Tax=Enterobacteriaceae RepID=Q7N356_PHOLL Length = 114 Score = 134 bits (337), Expect = 8e-31, Method: Compositional matrix adjust. Identities = 60/114 (52%), Positives = 83/114 (72%) Query: 1 MTSLQLSIVHRLPQNYRWSAGFAGSKVEPIPQNGPCGDNSLVALKLLSPDGDNAWSVMYK 60 M ++QLS+VHRLPQ+Y+WS GF G KVEP+P + ++SL+ LKLLS + AW+VM K Sbjct: 1 MNNIQLSVVHRLPQSYKWSPGFVGIKVEPLPVDEVNAESSLMGLKLLSHEDGAAWNVMQK 60 Query: 61 LSQALSDIEVPCSVLECEGEPCLFVNRQDEFAATCRLKNFGVAIAEPFSNYNPF 114 L Q+L+D+++ +V+E EG+PCLF+N +DE A CRLK G AIAE + PF Sbjct: 61 LQQSLADMQIGSTVIELEGQPCLFINSEDESAVMCRLKTLGAAIAEKITALYPF 114 >UniRef50_Q2NUF3 Putative uncharacterized protein n=1 Tax=Sodalis glossinidius str. 'morsitans' RepID=Q2NUF3_SODGM Length = 113 Score = 131 bits (330), Expect = 6e-30, Method: Compositional matrix adjust. Identities = 58/107 (54%), Positives = 79/107 (73%), Gaps = 1/107 (0%) Query: 1 MTSLQLSIVHRLPQNYRWSAGFAGSKVEPIPQNGPCGDNSLVALKLLSPDGDNAWSVMYK 60 MT+L LS+VHRLP YRW G+ G KVEP+ + DN L+ L LLS D AW VM + Sbjct: 1 MTNLHLSVVHRLPHQYRWLQGYTGVKVEPVIE-ARHDDNVLIGLTLLSHGDDTAWQVMRE 59 Query: 61 LSQALSDIEVPCSVLECEGEPCLFVNRQDEFAATCRLKNFGVAIAEP 107 L++ L++I+V CS++EC+G+PCLF++R+DE A CR+KN GVA+AEP Sbjct: 60 LARTLAEIDVSCSIVECDGKPCLFLHREDECAGLCRIKNIGVAVAEP 106 >UniRef50_C8QDK7 Putative uncharacterized protein n=1 Tax=Pantoea sp. At-9b RepID=C8QDK7_9ENTR Length = 108 Score = 110 bits (275), Expect = 1e-23, Method: Compositional matrix adjust. Identities = 53/108 (49%), Positives = 67/108 (62%), Gaps = 1/108 (0%) Query: 4 LQLSIVHRLPQNYRWSAGFAGSKVEPIPQNGPCGDNSLVALKLLSPDGDNAWSVMYKLSQ 63 LQLS+VHRLPQ+YRWS G P + D LVAL+LLS DG A ++ +L Sbjct: 2 LQLSVVHRLPQHYRWSVDHVGEIERQEPADAA-NDQDLVALRLLSHDGHPAMEILRQLQD 60 Query: 64 ALSDIEVPCSVLECEGEPCLFVNRQDEFAATCRLKNFGVAIAEPFSNY 111 L+ I+V C + +CEG+PCL + R DE A C LKN GVAIAE F + Sbjct: 61 TLAQIQVECKIAQCEGQPCLLIQRGDESATNCCLKNQGVAIAETFDGH 108 >UniRef50_B4ET97 Putative uncharacterized protein n=4 Tax=Enterobacteriaceae RepID=B4ET97_PROMH Length = 115 Score = 109 bits (273), Expect = 3e-23, Method: Compositional matrix adjust. Identities = 52/108 (48%), Positives = 74/108 (68%) Query: 1 MTSLQLSIVHRLPQNYRWSAGFAGSKVEPIPQNGPCGDNSLVALKLLSPDGDNAWSVMYK 60 M ++QLS+VH+LP +YRW AGF G++VE +P+N N+L+ LKLLS DG + Sbjct: 1 MENVQLSVVHKLPLSYRWLAGFTGTRVEILPENDASRQNTLIGLKLLSHDGMTLDEAIQN 60 Query: 61 LSQALSDIEVPCSVLECEGEPCLFVNRQDEFAATCRLKNFGVAIAEPF 108 L + L+++ + ++E +G PCLF++ DE AA C LKN GVAIAEPF Sbjct: 61 LQKYLNNLNIENVIIEWDGVPCLFLHSDDECAALCCLKNAGVAIAEPF 108 >UniRef50_B2PZQ3 Putative uncharacterized protein n=3 Tax=Providencia RepID=B2PZQ3_PROST Length = 113 Score = 99.4 bits (246), Expect = 3e-20, Method: Compositional matrix adjust. Identities = 51/106 (48%), Positives = 70/106 (66%), Gaps = 1/106 (0%) Query: 6 LSIVHRLPQNYRWSAGFAGSKVEPIPQNGPCGDNSLVALKLLSPDGDNAWSVMYKLSQAL 65 LS+VHRLPQ+YRW +G G+ VE +P + D LV LKLLS D M +LS++L Sbjct: 6 LSVVHRLPQSYRWLSGKIGTDVEVVPTDDFDSDK-LVGLKLLSHDCATDCIAMKQLSESL 64 Query: 66 SDIEVPCSVLECEGEPCLFVNRQDEFAATCRLKNFGVAIAEPFSNY 111 S++++ ++LE + E CLF++ DE AA CRLK GVAIAE S + Sbjct: 65 SELQIESAILEWQKEICLFIHASDELAAMCRLKAAGVAIAELESGF 110 >UniRef50_D2TYP2 Putative uncharacterized protein n=1 Tax=Arsenophonus nasoniae RepID=D2TYP2_9ENTR Length = 117 Score = 97.4 bits (241), Expect = 1e-19, Method: Compositional matrix adjust. Identities = 47/110 (42%), Positives = 68/110 (61%), Gaps = 1/110 (0%) Query: 1 MTSLQLSIVHRLPQNYRWSAGFAGSKVEPIPQN-GPCGDNSLVALKLLSPDGDNAWSVMY 59 M + +LS++HRLPQNYRW G AG K+E +P N N L+ K+L D + +V+ Sbjct: 1 MGNFKLSVIHRLPQNYRWLTGHAGKKIEVMPDNISSSKSNILIGFKVLCYDISDNLAVLK 60 Query: 60 KLSQALSDIEVPCSVLECEGEPCLFVNRQDEFAATCRLKNFGVAIAEPFS 109 LS AL ++++ +LE +G+ CLF+ +DE ATC K GVAIAE + Sbjct: 61 YLSSALDEMQIANDLLEWQGQICLFIRYEDEGLATCHFKTAGVAIAESIT 110 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P0AD22 Uncharacterized protein yejG n=123 Tax=Enterobac... 196 2e-49 UniRef50_Q7N356 Similar to unknown protein yejG of Escherichia c... 184 9e-46 UniRef50_B4ET97 Putative uncharacterized protein n=4 Tax=Enterob... 177 7e-44 UniRef50_Q2NUF3 Putative uncharacterized protein n=1 Tax=Sodalis... 174 7e-43 UniRef50_D2TYP2 Putative uncharacterized protein n=1 Tax=Arsenop... 166 2e-40 UniRef50_B2PZQ3 Putative uncharacterized protein n=3 Tax=Provide... 163 2e-39 UniRef50_C8QDK7 Putative uncharacterized protein n=1 Tax=Pantoea... 160 1e-38 Sequences not found previously or not previously below threshold: CONVERGED! >UniRef50_P0AD22 Uncharacterized protein yejG n=123 Tax=Enterobacteriaceae RepID=YEJG_ECOL6 Length = 114 Score = 196 bits (499), Expect = 2e-49, Method: Composition-based stats. Identities = 114/114 (100%), Positives = 114/114 (100%) Query: 1 MTSLQLSIVHRLPQNYRWSAGFAGSKVEPIPQNGPCGDNSLVALKLLSPDGDNAWSVMYK 60 MTSLQLSIVHRLPQNYRWSAGFAGSKVEPIPQNGPCGDNSLVALKLLSPDGDNAWSVMYK Sbjct: 1 MTSLQLSIVHRLPQNYRWSAGFAGSKVEPIPQNGPCGDNSLVALKLLSPDGDNAWSVMYK 60 Query: 61 LSQALSDIEVPCSVLECEGEPCLFVNRQDEFAATCRLKNFGVAIAEPFSNYNPF 114 LSQALSDIEVPCSVLECEGEPCLFVNRQDEFAATCRLKNFGVAIAEPFSNYNPF Sbjct: 61 LSQALSDIEVPCSVLECEGEPCLFVNRQDEFAATCRLKNFGVAIAEPFSNYNPF 114 >UniRef50_Q7N356 Similar to unknown protein yejG of Escherichia coli n=27 Tax=Enterobacteriaceae RepID=Q7N356_PHOLL Length = 114 Score = 184 bits (466), Expect = 9e-46, Method: Composition-based stats. Identities = 60/114 (52%), Positives = 83/114 (72%) Query: 1 MTSLQLSIVHRLPQNYRWSAGFAGSKVEPIPQNGPCGDNSLVALKLLSPDGDNAWSVMYK 60 M ++QLS+VHRLPQ+Y+WS GF G KVEP+P + ++SL+ LKLLS + AW+VM K Sbjct: 1 MNNIQLSVVHRLPQSYKWSPGFVGIKVEPLPVDEVNAESSLMGLKLLSHEDGAAWNVMQK 60 Query: 61 LSQALSDIEVPCSVLECEGEPCLFVNRQDEFAATCRLKNFGVAIAEPFSNYNPF 114 L Q+L+D+++ +V+E EG+PCLF+N +DE A CRLK G AIAE + PF Sbjct: 61 LQQSLADMQIGSTVIELEGQPCLFINSEDESAVMCRLKTLGAAIAEKITALYPF 114 >UniRef50_B4ET97 Putative uncharacterized protein n=4 Tax=Enterobacteriaceae RepID=B4ET97_PROMH Length = 115 Score = 177 bits (450), Expect = 7e-44, Method: Composition-based stats. Identities = 52/114 (45%), Positives = 75/114 (65%) Query: 1 MTSLQLSIVHRLPQNYRWSAGFAGSKVEPIPQNGPCGDNSLVALKLLSPDGDNAWSVMYK 60 M ++QLS+VH+LP +YRW AGF G++VE +P+N N+L+ LKLLS DG + Sbjct: 1 MENVQLSVVHKLPLSYRWLAGFTGTRVEILPENDASRQNTLIGLKLLSHDGMTLDEAIQN 60 Query: 61 LSQALSDIEVPCSVLECEGEPCLFVNRQDEFAATCRLKNFGVAIAEPFSNYNPF 114 L + L+++ + ++E +G PCLF++ DE AA C LKN GVAIAEPF + Sbjct: 61 LQKYLNNLNIENVIIEWDGVPCLFLHSDDECAALCCLKNAGVAIAEPFPTDYSY 114 >UniRef50_Q2NUF3 Putative uncharacterized protein n=1 Tax=Sodalis glossinidius str. 'morsitans' RepID=Q2NUF3_SODGM Length = 113 Score = 174 bits (442), Expect = 7e-43, Method: Composition-based stats. Identities = 58/107 (54%), Positives = 79/107 (73%), Gaps = 1/107 (0%) Query: 1 MTSLQLSIVHRLPQNYRWSAGFAGSKVEPIPQNGPCGDNSLVALKLLSPDGDNAWSVMYK 60 MT+L LS+VHRLP YRW G+ G KVEP+ + DN L+ L LLS D AW VM + Sbjct: 1 MTNLHLSVVHRLPHQYRWLQGYTGVKVEPVIE-ARHDDNVLIGLTLLSHGDDTAWQVMRE 59 Query: 61 LSQALSDIEVPCSVLECEGEPCLFVNRQDEFAATCRLKNFGVAIAEP 107 L++ L++I+V CS++EC+G+PCLF++R+DE A CR+KN GVA+AEP Sbjct: 60 LARTLAEIDVSCSIVECDGKPCLFLHREDECAGLCRIKNIGVAVAEP 106 >UniRef50_D2TYP2 Putative uncharacterized protein n=1 Tax=Arsenophonus nasoniae RepID=D2TYP2_9ENTR Length = 117 Score = 166 bits (421), Expect = 2e-40, Method: Composition-based stats. Identities = 47/110 (42%), Positives = 68/110 (61%), Gaps = 1/110 (0%) Query: 1 MTSLQLSIVHRLPQNYRWSAGFAGSKVEPIPQN-GPCGDNSLVALKLLSPDGDNAWSVMY 59 M + +LS++HRLPQNYRW G AG K+E +P N N L+ K+L D + +V+ Sbjct: 1 MGNFKLSVIHRLPQNYRWLTGHAGKKIEVMPDNISSSKSNILIGFKVLCYDISDNLAVLK 60 Query: 60 KLSQALSDIEVPCSVLECEGEPCLFVNRQDEFAATCRLKNFGVAIAEPFS 109 LS AL ++++ +LE +G+ CLF+ +DE ATC K GVAIAE + Sbjct: 61 YLSSALDEMQIANDLLEWQGQICLFIRYEDEGLATCHFKTAGVAIAESIT 110 >UniRef50_B2PZQ3 Putative uncharacterized protein n=3 Tax=Providencia RepID=B2PZQ3_PROST Length = 113 Score = 163 bits (412), Expect = 2e-39, Method: Composition-based stats. Identities = 52/112 (46%), Positives = 71/112 (63%), Gaps = 1/112 (0%) Query: 1 MTSLQLSIVHRLPQNYRWSAGFAGSKVEPIPQNGPCGDNSLVALKLLSPDGDNAWSVMYK 60 M LS+VHRLPQ+YRW +G G+ VE +P + D LV LKLLS D M + Sbjct: 1 MEKSGLSVVHRLPQSYRWLSGKIGTDVEVVPTDDFDSDK-LVGLKLLSHDCATDCIAMKQ 59 Query: 61 LSQALSDIEVPCSVLECEGEPCLFVNRQDEFAATCRLKNFGVAIAEPFSNYN 112 LS++LS++++ ++LE + E CLF++ DE AA CRLK GVAIAE S + Sbjct: 60 LSESLSELQIESAILEWQKEICLFIHASDELAAMCRLKAAGVAIAELESGFY 111 >UniRef50_C8QDK7 Putative uncharacterized protein n=1 Tax=Pantoea sp. At-9b RepID=C8QDK7_9ENTR Length = 108 Score = 160 bits (406), Expect = 1e-38, Method: Composition-based stats. Identities = 53/108 (49%), Positives = 67/108 (62%), Gaps = 1/108 (0%) Query: 4 LQLSIVHRLPQNYRWSAGFAGSKVEPIPQNGPCGDNSLVALKLLSPDGDNAWSVMYKLSQ 63 LQLS+VHRLPQ+YRWS G P + D LVAL+LLS DG A ++ +L Sbjct: 2 LQLSVVHRLPQHYRWSVDHVGEIERQEPADAAN-DQDLVALRLLSHDGHPAMEILRQLQD 60 Query: 64 ALSDIEVPCSVLECEGEPCLFVNRQDEFAATCRLKNFGVAIAEPFSNY 111 L+ I+V C + +CEG+PCL + R DE A C LKN GVAIAE F + Sbjct: 61 TLAQIQVECKIAQCEGQPCLLIQRGDESATNCCLKNQGVAIAETFDGH 108 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.311 0.138 0.398 Lambda K H 0.267 0.0417 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 468,626,718 Number of Sequences: 3077464 Number of extensions: 16171439 Number of successful extensions: 33736 Number of sequences better than 1.0e-01: 7 Number of HSP's better than 0.1 without gapping: 14 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 33714 Number of HSP's gapped (non-prelim): 14 length of query: 114 length of database: 1,040,396,356 effective HSP length: 81 effective length of query: 33 effective length of database: 791,121,772 effective search space: 26107018476 effective search space used: 26107018476 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.2 bits) S2: 87 (38.1 bits)