BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (109 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P0AB47 Uncharacterized protein ymgD n=47 Tax=Enterobact... 224 9e-58 UniRef50_C1DCL9 Putative uncharacterized protein n=1 Tax=Laribac... 86 5e-16 >UniRef50_P0AB47 Uncharacterized protein ymgD n=47 Tax=Enterobacteriaceae RepID=YMGD_ECOL6 Length = 109 Score = 224 bits (570), Expect = 9e-58, Method: Compositional matrix adjust. Identities = 109/109 (100%), Positives = 109/109 (100%) Query: 1 MKKFALLAGLFVFAPMTWAQDYNIKNGLPSETYITCAEANEMAKTDSAQVAEIVAVMGNA 60 MKKFALLAGLFVFAPMTWAQDYNIKNGLPSETYITCAEANEMAKTDSAQVAEIVAVMGNA Sbjct: 1 MKKFALLAGLFVFAPMTWAQDYNIKNGLPSETYITCAEANEMAKTDSAQVAEIVAVMGNA 60 Query: 61 SVASRDLKIEQSPELSAKVVEKLNQVCAKDPQMLLITAIDDTMRAIGKK 109 SVASRDLKIEQSPELSAKVVEKLNQVCAKDPQMLLITAIDDTMRAIGKK Sbjct: 61 SVASRDLKIEQSPELSAKVVEKLNQVCAKDPQMLLITAIDDTMRAIGKK 109 >UniRef50_C1DCL9 Putative uncharacterized protein n=1 Tax=Laribacter hongkongensis HLHK9 RepID=C1DCL9_LARHH Length = 113 Score = 85.5 bits (210), Expect = 5e-16, Method: Compositional matrix adjust. Identities = 38/76 (50%), Positives = 55/76 (72%) Query: 31 ETYITCAEANEMAKTDSAQVAEIVAVMGNASVASRDLKIEQSPELSAKVVEKLNQVCAKD 90 E + TC EANEMAK D+A V++++A +GNA+V +RDLKI+ P+L V++ LN C++D Sbjct: 31 EFFTTCTEANEMAKNDTAAVSDLIARLGNAAVEARDLKIKPRPDLDKDVIKHLNAYCSQD 90 Query: 91 PQMLLITAIDDTMRAI 106 P LLIT +D MR + Sbjct: 91 PNGLLITGMDIAMRQV 106 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P0AB47 Uncharacterized protein ymgD n=47 Tax=Enterobact... 187 8e-47 UniRef50_C1DCL9 Putative uncharacterized protein n=1 Tax=Laribac... 131 5e-30 Sequences not found previously or not previously below threshold: CONVERGED! >UniRef50_P0AB47 Uncharacterized protein ymgD n=47 Tax=Enterobacteriaceae RepID=YMGD_ECOL6 Length = 109 Score = 187 bits (475), Expect = 8e-47, Method: Composition-based stats. Identities = 109/109 (100%), Positives = 109/109 (100%) Query: 1 MKKFALLAGLFVFAPMTWAQDYNIKNGLPSETYITCAEANEMAKTDSAQVAEIVAVMGNA 60 MKKFALLAGLFVFAPMTWAQDYNIKNGLPSETYITCAEANEMAKTDSAQVAEIVAVMGNA Sbjct: 1 MKKFALLAGLFVFAPMTWAQDYNIKNGLPSETYITCAEANEMAKTDSAQVAEIVAVMGNA 60 Query: 61 SVASRDLKIEQSPELSAKVVEKLNQVCAKDPQMLLITAIDDTMRAIGKK 109 SVASRDLKIEQSPELSAKVVEKLNQVCAKDPQMLLITAIDDTMRAIGKK Sbjct: 61 SVASRDLKIEQSPELSAKVVEKLNQVCAKDPQMLLITAIDDTMRAIGKK 109 >UniRef50_C1DCL9 Putative uncharacterized protein n=1 Tax=Laribacter hongkongensis HLHK9 RepID=C1DCL9_LARHH Length = 113 Score = 131 bits (330), Expect = 5e-30, Method: Composition-based stats. Identities = 38/77 (49%), Positives = 55/77 (71%) Query: 31 ETYITCAEANEMAKTDSAQVAEIVAVMGNASVASRDLKIEQSPELSAKVVEKLNQVCAKD 90 E + TC EANEMAK D+A V++++A +GNA+V +RDLKI+ P+L V++ LN C++D Sbjct: 31 EFFTTCTEANEMAKNDTAAVSDLIARLGNAAVEARDLKIKPRPDLDKDVIKHLNAYCSQD 90 Query: 91 PQMLLITAIDDTMRAIG 107 P LLIT +D MR + Sbjct: 91 PNGLLITGMDIAMRQVA 107 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.317 0.135 0.375 Lambda K H 0.267 0.0412 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 400,615,385 Number of Sequences: 3077464 Number of extensions: 12909937 Number of successful extensions: 37896 Number of sequences better than 1.0e-01: 2 Number of HSP's better than 0.1 without gapping: 4 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 37892 Number of HSP's gapped (non-prelim): 4 length of query: 109 length of database: 1,040,396,356 effective HSP length: 77 effective length of query: 32 effective length of database: 803,431,628 effective search space: 25709812096 effective search space used: 25709812096 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.7 bits) S2: 87 (38.1 bits)