BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (124 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P0AAR2 Uncharacterized protein ybaJ n=144 Tax=Enterobac... 260 1e-68 UniRef50_Q2NV68 Putative uncharacterized protein n=2 Tax=Enterob... 137 9e-32 UniRef50_C5BCY7 Putative uncharacterized protein n=2 Tax=Edwards... 125 5e-28 UniRef50_B6XA98 Putative uncharacterized protein n=4 Tax=Enterob... 110 2e-23 UniRef50_B2Q721 Putative uncharacterized protein n=1 Tax=Provide... 102 3e-21 UniRef50_B4EU68 Putative uncharacterized protein n=3 Tax=Proteus... 80 2e-14 >UniRef50_P0AAR2 Uncharacterized protein ybaJ n=144 Tax=Enterobacteriaceae RepID=YBAJ_ECO57 Length = 124 Score = 260 bits (664), Expect = 1e-68, Method: Compositional matrix adjust. Identities = 124/124 (100%), Positives = 124/124 (100%) Query: 1 MDEYSPKRHDIAQLKFLCETLYHDCLANLEESNHGWVNDPTSAINLQLNELIEHIATFAL 60 MDEYSPKRHDIAQLKFLCETLYHDCLANLEESNHGWVNDPTSAINLQLNELIEHIATFAL Sbjct: 1 MDEYSPKRHDIAQLKFLCETLYHDCLANLEESNHGWVNDPTSAINLQLNELIEHIATFAL 60 Query: 61 NYKIKYNEDNKLIEQIDEYLDDTFMLFSSYGINMQDLQKWRKSGNRLFRCFVNATKENPA 120 NYKIKYNEDNKLIEQIDEYLDDTFMLFSSYGINMQDLQKWRKSGNRLFRCFVNATKENPA Sbjct: 61 NYKIKYNEDNKLIEQIDEYLDDTFMLFSSYGINMQDLQKWRKSGNRLFRCFVNATKENPA 120 Query: 121 SLSC 124 SLSC Sbjct: 121 SLSC 124 >UniRef50_Q2NV68 Putative uncharacterized protein n=2 Tax=Enterobacteriaceae RepID=Q2NV68_SODGM Length = 119 Score = 137 bits (345), Expect = 9e-32, Method: Compositional matrix adjust. Identities = 70/111 (63%), Positives = 90/111 (81%) Query: 1 MDEYSPKRHDIAQLKFLCETLYHDCLANLEESNHGWVNDPTSAINLQLNELIEHIATFAL 60 MDEYSPKR+DIAQLK+LCE LY + +A+L S HGWVNDP+SA+NLQLNELIEHIA + Sbjct: 1 MDEYSPKRYDIAQLKYLCENLYDEGIASLGNSYHGWVNDPSSAVNLQLNELIEHIAANIV 60 Query: 61 NYKIKYNEDNKLIEQIDEYLDDTFMLFSSYGINMQDLQKWRKSGNRLFRCF 111 +K+KY+ +++L +Q + +LDDTF LFSSYGIN D+Q+WR+S RLF F Sbjct: 61 IFKLKYHNESELTDQAETFLDDTFTLFSSYGINNYDIQRWRRSRRRLFGTF 111 >UniRef50_C5BCY7 Putative uncharacterized protein n=2 Tax=Edwardsiella RepID=C5BCY7_EDWI9 Length = 122 Score = 125 bits (313), Expect = 5e-28, Method: Compositional matrix adjust. Identities = 59/120 (49%), Positives = 84/120 (70%), Gaps = 1/120 (0%) Query: 1 MDEYSPKRHDIAQLKFLCETLYHDCLANLEESNHGWVNDPTSAINLQLNELIEHIATFAL 60 MDE + +HDI++LK+LC+ LYH + L ESNHGWV+DPT+ +NLQLNELIEHIA+ A Sbjct: 1 MDENTSYQHDISELKYLCDYLYHQGIDVLGESNHGWVSDPTAEVNLQLNELIEHIASIAQ 60 Query: 61 NYKIKYNEDNKLIEQIDEYLDDTFMLFSSYGINMQDLQKWRKSGNRLFRCFVNATKENPA 120 ++KIKY + L E +D YLD+T+ LF +Y I+ L++W ++ R+ C + K N A Sbjct: 61 SFKIKYPRHSDLAEMLDYYLDETYALFGTYSISETALRQWLRTKRRMAYCLAH-EKRNAA 119 >UniRef50_B6XA98 Putative uncharacterized protein n=4 Tax=Enterobacteriaceae RepID=B6XA98_9ENTR Length = 126 Score = 110 bits (274), Expect = 2e-23, Method: Compositional matrix adjust. Identities = 49/111 (44%), Positives = 75/111 (67%) Query: 1 MDEYSPKRHDIAQLKFLCETLYHDCLANLEESNHGWVNDPTSAINLQLNELIEHIATFAL 60 MDEYSPK+HDIA+LK+LC +L D +++L+++N W+ND +SA ++ LNEL+EHIA F Sbjct: 5 MDEYSPKKHDIAELKYLCNSLNRDAISSLQKTNTHWINDLSSAQSISLNELVEHIAAFVW 64 Query: 61 NYKIKYNEDNKLIEQIDEYLDDTFMLFSSYGINMQDLQKWRKSGNRLFRCF 111 +KIKY ++N +I ++EYLD+T+ LF S + ++ W L Sbjct: 65 RFKIKYPKENLVISLVEEYLDETYNLFGSPVVTFSEIIDWESMNQNLVAVL 115 >UniRef50_B2Q721 Putative uncharacterized protein n=1 Tax=Providencia stuartii ATCC 25827 RepID=B2Q721_PROST Length = 122 Score = 102 bits (255), Expect = 3e-21, Method: Compositional matrix adjust. Identities = 48/111 (43%), Positives = 74/111 (66%) Query: 1 MDEYSPKRHDIAQLKFLCETLYHDCLANLEESNHGWVNDPTSAINLQLNELIEHIATFAL 60 MDEYSPK +DI++LK+LC +L + + +L+++N WVND +S + +LNELIEHIA F Sbjct: 1 MDEYSPKNYDISELKYLCNSLNREAMLSLQKTNTHWVNDLSSPQSARLNELIEHIAAFVW 60 Query: 61 NYKIKYNEDNKLIEQIDEYLDDTFMLFSSYGINMQDLQKWRKSGNRLFRCF 111 +KIKY ++N +I ++EYLD+T+ LF S I + ++ W+ L Sbjct: 61 QFKIKYPKENLVISLVEEYLDETYDLFGSPVITLSEIIDWQSMNQNLVSVL 111 >UniRef50_B4EU68 Putative uncharacterized protein n=3 Tax=Proteus RepID=B4EU68_PROMH Length = 120 Score = 80.5 bits (197), Expect = 2e-14, Method: Compositional matrix adjust. Identities = 39/102 (38%), Positives = 66/102 (64%) Query: 1 MDEYSPKRHDIAQLKFLCETLYHDCLANLEESNHGWVNDPTSAINLQLNELIEHIATFAL 60 MDEYSP++ D+A+L FLCE L L++L++ W ND +S ++ LN LI+HI F+ Sbjct: 1 MDEYSPEKIDLAELSFLCEELLQQALSSLDKGTAVWNNDLSSTKSVDLNALIDHIMGFSW 60 Query: 61 NYKIKYNEDNKLIEQIDEYLDDTFMLFSSYGINMQDLQKWRK 102 +KIKY + + + ++E +++T+ LF S I+ +L W++ Sbjct: 61 LFKIKYPDKHAINTLMEECIEETYRLFGSDSISYSELNNWKE 102 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P0AAR2 Uncharacterized protein ybaJ n=144 Tax=Enterobac... 192 2e-48 UniRef50_C5BCY7 Putative uncharacterized protein n=2 Tax=Edwards... 172 4e-42 UniRef50_B6XA98 Putative uncharacterized protein n=4 Tax=Enterob... 169 3e-41 UniRef50_B2Q721 Putative uncharacterized protein n=1 Tax=Provide... 166 2e-40 UniRef50_Q2NV68 Putative uncharacterized protein n=2 Tax=Enterob... 152 4e-36 UniRef50_B4EU68 Putative uncharacterized protein n=3 Tax=Proteus... 145 3e-34 Sequences not found previously or not previously below threshold: CONVERGED! >UniRef50_P0AAR2 Uncharacterized protein ybaJ n=144 Tax=Enterobacteriaceae RepID=YBAJ_ECO57 Length = 124 Score = 192 bits (488), Expect = 2e-48, Method: Composition-based stats. Identities = 124/124 (100%), Positives = 124/124 (100%) Query: 1 MDEYSPKRHDIAQLKFLCETLYHDCLANLEESNHGWVNDPTSAINLQLNELIEHIATFAL 60 MDEYSPKRHDIAQLKFLCETLYHDCLANLEESNHGWVNDPTSAINLQLNELIEHIATFAL Sbjct: 1 MDEYSPKRHDIAQLKFLCETLYHDCLANLEESNHGWVNDPTSAINLQLNELIEHIATFAL 60 Query: 61 NYKIKYNEDNKLIEQIDEYLDDTFMLFSSYGINMQDLQKWRKSGNRLFRCFVNATKENPA 120 NYKIKYNEDNKLIEQIDEYLDDTFMLFSSYGINMQDLQKWRKSGNRLFRCFVNATKENPA Sbjct: 61 NYKIKYNEDNKLIEQIDEYLDDTFMLFSSYGINMQDLQKWRKSGNRLFRCFVNATKENPA 120 Query: 121 SLSC 124 SLSC Sbjct: 121 SLSC 124 >UniRef50_C5BCY7 Putative uncharacterized protein n=2 Tax=Edwardsiella RepID=C5BCY7_EDWI9 Length = 122 Score = 172 bits (435), Expect = 4e-42, Method: Composition-based stats. Identities = 59/120 (49%), Positives = 84/120 (70%), Gaps = 1/120 (0%) Query: 1 MDEYSPKRHDIAQLKFLCETLYHDCLANLEESNHGWVNDPTSAINLQLNELIEHIATFAL 60 MDE + +HDI++LK+LC+ LYH + L ESNHGWV+DPT+ +NLQLNELIEHIA+ A Sbjct: 1 MDENTSYQHDISELKYLCDYLYHQGIDVLGESNHGWVSDPTAEVNLQLNELIEHIASIAQ 60 Query: 61 NYKIKYNEDNKLIEQIDEYLDDTFMLFSSYGINMQDLQKWRKSGNRLFRCFVNATKENPA 120 ++KIKY + L E +D YLD+T+ LF +Y I+ L++W ++ R+ C + K N A Sbjct: 61 SFKIKYPRHSDLAEMLDYYLDETYALFGTYSISETALRQWLRTKRRMAYCLAH-EKRNAA 119 >UniRef50_B6XA98 Putative uncharacterized protein n=4 Tax=Enterobacteriaceae RepID=B6XA98_9ENTR Length = 126 Score = 169 bits (427), Expect = 3e-41, Method: Composition-based stats. Identities = 49/111 (44%), Positives = 75/111 (67%) Query: 1 MDEYSPKRHDIAQLKFLCETLYHDCLANLEESNHGWVNDPTSAINLQLNELIEHIATFAL 60 MDEYSPK+HDIA+LK+LC +L D +++L+++N W+ND +SA ++ LNEL+EHIA F Sbjct: 5 MDEYSPKKHDIAELKYLCNSLNRDAISSLQKTNTHWINDLSSAQSISLNELVEHIAAFVW 64 Query: 61 NYKIKYNEDNKLIEQIDEYLDDTFMLFSSYGINMQDLQKWRKSGNRLFRCF 111 +KIKY ++N +I ++EYLD+T+ LF S + ++ W L Sbjct: 65 RFKIKYPKENLVISLVEEYLDETYNLFGSPVVTFSEIIDWESMNQNLVAVL 115 >UniRef50_B2Q721 Putative uncharacterized protein n=1 Tax=Providencia stuartii ATCC 25827 RepID=B2Q721_PROST Length = 122 Score = 166 bits (420), Expect = 2e-40, Method: Composition-based stats. Identities = 48/111 (43%), Positives = 74/111 (66%) Query: 1 MDEYSPKRHDIAQLKFLCETLYHDCLANLEESNHGWVNDPTSAINLQLNELIEHIATFAL 60 MDEYSPK +DI++LK+LC +L + + +L+++N WVND +S + +LNELIEHIA F Sbjct: 1 MDEYSPKNYDISELKYLCNSLNREAMLSLQKTNTHWVNDLSSPQSARLNELIEHIAAFVW 60 Query: 61 NYKIKYNEDNKLIEQIDEYLDDTFMLFSSYGINMQDLQKWRKSGNRLFRCF 111 +KIKY ++N +I ++EYLD+T+ LF S I + ++ W+ L Sbjct: 61 QFKIKYPKENLVISLVEEYLDETYDLFGSPVITLSEIIDWQSMNQNLVSVL 111 >UniRef50_Q2NV68 Putative uncharacterized protein n=2 Tax=Enterobacteriaceae RepID=Q2NV68_SODGM Length = 119 Score = 152 bits (383), Expect = 4e-36, Method: Composition-based stats. Identities = 70/115 (60%), Positives = 90/115 (78%) Query: 1 MDEYSPKRHDIAQLKFLCETLYHDCLANLEESNHGWVNDPTSAINLQLNELIEHIATFAL 60 MDEYSPKR+DIAQLK+LCE LY + +A+L S HGWVNDP+SA+NLQLNELIEHIA + Sbjct: 1 MDEYSPKRYDIAQLKYLCENLYDEGIASLGNSYHGWVNDPSSAVNLQLNELIEHIAANIV 60 Query: 61 NYKIKYNEDNKLIEQIDEYLDDTFMLFSSYGINMQDLQKWRKSGNRLFRCFVNAT 115 +K+KY+ +++L +Q + +LDDTF LFSSYGIN D+Q+WR+S RLF F Sbjct: 61 IFKLKYHNESELTDQAETFLDDTFTLFSSYGINNYDIQRWRRSRRRLFGTFSETE 115 >UniRef50_B4EU68 Putative uncharacterized protein n=3 Tax=Proteus RepID=B4EU68_PROMH Length = 120 Score = 145 bits (367), Expect = 3e-34, Method: Composition-based stats. Identities = 40/108 (37%), Positives = 67/108 (62%) Query: 1 MDEYSPKRHDIAQLKFLCETLYHDCLANLEESNHGWVNDPTSAINLQLNELIEHIATFAL 60 MDEYSP++ D+A+L FLCE L L++L++ W ND +S ++ LN LI+HI F+ Sbjct: 1 MDEYSPEKIDLAELSFLCEELLQQALSSLDKGTAVWNNDLSSTKSVDLNALIDHIMGFSW 60 Query: 61 NYKIKYNEDNKLIEQIDEYLDDTFMLFSSYGINMQDLQKWRKSGNRLF 108 +KIKY + + + ++E +++T+ LF S I+ +L W++ L Sbjct: 61 LFKIKYPDKHAINTLMEECIEETYRLFGSDSISYSELNNWKELNYSLL 108 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.312 0.132 0.375 Lambda K H 0.267 0.0407 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 458,529,112 Number of Sequences: 3077464 Number of extensions: 14934314 Number of successful extensions: 49652 Number of sequences better than 1.0e-01: 6 Number of HSP's better than 0.1 without gapping: 12 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 49640 Number of HSP's gapped (non-prelim): 12 length of query: 124 length of database: 1,040,396,356 effective HSP length: 90 effective length of query: 34 effective length of database: 763,424,596 effective search space: 25956436264 effective search space used: 25956436264 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.4 bits) S2: 87 (38.1 bits)