BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (124 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P0AAR2 Uncharacterized protein ybaJ n=144 Tax=Enterobac... 192 2e-48 UniRef50_C5BCY7 Putative uncharacterized protein n=2 Tax=Edwards... 172 4e-42 UniRef50_B6XA98 Putative uncharacterized protein n=4 Tax=Enterob... 169 3e-41 UniRef50_B2Q721 Putative uncharacterized protein n=1 Tax=Provide... 166 2e-40 UniRef50_Q2NV68 Putative uncharacterized protein n=2 Tax=Enterob... 152 4e-36 UniRef50_B4EU68 Putative uncharacterized protein n=3 Tax=Proteus... 145 3e-34 >UniRef50_P0AAR2 Uncharacterized protein ybaJ n=144 Tax=Enterobacteriaceae RepID=YBAJ_ECO57 Length = 124 Score = 192 bits (488), Expect = 2e-48, Method: Composition-based stats. Identities = 124/124 (100%), Positives = 124/124 (100%) Query: 1 MDEYSPKRHDIAQLKFLCETLYHDCLANLEESNHGWVNDPTSAINLQLNELIEHIATFAL 60 MDEYSPKRHDIAQLKFLCETLYHDCLANLEESNHGWVNDPTSAINLQLNELIEHIATFAL Sbjct: 1 MDEYSPKRHDIAQLKFLCETLYHDCLANLEESNHGWVNDPTSAINLQLNELIEHIATFAL 60 Query: 61 NYKIKYNEDNKLIEQIDEYLDDTFMLFSSYGINMQDLQKWRKSGNRLFRCFVNATKENPA 120 NYKIKYNEDNKLIEQIDEYLDDTFMLFSSYGINMQDLQKWRKSGNRLFRCFVNATKENPA Sbjct: 61 NYKIKYNEDNKLIEQIDEYLDDTFMLFSSYGINMQDLQKWRKSGNRLFRCFVNATKENPA 120 Query: 121 SLSC 124 SLSC Sbjct: 121 SLSC 124 >UniRef50_C5BCY7 Putative uncharacterized protein n=2 Tax=Edwardsiella RepID=C5BCY7_EDWI9 Length = 122 Score = 172 bits (435), Expect = 4e-42, Method: Composition-based stats. Identities = 59/120 (49%), Positives = 84/120 (70%), Gaps = 1/120 (0%) Query: 1 MDEYSPKRHDIAQLKFLCETLYHDCLANLEESNHGWVNDPTSAINLQLNELIEHIATFAL 60 MDE + +HDI++LK+LC+ LYH + L ESNHGWV+DPT+ +NLQLNELIEHIA+ A Sbjct: 1 MDENTSYQHDISELKYLCDYLYHQGIDVLGESNHGWVSDPTAEVNLQLNELIEHIASIAQ 60 Query: 61 NYKIKYNEDNKLIEQIDEYLDDTFMLFSSYGINMQDLQKWRKSGNRLFRCFVNATKENPA 120 ++KIKY + L E +D YLD+T+ LF +Y I+ L++W ++ R+ C + K N A Sbjct: 61 SFKIKYPRHSDLAEMLDYYLDETYALFGTYSISETALRQWLRTKRRMAYCLAH-EKRNAA 119 >UniRef50_B6XA98 Putative uncharacterized protein n=4 Tax=Enterobacteriaceae RepID=B6XA98_9ENTR Length = 126 Score = 169 bits (427), Expect = 3e-41, Method: Composition-based stats. Identities = 49/111 (44%), Positives = 75/111 (67%) Query: 1 MDEYSPKRHDIAQLKFLCETLYHDCLANLEESNHGWVNDPTSAINLQLNELIEHIATFAL 60 MDEYSPK+HDIA+LK+LC +L D +++L+++N W+ND +SA ++ LNEL+EHIA F Sbjct: 5 MDEYSPKKHDIAELKYLCNSLNRDAISSLQKTNTHWINDLSSAQSISLNELVEHIAAFVW 64 Query: 61 NYKIKYNEDNKLIEQIDEYLDDTFMLFSSYGINMQDLQKWRKSGNRLFRCF 111 +KIKY ++N +I ++EYLD+T+ LF S + ++ W L Sbjct: 65 RFKIKYPKENLVISLVEEYLDETYNLFGSPVVTFSEIIDWESMNQNLVAVL 115 >UniRef50_B2Q721 Putative uncharacterized protein n=1 Tax=Providencia stuartii ATCC 25827 RepID=B2Q721_PROST Length = 122 Score = 166 bits (420), Expect = 2e-40, Method: Composition-based stats. Identities = 48/111 (43%), Positives = 74/111 (66%) Query: 1 MDEYSPKRHDIAQLKFLCETLYHDCLANLEESNHGWVNDPTSAINLQLNELIEHIATFAL 60 MDEYSPK +DI++LK+LC +L + + +L+++N WVND +S + +LNELIEHIA F Sbjct: 1 MDEYSPKNYDISELKYLCNSLNREAMLSLQKTNTHWVNDLSSPQSARLNELIEHIAAFVW 60 Query: 61 NYKIKYNEDNKLIEQIDEYLDDTFMLFSSYGINMQDLQKWRKSGNRLFRCF 111 +KIKY ++N +I ++EYLD+T+ LF S I + ++ W+ L Sbjct: 61 QFKIKYPKENLVISLVEEYLDETYDLFGSPVITLSEIIDWQSMNQNLVSVL 111 >UniRef50_Q2NV68 Putative uncharacterized protein n=2 Tax=Enterobacteriaceae RepID=Q2NV68_SODGM Length = 119 Score = 152 bits (383), Expect = 4e-36, Method: Composition-based stats. Identities = 70/115 (60%), Positives = 90/115 (78%) Query: 1 MDEYSPKRHDIAQLKFLCETLYHDCLANLEESNHGWVNDPTSAINLQLNELIEHIATFAL 60 MDEYSPKR+DIAQLK+LCE LY + +A+L S HGWVNDP+SA+NLQLNELIEHIA + Sbjct: 1 MDEYSPKRYDIAQLKYLCENLYDEGIASLGNSYHGWVNDPSSAVNLQLNELIEHIAANIV 60 Query: 61 NYKIKYNEDNKLIEQIDEYLDDTFMLFSSYGINMQDLQKWRKSGNRLFRCFVNAT 115 +K+KY+ +++L +Q + +LDDTF LFSSYGIN D+Q+WR+S RLF F Sbjct: 61 IFKLKYHNESELTDQAETFLDDTFTLFSSYGINNYDIQRWRRSRRRLFGTFSETE 115 >UniRef50_B4EU68 Putative uncharacterized protein n=3 Tax=Proteus RepID=B4EU68_PROMH Length = 120 Score = 145 bits (367), Expect = 3e-34, Method: Composition-based stats. Identities = 40/108 (37%), Positives = 67/108 (62%) Query: 1 MDEYSPKRHDIAQLKFLCETLYHDCLANLEESNHGWVNDPTSAINLQLNELIEHIATFAL 60 MDEYSP++ D+A+L FLCE L L++L++ W ND +S ++ LN LI+HI F+ Sbjct: 1 MDEYSPEKIDLAELSFLCEELLQQALSSLDKGTAVWNNDLSSTKSVDLNALIDHIMGFSW 60 Query: 61 NYKIKYNEDNKLIEQIDEYLDDTFMLFSSYGINMQDLQKWRKSGNRLF 108 +KIKY + + + ++E +++T+ LF S I+ +L W++ L Sbjct: 61 LFKIKYPDKHAINTLMEECIEETYRLFGSDSISYSELNNWKELNYSLL 108 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P0AAR2 Uncharacterized protein ybaJ n=144 Tax=Enterobac... 195 4e-49 UniRef50_C5BCY7 Putative uncharacterized protein n=2 Tax=Edwards... 175 3e-43 UniRef50_B6XA98 Putative uncharacterized protein n=4 Tax=Enterob... 172 3e-42 UniRef50_B2Q721 Putative uncharacterized protein n=1 Tax=Provide... 169 2e-41 UniRef50_Q2NV68 Putative uncharacterized protein n=2 Tax=Enterob... 157 7e-38 UniRef50_B4EU68 Putative uncharacterized protein n=3 Tax=Proteus... 152 3e-36 Sequences not found previously or not previously below threshold: CONVERGED! >UniRef50_P0AAR2 Uncharacterized protein ybaJ n=144 Tax=Enterobacteriaceae RepID=YBAJ_ECO57 Length = 124 Score = 195 bits (495), Expect = 4e-49, Method: Composition-based stats. Identities = 124/124 (100%), Positives = 124/124 (100%) Query: 1 MDEYSPKRHDIAQLKFLCETLYHDCLANLEESNHGWVNDPTSAINLQLNELIEHIATFAL 60 MDEYSPKRHDIAQLKFLCETLYHDCLANLEESNHGWVNDPTSAINLQLNELIEHIATFAL Sbjct: 1 MDEYSPKRHDIAQLKFLCETLYHDCLANLEESNHGWVNDPTSAINLQLNELIEHIATFAL 60 Query: 61 NYKIKYNEDNKLIEQIDEYLDDTFMLFSSYGINMQDLQKWRKSGNRLFRCFVNATKENPA 120 NYKIKYNEDNKLIEQIDEYLDDTFMLFSSYGINMQDLQKWRKSGNRLFRCFVNATKENPA Sbjct: 61 NYKIKYNEDNKLIEQIDEYLDDTFMLFSSYGINMQDLQKWRKSGNRLFRCFVNATKENPA 120 Query: 121 SLSC 124 SLSC Sbjct: 121 SLSC 124 >UniRef50_C5BCY7 Putative uncharacterized protein n=2 Tax=Edwardsiella RepID=C5BCY7_EDWI9 Length = 122 Score = 175 bits (444), Expect = 3e-43, Method: Composition-based stats. Identities = 59/120 (49%), Positives = 84/120 (70%), Gaps = 1/120 (0%) Query: 1 MDEYSPKRHDIAQLKFLCETLYHDCLANLEESNHGWVNDPTSAINLQLNELIEHIATFAL 60 MDE + +HDI++LK+LC+ LYH + L ESNHGWV+DPT+ +NLQLNELIEHIA+ A Sbjct: 1 MDENTSYQHDISELKYLCDYLYHQGIDVLGESNHGWVSDPTAEVNLQLNELIEHIASIAQ 60 Query: 61 NYKIKYNEDNKLIEQIDEYLDDTFMLFSSYGINMQDLQKWRKSGNRLFRCFVNATKENPA 120 ++KIKY + L E +D YLD+T+ LF +Y I+ L++W ++ R+ C + K N A Sbjct: 61 SFKIKYPRHSDLAEMLDYYLDETYALFGTYSISETALRQWLRTKRRMAYCLAH-EKRNAA 119 >UniRef50_B6XA98 Putative uncharacterized protein n=4 Tax=Enterobacteriaceae RepID=B6XA98_9ENTR Length = 126 Score = 172 bits (436), Expect = 3e-42, Method: Composition-based stats. Identities = 50/116 (43%), Positives = 77/116 (66%) Query: 1 MDEYSPKRHDIAQLKFLCETLYHDCLANLEESNHGWVNDPTSAINLQLNELIEHIATFAL 60 MDEYSPK+HDIA+LK+LC +L D +++L+++N W+ND +SA ++ LNEL+EHIA F Sbjct: 5 MDEYSPKKHDIAELKYLCNSLNRDAISSLQKTNTHWINDLSSAQSISLNELVEHIAAFVW 64 Query: 61 NYKIKYNEDNKLIEQIDEYLDDTFMLFSSYGINMQDLQKWRKSGNRLFRCFVNATK 116 +KIKY ++N +I ++EYLD+T+ LF S + ++ W L + K Sbjct: 65 RFKIKYPKENLVISLVEEYLDETYNLFGSPVVTFSEIIDWESMNQNLVAVLDDDLK 120 >UniRef50_B2Q721 Putative uncharacterized protein n=1 Tax=Providencia stuartii ATCC 25827 RepID=B2Q721_PROST Length = 122 Score = 169 bits (428), Expect = 2e-41, Method: Composition-based stats. Identities = 49/116 (42%), Positives = 76/116 (65%) Query: 1 MDEYSPKRHDIAQLKFLCETLYHDCLANLEESNHGWVNDPTSAINLQLNELIEHIATFAL 60 MDEYSPK +DI++LK+LC +L + + +L+++N WVND +S + +LNELIEHIA F Sbjct: 1 MDEYSPKNYDISELKYLCNSLNREAMLSLQKTNTHWVNDLSSPQSARLNELIEHIAAFVW 60 Query: 61 NYKIKYNEDNKLIEQIDEYLDDTFMLFSSYGINMQDLQKWRKSGNRLFRCFVNATK 116 +KIKY ++N +I ++EYLD+T+ LF S I + ++ W+ L + K Sbjct: 61 QFKIKYPKENLVISLVEEYLDETYDLFGSPVITLSEIIDWQSMNQNLVSVLDDDLK 116 >UniRef50_Q2NV68 Putative uncharacterized protein n=2 Tax=Enterobacteriaceae RepID=Q2NV68_SODGM Length = 119 Score = 157 bits (398), Expect = 7e-38, Method: Composition-based stats. Identities = 70/115 (60%), Positives = 90/115 (78%) Query: 1 MDEYSPKRHDIAQLKFLCETLYHDCLANLEESNHGWVNDPTSAINLQLNELIEHIATFAL 60 MDEYSPKR+DIAQLK+LCE LY + +A+L S HGWVNDP+SA+NLQLNELIEHIA + Sbjct: 1 MDEYSPKRYDIAQLKYLCENLYDEGIASLGNSYHGWVNDPSSAVNLQLNELIEHIAANIV 60 Query: 61 NYKIKYNEDNKLIEQIDEYLDDTFMLFSSYGINMQDLQKWRKSGNRLFRCFVNAT 115 +K+KY+ +++L +Q + +LDDTF LFSSYGIN D+Q+WR+S RLF F Sbjct: 61 IFKLKYHNESELTDQAETFLDDTFTLFSSYGINNYDIQRWRRSRRRLFGTFSETE 115 >UniRef50_B4EU68 Putative uncharacterized protein n=3 Tax=Proteus RepID=B4EU68_PROMH Length = 120 Score = 152 bits (385), Expect = 3e-36, Method: Composition-based stats. Identities = 41/116 (35%), Positives = 68/116 (58%) Query: 1 MDEYSPKRHDIAQLKFLCETLYHDCLANLEESNHGWVNDPTSAINLQLNELIEHIATFAL 60 MDEYSP++ D+A+L FLCE L L++L++ W ND +S ++ LN LI+HI F+ Sbjct: 1 MDEYSPEKIDLAELSFLCEELLQQALSSLDKGTAVWNNDLSSTKSVDLNALIDHIMGFSW 60 Query: 61 NYKIKYNEDNKLIEQIDEYLDDTFMLFSSYGINMQDLQKWRKSGNRLFRCFVNATK 116 +KIKY + + + ++E +++T+ LF S I+ +L W++ L K Sbjct: 61 LFKIKYPDKHAINTLMEECIEETYRLFGSDSISYSELNNWKELNYSLLTLIDKNPK 116 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.308 0.131 0.368 Lambda K H 0.267 0.0403 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 413,029,849 Number of Sequences: 3077464 Number of extensions: 11183875 Number of successful extensions: 39831 Number of sequences better than 1.0e-01: 6 Number of HSP's better than 0.1 without gapping: 12 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 39819 Number of HSP's gapped (non-prelim): 12 length of query: 124 length of database: 1,040,396,356 effective HSP length: 90 effective length of query: 34 effective length of database: 763,424,596 effective search space: 25956436264 effective search space used: 25956436264 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.1 bits) S2: 87 (38.1 bits)