BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (101 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P59193 Uncharacterized protein yfdT n=13 Tax=root RepID... 165 4e-40 UniRef50_B7NVE5 Putative uncharacterized protein yfdT n=2 Tax=Es... 113 2e-24 UniRef50_C8UKV5 Putative uncharacterized protein n=1 Tax=Escheri... 107 1e-22 UniRef50_B5YSP3 Putative uncharacterized protein n=14 Tax=Escher... 100 1e-20 UniRef50_B0FEA4 Similar to EaA protein of bacteriophage P22 n=9 ... 82 7e-15 UniRef50_Q1R2C5 Putative uncharacterized protein n=2 Tax=root Re... 81 8e-15 UniRef50_C8TW12 Putative uncharacterized protein n=1 Tax=Escheri... 78 9e-14 UniRef50_B3HZ02 Valyl-tRNA synthetase n=16 Tax=root RepID=B3HZ02... 77 2e-13 UniRef50_C9XY18 Putative uncharacterized protein n=1 Tax=Cronoba... 72 6e-12 UniRef50_D0SJQ9 Predicted protein n=1 Tax=Acinetobacter junii SH... 70 2e-11 UniRef50_Q9F558 YcgB protein n=2 Tax=Escherichia coli RepID=Q9F5... 67 1e-10 UniRef50_Q7WLS9 Putative uncharacterized protein n=1 Tax=Bordete... 40 0.020 >UniRef50_P59193 Uncharacterized protein yfdT n=13 Tax=root RepID=YFDT_SHIFL Length = 101 Score = 165 bits (417), Expect = 4e-40, Method: Composition-based stats. Identities = 97/101 (96%), Positives = 101/101 (100%) Query: 1 MTTFTNKELIKEIKERISSLEVRDDIERRAYEIALVSLEVEPDEREAYELFMEKRFGDLV 60 MTTFT+KELIKEIKERISSL+VRDDIERRAYEIAL+SLEVEPDEREAYELFMEKRFGDLV Sbjct: 1 MTTFTDKELIKEIKERISSLDVRDDIERRAYEIALLSLEVEPDEREAYELFMEKRFGDLV 60 Query: 61 DRRRAKNGDNEYMAWDMTLGWIIWQQRAGIHFSTMSQQEVK 101 DRRRAKNGDNEYMAWDMTLGWI+WQQRAGIHFSTMSQQEVK Sbjct: 61 DRRRAKNGDNEYMAWDMTLGWIVWQQRAGIHFSTMSQQEVK 101 >UniRef50_B7NVE5 Putative uncharacterized protein yfdT n=2 Tax=Escherichia coli RepID=B7NVE5_ECO7I Length = 115 Score = 113 bits (282), Expect = 2e-24, Method: Composition-based stats. Identities = 58/79 (73%), Positives = 63/79 (79%), Gaps = 2/79 (2%) Query: 21 EVRDDIERRAYEIALVSLEVEPDEREAYELFMEKRFGDLVDRRRAKNGDNEYMAWDMTLG 80 E+ DD+E IAL SLEVEPDER AYELFMEKRFG VDRRRAKNGDNEYMAWDM LG Sbjct: 36 ELEDDLE--LARIALASLEVEPDERAAYELFMEKRFGKTVDRRRAKNGDNEYMAWDMALG 93 Query: 81 WIIWQQRAGIHFSTMSQQE 99 W++WQQRAG+ ST QE Sbjct: 94 WVVWQQRAGMSLSTAQPQE 112 >UniRef50_C8UKV5 Putative uncharacterized protein n=1 Tax=Escherichia coli O111:H- str. 11128 RepID=C8UKV5_ECO1A Length = 244 Score = 107 bits (266), Expect = 1e-22, Method: Composition-based stats. Identities = 45/71 (63%), Positives = 50/71 (70%) Query: 20 LEVRDDIERRAYEIALVSLEVEPDEREAYELFMEKRFGDLVDRRRAKNGDNEYMAWDMTL 79 L+ + A +I L SL DER AYELFMEKRFG+ VDRRRAKNGD EYMAWDM L Sbjct: 27 LDEDQNNMLTALKITLASLAAVSDERAAYELFMEKRFGESVDRRRAKNGDREYMAWDMAL 86 Query: 80 GWIIWQQRAGI 90 GWIIW RA + Sbjct: 87 GWIIWCHRAAM 97 >UniRef50_B5YSP3 Putative uncharacterized protein n=14 Tax=Escherichia coli RepID=B5YSP3_ECO5E Length = 187 Score = 100 bits (249), Expect = 1e-20, Method: Composition-based stats. Identities = 46/71 (64%), Positives = 51/71 (71%) Query: 20 LEVRDDIERRAYEIALVSLEVEPDEREAYELFMEKRFGDLVDRRRAKNGDNEYMAWDMTL 79 L+ + A +IAL SL DER AYELFMEKRFG+ VDRRRAKNGD EYMAWDM L Sbjct: 27 LDEDQNNMLTALKIALASLASVSDERAAYELFMEKRFGESVDRRRAKNGDREYMAWDMAL 86 Query: 80 GWIIWQQRAGI 90 GWIIW RA + Sbjct: 87 GWIIWCHRAAM 97 >UniRef50_B0FEA4 Similar to EaA protein of bacteriophage P22 n=9 Tax=root RepID=B0FEA4_9CAUD Length = 207 Score = 81.7 bits (200), Expect = 7e-15, Method: Composition-based stats. Identities = 33/42 (78%), Positives = 38/42 (90%) Query: 1 MTTFTNKELIKEIKERISSLEVRDDIERRAYEIALVSLEVEP 42 MTTFT+KE+IKEIKERI SL+VRD+IERRAYEIAL +L EP Sbjct: 1 MTTFTDKEMIKEIKERIGSLDVRDNIERRAYEIALTALTTEP 42 >UniRef50_Q1R2C5 Putative uncharacterized protein n=2 Tax=root RepID=Q1R2C5_ECOUT Length = 118 Score = 81.3 bits (199), Expect = 8e-15, Method: Composition-based stats. Identities = 36/42 (85%), Positives = 39/42 (92%) Query: 1 MTTFTNKELIKEIKERISSLEVRDDIERRAYEIALVSLEVEP 42 MTTFT+KELIKEIKERI SL+VRD+IERRAYEIAL SLE EP Sbjct: 1 MTTFTDKELIKEIKERIGSLDVRDNIERRAYEIALASLEAEP 42 >UniRef50_C8TW12 Putative uncharacterized protein n=1 Tax=Escherichia coli O26:H11 str. 11368 RepID=C8TW12_ECO26 Length = 287 Score = 77.8 bits (190), Expect = 9e-14, Method: Composition-based stats. Identities = 34/42 (80%), Positives = 39/42 (92%) Query: 1 MTTFTNKELIKEIKERISSLEVRDDIERRAYEIALVSLEVEP 42 MTTFT+K+LIKEIKERISSL+VRD+IERRAYEIAL +L EP Sbjct: 1 MTTFTDKKLIKEIKERISSLDVRDNIERRAYEIALTALTAEP 42 >UniRef50_B3HZ02 Valyl-tRNA synthetase n=16 Tax=root RepID=B3HZ02_ECOLX Length = 206 Score = 77.1 bits (188), Expect = 2e-13, Method: Composition-based stats. Identities = 35/42 (83%), Positives = 38/42 (90%) Query: 1 MTTFTNKELIKEIKERISSLEVRDDIERRAYEIALVSLEVEP 42 MTTFT+KELIKEIKERI SL VRD++ERRAYEIAL SLE EP Sbjct: 1 MTTFTDKELIKEIKERIGSLHVRDNVERRAYEIALASLEAEP 42 >UniRef50_C9XY18 Putative uncharacterized protein n=1 Tax=Cronobacter turicensis RepID=C9XY18_CROTZ Length = 90 Score = 71.7 bits (174), Expect = 6e-12, Method: Composition-based stats. Identities = 23/49 (46%), Positives = 34/49 (69%), Gaps = 1/49 (2%) Query: 45 REAYELFMEKRFGDLVDRRRAKNGDNEYMAWDMTLGWIIWQQ-RAGIHF 92 RE +E ++++FGDL+D+R KN D ++MAWDM + W WQ+ RA I Sbjct: 5 REQFEAAIKQKFGDLIDQRVCKNSDGDHMAWDMQVAWWAWQESRAAIEI 53 >UniRef50_D0SJQ9 Predicted protein n=1 Tax=Acinetobacter junii SH205 RepID=D0SJQ9_ACIJU Length = 126 Score = 70.1 bits (170), Expect = 2e-11, Method: Composition-based stats. Identities = 20/42 (47%), Positives = 30/42 (71%) Query: 44 EREAYELFMEKRFGDLVDRRRAKNGDNEYMAWDMTLGWIIWQ 85 EREA+E +M +++ +L+DRR+ N YMAWDM + W +WQ Sbjct: 6 EREAFEAYMSEKYKNLMDRRQCLNNGGGYMAWDMNVAWRVWQ 47 >UniRef50_Q9F558 YcgB protein n=2 Tax=Escherichia coli RepID=Q9F558_ECOLX Length = 151 Score = 67.4 bits (163), Expect = 1e-10, Method: Composition-based stats. Identities = 34/43 (79%), Positives = 38/43 (88%), Gaps = 2/43 (4%) Query: 1 MTTFTN--KELIKEIKERISSLEVRDDIERRAYEIALVSLEVE 41 MTTFT+ KELIKEI+ERI SL+VRD+IERRAYEIAL SLE E Sbjct: 1 MTTFTDEDKELIKEIRERIGSLDVRDNIERRAYEIALASLEAE 43 >UniRef50_Q7WLS9 Putative uncharacterized protein n=1 Tax=Bordetella bronchiseptica RepID=Q7WLS9_BORBR Length = 503 Score = 40.1 bits (92), Expect = 0.020, Method: Composition-based stats. Identities = 24/66 (36%), Positives = 33/66 (50%), Gaps = 6/66 (9%) Query: 29 RAYEIALVSLEVEP--DEREAYELF--MEKRFGDLVDRRRAKNGDNEYMAWDMTLGWIIW 84 RA E AL+S P DER A+E + M ++ R GD Y+ W + GW +W Sbjct: 35 RAIESALLSKLRAPVADERAAFETWNSMHGQYRRSDAYERLDTGD--YVKWPVEHGWRVW 92 Query: 85 QQRAGI 90 Q RA + Sbjct: 93 QARAAL 98 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P59193 Uncharacterized protein yfdT n=13 Tax=root RepID... 161 6e-39 UniRef50_C8UKV5 Putative uncharacterized protein n=1 Tax=Escheri... 120 2e-26 UniRef50_B7NVE5 Putative uncharacterized protein yfdT n=2 Tax=Es... 111 8e-24 UniRef50_B5YSP3 Putative uncharacterized protein n=14 Tax=Escher... 110 2e-23 UniRef50_Q1R2C5 Putative uncharacterized protein n=2 Tax=root Re... 79 3e-14 UniRef50_B0FEA4 Similar to EaA protein of bacteriophage P22 n=9 ... 78 7e-14 UniRef50_C8TW12 Putative uncharacterized protein n=1 Tax=Escheri... 75 5e-13 UniRef50_B3HZ02 Valyl-tRNA synthetase n=16 Tax=root RepID=B3HZ02... 75 8e-13 UniRef50_C9XY18 Putative uncharacterized protein n=1 Tax=Cronoba... 73 2e-12 UniRef50_D0SJQ9 Predicted protein n=1 Tax=Acinetobacter junii SH... 70 2e-11 UniRef50_Q9F558 YcgB protein n=2 Tax=Escherichia coli RepID=Q9F5... 66 5e-10 Sequences not found previously or not previously below threshold: UniRef50_Q7WLS9 Putative uncharacterized protein n=1 Tax=Bordete... 39 0.034 CONVERGED! >UniRef50_P59193 Uncharacterized protein yfdT n=13 Tax=root RepID=YFDT_SHIFL Length = 101 Score = 161 bits (407), Expect = 6e-39, Method: Composition-based stats. Identities = 97/101 (96%), Positives = 101/101 (100%) Query: 1 MTTFTNKELIKEIKERISSLEVRDDIERRAYEIALVSLEVEPDEREAYELFMEKRFGDLV 60 MTTFT+KELIKEIKERISSL+VRDDIERRAYEIAL+SLEVEPDEREAYELFMEKRFGDLV Sbjct: 1 MTTFTDKELIKEIKERISSLDVRDDIERRAYEIALLSLEVEPDEREAYELFMEKRFGDLV 60 Query: 61 DRRRAKNGDNEYMAWDMTLGWIIWQQRAGIHFSTMSQQEVK 101 DRRRAKNGDNEYMAWDMTLGWI+WQQRAGIHFSTMSQQEVK Sbjct: 61 DRRRAKNGDNEYMAWDMTLGWIVWQQRAGIHFSTMSQQEVK 101 >UniRef50_C8UKV5 Putative uncharacterized protein n=1 Tax=Escherichia coli O111:H- str. 11128 RepID=C8UKV5_ECO1A Length = 244 Score = 120 bits (300), Expect = 2e-26, Method: Composition-based stats. Identities = 45/71 (63%), Positives = 50/71 (70%) Query: 20 LEVRDDIERRAYEIALVSLEVEPDEREAYELFMEKRFGDLVDRRRAKNGDNEYMAWDMTL 79 L+ + A +I L SL DER AYELFMEKRFG+ VDRRRAKNGD EYMAWDM L Sbjct: 27 LDEDQNNMLTALKITLASLAAVSDERAAYELFMEKRFGESVDRRRAKNGDREYMAWDMAL 86 Query: 80 GWIIWQQRAGI 90 GWIIW RA + Sbjct: 87 GWIIWCHRAAM 97 >UniRef50_B7NVE5 Putative uncharacterized protein yfdT n=2 Tax=Escherichia coli RepID=B7NVE5_ECO7I Length = 115 Score = 111 bits (277), Expect = 8e-24, Method: Composition-based stats. Identities = 58/79 (73%), Positives = 63/79 (79%), Gaps = 2/79 (2%) Query: 21 EVRDDIERRAYEIALVSLEVEPDEREAYELFMEKRFGDLVDRRRAKNGDNEYMAWDMTLG 80 E+ DD+E IAL SLEVEPDER AYELFMEKRFG VDRRRAKNGDNEYMAWDM LG Sbjct: 36 ELEDDLEL--ARIALASLEVEPDERAAYELFMEKRFGKTVDRRRAKNGDNEYMAWDMALG 93 Query: 81 WIIWQQRAGIHFSTMSQQE 99 W++WQQRAG+ ST QE Sbjct: 94 WVVWQQRAGMSLSTAQPQE 112 >UniRef50_B5YSP3 Putative uncharacterized protein n=14 Tax=Escherichia coli RepID=B5YSP3_ECO5E Length = 187 Score = 110 bits (274), Expect = 2e-23, Method: Composition-based stats. Identities = 46/71 (64%), Positives = 51/71 (71%) Query: 20 LEVRDDIERRAYEIALVSLEVEPDEREAYELFMEKRFGDLVDRRRAKNGDNEYMAWDMTL 79 L+ + A +IAL SL DER AYELFMEKRFG+ VDRRRAKNGD EYMAWDM L Sbjct: 27 LDEDQNNMLTALKIALASLASVSDERAAYELFMEKRFGESVDRRRAKNGDREYMAWDMAL 86 Query: 80 GWIIWQQRAGI 90 GWIIW RA + Sbjct: 87 GWIIWCHRAAM 97 >UniRef50_Q1R2C5 Putative uncharacterized protein n=2 Tax=root RepID=Q1R2C5_ECOUT Length = 118 Score = 79.4 bits (194), Expect = 3e-14, Method: Composition-based stats. Identities = 36/42 (85%), Positives = 39/42 (92%) Query: 1 MTTFTNKELIKEIKERISSLEVRDDIERRAYEIALVSLEVEP 42 MTTFT+KELIKEIKERI SL+VRD+IERRAYEIAL SLE EP Sbjct: 1 MTTFTDKELIKEIKERIGSLDVRDNIERRAYEIALASLEAEP 42 >UniRef50_B0FEA4 Similar to EaA protein of bacteriophage P22 n=9 Tax=root RepID=B0FEA4_9CAUD Length = 207 Score = 78.2 bits (191), Expect = 7e-14, Method: Composition-based stats. Identities = 33/42 (78%), Positives = 38/42 (90%) Query: 1 MTTFTNKELIKEIKERISSLEVRDDIERRAYEIALVSLEVEP 42 MTTFT+KE+IKEIKERI SL+VRD+IERRAYEIAL +L EP Sbjct: 1 MTTFTDKEMIKEIKERIGSLDVRDNIERRAYEIALTALTTEP 42 >UniRef50_C8TW12 Putative uncharacterized protein n=1 Tax=Escherichia coli O26:H11 str. 11368 RepID=C8TW12_ECO26 Length = 287 Score = 75.2 bits (183), Expect = 5e-13, Method: Composition-based stats. Identities = 34/42 (80%), Positives = 39/42 (92%) Query: 1 MTTFTNKELIKEIKERISSLEVRDDIERRAYEIALVSLEVEP 42 MTTFT+K+LIKEIKERISSL+VRD+IERRAYEIAL +L EP Sbjct: 1 MTTFTDKKLIKEIKERISSLDVRDNIERRAYEIALTALTAEP 42 >UniRef50_B3HZ02 Valyl-tRNA synthetase n=16 Tax=root RepID=B3HZ02_ECOLX Length = 206 Score = 74.8 bits (182), Expect = 8e-13, Method: Composition-based stats. Identities = 35/42 (83%), Positives = 38/42 (90%) Query: 1 MTTFTNKELIKEIKERISSLEVRDDIERRAYEIALVSLEVEP 42 MTTFT+KELIKEIKERI SL VRD++ERRAYEIAL SLE EP Sbjct: 1 MTTFTDKELIKEIKERIGSLHVRDNVERRAYEIALASLEAEP 42 >UniRef50_C9XY18 Putative uncharacterized protein n=1 Tax=Cronobacter turicensis RepID=C9XY18_CROTZ Length = 90 Score = 73.2 bits (178), Expect = 2e-12, Method: Composition-based stats. Identities = 23/49 (46%), Positives = 34/49 (69%), Gaps = 1/49 (2%) Query: 45 REAYELFMEKRFGDLVDRRRAKNGDNEYMAWDMTLGWIIWQQ-RAGIHF 92 RE +E ++++FGDL+D+R KN D ++MAWDM + W WQ+ RA I Sbjct: 5 REQFEAAIKQKFGDLIDQRVCKNSDGDHMAWDMQVAWWAWQESRAAIEI 53 >UniRef50_D0SJQ9 Predicted protein n=1 Tax=Acinetobacter junii SH205 RepID=D0SJQ9_ACIJU Length = 126 Score = 70.2 bits (170), Expect = 2e-11, Method: Composition-based stats. Identities = 20/48 (41%), Positives = 30/48 (62%) Query: 44 EREAYELFMEKRFGDLVDRRRAKNGDNEYMAWDMTLGWIIWQQRAGIH 91 EREA+E +M +++ +L+DRR+ N YMAWDM + W +WQ Sbjct: 6 EREAFEAYMSEKYKNLMDRRQCLNNGGGYMAWDMNVAWRVWQAAKAQE 53 >UniRef50_Q9F558 YcgB protein n=2 Tax=Escherichia coli RepID=Q9F558_ECOLX Length = 151 Score = 65.5 bits (158), Expect = 5e-10, Method: Composition-based stats. Identities = 34/43 (79%), Positives = 38/43 (88%), Gaps = 2/43 (4%) Query: 1 MTTFTN--KELIKEIKERISSLEVRDDIERRAYEIALVSLEVE 41 MTTFT+ KELIKEI+ERI SL+VRD+IERRAYEIAL SLE E Sbjct: 1 MTTFTDEDKELIKEIRERIGSLDVRDNIERRAYEIALASLEAE 43 >UniRef50_Q7WLS9 Putative uncharacterized protein n=1 Tax=Bordetella bronchiseptica RepID=Q7WLS9_BORBR Length = 503 Score = 39.3 bits (90), Expect = 0.034, Method: Composition-based stats. Identities = 24/66 (36%), Positives = 33/66 (50%), Gaps = 6/66 (9%) Query: 29 RAYEIALVSLEVEP--DEREAYELF--MEKRFGDLVDRRRAKNGDNEYMAWDMTLGWIIW 84 RA E AL+S P DER A+E + M ++ R GD Y+ W + GW +W Sbjct: 35 RAIESALLSKLRAPVADERAAFETWNSMHGQYRRSDAYERLDTGD--YVKWPVEHGWRVW 92 Query: 85 QQRAGI 90 Q RA + Sbjct: 93 QARAAL 98 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.307 0.128 0.359 Lambda K H 0.267 0.0408 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 371,376,194 Number of Sequences: 3077464 Number of extensions: 11933939 Number of successful extensions: 47764 Number of sequences better than 1.0e-01: 12 Number of HSP's better than 0.1 without gapping: 22 Number of HSP's successfully gapped in prelim test: 2 Number of HSP's that attempted gapping in prelim test: 47742 Number of HSP's gapped (non-prelim): 24 length of query: 101 length of database: 1,040,396,356 effective HSP length: 70 effective length of query: 31 effective length of database: 824,973,876 effective search space: 25574190156 effective search space used: 25574190156 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.1 bits) S2: 87 (38.1 bits)