BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (102 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P75973 Uncharacterized protein ymfJ n=3 Tax=Escherichia... 210 1e-53 UniRef50_Q0T6K0 Hypothetical bacteriophage protein n=5 Tax=Enter... 71 1e-11 UniRef50_B7UQF4 Predicted protein n=3 Tax=Enterobacteriaceae Rep... 66 3e-10 UniRef50_B7MPU9 Putative uncharacterized protein n=1 Tax=Escheri... 51 1e-05 UniRef50_C1MEG4 Predicted protein n=1 Tax=Citrobacter sp. 30_2 R... 47 2e-04 >UniRef50_P75973 Uncharacterized protein ymfJ n=3 Tax=Escherichia coli RepID=YMFJ_ECOLI Length = 102 Score = 210 bits (535), Expect = 1e-53, Method: Compositional matrix adjust. Identities = 102/102 (100%), Positives = 102/102 (100%) Query: 1 MNEQNLKHVIALLLEDAKRLQQIEPNAGTEARILLAKQALKTCGAQAPGRTEFMNFMANT 60 MNEQNLKHVIALLLEDAKRLQQIEPNAGTEARILLAKQALKTCGAQAPGRTEFMNFMANT Sbjct: 1 MNEQNLKHVIALLLEDAKRLQQIEPNAGTEARILLAKQALKTCGAQAPGRTEFMNFMANT 60 Query: 61 LTPLPCNGEKVSRVYHDTMVKALRIELDGLRRKIVMNEIVTN 102 LTPLPCNGEKVSRVYHDTMVKALRIELDGLRRKIVMNEIVTN Sbjct: 61 LTPLPCNGEKVSRVYHDTMVKALRIELDGLRRKIVMNEIVTN 102 >UniRef50_Q0T6K0 Hypothetical bacteriophage protein n=5 Tax=Enterobacteriaceae RepID=Q0T6K0_SHIF8 Length = 67 Score = 70.9 bits (172), Expect = 1e-11, Method: Compositional matrix adjust. Identities = 34/42 (80%), Positives = 38/42 (90%) Query: 1 MNEQNLKHVIALLLEDAKRLQQIEPNAGTEARILLAKQALKT 42 MNE LKHVIALLLEDAKRLQQ+EPNAGT ARI LAK+AL++ Sbjct: 1 MNETELKHVIALLLEDAKRLQQLEPNAGTGARIWLAKEALES 42 >UniRef50_B7UQF4 Predicted protein n=3 Tax=Enterobacteriaceae RepID=B7UQF4_ECO27 Length = 133 Score = 66.2 bits (160), Expect = 3e-10, Method: Compositional matrix adjust. Identities = 30/40 (75%), Positives = 37/40 (92%) Query: 2 NEQNLKHVIALLLEDAKRLQQIEPNAGTEARILLAKQALK 41 N++ LKH+IALLLEDAKRLQQ+EPNAGTEARI +A ++LK Sbjct: 69 NDEKLKHIIALLLEDAKRLQQLEPNAGTEARIWIAMKSLK 108 >UniRef50_B7MPU9 Putative uncharacterized protein n=1 Tax=Escherichia coli ED1a RepID=B7MPU9_ECO81 Length = 126 Score = 51.2 bits (121), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 23/41 (56%), Positives = 33/41 (80%) Query: 1 MNEQNLKHVIALLLEDAKRLQQIEPNAGTEARILLAKQALK 41 MNE LKH+IALLLEDA+++ ++ PN+ T RI +A++ALK Sbjct: 8 MNETELKHIIALLLEDARQVYRLSPNSATLTRIQMAEKALK 48 >UniRef50_C1MEG4 Predicted protein n=1 Tax=Citrobacter sp. 30_2 RepID=C1MEG4_9ENTR Length = 131 Score = 47.0 bits (110), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 22/35 (62%), Positives = 29/35 (82%) Query: 7 KHVIALLLEDAKRLQQIEPNAGTEARILLAKQALK 41 KH+IA LLED++R+Q IEPNAGT +RI A+Q L+ Sbjct: 97 KHLIAGLLEDSRRIQMIEPNAGTLSRIAQAEQYLQ 131 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P75973 Uncharacterized protein ymfJ n=3 Tax=Escherichia... 187 1e-46 UniRef50_Q0T6K0 Hypothetical bacteriophage protein n=5 Tax=Enter... 77 1e-13 UniRef50_B7UQF4 Predicted protein n=3 Tax=Enterobacteriaceae Rep... 70 3e-11 UniRef50_B7MPU9 Putative uncharacterized protein n=1 Tax=Escheri... 65 5e-10 UniRef50_C1MEG4 Predicted protein n=1 Tax=Citrobacter sp. 30_2 R... 61 9e-09 Sequences not found previously or not previously below threshold: CONVERGED! >UniRef50_P75973 Uncharacterized protein ymfJ n=3 Tax=Escherichia coli RepID=YMFJ_ECOLI Length = 102 Score = 187 bits (475), Expect = 1e-46, Method: Composition-based stats. Identities = 102/102 (100%), Positives = 102/102 (100%) Query: 1 MNEQNLKHVIALLLEDAKRLQQIEPNAGTEARILLAKQALKTCGAQAPGRTEFMNFMANT 60 MNEQNLKHVIALLLEDAKRLQQIEPNAGTEARILLAKQALKTCGAQAPGRTEFMNFMANT Sbjct: 1 MNEQNLKHVIALLLEDAKRLQQIEPNAGTEARILLAKQALKTCGAQAPGRTEFMNFMANT 60 Query: 61 LTPLPCNGEKVSRVYHDTMVKALRIELDGLRRKIVMNEIVTN 102 LTPLPCNGEKVSRVYHDTMVKALRIELDGLRRKIVMNEIVTN Sbjct: 61 LTPLPCNGEKVSRVYHDTMVKALRIELDGLRRKIVMNEIVTN 102 >UniRef50_Q0T6K0 Hypothetical bacteriophage protein n=5 Tax=Enterobacteriaceae RepID=Q0T6K0_SHIF8 Length = 67 Score = 77.4 bits (189), Expect = 1e-13, Method: Composition-based stats. Identities = 34/42 (80%), Positives = 38/42 (90%) Query: 1 MNEQNLKHVIALLLEDAKRLQQIEPNAGTEARILLAKQALKT 42 MNE LKHVIALLLEDAKRLQQ+EPNAGT ARI LAK+AL++ Sbjct: 1 MNETELKHVIALLLEDAKRLQQLEPNAGTGARIWLAKEALES 42 >UniRef50_B7UQF4 Predicted protein n=3 Tax=Enterobacteriaceae RepID=B7UQF4_ECO27 Length = 133 Score = 69.7 bits (169), Expect = 3e-11, Method: Composition-based stats. Identities = 30/40 (75%), Positives = 37/40 (92%) Query: 2 NEQNLKHVIALLLEDAKRLQQIEPNAGTEARILLAKQALK 41 N++ LKH+IALLLEDAKRLQQ+EPNAGTEARI +A ++LK Sbjct: 69 NDEKLKHIIALLLEDAKRLQQLEPNAGTEARIWIAMKSLK 108 >UniRef50_B7MPU9 Putative uncharacterized protein n=1 Tax=Escherichia coli ED1a RepID=B7MPU9_ECO81 Length = 126 Score = 65.4 bits (158), Expect = 5e-10, Method: Composition-based stats. Identities = 23/41 (56%), Positives = 33/41 (80%) Query: 1 MNEQNLKHVIALLLEDAKRLQQIEPNAGTEARILLAKQALK 41 MNE LKH+IALLLEDA+++ ++ PN+ T RI +A++ALK Sbjct: 8 MNETELKHIIALLLEDARQVYRLSPNSATLTRIQMAEKALK 48 >UniRef50_C1MEG4 Predicted protein n=1 Tax=Citrobacter sp. 30_2 RepID=C1MEG4_9ENTR Length = 131 Score = 61.2 bits (147), Expect = 9e-09, Method: Composition-based stats. Identities = 22/35 (62%), Positives = 29/35 (82%) Query: 7 KHVIALLLEDAKRLQQIEPNAGTEARILLAKQALK 41 KH+IA LLED++R+Q IEPNAGT +RI A+Q L+ Sbjct: 97 KHLIAGLLEDSRRIQMIEPNAGTLSRIAQAEQYLQ 131 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.320 0.136 0.380 Lambda K H 0.267 0.0417 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 345,536,202 Number of Sequences: 3077464 Number of extensions: 10583045 Number of successful extensions: 26422 Number of sequences better than 1.0e-01: 5 Number of HSP's better than 0.1 without gapping: 10 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 26412 Number of HSP's gapped (non-prelim): 10 length of query: 102 length of database: 1,040,396,356 effective HSP length: 71 effective length of query: 31 effective length of database: 821,896,412 effective search space: 25478788772 effective search space used: 25478788772 T: 11 A: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.8 bits) S2: 87 (38.1 bits)