BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (63 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P29010 Uncharacterized protein ydfD n=98 Tax=Escherichi... 132 5e-30 UniRef50_A4W6Y3 Putative uncharacterized protein n=2 Tax=Enterob... 54 1e-06 UniRef50_B5XSM4 Putative uncharacterized protein n=1 Tax=Klebsie... 43 0.004 UniRef50_P76275 Uncharacterized protein yebW n=114 Tax=Bacteria ... 41 0.009 >UniRef50_P29010 Uncharacterized protein ydfD n=98 Tax=Escherichia RepID=YDFD_ECOLI Length = 63 Score = 132 bits (331), Expect = 5e-30, Method: Compositional matrix adjust. Identities = 63/63 (100%), Positives = 63/63 (100%) Query: 1 MNSAFVLVLTVFLVSGEPVDIAVSVHRTMQECMTAATEQKIPGNCYPVDKVIHQDNIEIP 60 MNSAFVLVLTVFLVSGEPVDIAVSVHRTMQECMTAATEQKIPGNCYPVDKVIHQDNIEIP Sbjct: 1 MNSAFVLVLTVFLVSGEPVDIAVSVHRTMQECMTAATEQKIPGNCYPVDKVIHQDNIEIP 60 Query: 61 AGL 63 AGL Sbjct: 61 AGL 63 >UniRef50_A4W6Y3 Putative uncharacterized protein n=2 Tax=Enterobacteriaceae RepID=A4W6Y3_ENT38 Length = 65 Score = 54.3 bits (129), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 27/62 (43%), Positives = 37/62 (59%), Gaps = 2/62 (3%) Query: 1 MNSAFVLVLTVFLVSGEPVDIAVSVHRTMQECMTAATEQKIPGNCYPVDKVIHQDNIEIP 60 M + LVLT+ + +G+ D V ++ Q+C AA+EQ NCYPVD +IH D E P Sbjct: 1 MGALNALVLTITMTNGDYQDAVVGIYDNQQQCEAAASEQMGVTNCYPVDGIIHAD--ETP 58 Query: 61 AG 62 AG Sbjct: 59 AG 60 >UniRef50_B5XSM4 Putative uncharacterized protein n=1 Tax=Klebsiella pneumoniae 342 RepID=B5XSM4_KLEP3 Length = 64 Score = 42.7 bits (99), Expect = 0.004, Method: Compositional matrix adjust. Identities = 20/61 (32%), Positives = 34/61 (55%) Query: 1 MNSAFVLVLTVFLVSGEPVDIAVSVHRTMQECMTAATEQKIPGNCYPVDKVIHQDNIEIP 60 MN+ F LV++V ++GE D+ + V+ + C + A EQK+ G C P + + + P Sbjct: 1 MNTLFALVISVCALTGECSDVLIGVYPSEASCNSNADEQKVQGQCLPYRNTQNMADDQQP 60 Query: 61 A 61 A Sbjct: 61 A 61 >UniRef50_P76275 Uncharacterized protein yebW n=114 Tax=Bacteria RepID=YEBW_ECOLI Length = 63 Score = 41.2 bits (95), Expect = 0.009, Method: Compositional matrix adjust. Identities = 22/58 (37%), Positives = 33/58 (56%), Gaps = 3/58 (5%) Query: 5 FVLVLTVFLVSGEPVDIAVSVHRTMQECMTAATEQKI-PGNCYPVDKVIHQDNIEIPA 61 F LVL V + G DI V V+ T Q+C+ + ++Q+I G C+P++ I D PA Sbjct: 2 FALVLFVCYLDGGCEDIVVDVYNTEQQCLYSMSDQRIRQGGCFPIEDFI--DGFWRPA 57 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P29010 Uncharacterized protein ydfD n=98 Tax=Escherichi... 103 1e-21 UniRef50_A4W6Y3 Putative uncharacterized protein n=2 Tax=Enterob... 93 3e-18 Sequences not found previously or not previously below threshold: UniRef50_P76275 Uncharacterized protein yebW n=114 Tax=Bacteria ... 43 0.002 UniRef50_B5XSM4 Putative uncharacterized protein n=1 Tax=Klebsie... 43 0.003 CONVERGED! >UniRef50_P29010 Uncharacterized protein ydfD n=98 Tax=Escherichia RepID=YDFD_ECOLI Length = 63 Score = 103 bits (257), Expect = 1e-21, Method: Composition-based stats. Identities = 63/63 (100%), Positives = 63/63 (100%) Query: 1 MNSAFVLVLTVFLVSGEPVDIAVSVHRTMQECMTAATEQKIPGNCYPVDKVIHQDNIEIP 60 MNSAFVLVLTVFLVSGEPVDIAVSVHRTMQECMTAATEQKIPGNCYPVDKVIHQDNIEIP Sbjct: 1 MNSAFVLVLTVFLVSGEPVDIAVSVHRTMQECMTAATEQKIPGNCYPVDKVIHQDNIEIP 60 Query: 61 AGL 63 AGL Sbjct: 61 AGL 63 >UniRef50_A4W6Y3 Putative uncharacterized protein n=2 Tax=Enterobacteriaceae RepID=A4W6Y3_ENT38 Length = 65 Score = 92.9 bits (229), Expect = 3e-18, Method: Composition-based stats. Identities = 27/62 (43%), Positives = 37/62 (59%), Gaps = 2/62 (3%) Query: 1 MNSAFVLVLTVFLVSGEPVDIAVSVHRTMQECMTAATEQKIPGNCYPVDKVIHQDNIEIP 60 M + LVLT+ + +G+ D V ++ Q+C AA+EQ NCYPVD +IH D E P Sbjct: 1 MGALNALVLTITMTNGDYQDAVVGIYDNQQQCEAAASEQMGVTNCYPVDGIIHAD--ETP 58 Query: 61 AG 62 AG Sbjct: 59 AG 60 >UniRef50_P76275 Uncharacterized protein yebW n=114 Tax=Bacteria RepID=YEBW_ECOLI Length = 63 Score = 43.2 bits (100), Expect = 0.002, Method: Composition-based stats. Identities = 22/59 (37%), Positives = 33/59 (55%), Gaps = 3/59 (5%) Query: 4 AFVLVLTVFLVSGEPVDIAVSVHRTMQECMTAATEQKI-PGNCYPVDKVIHQDNIEIPA 61 F LVL V + G DI V V+ T Q+C+ + ++Q+I G C+P++ I D PA Sbjct: 1 MFALVLFVCYLDGGCEDIVVDVYNTEQQCLYSMSDQRIRQGGCFPIEDFI--DGFWRPA 57 >UniRef50_B5XSM4 Putative uncharacterized protein n=1 Tax=Klebsiella pneumoniae 342 RepID=B5XSM4_KLEP3 Length = 64 Score = 42.8 bits (99), Expect = 0.003, Method: Composition-based stats. Identities = 20/61 (32%), Positives = 34/61 (55%) Query: 1 MNSAFVLVLTVFLVSGEPVDIAVSVHRTMQECMTAATEQKIPGNCYPVDKVIHQDNIEIP 60 MN+ F LV++V ++GE D+ + V+ + C + A EQK+ G C P + + + P Sbjct: 1 MNTLFALVISVCALTGECSDVLIGVYPSEASCNSNADEQKVQGQCLPYRNTQNMADDQQP 60 Query: 61 A 61 A Sbjct: 61 A 61 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.316 0.131 0.363 Lambda K H 0.267 0.0387 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 210,021,835 Number of Sequences: 3077464 Number of extensions: 5486342 Number of successful extensions: 10683 Number of sequences better than 1.0e-01: 4 Number of HSP's better than 0.1 without gapping: 6 Number of HSP's successfully gapped in prelim test: 2 Number of HSP's that attempted gapping in prelim test: 10677 Number of HSP's gapped (non-prelim): 8 length of query: 63 length of database: 1,040,396,356 effective HSP length: 35 effective length of query: 28 effective length of database: 932,685,116 effective search space: 26115183248 effective search space used: 26115183248 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.6 bits) S2: 87 (38.2 bits)