BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (64 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_Q2EES0 Uncharacterized protein ynfO from qin prophage n... 95 9e-19 UniRef50_B7UR36 Predicted protein n=11 Tax=Enterobacteriaceae Re... 83 3e-15 UniRef50_C6CCG6 Putative uncharacterized protein n=5 Tax=Enterob... 61 1e-08 UniRef50_A8AFL0 Putative uncharacterized protein n=2 Tax=Enterob... 49 4e-05 UniRef50_D2TBY0 Putative uncharacterized protein n=1 Tax=Erwinia... 48 1e-04 UniRef50_B6ZUB1 Putative uncharacterized protein n=1 Tax=Escheri... 45 8e-04 UniRef50_C8QDC1 Putative uncharacterized protein n=1 Tax=Pantoea... 44 0.002 >UniRef50_Q2EES0 Uncharacterized protein ynfO from qin prophage n=129 Tax=root RepID=YNFO_ECOLI Length = 77 Score = 94.7 bits (234), Expect = 9e-19, Method: Compositional matrix adjust. Identities = 53/57 (92%), Positives = 54/57 (94%) Query: 1 MSTKNRTRRTTTRNIRFPNQMIEQINIALEQKGSGNFSAWVIEACRRRLTSEKRAYT 57 MSTKNRTRRTTTRNIRFPNQMIEQINIALEQKGSGNFSAWVIEACRRRL SEKR + Sbjct: 1 MSTKNRTRRTTTRNIRFPNQMIEQINIALEQKGSGNFSAWVIEACRRRLCSEKRVSS 57 >UniRef50_B7UR36 Predicted protein n=11 Tax=Enterobacteriaceae RepID=B7UR36_ECO27 Length = 66 Score = 83.2 bits (204), Expect = 3e-15, Method: Compositional matrix adjust. Identities = 45/52 (86%), Positives = 49/52 (94%) Query: 1 MSTKNRTRRTTTRNIRFPNQMIEQINIALEQKGSGNFSAWVIEACRRRLTSE 52 MS+KNRTRRTTTRNIRFPN +IEQINIALE KGSGNFSAWVIEACRRRL ++ Sbjct: 1 MSSKNRTRRTTTRNIRFPNHIIEQINIALEHKGSGNFSAWVIEACRRRLATD 52 >UniRef50_C6CCG6 Putative uncharacterized protein n=5 Tax=Enterobacteriaceae RepID=C6CCG6_DICDC Length = 64 Score = 60.8 bits (146), Expect = 1e-08, Method: Compositional matrix adjust. Identities = 25/58 (43%), Positives = 42/58 (72%), Gaps = 4/58 (6%) Query: 1 MSTKNRTRRTTTRNIRFPNQMIEQINIALEQK----GSGNFSAWVIEACRRRLTSEKR 54 M T ++ +R+TT+ IRFP+++IE+IN ++ Q+ NFSAWV++AC ++L + KR Sbjct: 1 MPTGSKNKRSTTKGIRFPHELIEEINASVAQEKTEDPDANFSAWVLDACNQKLAAPKR 58 >UniRef50_A8AFL0 Putative uncharacterized protein n=2 Tax=Enterobacteriaceae RepID=A8AFL0_CITK8 Length = 63 Score = 49.3 bits (116), Expect = 4e-05, Method: Compositional matrix adjust. Identities = 22/48 (45%), Positives = 32/48 (66%) Query: 1 MSTKNRTRRTTTRNIRFPNQMIEQINIALEQKGSGNFSAWVIEACRRR 48 MS KN T +T +NIRF +++QI + ++ + NFSAWV EACR + Sbjct: 1 MSIKNFTGKTVIKNIRFSYSLLKQIEQVMAKEKTSNFSAWVKEACREK 48 >UniRef50_D2TBY0 Putative uncharacterized protein n=1 Tax=Erwinia pyrifoliae DSM 12163 RepID=D2TBY0_ERWPY Length = 60 Score = 47.8 bits (112), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 20/43 (46%), Positives = 30/43 (69%), Gaps = 3/43 (6%) Query: 10 TTTRNIRFPNQMIEQINIALEQKGSGNFSAWVIEACRRRLTSE 52 +T ++IRF + ++EQIN+ G GNFS+WV ++CR RL E Sbjct: 15 STMKHIRFEDSLLEQINVVA---GQGNFSSWVKDSCRERLRKE 54 >UniRef50_B6ZUB1 Putative uncharacterized protein n=1 Tax=Escherichia coli O157:H7 str. TW14588 RepID=B6ZUB1_ECO57 Length = 60 Score = 45.1 bits (105), Expect = 8e-04, Method: Compositional matrix adjust. Identities = 21/43 (48%), Positives = 29/43 (67%), Gaps = 3/43 (6%) Query: 10 TTTRNIRFPNQMIEQINIALEQKGSGNFSAWVIEACRRRLTSE 52 +T +NIRF + +++QIN E G GNFS+WV ACR +L E Sbjct: 15 STKKNIRFEDTLLQQIN---EVAGPGNFSSWVKNACRDKLRKE 54 >UniRef50_C8QDC1 Putative uncharacterized protein n=1 Tax=Pantoea sp. At-9b RepID=C8QDC1_9ENTR Length = 57 Score = 43.9 bits (102), Expect = 0.002, Method: Compositional matrix adjust. Identities = 19/47 (40%), Positives = 34/47 (72%), Gaps = 3/47 (6%) Query: 3 TKNRTRRTTTRNIRFPNQMIEQINIALEQKGSGNFSAWVIEACRRRL 49 T ++TRR ++IRF + +++QI ++ + S NFSAWV++ACR ++ Sbjct: 5 TNSKTRR---KHIRFSHALLDQIEESMNSENSQNFSAWVVDACRLKV 48 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_C6CCG6 Putative uncharacterized protein n=5 Tax=Enterob... 75 5e-13 UniRef50_Q2EES0 Uncharacterized protein ynfO from qin prophage n... 74 1e-12 UniRef50_B7UR36 Predicted protein n=11 Tax=Enterobacteriaceae Re... 71 8e-12 UniRef50_A8AFL0 Putative uncharacterized protein n=2 Tax=Enterob... 62 5e-09 UniRef50_D2TBY0 Putative uncharacterized protein n=1 Tax=Erwinia... 58 8e-08 UniRef50_B6ZUB1 Putative uncharacterized protein n=1 Tax=Escheri... 58 1e-07 UniRef50_C8QDC1 Putative uncharacterized protein n=1 Tax=Pantoea... 56 3e-07 Sequences not found previously or not previously below threshold: CONVERGED! >UniRef50_C6CCG6 Putative uncharacterized protein n=5 Tax=Enterobacteriaceae RepID=C6CCG6_DICDC Length = 64 Score = 75.3 bits (183), Expect = 5e-13, Method: Composition-based stats. Identities = 25/58 (43%), Positives = 42/58 (72%), Gaps = 4/58 (6%) Query: 1 MSTKNRTRRTTTRNIRFPNQMIEQINIALEQK----GSGNFSAWVIEACRRRLTSEKR 54 M T ++ +R+TT+ IRFP+++IE+IN ++ Q+ NFSAWV++AC ++L + KR Sbjct: 1 MPTGSKNKRSTTKGIRFPHELIEEINASVAQEKTEDPDANFSAWVLDACNQKLAAPKR 58 >UniRef50_Q2EES0 Uncharacterized protein ynfO from qin prophage n=129 Tax=root RepID=YNFO_ECOLI Length = 77 Score = 74.5 bits (181), Expect = 1e-12, Method: Composition-based stats. Identities = 53/63 (84%), Positives = 56/63 (88%) Query: 1 MSTKNRTRRTTTRNIRFPNQMIEQINIALEQKGSGNFSAWVIEACRRRLTSEKRAYTSIK 60 MSTKNRTRRTTTRNIRFPNQMIEQINIALEQKGSGNFSAWVIEACRRRL SEKR + Sbjct: 1 MSTKNRTRRTTTRNIRFPNQMIEQINIALEQKGSGNFSAWVIEACRRRLCSEKRVSSEAN 60 Query: 61 SDE 63 ++ Sbjct: 61 KEK 63 >UniRef50_B7UR36 Predicted protein n=11 Tax=Enterobacteriaceae RepID=B7UR36_ECO27 Length = 66 Score = 71.4 bits (173), Expect = 8e-12, Method: Composition-based stats. Identities = 45/52 (86%), Positives = 49/52 (94%) Query: 1 MSTKNRTRRTTTRNIRFPNQMIEQINIALEQKGSGNFSAWVIEACRRRLTSE 52 MS+KNRTRRTTTRNIRFPN +IEQINIALE KGSGNFSAWVIEACRRRL ++ Sbjct: 1 MSSKNRTRRTTTRNIRFPNHIIEQINIALEHKGSGNFSAWVIEACRRRLATD 52 >UniRef50_A8AFL0 Putative uncharacterized protein n=2 Tax=Enterobacteriaceae RepID=A8AFL0_CITK8 Length = 63 Score = 62.2 bits (149), Expect = 5e-09, Method: Composition-based stats. Identities = 22/48 (45%), Positives = 32/48 (66%) Query: 1 MSTKNRTRRTTTRNIRFPNQMIEQINIALEQKGSGNFSAWVIEACRRR 48 MS KN T +T +NIRF +++QI + ++ + NFSAWV EACR + Sbjct: 1 MSIKNFTGKTVIKNIRFSYSLLKQIEQVMAKEKTSNFSAWVKEACREK 48 >UniRef50_D2TBY0 Putative uncharacterized protein n=1 Tax=Erwinia pyrifoliae DSM 12163 RepID=D2TBY0_ERWPY Length = 60 Score = 58.3 bits (139), Expect = 8e-08, Method: Composition-based stats. Identities = 20/45 (44%), Positives = 31/45 (68%), Gaps = 3/45 (6%) Query: 8 RRTTTRNIRFPNQMIEQINIALEQKGSGNFSAWVIEACRRRLTSE 52 + +T ++IRF + ++EQIN+ G GNFS+WV ++CR RL E Sbjct: 13 KGSTMKHIRFEDSLLEQINVV---AGQGNFSSWVKDSCRERLRKE 54 >UniRef50_B6ZUB1 Putative uncharacterized protein n=1 Tax=Escherichia coli O157:H7 str. TW14588 RepID=B6ZUB1_ECO57 Length = 60 Score = 57.5 bits (137), Expect = 1e-07, Method: Composition-based stats. Identities = 21/43 (48%), Positives = 29/43 (67%), Gaps = 3/43 (6%) Query: 10 TTTRNIRFPNQMIEQINIALEQKGSGNFSAWVIEACRRRLTSE 52 +T +NIRF + +++QIN E G GNFS+WV ACR +L E Sbjct: 15 STKKNIRFEDTLLQQIN---EVAGPGNFSSWVKNACRDKLRKE 54 >UniRef50_C8QDC1 Putative uncharacterized protein n=1 Tax=Pantoea sp. At-9b RepID=C8QDC1_9ENTR Length = 57 Score = 56.0 bits (133), Expect = 3e-07, Method: Composition-based stats. Identities = 19/52 (36%), Positives = 36/52 (69%), Gaps = 3/52 (5%) Query: 3 TKNRTRRTTTRNIRFPNQMIEQINIALEQKGSGNFSAWVIEACRRRLTSEKR 54 T ++TRR ++IRF + +++QI ++ + S NFSAWV++ACR ++ ++ Sbjct: 5 TNSKTRR---KHIRFSHALLDQIEESMNSENSQNFSAWVVDACRLKVREIQK 53 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.297 0.120 0.308 Lambda K H 0.267 0.0372 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 190,395,381 Number of Sequences: 3077464 Number of extensions: 4495023 Number of successful extensions: 11950 Number of sequences better than 1.0e-01: 9 Number of HSP's better than 0.1 without gapping: 11 Number of HSP's successfully gapped in prelim test: 5 Number of HSP's that attempted gapping in prelim test: 11932 Number of HSP's gapped (non-prelim): 16 length of query: 64 length of database: 1,040,396,356 effective HSP length: 36 effective length of query: 28 effective length of database: 929,607,652 effective search space: 26029014256 effective search space used: 26029014256 T: 11 A: 40 X1: 16 ( 6.9 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (20.7 bits) S2: 87 (38.3 bits)