BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (77 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_Q2EES0 Uncharacterized protein ynfO from qin prophage n... 157 1e-37 UniRef50_B7UR36 Predicted protein n=11 Tax=Enterobacteriaceae Re... 84 1e-15 UniRef50_C6CCG6 Putative uncharacterized protein n=5 Tax=Enterob... 60 1e-08 UniRef50_A8AFL0 Putative uncharacterized protein n=2 Tax=Enterob... 50 3e-05 UniRef50_D2TBY0 Putative uncharacterized protein n=1 Tax=Erwinia... 47 2e-04 UniRef50_C8QDC1 Putative uncharacterized protein n=1 Tax=Pantoea... 44 0.002 UniRef50_B6ZUB1 Putative uncharacterized protein n=1 Tax=Escheri... 44 0.002 >UniRef50_Q2EES0 Uncharacterized protein ynfO from qin prophage n=129 Tax=root RepID=YNFO_ECOLI Length = 77 Score = 157 bits (396), Expect = 1e-37, Method: Compositional matrix adjust. Identities = 77/77 (100%), Positives = 77/77 (100%) Query: 1 MSTKNRTRRTTTRNIRFPNQMIEQINIALEQKGSGNFSAWVIEACRRRLCSEKRVSSEAN 60 MSTKNRTRRTTTRNIRFPNQMIEQINIALEQKGSGNFSAWVIEACRRRLCSEKRVSSEAN Sbjct: 1 MSTKNRTRRTTTRNIRFPNQMIEQINIALEQKGSGNFSAWVIEACRRRLCSEKRVSSEAN 60 Query: 61 KEKSDITELLRKQVRPD 77 KEKSDITELLRKQVRPD Sbjct: 61 KEKSDITELLRKQVRPD 77 >UniRef50_B7UR36 Predicted protein n=11 Tax=Enterobacteriaceae RepID=B7UR36_ECO27 Length = 66 Score = 84.3 bits (207), Expect = 1e-15, Method: Compositional matrix adjust. Identities = 50/66 (75%), Positives = 55/66 (83%), Gaps = 3/66 (4%) Query: 1 MSTKNRTRRTTTRNIRFPNQMIEQINIALEQKGSGNFSAWVIEACRRRLCSEK---RVSS 57 MS+KNRTRRTTTRNIRFPN +IEQINIALE KGSGNFSAWVIEACRRRL ++ R +S Sbjct: 1 MSSKNRTRRTTTRNIRFPNHIIEQINIALEHKGSGNFSAWVIEACRRRLATDATHLRPAS 60 Query: 58 EANKEK 63 N EK Sbjct: 61 MTNNEK 66 >UniRef50_C6CCG6 Putative uncharacterized protein n=5 Tax=Enterobacteriaceae RepID=C6CCG6_DICDC Length = 64 Score = 60.5 bits (145), Expect = 1e-08, Method: Compositional matrix adjust. Identities = 25/58 (43%), Positives = 43/58 (74%), Gaps = 4/58 (6%) Query: 1 MSTKNRTRRTTTRNIRFPNQMIEQINIALEQKGS----GNFSAWVIEACRRRLCSEKR 54 M T ++ +R+TT+ IRFP+++IE+IN ++ Q+ + NFSAWV++AC ++L + KR Sbjct: 1 MPTGSKNKRSTTKGIRFPHELIEEINASVAQEKTEDPDANFSAWVLDACNQKLAAPKR 58 >UniRef50_A8AFL0 Putative uncharacterized protein n=2 Tax=Enterobacteriaceae RepID=A8AFL0_CITK8 Length = 63 Score = 49.7 bits (117), Expect = 3e-05, Method: Compositional matrix adjust. Identities = 22/48 (45%), Positives = 32/48 (66%) Query: 1 MSTKNRTRRTTTRNIRFPNQMIEQINIALEQKGSGNFSAWVIEACRRR 48 MS KN T +T +NIRF +++QI + ++ + NFSAWV EACR + Sbjct: 1 MSIKNFTGKTVIKNIRFSYSLLKQIEQVMAKEKTSNFSAWVKEACREK 48 >UniRef50_D2TBY0 Putative uncharacterized protein n=1 Tax=Erwinia pyrifoliae DSM 12163 RepID=D2TBY0_ERWPY Length = 60 Score = 47.0 bits (110), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 19/40 (47%), Positives = 29/40 (72%), Gaps = 3/40 (7%) Query: 10 TTTRNIRFPNQMIEQINIALEQKGSGNFSAWVIEACRRRL 49 +T ++IRF + ++EQIN+ G GNFS+WV ++CR RL Sbjct: 15 STMKHIRFEDSLLEQINVVA---GQGNFSSWVKDSCRERL 51 >UniRef50_C8QDC1 Putative uncharacterized protein n=1 Tax=Pantoea sp. At-9b RepID=C8QDC1_9ENTR Length = 57 Score = 43.9 bits (102), Expect = 0.002, Method: Compositional matrix adjust. Identities = 19/47 (40%), Positives = 34/47 (72%), Gaps = 3/47 (6%) Query: 3 TKNRTRRTTTRNIRFPNQMIEQINIALEQKGSGNFSAWVIEACRRRL 49 T ++TRR ++IRF + +++QI ++ + S NFSAWV++ACR ++ Sbjct: 5 TNSKTRR---KHIRFSHALLDQIEESMNSENSQNFSAWVVDACRLKV 48 >UniRef50_B6ZUB1 Putative uncharacterized protein n=1 Tax=Escherichia coli O157:H7 str. TW14588 RepID=B6ZUB1_ECO57 Length = 60 Score = 43.9 bits (102), Expect = 0.002, Method: Compositional matrix adjust. Identities = 20/40 (50%), Positives = 28/40 (70%), Gaps = 3/40 (7%) Query: 10 TTTRNIRFPNQMIEQINIALEQKGSGNFSAWVIEACRRRL 49 +T +NIRF + +++QIN E G GNFS+WV ACR +L Sbjct: 15 STKKNIRFEDTLLQQIN---EVAGPGNFSSWVKNACRDKL 51 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_Q2EES0 Uncharacterized protein ynfO from qin prophage n... 111 9e-24 UniRef50_B7UR36 Predicted protein n=11 Tax=Enterobacteriaceae Re... 84 1e-15 UniRef50_C6CCG6 Putative uncharacterized protein n=5 Tax=Enterob... 74 9e-13 UniRef50_A8AFL0 Putative uncharacterized protein n=2 Tax=Enterob... 66 5e-10 UniRef50_C8QDC1 Putative uncharacterized protein n=1 Tax=Pantoea... 56 2e-07 UniRef50_D2TBY0 Putative uncharacterized protein n=1 Tax=Erwinia... 56 3e-07 UniRef50_B6ZUB1 Putative uncharacterized protein n=1 Tax=Escheri... 54 1e-06 Sequences not found previously or not previously below threshold: CONVERGED! >UniRef50_Q2EES0 Uncharacterized protein ynfO from qin prophage n=129 Tax=root RepID=YNFO_ECOLI Length = 77 Score = 111 bits (276), Expect = 9e-24, Method: Composition-based stats. Identities = 77/77 (100%), Positives = 77/77 (100%) Query: 1 MSTKNRTRRTTTRNIRFPNQMIEQINIALEQKGSGNFSAWVIEACRRRLCSEKRVSSEAN 60 MSTKNRTRRTTTRNIRFPNQMIEQINIALEQKGSGNFSAWVIEACRRRLCSEKRVSSEAN Sbjct: 1 MSTKNRTRRTTTRNIRFPNQMIEQINIALEQKGSGNFSAWVIEACRRRLCSEKRVSSEAN 60 Query: 61 KEKSDITELLRKQVRPD 77 KEKSDITELLRKQVRPD Sbjct: 61 KEKSDITELLRKQVRPD 77 >UniRef50_B7UR36 Predicted protein n=11 Tax=Enterobacteriaceae RepID=B7UR36_ECO27 Length = 66 Score = 84.1 bits (206), Expect = 1e-15, Method: Composition-based stats. Identities = 50/66 (75%), Positives = 55/66 (83%), Gaps = 3/66 (4%) Query: 1 MSTKNRTRRTTTRNIRFPNQMIEQINIALEQKGSGNFSAWVIEACRRRLCSEK---RVSS 57 MS+KNRTRRTTTRNIRFPN +IEQINIALE KGSGNFSAWVIEACRRRL ++ R +S Sbjct: 1 MSSKNRTRRTTTRNIRFPNHIIEQINIALEHKGSGNFSAWVIEACRRRLATDATHLRPAS 60 Query: 58 EANKEK 63 N EK Sbjct: 61 MTNNEK 66 >UniRef50_C6CCG6 Putative uncharacterized protein n=5 Tax=Enterobacteriaceae RepID=C6CCG6_DICDC Length = 64 Score = 74.5 bits (181), Expect = 9e-13, Method: Composition-based stats. Identities = 25/58 (43%), Positives = 43/58 (74%), Gaps = 4/58 (6%) Query: 1 MSTKNRTRRTTTRNIRFPNQMIEQINIALEQKGS----GNFSAWVIEACRRRLCSEKR 54 M T ++ +R+TT+ IRFP+++IE+IN ++ Q+ + NFSAWV++AC ++L + KR Sbjct: 1 MPTGSKNKRSTTKGIRFPHELIEEINASVAQEKTEDPDANFSAWVLDACNQKLAAPKR 58 >UniRef50_A8AFL0 Putative uncharacterized protein n=2 Tax=Enterobacteriaceae RepID=A8AFL0_CITK8 Length = 63 Score = 65.6 bits (158), Expect = 5e-10, Method: Composition-based stats. Identities = 22/62 (35%), Positives = 34/62 (54%) Query: 1 MSTKNRTRRTTTRNIRFPNQMIEQINIALEQKGSGNFSAWVIEACRRRLCSEKRVSSEAN 60 MS KN T +T +NIRF +++QI + ++ + NFSAWV EACR + + Sbjct: 1 MSIKNFTGKTVIKNIRFSYSLLKQIEQVMAKEKTSNFSAWVKEACREKALNSSTPKDVNR 60 Query: 61 KE 62 + Sbjct: 61 NK 62 >UniRef50_C8QDC1 Putative uncharacterized protein n=1 Tax=Pantoea sp. At-9b RepID=C8QDC1_9ENTR Length = 57 Score = 56.4 bits (134), Expect = 2e-07, Method: Composition-based stats. Identities = 19/47 (40%), Positives = 34/47 (72%), Gaps = 3/47 (6%) Query: 3 TKNRTRRTTTRNIRFPNQMIEQINIALEQKGSGNFSAWVIEACRRRL 49 T ++TRR ++IRF + +++QI ++ + S NFSAWV++ACR ++ Sbjct: 5 TNSKTRR---KHIRFSHALLDQIEESMNSENSQNFSAWVVDACRLKV 48 >UniRef50_D2TBY0 Putative uncharacterized protein n=1 Tax=Erwinia pyrifoliae DSM 12163 RepID=D2TBY0_ERWPY Length = 60 Score = 56.0 bits (133), Expect = 3e-07, Method: Composition-based stats. Identities = 20/45 (44%), Positives = 31/45 (68%), Gaps = 3/45 (6%) Query: 8 RRTTTRNIRFPNQMIEQINIALEQKGSGNFSAWVIEACRRRLCSE 52 + +T ++IRF + ++EQIN+ G GNFS+WV ++CR RL E Sbjct: 13 KGSTMKHIRFEDSLLEQINVV---AGQGNFSSWVKDSCRERLRKE 54 >UniRef50_B6ZUB1 Putative uncharacterized protein n=1 Tax=Escherichia coli O157:H7 str. TW14588 RepID=B6ZUB1_ECO57 Length = 60 Score = 54.1 bits (128), Expect = 1e-06, Method: Composition-based stats. Identities = 21/43 (48%), Positives = 29/43 (67%), Gaps = 3/43 (6%) Query: 10 TTTRNIRFPNQMIEQINIALEQKGSGNFSAWVIEACRRRLCSE 52 +T +NIRF + +++QIN E G GNFS+WV ACR +L E Sbjct: 15 STKKNIRFEDTLLQQIN---EVAGPGNFSSWVKNACRDKLRKE 54 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.300 0.121 0.313 Lambda K H 0.267 0.0377 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 232,014,924 Number of Sequences: 3077464 Number of extensions: 6104224 Number of successful extensions: 18991 Number of sequences better than 1.0e-01: 8 Number of HSP's better than 0.1 without gapping: 9 Number of HSP's successfully gapped in prelim test: 6 Number of HSP's that attempted gapping in prelim test: 18975 Number of HSP's gapped (non-prelim): 15 length of query: 77 length of database: 1,040,396,356 effective HSP length: 48 effective length of query: 29 effective length of database: 892,678,084 effective search space: 25887664436 effective search space used: 25887664436 T: 11 A: 40 X1: 16 ( 6.9 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (20.8 bits) S2: 87 (38.2 bits)