BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (105 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_Q9JMR2 Uncharacterized protein yubN n=29 Tax=Enterobact... 213 2e-54 UniRef50_Q6SJ33 YehA n=4 Tax=root RepID=Q6SJ33_ECOLX 113 2e-24 UniRef50_Q849U8 Putative uncharacterized protein orf63 n=1 Tax=E... 79 5e-14 UniRef50_Q0E843 W0062 n=10 Tax=root RepID=Q0E843_ECOLX 75 8e-13 UniRef50_B7UTF8 Conserved predicted protein n=1 Tax=Escherichia ... 59 4e-08 UniRef50_Q849V7 Putative uncharacterized protein orf57 n=2 Tax=E... 57 2e-07 UniRef50_Q3ZU55 Putative uncharacterized protein n=11 Tax=Entero... 43 0.003 >UniRef50_Q9JMR2 Uncharacterized protein yubN n=29 Tax=Enterobacteriaceae RepID=YUBN_ECOLI Length = 105 Score = 213 bits (542), Expect = 2e-54, Method: Compositional matrix adjust. Identities = 105/105 (100%), Positives = 105/105 (100%) Query: 1 MKRGLRNGVAPCIPVAFCRDAGSAGHGATEAARPERSLQARGFASGMEARQGGDNFAGSV 60 MKRGLRNGVAPCIPVAFCRDAGSAGHGATEAARPERSLQARGFASGMEARQGGDNFAGSV Sbjct: 1 MKRGLRNGVAPCIPVAFCRDAGSAGHGATEAARPERSLQARGFASGMEARQGGDNFAGSV 60 Query: 61 HDSRPARGDARKRHCQENNKTDRNQGSSNQSQNRRKKRIIKGKVM 105 HDSRPARGDARKRHCQENNKTDRNQGSSNQSQNRRKKRIIKGKVM Sbjct: 61 HDSRPARGDARKRHCQENNKTDRNQGSSNQSQNRRKKRIIKGKVM 105 >UniRef50_Q6SJ33 YehA n=4 Tax=root RepID=Q6SJ33_ECOLX Length = 125 Score = 113 bits (283), Expect = 2e-24, Method: Compositional matrix adjust. Identities = 57/69 (82%), Positives = 59/69 (85%) Query: 14 PVAFCRDAGSAGHGATEAARPERSLQARGFASGMEARQGGDNFAGSVHDSRPARGDARKR 73 P FCRDAGSAGHGAT ARP RSLQ R FASGMEARQGGD+FAGSVHDS P +GDARKR Sbjct: 28 PGVFCRDAGSAGHGATRVARPFRSLQTRVFASGMEARQGGDSFAGSVHDSPPRQGDARKR 87 Query: 74 HCQENNKTD 82 H QENNKTD Sbjct: 88 HKQENNKTD 96 >UniRef50_Q849U8 Putative uncharacterized protein orf63 n=1 Tax=Escherichia coli RepID=Q849U8_ECOLX Length = 74 Score = 78.6 bits (192), Expect = 5e-14, Method: Compositional matrix adjust. Identities = 38/43 (88%), Positives = 38/43 (88%) Query: 1 MKRGLRNGVAPCIPVAFCRDAGSAGHGATEAARPERSLQARGF 43 MKRGLRN VAP IP AFC DA SAGHGATEAARPERSLQARGF Sbjct: 1 MKRGLRNCVAPYIPAAFCWDAWSAGHGATEAARPERSLQARGF 43 >UniRef50_Q0E843 W0062 n=10 Tax=root RepID=Q0E843_ECOLX Length = 147 Score = 74.7 bits (182), Expect = 8e-13, Method: Compositional matrix adjust. Identities = 37/44 (84%), Positives = 39/44 (88%) Query: 42 GFASGMEARQGGDNFAGSVHDSRPARGDARKRHCQENNKTDRNQ 85 GFASGMEARQGGD+ AGSVHDS P +GDARKRH QENNKTD NQ Sbjct: 56 GFASGMEARQGGDSSAGSVHDSPPRQGDARKRHKQENNKTDSNQ 99 >UniRef50_B7UTF8 Conserved predicted protein n=1 Tax=Escherichia coli O127:H6 str. E2348/69 RepID=B7UTF8_ECO27 Length = 140 Score = 59.3 bits (142), Expect = 4e-08, Method: Compositional matrix adjust. Identities = 29/39 (74%), Positives = 33/39 (84%) Query: 41 RGFASGMEARQGGDNFAGSVHDSRPARGDARKRHCQENN 79 +GFASG+EARQGGD+ AGSVHDS P +GDARK H QE N Sbjct: 82 QGFASGIEARQGGDSSAGSVHDSPPRQGDARKSHKQERN 120 >UniRef50_Q849V7 Putative uncharacterized protein orf57 n=2 Tax=Escherichia coli RepID=Q849V7_ECOLX Length = 114 Score = 56.6 bits (135), Expect = 2e-07, Method: Compositional matrix adjust. Identities = 31/53 (58%), Positives = 34/53 (64%) Query: 16 AFCRDAGSAGHGATEAARPERSLQARGFASGMEARQGGDNFAGSVHDSRPARG 68 AFCRDAGSAGHGATEAARPERSLQA G + + +RPARG Sbjct: 31 AFCRDAGSAGHGATEAARPERSLQAMVSRQGWKPARAETAPLAPFTTARPARG 83 >UniRef50_Q3ZU55 Putative uncharacterized protein n=11 Tax=Enterobacteriaceae RepID=Q3ZU55_ECOLX Length = 165 Score = 43.1 bits (100), Expect = 0.003, Method: Compositional matrix adjust. Identities = 22/25 (88%), Positives = 22/25 (88%) Query: 19 RDAGSAGHGATEAARPERSLQARGF 43 R AGSAGHGATEAARPERSLQA F Sbjct: 77 RCAGSAGHGATEAARPERSLQAWFF 101 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_Q9JMR2 Uncharacterized protein yubN n=29 Tax=Enterobact... 153 1e-36 UniRef50_Q6SJ33 YehA n=4 Tax=root RepID=Q6SJ33_ECOLX 100 1e-20 UniRef50_Q849V7 Putative uncharacterized protein orf57 n=2 Tax=E... 79 4e-14 UniRef50_Q849U8 Putative uncharacterized protein orf63 n=1 Tax=E... 73 4e-12 UniRef50_Q0E843 W0062 n=10 Tax=root RepID=Q0E843_ECOLX 64 2e-09 UniRef50_B7UTF8 Conserved predicted protein n=1 Tax=Escherichia ... 62 5e-09 Sequences not found previously or not previously below threshold: UniRef50_Q3ZU55 Putative uncharacterized protein n=11 Tax=Entero... 40 0.028 CONVERGED! >UniRef50_Q9JMR2 Uncharacterized protein yubN n=29 Tax=Enterobacteriaceae RepID=YUBN_ECOLI Length = 105 Score = 153 bits (387), Expect = 1e-36, Method: Composition-based stats. Identities = 105/105 (100%), Positives = 105/105 (100%) Query: 1 MKRGLRNGVAPCIPVAFCRDAGSAGHGATEAARPERSLQARGFASGMEARQGGDNFAGSV 60 MKRGLRNGVAPCIPVAFCRDAGSAGHGATEAARPERSLQARGFASGMEARQGGDNFAGSV Sbjct: 1 MKRGLRNGVAPCIPVAFCRDAGSAGHGATEAARPERSLQARGFASGMEARQGGDNFAGSV 60 Query: 61 HDSRPARGDARKRHCQENNKTDRNQGSSNQSQNRRKKRIIKGKVM 105 HDSRPARGDARKRHCQENNKTDRNQGSSNQSQNRRKKRIIKGKVM Sbjct: 61 HDSRPARGDARKRHCQENNKTDRNQGSSNQSQNRRKKRIIKGKVM 105 >UniRef50_Q6SJ33 YehA n=4 Tax=root RepID=Q6SJ33_ECOLX Length = 125 Score = 100 bits (249), Expect = 1e-20, Method: Composition-based stats. Identities = 57/69 (82%), Positives = 59/69 (85%) Query: 14 PVAFCRDAGSAGHGATEAARPERSLQARGFASGMEARQGGDNFAGSVHDSRPARGDARKR 73 P FCRDAGSAGHGAT ARP RSLQ R FASGMEARQGGD+FAGSVHDS P +GDARKR Sbjct: 28 PGVFCRDAGSAGHGATRVARPFRSLQTRVFASGMEARQGGDSFAGSVHDSPPRQGDARKR 87 Query: 74 HCQENNKTD 82 H QENNKTD Sbjct: 88 HKQENNKTD 96 >UniRef50_Q849V7 Putative uncharacterized protein orf57 n=2 Tax=Escherichia coli RepID=Q849V7_ECOLX Length = 114 Score = 79.4 bits (194), Expect = 4e-14, Method: Composition-based stats. Identities = 32/58 (55%), Positives = 35/58 (60%) Query: 15 VAFCRDAGSAGHGATEAARPERSLQARGFASGMEARQGGDNFAGSVHDSRPARGDARK 72 AFCRDAGSAGHGATEAARPERSLQA G + + +RPARG K Sbjct: 30 GAFCRDAGSAGHGATEAARPERSLQAMVSRQGWKPARAETAPLAPFTTARPARGTPGK 87 >UniRef50_Q849U8 Putative uncharacterized protein orf63 n=1 Tax=Escherichia coli RepID=Q849U8_ECOLX Length = 74 Score = 72.5 bits (176), Expect = 4e-12, Method: Composition-based stats. Identities = 38/43 (88%), Positives = 38/43 (88%) Query: 1 MKRGLRNGVAPCIPVAFCRDAGSAGHGATEAARPERSLQARGF 43 MKRGLRN VAP IP AFC DA SAGHGATEAARPERSLQARGF Sbjct: 1 MKRGLRNCVAPYIPAAFCWDAWSAGHGATEAARPERSLQARGF 43 >UniRef50_Q0E843 W0062 n=10 Tax=root RepID=Q0E843_ECOLX Length = 147 Score = 63.6 bits (153), Expect = 2e-09, Method: Composition-based stats. Identities = 37/44 (84%), Positives = 39/44 (88%) Query: 42 GFASGMEARQGGDNFAGSVHDSRPARGDARKRHCQENNKTDRNQ 85 GFASGMEARQGGD+ AGSVHDS P +GDARKRH QENNKTD NQ Sbjct: 56 GFASGMEARQGGDSSAGSVHDSPPRQGDARKRHKQENNKTDSNQ 99 >UniRef50_B7UTF8 Conserved predicted protein n=1 Tax=Escherichia coli O127:H6 str. E2348/69 RepID=B7UTF8_ECO27 Length = 140 Score = 62.1 bits (149), Expect = 5e-09, Method: Composition-based stats. Identities = 29/39 (74%), Positives = 33/39 (84%) Query: 41 RGFASGMEARQGGDNFAGSVHDSRPARGDARKRHCQENN 79 +GFASG+EARQGGD+ AGSVHDS P +GDARK H QE N Sbjct: 82 QGFASGIEARQGGDSSAGSVHDSPPRQGDARKSHKQERN 120 >UniRef50_Q3ZU55 Putative uncharacterized protein n=11 Tax=Enterobacteriaceae RepID=Q3ZU55_ECOLX Length = 165 Score = 39.8 bits (91), Expect = 0.028, Method: Composition-based stats. Identities = 22/26 (84%), Positives = 22/26 (84%) Query: 19 RDAGSAGHGATEAARPERSLQARGFA 44 R AGSAGHGATEAARPERSLQA F Sbjct: 77 RCAGSAGHGATEAARPERSLQAWFFR 102 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.299 0.125 0.344 Lambda K H 0.267 0.0377 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 379,755,323 Number of Sequences: 3077464 Number of extensions: 13288812 Number of successful extensions: 74965 Number of sequences better than 1.0e-01: 7 Number of HSP's better than 0.1 without gapping: 15 Number of HSP's successfully gapped in prelim test: 1 Number of HSP's that attempted gapping in prelim test: 74946 Number of HSP's gapped (non-prelim): 16 length of query: 105 length of database: 1,040,396,356 effective HSP length: 73 effective length of query: 32 effective length of database: 815,741,484 effective search space: 26103727488 effective search space used: 26103727488 T: 11 A: 40 X1: 16 ( 6.9 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (20.7 bits) S2: 87 (38.2 bits)