BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (213 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P76543 Uncharacterized protein yffL n=3 Tax=Escherichia... 444 e-124 >UniRef50_P76543 Uncharacterized protein yffL n=3 Tax=Escherichia coli K-12 RepID=YFFL_ECOLI Length = 213 Score = 444 bits (1143), Expect = e-124, Method: Compositional matrix adjust. Identities = 213/213 (100%), Positives = 213/213 (100%) Query: 1 MFATKDPEFENRINTNKSPRNAATCRGRYEKQAKGEFLMSDMLAVEQETNNDVRQFLNKI 60 MFATKDPEFENRINTNKSPRNAATCRGRYEKQAKGEFLMSDMLAVEQETNNDVRQFLNKI Sbjct: 1 MFATKDPEFENRINTNKSPRNAATCRGRYEKQAKGEFLMSDMLAVEQETNNDVRQFLNKI 60 Query: 61 NELRNKAPKNEETKHEEHTPDNHEETDHHEAKQQEQAWRGNLRYLDTLNRLDEVLPRKLY 120 NELRNKAPKNEETKHEEHTPDNHEETDHHEAKQQEQAWRGNLRYLDTLNRLDEVLPRKLY Sbjct: 61 NELRNKAPKNEETKHEEHTPDNHEETDHHEAKQQEQAWRGNLRYLDTLNRLDEVLPRKLY 120 Query: 121 ERWEKEHTVNDEAVLRALCYFAGTGKNSQLGWCRVGRGTIDKRARLSKNTVKKCLDRLVN 180 ERWEKEHTVNDEAVLRALCYFAGTGKNSQLGWCRVGRGTIDKRARLSKNTVKKCLDRLVN Sbjct: 121 ERWEKEHTVNDEAVLRALCYFAGTGKNSQLGWCRVGRGTIDKRARLSKNTVKKCLDRLVN 180 Query: 181 HFKLVERTEGYIPGSAERECNEYQLLFKPYNMK 213 HFKLVERTEGYIPGSAERECNEYQLLFKPYNMK Sbjct: 181 HFKLVERTEGYIPGSAERECNEYQLLFKPYNMK 213 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P76543 Uncharacterized protein yffL n=3 Tax=Escherichia... 429 e-119 Sequences not found previously or not previously below threshold: UniRef50_Q96JM2 Zinc finger protein 462 n=34 Tax=Eukaryota RepID... 41 0.025 UniRef50_Q5T0T3 Zinc finger protein 462 (Fragment) n=12 Tax=Euth... 40 0.048 CONVERGED! >UniRef50_P76543 Uncharacterized protein yffL n=3 Tax=Escherichia coli K-12 RepID=YFFL_ECOLI Length = 213 Score = 429 bits (1104), Expect = e-119, Method: Composition-based stats. Identities = 213/213 (100%), Positives = 213/213 (100%) Query: 1 MFATKDPEFENRINTNKSPRNAATCRGRYEKQAKGEFLMSDMLAVEQETNNDVRQFLNKI 60 MFATKDPEFENRINTNKSPRNAATCRGRYEKQAKGEFLMSDMLAVEQETNNDVRQFLNKI Sbjct: 1 MFATKDPEFENRINTNKSPRNAATCRGRYEKQAKGEFLMSDMLAVEQETNNDVRQFLNKI 60 Query: 61 NELRNKAPKNEETKHEEHTPDNHEETDHHEAKQQEQAWRGNLRYLDTLNRLDEVLPRKLY 120 NELRNKAPKNEETKHEEHTPDNHEETDHHEAKQQEQAWRGNLRYLDTLNRLDEVLPRKLY Sbjct: 61 NELRNKAPKNEETKHEEHTPDNHEETDHHEAKQQEQAWRGNLRYLDTLNRLDEVLPRKLY 120 Query: 121 ERWEKEHTVNDEAVLRALCYFAGTGKNSQLGWCRVGRGTIDKRARLSKNTVKKCLDRLVN 180 ERWEKEHTVNDEAVLRALCYFAGTGKNSQLGWCRVGRGTIDKRARLSKNTVKKCLDRLVN Sbjct: 121 ERWEKEHTVNDEAVLRALCYFAGTGKNSQLGWCRVGRGTIDKRARLSKNTVKKCLDRLVN 180 Query: 181 HFKLVERTEGYIPGSAERECNEYQLLFKPYNMK 213 HFKLVERTEGYIPGSAERECNEYQLLFKPYNMK Sbjct: 181 HFKLVERTEGYIPGSAERECNEYQLLFKPYNMK 213 >UniRef50_Q96JM2 Zinc finger protein 462 n=34 Tax=Eukaryota RepID=ZN462_HUMAN Length = 2506 Score = 41.2 bits (95), Expect = 0.025, Method: Composition-based stats. Identities = 33/123 (26%), Positives = 54/123 (43%), Gaps = 6/123 (4%) Query: 25 CRGRYEKQAKGEFLMSDMLAVEQETNNDVRQFLNKINELRNKAPK--NEETKHEEHTPDN 82 CR + K +G D ++ ++Q + K NEL+ + ETKH E D+ Sbjct: 2287 CRSKLSKYLQGVVFRCDKCTFTCSSDESLQQHIEKHNELKPYKCQLCYYETKHTEEL-DS 2345 Query: 83 HEETDHHEAKQQEQAWRGNLRYLDTLNRLDEVLPRKLYERWEKEHTVNDEAVLRALCYFA 142 H +H ++ E R N LD L ++ E + + +KE +N +A R L F+ Sbjct: 2346 HLRDEHKVSRNFELVGRVN---LDQLEQMKEKMESSSSDDEDKEEEMNSKAEDRELMRFS 2402 Query: 143 GTG 145 G Sbjct: 2403 DHG 2405 >UniRef50_Q5T0T3 Zinc finger protein 462 (Fragment) n=12 Tax=Eutheria RepID=Q5T0T3_HUMAN Length = 931 Score = 40.1 bits (92), Expect = 0.048, Method: Composition-based stats. Identities = 33/123 (26%), Positives = 54/123 (43%), Gaps = 6/123 (4%) Query: 25 CRGRYEKQAKGEFLMSDMLAVEQETNNDVRQFLNKINELRNKAPK--NEETKHEEHTPDN 82 CR + K +G D ++ ++Q + K NEL+ + ETKH E D+ Sbjct: 712 CRSKLSKYLQGVVFRCDKCTFTCSSDESLQQHIEKHNELKPYKCQLCYYETKHTEEL-DS 770 Query: 83 HEETDHHEAKQQEQAWRGNLRYLDTLNRLDEVLPRKLYERWEKEHTVNDEAVLRALCYFA 142 H +H ++ E R N LD L ++ E + + +KE +N +A R L F+ Sbjct: 771 HLRDEHKVSRNFELVGRVN---LDQLEQMKEKMESSSSDDEDKEEEMNSKAEDRELMRFS 827 Query: 143 GTG 145 G Sbjct: 828 DHG 830 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.315 0.131 0.393 Lambda K H 0.267 0.0409 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 846,799,102 Number of Sequences: 3077464 Number of extensions: 33312697 Number of successful extensions: 145481 Number of sequences better than 1.0e-01: 4 Number of HSP's better than 0.1 without gapping: 4 Number of HSP's successfully gapped in prelim test: 11 Number of HSP's that attempted gapping in prelim test: 145387 Number of HSP's gapped (non-prelim): 66 length of query: 213 length of database: 1,040,396,356 effective HSP length: 123 effective length of query: 90 effective length of database: 661,868,284 effective search space: 59568145560 effective search space used: 59568145560 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.6 bits) S2: 90 (39.3 bits)