BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (71 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P76057 Uncharacterized protein ydaQ n=18 Tax=Enterobact... 151 5e-36 UniRef50_C8TPU6 Excisionase n=14 Tax=root RepID=C8TPU6_ECO26 40 0.033 UniRef50_Q6LSL1 Putative uncharacterized protein XIS n=2 Tax=Pho... 39 0.065 >UniRef50_P76057 Uncharacterized protein ydaQ n=18 Tax=Enterobacteriaceae RepID=YDAQ_ECOLI Length = 71 Score = 151 bits (382), Expect = 5e-36, Method: Compositional matrix adjust. Identities = 71/71 (100%), Positives = 71/71 (100%) Query: 1 MAQVIFNEEWMVEYGLMLRTGLGARQIEAYRQNCWVEGFHFKRVSPLGKPDSKRGIIWYN 60 MAQVIFNEEWMVEYGLMLRTGLGARQIEAYRQNCWVEGFHFKRVSPLGKPDSKRGIIWYN Sbjct: 1 MAQVIFNEEWMVEYGLMLRTGLGARQIEAYRQNCWVEGFHFKRVSPLGKPDSKRGIIWYN 60 Query: 61 YPKINQFIKDS 71 YPKINQFIKDS Sbjct: 61 YPKINQFIKDS 71 >UniRef50_C8TPU6 Excisionase n=14 Tax=root RepID=C8TPU6_ECO26 Length = 94 Score = 39.7 bits (91), Expect = 0.033, Method: Compositional matrix adjust. Identities = 23/66 (34%), Positives = 33/66 (50%), Gaps = 1/66 (1%) Query: 5 IFNEEWMVEYGLMLRTGLGARQIEAYRQNCWVEGFHFKRVSPLGKPDSKRGIIWYNYPKI 64 I EW+ LM TGL I R + W+ G +K+++P G P K Y+ P I Sbjct: 22 INANEWVTLELLMAVTGLRKGTILRARDSAWMNGREYKQIAPDGTP-KKNSECLYHLPTI 80 Query: 65 NQFIKD 70 N +IK+ Sbjct: 81 NTWIKN 86 >UniRef50_Q6LSL1 Putative uncharacterized protein XIS n=2 Tax=Photobacterium profundum RepID=Q6LSL1_PHOPR Length = 85 Score = 38.5 bits (88), Expect = 0.065, Method: Compositional matrix adjust. Identities = 19/61 (31%), Positives = 29/61 (47%) Query: 8 EEWMVEYGLMLRTGLGARQIEAYRQNCWVEGFHFKRVSPLGKPDSKRGIIWYNYPKINQF 67 +W+ E + GL I YR W +G H+++V G P + I YN IN++ Sbjct: 19 SKWVGENIIQAVYGLTGAAIMNYRLKAWQQGVHYRKVGITGVPSGSKAKILYNIHSINEW 78 Query: 68 I 68 I Sbjct: 79 I 79 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P76057 Uncharacterized protein ydaQ n=18 Tax=Enterobact... 152 2e-36 Sequences not found previously or not previously below threshold: UniRef50_C8TPU6 Excisionase n=14 Tax=root RepID=C8TPU6_ECO26 40 0.035 UniRef50_Q6LSL1 Putative uncharacterized protein XIS n=2 Tax=Pho... 39 0.049 CONVERGED! >UniRef50_P76057 Uncharacterized protein ydaQ n=18 Tax=Enterobacteriaceae RepID=YDAQ_ECOLI Length = 71 Score = 152 bits (385), Expect = 2e-36, Method: Composition-based stats. Identities = 71/71 (100%), Positives = 71/71 (100%) Query: 1 MAQVIFNEEWMVEYGLMLRTGLGARQIEAYRQNCWVEGFHFKRVSPLGKPDSKRGIIWYN 60 MAQVIFNEEWMVEYGLMLRTGLGARQIEAYRQNCWVEGFHFKRVSPLGKPDSKRGIIWYN Sbjct: 1 MAQVIFNEEWMVEYGLMLRTGLGARQIEAYRQNCWVEGFHFKRVSPLGKPDSKRGIIWYN 60 Query: 61 YPKINQFIKDS 71 YPKINQFIKDS Sbjct: 61 YPKINQFIKDS 71 >UniRef50_C8TPU6 Excisionase n=14 Tax=root RepID=C8TPU6_ECO26 Length = 94 Score = 39.6 bits (91), Expect = 0.035, Method: Composition-based stats. Identities = 22/62 (35%), Positives = 32/62 (51%), Gaps = 1/62 (1%) Query: 9 EWMVEYGLMLRTGLGARQIEAYRQNCWVEGFHFKRVSPLGKPDSKRGIIWYNYPKINQFI 68 EW+ LM TGL I R + W+ G +K+++P G P K Y+ P IN +I Sbjct: 26 EWVTLELLMAVTGLRKGTILRARDSAWMNGREYKQIAPDGTP-KKNSECLYHLPTINTWI 84 Query: 69 KD 70 K+ Sbjct: 85 KN 86 >UniRef50_Q6LSL1 Putative uncharacterized protein XIS n=2 Tax=Photobacterium profundum RepID=Q6LSL1_PHOPR Length = 85 Score = 38.8 bits (89), Expect = 0.049, Method: Composition-based stats. Identities = 19/60 (31%), Positives = 29/60 (48%) Query: 9 EWMVEYGLMLRTGLGARQIEAYRQNCWVEGFHFKRVSPLGKPDSKRGIIWYNYPKINQFI 68 +W+ E + GL I YR W +G H+++V G P + I YN IN++I Sbjct: 20 KWVGENIIQAVYGLTGAAIMNYRLKAWQQGVHYRKVGITGVPSGSKAKILYNIHSINEWI 79 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.324 0.141 0.464 Lambda K H 0.267 0.0439 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 346,113,838 Number of Sequences: 3077464 Number of extensions: 12789457 Number of successful extensions: 23873 Number of sequences better than 1.0e-01: 3 Number of HSP's better than 0.1 without gapping: 4 Number of HSP's successfully gapped in prelim test: 2 Number of HSP's that attempted gapping in prelim test: 23869 Number of HSP's gapped (non-prelim): 6 length of query: 71 length of database: 1,040,396,356 effective HSP length: 43 effective length of query: 28 effective length of database: 908,065,404 effective search space: 25425831312 effective search space used: 25425831312 T: 11 A: 40 X1: 16 ( 7.5 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 40 (21.5 bits) S2: 87 (38.0 bits)