BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (104 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P18032 Protein artA n=10 Tax=Enterobacteriaceae RepID=A... 201 5e-51 UniRef50_B7LIS5 Putative uncharacterized protein artA n=1 Tax=Es... 50 2e-05 UniRef50_Q7WUF1 Putative uncharacterized protein artA n=1 Tax=Es... 50 3e-05 >UniRef50_P18032 Protein artA n=10 Tax=Enterobacteriaceae RepID=ARTA_ECOLI Length = 104 Score = 201 bits (512), Expect = 5e-51, Method: Compositional matrix adjust. Identities = 104/104 (100%), Positives = 104/104 (100%) Query: 1 MEKRSFKEKLEIIRNIIRESLLGNAAIIALIYAASHSLPVNAFPDYLVISLLSIAAGIVV 60 MEKRSFKEKLEIIRNIIRESLLGNAAIIALIYAASHSLPVNAFPDYLVISLLSIAAGIVV Sbjct: 1 MEKRSFKEKLEIIRNIIRESLLGNAAIIALIYAASHSLPVNAFPDYLVISLLSIAAGIVV 60 Query: 61 LWLFSIIYIYFCELFRSHWIAVWFIIWSSVINLIILYGFYDRFI 104 LWLFSIIYIYFCELFRSHWIAVWFIIWSSVINLIILYGFYDRFI Sbjct: 61 LWLFSIIYIYFCELFRSHWIAVWFIIWSSVINLIILYGFYDRFI 104 >UniRef50_B7LIS5 Putative uncharacterized protein artA n=1 Tax=Escherichia coli ED1a RepID=B7LIS5_ECO81 Length = 112 Score = 50.1 bits (118), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 36/86 (41%), Positives = 47/86 (54%), Gaps = 9/86 (10%) Query: 1 MEKRSFKEKLEIIRNIIRESLLGNAAIIALIYAASHSLPVNAFPDYLVISLLSIAA---- 56 M+ + KEKLEIIRNIIRE+LLGNAAI+ L+ + V ++ IS+L A Sbjct: 1 MKIKQHKEKLEIIRNIIRETLLGNAAIVVLV----SMIIVLCRQEWTFISMLFTGALCLS 56 Query: 57 -GIVVLWLFSIIYIYFCELFRSHWIA 81 V W +II +Y EL S I Sbjct: 57 FVFVFFWFIAIIQVYINELTESKRIK 82 >UniRef50_Q7WUF1 Putative uncharacterized protein artA n=1 Tax=Escherichia coli RepID=Q7WUF1_ECOLX Length = 106 Score = 49.7 bits (117), Expect = 3e-05, Method: Compositional matrix adjust. Identities = 32/91 (35%), Positives = 57/91 (62%), Gaps = 4/91 (4%) Query: 1 MEKRSFKEKLEIIRNIIRESLLGNAAIIALIYAASHSL--PVNAFPDYLVISLLSIAAGI 58 M+ + KEKLEIIR+IIRE+LLGNAA+I L L P ++ ++ + ++ ++ + Sbjct: 1 MKIKQHKEKLEIIRDIIRETLLGNAALIVLAAMVPVVLVKPWDSM-TFIFMGVVCVSFAV 59 Query: 59 VVLWLFSIIYIYFCELFRSHWIAVW-FIIWS 88 ++ WL I+ +Y L ++ +I ++ F IW+ Sbjct: 60 ILFWLVYIVCLYIGTLVKNIFIRIFAFSIWA 90 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P18032 Protein artA n=10 Tax=Enterobacteriaceae RepID=A... 107 1e-22 UniRef50_Q7WUF1 Putative uncharacterized protein artA n=1 Tax=Es... 73 2e-12 UniRef50_B7LIS5 Putative uncharacterized protein artA n=1 Tax=Es... 68 1e-10 Sequences not found previously or not previously below threshold: CONVERGED! >UniRef50_P18032 Protein artA n=10 Tax=Enterobacteriaceae RepID=ARTA_ECOLI Length = 104 Score = 107 bits (268), Expect = 1e-22, Method: Composition-based stats. Identities = 104/104 (100%), Positives = 104/104 (100%) Query: 1 MEKRSFKEKLEIIRNIIRESLLGNAAIIALIYAASHSLPVNAFPDYLVISLLSIAAGIVV 60 MEKRSFKEKLEIIRNIIRESLLGNAAIIALIYAASHSLPVNAFPDYLVISLLSIAAGIVV Sbjct: 1 MEKRSFKEKLEIIRNIIRESLLGNAAIIALIYAASHSLPVNAFPDYLVISLLSIAAGIVV 60 Query: 61 LWLFSIIYIYFCELFRSHWIAVWFIIWSSVINLIILYGFYDRFI 104 LWLFSIIYIYFCELFRSHWIAVWFIIWSSVINLIILYGFYDRFI Sbjct: 61 LWLFSIIYIYFCELFRSHWIAVWFIIWSSVINLIILYGFYDRFI 104 >UniRef50_Q7WUF1 Putative uncharacterized protein artA n=1 Tax=Escherichia coli RepID=Q7WUF1_ECOLX Length = 106 Score = 73.4 bits (179), Expect = 2e-12, Method: Composition-based stats. Identities = 34/100 (34%), Positives = 60/100 (60%), Gaps = 4/100 (4%) Query: 1 MEKRSFKEKLEIIRNIIRESLLGNAAIIALIYAASHSL--PVNAFPDYLVISLLSIAAGI 58 M+ + KEKLEIIR+IIRE+LLGNAA+I L L P ++ ++ + ++ ++ + Sbjct: 1 MKIKQHKEKLEIIRDIIRETLLGNAALIVLAAMVPVVLVKPWDSM-TFIFMGVVCVSFAV 59 Query: 59 VVLWLFSIIYIYFCELFRSHWIAVW-FIIWSSVINLIILY 97 ++ WL I+ +Y L ++ +I ++ F IW+ I+Y Sbjct: 60 ILFWLVYIVCLYIGTLVKNIFIRIFAFSIWALSAEGAIIY 99 >UniRef50_B7LIS5 Putative uncharacterized protein artA n=1 Tax=Escherichia coli ED1a RepID=B7LIS5_ECO81 Length = 112 Score = 67.6 bits (164), Expect = 1e-10, Method: Composition-based stats. Identities = 36/91 (39%), Positives = 49/91 (53%), Gaps = 9/91 (9%) Query: 1 MEKRSFKEKLEIIRNIIRESLLGNAAIIALIYAASHSLPVNAFPDYLVISLLSIA----- 55 M+ + KEKLEIIRNIIRE+LLGNAAI+ L+ + V ++ IS+L Sbjct: 1 MKIKQHKEKLEIIRNIIRETLLGNAAIVVLV----SMIIVLCRQEWTFISMLFTGALCLS 56 Query: 56 AGIVVLWLFSIIYIYFCELFRSHWIAVWFII 86 V W +II +Y EL S I F++ Sbjct: 57 FVFVFFWFIAIIQVYINELTESKRIKWPFVL 87 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.338 0.151 0.448 Lambda K H 0.267 0.0457 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 450,308,120 Number of Sequences: 3077464 Number of extensions: 16793207 Number of successful extensions: 139866 Number of sequences better than 1.0e-01: 7 Number of HSP's better than 0.1 without gapping: 6 Number of HSP's successfully gapped in prelim test: 4 Number of HSP's that attempted gapping in prelim test: 139854 Number of HSP's gapped (non-prelim): 10 length of query: 104 length of database: 1,040,396,356 effective HSP length: 72 effective length of query: 32 effective length of database: 818,818,948 effective search space: 26202206336 effective search space used: 26202206336 T: 11 A: 40 X1: 16 ( 7.8 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 39 (21.7 bits) S2: 87 (38.0 bits)