BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (153 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_Q9JMS1 Uncharacterized protein yuaT n=8 Tax=root RepID=... 323 9e-88 UniRef50_C8CGL3 Putative uncharacterized protein n=3 Tax=Escheri... 222 4e-57 UniRef50_Q08JM6 Putative uncharacterized protein orf42 n=1 Tax=E... 123 2e-27 >UniRef50_Q9JMS1 Uncharacterized protein yuaT n=8 Tax=root RepID=YUAT_ECOLI Length = 153 Score = 323 bits (829), Expect = 9e-88, Method: Compositional matrix adjust. Identities = 153/153 (100%), Positives = 153/153 (100%) Query: 1 MFEDPEIRIHKLQPCTQCPRLEDLWFSWYFRYIHDRCLRSSACHQDIHGASLYAPLALKQ 60 MFEDPEIRIHKLQPCTQCPRLEDLWFSWYFRYIHDRCLRSSACHQDIHGASLYAPLALKQ Sbjct: 1 MFEDPEIRIHKLQPCTQCPRLEDLWFSWYFRYIHDRCLRSSACHQDIHGASLYAPLALKQ 60 Query: 61 TYEVYFGYDCTLRKYGRFRQKLPVKTDFRKNGARRRPVTGRSPRTGEEAQWICNHSEQVR 120 TYEVYFGYDCTLRKYGRFRQKLPVKTDFRKNGARRRPVTGRSPRTGEEAQWICNHSEQVR Sbjct: 61 TYEVYFGYDCTLRKYGRFRQKLPVKTDFRKNGARRRPVTGRSPRTGEEAQWICNHSEQVR 120 Query: 121 PEAFIIGSVCCFSGMIYPHIQRPDPVNRSMNDQ 153 PEAFIIGSVCCFSGMIYPHIQRPDPVNRSMNDQ Sbjct: 121 PEAFIIGSVCCFSGMIYPHIQRPDPVNRSMNDQ 153 >UniRef50_C8CGL3 Putative uncharacterized protein n=3 Tax=Escherichia coli RepID=C8CGL3_ECOLX Length = 162 Score = 222 bits (565), Expect = 4e-57, Method: Compositional matrix adjust. Identities = 105/114 (92%), Positives = 110/114 (96%) Query: 40 SSACHQDIHGASLYAPLALKQTYEVYFGYDCTLRKYGRFRQKLPVKTDFRKNGARRRPVT 99 S A QDIHGASL+APLAL+QTYEVYFGYDCTLRKYGRFRQKLPVKTDFRKN ARRRPVT Sbjct: 45 SDASCQDIHGASLHAPLALRQTYEVYFGYDCTLRKYGRFRQKLPVKTDFRKNSARRRPVT 104 Query: 100 GRSPRTGEEAQWICNHSEQVRPEAFIIGSVCCFSGMIYPHIQRPDPVNRSMNDQ 153 GRSPRTGEEAQ+IC++SEQVRPEAFIIGSVCCFSGMIYPHIQRPDPVNRSMNDQ Sbjct: 105 GRSPRTGEEAQYICSYSEQVRPEAFIIGSVCCFSGMIYPHIQRPDPVNRSMNDQ 158 >UniRef50_Q08JM6 Putative uncharacterized protein orf42 n=1 Tax=Escherichia coli RepID=Q08JM6_ECOLX Length = 100 Score = 123 bits (308), Expect = 2e-27, Method: Compositional matrix adjust. Identities = 60/72 (83%), Positives = 63/72 (87%) Query: 53 YAPLALKQTYEVYFGYDCTLRKYGRFRQKLPVKTDFRKNGARRRPVTGRSPRTGEEAQWI 112 Y LALK TYEVYFGYD TLRKY RFRQKLPVKTDF+KN ARRRPVTGRSPRTGEEAQ I Sbjct: 28 YQSLALKHTYEVYFGYDRTLRKYCRFRQKLPVKTDFKKNSARRRPVTGRSPRTGEEAQCI 87 Query: 113 CNHSEQVRPEAF 124 C++ EQVRPEA Sbjct: 88 CSYREQVRPEAL 99 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_Q9JMS1 Uncharacterized protein yuaT n=8 Tax=root RepID=... 309 2e-83 UniRef50_C8CGL3 Putative uncharacterized protein n=3 Tax=Escheri... 232 3e-60 UniRef50_Q08JM6 Putative uncharacterized protein orf42 n=1 Tax=E... 138 4e-32 Sequences not found previously or not previously below threshold: CONVERGED! >UniRef50_Q9JMS1 Uncharacterized protein yuaT n=8 Tax=root RepID=YUAT_ECOLI Length = 153 Score = 309 bits (792), Expect = 2e-83, Method: Composition-based stats. Identities = 153/153 (100%), Positives = 153/153 (100%) Query: 1 MFEDPEIRIHKLQPCTQCPRLEDLWFSWYFRYIHDRCLRSSACHQDIHGASLYAPLALKQ 60 MFEDPEIRIHKLQPCTQCPRLEDLWFSWYFRYIHDRCLRSSACHQDIHGASLYAPLALKQ Sbjct: 1 MFEDPEIRIHKLQPCTQCPRLEDLWFSWYFRYIHDRCLRSSACHQDIHGASLYAPLALKQ 60 Query: 61 TYEVYFGYDCTLRKYGRFRQKLPVKTDFRKNGARRRPVTGRSPRTGEEAQWICNHSEQVR 120 TYEVYFGYDCTLRKYGRFRQKLPVKTDFRKNGARRRPVTGRSPRTGEEAQWICNHSEQVR Sbjct: 61 TYEVYFGYDCTLRKYGRFRQKLPVKTDFRKNGARRRPVTGRSPRTGEEAQWICNHSEQVR 120 Query: 121 PEAFIIGSVCCFSGMIYPHIQRPDPVNRSMNDQ 153 PEAFIIGSVCCFSGMIYPHIQRPDPVNRSMNDQ Sbjct: 121 PEAFIIGSVCCFSGMIYPHIQRPDPVNRSMNDQ 153 >UniRef50_C8CGL3 Putative uncharacterized protein n=3 Tax=Escherichia coli RepID=C8CGL3_ECOLX Length = 162 Score = 232 bits (591), Expect = 3e-60, Method: Composition-based stats. Identities = 105/114 (92%), Positives = 110/114 (96%) Query: 40 SSACHQDIHGASLYAPLALKQTYEVYFGYDCTLRKYGRFRQKLPVKTDFRKNGARRRPVT 99 S A QDIHGASL+APLAL+QTYEVYFGYDCTLRKYGRFRQKLPVKTDFRKN ARRRPVT Sbjct: 45 SDASCQDIHGASLHAPLALRQTYEVYFGYDCTLRKYGRFRQKLPVKTDFRKNSARRRPVT 104 Query: 100 GRSPRTGEEAQWICNHSEQVRPEAFIIGSVCCFSGMIYPHIQRPDPVNRSMNDQ 153 GRSPRTGEEAQ+IC++SEQVRPEAFIIGSVCCFSGMIYPHIQRPDPVNRSMNDQ Sbjct: 105 GRSPRTGEEAQYICSYSEQVRPEAFIIGSVCCFSGMIYPHIQRPDPVNRSMNDQ 158 >UniRef50_Q08JM6 Putative uncharacterized protein orf42 n=1 Tax=Escherichia coli RepID=Q08JM6_ECOLX Length = 100 Score = 138 bits (349), Expect = 4e-32, Method: Composition-based stats. Identities = 60/72 (83%), Positives = 63/72 (87%) Query: 53 YAPLALKQTYEVYFGYDCTLRKYGRFRQKLPVKTDFRKNGARRRPVTGRSPRTGEEAQWI 112 Y LALK TYEVYFGYD TLRKY RFRQKLPVKTDF+KN ARRRPVTGRSPRTGEEAQ I Sbjct: 28 YQSLALKHTYEVYFGYDRTLRKYCRFRQKLPVKTDFKKNSARRRPVTGRSPRTGEEAQCI 87 Query: 113 CNHSEQVRPEAF 124 C++ EQVRPEA Sbjct: 88 CSYREQVRPEAL 99 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.318 0.143 0.478 Lambda K H 0.267 0.0441 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 760,627,020 Number of Sequences: 3077464 Number of extensions: 32911184 Number of successful extensions: 74997 Number of sequences better than 1.0e-01: 3 Number of HSP's better than 0.1 without gapping: 6 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 74991 Number of HSP's gapped (non-prelim): 6 length of query: 153 length of database: 1,040,396,356 effective HSP length: 115 effective length of query: 38 effective length of database: 686,487,996 effective search space: 26086543848 effective search space used: 26086543848 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 40 (21.2 bits) S2: 87 (38.0 bits)