BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (101 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_Q9S4X3 Uncharacterized protein yubC n=4 Tax=Escherichia... 199 2e-50 UniRef50_B7JCM2 Putative uncharacterized protein n=3 Tax=Enterob... 130 1e-29 UniRef50_D0Z6S0 Putative uncharacterized protein n=1 Tax=Escheri... 103 1e-21 UniRef50_Q1R1W4 Uncharacterized protein yubC n=19 Tax=root RepID... 102 4e-21 >UniRef50_Q9S4X3 Uncharacterized protein yubC n=4 Tax=Escherichia coli RepID=YUBC_ECOLI Length = 101 Score = 199 bits (506), Expect = 2e-50, Method: Compositional matrix adjust. Identities = 101/101 (100%), Positives = 101/101 (100%) Query: 1 MTETGGQPPVSFPVKDVAGLLFLLRRLTRRGRNAACGQCPASGWARDGRFYRSRLASVTV 60 MTETGGQPPVSFPVKDVAGLLFLLRRLTRRGRNAACGQCPASGWARDGRFYRSRLASVTV Sbjct: 1 MTETGGQPPVSFPVKDVAGLLFLLRRLTRRGRNAACGQCPASGWARDGRFYRSRLASVTV 60 Query: 61 YASPSPFSDERPSSRFRGIFSPSKRRRLRYSTVGLTRYRTR 101 YASPSPFSDERPSSRFRGIFSPSKRRRLRYSTVGLTRYRTR Sbjct: 61 YASPSPFSDERPSSRFRGIFSPSKRRRLRYSTVGLTRYRTR 101 >UniRef50_B7JCM2 Putative uncharacterized protein n=3 Tax=Enterobacteriaceae RepID=B7JCM2_ECOLX Length = 112 Score = 130 bits (327), Expect = 1e-29, Method: Compositional matrix adjust. Identities = 84/87 (96%), Positives = 84/87 (96%) Query: 15 KDVAGLLFLLRRLTRRGRNAACGQCPASGWARDGRFYRSRLASVTVYASPSPFSDERPSS 74 KDVAGLLFLLRRLTRRGRNAACGQCPASG ARDGRFYRSRLA VTVYASPSPFSDERPSS Sbjct: 26 KDVAGLLFLLRRLTRRGRNAACGQCPASGRARDGRFYRSRLALVTVYASPSPFSDERPSS 85 Query: 75 RFRGIFSPSKRRRLRYSTVGLTRYRTR 101 RFRGI SPSKRRRLRYSTVGLTRYRTR Sbjct: 86 RFRGISSPSKRRRLRYSTVGLTRYRTR 112 >UniRef50_D0Z6S0 Putative uncharacterized protein n=1 Tax=Escherichia coli ETEC H10407 RepID=D0Z6S0_ECOLX Length = 135 Score = 103 bits (258), Expect = 1e-21, Method: Compositional matrix adjust. Identities = 55/57 (96%), Positives = 55/57 (96%) Query: 45 ARDGRFYRSRLASVTVYASPSPFSDERPSSRFRGIFSPSKRRRLRYSTVGLTRYRTR 101 ARDGRFYRSRLASVTVYASPSPFSDERPSSRFRGI PSKRRRLRYSTVGLTRYRTR Sbjct: 79 ARDGRFYRSRLASVTVYASPSPFSDERPSSRFRGIALPSKRRRLRYSTVGLTRYRTR 135 >UniRef50_Q1R1W4 Uncharacterized protein yubC n=19 Tax=root RepID=YUBC_ECOUT Length = 103 Score = 102 bits (254), Expect = 4e-21, Method: Compositional matrix adjust. Identities = 54/58 (93%), Positives = 55/58 (94%) Query: 44 WARDGRFYRSRLASVTVYASPSPFSDERPSSRFRGIFSPSKRRRLRYSTVGLTRYRTR 101 WARDGRFYRSRLASVTVYAS SPFSDERPSSRFRGI PS+RRRLRYSTVGLTRYRTR Sbjct: 46 WARDGRFYRSRLASVTVYASLSPFSDERPSSRFRGIALPSERRRLRYSTVGLTRYRTR 103 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_Q9S4X3 Uncharacterized protein yubC n=4 Tax=Escherichia... 162 3e-39 UniRef50_B7JCM2 Putative uncharacterized protein n=3 Tax=Enterob... 127 2e-28 UniRef50_D0Z6S0 Putative uncharacterized protein n=1 Tax=Escheri... 103 2e-21 UniRef50_Q1R1W4 Uncharacterized protein yubC n=19 Tax=root RepID... 88 1e-16 Sequences not found previously or not previously below threshold: CONVERGED! >UniRef50_Q9S4X3 Uncharacterized protein yubC n=4 Tax=Escherichia coli RepID=YUBC_ECOLI Length = 101 Score = 162 bits (410), Expect = 3e-39, Method: Composition-based stats. Identities = 101/101 (100%), Positives = 101/101 (100%) Query: 1 MTETGGQPPVSFPVKDVAGLLFLLRRLTRRGRNAACGQCPASGWARDGRFYRSRLASVTV 60 MTETGGQPPVSFPVKDVAGLLFLLRRLTRRGRNAACGQCPASGWARDGRFYRSRLASVTV Sbjct: 1 MTETGGQPPVSFPVKDVAGLLFLLRRLTRRGRNAACGQCPASGWARDGRFYRSRLASVTV 60 Query: 61 YASPSPFSDERPSSRFRGIFSPSKRRRLRYSTVGLTRYRTR 101 YASPSPFSDERPSSRFRGIFSPSKRRRLRYSTVGLTRYRTR Sbjct: 61 YASPSPFSDERPSSRFRGIFSPSKRRRLRYSTVGLTRYRTR 101 >UniRef50_B7JCM2 Putative uncharacterized protein n=3 Tax=Enterobacteriaceae RepID=B7JCM2_ECOLX Length = 112 Score = 127 bits (318), Expect = 2e-28, Method: Composition-based stats. Identities = 84/87 (96%), Positives = 84/87 (96%) Query: 15 KDVAGLLFLLRRLTRRGRNAACGQCPASGWARDGRFYRSRLASVTVYASPSPFSDERPSS 74 KDVAGLLFLLRRLTRRGRNAACGQCPASG ARDGRFYRSRLA VTVYASPSPFSDERPSS Sbjct: 26 KDVAGLLFLLRRLTRRGRNAACGQCPASGRARDGRFYRSRLALVTVYASPSPFSDERPSS 85 Query: 75 RFRGIFSPSKRRRLRYSTVGLTRYRTR 101 RFRGI SPSKRRRLRYSTVGLTRYRTR Sbjct: 86 RFRGISSPSKRRRLRYSTVGLTRYRTR 112 >UniRef50_D0Z6S0 Putative uncharacterized protein n=1 Tax=Escherichia coli ETEC H10407 RepID=D0Z6S0_ECOLX Length = 135 Score = 103 bits (257), Expect = 2e-21, Method: Composition-based stats. Identities = 55/58 (94%), Positives = 55/58 (94%) Query: 44 WARDGRFYRSRLASVTVYASPSPFSDERPSSRFRGIFSPSKRRRLRYSTVGLTRYRTR 101 ARDGRFYRSRLASVTVYASPSPFSDERPSSRFRGI PSKRRRLRYSTVGLTRYRTR Sbjct: 78 RARDGRFYRSRLASVTVYASPSPFSDERPSSRFRGIALPSKRRRLRYSTVGLTRYRTR 135 >UniRef50_Q1R1W4 Uncharacterized protein yubC n=19 Tax=root RepID=YUBC_ECOUT Length = 103 Score = 87.8 bits (216), Expect = 1e-16, Method: Composition-based stats. Identities = 54/58 (93%), Positives = 55/58 (94%) Query: 44 WARDGRFYRSRLASVTVYASPSPFSDERPSSRFRGIFSPSKRRRLRYSTVGLTRYRTR 101 WARDGRFYRSRLASVTVYAS SPFSDERPSSRFRGI PS+RRRLRYSTVGLTRYRTR Sbjct: 46 WARDGRFYRSRLASVTVYASLSPFSDERPSSRFRGIALPSERRRLRYSTVGLTRYRTR 103 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.311 0.139 0.438 Lambda K H 0.267 0.0418 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 414,379,675 Number of Sequences: 3077464 Number of extensions: 15255460 Number of successful extensions: 41436 Number of sequences better than 1.0e-01: 4 Number of HSP's better than 0.1 without gapping: 8 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 41428 Number of HSP's gapped (non-prelim): 8 length of query: 101 length of database: 1,040,396,356 effective HSP length: 70 effective length of query: 31 effective length of database: 824,973,876 effective search space: 25574190156 effective search space used: 25574190156 T: 11 A: 40 X1: 15 ( 6.7 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 40 (20.8 bits) S2: 87 (38.1 bits)