BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (174 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_O82900 Uncharacterized protein yubH n=10 Tax=root RepID... 303 1e-81 UniRef50_C8TWA5 Conserved predicted protein n=7 Tax=root RepID=C... 211 7e-54 UniRef50_C7SA25 YgbA-ard n=5 Tax=Enterobacteriaceae RepID=C7SA25... 149 3e-35 UniRef50_B9K6E8 Putative uncharacterized protein n=3 Tax=Escheri... 131 9e-30 UniRef50_B1LRQ2 Putative uncharacterized protein n=1 Tax=Escheri... 129 3e-29 UniRef50_Q27TL7 Putative uncharacterized protein n=1 Tax=Escheri... 46 7e-04 >UniRef50_O82900 Uncharacterized protein yubH n=10 Tax=root RepID=YUBH_ECO57 Length = 190 Score = 303 bits (776), Expect = 1e-81, Method: Compositional matrix adjust. Identities = 160/176 (90%), Positives = 161/176 (91%), Gaps = 2/176 (1%) Query: 1 MCCVYRMNRPASGLTVVFCGKLSGKPGPKSAAWRMPWQKSGADD-GGENPRFFSAGPRTE 59 MCCVYRMNRPA GLTVVFCG+ SGKPG KSAAWRMPWQKSG DD GG NPR F AGPRTE Sbjct: 1 MCCVYRMNRPAGGLTVVFCGRWSGKPGTKSAAWRMPWQKSGNDDVGGVNPRLFPAGPRTE 60 Query: 60 HKGSRRRLRFTRPC-AWPCGFSVMWPPCRVRAVPCLHLSRAGGDARVRFAAAVTRSLLPV 118 HKGSRR LRFTRPC AWPCGFSVMWP RVRAVPCLHLSRAG DARVRFAAAVTRSLLPV Sbjct: 61 HKGSRRWLRFTRPCRAWPCGFSVMWPQRRVRAVPCLHLSRAGVDARVRFAAAVTRSLLPV 120 Query: 119 CRDFPVVHPLRFRGLTLQLPSAVCVRLRLPLRPVHPRLIARLLWRHGTARCRGICE 174 CRDFPVV PLRFRGLTLQLPSAVCVRLRLPLRPVHPRLIARL WRHGTARCRGICE Sbjct: 121 CRDFPVVRPLRFRGLTLQLPSAVCVRLRLPLRPVHPRLIARLFWRHGTARCRGICE 176 >UniRef50_C8TWA5 Conserved predicted protein n=7 Tax=root RepID=C8TWA5_ECO26 Length = 158 Score = 211 bits (538), Expect = 7e-54, Method: Compositional matrix adjust. Identities = 113/124 (91%), Positives = 113/124 (91%), Gaps = 2/124 (1%) Query: 35 MPWQKSGADD-GGENPRFFSAGPRTEHKGSRRRLRFTRPC-AWPCGFSVMWPPCRVRAVP 92 MPWQKSG DD GG NPR F AGPRTEHKGSRRRLRFTRPC AWPCGFSVMWPPCRVRAVP Sbjct: 1 MPWQKSGNDDVGGVNPRLFPAGPRTEHKGSRRRLRFTRPCGAWPCGFSVMWPPCRVRAVP 60 Query: 93 CLHLSRAGGDARVRFAAAVTRSLLPVCRDFPVVHPLRFRGLTLQLPSAVCVRLRLPLRPV 152 CLHLSRAG DARVRFAAAVTRSLLPVCRDFPVVHPLRFRGLTLQLP AVCVRLRLPLRP Sbjct: 61 CLHLSRAGVDARVRFAAAVTRSLLPVCRDFPVVHPLRFRGLTLQLPPAVCVRLRLPLRPC 120 Query: 153 HPRL 156 P L Sbjct: 121 IPAL 124 >UniRef50_C7SA25 YgbA-ard n=5 Tax=Enterobacteriaceae RepID=C7SA25_ECOLX Length = 296 Score = 149 bits (377), Expect = 3e-35, Method: Compositional matrix adjust. Identities = 79/89 (88%), Positives = 80/89 (89%), Gaps = 1/89 (1%) Query: 69 FTRPC-AWPCGFSVMWPPCRVRAVPCLHLSRAGGDARVRFAAAVTRSLLPVCRDFPVVHP 127 FTRPC AWPCGFS +WP RVRAVPCLHLSRAG DARVRFAAAVTRSLLPVCRDFPVV P Sbjct: 19 FTRPCGAWPCGFSALWPQRRVRAVPCLHLSRAGRDARVRFAAAVTRSLLPVCRDFPVVRP 78 Query: 128 LRFRGLTLQLPSAVCVRLRLPLRPVHPRL 156 LRFRGLTLQLPSAVCVRLRLPLRP P L Sbjct: 79 LRFRGLTLQLPSAVCVRLRLPLRPCIPAL 107 >UniRef50_B9K6E8 Putative uncharacterized protein n=3 Tax=Escherichia coli RepID=B9K6E8_ECOLX Length = 184 Score = 131 bits (330), Expect = 9e-30, Method: Compositional matrix adjust. Identities = 74/100 (74%), Positives = 79/100 (79%), Gaps = 5/100 (5%) Query: 21 KLSGKPGPKSAAWRMPWQKSGADD-GGENPRFFSAGPRTEHKGSRRRLRFTRPC-AWPCG 78 + S + G KSAAWRMPWQKSG DD GGENPR F AGPRTEHKGSRRRLRFTRPC AWPCG Sbjct: 53 EFSWRHGMKSAAWRMPWQKSGNDDDGGENPRLFPAGPRTEHKGSRRRLRFTRPCGAWPCG 112 Query: 79 FSVMWPPCRVRAVPCLHLSRAGGDARVRFAAAVTRSLLPV 118 SVMW RVRAVPCLHLSRAG DAR AV+++ L V Sbjct: 113 GSVMWTSSRVRAVPCLHLSRAGVDAR---GNAVSQTALHV 149 >UniRef50_B1LRQ2 Putative uncharacterized protein n=1 Tax=Escherichia coli SMS-3-5 RepID=B1LRQ2_ECOSM Length = 137 Score = 129 bits (325), Expect = 3e-29, Method: Compositional matrix adjust. Identities = 69/85 (81%), Positives = 71/85 (83%), Gaps = 2/85 (2%) Query: 21 KLSGKPGPKSAAWRMPWQKSGADD-GGENPRFFSAGPRTEHKGSRRRLRFTRPC-AWPCG 78 + S + G KSAAWRMPWQKSG DD GGENPR F AGPRTEHKGSRRRLRFTRPC AWPCG Sbjct: 38 EFSWRHGMKSAAWRMPWQKSGNDDDGGENPRLFPAGPRTEHKGSRRRLRFTRPCGAWPCG 97 Query: 79 FSVMWPPCRVRAVPCLHLSRAGGDA 103 SVMW RVRAVPCLHLSRAG DA Sbjct: 98 GSVMWTSSRVRAVPCLHLSRAGVDA 122 >UniRef50_Q27TL7 Putative uncharacterized protein n=1 Tax=Escherichia coli APEC O1 RepID=Q27TL7_ECOK1 Length = 86 Score = 45.8 bits (107), Expect = 7e-04, Method: Compositional matrix adjust. Identities = 26/40 (65%), Positives = 28/40 (70%), Gaps = 3/40 (7%) Query: 21 KLSGKPGPKSAAWRMPWQKSG-ADDGGENPRFFSAGPRTE 59 + S + G KSAAWRMPWQKSG DDGGENP SA R E Sbjct: 38 EFSWRHGMKSAAWRMPWQKSGNDDDGGENP--ISANLRVE 75 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_O82900 Uncharacterized protein yubH n=10 Tax=root RepID... 282 4e-75 UniRef50_C8TWA5 Conserved predicted protein n=7 Tax=root RepID=C... 202 4e-51 UniRef50_C7SA25 YgbA-ard n=5 Tax=Enterobacteriaceae RepID=C7SA25... 163 2e-39 UniRef50_B9K6E8 Putative uncharacterized protein n=3 Tax=Escheri... 149 3e-35 UniRef50_B1LRQ2 Putative uncharacterized protein n=1 Tax=Escheri... 144 1e-33 UniRef50_Q27TL7 Putative uncharacterized protein n=1 Tax=Escheri... 61 2e-08 Sequences not found previously or not previously below threshold: CONVERGED! >UniRef50_O82900 Uncharacterized protein yubH n=10 Tax=root RepID=YUBH_ECO57 Length = 190 Score = 282 bits (721), Expect = 4e-75, Method: Composition-based stats. Identities = 160/176 (90%), Positives = 161/176 (91%), Gaps = 2/176 (1%) Query: 1 MCCVYRMNRPASGLTVVFCGKLSGKPGPKSAAWRMPWQKSGADD-GGENPRFFSAGPRTE 59 MCCVYRMNRPA GLTVVFCG+ SGKPG KSAAWRMPWQKSG DD GG NPR F AGPRTE Sbjct: 1 MCCVYRMNRPAGGLTVVFCGRWSGKPGTKSAAWRMPWQKSGNDDVGGVNPRLFPAGPRTE 60 Query: 60 HKGSRRRLRFTRPC-AWPCGFSVMWPPCRVRAVPCLHLSRAGGDARVRFAAAVTRSLLPV 118 HKGSRR LRFTRPC AWPCGFSVMWP RVRAVPCLHLSRAG DARVRFAAAVTRSLLPV Sbjct: 61 HKGSRRWLRFTRPCRAWPCGFSVMWPQRRVRAVPCLHLSRAGVDARVRFAAAVTRSLLPV 120 Query: 119 CRDFPVVHPLRFRGLTLQLPSAVCVRLRLPLRPVHPRLIARLLWRHGTARCRGICE 174 CRDFPVV PLRFRGLTLQLPSAVCVRLRLPLRPVHPRLIARL WRHGTARCRGICE Sbjct: 121 CRDFPVVRPLRFRGLTLQLPSAVCVRLRLPLRPVHPRLIARLFWRHGTARCRGICE 176 >UniRef50_C8TWA5 Conserved predicted protein n=7 Tax=root RepID=C8TWA5_ECO26 Length = 158 Score = 202 bits (514), Expect = 4e-51, Method: Composition-based stats. Identities = 113/124 (91%), Positives = 113/124 (91%), Gaps = 2/124 (1%) Query: 35 MPWQKSGADD-GGENPRFFSAGPRTEHKGSRRRLRFTRPC-AWPCGFSVMWPPCRVRAVP 92 MPWQKSG DD GG NPR F AGPRTEHKGSRRRLRFTRPC AWPCGFSVMWPPCRVRAVP Sbjct: 1 MPWQKSGNDDVGGVNPRLFPAGPRTEHKGSRRRLRFTRPCGAWPCGFSVMWPPCRVRAVP 60 Query: 93 CLHLSRAGGDARVRFAAAVTRSLLPVCRDFPVVHPLRFRGLTLQLPSAVCVRLRLPLRPV 152 CLHLSRAG DARVRFAAAVTRSLLPVCRDFPVVHPLRFRGLTLQLP AVCVRLRLPLRP Sbjct: 61 CLHLSRAGVDARVRFAAAVTRSLLPVCRDFPVVHPLRFRGLTLQLPPAVCVRLRLPLRPC 120 Query: 153 HPRL 156 P L Sbjct: 121 IPAL 124 >UniRef50_C7SA25 YgbA-ard n=5 Tax=Enterobacteriaceae RepID=C7SA25_ECOLX Length = 296 Score = 163 bits (414), Expect = 2e-39, Method: Composition-based stats. Identities = 79/89 (88%), Positives = 80/89 (89%), Gaps = 1/89 (1%) Query: 69 FTRPC-AWPCGFSVMWPPCRVRAVPCLHLSRAGGDARVRFAAAVTRSLLPVCRDFPVVHP 127 FTRPC AWPCGFS +WP RVRAVPCLHLSRAG DARVRFAAAVTRSLLPVCRDFPVV P Sbjct: 19 FTRPCGAWPCGFSALWPQRRVRAVPCLHLSRAGRDARVRFAAAVTRSLLPVCRDFPVVRP 78 Query: 128 LRFRGLTLQLPSAVCVRLRLPLRPVHPRL 156 LRFRGLTLQLPSAVCVRLRLPLRP P L Sbjct: 79 LRFRGLTLQLPSAVCVRLRLPLRPCIPAL 107 >UniRef50_B9K6E8 Putative uncharacterized protein n=3 Tax=Escherichia coli RepID=B9K6E8_ECOLX Length = 184 Score = 149 bits (377), Expect = 3e-35, Method: Composition-based stats. Identities = 75/105 (71%), Positives = 80/105 (76%), Gaps = 5/105 (4%) Query: 21 KLSGKPGPKSAAWRMPWQKSGADD-GGENPRFFSAGPRTEHKGSRRRLRFTRPC-AWPCG 78 + S + G KSAAWRMPWQKSG DD GGENPR F AGPRTEHKGSRRRLRFTRPC AWPCG Sbjct: 53 EFSWRHGMKSAAWRMPWQKSGNDDDGGENPRLFPAGPRTEHKGSRRRLRFTRPCGAWPCG 112 Query: 79 FSVMWPPCRVRAVPCLHLSRAGGDARVRFAAAVTRSLLPVCRDFP 123 SVMW RVRAVPCLHLSRAG DAR AV+++ L V P Sbjct: 113 GSVMWTSSRVRAVPCLHLSRAGVDAR---GNAVSQTALHVVAAGP 154 >UniRef50_B1LRQ2 Putative uncharacterized protein n=1 Tax=Escherichia coli SMS-3-5 RepID=B1LRQ2_ECOSM Length = 137 Score = 144 bits (363), Expect = 1e-33, Method: Composition-based stats. Identities = 69/85 (81%), Positives = 71/85 (83%), Gaps = 2/85 (2%) Query: 21 KLSGKPGPKSAAWRMPWQKSGADD-GGENPRFFSAGPRTEHKGSRRRLRFTRPC-AWPCG 78 + S + G KSAAWRMPWQKSG DD GGENPR F AGPRTEHKGSRRRLRFTRPC AWPCG Sbjct: 38 EFSWRHGMKSAAWRMPWQKSGNDDDGGENPRLFPAGPRTEHKGSRRRLRFTRPCGAWPCG 97 Query: 79 FSVMWPPCRVRAVPCLHLSRAGGDA 103 SVMW RVRAVPCLHLSRAG DA Sbjct: 98 GSVMWTSSRVRAVPCLHLSRAGVDA 122 >UniRef50_Q27TL7 Putative uncharacterized protein n=1 Tax=Escherichia coli APEC O1 RepID=Q27TL7_ECOK1 Length = 86 Score = 61.1 bits (147), Expect = 2e-08, Method: Composition-based stats. Identities = 26/40 (65%), Positives = 28/40 (70%), Gaps = 3/40 (7%) Query: 21 KLSGKPGPKSAAWRMPWQKSGA-DDGGENPRFFSAGPRTE 59 + S + G KSAAWRMPWQKSG DDGGENP SA R E Sbjct: 38 EFSWRHGMKSAAWRMPWQKSGNDDDGGENP--ISANLRVE 75 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.318 0.144 0.514 Lambda K H 0.267 0.0440 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 883,894,717 Number of Sequences: 3077464 Number of extensions: 42033312 Number of successful extensions: 105450 Number of sequences better than 1.0e-01: 10 Number of HSP's better than 0.1 without gapping: 14 Number of HSP's successfully gapped in prelim test: 3 Number of HSP's that attempted gapping in prelim test: 105417 Number of HSP's gapped (non-prelim): 17 length of query: 174 length of database: 1,040,396,356 effective HSP length: 120 effective length of query: 54 effective length of database: 671,100,676 effective search space: 36239436504 effective search space used: 36239436504 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 40 (21.1 bits) S2: 88 (38.4 bits)