BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (150 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_A4W8W8 UPF0268 protein Ent638_1468 n=156 Tax=Gammaprote... 276 2e-73 UniRef50_Q7N603 UPF0268 protein plu1774 n=9 Tax=Enterobacteriace... 245 3e-64 UniRef50_B2JYT3 UPF0268 protein YPTS_1558 n=46 Tax=Gammaproteoba... 245 3e-64 UniRef50_A5UEY9 UPF0268 protein CGSHiGG_01275 n=27 Tax=Pasteurel... 150 1e-35 UniRef50_A4MZW6 Putative uncharacterized protein n=1 Tax=Haemoph... 100 1e-20 UniRef50_C4NV48 Putative uncharacterized protein n=7 Tax=Gammapr... 43 0.004 >UniRef50_A4W8W8 UPF0268 protein Ent638_1468 n=156 Tax=Gammaproteobacteria RepID=Y1468_ENT38 Length = 151 Score = 276 bits (705), Expect = 2e-73, Method: Compositional matrix adjust. Identities = 132/149 (88%), Positives = 144/149 (96%) Query: 1 MKYQQLENLESGWKWKYLVKKHREGELITRYIEASAAQEAVDVLLSLENEPVLVNGWIDK 60 MKYQQLENLESGWKWKYLVKKHREGELIT YIEASAA+EAVD+LL+LENEPV VN WI+K Sbjct: 1 MKYQQLENLESGWKWKYLVKKHREGELITCYIEASAAKEAVDLLLTLENEPVHVNSWIEK 60 Query: 61 HMNPELVNRMKQTIRARRKRHFNAEHQHTRKKSIDLEFIVWQRLAGLAQRRGKTLSETIV 120 H+NP L+NRMKQTIRARRKRHFNAEHQHTRKKSIDLEF+VWQRLAGLAQRRGKTLSET+V Sbjct: 61 HINPALLNRMKQTIRARRKRHFNAEHQHTRKKSIDLEFMVWQRLAGLAQRRGKTLSETVV 120 Query: 121 QLIEDAENKEKYANKMSSLKQDLQALLGK 149 QLIEDAE+KEKYAN+MS+LK DLQA+LGK Sbjct: 121 QLIEDAEHKEKYANQMSTLKNDLQAMLGK 149 >UniRef50_Q7N603 UPF0268 protein plu1774 n=9 Tax=Enterobacteriaceae RepID=Y1774_PHOLL Length = 151 Score = 245 bits (626), Expect = 3e-64, Method: Compositional matrix adjust. Identities = 117/149 (78%), Positives = 132/149 (88%) Query: 1 MKYQQLENLESGWKWKYLVKKHREGELITRYIEASAAQEAVDVLLSLENEPVLVNGWIDK 60 MKYQQLENLE GWKW YL+KKH+EGELIT+YIE SAA AVD L+ LE+EPV V WI++ Sbjct: 1 MKYQQLENLECGWKWTYLMKKHQEGELITKYIENSAAHAAVDKLIELESEPVRVLKWIEQ 60 Query: 61 HMNPELVNRMKQTIRARRKRHFNAEHQHTRKKSIDLEFIVWQRLAGLAQRRGKTLSETIV 120 HMNP+L NRMKQTIRARRKRHFNAEHQHTRKKSIDL+F VW RL+ L+QRRG TLSETI+ Sbjct: 61 HMNPDLSNRMKQTIRARRKRHFNAEHQHTRKKSIDLDFPVWHRLSALSQRRGNTLSETII 120 Query: 121 QLIEDAENKEKYANKMSSLKQDLQALLGK 149 QLIEDAE KEKYAN+MSSLK DL+A+LGK Sbjct: 121 QLIEDAERKEKYANQMSSLKHDLEAILGK 149 >UniRef50_B2JYT3 UPF0268 protein YPTS_1558 n=46 Tax=Gammaproteobacteria RepID=Y1558_YERPB Length = 151 Score = 245 bits (625), Expect = 3e-64, Method: Compositional matrix adjust. Identities = 120/150 (80%), Positives = 132/150 (88%) Query: 1 MKYQQLENLESGWKWKYLVKKHREGELITRYIEASAAQEAVDVLLSLENEPVLVNGWIDK 60 MKYQQLENLESGWKW YLVKKHREGE ITR+IE SAAQ+AV+ L+ LENEPV V WID Sbjct: 1 MKYQQLENLESGWKWAYLVKKHREGEAITRHIENSAAQDAVEQLMKLENEPVKVQEWIDA 60 Query: 61 HMNPELVNRMKQTIRARRKRHFNAEHQHTRKKSIDLEFIVWQRLAGLAQRRGKTLSETIV 120 HMN L RMKQTIRARRKRHFNAEHQHTRKKSIDLEF+VWQRLA LA+RRG TLS+T+V Sbjct: 61 HMNVNLATRMKQTIRARRKRHFNAEHQHTRKKSIDLEFLVWQRLAVLARRRGNTLSDTVV 120 Query: 121 QLIEDAENKEKYANKMSSLKQDLQALLGKE 150 QLIEDAE KEKYA++MSSLKQDL+ +L KE Sbjct: 121 QLIEDAERKEKYASQMSSLKQDLKDILDKE 150 >UniRef50_A5UEY9 UPF0268 protein CGSHiGG_01275 n=27 Tax=Pasteurellaceae RepID=Y1275_HAEIG Length = 148 Score = 150 bits (380), Expect = 1e-35, Method: Compositional matrix adjust. Identities = 74/147 (50%), Positives = 102/147 (69%) Query: 1 MKYQQLENLESGWKWKYLVKKHREGELITRYIEASAAQEAVDVLLSLENEPVLVNGWIDK 60 MKYQ+LEN E+ WKW YL++KHREGE ITRY E S + LL +N P + WI Sbjct: 1 MKYQKLENQEANWKWIYLIRKHREGENITRYEERSLQEAKAQELLESQNYPSQIEEWIKN 60 Query: 61 HMNPELVNRMKQTIRARRKRHFNAEHQHTRKKSIDLEFIVWQRLAGLAQRRGKTLSETIV 120 H++P L ++ Q IRARRKR FN E QHT+KKSIDLE+ VW RL+ +++ TLSETI Sbjct: 61 HLSPALPIKLDQAIRARRKRFFNGEKQHTKKKSIDLEYAVWLRLSKYSRKMKMTLSETIT 120 Query: 121 QLIEDAENKEKYANKMSSLKQDLQALL 147 +I++ E+K ++ N+M+++K L+ LL Sbjct: 121 YMIDERESKAQFENQMAAMKTSLKNLL 147 >UniRef50_A4MZW6 Putative uncharacterized protein n=1 Tax=Haemophilus influenzae 22.1-21 RepID=A4MZW6_HAEIN Length = 110 Score = 100 bits (250), Expect = 1e-20, Method: Compositional matrix adjust. Identities = 49/94 (52%), Positives = 63/94 (67%) Query: 1 MKYQQLENLESGWKWKYLVKKHREGELITRYIEASAAQEAVDVLLSLENEPVLVNGWIDK 60 MKYQ+LEN E+ WKW YL++KHREGE ITRY E S + LL+ +N P + WI Sbjct: 1 MKYQKLENQEANWKWIYLIRKHREGENITRYEERSLQEAKAQELLNAQNYPEKIEEWIKN 60 Query: 61 HMNPELVNRMKQTIRARRKRHFNAEHQHTRKKSI 94 H++P L ++ Q IRARRKR FN E QHT+K + Sbjct: 61 HLSPALPIKLDQAIRARRKRFFNGEKQHTKKNPL 94 >UniRef50_C4NV48 Putative uncharacterized protein n=7 Tax=Gammaproteobacteria RepID=C4NV48_ECOLX Length = 133 Score = 42.7 bits (99), Expect = 0.004, Method: Compositional matrix adjust. Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 6/81 (7%) Query: 39 EAVDVLLSLENEPVLVNGWIDKHMNPELVNRMKQTIRARRKRHFNAEHQHTRKKSIDLEF 98 EAVD L S+ P + WI+K++ E +N++ +IR RR+R + KSI + Sbjct: 44 EAVDSLNSIGRSPSELTEWINKYLTAEQINKLGTSIRQRRRRGYGV------GKSITISD 97 Query: 99 IVWQRLAGLAQRRGKTLSETI 119 + L L++ G +LSE I Sbjct: 98 KAHRILKRLSEVDGCSLSEVI 118 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_A4W8W8 UPF0268 protein Ent638_1468 n=156 Tax=Gammaprote... 226 1e-58 UniRef50_Q7N603 UPF0268 protein plu1774 n=9 Tax=Enterobacteriace... 224 6e-58 UniRef50_B2JYT3 UPF0268 protein YPTS_1558 n=46 Tax=Gammaproteoba... 216 2e-55 UniRef50_A5UEY9 UPF0268 protein CGSHiGG_01275 n=27 Tax=Pasteurel... 195 4e-49 UniRef50_A4MZW6 Putative uncharacterized protein n=1 Tax=Haemoph... 140 1e-32 Sequences not found previously or not previously below threshold: UniRef50_C4NV48 Putative uncharacterized protein n=7 Tax=Gammapr... 41 0.010 UniRef50_A0D1C0 Chromosome undetermined scaffold_34, whole genom... 38 0.082 CONVERGED! >UniRef50_A4W8W8 UPF0268 protein Ent638_1468 n=156 Tax=Gammaproteobacteria RepID=Y1468_ENT38 Length = 151 Score = 226 bits (576), Expect = 1e-58, Method: Composition-based stats. Identities = 132/150 (88%), Positives = 145/150 (96%) Query: 1 MKYQQLENLESGWKWKYLVKKHREGELITRYIEASAAQEAVDVLLSLENEPVLVNGWIDK 60 MKYQQLENLESGWKWKYLVKKHREGELIT YIEASAA+EAVD+LL+LENEPV VN WI+K Sbjct: 1 MKYQQLENLESGWKWKYLVKKHREGELITCYIEASAAKEAVDLLLTLENEPVHVNSWIEK 60 Query: 61 HMNPELVNRMKQTIRARRKRHFNAEHQHTRKKSIDLEFIVWQRLAGLAQRRGKTLSETIV 120 H+NP L+NRMKQTIRARRKRHFNAEHQHTRKKSIDLEF+VWQRLAGLAQRRGKTLSET+V Sbjct: 61 HINPALLNRMKQTIRARRKRHFNAEHQHTRKKSIDLEFMVWQRLAGLAQRRGKTLSETVV 120 Query: 121 QLIEDAENKEKYANKMSSLKQDLQALLGKE 150 QLIEDAE+KEKYAN+MS+LK DLQA+LGK+ Sbjct: 121 QLIEDAEHKEKYANQMSTLKNDLQAMLGKK 150 >UniRef50_Q7N603 UPF0268 protein plu1774 n=9 Tax=Enterobacteriaceae RepID=Y1774_PHOLL Length = 151 Score = 224 bits (571), Expect = 6e-58, Method: Composition-based stats. Identities = 117/150 (78%), Positives = 133/150 (88%) Query: 1 MKYQQLENLESGWKWKYLVKKHREGELITRYIEASAAQEAVDVLLSLENEPVLVNGWIDK 60 MKYQQLENLE GWKW YL+KKH+EGELIT+YIE SAA AVD L+ LE+EPV V WI++ Sbjct: 1 MKYQQLENLECGWKWTYLMKKHQEGELITKYIENSAAHAAVDKLIELESEPVRVLKWIEQ 60 Query: 61 HMNPELVNRMKQTIRARRKRHFNAEHQHTRKKSIDLEFIVWQRLAGLAQRRGKTLSETIV 120 HMNP+L NRMKQTIRARRKRHFNAEHQHTRKKSIDL+F VW RL+ L+QRRG TLSETI+ Sbjct: 61 HMNPDLSNRMKQTIRARRKRHFNAEHQHTRKKSIDLDFPVWHRLSALSQRRGNTLSETII 120 Query: 121 QLIEDAENKEKYANKMSSLKQDLQALLGKE 150 QLIEDAE KEKYAN+MSSLK DL+A+LGK+ Sbjct: 121 QLIEDAERKEKYANQMSSLKHDLEAILGKK 150 >UniRef50_B2JYT3 UPF0268 protein YPTS_1558 n=46 Tax=Gammaproteobacteria RepID=Y1558_YERPB Length = 151 Score = 216 bits (550), Expect = 2e-55, Method: Composition-based stats. Identities = 120/150 (80%), Positives = 132/150 (88%) Query: 1 MKYQQLENLESGWKWKYLVKKHREGELITRYIEASAAQEAVDVLLSLENEPVLVNGWIDK 60 MKYQQLENLESGWKW YLVKKHREGE ITR+IE SAAQ+AV+ L+ LENEPV V WID Sbjct: 1 MKYQQLENLESGWKWAYLVKKHREGEAITRHIENSAAQDAVEQLMKLENEPVKVQEWIDA 60 Query: 61 HMNPELVNRMKQTIRARRKRHFNAEHQHTRKKSIDLEFIVWQRLAGLAQRRGKTLSETIV 120 HMN L RMKQTIRARRKRHFNAEHQHTRKKSIDLEF+VWQRLA LA+RRG TLS+T+V Sbjct: 61 HMNVNLATRMKQTIRARRKRHFNAEHQHTRKKSIDLEFLVWQRLAVLARRRGNTLSDTVV 120 Query: 121 QLIEDAENKEKYANKMSSLKQDLQALLGKE 150 QLIEDAE KEKYA++MSSLKQDL+ +L KE Sbjct: 121 QLIEDAERKEKYASQMSSLKQDLKDILDKE 150 >UniRef50_A5UEY9 UPF0268 protein CGSHiGG_01275 n=27 Tax=Pasteurellaceae RepID=Y1275_HAEIG Length = 148 Score = 195 bits (495), Expect = 4e-49, Method: Composition-based stats. Identities = 74/147 (50%), Positives = 102/147 (69%) Query: 1 MKYQQLENLESGWKWKYLVKKHREGELITRYIEASAAQEAVDVLLSLENEPVLVNGWIDK 60 MKYQ+LEN E+ WKW YL++KHREGE ITRY E S + LL +N P + WI Sbjct: 1 MKYQKLENQEANWKWIYLIRKHREGENITRYEERSLQEAKAQELLESQNYPSQIEEWIKN 60 Query: 61 HMNPELVNRMKQTIRARRKRHFNAEHQHTRKKSIDLEFIVWQRLAGLAQRRGKTLSETIV 120 H++P L ++ Q IRARRKR FN E QHT+KKSIDLE+ VW RL+ +++ TLSETI Sbjct: 61 HLSPALPIKLDQAIRARRKRFFNGEKQHTKKKSIDLEYAVWLRLSKYSRKMKMTLSETIT 120 Query: 121 QLIEDAENKEKYANKMSSLKQDLQALL 147 +I++ E+K ++ N+M+++K L+ LL Sbjct: 121 YMIDERESKAQFENQMAAMKTSLKNLL 147 >UniRef50_A4MZW6 Putative uncharacterized protein n=1 Tax=Haemophilus influenzae 22.1-21 RepID=A4MZW6_HAEIN Length = 110 Score = 140 bits (353), Expect = 1e-32, Method: Composition-based stats. Identities = 49/94 (52%), Positives = 63/94 (67%) Query: 1 MKYQQLENLESGWKWKYLVKKHREGELITRYIEASAAQEAVDVLLSLENEPVLVNGWIDK 60 MKYQ+LEN E+ WKW YL++KHREGE ITRY E S + LL+ +N P + WI Sbjct: 1 MKYQKLENQEANWKWIYLIRKHREGENITRYEERSLQEAKAQELLNAQNYPEKIEEWIKN 60 Query: 61 HMNPELVNRMKQTIRARRKRHFNAEHQHTRKKSI 94 H++P L ++ Q IRARRKR FN E QHT+K + Sbjct: 61 HLSPALPIKLDQAIRARRKRFFNGEKQHTKKNPL 94 >UniRef50_C4NV48 Putative uncharacterized protein n=7 Tax=Gammaproteobacteria RepID=C4NV48_ECOLX Length = 133 Score = 41.3 bits (95), Expect = 0.010, Method: Composition-based stats. Identities = 31/117 (26%), Positives = 55/117 (47%), Gaps = 7/117 (5%) Query: 3 YQQLENLESGWKWKYLVKKHREGELITRYIEASAAQEAVDVLLSLENEPVLVNGWIDKHM 62 + ++ N ++ YL++ R + EAVD L S+ P + WI+K++ Sbjct: 9 FLKVSNEDAQATAIYLLRAASR-PAFWRDVPFDKKLEAVDSLNSIGRSPSELTEWINKYL 67 Query: 63 NPELVNRMKQTIRARRKRHFNAEHQHTRKKSIDLEFIVWQRLAGLAQRRGKTLSETI 119 E +N++ +IR RR+R + KSI + + L L++ G +LSE I Sbjct: 68 TAEQINKLGTSIRQRRRRGYGV------GKSITISDKAHRILKRLSEVDGCSLSEVI 118 >UniRef50_A0D1C0 Chromosome undetermined scaffold_34, whole genome shotgun sequence n=2 Tax=Paramecium tetraurelia RepID=A0D1C0_PARTE Length = 1131 Score = 38.2 bits (87), Expect = 0.082, Method: Composition-based stats. Identities = 29/126 (23%), Positives = 60/126 (47%), Gaps = 3/126 (2%) Query: 26 ELITRYIEASAAQEAVDVLLSLENEPVLVNGWIDKHMNPELVNRMKQTIRARRKR-HFNA 84 ++I I+ + + + LL NE + + ++ L ++ Q + ++KR Sbjct: 398 QIILNAIKKANSLPEIQELLVKLNEESSGVSVLSRLIDL-LTRKLDQIMADQKKRDQMKK 456 Query: 85 EHQHTRKKSID-LEFIVWQRLAGLAQRRGKTLSETIVQLIEDAENKEKYANKMSSLKQDL 143 E + K+ I L+ + QR + +QR K L+E + QLI++ + + + + S LK Sbjct: 457 EREEGYKQQIAYLQLEIKQRTSQESQRDLKNLNEKLNQLIDENKKLQDQSQQYSQLKNKF 516 Query: 144 QALLGK 149 Q + G+ Sbjct: 517 QDMSGQ 522 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.304 0.127 0.353 Lambda K H 0.267 0.0392 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 519,951,656 Number of Sequences: 3077464 Number of extensions: 17059163 Number of successful extensions: 66380 Number of sequences better than 1.0e-01: 10 Number of HSP's better than 0.1 without gapping: 12 Number of HSP's successfully gapped in prelim test: 6 Number of HSP's that attempted gapping in prelim test: 66366 Number of HSP's gapped (non-prelim): 19 length of query: 150 length of database: 1,040,396,356 effective HSP length: 113 effective length of query: 37 effective length of database: 692,642,924 effective search space: 25627788188 effective search space used: 25627788188 T: 11 A: 40 X1: 16 ( 7.0 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (20.9 bits) S2: 87 (38.2 bits)