BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (127 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P28911 Uncharacterized protein yhhH n=15 Tax=Escherichi... 261 4e-69 UniRef50_P33668 Uncharacterized protein ybbC n=13 Tax=Escherichi... 102 3e-21 UniRef50_C6V8M1 Putative uncharacterized protein n=3 Tax=Escheri... 60 2e-08 UniRef50_A5FGG2 Hypothetical lipoprotein n=1 Tax=Flavobacterium ... 46 3e-04 UniRef50_C6X3U0 Putative uncharacterized protein n=1 Tax=Flavoba... 43 0.004 UniRef50_C6M7N1 Putative lipoprotein n=2 Tax=Neisseria RepID=C6M... 43 0.004 UniRef50_D1PST3 Putative uncharacterized protein n=1 Tax=Prevote... 42 0.004 UniRef50_C7PKA4 Putative uncharacterized protein n=1 Tax=Chitino... 40 0.021 UniRef50_D1N0G5 Putative uncharacterized protein n=1 Tax=Victiva... 39 0.075 >UniRef50_P28911 Uncharacterized protein yhhH n=15 Tax=Escherichia RepID=YHHH_ECOLI Length = 127 Score = 261 bits (668), Expect = 4e-69, Method: Compositional matrix adjust. Identities = 127/127 (100%), Positives = 127/127 (100%) Query: 1 MSAEFMVICKKILFRNCVIVSLFVFTYNTWAQCNNNIKIMRKYESEGKYTVRNLVKNKAI 60 MSAEFMVICKKILFRNCVIVSLFVFTYNTWAQCNNNIKIMRKYESEGKYTVRNLVKNKAI Sbjct: 1 MSAEFMVICKKILFRNCVIVSLFVFTYNTWAQCNNNIKIMRKYESEGKYTVRNLVKNKAI 60 Query: 61 ALELAEIYVKNRYGQDAAEEEKPYEITELTTSWVVEGTIHSDQIAGGVFIIEIGKNDGRI 120 ALELAEIYVKNRYGQDAAEEEKPYEITELTTSWVVEGTIHSDQIAGGVFIIEIGKNDGRI Sbjct: 61 ALELAEIYVKNRYGQDAAEEEKPYEITELTTSWVVEGTIHSDQIAGGVFIIEIGKNDGRI 120 Query: 121 LNFGHGK 127 LNFGHGK Sbjct: 121 LNFGHGK 127 >UniRef50_P33668 Uncharacterized protein ybbC n=13 Tax=Escherichia coli RepID=YBBC_ECOLI Length = 122 Score = 102 bits (254), Expect = 3e-21, Method: Compositional matrix adjust. Identities = 54/96 (56%), Positives = 67/96 (69%), Gaps = 2/96 (2%) Query: 34 NNNIKIMRKYESE--GKYTVRNLVKNKAIALELAEIYVKNRYGQDAAEEEKPYEITELTT 91 + NI MR E KY+V+N VK + +A++LAEIYV+ RYG+ AEEEKPY ITEL Sbjct: 27 DENIIFMRYVEKLHLDKYSVKNTVKTETMAIQLAEIYVRYRYGERIAEEEKPYLITELPD 86 Query: 92 SWVVEGTIHSDQIAGGVFIIEIGKNDGRILNFGHGK 127 SWVVEG ++AGGVFIIEI K +G +LNF H K Sbjct: 87 SWVVEGAKLPYEVAGGVFIIEINKKNGCVLNFLHSK 122 >UniRef50_C6V8M1 Putative uncharacterized protein n=3 Tax=Escherichia coli RepID=C6V8M1_ECOBD Length = 91 Score = 60.5 bits (145), Expect = 2e-08, Method: Compositional matrix adjust. Identities = 32/78 (41%), Positives = 47/78 (60%), Gaps = 2/78 (2%) Query: 50 TVRNLVKNKAIALELAEIYVKNRYGQDAAEEEKPYEITELTTSWVVEGTIHSDQIAGGVF 109 V LV ++ +ALEL+ +Y+K YG++ AE +KPY IT+ W +EG + GG F Sbjct: 16 PVEILVNSREMALELSYVYIKYVYGKEKAEFQKPYSITDDNNCWKIEGK--QPKTLGGNF 73 Query: 110 IIEIGKNDGRILNFGHGK 127 I I K DG++L+ H K Sbjct: 74 TILIAKKDGQVLHVIHTK 91 >UniRef50_A5FGG2 Hypothetical lipoprotein n=1 Tax=Flavobacterium johnsoniae UW101 RepID=A5FGG2_FLAJ1 Length = 115 Score = 46.2 bits (108), Expect = 3e-04, Method: Compositional matrix adjust. Identities = 32/115 (27%), Positives = 61/115 (53%), Gaps = 5/115 (4%) Query: 15 RNCVIVSLFVFTYNTWAQ-CNNNIKIMRKYESEGKYTVRNLVKNKAIALELAEIYVKNRY 73 N ++V + +N+ +Q + K+++ + ++LV + A+++AE + Y Sbjct: 4 HNVLLVVFLLIFFNSCSQEKTKSQKVVKSVVDKPNLVFKDLVPDNETAIKIAEAILVPIY 63 Query: 74 GQDAAEEEKPYEITELTTS-WVVEGTIHSDQIAGGVFIIEIGKNDGRILNFGHGK 127 G+ ++ +P+ T + + W VEGT+H+ + GGV IEI K D +IL H K Sbjct: 64 GKKIYKQ-RPFVATLKSPNVWAVEGTLHTTK--GGVAYIEIQKKDCKILKVYHEK 115 >UniRef50_C6X3U0 Putative uncharacterized protein n=1 Tax=Flavobacteriaceae bacterium 3519-10 RepID=C6X3U0_FLAB3 Length = 124 Score = 42.7 bits (99), Expect = 0.004, Method: Compositional matrix adjust. Identities = 25/74 (33%), Positives = 39/74 (52%), Gaps = 1/74 (1%) Query: 54 LVKNKAIALELAEIYVKNRYGQDAAEEEKPYEITELTTSWVVEGTIHSDQIAGGVFIIEI 113 L+K + A+++AE + YG+ E EKPYE + WV+ GT+ GG F I I Sbjct: 52 LIKEENTAIKVAEPILFEIYGRSKIEGEKPYEAYLIKNYWVINGTVDRYSF-GGAFSIII 110 Query: 114 GKNDGRILNFGHGK 127 + +++N H K Sbjct: 111 DARNSKVINVIHYK 124 >UniRef50_C6M7N1 Putative lipoprotein n=2 Tax=Neisseria RepID=C6M7N1_NEISI Length = 96 Score = 42.7 bits (99), Expect = 0.004, Method: Compositional matrix adjust. Identities = 24/66 (36%), Positives = 33/66 (50%), Gaps = 3/66 (4%) Query: 62 LELAEIYVKNRYGQDAAEEEKPYEITELTTSWVVEGTIHSDQIAGGVFIIEIGKNDGRIL 121 L L EIYV YG+ A+ +KPY + + WV+ G + GG F IG N G++ Sbjct: 34 LRLTEIYVTQHYGEQTAQAQKPYRVKKDGEHWVISGK--PPKALGGNFRAVIGAN-GQLE 90 Query: 122 NFGHGK 127 H K Sbjct: 91 EITHSK 96 >UniRef50_D1PST3 Putative uncharacterized protein n=1 Tax=Prevotella bergensis DSM 17361 RepID=D1PST3_9BACT Length = 142 Score = 42.4 bits (98), Expect = 0.004, Method: Compositional matrix adjust. Identities = 27/86 (31%), Positives = 47/86 (54%), Gaps = 5/86 (5%) Query: 44 ESEGKYTVRNLVKNKAIALELAEIYVKNRYGQDAAEEEKPYEIT-ELTTSWVVEGTIHSD 102 ++EG + N +A E+A +K YG ++EKPY++ + W++ G+ D Sbjct: 60 DNEGANPDSGFIPNAKVAYEVAIAVLKPIYGH-YVDKEKPYKVVLDSKRYWIITGS--KD 116 Query: 103 QIA-GGVFIIEIGKNDGRILNFGHGK 127 I+ GGV + + K+DGR++ HGK Sbjct: 117 SISKGGVAEVTLRKSDGRVIMVTHGK 142 >UniRef50_C7PKA4 Putative uncharacterized protein n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PKA4_CHIPD Length = 105 Score = 40.0 bits (92), Expect = 0.021, Method: Compositional matrix adjust. Identities = 31/94 (32%), Positives = 49/94 (52%), Gaps = 3/94 (3%) Query: 35 NNIKIMRKYESEGKYTVRNLVKNKAIALELAEIYVKNRYGQDAAEEEKPYEITELTTS-W 93 +I I K + K + + V ++ A ++AE YG+ +EKPY ++ + S W Sbjct: 14 TSICISAKENTNDKTSASDYVPDEETAKKIAEAIWLPIYGE-RIYDEKPYVVSLVGDSVW 72 Query: 94 VVEGTIHSDQIAGGVFIIEIGKNDGRILNFGHGK 127 V+G++ + GGV IEI KND +IL H K Sbjct: 73 AVDGSLPKKK-RGGVAYIEIQKNDCKILKVIHSK 105 >UniRef50_D1N0G5 Putative uncharacterized protein n=1 Tax=Victivallis vadensis ATCC BAA-548 RepID=D1N0G5_9BACT Length = 336 Score = 38.5 bits (88), Expect = 0.075, Method: Compositional matrix adjust. Identities = 23/75 (30%), Positives = 41/75 (54%), Gaps = 2/75 (2%) Query: 53 NLVKNKAIALELAEIYVKNRYGQDAAEEEKPYEITELTTSWVVEGTIHSDQIAGGVFIIE 112 NL +K A+++AE + YG++ + +P+ + E T + + GT+ + GGV I Sbjct: 264 NLNLSKEDAVKVAETVLVGIYGKEVLRQ-RPWRVVESETEFQISGTLAPSSV-GGVAEIS 321 Query: 113 IGKNDGRILNFGHGK 127 I K+D + + HGK Sbjct: 322 IRKSDAGVARYTHGK 336 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P28911 Uncharacterized protein yhhH n=15 Tax=Escherichi... 195 3e-49 UniRef50_P33668 Uncharacterized protein ybbC n=13 Tax=Escherichi... 130 1e-29 UniRef50_A5FGG2 Hypothetical lipoprotein n=1 Tax=Flavobacterium ... 128 4e-29 UniRef50_C6V8M1 Putative uncharacterized protein n=3 Tax=Escheri... 107 2e-22 Sequences not found previously or not previously below threshold: UniRef50_C7PKA4 Putative uncharacterized protein n=1 Tax=Chitino... 78 9e-14 UniRef50_B8F9G3 Putative uncharacterized protein n=1 Tax=Desulfa... 62 8e-09 UniRef50_C6X3U0 Putative uncharacterized protein n=1 Tax=Flavoba... 58 8e-08 UniRef50_C6M7N1 Putative lipoprotein n=2 Tax=Neisseria RepID=C6M... 55 5e-07 UniRef50_D1N0G5 Putative uncharacterized protein n=1 Tax=Victiva... 50 2e-05 UniRef50_D1PST3 Putative uncharacterized protein n=1 Tax=Prevote... 50 3e-05 UniRef50_B2KDJ6 Putative uncharacterized protein n=1 Tax=Elusimi... 47 2e-04 UniRef50_C0ZK58 Putative uncharacterized protein n=1 Tax=Breviba... 43 0.004 >UniRef50_P28911 Uncharacterized protein yhhH n=15 Tax=Escherichia RepID=YHHH_ECOLI Length = 127 Score = 195 bits (496), Expect = 3e-49, Method: Composition-based stats. Identities = 127/127 (100%), Positives = 127/127 (100%) Query: 1 MSAEFMVICKKILFRNCVIVSLFVFTYNTWAQCNNNIKIMRKYESEGKYTVRNLVKNKAI 60 MSAEFMVICKKILFRNCVIVSLFVFTYNTWAQCNNNIKIMRKYESEGKYTVRNLVKNKAI Sbjct: 1 MSAEFMVICKKILFRNCVIVSLFVFTYNTWAQCNNNIKIMRKYESEGKYTVRNLVKNKAI 60 Query: 61 ALELAEIYVKNRYGQDAAEEEKPYEITELTTSWVVEGTIHSDQIAGGVFIIEIGKNDGRI 120 ALELAEIYVKNRYGQDAAEEEKPYEITELTTSWVVEGTIHSDQIAGGVFIIEIGKNDGRI Sbjct: 61 ALELAEIYVKNRYGQDAAEEEKPYEITELTTSWVVEGTIHSDQIAGGVFIIEIGKNDGRI 120 Query: 121 LNFGHGK 127 LNFGHGK Sbjct: 121 LNFGHGK 127 >UniRef50_P33668 Uncharacterized protein ybbC n=13 Tax=Escherichia coli RepID=YBBC_ECOLI Length = 122 Score = 130 bits (328), Expect = 1e-29, Method: Composition-based stats. Identities = 57/114 (50%), Positives = 73/114 (64%), Gaps = 4/114 (3%) Query: 18 VIVSLFVFTYNTWA--QCNNNIKIMRKYESE--GKYTVRNLVKNKAIALELAEIYVKNRY 73 ++ +F N A + NI MR E KY+V+N VK + +A++LAEIYV+ RY Sbjct: 9 MLSFFILFACNETAVYGSDENIIFMRYVEKLHLDKYSVKNTVKTETMAIQLAEIYVRYRY 68 Query: 74 GQDAAEEEKPYEITELTTSWVVEGTIHSDQIAGGVFIIEIGKNDGRILNFGHGK 127 G+ AEEEKPY ITEL SWVVEG ++AGGVFIIEI K +G +LNF H K Sbjct: 69 GERIAEEEKPYLITELPDSWVVEGAKLPYEVAGGVFIIEINKKNGCVLNFLHSK 122 >UniRef50_A5FGG2 Hypothetical lipoprotein n=1 Tax=Flavobacterium johnsoniae UW101 RepID=A5FGG2_FLAJ1 Length = 115 Score = 128 bits (322), Expect = 4e-29, Method: Composition-based stats. Identities = 32/115 (27%), Positives = 61/115 (53%), Gaps = 5/115 (4%) Query: 15 RNCVIVSLFVFTYNTWAQ-CNNNIKIMRKYESEGKYTVRNLVKNKAIALELAEIYVKNRY 73 N ++V + +N+ +Q + K+++ + ++LV + A+++AE + Y Sbjct: 4 HNVLLVVFLLIFFNSCSQEKTKSQKVVKSVVDKPNLVFKDLVPDNETAIKIAEAILVPIY 63 Query: 74 GQDAAEEEKPYEITELTTS-WVVEGTIHSDQIAGGVFIIEIGKNDGRILNFGHGK 127 G+ + ++P+ T + + W VEGT+H+ + GGV IEI K D +IL H K Sbjct: 64 GKKIYK-QRPFVATLKSPNVWAVEGTLHTTK--GGVAYIEIQKKDCKILKVYHEK 115 >UniRef50_C6V8M1 Putative uncharacterized protein n=3 Tax=Escherichia coli RepID=C6V8M1_ECOBD Length = 91 Score = 107 bits (266), Expect = 2e-22, Method: Composition-based stats. Identities = 32/78 (41%), Positives = 47/78 (60%), Gaps = 2/78 (2%) Query: 50 TVRNLVKNKAIALELAEIYVKNRYGQDAAEEEKPYEITELTTSWVVEGTIHSDQIAGGVF 109 V LV ++ +ALEL+ +Y+K YG++ AE +KPY IT+ W +EG + GG F Sbjct: 16 PVEILVNSREMALELSYVYIKYVYGKEKAEFQKPYSITDDNNCWKIEGK--QPKTLGGNF 73 Query: 110 IIEIGKNDGRILNFGHGK 127 I I K DG++L+ H K Sbjct: 74 TILIAKKDGQVLHVIHTK 91 >UniRef50_C7PKA4 Putative uncharacterized protein n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PKA4_CHIPD Length = 105 Score = 77.8 bits (190), Expect = 9e-14, Method: Composition-based stats. Identities = 31/114 (27%), Positives = 53/114 (46%), Gaps = 11/114 (9%) Query: 15 RNCVIVSLFVFTYNTWAQCNNNIKIMRKYESEGKYTVRNLVKNKAIALELAEIYVKNRYG 74 +N +++ L + N + K + + V ++ A ++AE YG Sbjct: 2 KNVMLLILVALITSICISAKEN--------TNDKTSASDYVPDEETAKKIAEAIWLPIYG 53 Query: 75 QDAAEEEKPYEITELTTS-WVVEGTIHSDQIAGGVFIIEIGKNDGRILNFGHGK 127 + + EKPY ++ + S W V+G++ + GGV IEI KND +IL H K Sbjct: 54 ERIYD-EKPYVVSLVGDSVWAVDGSL-PKKKRGGVAYIEIQKNDCKILKVIHSK 105 >UniRef50_B8F9G3 Putative uncharacterized protein n=1 Tax=Desulfatibacillum alkenivorans AK-01 RepID=B8F9G3_DESAA Length = 139 Score = 61.6 bits (148), Expect = 8e-09, Method: Composition-based stats. Identities = 22/93 (23%), Positives = 35/93 (37%), Gaps = 6/93 (6%) Query: 40 MRKYESEGKYTVRNLVKNKAIALELAEIYVKNRYGQDAAEEEKPYEITELTTSWVVEGTI 99 + + V ++ A+ +AE YG ++KP+ W+V+GT Sbjct: 48 IENIKDLDFVPENGYVPDEETAIRIAEAVWLPIYGPQIY-QDKPFVAKLYGDEWLVKGTY 106 Query: 100 HSDQIA-----GGVFIIEIGKNDGRILNFGHGK 127 GGV I K DG+IL H + Sbjct: 107 VIPDDLNEIMRGGVPYAVIRKIDGKILAVTHTR 139 >UniRef50_C6X3U0 Putative uncharacterized protein n=1 Tax=Flavobacteriaceae bacterium 3519-10 RepID=C6X3U0_FLAB3 Length = 124 Score = 58.2 bits (139), Expect = 8e-08, Method: Composition-based stats. Identities = 30/122 (24%), Positives = 57/122 (46%), Gaps = 13/122 (10%) Query: 18 VIVSLFVFTYNTWAQCNNNIKIMRKYESEGKYTVRN------------LVKNKAIALELA 65 +++ LFV + + ++N + + E + +++ L+K + A+++A Sbjct: 4 ILLMLFVILQFSCNKVSHNKLGIDNAKKELESALKDTTKIAILDNNELLIKEENTAIKVA 63 Query: 66 EIYVKNRYGQDAAEEEKPYEITELTTSWVVEGTIHSDQIAGGVFIIEIGKNDGRILNFGH 125 E + YG+ E EKPYE + WV+ GT+ GG F I I + +++N H Sbjct: 64 EPILFEIYGRSKIEGEKPYEAYLIKNYWVINGTVDRY-SFGGAFSIIIDARNSKVINVIH 122 Query: 126 GK 127 K Sbjct: 123 YK 124 >UniRef50_C6M7N1 Putative lipoprotein n=2 Tax=Neisseria RepID=C6M7N1_NEISI Length = 96 Score = 55.5 bits (132), Expect = 5e-07, Method: Composition-based stats. Identities = 23/71 (32%), Positives = 34/71 (47%), Gaps = 3/71 (4%) Query: 57 NKAIALELAEIYVKNRYGQDAAEEEKPYEITELTTSWVVEGTIHSDQIAGGVFIIEIGKN 116 + L L EIYV YG+ A+ +KPY + + WV+ G + GG F IG Sbjct: 29 TEQQVLRLTEIYVTQHYGEQTAQAQKPYRVKKDGEHWVISGK--PPKALGGNFRAVIGA- 85 Query: 117 DGRILNFGHGK 127 +G++ H K Sbjct: 86 NGQLEEITHSK 96 >UniRef50_D1N0G5 Putative uncharacterized protein n=1 Tax=Victivallis vadensis ATCC BAA-548 RepID=D1N0G5_9BACT Length = 336 Score = 50.1 bits (118), Expect = 2e-05, Method: Composition-based stats. Identities = 23/75 (30%), Positives = 41/75 (54%), Gaps = 2/75 (2%) Query: 53 NLVKNKAIALELAEIYVKNRYGQDAAEEEKPYEITELTTSWVVEGTIHSDQIAGGVFIIE 112 NL +K A+++AE + YG++ ++P+ + E T + + GT+ + GGV I Sbjct: 264 NLNLSKEDAVKVAETVLVGIYGKEVLR-QRPWRVVESETEFQISGTLAPSSV-GGVAEIS 321 Query: 113 IGKNDGRILNFGHGK 127 I K+D + + HGK Sbjct: 322 IRKSDAGVARYTHGK 336 >UniRef50_D1PST3 Putative uncharacterized protein n=1 Tax=Prevotella bergensis DSM 17361 RepID=D1PST3_9BACT Length = 142 Score = 49.7 bits (117), Expect = 3e-05, Method: Composition-based stats. Identities = 26/94 (27%), Positives = 46/94 (48%), Gaps = 3/94 (3%) Query: 35 NNIKIMRKYESEGKYTVRNLVKNKAIALELAEIYVKNRYGQDAAEEEKPYEITELTTS-W 93 + ++EG + N +A E+A +K YG ++EKPY++ + W Sbjct: 51 EDTITFENLDNEGANPDSGFIPNAKVAYEVAIAVLKPIYGHYV-DKEKPYKVVLDSKRYW 109 Query: 94 VVEGTIHSDQIAGGVFIIEIGKNDGRILNFGHGK 127 ++ G+ S GGV + + K+DGR++ HGK Sbjct: 110 IITGSKDSI-SKGGVAEVTLRKSDGRVIMVTHGK 142 >UniRef50_B2KDJ6 Putative uncharacterized protein n=1 Tax=Elusimicrobium minutum Pei191 RepID=B2KDJ6_ELUMP Length = 147 Score = 47.0 bits (110), Expect = 2e-04, Method: Composition-based stats. Identities = 23/71 (32%), Positives = 35/71 (49%), Gaps = 1/71 (1%) Query: 57 NKAIALELAEIYVKNRYGQDAAEEEKPYEITELTTSWVVEGTIHSDQIAGGVFIIEIGKN 116 N+ A+ AE + GQ+ +KP++ WVV GT ++G V II + K Sbjct: 78 NEQSAIAAAEEELGYILGQELMSAQKPFKAAGCDNMWVVYGTNEVGTLSGAVHII-LRKQ 136 Query: 117 DGRILNFGHGK 127 DG+IL + K Sbjct: 137 DGKILQVFYEK 147 >UniRef50_C0ZK58 Putative uncharacterized protein n=1 Tax=Brevibacillus brevis NBRC 100599 RepID=C0ZK58_BREBN Length = 161 Score = 42.8 bits (99), Expect = 0.004, Method: Composition-based stats. Identities = 22/83 (26%), Positives = 37/83 (44%), Gaps = 6/83 (7%) Query: 47 GKYTVRNLVKNKAIALELAEIYVKNRYGQDAAEEEKPYEITELT--TSWVVEGTIHSDQI 104 ++ L A+A + Y+ Y Q EE P+ I+ +W++EGT+ Sbjct: 83 DEFAFGELTDEHAVATAI-FSYLAEHYSQ--FLEETPFAISYNPIAEAWIIEGTL-PPGW 138 Query: 105 AGGVFIIEIGKNDGRILNFGHGK 127 GGV I + K +G++L K Sbjct: 139 LGGVIYIALAKENGKLLMMYGTK 161 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P28911 Uncharacterized protein yhhH n=15 Tax=Escherichi... 163 2e-39 UniRef50_P33668 Uncharacterized protein ybbC n=13 Tax=Escherichi... 120 1e-26 UniRef50_A5FGG2 Hypothetical lipoprotein n=1 Tax=Flavobacterium ... 109 4e-23 UniRef50_C7PKA4 Putative uncharacterized protein n=1 Tax=Chitino... 105 5e-22 UniRef50_B8F9G3 Putative uncharacterized protein n=1 Tax=Desulfa... 105 6e-22 UniRef50_C6X3U0 Putative uncharacterized protein n=1 Tax=Flavoba... 104 7e-22 UniRef50_D1PST3 Putative uncharacterized protein n=1 Tax=Prevote... 94 1e-18 UniRef50_C6V8M1 Putative uncharacterized protein n=3 Tax=Escheri... 93 3e-18 UniRef50_C6M7N1 Putative lipoprotein n=2 Tax=Neisseria RepID=C6M... 83 2e-15 UniRef50_D1N0G5 Putative uncharacterized protein n=1 Tax=Victiva... 83 2e-15 UniRef50_B2KDJ6 Putative uncharacterized protein n=1 Tax=Elusimi... 82 5e-15 Sequences not found previously or not previously below threshold: UniRef50_C0ZK58 Putative uncharacterized protein n=1 Tax=Breviba... 44 0.002 UniRef50_A4YMX6 Putative uncharacterized protein n=1 Tax=Bradyrh... 41 0.013 CONVERGED! >UniRef50_P28911 Uncharacterized protein yhhH n=15 Tax=Escherichia RepID=YHHH_ECOLI Length = 127 Score = 163 bits (412), Expect = 2e-39, Method: Composition-based stats. Identities = 127/127 (100%), Positives = 127/127 (100%) Query: 1 MSAEFMVICKKILFRNCVIVSLFVFTYNTWAQCNNNIKIMRKYESEGKYTVRNLVKNKAI 60 MSAEFMVICKKILFRNCVIVSLFVFTYNTWAQCNNNIKIMRKYESEGKYTVRNLVKNKAI Sbjct: 1 MSAEFMVICKKILFRNCVIVSLFVFTYNTWAQCNNNIKIMRKYESEGKYTVRNLVKNKAI 60 Query: 61 ALELAEIYVKNRYGQDAAEEEKPYEITELTTSWVVEGTIHSDQIAGGVFIIEIGKNDGRI 120 ALELAEIYVKNRYGQDAAEEEKPYEITELTTSWVVEGTIHSDQIAGGVFIIEIGKNDGRI Sbjct: 61 ALELAEIYVKNRYGQDAAEEEKPYEITELTTSWVVEGTIHSDQIAGGVFIIEIGKNDGRI 120 Query: 121 LNFGHGK 127 LNFGHGK Sbjct: 121 LNFGHGK 127 >UniRef50_P33668 Uncharacterized protein ybbC n=13 Tax=Escherichia coli RepID=YBBC_ECOLI Length = 122 Score = 120 bits (301), Expect = 1e-26, Method: Composition-based stats. Identities = 57/114 (50%), Positives = 73/114 (64%), Gaps = 4/114 (3%) Query: 18 VIVSLFVFTYNTWA--QCNNNIKIMRKYESE--GKYTVRNLVKNKAIALELAEIYVKNRY 73 ++ +F N A + NI MR E KY+V+N VK + +A++LAEIYV+ RY Sbjct: 9 MLSFFILFACNETAVYGSDENIIFMRYVEKLHLDKYSVKNTVKTETMAIQLAEIYVRYRY 68 Query: 74 GQDAAEEEKPYEITELTTSWVVEGTIHSDQIAGGVFIIEIGKNDGRILNFGHGK 127 G+ AEEEKPY ITEL SWVVEG ++AGGVFIIEI K +G +LNF H K Sbjct: 69 GERIAEEEKPYLITELPDSWVVEGAKLPYEVAGGVFIIEINKKNGCVLNFLHSK 122 >UniRef50_A5FGG2 Hypothetical lipoprotein n=1 Tax=Flavobacterium johnsoniae UW101 RepID=A5FGG2_FLAJ1 Length = 115 Score = 109 bits (271), Expect = 4e-23, Method: Composition-based stats. Identities = 32/115 (27%), Positives = 59/115 (51%), Gaps = 5/115 (4%) Query: 15 RNCVIVSLFVFTYNTWAQ-CNNNIKIMRKYESEGKYTVRNLVKNKAIALELAEIYVKNRY 73 N ++V + +N+ +Q + K+++ + ++LV + A+++AE + Y Sbjct: 4 HNVLLVVFLLIFFNSCSQEKTKSQKVVKSVVDKPNLVFKDLVPDNETAIKIAEAILVPIY 63 Query: 74 GQDAAEEEKPYEITE-LTTSWVVEGTIHSDQIAGGVFIIEIGKNDGRILNFGHGK 127 G+ + ++P+ T W VEGT+H+ + GGV IEI K D +IL H K Sbjct: 64 GKKIYK-QRPFVATLKSPNVWAVEGTLHTTK--GGVAYIEIQKKDCKILKVYHEK 115 >UniRef50_C7PKA4 Putative uncharacterized protein n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PKA4_CHIPD Length = 105 Score = 105 bits (262), Expect = 5e-22, Method: Composition-based stats. Identities = 31/114 (27%), Positives = 52/114 (45%), Gaps = 11/114 (9%) Query: 15 RNCVIVSLFVFTYNTWAQCNNNIKIMRKYESEGKYTVRNLVKNKAIALELAEIYVKNRYG 74 +N +++ L + N K + + V ++ A ++AE YG Sbjct: 2 KNVMLLILVALITSICISAKENT--------NDKTSASDYVPDEETAKKIAEAIWLPIYG 53 Query: 75 QDAAEEEKPYEITELTTS-WVVEGTIHSDQIAGGVFIIEIGKNDGRILNFGHGK 127 + + EKPY ++ + S W V+G++ + GGV IEI KND +IL H K Sbjct: 54 ERIYD-EKPYVVSLVGDSVWAVDGSL-PKKKRGGVAYIEIQKNDCKILKVIHSK 105 >UniRef50_B8F9G3 Putative uncharacterized protein n=1 Tax=Desulfatibacillum alkenivorans AK-01 RepID=B8F9G3_DESAA Length = 139 Score = 105 bits (261), Expect = 6e-22, Method: Composition-based stats. Identities = 22/93 (23%), Positives = 35/93 (37%), Gaps = 6/93 (6%) Query: 40 MRKYESEGKYTVRNLVKNKAIALELAEIYVKNRYGQDAAEEEKPYEITELTTSWVVEGTI 99 + + V ++ A+ +AE YG + +KP+ W+V+GT Sbjct: 48 IENIKDLDFVPENGYVPDEETAIRIAEAVWLPIYGPQIYQ-DKPFVAKLYGDEWLVKGTY 106 Query: 100 HSDQIA-----GGVFIIEIGKNDGRILNFGHGK 127 GGV I K DG+IL H + Sbjct: 107 VIPDDLNEIMRGGVPYAVIRKIDGKILAVTHTR 139 >UniRef50_C6X3U0 Putative uncharacterized protein n=1 Tax=Flavobacteriaceae bacterium 3519-10 RepID=C6X3U0_FLAB3 Length = 124 Score = 104 bits (260), Expect = 7e-22, Method: Composition-based stats. Identities = 30/122 (24%), Positives = 57/122 (46%), Gaps = 13/122 (10%) Query: 18 VIVSLFVFTYNTWAQCNNNIKIMRKYESEGKYTVRN------------LVKNKAIALELA 65 +++ LFV + + ++N + + E + +++ L+K + A+++A Sbjct: 4 ILLMLFVILQFSCNKVSHNKLGIDNAKKELESALKDTTKIAILDNNELLIKEENTAIKVA 63 Query: 66 EIYVKNRYGQDAAEEEKPYEITELTTSWVVEGTIHSDQIAGGVFIIEIGKNDGRILNFGH 125 E + YG+ E EKPYE + WV+ GT+ GG F I I + +++N H Sbjct: 64 EPILFEIYGRSKIEGEKPYEAYLIKNYWVINGTVDRY-SFGGAFSIIIDARNSKVINVIH 122 Query: 126 GK 127 K Sbjct: 123 YK 124 >UniRef50_D1PST3 Putative uncharacterized protein n=1 Tax=Prevotella bergensis DSM 17361 RepID=D1PST3_9BACT Length = 142 Score = 94.4 bits (233), Expect = 1e-18, Method: Composition-based stats. Identities = 26/94 (27%), Positives = 46/94 (48%), Gaps = 3/94 (3%) Query: 35 NNIKIMRKYESEGKYTVRNLVKNKAIALELAEIYVKNRYGQDAAEEEKPYEITELTT-SW 93 + ++EG + N +A E+A +K YG ++EKPY++ + W Sbjct: 51 EDTITFENLDNEGANPDSGFIPNAKVAYEVAIAVLKPIYGHYV-DKEKPYKVVLDSKRYW 109 Query: 94 VVEGTIHSDQIAGGVFIIEIGKNDGRILNFGHGK 127 ++ G+ S GGV + + K+DGR++ HGK Sbjct: 110 IITGSKDSI-SKGGVAEVTLRKSDGRVIMVTHGK 142 >UniRef50_C6V8M1 Putative uncharacterized protein n=3 Tax=Escherichia coli RepID=C6V8M1_ECOBD Length = 91 Score = 92.9 bits (229), Expect = 3e-18, Method: Composition-based stats. Identities = 32/80 (40%), Positives = 48/80 (60%), Gaps = 2/80 (2%) Query: 48 KYTVRNLVKNKAIALELAEIYVKNRYGQDAAEEEKPYEITELTTSWVVEGTIHSDQIAGG 107 + V LV ++ +ALEL+ +Y+K YG++ AE +KPY IT+ W +EG + GG Sbjct: 14 ENPVEILVNSREMALELSYVYIKYVYGKEKAEFQKPYSITDDNNCWKIEGK--QPKTLGG 71 Query: 108 VFIIEIGKNDGRILNFGHGK 127 F I I K DG++L+ H K Sbjct: 72 NFTILIAKKDGQVLHVIHTK 91 >UniRef50_C6M7N1 Putative lipoprotein n=2 Tax=Neisseria RepID=C6M7N1_NEISI Length = 96 Score = 83.2 bits (204), Expect = 2e-15, Method: Composition-based stats. Identities = 23/71 (32%), Positives = 34/71 (47%), Gaps = 3/71 (4%) Query: 57 NKAIALELAEIYVKNRYGQDAAEEEKPYEITELTTSWVVEGTIHSDQIAGGVFIIEIGKN 116 + L L EIYV YG+ A+ +KPY + + WV+ G + GG F IG Sbjct: 29 TEQQVLRLTEIYVTQHYGEQTAQAQKPYRVKKDGEHWVISGK--PPKALGGNFRAVIGA- 85 Query: 117 DGRILNFGHGK 127 +G++ H K Sbjct: 86 NGQLEEITHSK 96 >UniRef50_D1N0G5 Putative uncharacterized protein n=1 Tax=Victivallis vadensis ATCC BAA-548 RepID=D1N0G5_9BACT Length = 336 Score = 83.2 bits (204), Expect = 2e-15, Method: Composition-based stats. Identities = 25/93 (26%), Positives = 46/93 (49%), Gaps = 7/93 (7%) Query: 35 NNIKIMRKYESEGKYTVRNLVKNKAIALELAEIYVKNRYGQDAAEEEKPYEITELTTSWV 94 N ++ +E NL +K A+++AE + YG++ ++P+ + E T + Sbjct: 251 ENSITIKTAPAEV-----NLNLSKEDAVKVAETVLVGIYGKEVLR-QRPWRVVESETEFQ 304 Query: 95 VEGTIHSDQIAGGVFIIEIGKNDGRILNFGHGK 127 + GT+ + GGV I I K+D + + HGK Sbjct: 305 ISGTLAPSSV-GGVAEISIRKSDAGVARYTHGK 336 >UniRef50_B2KDJ6 Putative uncharacterized protein n=1 Tax=Elusimicrobium minutum Pei191 RepID=B2KDJ6_ELUMP Length = 147 Score = 82.1 bits (201), Expect = 5e-15, Method: Composition-based stats. Identities = 23/78 (29%), Positives = 37/78 (47%), Gaps = 1/78 (1%) Query: 50 TVRNLVKNKAIALELAEIYVKNRYGQDAAEEEKPYEITELTTSWVVEGTIHSDQIAGGVF 109 ++ N+ A+ AE + GQ+ +KP++ WVV GT ++G V Sbjct: 71 PEIIVLFNEQSAIAAAEEELGYILGQELMSAQKPFKAAGCDNMWVVYGTNEVGTLSGAVH 130 Query: 110 IIEIGKNDGRILNFGHGK 127 II + K DG+IL + K Sbjct: 131 II-LRKQDGKILQVFYEK 147 >UniRef50_C0ZK58 Putative uncharacterized protein n=1 Tax=Brevibacillus brevis NBRC 100599 RepID=C0ZK58_BREBN Length = 161 Score = 43.6 bits (101), Expect = 0.002, Method: Composition-based stats. Identities = 21/74 (28%), Positives = 33/74 (44%), Gaps = 6/74 (8%) Query: 57 NKAIALELAE-IYVKNRYGQDAAEEEKPYEITELT--TSWVVEGTIHSDQIAGGVFIIEI 113 A+ A Y+ Y Q EE P+ I+ +W++EGT+ GGV I + Sbjct: 91 TDEHAVATAIFSYLAEHYSQFL--EETPFAISYNPIAEAWIIEGTL-PPGWLGGVIYIAL 147 Query: 114 GKNDGRILNFGHGK 127 K +G++L K Sbjct: 148 AKENGKLLMMYGTK 161 >UniRef50_A4YMX6 Putative uncharacterized protein n=1 Tax=Bradyrhizobium sp. ORS278 RepID=A4YMX6_BRASO Length = 110 Score = 40.9 bits (94), Expect = 0.013, Method: Composition-based stats. Identities = 15/68 (22%), Positives = 26/68 (38%), Gaps = 2/68 (2%) Query: 60 IALELAEIYVKNRYGQDAAEEEKPYEITELTTSWVVEGTIHSDQIAGGVFIIEIGKNDGR 119 A +AE Y+ Y P + + W V + +AGG +I + K + Sbjct: 45 TAARIAERYLAVHY-PAFDTIAMPPIVDDEGDVWKVSYEL-PPNMAGGNPVIVVEKTSWK 102 Query: 120 ILNFGHGK 127 +L H + Sbjct: 103 VLRVYHEQ 110 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.317 0.129 0.332 Lambda K H 0.267 0.0397 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 611,584,586 Number of Sequences: 3077464 Number of extensions: 18312709 Number of successful extensions: 62775 Number of sequences better than 1.0e-01: 13 Number of HSP's better than 0.1 without gapping: 23 Number of HSP's successfully gapped in prelim test: 12 Number of HSP's that attempted gapping in prelim test: 62721 Number of HSP's gapped (non-prelim): 35 length of query: 127 length of database: 1,040,396,356 effective HSP length: 93 effective length of query: 34 effective length of database: 754,192,204 effective search space: 25642534936 effective search space used: 25642534936 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.7 bits) S2: 87 (38.2 bits)