BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (122 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P33668 Uncharacterized protein ybbC n=13 Tax=Escherichi... 165 3e-40 UniRef50_C6X3U0 Putative uncharacterized protein n=1 Tax=Flavoba... 123 2e-27 UniRef50_P28911 Uncharacterized protein yhhH n=15 Tax=Escherichi... 116 2e-25 UniRef50_C7PKA4 Putative uncharacterized protein n=1 Tax=Chitino... 96 2e-19 UniRef50_B8F9G3 Putative uncharacterized protein n=1 Tax=Desulfa... 95 8e-19 UniRef50_C6V8M1 Putative uncharacterized protein n=3 Tax=Escheri... 89 4e-17 UniRef50_A5FGG2 Hypothetical lipoprotein n=1 Tax=Flavobacterium ... 88 9e-17 UniRef50_C6M7N1 Putative lipoprotein n=2 Tax=Neisseria RepID=C6M... 85 7e-16 UniRef50_D1N0G5 Putative uncharacterized protein n=1 Tax=Victiva... 85 8e-16 UniRef50_D1PST3 Putative uncharacterized protein n=1 Tax=Prevote... 79 6e-14 UniRef50_B2KDJ6 Putative uncharacterized protein n=1 Tax=Elusimi... 51 1e-05 UniRef50_A4YMX6 Putative uncharacterized protein n=1 Tax=Bradyrh... 43 0.003 UniRef50_C0ZK58 Putative uncharacterized protein n=1 Tax=Breviba... 42 0.006 >UniRef50_P33668 Uncharacterized protein ybbC n=13 Tax=Escherichia coli RepID=YBBC_ECOLI Length = 122 Score = 165 bits (418), Expect = 3e-40, Method: Composition-based stats. Identities = 122/122 (100%), Positives = 122/122 (100%) Query: 1 MKYSSIFSMLSFFILFACNETAVYGSDENIIFMRYVEKLHLDKYSVKNTVKTETMAIQLA 60 MKYSSIFSMLSFFILFACNETAVYGSDENIIFMRYVEKLHLDKYSVKNTVKTETMAIQLA Sbjct: 1 MKYSSIFSMLSFFILFACNETAVYGSDENIIFMRYVEKLHLDKYSVKNTVKTETMAIQLA 60 Query: 61 EIYVRYRYGERIAEEEKPYLITELPDSWVVEGAKLPYEVAGGVFIIEINKKNGCVLNFLH 120 EIYVRYRYGERIAEEEKPYLITELPDSWVVEGAKLPYEVAGGVFIIEINKKNGCVLNFLH Sbjct: 61 EIYVRYRYGERIAEEEKPYLITELPDSWVVEGAKLPYEVAGGVFIIEINKKNGCVLNFLH 120 Query: 121 SK 122 SK Sbjct: 121 SK 122 >UniRef50_C6X3U0 Putative uncharacterized protein n=1 Tax=Flavobacteriaceae bacterium 3519-10 RepID=C6X3U0_FLAB3 Length = 124 Score = 123 bits (308), Expect = 2e-27, Method: Composition-based stats. Identities = 37/122 (30%), Positives = 54/122 (44%), Gaps = 6/122 (4%) Query: 6 IFSMLSFFILFACNETA--VYGSDENIIFMRYVEKLHLDKYSVKN---TVKTETMAIQLA 60 I ML + F+CN+ + G D + K + N +K E AI++A Sbjct: 4 ILLMLFVILQFSCNKVSHNKLGIDNAKKELESALKDTTKIAILDNNELLIKEENTAIKVA 63 Query: 61 EIYVRYRYGERIAEEEKPYLITELPDSWVVEGAKLPYEVAGGVFIIEINKKNGCVLNFLH 120 E + YG E EKPY + + WV+ G Y GG F I I+ +N V+N +H Sbjct: 64 EPILFEIYGRSKIEGEKPYEAYLIKNYWVINGTVDRYS-FGGAFSIIIDARNSKVINVIH 122 Query: 121 SK 122 K Sbjct: 123 YK 124 >UniRef50_P28911 Uncharacterized protein yhhH n=15 Tax=Escherichia RepID=YHHH_ECOLI Length = 127 Score = 116 bits (291), Expect = 2e-25, Method: Composition-based stats. Identities = 54/96 (56%), Positives = 67/96 (69%), Gaps = 2/96 (2%) Query: 27 DENIIFMRYVEKLHLDKYSVKNTVKTETMAIQLAEIYVRYRYGERIAEEEKPYLITELPD 86 + NI MR E KY+V+N VK + +A++LAEIYV+ RYG+ AEEEKPY ITEL Sbjct: 34 NNNIKIMRKYESE--GKYTVRNLVKNKAIALELAEIYVKNRYGQDAAEEEKPYEITELTT 91 Query: 87 SWVVEGAKLPYEVAGGVFIIEINKKNGCVLNFLHSK 122 SWVVEG ++AGGVFIIEI K +G +LNF H K Sbjct: 92 SWVVEGTIHSDQIAGGVFIIEIGKNDGRILNFGHGK 127 >UniRef50_C7PKA4 Putative uncharacterized protein n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PKA4_CHIPD Length = 105 Score = 96.4 bits (238), Expect = 2e-19, Method: Composition-based stats. Identities = 34/89 (38%), Positives = 50/89 (56%), Gaps = 3/89 (3%) Query: 35 YVEKLHLDKYSVKNTVKTETMAIQLAEIYVRYRYGERIAEEEKPYLITELPDS-WVVEGA 93 ++ DK S + V E A ++AE YGERI + EKPY+++ + DS W V+G+ Sbjct: 19 SAKENTNDKTSASDYVPDEETAKKIAEAIWLPIYGERIYD-EKPYVVSLVGDSVWAVDGS 77 Query: 94 KLPYEVAGGVFIIEINKKNGCVLNFLHSK 122 P + GGV IEI K + +L +HSK Sbjct: 78 L-PKKKRGGVAYIEIQKNDCKILKVIHSK 105 >UniRef50_B8F9G3 Putative uncharacterized protein n=1 Tax=Desulfatibacillum alkenivorans AK-01 RepID=B8F9G3_DESAA Length = 139 Score = 94.8 bits (234), Expect = 8e-19, Method: Composition-based stats. Identities = 23/81 (28%), Positives = 39/81 (48%), Gaps = 6/81 (7%) Query: 47 KNTVKTETMAIQLAEIYVRYRYGERIAEEEKPYLITELPDSWVVEGAKLPYE-----VAG 101 V E AI++AE YG +I + +KP++ D W+V+G + + + G Sbjct: 60 NGYVPDEETAIRIAEAVWLPIYGPQIYQ-DKPFVAKLYGDEWLVKGTYVIPDDLNEIMRG 118 Query: 102 GVFIIEINKKNGCVLNFLHSK 122 GV I K +G +L H++ Sbjct: 119 GVPYAVIRKIDGKILAVTHTR 139 >UniRef50_C6V8M1 Putative uncharacterized protein n=3 Tax=Escherichia coli RepID=C6V8M1_ECOBD Length = 91 Score = 89.4 bits (220), Expect = 4e-17, Method: Composition-based stats. Identities = 30/74 (40%), Positives = 48/74 (64%), Gaps = 2/74 (2%) Query: 49 TVKTETMAIQLAEIYVRYRYGERIAEEEKPYLITELPDSWVVEGAKLPYEVAGGVFIIEI 108 V + MA++L+ +Y++Y YG+ AE +KPY IT+ + W +EG + + GG F I I Sbjct: 20 LVNSREMALELSYVYIKYVYGKEKAEFQKPYSITDDNNCWKIEGKQ--PKTLGGNFTILI 77 Query: 109 NKKNGCVLNFLHSK 122 KK+G VL+ +H+K Sbjct: 78 AKKDGQVLHVIHTK 91 >UniRef50_A5FGG2 Hypothetical lipoprotein n=1 Tax=Flavobacterium johnsoniae UW101 RepID=A5FGG2_FLAJ1 Length = 115 Score = 87.9 bits (216), Expect = 9e-17, Method: Composition-based stats. Identities = 36/126 (28%), Positives = 62/126 (49%), Gaps = 15/126 (11%) Query: 1 MKYSSIFSMLSFFILF-ACNETAVYGSDENIIFMRYVEKLHLDKYSV--KNTVKTETMAI 57 MK ++ ++ I F +C++ + V K +DK ++ K+ V AI Sbjct: 1 MKKHNVLLVVFLLIFFNSCSQ--------EKTKSQKVVKSVVDKPNLVFKDLVPDNETAI 52 Query: 58 QLAEIYVRYRYGERIAEEEKPYLITE-LPDSWVVEGAKLPYEVAGGVFIIEINKKNGCVL 116 ++AE + YG++I + ++P++ T P+ W VEG + GGV IEI KK+ +L Sbjct: 53 KIAEAILVPIYGKKIYK-QRPFVATLKSPNVWAVEGTLHTTK--GGVAYIEIQKKDCKIL 109 Query: 117 NFLHSK 122 H K Sbjct: 110 KVYHEK 115 >UniRef50_C6M7N1 Putative lipoprotein n=2 Tax=Neisseria RepID=C6M7N1_NEISI Length = 96 Score = 84.8 bits (208), Expect = 7e-16, Method: Composition-based stats. Identities = 27/71 (38%), Positives = 38/71 (53%), Gaps = 3/71 (4%) Query: 52 TETMAIQLAEIYVRYRYGERIAEEEKPYLITELPDSWVVEGAKLPYEVAGGVFIIEINKK 111 TE ++L EIYV YGE+ A+ +KPY + + + WV+ G P + GG F I Sbjct: 29 TEQQVLRLTEIYVTQHYGEQTAQAQKPYRVKKDGEHWVISGK--PPKALGGNFRAVIG-A 85 Query: 112 NGCVLNFLHSK 122 NG + HSK Sbjct: 86 NGQLEEITHSK 96 >UniRef50_D1N0G5 Putative uncharacterized protein n=1 Tax=Victivallis vadensis ATCC BAA-548 RepID=D1N0G5_9BACT Length = 336 Score = 84.8 bits (208), Expect = 8e-16, Method: Composition-based stats. Identities = 20/75 (26%), Positives = 36/75 (48%), Gaps = 2/75 (2%) Query: 48 NTVKTETMAIQLAEIYVRYRYGERIAEEEKPYLITELPDSWVVEGAKLPYEVAGGVFIIE 107 N ++ A+++AE + YG+ + ++P+ + E + + G P V GGV I Sbjct: 264 NLNLSKEDAVKVAETVLVGIYGKEVLR-QRPWRVVESETEFQISGTLAPSSV-GGVAEIS 321 Query: 108 INKKNGCVLNFLHSK 122 I K + V + H K Sbjct: 322 IRKSDAGVARYTHGK 336 >UniRef50_D1PST3 Putative uncharacterized protein n=1 Tax=Prevotella bergensis DSM 17361 RepID=D1PST3_9BACT Length = 142 Score = 78.7 bits (192), Expect = 6e-14, Method: Composition-based stats. Identities = 19/77 (24%), Positives = 36/77 (46%), Gaps = 3/77 (3%) Query: 47 KNTVKTETMAIQLAEIYVRYRYGERIAEEEKPYLITELPD-SWVVEGAKLPYEVAGGVFI 105 + +A ++A ++ YG + ++EKPY + W++ G+K GGV Sbjct: 68 SGFIPNAKVAYEVAIAVLKPIYGHYV-DKEKPYKVVLDSKRYWIITGSKDSISK-GGVAE 125 Query: 106 IEINKKNGCVLNFLHSK 122 + + K +G V+ H K Sbjct: 126 VTLRKSDGRVIMVTHGK 142 >UniRef50_B2KDJ6 Putative uncharacterized protein n=1 Tax=Elusimicrobium minutum Pei191 RepID=B2KDJ6_ELUMP Length = 147 Score = 51.3 bits (121), Expect = 1e-05, Method: Composition-based stats. Identities = 20/71 (28%), Positives = 33/71 (46%), Gaps = 1/71 (1%) Query: 52 TETMAIQLAEIYVRYRYGERIAEEEKPYLITELPDSWVVEGAKLPYEVAGGVFIIEINKK 111 E AI AE + Y G+ + +KP+ + WVV G ++G V I + K+ Sbjct: 78 NEQSAIAAAEEELGYILGQELMSAQKPFKAAGCDNMWVVYGTNEVGTLSGAV-HIILRKQ 136 Query: 112 NGCVLNFLHSK 122 +G +L + K Sbjct: 137 DGKILQVFYEK 147 >UniRef50_A4YMX6 Putative uncharacterized protein n=1 Tax=Bradyrhizobium sp. ORS278 RepID=A4YMX6_BRASO Length = 110 Score = 43.2 bits (100), Expect = 0.003, Method: Composition-based stats. Identities = 19/68 (27%), Positives = 31/68 (45%), Gaps = 2/68 (2%) Query: 55 MAIQLAEIYVRYRYGERIAEEEKPYLITELPDSWVVEGAKLPYEVAGGVFIIEINKKNGC 114 A ++AE Y+ Y P ++ + D W V +LP +AGG +I + K + Sbjct: 45 TAARIAERYLAVHY-PAFDTIAMPPIVDDEGDVWKVS-YELPPNMAGGNPVIVVEKTSWK 102 Query: 115 VLNFLHSK 122 VL H + Sbjct: 103 VLRVYHEQ 110 >UniRef50_C0ZK58 Putative uncharacterized protein n=1 Tax=Brevibacillus brevis NBRC 100599 RepID=C0ZK58_BREBN Length = 161 Score = 42.1 bits (97), Expect = 0.006, Method: Composition-based stats. Identities = 23/74 (31%), Positives = 38/74 (51%), Gaps = 6/74 (8%) Query: 52 TETMAIQLAE-IYVRYRYGERIAEEEKPYLITELP--DSWVVEGAKLPYEVAGGVFIIEI 108 T+ A+ A Y+ Y + + EE P+ I+ P ++W++EG P GGV I + Sbjct: 91 TDEHAVATAIFSYLAEHYSQFL--EETPFAISYNPIAEAWIIEGTL-PPGWLGGVIYIAL 147 Query: 109 NKKNGCVLNFLHSK 122 K+NG +L +K Sbjct: 148 AKENGKLLMMYGTK 161 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P33668 Uncharacterized protein ybbC n=13 Tax=Escherichi... 157 1e-37 UniRef50_C6X3U0 Putative uncharacterized protein n=1 Tax=Flavoba... 119 2e-26 UniRef50_P28911 Uncharacterized protein yhhH n=15 Tax=Escherichi... 113 2e-24 UniRef50_A5FGG2 Hypothetical lipoprotein n=1 Tax=Flavobacterium ... 111 9e-24 UniRef50_C7PKA4 Putative uncharacterized protein n=1 Tax=Chitino... 99 4e-20 UniRef50_B8F9G3 Putative uncharacterized protein n=1 Tax=Desulfa... 98 9e-20 UniRef50_C6V8M1 Putative uncharacterized protein n=3 Tax=Escheri... 89 4e-17 UniRef50_B2KDJ6 Putative uncharacterized protein n=1 Tax=Elusimi... 86 5e-16 UniRef50_D1N0G5 Putative uncharacterized protein n=1 Tax=Victiva... 85 8e-16 UniRef50_C6M7N1 Putative lipoprotein n=2 Tax=Neisseria RepID=C6M... 84 2e-15 UniRef50_D1PST3 Putative uncharacterized protein n=1 Tax=Prevote... 80 3e-14 Sequences not found previously or not previously below threshold: UniRef50_C0ZK58 Putative uncharacterized protein n=1 Tax=Breviba... 44 0.001 UniRef50_A4YMX6 Putative uncharacterized protein n=1 Tax=Bradyrh... 43 0.004 >UniRef50_P33668 Uncharacterized protein ybbC n=13 Tax=Escherichia coli RepID=YBBC_ECOLI Length = 122 Score = 157 bits (396), Expect = 1e-37, Method: Composition-based stats. Identities = 122/122 (100%), Positives = 122/122 (100%) Query: 1 MKYSSIFSMLSFFILFACNETAVYGSDENIIFMRYVEKLHLDKYSVKNTVKTETMAIQLA 60 MKYSSIFSMLSFFILFACNETAVYGSDENIIFMRYVEKLHLDKYSVKNTVKTETMAIQLA Sbjct: 1 MKYSSIFSMLSFFILFACNETAVYGSDENIIFMRYVEKLHLDKYSVKNTVKTETMAIQLA 60 Query: 61 EIYVRYRYGERIAEEEKPYLITELPDSWVVEGAKLPYEVAGGVFIIEINKKNGCVLNFLH 120 EIYVRYRYGERIAEEEKPYLITELPDSWVVEGAKLPYEVAGGVFIIEINKKNGCVLNFLH Sbjct: 61 EIYVRYRYGERIAEEEKPYLITELPDSWVVEGAKLPYEVAGGVFIIEINKKNGCVLNFLH 120 Query: 121 SK 122 SK Sbjct: 121 SK 122 >UniRef50_C6X3U0 Putative uncharacterized protein n=1 Tax=Flavobacteriaceae bacterium 3519-10 RepID=C6X3U0_FLAB3 Length = 124 Score = 119 bits (299), Expect = 2e-26, Method: Composition-based stats. Identities = 37/122 (30%), Positives = 54/122 (44%), Gaps = 6/122 (4%) Query: 6 IFSMLSFFILFACNETA--VYGSDENIIFMRYVEKLHLDKYSVKN---TVKTETMAIQLA 60 I ML + F+CN+ + G D + K + N +K E AI++A Sbjct: 4 ILLMLFVILQFSCNKVSHNKLGIDNAKKELESALKDTTKIAILDNNELLIKEENTAIKVA 63 Query: 61 EIYVRYRYGERIAEEEKPYLITELPDSWVVEGAKLPYEVAGGVFIIEINKKNGCVLNFLH 120 E + YG E EKPY + + WV+ G Y GG F I I+ +N V+N +H Sbjct: 64 EPILFEIYGRSKIEGEKPYEAYLIKNYWVINGTVDRYS-FGGAFSIIIDARNSKVINVIH 122 Query: 121 SK 122 K Sbjct: 123 YK 124 >UniRef50_P28911 Uncharacterized protein yhhH n=15 Tax=Escherichia RepID=YHHH_ECOLI Length = 127 Score = 113 bits (283), Expect = 2e-24, Method: Composition-based stats. Identities = 54/96 (56%), Positives = 67/96 (69%), Gaps = 2/96 (2%) Query: 27 DENIIFMRYVEKLHLDKYSVKNTVKTETMAIQLAEIYVRYRYGERIAEEEKPYLITELPD 86 + NI MR E KY+V+N VK + +A++LAEIYV+ RYG+ AEEEKPY ITEL Sbjct: 34 NNNIKIMRKYESE--GKYTVRNLVKNKAIALELAEIYVKNRYGQDAAEEEKPYEITELTT 91 Query: 87 SWVVEGAKLPYEVAGGVFIIEINKKNGCVLNFLHSK 122 SWVVEG ++AGGVFIIEI K +G +LNF H K Sbjct: 92 SWVVEGTIHSDQIAGGVFIIEIGKNDGRILNFGHGK 127 >UniRef50_A5FGG2 Hypothetical lipoprotein n=1 Tax=Flavobacterium johnsoniae UW101 RepID=A5FGG2_FLAJ1 Length = 115 Score = 111 bits (276), Expect = 9e-24, Method: Composition-based stats. Identities = 36/126 (28%), Positives = 62/126 (49%), Gaps = 15/126 (11%) Query: 1 MKYSSIFSMLSFFILF-ACNETAVYGSDENIIFMRYVEKLHLDKYSV--KNTVKTETMAI 57 MK ++ ++ I F +C++ + V K +DK ++ K+ V AI Sbjct: 1 MKKHNVLLVVFLLIFFNSCSQ--------EKTKSQKVVKSVVDKPNLVFKDLVPDNETAI 52 Query: 58 QLAEIYVRYRYGERIAEEEKPYLITE-LPDSWVVEGAKLPYEVAGGVFIIEINKKNGCVL 116 ++AE + YG++I + ++P++ T P+ W VEG + GGV IEI KK+ +L Sbjct: 53 KIAEAILVPIYGKKIYK-QRPFVATLKSPNVWAVEGTLHTTK--GGVAYIEIQKKDCKIL 109 Query: 117 NFLHSK 122 H K Sbjct: 110 KVYHEK 115 >UniRef50_C7PKA4 Putative uncharacterized protein n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PKA4_CHIPD Length = 105 Score = 99.1 bits (245), Expect = 4e-20, Method: Composition-based stats. Identities = 34/89 (38%), Positives = 50/89 (56%), Gaps = 3/89 (3%) Query: 35 YVEKLHLDKYSVKNTVKTETMAIQLAEIYVRYRYGERIAEEEKPYLITELPDS-WVVEGA 93 ++ DK S + V E A ++AE YGERI + EKPY+++ + DS W V+G+ Sbjct: 19 SAKENTNDKTSASDYVPDEETAKKIAEAIWLPIYGERIYD-EKPYVVSLVGDSVWAVDGS 77 Query: 94 KLPYEVAGGVFIIEINKKNGCVLNFLHSK 122 P + GGV IEI K + +L +HSK Sbjct: 78 L-PKKKRGGVAYIEIQKNDCKILKVIHSK 105 >UniRef50_B8F9G3 Putative uncharacterized protein n=1 Tax=Desulfatibacillum alkenivorans AK-01 RepID=B8F9G3_DESAA Length = 139 Score = 98.0 bits (242), Expect = 9e-20, Method: Composition-based stats. Identities = 23/81 (28%), Positives = 39/81 (48%), Gaps = 6/81 (7%) Query: 47 KNTVKTETMAIQLAEIYVRYRYGERIAEEEKPYLITELPDSWVVEGAKLPYE-----VAG 101 V E AI++AE YG +I + +KP++ D W+V+G + + + G Sbjct: 60 NGYVPDEETAIRIAEAVWLPIYGPQIYQ-DKPFVAKLYGDEWLVKGTYVIPDDLNEIMRG 118 Query: 102 GVFIIEINKKNGCVLNFLHSK 122 GV I K +G +L H++ Sbjct: 119 GVPYAVIRKIDGKILAVTHTR 139 >UniRef50_C6V8M1 Putative uncharacterized protein n=3 Tax=Escherichia coli RepID=C6V8M1_ECOBD Length = 91 Score = 89.1 bits (219), Expect = 4e-17, Method: Composition-based stats. Identities = 30/74 (40%), Positives = 48/74 (64%), Gaps = 2/74 (2%) Query: 49 TVKTETMAIQLAEIYVRYRYGERIAEEEKPYLITELPDSWVVEGAKLPYEVAGGVFIIEI 108 V + MA++L+ +Y++Y YG+ AE +KPY IT+ + W +EG + + GG F I I Sbjct: 20 LVNSREMALELSYVYIKYVYGKEKAEFQKPYSITDDNNCWKIEGKQ--PKTLGGNFTILI 77 Query: 109 NKKNGCVLNFLHSK 122 KK+G VL+ +H+K Sbjct: 78 AKKDGQVLHVIHTK 91 >UniRef50_B2KDJ6 Putative uncharacterized protein n=1 Tax=Elusimicrobium minutum Pei191 RepID=B2KDJ6_ELUMP Length = 147 Score = 85.6 bits (210), Expect = 5e-16, Method: Composition-based stats. Identities = 20/71 (28%), Positives = 33/71 (46%), Gaps = 1/71 (1%) Query: 52 TETMAIQLAEIYVRYRYGERIAEEEKPYLITELPDSWVVEGAKLPYEVAGGVFIIEINKK 111 E AI AE + Y G+ + +KP+ + WVV G ++G V I + K+ Sbjct: 78 NEQSAIAAAEEELGYILGQELMSAQKPFKAAGCDNMWVVYGTNEVGTLSGAV-HIILRKQ 136 Query: 112 NGCVLNFLHSK 122 +G +L + K Sbjct: 137 DGKILQVFYEK 147 >UniRef50_D1N0G5 Putative uncharacterized protein n=1 Tax=Victivallis vadensis ATCC BAA-548 RepID=D1N0G5_9BACT Length = 336 Score = 84.9 bits (208), Expect = 8e-16, Method: Composition-based stats. Identities = 20/75 (26%), Positives = 36/75 (48%), Gaps = 2/75 (2%) Query: 48 NTVKTETMAIQLAEIYVRYRYGERIAEEEKPYLITELPDSWVVEGAKLPYEVAGGVFIIE 107 N ++ A+++AE + YG+ + ++P+ + E + + G P V GGV I Sbjct: 264 NLNLSKEDAVKVAETVLVGIYGKEVLR-QRPWRVVESETEFQISGTLAPSSV-GGVAEIS 321 Query: 108 INKKNGCVLNFLHSK 122 I K + V + H K Sbjct: 322 IRKSDAGVARYTHGK 336 >UniRef50_C6M7N1 Putative lipoprotein n=2 Tax=Neisseria RepID=C6M7N1_NEISI Length = 96 Score = 83.7 bits (205), Expect = 2e-15, Method: Composition-based stats. Identities = 27/71 (38%), Positives = 38/71 (53%), Gaps = 3/71 (4%) Query: 52 TETMAIQLAEIYVRYRYGERIAEEEKPYLITELPDSWVVEGAKLPYEVAGGVFIIEINKK 111 TE ++L EIYV YGE+ A+ +KPY + + + WV+ G P + GG F I Sbjct: 29 TEQQVLRLTEIYVTQHYGEQTAQAQKPYRVKKDGEHWVISGK--PPKALGGNFRAVIG-A 85 Query: 112 NGCVLNFLHSK 122 NG + HSK Sbjct: 86 NGQLEEITHSK 96 >UniRef50_D1PST3 Putative uncharacterized protein n=1 Tax=Prevotella bergensis DSM 17361 RepID=D1PST3_9BACT Length = 142 Score = 79.8 bits (195), Expect = 3e-14, Method: Composition-based stats. Identities = 19/77 (24%), Positives = 36/77 (46%), Gaps = 3/77 (3%) Query: 47 KNTVKTETMAIQLAEIYVRYRYGERIAEEEKPYLITELPD-SWVVEGAKLPYEVAGGVFI 105 + +A ++A ++ YG + ++EKPY + W++ G+K GGV Sbjct: 68 SGFIPNAKVAYEVAIAVLKPIYGHYV-DKEKPYKVVLDSKRYWIITGSKDSISK-GGVAE 125 Query: 106 IEINKKNGCVLNFLHSK 122 + + K +G V+ H K Sbjct: 126 VTLRKSDGRVIMVTHGK 142 >UniRef50_C0ZK58 Putative uncharacterized protein n=1 Tax=Brevibacillus brevis NBRC 100599 RepID=C0ZK58_BREBN Length = 161 Score = 44.4 bits (103), Expect = 0.001, Method: Composition-based stats. Identities = 23/74 (31%), Positives = 38/74 (51%), Gaps = 6/74 (8%) Query: 52 TETMAIQLAE-IYVRYRYGERIAEEEKPYLITELP--DSWVVEGAKLPYEVAGGVFIIEI 108 T+ A+ A Y+ Y + + EE P+ I+ P ++W++EG P GGV I + Sbjct: 91 TDEHAVATAIFSYLAEHYSQFL--EETPFAISYNPIAEAWIIEGTL-PPGWLGGVIYIAL 147 Query: 109 NKKNGCVLNFLHSK 122 K+NG +L +K Sbjct: 148 AKENGKLLMMYGTK 161 >UniRef50_A4YMX6 Putative uncharacterized protein n=1 Tax=Bradyrhizobium sp. ORS278 RepID=A4YMX6_BRASO Length = 110 Score = 42.9 bits (99), Expect = 0.004, Method: Composition-based stats. Identities = 19/68 (27%), Positives = 31/68 (45%), Gaps = 2/68 (2%) Query: 55 MAIQLAEIYVRYRYGERIAEEEKPYLITELPDSWVVEGAKLPYEVAGGVFIIEINKKNGC 114 A ++AE Y+ Y P ++ + D W V +LP +AGG +I + K + Sbjct: 45 TAARIAERYLAVHY-PAFDTIAMPPIVDDEGDVWKVS-YELPPNMAGGNPVIVVEKTSWK 102 Query: 115 VLNFLHSK 122 VL H + Sbjct: 103 VLRVYHEQ 110 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P33668 Uncharacterized protein ybbC n=13 Tax=Escherichi... 154 7e-37 UniRef50_C6X3U0 Putative uncharacterized protein n=1 Tax=Flavoba... 117 1e-25 UniRef50_P28911 Uncharacterized protein yhhH n=15 Tax=Escherichi... 111 1e-23 UniRef50_A5FGG2 Hypothetical lipoprotein n=1 Tax=Flavobacterium ... 110 2e-23 UniRef50_C7PKA4 Putative uncharacterized protein n=1 Tax=Chitino... 97 2e-19 UniRef50_B8F9G3 Putative uncharacterized protein n=1 Tax=Desulfa... 95 7e-19 UniRef50_C6V8M1 Putative uncharacterized protein n=3 Tax=Escheri... 87 1e-16 UniRef50_D1N0G5 Putative uncharacterized protein n=1 Tax=Victiva... 83 3e-15 UniRef50_B2KDJ6 Putative uncharacterized protein n=1 Tax=Elusimi... 83 4e-15 UniRef50_C6M7N1 Putative lipoprotein n=2 Tax=Neisseria RepID=C6M... 82 7e-15 UniRef50_D1PST3 Putative uncharacterized protein n=1 Tax=Prevote... 78 1e-13 UniRef50_C0ZK58 Putative uncharacterized protein n=1 Tax=Breviba... 71 2e-11 Sequences not found previously or not previously below threshold: UniRef50_A4YMX6 Putative uncharacterized protein n=1 Tax=Bradyrh... 45 0.001 >UniRef50_P33668 Uncharacterized protein ybbC n=13 Tax=Escherichia coli RepID=YBBC_ECOLI Length = 122 Score = 154 bits (389), Expect = 7e-37, Method: Composition-based stats. Identities = 122/122 (100%), Positives = 122/122 (100%) Query: 1 MKYSSIFSMLSFFILFACNETAVYGSDENIIFMRYVEKLHLDKYSVKNTVKTETMAIQLA 60 MKYSSIFSMLSFFILFACNETAVYGSDENIIFMRYVEKLHLDKYSVKNTVKTETMAIQLA Sbjct: 1 MKYSSIFSMLSFFILFACNETAVYGSDENIIFMRYVEKLHLDKYSVKNTVKTETMAIQLA 60 Query: 61 EIYVRYRYGERIAEEEKPYLITELPDSWVVEGAKLPYEVAGGVFIIEINKKNGCVLNFLH 120 EIYVRYRYGERIAEEEKPYLITELPDSWVVEGAKLPYEVAGGVFIIEINKKNGCVLNFLH Sbjct: 61 EIYVRYRYGERIAEEEKPYLITELPDSWVVEGAKLPYEVAGGVFIIEINKKNGCVLNFLH 120 Query: 121 SK 122 SK Sbjct: 121 SK 122 >UniRef50_C6X3U0 Putative uncharacterized protein n=1 Tax=Flavobacteriaceae bacterium 3519-10 RepID=C6X3U0_FLAB3 Length = 124 Score = 117 bits (292), Expect = 1e-25, Method: Composition-based stats. Identities = 37/122 (30%), Positives = 54/122 (44%), Gaps = 6/122 (4%) Query: 6 IFSMLSFFILFACNETA--VYGSDENIIFMRYVEKLHLDKYSVKN---TVKTETMAIQLA 60 I ML + F+CN+ + G D + K + N +K E AI++A Sbjct: 4 ILLMLFVILQFSCNKVSHNKLGIDNAKKELESALKDTTKIAILDNNELLIKEENTAIKVA 63 Query: 61 EIYVRYRYGERIAEEEKPYLITELPDSWVVEGAKLPYEVAGGVFIIEINKKNGCVLNFLH 120 E + YG E EKPY + + WV+ G Y GG F I I+ +N V+N +H Sbjct: 64 EPILFEIYGRSKIEGEKPYEAYLIKNYWVINGTVDRYS-FGGAFSIIIDARNSKVINVIH 122 Query: 121 SK 122 K Sbjct: 123 YK 124 >UniRef50_P28911 Uncharacterized protein yhhH n=15 Tax=Escherichia RepID=YHHH_ECOLI Length = 127 Score = 111 bits (276), Expect = 1e-23, Method: Composition-based stats. Identities = 54/96 (56%), Positives = 67/96 (69%), Gaps = 2/96 (2%) Query: 27 DENIIFMRYVEKLHLDKYSVKNTVKTETMAIQLAEIYVRYRYGERIAEEEKPYLITELPD 86 + NI MR E KY+V+N VK + +A++LAEIYV+ RYG+ AEEEKPY ITEL Sbjct: 34 NNNIKIMRKYESE--GKYTVRNLVKNKAIALELAEIYVKNRYGQDAAEEEKPYEITELTT 91 Query: 87 SWVVEGAKLPYEVAGGVFIIEINKKNGCVLNFLHSK 122 SWVVEG ++AGGVFIIEI K +G +LNF H K Sbjct: 92 SWVVEGTIHSDQIAGGVFIIEIGKNDGRILNFGHGK 127 >UniRef50_A5FGG2 Hypothetical lipoprotein n=1 Tax=Flavobacterium johnsoniae UW101 RepID=A5FGG2_FLAJ1 Length = 115 Score = 110 bits (274), Expect = 2e-23, Method: Composition-based stats. Identities = 36/126 (28%), Positives = 62/126 (49%), Gaps = 15/126 (11%) Query: 1 MKYSSIFSMLSFFILF-ACNETAVYGSDENIIFMRYVEKLHLDKYSV--KNTVKTETMAI 57 MK ++ ++ I F +C++ + V K +DK ++ K+ V AI Sbjct: 1 MKKHNVLLVVFLLIFFNSCSQ--------EKTKSQKVVKSVVDKPNLVFKDLVPDNETAI 52 Query: 58 QLAEIYVRYRYGERIAEEEKPYLITE-LPDSWVVEGAKLPYEVAGGVFIIEINKKNGCVL 116 ++AE + YG++I + ++P++ T P+ W VEG + GGV IEI KK+ +L Sbjct: 53 KIAEAILVPIYGKKIYK-QRPFVATLKSPNVWAVEGTLHTTK--GGVAYIEIQKKDCKIL 109 Query: 117 NFLHSK 122 H K Sbjct: 110 KVYHEK 115 >UniRef50_C7PKA4 Putative uncharacterized protein n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PKA4_CHIPD Length = 105 Score = 96.8 bits (239), Expect = 2e-19, Method: Composition-based stats. Identities = 34/89 (38%), Positives = 50/89 (56%), Gaps = 3/89 (3%) Query: 35 YVEKLHLDKYSVKNTVKTETMAIQLAEIYVRYRYGERIAEEEKPYLITELPDS-WVVEGA 93 ++ DK S + V E A ++AE YGERI + EKPY+++ + DS W V+G+ Sbjct: 19 SAKENTNDKTSASDYVPDEETAKKIAEAIWLPIYGERIYD-EKPYVVSLVGDSVWAVDGS 77 Query: 94 KLPYEVAGGVFIIEINKKNGCVLNFLHSK 122 P + GGV IEI K + +L +HSK Sbjct: 78 L-PKKKRGGVAYIEIQKNDCKILKVIHSK 105 >UniRef50_B8F9G3 Putative uncharacterized protein n=1 Tax=Desulfatibacillum alkenivorans AK-01 RepID=B8F9G3_DESAA Length = 139 Score = 94.9 bits (234), Expect = 7e-19, Method: Composition-based stats. Identities = 23/81 (28%), Positives = 39/81 (48%), Gaps = 6/81 (7%) Query: 47 KNTVKTETMAIQLAEIYVRYRYGERIAEEEKPYLITELPDSWVVEGAKLPYE-----VAG 101 V E AI++AE YG +I + +KP++ D W+V+G + + + G Sbjct: 60 NGYVPDEETAIRIAEAVWLPIYGPQIYQ-DKPFVAKLYGDEWLVKGTYVIPDDLNEIMRG 118 Query: 102 GVFIIEINKKNGCVLNFLHSK 122 GV I K +G +L H++ Sbjct: 119 GVPYAVIRKIDGKILAVTHTR 139 >UniRef50_C6V8M1 Putative uncharacterized protein n=3 Tax=Escherichia coli RepID=C6V8M1_ECOBD Length = 91 Score = 87.2 bits (214), Expect = 1e-16, Method: Composition-based stats. Identities = 30/74 (40%), Positives = 48/74 (64%), Gaps = 2/74 (2%) Query: 49 TVKTETMAIQLAEIYVRYRYGERIAEEEKPYLITELPDSWVVEGAKLPYEVAGGVFIIEI 108 V + MA++L+ +Y++Y YG+ AE +KPY IT+ + W +EG + + GG F I I Sbjct: 20 LVNSREMALELSYVYIKYVYGKEKAEFQKPYSITDDNNCWKIEGKQ--PKTLGGNFTILI 77 Query: 109 NKKNGCVLNFLHSK 122 KK+G VL+ +H+K Sbjct: 78 AKKDGQVLHVIHTK 91 >UniRef50_D1N0G5 Putative uncharacterized protein n=1 Tax=Victivallis vadensis ATCC BAA-548 RepID=D1N0G5_9BACT Length = 336 Score = 82.5 bits (202), Expect = 3e-15, Method: Composition-based stats. Identities = 20/75 (26%), Positives = 36/75 (48%), Gaps = 2/75 (2%) Query: 48 NTVKTETMAIQLAEIYVRYRYGERIAEEEKPYLITELPDSWVVEGAKLPYEVAGGVFIIE 107 N ++ A+++AE + YG+ + ++P+ + E + + G P V GGV I Sbjct: 264 NLNLSKEDAVKVAETVLVGIYGKEVLR-QRPWRVVESETEFQISGTLAPSSV-GGVAEIS 321 Query: 108 INKKNGCVLNFLHSK 122 I K + V + H K Sbjct: 322 IRKSDAGVARYTHGK 336 >UniRef50_B2KDJ6 Putative uncharacterized protein n=1 Tax=Elusimicrobium minutum Pei191 RepID=B2KDJ6_ELUMP Length = 147 Score = 82.5 bits (202), Expect = 4e-15, Method: Composition-based stats. Identities = 20/71 (28%), Positives = 33/71 (46%), Gaps = 1/71 (1%) Query: 52 TETMAIQLAEIYVRYRYGERIAEEEKPYLITELPDSWVVEGAKLPYEVAGGVFIIEINKK 111 E AI AE + Y G+ + +KP+ + WVV G ++G V I + K+ Sbjct: 78 NEQSAIAAAEEELGYILGQELMSAQKPFKAAGCDNMWVVYGTNEVGTLSGAV-HIILRKQ 136 Query: 112 NGCVLNFLHSK 122 +G +L + K Sbjct: 137 DGKILQVFYEK 147 >UniRef50_C6M7N1 Putative lipoprotein n=2 Tax=Neisseria RepID=C6M7N1_NEISI Length = 96 Score = 81.8 bits (200), Expect = 7e-15, Method: Composition-based stats. Identities = 27/71 (38%), Positives = 38/71 (53%), Gaps = 3/71 (4%) Query: 52 TETMAIQLAEIYVRYRYGERIAEEEKPYLITELPDSWVVEGAKLPYEVAGGVFIIEINKK 111 TE ++L EIYV YGE+ A+ +KPY + + + WV+ G P + GG F I Sbjct: 29 TEQQVLRLTEIYVTQHYGEQTAQAQKPYRVKKDGEHWVISGK--PPKALGGNFRAVIG-A 85 Query: 112 NGCVLNFLHSK 122 NG + HSK Sbjct: 86 NGQLEEITHSK 96 >UniRef50_D1PST3 Putative uncharacterized protein n=1 Tax=Prevotella bergensis DSM 17361 RepID=D1PST3_9BACT Length = 142 Score = 77.5 bits (189), Expect = 1e-13, Method: Composition-based stats. Identities = 19/77 (24%), Positives = 36/77 (46%), Gaps = 3/77 (3%) Query: 47 KNTVKTETMAIQLAEIYVRYRYGERIAEEEKPYLITELPD-SWVVEGAKLPYEVAGGVFI 105 + +A ++A ++ YG + ++EKPY + W++ G+K GGV Sbjct: 68 SGFIPNAKVAYEVAIAVLKPIYGHYV-DKEKPYKVVLDSKRYWIITGSKDSISK-GGVAE 125 Query: 106 IEINKKNGCVLNFLHSK 122 + + K +G V+ H K Sbjct: 126 VTLRKSDGRVIMVTHGK 142 >UniRef50_C0ZK58 Putative uncharacterized protein n=1 Tax=Brevibacillus brevis NBRC 100599 RepID=C0ZK58_BREBN Length = 161 Score = 70.6 bits (171), Expect = 2e-11, Method: Composition-based stats. Identities = 23/74 (31%), Positives = 38/74 (51%), Gaps = 6/74 (8%) Query: 52 TETMAIQLAE-IYVRYRYGERIAEEEKPYLITELP--DSWVVEGAKLPYEVAGGVFIIEI 108 T+ A+ A Y+ Y + + EE P+ I+ P ++W++EG P GGV I + Sbjct: 91 TDEHAVATAIFSYLAEHYSQFL--EETPFAISYNPIAEAWIIEGTL-PPGWLGGVIYIAL 147 Query: 109 NKKNGCVLNFLHSK 122 K+NG +L +K Sbjct: 148 AKENGKLLMMYGTK 161 >UniRef50_A4YMX6 Putative uncharacterized protein n=1 Tax=Bradyrhizobium sp. ORS278 RepID=A4YMX6_BRASO Length = 110 Score = 44.8 bits (104), Expect = 0.001, Method: Composition-based stats. Identities = 18/68 (26%), Positives = 29/68 (42%), Gaps = 2/68 (2%) Query: 55 MAIQLAEIYVRYRYGERIAEEEKPYLITELPDSWVVEGAKLPYEVAGGVFIIEINKKNGC 114 A ++AE Y+ Y P ++ + D W V P +AGG +I + K + Sbjct: 45 TAARIAERYLAVHYPAFDTIAMPP-IVDDEGDVWKVSYEL-PPNMAGGNPVIVVEKTSWK 102 Query: 115 VLNFLHSK 122 VL H + Sbjct: 103 VLRVYHEQ 110 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.313 0.124 0.319 Lambda K H 0.267 0.0382 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 448,924,451 Number of Sequences: 3077464 Number of extensions: 9766498 Number of successful extensions: 47148 Number of sequences better than 1.0e-01: 13 Number of HSP's better than 0.1 without gapping: 31 Number of HSP's successfully gapped in prelim test: 8 Number of HSP's that attempted gapping in prelim test: 47072 Number of HSP's gapped (non-prelim): 39 length of query: 122 length of database: 1,040,396,356 effective HSP length: 88 effective length of query: 34 effective length of database: 769,579,524 effective search space: 26165703816 effective search space used: 26165703816 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.6 bits) S2: 87 (38.2 bits)