BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (122 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P33668 Uncharacterized protein ybbC n=13 Tax=Escherichi... 251 5e-66 UniRef50_P28911 Uncharacterized protein yhhH n=15 Tax=Escherichi... 102 3e-21 UniRef50_C6V8M1 Putative uncharacterized protein n=3 Tax=Escheri... 60 3e-08 UniRef50_C7PKA4 Putative uncharacterized protein n=1 Tax=Chitino... 54 2e-06 UniRef50_C6M7N1 Putative lipoprotein n=2 Tax=Neisseria RepID=C6M... 48 1e-04 UniRef50_C6X3U0 Putative uncharacterized protein n=1 Tax=Flavoba... 43 0.004 UniRef50_A5FGG2 Hypothetical lipoprotein n=1 Tax=Flavobacterium ... 42 0.007 >UniRef50_P33668 Uncharacterized protein ybbC n=13 Tax=Escherichia coli RepID=YBBC_ECOLI Length = 122 Score = 251 bits (641), Expect = 5e-66, Method: Compositional matrix adjust. Identities = 122/122 (100%), Positives = 122/122 (100%) Query: 1 MKYSSIFSMLSFFILFACNETAVYGSDENIIFMRYVEKLHLDKYSVKNTVKTETMAIQLA 60 MKYSSIFSMLSFFILFACNETAVYGSDENIIFMRYVEKLHLDKYSVKNTVKTETMAIQLA Sbjct: 1 MKYSSIFSMLSFFILFACNETAVYGSDENIIFMRYVEKLHLDKYSVKNTVKTETMAIQLA 60 Query: 61 EIYVRYRYGERIAEEEKPYLITELPDSWVVEGAKLPYEVAGGVFIIEINKKNGCVLNFLH 120 EIYVRYRYGERIAEEEKPYLITELPDSWVVEGAKLPYEVAGGVFIIEINKKNGCVLNFLH Sbjct: 61 EIYVRYRYGERIAEEEKPYLITELPDSWVVEGAKLPYEVAGGVFIIEINKKNGCVLNFLH 120 Query: 121 SK 122 SK Sbjct: 121 SK 122 >UniRef50_P28911 Uncharacterized protein yhhH n=15 Tax=Escherichia RepID=YHHH_ECOLI Length = 127 Score = 102 bits (254), Expect = 3e-21, Method: Compositional matrix adjust. Identities = 54/96 (56%), Positives = 67/96 (69%), Gaps = 2/96 (2%) Query: 27 DENIIFMRYVEKLHLDKYSVKNTVKTETMAIQLAEIYVRYRYGERIAEEEKPYLITELPD 86 + NI MR E KY+V+N VK + +A++LAEIYV+ RYG+ AEEEKPY ITEL Sbjct: 34 NNNIKIMRKYESE--GKYTVRNLVKNKAIALELAEIYVKNRYGQDAAEEEKPYEITELTT 91 Query: 87 SWVVEGAKLPYEVAGGVFIIEINKKNGCVLNFLHSK 122 SWVVEG ++AGGVFIIEI K +G +LNF H K Sbjct: 92 SWVVEGTIHSDQIAGGVFIIEIGKNDGRILNFGHGK 127 >UniRef50_C6V8M1 Putative uncharacterized protein n=3 Tax=Escherichia coli RepID=C6V8M1_ECOBD Length = 91 Score = 59.7 bits (143), Expect = 3e-08, Method: Compositional matrix adjust. Identities = 32/73 (43%), Positives = 49/73 (67%), Gaps = 2/73 (2%) Query: 50 VKTETMAIQLAEIYVRYRYGERIAEEEKPYLITELPDSWVVEGAKLPYEVAGGVFIIEIN 109 V + MA++L+ +Y++Y YG+ AE +KPY IT+ + W +EG K P + GG F I I Sbjct: 21 VNSREMALELSYVYIKYVYGKEKAEFQKPYSITDDNNCWKIEG-KQP-KTLGGNFTILIA 78 Query: 110 KKNGCVLNFLHSK 122 KK+G VL+ +H+K Sbjct: 79 KKDGQVLHVIHTK 91 >UniRef50_C7PKA4 Putative uncharacterized protein n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PKA4_CHIPD Length = 105 Score = 53.5 bits (127), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 35/82 (42%), Positives = 49/82 (59%), Gaps = 3/82 (3%) Query: 42 DKYSVKNTVKTETMAIQLAEIYVRYRYGERIAEEEKPYLITELPDS-WVVEGAKLPYEVA 100 DK S + V E A ++AE YGERI +E KPY+++ + DS W V+G+ LP + Sbjct: 26 DKTSASDYVPDEETAKKIAEAIWLPIYGERIYDE-KPYVVSLVGDSVWAVDGS-LPKKKR 83 Query: 101 GGVFIIEINKKNGCVLNFLHSK 122 GGV IEI K + +L +HSK Sbjct: 84 GGVAYIEIQKNDCKILKVIHSK 105 >UniRef50_C6M7N1 Putative lipoprotein n=2 Tax=Neisseria RepID=C6M7N1_NEISI Length = 96 Score = 47.8 bits (112), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 27/71 (38%), Positives = 38/71 (53%), Gaps = 3/71 (4%) Query: 52 TETMAIQLAEIYVRYRYGERIAEEEKPYLITELPDSWVVEGAKLPYEVAGGVFIIEINKK 111 TE ++L EIYV YGE+ A+ +KPY + + + WV+ G P + GG F I Sbjct: 29 TEQQVLRLTEIYVTQHYGEQTAQAQKPYRVKKDGEHWVISGK--PPKALGGNFRAVIG-A 85 Query: 112 NGCVLNFLHSK 122 NG + HSK Sbjct: 86 NGQLEEITHSK 96 >UniRef50_C6X3U0 Putative uncharacterized protein n=1 Tax=Flavobacteriaceae bacterium 3519-10 RepID=C6X3U0_FLAB3 Length = 124 Score = 42.7 bits (99), Expect = 0.004, Method: Compositional matrix adjust. Identities = 27/73 (36%), Positives = 38/73 (52%), Gaps = 1/73 (1%) Query: 50 VKTETMAIQLAEIYVRYRYGERIAEEEKPYLITELPDSWVVEGAKLPYEVAGGVFIIEIN 109 +K E AI++AE + YG E EKPY + + WV+ G Y GG F I I+ Sbjct: 53 IKEENTAIKVAEPILFEIYGRSKIEGEKPYEAYLIKNYWVINGTVDRYSF-GGAFSIIID 111 Query: 110 KKNGCVLNFLHSK 122 +N V+N +H K Sbjct: 112 ARNSKVINVIHYK 124 >UniRef50_A5FGG2 Hypothetical lipoprotein n=1 Tax=Flavobacterium johnsoniae UW101 RepID=A5FGG2_FLAJ1 Length = 115 Score = 41.6 bits (96), Expect = 0.007, Method: Compositional matrix adjust. Identities = 34/115 (29%), Positives = 57/115 (49%), Gaps = 7/115 (6%) Query: 9 MLSFFILFACNETAVYGSDENIIFMRYVEKLHLDKYSVKNTVKTETMAIQLAEIYVRYRY 68 +L F+L N + + + V+K +L K+ V AI++AE + Y Sbjct: 7 LLVVFLLIFFNSCSQEKTKSQKVVKSVVDKPNL---VFKDLVPDNETAIKIAEAILVPIY 63 Query: 69 GERIAEEEKPYLIT-ELPDSWVVEGAKLPYEVAGGVFIIEINKKNGCVLNFLHSK 122 G++I ++ +P++ T + P+ W VEG + GGV IEI KK+ +L H K Sbjct: 64 GKKIYKQ-RPFVATLKSPNVWAVEGTL--HTTKGGVAYIEIQKKDCKILKVYHEK 115 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P33668 Uncharacterized protein ybbC n=13 Tax=Escherichi... 194 1e-48 UniRef50_P28911 Uncharacterized protein yhhH n=15 Tax=Escherichi... 132 5e-30 UniRef50_C7PKA4 Putative uncharacterized protein n=1 Tax=Chitino... 100 2e-20 UniRef50_C6V8M1 Putative uncharacterized protein n=3 Tax=Escheri... 98 7e-20 UniRef50_C6M7N1 Putative lipoprotein n=2 Tax=Neisseria RepID=C6M... 95 8e-19 Sequences not found previously or not previously below threshold: UniRef50_B8F9G3 Putative uncharacterized protein n=1 Tax=Desulfa... 69 5e-11 UniRef50_A5FGG2 Hypothetical lipoprotein n=1 Tax=Flavobacterium ... 69 6e-11 UniRef50_C6X3U0 Putative uncharacterized protein n=1 Tax=Flavoba... 56 4e-07 UniRef50_D1PST3 Putative uncharacterized protein n=1 Tax=Prevote... 50 3e-05 UniRef50_D1N0G5 Putative uncharacterized protein n=1 Tax=Victiva... 47 2e-04 UniRef50_B2KDJ6 Putative uncharacterized protein n=1 Tax=Elusimi... 43 0.004 UniRef50_C0ZK58 Putative uncharacterized protein n=1 Tax=Breviba... 39 0.045 UniRef50_A4YMX6 Putative uncharacterized protein n=1 Tax=Bradyrh... 39 0.068 >UniRef50_P33668 Uncharacterized protein ybbC n=13 Tax=Escherichia coli RepID=YBBC_ECOLI Length = 122 Score = 194 bits (492), Expect = 1e-48, Method: Composition-based stats. Identities = 122/122 (100%), Positives = 122/122 (100%) Query: 1 MKYSSIFSMLSFFILFACNETAVYGSDENIIFMRYVEKLHLDKYSVKNTVKTETMAIQLA 60 MKYSSIFSMLSFFILFACNETAVYGSDENIIFMRYVEKLHLDKYSVKNTVKTETMAIQLA Sbjct: 1 MKYSSIFSMLSFFILFACNETAVYGSDENIIFMRYVEKLHLDKYSVKNTVKTETMAIQLA 60 Query: 61 EIYVRYRYGERIAEEEKPYLITELPDSWVVEGAKLPYEVAGGVFIIEINKKNGCVLNFLH 120 EIYVRYRYGERIAEEEKPYLITELPDSWVVEGAKLPYEVAGGVFIIEINKKNGCVLNFLH Sbjct: 61 EIYVRYRYGERIAEEEKPYLITELPDSWVVEGAKLPYEVAGGVFIIEINKKNGCVLNFLH 120 Query: 121 SK 122 SK Sbjct: 121 SK 122 >UniRef50_P28911 Uncharacterized protein yhhH n=15 Tax=Escherichia RepID=YHHH_ECOLI Length = 127 Score = 132 bits (331), Expect = 5e-30, Method: Composition-based stats. Identities = 54/96 (56%), Positives = 67/96 (69%), Gaps = 2/96 (2%) Query: 27 DENIIFMRYVEKLHLDKYSVKNTVKTETMAIQLAEIYVRYRYGERIAEEEKPYLITELPD 86 + NI MR E KY+V+N VK + +A++LAEIYV+ RYG+ AEEEKPY ITEL Sbjct: 34 NNNIKIMRKYESE--GKYTVRNLVKNKAIALELAEIYVKNRYGQDAAEEEKPYEITELTT 91 Query: 87 SWVVEGAKLPYEVAGGVFIIEINKKNGCVLNFLHSK 122 SWVVEG ++AGGVFIIEI K +G +LNF H K Sbjct: 92 SWVVEGTIHSDQIAGGVFIIEIGKNDGRILNFGHGK 127 >UniRef50_C7PKA4 Putative uncharacterized protein n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PKA4_CHIPD Length = 105 Score = 99.8 bits (247), Expect = 2e-20, Method: Composition-based stats. Identities = 35/82 (42%), Positives = 49/82 (59%), Gaps = 3/82 (3%) Query: 42 DKYSVKNTVKTETMAIQLAEIYVRYRYGERIAEEEKPYLITELPDS-WVVEGAKLPYEVA 100 DK S + V E A ++AE YGERI +E KPY+++ + DS W V+G+ LP + Sbjct: 26 DKTSASDYVPDEETAKKIAEAIWLPIYGERIYDE-KPYVVSLVGDSVWAVDGS-LPKKKR 83 Query: 101 GGVFIIEINKKNGCVLNFLHSK 122 GGV IEI K + +L +HSK Sbjct: 84 GGVAYIEIQKNDCKILKVIHSK 105 >UniRef50_C6V8M1 Putative uncharacterized protein n=3 Tax=Escherichia coli RepID=C6V8M1_ECOBD Length = 91 Score = 98.3 bits (243), Expect = 7e-20, Method: Composition-based stats. Identities = 30/74 (40%), Positives = 48/74 (64%), Gaps = 2/74 (2%) Query: 49 TVKTETMAIQLAEIYVRYRYGERIAEEEKPYLITELPDSWVVEGAKLPYEVAGGVFIIEI 108 V + MA++L+ +Y++Y YG+ AE +KPY IT+ + W +EG + + GG F I I Sbjct: 20 LVNSREMALELSYVYIKYVYGKEKAEFQKPYSITDDNNCWKIEGKQ--PKTLGGNFTILI 77 Query: 109 NKKNGCVLNFLHSK 122 KK+G VL+ +H+K Sbjct: 78 AKKDGQVLHVIHTK 91 >UniRef50_C6M7N1 Putative lipoprotein n=2 Tax=Neisseria RepID=C6M7N1_NEISI Length = 96 Score = 94.8 bits (234), Expect = 8e-19, Method: Composition-based stats. Identities = 27/71 (38%), Positives = 38/71 (53%), Gaps = 3/71 (4%) Query: 52 TETMAIQLAEIYVRYRYGERIAEEEKPYLITELPDSWVVEGAKLPYEVAGGVFIIEINKK 111 TE ++L EIYV YGE+ A+ +KPY + + + WV+ G P + GG F I Sbjct: 29 TEQQVLRLTEIYVTQHYGEQTAQAQKPYRVKKDGEHWVISGK--PPKALGGNFRAVIG-A 85 Query: 112 NGCVLNFLHSK 122 NG + HSK Sbjct: 86 NGQLEEITHSK 96 >UniRef50_B8F9G3 Putative uncharacterized protein n=1 Tax=Desulfatibacillum alkenivorans AK-01 RepID=B8F9G3_DESAA Length = 139 Score = 68.6 bits (166), Expect = 5e-11, Method: Composition-based stats. Identities = 23/79 (29%), Positives = 39/79 (49%), Gaps = 6/79 (7%) Query: 49 TVKTETMAIQLAEIYVRYRYGERIAEEEKPYLITELPDSWVVEGAKLPYE-----VAGGV 103 V E AI++AE YG +I + +KP++ D W+V+G + + + GGV Sbjct: 62 YVPDEETAIRIAEAVWLPIYGPQIYQ-DKPFVAKLYGDEWLVKGTYVIPDDLNEIMRGGV 120 Query: 104 FIIEINKKNGCVLNFLHSK 122 I K +G +L H++ Sbjct: 121 PYAVIRKIDGKILAVTHTR 139 >UniRef50_A5FGG2 Hypothetical lipoprotein n=1 Tax=Flavobacterium johnsoniae UW101 RepID=A5FGG2_FLAJ1 Length = 115 Score = 68.6 bits (166), Expect = 6e-11, Method: Composition-based stats. Identities = 27/77 (35%), Positives = 43/77 (55%), Gaps = 4/77 (5%) Query: 47 KNTVKTETMAIQLAEIYVRYRYGERIAEEEKPYLIT-ELPDSWVVEGAKLPYEVAGGVFI 105 K+ V AI++AE + YG++I + ++P++ T + P+ W VEG + GGV Sbjct: 42 KDLVPDNETAIKIAEAILVPIYGKKIYK-QRPFVATLKSPNVWAVEGTLHTTK--GGVAY 98 Query: 106 IEINKKNGCVLNFLHSK 122 IEI KK+ +L H K Sbjct: 99 IEIQKKDCKILKVYHEK 115 >UniRef50_C6X3U0 Putative uncharacterized protein n=1 Tax=Flavobacteriaceae bacterium 3519-10 RepID=C6X3U0_FLAB3 Length = 124 Score = 55.9 bits (133), Expect = 4e-07, Method: Composition-based stats. Identities = 37/122 (30%), Positives = 54/122 (44%), Gaps = 6/122 (4%) Query: 6 IFSMLSFFILFACNETA--VYGSDENIIFMRYVEKLHLDKYSVKN---TVKTETMAIQLA 60 I ML + F+CN+ + G D + K + N +K E AI++A Sbjct: 4 ILLMLFVILQFSCNKVSHNKLGIDNAKKELESALKDTTKIAILDNNELLIKEENTAIKVA 63 Query: 61 EIYVRYRYGERIAEEEKPYLITELPDSWVVEGAKLPYEVAGGVFIIEINKKNGCVLNFLH 120 E + YG E EKPY + + WV+ G Y GG F I I+ +N V+N +H Sbjct: 64 EPILFEIYGRSKIEGEKPYEAYLIKNYWVINGTVDRY-SFGGAFSIIIDARNSKVINVIH 122 Query: 121 SK 122 K Sbjct: 123 YK 124 >UniRef50_D1PST3 Putative uncharacterized protein n=1 Tax=Prevotella bergensis DSM 17361 RepID=D1PST3_9BACT Length = 142 Score = 49.7 bits (117), Expect = 3e-05, Method: Composition-based stats. Identities = 19/77 (24%), Positives = 36/77 (46%), Gaps = 3/77 (3%) Query: 47 KNTVKTETMAIQLAEIYVRYRYGERIAEEEKPYLITELPD-SWVVEGAKLPYEVAGGVFI 105 + +A ++A ++ YG + ++EKPY + W++ G+K GGV Sbjct: 68 SGFIPNAKVAYEVAIAVLKPIYGHYV-DKEKPYKVVLDSKRYWIITGSKDSISK-GGVAE 125 Query: 106 IEINKKNGCVLNFLHSK 122 + + K +G V+ H K Sbjct: 126 VTLRKSDGRVIMVTHGK 142 >UniRef50_D1N0G5 Putative uncharacterized protein n=1 Tax=Victivallis vadensis ATCC BAA-548 RepID=D1N0G5_9BACT Length = 336 Score = 46.7 bits (109), Expect = 2e-04, Method: Composition-based stats. Identities = 20/75 (26%), Positives = 36/75 (48%), Gaps = 2/75 (2%) Query: 48 NTVKTETMAIQLAEIYVRYRYGERIAEEEKPYLITELPDSWVVEGAKLPYEVAGGVFIIE 107 N ++ A+++AE + YG+ + ++P+ + E + + G P V GGV I Sbjct: 264 NLNLSKEDAVKVAETVLVGIYGKEVLR-QRPWRVVESETEFQISGTLAPSSV-GGVAEIS 321 Query: 108 INKKNGCVLNFLHSK 122 I K + V + H K Sbjct: 322 IRKSDAGVARYTHGK 336 >UniRef50_B2KDJ6 Putative uncharacterized protein n=1 Tax=Elusimicrobium minutum Pei191 RepID=B2KDJ6_ELUMP Length = 147 Score = 42.8 bits (99), Expect = 0.004, Method: Composition-based stats. Identities = 21/71 (29%), Positives = 34/71 (47%), Gaps = 1/71 (1%) Query: 52 TETMAIQLAEIYVRYRYGERIAEEEKPYLITELPDSWVVEGAKLPYEVAGGVFIIEINKK 111 E AI AE + Y G+ + +KP+ + WVV G ++G V II + K+ Sbjct: 78 NEQSAIAAAEEELGYILGQELMSAQKPFKAAGCDNMWVVYGTNEVGTLSGAVHII-LRKQ 136 Query: 112 NGCVLNFLHSK 122 +G +L + K Sbjct: 137 DGKILQVFYEK 147 >UniRef50_C0ZK58 Putative uncharacterized protein n=1 Tax=Brevibacillus brevis NBRC 100599 RepID=C0ZK58_BREBN Length = 161 Score = 39.0 bits (89), Expect = 0.045, Method: Composition-based stats. Identities = 21/62 (33%), Positives = 33/62 (53%), Gaps = 5/62 (8%) Query: 63 YVRYRYGERIAEEEKPYLITELP--DSWVVEGAKLPYEVAGGVFIIEINKKNGCVLNFLH 120 Y+ Y + EE P+ I+ P ++W++EG LP GGV I + K+NG +L Sbjct: 103 YLAEHYSQ--FLEETPFAISYNPIAEAWIIEGT-LPPGWLGGVIYIALAKENGKLLMMYG 159 Query: 121 SK 122 +K Sbjct: 160 TK 161 >UniRef50_A4YMX6 Putative uncharacterized protein n=1 Tax=Bradyrhizobium sp. ORS278 RepID=A4YMX6_BRASO Length = 110 Score = 38.6 bits (88), Expect = 0.068, Method: Composition-based stats. Identities = 19/68 (27%), Positives = 31/68 (45%), Gaps = 2/68 (2%) Query: 55 MAIQLAEIYVRYRYGERIAEEEKPYLITELPDSWVVEGAKLPYEVAGGVFIIEINKKNGC 114 A ++AE Y+ Y P ++ + D W V +LP +AGG +I + K + Sbjct: 45 TAARIAERYLAVHYPAFDTIAMPP-IVDDEGDVWKVS-YELPPNMAGGNPVIVVEKTSWK 102 Query: 115 VLNFLHSK 122 VL H + Sbjct: 103 VLRVYHEQ 110 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P33668 Uncharacterized protein ybbC n=13 Tax=Escherichi... 165 3e-40 UniRef50_C6X3U0 Putative uncharacterized protein n=1 Tax=Flavoba... 123 2e-27 UniRef50_P28911 Uncharacterized protein yhhH n=15 Tax=Escherichi... 116 2e-25 UniRef50_C7PKA4 Putative uncharacterized protein n=1 Tax=Chitino... 96 2e-19 UniRef50_B8F9G3 Putative uncharacterized protein n=1 Tax=Desulfa... 95 8e-19 UniRef50_C6V8M1 Putative uncharacterized protein n=3 Tax=Escheri... 89 4e-17 UniRef50_A5FGG2 Hypothetical lipoprotein n=1 Tax=Flavobacterium ... 88 9e-17 UniRef50_C6M7N1 Putative lipoprotein n=2 Tax=Neisseria RepID=C6M... 85 7e-16 UniRef50_D1N0G5 Putative uncharacterized protein n=1 Tax=Victiva... 85 8e-16 UniRef50_D1PST3 Putative uncharacterized protein n=1 Tax=Prevote... 79 6e-14 Sequences not found previously or not previously below threshold: UniRef50_B2KDJ6 Putative uncharacterized protein n=1 Tax=Elusimi... 51 1e-05 UniRef50_A4YMX6 Putative uncharacterized protein n=1 Tax=Bradyrh... 43 0.003 UniRef50_C0ZK58 Putative uncharacterized protein n=1 Tax=Breviba... 42 0.006 >UniRef50_P33668 Uncharacterized protein ybbC n=13 Tax=Escherichia coli RepID=YBBC_ECOLI Length = 122 Score = 165 bits (418), Expect = 3e-40, Method: Composition-based stats. Identities = 122/122 (100%), Positives = 122/122 (100%) Query: 1 MKYSSIFSMLSFFILFACNETAVYGSDENIIFMRYVEKLHLDKYSVKNTVKTETMAIQLA 60 MKYSSIFSMLSFFILFACNETAVYGSDENIIFMRYVEKLHLDKYSVKNTVKTETMAIQLA Sbjct: 1 MKYSSIFSMLSFFILFACNETAVYGSDENIIFMRYVEKLHLDKYSVKNTVKTETMAIQLA 60 Query: 61 EIYVRYRYGERIAEEEKPYLITELPDSWVVEGAKLPYEVAGGVFIIEINKKNGCVLNFLH 120 EIYVRYRYGERIAEEEKPYLITELPDSWVVEGAKLPYEVAGGVFIIEINKKNGCVLNFLH Sbjct: 61 EIYVRYRYGERIAEEEKPYLITELPDSWVVEGAKLPYEVAGGVFIIEINKKNGCVLNFLH 120 Query: 121 SK 122 SK Sbjct: 121 SK 122 >UniRef50_C6X3U0 Putative uncharacterized protein n=1 Tax=Flavobacteriaceae bacterium 3519-10 RepID=C6X3U0_FLAB3 Length = 124 Score = 123 bits (308), Expect = 2e-27, Method: Composition-based stats. Identities = 37/122 (30%), Positives = 54/122 (44%), Gaps = 6/122 (4%) Query: 6 IFSMLSFFILFACNETA--VYGSDENIIFMRYVEKLHLDKYSVKN---TVKTETMAIQLA 60 I ML + F+CN+ + G D + K + N +K E AI++A Sbjct: 4 ILLMLFVILQFSCNKVSHNKLGIDNAKKELESALKDTTKIAILDNNELLIKEENTAIKVA 63 Query: 61 EIYVRYRYGERIAEEEKPYLITELPDSWVVEGAKLPYEVAGGVFIIEINKKNGCVLNFLH 120 E + YG E EKPY + + WV+ G Y GG F I I+ +N V+N +H Sbjct: 64 EPILFEIYGRSKIEGEKPYEAYLIKNYWVINGTVDRYS-FGGAFSIIIDARNSKVINVIH 122 Query: 121 SK 122 K Sbjct: 123 YK 124 >UniRef50_P28911 Uncharacterized protein yhhH n=15 Tax=Escherichia RepID=YHHH_ECOLI Length = 127 Score = 116 bits (291), Expect = 2e-25, Method: Composition-based stats. Identities = 54/96 (56%), Positives = 67/96 (69%), Gaps = 2/96 (2%) Query: 27 DENIIFMRYVEKLHLDKYSVKNTVKTETMAIQLAEIYVRYRYGERIAEEEKPYLITELPD 86 + NI MR E KY+V+N VK + +A++LAEIYV+ RYG+ AEEEKPY ITEL Sbjct: 34 NNNIKIMRKYESE--GKYTVRNLVKNKAIALELAEIYVKNRYGQDAAEEEKPYEITELTT 91 Query: 87 SWVVEGAKLPYEVAGGVFIIEINKKNGCVLNFLHSK 122 SWVVEG ++AGGVFIIEI K +G +LNF H K Sbjct: 92 SWVVEGTIHSDQIAGGVFIIEIGKNDGRILNFGHGK 127 >UniRef50_C7PKA4 Putative uncharacterized protein n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PKA4_CHIPD Length = 105 Score = 96.4 bits (238), Expect = 2e-19, Method: Composition-based stats. Identities = 34/89 (38%), Positives = 50/89 (56%), Gaps = 3/89 (3%) Query: 35 YVEKLHLDKYSVKNTVKTETMAIQLAEIYVRYRYGERIAEEEKPYLITELPDS-WVVEGA 93 ++ DK S + V E A ++AE YGERI + EKPY+++ + DS W V+G+ Sbjct: 19 SAKENTNDKTSASDYVPDEETAKKIAEAIWLPIYGERIYD-EKPYVVSLVGDSVWAVDGS 77 Query: 94 KLPYEVAGGVFIIEINKKNGCVLNFLHSK 122 P + GGV IEI K + +L +HSK Sbjct: 78 L-PKKKRGGVAYIEIQKNDCKILKVIHSK 105 >UniRef50_B8F9G3 Putative uncharacterized protein n=1 Tax=Desulfatibacillum alkenivorans AK-01 RepID=B8F9G3_DESAA Length = 139 Score = 94.8 bits (234), Expect = 8e-19, Method: Composition-based stats. Identities = 23/81 (28%), Positives = 39/81 (48%), Gaps = 6/81 (7%) Query: 47 KNTVKTETMAIQLAEIYVRYRYGERIAEEEKPYLITELPDSWVVEGAKLPYE-----VAG 101 V E AI++AE YG +I + +KP++ D W+V+G + + + G Sbjct: 60 NGYVPDEETAIRIAEAVWLPIYGPQIYQ-DKPFVAKLYGDEWLVKGTYVIPDDLNEIMRG 118 Query: 102 GVFIIEINKKNGCVLNFLHSK 122 GV I K +G +L H++ Sbjct: 119 GVPYAVIRKIDGKILAVTHTR 139 >UniRef50_C6V8M1 Putative uncharacterized protein n=3 Tax=Escherichia coli RepID=C6V8M1_ECOBD Length = 91 Score = 89.4 bits (220), Expect = 4e-17, Method: Composition-based stats. Identities = 30/74 (40%), Positives = 48/74 (64%), Gaps = 2/74 (2%) Query: 49 TVKTETMAIQLAEIYVRYRYGERIAEEEKPYLITELPDSWVVEGAKLPYEVAGGVFIIEI 108 V + MA++L+ +Y++Y YG+ AE +KPY IT+ + W +EG + + GG F I I Sbjct: 20 LVNSREMALELSYVYIKYVYGKEKAEFQKPYSITDDNNCWKIEGKQ--PKTLGGNFTILI 77 Query: 109 NKKNGCVLNFLHSK 122 KK+G VL+ +H+K Sbjct: 78 AKKDGQVLHVIHTK 91 >UniRef50_A5FGG2 Hypothetical lipoprotein n=1 Tax=Flavobacterium johnsoniae UW101 RepID=A5FGG2_FLAJ1 Length = 115 Score = 87.9 bits (216), Expect = 9e-17, Method: Composition-based stats. Identities = 36/126 (28%), Positives = 62/126 (49%), Gaps = 15/126 (11%) Query: 1 MKYSSIFSMLSFFILF-ACNETAVYGSDENIIFMRYVEKLHLDKYSV--KNTVKTETMAI 57 MK ++ ++ I F +C++ + V K +DK ++ K+ V AI Sbjct: 1 MKKHNVLLVVFLLIFFNSCSQ--------EKTKSQKVVKSVVDKPNLVFKDLVPDNETAI 52 Query: 58 QLAEIYVRYRYGERIAEEEKPYLITE-LPDSWVVEGAKLPYEVAGGVFIIEINKKNGCVL 116 ++AE + YG++I + ++P++ T P+ W VEG + GGV IEI KK+ +L Sbjct: 53 KIAEAILVPIYGKKIYK-QRPFVATLKSPNVWAVEGTLHTTK--GGVAYIEIQKKDCKIL 109 Query: 117 NFLHSK 122 H K Sbjct: 110 KVYHEK 115 >UniRef50_C6M7N1 Putative lipoprotein n=2 Tax=Neisseria RepID=C6M7N1_NEISI Length = 96 Score = 84.8 bits (208), Expect = 7e-16, Method: Composition-based stats. Identities = 27/71 (38%), Positives = 38/71 (53%), Gaps = 3/71 (4%) Query: 52 TETMAIQLAEIYVRYRYGERIAEEEKPYLITELPDSWVVEGAKLPYEVAGGVFIIEINKK 111 TE ++L EIYV YGE+ A+ +KPY + + + WV+ G P + GG F I Sbjct: 29 TEQQVLRLTEIYVTQHYGEQTAQAQKPYRVKKDGEHWVISGK--PPKALGGNFRAVIG-A 85 Query: 112 NGCVLNFLHSK 122 NG + HSK Sbjct: 86 NGQLEEITHSK 96 >UniRef50_D1N0G5 Putative uncharacterized protein n=1 Tax=Victivallis vadensis ATCC BAA-548 RepID=D1N0G5_9BACT Length = 336 Score = 84.8 bits (208), Expect = 8e-16, Method: Composition-based stats. Identities = 20/75 (26%), Positives = 36/75 (48%), Gaps = 2/75 (2%) Query: 48 NTVKTETMAIQLAEIYVRYRYGERIAEEEKPYLITELPDSWVVEGAKLPYEVAGGVFIIE 107 N ++ A+++AE + YG+ + ++P+ + E + + G P V GGV I Sbjct: 264 NLNLSKEDAVKVAETVLVGIYGKEVLR-QRPWRVVESETEFQISGTLAPSSV-GGVAEIS 321 Query: 108 INKKNGCVLNFLHSK 122 I K + V + H K Sbjct: 322 IRKSDAGVARYTHGK 336 >UniRef50_D1PST3 Putative uncharacterized protein n=1 Tax=Prevotella bergensis DSM 17361 RepID=D1PST3_9BACT Length = 142 Score = 78.7 bits (192), Expect = 6e-14, Method: Composition-based stats. Identities = 19/77 (24%), Positives = 36/77 (46%), Gaps = 3/77 (3%) Query: 47 KNTVKTETMAIQLAEIYVRYRYGERIAEEEKPYLITELPD-SWVVEGAKLPYEVAGGVFI 105 + +A ++A ++ YG + ++EKPY + W++ G+K GGV Sbjct: 68 SGFIPNAKVAYEVAIAVLKPIYGHYV-DKEKPYKVVLDSKRYWIITGSKDSISK-GGVAE 125 Query: 106 IEINKKNGCVLNFLHSK 122 + + K +G V+ H K Sbjct: 126 VTLRKSDGRVIMVTHGK 142 >UniRef50_B2KDJ6 Putative uncharacterized protein n=1 Tax=Elusimicrobium minutum Pei191 RepID=B2KDJ6_ELUMP Length = 147 Score = 51.3 bits (121), Expect = 1e-05, Method: Composition-based stats. Identities = 20/71 (28%), Positives = 33/71 (46%), Gaps = 1/71 (1%) Query: 52 TETMAIQLAEIYVRYRYGERIAEEEKPYLITELPDSWVVEGAKLPYEVAGGVFIIEINKK 111 E AI AE + Y G+ + +KP+ + WVV G ++G V I + K+ Sbjct: 78 NEQSAIAAAEEELGYILGQELMSAQKPFKAAGCDNMWVVYGTNEVGTLSGAV-HIILRKQ 136 Query: 112 NGCVLNFLHSK 122 +G +L + K Sbjct: 137 DGKILQVFYEK 147 >UniRef50_A4YMX6 Putative uncharacterized protein n=1 Tax=Bradyrhizobium sp. ORS278 RepID=A4YMX6_BRASO Length = 110 Score = 43.2 bits (100), Expect = 0.003, Method: Composition-based stats. Identities = 19/68 (27%), Positives = 31/68 (45%), Gaps = 2/68 (2%) Query: 55 MAIQLAEIYVRYRYGERIAEEEKPYLITELPDSWVVEGAKLPYEVAGGVFIIEINKKNGC 114 A ++AE Y+ Y P ++ + D W V +LP +AGG +I + K + Sbjct: 45 TAARIAERYLAVHY-PAFDTIAMPPIVDDEGDVWKVS-YELPPNMAGGNPVIVVEKTSWK 102 Query: 115 VLNFLHSK 122 VL H + Sbjct: 103 VLRVYHEQ 110 >UniRef50_C0ZK58 Putative uncharacterized protein n=1 Tax=Brevibacillus brevis NBRC 100599 RepID=C0ZK58_BREBN Length = 161 Score = 42.1 bits (97), Expect = 0.006, Method: Composition-based stats. Identities = 23/74 (31%), Positives = 38/74 (51%), Gaps = 6/74 (8%) Query: 52 TETMAIQLAE-IYVRYRYGERIAEEEKPYLITELP--DSWVVEGAKLPYEVAGGVFIIEI 108 T+ A+ A Y+ Y + + EE P+ I+ P ++W++EG P GGV I + Sbjct: 91 TDEHAVATAIFSYLAEHYSQFL--EETPFAISYNPIAEAWIIEGTL-PPGWLGGVIYIAL 147 Query: 109 NKKNGCVLNFLHSK 122 K+NG +L +K Sbjct: 148 AKENGKLLMMYGTK 161 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.314 0.126 0.331 Lambda K H 0.267 0.0395 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 559,085,247 Number of Sequences: 3077464 Number of extensions: 16109321 Number of successful extensions: 50326 Number of sequences better than 1.0e-01: 13 Number of HSP's better than 0.1 without gapping: 21 Number of HSP's successfully gapped in prelim test: 12 Number of HSP's that attempted gapping in prelim test: 50274 Number of HSP's gapped (non-prelim): 33 length of query: 122 length of database: 1,040,396,356 effective HSP length: 88 effective length of query: 34 effective length of database: 769,579,524 effective search space: 26165703816 effective search space used: 26165703816 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.6 bits) S2: 87 (38.2 bits)