BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (300 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P31665 Uncharacterized protein yadD n=59 Tax=Enterobact... 621 e-177 UniRef50_C2DMU4 Possible transposase n=6 Tax=Enterobacteriaceae ... 596 e-169 UniRef50_P77768 Uncharacterized protein yfcI n=175 Tax=Gammaprot... 312 8e-84 UniRef50_Q1CC76 Transposase n=27 Tax=Gammaproteobacteria RepID=Q... 306 4e-82 UniRef50_P37415 Uncharacterized protein pSLT051 n=256 Tax=Gammap... 305 1e-81 UniRef50_D2U4R8 Transposase (Fragment) n=4 Tax=Enterobacteriacea... 299 7e-80 UniRef50_B7UFQ5 Predicted protein n=14 Tax=Enterobacteriaceae Re... 299 7e-80 UniRef50_Q4LC22 TpnA protein n=9 Tax=Enterobacteriaceae RepID=Q4... 288 1e-76 UniRef50_Q7N1D0 Transposase, ISNCY family n=36 Tax=root RepID=Q7... 282 1e-74 UniRef50_D0KLJ7 Putative transposase YhgA family protein n=1 Tax... 261 2e-68 UniRef50_Q7B1W7 YadD homologue n=11 Tax=root RepID=Q7B1W7_ECOLX 243 6e-63 UniRef50_C2LLN3 Transposase n=37 Tax=Enterobacteriaceae RepID=C2... 242 9e-63 UniRef50_C8QFJ7 Putative transposase YhgA family protein n=4 Tax... 237 5e-61 UniRef50_B6XDZ7 Putative uncharacterized protein n=2 Tax=Provide... 227 4e-58 UniRef50_D1P284 Transposase, ISNCY family n=10 Tax=Enterobacteri... 226 6e-58 UniRef50_C0Q5B1 Ytl2 n=4 Tax=Enterobacteriaceae RepID=C0Q5B1_SALPC 211 3e-53 UniRef50_C3M8C1 Putative transposase n=3 Tax=Candidatus Hamilton... 204 4e-51 UniRef50_Q3C0L1 TpnA protein n=16 Tax=Enterobacteriaceae RepID=Q... 203 5e-51 UniRef50_A8PLK1 Putative uncharacterized protein n=3 Tax=Rickett... 190 4e-47 UniRef50_C2LF55 Transposase n=3 Tax=Enterobacteriaceae RepID=C2L... 184 4e-45 UniRef50_B7MZS6 Putative uncharacterized protein n=3 Tax=Escheri... 181 3e-44 UniRef50_C2DIT3 Possible transposase n=5 Tax=Enterobacteriaceae ... 179 9e-44 UniRef50_Q52101 ORF n=1 Tax=Salmonella enterica subsp. enterica ... 177 4e-43 UniRef50_C0AXL8 Putative uncharacterized protein n=1 Tax=Proteus... 164 3e-39 UniRef50_C1J8H0 Truncated transposase n=3 Tax=Escherichia coli R... 164 4e-39 UniRef50_A8PQ66 Putative uncharacterized protein n=3 Tax=Rickett... 151 2e-35 UniRef50_C8T759 Putative uncharacterized protein n=1 Tax=Klebsie... 143 8e-33 UniRef50_B3ESQ9 Putative uncharacterized protein n=2 Tax=Bacteri... 135 2e-30 UniRef50_C3PPD7 Transposase and inactivated derivative n=13 Tax=... 129 1e-28 UniRef50_A8GX51 Transposase and inactivated derivative n=11 Tax=... 128 3e-28 UniRef50_C4YU05 Transposase n=5 Tax=Rickettsieae RepID=C4YU05_9RICK 127 5e-28 UniRef50_Q1RJ73 Transposase and inactivated derivative n=10 Tax=... 121 2e-26 UniRef50_Q1RGR6 Transposase and inactivated derivative n=15 Tax=... 115 2e-24 UniRef50_B6J6C6 Hypothetical cytosolic protein n=1 Tax=Coxiella ... 115 2e-24 UniRef50_D0YJF1 Putative transposase YhgA family protein n=1 Tax... 112 1e-23 UniRef50_A6TJT5 Putative uncharacterized protein n=1 Tax=Alkalip... 111 3e-23 UniRef50_Q1RKI3 Transposase and inactivated derivative n=10 Tax=... 111 4e-23 UniRef50_A5CC03 Transposase and inactivated derivative n=9 Tax=O... 104 5e-21 UniRef50_Q24W02 Putative uncharacterized protein n=3 Tax=Clostri... 102 1e-20 UniRef50_A8PLG1 Transposase n=1 Tax=Rickettsiella grylli RepID=A... 102 2e-20 UniRef50_Q6TFF6 Putative transposase n=1 Tax=Caedibacter taenios... 98 4e-19 UniRef50_A9BGB6 Putative uncharacterized protein n=3 Tax=Petroto... 96 1e-18 UniRef50_C5JAV2 Transposase n=2 Tax=uncultured bacterium RepID=C... 96 1e-18 UniRef50_B3ETR6 Putative uncharacterized protein n=1 Tax=Candida... 93 1e-17 UniRef50_Q2J904 Putative uncharacterized protein n=1 Tax=Frankia... 91 5e-17 UniRef50_C1MD86 Putative uncharacterized protein n=5 Tax=Enterob... 89 1e-16 UniRef50_D0LMM4 Putative transposase n=10 Tax=Haliangium ochrace... 88 3e-16 UniRef50_A9EVM7 Similar to putative transposase n=2 Tax=Sorangiu... 86 1e-15 UniRef50_C0GW49 Putative uncharacterized protein n=6 Tax=Desulfo... 86 2e-15 UniRef50_D2QBD7 Putative uncharacterized protein n=1 Tax=Spiroso... 85 2e-15 UniRef50_B2V9N0 Putative uncharacterized protein n=4 Tax=Sulfuri... 85 3e-15 UniRef50_C0GW46 Putative uncharacterized protein n=2 Tax=Desulfo... 84 4e-15 UniRef50_A0LBL3 Putative uncharacterized protein n=6 Tax=Magneto... 83 1e-14 UniRef50_Q1QWV4 Putative uncharacterized protein n=11 Tax=Proteo... 76 1e-12 UniRef50_A3ET28 Probable transposase n=6 Tax=Leptospirillum sp. ... 75 3e-12 UniRef50_C6VTM0 Putative uncharacterized protein n=1 Tax=Dyadoba... 75 4e-12 UniRef50_A6G4N5 Putative uncharacterized protein n=1 Tax=Plesioc... 75 4e-12 UniRef50_B8FP58 Putative uncharacterized protein n=1 Tax=Desulfi... 74 5e-12 UniRef50_A4XMD0 Putative uncharacterized protein n=5 Tax=Clostri... 74 7e-12 UniRef50_D0LPI9 Putative transposase n=2 Tax=Haliangium ochraceu... 72 2e-11 UniRef50_A6G0X2 Putative uncharacterized protein n=1 Tax=Plesioc... 70 1e-10 UniRef50_B4U689 Putative uncharacterized protein n=8 Tax=Aquific... 69 2e-10 UniRef50_C0GWA6 Putative uncharacterized protein n=3 Tax=Desulfo... 69 2e-10 UniRef50_A3JHZ5 Putative transposase n=11 Tax=Proteobacteria Rep... 69 2e-10 UniRef50_Q3C0L0 TpnA protein n=2 Tax=Sodalis glossinidius RepID=... 69 3e-10 UniRef50_D2NBJ3 Putative uncharacterized protein n=1 Tax=Escheri... 68 4e-10 UniRef50_C6I158 Putative uncharacterized protein n=3 Tax=Leptosp... 68 4e-10 UniRef50_C5UWW9 Putative uncharacterized protein n=1 Tax=Clostri... 67 8e-10 UniRef50_A9BGB3 Putative uncharacterized protein n=2 Tax=Petroto... 67 8e-10 UniRef50_B6WXP3 Putative uncharacterized protein n=1 Tax=Desulfo... 67 8e-10 UniRef50_C6HY29 Putative uncharacterized protein n=1 Tax=Leptosp... 65 2e-09 UniRef50_A4XFI8 Putative uncharacterized protein n=7 Tax=Clostri... 65 2e-09 UniRef50_C4FIM1 Putative uncharacterized protein n=1 Tax=Sulfuri... 65 2e-09 UniRef50_Q1Q296 Putative uncharacterized protein n=6 Tax=Candida... 65 3e-09 UniRef50_C7RR52 Putative transposase n=1 Tax=Candidatus Accumuli... 65 4e-09 UniRef50_Q3JB06 Putative transposase n=17 Tax=Proteobacteria Rep... 64 5e-09 UniRef50_B9MMR0 Putative uncharacterized protein n=1 Tax=Anaeroc... 64 6e-09 UniRef50_C0GV86 Transposase, ISNCY family n=7 Tax=Desulfonatrono... 64 9e-09 UniRef50_A4XG55 Putative uncharacterized protein n=2 Tax=Caldice... 63 1e-08 UniRef50_Q2FP14 Putative uncharacterized protein n=4 Tax=Methano... 63 1e-08 UniRef50_Q04UG3 Transposase, YhgA-like n=8 Tax=Leptospira RepID=... 63 1e-08 UniRef50_C0A240 Putative uncharacterized protein n=1 Tax=Opituta... 61 5e-08 UniRef50_B9TA29 Putative uncharacterized protein n=1 Tax=Ricinus... 60 7e-08 UniRef50_C6HXQ0 Putative uncharacterized protein n=1 Tax=Leptosp... 60 7e-08 UniRef50_B5Q357 Transposase n=10 Tax=Salmonella enterica subsp. ... 59 2e-07 UniRef50_C0GTX5 Putative uncharacterized protein n=8 Tax=Desulfo... 59 3e-07 UniRef50_C1DXM1 Putative uncharacterized protein n=5 Tax=Sulfuri... 53 1e-05 UniRef50_Q2RLW6 Putative uncharacterized protein n=9 Tax=Clostri... 52 3e-05 UniRef50_B9MN47 Putative uncharacterized protein n=2 Tax=Bacteri... 52 3e-05 UniRef50_C6IY67 Transposase n=1 Tax=Paenibacillus sp. oral taxon... 52 3e-05 UniRef50_C1DXV7 Putative uncharacterized protein n=1 Tax=Sulfuri... 50 1e-04 UniRef50_C6HTR6 Probable transposase n=5 Tax=Leptospirillum ferr... 50 1e-04 UniRef50_C6PYR3 Putative uncharacterized protein n=1 Tax=Clostri... 50 1e-04 UniRef50_A6EA97 Putative uncharacterized protein n=1 Tax=Pedobac... 49 2e-04 UniRef50_C6HZP6 Putative uncharacterized protein n=1 Tax=Leptosp... 49 2e-04 UniRef50_B2V697 Putative uncharacterized protein n=6 Tax=Sulfuri... 49 2e-04 UniRef50_C4GYF6 Transposase n=20 Tax=Yersinia pestis RepID=C4GYF... 48 4e-04 UniRef50_C6XV94 Putative uncharacterized protein n=7 Tax=Pedobac... 47 6e-04 UniRef50_C1I6Y7 Putative uncharacterized protein n=1 Tax=Clostri... 47 8e-04 UniRef50_B0K519 Putative uncharacterized protein n=14 Tax=Thermo... 47 9e-04 UniRef50_B0K503 Putative uncharacterized protein n=12 Tax=Thermo... 45 0.004 UniRef50_D1PHY3 Putative uncharacterized protein n=2 Tax=Prevote... 45 0.004 UniRef50_Q24Y59 Putative uncharacterized protein n=4 Tax=Peptoco... 44 0.005 UniRef50_B3CVG1 Putative uncharacterized protein n=2 Tax=Orienti... 44 0.006 UniRef50_C5RH90 Putative uncharacterized protein n=2 Tax=Clostri... 44 0.006 UniRef50_C5UQX3 Putative uncharacterized protein n=1 Tax=Clostri... 44 0.009 UniRef50_B0G834 Putative uncharacterized protein n=3 Tax=Dorea f... 43 0.015 UniRef50_A4U3R1 Putative uncharacterized protein n=1 Tax=Magneto... 43 0.017 UniRef50_C0QGW4 Putative uncharacterized protein n=1 Tax=Desulfo... 42 0.020 UniRef50_A4XMU7 Putative uncharacterized protein n=1 Tax=Caldice... 42 0.021 UniRef50_Q6D6X6 Putative transposase (Fragment) n=2 Tax=Pectobac... 42 0.022 UniRef50_C1P7A8 Putative uncharacterized protein n=1 Tax=Bacillu... 42 0.026 UniRef50_C2LUG6 Putative uncharacterized protein n=1 Tax=Strepto... 42 0.037 UniRef50_B9MMM9 Putative uncharacterized protein n=1 Tax=Anaeroc... 41 0.054 UniRef50_A6LFH9 Putative uncharacterized protein n=6 Tax=Bactero... 41 0.056 >UniRef50_P31665 Uncharacterized protein yadD n=59 Tax=Enterobacteriaceae RepID=YADD_ECOLI Length = 300 Score = 621 bits (1602), Expect = e-177, Method: Compositional matrix adjust. Identities = 300/300 (100%), Positives = 300/300 (100%) Query: 1 MDAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHS 60 MDAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHS Sbjct: 1 MDAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHS 60 Query: 61 TDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPIL 120 TDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPIL Sbjct: 61 TDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPIL 120 Query: 121 FYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQK 180 FYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQK Sbjct: 121 FYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQK 180 Query: 181 HIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGES 240 HIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGES Sbjct: 181 HIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGES 240 Query: 241 MMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKVINLI 300 MMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKVINLI Sbjct: 241 MMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKVINLI 300 >UniRef50_C2DMU4 Possible transposase n=6 Tax=Enterobacteriaceae RepID=C2DMU4_ECOLX Length = 314 Score = 596 bits (1536), Expect = e-169, Method: Compositional matrix adjust. Identities = 291/314 (92%), Positives = 295/314 (93%), Gaps = 16/314 (5%) Query: 1 MDAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHS 60 MDAPSTTPHDAVFKQFLMHAETARDFL+IHLP ELRELCDL+TLHLESGSFIEESLKGHS Sbjct: 1 MDAPSTTPHDAVFKQFLMHAETARDFLDIHLPAELRELCDLDTLHLESGSFIEESLKGHS 60 Query: 61 TDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPIL 120 TDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPIL Sbjct: 61 TDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPIL 120 Query: 121 FYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQK 180 FYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQK Sbjct: 121 FYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQK 180 Query: 181 HIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGES 240 HIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGG+S Sbjct: 181 HIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGKS 240 Query: 241 MMTLAQWFE----------------EKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAE 284 MMTLAQWFE EKGIEKGIQQGRQEVSQEFA RLLSKGM REDVAE Sbjct: 241 MMTLAQWFEEKGIEKGIEKGIEKGMEKGIEKGIQQGRQEVSQEFALRLLSKGMPREDVAE 300 Query: 285 MANLPLAEIDKVIN 298 MANLPLAEIDK+IN Sbjct: 301 MANLPLAEIDKLIN 314 >UniRef50_P77768 Uncharacterized protein yfcI n=175 Tax=Gammaproteobacteria RepID=YFCI_ECOLI Length = 296 Score = 312 bits (800), Expect = 8e-84, Method: Compositional matrix adjust. Identities = 152/285 (53%), Positives = 207/285 (72%), Gaps = 5/285 (1%) Query: 5 STTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVL 64 ++TPHDAVFK FL H +TARDF++IHLP LR+LCDL TL LE SFI+E L+ + +D+L Sbjct: 6 TSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLL 65 Query: 65 YSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQG 124 +SV+ Q GY++VVIEHQSKP++ MAFRMMRYSIAAM HL+A + +LPLV+P+LFY G Sbjct: 66 WSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYHG 125 Query: 125 EATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQ 184 +PYP S+CW D F P +AR++Y+S FPLVDIT+ PDDEIMQHR++A+LEL+QKHIRQ Sbjct: 126 CRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQ 185 Query: 185 RDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETG-GESMMT 243 RDL+ L++Q+V+L+ G T+ QL A+ NY+LQ G ++ F G + +R E +MT Sbjct: 186 RDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKLMT 245 Query: 244 LAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANL 288 +A E+G QG+ E + AQ +L +G+ RE V + L Sbjct: 246 IADRLR----EEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRL 286 >UniRef50_Q1CC76 Transposase n=27 Tax=Gammaproteobacteria RepID=Q1CC76_YERPN Length = 313 Score = 306 bits (785), Expect = 4e-82, Method: Compositional matrix adjust. Identities = 154/297 (51%), Positives = 202/297 (68%), Gaps = 13/297 (4%) Query: 5 STTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVL 64 + TPHDA F+QFL E ARDF+E+HLP ELR +CDL+TL LESGSF+E+ L+ + +DVL Sbjct: 7 TPTPHDATFRQFLTQPEIARDFMELHLPAELRAICDLSTLKLESGSFVEDDLRQYFSDVL 66 Query: 65 YSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQG 124 YS+ GY+HV+IEHQS PDK MAFR++RY+IAAM RHLEA H KLPLV+P+LFY G Sbjct: 67 YSLDTVEGEGYVHVLIEHQSSPDKHMAFRLIRYAIAAMQRHLEAGHAKLPLVIPVLFYVG 126 Query: 125 EATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQ 184 + +PYP S W D F PELA ++Y+ FPLVD+T+ PDD+IM+HR +A L LLQKHI Q Sbjct: 127 KRSPYPYSTRWLDEFDDPELAHKLYSGAFPLVDVTVIPDDDIMEHRSMAALTLLQKHIHQ 186 Query: 185 RDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDR-ETGGESMMT 243 RD+ L ++L TL+ Y S Q++A+ +Y+LQ G + ++ F L R G+++MT Sbjct: 187 RDIATLTDRLATLLMADYLSSPQVMALIHYLLQAGESADSEAFVRELAQRVPQHGDALMT 246 Query: 244 LAQWFEEKGIEKGIQQGRQEVSQ------------EFAQRLLSKGMSREDVAEMANL 288 +AQ E+KGIEKG +GR E Q E A+ LL GM E V E L Sbjct: 247 IAQQLEQKGIEKGRMEGRTEGIQLGEQRGIEKGKLEVARSLLKMGMPIESVQEATGL 303 >UniRef50_P37415 Uncharacterized protein pSLT051 n=256 Tax=Gammaproteobacteria RepID=YTL2_SALTY Length = 313 Score = 305 bits (782), Expect = 1e-81, Method: Compositional matrix adjust. Identities = 148/295 (50%), Positives = 202/295 (68%), Gaps = 13/295 (4%) Query: 7 TPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYS 66 TPHDA F+QFL + ARDF+E+HLP ELR +CDL+TL LESGSF+E+ L+ + +DVLYS Sbjct: 9 TPHDATFRQFLTQPDIARDFMELHLPAELRAICDLSTLKLESGSFVEDDLRQYFSDVLYS 68 Query: 67 VQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEA 126 ++ GY+HV++EHQS PDK MAFR++RY++AAM RHLEA H KLPLV+P+LFY G+ Sbjct: 69 LKTTAGDGYIHVLVEHQSTPDKHMAFRLIRYAVAAMQRHLEAGHKKLPLVIPVLFYTGKR 128 Query: 127 TPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQRD 186 +PYP S W D F LA ++Y+S FPLVD+T+ PDDEI HR +A L LLQKHI QRD Sbjct: 129 SPYPYSTRWLDEFDDTALADKLYSSAFPLVDVTVIPDDEIAGHRSMAALTLLQKHIHQRD 188 Query: 187 LMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDR-ETGGESMMTLA 245 L L+++L ++ GY S SQ++++ +Y++Q G T A+ F L R G+++MT+A Sbjct: 189 LAELVDRLAPILLAGYLSSSQVISLVHYIVQAGETSDAEAFVRELAQRVPQHGDALMTIA 248 Query: 246 QWFEEKGIEKGIQQGRQ------------EVSQEFAQRLLSKGMSREDVAEMANL 288 Q E+KGIEKGIQ G Q E + + A+ +L + R V +M L Sbjct: 249 QQLEQKGIEKGIQLGEQRGIEKGRSEGEREATLKIARTMLQNCIDRNTVMKMTGL 303 >UniRef50_D2U4R8 Transposase (Fragment) n=4 Tax=Enterobacteriaceae RepID=D2U4R8_9ENTR Length = 308 Score = 299 bits (766), Expect = 7e-80, Method: Compositional matrix adjust. Identities = 143/293 (48%), Positives = 201/293 (68%), Gaps = 1/293 (0%) Query: 7 TPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYS 66 TPHDAVFKQFL ETA+DF +I LP E++ LCDL++L +ESGSFI+ +K + +D+LYS Sbjct: 14 TPHDAVFKQFLSEKETAKDFFDIWLPDEIKALCDLDSLKMESGSFIDSEMKNYQSDILYS 73 Query: 67 VQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEA 126 V GY++V+IEHQS PDK +A+R+MRYS+AAM +HLE + +LPLV PILFY GE Sbjct: 74 VSTTKGSGYIYVLIEHQSTPDKLIAWRLMRYSLAAMQKHLEDGNKQLPLVFPILFYCGEQ 133 Query: 127 TPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQRD 186 +P+P S W D F +LA +YN+PF L D+T D EIMQH+RIA+LELLQKHIR+RD Sbjct: 134 SPHPYSTHWLDCFEDRKLAESIYNNPFKLADVTTLDDGEIMQHKRIALLELLQKHIRRRD 193 Query: 187 LMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQA-DLFYGVLRDRETGGESMMTLA 245 + LL+ +V L+ Y + +Q++ M NY++Q G+ ++ + + + E ++MT+A Sbjct: 194 MTELLDSIVKLLSYNYYTDNQVITMFNYLIQEGNAQRPMEFITNIAKQAEKHEGALMTIA 253 Query: 246 QWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKVIN 298 Q EE GI+KGIQQG Q+ E A++ L+ G+ R V L E++K N Sbjct: 254 QQIEEIGIQKGIQQGIQKTKIELAKQFLANGVDRNTVKISTGLSDEELNKFEN 306 >UniRef50_B7UFQ5 Predicted protein n=14 Tax=Enterobacteriaceae RepID=B7UFQ5_ECO27 Length = 315 Score = 299 bits (766), Expect = 7e-80, Method: Compositional matrix adjust. Identities = 153/309 (49%), Positives = 209/309 (67%), Gaps = 17/309 (5%) Query: 2 DAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHST 61 ++ +++PHDAVFK F+ ETARDFLEIHLP LR+LC+L TL LE SFIE+SL+ + + Sbjct: 3 ESTTSSPHDAVFKTFMFTPETARDFLEIHLPEPLRKLCNLQTLRLEPTSFIEKSLRAYYS 62 Query: 62 DVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILF 121 DVL+SV+ GY++ VIEHQS +K MAFR+MRY+ AAM RHL+ +D++PLVVP+LF Sbjct: 63 DVLWSVETSEGDGYIYCVIEHQSSAEKNMAFRLMRYATAAMQRHLDKGYDRVPLVVPLLF 122 Query: 122 YQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKH 181 Y GEA+PYP S+ W D F P+LAR++Y FPLVDITI PDDEIMQHRRIA+LEL+QKH Sbjct: 123 YHGEASPYPYSLNWLDEFDDPQLARQLYTEAFPLVDITIVPDDEIMQHRRIALLELIQKH 182 Query: 182 IRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRET-GGES 240 IR RDL+ +++++ TL+ G+T+ SQL + NY+LQ G T + F + +R E Sbjct: 183 IRDRDLIGMVDRITTLLVRGFTNDSQLQTLFNYLLQCGDTSRFTRFIQEIAERSPLQKEI 242 Query: 241 MMTLAQWFEEKGIEKGIQQGRQEVSQE----------------FAQRLLSKGMSREDVAE 284 +MT+A+ ++G + G Q+G+ E QE A R+L +G RE V Sbjct: 243 LMTIAERLRQEGHQIGWQEGKIEGWQEGKLEGLQEGMHEQAIKIALRMLEQGFEREIVLA 302 Query: 285 MANLPLAEI 293 L A+I Sbjct: 303 ATQLTDADI 311 >UniRef50_Q4LC22 TpnA protein n=9 Tax=Enterobacteriaceae RepID=Q4LC22_SODGL Length = 308 Score = 288 bits (738), Expect = 1e-76, Method: Compositional matrix adjust. Identities = 139/302 (46%), Positives = 203/302 (67%), Gaps = 9/302 (2%) Query: 5 STTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVL 64 + TPHDAVF+QFL TA+DF +I LP +++ LCD TL ESGSFI+ +K + +D+L Sbjct: 6 TPTPHDAVFRQFLHDKATAQDFFDIWLPDDIKALCDWETLKPESGSFIDPDMKPYQSDIL 65 Query: 65 YSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQG 124 YSV G GY++ +IEHQS PDK MA+R+MRYS+AAM RHLEA HDKLPLV P+LFY G Sbjct: 66 YSVNANGVDGYVYCLIEHQSTPDKLMAWRLMRYSMAAMQRHLEAGHDKLPLVFPVLFYCG 125 Query: 125 EATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQ 184 E +P+P S W D F P++A ++Y+ PF L+D+T DD IMQHRR+A+LEL+QKHIR+ Sbjct: 126 EKSPHPYSTNWLDCFERPDIAAKIYSQPFRLMDVTTLDDDAIMQHRRMALLELIQKHIRR 185 Query: 185 RDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDR-ETGGESMMT 243 RD+ LL+ +V L+ Y + +Q+V M NY++Q G+ F + R E E++MT Sbjct: 186 RDMTELLDSIVKLLSYNYYTDTQVVTMMNYLVQEGNAASPRTFITEIAKRAEKHEEALMT 245 Query: 244 LAQWFEEKGI--------EKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDK 295 +A+ +++G ++GIQQG + + A+++LS+G++R+ V L +D Sbjct: 246 IAEALKQEGYQIGRDDGRQEGIQQGEHAAAMKIARQMLSRGIARDAVKACTGLSDNALDN 305 Query: 296 VI 297 ++ Sbjct: 306 LM 307 >UniRef50_Q7N1D0 Transposase, ISNCY family n=36 Tax=root RepID=Q7N1D0_PHOLL Length = 335 Score = 282 bits (722), Expect = 1e-74, Method: Compositional matrix adjust. Identities = 139/274 (50%), Positives = 192/274 (70%), Gaps = 2/274 (0%) Query: 5 STTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVL 64 + TPHDA+FK+FL H +TARDFLEIHLP LR +CDL+TL LESGSFIE++L+ H +D+L Sbjct: 6 TPTPHDAIFKKFLSHIDTARDFLEIHLPATLRAVCDLDTLRLESGSFIEDNLRVHYSDIL 65 Query: 65 YSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQG 124 YS++ Y++ VIEHQS PDK MAFR+MRYSI+AM HLE H KLPLV+P+LFY G Sbjct: 66 YSLKTTQGESYVYCVIEHQSSPDKMMAFRLMRYSISAMQWHLEQGHKKLPLVIPVLFYHG 125 Query: 125 EATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQ 184 + PYP S WFD F + LA +Y+S FPLVD+T+ PDDEI+ H+R+A+LE++QKHIRQ Sbjct: 126 KIRPYPWSTNWFDCFDASALAEEIYSSAFPLVDVTVIPDDEILTHKRVALLEIVQKHIRQ 185 Query: 185 RDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDR-ETGGESMMT 243 RD+ L ++L L Y + L +M NY+L G T + F L ++ E +MT Sbjct: 186 RDMAELQQELTMLFAYDYYTYELLKSMLNYILLVGDTADPEGFIRQLAEQFPKYEEVLMT 245 Query: 244 LAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGM 277 +AQ + KG ++G+++G Q+ Q+ + L +G+ Sbjct: 246 IAQKLQHKGHQEGLKEGLQKC-QDAREEGLQEGL 278 >UniRef50_D0KLJ7 Putative transposase YhgA family protein n=1 Tax=Pectobacterium wasabiae WPP163 RepID=D0KLJ7_PECWW Length = 288 Score = 261 bits (667), Expect = 2e-68, Method: Compositional matrix adjust. Identities = 132/292 (45%), Positives = 187/292 (64%), Gaps = 13/292 (4%) Query: 9 HDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQ 68 HDA+FKQFL ARDFL IHLP +RE CD NTL LES SFI+E L+ +DVLYS+ Sbjct: 4 HDAIFKQFLSDIAVARDFLTIHLPDSIRERCDFNTLQLESASFIDEKLRARISDVLYSLH 63 Query: 69 MQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEATP 128 GY++ VIEHQS+P+K+MAFR++RY +AAM +HL+ HD+LPLVVP+LFY G + P Sbjct: 64 TSVGKGYIYCVIEHQSRPEKQMAFRLLRYCLAAMQQHLDQGHDRLPLVVPLLFYHGRSRP 123 Query: 129 YPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQRDLM 188 YP S+ W D F +P LA+ +Y PFPLVD+T+ PDDEI HRR+A+LEL+QKHIR RD++ Sbjct: 124 YPYSLRWLDSFAAPVLAQTLYEQPFPLVDLTVMPDDEIRTHRRMALLELVQKHIRTRDML 183 Query: 189 LLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGESMMTLAQWF 248 L ++ L + S + ++ M ++ G+ R + G LAQ Sbjct: 184 ELAREIGLLFERWAAPLS--IGQEDIMTIAEQLKKMGFDEGIQRGIQQG------LAQ-- 233 Query: 249 EEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKVINLI 300 G+E+GI+QG + +++ A+ LL GM + V + L E+++++ I Sbjct: 234 ---GLEQGIEQGMKNSARQIARHLLLTGMDKNSVQQATQLETEELEQLVTAI 282 >UniRef50_Q7B1W7 YadD homologue n=11 Tax=root RepID=Q7B1W7_ECOLX Length = 313 Score = 243 bits (620), Expect = 6e-63, Method: Compositional matrix adjust. Identities = 127/313 (40%), Positives = 185/313 (59%), Gaps = 25/313 (7%) Query: 2 DAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHST 61 + + TPHDA F+ FL + + ARDFLE+HLP E R+LCDL+TL LE +F+E L +++ Sbjct: 6 NTTTPTPHDAAFRSFLANPDVARDFLELHLPAEYRQLCDLSTLKLEPATFVEPDLHQYAS 65 Query: 62 DVLYSVQMQGNP-GYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPIL 120 D+L+SV+ G GY++ +IEHQS + M FRM+RYS+AAM RHLE H LPLV+P+L Sbjct: 66 DILWSVKTTGGEDGYVYTLIEHQSTENLYMPFRMLRYSVAAMQRHLE-QHKTLPLVIPVL 124 Query: 121 FYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQK 180 FY GE +PYP SM W D F +P LA ++Y PFPLVDIT+ D+EIM HRR+A L LL K Sbjct: 125 FYHGERSPYPYSMNWLDCFENPALAAKIYTKPFPLVDITVVDDNEIMNHRRMAALTLLMK 184 Query: 181 HIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGES 240 HIRQRD+++ L+ LV + + Q+ + NY+L + + + +S Sbjct: 185 HIRQRDMLMCLDNLVRAL-QDIQDEEQITVLFNYLLNGSEHVTVEFLQTLAQRLPQHEDS 243 Query: 241 MMTLAQWFE-----------------EKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVA 283 +MTLA+ + + + +Q+ R E A+ L + GM + Sbjct: 244 IMTLAERLKQEGIQQGIQQGIQQGIQQGVQQGALQKAR-----EIARELRNAGMPAAQIC 298 Query: 284 EMANLPLAEIDKV 296 ++ L AE+ + Sbjct: 299 QLTGLSEAELKNI 311 >UniRef50_C2LLN3 Transposase n=37 Tax=Enterobacteriaceae RepID=C2LLN3_PROMI Length = 319 Score = 242 bits (618), Expect = 9e-63, Method: Compositional matrix adjust. Identities = 124/274 (45%), Positives = 182/274 (66%), Gaps = 4/274 (1%) Query: 9 HDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQ 68 HDA+FKQFL H E ARDF +HLP + LCDL+TL LE SF+E L+ +DVLYSVQ Sbjct: 10 HDALFKQFLTHPENARDFFSVHLPANILPLCDLSTLRLEPASFVERRLRQLHSDVLYSVQ 69 Query: 69 MQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHL-EADHDK-LPLVVPILFYQGEA 126 M GY++ +IEHQSKPD+ M FR+M Y+++A+ HL ++ DK LPLVVP LFYQG Sbjct: 70 MTEGEGYIYCLIEHQSKPDRLMGFRLMHYAMSAIAHHLKKSPADKTLPLVVPFLFYQGSV 129 Query: 127 TPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQRD 186 PYP SM W D F P LA+++Y FPLVD+++ D+EI+ H+ IA+LEL+QKHIR RD Sbjct: 130 CPYPYSMNWLDGFADPALAQQLYTRSFPLVDLSVLSDEEILTHKGIALLELVQKHIRTRD 189 Query: 187 -LMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGESMM-TL 244 LM +L + +I+ + + Q+ ++ Y+ +G+ F+ L ++M+ T+ Sbjct: 190 GLMAVLPIIAQIINSQHNTVDQVRSVIEYIAYQGYILDESRFFSQLIALSPEYKTMLTTI 249 Query: 245 AQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMS 278 A+ E+KGIEKGI++G ++ ++ ++ + KG+ Sbjct: 250 AEQLEQKGIEKGIEKGIEKGIEKGIEKGIEKGIG 283 >UniRef50_C8QFJ7 Putative transposase YhgA family protein n=4 Tax=Pantoea sp. At-9b RepID=C8QFJ7_9ENTR Length = 301 Score = 237 bits (604), Expect = 5e-61, Method: Compositional matrix adjust. Identities = 122/293 (41%), Positives = 182/293 (62%), Gaps = 6/293 (2%) Query: 6 TTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLY 65 + PHDA+FK+FL H AR FLEIHLP +RE CDL+ L + +FIE L +DVL Sbjct: 5 SAPHDALFKKFLSHLPVARQFLEIHLPQSIREHCDLDKLQVVPTTFIERDLSALYSDVLL 64 Query: 66 SVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGE 125 S++ GY++ +IEHQS PDK M RMMRY++AA+ RHL+ H +PLV+PILFYQG+ Sbjct: 65 SMKTDDGEGYIYALIEHQSTPDKHMTLRMMRYTLAAIQRHLDEGHHDVPLVIPILFYQGK 124 Query: 126 ATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQR 185 +PYP SM W + F +P LA++++ FPLVD+T+ PD+EIM HR +A LE+ K IR R Sbjct: 125 TSPYPYSMNWLESFRNPVLAKQIFCHSFPLVDVTVIPDEEIMAHRDVARLEMAHKIIRLR 184 Query: 186 DLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVL--RDRETGGESMMT 243 D++ ++ + TL+ Y + + Y+L+ G+T+ + +L + G+ M Sbjct: 185 DILENIDPMATLLALDYNDDLS-IDVVFYLLRYGNTDDREKIVKILIQAKPQLEGKIMTI 243 Query: 244 LAQWFEE---KGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEI 293 QW +E +G ++G ++GRQEV E AQR+L + + ++ L E+ Sbjct: 244 EEQWRQESRQEGRQEGRKEGRQEVMLELAQRMLREQFDLNTIMKLTGLSEGEL 296 >UniRef50_B6XDZ7 Putative uncharacterized protein n=2 Tax=Providencia RepID=B6XDZ7_9ENTR Length = 327 Score = 227 bits (578), Expect = 4e-58, Method: Compositional matrix adjust. Identities = 111/262 (42%), Positives = 167/262 (63%), Gaps = 2/262 (0%) Query: 8 PHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSV 67 PHD+ FK F+ + ARDF EI+LP ++ LC+L+TL L S SFI+++L+ +D+LYSV Sbjct: 13 PHDSTFKGFMSKVDNARDFFEIYLPNRIKPLCNLDTLKLASASFIDKTLRSRFSDMLYSV 72 Query: 68 QMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEAT 127 Q GY ++++EHQS PDK M +R+M Y+ AM++HL+ ++ LPLVVPILFY G+ + Sbjct: 73 QTLKGKGYFYLLVEHQSTPDKLMGWRLMHYAFCAMNQHLQQGNNALPLVVPILFYHGKQS 132 Query: 128 PYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQR-D 186 PYP S W D F +LA +Y +P PLVD+T+ DDEI+ HR++A +EL+ KH R D Sbjct: 133 PYPYSQVWTDCFPWADLAYDLYCNPLPLVDVTVASDDEIVNHRKVAAMELVLKHSTLRDD 192 Query: 187 LMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETG-GESMMTLA 245 L++L E+L +I E ++ + NY+ T L ++ G E++MT+A Sbjct: 193 LIVLSERLAQVISENENHRDDVILIINYLFSVMDTPTYTQIVKTLIEQTEGYQETVMTIA 252 Query: 246 QWFEEKGIEKGIQQGRQEVSQE 267 +G+EKG+ +GR+E E Sbjct: 253 DRLRNEGLEKGLIKGREEGKAE 274 >UniRef50_D1P284 Transposase, ISNCY family n=10 Tax=Enterobacteriaceae RepID=D1P284_9ENTR Length = 322 Score = 226 bits (577), Expect = 6e-58, Method: Compositional matrix adjust. Identities = 107/260 (41%), Positives = 161/260 (61%), Gaps = 2/260 (0%) Query: 6 TTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLY 65 PHD+ FK F+ + ARDF E+HLP ++ LC+ +TL L S SF++++L+ +D+LY Sbjct: 7 VAPHDSTFKGFMSKVDNARDFFEVHLPNRIKHLCNFDTLKLASASFVDKTLRSRFSDMLY 66 Query: 66 SVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGE 125 SVQ GY + ++EHQS PDK M +R+M Y+ AM++HL+ H LPLVVPILFY G Sbjct: 67 SVQTLKGKGYFYFLVEHQSSPDKLMGWRLMHYAFCAMNQHLQQGHQSLPLVVPILFYHGN 126 Query: 126 ATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQR 185 +PYP S W D F +LA +Y +P PLVD+T+ DDE+M HR++A +EL+ KH R Sbjct: 127 QSPYPYSQSWTDCFQWSDLAHDLYCNPLPLVDVTVACDDELMNHRKVAAMELVFKHASLR 186 Query: 186 -DLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDR-ETGGESMMT 243 D+ L E+L +++ ++ + NY+ T L D+ E E++M Sbjct: 187 GDVFGLSERLAQVLNNNQNHQDDVILIINYLFSVMDTPAYTHIVKTLVDQTEKHQETVMN 246 Query: 244 LAQWFEEKGIEKGIQQGRQE 263 +AQ +G+EKG+++GR+E Sbjct: 247 IAQRLRNEGMEKGMEKGRKE 266 >UniRef50_C0Q5B1 Ytl2 n=4 Tax=Enterobacteriaceae RepID=C0Q5B1_SALPC Length = 316 Score = 211 bits (536), Expect = 3e-53, Method: Compositional matrix adjust. Identities = 102/217 (47%), Positives = 147/217 (67%), Gaps = 1/217 (0%) Query: 9 HDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQ 68 HD +FK FL +TARDFL +HLP ++R L+TL LE GSF+++ L+ +DVLYSV+ Sbjct: 12 HDGLFKLFLREPDTARDFLAVHLPADIRAQVRLDTLKLEPGSFVDQKLRELHSDVLYSVE 71 Query: 69 M-QGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEAT 127 +G+ GY++ ++EHQS D+ MA+RMMRYS+A M HL+ + LP+VVP+LFYQG Sbjct: 72 TAEGHAGYIYCLVEHQSTADRMMAWRMMRYSMAVMDAHLKKGNGTLPVVVPLLFYQGMVR 131 Query: 128 PYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQRDL 187 PYP S W D F P LAR VY+ P+PLVD+++ D ++ HRR+A+LEL+Q+ IR RD Sbjct: 132 PYPYSTDWMDCFDVPALAREVYSRPWPLVDVSVMEDCDLQSHRRMALLELVQRDIRHRDA 191 Query: 188 MLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQA 224 LL +V LI + +Q+ A+ Y++ G T ++ Sbjct: 192 ASLLRDVVQLIRLAGNTRAQVEAVLCYIIYNGMTSES 228 >UniRef50_C3M8C1 Putative transposase n=3 Tax=Candidatus Hamiltonella defensa 5AT (Acyrthosiphon pisum) RepID=C3M8C1_HAMD5 Length = 308 Score = 204 bits (518), Expect = 4e-51, Method: Compositional matrix adjust. Identities = 109/248 (43%), Positives = 156/248 (62%), Gaps = 14/248 (5%) Query: 6 TTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLY 65 +TPHD +FK+F AR+F EIHLP + ++ +L + GSFI++SLK +D++Y Sbjct: 4 STPHDRLFKKFFGDIALARNFFEIHLPSSILKIVSFPSLKMVPGSFIDKSLKQSHSDMVY 63 Query: 66 SVQMQ-GNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQG 124 S + G GYL+ V+EHQS DK MAFRM +YS+A M +HL+ HD LPLV+P+LFY G Sbjct: 64 SFETSTGKEGYLYCVVEHQSTDDKMMAFRMKKYSLAVMQQHLDQGHDTLPLVLPVLFYHG 123 Query: 125 EATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQ 184 + +PYP SM W D F ELAR + + PFPLVD+T+ P++EIM+H I+ LE+ QK + Sbjct: 124 QKSPYPHSMDWRDCFCEKELARILDSQPFPLVDVTMLPEEEIMKHGIISWLEMSQKMVHT 183 Query: 185 RDLM------LLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGG 238 RD+M + L++L L DE + S + Y+ Q G T LF+ L T Sbjct: 184 RDMMEIAPYLIRLDKLFPLNDELFKS------LLYYLFQEGETADRMLFFDALSST-TQR 236 Query: 239 ESMMTLAQ 246 E++MT+A+ Sbjct: 237 ENVMTIAE 244 >UniRef50_Q3C0L1 TpnA protein n=16 Tax=Enterobacteriaceae RepID=Q3C0L1_SODGL Length = 277 Score = 203 bits (517), Expect = 5e-51, Method: Compositional matrix adjust. Identities = 105/275 (38%), Positives = 167/275 (60%), Gaps = 17/275 (6%) Query: 41 LNTLHLESGSFIEESLKGHSTDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIA 100 L+TL + SGSFIE+ L +D+LYS++ Y++ +IEHQS P+ MAFR++RY++ Sbjct: 3 LSTLVMVSGSFIEDDLCSQCSDMLYSLKSTLGDAYIYCLIEHQSCPEPMMAFRLLRYAVT 62 Query: 101 AMHRHLEADHDKLPLVVPILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITI 160 AMHRHLE ++ +LP+V+PILFY G +PYP + W D F +LA VY FPLVD+T Sbjct: 63 AMHRHLEQENKQLPVVIPILFYHGSTSPYPYTTHWLDCFADRKLAESVYEKAFPLVDVTA 122 Query: 161 TPDDEIMQHRRIAILELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGH 220 D+EI++HRR+A++E++QKHIR R+++ L +L L+++ S Q + Y++ G+ Sbjct: 123 MEDEEILRHRRMALMEIVQKHIRTRNMLELAGELANLLEQWKFSKEQCKTLVYYLVLAGN 182 Query: 221 TEQADLFYGVL-RDRETGGESMMTLAQWFEEKGIEK----------------GIQQGRQE 263 T + F L + + E MMT+A+ E KG++K GIQ G+++ Sbjct: 183 TTDGEGFLRTLAQPAPSYREDMMTIAEQLEAKGMQKGIQLGEKKGIERGLQEGIQLGKKQ 242 Query: 264 VSQEFAQRLLSKGMSREDVAEMANLPLAEIDKVIN 298 + + A++ L G+ R+ V L +I+ V+N Sbjct: 243 ATLKIARQFLVNGVERDIVKMSTGLTDRDINDVLN 277 >UniRef50_A8PLK1 Putative uncharacterized protein n=3 Tax=Rickettsiella grylli RepID=A8PLK1_9COXI Length = 308 Score = 190 bits (483), Expect = 4e-47, Method: Compositional matrix adjust. Identities = 102/296 (34%), Positives = 171/296 (57%), Gaps = 11/296 (3%) Query: 7 TPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYS 66 HDA+FK F E A F+ I+LP +++ CD +TL +E GSF++ LK H +D+LYS Sbjct: 7 NAHDAIFKTFFTDIEVATHFITIYLPKHMKQACDFSTLKIEPGSFVDADLKQHHSDILYS 66 Query: 67 VQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEA 126 +++ G GY+++ +EHQS ++ M FRM RY +A M +HL + KLPLV+ +LFY G+ Sbjct: 67 LKVNGMHGYVYLNLEHQSTAEELMPFRMHRYKVAIMQQHLNQGNKKLPLVISMLFYHGKG 126 Query: 127 TPYPLSMCWFDMFYSPELAR-RVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQR 185 YP + D A+ ++ P L+D+ + PD+EI +H+++A LE++QKHI R Sbjct: 127 Q-YPYCLKLIDCVEDTPFAKAHFFDDPL-LIDLNVLPDEEIYRHKQLAFLEIVQKHIFTR 184 Query: 186 DLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGESMMTLA 245 DL + + +V L+ + + YML +G T + L+ E E +M A Sbjct: 185 DLEDIADHIVRLVKQVKPDHDLFNQLVYYMLVKGETANVNQVIEKLKTIEDYEEDIMNAA 244 Query: 246 QWFEEKGIEKGIQQGRQEVSQE--------FAQRLLSKGMSREDVAEMANLPLAEI 293 Q +++G ++G+ +GRQE Q+ A++L+++G S + + ++ NL E+ Sbjct: 245 QQLKQQGRQEGLYEGRQEGLQKGEYRKAITIAKKLIAEGRSIQYIQDLTNLSENEV 300 >UniRef50_C2LF55 Transposase n=3 Tax=Enterobacteriaceae RepID=C2LF55_PROMI Length = 330 Score = 184 bits (466), Expect = 4e-45, Method: Compositional matrix adjust. Identities = 95/265 (35%), Positives = 155/265 (58%), Gaps = 10/265 (3%) Query: 7 TPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYS 66 + HDA FK+F+M+ A+DF IHL EL+ CD +TL L++ SFI+ L+ +D+LYS Sbjct: 8 SSHDAAFKRFMMNISNAKDFFFIHLSDELKSYCDFSTLKLQNSSFIDIKLRSRMSDILYS 67 Query: 67 VQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEA 126 V+ + ++ +IEHQS+PDK +A+RMM Y+ M++HL+ + LPLVVPILFY G+ Sbjct: 68 VKTKKGNISIYFLIEHQSRPDKMIAWRMMHYAFCTMNQHLQQGYTSLPLVVPILFYHGKR 127 Query: 127 TPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQRD 186 PYP S+ W D F LA ++Y + F L+D+ D+ ++ HR+ A++E+ KH+ D Sbjct: 128 KPYPFSVNWLDCFPLSTLANQLYLNNFALIDLNSIDDEILLTHRKAAVMEIAMKHVNSCD 187 Query: 187 LMLLLEQLVT-LIDEGYTSGSQLVAMQNYMLQRGHTEQAD---LFYGVLRDRETGGESMM 242 + L L++ I++ S +A+ Y+ + AD + + + E++M Sbjct: 188 DLDKLAMLLSKAINQKNCSDEDTIAVVQYLFSI--MDAADFESIINKIAEQVDNHRETIM 245 Query: 243 TLAQWFEEKGIE----KGIQQGRQE 263 +A E KG + +GI+ G+ E Sbjct: 246 NIAWRLENKGFKLGKMEGIEIGKNE 270 >UniRef50_B7MZS6 Putative uncharacterized protein n=3 Tax=Escherichia coli ED1a RepID=B7MZS6_ECO81 Length = 319 Score = 181 bits (458), Expect = 3e-44, Method: Compositional matrix adjust. Identities = 105/287 (36%), Positives = 158/287 (55%), Gaps = 9/287 (3%) Query: 9 HDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQ 68 HDA F++ L ARDFLE L + C+L+T+ LE +F+ ESL+ + DVL S++ Sbjct: 12 HDAAFRKTLKDPAAARDFLEQVLTPYQKSRCNLDTIELEPTTFVAESLRQSACDVLLSMK 71 Query: 69 MQ-GNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEAT 127 G GY++ +IEHQS PDK + RMMRY +A M +H+E +H P+V+P+LFY G Sbjct: 72 TNDGKDGYIYTLIEHQSSPDKFIPLRMMRYILAVMEQHIE-EHKCAPVVIPVLFYHGAKR 130 Query: 128 PYPLSMCWFDMFYSPELARRVYN--SPFPLVDITITPDDEIMQHRRIAILELLQKHIRQR 185 PYP M W D P R +Y PF LVD++ DDEI + R+A L K Sbjct: 131 PYPYPMNWVDCLDDPAYGREIYGEQKPFSLVDVSTLTDDEIEHYHRMAALMFTMKSGTSG 190 Query: 186 DLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGESMMTLA 245 D++ L+ + +TL D+ Y S L + Y+L+ + A+L V + +MT+A Sbjct: 191 DVIELIGKSITLTDK-YGSSVHLNTVLTYLLELYQMDFAELSEAVSTHYPSHKGVIMTIA 249 Query: 246 QWFEE----KGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANL 288 + EE KG+EKG+++GR E + +G S E++ + +L Sbjct: 250 EQLEERGLKKGLEKGLEKGRAEERSRLVLMMRQRGKSLEEIKDFLDL 296 >UniRef50_C2DIT3 Possible transposase n=5 Tax=Enterobacteriaceae RepID=C2DIT3_ECOLX Length = 197 Score = 179 bits (454), Expect = 9e-44, Method: Compositional matrix adjust. Identities = 89/189 (47%), Positives = 122/189 (64%), Gaps = 9/189 (4%) Query: 95 MRYSIAAMHRHLEADHDKLPLVVPILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFP 154 MRY+IAAM HL+A + LP+VVP+LFY G +PYP S+CW D F P LAR++Y S FP Sbjct: 1 MRYAIAAMQNHLDAGYKTLPMVVPLLFYHGIESPYPYSLCWLDCFADPNLARQLYASAFP 60 Query: 155 LVDITITPDDEIMQHRRIAILELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNY 214 L+D+T+ PDDEIM HRR+A+LEL+QKHIRQRDLM L+EQ+ L+ GY +G Q+ + NY Sbjct: 61 LIDVTLMPDDEIMLHRRMALLELIQKHIRQRDLMGLVEQMACLLSSGYANGRQIKGLFNY 120 Query: 215 MLQRGHTEQ-ADLFYGVLRDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLL 273 +LQ G + D GV + S+MT+A E+ Q+G Q + A+ +L Sbjct: 121 ILQTGDAVRFNDFIDGVAKRSPKHKVSLMTIA--------ERLRQEGEQSKALHIAKIML 172 Query: 274 SKGMSREDV 282 G+ D+ Sbjct: 173 ESGVPLADI 181 >UniRef50_Q52101 ORF n=1 Tax=Salmonella enterica subsp. enterica serovar Enteritidis RepID=Q52101_SALEN Length = 292 Score = 177 bits (449), Expect = 4e-43, Method: Compositional matrix adjust. Identities = 112/289 (38%), Positives = 154/289 (53%), Gaps = 18/289 (6%) Query: 7 TPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYS 66 TPHDA F+QFL + ARDF+E+HLP ELR +CDL+TL LESGSF+E+ L+ + +DVLYS Sbjct: 9 TPHDATFRQFLTQPDIARDFMELHLPAELRAICDLSTLKLESGSFVEDDLRQYFSDVLYS 68 Query: 67 VQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSI-AAMHRHLEADHDKLPLVVPILFYQGE 125 ++ + + S+ + F + AAM RHLEA H KLPLV+P+LFY G+ Sbjct: 69 LKTTAGDDIFMSWL-NTSQHLTNICFPPDTLCVGAAMQRHLEAGHKKLPLVIPVLFYTGK 127 Query: 126 ATPYPLSMCWFDMFYSPELARRVYNSPFP-LVDITITPDDEIMQHRRIAILELLQKHIRQ 184 +PYP S W D F R+ LVD+T+ PDDEI HR +A L LL ++I Sbjct: 128 RSPYPYSTRWLDEFDDTAPGRQTLQQRLSRLVDVTVIPDDEIAGHRSMAALTLLPENIF- 186 Query: 185 RDLMLLLEQLVTLIDEGYTSGSQLVAMQN----YMLQRGHTEQADLFYGVLRDRETGGES 240 + + +T Y S +A Y R + + L G++ Sbjct: 187 --ISGTWQNWLTGWRPFYGRISVFIAGNIAGTLYSAGRRNIRRRSLCTRTGTACAQHGDA 244 Query: 241 MMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLP 289 +MT+AQ E+KGIEKGIQ G QR + KG S + + N P Sbjct: 245 LMTIAQQLEQKGIEKGIQLGE--------QRGIEKGRSEGEREDSENSP 285 >UniRef50_C0AXL8 Putative uncharacterized protein n=1 Tax=Proteus penneri ATCC 35198 RepID=C0AXL8_9ENTR Length = 254 Score = 164 bits (415), Expect = 3e-39, Method: Compositional matrix adjust. Identities = 83/239 (34%), Positives = 136/239 (56%), Gaps = 2/239 (0%) Query: 24 RDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQMQGNPGYLHVVIEHQ 83 + F IHLP EL+ CD +TL L++ SFI+ L+ +D+LY V+ + ++++IEHQ Sbjct: 6 KTFFFIHLPEELKSQCDFSTLQLQNSSFIDIKLRSRMSDILYLVKTKEGDVPIYLLIEHQ 65 Query: 84 SKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEATPYPLSMCWFDMFYSPE 143 S+PDK +A+RMM Y+ M++HL+ + LPLVVPILFY G+ PYP + W + F Sbjct: 66 SRPDKMIAWRMMHYAFCTMNQHLQQGYKSLPLVVPILFYHGKKKPYPFPVNWMECFPLSS 125 Query: 144 LARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQ-RDLMLLLEQLVTLIDEGY 202 LA +Y++ F L+D+T DD ++ H++ A++E+ KH+ DL + L I++ Sbjct: 126 LANHIYSNDFSLIDLTSIDDDILLTHKKAAVMEIAMKHVNSCHDLNKIAMLLSKAINQKN 185 Query: 203 TSGSQLVAMQNYMLQRGHTEQADLFYGVLRDR-ETGGESMMTLAQWFEEKGIEKGIQQG 260 VA+ Y+ + + +R + E++M +A E KG + GI +G Sbjct: 186 CRDEDTVAVVQYLFSIMDASDFEFIINKIAERVDNHRETIMNIAWRLENKGFKLGIDEG 244 >UniRef50_C1J8H0 Truncated transposase n=3 Tax=Escherichia coli RepID=C1J8H0_ECOLX Length = 202 Score = 164 bits (415), Expect = 4e-39, Method: Compositional matrix adjust. Identities = 88/207 (42%), Positives = 124/207 (59%), Gaps = 7/207 (3%) Query: 90 MAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEATPYPLSMCWFDMFYSPELARRVY 149 M FRM+RYS+AAM RHLE H LPLV+P+LFY GE +PYP SM W D F P LA ++Y Sbjct: 1 MPFRMLRYSVAAMQRHLE-QHKTLPLVIPVLFYHGERSPYPYSMNWLDCFEEPALAAKIY 59 Query: 150 NSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLV 209 PFPLVDIT+ D+EIM HRR+A L LL KHIR RD+M LL++L ++ E S Q+ Sbjct: 60 TKPFPLVDITVVDDNEIMNHRRMAALTLLMKHIRHRDMMELLDKLPQVMVE--ISDEQVR 117 Query: 210 AMQNYMLQRGHTEQADLFYGVLRDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFA 269 + +Y++ G + + + + +MT+A+ E +KG Q+G E + A Sbjct: 118 VLIHYIVNAGDSVSPEFMRALAERLPQHEDKLMTIAERLE----QKGRQEGALEKALAIA 173 Query: 270 QRLLSKGMSREDVAEMANLPLAEIDKV 296 +L GM+ E + + L AE+ + Sbjct: 174 CQLQKMGMTPEQIKQATGLSEAELKNI 200 >UniRef50_A8PQ66 Putative uncharacterized protein n=3 Tax=Rickettsiella grylli RepID=A8PQ66_9COXI Length = 307 Score = 151 bits (382), Expect = 2e-35, Method: Compositional matrix adjust. Identities = 88/297 (29%), Positives = 157/297 (52%), Gaps = 7/297 (2%) Query: 9 HDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQ 68 HD +FK L A FL+ L E+ +L ++ TL L SF+ + +D++Y Q Sbjct: 9 HDKLFKYSLSKKTIAISFLKSRLSSEIYKLINIETLQLTDKSFVLPEFREIHSDIVYQCQ 68 Query: 69 MQGNPGYLHVVIEHQSKPDKK-MAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEAT 127 + GY+ ++EH+S + MAFR ++Y+I+AM ++ + KLP+V+PI Y G + Sbjct: 69 INEKKGYIFFILEHESTAHVELMAFRQLQYTISAMDQYCRQGNKKLPIVLPICVYHGIKS 128 Query: 128 PYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQRDL 187 PYP S +D F + ++AR++ PF L+D+T+ D+E+ + ++E+L KH R ++ Sbjct: 129 PYPHSQDVYDNFENLQIARQIVFKPFTLIDLTVLSDEELAKDGPAYLMEMLLKHSRAKNF 188 Query: 188 MLLLEQLVTLIDEGYTSGSQLVA--MQNYMLQRGHTEQADLFYGVLRDRETG----GESM 241 + +L + + I + + YM+ E + +++ T +M Sbjct: 189 LSILHRRIEFIQSLLNRFGKEYRWFVVKYMINETQDESPNAVEQLVQTLSTAFPEEKNTM 248 Query: 242 MTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKVIN 298 MT AQ ++G+E+G++QGR E + A+ LL GMS + V + L E+ ++N Sbjct: 249 MTFAQQLRQEGLEQGLEQGRYEEAIAIAKNLLGDGMSFKAVQRLTGLSEKEVMNLVN 305 >UniRef50_C8T759 Putative uncharacterized protein n=1 Tax=Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884 RepID=C8T759_KLEPR Length = 185 Score = 143 bits (360), Expect = 8e-33, Method: Compositional matrix adjust. Identities = 78/175 (44%), Positives = 107/175 (61%), Gaps = 19/175 (10%) Query: 133 MCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQRDLMLLLE 192 MCW F P++ARR+Y FPL+DIT TPDDEIM+HRR+A+LELLQKHIRQRDLM L E Sbjct: 1 MCWLAGFADPDIARRIYGEDFPLIDITSTPDDEIMRHRRVAMLELLQKHIRQRDLMDLHE 60 Query: 193 QLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRD---RETGGESMMTLAQWFE 249 QLV L+ GYTS QL + +Y+LQ G+ F L R E++M +AQ+ E Sbjct: 61 QLVRLLALGYTSRRQLKTLLHYLLQAGNAADPVAFLRHLAQNVPRRPHKETLMNIAQFLE 120 Query: 250 EK----------------GIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANL 288 ++ GIE+GI+QG Q+ ++ A+ +L+ G+ VA++ L Sbjct: 121 QRGHQQGLKQGLEQGLQQGIEQGIEQGEQQTAERIARAMLANGLDLSLVAKLTGL 175 >UniRef50_B3ESQ9 Putative uncharacterized protein n=2 Tax=Bacteria RepID=B3ESQ9_AMOA5 Length = 308 Score = 135 bits (340), Expect = 2e-30, Method: Compositional matrix adjust. Identities = 91/302 (30%), Positives = 160/302 (52%), Gaps = 17/302 (5%) Query: 6 TTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLY 65 + PHD + K L H E ++F + + P ++ + DL +L L + S++ E L+ D+++ Sbjct: 10 SNPHDLLVKATLSHPEAIQEFAKAYFPADILKRVDLPSLKLTNKSYVTEELREFHNDLVF 69 Query: 66 SVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDK--LPLVVPI-LFY 122 S + PGY V+EHQS PD MA R ++Y+IA + +++ +K P++V I L++ Sbjct: 70 SFTIDKQPGYAFFVLEHQSTPDPLMALRFVKYNIALIEEYIKEKGEKTPWPIIVNICLYH 129 Query: 123 QGEATPYPLSMCWFDMFYSPELARRV-YNSPFPLVDITITPDDEIMQHRRIAILELLQKH 181 PYP S +D+F P A+ + + F L D+ TP++ + QH I ++E L K+ Sbjct: 130 NANEKPYPYSTSVYDLFKDPLTAKALEMFTKFYLADLNSTPNEVLEQHGSIGLMEKLLKY 189 Query: 182 IRQRDLMLLLEQLVT-----LIDEGYTSGSQLVAMQNYMLQRGHTEQ--ADLFYGVLRDR 234 R RD+ ++E+ + LI G + L+ + Q +E+ LF VL Sbjct: 190 SRHRDIFNVIEKELKRSKGYLIVRGDYWKTILIYSSYVIGQEEKSEKDLVSLFKEVLSKN 249 Query: 235 ETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEID 294 E E M+T+AQ EE+G +G + R++++ A+ +L KG + E+ L +I+ Sbjct: 250 EE--EIMITIAQTIEERGEMRG--KRREKIA--IAKNMLKKGCEISFIEEITGLSRKDIE 303 Query: 295 KV 296 K+ Sbjct: 304 KL 305 >UniRef50_C3PPD7 Transposase and inactivated derivative n=13 Tax=spotted fever group RepID=C3PPD7_RICAE Length = 361 Score = 129 bits (324), Expect = 1e-28, Method: Compositional matrix adjust. Identities = 82/291 (28%), Positives = 148/291 (50%), Gaps = 43/291 (14%) Query: 9 HDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQ 68 HD +FK+ + AR+FLE +LPV + +LN++ +E SF+ E L+ +DV+YSV Sbjct: 41 HDELFKKVMSEPVAAREFLEHYLPVTFKNKINLNSIKIEKESFVTEDLRKRLSDVVYSVS 100 Query: 69 MQ--------------GNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADH---- 110 ++ + Y++V+IEHQS D +AFR+ +Y + RH +A++ Sbjct: 101 LKNDNIKDSTTEKSVHNDKAYVYVLIEHQSSSDYWIAFRLWQYMLLLCERHKDANNNKSS 160 Query: 111 ------DKLPLVVPILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDD 164 +KLPL+ PI+ Y + PY ++++F + A+ + + LVD+ DD Sbjct: 161 VTKEKDNKLPLICPIVVYANDK-PYNAPRSFWELFEDSKTAKDMMGDEYLLVDLQKQSDD 219 Query: 165 EIMQHRRIAILELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYML-------- 216 EI + + + ++E + KHI+ RD++ L + L+ E + S ++ Y+ Sbjct: 220 EIEKKKHLGMMEYMLKHIKARDILNLWQSLL----EKFESSIEIDKENGYIYIKWLLWYS 275 Query: 217 -----QRGHTEQADLFYGVLRDRETGGESMMTLAQWFEEKGIEKGIQQGRQ 262 + E A + L+ +E E M T+A + ++G++KG+ QG Q Sbjct: 276 DAKVSEDKQVELASIIAKHLK-KEDQEELMRTIADKYIDEGVQKGMVQGMQ 325 >UniRef50_A8GX51 Transposase and inactivated derivative n=11 Tax=Rickettsia RepID=A8GX51_RICB8 Length = 355 Score = 128 bits (321), Expect = 3e-28, Method: Compositional matrix adjust. Identities = 84/261 (32%), Positives = 143/261 (54%), Gaps = 13/261 (4%) Query: 12 VFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQMQG 71 +F++ L + A +F HLP ++ L D +L +E+ +F+E SLK +DVL+S + Sbjct: 23 IFRKALENPLVAHEFFNAHLPPNIKSLIDFPSLAMENTTFVESSLKDSISDVLFSCKFDK 82 Query: 72 NPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHL--EADHDKLPLVVPILFYQGEATPY 129 GYL +++EHQSK D MAFR+ +Y I R+L LPL+ P++F+ G+ Y Sbjct: 83 QDGYLFLLVEHQSKADHFMAFRLFKYMINICERYLIQNPKAKTLPLIYPMIFFNGQ-EKY 141 Query: 130 PLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQRDLML 189 ++ +D+F + +LA+ ++ + + LV++ PD+E Q ILE KHI +R+L+ Sbjct: 142 NVARNLWDLFTNNKLAKELWINDYQLVNVHEIPDEEFKQRIWSGILEFFLKHIHERELLK 201 Query: 190 LLEQLVTLIDE--GYTSGSQLVAM-QNYMLQRGHTEQAD--LFYGVLRDR---ETGGESM 241 +++ ++ E T G + M Y L + EQAD +L + E G M Sbjct: 202 RWQEISDILPELTKITIGYDYLEMILYYTLTK--IEQADKIKLKNLLSTKLNPEIGTRLM 259 Query: 242 MTLAQWFEEKGIEKGIQQGRQ 262 +LA+ ++++G E GI +G Q Sbjct: 260 RSLAEHWQQEGKEIGILEGLQ 280 >UniRef50_C4YU05 Transposase n=5 Tax=Rickettsieae RepID=C4YU05_9RICK Length = 342 Score = 127 bits (319), Expect = 5e-28, Method: Compositional matrix adjust. Identities = 66/189 (34%), Positives = 113/189 (59%), Gaps = 3/189 (1%) Query: 8 PHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSV 67 HDA+ K+ L A++FLE +LP + +EL DL + +E SF+E+ LK +D++YSV Sbjct: 6 KHDALVKKILTEKIAAQEFLEHYLPSDFKELIDLREIKVEKESFVEDDLKRKYSDIIYSV 65 Query: 68 QMQGN-PGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEA 126 + + +++V+IE QS D +A R+ +Y + RH E + +KLPL+ P+L Y G + Sbjct: 66 KTRDQEEAFVYVLIEAQSSCDYWIALRLWKYMLLLCERH-ENNKNKLPLICPLLIYNG-S 123 Query: 127 TPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQRD 186 Y ++++F PE A+++ + LVD+ DDEI Q + + ++E KHI QRD Sbjct: 124 EVYNAPRNFWELFTKPERAKKLMVQDYQLVDLQNQSDDEIEQKKHLGMMEYFLKHIHQRD 183 Query: 187 LMLLLEQLV 195 ++ L ++ + Sbjct: 184 MLKLWDEFL 192 >UniRef50_Q1RJ73 Transposase and inactivated derivative n=10 Tax=Rickettsieae RepID=Q1RJ73_RICBR Length = 305 Score = 121 bits (304), Expect = 2e-26, Method: Compositional matrix adjust. Identities = 78/299 (26%), Positives = 154/299 (51%), Gaps = 11/299 (3%) Query: 8 PHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSV 67 HD++ K + A++FLE +LP + ++L DL+ + +E S+IEESL +D++Y + Sbjct: 6 KHDSLVKIIMTDKIAAQEFLEYYLPEDFKKLIDLSKITVEQESYIEESLSKKYSDIVYGI 65 Query: 68 QMQG-NPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEA 126 + + G+++++IE QS D A R+ +Y++ RH E +KLPLV ++ Y G+ Sbjct: 66 ETKEYGKGFVYILIEAQSTVDYWTALRLWKYTLLLCERHKEK-RNKLPLVYNLVIYNGKQ 124 Query: 127 TPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQRD 186 W D+F + +A+++ + LVD+ D+EI++ + I +L+ + KHI +RD Sbjct: 125 VYNAPRNLW-DLFTNSVMAKKLMMEDYQLVDLQAMSDNEIVKKKHIGMLDYILKHIHERD 183 Query: 187 LMLLLEQL------VTLID--EGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGG 238 ++ L EQ V ++D +GY + + + + + + + Sbjct: 184 MIQLWEQFLANFNHVIMLDKEKGYIYLKSFLWYTDAKISKKQQPRLVQVFDKYLSPQHKD 243 Query: 239 ESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKVI 297 M T+A + ++G ++G ++G + A+++ S+G +AE+ L I +I Sbjct: 244 NIMKTIADVYIDEGKQEGKREGEYNKAVMIAKKMFSQGFKIPVIAELTGLKETLIRSII 302 >UniRef50_Q1RGR6 Transposase and inactivated derivative n=15 Tax=Rickettsia RepID=Q1RGR6_RICBR Length = 313 Score = 115 bits (288), Expect = 2e-24, Method: Compositional matrix adjust. Identities = 57/192 (29%), Positives = 104/192 (54%), Gaps = 4/192 (2%) Query: 9 HDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQ 68 HD + + + +++F E+HLP ++ L L +E SF+++ LK D+L+S + Sbjct: 7 HDEIIRSAFENPLVSKEFFEMHLPPHIQNLISFEKLKMEKDSFVDKRLKKSIVDILFSAK 66 Query: 69 MQGNPGYLHVVIEHQSKPDKKMAFRMMRY--SIAAMHRHLEADHDKLPLVVPILFYQGEA 126 GYL++++EHQS P+ KMA R+ RY IA H+ K P + P++FY G Sbjct: 67 FGEKKGYLYLLLEHQSTPEYKMALRLFRYMFKIAEYHKK-STKSKKFPFIYPLIFYNG-V 124 Query: 127 TPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQRD 186 Y +++F + EL + ++ + L+++ PD+++ + IL+ KHI +RD Sbjct: 125 QKYNAPRNLWELFENSELVKSTWSGDYQLINVHDIPDEKLKEKAWSGILQFFMKHIHERD 184 Query: 187 LMLLLEQLVTLI 198 L+ E++ L+ Sbjct: 185 LLKRWEEVADLL 196 >UniRef50_B6J6C6 Hypothetical cytosolic protein n=1 Tax=Coxiella burnetii CbuK_Q154 RepID=B6J6C6_COXB1 Length = 143 Score = 115 bits (287), Expect = 2e-24, Method: Compositional matrix adjust. Identities = 53/130 (40%), Positives = 80/130 (61%), Gaps = 2/130 (1%) Query: 7 TPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYS 66 PHD F+ + A++F E HLP + + DLN+L L+ SFI+E LK DVLYS Sbjct: 6 NPHDYYFRTAMSDTRVAKEFFEYHLPNNILKAADLNSLQLQKSSFIDEHLKASMADVLYS 65 Query: 67 VQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHL-EADHDKLPLVVPILFYQGE 125 V++ PGY ++++EHQ PDK M +R++RY + + HL + D+ LP+VVP++FY G+ Sbjct: 66 VKLNRRPGYFYIIVEHQRNPDKLMPYRLLRYILRIIDHHLKKKDYLPLPIVVPLVFYNGK 125 Query: 126 ATPYPLSMCW 135 YP + Sbjct: 126 KR-YPFQRIF 134 >UniRef50_D0YJF1 Putative transposase YhgA family protein n=1 Tax=Klebsiella variicola At-22 RepID=D0YJF1_KLEVA Length = 190 Score = 112 bits (281), Expect = 1e-23, Method: Compositional matrix adjust. Identities = 65/180 (36%), Positives = 103/180 (57%), Gaps = 13/180 (7%) Query: 127 TPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQRD 186 T P + PE A+ +Y PF L+D+T+ PDD+++QHRR+A+LEL+QKHIRQRD Sbjct: 8 TSTPHDAVFKRFLRHPETAKTLYGCPFTLIDVTVMPDDDLVQHRRVALLELMQKHIRQRD 67 Query: 187 LMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDR-ETGGESMMTLA 245 L + E L ++ GYT+ QL + +YMLQ G+T + +F L R E++M++A Sbjct: 68 LSSITESLAAVVMLGYTNRRQLRMLFHYMLQYGNTAEPGVFLRRLARRLPQYEETLMSIA 127 Query: 246 QWFEEKGIEKGIQQGRQEVSQE------------FAQRLLSKGMSREDVAEMANLPLAEI 293 Q +++G ++G +GR+E QE A +L G+ +E V ++ L E+ Sbjct: 128 QKLKQEGRQEGRLEGREEGHQEGLQEGSRREALRIAGSMLQNGLDKEMVQKITGLSADEL 187 >UniRef50_A6TJT5 Putative uncharacterized protein n=1 Tax=Alkaliphilus metalliredigens QYMF RepID=A6TJT5_ALKMQ Length = 312 Score = 111 bits (277), Expect = 3e-23, Method: Compositional matrix adjust. Identities = 82/310 (26%), Positives = 159/310 (51%), Gaps = 23/310 (7%) Query: 8 PHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSV 67 PHD FK+ + A+DF+ +LP+EL ++ D+ TL E +IE+ LK +D+L+ Sbjct: 7 PHDKFFKEMFGNLALAKDFMTNYLPLELLKIVDIETLTPEKEHYIEDDLKESFSDLLFKA 66 Query: 68 QMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAM-HRHLEADHDKLPLVVPILFYQGEA 126 + G GYL+ + EH+S P K++A +++ Y + + L+ +K+P+++P+ Y G+ Sbjct: 67 NINGREGYLYFLFEHKSYPSKRIAIQLLHYMVRIWDDKSLKEKKEKIPMIIPMTVYHGKE 126 Query: 127 TPYPLSMCWFDMFYS----PELARR-VYNSPFPLVDITITPDDEIMQHRRIAILELLQKH 181 + +++ D+ PE R+ + + + D++ DDE+ ++ I+ + + Sbjct: 127 N-WNVALRLSDLMEGYEELPEEIRKYIPEYEYLIYDLSGYTDDEVKGDVQLQIVIKILRS 185 Query: 182 IRQRD--LMLLLEQLVTLID--EGYTSGSQLVAMQNYMLQRGH-----TEQADLFYGVLR 232 I + D + ++ V ++D E G + Y + TE DL V Sbjct: 186 IFRNDEEFFKVFKEAVEVLDKLEKQEKGIEYFKTFIYYILSARKGVTLTEIYDLVKEVSV 245 Query: 233 DRETGGESMMTLAQWF----EEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANL 288 +R + +MT+A+ EKG+EKG+++G+ E +E A+ L+ G+ + V + L Sbjct: 246 ER---SDEIMTIAEELLKEGMEKGMEKGMEKGKLEEKREVARNLIGLGVELDKVMKATGL 302 Query: 289 PLAEIDKVIN 298 EI+K++N Sbjct: 303 SEEEINKLLN 312 >UniRef50_Q1RKI3 Transposase and inactivated derivative n=10 Tax=Rickettsia RepID=Q1RKI3_RICBR Length = 270 Score = 111 bits (277), Expect = 4e-23, Method: Compositional matrix adjust. Identities = 59/182 (32%), Positives = 103/182 (56%), Gaps = 3/182 (1%) Query: 9 HDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQ 68 HD F++ L + AR+F E +LP E++ L TL LE+ SFI+ +LK TDVLYS + Sbjct: 56 HDKFFQKALSNPIVAREFFEEYLPTEIKALFSPTTLTLENDSFIDPNLKESITDVLYSAR 115 Query: 69 MQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHL--EADHDKLPLVVPILFYQGEA 126 + Y++++ EHQS D MAFR+ +Y + +HL D K P + P L Y + Sbjct: 116 INNRDCYIYILCEHQSSSDPHMAFRLFKYMLNIAEKHLISHPDSKKFPFIYP-LVYSNDH 174 Query: 127 TPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQRD 186 Y + +D+F + EL + +++ + L+ + DD++ ++ +A L++L K+I + + Sbjct: 175 KKYTAPLNLWDLFENSELVKDTWSNNYQLISLRDISDDKLKENPWLAPLQILMKYIHKPN 234 Query: 187 LM 188 + Sbjct: 235 VF 236 >UniRef50_A5CC03 Transposase and inactivated derivative n=9 Tax=Orientia tsutsugamushi RepID=A5CC03_ORITB Length = 355 Score = 104 bits (259), Expect = 5e-21, Method: Compositional matrix adjust. Identities = 58/199 (29%), Positives = 109/199 (54%), Gaps = 8/199 (4%) Query: 9 HDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQ 68 HD +FK + + A DF+ LP E++ + DLNT+ +E SF+E +L+ DVL+SV+ Sbjct: 7 HDGLFKDLMNEPKAALDFINDFLPNEVKNVLDLNTIKVEQESFVEANLRRSMCDVLFSVK 66 Query: 69 MQ-GNPGYLHVVIEHQSKPDKKMAFRMMRYSIA------AMHRHLEADHDKLPLVVPILF 121 + N +++V+IE + + D +AF++ +Y+++ + + + KLP+VVPI+ Sbjct: 67 TKNNNDAFIYVLIEAELRSDYWIAFKLWQYTLSILKRHKKGLKKRKKERGKLPIVVPIVV 126 Query: 122 YQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKH 181 Y G A + +++F P+LA+ + S + L+D PD EI + A++ ++ Sbjct: 127 YHG-ADRFNAPRSLWELFDDPKLAKELMGSEYLLIDWQAMPDSEIKRKATAALVHFMKYI 185 Query: 182 IRQRDLMLLLEQLVTLIDE 200 Q D++ L + + E Sbjct: 186 HNQPDIIELWAKFFNTLQE 204 >UniRef50_Q24W02 Putative uncharacterized protein n=3 Tax=Clostridiales RepID=Q24W02_DESHY Length = 333 Score = 102 bits (254), Expect = 1e-20, Method: Compositional matrix adjust. Identities = 87/328 (26%), Positives = 156/328 (47%), Gaps = 36/328 (10%) Query: 7 TPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYS 66 PHD FK+ AR FL+ +LP E+ L DL T+ + S+I++ L+ +D+L+ Sbjct: 6 NPHDKFFKETFGDVGMARSFLKNYLPQEILALVDLETILPQKDSYIDQELQESFSDLLFQ 65 Query: 67 VQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHL-EADHDKLPLVVPILFYQGE 125 V++ N GYL+ + EH+S P + +A ++++Y + L E+ DKLPL++P++ Y G+ Sbjct: 66 VKIHKNEGYLYFLFEHKSYPSQGIALQLLKYMVRIWESKLKESKPDKLPLIIPMVVYHGQ 125 Query: 126 A---TPYPLSMCWFDMFYSPE-LARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKH 181 + LS + P + + + + L D++ D E++ + + I+ + Sbjct: 126 EKWNSSLKLSGIIDNYEQLPNAVTQYIPEYEYILYDLSTYTDQEMVGNMLLLIILRTMRD 185 Query: 182 IRQRDL----MLLLEQLVTLID-EGYTSGSQLV-AMQNYMLQRGHTEQADLFYGVLRDRE 235 I +D +L E L++ E G Q + Y+L + + Y + ++ Sbjct: 186 IFIKDTEAFHNILHELLISFERVEDQEKGMQFFETLIRYILSTRQDLELERIYEIAKEVS 245 Query: 236 -TGGESMMTLAQWF------------------------EEKGIEKGIQQGRQEVSQEFAQ 270 GE MMT+A+ EKG E+G+++GR+E E A+ Sbjct: 246 LERGEVMMTIAEKLIMEGMEKGLKKGREEGLKKGREEGLEKGREEGLEKGREETKLEVAR 305 Query: 271 RLLSKGMSREDVAEMANLPLAEIDKVIN 298 LL G+ + VA+ L EI K++N Sbjct: 306 NLLGLGIEMDKVAKATGLSEEEIRKLMN 333 >UniRef50_A8PLG1 Transposase n=1 Tax=Rickettsiella grylli RepID=A8PLG1_9COXI Length = 212 Score = 102 bits (254), Expect = 2e-20, Method: Compositional matrix adjust. Identities = 66/199 (33%), Positives = 113/199 (56%), Gaps = 2/199 (1%) Query: 92 FRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEATPYPLSMCWFDMF-YSPELARRVYN 150 F++ RY A M +HL+ H LP+VV +L+Y+G+ TPYP + FD F + +A ++Y Sbjct: 4 FKIARYVHAIMDQHLKQGHAFLPIVVAMLYYRGKVTPYPYTGNIFDCFGKNKTIAEKIYL 63 Query: 151 SPFPLVDITITPDDEIMQHRRIAILELLQKHIR-QRDLMLLLEQLVTLIDEGYTSGSQLV 209 P+P++DIT DD I H IAIL+ QK+ RD+ +E ++ + +GY + Q Sbjct: 64 RPYPIIDITALSDDAIRGHGSIAILDFAQKYAAFNRDIQDGIEHIIGELKKGYLTREQCQ 123 Query: 210 AMQNYMLQRGHTEQADLFYGVLRDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFA 269 + Y + T+ + L+ E +M++A E++G+++G+QQGR E + A Sbjct: 124 TLLYYTFRETDTDNVKMLLEQLQTIRIYEEDIMSVAHKIEQQGLQRGLQQGRYEEDLKIA 183 Query: 270 QRLLSKGMSREDVAEMANL 288 +R+L+KG R + ++ L Sbjct: 184 KRMLAKGTDRGYIKDVTGL 202 >UniRef50_Q6TFF6 Putative transposase n=1 Tax=Caedibacter taeniospiralis RepID=Q6TFF6_CAETA Length = 299 Score = 97.8 bits (242), Expect = 4e-19, Method: Compositional matrix adjust. Identities = 85/294 (28%), Positives = 143/294 (48%), Gaps = 30/294 (10%) Query: 9 HDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSF-------IEESLKGHST 61 HD+VFK + + + A FL +LP EL EL D T+ LES + + + + Sbjct: 5 HDSVFKDLIANRDFAVSFLMTYLPKELVELVDWQTVKLESANVEHVRQQQKDNQKQKEQS 64 Query: 62 DVLYSVQMQ-GNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAM------HRHLEADHDKLP 114 D+ + + + G G + V IE Q+ D + R Y + + H+ ++ LP Sbjct: 65 DLTFLFKFKDGKNGAVFVHIESQTGDDGTILIRTRHYQTSYLLDYIKRHKTVKG----LP 120 Query: 115 LVVPILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAI 174 LVV I++Y + P+ S+ D F + ELA++ Y +D+ D+EI++H IA Sbjct: 121 LVVSIIYYANQK-PFSHSLNIHDYFANTELAKK-YAFTTQFIDLNRYSDEEILEHGFIAG 178 Query: 175 LELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDR 234 EL+ K IR++++ L+ + I E Y ++ V ++ YM Q E D ++ + Sbjct: 179 YELILKAIREKNIDGKLDIAINQI-EAYDHIARQVLIR-YMSQYSDMETKDFHDKLIYSK 236 Query: 235 ETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANL 288 +MT+A+ +E+KGI+KGIQ A+ L G+S E V + L Sbjct: 237 PDLRGDVMTVAEQWEQKGIQKGIQTT--------ARNFLLMGLSAEQVVKGTGL 282 >UniRef50_A9BGB6 Putative uncharacterized protein n=3 Tax=Petrotoga mobilis SJ95 RepID=A9BGB6_PETMO Length = 331 Score = 96.3 bits (238), Expect = 1e-18, Method: Compositional matrix adjust. Identities = 80/310 (25%), Positives = 145/310 (46%), Gaps = 18/310 (5%) Query: 7 TPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYS 66 PHD FK E ARDFL+ +LP E E+ DL+ L E+ S ++E+L+ +D+LY Sbjct: 7 NPHDRFFKLIFSDKEIARDFLQNYLPQEAVEIVDLDYLIPENNSHVDENLRESLSDMLYK 66 Query: 67 VQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEA 126 +++G GY+++++EH+S + K+ F+++RY + + K+P+++P++ Y G Sbjct: 67 TKIKGQDGYIYILMEHKSYIEGKVIFQLLRYITSIWEEKYDPKTKKVPIIIPMVIYHGRE 126 Query: 127 TPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIR--Q 184 + + +M E + P I D I + +RI L ++ I + Sbjct: 127 I-WNVETNLLNMVQGIEDLPNELKTYLPTYRYEIC-DFSIKRKKRIIGLTAMKVAIEAMR 184 Query: 185 RDLMLLLEQLVTLIDEGYTSGSQLVAMQN---------YMLQRGHTEQADLFYGVLRDRE 235 + E+ + + QL Q Y+L + V ++ Sbjct: 185 AGTAMTKEEFKERLRRVFAYIKQLPKEQVHEWFEECMIYLLNVREDVTIEEILKVQKEIM 244 Query: 236 TG-GESMMTLAQWFEEKGIEKGI----QQGRQEVSQEFAQRLLSKGMSREDVAEMANLPL 290 G GE +MT+A+ +G+EKG ++G+ E +EFA R+LSK + E+ + Sbjct: 245 PGRGEIVMTIAEKLRNEGMEKGKIEGERKGKLEGEREFAIRILSKRFGNQLTEEIKDRIR 304 Query: 291 AEIDKVINLI 300 +K I+ I Sbjct: 305 EADEKTIDYI 314 >UniRef50_C5JAV2 Transposase n=2 Tax=uncultured bacterium RepID=C5JAV2_9BACT Length = 334 Score = 96.3 bits (238), Expect = 1e-18, Method: Compositional matrix adjust. Identities = 81/298 (27%), Positives = 146/298 (48%), Gaps = 20/298 (6%) Query: 8 PHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSV 67 PHD K L + TA L LP E+ E + L GSFI+E+L+ H TD LY V Sbjct: 7 PHDRFLKALLSNPATAGTLLRERLPREVAEALSDDPPELLEGSFIDEALRPHLTDRLYRV 66 Query: 68 Q-MQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEAD--HDKLPLVVPILFYQG 124 + + G L+V+IEH+S PD ++ +++++Y + A+ + + ++LP +VP +FY G Sbjct: 67 RTVTGRTALLYVLIEHKSSPDLRIGWQLLKYLVEALKQWERENPAWERLPAIVPFVFYHG 126 Query: 125 EATPYPLSMCWFDMFYSPELARR-VYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIR 183 A + + + + + E R + N F ++D+ D ++ + + L K+ Sbjct: 127 -AAAWKVPDAFLALVDAEEGWRSHLLNFRFTVLDLGQIDDRQLSRQPNLQAWLLAAKYAT 185 Query: 184 QRDLM-----LLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRD-RETG 237 + D LL++ LV++ DE + + Y+++ + + ++R R Sbjct: 186 RDDRQLEVKELLIQTLVSVADEEFR------FLMRYVVETYRSYDEPMVREIIRRVRPEE 239 Query: 238 GESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDK 295 E+MM++ F + + KG Q+GRQE QE Q + G R E A + L ++ + Sbjct: 240 EETMMSM---FAQDMMAKGRQEGRQEGRQEGRQEGIKLGEQRGRQEEAAYMLLKQMRR 294 >UniRef50_B3ETR6 Putative uncharacterized protein n=1 Tax=Candidatus Amoebophilus asiaticus 5a2 RepID=B3ETR6_AMOA5 Length = 275 Score = 92.8 bits (229), Expect = 1e-17, Method: Compositional matrix adjust. Identities = 60/198 (30%), Positives = 103/198 (52%), Gaps = 11/198 (5%) Query: 75 YLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEATPYPLSMC 134 Y++ +IE+QS +K MAF M+ Y++A M +HL + +LP++V I Y G+ +PYP S Sbjct: 36 YVYTLIENQSTHNKLMAFSMLSYNVALMEQHLNEGYQELPIIVNICIYTGKKSPYPYSQD 95 Query: 135 WFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQRDLMLLLEQL 194 D F ELAR F L+D+++ +E+++ +E L + R+RD + + Sbjct: 96 ICDYFEGVELAREQMFKHFKLLDLSVLSQEELLKDGTFGSVEALLRQGRERDYLNWINNN 155 Query: 195 VTLIDEGYTSGSQLVAMQNYMLQRGHTEQAD-LFYGVLRDRETGGESMMTLAQWFEE--- 250 LI E ++ + + Y+L AD L ++ E ++T AQ + Sbjct: 156 QVLIWELVSNYGLSIVI--YILTTDDKNDADYLMQAIIEAVLEQKEIIVTAAQQLRQVDI 213 Query: 251 -----KGIEKGIQQGRQE 263 KGI++GI+QG++E Sbjct: 214 QTGLIKGIKEGIEQGKEE 231 >UniRef50_Q2J904 Putative uncharacterized protein n=1 Tax=Frankia sp. CcI3 RepID=Q2J904_FRASC Length = 323 Score = 90.9 bits (224), Expect = 5e-17, Method: Compositional matrix adjust. Identities = 76/280 (27%), Positives = 137/280 (48%), Gaps = 21/280 (7%) Query: 1 MDAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHS 60 M +P + PHDAVF++ L A L LP L DL+ L + GS ++ +L+ Sbjct: 1 MSSPPS-PHDAVFRRVLGVPSNAASQLRATLPAALVARLDLDRLAIVPGSLVDATLRWRH 59 Query: 61 TDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDK---LPLVV 117 TD+L++ + G+ +++V++EHQS D MAFRM+RY + R+L ADH K LP VV Sbjct: 60 TDLLFTAPLDGHEAFIYVLVEHQSSSDPLMAFRMLRYVVRVWDRYL-ADHHKAARLPAVV 118 Query: 118 PILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFP----LVDITITPDDEIMQHRRIA 173 P++ + E + + +P+LA + P L+D + D+ ++ R + Sbjct: 119 PLVVHHNEHAWVAPTQVLDLVDLAPDLA-GAWREHLPRFQFLLDDLVRVDERELRERPLT 177 Query: 174 --------ILELLQKHIR-QRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQA 224 +L+++ + R +DL +++L ++D G + + Y+ G + Sbjct: 178 HSVRLTLLLLKIVPGNPRLAQDLRPWVDELRAVLD-GPDGREEFATLLRYIELVGEADAR 236 Query: 225 DLFYGVLRDRETGGE-SMMTLAQWFEEKGIEKGIQQGRQE 263 D + ++ E + MT+A+ +G +G +GR E Sbjct: 237 DELHDLIAGLGPEAEDAYMTIAEMLRAEGRVEGRVEGRVE 276 >UniRef50_C1MD86 Putative uncharacterized protein n=5 Tax=Enterobacteriaceae RepID=C1MD86_9ENTR Length = 155 Score = 89.4 bits (220), Expect = 1e-16, Method: Compositional matrix adjust. Identities = 51/127 (40%), Positives = 83/127 (65%), Gaps = 2/127 (1%) Query: 162 PDDEIMQHRRIAILELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHT 221 PDD+IMQHRR+A+LEL+QKHIR+RDLM L+E+L L+ +G+ + +QL A+ NY++Q G+T Sbjct: 2 PDDKIMQHRRMALLELIQKHIRKRDLMGLVEKLAILLVKGHANDNQLKALFNYLMQAGNT 61 Query: 222 EQ-ADLFYGVLRDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSRE 280 + + V + +MT+A+ ++G G+Q+G ++ QE Q L +G RE Sbjct: 62 THFGEFLHEVAERLPQHKDKLMTIAERLRQEGHLNGLQEGHRKGLQEGLQTGLQQG-KRE 120 Query: 281 DVAEMAN 287 + +A+ Sbjct: 121 EALRIAS 127 >UniRef50_D0LMM4 Putative transposase n=10 Tax=Haliangium ochraceum DSM 14365 RepID=D0LMM4_HALO1 Length = 345 Score = 88.2 bits (217), Expect = 3e-16, Method: Compositional matrix adjust. Identities = 77/275 (28%), Positives = 126/275 (45%), Gaps = 26/275 (9%) Query: 9 HDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQ 68 HD++ K + A D LP + E DL+ L L GSF+ + L+ TD+L+ Sbjct: 6 HDSLVKATFARLDFAADEFRAVLPPAILERLDLDKLALCPGSFVSDELRQQHTDLLFRAP 65 Query: 69 MQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHD---KLPLVVPILFY--- 122 + G P +L++++EHQS ++ M R++RY + RHL +H LP ++P++ + Sbjct: 66 LDGEPAFLYLLLEHQSSVERMMPLRLLRYVASIWERHL-GEHPGAATLPPILPVVLHHSE 124 Query: 123 QGEATPYPLSMCWFDMFYSPELARRVYNSPFP-----LVDITITPDDEIMQHRRIAILEL 177 QG P L +F + AR P L D++ PD+ ++ A +L Sbjct: 125 QGWTAPTSLGQ----LFALSDGAREALGPYLPELRFLLDDLSHQPDEALLMREMAAQAKL 180 Query: 178 ----LQKHIRQRDLMLLLEQLVTLIDEGYTSGS---QLVAMQNYMLQRGHTEQADLFYGV 230 L+ +DL+ LL +I E T+ L A+ Y LQ T+ D Sbjct: 181 ALWALKNARHAQDLLALLRPWSPVILEAVTAPGGIDALAAIVRYTLQHADTD-PDALMRF 239 Query: 231 LRDR--ETGGESMMTLAQWFEEKGIEKGIQQGRQE 263 L D + E+ MT A+ + E+ ++QGR E Sbjct: 240 LIDSAGDPAKEAFMTGAEKLTQAVREQSLRQGRVE 274 >UniRef50_A9EVM7 Similar to putative transposase n=2 Tax=Sorangium cellulosum 'So ce 56' RepID=A9EVM7_SORC5 Length = 336 Score = 86.3 bits (212), Expect = 1e-15, Method: Compositional matrix adjust. Identities = 77/272 (28%), Positives = 132/272 (48%), Gaps = 19/272 (6%) Query: 9 HDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQ 68 HDA+FK E A L LP L D L L GSF++E+LK +D+L+S Sbjct: 14 HDALFKAAFSQVEHAAGELRQALPPALSARIDFAALRLRPGSFVDEALKERQSDLLFSAS 73 Query: 69 MQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADH---DKLPLVVPILFYQGE 125 M L+++ EHQS + MAFR++RY + HL A+H +LP ++P++ + E Sbjct: 74 MGEARVLLYLLFEHQSTVEPLMAFRLLRYMVRIWEHHL-AEHPGSKRLPAILPVVLHHSE 132 Query: 126 ATPYPLSMCWFDMFYSPELARRVYNSPFP----LVDITITPDDEIMQHRRIAILELLQ-- 179 T + + + D+ E AR V P ++D DE ++ R ++ L Sbjct: 133 -TGWTAATSFEDLLDLDEGARAVMVDHVPRFRFVLDDISQEGDEALKARAMSAFSRLVLW 191 Query: 180 --KHIRQRD-LMLLLEQLVTLIDEGYTSGS---QLVAMQNYMLQRGHTEQAD--LFYGVL 231 +H R+ D L+ L + + L++E + + L A+ Y+L ++AD L + Sbjct: 192 CLRHGREPDELLRQLGKWLDLVNEVRRAPNGVEALRAIWRYILATNERDEADEVLQRLLA 251 Query: 232 RDRETGGESMMTLAQWFEEKGIEKGIQQGRQE 263 E E +++ A E+G ++G+++G +E Sbjct: 252 AAGEPWKEEIVSAADQLMERGRQQGLREGLRE 283 >UniRef50_C0GW49 Putative uncharacterized protein n=6 Tax=Desulfonatronospira thiodismutans ASO3-1 RepID=C0GW49_9DELT Length = 339 Score = 85.9 bits (211), Expect = 2e-15, Method: Compositional matrix adjust. Identities = 64/259 (24%), Positives = 128/259 (49%), Gaps = 12/259 (4%) Query: 9 HDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQ 68 HD F+ L ARDF+ HLP E+ +L+T+ + S S++ ++LK TD++ +++ Sbjct: 14 HDHTFRAILGREPVARDFVRYHLPEEITRDMNLDTVKVSSRSYVSDNLKESMTDIVITLE 73 Query: 69 M-QGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEAT 127 + G P +++++EH+S D ++ +Y ++ LP++VP++FY G A Sbjct: 74 LITGEPAEIYILVEHKSDLDAWTKIQLFKYMNEVWQSFIQKKTGTLPIIVPLVFYHGTAR 133 Query: 128 PYPLSMCWFDMFYSPELARRVYNSPFP--LVDITITPDDEIMQHRRIAILELLQKHI--- 182 + S+ + D+F P R Y F L ++ + ++ + + L+ ++I Sbjct: 134 -WNYSLEFSDLFNLPSEHYRKYIPKFEHLLHEVPVINKKKVKSSITLEVFHLVLEYIFYP 192 Query: 183 RQRDLML-LLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHT-EQADLFYGVLRDRETGGES 240 +RD + LE L +D + ++ ++ T E+A+ ++ GGE+ Sbjct: 193 EKRDQIYEALELLFKGLDAKEAHEIFAILIKYLLIATDETPEEAE---EKVKHLPKGGET 249 Query: 241 MMTLAQWFEEKGIEKGIQQ 259 + T A+ EE+G K I++ Sbjct: 250 VRTTAEVLEERGYNKAIKE 268 >UniRef50_D2QBD7 Putative uncharacterized protein n=1 Tax=Spirosoma linguale DSM 74 RepID=D2QBD7_9SPHI Length = 341 Score = 85.1 bits (209), Expect = 2e-15, Method: Compositional matrix adjust. Identities = 86/334 (25%), Positives = 144/334 (43%), Gaps = 39/334 (11%) Query: 1 MDAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHS 60 M A PHD FK+ E DFL P +RE D TL E +F +E L H Sbjct: 1 MAAQPDNPHDRFFKESFSQPEILIDFLNAFAPEAVRERIDYTTLTREVDTFTDEQLAEHF 60 Query: 61 TDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPIL 120 D+++SVQ G P L +++EH+S ++ F++ RY + ++ P V+P+L Sbjct: 61 ADLVFSVQYNGQPIRLVILLEHKSYTEEYPHFQINRYLLNLWESQIKQKQPLTP-VLPVL 119 Query: 121 FYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEI----MQHRRIAILE 176 Y G S+ + L + + L+D++ D+ + + R+ + Sbjct: 120 VYHGNRRWKQRSIPDYFAPLHETLTPYLPAFEYLLIDLSTLSDERLPTLQSDYARLTAI- 178 Query: 177 LLQKHIRQRDLMLLLEQLVTLIDE--GYTSGSQLVAMQN-YMLQRGHTEQADLFYGVLRD 233 LLQ R+R+L LL+ ++ T+G + V+ Y+ + + +LF R Sbjct: 179 LLQNSRRKRELTRLLDAFADVVRRLTDTTAGQRFVSTGFLYLSYTANLTKVELFGIFSRI 238 Query: 234 RETGGESMMTLAQWFEEKG-----------IEKGIQQGRQ-EVSQ--------------- 266 S MT+A+ ++G E+ IQQGR+ E Q Sbjct: 239 SSKIESSTMTVAEELIQEGRELERRQTRMVAEELIQQGRELERRQAMMAAEELLKQQERQ 298 Query: 267 ---EFAQRLLSKGMSREDVAEMANLPLAEIDKVI 297 +F + +L+ + +A A LPLAE+D +I Sbjct: 299 NKIKFIKAMLNLNLDAATIATAAELPLAEVDAII 332 >UniRef50_B2V9N0 Putative uncharacterized protein n=4 Tax=Sulfurihydrogenibium RepID=B2V9N0_SULSY Length = 312 Score = 84.7 bits (208), Expect = 3e-15, Method: Compositional matrix adjust. Identities = 74/305 (24%), Positives = 147/305 (48%), Gaps = 40/305 (13%) Query: 2 DAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHST 61 + S PH+ FKQ +++ +DFL I L +L + L++L L + K H Sbjct: 3 NKESIQPHNWFFKQVFSNSKNVQDFLSIFLS-DLSQKIQLSSLELVPSEKFSNNQKKHFL 61 Query: 62 DVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILF 121 D+LY ++ Y+ ++ EH+S DKK+ ++M+Y+ L+ + D P ++ I+F Sbjct: 62 DLLYKCKLNDKEAYIRLIFEHKSYVDKKLPLQLMQYNAVIWEEALK-EKDYYPPIINIVF 120 Query: 122 YQGEAT-PYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIA---ILE- 176 Y G+A +P ++ + EL + + + L+D+ D+ + ++ + I+E Sbjct: 121 YHGQAKWNFPTTIPDIE---DEELDKYIQKLNYILIDLNEIEDENLKRYLKKNVDLIMEM 177 Query: 177 LLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQ--LVAMQNY--MLQRGHTEQADLFYGVLR 232 L+ KHI R LE++ TL+ + S+ V + NY ++++ + + ++F ++ Sbjct: 178 LIMKHIHDR-----LERIKTLLKDVIDECSEDCFVIILNYLVLVKKDYEKVKEVFKEII- 231 Query: 233 DRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAE 292 GGE M L F +K +G +G+ E+ RE++ ++ ++ Sbjct: 232 ----GGEEKMML---FTDKLKMEGKMEGKIEI-------------LRENIIDLIDVKFGV 271 Query: 293 IDKVI 297 +DK I Sbjct: 272 VDKSI 276 >UniRef50_C0GW46 Putative uncharacterized protein n=2 Tax=Desulfonatronospira thiodismutans ASO3-1 RepID=C0GW46_9DELT Length = 341 Score = 84.3 bits (207), Expect = 4e-15, Method: Compositional matrix adjust. Identities = 75/289 (25%), Positives = 133/289 (46%), Gaps = 25/289 (8%) Query: 7 TPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYS 66 PH+A FK F E + F++ H+P E+ L DL+TL ++ F+ E + + DV+ + Sbjct: 7 NPHNACFKDFFKDPEFVKAFIKYHIPEEICSLLDLDTLQVDLSGFVSEEHREYYADVMVT 66 Query: 67 VQMQGNPG--YLHVVIEHQSKPDKKMAFRMMRYSIAA---MHRHLEADHDKLPLVVPILF 121 VQ++G+ +++++EH+S P+ +++ Y + + R + LP+++P++ Sbjct: 67 VQLKGHTENVNIYILLEHKSTPEFLTRLQILNYEVQKWMDLKRKGQL-QGYLPVIIPVVI 125 Query: 122 YQGEATPYPLSMCWFDMFYSPELARRVYNSPFP--LVDITITPDDEIMQHRRIAILELLQ 179 Y G+ + S + D+F P R + F + DI+ DDE + I LL Sbjct: 126 YHGKGR-WNFSRKFSDLFDLPSEVLRPFVPEFKHMIHDISSMEDDEFKTTAILEIFHLLF 184 Query: 180 KHIRQRDLMLLLEQLVTLID---EGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRET 236 K+I +L L+++ L++ + L A+ Y+ +G L G R Sbjct: 185 KYIHYPELETKLQEIYDLLETIPDQDKVKQYLQAIVQYVAVQGPISLERL--GEYTRRLP 242 Query: 237 GGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEM 285 GG+ M A QQ RQE EF Q + RE A++ Sbjct: 243 GGDEAMQTA-----------AQQIRQEAYNEFIQEQEKMLVEREKHAKL 280 >UniRef50_A0LBL3 Putative uncharacterized protein n=6 Tax=Magnetococcus sp. MC-1 RepID=A0LBL3_MAGSM Length = 322 Score = 82.8 bits (203), Expect = 1e-14, Method: Compositional matrix adjust. Identities = 85/304 (27%), Positives = 138/304 (45%), Gaps = 21/304 (6%) Query: 6 TTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLY 65 T PHD K L + L LP E+ EL L G+FI+ + H TD L+ Sbjct: 5 TQPHDRFLKALLSDPDKTGTLLRERLPKEVAELLSSEPPVLVDGTFIDGEFREHLTDRLF 64 Query: 66 SVQMQ-GNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQG 124 V+ Q G Y++ +IEH+S D+ +AF+++RY + R L+ KLP +VP++ Y G Sbjct: 65 KVKTQEGKAAYIYALIEHKSYADEWVAFQLLRYMVRIWERFLKEGQQKLPPIVPLVVYHG 124 Query: 125 E---ATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQ--HRRIAILELLQ 179 P S + L + + F + D+ DD++ Q H R A++ + Sbjct: 125 AREWTVPNQFSAL---LEADKGLLHHLLDFSFAVTDLGRIADDDLSQDTHLRAALMAM-- 179 Query: 180 KHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQ--RGHTEQADLFYGVLRDRETG 237 K+ Q +++ + +G ++LV Y++Q RG T AD+ + Sbjct: 180 KYAFQGAEGVVVIPQIGKGAQGDPEFAKLVL--RYLIQTYRGMT-MADV--QAYAEEAFP 234 Query: 238 GESMMTLAQWFEE---KGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEID 294 GE+ +Q+ E KG ++G Q+GR+E QE Q S + R ++P Sbjct: 235 GEAEHYASQFAREMMSKGRQEGRQEGRREGRQEGRQEGESSLLLRLLHRRFGDVPSWAEL 294 Query: 295 KVIN 298 KV N Sbjct: 295 KVAN 298 >UniRef50_Q1QWV4 Putative uncharacterized protein n=11 Tax=Proteobacteria RepID=Q1QWV4_CHRSD Length = 326 Score = 76.3 bits (186), Expect = 1e-12, Method: Compositional matrix adjust. Identities = 87/308 (28%), Positives = 144/308 (46%), Gaps = 28/308 (9%) Query: 13 FKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQMQGN 72 +K H E RD L + E D +TL SGS+I E L+ DV++ V+ + Sbjct: 13 YKLLFSHPEMVRDLLTGFVKEAWVEQLDFSTLEKVSGSYITEDLRDREDDVIWRVRWGDD 72 Query: 73 PGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHL---EA--DHDKLPLVVPILFYQGEAT 127 Y+++++E QS D+ MA R+M Y + +++ L EA + KLP V+PI+ Y GE Sbjct: 73 WLYVYLLLEFQSSVDRFMAVRVMTY-LGLLYQDLIRQEAFTPNGKLPPVLPIVLYNGEKR 131 Query: 128 PYPLSMCWFDMF--YSPELARRVYNSPFPLVD-ITITPDDEIMQHRR--IAILELLQKHI 182 + + D+ L R N + L+D + D E H R A L L+ + Sbjct: 132 -WTAAQNVADLVEQVPGGLERYRPNLAYLLLDEGAVISDPEWSDHMRNVAAALFRLEHNR 190 Query: 183 RQRDLMLLLEQLVTLIDEGYTSGSQ---LVAMQNYML-QRGHTEQADLFYGVLRDRETGG 238 ++D++ +L LV + +G + +V ++ +L R + F + E Sbjct: 191 DEQDMLEVLGTLVEWLKAPEQTGLRRAFVVWIRRVLLPNRAPGMELPEFNELQDLHEVHD 250 Query: 239 ESMMTLAQW---FEEKGIEKGIQQGRQEVSQEFAQRLLSKG---------MSREDVAEMA 286 + QW +EEKG ++G Q+GR+E QE QR + K +S E +AE Sbjct: 251 MLAERIKQWPERWEEKGRQEGRQEGRKEGRQEGEQRGIEKTARNLIKLGVLSDEQIAEAT 310 Query: 287 NLPLAEID 294 L +AE++ Sbjct: 311 GLTVAEVE 318 >UniRef50_A3ET28 Probable transposase n=6 Tax=Leptospirillum sp. Group II RepID=A3ET28_9BACT Length = 335 Score = 75.1 bits (183), Expect = 3e-12, Method: Compositional matrix adjust. Identities = 68/258 (26%), Positives = 122/258 (47%), Gaps = 19/258 (7%) Query: 9 HDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQ 68 HD FK E RDFL LP E+ + D ++L + I S + D++ + Sbjct: 8 HDRFFKTSFGRIEVLRDFLTGFLPPEISQSIDPDSLRFLNTESIGLSFEKSHMDLVVECR 67 Query: 69 MQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEATP 128 + P +++IEH+S PD ++ +M+RY +A R+ + D+ L V+P++F+QG P Sbjct: 68 ISETPAQFYLLIEHKSVPDPEVFLQMLRYMVALWTRNRQ-DNKPLVPVLPLVFHQG-GRP 125 Query: 129 YPLSMCWFDMFYSPEL--ARRVYNSP--FPLVDITITPDDEIMQHRRIAILELLQKHIRQ 184 + L + + + F PE A V +P F L ++ T E H ++ L K+ Sbjct: 126 WTLPVRFQETFPVPETLKAHAVDFAPLLFDLSTVSGTTIRERSAHAETVVVLTLLKYAFS 185 Query: 185 ---RDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGESM 241 D++ L++ DE + L + NY ++ + + + R GGE + Sbjct: 186 GSVEDVLRALKETGGSFDETF-----LFGVLNYAIRAFEVKDPVVVDAI--SRSFGGEKI 238 Query: 242 M--TLAQWFEEKGIEKGI 257 M + +W EE G+++G+ Sbjct: 239 MPSIIDEWVEE-GLKEGL 255 >UniRef50_C6VTM0 Putative uncharacterized protein n=1 Tax=Dyadobacter fermentans DSM 18053 RepID=C6VTM0_DYAFD Length = 308 Score = 74.7 bits (182), Expect = 4e-12, Method: Compositional matrix adjust. Identities = 76/305 (24%), Positives = 147/305 (48%), Gaps = 18/305 (5%) Query: 6 TTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLY 65 T HDA + + + + A D+ +P +++L D +TL +++ + L+ +D++Y Sbjct: 5 TPKHDAFIRAIMGNKQIALDYFRASIPQNIQDLLDFSTLRQLPDTYVSKELQKSISDIVY 64 Query: 66 SVQMQGNPGYLHV--VIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQ 123 Q G + + ++EH+S DK ++ Y + + + + + + L++PIL Y Sbjct: 65 VCQKASGNGEVKISLLVEHKSYVDKYTPIQIGSYIFSGLLKQI-GNKESPSLIIPILLYH 123 Query: 124 GEATPYPLSMCWFDMFYSPELARRVYNSPFPLV--DITITPDDEI--MQHRRIAILELLQ 179 G A + D+F +PE A + + + + D+ D+EI + ++ +A L Sbjct: 124 G-ADRWEYKTVA-DLFENPEPALQQFIPDYQYIFHDLGQISDEEIQSLHNKFLAASLLAM 181 Query: 180 KHIRQRD-LMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDR-ETG 237 K+ +D L LL ++TL E + ++ Y L G+ + F +++ Sbjct: 182 KYSALKDQLNTLLPTILTLASE--VDRNLHKSLLFYTL-VGNPLTEEQFLNLIKSVPNQK 238 Query: 238 GESMMTLAQWFEEKGIEKGIQQGRQEVSQ--EFAQRLLSKG--MSREDVAEMANLPLAEI 293 E++M + + FEEKG +KGI++GR E Q E A R L K ++ E +A N+ + Sbjct: 239 KEAIMDIFEIFEEKGWKKGIEEGRAEAEQKIETAVRNLIKQSVLTDEQIASAMNVTTDYV 298 Query: 294 DKVIN 298 +V N Sbjct: 299 AEVRN 303 >UniRef50_A6G4N5 Putative uncharacterized protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6G4N5_9DELT Length = 343 Score = 74.7 bits (182), Expect = 4e-12, Method: Compositional matrix adjust. Identities = 73/276 (26%), Positives = 124/276 (44%), Gaps = 13/276 (4%) Query: 1 MDAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHS 60 M S +PHDA+FK + A L+ L + D +TL E GS+I+E+L Sbjct: 1 MHGTSPSPHDALFKSAFKDPKDAAKLLQNVLDEPIAHAIDWSTLRPEPGSYIDETLAERH 60 Query: 61 TDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDK-LPLVVPI 119 +D+L+S + G Y++++IEHQS D+ M RM+ Y RH A + LP ++P+ Sbjct: 61 SDLLFSASIGGEDAYVYLLIEHQSTVDRDMPLRMLVYLTRVWLRHRSAHPGRDLPPILPV 120 Query: 120 LFYQ---GEATPYPL-SMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAIL 175 + G P S+ PEL + + D+T D ++ + Sbjct: 121 VVSHAPGGWTAPVTFESLVRPGPTDLPELTPHIPRFELVINDLTHLSDQQLREWSMRGFA 180 Query: 176 ELLQKHIRQR-DLMLLLEQLVTLID---EGYTSGSQLVAMQ---NYMLQRGHTEQADLFY 228 L+ +R R ++ L++ + T D E + + + AM +Y+ Q F+ Sbjct: 181 TLVLWILRTRHEIPELIDGVSTWRDMFREVFEAPDGVQAMTKIFHYIACIAQRVQVQEFH 240 Query: 229 GVLRDR-ETGGESMMTLAQWFEEKGIEKGIQQGRQE 263 L + E M T + E+G+ KG+ +GR+E Sbjct: 241 AKLDEHVPQTREVMKTYYEELMEEGMAKGLAKGREE 276 >UniRef50_B8FP58 Putative uncharacterized protein n=1 Tax=Desulfitobacterium hafniense DCB-2 RepID=B8FP58_DESHD Length = 167 Score = 74.3 bits (181), Expect = 5e-12, Method: Compositional matrix adjust. Identities = 43/135 (31%), Positives = 71/135 (52%), Gaps = 8/135 (5%) Query: 7 TPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYS 66 PHD FK+ TAR FLE +LP E+R L DL T+ + S+I++ L+ +D+L+ Sbjct: 6 NPHDKFFKETFGDVGTARSFLENYLPQEVRALVDLKTVLPQKDSYIDQELQESFSDLLFQ 65 Query: 67 VQMQGNPGYLHVVIEHQSKP----DKKMAFRMMRYSIAAMHRHL---EADHDKLPLVVPI 119 V+++ N GY + + EH+ +P KKM+ R+ S+ + R + +H K P + Sbjct: 66 VKIRENEGYFYFLFEHKVRPYADRRKKMSTRLADDSVLSKQREMFMQSVNHGKPPYISRF 125 Query: 120 LFYQGEATPYPLSMC 134 + +G T C Sbjct: 126 I-RKGNRTGSAACRC 139 >UniRef50_A4XMD0 Putative uncharacterized protein n=5 Tax=Clostridia RepID=A4XMD0_CALS8 Length = 329 Score = 73.6 bits (179), Expect = 7e-12, Method: Compositional matrix adjust. Identities = 78/338 (23%), Positives = 140/338 (41%), Gaps = 69/338 (20%) Query: 7 TPH---DAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDV 63 PH D FK+ E +FL ++ E D +L SFI++ DV Sbjct: 5 VPHNQYDLTFKRLFQFKEVFLNFLRGNINREWVNRIDAESLEFVDRSFIKDEFVEKEADV 64 Query: 64 LYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDK-LPLVVPILFY 122 +Y +++ Y +V+IE QS D+ M R+ Y RH+E D+ LP +VPI+ Y Sbjct: 65 IYRARLEDTDVYFYVLIEPQSTADRNMPRRLFEYMTLIWKRHMEEKADELLPPIVPIVLY 124 Query: 123 QGEATPYPLSMCW--FDMFYSPELARRVYNSPFPLVDITITPDDEIMQHR--RIAILELL 178 G + + + FD+F ++N + LVD+ DDE ++ R ++I+ L Sbjct: 125 NGRSGWNIPTQIFKGFDIFKDD-----MFN--YILVDVN-RLDDEKLKSRLDLLSIILYL 176 Query: 179 QKHIRQRDLML--------------------------------LLEQLVTLID------- 199 +K R + + + E++ + ID Sbjct: 177 EKSRRNAEEFVEKLSEVSEYICKLPQVQLKVFCSWLLRIVKPQVREEMESRIDELLKKIE 236 Query: 200 -EGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGESMMTLAQWFEEKGIEKGIQ 258 EG + + ++Q + E + + +E G E + ++GI++GI+ Sbjct: 237 AEGVEDVGEFIFNVQQLIQEYYREAEE------KGKEKGYEEGI-------QEGIKEGIK 283 Query: 259 QGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKV 296 +G Q +E +RL+ KG + +AE + + I K+ Sbjct: 284 EGIQRKEEEIVRRLIQKGFNDNFIAEATGVEIERIKKI 321 >UniRef50_D0LPI9 Putative transposase n=2 Tax=Haliangium ochraceum DSM 14365 RepID=D0LPI9_HALO1 Length = 338 Score = 72.0 bits (175), Expect = 2e-11, Method: Compositional matrix adjust. Identities = 39/123 (31%), Positives = 68/123 (55%), Gaps = 3/123 (2%) Query: 9 HDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQ 68 +D + + E A D LP L + DL+ L L SG+++ + L+ + TDVLYSV Sbjct: 24 YDVLVETTFARREYAADTFRTMLPPALVKRLDLDALSLRSGTYVSDELRQYYTDVLYSVL 83 Query: 69 MQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHL--EADHDKLPLVVPILFYQGEA 126 + G +++++++HQS D R+ R ++ R+L D LP+++PI+F+ EA Sbjct: 84 LDGEQAFIYLLLKHQSATDPMFPLRLPRNVLSIWERYLIERQDATTLPVILPIVFHH-EA 142 Query: 127 TPY 129 T + Sbjct: 143 TGW 145 >UniRef50_A6G0X2 Putative uncharacterized protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6G0X2_9DELT Length = 363 Score = 69.7 bits (169), Expect = 1e-10, Method: Compositional matrix adjust. Identities = 75/287 (26%), Positives = 127/287 (44%), Gaps = 31/287 (10%) Query: 5 STTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVL 64 ++ PHDA+F+ H A L LP EL L D + L + + SL TD+L Sbjct: 15 TSRPHDALFRATFEHPSHAGSLLRSALPRELAALIDWSRLRPAANELVSSSLGERRTDLL 74 Query: 65 YSVQMQGNPG--------YLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLV 116 +S ++G PG YLH IEHQS+ D M R++ Y + RH + LP V Sbjct: 75 FSTALEG-PGAGDGARVVYLH--IEHQSRVDTTMPLRVLGYRVRIWERHRKRHGGALPPV 131 Query: 117 -VPILFYQGEATPYPLSMCWFDMFYSP-----ELARRVYNSPFPLVDITITPDDEIM--- 167 +L + + P S+ ++F P +A + P + D+ D E+ Sbjct: 132 FCVVLSHAAKGWTGPRSL--VELFPEPVRTLAPIAAHLPRCPLIVEDLGRRADAELRARH 189 Query: 168 QHRRIAILELLQKHIRQRD-----LMLLLEQLVTLIDEGYTSGSQ-LVAMQNYMLQRGHT 221 H A+ L + R + L+ +Q++ L+D Y G + L + Y+ G Sbjct: 190 AHPLPALTLWLLRDARSPERLVHRLLDWRDQIIALLD--YDHGERDLAQLLRYVALVGSE 247 Query: 222 EQADLFYGVLRDRETGGESM-MTLAQWFEEKGIEKGIQQGRQEVSQE 267 + F+ + E+M MT+A+ + +++G +QG++E +E Sbjct: 248 MDFEEFHRFVAHHIPEVEAMTMTIAEQLCREALQRGREQGQREGQRE 294 >UniRef50_B4U689 Putative uncharacterized protein n=8 Tax=Aquificales RepID=B4U689_HYDS0 Length = 323 Score = 68.9 bits (167), Expect = 2e-10, Method: Compositional matrix adjust. Identities = 70/291 (24%), Positives = 132/291 (45%), Gaps = 33/291 (11%) Query: 8 PHDAVFKQFLMHAETARDFLEI---HLPVELRELCDLNTLHLESGSFIEESLKGHSTDVL 64 PHD+ FKQ + L+I + + + +NT S S + D+L Sbjct: 5 PHDSFFKQIFSDPRRVKTLLDIFAKDVAKSIHSITPVNTEKFSSKS------QKFMLDLL 58 Query: 65 YSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQG 124 +S +++ Y+ +V+EH+S DK++ ++ Y+ AA+ + + P ++ I+FY G Sbjct: 59 FSCKVKDQDAYIRIVLEHKSYLDKELPIQLSYYN-AAIWEEAIKEKEYYPPIINIVFYHG 117 Query: 125 EAT-PYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRI------AILEL 177 + P S+ + L + V + L+D+ DDE++ I A++ + Sbjct: 118 KGEWNIPTSL---PVLEDQNLEKYVSKLNYILIDLNKVSDDELINEAYIDFCFTSAVIAM 174 Query: 178 LQKHIRQRDLMLLLEQLVTLI-----DEGYTSGSQLVAMQNYM-LQRGHTEQADLFYGVL 231 H + + LV + +EGY L NY+ +G T++A+ L Sbjct: 175 KHVHENIEKIKAVFRPLVEYVQIHEDEEGYHC---LFFSFNYISYVKGDTKEAE---NAL 228 Query: 232 RDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDV 282 ++ G + MTL + + +G+EKG Q+G QE ++ Q L K ++D+ Sbjct: 229 KELIGGDKKAMTLIEKWIMEGLEKGKQEGLQEGLEKGKQEGLIKA-KKDDI 278 >UniRef50_C0GWA6 Putative uncharacterized protein n=3 Tax=Desulfonatronospira thiodismutans ASO3-1 RepID=C0GWA6_9DELT Length = 334 Score = 68.9 bits (167), Expect = 2e-10, Method: Compositional matrix adjust. Identities = 70/292 (23%), Positives = 127/292 (43%), Gaps = 21/292 (7%) Query: 1 MDAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHS 60 M HD FK F E RDF++ +LP E+++ DL + ++ ++ E K Sbjct: 1 MSKKIPNAHDICFKSFFSREEFVRDFIQYYLPEEIKKHLDLTIIEIDMEGYLSEEFKEFY 60 Query: 61 TDVLYSVQMQG--NPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEAD--HDKLPLV 116 +DV+ V + L+ + EH+SKP + + + Y + R L + LP++ Sbjct: 61 SDVVAKVYFNDRVHELELYFLFEHKSKPYRFTILQTLNYQVQKWMRLLVEGKLNQHLPII 120 Query: 117 VPILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFP--LVDITITPDDEIMQHRRIAI 174 VP++ Y G + + S+ + D+F P + + F L DI + + I Sbjct: 121 VPVVIYNGYKS-WNFSVQFEDLFQLPSEYYKDFIPQFRHILHDIGQMDEASFKTTTIMEI 179 Query: 175 LELLQKHIRQRDLMLLLEQLVTLID---EGYTSGSQLVAMQNYMLQRGHTEQADLFYGVL 231 LL K+I +L + ++ L++ + L + Y++ G + L Sbjct: 180 FHLLLKYIYYPELDTKIHEIYDLLEKLPDNDKLTDYLFIIVRYVMASGAIPEKRLLEHA- 238 Query: 232 RDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQ-----EFAQRLLSKGMS 278 R +GGE M+ LA + IE+ ++Q R+ Q E +Q +L K + Sbjct: 239 -KRFSGGEEMIGLAA----REIEERVEQTRKPYWQKQAKVENSQEMLIKSLK 285 >UniRef50_A3JHZ5 Putative transposase n=11 Tax=Proteobacteria RepID=A3JHZ5_9ALTE Length = 325 Score = 68.9 bits (167), Expect = 2e-10, Method: Compositional matrix adjust. Identities = 81/320 (25%), Positives = 142/320 (44%), Gaps = 30/320 (9%) Query: 5 STTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVL 64 +T HD +K+ H E + +E P E+ L D NTL SG++I + DV+ Sbjct: 2 ATNHHDTGYKELFSHPEFVQQLVEGFAPSEIAGLMDFNTLKNHSGNYITPLFEEKFEDVV 61 Query: 65 YSVQMQ----GNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDK-----LPL 115 +SV++ +L++++E QSK D M R+M Y +A + HL + LP Sbjct: 62 WSVEVTWEGITQRVFLYILLEFQSKIDSTMPLRLMHY-VACFYDHLLKTRETTVRQGLPP 120 Query: 116 VVPILFYQGEATPYPLSMCWFDMFY-SPELARRVYNS--PFPLVDITITPDDEIMQHRR- 171 + P++ Y G + + +DM +P RVY + L+D D+E++ R Sbjct: 121 IFPMVLYNG-SQRWSARQDIYDMVQPAPPEFLRVYQPHLRYYLIDEGRYTDEELISKRTP 179 Query: 172 -IAILELLQKHIRQRDLMLLLEQLVTLI--DEGYTSGSQLVAMQ-NYMLQRGHTE---QA 224 I + L ++++V ++ D ++V LQR + Sbjct: 180 LSGIFGVENAGHSWEALQQAVDRIVEIVKADPNKDRVDKIVTRWIKRHLQRVAPKARLNL 239 Query: 225 DLFYGVLRDRETGGESMMTLAQWFEEKGIEKGIQQGRQ-------EVSQEFAQRLLSKG- 276 D ++ DR E++ L + +G ++G Q+GRQ E ++ + LLS G Sbjct: 240 DRMSSLVEDRNMLAENLENLVKKERLEGRQEGRQEGRQEGDRRALEEKRKTVRHLLSFGV 299 Query: 277 MSREDVAEMANLPLAEIDKV 296 +S + +A L + EIDK+ Sbjct: 300 LSNDQIAVATGLSVDEIDKL 319 >UniRef50_Q3C0L0 TpnA protein n=2 Tax=Sodalis glossinidius RepID=Q3C0L0_SODGL Length = 131 Score = 68.6 bits (166), Expect = 3e-10, Method: Compositional matrix adjust. Identities = 32/53 (60%), Positives = 38/53 (71%) Query: 9 HDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHST 61 HD VFK+FL ARDFLEIHLP LR+ CD +TL + SGSFIE+ LKG + Sbjct: 8 HDHVFKKFLGDIAVARDFLEIHLPPHLRKHCDFSTLAMASGSFIEDDLKGQCS 60 >UniRef50_D2NBJ3 Putative uncharacterized protein n=1 Tax=Escherichia coli SE15 RepID=D2NBJ3_ECOLX Length = 136 Score = 68.2 bits (165), Expect = 4e-10, Method: Compositional matrix adjust. Identities = 42/116 (36%), Positives = 63/116 (54%), Gaps = 9/116 (7%) Query: 168 QHRRIAILELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQ-ADL 226 +H +A+LEL+QKHIRQRDLM L+EQ+ L+ GY + Q+ + NY+LQ G + D Sbjct: 13 RHASMALLELIQKHIRQRDLMGLVEQMACLLSSGYANDRQIKGLFNYILQTGDAVRFNDF 72 Query: 227 FYGVLRDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDV 282 GV ES+MT+A+ Q+G Q + A+ +L G+ D+ Sbjct: 73 IDGVAERSPKHKESLMTIAERLR--------QEGEQSKALHIAKIMLESGVPLADI 120 >UniRef50_C6I158 Putative uncharacterized protein n=3 Tax=Leptospirillum ferrodiazotrophum RepID=C6I158_9BACT Length = 328 Score = 67.8 bits (164), Expect = 4e-10, Method: Compositional matrix adjust. Identities = 81/320 (25%), Positives = 129/320 (40%), Gaps = 38/320 (11%) Query: 9 HDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQ 68 HD FK L + L+ LP + D +L + E L D+ +S + Sbjct: 7 HDRFFKSTLGRPDRLGKVLKAFLPTNISASLDPGSLVPLGTESVGEGLDSSLMDLAFSAR 66 Query: 69 MQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEATP 128 +H+++EH+S PD + F++ RY R L+ PL +PILFY G P Sbjct: 67 FGDQEARIHLIVEHKSSPDPRTHFQIARYLCGLWIRELKEGLQPRPL-LPILFYHG-VVP 124 Query: 129 YPLSMCWFDMFYSP-ELARRVYNSPFPLVDITITPDDEIMQHRRI--AILELLQ-KHIRQ 184 + L ++ P EL + PL+D+ D+EI H A+L LL KHI Sbjct: 125 WTLPSRLTEVLRPPSELLAVTPDFVLPLIDLRRVDDEEIRHHVDDLEAVLALLSLKHIFD 184 Query: 185 RDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQ----RGHTEQADLFYGVLRDRETGGES 240 + L+ L+ I E + L NYM E + + R+ G + Sbjct: 185 -GVETLVRLLLREIWERKAPHAILKPEMNYMAGVYKITNSQEMKQIVDPIAREV---GMA 240 Query: 241 MMTLAQWFEE-----------------------KGIEKGIQQGRQEVSQEFAQRLLSKG- 276 + W +E KG+EKG QQG + ++ + LL K Sbjct: 241 QDIVETWLDEYLQQGLQKGLEQGLQQGLQQGLEKGLEKGFQQGARLKEEQVIRTLLKKKT 300 Query: 277 MSREDVAEMANLPLAEIDKV 296 S E++A + + L+ + +V Sbjct: 301 FSFEEIASLVGVELSRVREV 320 >UniRef50_C5UWW9 Putative uncharacterized protein n=1 Tax=Clostridium botulinum E1 str. 'BoNT E Beluga' RepID=C5UWW9_CLOBO Length = 323 Score = 67.0 bits (162), Expect = 8e-10, Method: Compositional matrix adjust. Identities = 69/317 (21%), Positives = 131/317 (41%), Gaps = 38/317 (11%) Query: 9 HDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQ 68 HD +K H ET +FL E L + + L L S+I + +D+LY Sbjct: 10 HDVGYKHIFSHKETFLEFLRSFTKKEWANLINEDDLILVDKSYILSDFEEEESDILYKAN 69 Query: 69 MQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHD--------KLPLVVPIL 120 + +V++E QSK D +M R++ Y L+ KLP +VPI+ Sbjct: 70 IDDKEVIFYVLLEFQSKVDFQMPMRLLFYMTEIWRDVLKNTEKNERKRKNFKLPSIVPIV 129 Query: 121 FYQGEATPYPLSMCWFDMFYSPELAR-RVYNSPFPLVDITITPDDEIMQ-HRRIAILELL 178 Y G+ + + + +M EL + + + L DI D E++ I+ + LL Sbjct: 130 LYNGK-NKWSAKISFKEMLSGYELFEDNILDFNYMLFDINRYSDHELLNISNMISAVFLL 188 Query: 179 QKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYM-------------------LQRG 219 + I +++LM + S Q + ++ L++ Sbjct: 189 DQEIDEQELM--RRLKKIIYILKKISPEQFSVFKKWLKNIVKPRVRDNLQGEIDDVLEKS 246 Query: 220 HTEQADLFYGVLRDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSR 279 + E+ D L G+++ + E+G++KGI+QG ++ ++ A++ + GM Sbjct: 247 NQEEVDFMVSNL------GKTIERMQDKAIERGLKKGIEQGIEQGIEQTAKKAIEMGMDN 300 Query: 280 EDVAEMANLPLAEIDKV 296 E + + L +I+ + Sbjct: 301 EIIMNLTGLSEEQINTI 317 >UniRef50_A9BGB3 Putative uncharacterized protein n=2 Tax=Petrotoga mobilis SJ95 RepID=A9BGB3_PETMO Length = 336 Score = 67.0 bits (162), Expect = 8e-10, Method: Compositional matrix adjust. Identities = 37/130 (28%), Positives = 66/130 (50%), Gaps = 8/130 (6%) Query: 5 STTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVL 64 S D++FK+ DFL+ LP E + L E I + +D+L Sbjct: 2 SNPIKDSIFKELFEDRTVFYDFLKAFLPKETTKQIKETDLKREQTELIGKDFSIKRSDIL 61 Query: 65 YSVQMQ-GNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHD-------KLPLV 116 Y ++ + G Y+++++EHQSK D+ MAFRM+ Y + +++ + KLP++ Sbjct: 62 YKIEKRNGQDVYIYLLLEHQSKVDQLMAFRMLAYKVRIWEQYVNSHKKESEQKGFKLPVI 121 Query: 117 VPILFYQGEA 126 + ++FY G+A Sbjct: 122 IGMVFYDGKA 131 >UniRef50_B6WXP3 Putative uncharacterized protein n=1 Tax=Desulfovibrio piger ATCC 29098 RepID=B6WXP3_9DELT Length = 330 Score = 67.0 bits (162), Expect = 8e-10, Method: Compositional matrix adjust. Identities = 37/122 (30%), Positives = 63/122 (51%), Gaps = 5/122 (4%) Query: 8 PHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSV 67 PHD+ +KQF + E L +P + E D +TL SGS++ + L+ D+++ + Sbjct: 7 PHDSAYKQFFSNPEMVESLLRDFVPADFIEDLDFSTLERCSGSYVTDDLRERHDDIVWRI 66 Query: 68 QM-QGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEA----DHDKLPLVVPILFY 122 +G Y+ +V+E QS PD MA R + Y+ + ++ + + LP V PI+ Y Sbjct: 67 GWKKGAWCYVALVLEFQSTPDYWMALRTLSYTALLLLDLVKTGKVHEGEGLPPVFPIVIY 126 Query: 123 QG 124 G Sbjct: 127 NG 128 >UniRef50_C6HY29 Putative uncharacterized protein n=1 Tax=Leptospirillum ferrodiazotrophum RepID=C6HY29_9BACT Length = 319 Score = 65.5 bits (158), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 80/327 (24%), Positives = 144/327 (44%), Gaps = 44/327 (13%) Query: 3 APSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESL-KGHST 61 A + TPHD FK+ E L LP ++ D ++L G + E L + Sbjct: 2 AKNLTPHDVFFKEIFSQREILSSALSELLPEDVVRRMDFDSLAYLPGESVGEGLSRSTRA 61 Query: 62 DVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILF 121 D+++SV G L V++EH+S PD ++ F++++ + ++L + LP ++PILF Sbjct: 62 DLVFSVSFGEREGRLVVILEHKSHPDPRVHFQILQMMVMGWMQNLREGREPLP-ILPILF 120 Query: 122 YQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEI--MQHRRIAILELLQ 179 Y G+ + M E+AR + + +D+ + D I +Q+ L Sbjct: 121 YHGQGSWSIPDRFSERMKIPREIARYLPDFELLRIDLGLIDDTRIRSLQNVLAGAALLSM 180 Query: 180 KHI---RQRDLMLLLE------------QLVTLIDEGYTS--GSQLVAMQNYMLQRGHTE 222 KH+ +R LL+E + + L+ Y + + Y + TE Sbjct: 181 KHVFENPRRFFHLLIEFGRERSAPHDIIEKIVLVALDYAGHVHKNIPDEELYNIMAAITE 240 Query: 223 QADLFYGVLRDRETGGESMMTLAQWFEEKGIEKGIQQG---------RQEVSQEFAQRLL 273 +A + ET E + + W EE GI+KG+Q G +Q V Q + +L Sbjct: 241 EAGM--------ETTTERLKKI--WIEE-GIQKGVQLGIQQGVQQGVQQGVRQNQIKTIL 289 Query: 274 S---KGMSREDVAEMANLPLAEIDKVI 297 S + + +A++ +L L E+++V+ Sbjct: 290 SLSKHNFTPQQIADLLSLELPEVERVL 316 >UniRef50_A4XFI8 Putative uncharacterized protein n=7 Tax=Clostridia RepID=A4XFI8_CALS8 Length = 321 Score = 65.5 bits (158), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 72/315 (22%), Positives = 137/315 (43%), Gaps = 42/315 (13%) Query: 9 HDAVFKQFLMHAET----ARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVL 64 HD+ FK H + +D + E++E +++ L F++E+ DV+ Sbjct: 10 HDSTFKFLFEHPKDILFLVKDVIGYSWAKEIKE----DSIELADKEFVDETFHQKRADVI 65 Query: 65 YSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQG 124 +++ Y +++IE+QS + M R++RY I + + KLP ++PI+ Y G Sbjct: 66 AKARLKDREVYFYIIIENQSTVAEDMPERLLRYMILLWAKKIREGVKKLPAIIPIVTYNG 125 Query: 125 EATPYPLS---MCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKH 181 + +S + FD+F + +V+I+ ++Q IL + + Sbjct: 126 LEKDWDVSQEIISEFDIFKDDIFK-------YAVVNISKLDAKTLLQEEE-DILSPVVFY 177 Query: 182 IRQ-RDLMLLLEQLVTLIDEGYTSGSQ------LVAMQNYMLQRGHTEQADLFYGVLRDR 234 + Q RD L + + I+ T SQ L+ N + R E + + + + Sbjct: 178 LEQVRDDTEELVKRLKEIEPKLTKLSQNNAERFLIWAGNVIRPRLVKEDKEKYDELAQRV 237 Query: 235 ETGGESMM-----TLAQW--------FEEKGIEKGIQ---QGRQEVSQEFAQRLLSKGMS 278 E GG M +A+ F E IE I+ +G+ E E A++++ +G S Sbjct: 238 EQGGSRQMGEFVSNVAKLLDEVQMRKFNEGKIEGKIEGKIEGKIEGKIEVAKKMIRRGFS 297 Query: 279 REDVAEMANLPLAEI 293 ED+AE+ L + ++ Sbjct: 298 DEDIAELTELDIEKV 312 >UniRef50_C4FIM1 Putative uncharacterized protein n=1 Tax=Sulfurihydrogenibium yellowstonense SS-5 RepID=C4FIM1_9AQUI Length = 316 Score = 65.5 bits (158), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 56/261 (21%), Positives = 123/261 (47%), Gaps = 23/261 (8%) Query: 8 PHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSV 67 PHD FKQ + + L+I P EL + DL ++ L + + + ++LY Sbjct: 6 PHDQFFKQIFSEPKRVKSLLDIFYP-ELSQKIDLESIRLLNSEKYSQKVGKSLLNLLYEC 64 Query: 68 QMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGE-- 125 +++ +L ++ EH+S DK + +++ Y+ + ++++ P ++ I+ Y G+ Sbjct: 65 KIENEKSFLRIIFEHKSYIDKNLPSQLLYYN--GILWEETGEYEEYPPIINIVLYHGKRK 122 Query: 126 -ATPYPLSMCWFDMFYSPELARRVYNS-PFPLVDITITPDDEIMQHRRIAILE----LLQ 179 P L + E+ R N + L+D++ D+E++ + L Sbjct: 123 WNIPATLPKT------NSEIIERFANKLNYHLIDLSKVADEEMISKLYLDFCTVSALLTM 176 Query: 180 KHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGE 239 KHI + + + ++ + E Y G + + +Y+ + ++ + VL++ G + Sbjct: 177 KHIFED--LRKYKHILKKVFEHYQDGCVFIIL-DYISVVNNPQEVE---NVLKEILGGEK 230 Query: 240 SMMTLAQWFEEKGIEKGIQQG 260 MMTL + ++ +G+++G+QQG Sbjct: 231 DMMTLTEKWKMEGLQQGLQQG 251 >UniRef50_Q1Q296 Putative uncharacterized protein n=6 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1Q296_9BACT Length = 338 Score = 64.7 bits (156), Expect = 3e-09, Method: Compositional matrix adjust. Identities = 72/278 (25%), Positives = 123/278 (44%), Gaps = 23/278 (8%) Query: 7 TPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYS 66 PHD FK+ E A DFL P E+ + DL+TL ++ S+I+E LK H +D++Y+ Sbjct: 5 NPHDKFFKETFSIRENAIDFLSGRFPPEILKKLDLSTLTQDNSSYIDEELKEHFSDIVYT 64 Query: 67 VQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQG-E 125 + + ++ EH+S ++M+Y + + + +P V+P++ Y G E Sbjct: 65 CFCKDKEIRITLLFEHKSYAVACPYLQLMKYLLKIWEANSKQAQRLIP-VIPVILYHGKE 123 Query: 126 ATPYPLSMCWF----DMFYSPELARRVYNSPFPLVDITITPDDEIMQH--RRIA--ILEL 177 A +F ++FY R + + L DI+ ++EI RR++ I L Sbjct: 124 AWKVRRFREYFEGIDEVFY-----RFIPEFEYLLTDISCYSNEEIKDRVFRRVSLQITML 178 Query: 178 LQKHIRQRDLMLLLEQLVTLIDEGYTSGSQ------LVAMQNYMLQRGHTEQADLFYGVL 231 L ++I D L ++L + G + L + Y+ + + + Sbjct: 179 LMRNI--FDEKYLEDKLKDFFEIGIQYFEEDEGLKFLESAIRYLYYASDIAEKRVIDTLK 236 Query: 232 RDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFA 269 E GG+ MT+A EKG G +GR E E A Sbjct: 237 EISEEGGKLSMTIAAKLIEKGKIAGRVEGRAEGRAEGA 274 >UniRef50_C7RR52 Putative transposase n=1 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RR52_9PROT Length = 330 Score = 64.7 bits (156), Expect = 4e-09, Method: Compositional matrix adjust. Identities = 40/121 (33%), Positives = 62/121 (51%), Gaps = 6/121 (4%) Query: 9 HDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQ 68 HD +K E RD + +P + D +TL GS++ E + D+++ V+ Sbjct: 5 HDTGYKLLFSTPELVRDLILGFVPDDWLHGLDYSTLERVPGSYVTEDFTNRADDIVWRVK 64 Query: 69 MQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHL----EADHD-KLPLVVPILFYQ 123 + G YL+++IE QS DK MA RMM Y +++ L E D +LP V+PI+ Y Sbjct: 65 VGGEWVYLYLLIEFQSSVDKYMALRMMVYG-GLLYQDLIKRGEVLADGRLPPVLPIVLYN 123 Query: 124 G 124 G Sbjct: 124 G 124 >UniRef50_Q3JB06 Putative transposase n=17 Tax=Proteobacteria RepID=Q3JB06_NITOC Length = 350 Score = 64.3 bits (155), Expect = 5e-09, Method: Compositional matrix adjust. Identities = 36/121 (29%), Positives = 65/121 (53%), Gaps = 6/121 (4%) Query: 9 HDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQ 68 HDA +K+ H E RD L+ + + D +TL SGS++ + L+ D+++ ++ Sbjct: 4 HDASYKRLFSHPEMVRDLLQGFVREPWVQQLDFSTLEKVSGSYVTDDLREREDDIIWRLR 63 Query: 69 MQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHL-----EADHDKLPLVVPILFYQ 123 Q Y+++++E QS D MA R++ Y + +++ L A + KLP V P++ Y Sbjct: 64 HQEGWMYIYLLLEFQSTVDPYMAVRVLAY-VGLLYQDLIKARYIAPNQKLPPVFPLVLYN 122 Query: 124 G 124 G Sbjct: 123 G 123 >UniRef50_B9MMR0 Putative uncharacterized protein n=1 Tax=Anaerocellum thermophilum DSM 6725 RepID=B9MMR0_ANATD Length = 333 Score = 63.9 bits (154), Expect = 6e-09, Method: Compositional matrix adjust. Identities = 46/171 (26%), Positives = 78/171 (45%), Gaps = 13/171 (7%) Query: 4 PSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDV 63 P +D FK+ E +FL+ + + DL +L SF+++ DV Sbjct: 5 PPHNQYDLTFKRIFSFKEVFLNFLKSTIKRPWVDKIDLQSLEFVDRSFVKDEFVEKEADV 64 Query: 64 LYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDK-LPLVVPILFY 122 +Y +++ Y +V++E QS DK M R+ Y RH+E D L +VPI+ Y Sbjct: 65 IYRAKIEDTDIYFYVLLEAQSTTDKTMPRRLFEYMNLIWQRHIEETKDDLLSPIVPIVLY 124 Query: 123 QGEAT---PYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHR 170 G + P + W ++F ++N + LVD+ DDE +++R Sbjct: 125 NGRSNWNVPTLIFKGW-EIFKDD-----MFN--YFLVDVN-NIDDETLKNR 166 >UniRef50_C0GV86 Transposase, ISNCY family n=7 Tax=Desulfonatronospira thiodismutans ASO3-1 RepID=C0GV86_9DELT Length = 125 Score = 63.5 bits (153), Expect = 9e-09, Method: Compositional matrix adjust. Identities = 29/94 (30%), Positives = 59/94 (62%), Gaps = 3/94 (3%) Query: 7 TPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYS 66 PH+ +F + + + AR FL+ H+ E+++ DL+TL LE ++++E LK H +D+++S Sbjct: 8 APHEGLFLKIFQNLDNARHFLKNHMSEEIQKRFDLDTLRLEPTTYVDEKLKKHYSDLVFS 67 Query: 67 VQMQGNP---GYLHVVIEHQSKPDKKMAFRMMRY 97 V++ G ++++ EH+S PD ++++Y Sbjct: 68 VRLIGYKNQFAKIYLLFEHKSSPDPLTGVQVLKY 101 >UniRef50_A4XG55 Putative uncharacterized protein n=2 Tax=Caldicellulosiruptor saccharolyticus DSM 8903 RepID=A4XG55_CALS8 Length = 327 Score = 63.2 bits (152), Expect = 1e-08, Method: Compositional matrix adjust. Identities = 73/283 (25%), Positives = 128/283 (45%), Gaps = 42/283 (14%) Query: 44 LHLESGSFIEESLKGHSTDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMH 103 L L +++ +D+LY ++Q + +++ EHQS D MA R++ Y I + Sbjct: 45 LELVDKNYVLPDFSEQESDLLYKARLQEEELFFYILFEHQSTVDYNMAMRLLFY-ITDIW 103 Query: 104 RHLEADHD---------KLPLVVPILFYQGEATPYPLSMCWFDMFYSPEL-ARRVYNSPF 153 R D K P VVPI+ Y G+ P+ S+ + + E+ + + + + Sbjct: 104 RDWLKQFDKNQFKNKSFKFPPVVPIVLYDGD-NPWTASVNLKERIMNFEVFGKYIVDFEY 162 Query: 154 PLVDITITPDDEIMQHRRIAILEL-LQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQ 212 L+D+ PD+ I +++ I L L L K +++L L L Y G++ + Sbjct: 163 ILIDLN-DPDEMIFKYKDILSLILKLNKVKTEKELERLFLDLYE-----YLQGAKEKEIN 216 Query: 213 N------YMLQRGHTEQADLFYGVLRDRETGGESMMTLAQ--------WFEEKGIEKGIQ 258 +L+ ++ +L + GGE +M L Q W+ E GI+KGIQ Sbjct: 217 TLKICLPVVLKELGEDKVQEAKDMLECIDVGGEGIMPLFQNLRKIREEWYHE-GIQKGIQ 275 Query: 259 QGRQEVSQ--------EFAQRLLSKGMSREDVAEMANLPLAEI 293 G Q+ Q E A+R++ KG S E++ E+ L + +I Sbjct: 276 DGLQQGLQQGLQKKELEIAERMIVKGYSDEEIHEITGLDIEKI 318 >UniRef50_Q2FP14 Putative uncharacterized protein n=4 Tax=Methanospirillum hungatei JF-1 RepID=Q2FP14_METHJ Length = 312 Score = 63.2 bits (152), Expect = 1e-08, Method: Compositional matrix adjust. Identities = 40/152 (26%), Positives = 75/152 (49%), Gaps = 5/152 (3%) Query: 10 DAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQM 69 D +K+ H E D + L +L CDL+TL +GS++ + L+ D+++ + Sbjct: 5 DHPYKRLFSHPEMIADLIRGFLDPKLVSGCDLSTLERCNGSYVTDDLREREDDIIWRLAY 64 Query: 70 QGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEAD---HDKLPLVVPILFYQGEA 126 L+++IE QSKPD M R+M Y + + ++P ++PI+ Y GE Sbjct: 65 GDRTLILYLLIEFQSKPDYSMPIRIMSYMALLWQDLIRSGVIVPSRIPGIIPIVLYNGE- 123 Query: 127 TPYPLSMCWFDMFYSPE-LARRVYNSPFPLVD 157 P+ + + P+ ++R + + P+ L+D Sbjct: 124 IPWKVPHDIRETIQMPKPVSRFIPSVPYLLID 155 >UniRef50_Q04UG3 Transposase, YhgA-like n=8 Tax=Leptospira RepID=Q04UG3_LEPBJ Length = 304 Score = 62.8 bits (151), Expect = 1e-08, Method: Compositional matrix adjust. Identities = 79/299 (26%), Positives = 140/299 (46%), Gaps = 26/299 (8%) Query: 6 TTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLY 65 PHD + ++ + A F + LP E+ EL DL L L SF+ E LK TD+L+ Sbjct: 5 NNPHDRLIRETFQDKKEAATFFKNTLPPEVVELLDLENLELTESSFVSEELKQEQTDLLF 64 Query: 66 SVQMQ-GNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQG 124 + ++ GN ++++ EH+S + + +++ Y + ++R+ + + +V+P +FY G Sbjct: 65 QIPLKSGNKSNVYLLFEHKSYLENTIYIQLLGY-LTEIYRNQQRSGESFSVVIPFVFYHG 123 Query: 125 EATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRI------AILELL 178 E + L + D F + V+ P I + + I +++ L ++ Sbjct: 124 EK-EWKLGDRFSDQFVLTKQETDVFQDFIPDFKIDLFDLEGIELKKKLESITFQVTLGVV 182 Query: 179 QKHIRQRDLMLL--LEQLVTLIDEGYTSGSQLVAMQNYMLQRGH-------TEQADLFYG 229 Q+ IR+RDL + L L +L+ G S+ VA+ +L + TE + Sbjct: 183 QR-IRERDLEFVSHLPGLFSLL-LGIEEESKRVAILRKLLLYIYWARDLKPTELKRVL-- 238 Query: 230 VLRDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANL 288 + E E MT A E+ I +GIQQG+ E E A+ +LS+ + E V + L Sbjct: 239 AISKLEQYEELTMTTA----ERLISEGIQQGKIEGKIETARNMLSEDIQLEAVLRITGL 293 >UniRef50_C0A240 Putative uncharacterized protein n=1 Tax=Opitutaceae bacterium TAV2 RepID=C0A240_9BACT Length = 365 Score = 60.8 bits (146), Expect = 5e-08, Method: Compositional matrix adjust. Identities = 44/153 (28%), Positives = 65/153 (42%), Gaps = 23/153 (15%) Query: 3 APSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTD 62 A HD +F+ AR FL LP EL D +TL + S I ++L D Sbjct: 30 AAGNGDHDRIFRHAFSLPAVARQFLRTWLPPELVAQADWHTLTVTRISGISDTLGERRED 89 Query: 63 VLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMM-------------RYSIAAMHRHLEAD 109 V+Y + + G + +V++EHQ+K +K MA R+M R A +AD Sbjct: 90 VVYRINVNGRNVHFYVLMEHQTKTEKHMARRIMEETFLIWRQDEHDRAEAAKKEAPGKAD 149 Query: 110 H-------DKLPLVVPILFYQGEATPYPLSMCW 135 DK PLV+ ++ + G P W Sbjct: 150 RQSRRRETDKFPLVISMVLHPG---PRKWGKIW 179 >UniRef50_B9TA29 Putative uncharacterized protein n=1 Tax=Ricinus communis RepID=B9TA29_RICCO Length = 411 Score = 60.5 bits (145), Expect = 7e-08, Method: Compositional matrix adjust. Identities = 33/127 (25%), Positives = 61/127 (48%), Gaps = 8/127 (6%) Query: 6 TTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLY 65 ++ D+++KQ H E RD + L + + + S+ + DV++ Sbjct: 40 SSRTDSLYKQLFAHPEIVRDLVAGFLAADWARGLTVEAFERVNASYASDHGHVRHDDVVW 99 Query: 66 SVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRY------SIAAMHRHLEADHDKLPLVVPI 119 ++ G Y+++++E Q++PDK MA RM Y + A H+ + H KLP V+P+ Sbjct: 100 RARIGGEWVYVYILLEFQARPDKWMALRMQVYVGLLYQDLVAQHKL--SKHGKLPPVLPV 157 Query: 120 LFYQGEA 126 + Y G Sbjct: 158 VLYHGRG 164 >UniRef50_C6HXQ0 Putative uncharacterized protein n=1 Tax=Leptospirillum ferrodiazotrophum RepID=C6HXQ0_9BACT Length = 341 Score = 60.5 bits (145), Expect = 7e-08, Method: Compositional matrix adjust. Identities = 72/270 (26%), Positives = 113/270 (41%), Gaps = 27/270 (10%) Query: 9 HDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQ 68 HD FK L + L+ LP L L +L + +SL D+ + Sbjct: 8 HDRFFKSTLGRPKRMEHILKAFLPPALSALLAPGSLVPLFSEVVGDSLDASLLDMAFEAT 67 Query: 69 MQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEATP 128 +HV++EH+S PD F+++ Y R + +P VP+LFY G P Sbjct: 68 FGERKTRIHVLVEHKSSPDPWAHFQILHYLAELWLRDKKESRSPIPF-VPVLFYHG-LRP 125 Query: 129 YPLSMCWFDMFYSP-ELARRVYNSPFPLVDITITPDDEIMQHRR---IAILELLQKHI-- 182 + L +M P EL V + P++D+ D +I + R + LL KHI Sbjct: 126 WNLPTRLSEMLDPPSELLPFVPDYLLPVIDLGKIDDLDIREKIRDFETSACLLLLKHIFE 185 Query: 183 -RQRDLMLLLEQLVTLIDEGYTSGSQL-----VAMQNYMLQRGHTEQ-ADLFYGVLRDRE 235 + L L++ T+G L ++ +Y++ H E A+L V + Sbjct: 186 GARGSLRAFLQE---------TNGKNLSRDIIISGMSYVIGVHHLESTAELSRLVNTILK 236 Query: 236 TGGESMMTLAQWFEE---KGIEKGIQQGRQ 262 G S + W EE +G++KGIQQG Q Sbjct: 237 EEGMSQNVVELWMEELIQQGVQKGIQQGVQ 266 >UniRef50_B5Q357 Transposase n=10 Tax=Salmonella enterica subsp. enterica RepID=B5Q357_SALVI Length = 174 Score = 59.3 bits (142), Expect = 2e-07, Method: Compositional matrix adjust. Identities = 25/35 (71%), Positives = 30/35 (85%) Query: 3 APSTTPHDAVFKQFLMHAETARDFLEIHLPVELRE 37 + ++TPHDAVFK FL H ETARDF+EIHLPV LR+ Sbjct: 4 STTSTPHDAVFKTFLRHPETARDFMEIHLPVSLRQ 38 Score = 51.6 bits (122), Expect = 4e-05, Method: Compositional matrix adjust. Identities = 42/142 (29%), Positives = 69/142 (48%), Gaps = 32/142 (22%) Query: 182 IRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNY-MLQRGHTEQADLFYGVLRDRETGG-- 238 +RQRDL+ L+E++ +L+ G + QL A+ NY M+Q GHT + F R+ G Sbjct: 36 LRQRDLLGLVERIASLLVTGCANDRQLKALFNYLMIQHGHTPRFTTFI-----RDVVGHV 90 Query: 239 ----ESMMTLAQ-----------------WFEE---KGIEKGIQQGRQEVSQEFAQRLLS 274 E +MTL + EE +G+EKG+++G+ + A+++L+ Sbjct: 91 PHTKERLMTLIERIRAADRRKGERQGRQLGLEEGLAEGLEKGLEKGQHVAALRIARQMLA 150 Query: 275 KGMSREDVAEMANLPLAEIDKV 296 G+ RE V L E+ V Sbjct: 151 DGLDRETVQRFTGLTAEELQDV 172 >UniRef50_C0GTX5 Putative uncharacterized protein n=8 Tax=Desulfonatronospira thiodismutans ASO3-1 RepID=C0GTX5_9DELT Length = 338 Score = 58.5 bits (140), Expect = 3e-07, Method: Compositional matrix adjust. Identities = 73/298 (24%), Positives = 131/298 (43%), Gaps = 10/298 (3%) Query: 5 STTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVL 64 +T HD+ K FL A L+ LP E+ + D N ++ E S++ +SL+G+ +D++ Sbjct: 3 TTNIHDSTIKYFLSDRLNAISLLKSMLPEEIVKQLDFNKIYYEKDSYLPKSLQGYYSDLV 62 Query: 65 YSVQMQGNPGYLHV--VIEHQSKPDKKMAFRMMRYSIAAMHRHLE-ADHDKLPLVVPILF 121 SV + V ++EH+S K + +RY + ++ + +LP+++PIL Sbjct: 63 VSVPTKCGSYVAKVFFLLEHKSTFKKNTPLQFLRYILEFWEQYQKNTGETRLPVIIPILI 122 Query: 122 YQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVD-ITITPDDEIMQHRRIAILELLQK 180 E P + S + V + F L D + P+D A+ L + Sbjct: 123 AHPEEGWKPTKVSDLVDLPSDDFKIFVPDFNFLLYDAVNDDPEDYDFDETLKALFTLW-R 181 Query: 181 HIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQ---NYM-LQRGHTEQADLFYGVLRDRET 236 + R + M +++ LI + L +Q +Y+ + R E D+ + + Sbjct: 182 YSRSPEFMQGVQKAFQLIKKVDPKARLLDFVQMILHYLEVTRDEKEYIDIQKIAETEIDE 241 Query: 237 GGESMMTLAQWFEEKGIEKGIQQGRQEVS-QEFAQRLLSKGMSREDVAEMANLPLAEI 293 G E M T+A+ F +G E+ Q+ QE E L + + D+A A PL +I Sbjct: 242 GEEYMGTIAEMFRREGDERTEQRFLQEKPIWEKQSELKATQETLIDIATEAYGPLPDI 299 >UniRef50_C1DXM1 Putative uncharacterized protein n=5 Tax=Sulfurihydrogenibium azorense Az-Fu1 RepID=C1DXM1_SULAA Length = 342 Score = 53.1 bits (126), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 38/126 (30%), Positives = 57/126 (45%), Gaps = 7/126 (5%) Query: 7 TPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYS 66 +PHD FK + FLEI LP +L E N+L L + K D+ + Sbjct: 6 SPHDWFFKMIFSQKQNVESFLEIFLP-QLYECIIPNSLKLSDTEKFSKKYKKFFLDLAFD 64 Query: 67 VQM---QGNP--GYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILF 121 ++ +GN G +++V EH+S PDK ++ Y M P V+PI+F Sbjct: 65 CKLKDKEGNTIDGQIYIVFEHKSYPDKHTPSQISFYKSVMMEEDERLSRPYRP-VIPIVF 123 Query: 122 YQGEAT 127 Y GE + Sbjct: 124 YHGEKS 129 >UniRef50_Q2RLW6 Putative uncharacterized protein n=9 Tax=Clostridia RepID=Q2RLW6_MOOTA Length = 344 Score = 52.0 bits (123), Expect = 3e-05, Method: Compositional matrix adjust. Identities = 44/205 (21%), Positives = 94/205 (45%), Gaps = 11/205 (5%) Query: 4 PSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDV 63 P P+D ++Q L + L+ + E D + L L + S++ + DV Sbjct: 10 PPHHPYDKGYRQLLADKRVFLELLKTFVREAWVEAIDADDLILVNKSYVLQDFSEKEADV 69 Query: 64 LYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHR--------HLEADHDKLPL 115 +Y ++ + +V++E QS D M FR++ Y + E+ H +LP Sbjct: 70 VYRLKTRNRNVIFYVLLELQSTVDYLMPFRLLLYMVEIWREIYNNTPQGERESKHFRLPP 129 Query: 116 VVPILFYQGEATPYPLSMCWFDMFYS-PELARRVYNSPFPLVDITITPDDEIMQHRR-IA 173 ++P + Y G A + ++ + +M S + + + + + L D+ ++E+++ IA Sbjct: 130 IIPAVLYNG-AGSWTAALSFKEMLNSYQDFSGHLLDFRYLLFDVNRYSEEELIRAANLIA 188 Query: 174 ILELLQKHIRQRDLMLLLEQLVTLI 198 + LL + ++ DL L++L ++ Sbjct: 189 GIFLLDQKMQPEDLAGRLQKLAGVL 213 >UniRef50_B9MN47 Putative uncharacterized protein n=2 Tax=Bacteria RepID=B9MN47_ANATD Length = 324 Score = 52.0 bits (123), Expect = 3e-05, Method: Compositional matrix adjust. Identities = 28/109 (25%), Positives = 57/109 (52%), Gaps = 7/109 (6%) Query: 34 ELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFR 93 E+RE +++ ++ ++I + DV+ +++ Y +++IE+QS K M R Sbjct: 43 EIRE----SSIEIKKTNYITKEFSQVEADVVAKARLKDRDVYFYILIENQSTVAKDMPER 98 Query: 94 MMRYSIAAMHRHLEADHDKLPLVVPILFYQGEATPYPLS---MCWFDMF 139 ++RY I+ + +KLP ++PI+ Y G + +S + FD+F Sbjct: 99 LLRYMISIWAEEIRNGVEKLPAIIPIVVYNGLDRRWEVSTDIIGAFDIF 147 >UniRef50_C6IY67 Transposase n=1 Tax=Paenibacillus sp. oral taxon 786 str. D14 RepID=C6IY67_9BACL Length = 333 Score = 52.0 bits (123), Expect = 3e-05, Method: Compositional matrix adjust. Identities = 84/334 (25%), Positives = 142/334 (42%), Gaps = 70/334 (20%) Query: 8 PHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESL------KGHST 61 PHD FK+ L+H A +F+ + P EL D + ++E L + + Sbjct: 27 PHDEAFKK-LLHTFFA-EFIALFFP-ELESQLDFSQTRF----LMQEQLVDVVGEEARTL 79 Query: 62 DVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILF 121 D+L + G ++ + +E QS RM Y RH +H L++PI Sbjct: 80 DLLLETKYIGTDAFILIHLEPQSYRQADFHERMFIYFSRLFERH-RKEHQ---LIIPIAI 135 Query: 122 Y-----QGEATPYPLSMCWFDM----FYSPELA----RRVYNSPFP-----LVDITITPD 163 + + E +S+ D+ F EL RR +S P L + Sbjct: 136 FTSAESKNERNSLNMSILGEDILQFRFLKVELINQPWRRFIDSNNPVAAALLAKMGYNKG 195 Query: 164 DEIMQHRRIAILELL---QKHIRQRDLMLLL--------------EQLVTLIDEGYTSGS 206 +E + R+A L +L + + Q L L++ E+++ + + Y S Sbjct: 196 EE--RELRLAYLRMLLQLSQRLDQARLALVMSIADLYFEPDPRQDEEMLRELAKQYAKES 253 Query: 207 QLVA--MQNYMLQRGHTEQADLFYGVLRDRETGGESMMTLAQWFEEKGIEKGIQQGRQEV 264 +++ M +M Q G+ + G+ E G E + EKG EKGI+QG Sbjct: 254 EVIMELMPAWMRQ-GYEK------GLEEGLEKGIEQGI-------EKGFEKGIEQGTLIE 299 Query: 265 SQEFAQRLLSKGMSREDVAEMANLPLAEIDKVIN 298 ++ A+RLLSKG + E++A+M L + EI K++N Sbjct: 300 RRQIARRLLSKGFTLEEIADMTQLSIEEIKKIMN 333 >UniRef50_C1DXV7 Putative uncharacterized protein n=1 Tax=Sulfurihydrogenibium azorense Az-Fu1 RepID=C1DXV7_SULAA Length = 357 Score = 50.1 bits (118), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 61/260 (23%), Positives = 123/260 (47%), Gaps = 23/260 (8%) Query: 7 TPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHL---ESGSFIEESLKGHSTDV 63 PHD K+ L E A+ L+ HLP E+ + TL + E+ + E+S + D+ Sbjct: 15 NPHDTYAKELLKDEEVAQVLLDAHLPQEINSIIKKETLEIINTENLDYKEKS--KYFADI 72 Query: 64 LYSVQ-MQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFY 122 +YS++ + G ++V+IEH+S DK + ++++ A + E K+ + PI+ Y Sbjct: 73 IYSLKTIYGEDLKIYVLIEHKSYDDKHLPLQLIKNMTAVWSK--EILEGKITPIYPIVIY 130 Query: 123 QGEATPYPLSM-CWFDMFYSPELARRVYNSPFPLVDITITPDDEIM---QHRRIAILELL 178 A+ LS+ F +Y + + F + + + DE +++ I L + Sbjct: 131 ---ASKEKLSLESKFSNYYKISDNMKKFFLDFYVSTLNLNELDEKTIKEKYKNIYTLIMT 187 Query: 179 QKHIRQ---RDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRE 235 + I++ +++ L++ + TL + Y + V +Y+ ++ Y ++ + Sbjct: 188 LRIIQEPTPENILNLIKSIETLYN--YKPKAVYVIALSYIFTIAKKDKNT--YIKVKKQL 243 Query: 236 TGGESMMTLAQWFEEKGIEK 255 GG +M +L F E+G+EK Sbjct: 244 EGG-NMGSLLDMFIEEGLEK 262 >UniRef50_C6HTR6 Probable transposase n=5 Tax=Leptospirillum ferrodiazotrophum RepID=C6HTR6_9BACT Length = 216 Score = 49.7 bits (117), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 48/170 (28%), Positives = 77/170 (45%), Gaps = 11/170 (6%) Query: 5 STTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLK-GHSTDV 63 + TPHD+ FK + L L D ++L SG I E L +D+ Sbjct: 4 TPTPHDSFFKDVFGPGKANLPALLSLLDAPFASRIDPSSLTFLSGETIGEGLATSFRSDL 63 Query: 64 LYSV-----QMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVP 118 + S+ + G P ++EH+S P + + F++ A R L LP VVP Sbjct: 64 VGSLLVADATVDGKPLEFVFLVEHKSSPARDIQFKLACLVTALWARFLREGKPPLP-VVP 122 Query: 119 ILFYQGEATPY--PLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEI 166 IL + G+ +P+ PL + + + PELA + + ++D+T DDEI Sbjct: 123 ILIHHGK-SPWNQPLRL-YETLGLRPELATGMLDYALHVIDLTRIEDDEI 170 >UniRef50_C6PYR3 Putative uncharacterized protein n=1 Tax=Clostridium carboxidivorans P7 RepID=C6PYR3_9CLOT Length = 344 Score = 49.7 bits (117), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 63/303 (20%), Positives = 127/303 (41%), Gaps = 46/303 (15%) Query: 9 HDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQ 68 HD +K + E D ++ + + + + L + S+I + +D++Y Sbjct: 11 HDKSYKDLFSNKELLVDMIQNFVKSSWIKEIKKDNIELVNKSYILSDYEELESDIVYKAT 70 Query: 69 MQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHR---------HLEADHDKLPLVVPI 119 + G ++++E QS D M R+ Y ++ + R +++ +LP +VP+ Sbjct: 71 IDGREVIFYILLEFQSYVDYSMPIRLFLY-MSEIWREVLKNTKQAEVKSKEFRLPAIVPL 129 Query: 120 LFYQGEATPYPLSMCWFDMFYSPEL-ARRVYNSPFPLVDITITPDDEIMQHRR-IAILEL 177 + Y GE + + + ++ EL + + + L+DI +E+M+ + ++ + L Sbjct: 130 VLYNGEY-KWTVEKKFKNIINKSELFGNNIIDFEYILIDINKYEKEELMELKNLVSAVFL 188 Query: 178 LQKHI-------RQRDLMLLLEQL-------------VTLIDE-----GYTSGSQLVAMQ 212 L + + R +D+ + L VTL DE G L+A + Sbjct: 189 LDQKVDIEEFISRVKDIAIDFNNLTEEQKMMLRHWLRVTLSDELKGNLGEKIEDILIAKK 248 Query: 213 NYMLQRGHTEQADLFYGVLRDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRL 272 + + + + RE G E + E+GIEKGI++ RQ+ E +L Sbjct: 249 EEVNRMTSNISKTIKETFAKTREEGMEKGI-------EEGIEKGIEKARQK-DVEIVLKL 300 Query: 273 LSK 275 L+K Sbjct: 301 LTK 303 >UniRef50_A6EA97 Putative uncharacterized protein n=1 Tax=Pedobacter sp. BAL39 RepID=A6EA97_9SPHI Length = 293 Score = 48.9 bits (115), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 70/285 (24%), Positives = 118/285 (41%), Gaps = 41/285 (14%) Query: 24 RDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSV-QMQGNPGYLHVVIEH 82 R+ +E+ LP +RE+ L L E + + D L V +QGN LH IE Sbjct: 19 RENMEVTLPEVIREVLGLEILLSEELPDDVQHTRERKPDALKKVTDIQGNTFVLH--IEF 76 Query: 83 QSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEATPYPLSMCWFDMFYSP 142 Q + +K+M +RM YSI M R+ +LP+ ++F + P + + YS Sbjct: 77 QVEDEKEMVYRMAEYSIMLMRRY------QLPVKQYVIFLKDTKPRMPTGLKTPKLVYSF 130 Query: 143 ELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQRDLMLLLEQLVTLIDEGY 202 +L R+ + L + P+ +++ A+L + R+ L ++ L++ + Sbjct: 131 DLI-RIAEISYKLFIKSDNPEVKML-----AVLANFDEADREGALTSIITGLLSHSKGDF 184 Query: 203 TSGSQLVAMQNYMLQRGHTEQ----------------ADLFYGVLRDRETGGESMMTLAQ 246 ++ +M R EQ D FY R E GE Sbjct: 185 AERRHFKQLRIFMQLRSSIEQHFDKVMDSVSTFFKEENDYFY---RKGEARGEIKG---- 237 Query: 247 WFEEKGIEKGIQQGRQEVSQEFAQRLLSK-GMSREDVAEMANLPL 290 E KG KG +G + S+ + L++K G S E AE+A + + Sbjct: 238 --EAKGEAKGEAKGEAKKSRAVVENLIAKLGFSDEQAAEIAEVTV 280 >UniRef50_C6HZP6 Putative uncharacterized protein n=1 Tax=Leptospirillum ferrodiazotrophum RepID=C6HZP6_9BACT Length = 334 Score = 48.9 bits (115), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 69/310 (22%), Positives = 129/310 (41%), Gaps = 25/310 (8%) Query: 5 STTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESL-KGHSTDV 63 STTPHD+ FK + L + L +L++L G I E L + +D+ Sbjct: 21 STTPHDSFFKDVFGPGKGHLPSLIPLIDGSLASRIELSSLEYLPGESIAEDLARSTRSDL 80 Query: 64 -----LYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVP 118 + + ++ G + + EH+S + ++ A + R L P V+P Sbjct: 81 SASLLISNARIDGGDARIAFIFEHKSFLPHHIHIPLLSLVSALLSRDLREGRKPCP-VIP 139 Query: 119 ILFYQGEATPYPLSMCWFDMF-YSPELARRVYNSPFPLVDITITPDD---EIMQHRRIAI 174 ++ Y G A P+ L + SPELA R+ + L+D++ D+ E + H + Sbjct: 140 VVLYHGRA-PWTLPARLSEALDLSPELAPRLPDFELTLIDLSRFSDETLKEKIAHPEPLV 198 Query: 175 LELLQKHIRQRDLMLL--LEQLVTLIDEGYTSGSQLVAMQ----NYMLQRGHTEQADLFY 228 + KHI + +L +L+ + ++V +Y+ + H ++ + Sbjct: 199 SLSVMKHIFEPPESVLGHFVRLIKTLSPSRDILKRIVDTTLHYISYVKKSHHPQEIRTIF 258 Query: 229 GVLRDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANL 288 E M T+ +E+GI++GIQ GR E Q +S + +A + N+ Sbjct: 259 TTF----LAEEKMTTVLDLIKEEGIQEGIQMGRDEAITRLLQH---SSLSPQQIASILNV 311 Query: 289 PLAEIDKVIN 298 L+ + + N Sbjct: 312 DLSRVLSLAN 321 >UniRef50_B2V697 Putative uncharacterized protein n=6 Tax=Sulfurihydrogenibium RepID=B2V697_SULSY Length = 311 Score = 48.9 bits (115), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 34/162 (20%), Positives = 74/162 (45%), Gaps = 7/162 (4%) Query: 8 PHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSV 67 PHD FKQ + + L+I EL + DL ++ L + + + D+LY Sbjct: 6 PHDQFFKQIFSEPKRVKSLLDIFYS-ELSQKIDLESIRLLNSEKYSQKIGKSLLDLLYEC 64 Query: 68 QMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEAT 127 +++ +L ++ EH+S DK + +++ Y+ + + LP++ +L++ Sbjct: 65 KIENEKSFLRIIFEHKSYIDKNLPSQLLYYN-GILWEETGEYKEYLPIINIVLYHGKRKW 123 Query: 128 PYPLSMCWFDMFYSPELARRVYNS-PFPLVDITITPDDEIMQ 168 P ++ + E+ R N + L+D++ D+E++ Sbjct: 124 NIPTTLPKTN----SEIIERFSNKLNYHLIDLSKVADEEMIN 161 >UniRef50_C4GYF6 Transposase n=20 Tax=Yersinia pestis RepID=C4GYF6_YERPN Length = 105 Score = 48.1 bits (113), Expect = 4e-04, Method: Compositional matrix adjust. Identities = 26/62 (41%), Positives = 41/62 (66%), Gaps = 1/62 (1%) Query: 202 YTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDR-ETGGESMMTLAQWFEEKGIEKGIQQG 260 Y S Q++A+ +Y+LQ G + ++ F L R G+++MT+AQ E+KGIEKGI++G Sbjct: 4 YLSSPQVMALIHYLLQAGESADSEAFVRELAQRVPQHGDALMTIAQQLEQKGIEKGIEKG 63 Query: 261 RQ 262 Q Sbjct: 64 IQ 65 >UniRef50_C6XV94 Putative uncharacterized protein n=7 Tax=Pedobacter heparinus DSM 2366 RepID=C6XV94_PEDHD Length = 283 Score = 47.4 bits (111), Expect = 6e-04, Method: Compositional matrix adjust. Identities = 66/278 (23%), Positives = 123/278 (44%), Gaps = 38/278 (13%) Query: 31 LPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQ-MQGNPGYLHVVIEHQSKPDKK 89 LP ++ + LN +E + + K TD+L V+ +GN LHV E+Q+ + Sbjct: 26 LPGIIKHVLHLNVNTVEELADDVQFTKERKTDLLKKVRDNKGNRYVLHV--EYQTDNYPE 83 Query: 90 MAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEATPYPLSMCWFDMFYSPELARRVY 149 MAFRM YSI +H KLP+ +++ S+ D + L Sbjct: 84 MAFRMAEYSIMLQRKH------KLPVKQFVIYIGPAKANMATSITTKDFRFRYNL----- 132 Query: 150 NSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQRDLMLLLEQLVT---LIDEG--YTS 204 + V+ + ++++ + +AIL L + L +++++ T +++G + Sbjct: 133 -TELSAVNYKLFLKSDLVEEKMLAILSNLASESTESVLAQVVQEIETHTSTLEQGRYFRQ 191 Query: 205 GSQLVAMQNY---------MLQRGHTEQADLFYGVLRDRETGGESMMTLAQWFEEKGIEK 255 L+ ++N ++ + E+ D+ Y R E GE + KG K Sbjct: 192 LRILLQLRNLNKKAIKDMALVGKIFKEEKDILY---RRGEIKGEIKGEI------KGEIK 242 Query: 256 GIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEI 293 GI++GR E + E A L +G++ E +A++ L + EI Sbjct: 243 GIEKGRYEEAMEIALELKKEGLATEFIAKITKLSIEEI 280 >UniRef50_C1I6Y7 Putative uncharacterized protein n=1 Tax=Clostridium sp. 7_2_43FAA RepID=C1I6Y7_9CLOT Length = 226 Score = 47.0 bits (110), Expect = 8e-04, Method: Compositional matrix adjust. Identities = 32/136 (23%), Positives = 57/136 (41%), Gaps = 8/136 (5%) Query: 46 LESGSFIEESLKGHSTDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMH-- 103 L + S+I + +D++Y GN + +V++E QS D +M R++ Y I Sbjct: 3 LVNKSYILSDYEEQESDIVYKANFNGNDVFFYVLLEFQSSVDFRMPIRLLLYMIEIWRDI 62 Query: 104 -RHLEADHDK-----LPLVVPILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVD 157 R+ E K LP +VPI+ Y G+ + S + N + +D Sbjct: 63 LRNTELKEFKRKTFRLPSIVPIVLYNGKKKWTAAKELKHAISNSDVFGDNILNFKYEFID 122 Query: 158 ITITPDDEIMQHRRIA 173 I +E+ + I+ Sbjct: 123 INSYEKEELYNKQNIS 138 >UniRef50_B0K519 Putative uncharacterized protein n=14 Tax=Thermoanaerobacteraceae RepID=B0K519_THEPX Length = 288 Score = 47.0 bits (110), Expect = 9e-04, Method: Compositional matrix adjust. Identities = 34/165 (20%), Positives = 85/165 (51%), Gaps = 15/165 (9%) Query: 63 VLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSI---------AAMHRHLEADHDKL 113 ++Y V+++ + ++++E QSK D +M +R++ Y I ++++ D+ KL Sbjct: 1 MVYQVKLKDKEVFFYILLELQSKVDFQMPYRLLLYIIEVWREILKDTSLNQQKRKDY-KL 59 Query: 114 PLVVPILFYQGEATPYPLSMCWFDMFYSPEL-ARRVYNSPFPLVDITITPDDEIMQ-HRR 171 P ++PI+ Y G + S+ + + S +L + + + L+D+ ++E++Q Sbjct: 60 PAIIPIVLYNG-VNRWTASLSFKETIDSYQLFGENIIDFKYILIDVNRYNEEELLQLSNL 118 Query: 172 IAILELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYML 216 I+ + LL + I + +L +L ++ + S + + ++N++ Sbjct: 119 ISSIFLLDRKIDKEELTEKWGKLADVLKD--ISEEEFIILRNWLF 161 >UniRef50_B0K503 Putative uncharacterized protein n=12 Tax=Thermoanaerobacteraceae RepID=B0K503_THEPX Length = 360 Score = 44.7 bits (104), Expect = 0.004, Method: Compositional matrix adjust. Identities = 34/164 (20%), Positives = 78/164 (47%), Gaps = 19/164 (11%) Query: 50 SFIEESLKGHSTDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLE-- 107 SF+ + D++Y V+++ ++++E QS D +M +R++ Y + L+ Sbjct: 55 SFVLQDFADKEADLVYRVKLKDKEVIFYILMELQSTVDYQMPYRLLLYMVEIWRSILKDT 114 Query: 108 ------ADHDKLPLVVPILFYQGE-----ATPYPLSMCWFDMFYSPELARRVYNSPFPLV 156 KLP++VPI+ Y G+ T Y ++ ++ F + + L+ Sbjct: 115 PRKESRRKDFKLPVIVPIVLYNGDHKWTAKTSYKETLNSYETF-----GEYAVDFKYILI 169 Query: 157 DITITPDDEIMQ-HRRIAILELLQKHIRQRDLMLLLEQLVTLID 199 D+ +E+++ IA + LL++ + ++M L++L +++ Sbjct: 170 DVNRYTKEELLKLENLIASVFLLEQKVEFEEIMKRLKELSEILN 213 >UniRef50_D1PHY3 Putative uncharacterized protein n=2 Tax=Prevotella copri DSM 18205 RepID=D1PHY3_9BACT Length = 307 Score = 44.7 bits (104), Expect = 0.004, Method: Compositional matrix adjust. Identities = 22/49 (44%), Positives = 31/49 (63%) Query: 250 EKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKVIN 298 EKG+EKG +G+ E + E AQRLL+ G+ E V++ LPL I + N Sbjct: 258 EKGMEKGRAEGKHEANTEIAQRLLAMGLPAEQVSKATQLPLEIIKNLSN 306 >UniRef50_Q24Y59 Putative uncharacterized protein n=4 Tax=Peptococcaceae RepID=Q24Y59_DESHY Length = 283 Score = 44.3 bits (103), Expect = 0.005, Method: Compositional matrix adjust. Identities = 22/54 (40%), Positives = 34/54 (62%) Query: 240 SMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEI 293 M + QW E+G ++G +GR++ +E AQ +L+ GMS E +A+ LPL EI Sbjct: 220 KMTQIEQWIREEGRQEGELKGRRDEKRETAQTMLNLGMSPELIAKATKLPLEEI 273 >UniRef50_B3CVG1 Putative uncharacterized protein n=2 Tax=Orientia tsutsugamushi str. Ikeda RepID=B3CVG1_ORITI Length = 96 Score = 44.3 bits (103), Expect = 0.006, Method: Composition-based stats. Identities = 37/125 (29%), Positives = 58/125 (46%), Gaps = 33/125 (26%) Query: 174 ILELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRD 233 +LE + KHI QRD++ L E+ ++++ H G++ D Sbjct: 1 MLEYMLKHIHQRDMLKLWEE--------------------FLIKFKH--------GLILD 32 Query: 234 RETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEI 293 +E G SM T+A K I++GI +GR E +QE + LL G E ++E L E+ Sbjct: 33 KEKG-NSMRTIAA----KYIDEGIAKGRAEAAQELTRNLLKAGFLVEFISETTGLSKEEV 87 Query: 294 DKVIN 298 V N Sbjct: 88 VNVKN 92 >UniRef50_C5RH90 Putative uncharacterized protein n=2 Tax=Clostridium cellulovorans 743B RepID=C5RH90_CLOCL Length = 339 Score = 44.3 bits (103), Expect = 0.006, Method: Compositional matrix adjust. Identities = 34/174 (19%), Positives = 76/174 (43%), Gaps = 10/174 (5%) Query: 9 HDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQ 68 HD +K + ET ++ + L L S++ + +D++Y + Sbjct: 23 HDKSYKDLFSNKETFLSLIQTFVSNTWGSKLTKENLVLVDKSYVLSDYEELESDIVYKAR 82 Query: 69 MQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDK--------LPLVVPIL 120 + + + ++++E QS D +M R++ Y I L+ +K LP VVPI+ Sbjct: 83 IGDHEVFFYMLLEFQSYVDYRMPIRLLLYMIEIWREILKNTSEKEFKRKSFRLPAVVPIV 142 Query: 121 FYQGEATPYPLSMCWFDMFYSPEL-ARRVYNSPFPLVDITITPDDEIMQHRRIA 173 Y GE + ++ ++ + ++ + + + +D+ DE+ +++ IA Sbjct: 143 VYNGEKN-WTVARTLKEVISNSDIFGESILDFRYEFLDVNRFKKDELYENQNIA 195 >UniRef50_C5UQX3 Putative uncharacterized protein n=1 Tax=Clostridium botulinum E1 str. 'BoNT E Beluga' RepID=C5UQX3_CLOBO Length = 66 Score = 43.5 bits (101), Expect = 0.009, Method: Composition-based stats. Identities = 21/47 (44%), Positives = 32/47 (68%) Query: 250 EKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKV 296 EKG E GI QGR+ S E ++ + KGMS + + E+ LP+AEI+++ Sbjct: 14 EKGRELGILQGRKNKSIEVTKKAIKKGMSNKLINELTELPIAEIEEI 60 >UniRef50_B0G834 Putative uncharacterized protein n=3 Tax=Dorea formicigenerans ATCC 27755 RepID=B0G834_9FIRM Length = 369 Score = 42.7 bits (99), Expect = 0.015, Method: Compositional matrix adjust. Identities = 25/91 (27%), Positives = 46/91 (50%), Gaps = 11/91 (12%) Query: 48 SGSFIEESLKGHSTDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSI-------- 99 S F+ + +D + + + + YL +IEHQS+ D M+FR++RY + Sbjct: 57 SSHFLPLFQESRDSDTVNKIWIGNSEIYLIALIEHQSENDFDMSFRILRYIVFIWTDYAA 116 Query: 100 --AAMHRHLEADHDKL-PLVVPILFYQGEAT 127 +H+ D L P ++PI++Y+G +T Sbjct: 117 QQEKLHKGTTKSKDFLYPPILPIVYYEGSST 147 >UniRef50_A4U3R1 Putative uncharacterized protein n=1 Tax=Magnetospirillum gryphiswaldense RepID=A4U3R1_9PROT Length = 322 Score = 42.7 bits (99), Expect = 0.017, Method: Compositional matrix adjust. Identities = 49/203 (24%), Positives = 81/203 (39%), Gaps = 23/203 (11%) Query: 10 DAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSV-Q 68 DA++ + H A + +P + D + + F + K DV++ + Sbjct: 5 DALYHRLFSHPLMAEQLVREFVPEAMAVGLDFARMERVNAKFHDRDGKRREGDVIWRIPT 64 Query: 69 MQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADH-----DKLPLVVPILFYQ 123 G LH++ E QS D MA R Y + +HL A+ D+LP V+ ++ Y Sbjct: 65 ADGEDVVLHILCEFQSTTDWWMAVRTQVYE-GLLWQHLIAERKLKSGDRLPPVLTLVLYN 123 Query: 124 GE------ATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILEL 177 GE PL P R Y+ L+D+ P++E+ +A L Sbjct: 124 GEQRWHAPTDTIPLIALPAGSPLWPWQPRACYH----LLDMGAVPEEELAIRDSLAALLF 179 Query: 178 LQKHIRQRDLMLLLEQLVTLIDE 200 +H R+ E+L LID+ Sbjct: 180 RLEHPREP------EELAGLIDD 196 >UniRef50_C0QGW4 Putative uncharacterized protein n=1 Tax=Desulfobacterium autotrophicum HRM2 RepID=C0QGW4_DESAH Length = 298 Score = 42.4 bits (98), Expect = 0.020, Method: Compositional matrix adjust. Identities = 54/242 (22%), Positives = 103/242 (42%), Gaps = 41/242 (16%) Query: 72 NPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQ-----GEA 126 N L ++E Q K ++++RY+ M H +A LV+P + + +A Sbjct: 66 NQQLLLWLVEFQEDKSKFSIYKLLRYTTDLMETHPDA------LVIPTVLFTDRKKWSKA 119 Query: 127 TPYPLSMCWFD-MFYSPEL---------ARRVYNSPFPLVDITITPDDEIMQHRRIAILE 176 L D MF E AR YN P+V I + P + RI ++ Sbjct: 120 VLQQLHAQLHDRMFLHFEYVFHKLFDLNARDYYNVDNPVVKILL-PKMHYKKEDRIEVIR 178 Query: 177 LLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRET 236 + Q L ++ V ID Q EQ +L+ +++ +ET Sbjct: 179 QAYAGLFQLVSSGLFDKYVDFIDTYAEIEDQ--------------EQLNLYNEIVQHKET 224 Query: 237 GGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKV 296 LAQ+ E+G+++G ++ R++ F ++ +G+S +A++ +L ++ ++K+ Sbjct: 225 A-----MLAQYIRERGMQEGRKEERKQSLISFIRKAKQEGVSVPTIAKIVDLDVSMVNKI 279 Query: 297 IN 298 +N Sbjct: 280 LN 281 >UniRef50_A4XMU7 Putative uncharacterized protein n=1 Tax=Caldicellulosiruptor saccharolyticus DSM 8903 RepID=A4XMU7_CALS8 Length = 313 Score = 42.4 bits (98), Expect = 0.021, Method: Compositional matrix adjust. Identities = 64/320 (20%), Positives = 137/320 (42%), Gaps = 60/320 (18%) Query: 10 DAVFKQFLMHAETAR----DFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLY 65 D FK+ L + + + LE+ LP+++ L D+ + ES I + +D++Y Sbjct: 9 DEGFKKVLTNRTNIKWLLTELLEV-LPIQIG-LEDIEVIATES---INRQWRARRSDMVY 63 Query: 66 SVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGE 125 ++ + Y+ V++E QS ++ + R++ Y + ++ + LP+V+P++ Y GE Sbjct: 64 KIKYKD--AYICVLLEFQSSKEELIHLRVLEYMLLIQKKY--TTKNLLPVVIPVVLYTGE 119 Query: 126 ATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQH---------------- 169 P + ++ Y + + V VD+ + D+++++ Sbjct: 120 EKWTPATCFEQNVVYGEDFKQFVQKFSLVFVDVRMIDDEKLLKSPNLLAAALYVDKVSDN 179 Query: 170 -RRIA-ILELLQKHIR--QRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQAD 225 ++A LE L KH++ + E L ++ +GY + V ++ + Sbjct: 180 PEKVAERLEYLSKHVKFSEEQKEEFCEWLYHVVLKGYGFSDEEV--DEFLFKSDF----- 232 Query: 226 LFYGVLRDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQ------------EFAQRLL 273 L GV E + A+ KG+EK +++ R++ Q E AQ+++ Sbjct: 233 LRLGV-------NEMFLNTAEKIR-KGLEKELEKERKQGIQQGIQQGKEQALLEVAQKMI 284 Query: 274 SKGMSREDVAEMANLPLAEI 293 +G +A++ L + I Sbjct: 285 EEGAEDSFIAKVTGLDMERI 304 >UniRef50_Q6D6X6 Putative transposase (Fragment) n=2 Tax=Pectobacterium RepID=Q6D6X6_ERWCT Length = 135 Score = 42.4 bits (98), Expect = 0.022, Method: Compositional matrix adjust. Identities = 29/98 (29%), Positives = 51/98 (52%), Gaps = 13/98 (13%) Query: 214 YMLQRGHTEQ-ADLFYGVLRDRETGGESMMTLAQWFEEKGIEK------------GIQQG 260 Y+ + G+T + A+ V + T E++MT+AQ E+ G EK G++QG Sbjct: 34 YIARSGNTSKPAEFIEAVAQSLSTDREAIMTIAQQLEKIGFEKGIKHGMQQGMQRGMEQG 93 Query: 261 RQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKVIN 298 + +++ A++LL GM V +M L AE+ ++ N Sbjct: 94 IKTSARQIARQLLLSGMEPAQVCQMTQLSAAELAQLSN 131 >UniRef50_C1P7A8 Putative uncharacterized protein n=1 Tax=Bacillus coagulans 36D1 RepID=C1P7A8_BACCO Length = 345 Score = 42.0 bits (97), Expect = 0.026, Method: Compositional matrix adjust. Identities = 20/48 (41%), Positives = 31/48 (64%) Query: 249 EEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKV 296 +E+GIE GI++G+ E + A LL +G S E VA+M L + E+ K+ Sbjct: 292 KEEGIEIGIEKGKMEEKRNLAAELLREGFSVEKVAKMVKLSIDEVKKI 339 >UniRef50_C2LUG6 Putative uncharacterized protein n=1 Tax=Streptococcus salivarius SK126 RepID=C2LUG6_STRSL Length = 299 Score = 41.6 bits (96), Expect = 0.037, Method: Compositional matrix adjust. Identities = 22/68 (32%), Positives = 43/68 (63%) Query: 231 LRDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPL 290 +R +E +M T E+G+E+G+++GR E E +++L+KG+S E V+++ L L Sbjct: 232 IRIQENYDMTMETAIDEAREEGLEQGLKRGRYEGQLELIRKMLAKGLSLEVVSDVTGLSL 291 Query: 291 AEIDKVIN 298 E+D +++ Sbjct: 292 EELDGLLS 299 >UniRef50_B9MMM9 Putative uncharacterized protein n=1 Tax=Anaerocellum thermophilum DSM 6725 RepID=B9MMM9_ANATD Length = 315 Score = 40.8 bits (94), Expect = 0.054, Method: Compositional matrix adjust. Identities = 33/165 (20%), Positives = 75/165 (45%), Gaps = 8/165 (4%) Query: 5 STTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVL 64 + +D +K+ + E FL+ L E + + + + + + I + + +D++ Sbjct: 3 TYKKYDEGYKKLFSNKENLIWFLQNVLNEERFKKIEKSDVEIIATESINKKWQKKISDIV 62 Query: 65 YSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMH--RHLEADHDKLPLVVPILFY 122 Y ++ + + + IE QS+ DKK+ R+ Y MH + + ++P+VVPI+ Y Sbjct: 63 YKIKYKD--SFFCLTIEFQSREDKKILHRLYEY----MHLIQLKNKVNGEIPVVVPIVLY 116 Query: 123 QGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIM 167 G + P + ++ + N +DI P+++++ Sbjct: 117 NGISHWKPNEQYNEIILFAKDFPEYAQNFKIIFLDIKSIPEEKLI 161 >UniRef50_A6LFH9 Putative uncharacterized protein n=6 Tax=Bacteroidales RepID=A6LFH9_PARD8 Length = 295 Score = 40.8 bits (94), Expect = 0.056, Method: Compositional matrix adjust. Identities = 17/47 (36%), Positives = 34/47 (72%) Query: 250 EKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKV 296 +KGIEKGI++GRQE + A+++ +G+ E +A+ + L + +I+++ Sbjct: 249 QKGIEKGIEKGRQEEKLQIARKMKKQGLDSELIAQCSGLSVEDIERL 295 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P31665 Uncharacterized protein yadD n=59 Tax=Enterobact... 395 e-108 UniRef50_C2DMU4 Possible transposase n=6 Tax=Enterobacteriaceae ... 377 e-103 UniRef50_P37415 Uncharacterized protein pSLT051 n=256 Tax=Gammap... 353 3e-96 UniRef50_Q1CC76 Transposase n=27 Tax=Gammaproteobacteria RepID=Q... 343 6e-93 UniRef50_Q4LC22 TpnA protein n=9 Tax=Enterobacteriaceae RepID=Q4... 342 1e-92 UniRef50_D2U4R8 Transposase (Fragment) n=4 Tax=Enterobacteriacea... 333 4e-90 UniRef50_B7UFQ5 Predicted protein n=14 Tax=Enterobacteriaceae Re... 329 6e-89 UniRef50_P77768 Uncharacterized protein yfcI n=175 Tax=Gammaprot... 325 1e-87 UniRef50_Q7N1D0 Transposase, ISNCY family n=36 Tax=root RepID=Q7... 320 5e-86 UniRef50_D1P284 Transposase, ISNCY family n=10 Tax=Enterobacteri... 305 1e-81 UniRef50_Q7B1W7 YadD homologue n=11 Tax=root RepID=Q7B1W7_ECOLX 304 3e-81 UniRef50_C8QFJ7 Putative transposase YhgA family protein n=4 Tax... 301 2e-80 UniRef50_B6XDZ7 Putative uncharacterized protein n=2 Tax=Provide... 298 2e-79 UniRef50_C2LLN3 Transposase n=37 Tax=Enterobacteriaceae RepID=C2... 293 5e-78 UniRef50_A8PLK1 Putative uncharacterized protein n=3 Tax=Rickett... 291 2e-77 UniRef50_D0KLJ7 Putative transposase YhgA family protein n=1 Tax... 287 4e-76 UniRef50_B7MZS6 Putative uncharacterized protein n=3 Tax=Escheri... 273 6e-72 UniRef50_Q3C0L1 TpnA protein n=16 Tax=Enterobacteriaceae RepID=Q... 271 2e-71 UniRef50_C2LF55 Transposase n=3 Tax=Enterobacteriaceae RepID=C2L... 271 3e-71 UniRef50_C0Q5B1 Ytl2 n=4 Tax=Enterobacteriaceae RepID=C0Q5B1_SALPC 266 1e-69 UniRef50_Q52101 ORF n=1 Tax=Salmonella enterica subsp. enterica ... 265 1e-69 UniRef50_A8PQ66 Putative uncharacterized protein n=3 Tax=Rickett... 262 1e-68 UniRef50_C3M8C1 Putative transposase n=3 Tax=Candidatus Hamilton... 256 6e-67 UniRef50_C0AXL8 Putative uncharacterized protein n=1 Tax=Proteus... 245 2e-63 UniRef50_Q24W02 Putative uncharacterized protein n=3 Tax=Clostri... 239 1e-61 UniRef50_A6TJT5 Putative uncharacterized protein n=1 Tax=Alkalip... 230 5e-59 UniRef50_A6G4N5 Putative uncharacterized protein n=1 Tax=Plesioc... 228 2e-58 UniRef50_Q1RJ73 Transposase and inactivated derivative n=10 Tax=... 227 3e-58 UniRef50_B3ESQ9 Putative uncharacterized protein n=2 Tax=Bacteri... 224 4e-57 UniRef50_Q2J904 Putative uncharacterized protein n=1 Tax=Frankia... 219 8e-56 UniRef50_A8GX51 Transposase and inactivated derivative n=11 Tax=... 219 9e-56 UniRef50_A0LBL3 Putative uncharacterized protein n=6 Tax=Magneto... 219 9e-56 UniRef50_C4YU05 Transposase n=5 Tax=Rickettsieae RepID=C4YU05_9RICK 217 3e-55 UniRef50_A9EVM7 Similar to putative transposase n=2 Tax=Sorangiu... 217 4e-55 UniRef50_A5CC03 Transposase and inactivated derivative n=9 Tax=O... 216 6e-55 UniRef50_C3PPD7 Transposase and inactivated derivative n=13 Tax=... 215 1e-54 UniRef50_D0LMM4 Putative transposase n=10 Tax=Haliangium ochrace... 215 1e-54 UniRef50_A6G0X2 Putative uncharacterized protein n=1 Tax=Plesioc... 213 6e-54 UniRef50_Q1QWV4 Putative uncharacterized protein n=11 Tax=Proteo... 212 1e-53 UniRef50_C1J8H0 Truncated transposase n=3 Tax=Escherichia coli R... 211 2e-53 UniRef50_D2QBD7 Putative uncharacterized protein n=1 Tax=Spiroso... 211 3e-53 UniRef50_A9BGB6 Putative uncharacterized protein n=3 Tax=Petroto... 208 2e-52 UniRef50_C5JAV2 Transposase n=2 Tax=uncultured bacterium RepID=C... 206 9e-52 UniRef50_Q1RGR6 Transposase and inactivated derivative n=15 Tax=... 203 6e-51 UniRef50_C2DIT3 Possible transposase n=5 Tax=Enterobacteriaceae ... 202 1e-50 UniRef50_A3JHZ5 Putative transposase n=11 Tax=Proteobacteria Rep... 200 4e-50 UniRef50_C6VTM0 Putative uncharacterized protein n=1 Tax=Dyadoba... 199 1e-49 UniRef50_C6I158 Putative uncharacterized protein n=3 Tax=Leptosp... 198 2e-49 UniRef50_Q1Q296 Putative uncharacterized protein n=6 Tax=Candida... 197 4e-49 UniRef50_Q2RLW6 Putative uncharacterized protein n=9 Tax=Clostri... 196 7e-49 UniRef50_Q1RKI3 Transposase and inactivated derivative n=10 Tax=... 195 1e-48 UniRef50_C0GW46 Putative uncharacterized protein n=2 Tax=Desulfo... 195 2e-48 UniRef50_C5UWW9 Putative uncharacterized protein n=1 Tax=Clostri... 195 2e-48 UniRef50_C6HY29 Putative uncharacterized protein n=1 Tax=Leptosp... 193 6e-48 UniRef50_A4XMD0 Putative uncharacterized protein n=5 Tax=Clostri... 192 1e-47 UniRef50_A3ET28 Probable transposase n=6 Tax=Leptospirillum sp. ... 187 3e-46 UniRef50_Q6TFF6 Putative transposase n=1 Tax=Caedibacter taenios... 187 4e-46 UniRef50_C6HZP6 Putative uncharacterized protein n=1 Tax=Leptosp... 186 6e-46 UniRef50_C0GW49 Putative uncharacterized protein n=6 Tax=Desulfo... 186 6e-46 UniRef50_B4U689 Putative uncharacterized protein n=8 Tax=Aquific... 186 7e-46 UniRef50_C6HXQ0 Putative uncharacterized protein n=1 Tax=Leptosp... 186 1e-45 UniRef50_C7RR52 Putative transposase n=1 Tax=Candidatus Accumuli... 184 5e-45 UniRef50_C0GWA6 Putative uncharacterized protein n=3 Tax=Desulfo... 179 1e-43 UniRef50_C4FIM1 Putative uncharacterized protein n=1 Tax=Sulfuri... 178 2e-43 UniRef50_B2V9N0 Putative uncharacterized protein n=4 Tax=Sulfuri... 178 3e-43 UniRef50_C0GTX5 Putative uncharacterized protein n=8 Tax=Desulfo... 177 3e-43 UniRef50_A4XFI8 Putative uncharacterized protein n=7 Tax=Clostri... 177 5e-43 UniRef50_B9MMR0 Putative uncharacterized protein n=1 Tax=Anaeroc... 177 5e-43 UniRef50_B3ETR6 Putative uncharacterized protein n=1 Tax=Candida... 176 9e-43 UniRef50_Q2FP14 Putative uncharacterized protein n=4 Tax=Methano... 175 2e-42 UniRef50_B9TA29 Putative uncharacterized protein n=1 Tax=Ricinus... 173 6e-42 UniRef50_A4XG55 Putative uncharacterized protein n=2 Tax=Caldice... 172 2e-41 UniRef50_Q04UG3 Transposase, YhgA-like n=8 Tax=Leptospira RepID=... 169 1e-40 UniRef50_B6WXP3 Putative uncharacterized protein n=1 Tax=Desulfo... 160 4e-38 UniRef50_Q3JB06 Putative transposase n=17 Tax=Proteobacteria Rep... 159 8e-38 UniRef50_C6PYR3 Putative uncharacterized protein n=1 Tax=Clostri... 155 1e-36 UniRef50_C6HTR6 Probable transposase n=5 Tax=Leptospirillum ferr... 154 3e-36 UniRef50_A8PLG1 Transposase n=1 Tax=Rickettsiella grylli RepID=A... 153 6e-36 UniRef50_C8T759 Putative uncharacterized protein n=1 Tax=Klebsie... 151 2e-35 UniRef50_D0YJF1 Putative transposase YhgA family protein n=1 Tax... 150 4e-35 UniRef50_B9MN47 Putative uncharacterized protein n=2 Tax=Bacteri... 149 1e-34 UniRef50_B2V697 Putative uncharacterized protein n=6 Tax=Sulfuri... 148 2e-34 UniRef50_C0A240 Putative uncharacterized protein n=1 Tax=Opituta... 148 2e-34 UniRef50_C1DXM1 Putative uncharacterized protein n=5 Tax=Sulfuri... 147 4e-34 UniRef50_B6J6C6 Hypothetical cytosolic protein n=1 Tax=Coxiella ... 145 2e-33 UniRef50_A9BGB3 Putative uncharacterized protein n=2 Tax=Petroto... 143 5e-33 UniRef50_D0LPI9 Putative transposase n=2 Tax=Haliangium ochraceu... 142 2e-32 UniRef50_C6IY67 Transposase n=1 Tax=Paenibacillus sp. oral taxon... 130 5e-29 UniRef50_B8FP58 Putative uncharacterized protein n=1 Tax=Desulfi... 128 3e-28 UniRef50_C1DXV7 Putative uncharacterized protein n=1 Tax=Sulfuri... 126 8e-28 UniRef50_C1I6Y7 Putative uncharacterized protein n=1 Tax=Clostri... 123 6e-27 UniRef50_C6XV94 Putative uncharacterized protein n=7 Tax=Pedobac... 118 2e-25 UniRef50_B0K519 Putative uncharacterized protein n=14 Tax=Thermo... 118 3e-25 UniRef50_C1MD86 Putative uncharacterized protein n=5 Tax=Enterob... 115 1e-24 UniRef50_D2NBJ3 Putative uncharacterized protein n=1 Tax=Escheri... 110 7e-23 UniRef50_C0GV86 Transposase, ISNCY family n=7 Tax=Desulfonatrono... 105 2e-21 UniRef50_A6EA97 Putative uncharacterized protein n=1 Tax=Pedobac... 105 2e-21 UniRef50_B5Q357 Transposase n=10 Tax=Salmonella enterica subsp. ... 88 5e-16 UniRef50_Q3C0L0 TpnA protein n=2 Tax=Sodalis glossinidius RepID=... 76 2e-12 UniRef50_C4GYF6 Transposase n=20 Tax=Yersinia pestis RepID=C4GYF... 74 7e-12 Sequences not found previously or not previously below threshold: UniRef50_C5RH90 Putative uncharacterized protein n=2 Tax=Clostri... 162 2e-38 UniRef50_A6G1G8 Putative uncharacterized protein n=1 Tax=Plesioc... 158 2e-37 UniRef50_B0K503 Putative uncharacterized protein n=12 Tax=Thermo... 141 3e-32 UniRef50_A4U3R1 Putative uncharacterized protein n=1 Tax=Magneto... 141 3e-32 UniRef50_A4XMU7 Putative uncharacterized protein n=1 Tax=Caldice... 128 2e-28 UniRef50_B9MPV5 Putative uncharacterized protein n=5 Tax=Clostri... 120 6e-26 UniRef50_B9MMM9 Putative uncharacterized protein n=1 Tax=Anaeroc... 118 2e-25 UniRef50_B9E303 Putative uncharacterized protein n=2 Tax=Clostri... 115 3e-24 UniRef50_B0G834 Putative uncharacterized protein n=3 Tax=Dorea f... 110 8e-23 UniRef50_A5USQ0 Putative uncharacterized protein n=4 Tax=Roseifl... 106 8e-22 UniRef50_B0K813 Putative uncharacterized protein n=13 Tax=Thermo... 101 2e-20 UniRef50_Q2RKN5 Putative uncharacterized protein n=1 Tax=Moorell... 100 5e-20 UniRef50_Q7NIZ1 Gll2041 protein n=9 Tax=Cyanobacteria RepID=Q7NI... 100 7e-20 UniRef50_B1XMU9 Putative uncharacterized protein n=1 Tax=Synecho... 99 3e-19 UniRef50_Q1PZ06 Putative uncharacterized protein n=1 Tax=Candida... 98 4e-19 UniRef50_C4FHW2 Putative uncharacterized protein n=1 Tax=Sulfuri... 96 1e-18 UniRef50_A5D0D4 Putative uncharacterized protein n=10 Tax=Clostr... 96 2e-18 UniRef50_B0KCX4 Putative uncharacterized protein n=12 Tax=Thermo... 94 4e-18 UniRef50_C4G1D5 Putative uncharacterized protein n=2 Tax=Abiotro... 93 1e-17 UniRef50_C1PBU4 Putative uncharacterized protein n=4 Tax=Bacillu... 88 4e-16 UniRef50_C4UAM6 Putative uncharacterized protein n=1 Tax=Yersini... 87 5e-16 UniRef50_B7GJZ4 Transposase n=10 Tax=Bacillaceae RepID=B7GJZ4_ANOFW 86 2e-15 UniRef50_C9KKN3 Putative uncharacterized protein n=1 Tax=Mitsuok... 85 3e-15 UniRef50_C0QGW4 Putative uncharacterized protein n=1 Tax=Desulfo... 83 1e-14 UniRef50_Q2RGS0 Putative uncharacterized protein n=2 Tax=Moorell... 83 2e-14 UniRef50_C2LUG6 Putative uncharacterized protein n=1 Tax=Strepto... 82 2e-14 UniRef50_C8PTN1 Putative uncharacterized protein n=4 Tax=Trepone... 81 4e-14 UniRef50_Q6D6X6 Putative transposase (Fragment) n=2 Tax=Pectobac... 81 4e-14 UniRef50_B7CC32 Putative uncharacterized protein n=10 Tax=Eubact... 80 7e-14 UniRef50_A6LFH9 Putative uncharacterized protein n=6 Tax=Bactero... 80 7e-14 UniRef50_Q73P51 Conserved domain protein n=7 Tax=Treponema RepID... 78 3e-13 UniRef50_A4XJH0 Putative uncharacterized protein n=1 Tax=Caldice... 77 6e-13 UniRef50_C9RQ02 Putative uncharacterized protein n=1 Tax=Fibroba... 77 1e-12 UniRef50_UPI0001BC3A9D hypothetical protein BcroD2_08902 n=3 Tax... 76 1e-12 UniRef50_C9LWJ8 Putative uncharacterized protein n=1 Tax=Selenom... 75 3e-12 UniRef50_A6LFA9 Putative uncharacterized protein n=22 Tax=Bacter... 75 3e-12 UniRef50_UPI0001C351D8 hypothetical protein ChatD1_33675 n=1 Tax... 74 5e-12 UniRef50_UPI0001C34E7F hypothetical protein ClM62_15401 n=1 Tax=... 72 2e-11 UniRef50_D1PHY3 Putative uncharacterized protein n=2 Tax=Prevote... 72 2e-11 UniRef50_C4G3R2 Putative uncharacterized protein n=2 Tax=Abiotro... 72 2e-11 UniRef50_B3CQQ1 Putative transposase n=3 Tax=Orientia tsutsugamu... 72 3e-11 UniRef50_C5UZR7 Putative uncharacterized protein n=1 Tax=Clostri... 72 3e-11 UniRef50_C6LJP2 Putative transposase n=1 Tax=Bryantella formatex... 71 4e-11 UniRef50_A7BWQ7 Putative uncharacterized protein n=3 Tax=Beggiat... 71 4e-11 UniRef50_C1P7A8 Putative uncharacterized protein n=1 Tax=Bacillu... 71 4e-11 UniRef50_C6XVT6 Putative uncharacterized protein n=1 Tax=Pedobac... 71 5e-11 UniRef50_A1ZPJ4 Hypothetical conserved protein n=6 Tax=Microscil... 71 5e-11 UniRef50_C9LXX0 Putative uncharacterized protein n=6 Tax=Selenom... 71 6e-11 UniRef50_C6LE73 Putative uncharacterized protein n=1 Tax=Bryante... 70 8e-11 UniRef50_C8W2V6 Putative uncharacterized protein n=2 Tax=Desulfo... 70 8e-11 UniRef50_UPI0001C353CE hypothetical protein ChatD1_20495 n=1 Tax... 70 8e-11 UniRef50_Q24MW9 Putative uncharacterized protein n=4 Tax=Desulfi... 70 1e-10 UniRef50_Q24Y59 Putative uncharacterized protein n=4 Tax=Peptoco... 70 1e-10 UniRef50_A8VV66 ATPase associated with various cellular activiti... 68 3e-10 UniRef50_UPI00006A2D99 UPI00006A2D99 related cluster n=2 Tax=Xen... 68 4e-10 UniRef50_B5U1X5 Putative uncharacterized protein n=1 Tax=uncultu... 68 5e-10 UniRef50_C8W1F3 Putative uncharacterized protein n=2 Tax=Desulfo... 68 5e-10 UniRef50_B8HNA0 Putative uncharacterized protein n=3 Tax=Cyanoba... 66 1e-09 UniRef50_A7BTR0 Putative uncharacterized protein n=3 Tax=Beggiat... 66 1e-09 UniRef50_B8HL58 Putative uncharacterized protein n=2 Tax=Cyanoth... 66 2e-09 UniRef50_C9XMT1 Putative uncharacterized protein n=4 Tax=Clostri... 66 2e-09 UniRef50_C0F0J0 Putative uncharacterized protein n=1 Tax=Eubacte... 65 3e-09 UniRef50_C0QZ87 Chromosome segregation ATPase n=19 Tax=Bacteria ... 64 7e-09 UniRef50_C1J8G9 YdgA n=11 Tax=Enterobacteriaceae RepID=C1J8G9_ECOLX 64 8e-09 UniRef50_C4G7H9 Putative uncharacterized protein n=2 Tax=Abiotro... 63 1e-08 UniRef50_A5Z376 Putative uncharacterized protein n=1 Tax=Eubacte... 63 1e-08 UniRef50_B7BFV9 Putative uncharacterized protein n=1 Tax=Parabac... 63 2e-08 UniRef50_A8YL21 Genome sequencing data, contig C325 n=27 Tax=Cya... 62 2e-08 UniRef50_A8SDU3 Putative uncharacterized protein n=1 Tax=Faecali... 62 2e-08 UniRef50_C2G1H3 Hypothetical cytosolic protein n=1 Tax=Sphingoba... 62 2e-08 UniRef50_Q8YTL4 All2703 protein n=13 Tax=Cyanobacteria RepID=Q8Y... 62 3e-08 UniRef50_A7AK04 Putative uncharacterized protein n=2 Tax=Parabac... 62 3e-08 UniRef50_C8PLW8 Putative uncharacterized protein n=2 Tax=Trepone... 62 3e-08 UniRef50_C0R0H3 Putative uncharacterized protein n=8 Tax=Brachys... 62 3e-08 UniRef50_A5D5U3 Hypothetical membrane protein n=3 Tax=Peptococca... 62 4e-08 UniRef50_Q1NK38 Putative uncharacterized protein n=2 Tax=delta p... 61 4e-08 UniRef50_D1P8S5 Putative uncharacterized protein n=1 Tax=Prevote... 61 4e-08 UniRef50_C0CSV6 Putative uncharacterized protein n=1 Tax=Clostri... 61 5e-08 UniRef50_A5KR99 Putative uncharacterized protein n=11 Tax=Rumino... 61 6e-08 UniRef50_UPI0001BC3131 hypothetical protein BcroD2_12630 n=4 Tax... 60 7e-08 UniRef50_B4VZ11 Putative uncharacterized protein n=1 Tax=Microco... 60 9e-08 UniRef50_Q6ZEK6 Slr5124 protein n=11 Tax=Chroococcales RepID=Q6Z... 60 9e-08 UniRef50_B1WSK8 CHP1784-containing protein n=11 Tax=Cyanobacteri... 60 1e-07 UniRef50_C0BF92 Putative uncharacterized protein n=1 Tax=Coproco... 60 1e-07 UniRef50_A7C3K1 Putative uncharacterized protein n=3 Tax=Beggiat... 60 1e-07 UniRef50_B4VKW0 Putative uncharacterized protein n=2 Tax=Microco... 59 1e-07 UniRef50_Q2FSG0 Putative uncharacterized protein n=1 Tax=Methano... 59 1e-07 UniRef50_Q5GSR2 Uncharacterized conserved protein n=15 Tax=Wolba... 59 2e-07 UniRef50_B1V1L4 Putative uncharacterized protein n=38 Tax=Clostr... 59 2e-07 UniRef50_A8GY36 Putative uncharacterized protein n=15 Tax=Ricket... 59 2e-07 UniRef50_A8F2U7 Putative uncharacterized protein n=15 Tax=Bacter... 59 2e-07 UniRef50_C3R531 Putative uncharacterized protein n=6 Tax=Bactero... 59 2e-07 UniRef50_C4ZLA7 Conserved hypothetical cytosolic protein n=2 Tax... 59 2e-07 UniRef50_Q3ARM2 Putative uncharacterized protein n=10 Tax=Bacter... 59 2e-07 UniRef50_C1QAJ2 Putative uncharacterized protein n=2 Tax=Brachys... 58 3e-07 UniRef50_C0EXQ3 Putative uncharacterized protein n=1 Tax=Eubacte... 58 3e-07 UniRef50_Q00255 ORF295 n=1 Tax=Leptolyngbya boryana RepID=Q00255... 58 3e-07 UniRef50_C6VTD5 Putative uncharacterized protein n=1 Tax=Dyadoba... 58 3e-07 UniRef50_Q3ATN4 Putative uncharacterized protein n=1 Tax=Chlorob... 58 3e-07 UniRef50_C0DAA1 Putative uncharacterized protein n=2 Tax=Clostri... 58 4e-07 UniRef50_Q8YQI6 All3837 protein n=4 Tax=Cyanobacteria RepID=Q8YQ... 58 4e-07 UniRef50_Q899X1 Putative uncharacterized protein n=2 Tax=Clostri... 58 4e-07 UniRef50_C4FYK3 Putative uncharacterized protein n=2 Tax=Abiotro... 58 4e-07 UniRef50_B4B4Q2 Putative uncharacterized protein n=1 Tax=Cyanoth... 58 5e-07 UniRef50_D0TYF1 Putative uncharacterized protein n=1 Tax=Bactero... 58 5e-07 UniRef50_A7M2M6 Putative uncharacterized protein n=2 Tax=Bactero... 58 5e-07 UniRef50_Q8GBS6 Putative uncharacterized protein n=12 Tax=Trepon... 58 5e-07 UniRef50_C4FIG5 Putative uncharacterized protein n=1 Tax=Sulfuri... 57 6e-07 UniRef50_UPI0001C369BC hypothetical protein ChatD1_02491 n=1 Tax... 57 6e-07 UniRef50_B8FTH9 Putative uncharacterized protein n=3 Tax=Desulfi... 57 6e-07 UniRef50_C0CTJ7 Putative uncharacterized protein n=5 Tax=Clostri... 57 6e-07 UniRef50_B3CVG1 Putative uncharacterized protein n=2 Tax=Orienti... 57 6e-07 UniRef50_UPI0001C366FA hypothetical protein ChatD1_09620 n=1 Tax... 57 7e-07 UniRef50_A7BN25 Putative uncharacterized protein n=3 Tax=Beggiat... 57 7e-07 UniRef50_A1WV23 Putative uncharacterized protein n=1 Tax=Halorho... 57 8e-07 UniRef50_C9LBM4 Putative uncharacterized protein n=1 Tax=Blautia... 56 1e-06 UniRef50_Q6D2V6 Putative uncharacterized protein (Fragment) n=1 ... 56 1e-06 UniRef50_D0BNN6 ATP-dependent DNA helicase RecQ n=1 Tax=Granulic... 56 2e-06 UniRef50_A8YH27 Similar to tr|Q8YMI8|Q8YMI8 n=19 Tax=Cyanobacter... 56 2e-06 UniRef50_UPI0001C371D2 hypothetical protein RflaF_10865 n=1 Tax=... 56 2e-06 UniRef50_C5EKZ7 Predicted protein n=1 Tax=Clostridiales bacteriu... 56 2e-06 UniRef50_B7UFQ6 Predicted protein n=11 Tax=Escherichia RepID=B7U... 56 2e-06 UniRef50_C4ZGR2 Putative uncharacterized protein n=2 Tax=Eubacte... 56 2e-06 UniRef50_C0D7Q8 Putative uncharacterized protein n=1 Tax=Clostri... 55 2e-06 UniRef50_A5CBY6 Transposase and inactivated derivative n=47 Tax=... 55 3e-06 UniRef50_UPI00016C0F09 hypothetical protein Epulo_07618 n=2 Tax=... 55 3e-06 UniRef50_C1DU30 Putative uncharacterized protein n=7 Tax=Sulfuri... 55 3e-06 UniRef50_Q5L374 Transposase n=26 Tax=Bacillaceae RepID=Q5L374_GEOKA 55 4e-06 UniRef50_B0JHW4 Transposase n=31 Tax=Cyanobacteria RepID=B0JHW4_... 55 4e-06 UniRef50_C0G0A4 Putative uncharacterized protein n=2 Tax=Rosebur... 55 4e-06 UniRef50_C1TQY0 Putative transposase, YhgA n=1 Tax=Dethiosulfovi... 55 4e-06 UniRef50_C1J8S3 YdgA n=6 Tax=Escherichia coli RepID=C1J8S3_ECOLX 55 4e-06 UniRef50_A6MYW5 Chromosome segregation ATPase n=4 Tax=Rickettsia... 55 4e-06 UniRef50_B0JU44 Putative uncharacterized protein n=4 Tax=Microcy... 55 5e-06 UniRef50_A6EAN2 Putative uncharacterized protein n=1 Tax=Pedobac... 55 5e-06 UniRef50_C6XV81 Putative uncharacterized protein n=4 Tax=Pedobac... 54 5e-06 UniRef50_C3PNP1 Transposase and inactivated derivative n=1 Tax=R... 54 6e-06 UniRef50_A6BF26 Putative uncharacterized protein n=14 Tax=Clostr... 54 6e-06 UniRef50_B0A7T9 Putative uncharacterized protein n=2 Tax=Clostri... 54 6e-06 UniRef50_C4Z1Q2 Putative uncharacterized protein n=1 Tax=Eubacte... 54 7e-06 UniRef50_A6LF36 Putative uncharacterized protein n=7 Tax=Bactero... 54 8e-06 UniRef50_C6LTE0 Putative uncharacterized protein n=1 Tax=Giardia... 53 8e-06 UniRef50_C1Q938 Putative uncharacterized protein n=4 Tax=Brachys... 53 9e-06 UniRef50_Q24Y19 Putative uncharacterized protein n=3 Tax=Desulfi... 53 1e-05 UniRef50_Q9L0J0 Putative uncharacterized protein SCO4675 n=4 Tax... 53 1e-05 UniRef50_B4VKU9 Putative uncharacterized protein n=1 Tax=Microco... 53 1e-05 UniRef50_C1QAK6 Putative uncharacterized protein n=1 Tax=Brachys... 53 1e-05 UniRef50_Q8YMI0 Alr4953 protein n=8 Tax=Cyanobacteria RepID=Q8YM... 53 1e-05 UniRef50_C9LT45 Putative uncharacterized protein n=2 Tax=Selenom... 53 1e-05 UniRef50_D1PGQ2 Transposase, ISNCY family n=2 Tax=Prevotella cop... 53 1e-05 >UniRef50_P31665 Uncharacterized protein yadD n=59 Tax=Enterobacteriaceae RepID=YADD_ECOLI Length = 300 Score = 395 bits (1015), Expect = e-108, Method: Composition-based stats. Identities = 300/300 (100%), Positives = 300/300 (100%) Query: 1 MDAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHS 60 MDAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHS Sbjct: 1 MDAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHS 60 Query: 61 TDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPIL 120 TDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPIL Sbjct: 61 TDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPIL 120 Query: 121 FYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQK 180 FYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQK Sbjct: 121 FYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQK 180 Query: 181 HIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGES 240 HIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGES Sbjct: 181 HIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGES 240 Query: 241 MMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKVINLI 300 MMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKVINLI Sbjct: 241 MMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKVINLI 300 >UniRef50_C2DMU4 Possible transposase n=6 Tax=Enterobacteriaceae RepID=C2DMU4_ECOLX Length = 314 Score = 377 bits (968), Expect = e-103, Method: Composition-based stats. Identities = 291/314 (92%), Positives = 295/314 (93%), Gaps = 16/314 (5%) Query: 1 MDAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHS 60 MDAPSTTPHDAVFKQFLMHAETARDFL+IHLP ELRELCDL+TLHLESGSFIEESLKGHS Sbjct: 1 MDAPSTTPHDAVFKQFLMHAETARDFLDIHLPAELRELCDLDTLHLESGSFIEESLKGHS 60 Query: 61 TDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPIL 120 TDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPIL Sbjct: 61 TDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPIL 120 Query: 121 FYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQK 180 FYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQK Sbjct: 121 FYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQK 180 Query: 181 HIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGES 240 HIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGG+S Sbjct: 181 HIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGKS 240 Query: 241 MMTLAQWFE----------------EKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAE 284 MMTLAQWFE EKGIEKGIQQGRQEVSQEFA RLLSKGM REDVAE Sbjct: 241 MMTLAQWFEEKGIEKGIEKGIEKGMEKGIEKGIQQGRQEVSQEFALRLLSKGMPREDVAE 300 Query: 285 MANLPLAEIDKVIN 298 MANLPLAEIDK+IN Sbjct: 301 MANLPLAEIDKLIN 314 >UniRef50_P37415 Uncharacterized protein pSLT051 n=256 Tax=Gammaproteobacteria RepID=YTL2_SALTY Length = 313 Score = 353 bits (907), Expect = 3e-96, Method: Composition-based stats. Identities = 145/306 (47%), Positives = 208/306 (67%), Gaps = 13/306 (4%) Query: 4 PSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDV 63 + TPHDA F+QFL + ARDF+E+HLP ELR +CDL+TL LESGSF+E+ L+ + +DV Sbjct: 6 TTPTPHDATFRQFLTQPDIARDFMELHLPAELRAICDLSTLKLESGSFVEDDLRQYFSDV 65 Query: 64 LYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQ 123 LYS++ GY+HV++EHQS PDK MAFR++RY++AAM RHLEA H KLPLV+P+LFY Sbjct: 66 LYSLKTTAGDGYIHVLVEHQSTPDKHMAFRLIRYAVAAMQRHLEAGHKKLPLVIPVLFYT 125 Query: 124 GEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIR 183 G+ +PYP S W D F LA ++Y+S FPLVD+T+ PDDEI HR +A L LLQKHI Sbjct: 126 GKRSPYPYSTRWLDEFDDTALADKLYSSAFPLVDVTVIPDDEIAGHRSMAALTLLQKHIH 185 Query: 184 QRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLF-YGVLRDRETGGESMM 242 QRDL L+++L ++ GY S SQ++++ +Y++Q G T A+ F + + G+++M Sbjct: 186 QRDLAELVDRLAPILLAGYLSSSQVISLVHYIVQAGETSDAEAFVRELAQRVPQHGDALM 245 Query: 243 TLAQWFEEKGIEKGIQ------------QGRQEVSQEFAQRLLSKGMSREDVAEMANLPL 290 T+AQ E+KGIEKGIQ +G +E + + A+ +L + R V +M L Sbjct: 246 TIAQQLEQKGIEKGIQLGEQRGIEKGRSEGEREATLKIARTMLQNCIDRNTVMKMTGLTE 305 Query: 291 AEIDKV 296 ++ ++ Sbjct: 306 DDLAQI 311 >UniRef50_Q1CC76 Transposase n=27 Tax=Gammaproteobacteria RepID=Q1CC76_YERPN Length = 313 Score = 343 bits (879), Expect = 6e-93, Method: Composition-based stats. Identities = 152/305 (49%), Positives = 206/305 (67%), Gaps = 13/305 (4%) Query: 5 STTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVL 64 + TPHDA F+QFL E ARDF+E+HLP ELR +CDL+TL LESGSF+E+ L+ + +DVL Sbjct: 7 TPTPHDATFRQFLTQPEIARDFMELHLPAELRAICDLSTLKLESGSFVEDDLRQYFSDVL 66 Query: 65 YSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQG 124 YS+ GY+HV+IEHQS PDK MAFR++RY+IAAM RHLEA H KLPLV+P+LFY G Sbjct: 67 YSLDTVEGEGYVHVLIEHQSSPDKHMAFRLIRYAIAAMQRHLEAGHAKLPLVIPVLFYVG 126 Query: 125 EATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQ 184 + +PYP S W D F PELA ++Y+ FPLVD+T+ PDD+IM+HR +A L LLQKHI Q Sbjct: 127 KRSPYPYSTRWLDEFDDPELAHKLYSGAFPLVDVTVIPDDDIMEHRSMAALTLLQKHIHQ 186 Query: 185 RDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLF-YGVLRDRETGGESMMT 243 RD+ L ++L TL+ Y S Q++A+ +Y+LQ G + ++ F + + G+++MT Sbjct: 187 RDIATLTDRLATLLMADYLSSPQVMALIHYLLQAGESADSEAFVRELAQRVPQHGDALMT 246 Query: 244 LAQWFEEKGIEKGIQQGRQEVSQ------------EFAQRLLSKGMSREDVAEMANLPLA 291 +AQ E+KGIEKG +GR E Q E A+ LL GM E V E L Sbjct: 247 IAQQLEQKGIEKGRMEGRTEGIQLGEQRGIEKGKLEVARSLLKMGMPIESVQEATGLSED 306 Query: 292 EIDKV 296 ++ ++ Sbjct: 307 DLAQI 311 >UniRef50_Q4LC22 TpnA protein n=9 Tax=Enterobacteriaceae RepID=Q4LC22_SODGL Length = 308 Score = 342 bits (877), Expect = 1e-92, Method: Composition-based stats. Identities = 138/307 (44%), Positives = 203/307 (66%), Gaps = 10/307 (3%) Query: 1 MDAP-STTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGH 59 M + TPHDAVF+QFL TA+DF +I LP +++ LCD TL ESGSFI+ +K + Sbjct: 1 MSKKFTPTPHDAVFRQFLHDKATAQDFFDIWLPDDIKALCDWETLKPESGSFIDPDMKPY 60 Query: 60 STDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPI 119 +D+LYSV G GY++ +IEHQS PDK MA+R+MRYS+AAM RHLEA HDKLPLV P+ Sbjct: 61 QSDILYSVNANGVDGYVYCLIEHQSTPDKLMAWRLMRYSMAAMQRHLEAGHDKLPLVFPV 120 Query: 120 LFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQ 179 LFY GE +P+P S W D F P++A ++Y+ PF L+D+T DD IMQHRR+A+LEL+Q Sbjct: 121 LFYCGEKSPHPYSTNWLDCFERPDIAAKIYSQPFRLMDVTTLDDDAIMQHRRMALLELIQ 180 Query: 180 KHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQAD-LFYGVLRDRETGG 238 KHIR+RD+ LL+ +V L+ Y + +Q+V M NY++Q G+ + + E Sbjct: 181 KHIRRRDMTELLDSIVKLLSYNYYTDTQVVTMMNYLVQEGNAASPRTFITEIAKRAEKHE 240 Query: 239 ESMMTLAQWFEEK--------GIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPL 290 E++MT+A+ +++ G ++GIQQG + + A+++LS+G++R+ V L Sbjct: 241 EALMTIAEALKQEGYQIGRDDGRQEGIQQGEHAAAMKIARQMLSRGIARDAVKACTGLSD 300 Query: 291 AEIDKVI 297 +D ++ Sbjct: 301 NALDNLM 307 >UniRef50_D2U4R8 Transposase (Fragment) n=4 Tax=Enterobacteriaceae RepID=D2U4R8_9ENTR Length = 308 Score = 333 bits (854), Expect = 4e-90, Method: Composition-based stats. Identities = 143/295 (48%), Positives = 202/295 (68%), Gaps = 1/295 (0%) Query: 5 STTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVL 64 + TPHDAVFKQFL ETA+DF +I LP E++ LCDL++L +ESGSFI+ +K + +D+L Sbjct: 12 TPTPHDAVFKQFLSEKETAKDFFDIWLPDEIKALCDLDSLKMESGSFIDSEMKNYQSDIL 71 Query: 65 YSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQG 124 YSV GY++V+IEHQS PDK +A+R+MRYS+AAM +HLE + +LPLV PILFY G Sbjct: 72 YSVSTTKGSGYIYVLIEHQSTPDKLIAWRLMRYSLAAMQKHLEDGNKQLPLVFPILFYCG 131 Query: 125 EATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQ 184 E +P+P S W D F +LA +YN+PF L D+T D EIMQH+RIA+LELLQKHIR+ Sbjct: 132 EQSPHPYSTHWLDCFEDRKLAESIYNNPFKLADVTTLDDGEIMQHKRIALLELLQKHIRR 191 Query: 185 RDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQA-DLFYGVLRDRETGGESMMT 243 RD+ LL+ +V L+ Y + +Q++ M NY++Q G+ ++ + + + E ++MT Sbjct: 192 RDMTELLDSIVKLLSYNYYTDNQVITMFNYLIQEGNAQRPMEFITNIAKQAEKHEGALMT 251 Query: 244 LAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKVIN 298 +AQ EE GI+KGIQQG Q+ E A++ L+ G+ R V L E++K N Sbjct: 252 IAQQIEEIGIQKGIQQGIQKTKIELAKQFLANGVDRNTVKISTGLSDEELNKFEN 306 >UniRef50_B7UFQ5 Predicted protein n=14 Tax=Enterobacteriaceae RepID=B7UFQ5_ECO27 Length = 315 Score = 329 bits (844), Expect = 6e-89, Method: Composition-based stats. Identities = 153/309 (49%), Positives = 209/309 (67%), Gaps = 17/309 (5%) Query: 2 DAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHST 61 ++ +++PHDAVFK F+ ETARDFLEIHLP LR+LC+L TL LE SFIE+SL+ + + Sbjct: 3 ESTTSSPHDAVFKTFMFTPETARDFLEIHLPEPLRKLCNLQTLRLEPTSFIEKSLRAYYS 62 Query: 62 DVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILF 121 DVL+SV+ GY++ VIEHQS +K MAFR+MRY+ AAM RHL+ +D++PLVVP+LF Sbjct: 63 DVLWSVETSEGDGYIYCVIEHQSSAEKNMAFRLMRYATAAMQRHLDKGYDRVPLVVPLLF 122 Query: 122 YQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKH 181 Y GEA+PYP S+ W D F P+LAR++Y FPLVDITI PDDEIMQHRRIA+LEL+QKH Sbjct: 123 YHGEASPYPYSLNWLDEFDDPQLARQLYTEAFPLVDITIVPDDEIMQHRRIALLELIQKH 182 Query: 182 IRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRET-GGES 240 IR RDL+ +++++ TL+ G+T+ SQL + NY+LQ G T + F + +R E Sbjct: 183 IRDRDLIGMVDRITTLLVRGFTNDSQLQTLFNYLLQCGDTSRFTRFIQEIAERSPLQKEI 242 Query: 241 MMTLAQWFEEKGIEKGIQQGRQEVSQE----------------FAQRLLSKGMSREDVAE 284 +MT+A+ ++G + G Q+G+ E QE A R+L +G RE V Sbjct: 243 LMTIAERLRQEGHQIGWQEGKIEGWQEGKLEGLQEGMHEQAIKIALRMLEQGFEREIVLA 302 Query: 285 MANLPLAEI 293 L A+I Sbjct: 303 ATQLTDADI 311 >UniRef50_P77768 Uncharacterized protein yfcI n=175 Tax=Gammaproteobacteria RepID=YFCI_ECOLI Length = 296 Score = 325 bits (833), Expect = 1e-87, Method: Composition-based stats. Identities = 152/292 (52%), Positives = 210/292 (71%), Gaps = 5/292 (1%) Query: 3 APSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTD 62 + ++TPHDAVFK FL H +TARDF++IHLP LR+LCDL TL LE SFI+E L+ + +D Sbjct: 4 STTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSD 63 Query: 63 VLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFY 122 +L+SV+ Q GY++VVIEHQSKP++ MAFRMMRYSIAAM HL+A + +LPLV+P+LFY Sbjct: 64 LLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFY 123 Query: 123 QGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHI 182 G +PYP S+CW D F P +AR++Y+S FPLVDIT+ PDDEIMQHR++A+LEL+QKHI Sbjct: 124 HGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHI 183 Query: 183 RQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDR-ETGGESM 241 RQRDL+ L++Q+V+L+ G T+ QL A+ NY+LQ G ++ F G + +R E + Sbjct: 184 RQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKL 243 Query: 242 MTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEI 293 MT+A E+G QG+ E + AQ +L +G+ RE V + L ++ Sbjct: 244 MTIADRLREEGAM----QGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDL 291 >UniRef50_Q7N1D0 Transposase, ISNCY family n=36 Tax=root RepID=Q7N1D0_PHOLL Length = 335 Score = 320 bits (819), Expect = 5e-86, Method: Composition-based stats. Identities = 146/330 (44%), Positives = 204/330 (61%), Gaps = 36/330 (10%) Query: 3 APSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTD 62 + TPHDA+FK+FL H +TARDFLEIHLP LR +CDL+TL LESGSFIE++L+ H +D Sbjct: 4 KNTPTPHDAIFKKFLSHIDTARDFLEIHLPATLRAVCDLDTLRLESGSFIEDNLRVHYSD 63 Query: 63 VLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFY 122 +LYS++ Y++ VIEHQS PDK MAFR+MRYSI+AM HLE H KLPLV+P+LFY Sbjct: 64 ILYSLKTTQGESYVYCVIEHQSSPDKMMAFRLMRYSISAMQWHLEQGHKKLPLVIPVLFY 123 Query: 123 QGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHI 182 G+ PYP S WFD F + LA +Y+S FPLVD+T+ PDDEI+ H+R+A+LE++QKHI Sbjct: 124 HGKIRPYPWSTNWFDCFDASALAEEIYSSAFPLVDVTVIPDDEILTHKRVALLEIVQKHI 183 Query: 183 RQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDR-ETGGESM 241 RQRD+ L ++L L Y + L +M NY+L G T + F L ++ E + Sbjct: 184 RQRDMAELQQELTMLFAYDYYTYELLKSMLNYILLVGDTADPEGFIRQLAEQFPKYEEVL 243 Query: 242 MTLAQWFEEKGIEKGIQQG-------RQEVSQE--------------------------- 267 MT+AQ + KG ++G+++G R+E QE Sbjct: 244 MTIAQKLQHKGHQEGLKEGLQKCQDAREEGLQEGLQKGEKKGEKKGEKKGEEKGEKRASL 303 Query: 268 -FAQRLLSKGMSREDVAEMANLPLAEIDKV 296 A+ L+ G+ RE + + L E++++ Sbjct: 304 KIARALMDNGIDRETIMKSTGLSQNELEQI 333 >UniRef50_D1P284 Transposase, ISNCY family n=10 Tax=Enterobacteriaceae RepID=D1P284_9ENTR Length = 322 Score = 305 bits (781), Expect = 1e-81, Method: Composition-based stats. Identities = 115/321 (35%), Positives = 181/321 (56%), Gaps = 25/321 (7%) Query: 1 MDAPST-TPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGH 59 M S PHD+ FK F+ + ARDF E+HLP ++ LC+ +TL L S SF++++L+ Sbjct: 1 MATQSIVAPHDSTFKGFMSKVDNARDFFEVHLPNRIKHLCNFDTLKLASASFVDKTLRSR 60 Query: 60 STDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPI 119 +D+LYSVQ GY + ++EHQS PDK M +R+M Y+ AM++HL+ H LPLVVPI Sbjct: 61 FSDMLYSVQTLKGKGYFYFLVEHQSSPDKLMGWRLMHYAFCAMNQHLQQGHQSLPLVVPI 120 Query: 120 LFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQ 179 LFY G +PYP S W D F +LA +Y +P PLVD+T+ DDE+M HR++A +EL+ Sbjct: 121 LFYHGNQSPYPYSQSWTDCFQWSDLAHDLYCNPLPLVDVTVACDDELMNHRKVAAMELVF 180 Query: 180 KHIRQR-DLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQAD-LFYGVLRDRETG 237 KH R D+ L E+L +++ ++ + NY+ T + ++ E Sbjct: 181 KHASLRGDVFGLSERLAQVLNNNQNHQDDVILIINYLFSVMDTPAYTHIVKTLVDQTEKH 240 Query: 238 GESMMTLAQWFEEKGIEKGIQQGRQEVS---------------QEFAQRL-------LSK 275 E++M +AQ +G+EKG+++GR+E Q+ A L L Sbjct: 241 QETVMNIAQRLRNEGMEKGMEKGRKEERMISQQKLANERQHYQQQMALNLQQQAIMSLKL 300 Query: 276 GMSREDVAEMANLPLAEIDKV 296 G+S + ++++ L ++I + Sbjct: 301 GLSVDIISQITGLSPSDIHAL 321 >UniRef50_Q7B1W7 YadD homologue n=11 Tax=root RepID=Q7B1W7_ECOLX Length = 313 Score = 304 bits (778), Expect = 3e-81, Method: Composition-based stats. Identities = 133/308 (43%), Positives = 194/308 (62%), Gaps = 15/308 (4%) Query: 2 DAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHST 61 + + TPHDA F+ FL + + ARDFLE+HLP E R+LCDL+TL LE +F+E L +++ Sbjct: 6 NTTTPTPHDAAFRSFLANPDVARDFLELHLPAEYRQLCDLSTLKLEPATFVEPDLHQYAS 65 Query: 62 DVLYSVQMQGN-PGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPIL 120 D+L+SV+ G GY++ +IEHQS + M FRM+RYS+AAM RHLE H LPLV+P+L Sbjct: 66 DILWSVKTTGGEDGYVYTLIEHQSTENLYMPFRMLRYSVAAMQRHLEQ-HKTLPLVIPVL 124 Query: 121 FYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQK 180 FY GE +PYP SM W D F +P LA ++Y PFPLVDIT+ D+EIM HRR+A L LL K Sbjct: 125 FYHGERSPYPYSMNWLDCFENPALAAKIYTKPFPLVDITVVDDNEIMNHRRMAALTLLMK 184 Query: 181 HIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGES 240 HIRQRD+++ L+ LV + + Q+ + NY+L + + + +S Sbjct: 185 HIRQRDMLMCLDNLVRAL-QDIQDEEQITVLFNYLLNGSEHVTVEFLQTLAQRLPQHEDS 243 Query: 241 MMTLAQWFEEKGIEKGIQQGRQ------------EVSQEFAQRLLSKGMSREDVAEMANL 288 +MTLA+ +++GI++GIQQG Q + ++E A+ L + GM + ++ L Sbjct: 244 IMTLAERLKQEGIQQGIQQGIQQGIQQGVQQGALQKAREIARELRNAGMPAAQICQLTGL 303 Query: 289 PLAEIDKV 296 AE+ + Sbjct: 304 SEAELKNI 311 >UniRef50_C8QFJ7 Putative transposase YhgA family protein n=4 Tax=Pantoea sp. At-9b RepID=C8QFJ7_9ENTR Length = 301 Score = 301 bits (770), Expect = 2e-80, Method: Composition-based stats. Identities = 118/296 (39%), Positives = 185/296 (62%), Gaps = 6/296 (2%) Query: 6 TTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLY 65 + PHDA+FK+FL H AR FLEIHLP +RE CDL+ L + +FIE L +DVL Sbjct: 5 SAPHDALFKKFLSHLPVARQFLEIHLPQSIREHCDLDKLQVVPTTFIERDLSALYSDVLL 64 Query: 66 SVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGE 125 S++ GY++ +IEHQS PDK M RMMRY++AA+ RHL+ H +PLV+PILFYQG+ Sbjct: 65 SMKTDDGEGYIYALIEHQSTPDKHMTLRMMRYTLAAIQRHLDEGHHDVPLVIPILFYQGK 124 Query: 126 ATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQR 185 +PYP SM W + F +P LA++++ FPLVD+T+ PD+EIM HR +A LE+ K IR R Sbjct: 125 TSPYPYSMNWLESFRNPVLAKQIFCHSFPLVDVTVIPDEEIMAHRDVARLEMAHKIIRLR 184 Query: 186 DLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVL-RDRETGGESMMTL 244 D++ ++ + TL+ Y + + + Y+L+ G+T+ + +L + + +MT+ Sbjct: 185 DILENIDPMATLLALDY-NDDLSIDVVFYLLRYGNTDDREKIVKILIQAKPQLEGKIMTI 243 Query: 245 AQWFEEKGIEKGIQQGRQEVSQEF----AQRLLSKGMSREDVAEMANLPLAEIDKV 296 + + ++ ++G Q+GR+E QE AQR+L + + ++ L E+ ++ Sbjct: 244 EEQWRQESRQEGRQEGRKEGRQEVMLELAQRMLREQFDLNTIMKLTGLSEGELRQL 299 >UniRef50_B6XDZ7 Putative uncharacterized protein n=2 Tax=Providencia RepID=B6XDZ7_9ENTR Length = 327 Score = 298 bits (763), Expect = 2e-79, Method: Composition-based stats. Identities = 117/312 (37%), Positives = 181/312 (58%), Gaps = 25/312 (8%) Query: 8 PHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSV 67 PHD+ FK F+ + ARDF EI+LP ++ LC+L+TL L S SFI+++L+ +D+LYSV Sbjct: 13 PHDSTFKGFMSKVDNARDFFEIYLPNRIKPLCNLDTLKLASASFIDKTLRSRFSDMLYSV 72 Query: 68 QMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEAT 127 Q GY ++++EHQS PDK M +R+M Y+ AM++HL+ ++ LPLVVPILFY G+ + Sbjct: 73 QTLKGKGYFYLLVEHQSTPDKLMGWRLMHYAFCAMNQHLQQGNNALPLVVPILFYHGKQS 132 Query: 128 PYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQRDL 187 PYP S W D F +LA +Y +P PLVD+T+ DDEI+ HR++A +EL+ KH RD Sbjct: 133 PYPYSQVWTDCFPWADLAYDLYCNPLPLVDVTVASDDEIVNHRKVAAMELVLKHSTLRDD 192 Query: 188 MLLL-EQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETG-GESMMTLA 245 +++L E+L +I E ++ + NY+ T L ++ G E++MT+A Sbjct: 193 LIVLSERLAQVISENENHRDDVILIINYLFSVMDTPTYTQIVKTLIEQTEGYQETVMTIA 252 Query: 246 QWFEEKGIEKGIQQGRQEVSQEFAQRL-----------------------LSKGMSREDV 282 +G+EKG+ +GR+E E L G+S + + Sbjct: 253 DRLRNEGLEKGLIKGREEGKAEGKAEGREEARQEEQAIARQRTYTQVITSLDLGLSIDII 312 Query: 283 AEMANLPLAEID 294 +++ LP +EI Sbjct: 313 SKITGLPHSEIQ 324 >UniRef50_C2LLN3 Transposase n=37 Tax=Enterobacteriaceae RepID=C2LLN3_PROMI Length = 319 Score = 293 bits (750), Expect = 5e-78, Method: Composition-based stats. Identities = 128/317 (40%), Positives = 189/317 (59%), Gaps = 21/317 (6%) Query: 1 MDAPSTTP-HDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGH 59 M + P HDA+FKQFL H E ARDF +HLP + LCDL+TL LE SF+E L+ Sbjct: 1 MTKNTQQPVHDALFKQFLTHPENARDFFSVHLPANILPLCDLSTLRLEPASFVERRLRQL 60 Query: 60 STDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADH--DKLPLVV 117 +DVLYSVQM GY++ +IEHQSKPD+ M FR+M Y+++A+ HL+ LPLVV Sbjct: 61 HSDVLYSVQMTEGEGYIYCLIEHQSKPDRLMGFRLMHYAMSAIAHHLKKSPADKTLPLVV 120 Query: 118 PILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILEL 177 P LFYQG PYP SM W D F P LA+++Y FPLVD+++ D+EI+ H+ IA+LEL Sbjct: 121 PFLFYQGSVCPYPYSMNWLDGFADPALAQQLYTRSFPLVDLSVLSDEEILTHKGIALLEL 180 Query: 178 LQKHIRQRD-LMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRET 236 +QKHIR RD LM +L + +I+ + + Q+ ++ Y+ +G+ F+ L Sbjct: 181 VQKHIRTRDGLMAVLPIIAQIINSQHNTVDQVRSVIEYIAYQGYILDESRFFSQLIALSP 240 Query: 237 GGESM-MTLAQWFEEKGIEKGIQQGRQEVS----------------QEFAQRLLSKGMSR 279 ++M T+A+ E+KGIEKGI++G ++ ++ A+ LL +G+ Sbjct: 241 EYKTMLTTIAEQLEQKGIEKGIEKGIEKGIEKGIEKGIEKGIGLGVEKVARSLLQQGVDL 300 Query: 280 EDVAEMANLPLAEIDKV 296 + + L +I+ + Sbjct: 301 NIIMQCTGLTREKIESL 317 >UniRef50_A8PLK1 Putative uncharacterized protein n=3 Tax=Rickettsiella grylli RepID=A8PLK1_9COXI Length = 308 Score = 291 bits (744), Expect = 2e-77, Method: Composition-based stats. Identities = 102/305 (33%), Positives = 172/305 (56%), Gaps = 9/305 (2%) Query: 1 MDAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHS 60 M HDA+FK F E A F+ I+LP +++ CD +TL +E GSF++ LK H Sbjct: 1 MSIQIHNAHDAIFKTFFTDIEVATHFITIYLPKHMKQACDFSTLKIEPGSFVDADLKQHH 60 Query: 61 TDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPIL 120 +D+LYS+++ G GY+++ +EHQS ++ M FRM RY +A M +HL + KLPLV+ +L Sbjct: 61 SDILYSLKVNGMHGYVYLNLEHQSTAEELMPFRMHRYKVAIMQQHLNQGNKKLPLVISML 120 Query: 121 FYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQK 180 FY G+ YP + D A+ + L+D+ + PD+EI +H+++A LE++QK Sbjct: 121 FYHGKGQ-YPYCLKLIDCVEDTPFAKAHFFDDPLLIDLNVLPDEEIYRHKQLAFLEIVQK 179 Query: 181 HIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGES 240 HI RDL + + +V L+ + + YML +G T + L+ E E Sbjct: 180 HIFTRDLEDIADHIVRLVKQVKPDHDLFNQLVYYMLVKGETANVNQVIEKLKTIEDYEED 239 Query: 241 MMTLAQWFEEKGIEKGIQQGRQEVSQE--------FAQRLLSKGMSREDVAEMANLPLAE 292 +M AQ +++G ++G+ +GRQE Q+ A++L+++G S + + ++ NL E Sbjct: 240 IMNAAQQLKQQGRQEGLYEGRQEGLQKGEYRKAITIAKKLIAEGRSIQYIQDLTNLSENE 299 Query: 293 IDKVI 297 + ++ Sbjct: 300 VLSLV 304 >UniRef50_D0KLJ7 Putative transposase YhgA family protein n=1 Tax=Pectobacterium wasabiae WPP163 RepID=D0KLJ7_PECWW Length = 288 Score = 287 bits (734), Expect = 4e-76, Method: Composition-based stats. Identities = 132/292 (45%), Positives = 187/292 (64%), Gaps = 13/292 (4%) Query: 9 HDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQ 68 HDA+FKQFL ARDFL IHLP +RE CD NTL LES SFI+E L+ +DVLYS+ Sbjct: 4 HDAIFKQFLSDIAVARDFLTIHLPDSIRERCDFNTLQLESASFIDEKLRARISDVLYSLH 63 Query: 69 MQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEATP 128 GY++ VIEHQS+P+K+MAFR++RY +AAM +HL+ HD+LPLVVP+LFY G + P Sbjct: 64 TSVGKGYIYCVIEHQSRPEKQMAFRLLRYCLAAMQQHLDQGHDRLPLVVPLLFYHGRSRP 123 Query: 129 YPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQRDLM 188 YP S+ W D F +P LA+ +Y PFPLVD+T+ PDDEI HRR+A+LEL+QKHIR RD++ Sbjct: 124 YPYSLRWLDSFAAPVLAQTLYEQPFPLVDLTVMPDDEIRTHRRMALLELVQKHIRTRDML 183 Query: 189 LLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGESMMTLAQWF 248 L ++ L + S + ++ M ++ G+ R + G LAQ Sbjct: 184 ELAREIGLLFERWAAPLS--IGQEDIMTIAEQLKKMGFDEGIQRGIQQG------LAQ-- 233 Query: 249 EEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKVINLI 300 G+E+GI+QG + +++ A+ LL GM + V + L E+++++ I Sbjct: 234 ---GLEQGIEQGMKNSARQIARHLLLTGMDKNSVQQATQLETEELEQLVTAI 282 >UniRef50_B7MZS6 Putative uncharacterized protein n=3 Tax=Escherichia coli ED1a RepID=B7MZS6_ECO81 Length = 319 Score = 273 bits (698), Expect = 6e-72, Method: Composition-based stats. Identities = 105/304 (34%), Positives = 164/304 (53%), Gaps = 9/304 (2%) Query: 4 PSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDV 63 ++ HDA F++ L ARDFLE L + C+L+T+ LE +F+ ESL+ + DV Sbjct: 7 KTSLIHDAAFRKTLKDPAAARDFLEQVLTPYQKSRCNLDTIELEPTTFVAESLRQSACDV 66 Query: 64 LYSVQMQ-GNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFY 122 L S++ G GY++ +IEHQS PDK + RMMRY +A M +H+E H P+V+P+LFY Sbjct: 67 LLSMKTNDGKDGYIYTLIEHQSSPDKFIPLRMMRYILAVMEQHIEE-HKCAPVVIPVLFY 125 Query: 123 QGEATPYPLSMCWFDMFYSPELARRVY--NSPFPLVDITITPDDEIMQHRRIAILELLQK 180 G PYP M W D P R +Y PF LVD++ DDEI + R+A L K Sbjct: 126 HGAKRPYPYPMNWVDCLDDPAYGREIYGEQKPFSLVDVSTLTDDEIEHYHRMAALMFTMK 185 Query: 181 HIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGES 240 D++ L+ + +TL D Y S L + Y+L+ + A+L V + Sbjct: 186 SGTSGDVIELIGKSITLTD-KYGSSVHLNTVLTYLLELYQMDFAELSEAVSTHYPSHKGV 244 Query: 241 MMTLAQWFEE----KGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKV 296 +MT+A+ EE KG+EKG+++GR E + +G S E++ + +L ++ + Sbjct: 245 IMTIAEQLEERGLKKGLEKGLEKGRAEERSRLVLMMRQRGKSLEEIKDFLDLTDEQLLQA 304 Query: 297 INLI 300 ++ + Sbjct: 305 LDYV 308 >UniRef50_Q3C0L1 TpnA protein n=16 Tax=Enterobacteriaceae RepID=Q3C0L1_SODGL Length = 277 Score = 271 bits (694), Expect = 2e-71, Method: Composition-based stats. Identities = 105/275 (38%), Positives = 165/275 (60%), Gaps = 17/275 (6%) Query: 41 LNTLHLESGSFIEESLKGHSTDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIA 100 L+TL + SGSFIE+ L +D+LYS++ Y++ +IEHQS P+ MAFR++RY++ Sbjct: 3 LSTLVMVSGSFIEDDLCSQCSDMLYSLKSTLGDAYIYCLIEHQSCPEPMMAFRLLRYAVT 62 Query: 101 AMHRHLEADHDKLPLVVPILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITI 160 AMHRHLE ++ +LP+V+PILFY G +PYP + W D F +LA VY FPLVD+T Sbjct: 63 AMHRHLEQENKQLPVVIPILFYHGSTSPYPYTTHWLDCFADRKLAESVYEKAFPLVDVTA 122 Query: 161 TPDDEIMQHRRIAILELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGH 220 D+EI++HRR+A++E++QKHIR R+++ L +L L+++ S Q + Y++ G+ Sbjct: 123 MEDEEILRHRRMALMEIVQKHIRTRNMLELAGELANLLEQWKFSKEQCKTLVYYLVLAGN 182 Query: 221 TEQADLFYGVLRDR-ETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQ------------- 266 T + F L + E MMT+A+ E KG++KGIQ G ++ + Sbjct: 183 TTDGEGFLRTLAQPAPSYREDMMTIAEQLEAKGMQKGIQLGEKKGIERGLQEGIQLGKKQ 242 Query: 267 ---EFAQRLLSKGMSREDVAEMANLPLAEIDKVIN 298 + A++ L G+ R+ V L +I+ V+N Sbjct: 243 ATLKIARQFLVNGVERDIVKMSTGLTDRDINDVLN 277 >UniRef50_C2LF55 Transposase n=3 Tax=Enterobacteriaceae RepID=C2LF55_PROMI Length = 330 Score = 271 bits (692), Expect = 3e-71, Method: Composition-based stats. Identities = 101/328 (30%), Positives = 172/328 (52%), Gaps = 31/328 (9%) Query: 1 MDAPS-TTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGH 59 M+ P + HDA FK+F+M+ A+DF IHL EL+ CD +TL L++ SFI+ L+ Sbjct: 1 MNKPLLISSHDAAFKRFMMNISNAKDFFFIHLSDELKSYCDFSTLKLQNSSFIDIKLRSR 60 Query: 60 STDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPI 119 +D+LYSV+ + ++ +IEHQS+PDK +A+RMM Y+ M++HL+ + LPLVVPI Sbjct: 61 MSDILYSVKTKKGNISIYFLIEHQSRPDKMIAWRMMHYAFCTMNQHLQQGYTSLPLVVPI 120 Query: 120 LFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQ 179 LFY G+ PYP S+ W D F LA ++Y + F L+D+ D+ ++ HR+ A++E+ Sbjct: 121 LFYHGKRKPYPFSVNWLDCFPLSTLANQLYLNNFALIDLNSIDDEILLTHRKAAVMEIAM 180 Query: 180 KHIRQRDLMLLLEQLV-TLIDEGYTSGSQLVAMQNYMLQRGHTEQAD-LFYGVLRDRETG 237 KH+ D + L L+ I++ S +A+ Y+ + + + + Sbjct: 181 KHVNSCDDLDKLAMLLSKAINQKNCSDEDTIAVVQYLFSIMDAADFESIINKIAEQVDNH 240 Query: 238 GESMMTLAQWFEEKGIE----------------------------KGIQQGRQEVSQEFA 269 E++M +A E KG + +GI+ G++ V + A Sbjct: 241 RETIMNIAWRLENKGFKLGKMEGIEIGKNEGIEIGKNEGIEIGKNEGIEIGKKIVQIQLA 300 Query: 270 QRLLSKGMSREDVAEMANLPLAEIDKVI 297 + LL + + E + + L + E+ ++ Sbjct: 301 KNLLKENVELEFIERITGLSIQELKILL 328 >UniRef50_C0Q5B1 Ytl2 n=4 Tax=Enterobacteriaceae RepID=C0Q5B1_SALPC Length = 316 Score = 266 bits (679), Expect = 1e-69, Method: Composition-based stats. Identities = 118/303 (38%), Positives = 180/303 (59%), Gaps = 17/303 (5%) Query: 9 HDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQ 68 HD +FK FL +TARDFL +HLP ++R L+TL LE GSF+++ L+ +DVLYSV+ Sbjct: 12 HDGLFKLFLREPDTARDFLAVHLPADIRAQVRLDTLKLEPGSFVDQKLRELHSDVLYSVE 71 Query: 69 M-QGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEAT 127 +G+ GY++ ++EHQS D+ MA+RMMRYS+A M HL+ + LP+VVP+LFYQG Sbjct: 72 TAEGHAGYIYCLVEHQSTADRMMAWRMMRYSMAVMDAHLKKGNGTLPVVVPLLFYQGMVR 131 Query: 128 PYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQRDL 187 PYP S W D F P LAR VY+ P+PLVD+++ D ++ HRR+A+LEL+Q+ IR RD Sbjct: 132 PYPYSTDWMDCFDVPALAREVYSRPWPLVDVSVMEDCDLQSHRRMALLELVQRDIRHRDA 191 Query: 188 MLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQA--DLFYGVLRDRETGGESMM-TL 244 LL +V LI + +Q+ A+ Y++ G T ++ Y + + E +M T+ Sbjct: 192 ASLLRDVVQLIRLAGNTRAQVEAVLCYIIYNGMTSESITPFLYELAGEIPEYKELIMGTI 251 Query: 245 AQWFE------------EKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAE 292 AQ + ++ + +++ Q+ E A LL G+S E V + L Sbjct: 252 AQQLKEEGIQQGIQQGIQQERQASLER-EQKTLLETAYALLDNGVSLEVVIKSTGLNRET 310 Query: 293 IDK 295 +++ Sbjct: 311 LEQ 313 >UniRef50_Q52101 ORF n=1 Tax=Salmonella enterica subsp. enterica serovar Enteritidis RepID=Q52101_SALEN Length = 292 Score = 265 bits (678), Expect = 1e-69, Method: Composition-based stats. Identities = 106/278 (38%), Positives = 149/278 (53%), Gaps = 10/278 (3%) Query: 4 PSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDV 63 + TPHDA F+QFL + ARDF+E+HLP ELR +CDL+TL LESGSF+E+ L+ + +DV Sbjct: 6 TTPTPHDATFRQFLTQPDIARDFMELHLPAELRAICDLSTLKLESGSFVEDDLRQYFSDV 65 Query: 64 LYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSI-AAMHRHLEADHDKLPLVVPILFY 122 LYS++ + + S+ + F + AAM RHLEA H KLPLV+P+LFY Sbjct: 66 LYSLKTTAGDDIFMSWL-NTSQHLTNICFPPDTLCVGAAMQRHLEAGHKKLPLVIPVLFY 124 Query: 123 QGEATPYPLSMCWFDMFYSPELARRVYNSPFP-LVDITITPDDEIMQHRRIAILELLQKH 181 G+ +PYP S W D F R+ LVD+T+ PDDEI HR +A L LL ++ Sbjct: 125 TGKRSPYPYSTRWLDEFDDTAPGRQTLQQRLSRLVDVTVIPDDEIAGHRSMAALTLLPEN 184 Query: 182 IRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQN----YMLQRGHTEQADLFYGVLRDRETG 237 I + + +T Y S +A Y R + + L Sbjct: 185 IF---ISGTWQNWLTGWRPFYGRISVFIAGNIAGTLYSAGRRNIRRRSLCTRTGTACAQH 241 Query: 238 GESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSK 275 G+++MT+AQ E+KGIEKGIQ G Q ++ + Sbjct: 242 GDALMTIAQQLEQKGIEKGIQLGEQRGIEKGRSEGERE 279 >UniRef50_A8PQ66 Putative uncharacterized protein n=3 Tax=Rickettsiella grylli RepID=A8PQ66_9COXI Length = 307 Score = 262 bits (669), Expect = 1e-68, Method: Composition-based stats. Identities = 88/305 (28%), Positives = 157/305 (51%), Gaps = 7/305 (2%) Query: 1 MDAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHS 60 M HD +FK L A FL+ L E+ +L ++ TL L SF+ + Sbjct: 1 MAMTIHQAHDKLFKYSLSKKTIAISFLKSRLSSEIYKLINIETLQLTDKSFVLPEFREIH 60 Query: 61 TDVLYSVQMQGNPGYLHVVIEHQSKPD-KKMAFRMMRYSIAAMHRHLEADHDKLPLVVPI 119 +D++Y Q+ GY+ ++EH+S + MAFR ++Y+I+AM ++ + KLP+V+PI Sbjct: 61 SDIVYQCQINEKKGYIFFILEHESTAHVELMAFRQLQYTISAMDQYCRQGNKKLPIVLPI 120 Query: 120 LFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQ 179 Y G +PYP S +D F + ++AR++ PF L+D+T+ D+E+ + ++E+L Sbjct: 121 CVYHGIKSPYPHSQDVYDNFENLQIARQIVFKPFTLIDLTVLSDEELAKDGPAYLMEMLL 180 Query: 180 KHIRQRDLMLLLEQLVTLIDEGYTSGSQLV--AMQNYMLQRGHTEQADLFYGVLRDR--- 234 KH R ++ + +L + + I + + YM+ E + +++ Sbjct: 181 KHSRAKNFLSILHRRIEFIQSLLNRFGKEYRWFVVKYMINETQDESPNAVEQLVQTLSTA 240 Query: 235 -ETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEI 293 +MMT AQ ++G+E+G++QGR E + A+ LL GMS + V + L E+ Sbjct: 241 FPEEKNTMMTFAQQLRQEGLEQGLEQGRYEEAIAIAKNLLGDGMSFKAVQRLTGLSEKEV 300 Query: 294 DKVIN 298 ++N Sbjct: 301 MNLVN 305 >UniRef50_C3M8C1 Putative transposase n=3 Tax=Candidatus Hamiltonella defensa 5AT (Acyrthosiphon pisum) RepID=C3M8C1_HAMD5 Length = 308 Score = 256 bits (655), Expect = 6e-67, Method: Composition-based stats. Identities = 120/308 (38%), Positives = 182/308 (59%), Gaps = 14/308 (4%) Query: 4 PSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDV 63 +TPHD +FK+F AR+F EIHLP + ++ +L + GSFI++SLK +D+ Sbjct: 2 KISTPHDRLFKKFFGDIALARNFFEIHLPSSILKIVSFPSLKMVPGSFIDKSLKQSHSDM 61 Query: 64 LYSVQM-QGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFY 122 +YS + G GYL+ V+EHQS DK MAFRM +YS+A M +HL+ HD LPLV+P+LFY Sbjct: 62 VYSFETSTGKEGYLYCVVEHQSTDDKMMAFRMKKYSLAVMQQHLDQGHDTLPLVLPVLFY 121 Query: 123 QGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHI 182 G+ +PYP SM W D F ELAR + + PFPLVD+T+ P++EIM+H I+ LE+ QK + Sbjct: 122 HGQKSPYPHSMDWRDCFCEKELARILDSQPFPLVDVTMLPEEEIMKHGIISWLEMSQKMV 181 Query: 183 RQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGESMM 242 RD+M + L+ L + ++ Y+ Q G T LF+ L E++M Sbjct: 182 HTRDMMEIAPYLIRLDKLFPLNDELFKSLLYYLFQEGETADRMLFFDALSSTTQ-RENVM 240 Query: 243 TLAQWFE------------EKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPL 290 T+A+ + E+G E+G ++GR+E +E A+ LL+ G S + V L Sbjct: 241 TIAEELKREGREEGREEGREEGREEGREEGREEGREEIAKNLLNNGFSFKQVKMYTGLSE 300 Query: 291 AEIDKVIN 298 ++K+++ Sbjct: 301 DSLNKLLD 308 >UniRef50_C0AXL8 Putative uncharacterized protein n=1 Tax=Proteus penneri ATCC 35198 RepID=C0AXL8_9ENTR Length = 254 Score = 245 bits (625), Expect = 2e-63, Method: Composition-based stats. Identities = 83/247 (33%), Positives = 138/247 (55%), Gaps = 2/247 (0%) Query: 24 RDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQMQGNPGYLHVVIEHQ 83 + F IHLP EL+ CD +TL L++ SFI+ L+ +D+LY V+ + ++++IEHQ Sbjct: 6 KTFFFIHLPEELKSQCDFSTLQLQNSSFIDIKLRSRMSDILYLVKTKEGDVPIYLLIEHQ 65 Query: 84 SKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEATPYPLSMCWFDMFYSPE 143 S+PDK +A+RMM Y+ M++HL+ + LPLVVPILFY G+ PYP + W + F Sbjct: 66 SRPDKMIAWRMMHYAFCTMNQHLQQGYKSLPLVVPILFYHGKKKPYPFPVNWMECFPLSS 125 Query: 144 LARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQ-RDLMLLLEQLVTLIDEGY 202 LA +Y++ F L+D+T DD ++ H++ A++E+ KH+ DL + L I++ Sbjct: 126 LANHIYSNDFSLIDLTSIDDDILLTHKKAAVMEIAMKHVNSCHDLNKIAMLLSKAINQKN 185 Query: 203 TSGSQLVAMQNYMLQRGHTEQADLFYGVLRDR-ETGGESMMTLAQWFEEKGIEKGIQQGR 261 VA+ Y+ + + +R + E++M +A E KG + GI +G Sbjct: 186 CRDEDTVAVVQYLFSIMDASDFEFIINKIAERVDNHRETIMNIAWRLENKGFKLGIDEGF 245 Query: 262 QEVSQEF 268 + + Sbjct: 246 EIGKLKV 252 >UniRef50_Q24W02 Putative uncharacterized protein n=3 Tax=Clostridiales RepID=Q24W02_DESHY Length = 333 Score = 239 bits (610), Expect = 1e-61, Method: Composition-based stats. Identities = 82/330 (24%), Positives = 151/330 (45%), Gaps = 38/330 (11%) Query: 6 TTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLY 65 PHD FK+ AR FL+ +LP E+ L DL T+ + S+I++ L+ +D+L+ Sbjct: 5 HNPHDKFFKETFGDVGMARSFLKNYLPQEILALVDLETILPQKDSYIDQELQESFSDLLF 64 Query: 66 SVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADH-DKLPLVVPILFYQG 124 V++ N GYL+ + EH+S P + +A ++++Y + L+ DKLPL++P++ Y G Sbjct: 65 QVKIHKNEGYLYFLFEHKSYPSQGIALQLLKYMVRIWESKLKESKPDKLPLIIPMVVYHG 124 Query: 125 EATPYPLSMCWFDMFYSPE-----LARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQ 179 + + S+ + + E + + + + L D++ D E++ + + I+ Sbjct: 125 QE-KWNSSLKLSGIIDNYEQLPNAVTQYIPEYEYILYDLSTYTDQEMVGNMLLLIILRTM 183 Query: 180 KHIRQRDLMLLLEQLVTLIDEGYTSGSQ------LVAMQNYMLQRGHTEQADLFYGVLRD 233 + I +D L L+ Q + Y+L + + Y + ++ Sbjct: 184 RDIFIKDTEAFHNILHELLISFERVEDQEKGMQFFETLIRYILSTRQDLELERIYEIAKE 243 Query: 234 RE-TGGESMMTLAQWFE------------------------EKGIEKGIQQGRQEVSQEF 268 GE MMT+A+ EKG E+G+++GR+E E Sbjct: 244 VSLERGEVMMTIAEKLIMEGMEKGLKKGREEGLKKGREEGLEKGREEGLEKGREETKLEV 303 Query: 269 AQRLLSKGMSREDVAEMANLPLAEIDKVIN 298 A+ LL G+ + VA+ L EI K++N Sbjct: 304 ARNLLGLGIEMDKVAKATGLSEEEIRKLMN 333 >UniRef50_A6TJT5 Putative uncharacterized protein n=1 Tax=Alkaliphilus metalliredigens QYMF RepID=A6TJT5_ALKMQ Length = 312 Score = 230 bits (587), Expect = 5e-59, Method: Composition-based stats. Identities = 74/309 (23%), Positives = 154/309 (49%), Gaps = 17/309 (5%) Query: 6 TTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLY 65 PHD FK+ + A+DF+ +LP+EL ++ D+ TL E +IE+ LK +D+L+ Sbjct: 5 HQPHDKFFKEMFGNLALAKDFMTNYLPLELLKIVDIETLTPEKEHYIEDDLKESFSDLLF 64 Query: 66 SVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAM-HRHLEADHDKLPLVVPILFYQG 124 + G GYL+ + EH+S P K++A +++ Y + + L+ +K+P+++P+ Y G Sbjct: 65 KANINGREGYLYFLFEHKSYPSKRIAIQLLHYMVRIWDDKSLKEKKEKIPMIIPMTVYHG 124 Query: 125 EATPYPLSMCWFDMFYSPELA-----RRVYNSPFPLVDITITPDDEIMQHRRIAILELLQ 179 + + +++ D+ E + + + + D++ DDE+ ++ I+ + Sbjct: 125 KE-NWNVALRLSDLMEGYEELPEEIRKYIPEYEYLIYDLSGYTDDEVKGDVQLQIVIKIL 183 Query: 180 KHIRQRD--LMLLLEQLVTLIDEGYTSG---SQLVAMQNYMLQRGHTEQADLFYGVLRDR 234 + I + D + ++ V ++D+ Y+L Y ++++ Sbjct: 184 RSIFRNDEEFFKVFKEAVEVLDKLEKQEKGIEYFKTFIYYILSARKGVTLTEIYDLVKEV 243 Query: 235 ETGG-ESMMTLAQWF----EEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLP 289 + +MT+A+ EKG+EKG+++G+ E +E A+ L+ G+ + V + L Sbjct: 244 SVERSDEIMTIAEELLKEGMEKGMEKGMEKGKLEEKREVARNLIGLGVELDKVMKATGLS 303 Query: 290 LAEIDKVIN 298 EI+K++N Sbjct: 304 EEEINKLLN 312 >UniRef50_A6G4N5 Putative uncharacterized protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6G4N5_9DELT Length = 343 Score = 228 bits (581), Expect = 2e-58, Method: Composition-based stats. Identities = 71/312 (22%), Positives = 131/312 (41%), Gaps = 13/312 (4%) Query: 1 MDAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHS 60 M S +PHDA+FK + A L+ L + D +TL E GS+I+E+L Sbjct: 1 MHGTSPSPHDALFKSAFKDPKDAAKLLQNVLDEPIAHAIDWSTLRPEPGSYIDETLAERH 60 Query: 61 TDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDK-LPLVVPI 119 +D+L+S + G Y++++IEHQS D+ M RM+ Y RH A + LP ++P+ Sbjct: 61 SDLLFSASIGGEDAYVYLLIEHQSTVDRDMPLRMLVYLTRVWLRHRSAHPGRDLPPILPV 120 Query: 120 LFYQGEATPYPL----SMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAIL 175 + S+ PEL + + D+T D ++ + Sbjct: 121 VVSHAPGGWTAPVTFESLVRPGPTDLPELTPHIPRFELVINDLTHLSDQQLREWSMRGFA 180 Query: 176 ELLQKHIRQR----DLMLLLEQLVTLIDEGYTSGSQLVA---MQNYMLQRGHTEQADLFY 228 L+ +R R +L+ + + E + + + A + +Y+ Q F+ Sbjct: 181 TLVLWILRTRHEIPELIDGVSTWRDMFREVFEAPDGVQAMTKIFHYIACIAQRVQVQEFH 240 Query: 229 GVLRDR-ETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMAN 287 L + E M T + E+G+ KG+ +GR+E ++ L + + A+ Sbjct: 241 AKLDEHVPQTREVMKTYYEELMEEGMAKGLAKGREEGREQSRIETLQETLIDLLSAKFDL 300 Query: 288 LPLAEIDKVINL 299 L +++ + Sbjct: 301 RELEHAERIRSA 312 >UniRef50_Q1RJ73 Transposase and inactivated derivative n=10 Tax=Rickettsieae RepID=Q1RJ73_RICBR Length = 305 Score = 227 bits (580), Expect = 3e-58, Method: Composition-based stats. Identities = 75/298 (25%), Positives = 153/298 (51%), Gaps = 11/298 (3%) Query: 9 HDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQ 68 HD++ K + A++FLE +LP + ++L DL+ + +E S+IEESL +D++Y ++ Sbjct: 7 HDSLVKIIMTDKIAAQEFLEYYLPEDFKKLIDLSKITVEQESYIEESLSKKYSDIVYGIE 66 Query: 69 MQG-NPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEAT 127 + G+++++IE QS D A R+ +Y++ RH + +KLPLV ++ Y G+ Sbjct: 67 TKEYGKGFVYILIEAQSTVDYWTALRLWKYTLLLCERH-KEKRNKLPLVYNLVIYNGKQV 125 Query: 128 PYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQRDL 187 Y +D+F + +A+++ + LVD+ D+EI++ + I +L+ + KHI +RD+ Sbjct: 126 -YNAPRNLWDLFTNSVMAKKLMMEDYQLVDLQAMSDNEIVKKKHIGMLDYILKHIHERDM 184 Query: 188 MLLLEQLVTLI--------DEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGE 239 + L EQ + ++GY + + + + + + + Sbjct: 185 IQLWEQFLANFNHVIMLDKEKGYIYLKSFLWYTDAKISKKQQPRLVQVFDKYLSPQHKDN 244 Query: 240 SMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKVI 297 M T+A + ++G ++G ++G + A+++ S+G +AE+ L I +I Sbjct: 245 IMKTIADVYIDEGKQEGKREGEYNKAVMIAKKMFSQGFKIPVIAELTGLKETLIRSII 302 >UniRef50_B3ESQ9 Putative uncharacterized protein n=2 Tax=Bacteria RepID=B3ESQ9_AMOA5 Length = 308 Score = 224 bits (570), Expect = 4e-57, Method: Composition-based stats. Identities = 90/302 (29%), Positives = 154/302 (50%), Gaps = 17/302 (5%) Query: 6 TTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLY 65 + PHD + K L H E ++F + + P ++ + DL +L L + S++ E L+ D+++ Sbjct: 10 SNPHDLLVKATLSHPEAIQEFAKAYFPADILKRVDLPSLKLTNKSYVTEELREFHNDLVF 69 Query: 66 SVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDK--LPLVVPILFYQ 123 S + PGY V+EHQS PD MA R ++Y+IA + +++ +K P++V I Y Sbjct: 70 SFTIDKQPGYAFFVLEHQSTPDPLMALRFVKYNIALIEEYIKEKGEKTPWPIIVNICLYH 129 Query: 124 G-EATPYPLSMCWFDMFYSPELARRV-YNSPFPLVDITITPDDEIMQHRRIAILELLQKH 181 PYP S +D+F P A+ + + F L D+ TP++ + QH I ++E L K+ Sbjct: 130 NANEKPYPYSTSVYDLFKDPLTAKALEMFTKFYLADLNSTPNEVLEQHGSIGLMEKLLKY 189 Query: 182 IRQRDLMLLLEQLVT-----LIDEGYTSGSQLVAMQNYMLQRGHTEQ--ADLFYGVLRDR 234 R RD+ ++E+ + LI G + L+ + Q +E+ LF VL Sbjct: 190 SRHRDIFNVIEKELKRSKGYLIVRGDYWKTILIYSSYVIGQEEKSEKDLVSLFKEVLSKN 249 Query: 235 ETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEID 294 E E M+T+AQ EE+G + G++ A+ +L KG + E+ L +I+ Sbjct: 250 E--EEIMITIAQTIEERGEMR----GKRREKIAIAKNMLKKGCEISFIEEITGLSRKDIE 303 Query: 295 KV 296 K+ Sbjct: 304 KL 305 >UniRef50_Q2J904 Putative uncharacterized protein n=1 Tax=Frankia sp. CcI3 RepID=Q2J904_FRASC Length = 323 Score = 219 bits (559), Expect = 8e-56, Method: Composition-based stats. Identities = 72/283 (25%), Positives = 132/283 (46%), Gaps = 17/283 (6%) Query: 1 MDAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHS 60 M +P + PHDAVF++ L A L LP L DL+ L + GS ++ +L+ Sbjct: 1 MSSPPS-PHDAVFRRVLGVPSNAASQLRATLPAALVARLDLDRLAIVPGSLVDATLRWRH 59 Query: 61 TDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHD--KLPLVVP 118 TD+L++ + G+ +++V++EHQS D MAFRM+RY + R+L H +LP VVP Sbjct: 60 TDLLFTAPLDGHEAFIYVLVEHQSSSDPLMAFRMLRYVVRVWDRYLADHHKAARLPAVVP 119 Query: 119 ILFYQGEATPYPLSMCWFDMFYSPELA----RRVYNSPFPLVDITITPDDEIMQHRR--- 171 ++ + E + + +P+LA + F L D+ + E+ + Sbjct: 120 LVVHHNEHAWVAPTQVLDLVDLAPDLAGAWREHLPRFQFLLDDLVRVDERELRERPLTHS 179 Query: 172 IAILELLQKHIR-----QRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADL 226 + + LL K + +DL +++L ++D G + + Y+ G + D Sbjct: 180 VRLTLLLLKIVPGNPRLAQDLRPWVDELRAVLD-GPDGREEFATLLRYIELVGEADARDE 238 Query: 227 FYGVLRDR-ETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEF 268 + ++ ++ MT+A+ +G +G +GR E + Sbjct: 239 LHDLIAGLGPEAEDAYMTIAEMLRAEGRVEGRVEGRVESLLQL 281 >UniRef50_A8GX51 Transposase and inactivated derivative n=11 Tax=Rickettsia RepID=A8GX51_RICB8 Length = 355 Score = 219 bits (558), Expect = 9e-56, Method: Composition-based stats. Identities = 81/278 (29%), Positives = 143/278 (51%), Gaps = 9/278 (3%) Query: 12 VFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQMQG 71 +F++ L + A +F HLP ++ L D +L +E+ +F+E SLK +DVL+S + Sbjct: 23 IFRKALENPLVAHEFFNAHLPPNIKSLIDFPSLAMENTTFVESSLKDSISDVLFSCKFDK 82 Query: 72 NPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHL--EADHDKLPLVVPILFYQGEATPY 129 GYL +++EHQSK D MAFR+ +Y I R+L LPL+ P++F+ G+ Y Sbjct: 83 QDGYLFLLVEHQSKADHFMAFRLFKYMINICERYLIQNPKAKTLPLIYPMIFFNGQE-KY 141 Query: 130 PLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQRDLML 189 ++ +D+F + +LA+ ++ + + LV++ PD+E Q ILE KHI +R+L+ Sbjct: 142 NVARNLWDLFTNNKLAKELWINDYQLVNVHEIPDEEFKQRIWSGILEFFLKHIHERELLK 201 Query: 190 LLEQLVTLIDEGYTS---GSQLVAMQNYMLQRGHTEQADLFYGVLRDR---ETGGESMMT 243 +++ ++ E L + Y L + +L + E G M + Sbjct: 202 RWQEISDILPELTKITIGYDYLEMILYYTLTKIEQADKIKLKNLLSTKLNPEIGTRLMRS 261 Query: 244 LAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSRED 281 LA+ ++++G E GI +G Q + Q +KG+ Sbjct: 262 LAEHWQQEGKEIGILEGLQVGEAKGIQIGEAKGIQIGK 299 >UniRef50_A0LBL3 Putative uncharacterized protein n=6 Tax=Magnetococcus sp. MC-1 RepID=A0LBL3_MAGSM Length = 322 Score = 219 bits (558), Expect = 9e-56, Method: Composition-based stats. Identities = 74/302 (24%), Positives = 130/302 (43%), Gaps = 9/302 (2%) Query: 3 APSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTD 62 T PHD K L + L LP E+ EL L G+FI+ + H TD Sbjct: 2 TKITQPHDRFLKALLSDPDKTGTLLRERLPKEVAELLSSEPPVLVDGTFIDGEFREHLTD 61 Query: 63 VLYSVQMQGNPG-YLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILF 121 L+ V+ Q Y++ +IEH+S D+ +AF+++RY + R L+ KLP +VP++ Sbjct: 62 RLFKVKTQEGKAAYIYALIEHKSYADEWVAFQLLRYMVRIWERFLKEGQQKLPPIVPLVV 121 Query: 122 YQGEATPYPLSMCWFDMFYSPE-LARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQK 180 Y G + + + + + + L + + F + D+ DD++ Q + + K Sbjct: 122 YHGARE-WTVPNQFSALLEADKGLLHHLLDFSFAVTDLGRIADDDLSQDTHLRAALMAMK 180 Query: 181 HIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGES 240 + Q +++ + +G ++LV Y++Q + GE+ Sbjct: 181 YAFQGAEGVVVIPQIGKGAQGDPEFAKLV--LRYLIQTYRGMTM-ADVQAYAEEAFPGEA 237 Query: 241 MMTLAQWFEE---KGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKVI 297 +Q+ E KG ++G Q+GR+E QE Q S + R ++P KV Sbjct: 238 EHYASQFAREMMSKGRQEGRQEGRREGRQEGRQEGESSLLLRLLHRRFGDVPSWAELKVA 297 Query: 298 NL 299 N Sbjct: 298 NA 299 >UniRef50_C4YU05 Transposase n=5 Tax=Rickettsieae RepID=C4YU05_9RICK Length = 342 Score = 217 bits (553), Expect = 3e-55, Method: Composition-based stats. Identities = 86/337 (25%), Positives = 154/337 (45%), Gaps = 51/337 (15%) Query: 9 HDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQ 68 HDA+ K+ L A++FLE +LP + +EL DL + +E SF+E+ LK +D++YSV+ Sbjct: 7 HDALVKKILTEKIAAQEFLEHYLPSDFKELIDLREIKVEKESFVEDDLKRKYSDIIYSVK 66 Query: 69 MQGN-PGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEAT 127 + +++V+IE QS D +A R+ +Y + RH E + +KLPL+ P+L Y G Sbjct: 67 TRDQEEAFVYVLIEAQSSCDYWIALRLWKYMLLLCERH-ENNKNKLPLICPLLIYNGSEV 125 Query: 128 PYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQRDL 187 Y ++++F PE A+++ + LVD+ DDEI Q + + ++E KHI QRD+ Sbjct: 126 -YNAPRNFWELFTKPERAKKLMVQDYQLVDLQNQSDDEIEQKKHLGMMEYFLKHIHQRDM 184 Query: 188 MLLLEQLVTLIDE--------GYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGE 239 + L ++ + GY V + + ++ + E Sbjct: 185 LKLWDEFLIRFKPSIIMDKESGYIYLRSFVWYTDAKISEEKQQELEQIIVKHLSTEEKDN 244 Query: 240 SMMTLAQWFEEKGIE----------------------------------------KGIQQ 259 M T+AQ + ++G++ +G + Sbjct: 245 IMRTIAQKYIDEGVQHGIIQGIQQGIQQGVEKGKAEGLKIGEAKGKAEGKAEGKAEGKAE 304 Query: 260 GRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKV 296 G+ E E A+++LS+G ++ + L A I + Sbjct: 305 GKAEERVEIARKMLSQGCDFSFISSVTGLEEAFIRSL 341 >UniRef50_A9EVM7 Similar to putative transposase n=2 Tax=Sorangium cellulosum 'So ce 56' RepID=A9EVM7_SORC5 Length = 336 Score = 217 bits (553), Expect = 4e-55, Method: Composition-based stats. Identities = 80/306 (26%), Positives = 135/306 (44%), Gaps = 25/306 (8%) Query: 7 TPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYS 66 HDA+FK E A L LP L D L L GSF++E+LK +D+L+S Sbjct: 12 NAHDALFKAAFSQVEHAAGELRQALPPALSARIDFAALRLRPGSFVDEALKERQSDLLFS 71 Query: 67 VQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHL--EADHDKLPLVVPILFYQG 124 M L+++ EHQS + MAFR++RY + HL +LP ++P++ + Sbjct: 72 ASMGEARVLLYLLFEHQSTVEPLMAFRLLRYMVRIWEHHLAEHPGSKRLPAILPVVLHHS 131 Query: 125 EATPYPLSMCWFDMFYSPELAR-----RVYNSPFPLVDITITPDDEIMQHRRIAILELL- 178 E + + + D+ E AR V F L DI+ D+ + A L+ Sbjct: 132 ETG-WTAATSFEDLLDLDEGARAVMVDHVPRFRFVLDDISQEGDEALKARAMSAFSRLVL 190 Query: 179 --QKHIRQRD-LMLLLEQLVTLIDEGYTSG---SQLVAMQNYMLQRGHTEQADLFYG--V 230 +H R+ D L+ L + + L++E + L A+ Y+L ++AD + Sbjct: 191 WCLRHGREPDELLRQLGKWLDLVNEVRRAPNGVEALRAIWRYILATNERDEADEVLQRLL 250 Query: 231 LRDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPL 290 E E +++ A E+G ++G+++G +E +L K + A LP Sbjct: 251 AAAGEPWKEEIVSAADQLMERGRQQGLREGLREGR----CHMLLKVLG----ARFGALPN 302 Query: 291 AEIDKV 296 + +V Sbjct: 303 DAVARV 308 >UniRef50_A5CC03 Transposase and inactivated derivative n=9 Tax=Orientia tsutsugamushi RepID=A5CC03_ORITB Length = 355 Score = 216 bits (551), Expect = 6e-55, Method: Composition-based stats. Identities = 81/343 (23%), Positives = 150/343 (43%), Gaps = 56/343 (16%) Query: 9 HDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQ 68 HD +FK + + A DF+ LP E++ + DLNT+ +E SF+E +L+ DVL+SV+ Sbjct: 7 HDGLFKDLMNEPKAALDFINDFLPNEVKNVLDLNTIKVEQESFVEANLRRSMCDVLFSVK 66 Query: 69 MQ-GNPGYLHVVIEHQSKPDKKMAFRMMRYSIAA------MHRHLEADHDKLPLVVPILF 121 + N +++V+IE + + D +AF++ +Y+++ + + + KLP+VVPI+ Sbjct: 67 TKNNNDAFIYVLIEAELRSDYWIAFKLWQYTLSILKRHKKGLKKRKKERGKLPIVVPIVV 126 Query: 122 YQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKH 181 Y G A + +++F P+LA+ + S + L+D PD EI + A++ ++ Sbjct: 127 YHG-ADRFNAPRSLWELFDDPKLAKELMGSEYLLIDWQAMPDSEIKRKATAALVHFMKYI 185 Query: 182 IRQRDLMLLLEQLVTLIDEGYTSGSQ-----LVAMQNYMLQRGHTEQADLFYGVLRD--- 233 Q D++ L + + E + + ++ Y + + + +L + Sbjct: 186 HNQPDIIELWAKFFNTLQEIVQKDKEEGFLYIRSLLYYTISKVSQNEQPRLKQLLDENLS 245 Query: 234 RETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEF------------------------- 268 E M T+A + ++G KG +GR E E Sbjct: 246 IEDRDRIMGTIAAQYIDEGKAKGRAEGRAEGRAEGRAEGRAEGRAEGRAEGRAEGRAEGI 305 Query: 269 ---------------AQRLLSKGMSREDVAEMANLPLAEIDKV 296 A+ LL G S E +AE L E+ + Sbjct: 306 EIGETKGRAEAAQGLARNLLKAGFSVEFIAENTGLSNEEVVNL 348 >UniRef50_C3PPD7 Transposase and inactivated derivative n=13 Tax=spotted fever group RepID=C3PPD7_RICAE Length = 361 Score = 215 bits (549), Expect = 1e-54, Method: Composition-based stats. Identities = 79/294 (26%), Positives = 144/294 (48%), Gaps = 33/294 (11%) Query: 9 HDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQ 68 HD +FK+ + AR+FLE +LPV + +LN++ +E SF+ E L+ +DV+YSV Sbjct: 41 HDELFKKVMSEPVAAREFLEHYLPVTFKNKINLNSIKIEKESFVTEDLRKRLSDVVYSVS 100 Query: 69 MQ--------------GNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRH--------- 105 ++ + Y++V+IEHQS D +AFR+ +Y + RH Sbjct: 101 LKNDNIKDSTTEKSVHNDKAYVYVLIEHQSSSDYWIAFRLWQYMLLLCERHKDANNNKSS 160 Query: 106 -LEADHDKLPLVVPILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDD 164 + +KLPL+ PI+ Y + PY ++++F + A+ + + LVD+ DD Sbjct: 161 VTKEKDNKLPLICPIVVYANDK-PYNAPRSFWELFEDSKTAKDMMGDEYLLVDLQKQSDD 219 Query: 165 EIMQHRRIAILELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQ-----LVAMQNYM-LQR 218 EI + + + ++E + KHI+ RD++ L + L+ + + + + Y + Sbjct: 220 EIEKKKHLGMMEYMLKHIKARDILNLWQSLLEKFESSIEIDKENGYIYIKWLLWYSDAKV 279 Query: 219 GHTEQADLFYGVLRDR--ETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQ 270 +Q +L + + E E M T+A + ++G++KG+ QG Q Q Sbjct: 280 SEDKQVELASIIAKHLKKEDQEELMRTIADKYIDEGVQKGMVQGMQIGEARGMQ 333 >UniRef50_D0LMM4 Putative transposase n=10 Tax=Haliangium ochraceum DSM 14365 RepID=D0LMM4_HALO1 Length = 345 Score = 215 bits (549), Expect = 1e-54, Method: Composition-based stats. Identities = 77/320 (24%), Positives = 132/320 (41%), Gaps = 30/320 (9%) Query: 5 STTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVL 64 HD++ K + A D LP + E DL+ L L GSF+ + L+ TD+L Sbjct: 2 PHDSHDSLVKATFARLDFAADEFRAVLPPAILERLDLDKLALCPGSFVSDELRQQHTDLL 61 Query: 65 YSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHD--KLPLVVPILFY 122 + + G P +L++++EHQS ++ M R++RY + RHL LP ++P++ + Sbjct: 62 FRAPLDGEPAFLYLLLEHQSSVERMMPLRLLRYVASIWERHLGEHPGAATLPPILPVVLH 121 Query: 123 QGEATPYPLSMCWFDMFYSPELAR-----RVYNSPFPLVDITITPDDEIMQHRRIAILEL 177 E + +F + AR + F L D++ PD+ ++ A +L Sbjct: 122 HSEQG-WTAPTSLGQLFALSDGAREALGPYLPELRFLLDDLSHQPDEALLMREMAAQAKL 180 Query: 178 LQKHI----RQRDLMLLLEQLVTLIDEGYTSG---SQLVAMQNYMLQRGHTEQADLFYGV 230 + +DL+ LL +I E T+ L A+ Y LQ T D Sbjct: 181 ALWALKNARHAQDLLALLRPWSPVILEAVTAPGGIDALAAIVRYTLQHADT-DPDALMRF 239 Query: 231 LRDR--ETGGESMMTLAQWFEE------------KGIEKGIQQGRQEVSQEFAQRLLSKG 276 L D + E+ MT A+ + +G +G +GR E E L Sbjct: 240 LIDSAGDPAKEAFMTGAEKLTQAVREQSLRQGRVEGRVEGRVEGRVEGRVEGRTEALRTV 299 Query: 277 MSREDVAEMANLPLAEIDKV 296 +S++ LP +++ Sbjct: 300 LSKQLRQRFGTLPSEVTERL 319 >UniRef50_A6G0X2 Putative uncharacterized protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6G0X2_9DELT Length = 363 Score = 213 bits (543), Expect = 6e-54, Method: Composition-based stats. Identities = 67/294 (22%), Positives = 124/294 (42%), Gaps = 21/294 (7%) Query: 2 DAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHST 61 ++ ++ PHDA+F+ H A L LP EL L D + L + + SL T Sbjct: 12 ESVTSRPHDALFRATFEHPSHAGSLLRSALPRELAALIDWSRLRPAANELVSSSLGERRT 71 Query: 62 DVLYSVQMQG---NPG--YLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLV 116 D+L+S ++G G +++ IEHQS+ D M R++ Y + RH + LP V Sbjct: 72 DLLFSTALEGPGAGDGARVVYLHIEHQSRVDTTMPLRVLGYRVRIWERHRKRHGGALPPV 131 Query: 117 VPILFYQGEATPYPLSMCWFDMFYSP-----ELARRVYNSPFPLVDITITPDDEIM---Q 168 ++ + ++F P +A + P + D+ D E+ Sbjct: 132 FCVVLSHAAKG-WTGPRSLVELFPEPVRTLAPIAAHLPRCPLIVEDLGRRADAELRARHA 190 Query: 169 HRRIAILELLQKHIRQRD-----LMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQ 223 H A+ L + R + L+ +Q++ L+D + L + Y+ G Sbjct: 191 HPLPALTLWLLRDARSPERLVHRLLDWRDQIIALLDYDHGERD-LAQLLRYVALVGSEMD 249 Query: 224 ADLFYGVLRDRETGGESM-MTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKG 276 + F+ + E+M MT+A+ + +++G +QG++E +E +G Sbjct: 250 FEEFHRFVAHHIPEVEAMTMTIAEQLCREALQRGREQGQREGQREGRLEGQREG 303 >UniRef50_Q1QWV4 Putative uncharacterized protein n=11 Tax=Proteobacteria RepID=Q1QWV4_CHRSD Length = 326 Score = 212 bits (540), Expect = 1e-53, Method: Composition-based stats. Identities = 83/308 (26%), Positives = 136/308 (44%), Gaps = 24/308 (7%) Query: 13 FKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQMQGN 72 +K H E RD L + E D +TL SGS+I E L+ DV++ V+ + Sbjct: 13 YKLLFSHPEMVRDLLTGFVKEAWVEQLDFSTLEKVSGSYITEDLRDREDDVIWRVRWGDD 72 Query: 73 PGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEAD----HDKLPLVVPILFYQGEATP 128 Y+++++E QS D+ MA R+M Y + + + KLP V+PI+ Y GE Sbjct: 73 WLYVYLLLEFQSSVDRFMAVRVMTYLGLLYQDLIRQEAFTPNGKLPPVLPIVLYNGEKRW 132 Query: 129 YPLSMCWFDMFYSP-ELARRVYNSPFPLVDIT-ITPDDEIMQHRR--IAILELLQKHIRQ 184 + P L R N + L+D + D E H R A L L+ + + Sbjct: 133 TAAQNVADLVEQVPGGLERYRPNLAYLLLDEGAVISDPEWSDHMRNVAAALFRLEHNRDE 192 Query: 185 RDLMLLLEQLVTLIDEGYT---SGSQLVAMQNYML-QRGHTEQADLFYGVLRDRETGGES 240 +D++ +L LV + + +V ++ +L R + F + E Sbjct: 193 QDMLEVLGTLVEWLKAPEQTGLRRAFVVWIRRVLLPNRAPGMELPEFNELQDLHEVHDML 252 Query: 241 MMTLAQW---FEEKGIEKGIQQGRQEVSQEFAQRLLSKG---------MSREDVAEMANL 288 + QW +EEKG ++G Q+GR+E QE QR + K +S E +AE L Sbjct: 253 AERIKQWPERWEEKGRQEGRQEGRKEGRQEGEQRGIEKTARNLIKLGVLSDEQIAEATGL 312 Query: 289 PLAEIDKV 296 +AE++ + Sbjct: 313 TVAEVEGL 320 >UniRef50_C1J8H0 Truncated transposase n=3 Tax=Escherichia coli RepID=C1J8H0_ECOLX Length = 202 Score = 211 bits (538), Expect = 2e-53, Method: Composition-based stats. Identities = 88/207 (42%), Positives = 124/207 (59%), Gaps = 7/207 (3%) Query: 90 MAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEATPYPLSMCWFDMFYSPELARRVY 149 M FRM+RYS+AAM RHLE H LPLV+P+LFY GE +PYP SM W D F P LA ++Y Sbjct: 1 MPFRMLRYSVAAMQRHLEQ-HKTLPLVIPVLFYHGERSPYPYSMNWLDCFEEPALAAKIY 59 Query: 150 NSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLV 209 PFPLVDIT+ D+EIM HRR+A L LL KHIR RD+M LL++L ++ E S Q+ Sbjct: 60 TKPFPLVDITVVDDNEIMNHRRMAALTLLMKHIRHRDMMELLDKLPQVMVE--ISDEQVR 117 Query: 210 AMQNYMLQRGHTEQADLFYGVLRDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFA 269 + +Y++ G + + + + +MT+A+ E +KG Q+G E + A Sbjct: 118 VLIHYIVNAGDSVSPEFMRALAERLPQHEDKLMTIAERLE----QKGRQEGALEKALAIA 173 Query: 270 QRLLSKGMSREDVAEMANLPLAEIDKV 296 +L GM+ E + + L AE+ + Sbjct: 174 CQLQKMGMTPEQIKQATGLSEAELKNI 200 >UniRef50_D2QBD7 Putative uncharacterized protein n=1 Tax=Spirosoma linguale DSM 74 RepID=D2QBD7_9SPHI Length = 341 Score = 211 bits (537), Expect = 3e-53, Method: Composition-based stats. Identities = 84/334 (25%), Positives = 143/334 (42%), Gaps = 39/334 (11%) Query: 1 MDAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHS 60 M A PHD FK+ E DFL P +RE D TL E +F +E L H Sbjct: 1 MAAQPDNPHDRFFKESFSQPEILIDFLNAFAPEAVRERIDYTTLTREVDTFTDEQLAEHF 60 Query: 61 TDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPIL 120 D+++SVQ G P L +++EH+S ++ F++ RY + ++ L V+P+L Sbjct: 61 ADLVFSVQYNGQPIRLVILLEHKSYTEEYPHFQINRYLLNLWESQIKQK-QPLTPVLPVL 119 Query: 121 FYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEI----MQHRRIAILE 176 Y G S+ + L + + L+D++ D+ + + R+ + Sbjct: 120 VYHGNRRWKQRSIPDYFAPLHETLTPYLPAFEYLLIDLSTLSDERLPTLQSDYARLTAI- 178 Query: 177 LLQKHIRQRDLMLLLEQLVTLIDE--GYTSGSQLVAM-QNYMLQRGHTEQADLFYGVLRD 233 LLQ R+R+L LL+ ++ T+G + V+ Y+ + + +LF R Sbjct: 179 LLQNSRRKRELTRLLDAFADVVRRLTDTTAGQRFVSTGFLYLSYTANLTKVELFGIFSRI 238 Query: 234 RETGGESMMTLAQWFEEKGI-----------EKGIQQGRQEVSQ---------------- 266 S MT+A+ ++G E+ IQQGR+ + Sbjct: 239 SSKIESSTMTVAEELIQEGRELERRQTRMVAEELIQQGRELERRQAMMAAEELLKQQERQ 298 Query: 267 ---EFAQRLLSKGMSREDVAEMANLPLAEIDKVI 297 +F + +L+ + +A A LPLAE+D +I Sbjct: 299 NKIKFIKAMLNLNLDAATIATAAELPLAEVDAII 332 >UniRef50_A9BGB6 Putative uncharacterized protein n=3 Tax=Petrotoga mobilis SJ95 RepID=A9BGB6_PETMO Length = 331 Score = 208 bits (530), Expect = 2e-52, Method: Composition-based stats. Identities = 79/316 (25%), Positives = 145/316 (45%), Gaps = 18/316 (5%) Query: 1 MDAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHS 60 M+ PHD FK E ARDFL+ +LP E E+ DL+ L E+ S ++E+L+ Sbjct: 1 MNELVHNPHDRFFKLIFSDKEIARDFLQNYLPQEAVEIVDLDYLIPENNSHVDENLRESL 60 Query: 61 TDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPIL 120 +D+LY +++G GY+++++EH+S + K+ F+++RY + + K+P+++P++ Sbjct: 61 SDMLYKTKIKGQDGYIYILMEHKSYIEGKVIFQLLRYITSIWEEKYDPKTKKVPIIIPMV 120 Query: 121 FYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQK 180 Y G + + +M E + P D I + +RI L ++ Sbjct: 121 IYHGREI-WNVETNLLNMVQGIEDLPNELKTYLPTYRY-EICDFSIKRKKRIIGLTAMKV 178 Query: 181 HIR--QRDLMLLLEQLVTLIDEGYTSGSQLVAM---------QNYMLQRGHTEQADLFYG 229 I + + E+ + + QL Y+L + Sbjct: 179 AIEAMRAGTAMTKEEFKERLRRVFAYIKQLPKEQVHEWFEECMIYLLNVREDVTIEEILK 238 Query: 230 VLRDRETG-GESMMTLAQWFEEKGIEKGI----QQGRQEVSQEFAQRLLSKGMSREDVAE 284 V ++ G GE +MT+A+ +G+EKG ++G+ E +EFA R+LSK + E Sbjct: 239 VQKEIMPGRGEIVMTIAEKLRNEGMEKGKIEGERKGKLEGEREFAIRILSKRFGNQLTEE 298 Query: 285 MANLPLAEIDKVINLI 300 + + +K I+ I Sbjct: 299 IKDRIREADEKTIDYI 314 >UniRef50_C5JAV2 Transposase n=2 Tax=uncultured bacterium RepID=C5JAV2_9BACT Length = 334 Score = 206 bits (524), Expect = 9e-52, Method: Composition-based stats. Identities = 77/294 (26%), Positives = 140/294 (47%), Gaps = 10/294 (3%) Query: 7 TPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYS 66 PHD K L + TA L LP E+ E + L GSFI+E+L+ H TD LY Sbjct: 6 HPHDRFLKALLSNPATAGTLLRERLPREVAEALSDDPPELLEGSFIDEALRPHLTDRLYR 65 Query: 67 VQM-QGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEAD--HDKLPLVVPILFYQ 123 V+ G L+V+IEH+S PD ++ +++++Y + A+ + + ++LP +VP +FY Sbjct: 66 VRTVTGRTALLYVLIEHKSSPDLRIGWQLLKYLVEALKQWERENPAWERLPAIVPFVFYH 125 Query: 124 GEATPYPLSMCWFDMFYSPELAR-RVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHI 182 G A + + + + + E R + N F ++D+ D ++ + + L K+ Sbjct: 126 GAAA-WKVPDAFLALVDAEEGWRSHLLNFRFTVLDLGQIDDRQLSRQPNLQAWLLAAKYA 184 Query: 183 RQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRD-RETGGESM 241 + D L +++L+ + + Y+++ + + ++R R E+M Sbjct: 185 TRDDRQLEVKELLIQTLVSVAD-EEFRFLMRYVVETYRSYDEPMVREIIRRVRPEEEETM 243 Query: 242 MTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDK 295 M+ F + + KG Q+GRQE QE Q + G R E A + L ++ + Sbjct: 244 MS---MFAQDMMAKGRQEGRQEGRQEGRQEGIKLGEQRGRQEEAAYMLLKQMRR 294 >UniRef50_Q1RGR6 Transposase and inactivated derivative n=15 Tax=Rickettsia RepID=Q1RGR6_RICBR Length = 313 Score = 203 bits (517), Expect = 6e-51, Method: Composition-based stats. Identities = 75/310 (24%), Positives = 149/310 (48%), Gaps = 22/310 (7%) Query: 7 TPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYS 66 HD + + + +++F E+HLP ++ L L +E SF+++ LK D+L+S Sbjct: 5 PKHDEIIRSAFENPLVSKEFFEMHLPPHIQNLISFEKLKMEKDSFVDKRLKKSIVDILFS 64 Query: 67 VQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEA-DHDKLPLVVPILFYQGE 125 + GYL++++EHQS P+ KMA R+ RY H ++ K P + P++FY G Sbjct: 65 AKFGEKKGYLYLLLEHQSTPEYKMALRLFRYMFKIAEYHKKSTKSKKFPFIYPLIFYNG- 123 Query: 126 ATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQR 185 Y +++F + EL + ++ + L+++ PD+++ + IL+ KHI +R Sbjct: 124 VQKYNAPRNLWELFENSELVKSTWSGDYQLINVHDIPDEKLKEKAWSGILQFFMKHIHER 183 Query: 186 DLMLLLEQLVTLI---DEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDR--ETGGES 240 DL+ E++ L+ + + + Y L R + +L+ + E+ Sbjct: 184 DLLKRWEEVADLLPKFAKIDIGIEHIELILCYTLTRIKQDDIIEVEKLLQSKLNPKKREN 243 Query: 241 MM-TLAQWFEEKGIE-------KGIQQGR-------QEVSQEFAQRLLSKGMSREDVAEM 285 +M ++A + ++G E K +Q+ + QE A+ ++ +G S E V ++ Sbjct: 244 VMKSIAHHWIQQGREEEKAIMLKKMQEEKVIMAEKVQEEKVMMAKEMMKEGFSLESVIKI 303 Query: 286 ANLPLAEIDK 295 L +++K Sbjct: 304 TKLSKEDLEK 313 >UniRef50_C2DIT3 Possible transposase n=5 Tax=Enterobacteriaceae RepID=C2DIT3_ECOLX Length = 197 Score = 202 bits (514), Expect = 1e-50, Method: Composition-based stats. Identities = 89/200 (44%), Positives = 124/200 (62%), Gaps = 9/200 (4%) Query: 95 MRYSIAAMHRHLEADHDKLPLVVPILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFP 154 MRY+IAAM HL+A + LP+VVP+LFY G +PYP S+CW D F P LAR++Y S FP Sbjct: 1 MRYAIAAMQNHLDAGYKTLPMVVPLLFYHGIESPYPYSLCWLDCFADPNLARQLYASAFP 60 Query: 155 LVDITITPDDEIMQHRRIAILELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNY 214 L+D+T+ PDDEIM HRR+A+LEL+QKHIRQRDLM L+EQ+ L+ GY +G Q+ + NY Sbjct: 61 LIDVTLMPDDEIMLHRRMALLELIQKHIRQRDLMGLVEQMACLLSSGYANGRQIKGLFNY 120 Query: 215 MLQRGHTEQA-DLFYGVLRDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLL 273 +LQ G + D GV + S+MT+A+ Q+G Q + A+ +L Sbjct: 121 ILQTGDAVRFNDFIDGVAKRSPKHKVSLMTIAERLR--------QEGEQSKALHIAKIML 172 Query: 274 SKGMSREDVAEMANLPLAEI 293 G+ D+ + E+ Sbjct: 173 ESGVPLADIMRFTGVSEEEL 192 >UniRef50_A3JHZ5 Putative transposase n=11 Tax=Proteobacteria RepID=A3JHZ5_9ALTE Length = 325 Score = 200 bits (509), Expect = 4e-50, Method: Composition-based stats. Identities = 75/320 (23%), Positives = 137/320 (42%), Gaps = 30/320 (9%) Query: 5 STTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVL 64 +T HD +K+ H E + +E P E+ L D NTL SG++I + DV+ Sbjct: 2 ATNHHDTGYKELFSHPEFVQQLVEGFAPSEIAGLMDFNTLKNHSGNYITPLFEEKFEDVV 61 Query: 65 YSVQMQ----GNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDK-----LPL 115 +SV++ +L++++E QSK D M R+M Y +A + HL + LP Sbjct: 62 WSVEVTWEGITQRVFLYILLEFQSKIDSTMPLRLMHY-VACFYDHLLKTRETTVRQGLPP 120 Query: 116 VVPILFYQGEATPYPLSMCWFDMFY--SPELAR-RVYNSPFPLVDITITPDDEIMQHRRI 172 + P++ Y G + + +DM PE R + + L+D D+E++ R Sbjct: 121 IFPMVLYNG-SQRWSARQDIYDMVQPAPPEFLRVYQPHLRYYLIDEGRYTDEELISKRTP 179 Query: 173 --AILELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQA------ 224 I + L ++++V ++ + ++ + Sbjct: 180 LSGIFGVENAGHSWEALQQAVDRIVEIVKADPNKDRVDKIVTRWIKRHLQRVAPKARLNL 239 Query: 225 DLFYGVLRDRETGGESMMTLAQWFEEKGIEKGIQQGRQEV-------SQEFAQRLLSKG- 276 D ++ DR E++ L + +G ++G Q+GRQE ++ + LLS G Sbjct: 240 DRMSSLVEDRNMLAENLENLVKKERLEGRQEGRQEGRQEGDRRALEEKRKTVRHLLSFGV 299 Query: 277 MSREDVAEMANLPLAEIDKV 296 +S + +A L + EIDK+ Sbjct: 300 LSNDQIAVATGLSVDEIDKL 319 >UniRef50_C6VTM0 Putative uncharacterized protein n=1 Tax=Dyadobacter fermentans DSM 18053 RepID=C6VTM0_DYAFD Length = 308 Score = 199 bits (505), Expect = 1e-49, Method: Composition-based stats. Identities = 72/310 (23%), Positives = 148/310 (47%), Gaps = 19/310 (6%) Query: 1 MDAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHS 60 MD + HDA + + + + A D+ +P +++L D +TL +++ + L+ Sbjct: 1 MDKHTP-KHDAFIRAIMGNKQIALDYFRASIPQNIQDLLDFSTLRQLPDTYVSKELQKSI 59 Query: 61 TDVLYSVQ--MQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVP 118 +D++Y Q + +++EH+S DK ++ Y + + + + + + L++P Sbjct: 60 SDIVYVCQKASGNGEVKISLLVEHKSYVDKYTPIQIGSYIFSGLLKQI-GNKESPSLIIP 118 Query: 119 ILFYQGEATPYPLSMCWFDMFYSPELA--RRVYNSPFPLVDITITPDDEI--MQHRRIAI 174 IL Y G A + D+F +PE A + + + + D+ D+EI + ++ +A Sbjct: 119 ILLYHG-ADRWEYKTVA-DLFENPEPALQQFIPDYQYIFHDLGQISDEEIQSLHNKFLAA 176 Query: 175 LELLQKHIRQRD-LMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRD 233 L K+ +D L LL ++TL E + ++ Y L G+ + F +++ Sbjct: 177 SLLAMKYSALKDQLNTLLPTILTLASEVD--RNLHKSLLFYTL-VGNPLTEEQFLNLIKS 233 Query: 234 RE-TGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEF---AQRLLSKG-MSREDVAEMANL 288 E++M + + FEEKG +KGI++GR E Q+ + L+ + ++ E +A N+ Sbjct: 234 VPNQKKEAIMDIFEIFEEKGWKKGIEEGRAEAEQKIETAVRNLIKQSVLTDEQIASAMNV 293 Query: 289 PLAEIDKVIN 298 + +V N Sbjct: 294 TTDYVAEVRN 303 >UniRef50_C6I158 Putative uncharacterized protein n=3 Tax=Leptospirillum ferrodiazotrophum RepID=C6I158_9BACT Length = 328 Score = 198 bits (504), Expect = 2e-49, Method: Composition-based stats. Identities = 74/317 (23%), Positives = 120/317 (37%), Gaps = 30/317 (9%) Query: 9 HDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQ 68 HD FK L + L+ LP + D +L + E L D+ +S + Sbjct: 7 HDRFFKSTLGRPDRLGKVLKAFLPTNISASLDPGSLVPLGTESVGEGLDSSLMDLAFSAR 66 Query: 69 MQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEATP 128 +H+++EH+S PD + F++ RY R L+ PL +PILFY G Sbjct: 67 FGDQEARIHLIVEHKSSPDPRTHFQIARYLCGLWIRELKEGLQPRPL-LPILFYHGVVPW 125 Query: 129 YPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHR---RIAILELLQKHIRQR 185 S + EL + PL+D+ D+EI H + L KHI Sbjct: 126 TLPSRLTEVLRPPSELLAVTPDFVLPLIDLRRVDDEEIRHHVDDLEAVLALLSLKHIFDG 185 Query: 186 DLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRE-TGGESMMTL 244 + L+ L+ I E + L NYM + ++ G + + Sbjct: 186 -VETLVRLLLREIWERKAPHAILKPEMNYMAGVYKITNSQEMKQIVDPIAREVGMAQDIV 244 Query: 245 AQWFEE-----------------------KGIEKGIQQGRQEVSQEFAQRLLSKG-MSRE 280 W +E KG+EKG QQG + ++ + LL K S E Sbjct: 245 ETWLDEYLQQGLQKGLEQGLQQGLQQGLEKGLEKGFQQGARLKEEQVIRTLLKKKTFSFE 304 Query: 281 DVAEMANLPLAEIDKVI 297 ++A + + L+ + +V Sbjct: 305 EIASLVGVELSRVREVA 321 >UniRef50_Q1Q296 Putative uncharacterized protein n=6 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1Q296_9BACT Length = 338 Score = 197 bits (501), Expect = 4e-49, Method: Composition-based stats. Identities = 67/280 (23%), Positives = 117/280 (41%), Gaps = 13/280 (4%) Query: 7 TPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYS 66 PHD FK+ E A DFL P E+ + DL+TL ++ S+I+E LK H +D++Y+ Sbjct: 5 NPHDKFFKETFSIRENAIDFLSGRFPPEILKKLDLSTLTQDNSSYIDEELKEHFSDIVYT 64 Query: 67 VQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEA 126 + + ++ EH+S ++M+Y + + + +P V+P++ Y G+ Sbjct: 65 CFCKDKEIRITLLFEHKSYAVACPYLQLMKYLLKIWEANSKQAQRLIP-VIPVILYHGKE 123 Query: 127 TPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIM----QHRRIAILELLQKHI 182 + R + + L DI+ ++EI + + I LL ++I Sbjct: 124 AWKVRRFREYFEGIDEVFYRFIPEFEYLLTDISCYSNEEIKDRVFRRVSLQITMLLMRNI 183 Query: 183 RQRDLMLLLEQLVTLIDEGYTSGSQ------LVAMQNYMLQRGHTEQADLFYGVLRDRET 236 D L ++L + G + L + Y+ + + + E Sbjct: 184 F--DEKYLEDKLKDFFEIGIQYFEEDEGLKFLESAIRYLYYASDIAEKRVIDTLKEISEE 241 Query: 237 GGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKG 276 GG+ MT+A EKG G +GR E E A KG Sbjct: 242 GGKLSMTIAAKLIEKGKIAGRVEGRAEGRAEGAIEGERKG 281 >UniRef50_Q2RLW6 Putative uncharacterized protein n=9 Tax=Clostridia RepID=Q2RLW6_MOOTA Length = 344 Score = 196 bits (499), Expect = 7e-49, Method: Composition-based stats. Identities = 66/330 (20%), Positives = 138/330 (41%), Gaps = 38/330 (11%) Query: 4 PSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDV 63 P P+D ++Q L + L+ + E D + L L + S++ + DV Sbjct: 10 PPHHPYDKGYRQLLADKRVFLELLKTFVREAWVEAIDADDLILVNKSYVLQDFSEKEADV 69 Query: 64 LYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHR--------HLEADHDKLPL 115 +Y ++ + +V++E QS D M FR++ Y + E+ H +LP Sbjct: 70 VYRLKTRNRNVIFYVLLELQSTVDYLMPFRLLLYMVEIWREIYNNTPQGERESKHFRLPP 129 Query: 116 VVPILFYQGEATPYPLSMCWFDMFYSP-ELARRVYNSPFPLVDITITPDDEIMQHRR-IA 173 ++P + Y G A + ++ + +M S + + + + + L D+ ++E+++ IA Sbjct: 130 IIPAVLYNG-AGSWTAALSFKEMLNSYQDFSGHLLDFRYLLFDVNRYSEEELIRAANLIA 188 Query: 174 ILELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLV--AMQNYMLQRGHTEQADLFYGVL 231 + LL + ++ DL L++L ++ + ++N + R + ++ G+L Sbjct: 189 GIFLLDQKMQPEDLAGRLQKLAGVLRRLTPDEFRHFTTWLKNVVQPRMPGDFSEKIDGIL 248 Query: 232 RDRETGGESMM------TLAQWFEE---KGIEKGIQ----------------QGRQEVSQ 266 M TL + + KG+++G Q +G+ E + Sbjct: 249 NASNPWEVERMIYNLELTLEEMQRQALLKGLKEGEQKGKLEGKLEGKLEGKLEGKLEGKR 308 Query: 267 EFAQRLLSKGMSREDVAEMANLPLAEIDKV 296 E A+ LL + E + + L L EI+ + Sbjct: 309 EVARNLLLLNVDIETIIKATGLALEEINAL 338 >UniRef50_Q1RKI3 Transposase and inactivated derivative n=10 Tax=Rickettsia RepID=Q1RKI3_RICBR Length = 270 Score = 195 bits (496), Expect = 1e-48, Method: Composition-based stats. Identities = 58/193 (30%), Positives = 108/193 (55%), Gaps = 3/193 (1%) Query: 9 HDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQ 68 HD F++ L + AR+F E +LP E++ L TL LE+ SFI+ +LK TDVLYS + Sbjct: 56 HDKFFQKALSNPIVAREFFEEYLPTEIKALFSPTTLTLENDSFIDPNLKESITDVLYSAR 115 Query: 69 MQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHL--EADHDKLPLVVPILFYQGEA 126 + Y++++ EHQS D MAFR+ +Y + +HL D K P + P++ Y + Sbjct: 116 INNRDCYIYILCEHQSSSDPHMAFRLFKYMLNIAEKHLISHPDSKKFPFIYPLV-YSNDH 174 Query: 127 TPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQRD 186 Y + +D+F + EL + +++ + L+ + DD++ ++ +A L++L K+I + + Sbjct: 175 KKYTAPLNLWDLFENSELVKDTWSNNYQLISLRDISDDKLKENPWLAPLQILMKYIHKPN 234 Query: 187 LMLLLEQLVTLID 199 + +++ + Sbjct: 235 VFDKWQEISGCLA 247 >UniRef50_C0GW46 Putative uncharacterized protein n=2 Tax=Desulfonatronospira thiodismutans ASO3-1 RepID=C0GW46_9DELT Length = 341 Score = 195 bits (496), Expect = 2e-48, Method: Composition-based stats. Identities = 75/303 (24%), Positives = 132/303 (43%), Gaps = 23/303 (7%) Query: 1 MDAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHS 60 M PH+A FK F E + F++ H+P E+ L DL+TL ++ F+ E + + Sbjct: 1 MSFEIPNPHNACFKDFFKDPEFVKAFIKYHIPEEICSLLDLDTLQVDLSGFVSEEHREYY 60 Query: 61 TDVLYSVQMQGN--PGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEAD--HDKLPLV 116 DV+ +VQ++G+ +++++EH+S P+ +++ Y + LP++ Sbjct: 61 ADVMVTVQLKGHTENVNIYILLEHKSTPEFLTRLQILNYEVQKWMDLKRKGQLQGYLPVI 120 Query: 117 VPILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFP--LVDITITPDDEIMQHRRIAI 174 +P++ Y G+ + S + D+F P R + F + DI+ DDE + I Sbjct: 121 IPVVIYHGKG-RWNFSRKFSDLFDLPSEVLRPFVPEFKHMIHDISSMEDDEFKTTAILEI 179 Query: 175 LELLQKHIRQRDLMLLLEQLVTLID---EGYTSGSQLVAMQNYMLQRGHTEQADLFYGVL 231 LL K+I +L L+++ L++ + L A+ Y+ +G + Sbjct: 180 FHLLFKYIHYPELETKLQEIYDLLETIPDQDKVKQYLQAIVQYVAVQGPI-SLERLGEYT 238 Query: 232 RDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLA 291 R G E+M T AQ RQE EF Q + RE A++ Sbjct: 239 RRLPGGDEAMQTAAQQI------------RQEAYNEFIQEQEKMLVEREKHAKLEATQEN 286 Query: 292 EID 294 ID Sbjct: 287 LID 289 >UniRef50_C5UWW9 Putative uncharacterized protein n=1 Tax=Clostridium botulinum E1 str. 'BoNT E Beluga' RepID=C5UWW9_CLOBO Length = 323 Score = 195 bits (495), Expect = 2e-48, Method: Composition-based stats. Identities = 67/314 (21%), Positives = 126/314 (40%), Gaps = 26/314 (8%) Query: 6 TTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLY 65 HD +K H ET +FL E L + + L L S+I + +D+LY Sbjct: 7 HHEHDVGYKHIFSHKETFLEFLRSFTKKEWANLINEDDLILVDKSYILSDFEEEESDILY 66 Query: 66 SVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHR--------HLEADHDKLPLVV 117 + +V++E QSK D +M R++ Y + + KLP +V Sbjct: 67 KANIDDKEVIFYVLLEFQSKVDFQMPMRLLFYMTEIWRDVLKNTEKNERKRKNFKLPSIV 126 Query: 118 PILFYQGEATPYPLSMCWFDMFYSPELAR-RVYNSPFPLVDITITPDDEIMQ-HRRIAIL 175 PI+ Y G+ + + + +M EL + + + L DI D E++ I+ + Sbjct: 127 PIVLYNGK-NKWSAKISFKEMLSGYELFEDNILDFNYMLFDINRYSDHELLNISNMISAV 185 Query: 176 ELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQAD----LFYGVL 231 LL + I +++LM + S Q + ++ D VL Sbjct: 186 FLLDQEIDEQELMR--RLKKIIYILKKISPEQFSVFKKWLKNIVKPRVRDNLQGEIDDVL 243 Query: 232 RDRETG---------GESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDV 282 G+++ + E+G++KGI+QG ++ ++ A++ + GM E + Sbjct: 244 EKSNQEEVDFMVSNLGKTIERMQDKAIERGLKKGIEQGIEQGIEQTAKKAIEMGMDNEII 303 Query: 283 AEMANLPLAEIDKV 296 + L +I+ + Sbjct: 304 MNLTGLSEEQINTI 317 >UniRef50_C6HY29 Putative uncharacterized protein n=1 Tax=Leptospirillum ferrodiazotrophum RepID=C6HY29_9BACT Length = 319 Score = 193 bits (491), Expect = 6e-48, Method: Composition-based stats. Identities = 71/316 (22%), Positives = 135/316 (42%), Gaps = 22/316 (6%) Query: 3 APSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESL-KGHST 61 A + TPHD FK+ E L LP ++ D ++L G + E L + Sbjct: 2 AKNLTPHDVFFKEIFSQREILSSALSELLPEDVVRRMDFDSLAYLPGESVGEGLSRSTRA 61 Query: 62 DVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILF 121 D+++SV G L V++EH+S PD ++ F++++ + ++L + LP ++PILF Sbjct: 62 DLVFSVSFGEREGRLVVILEHKSHPDPRVHFQILQMMVMGWMQNLREGREPLP-ILPILF 120 Query: 122 YQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEI--MQHRRIAILELLQ 179 Y G+ + M E+AR + + +D+ + D I +Q+ L Sbjct: 121 YHGQGSWSIPDRFSERMKIPREIARYLPDFELLRIDLGLIDDTRIRSLQNVLAGAALLSM 180 Query: 180 KHIRQ---RDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRG-HTEQADLFYGVLRDRE 235 KH+ + R LL+E ++ +Y + +L+ + E Sbjct: 181 KHVFENPRRFFHLLIEFGRERSAPHDIIEKIVLVALDYAGHVHKNIPDEELYNIMAAITE 240 Query: 236 TGGESMMT--LAQWFEEKGIEKGIQQGR------------QEVSQEFAQRLLSKGMSRED 281 G T L + + E+GI+KG+Q G ++ + L + + Sbjct: 241 EAGMETTTERLKKIWIEEGIQKGVQLGIQQGVQQGVQQGVRQNQIKTILSLSKHNFTPQQ 300 Query: 282 VAEMANLPLAEIDKVI 297 +A++ +L L E+++V+ Sbjct: 301 IADLLSLELPEVERVL 316 >UniRef50_A4XMD0 Putative uncharacterized protein n=5 Tax=Clostridia RepID=A4XMD0_CALS8 Length = 329 Score = 192 bits (487), Expect = 1e-47, Method: Composition-based stats. Identities = 70/323 (21%), Positives = 128/323 (39%), Gaps = 38/323 (11%) Query: 5 STTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVL 64 +D FK+ E +FL ++ E D +L SFI++ DV+ Sbjct: 6 PHNQYDLTFKRLFQFKEVFLNFLRGNINREWVNRIDAESLEFVDRSFIKDEFVEKEADVI 65 Query: 65 YSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDK-LPLVVPILFYQ 123 Y +++ Y +V+IE QS D+ M R+ Y RH+E D+ LP +VPI+ Y Sbjct: 66 YRARLEDTDVYFYVLIEPQSTADRNMPRRLFEYMTLIWKRHMEEKADELLPPIVPIVLYN 125 Query: 124 GEATPYPLSMCW--FDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKH 181 G + + + FD+F + LVD+ D+++ + + L + Sbjct: 126 GRSGWNIPTQIFKGFDIFKDD-------MFNYILVDVNRLDDEKLKSRLDLLSIILYLEK 178 Query: 182 IRQ--RDLMLLLEQLVTLIDEGYTSGSQLV--AMQNYMLQRGHTEQADLFYGVLRDRETG 237 R+ + + L ++ I + ++ + + + E +L+ E Sbjct: 179 SRRNAEEFVEKLSEVSEYICKLPQVQLKVFCSWLLRIVKPQVREEMESRIDELLKKIEAE 238 Query: 238 G----------------ESMMTLAQWFEEKGIEKGIQQGRQEV--------SQEFAQRLL 273 G E + +EKG E+GIQ+G +E +E +RL+ Sbjct: 239 GVEDVGEFIFNVQQLIQEYYREAEEKGKEKGYEEGIQEGIKEGIKEGIQRKEEEIVRRLI 298 Query: 274 SKGMSREDVAEMANLPLAEIDKV 296 KG + +AE + + I K+ Sbjct: 299 QKGFNDNFIAEATGVEIERIKKI 321 >UniRef50_A3ET28 Probable transposase n=6 Tax=Leptospirillum sp. Group II RepID=A3ET28_9BACT Length = 335 Score = 187 bits (476), Expect = 3e-46, Method: Composition-based stats. Identities = 71/326 (21%), Positives = 142/326 (43%), Gaps = 44/326 (13%) Query: 9 HDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQ 68 HD FK E RDFL LP E+ + D ++L + I S + D++ + Sbjct: 8 HDRFFKTSFGRIEVLRDFLTGFLPPEISQSIDPDSLRFLNTESIGLSFEKSHMDLVVECR 67 Query: 69 MQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEATP 128 + P +++IEH+S PD ++ +M+RY +A R+ D+ L V+P++F+QG P Sbjct: 68 ISETPAQFYLLIEHKSVPDPEVFLQMLRYMVALWTRN-RQDNKPLVPVLPLVFHQG-GRP 125 Query: 129 YPLSMCWFDMFYSPE-LARRVYNSPFPLVDITITPD---DEIMQHRRIAILELLQKHIRQ 184 + L + + + F PE L + L D++ E H ++ L K+ Sbjct: 126 WTLPVRFQETFPVPETLKAHAVDFAPLLFDLSTVSGTTIRERSAHAETVVVLTLLKYAFS 185 Query: 185 RDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGESMM-T 243 + +L L G + L + NY ++ + + + R GGE +M + Sbjct: 186 GSVEDVLRALKET--GGSFDETFLFGVLNYAIRAFEVKDPVVVDAI--SRSFGGEKIMPS 241 Query: 244 LAQWFEEKGIEKGI--------------------------------QQGRQEVSQEFAQR 271 + + E+G+++G+ ++G++E ++ ++ Sbjct: 242 IIDEWVEEGLKEGLKKGREEGREEGREEGKEEGRKEGREEGKEEGRKEGQKEGQRKTIEK 301 Query: 272 LLSKG-MSREDVAEMANLPLAEIDKV 296 LL+KG +S ++A ++ L ++++ Sbjct: 302 LLAKGVLSVSEIASALDVDLQWVEQI 327 >UniRef50_Q6TFF6 Putative transposase n=1 Tax=Caedibacter taeniospiralis RepID=Q6TFF6_CAETA Length = 299 Score = 187 bits (475), Expect = 4e-46, Method: Composition-based stats. Identities = 83/300 (27%), Positives = 142/300 (47%), Gaps = 22/300 (7%) Query: 7 TPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGS-------FIEESLKGH 59 HD+VFK + + + A FL +LP EL EL D T+ LES + + + Sbjct: 3 NVHDSVFKDLIANRDFAVSFLMTYLPKELVELVDWQTVKLESANVEHVRQQQKDNQKQKE 62 Query: 60 STDVLYSVQMQ-GNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADH--DKLPLV 116 +D+ + + + G G + V IE Q+ D + R Y + + +++ LPLV Sbjct: 63 QSDLTFLFKFKDGKNGAVFVHIESQTGDDGTILIRTRHYQTSYLLDYIKRHKTVKGLPLV 122 Query: 117 VPILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILE 176 V I++Y + P+ S+ D F + ELA++ Y +D+ D+EI++H IA E Sbjct: 123 VSIIYYANQK-PFSHSLNIHDYFANTELAKK-YAFTTQFIDLNRYSDEEILEHGFIAGYE 180 Query: 177 LLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRET 236 L+ K IR++++ L+ + I+ Q+ + YM Q E D ++ + Sbjct: 181 LILKAIREKNIDGKLDIAINQIEAYDHIARQV--LIRYMSQYSDMETKDFHDKLIYSKPD 238 Query: 237 GGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKV 296 +MT+A+ +E+KGI+KGIQ A+ L G+S E V + L + K+ Sbjct: 239 LRGDVMTVAEQWEQKGIQKGIQT--------TARNFLLMGLSAEQVVKGTGLDQDTVLKL 290 >UniRef50_C6HZP6 Putative uncharacterized protein n=1 Tax=Leptospirillum ferrodiazotrophum RepID=C6HZP6_9BACT Length = 334 Score = 186 bits (473), Expect = 6e-46, Method: Composition-based stats. Identities = 65/306 (21%), Positives = 122/306 (39%), Gaps = 17/306 (5%) Query: 5 STTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESL-KGHSTDV 63 STTPHD+ FK + L + L +L++L G I E L + +D+ Sbjct: 21 STTPHDSFFKDVFGPGKGHLPSLIPLIDGSLASRIELSSLEYLPGESIAEDLARSTRSDL 80 Query: 64 -----LYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVP 118 + + ++ G + + EH+S + ++ A + R L P V+P Sbjct: 81 SASLLISNARIDGGDARIAFIFEHKSFLPHHIHIPLLSLVSALLSRDLREGRKPCP-VIP 139 Query: 119 ILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQ---HRRIAIL 175 ++ Y G A + + SPELA R+ + L+D++ D+ + + H + Sbjct: 140 VVLYHGRAPWTLPARLSEALDLSPELAPRLPDFELTLIDLSRFSDETLKEKIAHPEPLVS 199 Query: 176 ELLQKHIRQRDLMLL---LEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLR 232 + KHI + +L + + TL + +Y+ + + Sbjct: 200 LSVMKHIFEPPESVLGHFVRLIKTLSPSRDILKRIVDTTLHYISYVKKSHHPQEIRTIFT 259 Query: 233 DRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAE 292 E M T+ +E+GI++GIQ GR E Q +S + +A + N+ L+ Sbjct: 260 T-FLAEEKMTTVLDLIKEEGIQEGIQMGRDEAITRLLQH---SSLSPQQIASILNVDLSR 315 Query: 293 IDKVIN 298 + + N Sbjct: 316 VLSLAN 321 >UniRef50_C0GW49 Putative uncharacterized protein n=6 Tax=Desulfonatronospira thiodismutans ASO3-1 RepID=C0GW49_9DELT Length = 339 Score = 186 bits (473), Expect = 6e-46, Method: Composition-based stats. Identities = 64/295 (21%), Positives = 137/295 (46%), Gaps = 11/295 (3%) Query: 3 APSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTD 62 + ++ HD F+ L ARDF+ HLP E+ +L+T+ + S S++ ++LK TD Sbjct: 8 SDTSKYHDHTFRAILGREPVARDFVRYHLPEEITRDMNLDTVKVSSRSYVSDNLKESMTD 67 Query: 63 VLYSVQ-MQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILF 121 ++ +++ + G P +++++EH+S D ++ +Y ++ LP++VP++F Sbjct: 68 IVITLELITGEPAEIYILVEHKSDLDAWTKIQLFKYMNEVWQSFIQKKTGTLPIIVPLVF 127 Query: 122 YQGEATPYPLSMCWFDMFYSPELA--RRVYNSPFPLVDITITPDDEIMQHRRIAILELLQ 179 Y G A + S+ + D+F P + + L ++ + ++ + + L+ Sbjct: 128 YHGTA-RWNYSLEFSDLFNLPSEHYRKYIPKFEHLLHEVPVINKKKVKSSITLEVFHLVL 186 Query: 180 KHIRQRDLMLLLEQLVTLIDEG---YTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRET 236 ++I + + + + L+ +G + + Y+L E + ++ Sbjct: 187 EYIFYPEKRDQIYEALELLFKGLDAKEAHEIFAILIKYLLIATD-ETPEEAEEKVKHLPK 245 Query: 237 GGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLA 291 GGE++ T A+ EE+G K I++ + E L + + D+A A PL Sbjct: 246 GGETVRTTAEVLEERGYNKAIKE---KPVWEKQAELKNAHETLIDIATEAYGPLP 297 >UniRef50_B4U689 Putative uncharacterized protein n=8 Tax=Aquificales RepID=B4U689_HYDS0 Length = 323 Score = 186 bits (473), Expect = 7e-46, Method: Composition-based stats. Identities = 59/283 (20%), Positives = 120/283 (42%), Gaps = 18/283 (6%) Query: 7 TPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYS 66 PHD+ FKQ + L+I + + +++ + + D+L+S Sbjct: 4 QPHDSFFKQIFSDPRRVKTLLDIFAKDVAKSI---HSITPVNTEKFSSKSQKFMLDLLFS 60 Query: 67 VQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEA 126 +++ Y+ +V+EH+S DK++ ++ Y+ A ++ + P ++ I+FY G+ Sbjct: 61 CKVKDQDAYIRIVLEHKSYLDKELPIQLSYYNAAIWEEAIKEK-EYYPPIINIVFYHGKG 119 Query: 127 TPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRI------AILELLQK 180 + + L + V + L+D+ DDE++ I A++ + Sbjct: 120 EWNIPTS--LPVLEDQNLEKYVSKLNYILIDLNKVSDDELINEAYIDFCFTSAVIAMKHV 177 Query: 181 HIRQRDLMLLLEQLVTLIDEGYTSGSQ--LVAMQNYM-LQRGHTEQADLFYGVLRDRETG 237 H + + LV + L NY+ +G T++A+ L++ G Sbjct: 178 HENIEKIKAVFRPLVEYVQIHEDEEGYHCLFFSFNYISYVKGDTKEAE---NALKELIGG 234 Query: 238 GESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSRE 280 + MTL + + +G+EKG Q+G QE ++ Q L K + Sbjct: 235 DKKAMTLIEKWIMEGLEKGKQEGLQEGLEKGKQEGLIKAKKDD 277 >UniRef50_C6HXQ0 Putative uncharacterized protein n=1 Tax=Leptospirillum ferrodiazotrophum RepID=C6HXQ0_9BACT Length = 341 Score = 186 bits (472), Expect = 1e-45, Method: Composition-based stats. Identities = 71/326 (21%), Positives = 126/326 (38%), Gaps = 40/326 (12%) Query: 9 HDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQ 68 HD FK L + L+ LP L L +L + +SL D+ + Sbjct: 8 HDRFFKSTLGRPKRMEHILKAFLPPALSALLAPGSLVPLFSEVVGDSLDASLLDMAFEAT 67 Query: 69 MQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEATP 128 +HV++EH+S PD F+++ Y R + +P V P+LFY G P Sbjct: 68 FGERKTRIHVLVEHKSSPDPWAHFQILHYLAELWLRDKKESRSPIPFV-PVLFYHGLR-P 125 Query: 129 YPLSMCWFDMFYSP-ELARRVYNSPFPLVDITITPDDEIMQHRR---IAILELLQKHIRQ 184 + L +M P EL V + P++D+ D +I + R + LL KHI + Sbjct: 126 WNLPTRLSEMLDPPSELLPFVPDYLLPVIDLGKIDDLDIREKIRDFETSACLLLLKHIFE 185 Query: 185 RDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGESMMTL 244 L + + S +++ +Y++ H E ++ + Sbjct: 186 GA-RGSLRAFLQETNGKNLSRDIIISGMSYVIGVHHLESTAELSRLVNTILKEEGMSQNV 244 Query: 245 AQWFEEKGIEKGIQQGRQEVSQ--------------------------------EFAQRL 272 + + E+ I++G+Q+G Q+ Q + ++L Sbjct: 245 VELWMEELIQQGVQKGIQQGVQLGIEQGIQQGIQQGVQQGVRQGVQQGIRITQDDTIRKL 304 Query: 273 LSKG-MSREDVAEMANLPLAEIDKVI 297 L+KG +S E +A +LP I +V+ Sbjct: 305 LNKGQLSVEQIAFFLDLPTDRIREVL 330 >UniRef50_C7RR52 Putative transposase n=1 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RR52_9PROT Length = 330 Score = 184 bits (466), Expect = 5e-45, Method: Composition-based stats. Identities = 71/317 (22%), Positives = 129/317 (40%), Gaps = 27/317 (8%) Query: 7 TPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYS 66 HD +K E RD + +P + D +TL GS++ E + D+++ Sbjct: 3 NTHDTGYKLLFSTPELVRDLILGFVPDDWLHGLDYSTLERVPGSYVTEDFTNRADDIVWR 62 Query: 67 VQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADH----DKLPLVVPILFY 122 V++ G YL+++IE QS DK MA RMM Y ++ +LP V+PI+ Y Sbjct: 63 VKVGGEWVYLYLLIEFQSSVDKYMALRMMVYGGLLYQDLIKRGEVLADGRLPPVLPIVLY 122 Query: 123 QGEATPYPLSMCWFDMFYSPELARRV-YNSPFPLVDITITPDDEIMQHRRIAILELLQKH 181 G ++ + + P L + + L+D D E+ + + +H Sbjct: 123 NGSQRWSAVTDVFELIPPVPGLVEQFKPRLKYLLIDENAWSDSELASLKNLVAAVFRIEH 182 Query: 182 IRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGESM 241 + L++L+DE L M ++ +A+ + VL + E Sbjct: 183 PASP---AAIGDLLSLLDEWLAERPDLRRMFALWIRATLMRKAE-YRIVLPRIDDLQELN 238 Query: 242 MTLAQWFEE-------KGIEKGIQQGRQEVSQEF--------AQRLLSKGM---SREDVA 283 + LA+ EE +G +G +G+ E E Q+LL K + +A Sbjct: 239 VMLAERLEEWAQAYKAEGKAEGKAEGKAEGKAEGKAEGEALALQKLLKKRFGAVPPDVLA 298 Query: 284 EMANLPLAEIDKVINLI 300 +++ L +ID ++ + Sbjct: 299 QISRASLEQIDAWLDQV 315 >UniRef50_C0GWA6 Putative uncharacterized protein n=3 Tax=Desulfonatronospira thiodismutans ASO3-1 RepID=C0GWA6_9DELT Length = 334 Score = 179 bits (453), Expect = 1e-43, Method: Composition-based stats. Identities = 69/292 (23%), Positives = 125/292 (42%), Gaps = 21/292 (7%) Query: 1 MDAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHS 60 M HD FK F E RDF++ +LP E+++ DL + ++ ++ E K Sbjct: 1 MSKKIPNAHDICFKSFFSREEFVRDFIQYYLPEEIKKHLDLTIIEIDMEGYLSEEFKEFY 60 Query: 61 TDVLYSVQMQGN--PGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEAD--HDKLPLV 116 +DV+ V L+ + EH+SKP + + + Y + R L + LP++ Sbjct: 61 SDVVAKVYFNDRVHELELYFLFEHKSKPYRFTILQTLNYQVQKWMRLLVEGKLNQHLPII 120 Query: 117 VPILFYQGEATPYPLSMCWFDMFYSPELARR--VYNSPFPLVDITITPDDEIMQHRRIAI 174 VP++ Y G + + S+ + D+F P + + L DI + + I Sbjct: 121 VPVVIYNGYKS-WNFSVQFEDLFQLPSEYYKDFIPQFRHILHDIGQMDEASFKTTTIMEI 179 Query: 175 LELLQKHIRQRDLMLLLEQLVTLIDEGYTSG---SQLVAMQNYMLQRGHTEQADLFYGVL 231 LL K+I +L + ++ L+++ + L + Y++ G + L Sbjct: 180 FHLLLKYIYYPELDTKIHEIYDLLEKLPDNDKLTDYLFIIVRYVMASGAIPEKRLLEH-- 237 Query: 232 RDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFA-----QRLLSKGMS 278 R +GGE M+ LA IE+ ++Q R+ Q+ A Q +L K + Sbjct: 238 AKRFSGGEEMIGLAARE----IEERVEQTRKPYWQKQAKVENSQEMLIKSLK 285 >UniRef50_C4FIM1 Putative uncharacterized protein n=1 Tax=Sulfurihydrogenibium yellowstonense SS-5 RepID=C4FIM1_9AQUI Length = 316 Score = 178 bits (452), Expect = 2e-43, Method: Composition-based stats. Identities = 58/286 (20%), Positives = 132/286 (46%), Gaps = 19/286 (6%) Query: 7 TPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYS 66 PHD FKQ + + L+I P EL + DL ++ L + + + ++LY Sbjct: 5 QPHDQFFKQIFSEPKRVKSLLDIFYP-ELSQKIDLESIRLLNSEKYSQKVGKSLLNLLYE 63 Query: 67 VQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEA 126 +++ +L ++ EH+S DK + +++ Y+ ++++ P ++ I+ Y G+ Sbjct: 64 CKIENEKSFLRIIFEHKSYIDKNLPSQLLYYNGILWEE--TGEYEEYPPIINIVLYHGKR 121 Query: 127 TPYPLSMCWFDMFYSPELARRVYNS-PFPLVDITITPDDEIMQHRRIAILE----LLQKH 181 + + E+ R N + L+D++ D+E++ + L KH Sbjct: 122 KWNIPATL---PKTNSEIIERFANKLNYHLIDLSKVADEEMISKLYLDFCTVSALLTMKH 178 Query: 182 IRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGESM 241 I + + + ++ + E Y G + + +Y+ + ++ + VL++ G + M Sbjct: 179 IFED--LRKYKHILKKVFEHYQDGCVFI-ILDYISVVNNPQEVE---NVLKEILGGEKDM 232 Query: 242 MTLAQWFEEKGIEKGIQQGRQEVSQEFAQR--LLSKGMSREDVAEM 285 MTL + ++ +G+++G+QQG E ++ + L G E++ ++ Sbjct: 233 MTLTEKWKMEGLQQGLQQGMIEGQKKAILKSIQLKFGRVPENIEKL 278 >UniRef50_B2V9N0 Putative uncharacterized protein n=4 Tax=Sulfurihydrogenibium RepID=B2V9N0_SULSY Length = 312 Score = 178 bits (451), Expect = 3e-43, Method: Composition-based stats. Identities = 64/268 (23%), Positives = 124/268 (46%), Gaps = 17/268 (6%) Query: 2 DAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHST 61 + S PH+ FKQ +++ +DFL I L +L + L++L L + K H Sbjct: 3 NKESIQPHNWFFKQVFSNSKNVQDFLSIFL-SDLSQKIQLSSLELVPSEKFSNNQKKHFL 61 Query: 62 DVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILF 121 D+LY ++ Y+ ++ EH+S DKK+ ++M+Y+ L+ D P ++ I+F Sbjct: 62 DLLYKCKLNDKEAYIRLIFEHKSYVDKKLPLQLMQYNAVIWEEALKEK-DYYPPIINIVF 120 Query: 122 YQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRR----IAILEL 177 Y G+A + D+ EL + + + L+D+ D+ + ++ + + + L Sbjct: 121 YHGQA-KWNFPTTIPDI-EDEELDKYIQKLNYILIDLNEIEDENLKRYLKKNVDLIMEML 178 Query: 178 LQKHIRQRDLMLLLEQLVTLIDEGYTSGSQ--LVAMQNYMLQRGHTEQADLFYGVLRDRE 235 + KHI R LE++ TL+ + S+ V + NY++ + + V ++ Sbjct: 179 IMKHIHDR-----LERIKTLLKDVIDECSEDCFVIILNYLVLV--KKDYEKVKEVFKEII 231 Query: 236 TGGESMMTLAQWFEEKGIEKGIQQGRQE 263 G E MM + +G +G + +E Sbjct: 232 GGEEKMMLFTDKLKMEGKMEGKIEILRE 259 >UniRef50_C0GTX5 Putative uncharacterized protein n=8 Tax=Desulfonatronospira thiodismutans ASO3-1 RepID=C0GTX5_9DELT Length = 338 Score = 177 bits (450), Expect = 3e-43, Method: Composition-based stats. Identities = 71/298 (23%), Positives = 130/298 (43%), Gaps = 10/298 (3%) Query: 5 STTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVL 64 +T HD+ K FL A L+ LP E+ + D N ++ E S++ +SL+G+ +D++ Sbjct: 3 TTNIHDSTIKYFLSDRLNAISLLKSMLPEEIVKQLDFNKIYYEKDSYLPKSLQGYYSDLV 62 Query: 65 YSVQMQGNP--GYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLE-ADHDKLPLVVPILF 121 SV + + ++EH+S K + +RY + ++ + +LP+++PIL Sbjct: 63 VSVPTKCGSYVAKVFFLLEHKSTFKKNTPLQFLRYILEFWEQYQKNTGETRLPVIIPILI 122 Query: 122 YQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVD-ITITPDDEIMQHRRIAILELLQK 180 E P + S + V + F L D + P+D A+ L + Sbjct: 123 AHPEEGWKPTKVSDLVDLPSDDFKIFVPDFNFLLYDAVNDDPEDYDFDETLKALFTLW-R 181 Query: 181 HIRQRDLMLLLEQLVTLIDEGYTSGSQL---VAMQNYM-LQRGHTEQADLFYGVLRDRET 236 + R + M +++ LI + L + +Y+ + R E D+ + + Sbjct: 182 YSRSPEFMQGVQKAFQLIKKVDPKARLLDFVQMILHYLEVTRDEKEYIDIQKIAETEIDE 241 Query: 237 GGESMMTLAQWFEEKGIEKGIQQGRQEV-SQEFAQRLLSKGMSREDVAEMANLPLAEI 293 G E M T+A+ F +G E+ Q+ QE E L + + D+A A PL +I Sbjct: 242 GEEYMGTIAEMFRREGDERTEQRFLQEKPIWEKQSELKATQETLIDIATEAYGPLPDI 299 >UniRef50_A4XFI8 Putative uncharacterized protein n=7 Tax=Clostridia RepID=A4XFI8_CALS8 Length = 321 Score = 177 bits (448), Expect = 5e-43, Method: Composition-based stats. Identities = 62/317 (19%), Positives = 132/317 (41%), Gaps = 26/317 (8%) Query: 2 DAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHST 61 + HD+ FK H + ++ + + +++ L F++E+ Sbjct: 3 SSLPPQEHDSTFKFLFEHPKDILFLVKDVIGYSWAKEIKEDSIELADKEFVDETFHQKRA 62 Query: 62 DVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILF 121 DV+ +++ Y +++IE+QS + M R++RY I + + KLP ++PI+ Sbjct: 63 DVIAKARLKDREVYFYIIIENQSTVAEDMPERLLRYMILLWAKKIREGVKKLPAIIPIVT 122 Query: 122 YQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKH 181 Y G + +S F + + +V+I+ ++Q + ++ Sbjct: 123 YNGLEKDWDVSQEIISEFD----IFKDDIFKYAVVNISKLDAKTLLQEEEDILSPVVFYL 178 Query: 182 IRQRDLMLLLEQLVTLIDEGYTSGSQ------LVAMQNYMLQRGHTEQADLFYGVLRDRE 235 + RD L + + I+ T SQ L+ N + R E + + + + E Sbjct: 179 EQVRDDTEELVKRLKEIEPKLTKLSQNNAERFLIWAGNVIRPRLVKEDKEKYDELAQRVE 238 Query: 236 TGGESMM-----TLAQWFEE-----------KGIEKGIQQGRQEVSQEFAQRLLSKGMSR 279 GG M +A+ +E +G +G +G+ E E A++++ +G S Sbjct: 239 QGGSRQMGEFVSNVAKLLDEVQMRKFNEGKIEGKIEGKIEGKIEGKIEVAKKMIRRGFSD 298 Query: 280 EDVAEMANLPLAEIDKV 296 ED+AE+ L + ++ ++ Sbjct: 299 EDIAELTELDIEKVKEL 315 >UniRef50_B9MMR0 Putative uncharacterized protein n=1 Tax=Anaerocellum thermophilum DSM 6725 RepID=B9MMR0_ANATD Length = 333 Score = 177 bits (448), Expect = 5e-43, Method: Composition-based stats. Identities = 58/325 (17%), Positives = 125/325 (38%), Gaps = 38/325 (11%) Query: 3 APSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTD 62 P +D FK+ E +FL+ + + DL +L SF+++ D Sbjct: 4 KPPHNQYDLTFKRIFSFKEVFLNFLKSTIKRPWVDKIDLQSLEFVDRSFVKDEFVEKEAD 63 Query: 63 VLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDK-LPLVVPILF 121 V+Y +++ Y +V++E QS DK M R+ Y RH+E D L +VPI+ Sbjct: 64 VIYRAKIEDTDIYFYVLLEAQSTTDKTMPRRLFEYMNLIWQRHIEETKDDLLSPIVPIVL 123 Query: 122 YQGEATPYPLSMCW--FDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQ 179 Y G + ++ + +++F + LVD+ D+ + + + L Sbjct: 124 YNGRSNWNVPTLIFKGWEIFKDD-------MFNYFLVDVNNIDDETLKNRLDLLSVILYL 176 Query: 180 KHIRQ--RDLMLLLEQLVTLIDEGYTSGSQLV--AMQNYMLQRGHTEQADLFYGVLRDRE 235 R+ ++ + L+++ I T ++ + + + E +L+ E Sbjct: 177 DRSRKTAKEFIEKLKEVTEYISCLPTEQVKVFAMWLLRVIRPQMMEEVQGEIDELLKRIE 236 Query: 236 TGG-----------ESMM-------------TLAQWFEEKGIEKGIQQGRQEVSQEFAQR 271 G + +M + + +G +G +G E + A+ Sbjct: 237 QEGVTDVGDFVFNVQRLMQEYYKEAEEKGKEKGYEEGKLEGKLEGKLEGELEATIRIARN 296 Query: 272 LLSKGMSREDVAEMANLPLAEIDKV 296 ++ G ++++ L + +I ++ Sbjct: 297 MILAGAEDSFISKVTGLDIEKIKEL 321 >UniRef50_B3ETR6 Putative uncharacterized protein n=1 Tax=Candidatus Amoebophilus asiaticus 5a2 RepID=B3ETR6_AMOA5 Length = 275 Score = 176 bits (446), Expect = 9e-43, Method: Composition-based stats. Identities = 65/239 (27%), Positives = 120/239 (50%), Gaps = 19/239 (7%) Query: 75 YLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEATPYPLSMC 134 Y++ +IE+QS +K MAF M+ Y++A M +HL + +LP++V I Y G+ +PYP S Sbjct: 36 YVYTLIENQSTHNKLMAFSMLSYNVALMEQHLNEGYQELPIIVNICIYTGKKSPYPYSQD 95 Query: 135 WFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQRDLMLLLEQL 194 D F ELAR F L+D+++ +E+++ +E L + R+RD + + Sbjct: 96 ICDYFEGVELAREQMFKHFKLLDLSVLSQEELLKDGTFGSVEALLRQGRERDYLNWINNN 155 Query: 195 VTLIDEGYTSGSQLVAMQNYMLQRGHTEQAD-LFYGVLRDRETGGESMMTLAQWFEE--- 250 LI E ++ +++ Y+L AD L ++ E ++T AQ + Sbjct: 156 QVLIWELVSNYG--LSIVIYILTTDDKNDADYLMQAIIEAVLEQKEIIVTAAQQLRQVDI 213 Query: 251 -----KGIEKGIQQGRQEV--------SQEFAQRLLSKGMSREDVAEMANLPLAEIDKV 296 KGI++GI+QG++E +Q + +L +G+ + ++ + I+K+ Sbjct: 214 QTGLIKGIKEGIEQGKEEGVKLGIQAKAQAIDKSMLKEGLEISLIQKVTGISREAIEKL 272 >UniRef50_Q2FP14 Putative uncharacterized protein n=4 Tax=Methanospirillum hungatei JF-1 RepID=Q2FP14_METHJ Length = 312 Score = 175 bits (443), Expect = 2e-42, Method: Composition-based stats. Identities = 67/312 (21%), Positives = 131/312 (41%), Gaps = 28/312 (8%) Query: 10 DAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQM 69 D +K+ H E D + L +L CDL+TL +GS++ + L+ D+++ + Sbjct: 5 DHPYKRLFSHPEMIADLIRGFLDPKLVSGCDLSTLERCNGSYVTDDLREREDDIIWRLAY 64 Query: 70 QGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEAD---HDKLPLVVPILFYQGEA 126 L+++IE QSKPD M R+M Y + + ++P ++PI+ Y GE Sbjct: 65 GDRTLILYLLIEFQSKPDYSMPIRIMSYMALLWQDLIRSGVIVPSRIPGIIPIVLYNGE- 123 Query: 127 TPYPLSMCWFDMFYSPE-LARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQR 185 P+ + + P+ ++R + + P+ L+D +M+ R +A + Sbjct: 124 IPWKVPHDIRETIQMPKPVSRFIPSVPYLLIDELRLSVHHLMEVRNLAACLFGLEQSSGP 183 Query: 186 DLMLLLEQLVTLIDEGYTSGSQLVAMQN-YMLQRGHTEQADLFYGVLRDRETGGESMMTL 244 + L +L ++ + L +M+ + L +T + D + + G + Sbjct: 184 --LELF-ELGARLNRWMQTDPNLDSMRRDFSLFFENTLKRDDDISISNPFQGGTMLAERV 240 Query: 245 AQWFEE-------------------KGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEM 285 +W + +G +G +G+ E +R+ KGMS ++A + Sbjct: 241 NKWIAQYKAEGRKEGKEEGKKEGLLEGRVEGKLEGKLEGMATILKRMKEKGMSVTEIATI 300 Query: 286 ANLPLAEIDKVI 297 LP EI +I Sbjct: 301 TGLPEDEIQHLI 312 >UniRef50_B9TA29 Putative uncharacterized protein n=1 Tax=Ricinus communis RepID=B9TA29_RICCO Length = 411 Score = 173 bits (439), Expect = 6e-42, Method: Composition-based stats. Identities = 59/299 (19%), Positives = 111/299 (37%), Gaps = 28/299 (9%) Query: 6 TTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLY 65 ++ D+++KQ H E RD + L + + + S+ + DV++ Sbjct: 40 SSRTDSLYKQLFAHPEIVRDLVAGFLAADWARGLTVEAFERVNASYASDHGHVRHDDVVW 99 Query: 66 SVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMH----RHLEADHDKLPLVVPILF 121 ++ G Y+++++E Q++PDK MA RM Y +H + H KLP V+P++ Sbjct: 100 RARIGGEWVYVYILLEFQARPDKWMALRMQVYVGLLYQDLVAQHKLSKHGKLPPVLPVVL 159 Query: 122 YQGEATPYPLSMCWFDMFYSPE-LARRVYNSPFPLVD------------ITITPDDEIMQ 168 Y G + M +P L R + + L+D + D Sbjct: 160 YHGRGPWRAATALASLMLPAPSGLERYQPSQRYLLIDQHHGTARADVVSLLFRLLDAATD 219 Query: 169 HRRIAILELLQKHIRQRDLMLLLEQLVTLIDEGYTS---------GSQLVAMQNYMLQRG 219 + L+LL + IR RD+ + + L I Sbjct: 220 LQLREALDLLAERIRARDMDPVRDSLTRWIQLTLQDAAVETSMDLEEAFTMKMRRKFSYD 279 Query: 220 HTEQADLFYGVLRDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMS 278 +F L ++++ Q E+G+E+G +G + E +R +G+ Sbjct: 280 EMFDPGMFERPLAKA--REKAIVEGLQQGREEGLERGRVEGLERGRVEGLERGREEGLK 336 >UniRef50_A4XG55 Putative uncharacterized protein n=2 Tax=Caldicellulosiruptor saccharolyticus DSM 8903 RepID=A4XG55_CALS8 Length = 327 Score = 172 bits (435), Expect = 2e-41, Method: Composition-based stats. Identities = 68/318 (21%), Positives = 129/318 (40%), Gaps = 28/318 (8%) Query: 5 STTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVL 64 +D +K + L+ + + L L +++ +D+L Sbjct: 6 PHNVNDLEYKYIFSNKSLFLRLLKRIDRINIFNKLTEEDLELVDKNYVLPDFSEQESDLL 65 Query: 65 YSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEA--------DHDKLPLV 116 Y ++Q + +++ EHQS D MA R++ Y L+ K P V Sbjct: 66 YKARLQEEELFFYILFEHQSTVDYNMAMRLLFYITDIWRDWLKQFDKNQFKNKSFKFPPV 125 Query: 117 VPILFYQGEATPYPLSMCWFDMFYSPE-LARRVYNSPFPLVDITITPDDEIMQHRRIAIL 175 VPI+ Y G+ P+ S+ + + E + + + + L+D+ PD+ I +++ I L Sbjct: 126 VPIVLYDGD-NPWTASVNLKERIMNFEVFGKYIVDFEYILIDLND-PDEMIFKYKDILSL 183 Query: 176 EL-LQKHIRQRDLMLLLEQLVTLIDEGYTSGSQ-LVAMQNYMLQRGHTEQADLFYGVLRD 233 L L K +++L L L + L +L+ ++ +L Sbjct: 184 ILKLNKVKTEKELERLFLDLYEYLQGAKEKEINTLKICLPVVLKELGEDKVQEAKDMLEC 243 Query: 234 RETGGESMMTLAQ---WFEEKGIEKGIQQGRQ------------EVSQEFAQRLLSKGMS 278 + GGE +M L Q E+ +GIQ+G Q + E A+R++ KG S Sbjct: 244 IDVGGEGIMPLFQNLRKIREEWYHEGIQKGIQDGLQQGLQQGLQKKELEIAERMIVKGYS 303 Query: 279 REDVAEMANLPLAEIDKV 296 E++ E+ L + +I ++ Sbjct: 304 DEEIHEITGLDIEKIKEL 321 >UniRef50_Q04UG3 Transposase, YhgA-like n=8 Tax=Leptospira RepID=Q04UG3_LEPBJ Length = 304 Score = 169 bits (428), Expect = 1e-40, Method: Composition-based stats. Identities = 70/305 (22%), Positives = 132/305 (43%), Gaps = 18/305 (5%) Query: 3 APSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTD 62 PHD + ++ + A F + LP E+ EL DL L L SF+ E LK TD Sbjct: 2 TEVNNPHDRLIRETFQDKKEAATFFKNTLPPEVVELLDLENLELTESSFVSEELKQEQTD 61 Query: 63 VLYSVQMQ-GNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILF 121 +L+ + ++ GN ++++ EH+S + + +++ Y +R+ + + +V+P +F Sbjct: 62 LLFQIPLKSGNKSNVYLLFEHKSYLENTIYIQLLGYLTEI-YRNQQRSGESFSVVIPFVF 120 Query: 122 YQGEATPYPLSMCWFDMFYSPELAR-----RVYNSPFPLVDITITPDDEIMQHRRIAILE 176 Y GE + L + D F + + + L D+ + ++ + Sbjct: 121 YHGEKE-WKLGDRFSDQFVLTKQETDVFQDFIPDFKIDLFDLEGIELKKKLESITFQVTL 179 Query: 177 LLQKHIRQRDLMLLLE-----QLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGV- 230 + + IR+RDL + L+ I+E + L + Y+ + +L + Sbjct: 180 GVVQRIRERDLEFVSHLPGLFSLLLGIEEESKRVAILRKLLLYIYWARDLKPTELKRVLA 239 Query: 231 LRDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPL 290 + E E MT A+ I +GIQQG+ E E A+ +LS+ + E V + L Sbjct: 240 ISKLEQYEELTMTTAERL----ISEGIQQGKIEGKIETARNMLSEDIQLEAVLRITGLSK 295 Query: 291 AEIDK 295 ++ Sbjct: 296 QDLKD 300 >UniRef50_C5RH90 Putative uncharacterized protein n=2 Tax=Clostridium cellulovorans 743B RepID=C5RH90_CLOCL Length = 339 Score = 162 bits (409), Expect = 2e-38, Method: Composition-based stats. Identities = 54/309 (17%), Positives = 113/309 (36%), Gaps = 12/309 (3%) Query: 4 PSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDV 63 HD +K + ET ++ + L L S++ + +D+ Sbjct: 18 KKNNLHDKSYKDLFSNKETFLSLIQTFVSNTWGSKLTKENLVLVDKSYVLSDYEELESDI 77 Query: 64 LYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAM--------HRHLEADHDKLPL 115 +Y ++ + + ++++E QS D +M R++ Y I + + +LP Sbjct: 78 VYKARIGDHEVFFYMLLEFQSYVDYRMPIRLLLYMIEIWREILKNTSEKEFKRKSFRLPA 137 Query: 116 VVPILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIA-I 174 VVPI+ Y GE + S + + + +D+ DE+ +++ IA Sbjct: 138 VVPIVVYNGEKNWTVARTLKEVISNSDIFGESILDFRYEFLDVNRFKKDELYENQNIASA 197 Query: 175 LELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDR 234 + LL + I + + L+ +V ++ + + + + Sbjct: 198 IFLLDQSISRIEFYNRLKDIVIEFNKLTVEEKAQLKHWLVNVNSEENNYKENIEKIFSSN 257 Query: 235 ETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSR---EDVAEMANLPLA 291 + E M + EK E+G +G+ E E + L+K E ++ LP Sbjct: 258 KREVEIMTSNISKGLEKLKEEGKIEGKAEGKAELLIKQLNKKFKLLPMEYEKKIKALPEK 317 Query: 292 EIDKVINLI 300 +D + I Sbjct: 318 ILDDIATDI 326 >UniRef50_B6WXP3 Putative uncharacterized protein n=1 Tax=Desulfovibrio piger ATCC 29098 RepID=B6WXP3_9DELT Length = 330 Score = 160 bits (406), Expect = 4e-38, Method: Composition-based stats. Identities = 62/278 (22%), Positives = 116/278 (41%), Gaps = 17/278 (6%) Query: 8 PHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSV 67 PHD+ +KQF + E L +P + E D +TL SGS++ + L+ D+++ + Sbjct: 7 PHDSAYKQFFSNPEMVESLLRDFVPADFIEDLDFSTLERCSGSYVTDDLRERHDDIVWRI 66 Query: 68 QM-QGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADH----DKLPLVVPILFY 122 +G Y+ +V+E QS PD MA R + Y+ + ++ + LP V PI+ Y Sbjct: 67 GWKKGAWCYVALVLEFQSTPDYWMALRTLSYTALLLLDLVKTGKVHEGEGLPPVFPIVIY 126 Query: 123 QGEATPYPLSMCWFDMFY--SPELARRVYNSPFPLVDITITPDDEI-MQHRRIAILELLQ 179 G + +F L L+D + DE+ +A L L+ Sbjct: 127 NGGKA-WKAPQEVATLFAPMPDSLKHYCPQHRHFLLDESRVSGDELDKSQGLVAQLLKLE 185 Query: 180 KHIRQRDLMLLLEQLVTLIDEGYT---SGSQLVAMQNYMLQR-GHTEQADLFYGVLRDRE 235 + + ++++L+T + E + V + +L+R G TE+ F + Sbjct: 186 RAQEPEQVRQIVKELITRLHEPKYLLLRRAFTVWLSRVVLKRSGITEEIPEFQDLREVDA 245 Query: 236 TGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLL 273 E A ++++ I++G +G + L Sbjct: 246 MLEER----AAQWKDEYIKQGKTEGISIGEARGIRSAL 279 >UniRef50_Q3JB06 Putative transposase n=17 Tax=Proteobacteria RepID=Q3JB06_NITOC Length = 350 Score = 159 bits (403), Expect = 8e-38, Method: Composition-based stats. Identities = 49/198 (24%), Positives = 90/198 (45%), Gaps = 8/198 (4%) Query: 9 HDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQ 68 HDA +K+ H E RD L+ + + D +TL SGS++ + L+ D+++ ++ Sbjct: 4 HDASYKRLFSHPEMVRDLLQGFVREPWVQQLDFSTLEKVSGSYVTDDLREREDDIIWRLR 63 Query: 69 MQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLE----ADHDKLPLVVPILFYQG 124 Q Y+++++E QS D MA R++ Y ++ A + KLP V P++ Y G Sbjct: 64 HQEGWMYIYLLLEFQSTVDPYMAVRVLAYVGLLYQDLIKARYIAPNQKLPPVFPLVLYNG 123 Query: 125 EATPYPLSMCWFDMFYSPE--LARRVYNSPFPLVDITITPDDEIM-QHRRIAILELLQKH 181 + + D+ E L R + + LVD D+ + +A L L+ Sbjct: 124 -GPRWRAATEVGDLITPLEGGLERYRPSLRYLLVDEGDYQDEALAPLKNLVASLFRLENS 182 Query: 182 IRQRDLMLLLEQLVTLID 199 +L+ +L L+ + Sbjct: 183 RTPEELLQVLRNLLQWLQ 200 >UniRef50_A6G1G8 Putative uncharacterized protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6G1G8_9DELT Length = 329 Score = 158 bits (401), Expect = 2e-37, Method: Composition-based stats. Identities = 60/291 (20%), Positives = 99/291 (34%), Gaps = 17/291 (5%) Query: 4 PSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDV 63 P HDA+FK A LP L + D E + ++ L DV Sbjct: 2 PVLHAHDALFKAAFGAPAHAARLCRALLPPALVAVLDWRASTSEPTAVLDLRLSERRCDV 61 Query: 64 LYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEAD-HDKLPLVVPILFY 122 L+ + G ++V++EHQS ++ M R+ Y H D H LP ++PI+ Sbjct: 62 LWRTRFVDG-GPIYVLLEHQSTRERDMPLRIEGYLARIWAGHRRGDRHGPLPPIIPIVVS 120 Query: 123 QGEATPYPLSMCWFDMFYSPE----LARRVYNSPFPLVDITITPDDEIMQHRRI---AIL 175 E W SP+ LA V N + D+T D + + Sbjct: 121 HAEHGWRAPRSFWEQFSPSPDCIPGLAPFVPNFQLLIDDLTQVDDASLRGRSLPLFQTLA 180 Query: 176 ELLQKHIRQR-------DLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQR-GHTEQADLF 227 L + R D + + G + + Y G E ++ Sbjct: 181 LWLLRDARDPGRVLESVDEWNTWIHRLRGESQHEQDGGDIEQLLRYAYAVMGEGEDSEFH 240 Query: 228 YGVLRDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMS 278 + E +T Q +G ++G+++GR + E + L S Sbjct: 241 RKLAAFHPPSAEMSLTFEQQAINRGHKRGLEEGRIKGRLELLEAQLHAKFS 291 >UniRef50_C6PYR3 Putative uncharacterized protein n=1 Tax=Clostridium carboxidivorans P7 RepID=C6PYR3_9CLOT Length = 344 Score = 155 bits (393), Expect = 1e-36, Method: Composition-based stats. Identities = 50/293 (17%), Positives = 123/293 (41%), Gaps = 25/293 (8%) Query: 1 MDAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHS 60 + HD +K + E D ++ + + + + L + S+I + Sbjct: 3 IKKEMHHIHDKSYKDLFSNKELLVDMIQNFVKSSWIKEIKKDNIELVNKSYILSDYEELE 62 Query: 61 TDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHR--------HLEADHDK 112 +D++Y + G ++++E QS D M R+ Y +++ + Sbjct: 63 SDIVYKATIDGREVIFYILLEFQSYVDYSMPIRLFLYMSEIWREVLKNTKQAEVKSKEFR 122 Query: 113 LPLVVPILFYQGEATPYPLSMCWFDMFYSPEL-ARRVYNSPFPLVDITITPDDEIMQHRR 171 LP +VP++ Y GE + + + ++ EL + + + L+DI +E+M+ + Sbjct: 123 LPAIVPLVLYNGEY-KWTVEKKFKNIINKSELFGNNIIDFEYILIDINKYEKEELMELKN 181 Query: 172 I-AILELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQA----DL 226 + + + LL + + + + ++ + ID + Q + +++++ E + Sbjct: 182 LVSAVFLLDQKVDIEEFISRVKDIA--IDFNNLTEEQKMMLRHWLRVTLSDELKGNLGEK 239 Query: 227 FYGVLRDRETGGESM-----MTLAQWF---EEKGIEKGIQQGRQEVSQEFAQR 271 +L ++ M T+ + F E+G+EKGI++G ++ ++ Q+ Sbjct: 240 IEDILIAKKEEVNRMTSNISKTIKETFAKTREEGMEKGIEEGIEKGIEKARQK 292 >UniRef50_C6HTR6 Probable transposase n=5 Tax=Leptospirillum ferrodiazotrophum RepID=C6HTR6_9BACT Length = 216 Score = 154 bits (389), Expect = 3e-36, Method: Composition-based stats. Identities = 53/217 (24%), Positives = 87/217 (40%), Gaps = 11/217 (5%) Query: 3 APSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLK-GHST 61 + TPHD+ FK + L L D ++L SG I E L + Sbjct: 2 TTTPTPHDSFFKDVFGPGKANLPALLSLLDAPFASRIDPSSLTFLSGETIGEGLATSFRS 61 Query: 62 DVLYSV-----QMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLV 116 D++ S+ + G P ++EH+S P + + F++ A R L LP V Sbjct: 62 DLVGSLLVADATVDGKPLEFVFLVEHKSSPARDIQFKLACLVTALWARFLREGKPPLP-V 120 Query: 117 VPILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQH---RRIA 173 VPIL + G++ + + PELA + + ++D+T DDEI + Sbjct: 121 VPILIHHGKSPWNQPLRLYETLGLRPELATGMLDYALHVIDLTRIEDDEIRRKIPDPEPQ 180 Query: 174 ILELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVA 210 + KHI L L + L+ E + L++ Sbjct: 181 MSLAAMKHIHDP-LPAFLRVMADLLKEIEENRDILLS 216 >UniRef50_A8PLG1 Transposase n=1 Tax=Rickettsiella grylli RepID=A8PLG1_9COXI Length = 212 Score = 153 bits (388), Expect = 6e-36, Method: Composition-based stats. Identities = 66/212 (31%), Positives = 117/212 (55%), Gaps = 2/212 (0%) Query: 89 KMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEATPYPLSMCWFDMF-YSPELARR 147 F++ RY A M +HL+ H LP+VV +L+Y+G+ TPYP + FD F + +A + Sbjct: 1 MTPFKIARYVHAIMDQHLKQGHAFLPIVVAMLYYRGKVTPYPYTGNIFDCFGKNKTIAEK 60 Query: 148 VYNSPFPLVDITITPDDEIMQHRRIAILELLQKH-IRQRDLMLLLEQLVTLIDEGYTSGS 206 +Y P+P++DIT DD I H IAIL+ QK+ RD+ +E ++ + +GY + Sbjct: 61 IYLRPYPIIDITALSDDAIRGHGSIAILDFAQKYAAFNRDIQDGIEHIIGELKKGYLTRE 120 Query: 207 QLVAMQNYMLQRGHTEQADLFYGVLRDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQ 266 Q + Y + T+ + L+ E +M++A E++G+++G+QQGR E Sbjct: 121 QCQTLLYYTFRETDTDNVKMLLEQLQTIRIYEEDIMSVAHKIEQQGLQRGLQQGRYEEDL 180 Query: 267 EFAQRLLSKGMSREDVAEMANLPLAEIDKVIN 298 + A+R+L+KG R + ++ L ++ + + Sbjct: 181 KIAKRMLAKGTDRGYIKDVTGLSDQDLLNLED 212 >UniRef50_C8T759 Putative uncharacterized protein n=1 Tax=Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884 RepID=C8T759_KLEPR Length = 185 Score = 151 bits (382), Expect = 2e-35, Method: Composition-based stats. Identities = 78/183 (42%), Positives = 110/183 (60%), Gaps = 19/183 (10%) Query: 133 MCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQRDLMLLLE 192 MCW F P++ARR+Y FPL+DIT TPDDEIM+HRR+A+LELLQKHIRQRDLM L E Sbjct: 1 MCWLAGFADPDIARRIYGEDFPLIDITSTPDDEIMRHRRVAMLELLQKHIRQRDLMDLHE 60 Query: 193 QLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRD---RETGGESMMTLAQWFE 249 QLV L+ GYTS QL + +Y+LQ G+ F L R E++M +AQ+ E Sbjct: 61 QLVRLLALGYTSRRQLKTLLHYLLQAGNAADPVAFLRHLAQNVPRRPHKETLMNIAQFLE 120 Query: 250 EK----------------GIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEI 293 ++ GIE+GI+QG Q+ ++ A+ +L+ G+ VA++ L + Sbjct: 121 QRGHQQGLKQGLEQGLQQGIEQGIEQGEQQTAERIARAMLANGLDLSLVAKLTGLAPECL 180 Query: 294 DKV 296 ++ Sbjct: 181 ARL 183 >UniRef50_D0YJF1 Putative transposase YhgA family protein n=1 Tax=Klebsiella variicola At-22 RepID=D0YJF1_KLEVA Length = 190 Score = 150 bits (380), Expect = 4e-35, Method: Composition-based stats. Identities = 63/180 (35%), Positives = 103/180 (57%), Gaps = 13/180 (7%) Query: 130 PLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQRDLML 189 P + PE A+ +Y PF L+D+T+ PDD+++QHRR+A+LEL+QKHIRQRDL Sbjct: 11 PHDAVFKRFLRHPETAKTLYGCPFTLIDVTVMPDDDLVQHRRVALLELMQKHIRQRDLSS 70 Query: 190 LLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLF-YGVLRDRETGGESMMTLAQWF 248 + E L ++ GYT+ QL + +YMLQ G+T + +F + R E++M++AQ Sbjct: 71 ITESLAAVVMLGYTNRRQLRMLFHYMLQYGNTAEPGVFLRRLARRLPQYEETLMSIAQKL 130 Query: 249 EEKGIEKGIQQGRQEVSQEFAQR------------LLSKGMSREDVAEMANLPLAEIDKV 296 +++G ++G +GR+E QE Q +L G+ +E V ++ L E+ + Sbjct: 131 KQEGRQEGRLEGREEGHQEGLQEGSRREALRIAGSMLQNGLDKEMVQKITGLSADELQPL 190 >UniRef50_B9MN47 Putative uncharacterized protein n=2 Tax=Bacteria RepID=B9MN47_ANATD Length = 324 Score = 149 bits (376), Expect = 1e-34, Method: Composition-based stats. Identities = 61/310 (19%), Positives = 127/310 (40%), Gaps = 25/310 (8%) Query: 9 HDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQ 68 HD+ FK + + L + +++ ++ ++I + DV+ + Sbjct: 14 HDSTFKLLFENPKDIYLLLSKIINYSWANEIRESSIEIKKTNYITKEFSQVEADVVAKAR 73 Query: 69 MQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEATP 128 ++ Y +++IE+QS K M R++RY I+ + +KLP ++PI+ Y G Sbjct: 74 LKDRDVYFYILIENQSTVAKDMPERLLRYMISIWAEEIRNGVEKLPAIIPIVVYNGLDRR 133 Query: 129 YPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIA---ILELLQKHIRQR 185 + +S F + + +VDI +Q + I L Q Sbjct: 134 WEVSTDIIGAFD----IFKNDIFKYKVVDIAQIDIKNYLQEEDVLTPIIFYLEQVRNDSN 189 Query: 186 DLMLLLEQLVTLIDEG--YTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGESMM- 242 +L+ L+++ + + L+ Q+ + R EQ + ++ G +M Sbjct: 190 ELVRRLQEIEQSLKKLSFNNIERFLLWSQHVIRPRLGNEQKKEYDKLVMKVRQEGVELMG 249 Query: 243 ----TLAQWFEEKGIEKGI-----------QQGRQEVSQEFAQRLLSKGMSREDVAEMAN 287 +A+ +E ++ + QQG Q+ E A+R++ G+S E +++ N Sbjct: 250 EFVSNVARLLDETKTKEFLAGVQQGIQQGIQQGIQQERIETAKRMIQLGISYEVISKATN 309 Query: 288 LPLAEIDKVI 297 L + EI+K+ Sbjct: 310 LSIEEIEKIA 319 >UniRef50_B2V697 Putative uncharacterized protein n=6 Tax=Sulfurihydrogenibium RepID=B2V697_SULSY Length = 311 Score = 148 bits (375), Expect = 2e-34, Method: Composition-based stats. Identities = 50/250 (20%), Positives = 109/250 (43%), Gaps = 17/250 (6%) Query: 7 TPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYS 66 PHD FKQ + + L+I EL + DL ++ L + + + D+LY Sbjct: 5 QPHDQFFKQIFSEPKRVKSLLDIFY-SELSQKIDLESIRLLNSEKYSQKIGKSLLDLLYE 63 Query: 67 VQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEA 126 +++ +L ++ EH+S DK + +++ Y+ + LP ++ I+ Y G+ Sbjct: 64 CKIENEKSFLRIIFEHKSYIDKNLPSQLLYYNGILWE-ETGEYKEYLP-IINIVLYHGKR 121 Query: 127 TPYPLSMCWFDMFYSPELARRVYNS-PFPLVDITITPDDEIMQHRRI----AILELLQKH 181 + + E+ R N + L+D++ D+E++ + A L KH Sbjct: 122 KWNIPTTL---PKTNSEIIERFSNKLNYHLIDLSKVADEEMINKLYVDFCTASALLTMKH 178 Query: 182 IRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGESM 241 I + + + ++ + E Y G + + +Y+ + ++ + VL++ G + M Sbjct: 179 IFED--LKKYKHILKKVFEHYQDGCVFI-ILDYISVVNNPQEVE---NVLKEILGGEKEM 232 Query: 242 MTLAQWFEEK 251 TL + ++ + Sbjct: 233 TTLTEKWKME 242 >UniRef50_C0A240 Putative uncharacterized protein n=1 Tax=Opitutaceae bacterium TAV2 RepID=C0A240_9BACT Length = 365 Score = 148 bits (374), Expect = 2e-34, Method: Composition-based stats. Identities = 59/280 (21%), Positives = 115/280 (41%), Gaps = 31/280 (11%) Query: 3 APSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTD 62 A HD +F+ AR FL LP EL D +TL + S I ++L D Sbjct: 30 AAGNGDHDRIFRHAFSLPAVARQFLRTWLPPELVAQADWHTLTVTRISGISDTLGERRED 89 Query: 63 VLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRH----------------- 105 V+Y + + G + +V++EHQ+K +K MA R+M + + Sbjct: 90 VVYRINVNGRNVHFYVLMEHQTKTEKHMARRIMEETFLIWRQDEHDRAEAAKKEAPGKAD 149 Query: 106 ---LEADHDKLPLVVPILFYQGEATPYPLSMCWFDMFYSPELAR----RVYNSPFPLVDI 158 + DK PLV+ ++ + G + + P + + + + F +V++ Sbjct: 150 RQSRRRETDKFPLVISMVLHPGPRKWGKIWRLADLIDVPPRMEKWARTFMPDCGFIVVEL 209 Query: 159 TITPDDEIMQ-HRRIAILELLQKH----IRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQN 213 P +++ H AIL LQ + I R + LL+++ + D + G+ + + + Sbjct: 210 AGLPLEKLADGHLARAILGALQGNRLGLIDIRKIKRLLDEMFSDPDRA-SVGAVVKQLWH 268 Query: 214 YMLQRGHTEQADLFYGVLRDR-ETGGESMMTLAQWFEEKG 252 Y++ ++ V+ E ++M + ++ G Sbjct: 269 YLISSSDLKEEQTKDIVIAHIPEEYRSNIMNTVERLKQAG 308 >UniRef50_C1DXM1 Putative uncharacterized protein n=5 Tax=Sulfurihydrogenibium azorense Az-Fu1 RepID=C1DXM1_SULAA Length = 342 Score = 147 bits (372), Expect = 4e-34, Method: Composition-based stats. Identities = 59/262 (22%), Positives = 110/262 (41%), Gaps = 17/262 (6%) Query: 7 TPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYS 66 +PHD FK + FLEI LP L E N+L L + K D+ + Sbjct: 6 SPHDWFFKMIFSQKQNVESFLEIFLPQ-LYECIIPNSLKLSDTEKFSKKYKKFFLDLAFD 64 Query: 67 VQMQGN-----PGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILF 121 +++ G +++V EH+S PDK ++ Y M P V+PI+F Sbjct: 65 CKLKDKEGNTIDGQIYIVFEHKSYPDKHTPSQISFYKSVMMEEDERLSRPYRP-VIPIVF 123 Query: 122 YQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILEL---- 177 Y GE + + L + +++ + L D++ + +++ + + Sbjct: 124 YHGEKSWNIPTDIPQQFNTLGNLEKYLHSLSYILFDVSKVDESFLIEKIYLNACLISGVF 183 Query: 178 LQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETG 237 K+I +DL L L LI + L + +Y + + + +L + G Sbjct: 184 TLKNIF-KDLKYLRPVLEKLILDDVKDC--LYIIIDYTVIV--KKDLETIEKILEEI-GG 237 Query: 238 GESMMTLAQWFEEKGIEKGIQQ 259 E MMTL + ++ +G++KG+++ Sbjct: 238 EEKMMTLTEKWKMEGLKKGMEE 259 >UniRef50_B6J6C6 Hypothetical cytosolic protein n=1 Tax=Coxiella burnetii CbuK_Q154 RepID=B6J6C6_COXB1 Length = 143 Score = 145 bits (366), Expect = 2e-33, Method: Composition-based stats. Identities = 52/135 (38%), Positives = 79/135 (58%), Gaps = 2/135 (1%) Query: 3 APSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTD 62 PHD F+ + A++F E HLP + + DLN+L L+ SFI+E LK D Sbjct: 2 KKIHNPHDYYFRTAMSDTRVAKEFFEYHLPNNILKAADLNSLQLQKSSFIDEHLKASMAD 61 Query: 63 VLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEAD-HDKLPLVVPILF 121 VLYSV++ PGY ++++EHQ PDK M +R++RY + + HL+ + LP+VVP++F Sbjct: 62 VLYSVKLNRRPGYFYIIVEHQRNPDKLMPYRLLRYILRIIDHHLKKKDYLPLPIVVPLVF 121 Query: 122 YQGEATPYPLSMCWF 136 Y G+ YP + Sbjct: 122 YNGKK-RYPFQRIFL 135 >UniRef50_A9BGB3 Putative uncharacterized protein n=2 Tax=Petrotoga mobilis SJ95 RepID=A9BGB3_PETMO Length = 336 Score = 143 bits (362), Expect = 5e-33, Method: Composition-based stats. Identities = 58/309 (18%), Positives = 132/309 (42%), Gaps = 18/309 (5%) Query: 5 STTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVL 64 S D++FK+ DFL+ LP E + L E I + +D+L Sbjct: 2 SNPIKDSIFKELFEDRTVFYDFLKAFLPKETTKQIKETDLKREQTELIGKDFSIKRSDIL 61 Query: 65 YSV-QMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHD-------KLPLV 116 Y + + G Y+++++EHQSK D+ MAFRM+ Y + +++ + KLP++ Sbjct: 62 YKIEKRNGQDVYIYLLLEHQSKVDQLMAFRMLAYKVRIWEQYVNSHKKESEQKGFKLPVI 121 Query: 117 VPILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIM-QHRRIAIL 175 + ++FY G+A + + + + + L++++ ++ I+ + + ++ Sbjct: 122 IGMVFYDGKAKWTSPMDVKEKITEIKNMEEYLIKANYELINLSNIKEETIINMKKALGVI 181 Query: 176 ELLQK-HIRQRDLMLLL-----EQLVTLIDEGYTSGSQLVAMQNYML--QRGHTEQADLF 227 L K ++R ++ LL + L+ L +E ++ + + + E + F Sbjct: 182 LLTDKPNVRVKNAEELLKIINKDILLKLSEEEQEKFNKHRNAFIELFGKRTDYEEIKERF 241 Query: 228 YGVLR-DRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMA 286 + + ++ +A+ EK +G +G+ E E + L+ + +++ + Sbjct: 242 EELKEMEVPKMFNTLEEIAKRDREKAKLEGKAEGKVEGKLEERRELIIEILNQRFGEDFD 301 Query: 287 NLPLAEIDK 295 +I Sbjct: 302 KSLEEKIRN 310 >UniRef50_D0LPI9 Putative transposase n=2 Tax=Haliangium ochraceum DSM 14365 RepID=D0LPI9_HALO1 Length = 338 Score = 142 bits (357), Expect = 2e-32, Method: Composition-based stats. Identities = 60/266 (22%), Positives = 107/266 (40%), Gaps = 21/266 (7%) Query: 9 HDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQ 68 +D + + E A D LP L + DL+ L L SG+++ + L+ + TDVLYSV Sbjct: 24 YDVLVETTFARREYAADTFRTMLPPALVKRLDLDALSLRSGTYVSDELRQYYTDVLYSVL 83 Query: 69 MQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHL--EADHDKLPLVVPILFYQGEA 126 + G +++++++HQS D R+ R ++ R+L D LP+++PI+F+ EA Sbjct: 84 LDGEQAFIYLLLKHQSATDPMFPLRLPRNVLSIWERYLIERQDATTLPVILPIVFHH-EA 142 Query: 127 TPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITP--------DDEIMQHRRIAILELL 178 T + ++ R ++ D+ L LL Sbjct: 143 TGWSDAVGLNGSLALGADVRTALSANRRDFRRLRYLLLVLCFQFDEASRAQNLNEALGLL 202 Query: 179 QK----HIRQRDLMLLLEQLVTLIDEGYTS---GSQLVAMQNYMLQRGHTEQADLFYGVL 231 + +RDL+ L+ +I E + L + ++L+ T D L Sbjct: 203 MRTFGVARPKRDLVASLKGWEDVIREVVATQRGREMLATVVQFILENSET-DPDELKSFL 261 Query: 232 R--DRETGGESMMTLAQWFEEKGIEK 255 E + MT A + E+ Sbjct: 262 EFTAGEPARTAFMTGADRLTQGVREE 287 >UniRef50_B0K503 Putative uncharacterized protein n=12 Tax=Thermoanaerobacteraceae RepID=B0K503_THEPX Length = 360 Score = 141 bits (355), Expect = 3e-32, Method: Composition-based stats. Identities = 46/262 (17%), Positives = 106/262 (40%), Gaps = 13/262 (4%) Query: 6 TTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLY 65 HD +K L + L + E D + SF+ + D++Y Sbjct: 11 HNQHDKGYKFLLSSKRVFIELLRSFVKQEWVNDIDEANVVKVDKSFVLQDFADKEADLVY 70 Query: 66 SVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAM--------HRHLEADHDKLPLVV 117 V+++ ++++E QS D +M +R++ Y + + KLP++V Sbjct: 71 RVKLKDKEVIFYILMELQSTVDYQMPYRLLLYMVEIWRSILKDTPRKESRRKDFKLPVIV 130 Query: 118 PILFYQGEATPYPLSMCWFDMFYSPE-LARRVYNSPFPLVDITITPDDEIMQ-HRRIAIL 175 PI+ Y G+ + + + S E + + L+D+ +E+++ IA + Sbjct: 131 PIVLYNGDH-KWTAKTSYKETLNSYETFGEYAVDFKYILIDVNRYTKEELLKLENLIASV 189 Query: 176 ELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLV--AMQNYMLQRGHTEQADLFYGVLRD 233 LL++ + ++M L++L +++ L + +L R E+ + ++ + Sbjct: 190 FLLEQKVEFEEIMKRLKELSEILNNLDKDEILLFKAWFKKILLARLPEEERENIERIIDE 249 Query: 234 RETGGESMMTLAQWFEEKGIEK 255 + E + L + ++ E+ Sbjct: 250 NKEVEEMISNLEKTILQEMKER 271 >UniRef50_A4U3R1 Putative uncharacterized protein n=1 Tax=Magnetospirillum gryphiswaldense RepID=A4U3R1_9PROT Length = 322 Score = 141 bits (355), Expect = 3e-32, Method: Composition-based stats. Identities = 56/279 (20%), Positives = 104/279 (37%), Gaps = 12/279 (4%) Query: 10 DAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQM 69 DA++ + H A + +P + D + + F + K DV++ + Sbjct: 5 DALYHRLFSHPLMAEQLVREFVPEAMAVGLDFARMERVNAKFHDRDGKRREGDVIWRIPT 64 Query: 70 -QGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAM-----HRHLEADHDKLPLVVPILFYQ 123 G LH++ E QS D MA R Y R L++ D+LP V+ ++ Y Sbjct: 65 ADGEDVVLHILCEFQSTTDWWMAVRTQVYEGLLWQHLIAERKLKSG-DRLPPVLTLVLYN 123 Query: 124 GEATPYPLSMCWF--DMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKH 181 GE + + + L + + L+D+ P++E+ +A L +H Sbjct: 124 GEQRWHAPTDTIPLIALPAGSPLWPWQPRACYHLLDMGAVPEEELAIRDSLAALLFRLEH 183 Query: 182 IRQR-DLMLLLEQLVTLID--EGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGG 238 R+ +L L++ +V GY +L G+ + ++ R Sbjct: 184 PREPEELAGLIDDVVGWFRRHPGYDELRRLFTELVRQAIEGYETSVAVPGDMMEMRSMLA 243 Query: 239 ESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGM 277 T + + +GI +G +G + RLL K Sbjct: 244 NLGETWKKRWLAEGIAEGEARGEARGEAKALIRLLEKRF 282 >UniRef50_C6IY67 Transposase n=1 Tax=Paenibacillus sp. oral taxon 786 str. D14 RepID=C6IY67_9BACL Length = 333 Score = 130 bits (327), Expect = 5e-29, Method: Composition-based stats. Identities = 63/319 (19%), Positives = 119/319 (37%), Gaps = 40/319 (12%) Query: 8 PHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKG--HSTDVLY 65 PHD FK+ L +F+ + P EL D + + + + + D+L Sbjct: 27 PHDEAFKKLLH--TFFAEFIALFFP-ELESQLDFSQTRFLMQEQLVDVVGEEARTLDLLL 83 Query: 66 SVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGE 125 + G ++ + +E QS RM Y RH + L++PI + Sbjct: 84 ETKYIGTDAFILIHLEPQSYRQADFHERMFIYFSRLFERHRKEHQ----LIIPIAIFTSA 139 Query: 126 ATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQK----H 181 S + L + F V++ P + LL K Sbjct: 140 E-----SKNERNSLNMSILGEDILQFRFLKVELINQPWRRFIDSNNPVAAALLAKMGYNK 194 Query: 182 IRQRDLMLLLEQLVTLIDE--GYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGE 239 +R+L L +++ + + + ++++ + + + ++ + + E Sbjct: 195 GEERELRLAYLRMLLQLSQRLDQARLALVMSIADLYFEPDPRQDEEMLRELAKQYAKESE 254 Query: 240 SMMTL--------------------AQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSR 279 +M L + EKG EKGI+QG ++ A+RLLSKG + Sbjct: 255 VIMELMPAWMRQGYEKGLEEGLEKGIEQGIEKGFEKGIEQGTLIERRQIARRLLSKGFTL 314 Query: 280 EDVAEMANLPLAEIDKVIN 298 E++A+M L + EI K++N Sbjct: 315 EEIADMTQLSIEEIKKIMN 333 >UniRef50_A4XMU7 Putative uncharacterized protein n=1 Tax=Caldicellulosiruptor saccharolyticus DSM 8903 RepID=A4XMU7_CALS8 Length = 313 Score = 128 bits (322), Expect = 2e-28, Method: Composition-based stats. Identities = 52/305 (17%), Positives = 124/305 (40%), Gaps = 24/305 (7%) Query: 10 DAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQM 69 D FK+ L + + L L L L + + + I + +D++Y ++ Sbjct: 9 DEGFKKVLTNRTNIKWLLTELL-EVLPIQIGLEDIEVIATESINRQWRARRSDMVYKIKY 67 Query: 70 QGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEATPY 129 + Y+ V++E QS ++ + R++ Y + ++ + LP+V+P++ Y GE Sbjct: 68 K--DAYICVLLEFQSSKEELIHLRVLEYMLLIQKKYTTKN--LLPVVIPVVLYTGEEKWT 123 Query: 130 PLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRR-IAILELLQKHIRQRD-L 187 P + ++ Y + + V VD+ + D+++++ +A + K + + Sbjct: 124 PATCFEQNVVYGEDFKQFVQKFSLVFVDVRMIDDEKLLKSPNLLAAALYVDKVSDNPEKV 183 Query: 188 MLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGG----ESMMT 243 LE L + + +++ +G+ + L + E + Sbjct: 184 AERLEYLSKHVKFSEEQKEEFCEWLYHVVLKGYGFSDEEVDEFLFKSDFLRLGVNEMFLN 243 Query: 244 LAQWFEEKGIEKGIQQGR------------QEVSQEFAQRLLSKGMSREDVAEMANLPLA 291 A+ KG+EK +++ R ++ E AQ+++ +G +A++ L + Sbjct: 244 TAEKIR-KGLEKELEKERKQGIQQGIQQGKEQALLEVAQKMIEEGAEDSFIAKVTGLDME 302 Query: 292 EIDKV 296 I ++ Sbjct: 303 RIRQL 307 >UniRef50_B8FP58 Putative uncharacterized protein n=1 Tax=Desulfitobacterium hafniense DCB-2 RepID=B8FP58_DESHD Length = 167 Score = 128 bits (321), Expect = 3e-28, Method: Composition-based stats. Identities = 43/136 (31%), Positives = 71/136 (52%), Gaps = 8/136 (5%) Query: 6 TTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLY 65 PHD FK+ TAR FLE +LP E+R L DL T+ + S+I++ L+ +D+L+ Sbjct: 5 HNPHDKFFKETFGDVGTARSFLENYLPQEVRALVDLKTVLPQKDSYIDQELQESFSDLLF 64 Query: 66 SVQMQGNPGYLHVVIEHQSKPD----KKMAFRMMRYSIAAMHRHL---EADHDKLPLVVP 118 V+++ N GY + + EH+ +P KKM+ R+ S+ + R + +H K P + Sbjct: 65 QVKIRENEGYFYFLFEHKVRPYADRRKKMSTRLADDSVLSKQREMFMQSVNHGKPPYISR 124 Query: 119 ILFYQGEATPYPLSMC 134 + +G T C Sbjct: 125 FI-RKGNRTGSAACRC 139 >UniRef50_C1DXV7 Putative uncharacterized protein n=1 Tax=Sulfurihydrogenibium azorense Az-Fu1 RepID=C1DXV7_SULAA Length = 357 Score = 126 bits (317), Expect = 8e-28, Method: Composition-based stats. Identities = 50/255 (19%), Positives = 112/255 (43%), Gaps = 13/255 (5%) Query: 7 TPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIE-ESLKGHSTDVLY 65 PHD K+ L E A+ L+ HLP E+ + TL + + ++ + + D++Y Sbjct: 15 NPHDTYAKELLKDEEVAQVLLDAHLPQEINSIIKKETLEIINTENLDYKEKSKYFADIIY 74 Query: 66 SVQMQ-GNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQG 124 S++ G ++V+IEH+S DK + ++++ A + + K+ + PI+ Y Sbjct: 75 SLKTIYGEDLKIYVLIEHKSYDDKHLPLQLIKNMTAVWSKEILEG--KITPIYPIVIYAS 132 Query: 125 EATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQ-HRRIAILELLQKHIR 183 + S S + + + +++ + I + ++ I L + + I+ Sbjct: 133 KEKLSLESKFSNYYKISDNMKKFFLDFYVSTLNLNELDEKTIKEKYKNIYTLIMTLRIIQ 192 Query: 184 QR---DLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGES 240 + +++ L++ + TL + Y + V +Y+ ++ ++ G + Sbjct: 193 EPTPENILNLIKSIETLYN--YKPKAVYVIALSYIFTIAKKDKNTYIK---VKKQLEGGN 247 Query: 241 MMTLAQWFEEKGIEK 255 M +L F E+G+EK Sbjct: 248 MGSLLDMFIEEGLEK 262 >UniRef50_C1I6Y7 Putative uncharacterized protein n=1 Tax=Clostridium sp. 7_2_43FAA RepID=C1I6Y7_9CLOT Length = 226 Score = 123 bits (310), Expect = 6e-27, Method: Composition-based stats. Identities = 40/225 (17%), Positives = 85/225 (37%), Gaps = 13/225 (5%) Query: 46 LESGSFIEESLKGHSTDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMH-- 103 L + S+I + +D++Y GN + +V++E QS D +M R++ Y I Sbjct: 3 LVNKSYILSDYEEQESDIVYKANFNGNDVFFYVLLEFQSSVDFRMPIRLLLYMIEIWRDI 62 Query: 104 ------RHLEADHDKLPLVVPILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVD 157 + + +LP +VPI+ Y G+ + S + N + +D Sbjct: 63 LRNTELKEFKRKTFRLPSIVPIVLYNGKKKWTAAKELKHAISNSDVFGDNILNFKYEFID 122 Query: 158 ITITPDDEIMQHRRI-AILELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYML 216 I +E+ + I + + LL ++I + + L+ ++ + + + Sbjct: 123 INSYEKEELYNKQNISSAIFLLDQNINRIEFYNRLKDIIIGFNNLSIEEKMHLKHWLVNI 182 Query: 217 QRGHTEQADLFYGVLRDRETGGESMMTLAQWFEEKGIEKGIQQGR 261 D + + +M + KG+EK + G+ Sbjct: 183 NTEENNFKDNIEKIFNADKQEVLNMTS----NISKGLEKLKEDGK 223 >UniRef50_B9MPV5 Putative uncharacterized protein n=5 Tax=Clostridia RepID=B9MPV5_ANATD Length = 331 Score = 120 bits (301), Expect = 6e-26, Method: Composition-based stats. Identities = 51/325 (15%), Positives = 123/325 (37%), Gaps = 39/325 (12%) Query: 9 HDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQ 68 +D FK+ F+ +P + + + + + I K +D++Y + Sbjct: 7 YDVGFKKLFSDKINVCWFITEIIPEPRLKNYTQSDIEIVATESINAQWKARRSDMVYRLP 66 Query: 69 MQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEATP 128 + +++++E QS+P+K+M R+ Y ++ +VVP++ Y G Sbjct: 67 YSSSW--IYLLVEFQSRPNKQMHCRIYEYVFLIQRKYQIDKRLP--VVVPVVLYNGVEKW 122 Query: 129 YPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQ-HRRIAILELLQKHIRQRD- 186 P++ ++ Y+ + V + +D+ P+D+++ + +A + + D Sbjct: 123 QPVTQFADNVEYAEDFPEYVQRLNYIFIDVRDIPEDKLLNGNNVLAAALYVDQVATNPDS 182 Query: 187 -LMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGESM---- 241 + LLE + + + +L+ + ++ + + G E M Sbjct: 183 VVERLLELGKNIRIPDEQREELAEWLYHAVLKSYKIPREEINELFAKSKILGVEEMFQST 242 Query: 242 -MTLAQWFEE---------------------------KGIEKGIQQGRQEVSQEFAQRLL 273 M + + E +G +G +GR E E A+ L+ Sbjct: 243 AMKIKKGLAEEKKKIRLESKIEGKIEGKIEGKIEGKIEGKIEGKIEGRMEAQLEIARNLI 302 Query: 274 SKGMSREDVAEMANLPLAEIDKVIN 298 +G +A++ L + ++ ++ N Sbjct: 303 LEGAEDSFIAKVTGLDIEKVKELRN 327 >UniRef50_B9MMM9 Putative uncharacterized protein n=1 Tax=Anaerocellum thermophilum DSM 6725 RepID=B9MMM9_ANATD Length = 315 Score = 118 bits (297), Expect = 2e-25, Method: Composition-based stats. Identities = 55/315 (17%), Positives = 133/315 (42%), Gaps = 26/315 (8%) Query: 4 PSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDV 63 + +D +K+ + E FL+ L E + + + + + + I + + +D+ Sbjct: 2 KTYKKYDEGYKKLFSNKENLIWFLQNVLNEERFKKIEKSDVEIIATESINKKWQKKISDI 61 Query: 64 LYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQ 123 +Y ++ + + + IE QS+ DKK+ R+ Y ++ + ++P+VVPI+ Y Sbjct: 62 VYKIKYK--DSFFCLTIEFQSREDKKILHRLYEYMHLIQLKN--KVNGEIPVVVPIVLYN 117 Query: 124 GEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRR-IAILELLQKHI 182 G + P + ++ + N +DI P+++++ +AI + + Sbjct: 118 GISHWKPNEQYNEIILFAKDFPEYAQNFKIIFLDIKSIPEEKLISAANVLAIAVYIDQVS 177 Query: 183 RQRD-LMLLLEQLVTLIDEGYTSGSQL------VAMQNYMLQRGHTEQA---------DL 226 + ++ + L I + +L V +++Y + E+ +L Sbjct: 178 NNPERVLNRILNLRGKIHLNWEQREELADWLYEVILRSYGVSEEEAEEMFKKSGLEVDEL 237 Query: 227 FYGVLRDRETGGE-SMMTLAQWFEEKGIEKGIQQGRQEVSQE----FAQRLLSKGMSRED 281 F + G E +A+ ++G+++G++QG ++ + A+++L E Sbjct: 238 FSSTAEKIKQGIEREKKKIAKEAMKQGMKQGMKQGMKQGMKRAIKLIAKQMLKDNQPIEL 297 Query: 282 VAEMANLPLAEIDKV 296 +++ L EI K+ Sbjct: 298 ISKYTGLTPEEIKKL 312 >UniRef50_C6XV94 Putative uncharacterized protein n=7 Tax=Pedobacter heparinus DSM 2366 RepID=C6XV94_PEDHD Length = 283 Score = 118 bits (296), Expect = 2e-25, Method: Composition-based stats. Identities = 60/287 (20%), Positives = 119/287 (41%), Gaps = 30/287 (10%) Query: 21 ETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQMQGNPGYLHVVI 80 + R+ + LP ++ + LN +E + + K TD+L V+ Y+ + + Sbjct: 16 KIFRENMHNTLPGIIKHVLHLNVNTVEELADDVQFTKERKTDLLKKVRDNKGNRYV-LHV 74 Query: 81 EHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEATPYPLSMCWFDMFY 140 E+Q+ +MAFRM YSI +H KLP+ +++ S+ D + Sbjct: 75 EYQTDNYPEMAFRMAEYSIMLQRKH------KLPVKQFVIYIGPAKANMATSITTKDFRF 128 Query: 141 SPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQRDLMLLLEQLVTL--- 197 L V+ + ++++ + +AIL L + L +++++ T Sbjct: 129 RYNLT------ELSAVNYKLFLKSDLVEEKMLAILSNLASESTESVLAQVVQEIETHTST 182 Query: 198 IDEGYTSGSQLVAMQNYMLQRGHTEQ--------ADLFYGVLRDRETGGESMMTLAQWFE 249 +++G + +Q L + + + + R E GE + Sbjct: 183 LEQGRYFRQLRILLQLRNLNKKAIKDMALVGKIFKEEKDILYRRGEIKGEIKGEI----- 237 Query: 250 EKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKV 296 KG KGI++GR E + E A L +G++ E +A++ L + EI + Sbjct: 238 -KGEIKGIEKGRYEEAMEIALELKKEGLATEFIAKITKLSIEEIQAL 283 >UniRef50_B0K519 Putative uncharacterized protein n=14 Tax=Thermoanaerobacteraceae RepID=B0K519_THEPX Length = 288 Score = 118 bits (295), Expect = 3e-25, Method: Composition-based stats. Identities = 42/250 (16%), Positives = 109/250 (43%), Gaps = 20/250 (8%) Query: 63 VLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHR--------HLEADHDKLP 114 ++Y V+++ + ++++E QSK D +M +R++ Y I + KLP Sbjct: 1 MVYQVKLKDKEVFFYILLELQSKVDFQMPYRLLLYIIEVWREILKDTSLNQQKRKDYKLP 60 Query: 115 LVVPILFYQGEATPYPLSMCWFDMFYSPEL-ARRVYNSPFPLVDITITPDDEIMQ-HRRI 172 ++PI+ Y G + S+ + + S +L + + + L+D+ ++E++Q I Sbjct: 61 AIIPIVLYNG-VNRWTASLSFKETIDSYQLFGENIIDFKYILIDVNRYNEEELLQLSNLI 119 Query: 173 AILELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGH----TEQADLFY 228 + + LL + I + +L +L ++ S + + ++N++ ++ Sbjct: 120 SSIFLLDRKIDKEELTEKWGKLADVL--KDISEEEFIILRNWLFSVVSRFLPEDKEKEIK 177 Query: 229 GVLRDRETGGESMMTLAQWFEEKGIE---KGIQQGRQEVSQEFAQRLLSKGMSREDVAEM 285 +L E E + L + E+ + +G+++G ++ E + +G + + Sbjct: 178 EILVQSEGVEEMISNLERSLREEFRKTRREGLKEGLKKGKLEGLKIGKMEGRMEGKIEGI 237 Query: 286 ANLPLAEIDK 295 + ++ + Sbjct: 238 RMVVFEQLKE 247 >UniRef50_C1MD86 Putative uncharacterized protein n=5 Tax=Enterobacteriaceae RepID=C1MD86_9ENTR Length = 155 Score = 115 bits (289), Expect = 1e-24, Method: Composition-based stats. Identities = 48/150 (32%), Positives = 85/150 (56%), Gaps = 17/150 (11%) Query: 161 TPDDEIMQHRRIAILELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGH 220 PDD+IMQHRR+A+LEL+QKHIR+RDLM L+E+L L+ +G+ + +QL A+ NY++Q G+ Sbjct: 1 MPDDKIMQHRRMALLELIQKHIRKRDLMGLVEKLAILLVKGHANDNQLKALFNYLMQAGN 60 Query: 221 TEQA-DLFYGVLRDRETGGESMMTLAQWFE----------------EKGIEKGIQQGRQE 263 T + + V + +MT+A+ ++G++ G+QQG++E Sbjct: 61 TTHFGEFLHEVAERLPQHKDKLMTIAERLRQEGHLNGLQEGHRKGLQEGLQTGLQQGKRE 120 Query: 264 VSQEFAQRLLSKGMSREDVAEMANLPLAEI 293 + A + + G+ + + L ++ Sbjct: 121 EALRIASTMQADGIDPLTIIRITGLTAEDL 150 >UniRef50_B9E303 Putative uncharacterized protein n=2 Tax=Clostridium kluyveri RepID=B9E303_CLOK1 Length = 304 Score = 115 bits (287), Expect = 3e-24, Method: Composition-based stats. Identities = 46/250 (18%), Positives = 96/250 (38%), Gaps = 34/250 (13%) Query: 78 VVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHD--------KLPLVVPILFYQGEATPY 129 +E QS+ D +M R++ Y + L+ KLP ++P++ Y G+ + Sbjct: 28 CFLEFQSRVDYRMPMRLLFYMVEIWREILKNTSKNDRSKKDFKLPSIIPMVLYNGK-NTW 86 Query: 130 PLSMCWFDMFYSPEL-ARRVYNSPFPLVDITITPDDEIMQHRRI-AILELLQKHIRQRDL 187 + D+ +L V + + L DI ++++ + + + LL K I + DL Sbjct: 87 TACKNFKDVLSGSKLFGENVIDFRYMLFDIYRYNEEQLEDMANMVSTVFLLDKEISKEDL 146 Query: 188 MLLLEQLVTLIDEGYTSGSQLVAMQNYMLQ----RGHTEQADLFYGVLRDRETGGESMM- 242 + L +T + Q ++ ++ R +E +L G M Sbjct: 147 VKRLR--LTAYVLKKITPEQFDILKAWLKSIIKPRLDSESKIKIEEILEKSSQGEVDSMV 204 Query: 243 ----------------TLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMA 286 T + +G ++G ++GR+E +E + S+ +++ V + Sbjct: 205 SNLGKTIDNIIREGRETGLEEGRREGRKEGRKEGRKEGRKEGRKEGKSELITKMLVKKFT 264 Query: 287 NLPLAEIDKV 296 LP K+ Sbjct: 265 KLPDGYTHKI 274 >UniRef50_D2NBJ3 Putative uncharacterized protein n=1 Tax=Escherichia coli SE15 RepID=D2NBJ3_ECOLX Length = 136 Score = 110 bits (275), Expect = 7e-23, Method: Composition-based stats. Identities = 43/128 (33%), Positives = 66/128 (51%), Gaps = 9/128 (7%) Query: 167 MQHRRIAILELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQA-D 225 +H +A+LEL+QKHIRQRDLM L+EQ+ L+ GY + Q+ + NY+LQ G + D Sbjct: 12 RRHASMALLELIQKHIRQRDLMGLVEQMACLLSSGYANDRQIKGLFNYILQTGDAVRFND 71 Query: 226 LFYGVLRDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEM 285 GV ES+MT+A+ Q+G Q + A+ +L G+ D+ Sbjct: 72 FIDGVAERSPKHKESLMTIAERLR--------QEGEQSKALHIAKIMLESGVPLADIMRF 123 Query: 286 ANLPLAEI 293 + E+ Sbjct: 124 TGVSEEEL 131 >UniRef50_B0G834 Putative uncharacterized protein n=3 Tax=Dorea formicigenerans ATCC 27755 RepID=B0G834_9FIRM Length = 369 Score = 110 bits (274), Expect = 8e-23, Method: Composition-based stats. Identities = 50/308 (16%), Positives = 104/308 (33%), Gaps = 28/308 (9%) Query: 6 TTPH--DAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDV 63 H D K FL+ + + L + + S F+ + +D Sbjct: 13 HNTHTKDNAAKIVFGDPVLCAQFLKGYTDIPLFKEIKPEDIENVSSHFLPLFQESRDSDT 72 Query: 64 LYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAM------HRHLEADHDK----- 112 + + + + YL +IEHQS+ D M+FR++RY + L K Sbjct: 73 VNKIWIGNSEIYLIALIEHQSENDFDMSFRILRYIVFIWTDYAAQQEKLHKGTTKSKDFL 132 Query: 113 LPLVVPILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRI 172 P ++PI++Y+G +T +F S + + + +V + ++++ Sbjct: 133 YPPILPIVYYEGSSTWSAPLNFKNRVFLSDVFGDYIPSFNYLVVPLNKYSKQDLIEKNDE 192 Query: 173 AILELLQKHIRQRDLMLLLEQLVTLIDEGYTSGS-----QLVAMQNYMLQRGHTEQADLF 227 L L ++ L+ + E T + +++ +L + Sbjct: 193 LSLIFLINQLQSSSEFHALKDIPKKYTEHLTEDTPDYLLKIIGKVIAVLLHKLNVPDEEV 252 Query: 228 YGVLRDRETGGESMM----------TLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGM 277 Y V SMM + E+G +G +G + E + +G Sbjct: 253 YEVTDQITRRKFSMMFDNFQAYDVQETRRVSREEGRLEGRIEGERAGRIEGERAGRIEGE 312 Query: 278 SREDVAEM 285 + ++ Sbjct: 313 RLHLIKQV 320 >UniRef50_A5USQ0 Putative uncharacterized protein n=4 Tax=Roseiflexus sp. RS-1 RepID=A5USQ0_ROSS1 Length = 330 Score = 106 bits (265), Expect = 8e-22, Method: Composition-based stats. Identities = 57/281 (20%), Positives = 98/281 (34%), Gaps = 24/281 (8%) Query: 9 HDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLK--GHSTDVLYS 66 HDA+FK L R+F+++ P +L D + D++ Sbjct: 7 HDALFKLVLT--AFFREFIDLVAP-DLAAALDPAPPVFLDKESFADLFDPDRREADLVAQ 63 Query: 67 VQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEA 126 V+++ +P L + +EHQ++ D + RM RY R+ + + PI Sbjct: 64 VRLRQHPATLLIHLEHQAQADAALDRRMFRYFARLYDRYDQ-------PIYPIAL----- 111 Query: 127 TPYPLSMC-WFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHI--- 182 YP D R V + +V + + A + L+ + Sbjct: 112 CSYPRPRRPAADRHEVRAAQRTVLTFQYQVVQLNRMDWRAYLTTTNPAAMALMARMRVAP 171 Query: 183 --RQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGES 240 R R L L G + Y+ EQA L V R E Sbjct: 172 EDRWRVKAACLRLLAGAPLTGAQRRLIGQFVDIYLPLNAREEQA-LAAEVARLPGAAKEV 230 Query: 241 MMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSRED 281 +M L +E KG +G+++G +E E + ++G+ Sbjct: 231 VMELITSWERKGRAEGLREGLREGRAEGLREGRAEGLREGQ 271 >UniRef50_C0GV86 Transposase, ISNCY family n=7 Tax=Desulfonatronospira thiodismutans ASO3-1 RepID=C0GV86_9DELT Length = 125 Score = 105 bits (263), Expect = 2e-21, Method: Composition-based stats. Identities = 29/101 (28%), Positives = 60/101 (59%), Gaps = 3/101 (2%) Query: 3 APSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTD 62 + PH+ +F + + + AR FL+ H+ E+++ DL+TL LE ++++E LK H +D Sbjct: 4 KRNQAPHEGLFLKIFQNLDNARHFLKNHMSEEIQKRFDLDTLRLEPTTYVDEKLKKHYSD 63 Query: 63 VLYSVQMQGNP---GYLHVVIEHQSKPDKKMAFRMMRYSIA 100 +++SV++ G ++++ EH+S PD ++++Y Sbjct: 64 LVFSVRLIGYKNQFAKIYLLFEHKSSPDPLTGVQVLKYMAL 104 >UniRef50_A6EA97 Putative uncharacterized protein n=1 Tax=Pedobacter sp. BAL39 RepID=A6EA97_9SPHI Length = 293 Score = 105 bits (261), Expect = 2e-21, Method: Composition-based stats. Identities = 62/290 (21%), Positives = 115/290 (39%), Gaps = 33/290 (11%) Query: 21 ETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQMQGNPGYLHVVI 80 + R+ +E+ LP +RE+ L L E + + D L V ++ + I Sbjct: 16 KIIRENMEVTLPEVIREVLGLEILLSEELPDDVQHTRERKPDALKKVTDIQGNTFV-LHI 74 Query: 81 EHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEATPYPLSMCWFDMFY 140 E Q + +K+M +RM YSI M R+ +LP+ ++F + P + + Y Sbjct: 75 EFQVEDEKEMVYRMAEYSIMLMRRY------QLPVKQYVIFLKDTKPRMPTGLKTPKLVY 128 Query: 141 SPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQRDLMLLLEQLVTLIDE 200 S +L R + + L + P+ + +A+L + R+ L ++ L++ Sbjct: 129 SFDLIR-IAEISYKLFIKSDNPEV-----KMLAVLANFDEADREGALTSIITGLLSHSKG 182 Query: 201 GYTSGSQLVAMQNYMLQRGHTEQ-------------ADLFYGVLRDRETGGESMMTLAQW 247 + ++ +M R EQ + R E GE Sbjct: 183 DFAERRHFKQLRIFMQLRSSIEQHFDKVMDSVSTFFKEENDYFYRKGEARGEI------K 236 Query: 248 FEEKGIEKGIQQGRQEVSQEFAQRLLSK-GMSREDVAEMANLPLAEIDKV 296 E KG KG +G + S+ + L++K G S E AE+A + + + + Sbjct: 237 GEAKGEAKGEAKGEAKKSRAVVENLIAKLGFSDEQAAEIAEVTVDFVKDI 286 >UniRef50_B0K813 Putative uncharacterized protein n=13 Tax=Thermoanaerobacterales RepID=B0K813_THEP3 Length = 267 Score = 101 bits (253), Expect = 2e-20, Method: Composition-based stats. Identities = 49/293 (16%), Positives = 116/293 (39%), Gaps = 33/293 (11%) Query: 9 HDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQ 68 +D K + A D L L + F ++ +D+++ Sbjct: 5 YDITAKNIFSN--LADDIASYFL------GLKFTKLDELNIEFTT--IESRESDMVFKCT 54 Query: 69 MQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEATP 128 + + + IE Q+ D KM +RM+RY+ M +H + ++ Y + Sbjct: 55 TENRD--IALHIEFQTYNDSKMPYRMLRYATEIMEKHNLLPYQ-------VVVYCSKNE- 104 Query: 129 YPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELL---QKHIRQR 185 + + + N + ++D+ ++I++ + + L K RQ+ Sbjct: 105 ----LKMENNLNYHLGEENLLNFRYRIIDVGKIKFEDIVKTKYYDLYTFLPVADKDKRQK 160 Query: 186 DLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGESMMTLA 245 + L + +I + ++ ++Y++ ++ + ++ M++ Sbjct: 161 EKEAYLRKCAEVIRDMPVDKAK----KSYIVTTAEILAGIIYDEEVIEKIFSEVIGMSIL 216 Query: 246 QWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKVIN 298 + E K + +++G++E S E A+ LL +GM +A++ L + EI K++N Sbjct: 217 E--ESKVYKNILEKGKKEKSIEIARELLKEGMDINKIAQITKLSVEEIKKLLN 267 >UniRef50_Q2RKN5 Putative uncharacterized protein n=1 Tax=Moorella thermoacetica ATCC 39073 RepID=Q2RKN5_MOOTA Length = 304 Score = 100 bits (250), Expect = 5e-20, Method: Composition-based stats. Identities = 62/303 (20%), Positives = 112/303 (36%), Gaps = 25/303 (8%) Query: 6 TTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESL--KGHSTDV 63 HD +FK+ L R+F+E+ P L D + I + H D+ Sbjct: 2 PVDHDRLFKELLT--TFFREFMELFFPAA-HTLIDYTDTKFLTQEVITDITAGDKHYVDI 58 Query: 64 LYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPI-LFY 122 L V+++G G + V IE Q+ A RM Y +H + V+PI +F Sbjct: 59 LAEVKIKGEDGCVLVHIEPQAYRQADFARRMFIYFSRLYEKHQKR-------VLPIAVFA 111 Query: 123 QGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQK-- 180 F +V F + + P + + LL K Sbjct: 112 HDSKVEETNRHEVEFPFL------KVLQFEFYKIQLKRLPWRQYLNSNNPVAAALLSKMD 165 Query: 181 HIRQRDLMLLLE--QLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGG 238 + + + + +E +L+T + + A + L E+ L + + + Sbjct: 166 YSPRERVQVKIEFLRLLTRMQLDPARMELITAFFDSYLVLNAEEEKSLQEKLSEELQPEE 225 Query: 239 -ESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKVI 297 + +M L + KG ++G Q+GRQE+ ++ L S E A++ L ++D + Sbjct: 226 VQRVMELTTSWHLKGWQQGRQEGRQEILLRQLRKRLGT-TSPEVEAKIKTLSAEQLDDLA 284 Query: 298 NLI 300 I Sbjct: 285 EKI 287 >UniRef50_Q7NIZ1 Gll2041 protein n=9 Tax=Cyanobacteria RepID=Q7NIZ1_GLOVI Length = 311 Score = 100 bits (248), Expect = 7e-20, Method: Composition-based stats. Identities = 46/308 (14%), Positives = 117/308 (37%), Gaps = 29/308 (9%) Query: 7 TPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESL--KGHSTDVL 64 T HD +FK+ L +F+++ ++ + ++ + + + D++ Sbjct: 2 TDHDRLFKELLS--TFFVEFIDLFF-ADVGNYLERGSIVFLEKELFSDITAGERYEADLV 58 Query: 65 YSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQG 124 + + + + V IE+Q++ ++RM RY ++ +LP + PI + Sbjct: 59 VKARFRDHQSFFLVHIENQTEAQSIFSYRMFRYFARLYEKY------QLP-IYPIAVFSF 111 Query: 125 EATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQK-HIR 183 F V + +V + + ++ L+ + I Sbjct: 112 TEPLRAEPTAHRVAFPDFT----VLEFHYRVVQLNRLDWRDFLRQPNPVASALMARMRIA 167 Query: 184 QRDLMLLLEQLVTLID--EGYTSGSQLVAMQ--NYMLQRGHTEQADLFYGVLRDRETGGE 239 D + + + L+ + +QL++ Y+ E+ + + E Sbjct: 168 PADRPRVKLECLRLLATLRLDPARTQLISGFVDTYLKLTAQEERL-FAAELATIGASEQE 226 Query: 240 SMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSR-------EDVAEMANLPLAE 292 +++ + + ++G+E+G Q GRQE QE A ++ + +SR ++ ++ L Sbjct: 227 AVVQIVTSWMQQGLEQGRQVGRQEGRQEEALAIVLRQLSRRLGTLPAQNAERVSGLSTTA 286 Query: 293 IDKVINLI 300 ++ + + Sbjct: 287 LEALSEAL 294 >UniRef50_B1XMU9 Putative uncharacterized protein n=1 Tax=Synechococcus sp. PCC 7002 RepID=B1XMU9_SYNP2 Length = 316 Score = 98.5 bits (244), Expect = 3e-19, Method: Composition-based stats. Identities = 47/279 (16%), Positives = 105/279 (37%), Gaps = 22/279 (7%) Query: 9 HDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEE--SLKGHSTDVLYS 66 HD +FK+ L DFL + P E+ E + N+L + + + D++ Sbjct: 7 HDLLFKELLT--TFFWDFLALFAP-EILETAEQNSLTFLTQEVFNDLPGQTRRNVDIVAK 63 Query: 67 VQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEA 126 + +G V +E+Q+ A RM Y ++ +LP + PI + + Sbjct: 64 LHFRGQETCFLVHVENQATSQADFAERMFLYFARLYEKY------RLP-IYPIALFSYRS 116 Query: 127 TPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQRD 186 + F S E + + F + + P + ++ L+ K + Sbjct: 117 PQRLEPETFSVAFPSKE----ILSFSFQTIQLNRLPWRDFLRQPNPVAAALMAKMNFSSE 172 Query: 187 LMLLLE----QLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGES-M 241 ++ +++ + L + R + + +F L + E+ + Sbjct: 173 ERPKVKLECLRMIVTLRLDSARIHLLSGFVD-TYLRLNMAEQQVFEQELHRIQPQEEAQV 231 Query: 242 MTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSRE 280 + + + E+G+++G Q+GRQE + + R + + + Sbjct: 232 LRIVTSWMEEGLQQGRQEGRQEEACKLILRFVQQRFPEQ 270 >UniRef50_Q1PZ06 Putative uncharacterized protein n=1 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1PZ06_9BACT Length = 238 Score = 97.7 bits (242), Expect = 4e-19, Method: Composition-based stats. Identities = 36/195 (18%), Positives = 73/195 (37%), Gaps = 13/195 (6%) Query: 95 MRYSIAAMHRHLEADHDKLPLVVPILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFP 154 M+Y + + + +P V+P++ Y G+ T + R + + Sbjct: 1 MKYLLKIWAANSKQMQRLIP-VIPVILYHGKETWKVRRFRDYFEGIDEVFFRFIPEFEYL 59 Query: 155 LVDITITPDDEIMQHR----RIAILELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQ--- 207 L D++ ++EI + I LL ++I ++ ++L + G + Sbjct: 60 LTDLSFYSNEEIKDKVFRRVSLQITMLLMRNIYNDKILG--DKLKAFFEIGKQYFEEGEG 117 Query: 208 ---LVAMQNYMLQRGHTEQADLFYGVLRDRETGGESMMTLAQWFEEKGIEKGIQQGRQEV 264 L ++ Y+ E+ + + E GG MT+A EKG G +GR E Sbjct: 118 LKFLESVIRYLYYASDIEEERVIDTLKEISEEGGRLSMTIAARLIEKGKIAGRMEGRAEG 177 Query: 265 SQEFAQRLLSKGMSR 279 ++ L + + Sbjct: 178 ERKGRMEGLIEAIEI 192 >UniRef50_C4FHW2 Putative uncharacterized protein n=1 Tax=Sulfurihydrogenibium yellowstonense SS-5 RepID=C4FHW2_9AQUI Length = 211 Score = 96.2 bits (238), Expect = 1e-18, Method: Composition-based stats. Identities = 34/173 (19%), Positives = 72/173 (41%), Gaps = 8/173 (4%) Query: 109 DHDKLPLVVPILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQ 168 + P ++ I+FY GE + +L + L+D+ PD+E+ Sbjct: 6 KKEYYPPIINIVFYHGEREWNIPTN--LPTVKDKDLQEYTQKLNYILIDLNKIPDEELKN 63 Query: 169 H--RRIAILELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADL 226 + + ++ + R D + L ++ LI + L + +Y++ + A+ Sbjct: 64 RISKNMDVILAILVMKRIFDDIQNLRPILELIIKH--KSDSLFIILDYIVLI--KKDAEK 119 Query: 227 FYGVLRDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSR 279 +L++ G E MMTL + ++ +G KG +GR E ++ +L+ Sbjct: 120 VEKILKEISGGDEKMMTLTEKWKMEGWMKGKLEGRLEAQRKAIIKLIQLKFGN 172 >UniRef50_A5D0D4 Putative uncharacterized protein n=10 Tax=Clostridia RepID=A5D0D4_PELTS Length = 332 Score = 95.8 bits (237), Expect = 2e-18, Method: Composition-based stats. Identities = 48/274 (17%), Positives = 100/274 (36%), Gaps = 21/274 (7%) Query: 9 HDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESL--KGHSTDVLYS 66 HD +FKQ L +F+E+ P E + DL + + + H D++ Sbjct: 8 HDRLFKQLLE--TFFAEFMELFFP-EAAQATDLEYVKFLQQELFTDITAGEKHRADIIVE 64 Query: 67 VQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEA 126 +++ PG + V +E QS K+ RM Y ++ ++P+ + Sbjct: 65 TRLKDEPGLILVHVEPQSYIQKEFNERMFIYFSRLYEKYRRK-------ILPVAVF---- 113 Query: 127 TPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQRD 186 Y D F V F +++ + ++ LL K + + Sbjct: 114 -TYDHIRNEPDSFEIGFSFLDVLRFHFYKLELKKLHWRDYIRSDNPVAAALLSKMGFRPE 172 Query: 187 LMLLLEQLVTLI---DEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGG-ESMM 242 + ++ + + + ++L+ + + ++ + FY L + E +M Sbjct: 173 ERVQVKLEFMRMLARMKLDPARTELIGGFFETYLKLNRQEEEEFYRELGKIDKKEVELIM 232 Query: 243 TLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKG 276 + + EKG +G +GR E E ++G Sbjct: 233 QITTSWHEKGRMEGRLEGRLEGRLEGRLEGEARG 266 >UniRef50_B0KCX4 Putative uncharacterized protein n=12 Tax=Thermoanaerobacterales RepID=B0KCX4_THEP3 Length = 267 Score = 94.3 bits (233), Expect = 4e-18, Method: Composition-based stats. Identities = 49/294 (16%), Positives = 104/294 (35%), Gaps = 29/294 (9%) Query: 6 TTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLY 65 + +D K + A D L + F ++ +D++ Sbjct: 2 SQKYDITIKDIFSN--MADDITAYFL------GLTYTKTDELNIEFT--KVEKRQSDIVL 51 Query: 66 SVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGE 125 + + V +E QS D KM +RM+RYS+ M ++ + ++ Y G+ Sbjct: 52 KCTTEKGD--IAVHLEFQSDNDDKMPYRMLRYSLEIMEKYNLTPYQ-------LVIYMGK 102 Query: 126 ATPYPLSMCWFDMFYSPELAR-RVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQ 184 + L + + + ++D+ +I + + LL R+ Sbjct: 103 ND------LRMENKLDYNLGEENILDYRYKIIDVGTIKFLDITKTDYYDLYALLPIMDRE 156 Query: 185 RDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGESMMTL 244 R + L ++ + ++ + V+ T M+ + Sbjct: 157 RRKTEGEKYLKECVEAIKNIPIDINKKKDITFKAEILSGLVYSREVIERVFTEVMEMLRI 216 Query: 245 AQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKVIN 298 + K I +++G +E S A+ LL +GM +A++ L + EI K++N Sbjct: 217 EESEAYKMI---LEKGAKEKSLRIAKELLKEGMDINKIAKITELSIEEIKKLMN 267 >UniRef50_C4G1D5 Putative uncharacterized protein n=2 Tax=Abiotrophia defectiva ATCC 49176 RepID=C4G1D5_ABIDE Length = 297 Score = 92.7 bits (229), Expect = 1e-17, Method: Composition-based stats. Identities = 44/235 (18%), Positives = 91/235 (38%), Gaps = 9/235 (3%) Query: 69 MQGNPGYLHVV-IEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEAT 127 + N + IE+QS PDK M R++ Y A + ++ + V+ I+ Y G+ Sbjct: 65 WKKNEVIFSFIGIENQSAPDKDMILRIISYDGATYKSQM--GNESIYPVLTIVIYWGKYE 122 Query: 128 PYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIM----QHRRIAILELLQKHIR 183 + ELA + + F L+DI E++ R +A QK + Sbjct: 123 WKAPVSLQERINCPRELADIIPDYRFKLIDIGRLSGKELIKFKSDFRLVAEFIARQKEYK 182 Query: 184 QRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGG--ESM 241 + + + + + + ++ + + +L + E G + + Sbjct: 183 PGKEEIKHPEELLDLLDLLAGDKRFKELKGKVKNIRKEGRIINMCELLDEIENRGIEKGI 242 Query: 242 MTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKV 296 + EKGIEKG +G + + A++ +S + + + L EI+++ Sbjct: 243 EQGIEQGIEKGIEKGRSEGEETATLRIAKKFKDSNVSIDIIMKATGLTKEEIEEL 297 >UniRef50_C1PBU4 Putative uncharacterized protein n=4 Tax=Bacillus coagulans 36D1 RepID=C1PBU4_BACCO Length = 329 Score = 88.1 bits (217), Expect = 4e-16, Method: Composition-based stats. Identities = 51/324 (15%), Positives = 117/324 (36%), Gaps = 51/324 (15%) Query: 9 HDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEE--SLKGHSTDVLYS 66 HD +FK+ + + ++F++ P +L D + S + + D+L Sbjct: 14 HDRLFKELIQN--FFQEFMDAFFP-DLSADLDYRRVRFLSQEQFTDFPGGEQKRVDILAE 70 Query: 67 VQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEA 126 +++G + + +E QS +K RM RY + RH + V+PI + Sbjct: 71 TKVKGKDTVILIHVEPQSYYEKPFPERMFRYYMMISLRHRK-------PVLPIAVFS-YE 122 Query: 127 TPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQK--HIRQ 184 + F++ E + + + + ++ LL K + Sbjct: 123 EKTETPDTYTFAFHNIE----ILRFHYLSIHLMKQNWRNYIRSNNPVAAALLSKMGYTET 178 Query: 185 RDLMLLLE--QLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGESMM 242 + + LE +++ ++ L +Y L+ E+A++ + E ++ Sbjct: 179 ERVQVKLEFLRMLARMELDPAKMRLLHGFFDYYLKLNEKEEAEVMENIKMLDPDEAEQVL 238 Query: 243 TLAQWFEEKG----------------IEKGIQQGRQEVSQE--------------FAQRL 272 L + ++G +EKG ++G + ++ A ++ Sbjct: 239 KLPNSYFDRGYKKGKEEGREEGIEIGVEKGREEGIEIGVEKGREEERKEMLQTIPIAIKM 298 Query: 273 LSKGMSREDVAEMANLPLAEIDKV 296 L +G + + E L E++K+ Sbjct: 299 LQEGRELQLIVEKTGLSQREVEKI 322 >UniRef50_B5Q357 Transposase n=10 Tax=Salmonella enterica subsp. enterica RepID=B5Q357_SALVI Length = 174 Score = 87.7 bits (216), Expect = 5e-16, Method: Composition-based stats. Identities = 44/189 (23%), Positives = 73/189 (38%), Gaps = 47/189 (24%) Query: 130 PLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQRDLML 189 P + PE AR P+ +RQRDL+ Sbjct: 9 PHDAVFKTFLRHPETARDFMEIHLPV-------------------------SLRQRDLLG 43 Query: 190 LLEQLVTLIDEGYTSGSQLVAMQNY-MLQRGHTEQAD-LFYGVLRDRETGGESMMTLAQW 247 L+E++ +L+ G + QL A+ NY M+Q GHT + V+ E +MTL + Sbjct: 44 LVERIASLLVTGCANDRQLKALFNYLMIQHGHTPRFTTFIRDVVGHVPHTKERLMTLIER 103 Query: 248 FEE--------------------KGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMAN 287 +G+EKG+++G+ + A+++L+ G+ RE V Sbjct: 104 IRAADRRKGERQGRQLGLEEGLAEGLEKGLEKGQHVAALRIARQMLADGLDRETVQRFTG 163 Query: 288 LPLAEIDKV 296 L E+ V Sbjct: 164 LTAEELQDV 172 Score = 53.8 bits (128), Expect = 6e-06, Method: Composition-based stats. Identities = 25/36 (69%), Positives = 30/36 (83%) Query: 3 APSTTPHDAVFKQFLMHAETARDFLEIHLPVELREL 38 + ++TPHDAVFK FL H ETARDF+EIHLPV LR+ Sbjct: 4 STTSTPHDAVFKTFLRHPETARDFMEIHLPVSLRQR 39 >UniRef50_C4UAM6 Putative uncharacterized protein n=1 Tax=Yersinia aldovae ATCC 35236 RepID=C4UAM6_YERAL Length = 105 Score = 87.3 bits (215), Expect = 5e-16, Method: Composition-based stats. Identities = 27/101 (26%), Positives = 47/101 (46%), Gaps = 13/101 (12%) Query: 209 VAMQNYMLQRGHTEQ-ADLFYGVLRDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQ- 266 ++ NYMLQ G + + R E +MT+AQ +++G ++G Q+GR E Q Sbjct: 3 KSLINYMLQDGDAATPKTFIWELARRSPQHKELLMTIAQKLKQEGRQEGRQEGRVEGIQI 62 Query: 267 -----------EFAQRLLSKGMSREDVAEMANLPLAEIDKV 296 E A+ +L G+ R V +M L ++ ++ Sbjct: 63 GEANGLKKGKLEVARTMLVNGLDRATVMKMTGLSDKDLTQI 103 >UniRef50_B7GJZ4 Transposase n=10 Tax=Bacillaceae RepID=B7GJZ4_ANOFW Length = 286 Score = 85.8 bits (211), Expect = 2e-15, Method: Composition-based stats. Identities = 53/295 (17%), Positives = 101/295 (34%), Gaps = 25/295 (8%) Query: 9 HDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESL--KGHSTDVLYS 66 HD +FK+ L L + E D L S + + + D+L Sbjct: 7 HDRLFKELLTTFFEEFILLFF---PHVHEHIDFRHLSFLSEELFTDVTAGEKYRVDLLIQ 63 Query: 67 VQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEA 126 +++G G + + +E+QS RM Y ++ ++PI + Sbjct: 64 TKLKGEAGIIIIHVENQSYMQSSFPERMFIYFSRLFEKYRTN-------ILPIAIF---- 112 Query: 127 TPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQRD 186 Y F V F V++ ++ LL K + Sbjct: 113 -SYDFIRDEPSSFTLQFPFLHVLQFQFLAVELRKQNWRHYIRSENPIATALLSKMGYNEN 171 Query: 187 LMLLLEQ----LVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGESMM 242 + L++ ++ + L+ ++ E+ V + GE +M Sbjct: 172 ERVELKKQFFRMLIRQNIDEAKRRLLIGFFETYVKLTEQEEEQFQNEVKKMGGKEGEQVM 231 Query: 243 TLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKVI 297 L +E+KG G +E +E Q+++ KGMS +A + + E+ KV+ Sbjct: 232 ELIISYEQKGKIA----GAKEKEREMIQKMVEKGMSITQIAHLLDRSEEEVRKVV 282 >UniRef50_C9KKN3 Putative uncharacterized protein n=1 Tax=Mitsuokella multacida DSM 20544 RepID=C9KKN3_9FIRM Length = 297 Score = 84.6 bits (208), Expect = 3e-15, Method: Composition-based stats. Identities = 58/307 (18%), Positives = 104/307 (33%), Gaps = 46/307 (14%) Query: 10 DAVFKQFLMHAETARDFLEIHLPVELREL-CDLNTLHLESGSFIEESLKGHSTDVLYSVQ 68 D++F+ E ++ + TL + I+ + S + Sbjct: 11 DSLFRHIFNDKRRLASLYESLTGRKVAPRDIAITTLRGVFFNDIKNDI---------SFR 61 Query: 69 MQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLE-ADHDKLPLVVPI------LF 121 + +++EHQS + M RM+ Y R L+ + ++PI +F Sbjct: 62 IGDRDI---ILMEHQSSWNPNMPLRMLWYVAKLYSRQLDSQEVVYRSRLIPIPAPEFYVF 118 Query: 122 YQGEA-----TPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILE 176 Y G LS + + ELA YN + + E Sbjct: 119 YNGSQDEPDYQKLRLSDAFAHATDTLELAVDCYNINY-----------STQNKLLDSCYE 167 Query: 177 LLQKHIRQRDLMLLLE---QLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFY---GV 230 L I + + ++ +L T I + T M +Y Q+ +E D+ Sbjct: 168 LRCYSIFVQKVREGIQNGLELRTAIRQAITYCKTHDLMGDY-FQKNESEVFDMVNFKWDQ 226 Query: 231 LRDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPL 290 R E E + + + +G +G G + + A LL KG+ + E NL L Sbjct: 227 KRALEVAKEDGVAIGE---ARGEARGKLLGERNAMMKVALSLLKKGLPVGVITESTNLSL 283 Query: 291 AEIDKVI 297 E+ K+ Sbjct: 284 EEVRKIA 290 >UniRef50_C0QGW4 Putative uncharacterized protein n=1 Tax=Desulfobacterium autotrophicum HRM2 RepID=C0QGW4_DESAH Length = 298 Score = 82.7 bits (203), Expect = 1e-14, Method: Composition-based stats. Identities = 46/292 (15%), Positives = 107/292 (36%), Gaps = 16/292 (5%) Query: 9 HDAVFKQFLMH-AETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSV 67 HD FK + + D+ ++ D+ L E D+ Sbjct: 4 HDHNFKNLFLDFPKETLDWFFPQAGQSWGKVLDVEFLRQEPKKHNLSD-SSLELDMPILF 62 Query: 68 QMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEAT 127 + L ++E Q K ++++RY+ M H +A LV+P + + Sbjct: 63 NFENQQLLLW-LVEFQEDKSKFSIYKLLRYTTDLMETHPDA------LVIPTVLFTDRKK 115 Query: 128 PYPLS-MCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQRD 186 + Y L D+ D + + + IL + ++ Sbjct: 116 WSKAVLQQLHAQLHDRMFLHFEYVFH-KLFDLNAR-DYYNVDNPVVKILLPKMHYKKEDR 173 Query: 187 LMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGESMMTLAQ 246 + ++ + L S +++ E + + + E+ M LAQ Sbjct: 174 IEVIRQAYAGLFQLV--SSGLFDKYVDFIDTYAEIEDQEQL-NLYNEIVQHKETAM-LAQ 229 Query: 247 WFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKVIN 298 + E+G+++G ++ R++ F ++ +G+S +A++ +L ++ ++K++N Sbjct: 230 YIRERGMQEGRKEERKQSLISFIRKAKQEGVSVPTIAKIVDLDVSMVNKILN 281 >UniRef50_Q2RGS0 Putative uncharacterized protein n=2 Tax=Moorella thermoacetica ATCC 39073 RepID=Q2RGS0_MOOTA Length = 310 Score = 82.7 bits (203), Expect = 2e-14, Method: Composition-based stats. Identities = 60/320 (18%), Positives = 117/320 (36%), Gaps = 44/320 (13%) Query: 1 MDAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHS 60 M S +D K ++ + + L +E ++ Sbjct: 1 MQPKSGNRYDITIKDLFADET--QELINYF--GHFEARVTGD-LKIEF-----PQVETRV 50 Query: 61 TDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPIL 120 +D++ + Q P +H +E QS+ D +M +RM+RY++ V I+ Sbjct: 51 SDLVMKAESQQGPLAIH--LEFQSRNDDEMPYRMLRYALEI-------HKTYHLPVYQIV 101 Query: 121 FYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQK 180 Y G+ S + + + + + + L+D+ +E+ +L LL Sbjct: 102 IYFGQWQMNMTSQLEYRLGD-----QNLLDYRYHLIDVGNITYEELKNSPHQRLLSLLPV 156 Query: 181 HIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRG-------HTEQADLFYGVLRD 233 R++ E L ++ S L + +L+ + DL + + Sbjct: 157 VDREKRQKGGKEFLRRCAEDIINSDLDLETKKTVLLRAEIFAGLVFDKKAIDLVFREVEQ 216 Query: 234 RETGGES----------MMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGM---SRE 280 + ES M + EKG+EKGI++G+QE + RLL K RE Sbjct: 217 MLSIEESAGYQRIFEKGMEKGIEKGMEKGMEKGIEKGQQESLLDVTIRLLRKKFRKIPRE 276 Query: 281 DVAEMANLPLAEIDKVINLI 300 +A + + + ++I+ I Sbjct: 277 YLARIKEQDVYVLQQIIDSI 296 >UniRef50_C2LUG6 Putative uncharacterized protein n=1 Tax=Streptococcus salivarius SK126 RepID=C2LUG6_STRSL Length = 299 Score = 81.9 bits (201), Expect = 2e-14, Method: Composition-based stats. Identities = 57/300 (19%), Positives = 111/300 (37%), Gaps = 26/300 (8%) Query: 10 DAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQM 69 D + K+ E F+ L +++ + L L F E+ L S DV ++ Sbjct: 13 DIMAKKIFSLPEVTVAFIRDILDLDVVDAQILEGTQLHKKDFDEDELFSTSVDV--RAKL 70 Query: 70 QGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEAD---------HDKLPLVVPIL 120 V+IE Q + R Y + +++ ++++ V I Sbjct: 71 NDGTE---VIIEIQVRKQHYFLNRFHYYLANQLVENVQQLRQQGQTHKMYEQMEPVYGIA 127 Query: 121 FYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILEL--L 178 + P S + + +Y+ D + +IA LEL Sbjct: 128 ILEKTLLPDEESPINTYWMANSRTGKPLYSF---------YKDGKQQNLLQIAFLELDKY 178 Query: 179 QKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHT-EQADLFYGVLRDRETG 237 K RD + + + + + T E+ + +R +E Sbjct: 179 NKDKHIRDEGRQWLEFFGNLPFSKAPSRAVTHADSLLDSSSWTQEEKAMIDERIRIQENY 238 Query: 238 GESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKVI 297 +M T E+G+E+G+++GR E E +++L+KG+S E V+++ L L E+D ++ Sbjct: 239 DMTMETAIDEAREEGLEQGLKRGRYEGQLELIRKMLAKGLSLEVVSDVTGLSLEELDGLL 298 >UniRef50_C8PTN1 Putative uncharacterized protein n=4 Tax=Treponema vincentii ATCC 35580 RepID=C8PTN1_9SPIO Length = 303 Score = 81.2 bits (199), Expect = 4e-14, Method: Composition-based stats. Identities = 53/315 (16%), Positives = 109/315 (34%), Gaps = 28/315 (8%) Query: 1 MDAPSTTPHDAVFKQFLMHAETARDFLEIHLPV----ELRELCDLNTLHLESGSFIEESL 56 M + D+VF E A++ L+ C + + L++ ++ Sbjct: 1 MSTANRKYKDSVFVDLFSEDEKAKENFLSLYNALHGTNLQLSCPVENIKLDNVMYM---- 56 Query: 57 KGHSTDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKL--- 113 DV S + + V+ EHQS ++ M R ++Y + + L Sbjct: 57 -NIVNDV--SCLVDNK---IIVLAEHQSTINENMPLRFLQYIARLYEKLQKPTDRYLRTL 110 Query: 114 ---PLVVPILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQH- 169 P +FY G ++ + + R + + +I + E++ Sbjct: 111 SKIPTPEFYVFYNGLNDYPETTVLKLSDAFITKPERIPLDLEVKVYNINKSKGAEVLSRC 170 Query: 170 RRIAILELLQKHIRQR---DLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADL 226 + + L + +R + D V + E L ++ + D Sbjct: 171 KTLDEYSLFIEEVRLQTQLDPENGFTNAVKICIEKGILKEYLQRKSREVINML-IAEYDY 229 Query: 227 FYGVLRDRETGGESM--MTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAE 284 + RE G+ ++Q + GI +G+ QG + + E A+ + +A+ Sbjct: 230 DTDIAVQREEAGKIAFAKGISQGLSQ-GISQGLSQGSHQKALETARLMKQANCEIPFIAK 288 Query: 285 MANLPLAEIDKVINL 299 M L AE++ + NL Sbjct: 289 MTGLTQAEVESIGNL 303 >UniRef50_Q6D6X6 Putative transposase (Fragment) n=2 Tax=Pectobacterium RepID=Q6D6X6_ERWCT Length = 135 Score = 81.2 bits (199), Expect = 4e-14, Method: Composition-based stats. Identities = 32/128 (25%), Positives = 64/128 (50%), Gaps = 13/128 (10%) Query: 184 QRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQ-ADLFYGVLRDRETGGESMM 242 D++ L + + L + Q A+ Y+ + G+T + A+ V + T E++M Sbjct: 4 HHDMLELAQDIGILFERWQIPLPQKRAILFYIARSGNTSKPAEFIEAVAQSLSTDREAIM 63 Query: 243 TLAQWFE------------EKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPL 290 T+AQ E ++G+++G++QG + +++ A++LL GM V +M L Sbjct: 64 TIAQQLEKIGFEKGIKHGMQQGMQRGMEQGIKTSARQIARQLLLSGMEPAQVCQMTQLSA 123 Query: 291 AEIDKVIN 298 AE+ ++ N Sbjct: 124 AELAQLSN 131 >UniRef50_B7CC32 Putative uncharacterized protein n=10 Tax=Eubacterium biforme DSM 3989 RepID=B7CC32_9FIRM Length = 301 Score = 80.4 bits (197), Expect = 7e-14, Method: Composition-based stats. Identities = 56/296 (18%), Positives = 107/296 (36%), Gaps = 11/296 (3%) Query: 10 DAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQM 69 D K+FL + DF + R L N + L+S DV+ + Sbjct: 6 DKTMKEFLENNAYFVDFFNAYFFDGERVLKPENCMELDSEMNDSNMDLEKHVDVI--RKY 63 Query: 70 QGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHR-----HLEADHDKLPLVVPILFYQG 124 Y +IE+QS D M R Y A R +KLP+V ++FY G Sbjct: 64 NDGNLYSAFIIENQSYVDASMVVRAAAYEFVAYDRMLKKLKKNKAKEKLPMVHILVFYTG 123 Query: 125 EATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQ 184 E + + ++ L++IT + L + + I Sbjct: 124 EKLWNAANKLSQLVEVDERFESYFHDYQMNLIEITGNT-SYNFNEEDVYNLFYICRSIYD 182 Query: 185 RDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGESMMTL 244 + + L+ + + ++ E+ ++ R +S Sbjct: 183 QSIYEEKSNGFGLVKSSVLKVVKTLTDVEWLDLEELEEKEEIEMCEAEKRWLEVKSKEWE 242 Query: 245 AQWF---EEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKVI 297 A+ E+GIE+GI+QG ++ E ++++ KG + +A + ++ I+K++ Sbjct: 243 AKGIKKGIEQGIEQGIEQGSEKKELEMYRKMMDKGFGIKAIASIFSVSEESIEKLL 298 >UniRef50_A6LFH9 Putative uncharacterized protein n=6 Tax=Bacteroidales RepID=A6LFH9_PARD8 Length = 295 Score = 80.4 bits (197), Expect = 7e-14, Method: Composition-based stats. Identities = 52/298 (17%), Positives = 109/298 (36%), Gaps = 23/298 (7%) Query: 10 DAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQM 69 D FK + FL L E R++ D+ L E I+ + T +++ + Sbjct: 10 DVGFKAVFQDKQVTIKFLNAALAGE-RQIKDITYLDKE----IKPETVENRT-IIFDLLC 63 Query: 70 QGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHD---KLPLVVPILFYQGEA 126 + G ++E Q+ P R Y + R + +L + + F + Sbjct: 64 EDVSG-AKFILEMQNCPQHYFFNRGFYYLCRMVARQGQIGKQWQYRLLPIYGVYFLNFKL 122 Query: 127 TPYPLSMCWFDMFYSP------ELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQK 180 + + E+ + FPL ++ + K Sbjct: 123 PEFTDFRTDVVLANERTGKVFNEIKMKQIYISFPLFSLS-----KEECKSSFERWIYTLK 177 Query: 181 HIRQRDLMLLLEQLVTLIDE-GYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGE 239 ++ + E+ T + + + L + + + D + + + G E Sbjct: 178 NMNLFEQSPFKEEQETFLRLLDVANVNSLSEKERAIYEENLKNYRDWYATIDYAQTEGIE 237 Query: 240 SMMTLA-QWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKV 296 M Q +KGIEKGI++GRQE + A+++ +G+ E +A+ + L + +I+++ Sbjct: 238 KGMQEGMQKGMQKGIEKGIEKGRQEEKLQIARKMKKQGLDSELIAQCSGLSVEDIERL 295 >UniRef50_Q73P51 Conserved domain protein n=7 Tax=Treponema RepID=Q73P51_TREDE Length = 292 Score = 78.5 bits (192), Expect = 3e-13, Method: Composition-based stats. Identities = 57/313 (18%), Positives = 124/313 (39%), Gaps = 38/313 (12%) Query: 1 MDAPSTTPHDAVFKQFLMHAETARD-FLEIHLPVELREL---CDLNTLHLESGSFIEESL 56 M + D+VF E A++ FL ++ + L C + + L++ ++ Sbjct: 1 MSTSNRKYKDSVFVDLFSEDERAKENFLSLYNALHGTNLPMSCPVENIRLDNVMYM---- 56 Query: 57 KGHSTDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMH-------RHLEAD 109 DV S + G + ++ EHQS ++ M R + Y R+L+ Sbjct: 57 -NIINDV--SCLVDGK---IIILAEHQSTINENMPLRFLEYIARLYEKLQAPTDRYLKKL 110 Query: 110 HDKLPLVVPILFYQGEAT-PYPLSMCWFDMF-YSPELARRVYNSPFPLVDITITPDDEIM 167 K+P +FY G+ P ++ D F P+ A P L + + + Sbjct: 111 S-KIPTPEFYVFYNGKEDYPETTALKLSDAFITKPKQA------PLEL-TVQVLNINTDK 162 Query: 168 QHRRIAILELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLF 227 ++ + + L+++ + + QL + G+T+ ++ + + + + ++ Sbjct: 163 ANKILTACKPLEEYSLFVEEVRKQTQLDP--ENGFTNAIKICIEKGILKEYLMRKSREVI 220 Query: 228 YGVLRDRETGGESMMTLAQWFEEKGIEKGIQQG----RQEVSQEFAQRLLSKGMSREDVA 283 ++ + + + + + GIE+GI+QG + + E A+ G + +A Sbjct: 221 NMLVAEYDYDTDIAVQREESLR-IGIEQGIRQGFSDGAYQKAIEIAKAFKQFGFDIDKIA 279 Query: 284 EMANLPLAEIDKV 296 E L EI+K+ Sbjct: 280 EGTGLSREEIEKL 292 >UniRef50_A4XJH0 Putative uncharacterized protein n=1 Tax=Caldicellulosiruptor saccharolyticus DSM 8903 RepID=A4XJH0_CALS8 Length = 134 Score = 77.3 bits (189), Expect = 6e-13, Method: Composition-based stats. Identities = 19/135 (14%), Positives = 55/135 (40%), Gaps = 1/135 (0%) Query: 1 MDAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHS 60 M+ + +A+F+ ++ L+ + +++ + + E++ + Sbjct: 1 MNNNFSQDENAIFRLIFSDSKEILFLLKNVAKFSWVDRIQKDSIEVILVDYDNENVLKYK 60 Query: 61 TDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPIL 120 DV+ V ++ N Y+ V + P+ M ++ + + ++ DK+P ++P++ Sbjct: 61 PDVIAKVTIENNTAYIFVFFVSKV-PECGMRNIILNNMLLFWEKKIKEGTDKIPPIIPLV 119 Query: 121 FYQGEATPYPLSMCW 135 Y G+ + Sbjct: 120 LYNGKEIWTEPREIY 134 >UniRef50_C9RQ02 Putative uncharacterized protein n=1 Tax=Fibrobacter succinogenes subsp. succinogenes S85 RepID=C9RQ02_FIBSS Length = 360 Score = 76.6 bits (187), Expect = 1e-12, Method: Composition-based stats. Identities = 56/301 (18%), Positives = 108/301 (35%), Gaps = 24/301 (7%) Query: 2 DAPSTTPHDAVFKQFLMHAETARDFLE-----IHLPVELRELCDLNTLHLESGSFIE-ES 55 + + HDA F+ AR LE H +L+TL S+ E + Sbjct: 5 NKVTKRKHDAYFRWLFADTTHARCLLELAGKINHEIDAFLTQINLDTLMRIPDSYSEVDD 64 Query: 56 LKGHSTDVLYSVQM-QGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLP 114 D+ + V + G P + +++EH+S D + ++ +Y + M + Sbjct: 65 TGE--ADLAFRVNVSTGAPILVGILLEHKSGRDPIIFDQISKYIHSVMKIQDKNRIFSGI 122 Query: 115 LVVPILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIM--QHRRI 172 + I+FY G PL + Y +V V++ PD + + ++ Sbjct: 123 PTMAIIFYNGRDNWNPLK--ILEKSYPDYFRGKVLPFQCTFVNMADIPDSDCLACENTAT 180 Query: 173 AILELLQKHIRQRD-LMLLLEQLVTLIDE---GYTSGSQLVAMQNYMLQRGHTEQADLFY 228 + + KH +D L+ LL Q +D+ S M G +L Sbjct: 181 GMGIIALKHAFNKDKLLELLPQFCKFLDKMPRNEASCLLEKTSIYLMEYLGKDFLKELNM 240 Query: 229 GVLRDRETGGESMMTLAQWFEEKGIEKGIQQGRQ-----EVSQEFAQRLLSKGMSREDVA 283 + + G +++ +F ++ E+ Q + E Q+ + L R+ + Sbjct: 241 AFVSIGQKYG--FVSIGDYFRQQLAEERQQMTEERLQMAEERQQITEERLQMAEERQQIT 298 Query: 284 E 284 E Sbjct: 299 E 299 >UniRef50_UPI0001BC3A9D hypothetical protein BcroD2_08902 n=3 Tax=Butyrivibrio crossotus DSM 2876 RepID=UPI0001BC3A9D Length = 324 Score = 76.2 bits (186), Expect = 1e-12, Method: Composition-based stats. Identities = 48/317 (15%), Positives = 108/317 (34%), Gaps = 49/317 (15%) Query: 10 DAVFKQFLMHAETARDFL-------EIHLPVELRELCDLNTLHLESGSF-IEESLKGHST 61 D + K + + D + + + E D+ T +E + + + Sbjct: 20 DILLKDYFT-PDIFADAINAILYDGKSVVTPERMRTIDIETQRVEDENGNVTADTRLRD- 77 Query: 62 DVLYSVQMQGNPGYLHVV--IEHQSKPDKKMAFRMMRYSIAAMHRHLEADHD---KLPLV 116 S ++ ++ + IEHQS D M R+M Y + R ++++ ++ + Sbjct: 78 ----SAKVVEVDDAIYCLFAIEHQSVEDYTMPLRIMEYDVREYLRQVKSNKGVQVRIKPI 133 Query: 117 VPILFYQGEATPYPLSMCWFDMFYSPE--------LARRVYNSPFPLVDITITPDD--EI 166 + I+ Y +A + + DMF L + + L + ++ E Sbjct: 134 ITIVMYW-KADKWNQPVSVKDMFDKNTVRWLEYNGLGGYIQDYRMHLFEPGTVKEEDLEK 192 Query: 167 MQHRRIAILELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADL 226 + ++ ++ L E+ + + + + Y+ G E+ D+ Sbjct: 193 FKTELKDVIAYVKYSKSTEALKDYNEKYKPDLTKSTVTLINELTNSKYVFIEG-KERLDM 251 Query: 227 FYGVLRDRETGGESMMTLAQWFEEKGIEKGIQQGRQEV---SQEFAQRLLSKGMSREDVA 283 + E+G KG + +E + L +GMS ++A Sbjct: 252 CEAF---------------EGLIEEGRAKGKAEELKEKYKSWVTLSNNLKKRGMSNPEIA 296 Query: 284 EMANLPLAEIDKVINLI 300 + +P E+ K +I Sbjct: 297 SLLGVPETELQKAFKMI 313 >UniRef50_Q3C0L0 TpnA protein n=2 Tax=Sodalis glossinidius RepID=Q3C0L0_SODGL Length = 131 Score = 75.8 bits (185), Expect = 2e-12, Method: Composition-based stats. Identities = 32/59 (54%), Positives = 40/59 (67%) Query: 3 APSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHST 61 + + HD VFK+FL ARDFLEIHLP LR+ CD +TL + SGSFIE+ LKG + Sbjct: 2 TSTLSHHDHVFKKFLGDIAVARDFLEIHLPPHLRKHCDFSTLAMASGSFIEDDLKGQCS 60 >UniRef50_C9LWJ8 Putative uncharacterized protein n=1 Tax=Selenomonas sputigena ATCC 35185 RepID=C9LWJ8_9FIRM Length = 292 Score = 75.4 bits (184), Expect = 3e-12, Method: Composition-based stats. Identities = 57/301 (18%), Positives = 113/301 (37%), Gaps = 39/301 (12%) Query: 10 DAVFKQFLMHAETARDFLEIHLPVELR-ELCDLNTLHLESGSFIEESLKGHSTDVLYSVQ 68 D++F + +D ++ + L TL G+F + DV + Sbjct: 10 DSLFCDIFRRKDYLQDVYRGLFGRDVSLQEIQLMTLQ---GTFFNDE----KNDVSF--- 59 Query: 69 MQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHD------KLPLVVPILFY 122 + G V++EHQS ++ M RM Y + + D +LP +FY Sbjct: 60 LAGKRQI--VLMEHQSTLNENMPLRMFWYMAKLYRKQVPKDAPYRTRRLRLPAPCFYVFY 117 Query: 123 QG-EATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRR-IAILELLQK 180 G + P M + F + + + +I +++ R + + Sbjct: 118 NGLDPAPDEWEMRLSEAFEGECSSLELCVKAY---NINEMSGSRLLEKSRALKGYSVFVA 174 Query: 181 HIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLF----YGVLRDRET 236 IR++ + L + + + + Y L+R E D+ L R Sbjct: 175 QIRRKTAAGVC--LEEAVKQAIRYCIEQDLLAEYFLEREMEEVFDMVSFKWDPELAKRVQ 232 Query: 237 GGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGM-SREDVAEMANLPLAEIDK 295 E+ +E G+EKG+++G ++ E +L K S +D++E++ PL +I+ Sbjct: 233 LQEA--------QEIGMEKGMEKGMEKGVTEIVLNMLKKKKWSLQDISEVSQWPLDKIES 284 Query: 296 V 296 + Sbjct: 285 L 285 >UniRef50_A6LFA9 Putative uncharacterized protein n=22 Tax=Bacteroidales RepID=A6LFA9_PARD8 Length = 305 Score = 75.0 bits (183), Expect = 3e-12, Method: Composition-based stats. Identities = 52/307 (16%), Positives = 118/307 (38%), Gaps = 31/307 (10%) Query: 10 DAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQM 69 D FK E +D L L L + L + + + E+ +G V++ + Sbjct: 10 DFGFKHIFG-REMDKDILIEFLNDLLEGEYTIMDLRIMNNERLPETEQGRK--VIFDIHC 66 Query: 70 QGNPGYLHVVIEHQSKPDKKMAFRMMRY-SIAAMHRHLEADHDK-LPLVVPILFYQ---- 123 + + G ++IE Q++ R + Y S + + + ++ D L V + F Sbjct: 67 ETDKGE-RIIIEMQNREQPHFKDRALYYLSHSVVEQGIKGTWDYELAAVYGVFFLNFTLD 125 Query: 124 ---GEATPYPLSMCWFDMFYSPELARRVYNSPF--PLVDITITPDDEIMQHRRIAILELL 178 G D+ + +V+N F +++ +E + Sbjct: 126 EENGPDKNGKEGKFRRDIILADRENGQVFNPKFRQIYIELPRFNKEEEECETDFERWIYV 185 Query: 179 QKHIR---------QRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYG 229 KH+ ++ + LE++ ++ + +Q A + L + Sbjct: 186 LKHMDTLDRMPFKARKAIFERLERIGSMANLTPKQRAQYEAEWK----MYNDYYNTLDFA 241 Query: 230 VLRDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLP 289 V + + G E M + +KG+++G+Q+G Q+ + A+ + ++G++ + + L Sbjct: 242 VEKGMKKGMEEGM---EKGLQKGLQEGLQEGLQKGKESTARNMKAEGITPLIIQKCTGLS 298 Query: 290 LAEIDKV 296 L EI+++ Sbjct: 299 LEEIERL 305 >UniRef50_UPI0001C351D8 hypothetical protein ChatD1_33675 n=1 Tax=Clostridium hathewayi DSM 13479 RepID=UPI0001C351D8 Length = 313 Score = 74.2 bits (181), Expect = 5e-12, Method: Composition-based stats. Identities = 47/303 (15%), Positives = 100/303 (33%), Gaps = 41/303 (13%) Query: 10 DAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHS--TDVLYSV 67 D +F+ + D + DL L ++ +DVL Sbjct: 12 DRLFRLAFQEKKDLLDLYNAVSGRQYTNPDDLIITTLADAIYLGMKNDISFLVSDVL--- 68 Query: 68 QMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHD--------KLPLVVPI 119 + EHQS + M R + Y +++ + +LP+ I Sbjct: 69 ----------NLYEHQSSFNPNMPVRGLNYFADTYREYIDRNGFDIYGEKLIRLPMPQYI 118 Query: 120 LFYQG-EATPYPLSMCWFDMF--YSPELARRVYNSPFPLVDITITPDDEIMQH--RRIAI 174 +FY G + P + + D F +PE + +++I + E+M R Sbjct: 119 VFYNGTKEEPDRIELRLSDAFLCQNPEE-KGCLECRATMININYGHNKELMDRCRRLKDY 177 Query: 175 LELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDR 234 + + + L++ V L + +A++ +L + Sbjct: 178 AVFVSRIRNNEKRGMALDEAVKQAVHSCIEEGILADILK-------KNRAEVCNLILYEY 230 Query: 235 ETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEID 294 + + + E ++ G ++GR + + KG++ +A+M L + Sbjct: 231 DEQRQLAI-----AREGAMKAGREEGRAAEQVTIIRNMAGKGLNPSAIADMLGLEEGYVK 285 Query: 295 KVI 297 KV+ Sbjct: 286 KVL 288 >UniRef50_C4GYF6 Transposase n=20 Tax=Yersinia pestis RepID=C4GYF6_YERPN Length = 105 Score = 73.9 bits (180), Expect = 7e-12, Method: Composition-based stats. Identities = 25/72 (34%), Positives = 43/72 (59%), Gaps = 1/72 (1%) Query: 201 GYTSGSQLVAMQNYMLQRGHTEQADLF-YGVLRDRETGGESMMTLAQWFEEKGIEKGIQQ 259 Y S Q++A+ +Y+LQ G + ++ F + + G+++MT+AQ E+KGIEKGI++ Sbjct: 3 DYLSSPQVMALIHYLLQAGESADSEAFVRELAQRVPQHGDALMTIAQQLEQKGIEKGIEK 62 Query: 260 GRQEVSQEFAQR 271 G Q Q+ Sbjct: 63 GIQLGEQKGKLE 74 >UniRef50_UPI0001C34E7F hypothetical protein ClM62_15401 n=1 Tax=Clostridium sp. M62/1 RepID=UPI0001C34E7F Length = 324 Score = 72.3 bits (176), Expect = 2e-11, Method: Composition-based stats. Identities = 52/302 (17%), Positives = 101/302 (33%), Gaps = 39/302 (12%) Query: 10 DAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQM 69 DA+F+ E + L + LE+ ++ D+ + + M Sbjct: 28 DALFRMIFNDKEALLSLYNAVGNTSYTDASQLQIVTLENAVYM-----NIKNDLAFLLNM 82 Query: 70 QGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHL------EADHDKLPLVVPILFYQ 123 + N + EHQS + M R + Y L + KLP ++F+ Sbjct: 83 ELN------LYEHQSTWNPNMPLRDLFYVSREYEMLLANQSIYSSSLLKLPAPRFVVFFN 136 Query: 124 G-----EATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRI--AILE 176 G E LS + P+L +V +++I +DE+M R+ Sbjct: 137 GSYDMGEQCVLKLSDAYEKKVEDPDLELKV-----TVLNINAGWNDELMNTCRLLKEYSL 191 Query: 177 LLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRET 236 + + M L E + +DE G + Y + + ++ Sbjct: 192 YVARVRAYAKEMELAEAVSRAVDECIKEGILRDFLMKYRAEAISVSIFEYDEEREKELLR 251 Query: 237 GGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMS-REDVAEMANLPLAEIDK 295 E E G ++G+ QGR+E + + +++G+S A + Sbjct: 252 KTEY---------EFGRQEGLSQGREEGLSQGIKEGMAQGVSAMIRHCRKAGASREDTLS 302 Query: 296 VI 297 ++ Sbjct: 303 IL 304 >UniRef50_D1PHY3 Putative uncharacterized protein n=2 Tax=Prevotella copri DSM 18205 RepID=D1PHY3_9BACT Length = 307 Score = 71.9 bits (175), Expect = 2e-11, Method: Composition-based stats. Identities = 56/303 (18%), Positives = 101/303 (33%), Gaps = 21/303 (6%) Query: 10 DAVFKQFLM-HAETARDFLEIHLPVELRELCDLNTLHLESGSFIE--ESLKGHSTDVLYS 66 D FK+ H + L LP+ E + + E K DVL Sbjct: 11 DLTFKKIFGNHPKRLISLLNALLPLSDEEQI--REIKYLPTELVPQLEGGKNTIVDVL-C 67 Query: 67 VQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDK--LPLVVPILFYQG 124 ++G +E Q + R++ + + L V + Sbjct: 68 TDVRGRK----FCVEMQMEWSDAFQQRVLFNASKLYVSQAKKGGKYSELQPVYSLNLIND 123 Query: 125 E-ATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIR 183 A P + + + + + + + F +++ I R + + I Sbjct: 124 IFAHDTPDFIHNYRIVHDKDSNKVIEGLHFTFIELPKFTPHSIADKRMMVLWLRFLTEIN 183 Query: 184 Q--RDLMLLL---EQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGG 238 +D+ L ++ ++E SG ++ Y + Sbjct: 184 SNTKDIPADLLNDPEIGKAVEELEISGFSDAELRAYDKFWDSVSVERTLIDDSYQKGKEK 243 Query: 239 ESMMTLAQWFE---EKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDK 295 LA+ E EKG+EKG +G+ E + E AQRLL+ G+ E V++ LPL I Sbjct: 244 GKQEGLAEGMEKGMEKGMEKGRAEGKHEANTEIAQRLLAMGLPAEQVSKATQLPLEIIKN 303 Query: 296 VIN 298 + N Sbjct: 304 LSN 306 >UniRef50_C4G3R2 Putative uncharacterized protein n=2 Tax=Abiotrophia defectiva ATCC 49176 RepID=C4G3R2_ABIDE Length = 336 Score = 71.9 bits (175), Expect = 2e-11, Method: Composition-based stats. Identities = 50/303 (16%), Positives = 101/303 (33%), Gaps = 49/303 (16%) Query: 10 DAVFKQFLMHAETARDFLEI-HLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQ 68 D+VF + R + H + D + LE+ + D+ ++V+ Sbjct: 67 DSVFTLLFSDIKNIRKLYQSLHDDSDSYSDEDFKIITLENV-----FINAPYNDLGFTVK 121 Query: 69 MQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHD--------KLPLVVPIL 120 + + ++ E QS + M R++ Y + H ++ +LP I+ Sbjct: 122 NK-----VIILAEAQSTFNPNMGLRLLIYIAQSYHDYISEYKFNIFSEKLIRLPNPEFIV 176 Query: 121 FYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQK 180 Y G + D F S N + I E + + E+ + Sbjct: 177 IYSGSKKTDITEIRLSDCFESGT----APNIELVVKVIGGNNVKEGIIQEYLKFCEMYDE 232 Query: 181 HIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGES 240 +R V +E S +++ + L + E Sbjct: 233 KVRS----------VKPSEEKAYSLKKVIK---------DCIDNGILKDFLTLHQKEVED 273 Query: 241 MMTLA-------QWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEI 293 MM ++ + + KGI+QG+ + S FA+ +L S + + E+ L +I Sbjct: 274 MMMTVIPPEQALEYIKLEEYNKGIEQGKLDTSLNFARNMLKNNYSIDSIIEITGLSREQI 333 Query: 294 DKV 296 ++ Sbjct: 334 KRL 336 >UniRef50_B3CQQ1 Putative transposase n=3 Tax=Orientia tsutsugamushi str. Ikeda RepID=B3CQQ1_ORITI Length = 153 Score = 71.5 bits (174), Expect = 3e-11, Method: Composition-based stats. Identities = 35/151 (23%), Positives = 59/151 (39%), Gaps = 24/151 (15%) Query: 174 ILELLQKHIRQRDLMLLLEQLVTLI--------DEGYTSGSQLVAMQNYMLQRGHTEQAD 225 +LE + KHI QRD++ L E+ + ++GY + + L + + Sbjct: 1 MLEYMLKHIHQRDMLKLWEEFLIKFKHVLILDKEKGYIYLRSFLWYTDTKLLESQQPELE 60 Query: 226 LFYGVLRDRETGGESMMTLAQWFEEKGIE----------------KGIQQGRQEVSQEFA 269 E M T+A + ++GIE +GI+ G + Q A Sbjct: 61 QVLAKYLSEEEKSNIMRTIAAKYIDEGIEIGETKGIAKGIAKGIAEGIEIGEVKAKQGLA 120 Query: 270 QRLLSKGMSREDVAEMANLPLAEIDKVINLI 300 + LL G S E ++E L E+ + N I Sbjct: 121 RNLLKAGFSVEFISENTGLSKEEVINLKNNI 151 >UniRef50_C5UZR7 Putative uncharacterized protein n=1 Tax=Clostridium botulinum E1 str. 'BoNT E Beluga' RepID=C5UZR7_CLOBO Length = 334 Score = 71.5 bits (174), Expect = 3e-11, Method: Composition-based stats. Identities = 60/328 (18%), Positives = 107/328 (32%), Gaps = 43/328 (13%) Query: 10 DAVFKQFLMH-AETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQ 68 D + K + + L D L + + FI ++ DV + V Sbjct: 11 DEILKFLFSTSKKVLVNLLNGIFEENFSS--DEVELSVSNNEFIMDTFDTLRGDVFFEVL 68 Query: 69 MQGNPGYLHVVIEHQSKPDKKMAFRMMRY-----------------------SIAAMHRH 105 + +E Q+K D M RM Y + + R+ Sbjct: 69 NNEVSNKVTYHLEFQTKNDSTMIIRMFEYGFRKGKEQTGNRDDFKTIYFPKQKVIFIERN 128 Query: 106 LEADHD-KLPLVVPILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDD 164 D KL +V+P ++ Y + + + + EL PL + D Sbjct: 129 NNIKEDIKLKIVLP----DEQSFIYSVPVMKYWEYTDNELIENKMYPLLPLQLFNLRKDL 184 Query: 165 EIMQHRRIAILELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQ------------ 212 E + H + + + + L D+ G M Sbjct: 185 EYARRSNNIDKINDLSHEAKEIALKIANESKKLFDDNEIIGEDFHKMLLAIQNLIEYLNR 244 Query: 213 NYMLQRGHTEQADLFYGVLRDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRL 272 NY E+ L D E + + EKGIEKG+++G ++ + E A Sbjct: 245 NYFNDDRLEEEVSTMTKTLYDPEVEKRGIEKGIEKGIEKGIEKGMEKGIEKKAIEDAIGF 304 Query: 273 LSKGMSREDVAEMANLPLAEIDKVINLI 300 L G+S E V++ LP+ ++ ++ + I Sbjct: 305 LRLGVSEEIVSKGTGLPIEKVRELKDKI 332 >UniRef50_C6LJP2 Putative transposase n=1 Tax=Bryantella formatexigens DSM 14469 RepID=C6LJP2_9FIRM Length = 326 Score = 71.2 bits (173), Expect = 4e-11, Method: Composition-based stats. Identities = 41/235 (17%), Positives = 88/235 (37%), Gaps = 23/235 (9%) Query: 65 YSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMR-----YSI--AAMHRHLEADHDKLPLVV 117 ++ ++ G + V +++Q+ D M R+M Y + +KL V+ Sbjct: 78 FNKKIVAPDGEIIVALQNQTTVDFGMPLRVMTEDALEYDVQRRMCKDEKLHKGEKLAPVI 137 Query: 118 PILFYQGEATPYPLSMCWFDMFYSPELAR--RVYNSPF-PLVDITITPDDEIMQHRRIAI 174 I+FY G + DM PE + + Y P+ L+ D + Sbjct: 138 TIVFYYGAQI-WSGPTDLADMVKIPEEFKWLKKYIRPYAMLLITPENVDAAWFSGGWREV 196 Query: 175 LELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQ-----NYMLQRGHTEQADLFYG 229 E+LQ+ ++++ L++ ++ ++ ++L+ Y + E+A + Sbjct: 197 FEILQRRNDEKEMQRYLQKKRSVYEKLPEDTNRLIFALTGHLDYYNALKRKGERAVMCKA 256 Query: 230 VLRDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAE 284 ++G E + GI +GI QG + +G + E + + Sbjct: 257 FEDHYKSGVEEGKNI-------GIHQGISQGLGRGIGAMIRENQEEGKTTESIID 304 >UniRef50_A7BWQ7 Putative uncharacterized protein n=3 Tax=Beggiatoa sp. PS RepID=A7BWQ7_9GAMM Length = 290 Score = 71.2 bits (173), Expect = 4e-11, Method: Composition-based stats. Identities = 58/308 (18%), Positives = 116/308 (37%), Gaps = 39/308 (12%) Query: 5 STTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIE--ESLKGHSTD 62 + HD++FK + +F + P + FI E+LK Sbjct: 3 NPKSHDSLFKWLIT--AFTTEFFGHYFPD-----IRIGEYTFIDKEFISKYENLKESLKG 55 Query: 63 VLY---SVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPI 119 L+ V++ G + + IEHQS+ + ++ R+ YS A + V I Sbjct: 56 DLFLGMEVEIDGLLREIIIQIEHQSERE-DVSERVYEYSCYAWLLKKK-------PVWSI 107 Query: 120 LFYQGEA-TPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDE----IMQHRRIAI 174 + Y EA P++ ++ F S + + D+ ++ I +H + Sbjct: 108 VIYTDEAVWRKPVTEQFWYAFDSQK------GKQYHHFDVIKVKAEKSSDLIQKHSLMCK 161 Query: 175 LELLQKHIRQRDLMLLLEQLVTL--IDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLR 232 L L+ RQ D L+ ++ + + + QL+ + ++ + L Sbjct: 162 LLALKADDRQTDPEKLVYEIYRAAALMKEQLTNEQLLLIDQWVSFYKKVSEKRLDKIKKE 221 Query: 233 DRETGGESMMTLAQWFEEKGIEKGIQQGRQEV----SQEFAQRLLSKGMSREDVAEMANL 288 + E+ T+++ +G KG +G+ E ++ A LL G+ E + + Sbjct: 222 IKMDFIET--TISEHVYNQGWIKGEAEGKAEGEAKGRKKTAINLLKMGIDVEIIQKATGF 279 Query: 289 PLAEIDKV 296 AEI ++ Sbjct: 280 SDAEIKQM 287 >UniRef50_C1P7A8 Putative uncharacterized protein n=1 Tax=Bacillus coagulans 36D1 RepID=C1P7A8_BACCO Length = 345 Score = 71.2 bits (173), Expect = 4e-11, Method: Composition-based stats. Identities = 70/338 (20%), Positives = 116/338 (34%), Gaps = 59/338 (17%) Query: 5 STTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNT----LHLESGSFIEESLK-GH 59 T +D ++K+ + E +F+ + +L E D L E I + K Sbjct: 15 PGTDYDGLWKKIIS--ELFEEFI-LFFAPDLYETIDFGKGIVFLEQELHKVIIKHKKGKR 71 Query: 60 STDVLYSVQMQ-GNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVP 118 D + V ++ G Y+ + IE Q K D + RM Y R E + Sbjct: 72 IADKIVKVSLKNGEEKYVFIHIEIQEKQDPDFSKRMFTYFYRLFDRFQEN-------IYS 124 Query: 119 ILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELL 178 I + S + FY EL R F DI IA+L + Sbjct: 125 IAILTDLSKSNN-SEPFQYSFYGTELTYRFNTYKFNEADIPSLKKST--NPFAIAVLAGI 181 Query: 179 QKHIRQRDLMLLLEQLVTLIDEGY-----------TSGSQLVAMQNY------------- 214 H+ +++ E L+ E + + Y Sbjct: 182 YLHLTEKNYQKRYEVKKKLLKEFILSNQNLSSNYAEALCYFIDYLLYLPGELTKQLTKEL 241 Query: 215 ----------MLQRGHTEQADLFYGVLRDRETGG------ESMMTLAQWFEEKGIEKGIQ 258 ML ++A F L+ + G + + + +E+GIE GI+ Sbjct: 242 FIHIEKEANHMLYSEELKEAPTFAEYLKTVKEEGIEIGIEKGIEKGIEKGKEEGIEIGIE 301 Query: 259 QGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKV 296 +G+ E + A LL +G S E VA+M L + E+ K+ Sbjct: 302 KGKMEEKRNLAAELLREGFSVEKVAKMVKLSIDEVKKI 339 >UniRef50_C6XVT6 Putative uncharacterized protein n=1 Tax=Pedobacter heparinus DSM 2366 RepID=C6XVT6_PEDHD Length = 317 Score = 71.2 bits (173), Expect = 5e-11, Method: Composition-based stats. Identities = 53/308 (17%), Positives = 113/308 (36%), Gaps = 37/308 (12%) Query: 10 DAVFKQFLMHAETARDFLEIHLPVELRELCDLNT-LHLESGSFI------EESLKGHSTD 62 D K DFL + + E+ D N + + E G D Sbjct: 26 DEFLKGAFED--NFPDFLR-FVFSDADEILDFNREIEFLNNELFTIIPDRERKGGGRRAD 82 Query: 63 VLYSVQMQGNPGYLHVV-IEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILF 121 +L + ++ ++ +E + D+K R+ Y+ ++ + V I Sbjct: 83 LLAKLYLKDGTEKWVLLNVEIEGGNDRKFGQRVFEYNYRIRDKYKVS-------VASIAV 135 Query: 122 YQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAIL------ 175 + G+ T + Y EL V + + + +DE+++ L Sbjct: 136 FTGKKTQLRPTE------YLDELLGTVLSFKYTAYHVFDHQEDELLKSDNPFSLIALACQ 189 Query: 176 -ELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADL---FYGVL 231 LL+ I +L +V + ++++ ++ E ++ F + Sbjct: 190 KALLEGKIPDEELADERLVIVKALLRHGYDRQRIISFILFLKNFIFIESEEINRKFDQQI 249 Query: 232 RDRETGGESMMTL---AQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANL 288 + M + +W ++ +G +GR+E + E A+ L +G++ E +A+ L Sbjct: 250 EELTKDKNPMGVIDVFKKWERQEAKIEGKLEGRREEALEIARELKKEGLTIEFIAKTTKL 309 Query: 289 PLAEIDKV 296 P+AEI+K+ Sbjct: 310 PIAEIEKL 317 >UniRef50_A1ZPJ4 Hypothetical conserved protein n=6 Tax=Microscilla marina ATCC 23134 RepID=A1ZPJ4_9SPHI Length = 302 Score = 70.8 bits (172), Expect = 5e-11, Method: Composition-based stats. Identities = 52/309 (16%), Positives = 115/309 (37%), Gaps = 44/309 (14%) Query: 6 TTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLY 65 + +D +FK+ + E + +L +E+ E ++ D L Sbjct: 19 SNQYDKIFKENIG--EHFLSLSKTYLGIEVASS--------EELKDKLQTTLEREADFLR 68 Query: 66 SVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGE 125 + + + +E QS ++ MA RM Y ++ KLP + + Y G Sbjct: 69 KITTPKGEQMI-IQLEFQSTDEQGMAERMQLYFAILRQKY------KLP-IRQFVIYVGS 120 Query: 126 ATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIA-ILELLQKHIRQ 184 P + + + F L+D+ + ++ +L + +Q Sbjct: 121 KPPKMRTRLKPEEV----------FTGFELLDLRQVSYTQWLESDIPEEVLLAVLGDFQQ 170 Query: 185 RDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGESMMTL 244 + + +L+Q+++ I + L + +L + E G + Sbjct: 171 KKVSTVLKQIISKIVKLIDDPGTLQKYIRQLATFARL--RNLVIETEQTLEYMGLTYDIE 228 Query: 245 AQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKG-------------MSREDVAEMANLPLA 291 F ++G++KG Q+G ++ QE ++ +++G M E+VA +A L + Sbjct: 229 KDVFYQRGVKKGQQEGIEKGHQEGIEKGITQGVVKMVIALLKSGKMPLEEVARIAELSVI 288 Query: 292 EIDKVINLI 300 ++ K+ + I Sbjct: 289 DVQKMADQI 297 >UniRef50_C9LXX0 Putative uncharacterized protein n=6 Tax=Selenomonas sputigena ATCC 35185 RepID=C9LXX0_9FIRM Length = 301 Score = 70.8 bits (172), Expect = 6e-11, Method: Composition-based stats. Identities = 54/310 (17%), Positives = 107/310 (34%), Gaps = 39/310 (12%) Query: 1 MDAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTL--HLESGSFIEESLKG 58 M T D++F+ +AE LP L D T + + E G Sbjct: 1 MRNTKRTYKDSLFRDIFNNAER--------LPEIYEALLDHKTTPDDITLATIDETLFTG 52 Query: 59 HSTDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEA----DHDKLP 114 D+ + V Q +++EHQS + M R++ Y + R+++ + +P Sbjct: 53 VKNDIGFIVGNQH-----VLLVEHQSTINANMPLRLLMYLVEIYRRYVDKDAIYKKELIP 107 Query: 115 LVVP--ILFYQGEAT-PYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRR 171 L P +FY G A P ++ D F + +I P+ I++ Sbjct: 108 LPAPKFYVFYNGLAEMPDIWALHLSDAF---GGHDSDLELEVKVFNINDKPNRPILEKCH 164 Query: 172 IAILELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVL 231 L + + ++ + ++ + Q +Y+ + +QA + +L Sbjct: 165 A----LKSYSVFVAKVRECIKN-GSSLEIAVGNAVQYCVAHDYLGEYFRQKQAKEVFDML 219 Query: 232 RDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQR---------LLSKGMSREDV 282 ++ A+ EKG+ G Q+G + + + S E Sbjct: 220 NFVWNQERALEVRAEEAMEKGLRLGRQEGLSQGLSQGVLETTTASIRNVMKSMDFPIEKA 279 Query: 283 AEMANLPLAE 292 ++ +P E Sbjct: 280 MDILQIPEEE 289 >UniRef50_C6LE73 Putative uncharacterized protein n=1 Tax=Bryantella formatexigens DSM 14469 RepID=C6LE73_9FIRM Length = 326 Score = 70.4 bits (171), Expect = 8e-11, Method: Composition-based stats. Identities = 58/299 (19%), Positives = 116/299 (38%), Gaps = 36/299 (12%) Query: 10 DAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFI-EESLKG---------- 58 D + K++ + DF+ L R L L + + + Sbjct: 5 DIILKEYQRDSRHFCDFVNGALAQG-RPLLKRGQLVPVPTELVLVKDTEEDDENAVVKTV 63 Query: 59 -HSTDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMM-----RYSIAAMHRHLEADHD- 111 D+ + N G + V I++Q+ D M R+M Y + + H Sbjct: 64 QRFRDITGKAEADKNAGCIIVAIQNQTTVDYGMPLRVMLEDALEYDVQRRTKKNRKLHKG 123 Query: 112 -KLPLVVPILFYQGEATPYPLSMCWFDMFYSPELARRV--YNSPFPLVDIT-ITPDDEIM 167 KL LV+ ++FY G TP+ +M P R++ Y +P+V +T D Sbjct: 124 EKLCLVITLVFYYG-TTPWRAPSDLAEMISVPREFRQLREYIQSYPIVVVTPENVDTACF 182 Query: 168 QHRRIAILELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLF 227 + ILE+L++ ++++ LE+ + ++ ++++ T+ D + Sbjct: 183 RGGWQEILEILRRQNDEKEMGRYLEKNRAIYEKLPEDTNRVIFAL--------TDHLDYY 234 Query: 228 YGVLRDRETGGESMMTLAQWFEEK-GIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEM 285 + +E G + M A K G+E+G +QG + ++ ++ +GM A + Sbjct: 235 REL---KEKGEKITMCKAFTDHYKSGVEEGKKQGMKRGRRQGIKQGKRQGMDMGIRAMI 290 >UniRef50_C8W2V6 Putative uncharacterized protein n=2 Tax=Desulfotomaculum acetoxidans DSM 771 RepID=C8W2V6_DESAS Length = 300 Score = 70.4 bits (171), Expect = 8e-11, Method: Composition-based stats. Identities = 47/259 (18%), Positives = 100/259 (38%), Gaps = 43/259 (16%) Query: 34 ELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQMQGNPGYLHVV-IEHQSKPDKKMAF 92 E+ + + I +D+L+ V GY +++ IE Q +PD++M Sbjct: 22 EMVRGITVEDVQRVEKEAIA---VKRESDMLFRVS---EDGYEYLMAIEMQIRPDREMPR 75 Query: 93 RMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEATPYPLSMCWFDMFYSPELARRVYNSP 152 R++ Y+ +H E P++V + ++ + Y F + V Sbjct: 76 RLLEYTAM---QHREFKKPVYPVIVNLTGHKKKDESY--------CFDCLDFT--VVTFN 122 Query: 153 FPLVDITITPDDEIMQHRRIAILEL--LQKH--IRQRDLMLLLEQLVTLIDEGYTSGSQL 208 + ++++ P + ++ + ++ L L +H + ++++ + DEG + L Sbjct: 123 YRQINLSDLPGQDFLRSGPVGLIPLVVLMRHDEAPEEVFAKCVQRVDEVQDEGLRADLYL 182 Query: 209 ------------VAMQNYMLQRGHTEQADLFYGVLRDRETGGESMMTLAQWFEEKGIEKG 256 + Y+ + E + LF G+ GE + +KGI+KG Sbjct: 183 GLAVLSTIKFTREIILKYI-EVNKMENSPLFDGIREKWIDQGEQIG------FQKGIQKG 235 Query: 257 IQQGRQEVSQEFAQRLLSK 275 IQQ Q+ E + + Sbjct: 236 IQQAMQQSILEALEENIGM 254 >UniRef50_UPI0001C353CE hypothetical protein ChatD1_20495 n=1 Tax=Clostridium hathewayi DSM 13479 RepID=UPI0001C353CE Length = 319 Score = 70.4 bits (171), Expect = 8e-11, Method: Composition-based stats. Identities = 46/298 (15%), Positives = 101/298 (33%), Gaps = 31/298 (10%) Query: 10 DAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQM 69 D +F+ E E DL L+ ++ D+ + + Sbjct: 29 DRLFRMVFNRKEELLSLYNAVSHSEYTNPDDLEINTLDDVIYM-----KMKNDLAFLI-- 81 Query: 70 QGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEAD--------HDKLPLVVPILF 121 L++ EHQS + M R Y + ++++ + LP+ +F Sbjct: 82 ---DDVLNLW-EHQSTWNPNMPVRGTFYIVEEYRKYIDQNGLNLYGSSRITLPVPQFYVF 137 Query: 122 YQG-EATPYPLSMCWFDMFYSP-ELARRVYNSPFPLVDITITPDDEIMQH--RRIAILEL 177 Y G P + + D F +++I ++E+M+ E Sbjct: 138 YNGLREEPDYIELKLSDAFSRVHSEVEPCMEFKAVMLNINRGHNEELMRQCTTLREYAEF 197 Query: 178 LQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETG 237 + + + + LE+ + + L + +A++F +L + + Sbjct: 198 VARIRDETEDGTALEEAAMNVMDSCIRDGILAEFLSVH-------RAEVFEVLLTEYDEQ 250 Query: 238 GESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDK 295 + + +G +G +G E ++E A L+ KG + ED A + + + + Sbjct: 251 RHIA-SEKEISRREGHMEGRTEGILEKAKEVAVNLIKKGFTVEDAASICGEDICRVKE 307 >UniRef50_Q24MW9 Putative uncharacterized protein n=4 Tax=Desulfitobacterium hafniense RepID=Q24MW9_DESHY Length = 295 Score = 70.0 bits (170), Expect = 1e-10, Method: Composition-based stats. Identities = 54/311 (17%), Positives = 107/311 (34%), Gaps = 39/311 (12%) Query: 1 MDAPSTTPHDAVFKQFLM---HAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLK 57 M +D +FK + + FL L + +L + L E LK Sbjct: 3 MAERLNRINDYLFKYIFGRQENKDILLSFLNAVLSPAGED--ELTDITLSDRELDPEHLK 60 Query: 58 GHST--DVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPL 115 + D+L N G L + IE Q +K + R + Y L++ Sbjct: 61 DKMSRLDILGVA----NDGSL-INIEVQIASEKNIDKRTLYYWAKIYQSQLQSG------ 109 Query: 116 VVPILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAIL 175 + Y+ A +++ F + +++ + + I L Sbjct: 110 ----MLYKDLARTVTVNVLNFSFLPDAQRYHSMFSL------YEAHSGLRLNRDLEIHFL 159 Query: 176 ELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAM----------QNYMLQRGHTEQAD 225 EL + L++ + + + +AM + + E+ Sbjct: 160 ELEKWKALSTKPRTRLDKWLMYLSNTDPKELEEIAMSEPAIGKALTVEEIFLKNDKERY- 218 Query: 226 LFYGVLRDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEM 285 L+ + +M E+G+ +GI QG + E A +L KG+S +AE+ Sbjct: 219 LYEMREKGIRDHLSAMDNAKTEGIEQGLAQGIAQGIERGKTEIALSMLKKGLSLNMIAEI 278 Query: 286 ANLPLAEIDKV 296 +LP+ +I+++ Sbjct: 279 TDLPIEQIEEI 289 >UniRef50_Q24Y59 Putative uncharacterized protein n=4 Tax=Peptococcaceae RepID=Q24Y59_DESHY Length = 283 Score = 69.6 bits (169), Expect = 1e-10, Method: Composition-based stats. Identities = 42/224 (18%), Positives = 81/224 (36%), Gaps = 14/224 (6%) Query: 78 VVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGE-ATPYPLSMCWF 136 + +E Q+ ++ R + Y + R H I+ Y G C Sbjct: 68 LHLEFQTTAGEQDLKRFLYYDARLVRRQERKVHT-------IVIYSGRIEQARERLECGS 120 Query: 137 DMFYSPELARRVYNSPFPLVDI-TITPDDEIMQHRRIAILELLQKHIRQRDLMLLLEQLV 195 ++ + + YN + + +++ L L ++ L Q Sbjct: 121 ILYQVENIYMKHYNGDQEYNRLKHKIDNHQLLSETDTLKLIFLPLMKSEQKEEELAIQAA 180 Query: 196 TLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGESMMTLAQWFEEKGIEK 255 L ++L A+ ++ +L M + QW E+G ++ Sbjct: 181 ELAKAAPDEKTKLFAIAALIVITDKIMSESNKRKLLEVL-----KMTQIEQWIREEGRQE 235 Query: 256 GIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKVINL 299 G +GR++ +E AQ +L+ GMS E +A+ LPL EI ++ Sbjct: 236 GELKGRRDEKRETAQTMLNLGMSPELIAKATKLPLEEILEMAKA 279 >UniRef50_A8VV66 ATPase associated with various cellular activities, AAA_3 n=2 Tax=Bacillus selenitireducens MLS10 RepID=A8VV66_9BACI Length = 214 Score = 68.1 bits (165), Expect = 3e-10, Method: Composition-based stats. Identities = 35/211 (16%), Positives = 75/211 (35%), Gaps = 26/211 (12%) Query: 102 MHRHLEADHDKLPLVVPILFYQGEATPYPLSMCWFDMFYSPELA------RRVYNSPFPL 155 M + + L ++PIL QG + D F A + N + L Sbjct: 1 MRKEGRGNPRTL--IIPILIAQGRRRWSRSTTLMADFFSHYSEALRDDCEPFIPNFRYLL 58 Query: 156 VDITITPDDEIMQHRRIAILELLQKHIRQRDLMLLLEQLVTLI------DEGYTSGSQLV 209 DI ++++H + I L + + D L ++ L+ + + Q++ Sbjct: 59 YDIQEQDAADMIRHTLLKITIELMALVFEEDESKLEARMTELLTMSEIGEISDSYAEQVL 118 Query: 210 AMQNYMLQR----GHTEQADLFYGVLRDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVS 265 + Y+++ + V + G E +M A E Q+G+ + Sbjct: 119 RLLEYVMRGNRHFDQAMFETIRQNVTTEAHEGSELIMNFADQLE--------QKGKHKKE 170 Query: 266 QEFAQRLLSKGMSREDVAEMANLPLAEIDKV 296 +L +G S+E + ++ +L + + Sbjct: 171 LAIFLKLTRRGESKESIMDLLDLDDKSFEAL 201 >UniRef50_UPI00006A2D99 UPI00006A2D99 related cluster n=2 Tax=Xenopus (Silurana) tropicalis RepID=UPI00006A2D99 Length = 308 Score = 68.1 bits (165), Expect = 4e-10, Method: Composition-based stats. Identities = 43/276 (15%), Positives = 93/276 (33%), Gaps = 21/276 (7%) Query: 6 TTPHDAVFKQFLMH-AETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHS--TD 62 T HD FK ++ A F P E + + D + ++ L D Sbjct: 1 PTSHDQNFKNLILDYPRQALQF---FAPDEAKNIDDSAVITPIRQEQLKNRLGDRFYELD 57 Query: 63 VLYSVQM-QGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILF 121 V V+ G + ++E ++ P + R++ Y + + VVPI+ Sbjct: 58 VPLKVEWPDGRHAAMLFLLEEETDPARFSIHRLVSYCANLAE-LMGTNR-----VVPIVI 111 Query: 122 YQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRI-AILELLQK 180 + S + + + + P ++ I A + L Sbjct: 112 F------LRSSPDIRRDLHLGVDGVNFLSFHYIACVLPDIPAEQYKDSTNIVARIALPTM 165 Query: 181 HIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGES 240 H + ++ ++ + +D +G + + +++ E + + ++ Sbjct: 166 HYAREQVIDVMAWALRGLDTLEANGDKRIKYLDFIDTYSQLEDNERQL-FKQRYPQEEKT 224 Query: 241 MMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKG 276 + ++ Q +GI +GI QG QE Q +G Sbjct: 225 VTSIVQRAIHQGIHQGIHQGIQEGMLMGRQEGRQEG 260 >UniRef50_B5U1X5 Putative uncharacterized protein n=1 Tax=uncultured bacterium RepID=B5U1X5_9BACT Length = 304 Score = 67.7 bits (164), Expect = 5e-10, Method: Composition-based stats. Identities = 59/303 (19%), Positives = 114/303 (37%), Gaps = 37/303 (12%) Query: 10 DAVFKQFLM-HAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQ 68 D++F + + + FL ++ + L +TL LE + + K + D+ V Sbjct: 15 DSLFVDYFSKDRDWKQHFLSLYNALHGTNLQVADTL-LERVNIDQVLYKSYYNDIAVLV- 72 Query: 69 MQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPI------LFY 122 G ++IEHQS + M R++ Y +++ +VP+ +FY Sbjct: 73 ----NGQFILMIEHQSTINPNMPLRLLEYVARIYGNLVDSKAKFSRHLVPLARPEFYVFY 128 Query: 123 QGEATPYPLS-MCWFDMFYS-PELARRVYNSPFPLVDI-TITPDDEIMQHRRIAILELLQ 179 G+ P S + D F + P A + I + P + + + Sbjct: 129 TGDQKLPPESYLHLSDSFPNQPPKADLTLELKVKVCTIKSDHPSPVVHRCPDLEQYAQFL 188 Query: 180 KHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGH------TEQADLFYGVLRD 233 K + + E L I E +++Y+ +RG + D Sbjct: 189 KLVEEAKAAGQAEPLTWAIQEAVRR----NILRDYLERRGGETLSILMAEYDYATDFAVQ 244 Query: 234 RETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEI 293 +E E G+ G+++G + E A+ LLS+G++ + VA +LPL + Sbjct: 245 KEEAYED-----------GLFAGLERGAYQNKLETARSLLSEGLAPQMVARCTSLPLETV 293 Query: 294 DKV 296 ++ Sbjct: 294 QQL 296 >UniRef50_C8W1F3 Putative uncharacterized protein n=2 Tax=Desulfotomaculum acetoxidans DSM 771 RepID=C8W1F3_DESAS Length = 303 Score = 67.7 bits (164), Expect = 5e-10, Method: Composition-based stats. Identities = 42/261 (16%), Positives = 97/261 (37%), Gaps = 36/261 (13%) Query: 55 SLKGHSTDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLP 114 +K D ++ ++ + +E Q+ K + RM+ Y + ++ + Sbjct: 55 EVKEKRIDFVFLLKDNS-----ILHLEFQTTIPKDILIRMVTYGSRLVEKYDQD------ 103 Query: 115 LVVPILFYQGEATP-----------YPLSMCWFDMFYSPELARRVYNSPFPLVDITITPD 163 V ++ Y G+ Y + + F +R+Y P Sbjct: 104 -VNTVVIYSGKIESAPRLLRKGSLTYKVKNIYMKKFDGDAEYKRIYEK-----IKNKKPL 157 Query: 164 DEIMQHRRIAILELLQKHIRQRDLMLLLEQLVTLID----EGYTSGSQLVAMQNYMLQRG 219 DEI R I + + K + ++ + +L I +T G+ + N++ + Sbjct: 158 DEIDIQRLIFLPLMKSKEKSEDEMAIQAAELAKEIPNEPIRAFTIGAIVAISDNFLTEEY 217 Query: 220 HTEQADLFYGVLRDR----ETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSK 275 ++ ++ E E + + E+G+++G+++G +E ++ A L + Sbjct: 218 KKRLLEVLRMTQIEQWIREEGREEGLKEGLKEGREEGLKEGLKEGLREGLEKTAIAALRE 277 Query: 276 GMSREDVAEMANLPLAEIDKV 296 G E + ++ NL EI + Sbjct: 278 GFDIETIVKITNLSKEEILSL 298 >UniRef50_B8HNA0 Putative uncharacterized protein n=3 Tax=Cyanobacteria RepID=B8HNA0_CYAP4 Length = 315 Score = 66.2 bits (160), Expect = 1e-09, Method: Composition-based stats. Identities = 45/218 (20%), Positives = 79/218 (36%), Gaps = 32/218 (14%) Query: 78 VVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEATPYPLSMCWFD 137 + +E Q+ PD M RM+ Y + + R + +V + Y L Sbjct: 57 LHVEFQTGPDADMPLRMLDYRVRLLRRSPQK------VVRQFVIY--------LRQTTSV 102 Query: 138 MFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQRD-LMLLLEQLVT 196 + Y EL F +V + D ++ R + +L + L + ++L T Sbjct: 103 LVYQTELQLESTWHEFNVVRLWECSTDPLLASRGLLPFAVLGQTSNPEATLAQVAQRLST 162 Query: 197 LIDEGYTS----------GSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGESMMTLAQ 246 + + S G L L R + LFY + + +AQ Sbjct: 163 IENRTEQSNLTAASAILAGLVLDQQTIQRLLRREIMRESLFYQGILEEGMQKGVERGIAQ 222 Query: 247 -------WFEEKGIEKGIQQGRQEVSQEFAQRLLSKGM 277 ++G+E+G Q+GRQE QE Q + +G+ Sbjct: 223 GIQLGLEQGRQEGLEQGRQEGRQEGRQEGRQEGIQQGV 260 >UniRef50_A7BTR0 Putative uncharacterized protein n=3 Tax=Beggiatoa RepID=A7BTR0_9GAMM Length = 309 Score = 66.2 bits (160), Expect = 1e-09, Method: Composition-based stats. Identities = 52/316 (16%), Positives = 102/316 (32%), Gaps = 46/316 (14%) Query: 10 DAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHST--DVLYSV 67 D K L D LE L L+E D++ L + + D+L Sbjct: 11 DWALKNILRDKANF-DVLEGFLTALLQE--DISVLEILESESNQSDFAKKFNRVDILVKD 67 Query: 68 QMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDK--LPLVVPI------ 119 Q ++IE Q+ + R++ + + LE D + V+ I Sbjct: 68 SHQRK-----MIIEVQNHRETGYLERILWGTSKLIVETLELGEDYRNISKVISISIVYFD 122 Query: 120 ---------LFY-----QGEATPYPL-SMCWFDMFYSPELARRVYNSPFPLVDITITPDD 164 ++Y G P L + F L+ + D Sbjct: 123 LGLSDDNEYVYYGVANLHGLQHNQPFRFRRLMADKTFKSLQTKDIFPEFYLLRVEHFQD- 181 Query: 165 EIMQHRRIAILELLQKH--IRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTE 222 + + + KH IR + + + + + + YM+ T Sbjct: 182 --IIKTDLDEWIYMLKHSTIRTDFKSKNINKAQEKLTLLQMNPQKRKDYEKYMV--DMTV 237 Query: 223 QADLFYGVLRDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDV 282 + D+ + G Q ++G ++GIQ+G ++ + + L +G+ + Sbjct: 238 ERDVLEAAQEEGIQKGR------QEGIQEGRQEGIQKGMEKKTVVIVKNALQQGLELTLI 291 Query: 283 AEMANLPLAEIDKVIN 298 + + L + EI K+ N Sbjct: 292 SSLTGLSIEEIQKIQN 307 >UniRef50_B8HL58 Putative uncharacterized protein n=2 Tax=Cyanothece RepID=B8HL58_CYAP4 Length = 334 Score = 65.8 bits (159), Expect = 2e-09, Method: Composition-based stats. Identities = 45/298 (15%), Positives = 102/298 (34%), Gaps = 32/298 (10%) Query: 3 APSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTL-HLESGSFI----EESLK 57 + D+ +K+ L + ++ + P E D F + Sbjct: 2 TQPRSDKDSAWKEIL--RQYFQEAIVFFFPQT-AEQVDWTRPYEFLDKEFQQIAPDAETG 58 Query: 58 GHSTDVLYSVQMQGN-PGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLV 116 D L V ++ +L + +E Q+ + + A RM Y++ R Sbjct: 59 KRYADQLVKVWLKDGAELWLLIHVEVQAARESEFAQRMFTYNLRIFDRFNH-------PA 111 Query: 117 VPILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILE 176 + + E+ + FD F L+ R I+ ++ I ++ Sbjct: 112 ISLAILCDESVRWRPESFSFD-FPDTSLSFRFGRVKLLDYRERISELEQSPNPFSIVVMA 170 Query: 177 LLQKHIRQRDLML---LLEQLVTLIDEGYTSGSQLVAMQN---YMLQRGHTEQADLFYGV 230 L+ ++D L+ + EG +++ + +++ + + + + Sbjct: 171 HLRAQATRKDDQQRKFWKLTLIRRLYEGGYGRQEVINLFRFIDWVMILPEGLKEEFWQEL 230 Query: 231 LRDRETGGESMMTLAQWF---------EEKGIEKGIQQGRQEVSQEFAQRLLSKGMSR 279 E +T + ++G ++G Q+GRQE QE A+ L+ + ++R Sbjct: 231 KIYEEERRMPFITSVEEIGFERGLEQGRQEGRQEGRQEGRQEGRQEEARALILRPLTR 288 >UniRef50_C9XMT1 Putative uncharacterized protein n=4 Tax=Clostridium difficile RepID=C9XMT1_CLODC Length = 158 Score = 65.8 bits (159), Expect = 2e-09, Method: Composition-based stats. Identities = 29/132 (21%), Positives = 56/132 (42%), Gaps = 10/132 (7%) Query: 1 MDAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHS 60 M+ + +++ E+ ++ V + + L + LE SFI E KG Sbjct: 1 MNLKRSNEKREEYRRMYSDKESFLSLIQNFTSVSIAKELTLKNIELE-TSFICE-YKGKE 58 Query: 61 TDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMH--------RHLEADHDK 112 D++Y V + ++V+E Q++ D ++ R+ Y +E + K Sbjct: 59 VDIIYKVFSKSGKVSHYIVLEFQTEMDTEIVPRLKSYREQIWKSFIMKKSLEEIEDKNFK 118 Query: 113 LPLVVPILFYQG 124 LP V+P++ Y G Sbjct: 119 LPKVIPVVLYSG 130 >UniRef50_C0F0J0 Putative uncharacterized protein n=1 Tax=Eubacterium hallii DSM 3353 RepID=C0F0J0_9FIRM Length = 316 Score = 65.0 bits (157), Expect = 3e-09, Method: Composition-based stats. Identities = 60/327 (18%), Positives = 111/327 (33%), Gaps = 60/327 (18%) Query: 10 DAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGH--------ST 61 DA+ K++L + E D +L + ++ L S I L Sbjct: 5 DALTKEYLSNNEIFADVF-NYLIYDGQQRILPENLIERDTSEITLPLGKRGELATIQKFR 63 Query: 62 DVLYSVQMQGNPGYLHVVI--EHQSKPDKKMAFRMMRYSI----------AAMHRHLEAD 109 D+L + L+V+ E+QS M R M Y +R + Sbjct: 64 DILKGCIAKEYKNTLYVLFGVENQSHIHYAMPVRNMLYDAINYSAQVNEKTKKYRKIRKQ 123 Query: 110 H-----------------DKLPLVVPILFYQGEATPYPLSMCWFDMF--YSPELARRVYN 150 + D+L V+ + Y G + + +MF L + + Sbjct: 124 NPNFKETTEEFLSGWHPDDRLVPVITVTIYFGN-DGWDAAKSLQEMFSETDESLKEFLPD 182 Query: 151 SPFPLVDITITPDDEIMQHRRIAILELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVA 210 L+ + H L + K I + M +L L D GY++ Sbjct: 183 YKLHLISCNNISNFTKF-HTEFGRLMHILKVISDEEQMDIL-----LSDPGYSA------ 230 Query: 211 MQNYMLQRGHTEQADLFYGVLRDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQ 270 L + + F G+ +++ W + K E G ++G E + + Q Sbjct: 231 -----LSVTAAQIINTFTGLHFSIPEKEDTINMRNAWTDHK--ESGRREGFNEATTSYTQ 283 Query: 271 RLLSKGMSREDVAEMANLPLAEIDKVI 297 R+ G+ E +AE+ P+ E++K++ Sbjct: 284 RMYKAGIPLEVIAEVIEKPVTEVEKIL 310 >UniRef50_C0QZ87 Chromosome segregation ATPase n=19 Tax=Bacteria RepID=C0QZ87_BRAHW Length = 309 Score = 63.8 bits (154), Expect = 7e-09, Method: Composition-based stats. Identities = 41/263 (15%), Positives = 99/263 (37%), Gaps = 29/263 (11%) Query: 54 ESLKGHSTDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKL 113 E+LK DV + + ++IE Q + R++ Y + L+ + + + Sbjct: 56 ENLKESILDV--KAKTKDGK---KILIEIQLIGNNNFIKRILYYIAKNISSELKENENYI 110 Query: 114 PL--VVPILFYQ-----GEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPD--- 163 + ++ I F G + F + + ++ + ++I + Sbjct: 111 NISQMISISFLNFNLKIGSESDIKREHKCFQLSDINNSSLKLDDFQIHFIEIKRFAEILK 170 Query: 164 ----DEIMQHRRIAILELLQKHIRQRDLMLLLEQLV---TLIDEGYTSGSQLVAMQNYML 216 D+ +++ ++ ++ +DL + +L+ ++ + + VA + M Sbjct: 171 NASIDDYNKNKLLSWIDFF----TAKDLEKSINKLIGGNDIMSKVMDKYKRFVADEKEMS 226 Query: 217 QRGHTEQA---DLFYGVLRDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLL 273 + E E + Q ++GIE+GI+QG + + A+ L Sbjct: 227 AYNERDTFLYGQAAMLQYEREEGKKEGIEIGIQQGIKEGIEQGIEQGEKNKALSIARSLK 286 Query: 274 SKGMSREDVAEMANLPLAEIDKV 296 G+ + ++E L + EI+K+ Sbjct: 287 KSGLDDKFISENTGLTIEEIEKL 309 >UniRef50_C1J8G9 YdgA n=11 Tax=Enterobacteriaceae RepID=C1J8G9_ECOLX Length = 81 Score = 63.8 bits (154), Expect = 8e-09, Method: Composition-based stats. Identities = 19/55 (34%), Positives = 32/55 (58%) Query: 242 MTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKV 296 MT+A+ +KG ++G ++G EV++E A RL G + E + E L E+ K+ Sbjct: 22 MTIAERLIQKGFDEGFKKGALEVAREAACRLRDMGWTPERIQEATGLSGEELKKL 76 >UniRef50_C4G7H9 Putative uncharacterized protein n=2 Tax=Abiotrophia defectiva ATCC 49176 RepID=C4G7H9_ABIDE Length = 305 Score = 63.1 bits (152), Expect = 1e-08, Method: Composition-based stats. Identities = 51/263 (19%), Positives = 97/263 (36%), Gaps = 23/263 (8%) Query: 36 RELCDLNTLH--LESGSFIEES--LKGHSTDVLYSVQMQGNPGYLHVV-IEHQSKPDKKM 90 +E ++L + ++ E L DV S + L VV IE+Q+K +K M Sbjct: 30 KERVKEDSLEDSKINSAYKAEDGKLHEQERDV--SKYWKEGNTNLLVVGIENQTKAEKLM 87 Query: 91 AFRMMRYSIAAMHRHLEADHDKLP-----LVVPILFYQGEATPYPLSMCWFDMFYSPELA 145 R++ Y A+ L +LP VV I+ Y G + L Sbjct: 88 PARIIGYDGASYRSQLLKSTGRLPKNKLTPVVTIVLYFGLTRWNQPKNLKGILDIPTGLE 147 Query: 146 RRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQRDLMLLLEQLVTLIDEGYTSG 205 V + + +I P++++ ++ + L+ K+ + + + Sbjct: 148 DFVSDYKINVFEIAFLPEEKV--NKFKSDFRLVAKY------FTNIRKNPYYLPADENEI 199 Query: 206 SQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGESMMTLAQWFEE---KGIEKGIQQGRQ 262 + A+ ++ +E E + L+Q + + +G E+G+ QG Sbjct: 200 KHVDAVLKFLSIMSGSEDIIEKLTANNGSEVKNMTGGPLSQLYYKGVSEGREEGLLQGIN 259 Query: 263 EVSQEFAQRLLSKGMSREDVAEM 285 E + SKGMS E+ E+ Sbjct: 260 ETLLKVYLNCRSKGMSVEESEEI 282 >UniRef50_A5Z376 Putative uncharacterized protein n=1 Tax=Eubacterium ventriosum ATCC 27560 RepID=A5Z376_9FIRM Length = 316 Score = 63.1 bits (152), Expect = 1e-08, Method: Composition-based stats. Identities = 48/309 (15%), Positives = 103/309 (33%), Gaps = 42/309 (13%) Query: 10 DAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQM 69 D VF++ + + + LE++ + + + L + DV Y Sbjct: 9 DRVFRKLFGYEKNKGNLLELYNALNDSNYTNPDDLEI-----------NTLDDVFYMNMK 57 Query: 70 QGNPGYL---HVVIEHQSKPDKKMAFRMMRYSIAAMHRHL--------EADHDKLPLVVP 118 + + EHQS M R RYS + ++ K+P Sbjct: 58 NDVSCIIDWNMAIYEHQSTWSYNMPLRGYRYSAELYNDYIVRNNLDVFRRKLIKIPTPQY 117 Query: 119 ILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELL 178 +FY G + + + +++I ++E+M I Sbjct: 118 YVFYNGNEKRPDREVLKLSDAFMVPCKDGEFEWTATVLNINAGHNEELMSKCSIL----- 172 Query: 179 QKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNY-----MLQRGHTEQADLFYGVLRD 233 R+ +++ ++ + E + +Y +L+ + +L Sbjct: 173 ------REYAIMVSKIKEFLAESLELKDAIKKAIDYCLDNNVLKEFLQDHRSEVEDMLWR 226 Query: 234 RETGGESMMTLAQWFEEKGIEKGIQQGRQEVS----QEFAQRLLSKGMSREDVAEMANLP 289 E+M + F E+G + G++ GR + + L K S E++A+ Sbjct: 227 EYNEEETMAHWKEDFYEEGEQHGLEVGRANGEKIKLIKLVCKKLVKNKSIEEIADDLEED 286 Query: 290 LAEIDKVIN 298 ++ I+K+ N Sbjct: 287 VSTIEKICN 295 >UniRef50_B7BFV9 Putative uncharacterized protein n=1 Tax=Parabacteroides johnsonii DSM 18315 RepID=B7BFV9_9PORP Length = 293 Score = 62.7 bits (151), Expect = 2e-08, Method: Composition-based stats. Identities = 41/301 (13%), Positives = 101/301 (33%), Gaps = 31/301 (10%) Query: 10 DAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQM 69 D FK E +++ L L + L + ES + ++ ++ Sbjct: 10 DRGFKHLFGQ-EDSKELLVDLLNGLFEGERVITELSFLNVEMPAESTDSRAA--VFDLKC 66 Query: 70 QGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHD---KLPLVVPILFYQ--- 123 + G + + +E Q+ P R + Y + +D +L V I Sbjct: 67 KDKEGRIFI-VEVQNAPQTYFYERGLYYLCRIISDQDRRGNDWKFELYPVYGIFLLNFKS 125 Query: 124 GEATPYPLSMCWFDMF---YSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQK 180 G+ + D + R++Y +++ +E + K Sbjct: 126 GKTDKVRTDIVLADRETGKQMSDTMRQIY------LEMPFFNKEEAECETSLDYWLYTLK 179 Query: 181 HIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRE----- 235 ++ + + + Q + + +L + N + + + + + RD + Sbjct: 180 YMEKLETLPFKGQ-----KQLFEKLERLAKIVN--MNKKERMEYEESLKIYRDNQGVLDY 232 Query: 236 TGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDK 295 + M + ++GIEKG+++G ++ A ++ +G+ + + L I Sbjct: 233 AIEKGYMEGVEKGLKEGIEKGLEKGMEKGIYLVAAKMKMQGIDFATITSVTGLNAETIAT 292 Query: 296 V 296 + Sbjct: 293 L 293 >UniRef50_A8YL21 Genome sequencing data, contig C325 n=27 Tax=Cyanobacteria RepID=A8YL21_MICAE Length = 149 Score = 62.3 bits (150), Expect = 2e-08, Method: Composition-based stats. Identities = 21/116 (18%), Positives = 45/116 (38%), Gaps = 12/116 (10%) Query: 9 HDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEE--SLKGHSTDVLYS 66 HD +FK+ + +F+E+ P E+ D ++ + + H +D++ Sbjct: 7 HDRLFKELIS--TFFVEFIELFFP-EVMNYLDTESITFLDKEVFTDVTEGERHKSDLVAQ 63 Query: 67 VQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFY 122 V+ +G + + +E Q K RM Y + + + PI+ + Sbjct: 64 VRFRGKESFFLIYVEAQESSRKWFNRRMFTYFARFHEKFVL-------PIYPIVIF 112 >UniRef50_A8SDU3 Putative uncharacterized protein n=1 Tax=Faecalibacterium prausnitzii M21/2 RepID=A8SDU3_9FIRM Length = 295 Score = 62.3 bits (150), Expect = 2e-08, Method: Composition-based stats. Identities = 47/251 (18%), Positives = 86/251 (34%), Gaps = 29/251 (11%) Query: 53 EESLKGHSTDVLYSVQMQGNPGYLHVV-IEHQSKPDKKMAFRMMRYSIAAMHRHLEADHD 111 + L DV + + + + E+Q+ D M R++ Y A L D+D Sbjct: 50 DGRLHEIERDVAK--RWKNGNIRVACIGFENQTASDPDMPLRVIGYDGAEYRAQLLGDND 107 Query: 112 ---KLPLVVPILFYQGEATPYPLSMCWFDMFYSP-ELARRVYNSPFPLVDITITPDDEIM 167 + P V ++ Y G P+ + + P E V + L Sbjct: 108 TGSRYPAVT-LVLYFGHEKPWSGPLSLKERLNVPKEFEPYVNDYKINLF----------- 155 Query: 168 QHRRIAILELLQKHIRQRDLMLLLEQLVTLIDEGY-----TSGSQLVAMQNYMLQRGHTE 222 +IA L Q + Q D ++ + V + G + + + + Sbjct: 156 ---QIAYLTREQVELFQSDFKVVADYFVQKRENGDYVPSSQDLTHVQETLQLLSIMTNDH 212 Query: 223 QADLFYGVLRDRETGGESMM-TLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSRED 281 + + Y D GG M + E +GIEKGI +G + A L+ K + + Sbjct: 213 RFEDAYNTSTDDRKGGPRNMCDVLDKVENRGIEKGIVKGESRGENKMAL-LVKKLLDQNR 271 Query: 282 VAEMANLPLAE 292 + ++ E Sbjct: 272 IDDVKRASEDE 282 >UniRef50_C2G1H3 Hypothetical cytosolic protein n=1 Tax=Sphingobacterium spiritivorum ATCC 33300 RepID=C2G1H3_9SPHI Length = 294 Score = 62.3 bits (150), Expect = 2e-08, Method: Composition-based stats. Identities = 50/301 (16%), Positives = 101/301 (33%), Gaps = 38/301 (12%) Query: 10 DAVFKQFLMHAETARDFLEI--HLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSV 67 D + A++ D L EL +L + + L T Sbjct: 18 DDFLRFLYPDADSVFDLSRGITFLDKELEQLFPPEGNEFAPK--VVDKLAQVYT------ 69 Query: 68 QMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEAT 127 G ++ + +E Q K A RM Y + ++ H ++ IL EA+ Sbjct: 70 -HDGMEEWVLIHVEVQGTCRKDFASRMFTYYYRILDKY----HKRITA-FAILT---EAS 120 Query: 128 PYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELL--------- 178 P + + F + F I D ++ L +L Sbjct: 121 KKPRPNVYEEEFMGTSI-----QYRFNTYKIAEQDTDRLLASDNPFALVVLTAKAAFVGK 175 Query: 179 ---QKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRE 235 K + L+ QL + E S ++ + N++ + +++ ++ E Sbjct: 176 NLNDKDESDKALLQTKIQLARELLERNMSKEKIRGLMNFLRYYVRFDNSEVNTIFEQEVE 235 Query: 236 TGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDK 295 E T+ EE + + ++G++E A+ + G+ E + + L + EI+K Sbjct: 236 KLTERSHTM--GIEELLLNRAKKEGKRESLISVAREMKKDGIPVEQIVKFTKLSIKEIEK 293 Query: 296 V 296 + Sbjct: 294 L 294 >UniRef50_Q8YTL4 All2703 protein n=13 Tax=Cyanobacteria RepID=Q8YTL4_ANASP Length = 270 Score = 61.9 bits (149), Expect = 3e-08, Method: Composition-based stats. Identities = 41/269 (15%), Positives = 102/269 (37%), Gaps = 23/269 (8%) Query: 31 LPVELRELCDLNTLHLESGSFIEESLKG--HSTDVLYSVQMQGNPGYLHVVIEHQSKPDK 88 P EL + + F +K D L+ ++ + ++ +E Q +PD Sbjct: 14 FPHIFFELINQSPQEASIYEFTSREVKQLAFRLDGLFLPKINDSTKPFYI-VEVQFQPDD 72 Query: 89 KMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEATPYPLSMCWFDMFYSPELARRV 148 +R+ A + +L+ P V ++ Y ++ + ++ +R+ Sbjct: 73 DFYYRLF----AELFLYLKQYKPPYPWQV-VVIYPSRGIERQQTIHFDEILVL-NRVKRI 126 Query: 149 YNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQRDLM--LLLEQLVTLIDEGYTSGS 206 Y + + +++L+ + ++ L+ Q + + Sbjct: 127 YLDEL---------GEVAETSLGVGVVKLVIETEETAPVLARQLIAQAKQQLTDVTAKRD 177 Query: 207 QLVAMQNYMLQRGHTEQADLFYGVLRDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQ 266 + ++ ++ + + + +L E + Q E+G ++G Q+G+QE Sbjct: 178 LINLIETIIVYKLPQKSREEIEAMLGLNELKQSR---VYQEALEEGKQEGKQEGKQEAKL 234 Query: 267 EFAQRLLSKGMSREDVAEMANLPLAEIDK 295 E R++ G+S E +A++ +LPL + + Sbjct: 235 ETIPRMVQFGLSVEAIAQLLDLPLEVVQQ 263 >UniRef50_A7AK04 Putative uncharacterized protein n=2 Tax=Parabacteroides RepID=A7AK04_9PORP Length = 299 Score = 61.9 bits (149), Expect = 3e-08, Method: Composition-based stats. Identities = 51/304 (16%), Positives = 112/304 (36%), Gaps = 33/304 (10%) Query: 10 DAVFKQFL---MHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYS 66 D FK+F + E FL L +++++ + + ++ Sbjct: 12 DYAFKRFFGTVSNKELTIGFLNSLLNKDIKDII------FHNVEMQGNNTDSRKA--VFD 63 Query: 67 VQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEA 126 + +G+ G L + +E Q K K + R++ Y+ + + + +K L + E Sbjct: 64 LFCEGSDGELFI-VEIQKKRQKYFSDRVLYYASFVIQMQADIESEKF-----RLAKEEER 117 Query: 127 TPYPLSMC--WFDMFYSPEL-ARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIR 183 + + + F L R + +V + + LEL + ++ Sbjct: 118 RRWNYHINKVYVVCFLDFRLDTRYTDKYRWDVVRMDRELKIPFSETLNEIYLELPKFNLN 177 Query: 184 QRD----------LMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRD 233 + M ++ + L E + L +++ + + + + L Y + Sbjct: 178 FEECDTFYKKFLYTMNNIDIMGQLSKETIQNDKLLRKLKSAIELQRMSAKERLAYELSIA 237 Query: 234 RETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEI 293 E + +A F E+G EKGI +G E ++ + GM +A+ A LP E+ Sbjct: 238 AE--RDLAACMATSF-EEGEEKGIAKGITEGMRKIILNMKQAGMDLATIAKTAGLPEKEV 294 Query: 294 DKVI 297 + ++ Sbjct: 295 EALL 298 >UniRef50_C8PLW8 Putative uncharacterized protein n=2 Tax=Treponema vincentii ATCC 35580 RepID=C8PLW8_9SPIO Length = 264 Score = 61.9 bits (149), Expect = 3e-08, Method: Composition-based stats. Identities = 53/287 (18%), Positives = 110/287 (38%), Gaps = 37/287 (12%) Query: 10 DAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQM 69 D +F + + H R FLE+ ++ ++ L++ ++ + + ++++ DVL V+ Sbjct: 14 DFMFCKVMEHESLCRPFLEMLFSTQIEKITYLSSQNIITTNSEAKTVR---LDVL--VKD 68 Query: 70 QGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEATPY 129 Y IE Q + + RM Y L+ + Y+ Sbjct: 69 DIGTSY---DIEMQVGNEYNIPKRMRYYQAVLDVAFLDKGYS----------YKALNNSV 115 Query: 130 PLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQRDLML 189 + +C FD + R VY D I+ H + L K ++ Sbjct: 116 IIFVCLFDPIGND---RAVYTFENI-----CIEDKTILLHDGTKKIILNAK-AFKKTDNQ 166 Query: 190 LLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGESMMTLAQWFE 249 L + + G + + ++ + E A Y +L ++M Sbjct: 167 ELRGFLQYVTTGKATTAYTGRIEQMIQTVKQNELARREYHILPA------ALMDA----M 216 Query: 250 EKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKV 296 ++G +G+ +G ++ + E A+ LL G+S E++A+ L AE++ + Sbjct: 217 DEGEARGLAKGSRQKALETAKNLLHFGLSVENIAQATGLSQAEVEAL 263 >UniRef50_C0R0H3 Putative uncharacterized protein n=8 Tax=Brachyspira RepID=C0R0H3_BRAHW Length = 292 Score = 61.9 bits (149), Expect = 3e-08, Method: Composition-based stats. Identities = 51/295 (17%), Positives = 96/295 (32%), Gaps = 24/295 (8%) Query: 10 DAVFKQFLMHA---ETARDFLE-IHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLY 65 D + DF+ I L ++ + L + E K TDV Sbjct: 14 DYFVRYLFSDKGSEAILLDFINSIMLDSGMKTFRSVEILTPFNYKENYED-KETITDV-- 70 Query: 66 SVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDK--LPLVVPILFYQ 123 Q V+IE Q + + + R++ Y + + L+ L V+ I Sbjct: 71 KCITQNGTV---VIIEIQLQGNSRFPERILYYWASNYSKLLKQGEKYDALTPVISINLLN 127 Query: 124 GEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIR 183 S+ M Y +R+ I I + + L K Sbjct: 128 FNLDDND-SIHSCYMIYDTN-NKRLLTDHLQ---IHIIELKKFKYNSLEYDLNCWLKFFT 182 Query: 184 QRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGESMMT 243 +D + L+ E N++ R + D L G +M Sbjct: 183 MKDKDNKEVIMSELVKEKPIMEEVQRRYNNFIKDRLMMNEYDKRQAYL-----YGNQIML 237 Query: 244 LAQWF--EEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKV 296 + +G E+GI++G ++ A+ + +K M ++E+ L + +I+K+ Sbjct: 238 EEERRLGRVEGKEEGIKEGIEQEKYSLARNMKNKNMDLNLISELTGLSIEKIEKL 292 >UniRef50_A5D5U3 Hypothetical membrane protein n=3 Tax=Peptococcaceae RepID=A5D5U3_PELTS Length = 292 Score = 61.5 bits (148), Expect = 4e-08, Method: Composition-based stats. Identities = 35/217 (16%), Positives = 77/217 (35%), Gaps = 19/217 (8%) Query: 58 GHSTDVLYSVQMQGNPGYLHV-VIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLV 116 ++D L V+ GY ++ ++E Q++PD+KMA R++ Y+ H + P++ Sbjct: 44 QRTSDALVKVR---EDGYEYLMLVEFQARPDRKMARRLLEYTAM---HHCRHEKPVYPVI 97 Query: 117 VPILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILE 176 + + + Y F+ V N + +++ E++ + +L Sbjct: 98 INLTGGSLQDGWYT-----FECLDLT-----VVNFNYRQINLQDIAGRELLYRGPVGLLP 147 Query: 177 LLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRET 236 L +L++ + + + + + +LR E Sbjct: 148 LAPLMSHDEPPEKVLDKCARRLQSEVEAEDDRALLYLALAALASLKYPK--DLILRVLEV 205 Query: 237 GGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLL 273 + L E+ KG +G+ E E +L Sbjct: 206 SRLENIPLFDGIREEWEAKGRIEGKNEGKIEGMVEML 242 >UniRef50_Q1NK38 Putative uncharacterized protein n=2 Tax=delta proteobacterium MLMS-1 RepID=Q1NK38_9DELT Length = 115 Score = 61.1 bits (147), Expect = 4e-08, Method: Composition-based stats. Identities = 16/56 (28%), Positives = 22/56 (39%) Query: 1 MDAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESL 56 M S + HD +K H D L + + D TL SGSF+ + L Sbjct: 3 MKPDSKSDHDNSYKLLFSHPRMVEDLLRGFVREDWISEVDFTTLETVSGSFVSDDL 58 >UniRef50_D1P8S5 Putative uncharacterized protein n=1 Tax=Prevotella copri DSM 18205 RepID=D1P8S5_9BACT Length = 303 Score = 61.1 bits (147), Expect = 4e-08, Method: Composition-based stats. Identities = 44/301 (14%), Positives = 94/301 (31%), Gaps = 35/301 (11%) Query: 10 DAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQM 69 D FK+ +D L L + + + + + ++ V Sbjct: 16 DFGFKRIFG-TAMNKDLLICFLNSLFNGRQVVKDVSYLNPEHVGDVYTDRRA--IFDVYC 72 Query: 70 QGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHD---KLPLVVPILFYQGEA 126 +G G ++E Q+ R + YS + ++ KL + + Sbjct: 73 EGENGE-KFIVEMQNAYQTYFKDRALFYSTFPIREQAPKGNEWDFKLNNIYTVALLN--- 128 Query: 127 TPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQRD 186 D F ++ V + + I +E+ + + + Sbjct: 129 -----FNMNEDAFDKEKIRHHVQLCDTATHKVFYDKLEYI-------YVEISKFNKTLEE 176 Query: 187 LMLLLEQLVTLIDEGYT----SGSQLVAMQNYMLQRGHTEQ--ADLFYGVLRDRETGGES 240 L L E+ + + Y + + + + + + + Sbjct: 177 LDTLYEKWLYALKNLYKLTQRPKELCDKVFDRLFEEAEIAKFTPQEMREYETSKMAYRDI 236 Query: 241 MMTLAQWFEE---KGIEKGIQQGRQEV----SQEFAQRLLSKGMSREDVAEMANLPLAEI 293 ++ E +GIE G+++GR E S E A+++L+KGM + +M L EI Sbjct: 237 KNSVDTAKREGIAEGIEIGMEKGRAEGMNLRSLEIARKMLAKGMDEASIMDMTGLTSEEI 296 Query: 294 D 294 Sbjct: 297 K 297 >UniRef50_C0CSV6 Putative uncharacterized protein n=1 Tax=Clostridium asparagiforme DSM 15981 RepID=C0CSV6_9CLOT Length = 317 Score = 61.1 bits (147), Expect = 5e-08, Method: Composition-based stats. Identities = 47/290 (16%), Positives = 91/290 (31%), Gaps = 38/290 (13%) Query: 10 DAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQM 69 D +F+ D D + L + + + D+ + V Sbjct: 10 DRLFRLVFGDRRRLLDLYNALNGSHYE---DPDALEI--TTLDDAVYLSMKNDLSFLV-- 62 Query: 70 QGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADH--------DKLPLVVPILF 121 L+ EHQS + M R Y +++ KLP ++F Sbjct: 63 -NGVLNLY---EHQSTYNPNMPVRGFFYLADVYRKYVVEHKLNLYGSRLAKLPSPKYLVF 118 Query: 122 YQGEATPYPLSMCWFDMFYSPELARRV-YNSPFPLVDITITPDD-EIMQHRRIAILELLQ 179 Y G + + + R V + I +++ R + E Q Sbjct: 119 YNGRKE--EPDRKILRLSDAFQGGRNAEPCLELCAVMLNINLGRNQVLMERCRTLKEYAQ 176 Query: 180 KHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGE 239 R R ++ L + +D + ++N++ +A++ +L D Sbjct: 177 FVDRVRRMIAETGALESAVDCAVEDCIRDGILENFLSSH----RAEVLDVILTDYNEQEY 232 Query: 240 SMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGM--SREDVAEMAN 287 M E+ ++GR E E LS+G+ SRE + ++ Sbjct: 233 IAME---------REEAWEEGRAEGLTEGLSEGLSEGLSVSREAILDLLG 273 >UniRef50_A5KR99 Putative uncharacterized protein n=11 Tax=Ruminococcus torques ATCC 27756 RepID=A5KR99_9FIRM Length = 317 Score = 60.8 bits (146), Expect = 6e-08, Method: Composition-based stats. Identities = 40/230 (17%), Positives = 79/230 (34%), Gaps = 16/230 (6%) Query: 81 EHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPI------LFYQGEATPYPLSMC 134 EHQS ++ M R + Y A R + +VP+ FY G+ Sbjct: 83 EHQSTINENMPLRSLLYIGRAYERLVPPRSRYKKKIVPLPTPEFYTFYNGKEKWEKEKEL 142 Query: 135 WFDMFYSPELARRVYNSPFPLVDITITPDDEIMQH-----RRIAILELLQKHIRQRDLML 189 Y + +++I EI++ +E++Q + + Sbjct: 143 RLSDAYIVKDGEPSLELKVKVINIRPEEHHEILEKCQVLKEYSQFMEIVQNYQISGEEEP 202 Query: 190 LLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRD--RETGGESMMTLAQW 247 + + I++G + + + + V R+ RE G E + Sbjct: 203 YKKAIKECIEKGILADYLMRKGSEVVNMLLDEYDYETDIEVQREEAREQGREEGR---KQ 259 Query: 248 FEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKVI 297 E+G ++G ++GR+ Q+ L KG + +A+ I +I Sbjct: 260 GREEGRKQGREEGRKAERSTLIQKKLEKGKTISQIADELEDTEENIACLI 309 >UniRef50_UPI0001BC3131 hypothetical protein BcroD2_12630 n=4 Tax=Butyrivibrio crossotus DSM 2876 RepID=UPI0001BC3131 Length = 247 Score = 60.4 bits (145), Expect = 7e-08, Method: Composition-based stats. Identities = 37/270 (13%), Positives = 87/270 (32%), Gaps = 38/270 (14%) Query: 1 MDAPSTTPH--DAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKG 58 M+ + D VF+ + + D+ LE+ ++ Sbjct: 1 MNNETVNRKYKDTVFRLLFKDKSNLLSLFNAVNDTDFSDENDIKITTLENAIYMT----- 55 Query: 59 HSTDV--LYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHL------EADH 110 D+ + +++ + EHQS + M +R + Y R++ Sbjct: 56 SKNDISCIIDMKLN--------LFEHQSTVNPNMPYRNLEYVTKCFKRYVGNFDVYTGKA 107 Query: 111 DKLPLVVPILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHR 170 LP ++FY G + + + N ++ I ++ Sbjct: 108 LTLPNPKFVVFYNG--VNEQPPIRVMRLSDLYAHKDEIPNLELVVIQYNINN---LVNCT 162 Query: 171 RIAILELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVA-MQNYMLQ---RGHTEQADL 226 + E L+++ + + + + +D+G S + + N +L+ + + Sbjct: 163 LMDRCEPLKEYS---EFIGCIRSNLKTMDKGEAVDSAIDYCIGNGILKDFLTNNRNEVRS 219 Query: 227 FYGVLRDRETGGESMMTLAQWFEEKGIEKG 256 D E +++ +A E G +KG Sbjct: 220 MSLFEFDAEEHEKAIKQIA---YEDGYDKG 246 >UniRef50_B4VZ11 Putative uncharacterized protein n=1 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VZ11_9CYAN Length = 333 Score = 60.0 bits (144), Expect = 9e-08, Method: Composition-based stats. Identities = 57/296 (19%), Positives = 104/296 (35%), Gaps = 37/296 (12%) Query: 9 HDAVFKQFLMHAETARDFLEIHLPVELRELCDLNT-LHLESGSFI----EESLKGHSTDV 63 +D+ +K+ + R+FL P + E D E D Sbjct: 4 YDSPWKESIS--LYFREFLSFFYPR-IEEDIDWERGFEFLDTELQQIKRETETGRRDADK 60 Query: 64 LYSV-QMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFY 122 L V + G ++ V +E QS+ + + RM Y R+ + VV + Sbjct: 61 LVKVWRRSGEEEWVLVHVEVQSQRQSEFSERMYLYHSRIFDRYRRS-------VVSLGIL 113 Query: 123 QGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHI 182 E + + Y EL FP+V + DE+ + + + ++Q H+ Sbjct: 114 GDEQPGWRPNR------YERELWGCRAILEFPMVKLLDYSMDELARSQNP-LAAIVQAHL 166 Query: 183 RQR--------DLMLLLEQLVTLIDEGYTSGS--QLVAMQNYMLQRGHTEQADLFYGVLR 232 + L + +L + GY QL + ++ + E+ L+ + Sbjct: 167 SAQVAGKDVGVGYESKLSLIKSLYERGYGREDIVQLFRLIDWFIALPKREEERLWQEIQT 226 Query: 233 DRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANL 288 E +T + GI KG++QG Q+ QE R+L E ++ L Sbjct: 227 LEEERKMPYITSIERI---GIRKGLEQGLQQARQEDIVRILELRFE-EIPQKLRGL 278 >UniRef50_Q6ZEK6 Slr5124 protein n=11 Tax=Chroococcales RepID=Q6ZEK6_SYNY3 Length = 276 Score = 60.0 bits (144), Expect = 9e-08, Method: Composition-based stats. Identities = 51/280 (18%), Positives = 105/280 (37%), Gaps = 24/280 (8%) Query: 26 FLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQMQGNPGYLHVVIEHQSK 85 FL + + L S E SL+ D L + L + +E Q++ Sbjct: 8 FLAESFSEDYAAWLLGRPIKLTKLSPTELSLEPIRADSL----ILEQSEDLVLHLEFQTE 63 Query: 86 PDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPI------LFYQGEATPYPLSMCWFDMF 139 PD M FRM+ Y + R + + V+ + L YQ + + Sbjct: 64 PDPTMGFRMLDYRVRVYRRFPQKTMHQF--VIYLKRSSNDLVYQDSFQVGETLHRYQAIR 121 Query: 140 YSPELARRVYNSP--FPLVDITITPDDEIMQHRRIAILELLQKHIRQRDLMLLLEQLVTL 197 + + SP PL +T T D + LE ++ + + +LM T Sbjct: 122 LWEQPSEAFLQSPGLLPLAVLTQTSDPTLKLREVATALEQIEDNRVKANLMA-----ATS 176 Query: 198 IDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGESMMTLAQWFEEKGIEKGI 257 + G +L+ +L+ +++ ++ +L + + G+ L +G +G Sbjct: 177 VFGGILLAPELIKT---ILRSEIMKESAVYQEILEEGKIAGKLEGRLEGKL--EGKLEGK 231 Query: 258 QQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKVI 297 +GR E E L G++ ++A+ ++ + +++ + Sbjct: 232 LEGRLEAKLETIPLLKKLGLTITEIAKELDIDVELVNRFV 271 >UniRef50_B1WSK8 CHP1784-containing protein n=11 Tax=Cyanobacteria RepID=B1WSK8_CYAA5 Length = 260 Score = 59.6 bits (143), Expect = 1e-07, Method: Composition-based stats. Identities = 41/276 (14%), Positives = 94/276 (34%), Gaps = 28/276 (10%) Query: 21 ETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQMQGNPGYLHVVI 80 + FL + + L E SL D L +Q + + I Sbjct: 5 DNVCKFLAERFSRDFANWLLNEPIELTELKPTELSLNPIRADSLIFLQSDD----IVLHI 60 Query: 81 EHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEATPYPLSMCWFDMFY 140 E Q+ PD+ + FRM Y + R+ + + ++ Y P + + + F Sbjct: 61 EFQTSPDEDIPFRMTDYRLRVYRRYPNKE------MYQVVIY---LKPSNSELVYQNTFE 111 Query: 141 SPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQRDLMLLLEQLVTLIDE 200 L + F ++ + D + + + +L R+ + + ++ + Sbjct: 112 LTNLRHQ-----FNVIRLWEENTDSFLNNSGLLPFAVLTCTDNPRETLTQIAAIIDSMPN 166 Query: 201 GYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGESMMTLAQWFEEKGIEKGIQQG 260 + + G D +LR + ++ +G +G Sbjct: 167 QQRQSDISASTA---ILSGLKLDQDSIKRILRSDIMKESVI-------YQEIFHEGEVKG 216 Query: 261 RQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKV 296 +++ + A +L M+ E ++++ L L EI+++ Sbjct: 217 QKQAIKNIALNMLRNHMNLEVISQLTGLNLQEIEQL 252 >UniRef50_C0BF92 Putative uncharacterized protein n=1 Tax=Coprococcus comes ATCC 27758 RepID=C0BF92_9FIRM Length = 307 Score = 59.6 bits (143), Expect = 1e-07, Method: Composition-based stats. Identities = 47/273 (17%), Positives = 92/273 (33%), Gaps = 32/273 (11%) Query: 10 DAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQM 69 D +++ + E + + DL LE ++ DV + V Sbjct: 21 DRLWRMIFNNKEDLLQLYNAINHTDYQNPDDLEVNTLEDVLYLS-----MKNDVSFLV-- 73 Query: 70 QGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHL--------EADHDKLPLVVPILF 121 G L+ EH S + M R + Y ++ +LP I+F Sbjct: 74 -GGTMNLY---EHLSTFNPNMPLRGVFYFSRLYEGYVADNNLMIYHEKRVRLPKPKYIVF 129 Query: 122 YQGEA-TPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQH-RRIAILELLQ 179 Y G P + + D F + + +++I + E+M+H RR+ + Sbjct: 130 YNGTKNQPDSMELRLSDCFENTDNDAPCLECTATMLNINYGHNQELMKHCRRLEEYSIFV 189 Query: 180 KHIRQ--RDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETG 237 + +R+ + + + L ID ++ + + + + Sbjct: 190 QCVREYIQSEPSVEDALEKAIDTCINQDVLADFLKKHRAEVTNMILTTYDKDLYEK---- 245 Query: 238 GESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQ 270 TL + E+G E+G+ +GR E E Q Sbjct: 246 -----TLKEDAREEGREEGLMEGRAETRAELNQ 273 >UniRef50_A7C3K1 Putative uncharacterized protein n=3 Tax=Beggiatoa sp. PS RepID=A7C3K1_9GAMM Length = 272 Score = 59.6 bits (143), Expect = 1e-07, Method: Composition-based stats. Identities = 49/294 (16%), Positives = 108/294 (36%), Gaps = 34/294 (11%) Query: 12 VFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQMQG 71 K+ F++ L +E+ D I D + + Q Sbjct: 4 FLKKVFSKPHIFTAFVKDMLGIEIE--IDKVETEKSFSPIIGN------VDSRFDLFAQD 55 Query: 72 NPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLP---LVVPILFYQGEATP 128 L V I+H+ D R + Y A+ + + + P + ++ G+ Sbjct: 56 TKNRLIVDIQHKRYKDHY--DRFLHYHCVALLEQITSSANYKPDMQVYTIVVLTSGDKHK 113 Query: 129 YPLSMCWFDMFYSP------ELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHI 182 L + F + VY P + D T P E ++ ++ + +++ Sbjct: 114 TDLLITDFSPKKLDGSSIAETQHKIVYVCPKYVTDETPKPYQEWLKAINDSLDKQVEESH 173 Query: 183 RQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGESMM 242 +++++ +LI + S + M++ ++++ L + R+ G E M Sbjct: 174 YH---NEVIQEIFSLIKKDKISPEEYARMKD-----EYSDEEYLQEQTQKARKEGMEKGM 225 Query: 243 TLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKV 296 EKGI KGI++G ++ A+ + ++ E + E+ L + +I+ + Sbjct: 226 -------EKGIGKGIEKGIEKGVLMMAKNMKEAKVAIETIIEVTGLSIEQIEDL 272 >UniRef50_B4VKW0 Putative uncharacterized protein n=2 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VKW0_9CYAN Length = 296 Score = 59.2 bits (142), Expect = 1e-07, Method: Composition-based stats. Identities = 50/316 (15%), Positives = 108/316 (34%), Gaps = 45/316 (14%) Query: 4 PSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDV 63 P+ D K+ L + L L LR+ + ++ + E K + D+ Sbjct: 3 PTHIRFDWAIKKLLRNKAN-YGVLAGFLSELLRKPITIQSILEGESNQQAEDDKLNRVDI 61 Query: 64 LYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHD----KLPLVVPI 119 L + ++IE Q+ ++ RM+ + + LE K V I Sbjct: 62 L--AENDRGEL---ILIEVQNSTEQDYFHRMLYGTSRLITDFLEKGEPYGNVKKVYSVNI 116 Query: 120 LF----------YQGE--------ATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITIT 161 ++ Y G LS+ +F S ++ + ++ + Sbjct: 117 VYFSLGQGDDYIYHGTLEFRGLHLDDKLGLSINQRKLFNSQDV--YEIFPEYYVIKVNNF 174 Query: 162 PDDEIMQHRRIAILELLQKH-IRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGH 220 E+ + L+K I++ L + + S ++ +M R + Sbjct: 175 N--EVASDTLDEWIYFLKKSQIKEEFTAQGLAEAKENLLVDSLSEAERANYLRFMENRRY 232 Query: 221 TEQADLFYGVLRDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSRE 280 ++ E E + E+G+++G++QG+Q+ A+ L +G + Sbjct: 233 ----EISLIESSRSEGRLEGL--------EEGLKEGMEQGKQQEKVNIARLLKQQGTDLD 280 Query: 281 DVAEMANLPLAEIDKV 296 + L EI+++ Sbjct: 281 TITAATGLTREEIEEL 296 >UniRef50_Q2FSG0 Putative uncharacterized protein n=1 Tax=Methanospirillum hungatei JF-1 RepID=Q2FSG0_METHJ Length = 291 Score = 59.2 bits (142), Expect = 1e-07, Method: Composition-based stats. Identities = 28/119 (23%), Positives = 57/119 (47%), Gaps = 12/119 (10%) Query: 178 LQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETG 237 L+K + ++DL + +L ++ + L + E+ +F V E G Sbjct: 176 LEKELPEKDLRNKVRELTLILADKIVDQKILDELW---------EELRMFKVVKYAEEKG 226 Query: 238 GESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKV 296 E + + EKGI+KG+++G+++ + A+ +LS G+ E + + L + IDK+ Sbjct: 227 MEKGL---EKGLEKGIKKGMEKGKKQERETVAKNMLSLGIEDELIIKATGLDQSIIDKL 282 >UniRef50_Q5GSR2 Uncharacterized conserved protein n=15 Tax=Wolbachia RepID=Q5GSR2_WOLTR Length = 317 Score = 59.2 bits (142), Expect = 2e-07, Method: Composition-based stats. Identities = 57/309 (18%), Positives = 119/309 (38%), Gaps = 31/309 (10%) Query: 10 DAVFKQFLM---HAETARDFLEIHLP-VELRELCDLNTLHLESGSFIEESLKGHSTDVLY 65 D +FK+ + + FL L E+ + ++ L + I+ + + ++ Sbjct: 12 DLIFKKIFGTEKNKKIIICFLNNILGFAEINAIQEVEFL----SAIIDPEIASNKQSIIV 67 Query: 66 SVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDK---LPLVVPILFY 122 V + G V IE Q +K R+ Y++ A R L+ + L V I Sbjct: 68 DVFCKDATGTRRV-IEVQLAINKGFEKRVQPYAVKAYSRQLDKSGNYIVDLKKVFFIAIS 126 Query: 123 Q----GEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIA----I 174 E Y + D + + + F +++ ++ Q I Sbjct: 127 NCNLLSEKVDYISTHNIHDTKTN---GHYLKDFQFIFIELPKFSKSKVEQLINIVEHWCF 183 Query: 175 LELLQKHIRQRDLMLLLEQLV------TLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFY 228 + + DL + ++++ +DE + + ++A + ++ E+A L Y Sbjct: 184 FFKNAEDTTETDLKRVAKKVLIIKLAYDGLDEFHWNEEDIIAYEERVMNL-QKEKAILEY 242 Query: 229 GVLRDRETGGESMMTLA-QWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMAN 287 + E G E + ++ + + G EKG ++G ++ A+ L GMS +AE+ Sbjct: 243 RLDLATEKGREEGVKISKERGIKVGAEKGREEGVKKAKIAVAKNSLKAGMSIGAIAEIIG 302 Query: 288 LPLAEIDKV 296 L + +I K+ Sbjct: 303 LSVGKIKKL 311 >UniRef50_B1V1L4 Putative uncharacterized protein n=38 Tax=Clostridium RepID=B1V1L4_CLOPE Length = 300 Score = 59.2 bits (142), Expect = 2e-07, Method: Composition-based stats. Identities = 56/312 (17%), Positives = 126/312 (40%), Gaps = 45/312 (14%) Query: 10 DAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQM 69 D VFK+ AE ++D L L ++ + + L+S ++ + + + Sbjct: 8 DFVFKRLFG-AEESKDSLISLLNAIIKSDNPIKDIELKSPDLEKQHIGDKFCRLDIKAKT 66 Query: 70 QGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEATPY 129 + +E Q + + M R + Y L A + Y+ A Sbjct: 67 DKGEI---INVEIQVRDEYNMVQRTLYYWSKIYSDQLGASEN----------YKNLARTV 113 Query: 130 PLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILEL-LQKHIRQRDLM 188 +++ F + + Y++ + L +IT ++E+ I +EL K I+ ++ Sbjct: 114 CINILNFKLLDND-----RYHNTYRLKEIT--TNEELTDIEEIHFIELPKSKEIKSEEVN 166 Query: 189 LL--LEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLF-----YGVLRDRETGGESM 241 + L + + I E + +++ + + +++ T+ L R RE Sbjct: 167 NIDSLLKWIEFIKEPESETVRILELTDESIRKAKTQLYKLSLDKKTIEQYRIREKAMYDE 226 Query: 242 MTLAQWFEEKGIEKGIQQGRQEVSQE----------------FAQRLLSKGMSREDVAEM 285 ++ + EKG+++G++ GR+E +E A+ LLSKG+ +++A++ Sbjct: 227 ISALENSREKGLQEGVKIGRKEGKEEGLKEGEVRGKLKANRKIAKNLLSKGLELKEIAKI 286 Query: 286 ANLPLAEIDKVI 297 L ++++I Sbjct: 287 LELDENLVEEII 298 >UniRef50_A8GY36 Putative uncharacterized protein n=15 Tax=Rickettsia RepID=A8GY36_RICB8 Length = 279 Score = 59.2 bits (142), Expect = 2e-07, Method: Composition-based stats. Identities = 50/297 (16%), Positives = 99/297 (33%), Gaps = 43/297 (14%) Query: 10 DAVFKQFLMHAETARDFLEI--HLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSV 67 D FK+ FL LP ELR + DL + E + ++ + + V Sbjct: 10 DVAFKKLFTDKARLISFLNNIMRLPEELR-IIDLKYISNEQVPDLGQNKRS-----IVDV 63 Query: 68 QMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDK--LPLVVPILFYQGE 125 ++ N G +++V D +A R+ Y A L+ + L VV ++ G Sbjct: 64 KVTDNSGNIYIVEMQNGYADAFLA-RVQFYGCVAFSSQLKRGKEYADLAPVVMVIITSGF 122 Query: 126 A--TPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDD----EIMQHRRIAILELLQ 179 + + ++ + V++ + E ++ + ++ Sbjct: 123 QALPEEKECISYHQTINVGNGKNQLKCLSYVFVELDKFTKEANELETIEDDWLYMMAKFD 182 Query: 180 KHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGE 239 K E V + Y + + F + + + Sbjct: 183 KA-----------------KEPPKHTQDEVVLSAY-------KTIEQFNWSEAEYDNYIK 218 Query: 240 SMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKV 296 +M LA EE + ++G+ E S E A+ +L E + + L EI+K+ Sbjct: 219 AM--LAAQTEELNQKSKFKEGKAERSIEMAKEMLQDNEPIEKIIKYTKLSKEEIEKL 273 >UniRef50_A8F2U7 Putative uncharacterized protein n=15 Tax=Bacteria RepID=A8F2U7_RICM5 Length = 281 Score = 59.2 bits (142), Expect = 2e-07, Method: Composition-based stats. Identities = 45/291 (15%), Positives = 99/291 (34%), Gaps = 30/291 (10%) Query: 10 DAVFKQFLMHA-ETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQ 68 D FK+ + + + + DL+ + E E + L+ ++ Sbjct: 10 DIAFKKLFSDKVKLINLLNSLLRLSKGDRIIDLSYITTEQLPLFLEGRRS-----LFDLK 64 Query: 69 MQGNPGYLHVVIEHQSKPDKKMAFRMM---RYSIAAMHRHLEADHDKLPLVVPILFYQGE 125 ++ G ++ IE Q K +K R Y+ + + D LP+V+ + Sbjct: 65 VKDETGRWYI-IEMQRKMEKDYLNRTQLYGCYTYVSQIKKGMKHKDLLPVVIISIIRAKA 123 Query: 126 ATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQR 185 + + + S + +++ + +++ +++ L K+ Q Sbjct: 124 LPDELPYISYHHIKESNIHKQYLFSLTYVFIELGKFKKNDLKDDTDE--WLYLLKYASQE 181 Query: 186 DLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGESMMTLA 245 + ++ Y S Q E D F + ++ Sbjct: 182 QEPPKEIKNEIVLS-AYASLEQYKW--------TEQEHDDYFRAEMAIQQE--------I 224 Query: 246 QWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKV 296 FEEK G+++G ++ E A+ +L + E +A L + EI K+ Sbjct: 225 DKFEEK-FNAGMEKGIEKEKIETAKEMLIENGPIEQIARYTKLTIEEIKKL 274 >UniRef50_C3R531 Putative uncharacterized protein n=6 Tax=Bacteroidales RepID=C3R531_9BACE Length = 325 Score = 59.2 bits (142), Expect = 2e-07, Method: Composition-based stats. Identities = 57/326 (17%), Positives = 105/326 (32%), Gaps = 48/326 (14%) Query: 7 TPH-DAVFKQFLM---HAETARDFLEIHLPVELRELCDLNTLHLESGSFI-EESLKGHST 61 P+ D FK + E FL + +E +++ E L T Sbjct: 12 NPYTDFAFKLLFGTDLNKEILIGFLNALFDGKQV---------IEDVTYLNTEHLGSKET 62 Query: 62 D--VLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADH---DKLPLV 116 D ++ V + G ++IE Q + R + Y+ + +L V Sbjct: 63 DRRAVFDVYCENEKGE-KILIEMQRGEQQFFKDRSIYYATYPIREQAIKGEIWDYELKAV 121 Query: 117 VPILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLV------------DITITPDD 164 I S ++ V+ V ++ D Sbjct: 122 YVIGILNFALDDVSSSSFRHEVKLMDTTTHEVFFDKLTFVYLEMPKFHKTEQELDTLFDK 181 Query: 165 EIMQHRRIAILELLQKHIRQRDLMLLLE--QLVTLIDEG-YTSGSQLVAMQNYMLQRGHT 221 + + +A L +++R L E ++ E Y L +++ Sbjct: 182 WMFVLKNLARLMERPTALQERVFNRLFEAAEIAQFSKENLYAYEESLKVYRDWNNVIDTA 241 Query: 222 EQADLFYGVLRDRETGGESMMTLAQWFEE-----------KGIEKGIQQGRQEVSQEFAQ 270 Q + G+ G E +A+ EE KG+EKGI +G +Q A Sbjct: 242 IQKGIARGMEEGLVKGMEEG--IAKGMEEGIVKGMEEGIAKGMEKGIAEGEWMKAQTIAG 299 Query: 271 RLLSKGMSREDVAEMANLPLAEIDKV 296 L + G+S ++A++ L EI+ + Sbjct: 300 NLKNAGLSIAEIAKVTGLSEDEINSL 325 >UniRef50_C4ZLA7 Conserved hypothetical cytosolic protein n=2 Tax=Proteobacteria RepID=C4ZLA7_THASP Length = 339 Score = 58.8 bits (141), Expect = 2e-07, Method: Composition-based stats. Identities = 47/303 (15%), Positives = 103/303 (33%), Gaps = 30/303 (9%) Query: 9 HDAVFKQFLMHAETARDFLEIHLPVELRELCDLNT-LHLESGSFI----EESLKGHSTDV 63 +D+ +K+ + H +F++ + P + D + +L D Sbjct: 10 YDSPWKEAVEH--AFPEFIDFYFP-DAGRQIDWARGHRFLDKELQQIVRDAALGRRHVDK 66 Query: 64 LYSVQMQ-GNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFY 122 L SV G +L V IE Q D A RM Y+ + V + Sbjct: 67 LASVTTHAGEEDWLCVHIEVQGSMDPDFARRMFVYNYRIYDSYDR-------PVASLAVL 119 Query: 123 QGEATPYPLSMCWFDMFYSPELARRVYNSPF-PLVDITITPDDEIMQHRRIAILELLQKH 181 + + ++ R P LVD + A++ + Sbjct: 120 ADDDPAWRPDRFGYERL----GCRHNLQFPVAKLVDHAADEAALLCNPNPFALVTAAHLY 175 Query: 182 --------IRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRD 233 I + D L +L+ D ++ ++M++ + L+ + Sbjct: 176 TRRTRRSPIARFDAKRRLVRLLYERDWTRQRILDFFSVLDWMMRLPREFEQRLWQDIENI 235 Query: 234 RETGGESMMTLAQWF-EEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAE 292 +T + E+G++KG++QG + ++ ++ + KG+ + A++ L Sbjct: 236 EGERKVKYVTSVERLAIERGLQKGMEQGLEIGIEKGIEQGIEKGIEKGRAQGSASVLLRL 295 Query: 293 IDK 295 +++ Sbjct: 296 LNR 298 >UniRef50_Q3ARM2 Putative uncharacterized protein n=10 Tax=Bacteroidetes/Chlorobi group RepID=Q3ARM2_CHLCH Length = 322 Score = 58.8 bits (141), Expect = 2e-07, Method: Composition-based stats. Identities = 45/321 (14%), Positives = 110/321 (34%), Gaps = 53/321 (16%) Query: 10 DAVFKQFLM---HAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYS 66 D FK+ + + FL LP+E + D L + S ++ Sbjct: 13 DFGFKKLFGSEMNKDLLIAFLNTLLPIEAGTIAD---LTFLPNDRVGRSEFDRRA--IFD 67 Query: 67 VQMQGNPGYLHVVIE-HQSKPDKKMAFRMMRYSIAAMHRHLEADHDK-LPLVVPI----- 119 + + G + ++E Q+K D + S + + + L + + Sbjct: 68 LHCKNEKGE-YFIVEMQQAKQDYFKDRSVFYASFPIQEQAQKGKWNYCLQPIYMVGILDF 126 Query: 120 ----------LFYQ---------GEATPYPLSMCWFDMFY----SPELARRVYNSPFPLV 156 + + G+ L+ + ++ EL + L Sbjct: 127 IFDENKADDTIVHHEIKLVNLSTGKVFYEKLTFIYLELPKFTKSVDELESDFDKWCYLLS 186 Query: 157 DITITPDD--EIMQHRRIAILELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNY 214 ++ D + + + + EL + + E+ + + + ++N Sbjct: 187 NLPDLTDRPARLQEKVFLKVFELAEIAKYTPEEAREYEKSLKVYRD----------LKNV 236 Query: 215 MLQRGHTEQADLFYGVLRDRETGG--ESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRL 272 + +A+ + + G E M+ + ++G++KG++ G + E A++L Sbjct: 237 IDCAYDEGKAEGIEEGIEKGKEIGVLEGMVKGKELGLQEGLQKGMEAGLLKGKLEIARKL 296 Query: 273 LSKGMSREDVAEMANLPLAEI 293 + KGMS ++ A +A + + + Sbjct: 297 MVKGMSADEAAGIAGVDVERL 317 >UniRef50_C1QAJ2 Putative uncharacterized protein n=2 Tax=Brachyspira murdochii DSM 12563 RepID=C1QAJ2_9SPIR Length = 312 Score = 58.5 bits (140), Expect = 3e-07, Method: Composition-based stats. Identities = 46/314 (14%), Positives = 102/314 (32%), Gaps = 37/314 (11%) Query: 10 DAVFKQFLMHAE---TARDFLEI-HLPVELRELCDLNTLHLESGSFIEESLKGHSTD--- 62 D + + DF+ L ++ + L + + K + D Sbjct: 9 DYFVRYLFSSKDSNFILLDFINSTMLDANMKTFRSVEILTPSPKAGSRLNYKENYDDKES 68 Query: 63 ---------------VLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLE 107 + Q V+IE Q + + + R++ Y + + L+ Sbjct: 69 IAPKVARKVDRCRRRLDVKCITQNGTV---VIIEIQLQGNSRFPERILYYWASNYSKLLK 125 Query: 108 ADHDK--LPLVVPILFYQGEATPYPLSMCWFDMFYSPEL-ARRVYNSPFPLVDITITPDD 164 L V+ I + C + + ++R+ I I Sbjct: 126 QGEKYDALTPVISINL---LNFNLDNNDCIHSCYMIYDTKSKRLLTDHLQ---IHIIEIK 179 Query: 165 EIMQHRRIAILELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQA 224 + + L+ K ++ + L+ E N++ R + Sbjct: 180 KFKDNLLDKDLDCWLKFFTIKEKDNREVIMSELVKEKPIMEEVQKRYNNFIKDRLMMNEY 239 Query: 225 DLFYGVLRDRET--GGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDV 282 D L + E + + + F+ KGIEKGI++G +E A+ + +K + + Sbjct: 240 DKREAYLYGNQIMLEEERRLGIEEGFK-KGIEKGIEKGIKENQILTAKNMKNKNIDIALI 298 Query: 283 AEMANLPLAEIDKV 296 +++ L + EI+++ Sbjct: 299 SDITGLSIKEIEEL 312 >UniRef50_C0EXQ3 Putative uncharacterized protein n=1 Tax=Eubacterium hallii DSM 3353 RepID=C0EXQ3_9FIRM Length = 290 Score = 58.5 bits (140), Expect = 3e-07, Method: Composition-based stats. Identities = 46/304 (15%), Positives = 111/304 (36%), Gaps = 49/304 (16%) Query: 10 DAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQM 69 D +F+ E L + V + D++ + + + S + D+ + + Sbjct: 15 DRLFRFVFGAEENKAYLLSLCNAVSGTDYTDVDDIEITTLS--DAIYIKMKNDISFLIDS 72 Query: 70 QGNPGYLHVVIEHQSKPDKKMAFRMMR-----YSIAAMHRHLEADHDKLPLVV---PILF 121 Q + EHQS + M R M Y I + +L+ L ++ + Sbjct: 73 Q------MNLFEHQSTFNPNMPLRGMECFAELYGIYIIENNLDIYVSSLQKILTPRYYVI 126 Query: 122 YQG-EATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQ-----HRRIAIL 175 Y G E P + + D F P+ + + L +I + ++++ + + Sbjct: 127 YNGTEKQPDVVKLKLSDAFQVPDDSGEFEWTATML-NINYGHNRKLLEQCQPLYEYAHFI 185 Query: 176 ELLQKHIRQRDLMLLLEQLVTLIDEGYTSGS---QLVAMQNYMLQRGHTEQADLFYGVLR 232 +L++++ +L +++ V E G+ Q + + ML E+ Sbjct: 186 KLVREYSEAMELKKAIDKAVEKAREWKCIGTFLYQCKSEVSVMLLTEFDEKK-------- 237 Query: 233 DRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAE 292 ++++ L G ++GR++ + +L+ +S E +A+ + + Sbjct: 238 ----HEDNLIKL-----------GEKEGREKERMKNICSMLALSLSPEIIAKACEVSVDY 282 Query: 293 IDKV 296 + + Sbjct: 283 VLNL 286 >UniRef50_Q00255 ORF295 n=1 Tax=Leptolyngbya boryana RepID=Q00255_PLEBO Length = 295 Score = 58.5 bits (140), Expect = 3e-07, Method: Composition-based stats. Identities = 53/305 (17%), Positives = 108/305 (35%), Gaps = 40/305 (13%) Query: 3 APSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNT-LHLESGSFI----EESLK 57 + T +D +K F+ R+FL P + D + + + + Sbjct: 5 SSENTDYDNPWKTFIE--LYFREFLAFFFPT-IEADVDWSKPVRFLDKELQKIVRDAEIP 61 Query: 58 GHSTDVLYSV-QMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLV 116 D L V +++G + IE QS+ ++ RM Y+ R+ V Sbjct: 62 KRYADKLVEVHRLRGERTLVICHIEVQSQEERDFVARMYSYNYRLRDRYNC-------PV 114 Query: 117 VPILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITIT----PDDEIMQHRRI 172 V + + + S + EL + FP+V ++ + E +Q+ Sbjct: 115 VSLAILGDDRPNWRPSRFY------DELWGCATHFEFPIVKLSDYQSQWTELEAIQNPFA 168 Query: 173 AILELLQK----HIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFY 228 + K H + + L T++ + S ++ + N++ + + +L Sbjct: 169 VVAMAHLKTKETHNQPLERKRWRYHLTTMLYDRGYSEQDILELHNFLDWLMNLPE-ELER 227 Query: 229 GVLRDRETGGESM-MTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMAN 287 + + ET E+ M E + + E Q A +L + + E +AE+ Sbjct: 228 QLQAELETFEEARRMKYVSSLERR--------AKLEEKQAIALNMLRRNLDMELIAEVTG 279 Query: 288 LPLAE 292 L +AE Sbjct: 280 LTIAE 284 >UniRef50_C6VTD5 Putative uncharacterized protein n=1 Tax=Dyadobacter fermentans DSM 18053 RepID=C6VTD5_DYAFD Length = 308 Score = 58.5 bits (140), Expect = 3e-07, Method: Composition-based stats. Identities = 45/319 (14%), Positives = 103/319 (32%), Gaps = 54/319 (16%) Query: 10 DAVFKQFLM---HAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYS 66 D FK+ + + DFL + E R + DL E+ I + ++ Sbjct: 10 DFGFKRIFGSEANKDILIDFLNVLFAGE-RLVADLTFASNENNGRIPILRRA-----IFD 63 Query: 67 VQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEA 126 + G G +IE Q + R + YS + + +EA G Sbjct: 64 LCCTGADGE-QFIIEVQRVRQEYFKDRCLYYSASLIRDQVEAG--------------GTN 108 Query: 127 TPYPLSMCWF-----DMFYSPELARRV---------------YNSPFPLVDITITPDDEI 166 Y L + F + + +++ E Sbjct: 109 WRYDLKPVYLIGLMDFCFEDSDDGHYLHEIRLIKRSNGQVFYDKFGLTFIEMPAFQKKES 168 Query: 167 MQHRRIAILELLQKHIRQRDL------MLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGH 220 + L K++ + ++ + +++ + + + + +A Y+ + Sbjct: 169 DLSTELDRWLYLLKNLSKLNIVPPVLTNPVYQKVFRVAEVCNLNKEEKMAWDAYLKAKWD 228 Query: 221 TEQADLFYGVLRDR----ETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKG 276 E + + R E E + ++GI+KG + G + ++ + +L+KG Sbjct: 229 NENSMDYAKKEAMRVGHEEGHKEGHKEGHKEGMKEGIKKGRETGIELGKRQVVKNMLAKG 288 Query: 277 MSREDVAEMANLPLAEIDK 295 + ++++ L +I Sbjct: 289 FDMQTISDITGLTFEQIRN 307 >UniRef50_Q3ATN4 Putative uncharacterized protein n=1 Tax=Chlorobium chlorochromatii CaD3 RepID=Q3ATN4_CHLCH Length = 287 Score = 58.5 bits (140), Expect = 3e-07, Method: Composition-based stats. Identities = 49/303 (16%), Positives = 105/303 (34%), Gaps = 50/303 (16%) Query: 10 DAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQM 69 D V K L A D I L ++ L + ++ DV+ V + Sbjct: 5 DVVSKDIL--KRIALDIARILL------HLKVDHAELLETEH--QRVEERRADVV--VLV 52 Query: 70 QGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEATPY 129 QG G + +E Q+ +A+R++RY H D + L Y G+A Sbjct: 53 QGESGRFILHLEIQNDNQANIAWRLLRYRSDIGLAHKGYDIKQY------LIYIGKAP-- 104 Query: 130 PLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILEL-----LQKHIRQ 184 + + + + + ++D+ ++ L L + + Sbjct: 105 --------LSMPTGIHQTGLDYRYHVIDMHSVDCQALLTQDTPDALVLAILCDFKGRSER 156 Query: 185 RDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGESMMTL 244 + ++++L L E + + M + L E+M+++ Sbjct: 157 EVVRYIIQRLQELTAENESRYHDYMRMLEILSA----------NRSLEKIIEEEEAMLSV 206 Query: 245 AQWFE----EKGIEKGIQQGRQEVSQEFAQRLLSKGM---SREDVAEMANLPLAEIDKVI 297 G+ GI+QG Q+ + +R L++ S VA + L + +++++ Sbjct: 207 VDQTRLPSFRIGMRHGIEQGVQQGTLSLVKRQLTRRFGTLSYHHVARLDKLNIEQLEELS 266 Query: 298 NLI 300 + + Sbjct: 267 DAL 269 >UniRef50_C0DAA1 Putative uncharacterized protein n=2 Tax=Clostridium asparagiforme DSM 15981 RepID=C0DAA1_9CLOT Length = 302 Score = 58.1 bits (139), Expect = 4e-07, Method: Composition-based stats. Identities = 52/291 (17%), Positives = 112/291 (38%), Gaps = 32/291 (10%) Query: 10 DAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQM 69 D++F+ + + LE++ + + + L + + + G D+ + + Sbjct: 18 DSLFRVIFSEKK---ELLELYNAINGSHYENPDDLIITTIGDVL--YLGMKNDISF---L 69 Query: 70 QGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEAD----HDKLPLVVP----ILF 121 G L+ E QS + M R + Y +L+ + + PL +P I+F Sbjct: 70 IGQHLSLY---EAQSTWNPNMPLRGLFYFSRLYQGYLKEHQLDLYSRRPLSLPFPEFIVF 126 Query: 122 YQGE-ATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQH-RRIAILELLQ 179 Y G P + D+FY E + + +++I ++E+M+ R++ L Sbjct: 127 YNGTMEQPDRTQLRLSDLFYQAEGVPCLECTA-TMININYGHNEEMMKSCRKLYEYAFLI 185 Query: 180 KHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGE 239 +R R L L +D+ Q ++N++L+ + + + E Sbjct: 186 NAVRSRLNEGL--HLEAAVDQAVEDCIQHDVLKNFLLKHREEVREMILSEYDEELHINSE 243 Query: 240 SMMTLAQWFEEKGIEKGIQQGRQEVSQEF---AQRLLSKGMSREDVAEMAN 287 ++ E+G+E G+ QG Q + RL + G + + + + Sbjct: 244 KKISY-----EEGLEAGVVQGTQHGQERVNALITRLAAAGRADDIIRSAED 289 >UniRef50_Q8YQI6 All3837 protein n=4 Tax=Cyanobacteria RepID=Q8YQI6_ANASP Length = 276 Score = 58.1 bits (139), Expect = 4e-07, Method: Composition-based stats. Identities = 41/283 (14%), Positives = 97/283 (34%), Gaps = 19/283 (6%) Query: 21 ETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKG--HSTDVLYSVQMQGNPGYLHV 78 + + P + ++ + F+ LK D ++ + L+ Sbjct: 5 KIFYSLFQAF-PSIFFAIIGETDINPSTYEFVSVELKETAFRIDGVFKPVNESTEEPLYF 63 Query: 79 VIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEATPYPLSMCWFDM 138 +E Q + D K R + ++ + + ++ P + P + + Sbjct: 64 -VEVQFQLDPKFYRRFFAEIFLYLRQNPSVNFWRAVVIYP------QRIIEPDDQQPYRL 116 Query: 139 FYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQ-RDLMLLLEQLVTL 197 +R+Y L ++ ++ + I+ I Q R L++ Q +T Sbjct: 117 ILDSSQIQRIY-----LDELGTASENSLQLAIVQLIIASEATAIDQGRQLIIQARQELTD 171 Query: 198 IDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGESMMTLAQWFEE---KGIE 254 + + Y E+ G+ + + EE +G + Sbjct: 172 EANKKQIVELIETILLYKFTNLSREEVAAMLGIDDEFKKTRMYQSIKEDGLEEGRQEGRQ 231 Query: 255 KGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKVI 297 +G Q+G+ + E RLL+ G++ E +A +L + ++ +V+ Sbjct: 232 EGRQEGKLQAKLEAIPRLLALGLNVEQIAGALDLTIEQVQEVV 274 >UniRef50_Q899X1 Putative uncharacterized protein n=2 Tax=Clostridium RepID=Q899X1_CLOTE Length = 306 Score = 57.7 bits (138), Expect = 4e-07, Method: Composition-based stats. Identities = 44/300 (14%), Positives = 103/300 (34%), Gaps = 36/300 (12%) Query: 10 DAVFKQFLM-HAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQ 68 D++ K + ++A DF + D + + +K + TD Y+ Sbjct: 8 DSIMKNAMDIFKQSAVDFFK----------LDTKIIAPANTELKTIDIKTNFTD--YTFY 55 Query: 69 MQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEATP 128 + + ++ E Q+ ++ R + Y + +++ + V ++ Y + Sbjct: 56 TENDD---YLHFEFQTTNKEEDINRFLFYDASLFYKYGKK-------VNTLVVYSSDIKK 105 Query: 129 YPLSMCWFDM------FYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHI 182 + + FY L + T E + I L + Sbjct: 106 SKTKVDAGSLKYEIKAFYMSSLNGDEEYNNLK----TKIDKGEDLTKEEILSLTFIPLMD 161 Query: 183 RQRD---LMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGE 239 + D + +L ++E T + + ++ + G + F V E G Sbjct: 162 SKEDKSTRTIKSIELAEKMEENNTKLQCITLLYAFLEKFGDAKSKKKFKEVFSMTEIGRM 221 Query: 240 SMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKVINL 299 + + +GI+KGI++G ++ E S+ + ++ + + +P I K+ L Sbjct: 222 IVEESIEKGRAEGIKKGIEEGIKKGRTEGKTEGKSEILIKQLIKKFKKVPEEYIQKIKTL 281 >UniRef50_C4FYK3 Putative uncharacterized protein n=2 Tax=Abiotrophia defectiva ATCC 49176 RepID=C4FYK3_ABIDE Length = 365 Score = 57.7 bits (138), Expect = 4e-07, Method: Composition-based stats. Identities = 55/309 (17%), Positives = 109/309 (35%), Gaps = 35/309 (11%) Query: 10 DAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESG--SFIEESLKGHSTDVLYSV 67 D + K+ M + DFL + R++ + + L SG + + + D Sbjct: 5 DILEKKLFMFNDVFADFLNGII-FNGRQIVEESELFDLSGWSHYKADDSRHRYQDRDVVK 63 Query: 68 QMQGNPGYLHVV-IEHQSKPDKKMAFRMMRYSIAA-----------MHRHLE-------- 107 + + ++ IE+Q PDK M FR++ Y A+ +HL+ Sbjct: 64 LWKKKNVVISLIGIENQDVPDKDMVFRVLSYDGASYKTQLAKKDEDKRKHLKDKKNTEIV 123 Query: 108 ----ADHDKLPLVVPILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPD 163 D + V+ + Y GE + + L V + L+D+ + Sbjct: 124 EIGKEDEKDIFPVITFVVYYGEEEWKYETTLKKRLKIGDGLDEFVSDYKINLIDLKKFTE 183 Query: 164 DEI--MQHRRIAILELLQK--HIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRG 219 D+I + ++ + K + + L + V+ + T + +N + Sbjct: 184 DDINKFKKDFKLLVNYMVKGSNHDAGSIELNHPEEVSELVLRLTGEELPIPRENDGGKTM 243 Query: 220 HTEQADLFYGVLRDRETGGESM-MTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMS 278 +F + E G + MT KG+ +G+ +G E + L+KGM+ Sbjct: 244 EKFFEPMFARMAEKAEARGMAKGMT---EGMAKGMTEGMAKGLAEGKAKGMTEGLAKGMT 300 Query: 279 REDVAEMAN 287 +A Sbjct: 301 EGMAKGLAE 309 >UniRef50_B4B4Q2 Putative uncharacterized protein n=1 Tax=Cyanothece sp. PCC 7822 RepID=B4B4Q2_9CHRO Length = 295 Score = 57.7 bits (138), Expect = 5e-07, Method: Composition-based stats. Identities = 32/186 (17%), Positives = 78/186 (41%), Gaps = 18/186 (9%) Query: 119 ILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFP---LVDITI------TPDDEIMQH 169 ++ Y G P + ++ + RR+Y P + + ++ Sbjct: 111 VVIY-GSRRYEPRQVLPYEDLLRSDRVRRIYLDELPQECFSSLNLRLVEFIISGEDAAVQ 169 Query: 170 RRIAILELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYG 229 + ++E + + + + + +L L TLI + + +Q M +++ L+ Sbjct: 170 KGRQLVEEVDRIEDEDERLEILGLLETLIAYKFDNLTQ--EEIRQMFSLDEFKKSRLYQD 227 Query: 230 VLRDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLP 289 + R+ + G + E+G+E+G+++GR+E + L +G S ED+A++ L Sbjct: 228 IYREAQEKG------LERGLERGLERGLERGREEAKLQTIDALFRRGFSVEDIADIVQLD 281 Query: 290 LAEIDK 295 + + + Sbjct: 282 VERVRQ 287 >UniRef50_D0TYF1 Putative uncharacterized protein n=1 Tax=Bacteroides sp. 2_1_22 RepID=D0TYF1_9BACE Length = 349 Score = 57.7 bits (138), Expect = 5e-07, Method: Composition-based stats. Identities = 50/356 (14%), Positives = 111/356 (31%), Gaps = 67/356 (18%) Query: 1 MDAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHS 60 M D FK +++ L L L + L +++ Sbjct: 1 MSKYVNPFTDIGFKIIFGQP-ASKNLLITLLNELLAGEHHITELTFLDKEDHADNVSDKG 59 Query: 61 TDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDK-------- 112 ++Y + + G ++++E Q++ R + Y A+ R +E+ K Sbjct: 60 --IIYDLYCRTASGE-YIIVEMQNRWHSNFLDRTLYYVCRAVSRQIESPSSKEVPVPEDP 116 Query: 113 ---------------LPLVVPILFYQGEATPYPLSMCWFDMFYSPELARRV--------Y 149 LP + I + + + + V Sbjct: 117 MTAREPLVSYGKQYRLPTIYGIFLTNFKEENLEAKFRTDTVLSDRDTGKIVNPHLRQIYL 176 Query: 150 NSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQRDLM--LLLEQLVTLIDEGYTSGSQ 207 P+ D++ D + + I L+ + R D + + E L L S Sbjct: 177 QFPYFTKDLS---DCHTLYDKLIYALKNMSNWNRMPDALKEQVFEHLARLAAVADLSEEN 233 Query: 208 LVAM--------QNYMLQRGHTEQADLFYGVLRD-------------------RETGGES 240 +A N +++ + + + E E Sbjct: 234 RIAYDKALDRYRVNQIVEEDERRKNEEMRRKAAEEGLKEGMKAGLEKGVKKGRLEGIKEG 293 Query: 241 MMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKV 296 M + ++G+EKG+++G Q+ E A+++ G+S + + + L ++I+ + Sbjct: 294 MKEGMKEGMKEGLEKGLEKGEQKKQIEIARKMREDGISIDIIIKYTGLQSSDIENL 349 >UniRef50_A7M2M6 Putative uncharacterized protein n=2 Tax=Bacteroides RepID=A7M2M6_BACOV Length = 182 Score = 57.7 bits (138), Expect = 5e-07, Method: Composition-based stats. Identities = 26/158 (16%), Positives = 62/158 (39%), Gaps = 14/158 (8%) Query: 149 YNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQRDLM--LLLEQLVTLIDEGYTSGS 206 P+ D++ D + + I L+ + R D + + E L L+ S Sbjct: 29 LQFPYFTKDLS---DCHTLYDKLIYALKNMSNWNRMPDALKEQVFEHLARLVAVADLSEE 85 Query: 207 QLVAM--------QNYMLQRGHTEQADLFYGVLRDRETGGESMMTLAQWFEEKGIEKGIQ 258 +A N +++ + + + + + +E G+EKG++ Sbjct: 86 NRIAYDKALDRYRVNQIVEEDERRKNEEMRRKAAEEGMKEGLKEGIREGIKE-GMEKGME 144 Query: 259 QGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKV 296 +G Q+ E A+++ G+S + + + L ++I+ + Sbjct: 145 KGEQKKQIEIARKMREDGISIDTIIKYTGLQSSDIENL 182 >UniRef50_Q8GBS6 Putative uncharacterized protein n=12 Tax=Treponema RepID=Q8GBS6_TREMA Length = 262 Score = 57.7 bits (138), Expect = 5e-07, Method: Composition-based stats. Identities = 54/297 (18%), Positives = 102/297 (34%), Gaps = 57/297 (19%) Query: 10 DAVFKQFLMHAETARDFLEIHLPV---ELRELCDLNTLHLESGSFIEESLKGHSTDVLYS 66 D +F Q + + + FLE+ L + + +T+ ES + D+L Sbjct: 13 DFMFCQVMKNKNLCKTFLEMLLADKIGNITHIASQSTVAPESEAKFV------RLDIL-- 64 Query: 67 VQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEA 126 +Q + IE Q + +A RM Y A L+ +Y Sbjct: 65 --VQDEKNNFYD-IEMQVVNEHNVAKRMRYYQSALDVSFLDKGE----------YYTNLK 111 Query: 127 TPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQRD 186 Y + +C FD + VY I +DE IR RD Sbjct: 112 DSYIIFVCLFDFIGK---NKAVYFFE------NICLEDEP---------------IRLRD 147 Query: 187 LMLLLEQLVTLIDEGYTSGSQLVAMQNYM----LQRGHTEQADLFYGVLRDRETGGESMM 242 + ++ + L Y+ + +E+ + ++ E + Sbjct: 148 GTKKI--IINVDAFKNIKDKALSGFLEYIKTGCITTKFSERIEKMIRTIKQNEQARQEYR 205 Query: 243 TLAQWFE---EKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKV 296 ++ E+G +G G + ++ A L + G+++ +A+ L LAEI+K+ Sbjct: 206 FISAVVMDAKEEGRSQGFTDGVNQTKRKTAAALKAMGLAKSKIAKATGLSLAEIEKL 262 >UniRef50_C4FIG5 Putative uncharacterized protein n=1 Tax=Sulfurihydrogenibium yellowstonense SS-5 RepID=C4FIG5_9AQUI Length = 346 Score = 57.3 bits (137), Expect = 6e-07, Method: Composition-based stats. Identities = 51/341 (14%), Positives = 110/341 (32%), Gaps = 85/341 (24%) Query: 10 DAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQM 69 D + L +E + ++ D++ S+K D+L Sbjct: 7 DITVRDVLKEP--ILKLIERLVGKKIVRSLDIS----------LPSIKERKVDILLEA-- 52 Query: 70 QGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEATPY 129 + IE QS K M +RM+ Y + + + ++ + Y GE Sbjct: 53 ---EDRSLIHIELQSTNHKYMHYRMLEYRLEITRKFKANN------IIQFVIYLGEKK-- 101 Query: 130 PLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILEL------------ 177 E++ + + L+DI P +I++ I + L Sbjct: 102 --------CTMKSEISEKDLIYRYNLIDIKQIPCQDIIKEDDIDSIVLGFLCNLKNKEKL 153 Query: 178 ---------LQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFY 228 +++ D + + ++ L + +L+ + + + DL Sbjct: 154 LEKLKIKLSTLDDVKRADWIRGILLILGLRPKLKKEFQRLIDREEIKMPITIRIRRDLIE 213 Query: 229 GVLRDRETGGESMMTLAQW----------------------------FEEKGIEKGIQQG 260 + + E+ + +KG++KGI++G Sbjct: 214 DLPLIGDLLREAEREAIEKGIEKGLKKGLKKGLKKGLKKGLKKGLKEGLQKGMQKGIKEG 273 Query: 261 RQEVSQEFAQRLLSKGM---SREDVAEMANLPLAEIDKVIN 298 Q+ QE Q+ L +G+ +ED+ ++ ++ K IN Sbjct: 274 LQKGLQEGLQKGLQEGLVKAKQEDIIKVLEAKFGKLSKTIN 314 >UniRef50_UPI0001C369BC hypothetical protein ChatD1_02491 n=1 Tax=Clostridium hathewayi DSM 13479 RepID=UPI0001C369BC Length = 310 Score = 57.3 bits (137), Expect = 6e-07, Method: Composition-based stats. Identities = 42/292 (14%), Positives = 86/292 (29%), Gaps = 48/292 (16%) Query: 10 DAVFKQFLMHAETARDFL-------EIHLPVELRELCDLNT-LHLESGSFIEESLKGHST 61 D K+ L D + L EL + + + + S +++++ Sbjct: 5 DFYIKKLLQDPARFADLYNAEIFHGKQILKAELLSPVSTESGIAITNRSGRKQTIQRRR- 63 Query: 62 DVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRY--------SIAAMHRHLEAD---- 109 D+ + ++ E Q + M R + Y H + Sbjct: 64 DIAMKASI--GACFIVAGCEAQGEIHYGMPIRSLTYDALDYTEQLTEIQKEHRKKKDLAK 121 Query: 110 ----------HDKLPLVVPILFYQGEATPYPLSMCWFDMFYS-------PELARRVYNSP 152 DKL V+ ++ Y G+ P+ +DM P+L + + Sbjct: 122 SPEFLSGITRRDKLQPVLTLVLYCGK-DPWDGPKSLYDMLDLRGPTECIPDLLAALPDYR 180 Query: 153 FPLVDITITPDDEIMQHRRIAILELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQ 212 LVDI + + + + +L+ + + I + A+ Sbjct: 181 INLVDIRKIENLSLYKTGLQQVFGMLKYSTDKSKFYNYITSNHDQISMLDDN-----ALT 235 Query: 213 NYMLQRGHTEQADLFYGVLRDRETGGESMMTLAQWFEEKGIEKGIQQGRQEV 264 M G E L + G +M G +G ++G++ Sbjct: 236 AVMGLLG--ENRRLMKYLAAPGREEGYTMCQAIDDLIADGKLEGKREGKRRG 285 >UniRef50_B8FTH9 Putative uncharacterized protein n=3 Tax=Desulfitobacterium hafniense RepID=B8FTH9_DESHD Length = 325 Score = 57.3 bits (137), Expect = 6e-07, Method: Composition-based stats. Identities = 46/318 (14%), Positives = 87/318 (27%), Gaps = 41/318 (12%) Query: 10 DAVFKQFLM---HAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYS 66 D FK + FL L + + T+ + K DV Sbjct: 10 DYAFKLIFGKEGNEAILIAFLNAALKLPQERRIEEITIINPELNKEYPEDKKSILDV--R 67 Query: 67 VQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEAD--HDKLP--LVVPILFY 122 + + IE Q M R + Y R + + +L + + I+ + Sbjct: 68 AITSQG---MQINIEIQLSNQYDMEKRSLYYWAQMYSRQIREGMAYKELTKTVSINIVDF 124 Query: 123 QGEATPYPLSMCWFDMFYSPE------LARRVYNSPFPLVDITITPDDEIMQHRRIAILE 176 + + L P L + + Sbjct: 125 NYLKQTSSYHNVFHLYEDEEKFQLTDVLEIHFMELPKLLAKWRKREIS--LWENELVRWL 182 Query: 177 LLQKHIRQRDLMLLLEQLV-------------------TLIDEGYTSGSQLVAMQNYMLQ 217 LL + ++++ +LE++ I E Y + + + ++ Sbjct: 183 LLLEGADNQEILQILEEIAMKDPVLYQAMNAWEETSEDPRIREAYFDRRKAILDEKAAIR 242 Query: 218 RGHTEQADLFYGVLRDRETGGESMMTLAQWFE--EKGIEKGIQQGRQEVSQEFAQRLLSK 275 + + G + E +G +G +GR E E A++LL Sbjct: 243 EAELRLQEALEEGMAKGIAEGRAKGIAEGKAEGKAEGRAEGRAEGRAEGRAEVAKKLLVL 302 Query: 276 GMSREDVAEMANLPLAEI 293 G +AE L EI Sbjct: 303 GFEITKIAEATGLSEEEI 320 >UniRef50_C0CTJ7 Putative uncharacterized protein n=5 Tax=Clostridium RepID=C0CTJ7_9CLOT Length = 327 Score = 57.3 bits (137), Expect = 6e-07, Method: Composition-based stats. Identities = 46/320 (14%), Positives = 103/320 (32%), Gaps = 41/320 (12%) Query: 10 DAVFKQFLMHAETARDFLEIHL--PVELRELCDLNTLHLESGSFIEESLK----GHSTDV 63 D V ++ E D + + ++ D+ L + D Sbjct: 5 DMVLNRYFEDGERYADLINGYAFNGDQVVRKEDVQELDPRETGVAGRLGRRPGVQKYRDS 64 Query: 64 LYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEAD-------------- 109 + V + ++ + +EHQ + M R M A R L Sbjct: 65 IRRVVLGAR--FVLIGLEHQDQVHYAMPVRAMLQDAAEYDRQLRRIRRVNRRVGGLTGAE 122 Query: 110 -------HDKLPLVVPILFYQGEATPYPLSMCWFDMFY----SPELARRVYNSPFPLVDI 158 D++ V+ ++ Y G+ P+ +M + + R V N ++++ Sbjct: 123 FLGGFTRKDRVCPVITLVLYYGKK-PWDGAMDLHGLMDCAGYPEPMLRLVNNYRLHVLEV 181 Query: 159 TITPDDEIMQHRRIAILELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQR 218 + + + +Q+ + E+ + ++ + Sbjct: 182 RRFVNIRRFRTDLYQVFGFIQRSGDKEAERRFTEENRVYFEGMDEEAFDVITA---ITGS 238 Query: 219 GHTEQADLFYGVLRDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQE----FAQRLLS 274 E+ Y R E++ + + +G +G +G+ E + E A+ + Sbjct: 239 RELERVKEQYREEGGRINMCEAIRGMIEDGRIEGRLEGKIEGKYEGALEKTRTVARNMYL 298 Query: 275 KGMSREDVAEMANLPLAEID 294 +GMS ED A + + A+I+ Sbjct: 299 RGMSAEDAAAICEMDTAQIE 318 >UniRef50_B3CVG1 Putative uncharacterized protein n=2 Tax=Orientia tsutsugamushi str. Ikeda RepID=B3CVG1_ORITI Length = 96 Score = 57.3 bits (137), Expect = 6e-07, Method: Composition-based stats. Identities = 33/125 (26%), Positives = 48/125 (38%), Gaps = 33/125 (26%) Query: 174 ILELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRD 233 +LE + KHI QRD++ L E+ + G Sbjct: 1 MLEYMLKHIHQRDMLKLWEEFLIKFKHGLILDK--------------------------- 33 Query: 234 RETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEI 293 G SM T+A + ++GI K GR E +QE + LL G E ++E L E+ Sbjct: 34 --EKGNSMRTIAAKYIDEGIAK----GRAEAAQELTRNLLKAGFLVEFISETTGLSKEEV 87 Query: 294 DKVIN 298 V N Sbjct: 88 VNVKN 92 >UniRef50_UPI0001C366FA hypothetical protein ChatD1_09620 n=1 Tax=Clostridium hathewayi DSM 13479 RepID=UPI0001C366FA Length = 342 Score = 56.9 bits (136), Expect = 7e-07, Method: Composition-based stats. Identities = 54/327 (16%), Positives = 101/327 (30%), Gaps = 45/327 (13%) Query: 10 DAVFKQFLMHAETARDFL-------EIHLPVELRELCDLNTLHLESGSFIEESLKGHSTD 62 D K L DF+ + L + L + + + D Sbjct: 8 DYYMKILLEDRARFADFINVNVFHGKQVLAADKLSLLPNEAGIVVVDADGVKRTIQRRRD 67 Query: 63 VLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRY-------SIAAMHRHLEADHDKL-- 113 V+ + + V E+Q K MA R M Y I + A+ DKL Sbjct: 68 VVMKAEF--GAYFCVVASENQGKVHYGMAVREMMYDALDYTEQIRKIEEKHRAEGDKLEG 125 Query: 114 --------------PLVVPILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDIT 159 P+V L+Y EA P S+ + P I Sbjct: 126 ADFLSHVTKADRLIPVVTLTLYYGNEAWDGPRSLYEMMGIDEEWEETALVKKCLPDYKIN 185 Query: 160 ITPDDEIMQHRRIAILELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRG 219 + E ++ + +H+ ++ + + E + + + + Sbjct: 186 LIDIRE---GEKLDQYKTSLQHVFG---LVKYNKNKQKLYEYTRVHREEINRMDRESKAA 239 Query: 220 HTEQADLFYGVLRDRETGGESMMTLAQWFEE-------KGIEKGIQQGRQEVSQEFAQRL 272 + + E+ E M + Q +E +G +GI G ++ F ++ Sbjct: 240 ALALIGEQKRLQKILESKREEEMDMCQAIDELIADGEVRGEVRGILMGMEKTKINFIRKQ 299 Query: 273 LSKGMSREDVAEMANLPLAEIDKVINL 299 K +S +A + +L ++KVI L Sbjct: 300 YKKQLSSSQIANILDLDERYVEKVIKL 326 >UniRef50_A7BN25 Putative uncharacterized protein n=3 Tax=Beggiatoa sp. SS RepID=A7BN25_9GAMM Length = 219 Score = 56.9 bits (136), Expect = 7e-07, Method: Composition-based stats. Identities = 33/179 (18%), Positives = 58/179 (32%), Gaps = 20/179 (11%) Query: 108 ADHDKLPLVVPILFYQGEATPYPLSMCWFDMFYSP-ELARRVYNSPFPLVDITITPDDEI 166 KLP V P++ Y G + P L + + + L+D D ++ Sbjct: 2 KKKIKLPPVCPVVIYNGNKAWNAAQEISELIEEVPGGLEKYRPHLRYFLIDEAKFADADL 61 Query: 167 MQHRRIAILELLQKHIRQRD--------LMLLLEQLVTLID--EGYTSGSQLVAMQNYML 216 + + ++ R D + +L LV + E +V +L Sbjct: 62 APLHNLVAAIIRLENTRSFDDEKALAEAISQVLNLLVDWLKDSEFIQLRRDIVTWLRRVL 121 Query: 217 QRGHTEQADL--FYGVLRDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLL 273 + ++ + E+M Q E+KG +G QG AQ LL Sbjct: 122 LPKNLPDVEIPEVIELQEMNAMLRENMQLWYQTAEKKGEARGKAQG-------IAQTLL 173 >UniRef50_A1WV23 Putative uncharacterized protein n=1 Tax=Halorhodospira halophila SL1 RepID=A1WV23_HALHL Length = 226 Score = 56.9 bits (136), Expect = 8e-07, Method: Composition-based stats. Identities = 28/122 (22%), Positives = 56/122 (45%), Gaps = 4/122 (3%) Query: 162 PDDEIMQHRRIAILELLQKHIRQRDLMLLLEQ-LVTLIDEGYTSGSQLVAMQNYMLQR-G 219 PD+ + + + + + + + L + LV L+ + + Y+ + G Sbjct: 64 PDEPDASYSQDPAVRAVLRALAWSCVQELSREDLVHLLRDLPPGHPLEKPLLVYIARTYG 123 Query: 220 HTEQADLFYGVLRDR--ETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGM 277 +AD+ Y + + R E E MT+A+ + ++G ++G Q+GRQE QE R L + + Sbjct: 124 SIAEADVRYALEQTRPIEQAEELTMTVAEEWIQRGRQQGWQEGRQEGLQEAETRALLQQI 183 Query: 278 SR 279 Sbjct: 184 EL 185 >UniRef50_C9LBM4 Putative uncharacterized protein n=1 Tax=Blautia hansenii DSM 20583 RepID=C9LBM4_RUMHA Length = 247 Score = 56.1 bits (134), Expect = 1e-06, Method: Composition-based stats. Identities = 17/73 (23%), Positives = 41/73 (56%), Gaps = 1/73 (1%) Query: 226 LFYGVLRDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEM 285 L++ E MTL + +++ ++G+++G + + ++LSKG+S E++ E+ Sbjct: 174 FVKADLKESREMEERFMTLEEMLKDE-RKEGLKEGTVKAQKRIVSKMLSKGLSDEEIMEL 232 Query: 286 ANLPLAEIDKVIN 298 ++ L E++ + N Sbjct: 233 CDISLEELENLKN 245 >UniRef50_Q6D2V6 Putative uncharacterized protein (Fragment) n=1 Tax=Pectobacterium atrosepticum RepID=Q6D2V6_ERWCT Length = 77 Score = 56.1 bits (134), Expect = 1e-06, Method: Composition-based stats. Identities = 18/71 (25%), Positives = 39/71 (54%), Gaps = 12/71 (16%) Query: 242 MTLAQWF------------EEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLP 289 MT+A+ ++G+E+GI+QG + +++ A+ LL GM +E V ++ L Sbjct: 1 MTIAEQLKKMGFDEGIQRGIQQGLEQGIEQGMKNSARQIARELLLTGMDKEKVRQITRLD 60 Query: 290 LAEIDKVINLI 300 E+++++ + Sbjct: 61 DEELEQLVTAV 71 >UniRef50_D0BNN6 ATP-dependent DNA helicase RecQ n=1 Tax=Granulicatella elegans ATCC 700633 RepID=D0BNN6_9LACT Length = 302 Score = 56.1 bits (134), Expect = 2e-06, Method: Composition-based stats. Identities = 51/314 (16%), Positives = 106/314 (33%), Gaps = 39/314 (12%) Query: 1 MDAPSTTPHDAVFKQFL---MHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLK 57 M T D +FK+ + DF+E ++L+ + N +E+ E+L Sbjct: 1 MKIKPTN--DLLFKKMMTTAGKEYILEDFIEAVTGMKLKNVRPANPYQIETYQKTIENLN 58 Query: 58 GHSTDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVV 117 + V G + ++IE Q K R+ Y A ++ +A+ K ++ Sbjct: 59 PVMYSTIVDVAATTEDG-MEIMIEMQLYQHKDFFERIFNYMATAYTQNYKAETAK--PII 115 Query: 118 PILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILEL 177 I+ T + + + + L Y + + + RI ++ L Sbjct: 116 SIVV-----TNFTVFPEFQEARIEIGLTNFAY--------YQEIRNRKQQPYWRIYLVNL 162 Query: 178 LQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETG 237 K I + + + G ++ + A + E Sbjct: 163 TDKAIVNGE-SRDFSEWRDFLKNGTIKPKSSRGLKEAQKIVNFSNLAGEERRLAELMEKY 221 Query: 238 GESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKG-----------------MSRE 280 + + + E+G+E+GI+ GRQ+ +R + KG + E Sbjct: 222 EDVYYQVMKHQLEEGLEQGIEIGRQQGVALGEKRGMEKGVALGERKGQVMICFKMNLPIE 281 Query: 281 DVAEMANLPLAEID 294 ++ + L + EI+ Sbjct: 282 EIQKHTGLSIEEIE 295 >UniRef50_A8YH27 Similar to tr|Q8YMI8|Q8YMI8 n=19 Tax=Cyanobacteria RepID=A8YH27_MICAE Length = 298 Score = 56.1 bits (134), Expect = 2e-06, Method: Composition-based stats. Identities = 25/124 (20%), Positives = 55/124 (44%), Gaps = 14/124 (11%) Query: 188 MLLLEQLVTLIDEGYTSGSQLVAMQNY-----MLQRGHTEQADLFYGVLRDRETGGES-- 240 + +L+ L + L + ++ + TE ++F + + + E+ Sbjct: 171 NQAVRELLELPQGNAFRENVLELLISWRVNMEINNILETEDREVFMTLSQTYQEWKEATK 230 Query: 241 -------MMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEI 293 + + E+G+E+G++QG+ E E RLL+ G+S E +A+ +L L ++ Sbjct: 231 REGRLEGLERGLEQGLERGLERGLEQGKLEAKLESIPRLLALGLSVEQIAQALDLDLEQV 290 Query: 294 DKVI 297 + I Sbjct: 291 RRAI 294 >UniRef50_UPI0001C371D2 hypothetical protein RflaF_10865 n=1 Tax=Ruminococcus flavefaciens FD-1 RepID=UPI0001C371D2 Length = 317 Score = 55.8 bits (133), Expect = 2e-06, Method: Composition-based stats. Identities = 49/319 (15%), Positives = 104/319 (32%), Gaps = 45/319 (14%) Query: 10 DAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFI----EESLK----GHST 61 DAV K ++ +E D L R++ L + I + + Sbjct: 5 DAVTKDYMQDSEHFADAF-NFLLYGGRQVIKPEQLKPLDTTSIALPYGDESRFVPIQKYR 63 Query: 62 DVLYSVQMQGNPGYLHVV--IEHQSKPDKKMAFRMMRYSI--------AAMHRHLEADH- 110 DVL V + +++ IE+QS M R M Y H ++ Sbjct: 64 DVLKMVTAMEDENATYLILGIENQSDIHYAMPIRNMLYDAIQYVNQADTIAKEHRKSKKM 123 Query: 111 --------------DKLPLVVPILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLV 156 D++ ++ + Y G + + ++ + V N L+ Sbjct: 124 PETRAEYLSGFYKTDRILPIITLTLYFGADEWDAPRDLHSMLTANEDILKFVDNYHLHLI 183 Query: 157 DITITPDDEIMQHRRIAILELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYML 216 D++ + L L K+++ L +V + + M N + Sbjct: 184 APAEIEDEDFAKFHTE--LSLALKYVKYSKDKKKLRDIVNEDTAFRSVSRKTADMVNVVT 241 Query: 217 QRGHTEQADLFYGVLRDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKG 276 L Y +R E++ + + +G +GI++G + + Sbjct: 242 SSN------LHYNDGEERVDMCEAIEEIRKDALAEGKAEGIEEGIIRTLIGLVKDGI--- 292 Query: 277 MSREDVAEMANLPLAEIDK 295 ++ D A+ A++ + E ++ Sbjct: 293 LTIADAAKRADMTVPEFEE 311 >UniRef50_C5EKZ7 Predicted protein n=1 Tax=Clostridiales bacterium 1_7_47FAA RepID=C5EKZ7_9FIRM Length = 329 Score = 55.8 bits (133), Expect = 2e-06, Method: Composition-based stats. Identities = 39/313 (12%), Positives = 80/313 (25%), Gaps = 55/313 (17%) Query: 14 KQFLMHAETARDFL-------EIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYS 66 ++ L H DF L E + ++ + D++ Sbjct: 9 RKLLNHPARFADFYNGTVFGGRQVLRPEQLSDVPNEQGIVILDKDGKKRVVERRRDIIKK 68 Query: 67 VQMQGNPGYLHVVI---EHQSKPDKKMAFRMMRY--------SIAAMHRHLEAD------ 109 + ++ E+Q M R M Y H Sbjct: 69 ASFGA-----YFILAAEENQDTIHYGMPVRNMMYDALDYTEQMECLKQAHKSRGDVLDGG 123 Query: 110 --------HDKLPLVVPILFYQGEATPYPLSMCWFDMFYSPELARR-------VYNSPFP 154 D+L VV ++ Y G P+ +DM A+ + + Sbjct: 124 GFLSGITREDRLMPVVSLILYHGSK-PWDGPRSLYDMLGLDASAKETLALKQVLPDYRIN 182 Query: 155 LVDITITPDDEIMQHRRIAILELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNY 214 L+D + E+ + +L+ + + ++ + + Sbjct: 183 LIDASNIEHPELFCTSLQHVFSMLKYNTDK-------QKFYGYAKQHQKDLLDMDDDSML 235 Query: 215 MLQRGHTEQADLFYGVLRDRETGGE--SMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRL 272 + EQ L + E + G +G +G+ E A L Sbjct: 236 AMLTLLGEQKRLLKILETSSNDTKEGTDVCIAIDELINDGKIEGKIEGKIEGEHRLA-TL 294 Query: 273 LSKGMSREDVAEM 285 + + V + Sbjct: 295 MDRLFKDGRVEDA 307 >UniRef50_B7UFQ6 Predicted protein n=11 Tax=Escherichia RepID=B7UFQ6_ECO27 Length = 73 Score = 55.8 bits (133), Expect = 2e-06, Method: Composition-based stats. Identities = 17/68 (25%), Positives = 30/68 (44%), Gaps = 16/68 (23%) Query: 242 MTLAQWFEEKGIEKGIQQGRQEVSQE----------------FAQRLLSKGMSREDVAEM 285 MT+A+ ++G + G Q+G+ E QE A R+L +G+ R+ V Sbjct: 1 MTIAERLRQEGHQIGWQEGKIEGWQEGKLEGLQESMHEQAIKIALRMLEQGIDRDQVLAA 60 Query: 286 ANLPLAEI 293 L ++ Sbjct: 61 TQLSETDL 68 >UniRef50_C4ZGR2 Putative uncharacterized protein n=2 Tax=Eubacterium rectale ATCC 33656 RepID=C4ZGR2_EUBR3 Length = 370 Score = 55.8 bits (133), Expect = 2e-06, Method: Composition-based stats. Identities = 44/240 (18%), Positives = 86/240 (35%), Gaps = 35/240 (14%) Query: 79 VIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHD--------------KLPLVVPILFYQG 124 + EHQS M R + Y + L K+P ++FY G Sbjct: 134 IYEHQSTVCPNMPVRSLIYFSVILSDMLSDKKKGTKSGKNIYGRRLVKIPTPHFVVFYNG 193 Query: 125 -EATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQ-----HRRIAILELL 178 E P + D F P + + +I + IM+ + + + + Sbjct: 194 EEEQPEVQELKLSDAFEKPTDEPNL-ELKCKVYNINDGKNKAIMESCGWLNDYMTFVNKV 252 Query: 179 QKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGG 238 +++ L + ID Y + ++ + T+ L Y R E Sbjct: 253 REYHADGAFDDLAIDIEKAID--YCIDNDILKEFLKTYRSEVTKSMQLNYEFDRQLELER 310 Query: 239 ESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKG-MSREDVAEMANLPLAEIDKVI 297 + E+G+E GI++G + + L++KG + + AE A + ++E +K++ Sbjct: 311 ADAI-------EEGMEIGIEKG----ANKMLFTLVTKGKLDIDTAAEEAGVSVSEFEKLM 359 >UniRef50_C0D7Q8 Putative uncharacterized protein n=1 Tax=Clostridium asparagiforme DSM 15981 RepID=C0D7Q8_9CLOT Length = 351 Score = 55.4 bits (132), Expect = 2e-06, Method: Composition-based stats. Identities = 52/280 (18%), Positives = 85/280 (30%), Gaps = 43/280 (15%) Query: 32 PVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQMQGNPGYLHVVIE-HQSKPDKKM 90 P +L ++ D N + + ++ DV+ Y ++ E +Q K M Sbjct: 47 PEDLSDVPDENGIAIVGLDGKRRLIRRSR-DVIKKASFG---AYFVLLAEENQDKVHYAM 102 Query: 91 AFRMMRYSI--------AAMHRHLEADH--------------DKLPLVVPILFYQGEATP 128 R M Y A RH E D++ VV + Y G P Sbjct: 103 PVRSMLYDALEYTEQVEALKRRHRECGDRLEGDAFLSGITRDDRIMPVVTLTVYHGAK-P 161 Query: 129 YPLSMCWFDMFYSP-------ELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKH 181 + +DM L + + LV++ E + + + + K+ Sbjct: 162 WDGPRSLYDMLEMDRDSKEWEALKEVLPDYRLNLVELNNMQHLERFRSS-LQPIFTVLKY 220 Query: 182 IRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGESM 241 ++D E L +E V +L EQ L + G E M Sbjct: 221 -NRKDKRKFYEYLENHREELRKMDDDSVRAMLALLG----EQKRLLRMLELPGGEGKERM 275 Query: 242 M--TLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSR 279 G E+G +G+ E E L G R Sbjct: 276 DVYNAIDELIADGREEGKAEGKAEGRVEGKAIGLELGQKR 315 >UniRef50_A5CBY6 Transposase and inactivated derivative n=47 Tax=cellular organisms RepID=A5CBY6_ORITB Length = 324 Score = 55.4 bits (132), Expect = 3e-06, Method: Composition-based stats. Identities = 54/318 (16%), Positives = 106/318 (33%), Gaps = 33/318 (10%) Query: 10 DAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEE----SLKGHSTDVLY 65 D FK+ + +D L L L + +E I + S K DVL Sbjct: 12 DVAFKKIFGSEKN-KDILIHFLNDILLFEGNREITEVEFLGTILDADIASKKESIVDVL- 69 Query: 66 SVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHD----KLPLVVPILF 121 + + Y+ IE Q P + R Y+ A R + L V+ I Sbjct: 70 -CKDKNGAQYI---IEMQVDPTQGFEKRAQYYAAKAYGRQPNRGKEGKYSDLKEVIFIAI 125 Query: 122 YQGEATPYPLS-MCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRI-AILELLQ 179 + P + + + + F +++ + + + I Sbjct: 126 ADYKLFPNKEDYISRHVILDKKTYEHDLKDFSFTFIELPKFKKNRVEELSDITEKWCYFF 185 Query: 180 KHIRQRDLMLLLEQLVT--LIDEGYTSGSQLVAMQNYMLQRGHT----------EQADLF 227 KH ++ L + + +I Y + Q ++ ++ E L Sbjct: 186 KHAKETTLDGYHKIIGEDLIIKRAYEALDQFNWSEDELITYEQELKRIWDNKAVEDYKLE 245 Query: 228 YGVLRDRETGGESMMTLAQ-----WFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDV 282 + G + L + + +G +G +G+ E ++FA +LL +S E + Sbjct: 246 RAKAEGIKLGEAKGIKLGEAKGKAEGKAEGKAEGKAEGKAEAKKDFAIKLLKSELSVETI 305 Query: 283 AEMANLPLAEIDKVINLI 300 AE +L + E+ + N + Sbjct: 306 AEYTDLSIQEVLNLKNSV 323 >UniRef50_UPI00016C0F09 hypothetical protein Epulo_07618 n=2 Tax=Epulopiscium sp. 'N.t. morphotype B' RepID=UPI00016C0F09 Length = 328 Score = 55.0 bits (131), Expect = 3e-06, Method: Composition-based stats. Identities = 57/320 (17%), Positives = 114/320 (35%), Gaps = 38/320 (11%) Query: 10 DAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQM 69 D K L D LE L L E + + + + LK + D+L + Sbjct: 11 DYAMKYILREKSNF-DILEGFLYALLNEEVKILEILESENNKNDIDLKSNRVDLLTRDEQ 69 Query: 70 QGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLV----VPILFYQ-G 124 ++IE Q + R++ + + +L ++ + IL++ G Sbjct: 70 NRR-----MIIEIQYAAESDYLQRLLYETSRTIIDNLPDGARYEEVIKVISISILYFDIG 124 Query: 125 EATPYPLSMCWFDMFYSPELARRVYN----------------SPFPLVDITITPDD-EIM 167 Y M + D+ L + + L++ + D+ E Sbjct: 125 TKCVYKGEMSFRDIKTKENLIPEKADKYIGIKSKKKNYKEIFPEYYLINTRLFNDNVETD 184 Query: 168 QHRRIAILEL-------LQKHIRQRDLMLLLEQLVTLIDEGYTS--GSQLVAMQNYMLQR 218 I + + K I + L + ++ Y + + + Y Sbjct: 185 LDDWIYMFKNSEVREGATAKSIDKAKERLDVLKMSKKERSQYNNFLDERRKSASQYHTAY 244 Query: 219 GHTEQADLFYGVLRDRETGGES-MMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGM 277 + + G+ + E G E + + ++GIE+GI+QG ++ +EF +R L K + Sbjct: 245 TEGIEQGIKQGIKQGIEQGIEQGIEQGIEQGIKQGIEQGIEQGIEQNKKEFVKRGLDKKI 304 Query: 278 SREDVAEMANLPLAEIDKVI 297 S +AE+A L + E+ +I Sbjct: 305 SIAMIAELAELSIEEVKAII 324 >UniRef50_C1DU30 Putative uncharacterized protein n=7 Tax=Sulfurihydrogenibium azorense Az-Fu1 RepID=C1DU30_SULAA Length = 313 Score = 55.0 bits (131), Expect = 3e-06, Method: Composition-based stats. Identities = 51/292 (17%), Positives = 103/292 (35%), Gaps = 62/292 (21%) Query: 10 DAVFKQFLMHAETARDFLEIHLPVELR-ELCDLNTLHLESGSFIEESLKGHSTDVLYSVQ 68 D + K + A +EI L ++ +L + L + D++ V+ Sbjct: 7 DLLLKHLFKNP--ATKLIEIILGKKVNWQLLQDSDLKIVKT---------READLV--VK 53 Query: 69 MQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEATP 128 ++ N + IE QS D M +RM Y ++ D ++ + Y G+ Sbjct: 54 LEDNTI---LHIEIQSTNDPSMPYRMFEYFYLITDKYKPKD------LIQVCIYIGKEP- 103 Query: 129 YPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIA--------------- 173 + S ++ + + L+DI P E++ + I Sbjct: 104 ---------LKMSDKIQFSDWTYRYRLIDIKDIPCKELITSQNITDKLLAGLCKIEDPKF 154 Query: 174 ILELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQ------------RGHT 221 +E + K I+ + E ++ + +++Y+ Q R Sbjct: 155 YVENVIKEIKNANPKDRKELFTLFLEISKIRNNIEEEIRSYIRQEDFEMPITIEWTREEI 214 Query: 222 EQADLFYGVLR--DRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQR 271 E + VL+ E E + Q ++G+E+G+QQG Q+ E ++ Sbjct: 215 ESYPVLRDVLKIGKEEGYKEGLQQGLQQGLKEGLEQGVQQGLQKGLIEGLRQ 266 >UniRef50_Q5L374 Transposase n=26 Tax=Bacillaceae RepID=Q5L374_GEOKA Length = 125 Score = 54.6 bits (130), Expect = 4e-06, Method: Composition-based stats. Identities = 24/126 (19%), Positives = 53/126 (42%), Gaps = 7/126 (5%) Query: 174 ILELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRD 233 ++ L +QR L E + L +E V+ + ++ Sbjct: 2 LVRLELDEAKQRLLFGFFETYLRLSEEEEAKPRHEVSQM----ETKEAKRVMELIVSYEQ 57 Query: 234 RETGGESMMTLAQWFEE---KGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPL 290 R + Q ++ +G+++G Q+G +E + +R+L+KG + + E+ LP+ Sbjct: 58 RGLEKGIQQGIEQGIKQGMKQGMKQGRQEGIEEGKLDVVKRMLAKGYDVDTIHELTGLPV 117 Query: 291 AEIDKV 296 +I++V Sbjct: 118 EKIERV 123 >UniRef50_B0JHW4 Transposase n=31 Tax=Cyanobacteria RepID=B0JHW4_MICAN Length = 288 Score = 54.6 bits (130), Expect = 4e-06, Method: Composition-based stats. Identities = 27/111 (24%), Positives = 47/111 (42%) Query: 188 MLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGESMMTLAQW 247 LL+ + T++ ++ + L Q +E E + + Sbjct: 176 QELLQLIETILVYKLPLLNRREIETMFSLDELKQTQYFQDVREEARQEGREEGIEQGIEQ 235 Query: 248 FEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKVIN 298 E+GIE+GI+QGR + E RLL+ G+S E VA L + ++ + N Sbjct: 236 GIEQGIEQGIEQGRLNKALEAVPRLLALGLSVEQVASALELEVKQVRAIQN 286 >UniRef50_C0G0A4 Putative uncharacterized protein n=2 Tax=Roseburia inulinivorans DSM 16841 RepID=C0G0A4_9FIRM Length = 319 Score = 54.6 bits (130), Expect = 4e-06, Method: Composition-based stats. Identities = 23/121 (19%), Positives = 41/121 (33%), Gaps = 17/121 (14%) Query: 10 DAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQM 69 D VF+ + + DL + LE+ ++ D+ + + Sbjct: 58 DTVFRMLFSDRKNLLSLYNAVNQSNYKNPEDLEIVTLENAIYMGIK-----NDLAF---I 109 Query: 70 QGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADH------DKLPLVVPILFYQ 123 YL+ EHQS + M R + Y + + ++ K+P I FY Sbjct: 110 MDTNLYLY---EHQSTYNPNMPLRDLFYICSEYQKLVDKKSLFSSTLQKIPAPNFIEFYN 166 Query: 124 G 124 G Sbjct: 167 G 167 >UniRef50_C1TQY0 Putative transposase, YhgA n=1 Tax=Dethiosulfovibrio peptidovorans DSM 11002 RepID=C1TQY0_9BACT Length = 133 Score = 54.6 bits (130), Expect = 4e-06, Method: Composition-based stats. Identities = 15/60 (25%), Positives = 30/60 (50%) Query: 237 GGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKV 296 E + +G+EKG Q+G ++ A R++ KGM E ++E+ L + ++ + Sbjct: 61 EKEGLRKGIHRGRREGMEKGRQEGLRKALARTAMRMIEKGMDLETISELTGLDIDKVRDM 120 >UniRef50_C1J8S3 YdgA n=6 Tax=Escherichia coli RepID=C1J8S3_ECOLX Length = 68 Score = 54.6 bits (130), Expect = 4e-06, Method: Composition-based stats. Identities = 19/63 (30%), Positives = 32/63 (50%), Gaps = 8/63 (12%) Query: 242 MTLAQWFEEKGI--------EKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEI 293 MT+A+ +KG ++G ++G EV++E A RL G E + E+ L E+ Sbjct: 1 MTIAERLIQKGFDEGFKESFKEGFKEGALEVAREIACRLRDMGWPPERIQEVTGLSGEEL 60 Query: 294 DKV 296 K+ Sbjct: 61 KKL 63 >UniRef50_A6MYW5 Chromosome segregation ATPase n=4 Tax=Rickettsia RepID=A6MYW5_9RICK Length = 296 Score = 54.6 bits (130), Expect = 4e-06, Method: Composition-based stats. Identities = 46/292 (15%), Positives = 102/292 (34%), Gaps = 11/292 (3%) Query: 10 DAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQM 69 D FK+ E +D L + + + + + L + + ++ + +L ++ Sbjct: 9 DLAFKKIFGVEEN-KDLLISLINSIVSKEDQIVDVTLLN-PYNPQNFRNDKLSIL-DIKA 65 Query: 70 QGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEATPY 129 G G IE Q + R + Y L+A D L I + T Sbjct: 66 LGESGK-RFNIEIQITDEADYDKRALYYWAKLYTEALQASQDYSSLNKAIGIHILNFTSI 124 Query: 130 PLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQRDLML 189 P + + ++F+ E + F +++ ++ + + ++L+K D+ Sbjct: 125 PETNKYHNIFHITEKDSGLL--YFKDLELHTIELNKFSNNPNEELADILKKVGNSLDIWS 182 Query: 190 LLEQLVTLIDEGYTSGSQLVAMQNYMLQRGH-----TEQADLFYGVLRDRETGGESMMTL 244 L++ A L +E+ D + L+ ++ Sbjct: 183 AFLTRHDLLNSNNLPKKLDNASLKKALTVLDVMNFTSEERDAYEDHLKWLRIEANTLKKY 242 Query: 245 AQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKV 296 +G +GIQ G+ E A+ L G++ ++E L +I+++ Sbjct: 243 EAQARVRGKVEGIQIGKTEEKIAIARNLKRSGVAITIISESTGLTKKQIEEL 294 >UniRef50_B0JU44 Putative uncharacterized protein n=4 Tax=Microcystis aeruginosa RepID=B0JU44_MICAN Length = 72 Score = 54.6 bits (130), Expect = 5e-06, Method: Composition-based stats. Identities = 14/65 (21%), Positives = 36/65 (55%) Query: 232 RDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLA 291 R+ + + + +KG ++G+Q+G++E +++ A ++LS G ++A +L Sbjct: 2 RESVIYQDILEEGEEKGLQKGRQEGLQEGKEEKARQIALKMLSAGFPIPEIARFTDLSPD 61 Query: 292 EIDKV 296 I+++ Sbjct: 62 AIEEL 66 >UniRef50_A6EAN2 Putative uncharacterized protein n=1 Tax=Pedobacter sp. BAL39 RepID=A6EAN2_9SPHI Length = 317 Score = 54.6 bits (130), Expect = 5e-06, Method: Composition-based stats. Identities = 43/312 (13%), Positives = 89/312 (28%), Gaps = 32/312 (10%) Query: 10 DAVFKQFLM---HAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYS 66 D FK+ + + D L + + DL E H ++ Sbjct: 13 DFAFKKIFGGDPNKDLLIDLLNALFKGR-KIIIDLTYNKNEHPGD-----SEHEGAAVFD 66 Query: 67 VQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHD-----KLPLVVPILF 121 + G G +IE Q + R + Y+ + + +L V I Sbjct: 67 LLCTGQNGE-QFIIEIQRAKQENFKERALFYTSRLISSQAPKGNRASWGYRLTEVYLIAL 125 Query: 122 YQGEATPYPLSMCWFD--MFYSPELARRVYNS-PFPLVDITITPDDEIMQHRRIAILELL 178 + + + + Y + +++ + L Sbjct: 126 MEDTTLNDESEHEFLHDICLCKRDTGKVFYEKLGYLYIELRKFVKSSTELQTDLDRWLFL 185 Query: 179 QKHIRQRD---------LMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYG 229 K++ D + L + + + Y + + G Sbjct: 186 LKNLSSMDKIPVYLRKPIFEKLFSIAEYSNLSKEEKMSYDSRMKYKWDNENVREYARKEG 245 Query: 230 VLRDRETGGESMM---TLAQWFE--EKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAE 284 + + E G E L E +G +G +GR+E + + A + S + + +A Sbjct: 246 LEKGLEEGREKGRLEGKLEGKLEGKLEGKLEGKLEGRKEAAIKIAGEMKSANLPLDQIAR 305 Query: 285 MANLPLAEIDKV 296 L L EI+ + Sbjct: 306 FTKLSLEEIEGI 317 >UniRef50_C6XV81 Putative uncharacterized protein n=4 Tax=Pedobacter heparinus DSM 2366 RepID=C6XV81_PEDHD Length = 318 Score = 54.2 bits (129), Expect = 5e-06, Method: Composition-based stats. Identities = 48/312 (15%), Positives = 94/312 (30%), Gaps = 48/312 (15%) Query: 10 DAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQM 69 D FK+ E ++ L L + + + F E + ++ V Sbjct: 28 DFSFKRLFATEE-SKPILIGLLNHLFKGRKYITEIEYGKNEFPGEIAQEGGA--VFDVYC 84 Query: 70 QGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEATPY 129 G +IE Q + R + Y A+ G+ + Sbjct: 85 TDVNGS-KFIIEVQRGNQEYFKERALFYVSRAISEQAPK---------------GDRKGW 128 Query: 130 PLSMC------WFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIR 183 + + + F P+ + Y L +E+L Sbjct: 129 AYKLTEVYLLAFLEDFNLPDSPKSEYVQDICLA--NRHTGIIFYDKVGFIFIEMLNFVKG 186 Query: 184 QRDLMLLLEQLVTLIDEGYT-----------SGSQLVAMQNYMLQRGHTEQADLFYGVLR 232 +L L++ + + QL + NY E+ D++ L+ Sbjct: 187 SDELYTELDKWLYALKHLTEFKQRPEYLSGPEFDQLFTLANYASLT--PEERDMYNSSLK 244 Query: 233 DRETGGESM-----MTLAQWFE---EKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAE 284 + + +L Q E E+G E+G +QG + + E A +L E++ + Sbjct: 245 RKWDNKNVLDYAVKKSLEQGLEQGLEQGREQGREQGIHKKAIEIALEMLVNKYPIEEIIK 304 Query: 285 MANLPLAEIDKV 296 + L EI + Sbjct: 305 LTKLSKEEIQSL 316 >UniRef50_C3PNP1 Transposase and inactivated derivative n=1 Tax=Rickettsia africae ESF-5 RepID=C3PNP1_RICAE Length = 114 Score = 54.2 bits (129), Expect = 6e-06, Method: Composition-based stats. Identities = 25/111 (22%), Positives = 45/111 (40%), Gaps = 19/111 (17%) Query: 205 GSQLVAMQNYMLQRGHTEQADLFYGVLRDR---ETGGESMMTLAQWFEEKGIE------- 254 + + Y L + +L + E G M +LAQ ++++G E Sbjct: 3 DCIIYRLLYYTLTKIEQADRIKLEDLLSTKLNPEIGTRLMRSLAQHWQQEGKELGVLEGL 62 Query: 255 -----KGIQ----QGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKV 296 KGIQ +G+ E E A+ +LS+G + ++ + L A I + Sbjct: 63 QVGEAKGIQIGEAKGKAEERVEIAKEMLSQGCNISLISSVTGLDEAFISSL 113 >UniRef50_A6BF26 Putative uncharacterized protein n=14 Tax=Clostridiales RepID=A6BF26_9FIRM Length = 366 Score = 54.2 bits (129), Expect = 6e-06, Method: Composition-based stats. Identities = 54/318 (16%), Positives = 112/318 (35%), Gaps = 46/318 (14%) Query: 10 DAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQM 69 D +F+ E E + L + LE+ ++ D+ + + Sbjct: 58 DTIFRMLYHDKENLLSLYNAVNGREYTDPEKLQVVTLENAIYMG-----MKNDLAF---I 109 Query: 70 QGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADH------DKLPLVVPILFYQ 123 YL+ EHQS + + R + Y R + K+P ++FY Sbjct: 110 MDMNLYLY---EHQSTYNPNIPLRNLFYIADEYQRLVVRKSLYSTVIQKIPTPRFLVFYN 166 Query: 124 GEAT-----PYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQH-----RRIA 173 G + LS + + +P+L RV ++++ ++M+H Sbjct: 167 GTKEVEDRSEFRLSSAYENPTENPDLELRV-----TMLNVNDGHSSDLMEHCRTLKEYAQ 221 Query: 174 ILELLQKHIRQRDL---MLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGV 230 + ++K+ ++D+ + + I+EG + L + + + Sbjct: 222 YVARVRKYAAKQDVSLEEAVTRAVDECIEEGILAEFLLKNKTEVIRVSIYEYDKEFEEKK 281 Query: 231 LRDRETGGESMMTLA-----------QWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSR 279 LR E + Q E G + GI+ G++ + ++ ++ L KG S Sbjct: 282 LRKAEYEAGRQDGIEIGRQDGIEIGRQDGIEIGRQDGIEIGKRILLEKIIKKKLKKGKST 341 Query: 280 EDVAEMANLPLAEIDKVI 297 E +A+ + I KV+ Sbjct: 342 EQIADELEEDINIIQKVV 359 >UniRef50_B0A7T9 Putative uncharacterized protein n=2 Tax=Clostridium bartlettii DSM 16795 RepID=B0A7T9_9CLOT Length = 271 Score = 53.8 bits (128), Expect = 6e-06, Method: Composition-based stats. Identities = 52/295 (17%), Positives = 101/295 (34%), Gaps = 42/295 (14%) Query: 10 DAVFKQFLM---HAETARDFLEIHL-PVELRELCDLNTLHLESGSFIEESLKGHSTDVLY 65 D VFK + + FL L P +L ++ + + ++IE+ DV Sbjct: 10 DFVFKNIFGSEKNPKILISFLNATLKPKDLITSVEIKNTDI-NKNYIEDKF--SRLDV-- 64 Query: 66 SVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHD----KLPLVVPILF 121 + + + IE Q K + M R + Y L D K + + IL Sbjct: 65 KAKTSNDEI---INIEIQLKNEYNMIKRSLYYWSKLYSEQLGEGQDYSVLKRTICINILN 121 Query: 122 YQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKH 181 ++ T S YS E V F ++I D + + +E L+ Sbjct: 122 FKYLKTRKFHSGYRLKEIYSNEELTNVAEIHF--IEIPKLDDGADEKDMLVNWIEFLK-- 177 Query: 182 IRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGESM 241 + + + ++ ++ +++ + + RE Sbjct: 178 ----------DPESETVRSLEMNIEEIRQAKDELIRMSNDD---------TQREIYEMRA 218 Query: 242 MTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKV 296 TL + + ++G Q+ +E A+ LL + E +A L + EI+K+ Sbjct: 219 KTLRDKISA--LNEAERKGIQQGKREIAKALLDV-LDIETIALKTGLSIDEINKL 270 >UniRef50_C4Z1Q2 Putative uncharacterized protein n=1 Tax=Eubacterium eligens ATCC 27750 RepID=C4Z1Q2_EUBE2 Length = 321 Score = 53.8 bits (128), Expect = 7e-06, Method: Composition-based stats. Identities = 53/332 (15%), Positives = 104/332 (31%), Gaps = 51/332 (15%) Query: 2 DAPSTTPHDAVFKQFLMHAETARDFL-------EIHLPVELRELCDLN-TLHLESGSFIE 53 +T D K F E D + L + D + + + S S+ E Sbjct: 4 SNRTTHQKDVSLKTFWRDNEHFADLFNATVFNGKQVLKPDKLTEMDTDVSATIHSKSYNE 63 Query: 54 ESLKGHSTDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRH---LEADH 110 + DV+ +M + + +E Q K M R M Y + ++ H Sbjct: 64 SITRNR--DVVK--KMSDGVEFNILGLEIQDKTHYAMPLRTMTYDALGYIKEYNDIKKHH 119 Query: 111 D--------------------KLPLVVPILFYQGEATPYPLSMCWFDMFYSPELARRVYN 150 + ++ ++ Y GE+ + C DM S + Y Sbjct: 120 KLNKDSFSSHEEFLSGINKSDRFHPIITLVLYYGESL-WDGPTCLSDMMISMPDNIKAYF 178 Query: 151 SPFPLVDITITPDDEIMQHRRIAILELLQKHIRQRDLMLLLEQLV-TLIDEGYTSGSQLV 209 S + L + I D+ + RD+ ++ + D Y Sbjct: 179 SDYKLNLVQILDSDK-----------YTFYNEDVRDVFNIIRNIYNDDFDSIYREYESRN 227 Query: 210 AMQNYMLQRGHTEQADLFYGVLRDRETGGESMMTLA-QWFEEKGIEKGIQQGRQEVSQEF 268 + M + + D E GG M A + F+ + KG+++G Sbjct: 228 VDIDVMELICNITSVPKLMDLCTDTEQGGTVNMCEAMKRFQAECESKGMKEGIDSEKVNS 287 Query: 269 AQRLLSKGMSREDVAEMANLPLAEIDKVINLI 300 +L G+++E + + ++++ I Sbjct: 288 IISMLEFGITKEQI--LTRYTKEDLERAEAAI 317 >UniRef50_A6LF36 Putative uncharacterized protein n=7 Tax=Bacteroidales RepID=A6LF36_PARD8 Length = 273 Score = 53.8 bits (128), Expect = 8e-06, Method: Composition-based stats. Identities = 41/292 (14%), Positives = 91/292 (31%), Gaps = 33/292 (11%) Query: 10 DAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQM 69 D F + E ++ L L D+ + + E+L +++ V Sbjct: 10 DFGFHRIFGQ-EVHKELLIDFLNQLFFGEHDIEDITFLNPIQTPETLDDRG--IVFDVHC 66 Query: 70 QGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPI--LFYQGEAT 127 + + G L V +E Q+ R + Y A+ + D + P+ +F Sbjct: 67 KDSNGNLFV-VEMQTGAQPYFHDRGLYYLARAISNQGQKGKDWKFALQPVYGVFLLNYKM 125 Query: 128 PYPLSMCWFDMFYSPELARRVYNSPFPLV-DITITPDDEIMQHRRIAILELLQKHIRQRD 186 + E R + + ++ + L KH+ Sbjct: 126 DVNSKFRTDVILADRETGRMFSDRIRQVYLELPYFQKEPDECENDFERWIYLLKHMD--- 182 Query: 187 LMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLF--YGVLRDRETGGESMMTL 244 TL + + + + + + + L+ ++ Sbjct: 183 ---------TLERMPFKAKKAVFDKLLEVADVANLSKEERIQYDEALKRYRDYKNTI--- 230 Query: 245 AQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKV 296 + EEKGI KG + A+ + ++G++ + + L L +I+K+ Sbjct: 231 -DYAEEKGILKGKE--------STARNMKAEGIAPLIIQKCTGLSLEDIEKL 273 >UniRef50_C6LTE0 Putative uncharacterized protein n=1 Tax=Giardia intestinalis ATCC 50581 RepID=C6LTE0_GIALA Length = 353 Score = 53.4 bits (127), Expect = 8e-06, Method: Composition-based stats. Identities = 44/297 (14%), Positives = 93/297 (31%), Gaps = 32/297 (10%) Query: 10 DAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFI--EESLKGHSTDVLYSV 67 D VF Q E + L L L+ + + ++ K D+ Sbjct: 73 DFVFYQIFG-VEKHKSVLISLLNSILKGNPHVKDVRIDPTEHKRTTPDGKSVRLDI--KA 129 Query: 68 QMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRH---LEADHDKLPLVVPILFYQG 124 + V +E Q + R + Y + + + +P V+ I Sbjct: 130 TINDGTI---VDVEMQCINTGDIYHRSIYYQSLILRDYTIKQGQSYKSIPDVIIIWIMN- 185 Query: 125 EATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQ 184 C ++ P+ EI + + L K Sbjct: 186 -QDITNRKGCMHEIV--------------PMYKANGIDQIEIASEKMRQFIIELTKLGNT 230 Query: 185 RDL--MLLLEQLVTLIDEGYTSGSQLVAMQNY---MLQRGHTEQADLFYGVLRDRETGGE 239 + +T I + + +L+ ++ M + + + + R Sbjct: 231 SNFCYNKAFTAWMTFIKDPSSISGELLEVEGVQTAMKELTYLSENKETRAIYDARRIALL 290 Query: 240 SMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKV 296 + + + EKG +G+ +GR + + A+++LS G+ E + + L + EI+ V Sbjct: 291 DLNSAIEHGIEKGKAEGLVEGRDKERERMAEQMLSDGLDIEFIVRYSGLSMQEIENV 347 >UniRef50_C1Q938 Putative uncharacterized protein n=4 Tax=Brachyspira murdochii DSM 12563 RepID=C1Q938_9SPIR Length = 326 Score = 53.4 bits (127), Expect = 9e-06, Method: Composition-based stats. Identities = 42/299 (14%), Positives = 99/299 (33%), Gaps = 34/299 (11%) Query: 10 DAVFKQFLMH---AETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYS 66 D + H A +F+ N + + + I E+ + V Sbjct: 50 DYFVRYLFSHDGNENIALNFINAVFKD--LNFETFNKIEILNPFNISENYDEKESIVDIK 107 Query: 67 VQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEA 126 + + V+IE QS+ ++ R + Y L FY G Sbjct: 108 ATTETG---ITVLIEIQSRGNEDFIKRALYYWAYNYSSSLNRGS----------FYDGLK 154 Query: 127 TPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILEL----LQKHI 182 +S+ + + E S + L ++ + + H ++ LEL L+ Sbjct: 155 P--TVSINITNFILTDEDKVH---SCYVLKELN--NNKILTDHCQLHFLELPKFNLKDIS 207 Query: 183 RQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGESMM 242 L + ++ ++ I ++ +N + + + + + Sbjct: 208 AIESLDNIHKEFISWIKFFKGEDMSILMKENTIFEEVEKKCLTFVNDSPVIDKYKKREVD 267 Query: 243 TLA-----QWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKV 296 T + +K E+GI++G +E A+ + + + ++++ L + EI+ + Sbjct: 268 TYFFNKSMELDIKKAKEEGIKEGIKENQILTAKNMKKENIDINIISKITGLSIQEIENL 326 >UniRef50_Q24Y19 Putative uncharacterized protein n=3 Tax=Desulfitobacterium hafniense RepID=Q24Y19_DESHY Length = 248 Score = 53.4 bits (127), Expect = 1e-05, Method: Composition-based stats. Identities = 36/245 (14%), Positives = 74/245 (30%), Gaps = 33/245 (13%) Query: 78 VVIEHQSKPDKKMAFRMMRYSIAAMHRHLEAD--HDKLP--LVVPILFYQGEATPYPLSM 133 + IE Q M R + Y R + + +L + + I+ + Sbjct: 3 INIEIQLSNQYDMEKRSLYYWAQMYSRQIREGMAYKELTKTVSINIVDFNYLKQTSNYHN 62 Query: 134 CWFDMFYSPE------LARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQRDL 187 + + L P L + + LL + +++ Sbjct: 63 VFHLYEDEEKFQLTDVLEIHFMELPKLLAKWRRREIS--LWENELVRWLLLLEGADNQEI 120 Query: 188 MLLLEQLV-------------------TLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFY 228 + +LE++ I E Y + + + ++ + Sbjct: 121 LQILEEIAMKDPVLYQAMNAWEETSEDPRIREAYFDRRKAILDEKAAIREAELRLQEALE 180 Query: 229 GVLRDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANL 288 + G + +A+ + +G +G +GR E E A++LL G +AE L Sbjct: 181 EGMAKGIAEGRAKG-IAEG-KAEGKAEGRAEGRAEGRAEVAKKLLVLGFEITKIAEATGL 238 Query: 289 PLAEI 293 EI Sbjct: 239 SEEEI 243 >UniRef50_Q9L0J0 Putative uncharacterized protein SCO4675 n=4 Tax=Streptomyces RepID=Q9L0J0_STRCO Length = 302 Score = 53.4 bits (127), Expect = 1e-05, Method: Composition-based stats. Identities = 45/304 (14%), Positives = 93/304 (30%), Gaps = 37/304 (12%) Query: 6 TTPHDAVFKQFLMHA---ETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTD 62 ++ H+A+ + F FL I LP + E S D Sbjct: 3 SSSHEAMHRIFQHDPGLFSRVTHFLGIDLPRPIGA-------TALPTDLTEASPVERRVD 55 Query: 63 VLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFY 122 L + G + +E Q K D Y ++ +LP + ++ Sbjct: 56 TLLRFETAER-GPFLLAVEAQGKKDPDKPASWAYYVSYLWTKY------RLPTALLVVCQ 108 Query: 123 QGEATPYPLSMCWFDMFYSPELARRVYNSPFPLV----DITITPDDEIMQHRRIAILELL 178 + P L R P+V ++ + D + + + Sbjct: 109 DHATAKWAQRAVTSGPPELPTLTLR------PVVAGPHNMPVITDPDEARADLVLASLAA 162 Query: 179 QKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGG 238 H + + +L+ L T + + + + + V D Sbjct: 163 ITHAAEPVVNAILKALSTALSDAPEDIAAPIVEFTAHGLGNRPARHLWRNLVAVDLSFYK 222 Query: 239 ESMMTLAQWFEEKGIEKGIQQGRQEVS-QEFAQRLL----SKGMSRED--VAEMANLPLA 291 +++ ++G E+G +QGR++ Q+ AQ +L +G+ D + Sbjct: 223 S---YISEEIRDEGREQGREQGREQGRAQQGAQDVLLVLEQRGLDIPDGVRTRITECGDP 279 Query: 292 EIDK 295 E+ + Sbjct: 280 EVLR 283 >UniRef50_B4VKU9 Putative uncharacterized protein n=1 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VKU9_9CYAN Length = 323 Score = 53.4 bits (127), Expect = 1e-05, Method: Composition-based stats. Identities = 46/304 (15%), Positives = 92/304 (30%), Gaps = 23/304 (7%) Query: 10 DAVFKQFLMHAETARDFLEIHLPVEL---RELCDLNTLHLESGSFIEESLKGHSTDVLYS 66 D FK+ + + D L L + + + T+ E+LK D+ Sbjct: 12 DYAFKKIFGS-DQSEDILISFLNAIVYNGKSVISSLTIVNPYNPGQVETLKDSYLDI--R 68 Query: 67 VQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKL--PLVVPILFYQG 124 + V+IE Q R+ A L + L V+ + Sbjct: 69 AVLNSGEI---VLIEMQVARIAAFYKRVTYNLCKAYANQLTSGDYYLEITPVIAVTITDF 125 Query: 125 EATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQ 184 +F E + L+ + + + L +K I Sbjct: 126 ILFKENPKCIHHFVFKDKESSSEYPEHELQLI----FVELPRFVKKLPELQTLAEKWIYF 181 Query: 185 RDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQA-----DLFYGVLRDRETGGE 239 LE++ + E L L E+ L + R + E Sbjct: 182 MTQAQDLEEIPESLAEVTAIEKALTIANQANLTPAEAEEVSRRAMQLRDEIGRIKYATEE 241 Query: 240 SMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVA---EMANLPLAEIDKV 296 + + ++G ++G Q+GR ++ RLL+K + + L L+ ++ + Sbjct: 242 ASKEAREEGRQEGRQEGRQEGRITEARALVLRLLNKRFPDQTAELNSLVEGLSLSALEGL 301 Query: 297 INLI 300 + + Sbjct: 302 SDAM 305 >UniRef50_C1QAK6 Putative uncharacterized protein n=1 Tax=Brachyspira murdochii DSM 12563 RepID=C1QAK6_9SPIR Length = 290 Score = 53.1 bits (126), Expect = 1e-05, Method: Composition-based stats. Identities = 25/153 (16%), Positives = 63/153 (41%), Gaps = 13/153 (8%) Query: 144 LARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQRDLMLLLEQLVTLIDEGYT 203 L P L + D+ + L + +K + ++M +L + ++ E Y Sbjct: 151 LEMHFLELPKYLFSSSRLTDE---LYAWFYFLTIKEKREKMEEIMEMLVKKNPIMKEVYD 207 Query: 204 SGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGESMMTLAQWFEEKGIEKGIQQGRQE 263 ++ V ++ L +TE ++ +L E + E+G+++GI++G + Sbjct: 208 EYNKFVNTKD--LFDNYTEYEKNYFDMLALNEERIKG--------REEGLKEGIEKGEKN 257 Query: 264 VSQEFAQRLLSKGMSREDVAEMANLPLAEIDKV 296 + A+ + + +++ L + EI+ + Sbjct: 258 KAISMAKNMKKDKVDFNTISKYTGLSIEEIENL 290 >UniRef50_Q8YMI0 Alr4953 protein n=8 Tax=Cyanobacteria RepID=Q8YMI0_ANASP Length = 314 Score = 53.1 bits (126), Expect = 1e-05, Method: Composition-based stats. Identities = 46/312 (14%), Positives = 108/312 (34%), Gaps = 45/312 (14%) Query: 10 DAVFKQFLM--HAETARDFLEIHLPVELRELCDLNTL-HLESGSFI----EESLKGHSTD 62 D+ +K+ L + + F E L + + F E D Sbjct: 11 DSPWKEILEAYFPQAVQFFF-----PETAALINWERPYEFLNTEFQQIAREAEQGKPYAD 65 Query: 63 VLYSV-QMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILF 121 L V Q+QG +L + +E Q++ + + RM Y+ R + + + Sbjct: 66 QLVKVWQIQGEEIWLLIHVEIQAQKEDDFSKRMFTYNFRIFDRFEK-------PAISLAI 118 Query: 122 YQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPD--DEIMQHRRIAILELLQ 179 + S ++ + N F +V + + DE+ + ++ Sbjct: 119 LCDTNRQWRPSNYSYNYPQTR------LNFEFGIVKLLDYENRFDELENNTNP-FATVVM 171 Query: 180 KHIRQRDL-----------MLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFY 228 H++ + L+ +L L + + + ++++ + L Sbjct: 172 AHLKTQQTRSSPQERKIWKFSLIRRLYDLGLQEQDIRNLYRFI-DWVMILPKALENQLCS 230 Query: 229 GVLRDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANL 288 V + + +T A+ G E+GIQ+G + + +R L + +S E + +L Sbjct: 231 EVQQLEQERTMRYVTSAERI---GYERGIQEGELGIILKLLKRRLGE-LSPEIQQRIQSL 286 Query: 289 PLAEIDKVINLI 300 + +++ + + Sbjct: 287 SVNQLENLSEAL 298 >UniRef50_C9LT45 Putative uncharacterized protein n=2 Tax=Selenomonas sputigena ATCC 35185 RepID=C9LT45_9FIRM Length = 374 Score = 53.1 bits (126), Expect = 1e-05, Method: Composition-based stats. Identities = 23/135 (17%), Positives = 53/135 (39%), Gaps = 6/135 (4%) Query: 168 QHRRIAILELLQKHIRQRDLMLLLEQLVTLIDEGY----TSGSQLVAMQNYMLQRGHTEQ 223 + ++I+ + + RQ+D L+ L L + L + + E+ Sbjct: 233 DYDLMSIITIYLGNERQQDEDWLIRFLQILFKDMEISPAAKKQLLKNEFDMDISADIEEE 292 Query: 224 ADLFYGVLRDRETGGES--MMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSRED 281 + G M + E+G+E+G+++GR+E + +L + E Sbjct: 293 MRTMCNLSTGIYEQGMERGMERGMERGMERGMERGMERGREEGKVDIVLEMLRNKLPLEM 352 Query: 282 VAEMANLPLAEIDKV 296 +A M+ L ++ ++ Sbjct: 353 IASMSKFSLEKVKEL 367 >UniRef50_D1PGQ2 Transposase, ISNCY family n=2 Tax=Prevotella copri DSM 18205 RepID=D1PGQ2_9BACT Length = 118 Score = 53.1 bits (126), Expect = 1e-05, Method: Composition-based stats. Identities = 20/77 (25%), Positives = 40/77 (51%), Gaps = 5/77 (6%) Query: 223 QADLFYGVLRDRETGGESMMTLAQWFEE---KGIEKGIQQGRQEVSQEFAQRLLSKGMSR 279 + L G+ + G E LA+ E KG+ +G+++G + S E A+++L+ GM Sbjct: 42 EKGLAEGMEKGLAEGMEKG--LAEGMEMGLVKGLAEGMEKGMNKRSLEIARKMLANGMDA 99 Query: 280 EDVAEMANLPLAEIDKV 296 V E+ L +++ ++ Sbjct: 100 ATVMEITGLSESQLQQL 116 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P31665 Uncharacterized protein yadD n=59 Tax=Enterobact... 269 6e-71 UniRef50_P37415 Uncharacterized protein pSLT051 n=256 Tax=Gammap... 255 1e-66 UniRef50_C2DMU4 Possible transposase n=6 Tax=Enterobacteriaceae ... 254 3e-66 UniRef50_Q1CC76 Transposase n=27 Tax=Gammaproteobacteria RepID=Q... 252 1e-65 UniRef50_Q4LC22 TpnA protein n=9 Tax=Enterobacteriaceae RepID=Q4... 251 3e-65 UniRef50_D2U4R8 Transposase (Fragment) n=4 Tax=Enterobacteriacea... 236 8e-61 UniRef50_P77768 Uncharacterized protein yfcI n=175 Tax=Gammaprot... 235 1e-60 UniRef50_B7UFQ5 Predicted protein n=14 Tax=Enterobacteriaceae Re... 233 5e-60 UniRef50_Q7B1W7 YadD homologue n=11 Tax=root RepID=Q7B1W7_ECOLX 230 4e-59 UniRef50_Q7N1D0 Transposase, ISNCY family n=36 Tax=root RepID=Q7... 228 1e-58 UniRef50_D1P284 Transposase, ISNCY family n=10 Tax=Enterobacteri... 219 8e-56 UniRef50_C8QFJ7 Putative transposase YhgA family protein n=4 Tax... 216 7e-55 UniRef50_A8PLK1 Putative uncharacterized protein n=3 Tax=Rickett... 214 3e-54 UniRef50_D0KLJ7 Putative transposase YhgA family protein n=1 Tax... 214 3e-54 UniRef50_C2LLN3 Transposase n=37 Tax=Enterobacteriaceae RepID=C2... 213 4e-54 UniRef50_B6XDZ7 Putative uncharacterized protein n=2 Tax=Provide... 212 1e-53 UniRef50_C0Q5B1 Ytl2 n=4 Tax=Enterobacteriaceae RepID=C0Q5B1_SALPC 205 2e-51 UniRef50_C2LF55 Transposase n=3 Tax=Enterobacteriaceae RepID=C2L... 196 9e-49 UniRef50_B7MZS6 Putative uncharacterized protein n=3 Tax=Escheri... 195 2e-48 UniRef50_C3M8C1 Putative transposase n=3 Tax=Candidatus Hamilton... 192 2e-47 UniRef50_A8PQ66 Putative uncharacterized protein n=3 Tax=Rickett... 188 2e-46 UniRef50_Q3C0L1 TpnA protein n=16 Tax=Enterobacteriaceae RepID=Q... 185 2e-45 UniRef50_A6G4N5 Putative uncharacterized protein n=1 Tax=Plesioc... 177 3e-43 UniRef50_Q2J904 Putative uncharacterized protein n=1 Tax=Frankia... 175 3e-42 UniRef50_C0AXL8 Putative uncharacterized protein n=1 Tax=Proteus... 174 3e-42 UniRef50_A6G0X2 Putative uncharacterized protein n=1 Tax=Plesioc... 174 4e-42 UniRef50_Q52101 ORF n=1 Tax=Salmonella enterica subsp. enterica ... 170 5e-41 UniRef50_C7RR52 Putative transposase n=1 Tax=Candidatus Accumuli... 166 6e-40 UniRef50_A9EVM7 Similar to putative transposase n=2 Tax=Sorangiu... 166 1e-39 UniRef50_Q1QWV4 Putative uncharacterized protein n=11 Tax=Proteo... 164 3e-39 UniRef50_A0LBL3 Putative uncharacterized protein n=6 Tax=Magneto... 164 3e-39 UniRef50_D0LMM4 Putative transposase n=10 Tax=Haliangium ochrace... 163 8e-39 UniRef50_Q24W02 Putative uncharacterized protein n=3 Tax=Clostri... 161 2e-38 UniRef50_B3ESQ9 Putative uncharacterized protein n=2 Tax=Bacteri... 161 3e-38 UniRef50_Q2FP14 Putative uncharacterized protein n=4 Tax=Methano... 160 8e-38 UniRef50_Q2RLW6 Putative uncharacterized protein n=9 Tax=Clostri... 159 1e-37 UniRef50_A8GX51 Transposase and inactivated derivative n=11 Tax=... 159 1e-37 UniRef50_A5CC03 Transposase and inactivated derivative n=9 Tax=O... 155 2e-36 UniRef50_D2QBD7 Putative uncharacterized protein n=1 Tax=Spiroso... 155 2e-36 UniRef50_Q1RJ73 Transposase and inactivated derivative n=10 Tax=... 153 5e-36 UniRef50_C1J8H0 Truncated transposase n=3 Tax=Escherichia coli R... 153 8e-36 UniRef50_A6TJT5 Putative uncharacterized protein n=1 Tax=Alkalip... 153 9e-36 UniRef50_Q1RKI3 Transposase and inactivated derivative n=10 Tax=... 148 2e-34 UniRef50_B9TA29 Putative uncharacterized protein n=1 Tax=Ricinus... 148 2e-34 UniRef50_C5JAV2 Transposase n=2 Tax=uncultured bacterium RepID=C... 148 2e-34 UniRef50_C6I158 Putative uncharacterized protein n=3 Tax=Leptosp... 148 3e-34 UniRef50_A3JHZ5 Putative transposase n=11 Tax=Proteobacteria Rep... 147 4e-34 UniRef50_C6HY29 Putative uncharacterized protein n=1 Tax=Leptosp... 146 8e-34 UniRef50_C6HXQ0 Putative uncharacterized protein n=1 Tax=Leptosp... 146 1e-33 UniRef50_C3PPD7 Transposase and inactivated derivative n=13 Tax=... 146 1e-33 UniRef50_C2DIT3 Possible transposase n=5 Tax=Enterobacteriaceae ... 146 1e-33 UniRef50_C4YU05 Transposase n=5 Tax=Rickettsieae RepID=C4YU05_9RICK 145 1e-33 UniRef50_Q1RGR6 Transposase and inactivated derivative n=15 Tax=... 145 1e-33 UniRef50_C5UWW9 Putative uncharacterized protein n=1 Tax=Clostri... 144 4e-33 UniRef50_A4XFI8 Putative uncharacterized protein n=7 Tax=Clostri... 144 4e-33 UniRef50_B9MMR0 Putative uncharacterized protein n=1 Tax=Anaeroc... 143 7e-33 UniRef50_A4XMD0 Putative uncharacterized protein n=5 Tax=Clostri... 142 1e-32 UniRef50_A3ET28 Probable transposase n=6 Tax=Leptospirillum sp. ... 141 2e-32 UniRef50_Q6TFF6 Putative transposase n=1 Tax=Caedibacter taenios... 140 6e-32 UniRef50_A4XG55 Putative uncharacterized protein n=2 Tax=Caldice... 138 2e-31 UniRef50_Q3JB06 Putative transposase n=17 Tax=Proteobacteria Rep... 137 4e-31 UniRef50_A6G1G8 Putative uncharacterized protein n=1 Tax=Plesioc... 136 6e-31 UniRef50_Q1Q296 Putative uncharacterized protein n=6 Tax=Candida... 136 7e-31 UniRef50_C6VTM0 Putative uncharacterized protein n=1 Tax=Dyadoba... 134 2e-30 UniRef50_B6WXP3 Putative uncharacterized protein n=1 Tax=Desulfo... 134 4e-30 UniRef50_C0GW46 Putative uncharacterized protein n=2 Tax=Desulfo... 134 4e-30 UniRef50_A9BGB6 Putative uncharacterized protein n=3 Tax=Petroto... 134 4e-30 UniRef50_C0GW49 Putative uncharacterized protein n=6 Tax=Desulfo... 132 2e-29 UniRef50_C5RH90 Putative uncharacterized protein n=2 Tax=Clostri... 132 2e-29 UniRef50_B4U689 Putative uncharacterized protein n=8 Tax=Aquific... 131 3e-29 UniRef50_C6HZP6 Putative uncharacterized protein n=1 Tax=Leptosp... 129 1e-28 UniRef50_C0GTX5 Putative uncharacterized protein n=8 Tax=Desulfo... 128 2e-28 UniRef50_B9MN47 Putative uncharacterized protein n=2 Tax=Bacteri... 128 3e-28 UniRef50_A4U3R1 Putative uncharacterized protein n=1 Tax=Magneto... 127 5e-28 UniRef50_B3ETR6 Putative uncharacterized protein n=1 Tax=Candida... 125 2e-27 UniRef50_Q04UG3 Transposase, YhgA-like n=8 Tax=Leptospira RepID=... 125 2e-27 UniRef50_C0GWA6 Putative uncharacterized protein n=3 Tax=Desulfo... 124 3e-27 UniRef50_C4FIM1 Putative uncharacterized protein n=1 Tax=Sulfuri... 124 3e-27 UniRef50_C6HTR6 Probable transposase n=5 Tax=Leptospirillum ferr... 124 5e-27 UniRef50_A4XMU7 Putative uncharacterized protein n=1 Tax=Caldice... 122 1e-26 UniRef50_B2V9N0 Putative uncharacterized protein n=4 Tax=Sulfuri... 122 2e-26 UniRef50_Q7NIZ1 Gll2041 protein n=9 Tax=Cyanobacteria RepID=Q7NI... 121 3e-26 UniRef50_C6IY67 Transposase n=1 Tax=Paenibacillus sp. oral taxon... 119 9e-26 UniRef50_A5USQ0 Putative uncharacterized protein n=4 Tax=Roseifl... 119 2e-25 UniRef50_C6PYR3 Putative uncharacterized protein n=1 Tax=Clostri... 119 2e-25 UniRef50_B9MPV5 Putative uncharacterized protein n=5 Tax=Clostri... 118 2e-25 UniRef50_A8PLG1 Transposase n=1 Tax=Rickettsiella grylli RepID=A... 117 4e-25 UniRef50_B1XMU9 Putative uncharacterized protein n=1 Tax=Synecho... 117 5e-25 UniRef50_B9MMM9 Putative uncharacterized protein n=1 Tax=Anaeroc... 117 5e-25 UniRef50_B0G834 Putative uncharacterized protein n=3 Tax=Dorea f... 117 6e-25 UniRef50_C8PTN1 Putative uncharacterized protein n=4 Tax=Trepone... 113 8e-24 UniRef50_C0A240 Putative uncharacterized protein n=1 Tax=Opituta... 112 1e-23 UniRef50_Q2RKN5 Putative uncharacterized protein n=1 Tax=Moorell... 111 3e-23 UniRef50_B0K503 Putative uncharacterized protein n=12 Tax=Thermo... 110 5e-23 UniRef50_B6J6C6 Hypothetical cytosolic protein n=1 Tax=Coxiella ... 110 7e-23 UniRef50_C1DXM1 Putative uncharacterized protein n=5 Tax=Sulfuri... 107 6e-22 UniRef50_D0YJF1 Putative transposase YhgA family protein n=1 Tax... 106 7e-22 UniRef50_C8T759 Putative uncharacterized protein n=1 Tax=Klebsie... 106 7e-22 UniRef50_B2V697 Putative uncharacterized protein n=6 Tax=Sulfuri... 106 7e-22 UniRef50_A5D0D4 Putative uncharacterized protein n=10 Tax=Clostr... 106 1e-21 UniRef50_A9BGB3 Putative uncharacterized protein n=2 Tax=Petroto... 106 1e-21 UniRef50_C9KKN3 Putative uncharacterized protein n=1 Tax=Mitsuok... 105 2e-21 UniRef50_C4G1D5 Putative uncharacterized protein n=2 Tax=Abiotro... 105 2e-21 UniRef50_C6VTD5 Putative uncharacterized protein n=1 Tax=Dyadoba... 105 2e-21 UniRef50_A6LF36 Putative uncharacterized protein n=7 Tax=Bactero... 105 2e-21 UniRef50_Q73P51 Conserved domain protein n=7 Tax=Treponema RepID... 104 3e-21 UniRef50_D1PHY3 Putative uncharacterized protein n=2 Tax=Prevote... 104 4e-21 UniRef50_A6LFH9 Putative uncharacterized protein n=6 Tax=Bactero... 102 2e-20 UniRef50_B5U1X5 Putative uncharacterized protein n=1 Tax=uncultu... 101 3e-20 UniRef50_A6LFA9 Putative uncharacterized protein n=22 Tax=Bacter... 101 3e-20 UniRef50_Q24MW9 Putative uncharacterized protein n=4 Tax=Desulfi... 101 4e-20 UniRef50_D0LPI9 Putative transposase n=2 Tax=Haliangium ochraceu... 99 1e-19 UniRef50_UPI0001C351D8 hypothetical protein ChatD1_33675 n=1 Tax... 99 1e-19 UniRef50_C9LWJ8 Putative uncharacterized protein n=1 Tax=Selenom... 99 2e-19 UniRef50_UPI0001C353CE hypothetical protein ChatD1_20495 n=1 Tax... 99 2e-19 UniRef50_C1PBU4 Putative uncharacterized protein n=4 Tax=Bacillu... 98 3e-19 UniRef50_B7BFV9 Putative uncharacterized protein n=1 Tax=Parabac... 98 3e-19 UniRef50_B7CC32 Putative uncharacterized protein n=10 Tax=Eubact... 98 3e-19 UniRef50_A5CBY6 Transposase and inactivated derivative n=47 Tax=... 98 3e-19 UniRef50_C2LUG6 Putative uncharacterized protein n=1 Tax=Strepto... 98 4e-19 UniRef50_B7GJZ4 Transposase n=10 Tax=Bacillaceae RepID=B7GJZ4_ANOFW 97 6e-19 UniRef50_C1I6Y7 Putative uncharacterized protein n=1 Tax=Clostri... 97 8e-19 UniRef50_A6MYW5 Chromosome segregation ATPase n=4 Tax=Rickettsia... 96 9e-19 UniRef50_C3R531 Putative uncharacterized protein n=6 Tax=Bactero... 96 9e-19 UniRef50_C9LXX0 Putative uncharacterized protein n=6 Tax=Selenom... 96 1e-18 UniRef50_B0K813 Putative uncharacterized protein n=13 Tax=Thermo... 96 1e-18 UniRef50_B4SC57 Putative uncharacterized protein n=14 Tax=Bacter... 95 3e-18 UniRef50_A6EAN2 Putative uncharacterized protein n=1 Tax=Pedobac... 95 4e-18 UniRef50_B8HL58 Putative uncharacterized protein n=2 Tax=Cyanoth... 94 5e-18 UniRef50_C0CTJ7 Putative uncharacterized protein n=5 Tax=Clostri... 94 5e-18 UniRef50_C0CSV6 Putative uncharacterized protein n=1 Tax=Clostri... 93 9e-18 UniRef50_Q5GSR2 Uncharacterized conserved protein n=15 Tax=Wolba... 93 1e-17 UniRef50_C6XV94 Putative uncharacterized protein n=7 Tax=Pedobac... 93 1e-17 UniRef50_UPI0001C34E7F hypothetical protein ClM62_15401 n=1 Tax=... 93 1e-17 UniRef50_B9E303 Putative uncharacterized protein n=2 Tax=Clostri... 92 2e-17 UniRef50_A8GY36 Putative uncharacterized protein n=15 Tax=Ricket... 92 2e-17 UniRef50_C6XV81 Putative uncharacterized protein n=4 Tax=Pedobac... 91 4e-17 UniRef50_B8FTH9 Putative uncharacterized protein n=3 Tax=Desulfi... 91 4e-17 UniRef50_B0KCX4 Putative uncharacterized protein n=12 Tax=Thermo... 91 4e-17 UniRef50_B8FP58 Putative uncharacterized protein n=1 Tax=Desulfi... 90 7e-17 UniRef50_C8PT67 Putative uncharacterized protein n=1 Tax=Trepone... 90 1e-16 UniRef50_C2G1H3 Hypothetical cytosolic protein n=1 Tax=Sphingoba... 89 2e-16 UniRef50_D1P8S5 Putative uncharacterized protein n=1 Tax=Prevote... 89 2e-16 UniRef50_C0R0H3 Putative uncharacterized protein n=8 Tax=Brachys... 89 2e-16 UniRef50_B0K519 Putative uncharacterized protein n=14 Tax=Thermo... 88 3e-16 UniRef50_B1WSK8 CHP1784-containing protein n=11 Tax=Cyanobacteri... 88 3e-16 UniRef50_C4ZLA7 Conserved hypothetical cytosolic protein n=2 Tax... 88 4e-16 UniRef50_C0F0J0 Putative uncharacterized protein n=1 Tax=Eubacte... 88 4e-16 UniRef50_A6M1J9 Putative uncharacterized protein n=1 Tax=Clostri... 88 4e-16 UniRef50_B4VKU9 Putative uncharacterized protein n=1 Tax=Microco... 88 5e-16 UniRef50_C6XVT6 Putative uncharacterized protein n=1 Tax=Pedobac... 87 6e-16 UniRef50_C1QAJ2 Putative uncharacterized protein n=2 Tax=Brachys... 87 8e-16 UniRef50_D0TYF1 Putative uncharacterized protein n=1 Tax=Bactero... 87 8e-16 UniRef50_C6Y2B5 Transposase and inactivated derivative n=1 Tax=P... 86 1e-15 UniRef50_UPI0001BC3A9D hypothetical protein BcroD2_08902 n=3 Tax... 86 2e-15 UniRef50_B3QUJ9 Putative uncharacterized protein n=8 Tax=Bacteri... 85 2e-15 UniRef50_C6LTE0 Putative uncharacterized protein n=1 Tax=Giardia... 85 2e-15 UniRef50_A5Z376 Putative uncharacterized protein n=1 Tax=Eubacte... 85 3e-15 UniRef50_C0DAA1 Putative uncharacterized protein n=2 Tax=Clostri... 85 4e-15 UniRef50_C0QGW4 Putative uncharacterized protein n=1 Tax=Desulfo... 85 5e-15 UniRef50_C8PLW8 Putative uncharacterized protein n=2 Tax=Trepone... 84 5e-15 UniRef50_UPI0001C369BC hypothetical protein ChatD1_02491 n=1 Tax... 84 5e-15 UniRef50_C4G3R2 Putative uncharacterized protein n=2 Tax=Abiotro... 84 6e-15 UniRef50_Q2FTW8 Putative uncharacterized protein n=2 Tax=Methano... 84 6e-15 UniRef50_Q2RGS0 Putative uncharacterized protein n=2 Tax=Moorell... 84 7e-15 UniRef50_Q3ARM2 Putative uncharacterized protein n=10 Tax=Bacter... 83 8e-15 UniRef50_C5UZR7 Putative uncharacterized protein n=1 Tax=Clostri... 83 8e-15 UniRef50_A7AK04 Putative uncharacterized protein n=2 Tax=Parabac... 83 8e-15 UniRef50_C1DXV7 Putative uncharacterized protein n=1 Tax=Sulfuri... 83 8e-15 UniRef50_C6LE73 Putative uncharacterized protein n=1 Tax=Bryante... 83 9e-15 UniRef50_Q00255 ORF295 n=1 Tax=Leptolyngbya boryana RepID=Q00255... 83 1e-14 UniRef50_C6W4R9 Putative uncharacterized protein n=1 Tax=Dyadoba... 83 1e-14 UniRef50_C1MD86 Putative uncharacterized protein n=5 Tax=Enterob... 83 2e-14 UniRef50_A7BWQ7 Putative uncharacterized protein n=3 Tax=Beggiat... 82 3e-14 UniRef50_B1V1L4 Putative uncharacterized protein n=38 Tax=Clostr... 82 3e-14 UniRef50_A7B1D1 Putative uncharacterized protein n=3 Tax=Ruminoc... 81 3e-14 UniRef50_Q9L0J0 Putative uncharacterized protein SCO4675 n=4 Tax... 81 3e-14 UniRef50_A6BF26 Putative uncharacterized protein n=14 Tax=Clostr... 81 3e-14 UniRef50_C5RQ96 Putative uncharacterized protein n=1 Tax=Clostri... 81 4e-14 UniRef50_B4VKW0 Putative uncharacterized protein n=2 Tax=Microco... 81 4e-14 UniRef50_D2NBJ3 Putative uncharacterized protein n=1 Tax=Escheri... 81 5e-14 UniRef50_UPI0001C366FA hypothetical protein ChatD1_09620 n=1 Tax... 81 5e-14 UniRef50_Q8F560 Putative uncharacterized protein n=1 Tax=Leptosp... 81 5e-14 UniRef50_A1ZPJ4 Hypothetical conserved protein n=6 Tax=Microscil... 81 5e-14 UniRef50_C0QZQ8 Putative uncharacterized protein n=4 Tax=Brachys... 81 5e-14 UniRef50_C1P7A8 Putative uncharacterized protein n=1 Tax=Bacillu... 81 5e-14 UniRef50_A8F2U7 Putative uncharacterized protein n=15 Tax=Bacter... 81 6e-14 UniRef50_B4VZ11 Putative uncharacterized protein n=1 Tax=Microco... 81 6e-14 UniRef50_B5CRG1 Putative uncharacterized protein n=4 Tax=Ruminoc... 81 6e-14 UniRef50_A5KR99 Putative uncharacterized protein n=11 Tax=Rumino... 80 9e-14 UniRef50_C8NHS0 Putative uncharacterized protein n=1 Tax=Granuli... 80 1e-13 UniRef50_UPI00006A2D99 UPI00006A2D99 related cluster n=2 Tax=Xen... 80 1e-13 UniRef50_D0BNN6 ATP-dependent DNA helicase RecQ n=1 Tax=Granulic... 80 1e-13 UniRef50_UPI0001C371D2 hypothetical protein RflaF_10865 n=1 Tax=... 80 1e-13 UniRef50_C9RP54 Putative uncharacterized protein n=1 Tax=Fibroba... 80 1e-13 UniRef50_B8HNA0 Putative uncharacterized protein n=3 Tax=Cyanoba... 79 2e-13 UniRef50_C0EXQ3 Putative uncharacterized protein n=1 Tax=Eubacte... 79 2e-13 UniRef50_B0MQP0 Putative uncharacterized protein n=2 Tax=Eubacte... 78 3e-13 UniRef50_C9LUC8 Putative uncharacterized protein n=5 Tax=Selenom... 78 3e-13 UniRef50_B4VQ19 Putative uncharacterized protein n=3 Tax=Microco... 78 3e-13 UniRef50_C1Q938 Putative uncharacterized protein n=4 Tax=Brachys... 78 3e-13 UniRef50_C0QZ87 Chromosome segregation ATPase n=19 Tax=Bacteria ... 78 3e-13 UniRef50_C0G0A4 Putative uncharacterized protein n=2 Tax=Rosebur... 78 3e-13 UniRef50_B4VTF8 Putative uncharacterized protein n=7 Tax=Oscilla... 78 3e-13 UniRef50_C0BF92 Putative uncharacterized protein n=1 Tax=Coproco... 78 3e-13 UniRef50_Q3ARU8 Putative uncharacterized protein n=12 Tax=Chloro... 78 4e-13 UniRef50_C4ZGR2 Putative uncharacterized protein n=2 Tax=Eubacte... 78 4e-13 UniRef50_B0A7T9 Putative uncharacterized protein n=2 Tax=Clostri... 78 4e-13 UniRef50_C0GV86 Transposase, ISNCY family n=7 Tax=Desulfonatrono... 78 4e-13 UniRef50_B6FJ15 Putative uncharacterized protein n=5 Tax=Clostri... 78 5e-13 UniRef50_C9RMD5 Putative uncharacterized protein n=1 Tax=Fibroba... 77 6e-13 UniRef50_Q24Y59 Putative uncharacterized protein n=4 Tax=Peptoco... 77 7e-13 UniRef50_C4Z1Q2 Putative uncharacterized protein n=1 Tax=Eubacte... 77 7e-13 UniRef50_Q8YK35 All8083 protein n=6 Tax=Cyanobacteria RepID=Q8YK... 77 7e-13 UniRef50_C9LT45 Putative uncharacterized protein n=2 Tax=Selenom... 77 8e-13 UniRef50_Q24Y19 Putative uncharacterized protein n=3 Tax=Desulfi... 77 8e-13 UniRef50_Q8YMI0 Alr4953 protein n=8 Tax=Cyanobacteria RepID=Q8YM... 76 1e-12 UniRef50_Q8ZS56 Alr7656 protein n=6 Tax=Nostocaceae RepID=Q8ZS56... 76 1e-12 UniRef50_C0R2N1 Putative uncharacterized protein n=4 Tax=Wolbach... 76 1e-12 UniRef50_C4G7H9 Putative uncharacterized protein n=2 Tax=Abiotro... 76 2e-12 UniRef50_C0DB21 Putative uncharacterized protein n=2 Tax=Clostri... 76 2e-12 UniRef50_Q8YQI6 All3837 protein n=4 Tax=Cyanobacteria RepID=Q8YQ... 76 2e-12 UniRef50_Q3ATN4 Putative uncharacterized protein n=1 Tax=Chlorob... 76 2e-12 UniRef50_A7BPH0 Putative uncharacterized protein n=5 Tax=Beggiat... 75 2e-12 UniRef50_C9LXS5 Transposase n=3 Tax=Selenomonas sputigena ATCC 3... 75 3e-12 UniRef50_C6LJP2 Putative transposase n=1 Tax=Bryantella formatex... 75 3e-12 UniRef50_A7BTR0 Putative uncharacterized protein n=3 Tax=Beggiat... 75 3e-12 UniRef50_A6EA97 Putative uncharacterized protein n=1 Tax=Pedobac... 75 3e-12 UniRef50_C9RQ02 Putative uncharacterized protein n=1 Tax=Fibroba... 75 4e-12 UniRef50_Q8YTL4 All2703 protein n=13 Tax=Cyanobacteria RepID=Q8Y... 75 4e-12 UniRef50_A7C3X3 Putative uncharacterized protein n=7 Tax=Beggiat... 75 4e-12 UniRef50_A7C3K1 Putative uncharacterized protein n=3 Tax=Beggiat... 75 4e-12 UniRef50_Q1NU37 Putative uncharacterized protein n=1 Tax=delta p... 75 4e-12 UniRef50_C4Z2A6 Putative uncharacterized protein n=2 Tax=Eubacte... 75 5e-12 UniRef50_A4XJH0 Putative uncharacterized protein n=1 Tax=Caldice... 74 5e-12 UniRef50_Q2FSM2 Putative uncharacterized protein n=3 Tax=Methano... 74 5e-12 UniRef50_C4FYK3 Putative uncharacterized protein n=2 Tax=Abiotro... 74 6e-12 UniRef50_UPI0001BC3131 hypothetical protein BcroD2_12630 n=4 Tax... 74 7e-12 UniRef50_C5EKZ7 Predicted protein n=1 Tax=Clostridiales bacteriu... 74 8e-12 UniRef50_A6FZY9 Putative uncharacterized protein n=2 Tax=Plesioc... 73 8e-12 UniRef50_UPI00006CAA90 hypothetical protein TTHERM_00670420 n=1 ... 73 9e-12 UniRef50_Q6ZEK6 Slr5124 protein n=11 Tax=Chroococcales RepID=Q6Z... 73 1e-11 UniRef50_C8W1F3 Putative uncharacterized protein n=2 Tax=Desulfo... 72 2e-11 UniRef50_D2RKL8 Tetracycline resistance leader peptide n=3 Tax=A... 72 2e-11 Sequences not found previously or not previously below threshold: UniRef50_C6Y2C7 Putative uncharacterized protein n=2 Tax=Pedobac... 77 8e-13 UniRef50_B7K6I4 Putative uncharacterized protein n=2 Tax=Cyanoth... 76 2e-12 UniRef50_C6XVH2 Putative uncharacterized protein n=1 Tax=Pedobac... 75 3e-12 UniRef50_C0QWG9 Putative uncharacterized protein n=8 Tax=Brachys... 75 3e-12 UniRef50_B7I1C8 Putative uncharacterized protein n=16 Tax=Bacill... 74 6e-12 UniRef50_UPI0001B4A8CA hypothetical protein Bfra3_22303 n=1 Tax=... 73 1e-11 >UniRef50_P31665 Uncharacterized protein yadD n=59 Tax=Enterobacteriaceae RepID=YADD_ECOLI Length = 300 Score = 269 bits (688), Expect = 6e-71, Method: Composition-based stats. Identities = 300/300 (100%), Positives = 300/300 (100%) Query: 1 MDAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHS 60 MDAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHS Sbjct: 1 MDAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHS 60 Query: 61 TDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPIL 120 TDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPIL Sbjct: 61 TDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPIL 120 Query: 121 FYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQK 180 FYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQK Sbjct: 121 FYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQK 180 Query: 181 HIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGES 240 HIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGES Sbjct: 181 HIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGES 240 Query: 241 MMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKVINLI 300 MMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKVINLI Sbjct: 241 MMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKVINLI 300 >UniRef50_P37415 Uncharacterized protein pSLT051 n=256 Tax=Gammaproteobacteria RepID=YTL2_SALTY Length = 313 Score = 255 bits (651), Expect = 1e-66, Method: Composition-based stats. Identities = 144/309 (46%), Positives = 207/309 (66%), Gaps = 13/309 (4%) Query: 2 DAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHST 61 + TPHDA F+QFL + ARDF+E+HLP ELR +CDL+TL LESGSF+E+ L+ + + Sbjct: 4 KNTTPTPHDATFRQFLTQPDIARDFMELHLPAELRAICDLSTLKLESGSFVEDDLRQYFS 63 Query: 62 DVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILF 121 DVLYS++ GY+HV++EHQS PDK MAFR++RY++AAM RHLEA H KLPLV+P+LF Sbjct: 64 DVLYSLKTTAGDGYIHVLVEHQSTPDKHMAFRLIRYAVAAMQRHLEAGHKKLPLVIPVLF 123 Query: 122 YQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKH 181 Y G+ +PYP S W D F LA ++Y+S FPLVD+T+ PDDEI HR +A L LLQKH Sbjct: 124 YTGKRSPYPYSTRWLDEFDDTALADKLYSSAFPLVDVTVIPDDEIAGHRSMAALTLLQKH 183 Query: 182 IRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQAD-LFYGVLRDRETGGES 240 I QRDL L+++L ++ GY S SQ++++ +Y++Q G T A+ + + G++ Sbjct: 184 IHQRDLAELVDRLAPILLAGYLSSSQVISLVHYIVQAGETSDAEAFVRELAQRVPQHGDA 243 Query: 241 MMTLAQWFEEKGIEKGIQ------------QGRQEVSQEFAQRLLSKGMSREDVAEMANL 288 +MT+AQ E+KGIEKGIQ +G +E + + A+ +L + R V +M L Sbjct: 244 LMTIAQQLEQKGIEKGIQLGEQRGIEKGRSEGEREATLKIARTMLQNCIDRNTVMKMTGL 303 Query: 289 PLAEIDKVI 297 ++ ++ Sbjct: 304 TEDDLAQIR 312 >UniRef50_C2DMU4 Possible transposase n=6 Tax=Enterobacteriaceae RepID=C2DMU4_ECOLX Length = 314 Score = 254 bits (647), Expect = 3e-66, Method: Composition-based stats. Identities = 291/314 (92%), Positives = 295/314 (93%), Gaps = 16/314 (5%) Query: 1 MDAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHS 60 MDAPSTTPHDAVFKQFLMHAETARDFL+IHLP ELRELCDL+TLHLESGSFIEESLKGHS Sbjct: 1 MDAPSTTPHDAVFKQFLMHAETARDFLDIHLPAELRELCDLDTLHLESGSFIEESLKGHS 60 Query: 61 TDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPIL 120 TDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPIL Sbjct: 61 TDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPIL 120 Query: 121 FYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQK 180 FYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQK Sbjct: 121 FYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQK 180 Query: 181 HIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGES 240 HIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGG+S Sbjct: 181 HIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGKS 240 Query: 241 MMTLAQWFE----------------EKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAE 284 MMTLAQWFE EKGIEKGIQQGRQEVSQEFA RLLSKGM REDVAE Sbjct: 241 MMTLAQWFEEKGIEKGIEKGIEKGMEKGIEKGIQQGRQEVSQEFALRLLSKGMPREDVAE 300 Query: 285 MANLPLAEIDKVIN 298 MANLPLAEIDK+IN Sbjct: 301 MANLPLAEIDKLIN 314 >UniRef50_Q1CC76 Transposase n=27 Tax=Gammaproteobacteria RepID=Q1CC76_YERPN Length = 313 Score = 252 bits (642), Expect = 1e-65, Method: Composition-based stats. Identities = 149/309 (48%), Positives = 205/309 (66%), Gaps = 13/309 (4%) Query: 2 DAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHST 61 + TPHDA F+QFL E ARDF+E+HLP ELR +CDL+TL LESGSF+E+ L+ + + Sbjct: 4 KNSTPTPHDATFRQFLTQPEIARDFMELHLPAELRAICDLSTLKLESGSFVEDDLRQYFS 63 Query: 62 DVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILF 121 DVLYS+ GY+HV+IEHQS PDK MAFR++RY+IAAM RHLEA H KLPLV+P+LF Sbjct: 64 DVLYSLDTVEGEGYVHVLIEHQSSPDKHMAFRLIRYAIAAMQRHLEAGHAKLPLVIPVLF 123 Query: 122 YQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKH 181 Y G+ +PYP S W D F PELA ++Y+ FPLVD+T+ PDD+IM+HR +A L LLQKH Sbjct: 124 YVGKRSPYPYSTRWLDEFDDPELAHKLYSGAFPLVDVTVIPDDDIMEHRSMAALTLLQKH 183 Query: 182 IRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQAD-LFYGVLRDRETGGES 240 I QRD+ L ++L TL+ Y S Q++A+ +Y+LQ G + ++ + + G++ Sbjct: 184 IHQRDIATLTDRLATLLMADYLSSPQVMALIHYLLQAGESADSEAFVRELAQRVPQHGDA 243 Query: 241 MMTLAQWFEEKGIEKGIQQGR------------QEVSQEFAQRLLSKGMSREDVAEMANL 288 +MT+AQ E+KGIEKG +GR ++ E A+ LL GM E V E L Sbjct: 244 LMTIAQQLEQKGIEKGRMEGRTEGIQLGEQRGIEKGKLEVARSLLKMGMPIESVQEATGL 303 Query: 289 PLAEIDKVI 297 ++ ++ Sbjct: 304 SEDDLAQIR 312 >UniRef50_Q4LC22 TpnA protein n=9 Tax=Enterobacteriaceae RepID=Q4LC22_SODGL Length = 308 Score = 251 bits (640), Expect = 3e-65, Method: Composition-based stats. Identities = 139/307 (45%), Positives = 202/307 (65%), Gaps = 10/307 (3%) Query: 1 MDAP-STTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGH 59 M + TPHDAVF+QFL TA+DF +I LP +++ LCD TL ESGSFI+ +K + Sbjct: 1 MSKKFTPTPHDAVFRQFLHDKATAQDFFDIWLPDDIKALCDWETLKPESGSFIDPDMKPY 60 Query: 60 STDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPI 119 +D+LYSV G GY++ +IEHQS PDK MA+R+MRYS+AAM RHLEA HDKLPLV P+ Sbjct: 61 QSDILYSVNANGVDGYVYCLIEHQSTPDKLMAWRLMRYSMAAMQRHLEAGHDKLPLVFPV 120 Query: 120 LFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQ 179 LFY GE +P+P S W D F P++A ++Y+ PF L+D+T DD IMQHRR+A+LEL+Q Sbjct: 121 LFYCGEKSPHPYSTNWLDCFERPDIAAKIYSQPFRLMDVTTLDDDAIMQHRRMALLELIQ 180 Query: 180 KHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQAD-LFYGVLRDRETGG 238 KHIR+RD+ LL+ +V L+ Y + +Q+V M NY++Q G+ + + E Sbjct: 181 KHIRRRDMTELLDSIVKLLSYNYYTDTQVVTMMNYLVQEGNAASPRTFITEIAKRAEKHE 240 Query: 239 ESMMTLAQWFEEKGIEKGIQQGRQEVSQ--------EFAQRLLSKGMSREDVAEMANLPL 290 E++MT+A+ +++G + G GRQE Q + A+++LS+G++R+ V L Sbjct: 241 EALMTIAEALKQEGYQIGRDDGRQEGIQQGEHAAAMKIARQMLSRGIARDAVKACTGLSD 300 Query: 291 AEIDKVI 297 +D ++ Sbjct: 301 NALDNLM 307 >UniRef50_D2U4R8 Transposase (Fragment) n=4 Tax=Enterobacteriaceae RepID=D2U4R8_9ENTR Length = 308 Score = 236 bits (601), Expect = 8e-61, Method: Composition-based stats. Identities = 143/299 (47%), Positives = 202/299 (67%), Gaps = 1/299 (0%) Query: 2 DAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHST 61 + TPHDAVFKQFL ETA+DF +I LP E++ LCDL++L +ESGSFI+ +K + + Sbjct: 9 KKFTPTPHDAVFKQFLSEKETAKDFFDIWLPDEIKALCDLDSLKMESGSFIDSEMKNYQS 68 Query: 62 DVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILF 121 D+LYSV GY++V+IEHQS PDK +A+R+MRYS+AAM +HLE + +LPLV PILF Sbjct: 69 DILYSVSTTKGSGYIYVLIEHQSTPDKLIAWRLMRYSLAAMQKHLEDGNKQLPLVFPILF 128 Query: 122 YQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKH 181 Y GE +P+P S W D F +LA +YN+PF L D+T D EIMQH+RIA+LELLQKH Sbjct: 129 YCGEQSPHPYSTHWLDCFEDRKLAESIYNNPFKLADVTTLDDGEIMQHKRIALLELLQKH 188 Query: 182 IRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQA-DLFYGVLRDRETGGES 240 IR+RD+ LL+ +V L+ Y + +Q++ M NY++Q G+ ++ + + + E + Sbjct: 189 IRRRDMTELLDSIVKLLSYNYYTDNQVITMFNYLIQEGNAQRPMEFITNIAKQAEKHEGA 248 Query: 241 MMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKVINL 299 +MT+AQ EE GI+KGIQQG Q+ E A++ L+ G+ R V L E++K N Sbjct: 249 LMTIAQQIEEIGIQKGIQQGIQKTKIELAKQFLANGVDRNTVKISTGLSDEELNKFENQ 307 >UniRef50_P77768 Uncharacterized protein yfcI n=175 Tax=Gammaproteobacteria RepID=YFCI_ECOLI Length = 296 Score = 235 bits (600), Expect = 1e-60, Method: Composition-based stats. Identities = 150/292 (51%), Positives = 207/292 (70%), Gaps = 5/292 (1%) Query: 3 APSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTD 62 + ++TPHDAVFK FL H +TARDF++IHLP LR+LCDL TL LE SFI+E L+ + +D Sbjct: 4 STTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSD 63 Query: 63 VLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFY 122 +L+SV+ Q GY++VVIEHQSKP++ MAFRMMRYSIAAM HL+A + +LPLV+P+LFY Sbjct: 64 LLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFY 123 Query: 123 QGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHI 182 G +PYP S+CW D F P +AR++Y+S FPLVDIT+ PDDEIMQHR++A+LEL+QKHI Sbjct: 124 HGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHI 183 Query: 183 RQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFY-GVLRDRETGGESM 241 RQRDL+ L++Q+V+L+ G T+ QL A+ NY+LQ G ++ F + E + Sbjct: 184 RQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKL 243 Query: 242 MTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEI 293 MT+A E +G QG+ E + AQ +L +G+ RE V + L ++ Sbjct: 244 MTIADRLRE----EGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDL 291 >UniRef50_B7UFQ5 Predicted protein n=14 Tax=Enterobacteriaceae RepID=B7UFQ5_ECO27 Length = 315 Score = 233 bits (594), Expect = 5e-60, Method: Composition-based stats. Identities = 151/309 (48%), Positives = 206/309 (66%), Gaps = 17/309 (5%) Query: 2 DAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHST 61 ++ +++PHDAVFK F+ ETARDFLEIHLP LR+LC+L TL LE SFIE+SL+ + + Sbjct: 3 ESTTSSPHDAVFKTFMFTPETARDFLEIHLPEPLRKLCNLQTLRLEPTSFIEKSLRAYYS 62 Query: 62 DVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILF 121 DVL+SV+ GY++ VIEHQS +K MAFR+MRY+ AAM RHL+ +D++PLVVP+LF Sbjct: 63 DVLWSVETSEGDGYIYCVIEHQSSAEKNMAFRLMRYATAAMQRHLDKGYDRVPLVVPLLF 122 Query: 122 YQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKH 181 Y GEA+PYP S+ W D F P+LAR++Y FPLVDITI PDDEIMQHRRIA+LEL+QKH Sbjct: 123 YHGEASPYPYSLNWLDEFDDPQLARQLYTEAFPLVDITIVPDDEIMQHRRIALLELIQKH 182 Query: 182 IRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQA-DLFYGVLRDRETGGES 240 IR RDL+ +++++ TL+ G+T+ SQL + NY+LQ G T + + E Sbjct: 183 IRDRDLIGMVDRITTLLVRGFTNDSQLQTLFNYLLQCGDTSRFTRFIQEIAERSPLQKEI 242 Query: 241 MMTLAQWFEEKGIEKGIQQGRQEVSQE----------------FAQRLLSKGMSREDVAE 284 +MT+A+ ++G + G Q+G+ E QE A R+L +G RE V Sbjct: 243 LMTIAERLRQEGHQIGWQEGKIEGWQEGKLEGLQEGMHEQAIKIALRMLEQGFEREIVLA 302 Query: 285 MANLPLAEI 293 L A+I Sbjct: 303 ATQLTDADI 311 >UniRef50_Q7B1W7 YadD homologue n=11 Tax=root RepID=Q7B1W7_ECOLX Length = 313 Score = 230 bits (586), Expect = 4e-59, Method: Composition-based stats. Identities = 133/308 (43%), Positives = 194/308 (62%), Gaps = 15/308 (4%) Query: 2 DAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHST 61 + + TPHDA F+ FL + + ARDFLE+HLP E R+LCDL+TL LE +F+E L +++ Sbjct: 6 NTTTPTPHDAAFRSFLANPDVARDFLELHLPAEYRQLCDLSTLKLEPATFVEPDLHQYAS 65 Query: 62 DVLYSVQMQGN-PGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPIL 120 D+L+SV+ G GY++ +IEHQS + M FRM+RYS+AAM RHLE H LPLV+P+L Sbjct: 66 DILWSVKTTGGEDGYVYTLIEHQSTENLYMPFRMLRYSVAAMQRHLEQ-HKTLPLVIPVL 124 Query: 121 FYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQK 180 FY GE +PYP SM W D F +P LA ++Y PFPLVDIT+ D+EIM HRR+A L LL K Sbjct: 125 FYHGERSPYPYSMNWLDCFENPALAAKIYTKPFPLVDITVVDDNEIMNHRRMAALTLLMK 184 Query: 181 HIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGES 240 HIRQRD+++ L+ LV + + Q+ + NY+L + + + +S Sbjct: 185 HIRQRDMLMCLDNLVRAL-QDIQDEEQITVLFNYLLNGSEHVTVEFLQTLAQRLPQHEDS 243 Query: 241 MMTLAQWFEEKGIEKGIQQGRQ------------EVSQEFAQRLLSKGMSREDVAEMANL 288 +MTLA+ +++GI++GIQQG Q + ++E A+ L + GM + ++ L Sbjct: 244 IMTLAERLKQEGIQQGIQQGIQQGIQQGVQQGALQKAREIARELRNAGMPAAQICQLTGL 303 Query: 289 PLAEIDKV 296 AE+ + Sbjct: 304 SEAELKNI 311 >UniRef50_Q7N1D0 Transposase, ISNCY family n=36 Tax=root RepID=Q7N1D0_PHOLL Length = 335 Score = 228 bits (582), Expect = 1e-58, Method: Composition-based stats. Identities = 143/331 (43%), Positives = 199/331 (60%), Gaps = 36/331 (10%) Query: 2 DAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHST 61 + TPHDA+FK+FL H +TARDFLEIHLP LR +CDL+TL LESGSFIE++L+ H + Sbjct: 3 RKNTPTPHDAIFKKFLSHIDTARDFLEIHLPATLRAVCDLDTLRLESGSFIEDNLRVHYS 62 Query: 62 DVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILF 121 D+LYS++ Y++ VIEHQS PDK MAFR+MRYSI+AM HLE H KLPLV+P+LF Sbjct: 63 DILYSLKTTQGESYVYCVIEHQSSPDKMMAFRLMRYSISAMQWHLEQGHKKLPLVIPVLF 122 Query: 122 YQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKH 181 Y G+ PYP S WFD F + LA +Y+S FPLVD+T+ PDDEI+ H+R+A+LE++QKH Sbjct: 123 YHGKIRPYPWSTNWFDCFDASALAEEIYSSAFPLVDVTVIPDDEILTHKRVALLEIVQKH 182 Query: 182 IRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQAD-LFYGVLRDRETGGES 240 IRQRD+ L ++L L Y + L +M NY+L G T + + E Sbjct: 183 IRQRDMAELQQELTMLFAYDYYTYELLKSMLNYILLVGDTADPEGFIRQLAEQFPKYEEV 242 Query: 241 MMTLAQWFEEKGIEKG-----------IQQGRQEVSQ----------------------- 266 +MT+AQ + KG ++G ++G QE Q Sbjct: 243 LMTIAQKLQHKGHQEGLKEGLQKCQDAREEGLQEGLQKGEKKGEKKGEKKGEEKGEKRAS 302 Query: 267 -EFAQRLLSKGMSREDVAEMANLPLAEIDKV 296 + A+ L+ G+ RE + + L E++++ Sbjct: 303 LKIARALMDNGIDRETIMKSTGLSQNELEQI 333 >UniRef50_D1P284 Transposase, ISNCY family n=10 Tax=Enterobacteriaceae RepID=D1P284_9ENTR Length = 322 Score = 219 bits (558), Expect = 8e-56, Method: Composition-based stats. Identities = 115/322 (35%), Positives = 181/322 (56%), Gaps = 25/322 (7%) Query: 1 MDAPST-TPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGH 59 M S PHD+ FK F+ + ARDF E+HLP ++ LC+ +TL L S SF++++L+ Sbjct: 1 MATQSIVAPHDSTFKGFMSKVDNARDFFEVHLPNRIKHLCNFDTLKLASASFVDKTLRSR 60 Query: 60 STDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPI 119 +D+LYSVQ GY + ++EHQS PDK M +R+M Y+ AM++HL+ H LPLVVPI Sbjct: 61 FSDMLYSVQTLKGKGYFYFLVEHQSSPDKLMGWRLMHYAFCAMNQHLQQGHQSLPLVVPI 120 Query: 120 LFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQ 179 LFY G +PYP S W D F +LA +Y +P PLVD+T+ DDE+M HR++A +EL+ Sbjct: 121 LFYHGNQSPYPYSQSWTDCFQWSDLAHDLYCNPLPLVDVTVACDDELMNHRKVAAMELVF 180 Query: 180 KHIRQR-DLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQAD-LFYGVLRDRETG 237 KH R D+ L E+L +++ ++ + NY+ T + ++ E Sbjct: 181 KHASLRGDVFGLSERLAQVLNNNQNHQDDVILIINYLFSVMDTPAYTHIVKTLVDQTEKH 240 Query: 238 GESMMTLAQWFEEKGIEKGIQQGRQEVS---------------QEFAQRL-------LSK 275 E++M +AQ +G+EKG+++GR+E Q+ A L L Sbjct: 241 QETVMNIAQRLRNEGMEKGMEKGRKEERMISQQKLANERQHYQQQMALNLQQQAIMSLKL 300 Query: 276 GMSREDVAEMANLPLAEIDKVI 297 G+S + ++++ L ++I + Sbjct: 301 GLSVDIISQITGLSPSDIHALR 322 >UniRef50_C8QFJ7 Putative transposase YhgA family protein n=4 Tax=Pantoea sp. At-9b RepID=C8QFJ7_9ENTR Length = 301 Score = 216 bits (550), Expect = 7e-55, Method: Composition-based stats. Identities = 113/298 (37%), Positives = 180/298 (60%), Gaps = 4/298 (1%) Query: 3 APSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTD 62 + + PHDA+FK+FL H AR FLEIHLP +RE CDL+ L + +FIE L +D Sbjct: 2 SVVSAPHDALFKKFLSHLPVARQFLEIHLPQSIREHCDLDKLQVVPTTFIERDLSALYSD 61 Query: 63 VLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFY 122 VL S++ GY++ +IEHQS PDK M RMMRY++AA+ RHL+ H +PLV+PILFY Sbjct: 62 VLLSMKTDDGEGYIYALIEHQSTPDKHMTLRMMRYTLAAIQRHLDEGHHDVPLVIPILFY 121 Query: 123 QGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHI 182 QG+ +PYP SM W + F +P LA++++ FPLVD+T+ PD+EIM HR +A LE+ K I Sbjct: 122 QGKTSPYPYSMNWLESFRNPVLAKQIFCHSFPLVDVTVIPDEEIMAHRDVARLEMAHKII 181 Query: 183 RQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGESMM 242 R RD++ ++ + TL+ Y + + + ++ + +++ + +M Sbjct: 182 RLRDILENIDPMATLLALDYNDDLSIDVVFYLLRYGNTDDREKIVKILIQAKPQLEGKIM 241 Query: 243 TLAQWFEEKGIEKGIQQGRQEVSQEF----AQRLLSKGMSREDVAEMANLPLAEIDKV 296 T+ + + ++ ++G Q+GR+E QE AQR+L + + ++ L E+ ++ Sbjct: 242 TIEEQWRQESRQEGRQEGRKEGRQEVMLELAQRMLREQFDLNTIMKLTGLSEGELRQL 299 >UniRef50_A8PLK1 Putative uncharacterized protein n=3 Tax=Rickettsiella grylli RepID=A8PLK1_9COXI Length = 308 Score = 214 bits (545), Expect = 3e-54, Method: Composition-based stats. Identities = 102/306 (33%), Positives = 172/306 (56%), Gaps = 9/306 (2%) Query: 1 MDAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHS 60 M HDA+FK F E A F+ I+LP +++ CD +TL +E GSF++ LK H Sbjct: 1 MSIQIHNAHDAIFKTFFTDIEVATHFITIYLPKHMKQACDFSTLKIEPGSFVDADLKQHH 60 Query: 61 TDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPIL 120 +D+LYS+++ G GY+++ +EHQS ++ M FRM RY +A M +HL + KLPLV+ +L Sbjct: 61 SDILYSLKVNGMHGYVYLNLEHQSTAEELMPFRMHRYKVAIMQQHLNQGNKKLPLVISML 120 Query: 121 FYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQK 180 FY G+ YP + D A+ + L+D+ + PD+EI +H+++A LE++QK Sbjct: 121 FYHGKGQ-YPYCLKLIDCVEDTPFAKAHFFDDPLLIDLNVLPDEEIYRHKQLAFLEIVQK 179 Query: 181 HIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGES 240 HI RDL + + +V L+ + + YML +G T + L+ E E Sbjct: 180 HIFTRDLEDIADHIVRLVKQVKPDHDLFNQLVYYMLVKGETANVNQVIEKLKTIEDYEED 239 Query: 241 MMTLAQWFEEKGIEKGIQQGRQEVSQE--------FAQRLLSKGMSREDVAEMANLPLAE 292 +M AQ +++G ++G+ +GRQE Q+ A++L+++G S + + ++ NL E Sbjct: 240 IMNAAQQLKQQGRQEGLYEGRQEGLQKGEYRKAITIAKKLIAEGRSIQYIQDLTNLSENE 299 Query: 293 IDKVIN 298 + ++ Sbjct: 300 VLSLVE 305 >UniRef50_D0KLJ7 Putative transposase YhgA family protein n=1 Tax=Pectobacterium wasabiae WPP163 RepID=D0KLJ7_PECWW Length = 288 Score = 214 bits (544), Expect = 3e-54, Method: Composition-based stats. Identities = 126/294 (42%), Positives = 181/294 (61%), Gaps = 13/294 (4%) Query: 7 TPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYS 66 HDA+FKQFL ARDFL IHLP +RE CD NTL LES SFI+E L+ +DVLYS Sbjct: 2 PSHDAIFKQFLSDIAVARDFLTIHLPDSIRERCDFNTLQLESASFIDEKLRARISDVLYS 61 Query: 67 VQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEA 126 + GY++ VIEHQS+P+K+MAFR++RY +AAM +HL+ HD+LPLVVP+LFY G + Sbjct: 62 LHTSVGKGYIYCVIEHQSRPEKQMAFRLLRYCLAAMQQHLDQGHDRLPLVVPLLFYHGRS 121 Query: 127 TPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQRD 186 PYP S+ W D F +P LA+ +Y PFPLVD+T+ PDDEI HRR+A+LEL+QKHIR RD Sbjct: 122 RPYPYSLRWLDSFAAPVLAQTLYEQPFPLVDLTVMPDDEIRTHRRMALLELVQKHIRTRD 181 Query: 187 LMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGESMMTLAQ 246 ++ L ++ L + + ++ + + E + Q Sbjct: 182 MLELAREIGLLFE-------------RWAAPLSIGQEDIMTIAEQLKKMGFDEGIQRGIQ 228 Query: 247 WFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKVINLI 300 +G+E+GI+QG + +++ A+ LL GM + V + L E+++++ I Sbjct: 229 QGLAQGLEQGIEQGMKNSARQIARHLLLTGMDKNSVQQATQLETEELEQLVTAI 282 >UniRef50_C2LLN3 Transposase n=37 Tax=Enterobacteriaceae RepID=C2LLN3_PROMI Length = 319 Score = 213 bits (543), Expect = 4e-54, Method: Composition-based stats. Identities = 126/318 (39%), Positives = 190/318 (59%), Gaps = 21/318 (6%) Query: 1 MDAPSTTP-HDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGH 59 M + P HDA+FKQFL H E ARDF +HLP + LCDL+TL LE SF+E L+ Sbjct: 1 MTKNTQQPVHDALFKQFLTHPENARDFFSVHLPANILPLCDLSTLRLEPASFVERRLRQL 60 Query: 60 STDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEAD--HDKLPLVV 117 +DVLYSVQM GY++ +IEHQSKPD+ M FR+M Y+++A+ HL+ LPLVV Sbjct: 61 HSDVLYSVQMTEGEGYIYCLIEHQSKPDRLMGFRLMHYAMSAIAHHLKKSPADKTLPLVV 120 Query: 118 PILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILEL 177 P LFYQG PYP SM W D F P LA+++Y FPLVD+++ D+EI+ H+ IA+LEL Sbjct: 121 PFLFYQGSVCPYPYSMNWLDGFADPALAQQLYTRSFPLVDLSVLSDEEILTHKGIALLEL 180 Query: 178 LQKHIRQRD-LMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHT-EQADLFYGVLRDRE 235 +QKHIR RD LM +L + +I+ + + Q+ ++ Y+ +G+ +++ F ++ Sbjct: 181 VQKHIRTRDGLMAVLPIIAQIINSQHNTVDQVRSVIEYIAYQGYILDESRFFSQLIALSP 240 Query: 236 TGGESMMTLAQWFEEKGIEKGIQQGRQEVS----------------QEFAQRLLSKGMSR 279 + T+A+ E+KGIEKGI++G ++ ++ A+ LL +G+ Sbjct: 241 EYKTMLTTIAEQLEQKGIEKGIEKGIEKGIEKGIEKGIEKGIGLGVEKVARSLLQQGVDL 300 Query: 280 EDVAEMANLPLAEIDKVI 297 + + L +I+ + Sbjct: 301 NIIMQCTGLTREKIESLK 318 >UniRef50_B6XDZ7 Putative uncharacterized protein n=2 Tax=Providencia RepID=B6XDZ7_9ENTR Length = 327 Score = 212 bits (539), Expect = 1e-53, Method: Composition-based stats. Identities = 115/315 (36%), Positives = 182/315 (57%), Gaps = 25/315 (7%) Query: 8 PHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSV 67 PHD+ FK F+ + ARDF EI+LP ++ LC+L+TL L S SFI+++L+ +D+LYSV Sbjct: 13 PHDSTFKGFMSKVDNARDFFEIYLPNRIKPLCNLDTLKLASASFIDKTLRSRFSDMLYSV 72 Query: 68 QMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEAT 127 Q GY ++++EHQS PDK M +R+M Y+ AM++HL+ ++ LPLVVPILFY G+ + Sbjct: 73 QTLKGKGYFYLLVEHQSTPDKLMGWRLMHYAFCAMNQHLQQGNNALPLVVPILFYHGKQS 132 Query: 128 PYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQRDL 187 PYP S W D F +LA +Y +P PLVD+T+ DDEI+ HR++A +EL+ KH RD Sbjct: 133 PYPYSQVWTDCFPWADLAYDLYCNPLPLVDVTVASDDEIVNHRKVAAMELVLKHSTLRDD 192 Query: 188 MLLL-EQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQA-DLFYGVLRDRETGGESMMTLA 245 +++L E+L +I E ++ + NY+ T + ++ E E++MT+A Sbjct: 193 LIVLSERLAQVISENENHRDDVILIINYLFSVMDTPTYTQIVKTLIEQTEGYQETVMTIA 252 Query: 246 QWFEEKGIEKGIQQGRQEVSQEFAQRLLSK-----------------------GMSREDV 282 +G+EKG+ +GR+E E + G+S + + Sbjct: 253 DRLRNEGLEKGLIKGREEGKAEGKAEGREEARQEEQAIARQRTYTQVITSLDLGLSIDII 312 Query: 283 AEMANLPLAEIDKVI 297 +++ LP +EI + Sbjct: 313 SKITGLPHSEIQAMR 327 >UniRef50_C0Q5B1 Ytl2 n=4 Tax=Enterobacteriaceae RepID=C0Q5B1_SALPC Length = 316 Score = 205 bits (520), Expect = 2e-51, Method: Composition-based stats. Identities = 115/311 (36%), Positives = 173/311 (55%), Gaps = 17/311 (5%) Query: 2 DAPSTTP--HDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGH 59 + HD +FK FL +TARDFL +HLP ++R L+TL LE GSF+++ L+ Sbjct: 3 NEKGHNRPGHDGLFKLFLREPDTARDFLAVHLPADIRAQVRLDTLKLEPGSFVDQKLREL 62 Query: 60 STDVLYSVQM-QGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVP 118 +DVLYSV+ +G+ GY++ ++EHQS D+ MA+RMMRYS+A M HL+ + LP+VVP Sbjct: 63 HSDVLYSVETAEGHAGYIYCLVEHQSTADRMMAWRMMRYSMAVMDAHLKKGNGTLPVVVP 122 Query: 119 ILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELL 178 +LFYQG PYP S W D F P LAR VY+ P+PLVD+++ D ++ HRR+A+LEL+ Sbjct: 123 LLFYQGMVRPYPYSTDWMDCFDVPALAREVYSRPWPLVDVSVMEDCDLQSHRRMALLELV 182 Query: 179 QKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQA--DLFYGVLRDRET 236 Q+ IR RD LL +V LI + +Q+ A+ Y++ G T ++ Y + + Sbjct: 183 QRDIRHRDAASLLRDVVQLIRLAGNTRAQVEAVLCYIIYNGMTSESITPFLYELAGEIPE 242 Query: 237 GGESMMTLAQW------------FEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAE 284 E +M + + + Q+ E A LL G+S E V + Sbjct: 243 YKELIMGTIAQQLKEEGIQQGIQQGIQQERQASLEREQKTLLETAYALLDNGVSLEVVIK 302 Query: 285 MANLPLAEIDK 295 L +++ Sbjct: 303 STGLNRETLEQ 313 >UniRef50_C2LF55 Transposase n=3 Tax=Enterobacteriaceae RepID=C2LF55_PROMI Length = 330 Score = 196 bits (497), Expect = 9e-49, Method: Composition-based stats. Identities = 100/328 (30%), Positives = 168/328 (51%), Gaps = 31/328 (9%) Query: 1 MDAPS-TTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGH 59 M+ P + HDA FK+F+M+ A+DF IHL EL+ CD +TL L++ SFI+ L+ Sbjct: 1 MNKPLLISSHDAAFKRFMMNISNAKDFFFIHLSDELKSYCDFSTLKLQNSSFIDIKLRSR 60 Query: 60 STDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPI 119 +D+LYSV+ + ++ +IEHQS+PDK +A+RMM Y+ M++HL+ + LPLVVPI Sbjct: 61 MSDILYSVKTKKGNISIYFLIEHQSRPDKMIAWRMMHYAFCTMNQHLQQGYTSLPLVVPI 120 Query: 120 LFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQ 179 LFY G+ PYP S+ W D F LA ++Y + F L+D+ D+ ++ HR+ A++E+ Sbjct: 121 LFYHGKRKPYPFSVNWLDCFPLSTLANQLYLNNFALIDLNSIDDEILLTHRKAAVMEIAM 180 Query: 180 KHIRQRDLMLLLEQLV-TLIDEGYTSGSQLVAMQNYMLQRGHTEQAD-LFYGVLRDRETG 237 KH+ D + L L+ I++ S +A+ Y+ + + + + Sbjct: 181 KHVNSCDDLDKLAMLLSKAINQKNCSDEDTIAVVQYLFSIMDAADFESIINKIAEQVDNH 240 Query: 238 GESMMTLAQWFEEKGIEKGIQQGRQEVSQE----------------------------FA 269 E++M +A E KG + G +G + E A Sbjct: 241 RETIMNIAWRLENKGFKLGKMEGIEIGKNEGIEIGKNEGIEIGKNEGIEIGKKIVQIQLA 300 Query: 270 QRLLSKGMSREDVAEMANLPLAEIDKVI 297 + LL + + E + + L + E+ ++ Sbjct: 301 KNLLKENVELEFIERITGLSIQELKILL 328 >UniRef50_B7MZS6 Putative uncharacterized protein n=3 Tax=Escherichia coli ED1a RepID=B7MZS6_ECO81 Length = 319 Score = 195 bits (494), Expect = 2e-48, Method: Composition-based stats. Identities = 99/305 (32%), Positives = 164/305 (53%), Gaps = 9/305 (2%) Query: 3 APSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTD 62 ++ HDA F++ L ARDFLE L + C+L+T+ LE +F+ ESL+ + D Sbjct: 6 NKTSLIHDAAFRKTLKDPAAARDFLEQVLTPYQKSRCNLDTIELEPTTFVAESLRQSACD 65 Query: 63 VLYSVQMQGN-PGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILF 121 VL S++ GY++ +IEHQS PDK + RMMRY +A M +H+E H P+V+P+LF Sbjct: 66 VLLSMKTNDGKDGYIYTLIEHQSSPDKFIPLRMMRYILAVMEQHIEE-HKCAPVVIPVLF 124 Query: 122 YQGEATPYPLSMCWFDMFYSPELARRVY--NSPFPLVDITITPDDEIMQHRRIAILELLQ 179 Y G PYP M W D P R +Y PF LVD++ DDEI + R+A L Sbjct: 125 YHGAKRPYPYPMNWVDCLDDPAYGREIYGEQKPFSLVDVSTLTDDEIEHYHRMAALMFTM 184 Query: 180 KHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGE 239 K D++ L+ + +T + + Y S L + Y+L+ + A+L V + Sbjct: 185 KSGTSGDVIELIGKSIT-LTDKYGSSVHLNTVLTYLLELYQMDFAELSEAVSTHYPSHKG 243 Query: 240 SMMTLAQWFEEKGIEKGIQQGRQEVSQE----FAQRLLSKGMSREDVAEMANLPLAEIDK 295 +MT+A+ EE+G++KG+++G ++ E + +G S E++ + +L ++ + Sbjct: 244 VIMTIAEQLEERGLKKGLEKGLEKGRAEERSRLVLMMRQRGKSLEEIKDFLDLTDEQLLQ 303 Query: 296 VINLI 300 ++ + Sbjct: 304 ALDYV 308 >UniRef50_C3M8C1 Putative transposase n=3 Tax=Candidatus Hamiltonella defensa 5AT (Acyrthosiphon pisum) RepID=C3M8C1_HAMD5 Length = 308 Score = 192 bits (487), Expect = 2e-47, Method: Composition-based stats. Identities = 120/308 (38%), Positives = 182/308 (59%), Gaps = 14/308 (4%) Query: 4 PSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDV 63 +TPHD +FK+F AR+F EIHLP + ++ +L + GSFI++SLK +D+ Sbjct: 2 KISTPHDRLFKKFFGDIALARNFFEIHLPSSILKIVSFPSLKMVPGSFIDKSLKQSHSDM 61 Query: 64 LYSVQM-QGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFY 122 +YS + G GYL+ V+EHQS DK MAFRM +YS+A M +HL+ HD LPLV+P+LFY Sbjct: 62 VYSFETSTGKEGYLYCVVEHQSTDDKMMAFRMKKYSLAVMQQHLDQGHDTLPLVLPVLFY 121 Query: 123 QGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHI 182 G+ +PYP SM W D F ELAR + + PFPLVD+T+ P++EIM+H I+ LE+ QK + Sbjct: 122 HGQKSPYPHSMDWRDCFCEKELARILDSQPFPLVDVTMLPEEEIMKHGIISWLEMSQKMV 181 Query: 183 RQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGESMM 242 RD+M + L+ L + ++ Y+ Q G T LF+ L E++M Sbjct: 182 HTRDMMEIAPYLIRLDKLFPLNDELFKSLLYYLFQEGETADRMLFFDALSSTTQ-RENVM 240 Query: 243 TLAQWFE------------EKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPL 290 T+A+ + E+G E+G ++GR+E +E A+ LL+ G S + V L Sbjct: 241 TIAEELKREGREEGREEGREEGREEGREEGREEGREEIAKNLLNNGFSFKQVKMYTGLSE 300 Query: 291 AEIDKVIN 298 ++K+++ Sbjct: 301 DSLNKLLD 308 >UniRef50_A8PQ66 Putative uncharacterized protein n=3 Tax=Rickettsiella grylli RepID=A8PQ66_9COXI Length = 307 Score = 188 bits (477), Expect = 2e-46, Method: Composition-based stats. Identities = 88/306 (28%), Positives = 156/306 (50%), Gaps = 7/306 (2%) Query: 1 MDAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHS 60 M HD +FK L A FL+ L E+ +L ++ TL L SF+ + Sbjct: 1 MAMTIHQAHDKLFKYSLSKKTIAISFLKSRLSSEIYKLINIETLQLTDKSFVLPEFREIH 60 Query: 61 TDVLYSVQMQGNPGYLHVVIEHQSKPD-KKMAFRMMRYSIAAMHRHLEADHDKLPLVVPI 119 +D++Y Q+ GY+ ++EH+S + MAFR ++Y+I+AM ++ + KLP+V+PI Sbjct: 61 SDIVYQCQINEKKGYIFFILEHESTAHVELMAFRQLQYTISAMDQYCRQGNKKLPIVLPI 120 Query: 120 LFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQ 179 Y G +PYP S +D F + ++AR++ PF L+D+T+ D+E+ + ++E+L Sbjct: 121 CVYHGIKSPYPHSQDVYDNFENLQIARQIVFKPFTLIDLTVLSDEELAKDGPAYLMEMLL 180 Query: 180 KHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQ--NYMLQRGHTEQADLFYGVLRDR--- 234 KH R ++ + +L + + I + YM+ E + +++ Sbjct: 181 KHSRAKNFLSILHRRIEFIQSLLNRFGKEYRWFVVKYMINETQDESPNAVEQLVQTLSTA 240 Query: 235 -ETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEI 293 +MMT AQ ++G+E+G++QGR E + A+ LL GMS + V + L E+ Sbjct: 241 FPEEKNTMMTFAQQLRQEGLEQGLEQGRYEEAIAIAKNLLGDGMSFKAVQRLTGLSEKEV 300 Query: 294 DKVINL 299 ++N Sbjct: 301 MNLVNK 306 >UniRef50_Q3C0L1 TpnA protein n=16 Tax=Enterobacteriaceae RepID=Q3C0L1_SODGL Length = 277 Score = 185 bits (468), Expect = 2e-45, Method: Composition-based stats. Identities = 102/275 (37%), Positives = 167/275 (60%), Gaps = 17/275 (6%) Query: 41 LNTLHLESGSFIEESLKGHSTDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIA 100 L+TL + SGSFIE+ L +D+LYS++ Y++ +IEHQS P+ MAFR++RY++ Sbjct: 3 LSTLVMVSGSFIEDDLCSQCSDMLYSLKSTLGDAYIYCLIEHQSCPEPMMAFRLLRYAVT 62 Query: 101 AMHRHLEADHDKLPLVVPILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITI 160 AMHRHLE ++ +LP+V+PILFY G +PYP + W D F +LA VY FPLVD+T Sbjct: 63 AMHRHLEQENKQLPVVIPILFYHGSTSPYPYTTHWLDCFADRKLAESVYEKAFPLVDVTA 122 Query: 161 TPDDEIMQHRRIAILELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGH 220 D+EI++HRR+A++E++QKHIR R+++ L +L L+++ S Q + Y++ G+ Sbjct: 123 MEDEEILRHRRMALMEIVQKHIRTRNMLELAGELANLLEQWKFSKEQCKTLVYYLVLAGN 182 Query: 221 TEQAD-LFYGVLRDRETGGESMMTLAQWFE----------------EKGIEKGIQQGRQE 263 T + + + + E MMT+A+ E E+G+++GIQ G+++ Sbjct: 183 TTDGEGFLRTLAQPAPSYREDMMTIAEQLEAKGMQKGIQLGEKKGIERGLQEGIQLGKKQ 242 Query: 264 VSQEFAQRLLSKGMSREDVAEMANLPLAEIDKVIN 298 + + A++ L G+ R+ V L +I+ V+N Sbjct: 243 ATLKIARQFLVNGVERDIVKMSTGLTDRDINDVLN 277 >UniRef50_A6G4N5 Putative uncharacterized protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6G4N5_9DELT Length = 343 Score = 177 bits (449), Expect = 3e-43, Method: Composition-based stats. Identities = 70/312 (22%), Positives = 127/312 (40%), Gaps = 13/312 (4%) Query: 1 MDAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHS 60 M S +PHDA+FK + A L+ L + D +TL E GS+I+E+L Sbjct: 1 MHGTSPSPHDALFKSAFKDPKDAAKLLQNVLDEPIAHAIDWSTLRPEPGSYIDETLAERH 60 Query: 61 TDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDK-LPLVVPI 119 +D+L+S + G Y++++IEHQS D+ M RM+ Y RH A + LP ++P+ Sbjct: 61 SDLLFSASIGGEDAYVYLLIEHQSTVDRDMPLRMLVYLTRVWLRHRSAHPGRDLPPILPV 120 Query: 120 LFYQGEATPYPL----SMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAIL 175 + S+ PEL + + D+T D ++ + Sbjct: 121 VVSHAPGGWTAPVTFESLVRPGPTDLPELTPHIPRFELVINDLTHLSDQQLREWSMRGFA 180 Query: 176 ELLQKHIRQR-------DLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFY 228 L+ +R R D + + + E + + +Y+ Q F+ Sbjct: 181 TLVLWILRTRHEIPELIDGVSTWRDMFREVFEAPDGVQAMTKIFHYIACIAQRVQVQEFH 240 Query: 229 GVLRDR-ETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMAN 287 L + E M T + E+G+ KG+ +GR+E ++ L + + A+ Sbjct: 241 AKLDEHVPQTREVMKTYYEELMEEGMAKGLAKGREEGREQSRIETLQETLIDLLSAKFDL 300 Query: 288 LPLAEIDKVINL 299 L +++ + Sbjct: 301 RELEHAERIRSA 312 >UniRef50_Q2J904 Putative uncharacterized protein n=1 Tax=Frankia sp. CcI3 RepID=Q2J904_FRASC Length = 323 Score = 175 bits (442), Expect = 3e-42, Method: Composition-based stats. Identities = 79/314 (25%), Positives = 140/314 (44%), Gaps = 29/314 (9%) Query: 1 MDAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHS 60 M +P + PHDAVF++ L A L LP L DL+ L + GS ++ +L+ Sbjct: 1 MSSPPS-PHDAVFRRVLGVPSNAASQLRATLPAALVARLDLDRLAIVPGSLVDATLRWRH 59 Query: 61 TDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHD--KLPLVVP 118 TD+L++ + G+ +++V++EHQS D MAFRM+RY + R+L H +LP VVP Sbjct: 60 TDLLFTAPLDGHEAFIYVLVEHQSSSDPLMAFRMLRYVVRVWDRYLADHHKAARLPAVVP 119 Query: 119 ILFYQGEATPYPLSMCWFDMFYSPELA----RRVYNSPFPLVDITITPDDEIMQHRR--- 171 ++ + E + + +P+LA + F L D+ + E+ + Sbjct: 120 LVVHHNEHAWVAPTQVLDLVDLAPDLAGAWREHLPRFQFLLDDLVRVDERELRERPLTHS 179 Query: 172 IAILELLQKHIR-----QRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADL 226 + + LL K + +DL +++L ++D G + + Y+ G + D Sbjct: 180 VRLTLLLLKIVPGNPRLAQDLRPWVDELRAVLD-GPDGREEFATLLRYIELVGEADARDE 238 Query: 227 FYGVLRDR-ETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEM 285 + ++ ++ MT+A E +G +GR E E +LL+ Sbjct: 239 LHDLIAGLGPEAEDAYMTIA----EMLRAEGRVEGRVEGRVESLLQLLTLKFGP------ 288 Query: 286 ANLPLAEIDKVINL 299 LP A + V + Sbjct: 289 --LPEAALAAVHDA 300 >UniRef50_C0AXL8 Putative uncharacterized protein n=1 Tax=Proteus penneri ATCC 35198 RepID=C0AXL8_9ENTR Length = 254 Score = 174 bits (441), Expect = 3e-42, Method: Composition-based stats. Identities = 82/247 (33%), Positives = 136/247 (55%), Gaps = 2/247 (0%) Query: 24 RDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQMQGNPGYLHVVIEHQ 83 + F IHLP EL+ CD +TL L++ SFI+ L+ +D+LY V+ + ++++IEHQ Sbjct: 6 KTFFFIHLPEELKSQCDFSTLQLQNSSFIDIKLRSRMSDILYLVKTKEGDVPIYLLIEHQ 65 Query: 84 SKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEATPYPLSMCWFDMFYSPE 143 S+PDK +A+RMM Y+ M++HL+ + LPLVVPILFY G+ PYP + W + F Sbjct: 66 SRPDKMIAWRMMHYAFCTMNQHLQQGYKSLPLVVPILFYHGKKKPYPFPVNWMECFPLSS 125 Query: 144 LARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQ-RDLMLLLEQLVTLIDEGY 202 LA +Y++ F L+D+T DD ++ H++ A++E+ KH+ DL + L I++ Sbjct: 126 LANHIYSNDFSLIDLTSIDDDILLTHKKAAVMEIAMKHVNSCHDLNKIAMLLSKAINQKN 185 Query: 203 TSGSQLVAMQNYMLQRGHTEQADL-FYGVLRDRETGGESMMTLAQWFEEKGIEKGIQQGR 261 VA+ Y+ + + + E++M +A E KG + GI +G Sbjct: 186 CRDEDTVAVVQYLFSIMDASDFEFIINKIAERVDNHRETIMNIAWRLENKGFKLGIDEGF 245 Query: 262 QEVSQEF 268 + + Sbjct: 246 EIGKLKV 252 >UniRef50_A6G0X2 Putative uncharacterized protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6G0X2_9DELT Length = 363 Score = 174 bits (440), Expect = 4e-42, Method: Composition-based stats. Identities = 65/294 (22%), Positives = 122/294 (41%), Gaps = 21/294 (7%) Query: 2 DAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHST 61 ++ ++ PHDA+F+ H A L LP EL L D + L + + SL T Sbjct: 12 ESVTSRPHDALFRATFEHPSHAGSLLRSALPRELAALIDWSRLRPAANELVSSSLGERRT 71 Query: 62 DVLYSVQMQ-----GNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLV 116 D+L+S ++ +++ IEHQS+ D M R++ Y + RH + LP V Sbjct: 72 DLLFSTALEGPGAGDGARVVYLHIEHQSRVDTTMPLRVLGYRVRIWERHRKRHGGALPPV 131 Query: 117 VPILFYQGEATPYPLSMCWFDMFYSP-----ELARRVYNSPFPLVDITITPDDEI---MQ 168 ++ + ++F P +A + P + D+ D E+ Sbjct: 132 FCVVLSHAAKG-WTGPRSLVELFPEPVRTLAPIAAHLPRCPLIVEDLGRRADAELRARHA 190 Query: 169 HRRIAILELLQKHIRQRD-----LMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQ 223 H A+ L + R + L+ +Q++ L+D + L + Y+ G Sbjct: 191 HPLPALTLWLLRDARSPERLVHRLLDWRDQIIALLDYDHG-ERDLAQLLRYVALVGSEMD 249 Query: 224 ADLFYGVLRDRETGGESM-MTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKG 276 + F+ + E+M MT+A+ + +++G +QG++E +E +G Sbjct: 250 FEEFHRFVAHHIPEVEAMTMTIAEQLCREALQRGREQGQREGQREGRLEGQREG 303 >UniRef50_Q52101 ORF n=1 Tax=Salmonella enterica subsp. enterica serovar Enteritidis RepID=Q52101_SALEN Length = 292 Score = 170 bits (430), Expect = 5e-41, Method: Composition-based stats. Identities = 106/280 (37%), Positives = 149/280 (53%), Gaps = 10/280 (3%) Query: 2 DAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHST 61 + TPHDA F+QFL + ARDF+E+HLP ELR +CDL+TL LESGSF+E+ L+ + + Sbjct: 4 KNTTPTPHDATFRQFLTQPDIARDFMELHLPAELRAICDLSTLKLESGSFVEDDLRQYFS 63 Query: 62 DVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSI-AAMHRHLEADHDKLPLVVPIL 120 DVLYS++ + + S+ + F + AAM RHLEA H KLPLV+P+L Sbjct: 64 DVLYSLKTTAGDDIFMSWL-NTSQHLTNICFPPDTLCVGAAMQRHLEAGHKKLPLVIPVL 122 Query: 121 FYQGEATPYPLSMCWFDMFYSPELARRVYNSPF-PLVDITITPDDEIMQHRRIAILELLQ 179 FY G+ +PYP S W D F R+ LVD+T+ PDDEI HR +A L LL Sbjct: 123 FYTGKRSPYPYSTRWLDEFDDTAPGRQTLQQRLSRLVDVTVIPDDEIAGHRSMAALTLLP 182 Query: 180 KHIRQRDLMLLLEQLVTLIDEGYTSGSQL----VAMQNYMLQRGHTEQADLFYGVLRDRE 235 ++I + + +T Y S +A Y R + + L Sbjct: 183 ENIF---ISGTWQNWLTGWRPFYGRISVFIAGNIAGTLYSAGRRNIRRRSLCTRTGTACA 239 Query: 236 TGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSK 275 G+++MT+AQ E+KGIEKGIQ G Q ++ + Sbjct: 240 QHGDALMTIAQQLEQKGIEKGIQLGEQRGIEKGRSEGERE 279 >UniRef50_C7RR52 Putative transposase n=1 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RR52_9PROT Length = 330 Score = 166 bits (421), Expect = 6e-40, Method: Composition-based stats. Identities = 67/316 (21%), Positives = 125/316 (39%), Gaps = 25/316 (7%) Query: 7 TPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYS 66 HD +K E RD + +P + D +TL GS++ E + D+++ Sbjct: 3 NTHDTGYKLLFSTPELVRDLILGFVPDDWLHGLDYSTLERVPGSYVTEDFTNRADDIVWR 62 Query: 67 VQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADH----DKLPLVVPILFY 122 V++ G YL+++IE QS DK MA RMM Y ++ +LP V+PI+ Y Sbjct: 63 VKVGGEWVYLYLLIEFQSSVDKYMALRMMVYGGLLYQDLIKRGEVLADGRLPPVLPIVLY 122 Query: 123 QGEATPYPLSMCWFDMFYSPELARRV-YNSPFPLVDITITPDDEIMQHRRIAILELLQKH 181 G ++ + + P L + + L+D D E+ + + +H Sbjct: 123 NGSQRWSAVTDVFELIPPVPGLVEQFKPRLKYLLIDENAWSDSELASLKNLVAAVFRIEH 182 Query: 182 IRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADL------FYGVLRDRE 235 + L++L+DE L M ++ +A+ + Sbjct: 183 PASP---AAIGDLLSLLDEWLAERPDLRRMFALWIRATLMRKAEYRIVLPRIDDLQELNV 239 Query: 236 TGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEF--------AQRLLSKGM---SREDVAE 284 E + AQ ++ +G +G +G+ E E Q+LL K + +A+ Sbjct: 240 MLAERLEEWAQAYKAEGKAEGKAEGKAEGKAEGKAEGEALALQKLLKKRFGAVPPDVLAQ 299 Query: 285 MANLPLAEIDKVINLI 300 ++ L +ID ++ + Sbjct: 300 ISRASLEQIDAWLDQV 315 >UniRef50_A9EVM7 Similar to putative transposase n=2 Tax=Sorangium cellulosum 'So ce 56' RepID=A9EVM7_SORC5 Length = 336 Score = 166 bits (419), Expect = 1e-39, Method: Composition-based stats. Identities = 77/313 (24%), Positives = 132/313 (42%), Gaps = 20/313 (6%) Query: 7 TPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYS 66 HDA+FK E A L LP L D L L GSF++E+LK +D+L+S Sbjct: 12 NAHDALFKAAFSQVEHAAGELRQALPPALSARIDFAALRLRPGSFVDEALKERQSDLLFS 71 Query: 67 VQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEA--DHDKLPLVVPILFYQG 124 M L+++ EHQS + MAFR++RY + HL +LP ++P++ + Sbjct: 72 ASMGEARVLLYLLFEHQSTVEPLMAFRLLRYMVRIWEHHLAEHPGSKRLPAILPVVLHHS 131 Query: 125 EATPYPLSMCWFDMFYSPELAR-----RVYNSPFPLVDITITPDDEIMQHRRIAILELL- 178 E + + D+ E AR V F L DI+ D+ + A L+ Sbjct: 132 ETGWTAAT-SFEDLLDLDEGARAVMVDHVPRFRFVLDDISQEGDEALKARAMSAFSRLVL 190 Query: 179 --QKHIRQRDL----MLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYG--V 230 +H R+ D + LV + L A+ Y+L ++AD + Sbjct: 191 WCLRHGREPDELLRQLGKWLDLVNEVRRAPNGVEALRAIWRYILATNERDEADEVLQRLL 250 Query: 231 LRDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKG---MSREDVAEMAN 287 E E +++ A E+G ++G+++G +E ++L + + VA + Sbjct: 251 AAAGEPWKEEIVSAADQLMERGRQQGLREGLREGRCHMLLKVLGARFGALPNDAVARVNA 310 Query: 288 LPLAEIDKVINLI 300 +A +D+ + Sbjct: 311 ADIAVLDRWSERV 323 >UniRef50_Q1QWV4 Putative uncharacterized protein n=11 Tax=Proteobacteria RepID=Q1QWV4_CHRSD Length = 326 Score = 164 bits (415), Expect = 3e-39, Method: Composition-based stats. Identities = 79/310 (25%), Positives = 135/310 (43%), Gaps = 24/310 (7%) Query: 13 FKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQMQGN 72 +K H E RD L + E D +TL SGS+I E L+ DV++ V+ + Sbjct: 13 YKLLFSHPEMVRDLLTGFVKEAWVEQLDFSTLEKVSGSYITEDLRDREDDVIWRVRWGDD 72 Query: 73 PGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEAD----HDKLPLVVPILFYQGEATP 128 Y+++++E QS D+ MA R+M Y + + + KLP V+PI+ Y GE Sbjct: 73 WLYVYLLLEFQSSVDRFMAVRVMTYLGLLYQDLIRQEAFTPNGKLPPVLPIVLYNGEKRW 132 Query: 129 YPLSMCWFDMFYSP-ELARRVYNSPFPLVDITI-TPDDEIMQH-RRIAILELLQKHIR-Q 184 + P L R N + L+D D E H R +A +H R + Sbjct: 133 TAAQNVADLVEQVPGGLERYRPNLAYLLLDEGAVISDPEWSDHMRNVAAALFRLEHNRDE 192 Query: 185 RDLMLLLEQLVTLIDEGYT---SGSQLVAMQNYMLQR----GHTEQADLFYGVLRDRETG 237 +D++ +L LV + + +V ++ +L + + + + Sbjct: 193 QDMLEVLGTLVEWLKAPEQTGLRRAFVVWIRRVLLPNRAPGMELPEFNELQDLHEVHDML 252 Query: 238 GESMMTLAQWFEEKGIEKGIQQGRQEVSQE--------FAQRLLSKG-MSREDVAEMANL 288 E + + +EEKG ++G Q+GR+E QE A+ L+ G +S E +AE L Sbjct: 253 AERIKQWPERWEEKGRQEGRQEGRKEGRQEGEQRGIEKTARNLIKLGVLSDEQIAEATGL 312 Query: 289 PLAEIDKVIN 298 +AE++ + Sbjct: 313 TVAEVEGLRE 322 >UniRef50_A0LBL3 Putative uncharacterized protein n=6 Tax=Magnetococcus sp. MC-1 RepID=A0LBL3_MAGSM Length = 322 Score = 164 bits (415), Expect = 3e-39, Method: Composition-based stats. Identities = 65/277 (23%), Positives = 111/277 (40%), Gaps = 5/277 (1%) Query: 3 APSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTD 62 T PHD K L + L LP E+ EL L G+FI+ + H TD Sbjct: 2 TKITQPHDRFLKALLSDPDKTGTLLRERLPKEVAELLSSEPPVLVDGTFIDGEFREHLTD 61 Query: 63 VLYSVQMQGNPG-YLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILF 121 L+ V+ Q Y++ +IEH+S D+ +AF+++RY + R L+ KLP +VP++ Sbjct: 62 RLFKVKTQEGKAAYIYALIEHKSYADEWVAFQLLRYMVRIWERFLKEGQQKLPPIVPLVV 121 Query: 122 YQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKH 181 Y G + + L + + F + D+ DD++ Q + + K+ Sbjct: 122 YHGAREWTVPNQFSALLEADKGLLHHLLDFSFAVTDLGRIADDDLSQDTHLRAALMAMKY 181 Query: 182 IRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGESM 241 Q +++ + + + Y++Q + G Sbjct: 182 AFQGAEGVVV--IPQIGKGAQGDPEFAKLVLRYLIQTYRGMTMADVQAYAEEAFPGE--A 237 Query: 242 MTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMS 278 A F + + KG Q+GRQE +E Q +G S Sbjct: 238 EHYASQFAREMMSKGRQEGRQEGRREGRQEGRQEGES 274 >UniRef50_D0LMM4 Putative transposase n=10 Tax=Haliangium ochraceum DSM 14365 RepID=D0LMM4_HALO1 Length = 345 Score = 163 bits (411), Expect = 8e-39, Method: Composition-based stats. Identities = 75/319 (23%), Positives = 133/319 (41%), Gaps = 28/319 (8%) Query: 5 STTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVL 64 HD++ K + A D LP + E DL+ L L GSF+ + L+ TD+L Sbjct: 2 PHDSHDSLVKATFARLDFAADEFRAVLPPAILERLDLDKLALCPGSFVSDELRQQHTDLL 61 Query: 65 YSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEA--DHDKLPLVVPILFY 122 + + G P +L++++EHQS ++ M R++RY + RHL LP ++P++ + Sbjct: 62 FRAPLDGEPAFLYLLLEHQSSVERMMPLRLLRYVASIWERHLGEHPGAATLPPILPVVLH 121 Query: 123 QGEATPYPLSMCWFDMFYSPE-----LARRVYNSPFPLVDITITPDDEIMQHRRIAILEL 177 E + +F + L + F L D++ PD+ ++ A +L Sbjct: 122 HSEQGWTAPT-SLGQLFALSDGAREALGPYLPELRFLLDDLSHQPDEALLMREMAAQAKL 180 Query: 178 ----LQKHIRQRDLMLLLEQLVTLIDEGYTSG---SQLVAMQNYMLQRGHTEQADLFY-G 229 L+ +DL+ LL +I E T+ L A+ Y LQ T+ L Sbjct: 181 ALWALKNARHAQDLLALLRPWSPVILEAVTAPGGIDALAAIVRYTLQHADTDPDALMRFL 240 Query: 230 VLRDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKG------------M 277 + + E+ MT A+ + E+ ++QGR E E +G + Sbjct: 241 IDSAGDPAKEAFMTGAEKLTQAVREQSLRQGRVEGRVEGRVEGRVEGRVEGRTEALRTVL 300 Query: 278 SREDVAEMANLPLAEIDKV 296 S++ LP +++ Sbjct: 301 SKQLRQRFGTLPSEVTERL 319 >UniRef50_Q24W02 Putative uncharacterized protein n=3 Tax=Clostridiales RepID=Q24W02_DESHY Length = 333 Score = 161 bits (407), Expect = 2e-38, Method: Composition-based stats. Identities = 82/331 (24%), Positives = 150/331 (45%), Gaps = 38/331 (11%) Query: 5 STTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVL 64 PHD FK+ AR FL+ +LP E+ L DL T+ + S+I++ L+ +D+L Sbjct: 4 IHNPHDKFFKETFGDVGMARSFLKNYLPQEILALVDLETILPQKDSYIDQELQESFSDLL 63 Query: 65 YSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADH-DKLPLVVPILFYQ 123 + V++ N GYL+ + EH+S P + +A ++++Y + L+ DKLPL++P++ Y Sbjct: 64 FQVKIHKNEGYLYFLFEHKSYPSQGIALQLLKYMVRIWESKLKESKPDKLPLIIPMVVYH 123 Query: 124 GEATPYPLSMCWFDMFYSPEL-----ARRVYNSPFPLVDITITPDDEIMQHRRIAILELL 178 G+ + S+ + + E + + + L D++ D E++ + + I+ Sbjct: 124 GQEK-WNSSLKLSGIIDNYEQLPNAVTQYIPEYEYILYDLSTYTDQEMVGNMLLLIILRT 182 Query: 179 QKHIRQRDLMLLLEQLVTLIDEGYTSGSQ------LVAMQNYMLQRGHTEQADLFYGVLR 232 + I +D L L+ Q + Y+L + + Y + + Sbjct: 183 MRDIFIKDTEAFHNILHELLISFERVEDQEKGMQFFETLIRYILSTRQDLELERIYEIAK 242 Query: 233 DRE-TGGESMMTLAQWFE------------------------EKGIEKGIQQGRQEVSQE 267 + GE MMT+A+ EKG E+G+++GR+E E Sbjct: 243 EVSLERGEVMMTIAEKLIMEGMEKGLKKGREEGLKKGREEGLEKGREEGLEKGREETKLE 302 Query: 268 FAQRLLSKGMSREDVAEMANLPLAEIDKVIN 298 A+ LL G+ + VA+ L EI K++N Sbjct: 303 VARNLLGLGIEMDKVAKATGLSEEEIRKLMN 333 >UniRef50_B3ESQ9 Putative uncharacterized protein n=2 Tax=Bacteria RepID=B3ESQ9_AMOA5 Length = 308 Score = 161 bits (407), Expect = 3e-38, Method: Composition-based stats. Identities = 76/301 (25%), Positives = 141/301 (46%), Gaps = 5/301 (1%) Query: 2 DAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHST 61 + PHD + K L H E ++F + + P ++ + DL +L L + S++ E L+ Sbjct: 6 KNDLSNPHDLLVKATLSHPEAIQEFAKAYFPADILKRVDLPSLKLTNKSYVTEELREFHN 65 Query: 62 DVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPI-- 119 D+++S + PGY V+EHQS PD MA R ++Y+IA + +++ +K P + + Sbjct: 66 DLVFSFTIDKQPGYAFFVLEHQSTPDPLMALRFVKYNIALIEEYIKEKGEKTPWPIIVNI 125 Query: 120 -LFYQGEATPYPLSMCWFDMFYSPELARRVYNS-PFPLVDITITPDDEIMQHRRIAILEL 177 L++ PYP S +D+F P A+ + F L D+ TP++ + QH I ++E Sbjct: 126 CLYHNANEKPYPYSTSVYDLFKDPLTAKALEMFTKFYLADLNSTPNEVLEQHGSIGLMEK 185 Query: 178 LQKHIRQRDLMLLLEQLVTLID-EGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRET 236 L K+ R RD+ ++E+ + G + Y E+ V +E Sbjct: 186 LLKYSRHRDIFNVIEKELKRSKGYLIVRGDYWKTILIYSSYVIGQEEKSEKDLVSLFKEV 245 Query: 237 GGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKV 296 ++ + + E+G +G++ A+ +L KG + E+ L +I+K+ Sbjct: 246 LSKNEEEIMITIAQTIEERGEMRGKRREKIAIAKNMLKKGCEISFIEEITGLSRKDIEKL 305 Query: 297 I 297 Sbjct: 306 K 306 >UniRef50_Q2FP14 Putative uncharacterized protein n=4 Tax=Methanospirillum hungatei JF-1 RepID=Q2FP14_METHJ Length = 312 Score = 160 bits (403), Expect = 8e-38, Method: Composition-based stats. Identities = 65/313 (20%), Positives = 123/313 (39%), Gaps = 24/313 (7%) Query: 7 TPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYS 66 D +K+ H E D + L +L CDL+TL +GS++ + L+ D+++ Sbjct: 2 NDSDHPYKRLFSHPEMIADLIRGFLDPKLVSGCDLSTLERCNGSYVTDDLREREDDIIWR 61 Query: 67 VQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADH---DKLPLVVPILFYQ 123 + L+++IE QSKPD M R+M Y + + ++P ++PI+ Y Sbjct: 62 LAYGDRTLILYLLIEFQSKPDYSMPIRIMSYMALLWQDLIRSGVIVPSRIPGIIPIVLYN 121 Query: 124 GEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIR 183 GE + ++R + + P+ L+D +M+ R +A + Sbjct: 122 GEIPWKVPHDIRETIQMPKPVSRFIPSVPYLLIDELRLSVHHLMEVRNLAACLFGLEQSS 181 Query: 184 QRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGESMMT 243 + L E L T + +++ L +T + D + + G Sbjct: 182 GP--LELFELGARLNRWMQTDPNLDSMRRDFSLFFENTLKRDDDISISNPFQGGTMLAER 239 Query: 244 LAQWFEE-------------------KGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAE 284 + +W + +G +G +G+ E +R+ KGMS ++A Sbjct: 240 VNKWIAQYKAEGRKEGKEEGKKEGLLEGRVEGKLEGKLEGMATILKRMKEKGMSVTEIAT 299 Query: 285 MANLPLAEIDKVI 297 + LP EI +I Sbjct: 300 ITGLPEDEIQHLI 312 >UniRef50_Q2RLW6 Putative uncharacterized protein n=9 Tax=Clostridia RepID=Q2RLW6_MOOTA Length = 344 Score = 159 bits (402), Expect = 1e-37, Method: Composition-based stats. Identities = 56/333 (16%), Positives = 126/333 (37%), Gaps = 36/333 (10%) Query: 4 PSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDV 63 P P+D ++Q L + L+ + E D + L L + S++ + DV Sbjct: 10 PPHHPYDKGYRQLLADKRVFLELLKTFVREAWVEAIDADDLILVNKSYVLQDFSEKEADV 69 Query: 64 LYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHR--------HLEADHDKLPL 115 +Y ++ + +V++E QS D M FR++ Y + E+ H +LP Sbjct: 70 VYRLKTRNRNVIFYVLLELQSTVDYLMPFRLLLYMVEIWREIYNNTPQGERESKHFRLPP 129 Query: 116 VVPILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAI- 174 ++P + Y G + + + + + + + L D+ ++E+++ + Sbjct: 130 IIPAVLYNGAGSWTAALSFKEMLNSYQDFSGHLLDFRYLLFDVNRYSEEELIRAANLIAG 189 Query: 175 LELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLV--AMQNYMLQRGHTEQADLFYGVLR 232 + LL + ++ DL L++L ++ + ++N + R + ++ G+L Sbjct: 190 IFLLDQKMQPEDLAGRLQKLAGVLRRLTPDEFRHFTTWLKNVVQPRMPGDFSEKIDGILN 249 Query: 233 DRETGGESMMTLAQWFEEKGIEK-------------------------GIQQGRQEVSQE 267 M + +++ G +G+ E +E Sbjct: 250 ASNPWEVERMIYNLELTLEEMQRQALLKGLKEGEQKGKLEGKLEGKLEGKLEGKLEGKRE 309 Query: 268 FAQRLLSKGMSREDVAEMANLPLAEIDKVINLI 300 A+ LL + E + + L L EI+ + + Sbjct: 310 VARNLLLLNVDIETIIKATGLALEEINALKKQM 342 >UniRef50_A8GX51 Transposase and inactivated derivative n=11 Tax=Rickettsia RepID=A8GX51_RICB8 Length = 355 Score = 159 bits (402), Expect = 1e-37, Method: Composition-based stats. Identities = 81/282 (28%), Positives = 144/282 (51%), Gaps = 9/282 (3%) Query: 12 VFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQMQG 71 +F++ L + A +F HLP ++ L D +L +E+ +F+E SLK +DVL+S + Sbjct: 23 IFRKALENPLVAHEFFNAHLPPNIKSLIDFPSLAMENTTFVESSLKDSISDVLFSCKFDK 82 Query: 72 NPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHL--EADHDKLPLVVPILFYQGEATPY 129 GYL +++EHQSK D MAFR+ +Y I R+L LPL+ P++F+ G+ Y Sbjct: 83 QDGYLFLLVEHQSKADHFMAFRLFKYMINICERYLIQNPKAKTLPLIYPMIFFNGQEK-Y 141 Query: 130 PLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQRDLML 189 ++ +D+F + +LA+ ++ + + LV++ PD+E Q ILE KHI +R+L+ Sbjct: 142 NVARNLWDLFTNNKLAKELWINDYQLVNVHEIPDEEFKQRIWSGILEFFLKHIHERELLK 201 Query: 190 LLEQLVTLIDEGYTS---GSQLVAMQNYMLQRGHTEQADLFYGVLRDR---ETGGESMMT 243 +++ ++ E L + Y L + +L + E G M + Sbjct: 202 RWQEISDILPELTKITIGYDYLEMILYYTLTKIEQADKIKLKNLLSTKLNPEIGTRLMRS 261 Query: 244 LAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEM 285 LA+ ++++G E GI +G Q + Q +KG+ + Sbjct: 262 LAEHWQQEGKEIGILEGLQVGEAKGIQIGEAKGIQIGKAEGI 303 >UniRef50_A5CC03 Transposase and inactivated derivative n=9 Tax=Orientia tsutsugamushi RepID=A5CC03_ORITB Length = 355 Score = 155 bits (391), Expect = 2e-36, Method: Composition-based stats. Identities = 81/345 (23%), Positives = 150/345 (43%), Gaps = 56/345 (16%) Query: 8 PHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSV 67 HD +FK + + A DF+ LP E++ + DLNT+ +E SF+E +L+ DVL+SV Sbjct: 6 KHDGLFKDLMNEPKAALDFINDFLPNEVKNVLDLNTIKVEQESFVEANLRRSMCDVLFSV 65 Query: 68 QMQ-GNPGYLHVVIEHQSKPDKKMAFRMMRYSIAA------MHRHLEADHDKLPLVVPIL 120 + + N +++V+IE + + D +AF++ +Y+++ + + + KLP+VVPI+ Sbjct: 66 KTKNNNDAFIYVLIEAELRSDYWIAFKLWQYTLSILKRHKKGLKKRKKERGKLPIVVPIV 125 Query: 121 FYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQK 180 Y G A + +++F P+LA+ + S + L+D PD EI + A++ ++ Sbjct: 126 VYHG-ADRFNAPRSLWELFDDPKLAKELMGSEYLLIDWQAMPDSEIKRKATAALVHFMKY 184 Query: 181 HIRQRDLMLLLEQLVTLIDEGYTSGSQ-----LVAMQNYMLQRGHTEQADLFYGVLRD-- 233 Q D++ L + + E + + ++ Y + + + +L + Sbjct: 185 IHNQPDIIELWAKFFNTLQEIVQKDKEEGFLYIRSLLYYTISKVSQNEQPRLKQLLDENL 244 Query: 234 -RETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQE------------------------- 267 E M T+A + ++G KG +GR E E Sbjct: 245 SIEDRDRIMGTIAAQYIDEGKAKGRAEGRAEGRAEGRAEGRAEGRAEGRAEGRAEGRAEG 304 Query: 268 ---------------FAQRLLSKGMSREDVAEMANLPLAEIDKVI 297 A+ LL G S E +AE L E+ + Sbjct: 305 IEIGETKGRAEAAQGLARNLLKAGFSVEFIAENTGLSNEEVVNLK 349 >UniRef50_D2QBD7 Putative uncharacterized protein n=1 Tax=Spirosoma linguale DSM 74 RepID=D2QBD7_9SPHI Length = 341 Score = 155 bits (390), Expect = 2e-36, Method: Composition-based stats. Identities = 77/333 (23%), Positives = 133/333 (39%), Gaps = 37/333 (11%) Query: 1 MDAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHS 60 M A PHD FK+ E DFL P +RE D TL E +F +E L H Sbjct: 1 MAAQPDNPHDRFFKESFSQPEILIDFLNAFAPEAVRERIDYTTLTREVDTFTDEQLAEHF 60 Query: 61 TDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPIL 120 D+++SVQ G P L +++EH+S ++ F++ RY + ++ L V+P+L Sbjct: 61 ADLVFSVQYNGQPIRLVILLEHKSYTEEYPHFQINRYLLNLWESQIKQKQP-LTPVLPVL 119 Query: 121 FYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAI---LEL 177 Y G S+ + L + + L+D++ D+ + + L Sbjct: 120 VYHGNRRWKQRSIPDYFAPLHETLTPYLPAFEYLLIDLSTLSDERLPTLQSDYARLTAIL 179 Query: 178 LQKHIRQRDLMLLLEQLVTLIDEGYTS---GSQLVAMQNYMLQRGHTEQADLFYGVLRDR 234 LQ R+R+L LL+ ++ + + Y+ + + +LF R Sbjct: 180 LQNSRRKRELTRLLDAFADVVRRLTDTTAGQRFVSTGFLYLSYTANLTKVELFGIFSRIS 239 Query: 235 ETGGESMMTLAQWFEEKGIEKGIQQGR------------------------------QEV 264 S MT+A+ ++G E +Q R ++ Sbjct: 240 SKIESSTMTVAEELIQEGRELERRQTRMVAEELIQQGRELERRQAMMAAEELLKQQERQN 299 Query: 265 SQEFAQRLLSKGMSREDVAEMANLPLAEIDKVI 297 +F + +L+ + +A A LPLAE+D +I Sbjct: 300 KIKFIKAMLNLNLDAATIATAAELPLAEVDAII 332 >UniRef50_Q1RJ73 Transposase and inactivated derivative n=10 Tax=Rickettsieae RepID=Q1RJ73_RICBR Length = 305 Score = 153 bits (387), Expect = 5e-36, Method: Composition-based stats. Identities = 75/301 (24%), Positives = 152/301 (50%), Gaps = 11/301 (3%) Query: 8 PHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSV 67 HD++ K + A++FLE +LP + ++L DL+ + +E S+IEESL +D++Y + Sbjct: 6 KHDSLVKIIMTDKIAAQEFLEYYLPEDFKKLIDLSKITVEQESYIEESLSKKYSDIVYGI 65 Query: 68 QMQG-NPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEA 126 + + G+++++IE QS D A R+ +Y++ RH + +KLPLV ++ Y G Sbjct: 66 ETKEYGKGFVYILIEAQSTVDYWTALRLWKYTLLLCERH-KEKRNKLPLVYNLVIYNG-K 123 Query: 127 TPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQRD 186 Y +D+F + +A+++ + LVD+ D+EI++ + I +L+ + KHI +RD Sbjct: 124 QVYNAPRNLWDLFTNSVMAKKLMMEDYQLVDLQAMSDNEIVKKKHIGMLDYILKHIHERD 183 Query: 187 LMLLLEQLVTLI--------DEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGG 238 ++ L EQ + ++GY + + + + + + + Sbjct: 184 MIQLWEQFLANFNHVIMLDKEKGYIYLKSFLWYTDAKISKKQQPRLVQVFDKYLSPQHKD 243 Query: 239 ESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKVIN 298 M T+A + ++G ++G ++G + A+++ S+G +AE+ L I +I Sbjct: 244 NIMKTIADVYIDEGKQEGKREGEYNKAVMIAKKMFSQGFKIPVIAELTGLKETLIRSIIE 303 Query: 299 L 299 Sbjct: 304 S 304 >UniRef50_C1J8H0 Truncated transposase n=3 Tax=Escherichia coli RepID=C1J8H0_ECOLX Length = 202 Score = 153 bits (386), Expect = 8e-36, Method: Composition-based stats. Identities = 88/207 (42%), Positives = 124/207 (59%), Gaps = 7/207 (3%) Query: 90 MAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEATPYPLSMCWFDMFYSPELARRVY 149 M FRM+RYS+AAM RHLE H LPLV+P+LFY GE +PYP SM W D F P LA ++Y Sbjct: 1 MPFRMLRYSVAAMQRHLEQ-HKTLPLVIPVLFYHGERSPYPYSMNWLDCFEEPALAAKIY 59 Query: 150 NSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLV 209 PFPLVDIT+ D+EIM HRR+A L LL KHIR RD+M LL++L ++ E S Q+ Sbjct: 60 TKPFPLVDITVVDDNEIMNHRRMAALTLLMKHIRHRDMMELLDKLPQVMVE--ISDEQVR 117 Query: 210 AMQNYMLQRGHTEQADLFYGVLRDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFA 269 + +Y++ G + + + + +MT+A+ E +KG Q+G E + A Sbjct: 118 VLIHYIVNAGDSVSPEFMRALAERLPQHEDKLMTIAERLE----QKGRQEGALEKALAIA 173 Query: 270 QRLLSKGMSREDVAEMANLPLAEIDKV 296 +L GM+ E + + L AE+ + Sbjct: 174 CQLQKMGMTPEQIKQATGLSEAELKNI 200 >UniRef50_A6TJT5 Putative uncharacterized protein n=1 Tax=Alkaliphilus metalliredigens QYMF RepID=A6TJT5_ALKMQ Length = 312 Score = 153 bits (385), Expect = 9e-36, Method: Composition-based stats. Identities = 71/309 (22%), Positives = 147/309 (47%), Gaps = 15/309 (4%) Query: 5 STTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVL 64 PHD FK+ + A+DF+ +LP+EL ++ D+ TL E +IE+ LK +D+L Sbjct: 4 IHQPHDKFFKEMFGNLALAKDFMTNYLPLELLKIVDIETLTPEKEHYIEDDLKESFSDLL 63 Query: 65 YSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRH-LEADHDKLPLVVPILFYQ 123 + + G GYL+ + EH+S P K++A +++ Y + L+ +K+P+++P+ Y Sbjct: 64 FKANINGREGYLYFLFEHKSYPSKRIAIQLLHYMVRIWDDKSLKEKKEKIPMIIPMTVYH 123 Query: 124 GEATPYPL----SMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQ 179 G+ + E+ + + + + D++ DDE+ ++ I+ + Sbjct: 124 GKENWNVALRLSDLMEGYEELPEEIRKYIPEYEYLIYDLSGYTDDEVKGDVQLQIVIKIL 183 Query: 180 KHIRQRDLM-----LLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDR 234 + I + D +++ +++ Y+L Y ++++ Sbjct: 184 RSIFRNDEEFFKVFKEAVEVLDKLEKQEKGIEYFKTFIYYILSARKGVTLTEIYDLVKEV 243 Query: 235 ETGG-ESMMTLAQWF----EEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLP 289 + +MT+A+ EKG+EKG+++G+ E +E A+ L+ G+ + V + L Sbjct: 244 SVERSDEIMTIAEELLKEGMEKGMEKGMEKGKLEEKREVARNLIGLGVELDKVMKATGLS 303 Query: 290 LAEIDKVIN 298 EI+K++N Sbjct: 304 EEEINKLLN 312 >UniRef50_Q1RKI3 Transposase and inactivated derivative n=10 Tax=Rickettsia RepID=Q1RKI3_RICBR Length = 270 Score = 148 bits (374), Expect = 2e-34, Method: Composition-based stats. Identities = 57/194 (29%), Positives = 108/194 (55%), Gaps = 3/194 (1%) Query: 8 PHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSV 67 HD F++ L + AR+F E +LP E++ L TL LE+ SFI+ +LK TDVLYS Sbjct: 55 KHDKFFQKALSNPIVAREFFEEYLPTEIKALFSPTTLTLENDSFIDPNLKESITDVLYSA 114 Query: 68 QMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEAD--HDKLPLVVPILFYQGE 125 ++ Y++++ EHQS D MAFR+ +Y + +HL + K P + P++ Y + Sbjct: 115 RINNRDCYIYILCEHQSSSDPHMAFRLFKYMLNIAEKHLISHPDSKKFPFIYPLV-YSND 173 Query: 126 ATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQR 185 Y + +D+F + EL + +++ + L+ + DD++ ++ +A L++L K+I + Sbjct: 174 HKKYTAPLNLWDLFENSELVKDTWSNNYQLISLRDISDDKLKENPWLAPLQILMKYIHKP 233 Query: 186 DLMLLLEQLVTLID 199 ++ +++ + Sbjct: 234 NVFDKWQEISGCLA 247 >UniRef50_B9TA29 Putative uncharacterized protein n=1 Tax=Ricinus communis RepID=B9TA29_RICCO Length = 411 Score = 148 bits (374), Expect = 2e-34, Method: Composition-based stats. Identities = 61/313 (19%), Positives = 111/313 (35%), Gaps = 40/313 (12%) Query: 1 MDAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHS 60 M + + D+++KQ H E RD + L + + + S+ + Sbjct: 39 MSSRT----DSLYKQLFAHPEIVRDLVAGFLAADWARGLTVEAFERVNASYASDHGHVRH 94 Query: 61 TDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMH----RHLEADHDKLPLV 116 DV++ ++ G Y+++++E Q++PDK MA RM Y +H + H KLP V Sbjct: 95 DDVVWRARIGGEWVYVYILLEFQARPDKWMALRMQVYVGLLYQDLVAQHKLSKHGKLPPV 154 Query: 117 VPILFYQGEATPYPLSMCWFDMFYSPE-LARRVYNSPFPLVD------------ITITPD 163 +P++ Y G + M +P L R + + L+D + Sbjct: 155 LPVVLYHGRGPWRAATALASLMLPAPSGLERYQPSQRYLLIDQHHGTARADVVSLLFRLL 214 Query: 164 DEIMQHRRIAILELLQKHIRQRDLMLLLEQLVTLIDEGYTS---------GSQLVAMQNY 214 D + L+LL + IR RD+ + + L I Sbjct: 215 DAATDLQLREALDLLAERIRARDMDPVRDSLTRWIQLTLQDAAVETSMDLEEAFTMKMRR 274 Query: 215 MLQRGHTEQADLFYGVLRDRETG----------GESMMTLAQWFEEKGIEKGIQQGRQEV 264 +F L E + E+G +G+++GR+E Sbjct: 275 KFSYDEMFDPGMFERPLAKAREKAIVEGLQQGREEGLERGRVEGLERGRVEGLERGREEG 334 Query: 265 SQEFAQRLLSKGM 277 + Q L +G+ Sbjct: 335 LKAGLQEGLQEGL 347 >UniRef50_C5JAV2 Transposase n=2 Tax=uncultured bacterium RepID=C5JAV2_9BACT Length = 334 Score = 148 bits (373), Expect = 2e-34, Method: Composition-based stats. Identities = 75/297 (25%), Positives = 137/297 (46%), Gaps = 8/297 (2%) Query: 3 APSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTD 62 PHD K L + TA L LP E+ E + L GSFI+E+L+ H TD Sbjct: 2 TEIAHPHDRFLKALLSNPATAGTLLRERLPREVAEALSDDPPELLEGSFIDEALRPHLTD 61 Query: 63 VLYSVQM-QGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEAD--HDKLPLVVPI 119 LY V+ G L+V+IEH+S PD ++ +++++Y + A+ + + ++LP +VP Sbjct: 62 RLYRVRTVTGRTALLYVLIEHKSSPDLRIGWQLLKYLVEALKQWERENPAWERLPAIVPF 121 Query: 120 LFYQGEATPYPLSMCWFDMFYSPELAR-RVYNSPFPLVDITITPDDEIMQHRRIAILELL 178 +FY G A + + + + + E R + N F ++D+ D ++ + + L Sbjct: 122 VFYHGAAA-WKVPDAFLALVDAEEGWRSHLLNFRFTVLDLGQIDDRQLSRQPNLQAWLLA 180 Query: 179 QKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGG 238 K+ + D L +++L+ + + + Y+++ + + ++R Sbjct: 181 AKYATRDDRQLEVKELLIQ-TLVSVADEEFRFLMRYVVETYRSYDEPMVREIIRRVRPEE 239 Query: 239 ESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDK 295 E M F + + KG Q+GRQE QE Q + G R E A + L ++ + Sbjct: 240 EETMM--SMFAQDMMAKGRQEGRQEGRQEGRQEGIKLGEQRGRQEEAAYMLLKQMRR 294 >UniRef50_C6I158 Putative uncharacterized protein n=3 Tax=Leptospirillum ferrodiazotrophum RepID=C6I158_9BACT Length = 328 Score = 148 bits (372), Expect = 3e-34, Method: Composition-based stats. Identities = 67/319 (21%), Positives = 118/319 (36%), Gaps = 30/319 (9%) Query: 9 HDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQ 68 HD FK L + L+ LP + D +L + E L D+ +S + Sbjct: 7 HDRFFKSTLGRPDRLGKVLKAFLPTNISASLDPGSLVPLGTESVGEGLDSSLMDLAFSAR 66 Query: 69 MQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEATP 128 +H+++EH+S PD + F++ RY R L+ PL +PILFY G Sbjct: 67 FGDQEARIHLIVEHKSSPDPRTHFQIARYLCGLWIRELKEGLQPRPL-LPILFYHGVVPW 125 Query: 129 YPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRR---IAILELLQKHIRQR 185 S + EL + PL+D+ D+EI H + L KHI Sbjct: 126 TLPSRLTEVLRPPSELLAVTPDFVLPLIDLRRVDDEEIRHHVDDLEAVLALLSLKHIFDG 185 Query: 186 DLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGESMMTLA 245 + L+ L+ I E + L NYM + ++ + Sbjct: 186 -VETLVRLLLREIWERKAPHAILKPEMNYMAGVYKITNSQEMKQIVDPIAREVGMAQDIV 244 Query: 246 QWFEEKGIEKGI------------------------QQGRQEVSQEFAQRLLSKG-MSRE 280 + + ++ +++G+ QQG + ++ + LL K S E Sbjct: 245 ETWLDEYLQQGLQKGLEQGLQQGLQQGLEKGLEKGFQQGARLKEEQVIRTLLKKKTFSFE 304 Query: 281 DVAEMANLPLAEIDKVINL 299 ++A + + L+ + +V Sbjct: 305 EIASLVGVELSRVREVAES 323 >UniRef50_A3JHZ5 Putative transposase n=11 Tax=Proteobacteria RepID=A3JHZ5_9ALTE Length = 325 Score = 147 bits (371), Expect = 4e-34, Method: Composition-based stats. Identities = 71/318 (22%), Positives = 130/318 (40%), Gaps = 26/318 (8%) Query: 6 TTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLY 65 T HD +K+ H E + +E P E+ L D NTL SG++I + DV++ Sbjct: 3 TNHHDTGYKELFSHPEFVQQLVEGFAPSEIAGLMDFNTLKNHSGNYITPLFEEKFEDVVW 62 Query: 66 SVQMQ----GNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDK----LPLVV 117 SV++ +L++++E QSK D M R+M Y L+ LP + Sbjct: 63 SVEVTWEGITQRVFLYILLEFQSKIDSTMPLRLMHYVACFYDHLLKTRETTVRQGLPPIF 122 Query: 118 PILFYQGEATPYPLSMCWFDMFYSPELARRVYN--SPFPLVDITITPDDEIMQHRRIAIL 175 P++ Y G + + +P RVY + L+D D+E++ R Sbjct: 123 PMVLYNGSQRWSARQDIYDMVQPAPPEFLRVYQPHLRYYLIDEGRYTDEELISKRTPLSG 182 Query: 176 ELLQKHI--RQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGH------TEQADLF 227 ++ L ++++V ++ + ++ + D Sbjct: 183 IFGVENAGHSWEALQQAVDRIVEIVKADPNKDRVDKIVTRWIKRHLQRVAPKARLNLDRM 242 Query: 228 YGVLRDRETGGESMMTLAQWFEEKGIEKGIQQGRQEV-------SQEFAQRLLSKG-MSR 279 ++ DR E++ L + +G ++G Q+GRQE ++ + LLS G +S Sbjct: 243 SSLVEDRNMLAENLENLVKKERLEGRQEGRQEGRQEGDRRALEEKRKTVRHLLSFGVLSN 302 Query: 280 EDVAEMANLPLAEIDKVI 297 + +A L + EIDK+ Sbjct: 303 DQIAVATGLSVDEIDKLR 320 >UniRef50_C6HY29 Putative uncharacterized protein n=1 Tax=Leptospirillum ferrodiazotrophum RepID=C6HY29_9BACT Length = 319 Score = 146 bits (368), Expect = 8e-34, Method: Composition-based stats. Identities = 69/317 (21%), Positives = 131/317 (41%), Gaps = 24/317 (7%) Query: 3 APSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESL-KGHST 61 A + TPHD FK+ E L LP ++ D ++L G + E L + Sbjct: 2 AKNLTPHDVFFKEIFSQREILSSALSELLPEDVVRRMDFDSLAYLPGESVGEGLSRSTRA 61 Query: 62 DVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILF 121 D+++SV G L V++EH+S PD ++ F++++ + ++L + LP ++PILF Sbjct: 62 DLVFSVSFGEREGRLVVILEHKSHPDPRVHFQILQMMVMGWMQNLREGREPLP-ILPILF 120 Query: 122 YQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEI--MQHRRIAILELLQ 179 Y G+ + M E+AR + + +D+ + D I +Q+ L Sbjct: 121 YHGQGSWSIPDRFSERMKIPREIARYLPDFELLRIDLGLIDDTRIRSLQNVLAGAALLSM 180 Query: 180 KHIRQ---RDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRET 236 KH+ + R LL+E ++ +Y + Y ++ Sbjct: 181 KHVFENPRRFFHLLIEFGRERSAPHDIIEKIVLVALDYAGHVHKNIPDEELYNIMAAITE 240 Query: 237 GGESMMTLAQWFEEKGIEKGIQQGRQ----------------EVSQEFAQRLLSKGMSRE 280 M T + ++ IE+GIQ+G Q + + L + + Sbjct: 241 EA-GMETTTERLKKIWIEEGIQKGVQLGIQQGVQQGVQQGVRQNQIKTILSLSKHNFTPQ 299 Query: 281 DVAEMANLPLAEIDKVI 297 +A++ +L L E+++V+ Sbjct: 300 QIADLLSLELPEVERVL 316 >UniRef50_C6HXQ0 Putative uncharacterized protein n=1 Tax=Leptospirillum ferrodiazotrophum RepID=C6HXQ0_9BACT Length = 341 Score = 146 bits (367), Expect = 1e-33, Method: Composition-based stats. Identities = 67/325 (20%), Positives = 122/325 (37%), Gaps = 38/325 (11%) Query: 9 HDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQ 68 HD FK L + L+ LP L L +L + +SL D+ + Sbjct: 8 HDRFFKSTLGRPKRMEHILKAFLPPALSALLAPGSLVPLFSEVVGDSLDASLLDMAFEAT 67 Query: 69 MQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEATP 128 +HV++EH+S PD F+++ Y R + +P V P+LFY G Sbjct: 68 FGERKTRIHVLVEHKSSPDPWAHFQILHYLAELWLRDKKESRSPIPFV-PVLFYHGLRPW 126 Query: 129 YPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRI---AILELLQKHIRQR 185 + + EL V + P++D+ D +I + R + LL KHI + Sbjct: 127 NLPTRLSEMLDPPSELLPFVPDYLLPVIDLGKIDDLDIREKIRDFETSACLLLLKHIFEG 186 Query: 186 DLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGESMMTLA 245 L + + S +++ +Y++ H E ++ + Sbjct: 187 A-RGSLRAFLQETNGKNLSRDIIISGMSYVIGVHHLESTAELSRLVNTILKEEGMSQNVV 245 Query: 246 QWFEEKGIEKGIQQGRQEVSQ--------------------------------EFAQRLL 273 + + E+ I++G+Q+G Q+ Q + ++LL Sbjct: 246 ELWMEELIQQGVQKGIQQGVQLGIEQGIQQGIQQGVQQGVRQGVQQGIRITQDDTIRKLL 305 Query: 274 SKG-MSREDVAEMANLPLAEIDKVI 297 +KG +S E +A +LP I +V+ Sbjct: 306 NKGQLSVEQIAFFLDLPTDRIREVL 330 >UniRef50_C3PPD7 Transposase and inactivated derivative n=13 Tax=spotted fever group RepID=C3PPD7_RICAE Length = 361 Score = 146 bits (367), Expect = 1e-33, Method: Composition-based stats. Identities = 77/301 (25%), Positives = 141/301 (46%), Gaps = 33/301 (10%) Query: 2 DAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHST 61 + HD +FK+ + AR+FLE +LPV + +LN++ +E SF+ E L+ + Sbjct: 34 NTSERPRHDELFKKVMSEPVAAREFLEHYLPVTFKNKINLNSIKIEKESFVTEDLRKRLS 93 Query: 62 DVLYSVQMQGNP--------------GYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHL- 106 DV+YSV ++ + Y++V+IEHQS D +AFR+ +Y + RH Sbjct: 94 DVVYSVSLKNDNIKDSTTEKSVHNDKAYVYVLIEHQSSSDYWIAFRLWQYMLLLCERHKD 153 Query: 107 ---------EADHDKLPLVVPILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVD 157 + +KLPL+ PI+ Y + PY ++++F + A+ + + LVD Sbjct: 154 ANNNKSSVTKEKDNKLPLICPIVVYANDK-PYNAPRSFWELFEDSKTAKDMMGDEYLLVD 212 Query: 158 ITITPDDEIMQHRRIAILELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQ--------LV 209 + DDEI + + + ++E + KHI+ RD++ L + L+ + + L+ Sbjct: 213 LQKQSDDEIEKKKHLGMMEYMLKHIKARDILNLWQSLLEKFESSIEIDKENGYIYIKWLL 272 Query: 210 AMQNYMLQRGHTEQADLFYGVLRDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFA 269 + + + +E E M T+A + ++G++KG+ QG Q Sbjct: 273 WYSDAKVSEDKQVELASIIAKHLKKEDQEELMRTIADKYIDEGVQKGMVQGMQIGEARGM 332 Query: 270 Q 270 Q Sbjct: 333 Q 333 >UniRef50_C2DIT3 Possible transposase n=5 Tax=Enterobacteriaceae RepID=C2DIT3_ECOLX Length = 197 Score = 146 bits (367), Expect = 1e-33, Method: Composition-based stats. Identities = 89/200 (44%), Positives = 124/200 (62%), Gaps = 9/200 (4%) Query: 95 MRYSIAAMHRHLEADHDKLPLVVPILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFP 154 MRY+IAAM HL+A + LP+VVP+LFY G +PYP S+CW D F P LAR++Y S FP Sbjct: 1 MRYAIAAMQNHLDAGYKTLPMVVPLLFYHGIESPYPYSLCWLDCFADPNLARQLYASAFP 60 Query: 155 LVDITITPDDEIMQHRRIAILELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNY 214 L+D+T+ PDDEIM HRR+A+LEL+QKHIRQRDLM L+EQ+ L+ GY +G Q+ + NY Sbjct: 61 LIDVTLMPDDEIMLHRRMALLELIQKHIRQRDLMGLVEQMACLLSSGYANGRQIKGLFNY 120 Query: 215 MLQRGHTEQA-DLFYGVLRDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLL 273 +LQ G + D GV + S+MT+A+ Q+G Q + A+ +L Sbjct: 121 ILQTGDAVRFNDFIDGVAKRSPKHKVSLMTIAERLR--------QEGEQSKALHIAKIML 172 Query: 274 SKGMSREDVAEMANLPLAEI 293 G+ D+ + E+ Sbjct: 173 ESGVPLADIMRFTGVSEEEL 192 >UniRef50_C4YU05 Transposase n=5 Tax=Rickettsieae RepID=C4YU05_9RICK Length = 342 Score = 145 bits (366), Expect = 1e-33, Method: Composition-based stats. Identities = 86/338 (25%), Positives = 153/338 (45%), Gaps = 51/338 (15%) Query: 8 PHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSV 67 HDA+ K+ L A++FLE +LP + +EL DL + +E SF+E+ LK +D++YSV Sbjct: 6 KHDALVKKILTEKIAAQEFLEHYLPSDFKELIDLREIKVEKESFVEDDLKRKYSDIIYSV 65 Query: 68 QMQG-NPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEA 126 + + +++V+IE QS D +A R+ +Y + RH E + +KLPL+ P+L Y G Sbjct: 66 KTRDQEEAFVYVLIEAQSSCDYWIALRLWKYMLLLCERH-ENNKNKLPLICPLLIYNGSE 124 Query: 127 TPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQRD 186 Y ++++F PE A+++ + LVD+ DDEI Q + + ++E KHI QRD Sbjct: 125 V-YNAPRNFWELFTKPERAKKLMVQDYQLVDLQNQSDDEIEQKKHLGMMEYFLKHIHQRD 183 Query: 187 LMLLLEQLVTLID--------EGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGG 238 ++ L ++ + GY V + + ++ + E Sbjct: 184 MLKLWDEFLIRFKPSIIMDKESGYIYLRSFVWYTDAKISEEKQQELEQIIVKHLSTEEKD 243 Query: 239 ESMMTLAQWFEEKGIEK----------------------------------------GIQ 258 M T+AQ + ++G++ G Sbjct: 244 NIMRTIAQKYIDEGVQHGIIQGIQQGIQQGVEKGKAEGLKIGEAKGKAEGKAEGKAEGKA 303 Query: 259 QGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKV 296 +G+ E E A+++LS+G ++ + L A I + Sbjct: 304 EGKAEERVEIARKMLSQGCDFSFISSVTGLEEAFIRSL 341 >UniRef50_Q1RGR6 Transposase and inactivated derivative n=15 Tax=Rickettsia RepID=Q1RGR6_RICBR Length = 313 Score = 145 bits (366), Expect = 1e-33, Method: Composition-based stats. Identities = 72/310 (23%), Positives = 145/310 (46%), Gaps = 22/310 (7%) Query: 7 TPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYS 66 HD + + + +++F E+HLP ++ L L +E SF+++ LK D+L+S Sbjct: 5 PKHDEIIRSAFENPLVSKEFFEMHLPPHIQNLISFEKLKMEKDSFVDKRLKKSIVDILFS 64 Query: 67 VQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEA-DHDKLPLVVPILFYQGE 125 + GYL++++EHQS P+ KMA R+ RY H ++ K P + P++FY G Sbjct: 65 AKFGEKKGYLYLLLEHQSTPEYKMALRLFRYMFKIAEYHKKSTKSKKFPFIYPLIFYNG- 123 Query: 126 ATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQR 185 Y +++F + EL + ++ + L+++ PD+++ + IL+ KHI +R Sbjct: 124 VQKYNAPRNLWELFENSELVKSTWSGDYQLINVHDIPDEKLKEKAWSGILQFFMKHIHER 183 Query: 186 DLMLLLEQLVTLI---DEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDR---ETGGE 239 DL+ E++ L+ + + + Y L R + +L+ + + Sbjct: 184 DLLKRWEEVADLLPKFAKIDIGIEHIELILCYTLTRIKQDDIIEVEKLLQSKLNPKKREN 243 Query: 240 SMMTLAQWFEEKGIEK--------------GIQQGRQEVSQEFAQRLLSKGMSREDVAEM 285 M ++A + ++G E+ + + QE A+ ++ +G S E V ++ Sbjct: 244 VMKSIAHHWIQQGREEEKAIMLKKMQEEKVIMAEKVQEEKVMMAKEMMKEGFSLESVIKI 303 Query: 286 ANLPLAEIDK 295 L +++K Sbjct: 304 TKLSKEDLEK 313 >UniRef50_C5UWW9 Putative uncharacterized protein n=1 Tax=Clostridium botulinum E1 str. 'BoNT E Beluga' RepID=C5UWW9_CLOBO Length = 323 Score = 144 bits (363), Expect = 4e-33, Method: Composition-based stats. Identities = 62/316 (19%), Positives = 126/316 (39%), Gaps = 20/316 (6%) Query: 2 DAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHST 61 + HD +K H ET +FL E L + + L L S+I + + Sbjct: 3 NNNVHHEHDVGYKHIFSHKETFLEFLRSFTKKEWANLINEDDLILVDKSYILSDFEEEES 62 Query: 62 DVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHR--------HLEADHDKL 113 D+LY + +V++E QSK D +M R++ Y + + KL Sbjct: 63 DILYKANIDDKEVIFYVLLEFQSKVDFQMPMRLLFYMTEIWRDVLKNTEKNERKRKNFKL 122 Query: 114 PLVVPILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIA 173 P +VPI+ Y G+ + + + + L DI D E++ + Sbjct: 123 PSIVPIVLYNGKNKWSAKISFKEMLSGYELFEDNILDFNYMLFDINRYSDHELLNISNMI 182 Query: 174 -ILELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLV--AMQNYMLQRGHTEQADLFYGV 230 + LL + I +++LM L++++ ++ + + ++N + R V Sbjct: 183 SAVFLLDQEIDEQELMRRLKKIIYILKKISPEQFSVFKKWLKNIVKPRVRDNLQGEIDDV 242 Query: 231 LRDRETGG---------ESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSRED 281 L +++ + E+G++KGI+QG ++ ++ A++ + GM E Sbjct: 243 LEKSNQEEVDFMVSNLGKTIERMQDKAIERGLKKGIEQGIEQGIEQTAKKAIEMGMDNEI 302 Query: 282 VAEMANLPLAEIDKVI 297 + + L +I+ + Sbjct: 303 IMNLTGLSEEQINTIR 318 >UniRef50_A4XFI8 Putative uncharacterized protein n=7 Tax=Clostridia RepID=A4XFI8_CALS8 Length = 321 Score = 144 bits (362), Expect = 4e-33, Method: Composition-based stats. Identities = 58/321 (18%), Positives = 129/321 (40%), Gaps = 26/321 (8%) Query: 2 DAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHST 61 + HD+ FK H + ++ + + +++ L F++E+ Sbjct: 3 SSLPPQEHDSTFKFLFEHPKDILFLVKDVIGYSWAKEIKEDSIELADKEFVDETFHQKRA 62 Query: 62 DVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILF 121 DV+ +++ Y +++IE+QS + M R++RY I + + KLP ++PI+ Sbjct: 63 DVIAKARLKDREVYFYIIIENQSTVAEDMPERLLRYMILLWAKKIREGVKKLPAIIPIVT 122 Query: 122 YQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRR----IAILEL 177 Y G + +S F + + +V+I+ ++Q + L Sbjct: 123 YNGLEKDWDVSQEIISEFDIFKD----DIFKYAVVNISKLDAKTLLQEEEDILSPVVFYL 178 Query: 178 LQKHIRQRDLMLLLEQLVTLIDEGYTS--GSQLVAMQNYMLQRGHTEQADLFYGVLRDRE 235 Q +L+ L+++ + + + L+ N + R E + + + + E Sbjct: 179 EQVRDDTEELVKRLKEIEPKLTKLSQNNAERFLIWAGNVIRPRLVKEDKEKYDELAQRVE 238 Query: 236 TGG----------------ESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSR 279 GG E M + +G +G +G+ E E A++++ +G S Sbjct: 239 QGGSRQMGEFVSNVAKLLDEVQMRKFNEGKIEGKIEGKIEGKIEGKIEVAKKMIRRGFSD 298 Query: 280 EDVAEMANLPLAEIDKVINLI 300 ED+AE+ L + ++ ++ + Sbjct: 299 EDIAELTELDIEKVKELRKEL 319 >UniRef50_B9MMR0 Putative uncharacterized protein n=1 Tax=Anaerocellum thermophilum DSM 6725 RepID=B9MMR0_ANATD Length = 333 Score = 143 bits (360), Expect = 7e-33, Method: Composition-based stats. Identities = 62/327 (18%), Positives = 126/327 (38%), Gaps = 38/327 (11%) Query: 2 DAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHST 61 P +D FK+ E +FL+ + + DL +L SF+++ Sbjct: 3 QKPPHNQYDLTFKRIFSFKEVFLNFLKSTIKRPWVDKIDLQSLEFVDRSFVKDEFVEKEA 62 Query: 62 DVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDK-LPLVVPIL 120 DV+Y +++ Y +V++E QS DK M R+ Y RH+E D L +VPI+ Sbjct: 63 DVIYRAKIEDTDIYFYVLLEAQSTTDKTMPRRLFEYMNLIWQRHIEETKDDLLSPIVPIV 122 Query: 121 FYQGEATPYPLSMCW--FDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELL 178 Y G + ++ + +++F + LVD+ D+ + + + L Sbjct: 123 LYNGRSNWNVPTLIFKGWEIFKDD-------MFNYFLVDVNNIDDETLKNRLDLLSVILY 175 Query: 179 QKHIRQ--RDLMLLLEQLVTLIDEGYTSGSQLV--AMQNYMLQRGHTEQADLFYGVLRDR 234 R+ ++ + L+++ I T ++ + + + E +L+ Sbjct: 176 LDRSRKTAKEFIEKLKEVTEYISCLPTEQVKVFAMWLLRVIRPQMMEEVQGEIDELLKRI 235 Query: 235 ETGG----------------ESMMTLAQWFEEKGIEKGIQQGRQEVSQE--------FAQ 270 E G E + +EKG E+G +G+ E E A+ Sbjct: 236 EQEGVTDVGDFVFNVQRLMQEYYKEAEEKGKEKGYEEGKLEGKLEGKLEGELEATIRIAR 295 Query: 271 RLLSKGMSREDVAEMANLPLAEIDKVI 297 ++ G ++++ L + +I ++ Sbjct: 296 NMILAGAEDSFISKVTGLDIEKIKELR 322 >UniRef50_A4XMD0 Putative uncharacterized protein n=5 Tax=Clostridia RepID=A4XMD0_CALS8 Length = 329 Score = 142 bits (358), Expect = 1e-32, Method: Composition-based stats. Identities = 66/328 (20%), Positives = 127/328 (38%), Gaps = 38/328 (11%) Query: 2 DAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHST 61 +D FK+ E +FL ++ E D +L SFI++ Sbjct: 3 QKVPHNQYDLTFKRLFQFKEVFLNFLRGNINREWVNRIDAESLEFVDRSFIKDEFVEKEA 62 Query: 62 DVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDK-LPLVVPIL 120 DV+Y +++ Y +V+IE QS D+ M R+ Y RH+E D+ LP +VPI+ Sbjct: 63 DVIYRARLEDTDVYFYVLIEPQSTADRNMPRRLFEYMTLIWKRHMEEKADELLPPIVPIV 122 Query: 121 FYQGEATPYPLSMCW--FDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELL 178 Y G + + + FD+F + LVD+ D+++ + + L Sbjct: 123 LYNGRSGWNIPTQIFKGFDIFKDD-------MFNYILVDVNRLDDEKLKSRLDLLSIILY 175 Query: 179 QKHIRQ--RDLMLLLEQLVTLIDEGYTSGSQLV--AMQNYMLQRGHTEQADLFYGVLRD- 233 + R+ + + L ++ I + ++ + + + E +L+ Sbjct: 176 LEKSRRNAEEFVEKLSEVSEYICKLPQVQLKVFCSWLLRIVKPQVREEMESRIDELLKKI 235 Query: 234 -----------------------RETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQ 270 RE + + ++GI++GI++G Q +E + Sbjct: 236 EAEGVEDVGEFIFNVQQLIQEYYREAEEKGKEKGYEEGIQEGIKEGIKEGIQRKEEEIVR 295 Query: 271 RLLSKGMSREDVAEMANLPLAEIDKVIN 298 RL+ KG + +AE + + I K+ Sbjct: 296 RLIQKGFNDNFIAEATGVEIERIKKIRE 323 >UniRef50_A3ET28 Probable transposase n=6 Tax=Leptospirillum sp. Group II RepID=A3ET28_9BACT Length = 335 Score = 141 bits (356), Expect = 2e-32, Method: Composition-based stats. Identities = 69/335 (20%), Positives = 142/335 (42%), Gaps = 42/335 (12%) Query: 3 APSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTD 62 + HD FK E RDFL LP E+ + D ++L + I S + D Sbjct: 2 NEISGLHDRFFKTSFGRIEVLRDFLTGFLPPEISQSIDPDSLRFLNTESIGLSFEKSHMD 61 Query: 63 VLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFY 122 ++ ++ P +++IEH+S PD ++ +M+RY +A R+ D+ L V+P++F+ Sbjct: 62 LVVECRISETPAQFYLLIEHKSVPDPEVFLQMLRYMVALWTRN-RQDNKPLVPVLPLVFH 120 Query: 123 QGEATPYPLSMCWFDMFYSPE-LARRVYNSPFPLVDITITP---DDEIMQHRRIAILELL 178 QG P+ L + + + F PE L + L D++ E H ++ L Sbjct: 121 QG-GRPWTLPVRFQETFPVPETLKAHAVDFAPLLFDLSTVSGTTIRERSAHAETVVVLTL 179 Query: 179 QKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGG 238 K+ + +L L G + L + NY ++ + + + R G Sbjct: 180 LKYAFSGSVEDVLRALKE--TGGSFDETFLFGVLNYAIRAFEVKDPVVVDAISRSF-GGE 236 Query: 239 ESMMTLAQWFEEKGIEKGI--------------------------------QQGRQEVSQ 266 + M ++ + E+G+++G+ ++G++E + Sbjct: 237 KIMPSIIDEWVEEGLKEGLKKGREEGREEGREEGKEEGRKEGREEGKEEGRKEGQKEGQR 296 Query: 267 EFAQRLLSKG-MSREDVAEMANLPLAEIDKVINLI 300 + ++LL+KG +S ++A ++ L ++++ + Sbjct: 297 KTIEKLLAKGVLSVSEIASALDVDLQWVEQIRKDL 331 >UniRef50_Q6TFF6 Putative transposase n=1 Tax=Caedibacter taeniospiralis RepID=Q6TFF6_CAETA Length = 299 Score = 140 bits (352), Expect = 6e-32, Method: Composition-based stats. Identities = 79/304 (25%), Positives = 139/304 (45%), Gaps = 22/304 (7%) Query: 7 TPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGS-------FIEESLKGH 59 HD+VFK + + + A FL +LP EL EL D T+ LES + + + Sbjct: 3 NVHDSVFKDLIANRDFAVSFLMTYLPKELVELVDWQTVKLESANVEHVRQQQKDNQKQKE 62 Query: 60 STDVLYSVQMQGNP-GYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADH--DKLPLV 116 +D+ + + + G + V IE Q+ D + R Y + + +++ LPLV Sbjct: 63 QSDLTFLFKFKDGKNGAVFVHIESQTGDDGTILIRTRHYQTSYLLDYIKRHKTVKGLPLV 122 Query: 117 VPILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILE 176 V I++Y + P+ S+ D F + ELA++ Y +D+ D+EI++H IA E Sbjct: 123 VSIIYYANQK-PFSHSLNIHDYFANTELAKK-YAFTTQFIDLNRYSDEEILEHGFIAGYE 180 Query: 177 LLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRET 236 L+ K IR++++ L+ + I+ Q+ + YM Q E D ++ + Sbjct: 181 LILKAIREKNIDGKLDIAINQIEAYDHIARQV--LIRYMSQYSDMETKDFHDKLIYSKPD 238 Query: 237 GGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKV 296 +MT+A+ +E Q+G Q+ Q A+ L G+S E V + L + K+ Sbjct: 239 LRGDVMTVAEQWE--------QKGIQKGIQTTARNFLLMGLSAEQVVKGTGLDQDTVLKL 290 Query: 297 INLI 300 + Sbjct: 291 KKEV 294 >UniRef50_A4XG55 Putative uncharacterized protein n=2 Tax=Caldicellulosiruptor saccharolyticus DSM 8903 RepID=A4XG55_CALS8 Length = 327 Score = 138 bits (347), Expect = 2e-31, Method: Composition-based stats. Identities = 58/320 (18%), Positives = 119/320 (37%), Gaps = 24/320 (7%) Query: 2 DAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHST 61 +D +K + L+ + + L L +++ + Sbjct: 3 SNLPHNVNDLEYKYIFSNKSLFLRLLKRIDRINIFNKLTEEDLELVDKNYVLPDFSEQES 62 Query: 62 DVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEA--------DHDKL 113 D+LY ++Q + +++ EHQS D MA R++ Y L+ K Sbjct: 63 DLLYKARLQEEELFFYILFEHQSTVDYNMAMRLLFYITDIWRDWLKQFDKNQFKNKSFKF 122 Query: 114 PLVVPILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIA 173 P VVPI+ Y G+ + + + + + L+D+ + ++ Sbjct: 123 PPVVPIVLYDGDNPWTASVNLKERIMNFEVFGKYIVDFEYILIDLNDPDEMIFKYKDILS 182 Query: 174 ILELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAM-QNYMLQRGHTEQADLFYGVLR 232 ++ L K +++L L L + + + +L+ ++ +L Sbjct: 183 LILKLNKVKTEKELERLFLDLYEYLQGAKEKEINTLKICLPVVLKELGEDKVQEAKDMLE 242 Query: 233 DRETGGESMMTLAQ---WFEEKGIEKGIQQGRQ------------EVSQEFAQRLLSKGM 277 + GGE +M L Q E+ +GIQ+G Q + E A+R++ KG Sbjct: 243 CIDVGGEGIMPLFQNLRKIREEWYHEGIQKGIQDGLQQGLQQGLQKKELEIAERMIVKGY 302 Query: 278 SREDVAEMANLPLAEIDKVI 297 S E++ E+ L + +I ++ Sbjct: 303 SDEEIHEITGLDIEKIKELR 322 >UniRef50_Q3JB06 Putative transposase n=17 Tax=Proteobacteria RepID=Q3JB06_NITOC Length = 350 Score = 137 bits (345), Expect = 4e-31, Method: Composition-based stats. Identities = 49/252 (19%), Positives = 99/252 (39%), Gaps = 8/252 (3%) Query: 9 HDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQ 68 HDA +K+ H E RD L+ + + D +TL SGS++ + L+ D+++ ++ Sbjct: 4 HDASYKRLFSHPEMVRDLLQGFVREPWVQQLDFSTLEKVSGSYVTDDLREREDDIIWRLR 63 Query: 69 MQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADH----DKLPLVVPILFYQG 124 Q Y+++++E QS D MA R++ Y ++A + KLP V P++ Y G Sbjct: 64 HQEGWMYIYLLLEFQSTVDPYMAVRVLAYVGLLYQDLIKARYIAPNQKLPPVFPLVLYNG 123 Query: 125 EATPYPLSMCWFDMFYSPE--LARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHI 182 + + D+ E L R + + LVD D+ + + + ++ Sbjct: 124 -GPRWRAATEVGDLITPLEGGLERYRPSLRYLLVDEGDYQDEALAPLKNLVASLFRLENS 182 Query: 183 RQ-RDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGESM 241 R +L+ +L L+ + G + + + E + Sbjct: 183 RTPEELLQVLRNLLQWLQSPAQKGLERDFTLWLKRVLLPARLPGVEIPSVASLEEMNSML 242 Query: 242 MTLAQWFEEKGI 253 + ++ Sbjct: 243 AERVVEWTQQWK 254 >UniRef50_A6G1G8 Putative uncharacterized protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6G1G8_9DELT Length = 329 Score = 136 bits (343), Expect = 6e-31, Method: Composition-based stats. Identities = 58/288 (20%), Positives = 96/288 (33%), Gaps = 17/288 (5%) Query: 7 TPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYS 66 HDA+FK A LP L + D E + ++ L DVL+ Sbjct: 5 HAHDALFKAAFGAPAHAARLCRALLPPALVAVLDWRASTSEPTAVLDLRLSERRCDVLWR 64 Query: 67 VQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLE-ADHDKLPLVVPILFYQGE 125 + G ++V++EHQS ++ M R+ Y H H LP ++PI+ E Sbjct: 65 TRFVDG-GPIYVLLEHQSTRERDMPLRIEGYLARIWAGHRRGDRHGPLPPIIPIVVSHAE 123 Query: 126 ATPYPLSMCWFDMFYSPE----LARRVYNSPFPLVDITITPDDEIMQHRRIAI---LELL 178 W SP+ LA V N + D+T D + L Sbjct: 124 HGWRAPRSFWEQFSPSPDCIPGLAPFVPNFQLLIDDLTQVDDASLRGRSLPLFQTLALWL 183 Query: 179 QKHIRQR-------DLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQ-RGHTEQADLFYGV 230 + R D + + G + + Y G E ++ + Sbjct: 184 LRDARDPGRVLESVDEWNTWIHRLRGESQHEQDGGDIEQLLRYAYAVMGEGEDSEFHRKL 243 Query: 231 LRDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMS 278 E +T Q +G ++G+++GR + E + L S Sbjct: 244 AAFHPPSAEMSLTFEQQAINRGHKRGLEEGRIKGRLELLEAQLHAKFS 291 >UniRef50_Q1Q296 Putative uncharacterized protein n=6 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1Q296_9BACT Length = 338 Score = 136 bits (343), Expect = 7e-31, Method: Composition-based stats. Identities = 65/278 (23%), Positives = 114/278 (41%), Gaps = 9/278 (3%) Query: 7 TPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYS 66 PHD FK+ E A DFL P E+ + DL+TL ++ S+I+E LK H +D++Y+ Sbjct: 5 NPHDKFFKETFSIRENAIDFLSGRFPPEILKKLDLSTLTQDNSSYIDEELKEHFSDIVYT 64 Query: 67 VQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEA 126 + + ++ EH+S ++M+Y + + + +L V+P++ Y G+ Sbjct: 65 CFCKDKEIRITLLFEHKSYAVACPYLQLMKYLLKIWEANSKQ-AQRLIPVIPVILYHGKE 123 Query: 127 TPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEI----MQHRRIAILELLQKHI 182 + R + + L DI+ ++EI + + I LL ++I Sbjct: 124 AWKVRRFREYFEGIDEVFYRFIPEFEYLLTDISCYSNEEIKDRVFRRVSLQITMLLMRNI 183 Query: 183 RQ----RDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGG 238 D + ++ E L + Y+ + + + E GG Sbjct: 184 FDEKYLEDKLKDFFEIGIQYFEEDEGLKFLESAIRYLYYASDIAEKRVIDTLKEISEEGG 243 Query: 239 ESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKG 276 + MT+A EKG G +GR E E A KG Sbjct: 244 KLSMTIAAKLIEKGKIAGRVEGRAEGRAEGAIEGERKG 281 >UniRef50_C6VTM0 Putative uncharacterized protein n=1 Tax=Dyadobacter fermentans DSM 18053 RepID=C6VTM0_DYAFD Length = 308 Score = 134 bits (338), Expect = 2e-30, Method: Composition-based stats. Identities = 65/309 (21%), Positives = 139/309 (44%), Gaps = 13/309 (4%) Query: 1 MDAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHS 60 MD + HDA + + + + A D+ +P +++L D +TL +++ + L+ Sbjct: 1 MDKHTP-KHDAFIRAIMGNKQIALDYFRASIPQNIQDLLDFSTLRQLPDTYVSKELQKSI 59 Query: 61 TDVLYSVQ--MQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVP 118 +D++Y Q + +++EH+S DK ++ Y + + + + + + L++P Sbjct: 60 SDIVYVCQKASGNGEVKISLLVEHKSYVDKYTPIQIGSYIFSGLLKQIG-NKESPSLIIP 118 Query: 119 ILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEI--MQHRRIAILE 176 IL Y G ++ P L + + + + D+ D+EI + ++ +A Sbjct: 119 ILLYHGADRWEYKTVADLFENPEPALQQFIPDYQYIFHDLGQISDEEIQSLHNKFLAASL 178 Query: 177 LLQKHIRQRDLML-LLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRE 235 L K+ +D + LL ++TL E + ++ Y L + + Sbjct: 179 LAMKYSALKDQLNTLLPTILTLASEV--DRNLHKSLLFYTLVGNPLTEEQFLNLIKSVPN 236 Query: 236 TGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEF---AQRLLSKG-MSREDVAEMANLPLA 291 E++M + + FEEKG +KGI++GR E Q+ + L+ + ++ E +A N+ Sbjct: 237 QKKEAIMDIFEIFEEKGWKKGIEEGRAEAEQKIETAVRNLIKQSVLTDEQIASAMNVTTD 296 Query: 292 EIDKVINLI 300 + +V N + Sbjct: 297 YVAEVRNNL 305 >UniRef50_B6WXP3 Putative uncharacterized protein n=1 Tax=Desulfovibrio piger ATCC 29098 RepID=B6WXP3_9DELT Length = 330 Score = 134 bits (337), Expect = 4e-30, Method: Composition-based stats. Identities = 61/281 (21%), Positives = 111/281 (39%), Gaps = 11/281 (3%) Query: 8 PHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSV 67 PHD+ +KQF + E L +P + E D +TL SGS++ + L+ D+++ + Sbjct: 7 PHDSAYKQFFSNPEMVESLLRDFVPADFIEDLDFSTLERCSGSYVTDDLRERHDDIVWRI 66 Query: 68 QMQGN-PGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADH----DKLPLVVPILFY 122 + Y+ +V+E QS PD MA R + Y+ + ++ + LP V PI+ Y Sbjct: 67 GWKKGAWCYVALVLEFQSTPDYWMALRTLSYTALLLLDLVKTGKVHEGEGLPPVFPIVIY 126 Query: 123 QGEATPYPLSMCWFDMFYSPE-LARRVYNSPFPLVDITITPDDEI-MQHRRIAILELLQK 180 G P+ L L+D + DE+ +A L L++ Sbjct: 127 NGGKAWKAPQEVATLFAPMPDSLKHYCPQHRHFLLDESRVSGDELDKSQGLVAQLLKLER 186 Query: 181 HIRQRDLMLLLEQLVTLIDEGYT---SGSQLVAMQNYMLQR-GHTEQADLFYGVLRDRET 236 + ++++L+T + E + V + +L+R G TE+ F + Sbjct: 187 AQEPEQVRQIVKELITRLHEPKYLLLRRAFTVWLSRVVLKRSGITEEIPEFQDLREVDAM 246 Query: 237 GGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGM 277 E + ++G +GI G + L Sbjct: 247 LEERAAQWKDEYIKQGKTEGISIGEARGIRSALHAFLESRF 287 >UniRef50_C0GW46 Putative uncharacterized protein n=2 Tax=Desulfonatronospira thiodismutans ASO3-1 RepID=C0GW46_9DELT Length = 341 Score = 134 bits (337), Expect = 4e-30, Method: Composition-based stats. Identities = 66/271 (24%), Positives = 122/271 (45%), Gaps = 11/271 (4%) Query: 1 MDAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHS 60 M PH+A FK F E + F++ H+P E+ L DL+TL ++ F+ E + + Sbjct: 1 MSFEIPNPHNACFKDFFKDPEFVKAFIKYHIPEEICSLLDLDTLQVDLSGFVSEEHREYY 60 Query: 61 TDVLYSVQMQG--NPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEAD--HDKLPLV 116 DV+ +VQ++G +++++EH+S P+ +++ Y + LP++ Sbjct: 61 ADVMVTVQLKGHTENVNIYILLEHKSTPEFLTRLQILNYEVQKWMDLKRKGQLQGYLPVI 120 Query: 117 VPILFYQGEATPYPLSMCWFDMFYSPE--LARRVYNSPFPLVDITITPDDEIMQHRRIAI 174 +P++ Y G+ + S + D+F P L V + DI+ DDE + I Sbjct: 121 IPVVIYHGKG-RWNFSRKFSDLFDLPSEVLRPFVPEFKHMIHDISSMEDDEFKTTAILEI 179 Query: 175 LELLQKHIRQRDLMLLLEQLVTLIDEGYTSGS---QLVAMQNYMLQRGHTEQADLFYGVL 231 LL K+I +L L+++ L++ L A+ Y+ +G + Sbjct: 180 FHLLFKYIHYPELETKLQEIYDLLETIPDQDKVKQYLQAIVQYVAVQGPI-SLERLGEYT 238 Query: 232 RDRETGGESMMTLAQWFEEKGIEKGIQQGRQ 262 R G E+M T AQ ++ + IQ+ + Sbjct: 239 RRLPGGDEAMQTAAQQIRQEAYNEFIQEQEK 269 >UniRef50_A9BGB6 Putative uncharacterized protein n=3 Tax=Petrotoga mobilis SJ95 RepID=A9BGB6_PETMO Length = 331 Score = 134 bits (336), Expect = 4e-30, Method: Composition-based stats. Identities = 64/305 (20%), Positives = 133/305 (43%), Gaps = 10/305 (3%) Query: 1 MDAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHS 60 M+ PHD FK E ARDFL+ +LP E E+ DL+ L E+ S ++E+L+ Sbjct: 1 MNELVHNPHDRFFKLIFSDKEIARDFLQNYLPQEAVEIVDLDYLIPENNSHVDENLRESL 60 Query: 61 TDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPIL 120 +D+LY +++G GY+++++EH+S + K+ F+++RY + + K+P+++P++ Sbjct: 61 SDMLYKTKIKGQDGYIYILMEHKSYIEGKVIFQLLRYITSIWEEKYDPKTKKVPIIIPMV 120 Query: 121 FYQGEATPYPLSMCWFDM----FYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILE 176 Y G + + EL + + + D +I I+ + + Sbjct: 121 IYHGREIWNVETNLLNMVQGIEDLPNELKTYLPTYRYEICDFSIKRKKRIIGLTAMKVAI 180 Query: 177 LLQK---HIRQRDLMLLLEQLVTLIDEGYTS--GSQLVAMQNYMLQRGHTEQADLFYGVL 231 + + + + L ++ I + Y+L + V Sbjct: 181 EAMRAGTAMTKEEFKERLRRVFAYIKQLPKEQVHEWFEECMIYLLNVREDVTIEEILKVQ 240 Query: 232 RDRETGG-ESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPL 290 ++ G E +MT+A+ +G+EKG +G ++ E + + +S+ ++ Sbjct: 241 KEIMPGRGEIVMTIAEKLRNEGMEKGKIEGERKGKLEGEREFAIRILSKRFGNQLTEEIK 300 Query: 291 AEIDK 295 I + Sbjct: 301 DRIRE 305 >UniRef50_C0GW49 Putative uncharacterized protein n=6 Tax=Desulfonatronospira thiodismutans ASO3-1 RepID=C0GW49_9DELT Length = 339 Score = 132 bits (331), Expect = 2e-29, Method: Composition-based stats. Identities = 64/295 (21%), Positives = 137/295 (46%), Gaps = 11/295 (3%) Query: 3 APSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTD 62 + ++ HD F+ L ARDF+ HLP E+ +L+T+ + S S++ ++LK TD Sbjct: 8 SDTSKYHDHTFRAILGREPVARDFVRYHLPEEITRDMNLDTVKVSSRSYVSDNLKESMTD 67 Query: 63 VLYSVQ-MQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILF 121 ++ +++ + G P +++++EH+S D ++ +Y ++ LP++VP++F Sbjct: 68 IVITLELITGEPAEIYILVEHKSDLDAWTKIQLFKYMNEVWQSFIQKKTGTLPIIVPLVF 127 Query: 122 YQGEATPYPLSMCWFDMFYSPELA--RRVYNSPFPLVDITITPDDEIMQHRRIAILELLQ 179 Y G A + S+ + D+F P + + L ++ + ++ + + L+ Sbjct: 128 YHGTA-RWNYSLEFSDLFNLPSEHYRKYIPKFEHLLHEVPVINKKKVKSSITLEVFHLVL 186 Query: 180 KHIRQRDLMLLLEQLVTLIDEG---YTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRET 236 ++I + + + + L+ +G + + Y+L E + ++ Sbjct: 187 EYIFYPEKRDQIYEALELLFKGLDAKEAHEIFAILIKYLLIATD-ETPEEAEEKVKHLPK 245 Query: 237 GGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLA 291 GGE++ T A+ EE+G K I++ + E L + + D+A A PL Sbjct: 246 GGETVRTTAEVLEERGYNKAIKE---KPVWEKQAELKNAHETLIDIATEAYGPLP 297 >UniRef50_C5RH90 Putative uncharacterized protein n=2 Tax=Clostridium cellulovorans 743B RepID=C5RH90_CLOCL Length = 339 Score = 132 bits (331), Expect = 2e-29, Method: Composition-based stats. Identities = 54/310 (17%), Positives = 114/310 (36%), Gaps = 12/310 (3%) Query: 3 APSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTD 62 HD +K + ET ++ + L L S++ + +D Sbjct: 17 NKKNNLHDKSYKDLFSNKETFLSLIQTFVSNTWGSKLTKENLVLVDKSYVLSDYEELESD 76 Query: 63 VLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMH--------RHLEADHDKLP 114 ++Y ++ + + ++++E QS D +M R++ Y I + + +LP Sbjct: 77 IVYKARIGDHEVFFYMLLEFQSYVDYRMPIRLLLYMIEIWREILKNTSEKEFKRKSFRLP 136 Query: 115 LVVPILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIA- 173 VVPI+ Y GE + S + + + +D+ DE+ +++ IA Sbjct: 137 AVVPIVVYNGEKNWTVARTLKEVISNSDIFGESILDFRYEFLDVNRFKKDELYENQNIAS 196 Query: 174 ILELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRD 233 + LL + I + + L+ +V ++ + + + + Sbjct: 197 AIFLLDQSISRIEFYNRLKDIVIEFNKLTVEEKAQLKHWLVNVNSEENNYKENIEKIFSS 256 Query: 234 RETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKG---MSREDVAEMANLPL 290 + E M + EK E+G +G+ E E + L+K + E ++ LP Sbjct: 257 NKREVEIMTSNISKGLEKLKEEGKIEGKAEGKAELLIKQLNKKFKLLPMEYEKKIKALPE 316 Query: 291 AEIDKVINLI 300 +D + I Sbjct: 317 KILDDIATDI 326 >UniRef50_B4U689 Putative uncharacterized protein n=8 Tax=Aquificales RepID=B4U689_HYDS0 Length = 323 Score = 131 bits (328), Expect = 3e-29, Method: Composition-based stats. Identities = 53/275 (19%), Positives = 110/275 (40%), Gaps = 12/275 (4%) Query: 7 TPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYS 66 PHD+ FKQ + L+I + ++++ + + D+L+S Sbjct: 4 QPHDSFFKQIFSDPRRVKTLLDIFAKDVAKS---IHSITPVNTEKFSSKSQKFMLDLLFS 60 Query: 67 VQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEA 126 +++ Y+ +V+EH+S DK++ ++ Y+ A ++ + P ++ I+FY G+ Sbjct: 61 CKVKDQDAYIRIVLEHKSYLDKELPIQLSYYNAAIWEEAIKE-KEYYPPIINIVFYHGKG 119 Query: 127 TPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILE----LLQKHI 182 + + L + V + L+D+ DDE++ I + KH+ Sbjct: 120 EWNIPTSLP--VLEDQNLEKYVSKLNYILIDLNKVSDDELINEAYIDFCFTSAVIAMKHV 177 Query: 183 RQ--RDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGES 240 + + + LV + ++ L++ G + Sbjct: 178 HENIEKIKAVFRPLVEYVQIHEDEEGYHCLFFSFNYISYVKGDTKEAENALKELIGGDKK 237 Query: 241 MMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSK 275 MTL + + +G+EKG Q+G QE ++ Q L K Sbjct: 238 AMTLIEKWIMEGLEKGKQEGLQEGLEKGKQEGLIK 272 >UniRef50_C6HZP6 Putative uncharacterized protein n=1 Tax=Leptospirillum ferrodiazotrophum RepID=C6HZP6_9BACT Length = 334 Score = 129 bits (324), Expect = 1e-28, Method: Composition-based stats. Identities = 67/307 (21%), Positives = 122/307 (39%), Gaps = 17/307 (5%) Query: 5 STTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESL-KGHSTDV 63 STTPHD+ FK + L + L +L++L G I E L + +D+ Sbjct: 21 STTPHDSFFKDVFGPGKGHLPSLIPLIDGSLASRIELSSLEYLPGESIAEDLARSTRSDL 80 Query: 64 LYS-----VQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVP 118 S ++ G + + EH+S + ++ A + R L P V+P Sbjct: 81 SASLLISNARIDGGDARIAFIFEHKSFLPHHIHIPLLSLVSALLSRDLREGRKPCP-VIP 139 Query: 119 ILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDD---EIMQHRRIAIL 175 ++ Y G A + + SPELA R+ + L+D++ D+ E + H + Sbjct: 140 VVLYHGRAPWTLPARLSEALDLSPELAPRLPDFELTLIDLSRFSDETLKEKIAHPEPLVS 199 Query: 176 ELLQKHIRQRDLMLL---LEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLR 232 + KHI + +L + + TL + +Y+ + + Sbjct: 200 LSVMKHIFEPPESVLGHFVRLIKTLSPSRDILKRIVDTTLHYISYVKKSHHPQEIRTIFT 259 Query: 233 DRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAE 292 E M T+ +E+GI++GIQ GR E + L +S + +A + N+ L+ Sbjct: 260 TF-LAEEKMTTVLDLIKEEGIQEGIQMGRDEA---ITRLLQHSSLSPQQIASILNVDLSR 315 Query: 293 IDKVINL 299 + + N Sbjct: 316 VLSLANS 322 >UniRef50_C0GTX5 Putative uncharacterized protein n=8 Tax=Desulfonatronospira thiodismutans ASO3-1 RepID=C0GTX5_9DELT Length = 338 Score = 128 bits (321), Expect = 2e-28, Method: Composition-based stats. Identities = 66/299 (22%), Positives = 126/299 (42%), Gaps = 8/299 (2%) Query: 4 PSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDV 63 +T HD+ K FL A L+ LP E+ + D N ++ E S++ +SL+G+ +D+ Sbjct: 2 STTNIHDSTIKYFLSDRLNAISLLKSMLPEEIVKQLDFNKIYYEKDSYLPKSLQGYYSDL 61 Query: 64 LYSVQMQGNP--GYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEA-DHDKLPLVVPIL 120 + SV + + ++EH+S K + +RY + ++ + +LP+++PIL Sbjct: 62 VVSVPTKCGSYVAKVFFLLEHKSTFKKNTPLQFLRYILEFWEQYQKNTGETRLPVIIPIL 121 Query: 121 FYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQK 180 E P + S + V + F L D ++ + L L + Sbjct: 122 IAHPEEGWKPTKVSDLVDLPSDDFKIFVPDFNFLLYDAVNDDPEDYDFDETLKALFTLWR 181 Query: 181 HIRQRDLMLLLEQLVTLIDEGYTSGSQL---VAMQNYMLQRGHTEQADLFYGVLR-DRET 236 + R + M +++ LI + L + +Y+ ++ + + + Sbjct: 182 YSRSPEFMQGVQKAFQLIKKVDPKARLLDFVQMILHYLEVTRDEKEYIDIQKIAETEIDE 241 Query: 237 GGESMMTLAQWFEEKGIEKGIQQGRQEV-SQEFAQRLLSKGMSREDVAEMANLPLAEID 294 G E M T+A+ F +G E+ Q+ QE E L + + D+A A PL +I Sbjct: 242 GEEYMGTIAEMFRREGDERTEQRFLQEKPIWEKQSELKATQETLIDIATEAYGPLPDIL 300 >UniRef50_B9MN47 Putative uncharacterized protein n=2 Tax=Bacteria RepID=B9MN47_ANATD Length = 324 Score = 128 bits (320), Expect = 3e-28, Method: Composition-based stats. Identities = 56/317 (17%), Positives = 125/317 (39%), Gaps = 25/317 (7%) Query: 2 DAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHST 61 + HD+ FK + + L + +++ ++ ++I + Sbjct: 7 EKLPAKEHDSTFKLLFENPKDIYLLLSKIINYSWANEIRESSIEIKKTNYITKEFSQVEA 66 Query: 62 DVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILF 121 DV+ +++ Y +++IE+QS K M R++RY I+ + +KLP ++PI+ Sbjct: 67 DVVAKARLKDRDVYFYILIENQSTVAKDMPERLLRYMISIWAEEIRNGVEKLPAIIPIVV 126 Query: 122 YQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRI---AILELL 178 Y G + +S F + + +VDI +Q + I L Sbjct: 127 YNGLDRRWEVSTDIIGAFDIF----KNDIFKYKVVDIAQIDIKNYLQEEDVLTPIIFYLE 182 Query: 179 QKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYML---QRGHTEQADLFYGVLRDRE 235 Q +L+ L+++ + + + + + + + + G+ ++ + V++ R+ Sbjct: 183 QVRNDSNELVRRLQEIEQSLKKLSFNNIERFLLWSQHVIRPRLGNEQKKEYDKLVMKVRQ 242 Query: 236 TGGESMMTLAQWFEEKGIEKGIQQGR---------------QEVSQEFAQRLLSKGMSRE 280 G E M E ++ Q+ E A+R++ G+S E Sbjct: 243 EGVELMGEFVSNVARLLDETKTKEFLAGVQQGIQQGIQQGIQQERIETAKRMIQLGISYE 302 Query: 281 DVAEMANLPLAEIDKVI 297 +++ NL + EI+K+ Sbjct: 303 VISKATNLSIEEIEKIA 319 >UniRef50_A4U3R1 Putative uncharacterized protein n=1 Tax=Magnetospirillum gryphiswaldense RepID=A4U3R1_9PROT Length = 322 Score = 127 bits (318), Expect = 5e-28, Method: Composition-based stats. Identities = 55/304 (18%), Positives = 110/304 (36%), Gaps = 13/304 (4%) Query: 10 DAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQM 69 DA++ + H A + +P + D + + F + K DV++ + Sbjct: 5 DALYHRLFSHPLMAEQLVREFVPEAMAVGLDFARMERVNAKFHDRDGKRREGDVIWRIPT 64 Query: 70 -QGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADH----DKLPLVVPILFYQG 124 G LH++ E QS D MA R Y + D+LP V+ ++ Y G Sbjct: 65 ADGEDVVLHILCEFQSTTDWWMAVRTQVYEGLLWQHLIAERKLKSGDRLPPVLTLVLYNG 124 Query: 125 EATPYPLSMC--WFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHI 182 E + + + L + + L+D+ P++E+ +A L +H Sbjct: 125 EQRWHAPTDTIPLIALPAGSPLWPWQPRACYHLLDMGAVPEEELAIRDSLAALLFRLEHP 184 Query: 183 RQ-RDLMLLLEQLVTLIDE--GYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGE 239 R+ +L L++ +V GY +L G+ + ++ R Sbjct: 185 REPEELAGLIDDVVGWFRRHPGYDELRRLFTELVRQAIEGYETSVAVPGDMMEMRSMLAN 244 Query: 240 SMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSK---GMSREDVAEMANLPLAEIDKV 296 T + + +GI +G +G + RLL K + + + + I+ Sbjct: 245 LGETWKKRWLAEGIAEGEARGEARGEAKALIRLLEKRFGQLPTDTRERVLAADTSSIEMW 304 Query: 297 INLI 300 ++ + Sbjct: 305 LDRL 308 >UniRef50_B3ETR6 Putative uncharacterized protein n=1 Tax=Candidatus Amoebophilus asiaticus 5a2 RepID=B3ETR6_AMOA5 Length = 275 Score = 125 bits (313), Expect = 2e-27, Method: Composition-based stats. Identities = 59/239 (24%), Positives = 113/239 (47%), Gaps = 19/239 (7%) Query: 75 YLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEATPYPLSMC 134 Y++ +IE+QS +K MAF M+ Y++A M +HL + +LP++V I Y G+ +PYP S Sbjct: 36 YVYTLIENQSTHNKLMAFSMLSYNVALMEQHLNEGYQELPIIVNICIYTGKKSPYPYSQD 95 Query: 135 WFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQRDLMLLLEQL 194 D F ELAR F L+D+++ +E+++ +E L + R+RD + + Sbjct: 96 ICDYFEGVELAREQMFKHFKLLDLSVLSQEELLKDGTFGSVEALLRQGRERDYLNWINNN 155 Query: 195 VTLIDEGYTSGSQLVAMQNYMLQRGHTEQADL-FYGVLRDRETGGESMMTLAQWFEEKGI 253 LI E ++ ++ Y+L AD ++ E ++T AQ + I Sbjct: 156 QVLIWELVSNYGL--SIVIYILTTDDKNDADYLMQAIIEAVLEQKEIIVTAAQQLRQVDI 213 Query: 254 EKGIQQGRQEVSQE----------------FAQRLLSKGMSREDVAEMANLPLAEIDKV 296 + G+ +G +E ++ + +L +G+ + ++ + I+K+ Sbjct: 214 QTGLIKGIKEGIEQGKEEGVKLGIQAKAQAIDKSMLKEGLEISLIQKVTGISREAIEKL 272 >UniRef50_Q04UG3 Transposase, YhgA-like n=8 Tax=Leptospira RepID=Q04UG3_LEPBJ Length = 304 Score = 125 bits (313), Expect = 2e-27, Method: Composition-based stats. Identities = 66/304 (21%), Positives = 123/304 (40%), Gaps = 16/304 (5%) Query: 3 APSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTD 62 PHD + ++ + A F + LP E+ EL DL L L SF+ E LK TD Sbjct: 2 TEVNNPHDRLIRETFQDKKEAATFFKNTLPPEVVELLDLENLELTESSFVSEELKQEQTD 61 Query: 63 VLYSVQMQ-GNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILF 121 +L+ + ++ GN ++++ EH+S + + +++ Y + + +V+P +F Sbjct: 62 LLFQIPLKSGNKSNVYLLFEHKSYLENTIYIQLLGYLTEIYRNQQRSG-ESFSVVIPFVF 120 Query: 122 YQGEATPYPLSMCWFDMF----YSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILEL 177 Y GE + + + L D+ + ++ + Sbjct: 121 YHGEKEWKLGDRFSDQFVLTKQETDVFQDFIPDFKIDLFDLEGIELKKKLESITFQVTLG 180 Query: 178 LQKHIRQRDLMLL-----LEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVL- 231 + + IR+RDL + L L+ I+E + L + Y+ + +L + Sbjct: 181 VVQRIRERDLEFVSHLPGLFSLLLGIEEESKRVAILRKLLLYIYWARDLKPTELKRVLAI 240 Query: 232 RDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLA 291 E E MT A+ +GIQQG+ E E A+ +LS+ + E V + L Sbjct: 241 SKLEQYEELTMTTAERLI----SEGIQQGKIEGKIETARNMLSEDIQLEAVLRITGLSKQ 296 Query: 292 EIDK 295 ++ Sbjct: 297 DLKD 300 >UniRef50_C0GWA6 Putative uncharacterized protein n=3 Tax=Desulfonatronospira thiodismutans ASO3-1 RepID=C0GWA6_9DELT Length = 334 Score = 124 bits (312), Expect = 3e-27, Method: Composition-based stats. Identities = 62/288 (21%), Positives = 118/288 (40%), Gaps = 11/288 (3%) Query: 1 MDAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHS 60 M HD FK F E RDF++ +LP E+++ DL + ++ ++ E K Sbjct: 1 MSKKIPNAHDICFKSFFSREEFVRDFIQYYLPEEIKKHLDLTIIEIDMEGYLSEEFKEFY 60 Query: 61 TDVLYSVQMQGN--PGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADH--DKLPLV 116 +DV+ V L+ + EH+SKP + + + Y + R L LP++ Sbjct: 61 SDVVAKVYFNDRVHELELYFLFEHKSKPYRFTILQTLNYQVQKWMRLLVEGKLNQHLPII 120 Query: 117 VPILFYQGEATPYPLSMCWFDMFYSPEL--ARRVYNSPFPLVDITITPDDEIMQHRRIAI 174 VP++ Y G + + S+ + D+F P + L DI + + I Sbjct: 121 VPVVIYNGYKS-WNFSVQFEDLFQLPSEYYKDFIPQFRHILHDIGQMDEASFKTTTIMEI 179 Query: 175 LELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQ---LVAMQNYMLQRGHTEQADLFYGVL 231 LL K+I +L + ++ L+++ + L + Y++ G + L Sbjct: 180 FHLLLKYIYYPELDTKIHEIYDLLEKLPDNDKLTDYLFIIVRYVMASGAIPEKRLLEH-A 238 Query: 232 RDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSR 279 + G E + A+ EE+ + +++ E +Q +L K + Sbjct: 239 KRFSGGEEMIGLAAREIEERVEQTRKPYWQKQAKVENSQEMLIKSLKM 286 >UniRef50_C4FIM1 Putative uncharacterized protein n=1 Tax=Sulfurihydrogenibium yellowstonense SS-5 RepID=C4FIM1_9AQUI Length = 316 Score = 124 bits (311), Expect = 3e-27, Method: Composition-based stats. Identities = 57/285 (20%), Positives = 130/285 (45%), Gaps = 17/285 (5%) Query: 7 TPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYS 66 PHD FKQ + + L+I P EL + DL ++ L + + + ++LY Sbjct: 5 QPHDQFFKQIFSEPKRVKSLLDIFYP-ELSQKIDLESIRLLNSEKYSQKVGKSLLNLLYE 63 Query: 67 VQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEA 126 +++ +L ++ EH+S DK + +++ Y+ ++++ P ++ I+ Y G+ Sbjct: 64 CKIENEKSFLRIIFEHKSYIDKNLPSQLLYYNGILWEE--TGEYEEYPPIINIVLYHGKR 121 Query: 127 TPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAIL----ELLQKHI 182 + S + R + L+D++ D+E++ + L KHI Sbjct: 122 KWNIPATLPKT--NSEIIERFANKLNYHLIDLSKVADEEMISKLYLDFCTVSALLTMKHI 179 Query: 183 RQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGESMM 242 + + + ++ + E Y G + + +Y+ + ++ + VL++ G + MM Sbjct: 180 F--EDLRKYKHILKKVFEHYQDGC-VFIILDYISVVNNPQEVE---NVLKEILGGEKDMM 233 Query: 243 TLAQWFEEKGIEKGIQQGRQEVSQEFAQR--LLSKGMSREDVAEM 285 TL + ++ +G+++G+QQG E ++ + L G E++ ++ Sbjct: 234 TLTEKWKMEGLQQGLQQGMIEGQKKAILKSIQLKFGRVPENIEKL 278 >UniRef50_C6HTR6 Probable transposase n=5 Tax=Leptospirillum ferrodiazotrophum RepID=C6HTR6_9BACT Length = 216 Score = 124 bits (310), Expect = 5e-27, Method: Composition-based stats. Identities = 52/215 (24%), Positives = 83/215 (38%), Gaps = 11/215 (5%) Query: 3 APSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESL-KGHST 61 + TPHD+ FK + L L D ++L SG I E L + Sbjct: 2 TTTPTPHDSFFKDVFGPGKANLPALLSLLDAPFASRIDPSSLTFLSGETIGEGLATSFRS 61 Query: 62 DVL-----YSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLV 116 D++ + G P ++EH+S P + + F++ A R L LP V Sbjct: 62 DLVGSLLVADATVDGKPLEFVFLVEHKSSPARDIQFKLACLVTALWARFLREGKPPLP-V 120 Query: 117 VPILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQH---RRIA 173 VPIL + G++ + + PELA + + ++D+T DDEI + Sbjct: 121 VPILIHHGKSPWNQPLRLYETLGLRPELATGMLDYALHVIDLTRIEDDEIRRKIPDPEPQ 180 Query: 174 ILELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQL 208 + KHI L L + L+ E + L Sbjct: 181 MSLAAMKHIHDP-LPAFLRVMADLLKEIEENRDIL 214 >UniRef50_A4XMU7 Putative uncharacterized protein n=1 Tax=Caldicellulosiruptor saccharolyticus DSM 8903 RepID=A4XMU7_CALS8 Length = 313 Score = 122 bits (306), Expect = 1e-26, Method: Composition-based stats. Identities = 52/307 (16%), Positives = 125/307 (40%), Gaps = 22/307 (7%) Query: 10 DAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQM 69 D FK+ L + + L L L L + + + I + +D++Y ++ Sbjct: 9 DEGFKKVLTNRTNIKWLLTELL-EVLPIQIGLEDIEVIATESINRQWRARRSDMVYKIKY 67 Query: 70 QGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEATPY 129 + Y+ V++E QS ++ + R++ Y + ++ + LP+V+P++ Y GE Sbjct: 68 KD--AYICVLLEFQSSKEELIHLRVLEYMLLIQKKYTTKN--LLPVVIPVVLYTGEEKWT 123 Query: 130 PLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELL--QKHIRQRDL 187 P + ++ Y + + V VD+ + D+++++ + L + + Sbjct: 124 PATCFEQNVVYGEDFKQFVQKFSLVFVDVRMIDDEKLLKSPNLLAAALYVDKVSDNPEKV 183 Query: 188 MLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDR------------- 234 LE L + + +++ +G+ + L Sbjct: 184 AERLEYLSKHVKFSEEQKEEFCEWLYHVVLKGYGFSDEEVDEFLFKSDFLRLGVNEMFLN 243 Query: 235 --ETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAE 292 E + + + ++GI++GIQQG+++ E AQ+++ +G +A++ L + Sbjct: 244 TAEKIRKGLEKELEKERKQGIQQGIQQGKEQALLEVAQKMIEEGAEDSFIAKVTGLDMER 303 Query: 293 IDKVINL 299 I ++ + Sbjct: 304 IRQLRSK 310 >UniRef50_B2V9N0 Putative uncharacterized protein n=4 Tax=Sulfurihydrogenibium RepID=B2V9N0_SULSY Length = 312 Score = 122 bits (305), Expect = 2e-26, Method: Composition-based stats. Identities = 56/268 (20%), Positives = 117/268 (43%), Gaps = 5/268 (1%) Query: 2 DAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHST 61 + S PH+ FKQ +++ +DFL I L +L + L++L L + K H Sbjct: 3 NKESIQPHNWFFKQVFSNSKNVQDFLSIFL-SDLSQKIQLSSLELVPSEKFSNNQKKHFL 61 Query: 62 DVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILF 121 D+LY ++ Y+ ++ EH+S DKK+ ++M+Y+ L+ D P ++ I+F Sbjct: 62 DLLYKCKLNDKEAYIRLIFEHKSYVDKKLPLQLMQYNAVIWEEALKE-KDYYPPIINIVF 120 Query: 122 YQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKH 181 Y G+A + EL + + + L+D+ D+ + ++ ++L+ + Sbjct: 121 YHGQAKWNFPTTIP--DIEDEELDKYIQKLNYILIDLNEIEDENLKRY-LKKNVDLIMEM 177 Query: 182 IRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGESM 241 + + + LE++ TL+ + S+ + + + V ++ G E M Sbjct: 178 LIMKHIHDRLERIKTLLKDVIDECSEDCFVIILNYLVLVKKDYEKVKEVFKEIIGGEEKM 237 Query: 242 MTLAQWFEEKGIEKGIQQGRQEVSQEFA 269 M + +G +G + +E + Sbjct: 238 MLFTDKLKMEGKMEGKIEILRENIIDLI 265 >UniRef50_Q7NIZ1 Gll2041 protein n=9 Tax=Cyanobacteria RepID=Q7NIZ1_GLOVI Length = 311 Score = 121 bits (303), Expect = 3e-26, Method: Composition-based stats. Identities = 40/307 (13%), Positives = 109/307 (35%), Gaps = 27/307 (8%) Query: 7 TPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESL--KGHSTDVL 64 T HD +FK+ L +F+++ ++ + ++ + + + D++ Sbjct: 2 TDHDRLFKELLS--TFFVEFIDLFF-ADVGNYLERGSIVFLEKELFSDITAGERYEADLV 58 Query: 65 YSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQG 124 + + + + V IE+Q++ ++RM RY ++ + PI + Sbjct: 59 VKARFRDHQSFFLVHIENQTEAQSIFSYRMFRYFARLYEKYQL-------PIYPIAVFSF 111 Query: 125 EATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQ 184 F V + +V + + ++ L+ + Sbjct: 112 TEPLRAEPTAHRVAFPDFT----VLEFHYRVVQLNRLDWRDFLRQPNPVASALMARMRIA 167 Query: 185 RDLMLLLE----QLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGES 240 ++ +L+ + + + L+ E+ + + E+ Sbjct: 168 PADRPRVKLECLRLLATLRLDPARTQLISGFVDTYLKLTAQEERLFAAELATIGASEQEA 227 Query: 241 MMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSR-------EDVAEMANLPLAEI 293 ++ + + ++G+E+G Q GRQE QE A ++ + +SR ++ ++ L + Sbjct: 228 VVQIVTSWMQQGLEQGRQVGRQEGRQEEALAIVLRQLSRRLGTLPAQNAERVSGLSTTAL 287 Query: 294 DKVINLI 300 + + + Sbjct: 288 EALSEAL 294 >UniRef50_C6IY67 Transposase n=1 Tax=Paenibacillus sp. oral taxon 786 str. D14 RepID=C6IY67_9BACL Length = 333 Score = 119 bits (299), Expect = 9e-26, Method: Composition-based stats. Identities = 56/319 (17%), Positives = 114/319 (35%), Gaps = 40/319 (12%) Query: 8 PHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKG--HSTDVLY 65 PHD FK+ L +F+ + P EL D + + + + + D+L Sbjct: 27 PHDEAFKKLLH--TFFAEFIALFFP-ELESQLDFSQTRFLMQEQLVDVVGEEARTLDLLL 83 Query: 66 SVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGE 125 + G ++ + +E QS RM Y RH + L++PI + Sbjct: 84 ETKYIGTDAFILIHLEPQSYRQADFHERMFIYFSRLFERHRKEHQ----LIIPIAIFTSA 139 Query: 126 ATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQR 185 + + + L + F V++ P + LL K + Sbjct: 140 ESKNERNSLNMSI-----LGEDILQFRFLKVELINQPWRRFIDSNNPVAAALLAKMGYNK 194 Query: 186 DLMLLLEQLVTLI------DEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGE 239 L + + ++++ + + + ++ + + E Sbjct: 195 GEERELRLAYLRMLLQLSQRLDQARLALVMSIADLYFEPDPRQDEEMLRELAKQYAKESE 254 Query: 240 SMMTLAQWFEEKGIEKGIQQGRQEV--------------------SQEFAQRLLSKGMSR 279 +M L + +G EKG+++G ++ ++ A+RLLSKG + Sbjct: 255 VIMELMPAWMRQGYEKGLEEGLEKGIEQGIEKGFEKGIEQGTLIERRQIARRLLSKGFTL 314 Query: 280 EDVAEMANLPLAEIDKVIN 298 E++A+M L + EI K++N Sbjct: 315 EEIADMTQLSIEEIKKIMN 333 >UniRef50_A5USQ0 Putative uncharacterized protein n=4 Tax=Roseiflexus sp. RS-1 RepID=A5USQ0_ROSS1 Length = 330 Score = 119 bits (297), Expect = 2e-25, Method: Composition-based stats. Identities = 53/318 (16%), Positives = 105/318 (33%), Gaps = 39/318 (12%) Query: 8 PHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEE--SLKGHSTDVLY 65 HDA+FK L R+F+++ P +L D + D++ Sbjct: 6 DHDALFKLVLT--AFFREFIDLVAP-DLAAALDPAPPVFLDKESFADLFDPDRREADLVA 62 Query: 66 SVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGE 125 V+++ +P L + +EHQ++ D + RM RY R+ + + PI Sbjct: 63 QVRLRQHPATLLIHLEHQAQADAALDRRMFRYFARLYDRYDQ-------PIYPIALCSYP 115 Query: 126 ATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKH-IRQ 184 P + R V + +V + + A + L+ + + Sbjct: 116 RPRRPAADRH----EVRAAQRTVLTFQYQVVQLNRMDWRAYLTTTNPAAMALMARMRVAP 171 Query: 185 RDLMLLLEQLVTLIDEGY---TSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGESM 241 D + + L+ + + L E+ L V R E + Sbjct: 172 EDRWRVKAACLRLLAGAPLTGAQRRLIGQFVDIYLPLNAREEQALAAEVARLPGAAKEVV 231 Query: 242 MTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKG-------------------MSREDV 282 M L +E KG +G+++G +E E + ++G + Sbjct: 232 MELITSWERKGRAEGLREGLREGRAEGLREGRAEGLREGQRLVVERMLTRRFGALPSGVR 291 Query: 283 AEMANLPLAEIDKVINLI 300 +A L E+ + + + Sbjct: 292 ERLATLTADELTALADAL 309 >UniRef50_C6PYR3 Putative uncharacterized protein n=1 Tax=Clostridium carboxidivorans P7 RepID=C6PYR3_9CLOT Length = 344 Score = 119 bits (297), Expect = 2e-25, Method: Composition-based stats. Identities = 39/286 (13%), Positives = 106/286 (37%), Gaps = 11/286 (3%) Query: 1 MDAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHS 60 + HD +K + E D ++ + + + + L + S+I + Sbjct: 3 IKKEMHHIHDKSYKDLFSNKELLVDMIQNFVKSSWIKEIKKDNIELVNKSYILSDYEELE 62 Query: 61 TDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHR--------HLEADHDK 112 +D++Y + G ++++E QS D M R+ Y +++ + Sbjct: 63 SDIVYKATIDGREVIFYILLEFQSYVDYSMPIRLFLYMSEIWREVLKNTKQAEVKSKEFR 122 Query: 113 LPLVVPILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRI 172 LP +VP++ Y GE + S + + + L+DI +E+M+ + + Sbjct: 123 LPAIVPLVLYNGEYKWTVEKKFKNIINKSELFGNNIIDFEYILIDINKYEKEELMELKNL 182 Query: 173 -AILELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTE--QADLFYG 229 + + LL + + + + ++ + + ++ + + + Sbjct: 183 VSAVFLLDQKVDIEEFISRVKDIAIDFNNLTEEQKMMLRHWLRVTLSDELKGNLGEKIED 242 Query: 230 VLRDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSK 275 +L ++ M + ++ K ++G ++ +E ++ + K Sbjct: 243 ILIAKKEEVNRMTSNISKTIKETFAKTREEGMEKGIEEGIEKGIEK 288 >UniRef50_B9MPV5 Putative uncharacterized protein n=5 Tax=Clostridia RepID=B9MPV5_ANATD Length = 331 Score = 118 bits (296), Expect = 2e-25, Method: Composition-based stats. Identities = 54/331 (16%), Positives = 123/331 (37%), Gaps = 39/331 (11%) Query: 4 PSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDV 63 + +D FK+ F+ +P + + + + + I K +D+ Sbjct: 2 KLSRSYDVGFKKLFSDKINVCWFITEIIPEPRLKNYTQSDIEIVATESINAQWKARRSDM 61 Query: 64 LYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQ 123 +Y + ++++++E QS+P+K+M R+ Y ++ +LP+VVP++ Y Sbjct: 62 VYRLPYS--SSWIYLLVEFQSRPNKQMHCRIYEYVFLIQRKY--QIDKRLPVVVPVVLYN 117 Query: 124 GEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILEL------ 177 G P++ ++ Y+ + V + +D+ P+D+++ + L Sbjct: 118 GVEKWQPVTQFADNVEYAEDFPEYVQRLNYIFIDVRDIPEDKLLNGNNVLAAALYVDQVA 177 Query: 178 ------------LQKHIRQRDLMLLL-----------------EQLVTLIDEGYTSGSQL 208 L K+IR D E++ L + G + Sbjct: 178 TNPDSVVERLLELGKNIRIPDEQREELAEWLYHAVLKSYKIPREEINELFAKSKILGVEE 237 Query: 209 VAMQNYMLQRGHTEQADLFYGVLRDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEF 268 + M + + + E E + + +G +G +GR E E Sbjct: 238 MFQSTAMKIKKGLAEEKKKIRLESKIEGKIEGKIEGKIEGKIEGKIEGKIEGRMEAQLEI 297 Query: 269 AQRLLSKGMSREDVAEMANLPLAEIDKVINL 299 A+ L+ +G +A++ L + ++ ++ N Sbjct: 298 ARNLILEGAEDSFIAKVTGLDIEKVKELRNQ 328 >UniRef50_A8PLG1 Transposase n=1 Tax=Rickettsiella grylli RepID=A8PLG1_9COXI Length = 212 Score = 117 bits (293), Expect = 4e-25, Method: Composition-based stats. Identities = 66/210 (31%), Positives = 117/210 (55%), Gaps = 2/210 (0%) Query: 91 AFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEATPYPLSMCWFDMF-YSPELARRVY 149 F++ RY A M +HL+ H LP+VV +L+Y+G+ TPYP + FD F + +A ++Y Sbjct: 3 PFKIARYVHAIMDQHLKQGHAFLPIVVAMLYYRGKVTPYPYTGNIFDCFGKNKTIAEKIY 62 Query: 150 NSPFPLVDITITPDDEIMQHRRIAILELLQKHI-RQRDLMLLLEQLVTLIDEGYTSGSQL 208 P+P++DIT DD I H IAIL+ QK+ RD+ +E ++ + +GY + Q Sbjct: 63 LRPYPIIDITALSDDAIRGHGSIAILDFAQKYAAFNRDIQDGIEHIIGELKKGYLTREQC 122 Query: 209 VAMQNYMLQRGHTEQADLFYGVLRDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEF 268 + Y + T+ + L+ E +M++A E++G+++G+QQGR E + Sbjct: 123 QTLLYYTFRETDTDNVKMLLEQLQTIRIYEEDIMSVAHKIEQQGLQRGLQQGRYEEDLKI 182 Query: 269 AQRLLSKGMSREDVAEMANLPLAEIDKVIN 298 A+R+L+KG R + ++ L ++ + + Sbjct: 183 AKRMLAKGTDRGYIKDVTGLSDQDLLNLED 212 >UniRef50_B1XMU9 Putative uncharacterized protein n=1 Tax=Synechococcus sp. PCC 7002 RepID=B1XMU9_SYNP2 Length = 316 Score = 117 bits (293), Expect = 5e-25, Method: Composition-based stats. Identities = 47/302 (15%), Positives = 108/302 (35%), Gaps = 23/302 (7%) Query: 8 PHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKG--HSTDVLY 65 HD +FK+ L DFL E+ E + N+L + + + D++ Sbjct: 6 DHDLLFKELLT--TFFWDFL-ALFAPEILETAEQNSLTFLTQEVFNDLPGQTRRNVDIVA 62 Query: 66 SVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGE 125 + +G V +E+Q+ A RM Y ++ + PI + Sbjct: 63 KLHFRGQETCFLVHVENQATSQADFAERMFLYFARLYEKYRL-------PIYPIALFSYR 115 Query: 126 ATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQR 185 + + F S E + + F + + P + ++ L+ K Sbjct: 116 SPQRLEPETFSVAFPSKE----ILSFSFQTIQLNRLPWRDFLRQPNPVAAALMAKMNFSS 171 Query: 186 DLMLLLE----QLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGESM 241 + ++ +++ + L + L+ EQ + R + + Sbjct: 172 EERPKVKLECLRMIVTLRLDSARIHLLSGFVDTYLRLNMAEQQVFEQELHRIQPQEEAQV 231 Query: 242 MTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSRE---DVAEMANLPLAEIDKVIN 298 + + + E+G+++G Q+GRQE + + R + + + ++ L L ++D + + Sbjct: 232 LRIVTSWMEEGLQQGRQEGRQEEACKLILRFVQQRFPEQVSGFAPQIQALNLTQLDALSD 291 Query: 299 LI 300 + Sbjct: 292 RL 293 >UniRef50_B9MMM9 Putative uncharacterized protein n=1 Tax=Anaerocellum thermophilum DSM 6725 RepID=B9MMM9_ANATD Length = 315 Score = 117 bits (293), Expect = 5e-25, Method: Composition-based stats. Identities = 47/316 (14%), Positives = 121/316 (38%), Gaps = 26/316 (8%) Query: 4 PSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDV 63 + +D +K+ + E FL+ L E + + + + + + I + + +D+ Sbjct: 2 KTYKKYDEGYKKLFSNKENLIWFLQNVLNEERFKKIEKSDVEIIATESINKKWQKKISDI 61 Query: 64 LYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQ 123 +Y ++ + + + IE QS+ DKK+ R+ Y + + ++P+VVPI+ Y Sbjct: 62 VYKIKYKD--SFFCLTIEFQSREDKKILHRLYEYMHLI--QLKNKVNGEIPVVVPIVLYN 117 Query: 124 GEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELL--QKH 181 G + P + ++ + N +DI P+++++ + + + Q Sbjct: 118 GISHWKPNEQYNEIILFAKDFPEYAQNFKIIFLDIKSIPEEKLISAANVLAIAVYIDQVS 177 Query: 182 IRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRD-------- 233 ++ + L I + +L ++ R + + + + Sbjct: 178 NNPERVLNRILNLRGKIHLNWEQREELADWLYEVILRSYGVSEEEAEEMFKKSGLEVDEL 237 Query: 234 ------------RETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSRED 281 + + ++G+++G++QG + + A+++L E Sbjct: 238 FSSTAEKIKQGIEREKKKIAKEAMKQGMKQGMKQGMKQGMKRAIKLIAKQMLKDNQPIEL 297 Query: 282 VAEMANLPLAEIDKVI 297 +++ L EI K+ Sbjct: 298 ISKYTGLTPEEIKKLK 313 >UniRef50_B0G834 Putative uncharacterized protein n=3 Tax=Dorea formicigenerans ATCC 27755 RepID=B0G834_9FIRM Length = 369 Score = 117 bits (292), Expect = 6e-25, Method: Composition-based stats. Identities = 43/297 (14%), Positives = 102/297 (34%), Gaps = 18/297 (6%) Query: 5 STTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVL 64 +T D K FL+ + + L + + S F+ + +D + Sbjct: 14 NTHTKDNAAKIVFGDPVLCAQFLKGYTDIPLFKEIKPEDIENVSSHFLPLFQESRDSDTV 73 Query: 65 YSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHL-----------EADHDKL 113 + + + YL +IEHQS+ D M+FR++RY + + ++ Sbjct: 74 NKIWIGNSEIYLIALIEHQSENDFDMSFRILRYIVFIWTDYAAQQEKLHKGTTKSKDFLY 133 Query: 114 PLVVPILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIA 173 P ++PI++Y+G +T +F S + + + +V + ++++ Sbjct: 134 PPILPIVYYEGSSTWSAPLNFKNRVFLSDVFGDYIPSFNYLVVPLNKYSKQDLIEKNDEL 193 Query: 174 ILELLQKHIRQRDLMLLLEQLVTLIDEGYTSG------SQLVAMQNYMLQRGHTEQADLF 227 L L ++ L+ + E T + + +L + + +++ Sbjct: 194 SLIFLINQLQSSSEFHALKDIPKKYTEHLTEDTPDYLLKIIGKVIAVLLHKLNVPDEEVY 253 Query: 228 YGVLRD-RETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVA 283 + R + ++ ++GR E E + +G + Sbjct: 254 EVTDQITRRKFSMMFDNFQAYDVQETRRVSREEGRLEGRIEGERAGRIEGERAGRIE 310 >UniRef50_C8PTN1 Putative uncharacterized protein n=4 Tax=Treponema vincentii ATCC 35580 RepID=C8PTN1_9SPIO Length = 303 Score = 113 bits (282), Expect = 8e-24, Method: Composition-based stats. Identities = 51/313 (16%), Positives = 101/313 (32%), Gaps = 24/313 (7%) Query: 1 MDAPSTTPHDAVFKQFLMH----AETARDFLEIHLPVELRELCDLNTLHLESGSFIEESL 56 M + D+VF E L+ C + + L++ ++ Sbjct: 1 MSTANRKYKDSVFVDLFSEDEKAKENFLSLYNALHGTNLQLSCPVENIKLDNVMYM---- 56 Query: 57 KGHSTDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKL--- 113 DV V + V+ EHQS ++ M R ++Y + + L Sbjct: 57 -NIVNDVSCLV-----DNKIIVLAEHQSTINENMPLRFLQYIARLYEKLQKPTDRYLRTL 110 Query: 114 ---PLVVPILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDE-IMQH 169 P +FY G ++ + + R + + +I + E + + Sbjct: 111 SKIPTPEFYVFYNGLNDYPETTVLKLSDAFITKPERIPLDLEVKVYNINKSKGAEVLSRC 170 Query: 170 RRIAILELLQKHIR---QRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADL 226 + + L + +R Q D V + E L ++ E Sbjct: 171 KTLDEYSLFIEEVRLQTQLDPENGFTNAVKICIEKGILKEYLQRKSREVINMLIAEYDYD 230 Query: 227 FYGVLRDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMA 286 ++ E G + +GI +G+ QG + + E A+ + +A+M Sbjct: 231 TDIAVQREEAGKIAFAKGISQGLSQGISQGLSQGSHQKALETARLMKQANCEIPFIAKMT 290 Query: 287 NLPLAEIDKVINL 299 L AE++ + NL Sbjct: 291 GLTQAEVESIGNL 303 >UniRef50_C0A240 Putative uncharacterized protein n=1 Tax=Opitutaceae bacterium TAV2 RepID=C0A240_9BACT Length = 365 Score = 112 bits (280), Expect = 1e-23, Method: Composition-based stats. Identities = 53/315 (16%), Positives = 115/315 (36%), Gaps = 36/315 (11%) Query: 8 PHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSV 67 HD +F+ AR FL LP EL D +TL + S I ++L DV+Y + Sbjct: 35 DHDRIFRHAFSLPAVARQFLRTWLPPELVAQADWHTLTVTRISGISDTLGERREDVVYRI 94 Query: 68 QMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAM--------------------HRHLE 107 + G + +V++EHQ+K +K MA R+M + + Sbjct: 95 NVNGRNVHFYVLMEHQTKTEKHMARRIMEETFLIWRQDEHDRAEAAKKEAPGKADRQSRR 154 Query: 108 ADHDKLPLVVPILFYQGEATPYPLSMCWFDMFYSPELAR----RVYNSPFPLVDITITPD 163 + DK PLV+ ++ + G + + P + + + + F +V++ P Sbjct: 155 RETDKFPLVISMVLHPGPRKWGKIWRLADLIDVPPRMEKWARTFMPDCGFIVVELAGLPL 214 Query: 164 DEIMQHRRIAILELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQ------LVAMQNYMLQ 217 +++ + + R + + ++ L+DE ++ + + + +Y++ Sbjct: 215 EKLADGHLARAILGALQGNRLGLID--IRKIKRLLDEMFSDPDRASVGAVVKQLWHYLIS 272 Query: 218 RGHTEQADLFYGVLRDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGM 277 ++ V+ E + E+ + G + + E A + + Sbjct: 273 SSDLKEEQTKDIVIAHIP---EEYRSNIMNTVERLKQAGALKAQHNAVIE-ALEVRFDRV 328 Query: 278 SREDVAEMANLPLAE 292 + + E Sbjct: 329 PEGLREAIQGINDPE 343 >UniRef50_Q2RKN5 Putative uncharacterized protein n=1 Tax=Moorella thermoacetica ATCC 39073 RepID=Q2RKN5_MOOTA Length = 304 Score = 111 bits (277), Expect = 3e-23, Method: Composition-based stats. Identities = 61/304 (20%), Positives = 107/304 (35%), Gaps = 27/304 (8%) Query: 6 TTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESL--KGHSTDV 63 HD +FK+ L R+F+E+ P L D + I + H D+ Sbjct: 2 PVDHDRLFKELLT--TFFREFMELFFPAA-HTLIDYTDTKFLTQEVITDITAGDKHYVDI 58 Query: 64 LYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQ 123 L V+++G G + V IE Q+ A RM Y +H + V+PI + Sbjct: 59 LAEVKIKGEDGCVLVHIEPQAYRQADFARRMFIYFSRLYEKHQKR-------VLPIAVFA 111 Query: 124 GEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIR 183 ++ + + + +V F + + P + + LL K Sbjct: 112 HDSKVEETNRHEVEFPFL-----KVLQFEFYKIQLKRLPWRQYLNSNNPVAAALLSKMDY 166 Query: 184 QRDLMLLLE----QLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGE 239 + ++ +L+T + + A + L E+ L + + + Sbjct: 167 SPRERVQVKIEFLRLLTRMQLDPARMELITAFFDSYLVLNAEEEKSLQEKLSEELQPEE- 225 Query: 240 SMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGM---SREDVAEMANLPLAEIDKV 296 + KG QQGRQE QE R L K + S E A++ L ++D + Sbjct: 226 --VQRVMELTTSWHLKGWQQGRQEGRQEILLRQLRKRLGTTSPEVEAKIKTLSAEQLDDL 283 Query: 297 INLI 300 I Sbjct: 284 AEKI 287 >UniRef50_B0K503 Putative uncharacterized protein n=12 Tax=Thermoanaerobacteraceae RepID=B0K503_THEPX Length = 360 Score = 110 bits (275), Expect = 5e-23, Method: Composition-based stats. Identities = 44/265 (16%), Positives = 103/265 (38%), Gaps = 11/265 (4%) Query: 2 DAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHST 61 HD +K L + L + E D + SF+ + Sbjct: 7 KEAIHNQHDKGYKFLLSSKRVFIELLRSFVKQEWVNDIDEANVVKVDKSFVLQDFADKEA 66 Query: 62 DVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAM--------HRHLEADHDKL 113 D++Y V+++ ++++E QS D +M +R++ Y + + KL Sbjct: 67 DLVYRVKLKDKEVIFYILMELQSTVDYQMPYRLLLYMVEIWRSILKDTPRKESRRKDFKL 126 Query: 114 PLVVPILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQ-HRRI 172 P++VPI+ Y G+ + + + + L+D+ +E+++ I Sbjct: 127 PVIVPIVLYNGDHKWTAKTSYKETLNSYETFGEYAVDFKYILIDVNRYTKEELLKLENLI 186 Query: 173 AILELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLV--AMQNYMLQRGHTEQADLFYGV 230 A + LL++ + ++M L++L +++ L + +L R E+ + + Sbjct: 187 ASVFLLEQKVEFEEIMKRLKELSEILNNLDKDEILLFKAWFKKILLARLPEEERENIERI 246 Query: 231 LRDRETGGESMMTLAQWFEEKGIEK 255 + + + E + L + ++ E+ Sbjct: 247 IDENKEVEEMISNLEKTILQEMKER 271 >UniRef50_B6J6C6 Hypothetical cytosolic protein n=1 Tax=Coxiella burnetii CbuK_Q154 RepID=B6J6C6_COXB1 Length = 143 Score = 110 bits (274), Expect = 7e-23, Method: Composition-based stats. Identities = 50/126 (39%), Positives = 76/126 (60%), Gaps = 1/126 (0%) Query: 3 APSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTD 62 PHD F+ + A++F E HLP + + DLN+L L+ SFI+E LK D Sbjct: 2 KKIHNPHDYYFRTAMSDTRVAKEFFEYHLPNNILKAADLNSLQLQKSSFIDEHLKASMAD 61 Query: 63 VLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEAD-HDKLPLVVPILF 121 VLYSV++ PGY ++++EHQ PDK M +R++RY + + HL+ + LP+VVP++F Sbjct: 62 VLYSVKLNRRPGYFYIIVEHQRNPDKLMPYRLLRYILRIIDHHLKKKDYLPLPIVVPLVF 121 Query: 122 YQGEAT 127 Y G+ Sbjct: 122 YNGKKR 127 >UniRef50_C1DXM1 Putative uncharacterized protein n=5 Tax=Sulfurihydrogenibium azorense Az-Fu1 RepID=C1DXM1_SULAA Length = 342 Score = 107 bits (266), Expect = 6e-22, Method: Composition-based stats. Identities = 52/258 (20%), Positives = 106/258 (41%), Gaps = 9/258 (3%) Query: 7 TPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYS 66 +PHD FK + FLEI LP L E N+L L + K D+ + Sbjct: 6 SPHDWFFKMIFSQKQNVESFLEIFLPQ-LYECIIPNSLKLSDTEKFSKKYKKFFLDLAFD 64 Query: 67 VQMQGNPGY-----LHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILF 121 +++ G +++V EH+S PDK ++ Y M P V+PI+F Sbjct: 65 CKLKDKEGNTIDGQIYIVFEHKSYPDKHTPSQISFYKSVMMEEDERLSRPYRP-VIPIVF 123 Query: 122 YQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKH 181 Y GE + + L + +++ + L D++ + +++ + L+ Sbjct: 124 YHGEKSWNIPTDIPQQFNTLGNLEKYLHSLSYILFDVSKVDESFLIEKIYLNAC-LISGV 182 Query: 182 IRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGESM 241 +++ L+ L ++++ + + + +L + G E M Sbjct: 183 FTLKNIFKDLKYLRPVLEKLILDDVKDCLYIIIDYTVIVKKDLETIEKILEEI-GGEEKM 241 Query: 242 MTLAQWFEEKGIEKGIQQ 259 MTL + ++ +G++KG+++ Sbjct: 242 MTLTEKWKMEGLKKGMEE 259 >UniRef50_D0YJF1 Putative transposase YhgA family protein n=1 Tax=Klebsiella variicola At-22 RepID=D0YJF1_KLEVA Length = 190 Score = 106 bits (265), Expect = 7e-22, Method: Composition-based stats. Identities = 63/180 (35%), Positives = 103/180 (57%), Gaps = 13/180 (7%) Query: 130 PLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQRDLML 189 P + PE A+ +Y PF L+D+T+ PDD+++QHRR+A+LEL+QKHIRQRDL Sbjct: 11 PHDAVFKRFLRHPETAKTLYGCPFTLIDVTVMPDDDLVQHRRVALLELMQKHIRQRDLSS 70 Query: 190 LLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLF-YGVLRDRETGGESMMTLAQWF 248 + E L ++ GYT+ QL + +YMLQ G+T + +F + R E++M++AQ Sbjct: 71 ITESLAAVVMLGYTNRRQLRMLFHYMLQYGNTAEPGVFLRRLARRLPQYEETLMSIAQKL 130 Query: 249 EEKGIEKGIQQGRQEVSQE------------FAQRLLSKGMSREDVAEMANLPLAEIDKV 296 +++G ++G +GR+E QE A +L G+ +E V ++ L E+ + Sbjct: 131 KQEGRQEGRLEGREEGHQEGLQEGSRREALRIAGSMLQNGLDKEMVQKITGLSADELQPL 190 >UniRef50_C8T759 Putative uncharacterized protein n=1 Tax=Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884 RepID=C8T759_KLEPR Length = 185 Score = 106 bits (265), Expect = 7e-22, Method: Composition-based stats. Identities = 77/183 (42%), Positives = 108/183 (59%), Gaps = 19/183 (10%) Query: 133 MCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQRDLMLLLE 192 MCW F P++ARR+Y FPL+DIT TPDDEIM+HRR+A+LELLQKHIRQRDLM L E Sbjct: 1 MCWLAGFADPDIARRIYGEDFPLIDITSTPDDEIMRHRRVAMLELLQKHIRQRDLMDLHE 60 Query: 193 QLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRD---RETGGESMMTLAQ--- 246 QLV L+ GYTS QL + +Y+LQ G+ F L R E++M +AQ Sbjct: 61 QLVRLLALGYTSRRQLKTLLHYLLQAGNAADPVAFLRHLAQNVPRRPHKETLMNIAQFLE 120 Query: 247 -------------WFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEI 293 ++GIE+GI+QG Q+ ++ A+ +L+ G+ VA++ L + Sbjct: 121 QRGHQQGLKQGLEQGLQQGIEQGIEQGEQQTAERIARAMLANGLDLSLVAKLTGLAPECL 180 Query: 294 DKV 296 ++ Sbjct: 181 ARL 183 >UniRef50_B2V697 Putative uncharacterized protein n=6 Tax=Sulfurihydrogenibium RepID=B2V697_SULSY Length = 311 Score = 106 bits (265), Expect = 7e-22, Method: Composition-based stats. Identities = 46/270 (17%), Positives = 107/270 (39%), Gaps = 7/270 (2%) Query: 7 TPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYS 66 PHD FKQ + + L+I EL + DL ++ L + + + D+LY Sbjct: 5 QPHDQFFKQIFSEPKRVKSLLDIFY-SELSQKIDLESIRLLNSEKYSQKIGKSLLDLLYE 63 Query: 67 VQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEA 126 +++ +L ++ EH+S DK + +++ Y+ ++ + ++ I+ Y G+ Sbjct: 64 CKIENEKSFLRIIFEHKSYIDKNLPSQLLYYNGILWEE--TGEYKEYLPIINIVLYHGKR 121 Query: 127 TPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILE--LLQKHIRQ 184 + S + R + L+D++ D+E++ + L Sbjct: 122 KWNIPTTLPKT--NSEIIERFSNKLNYHLIDLSKVADEEMINKLYVDFCTASALLTMKHI 179 Query: 185 RDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGESMMTL 244 + + + ++ + E Y G + + + E ++ +L + Sbjct: 180 FEDLKKYKHILKKVFEHYQDGCVFIILDYISVVNNPQEVENVLKEILGGEKEMTTLTEKW 239 Query: 245 AQWFEEKGIEKGIQQGRQEVSQEFAQRLLS 274 ++G+++G+QQG + QE +L+ Sbjct: 240 KMEGLQQGLQQGLQQGLIKAKQEDIIKLIK 269 >UniRef50_A5D0D4 Putative uncharacterized protein n=10 Tax=Clostridia RepID=A5D0D4_PELTS Length = 332 Score = 106 bits (263), Expect = 1e-21, Method: Composition-based stats. Identities = 45/275 (16%), Positives = 98/275 (35%), Gaps = 21/275 (7%) Query: 8 PHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESL--KGHSTDVLY 65 HD +FKQ L +F+E+ P E + DL + + + H D++ Sbjct: 7 DHDRLFKQLL--ETFFAEFMELFFP-EAAQATDLEYVKFLQQELFTDITAGEKHRADIIV 63 Query: 66 SVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGE 125 +++ PG + V +E QS K+ RM Y ++ ++P+ + + Sbjct: 64 ETRLKDEPGLILVHVEPQSYIQKEFNERMFIYFSRLYEKYRRK-------ILPVAVFTYD 116 Query: 126 ATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQR 185 D F V F +++ + ++ LL K + Sbjct: 117 HIRNEP-----DSFEIGFSFLDVLRFHFYKLELKKLHWRDYIRSDNPVAAALLSKMGFRP 171 Query: 186 DLMLLLE----QLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGESM 241 + + ++ +++ + + L+ E+ + + + + + E + Sbjct: 172 EERVQVKLEFMRMLARMKLDPARTELIGGFFETYLKLNRQEEEEFYRELGKIDKKEVELI 231 Query: 242 MTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKG 276 M + + EKG +G +GR E E ++G Sbjct: 232 MQITTSWHEKGRMEGRLEGRLEGRLEGRLEGEARG 266 >UniRef50_A9BGB3 Putative uncharacterized protein n=2 Tax=Petrotoga mobilis SJ95 RepID=A9BGB3_PETMO Length = 336 Score = 106 bits (263), Expect = 1e-21, Method: Composition-based stats. Identities = 53/309 (17%), Positives = 123/309 (39%), Gaps = 18/309 (5%) Query: 5 STTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVL 64 S D++FK+ DFL+ LP E + L E I + +D+L Sbjct: 2 SNPIKDSIFKELFEDRTVFYDFLKAFLPKETTKQIKETDLKREQTELIGKDFSIKRSDIL 61 Query: 65 YSV-QMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHD-------KLPLV 116 Y + + G Y+++++EHQSK D+ MAFRM+ Y + +++ + KLP++ Sbjct: 62 YKIEKRNGQDVYIYLLLEHQSKVDQLMAFRMLAYKVRIWEQYVNSHKKESEQKGFKLPVI 121 Query: 117 VPILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILE 176 + ++FY G+A + + + + + L++++ ++ I+ ++ + Sbjct: 122 IGMVFYDGKAKWTSPMDVKEKITEIKNMEEYLIKANYELINLSNIKEETIINMKKALGVI 181 Query: 177 LLQ-----KHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVL 231 LL + +L+ ++ + + L ++ G + Sbjct: 182 LLTDKPNVRVKNAEELLKIINKDILLKLSEEEQEKFNKHRNAFIELFGKRTDYEEIKERF 241 Query: 232 RDRETGG-----ESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMA 286 + + ++ +A+ EK +G +G+ E E + L+ + +++ + Sbjct: 242 EELKEMEVPKMFNTLEEIAKRDREKAKLEGKAEGKVEGKLEERRELIIEILNQRFGEDFD 301 Query: 287 NLPLAEIDK 295 +I Sbjct: 302 KSLEEKIRN 310 >UniRef50_C9KKN3 Putative uncharacterized protein n=1 Tax=Mitsuokella multacida DSM 20544 RepID=C9KKN3_9FIRM Length = 297 Score = 105 bits (262), Expect = 2e-21, Method: Composition-based stats. Identities = 53/304 (17%), Positives = 99/304 (32%), Gaps = 26/304 (8%) Query: 3 APSTTPHDAVFKQFLMHAETARDFLEIHLPVELR-ELCDLNTLHLESGSFIEESLKGHST 61 P T D++F+ E ++ + TL + I Sbjct: 4 KPKRTYKDSLFRHIFNDKRRLASLYESLTGRKVAPRDIAITTLRGVFFNDI-------KN 56 Query: 62 DVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHD-------KLP 114 D+ + + + +++EHQS + M RM+ Y R L++ +P Sbjct: 57 DISFRIGDRD-----IILMEHQSSWNPNMPLRMLWYVAKLYSRQLDSQEVVYRSRLIPIP 111 Query: 115 LVVPILFYQGEAT-PYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIA 173 +FY G P + D F + + ++ + + Sbjct: 112 APEFYVFYNGSQDEPDYQKLRLSDAFAHATDTLELAVDCYN-INYSTQNKLLDSCYELRC 170 Query: 174 ILELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRD 233 +QK R+ + +L T I + T M +Y Q+ +E D+ Sbjct: 171 YSIFVQKV---REGIQNGLELRTAIRQAITYCKTHDLMGDY-FQKNESEVFDMVNFKWDQ 226 Query: 234 RETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEI 293 + + E +G +G G + + A LL KG+ + E NL L E+ Sbjct: 227 KRALEVAKEDGVAIGEARGEARGKLLGERNAMMKVALSLLKKGLPVGVITESTNLSLEEV 286 Query: 294 DKVI 297 K+ Sbjct: 287 RKIA 290 >UniRef50_C4G1D5 Putative uncharacterized protein n=2 Tax=Abiotrophia defectiva ATCC 49176 RepID=C4G1D5_ABIDE Length = 297 Score = 105 bits (262), Expect = 2e-21, Method: Composition-based stats. Identities = 48/296 (16%), Positives = 109/296 (36%), Gaps = 10/296 (3%) Query: 9 HDAVFKQFLMHAETARDFLEI--HLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYS 66 D K L + + D + E+ + +L + + ++++ + Sbjct: 4 KDIAEKYLLSYNDVFADIVNGAVFGGEEIVKSNELADANGITQFKDDQNIHHEQVRDIAK 63 Query: 67 VQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEA 126 + + + IE+QS PDK M R++ Y A + ++ + V+ I+ Y G+ Sbjct: 64 FWKKNEVIFSFIGIENQSAPDKDMILRIISYDGATYKSQM--GNESIYPVLTIVIYWGKY 121 Query: 127 TPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIM----QHRRIAILELLQKHI 182 + ELA + + F L+DI E++ R +A QK Sbjct: 122 EWKAPVSLQERINCPRELADIIPDYRFKLIDIGRLSGKELIKFKSDFRLVAEFIARQKEY 181 Query: 183 RQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETG--GES 240 + + + + + + + ++ + + +L + E + Sbjct: 182 KPGKEEIKHPEELLDLLDLLAGDKRFKELKGKVKNIRKEGRIINMCELLDEIENRGIEKG 241 Query: 241 MMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKV 296 + + EKGIEKG +G + + A++ +S + + + L EI+++ Sbjct: 242 IEQGIEQGIEKGIEKGRSEGEETATLRIAKKFKDSNVSIDIIMKATGLTKEEIEEL 297 >UniRef50_C6VTD5 Putative uncharacterized protein n=1 Tax=Dyadobacter fermentans DSM 18053 RepID=C6VTD5_DYAFD Length = 308 Score = 105 bits (261), Expect = 2e-21, Method: Composition-based stats. Identities = 40/302 (13%), Positives = 99/302 (32%), Gaps = 20/302 (6%) Query: 10 DAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQM 69 D FK+ +D L L V + L S ++ + Sbjct: 10 DFGFKRIFGSEAN-KDILIDFLNVLFAGERLVADLTFASNENNGRIPILRRA--IFDLCC 66 Query: 70 QGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHD------KLPLVVPILFYQ 123 G G +IE Q + R + YS + + +EA K ++ ++ + Sbjct: 67 TGADGE-QFIIEVQRVRQEYFKDRCLYYSASLIRDQVEAGGTNWRYDLKPVYLIGLMDFC 125 Query: 124 GEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIR 183 E + + + +++ E + L K++ Sbjct: 126 FEDSDDGHYLHEIRLIKRSNGQVFYDKFGLTFIEMPAFQKKESDLSTELDRWLYLLKNLS 185 Query: 184 QRDL------MLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDR--- 234 + ++ + +++ + + + + +A Y+ + E + + R Sbjct: 186 KLNIVPPVLTNPVYQKVFRVAEVCNLNKEEKMAWDAYLKAKWDNENSMDYAKKEAMRVGH 245 Query: 235 -ETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEI 293 E E + ++GI+KG + G + ++ + +L+KG + ++++ L +I Sbjct: 246 EEGHKEGHKEGHKEGMKEGIKKGRETGIELGKRQVVKNMLAKGFDMQTISDITGLTFEQI 305 Query: 294 DK 295 Sbjct: 306 RN 307 >UniRef50_A6LF36 Putative uncharacterized protein n=7 Tax=Bacteroidales RepID=A6LF36_PARD8 Length = 273 Score = 105 bits (261), Expect = 2e-21, Method: Composition-based stats. Identities = 36/299 (12%), Positives = 87/299 (29%), Gaps = 29/299 (9%) Query: 1 MDAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHS 60 M D F + E ++ L L D+ + + E+L Sbjct: 1 MGKFINPFTDFGFHRIFGQ-EVHKELLIDFLNQLFFGEHDIEDITFLNPIQTPETLDDRG 59 Query: 61 TDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPI- 119 +++ V + + G L V +E Q+ R + Y A+ + D + P+ Sbjct: 60 --IVFDVHCKDSNGNLFV-VEMQTGAQPYFHDRGLYYLARAISNQGQKGKDWKFALQPVY 116 Query: 120 -LFYQGEATPYPLSMCWFDMFYSPELARRVYNS-PFPLVDITITPDDEIMQHRRIAILEL 177 +F + E R + +++ + Sbjct: 117 GVFLLNYKMDVNSKFRTDVILADRETGRMFSDRIRQVYLELPYFQKEPDECENDFERWIY 176 Query: 178 LQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETG 237 L KH+ + M + + + E+ + L+ Sbjct: 177 LLKHMDTLERMPFKAKKAVFDKLLEVAD----------VANLSKEERIQYDEALKRYRDY 226 Query: 238 GESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKV 296 ++ ++G + + A+ + ++G++ + + L L +I+K+ Sbjct: 227 KNTIDYAE------------EKGILKGKESTARNMKAEGIAPLIIQKCTGLSLEDIEKL 273 >UniRef50_Q73P51 Conserved domain protein n=7 Tax=Treponema RepID=Q73P51_TREDE Length = 292 Score = 104 bits (260), Expect = 3e-21, Method: Composition-based stats. Identities = 45/307 (14%), Positives = 94/307 (30%), Gaps = 26/307 (8%) Query: 1 MDAPSTTPHDAVFKQFLMH----AETARDFLEIHLPVELRELCDLNTLHLESGSFIEESL 56 M + D+VF E L C + + L++ ++ Sbjct: 1 MSTSNRKYKDSVFVDLFSEDERAKENFLSLYNALHGTNLPMSCPVENIRLDNVMYM---- 56 Query: 57 KGHSTDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHL------EADH 110 DV V G + ++ EHQS ++ M R + Y + Sbjct: 57 -NIINDVSCLV-----DGKIIILAEHQSTINENMPLRFLEYIARLYEKLQAPTDRYLKKL 110 Query: 111 DKLPLVVPILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITP-DDEIMQH 169 K+P +FY G+ + + + + +++I + + Sbjct: 111 SKIPTPEFYVFYNGKEDYPETTALKLSDAFITKPKQAPLELTVQVLNINTDKANKILTAC 170 Query: 170 RRIAILELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYG 229 + + L + +R++ + I G + + + A+ Y Sbjct: 171 KPLEEYSLFVEEVRKQTQLDPENGFTNAIKICIEKGILKEYLMRKSREVINMLVAEYDYD 230 Query: 230 VLRDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLP 289 + + E+GI +G G + + E A+ G + +AE L Sbjct: 231 TDIAVQREESL-----RIGIEQGIRQGFSDGAYQKAIEIAKAFKQFGFDIDKIAEGTGLS 285 Query: 290 LAEIDKV 296 EI+K+ Sbjct: 286 REEIEKL 292 >UniRef50_D1PHY3 Putative uncharacterized protein n=2 Tax=Prevotella copri DSM 18205 RepID=D1PHY3_9BACT Length = 307 Score = 104 bits (258), Expect = 4e-21, Method: Composition-based stats. Identities = 53/302 (17%), Positives = 104/302 (34%), Gaps = 17/302 (5%) Query: 10 DAVFKQFLMH-AETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQ 68 D FK+ + + L LP+ E + + + G +T + V Sbjct: 11 DLTFKKIFGNHPKRLISLLNALLPLSDEEQI--REIKYLPTELVPQLEGGKNT--IVDVL 66 Query: 69 MQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDK--LPLVVPILFYQGE- 125 G +E Q + R++ + + L V + Sbjct: 67 CTDVRGR-KFCVEMQMEWSDAFQQRVLFNASKLYVSQAKKGGKYSELQPVYSLNLINDIF 125 Query: 126 ATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQ- 184 A P + + + + + + + F +++ I R + + I Sbjct: 126 AHDTPDFIHNYRIVHDKDSNKVIEGLHFTFIELPKFTPHSIADKRMMVLWLRFLTEINSN 185 Query: 185 -RDLMLLL---EQLVTLIDEGYTSGSQLVAMQNY-MLQRGHTEQADLFYGVLRDRETGG- 238 +D+ L ++ ++E SG ++ Y + + L + + G Sbjct: 186 TKDIPADLLNDPEIGKAVEELEISGFSDAELRAYDKFWDSVSVERTLIDDSYQKGKEKGK 245 Query: 239 -ESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKVI 297 E + + EKG+EKG +G+ E + E AQRLL+ G+ E V++ LPL I + Sbjct: 246 QEGLAEGMEKGMEKGMEKGRAEGKHEANTEIAQRLLAMGLPAEQVSKATQLPLEIIKNLS 305 Query: 298 NL 299 N Sbjct: 306 NS 307 >UniRef50_A6LFH9 Putative uncharacterized protein n=6 Tax=Bacteroidales RepID=A6LFH9_PARD8 Length = 295 Score = 102 bits (253), Expect = 2e-20, Method: Composition-based stats. Identities = 45/302 (14%), Positives = 100/302 (33%), Gaps = 13/302 (4%) Query: 1 MDAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHS 60 M D FK + FL L E + + + E+++ + Sbjct: 1 MAQYVDIMTDVGFKAVFQDKQVTIKFLNAALAGERQ----IKDITYLDKEIKPETVENRT 56 Query: 61 TDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPIL 120 +++ + + G ++E Q+ P R Y + R + ++PI Sbjct: 57 --IIFDLLCEDVSGA-KFILEMQNCPQHYFFNRGFYYLCRMVARQGQIGKQWQYRLLPIY 113 Query: 121 FYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDI----TITPDDEIMQHRRIAILE 176 P + + + I + + Sbjct: 114 GVYFLNFKLPEFTDFRTDVVLANERTGKVFNEIKMKQIYISFPLFSLSKEECKSSFERWI 173 Query: 177 LLQKHIRQRDLMLLLEQLVTLIDEGY-TSGSQLVAMQNYMLQRGHTEQADLFYGV-LRDR 234 K++ + E+ T + + + L + + + D + + Sbjct: 174 YTLKNMNLFEQSPFKEEQETFLRLLDVANVNSLSEKERAIYEENLKNYRDWYATIDYAQT 233 Query: 235 ETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEID 294 E + M Q +KGIEKGI++GRQE + A+++ +G+ E +A+ + L + +I+ Sbjct: 234 EGIEKGMQEGMQKGMQKGIEKGIEKGRQEEKLQIARKMKKQGLDSELIAQCSGLSVEDIE 293 Query: 295 KV 296 ++ Sbjct: 294 RL 295 >UniRef50_B5U1X5 Putative uncharacterized protein n=1 Tax=uncultured bacterium RepID=B5U1X5_9BACT Length = 304 Score = 101 bits (251), Expect = 3e-20, Method: Composition-based stats. Identities = 52/304 (17%), Positives = 105/304 (34%), Gaps = 25/304 (8%) Query: 3 APSTTPHDAVFKQFLM-HAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHST 61 + + D++F + + + FL ++ + L +TL LE + + K + Sbjct: 8 NENRSHKDSLFVDYFSKDRDWKQHFLSLYNALHGTNLQVADTL-LERVNIDQVLYKSYYN 66 Query: 62 DVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHD------KLPL 115 D+ V G ++IEHQS + M R++ Y +++ L Sbjct: 67 DIAVLV-----NGQFILMIEHQSTINPNMPLRLLEYVARIYGNLVDSKAKFSRHLVPLAR 121 Query: 116 VVPILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDE---IMQHRRI 172 +FY G+ P S + + + + TI D + + + Sbjct: 122 PEFYVFYTGDQKLPPESYLHLSDSFPNQPPKADLTLELKVKVCTIKSDHPSPVVHRCPDL 181 Query: 173 AILELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLR 232 K + + E L I E ++ + A+ Y Sbjct: 182 EQYAQFLKLVEEAKAAGQAEPLTWAIQEAVRRNILRDYLERRGGETLSILMAEYDYATDF 241 Query: 233 DRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAE 292 + E G+ G+++G + E A+ LLS+G++ + VA +LPL Sbjct: 242 AVQKEE---------AYEDGLFAGLERGAYQNKLETARSLLSEGLAPQMVARCTSLPLET 292 Query: 293 IDKV 296 + ++ Sbjct: 293 VQQL 296 >UniRef50_A6LFA9 Putative uncharacterized protein n=22 Tax=Bacteroidales RepID=A6LFA9_PARD8 Length = 305 Score = 101 bits (251), Expect = 3e-20, Method: Composition-based stats. Identities = 49/309 (15%), Positives = 102/309 (33%), Gaps = 17/309 (5%) Query: 1 MDAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHS 60 M D FK E +D L L L + L + + + E+ +G Sbjct: 1 MGKFINPFTDFGFKHIFG-REMDKDILIEFLNDLLEGEYTIMDLRIMNNERLPETEQGRK 59 Query: 61 TDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLV---- 116 V++ + + + G ++IE Q++ R + Y ++ L Sbjct: 60 --VIFDIHCETDKGE-RIIIEMQNREQPHFKDRALYYLSHSVVEQGIKGTWDYELAAVYG 116 Query: 117 VPILFYQGEATPYPLSMCWFDMF-------YSPELARRVYNSPFPLVDITITPDDEIMQH 169 V L + + P F +++ +E Sbjct: 117 VFFLNFTLDEENGPDKNGKEGKFRRDIILADRENGQVFNPKFRQIYIELPRFNKEEEECE 176 Query: 170 RRIAILELLQKHIRQRDLMLLLEQLVTLID-EGYTSGSQLVAMQNYMLQRGHTEQADLFY 228 + KH+ D M + E S + L Q + D + Sbjct: 177 TDFERWIYVLKHMDTLDRMPFKARKAIFERLERIGSMANLTPKQRAQYEAEWKMYNDYYN 236 Query: 229 GVLRDRETG-GESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMAN 287 + E G + M + +KG+++G+Q+G Q+ + A+ + ++G++ + + Sbjct: 237 TLDFAVEKGMKKGMEEGMEKGLQKGLQEGLQEGLQKGKESTARNMKAEGITPLIIQKCTG 296 Query: 288 LPLAEIDKV 296 L L EI+++ Sbjct: 297 LSLEEIERL 305 >UniRef50_Q24MW9 Putative uncharacterized protein n=4 Tax=Desulfitobacterium hafniense RepID=Q24MW9_DESHY Length = 295 Score = 101 bits (250), Expect = 4e-20, Method: Composition-based stats. Identities = 45/303 (14%), Positives = 96/303 (31%), Gaps = 21/303 (6%) Query: 1 MDAPSTTPHDAVFKQFLM---HAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLK 57 M +D +FK + + FL L +L + L E LK Sbjct: 3 MAERLNRINDYLFKYIFGRQENKDILLSFLNAVLSP--AGEDELTDITLSDRELDPEHLK 60 Query: 58 GHSTDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVV 117 + + + IE Q +K + R + Y L++ L Sbjct: 61 DKMSRLDILGVANDGS---LINIEVQIASEKNIDKRTLYYWAKIYQSQLQSGMLYKDLAR 117 Query: 118 PILFYQGEATPYPLSMCWFDMFYSPELARRVY---NSPFPLVDITITPDDEIMQHRRIAI 174 + + P + + MF E + + +++ R+ Sbjct: 118 TVTVNVLNFSFLPDAQRYHSMFSLYEAHSGLRLNRDLEIHFLELEKWKALSTKPRTRLDK 177 Query: 175 LELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDR 234 + + ++L + + + + ++ L+ + Sbjct: 178 WLMYLSNTDPKELEEIAMSEPAIGKALTVEE----------IFLKNDKERYLYEMREKGI 227 Query: 235 ETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEID 294 +M E+G+ +GI QG + E A +L KG+S +AE+ +LP+ +I+ Sbjct: 228 RDHLSAMDNAKTEGIEQGLAQGIAQGIERGKTEIALSMLKKGLSLNMIAEITDLPIEQIE 287 Query: 295 KVI 297 ++ Sbjct: 288 EIR 290 >UniRef50_D0LPI9 Putative transposase n=2 Tax=Haliangium ochraceum DSM 14365 RepID=D0LPI9_HALO1 Length = 338 Score = 99.2 bits (245), Expect = 1e-19, Method: Composition-based stats. Identities = 57/265 (21%), Positives = 103/265 (38%), Gaps = 19/265 (7%) Query: 9 HDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQ 68 +D + + E A D LP L + DL+ L L SG+++ + L+ + TDVLYSV Sbjct: 24 YDVLVETTFARREYAADTFRTMLPPALVKRLDLDALSLRSGTYVSDELRQYYTDVLYSVL 83 Query: 69 MQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHL--EADHDKLPLVVPILFYQGEA 126 + G +++++++HQS D R+ R ++ R+L D LP+++PI+F+ EA Sbjct: 84 LDGEQAFIYLLLKHQSATDPMFPLRLPRNVLSIWERYLIERQDATTLPVILPIVFHH-EA 142 Query: 127 TPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITP--------DDEIMQHRRIAILELL 178 T + ++ R ++ D+ L LL Sbjct: 143 TGWSDAVGLNGSLALGADVRTALSANRRDFRRLRYLLLVLCFQFDEASRAQNLNEALGLL 202 Query: 179 QK----HIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYM--LQRGHTEQADLFYGVLR 232 + +RDL+ L+ +I E + + + + D L Sbjct: 203 MRTFGVARPKRDLVASLKGWEDVIREVVATQRGREMLATVVQFILENSETDPDELKSFLE 262 Query: 233 --DRETGGESMMTLAQWFEEKGIEK 255 E + MT A + E+ Sbjct: 263 FTAGEPARTAFMTGADRLTQGVREE 287 >UniRef50_UPI0001C351D8 hypothetical protein ChatD1_33675 n=1 Tax=Clostridium hathewayi DSM 13479 RepID=UPI0001C351D8 Length = 313 Score = 99.2 bits (245), Expect = 1e-19, Method: Composition-based stats. Identities = 44/307 (14%), Positives = 99/307 (32%), Gaps = 35/307 (11%) Query: 3 APSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTD 62 + D +F+ + D + DL L ++ D Sbjct: 5 KLNRNYKDRLFRLAFQEKKDLLDLYNAVSGRQYTNPDDLIITTLADAIYLGM-----KND 59 Query: 63 VLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHD--------KLP 114 + + V + + EHQS + M R + Y +++ + +LP Sbjct: 60 ISFLV------SDVLNLYEHQSSFNPNMPVRGLNYFADTYREYIDRNGFDIYGEKLIRLP 113 Query: 115 LVVPILFYQG-EATPYPLSMCWFDMF-YSPELARRVYNSPFPLVDITITPDDEIMQH--R 170 + I+FY G + P + + D F + +++I + E+M R Sbjct: 114 MPQYIVFYNGTKEEPDRIELRLSDAFLCQNPEEKGCLECRATMININYGHNKELMDRCRR 173 Query: 171 RIAILELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGV 230 + + + L++ V L + +A++ + Sbjct: 174 LKDYAVFVSRIRNNEKRGMALDEAVKQAVHSCIEEGILADIL-------KKNRAEVCNLI 226 Query: 231 LRDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPL 290 L + + + + E ++ G ++GR + + KG++ +A+M L Sbjct: 227 LYEYDEQRQLAI-----AREGAMKAGREEGRAAEQVTIIRNMAGKGLNPSAIADMLGLEE 281 Query: 291 AEIDKVI 297 + KV+ Sbjct: 282 GYVKKVL 288 >UniRef50_C9LWJ8 Putative uncharacterized protein n=1 Tax=Selenomonas sputigena ATCC 35185 RepID=C9LWJ8_9FIRM Length = 292 Score = 99.2 bits (245), Expect = 2e-19, Method: Composition-based stats. Identities = 45/303 (14%), Positives = 101/303 (33%), Gaps = 25/303 (8%) Query: 1 MDAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHS 60 M T D++F + +D D++ ++ + Sbjct: 1 MARVKRTYKDSLFCDIFRRKDYLQDVYRGLFGR------DVSLQEIQLMTLQGTFFNDEK 54 Query: 61 TDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDK------LP 114 DV + + V++EHQS ++ M RM Y + + D LP Sbjct: 55 NDVSFLAGKRQ-----IVLMEHQSTLNENMPLRMFWYMAKLYRKQVPKDAPYRTRRLRLP 109 Query: 115 LVVPILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAI 174 +FY G + + E +I +++ R Sbjct: 110 APCFYVFYNGLDP--APDEWEMRLSEAFEGECSSLELCVKAYNINEMSGSRLLEKSR--- 164 Query: 175 LELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDR 234 L+ + + ++E + Q+ + + + + + ++ + Sbjct: 165 --ALKGYSVFVAQIRRKTAAGVCLEEAVKQAIRYCIEQDLLAEYFLEREMEEVFDMVSFK 222 Query: 235 ETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLL-SKGMSREDVAEMANLPLAEI 293 + Q +E G+EKG+++G ++ E +L K S +D++E++ PL +I Sbjct: 223 WDPELAKRVQLQEAQEIGMEKGMEKGMEKGVTEIVLNMLKKKKWSLQDISEVSQWPLDKI 282 Query: 294 DKV 296 + + Sbjct: 283 ESL 285 >UniRef50_UPI0001C353CE hypothetical protein ChatD1_20495 n=1 Tax=Clostridium hathewayi DSM 13479 RepID=UPI0001C353CE Length = 319 Score = 98.8 bits (244), Expect = 2e-19, Method: Composition-based stats. Identities = 47/307 (15%), Positives = 100/307 (32%), Gaps = 31/307 (10%) Query: 1 MDAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHS 60 M + D +F+ E E DL L+ ++ Sbjct: 20 MVKVNKKYKDRLFRMVFNRKEELLSLYNAVSHSEYTNPDDLEINTLDDVIYM-----KMK 74 Query: 61 TDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEAD--------HDK 112 D+ + + + + EHQS + M R Y + ++++ + Sbjct: 75 NDLAFLI------DDVLNLWEHQSTWNPNMPVRGTFYIVEEYRKYIDQNGLNLYGSSRIT 128 Query: 113 LPLVVPILFYQG-EATPYPLSMCWFDMFYSP-ELARRVYNSPFPLVDITITPDDEIMQH- 169 LP+ +FY G P + + D F +++I ++E+M+ Sbjct: 129 LPVPQFYVFYNGLREEPDYIELKLSDAFSRVHSEVEPCMEFKAVMLNINRGHNEELMRQC 188 Query: 170 -RRIAILELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFY 228 E + + + + LE+ + + L + +A++F Sbjct: 189 TTLREYAEFVARIRDETEDGTALEEAAMNVMDSCIRDGILAEFLSVH-------RAEVFE 241 Query: 229 GVLRDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANL 288 +L + + E +E G +G E ++E A L+ KG + ED A + Sbjct: 242 VLLTEYDEQRHIASEKEISRREGHME-GRTEGILEKAKEVAVNLIKKGFTVEDAASICGE 300 Query: 289 PLAEIDK 295 + + + Sbjct: 301 DICRVKE 307 >UniRef50_C1PBU4 Putative uncharacterized protein n=4 Tax=Bacillus coagulans 36D1 RepID=C1PBU4_BACCO Length = 329 Score = 98.4 bits (243), Expect = 3e-19, Method: Composition-based stats. Identities = 49/330 (14%), Positives = 112/330 (33%), Gaps = 51/330 (15%) Query: 7 TPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESL--KGHSTDVL 64 HD +FK+ + + ++F++ P +L D + S + + D+L Sbjct: 12 HVHDRLFKELIQN--FFQEFMDAFFP-DLSADLDYRRVRFLSQEQFTDFPGGEQKRVDIL 68 Query: 65 YSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQG 124 +++G + + +E QS +K RM RY + RH + V+PI + Sbjct: 69 AETKVKGKDTVILIHVEPQSYYEKPFPERMFRYYMMISLRHRK-------PVLPIAVFSY 121 Query: 125 EATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQ 184 E F + + + + ++ LL K Sbjct: 122 EEKTETPDTYTFAFHNIE-----ILRFHYLSIHLMKQNWRNYIRSNNPVAAALLSKMGYT 176 Query: 185 RDLMLLLE----QLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGES 240 + ++ +++ ++ L +Y L+ E+A++ + E Sbjct: 177 ETERVQVKLEFLRMLARMELDPAKMRLLHGFFDYYLKLNEKEEAEVMENIKMLDPDEAEQ 236 Query: 241 MMTLAQWFEEKGI----------------EKGIQQGRQEVSQE--------------FAQ 270 ++ L + ++G EKG ++G + ++ A Sbjct: 237 VLKLPNSYFDRGYKKGKEEGREEGIEIGVEKGREEGIEIGVEKGREEERKEMLQTIPIAI 296 Query: 271 RLLSKGMSREDVAEMANLPLAEIDKVINLI 300 ++L +G + + E L E++K+ + Sbjct: 297 KMLQEGRELQLIVEKTGLSQREVEKIKQQL 326 >UniRef50_B7BFV9 Putative uncharacterized protein n=1 Tax=Parabacteroides johnsonii DSM 18315 RepID=B7BFV9_9PORP Length = 293 Score = 98.4 bits (243), Expect = 3e-19, Method: Composition-based stats. Identities = 37/304 (12%), Positives = 96/304 (31%), Gaps = 19/304 (6%) Query: 1 MDAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHS 60 M D FK + +++ L L + L + ES + Sbjct: 1 MATFINPFVDRGFKHLFGQED-SKELLVDLLNGLFEGERVITELSFLNVEMPAESTDSRA 59 Query: 61 TDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDK---LPLVV 117 ++ ++ + G + + +E Q+ P R + Y + +D L V Sbjct: 60 A--VFDLKCKDKEGRIFI-VEVQNAPQTYFYERGLYYLCRIISDQDRRGNDWKFELYPVY 116 Query: 118 PILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILEL 177 I ++ + + +++ +E + Sbjct: 117 GIFLLNFKSGKTDKVRTDIVLADRETGKQMSDTMRQIYLEMPFFNKEEAECETSLDYWLY 176 Query: 178 LQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRET- 236 K++ + + + Q + + +L + N + + + + + RD + Sbjct: 177 TLKYMEKLETLPFKGQ-----KQLFEKLERLAKIVN--MNKKERMEYEESLKIYRDNQGV 229 Query: 237 ----GGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAE 292 + M + ++GIEKG+++G ++ A ++ +G+ + + L Sbjct: 230 LDYAIEKGYMEGVEKGLKEGIEKGLEKGMEKGIYLVAAKMKMQGIDFATITSVTGLNAET 289 Query: 293 IDKV 296 I + Sbjct: 290 IATL 293 >UniRef50_B7CC32 Putative uncharacterized protein n=10 Tax=Eubacterium biforme DSM 3989 RepID=B7CC32_9FIRM Length = 301 Score = 98.4 bits (243), Expect = 3e-19, Method: Composition-based stats. Identities = 55/300 (18%), Positives = 105/300 (35%), Gaps = 11/300 (3%) Query: 6 TTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLY 65 D K+FL + DF + R L N + L+S DV+ Sbjct: 2 NKIKDKTMKEFLENNAYFVDFFNAYFFDGERVLKPENCMELDSEMNDSNMDLEKHVDVIR 61 Query: 66 SVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHR-----HLEADHDKLPLVVPIL 120 + Y +IE+QS D M R Y A R +KLP+V ++ Sbjct: 62 --KYNDGNLYSAFIIENQSYVDASMVVRAAAYEFVAYDRMLKKLKKNKAKEKLPMVHILV 119 Query: 121 FYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQK 180 FY GE + + ++ L++IT + L + + Sbjct: 120 FYTGEKLWNAANKLSQLVEVDERFESYFHDYQMNLIEITGNT-SYNFNEEDVYNLFYICR 178 Query: 181 HIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETG--- 237 I + + L+ + + ++ E+ ++ R Sbjct: 179 SIYDQSIYEEKSNGFGLVKSSVLKVVKTLTDVEWLDLEELEEKEEIEMCEAEKRWLEVKS 238 Query: 238 GESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKVI 297 E + E+GIE+GI+QG ++ E ++++ KG + +A + ++ I+K++ Sbjct: 239 KEWEAKGIKKGIEQGIEQGIEQGSEKKELEMYRKMMDKGFGIKAIASIFSVSEESIEKLL 298 >UniRef50_A5CBY6 Transposase and inactivated derivative n=47 Tax=cellular organisms RepID=A5CBY6_ORITB Length = 324 Score = 98.0 bits (242), Expect = 3e-19, Method: Composition-based stats. Identities = 49/317 (15%), Positives = 101/317 (31%), Gaps = 25/317 (7%) Query: 7 TPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYS 66 +D FK+ + +D L L L + +E I ++ + + Sbjct: 9 PKNDVAFKKIFGSEKN-KDILIHFLNDILLFEGNREITEVEFLGTILDADIASKKESIVD 67 Query: 67 VQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHD----KLPLVVPILFY 122 V + G ++ IE Q P + R Y+ A R + L V+ I Sbjct: 68 VLCKDKNGAQYI-IEMQVDPTQGFEKRAQYYAAKAYGRQPNRGKEGKYSDLKEVIFIAIA 126 Query: 123 QGEATPYPLSMC-WFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRI-AILELLQK 180 + P + + + F +++ + + + I K Sbjct: 127 DYKLFPNKEDYISRHVILDKKTYEHDLKDFSFTFIELPKFKKNRVEELSDITEKWCYFFK 186 Query: 181 HIRQRDLMLLLEQLVT--LIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGG 238 H ++ L + + +I Y + Q ++ ++ + + D + Sbjct: 187 HAKETTLDGYHKIIGEDLIIKRAYEALDQFNWSEDELITYEQELKRIWDNKAVEDYKLER 246 Query: 239 ---ESMMTLAQWFEEKGIEKGIQQGRQEVSQEF------------AQRLLSKGMSREDVA 283 E + + G KG +G+ E E A +LL +S E +A Sbjct: 247 AKAEGIKLGEAKGIKLGEAKGKAEGKAEGKAEGKAEGKAEAKKDFAIKLLKSELSVETIA 306 Query: 284 EMANLPLAEIDKVINLI 300 E +L + E+ + N + Sbjct: 307 EYTDLSIQEVLNLKNSV 323 >UniRef50_C2LUG6 Putative uncharacterized protein n=1 Tax=Streptococcus salivarius SK126 RepID=C2LUG6_STRSL Length = 299 Score = 97.6 bits (241), Expect = 4e-19, Method: Composition-based stats. Identities = 57/300 (19%), Positives = 111/300 (37%), Gaps = 26/300 (8%) Query: 10 DAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQM 69 D + K+ E F+ L +++ + L L F E+ L S DV ++ Sbjct: 13 DIMAKKIFSLPEVTVAFIRDILDLDVVDAQILEGTQLHKKDFDEDELFSTSVDV--RAKL 70 Query: 70 QGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLE---------ADHDKLPLVVPIL 120 V+IE Q + R Y + +++ ++++ V I Sbjct: 71 NDGTE---VIIEIQVRKQHYFLNRFHYYLANQLVENVQQLRQQGQTHKMYEQMEPVYGIA 127 Query: 121 FYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILEL--L 178 + P S + + +Y+ D + +IA LEL Sbjct: 128 ILEKTLLPDEESPINTYWMANSRTGKPLYSF---------YKDGKQQNLLQIAFLELDKY 178 Query: 179 QKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHT-EQADLFYGVLRDRETG 237 K RD + + + + + T E+ + +R +E Sbjct: 179 NKDKHIRDEGRQWLEFFGNLPFSKAPSRAVTHADSLLDSSSWTQEEKAMIDERIRIQENY 238 Query: 238 GESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKVI 297 +M T E+G+E+G+++GR E E +++L+KG+S E V+++ L L E+D ++ Sbjct: 239 DMTMETAIDEAREEGLEQGLKRGRYEGQLELIRKMLAKGLSLEVVSDVTGLSLEELDGLL 298 >UniRef50_B7GJZ4 Transposase n=10 Tax=Bacillaceae RepID=B7GJZ4_ANOFW Length = 286 Score = 97.2 bits (240), Expect = 6e-19, Method: Composition-based stats. Identities = 44/294 (14%), Positives = 91/294 (30%), Gaps = 17/294 (5%) Query: 8 PHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESL--KGHSTDVLY 65 HD +FK+ L + + E D L S + + + D+L Sbjct: 6 DHDRLFKELLTTFFEEF---ILLFFPHVHEHIDFRHLSFLSEELFTDVTAGEKYRVDLLI 62 Query: 66 SVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGE 125 +++G G + + +E+QS RM Y ++ ++PI + + Sbjct: 63 QTKLKGEAGIIIIHVENQSYMQSSFPERMFIYFSRLFEKYRTN-------ILPIAIFSYD 115 Query: 126 ATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQR 185 S + V F V++ ++ LL K Sbjct: 116 FIRDEPSSFTLQFPFL-----HVLQFQFLAVELRKQNWRHYIRSENPIATALLSKMGYNE 170 Query: 186 DLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGESMMTLA 245 + + L++ + + + + + G Sbjct: 171 NERVELKKQFFRMLIRQNIDEAKRRLLIGFFETYVKLTEQEEEQFQNEVKKMGGKEGEQV 230 Query: 246 QWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKVINL 299 +KG G +E +E Q+++ KGMS +A + + E+ KV+ + Sbjct: 231 MELIISYEQKGKIAGAKEKEREMIQKMVEKGMSITQIAHLLDRSEEEVRKVVEM 284 >UniRef50_C1I6Y7 Putative uncharacterized protein n=1 Tax=Clostridium sp. 7_2_43FAA RepID=C1I6Y7_9CLOT Length = 226 Score = 96.9 bits (239), Expect = 8e-19, Method: Composition-based stats. Identities = 39/223 (17%), Positives = 82/223 (36%), Gaps = 9/223 (4%) Query: 44 LHLESGSFIEESLKGHSTDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMH 103 + L + S+I + +D++Y GN + +V++E QS D +M R++ Y I Sbjct: 1 MILVNKSYILSDYEEQESDIVYKANFNGNDVFFYVLLEFQSSVDFRMPIRLLLYMIEIWR 60 Query: 104 --------RHLEADHDKLPLVVPILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPL 155 + + +LP +VPI+ Y G+ + S + N + Sbjct: 61 DILRNTELKEFKRKTFRLPSIVPIVLYNGKKKWTAAKELKHAISNSDVFGDNILNFKYEF 120 Query: 156 VDITITPDDEIMQHRRI-AILELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNY 214 +DI +E+ + I + + LL ++I + + L+ ++ + + Sbjct: 121 IDINSYEKEELYNKQNISSAIFLLDQNINRIEFYNRLKDIIIGFNNLSIEEKMHLKHWLV 180 Query: 215 MLQRGHTEQADLFYGVLRDRETGGESMMTLAQWFEEKGIEKGI 257 + D + + +M + EK E G Sbjct: 181 NINTEENNFKDNIEKIFNADKQEVLNMTSNISKGLEKLKEDGK 223 >UniRef50_A6MYW5 Chromosome segregation ATPase n=4 Tax=Rickettsia RepID=A6MYW5_9RICK Length = 296 Score = 96.5 bits (238), Expect = 9e-19, Method: Composition-based stats. Identities = 45/294 (15%), Positives = 102/294 (34%), Gaps = 11/294 (3%) Query: 10 DAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQM 69 D FK+ E +D L + + + + + L + + ++ + +L ++ Sbjct: 9 DLAFKKIFGVEEN-KDLLISLINSIVSKEDQIVDVTLLN-PYNPQNFRNDKLSIL-DIKA 65 Query: 70 QGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEATPY 129 G G IE Q + R + Y L+A D L I + T Sbjct: 66 LGESGK-RFNIEIQITDEADYDKRALYYWAKLYTEALQASQDYSSLNKAIGIHILNFTSI 124 Query: 130 PLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQRDLML 189 P + + ++F+ E + +++ ++ + + ++L+K D+ Sbjct: 125 PETNKYHNIFHITEKDSGLLYFK--DLELHTIELNKFSNNPNEELADILKKVGNSLDIWS 182 Query: 190 LLEQLVTLIDEGYTSGSQLVAMQNYMLQRGH-----TEQADLFYGVLRDRETGGESMMTL 244 L++ A L +E+ D + L+ ++ Sbjct: 183 AFLTRHDLLNSNNLPKKLDNASLKKALTVLDVMNFTSEERDAYEDHLKWLRIEANTLKKY 242 Query: 245 AQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKVIN 298 +G +GIQ G+ E A+ L G++ ++E L +I+++ + Sbjct: 243 EAQARVRGKVEGIQIGKTEEKIAIARNLKRSGVAITIISESTGLTKKQIEELDD 296 >UniRef50_C3R531 Putative uncharacterized protein n=6 Tax=Bacteroidales RepID=C3R531_9BACE Length = 325 Score = 96.5 bits (238), Expect = 9e-19, Method: Composition-based stats. Identities = 43/318 (13%), Positives = 88/318 (27%), Gaps = 31/318 (9%) Query: 6 TTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLY 65 D FK + ++ L L + + + + ++ Sbjct: 12 NPYTDFAFKLLFGT-DLNKEILIGFLNALFDGKQVIEDVTYLNTEHLGSKETDRRA--VF 68 Query: 66 SVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDK---LPLVVPILFY 122 V + G ++IE Q + R + Y+ + L V I Sbjct: 69 DVYCENEKGE-KILIEMQRGEQQFFKDRSIYYATYPIREQAIKGEIWDYELKAVYVIGIL 127 Query: 123 QGEATPYPLSMCWFDMFYSPELARRVYNSPFPLV--DITITPDDEIMQHRRIAILELLQK 180 S ++ V+ V ++ E + K Sbjct: 128 NFALDDVSSSSFRHEVKLMDTTTHEVFFDKLTFVYLEMPKFHKTEQELDTLFDKWMFVLK 187 Query: 181 HIR---------QRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVL 231 ++ Q + L + + + + + G+ Sbjct: 188 NLARLMERPTALQERVFNRLFEAAEIAQFSKENLYAYEESLKVYRDWNNVIDTAIQKGIA 247 Query: 232 RDRETG-------------GESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMS 278 R E G E ++ + KG+EKGI +G +Q A L + G+S Sbjct: 248 RGMEEGLVKGMEEGIAKGMEEGIVKGMEEGIAKGMEKGIAEGEWMKAQTIAGNLKNAGLS 307 Query: 279 REDVAEMANLPLAEIDKV 296 ++A++ L EI+ + Sbjct: 308 IAEIAKVTGLSEDEINSL 325 >UniRef50_C9LXX0 Putative uncharacterized protein n=6 Tax=Selenomonas sputigena ATCC 35185 RepID=C9LXX0_9FIRM Length = 301 Score = 96.5 bits (238), Expect = 1e-18, Method: Composition-based stats. Identities = 49/308 (15%), Positives = 102/308 (33%), Gaps = 35/308 (11%) Query: 1 MDAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHS 60 M T D++F+ +AE + E L D+ ++ F Sbjct: 1 MRNTKRTYKDSLFRDIFNNAERLPEIYEALL-DHKTTPDDITLATIDETLFTG-----VK 54 Query: 61 TDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDK------LP 114 D+ + V +++EHQS + M R++ Y + R+++ D LP Sbjct: 55 NDIGFIV-----GNQHVLLVEHQSTINANMPLRLLMYLVEIYRRYVDKDAIYKKELIPLP 109 Query: 115 LVVPILFYQGE-ATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIA 173 +FY G P ++ D F + +I P+ I++ Sbjct: 110 APKFYVFYNGLAEMPDIWALHLSDAF---GGHDSDLELEVKVFNINDKPNRPILEKCH-- 164 Query: 174 ILELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRD 233 L + + + + + ++ + Q +Y+ + +QA + +L Sbjct: 165 --ALKSYSVFVAKVRECI-KNGSSLEIAVGNAVQYCVAHDYLGEYFRQKQAKEVFDMLNF 221 Query: 234 RETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQR---------LLSKGMSREDVAE 284 ++ A+ EKG+ G Q+G + + + S E + Sbjct: 222 VWNQERALEVRAEEAMEKGLRLGRQEGLSQGLSQGVLETTTASIRNVMKSMDFPIEKAMD 281 Query: 285 MANLPLAE 292 + +P E Sbjct: 282 ILQIPEEE 289 >UniRef50_B0K813 Putative uncharacterized protein n=13 Tax=Thermoanaerobacterales RepID=B0K813_THEP3 Length = 267 Score = 96.1 bits (237), Expect = 1e-18, Method: Composition-based stats. Identities = 45/296 (15%), Positives = 115/296 (38%), Gaps = 33/296 (11%) Query: 6 TTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLY 65 + +D K + D + + D + + +D+++ Sbjct: 2 SQEYDITAKNIFSN---LADDIASYFLGLKFTKLDELNIEFTTIE-------SRESDMVF 51 Query: 66 SVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGE 125 + + + IE Q+ D KM +RM+RY+ M +H L ++ Y + Sbjct: 52 KCTTENRD--IALHIEFQTYNDSKMPYRMLRYATEIMEKHNL-------LPYQVVVYCSK 102 Query: 126 ATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELL---QKHI 182 + + + + N + ++D+ ++I++ + + L K Sbjct: 103 NELKMENNLNYHL-----GEENLLNFRYRIIDVGKIKFEDIVKTKYYDLYTFLPVADKDK 157 Query: 183 RQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGESMM 242 RQ++ L + +I + ++ ++Y++ ++ + ++ M Sbjct: 158 RQKEKEAYLRKCAEVIRDMPVDKAK----KSYIVTTAEILAGIIYDEEVIEKIFSEVIGM 213 Query: 243 TLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKVIN 298 ++ + + + +++G++E S E A+ LL +GM +A++ L + EI K++N Sbjct: 214 SILEESK--VYKNILEKGKKEKSIEIARELLKEGMDINKIAQITKLSVEEIKKLLN 267 >UniRef50_B4SC57 Putative uncharacterized protein n=14 Tax=Bacteria RepID=B4SC57_PELPB Length = 299 Score = 94.9 bits (234), Expect = 3e-18, Method: Composition-based stats. Identities = 44/299 (14%), Positives = 90/299 (30%), Gaps = 15/299 (5%) Query: 4 PSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDV 63 D FK+ E +D L + + E + + L++ + + G + + Sbjct: 3 KINPRVDFAFKKLFGSEEN-KDLLISLINAIVSEEDQVVEIELKNPYNLADYRAGKISIL 61 Query: 64 LYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQ 123 + + +E Q D R + Y + L L I Sbjct: 62 DIKAKAENGR---WFNVEMQISEDYNFDKRAIFYWAKLVTEQLSEGMMYKELKKTISINI 118 Query: 124 GEATPYPLSMCWFDMFYSPELARRVYN-----SPFPLVDITITPDDEIMQHRRIAILELL 178 + P + + A + +++ + Sbjct: 119 LDYNFVPDTTEVHSCYKIINTATGKDDRLHDVFELHYIELKKFNKLHHEISSTLDRWTTF 178 Query: 179 QKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETG- 237 Q D ++L + + +A + M + ++ L D E+ Sbjct: 179 LTTAHQLDREHTPKELA-----LDKNIVKAIAAIDRMFNEEERQVYEVRKQSLVDAESKI 233 Query: 238 GESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKV 296 ++ + E G+EKG +G S+ A LL KG++ +AE L + EI + Sbjct: 234 ASALEKGMEKGMEMGLEKGRDEGINAASKTIALNLLGKGIAIATIAEATGLSVLEITSL 292 >UniRef50_A6EAN2 Putative uncharacterized protein n=1 Tax=Pedobacter sp. BAL39 RepID=A6EAN2_9SPHI Length = 317 Score = 94.5 bits (233), Expect = 4e-18, Method: Composition-based stats. Identities = 40/309 (12%), Positives = 88/309 (28%), Gaps = 26/309 (8%) Query: 10 DAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQM 69 D FK+ + +D L L + + L +S + ++ + Sbjct: 13 DFAFKKIFG-GDPNKDLLIDLLNALFKGRKIIIDLTYNKNEHPGDSEHEGAA--VFDLLC 69 Query: 70 QGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHD-----KLPLVVPILFYQG 124 G G +IE Q + R + Y+ + + +L V I + Sbjct: 70 TGQNGE-QFIIEIQRAKQENFKERALFYTSRLISSQAPKGNRASWGYRLTEVYLIALMED 128 Query: 125 --EATPYPLSMCWFDMFYSPELARRVYN-SPFPLVDITITPDDEIMQHRRIAILELLQKH 181 + + Y + +++ + L K+ Sbjct: 129 TTLNDESEHEFLHDICLCKRDTGKVFYEKLGYLYIELRKFVKSSTELQTDLDRWLFLLKN 188 Query: 182 IRQRD---------LMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLR 232 + D + L + + + Y + + G+ + Sbjct: 189 LSSMDKIPVYLRKPIFEKLFSIAEYSNLSKEEKMSYDSRMKYKWDNENVREYARKEGLEK 248 Query: 233 DRETGGESMM-----TLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMAN 287 E G E + +G +G +GR+E + + A + S + + +A Sbjct: 249 GLEEGREKGRLEGKLEGKLEGKLEGKLEGKLEGRKEAAIKIAGEMKSANLPLDQIARFTK 308 Query: 288 LPLAEIDKV 296 L L EI+ + Sbjct: 309 LSLEEIEGI 317 >UniRef50_B8HL58 Putative uncharacterized protein n=2 Tax=Cyanothece RepID=B8HL58_CYAP4 Length = 334 Score = 94.2 bits (232), Expect = 5e-18, Method: Composition-based stats. Identities = 49/295 (16%), Positives = 98/295 (33%), Gaps = 24/295 (8%) Query: 3 APSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFI----EESLKG 58 + D+ +K+ L + ++ + P ++ F + Sbjct: 2 TQPRSDKDSAWKEIL--RQYFQEAIVFFFPQTAEQVDWTRPYEFLDKEFQQIAPDAETGK 59 Query: 59 HSTDVLYSVQMQGN-PGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVV 117 D L V ++ +L + +E Q+ + + A RM Y++ R + Sbjct: 60 RYADQLVKVWLKDGAELWLLIHVEVQAARESEFAQRMFTYNLRIFDRFNH-------PAI 112 Query: 118 PILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILEL 177 + E+ + FD F L+ R I+ ++ I ++ Sbjct: 113 SLAILCDESVRWRPESFSFD-FPDTSLSFRFGRVKLLDYRERISELEQSPNPFSIVVMAH 171 Query: 178 LQKHIRQRDLMLLLEQLVTLIDEGYT------SGSQLVAMQNYMLQRGHTEQADLFYGVL 231 L+ ++D +TLI Y L ++++ + + + + Sbjct: 172 LRAQATRKDDQQRKFWKLTLIRRLYEGGYGRQEVINLFRFIDWVMILPEGLKEEFWQEL- 230 Query: 232 RDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMA 286 + E M EE G E+G++QGRQE QE Q +G E A + Sbjct: 231 --KIYEEERRMPFITSVEEIGFERGLEQGRQEGRQEGRQEGRQEGRQEEARALIL 283 >UniRef50_C0CTJ7 Putative uncharacterized protein n=5 Tax=Clostridium RepID=C0CTJ7_9CLOT Length = 327 Score = 94.2 bits (232), Expect = 5e-18, Method: Composition-based stats. Identities = 46/319 (14%), Positives = 99/319 (31%), Gaps = 39/319 (12%) Query: 10 DAVFKQFLMHAETARDFLE--IHLPVELRELCDLNTLHLESGSFIEESLK----GHSTDV 63 D V ++ E D + ++ D+ L + D Sbjct: 5 DMVLNRYFEDGERYADLINGYAFNGDQVVRKEDVQELDPRETGVAGRLGRRPGVQKYRDS 64 Query: 64 LYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEAD-------------- 109 + V + ++ + +EHQ + M R M A R L Sbjct: 65 IRRVVL--GARFVLIGLEHQDQVHYAMPVRAMLQDAAEYDRQLRRIRRVNRRVGGLTGAE 122 Query: 110 -------HDKLPLVVPILFYQGEATPYPLSMCWFDMF---YSPELARRVYNSPFPLVDIT 159 D++ V+ ++ Y G+ M Y + R V N ++++ Sbjct: 123 FLGGFTRKDRVCPVITLVLYYGKKPWDGAMDLHGLMDCAGYPEPMLRLVNNYRLHVLEVR 182 Query: 160 ITPDDEIMQHRRIAILELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRG 219 + + + +Q+ + E+ + + + + Sbjct: 183 RFVNIRRFRTDLYQVFGFIQRSGDKEAERRFTEENRVYFEGM---DEEAFDVITAITGSR 239 Query: 220 HTEQADLFYGVLRDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQE----FAQRLLSK 275 E+ Y R E++ + + +G +G +G+ E + E A+ + + Sbjct: 240 ELERVKEQYREEGGRINMCEAIRGMIEDGRIEGRLEGKIEGKYEGALEKTRTVARNMYLR 299 Query: 276 GMSREDVAEMANLPLAEID 294 GMS ED A + + A+I+ Sbjct: 300 GMSAEDAAAICEMDTAQIE 318 >UniRef50_C0CSV6 Putative uncharacterized protein n=1 Tax=Clostridium asparagiforme DSM 15981 RepID=C0CSV6_9CLOT Length = 317 Score = 93.4 bits (230), Expect = 9e-18, Method: Composition-based stats. Identities = 50/311 (16%), Positives = 101/311 (32%), Gaps = 36/311 (11%) Query: 1 MDAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHS 60 M + D +F+ D + L L+ ++ Sbjct: 1 MTKVNKKYKDRLFRLVFGDRRRLLDLYNALNGSHYEDPDALEITTLDDAVYLSM-----K 55 Query: 61 TDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADH--------DK 112 D+ + V + + EHQS + M R Y +++ K Sbjct: 56 NDLSFLV------NGVLNLYEHQSTYNPNMPVRGFFYLADVYRKYVVEHKLNLYGSRLAK 109 Query: 113 LPLVVPILFYQG-EATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRR 171 LP ++FY G + P + D F A +++I + + +M+ R Sbjct: 110 LPSPKYLVFYNGRKEEPDRKILRLSDAFQGGRNAEPCLELCAVMLNINLGRNQVLMERCR 169 Query: 172 IAILELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVL 231 + E Q R R ++ L + +D + ++N++ +A++ +L Sbjct: 170 -TLKEYAQFVDRVRRMIAETGALESAVDCAVEDCIRDGILENFLSSH----RAEVLDVIL 224 Query: 232 RDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSR--EDVAEMANLP 289 D M E+ ++GR E E LS+G+S E + ++ Sbjct: 225 TDYNEQEYIAME---------REEAWEEGRAEGLTEGLSEGLSEGLSVSREAILDLLGEF 275 Query: 290 LAEIDKVINLI 300 +++ I Sbjct: 276 GEVPEELRARI 286 >UniRef50_Q5GSR2 Uncharacterized conserved protein n=15 Tax=Wolbachia RepID=Q5GSR2_WOLTR Length = 317 Score = 92.6 bits (228), Expect = 1e-17, Method: Composition-based stats. Identities = 47/304 (15%), Positives = 106/304 (34%), Gaps = 21/304 (6%) Query: 10 DAVFKQFLM---HAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYS 66 D +FK+ + + FL L + I+ + + ++ Sbjct: 12 DLIFKKIFGTEKNKKIIICFLNNILGFAEINAIQEVE---FLSAIIDPEIASNKQSIIVD 68 Query: 67 VQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDK---LPLVVPILFYQ 123 V + G V IE Q +K R+ Y++ A R L+ + L V I Sbjct: 69 VFCKDATGTRRV-IEVQLAINKGFEKRVQPYAVKAYSRQLDKSGNYIVDLKKVFFIAISN 127 Query: 124 GEATPYPLSMCWFDMFYSPEL-ARRVYNSPFPLVDITITPDDEIMQHRRIA----ILELL 178 + + + + + F +++ ++ Q I Sbjct: 128 CNLLSEKVDYISTHNIHDTKTNGHYLKDFQFIFIELPKFSKSKVEQLINIVEHWCFFFKN 187 Query: 179 QKHIRQRDLMLLLEQLVTL------IDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLR 232 + + DL + ++++ + +DE + + ++A + ++ + + L Sbjct: 188 AEDTTETDLKRVAKKVLIIKLAYDGLDEFHWNEEDIIAYEERVMNLQKEKAILEYRLDLA 247 Query: 233 DRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAE 292 + E + + + G EKG ++G ++ A+ L GMS +AE+ L + + Sbjct: 248 TEKGREEGVKISKERGIKVGAEKGREEGVKKAKIAVAKNSLKAGMSIGAIAEIIGLSVGK 307 Query: 293 IDKV 296 I K+ Sbjct: 308 IKKL 311 >UniRef50_C6XV94 Putative uncharacterized protein n=7 Tax=Pedobacter heparinus DSM 2366 RepID=C6XV94_PEDHD Length = 283 Score = 92.6 bits (228), Expect = 1e-17, Method: Composition-based stats. Identities = 58/281 (20%), Positives = 119/281 (42%), Gaps = 18/281 (6%) Query: 21 ETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQMQGNPGYLHVVI 80 + R+ + LP ++ + LN +E + + K TD+L V+ Y+ + + Sbjct: 16 KIFRENMHNTLPGIIKHVLHLNVNTVEELADDVQFTKERKTDLLKKVRDNKGNRYV-LHV 74 Query: 81 EHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEATPYPLSMCWFDMFY 140 E+Q+ +MAFRM YSI +H KLP+ +++ S+ D + Sbjct: 75 EYQTDNYPEMAFRMAEYSIMLQRKH------KLPVKQFVIYIGPAKANMATSITTKDFRF 128 Query: 141 SPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQRDLMLLLEQLVTL--- 197 L V+ + ++++ + +AIL L + L +++++ T Sbjct: 129 RYNLT------ELSAVNYKLFLKSDLVEEKMLAILSNLASESTESVLAQVVQEIETHTST 182 Query: 198 IDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRET--GGESMMTLAQWFEEKGIEK 255 +++G + +Q L + + L + ++ + + E KG K Sbjct: 183 LEQGRYFRQLRILLQLRNLNKKAIKDMALVGKIFKEEKDILYRRGEIKGEIKGEIKGEIK 242 Query: 256 GIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKV 296 GI++GR E + E A L +G++ E +A++ L + EI + Sbjct: 243 GIEKGRYEEAMEIALELKKEGLATEFIAKITKLSIEEIQAL 283 >UniRef50_UPI0001C34E7F hypothetical protein ClM62_15401 n=1 Tax=Clostridium sp. M62/1 RepID=UPI0001C34E7F Length = 324 Score = 92.6 bits (228), Expect = 1e-17, Method: Composition-based stats. Identities = 48/290 (16%), Positives = 90/290 (31%), Gaps = 22/290 (7%) Query: 3 APSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTD 62 P DA+F+ E + L + LE+ ++ D Sbjct: 21 PPRRDYKDALFRMIFNDKEALLSLYNAVGNTSYTDASQLQIVTLENAVYM-----NIKND 75 Query: 63 VLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHL------EADHDKLPLV 116 + + + M+ + EHQS + M R + Y L + KLP Sbjct: 76 LAFLLNME------LNLYEHQSTWNPNMPLRDLFYVSREYEMLLANQSIYSSSLLKLPAP 129 Query: 117 VPILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRI--AI 174 ++F+ G + Y ++ +++I +DE+M R+ Sbjct: 130 RFVVFFNGSYDMGEQCVLKLSDAYEKKVEDPDLELKVTVLNINAGWNDELMNTCRLLKEY 189 Query: 175 LELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNY---MLQRGHTEQADLFYGVL 231 + + M L E + +DE G + Y + E + L Sbjct: 190 SLYVARVRAYAKEMELAEAVSRAVDECIKEGILRDFLMKYRAEAISVSIFEYDEEREKEL 249 Query: 232 RDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSRED 281 + E+G+ +GI++G + + G SRED Sbjct: 250 LRKTEYEFGRQEGLSQGREEGLSQGIKEGMAQGVSAMIRHCRKAGASRED 299 >UniRef50_B9E303 Putative uncharacterized protein n=2 Tax=Clostridium kluyveri RepID=B9E303_CLOK1 Length = 304 Score = 92.2 bits (227), Expect = 2e-17, Method: Composition-based stats. Identities = 41/241 (17%), Positives = 94/241 (39%), Gaps = 20/241 (8%) Query: 78 VVIEHQSKPDKKMAFRMMRYSIAAMHRHLE--------ADHDKLPLVVPILFYQGEATPY 129 +E QS+ D +M R++ Y + L+ KLP ++P++ Y G+ T Sbjct: 28 CFLEFQSRVDYRMPMRLLFYMVEIWREILKNTSKNDRSKKDFKLPSIIPMVLYNGKNTWT 87 Query: 130 PLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAI-LELLQKHIRQRDLM 188 + S V + + L DI ++++ + + LL K I + DL+ Sbjct: 88 ACKNFKDVLSGSKLFGENVIDFRYMLFDIYRYNEEQLEDMANMVSTVFLLDKEISKEDLV 147 Query: 189 LLLEQLVTLIDEGYTSGSQLV--AMQNYMLQRGHTEQADLFYGVLRDRETGG-------- 238 L ++ + ++ +++ + R +E +L G Sbjct: 148 KRLRLTAYVLKKITPEQFDILKAWLKSIIKPRLDSESKIKIEEILEKSSQGEVDSMVSNL 207 Query: 239 -ESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKVI 297 +++ + + E G+E+G ++GR+E +E + +G + + + + K+ Sbjct: 208 GKTIDNIIREGRETGLEEGRREGRKEGRKEGRKEGRKEGRKEGKSELITKMLVKKFTKLP 267 Query: 298 N 298 + Sbjct: 268 D 268 >UniRef50_A8GY36 Putative uncharacterized protein n=15 Tax=Rickettsia RepID=A8GY36_RICB8 Length = 279 Score = 92.2 bits (227), Expect = 2e-17, Method: Composition-based stats. Identities = 43/302 (14%), Positives = 89/302 (29%), Gaps = 33/302 (10%) Query: 1 MDAPSTTPHDAVFKQFLMHAETARDFLEIHLP-VELRELCDLNTLHLESGSFIEESLKGH 59 M +D FK+ FL + E + DL + E + ++ + Sbjct: 1 MQRYLDPTNDVAFKKLFTDKARLISFLNNIMRLPEELRIIDLKYISNEQVPDLGQNKRS- 59 Query: 60 STDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDK--LPLVV 117 + V++ N G +++ +E Q+ R+ Y A L+ + L VV Sbjct: 60 ----IVDVKVTDNSGNIYI-VEMQNGYADAFLARVQFYGCVAFSSQLKRGKEYADLAPVV 114 Query: 118 PILFYQGEA--TPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAIL 175 ++ G + + ++ + V++ + Sbjct: 115 MVIITSGFQALPEEKECISYHQTINVGNGKNQLKCLSYVFVELDKFTKEANELETIEDDW 174 Query: 176 ELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRE 235 + + Q ++ Y + Q + E Sbjct: 175 LYMMAKFDKAKEPPKHTQ-DEVVLSAYKTIEQFNW---------------------SEAE 212 Query: 236 TGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDK 295 LA EE + ++G+ E S E A+ +L E + + L EI+K Sbjct: 213 YDNYIKAMLAAQTEELNQKSKFKEGKAERSIEMAKEMLQDNEPIEKIIKYTKLSKEEIEK 272 Query: 296 VI 297 + Sbjct: 273 LK 274 >UniRef50_C6XV81 Putative uncharacterized protein n=4 Tax=Pedobacter heparinus DSM 2366 RepID=C6XV81_PEDHD Length = 318 Score = 91.5 bits (225), Expect = 4e-17, Method: Composition-based stats. Identities = 42/301 (13%), Positives = 82/301 (27%), Gaps = 26/301 (8%) Query: 10 DAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQM 69 D FK+ E ++ L L + + + F E + ++ V Sbjct: 28 DFSFKRLFATEE-SKPILIGLLNHLFKGRKYITEIEYGKNEFPGEIAQEGGA--VFDVYC 84 Query: 70 QGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDK-----LPLVVPILFYQG 124 G +IE Q + R + Y A+ K L V + F + Sbjct: 85 TDVNGS-KFIIEVQRGNQEYFKERALFYVSRAISEQAPKGDRKGWAYKLTEVYLLAFLED 143 Query: 125 EATPYPLSMCWFDMFYSPELARRVYNSP---FPLVDITITPDDEIMQHRRIAILELLQKH 181 P + + F +++ + + KH Sbjct: 144 FNLPDSPKSEYVQDICLANRHTGIIFYDKVGFIFIEMLNFVKGSDELYTELDKWLYALKH 203 Query: 182 IRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQ------ADLFYGVLRDRE 235 + + QL + NY + Sbjct: 204 LTEFKQRPEYLSGPEF--------DQLFTLANYASLTPEERDMYNSSLKRKWDNKNVLDY 255 Query: 236 TGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDK 295 +S+ + E+G E+G +QG + + E A +L E++ ++ L EI Sbjct: 256 AVKKSLEQGLEQGLEQGREQGREQGIHKKAIEIALEMLVNKYPIEEIIKLTKLSKEEIQS 315 Query: 296 V 296 + Sbjct: 316 L 316 >UniRef50_B8FTH9 Putative uncharacterized protein n=3 Tax=Desulfitobacterium hafniense RepID=B8FTH9_DESHD Length = 325 Score = 91.1 bits (224), Expect = 4e-17, Method: Composition-based stats. Identities = 41/330 (12%), Positives = 94/330 (28%), Gaps = 37/330 (11%) Query: 1 MDAPSTTPHDAVFKQFLM---HAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLK 57 M + D FK + FL L + + + + +E + Sbjct: 1 MKEFISLKIDYAFKLIFGKEGNEAILIAFLNAALKLPQERRI--EEITIINPELNKEYPE 58 Query: 58 GHSTDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVV 117 + + + + IE Q M R + Y R + L Sbjct: 59 DKKSILDVRAITSQG---MQINIEIQLSNQYDMEKRSLYYWAQMYSRQIREGMAYKELTK 115 Query: 118 PILFYQGEATPYPLSMCWFDMFYSPELARRV---YNSPFPLVDITITPDDEI-----MQH 169 + + + + ++F+ E + +++ + Sbjct: 116 TVSINIVDFNYLKQTSSYHNVFHLYEDEEKFQLTDVLEIHFMELPKLLAKWRKREISLWE 175 Query: 170 RRIAILELLQKHIRQRDLMLLLEQLV-------------------TLIDEGYTSGSQLVA 210 + LL + ++++ +LE++ I E Y + + Sbjct: 176 NELVRWLLLLEGADNQEILQILEEIAMKDPVLYQAMNAWEETSEDPRIREAYFDRRKAIL 235 Query: 211 MQNYMLQRGHTEQADLFYGVLRDR--ETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEF 268 + ++ + + E + + + +G +G +GR E E Sbjct: 236 DEKAAIREAELRLQEALEEGMAKGIAEGRAKGIAEGKAEGKAEGRAEGRAEGRAEGRAEV 295 Query: 269 AQRLLSKGMSREDVAEMANLPLAEIDKVIN 298 A++LL G +AE L EI + + Sbjct: 296 AKKLLVLGFEITKIAEATGLSEEEISGLKD 325 >UniRef50_B0KCX4 Putative uncharacterized protein n=12 Tax=Thermoanaerobacterales RepID=B0KCX4_THEP3 Length = 267 Score = 91.1 bits (224), Expect = 4e-17, Method: Composition-based stats. Identities = 45/293 (15%), Positives = 103/293 (35%), Gaps = 27/293 (9%) Query: 6 TTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLY 65 + +D K + D + + D + +D++ Sbjct: 2 SQKYDITIKDIFSN---MADDITAYFLGLTYTKTDELNIEFTKVE-------KRQSDIVL 51 Query: 66 SVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGE 125 + + V +E QS D KM +RM+RYS+ M ++ ++ Y G+ Sbjct: 52 KCTTEKGD--IAVHLEFQSDNDDKMPYRMLRYSLEIMEKYNL-------TPYQLVIYMGK 102 Query: 126 ATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQR 185 + +++ + + + ++D+ +I + + LL R+R Sbjct: 103 NDLRMENKLDYNL-----GEENILDYRYKIIDVGTIKFLDITKTDYYDLYALLPIMDRER 157 Query: 186 DLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGESMMTLA 245 + L ++ + ++ + V+ T M+ + Sbjct: 158 RKTEGEKYLKECVEAIKNIPIDINKKKDITFKAEILSGLVYSREVIERVFTEVMEMLRIE 217 Query: 246 QWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKVIN 298 E + + +++G +E S A+ LL +GM +A++ L + EI K++N Sbjct: 218 ---ESEAYKMILEKGAKEKSLRIAKELLKEGMDINKIAKITELSIEEIKKLMN 267 >UniRef50_B8FP58 Putative uncharacterized protein n=1 Tax=Desulfitobacterium hafniense DCB-2 RepID=B8FP58_DESHD Length = 167 Score = 90.3 bits (222), Expect = 7e-17, Method: Composition-based stats. Identities = 39/136 (28%), Positives = 68/136 (50%), Gaps = 6/136 (4%) Query: 5 STTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVL 64 PHD FK+ TAR FLE +LP E+R L DL T+ + S+I++ L+ +D+L Sbjct: 4 IHNPHDKFFKETFGDVGTARSFLENYLPQEVRALVDLKTVLPQKDSYIDQELQESFSDLL 63 Query: 65 YSVQMQGNPGYLHVVIEHQSKPDKK----MAFRMMRYSIAAMHR--HLEADHDKLPLVVP 118 + V+++ N GY + + EH+ +P M+ R+ S+ + R +++ + P + Sbjct: 64 FQVKIRENEGYFYFLFEHKVRPYADRRKKMSTRLADDSVLSKQREMFMQSVNHGKPPYIS 123 Query: 119 ILFYQGEATPYPLSMC 134 +G T C Sbjct: 124 RFIRKGNRTGSAACRC 139 >UniRef50_C8PT67 Putative uncharacterized protein n=1 Tax=Treponema vincentii ATCC 35580 RepID=C8PT67_9SPIO Length = 285 Score = 89.5 bits (220), Expect = 1e-16, Method: Composition-based stats. Identities = 43/289 (14%), Positives = 92/289 (31%), Gaps = 19/289 (6%) Query: 10 DAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQM 69 D +F + + E + L L ++ +E I+++ + V + V Sbjct: 13 DYMFYRVMEDPEICKMLLNRVLQGKVD-----TITEIELQKTIDDAGRAKG--VRFDVWA 65 Query: 70 QGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEATPY 129 + G ++ IE Q+ K +A R+ Y A L L + + Sbjct: 66 KDCNGRIY-DIEMQAIDKKDLAKRIRYYQAAIDVSILGKSKPYESLPDTFILFFCTFDYL 124 Query: 130 PLSMCWFDMFYS-PELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQRDLM 188 ++ + E +R + I + + LE + + + Sbjct: 125 EKTLPVYTFKTMCSEDSRIELGDGVTKIIINSKAAEHEKNEKLKVFLEYMNGKVSNDEF- 183 Query: 189 LLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGESMMTLAQWF 248 +++L I E + G+ + G Sbjct: 184 --IQRLEQRIKEVKANEELRREYMLVNTIERDARNDGWKAGIAQGIAQG-------IAQG 234 Query: 249 EEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKVI 297 + G+ +G +G + E A+ L S G+S E +A+ L + E++ + Sbjct: 235 KSLGLAEGEARGSHHKALETARNLRSMGLSIEKIAQATGLTVQEVETIA 283 >UniRef50_C2G1H3 Hypothetical cytosolic protein n=1 Tax=Sphingobacterium spiritivorum ATCC 33300 RepID=C2G1H3_9SPHI Length = 294 Score = 89.2 bits (219), Expect = 2e-16, Method: Composition-based stats. Identities = 47/306 (15%), Positives = 95/306 (31%), Gaps = 36/306 (11%) Query: 10 DAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFI------EESLKGHSTDV 63 D ++K L DFL P + D Sbjct: 6 DYLWKGVLED--VFDDFLRFLYPDADSVFDLSRGITFLDKELEQLFPPEGNEFAPKVVDK 63 Query: 64 LYSVQMQGN-PGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFY 122 L V ++ + +E Q K A RM Y + ++ + + Sbjct: 64 LAQVYTHDGMEEWVLIHVEVQGTCRKDFASRMFTYYYRILDKYHKR--------ITAFAI 115 Query: 123 QGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELL---- 178 EA+ P + + F + F I D ++ L +L Sbjct: 116 LTEASKKPRPNVYEEEFMGTSI-----QYRFNTYKIAEQDTDRLLASDNPFALVVLTAKA 170 Query: 179 --------QKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGV 230 K + L+ QL + E S ++ + N++ + +++ Sbjct: 171 AFVGKNLNDKDESDKALLQTKIQLARELLERNMSKEKIRGLMNFLRYYVRFDNSEVNTIF 230 Query: 231 LRDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPL 290 ++ E E T+ EE + + ++G++E A+ + G+ E + + L + Sbjct: 231 EQEVEKLTERSHTM--GIEELLLNRAKKEGKRESLISVAREMKKDGIPVEQIVKFTKLSI 288 Query: 291 AEIDKV 296 EI+K+ Sbjct: 289 KEIEKL 294 >UniRef50_D1P8S5 Putative uncharacterized protein n=1 Tax=Prevotella copri DSM 18205 RepID=D1P8S5_9BACT Length = 303 Score = 89.2 bits (219), Expect = 2e-16, Method: Composition-based stats. Identities = 44/298 (14%), Positives = 90/298 (30%), Gaps = 17/298 (5%) Query: 10 DAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQM 69 D FK+ +D L L + + + + + ++ V Sbjct: 16 DFGFKRIFGT-AMNKDLLICFLNSLFNGRQVVKDVSYLNPEHVGDVYTDRRA--IFDVYC 72 Query: 70 QGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDK------LPLVVPILFYQ 123 +G G ++E Q+ R + YS + ++ + V + F Sbjct: 73 EGENGE-KFIVEMQNAYQTYFKDRALFYSTFPIREQAPKGNEWDFKLNNIYTVALLNFNM 131 Query: 124 GEATPYPLSMCWFDMFYSPELARRVYN-SPFPLVDITITPDDEIMQHRRIAILELLQKHI 182 E + + Y+ + V+I+ K+ Sbjct: 132 NEDAFDKEKIRHHVQLCDTATHKVFYDKLEYIYVEISKFNKTLEELDTLYEKWLYALKN- 190 Query: 183 RQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGESMM 242 L L ++ L D+ + + + + Q + + + + Sbjct: 191 ----LYKLTQRPKELCDKVFDRLFEEAEIAKFTPQEMREYETSKM-AYRDIKNSVDTAKR 245 Query: 243 TLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKVINLI 300 E G+EKG +G S E A+++L+KGM + +M L EI + I Sbjct: 246 EGIAEGIEIGMEKGRAEGMNLRSLEIARKMLAKGMDEASIMDMTGLTSEEIKLLKAEI 303 >UniRef50_C0R0H3 Putative uncharacterized protein n=8 Tax=Brachyspira RepID=C0R0H3_BRAHW Length = 292 Score = 88.8 bits (218), Expect = 2e-16, Method: Composition-based stats. Identities = 42/301 (13%), Positives = 98/301 (32%), Gaps = 20/301 (6%) Query: 2 DAPSTTPHDAVFKQFLMHA---ETARDFLEIH-LPVELRELCDLNTLHLESGSFIEESLK 57 + +D + DF+ L ++ + L + E K Sbjct: 6 NNNFNVLNDYFVRYLFSDKGSEAILLDFINSIMLDSGMKTFRSVEILTPFNYKENYED-K 64 Query: 58 GHSTDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDK--LPL 115 TDV Q V+IE Q + + + R++ Y + + L+ L Sbjct: 65 ETITDV--KCITQNGTV---VIIEIQLQGNSRFPERILYYWASNYSKLLKQGEKYDALTP 119 Query: 116 VVPILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAIL 175 V+ I + ++ + + ++++ + + + Sbjct: 120 VISINLLNFNLDDNDSIHSCYMIYDTNNKRLLTDHLQIHIIELKKFKYNSLEYDLNCWLK 179 Query: 176 ELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRE 235 K ++++ + L+ E N++ R + D L + Sbjct: 180 FFTMKDKDNKEVI-----MSELVKEKPIMEEVQRRYNNFIKDRLMMNEYDKRQAYLYGNQ 234 Query: 236 TGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDK 295 E L +G E+GI++G ++ A+ + +K M ++E+ L + +I+K Sbjct: 235 IMLEEERRL---GRVEGKEEGIKEGIEQEKYSLARNMKNKNMDLNLISELTGLSIEKIEK 291 Query: 296 V 296 + Sbjct: 292 L 292 >UniRef50_B0K519 Putative uncharacterized protein n=14 Tax=Thermoanaerobacteraceae RepID=B0K519_THEPX Length = 288 Score = 88.4 bits (217), Expect = 3e-16, Method: Composition-based stats. Identities = 36/247 (14%), Positives = 99/247 (40%), Gaps = 14/247 (5%) Query: 63 VLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHR--------HLEADHDKLP 114 ++Y V+++ + ++++E QSK D +M +R++ Y I + KLP Sbjct: 1 MVYQVKLKDKEVFFYILLELQSKVDFQMPYRLLLYIIEVWREILKDTSLNQQKRKDYKLP 60 Query: 115 LVVPILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIA- 173 ++PI+ Y G + + + + L+D+ ++E++Q + Sbjct: 61 AIIPIVLYNGVNRWTASLSFKETIDSYQLFGENIIDFKYILIDVNRYNEEELLQLSNLIS 120 Query: 174 ILELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQL--VAMQNYMLQRGHTEQADLFYGVL 231 + LL + I + +L +L ++ + + + + + + ++ +L Sbjct: 121 SIFLLDRKIDKEELTEKWGKLADVLKDISEEEFIILRNWLFSVVSRFLPEDKEKEIKEIL 180 Query: 232 RDRETGGESMMTLAQWFEEKGIE---KGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANL 288 E E + L + E+ + +G+++G ++ E + +G + + + Sbjct: 181 VQSEGVEEMISNLERSLREEFRKTRREGLKEGLKKGKLEGLKIGKMEGRMEGKIEGIRMV 240 Query: 289 PLAEIDK 295 ++ + Sbjct: 241 VFEQLKE 247 >UniRef50_B1WSK8 CHP1784-containing protein n=11 Tax=Cyanobacteria RepID=B1WSK8_CYAA5 Length = 260 Score = 88.4 bits (217), Expect = 3e-16, Method: Composition-based stats. Identities = 46/276 (16%), Positives = 95/276 (34%), Gaps = 28/276 (10%) Query: 21 ETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQMQGNPGYLHVVI 80 + FL + + L E SL D L +Q + + I Sbjct: 5 DNVCKFLAERFSRDFANWLLNEPIELTELKPTELSLNPIRADSLIFLQSDD----IVLHI 60 Query: 81 EHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEATPYPLSMCWFDMFY 140 E Q+ PD+ + FRM Y + R+ + + ++ Y P + + + F Sbjct: 61 EFQTSPDEDIPFRMTDYRLRVYRRYPNKE------MYQVVIY---LKPSNSELVYQNTFE 111 Query: 141 SPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQRDLMLLLEQLVTLIDE 200 L F ++ + D + + + +L R+ L Q+ +ID Sbjct: 112 LTNL-----RHQFNVIRLWEENTDSFLNNSGLLPFAVLTCTDNPRE---TLTQIAAIIDS 163 Query: 201 GYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGESMMTLAQWFEEKGIEKGIQQG 260 Q + + G D +LR + ++ +G +G Sbjct: 164 MPNQQRQSDISASTAILSGLKLDQDSIKRILRSDIMKESVI-------YQEIFHEGEVKG 216 Query: 261 RQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKV 296 +++ + A +L M+ E ++++ L L EI+++ Sbjct: 217 QKQAIKNIALNMLRNHMNLEVISQLTGLNLQEIEQL 252 >UniRef50_C4ZLA7 Conserved hypothetical cytosolic protein n=2 Tax=Proteobacteria RepID=C4ZLA7_THASP Length = 339 Score = 88.0 bits (216), Expect = 4e-16, Method: Composition-based stats. Identities = 47/305 (15%), Positives = 102/305 (33%), Gaps = 30/305 (9%) Query: 8 PHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFI----EESLKGHSTDV 63 +D+ +K+ + H +F++ + P R++ + +L D Sbjct: 9 DYDSPWKEAVEH--AFPEFIDFYFPDAGRQIDWARGHRFLDKELQQIVRDAALGRRHVDK 66 Query: 64 LYSVQM-QGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFY 122 L SV G +L V IE Q D A RM Y+ + V + Sbjct: 67 LASVTTHAGEEDWLCVHIEVQGSMDPDFARRMFVYNYRIYDSYDR-------PVASLAVL 119 Query: 123 QGEATPYPLSMCWFDMFYSPELARRVYNSPF-PLVDITITPDDEIMQHRRIAILELLQKH 181 + + ++ R P LVD + A++ + Sbjct: 120 ADDDPAWRPDRFGYERL----GCRHNLQFPVAKLVDHAADEAALLCNPNPFALVTAAHLY 175 Query: 182 IRQR--------DLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRD 233 R+ D L +L+ D ++ ++M++ + L+ + Sbjct: 176 TRRTRRSPIARFDAKRRLVRLLYERDWTRQRILDFFSVLDWMMRLPREFEQRLWQDIENI 235 Query: 234 RETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEI 293 GE + E IE+G+Q+G ++ + ++ + +G+ + A + + Sbjct: 236 ---EGERKVKYVTSVERLAIERGLQKGMEQGLEIGIEKGIEQGIEKGIEKGRAQGSASVL 292 Query: 294 DKVIN 298 +++N Sbjct: 293 LRLLN 297 >UniRef50_C0F0J0 Putative uncharacterized protein n=1 Tax=Eubacterium hallii DSM 3353 RepID=C0F0J0_9FIRM Length = 316 Score = 87.6 bits (215), Expect = 4e-16, Method: Composition-based stats. Identities = 48/325 (14%), Positives = 91/325 (28%), Gaps = 56/325 (17%) Query: 10 DAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLK-------GHSTD 62 DA+ K++L + E D + + + N + ++ K D Sbjct: 5 DALTKEYLSNNEIFADVFNYLIYDGQQRILPENLIERDTSEITLPLGKRGELATIQKFRD 64 Query: 63 VLYSVQMQGNPGYLHVV--IEHQSKPDKKMAFRMMRYSIAAMHRHLEAD----------- 109 +L + L+V+ +E+QS M R M Y + Sbjct: 65 ILKGCIAKEYKNTLYVLFGVENQSHIHYAMPVRNMLYDAINYSAQVNEKTKKYRKIRKQN 124 Query: 110 ----------------HDKLPLVVPILFYQGEATPYPL-SMCWFDMFYSPELARRVYNSP 152 D+L V+ + Y G S+ L + + Sbjct: 125 PNFKETTEEFLSGWHPDDRLVPVITVTIYFGNDGWDAAKSLQEMFSETDESLKEFLPDYK 184 Query: 153 FPLVDITITPDDEIMQHRRIAILELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQ 212 L+ + H L + K I + M +L T+ + Sbjct: 185 LHLISCNNISNFTKF-HTEFGRLMHILKVISDEEQMDILLSDPGYSALSVTAAQIINTFT 243 Query: 213 NYMLQRGHTEQADLFYGVLRDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRL 272 E E G ++G E + + QR+ Sbjct: 244 GLHFSIPEKEDTINMRN------------------AWTDHKESGRREGFNEATTSYTQRM 285 Query: 273 LSKGMSREDVAEMANLPLAEIDKVI 297 G+ E +AE+ P+ E++K++ Sbjct: 286 YKAGIPLEVIAEVIEKPVTEVEKIL 310 >UniRef50_A6M1J9 Putative uncharacterized protein n=1 Tax=Clostridium beijerinckii NCIMB 8052 RepID=A6M1J9_CLOB8 Length = 278 Score = 87.6 bits (215), Expect = 4e-16, Method: Composition-based stats. Identities = 43/299 (14%), Positives = 93/299 (31%), Gaps = 33/299 (11%) Query: 5 STTPHDAVFKQFLMHAETARDFLEIHLPVELRELCD-LNTLHLESGSFIEESLKGHSTDV 63 + +D VFK + +D + L L+ D L + L + + E + + Sbjct: 3 ISPKNDFVFKLLFGDEKN-KDLIIELLNSILKMPHDELEDIELINTELLREFAEDRKGIL 61 Query: 64 LYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADH-----DKLPLVVP 118 + + H+ IE Q MA R + Y + +++ + K + Sbjct: 62 DVRAKTKSGE---HIDIEIQVLYTYYMAERTLFYWSKMYNGQIKSGYTYDKLKKCITINI 118 Query: 119 ILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELL 178 + F E S + + +L + L + + + + L Sbjct: 119 VDFNCIEINKLHTSFHITEDETNKKLTDVLEIHYLELPKLFDNNIPKDESEPLVQWMMFL 178 Query: 179 QKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGG 238 Q R ++ +L + I + Y + N Sbjct: 179 Q--SRNKEAFEMLAEKNEKIKKAYNILEVISKDDN---------------------ARAA 215 Query: 239 ESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKVI 297 + ++ ++G +E + + A+ L G+ E VA+ L + E+ K+ Sbjct: 216 YEAREAELHDQMTRLKSAREEGIKEATIKNAKNFLVMGLDVEMVAKGTGLSVDEVLKIK 274 >UniRef50_B4VKU9 Putative uncharacterized protein n=1 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VKU9_9CYAN Length = 323 Score = 87.6 bits (215), Expect = 5e-16, Method: Composition-based stats. Identities = 45/312 (14%), Positives = 91/312 (29%), Gaps = 23/312 (7%) Query: 2 DAPSTTPHDAVFKQFLMH---AETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKG 58 + D FK+ + FL + + + T+ E+LK Sbjct: 4 KKFISPKIDYAFKKIFGSDQSEDILISFLNAIV-YNGKSVISSLTIVNPYNPGQVETLKD 62 Query: 59 HSTDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKL--PLV 116 D+ + V+IE Q R+ A L + L V Sbjct: 63 SYLDI--RAVLNSGE---IVLIEMQVARIAAFYKRVTYNLCKAYANQLTSGDYYLEITPV 117 Query: 117 VPILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILE 176 + + +F E + L+ + + + Sbjct: 118 IAVTITDFILFKENPKCIHHFVFKDKESSSEYPEHELQLI----FVELPRFVKKLPELQT 173 Query: 177 LLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQA-----DLFYGVL 231 L +K I LE++ + E L L E+ L + Sbjct: 174 LAEKWIYFMTQAQDLEEIPESLAEVTAIEKALTIANQANLTPAEAEEVSRRAMQLRDEIG 233 Query: 232 RDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVA---EMANL 288 R + E+ + ++G ++G Q+GR ++ RLL+K + + L Sbjct: 234 RIKYATEEASKEAREEGRQEGRQEGRQEGRITEARALVLRLLNKRFPDQTAELNSLVEGL 293 Query: 289 PLAEIDKVINLI 300 L+ ++ + + + Sbjct: 294 SLSALEGLSDAM 305 >UniRef50_C6XVT6 Putative uncharacterized protein n=1 Tax=Pedobacter heparinus DSM 2366 RepID=C6XVT6_PEDHD Length = 317 Score = 87.2 bits (214), Expect = 6e-16, Method: Composition-based stats. Identities = 50/315 (15%), Positives = 109/315 (34%), Gaps = 35/315 (11%) Query: 2 DAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFI------EES 55 + D K DFL L + + E Sbjct: 18 EERPRRKDDEFLKGAFED--NFPDFLRFVFSDADEILDFNREIEFLNNELFTIIPDRERK 75 Query: 56 LKGHSTDVLYSVQMQGN-PGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLP 114 G D+L + ++ ++ + +E + D+K R+ Y+ ++ + Sbjct: 76 GGGRRADLLAKLYLKDGTEKWVLLNVEIEGGNDRKFGQRVFEYNYRIRDKYKVS------ 129 Query: 115 LVVPILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAI 174 V I + G+ T + Y EL V + + + +DE+++ Sbjct: 130 -VASIAVFTGKKTQLRPTE------YLDELLGTVLSFKYTAYHVFDHQEDELLKSDNPFS 182 Query: 175 LE-------LLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLF 227 L LL+ I +L +V + ++++ ++ E ++ Sbjct: 183 LIALACQKALLEGKIPDEELADERLVIVKALLRHGYDRQRIISFILFLKNFIFIESEEIN 242 Query: 228 YGVLRDRETGGESMMTL------AQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSRED 281 + E + + +W ++ +G +GR+E + E A+ L +G++ E Sbjct: 243 RKFDQQIEELTKDKNPMGVIDVFKKWERQEAKIEGKLEGRREEALEIARELKKEGLTIEF 302 Query: 282 VAEMANLPLAEIDKV 296 +A+ LP+AEI+K+ Sbjct: 303 IAKTTKLPIAEIEKL 317 >UniRef50_C1QAJ2 Putative uncharacterized protein n=2 Tax=Brachyspira murdochii DSM 12563 RepID=C1QAJ2_9SPIR Length = 312 Score = 86.8 bits (213), Expect = 8e-16, Method: Composition-based stats. Identities = 44/310 (14%), Positives = 103/310 (33%), Gaps = 29/310 (9%) Query: 10 DAVFKQFLMHAE---TARDFLEI-HLPVELRELCDLNTLHLESGSFIEESLKGHSTDV-- 63 D + + DF+ L ++ + L + + K + D Sbjct: 9 DYFVRYLFSSKDSNFILLDFINSTMLDANMKTFRSVEILTPSPKAGSRLNYKENYDDKES 68 Query: 64 --------------LYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEAD 109 V+ G + V+IE Q + + + R++ Y + + L+ Sbjct: 69 IAPKVARKVDRCRRRLDVKCITQNGTV-VIIEIQLQGNSRFPERILYYWASNYSKLLKQG 127 Query: 110 HDK--LPLVVPILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIM 167 L V+ I + ++ + + +++I D+ + Sbjct: 128 EKYDALTPVISINLLNFNLDNNDCIHSCYMIYDTKSKRLLTDHLQIHIIEIKKFKDNLLD 187 Query: 168 QHRRIAILELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLF 227 + + K R+++ + L+ E N++ R + D Sbjct: 188 KDLDCWLKFFTIKEKDNREVI-----MSELVKEKPIMEEVQKRYNNFIKDRLMMNEYDKR 242 Query: 228 YGVLRDRETG-GESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMA 286 L + E + +KGIEKGI++G +E A+ + +K + ++++ Sbjct: 243 EAYLYGNQIMLEEERRLGIEEGFKKGIEKGIEKGIKENQILTAKNMKNKNIDIALISDIT 302 Query: 287 NLPLAEIDKV 296 L + EI+++ Sbjct: 303 GLSIKEIEEL 312 >UniRef50_D0TYF1 Putative uncharacterized protein n=1 Tax=Bacteroides sp. 2_1_22 RepID=D0TYF1_9BACE Length = 349 Score = 86.8 bits (213), Expect = 8e-16, Method: Composition-based stats. Identities = 49/356 (13%), Positives = 109/356 (30%), Gaps = 67/356 (18%) Query: 1 MDAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHS 60 M D FK +++ L L L + L +++ Sbjct: 1 MSKYVNPFTDIGFKIIFGQPA-SKNLLITLLNELLAGEHHITELTFLDKEDHADNVSDKG 59 Query: 61 TDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEA------------ 108 ++Y + + G ++++E Q++ R + Y A+ R +E+ Sbjct: 60 --IIYDLYCRTASGE-YIIVEMQNRWHSNFLDRTLYYVCRAVSRQIESPSSKEVPVPEDP 116 Query: 109 -----------DHDKLPLVVPILFYQGEATPYPLSMCWFDMFYSPELARRV--------Y 149 +LP + I + + + + V Sbjct: 117 MTAREPLVSYGKQYRLPTIYGIFLTNFKEENLEAKFRTDTVLSDRDTGKIVNPHLRQIYL 176 Query: 150 NSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQRD--LMLLLEQLVTLIDEGYTSGSQ 207 P+ D+ D + + I L+ + R D + E L L S Sbjct: 177 QFPYFTKDL---SDCHTLYDKLIYALKNMSNWNRMPDALKEQVFEHLARLAAVADLSEEN 233 Query: 208 LVAM--------QNYMLQRGHTEQADLFYGVLRDR-------------------ETGGES 240 +A N +++ + + + E E Sbjct: 234 RIAYDKALDRYRVNQIVEEDERRKNEEMRRKAAEEGLKEGMKAGLEKGVKKGRLEGIKEG 293 Query: 241 MMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKV 296 M + ++G+EKG+++G Q+ E A+++ G+S + + + L ++I+ + Sbjct: 294 MKEGMKEGMKEGLEKGLEKGEQKKQIEIARKMREDGISIDIIIKYTGLQSSDIENL 349 >UniRef50_C6Y2B5 Transposase and inactivated derivative n=1 Tax=Pedobacter heparinus DSM 2366 RepID=C6Y2B5_PEDHD Length = 310 Score = 86.5 bits (212), Expect = 1e-15, Method: Composition-based stats. Identities = 37/295 (12%), Positives = 88/295 (29%), Gaps = 26/295 (8%) Query: 10 DAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQM 69 D FK+ + +D L L+ ++ +L + E+ + V+ + Sbjct: 34 DLGFKRLFSAEQN-KDITITFLNHVLKGKREVVSLEFLKNEYPGETQEEGG--VIIDIVC 90 Query: 70 QGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDK-----LPLVVPILF--- 121 + G ++E Q ++ R + Y+ + + K L V I Sbjct: 91 KDQIGAF-FLVEMQKSWNQNFKERSLFYASRLITEQAPHGNRKEWAYSLKDVYVIALLEK 149 Query: 122 YQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKH 181 + A + + + F +++ E + K+ Sbjct: 150 FTINAGNKGKWLHDIALVNTDTGKVFNERLRFTYIELLSFKKTENQLETDLEKWIYALKN 209 Query: 182 IRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGESM 241 ++ L+ Q Y+ ++ + Sbjct: 210 LKHLKQAPAAFTEPQLL--------QFCQAARYINLTKEE------KNMISAKTKARWDY 255 Query: 242 MTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKV 296 + G E+G +G + + + A +L +KG+ ++ E+ L + EI + Sbjct: 256 YYAIDGAKIMGREEGETRGAHQKAAQIAIKLKNKGVPFTEIQELTELSITEIKNL 310 >UniRef50_UPI0001BC3A9D hypothetical protein BcroD2_08902 n=3 Tax=Butyrivibrio crossotus DSM 2876 RepID=UPI0001BC3A9D Length = 324 Score = 85.7 bits (210), Expect = 2e-15, Method: Composition-based stats. Identities = 48/313 (15%), Positives = 107/313 (34%), Gaps = 37/313 (11%) Query: 8 PHDAVFKQFLMHAETARDFLEIHLP-------VELRELCDLNTLHLESGSFIEESLKGHS 60 D + K + + D + L E D+ T +E + Sbjct: 18 QKDILLKDYFT-PDIFADAINAILYDGKSVVTPERMRTIDIETQRVED-ENGNVTADTRL 75 Query: 61 TDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHD---KLPLVV 117 D V++ Y IEHQS D M R+M Y + R ++++ ++ ++ Sbjct: 76 RDSAKVVEVDD-AIYCLFAIEHQSVEDYTMPLRIMEYDVREYLRQVKSNKGVQVRIKPII 134 Query: 118 PILFYQGEATPYPLSMCWFDMFYSPE--------LARRVYNSPFPLVDITITPDDEI--M 167 I+ Y +A + + DMF L + + L + ++++ Sbjct: 135 TIVMY-WKADKWNQPVSVKDMFDKNTVRWLEYNGLGGYIQDYRMHLFEPGTVKEEDLEKF 193 Query: 168 QHRRIAILELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLF 227 + ++ ++ L E+ + + + + Y+ G Sbjct: 194 KTELKDVIAYVKYSKSTEALKDYNEKYKPDLTKSTVTLINELTNSKYVFIEG-------- 245 Query: 228 YGVLRDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMAN 287 ++R E+ L + KG + +++ + + + L +GMS ++A + Sbjct: 246 ----KERLDMCEAFEGLIEEGRAKGKAEELKE-KYKSWVTLSNNLKKRGMSNPEIASLLG 300 Query: 288 LPLAEIDKVINLI 300 +P E+ K +I Sbjct: 301 VPETELQKAFKMI 313 >UniRef50_B3QUJ9 Putative uncharacterized protein n=8 Tax=Bacteria RepID=B3QUJ9_CHLT3 Length = 286 Score = 85.3 bits (209), Expect = 2e-15, Method: Composition-based stats. Identities = 43/299 (14%), Positives = 97/299 (32%), Gaps = 22/299 (7%) Query: 2 DAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHST 61 + D FK+ L L L E + L + + Sbjct: 6 EKYINPLTDFGFKKLFGTEPNKI-LLMDFLNQILPEKHQIQELSYSKNEHVGQQELDRKA 64 Query: 62 DVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADH--DKLPLVVPI 119 ++ V G G ++E Q R + Y+ + + + DKL V + Sbjct: 65 --IFDVYCVGQSGE-RFIVEVQKAKQNYFKDRSIYYASFPIQEQAKRGNWDDKLEPVYTV 121 Query: 120 LFYQGEATPYPLSMCWFDMFYSPELARRVYN--SPFPLVDITITPDDEIMQHRRIAILEL 177 + L + + A+R ++ F +++ E + Sbjct: 122 GILDFIFDDHKLDAELIHVVALKKSAQRSFSDKLKFIYIELPKFKKTEAELETQFDKWLY 181 Query: 178 LQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETG 237 + +H+ Q Q + ++ + + +N ++ + Sbjct: 182 VFRHLSQLQKRPTKFQ-EKIFEKLFEAAEIAKFSKNELVA-------------YEESLKY 227 Query: 238 GESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKV 296 M + +E+G +G + G ++ + E A+ L +KGM ++++E+ L EI+ + Sbjct: 228 YRDMKNVVDTSKEEGWLEGQKAGCEQRNYEIARVLKAKGMPIQEISEITGLTAQEIEHL 286 >UniRef50_C6LTE0 Putative uncharacterized protein n=1 Tax=Giardia intestinalis ATCC 50581 RepID=C6LTE0_GIALA Length = 353 Score = 85.3 bits (209), Expect = 2e-15, Method: Composition-based stats. Identities = 43/298 (14%), Positives = 95/298 (31%), Gaps = 28/298 (9%) Query: 10 DAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQM 69 D VF Q + + L L L+ + + ++ + G S + + Sbjct: 73 DFVFYQIFGVEKH-KSVLISLLNSILKGNPHVKDVRIDPTEHKRTTPDGKSVRLDIKATI 131 Query: 70 QGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRH---LEADHDKLPLVVPILFYQGEA 126 V +E Q + R + Y + + + +P V+ I Sbjct: 132 NDGT---IVDVEMQCINTGDIYHRSIYYQSLILRDYTIKQGQSYKSIPDVIIIWIMN--Q 186 Query: 127 TPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQRD 186 C ++ P+ EI + + L K + Sbjct: 187 DITNRKGCMHEIV--------------PMYKANGIDQIEIASEKMRQFIIELTKLGNTSN 232 Query: 187 L--MLLLEQLVTLIDEGYTSGSQLVAMQ---NYMLQRGHTEQADLFYGVLRDRETGGESM 241 +T I + + +L+ ++ M + + + + R + Sbjct: 233 FCYNKAFTAWMTFIKDPSSISGELLEVEGVQTAMKELTYLSENKETRAIYDARRIALLDL 292 Query: 242 MTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKVINL 299 + + EKG +G+ +GR + + A+++LS G+ E + + L + EI+ V + Sbjct: 293 NSAIEHGIEKGKAEGLVEGRDKERERMAEQMLSDGLDIEFIVRYSGLSMQEIENVKKM 350 >UniRef50_A5Z376 Putative uncharacterized protein n=1 Tax=Eubacterium ventriosum ATCC 27560 RepID=A5Z376_9FIRM Length = 316 Score = 84.9 bits (208), Expect = 3e-15, Method: Composition-based stats. Identities = 45/301 (14%), Positives = 94/301 (31%), Gaps = 24/301 (7%) Query: 9 HDAVFKQFLM---HAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLY 65 D VF++ + + DL L+ ++ DV Sbjct: 8 KDRVFRKLFGYEKNKGNLLELYNALNDSNYTNPDDLEINTLDDVFYMNM-----KNDVSC 62 Query: 66 SVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHL--------EADHDKLPLVV 117 + + EHQS M R RYS + ++ K+P Sbjct: 63 IIDWN------MAIYEHQSTWSYNMPLRGYRYSAELYNDYIVRNNLDVFRRKLIKIPTPQ 116 Query: 118 PILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILEL 177 +FY G + + + +++I ++E+M I E Sbjct: 117 YYVFYNGNEKRPDREVLKLSDAFMVPCKDGEFEWTATVLNINAGHNEELMSKCSILR-EY 175 Query: 178 LQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETG 237 + ++ + +L I + ++ ++ + L+ + ET Sbjct: 176 AIMVSKIKEFLAESLELKDAIKKAIDYCLDNNVLKEFLQDHRSEVEDMLWRE-YNEEETM 234 Query: 238 GESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKVI 297 + E+ G+E G G + + + L K S E++A+ ++ I+K+ Sbjct: 235 AHWKEDFYEEGEQHGLEVGRANGEKIKLIKLVCKKLVKNKSIEEIADDLEEDVSTIEKIC 294 Query: 298 N 298 N Sbjct: 295 N 295 >UniRef50_C0DAA1 Putative uncharacterized protein n=2 Tax=Clostridium asparagiforme DSM 15981 RepID=C0DAA1_9CLOT Length = 302 Score = 84.5 bits (207), Expect = 4e-15, Method: Composition-based stats. Identities = 47/288 (16%), Positives = 96/288 (33%), Gaps = 26/288 (9%) Query: 5 STTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVL 64 + D++F+ + + DL + ++ D+ Sbjct: 13 NRKFKDSLFRVIFSEKKELLELYNAINGSHYENPDDLIITTIGDVLYLGM-----KNDIS 67 Query: 65 YSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADH--------DKLPLV 116 + + G + E QS + M R + Y +L+ LP Sbjct: 68 FLI------GQHLSLYEAQSTWNPNMPLRGLFYFSRLYQGYLKEHQLDLYSRRPLSLPFP 121 Query: 117 VPILFYQGE-ATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQ-HRRIAI 174 I+FY G P + D+FY E +++I ++E+M+ R++ Sbjct: 122 EFIVFYNGTMEQPDRTQLRLSDLFYQAEGVPC-LECTATMININYGHNEEMMKSCRKLYE 180 Query: 175 LELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDR 234 L +R R L L +D+ Q ++N++L+ + + + Sbjct: 181 YAFLINAVRSRLNEGL--HLEAAVDQAVEDCIQHDVLKNFLLKHREEVREMILSEYDEEL 238 Query: 235 ETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDV 282 E ++ + E ++ G Q G QE RL + G + + + Sbjct: 239 HINSEKKISYEEGLEAGVVQ-GTQHG-QERVNALITRLAAAGRADDII 284 >UniRef50_C0QGW4 Putative uncharacterized protein n=1 Tax=Desulfobacterium autotrophicum HRM2 RepID=C0QGW4_DESAH Length = 298 Score = 84.5 bits (207), Expect = 5e-15, Method: Composition-based stats. Identities = 43/293 (14%), Positives = 104/293 (35%), Gaps = 14/293 (4%) Query: 7 TPHDAVFKQFLMH-AETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLY 65 HD FK + + D+ ++ D+ L E D+ Sbjct: 2 KSHDHNFKNLFLDFPKETLDWFFPQAGQSWGKVLDVEFLRQEPKKHNLSD-SSLELDMPI 60 Query: 66 SVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGE 125 + L ++E Q K ++++RY+ M H +A LV+P + + Sbjct: 61 LFNFENQQLLLW-LVEFQEDKSKFSIYKLLRYTTDLMETHPDA------LVIPTVLFTDR 113 Query: 126 ATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQR 185 + R + + + + + LL K ++ Sbjct: 114 KKWSKAVLQQLHAQLHD---RMFLHFEYVFHKLFDLNARDYYNVDNPVVKILLPKMHYKK 170 Query: 186 DLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGESMMTLA 245 + + + + S +++ E + + + E+ M LA Sbjct: 171 EDRIEVIRQAYAGLFQLVSSGLFDKYVDFIDTYAEIEDQEQL-NLYNEIVQHKETAM-LA 228 Query: 246 QWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKVIN 298 Q+ E+G+++G ++ R++ F ++ +G+S +A++ +L ++ ++K++N Sbjct: 229 QYIRERGMQEGRKEERKQSLISFIRKAKQEGVSVPTIAKIVDLDVSMVNKILN 281 >UniRef50_C8PLW8 Putative uncharacterized protein n=2 Tax=Treponema vincentii ATCC 35580 RepID=C8PLW8_9SPIO Length = 264 Score = 84.1 bits (206), Expect = 5e-15, Method: Composition-based stats. Identities = 52/288 (18%), Positives = 102/288 (35%), Gaps = 37/288 (12%) Query: 10 DAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQM 69 D +F + + H R FLE+ ++ ++ L++ ++ + + K DVL V+ Sbjct: 14 DFMFCKVMEHESLCRPFLEMLFSTQIEKITYLSSQNIITTN---SEAKTVRLDVL--VKD 68 Query: 70 QGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEATPY 129 Y IE Q + + RM Y L+ + L ++ + Sbjct: 69 DIGTSY---DIEMQVGNEYNIPKRMRYYQAVLDVAFLDKGYSYKALNNSVIIFVC----- 120 Query: 130 PLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQRDLML 189 +F R VY D I+ H + L K ++ D Sbjct: 121 --------LFDPIGNDRAVYTFENI-----CIEDKTILLHDGTKKIILNAKAFKKTDNQE 167 Query: 190 LLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGESMMTLAQWFE 249 L + + G + + ++ + E A Y +L Sbjct: 168 -LRGFLQYVTTGKATTAYTGRIEQMIQTVKQNELARREYHILPAALMDAMD--------- 217 Query: 250 EKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKVI 297 +G +G+ +G ++ + E A+ LL G+S E++A+ L AE++ + Sbjct: 218 -EGEARGLAKGSRQKALETAKNLLHFGLSVENIAQATGLSQAEVEALK 264 >UniRef50_UPI0001C369BC hypothetical protein ChatD1_02491 n=1 Tax=Clostridium hathewayi DSM 13479 RepID=UPI0001C369BC Length = 310 Score = 84.1 bits (206), Expect = 5e-15, Method: Composition-based stats. Identities = 41/291 (14%), Positives = 76/291 (26%), Gaps = 46/291 (15%) Query: 10 DAVFKQFLMHAETARDFLEI-------HLPVELRELCDLNTLHLESGSFIEESLKGHSTD 62 D K+ L D L EL + + + D Sbjct: 5 DFYIKKLLQDPARFADLYNAEIFHGKQILKAELLSPVSTESGIAITNRSGRKQTIQRRRD 64 Query: 63 VLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEA-------------- 108 + + ++ E Q + M R + Y L Sbjct: 65 IAMKASI--GACFIVAGCEAQGEIHYGMPIRSLTYDALDYTEQLTEIQKEHRKKKDLAKS 122 Query: 109 --------DHDKLPLVVPILFYQGEATPYPLSMCWFDMFYS-------PELARRVYNSPF 153 DKL V+ ++ Y G P+ +DM P+L + + Sbjct: 123 PEFLSGITRRDKLQPVLTLVLYCG-KDPWDGPKSLYDMLDLRGPTECIPDLLAALPDYRI 181 Query: 154 PLVDITITPDDEIMQHRRIAILELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQN 213 LVDI + + + + +L+ + + I + A+ Sbjct: 182 NLVDIRKIENLSLYKTGLQQVFGMLKYSTDKSKFYNYITSNHDQISMLDDN-----ALTA 236 Query: 214 YMLQRGHTEQADLFYGVLRDRETGGESMMTLAQWFEEKGIEKGIQQGRQEV 264 M G E L + G +M G +G ++G++ Sbjct: 237 VMGLLG--ENRRLMKYLAAPGREEGYTMCQAIDDLIADGKLEGKREGKRRG 285 >UniRef50_C4G3R2 Putative uncharacterized protein n=2 Tax=Abiotrophia defectiva ATCC 49176 RepID=C4G3R2_ABIDE Length = 336 Score = 84.1 bits (206), Expect = 6e-15, Method: Composition-based stats. Identities = 42/296 (14%), Positives = 97/296 (32%), Gaps = 33/296 (11%) Query: 9 HDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQ 68 D+VF + R + + + FI D+ ++V+ Sbjct: 66 KDSVFTLLFSDIKNIRKLYQSLHDDSDSYSDEDFKIITLENVFINAP----YNDLGFTVK 121 Query: 69 MQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHD--------KLPLVVPIL 120 + + ++ E QS + M R++ Y + H ++ +LP I+ Sbjct: 122 NK-----VIILAEAQSTFNPNMGLRLLIYIAQSYHDYISEYKFNIFSEKLIRLPNPEFIV 176 Query: 121 FYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQK 180 Y G + D F S N + I E + + E+ + Sbjct: 177 IYSGSKKTDITEIRLSDCFES----GTAPNIELVVKVIGGNNVKEGIIQEYLKFCEMYDE 232 Query: 181 HIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGES 240 +R E+ + + +++++ + + + ++ Sbjct: 233 KVRSVKPS---EEKAYSLKKVIKDCIDNGILKDFLTLHQKEVEDMMMTVIPPEQALE--- 286 Query: 241 MMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKV 296 + + + KGI+QG+ + S FA+ +L S + + E+ L +I ++ Sbjct: 287 ------YIKLEEYNKGIEQGKLDTSLNFARNMLKNNYSIDSIIEITGLSREQIKRL 336 >UniRef50_Q2FTW8 Putative uncharacterized protein n=2 Tax=Methanospirillum hungatei JF-1 RepID=Q2FTW8_METHJ Length = 306 Score = 84.1 bits (206), Expect = 6e-15, Method: Composition-based stats. Identities = 44/303 (14%), Positives = 86/303 (28%), Gaps = 30/303 (9%) Query: 6 TTPHDAVFKQFLMHA---ETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTD 62 + +D F+ + D L LP + + + I ++ K D Sbjct: 21 SPRNDFAFRLLFGDPNNSDILLDLLNAILPDHFQSVVCTD-----PHLLIPDTKKECILD 75 Query: 63 VLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFY 122 + V ++V IE Q K M R + Y L H L I+ Sbjct: 76 I--KVLSDSG---VYVDIEMQVLDLKSMEKRSLFYWAKMYLDQLNRGHSYHELKRTIVIN 130 Query: 123 QGEATPYPLSMCWFDMFYSPELARRVYN---SPFPLVDITITPDDEIMQH--RRIAILEL 177 + P+ F + + + +++ + ++ L Sbjct: 131 ILDYMLMPVE-DLHTCFQAYDKTHDILMSDVFEIHFLELPKVHRCRVPYKGTDLLSWLTF 189 Query: 178 LQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETG 237 L + + +M + L + + D + +E G Sbjct: 190 LNAYTEEEIIMAAEGKPAIQKAYNNLQIMSLDEETRRLYEAREMFLHDQATRMYEAKEEG 249 Query: 238 GESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKVI 297 E M + + +G + LLS GM E + + L + IDK+ Sbjct: 250 LEEGMKKGREEGREEEREGF-----------VKNLLSLGMDDEFIKKATGLDQSIIDKLK 298 Query: 298 NLI 300 + Sbjct: 299 KSL 301 >UniRef50_Q2RGS0 Putative uncharacterized protein n=2 Tax=Moorella thermoacetica ATCC 39073 RepID=Q2RGS0_MOOTA Length = 310 Score = 83.8 bits (205), Expect = 7e-15, Method: Composition-based stats. Identities = 58/320 (18%), Positives = 110/320 (34%), Gaps = 44/320 (13%) Query: 1 MDAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHS 60 M S +D K H + + +E+ Sbjct: 1 MQPKSGNRYDITIKDLFADETQELINYFGHFEARVTGDLKIEFPQVET----------RV 50 Query: 61 TDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPIL 120 +D++ + Q P L + +E QS+ D +M +RM+RY++ V I+ Sbjct: 51 SDLVMKAESQQGP--LAIHLEFQSRNDDEMPYRMLRYALEI-------HKTYHLPVYQIV 101 Query: 121 FYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQK 180 Y G+ S + + + + + + L+D+ +E+ +L LL Sbjct: 102 IYFGQWQMNMTSQLEYRLGD-----QNLLDYRYHLIDVGNITYEELKNSPHQRLLSLLPV 156 Query: 181 HIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRG-----------------HTEQ 223 R++ E L ++ S L + +L+ EQ Sbjct: 157 VDREKRQKGGKEFLRRCAEDIINSDLDLETKKTVLLRAEIFAGLVFDKKAIDLVFREVEQ 216 Query: 224 ADLFYGVLRDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGM---SRE 280 + + M + EKG+EKGI++G+QE + RLL K RE Sbjct: 217 MLSIEESAGYQRIFEKGMEKGIEKGMEKGMEKGIEKGQQESLLDVTIRLLRKKFRKIPRE 276 Query: 281 DVAEMANLPLAEIDKVINLI 300 +A + + + ++I+ I Sbjct: 277 YLARIKEQDVYVLQQIIDSI 296 >UniRef50_Q3ARM2 Putative uncharacterized protein n=10 Tax=Bacteroidetes/Chlorobi group RepID=Q3ARM2_CHLCH Length = 322 Score = 83.4 bits (204), Expect = 8e-15, Method: Composition-based stats. Identities = 40/311 (12%), Positives = 95/311 (30%), Gaps = 33/311 (10%) Query: 10 DAVFKQFLM---HAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYS 66 D FK+ + + FL LP+E + D L + S ++ Sbjct: 13 DFGFKKLFGSEMNKDLLIAFLNTLLPIEAGTIAD---LTFLPNDRVGRSEFDRRA--IFD 67 Query: 67 VQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDK--LPLVVPI----L 120 + + G + ++E Q R + Y+ + + L + + Sbjct: 68 LHCKNEKGE-YFIVEMQQAKQDYFKDRSVFYASFPIQEQAQKGKWNYCLQPIYMVGILDF 126 Query: 121 FYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQK 180 + + + F +++ L Sbjct: 127 IFDENKADDTIVHHEIKLVNLSTGKVFYEKLTFIYLELPKFTKSVDELESDFDKWCYLLS 186 Query: 181 HIRQ----------------RDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQA 224 ++ +L + + E S ++N + +A Sbjct: 187 NLPDLTDRPARLQEKVFLKVFELAEIAKYTPEEAREYEKSLKVYRDLKNVIDCAYDEGKA 246 Query: 225 DLFYGVLRDRETGG--ESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDV 282 + + + G E M+ + ++G++KG++ G + E A++L+ KGMS ++ Sbjct: 247 EGIEEGIEKGKEIGVLEGMVKGKELGLQEGLQKGMEAGLLKGKLEIARKLMVKGMSADEA 306 Query: 283 AEMANLPLAEI 293 A +A + + + Sbjct: 307 AGIAGVDVERL 317 >UniRef50_C5UZR7 Putative uncharacterized protein n=1 Tax=Clostridium botulinum E1 str. 'BoNT E Beluga' RepID=C5UZR7_CLOBO Length = 334 Score = 83.4 bits (204), Expect = 8e-15, Method: Composition-based stats. Identities = 55/324 (16%), Positives = 100/324 (30%), Gaps = 35/324 (10%) Query: 10 DAVFKQFLMH-AETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQ 68 D + K + + L D L + + FI ++ DV + V Sbjct: 11 DEILKFLFSTSKKVLVNLLNGIFEENFSS--DEVELSVSNNEFIMDTFDTLRGDVFFEVL 68 Query: 69 MQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVP---------- 118 + +E Q+K D M RM Y D + P Sbjct: 69 NNEVSNKVTYHLEFQTKNDSTMIIRMFEYGFRKGKEQTGNRDDFKTIYFPKQKVIFIERN 128 Query: 119 ----------ILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQ 168 I+ ++ Y + + + + EL PL + D E + Sbjct: 129 NNIKEDIKLKIVLPDEQSFIYSVPVMKYWEYTDNELIENKMYPLLPLQLFNLRKDLEYAR 188 Query: 169 HRRIAILELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGH-------- 220 H + + + + L D+ G M + Sbjct: 189 RSNNIDKINDLSHEAKEIALKIANESKKLFDDNEIIGEDFHKMLLAIQNLIEYLNRNYFN 248 Query: 221 ----TEQADLFYGVLRDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKG 276 E+ L D E + + EKGIEKG+++G ++ + E A L G Sbjct: 249 DDRLEEEVSTMTKTLYDPEVEKRGIEKGIEKGIEKGIEKGMEKGIEKKAIEDAIGFLRLG 308 Query: 277 MSREDVAEMANLPLAEIDKVINLI 300 +S E V++ LP+ ++ ++ + I Sbjct: 309 VSEEIVSKGTGLPIEKVRELKDKI 332 >UniRef50_A7AK04 Putative uncharacterized protein n=2 Tax=Parabacteroides RepID=A7AK04_9PORP Length = 299 Score = 83.4 bits (204), Expect = 8e-15, Method: Composition-based stats. Identities = 51/304 (16%), Positives = 108/304 (35%), Gaps = 33/304 (10%) Query: 10 DAVFKQFL---MHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYS 66 D FK+F + E FL L D+ + + + ++ Sbjct: 12 DYAFKRFFGTVSNKELTIGFLNSLLNK------DIKDIIFHNVEMQGNNTDSRKA--VFD 63 Query: 67 VQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEA 126 + +G+ G L + +E Q K K + R++ Y+ + + + +K L + E Sbjct: 64 LFCEGSDGELFI-VEIQKKRQKYFSDRVLYYASFVIQMQADIESEK-----FRLAKEEER 117 Query: 127 TPYPLSMC--WFDMFYSPEL-ARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIR 183 + + + F L R + +V + + LEL + ++ Sbjct: 118 RRWNYHINKVYVVCFLDFRLDTRYTDKYRWDVVRMDRELKIPFSETLNEIYLELPKFNLN 177 Query: 184 QRDLMLLLEQLV----------TLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRD 233 + ++ + L E + L +++ + + + + L Y + Sbjct: 178 FEECDTFYKKFLYTMNNIDIMGQLSKETIQNDKLLRKLKSAIELQRMSAKERLAYELSIA 237 Query: 234 RETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEI 293 E + +A FEE EKGI +G E ++ + GM +A+ A LP E+ Sbjct: 238 AE--RDLAACMATSFEEG-EEKGIAKGITEGMRKIILNMKQAGMDLATIAKTAGLPEKEV 294 Query: 294 DKVI 297 + ++ Sbjct: 295 EALL 298 >UniRef50_C1DXV7 Putative uncharacterized protein n=1 Tax=Sulfurihydrogenibium azorense Az-Fu1 RepID=C1DXV7_SULAA Length = 357 Score = 83.4 bits (204), Expect = 8e-15, Method: Composition-based stats. Identities = 41/236 (17%), Positives = 94/236 (39%), Gaps = 5/236 (2%) Query: 7 TPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHS-TDVLY 65 PHD K+ L E A+ L+ HLP E+ + TL + + ++ K D++Y Sbjct: 15 NPHDTYAKELLKDEEVAQVLLDAHLPQEINSIIKKETLEIINTENLDYKEKSKYFADIIY 74 Query: 66 SVQMQ-GNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQG 124 S++ G ++V+IEH+S DK + ++++ A + + K+ + PI+ Y Sbjct: 75 SLKTIYGEDLKIYVLIEHKSYDDKHLPLQLIKNMTAVWSKEILEG--KITPIYPIVIYAS 132 Query: 125 EATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQ-HRRIAILELLQKHIR 183 + S S + + + +++ + I + ++ I L + + I+ Sbjct: 133 KEKLSLESKFSNYYKISDNMKKFFLDFYVSTLNLNELDEKTIKEKYKNIYTLIMTLRIIQ 192 Query: 184 QRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGE 239 + +L + ++ + + + + + + V + E G Sbjct: 193 EPTPENILNLIKSIETLYNYKPKAVYVIALSYIFTIAKKDKNTYIKVKKQLEGGNM 248 >UniRef50_C6LE73 Putative uncharacterized protein n=1 Tax=Bryantella formatexigens DSM 14469 RepID=C6LE73_9FIRM Length = 326 Score = 83.4 bits (204), Expect = 9e-15, Method: Composition-based stats. Identities = 50/297 (16%), Positives = 103/297 (34%), Gaps = 30/297 (10%) Query: 9 HDAVFKQFLMHAETARDFLEIHLPV-----------ELRELCDLNTLHLESGSFIEESLK 57 D + K++ + DF+ L + L E Sbjct: 4 KDIILKEYQRDSRHFCDFVNGALAQGRPLLKRGQLVPVPTELVLVKDTEEDDENAVVKTV 63 Query: 58 GHSTDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADH------- 110 D+ + N G + V I++Q+ D M R+M Sbjct: 64 QRFRDITGKAEADKNAGCIIVAIQNQTTVDYGMPLRVMLEDALEYDVQRRTKKNRKLHKG 123 Query: 111 DKLPLVVPILFYQGEATPYPLSMCWFDMFYSPELAR-RVYNSPFPLVDIT-ITPDDEIMQ 168 +KL LV+ ++FY G S + E + R Y +P+V +T D + Sbjct: 124 EKLCLVITLVFYYGTTPWRAPSDLAEMISVPREFRQLREYIQSYPIVVVTPENVDTACFR 183 Query: 169 HRRIAILELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFY 228 ILE+L++ ++++ LE+ + ++ ++++ T+ D + Sbjct: 184 GGWQEILEILRRQNDEKEMGRYLEKNRAIYEKLPEDTNRVIFAL--------TDHLDYYR 235 Query: 229 GVLRDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEM 285 + E +M + G+E+G +QG + ++ ++ +GM A + Sbjct: 236 ELKEKGEKI--TMCKAFTDHYKSGVEEGKKQGMKRGRRQGIKQGKRQGMDMGIRAMI 290 >UniRef50_Q00255 ORF295 n=1 Tax=Leptolyngbya boryana RepID=Q00255_PLEBO Length = 295 Score = 83.4 bits (204), Expect = 1e-14, Method: Composition-based stats. Identities = 51/304 (16%), Positives = 107/304 (35%), Gaps = 36/304 (11%) Query: 2 DAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFI----EESLK 57 + T +D +K F+ R+FL P ++ + + + Sbjct: 4 QSSENTDYDNPWKTFI--ELYFREFLAFFFPTIEADVDWSKPVRFLDKELQKIVRDAEIP 61 Query: 58 GHSTDVLYSV-QMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLV 116 D L V +++G + IE QS+ ++ RM Y+ R+ V Sbjct: 62 KRYADKLVEVHRLRGERTLVICHIEVQSQEERDFVARMYSYNYRLRDRYN-------CPV 114 Query: 117 VPILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITIT----PDDEIMQHRRI 172 V + + + S + EL + FP+V ++ + E +Q+ Sbjct: 115 VSLAILGDDRPNWRPSRFY------DELWGCATHFEFPIVKLSDYQSQWTELEAIQNPFA 168 Query: 173 AILELLQK----HIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFY 228 + K H + + L T++ + S ++ + N++ + + +L Sbjct: 169 VVAMAHLKTKETHNQPLERKRWRYHLTTMLYDRGYSEQDILELHNFLDWLMNLPE-ELER 227 Query: 229 GVLRDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANL 288 + + ET E+ E+ + + E Q A +L + + E +AE+ L Sbjct: 228 QLQAELETFEEARRMKYVSSLER-------RAKLEEKQAIALNMLRRNLDMELIAEVTGL 280 Query: 289 PLAE 292 +AE Sbjct: 281 TIAE 284 >UniRef50_C6W4R9 Putative uncharacterized protein n=1 Tax=Dyadobacter fermentans DSM 18053 RepID=C6W4R9_DYAFD Length = 293 Score = 83.0 bits (203), Expect = 1e-14, Method: Composition-based stats. Identities = 43/306 (14%), Positives = 91/306 (29%), Gaps = 31/306 (10%) Query: 7 TPHDAVFKQFLMHAETARDFLEIHLPVE---LRELCDLNTLHLESGSFIEESLKG---HS 60 +D ++K L E DFL+ P L E Sbjct: 2 KRNDMLWKSIL--EEIFDDFLKFFFPNAEALFDMDRGFEYLDQELEQLFPPEGNAIATRY 59 Query: 61 TDVLYSVQMQGN-PGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPI 119 D L V + +L V IE Q D+ RM Y ++ + + I Sbjct: 60 VDKLVKVYCRSGAEAWLLVHIEVQGYRDETFPDRMFTYYYRICDKYRK-------PITAI 112 Query: 120 LFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQ 179 + + + F + ++E+ +L Sbjct: 113 AILTDDCRHFLPGQFEQACLGTS------VCFRFNSYKVLEQSEEELAASDNPFAQVILA 166 Query: 180 KHI-------RQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLR 232 + +L L L + + S ++ + ++ E DL L+ Sbjct: 167 TKLAIKGSRFSSDELYRLKIDLAKRLLKRNFSKRKVGRLMEFLKFYVSLEDDDLDREYLK 226 Query: 233 DRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSK-GMSREDVAEMANLPLA 291 + + + +EE + ++G + Q L+ + + E++A +A++ + Sbjct: 227 EVQRLFNP-EPIPMTWEETILYIVEEKGAEAAKTTVVQNLIRETNFTSEEIARLADVSVE 285 Query: 292 EIDKVI 297 + K+ Sbjct: 286 FVQKIK 291 >UniRef50_C1MD86 Putative uncharacterized protein n=5 Tax=Enterobacteriaceae RepID=C1MD86_9ENTR Length = 155 Score = 82.6 bits (202), Expect = 2e-14, Method: Composition-based stats. Identities = 48/150 (32%), Positives = 85/150 (56%), Gaps = 17/150 (11%) Query: 161 TPDDEIMQHRRIAILELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGH 220 PDD+IMQHRR+A+LEL+QKHIR+RDLM L+E+L L+ +G+ + +QL A+ NY++Q G+ Sbjct: 1 MPDDKIMQHRRMALLELIQKHIRKRDLMGLVEKLAILLVKGHANDNQLKALFNYLMQAGN 60 Query: 221 TEQA-DLFYGVLRDRETGGESMMTLAQWFE----------------EKGIEKGIQQGRQE 263 T + + V + +MT+A+ ++G++ G+QQG++E Sbjct: 61 TTHFGEFLHEVAERLPQHKDKLMTIAERLRQEGHLNGLQEGHRKGLQEGLQTGLQQGKRE 120 Query: 264 VSQEFAQRLLSKGMSREDVAEMANLPLAEI 293 + A + + G+ + + L ++ Sbjct: 121 EALRIASTMQADGIDPLTIIRITGLTAEDL 150 >UniRef50_A7BWQ7 Putative uncharacterized protein n=3 Tax=Beggiatoa sp. PS RepID=A7BWQ7_9GAMM Length = 290 Score = 81.8 bits (200), Expect = 3e-14, Method: Composition-based stats. Identities = 49/296 (16%), Positives = 105/296 (35%), Gaps = 15/296 (5%) Query: 5 STTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVL 64 + HD++FK + +F + P S + + Sbjct: 3 NPKSHDSLFKWLIT--AFTTEFFGHYFPDIRIGEYTFIDKEFISKYENLKESLKGDLFLG 60 Query: 65 YSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQG 124 V++ G + + IEHQS+ + ++ R+ YS A + V I+ Y Sbjct: 61 MEVEIDGLLREIIIQIEHQSERE-DVSERVYEYSCYAWLLKKK-------PVWSIVIYTD 112 Query: 125 EATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQ 184 EA F + + ++ ++ V + D I +H + L L+ RQ Sbjct: 113 EAVWRKPVTEQFWYAFDSQKGKQYHHFDVIKVKAEKSSDL-IQKHSLMCKLLALKADDRQ 171 Query: 185 RDLMLLLEQLVTL--IDEGYTSGSQLVAMQNYM--LQRGHTEQADLFYGVLRDRETGGES 240 D L+ ++ + + + QL+ + ++ ++ ++ D ++ Sbjct: 172 TDPEKLVYEIYRAAALMKEQLTNEQLLLIDQWVSFYKKVSEKRLDKIKKEIKMDFIETTI 231 Query: 241 MMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKV 296 + KG +G +G + ++ A LL G+ E + + AEI ++ Sbjct: 232 SEHVYNQGWIKGEAEGKAEGEAKGRKKTAINLLKMGIDVEIIQKATGFSDAEIKQM 287 >UniRef50_B1V1L4 Putative uncharacterized protein n=38 Tax=Clostridium RepID=B1V1L4_CLOPE Length = 300 Score = 81.8 bits (200), Expect = 3e-14, Method: Composition-based stats. Identities = 39/295 (13%), Positives = 103/295 (34%), Gaps = 11/295 (3%) Query: 10 DAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQM 69 D VFK+ E ++D L L ++ + + L+S ++ + + + Sbjct: 8 DFVFKRLFGAEE-SKDSLISLLNAIIKSDNPIKDIELKSPDLEKQHIGDKFCRLDIKAKT 66 Query: 70 QGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHL--EADHDKLPLVVPILFYQGEAT 127 + +E Q + + M R + Y L ++ L V I + Sbjct: 67 DKGE---IINVEIQVRDEYNMVQRTLYYWSKIYSDQLGASENYKNLARTVCINILNFKLL 123 Query: 128 PYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQRDL 187 + + +++ + + + + I L + I++ + Sbjct: 124 DNDRYHNTYRLKEITTNEELTDIEEIHFIELPKSKEIKSEEVNNIDSLLKWIEFIKEPES 183 Query: 188 MLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGES-----MM 242 + +T + + + Y + E E + Sbjct: 184 ETVRILELTDESIRKAKTQLYKLSLDKKTIEQYRIREKAMYDEISALENSREKGLQEGVK 243 Query: 243 TLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKVI 297 + +E+G+++G +G+ + +++ A+ LLSKG+ +++A++ L ++++I Sbjct: 244 IGRKEGKEEGLKEGEVRGKLKANRKIAKNLLSKGLELKEIAKILELDENLVEEII 298 >UniRef50_A7B1D1 Putative uncharacterized protein n=3 Tax=Ruminococcus gnavus ATCC 29149 RepID=A7B1D1_RUMGN Length = 323 Score = 81.5 bits (199), Expect = 3e-14, Method: Composition-based stats. Identities = 40/292 (13%), Positives = 98/292 (33%), Gaps = 23/292 (7%) Query: 10 DAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQM 69 D F++ + E R F+ L + E+ D + L ++ + + V++ Sbjct: 53 DFCFQELMEDEEVRRGFIGAFLRIPPEEILD---MELLPKKLRKKYKEEKYGILDVRVRL 109 Query: 70 QGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEAD--HDKLPLVVPILFYQGEAT 127 + + IE QS R + Y + +DKL + + Sbjct: 110 REGEQ---LNIEMQSIAYDYWQERSLFYLGKMYVDQIHEGEDYDKLKKCIHVGILDFTLF 166 Query: 128 PYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQRDL 187 + F ++ ++++ E Q + + R R+ Sbjct: 167 EHERYYSCFHIWEDTIRDMYSDKFEIHVLELPKLAKYEYPQTELLRWAQFF--GARSREE 224 Query: 188 MLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGESMMTLAQW 247 + +L + I + Y ++ A + +R+ + Sbjct: 225 IEVLAEKDEYIHKAYDKLEEISA-------------DEEKRLEYEERQKAIRDHRHMLAS 271 Query: 248 FEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKVINL 299 +G+ +G+++G+ E + E A+++L + E +AE + L ++ ++ Sbjct: 272 GRREGLREGLREGKHEHAVEMARKMLEDKLPIEKIAEYSGLSPEDVHRLEEQ 323 >UniRef50_Q9L0J0 Putative uncharacterized protein SCO4675 n=4 Tax=Streptomyces RepID=Q9L0J0_STRCO Length = 302 Score = 81.5 bits (199), Expect = 3e-14, Method: Composition-based stats. Identities = 45/298 (15%), Positives = 89/298 (29%), Gaps = 25/298 (8%) Query: 6 TTPHDAVFKQFLMHA---ETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTD 62 ++ H+A+ + F FL I LP + E S D Sbjct: 3 SSSHEAMHRIFQHDPGLFSRVTHFLGIDLPRPIGA-------TALPTDLTEASPVERRVD 55 Query: 63 VLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFY 122 L + G + +E Q K D Y ++ +LP + ++ Sbjct: 56 TLLRFETAER-GPFLLAVEAQGKKDPDKPASWAYYVSYLWTKY------RLPTALLVVCQ 108 Query: 123 QGEATPYPLSMCWFDMFYSPELA-RRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKH 181 + P L R V P ++ + D + + + H Sbjct: 109 DHATAKWAQRAVTSGPPELPTLTLRPVVAGP---HNMPVITDPDEARADLVLASLAAITH 165 Query: 182 IRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGESM 241 + + +L+ L T + + + + + V D + Sbjct: 166 AAEPVVNAILKALSTALSDAPEDIAAPIVEFTAHGLGNRPARHLWRNLVAVDLSFYKSYI 225 Query: 242 -MTLAQWFEEKGIEKGIQQGR-QEVSQEFAQRLLSKGMSRED--VAEMANLPLAEIDK 295 + E+G E+G +QGR Q+ +Q+ L +G+ D + E+ + Sbjct: 226 SEEIRDEGREQGREQGREQGRAQQGAQDVLLVLEQRGLDIPDGVRTRITECGDPEVLR 283 >UniRef50_A6BF26 Putative uncharacterized protein n=14 Tax=Clostridiales RepID=A6BF26_9FIRM Length = 366 Score = 81.5 bits (199), Expect = 3e-14, Method: Composition-based stats. Identities = 50/318 (15%), Positives = 104/318 (32%), Gaps = 36/318 (11%) Query: 8 PHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSV 67 D +F+ E E + L + LE+ ++ D+ + Sbjct: 56 YKDTIFRMLYHDKENLLSLYNAVNGREYTDPEKLQVVTLENAIYMGM-----KNDLAF-- 108 Query: 68 QMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADH------DKLPLVVPILF 121 + YL+ EHQS + + R + Y R + K+P ++F Sbjct: 109 -IMDMNLYLY---EHQSTYNPNIPLRNLFYIADEYQRLVVRKSLYSTVIQKIPTPRFLVF 164 Query: 122 YQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQH-----RRIAILE 176 Y G S Y ++++ ++M+H + Sbjct: 165 YNGTKEVEDRSEFRLSSAYENPTENPDLELRVTMLNVNDGHSSDLMEHCRTLKEYAQYVA 224 Query: 177 LLQKHIRQRD---LMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRD 233 ++K+ ++D + + I+EG + L + + + LR Sbjct: 225 RVRKYAAKQDVSLEEAVTRAVDECIEEGILAEFLLKNKTEVIRVSIYEYDKEFEEKKLRK 284 Query: 234 RETG-----------GESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDV 282 E + + Q E G + GI+ G++ + ++ ++ L KG S E + Sbjct: 285 AEYEAGRQDGIEIGRQDGIEIGRQDGIEIGRQDGIEIGKRILLEKIIKKKLKKGKSTEQI 344 Query: 283 AEMANLPLAEIDKVINLI 300 A+ + I KV+ + Sbjct: 345 ADELEEDINIIQKVVEKL 362 >UniRef50_C5RQ96 Putative uncharacterized protein n=1 Tax=Clostridium cellulovorans 743B RepID=C5RQ96_CLOCL Length = 288 Score = 81.5 bits (199), Expect = 4e-14, Method: Composition-based stats. Identities = 42/294 (14%), Positives = 94/294 (31%), Gaps = 27/294 (9%) Query: 10 DAVFKQFLMHAETARDFLEIHLPVEL-RELCDLNTLHLESGSFIEESLKGHSTDVLYSVQ 68 D VFK + +D L L L + + + + E + + V+ Sbjct: 17 DFVFKLLFGDEKN-KDLLIAFLSAVLNLPEREFVGIEILNTELFREFKEDKKGILDVRVK 75 Query: 69 MQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEAD--HDKLPLVVPILFYQGEA 126 + IE Q P + M R + Y ++ +DKL + I + Sbjct: 76 TVNGKQ---IDIEIQVLPTEFMPERTLFYWSKMYTTQVKPGDTYDKLKKCITINIVDFKC 132 Query: 127 TPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQRD 186 P + + + ++I D +I + I++ + Sbjct: 133 IPLNKLHTSYHLIEDETGHKLTDILEVHFLEIPKLFDKQIEINEDDPIIQWM------EF 186 Query: 187 LMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGESMMTLAQ 246 L + ++ ++ E S + + + + R E+ Sbjct: 187 LDGKSKGVMEMLAEKNESIKKAYNLLKIISK----------DEKARMIYEAREA----EL 232 Query: 247 WFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKVINLI 300 + I ++G E + A++++ +G S D+ E+ L +I ++ N + Sbjct: 233 RDQLTRIRSAEEKGANEKALRVAEKMIKRGDSINDIIELTELSKEKILELKNKL 286 >UniRef50_B4VKW0 Putative uncharacterized protein n=2 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VKW0_9CYAN Length = 296 Score = 81.1 bits (198), Expect = 4e-14, Method: Composition-based stats. Identities = 42/314 (13%), Positives = 99/314 (31%), Gaps = 39/314 (12%) Query: 3 APSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTD 62 P+ D K+ L + L L LR+ + ++ + E K + D Sbjct: 2 PPTHIRFDWAIKKLLRNKAN-YGVLAGFLSELLRKPITIQSILEGESNQQAEDDKLNRVD 60 Query: 63 VLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPL------- 115 +L N ++IE Q+ ++ RM+ + + LE + Sbjct: 61 ILA-----ENDRGELILIEVQNSTEQDYFHRMLYGTSRLITDFLEKGEPYGNVKKVYSVN 115 Query: 116 VVPILFYQGEATPYPLSMCWFDMFYSPELAR-------------RVYNSPFPLVDITITP 162 +V QG+ Y ++ + + +L + ++ + Sbjct: 116 IVYFSLGQGDDYIYHGTLEFRGLHLDDKLGLSINQRKLFNSQDVYEIFPEYYVIKVNNFN 175 Query: 163 DDEIMQHRRIAILELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTE 222 E+ + L+K Q+ + ++ + + + + Sbjct: 176 --EVASDTLDEWIYFLKKS-----------QIKEEFTAQGLAEAKENLLVDSLSEAERAN 222 Query: 223 QADLFYGVLRDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDV 282 + S E+G+++G++QG+Q+ A+ L +G + + Sbjct: 223 YLRFMENRRYEISLIESSRSEGRLEGLEEGLKEGMEQGKQQEKVNIARLLKQQGTDLDTI 282 Query: 283 AEMANLPLAEIDKV 296 L EI+++ Sbjct: 283 TAATGLTREEIEEL 296 >UniRef50_D2NBJ3 Putative uncharacterized protein n=1 Tax=Escherichia coli SE15 RepID=D2NBJ3_ECOLX Length = 136 Score = 81.1 bits (198), Expect = 5e-14, Method: Composition-based stats. Identities = 43/127 (33%), Positives = 66/127 (51%), Gaps = 9/127 (7%) Query: 168 QHRRIAILELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQA-DL 226 +H +A+LEL+QKHIRQRDLM L+EQ+ L+ GY + Q+ + NY+LQ G + D Sbjct: 13 RHASMALLELIQKHIRQRDLMGLVEQMACLLSSGYANDRQIKGLFNYILQTGDAVRFNDF 72 Query: 227 FYGVLRDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMA 286 GV ES+MT+A+ Q+G Q + A+ +L G+ D+ Sbjct: 73 IDGVAERSPKHKESLMTIAERLR--------QEGEQSKALHIAKIMLESGVPLADIMRFT 124 Query: 287 NLPLAEI 293 + E+ Sbjct: 125 GVSEEEL 131 >UniRef50_UPI0001C366FA hypothetical protein ChatD1_09620 n=1 Tax=Clostridium hathewayi DSM 13479 RepID=UPI0001C366FA Length = 342 Score = 81.1 bits (198), Expect = 5e-14, Method: Composition-based stats. Identities = 50/326 (15%), Positives = 96/326 (29%), Gaps = 43/326 (13%) Query: 10 DAVFKQFLMHAETARDFLE-------IHLPVELRELCDLNTLHLESGSFIEESLKGHSTD 62 D K L DF+ L + L + + + D Sbjct: 8 DYYMKILLEDRARFADFINVNVFHGKQVLAADKLSLLPNEAGIVVVDADGVKRTIQRRRD 67 Query: 63 VLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEA-------------- 108 V+ + + V E+Q K MA R M Y + Sbjct: 68 VVMKAEF--GAYFCVVASENQGKVHYGMAVREMMYDALDYTEQIRKIEEKHRAEGDKLEG 125 Query: 109 --------DHDKLPLVVPILFYQGEATPYPLSMCWFDMFYSPE------LARRVYNSPFP 154 D+L VV + Y G + M E + + + + Sbjct: 126 ADFLSHVTKADRLIPVVTLTLYYGNEAWDGPRSLYEMMGIDEEWEETALVKKCLPDYKIN 185 Query: 155 LVDITITPDDEIMQHRRIAILELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNY 214 L+DI + + + L++ + ++ L I+ Sbjct: 186 LIDIREGEKLDQYKTSLQHVFGLVKYNKNKQKLYEYTRVHREEINRMDRESKA-----AA 240 Query: 215 MLQRGHTEQADLFYGVLRDRET-GGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLL 273 + G ++ R+ E +++ L E +G +GI G ++ F ++ Sbjct: 241 LALIGEQKRLQKILESKREEEMDMCQAIDELIADGEVRGEVRGILMGMEKTKINFIRKQY 300 Query: 274 SKGMSREDVAEMANLPLAEIDKVINL 299 K +S +A + +L ++KVI L Sbjct: 301 KKQLSSSQIANILDLDERYVEKVIKL 326 >UniRef50_Q8F560 Putative uncharacterized protein n=1 Tax=Leptospira interrogans RepID=Q8F560_LEPIN Length = 278 Score = 81.1 bits (198), Expect = 5e-14, Method: Composition-based stats. Identities = 35/289 (12%), Positives = 89/289 (30%), Gaps = 21/289 (7%) Query: 12 VFKQFL-MHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQMQ 70 +FK + L L + + + + + S + + Q + Sbjct: 2 MFKILFVKEPDLLISILNSVLFTDGEHTI--RNIKILNPELVGSSPNDKRSYLDIRAQDE 59 Query: 71 GNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEAD--HDKLPLVVPILFYQGEATP 128 +E Q R + Y + L + L V I + P Sbjct: 60 DGK---IFHVEIQVAHQSSFVKRSLYYLSGLIRDQLNRGSMYSDLKPVYQINIVDFDLIP 116 Query: 129 YPLSMCWFDMFYSPELARRV-YNSPFPLVDITITPDDEIMQ-HRRIAILELLQKHIRQRD 186 F + + +++ ++ + + I + KH + + Sbjct: 117 SENFHSKFKFREESNPDIILTDDVEIHFLELCKFVKRDVRELRNNLEIWLYVLKHTSELE 176 Query: 187 LMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGESMMTLAQ 246 E++ L+D+ ++ Y + + ++ + LA Sbjct: 177 E----EEMRILVDKTPDLSKAFTILEQY------SNDPQKRNELEAKLKSDRDYAYDLAA 226 Query: 247 WFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDK 295 FE ++ GI++G ++ + A+++L +GM + + + L ++ Sbjct: 227 RFEAGELQ-GIEKGAEKEKLKSARKMLEEGMRLDVILRITGLSKKDLKD 274 >UniRef50_A1ZPJ4 Hypothetical conserved protein n=6 Tax=Microscilla marina ATCC 23134 RepID=A1ZPJ4_9SPHI Length = 302 Score = 81.1 bits (198), Expect = 5e-14, Method: Composition-based stats. Identities = 54/302 (17%), Positives = 114/302 (37%), Gaps = 30/302 (9%) Query: 6 TTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLY 65 + +D +FK+ + E + +L +E+ E ++ D L Sbjct: 19 SNQYDKIFKENIG--EHFLSLSKTYLGIEVASS--------EELKDKLQTTLEREADFLR 68 Query: 66 SVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGE 125 + + + +E QS ++ MA RM Y ++ KLP+ +++ + Sbjct: 69 KITTPKGE-QMIIQLEFQSTDEQGMAERMQLYFAILRQKY------KLPIRQFVIYVGSK 121 Query: 126 ATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQR 185 + ++F EL V T + +I + +A+L Q+ Sbjct: 122 PPKMRTRLKPEEVFTGFEL------LDLRQVSYTQWLESDIPEEVLLAVLGDFQQKKVST 175 Query: 186 DLMLLLEQLVTLID------EGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGE 239 L ++ ++V LID + + ++N +++ T + + Sbjct: 176 VLKQIISKIVKLIDDPGTLQKYIRQLATFARLRNLVIETEQTLEYMGLTYDIEKDVFYQR 235 Query: 240 SMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKG-MSREDVAEMANLPLAEIDKVIN 298 + Q EKG ++GI++G + + LL G M E+VA +A L + ++ K+ + Sbjct: 236 GVKKGQQEGIEKGHQEGIEKGITQGVVKMVIALLKSGKMPLEEVARIAELSVIDVQKMAD 295 Query: 299 LI 300 I Sbjct: 296 QI 297 >UniRef50_C0QZQ8 Putative uncharacterized protein n=4 Tax=Brachyspira RepID=C0QZQ8_BRAHW Length = 309 Score = 80.7 bits (197), Expect = 5e-14, Method: Composition-based stats. Identities = 40/301 (13%), Positives = 95/301 (31%), Gaps = 22/301 (7%) Query: 6 TTPHDAVFKQFLM---HAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTD 62 +D + + A +F+ + + + I E+ + Sbjct: 21 NRINDYFIRYLFSHTGNENIALNFINAVFKD--LNFETFQKIEILNPFNIAENYDEKESI 78 Query: 63 VLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEAD--HDKLPLVVPIL 120 V + + V+IE QS+ ++ R + Y L +D+L V I Sbjct: 79 VDIKATTESG---ITVLIEIQSRGNEDFIKRALYYWAYNYSSSLNRGSFYDELKPTVSIN 135 Query: 121 FYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITI-----TPDDEIMQHRRIAIL 175 T + + + V++ + E + + + Sbjct: 136 ITNFILTDEDKVHSCYILKELNNNKILTDHCQLHFVELPKSNLKNISEIESLDNTHKEFI 195 Query: 176 ELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRE 235 ++ + + M +L + T+ +E V M + E F + Sbjct: 196 SWVKF--FKGEDMSILMKENTIFEEVERKCRTFVNDSPVMDKYKKREVDTYFLNKSMEL- 252 Query: 236 TGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDK 295 + + ++GI++GI++G +E A+ + + +++ L + EI K Sbjct: 253 ----DIRKAKEEGIKEGIKEGIKEGIKENQISMAKNMKKDKVDFNIISKYTGLSIEEIKK 308 Query: 296 V 296 + Sbjct: 309 L 309 >UniRef50_C1P7A8 Putative uncharacterized protein n=1 Tax=Bacillus coagulans 36D1 RepID=C1P7A8_BACCO Length = 345 Score = 80.7 bits (197), Expect = 5e-14, Method: Composition-based stats. Identities = 64/342 (18%), Positives = 109/342 (31%), Gaps = 59/342 (17%) Query: 5 STTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNT-LHLESGSF----IEESLKGH 59 T +D ++K+ + E +F+ + +L E D + I+ Sbjct: 15 PGTDYDGLWKKIIS--ELFEEFI-LFFAPDLYETIDFGKGIVFLEQELHKVIIKHKKGKR 71 Query: 60 STDVLYSVQMQGNPG-YLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVP 118 D + V ++ Y+ + IE Q K D + RM Y R E + Sbjct: 72 IADKIVKVSLKNGEEKYVFIHIEIQEKQDPDFSKRMFTYFYRLFDRFQEN-------IYS 124 Query: 119 ILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELL 178 I + S + FY EL R F DI IA+L + Sbjct: 125 IAILTDLSKSNN-SEPFQYSFYGTELTYRFNTYKFNEADIPSLKKS--TNPFAIAVLAGI 181 Query: 179 QKHIRQRDLMLLLEQLVTLIDEGY--------TSGSQLVAMQNYMLQRGHTEQADLFYGV 230 H+ +++ E L+ E L +Y+L L + Sbjct: 182 YLHLTEKNYQKRYEVKKKLLKEFILSNQNLSSNYAEALCYFIDYLLYLPGELTKQLTKEL 241 Query: 231 LRDRETG--------------------------------GESMMTLAQWFEEKGIEKGIQ 258 E + + + +E+GIE GI+ Sbjct: 242 FIHIEKEANHMLYSEELKEAPTFAEYLKTVKEEGIEIGIEKGIEKGIEKGKEEGIEIGIE 301 Query: 259 QGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKVINLI 300 +G+ E + A LL +G S E VA+M L + E+ K+ + Sbjct: 302 KGKMEEKRNLAAELLREGFSVEKVAKMVKLSIDEVKKIKKCV 343 >UniRef50_A8F2U7 Putative uncharacterized protein n=15 Tax=Bacteria RepID=A8F2U7_RICM5 Length = 281 Score = 80.7 bits (197), Expect = 6e-14, Method: Composition-based stats. Identities = 43/301 (14%), Positives = 99/301 (32%), Gaps = 30/301 (9%) Query: 1 MDAPSTTPHDAVFKQFLMHA-ETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGH 59 M +D FK+ + + + + DL+ + E E + Sbjct: 1 MQRYLDGTNDIAFKKLFSDKVKLINLLNSLLRLSKGDRIIDLSYITTEQLPLFLEGRRS- 59 Query: 60 STDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEAD---HDKLPLV 116 L+ ++++ G ++ IE Q K +K R Y ++ D LP+V Sbjct: 60 ----LFDLKVKDETGRWYI-IEMQRKMEKDYLNRTQLYGCYTYVSQIKKGMKHKDLLPVV 114 Query: 117 VPILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILE 176 + + + + + S + +++ + +++ +++ Sbjct: 115 IISIIRAKALPDELPYISYHHIKESNIHKQYLFSLTYVFIELGKFKKNDLKDDTDE--WL 172 Query: 177 LLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRET 236 L K+ Q + ++ Y S Q E D F + ++ Sbjct: 173 YLLKYASQEQEPPKEIKN-EIVLSAYASLEQYKW--------TEQEHDDYFRAEMAIQQE 223 Query: 237 GGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKV 296 + E+ G+++G ++ E A+ +L + E +A L + EI K+ Sbjct: 224 IDK---------FEEKFNAGMEKGIEKEKIETAKEMLIENGPIEQIARYTKLTIEEIKKL 274 Query: 297 I 297 Sbjct: 275 K 275 >UniRef50_B4VZ11 Putative uncharacterized protein n=1 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VZ11_9CYAN Length = 333 Score = 80.7 bits (197), Expect = 6e-14, Method: Composition-based stats. Identities = 55/302 (18%), Positives = 104/302 (34%), Gaps = 30/302 (9%) Query: 7 TPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNT-LHLESGSFI----EESLKGHST 61 + +D+ +K+ + R+FL P + E D E Sbjct: 2 SEYDSPWKESIS--LYFREFLSFFYPR-IEEDIDWERGFEFLDTELQQIKRETETGRRDA 58 Query: 62 DVLYSV-QMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPIL 120 D L V + G ++ V +E QS+ + + RM Y R+ + VV + Sbjct: 59 DKLVKVWRRSGEEEWVLVHVEVQSQRQSEFSERMYLYHSRIFDRYRRS-------VVSLG 111 Query: 121 FYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILEL--- 177 E + + Y EL FP+V + DE+ + + + Sbjct: 112 ILGDEQPGWRPNR------YERELWGCRAILEFPMVKLLDYSMDELARSQNPLAAIVQAH 165 Query: 178 LQKHIRQRDLMLLLEQLVTLIDEGYTS---GSQLVAMQNYMLQRGHTEQADLFYGVLRDR 234 L + +D+ + E ++LI Y +V + + + + + Sbjct: 166 LSAQVAGKDVGVGYESKLSLIKSLYERGYGREDIVQLFRLIDWFIALPKREEERLWQEIQ 225 Query: 235 ETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEID 294 E M E GI KG++QG Q+ QE R+L E++ + + +I+ Sbjct: 226 TLEEERKMPYITSIERIGIRKGLEQGLQQARQEDIVRILELRF--EEIPQKLRGLIGKIE 283 Query: 295 KV 296 + Sbjct: 284 AL 285 >UniRef50_B5CRG1 Putative uncharacterized protein n=4 Tax=Ruminococcus lactaris ATCC 29176 RepID=B5CRG1_9FIRM Length = 356 Score = 80.7 bits (197), Expect = 6e-14, Method: Composition-based stats. Identities = 46/318 (14%), Positives = 94/318 (29%), Gaps = 38/318 (11%) Query: 17 LMHAETARDFLEIHL--------PVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQ 68 L + D L P L + + L+ + +K D++ + Sbjct: 42 LKDTKRFADLFNAILFQGKAVILPENLYPSPETTAVSLQDTQG-KNVVKKQYRDII--MN 98 Query: 69 MQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLE-------------------AD 109 Q + + +E Q+ ++M Y + Sbjct: 99 WQDQALLMLLAVESQTAIHYAAPLKVMLYDSMEYAEQVRVKWKERPPRLSSAEFLSRFQK 158 Query: 110 HDKLPLVVPILFYQGEATPYPLSMCWFDMFYSPE-------LARRVYNSPFPLVDITITP 162 +DKL V+ ++FY G + + MF + + + N LVD+ Sbjct: 159 NDKLIPVITLIFYYGTEE-WDGPLELHQMFDLGTEKKHAELMKKYLPNYHINLVDVRRLK 217 Query: 163 DDEIMQHRRIAILELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTE 222 + E Q I +LQ + L + + + Q Sbjct: 218 NLESFQSDLQIIFGMLQYSQDKYALRTYVANHKDYFQKLDLETYHALGAFLNSRQLMEIN 277 Query: 223 QADLFYGVLRDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDV 282 L + + Q E+G GI +G +E A ++ G S + + Sbjct: 278 VEKNEREELDMCKALEDIYNDGVQDGMEQGRRSGIAEGEASHKKEVAFQMQKLGYSLDAI 337 Query: 283 AEMANLPLAEIDKVINLI 300 A + + I +++ ++ Sbjct: 338 AAVLRESVDGISQILAVV 355 >UniRef50_A5KR99 Putative uncharacterized protein n=11 Tax=Ruminococcus torques ATCC 27756 RepID=A5KR99_9FIRM Length = 317 Score = 79.9 bits (195), Expect = 9e-14, Method: Composition-based stats. Identities = 44/312 (14%), Positives = 97/312 (31%), Gaps = 26/312 (8%) Query: 3 APSTTPHDAVFKQFL----MHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKG 58 + ++VF L E + +++ ++ Sbjct: 11 KENREIKNSVFVDLFYEDESAEANEIALFNAIHDEPLPEGTKIRRFRVDNTIYM-----N 65 Query: 59 HSTDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHL------EADHDK 112 D+ + G + V EHQS ++ M R + Y A R + + Sbjct: 66 FQNDISFDAG-----GKVIVFGEHQSTINENMPLRSLLYIGRAYERLVPPRSRYKKKIVP 120 Query: 113 LPLVVPILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRI 172 LP FY G+ Y + +++I EI++ ++ Sbjct: 121 LPTPEFYTFYNGKEKWEKEKELRLSDAYIVKDGEPSLELKVKVINIRPEEHHEILEKCQV 180 Query: 173 -----AILELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLF 227 +E++Q + + + + I++G + + + + Sbjct: 181 LKEYSQFMEIVQNYQISGEEEPYKKAIKECIEKGILADYLMRKGSEVVNMLLDEYDYETD 240 Query: 228 YGVLRDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMAN 287 V R+ E + + E+G ++G ++GR+ Q+ L KG + +A+ Sbjct: 241 IEVQRE-EAREQGREEGRKQGREEGRKQGREEGRKAERSTLIQKKLEKGKTISQIADELE 299 Query: 288 LPLAEIDKVINL 299 I +I Sbjct: 300 DTEENIACLIEQ 311 >UniRef50_C8NHS0 Putative uncharacterized protein n=1 Tax=Granulicatella adiacens ATCC 49175 RepID=C8NHS0_9LACT Length = 278 Score = 79.9 bits (195), Expect = 1e-13, Method: Composition-based stats. Identities = 34/300 (11%), Positives = 86/300 (28%), Gaps = 27/300 (9%) Query: 4 PSTTPHDAVFKQFL---MHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHS 60 +D +FK+ + ++F+++ ++L + N ++ + Sbjct: 3 KILPTNDLMFKKMMTSEGKEYILQNFIQVVTGMKLSNVKPTNPYQIQKYRENLAGVNLEM 62 Query: 61 TDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPIL 120 + + G + ++IE Q + R+ Y + A H V+ I+ Sbjct: 63 YQTIVDIAATTEEG-IDIIIEMQLYKHRGFFERIRYYMASTYMDSYSAGHQTYKPVISIV 121 Query: 121 FYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQK 180 LV++ + + + + L+ Sbjct: 122 VTDFSVFKEDPE----------------PRVEIGLVNLEKNREVLNEKGQPFERVYLVNL 165 Query: 181 HIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGES 240 + + + G + +Q+ + E E Sbjct: 166 ATTLPNQDEAFNEWRNFLKNGTITAKASKEIQDAYAVVDFYNLDSEEMKMAEQMEKYEEV 225 Query: 241 MMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKVINLI 300 ++ +E E G+++G+ L + ++ + L EI K+IN + Sbjct: 226 YWKTIEYAKETAREAGLKEGQ-------VLAFLKMNLPITEIIKHTGLSEEEIQKIINTL 278 >UniRef50_UPI00006A2D99 UPI00006A2D99 related cluster n=2 Tax=Xenopus (Silurana) tropicalis RepID=UPI00006A2D99 Length = 308 Score = 79.5 bits (194), Expect = 1e-13, Method: Composition-based stats. Identities = 41/275 (14%), Positives = 93/275 (33%), Gaps = 19/275 (6%) Query: 6 TTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHS--TDV 63 T HD FK ++ R L+ P E + + D + ++ L DV Sbjct: 1 PTSHDQNFKNLILD--YPRQALQFFAPDEAKNIDDSAVITPIRQEQLKNRLGDRFYELDV 58 Query: 64 LYSVQMQGN-PGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFY 122 V+ + ++E ++ P + R++ Y L + +P+V+ + Sbjct: 59 PLKVEWPDGRHAAMLFLLEEETDPARFSIHRLVSYCANLAE--LMGTNRVVPIVIFL--- 113 Query: 123 QGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRI-AILELLQKH 181 S + + + + P ++ I A + L H Sbjct: 114 -------RSSPDIRRDLHLGVDGVNFLSFHYIACVLPDIPAEQYKDSTNIVARIALPTMH 166 Query: 182 IRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGESM 241 + ++ ++ + +D +G + + +++ E + + +++ Sbjct: 167 YAREQVIDVMAWALRGLDTLEANGDKRIKYLDFIDTYSQLEDNERQL-FKQRYPQEEKTV 225 Query: 242 MTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKG 276 ++ Q +GI +GI QG QE Q +G Sbjct: 226 TSIVQRAIHQGIHQGIHQGIQEGMLMGRQEGRQEG 260 >UniRef50_D0BNN6 ATP-dependent DNA helicase RecQ n=1 Tax=Granulicatella elegans ATCC 700633 RepID=D0BNN6_9LACT Length = 302 Score = 79.5 bits (194), Expect = 1e-13, Method: Composition-based stats. Identities = 51/320 (15%), Positives = 106/320 (33%), Gaps = 39/320 (12%) Query: 1 MDAPSTTPHDAVFKQFL---MHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLK 57 M T D +FK+ + DF+E ++L+ + N +E+ E+L Sbjct: 1 MKIKPTN--DLLFKKMMTTAGKEYILEDFIEAVTGMKLKNVRPANPYQIETYQKTIENLN 58 Query: 58 GHSTDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVV 117 + V G + ++IE Q K R+ Y A ++ +A+ K ++ Sbjct: 59 PVMYSTIVDVAATTEDG-MEIMIEMQLYQHKDFFERIFNYMATAYTQNYKAETAK--PII 115 Query: 118 PILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILEL 177 I+ P F + + N + + + + RI ++ L Sbjct: 116 SIVVTNFTVFPE---------FQEARIEIGLTNFAYY----QEIRNRKQQPYWRIYLVNL 162 Query: 178 LQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETG 237 K I + + + G ++ + A + E Sbjct: 163 TDKAIVNGE-SRDFSEWRDFLKNGTIKPKSSRGLKEAQKIVNFSNLAGEERRLAELMEKY 221 Query: 238 GESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKG-----------------MSRE 280 + + + E+G+E+GI+ GRQ+ +R + KG + E Sbjct: 222 EDVYYQVMKHQLEEGLEQGIEIGRQQGVALGEKRGMEKGVALGERKGQVMICFKMNLPIE 281 Query: 281 DVAEMANLPLAEIDKVINLI 300 ++ + L + EI+ + Sbjct: 282 EIQKHTGLSIEEIEAFRKEM 301 >UniRef50_UPI0001C371D2 hypothetical protein RflaF_10865 n=1 Tax=Ruminococcus flavefaciens FD-1 RepID=UPI0001C371D2 Length = 317 Score = 79.5 bits (194), Expect = 1e-13, Method: Composition-based stats. Identities = 47/322 (14%), Positives = 98/322 (30%), Gaps = 47/322 (14%) Query: 8 PHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLK--------GH 59 DAV K ++ +E D L R++ L + I Sbjct: 3 DKDAVTKDYMQDSEHFADAFN-FLLYGGRQVIKPEQLKPLDTTSIALPYGDESRFVPIQK 61 Query: 60 STDVLYSVQMQGNPGYLHVV--IEHQSKPDKKMAFRMMRYSIAAM--------HRHLEAD 109 DVL V + +++ IE+QS M R M Y H ++ Sbjct: 62 YRDVLKMVTAMEDENATYLILGIENQSDIHYAMPIRNMLYDAIQYVNQADTIAKEHRKSK 121 Query: 110 H---------------DKLPLVVPILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFP 154 D++ ++ + Y G + + ++ + V N Sbjct: 122 KMPETRAEYLSGFYKTDRILPIITLTLYFGADEWDAPRDLHSMLTANEDILKFVDNYHLH 181 Query: 155 LVDITITPDDEIMQHRRIAILELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNY 214 L+ D++ + L L K+++ L +V + + M N Sbjct: 182 LIAPAEIEDEDFAKFHTE--LSLALKYVKYSKDKKKLRDIVNEDTAFRSVSRKTADMVNV 239 Query: 215 MLQRGHTEQADLFYGVLRDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLS 274 + + + M + + + +G +G +E L+ Sbjct: 240 VTSS----------NLHYNDGEERVDMCEAIEEIRKDALAEGKAEGIEEGIIRTLIGLVK 289 Query: 275 KG-MSREDVAEMANLPLAEIDK 295 G ++ D A+ A++ + E ++ Sbjct: 290 DGILTIADAAKRADMTVPEFEE 311 >UniRef50_C9RP54 Putative uncharacterized protein n=1 Tax=Fibrobacter succinogenes subsp. succinogenes S85 RepID=C9RP54_FIBSS Length = 312 Score = 79.5 bits (194), Expect = 1e-13, Method: Composition-based stats. Identities = 43/291 (14%), Positives = 89/291 (30%), Gaps = 28/291 (9%) Query: 10 DAVFKQFLMH---AETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYS 66 D +FK + + E FL L + + TL + + ++ Sbjct: 35 DGIFKMLIANEAKPERTVKFLNAMLGLTGDKAIKTYTLGVPENPGVLNDKTA-----IFD 89 Query: 67 VQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDK-LPLVVPILFYQGE 125 + G V+IE Q + R++ Y+ + R ++ D LP + + Sbjct: 90 IYGTTQAGEP-VLIEVQQNFNTLFVDRLIYYTARVISRTVKKAQDYNLPHIYVLSILTEN 148 Query: 126 ATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQR 185 P LV++ E L ++ Q Sbjct: 149 QFPRERDTYLHHAQLVRNRHLFYSKLDIYLVELEKFFAIEDRT---------LPENREQS 199 Query: 186 DLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGESMMTLA 245 D +L ++++ +L + + + + E G + M Sbjct: 200 DRAEMLRIFRDVLEDKDIPEEKLKRLLD--KDFANDVSFKGYTDETLLNEVDGMTDMLYE 257 Query: 246 QWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKV 296 + QG+ + E A +L++G S E +A + L ++ K+ Sbjct: 258 -------KQGSYLQGKDDERNEIAIAMLAEGDSIEKIARVTKLSENDVRKL 301 >UniRef50_B8HNA0 Putative uncharacterized protein n=3 Tax=Cyanobacteria RepID=B8HNA0_CYAP4 Length = 315 Score = 79.1 bits (193), Expect = 2e-13, Method: Composition-based stats. Identities = 45/262 (17%), Positives = 89/262 (33%), Gaps = 24/262 (9%) Query: 21 ETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQMQGNPGYLHVVI 80 + FL E+ + L E S++ D L ++ + L + + Sbjct: 4 DNICKFLAESFSTEVATWLLGERISLFKLEPTELSVEPIRADSLILLEAED----LILHV 59 Query: 81 EHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEATPYPLSMCWFDMFY 140 E Q+ PD M RM+ Y + + R + +V + Y L + Y Sbjct: 60 EFQTGPDADMPLRMLDYRVRLLRRSPQK------VVRQFVIY--------LRQTTSVLVY 105 Query: 141 SPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQRDLMLLLEQLVTLIDE 200 EL F +V + D ++ R + +L + + + Q ++ I+ Sbjct: 106 QTELQLESTWHEFNVVRLWECSTDPLLASRGLLPFAVLGQTSNPEATLAQVAQRLSTIEN 165 Query: 201 GYTSGSQLVAMQNYMLQRGHTE------QADLFYGVLRDRETGGESMMTLAQWFEEKGIE 254 + A + + ++ L + E M + +GI+ Sbjct: 166 RTEQSNLTAASAILAGLVLDQQTIQRLLRREIMRESLFYQGILEEGMQKGVERGIAQGIQ 225 Query: 255 KGIQQGRQEVSQEFAQRLLSKG 276 G++QGRQE ++ Q +G Sbjct: 226 LGLEQGRQEGLEQGRQEGRQEG 247 >UniRef50_C0EXQ3 Putative uncharacterized protein n=1 Tax=Eubacterium hallii DSM 3353 RepID=C0EXQ3_9FIRM Length = 290 Score = 78.8 bits (192), Expect = 2e-13, Method: Composition-based stats. Identities = 39/312 (12%), Positives = 107/312 (34%), Gaps = 43/312 (13%) Query: 3 APSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTD 62 + D +F+ E L + V + D++ + + + S + D Sbjct: 8 NANREYKDRLFRFVFGAEENKAYLLSLCNAVSGTDYTDVDDIEITTLS--DAIYIKMKND 65 Query: 63 VLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAM---------HRHLEADHDKL 113 + + + Q + EHQS + M R M ++ + L Sbjct: 66 ISFLIDSQ------MNLFEHQSTFNPNMPLRGMECFAELYGIYIIENNLDIYVSSLQKIL 119 Query: 114 PLVVPILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQ----- 168 +++ E P + + D F P+ + +++I + ++++ Sbjct: 120 TPRYYVIYNGTEKQPDVVKLKLSDAFQVPDDSGEF-EWTATMLNINYGHNRKLLEQCQPL 178 Query: 169 HRRIAILELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFY 228 + ++L++++ +L +++ V E G+ L ++ + TE + + Sbjct: 179 YEYAHFIKLVREYSEAMELKKAIDKAVEKAREWKCIGTFLYQCKSEVSVMLLTEFDEKKH 238 Query: 229 GVLRDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANL 288 + I+ G ++GR++ + +L+ +S E +A+ + Sbjct: 239 E--------------------DNLIKLGEKEGREKERMKNICSMLALSLSPEIIAKACEV 278 Query: 289 PLAEIDKVINLI 300 + + + + Sbjct: 279 SVDYVLNLKKEL 290 >UniRef50_B0MQP0 Putative uncharacterized protein n=2 Tax=Eubacterium siraeum DSM 15702 RepID=B0MQP0_9FIRM Length = 289 Score = 78.4 bits (191), Expect = 3e-13, Method: Composition-based stats. Identities = 43/284 (15%), Positives = 95/284 (33%), Gaps = 28/284 (9%) Query: 10 DAVFKQFLMHAETARDFLEIHLPVELRELCD-LNTLHLESGSFIEESLKGHSTDVLYSVQ 68 D +FK+ E + L+ +L L D + L + + + +S+ + + ++ Sbjct: 27 DIIFKKLFTD-EGNQHLLQAYLSDTLGIPYDSIENLVVLNSEIMPDSITEKYSRMDIRMK 85 Query: 69 MQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPL-----VVPILFYQ 123 G + +E Q K + R + Y L++ L + I F Sbjct: 86 ANGR----LINVEMQIKDEGDYKDRSLYYLSKLYSGQLKSGEVYGSLNQCISINIINFNL 141 Query: 124 GEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIR 183 + Y S + + +L + F L I D Q + ++ + Sbjct: 142 FDCEKYHSSFSMREDSRNEQLTDKFTAHYFELKKIGKNIDKNNKQELWLRLI-----NAE 196 Query: 184 QRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGESMMT 243 D + +L+Q + + +Q+ ++ + + RE T Sbjct: 197 TEDELDMLQQ------------TGVKQIQDAVVVLHKMSADEKTRELAEMREKALHIEAT 244 Query: 244 LAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMAN 287 +G G+++G + E ++ G+S E + + N Sbjct: 245 EKAHARAEGEAVGLKKGEKRKEAEMISKMRKSGLSEEQIKAILN 288 >UniRef50_C9LUC8 Putative uncharacterized protein n=5 Tax=Selenomonas sputigena ATCC 35185 RepID=C9LUC8_9FIRM Length = 325 Score = 78.4 bits (191), Expect = 3e-13, Method: Composition-based stats. Identities = 47/304 (15%), Positives = 95/304 (31%), Gaps = 37/304 (12%) Query: 2 DAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHST 61 + D VF+ + E ++ E L + + L Sbjct: 43 EKKGRQYQDTVFRMYFNEEERLKEVAGALHGRSYEE----EPLKIVTLE--GTFLSQIKN 96 Query: 62 DVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADH------DKLPL 115 D+ + + G + +EHQS ++ M R + Y + +++ A KLP Sbjct: 97 DISFLL-----AGRHLIFMEHQSTANQNMPLRCLYYVCEQLRQYIPAKKLYQNTPIKLPA 151 Query: 116 VVPILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAIL 175 +FY G Y +I Sbjct: 152 PEFHVFYTGNNDMPETCQMKLSDAYVKTDEEIHLELKANFHNIAYDNAK----------- 200 Query: 176 ELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRE 235 LLQ+ D + ++ + G + Y E++D+ L+ E Sbjct: 201 ILLQRSRSIHDYSFFIARIKRNMAAGMERAQAIREAMRY------CEESDIMKEFLQQHE 254 Query: 236 TGGESMMTLAQ---WFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAE 292 M+ FEE +E+G+++GR+E + +L + E +A ++ + Sbjct: 255 REVVDMVNFEWNQKDFEEAILEEGMERGREEGKVDMVLEMLRDKLPLETIARISKFSMER 314 Query: 293 IDKV 296 + ++ Sbjct: 315 VQEL 318 >UniRef50_B4VQ19 Putative uncharacterized protein n=3 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VQ19_9CYAN Length = 318 Score = 78.4 bits (191), Expect = 3e-13, Method: Composition-based stats. Identities = 42/307 (13%), Positives = 82/307 (26%), Gaps = 23/307 (7%) Query: 5 STTPHDAVFKQFLM---HAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHST 61 + D FK+ + FL + + DL + + + + LK Sbjct: 4 ISPKTDFAFKKIFGAKDSKDILISFLNALIYNANPVIQDLEIIDPYNPGDVVD-LKDSYL 62 Query: 62 DVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILF 121 DV + V+IE Q R++ L+ L I Sbjct: 63 DV--RAVLDNGST---VLIEMQVLNVASFEKRVIYNLTKTYANQLKYGEGYSHLKPAIAL 117 Query: 122 YQGEATPYPLSMCWFDMFYSPELARRV----YNSPFPLVDITITPDDEIMQHRRIAILEL 177 + + + + F E V++T Sbjct: 118 TITDFQLFDQTQRFLTRFGLKEKQELFDYTDPEIELIFVELTKFNKKLEQLDNLTDKWIY 177 Query: 178 LQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETG 237 K LE + + + L E+ L D+ Sbjct: 178 FIKDAPS------LEVIPPTFRQVPELEKAMNIANQANLSVEELEKIRKREVFLEDQRGF 231 Query: 238 -GESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGM---SREDVAEMANLPLAEI 293 ++ +G +G +GR E + RLL + + ++ NL + E+ Sbjct: 232 IVKAKQEGRVEGRVEGRVEGRVEGRVEEGIRWTLRLLERQFGSIPPAIINQIQNLSVEEL 291 Query: 294 DKVINLI 300 + + + I Sbjct: 292 EDLGDAI 298 >UniRef50_C1Q938 Putative uncharacterized protein n=4 Tax=Brachyspira murdochii DSM 12563 RepID=C1Q938_9SPIR Length = 326 Score = 78.4 bits (191), Expect = 3e-13, Method: Composition-based stats. Identities = 35/304 (11%), Positives = 88/304 (28%), Gaps = 30/304 (9%) Query: 3 APSTTPHDAVFKQFLM---HAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGH 59 +D + + A +F+ N + + + I E+ Sbjct: 43 NNLNRINDYFVRYLFSHDGNENIALNFINAVFKD--LNFETFNKIEILNPFNISENYDEK 100 Query: 60 STDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDK--LPLVV 117 + V + + V+IE QS+ ++ R + Y L L V Sbjct: 101 ESIVDIKATTETG---ITVLIEIQSRGNEDFIKRALYYWAYNYSSSLNRGSFYDGLKPTV 157 Query: 118 PILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILEL 177 I T + + + +++ +I Sbjct: 158 SINITNFILTDEDKVHSCYVLKELNNNKILTDHCQLHFLELPKFNLKDI----------- 206 Query: 178 LQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYG-----VLR 232 L + ++ ++ I ++ +N + + + + Sbjct: 207 ----SAIESLDNIHKEFISWIKFFKGEDMSILMKENTIFEEVEKKCLTFVNDSPVIDKYK 262 Query: 233 DRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAE 292 RE + +K E+GI++G +E A+ + + + ++++ L + E Sbjct: 263 KREVDTYFFNKSMELDIKKAKEEGIKEGIKENQILTAKNMKKENIDINIISKITGLSIQE 322 Query: 293 IDKV 296 I+ + Sbjct: 323 IENL 326 >UniRef50_C0QZ87 Chromosome segregation ATPase n=19 Tax=Bacteria RepID=C0QZ87_BRAHW Length = 309 Score = 78.4 bits (191), Expect = 3e-13, Method: Composition-based stats. Identities = 42/313 (13%), Positives = 94/313 (30%), Gaps = 24/313 (7%) Query: 3 APSTTPHDAVFKQFLM---HAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGH 59 +D + + + + + L E + E+LK Sbjct: 2 KEINRLNDLFVRYLIGTEGDEDILENIVNAVLNDVGFESVSNLEIINPYNLAENENLKES 61 Query: 60 STDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPI 119 DV + + ++IE Q + R++ Y + L+ + + + + I Sbjct: 62 ILDV--KAKTKDGKK---ILIEIQLIGNNNFIKRILYYIAKNISSELKENENYINISQMI 116 Query: 120 LF-------YQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRI 172 G + F + + ++ + ++I + Sbjct: 117 SISFLNFNLKIGSESDIKREHKCFQLSDINNSSLKLDDFQIHFIEIKRFAEILKNASIDD 176 Query: 173 AILELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVL- 231 L I L + + LI + + ++ + L Sbjct: 177 YNKNKLLSWIDFFTAKDLEKSINKLIGGNDIMSKVMDKYKRFVADEKEMSAYNERDTFLY 236 Query: 232 --------RDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVA 283 E E + Q ++GIE+GI+QG + + A+ L G+ + ++ Sbjct: 237 GQAAMLQYEREEGKKEGIEIGIQQGIKEGIEQGIEQGEKNKALSIARSLKKSGLDDKFIS 296 Query: 284 EMANLPLAEIDKV 296 E L + EI+K+ Sbjct: 297 ENTGLTIEEIEKL 309 >UniRef50_C0G0A4 Putative uncharacterized protein n=2 Tax=Roseburia inulinivorans DSM 16841 RepID=C0G0A4_9FIRM Length = 319 Score = 78.0 bits (190), Expect = 3e-13, Method: Composition-based stats. Identities = 32/242 (13%), Positives = 70/242 (28%), Gaps = 19/242 (7%) Query: 5 STTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVL 64 + D VF+ + + DL + LE+ ++ D+ Sbjct: 53 NRNYKDTVFRMLFSDRKNLLSLYNAVNQSNYKNPEDLEIVTLENAIYMG-----IKNDLA 107 Query: 65 YSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADH------DKLPLVVP 118 + + YL+ EHQS + M R + Y + + ++ K+P Sbjct: 108 F---IMDTNLYLY---EHQSTYNPNMPLRDLFYICSEYQKLVDKKSLFSSTLQKIPAPNF 161 Query: 119 ILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRI--AILE 176 I FY G + + ++++ + ++MQH + + Sbjct: 162 IEFYNGSTVISDCTELRLSSAFECLTGEPKLELIVTVLNVNEGHNADLMQHCSMLKEYAQ 221 Query: 177 LLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRET 236 + + M L E + +DE G + + + + Sbjct: 222 YVARVRHYASDMPLNEAVKHAVDECIREGILAEFLTQNRNEVISMSIFEYDKELEEKNYE 281 Query: 237 GG 238 Sbjct: 282 KQ 283 >UniRef50_B4VTF8 Putative uncharacterized protein n=7 Tax=Oscillatoriales RepID=B4VTF8_9CYAN Length = 306 Score = 78.0 bits (190), Expect = 3e-13, Method: Composition-based stats. Identities = 45/284 (15%), Positives = 86/284 (30%), Gaps = 22/284 (7%) Query: 5 STTPHDAVFKQFLM---HAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHST 61 D FK+ + E FL L + +L ++ I + +K Sbjct: 4 INPKTDFAFKKIFGSEQNPEILISFLNSLLYGGHPRITELEIINPYLAPKI-QGIKDTFL 62 Query: 62 DVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDK--LPLVVPI 119 DV ++ V+IE Q R++ + A LE D L V+ + Sbjct: 63 DV--KAKLTDETT---VIIEMQVLNLSGFEKRILYNAAKAYSIQLEPGDDYTLLNPVIAL 117 Query: 120 LFYQGE-ATPYPLSMCWFDMFYSPELARR-VYNSPFPLVDITITPDDEIMQHRRIAILEL 177 E P + F + L + + V++ + Sbjct: 118 TLTDFEMFEDLPQVISNFVLKEKKVLTDYPINDLELVFVELPKFTKELDELETLADKWIY 177 Query: 178 LQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETG 237 K R LE + + + + R E+ + Sbjct: 178 FIKCAR------GLETIPETMAQVPEIRKAFEVANQANMTR---EELEALEQREIYIHDQ 228 Query: 238 GESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSRED 281 ++ + + G E+GIQ GR++ QE ++ + +G +E Sbjct: 229 RNAIKLALRQGIQLGREQGIQVGREQGIQEGREQGIQEGREQEK 272 >UniRef50_C0BF92 Putative uncharacterized protein n=1 Tax=Coprococcus comes ATCC 27758 RepID=C0BF92_9FIRM Length = 307 Score = 78.0 bits (190), Expect = 3e-13, Method: Composition-based stats. Identities = 43/280 (15%), Positives = 90/280 (32%), Gaps = 30/280 (10%) Query: 2 DAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHST 61 + D +++ + E + + DL LE ++ Sbjct: 13 QTHNRQYKDRLWRMIFNNKEDLLQLYNAINHTDYQNPDDLEVNTLEDVLYLSM-----KN 67 Query: 62 DVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHL--------EADHDKL 113 DV + V G L+ EH S + M R + Y ++ +L Sbjct: 68 DVSFLV---GGTMNLY---EHLSTFNPNMPLRGVFYFSRLYEGYVADNNLMIYHEKRVRL 121 Query: 114 PLVVPILFYQGEA-TPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRI 172 P I+FY G P + + D F + + +++I + E+M+H R Sbjct: 122 PKPKYIVFYNGTKNQPDSMELRLSDCFENTDNDAPCLECTATMLNINYGHNQELMKHCRR 181 Query: 173 AILELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLR 232 L + I + + ++ +++ Q+ + +A++ +L Sbjct: 182 ----LEEYSIFVQCVREYIQS-EPSVEDALEKAIDTCINQDVLADFLKKHRAEVTNMILT 236 Query: 233 DRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRL 272 + + +E E+G ++G E E L Sbjct: 237 TYDK-----DLYEKTLKEDAREEGREEGLMEGRAETRAEL 271 >UniRef50_Q3ARU8 Putative uncharacterized protein n=12 Tax=Chlorobium chlorochromatii CaD3 RepID=Q3ARU8_CHLCH Length = 324 Score = 78.0 bits (190), Expect = 4e-13, Method: Composition-based stats. Identities = 49/301 (16%), Positives = 96/301 (31%), Gaps = 32/301 (10%) Query: 8 PHDAVFKQFLM--HAETARDFL-EIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVL 64 +D+ +K+ + E F L ++ + L S E D L Sbjct: 13 DYDSPWKEAIELYFPEFMAFFYPNAFLAIDWSKPYHFLDQELRSI-LPEAENGKRIVDKL 71 Query: 65 YSVQMQGNPGY-LHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQ 123 V + G L++ IE Q + R+ + ++ + P+ ++ Sbjct: 72 VQVHLLGGKERCLYIQIEVQGNREADFPRRIFICNYRIFDKYGK------PVASFVILTD 125 Query: 124 GEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITP--DDEIMQHRR----IAILEL 177 +++ P + YS E A F +V + E++ + L Sbjct: 126 SDSSWRPTT-------YSYEFAGSKMTLEFDMVKLLDFEPRIKELLASDNAFALVTAAHL 178 Query: 178 LQKHIRQRDLMLL--LEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRE 235 L + R++ L QL+ L+ + ++ + + + Sbjct: 179 LTQKTREKSFERLDAKSQLIRLLYNKQWTKERVKELFRVIDWFMELPKELEQQLQTEIYN 238 Query: 236 TGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSR------EDVAEMANLP 289 E M E +EKG +G + E ++ L +GM R E AE L Sbjct: 239 IEEEQKMKYISSIERYAMEKGWSEGMERGILEGMEKGLMEGMERGMAKGKEIGAEQTKLD 298 Query: 290 L 290 + Sbjct: 299 I 299 >UniRef50_C4ZGR2 Putative uncharacterized protein n=2 Tax=Eubacterium rectale ATCC 33656 RepID=C4ZGR2_EUBR3 Length = 370 Score = 78.0 bits (190), Expect = 4e-13, Method: Composition-based stats. Identities = 46/303 (15%), Positives = 91/303 (30%), Gaps = 38/303 (12%) Query: 12 VFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQMQG 71 VF + E A D+ + D + V + Sbjct: 78 VFSMLMQDKERALQLYNAMNGSSYDNPEDVEMVI-----HDGGISLSVRNDASFIVDAR- 131 Query: 72 NPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHD--------------KLPLVV 117 + EHQS M R + Y + L K+P Sbjct: 132 -----LSIYEHQSTVCPNMPVRSLIYFSVILSDMLSDKKKGTKSGKNIYGRRLVKIPTPH 186 Query: 118 PILFYQG-EATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQ-HRRIAIL 175 ++FY G E P + D F P + +I + IM+ + Sbjct: 187 FVVFYNGEEEQPEVQELKLSDAFEKPTDEPN-LELKCKVYNINDGKNKAIMESCGWLNDY 245 Query: 176 ELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRE 235 +R+ + L I++ ++ ++ + DR+ Sbjct: 246 MTFVNKVREYHADGAFDDLAIDIEKAIDYCIDNDILKEFLKTYRSEVTKSMQLNYEFDRQ 305 Query: 236 TGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKG-MSREDVAEMANLPLAEID 294 E IE+G++ G ++ + + L++KG + + AE A + ++E + Sbjct: 306 LELE---------RADAIEEGMEIGIEKGANKMLFTLVTKGKLDIDTAAEEAGVSVSEFE 356 Query: 295 KVI 297 K++ Sbjct: 357 KLM 359 >UniRef50_B0A7T9 Putative uncharacterized protein n=2 Tax=Clostridium bartlettii DSM 16795 RepID=B0A7T9_9CLOT Length = 271 Score = 78.0 bits (190), Expect = 4e-13, Method: Composition-based stats. Identities = 36/299 (12%), Positives = 92/299 (30%), Gaps = 30/299 (10%) Query: 1 MDAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHS 60 M D VFK + + L L L+ + ++ +++ + ++ Sbjct: 1 MKGLLDPKMDFVFKNIFGSEKNPK-ILISFLNATLKPKDLITSVEIKNTDINKNYIEDKF 59 Query: 61 TDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDK--LPLVVP 118 + + + + + IE Q K + M R + Y L D L + Sbjct: 60 SRLDVKAKTSNDE---IINIEIQLKNEYNMIKRSLYYWSKLYSEQLGEGQDYSVLKRTIC 116 Query: 119 ILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELL 178 I + + + + ++I D + + +E L Sbjct: 117 INILNFKYLKTRKFHSGYRLKEIYSNEELTNVAEIHFIEIPKLDDGADEKDMLVNWIEFL 176 Query: 179 QKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGG 238 + + + + ++ ++ +++ + + Y + Sbjct: 177 K------------DPESETVRSLEMNIEEIRQAKDELIRMSNDDTQREIYEMRAKT---- 220 Query: 239 ESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKVI 297 + + + ++G Q+ +E A+ LL + E +A L + EI+K+ Sbjct: 221 -------LRDKISALNEAERKGIQQGKREIAKALLDV-LDIETIALKTGLSIDEINKLK 271 >UniRef50_C0GV86 Transposase, ISNCY family n=7 Tax=Desulfonatronospira thiodismutans ASO3-1 RepID=C0GV86_9DELT Length = 125 Score = 78.0 bits (190), Expect = 4e-13, Method: Composition-based stats. Identities = 30/104 (28%), Positives = 60/104 (57%), Gaps = 4/104 (3%) Query: 1 MDAPSTT-PHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGH 59 M PH+ +F + + + AR FL+ H+ E+++ DL+TL LE ++++E LK H Sbjct: 1 MATKRNQAPHEGLFLKIFQNLDNARHFLKNHMSEEIQKRFDLDTLRLEPTTYVDEKLKKH 60 Query: 60 STDVLYSVQMQGNP---GYLHVVIEHQSKPDKKMAFRMMRYSIA 100 +D+++SV++ G ++++ EH+S PD ++++Y Sbjct: 61 YSDLVFSVRLIGYKNQFAKIYLLFEHKSSPDPLTGVQVLKYMAL 104 >UniRef50_B6FJ15 Putative uncharacterized protein n=5 Tax=Clostridium RepID=B6FJ15_9CLOT Length = 310 Score = 77.6 bits (189), Expect = 5e-13, Method: Composition-based stats. Identities = 41/313 (13%), Positives = 97/313 (30%), Gaps = 41/313 (13%) Query: 2 DAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHST 61 + D +F+ + + DL +E ++ Sbjct: 13 HKINKKYKDRIFRMIFHEKKELLELYNAVNNSNYTNPDDLTITTIEDVVYMGM-----KN 67 Query: 62 DVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKL-------- 113 D+ + + G + + EHQS + R + Y + ++E +L Sbjct: 68 DLSFLI------GDVMNLYEHQSSFSPNLPLRGLFYFSSLYKEYIEPVKHRLYTASPLHI 121 Query: 114 PLVVPILFYQG-EATPYPLSMCWFDMF-YSPELARRVYNSPFPLVDITITPDDEIMQHRR 171 P ++FY G + P + D+F + E +++I + + E+M+ R Sbjct: 122 PFPKYVVFYNGTKKEPERQELKLSDLFLENKEETTPSLECTAVVLNINLGKNRELMEKCR 181 Query: 172 IAILELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNY-----MLQRGHTEQADL 226 ++ + + + E G+ + ++ +L + Sbjct: 182 PL-----------KEYAEFISIIRKYLSEQMDFGNAVNKAVDFCIHNGILADILQKNRSE 230 Query: 227 FYGVLRDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMA 286 ++ E + +G KG+ G + + + + +EDV + Sbjct: 231 VVDMILTEYDEEEFRRAWREDLLNEGFRKGLNNGLSKGIKGTIHACMKFNVPKEDVMQNL 290 Query: 287 ----NLPLAEIDK 295 +L E +K Sbjct: 291 MEEFSLSQEEAEK 303 >UniRef50_C9RMD5 Putative uncharacterized protein n=1 Tax=Fibrobacter succinogenes subsp. succinogenes S85 RepID=C9RMD5_FIBSS Length = 344 Score = 77.2 bits (188), Expect = 6e-13, Method: Composition-based stats. Identities = 41/313 (13%), Positives = 93/313 (29%), Gaps = 34/313 (10%) Query: 10 DAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQM 69 DA FK FL E +FL + + + + I S K D+ + Sbjct: 38 DAAFKAFLSDEEALVNFLNGVFHLNEDNKIESVVIKNSEINIIFPSAKQFRLDI--RAKT 95 Query: 70 QGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAM----------------------HRHLE 107 + + IE Q R++ A M R Sbjct: 96 SKG---ICINIEMQKARPDYFVDRVLLQQSAFMLQSKYEWDKLNFGDLPSCLTKEERAER 152 Query: 108 ADHDKLPLVVPILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIM 167 H ++ + S + + + L D+T Sbjct: 153 EIHRYEVPPTYAIWICDFSIGKQKSFRGDWAVRNKKGLTLTDKMMYILYDLTKFNKPYKK 212 Query: 168 QHRRIAILELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLF 227 L K+ + + + + + + N ++ EQA+ Sbjct: 213 ITTTEDRWLYLLKYA-------GKAENLPDFNNSIIAKAINRILVNRASEKLIREQANDM 265 Query: 228 YGVLRDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMAN 287 + + + + ++G+++G++QG ++ E A +L+ + + ++ Sbjct: 266 VWTEEELDHLALLEVRAEKKGLKQGLKQGLEQGLEQGRVEMALAMLADNEPIGKIVKYSH 325 Query: 288 LPLAEIDKVINLI 300 LP ++I ++ + Sbjct: 326 LPESKILELKASL 338 >UniRef50_Q24Y59 Putative uncharacterized protein n=4 Tax=Peptococcaceae RepID=Q24Y59_DESHY Length = 283 Score = 77.2 bits (188), Expect = 7e-13, Method: Composition-based stats. Identities = 40/254 (15%), Positives = 86/254 (33%), Gaps = 15/254 (5%) Query: 46 LESGSFIEESLKGHSTDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRH 105 L D+++ ++ + +E Q+ ++ R + Y + R Sbjct: 41 LIPSVHPAVEANETRNDIIFLLEDD-----TLLHLEFQTTAGEQDLKRFLYYDARLVRRQ 95 Query: 106 LEADHDKLPLVVPILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDE 165 H +V +A + + + + + + + Sbjct: 96 ERKVH----TIVIYSGRIEQARERLECGSILYQVENIYMKHYNGDQEYNRLK-HKIDNHQ 150 Query: 166 IMQHRRIAILELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQAD 225 ++ L L ++ L Q L ++L A+ ++ Sbjct: 151 LLSETDTLKLIFLPLMKSEQKEEELAIQAAELAKAAPDEKTKLFAIAALIVITDKIMSES 210 Query: 226 LFYGVLRDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEM 285 +L + M + QW E+G ++G +GR++ +E AQ +L+ GMS E +A+ Sbjct: 211 NKRKLLEVLK-----MTQIEQWIREEGRQEGELKGRRDEKRETAQTMLNLGMSPELIAKA 265 Query: 286 ANLPLAEIDKVINL 299 LPL EI ++ Sbjct: 266 TKLPLEEILEMAKA 279 >UniRef50_C4Z1Q2 Putative uncharacterized protein n=1 Tax=Eubacterium eligens ATCC 27750 RepID=C4Z1Q2_EUBE2 Length = 321 Score = 77.2 bits (188), Expect = 7e-13, Method: Composition-based stats. Identities = 46/332 (13%), Positives = 97/332 (29%), Gaps = 51/332 (15%) Query: 2 DAPSTTPHDAVFKQFLMHAETARDFLEI-------HLPVELRELCDLN-TLHLESGSFIE 53 +T D K F E D L + D + + + S S+ E Sbjct: 4 SNRTTHQKDVSLKTFWRDNEHFADLFNATVFNGKQVLKPDKLTEMDTDVSATIHSKSYNE 63 Query: 54 ESLKGHSTDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLE------ 107 + DV+ + + + +E Q K M R M Y + Sbjct: 64 SITRNR--DVVKKM--SDGVEFNILGLEIQDKTHYAMPLRTMTYDALGYIKEYNDIKKHH 119 Query: 108 -----------------ADHDKLPLVVPILFYQGEATPYPLSMCWFDMFYSPE-LARRVY 149 D+ ++ ++ Y GE+ + M P+ + Sbjct: 120 KLNKDSFSSHEEFLSGINKSDRFHPIITLVLYYGESLWDGPTCLSDMMISMPDNIKAYFS 179 Query: 150 NSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLV 209 + LV I D + + + + ++I D + + Sbjct: 180 DYKLNLVQILD-SDKYTFYNEDVRDVFNIIRNIYNDDFDSIYRE-----------YESRN 227 Query: 210 AMQNYMLQRGHTEQADLFYGVLRDRETGG-ESMMTLAQWFEEKGIEKGIQQGRQEVSQEF 268 + M + + D E GG +M + F+ + KG+++G Sbjct: 228 VDIDVMELICNITSVPKLMDLCTDTEQGGTVNMCEAMKRFQAECESKGMKEGIDSEKVNS 287 Query: 269 AQRLLSKGMSREDVAEMANLPLAEIDKVINLI 300 +L G+++E + + ++++ I Sbjct: 288 IISMLEFGITKEQI--LTRYTKEDLERAEAAI 317 >UniRef50_Q8YK35 All8083 protein n=6 Tax=Cyanobacteria RepID=Q8YK35_ANASP Length = 313 Score = 76.8 bits (187), Expect = 7e-13, Method: Composition-based stats. Identities = 50/315 (15%), Positives = 98/315 (31%), Gaps = 42/315 (13%) Query: 3 APSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNT----LHLESGSFIEE-SLK 57 + +D +K+ + FL P E++ D L E I+E + Sbjct: 2 SEVRADYDGAWKE--GVEQYFEAFLAFFFP-EIQAEIDWERGYEFLEQELQQLIKESEVG 58 Query: 58 GHSTDVLYSVQMQGN-PGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLV 116 D L V ++ +L + +E QS+ D RM Y R+ + V Sbjct: 59 KQFVDKLIKVWLKDGKETWLLIHLEIQSQVDPNFTKRMFSYHYRIFDRYNQE-------V 111 Query: 117 VPILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDE-IMQHRRIAIL 175 V + + YS + FP+V + ++ Sbjct: 112 VSLAILGDNQANWRPQE------YSYGRWGCRLSLQFPIVKLLDYESRWSELEQSDSPFA 165 Query: 176 ELLQKHIR----QRDLMLLLEQLVTLIDEGYT------SGSQLVAMQNYMLQRGHTEQAD 225 L+ H+R +DL L+ ++LI Y Q+ + + ++ + Sbjct: 166 VLVMAHLRTQATTQDLTGRLQWKLSLIKRMYEVGYSRDKIQQIFRLLDRLMTLPPELDLN 225 Query: 226 LFYGVLRDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEM 285 + R + MT + G Q QE E + E + ++ Sbjct: 226 FQAELERFEAEQEMTYMTSIERI-------GRAQTLQESITEV-LETRFNNVPPELIEQL 277 Query: 286 ANL-PLAEIDKVINL 299 + L + +++ Sbjct: 278 KKIYELDRLKQLLKQ 292 >UniRef50_C9LT45 Putative uncharacterized protein n=2 Tax=Selenomonas sputigena ATCC 35185 RepID=C9LT45_9FIRM Length = 374 Score = 76.8 bits (187), Expect = 8e-13, Method: Composition-based stats. Identities = 39/261 (14%), Positives = 84/261 (32%), Gaps = 24/261 (9%) Query: 49 GSFIEESLKGHSTDVLY--SVQMQGNPGYLHVVIEHQSKPD-----KKMAFRMMRYSIAA 101 I + D+L+ V G L + +E Q + R + Y+ Sbjct: 118 TENIGITEGWVRFDILFHARVPQSGERITLIINVEAQRTQKRAKLGYALLRRAVYYASRL 177 Query: 102 MHRHLEADHDKLPLVVPILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITIT 161 + E + Y Y + + SP+ + I Sbjct: 178 ISSQKETE-------FTGSSYDEIKKVYSIWL----CMDSPDGRSAINRYDLAEHHILHH 226 Query: 162 PDDEIMQHRRIAILELLQKHIRQRDLMLLLEQLVTLIDEGYT----SGSQLVAMQNYMLQ 217 + + ++I+ + + RQ+D L+ L L + L + + Sbjct: 227 HKGKRADYDLMSIITIYLGNERQQDEDWLIRFLQILFKDMEISPAAKKQLLKNEFDMDIS 286 Query: 218 RGHTEQADLFYGVLRDRETGGES--MMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSK 275 E+ + G M + E+G+E+G+++GR+E + +L Sbjct: 287 ADIEEEMRTMCNLSTGIYEQGMERGMERGMERGMERGMERGMERGREEGKVDIVLEMLRN 346 Query: 276 GMSREDVAEMANLPLAEIDKV 296 + E +A M+ L ++ ++ Sbjct: 347 KLPLEMIASMSKFSLEKVKEL 367 >UniRef50_Q24Y19 Putative uncharacterized protein n=3 Tax=Desulfitobacterium hafniense RepID=Q24Y19_DESHY Length = 248 Score = 76.8 bits (187), Expect = 8e-13, Method: Composition-based stats. Identities = 33/246 (13%), Positives = 71/246 (28%), Gaps = 25/246 (10%) Query: 78 VVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEATPYPLSMCWFD 137 + IE Q M R + Y R + L + + + + + Sbjct: 3 INIEIQLSNQYDMEKRSLYYWAQMYSRQIREGMAYKELTKTVSINIVDFNYLKQTSNYHN 62 Query: 138 MFYSPELARRV---YNSPFPLVDITITPDDEI-----MQHRRIAILELLQKHIRQRDLML 189 +F+ E + +++ + + LL + ++++ Sbjct: 63 VFHLYEDEEKFQLTDVLEIHFMELPKLLAKWRRREISLWENELVRWLLLLEGADNQEILQ 122 Query: 190 LLEQLV-------TLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGG---- 238 +LE++ ++ + + Y +R R Sbjct: 123 ILEEIAMKDPVLYQAMNAWEETSEDPRIREAYFDRRKAILDEKAAIREAELRLQEALEEG 182 Query: 239 ------ESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAE 292 E + +G +G +GR E E A++LL G +AE L E Sbjct: 183 MAKGIAEGRAKGIAEGKAEGKAEGRAEGRAEGRAEVAKKLLVLGFEITKIAEATGLSEEE 242 Query: 293 IDKVIN 298 I + + Sbjct: 243 ISGLKD 248 >UniRef50_C6Y2C7 Putative uncharacterized protein n=2 Tax=Pedobacter heparinus DSM 2366 RepID=C6Y2C7_PEDHD Length = 283 Score = 76.8 bits (187), Expect = 8e-13, Method: Composition-based stats. Identities = 42/292 (14%), Positives = 88/292 (30%), Gaps = 27/292 (9%) Query: 10 DAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQM 69 D FK+ + +D + L + + E +K VL+ + Sbjct: 14 DYGFKRLFGNEPD-KDIMIEFLNALFEGEKIVIDIRYSPTEHAGEDVKEKK--VLFDLTC 70 Query: 70 QGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHD----KLPLVVPILFYQGE 125 G G + IE Q + R + Y + L L V I + + Sbjct: 71 TGADGETFI-IEMQRADQEFFRDRCVFYMSRLISAQLPRGTSNWDVPLKEVYLIGIMEFQ 129 Query: 126 ATPYPLSMCWFDMFYSPELARRVYNS-PFPLVDITITPDDEIMQHRRIAILELLQKHIRQ 184 + + + + Y + +++ E + L K++ Sbjct: 130 FNNINSNYLHNIALMNRDTGKVFYKGMGYKFLELPNFDKKESDLVTELDKWFYLLKNLSH 189 Query: 185 RDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGESMMTL 244 L+++ +D+ + + R+ + Sbjct: 190 ------LDKIPDFLDK------------RVFQKIFKIAEMSKMTKEERELYDSDVKAKSD 231 Query: 245 AQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKV 296 +K ++G+ E E A+ L SK ++ E +AE L + EI+K+ Sbjct: 232 WNAGIRYAEKKAKEEGKLEEKLEIARNLKSKAIAFEIIAETTGLSIDEIEKL 283 >UniRef50_Q8YMI0 Alr4953 protein n=8 Tax=Cyanobacteria RepID=Q8YMI0_ANASP Length = 314 Score = 76.4 bits (186), Expect = 1e-12, Method: Composition-based stats. Identities = 41/304 (13%), Positives = 99/304 (32%), Gaps = 29/304 (9%) Query: 10 DAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFI----EESLKGHSTDVLY 65 D+ +K+ L ++ P + + F E D L Sbjct: 11 DSPWKEIL--EAYFPQAVQFFFPETAALINWERPYEFLNTEFQQIAREAEQGKPYADQLV 68 Query: 66 SVQ-MQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQG 124 V +QG +L + +E Q++ + + RM Y+ R + + + Sbjct: 69 KVWQIQGEEIWLLIHVEIQAQKEDDFSKRMFTYNFRIFDRFEK-------PAISLAILCD 121 Query: 125 EATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPD--DEIMQHRRIAILELL---- 178 + S ++ + N F +V + + DE+ + ++ Sbjct: 122 TNRQWRPSNYSYNYPQTR------LNFEFGIVKLLDYENRFDELENNTNPFATVVMAHLK 175 Query: 179 --QKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRET 236 Q ++ + L+ + + + + ++ +A ++ Sbjct: 176 TQQTRSSPQERKIWKFSLIRRLYDLGLQEQDIRNLYRFIDWVMILPKALENQLCSEVQQL 235 Query: 237 GGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKV 296 E M E G E+GIQ+G + + +R L + +S E + +L + +++ + Sbjct: 236 EQERTMRYVTSAERIGYERGIQEGELGIILKLLKRRLGE-LSPEIQQRIQSLSVNQLENL 294 Query: 297 INLI 300 + Sbjct: 295 SEAL 298 >UniRef50_Q8ZS56 Alr7656 protein n=6 Tax=Nostocaceae RepID=Q8ZS56_ANASP Length = 319 Score = 76.4 bits (186), Expect = 1e-12, Method: Composition-based stats. Identities = 48/310 (15%), Positives = 94/310 (30%), Gaps = 41/310 (13%) Query: 3 APSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNT-LHLESGSFI----EESLK 57 + +D +K+ + FL P E++ D E + Sbjct: 2 SEVRADYDGAWKE--GVEQYFEAFLAFFFP-EIQAEIDWERGYDFLDQELQQLIRESEIG 58 Query: 58 GHSTDVLYSVQMQGN-PGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLV 116 D L V ++ +L + +E QS+ D RM Y R+ + V Sbjct: 59 KQFVDKLIKVWLKDGKETWLLIHLEIQSQVDTNFPKRMFSYHYRIFDRYNQE-------V 111 Query: 117 VPILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDE-IMQHRRIAIL 175 V + + YS + FP+V + ++ Sbjct: 112 VSLAILGDNQANWRPQE------YSYGRWGCHLSLQFPIVKLLDYESRWSELEQSDSPFA 165 Query: 176 ELLQKHIR----QRDLMLLLEQLVTLIDEGYT------SGSQLVAMQNYMLQRGHTEQAD 225 L+ H+R +DL L+ ++LI Y Q+ + + ++ + Sbjct: 166 VLVMAHLRTQATTQDLAGRLQWKLSLIKRMYELGYSRDKIQQIFRLLDRLMTLPPELDLN 225 Query: 226 LFYGVLRDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEM 285 + R + MT + GI + Q+ + + K + E V ++ Sbjct: 226 FKAELERFEAEQEMTYMTSIERI-------GIAEATQKYIAQI-LTIRFKDIPTELVEKL 277 Query: 286 ANLPLAEIDK 295 L E+ Sbjct: 278 NKLYDIELLN 287 >UniRef50_C0R2N1 Putative uncharacterized protein n=4 Tax=Wolbachia RepID=C0R2N1_WOLWR Length = 277 Score = 76.1 bits (185), Expect = 1e-12, Method: Composition-based stats. Identities = 30/236 (12%), Positives = 72/236 (30%), Gaps = 21/236 (8%) Query: 82 HQSKPDKKMAFRMMRYSIAAMHRHLEADHDK--LPLVVPILFYQGEATPYPLSMCWFDMF 139 +Q K R Y+ A R + L ++ I P Sbjct: 41 NQVAKTKGFEKRAQYYAAKAYSRQADKGDQYHNLKEIIFIAIADCVLFPNKSEYKSKHTI 100 Query: 140 YSPELARRVY-NSPFPLVDITITP-DDEIMQHRRIAILELLQKHIRQRDLMLL------- 190 + + F +++ P + E + ++ + L Sbjct: 101 RDEDTNEHDLKDFYFIFIELPKFPKNKEDQLENIVEKWVYFFRYADETSEEELEKIIGSD 160 Query: 191 --LEQLVTLIDEGYTSGSQLVAMQNYMLQRGHT--------EQADLFYGVLRDRETGGES 240 +++ ++ S + +A + + + + A E E Sbjct: 161 VIIKKAYEELNRFNWSEKEFIAYEQEIKRILDEQAVLAQKLDDATEKGREEGKEEGKEEG 220 Query: 241 MMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKV 296 + + +GI+ G ++G ++ A+ LL G+S + +A+ L + E+ + Sbjct: 221 IQIGHEKGRAEGIQIGAEKGEKQAKITVAKNLLKAGVSIDIIAQTTGLTVDEVKDL 276 >UniRef50_C4G7H9 Putative uncharacterized protein n=2 Tax=Abiotrophia defectiva ATCC 49176 RepID=C4G7H9_ABIDE Length = 305 Score = 75.7 bits (184), Expect = 2e-12, Method: Composition-based stats. Identities = 53/289 (18%), Positives = 103/289 (35%), Gaps = 20/289 (6%) Query: 8 PHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLH--LESGSFIEESLKGHSTDVLY 65 +D K + + D + L + +E ++L + ++ E K H + Sbjct: 3 DYDVTEKLLEDYNDVFADIVNTLLF-DGKERVKEDSLEDSKINSAYKAEDGKLHEQERDV 61 Query: 66 SVQMQGNPGYLHVV-IEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLP-----LVVPI 119 S + L VV IE+Q+K +K M R++ Y A+ L +LP VV I Sbjct: 62 SKYWKEGNTNLLVVGIENQTKAEKLMPARIIGYDGASYRSQLLKSTGRLPKNKLTPVVTI 121 Query: 120 LFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQ 179 + Y G + L V + + +I P++++ ++ + L+ Sbjct: 122 VLYFGLTRWNQPKNLKGILDIPTGLEDFVSDYKINVFEIAFLPEEKV--NKFKSDFRLVA 179 Query: 180 KHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGE 239 K+ + + + + A+ ++ +E E Sbjct: 180 KYFTN------IRKNPYYLPADENEIKHVDAVLKFLSIMSGSEDIIEKLTANNGSEVKNM 233 Query: 240 S---MMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEM 285 + + L +G E+G+ QG E + SKGMS E+ E+ Sbjct: 234 TGGPLSQLYYKGVSEGREEGLLQGINETLLKVYLNCRSKGMSVEESEEI 282 >UniRef50_C0DB21 Putative uncharacterized protein n=2 Tax=Clostridium asparagiforme DSM 15981 RepID=C0DB21_9CLOT Length = 328 Score = 75.7 bits (184), Expect = 2e-12, Method: Composition-based stats. Identities = 41/295 (13%), Positives = 81/295 (27%), Gaps = 48/295 (16%) Query: 15 QFLMHAETARDFLEIHLPVELRELCDLNTLHLESGS--------FIEESLKGHSTDVLYS 66 + L D L L GS + + DV Sbjct: 10 KLLSDPVYFSDLCNGVLFRG-EMYLKPEDLMPVKGSQGVLYADRKGVKKVLERRRDVAMR 68 Query: 67 VQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHD--------------- 111 ++ Y + +E+Q+ M R + Y ++ Sbjct: 69 LK--SGTRYAVIAVENQANIHYAMVIRSLLYDALDYTDQVQIQEKELRQAGRRPSGDGFL 126 Query: 112 -------KLPLVVPILFYQGEATPYPLSMCWFDMF-------YSPELARRVYNSPFPLVD 157 +L VV ++ Y G + S ++ +PELA + + LV+ Sbjct: 127 SGVGPKLRLEPVVTLVLYWGSGH-WDGSTSLHELLGLKDGKGEAPELAGYIPDYRLNLVN 185 Query: 158 ITITPDDEIMQHRRIAILELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQ 217 D I + I +L+ + L ++ T + + + L Sbjct: 186 AANMDDPSIFRTHLQQIFSMLKYKSDKAALYRYAQENRTELRDMDGTAKLA-------LL 238 Query: 218 RGHTEQADLFYGVLRDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRL 272 EQ L + M G +G ++G ++ + ++L Sbjct: 239 SMMGEQKRLQKIMEEAEGEEEFDMCKAIDDLIADGESRGFERGDRQGFERGERQL 293 >UniRef50_Q8YQI6 All3837 protein n=4 Tax=Cyanobacteria RepID=Q8YQI6_ANASP Length = 276 Score = 75.7 bits (184), Expect = 2e-12, Method: Composition-based stats. Identities = 35/287 (12%), Positives = 90/287 (31%), Gaps = 17/287 (5%) Query: 17 LMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGH--STDVLYSVQMQGNPG 74 + + + P + ++ + F+ LK D ++ + Sbjct: 1 MQTDKIFYSLFQAF-PSIFFAIIGETDINPSTYEFVSVELKETAFRIDGVFKPVNESTEE 59 Query: 75 YLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEATPYPLSMC 134 L+ +E Q + D K R + ++ + + ++ P + P Sbjct: 60 PLYF-VEVQFQLDPKFYRRFFAEIFLYLRQNPSVNFWRAVVIYP------QRIIEPDDQQ 112 Query: 135 WFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQRDLMLLLEQL 194 + + +R+Y + ++ I + R L++ Q Sbjct: 113 PYRLILDSSQIQRIYLDELGTASENSLQ----LAIVQLIIASEATAIDQGRQLIIQARQE 168 Query: 195 VTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLF---YGVLRDRETGGESMMTLAQWFEEK 251 +T + + Y E+ + + ++ Sbjct: 169 LTDEANKKQIVELIETILLYKFTNLSREEVAAMLGIDDEFKKTRMYQSIKEDGLEEGRQE 228 Query: 252 GIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKVIN 298 G ++G Q+G+ + E RLL+ G++ E +A +L + ++ +V+ Sbjct: 229 GRQEGRQEGKLQAKLEAIPRLLALGLNVEQIAGALDLTIEQVQEVVE 275 >UniRef50_B7K6I4 Putative uncharacterized protein n=2 Tax=Cyanothece RepID=B7K6I4_CYAP8 Length = 319 Score = 75.7 bits (184), Expect = 2e-12, Method: Composition-based stats. Identities = 46/314 (14%), Positives = 105/314 (33%), Gaps = 35/314 (11%) Query: 3 APSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSF----IEESLKG 58 +T D+ +K + DF+ P ++ + L Sbjct: 2 KDPSTSFDSPWKDIV--EAYLPDFMAFFFPDAYEQINWEQGFEFLDKELGQVVRDAQLGK 59 Query: 59 HSTDVLYSV-QMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVV 117 D L V + G ++ + +E QS+ + A R+ Y R+ V Sbjct: 60 RFVDKLVKVYRRSGEETWVLIHLEIQSQYEAGFAERIYVYQYRIYDRYRRK-------VA 112 Query: 118 PILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITIT--PDDEIMQHRRIAIL 175 ++ E+ + S +++F N + +V + + + + Sbjct: 113 SLVVLGDESPTWKPSEFGYEIFGVE------INYRYRVVKLLDLGQDWEALSANENPFAT 166 Query: 176 ELL-------QKHIRQRDLMLLLEQLVTLIDEGYTSGSQ--LVAMQNYMLQRGHTEQADL 226 ++ K RQ L L L +GY L +++L +++ Sbjct: 167 VVMAHLKAGQTKKNRQERLQWKLSLTRQLYQKGYLRQDVINLFRFIDWILSLPDNLESEF 226 Query: 227 FYGVLRDRETGGESMMT-LAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGM---SREDV 282 + + + E +T + + E+G E+G +G Q + R L++ + S Sbjct: 227 WSELRQYEEEQRMPYITSVERLGRERGREEGRLEGMQREAANMVLRQLNRRLGQVSPSVE 286 Query: 283 AEMANLPLAEIDKV 296 ++ L + +++ + Sbjct: 287 EQIRQLRVEQLEDL 300 >UniRef50_Q3ATN4 Putative uncharacterized protein n=1 Tax=Chlorobium chlorochromatii CaD3 RepID=Q3ATN4_CHLCH Length = 287 Score = 75.7 bits (184), Expect = 2e-12, Method: Composition-based stats. Identities = 47/298 (15%), Positives = 101/298 (33%), Gaps = 34/298 (11%) Query: 7 TPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYS 66 D V K L A D I L ++ L + ++ DV+ Sbjct: 2 HAKDVVSKDIL--KRIALDIARILL------HLKVDHAELLETEH--QRVEERRADVVVL 51 Query: 67 VQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEA 126 V QG G + +E Q+ +A+R++RY H +D + L Y G+A Sbjct: 52 V--QGESGRFILHLEIQNDNQANIAWRLLRYRSDIGLAH--KGYD----IKQYLIYIGKA 103 Query: 127 TPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQ-R 185 + + + + + ++D+ ++ L L + R Sbjct: 104 P----------LSMPTGIHQTGLDYRYHVIDMHSVDCQALLTQDTPDALVLAILCDFKGR 153 Query: 186 DLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGESMMTLA 245 ++ ++ + E +YM ++ + E + Sbjct: 154 SEREVVRYIIQRLQELTAENE--SRYHDYMRMLEILSANRSLEKIIEEEEAMLSVVDQTR 211 Query: 246 QWFEEKGIEKGIQQGRQEVSQEFAQRLLSKG---MSREDVAEMANLPLAEIDKVINLI 300 G+ GI+QG Q+ + +R L++ +S VA + L + +++++ + + Sbjct: 212 LPSFRIGMRHGIEQGVQQGTLSLVKRQLTRRFGTLSYHHVARLDKLNIEQLEELSDAL 269 >UniRef50_A7BPH0 Putative uncharacterized protein n=5 Tax=Beggiatoa RepID=A7BPH0_9GAMM Length = 289 Score = 75.3 bits (183), Expect = 2e-12, Method: Composition-based stats. Identities = 45/301 (14%), Positives = 106/301 (35%), Gaps = 35/301 (11%) Query: 1 MDAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHS 60 M + +D +FK+ H ++ L L ++ S+ Sbjct: 19 MKQVAPLRYDVIFKKAFSHPTIFTALVKDFLG------IQLEIDEVKYNKGFVPSVNSLV 72 Query: 61 TDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHR---HLEADHDKLPLVV 117 ++ + + ++ L V ++H + + R + Y ++M + +D+D ++ Sbjct: 73 SE--FDLFVEDKKNQLIVEMKH-AYCSRSDYERFVYYQCSSMVEAVINSNSDYDFPMTII 129 Query: 118 PILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILEL 177 I+F+ + TP P S F S +LA R+ ++ + Sbjct: 130 TIVFFTWKKTPSPDSSIIVHDFESRDLATGQLLDKIY--------------QRKHQLIFV 175 Query: 178 LQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETG 237 + + + ID+ N ++Q +LF + +D+ T Sbjct: 176 FTNDSTHENTPSTYREWMQAIDDSLDGEVDEEKYTNPLIQ-------ELFGVIEKDKITP 228 Query: 238 GESMMTLAQWFEEKGIEKGIQQGRQEVS-QEFAQRLLSK-GMSREDVAEMANLPLAEIDK 295 E Q+ +E+ K G ++ ++ A+ L + ++ +++A L L + Sbjct: 229 EERACMKDQYSQEEACIKAFNDGMKQGQSKKTARNLKANSKLTEKEIARATGLSLEMVKA 288 Query: 296 V 296 + Sbjct: 289 L 289 >UniRef50_C9LXS5 Transposase n=3 Tax=Selenomonas sputigena ATCC 35185 RepID=C9LXS5_9FIRM Length = 347 Score = 75.3 bits (183), Expect = 3e-12, Method: Composition-based stats. Identities = 41/330 (12%), Positives = 98/330 (29%), Gaps = 55/330 (16%) Query: 6 TTPHDAVFKQFLMHAETARDFLEIHLPV-------------------ELRELCDLN---- 42 +D K+ L + L+ +P D++ Sbjct: 29 NPAYDQHAKRLLAQKDVVARILKGVVPEFRQMDLATIIGRCIEGEPEIGAIPIDMDKTNA 88 Query: 43 ------TLHLESGSFIEESLKGHSTDVLYSVQM--QGNPGYLHVVIEHQ-----SKPDKK 89 + ++ + D+L+ ++ G L V IE Q S+ Sbjct: 89 ARRIPKEIRGDNTESASPTEGWIRFDILFRAKVPQTGARITLIVNIEAQKTQSNSRLGYA 148 Query: 90 MAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEATPYPLSMCWFDMFYSPELARRVY 149 + R + Y+ + E + K Y Y + +C + Sbjct: 149 LLRRAIYYACRLISSQKETEFAK-------SNYNDIKKVYSVWICMDAPDDKSAINFYDM 201 Query: 150 NSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQRDLMLLLEQLV--TLIDEGYTSGSQ 207 L E + + I+ + + ++ +L+ + Sbjct: 202 QERHFLHR----TKAEKSDYDLLNIIMIYLGADDSGNELVRFLKLLFRDTVKSAAEKKKI 257 Query: 208 LVAMQNYMLQRGHTEQADLFYGVLRDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQE 267 L + + + ++ + + G + E+GIE+GI+QG ++ Sbjct: 258 LESEFDLDISGDMEKEMNTMCNLSEGIFERG------IEQGIEQGIEQGIEQGIEQGESG 311 Query: 268 FAQRLLSKGMSREDVAEMANLPLAEIDKVI 297 +L KG +A+++ + +I+++ Sbjct: 312 MILSMLKKGYDLTSIADISQWSIKKIEQLA 341 >UniRef50_C6LJP2 Putative transposase n=1 Tax=Bryantella formatexigens DSM 14469 RepID=C6LJP2_9FIRM Length = 326 Score = 74.9 bits (182), Expect = 3e-12, Method: Composition-based stats. Identities = 37/241 (15%), Positives = 87/241 (36%), Gaps = 15/241 (6%) Query: 65 YSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLE-------ADHDKLPLVV 117 ++ ++ G + V +++Q+ D M R+M +KL V+ Sbjct: 78 FNKKIVAPDGEIIVALQNQTTVDFGMPLRVMTEDALEYDVQRRMCKDEKLHKGEKLAPVI 137 Query: 118 PILFYQGEATPYPLSMCWFDMFYSPEL-ARRVYNSPFPLVDIT-ITPDDEIMQHRRIAIL 175 I+FY G + + E + Y P+ ++ IT D + Sbjct: 138 TIVFYYGAQIWSGPTDLADMVKIPEEFKWLKKYIRPYAMLLITPENVDAAWFSGGWREVF 197 Query: 176 ELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRE 235 E+LQ+ ++++ L++ ++ ++ ++L+ L + + V+ + Sbjct: 198 EILQRRNDEKEMQRYLQKKRSVYEKLPEDTNRLIFALTGHLDYYNALKRKGERAVM--CK 255 Query: 236 TGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDV----AEMANLPLA 291 + + + + GI +GI QG + +G + E + + +L Sbjct: 256 AFEDHYKSGVEEGKNIGIHQGISQGLGRGIGAMIRENQEEGKTTESIIDKLQKYFSLSRE 315 Query: 292 E 292 E Sbjct: 316 E 316 >UniRef50_C6XVH2 Putative uncharacterized protein n=1 Tax=Pedobacter heparinus DSM 2366 RepID=C6XVH2_PEDHD Length = 290 Score = 74.9 bits (182), Expect = 3e-12, Method: Composition-based stats. Identities = 42/301 (13%), Positives = 87/301 (28%), Gaps = 35/301 (11%) Query: 2 DAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHST 61 D FK F E+ +D L L + + L + S + Sbjct: 19 SRYINPKTDFAFKHFFG-KESHKDLLIGFLNGIFKGRKIIVDLEYNQVTHQGISKEDRKN 77 Query: 62 DVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHD----KLPLVV 117 ++ + GN G ++E Q RM+ Y+ +++ + + +LP V Sbjct: 78 --IFDLNCTGNKGE-RFIVEMQQAKRSFFKDRMIYYTSNLIYQQGISVNSDWNYELPEVY 134 Query: 118 PILFYQGEATPYPLSMCWFD--MFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAIL 175 + D + A + +++ E Sbjct: 135 LVAIMDFSFDDTHPDQYEHDVRLMDVHTHAEFYKKLGYIFIEMPKFKKVETELVTNEDGW 194 Query: 176 ELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRE 235 ++ M L+++ + + + L + + RD Sbjct: 195 LFSLRY------MNTLKEIPLSLRDKEEFIKLFNIAEVSNLDPDEMKAYQASLKIARDNY 248 Query: 236 TGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDK 295 + E++ ++E + E A L +S E +A L L EID+ Sbjct: 249 SHDETI-------------------KREKAFEIAAELKKNEVSFEIIAAATGLTLNEIDE 289 Query: 296 V 296 + Sbjct: 290 L 290 >UniRef50_A7BTR0 Putative uncharacterized protein n=3 Tax=Beggiatoa RepID=A7BTR0_9GAMM Length = 309 Score = 74.9 bits (182), Expect = 3e-12, Method: Composition-based stats. Identities = 47/303 (15%), Positives = 100/303 (33%), Gaps = 20/303 (6%) Query: 10 DAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQM 69 D K L D LE L L+E + + + + + K + D+L Sbjct: 11 DWALKNILRDKANF-DVLEGFLTALLQEDISVLEILESESNQSDFAKKFNRVDILVKDSH 69 Query: 70 QGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEATPY 129 Q ++IE Q+ + R++ + + LE D + I Sbjct: 70 QRK-----MIIEVQNHRETGYLERILWGTSKLIVETLELGEDYRNISKVISISIVYFDLG 124 Query: 130 PLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQRDLML 189 + + L +N PF + + +Q + I L + +D++ Sbjct: 125 LSDDNEYVYYGVANLHGLQHNQPFRFRRLMADKTFKSLQTKDIFPEFYLLRVEHFQDIIK 184 Query: 190 ----------LLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRET--- 236 + T + +Q M + + + +R+ Sbjct: 185 TDLDEWIYMLKHSTIRTDFKSKNINKAQEKLTLLQMNPQKRKDYEKYMVDMTVERDVLEA 244 Query: 237 -GGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDK 295 E + Q ++G ++GIQ+G ++ + + L +G+ ++ + L + EI K Sbjct: 245 AQEEGIQKGRQEGIQEGRQEGIQKGMEKKTVVIVKNALQQGLELTLISSLTGLSIEEIQK 304 Query: 296 VIN 298 + N Sbjct: 305 IQN 307 >UniRef50_A6EA97 Putative uncharacterized protein n=1 Tax=Pedobacter sp. BAL39 RepID=A6EA97_9SPHI Length = 293 Score = 74.9 bits (182), Expect = 3e-12, Method: Composition-based stats. Identities = 59/285 (20%), Positives = 114/285 (40%), Gaps = 21/285 (7%) Query: 21 ETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQMQGNPGYLHVVI 80 + R+ +E+ LP +RE+ L L E + + D L V ++ + I Sbjct: 16 KIIRENMEVTLPEVIREVLGLEILLSEELPDDVQHTRERKPDALKKVTDIQGNTFV-LHI 74 Query: 81 EHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEATPYPLSMCWFDMFY 140 E Q + +K+M +RM YSI M R+ +LP+ ++F + P + + Y Sbjct: 75 EFQVEDEKEMVYRMAEYSIMLMRRY------QLPVKQYVIFLKDTKPRMPTGLKTPKLVY 128 Query: 141 SPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQRDLMLLLEQLVTLIDE 200 S +L R + + L + P+ ++ +A+L + R+ L ++ L++ Sbjct: 129 SFDLIR-IAEISYKLFIKSDNPEVKM-----LAVLANFDEADREGALTSIITGLLSHSKG 182 Query: 201 GYTSGSQLVAMQNYMLQRGHTEQ-ADLFYG------VLRDRETGGESMMTLAQWFEEKGI 253 + ++ +M R EQ D + + E KG Sbjct: 183 DFAERRHFKQLRIFMQLRSSIEQHFDKVMDSVSTFFKEENDYFYRKGEARGEIKGEAKGE 242 Query: 254 EKGIQQGRQEVSQEFAQRLLSK-GMSREDVAEMANLPLAEIDKVI 297 KG +G + S+ + L++K G S E AE+A + + + + Sbjct: 243 AKGEAKGEAKKSRAVVENLIAKLGFSDEQAAEIAEVTVDFVKDIR 287 >UniRef50_C0QWG9 Putative uncharacterized protein n=8 Tax=Brachyspira RepID=C0QWG9_BRAHW Length = 301 Score = 74.9 bits (182), Expect = 3e-12, Method: Composition-based stats. Identities = 36/304 (11%), Positives = 92/304 (30%), Gaps = 30/304 (9%) Query: 3 APSTTPHDAVFKQFLM---HAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGH 59 +D + + A +F+ E + + + I E+ Sbjct: 18 NNLNRINDYFIRYLFSHEGNENIALNFINAVFKDLGFE--TFKKIEILNPFNIAENYDEK 75 Query: 60 STDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEAD--HDKLPLVV 117 + V + + V+IE Q++ ++ R + Y L +D+L V Sbjct: 76 ESIVDIKAITESG---ITVLIEIQARGNEDFIKRALYYWAYNYSSSLNRGSFYDELKPTV 132 Query: 118 PILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILEL 177 I T + + + +++ Sbjct: 133 SINITNFILTNEDKVHSCYVLKELNNNKILTDHCQLHFLELPKFN--------------- 177 Query: 178 LQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFY-----GVLR 232 L+ L + ++ ++ + ++ +N + + + + Sbjct: 178 LKNISAIESLDNIHKEFISWVKFFKGEDMSILMKENTIFEEVEKKCRTFVNNTPVMDKYK 237 Query: 233 DRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAE 292 RE + +K E+GI+QG + + A+ + G+ + ++E L + E Sbjct: 238 KREVDAYFFDKSIELDLKKAKEEGIEQGEKNKAISIAKSFKNAGIDIKIISENTGLSIEE 297 Query: 293 IDKV 296 ++K+ Sbjct: 298 VEKL 301 >UniRef50_C9RQ02 Putative uncharacterized protein n=1 Tax=Fibrobacter succinogenes subsp. succinogenes S85 RepID=C9RQ02_FIBSS Length = 360 Score = 74.5 bits (181), Expect = 4e-12, Method: Composition-based stats. Identities = 53/299 (17%), Positives = 106/299 (35%), Gaps = 20/299 (6%) Query: 2 DAPSTTPHDAVFKQFLMHAETARDFLE-----IHLPVELRELCDLNTLHLESGSFIEESL 56 + + HDA F+ AR LE H +L+TL S+ E Sbjct: 5 NKVTKRKHDAYFRWLFADTTHARCLLELAGKINHEIDAFLTQINLDTLMRIPDSY-SEVD 63 Query: 57 KGHSTDVLYSVQM-QGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPL 115 D+ + V + G P + +++EH+S D + ++ +Y + M + Sbjct: 64 DTGEADLAFRVNVSTGAPILVGILLEHKSGRDPIIFDQISKYIHSVMKIQDKNRIFSGIP 123 Query: 116 VVPILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIA-- 173 + I+FY G P + + Y +V V++ PD + + A Sbjct: 124 TMAIIFYNGRDNWNP--LKILEKSYPDYFRGKVLPFQCTFVNMADIPDSDCLACENTATG 181 Query: 174 ILELLQKHIRQRD-LMLLLEQLVTLIDEGYTSG--SQLVAMQNYMLQRGHTEQADLFYGV 230 + + KH +D L+ LL Q +D+ + L Y+++ + + Sbjct: 182 MGIIALKHAFNKDKLLELLPQFCKFLDKMPRNEASCLLEKTSIYLMEYLGKDFLKEL-NM 240 Query: 231 LRDRETGGESMMTLAQWFEEKGIEKGIQQGRQ-----EVSQEFAQRLLSKGMSREDVAE 284 +++ +F ++ E+ Q + E Q+ + L R+ + E Sbjct: 241 AFVSIGQKYGFVSIGDYFRQQLAEERQQMTEERLQMAEERQQITEERLQMAEERQQITE 299 >UniRef50_Q8YTL4 All2703 protein n=13 Tax=Cyanobacteria RepID=Q8YTL4_ANASP Length = 270 Score = 74.5 bits (181), Expect = 4e-12, Method: Composition-based stats. Identities = 41/281 (14%), Positives = 93/281 (33%), Gaps = 20/281 (7%) Query: 17 LMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKG--HSTDVLYSVQMQGNPG 74 + + P EL + + F +K D L+ ++ + Sbjct: 1 MKTDTIFYSLFQEF-PHIFFELINQSPQEASIYEFTSREVKQLAFRLDGLFLPKINDSTK 59 Query: 75 YLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEATPYPLSMC 134 ++ +E Q +PD +R+ + ++ K P ++ Sbjct: 60 PFYI-VEVQFQPDDDFYYRLFAELFLYLKQY------KPPYPWQVVVIYPSRGIERQQTI 112 Query: 135 WFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQRDLMLLLEQL 194 FD +R+Y E + L + + L+ Q Sbjct: 113 HFDEILVLNRVKRIYLDEL-------GEVAETSLGVGVVKLVIETEETAPVLARQLIAQA 165 Query: 195 VTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGESMMTLAQWFEEKGIE 254 + + + ++ ++ + + + +L E + Q E+G + Sbjct: 166 KQQLTDVTAKRDLINLIETIIVYKLPQKSREEIEAMLGLNELKQSR---VYQEALEEGKQ 222 Query: 255 KGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDK 295 +G Q+G+QE E R++ G+S E +A++ +LPL + + Sbjct: 223 EGKQEGKQEAKLETIPRMVQFGLSVEAIAQLLDLPLEVVQQ 263 >UniRef50_A7C3X3 Putative uncharacterized protein n=7 Tax=Beggiatoa RepID=A7C3X3_9GAMM Length = 308 Score = 74.5 bits (181), Expect = 4e-12, Method: Composition-based stats. Identities = 38/318 (11%), Positives = 98/318 (30%), Gaps = 30/318 (9%) Query: 1 MDAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHS 60 M + +D +FK+ + F+ L DL +E + + + Sbjct: 1 MKEVAPLRYDVIFKKAFGVPKIFTAFVHDFLN------IDLEIDTVEKDKVYDPPIGNVA 54 Query: 61 TDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPIL 120 + + + + V ++H PD R + Y AA+ + D P + Sbjct: 55 AK--FDLYAEDKKNRVIVDMQHVRFPDHY--DRFLHYHCAALLEQVVYSKDYRPNLKVFT 110 Query: 121 FYQGEATPYPLSMCWFDMFYSPELARR-VYNSPFPLVDI-TITPDDEIMQHRRIAILELL 178 + F +L + + ++ I + + +E + Sbjct: 111 LVILTSGDRHKKDITITDFDPKDLEGNPIGETEHKIIHICPKYLNKAHTPPQYHEWMEAI 170 Query: 179 QKHIRQR-DLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETG 237 + + ++ D + I + + M + ++ Sbjct: 171 EDSLDEQVDESKYTHPEIQQIFKLIEKDKVTPQERAKMFDEYSMDAVKQEKIQKIQKKAK 230 Query: 238 GESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKG-----------------MSRE 280 E + + ++G+++G ++G +E +E + L +G ++ E Sbjct: 231 EEGLKEGLKEGLKEGLKEGKEEGLKEGLKEGKEEGLKEGEHKAKEELVRNLWSIGMLTEE 290 Query: 281 DVAEMANLPLAEIDKVIN 298 +A+ L L ++ + Sbjct: 291 QIAQTTGLTLEKVKALKE 308 >UniRef50_A7C3K1 Putative uncharacterized protein n=3 Tax=Beggiatoa sp. PS RepID=A7C3K1_9GAMM Length = 272 Score = 74.5 bits (181), Expect = 4e-12, Method: Composition-based stats. Identities = 45/294 (15%), Positives = 101/294 (34%), Gaps = 34/294 (11%) Query: 12 VFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQMQG 71 K+ F++ L +E+ D I D + + Q Sbjct: 4 FLKKVFSKPHIFTAFVKDMLGIEI--EIDKVETEKSFSPIIGN------VDSRFDLFAQD 55 Query: 72 NPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEATPYPL 131 L V I+H+ D R + Y A+ + + + P + + Sbjct: 56 TKNRLIVDIQHKRYKDHY--DRFLHYHCVALLEQITSSANYKPDMQVYTIVVLTSGDKHK 113 Query: 132 SMCWFDMFYSPEL---------ARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHI 182 + F +L + VY P + D T P E ++ ++ + +++ Sbjct: 114 TDLLITDFSPKKLDGSSIAETQHKIVYVCPKYVTDETPKPYQEWLKAINDSLDKQVEESH 173 Query: 183 RQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGESMM 242 + +++++ +LI + S + M D + +E ++ Sbjct: 174 YHNE---VIQEIFSLIKKDKISPEEYARM------------KDEYSDEEYLQEQTQKARK 218 Query: 243 TLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKV 296 + EKGI KGI++G ++ A+ + ++ E + E+ L + +I+ + Sbjct: 219 EGMEKGMEKGIGKGIEKGIEKGVLMMAKNMKEAKVAIETIIEVTGLSIEQIEDL 272 >UniRef50_Q1NU37 Putative uncharacterized protein n=1 Tax=delta proteobacterium MLMS-1 RepID=Q1NU37_9DELT Length = 309 Score = 74.5 bits (181), Expect = 4e-12, Method: Composition-based stats. Identities = 50/311 (16%), Positives = 99/311 (31%), Gaps = 27/311 (8%) Query: 3 APSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNT-LHLESGSFI----EESLK 57 HD+ +K+ L + +FLE L E+ D + + L Sbjct: 2 TEPLQDHDSPWKEALEN--RFAEFLE-LLFSEVHREIDWSRERTFLDKELQKLTQDAELG 58 Query: 58 GHSTDVLYSVQMQ-GNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLV 116 D L V G ++ + +E Q + + A RM Y R+ L Sbjct: 59 RRYADKLVKVWSNEGRETWVLIHVEVQGEAQQDFARRMYIYHYRISDRYSVDVVSLGVLA 118 Query: 117 VPILFYQGEATPYPLSMCWFD-MFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAIL 175 ++ ++ + + C D F + +L + + + E ++ ++ Sbjct: 119 DTVVSFRPDGYRWQRWGCTLDFCFPTVKLLDWLAAERWQQL--------ERSENIFALVV 170 Query: 176 ELLQKHIRQRDLMLLLEQLVTLIDEGYTSGS------QLVAMQNYMLQRGHTEQADLFYG 229 + DL +L + LI Y G +L + ++M++ + Sbjct: 171 MAQLAAKTEADLDILESKKFRLIKLLYDRGYSKEIILELFRVIDWMIRLPDNLEDRFLDA 230 Query: 230 VLRDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLP 289 V + E M E +GI+ G+QQG + R + ++ Sbjct: 231 VHKIEEDKK---MPYVTSAERRGIKIGMQQGEASLLLRQMGRKYGSSAADSYRQKIEQAD 287 Query: 290 LAEIDKVINLI 300 + K I Sbjct: 288 PESLLKWSERI 298 >UniRef50_C4Z2A6 Putative uncharacterized protein n=2 Tax=Eubacterium eligens ATCC 27750 RepID=C4Z2A6_EUBE2 Length = 336 Score = 74.5 bits (181), Expect = 5e-12, Method: Composition-based stats. Identities = 46/329 (13%), Positives = 89/329 (27%), Gaps = 47/329 (14%) Query: 2 DAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHST 61 D ++ D VF + + DL L GS++ + Sbjct: 6 DNVTSKFKDNVFCMLYRDKRNLLELYNALNNSAYTNVDDLQVTTLNGGSYM-----KYKN 60 Query: 62 DVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHL------EADHDKLPL 115 D + + M + E QS + M R + Y K+P+ Sbjct: 61 DASFLLCMS------LYMFEQQSSKNPNMPLRFLHYVSDVFRELFSNSMLHRRSTIKIPV 114 Query: 116 VVPILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHR----- 170 + FY G + E+ + I I D I+ Sbjct: 115 PHFVTFYNGLEKWIEDEDEI-RLSDMYEIPTDNPELELKVRVININKDVHILNKCKTLRD 173 Query: 171 -------------------RIAILELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAM 211 RIA+ E + + I + L+ EQ + E + Sbjct: 174 YMTFVNKVRFKMGVEGDDVRIAVTEAMDECIDEDILVDFFEQHREEVVEVSIYDYDEEDV 233 Query: 212 QNYMLQRGHTEQADLFYGVLRDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQR 271 + + + + + E + ++ +G Q G Q + Sbjct: 234 RRTLFEEAKEMAKEELKETVI-----EELKREAKEELTQEVFAEGEQSGEQLKIVNQIIK 288 Query: 272 LLSKGMSREDVAEMANLPLAEIDKVINLI 300 + K + E +A A+I + + + Sbjct: 289 KVKKSKTLETIASELEEEEADIKPIYDAV 317 >UniRef50_A4XJH0 Putative uncharacterized protein n=1 Tax=Caldicellulosiruptor saccharolyticus DSM 8903 RepID=A4XJH0_CALS8 Length = 134 Score = 74.1 bits (180), Expect = 5e-12, Method: Composition-based stats. Identities = 19/135 (14%), Positives = 55/135 (40%), Gaps = 1/135 (0%) Query: 1 MDAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHS 60 M+ + +A+F+ ++ L+ + +++ + + E++ + Sbjct: 1 MNNNFSQDENAIFRLIFSDSKEILFLLKNVAKFSWVDRIQKDSIEVILVDYDNENVLKYK 60 Query: 61 TDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPIL 120 DV+ V ++ N Y+ V + P+ M ++ + + ++ DK+P ++P++ Sbjct: 61 PDVIAKVTIENNTAYIFVFFVSKV-PECGMRNIILNNMLLFWEKKIKEGTDKIPPIIPLV 119 Query: 121 FYQGEATPYPLSMCW 135 Y G+ + Sbjct: 120 LYNGKEIWTEPREIY 134 >UniRef50_Q2FSM2 Putative uncharacterized protein n=3 Tax=Methanospirillum hungatei JF-1 RepID=Q2FSM2_METHJ Length = 304 Score = 74.1 bits (180), Expect = 5e-12, Method: Composition-based stats. Identities = 40/296 (13%), Positives = 102/296 (34%), Gaps = 19/296 (6%) Query: 6 TTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLY 65 + +D +F+ + + L L L E + + + K D+ Sbjct: 23 SPKNDFLFRLLFGD-DGNEELLASLLSSILHEEIEHVVIKNPYILKLFSEDKETILDIKA 81 Query: 66 SVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDK--LPLVVPILFYQ 123 ++ + V IE Q + R++ Y ++ +D L IL Sbjct: 82 AINSKK-----LVDIEIQLWNSPCLMSRILFYWARLYASQIKQGNDYTVLQKTTSILILD 136 Query: 124 GEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIR 183 + + ++++ + + + + +L + + Sbjct: 137 DLNHSSEDYHACSHLHDWKQHITLTDMIEVHVLELPKLHNLKQLDKSNTLLQWMLFFNAQ 196 Query: 184 QRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGESMMT 243 R+ ++++ + +I + + ++ E+ Y + G + + Sbjct: 197 TREELIMVSEANPVIKKA----------TDLLITMSRDEETRQMYEAREEYLLGRQIEIQ 246 Query: 244 LAQ-WFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKVIN 298 A+ E+G +G +GR+ ++ A RL+ +GM+ E + ++ L +A I + N Sbjct: 247 GAKGEGREEGRIEGRIEGRETERKDIAMRLIEEGMNDEFIKKITGLDIAVIRSLHN 302 >UniRef50_B7I1C8 Putative uncharacterized protein n=16 Tax=Bacillus cereus group RepID=B7I1C8_BACC7 Length = 307 Score = 73.7 bits (179), Expect = 6e-12, Method: Composition-based stats. Identities = 42/299 (14%), Positives = 94/299 (31%), Gaps = 21/299 (7%) Query: 10 DAVFKQFLM---HAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYS 66 D FK+ + + FL L + E +LHL+ E + + Sbjct: 15 DYAFKRLFGVEGNEDILIGFLNAVLQSSIDEEI--TSLHLDDPHLPREQKDDKLSILDLR 72 Query: 67 VQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDK--LPLVVPILFYQG 124 + + + IE Q + K M R + Y + + L + I Sbjct: 73 ATLNSG---IKINIEIQVRDKKDMIERSLFYWSGMYYSQMTQGMKYTELRPTICINIVDF 129 Query: 125 EATPYPLSMCWFDMFYSPELARRVYNS-PFPLVDITITPDDE-----IMQHRRIAILELL 178 P + + + R + + ++I + +A LL Sbjct: 130 ILFPEEQEFHSINTVMNKKSKRIITENMQLHFLEIPKVIQEWQGKRMDPWEDSLARWLLL 189 Query: 179 QKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGG 238 L +LE + A++++ + L+ + + Sbjct: 190 FPAHEDERLTTILEAIA-----MEKDPVLKKAIEDWERLSSDKDFLRLYEAREKAIKDRI 244 Query: 239 ESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKVI 297 + T + +K E ++ + E + + G+ E +A++A L + E++++I Sbjct: 245 SEIETAEERAAKKAAEIATEETKIATKIEMIENMFKIGLPIEKIAKVAELSVEEVNEII 303 >UniRef50_C4FYK3 Putative uncharacterized protein n=2 Tax=Abiotrophia defectiva ATCC 49176 RepID=C4FYK3_ABIDE Length = 365 Score = 73.7 bits (179), Expect = 6e-12, Method: Composition-based stats. Identities = 48/318 (15%), Positives = 103/318 (32%), Gaps = 32/318 (10%) Query: 9 HDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLES-GSFIEESLKGHSTDV-LYS 66 D + K+ M + DFL + + + + L + + + D + Sbjct: 4 KDILEKKLFMFNDVFADFLNGIIFNGRQIVEESELFDLSGWSHYKADDSRHRYQDRDVVK 63 Query: 67 VQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHL-------------------- 106 + + N + IE+Q PDK M FR++ Y A+ L Sbjct: 64 LWKKKNVVISLIGIENQDVPDKDMVFRVLSYDGASYKTQLAKKDEDKRKHLKDKKNTEIV 123 Query: 107 ---EADHDKLPLVVPILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPD 163 + D + V+ + Y GE + + L V + L+D+ + Sbjct: 124 EIGKEDEKDIFPVITFVVYYGEEEWKYETTLKKRLKIGDGLDEFVSDYKINLIDLKKFTE 183 Query: 164 DEIMQ-HRRIAILELLQKHIRQRDLMLLLEQLVTLIDEGYTS---GSQLVAMQNYMLQRG 219 D+I + + +L D + + E + +N + Sbjct: 184 DDINKFKKDFKLLVNYMVKGSNHDAGSIELNHPEEVSELVLRLTGEELPIPRENDGGKTM 243 Query: 220 HTEQADLFYGVLRDRETGGES--MMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGM 277 +F + E G + M +G+ KG+ +G+ + E + +++GM Sbjct: 244 EKFFEPMFARMAEKAEARGMAKGMTEGMAKGMTEGMAKGLAEGKAKGMTEGLAKGMTEGM 303 Query: 278 SREDVA-EMANLPLAEID 294 ++ + L ++ Sbjct: 304 AKGLAEGKARGLAEGLVE 321 >UniRef50_UPI0001BC3131 hypothetical protein BcroD2_12630 n=4 Tax=Butyrivibrio crossotus DSM 2876 RepID=UPI0001BC3131 Length = 247 Score = 73.7 bits (179), Expect = 7e-12, Method: Composition-based stats. Identities = 29/258 (11%), Positives = 72/258 (27%), Gaps = 29/258 (11%) Query: 1 MDAPSTT--PHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKG 58 M+ + D VF+ + + D+ LE+ ++ Sbjct: 1 MNNETVNRKYKDTVFRLLFKDKSNLLSLFNAVNDTDFSDENDIKITTLENAIYMT----- 55 Query: 59 HSTDV--LYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDK---- 112 D+ + +++ + EHQS + M +R + Y R++ Sbjct: 56 SKNDISCIIDMKLN--------LFEHQSTVNPNMPYRNLEYVTKCFKRYVGNFDVYTGKA 107 Query: 113 --LPLVVPILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHR 170 LP ++FY G + + + N ++ I ++ Sbjct: 108 LTLPNPKFVVFYNG--VNEQPPIRVMRLSDLYAHKDEIPNLELVVIQYNINN---LVNCT 162 Query: 171 RIAILELLQKHI-RQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYG 229 + E L+++ + L+ + + + + Sbjct: 163 LMDRCEPLKEYSEFIGCIRSNLKTMDKGEAVDSAIDYCIGNGILKDFLTNNRNEVRSMSL 222 Query: 230 VLRDRETGGESMMTLAQW 247 D E +++ +A Sbjct: 223 FEFDAEEHEKAIKQIAYE 240 >UniRef50_C5EKZ7 Predicted protein n=1 Tax=Clostridiales bacterium 1_7_47FAA RepID=C5EKZ7_9FIRM Length = 329 Score = 73.7 bits (179), Expect = 8e-12, Method: Composition-based stats. Identities = 37/296 (12%), Positives = 78/296 (26%), Gaps = 44/296 (14%) Query: 13 FKQFLMHAETARDFLEI-------HLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLY 65 ++ L H DF L E + ++ + D++ Sbjct: 8 MRKLLNHPARFADFYNGTVFGGRQVLRPEQLSDVPNEQGIVILDKDGKKRVVERRRDIIK 67 Query: 66 SVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADH--------------- 110 ++ E+Q M R M Y +E Sbjct: 68 KASF--GAYFILAAEENQDTIHYGMPVRNMMYDALDYTEQMECLKQAHKSRGDVLDGGGF 125 Query: 111 -------DKLPLVVPILFYQGEATPYPLSMCWFDMFYSPELARR-------VYNSPFPLV 156 D+L VV ++ Y G P+ +DM A+ + + L+ Sbjct: 126 LSGITREDRLMPVVSLILYHGSK-PWDGPRSLYDMLGLDASAKETLALKQVLPDYRINLI 184 Query: 157 DITITPDDEIMQHRRIAILELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYML 216 D + E+ + +L+ + ++ +Q + +M + Sbjct: 185 DASNIEHPELFCTSLQHVFSMLKYNTDKQKFYGYAKQ-----HQKDLLDMDDDSMLAMLT 239 Query: 217 QRGHTEQADLFYGVLRDRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRL 272 G ++ + G + G +G +G+ E A + Sbjct: 240 LLGEQKRLLKILETSSNDTKEGTDVCIAIDELINDGKIEGKIEGKIEGEHRLATLM 295 >UniRef50_A6FZY9 Putative uncharacterized protein n=2 Tax=Plesiocystis pacifica SIR-1 RepID=A6FZY9_9DELT Length = 320 Score = 73.4 bits (178), Expect = 8e-12, Method: Composition-based stats. Identities = 50/295 (16%), Positives = 92/295 (31%), Gaps = 29/295 (9%) Query: 5 STTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEE--SLKGHSTD 62 TT H + + E A L +EL E + T H + D Sbjct: 2 PTTLHQGAARLLVDEPEQAFSCLRSVFGLELPEFVRVQTRHAVLDRHLPMAGDTGELRPD 61 Query: 63 VLYSVQMQGNP-GYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILF 121 L S + G+P G L ++IE Q +PD R+ Y A + V ++ Sbjct: 62 ALLSAESPGDPLGGLGLIIEAQRRPDPIKHRRLWVYWAHASEELRRS------TAVLMIA 115 Query: 122 YQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKH 181 + + S+ +++ L V + ++ + D + H Sbjct: 116 LSDAVSRWARSLGQYELPPREGLL--VLDRH----NMPVVRDPATARRLPAWSTLSAMIH 169 Query: 182 IRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGESM 241 DL L L ++ + + Y+L + + G + + + Sbjct: 170 GVHGDLDALKVVLPVVLSLEDERRWRYAS---YLLCAVDPQSRAILEGAMSTQRIPISDI 226 Query: 242 MTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMS------REDVAEMANLPL 290 G ++GR E E ++G + E V E+ LP+ Sbjct: 227 ER-----RSIAFHDGREEGRTEGRAEGRTEGRAEGRTEVLRELVETVVELRGLPV 276 >UniRef50_UPI00006CAA90 hypothetical protein TTHERM_00670420 n=1 Tax=Tetrahymena thermophila RepID=UPI00006CAA90 Length = 345 Score = 73.4 bits (178), Expect = 9e-12, Method: Composition-based stats. Identities = 38/292 (13%), Positives = 96/292 (32%), Gaps = 14/292 (4%) Query: 10 DAVFKQFLMHAETARDFLEIHL---PVELRELCDLNTLHLESGSFIEESLKGHSTDVLYS 66 D VF++ + E + FLE L L E + + + + ++ Sbjct: 64 DFVFEKIFSNHERMKSFLESVLVGKNKILHEEINEVIYLNNNLLQNSLTQEYIPKKSMFD 123 Query: 67 VQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEA 126 +Q++ + G V I ++ R+ YS ++ + H L ++ I Sbjct: 124 LQIKTSQGTFIVEI-YKRSFQP-FLKRIQYYSAQSLSQQQNQTHTSLKPIISIAIVDDIL 181 Query: 127 TPYPLSMCWFDMFYSPELARRVYNSP-FPLVDITITPDDEIMQHRRIAILELLQKHIRQR 185 + F + + N + +++ + + Q + ++ Sbjct: 182 FEDDVPCISFHKTIEQKTQKVFLNYSTYVFIELGKYDNKKYDQSCVHGV--------NEK 233 Query: 186 DLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGESMMTLA 245 + + LL++ ++ ++ E + E + + Sbjct: 234 EWLDLLKKSDIHRQYKTKEVLNAAQYAQFIQEKLFDEYVKHKLYEDQFIEEIKNAKVEGI 293 Query: 246 QWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKVI 297 Q +E+ I+ + +E +++L G+S + + L EID++ Sbjct: 294 QQGQEETIKLSKHYSIKAGKEEVVKQMLKDGLSLQKIITYTGLSKEEIDEIK 345 >UniRef50_Q6ZEK6 Slr5124 protein n=11 Tax=Chroococcales RepID=Q6ZEK6_SYNY3 Length = 276 Score = 73.4 bits (178), Expect = 1e-11, Method: Composition-based stats. Identities = 41/285 (14%), Positives = 97/285 (34%), Gaps = 28/285 (9%) Query: 21 ETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQMQGNPGYLHVVI 80 + FL + + L S E SL+ D L Q + L + + Sbjct: 3 DNLCKFLAESFSEDYAAWLLGRPIKLTKLSPTELSLEPIRADSLILEQSED----LVLHL 58 Query: 81 EHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEATPYPLSMCWFDMFY 140 E Q++PD M FRM+ Y + R + + + + + + D F Sbjct: 59 EFQTEPDPTMGFRMLDYRVRVYRRFPQKTMHQFVIYL---------KRSSNDLVYQDSFQ 109 Query: 141 SPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQRDLMLLLEQLVTLIDE 200 E + + + P + +Q + L +L + + + + I++ Sbjct: 110 VGETLH-----RYQAIRLWEQPSEAFLQSPGLLPLAVLTQTSDPTLKLREVATALEQIED 164 Query: 201 GYTSGSQLVAMQNY--MLQRGHTEQADLFYGVLRDRETGGESMMTLAQWFEEKGIEKGIQ 258 + + A + +L + L ++++ E + + +G +G Sbjct: 165 NRVKANLMAATSVFGGILLAPELIKTILRSEIMKESAVYQEILEEGKIAGKLEGRLEGKL 224 Query: 259 QGRQEVSQEFAQR--------LLSKGMSREDVAEMANLPLAEIDK 295 +G+ E E L G++ ++A+ ++ + +++ Sbjct: 225 EGKLEGKLEGRLEAKLETIPLLKKLGLTITEIAKELDIDVELVNR 269 >UniRef50_UPI0001B4A8CA hypothetical protein Bfra3_22303 n=1 Tax=Bacteroides fragilis 3_1_12 RepID=UPI0001B4A8CA Length = 282 Score = 73.0 bits (177), Expect = 1e-11, Method: Composition-based stats. Identities = 38/291 (13%), Positives = 92/291 (31%), Gaps = 21/291 (7%) Query: 10 DAVFKQFLM-HAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQ 68 D FK+ H + FL LP+ L E + + E+ ++ V + Sbjct: 9 DLTFKRVFGEHPDLVMSFLNALLPLRLEESI--TDIEYLPSGMVPENSLPKNSIVYVRCR 66 Query: 69 MQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDK--LPLVVPILFYQGEA 126 ++ +E Q + +M + A R +++ L V + Sbjct: 67 DSKGRSFI---VEMQMIWSPEFKQCVMFNASKAYVRQMDSGEQYDLLQPVYSLNLVNDIF 123 Query: 127 TPY-PLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQR 185 P ++ + + R + V++ + + + I ++ Sbjct: 124 EPDIKEYYHYYRLVHVEHTERVINGLHLVFVELPKFTPHTYSEKKMHILWLRYLTEIDEK 183 Query: 186 DLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGESMMTLA 245 ++ + E + ++ + F+ ++ TL Sbjct: 184 T-----HEVPEELLENPEIKKAVTVLEESAFTPEQLLGYEKFWDIIS-------VEKTLI 231 Query: 246 QWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKV 296 E K E+G ++G + A +G+S + ++ + L EI+++ Sbjct: 232 SSAERKEKEEGRKEGELQEKLLVASNAKKQGLSLDIISSLTGLSAEEIERL 282 >UniRef50_C8W1F3 Putative uncharacterized protein n=2 Tax=Desulfotomaculum acetoxidans DSM 771 RepID=C8W1F3_DESAS Length = 303 Score = 72.2 bits (175), Expect = 2e-11, Method: Composition-based stats. Identities = 41/254 (16%), Positives = 89/254 (35%), Gaps = 14/254 (5%) Query: 55 SLKGHSTDVLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLP 114 +K D ++ ++ + +E Q+ K + RM+ Y + ++ + + + Sbjct: 55 EVKEKRIDFVFLLKDNS-----ILHLEFQTTIPKDILIRMVTYGSRLVEKYDQDVNTVVI 109 Query: 115 LVVPILFYQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAI 174 I L+ +++ Y + + P DEI R I + Sbjct: 110 YSGKIESAPRLLRKGSLTYKVKNIYMKKFDGDAEYKRIYEKIK-NKKPLDEIDIQRLIFL 168 Query: 175 LELLQKHIRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLR-- 232 + K + ++ + +L I + A+ E VLR Sbjct: 169 PLMKSKEKSEDEMAIQAAELAKEIPNEPIRAFTIGAIVAISDNFLTEEYKKRLLEVLRMT 228 Query: 233 ------DRETGGESMMTLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMA 286 E E + + E+G+++G+++G +E ++ A L +G E + ++ Sbjct: 229 QIEQWIREEGREEGLKEGLKEGREEGLKEGLKEGLREGLEKTAIAALREGFDIETIVKIT 288 Query: 287 NLPLAEIDKVINLI 300 NL EI + I Sbjct: 289 NLSKEEILSLKKKI 302 >UniRef50_D2RKL8 Tetracycline resistance leader peptide n=3 Tax=Acidaminococcus fermentans DSM 20731 RepID=D2RKL8_ACIFE Length = 285 Score = 72.2 bits (175), Expect = 2e-11, Method: Composition-based stats. Identities = 41/290 (14%), Positives = 94/290 (32%), Gaps = 31/290 (10%) Query: 11 AVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVLYSVQMQ 70 +F+ + + + +E L +++R L E + K D V ++ Sbjct: 16 FLFQHVMRNERLCKHLIEKFLNIQIRS---LQYQSFEKTIDLRLEGKSIRLD----VFVE 68 Query: 71 GNPGYLHVVIEHQSKPDK--KMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQGEATP 128 N G ++ IE Q +A R Y L+ L + + P Sbjct: 69 DNEGRVY-DIEMQCSNSPRNDLAKRSRFYQSLIDGELLDKGKPYEELNPSYVIFICTFDP 127 Query: 129 YPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQRDLM 188 R + F +D +Q + L + Sbjct: 128 ---------------FHRGLPIYTFT----HCCKEDNRVQLKDEETRMFLNSKGSENAAD 168 Query: 189 LLLEQLVTLIDEGYTSGSQLVAMQNY--MLQRGHTEQADLFYGVLRDRETGGESMMTLAQ 246 + + +D G + ++ +++ + + R E+ Q Sbjct: 169 PDIAAFLRYVDGKAAEGRFVESLDQEVHLVKSMDKVRREYMILSDEIRRRQKEAAEEGWQ 228 Query: 247 WFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKV 296 +KG++KG+++GR++ + +L + + E ++ + + L +I K+ Sbjct: 229 EGMQKGMQKGMEKGREKEREANILGMLKEKIPVETISRITHYSLDQIQKL 278 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.313 0.118 0.283 Lambda K H 0.267 0.0363 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,444,459,736 Number of Sequences: 3077464 Number of extensions: 53636779 Number of successful extensions: 238486 Number of sequences better than 1.0e-01: 250 Number of HSP's better than 0.1 without gapping: 998 Number of HSP's successfully gapped in prelim test: 839 Number of HSP's that attempted gapping in prelim test: 232335 Number of HSP's gapped (non-prelim): 3431 length of query: 300 length of database: 1,040,396,356 effective HSP length: 128 effective length of query: 172 effective length of database: 646,480,964 effective search space: 111194725808 effective search space used: 111194725808 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.6 bits) S2: 92 (40.2 bits)