BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (306 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P77768 Uncharacterized protein yfcI n=175 Tax=Gammaprot... 382 e-105 UniRef50_B7UFQ5 Predicted protein n=14 Tax=Enterobacteriaceae Re... 341 2e-92 UniRef50_P37415 Uncharacterized protein pSLT051 n=256 Tax=Gammap... 318 2e-85 UniRef50_Q1CC76 Transposase n=27 Tax=Gammaproteobacteria RepID=Q... 313 4e-84 UniRef50_Q7N1D0 Transposase, ISNCY family n=36 Tax=root RepID=Q7... 297 3e-79 UniRef50_Q7B1W7 YadD homologue n=11 Tax=root RepID=Q7B1W7_ECOLX 291 2e-77 UniRef50_C2DMU4 Possible transposase n=6 Tax=Enterobacteriaceae ... 285 2e-75 UniRef50_P31665 Uncharacterized protein yadD n=59 Tax=Enterobact... 284 2e-75 UniRef50_D2U4R8 Transposase (Fragment) n=4 Tax=Enterobacteriacea... 284 3e-75 UniRef50_Q4LC22 TpnA protein n=9 Tax=Enterobacteriaceae RepID=Q4... 268 2e-70 UniRef50_C2LLN3 Transposase n=37 Tax=Enterobacteriaceae RepID=C2... 268 2e-70 UniRef50_D0KLJ7 Putative transposase YhgA family protein n=1 Tax... 243 6e-63 UniRef50_C8QFJ7 Putative transposase YhgA family protein n=4 Tax... 242 1e-62 UniRef50_C0Q5B1 Ytl2 n=4 Tax=Enterobacteriaceae RepID=C0Q5B1_SALPC 237 5e-61 UniRef50_D1P284 Transposase, ISNCY family n=10 Tax=Enterobacteri... 234 3e-60 UniRef50_B6XDZ7 Putative uncharacterized protein n=2 Tax=Provide... 230 4e-59 UniRef50_C2DIT3 Possible transposase n=5 Tax=Enterobacteriaceae ... 229 9e-59 UniRef50_C2LF55 Transposase n=3 Tax=Enterobacteriaceae RepID=C2L... 212 2e-53 UniRef50_Q3C0L1 TpnA protein n=16 Tax=Enterobacteriaceae RepID=Q... 206 9e-52 UniRef50_C3M8C1 Putative transposase n=3 Tax=Candidatus Hamilton... 202 8e-51 UniRef50_A8PLK1 Putative uncharacterized protein n=3 Tax=Rickett... 196 9e-49 UniRef50_C1J8H0 Truncated transposase n=3 Tax=Escherichia coli R... 188 2e-46 UniRef50_B7MZS6 Putative uncharacterized protein n=3 Tax=Escheri... 186 1e-45 UniRef50_C0AXL8 Putative uncharacterized protein n=1 Tax=Proteus... 184 4e-45 UniRef50_Q52101 ORF n=1 Tax=Salmonella enterica subsp. enterica ... 178 2e-43 UniRef50_B3ESQ9 Putative uncharacterized protein n=2 Tax=Bacteri... 164 3e-39 UniRef50_A8PQ66 Putative uncharacterized protein n=3 Tax=Rickett... 145 2e-33 UniRef50_C4YU05 Transposase n=5 Tax=Rickettsieae RepID=C4YU05_9RICK 138 2e-31 UniRef50_Q1RJ73 Transposase and inactivated derivative n=10 Tax=... 137 4e-31 UniRef50_D0YJF1 Putative transposase YhgA family protein n=1 Tax... 136 8e-31 UniRef50_C1MD86 Putative uncharacterized protein n=5 Tax=Enterob... 134 5e-30 UniRef50_C8T759 Putative uncharacterized protein n=1 Tax=Klebsie... 132 2e-29 UniRef50_A8GX51 Transposase and inactivated derivative n=11 Tax=... 127 4e-28 UniRef50_C3PPD7 Transposase and inactivated derivative n=13 Tax=... 125 1e-27 UniRef50_D2NBJ3 Putative uncharacterized protein n=1 Tax=Escheri... 124 6e-27 UniRef50_A5CC03 Transposase and inactivated derivative n=9 Tax=O... 121 3e-26 UniRef50_Q1RGR6 Transposase and inactivated derivative n=15 Tax=... 121 4e-26 UniRef50_B6J6C6 Hypothetical cytosolic protein n=1 Tax=Coxiella ... 117 6e-25 UniRef50_Q6TFF6 Putative transposase n=1 Tax=Caedibacter taenios... 113 7e-24 UniRef50_Q1RKI3 Transposase and inactivated derivative n=10 Tax=... 112 1e-23 UniRef50_A9BGB6 Putative uncharacterized protein n=3 Tax=Petroto... 104 3e-21 UniRef50_Q2J904 Putative uncharacterized protein n=1 Tax=Frankia... 101 4e-20 UniRef50_Q24W02 Putative uncharacterized protein n=3 Tax=Clostri... 100 6e-20 UniRef50_A0LBL3 Putative uncharacterized protein n=6 Tax=Magneto... 97 5e-19 UniRef50_B3ETR6 Putative uncharacterized protein n=1 Tax=Candida... 97 7e-19 UniRef50_C0GW46 Putative uncharacterized protein n=2 Tax=Desulfo... 97 9e-19 UniRef50_C0GW49 Putative uncharacterized protein n=6 Tax=Desulfo... 96 1e-18 UniRef50_A6TJT5 Putative uncharacterized protein n=1 Tax=Alkalip... 94 4e-18 UniRef50_A6G4N5 Putative uncharacterized protein n=1 Tax=Plesioc... 93 1e-17 UniRef50_C5JAV2 Transposase n=2 Tax=uncultured bacterium RepID=C... 89 2e-16 UniRef50_B5Q357 Transposase n=10 Tax=Salmonella enterica subsp. ... 89 2e-16 UniRef50_A8PLG1 Transposase n=1 Tax=Rickettsiella grylli RepID=A... 89 3e-16 UniRef50_B2V9N0 Putative uncharacterized protein n=4 Tax=Sulfuri... 86 2e-15 UniRef50_A9EVM7 Similar to putative transposase n=2 Tax=Sorangiu... 85 4e-15 UniRef50_A6G0X2 Putative uncharacterized protein n=1 Tax=Plesioc... 83 9e-15 UniRef50_D0LMM4 Putative transposase n=10 Tax=Haliangium ochrace... 82 2e-14 UniRef50_D2QBD7 Putative uncharacterized protein n=1 Tax=Spiroso... 82 3e-14 UniRef50_B9MMR0 Putative uncharacterized protein n=1 Tax=Anaeroc... 81 5e-14 UniRef50_A9BGB3 Putative uncharacterized protein n=2 Tax=Petroto... 77 7e-13 UniRef50_Q3JB06 Putative transposase n=17 Tax=Proteobacteria Rep... 76 2e-12 UniRef50_C6VTM0 Putative uncharacterized protein n=1 Tax=Dyadoba... 75 3e-12 UniRef50_B6WXP3 Putative uncharacterized protein n=1 Tax=Desulfo... 74 8e-12 UniRef50_A4XMD0 Putative uncharacterized protein n=5 Tax=Clostri... 73 1e-11 UniRef50_B8FP58 Putative uncharacterized protein n=1 Tax=Desulfi... 72 2e-11 UniRef50_D0LPI9 Putative transposase n=2 Tax=Haliangium ochraceu... 72 3e-11 UniRef50_C6I158 Putative uncharacterized protein n=3 Tax=Leptosp... 70 8e-11 UniRef50_Q1QWV4 Putative uncharacterized protein n=11 Tax=Proteo... 70 1e-10 UniRef50_A3JHZ5 Putative transposase n=11 Tax=Proteobacteria Rep... 69 2e-10 UniRef50_C6HY29 Putative uncharacterized protein n=1 Tax=Leptosp... 69 3e-10 UniRef50_A3ET28 Probable transposase n=6 Tax=Leptospirillum sp. ... 69 3e-10 UniRef50_B9TA29 Putative uncharacterized protein n=1 Tax=Ricinus... 67 9e-10 UniRef50_Q1Q296 Putative uncharacterized protein n=6 Tax=Candida... 67 9e-10 UniRef50_C7RR52 Putative transposase n=1 Tax=Candidatus Accumuli... 67 1e-09 UniRef50_B4U689 Putative uncharacterized protein n=8 Tax=Aquific... 67 1e-09 UniRef50_C5UWW9 Putative uncharacterized protein n=1 Tax=Clostri... 66 2e-09 UniRef50_Q04UG3 Transposase, YhgA-like n=8 Tax=Leptospira RepID=... 66 2e-09 UniRef50_C0GTX5 Putative uncharacterized protein n=8 Tax=Desulfo... 65 2e-09 UniRef50_C0GWA6 Putative uncharacterized protein n=3 Tax=Desulfo... 65 4e-09 UniRef50_C0A240 Putative uncharacterized protein n=1 Tax=Opituta... 64 5e-09 UniRef50_C0GV86 Transposase, ISNCY family n=7 Tax=Desulfonatrono... 64 6e-09 UniRef50_Q2FP14 Putative uncharacterized protein n=4 Tax=Methano... 64 6e-09 UniRef50_C1DXM1 Putative uncharacterized protein n=5 Tax=Sulfuri... 64 8e-09 UniRef50_A6G1G8 Putative uncharacterized protein n=1 Tax=Plesioc... 63 1e-08 UniRef50_A4XFI8 Putative uncharacterized protein n=7 Tax=Clostri... 63 1e-08 UniRef50_C4FIM1 Putative uncharacterized protein n=1 Tax=Sulfuri... 62 2e-08 UniRef50_C1DXV7 Putative uncharacterized protein n=1 Tax=Sulfuri... 62 2e-08 UniRef50_C6HXQ0 Putative uncharacterized protein n=1 Tax=Leptosp... 60 1e-07 UniRef50_Q2RLW6 Putative uncharacterized protein n=9 Tax=Clostri... 60 1e-07 UniRef50_C4UAM6 Putative uncharacterized protein n=1 Tax=Yersini... 57 6e-07 UniRef50_A4U3R1 Putative uncharacterized protein n=1 Tax=Magneto... 56 1e-06 UniRef50_B2V697 Putative uncharacterized protein n=6 Tax=Sulfuri... 56 1e-06 UniRef50_Q3C0L0 TpnA protein n=2 Tax=Sodalis glossinidius RepID=... 55 2e-06 UniRef50_C4GYF6 Transposase n=20 Tax=Yersinia pestis RepID=C4GYF... 53 1e-05 UniRef50_B9MN47 Putative uncharacterized protein n=2 Tax=Bacteri... 52 2e-05 UniRef50_C5RH90 Putative uncharacterized protein n=2 Tax=Clostri... 49 3e-04 UniRef50_C6HTR6 Probable transposase n=5 Tax=Leptospirillum ferr... 48 4e-04 UniRef50_A4XG55 Putative uncharacterized protein n=2 Tax=Caldice... 47 6e-04 UniRef50_C6HZP6 Putative uncharacterized protein n=1 Tax=Leptosp... 46 0.001 UniRef50_C6PYR3 Putative uncharacterized protein n=1 Tax=Clostri... 46 0.002 UniRef50_A4XMU7 Putative uncharacterized protein n=1 Tax=Caldice... 46 0.002 UniRef50_B0K503 Putative uncharacterized protein n=12 Tax=Thermo... 45 0.003 UniRef50_B9MMM9 Putative uncharacterized protein n=1 Tax=Anaeroc... 45 0.003 UniRef50_C4FHW2 Putative uncharacterized protein n=1 Tax=Sulfuri... 45 0.003 UniRef50_C9XMT1 Putative uncharacterized protein n=4 Tax=Clostri... 44 0.007 UniRef50_B0K519 Putative uncharacterized protein n=14 Tax=Thermo... 44 0.010 UniRef50_Q6D6X6 Putative transposase (Fragment) n=2 Tax=Pectobac... 43 0.014 UniRef50_C1I6Y7 Putative uncharacterized protein n=1 Tax=Clostri... 41 0.041 UniRef50_B1EI63 Putative uncharacterized protein n=1 Tax=Escheri... 40 0.070 UniRef50_A7N2B6 Putative uncharacterized protein n=1 Tax=Vibrio ... 40 0.080 >UniRef50_P77768 Uncharacterized protein yfcI n=175 Tax=Gammaproteobacteria RepID=YFCI_ECOLI Length = 296 Score = 382 bits (982), Expect = e-105, Method: Compositional matrix adjust. Identities = 183/294 (62%), Positives = 232/294 (78%), Gaps = 5/294 (1%) Query: 1 MTNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRAL 60 MT TTSTPHDA+FK+FL HPDTARDF++IHLP LR+LCDL +LKLE SF+DE LR Sbjct: 1 MTISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQY 60 Query: 61 HSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIP 120 +SD+LWSVKT+EG GYIYVVIEHQS+ + MAFR+MRYS+A MQ H++ ++ LPLV+P Sbjct: 61 YSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKE-LPLVLP 119 Query: 121 MLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELI 180 MLFYHG RSPYP+SLCWLDEFA+P ARK+Y++AFPLVD+TVVPDDEI+QHR++ALLELI Sbjct: 120 MLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELI 179 Query: 181 QKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQH 240 QKHIRQRDL+GL+DQ+V LLVT ND Q+ AL NY+L TGD RF FI E+ R PQ Sbjct: 180 QKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQE 239 Query: 241 RERIMTIAERIHNDGYIKGEQ----RILRLLLQNGADPEWIQKITGLSAEQMQA 290 +E++MTIA+R+ +G ++G+ RI + +L G D E + +T LS + + A Sbjct: 240 KEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIA 293 >UniRef50_B7UFQ5 Predicted protein n=14 Tax=Enterobacteriaceae RepID=B7UFQ5_ECO27 Length = 315 Score = 341 bits (874), Expect = 2e-92, Method: Compositional matrix adjust. Identities = 159/261 (60%), Positives = 204/261 (78%), Gaps = 1/261 (0%) Query: 1 MTNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRAL 60 MT TTS+PHDA+FKTF+ P+TARDF+EIHLP+ LR+LC+L +L+LE SF+++ LRA Sbjct: 1 MTESTTSSPHDAVFKTFMFTPETARDFLEIHLPEPLRKLCNLQTLRLEPTSFIEKSLRAY 60 Query: 61 HSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIP 120 +SD+LWSV+T EGDGYIY VIEHQS + +MAFRLMRY+ A MQRH++ + +PLV+P Sbjct: 61 YSDVLWSVETSEGDGYIYCVIEHQSSAEKNMAFRLMRYATAAMQRHLDKGYDR-VPLVVP 119 Query: 121 MLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELI 180 +LFYHG SPYP+SL WLDEF DP AR+LY AFPLVD+T+VPDDEI+QHRR+ALLELI Sbjct: 120 LLFYHGEASPYPYSLNWLDEFDDPQLARQLYTEAFPLVDITIVPDDEIMQHRRIALLELI 179 Query: 181 QKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQH 240 QKHIR RDL+G++D++ LLV NDSQ+ L NY+L GD +RF FI E+ R P Sbjct: 180 QKHIRDRDLIGMVDRITTLLVRGFTNDSQLQTLFNYLLQCGDTSRFTRFIQEIAERSPLQ 239 Query: 241 RERIMTIAERIHNDGYIKGEQ 261 +E +MTIAER+ +G+ G Q Sbjct: 240 KEILMTIAERLRQEGHQIGWQ 260 >UniRef50_P37415 Uncharacterized protein pSLT051 n=256 Tax=Gammaproteobacteria RepID=YTL2_SALTY Length = 313 Score = 318 bits (814), Expect = 2e-85, Method: Compositional matrix adjust. Identities = 159/308 (51%), Positives = 212/308 (68%), Gaps = 21/308 (6%) Query: 5 TTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDI 64 TT TPHDA F+ FLT PD ARDFME+HLP +LR +CDL +LKLES SFV++ LR SD+ Sbjct: 6 TTPTPHDATFRQFLTQPDIARDFMELHLPAELRAICDLSTLKLESGSFVEDDLRQYFSDV 65 Query: 65 LWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFY 124 L+S+KT GDGYI+V++EHQS D HMAFRL+RY++A MQRH+E ++ LPLVIP+LFY Sbjct: 66 LYSLKTTAGDGYIHVLVEHQSTPDKHMAFRLIRYAVAAMQRHLEAGHKK-LPLVIPVLFY 124 Query: 125 HGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHI 184 G RSPYP+S WLDEF D A KLY++AFPLVDVTV+PDDEI HR +A L L+QKHI Sbjct: 125 TGKRSPYPYSTRWLDEFDDTALADKLYSSAFPLVDVTVIPDDEIAGHRSMAALTLLQKHI 184 Query: 185 RQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRERI 244 QRDL L+D+L +L+ + SQ+ +L++YI+ G+ + F+ EL +R+PQH + + Sbjct: 185 HQRDLAELVDRLAPILLAGYLSSSQVISLVHYIVQAGETSDAEAFVRELAQRVPQHGDAL 244 Query: 245 MTIAERIHNDGYIK----GEQR----------------ILRLLLQNGADPEWIQKITGLS 284 MTIA+++ G K GEQR I R +LQN D + K+TGL+ Sbjct: 245 MTIAQQLEQKGIEKGIQLGEQRGIEKGRSEGEREATLKIARTMLQNCIDRNTVMKMTGLT 304 Query: 285 AEQMQALR 292 + + +R Sbjct: 305 EDDLAQIR 312 >UniRef50_Q1CC76 Transposase n=27 Tax=Gammaproteobacteria RepID=Q1CC76_YERPN Length = 313 Score = 313 bits (803), Expect = 4e-84, Method: Compositional matrix adjust. Identities = 157/308 (50%), Positives = 212/308 (68%), Gaps = 21/308 (6%) Query: 5 TTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDI 64 +T TPHDA F+ FLT P+ ARDFME+HLP +LR +CDL +LKLES SFV++ LR SD+ Sbjct: 6 STPTPHDATFRQFLTQPEIARDFMELHLPAELRAICDLSTLKLESGSFVEDDLRQYFSDV 65 Query: 65 LWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFY 124 L+S+ T EG+GY++V+IEHQS D HMAFRL+RY++A MQRH+E + LPLVIP+LFY Sbjct: 66 LYSLDTVEGEGYVHVLIEHQSSPDKHMAFRLIRYAIAAMQRHLEAGHAK-LPLVIPVLFY 124 Query: 125 HGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHI 184 G RSPYP+S WLDEF DP A KLY+ AFPLVDVTV+PDD+I++HR +A L L+QKHI Sbjct: 125 VGKRSPYPYSTRWLDEFDDPELAHKLYSGAFPLVDVTVIPDDDIMEHRSMAALTLLQKHI 184 Query: 185 RQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRERI 244 QRD+ L D+L LL+ + + Q+ AL++Y+L G+ A F+ EL +R+PQH + + Sbjct: 185 HQRDIATLTDRLATLLMADYLSSPQVMALIHYLLQAGESADSEAFVRELAQRVPQHGDAL 244 Query: 245 MTIAERIHNDGYIK------------GEQR--------ILRLLLQNGADPEWIQKITGLS 284 MTIA+++ G K GEQR + R LL+ G E +Q+ TGLS Sbjct: 245 MTIAQQLEQKGIEKGRMEGRTEGIQLGEQRGIEKGKLEVARSLLKMGMPIESVQEATGLS 304 Query: 285 AEQMQALR 292 + + +R Sbjct: 305 EDDLAQIR 312 >UniRef50_Q7N1D0 Transposase, ISNCY family n=36 Tax=root RepID=Q7N1D0_PHOLL Length = 335 Score = 297 bits (761), Expect = 3e-79, Method: Compositional matrix adjust. Identities = 153/334 (45%), Positives = 208/334 (62%), Gaps = 44/334 (13%) Query: 1 MTNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRAL 60 M T TPHDA+FK FL+H DTARDF+EIHLP LR +CDLD+L+LES SF+++ LR Sbjct: 1 MKRKNTPTPHDAIFKKFLSHIDTARDFLEIHLPATLRAVCDLDTLRLESGSFIEDNLRVH 60 Query: 61 HSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIP 120 +SDIL+S+KT +G+ Y+Y VIEHQS D MAFRLMRYS++ MQ H+E ++ LPLVIP Sbjct: 61 YSDILYSLKTTQGESYVYCVIEHQSSPDKMMAFRLMRYSISAMQWHLEQGHKK-LPLVIP 119 Query: 121 MLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELI 180 +LFYHG PYPWS W D F A ++Y++AFPLVDVTV+PDDEI+ H+RVALLE++ Sbjct: 120 VLFYHGKIRPYPWSTNWFDCFDASALAEEIYSSAFPLVDVTVIPDDEILTHKRVALLEIV 179 Query: 181 QKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQH 240 QKHIRQRD+ L +L +L + + ++LNYILL GD A FI +L + P++ Sbjct: 180 QKHIRQRDMAELQQELTMLFAYDYYTYELLKSMLNYILLVGDTADPEGFIRQLAEQFPKY 239 Query: 241 RERIMTIAERIHNDGYIKGEQ--------------------------------------- 261 E +MTIA+++ + G+ +G + Sbjct: 240 EEVLMTIAQKLQHKGHQEGLKEGLQKCQDAREEGLQEGLQKGEKKGEKKGEKKGEEKGEK 299 Query: 262 ----RILRLLLQNGADPEWIQKITGLSAEQMQAL 291 +I R L+ NG D E I K TGLS +++ + Sbjct: 300 RASLKIARALMDNGIDRETIMKSTGLSQNELEQI 333 >UniRef50_Q7B1W7 YadD homologue n=11 Tax=root RepID=Q7B1W7_ECOLX Length = 313 Score = 291 bits (745), Expect = 2e-77, Method: Compositional matrix adjust. Identities = 143/250 (57%), Positives = 184/250 (73%), Gaps = 5/250 (2%) Query: 3 NFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHS 62 N TT TPHDA F++FL +PD ARDF+E+HLP + R+LCDL +LKLE A+FV+ L S Sbjct: 6 NTTTPTPHDAAFRSFLANPDVARDFLELHLPAEYRQLCDLSTLKLEPATFVEPDLHQYAS 65 Query: 63 DILWSVKTREG-DGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPM 121 DILWSVKT G DGY+Y +IEHQS E+++M FR++RYS+A MQRH+E K LPLVIP+ Sbjct: 66 DILWSVKTTGGEDGYVYTLIEHQSTENLYMPFRMLRYSVAAMQRHLEQHKT--LPLVIPV 123 Query: 122 LFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQ 181 LFYHG RSPYP+S+ WLD F +P A K+Y FPLVD+TVV D+EI+ HRR+A L L+ Sbjct: 124 LFYHGERSPYPYSMNWLDCFENPALAAKIYTKPFPLVDITVVDDNEIMNHRRMAALTLLM 183 Query: 182 KHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHR 241 KHIRQRD++ +D LV L + ++ QIT L NY LL G E EF+ L +R+PQH Sbjct: 184 KHIRQRDMLMCLDNLVRAL-QDIQDEEQITVLFNY-LLNGSEHVTVEFLQTLAQRLPQHE 241 Query: 242 ERIMTIAERI 251 + IMT+AER+ Sbjct: 242 DSIMTLAERL 251 >UniRef50_C2DMU4 Possible transposase n=6 Tax=Enterobacteriaceae RepID=C2DMU4_ECOLX Length = 314 Score = 285 bits (728), Expect = 2e-75, Method: Compositional matrix adjust. Identities = 138/268 (51%), Positives = 191/268 (71%), Gaps = 6/268 (2%) Query: 6 TSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDIL 65 ++TPHDA+FK FL H +TARDF++IHLP +LRELCDLD+L LES SF++E L+ +D+L Sbjct: 5 STTPHDAVFKQFLMHAETARDFLDIHLPAELRELCDLDTLHLESGSFIEESLKGHSTDVL 64 Query: 66 WSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIE--HDKRQPLPLVIPMLF 123 +SV+ + GY++VVIEHQS+ D MAFR+MRYS+A M RH+E HDK LPLV+P+LF Sbjct: 65 YSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDK---LPLVVPILF 121 Query: 124 YHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKH 183 Y G +PYP S+CW D F P AR++YN+ FPLVD+T+ PDDEI+QHRR+A+LEL+QKH Sbjct: 122 YQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKH 181 Query: 184 IRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRER 243 IRQRDLM L++QLV L+ + SQ+ A+ NY+L G + + F L R + Sbjct: 182 IRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGKS- 240 Query: 244 IMTIAERIHNDGYIKGEQRILRLLLQNG 271 +MT+A+ G KG ++ + ++ G Sbjct: 241 MMTLAQWFEEKGIEKGIEKGIEKGMEKG 268 >UniRef50_P31665 Uncharacterized protein yadD n=59 Tax=Enterobacteriaceae RepID=YADD_ECOLI Length = 300 Score = 284 bits (727), Expect = 2e-75, Method: Compositional matrix adjust. Identities = 139/259 (53%), Positives = 187/259 (72%), Gaps = 6/259 (2%) Query: 6 TSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDIL 65 ++TPHDA+FK FL H +TARDF+EIHLP +LRELCDL++L LES SF++E L+ +D+L Sbjct: 5 STTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVL 64 Query: 66 WSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIE--HDKRQPLPLVIPMLF 123 +SV+ + GY++VVIEHQS+ D MAFR+MRYS+A M RH+E HDK LPLV+P+LF Sbjct: 65 YSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDK---LPLVVPILF 121 Query: 124 YHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKH 183 Y G +PYP S+CW D F P AR++YN+ FPLVD+T+ PDDEI+QHRR+A+LEL+QKH Sbjct: 122 YQGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKH 181 Query: 184 IRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRER 243 IRQRDLM L++QLV L+ + SQ+ A+ NY+L G + + F L R E Sbjct: 182 IRQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDR-ETGGES 240 Query: 244 IMTIAERIHNDGYIKGEQR 262 +MT+A+ G KG Q+ Sbjct: 241 MMTLAQWFEEKGIEKGIQQ 259 >UniRef50_D2U4R8 Transposase (Fragment) n=4 Tax=Enterobacteriaceae RepID=D2U4R8_9ENTR Length = 308 Score = 284 bits (727), Expect = 3e-75, Method: Compositional matrix adjust. Identities = 141/299 (47%), Positives = 198/299 (66%), Gaps = 9/299 (3%) Query: 1 MTNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRAL 60 MT T TPHDA+FK FL+ +TA+DF +I LP +++ LCDLDSLK+ES SF+D +++ Sbjct: 7 MTKKFTPTPHDAVFKQFLSEKETAKDFFDIWLPDEIKALCDLDSLKMESGSFIDSEMKNY 66 Query: 61 HSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIP 120 SDIL+SV T +G GYIYV+IEHQS D +A+RLMRYS+A MQ+H+E +Q LPLV P Sbjct: 67 QSDILYSVSTTKGSGYIYVLIEHQSTPDKLIAWRLMRYSLAAMQKHLEDGNKQ-LPLVFP 125 Query: 121 MLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELI 180 +LFY G +SP+P+S WLD F D A +YN F L DVT + D EI+QH+R+ALLEL+ Sbjct: 126 ILFYCGEQSPHPYSTHWLDCFEDRKLAESIYNNPFKLADVTTLDDGEIMQHKRIALLELL 185 Query: 181 QKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQH 240 QKHIR+RD+ L+D +V LL D+Q+ + NY++ G+ R EFI+ + ++ +H Sbjct: 186 QKHIRRRDMTELLDSIVKLLSYNYYTDNQVITMFNYLIQEGNAQRPMEFITNIAKQAEKH 245 Query: 241 RERIMTIAERIHNDGYIKGEQR--------ILRLLLQNGADPEWIQKITGLSAEQMQAL 291 +MTIA++I G KG Q+ + + L NG D ++ TGLS E++ Sbjct: 246 EGALMTIAQQIEEIGIQKGIQQGIQKTKIELAKQFLANGVDRNTVKISTGLSDEELNKF 304 >UniRef50_Q4LC22 TpnA protein n=9 Tax=Enterobacteriaceae RepID=Q4LC22_SODGL Length = 308 Score = 268 bits (685), Expect = 2e-70, Method: Compositional matrix adjust. Identities = 142/309 (45%), Positives = 193/309 (62%), Gaps = 21/309 (6%) Query: 1 MTNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRAL 60 M+ T TPHDA+F+ FL TA+DF +I LP D++ LCD ++LK ES SF+D ++ Sbjct: 1 MSKKFTPTPHDAVFRQFLHDKATAQDFFDIWLPDDIKALCDWETLKPESGSFIDPDMKPY 60 Query: 61 HSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIE--HDKRQPLPLV 118 SDIL+SV DGY+Y +IEHQS D MA+RLMRYSMA MQRH+E HDK LPLV Sbjct: 61 QSDILYSVNANGVDGYVYCLIEHQSTPDKLMAWRLMRYSMAAMQRHLEAGHDK---LPLV 117 Query: 119 IPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLE 178 P+LFY G +SP+P+S WLD F P A K+Y+ F L+DVT + DD I+QHRR+ALLE Sbjct: 118 FPVLFYCGEKSPHPYSTNWLDCFERPDIAAKIYSQPFRLMDVTTLDDDAIMQHRRMALLE 177 Query: 179 LIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMP 238 LIQKHIR+RD+ L+D +V LL D+Q+ ++NY++ G+ A FI+E+ +R Sbjct: 178 LIQKHIRRRDMTELLDSIVKLLSYNYYTDTQVVTMMNYLVQEGNAASPRTFITEIAKRAE 237 Query: 239 QHRERIMTIAERIHNDGYIKGEQ----------------RILRLLLQNGADPEWIQKITG 282 +H E +MTIAE + +GY G +I R +L G + ++ TG Sbjct: 238 KHEEALMTIAEALKQEGYQIGRDDGRQEGIQQGEHAAAMKIARQMLSRGIARDAVKACTG 297 Query: 283 LSAEQMQAL 291 LS + L Sbjct: 298 LSDNALDNL 306 >UniRef50_C2LLN3 Transposase n=37 Tax=Enterobacteriaceae RepID=C2LLN3_PROMI Length = 319 Score = 268 bits (685), Expect = 2e-70, Method: Compositional matrix adjust. Identities = 141/324 (43%), Positives = 203/324 (62%), Gaps = 38/324 (11%) Query: 1 MTNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRAL 60 MT T HDALFK FLTHP+ ARDF +HLP ++ LCDL +L+LE ASFV+ +LR L Sbjct: 1 MTKNTQQPVHDALFKQFLTHPENARDFFSVHLPANILPLCDLSTLRLEPASFVERRLRQL 60 Query: 61 HSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQP----LP 116 HSD+L+SV+ EG+GYIY +IEHQS+ D M FRLM Y+M+ + H+ K+ P LP Sbjct: 61 HSDVLYSVQMTEGEGYIYCLIEHQSKPDRLMGFRLMHYAMSAIAHHL---KKSPADKTLP 117 Query: 117 LVIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVAL 176 LV+P LFY GS PYP+S+ WLD FADP A++LY +FPLVD++V+ D+EI+ H+ +AL Sbjct: 118 LVVPFLFYQGSVCPYPYSMNWLDGFADPALAQQLYTRSFPLVDLSVLSDEEILTHKGIAL 177 Query: 177 LELIQKHIRQRD-LMGLIDQLVVLLVTECANDSQITALLNYILLTG---DEARFNEFISE 232 LEL+QKHIR RD LM ++ + ++ ++ Q+ +++ YI G DE+R F S+ Sbjct: 178 LELVQKHIRTRDGLMAVLPIIAQIINSQHNTVDQVRSVIEYIAYQGYILDESR---FFSQ 234 Query: 233 LTRRMPQHRERIMTIAERIHNDGYIK------------------------GEQRILRLLL 268 L P+++ + TIAE++ G K G +++ R LL Sbjct: 235 LIALSPEYKTMLTTIAEQLEQKGIEKGIEKGIEKGIEKGIEKGIEKGIGLGVEKVARSLL 294 Query: 269 QNGADPEWIQKITGLSAEQMQALR 292 Q G D I + TGL+ E++++L+ Sbjct: 295 QQGVDLNIIMQCTGLTREKIESLK 318 >UniRef50_D0KLJ7 Putative transposase YhgA family protein n=1 Tax=Pectobacterium wasabiae WPP163 RepID=D0KLJ7_PECWW Length = 288 Score = 243 bits (620), Expect = 6e-63, Method: Compositional matrix adjust. Identities = 139/308 (45%), Positives = 183/308 (59%), Gaps = 59/308 (19%) Query: 10 HDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVK 69 HDA+FK FL+ ARDF+ IHLP +RE CD ++L+LESASF+DEKLRA SD+L+S+ Sbjct: 4 HDAIFKQFLSDIAVARDFLTIHLPDSIRERCDFNTLQLESASFIDEKLRARISDVLYSLH 63 Query: 70 TREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIE--HDKRQPLPLVIPMLFYHGS 127 T G GYIY VIEHQSR + MAFRL+RY +A MQ+H++ HD+ LPLV+P+LFYHG Sbjct: 64 TSVGKGYIYCVIEHQSRPEKQMAFRLLRYCLAAMQQHLDQGHDR---LPLVVPLLFYHGR 120 Query: 128 RSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIRQR 187 PYP+SL WLD FA P A+ LY FPLVD+TV+PDDEI HRR+ALLEL+QKHIR R Sbjct: 121 SRPYPYSLRWLDSFAAPVLAQTLYEQPFPLVDLTVMPDDEIRTHRRMALLELVQKHIRTR 180 Query: 188 DLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRERIMTI 247 D++ L ++ +L F + + L+ +E IMTI Sbjct: 181 DMLELAREIGLL--------------------------FERWAAPLSI----GQEDIMTI 210 Query: 248 AERIHNDGYIKGEQR------------------------ILRLLLQNGADPEWIQKITGL 283 AE++ G+ +G QR I R LL G D +Q+ T L Sbjct: 211 AEQLKKMGFDEGIQRGIQQGLAQGLEQGIEQGMKNSARQIARHLLLTGMDKNSVQQATQL 270 Query: 284 SAEQMQAL 291 E+++ L Sbjct: 271 ETEELEQL 278 >UniRef50_C8QFJ7 Putative transposase YhgA family protein n=4 Tax=Pantoea sp. At-9b RepID=C8QFJ7_9ENTR Length = 301 Score = 242 bits (618), Expect = 1e-62, Method: Compositional matrix adjust. Identities = 128/299 (42%), Positives = 189/299 (63%), Gaps = 14/299 (4%) Query: 5 TTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDI 64 S PHDALFK FL+H AR F+EIHLP+ +RE CDLD L++ +F++ L AL+SD+ Sbjct: 3 VVSAPHDALFKKFLSHLPVARQFLEIHLPQSIREHCDLDKLQVVPTTFIERDLSALYSDV 62 Query: 65 LWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFY 124 L S+KT +G+GYIY +IEHQS D HM R+MRY++A +QRH++ + +PLVIP+LFY Sbjct: 63 LLSMKTDDGEGYIYALIEHQSTPDKHMTLRMMRYTLAAIQRHLD-EGHHDVPLVIPILFY 121 Query: 125 HGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHI 184 G SPYP+S+ WL+ F +P A++++ +FPLVDVTV+PD+EI+ HR VA LE+ K I Sbjct: 122 QGKTSPYPYSMNWLESFRNPVLAKQIFCHSFPLVDVTVIPDEEIMAHRDVARLEMAHKII 181 Query: 185 RQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRERI 244 R RD++ ID + LL + +D I + Y+L G+ + + L + PQ +I Sbjct: 182 RLRDILENIDPMATLLALDYNDDLSIDVVF-YLLRYGNTDDREKIVKILIQAKPQLEGKI 240 Query: 245 MTIAER--------IHNDGYIKGEQRIL----RLLLQNGADPEWIQKITGLSAEQMQAL 291 MTI E+ +G +G Q ++ + +L+ D I K+TGLS +++ L Sbjct: 241 MTIEEQWRQESRQEGRQEGRKEGRQEVMLELAQRMLREQFDLNTIMKLTGLSEGELRQL 299 >UniRef50_C0Q5B1 Ytl2 n=4 Tax=Enterobacteriaceae RepID=C0Q5B1_SALPC Length = 316 Score = 237 bits (604), Expect = 5e-61, Method: Compositional matrix adjust. Identities = 135/305 (44%), Positives = 187/305 (61%), Gaps = 23/305 (7%) Query: 10 HDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVK 69 HD LFK FL PDTARDF+ +HLP D+R LD+LKLE SFVD+KLR LHSD+L+SV+ Sbjct: 12 HDGLFKLFLREPDTARDFLAVHLPADIRAQVRLDTLKLEPGSFVDQKLRELHSDVLYSVE 71 Query: 70 TREG-DGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGSR 128 T EG GYIY ++EHQS D MA+R+MRYSMAVM H++ LP+V+P+LFY G Sbjct: 72 TAEGHAGYIYCLVEHQSTADRMMAWRMMRYSMAVMDAHLKKGN-GTLPVVVPLLFYQGMV 130 Query: 129 SPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIRQRD 188 PYP+S W+D F P AR++Y+ +PLVDV+V+ D ++ HRR+ALLEL+Q+ IR RD Sbjct: 131 RPYPYSTDWMDCFDVPALAREVYSRPWPLVDVSVMEDCDLQSHRRMALLELVQRDIRHRD 190 Query: 189 LMGLIDQLVVLLVTECANDSQITALLNYILLTG-DEARFNEFISELTRRMPQHRERIM-T 246 L+ +V L+ +Q+ A+L YI+ G F+ EL +P+++E IM T Sbjct: 191 AASLLRDVVQLIRLAGNTRAQVEAVLCYIIYNGMTSESITPFLYELAGEIPEYKELIMGT 250 Query: 247 IAERIH---------------NDGYIKGEQRIL----RLLLQNGADPEWIQKITGLSAEQ 287 IA+++ ++ EQ+ L LL NG E + K TGL+ E Sbjct: 251 IAQQLKEEGIQQGIQQGIQQERQASLEREQKTLLETAYALLDNGVSLEVVIKSTGLNRET 310 Query: 288 MQALR 292 ++ R Sbjct: 311 LEQPR 315 >UniRef50_D1P284 Transposase, ISNCY family n=10 Tax=Enterobacteriaceae RepID=D1P284_9ENTR Length = 322 Score = 234 bits (597), Expect = 3e-60, Method: Compositional matrix adjust. Identities = 125/323 (38%), Positives = 183/323 (56%), Gaps = 32/323 (9%) Query: 1 MTNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRAL 60 M + PHD+ FK F++ D ARDF E+HLP ++ LC+ D+LKL SASFVD+ LR+ Sbjct: 1 MATQSIVAPHDSTFKGFMSKVDNARDFFEVHLPNRIKHLCNFDTLKLASASFVDKTLRSR 60 Query: 61 HSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIP 120 SD+L+SV+T +G GY Y ++EHQS D M +RLM Y+ M +H++ Q LPLV+P Sbjct: 61 FSDMLYSVQTLKGKGYFYFLVEHQSSPDKLMGWRLMHYAFCAMNQHLQQG-HQSLPLVVP 119 Query: 121 MLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELI 180 +LFYHG++SPYP+S W D F A LY PLVDVTV DDE++ HR+VA +EL+ Sbjct: 120 ILFYHGNQSPYPYSQSWTDCFQWSDLAHDLYCNPLPLVDVTVACDDELMNHRKVAAMELV 179 Query: 181 QKHIRQR-DLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQ 239 KH R D+ GL ++L +L + + ++NY+ D + + L + + Sbjct: 180 FKHASLRGDVFGLSERLAQVLNNNQNHQDDVILIINYLFSVMDTPAYTHIVKTLVDQTEK 239 Query: 240 HRERIMTIAERIHNDGYIKG------EQRILR------------------------LLLQ 269 H+E +M IA+R+ N+G KG E+R++ + L+ Sbjct: 240 HQETVMNIAQRLRNEGMEKGMEKGRKEERMISQQKLANERQHYQQQMALNLQQQAIMSLK 299 Query: 270 NGADPEWIQKITGLSAEQMQALR 292 G + I +ITGLS + ALR Sbjct: 300 LGLSVDIISQITGLSPSDIHALR 322 >UniRef50_B6XDZ7 Putative uncharacterized protein n=2 Tax=Providencia RepID=B6XDZ7_9ENTR Length = 327 Score = 230 bits (587), Expect = 4e-59, Method: Compositional matrix adjust. Identities = 127/324 (39%), Positives = 184/324 (56%), Gaps = 33/324 (10%) Query: 1 MTNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRAL 60 MT + PHD+ FK F++ D ARDF EI+LP ++ LC+LD+LKL SASF+D+ LR+ Sbjct: 5 MTMQLIARPHDSTFKGFMSKVDNARDFFEIYLPNRIKPLCNLDTLKLASASFIDKTLRSR 64 Query: 61 HSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIP 120 SD+L+SV+T +G GY Y+++EHQS D M +RLM Y+ M +H++ LPLV+P Sbjct: 65 FSDMLYSVQTLKGKGYFYLLVEHQSTPDKLMGWRLMHYAFCAMNQHLQQGN-NALPLVVP 123 Query: 121 MLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELI 180 +LFYHG +SPYP+S W D F A LY PLVDVTV DDEIV HR+VA +EL+ Sbjct: 124 ILFYHGKQSPYPYSQVWTDCFPWADLAYDLYCNPLPLVDVTVASDDEIVNHRKVAAMELV 183 Query: 181 QKHIRQRDLMGLIDQLVVLLVTECAND-SQITALLNYILLTGDEARFNEFISELTRRMPQ 239 KH RD + ++ + + +++E N + ++NY+ D + + + L + Sbjct: 184 LKHSTLRDDLIVLSERLAQVISENENHRDDVILIINYLFSVMDTPTYTQIVKTLIEQTEG 243 Query: 240 HRERIMTIAERIHNDGYIKG-----------------------EQRILR--------LLL 268 ++E +MTIA+R+ N+G KG EQ I R L Sbjct: 244 YQETVMTIADRLRNEGLEKGLIKGREEGKAEGKAEGREEARQEEQAIARQRTYTQVITSL 303 Query: 269 QNGADPEWIQKITGLSAEQMQALR 292 G + I KITGL ++QA+R Sbjct: 304 DLGLSIDIISKITGLPHSEIQAMR 327 >UniRef50_C2DIT3 Possible transposase n=5 Tax=Enterobacteriaceae RepID=C2DIT3_ECOLX Length = 197 Score = 229 bits (584), Expect = 9e-59, Method: Compositional matrix adjust. Identities = 107/198 (54%), Positives = 144/198 (72%), Gaps = 1/198 (0%) Query: 96 MRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAF 155 MRY++A MQ H++ + LP+V+P+LFYHG SPYP+SLCWLD FADP AR+LY +AF Sbjct: 1 MRYAIAAMQNHLDAGYKT-LPMVVPLLFYHGIESPYPYSLCWLDCFADPNLARQLYASAF 59 Query: 156 PLVDVTVVPDDEIVQHRRVALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLN 215 PL+DVT++PDDEI+ HRR+ALLELIQKHIRQRDLMGL++Q+ LL + AN QI L N Sbjct: 60 PLIDVTLMPDDEIMLHRRMALLELIQKHIRQRDLMGLVEQMACLLSSGYANGRQIKGLFN 119 Query: 216 YILLTGDEARFNEFISELTRRMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGADPE 275 YIL TGD RFN+FI + +R P+H+ +MTIAER+ +G I +++L++G Sbjct: 120 YILQTGDAVRFNDFIDGVAKRSPKHKVSLMTIAERLRQEGEQSKALHIAKIMLESGVPLA 179 Query: 276 WIQKITGLSAEQMQALRQ 293 I + TG+S E++ A Q Sbjct: 180 DIMRFTGVSEEELAAASQ 197 >UniRef50_C2LF55 Transposase n=3 Tax=Enterobacteriaceae RepID=C2LF55_PROMI Length = 330 Score = 212 bits (539), Expect = 2e-53, Method: Compositional matrix adjust. Identities = 116/325 (35%), Positives = 187/325 (57%), Gaps = 46/325 (14%) Query: 8 TPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWS 67 + HDA FK F+ + A+DF IHL +L+ CD +LKL+++SF+D KLR+ SDIL+S Sbjct: 8 SSHDAAFKRFMMNISNAKDFFFIHLSDELKSYCDFSTLKLQNSSFIDIKLRSRMSDILYS 67 Query: 68 VKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGS 127 VKT++G+ IY +IEHQSR D +A+R+M Y+ M +H++ LPLV+P+LFYHG Sbjct: 68 VKTKKGNISIYFLIEHQSRPDKMIAWRMMHYAFCTMNQHLQQG-YTSLPLVVPILFYHGK 126 Query: 128 RSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIRQR 187 R PYP+S+ WLD F T A +LY F L+D+ + D+ ++ HR+ A++E+ KH+ Sbjct: 127 RKPYPFSVNWLDCFPLSTLANQLYLNNFALIDLNSIDDEILLTHRKAAVMEIAMKHVNSC 186 Query: 188 DLMGLIDQLVVLLV-----TECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRE 242 D + D+L +LL C+++ I A++ Y+ D A F I+++ ++ HRE Sbjct: 187 DDL---DKLAMLLSKAINQKNCSDEDTI-AVVQYLFSIMDAADFESIINKIAEQVDNHRE 242 Query: 243 RIMTIAERIHNDGYIKGE-------------------------------QRILRL----- 266 IM IA R+ N G+ G+ ++I+++ Sbjct: 243 TIMNIAWRLENKGFKLGKMEGIEIGKNEGIEIGKNEGIEIGKNEGIEIGKKIVQIQLAKN 302 Query: 267 LLQNGADPEWIQKITGLSAEQMQAL 291 LL+ + E+I++ITGLS ++++ L Sbjct: 303 LLKENVELEFIERITGLSIQELKIL 327 >UniRef50_Q3C0L1 TpnA protein n=16 Tax=Enterobacteriaceae RepID=Q3C0L1_SODGL Length = 277 Score = 206 bits (524), Expect = 9e-52, Method: Compositional matrix adjust. Identities = 110/267 (41%), Positives = 162/267 (60%), Gaps = 25/267 (9%) Query: 42 LDSLKLESASFVDEKLRALHSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMA 101 L +L + S SF+++ L + SD+L+S+K+ GD YIY +IEHQS + MAFRL+RY++ Sbjct: 3 LSTLVMVSGSFIEDDLCSQCSDMLYSLKSTLGDAYIYCLIEHQSCPEPMMAFRLLRYAVT 62 Query: 102 VMQRHIEHDKRQPLPLVIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVT 161 M RH+E + +Q LP+VIP+LFYHGS SPYP++ WLD FAD A +Y AFPLVDVT Sbjct: 63 AMHRHLEQENKQ-LPVVIPILFYHGSTSPYPYTTHWLDCFADRKLAESVYEKAFPLVDVT 121 Query: 162 VVPDDEIVQHRRVALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTG 221 + D+EI++HRR+AL+E++QKHIR R+++ L +L LL + Q L+ Y++L G Sbjct: 122 AMEDEEILRHRRMALMEIVQKHIRTRNMLELAGELANLLEQWKFSKEQCKTLVYYLVLAG 181 Query: 222 DEARFNEFISELTRRMPQHRERIMTIAERIHNDGYIKGEQ-------------------- 261 + F+ L + P +RE +MTIAE++ G KG Q Sbjct: 182 NTTDGEGFLRTLAQPAPSYREDMMTIAEQLEAKGMQKGIQLGEKKGIERGLQEGIQLGKK 241 Query: 262 ----RILRLLLQNGADPEWIQKITGLS 284 +I R L NG + + ++ TGL+ Sbjct: 242 QATLKIARQFLVNGVERDIVKMSTGLT 268 >UniRef50_C3M8C1 Putative transposase n=3 Tax=Candidatus Hamiltonella defensa 5AT (Acyrthosiphon pisum) RepID=C3M8C1_HAMD5 Length = 308 Score = 202 bits (515), Expect = 8e-51, Method: Compositional matrix adjust. Identities = 121/308 (39%), Positives = 177/308 (57%), Gaps = 28/308 (9%) Query: 7 STPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILW 66 STPHD LFK F AR+F EIHLP + ++ SLK+ SF+D+ L+ HSD+++ Sbjct: 4 STPHDRLFKKFFGDIALARNFFEIHLPSSILKIVSFPSLKMVPGSFIDKSLKQSHSDMVY 63 Query: 67 SVKTREG-DGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIE--HDKRQPLPLVIPMLF 123 S +T G +GY+Y V+EHQS +D MAFR+ +YS+AVMQ+H++ HD LPLV+P+LF Sbjct: 64 SFETSTGKEGYLYCVVEHQSTDDKMMAFRMKKYSLAVMQQHLDQGHDT---LPLVLPVLF 120 Query: 124 YHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKH 183 YHG +SPYP S+ W D F + AR L + FPLVDVT++P++EI++H ++ LE+ QK Sbjct: 121 YHGQKSPYPHSMDWRDCFCEKELARILDSQPFPLVDVTMLPEEEIMKHGIISWLEMSQKM 180 Query: 184 IRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRER 243 + RD+M + L+ L ND +LL Y+ G+ A F L+ RE Sbjct: 181 VHTRDMMEIAPYLIRLDKLFPLNDELFKSLLYYLFQEGETADRMLFFDALSST--TQREN 238 Query: 244 IMTIAERIH--------------------NDGYIKGEQRILRLLLQNGADPEWIQKITGL 283 +MTIAE + +G +G + I + LL NG + ++ TGL Sbjct: 239 VMTIAEELKREGREEGREEGREEGREEGREEGREEGREEIAKNLLNNGFSFKQVKMYTGL 298 Query: 284 SAEQMQAL 291 S + + L Sbjct: 299 SEDSLNKL 306 >UniRef50_A8PLK1 Putative uncharacterized protein n=3 Tax=Rickettsiella grylli RepID=A8PLK1_9COXI Length = 308 Score = 196 bits (498), Expect = 9e-49, Method: Compositional matrix adjust. Identities = 111/300 (37%), Positives = 174/300 (58%), Gaps = 19/300 (6%) Query: 8 TPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWS 67 HDA+FKTF T + A F+ I+LPK +++ CD +LK+E SFVD L+ HSDIL+S Sbjct: 7 NAHDAIFKTFFTDIEVATHFITIYLPKHMKQACDFSTLKIEPGSFVDADLKQHHSDILYS 66 Query: 68 VKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGS 127 +K GY+Y+ +EHQS + M FR+ RY +A+MQ+H+ ++ LPLVI MLFYHG Sbjct: 67 LKVNGMHGYVYLNLEHQSTAEELMPFRMHRYKVAIMQQHLNQGNKK-LPLVISMLFYHG- 124 Query: 128 RSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIRQR 187 + YP+ L +D D A+ + L+D+ V+PD+EI +H+++A LE++QKHI R Sbjct: 125 KGQYPYCLKLIDCVEDTPFAKAHFFDDPLLIDLNVLPDEEIYRHKQLAFLEIVQKHIFTR 184 Query: 188 DLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRERIMTI 247 DL + D +V L+ + L+ Y+L+ G+ A N+ I +L + + + E IM Sbjct: 185 DLEDIADHIVRLVKQVKPDHDLFNQLVYYMLVKGETANVNQVIEKL-KTIEDYEEDIMNA 243 Query: 248 AERIH------------NDGYIKGEQR----ILRLLLQNGADPEWIQKITGLSAEQMQAL 291 A+++ +G KGE R I + L+ G ++IQ +T LS ++ +L Sbjct: 244 AQQLKQQGRQEGLYEGRQEGLQKGEYRKAITIAKKLIAEGRSIQYIQDLTNLSENEVLSL 303 >UniRef50_C1J8H0 Truncated transposase n=3 Tax=Escherichia coli RepID=C1J8H0_ECOLX Length = 202 Score = 188 bits (477), Expect = 2e-46, Method: Compositional matrix adjust. Identities = 98/205 (47%), Positives = 138/205 (67%), Gaps = 9/205 (4%) Query: 91 MAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGSRSPYPWSLCWLDEFADPTTARKL 150 M FR++RYS+A MQRH+E K LPLVIP+LFYHG RSPYP+S+ WLD F +P A K+ Sbjct: 1 MPFRMLRYSVAAMQRHLEQHKT--LPLVIPVLFYHGERSPYPYSMNWLDCFEEPALAAKI 58 Query: 151 YNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQI 210 Y FPLVD+TVV D+EI+ HRR+A L L+ KHIR RD+M L+D+L ++V +D Q+ Sbjct: 59 YTKPFPLVDITVVDDNEIMNHRRMAALTLLMKHIRHRDMMELLDKLPQVMVE--ISDEQV 116 Query: 211 TALLNYILLTGDEARFNEFISELTRRMPQHRERIMTIAERIHNDGYIKGE-QRILRLLLQ 269 L++YI+ GD EF+ L R+PQH +++MTIAER+ G +G ++ L + Q Sbjct: 117 RVLIHYIVNAGDSVS-PEFMRALAERLPQHEDKLMTIAERLEQKGRQEGALEKALAIACQ 175 Query: 270 ---NGADPEWIQKITGLSAEQMQAL 291 G PE I++ TGLS +++ + Sbjct: 176 LQKMGMTPEQIKQATGLSEAELKNI 200 >UniRef50_B7MZS6 Putative uncharacterized protein n=3 Tax=Escherichia coli ED1a RepID=B7MZS6_ECO81 Length = 319 Score = 186 bits (471), Expect = 1e-45, Method: Compositional matrix adjust. Identities = 115/302 (38%), Positives = 166/302 (54%), Gaps = 20/302 (6%) Query: 6 TSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDIL 65 TS HDA F+ L P ARDF+E L + C+LD+++LE +FV E LR D+L Sbjct: 8 TSLIHDAAFRKTLKDPAAARDFLEQVLTPYQKSRCNLDTIELEPTTFVAESLRQSACDVL 67 Query: 66 WSVKTREG-DGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFY 124 S+KT +G DGYIY +IEHQS D + R+MRY +AVM++HIE K P+VIP+LFY Sbjct: 68 LSMKTNDGKDGYIYTLIEHQSSPDKFIPLRMMRYILAVMEQHIEEHKCA--PVVIPVLFY 125 Query: 125 HGSRSPYPWSLCWLDEFADPTTARKLYN--AAFPLVDVTVVPDDEIVQHRRVALLELIQK 182 HG++ PYP+ + W+D DP R++Y F LVDV+ + DDEI + R+A L K Sbjct: 126 HGAKRPYPYPMNWVDCLDDPAYGREIYGEQKPFSLVDVSTLTDDEIEHYHRMAALMFTMK 185 Query: 183 HIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRE 242 D++ LI + + L + + + +L Y LL + F E ++ P H+ Sbjct: 186 SGTSGDVIELIGKSIT-LTDKYGSSVHLNTVLTY-LLELYQMDFAELSEAVSTHYPSHKG 243 Query: 243 RIMTIAERIHNDGYIKG------------EQRILRLLLQNGADPEWIQKITGLSAEQ-MQ 289 IMTIAE++ G KG R++ ++ Q G E I+ L+ EQ +Q Sbjct: 244 VIMTIAEQLEERGLKKGLEKGLEKGRAEERSRLVLMMRQRGKSLEEIKDFLDLTDEQLLQ 303 Query: 290 AL 291 AL Sbjct: 304 AL 305 >UniRef50_C0AXL8 Putative uncharacterized protein n=1 Tax=Proteus penneri ATCC 35198 RepID=C0AXL8_9ENTR Length = 254 Score = 184 bits (466), Expect = 4e-45, Method: Compositional matrix adjust. Identities = 90/236 (38%), Positives = 147/236 (62%), Gaps = 2/236 (0%) Query: 25 RDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVKTREGDGYIYVVIEHQ 84 + F IHLP++L+ CD +L+L+++SF+D KLR+ SDIL+ VKT+EGD IY++IEHQ Sbjct: 6 KTFFFIHLPEELKSQCDFSTLQLQNSSFIDIKLRSRMSDILYLVKTKEGDVPIYLLIEHQ 65 Query: 85 SREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGSRSPYPWSLCWLDEFADP 144 SR D +A+R+M Y+ M +H++ + LPLV+P+LFYHG + PYP+ + W++ F Sbjct: 66 SRPDKMIAWRMMHYAFCTMNQHLQQGYKS-LPLVVPILFYHGKKKPYPFPVNWMECFPLS 124 Query: 145 TTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIRQ-RDLMGLIDQLVVLLVTE 203 + A +Y+ F L+D+T + DD ++ H++ A++E+ KH+ DL + L + + Sbjct: 125 SLANHIYSNDFSLIDLTSIDDDILLTHKKAAVMEIAMKHVNSCHDLNKIAMLLSKAINQK 184 Query: 204 CANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRERIMTIAERIHNDGYIKG 259 D A++ Y+ D + F I+++ R+ HRE IM IA R+ N G+ G Sbjct: 185 NCRDEDTVAVVQYLFSIMDASDFEFIINKIAERVDNHRETIMNIAWRLENKGFKLG 240 >UniRef50_Q52101 ORF n=1 Tax=Salmonella enterica subsp. enterica serovar Enteritidis RepID=Q52101_SALEN Length = 292 Score = 178 bits (452), Expect = 2e-43, Method: Compositional matrix adjust. Identities = 111/267 (41%), Positives = 148/267 (55%), Gaps = 20/267 (7%) Query: 5 TTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDI 64 TT TPHDA F+ FLT PD ARDFME+HLP +LR +CDL +LKLES SFV++ LR SD+ Sbjct: 6 TTPTPHDATFRQFLTQPDIARDFMELHLPAELRAICDLSTLKLESGSFVEDDLRQYFSDV 65 Query: 65 LWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSM-AVMQRHIEHDKRQPLPLVIPMLF 123 L+S+KT GD I++ + S+ ++ F + A MQRH+E ++ LPLVIP+LF Sbjct: 66 LYSLKTTAGDD-IFMSWLNTSQHLTNICFPPDTLCVGAAMQRHLEAGHKK-LPLVIPVLF 123 Query: 124 YHGSRSPYPWSLCWLDEFADPTTARKLYNAAFP-LVDVTVVPDDEIVQHRRVALLELIQK 182 Y G RSPYP+S WLDEF D R+ LVDVTV+PDDEI HR +A L L+ + Sbjct: 124 YTGKRSPYPYSTRWLDEFDDTAPGRQTLQQRLSRLVDVTVIPDDEIAGHRSMAALTLLPE 183 Query: 183 HIR-----QRDLMG---LIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELT 234 +I Q L G ++ V + A N R + Sbjct: 184 NIFISGTWQNWLTGWRPFYGRISVFIAGNIAGTLYSAGRRNI--------RRRSLCTRTG 235 Query: 235 RRMPQHRERIMTIAERIHNDGYIKGEQ 261 QH + +MTIA+++ G KG Q Sbjct: 236 TACAQHGDALMTIAQQLEQKGIEKGIQ 262 >UniRef50_B3ESQ9 Putative uncharacterized protein n=2 Tax=Bacteria RepID=B3ESQ9_AMOA5 Length = 308 Score = 164 bits (416), Expect = 3e-39, Method: Compositional matrix adjust. Identities = 98/301 (32%), Positives = 171/301 (56%), Gaps = 17/301 (5%) Query: 7 STPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILW 66 S PHD L K L+HP+ ++F + + P D+ + DL SLKL + S+V E+LR H+D+++ Sbjct: 10 SNPHDLLVKATLSHPEAIQEFAKAYFPADILKRVDLPSLKLTNKSYVTEELREFHNDLVF 69 Query: 67 SVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHI-EHDKRQPLPLVIPMLFYH 125 S + GY + V+EHQS D MA R ++Y++A+++ +I E ++ P P+++ + YH Sbjct: 70 SFTIDKQPGYAFFVLEHQSTPDPLMALRFVKYNIALIEEYIKEKGEKTPWPIIVNICLYH 129 Query: 126 GSR-SPYPWSLCWLDEFADPTTARKL-YNAAFPLVDVTVVPDDEIVQHRRVALLELIQKH 183 + PYP+S D F DP TA+ L F L D+ P++ + QH + L+E + K+ Sbjct: 130 NANEKPYPYSTSVYDLFKDPLTAKALEMFTKFYLADLNSTPNEVLEQHGSIGLMEKLLKY 189 Query: 184 IRQRDLMGLIDQLV-----VLLVTECANDSQITALLNYILLTGDEARF-NEFISELTRRM 237 R RD+ +I++ + L+V D T L+ + G E + + +S + Sbjct: 190 SRHRDIFNVIEKELKRSKGYLIVR---GDYWKTILIYSSYVIGQEEKSEKDLVSLFKEVL 246 Query: 238 PQHRERIM-TIAERIHNDGYIKGEQR----ILRLLLQNGADPEWIQKITGLSAEQMQALR 292 ++ E IM TIA+ I G ++G++R I + +L+ G + +I++ITGLS + ++ L+ Sbjct: 247 SKNEEEIMITIAQTIEERGEMRGKRREKIAIAKNMLKKGCEISFIEEITGLSRKDIEKLK 306 Query: 293 Q 293 Q Sbjct: 307 Q 307 >UniRef50_A8PQ66 Putative uncharacterized protein n=3 Tax=Rickettsiella grylli RepID=A8PQ66_9COXI Length = 307 Score = 145 bits (366), Expect = 2e-33, Method: Compositional matrix adjust. Identities = 84/301 (27%), Positives = 164/301 (54%), Gaps = 15/301 (4%) Query: 5 TTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDI 64 T HD LFK L+ A F++ L ++ +L ++++L+L SFV + R +HSDI Sbjct: 4 TIHQAHDKLFKYSLSKKTIAISFLKSRLSSEIYKLINIETLQLTDKSFVLPEFREIHSDI 63 Query: 65 LWSVKTREGDGYIYVVIEHQSREDIH-MAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLF 123 ++ + E GYI+ ++EH+S + MAFR ++Y+++ M ++ ++ LP+V+P+ Sbjct: 64 VYQCQINEKKGYIFFILEHESTAHVELMAFRQLQYTISAMDQYCRQGNKK-LPIVLPICV 122 Query: 124 YHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKH 183 YHG +SPYP S D F + AR++ F L+D+TV+ D+E+ + L+E++ KH Sbjct: 123 YHGIKSPYPHSQDVYDNFENLQIARQIVFKPFTLIDLTVLSDEELAKDGPAYLMEMLLKH 182 Query: 184 IRQRDLMGLID---QLVVLLVTECANDSQITALLNYILLTGDEA--RFNEFISELTRRMP 238 R ++ + ++ + + L+ + + + I T DE+ + + L+ P Sbjct: 183 SRAKNFLSILHRRIEFIQSLLNRFGKEYRWFVVKYMINETQDESPNAVEQLVQTLSTAFP 242 Query: 239 QHRERIMTIAERIHNDGYIKGEQR--------ILRLLLQNGADPEWIQKITGLSAEQMQA 290 + + +MT A+++ +G +G ++ I + LL +G + +Q++TGLS +++ Sbjct: 243 EEKNTMMTFAQQLRQEGLEQGLEQGRYEEAIAIAKNLLGDGMSFKAVQRLTGLSEKEVMN 302 Query: 291 L 291 L Sbjct: 303 L 303 >UniRef50_C4YU05 Transposase n=5 Tax=Rickettsieae RepID=C4YU05_9RICK Length = 342 Score = 138 bits (348), Expect = 2e-31, Method: Compositional matrix adjust. Identities = 75/191 (39%), Positives = 117/191 (61%), Gaps = 4/191 (2%) Query: 9 PHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSV 68 HDAL K LT A++F+E +LP D +EL DL +K+E SFV++ L+ +SDI++SV Sbjct: 6 KHDALVKKILTEKIAAQEFLEHYLPSDFKELIDLREIKVEKESFVEDDLKRKYSDIIYSV 65 Query: 69 KTR-EGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGS 127 KTR + + ++YV+IE QS D +A RL +Y + + +RH + + LPL+ P+L Y+GS Sbjct: 66 KTRDQEEAFVYVLIEAQSSCDYWIALRLWKYMLLLCERH--ENNKNKLPLICPLLIYNGS 123 Query: 128 RSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIRQR 187 Y + + F P A+KL + LVD+ DDEI Q + + ++E KHI QR Sbjct: 124 -EVYNAPRNFWELFTKPERAKKLMVQDYQLVDLQNQSDDEIEQKKHLGMMEYFLKHIHQR 182 Query: 188 DLMGLIDQLVV 198 D++ L D+ ++ Sbjct: 183 DMLKLWDEFLI 193 >UniRef50_Q1RJ73 Transposase and inactivated derivative n=10 Tax=Rickettsieae RepID=Q1RJ73_RICBR Length = 305 Score = 137 bits (346), Expect = 4e-31, Method: Compositional matrix adjust. Identities = 90/298 (30%), Positives = 163/298 (54%), Gaps = 31/298 (10%) Query: 9 PHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSV 68 HD+L K +T A++F+E +LP+D ++L DL + +E S+++E L +SDI++ + Sbjct: 6 KHDSLVKIIMTDKIAAQEFLEYYLPEDFKKLIDLSKITVEQESYIEESLSKKYSDIVYGI 65 Query: 69 KTRE-GDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGS 127 +T+E G G++Y++IE QS D A RL +Y++ + +RH E KR LPLV ++ Y+G Sbjct: 66 ETKEYGKGFVYILIEAQSTVDYWTALRLWKYTLLLCERHKE--KRNKLPLVYNLVIYNGK 123 Query: 128 RSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIRQR 187 + W D F + A+KL + LVD+ + D+EIV+ + + +L+ I KHI +R Sbjct: 124 QVYNAPRNLW-DLFTNSVMAKKLMMEDYQLVDLQAMSDNEIVKKKHIGMLDYILKHIHER 182 Query: 188 DLMGLIDQLV-----VLLVTECANDSQITALLNYI---LLTGDEAR----FNEFISELTR 235 D++ L +Q + V+++ + + + L Y + + R F++++S Sbjct: 183 DMIQLWEQFLANFNHVIMLDKEKGYIYLKSFLWYTDAKISKKQQPRLVQVFDKYLS---- 238 Query: 236 RMPQHRERIM-TIAERIHNDGYIKGEQR--------ILRLLLQNGADPEWIQKITGLS 284 PQH++ IM TIA+ ++G +G++ I + + G I ++TGL Sbjct: 239 --PQHKDNIMKTIADVYIDEGKQEGKREGEYNKAVMIAKKMFSQGFKIPVIAELTGLK 294 >UniRef50_D0YJF1 Putative transposase YhgA family protein n=1 Tax=Klebsiella variicola At-22 RepID=D0YJF1_KLEVA Length = 190 Score = 136 bits (343), Expect = 8e-31, Method: Compositional matrix adjust. Identities = 74/168 (44%), Positives = 106/168 (63%), Gaps = 20/168 (11%) Query: 144 PTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIRQRDLMGLIDQLVVLLVTE 203 P TA+ LY F L+DVTV+PDD++VQHRRVALLEL+QKHIRQRDL + + L +++ Sbjct: 23 PETAKTLYGCPFTLIDVTVMPDDDLVQHRRVALLELMQKHIRQRDLSSITESLAAVVMLG 82 Query: 204 CANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRERIMTIAERI------------ 251 N Q+ L +Y+L G+ A F+ L RR+PQ+ E +M+IA+++ Sbjct: 83 YTNRRQLRMLFHYMLQYGNTAEPGVFLRRLARRLPQYEETLMSIAQKLKQEGRQEGRLEG 142 Query: 252 ----HNDGYIKGEQR-ILRL---LLQNGADPEWIQKITGLSAEQMQAL 291 H +G +G +R LR+ +LQNG D E +QKITGLSA+++Q L Sbjct: 143 REEGHQEGLQEGSRREALRIAGSMLQNGLDKEMVQKITGLSADELQPL 190 >UniRef50_C1MD86 Putative uncharacterized protein n=5 Tax=Enterobacteriaceae RepID=C1MD86_9ENTR Length = 155 Score = 134 bits (336), Expect = 5e-30, Method: Compositional matrix adjust. Identities = 67/150 (44%), Positives = 97/150 (64%), Gaps = 24/150 (16%) Query: 163 VPDDEIVQHRRVALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGD 222 +PDD+I+QHRR+ALLELIQKHIR+RDLMGL+++L +LLV AND+Q+ AL NY++ G+ Sbjct: 1 MPDDKIMQHRRMALLELIQKHIRKRDLMGLVEKLAILLVKGHANDNQLKALFNYLMQAGN 60 Query: 223 EARFNEFISELTRRMPQHRERIMTIAERIHNDGYIKGEQ--------------------- 261 F EF+ E+ R+PQH++++MTIAER+ +G++ G Q Sbjct: 61 TTHFGEFLHEVAERLPQHKDKLMTIAERLRQEGHLNGLQEGHRKGLQEGLQTGLQQGKRE 120 Query: 262 ---RILRLLLQNGADPEWIQKITGLSAEQM 288 RI + +G DP I +ITGL+AE + Sbjct: 121 EALRIASTMQADGIDPLTIIRITGLTAEDL 150 >UniRef50_C8T759 Putative uncharacterized protein n=1 Tax=Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884 RepID=C8T759_KLEPR Length = 185 Score = 132 bits (332), Expect = 2e-29, Method: Compositional matrix adjust. Identities = 75/178 (42%), Positives = 103/178 (57%), Gaps = 26/178 (14%) Query: 135 LCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIRQRDLMGLID 194 +CWL FADP AR++Y FPL+D+T PDDEI++HRRVA+LEL+QKHIRQRDLM L + Sbjct: 1 MCWLAGFADPDIARRIYGEDFPLIDITSTPDDEIMRHRRVAMLELLQKHIRQRDLMDLHE 60 Query: 195 QLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQ--HRERIMTIA---- 248 QLV LL + Q+ LL+Y+L G+ A F+ L + +P+ H+E +M IA Sbjct: 61 QLVRLLALGYTSRRQLKTLLHYLLQAGNAADPVAFLRHLAQNVPRRPHKETLMNIAQFLE 120 Query: 249 ERIHNDGYIKG--------------------EQRILRLLLQNGADPEWIQKITGLSAE 286 +R H G +G +RI R +L NG D + K+TGL+ E Sbjct: 121 QRGHQQGLKQGLEQGLQQGIEQGIEQGEQQTAERIARAMLANGLDLSLVAKLTGLAPE 178 >UniRef50_A8GX51 Transposase and inactivated derivative n=11 Tax=Rickettsia RepID=A8GX51_RICB8 Length = 355 Score = 127 bits (320), Expect = 4e-28, Method: Compositional matrix adjust. Identities = 64/179 (35%), Positives = 109/179 (60%), Gaps = 2/179 (1%) Query: 13 LFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVKTRE 72 +F+ L +P A +F HLP +++ L D SL +E+ +FV+ L+ SD+L+S K + Sbjct: 23 IFRKALENPLVAHEFFNAHLPPNIKSLIDFPSLAMENTTFVESSLKDSISDVLFSCKFDK 82 Query: 73 GDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRH-IEHDKRQPLPLVIPMLFYHGSRSPY 131 DGY+++++EHQS+ D MAFRL +Y + + +R+ I++ K + LPL+ PM+F++G Sbjct: 83 QDGYLFLLVEHQSKADHFMAFRLFKYMINICERYLIQNPKAKTLPLIYPMIFFNGQEKYN 142 Query: 132 PWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIRQRDLM 190 W D F + A++L+ + LV+V +PD+E Q +LE KHI +R+L+ Sbjct: 143 VARNLW-DLFTNNKLAKELWINDYQLVNVHEIPDEEFKQRIWSGILEFFLKHIHERELL 200 >UniRef50_C3PPD7 Transposase and inactivated derivative n=13 Tax=spotted fever group RepID=C3PPD7_RICAE Length = 361 Score = 125 bits (315), Expect = 1e-27, Method: Compositional matrix adjust. Identities = 86/288 (29%), Positives = 146/288 (50%), Gaps = 31/288 (10%) Query: 2 TNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALH 61 +N + HD LFK ++ P AR+F+E +LP + +L+S+K+E SFV E LR Sbjct: 33 SNTSERPRHDELFKKVMSEPVAAREFLEHYLPVTFKNKINLNSIKIEKESFVTEDLRKRL 92 Query: 62 SDILWSVKTREGD--------------GYIYVVIEHQSREDIHMAFRLMRYSMAVMQRH- 106 SD+++SV + + Y+YV+IEHQS D +AFRL +Y + + +RH Sbjct: 93 SDVVYSVSLKNDNIKDSTTEKSVHNDKAYVYVLIEHQSSSDYWIAFRLWQYMLLLCERHK 152 Query: 107 --------IEHDKRQPLPLVIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLV 158 + +K LPL+ P++ Y + PY + + F D TA+ + + LV Sbjct: 153 DANNNKSSVTKEKDNKLPLICPIVVYANDK-PYNAPRSFWELFEDSKTAKDMMGDEYLLV 211 Query: 159 DVTVVPDDEIVQHRRVALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYIL 218 D+ DDEI + + + ++E + KHI+ RD++ L L+ + D + + L Sbjct: 212 DLQKQSDDEIEKKKHLGMMEYMLKHIKARDILNLWQSLLEKFESSIEIDKENGYIYIKWL 271 Query: 219 LTGDEARFNEFIS-ELTRRMPQH-----RERIM-TIAERIHNDGYIKG 259 L +A+ +E EL + +H +E +M TIA++ ++G KG Sbjct: 272 LWYSDAKVSEDKQVELASIIAKHLKKEDQEELMRTIADKYIDEGVQKG 319 >UniRef50_D2NBJ3 Putative uncharacterized protein n=1 Tax=Escherichia coli SE15 RepID=D2NBJ3_ECOLX Length = 136 Score = 124 bits (310), Expect = 6e-27, Method: Compositional matrix adjust. Identities = 61/124 (49%), Positives = 83/124 (66%) Query: 170 QHRRVALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEF 229 +H +ALLELIQKHIRQRDLMGL++Q+ LL + AND QI L NYIL TGD RFN+F Sbjct: 13 RHASMALLELIQKHIRQRDLMGLVEQMACLLSSGYANDRQIKGLFNYILQTGDAVRFNDF 72 Query: 230 ISELTRRMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQ 289 I + R P+H+E +MTIAER+ +G I +++L++G I + TG+S E++ Sbjct: 73 IDGVAERSPKHKESLMTIAERLRQEGEQSKALHIAKIMLESGVPLADIMRFTGVSEEELA 132 Query: 290 ALRQ 293 A Q Sbjct: 133 AASQ 136 >UniRef50_A5CC03 Transposase and inactivated derivative n=9 Tax=Orientia tsutsugamushi RepID=A5CC03_ORITB Length = 355 Score = 121 bits (303), Expect = 3e-26, Method: Compositional matrix adjust. Identities = 85/271 (31%), Positives = 142/271 (52%), Gaps = 20/271 (7%) Query: 5 TTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDI 64 + + HD LFK + P A DF+ LP +++ + DL+++K+E SFV+ LR D+ Sbjct: 2 SENLKHDGLFKDLMNEPKAALDFINDFLPNEVKNVLDLNTIKVEQESFVEANLRRSMCDV 61 Query: 65 LWSVKTR-EGDGYIYVVIEHQSREDIHMAFRLMRYSMAV-----MQRHIEHDKRQPLPLV 118 L+SVKT+ D +IYV+IE + R D +AF+L +Y++++ +R LP+V Sbjct: 62 LFSVKTKNNNDAFIYVLIEAELRSDYWIAFKLWQYTLSILKRHKKGLKKRKKERGKLPIV 121 Query: 119 IPMLFYHGS-RSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALL 177 +P++ YHG+ R P SL L F DP A++L + + L+D +PD EI + AL+ Sbjct: 122 VPIVVYHGADRFNAPRSLWEL--FDDPKLAKELMGSEYLLIDWQAMPDSEIKRKATAALV 179 Query: 178 ELIQKHIRQRDLMGLIDQLVVLLVTECANDSQ-----ITALLNYILLT---GDEARFNEF 229 ++ Q D++ L + L D + I +LL Y + ++ R + Sbjct: 180 HFMKYIHNQPDIIELWAKFFNTLQEIVQKDKEEGFLYIRSLLYYTISKVSQNEQPRLKQL 239 Query: 230 ISELTRRMPQHRERIM-TIAERIHNDGYIKG 259 + E + R+RIM TIA + ++G KG Sbjct: 240 LDE--NLSIEDRDRIMGTIAAQYIDEGKAKG 268 >UniRef50_Q1RGR6 Transposase and inactivated derivative n=15 Tax=Rickettsia RepID=Q1RGR6_RICBR Length = 313 Score = 121 bits (303), Expect = 4e-26, Method: Compositional matrix adjust. Identities = 81/307 (26%), Positives = 151/307 (49%), Gaps = 28/307 (9%) Query: 10 HDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVK 69 HD + ++ +P +++F E+HLP ++ L + LK+E SFVD++L+ DIL+S K Sbjct: 7 HDEIIRSAFENPLVSKEFFEMHLPPHIQNLISFEKLKMEKDSFVDKRLKKSIVDILFSAK 66 Query: 70 TREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGSRS 129 E GY+Y+++EHQS + MA RL RY + + H + K + P + P++FY+G + Sbjct: 67 FGEKKGYLYLLLEHQSTPEYKMALRLFRYMFKIAEYHKKSTKSKKFPFIYPLIFYNGVQK 126 Query: 130 PYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIRQRDL 189 W + F + + ++ + L++V +PD+++ + +L+ KHI +RDL Sbjct: 127 YNAPRNLW-ELFENSELVKSTWSGDYQLINVHDIPDEKLKEKAWSGILQFFMKHIHERDL 185 Query: 190 MGLIDQLVVLLVTECAND---SQITALLNYILLTGDEARFNEFISELTRRM-PQHRERIM 245 + +++ LL D I +L Y L + E L ++ P+ RE +M Sbjct: 186 LKRWEEVADLLPKFAKIDIGIEHIELILCYTLTRIKQDDIIEVEKLLQSKLNPKKRENVM 245 Query: 246 -TIA----------------ERIHNDGYIKGEQ------RILRLLLQNGADPEWIQKITG 282 +IA +++ + I E+ + + +++ G E + KIT Sbjct: 246 KSIAHHWIQQGREEEKAIMLKKMQEEKVIMAEKVQEEKVMMAKEMMKEGFSLESVIKITK 305 Query: 283 LSAEQMQ 289 LS E ++ Sbjct: 306 LSKEDLE 312 >UniRef50_B6J6C6 Hypothetical cytosolic protein n=1 Tax=Coxiella burnetii CbuK_Q154 RepID=B6J6C6_COXB1 Length = 143 Score = 117 bits (292), Expect = 6e-25, Method: Compositional matrix adjust. Identities = 51/131 (38%), Positives = 86/131 (65%), Gaps = 1/131 (0%) Query: 8 TPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWS 67 PHD F+T ++ A++F E HLP ++ + DL+SL+L+ +SF+DE L+A +D+L+S Sbjct: 6 NPHDYYFRTAMSDTRVAKEFFEYHLPNNILKAADLNSLQLQKSSFIDEHLKASMADVLYS 65 Query: 68 VKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGS 127 VK GY Y+++EHQ D M +RL+RY + ++ H++ PLP+V+P++FY+G Sbjct: 66 VKLNRRPGYFYIIVEHQRNPDKLMPYRLLRYILRIIDHHLKKKDYLPLPIVVPLVFYNGK 125 Query: 128 RSPYPWSLCWL 138 + YP+ +L Sbjct: 126 KR-YPFQRIFL 135 >UniRef50_Q6TFF6 Putative transposase n=1 Tax=Caedibacter taeniospiralis RepID=Q6TFF6_CAETA Length = 299 Score = 113 bits (283), Expect = 7e-24, Method: Compositional matrix adjust. Identities = 88/301 (29%), Positives = 152/301 (50%), Gaps = 18/301 (5%) Query: 10 HDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLR---------AL 60 HD++FK + + D A F+ +LPK+L EL D ++KLESA+ E +R Sbjct: 5 HDSVFKDLIANRDFAVSFLMTYLPKELVELVDWQTVKLESANV--EHVRQQQKDNQKQKE 62 Query: 61 HSDILWSVKTREG-DGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIE-HDKRQPLPLV 118 SD+ + K ++G +G ++V IE Q+ +D + R Y + + +I+ H + LPLV Sbjct: 63 QSDLTFLFKFKDGKNGAVFVHIESQTGDDGTILIRTRHYQTSYLLDYIKRHKTVKGLPLV 122 Query: 119 IPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLE 178 + +++Y ++ P+ SL D FA+ A+K Y +D+ D+EI++H +A E Sbjct: 123 VSIIYY-ANQKPFSHSLNIHDYFANTELAKK-YAFTTQFIDLNRYSDEEILEHGFIAGYE 180 Query: 179 LIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMP 238 LI K IR++++ G +D + + E + L+ Y+ D +F +L P Sbjct: 181 LILKAIREKNIDGKLD--IAINQIEAYDHIARQVLIRYMSQYSD-METKDFHDKLIYSKP 237 Query: 239 QHRERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQALRQPLPER 298 R +MT+AE+ G KG Q R L G E + K TGL + + L++ + + Sbjct: 238 DLRGDVMTVAEQWEQKGIQKGIQTTARNFLLMGLSAEQVVKGTGLDQDTVLKLKKEVEQT 297 Query: 299 E 299 + Sbjct: 298 Q 298 >UniRef50_Q1RKI3 Transposase and inactivated derivative n=10 Tax=Rickettsia RepID=Q1RKI3_RICBR Length = 270 Score = 112 bits (280), Expect = 1e-23, Method: Compositional matrix adjust. Identities = 60/202 (29%), Positives = 110/202 (54%), Gaps = 2/202 (0%) Query: 10 HDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVK 69 HD F+ L++P AR+F E +LP +++ L +L LE+ SF+D L+ +D+L+S + Sbjct: 56 HDKFFQKALSNPIVAREFFEEYLPTEIKALFSPTTLTLENDSFIDPNLKESITDVLYSAR 115 Query: 70 TREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRH-IEHDKRQPLPLVIPMLFYHGSR 128 D YIY++ EHQS D HMAFRL +Y + + ++H I H + P + P++ Y Sbjct: 116 INNRDCYIYILCEHQSSSDPHMAFRLFKYMLNIAEKHLISHPDSKKFPFIYPLV-YSNDH 174 Query: 129 SPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIRQRD 188 Y L D F + + ++ + L+ + + DD++ ++ +A L+++ K+I + + Sbjct: 175 KKYTAPLNLWDLFENSELVKDTWSNNYQLISLRDISDDKLKENPWLAPLQILMKYIHKPN 234 Query: 189 LMGLIDQLVVLLVTECANDSQI 210 + ++ L T A+ S I Sbjct: 235 VFDKWQEISGCLATIAASSSGI 256 >UniRef50_A9BGB6 Putative uncharacterized protein n=3 Tax=Petrotoga mobilis SJ95 RepID=A9BGB6_PETMO Length = 331 Score = 104 bits (260), Expect = 3e-21, Method: Compositional matrix adjust. Identities = 81/279 (29%), Positives = 130/279 (46%), Gaps = 34/279 (12%) Query: 3 NFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHS 62 N PHD FK + + ARDF++ +LP++ E+ DLD L E+ S VDE LR S Sbjct: 2 NELVHNPHDRFFKLIFSDKEIARDFLQNYLPQEAVEIVDLDYLIPENNSHVDENLRESLS 61 Query: 63 DILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPML 122 D+L+ K + DGYIY+++EH+S + + F+L+RY ++ + + K + +P++IPM+ Sbjct: 62 DMLYKTKIKGQDGYIYILMEHKSYIEGKVIFQLLRYITSIWEEKYD-PKTKKVPIIIPMV 120 Query: 123 FYHGSRSPYPWS-----LCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALL 177 YHG W+ L + D K Y + + D I + +R+ L Sbjct: 121 IYHGREI---WNVETNLLNMVQGIEDLPNELKTYLPTYRY----EICDFSIKRKKRIIGL 173 Query: 178 ELIQKHI----------------RQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTG 221 ++ I R R + I QL V E + I Y+L Sbjct: 174 TAMKVAIEAMRAGTAMTKEEFKERLRRVFAYIKQLPKEQVHEWFEECMI-----YLLNVR 228 Query: 222 DEARFNEFISELTRRMPQHRERIMTIAERIHNDGYIKGE 260 ++ E + MP E +MTIAE++ N+G KG+ Sbjct: 229 EDVTIEEILKVQKEIMPGRGEIVMTIAEKLRNEGMEKGK 267 >UniRef50_Q2J904 Putative uncharacterized protein n=1 Tax=Frankia sp. CcI3 RepID=Q2J904_FRASC Length = 323 Score = 101 bits (251), Expect = 4e-20, Method: Compositional matrix adjust. Identities = 92/308 (29%), Positives = 149/308 (48%), Gaps = 27/308 (8%) Query: 5 TTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDI 64 + +PHDA+F+ L P A + LP L DLD L + S VD LR H+D+ Sbjct: 3 SPPSPHDAVFRRVLGVPSNAASQLRATLPAALVARLDLDRLAIVPGSLVDATLRWRHTDL 62 Query: 65 LWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHI-EHDKRQPLPLVIPMLF 123 L++ + +IYV++EHQS D MAFR++RY + V R++ +H K LP V+P++ Sbjct: 63 LFTAPLDGHEAFIYVLVEHQSSSDPLMAFRMLRYVVRVWDRYLADHHKAARLPAVVPLVV 122 Query: 124 YHGSRSPYPW-----SLCWLDEFADPTTARKLYNAAFP-LVDVTVVPDDEIVQHRRVA-- 175 +H + W L +D D A + + F L+D V D+ ++ R + Sbjct: 123 HHNEHA---WVAPTQVLDLVDLAPDLAGAWREHLPRFQFLLDDLVRVDERELRERPLTHS 179 Query: 176 ------LLELIQKHIR-QRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNE 228 LL+++ + R +DL +D+L +L + + LL YI L G+ +E Sbjct: 180 VRLTLLLLKIVPGNPRLAQDLRPWVDELRAVLDGPDGRE-EFATLLRYIELVGEADARDE 238 Query: 229 FISELTRRMPQHRERIMTIAERIHNDGYIKG--EQRILRLL----LQNGADPE-WIQKIT 281 + P+ + MTIAE + +G ++G E R+ LL L+ G PE + + Sbjct: 239 LHDLIAGLGPEAEDAYMTIAEMLRAEGRVEGRVEGRVESLLQLLTLKFGPLPEAALAAVH 298 Query: 282 GLSAEQMQ 289 SA Q+Q Sbjct: 299 DASAGQLQ 306 >UniRef50_Q24W02 Putative uncharacterized protein n=3 Tax=Clostridiales RepID=Q24W02_DESHY Length = 333 Score = 100 bits (249), Expect = 6e-20, Method: Compositional matrix adjust. Identities = 85/330 (25%), Positives = 153/330 (46%), Gaps = 50/330 (15%) Query: 8 TPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWS 67 PHD FK AR F++ +LP+++ L DL+++ + S++D++L+ SD+L+ Sbjct: 6 NPHDKFFKETFGDVGMARSFLKNYLPQEILALVDLETILPQKDSYIDQELQESFSDLLFQ 65 Query: 68 VKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGS 127 VK + +GY+Y + EH+S +A +L++Y + + + ++ K LPL+IPM+ YHG Sbjct: 66 VKIHKNEGYLYFLFEHKSYPSQGIALQLLKYMVRIWESKLKESKPDKLPLIIPMVVYHGQ 125 Query: 128 RSPYPWSLCW-----LDEFADPTTARKLY--NAAFPLVDVTVVPDDEIVQHRRVALLELI 180 W+ +D + A Y + L D++ D E+V + + ++ Sbjct: 126 EK---WNSSLKLSGIIDNYEQLPNAVTQYIPEYEYILYDLSTYTDQEMVGNMLLLIILRT 182 Query: 181 QKHIRQRDLMGLIDQLVVLLVT-ECANDSQ-----ITALLNYILLTGDEARFNEFISELT 234 + I +D + L LL++ E D + L+ YIL T + E I E+ Sbjct: 183 MRDIFIKDTEAFHNILHELLISFERVEDQEKGMQFFETLIRYILSTRQDLEL-ERIYEIA 241 Query: 235 RRMPQHR-ERIMTIAERIHNDGYIKGEQR------------------------------- 262 + + R E +MTIAE++ +G KG ++ Sbjct: 242 KEVSLERGEVMMTIAEKLIMEGMEKGLKKGREEGLKKGREEGLEKGREEGLEKGREETKL 301 Query: 263 -ILRLLLQNGADPEWIQKITGLSAEQMQAL 291 + R LL G + + + K TGLS E+++ L Sbjct: 302 EVARNLLGLGIEMDKVAKATGLSEEEIRKL 331 >UniRef50_A0LBL3 Putative uncharacterized protein n=6 Tax=Magnetococcus sp. MC-1 RepID=A0LBL3_MAGSM Length = 322 Score = 97.4 bits (241), Expect = 5e-19, Method: Compositional matrix adjust. Identities = 63/183 (34%), Positives = 100/183 (54%), Gaps = 8/183 (4%) Query: 1 MTNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRAL 60 MT T PHD K L+ PD + LPK++ EL + L +F+D + R Sbjct: 1 MTKITQ--PHDRFLKALLSDPDKTGTLLRERLPKEVAELLSSEPPVLVDGTFIDGEFREH 58 Query: 61 HSDILWSVKTREGD-GYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVI 119 +D L+ VKT+EG YIY +IEH+S D +AF+L+RY + + +R ++ + +Q LP ++ Sbjct: 59 LTDRLFKVKTQEGKAAYIYALIEHKSYADEWVAFQLLRYMVRIWERFLK-EGQQKLPPIV 117 Query: 120 PMLFYHGSRS-PYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQ--HRRVAL 176 P++ YHG+R P L E AD L + +F + D+ + DD++ Q H R AL Sbjct: 118 PLVVYHGAREWTVPNQFSALLE-ADKGLLHHLLDFSFAVTDLGRIADDDLSQDTHLRAAL 176 Query: 177 LEL 179 + + Sbjct: 177 MAM 179 >UniRef50_B3ETR6 Putative uncharacterized protein n=1 Tax=Candidatus Amoebophilus asiaticus 5a2 RepID=B3ETR6_AMOA5 Length = 275 Score = 97.1 bits (240), Expect = 7e-19, Method: Compositional matrix adjust. Identities = 65/240 (27%), Positives = 124/240 (51%), Gaps = 27/240 (11%) Query: 76 YIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGSRSPYPWSL 135 Y+Y +IE+QS + MAF ++ Y++A+M++H+ ++ Q LP+++ + Y G +SPYP+S Sbjct: 36 YVYTLIENQSTHNKLMAFSMLSYNVALMEQHL-NEGYQELPIIVNICIYTGKKSPYPYSQ 94 Query: 136 CWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIRQRDLMGLIDQ 195 D F AR+ F L+D++V+ +E+++ +E + + R+RD + I+ Sbjct: 95 DICDYFEGVELAREQMFKHFKLLDLSVLSQEELLKDGTFGSVEALLRQGRERDYLNWINN 154 Query: 196 LVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRERIMTIAERIH--- 252 VL+ +N +++ YIL T D+ + + + + + +E I+T A+++ Sbjct: 155 NQVLIWELVSNYG--LSIVIYILTTDDKNDADYLMQAIIEAVLEQKEIIVTAAQQLRQVD 212 Query: 253 -NDGYIKG--------------------EQRILRLLLQNGADPEWIQKITGLSAEQMQAL 291 G IKG Q I + +L+ G + IQK+TG+S E ++ L Sbjct: 213 IQTGLIKGIKEGIEQGKEEGVKLGIQAKAQAIDKSMLKEGLEISLIQKVTGISREAIEKL 272 >UniRef50_C0GW46 Putative uncharacterized protein n=2 Tax=Desulfonatronospira thiodismutans ASO3-1 RepID=C0GW46_9DELT Length = 341 Score = 96.7 bits (239), Expect = 9e-19, Method: Compositional matrix adjust. Identities = 70/275 (25%), Positives = 143/275 (52%), Gaps = 11/275 (4%) Query: 3 NFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHS 62 +F PH+A FK F P+ + F++ H+P+++ L DLD+L+++ + FV E+ R ++ Sbjct: 2 SFEIPNPHNACFKDFFKDPEFVKAFIKYHIPEEICSLLDLDTLQVDLSGFVSEEHREYYA 61 Query: 63 DILWSV--KTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAV-MQRHIEHDKRQPLPLVI 119 D++ +V K + IY+++EH+S + +++ Y + M + + LP++I Sbjct: 62 DVMVTVQLKGHTENVNIYILLEHKSTPEFLTRLQILNYEVQKWMDLKRKGQLQGYLPVII 121 Query: 120 PMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFP--LVDVTVVPDDEIVQHRRVALL 177 P++ YHG + + +S + D F P+ + + F + D++ + DDE + + Sbjct: 122 PVVIYHG-KGRWNFSRKFSDLFDLPSEVLRPFVPEFKHMIHDISSMEDDEFKTTAILEIF 180 Query: 178 ELIQKHIRQRDLMGLIDQLVVLLVTECANDS---QITALLNYILLTGDEARFNEFISELT 234 L+ K+I +L + ++ LL T D + A++ Y+ + G + E + E T Sbjct: 181 HLLFKYIHYPELETKLQEIYDLLETIPDQDKVKQYLQAIVQYVAVQGPISL--ERLGEYT 238 Query: 235 RRMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQ 269 RR+P E + T A++I + Y + Q ++L++ Sbjct: 239 RRLPGGDEAMQTAAQQIRQEAYNEFIQEQEKMLVE 273 >UniRef50_C0GW49 Putative uncharacterized protein n=6 Tax=Desulfonatronospira thiodismutans ASO3-1 RepID=C0GW49_9DELT Length = 339 Score = 95.9 bits (237), Expect = 1e-18, Method: Compositional matrix adjust. Identities = 74/262 (28%), Positives = 134/262 (51%), Gaps = 14/262 (5%) Query: 6 TSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDIL 65 TS HD F+ L ARDF+ HLP+++ +LD++K+ S S+V + L+ +DI+ Sbjct: 10 TSKYHDHTFRAILGREPVARDFVRYHLPEEITRDMNLDTVKVSSRSYVSDNLKESMTDIV 69 Query: 66 WSVKTREGD-GYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFY 124 +++ G+ IY+++EH+S D +L +Y V Q I+ K LP+++P++FY Sbjct: 70 ITLELITGEPAEIYILVEHKSDLDAWTKIQLFKYMNEVWQSFIQ-KKTGTLPIIVPLVFY 128 Query: 125 HGSRSPYPWSLCWLDEFADPTTARKLYNAAFP--LVDVTVVPDDEIVQHRRVALLELIQK 182 HG+ + + +SL + D F P+ + Y F L +V V+ ++ + + L+ + Sbjct: 129 HGT-ARWNYSLEFSDLFNLPSEHYRKYIPKFEHLLHEVPVINKKKVKSSITLEVFHLVLE 187 Query: 183 HI---RQRDLMGLIDQLVVLLVTECANDSQ--ITALLNYILLTGDEARFNEFISELTRRM 237 +I +RD + + L +L A ++ L+ Y+L+ DE E E + + Sbjct: 188 YIFYPEKRD--QIYEALELLFKGLDAKEAHEIFAILIKYLLIATDET--PEEAEEKVKHL 243 Query: 238 PQHRERIMTIAERIHNDGYIKG 259 P+ E + T AE + GY K Sbjct: 244 PKGGETVRTTAEVLEERGYNKA 265 >UniRef50_A6TJT5 Putative uncharacterized protein n=1 Tax=Alkaliphilus metalliredigens QYMF RepID=A6TJT5_ALKMQ Length = 312 Score = 94.4 bits (233), Expect = 4e-18, Method: Compositional matrix adjust. Identities = 76/308 (24%), Positives = 146/308 (47%), Gaps = 29/308 (9%) Query: 9 PHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSV 68 PHD FK + A+DFM +LP +L ++ D+++L E ++++ L+ SD+L+ Sbjct: 7 PHDKFFKEMFGNLALAKDFMTNYLPLELLKIVDIETLTPEKEHYIEDDLKESFSDLLFKA 66 Query: 69 KTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGSR 128 +GY+Y + EH+S +A +L+ Y + + +K++ +P++IPM YHG Sbjct: 67 NINGREGYLYFLFEHKSYPSKRIAIQLLHYMVRIWDDKSLKEKKEKIPMIIPMTVYHGKE 126 Query: 129 SPYPWSLC-----WLDEFAD-PTTARK-LYNAAFPLVDVTVVPDDEIVQHRRVALLELIQ 181 + W++ ++ + + P RK + + + D++ DDE+ ++ ++ I Sbjct: 127 N---WNVALRLSDLMEGYEELPEEIRKYIPEYEYLIYDLSGYTDDEVKGDVQLQIVIKIL 183 Query: 182 KHIRQRD--LMGLIDQLVVLLVTECANDSQI---TALLNYILLTGDEARFNEFISELTRR 236 + I + D + + V +L + I + YIL E I +L + Sbjct: 184 RSIFRNDEEFFKVFKEAVEVLDKLEKQEKGIEYFKTFIYYILSARKGVTLTE-IYDLVKE 242 Query: 237 MPQHR-ERIMTIAERIHNDGYIKGEQR------------ILRLLLQNGADPEWIQKITGL 283 + R + IMTIAE + +G KG ++ + R L+ G + + + K TGL Sbjct: 243 VSVERSDEIMTIAEELLKEGMEKGMEKGMEKGKLEEKREVARNLIGLGVELDKVMKATGL 302 Query: 284 SAEQMQAL 291 S E++ L Sbjct: 303 SEEEINKL 310 >UniRef50_A6G4N5 Putative uncharacterized protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6G4N5_9DELT Length = 343 Score = 93.2 bits (230), Expect = 1e-17, Method: Compositional matrix adjust. Identities = 78/269 (28%), Positives = 126/269 (46%), Gaps = 17/269 (6%) Query: 5 TTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDI 64 T+ +PHDALFK+ P A ++ L + + D +L+ E S++DE L HSD+ Sbjct: 4 TSPSPHDALFKSAFKDPKDAAKLLQNVLDEPIAHAIDWSTLRPEPGSYIDETLAERHSDL 63 Query: 65 LWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFY 124 L+S D Y+Y++IEHQS D M R++ Y V RH + LP ++P++ Sbjct: 64 LFSASIGGEDAYVYLLIEHQSTVDRDMPLRMLVYLTRVWLRHRSAHPGRDLPPILPVVVS 123 Query: 125 HGSRSPYPWSLCWLDEF---ADPTTARKL--YNAAFPLV--DVTVVPDDEIVQHRRVALL 177 H +P W+ E PT +L + F LV D+T + D ++ + Sbjct: 124 H---APGGWTAPVTFESLVRPGPTDLPELTPHIPRFELVINDLTHLSDQQLREWSMRGFA 180 Query: 178 ELIQKHIRQR-DLMGLIDQLVVLL-----VTECANDSQ-ITALLNYILLTGDEARFNEFI 230 L+ +R R ++ LID + V E + Q +T + +YI + EF Sbjct: 181 TLVLWILRTRHEIPELIDGVSTWRDMFREVFEAPDGVQAMTKIFHYIACIAQRVQVQEFH 240 Query: 231 SELTRRMPQHRERIMTIAERIHNDGYIKG 259 ++L +PQ RE + T E + +G KG Sbjct: 241 AKLDEHVPQTREVMKTYYEELMEEGMAKG 269 >UniRef50_C5JAV2 Transposase n=2 Tax=uncultured bacterium RepID=C5JAV2_9BACT Length = 334 Score = 89.0 bits (219), Expect = 2e-16, Method: Compositional matrix adjust. Identities = 72/263 (27%), Positives = 134/263 (50%), Gaps = 18/263 (6%) Query: 9 PHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSV 68 PHD K L++P TA + LP+++ E D +L SF+DE LR +D L+ V Sbjct: 7 PHDRFLKALLSNPATAGTLLRERLPREVAEALSDDPPELLEGSFIDEALRPHLTDRLYRV 66 Query: 69 KTREG-DGYIYVVIEHQSREDIHMAFRLMRYSM-AVMQRHIEHDKRQPLPLVIPMLFYHG 126 +T G +YV+IEH+S D+ + ++L++Y + A+ Q E+ + LP ++P +FYHG Sbjct: 67 RTVTGRTALLYVLIEHKSSPDLRIGWQLLKYLVEALKQWERENPAWERLPAIVPFVFYHG 126 Query: 127 SRSPYPWSLCWLDEF-----ADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQ 181 + + W + D F A+ L N F ++D+ + D ++ + + L Sbjct: 127 AAA---WKVP--DAFLALVDAEEGWRSHLLNFRFTVLDLGQIDDRQLSRQPNLQAWLLAA 181 Query: 182 KHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNE-FISELTRRM-PQ 239 K+ + D + +L++ + A D + L+ Y++ T ++E + E+ RR+ P+ Sbjct: 182 KYATRDDRQLEVKELLIQTLVSVA-DEEFRFLMRYVVETYRS--YDEPMVREIIRRVRPE 238 Query: 240 HRERIMTI-AERIHNDGYIKGEQ 261 E +M++ A+ + G +G Q Sbjct: 239 EEETMMSMFAQDMMAKGRQEGRQ 261 >UniRef50_B5Q357 Transposase n=10 Tax=Salmonella enterica subsp. enterica RepID=B5Q357_SALVI Length = 174 Score = 88.6 bits (218), Expect = 2e-16, Method: Compositional matrix adjust. Identities = 50/135 (37%), Positives = 74/135 (54%), Gaps = 29/135 (21%) Query: 184 IRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLT-GDEARFNEFISELTRRMPQHRE 242 +RQRDL+GL++++ LLVT CAND Q+ AL NY+++ G RF FI ++ +P +E Sbjct: 36 LRQRDLLGLVERIASLLVTGCANDRQLKALFNYLMIQHGHTPRFTTFIRDVVGHVPHTKE 95 Query: 243 RIMTIAERIHNDGYIKGEQ----------------------------RILRLLLQNGADP 274 R+MT+ ERI KGE+ RI R +L +G D Sbjct: 96 RLMTLIERIRAADRRKGERQGRQLGLEEGLAEGLEKGLEKGQHVAALRIARQMLADGLDR 155 Query: 275 EWIQKITGLSAEQMQ 289 E +Q+ TGL+AE++Q Sbjct: 156 ETVQRFTGLTAEELQ 170 Score = 66.2 bits (160), Expect = 1e-09, Method: Compositional matrix adjust. Identities = 38/66 (57%), Positives = 44/66 (66%), Gaps = 7/66 (10%) Query: 1 MTNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFV------D 54 M TTSTPHDA+FKTFL HP+TARDFMEIHLP LR+ DL L AS + D Sbjct: 1 MKKSTTSTPHDAVFKTFLRHPETARDFMEIHLPVSLRQR-DLLGLVERIASLLVTGCAND 59 Query: 55 EKLRAL 60 +L+AL Sbjct: 60 RQLKAL 65 >UniRef50_A8PLG1 Transposase n=1 Tax=Rickettsiella grylli RepID=A8PLG1_9COXI Length = 212 Score = 88.6 bits (218), Expect = 3e-16, Method: Compositional matrix adjust. Identities = 58/206 (28%), Positives = 103/206 (50%), Gaps = 12/206 (5%) Query: 93 FRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGSRSPYPWSLCWLDEFA-DPTTARKLY 151 F++ RY A+M +H++ LP+V+ ML+Y G +PYP++ D F + T A K+Y Sbjct: 4 FKIARYVHAIMDQHLKQ-GHAFLPIVVAMLYYRGKVTPYPYTGNIFDCFGKNKTIAEKIY 62 Query: 152 NAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIR-QRDLMGLIDQLVVLLVTECANDSQI 210 +P++D+T + DD I H +A+L+ QK+ RD+ I+ ++ L Q Sbjct: 63 LRPYPIIDITALSDDAIRGHGSIAILDFAQKYAAFNRDIQDGIEHIIGELKKGYLTREQC 122 Query: 211 TALLNYILLTGDEARFNEFISELTRRMPQHRERIMTIAERIHNDGYI--------KGEQR 262 LL Y D + +L + + + E IM++A +I G + + + Sbjct: 123 QTLLYYTFRETDTDNVKMLLEQL-QTIRIYEEDIMSVAHKIEQQGLQRGLQQGRYEEDLK 181 Query: 263 ILRLLLQNGADPEWIQKITGLSAEQM 288 I + +L G D +I+ +TGLS + + Sbjct: 182 IAKRMLAKGTDRGYIKDVTGLSDQDL 207 >UniRef50_B2V9N0 Putative uncharacterized protein n=4 Tax=Sulfurihydrogenibium RepID=B2V9N0_SULSY Length = 312 Score = 85.5 bits (210), Expect = 2e-15, Method: Compositional matrix adjust. Identities = 65/270 (24%), Positives = 134/270 (49%), Gaps = 17/270 (6%) Query: 1 MTNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRAL 60 M N + PH+ FK ++ +DF+ I L DL + L SL+L + + Sbjct: 1 MKNKESIQPHNWFFKQVFSNSKNVQDFLSIFLS-DLSQKIQLSSLELVPSEKFSNNQKKH 59 Query: 61 HSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIP 120 D+L+ K + + YI ++ EH+S D + +LM+Y+ + + ++ + P +I Sbjct: 60 FLDLLYKCKLNDKEAYIRLIFEHKSYVDKKLPLQLMQYNAVIWEEALKE--KDYYPPIIN 117 Query: 121 MLFYHG-SRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQH--RRVALL 177 ++FYHG ++ +P + + + D + + + L+D+ + D+ + ++ + V L+ Sbjct: 118 IVFYHGQAKWNFPTT---IPDIEDEELDKYIQKLNYILIDLNEIEDENLKRYLKKNVDLI 174 Query: 178 --ELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTR 235 LI KHI R + I L+ ++ EC+ D +LNY++L + E + E+ + Sbjct: 175 MEMLIMKHIHDR--LERIKTLLKDVIDECSEDC-FVIILNYLVLVKKDY---EKVKEVFK 228 Query: 236 RMPQHRERIMTIAERIHNDGYIKGEQRILR 265 + E++M +++ +G ++G+ ILR Sbjct: 229 EIIGGEEKMMLFTDKLKMEGKMEGKIEILR 258 >UniRef50_A9EVM7 Similar to putative transposase n=2 Tax=Sorangium cellulosum 'So ce 56' RepID=A9EVM7_SORC5 Length = 336 Score = 84.7 bits (208), Expect = 4e-15, Method: Compositional matrix adjust. Identities = 79/276 (28%), Positives = 128/276 (46%), Gaps = 23/276 (8%) Query: 8 TPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWS 67 HDALFK + + A + LP L D +L+L SFVDE L+ SD+L+S Sbjct: 12 NAHDALFKAAFSQVEHAAGELRQALPPALSARIDFAALRLRPGSFVDEALKERQSDLLFS 71 Query: 68 VKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHI-EHDKRQPLPLVIPMLFYHG 126 E +Y++ EHQS + MAFRL+RY + + + H+ EH + LP ++P++ +H Sbjct: 72 ASMGEARVLLYLLFEHQSTVEPLMAFRLLRYMVRIWEHHLAEHPGSKRLPAILPVVLHH- 130 Query: 127 SRSPYPWS-------LCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLEL 179 S W+ L LDE A + F L D++ D+ + A L Sbjct: 131 --SETGWTAATSFEDLLDLDEGARAVMVDHVPRFRFVLDDISQEGDEALKARAMSAFSRL 188 Query: 180 I---QKHIRQRDLMGLIDQL-----VVLLVTECANDSQ-ITALLNYILLTGDEARFNEFI 230 + +H R+ D L+ QL +V V N + + A+ YIL T + +E + Sbjct: 189 VLWCLRHGREPD--ELLRQLGKWLDLVNEVRRAPNGVEALRAIWRYILATNERDEADEVL 246 Query: 231 SELTRRMPQ-HRERIMTIAERIHNDGYIKGEQRILR 265 L + +E I++ A+++ G +G + LR Sbjct: 247 QRLLAAAGEPWKEEIVSAADQLMERGRQQGLREGLR 282 >UniRef50_A6G0X2 Putative uncharacterized protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6G0X2_9DELT Length = 363 Score = 83.2 bits (204), Expect = 9e-15, Method: Compositional matrix adjust. Identities = 79/280 (28%), Positives = 123/280 (43%), Gaps = 23/280 (8%) Query: 5 TTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDI 64 TS PHDALF+ HP A + LP++L L D L+ + V L +D+ Sbjct: 14 VTSRPHDALFRATFEHPSHAGSLLRSALPRELAALIDWSRLRPAANELVSSSLGERRTDL 73 Query: 65 LWSVKTR---EGDG--YIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVI 119 L+S GDG +Y+ IEHQSR D M R++ Y + + +RH + LP V Sbjct: 74 LFSTALEGPGAGDGARVVYLHIEHQSRVDTTMPLRVLGYRVRIWERHRKRHG-GALPPVF 132 Query: 120 PMLFYHGSRS-PYPWSLCWLDEFADPTTARKLYNAAFP-----LVDVTVVPDDEIV---Q 170 ++ H ++ P SL L F +P A P + D+ D E+ Sbjct: 133 CVVLSHAAKGWTGPRSLVEL--FPEPVRTLAPIAAHLPRCPLIVEDLGRRADAELRARHA 190 Query: 171 HRRVALLELIQKHIRQRD-----LMGLIDQLVVLLVTECANDSQITALLNYILLTGDEAR 225 H AL + + R + L+ DQ++ LL + + + LL Y+ L G E Sbjct: 191 HPLPALTLWLLRDARSPERLVHRLLDWRDQIIALLDYD-HGERDLAQLLRYVALVGSEMD 249 Query: 226 FNEFISELTRRMPQHRERIMTIAERIHNDGYIKGEQRILR 265 F EF + +P+ MTIAE++ + +G ++ R Sbjct: 250 FEEFHRFVAHHIPEVEAMTMTIAEQLCREALQRGREQGQR 289 >UniRef50_D0LMM4 Putative transposase n=10 Tax=Haliangium ochraceum DSM 14365 RepID=D0LMM4_HALO1 Length = 345 Score = 82.4 bits (202), Expect = 2e-14, Method: Compositional matrix adjust. Identities = 69/229 (30%), Positives = 110/229 (48%), Gaps = 20/229 (8%) Query: 10 HDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVK 69 HD+L K D A D LP + E DLD L L SFV ++LR H+D+L+ Sbjct: 6 HDSLVKATFARLDFAADEFRAVLPPAILERLDLDKLALCPGSFVSDELRQQHTDLLFRAP 65 Query: 70 TREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHI-EHDKRQPLPLVIPMLFYHGSR 128 ++Y+++EHQS + M RL+RY ++ +RH+ EH LP ++P++ +H + Sbjct: 66 LDGEPAFLYLLLEHQSSVERMMPLRLLRYVASIWERHLGEHPGAATLPPILPVVLHHSEQ 125 Query: 129 S-PYPWSLCWLDEFADPTTARKLYNAAFP-----LVDVTVVPDD-----EIVQHRRVALL 177 P SL L FA AR+ P L D++ PD+ E+ ++AL Sbjct: 126 GWTAPTSLGQL--FALSDGAREALGPYLPELRFLLDDLSHQPDEALLMREMAAQAKLALW 183 Query: 178 ELIQKHIRQ-RDLMGLI---DQLVVLLVTECANDSQITALLNYILLTGD 222 L K+ R +DL+ L+ +++ VT + A++ Y L D Sbjct: 184 AL--KNARHAQDLLALLRPWSPVILEAVTAPGGIDALAAIVRYTLQHAD 230 >UniRef50_D2QBD7 Putative uncharacterized protein n=1 Tax=Spirosoma linguale DSM 74 RepID=D2QBD7_9SPHI Length = 341 Score = 81.6 bits (200), Expect = 3e-14, Method: Compositional matrix adjust. Identities = 79/308 (25%), Positives = 149/308 (48%), Gaps = 24/308 (7%) Query: 9 PHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSV 68 PHD FK + P+ DF+ P+ +RE D +L E +F DE+L +D+++SV Sbjct: 8 PHDRFFKESFSQPEILIDFLNAFAPEAVRERIDYTTLTREVDTFTDEQLAEHFADLVFSV 67 Query: 69 KTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGSR 128 + + +++EH+S + + F++ RY + + + I+ ++QPL V+P+L YHG+R Sbjct: 68 QYNGQPIRLVILLEHKSYTEEYPHFQINRYLLNLWESQIK--QKQPLTPVLPVLVYHGNR 125 Query: 129 SPYPWSLCWLDEFADP---TTARKLYNAAFPLVDVTVVPDDEI----VQHRRVALLELIQ 181 W + ++ P T L + L+D++ + D+ + + R+ + L+Q Sbjct: 126 R---WKQRSIPDYFAPLHETLTPYLPAFEYLLIDLSTLSDERLPTLQSDYARLTAI-LLQ 181 Query: 182 KHIRQRDLMGLIDQ---LVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMP 238 R+R+L L+D +V L A ++ Y+ T + + E +R Sbjct: 182 NSRRKRELTRLLDAFADVVRRLTDTTAGQRFVSTGFLYLSYTANLTKV-ELFGIFSRISS 240 Query: 239 QHRERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQ-MQALRQPLPE 297 + MT+AE + +G + E+R R++ + E IQ+ L Q M A + L + Sbjct: 241 KIESSTMTVAEELIQEGR-ELERRQTRMVAE-----ELIQQGRELERRQAMMAAEELLKQ 294 Query: 298 RERYSWLK 305 +ER + +K Sbjct: 295 QERQNKIK 302 >UniRef50_B9MMR0 Putative uncharacterized protein n=1 Tax=Anaerocellum thermophilum DSM 6725 RepID=B9MMR0_ANATD Length = 333 Score = 80.9 bits (198), Expect = 5e-14, Method: Compositional matrix adjust. Identities = 83/329 (25%), Positives = 145/329 (44%), Gaps = 48/329 (14%) Query: 10 HDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVK 69 +D FK + + +F++ + + + DL SL+ SFV ++ +D+++ K Sbjct: 10 YDLTFKRIFSFKEVFLNFLKSTIKRPWVDKIDLQSLEFVDRSFVKDEFVEKEADVIYRAK 69 Query: 70 TREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGSRS 129 + D Y YV++E QS D M RL Y + QRHIE K L ++P++ Y+G + Sbjct: 70 IEDTDIYFYVLLEAQSTTDKTMPRRLFEYMNLIWQRHIEETKDDLLSPIVPIVLYNGRSN 129 Query: 130 PYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIRQRDL 189 W++ L ++N + LVDV + DDE +++ R+ LL +I R R Sbjct: 130 ---WNVPTLIFKGWEIFKDDMFN--YFLVDVNNI-DDETLKN-RLDLLSVILYLDRSRKT 182 Query: 190 MG-LIDQLV-VLLVTECANDSQITALLNYILLTGDEARFNEF---ISELTRRMPQHRERI 244 I++L V C Q+ ++L E I EL +R+ Q E + Sbjct: 183 AKEFIEKLKEVTEYISCLPTEQVKVFAMWLLRVIRPQMMEEVQGEIDELLKRIEQ--EGV 240 Query: 245 MTIAERIHN------------------------------DGYIKGEQ----RILRLLLQN 270 + + + N +G ++GE RI R ++ Sbjct: 241 TDVGDFVFNVQRLMQEYYKEAEEKGKEKGYEEGKLEGKLEGKLEGELEATIRIARNMILA 300 Query: 271 GADPEWIQKITGLSAEQMQALRQPLPERE 299 GA+ +I K+TGL E+++ LRQ + ++E Sbjct: 301 GAEDSFISKVTGLDIEKIKELRQNMTDKE 329 >UniRef50_A9BGB3 Putative uncharacterized protein n=2 Tax=Petrotoga mobilis SJ95 RepID=A9BGB3_PETMO Length = 336 Score = 77.0 bits (188), Expect = 7e-13, Method: Compositional matrix adjust. Identities = 52/211 (24%), Positives = 114/211 (54%), Gaps = 15/211 (7%) Query: 7 STP-HDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDIL 65 S P D++FK DF++ LPK+ + LK E + + SDIL Sbjct: 2 SNPIKDSIFKELFEDRTVFYDFLKAFLPKETTKQIKETDLKREQTELIGKDFSIKRSDIL 61 Query: 66 WSVKTREG-DGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQ------PLPLV 118 + ++ R G D YIY+++EHQS+ D MAFR++ Y + + ++++ K++ LP++ Sbjct: 62 YKIEKRNGQDVYIYLLLEHQSKVDQLMAFRMLAYKVRIWEQYVNSHKKESEQKGFKLPVI 121 Query: 119 IPMLFYHGSRSPYPWSLCWLDEFADPTTARK-LYNAAFPLVDVTVVPDDEIVQHRR-VAL 176 I M+FY G ++ + + ++ + + L A + L++++ + ++ I+ ++ + + Sbjct: 122 IGMVFYDG-KAKWTSPMDVKEKITEIKNMEEYLIKANYELINLSNIKEETIINMKKALGV 180 Query: 177 LELIQK-HIRQRD---LMGLIDQLVVLLVTE 203 + L K ++R ++ L+ +I++ ++L ++E Sbjct: 181 ILLTDKPNVRVKNAEELLKIINKDILLKLSE 211 >UniRef50_Q3JB06 Putative transposase n=17 Tax=Proteobacteria RepID=Q3JB06_NITOC Length = 350 Score = 75.9 bits (185), Expect = 2e-12, Method: Compositional matrix adjust. Identities = 40/121 (33%), Positives = 67/121 (55%), Gaps = 3/121 (2%) Query: 10 HDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVK 69 HDA +K +HP+ RD ++ + + + D +L+ S S+V + LR DI+W ++ Sbjct: 4 HDASYKRLFSHPEMVRDLLQGFVREPWVQQLDFSTLEKVSGSYVTDDLREREDDIIWRLR 63 Query: 70 TREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEH---DKRQPLPLVIPMLFYHG 126 +EG YIY+++E QS D +MA R++ Y + Q I+ Q LP V P++ Y+G Sbjct: 64 HQEGWMYIYLLLEFQSTVDPYMAVRVLAYVGLLYQDLIKARYIAPNQKLPPVFPLVLYNG 123 Query: 127 S 127 Sbjct: 124 G 124 >UniRef50_C6VTM0 Putative uncharacterized protein n=1 Tax=Dyadobacter fermentans DSM 18053 RepID=C6VTM0_DYAFD Length = 308 Score = 75.1 bits (183), Expect = 3e-12, Method: Compositional matrix adjust. Identities = 62/258 (24%), Positives = 129/258 (50%), Gaps = 11/258 (4%) Query: 8 TP-HDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILW 66 TP HDA + + + A D+ +P+++++L D +L+ ++V ++L+ SDI++ Sbjct: 5 TPKHDAFIRAIMGNKQIALDYFRASIPQNIQDLLDFSTLRQLPDTYVSKELQKSISDIVY 64 Query: 67 SVKTREGDGYIYV--VIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFY 124 + G+G + + ++EH+S D + ++ Y + + + I +K P L+IP+L Y Sbjct: 65 VCQKASGNGEVKISLLVEHKSYVDKYTPIQIGSYIFSGLLKQI-GNKESP-SLIIPILLY 122 Query: 125 HGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEI--VQHRRVALLELIQK 182 HG+ ++ L E +P + + + + D+ + D+EI + ++ +A L K Sbjct: 123 HGADRWEYKTVADLFENPEPALQQFIPDYQYIFHDLGQISDEEIQSLHNKFLAASLLAMK 182 Query: 183 HIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMP-QHR 241 + +D + + ++ L +E D + L + L G+ +F++ L + +P Q + Sbjct: 183 YSALKDQLNTLLPTILTLASEV--DRNLHKSLLFYTLVGNPLTEEQFLN-LIKSVPNQKK 239 Query: 242 ERIMTIAERIHNDGYIKG 259 E IM I E G+ KG Sbjct: 240 EAIMDIFEIFEEKGWKKG 257 >UniRef50_B6WXP3 Putative uncharacterized protein n=1 Tax=Desulfovibrio piger ATCC 29098 RepID=B6WXP3_9DELT Length = 330 Score = 73.6 bits (179), Expect = 8e-12, Method: Compositional matrix adjust. Identities = 38/125 (30%), Positives = 68/125 (54%), Gaps = 4/125 (3%) Query: 9 PHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSV 68 PHD+ +K F ++P+ + +P D E D +L+ S S+V + LR H DI+W + Sbjct: 7 PHDSAYKQFFSNPEMVESLLRDFVPADFIEDLDFSTLERCSGSYVTDDLRERHDDIVWRI 66 Query: 69 KTREGD-GYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDK---RQPLPLVIPMLFY 124 ++G Y+ +V+E QS D MA R + Y+ ++ ++ K + LP V P++ Y Sbjct: 67 GWKKGAWCYVALVLEFQSTPDYWMALRTLSYTALLLLDLVKTGKVHEGEGLPPVFPIVIY 126 Query: 125 HGSRS 129 +G ++ Sbjct: 127 NGGKA 131 >UniRef50_A4XMD0 Putative uncharacterized protein n=5 Tax=Clostridia RepID=A4XMD0_CALS8 Length = 329 Score = 73.2 bits (178), Expect = 1e-11, Method: Compositional matrix adjust. Identities = 78/337 (23%), Positives = 146/337 (43%), Gaps = 59/337 (17%) Query: 7 STPH---DALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSD 63 PH D FK + +F+ ++ ++ D +SL+ SF+ ++ +D Sbjct: 4 KVPHNQYDLTFKRLFQFKEVFLNFLRGNINREWVNRIDAESLEFVDRSFIKDEFVEKEAD 63 Query: 64 ILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLF 123 +++ + + D Y YV+IE QS D +M RL Y + +RH+E + LP ++P++ Sbjct: 64 VIYRARLEDTDVYFYVLIEPQSTADRNMPRRLFEYMTLIWKRHMEEKADELLPPIVPIVL 123 Query: 124 YHGSRSPYPWSLCWLDEFADPTTARKLYNA------AFPLVDVTVVPDDEIVQHRRVALL 177 Y+G W++ PT K ++ + LVDV + DDE ++ R L Sbjct: 124 YNGRSG---WNI--------PTQIFKGFDIFKDDMFNYILVDVNRL-DDEKLKSRLDLLS 171 Query: 178 ELIQKHIRQRDLMGLIDQLV-----------VLLVTECA-------------NDSQITAL 213 ++ +R+ +++L V L C+ +S+I L Sbjct: 172 IILYLEKSRRNAEEFVEKLSEVSEYICKLPQVQLKVFCSWLLRIVKPQVREEMESRIDEL 231 Query: 214 LNYILLTGDEARFNEFISELTRRMPQ-HRERIMTIAERIHNDGYIKG------------E 260 L I G E EFI + + + + +RE E+ + +G +G E Sbjct: 232 LKKIEAEGVED-VGEFIFNVQQLIQEYYREAEEKGKEKGYEEGIQEGIKEGIKEGIQRKE 290 Query: 261 QRILRLLLQNGADPEWIQKITGLSAEQMQALRQPLPE 297 + I+R L+Q G + +I + TG+ E+++ +R+ E Sbjct: 291 EEIVRRLIQKGFNDNFIAEATGVEIERIKKIREEYTE 327 >UniRef50_B8FP58 Putative uncharacterized protein n=1 Tax=Desulfitobacterium hafniense DCB-2 RepID=B8FP58_DESHD Length = 167 Score = 72.0 bits (175), Expect = 2e-11, Method: Compositional matrix adjust. Identities = 45/138 (32%), Positives = 68/138 (49%), Gaps = 13/138 (9%) Query: 8 TPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWS 67 PHD FK TAR F+E +LP+++R L DL ++ + S++D++L+ SD+L+ Sbjct: 6 NPHDKFFKETFGDVGTARSFLENYLPQEVRALVDLKTVLPQKDSYIDQELQESFSDLLFQ 65 Query: 68 VKTREGDGYIYVVIEHQSR----EDIHMAFRLMRYSMAVMQRH-----IEHDKRQPLPLV 118 VK RE +GY Y + EH+ R M+ RL S+ QR + H K P Sbjct: 66 VKIRENEGYFYFLFEHKVRPYADRRKKMSTRLADDSVLSKQREMFMQSVNHGK----PPY 121 Query: 119 IPMLFYHGSRSPYPWSLC 136 I G+R+ C Sbjct: 122 ISRFIRKGNRTGSAACRC 139 >UniRef50_D0LPI9 Putative transposase n=2 Tax=Haliangium ochraceum DSM 14365 RepID=D0LPI9_HALO1 Length = 338 Score = 71.6 bits (174), Expect = 3e-11, Method: Compositional matrix adjust. Identities = 38/117 (32%), Positives = 67/117 (57%), Gaps = 1/117 (0%) Query: 10 HDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVK 69 +D L +T + A D LP L + DLD+L L S ++V ++LR ++D+L+SV Sbjct: 24 YDVLVETTFARREYAADTFRTMLPPALVKRLDLDALSLRSGTYVSDELRQYYTDVLYSVL 83 Query: 70 TREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRH-IEHDKRQPLPLVIPMLFYH 125 +IY++++HQS D RL R +++ +R+ IE LP+++P++F+H Sbjct: 84 LDGEQAFIYLLLKHQSATDPMFPLRLPRNVLSIWERYLIERQDATTLPVILPIVFHH 140 >UniRef50_C6I158 Putative uncharacterized protein n=3 Tax=Leptospirillum ferrodiazotrophum RepID=C6I158_9BACT Length = 328 Score = 70.1 bits (170), Expect = 8e-11, Method: Compositional matrix adjust. Identities = 59/198 (29%), Positives = 95/198 (47%), Gaps = 10/198 (5%) Query: 10 HDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVK 69 HD FK+ L PD ++ LP ++ D SL V E L + D+ +S + Sbjct: 7 HDRFFKSTLGRPDRLGKVLKAFLPTNISASLDPGSLVPLGTESVGEGLDSSLMDLAFSAR 66 Query: 70 TREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGSRS 129 + + I++++EH+S D F++ RY + R ++ + QP PL +P+LFYHG Sbjct: 67 FGDQEARIHLIVEHKSSPDPRTHFQIARYLCGLWIRELK-EGLQPRPL-LPILFYHGV-- 122 Query: 130 PYPWSL-CWLDEFADPTTARKLYNAAF--PLVDVTVVPDDEIVQHRRVALLELIQKHIRQ 186 PW+L L E P + F PL+D+ V D+EI H V LE + + Sbjct: 123 -VPWTLPSRLTEVLRPPSELLAVTPDFVLPLIDLRRVDDEEIRHH--VDDLEAVLALLSL 179 Query: 187 RDLMGLIDQLVVLLVTEC 204 + + ++ LV LL+ E Sbjct: 180 KHIFDGVETLVRLLLREI 197 >UniRef50_Q1QWV4 Putative uncharacterized protein n=11 Tax=Proteobacteria RepID=Q1QWV4_CHRSD Length = 326 Score = 69.7 bits (169), Expect = 1e-10, Method: Compositional matrix adjust. Identities = 79/310 (25%), Positives = 138/310 (44%), Gaps = 30/310 (9%) Query: 14 FKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVKTREG 73 +K +HP+ RD + + + E D +L+ S S++ E LR D++W V+ + Sbjct: 13 YKLLFSHPEMVRDLLTGFVKEAWVEQLDFSTLEKVSGSYITEDLRDREDDVIWRVRWGDD 72 Query: 74 DGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQP---LPLVIPMLFYHG-SRS 129 Y+Y+++E QS D MA R+M Y + Q I + P LP V+P++ Y+G R Sbjct: 73 WLYVYLLLEFQSSVDRFMAVRVMTYLGLLYQDLIRQEAFTPNGKLPPVLPIVLYNGEKRW 132 Query: 130 PYPWSLCWLDEFADPTTARKLYNAAFPLVD-VTVVPDDEIVQHRR--VALLELIQKHIRQ 186 ++ L E R N A+ L+D V+ D E H R A L ++ + + Sbjct: 133 TAAQNVADLVEQVPGGLERYRPNLAYLLLDEGAVISDPEWSDHMRNVAAALFRLEHNRDE 192 Query: 187 RDLMGLIDQLVVLLVT--ECANDSQITALLNYILLTG-----DEARFNEF---------I 230 +D++ ++ LV L + + +LL + FNE + Sbjct: 193 QDMLEVLGTLVEWLKAPEQTGLRRAFVVWIRRVLLPNRAPGMELPEFNELQDLHEVHDML 252 Query: 231 SELTRRMPQHRERIMTIAERIHN--DGYIKGEQRIL----RLLLQNGA-DPEWIQKITGL 283 +E ++ P+ E R +G +GEQR + R L++ G E I + TGL Sbjct: 253 AERIKQWPERWEEKGRQEGRQEGRKEGRQEGEQRGIEKTARNLIKLGVLSDEQIAEATGL 312 Query: 284 SAEQMQALRQ 293 + +++ LR+ Sbjct: 313 TVAEVEGLRE 322 >UniRef50_A3JHZ5 Putative transposase n=11 Tax=Proteobacteria RepID=A3JHZ5_9ALTE Length = 325 Score = 68.9 bits (167), Expect = 2e-10, Method: Compositional matrix adjust. Identities = 43/133 (32%), Positives = 70/133 (52%), Gaps = 12/133 (9%) Query: 10 HDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVK 69 HD +K +HP+ + +E P ++ L D ++LK S +++ D++WSV+ Sbjct: 6 HDTGYKELFSHPEFVQQLVEGFAPSEIAGLMDFNTLKNHSGNYITPLFEEKFEDVVWSVE 65 Query: 70 -TREGDG---YIYVVIEHQSREDIHMAFRLMRYSMAVMQRHI----EHDKRQPLPLVIPM 121 T EG ++Y+++E QS+ D M RLM Y +A H+ E RQ LP + PM Sbjct: 66 VTWEGITQRVFLYILLEFQSKIDSTMPLRLMHY-VACFYDHLLKTRETTVRQGLPPIFPM 124 Query: 122 LFYHGSRSPYPWS 134 + Y+GS+ WS Sbjct: 125 VLYNGSQR---WS 134 >UniRef50_C6HY29 Putative uncharacterized protein n=1 Tax=Leptospirillum ferrodiazotrophum RepID=C6HY29_9BACT Length = 319 Score = 68.6 bits (166), Expect = 3e-10, Method: Compositional matrix adjust. Identities = 53/168 (31%), Positives = 85/168 (50%), Gaps = 15/168 (8%) Query: 8 TPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKL-RALHSDILW 66 TPHD FK + + + LP+D+ D DSL V E L R+ +D+++ Sbjct: 6 TPHDVFFKEIFSQREILSSALSELLPEDVVRRMDFDSLAYLPGESVGEGLSRSTRADLVF 65 Query: 67 SVKTREGDGYIYVVIEHQSREDIHMAFRLMR-YSMAVMQRHIEHDKRQPLPLVIPMLFYH 125 SV E +G + V++EH+S D + F++++ M MQ E R+PLP ++P+LFYH Sbjct: 66 SVSFGEREGRLVVILEHKSHPDPRVHFQILQMMVMGWMQNLRE--GREPLP-ILPILFYH 122 Query: 126 GSRSPYPWSLCWLDEFADPTT-----ARKLYNAAFPLVDVTVVPDDEI 168 G S WS+ D F++ AR L + +D+ ++ D I Sbjct: 123 GQGS---WSIP--DRFSERMKIPREIARYLPDFELLRIDLGLIDDTRI 165 >UniRef50_A3ET28 Probable transposase n=6 Tax=Leptospirillum sp. Group II RepID=A3ET28_9BACT Length = 335 Score = 68.6 bits (166), Expect = 3e-10, Method: Compositional matrix adjust. Identities = 38/126 (30%), Positives = 66/126 (52%), Gaps = 5/126 (3%) Query: 10 HDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVK 69 HD FKT + RDF+ LP ++ + D DSL+ + + H D++ + Sbjct: 8 HDRFFKTSFGRIEVLRDFLTGFLPPEISQSIDPDSLRFLNTESIGLSFEKSHMDLVVECR 67 Query: 70 TREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGSRS 129 E Y++IEH+S D + +++RY +A+ R+ + +K PL V+P++F+ G R Sbjct: 68 ISETPAQFYLLIEHKSVPDPEVFLQMLRYMVALWTRNRQDNK--PLVPVLPLVFHQGGR- 124 Query: 130 PYPWSL 135 PW+L Sbjct: 125 --PWTL 128 >UniRef50_B9TA29 Putative uncharacterized protein n=1 Tax=Ricinus communis RepID=B9TA29_RICCO Length = 411 Score = 67.0 bits (162), Expect = 9e-10, Method: Compositional matrix adjust. Identities = 37/131 (28%), Positives = 64/131 (48%), Gaps = 4/131 (3%) Query: 4 FTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSD 63 F S+ D+L+K HP+ RD + L D +++ + +AS+ + H D Sbjct: 37 FFMSSRTDSLYKQLFAHPEIVRDLVAGFLAADWARGLTVEAFERVNASYASDHGHVRHDD 96 Query: 64 ILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEH---DKRQPLPLVIP 120 ++W + Y+Y+++E Q+R D MA R+ Y + Q + K LP V+P Sbjct: 97 VVWRARIGGEWVYVYILLEFQARPDKWMALRMQVYVGLLYQDLVAQHKLSKHGKLPPVLP 156 Query: 121 MLFYHGSRSPY 131 ++ YHG R P+ Sbjct: 157 VVLYHG-RGPW 166 >UniRef50_Q1Q296 Putative uncharacterized protein n=6 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1Q296_9BACT Length = 338 Score = 66.6 bits (161), Expect = 9e-10, Method: Compositional matrix adjust. Identities = 46/174 (26%), Positives = 89/174 (51%), Gaps = 10/174 (5%) Query: 8 TPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWS 67 PHD FK + + A DF+ P ++ + DL +L +++S++DE+L+ SDI+++ Sbjct: 5 NPHDKFFKETFSIRENAIDFLSGRFPPEILKKLDLSTLTQDNSSYIDEELKEHFSDIVYT 64 Query: 68 VKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGS 127 ++ + I ++ EH+S +LM+Y + + + + + +R L VIP++ YHG Sbjct: 65 CFCKDKEIRITLLFEHKSYAVACPYLQLMKYLLKIWEANSKQAQR--LIPVIPVILYHGK 122 Query: 128 RSPYPWSLCWLDEF---ADPTTARKLYNAAFPLVDVTVVPDDEIVQH--RRVAL 176 + W + E+ D R + + L D++ ++EI RRV+L Sbjct: 123 EA---WKVRRFREYFEGIDEVFYRFIPEFEYLLTDISCYSNEEIKDRVFRRVSL 173 >UniRef50_C7RR52 Putative transposase n=1 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RR52_9PROT Length = 330 Score = 66.6 bits (161), Expect = 1e-09, Method: Compositional matrix adjust. Identities = 41/127 (32%), Positives = 63/127 (49%), Gaps = 7/127 (5%) Query: 7 STPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILW 66 + HD +K + P+ RD + +P D D +L+ S+V E DI+W Sbjct: 2 ANTHDTGYKLLFSTPELVRDLILGFVPDDWLHGLDYSTLERVPGSYVTEDFTNRADDIVW 61 Query: 67 SVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEH-----DKRQPLPLVIPM 121 VK Y+Y++IE QS D +MA R+M Y + Q I+ D R LP V+P+ Sbjct: 62 RVKVGGEWVYLYLLIEFQSSVDKYMALRMMVYGGLLYQDLIKRGEVLADGR--LPPVLPI 119 Query: 122 LFYHGSR 128 + Y+GS+ Sbjct: 120 VLYNGSQ 126 >UniRef50_B4U689 Putative uncharacterized protein n=8 Tax=Aquificales RepID=B4U689_HYDS0 Length = 323 Score = 66.6 bits (161), Expect = 1e-09, Method: Compositional matrix adjust. Identities = 66/276 (23%), Positives = 126/276 (45%), Gaps = 28/276 (10%) Query: 9 PHDALFKTFLTHPDTARDFMEIH---LPKDLRELCDLDSLKLESASFVDEKLRALHSDIL 65 PHD+ FK + P + ++I + K + + +++ K S S + D+L Sbjct: 5 PHDSFFKQIFSDPRRVKTLLDIFAKDVAKSIHSITPVNTEKFSSKS------QKFMLDLL 58 Query: 66 WSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYH 125 +S K ++ D YI +V+EH+S D + +L Y+ A+ + I+ +++ P +I ++FYH Sbjct: 59 FSCKVKDQDAYIRIVLEHKSYLDKELPIQLSYYNAAIWEEAIK--EKEYYPPIINIVFYH 116 Query: 126 GSRS-PYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALL----ELI 180 G P SL L+ D + + + L+D+ V DDE++ + + Sbjct: 117 GKGEWNIPTSLPVLE---DQNLEKYVSKLNYILIDLNKVSDDELINEAYIDFCFTSAVIA 173 Query: 181 QKHIRQR-DLMGLIDQLVVLLVTECANDSQITAL---LNYI-LLTGDEARFNEFISELTR 235 KH+ + + + + + +V V ++ L NYI + GD + EL Sbjct: 174 MKHVHENIEKIKAVFRPLVEYVQIHEDEEGYHCLFFSFNYISYVKGDTKEAENALKELIG 233 Query: 236 RMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNG 271 ++ MT+ E+ +G KG+Q L+ L+ G Sbjct: 234 ----GDKKAMTLIEKWIMEGLEKGKQEGLQEGLEKG 265 >UniRef50_C5UWW9 Putative uncharacterized protein n=1 Tax=Clostridium botulinum E1 str. 'BoNT E Beluga' RepID=C5UWW9_CLOBO Length = 323 Score = 66.2 bits (160), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 66/325 (20%), Positives = 142/325 (43%), Gaps = 38/325 (11%) Query: 1 MTNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRAL 60 M N HD +K +H +T +F+ K+ L + D L L S++ Sbjct: 1 MKNNNVHHEHDVGYKHIFSHKETFLEFLRSFTKKEWANLINEDDLILVDKSYILSDFEEE 60 Query: 61 HSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQ---RHIEHDKRQ---- 113 SDIL+ + + YV++E QS+ D M RL+ Y + + ++ E ++R+ Sbjct: 61 ESDILYKANIDDKEVIFYVLLEFQSKVDFQMPMRLLFYMTEIWRDVLKNTEKNERKRKNF 120 Query: 114 PLPLVIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLY-----NAAFPLVDVTVVPDDEI 168 LP ++P++ Y+G WS F + + +L+ + + L D+ D E+ Sbjct: 121 KLPSIVPIVLYNGKN---KWSAKI--SFKEMLSGYELFEDNILDFNYMLFDINRYSDHEL 175 Query: 169 VQ-HRRVALLELIQKHIRQRDLM------------------GLIDQLVVLLVTECANDSQ 209 + ++ + L+ + I +++LM + + + +V D+ Sbjct: 176 LNISNMISAVFLLDQEIDEQELMRRLKKIIYILKKISPEQFSVFKKWLKNIVKPRVRDN- 234 Query: 210 ITALLNYILLTGDEARFNEFISELTRRMPQHRERIMTIA-ERIHNDGYIKGEQRILRLLL 268 + ++ +L ++ + +S L + + + +++ + ++ G +G ++ + + Sbjct: 235 LQGEIDDVLEKSNQEEVDFMVSNLGKTIERMQDKAIERGLKKGIEQGIEQGIEQTAKKAI 294 Query: 269 QNGADPEWIQKITGLSAEQMQALRQ 293 + G D E I +TGLS EQ+ +RQ Sbjct: 295 EMGMDNEIIMNLTGLSEEQINTIRQ 319 >UniRef50_Q04UG3 Transposase, YhgA-like n=8 Tax=Leptospira RepID=Q04UG3_LEPBJ Length = 304 Score = 65.9 bits (159), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 79/304 (25%), Positives = 148/304 (48%), Gaps = 28/304 (9%) Query: 6 TSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDIL 65 + PHD L + A F + LP ++ EL DL++L+L +SFV E+L+ +D+L Sbjct: 4 VNNPHDRLIRETFQDKKEAATFFKNTLPPEVVELLDLENLELTESSFVSEELKQEQTDLL 63 Query: 66 WSVKTREGD-GYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFY 124 + + + G+ +Y++ EH+S + + +L+ Y + + + + +VIP +FY Sbjct: 64 FQIPLKSGNKSNVYLLFEHKSYLENTIYIQLLGYLTEIYRN--QQRSGESFSVVIPFVFY 121 Query: 125 HGSRSPYPWSLC--WLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRV------AL 176 HG + W L + D+F ++ P + + + I +++ Sbjct: 122 HGEKE---WKLGDRFSDQFVLTKQETDVFQDFIPDFKIDLFDLEGIELKKKLESITFQVT 178 Query: 177 LELIQKHIRQRDL--MGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELT 234 L ++Q+ IR+RDL + + L LL+ +S+ A+L +LL AR + +EL Sbjct: 179 LGVVQR-IRERDLEFVSHLPGLFSLLLG-IEEESKRVAILRKLLLYIYWAR-DLKPTELK 235 Query: 235 R-----RMPQHRERIMTIAERIHNDGY----IKGEQRILRLLLQNGADPEWIQKITGLSA 285 R ++ Q+ E MT AER+ ++G I+G+ R +L E + +ITGLS Sbjct: 236 RVLAISKLEQYEELTMTTAERLISEGIQQGKIEGKIETARNMLSEDIQLEAVLRITGLSK 295 Query: 286 EQMQ 289 + ++ Sbjct: 296 QDLK 299 >UniRef50_C0GTX5 Putative uncharacterized protein n=8 Tax=Desulfonatronospira thiodismutans ASO3-1 RepID=C0GTX5_9DELT Length = 338 Score = 65.5 bits (158), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 60/271 (22%), Positives = 125/271 (46%), Gaps = 15/271 (5%) Query: 5 TTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDI 64 +T+ HD+ K FL+ A ++ LP+++ + D + + E S++ + L+ +SD+ Sbjct: 2 STTNIHDSTIKYFLSDRLNAISLLKSMLPEEIVKQLDFNKIYYEKDSYLPKSLQGYYSDL 61 Query: 65 LWSVKTREGD--GYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPML 122 + SV T+ G ++ ++EH+S + + +RY + +++ ++ LP++IP+L Sbjct: 62 VVSVPTKCGSYVAKVFFLLEHKSTFKKNTPLQFLRYILEFWEQYQKNTGETRLPVIIPIL 121 Query: 123 FYHGSRSPYPWSLCWLDEFAD-PTTARKLYNAAFPLVDVTVV---PDDEIVQHRRVALLE 178 H W + + D P+ K++ F + V P+D AL Sbjct: 122 IAHPEEG---WKPTKVSDLVDLPSDDFKIFVPDFNFLLYDAVNDDPEDYDFDETLKALFT 178 Query: 179 LIQKHIRQRDLMGLIDQLVVLLVTECANDSQ----ITALLNYILLTGDEARFNEFISELT 234 L ++ R + M + Q L+ + ++ + +L+Y+ +T DE + + Sbjct: 179 L-WRYSRSPEFMQGV-QKAFQLIKKVDPKARLLDFVQMILHYLEVTRDEKEYIDIQKIAE 236 Query: 235 RRMPQHRERIMTIAERIHNDGYIKGEQRILR 265 + + E + TIAE +G + EQR L+ Sbjct: 237 TEIDEGEEYMGTIAEMFRREGDERTEQRFLQ 267 >UniRef50_C0GWA6 Putative uncharacterized protein n=3 Tax=Desulfonatronospira thiodismutans ASO3-1 RepID=C0GWA6_9DELT Length = 334 Score = 64.7 bits (156), Expect = 4e-09, Method: Compositional matrix adjust. Identities = 50/222 (22%), Positives = 111/222 (50%), Gaps = 9/222 (4%) Query: 8 TPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWS 67 HD FK+F + + RDF++ +LP+++++ DL ++++ ++ E+ + +SD++ Sbjct: 7 NAHDICFKSFFSREEFVRDFIQYYLPEEIKKHLDLTIIEIDMEGYLSEEFKEFYSDVVAK 66 Query: 68 V--KTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAV-MQRHIEHDKRQPLPLVIPMLFY 124 V R + +Y + EH+S+ + + Y + M+ +E Q LP+++P++ Y Sbjct: 67 VYFNDRVHELELYFLFEHKSKPYRFTILQTLNYQVQKWMRLLVEGKLNQHLPIIVPVVIY 126 Query: 125 HGSRSPYPWSLCWLDEFADPTTARKLYNAAFP--LVDVTVVPDDEIVQHRRVALLELIQK 182 +G +S + +S+ + D F P+ K + F L D+ + + + + L+ K Sbjct: 127 NGYKS-WNFSVQFEDLFQLPSEYYKDFIPQFRHILHDIGQMDEASFKTTTIMEIFHLLLK 185 Query: 183 HIRQRDLMGLIDQLVVLLVTECANDS---QITALLNYILLTG 221 +I +L I ++ LL ND + ++ Y++ +G Sbjct: 186 YIYYPELDTKIHEIYDLLEKLPDNDKLTDYLFIIVRYVMASG 227 >UniRef50_C0A240 Putative uncharacterized protein n=1 Tax=Opitutaceae bacterium TAV2 RepID=C0A240_9BACT Length = 365 Score = 64.3 bits (155), Expect = 5e-09, Method: Compositional matrix adjust. Identities = 66/287 (22%), Positives = 126/287 (43%), Gaps = 45/287 (15%) Query: 10 HDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVK 69 HD +F+ + P AR F+ LP +L D +L + S + + L D+++ + Sbjct: 36 HDRIFRHAFSLPAVARQFLRTWLPPELVAQADWHTLTVTRISGISDTLGERREDVVYRIN 95 Query: 70 TREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQP--------------- 114 + + YV++EHQ++ + HMA R+M + ++ R EHD+ + Sbjct: 96 VNGRNVHFYVLMEHQTKTEKHMARRIMEETF-LIWRQDEHDRAEAAKKEAPGKADRQSRR 154 Query: 115 -----LPLVIPMLFYHGSRSPYPWSLCW-LDEFAD-PTTARK-----LYNAAFPLVDVTV 162 PLVI M+ + G P W W L + D P K + + F +V++ Sbjct: 155 RETDKFPLVISMVLHPG---PRKWGKIWRLADLIDVPPRMEKWARTFMPDCGFIVVELAG 211 Query: 163 VPDDEIVQ-HRRVALLELIQKHIRQRDLMGLID-QLVVLLVTECAND------SQITALL 214 +P +++ H A+L +Q + +GLID + + L+ E +D + L Sbjct: 212 LPLEKLADGHLARAILGALQG-----NRLGLIDIRKIKRLLDEMFSDPDRASVGAVVKQL 266 Query: 215 NYILLTGDEARFNEFISELTRRMP-QHRERIMTIAERIHNDGYIKGE 260 + L++ + + + + +P ++R IM ER+ G +K + Sbjct: 267 WHYLISSSDLKEEQTKDIVIAHIPEEYRSNIMNTVERLKQAGALKAQ 313 >UniRef50_C0GV86 Transposase, ISNCY family n=7 Tax=Desulfonatronospira thiodismutans ASO3-1 RepID=C0GV86_9DELT Length = 125 Score = 63.9 bits (154), Expect = 6e-09, Method: Compositional matrix adjust. Identities = 32/101 (31%), Positives = 60/101 (59%), Gaps = 3/101 (2%) Query: 1 MTNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRAL 60 M PH+ LF + D AR F++ H+ +++++ DLD+L+LE ++VDEKL+ Sbjct: 1 MATKRNQAPHEGLFLKIFQNLDNARHFLKNHMSEEIQKRFDLDTLRLEPTTYVDEKLKKH 60 Query: 61 HSDILWSVK---TREGDGYIYVVIEHQSREDIHMAFRLMRY 98 +SD+++SV+ + IY++ EH+S D ++++Y Sbjct: 61 YSDLVFSVRLIGYKNQFAKIYLLFEHKSSPDPLTGVQVLKY 101 >UniRef50_Q2FP14 Putative uncharacterized protein n=4 Tax=Methanospirillum hungatei JF-1 RepID=Q2FP14_METHJ Length = 312 Score = 63.9 bits (154), Expect = 6e-09, Method: Compositional matrix adjust. Identities = 38/127 (29%), Positives = 62/127 (48%), Gaps = 5/127 (3%) Query: 11 DALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVKT 70 D +K +HP+ D + L L CDL +L+ + S+V + LR DI+W + Sbjct: 5 DHPYKRLFSHPEMIADLIRGFLDPKLVSGCDLSTLERCNGSYVTDDLREREDDIIWRLAY 64 Query: 71 REGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQP--LPLVIPMLFYHGSR 128 + +Y++IE QS+ D M R+M Y + Q I P +P +IP++ Y+G Sbjct: 65 GDRTLILYLLIEFQSKPDYSMPIRIMSYMALLWQDLIRSGVIVPSRIPGIIPIVLYNGE- 123 Query: 129 SPYPWSL 135 PW + Sbjct: 124 --IPWKV 128 >UniRef50_C1DXM1 Putative uncharacterized protein n=5 Tax=Sulfurihydrogenibium azorense Az-Fu1 RepID=C1DXM1_SULAA Length = 342 Score = 63.5 bits (153), Expect = 8e-09, Method: Compositional matrix adjust. Identities = 71/279 (25%), Positives = 128/279 (45%), Gaps = 55/279 (19%) Query: 8 TPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILW- 66 +PHD FK + F+EI LP+ L E +SLKL +K + D+ + Sbjct: 6 SPHDWFFKMIFSQKQNVESFLEIFLPQ-LYECIIPNSLKLSDTEKFSKKYKKFFLDLAFD 64 Query: 67 -SVKTREG---DGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKR--QPLPLVIP 120 +K +EG DG IY+V EH+S D H ++ Y +M E D+R +P VIP Sbjct: 65 CKLKDKEGNTIDGQIYIVFEHKSYPDKHTPSQISFYKSVMM----EEDERLSRPYRPVIP 120 Query: 121 MLFYHGSRSPYPWSLCWLDEFADPTTARKLYNA-----------AFPLVDVTVVPDDEIV 169 ++FYHG +S W++ PT + +N ++ L DV+ V + ++ Sbjct: 121 IVFYHGEKS---WNI--------PTDIPQQFNTLGNLEKYLHSLSYILFDVSKVDESFLI 169 Query: 170 QH--------RRVALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNY-ILLT 220 + V L+ I K ++ L ++++L++ V +C + +++Y +++ Sbjct: 170 EKIYLNACLISGVFTLKNIFKDLKY--LRPVLEKLILDDVKDC-----LYIIIDYTVIVK 222 Query: 221 GDEARFNEFISELTRRMPQHRERIMTIAERIHNDGYIKG 259 D + + E+ E++MT+ E+ +G KG Sbjct: 223 KDLETIEKILEEIG-----GEEKMMTLTEKWKMEGLKKG 256 >UniRef50_A6G1G8 Putative uncharacterized protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6G1G8_9DELT Length = 329 Score = 63.2 bits (152), Expect = 1e-08, Method: Compositional matrix adjust. Identities = 73/271 (26%), Positives = 115/271 (42%), Gaps = 19/271 (7%) Query: 10 HDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVK 69 HDALFK P A LP L + D + E + +D +L D+LW + Sbjct: 7 HDALFKAAFGAPAHAARLCRALLPPALVAVLDWRASTSEPTAVLDLRLSERRCDVLWRTR 66 Query: 70 TREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFY---HG 126 +G G IYV++EHQS + M R+ Y + H D+ PLP +IP++ HG Sbjct: 67 FVDG-GPIYVLLEHQSTRERDMPLRIEGYLARIWAGHRRGDRHGPLPPIIPIVVSHAEHG 125 Query: 127 SRSPYP-WSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIR 185 R+P W P A + N + D+T V DD ++ R + L + + + Sbjct: 126 WRAPRSFWEQFSPSPDCIPGLAPFVPNFQLLIDDLTQV-DDASLRGRSLPLFQTLALWLL 184 Query: 186 Q--RDLMGLIDQL------VVLLVTECAND---SQITALLNYILLTGDEARFNEFISELT 234 + RD +++ + + L E ++ I LL Y E +EF +L Sbjct: 185 RDARDPGRVLESVDEWNTWIHRLRGESQHEQDGGDIEQLLRYAYAVMGEGEDSEFHRKLA 244 Query: 235 RRMPQHRERIMTIAERIHNDGYIKG--EQRI 263 P E +T ++ N G+ +G E RI Sbjct: 245 AFHPPSAEMSLTFEQQAINRGHKRGLEEGRI 275 >UniRef50_A4XFI8 Putative uncharacterized protein n=7 Tax=Clostridia RepID=A4XFI8_CALS8 Length = 321 Score = 63.2 bits (152), Expect = 1e-08, Method: Compositional matrix adjust. Identities = 73/326 (22%), Positives = 149/326 (45%), Gaps = 43/326 (13%) Query: 5 TTSTP---HDALFKTFLTHPDT----ARDFMEIHLPKDLRELCDLDSLKLESASFVDEKL 57 ++S P HD+ FK HP +D + K+++E DS++L FVDE Sbjct: 2 SSSLPPQEHDSTFKFLFEHPKDILFLVKDVIGYSWAKEIKE----DSIELADKEFVDETF 57 Query: 58 RALHSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPL 117 +D++ + ++ + Y Y++IE+QS M RL+RY + + + I ++ LP Sbjct: 58 HQKRADVIAKARLKDREVYFYIIIENQSTVAEDMPERLLRYMILLWAKKIREGVKK-LPA 116 Query: 118 VIPMLFYHGSRSPYPWS---LCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRV 174 +IP++ Y+G + S + D F D + N + L T++ ++E + V Sbjct: 117 IIPIVTYNGLEKDWDVSQEIISEFDIFKDDIFKYAVVNIS-KLDAKTLLQEEEDILSPVV 175 Query: 175 ALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQ--ITALLNYI---LLTGDEARFN-- 227 LE ++ + L+ + ++ L N+++ + N I L+ D+ +++ Sbjct: 176 FYLEQVRDDTEE--LVKRLKEIEPKLTKLSQNNAERFLIWAGNVIRPRLVKEDKEKYDEL 233 Query: 228 -------------EFISELTRRMPQHRERIMT---IAERIHN--DGYIKGEQRILRLLLQ 269 EF+S + + + + + R I +I +G I+G+ + + +++ Sbjct: 234 AQRVEQGGSRQMGEFVSNVAKLLDEVQMRKFNEGKIEGKIEGKIEGKIEGKIEVAKKMIR 293 Query: 270 NGADPEWIQKITGLSAEQMQALRQPL 295 G E I ++T L E+++ LR+ L Sbjct: 294 RGFSDEDIAELTELDIEKVKELRKEL 319 >UniRef50_C4FIM1 Putative uncharacterized protein n=1 Tax=Sulfurihydrogenibium yellowstonense SS-5 RepID=C4FIM1_9AQUI Length = 316 Score = 62.4 bits (150), Expect = 2e-08, Method: Compositional matrix adjust. Identities = 65/291 (22%), Positives = 133/291 (45%), Gaps = 29/291 (9%) Query: 5 TTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDI 64 T PHD FK + P + ++I P +L + DL+S++L ++ +K+ ++ Sbjct: 2 TDLQPHDQFFKQIFSEPKRVKSLLDIFYP-ELSQKIDLESIRLLNSEKYSQKVGKSLLNL 60 Query: 65 LWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFY 124 L+ K ++ ++ EH+S D ++ +L+ Y+ + + E+++ P +I ++ Y Sbjct: 61 LYECKIENEKSFLRIIFEHKSYIDKNLPSQLLYYNGILWEETGEYEEYPP---IINIVLY 117 Query: 125 HGSRSPYPWSL-CWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLE----L 179 HG R W++ L + R + L+D++ V D+E++ + L Sbjct: 118 HGKRK---WNIPATLPKTNSEIIERFANKLNYHLIDLSKVADEEMISKLYLDFCTVSALL 174 Query: 180 IQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQ 239 KHI + DL + ++ V E D + +L+YI + + + E+ Sbjct: 175 TMKHIFE-DLRKY--KHILKKVFEHYQDGCVFIILDYISVVNNPQEVENVLKEILG---- 227 Query: 240 HRERIMTIAERIHNDGYIKGEQR---------ILR-LLLQNGADPEWIQKI 280 + +MT+ E+ +G +G Q+ IL+ + L+ G PE I+K+ Sbjct: 228 GEKDMMTLTEKWKMEGLQQGLQQGMIEGQKKAILKSIQLKFGRVPENIEKL 278 >UniRef50_C1DXV7 Putative uncharacterized protein n=1 Tax=Sulfurihydrogenibium azorense Az-Fu1 RepID=C1DXV7_SULAA Length = 357 Score = 62.0 bits (149), Expect = 2e-08, Method: Compositional matrix adjust. Identities = 37/115 (32%), Positives = 68/115 (59%), Gaps = 3/115 (2%) Query: 8 TPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALH-SDILW 66 PHD K L + A+ ++ HLP+++ + ++L++ + +D K ++ + +DI++ Sbjct: 15 NPHDTYAKELLKDEEVAQVLLDAHLPQEINSIIKKETLEIINTENLDYKEKSKYFADIIY 74 Query: 67 SVKTREG-DGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPL-PLVI 119 S+KT G D IYV+IEH+S +D H+ +L++ AV + I K P+ P+VI Sbjct: 75 SLKTIYGEDLKIYVLIEHKSYDDKHLPLQLIKNMTAVWSKEILEGKITPIYPIVI 129 >UniRef50_C6HXQ0 Putative uncharacterized protein n=1 Tax=Leptospirillum ferrodiazotrophum RepID=C6HXQ0_9BACT Length = 341 Score = 59.7 bits (143), Expect = 1e-07, Method: Compositional matrix adjust. Identities = 53/181 (29%), Positives = 85/181 (46%), Gaps = 11/181 (6%) Query: 10 HDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVK 69 HD FK+ L P ++ LP L L SL + V + L A D+ + Sbjct: 8 HDRFFKSTLGRPKRMEHILKAFLPPALSALLAPGSLVPLFSEVVGDSLDASLLDMAFEAT 67 Query: 70 TREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGSRS 129 E I+V++EH+S D F+++ Y +A + + + R P+P V P+LFYHG R Sbjct: 68 FGERKTRIHVLVEHKSSPDPWAHFQILHY-LAELWLRDKKESRSPIPFV-PVLFYHGLR- 124 Query: 130 PYPWSL-CWLDEFADPTTARKLY--NAAFPLVDVTVVPDDEIVQHRR---VALLELIQKH 183 PW+L L E DP + + + P++D+ + D +I + R + L+ KH Sbjct: 125 --PWNLPTRLSEMLDPPSELLPFVPDYLLPVIDLGKIDDLDIREKIRDFETSACLLLLKH 182 Query: 184 I 184 I Sbjct: 183 I 183 >UniRef50_Q2RLW6 Putative uncharacterized protein n=9 Tax=Clostridia RepID=Q2RLW6_MOOTA Length = 344 Score = 59.7 bits (143), Expect = 1e-07, Method: Compositional matrix adjust. Identities = 72/336 (21%), Positives = 146/336 (43%), Gaps = 56/336 (16%) Query: 9 PHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSV 68 P+D ++ L + ++ + + E D D L L + S+V + +D+++ + Sbjct: 14 PYDKGYRQLLADKRVFLELLKTFVREAWVEAIDADDLILVNKSYVLQDFSEKEADVVYRL 73 Query: 69 KTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQP--------LPLVIP 120 KTR + YV++E QS D M FRL+ Y M + R I ++ Q LP +IP Sbjct: 74 KTRNRNVIFYVLLELQSTVDYLMPFRLLLY-MVEIWREIYNNTPQGERESKHFRLPPIIP 132 Query: 121 MLFYHGSRSPYPWSLC-----WLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRR-V 174 + Y+G+ S W+ L+ + D + L + + L DV ++E+++ + Sbjct: 133 AVLYNGAGS---WTAALSFKEMLNSYQD--FSGHLLDFRYLLFDVNRYSEEELIRAANLI 187 Query: 175 ALLELIQKHIRQRDLMGLIDQLVVLL---------------------------------V 201 A + L+ + ++ DL G + +L +L + Sbjct: 188 AGIFLLDQKMQPEDLAGRLQKLAGVLRRLTPDEFRHFTTWLKNVVQPRMPGDFSEKIDGI 247 Query: 202 TECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRERIMTIAERIHN--DGYIKG 259 +N ++ ++ + LT +E + + L + Q + + ++ +G ++G Sbjct: 248 LNASNPWEVERMIYNLELTLEEMQRQALLKGL-KEGEQKGKLEGKLEGKLEGKLEGKLEG 306 Query: 260 EQRILRLLLQNGADPEWIQKITGLSAEQMQALRQPL 295 ++ + R LL D E I K TGL+ E++ AL++ + Sbjct: 307 KREVARNLLLLNVDIETIIKATGLALEEINALKKQM 342 >UniRef50_C4UAM6 Putative uncharacterized protein n=1 Tax=Yersinia aldovae ATCC 35236 RepID=C4UAM6_YERAL Length = 105 Score = 57.4 bits (137), Expect = 6e-07, Method: Compositional matrix adjust. Identities = 34/93 (36%), Positives = 48/93 (51%), Gaps = 20/93 (21%) Query: 212 ALLNYILLTGDEARFNEFISELTRRMPQHRERIMTIAERIHNDGYIKGEQ---------- 261 +L+NY+L GD A FI EL RR PQH+E +MTIA+++ +G +G Q Sbjct: 4 SLINYMLQDGDAATPKTFIWELARRSPQHKELLMTIAQKLKQEGRQEGRQEGRVEGIQIG 63 Query: 262 ----------RILRLLLQNGADPEWIQKITGLS 284 + R +L NG D + K+TGLS Sbjct: 64 EANGLKKGKLEVARTMLVNGLDRATVMKMTGLS 96 >UniRef50_A4U3R1 Putative uncharacterized protein n=1 Tax=Magnetospirillum gryphiswaldense RepID=A4U3R1_9PROT Length = 322 Score = 56.2 bits (134), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 52/210 (24%), Positives = 89/210 (42%), Gaps = 39/210 (18%) Query: 11 DALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVKT 70 DAL+ +HP A + +P+ + D ++ +A F D + D++W + T Sbjct: 5 DALYHRLFSHPLMAEQLVREFVPEAMAVGLDFARMERVNAKFHDRDGKRREGDVIWRIPT 64 Query: 71 REG-DGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQP---LPLVIPMLFYHG 126 +G D ++++ E QS D MA R Y + Q I K + LP V+ ++ Y+G Sbjct: 65 ADGEDVVLHILCEFQSTTDWWMAVRTQVYEGLLWQHLIAERKLKSGDRLPPVLTLVLYNG 124 Query: 127 SR-----------------SP-YPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEI 168 + SP +PW A + L+D+ VP++E+ Sbjct: 125 EQRWHAPTDTIPLIALPAGSPLWPWQ----------------PRACYHLLDMGAVPEEEL 168 Query: 169 VQHRRVALLELIQKHIRQ-RDLMGLIDQLV 197 +A L +H R+ +L GLID +V Sbjct: 169 AIRDSLAALLFRLEHPREPEELAGLIDDVV 198 >UniRef50_B2V697 Putative uncharacterized protein n=6 Tax=Sulfurihydrogenibium RepID=B2V697_SULSY Length = 311 Score = 56.2 bits (134), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 52/230 (22%), Positives = 105/230 (45%), Gaps = 15/230 (6%) Query: 9 PHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSV 68 PHD FK + P + ++I +L + DL+S++L ++ +K+ D+L+ Sbjct: 6 PHDQFFKQIFSEPKRVKSLLDIFYS-ELSQKIDLESIRLLNSEKYSQKIGKSLLDLLYEC 64 Query: 69 KTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGSR 128 K ++ ++ EH+S D ++ +L+ Y+ + + E+ + P +I ++ YHG R Sbjct: 65 KIENEKSFLRIIFEHKSYIDKNLPSQLLYYNGILWEETGEYKEYLP---IINIVLYHGKR 121 Query: 129 SPYPWSL-CWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRV----ALLELIQKH 183 W++ L + R + L+D++ V D+E++ V A L KH Sbjct: 122 K---WNIPTTLPKTNSEIIERFSNKLNYHLIDLSKVADEEMINKLYVDFCTASALLTMKH 178 Query: 184 IRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISEL 233 I + DL + ++ V E D + +L+YI + + + E+ Sbjct: 179 IFE-DLKKY--KHILKKVFEHYQDGCVFIILDYISVVNNPQEVENVLKEI 225 >UniRef50_Q3C0L0 TpnA protein n=2 Tax=Sodalis glossinidius RepID=Q3C0L0_SODGL Length = 131 Score = 55.5 bits (132), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 25/55 (45%), Positives = 36/55 (65%) Query: 5 TTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRA 59 +T + HD +FK FL ARDF+EIHLP LR+ CD +L + S SF+++ L+ Sbjct: 3 STLSHHDHVFKKFLGDIAVARDFLEIHLPPHLRKHCDFSTLAMASGSFIEDDLKG 57 >UniRef50_C4GYF6 Transposase n=20 Tax=Yersinia pestis RepID=C4GYF6_YERPN Length = 105 Score = 53.1 bits (126), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 24/63 (38%), Positives = 41/63 (65%) Query: 209 QITALLNYILLTGDEARFNEFISELTRRMPQHRERIMTIAERIHNDGYIKGEQRILRLLL 268 Q+ AL++Y+L G+ A F+ EL +R+PQH + +MTIA+++ G KG ++ ++L Sbjct: 9 QVMALIHYLLQAGESADSEAFVRELAQRVPQHGDALMTIAQQLEQKGIEKGIEKGIQLGE 68 Query: 269 QNG 271 Q G Sbjct: 69 QKG 71 >UniRef50_B9MN47 Putative uncharacterized protein n=2 Tax=Bacteria RepID=B9MN47_ANATD Length = 324 Score = 52.4 bits (124), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 28/124 (22%), Positives = 63/124 (50%), Gaps = 9/124 (7%) Query: 7 STPHDALFKTFLTHPDTA----RDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHS 62 + HD+ FK +P + ++RE S++++ +++ ++ + + Sbjct: 11 AKEHDSTFKLLFENPKDIYLLLSKIINYSWANEIRE----SSIEIKKTNYITKEFSQVEA 66 Query: 63 DILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPML 122 D++ + ++ D Y Y++IE+QS M RL+RY +++ I + + LP +IP++ Sbjct: 67 DVVAKARLKDRDVYFYILIENQSTVAKDMPERLLRYMISIWAEEI-RNGVEKLPAIIPIV 125 Query: 123 FYHG 126 Y+G Sbjct: 126 VYNG 129 >UniRef50_C5RH90 Putative uncharacterized protein n=2 Tax=Clostridium cellulovorans 743B RepID=C5RH90_CLOCL Length = 339 Score = 48.5 bits (114), Expect = 3e-04, Method: Compositional matrix adjust. Identities = 37/176 (21%), Positives = 82/176 (46%), Gaps = 13/176 (7%) Query: 10 HDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVK 69 HD +K ++ +T ++ + ++L L S+V L SDI++ + Sbjct: 23 HDKSYKDLFSNKETFLSLIQTFVSNTWGSKLTKENLVLVDKSYVLSDYEELESDIVYKAR 82 Query: 70 TREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHD-----KRQP--LPLVIPML 122 + + + Y+++E QS D M RL+ Y + + + +++ KR+ LP V+P++ Sbjct: 83 IGDHEVFFYMLLEFQSYVDYRMPIRLLLYMIEIWREILKNTSEKEFKRKSFRLPAVVPIV 142 Query: 123 FYHGSRSPYPWSLC-WLDEFADPTT--ARKLYNAAFPLVDVTVVPDDEIVQHRRVA 175 Y+G ++ W++ L E + + + + +DV DE+ +++ +A Sbjct: 143 VYNGEKN---WTVARTLKEVISNSDIFGESILDFRYEFLDVNRFKKDELYENQNIA 195 >UniRef50_C6HTR6 Probable transposase n=5 Tax=Leptospirillum ferrodiazotrophum RepID=C6HTR6_9BACT Length = 216 Score = 48.1 bits (113), Expect = 4e-04, Method: Compositional matrix adjust. Identities = 49/171 (28%), Positives = 77/171 (45%), Gaps = 10/171 (5%) Query: 5 TTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKL-RALHSD 63 TT TPHD+ FK + L D SL S + E L + SD Sbjct: 3 TTPTPHDSFFKDVFGPGKANLPALLSLLDAPFASRIDPSSLTFLSGETIGEGLATSFRSD 62 Query: 64 ILWSV----KTREGDGYIYV-VIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLV 118 ++ S+ T +G +V ++EH+S + F+L A+ R + K PLP V Sbjct: 63 LVGSLLVADATVDGKPLEFVFLVEHKSSPARDIQFKLACLVTALWARFLREGK-PPLP-V 120 Query: 119 IPMLFYHGSRSPYPWSLCWLDEFA-DPTTARKLYNAAFPLVDVTVVPDDEI 168 +P+L +HG +SP+ L + P A + + A ++D+T + DDEI Sbjct: 121 VPILIHHG-KSPWNQPLRLYETLGLRPELATGMLDYALHVIDLTRIEDDEI 170 >UniRef50_A4XG55 Putative uncharacterized protein n=2 Tax=Caldicellulosiruptor saccharolyticus DSM 8903 RepID=A4XG55_CALS8 Length = 327 Score = 47.4 bits (111), Expect = 6e-04, Method: Compositional matrix adjust. Identities = 66/285 (23%), Positives = 126/285 (44%), Gaps = 40/285 (14%) Query: 43 DSLKLESASFVDEKLRALHSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAV 102 + L+L ++V SD+L+ + +E + + Y++ EHQS D +MA RL+ Y + Sbjct: 43 EDLELVDKNYVLPDFSEQESDLLYKARLQEEELFFYILFEHQSTVDYNMAMRLLFYITDI 102 Query: 103 MQRHIEH-DKRQ------PLPLVIPMLFYHGSRSPYPWSLCWLDEFAD-PTTARKLYNAA 154 + ++ DK Q P V+P++ Y G +P+ S+ + + + + + Sbjct: 103 WRDWLKQFDKNQFKNKSFKFPPVVPIVLYDGD-NPWTASVNLKERIMNFEVFGKYIVDFE 161 Query: 155 FPLVDVTVVPDDEIVQHRRV-ALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITAL 213 + L+D+ PD+ I +++ + +L+ + K +++L L L L + A + +I L Sbjct: 162 YILIDLN-DPDEMIFKYKDILSLILKLNKVKTEKELERLFLDLYEYL--QGAKEKEINTL 218 Query: 214 ---LNYILLTGDEARFNEFISELTRRMPQHRERIM-------TIAERIHNDGYIKG---- 259 L +L E + E ++ + E IM I E +++G KG Sbjct: 219 KICLPVVLKELGEDKVQE-AKDMLECIDVGGEGIMPLFQNLRKIREEWYHEGIQKGIQDG 277 Query: 260 ------------EQRILRLLLQNGADPEWIQKITGLSAEQMQALR 292 E I ++ G E I +ITGL E+++ LR Sbjct: 278 LQQGLQQGLQKKELEIAERMIVKGYSDEEIHEITGLDIEKIKELR 322 >UniRef50_C6HZP6 Putative uncharacterized protein n=1 Tax=Leptospirillum ferrodiazotrophum RepID=C6HZP6_9BACT Length = 334 Score = 46.2 bits (108), Expect = 0.001, Method: Compositional matrix adjust. Identities = 78/313 (24%), Positives = 134/313 (42%), Gaps = 41/313 (13%) Query: 6 TSTPHDALFKTFLTHPDTARDFMEIHLPK-------DLRELCDLDSLK-LESASFVDEKL 57 ++TPHD+ FK P HLP L +L SL+ L S ++ Sbjct: 21 STTPHDSFFKDVFG-PGKG------HLPSLIPLIDGSLASRIELSSLEYLPGESIAEDLA 73 Query: 58 RALHSDILWSV-----KTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKR 112 R+ SD+ S+ + GD I + EH+S H+ L+ A++ R + + R Sbjct: 74 RSTRSDLSASLLISNARIDGGDARIAFIFEHKSFLPHHIHIPLLSLVSALLSRDLR-EGR 132 Query: 113 QPLPLVIPMLFYHGSRSPYPWSL-CWLDEFAD--PTTARKLYNAAFPLVDVTVVPDD--- 166 +P P VIP++ YHG PW+L L E D P A +L + L+D++ D+ Sbjct: 133 KPCP-VIPVVLYHGR---APWTLPARLSEALDLSPELAPRLPDFELTLIDLSRFSDETLK 188 Query: 167 EIVQHRRVALLELIQKHIRQ--RDLMGLIDQLVVLLVTECANDSQIT-ALLNYILLTGDE 223 E + H + + KHI + ++G +L+ L +I L+YI Sbjct: 189 EKIAHPEPLVSLSVMKHIFEPPESVLGHFVRLIKTLSPSRDILKRIVDTTLHYISYVKKS 248 Query: 224 ARFNEFISELTRRMPQHRERIMTIAERIHNDGYIKG-----EQRILRLLLQNGADPEWIQ 278 E + T + + E++ T+ + I +G +G ++ I RLL + P+ I Sbjct: 249 HHPQEIRTIFTTFLAE--EKMTTVLDLIKEEGIQEGIQMGRDEAITRLLQHSSLSPQQIA 306 Query: 279 KITGLSAEQMQAL 291 I + ++ +L Sbjct: 307 SILNVDLSRVLSL 319 >UniRef50_C6PYR3 Putative uncharacterized protein n=1 Tax=Clostridium carboxidivorans P7 RepID=C6PYR3_9CLOT Length = 344 Score = 45.8 bits (107), Expect = 0.002, Method: Compositional matrix adjust. Identities = 37/183 (20%), Positives = 86/183 (46%), Gaps = 27/183 (14%) Query: 10 HDALFKTFLTHP----DTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDIL 65 HD +K ++ D ++F++ K++++ D+++L + S++ L SDI+ Sbjct: 11 HDKSYKDLFSNKELLVDMIQNFVKSSWIKEIKK----DNIELVNKSYILSDYEELESDIV 66 Query: 66 WSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQP--------LPL 117 + + Y+++E QS D M RL Y M+ + R + + +Q LP Sbjct: 67 YKATIDGREVIFYILLEFQSYVDYSMPIRLFLY-MSEIWREVLKNTKQAEVKSKEFRLPA 125 Query: 118 VIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLY-----NAAFPLVDVTVVPDDEIVQHR 172 ++P++ Y+G Y W++ +F + +L+ + + L+D+ +E+++ + Sbjct: 126 IVPLVLYNG---EYKWTVE--KKFKNIINKSELFGNNIIDFEYILIDINKYEKEELMELK 180 Query: 173 RVA 175 + Sbjct: 181 NLV 183 >UniRef50_A4XMU7 Putative uncharacterized protein n=1 Tax=Caldicellulosiruptor saccharolyticus DSM 8903 RepID=A4XMU7_CALS8 Length = 313 Score = 45.8 bits (107), Expect = 0.002, Method: Compositional matrix adjust. Identities = 68/317 (21%), Positives = 146/317 (46%), Gaps = 52/317 (16%) Query: 11 DALFKTFLTHPDTAR----DFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILW 66 D FK LT+ + + +E+ LP + L D++ + ES ++ + RA SD+++ Sbjct: 9 DEGFKKVLTNRTNIKWLLTELLEV-LPIQIG-LEDIEVIATES---INRQWRARRSDMVY 63 Query: 67 SVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHG 126 +K + D YI V++E QS ++ + R++ Y + + +++ + LP+VIP++ Y G Sbjct: 64 KIKYK--DAYICVLLEFQSSKEELIHLRVLEYMLLIQKKYT---TKNLLPVVIPVVLYTG 118 Query: 127 SRSPYPWSLCWLDEFADPTTARKLYNA-AFPLVDVTVVPDDEIVQH-------------- 171 P + C+ ++ + VDV ++ D+++++ Sbjct: 119 EEKWTP-ATCFEQNVVYGEDFKQFVQKFSLVFVDVRMIDDEKLLKSPNLLAAALYVDKVS 177 Query: 172 ---RRVA-LLELIQKHIR--QRDLMGLIDQLV-VLLVTECANDSQITALL---NYILLTG 221 +VA LE + KH++ + + L V+L +D ++ L +++ L Sbjct: 178 DNPEKVAERLEYLSKHVKFSEEQKEEFCEWLYHVVLKGYGFSDEEVDEFLFKSDFLRLGV 237 Query: 222 DEARFN---EFISELTRRMPQHRERIMTIAERIHNDGYIKGEQRILRL---LLQNGADPE 275 +E N + L + + + R++ + + EQ +L + +++ GA+ Sbjct: 238 NEMFLNTAEKIRKGLEKELEKERKQGIQQGIQQGK------EQALLEVAQKMIEEGAEDS 291 Query: 276 WIQKITGLSAEQMQALR 292 +I K+TGL E+++ LR Sbjct: 292 FIAKVTGLDMERIRQLR 308 >UniRef50_B0K503 Putative uncharacterized protein n=12 Tax=Thermoanaerobacteraceae RepID=B0K503_THEPX Length = 360 Score = 45.1 bits (105), Expect = 0.003, Method: Compositional matrix adjust. Identities = 34/163 (20%), Positives = 82/163 (50%), Gaps = 18/163 (11%) Query: 51 SFVDEKLRALHSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHD 110 SFV + +D+++ VK ++ + Y+++E QS D M +RL+ Y + + + ++ Sbjct: 55 SFVLQDFADKEADLVYRVKLKDKEVIFYILMELQSTVDYQMPYRLLLYMVEIWRSILKDT 114 Query: 111 KRQ-------PLPLVIPMLFYHG-----SRSPYPWSLCWLDEFADPTTARKLYNAAFPLV 158 R+ LP+++P++ Y+G +++ Y +L + F + K + L+ Sbjct: 115 PRKESRRKDFKLPVIVPIVLYNGDHKWTAKTSYKETLNSYETFGEYAVDFK-----YILI 169 Query: 159 DVTVVPDDEIVQ-HRRVALLELIQKHIRQRDLMGLIDQLVVLL 200 DV +E+++ +A + L+++ + ++M + +L +L Sbjct: 170 DVNRYTKEELLKLENLIASVFLLEQKVEFEEIMKRLKELSEIL 212 >UniRef50_B9MMM9 Putative uncharacterized protein n=1 Tax=Anaerocellum thermophilum DSM 6725 RepID=B9MMM9_ANATD Length = 315 Score = 45.1 bits (105), Expect = 0.003, Method: Compositional matrix adjust. Identities = 62/323 (19%), Positives = 147/323 (45%), Gaps = 46/323 (14%) Query: 6 TSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDIL 65 T +D +K ++ + F++ L ++ + + +++ + +++K + SDI+ Sbjct: 3 TYKKYDEGYKKLFSNKENLIWFLQNVLNEERFKKIEKSDVEIIATESINKKWQKKISDIV 62 Query: 66 WSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYH 125 + +K + D + + IE QSRED + RL Y M ++Q +++ +P+V+P++ Y+ Sbjct: 63 YKIKYK--DSFFCLTIEFQSREDKKILHRLYEY-MHLIQ--LKNKVNGEIPVVVPIVLYN 117 Query: 126 G-----SRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELI 180 G Y + + +F P A+ N +D+ +P+++++ V + + Sbjct: 118 GISHWKPNEQYNEIILFAKDF--PEYAQ---NFKIIFLDIKSIPEEKLISAANVLAIAVY 172 Query: 181 QKHI---------RQRDLMGLI-------DQLV-----VLLVTECANDSQITALLNYILL 219 + R +L G I ++L V+L + ++ + + L Sbjct: 173 IDQVSNNPERVLNRILNLRGKIHLNWEQREELADWLYEVILRSYGVSEEEAEEMFKKSGL 232 Query: 220 TGDEARFNEFISELTRRMPQHRERIMTIA-----ERIHNDGYIKGEQRILRL----LLQN 270 DE F+ ++ + + + +++I A ++ G +G +R ++L +L++ Sbjct: 233 EVDEL-FSSTAEKIKQGIEREKKKIAKEAMKQGMKQGMKQGMKQGMKRAIKLIAKQMLKD 291 Query: 271 GADPEWIQKITGLSAEQMQALRQ 293 E I K TGL+ E+++ L++ Sbjct: 292 NQPIELISKYTGLTPEEIKKLKK 314 >UniRef50_C4FHW2 Putative uncharacterized protein n=1 Tax=Sulfurihydrogenibium yellowstonense SS-5 RepID=C4FHW2_9AQUI Length = 211 Score = 45.1 bits (105), Expect = 0.003, Method: Compositional matrix adjust. Identities = 40/155 (25%), Positives = 80/155 (51%), Gaps = 15/155 (9%) Query: 111 KRQPLPLVIPMLFYHGSRSPYPWSL-CWLDEFADPTTARKLYNAAFPLVDVTVVPDDE-- 167 K++ P +I ++FYHG R W++ L D + L+D+ +PD+E Sbjct: 6 KKEYYPPIINIVFYHGERE---WNIPTNLPTVKDKDLQEYTQKLNYILIDLNKIPDEELK 62 Query: 168 --IVQHRRVALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEAR 225 I ++ V L L+ K I D+ L + ++ L+ + +DS + +L+YI+L +A Sbjct: 63 NRISKNMDVILAILVMKRIFD-DIQNL--RPILELIIKHKSDS-LFIILDYIVLIKKDA- 117 Query: 226 FNEFISELTRRMPQHRERIMTIAERIHNDGYIKGE 260 E + ++ + + E++MT+ E+ +G++KG+ Sbjct: 118 --EKVEKILKEISGGDEKMMTLTEKWKMEGWMKGK 150 >UniRef50_C9XMT1 Putative uncharacterized protein n=4 Tax=Clostridium difficile RepID=C9XMT1_CLODC Length = 158 Score = 43.9 bits (102), Expect = 0.007, Method: Compositional matrix adjust. Identities = 32/109 (29%), Positives = 57/109 (52%), Gaps = 13/109 (11%) Query: 25 RDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVKTREGDGYIYVVIEHQ 84 ++F + + K+L L +++LE+ SF+ E + DI++ V ++ G Y+V+E Q Sbjct: 28 QNFTSVSIAKEL----TLKNIELET-SFICE-YKGKEVDIIYKVFSKSGKVSHYIVLEFQ 81 Query: 85 SREDIHMAFRLMRY-----SMAVMQRHIE--HDKRQPLPLVIPMLFYHG 126 + D + RL Y +M++ +E DK LP VIP++ Y G Sbjct: 82 TEMDTEIVPRLKSYREQIWKSFIMKKSLEEIEDKNFKLPKVIPVVLYSG 130 >UniRef50_B0K519 Putative uncharacterized protein n=14 Tax=Thermoanaerobacteraceae RepID=B0K519_THEPX Length = 288 Score = 43.5 bits (101), Expect = 0.010, Method: Compositional matrix adjust. Identities = 29/119 (24%), Positives = 62/119 (52%), Gaps = 17/119 (14%) Query: 64 ILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQR-----HIEHDKRQ--PLP 116 +++ VK ++ + + Y+++E QS+ D M +RL+ Y + V + + KR+ LP Sbjct: 1 MVYQVKLKDKEVFFYILLELQSKVDFQMPYRLLLYIIEVWREILKDTSLNQQKRKDYKLP 60 Query: 117 LVIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNA-----AFPLVDVTVVPDDEIVQ 170 +IP++ Y+G + + SL F + + +L+ + L+DV ++E++Q Sbjct: 61 AIIPIVLYNGV-NRWTASLS----FKETIDSYQLFGENIIDFKYILIDVNRYNEEELLQ 114 >UniRef50_Q6D6X6 Putative transposase (Fragment) n=2 Tax=Pectobacterium RepID=Q6D6X6_ERWCT Length = 135 Score = 42.7 bits (99), Expect = 0.014, Method: Compositional matrix adjust. Identities = 32/103 (31%), Positives = 52/103 (50%), Gaps = 20/103 (19%) Query: 209 QITALLNYILLTGDEARFNEFISELTRRMPQHRERIMTIAERIHNDGYIKG--------- 259 Q A+L YI +G+ ++ EFI + + + RE IMTIA+++ G+ KG Sbjct: 27 QKRAILFYIARSGNTSKPAEFIEAVAQSLSTDREAIMTIAQQLEKIGFEKGIKHGMQQGM 86 Query: 260 ----EQ-------RILRLLLQNGADPEWIQKITGLSAEQMQAL 291 EQ +I R LL +G +P + ++T LSA ++ L Sbjct: 87 QRGMEQGIKTSARQIARQLLLSGMEPAQVCQMTQLSAAELAQL 129 >UniRef50_C1I6Y7 Putative uncharacterized protein n=1 Tax=Clostridium sp. 7_2_43FAA RepID=C1I6Y7_9CLOT Length = 226 Score = 41.2 bits (95), Expect = 0.041, Method: Compositional matrix adjust. Identities = 30/144 (20%), Positives = 61/144 (42%), Gaps = 23/144 (15%) Query: 47 LESASFVDEKLRALHSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVM--- 103 L + S++ SDI++ D + YV++E QS D M RL+ Y + + Sbjct: 3 LVNKSYILSDYEEQESDIVYKANFNGNDVFFYVLLEFQSSVDFRMPIRLLLYMIEIWRDI 62 Query: 104 --QRHIEHDKRQP--LPLVIPMLFYHGSRSPYPWS--------LCWLDEFADPTTARKLY 151 ++ KR+ LP ++P++ Y+G + W+ + D F D + Sbjct: 63 LRNTELKEFKRKTFRLPSIVPIVLYNGKK---KWTAAKELKHAISNSDVFGD-----NIL 114 Query: 152 NAAFPLVDVTVVPDDEIVQHRRVA 175 N + +D+ +E+ + ++ Sbjct: 115 NFKYEFIDINSYEKEELYNKQNIS 138 >UniRef50_B1EI63 Putative uncharacterized protein n=1 Tax=Escherichia albertii TW07627 RepID=B1EI63_9ESCH Length = 78 Score = 40.4 bits (93), Expect = 0.070, Method: Compositional matrix adjust. Identities = 22/56 (39%), Positives = 37/56 (66%), Gaps = 2/56 (3%) Query: 42 LDSLKLESASFVDEKLRALHSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMR 97 + +LKLES+SF+D+ LR +SD+LWSVK GY+ + + E + +A R+++ Sbjct: 1 MKTLKLESSSFIDDDLRESYSDVLWSVKYLI--GYLISYLLDRRMETLDIAKRMLQ 54 >UniRef50_A7N2B6 Putative uncharacterized protein n=1 Tax=Vibrio harveyi ATCC BAA-1116 RepID=A7N2B6_VIBHB Length = 86 Score = 40.4 bits (93), Expect = 0.080, Method: Compositional matrix adjust. Identities = 17/52 (32%), Positives = 31/52 (59%) Query: 208 SQITALLNYILLTGDEARFNEFISELTRRMPQHRERIMTIAERIHNDGYIKG 259 S +L+ Y+L G+ + + + L R++P+H ER MT+AE++ G +G Sbjct: 6 SAYDSLVEYLLRVGETSNLEDLMRTLARQVPEHEERFMTVAEQLEARGREQG 57 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P37415 Uncharacterized protein pSLT051 n=256 Tax=Gammap... 360 4e-98 UniRef50_P77768 Uncharacterized protein yfcI n=175 Tax=Gammaprot... 352 8e-96 UniRef50_Q1CC76 Transposase n=27 Tax=Gammaproteobacteria RepID=Q... 346 4e-94 UniRef50_Q4LC22 TpnA protein n=9 Tax=Enterobacteriaceae RepID=Q4... 341 1e-92 UniRef50_D2U4R8 Transposase (Fragment) n=4 Tax=Enterobacteriacea... 337 3e-91 UniRef50_Q7N1D0 Transposase, ISNCY family n=36 Tax=root RepID=Q7... 334 2e-90 UniRef50_B7UFQ5 Predicted protein n=14 Tax=Enterobacteriaceae Re... 332 9e-90 UniRef50_D1P284 Transposase, ISNCY family n=10 Tax=Enterobacteri... 315 1e-84 UniRef50_Q7B1W7 YadD homologue n=11 Tax=root RepID=Q7B1W7_ECOLX 313 5e-84 UniRef50_C2LLN3 Transposase n=37 Tax=Enterobacteriaceae RepID=C2... 312 1e-83 UniRef50_B6XDZ7 Putative uncharacterized protein n=2 Tax=Provide... 310 3e-83 UniRef50_P31665 Uncharacterized protein yadD n=59 Tax=Enterobact... 302 9e-81 UniRef50_C8QFJ7 Putative transposase YhgA family protein n=4 Tax... 298 2e-79 UniRef50_C2DMU4 Possible transposase n=6 Tax=Enterobacteriaceae ... 293 4e-78 UniRef50_C0Q5B1 Ytl2 n=4 Tax=Enterobacteriaceae RepID=C0Q5B1_SALPC 283 6e-75 UniRef50_C2LF55 Transposase n=3 Tax=Enterobacteriaceae RepID=C2L... 283 7e-75 UniRef50_A8PLK1 Putative uncharacterized protein n=3 Tax=Rickett... 279 8e-74 UniRef50_D0KLJ7 Putative transposase YhgA family protein n=1 Tax... 274 2e-72 UniRef50_Q3C0L1 TpnA protein n=16 Tax=Enterobacteriaceae RepID=Q... 273 5e-72 UniRef50_C3M8C1 Putative transposase n=3 Tax=Candidatus Hamilton... 264 3e-69 UniRef50_B7MZS6 Putative uncharacterized protein n=3 Tax=Escheri... 261 2e-68 UniRef50_A8PQ66 Putative uncharacterized protein n=3 Tax=Rickett... 253 7e-66 UniRef50_Q52101 ORF n=1 Tax=Salmonella enterica subsp. enterica ... 250 4e-65 UniRef50_C0AXL8 Putative uncharacterized protein n=1 Tax=Proteus... 249 7e-65 UniRef50_B3ESQ9 Putative uncharacterized protein n=2 Tax=Bacteri... 245 2e-63 UniRef50_Q24W02 Putative uncharacterized protein n=3 Tax=Clostri... 232 2e-59 UniRef50_C2DIT3 Possible transposase n=5 Tax=Enterobacteriaceae ... 226 8e-58 UniRef50_A6TJT5 Putative uncharacterized protein n=1 Tax=Alkalip... 223 6e-57 UniRef50_C3PPD7 Transposase and inactivated derivative n=13 Tax=... 221 2e-56 UniRef50_A6G0X2 Putative uncharacterized protein n=1 Tax=Plesioc... 221 2e-56 UniRef50_Q1RJ73 Transposase and inactivated derivative n=10 Tax=... 220 3e-56 UniRef50_Q2J904 Putative uncharacterized protein n=1 Tax=Frankia... 220 5e-56 UniRef50_A9EVM7 Similar to putative transposase n=2 Tax=Sorangiu... 218 3e-55 UniRef50_C1J8H0 Truncated transposase n=3 Tax=Escherichia coli R... 217 3e-55 UniRef50_A6G4N5 Putative uncharacterized protein n=1 Tax=Plesioc... 217 6e-55 UniRef50_Q1RGR6 Transposase and inactivated derivative n=15 Tax=... 216 8e-55 UniRef50_A8GX51 Transposase and inactivated derivative n=11 Tax=... 215 1e-54 UniRef50_Q6TFF6 Putative transposase n=1 Tax=Caedibacter taenios... 214 4e-54 UniRef50_A5CC03 Transposase and inactivated derivative n=9 Tax=O... 214 4e-54 UniRef50_C4YU05 Transposase n=5 Tax=Rickettsieae RepID=C4YU05_9RICK 211 3e-53 UniRef50_D2QBD7 Putative uncharacterized protein n=1 Tax=Spiroso... 209 1e-52 UniRef50_Q1QWV4 Putative uncharacterized protein n=11 Tax=Proteo... 209 1e-52 UniRef50_B9MMR0 Putative uncharacterized protein n=1 Tax=Anaeroc... 203 8e-51 UniRef50_A9BGB6 Putative uncharacterized protein n=3 Tax=Petroto... 203 8e-51 UniRef50_C0GW46 Putative uncharacterized protein n=2 Tax=Desulfo... 202 2e-50 UniRef50_A4XMD0 Putative uncharacterized protein n=5 Tax=Clostri... 199 1e-49 UniRef50_C5UWW9 Putative uncharacterized protein n=1 Tax=Clostri... 199 1e-49 UniRef50_Q1RKI3 Transposase and inactivated derivative n=10 Tax=... 199 1e-49 UniRef50_B2V9N0 Putative uncharacterized protein n=4 Tax=Sulfuri... 197 4e-49 UniRef50_Q2RLW6 Putative uncharacterized protein n=9 Tax=Clostri... 197 5e-49 UniRef50_C6VTM0 Putative uncharacterized protein n=1 Tax=Dyadoba... 192 1e-47 UniRef50_D0LMM4 Putative transposase n=10 Tax=Haliangium ochrace... 192 2e-47 UniRef50_A0LBL3 Putative uncharacterized protein n=6 Tax=Magneto... 192 2e-47 UniRef50_C5JAV2 Transposase n=2 Tax=uncultured bacterium RepID=C... 189 9e-47 UniRef50_C4FIM1 Putative uncharacterized protein n=1 Tax=Sulfuri... 186 7e-46 UniRef50_B4U689 Putative uncharacterized protein n=8 Tax=Aquific... 185 2e-45 UniRef50_Q1Q296 Putative uncharacterized protein n=6 Tax=Candida... 182 1e-44 UniRef50_C0GW49 Putative uncharacterized protein n=6 Tax=Desulfo... 181 3e-44 UniRef50_B3ETR6 Putative uncharacterized protein n=1 Tax=Candida... 180 7e-44 UniRef50_A6G1G8 Putative uncharacterized protein n=1 Tax=Plesioc... 178 2e-43 UniRef50_C0GTX5 Putative uncharacterized protein n=8 Tax=Desulfo... 178 2e-43 UniRef50_A4XG55 Putative uncharacterized protein n=2 Tax=Caldice... 177 3e-43 UniRef50_A4XFI8 Putative uncharacterized protein n=7 Tax=Clostri... 173 7e-42 UniRef50_C7RR52 Putative transposase n=1 Tax=Candidatus Accumuli... 172 2e-41 UniRef50_Q04UG3 Transposase, YhgA-like n=8 Tax=Leptospira RepID=... 172 2e-41 UniRef50_A3JHZ5 Putative transposase n=11 Tax=Proteobacteria Rep... 171 3e-41 UniRef50_Q2FP14 Putative uncharacterized protein n=4 Tax=Methano... 169 1e-40 UniRef50_C0GWA6 Putative uncharacterized protein n=3 Tax=Desulfo... 168 2e-40 UniRef50_A3ET28 Probable transposase n=6 Tax=Leptospirillum sp. ... 168 3e-40 UniRef50_B9TA29 Putative uncharacterized protein n=1 Tax=Ricinus... 166 8e-40 UniRef50_Q3JB06 Putative transposase n=17 Tax=Proteobacteria Rep... 166 8e-40 UniRef50_C6I158 Putative uncharacterized protein n=3 Tax=Leptosp... 166 1e-39 UniRef50_B2V697 Putative uncharacterized protein n=6 Tax=Sulfuri... 163 6e-39 UniRef50_C6HXQ0 Putative uncharacterized protein n=1 Tax=Leptosp... 162 1e-38 UniRef50_C0A240 Putative uncharacterized protein n=1 Tax=Opituta... 162 2e-38 UniRef50_B6WXP3 Putative uncharacterized protein n=1 Tax=Desulfo... 161 3e-38 UniRef50_C6HZP6 Putative uncharacterized protein n=1 Tax=Leptosp... 161 3e-38 UniRef50_C6HY29 Putative uncharacterized protein n=1 Tax=Leptosp... 156 8e-37 UniRef50_C1DXM1 Putative uncharacterized protein n=5 Tax=Sulfuri... 156 1e-36 UniRef50_C5RH90 Putative uncharacterized protein n=2 Tax=Clostri... 154 3e-36 UniRef50_A4U3R1 Putative uncharacterized protein n=1 Tax=Magneto... 153 5e-36 UniRef50_B6J6C6 Hypothetical cytosolic protein n=1 Tax=Coxiella ... 153 9e-36 UniRef50_A8PLG1 Transposase n=1 Tax=Rickettsiella grylli RepID=A... 152 2e-35 UniRef50_C8T759 Putative uncharacterized protein n=1 Tax=Klebsie... 148 2e-34 UniRef50_B9MN47 Putative uncharacterized protein n=2 Tax=Bacteri... 146 1e-33 UniRef50_C6PYR3 Putative uncharacterized protein n=1 Tax=Clostri... 145 2e-33 UniRef50_A9BGB3 Putative uncharacterized protein n=2 Tax=Petroto... 144 3e-33 UniRef50_D0YJF1 Putative transposase YhgA family protein n=1 Tax... 144 4e-33 UniRef50_C6HTR6 Probable transposase n=5 Tax=Leptospirillum ferr... 137 5e-31 UniRef50_D0LPI9 Putative transposase n=2 Tax=Haliangium ochraceu... 136 9e-31 UniRef50_D2NBJ3 Putative uncharacterized protein n=1 Tax=Escheri... 135 1e-30 UniRef50_B8FP58 Putative uncharacterized protein n=1 Tax=Desulfi... 129 1e-28 UniRef50_C1MD86 Putative uncharacterized protein n=5 Tax=Enterob... 126 8e-28 UniRef50_C1DXV7 Putative uncharacterized protein n=1 Tax=Sulfuri... 120 6e-26 UniRef50_C0GV86 Transposase, ISNCY family n=7 Tax=Desulfonatrono... 110 6e-23 UniRef50_B5Q357 Transposase n=10 Tax=Salmonella enterica subsp. ... 92 2e-17 UniRef50_C4UAM6 Putative uncharacterized protein n=1 Tax=Yersini... 89 2e-16 UniRef50_C4GYF6 Transposase n=20 Tax=Yersinia pestis RepID=C4GYF... 76 1e-12 UniRef50_Q3C0L0 TpnA protein n=2 Tax=Sodalis glossinidius RepID=... 76 2e-12 Sequences not found previously or not previously below threshold: UniRef50_B0K503 Putative uncharacterized protein n=12 Tax=Thermo... 142 1e-32 UniRef50_A4XMU7 Putative uncharacterized protein n=1 Tax=Caldice... 132 2e-29 UniRef50_B9MMM9 Putative uncharacterized protein n=1 Tax=Anaeroc... 121 4e-26 UniRef50_C1I6Y7 Putative uncharacterized protein n=1 Tax=Clostri... 116 1e-24 UniRef50_B9MPV5 Putative uncharacterized protein n=5 Tax=Clostri... 113 1e-23 UniRef50_B0G834 Putative uncharacterized protein n=3 Tax=Dorea f... 106 1e-21 UniRef50_B0K519 Putative uncharacterized protein n=14 Tax=Thermo... 105 1e-21 UniRef50_C6IY67 Transposase n=1 Tax=Paenibacillus sp. oral taxon... 100 5e-20 UniRef50_B0K813 Putative uncharacterized protein n=13 Tax=Thermo... 100 1e-19 UniRef50_B9E303 Putative uncharacterized protein n=2 Tax=Clostri... 98 4e-19 UniRef50_A5USQ0 Putative uncharacterized protein n=4 Tax=Roseifl... 93 9e-18 UniRef50_C4FHW2 Putative uncharacterized protein n=1 Tax=Sulfuri... 91 4e-17 UniRef50_B0KCX4 Putative uncharacterized protein n=12 Tax=Thermo... 87 7e-16 UniRef50_C4G1D5 Putative uncharacterized protein n=2 Tax=Abiotro... 87 8e-16 UniRef50_Q1PZ06 Putative uncharacterized protein n=1 Tax=Candida... 85 4e-15 UniRef50_C9KKN3 Putative uncharacterized protein n=1 Tax=Mitsuok... 83 9e-15 UniRef50_A8VV66 ATPase associated with various cellular activiti... 83 1e-14 UniRef50_C9RQ02 Putative uncharacterized protein n=1 Tax=Fibroba... 82 3e-14 UniRef50_Q2RKN5 Putative uncharacterized protein n=1 Tax=Moorell... 80 6e-14 UniRef50_Q7NIZ1 Gll2041 protein n=9 Tax=Cyanobacteria RepID=Q7NI... 80 8e-14 UniRef50_C1PBU4 Putative uncharacterized protein n=4 Tax=Bacillu... 80 1e-13 UniRef50_A5D0D4 Putative uncharacterized protein n=10 Tax=Clostr... 80 1e-13 UniRef50_Q73P51 Conserved domain protein n=7 Tax=Treponema RepID... 77 8e-13 UniRef50_Q6D6X6 Putative transposase (Fragment) n=2 Tax=Pectobac... 76 1e-12 UniRef50_Q2RGS0 Putative uncharacterized protein n=2 Tax=Moorell... 76 2e-12 UniRef50_B1XMU9 Putative uncharacterized protein n=1 Tax=Synecho... 76 2e-12 UniRef50_B7GJZ4 Transposase n=10 Tax=Bacillaceae RepID=B7GJZ4_ANOFW 75 4e-12 UniRef50_B7CC32 Putative uncharacterized protein n=10 Tax=Eubact... 74 7e-12 UniRef50_C6XV94 Putative uncharacterized protein n=7 Tax=Pedobac... 71 5e-11 UniRef50_B5U1X5 Putative uncharacterized protein n=1 Tax=uncultu... 70 7e-11 UniRef50_C9LXX0 Putative uncharacterized protein n=6 Tax=Selenom... 70 9e-11 UniRef50_UPI0001BC3A9D hypothetical protein BcroD2_08902 n=3 Tax... 70 1e-10 UniRef50_C9LWJ8 Putative uncharacterized protein n=1 Tax=Selenom... 70 1e-10 UniRef50_C6LE73 Putative uncharacterized protein n=1 Tax=Bryante... 69 2e-10 UniRef50_A6LFA9 Putative uncharacterized protein n=22 Tax=Bacter... 69 2e-10 UniRef50_A4XJH0 Putative uncharacterized protein n=1 Tax=Caldice... 68 3e-10 UniRef50_B1WSK8 CHP1784-containing protein n=11 Tax=Cyanobacteri... 67 1e-09 UniRef50_C4G3R2 Putative uncharacterized protein n=2 Tax=Abiotro... 67 1e-09 UniRef50_B3CQQ1 Putative transposase n=3 Tax=Orientia tsutsugamu... 67 1e-09 UniRef50_UPI0001C351D8 hypothetical protein ChatD1_33675 n=1 Tax... 66 1e-09 UniRef50_A7BWQ7 Putative uncharacterized protein n=3 Tax=Beggiat... 66 2e-09 UniRef50_C0QGW4 Putative uncharacterized protein n=1 Tax=Desulfo... 66 2e-09 UniRef50_C6LJP2 Putative transposase n=1 Tax=Bryantella formatex... 65 3e-09 UniRef50_C2LUG6 Putative uncharacterized protein n=1 Tax=Strepto... 65 4e-09 UniRef50_UPI0001C353CE hypothetical protein ChatD1_20495 n=1 Tax... 65 4e-09 UniRef50_C9XMT1 Putative uncharacterized protein n=4 Tax=Clostri... 65 4e-09 UniRef50_C8PTN1 Putative uncharacterized protein n=4 Tax=Trepone... 65 4e-09 UniRef50_A8GY36 Putative uncharacterized protein n=15 Tax=Ricket... 65 4e-09 UniRef50_A6LF36 Putative uncharacterized protein n=7 Tax=Bactero... 64 5e-09 UniRef50_C5UZR7 Putative uncharacterized protein n=1 Tax=Clostri... 63 1e-08 UniRef50_C8W2V6 Putative uncharacterized protein n=2 Tax=Desulfo... 63 1e-08 UniRef50_C8W1F3 Putative uncharacterized protein n=2 Tax=Desulfo... 63 2e-08 UniRef50_C6VTD5 Putative uncharacterized protein n=1 Tax=Dyadoba... 62 2e-08 UniRef50_C8WSD0 Putative uncharacterized protein n=5 Tax=Alicycl... 62 3e-08 UniRef50_Q1NK38 Putative uncharacterized protein n=2 Tax=delta p... 62 4e-08 UniRef50_Q5GSR2 Uncharacterized conserved protein n=15 Tax=Wolba... 61 4e-08 UniRef50_Q00255 ORF295 n=1 Tax=Leptolyngbya boryana RepID=Q00255... 61 5e-08 UniRef50_A6LFH9 Putative uncharacterized protein n=6 Tax=Bactero... 61 6e-08 UniRef50_A1ZPJ4 Hypothetical conserved protein n=6 Tax=Microscil... 61 7e-08 UniRef50_A6M1J9 Putative uncharacterized protein n=1 Tax=Clostri... 60 1e-07 UniRef50_B7BFV9 Putative uncharacterized protein n=1 Tax=Parabac... 60 1e-07 UniRef50_B0A7T9 Putative uncharacterized protein n=2 Tax=Clostri... 59 2e-07 UniRef50_UPI000190BD13 hypothetical protein SentesTyph_06309 n=2... 59 2e-07 UniRef50_C0CSV6 Putative uncharacterized protein n=1 Tax=Clostri... 59 2e-07 UniRef50_A7BN25 Putative uncharacterized protein n=3 Tax=Beggiat... 58 3e-07 UniRef50_A8YL21 Genome sequencing data, contig C325 n=27 Tax=Cya... 58 3e-07 UniRef50_UPI0001C34E7F hypothetical protein ClM62_15401 n=1 Tax=... 58 3e-07 UniRef50_C2G1H3 Hypothetical cytosolic protein n=1 Tax=Sphingoba... 58 3e-07 UniRef50_A7BTR0 Putative uncharacterized protein n=3 Tax=Beggiat... 58 3e-07 UniRef50_C8PLW8 Putative uncharacterized protein n=2 Tax=Trepone... 58 5e-07 UniRef50_A5D5U3 Hypothetical membrane protein n=3 Tax=Peptococca... 57 6e-07 UniRef50_Q24MW9 Putative uncharacterized protein n=4 Tax=Desulfi... 57 8e-07 UniRef50_C0F0J0 Putative uncharacterized protein n=1 Tax=Eubacte... 57 9e-07 UniRef50_Q8YMI0 Alr4953 protein n=8 Tax=Cyanobacteria RepID=Q8YM... 57 1e-06 UniRef50_C0EXQ3 Putative uncharacterized protein n=1 Tax=Eubacte... 57 1e-06 UniRef50_A7C3K1 Putative uncharacterized protein n=3 Tax=Beggiat... 57 1e-06 UniRef50_A8F2U7 Putative uncharacterized protein n=15 Tax=Bacter... 56 1e-06 UniRef50_C0R0H3 Putative uncharacterized protein n=8 Tax=Brachys... 56 2e-06 UniRef50_C1Q938 Putative uncharacterized protein n=4 Tax=Brachys... 56 2e-06 UniRef50_D0BNN6 ATP-dependent DNA helicase RecQ n=1 Tax=Granulic... 56 2e-06 UniRef50_B3CVG1 Putative uncharacterized protein n=2 Tax=Orienti... 55 2e-06 UniRef50_UPI00006A2D99 UPI00006A2D99 related cluster n=2 Tax=Xen... 55 2e-06 UniRef50_C1J8G9 YdgA n=11 Tax=Enterobacteriaceae RepID=C1J8G9_ECOLX 55 3e-06 UniRef50_C1P7A8 Putative uncharacterized protein n=1 Tax=Bacillu... 55 3e-06 UniRef50_D1PHY3 Putative uncharacterized protein n=2 Tax=Prevote... 55 3e-06 UniRef50_UPI0001BC3131 hypothetical protein BcroD2_12630 n=4 Tax... 55 3e-06 UniRef50_C0QZ87 Chromosome segregation ATPase n=19 Tax=Bacteria ... 55 4e-06 UniRef50_Q8YTL4 All2703 protein n=13 Tax=Cyanobacteria RepID=Q8Y... 55 4e-06 UniRef50_C6XVT6 Putative uncharacterized protein n=1 Tax=Pedobac... 55 5e-06 UniRef50_B4VKW0 Putative uncharacterized protein n=2 Tax=Microco... 55 5e-06 UniRef50_C0BF92 Putative uncharacterized protein n=1 Tax=Coproco... 54 5e-06 UniRef50_C1QAJ2 Putative uncharacterized protein n=2 Tax=Brachys... 54 8e-06 UniRef50_A7BPH0 Putative uncharacterized protein n=5 Tax=Beggiat... 54 8e-06 UniRef50_Q24Y59 Putative uncharacterized protein n=4 Tax=Peptoco... 53 9e-06 UniRef50_Q2FTW8 Putative uncharacterized protein n=2 Tax=Methano... 53 1e-05 UniRef50_B7CCB3 Putative uncharacterized protein n=1 Tax=Eubacte... 53 1e-05 UniRef50_A6BF26 Putative uncharacterized protein n=14 Tax=Clostr... 53 1e-05 UniRef50_C0G0A4 Putative uncharacterized protein n=2 Tax=Rosebur... 53 2e-05 UniRef50_C5RQ96 Putative uncharacterized protein n=1 Tax=Clostri... 53 2e-05 UniRef50_A7N2B6 Putative uncharacterized protein n=1 Tax=Vibrio ... 53 2e-05 UniRef50_C4G7H9 Putative uncharacterized protein n=2 Tax=Abiotro... 53 2e-05 UniRef50_B2JB68 Putative uncharacterized protein n=1 Tax=Nostoc ... 53 2e-05 UniRef50_UPI0001C371D2 hypothetical protein RflaF_10865 n=1 Tax=... 52 2e-05 UniRef50_UPI0001C369BC hypothetical protein ChatD1_02491 n=1 Tax... 52 3e-05 UniRef50_A7AK04 Putative uncharacterized protein n=2 Tax=Parabac... 52 3e-05 UniRef50_Q8F560 Putative uncharacterized protein n=1 Tax=Leptosp... 51 3e-05 UniRef50_C4ZGR2 Putative uncharacterized protein n=2 Tax=Eubacte... 51 4e-05 UniRef50_D1P8S5 Putative uncharacterized protein n=1 Tax=Prevote... 51 4e-05 UniRef50_A8SDU3 Putative uncharacterized protein n=1 Tax=Faecali... 51 6e-05 UniRef50_D1PGQ2 Transposase, ISNCY family n=2 Tax=Prevotella cop... 51 6e-05 UniRef50_C9LBM4 Putative uncharacterized protein n=1 Tax=Blautia... 51 7e-05 UniRef50_A7B1D1 Putative uncharacterized protein n=3 Tax=Ruminoc... 50 8e-05 UniRef50_A7M2M6 Putative uncharacterized protein n=2 Tax=Bactero... 50 8e-05 UniRef50_A5KR99 Putative uncharacterized protein n=11 Tax=Rumino... 50 9e-05 UniRef50_Q3ATN4 Putative uncharacterized protein n=1 Tax=Chlorob... 50 1e-04 UniRef50_Q2FSG0 Putative uncharacterized protein n=1 Tax=Methano... 50 1e-04 UniRef50_A7C2W6 Putative uncharacterized protein n=1 Tax=Beggiat... 50 1e-04 UniRef50_C5EKZ7 Predicted protein n=1 Tax=Clostridiales bacteriu... 50 2e-04 UniRef50_C0QWI7 Putative uncharacterized protein n=4 Tax=Brachys... 49 2e-04 UniRef50_C0DAA1 Putative uncharacterized protein n=2 Tax=Clostri... 49 2e-04 UniRef50_C1DU78 Putative uncharacterized protein n=1 Tax=Sulfuri... 49 2e-04 UniRef50_C9KZM2 Transposase n=8 Tax=cellular organisms RepID=C9K... 49 2e-04 UniRef50_C1TQY0 Putative transposase, YhgA n=1 Tax=Dethiosulfovi... 49 2e-04 UniRef50_B4SC57 Putative uncharacterized protein n=14 Tax=Bacter... 49 3e-04 UniRef50_B0G418 Putative uncharacterized protein n=5 Tax=Dorea f... 48 3e-04 UniRef50_UPI00006CAA90 hypothetical protein TTHERM_00670420 n=1 ... 48 3e-04 UniRef50_Q8YKL8 Alr7276 protein n=12 Tax=Bacteria RepID=Q8YKL8_A... 48 3e-04 UniRef50_Q8GBS6 Putative uncharacterized protein n=12 Tax=Trepon... 48 3e-04 UniRef50_A5Z376 Putative uncharacterized protein n=1 Tax=Eubacte... 48 3e-04 UniRef50_B6FJ15 Putative uncharacterized protein n=5 Tax=Clostri... 48 3e-04 UniRef50_C0CTJ7 Putative uncharacterized protein n=5 Tax=Clostri... 48 3e-04 UniRef50_C4FYK3 Putative uncharacterized protein n=2 Tax=Abiotro... 48 4e-04 UniRef50_C1J8S3 YdgA n=6 Tax=Escherichia coli RepID=C1J8S3_ECOLX 48 4e-04 UniRef50_B1V1L4 Putative uncharacterized protein n=38 Tax=Clostr... 48 4e-04 UniRef50_B6FTF1 Putative uncharacterized protein n=1 Tax=Clostri... 48 4e-04 UniRef50_Q73KA7 Putative uncharacterized protein n=2 Tax=Trepone... 48 4e-04 UniRef50_C9LXS5 Transposase n=3 Tax=Selenomonas sputigena ATCC 3... 48 4e-04 UniRef50_Q6D2V6 Putative uncharacterized protein (Fragment) n=1 ... 48 5e-04 UniRef50_B8FTH9 Putative uncharacterized protein n=3 Tax=Desulfi... 48 5e-04 UniRef50_UPI0001C3858D hypothetical protein AplaP_08641 n=1 Tax=... 48 5e-04 UniRef50_C4ZLA7 Conserved hypothetical cytosolic protein n=2 Tax... 48 5e-04 UniRef50_C1DU30 Putative uncharacterized protein n=7 Tax=Sulfuri... 48 5e-04 UniRef50_C6W4R9 Putative uncharacterized protein n=1 Tax=Dyadoba... 48 5e-04 UniRef50_C3QLI8 Putative uncharacterized protein n=1 Tax=Bactero... 48 6e-04 UniRef50_A7BL62 Putative uncharacterized protein n=2 Tax=Beggiat... 47 6e-04 UniRef50_A8V3I7 Putative uncharacterized protein (Fragment) n=2 ... 47 7e-04 UniRef50_C4Z1Q2 Putative uncharacterized protein n=1 Tax=Eubacte... 47 7e-04 UniRef50_UPI00019735B3 hypothetical protein ClM62_08045 n=1 Tax=... 47 9e-04 UniRef50_C4Z592 Putative uncharacterized protein n=2 Tax=Clostri... 47 0.001 UniRef50_B0BV37 Putative uncharacterized protein n=7 Tax=Rickett... 47 0.001 UniRef50_A3JHY3 Putative uncharacterized protein n=3 Tax=Gammapr... 46 0.001 >UniRef50_P37415 Uncharacterized protein pSLT051 n=256 Tax=Gammaproteobacteria RepID=YTL2_SALTY Length = 313 Score = 360 bits (923), Expect = 4e-98, Method: Composition-based stats. Identities = 157/311 (50%), Positives = 210/311 (67%), Gaps = 21/311 (6%) Query: 2 TNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALH 61 TT TPHDA F+ FLT PD ARDFME+HLP +LR +CDL +LKLES SFV++ LR Sbjct: 3 KKNTTPTPHDATFRQFLTQPDIARDFMELHLPAELRAICDLSTLKLESGSFVEDDLRQYF 62 Query: 62 SDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPM 121 SD+L+S+KT GDGYI+V++EHQS D HMAFRL+RY++A MQRH+E + LPLVIP+ Sbjct: 63 SDVLYSLKTTAGDGYIHVLVEHQSTPDKHMAFRLIRYAVAAMQRHLEAG-HKKLPLVIPV 121 Query: 122 LFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQ 181 LFY G RSPYP+S WLDEF D A KLY++AFPLVDVTV+PDDEI HR +A L L+Q Sbjct: 122 LFYTGKRSPYPYSTRWLDEFDDTALADKLYSSAFPLVDVTVIPDDEIAGHRSMAALTLLQ 181 Query: 182 KHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHR 241 KHI QRDL L+D+L +L+ + SQ+ +L++YI+ G+ + F+ EL +R+PQH Sbjct: 182 KHIHQRDLAELVDRLAPILLAGYLSSSQVISLVHYIVQAGETSDAEAFVRELAQRVPQHG 241 Query: 242 ERIMTIAERIHNDGYIKGEQ--------------------RILRLLLQNGADPEWIQKIT 281 + +MTIA+++ G KG Q +I R +LQN D + K+T Sbjct: 242 DALMTIAQQLEQKGIEKGIQLGEQRGIEKGRSEGEREATLKIARTMLQNCIDRNTVMKMT 301 Query: 282 GLSAEQMQALR 292 GL+ + + +R Sbjct: 302 GLTEDDLAQIR 312 >UniRef50_P77768 Uncharacterized protein yfcI n=175 Tax=Gammaproteobacteria RepID=YFCI_ECOLI Length = 296 Score = 352 bits (904), Expect = 8e-96, Method: Composition-based stats. Identities = 183/294 (62%), Positives = 232/294 (78%), Gaps = 5/294 (1%) Query: 1 MTNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRAL 60 MT TTSTPHDA+FK+FL HPDTARDF++IHLP LR+LCDL +LKLE SF+DE LR Sbjct: 1 MTISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQY 60 Query: 61 HSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIP 120 +SD+LWSVKT+EG GYIYVVIEHQS+ + MAFR+MRYS+A MQ H++ ++ LPLV+P Sbjct: 61 YSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKE-LPLVLP 119 Query: 121 MLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELI 180 MLFYHG RSPYP+SLCWLDEFA+P ARK+Y++AFPLVD+TVVPDDEI+QHR++ALLELI Sbjct: 120 MLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELI 179 Query: 181 QKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQH 240 QKHIRQRDL+GL+DQ+V LLVT ND Q+ AL NY+L TGD RF FI E+ R PQ Sbjct: 180 QKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQE 239 Query: 241 RERIMTIAERIHNDGYIKGE----QRILRLLLQNGADPEWIQKITGLSAEQMQA 290 +E++MTIA+R+ +G ++G+ RI + +L G D E + +T LS + + A Sbjct: 240 KEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIA 293 >UniRef50_Q1CC76 Transposase n=27 Tax=Gammaproteobacteria RepID=Q1CC76_YERPN Length = 313 Score = 346 bits (889), Expect = 4e-94, Method: Composition-based stats. Identities = 154/311 (49%), Positives = 209/311 (67%), Gaps = 21/311 (6%) Query: 2 TNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALH 61 +T TPHDA F+ FLT P+ ARDFME+HLP +LR +CDL +LKLES SFV++ LR Sbjct: 3 KKNSTPTPHDATFRQFLTQPEIARDFMELHLPAELRAICDLSTLKLESGSFVEDDLRQYF 62 Query: 62 SDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPM 121 SD+L+S+ T EG+GY++V+IEHQS D HMAFRL+RY++A MQRH+E LPLVIP+ Sbjct: 63 SDVLYSLDTVEGEGYVHVLIEHQSSPDKHMAFRLIRYAIAAMQRHLEAG-HAKLPLVIPV 121 Query: 122 LFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQ 181 LFY G RSPYP+S WLDEF DP A KLY+ AFPLVDVTV+PDD+I++HR +A L L+Q Sbjct: 122 LFYVGKRSPYPYSTRWLDEFDDPELAHKLYSGAFPLVDVTVIPDDDIMEHRSMAALTLLQ 181 Query: 182 KHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHR 241 KHI QRD+ L D+L LL+ + + Q+ AL++Y+L G+ A F+ EL +R+PQH Sbjct: 182 KHIHQRDIATLTDRLATLLMADYLSSPQVMALIHYLLQAGESADSEAFVRELAQRVPQHG 241 Query: 242 ERIMTIAERIHND--------------------GYIKGEQRILRLLLQNGADPEWIQKIT 281 + +MTIA+++ G KG+ + R LL+ G E +Q+ T Sbjct: 242 DALMTIAQQLEQKGIEKGRMEGRTEGIQLGEQRGIEKGKLEVARSLLKMGMPIESVQEAT 301 Query: 282 GLSAEQMQALR 292 GLS + + +R Sbjct: 302 GLSEDDLAQIR 312 >UniRef50_Q4LC22 TpnA protein n=9 Tax=Enterobacteriaceae RepID=Q4LC22_SODGL Length = 308 Score = 341 bits (876), Expect = 1e-92, Method: Composition-based stats. Identities = 139/307 (45%), Positives = 191/307 (62%), Gaps = 17/307 (5%) Query: 1 MTNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRAL 60 M+ T TPHDA+F+ FL TA+DF +I LP D++ LCD ++LK ES SF+D ++ Sbjct: 1 MSKKFTPTPHDAVFRQFLHDKATAQDFFDIWLPDDIKALCDWETLKPESGSFIDPDMKPY 60 Query: 61 HSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIP 120 SDIL+SV DGY+Y +IEHQS D MA+RLMRYSMA MQRH+E LPLV P Sbjct: 61 QSDILYSVNANGVDGYVYCLIEHQSTPDKLMAWRLMRYSMAAMQRHLEAG-HDKLPLVFP 119 Query: 121 MLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELI 180 +LFY G +SP+P+S WLD F P A K+Y+ F L+DVT + DD I+QHRR+ALLELI Sbjct: 120 VLFYCGEKSPHPYSTNWLDCFERPDIAAKIYSQPFRLMDVTTLDDDAIMQHRRMALLELI 179 Query: 181 QKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQH 240 QKHIR+RD+ L+D +V LL D+Q+ ++NY++ G+ A FI+E+ +R +H Sbjct: 180 QKHIRRRDMTELLDSIVKLLSYNYYTDTQVVTMMNYLVQEGNAASPRTFITEIAKRAEKH 239 Query: 241 RERIMTIAERIHND--------GYIKGEQ--------RILRLLLQNGADPEWIQKITGLS 284 E +MTIAE + + G +G Q +I R +L G + ++ TGLS Sbjct: 240 EEALMTIAEALKQEGYQIGRDDGRQEGIQQGEHAAAMKIARQMLSRGIARDAVKACTGLS 299 Query: 285 AEQMQAL 291 + L Sbjct: 300 DNALDNL 306 >UniRef50_D2U4R8 Transposase (Fragment) n=4 Tax=Enterobacteriaceae RepID=D2U4R8_9ENTR Length = 308 Score = 337 bits (864), Expect = 3e-91, Method: Composition-based stats. Identities = 141/302 (46%), Positives = 197/302 (65%), Gaps = 9/302 (2%) Query: 1 MTNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRAL 60 MT T TPHDA+FK FL+ +TA+DF +I LP +++ LCDLDSLK+ES SF+D +++ Sbjct: 7 MTKKFTPTPHDAVFKQFLSEKETAKDFFDIWLPDEIKALCDLDSLKMESGSFIDSEMKNY 66 Query: 61 HSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIP 120 SDIL+SV T +G GYIYV+IEHQS D +A+RLMRYS+A MQ+H+E +Q LPLV P Sbjct: 67 QSDILYSVSTTKGSGYIYVLIEHQSTPDKLIAWRLMRYSLAAMQKHLEDGNKQ-LPLVFP 125 Query: 121 MLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELI 180 +LFY G +SP+P+S WLD F D A +YN F L DVT + D EI+QH+R+ALLEL+ Sbjct: 126 ILFYCGEQSPHPYSTHWLDCFEDRKLAESIYNNPFKLADVTTLDDGEIMQHKRIALLELL 185 Query: 181 QKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQH 240 QKHIR+RD+ L+D +V LL D+Q+ + NY++ G+ R EFI+ + ++ +H Sbjct: 186 QKHIRRRDMTELLDSIVKLLSYNYYTDNQVITMFNYLIQEGNAQRPMEFITNIAKQAEKH 245 Query: 241 RERIMTIAERIHNDGYIKGEQRI--------LRLLLQNGADPEWIQKITGLSAEQMQALR 292 +MTIA++I G KG Q+ + L NG D ++ TGLS E++ Sbjct: 246 EGALMTIAQQIEEIGIQKGIQQGIQKTKIELAKQFLANGVDRNTVKISTGLSDEELNKFE 305 Query: 293 QP 294 Sbjct: 306 NQ 307 >UniRef50_Q7N1D0 Transposase, ISNCY family n=36 Tax=root RepID=Q7N1D0_PHOLL Length = 335 Score = 334 bits (857), Expect = 2e-90, Method: Composition-based stats. Identities = 153/334 (45%), Positives = 206/334 (61%), Gaps = 44/334 (13%) Query: 1 MTNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRAL 60 M T TPHDA+FK FL+H DTARDF+EIHLP LR +CDLD+L+LES SF+++ LR Sbjct: 1 MKRKNTPTPHDAIFKKFLSHIDTARDFLEIHLPATLRAVCDLDTLRLESGSFIEDNLRVH 60 Query: 61 HSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIP 120 +SDIL+S+KT +G+ Y+Y VIEHQS D MAFRLMRYS++ MQ H+E + LPLVIP Sbjct: 61 YSDILYSLKTTQGESYVYCVIEHQSSPDKMMAFRLMRYSISAMQWHLEQG-HKKLPLVIP 119 Query: 121 MLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELI 180 +LFYHG PYPWS W D F A ++Y++AFPLVDVTV+PDDEI+ H+RVALLE++ Sbjct: 120 VLFYHGKIRPYPWSTNWFDCFDASALAEEIYSSAFPLVDVTVIPDDEILTHKRVALLEIV 179 Query: 181 QKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQH 240 QKHIRQRD+ L +L +L + + ++LNYILL GD A FI +L + P++ Sbjct: 180 QKHIRQRDMAELQQELTMLFAYDYYTYELLKSMLNYILLVGDTADPEGFIRQLAEQFPKY 239 Query: 241 RERIMTIAERIHNDGYIKGEQR-------------------------------------- 262 E +MTIA+++ + G+ +G + Sbjct: 240 EEVLMTIAQKLQHKGHQEGLKEGLQKCQDAREEGLQEGLQKGEKKGEKKGEKKGEEKGEK 299 Query: 263 -----ILRLLLQNGADPEWIQKITGLSAEQMQAL 291 I R L+ NG D E I K TGLS +++ + Sbjct: 300 RASLKIARALMDNGIDRETIMKSTGLSQNELEQI 333 >UniRef50_B7UFQ5 Predicted protein n=14 Tax=Enterobacteriaceae RepID=B7UFQ5_ECO27 Length = 315 Score = 332 bits (852), Expect = 9e-90, Method: Composition-based stats. Identities = 165/312 (52%), Positives = 215/312 (68%), Gaps = 25/312 (8%) Query: 1 MTNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRAL 60 MT TTS+PHDA+FKTF+ P+TARDF+EIHLP+ LR+LC+L +L+LE SF+++ LRA Sbjct: 1 MTESTTSSPHDAVFKTFMFTPETARDFLEIHLPEPLRKLCNLQTLRLEPTSFIEKSLRAY 60 Query: 61 HSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIP 120 +SD+LWSV+T EGDGYIY VIEHQS + +MAFRLMRY+ A MQRH++ +PLV+P Sbjct: 61 YSDVLWSVETSEGDGYIYCVIEHQSSAEKNMAFRLMRYATAAMQRHLDKG-YDRVPLVVP 119 Query: 121 MLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELI 180 +LFYHG SPYP+SL WLDEF DP AR+LY AFPLVD+T+VPDDEI+QHRR+ALLELI Sbjct: 120 LLFYHGEASPYPYSLNWLDEFDDPQLARQLYTEAFPLVDITIVPDDEIMQHRRIALLELI 179 Query: 181 QKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQH 240 QKHIR RDL+G++D++ LLV NDSQ+ L NY+L GD +RF FI E+ R P Sbjct: 180 QKHIRDRDLIGMVDRITTLLVRGFTNDSQLQTLFNYLLQCGDTSRFTRFIQEIAERSPLQ 239 Query: 241 RERIMTIAERIHNDGYIKGEQR------------------------ILRLLLQNGADPEW 276 +E +MTIAER+ +G+ G Q I +L+ G + E Sbjct: 240 KEILMTIAERLRQEGHQIGWQEGKIEGWQEGKLEGLQEGMHEQAIKIALRMLEQGFEREI 299 Query: 277 IQKITGLSAEQM 288 + T L+ + Sbjct: 300 VLAATQLTDADI 311 >UniRef50_D1P284 Transposase, ISNCY family n=10 Tax=Enterobacteriaceae RepID=D1P284_9ENTR Length = 322 Score = 315 bits (807), Expect = 1e-84, Method: Composition-based stats. Identities = 123/323 (38%), Positives = 181/323 (56%), Gaps = 32/323 (9%) Query: 1 MTNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRAL 60 M + PHD+ FK F++ D ARDF E+HLP ++ LC+ D+LKL SASFVD+ LR+ Sbjct: 1 MATQSIVAPHDSTFKGFMSKVDNARDFFEVHLPNRIKHLCNFDTLKLASASFVDKTLRSR 60 Query: 61 HSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIP 120 SD+L+SV+T +G GY Y ++EHQS D M +RLM Y+ M +H++ Q LPLV+P Sbjct: 61 FSDMLYSVQTLKGKGYFYFLVEHQSSPDKLMGWRLMHYAFCAMNQHLQQG-HQSLPLVVP 119 Query: 121 MLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELI 180 +LFYHG++SPYP+S W D F A LY PLVDVTV DDE++ HR+VA +EL+ Sbjct: 120 ILFYHGNQSPYPYSQSWTDCFQWSDLAHDLYCNPLPLVDVTVACDDELMNHRKVAAMELV 179 Query: 181 QKHIRQR-DLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQ 239 KH R D+ GL ++L +L + + ++NY+ D + + L + + Sbjct: 180 FKHASLRGDVFGLSERLAQVLNNNQNHQDDVILIINYLFSVMDTPAYTHIVKTLVDQTEK 239 Query: 240 HRERIMTIAERIHNDGYIKGEQRILR------------------------------LLLQ 269 H+E +M IA+R+ N+G KG ++ + + L+ Sbjct: 240 HQETVMNIAQRLRNEGMEKGMEKGRKEERMISQQKLANERQHYQQQMALNLQQQAIMSLK 299 Query: 270 NGADPEWIQKITGLSAEQMQALR 292 G + I +ITGLS + ALR Sbjct: 300 LGLSVDIISQITGLSPSDIHALR 322 >UniRef50_Q7B1W7 YadD homologue n=11 Tax=root RepID=Q7B1W7_ECOLX Length = 313 Score = 313 bits (802), Expect = 5e-84, Method: Composition-based stats. Identities = 152/312 (48%), Positives = 201/312 (64%), Gaps = 27/312 (8%) Query: 2 TNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALH 61 N TT TPHDA F++FL +PD ARDF+E+HLP + R+LCDL +LKLE A+FV+ L Sbjct: 5 KNTTTPTPHDAAFRSFLANPDVARDFLELHLPAEYRQLCDLSTLKLEPATFVEPDLHQYA 64 Query: 62 SDILWSVKTREG-DGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIP 120 SDILWSVKT G DGY+Y +IEHQS E+++M FR++RYS+A MQRH+E K LPLVIP Sbjct: 65 SDILWSVKTTGGEDGYVYTLIEHQSTENLYMPFRMLRYSVAAMQRHLEQHK--TLPLVIP 122 Query: 121 MLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELI 180 +LFYHG RSPYP+S+ WLD F +P A K+Y FPLVD+TVV D+EI+ HRR+A L L+ Sbjct: 123 VLFYHGERSPYPYSMNWLDCFENPALAAKIYTKPFPLVDITVVDDNEIMNHRRMAALTLL 182 Query: 181 QKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQH 240 KHIRQRD++ +D LV L + ++ QIT L NY+L G E EF+ L +R+PQH Sbjct: 183 MKHIRQRDMLMCLDNLVRAL-QDIQDEEQITVLFNYLLN-GSEHVTVEFLQTLAQRLPQH 240 Query: 241 RERIMTIAERIH---------------------NDGYIKGEQRILRLLLQNGADPEWIQK 279 + IMT+AER+ K + I R L G I + Sbjct: 241 EDSIMTLAERLKQEGIQQGIQQGIQQGIQQGVQQGALQKA-REIARELRNAGMPAAQICQ 299 Query: 280 ITGLSAEQMQAL 291 +TGLS +++ + Sbjct: 300 LTGLSEAELKNI 311 >UniRef50_C2LLN3 Transposase n=37 Tax=Enterobacteriaceae RepID=C2LLN3_PROMI Length = 319 Score = 312 bits (800), Expect = 1e-83, Method: Composition-based stats. Identities = 136/318 (42%), Positives = 199/318 (62%), Gaps = 26/318 (8%) Query: 1 MTNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRAL 60 MT T HDALFK FLTHP+ ARDF +HLP ++ LCDL +L+LE ASFV+ +LR L Sbjct: 1 MTKNTQQPVHDALFKQFLTHPENARDFFSVHLPANILPLCDLSTLRLEPASFVERRLRQL 60 Query: 61 HSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEH-DKRQPLPLVI 119 HSD+L+SV+ EG+GYIY +IEHQS+ D M FRLM Y+M+ + H++ + LPLV+ Sbjct: 61 HSDVLYSVQMTEGEGYIYCLIEHQSKPDRLMGFRLMHYAMSAIAHHLKKSPADKTLPLVV 120 Query: 120 PMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLEL 179 P LFY GS PYP+S+ WLD FADP A++LY +FPLVD++V+ D+EI+ H+ +ALLEL Sbjct: 121 PFLFYQGSVCPYPYSMNWLDGFADPALAQQLYTRSFPLVDLSVLSDEEILTHKGIALLEL 180 Query: 180 IQKHIRQRD-LMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMP 238 +QKHIR RD LM ++ + ++ ++ Q+ +++ YI G + F S+L P Sbjct: 181 VQKHIRTRDGLMAVLPIIAQIINSQHNTVDQVRSVIEYIAYQGYILDESRFFSQLIALSP 240 Query: 239 QHRERIMTIAERIHNDGYIKGE------------------------QRILRLLLQNGADP 274 +++ + TIAE++ G KG +++ R LLQ G D Sbjct: 241 EYKTMLTTIAEQLEQKGIEKGIEKGIEKGIEKGIEKGIEKGIGLGVEKVARSLLQQGVDL 300 Query: 275 EWIQKITGLSAEQMQALR 292 I + TGL+ E++++L+ Sbjct: 301 NIIMQCTGLTREKIESLK 318 >UniRef50_B6XDZ7 Putative uncharacterized protein n=2 Tax=Providencia RepID=B6XDZ7_9ENTR Length = 327 Score = 310 bits (795), Expect = 3e-83, Method: Composition-based stats. Identities = 126/324 (38%), Positives = 183/324 (56%), Gaps = 33/324 (10%) Query: 1 MTNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRAL 60 MT + PHD+ FK F++ D ARDF EI+LP ++ LC+LD+LKL SASF+D+ LR+ Sbjct: 5 MTMQLIARPHDSTFKGFMSKVDNARDFFEIYLPNRIKPLCNLDTLKLASASFIDKTLRSR 64 Query: 61 HSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIP 120 SD+L+SV+T +G GY Y+++EHQS D M +RLM Y+ M +H++ LPLV+P Sbjct: 65 FSDMLYSVQTLKGKGYFYLLVEHQSTPDKLMGWRLMHYAFCAMNQHLQQGN-NALPLVVP 123 Query: 121 MLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELI 180 +LFYHG +SPYP+S W D F A LY PLVDVTV DDEIV HR+VA +EL+ Sbjct: 124 ILFYHGKQSPYPYSQVWTDCFPWADLAYDLYCNPLPLVDVTVASDDEIVNHRKVAAMELV 183 Query: 181 QKHIRQRDLMGLI-DQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQ 239 KH RD + ++ ++L ++ + + ++NY+ D + + + L + Sbjct: 184 LKHSTLRDDLIVLSERLAQVISENENHRDDVILIINYLFSVMDTPTYTQIVKTLIEQTEG 243 Query: 240 HRERIMTIAERIHNDGYIKG-----------------------EQRILRL--------LL 268 ++E +MTIA+R+ N+G KG EQ I R L Sbjct: 244 YQETVMTIADRLRNEGLEKGLIKGREEGKAEGKAEGREEARQEEQAIARQRTYTQVITSL 303 Query: 269 QNGADPEWIQKITGLSAEQMQALR 292 G + I KITGL ++QA+R Sbjct: 304 DLGLSIDIISKITGLPHSEIQAMR 327 >UniRef50_P31665 Uncharacterized protein yadD n=59 Tax=Enterobacteriaceae RepID=YADD_ECOLI Length = 300 Score = 302 bits (774), Expect = 9e-81, Method: Composition-based stats. Identities = 142/293 (48%), Positives = 196/293 (66%), Gaps = 10/293 (3%) Query: 6 TSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDIL 65 ++TPHDA+FK FL H +TARDF+EIHLP +LRELCDL++L LES SF++E L+ +D+L Sbjct: 5 STTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVL 64 Query: 66 WSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYH 125 +SV+ + GY++VVIEHQS+ D MAFR+MRYS+A M RH+E D LPLV+P+LFY Sbjct: 65 YSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEAD-HDKLPLVVPILFYQ 123 Query: 126 GSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIR 185 G +PYP S+CW D F P AR++YN+ FPLVD+T+ PDDEI+QHRR+A+LEL+QKHIR Sbjct: 124 GEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIR 183 Query: 186 QRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRERIM 245 QRDLM L++QLV L+ + SQ+ A+ NY+L G + + F L R E +M Sbjct: 184 QRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDR-ETGGESMM 242 Query: 246 TIAERIHNDGYIKGEQRI--------LRLLLQNGADPEWIQKITGLSAEQMQA 290 T+A+ G KG Q+ + LL G E + ++ L ++ Sbjct: 243 TLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDK 295 >UniRef50_C8QFJ7 Putative transposase YhgA family protein n=4 Tax=Pantoea sp. At-9b RepID=C8QFJ7_9ENTR Length = 301 Score = 298 bits (762), Expect = 2e-79, Method: Composition-based stats. Identities = 127/300 (42%), Positives = 185/300 (61%), Gaps = 14/300 (4%) Query: 4 FTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSD 63 S PHDALFK FL+H AR F+EIHLP+ +RE CDLD L++ +F++ L AL+SD Sbjct: 2 SVVSAPHDALFKKFLSHLPVARQFLEIHLPQSIREHCDLDKLQVVPTTFIERDLSALYSD 61 Query: 64 ILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLF 123 +L S+KT +G+GYIY +IEHQS D HM R+MRY++A +QRH++ +PLVIP+LF Sbjct: 62 VLLSMKTDDGEGYIYALIEHQSTPDKHMTLRMMRYTLAAIQRHLDEGHHD-VPLVIPILF 120 Query: 124 YHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKH 183 Y G SPYP+S+ WL+ F +P A++++ +FPLVDVTV+PD+EI+ HR VA LE+ K Sbjct: 121 YQGKTSPYPYSMNWLESFRNPVLAKQIFCHSFPLVDVTVIPDEEIMAHRDVARLEMAHKI 180 Query: 184 IRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRER 243 IR RD++ ID + LL + +D I + Y+L G+ + + L + PQ + Sbjct: 181 IRLRDILENIDPMATLLALDYNDDLSIDVVF-YLLRYGNTDDREKIVKILIQAKPQLEGK 239 Query: 244 IMTIAERIHNDGYIKGEQRI------------LRLLLQNGADPEWIQKITGLSAEQMQAL 291 IMTI E+ + +G Q + +L+ D I K+TGLS +++ L Sbjct: 240 IMTIEEQWRQESRQEGRQEGRKEGRQEVMLELAQRMLREQFDLNTIMKLTGLSEGELRQL 299 >UniRef50_C2DMU4 Possible transposase n=6 Tax=Enterobacteriaceae RepID=C2DMU4_ECOLX Length = 314 Score = 293 bits (751), Expect = 4e-78, Method: Composition-based stats. Identities = 141/295 (47%), Positives = 201/295 (68%), Gaps = 3/295 (1%) Query: 6 TSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDIL 65 ++TPHDA+FK FL H +TARDF++IHLP +LRELCDLD+L LES SF++E L+ +D+L Sbjct: 5 STTPHDAVFKQFLMHAETARDFLDIHLPAELRELCDLDTLHLESGSFIEESLKGHSTDVL 64 Query: 66 WSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYH 125 +SV+ + GY++VVIEHQS+ D MAFR+MRYS+A M RH+E D LPLV+P+LFY Sbjct: 65 YSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEAD-HDKLPLVVPILFYQ 123 Query: 126 GSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIR 185 G +PYP S+CW D F P AR++YN+ FPLVD+T+ PDDEI+QHRR+A+LEL+QKHIR Sbjct: 124 GEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIR 183 Query: 186 QRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRERIM 245 QRDLM L++QLV L+ + SQ+ A+ NY+L G + + F L R + +M Sbjct: 184 QRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDR-ETGGKSMM 242 Query: 246 TIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAE-QMQALRQPLPERE 299 T+A+ G KG ++ + ++ G + Q +S E ++ L + +P + Sbjct: 243 TLAQWFEEKGIEKGIEKGIEKGMEKGIEKGIQQGRQEVSQEFALRLLSKGMPRED 297 >UniRef50_C0Q5B1 Ytl2 n=4 Tax=Enterobacteriaceae RepID=C0Q5B1_SALPC Length = 316 Score = 283 bits (724), Expect = 6e-75, Method: Composition-based stats. Identities = 135/316 (42%), Positives = 186/316 (58%), Gaps = 25/316 (7%) Query: 1 MTNFTTSTP--HDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLR 58 M N HD LFK FL PDTARDF+ +HLP D+R LD+LKLE SFVD+KLR Sbjct: 1 MDNEKGHNRPGHDGLFKLFLREPDTARDFLAVHLPADIRAQVRLDTLKLEPGSFVDQKLR 60 Query: 59 ALHSDILWSVKTREGD-GYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPL 117 LHSD+L+SV+T EG GYIY ++EHQS D MA+R+MRYSMAVM H++ LP+ Sbjct: 61 ELHSDVLYSVETAEGHAGYIYCLVEHQSTADRMMAWRMMRYSMAVMDAHLKKGN-GTLPV 119 Query: 118 VIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALL 177 V+P+LFY G PYP+S W+D F P AR++Y+ +PLVDV+V+ D ++ HRR+ALL Sbjct: 120 VVPLLFYQGMVRPYPYSTDWMDCFDVPALAREVYSRPWPLVDVSVMEDCDLQSHRRMALL 179 Query: 178 ELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEAR-FNEFISELTRR 236 EL+Q+ IR RD L+ +V L+ +Q+ A+L YI+ G + F+ EL Sbjct: 180 ELVQRDIRHRDAASLLRDVVQLIRLAGNTRAQVEAVLCYIIYNGMTSESITPFLYELAGE 239 Query: 237 MPQHRERIM-TIAERIHN-------------------DGYIKGEQRILRLLLQNGADPEW 276 +P+++E IM TIA+++ + K LL NG E Sbjct: 240 IPEYKELIMGTIAQQLKEEGIQQGIQQGIQQERQASLEREQKTLLETAYALLDNGVSLEV 299 Query: 277 IQKITGLSAEQMQALR 292 + K TGL+ E ++ R Sbjct: 300 VIKSTGLNRETLEQPR 315 >UniRef50_C2LF55 Transposase n=3 Tax=Enterobacteriaceae RepID=C2LF55_PROMI Length = 330 Score = 283 bits (723), Expect = 7e-75, Method: Composition-based stats. Identities = 114/328 (34%), Positives = 180/328 (54%), Gaps = 38/328 (11%) Query: 1 MTNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRAL 60 M + HDA FK F+ + A+DF IHL +L+ CD +LKL+++SF+D KLR+ Sbjct: 1 MNKPLLISSHDAAFKRFMMNISNAKDFFFIHLSDELKSYCDFSTLKLQNSSFIDIKLRSR 60 Query: 61 HSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIP 120 SDIL+SVKT++G+ IY +IEHQSR D +A+R+M Y+ M +H++ LPLV+P Sbjct: 61 MSDILYSVKTKKGNISIYFLIEHQSRPDKMIAWRMMHYAFCTMNQHLQQG-YTSLPLVVP 119 Query: 121 MLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELI 180 +LFYHG R PYP+S+ WLD F T A +LY F L+D+ + D+ ++ HR+ A++E+ Sbjct: 120 ILFYHGKRKPYPFSVNWLDCFPLSTLANQLYLNNFALIDLNSIDDEILLTHRKAAVMEIA 179 Query: 181 QKHIRQR-DLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQ 239 KH+ DL L L + + +D A++ Y+ D A F I+++ ++ Sbjct: 180 MKHVNSCDDLDKLAMLLSKAINQKNCSDEDTIAVVQYLFSIMDAADFESIINKIAEQVDN 239 Query: 240 HRERIMTIAERIHNDGYIKGE------------------------------------QRI 263 HRE IM IA R+ N G+ G+ ++ Sbjct: 240 HRETIMNIAWRLENKGFKLGKMEGIEIGKNEGIEIGKNEGIEIGKNEGIEIGKKIVQIQL 299 Query: 264 LRLLLQNGADPEWIQKITGLSAEQMQAL 291 + LL+ + E+I++ITGLS ++++ L Sbjct: 300 AKNLLKENVELEFIERITGLSIQELKIL 327 >UniRef50_A8PLK1 Putative uncharacterized protein n=3 Tax=Rickettsiella grylli RepID=A8PLK1_9COXI Length = 308 Score = 279 bits (714), Expect = 8e-74, Method: Composition-based stats. Identities = 108/305 (35%), Positives = 171/305 (56%), Gaps = 19/305 (6%) Query: 6 TSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDIL 65 HDA+FKTF T + A F+ I+LPK +++ CD +LK+E SFVD L+ HSDIL Sbjct: 5 IHNAHDAIFKTFFTDIEVATHFITIYLPKHMKQACDFSTLKIEPGSFVDADLKQHHSDIL 64 Query: 66 WSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYH 125 +S+K GY+Y+ +EHQS + M FR+ RY +A+MQ+H+ + LPLVI MLFYH Sbjct: 65 YSLKVNGMHGYVYLNLEHQSTAEELMPFRMHRYKVAIMQQHLNQGN-KKLPLVISMLFYH 123 Query: 126 GSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIR 185 G + YP+ L +D D A+ + L+D+ V+PD+EI +H+++A LE++QKHI Sbjct: 124 G-KGQYPYCLKLIDCVEDTPFAKAHFFDDPLLIDLNVLPDEEIYRHKQLAFLEIVQKHIF 182 Query: 186 QRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRERIM 245 RDL + D +V L+ + L+ Y+L+ G+ A N+ I +L + + + E IM Sbjct: 183 TRDLEDIADHIVRLVKQVKPDHDLFNQLVYYMLVKGETANVNQVIEKL-KTIEDYEEDIM 241 Query: 246 TIAERIHNDGYIKGEQR----------------ILRLLLQNGADPEWIQKITGLSAEQMQ 289 A+++ G +G I + L+ G ++IQ +T LS ++ Sbjct: 242 NAAQQLKQQGRQEGLYEGRQEGLQKGEYRKAITIAKKLIAEGRSIQYIQDLTNLSENEVL 301 Query: 290 ALRQP 294 +L + Sbjct: 302 SLVEE 306 >UniRef50_D0KLJ7 Putative transposase YhgA family protein n=1 Tax=Pectobacterium wasabiae WPP163 RepID=D0KLJ7_PECWW Length = 288 Score = 274 bits (702), Expect = 2e-72, Method: Composition-based stats. Identities = 136/306 (44%), Positives = 177/306 (57%), Gaps = 55/306 (17%) Query: 10 HDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVK 69 HDA+FK FL+ ARDF+ IHLP +RE CD ++L+LESASF+DEKLRA SD+L+S+ Sbjct: 4 HDAIFKQFLSDIAVARDFLTIHLPDSIRERCDFNTLQLESASFIDEKLRARISDVLYSLH 63 Query: 70 TREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGSRS 129 T G GYIY VIEHQSR + MAFRL+RY +A MQ+H++ LPLV+P+LFYHG Sbjct: 64 TSVGKGYIYCVIEHQSRPEKQMAFRLLRYCLAAMQQHLDQG-HDRLPLVVPLLFYHGRSR 122 Query: 130 PYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIRQRDL 189 PYP+SL WLD FA P A+ LY FPLVD+TV+PDDEI HRR+ALLEL+QKHIR RD+ Sbjct: 123 PYPYSLRWLDSFAAPVLAQTLYEQPFPLVDLTVMPDDEIRTHRRMALLELVQKHIRTRDM 182 Query: 190 MGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRERIMTIAE 249 + L ++ +L A + +E IMTIAE Sbjct: 183 LELAREIGLLFERWAAP------------------------------LSIGQEDIMTIAE 212 Query: 250 RIHNDGYIKGEQR------------------------ILRLLLQNGADPEWIQKITGLSA 285 ++ G+ +G QR I R LL G D +Q+ T L Sbjct: 213 QLKKMGFDEGIQRGIQQGLAQGLEQGIEQGMKNSARQIARHLLLTGMDKNSVQQATQLET 272 Query: 286 EQMQAL 291 E+++ L Sbjct: 273 EELEQL 278 >UniRef50_Q3C0L1 TpnA protein n=16 Tax=Enterobacteriaceae RepID=Q3C0L1_SODGL Length = 277 Score = 273 bits (699), Expect = 5e-72, Method: Composition-based stats. Identities = 110/272 (40%), Positives = 163/272 (59%), Gaps = 25/272 (9%) Query: 42 LDSLKLESASFVDEKLRALHSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMA 101 L +L + S SF+++ L + SD+L+S+K+ GD YIY +IEHQS + MAFRL+RY++ Sbjct: 3 LSTLVMVSGSFIEDDLCSQCSDMLYSLKSTLGDAYIYCLIEHQSCPEPMMAFRLLRYAVT 62 Query: 102 VMQRHIEHDKRQPLPLVIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVT 161 M RH+E + +Q LP+VIP+LFYHGS SPYP++ WLD FAD A +Y AFPLVDVT Sbjct: 63 AMHRHLEQENKQ-LPVVIPILFYHGSTSPYPYTTHWLDCFADRKLAESVYEKAFPLVDVT 121 Query: 162 VVPDDEIVQHRRVALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTG 221 + D+EI++HRR+AL+E++QKHIR R+++ L +L LL + Q L+ Y++L G Sbjct: 122 AMEDEEILRHRRMALMEIVQKHIRTRNMLELAGELANLLEQWKFSKEQCKTLVYYLVLAG 181 Query: 222 DEARFNEFISELTRRMPQHRERIMTIAERIHNDGYIKGEQ-------------------- 261 + F+ L + P +RE +MTIAE++ G KG Q Sbjct: 182 NTTDGEGFLRTLAQPAPSYREDMMTIAEQLEAKGMQKGIQLGEKKGIERGLQEGIQLGKK 241 Query: 262 ----RILRLLLQNGADPEWIQKITGLSAEQMQ 289 +I R L NG + + ++ TGL+ + Sbjct: 242 QATLKIARQFLVNGVERDIVKMSTGLTDRDIN 273 >UniRef50_C3M8C1 Putative transposase n=3 Tax=Candidatus Hamiltonella defensa 5AT (Acyrthosiphon pisum) RepID=C3M8C1_HAMD5 Length = 308 Score = 264 bits (675), Expect = 3e-69, Method: Composition-based stats. Identities = 119/307 (38%), Positives = 175/307 (57%), Gaps = 24/307 (7%) Query: 6 TSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDIL 65 STPHD LFK F AR+F EIHLP + ++ SLK+ SF+D+ L+ HSD++ Sbjct: 3 ISTPHDRLFKKFFGDIALARNFFEIHLPSSILKIVSFPSLKMVPGSFIDKSLKQSHSDMV 62 Query: 66 WSVKTREG-DGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFY 124 +S +T G +GY+Y V+EHQS +D MAFR+ +YS+AVMQ+H++ LPLV+P+LFY Sbjct: 63 YSFETSTGKEGYLYCVVEHQSTDDKMMAFRMKKYSLAVMQQHLDQG-HDTLPLVLPVLFY 121 Query: 125 HGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHI 184 HG +SPYP S+ W D F + AR L + FPLVDVT++P++EI++H ++ LE+ QK + Sbjct: 122 HGQKSPYPHSMDWRDCFCEKELARILDSQPFPLVDVTMLPEEEIMKHGIISWLEMSQKMV 181 Query: 185 RQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRERI 244 RD+M + L+ L ND +LL Y+ G+ A F L+ RE + Sbjct: 182 HTRDMMEIAPYLIRLDKLFPLNDELFKSLLYYLFQEGETADRMLFFDALSSTT--QRENV 239 Query: 245 MTIAERIH--------------------NDGYIKGEQRILRLLLQNGADPEWIQKITGLS 284 MTIAE + +G +G + I + LL NG + ++ TGLS Sbjct: 240 MTIAEELKREGREEGREEGREEGREEGREEGREEGREEIAKNLLNNGFSFKQVKMYTGLS 299 Query: 285 AEQMQAL 291 + + L Sbjct: 300 EDSLNKL 306 >UniRef50_B7MZS6 Putative uncharacterized protein n=3 Tax=Escherichia coli ED1a RepID=B7MZS6_ECO81 Length = 319 Score = 261 bits (668), Expect = 2e-68, Method: Composition-based stats. Identities = 111/304 (36%), Positives = 163/304 (53%), Gaps = 19/304 (6%) Query: 2 TNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALH 61 TS HDA F+ L P ARDF+E L + C+LD+++LE +FV E LR Sbjct: 4 KVNKTSLIHDAAFRKTLKDPAAARDFLEQVLTPYQKSRCNLDTIELEPTTFVAESLRQSA 63 Query: 62 SDILWSVKTREG-DGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIP 120 D+L S+KT +G DGYIY +IEHQS D + R+MRY +AVM++HIE K P+VIP Sbjct: 64 CDVLLSMKTNDGKDGYIYTLIEHQSSPDKFIPLRMMRYILAVMEQHIEEHKCA--PVVIP 121 Query: 121 MLFYHGSRSPYPWSLCWLDEFADPTTARKLY--NAAFPLVDVTVVPDDEIVQHRRVALLE 178 +LFYHG++ PYP+ + W+D DP R++Y F LVDV+ + DDEI + R+A L Sbjct: 122 VLFYHGAKRPYPYPMNWVDCLDDPAYGREIYGEQKPFSLVDVSTLTDDEIEHYHRMAALM 181 Query: 179 LIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMP 238 K D++ LI + + L + + + +L Y+L + F E ++ P Sbjct: 182 FTMKSGTSGDVIELIGKSI-TLTDKYGSSVHLNTVLTYLLELY-QMDFAELSEAVSTHYP 239 Query: 239 QHRERIMTIAERIHNDGYIKGEQRILR------------LLLQNGADPEWIQKITGLSAE 286 H+ IMTIAE++ G KG ++ L ++ Q G E I+ L+ E Sbjct: 240 SHKGVIMTIAEQLEERGLKKGLEKGLEKGRAEERSRLVLMMRQRGKSLEEIKDFLDLTDE 299 Query: 287 QMQA 290 Q+ Sbjct: 300 QLLQ 303 >UniRef50_A8PQ66 Putative uncharacterized protein n=3 Tax=Rickettsiella grylli RepID=A8PQ66_9COXI Length = 307 Score = 253 bits (646), Expect = 7e-66, Method: Composition-based stats. Identities = 84/301 (27%), Positives = 162/301 (53%), Gaps = 15/301 (4%) Query: 5 TTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDI 64 T HD LFK L+ A F++ L ++ +L ++++L+L SFV + R +HSDI Sbjct: 4 TIHQAHDKLFKYSLSKKTIAISFLKSRLSSEIYKLINIETLQLTDKSFVLPEFREIHSDI 63 Query: 65 LWSVKTREGDGYIYVVIEHQSR-EDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLF 123 ++ + E GYI+ ++EH+S MAFR ++Y+++ M ++ + LP+V+P+ Sbjct: 64 VYQCQINEKKGYIFFILEHESTAHVELMAFRQLQYTISAMDQYCRQGN-KKLPIVLPICV 122 Query: 124 YHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKH 183 YHG +SPYP S D F + AR++ F L+D+TV+ D+E+ + L+E++ KH Sbjct: 123 YHGIKSPYPHSQDVYDNFENLQIARQIVFKPFTLIDLTVLSDEELAKDGPAYLMEMLLKH 182 Query: 184 IRQRDLMGLID---QLVVLLVTECANDSQITALLNYILLTGDEA--RFNEFISELTRRMP 238 R ++ + ++ + + L+ + + + I T DE+ + + L+ P Sbjct: 183 SRAKNFLSILHRRIEFIQSLLNRFGKEYRWFVVKYMINETQDESPNAVEQLVQTLSTAFP 242 Query: 239 QHRERIMTIAERIHNDGYIKGEQR--------ILRLLLQNGADPEWIQKITGLSAEQMQA 290 + + +MT A+++ +G +G ++ I + LL +G + +Q++TGLS +++ Sbjct: 243 EEKNTMMTFAQQLRQEGLEQGLEQGRYEEAIAIAKNLLGDGMSFKAVQRLTGLSEKEVMN 302 Query: 291 L 291 L Sbjct: 303 L 303 >UniRef50_Q52101 ORF n=1 Tax=Salmonella enterica subsp. enterica serovar Enteritidis RepID=Q52101_SALEN Length = 292 Score = 250 bits (639), Expect = 4e-65, Method: Composition-based stats. Identities = 112/282 (39%), Positives = 151/282 (53%), Gaps = 20/282 (7%) Query: 2 TNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALH 61 TT TPHDA F+ FLT PD ARDFME+HLP +LR +CDL +LKLES SFV++ LR Sbjct: 3 KKNTTPTPHDATFRQFLTQPDIARDFMELHLPAELRAICDLSTLKLESGSFVEDDLRQYF 62 Query: 62 SDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSM-AVMQRHIEHDKRQPLPLVIP 120 SD+L+S+KT GD I++ + S+ ++ F + A MQRH+E + LPLVIP Sbjct: 63 SDVLYSLKTTAGDD-IFMSWLNTSQHLTNICFPPDTLCVGAAMQRHLEAG-HKKLPLVIP 120 Query: 121 MLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAF-PLVDVTVVPDDEIVQHRRVALLEL 179 +LFY G RSPYP+S WLDEF D R+ LVDVTV+PDDEI HR +A L L Sbjct: 121 VLFYTGKRSPYPYSTRWLDEFDDTAPGRQTLQQRLSRLVDVTVIPDDEIAGHRSMAALTL 180 Query: 180 IQKHIR-----QRDLMGL---IDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFIS 231 + ++I Q L G ++ V + A N R + Sbjct: 181 LPENIFISGTWQNWLTGWRPFYGRISVFIAGNIAGTLYSAGRRNI--------RRRSLCT 232 Query: 232 ELTRRMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGAD 273 QH + +MTIA+++ G KG Q + ++ G Sbjct: 233 RTGTACAQHGDALMTIAQQLEQKGIEKGIQLGEQRGIEKGRS 274 >UniRef50_C0AXL8 Putative uncharacterized protein n=1 Tax=Proteus penneri ATCC 35198 RepID=C0AXL8_9ENTR Length = 254 Score = 249 bits (637), Expect = 7e-65, Method: Composition-based stats. Identities = 90/240 (37%), Positives = 147/240 (61%), Gaps = 2/240 (0%) Query: 25 RDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVKTREGDGYIYVVIEHQ 84 + F IHLP++L+ CD +L+L+++SF+D KLR+ SDIL+ VKT+EGD IY++IEHQ Sbjct: 6 KTFFFIHLPEELKSQCDFSTLQLQNSSFIDIKLRSRMSDILYLVKTKEGDVPIYLLIEHQ 65 Query: 85 SREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGSRSPYPWSLCWLDEFADP 144 SR D +A+R+M Y+ M +H++ + LPLV+P+LFYHG + PYP+ + W++ F Sbjct: 66 SRPDKMIAWRMMHYAFCTMNQHLQQG-YKSLPLVVPILFYHGKKKPYPFPVNWMECFPLS 124 Query: 145 TTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIRQ-RDLMGLIDQLVVLLVTE 203 + A +Y+ F L+D+T + DD ++ H++ A++E+ KH+ DL + L + + Sbjct: 125 SLANHIYSNDFSLIDLTSIDDDILLTHKKAAVMEIAMKHVNSCHDLNKIAMLLSKAINQK 184 Query: 204 CANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRERIMTIAERIHNDGYIKGEQRI 263 D A++ Y+ D + F I+++ R+ HRE IM IA R+ N G+ G Sbjct: 185 NCRDEDTVAVVQYLFSIMDASDFEFIINKIAERVDNHRETIMNIAWRLENKGFKLGIDEG 244 >UniRef50_B3ESQ9 Putative uncharacterized protein n=2 Tax=Bacteria RepID=B3ESQ9_AMOA5 Length = 308 Score = 245 bits (625), Expect = 2e-63, Method: Composition-based stats. Identities = 95/303 (31%), Positives = 168/303 (55%), Gaps = 11/303 (3%) Query: 3 NFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHS 62 S PHD L K L+HP+ ++F + + P D+ + DL SLKL + S+V E+LR H+ Sbjct: 6 KNDLSNPHDLLVKATLSHPEAIQEFAKAYFPADILKRVDLPSLKLTNKSYVTEELREFHN 65 Query: 63 DILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEH-DKRQPLPLVIPM 121 D+++S + GY + V+EHQS D MA R ++Y++A+++ +I+ ++ P P+++ + Sbjct: 66 DLVFSFTIDKQPGYAFFVLEHQSTPDPLMALRFVKYNIALIEEYIKEKGEKTPWPIIVNI 125 Query: 122 LFYHG-SRSPYPWSLCWLDEFADPTTARKL-YNAAFPLVDVTVVPDDEIVQHRRVALLEL 179 YH + PYP+S D F DP TA+ L F L D+ P++ + QH + L+E Sbjct: 126 CLYHNANEKPYPYSTSVYDLFKDPLTAKALEMFTKFYLADLNSTPNEVLEQHGSIGLMEK 185 Query: 180 IQKHIRQRDLMGLIDQLVVLLVT--ECANDSQITALLNYILLTGDEARFN-EFISELTRR 236 + K+ R RD+ +I++ + D T L+ + G E + + +S Sbjct: 186 LLKYSRHRDIFNVIEKELKRSKGYLIVRGDYWKTILIYSSYVIGQEEKSEKDLVSLFKEV 245 Query: 237 MPQHRERIM-TIAERIHNDGYIKGEQR----ILRLLLQNGADPEWIQKITGLSAEQMQAL 291 + ++ E IM TIA+ I G ++G++R I + +L+ G + +I++ITGLS + ++ L Sbjct: 246 LSKNEEEIMITIAQTIEERGEMRGKRREKIAIAKNMLKKGCEISFIEEITGLSRKDIEKL 305 Query: 292 RQP 294 +Q Sbjct: 306 KQE 308 >UniRef50_Q24W02 Putative uncharacterized protein n=3 Tax=Clostridiales RepID=Q24W02_DESHY Length = 333 Score = 232 bits (591), Expect = 2e-59, Method: Composition-based stats. Identities = 78/328 (23%), Positives = 144/328 (43%), Gaps = 42/328 (12%) Query: 6 TSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDIL 65 PHD FK AR F++ +LP+++ L DL+++ + S++D++L+ SD+L Sbjct: 4 IHNPHDKFFKETFGDVGMARSFLKNYLPQEILALVDLETILPQKDSYIDQELQESFSDLL 63 Query: 66 WSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYH 125 + VK + +GY+Y + EH+S +A +L++Y + + + ++ K LPL+IPM+ YH Sbjct: 64 FQVKIHKNEGYLYFLFEHKSYPSQGIALQLLKYMVRIWESKLKESKPDKLPLIIPMVVYH 123 Query: 126 GSRSPYP--WSLCWLDEFADPTTA--RKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQ 181 G +D + A + + + L D++ D E+V + + ++ Sbjct: 124 GQEKWNSSLKLSGIIDNYEQLPNAVTQYIPEYEYILYDLSTYTDQEMVGNMLLLIILRTM 183 Query: 182 KHIRQRDLMGLIDQLVVLLVTECANDSQ------ITALLNYILLTGDEARFNEFISELTR 235 + I +D + L LL++ + Q L+ YIL T + Sbjct: 184 RDIFIKDTEAFHNILHELLISFERVEDQEKGMQFFETLIRYILSTRQDLELERIYEIAKE 243 Query: 236 RMPQHRERIMTIAERIHNDGYIKG--------------------------------EQRI 263 + E +MTIAE++ +G KG + + Sbjct: 244 VSLERGEVMMTIAEKLIMEGMEKGLKKGREEGLKKGREEGLEKGREEGLEKGREETKLEV 303 Query: 264 LRLLLQNGADPEWIQKITGLSAEQMQAL 291 R LL G + + + K TGLS E+++ L Sbjct: 304 ARNLLGLGIEMDKVAKATGLSEEEIRKL 331 >UniRef50_C2DIT3 Possible transposase n=5 Tax=Enterobacteriaceae RepID=C2DIT3_ECOLX Length = 197 Score = 226 bits (576), Expect = 8e-58, Method: Composition-based stats. Identities = 107/198 (54%), Positives = 144/198 (72%), Gaps = 1/198 (0%) Query: 96 MRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAF 155 MRY++A MQ H++ + LP+V+P+LFYHG SPYP+SLCWLD FADP AR+LY +AF Sbjct: 1 MRYAIAAMQNHLDAG-YKTLPMVVPLLFYHGIESPYPYSLCWLDCFADPNLARQLYASAF 59 Query: 156 PLVDVTVVPDDEIVQHRRVALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLN 215 PL+DVT++PDDEI+ HRR+ALLELIQKHIRQRDLMGL++Q+ LL + AN QI L N Sbjct: 60 PLIDVTLMPDDEIMLHRRMALLELIQKHIRQRDLMGLVEQMACLLSSGYANGRQIKGLFN 119 Query: 216 YILLTGDEARFNEFISELTRRMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGADPE 275 YIL TGD RFN+FI + +R P+H+ +MTIAER+ +G I +++L++G Sbjct: 120 YILQTGDAVRFNDFIDGVAKRSPKHKVSLMTIAERLRQEGEQSKALHIAKIMLESGVPLA 179 Query: 276 WIQKITGLSAEQMQALRQ 293 I + TG+S E++ A Q Sbjct: 180 DIMRFTGVSEEELAAASQ 197 >UniRef50_A6TJT5 Putative uncharacterized protein n=1 Tax=Alkaliphilus metalliredigens QYMF RepID=A6TJT5_ALKMQ Length = 312 Score = 223 bits (569), Expect = 6e-57, Method: Composition-based stats. Identities = 67/307 (21%), Positives = 136/307 (44%), Gaps = 21/307 (6%) Query: 6 TSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDIL 65 PHD FK + A+DFM +LP +L ++ D+++L E ++++ L+ SD+L Sbjct: 4 IHQPHDKFFKEMFGNLALAKDFMTNYLPLELLKIVDIETLTPEKEHYIEDDLKESFSDLL 63 Query: 66 WSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYH 125 + +GY+Y + EH+S +A +L+ Y + + +K++ +P++IPM YH Sbjct: 64 FKANINGREGYLYFLFEHKSYPSKRIAIQLLHYMVRIWDDKSLKEKKEKIPMIIPMTVYH 123 Query: 126 GSRSPYPWSL--CWLDEFADPTTA--RKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQ 181 G + ++ + + + + + + D++ DDE+ ++ ++ I Sbjct: 124 GKENWNVALRLSDLMEGYEELPEEIRKYIPEYEYLIYDLSGYTDDEVKGDVQLQIVIKIL 183 Query: 182 KHIRQRD-----LMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRR 236 + I + D + +++ L + + YIL E + Sbjct: 184 RSIFRNDEEFFKVFKEAVEVLDKLEKQEKGIEYFKTFIYYILSARKGVTLTEIYDLVKEV 243 Query: 237 MPQHRERIMTIAERIHNDGYIKG------------EQRILRLLLQNGADPEWIQKITGLS 284 + + IMTIAE + +G KG ++ + R L+ G + + + K TGLS Sbjct: 244 SVERSDEIMTIAEELLKEGMEKGMEKGMEKGKLEEKREVARNLIGLGVELDKVMKATGLS 303 Query: 285 AEQMQAL 291 E++ L Sbjct: 304 EEEINKL 310 >UniRef50_C3PPD7 Transposase and inactivated derivative n=13 Tax=spotted fever group RepID=C3PPD7_RICAE Length = 361 Score = 221 bits (564), Expect = 2e-56, Method: Composition-based stats. Identities = 86/303 (28%), Positives = 149/303 (49%), Gaps = 31/303 (10%) Query: 2 TNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALH 61 +N + HD LFK ++ P AR+F+E +LP + +L+S+K+E SFV E LR Sbjct: 33 SNTSERPRHDELFKKVMSEPVAAREFLEHYLPVTFKNKINLNSIKIEKESFVTEDLRKRL 92 Query: 62 SDILWSVKTREGD--------------GYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHI 107 SD+++SV + + Y+YV+IEHQS D +AFRL +Y + + +RH Sbjct: 93 SDVVYSVSLKNDNIKDSTTEKSVHNDKAYVYVLIEHQSSSDYWIAFRLWQYMLLLCERHK 152 Query: 108 E---------HDKRQPLPLVIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLV 158 + +K LPL+ P++ Y + PY + + F D TA+ + + LV Sbjct: 153 DANNNKSSVTKEKDNKLPLICPIVVY-ANDKPYNAPRSFWELFEDSKTAKDMMGDEYLLV 211 Query: 159 DVTVVPDDEIVQHRRVALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYIL 218 D+ DDEI + + + ++E + KHI+ RD++ L L+ + D + + L Sbjct: 212 DLQKQSDDEIEKKKHLGMMEYMLKHIKARDILNLWQSLLEKFESSIEIDKENGYIYIKWL 271 Query: 219 LTGDEARFNEFIS-ELTRRMPQH------RERIMTIAERIHNDGYIKGEQRILRLLLQNG 271 L +A+ +E EL + +H E + TIA++ ++G KG + +++ G Sbjct: 272 LWYSDAKVSEDKQVELASIIAKHLKKEDQEELMRTIADKYIDEGVQKGMVQGMQIGEARG 331 Query: 272 ADP 274 Sbjct: 332 MQI 334 >UniRef50_A6G0X2 Putative uncharacterized protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6G0X2_9DELT Length = 363 Score = 221 bits (564), Expect = 2e-56, Method: Composition-based stats. Identities = 76/287 (26%), Positives = 124/287 (43%), Gaps = 21/287 (7%) Query: 3 NFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHS 62 TS PHDALF+ HP A + LP++L L D L+ + V L + Sbjct: 12 ESVTSRPHDALFRATFEHPSHAGSLLRSALPRELAALIDWSRLRPAANELVSSSLGERRT 71 Query: 63 DILWSVKTRE---GDG--YIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPL 117 D+L+S GDG +Y+ IEHQSR D M R++ Y + + +RH + LP Sbjct: 72 DLLFSTALEGPGAGDGARVVYLHIEHQSRVDTTMPLRVLGYRVRIWERHRKRHG-GALPP 130 Query: 118 VIPMLFYHGSRSPYPWSLCWLDEFADP-----TTARKLYNAAFPLVDVTVVPDDEIV--- 169 V ++ H ++ + ++ F +P A L + D+ D E+ Sbjct: 131 VFCVVLSHAAKG-WTGPRSLVELFPEPVRTLAPIAAHLPRCPLIVEDLGRRADAELRARH 189 Query: 170 QHRRVALLELIQKHIRQRD-----LMGLIDQLVVLLVTECANDSQITALLNYILLTGDEA 224 H AL + + R + L+ DQ++ LL + + + LL Y+ L G E Sbjct: 190 AHPLPALTLWLLRDARSPERLVHRLLDWRDQIIALLDYDH-GERDLAQLLRYVALVGSEM 248 Query: 225 RFNEFISELTRRMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNG 271 F EF + +P+ MTIAE++ + +G ++ R + G Sbjct: 249 DFEEFHRFVAHHIPEVEAMTMTIAEQLCREALQRGREQGQREGQREG 295 >UniRef50_Q1RJ73 Transposase and inactivated derivative n=10 Tax=Rickettsieae RepID=Q1RJ73_RICBR Length = 305 Score = 220 bits (562), Expect = 3e-56, Method: Composition-based stats. Identities = 86/298 (28%), Positives = 163/298 (54%), Gaps = 19/298 (6%) Query: 10 HDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVK 69 HD+L K +T A++F+E +LP+D ++L DL + +E S+++E L +SDI++ ++ Sbjct: 7 HDSLVKIIMTDKIAAQEFLEYYLPEDFKKLIDLSKITVEQESYIEESLSKKYSDIVYGIE 66 Query: 70 TRE-GDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGSR 128 T+E G G++Y++IE QS D A RL +Y++ + +RH +KR LPLV ++ Y+G + Sbjct: 67 TKEYGKGFVYILIEAQSTVDYWTALRLWKYTLLLCERH--KEKRNKLPLVYNLVIYNGKQ 124 Query: 129 SPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIRQRD 188 Y D F + A+KL + LVD+ + D+EIV+ + + +L+ I KHI +RD Sbjct: 125 -VYNAPRNLWDLFTNSVMAKKLMMEDYQLVDLQAMSDNEIVKKKHIGMLDYILKHIHERD 183 Query: 189 LMGLIDQLV-----VLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRM-PQHRE 242 ++ L +Q + V+++ + + + L Y + + + + + PQH++ Sbjct: 184 MIQLWEQFLANFNHVIMLDKEKGYIYLKSFLWYTDAKISKKQQPRLVQVFDKYLSPQHKD 243 Query: 243 RIM-TIAERIHNDGYIKGEQR--------ILRLLLQNGADPEWIQKITGLSAEQMQAL 291 IM TIA+ ++G +G++ I + + G I ++TGL ++++ Sbjct: 244 NIMKTIADVYIDEGKQEGKREGEYNKAVMIAKKMFSQGFKIPVIAELTGLKETLIRSI 301 >UniRef50_Q2J904 Putative uncharacterized protein n=1 Tax=Frankia sp. CcI3 RepID=Q2J904_FRASC Length = 323 Score = 220 bits (560), Expect = 5e-56, Method: Composition-based stats. Identities = 90/306 (29%), Positives = 145/306 (47%), Gaps = 21/306 (6%) Query: 4 FTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSD 63 + +PHDA+F+ L P A + LP L DLD L + S VD LR H+D Sbjct: 2 SSPPSPHDAVFRRVLGVPSNAASQLRATLPAALVARLDLDRLAIVPGSLVDATLRWRHTD 61 Query: 64 ILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHI-EHDKRQPLPLVIPML 122 +L++ + +IYV++EHQS D MAFR++RY + V R++ +H K LP V+P++ Sbjct: 62 LLFTAPLDGHEAFIYVLVEHQSSSDPLMAFRMLRYVVRVWDRYLADHHKAARLPAVVPLV 121 Query: 123 FYHGSRSPYPWSL--CWLDEFADPTTAR--KLYNAAFPLVDVTVVPDDEIVQHR------ 172 +H + + +D D A L F L D+ V + E+ + Sbjct: 122 VHHNEHAWVAPTQVLDLVDLAPDLAGAWREHLPRFQFLLDDLVRVDERELRERPLTHSVR 181 Query: 173 -RVALLELIQKHIRQ-RDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFI 230 + LL+++ + R +DL +D+L +L + LL YI L G+ +E Sbjct: 182 LTLLLLKIVPGNPRLAQDLRPWVDELRAVL-DGPDGREEFATLLRYIELVGEADARDELH 240 Query: 231 SELTRRMPQHRERIMTIAERIHNDGYIKG--EQRILRLL----LQNGADPE-WIQKITGL 283 + P+ + MTIAE + +G ++G E R+ LL L+ G PE + + Sbjct: 241 DLIAGLGPEAEDAYMTIAEMLRAEGRVEGRVEGRVESLLQLLTLKFGPLPEAALAAVHDA 300 Query: 284 SAEQMQ 289 SA Q+Q Sbjct: 301 SAGQLQ 306 >UniRef50_A9EVM7 Similar to putative transposase n=2 Tax=Sorangium cellulosum 'So ce 56' RepID=A9EVM7_SORC5 Length = 336 Score = 218 bits (555), Expect = 3e-55, Method: Composition-based stats. Identities = 73/273 (26%), Positives = 123/273 (45%), Gaps = 15/273 (5%) Query: 8 TPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWS 67 HDALFK + + A + LP L D +L+L SFVDE L+ SD+L+S Sbjct: 12 NAHDALFKAAFSQVEHAAGELRQALPPALSARIDFAALRLRPGSFVDEALKERQSDLLFS 71 Query: 68 VKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHI-EHDKRQPLPLVIPMLFYHG 126 E +Y++ EHQS + MAFRL+RY + + + H+ EH + LP ++P++ +H Sbjct: 72 ASMGEARVLLYLLFEHQSTVEPLMAFRLLRYMVRIWEHHLAEHPGSKRLPAILPVVLHH- 130 Query: 127 SRSPYPWSLCWLDEFADPTTAR-----KLYNAAFPLVDVTVVPDDEIVQHRRVALLELI- 180 S + + + + D AR + F L D++ D+ + A L+ Sbjct: 131 SETGWTAATSFEDLLDLDEGARAVMVDHVPRFRFVLDDISQEGDEALKARAMSAFSRLVL 190 Query: 181 --QKHIRQRD----LMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISE-L 233 +H R+ D +G LV + + A+ YIL T + +E + L Sbjct: 191 WCLRHGREPDELLRQLGKWLDLVNEVRRAPNGVEALRAIWRYILATNERDEADEVLQRLL 250 Query: 234 TRRMPQHRERIMTIAERIHNDGYIKGEQRILRL 266 +E I++ A+++ G +G + LR Sbjct: 251 AAAGEPWKEEIVSAADQLMERGRQQGLREGLRE 283 >UniRef50_C1J8H0 Truncated transposase n=3 Tax=Escherichia coli RepID=C1J8H0_ECOLX Length = 202 Score = 217 bits (554), Expect = 3e-55, Method: Composition-based stats. Identities = 98/205 (47%), Positives = 136/205 (66%), Gaps = 9/205 (4%) Query: 91 MAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGSRSPYPWSLCWLDEFADPTTARKL 150 M FR++RYS+A MQRH+E K LPLVIP+LFYHG RSPYP+S+ WLD F +P A K+ Sbjct: 1 MPFRMLRYSVAAMQRHLEQHK--TLPLVIPVLFYHGERSPYPYSMNWLDCFEEPALAAKI 58 Query: 151 YNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQI 210 Y FPLVD+TVV D+EI+ HRR+A L L+ KHIR RD+M L+D+L ++V +D Q+ Sbjct: 59 YTKPFPLVDITVVDDNEIMNHRRMAALTLLMKHIRHRDMMELLDKLPQVMVEI--SDEQV 116 Query: 211 TALLNYILLTGDEARFNEFISELTRRMPQHRERIMTIAERIHNDGYIKGEQR----ILRL 266 L++YI+ GD EF+ L R+PQH +++MTIAER+ G +G I Sbjct: 117 RVLIHYIVNAGDSVSP-EFMRALAERLPQHEDKLMTIAERLEQKGRQEGALEKALAIACQ 175 Query: 267 LLQNGADPEWIQKITGLSAEQMQAL 291 L + G PE I++ TGLS +++ + Sbjct: 176 LQKMGMTPEQIKQATGLSEAELKNI 200 >UniRef50_A6G4N5 Putative uncharacterized protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6G4N5_9DELT Length = 343 Score = 217 bits (552), Expect = 6e-55, Method: Composition-based stats. Identities = 74/316 (23%), Positives = 132/316 (41%), Gaps = 22/316 (6%) Query: 5 TTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDI 64 T+ +PHDALFK+ P A ++ L + + D +L+ E S++DE L HSD+ Sbjct: 4 TSPSPHDALFKSAFKDPKDAAKLLQNVLDEPIAHAIDWSTLRPEPGSYIDETLAERHSDL 63 Query: 65 LWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFY 124 L+S D Y+Y++IEHQS D M R++ Y V RH + LP ++P++ Sbjct: 64 LFSASIGGEDAYVYLLIEHQSTVDRDMPLRMLVYLTRVWLRHRSAHPGRDLPPILPVVVS 123 Query: 125 HGSRSPYPWSLCWLDE-------FADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALL 177 H +P W+ E P + + D+T + D ++ + Sbjct: 124 H---APGGWTAPVTFESLVRPGPTDLPELTPHIPRFELVINDLTHLSDQQLREWSMRGFA 180 Query: 178 ELIQKHIRQR-------DLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFI 230 L+ +R R D + + + +T + +YI + EF Sbjct: 181 TLVLWILRTRHEIPELIDGVSTWRDMFREVFEAPDGVQAMTKIFHYIACIAQRVQVQEFH 240 Query: 231 SELTRRMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQA 290 ++L +PQ RE + T E + +G KG + + G + I+ + + + + Sbjct: 241 AKLDEHVPQTREVMKTYYEELMEEGMAKGLAKG----REEGREQSRIETLQE-TLIDLLS 295 Query: 291 LRQPLPERERYSWLKS 306 + L E E ++S Sbjct: 296 AKFDLRELEHAERIRS 311 >UniRef50_Q1RGR6 Transposase and inactivated derivative n=15 Tax=Rickettsia RepID=Q1RGR6_RICBR Length = 313 Score = 216 bits (550), Expect = 8e-55, Method: Composition-based stats. Identities = 80/313 (25%), Positives = 148/313 (47%), Gaps = 28/313 (8%) Query: 5 TTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDI 64 + HD + ++ +P +++F E+HLP ++ L + LK+E SFVD++L+ DI Sbjct: 2 SQKPKHDEIIRSAFENPLVSKEFFEMHLPPHIQNLISFEKLKMEKDSFVDKRLKKSIVDI 61 Query: 65 LWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFY 124 L+S K E GY+Y+++EHQS + MA RL RY + + H + K + P + P++FY Sbjct: 62 LFSAKFGEKKGYLYLLLEHQSTPEYKMALRLFRYMFKIAEYHKKSTKSKKFPFIYPLIFY 121 Query: 125 HGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHI 184 +G + Y + F + + ++ + L++V +PD+++ + +L+ KHI Sbjct: 122 NGVQK-YNAPRNLWELFENSELVKSTWSGDYQLINVHDIPDEKLKEKAWSGILQFFMKHI 180 Query: 185 RQRDLMGLIDQLVVLLVTECAND---SQITALLNYILLTGDEARFNEFISELTRRM-PQH 240 +RDL+ +++ LL D I +L Y L + E L ++ P+ Sbjct: 181 HERDLLKRWEEVADLLPKFAKIDIGIEHIELILCYTLTRIKQDDIIEVEKLLQSKLNPKK 240 Query: 241 RERIM-TIAERIHNDGY----------------------IKGEQRILRLLLQNGADPEWI 277 RE +M +IA G + + + + +++ G E + Sbjct: 241 RENVMKSIAHHWIQQGREEEKAIMLKKMQEEKVIMAEKVQEEKVMMAKEMMKEGFSLESV 300 Query: 278 QKITGLSAEQMQA 290 KIT LS E ++ Sbjct: 301 IKITKLSKEDLEK 313 >UniRef50_A8GX51 Transposase and inactivated derivative n=11 Tax=Rickettsia RepID=A8GX51_RICB8 Length = 355 Score = 215 bits (549), Expect = 1e-54, Method: Composition-based stats. Identities = 79/268 (29%), Positives = 143/268 (53%), Gaps = 7/268 (2%) Query: 13 LFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVKTRE 72 +F+ L +P A +F HLP +++ L D SL +E+ +FV+ L+ SD+L+S K + Sbjct: 23 IFRKALENPLVAHEFFNAHLPPNIKSLIDFPSLAMENTTFVESSLKDSISDVLFSCKFDK 82 Query: 73 GDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRH-IEHDKRQPLPLVIPMLFYHGSRSPY 131 DGY+++++EHQS+ D MAFRL +Y + + +R+ I++ K + LPL+ PM+F++G Sbjct: 83 QDGYLFLLVEHQSKADHFMAFRLFKYMINICERYLIQNPKAKTLPLIYPMIFFNGQEKYN 142 Query: 132 PWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIRQRDLMG 191 W D F + A++L+ + LV+V +PD+E Q +LE KHI +R+L+ Sbjct: 143 VARNLW-DLFTNNKLAKELWINDYQLVNVHEIPDEEFKQRIWSGILEFFLKHIHERELLK 201 Query: 192 LIDQ---LVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRM-PQHRERIM-T 246 + ++ L + +L Y L ++A + + L+ ++ P+ R+M + Sbjct: 202 RWQEISDILPELTKITIGYDYLEMILYYTLTKIEQADKIKLKNLLSTKLNPEIGTRLMRS 261 Query: 247 IAERIHNDGYIKGEQRILRLLLQNGADP 274 +AE +G G L++ G Sbjct: 262 LAEHWQQEGKEIGILEGLQVGEAKGIQI 289 >UniRef50_Q6TFF6 Putative transposase n=1 Tax=Caedibacter taeniospiralis RepID=Q6TFF6_CAETA Length = 299 Score = 214 bits (545), Expect = 4e-54, Method: Composition-based stats. Identities = 87/303 (28%), Positives = 152/303 (50%), Gaps = 14/303 (4%) Query: 6 TSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASF-------VDEKLR 58 HD++FK + + D A F+ +LPK+L EL D ++KLESA+ D + + Sbjct: 1 MKNVHDSVFKDLIANRDFAVSFLMTYLPKELVELVDWQTVKLESANVEHVRQQQKDNQKQ 60 Query: 59 ALHSDILWSVKTREG-DGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKR-QPLP 116 SD+ + K ++G +G ++V IE Q+ +D + R Y + + +I+ K + LP Sbjct: 61 KEQSDLTFLFKFKDGKNGAVFVHIESQTGDDGTILIRTRHYQTSYLLDYIKRHKTVKGLP 120 Query: 117 LVIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVAL 176 LV+ +++Y ++ P+ SL D FA+ A+K Y +D+ D+EI++H +A Sbjct: 121 LVVSIIYY-ANQKPFSHSLNIHDYFANTELAKK-YAFTTQFIDLNRYSDEEILEHGFIAG 178 Query: 177 LELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRR 236 ELI K IR++++ G +D + + E + L+ Y+ D +F +L Sbjct: 179 YELILKAIREKNIDGKLDIAINQI--EAYDHIARQVLIRYMSQYSD-METKDFHDKLIYS 235 Query: 237 MPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQALRQPLP 296 P R +MT+AE+ G KG Q R L G E + K TGL + + L++ + Sbjct: 236 KPDLRGDVMTVAEQWEQKGIQKGIQTTARNFLLMGLSAEQVVKGTGLDQDTVLKLKKEVE 295 Query: 297 ERE 299 + + Sbjct: 296 QTQ 298 >UniRef50_A5CC03 Transposase and inactivated derivative n=9 Tax=Orientia tsutsugamushi RepID=A5CC03_ORITB Length = 355 Score = 214 bits (544), Expect = 4e-54, Method: Composition-based stats. Identities = 91/352 (25%), Positives = 154/352 (43%), Gaps = 62/352 (17%) Query: 5 TTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDI 64 + + HD LFK + P A DF+ LP +++ + DL+++K+E SFV+ LR D+ Sbjct: 2 SENLKHDGLFKDLMNEPKAALDFINDFLPNEVKNVLDLNTIKVEQESFVEANLRRSMCDV 61 Query: 65 LWSVKTR-EGDGYIYVVIEHQSREDIHMAFRLMRYSMAV-----MQRHIEHDKRQPLPLV 118 L+SVKT+ D +IYV+IE + R D +AF+L +Y++++ +R LP+V Sbjct: 62 LFSVKTKNNNDAFIYVLIEAELRSDYWIAFKLWQYTLSILKRHKKGLKKRKKERGKLPIV 121 Query: 119 IPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLE 178 +P++ YHG+ + + F DP A++L + + L+D +PD EI + AL+ Sbjct: 122 VPIVVYHGADR-FNAPRSLWELFDDPKLAKELMGSEYLLIDWQAMPDSEIKRKATAALVH 180 Query: 179 LIQKHIRQRDLMGLIDQLVVLLVTECANDS-----QITALLNYILLTGDEARFNEFISEL 233 ++ Q D++ L + L D I +LL Y + + L Sbjct: 181 FMKYIHNQPDIIELWAKFFNTLQEIVQKDKEEGFLYIRSLLYYTISKVSQNEQPRLKQLL 240 Query: 234 TRRMP-QHRERIM-TIAERIHNDGYIKGEQRI---------------------------- 263 + + R+RIM TIA + ++G KG Sbjct: 241 DENLSIEDRDRIMGTIAAQYIDEGKAKGRAEGRAEGRAEGRAEGRAEGRAEGRAEGRAEG 300 Query: 264 --------------------LRLLLQNGADPEWIQKITGLSAEQMQALRQPL 295 R LL+ G E+I + TGLS E++ L+ + Sbjct: 301 RAEGIEIGETKGRAEAAQGLARNLLKAGFSVEFIAENTGLSNEEVVNLKVSM 352 >UniRef50_C4YU05 Transposase n=5 Tax=Rickettsieae RepID=C4YU05_9RICK Length = 342 Score = 211 bits (537), Expect = 3e-53, Method: Composition-based stats. Identities = 98/338 (28%), Positives = 161/338 (47%), Gaps = 59/338 (17%) Query: 10 HDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVK 69 HDAL K LT A++F+E +LP D +EL DL +K+E SFV++ L+ +SDI++SVK Sbjct: 7 HDALVKKILTEKIAAQEFLEHYLPSDFKELIDLREIKVEKESFVEDDLKRKYSDIIYSVK 66 Query: 70 TREG-DGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGSR 128 TR+ + ++YV+IE QS D +A RL +Y + + +RH + + LPL+ P+L Y+GS Sbjct: 67 TRDQEEAFVYVLIEAQSSCDYWIALRLWKYMLLLCERH--ENNKNKLPLICPLLIYNGSE 124 Query: 129 SPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIRQRD 188 Y + + F P A+KL + LVD+ DDEI Q + + ++E KHI QRD Sbjct: 125 -VYNAPRNFWELFTKPERAKKLMVQDYQLVDLQNQSDDEIEQKKHLGMMEYFLKHIHQRD 183 Query: 189 LMGLIDQLVVLLVTECANDS-----QITALLNYILLTGDEARFNEFISELTRRMP-QHRE 242 ++ L D+ ++ D + + + Y E + E + + + + ++ Sbjct: 184 MLKLWDEFLIRFKPSIIMDKESGYIYLRSFVWYTDAKISEEKQQELEQIIVKHLSTEEKD 243 Query: 243 RIM-TIAERIHNDGYI-------------------------------------------- 257 IM TIA++ ++G Sbjct: 244 NIMRTIAQKYIDEGVQHGIIQGIQQGIQQGVEKGKAEGLKIGEAKGKAEGKAEGKAEGKA 303 Query: 258 --KGEQR--ILRLLLQNGADPEWIQKITGLSAEQMQAL 291 K E+R I R +L G D +I +TGL +++L Sbjct: 304 EGKAEERVEIARKMLSQGCDFSFISSVTGLEEAFIRSL 341 >UniRef50_D2QBD7 Putative uncharacterized protein n=1 Tax=Spirosoma linguale DSM 74 RepID=D2QBD7_9SPHI Length = 341 Score = 209 bits (532), Expect = 1e-52, Method: Composition-based stats. Identities = 78/306 (25%), Positives = 145/306 (47%), Gaps = 18/306 (5%) Query: 8 TPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWS 67 PHD FK + P+ DF+ P+ +RE D +L E +F DE+L +D+++S Sbjct: 7 NPHDRFFKESFSQPEILIDFLNAFAPEAVRERIDYTTLTREVDTFTDEQLAEHFADLVFS 66 Query: 68 VKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGS 127 V+ + +++EH+S + + F++ RY + + + I+ +QPL V+P+L YHG+ Sbjct: 67 VQYNGQPIRLVILLEHKSYTEEYPHFQINRYLLNLWESQIKQ--KQPLTPVLPVLVYHGN 124 Query: 128 RSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEI----VQHRRVALLELIQKH 183 R S+ T L + L+D++ + D+ + + R+ + L+Q Sbjct: 125 RRWKQRSIPDYFAPLHETLTPYLPAFEYLLIDLSTLSDERLPTLQSDYARLTAI-LLQNS 183 Query: 184 IRQRDLMGLID---QLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQH 240 R+R+L L+D +V L A ++ Y+ T + + F +R + Sbjct: 184 RRKRELTRLLDAFADVVRRLTDTTAGQRFVSTGFLYLSYTANLTKVELF-GIFSRISSKI 242 Query: 241 RERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQ-MQALRQPLPERE 299 MT+AE + +G + E+R R++ + E IQ+ L Q M A + L ++E Sbjct: 243 ESSTMTVAEELIQEGR-ELERRQTRMVAE-----ELIQQGRELERRQAMMAAEELLKQQE 296 Query: 300 RYSWLK 305 R + +K Sbjct: 297 RQNKIK 302 >UniRef50_Q1QWV4 Putative uncharacterized protein n=11 Tax=Proteobacteria RepID=Q1QWV4_CHRSD Length = 326 Score = 209 bits (531), Expect = 1e-52, Method: Composition-based stats. Identities = 73/314 (23%), Positives = 135/314 (42%), Gaps = 30/314 (9%) Query: 14 FKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVKTREG 73 +K +HP+ RD + + + E D +L+ S S++ E LR D++W V+ + Sbjct: 13 YKLLFSHPEMVRDLLTGFVKEAWVEQLDFSTLEKVSGSYITEDLRDREDDVIWRVRWGDD 72 Query: 74 DGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQP---LPLVIPMLFYHGSRSP 130 Y+Y+++E QS D MA R+M Y + Q I + P LP V+P++ Y+G + Sbjct: 73 WLYVYLLLEFQSSVDRFMAVRVMTYLGLLYQDLIRQEAFTPNGKLPPVLPIVLYNGEKRW 132 Query: 131 YPW-SLCWLDEFADPTTARKLYNAAFPLVDVT-VVPDDEIVQHRR--VALLELIQKHIRQ 186 ++ L E R N A+ L+D V+ D E H R A L ++ + + Sbjct: 133 TAAQNVADLVEQVPGGLERYRPNLAYLLLDEGAVISDPEWSDHMRNVAAALFRLEHNRDE 192 Query: 187 RDLMGLIDQLVVLLVTECAN--DSQITALLNYILLTG--------------DEARFNEFI 230 +D++ ++ LV L + +LL D ++ + Sbjct: 193 QDMLEVLGTLVEWLKAPEQTGLRRAFVVWIRRVLLPNRAPGMELPEFNELQDLHEVHDML 252 Query: 231 SELTRRMPQHRERIMTIAER------IHNDGYIKGEQRILRLLLQNGA-DPEWIQKITGL 283 +E ++ P+ E R +G +G ++ R L++ G E I + TGL Sbjct: 253 AERIKQWPERWEEKGRQEGRQEGRKEGRQEGEQRGIEKTARNLIKLGVLSDEQIAEATGL 312 Query: 284 SAEQMQALRQPLPE 297 + +++ LR+ + Sbjct: 313 TVAEVEGLREEDTQ 326 >UniRef50_B9MMR0 Putative uncharacterized protein n=1 Tax=Anaerocellum thermophilum DSM 6725 RepID=B9MMR0_ANATD Length = 333 Score = 203 bits (516), Expect = 8e-51, Method: Composition-based stats. Identities = 79/336 (23%), Positives = 139/336 (41%), Gaps = 44/336 (13%) Query: 1 MTNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRAL 60 M +D FK + + +F++ + + + DL SL+ SFV ++ Sbjct: 1 MEQKPPHNQYDLTFKRIFSFKEVFLNFLKSTIKRPWVDKIDLQSLEFVDRSFVKDEFVEK 60 Query: 61 HSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIP 120 +D+++ K + D Y YV++E QS D M RL Y + QRHIE K L ++P Sbjct: 61 EADVIYRAKIEDTDIYFYVLLEAQSTTDKTMPRRLFEYMNLIWQRHIEETKDDLLSPIVP 120 Query: 121 MLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELI 180 ++ Y+G W++ L ++N + LVDV + D+ + R+ LL +I Sbjct: 121 IVLYNGR---SNWNVPTLIFKGWEIFKDDMFN--YFLVDVNNIDDETLKN--RLDLLSVI 173 Query: 181 QKHIRQRDLMGLIDQLVVLLVT--ECANDSQITALLNYILLTGDEARFNEF---ISELTR 235 R R + + + C Q+ ++L E I EL + Sbjct: 174 LYLDRSRKTAKEFIEKLKEVTEYISCLPTEQVKVFAMWLLRVIRPQMMEEVQGEIDELLK 233 Query: 236 RMPQHR-----------ERIM-------------TIAERIHNDGYIKGEQ--------RI 263 R+ Q +R+M E +G ++G+ RI Sbjct: 234 RIEQEGVTDVGDFVFNVQRLMQEYYKEAEEKGKEKGYEEGKLEGKLEGKLEGELEATIRI 293 Query: 264 LRLLLQNGADPEWIQKITGLSAEQMQALRQPLPERE 299 R ++ GA+ +I K+TGL E+++ LRQ + ++E Sbjct: 294 ARNMILAGAEDSFISKVTGLDIEKIKELRQNMTDKE 329 >UniRef50_A9BGB6 Putative uncharacterized protein n=3 Tax=Petrotoga mobilis SJ95 RepID=A9BGB6_PETMO Length = 331 Score = 203 bits (516), Expect = 8e-51, Method: Composition-based stats. Identities = 80/317 (25%), Positives = 143/317 (45%), Gaps = 21/317 (6%) Query: 3 NFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHS 62 N PHD FK + + ARDF++ +LP++ E+ DLD L E+ S VDE LR S Sbjct: 2 NELVHNPHDRFFKLIFSDKEIARDFLQNYLPQEAVEIVDLDYLIPENNSHVDENLRESLS 61 Query: 63 DILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPML 122 D+L+ K + DGYIY+++EH+S + + F+L+RY ++ + + K + +P++IPM+ Sbjct: 62 DMLYKTKIKGQDGYIYILMEHKSYIEGKVIFQLLRYITSIWEEKYD-PKTKKVPIIIPMV 120 Query: 123 FYHGSRSPYPWSL--CWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELI 180 YHG + + D K Y + + D I + +R+ L + Sbjct: 121 IYHGREIWNVETNLLNMVQGIEDLPNELKTYLPTYRY----EICDFSIKRKKRIIGLTAM 176 Query: 181 QK---------HIRQRDLMGLIDQLVVLLVTECAN--DSQITALLNYILLTGDEARFNEF 229 + + + + + ++ + + Y+L ++ E Sbjct: 177 KVAIEAMRAGTAMTKEEFKERLRRVFAYIKQLPKEQVHEWFEECMIYLLNVREDVTIEEI 236 Query: 230 ISELTRRMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQ 289 + MP E +MTIAE++ N+G KG+ R G + E+ +I LS Sbjct: 237 LKVQKEIMPGRGEIVMTIAEKLRNEGMEKGKIEGERKGKLEG-EREFAIRI--LSKRFGN 293 Query: 290 ALRQPLPERERYSWLKS 306 L + + +R R + K+ Sbjct: 294 QLTEEIKDRIREADEKT 310 >UniRef50_C0GW46 Putative uncharacterized protein n=2 Tax=Desulfonatronospira thiodismutans ASO3-1 RepID=C0GW46_9DELT Length = 341 Score = 202 bits (513), Expect = 2e-50, Method: Composition-based stats. Identities = 69/276 (25%), Positives = 143/276 (51%), Gaps = 11/276 (3%) Query: 3 NFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHS 62 +F PH+A FK F P+ + F++ H+P+++ L DLD+L+++ + FV E+ R ++ Sbjct: 2 SFEIPNPHNACFKDFFKDPEFVKAFIKYHIPEEICSLLDLDTLQVDLSGFVSEEHREYYA 61 Query: 63 DILWSVKTRE--GDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQ-PLPLVI 119 D++ +V+ + + IY+++EH+S + +++ Y + + Q LP++I Sbjct: 62 DVMVTVQLKGHTENVNIYILLEHKSTPEFLTRLQILNYEVQKWMDLKRKGQLQGYLPVII 121 Query: 120 PMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFP--LVDVTVVPDDEIVQHRRVALL 177 P++ YHG + + +S + D F P+ + + F + D++ + DDE + + Sbjct: 122 PVVIYHG-KGRWNFSRKFSDLFDLPSEVLRPFVPEFKHMIHDISSMEDDEFKTTAILEIF 180 Query: 178 ELIQKHIRQRDLMGLIDQLVVLLVTECANDS---QITALLNYILLTGDEARFNEFISELT 234 L+ K+I +L + ++ LL T D + A++ Y+ + G + E + E T Sbjct: 181 HLLFKYIHYPELETKLQEIYDLLETIPDQDKVKQYLQAIVQYVAVQGPISL--ERLGEYT 238 Query: 235 RRMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQN 270 RR+P E + T A++I + Y + Q ++L++ Sbjct: 239 RRLPGGDEAMQTAAQQIRQEAYNEFIQEQEKMLVER 274 >UniRef50_A4XMD0 Putative uncharacterized protein n=5 Tax=Clostridia RepID=A4XMD0_CALS8 Length = 329 Score = 199 bits (506), Expect = 1e-49, Method: Composition-based stats. Identities = 72/336 (21%), Positives = 140/336 (41%), Gaps = 48/336 (14%) Query: 1 MTNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRAL 60 M +D FK + +F+ ++ ++ D +SL+ SF+ ++ Sbjct: 1 MQQKVPHNQYDLTFKRLFQFKEVFLNFLRGNINREWVNRIDAESLEFVDRSFIKDEFVEK 60 Query: 61 HSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIP 120 +D+++ + + D Y YV+IE QS D +M RL Y + +RH+E + LP ++P Sbjct: 61 EADVIYRARLEDTDVYFYVLIEPQSTADRNMPRRLFEYMTLIWKRHMEEKADELLPPIVP 120 Query: 121 MLFYHGSRSPYPWSLCW--LDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLE 178 ++ Y+G + + D F D + LVDV + D+++ + + Sbjct: 121 IVLYNGRSGWNIPTQIFKGFDIFKDDM-------FNYILVDVNRLDDEKLKSRLDLLSII 173 Query: 179 LIQKHIRQRDLMGLIDQLVVLLVTEC------------------------ANDSQITALL 214 L + R R+ +++L + C +S+I LL Sbjct: 174 LYLEKSR-RNAEEFVEKLSEVSEYICKLPQVQLKVFCSWLLRIVKPQVREEMESRIDELL 232 Query: 215 NYILLTGDEARFNEFISELTRRMPQ-HRERIMTIAERIHNDGYIKG------------EQ 261 I G E EFI + + + + +RE E+ + +G +G E+ Sbjct: 233 KKIEAEGVE-DVGEFIFNVQQLIQEYYREAEEKGKEKGYEEGIQEGIKEGIKEGIQRKEE 291 Query: 262 RILRLLLQNGADPEWIQKITGLSAEQMQALRQPLPE 297 I+R L+Q G + +I + TG+ E+++ +R+ E Sbjct: 292 EIVRRLIQKGFNDNFIAEATGVEIERIKKIREEYTE 327 >UniRef50_C5UWW9 Putative uncharacterized protein n=1 Tax=Clostridium botulinum E1 str. 'BoNT E Beluga' RepID=C5UWW9_CLOBO Length = 323 Score = 199 bits (505), Expect = 1e-49, Method: Composition-based stats. Identities = 61/321 (19%), Positives = 132/321 (41%), Gaps = 28/321 (8%) Query: 1 MTNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRAL 60 M N HD +K +H +T +F+ K+ L + D L L S++ Sbjct: 1 MKNNNVHHEHDVGYKHIFSHKETFLEFLRSFTKKEWANLINEDDLILVDKSYILSDFEEE 60 Query: 61 HSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQ------- 113 SDIL+ + + YV++E QS+ D M RL+ Y + + +++ ++ Sbjct: 61 ESDILYKANIDDKEVIFYVLLEFQSKVDFQMPMRLLFYMTEIWRDVLKNTEKNERKRKNF 120 Query: 114 PLPLVIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQ-HR 172 LP ++P++ Y+G + + + L D+ D E++ Sbjct: 121 KLPSIVPIVLYNGKNKWSAKISFKEMLSGYELFEDNILDFNYMLFDINRYSDHELLNISN 180 Query: 173 RVALLELIQKHIRQRDLMG------------------LIDQLVVLLVTECANDSQITALL 214 ++ + L+ + I +++LM + + + +V D + + Sbjct: 181 MISAVFLLDQEIDEQELMRRLKKIIYILKKISPEQFSVFKKWLKNIVKPRVRD-NLQGEI 239 Query: 215 NYILLTGDEARFNEFISELTRRMPQHRER-IMTIAERIHNDGYIKGEQRILRLLLQNGAD 273 + +L ++ + +S L + + + +++ I ++ G +G ++ + ++ G D Sbjct: 240 DDVLEKSNQEEVDFMVSNLGKTIERMQDKAIERGLKKGIEQGIEQGIEQTAKKAIEMGMD 299 Query: 274 PEWIQKITGLSAEQMQALRQP 294 E I +TGLS EQ+ +RQ Sbjct: 300 NEIIMNLTGLSEEQINTIRQE 320 >UniRef50_Q1RKI3 Transposase and inactivated derivative n=10 Tax=Rickettsia RepID=Q1RKI3_RICBR Length = 270 Score = 199 bits (505), Expect = 1e-49, Method: Composition-based stats. Identities = 60/205 (29%), Positives = 111/205 (54%), Gaps = 2/205 (0%) Query: 10 HDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVK 69 HD F+ L++P AR+F E +LP +++ L +L LE+ SF+D L+ +D+L+S + Sbjct: 56 HDKFFQKALSNPIVAREFFEEYLPTEIKALFSPTTLTLENDSFIDPNLKESITDVLYSAR 115 Query: 70 TREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRH-IEHDKRQPLPLVIPMLFYHGSR 128 D YIY++ EHQS D HMAFRL +Y + + ++H I H + P + P++ Y Sbjct: 116 INNRDCYIYILCEHQSSSDPHMAFRLFKYMLNIAEKHLISHPDSKKFPFIYPLV-YSNDH 174 Query: 129 SPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIRQRD 188 Y L D F + + ++ + L+ + + DD++ ++ +A L+++ K+I + + Sbjct: 175 KKYTAPLNLWDLFENSELVKDTWSNNYQLISLRDISDDKLKENPWLAPLQILMKYIHKPN 234 Query: 189 LMGLIDQLVVLLVTECANDSQITAL 213 + ++ L T A+ S I + Sbjct: 235 VFDKWQEISGCLATIAASSSGIEYI 259 >UniRef50_B2V9N0 Putative uncharacterized protein n=4 Tax=Sulfurihydrogenibium RepID=B2V9N0_SULSY Length = 312 Score = 197 bits (501), Expect = 4e-49, Method: Composition-based stats. Identities = 64/303 (21%), Positives = 136/303 (44%), Gaps = 18/303 (5%) Query: 1 MTNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRAL 60 M N + PH+ FK ++ +DF+ I L DL + L SL+L + + Sbjct: 1 MKNKESIQPHNWFFKQVFSNSKNVQDFLSIFL-SDLSQKIQLSSLELVPSEKFSNNQKKH 59 Query: 61 HSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIP 120 D+L+ K + + YI ++ EH+S D + +LM+Y+ + + ++ + P +I Sbjct: 60 FLDLLYKCKLNDKEAYIRLIFEHKSYVDKKLPLQLMQYNAVIWEEALKE--KDYYPPIIN 117 Query: 121 MLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQ--HRRVALLE 178 ++FYHG ++ + + D D + + + L+D+ + D+ + + + V L+ Sbjct: 118 IVFYHG-QAKWNFPTTIPD-IEDEELDKYIQKLNYILIDLNEIEDENLKRYLKKNVDLIM 175 Query: 179 LIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMP 238 + D + I L+ ++ EC+ D + L +L+ D + E E+ Sbjct: 176 EMLIMKHIHDRLERIKTLLKDVIDECSEDCFVIILNYLVLVKKDYEKVKEVFKEII---- 231 Query: 239 QHRERIMTIAERIHNDGYIKGEQRILRLLL------QNGADPEWIQKITGLSAEQMQALR 292 E++M +++ +G ++G+ ILR + + G + I + + ++ L+ Sbjct: 232 GGEEKMMLFTDKLKMEGKMEGKIEILRENIIDLIDVKFGVVDKSITEKVN-QIDNIETLK 290 Query: 293 QPL 295 Q L Sbjct: 291 QIL 293 >UniRef50_Q2RLW6 Putative uncharacterized protein n=9 Tax=Clostridia RepID=Q2RLW6_MOOTA Length = 344 Score = 197 bits (501), Expect = 5e-49, Method: Composition-based stats. Identities = 65/334 (19%), Positives = 138/334 (41%), Gaps = 44/334 (13%) Query: 7 STPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILW 66 P+D ++ L + ++ + + E D D L L + S+V + +D+++ Sbjct: 12 HHPYDKGYRQLLADKRVFLELLKTFVREAWVEAIDADDLILVNKSYVLQDFSEKEADVVY 71 Query: 67 SVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHI-------EHDKRQPLPLVI 119 +KTR + YV++E QS D M FRL+ Y + + + K LP +I Sbjct: 72 RLKTRNRNVIFYVLLELQSTVDYLMPFRLLLYMVEIWREIYNNTPQGERESKHFRLPPII 131 Query: 120 PMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRV-ALLE 178 P + Y+G+ S + + L + + L DV ++E+++ + A + Sbjct: 132 PAVLYNGAGSWTAALSFKEMLNSYQDFSGHLLDFRYLLFDVNRYSEEELIRAANLIAGIF 191 Query: 179 LIQKHIRQRDLMGLIDQLVVLLVTECAND------------------------------- 207 L+ + ++ DL G + +L +L ++ Sbjct: 192 LLDQKMQPEDLAGRLQKLAGVLRRLTPDEFRHFTTWLKNVVQPRMPGDFSEKIDGILNAS 251 Query: 208 --SQITALLNYILLTGDEARFNEFISELTRRMPQHRERIMTIAERIHN--DGYIKGEQRI 263 ++ ++ + LT +E + + L + Q + + ++ +G ++G++ + Sbjct: 252 NPWEVERMIYNLELTLEEMQRQALLKGL-KEGEQKGKLEGKLEGKLEGKLEGKLEGKREV 310 Query: 264 LRLLLQNGADPEWIQKITGLSAEQMQALRQPLPE 297 R LL D E I K TGL+ E++ AL++ + + Sbjct: 311 ARNLLLLNVDIETIIKATGLALEEINALKKQMEQ 344 >UniRef50_C6VTM0 Putative uncharacterized protein n=1 Tax=Dyadobacter fermentans DSM 18053 RepID=C6VTM0_DYAFD Length = 308 Score = 192 bits (488), Expect = 1e-47, Method: Composition-based stats. Identities = 62/305 (20%), Positives = 139/305 (45%), Gaps = 20/305 (6%) Query: 7 STPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILW 66 + HDA + + + A D+ +P+++++L D +L+ ++V ++L+ SDI++ Sbjct: 5 TPKHDAFIRAIMGNKQIALDYFRASIPQNIQDLLDFSTLRQLPDTYVSKELQKSISDIVY 64 Query: 67 SVK--TREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFY 124 + + G+ I +++EH+S D + ++ Y + + + I ++ L+IP+L Y Sbjct: 65 VCQKASGNGEVKISLLVEHKSYVDKYTPIQIGSYIFSGLLKQI--GNKESPSLIIPILLY 122 Query: 125 HGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEI--VQHRRVALLELIQK 182 HG+ ++ L E +P + + + + D+ + D+EI + ++ +A L K Sbjct: 123 HGADRWEYKTVADLFENPEPALQQFIPDYQYIFHDLGQISDEEIQSLHNKFLAASLLAMK 182 Query: 183 HIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRE 242 + +D + + ++ L +E D + L + L G+ +F++ + Q +E Sbjct: 183 YSALKDQLNTLLPTILTLASEV--DRNLHKSLLFYTLVGNPLTEEQFLNLIKSVPNQKKE 240 Query: 243 RIMTIAERIHNDGYIKGEQRI-----------LRLLLQNGA-DPEWIQKITGLSAEQMQA 290 IM I E G+ KG + +R L++ E I ++ + + Sbjct: 241 AIMDIFEIFEEKGWKKGIEEGRAEAEQKIETAVRNLIKQSVLTDEQIASAMNVTTDYVAE 300 Query: 291 LRQPL 295 +R L Sbjct: 301 VRNNL 305 >UniRef50_D0LMM4 Putative transposase n=10 Tax=Haliangium ochraceum DSM 14365 RepID=D0LMM4_HALO1 Length = 345 Score = 192 bits (487), Expect = 2e-47, Method: Composition-based stats. Identities = 76/305 (24%), Positives = 128/305 (41%), Gaps = 17/305 (5%) Query: 10 HDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVK 69 HD+L K D A D LP + E DLD L L SFV ++LR H+D+L+ Sbjct: 6 HDSLVKATFARLDFAADEFRAVLPPAILERLDLDKLALCPGSFVSDELRQQHTDLLFRAP 65 Query: 70 TREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHI-EHDKRQPLPLVIPMLFYHGSR 128 ++Y+++EHQS + M RL+RY ++ +RH+ EH LP ++P++ +H + Sbjct: 66 LDGEPAFLYLLLEHQSSVERMMPLRLLRYVASIWERHLGEHPGAATLPPILPVVLHHSEQ 125 Query: 129 SPYPWSLCWLDEFADPTTAR-----KLYNAAFPLVDVTVVPDDEIVQHRRVA---LLELI 180 + FA AR L F L D++ PD+ ++ A L Sbjct: 126 G-WTAPTSLGQLFALSDGAREALGPYLPELRFLLDDLSHQPDEALLMREMAAQAKLALWA 184 Query: 181 QKHIRQ-RDLMGL---IDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRR 236 K+ R +DL+ L +++ VT + A++ Y L D + Sbjct: 185 LKNARHAQDLLALLRPWSPVILEAVTAPGGIDALAAIVRYTLQHADTDPDALMRFLIDSA 244 Query: 237 MPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQALRQPLP 296 +E MT AE++ + ++ G ++ G + +ALR L Sbjct: 245 GDPAKEAFMTGAEKLTQAVREQSLRQGRVEGRVEGRVEGRVE---GRVEGRTEALRTVLS 301 Query: 297 ERERY 301 ++ R Sbjct: 302 KQLRQ 306 >UniRef50_A0LBL3 Putative uncharacterized protein n=6 Tax=Magnetococcus sp. MC-1 RepID=A0LBL3_MAGSM Length = 322 Score = 192 bits (487), Expect = 2e-47, Method: Composition-based stats. Identities = 68/282 (24%), Positives = 122/282 (43%), Gaps = 17/282 (6%) Query: 1 MTNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRAL 60 MT T PHD K L+ PD + LPK++ EL + L +F+D + R Sbjct: 1 MTKIT--QPHDRFLKALLSDPDKTGTLLRERLPKEVAELLSSEPPVLVDGTFIDGEFREH 58 Query: 61 HSDILWSVKTREGD-GYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVI 119 +D L+ VKT+EG YIY +IEH+S D +AF+L+RY + + +R ++ + Q LP ++ Sbjct: 59 LTDRLFKVKTQEGKAAYIYALIEHKSYADEWVAFQLLRYMVRIWERFLKEGQ-QKLPPIV 117 Query: 120 PMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLEL 179 P++ YHG+R + AD L + +F + D+ + DD++ Q + + Sbjct: 118 PLVVYHGAREWTVPNQFSALLEADKGLLHHLLDFSFAVTDLGRIADDDLSQDTHLRAALM 177 Query: 180 IQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQ 239 K+ Q + + + + +L Y++ T + + P Sbjct: 178 AMKYAFQG--AEGVVVIPQIGKGAQGDPEFAKLVLRYLIQTYRGMTMADVQAYAEEAFPG 235 Query: 240 HRE--------RIMTIAERIHNDGYIKGEQRILRLLLQNGAD 273 E +M+ + +G +G + + Q G Sbjct: 236 EAEHYASQFAREMMS---KGRQEGRQEGRREGRQEGRQEGES 274 >UniRef50_C5JAV2 Transposase n=2 Tax=uncultured bacterium RepID=C5JAV2_9BACT Length = 334 Score = 189 bits (481), Expect = 9e-47, Method: Composition-based stats. Identities = 68/284 (23%), Positives = 128/284 (45%), Gaps = 12/284 (4%) Query: 4 FTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSD 63 + PHD K L++P TA + LP+++ E D +L SF+DE LR +D Sbjct: 2 TEIAHPHDRFLKALLSNPATAGTLLRERLPREVAEALSDDPPELLEGSFIDEALRPHLTD 61 Query: 64 ILWSVKT-REGDGYIYVVIEHQSREDIHMAFRLMRYSM-AVMQRHIEHDKRQPLPLVIPM 121 L+ V+T +YV+IEH+S D+ + ++L++Y + A+ Q E+ + LP ++P Sbjct: 62 RLYRVRTVTGRTALLYVLIEHKSSPDLRIGWQLLKYLVEALKQWERENPAWERLPAIVPF 121 Query: 122 LFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQ 181 +FYHG+ + A+ L N F ++D+ + D ++ + + L Sbjct: 122 VFYHGAAAWKVPDAFLALVDAEEGWRSHLLNFRFTVLDLGQIDDRQLSRQPNLQAWLLAA 181 Query: 182 KHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHR 241 K+ + D + +L++ + A D + L+ Y++ T + R P+ Sbjct: 182 KYATRDDRQLEVKELLIQTLVSVA-DEEFRFLMRYVVETYRSYDEPMVREIIRRVRPEEE 240 Query: 242 ERIMTIA---------ERIHNDGYIKGEQRILRLLLQNGADPEW 276 E +M++ + +G +G Q ++L Q G E Sbjct: 241 ETMMSMFAQDMMAKGRQEGRQEGRQEGRQEGIKLGEQRGRQEEA 284 >UniRef50_C4FIM1 Putative uncharacterized protein n=1 Tax=Sulfurihydrogenibium yellowstonense SS-5 RepID=C4FIM1_9AQUI Length = 316 Score = 186 bits (473), Expect = 7e-46, Method: Composition-based stats. Identities = 63/304 (20%), Positives = 129/304 (42%), Gaps = 18/304 (5%) Query: 5 TTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDI 64 T PHD FK + P + ++I P +L + DL+S++L ++ +K+ ++ Sbjct: 2 TDLQPHDQFFKQIFSEPKRVKSLLDIFYP-ELSQKIDLESIRLLNSEKYSQKVGKSLLNL 60 Query: 65 LWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFY 124 L+ K ++ ++ EH+S D ++ +L+ Y+ + + E + + P +I ++ Y Sbjct: 61 LYECKIENEKSFLRIIFEHKSYIDKNLPSQLLYYNGILWE---ETGEYEEYPPIINIVLY 117 Query: 125 HGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALL----ELI 180 HG R L + R + L+D++ V D+E++ + L Sbjct: 118 HGKRKWNI--PATLPKTNSEIIERFANKLNYHLIDLSKVADEEMISKLYLDFCTVSALLT 175 Query: 181 QKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQH 240 KHI + DL + ++ V E D + +L+YI + + + E+ Sbjct: 176 MKHIFE-DLRKY--KHILKKVFEHYQDGCVFIILDYISVVNNPQEVENVLKEIL----GG 228 Query: 241 RERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQALRQPLPERER 300 + +MT+ E+ +G +G Q+ + + + IQ G E ++ L + E Sbjct: 229 EKDMMTLTEKWKMEGLQQGLQQGMIEGQKKAI-LKSIQLKFGRVPENIEKLISNINNLEE 287 Query: 301 YSWL 304 L Sbjct: 288 LDKL 291 >UniRef50_B4U689 Putative uncharacterized protein n=8 Tax=Aquificales RepID=B4U689_HYDS0 Length = 323 Score = 185 bits (470), Expect = 2e-45, Method: Composition-based stats. Identities = 64/280 (22%), Positives = 122/280 (43%), Gaps = 20/280 (7%) Query: 8 TPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWS 67 PHD+ FK + P + ++I KD+ + S+ + K + D+L+S Sbjct: 4 QPHDSFFKQIFSDPRRVKTLLDIF-AKDVAKSI--HSITPVNTEKFSSKSQKFMLDLLFS 60 Query: 68 VKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGS 127 K ++ D YI +V+EH+S D + +L Y+ A+ + I+ ++ P +I ++FYHG Sbjct: 61 CKVKDQDAYIRIVLEHKSYLDKELPIQLSYYNAAIWEEAIKE--KEYYPPIINIVFYHGK 118 Query: 128 RSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLE----LIQKH 183 + L D + + + L+D+ V DDE++ + + KH Sbjct: 119 GEWNIPTS--LPVLEDQNLEKYVSKLNYILIDLNKVSDDELINEAYIDFCFTSAVIAMKH 176 Query: 184 IRQR-DLMGLIDQLVVLLVTECANDSQITAL---LNYI-LLTGDEARFNEFISELTRRMP 238 + + + + + + +V V ++ L NYI + GD + EL Sbjct: 177 VHENIEKIKAVFRPLVEYVQIHEDEEGYHCLFFSFNYISYVKGDTKEAENALKELI---- 232 Query: 239 QHRERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQ 278 ++ MT+ E+ +G KG+Q L+ L+ G I+ Sbjct: 233 GGDKKAMTLIEKWIMEGLEKGKQEGLQEGLEKGKQEGLIK 272 >UniRef50_Q1Q296 Putative uncharacterized protein n=6 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1Q296_9BACT Length = 338 Score = 182 bits (462), Expect = 1e-44, Method: Composition-based stats. Identities = 64/328 (19%), Positives = 134/328 (40%), Gaps = 39/328 (11%) Query: 5 TTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDI 64 PHD FK + + A DF+ P ++ + DL +L +++S++DE+L+ SDI Sbjct: 2 EILNPHDKFFKETFSIRENAIDFLSGRFPPEILKKLDLSTLTQDNSSYIDEELKEHFSDI 61 Query: 65 LWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFY 124 +++ ++ + I ++ EH+S +LM+Y + + + + + Q L VIP++ Y Sbjct: 62 VYTCFCKDKEIRITLLFEHKSYAVACPYLQLMKYLLKIWEANSKQ--AQRLIPVIPVILY 119 Query: 125 HGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIV----QHRRVALLELI 180 HG + E D R + + L D++ ++EI + + + L+ Sbjct: 120 HGKEAWKVRRFREYFEGIDEVFYRFIPEFEYLLTDISCYSNEEIKDRVFRRVSLQITMLL 179 Query: 181 QKHIR----QRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRR 236 ++I D + ++ + E + + + Y+ D A I L Sbjct: 180 MRNIFDEKYLEDKLKDFFEIGIQYFEEDEGLKFLESAIRYLYYASDIAE-KRVIDTLKEI 238 Query: 237 MPQHRERIMTIAERIHNDGYI--------------------KGEQRILRLLLQNGADPEW 276 + + MTIA ++ G I KG L+ ++ G + ++ Sbjct: 239 SEEGGKLSMTIAAKLIEKGKIAGRVEGRAEGRAEGAIEGERKGRIEGLKEAIEIGLELKY 298 Query: 277 IQKITGLSA--------EQMQALRQPLP 296 K L E+++A+++ + Sbjct: 299 GVKGLKLLERIRKIVVIEKLEAIKEAVK 326 >UniRef50_C0GW49 Putative uncharacterized protein n=6 Tax=Desulfonatronospira thiodismutans ASO3-1 RepID=C0GW49_9DELT Length = 339 Score = 181 bits (459), Expect = 3e-44, Method: Composition-based stats. Identities = 71/262 (27%), Positives = 127/262 (48%), Gaps = 8/262 (3%) Query: 6 TSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDIL 65 TS HD F+ L ARDF+ HLP+++ +LD++K+ S S+V + L+ +DI+ Sbjct: 10 TSKYHDHTFRAILGREPVARDFVRYHLPEEITRDMNLDTVKVSSRSYVSDNLKESMTDIV 69 Query: 66 WSVK-TREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFY 124 +++ IY+++EH+S D +L +Y V Q I K LP+++P++FY Sbjct: 70 ITLELITGEPAEIYILVEHKSDLDAWTKIQLFKYMNEVWQSFI-QKKTGTLPIIVPLVFY 128 Query: 125 HGSRSPYPWSLCWLDEFADPTTARKLYNAAF--PLVDVTVVPDDEIVQHRRVALLELIQK 182 HG+ + + +SL + D F P+ + Y F L +V V+ ++ + + L+ + Sbjct: 129 HGT-ARWNYSLEFSDLFNLPSEHYRKYIPKFEHLLHEVPVINKKKVKSSITLEVFHLVLE 187 Query: 183 HIRQRDLMGLIDQLVVLLVTECANDS--QITALLNYILLTGDEARFNEFISELTRRMPQH 240 +I + I + + LL +I A+L LL + E E + +P+ Sbjct: 188 YIFYPEKRDQIYEALELLFKGLDAKEAHEIFAILIKYLLIATDETPEE-AEEKVKHLPKG 246 Query: 241 RERIMTIAERIHNDGYIKGEQR 262 E + T AE + GY K + Sbjct: 247 GETVRTTAEVLEERGYNKAIKE 268 >UniRef50_B3ETR6 Putative uncharacterized protein n=1 Tax=Candidatus Amoebophilus asiaticus 5a2 RepID=B3ETR6_AMOA5 Length = 275 Score = 180 bits (456), Expect = 7e-44, Method: Composition-based stats. Identities = 62/243 (25%), Positives = 124/243 (51%), Gaps = 27/243 (11%) Query: 76 YIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGSRSPYPWSL 135 Y+Y +IE+QS + MAF ++ Y++A+M++H+ Q LP+++ + Y G +SPYP+S Sbjct: 36 YVYTLIENQSTHNKLMAFSMLSYNVALMEQHLNEG-YQELPIIVNICIYTGKKSPYPYSQ 94 Query: 136 CWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIRQRDLMGLIDQ 195 D F AR+ F L+D++V+ +E+++ +E + + R+RD + I+ Sbjct: 95 DICDYFEGVELAREQMFKHFKLLDLSVLSQEELLKDGTFGSVEALLRQGRERDYLNWINN 154 Query: 196 LVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRERIMTIAERIHN-- 253 VL+ +N +++ YIL T D+ + + + + + +E I+T A+++ Sbjct: 155 NQVLIWELVSNYG--LSIVIYILTTDDKNDADYLMQAIIEAVLEQKEIIVTAAQQLRQVD 212 Query: 254 ----------DGYIKGEQR------------ILRLLLQNGADPEWIQKITGLSAEQMQAL 291 +G +G++ I + +L+ G + IQK+TG+S E ++ L Sbjct: 213 IQTGLIKGIKEGIEQGKEEGVKLGIQAKAQAIDKSMLKEGLEISLIQKVTGISREAIEKL 272 Query: 292 RQP 294 + Sbjct: 273 TKE 275 >UniRef50_A6G1G8 Putative uncharacterized protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6G1G8_9DELT Length = 329 Score = 178 bits (452), Expect = 2e-43, Method: Composition-based stats. Identities = 66/271 (24%), Positives = 107/271 (39%), Gaps = 17/271 (6%) Query: 8 TPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWS 67 HDALFK P A LP L + D + E + +D +L D+LW Sbjct: 5 HAHDALFKAAFGAPAHAARLCRALLPPALVAVLDWRASTSEPTAVLDLRLSERRCDVLWR 64 Query: 68 VKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGS 127 + +G G IYV++EHQS + M R+ Y + H D+ PLP +IP++ H Sbjct: 65 TRFVDG-GPIYVLLEHQSTRERDMPLRIEGYLARIWAGHRRGDRHGPLPPIIPIVVSHAE 123 Query: 128 RSPYPWSLCWLDEF-----ADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQK 182 + + ++F P A + N + D+T V D + L Sbjct: 124 HG-WRAPRSFWEQFSPSPDCIPGLAPFVPNFQLLIDDLTQVDDASLRGRSLPLFQTLALW 182 Query: 183 HIRQ-RDLMGLIDQL------VVLLVTECAND---SQITALLNYILLTGDEARFNEFISE 232 +R RD +++ + + L E ++ I LL Y E +EF + Sbjct: 183 LLRDARDPGRVLESVDEWNTWIHRLRGESQHEQDGGDIEQLLRYAYAVMGEGEDSEFHRK 242 Query: 233 LTRRMPQHRERIMTIAERIHNDGYIKGEQRI 263 L P E +T ++ N G+ +G + Sbjct: 243 LAAFHPPSAEMSLTFEQQAINRGHKRGLEEG 273 >UniRef50_C0GTX5 Putative uncharacterized protein n=8 Tax=Desulfonatronospira thiodismutans ASO3-1 RepID=C0GTX5_9DELT Length = 338 Score = 178 bits (452), Expect = 2e-43, Method: Composition-based stats. Identities = 59/268 (22%), Positives = 122/268 (45%), Gaps = 7/268 (2%) Query: 5 TTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDI 64 +T+ HD+ K FL+ A ++ LP+++ + D + + E S++ + L+ +SD+ Sbjct: 2 STTNIHDSTIKYFLSDRLNAISLLKSMLPEEIVKQLDFNKIYYEKDSYLPKSLQGYYSDL 61 Query: 65 LWSVKTREGD--GYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPML 122 + SV T+ G ++ ++EH+S + + +RY + +++ ++ LP++IP+L Sbjct: 62 VVSVPTKCGSYVAKVFFLLEHKSTFKKNTPLQFLRYILEFWEQYQKNTGETRLPVIIPIL 121 Query: 123 FYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVD-VTVVPDDEIVQHRRVALLELIQ 181 H P + L + + + F L D V P+D AL L Sbjct: 122 IAHPEEGWKPTKVSDLVDLPSDDFKIFVPDFNFLLYDAVNDDPEDYDFDETLKALFTL-W 180 Query: 182 KHIRQRDLMGLIDQ---LVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMP 238 ++ R + M + + L+ + + + +L+Y+ +T DE + + + Sbjct: 181 RYSRSPEFMQGVQKAFQLIKKVDPKARLLDFVQMILHYLEVTRDEKEYIDIQKIAETEID 240 Query: 239 QHRERIMTIAERIHNDGYIKGEQRILRL 266 + E + TIAE +G + EQR L+ Sbjct: 241 EGEEYMGTIAEMFRREGDERTEQRFLQE 268 >UniRef50_A4XG55 Putative uncharacterized protein n=2 Tax=Caldicellulosiruptor saccharolyticus DSM 8903 RepID=A4XG55_CALS8 Length = 327 Score = 177 bits (450), Expect = 3e-43, Method: Composition-based stats. Identities = 64/325 (19%), Positives = 133/325 (40%), Gaps = 36/325 (11%) Query: 1 MTNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRAL 60 M + +D +K ++ ++ ++ + L+L ++V Sbjct: 1 MCSNLPHNVNDLEYKYIFSNKSLFLRLLKRIDRINIFNKLTEEDLELVDKNYVLPDFSEQ 60 Query: 61 HSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEH-------DKRQ 113 SD+L+ + +E + + Y++ EHQS D +MA RL+ Y + + ++ +K Sbjct: 61 ESDLLYKARLQEEELFFYILFEHQSTVDYNMAMRLLFYITDIWRDWLKQFDKNQFKNKSF 120 Query: 114 PLPLVIPMLFYHGSRSPYPWSLCWLDEFAD-PTTARKLYNAAFPLVDVTVVPDDEIVQHR 172 P V+P++ Y G +P+ S+ + + + + + + L+D+ PD+ I +++ Sbjct: 121 KFPPVVPIVLYDGD-NPWTASVNLKERIMNFEVFGKYIVDFEYILIDLND-PDEMIFKYK 178 Query: 173 -RVALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITAL-LNYILLTGDEARFNEFI 230 ++L+ + K +++L L L L + + L +L E + E Sbjct: 179 DILSLILKLNKVKTEKELERLFLDLYEYLQGAKEKEINTLKICLPVVLKELGEDKVQE-A 237 Query: 231 SELTRRMPQHRERIM-------TIAERIHNDGYIKGEQ----------------RILRLL 267 ++ + E IM I E +++G KG Q I + Sbjct: 238 KDMLECIDVGGEGIMPLFQNLRKIREEWYHEGIQKGIQDGLQQGLQQGLQKKELEIAERM 297 Query: 268 LQNGADPEWIQKITGLSAEQMQALR 292 + G E I +ITGL E+++ LR Sbjct: 298 IVKGYSDEEIHEITGLDIEKIKELR 322 >UniRef50_A4XFI8 Putative uncharacterized protein n=7 Tax=Clostridia RepID=A4XFI8_CALS8 Length = 321 Score = 173 bits (438), Expect = 7e-42, Method: Composition-based stats. Identities = 68/323 (21%), Positives = 134/323 (41%), Gaps = 32/323 (9%) Query: 1 MTNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRAL 60 M++ HD+ FK HP ++ + + DS++L FVDE Sbjct: 1 MSSSLPPQEHDSTFKFLFEHPKDILFLVKDVIGYSWAKEIKEDSIELADKEFVDETFHQK 60 Query: 61 HSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIP 120 +D++ + ++ + Y Y++IE+QS M RL+RY + + + I + LP +IP Sbjct: 61 RADVIAKARLKDREVYFYIIIENQSTVAEDMPERLLRYMILLWAKKIREG-VKKLPAIIP 119 Query: 121 MLFYHGSRSPYPWSLCW---LDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALL 177 ++ Y+G + S D F D + N + L T++ ++E + V L Sbjct: 120 IVTYNGLEKDWDVSQEIISEFDIFKDDIFKYAVVNIS-KLDAKTLLQEEEDILSPVVFYL 178 Query: 178 ELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLN-YILLTGDEARFNEFISELTRR 236 E ++ +L+ + ++ L N+++ + ++ E EL +R Sbjct: 179 EQVRDDT--EELVKRLKEIEPKLTKLSQNNAERFLIWAGNVIRPRLVKEDKEKYDELAQR 236 Query: 237 MPQHRERIM----------------------TIAERIHN--DGYIKGEQRILRLLLQNGA 272 + Q R M I +I +G I+G+ + + +++ G Sbjct: 237 VEQGGSRQMGEFVSNVAKLLDEVQMRKFNEGKIEGKIEGKIEGKIEGKIEVAKKMIRRGF 296 Query: 273 DPEWIQKITGLSAEQMQALRQPL 295 E I ++T L E+++ LR+ L Sbjct: 297 SDEDIAELTELDIEKVKELRKEL 319 >UniRef50_C7RR52 Putative transposase n=1 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RR52_9PROT Length = 330 Score = 172 bits (435), Expect = 2e-41, Method: Composition-based stats. Identities = 69/319 (21%), Positives = 121/319 (37%), Gaps = 43/319 (13%) Query: 6 TSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDIL 65 + HD +K + P+ RD + +P D D +L+ S+V E DI+ Sbjct: 1 MANTHDTGYKLLFSTPELVRDLILGFVPDDWLHGLDYSTLERVPGSYVTEDFTNRADDIV 60 Query: 66 WSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDK---RQPLPLVIPML 122 W VK Y+Y++IE QS D +MA R+M Y + Q I+ + LP V+P++ Sbjct: 61 WRVKVGGEWVYLYLLIEFQSSVDKYMALRMMVYGGLLYQDLIKRGEVLADGRLPPVLPIV 120 Query: 123 FYHGSRSPYPWSLCWLDEFADPTT-ARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQ 181 Y+GS+ + + P + + L+D D E+ + + Sbjct: 121 LYNGSQRWSAVTDVFELIPPVPGLVEQFKPRLKYLLIDENAWSDSELASLKNLVAAVFRI 180 Query: 182 KHIRQR----DLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRM 237 +H DL+ L+D+ L + L+ E + R+ Sbjct: 181 EHPASPAAIGDLLSLLDE---WLAERPDLRRMFALWIRATLMRKAE------YRIVLPRI 231 Query: 238 PQHRERIMTIAERIHN-----------------------DGYIKGEQRILRLLLQN---G 271 +E + +AER+ +G +GE L+ LL+ Sbjct: 232 DDLQELNVMLAERLEEWAQAYKAEGKAEGKAEGKAEGKAEGKAEGEALALQKLLKKRFGA 291 Query: 272 ADPEWIQKITGLSAEQMQA 290 P+ + +I+ S EQ+ A Sbjct: 292 VPPDVLAQISRASLEQIDA 310 >UniRef50_Q04UG3 Transposase, YhgA-like n=8 Tax=Leptospira RepID=Q04UG3_LEPBJ Length = 304 Score = 172 bits (435), Expect = 2e-41, Method: Composition-based stats. Identities = 71/302 (23%), Positives = 139/302 (46%), Gaps = 20/302 (6%) Query: 4 FTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSD 63 + PHD L + A F + LP ++ EL DL++L+L +SFV E+L+ +D Sbjct: 2 TEVNNPHDRLIRETFQDKKEAATFFKNTLPPEVVELLDLENLELTESSFVSEELKQEQTD 61 Query: 64 ILWSVKTREGD-GYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPML 122 +L+ + + G+ +Y++ EH+S + + +L+ Y + + + +VIP + Sbjct: 62 LLFQIPLKSGNKSNVYLLFEHKSYLENTIYIQLLGYLTEIYRNQQRSG--ESFSVVIPFV 119 Query: 123 FYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLEL--- 179 FYHG + + + D+F ++ P + + + I +++ + Sbjct: 120 FYHGEKE-WKLGDRFSDQFVLTKQETDVFQDFIPDFKIDLFDLEGIELKKKLESITFQVT 178 Query: 180 --IQKHIRQRDLMGLIDQLVVL--LVTECANDSQITALLNYILL---TGDEARFNEFISE 232 + + IR+RDL + L L L+ +S+ A+L +LL + + E Sbjct: 179 LGVVQRIRERDL-EFVSHLPGLFSLLLGIEEESKRVAILRKLLLYIYWARDLKPTELKRV 237 Query: 233 LT-RRMPQHRERIMTIAERIHNDGYI----KGEQRILRLLLQNGADPEWIQKITGLSAEQ 287 L ++ Q+ E MT AER+ ++G +G+ R +L E + +ITGLS + Sbjct: 238 LAISKLEQYEELTMTTAERLISEGIQQGKIEGKIETARNMLSEDIQLEAVLRITGLSKQD 297 Query: 288 MQ 289 ++ Sbjct: 298 LK 299 >UniRef50_A3JHZ5 Putative transposase n=11 Tax=Proteobacteria RepID=A3JHZ5_9ALTE Length = 325 Score = 171 bits (433), Expect = 3e-41, Method: Composition-based stats. Identities = 65/320 (20%), Positives = 129/320 (40%), Gaps = 36/320 (11%) Query: 7 STPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILW 66 + HD +K +HP+ + +E P ++ L D ++LK S +++ D++W Sbjct: 3 TNHHDTGYKELFSHPEFVQQLVEGFAPSEIAGLMDFNTLKNHSGNYITPLFEEKFEDVVW 62 Query: 67 SVKTR----EGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDK---RQPLPLVI 119 SV+ ++Y+++E QS+ D M RLM Y ++ + RQ LP + Sbjct: 63 SVEVTWEGITQRVFLYILLEFQSKIDSTMPLRLMHYVACFYDHLLKTRETTVRQGLPPIF 122 Query: 120 PMLFYHGSRSPYPWSLCWLDEF-ADPTTARKLYNA--AFPLVDVTVVPDDEIVQHR-RVA 175 PM+ Y+GS+ + D P ++Y + L+D D+E++ R ++ Sbjct: 123 PMVLYNGSQR-WSARQDIYDMVQPAPPEFLRVYQPHLRYYLIDEGRYTDEELISKRTPLS 181 Query: 176 LLELIQKHIRQRD-LMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELT 234 + ++ + L +D++V ++ + D + +I + L Sbjct: 182 GIFGVENAGHSWEALQQAVDRIVEIVKADPNKDRVDKIVTRWIKRHLQRVAPKARL-NLD 240 Query: 235 RRMPQHRERIMTIA--------------ERIHNDGYIKG-------EQRILRLLLQNGA- 272 R +R M + +G +G +++ +R LL G Sbjct: 241 RMSSLVEDRNMLAENLENLVKKERLEGRQEGRQEGRQEGDRRALEEKRKTVRHLLSFGVL 300 Query: 273 DPEWIQKITGLSAEQMQALR 292 + I TGLS +++ LR Sbjct: 301 SNDQIAVATGLSVDEIDKLR 320 >UniRef50_Q2FP14 Putative uncharacterized protein n=4 Tax=Methanospirillum hungatei JF-1 RepID=Q2FP14_METHJ Length = 312 Score = 169 bits (428), Expect = 1e-40, Method: Composition-based stats. Identities = 59/310 (19%), Positives = 121/310 (39%), Gaps = 32/310 (10%) Query: 11 DALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVKT 70 D +K +HP+ D + L L CDL +L+ + S+V + LR DI+W + Sbjct: 5 DHPYKRLFSHPEMIADLIRGFLDPKLVSGCDLSTLERCNGSYVTDDLREREDDIIWRLAY 64 Query: 71 REGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQP--LPLVIPMLFYHGSR 128 + +Y++IE QS+ D M R+M Y + Q I P +P +IP++ Y+G Sbjct: 65 GDRTLILYLLIEFQSKPDYSMPIRIMSYMALLWQDLIRSGVIVPSRIPGIIPIVLYNGE- 123 Query: 129 SPYPWSLCWLDEFADP-TTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIRQR 187 P+ + P +R + + + L+D + +++ R +A + Sbjct: 124 IPWKVPHDIRETIQMPKPVSRFIPSVPYLLIDELRLSVHHLMEVRNLAACLFGLEQSSGP 183 Query: 188 ----DLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNE-------FISELTRR 236 +L +++ + + + L D+ + + + Sbjct: 184 LELFELGARLNRWMQTDPNLDSMRRDFSLFFENTLKRDDDISISNPFQGGTMLAERVNKW 243 Query: 237 MPQHR---------------ERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKIT 281 + Q++ + ++ +G ++G IL+ + + G I IT Sbjct: 244 IAQYKAEGRKEGKEEGKKEGLLEGRVEGKL--EGKLEGMATILKRMKEKGMSVTEIATIT 301 Query: 282 GLSAEQMQAL 291 GL +++Q L Sbjct: 302 GLPEDEIQHL 311 >UniRef50_C0GWA6 Putative uncharacterized protein n=3 Tax=Desulfonatronospira thiodismutans ASO3-1 RepID=C0GWA6_9DELT Length = 334 Score = 168 bits (426), Expect = 2e-40, Method: Composition-based stats. Identities = 57/262 (21%), Positives = 122/262 (46%), Gaps = 12/262 (4%) Query: 1 MTNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRAL 60 M+ HD FK+F + + RDF++ +LP+++++ DL ++++ ++ E+ + Sbjct: 1 MSKK-IPNAHDICFKSFFSREEFVRDFIQYYLPEEIKKHLDLTIIEIDMEGYLSEEFKEF 59 Query: 61 HSDILWSVKTREG--DGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDK-RQPLPL 117 +SD++ V + + +Y + EH+S+ + + Y + R + K Q LP+ Sbjct: 60 YSDVVAKVYFNDRVHELELYFLFEHKSKPYRFTILQTLNYQVQKWMRLLVEGKLNQHLPI 119 Query: 118 VIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAF--PLVDVTVVPDDEIVQHRRVA 175 ++P++ Y+G +S + +S+ + D F P+ K + F L D+ + + + Sbjct: 120 IVPVVIYNGYKS-WNFSVQFEDLFQLPSEYYKDFIPQFRHILHDIGQMDEASFKTTTIME 178 Query: 176 LLELIQKHIRQRDLMGLIDQLVVLLVTECANDS---QITALLNYILLTGDEARFNEFISE 232 + L+ K+I +L I ++ LL ND + ++ Y++ +G A + + E Sbjct: 179 IFHLLLKYIYYPELDTKIHEIYDLLEKLPDNDKLTDYLFIIVRYVMASG--AIPEKRLLE 236 Query: 233 LTRRMPQHRERIMTIAERIHND 254 +R E I A I Sbjct: 237 HAKRFSGGEEMIGLAAREIEER 258 >UniRef50_A3ET28 Probable transposase n=6 Tax=Leptospirillum sp. Group II RepID=A3ET28_9BACT Length = 335 Score = 168 bits (425), Expect = 3e-40, Method: Composition-based stats. Identities = 62/334 (18%), Positives = 130/334 (38%), Gaps = 54/334 (16%) Query: 10 HDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVK 69 HD FKT + RDF+ LP ++ + D DSL+ + + H D++ + Sbjct: 8 HDRFFKTSFGRIEVLRDFLTGFLPPEISQSIDPDSLRFLNTESIGLSFEKSHMDLVVECR 67 Query: 70 TREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGSRS 129 E Y++IEH+S D + +++RY +A+ R+ + +K PL V+P++F+ G R Sbjct: 68 ISETPAQFYLLIEHKSVPDPEVFLQMLRYMVALWTRNRQDNK--PLVPVLPLVFHQGGR- 124 Query: 130 PYPWSLCWLDEFADPT-TARKLYNAAFPLVDVTVVPDDEIVQ---HRRVALLELIQKHIR 185 P+ + + + F P + A L D++ V I + H ++ + K+ Sbjct: 125 PWTLPVRFQETFPVPETLKAHAVDFAPLLFDLSTVSGTTIRERSAHAETVVVLTLLKYAF 184 Query: 186 QRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRERIM 245 + ++ L +++ + +LNY + + + ++R E+IM Sbjct: 185 SGSVEDVLRALKET--GGSFDETFLFGVLNYAIRAFEVKDPV-VVDAISRSF--GGEKIM 239 Query: 246 -TIAERIHNDGY----------------------------------------IKGEQRIL 264 +I + +G +G+++ + Sbjct: 240 PSIIDEWVEEGLKEGLKKGREEGREEGREEGKEEGRKEGREEGKEEGRKEGQKEGQRKTI 299 Query: 265 RLLLQNGA-DPEWIQKITGLSAEQMQALRQPLPE 297 LL G I + + ++ +R+ L + Sbjct: 300 EKLLAKGVLSVSEIASALDVDLQWVEQIRKDLEK 333 >UniRef50_B9TA29 Putative uncharacterized protein n=1 Tax=Ricinus communis RepID=B9TA29_RICCO Length = 411 Score = 166 bits (421), Expect = 8e-40, Method: Composition-based stats. Identities = 62/326 (19%), Positives = 119/326 (36%), Gaps = 41/326 (12%) Query: 4 FTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSD 63 F S+ D+L+K HP+ RD + L D +++ + +AS+ + H D Sbjct: 37 FFMSSRTDSLYKQLFAHPEIVRDLVAGFLAADWARGLTVEAFERVNASYASDHGHVRHDD 96 Query: 64 ILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHD---KRQPLPLVIP 120 ++W + Y+Y+++E Q+R D MA R+ Y + Q + K LP V+P Sbjct: 97 VVWRARIGGEWVYVYILLEFQARPDKWMALRMQVYVGLLYQDLVAQHKLSKHGKLPPVLP 156 Query: 121 MLFYHGSRSPYPWSLCWLDEFADPT-TARKLYNAAFPLVD-----------------VTV 162 ++ YHG + P+ R + + L+D + Sbjct: 157 VVLYHGRGPWRAATALASLMLPAPSGLERYQPSQRYLLIDQHHGTARADVVSLLFRLLDA 216 Query: 163 VPDDEIVQHRRVALLELIQKHIRQRDLMGLIDQLVVLLVTECAN---------DSQITAL 213 D ++ + L+L+ + IR RD+ + D L + + + T Sbjct: 217 ATDLQLRE-----ALDLLAERIRARDMDPVRDSLTRWIQLTLQDAAVETSMDLEEAFTMK 271 Query: 214 LNYILLTGDEARFNEFISELTRRMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGAD 273 + + F L + + I+ ++ +G +G L G + Sbjct: 272 MRRKFSYDEMFDPGMFERPLAKA---REKAIVEGLQQGREEGLERGRVEGLERGRVEGLE 328 Query: 274 --PEWIQKITGLSAEQMQALRQPLPE 297 E K GL + L++ L + Sbjct: 329 RGREEGLKA-GLQEGLQEGLKEGLQQ 353 >UniRef50_Q3JB06 Putative transposase n=17 Tax=Proteobacteria RepID=Q3JB06_NITOC Length = 350 Score = 166 bits (421), Expect = 8e-40, Method: Composition-based stats. Identities = 54/225 (24%), Positives = 100/225 (44%), Gaps = 7/225 (3%) Query: 10 HDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVK 69 HDA +K +HP+ RD ++ + + + D +L+ S S+V + LR DI+W ++ Sbjct: 4 HDASYKRLFSHPEMVRDLLQGFVREPWVQQLDFSTLEKVSGSYVTDDLREREDDIIWRLR 63 Query: 70 TREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDK---RQPLPLVIPMLFYHG 126 +EG YIY+++E QS D +MA R++ Y + Q I+ Q LP V P++ Y+G Sbjct: 64 HQEGWMYIYLLLEFQSTVDPYMAVRVLAYVGLLYQDLIKARYIAPNQKLPPVFPLVLYNG 123 Query: 127 SRSPYPWS-LCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIR 185 + + L + R + + LVD D+ + + + ++ R Sbjct: 124 GPRWRAATEVGDLITPLEGGLERYRPSLRYLLVDEGDYQDEALAPLKNLVASLFRLENSR 183 Query: 186 QR-DLMGLIDQLVVLLVTECAN--DSQITALLNYILLTGDEARFN 227 +L+ ++ L+ L + + T L +LL Sbjct: 184 TPEELLQVLRNLLQWLQSPAQKGLERDFTLWLKRVLLPARLPGVE 228 >UniRef50_C6I158 Putative uncharacterized protein n=3 Tax=Leptospirillum ferrodiazotrophum RepID=C6I158_9BACT Length = 328 Score = 166 bits (420), Expect = 1e-39, Method: Composition-based stats. Identities = 72/329 (21%), Positives = 128/329 (38%), Gaps = 43/329 (13%) Query: 10 HDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVK 69 HD FK+ L PD ++ LP ++ D SL V E L + D+ +S + Sbjct: 7 HDRFFKSTLGRPDRLGKVLKAFLPTNISASLDPGSLVPLGTESVGEGLDSSLMDLAFSAR 66 Query: 70 TREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGSRS 129 + + I++++EH+S D F++ RY + R ++ QP PL +P+LFYHG Sbjct: 67 FGDQEARIHLIVEHKSSPDPRTHFQIARYLCGLWIRELKEG-LQPRPL-LPILFYHGVVP 124 Query: 130 PYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIRQRDL 189 S + PL+D+ V D+EI H V LE + + + + Sbjct: 125 WTLPSRLTEVLRPPSELLAVTPDFVLPLIDLRRVDDEEIRHH--VDDLEAVLALLSLKHI 182 Query: 190 MGLIDQLVVLLVTECANDSQITALL----NYI---LLTGDEARFNEFISELTRRMPQHRE 242 ++ LV LL+ E A+L NY+ + + + + R + ++ Sbjct: 183 FDGVETLVRLLLREIWERKAPHAILKPEMNYMAGVYKITNSQEMKQIVDPIAREVGMAQD 242 Query: 243 RIMTIAERIHNDGYIK----------------------------GEQRILRLLL-QNGAD 273 + T + G K E++++R LL + Sbjct: 243 IVETWLDEYLQQGLQKGLEQGLQQGLQQGLEKGLEKGFQQGARLKEEQVIRTLLKKKTFS 302 Query: 274 PEWIQKITGLSAEQMQALRQPLPERERYS 302 E I + G+ ++ +R+ ER S Sbjct: 303 FEEIASLVGV---ELSRVREVAESPERGS 328 >UniRef50_B2V697 Putative uncharacterized protein n=6 Tax=Sulfurihydrogenibium RepID=B2V697_SULSY Length = 311 Score = 163 bits (414), Expect = 6e-39, Method: Composition-based stats. Identities = 51/251 (20%), Positives = 108/251 (43%), Gaps = 17/251 (6%) Query: 8 TPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWS 67 PHD FK + P + ++I +L + DL+S++L ++ +K+ D+L+ Sbjct: 5 QPHDQFFKQIFSEPKRVKSLLDIFY-SELSQKIDLESIRLLNSEKYSQKIGKSLLDLLYE 63 Query: 68 VKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGS 127 K ++ ++ EH+S D ++ +L+ Y+ + + E + + +I ++ YHG Sbjct: 64 CKIENEKSFLRIIFEHKSYIDKNLPSQLLYYNGILWE---ETGEYKEYLPIINIVLYHGK 120 Query: 128 RSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALL----ELIQKH 183 R + L + R + L+D++ V D+E++ V L KH Sbjct: 121 RKWNIPTT--LPKTNSEIIERFSNKLNYHLIDLSKVADEEMINKLYVDFCTASALLTMKH 178 Query: 184 IRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRER 243 I + DL + ++ V E D + +L+YI + + + E+ + Sbjct: 179 IFE-DLKKY--KHILKKVFEHYQDGCVFIILDYISVVNNPQEVENVLKEIL----GGEKE 231 Query: 244 IMTIAERIHND 254 + T+ E+ + Sbjct: 232 MTTLTEKWKME 242 >UniRef50_C6HXQ0 Putative uncharacterized protein n=1 Tax=Leptospirillum ferrodiazotrophum RepID=C6HXQ0_9BACT Length = 341 Score = 162 bits (411), Expect = 1e-38, Method: Composition-based stats. Identities = 63/270 (23%), Positives = 114/270 (42%), Gaps = 13/270 (4%) Query: 10 HDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVK 69 HD FK+ L P ++ LP L L SL + V + L A D+ + Sbjct: 8 HDRFFKSTLGRPKRMEHILKAFLPPALSALLAPGSLVPLFSEVVGDSLDASLLDMAFEAT 67 Query: 70 TREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGSRS 129 E I+V++EH+S D F+++ Y + R + + R P+P V P+LFYHG R Sbjct: 68 FGERKTRIHVLVEHKSSPDPWAHFQILHYLAELWLR-DKKESRSPIPFV-PVLFYHGLR- 124 Query: 130 PYPWS-LCWLDEFADPTTARKLYNAAFPL--VDVTVVPDDEIVQHRR---VALLELIQKH 183 PW+ L E DP + + + L +D+ + D +I + R + L+ KH Sbjct: 125 --PWNLPTRLSEMLDPPSELLPFVPDYLLPVIDLGKIDDLDIREKIRDFETSACLLLLKH 182 Query: 184 IRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRER 243 I + G + + + + I + ++Y++ E +S L + + Sbjct: 183 IFEG-ARGSLRAFLQETNGKNLSRDIIISGMSYVIGVHHLESTAE-LSRLVNTILKEEGM 240 Query: 244 IMTIAERIHNDGYIKGEQRILRLLLQNGAD 273 + E + +G Q+ ++ +Q G + Sbjct: 241 SQNVVELWMEELIQQGVQKGIQQGVQLGIE 270 >UniRef50_C0A240 Putative uncharacterized protein n=1 Tax=Opitutaceae bacterium TAV2 RepID=C0A240_9BACT Length = 365 Score = 162 bits (409), Expect = 2e-38, Method: Composition-based stats. Identities = 70/323 (21%), Positives = 132/323 (40%), Gaps = 51/323 (15%) Query: 10 HDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVK 69 HD +F+ + P AR F+ LP +L D +L + S + + L D+++ + Sbjct: 36 HDRIFRHAFSLPAVARQFLRTWLPPELVAQADWHTLTVTRISGISDTLGERREDVVYRIN 95 Query: 70 TREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVM-------------------QRHIEHD 110 + + YV++EHQ++ + HMA R+M + + R Sbjct: 96 VNGRNVHFYVLMEHQTKTEKHMARRIMEETFLIWRQDEHDRAEAAKKEAPGKADRQSRRR 155 Query: 111 KRQPLPLVIPMLFYHGSRSPYPWSLCWL--DEFADPTTARK-----LYNAAFPLVDVTVV 163 + PLVI M+ + G P W W D P K + + F +V++ + Sbjct: 156 ETDKFPLVISMVLHPG---PRKWGKIWRLADLIDVPPRMEKWARTFMPDCGFIVVELAGL 212 Query: 164 PDDEIVQ-HRRVALLELIQKHIRQRDLMGLID-QLVVLLVTECAND------SQITALLN 215 P +++ H A+L +Q + +GLID + + L+ E +D + L Sbjct: 213 PLEKLADGHLARAILGALQG-----NRLGLIDIRKIKRLLDEMFSDPDRASVGAVVKQLW 267 Query: 216 YILLTGDEARFNEFISELTRRMP-QHRERIMTIAERIHNDGYIKGEQRILRLLLQNGAD- 273 + L++ + + + + +P ++R IM ER+ G +K + + L+ D Sbjct: 268 HYLISSSDLKEEQTKDIVIAHIPEEYRSNIMNTVERLKQAGALKAQHNAVIEALEVRFDR 327 Query: 274 -----PEWIQKITGLSAEQMQAL 291 E IQ I E+++ L Sbjct: 328 VPEGLREAIQGIN--DPERLRNL 348 >UniRef50_B6WXP3 Putative uncharacterized protein n=1 Tax=Desulfovibrio piger ATCC 29098 RepID=B6WXP3_9DELT Length = 330 Score = 161 bits (408), Expect = 3e-38, Method: Composition-based stats. Identities = 59/274 (21%), Positives = 111/274 (40%), Gaps = 15/274 (5%) Query: 9 PHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSV 68 PHD+ +K F ++P+ + +P D E D +L+ S S+V + LR H DI+W + Sbjct: 7 PHDSAYKQFFSNPEMVESLLRDFVPADFIEDLDFSTLERCSGSYVTDDLRERHDDIVWRI 66 Query: 69 KTREG-DGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDK---RQPLPLVIPMLFY 124 ++G Y+ +V+E QS D MA R + Y+ ++ ++ K + LP V P++ Y Sbjct: 67 GWKKGAWCYVALVLEFQSTPDYWMALRTLSYTALLLLDLVKTGKVHEGEGLPPVFPIVIY 126 Query: 125 HGSRSPYPWSLCWLDEFADPTTARKLYNA---AFPLVDVTVVPDDEIVQHRRVALLELIQ 181 +G ++ + FA + K Y F L + V D+ VA L ++ Sbjct: 127 NGGKA-WKAPQEVATLFAPMPDSLKHYCPQHRHFLLDESRVSGDELDKSQGLVAQLLKLE 185 Query: 182 KHIRQRDLMGLIDQLVVLLVTECA---NDSQITALLNYILLTGDEARFNEFISELTRRMP 238 + + ++ +L+ L + L +L +L Sbjct: 186 RAQEPEQVRQIVKELITRLHEPKYLLLRRAFTVWLSRVVLKRSGITEEIPEFQDLREVDA 245 Query: 239 QHRERIMTIAERIHNDGYIKGEQRILRLLLQNGA 272 ER A + ++ +G+ + + G Sbjct: 246 MLEER----AAQWKDEYIKQGKTEGISIGEARGI 275 >UniRef50_C6HZP6 Putative uncharacterized protein n=1 Tax=Leptospirillum ferrodiazotrophum RepID=C6HZP6_9BACT Length = 334 Score = 161 bits (407), Expect = 3e-38, Method: Composition-based stats. Identities = 70/308 (22%), Positives = 127/308 (41%), Gaps = 23/308 (7%) Query: 2 TNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKL-RAL 60 ++TPHD+ FK + + L +L SL+ + E L R+ Sbjct: 17 KTSISTTPHDSFFKDVFGPGKGHLPSLIPLIDGSLASRIELSSLEYLPGESIAEDLARST 76 Query: 61 HSDILWSV-----KTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPL 115 SD+ S+ + GD I + EH+S H+ L+ A++ R + R+P Sbjct: 77 RSDLSASLLISNARIDGGDARIAFIFEHKSFLPHHIHIPLLSLVSALLSRDLREG-RKPC 135 Query: 116 PLVIPMLFYHGSRSPYPWSLCWLDEFAD-PTTARKLYNAAFPLVDVTVVPDDEIVQ---H 171 P VIP++ YHG R+P+ + P A +L + L+D++ D+ + + H Sbjct: 136 P-VIPVVLYHG-RAPWTLPARLSEALDLSPELAPRLPDFELTLIDLSRFSDETLKEKIAH 193 Query: 172 RRVALLELIQKHIRQR--DLMGLIDQLVVLLVTECANDSQIT-ALLNYILLTGDEARFNE 228 + + KHI + ++G +L+ L +I L+YI E Sbjct: 194 PEPLVSLSVMKHIFEPPESVLGHFVRLIKTLSPSRDILKRIVDTTLHYISYVKKSHHPQE 253 Query: 229 FISELTRRMPQHRERIMTIAERIHNDGYIKGEQR-----ILRLLLQNGADPEWIQKITGL 283 + T + + E++ T+ + I +G +G Q I RLL + P+ I I + Sbjct: 254 IRTIFTTFLAE--EKMTTVLDLIKEEGIQEGIQMGRDEAITRLLQHSSLSPQQIASILNV 311 Query: 284 SAEQMQAL 291 ++ +L Sbjct: 312 DLSRVLSL 319 >UniRef50_C6HY29 Putative uncharacterized protein n=1 Tax=Leptospirillum ferrodiazotrophum RepID=C6HY29_9BACT Length = 319 Score = 156 bits (395), Expect = 8e-37, Method: Composition-based stats. Identities = 61/270 (22%), Positives = 110/270 (40%), Gaps = 14/270 (5%) Query: 1 MTNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKL-RA 59 M T PHD FK + + + LP+D+ D DSL V E L R+ Sbjct: 1 MAKNLT--PHDVFFKEIFSQREILSSALSELLPEDVVRRMDFDSLAYLPGESVGEGLSRS 58 Query: 60 LHSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVI 119 +D+++SV E +G + V++EH+S D + F++++ + +++ R+PLP ++ Sbjct: 59 TRADLVFSVSFGEREGRLVVILEHKSHPDPRVHFQILQMMVMGWMQNLREG-REPLP-IL 116 Query: 120 PMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEI--VQHRRVALL 177 P+LFYHG S AR L + +D+ ++ D I +Q+ Sbjct: 117 PILFYHGQGSWSIPDRFSERMKIPREIARYLPDFELLRIDLGLIDDTRIRSLQNVLAGAA 176 Query: 178 ELIQKHIRQ--RDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTR 235 L KH+ + R L+ + +I + + +E + + Sbjct: 177 LLSMKHVFENPRRFFHLLIEFGRERSAPHDIIEKIVLVALDYAGHVHKNIPDEELYNIMA 236 Query: 236 RMPQHRERIMTIAER----IHNDGYIKGEQ 261 + + + T ER +G KG Q Sbjct: 237 AITE-EAGMETTTERLKKIWIEEGIQKGVQ 265 >UniRef50_C1DXM1 Putative uncharacterized protein n=5 Tax=Sulfurihydrogenibium azorense Az-Fu1 RepID=C1DXM1_SULAA Length = 342 Score = 156 bits (394), Expect = 1e-36, Method: Composition-based stats. Identities = 58/264 (21%), Positives = 113/264 (42%), Gaps = 19/264 (7%) Query: 8 TPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWS 67 +PHD FK + F+EI LP+ L E +SLKL +K + D+ + Sbjct: 6 SPHDWFFKMIFSQKQNVESFLEIFLPQ-LYECIIPNSLKLSDTEKFSKKYKKFFLDLAFD 64 Query: 68 VKTREG-----DGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPML 122 K ++ DG IY+V EH+S D H ++ Y +M+ + +P VIP++ Sbjct: 65 CKLKDKEGNTIDGQIYIVFEHKSYPDKHTPSQISFYKSVMMEE--DERLSRPYRPVIPIV 122 Query: 123 FYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLE---- 178 FYHG +S + + L++ ++ L DV+ V + +++ + Sbjct: 123 FYHGEKSWNIPTDIPQQFNTLGNLEKYLHSLSYILFDVSKVDESFLIEKIYLNACLISGV 182 Query: 179 LIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMP 238 K+I + + + ++ L+ + D + +++ D + + E+ Sbjct: 183 FTLKNIFKD--LKYLRPVLEKLILDDVKDCLYIIIDYTVIVKKDLETIEKILEEI----- 235 Query: 239 QHRERIMTIAERIHNDGYIKGEQR 262 E++MT+ E+ +G KG + Sbjct: 236 GGEEKMMTLTEKWKMEGLKKGMEE 259 >UniRef50_C5RH90 Putative uncharacterized protein n=2 Tax=Clostridium cellulovorans 743B RepID=C5RH90_CLOCL Length = 339 Score = 154 bits (390), Expect = 3e-36, Method: Composition-based stats. Identities = 49/283 (17%), Positives = 112/283 (39%), Gaps = 15/283 (5%) Query: 7 STPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILW 66 + HD +K ++ +T ++ + ++L L S+V L SDI++ Sbjct: 20 NNLHDKSYKDLFSNKETFLSLIQTFVSNTWGSKLTKENLVLVDKSYVLSDYEELESDIVY 79 Query: 67 SVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQ-------PLPLVI 119 + + + + Y+++E QS D M RL+ Y + + + +++ + LP V+ Sbjct: 80 KARIGDHEVFFYMLLEFQSYVDYRMPIRLLLYMIEIWREILKNTSEKEFKRKSFRLPAVV 139 Query: 120 PMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLEL 179 P++ Y+G ++ + + + +DV DE+ +++ +A Sbjct: 140 PIVVYNGEKNWTVARTLKEVISNSDIFGESILDFRYEFLDVNRFKKDELYENQNIASAIF 199 Query: 180 IQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQ 239 + R + ++L +++ + A L + L+ + N + Sbjct: 200 LLDQSISR--IEFYNRLKDIVIEFNKLTVEEKAQLKHWLVNVNSEENNYKENIEKIFSSN 257 Query: 240 HRE-RIMT-----IAERIHNDGYIKGEQRILRLLLQNGADPEW 276 RE IMT E++ +G I+G+ LL + ++ Sbjct: 258 KREVEIMTSNISKGLEKLKEEGKIEGKAEGKAELLIKQLNKKF 300 >UniRef50_A4U3R1 Putative uncharacterized protein n=1 Tax=Magnetospirillum gryphiswaldense RepID=A4U3R1_9PROT Length = 322 Score = 153 bits (388), Expect = 5e-36, Method: Composition-based stats. Identities = 60/267 (22%), Positives = 105/267 (39%), Gaps = 20/267 (7%) Query: 11 DALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVKT 70 DAL+ +HP A + +P+ + D ++ +A F D + D++W + T Sbjct: 5 DALYHRLFSHPLMAEQLVREFVPEAMAVGLDFARMERVNAKFHDRDGKRREGDVIWRIPT 64 Query: 71 REG-DGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQ---PLPLVIPMLFYHG 126 +G D ++++ E QS D MA R Y + Q I K + LP V+ ++ Y+G Sbjct: 65 ADGEDVVLHILCEFQSTTDWWMAVRTQVYEGLLWQHLIAERKLKSGDRLPPVLTLVLYNG 124 Query: 127 SRSPYPWSLCWLDEFADP---TTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKH 183 + + + A P A + L+D+ VP++E+ +A L +H Sbjct: 125 EQR-WHAPTDTIPLIALPAGSPLWPWQPRACYHLLDMGAVPEEELAIRDSLAALLFRLEH 183 Query: 184 IRQR-DLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRE 242 R+ +L GLID +V D L + + + + Sbjct: 184 PREPEELAGLIDDVVGWFRRHPGYDE-----LRRLFTELVRQAIEGYETSVAVPGDMMEM 238 Query: 243 RIM------TIAERIHNDGYIKGEQRI 263 R M T +R +G +GE R Sbjct: 239 RSMLANLGETWKKRWLAEGIAEGEARG 265 >UniRef50_B6J6C6 Hypothetical cytosolic protein n=1 Tax=Coxiella burnetii CbuK_Q154 RepID=B6J6C6_COXB1 Length = 143 Score = 153 bits (386), Expect = 9e-36, Method: Composition-based stats. Identities = 51/135 (37%), Positives = 86/135 (63%), Gaps = 1/135 (0%) Query: 4 FTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSD 63 PHD F+T ++ A++F E HLP ++ + DL+SL+L+ +SF+DE L+A +D Sbjct: 2 KKIHNPHDYYFRTAMSDTRVAKEFFEYHLPNNILKAADLNSLQLQKSSFIDEHLKASMAD 61 Query: 64 ILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLF 123 +L+SVK GY Y+++EHQ D M +RL+RY + ++ H++ PLP+V+P++F Sbjct: 62 VLYSVKLNRRPGYFYIIVEHQRNPDKLMPYRLLRYILRIIDHHLKKKDYLPLPIVVPLVF 121 Query: 124 YHGSRSPYPWSLCWL 138 Y+G + YP+ +L Sbjct: 122 YNGKKR-YPFQRIFL 135 >UniRef50_A8PLG1 Transposase n=1 Tax=Rickettsiella grylli RepID=A8PLG1_9COXI Length = 212 Score = 152 bits (384), Expect = 2e-35, Method: Composition-based stats. Identities = 59/213 (27%), Positives = 104/213 (48%), Gaps = 12/213 (5%) Query: 90 HMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGSRSPYPWSLCWLDEF-ADPTTAR 148 F++ RY A+M +H++ LP+V+ ML+Y G +PYP++ D F + T A Sbjct: 1 MTPFKIARYVHAIMDQHLKQG-HAFLPIVVAMLYYRGKVTPYPYTGNIFDCFGKNKTIAE 59 Query: 149 KLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHI-RQRDLMGLIDQLVVLLVTECAND 207 K+Y +P++D+T + DD I H +A+L+ QK+ RD+ I+ ++ L Sbjct: 60 KIYLRPYPIIDITALSDDAIRGHGSIAILDFAQKYAAFNRDIQDGIEHIIGELKKGYLTR 119 Query: 208 SQITALLNYILLTGDEARFNEFISELTRRMPQHRERIMTIAERIHNDGYI--------KG 259 Q LL Y D + +L + + + E IM++A +I G + Sbjct: 120 EQCQTLLYYTFRETDTDNVKMLLEQL-QTIRIYEEDIMSVAHKIEQQGLQRGLQQGRYEE 178 Query: 260 EQRILRLLLQNGADPEWIQKITGLSAEQMQALR 292 + +I + +L G D +I+ +TGLS + + L Sbjct: 179 DLKIAKRMLAKGTDRGYIKDVTGLSDQDLLNLE 211 >UniRef50_C8T759 Putative uncharacterized protein n=1 Tax=Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884 RepID=C8T759_KLEPR Length = 185 Score = 148 bits (375), Expect = 2e-34, Method: Composition-based stats. Identities = 74/184 (40%), Positives = 106/184 (57%), Gaps = 26/184 (14%) Query: 135 LCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIRQRDLMGLID 194 +CWL FADP AR++Y FPL+D+T PDDEI++HRRVA+LEL+QKHIRQRDLM L + Sbjct: 1 MCWLAGFADPDIARRIYGEDFPLIDITSTPDDEIMRHRRVAMLELLQKHIRQRDLMDLHE 60 Query: 195 QLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQ--HRERIMTIAERIH 252 QLV LL + Q+ LL+Y+L G+ A F+ L + +P+ H+E +M IA+ + Sbjct: 61 QLVRLLALGYTSRRQLKTLLHYLLQAGNAADPVAFLRHLAQNVPRRPHKETLMNIAQFLE 120 Query: 253 NDGYIKGE------------------------QRILRLLLQNGADPEWIQKITGLSAEQM 288 G+ +G +RI R +L NG D + K+TGL+ E + Sbjct: 121 QRGHQQGLKQGLEQGLQQGIEQGIEQGEQQTAERIARAMLANGLDLSLVAKLTGLAPECL 180 Query: 289 QALR 292 L+ Sbjct: 181 ARLQ 184 >UniRef50_B9MN47 Putative uncharacterized protein n=2 Tax=Bacteria RepID=B9MN47_ANATD Length = 324 Score = 146 bits (368), Expect = 1e-33, Method: Composition-based stats. Identities = 52/321 (16%), Positives = 121/321 (37%), Gaps = 33/321 (10%) Query: 2 TNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALH 61 + HD+ FK +P + + S++++ +++ ++ + Sbjct: 6 KEKLPAKEHDSTFKLLFENPKDIYLLLSKIINYSWANEIRESSIEIKKTNYITKEFSQVE 65 Query: 62 SDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPM 121 +D++ + ++ D Y Y++IE+QS M RL+RY +++ I + + LP +IP+ Sbjct: 66 ADVVAKARLKDRDVYFYILIENQSTVAKDMPERLLRYMISIWAEEIRNG-VEKLPAIIPI 124 Query: 122 LFYHGSRSPYPWSLCW---LDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLE 178 + Y+G + S D F + K+ + A +D+ +E V + LE Sbjct: 125 VVYNGLDRRWEVSTDIIGAFDIFKNDIFKYKVVDIA--QIDIKNYLQEEDVLTPIIFYLE 182 Query: 179 LIQKHIRQRDLMGLIDQLVVLLVTECAN--DSQITALLNYILLTGDEARFNEFISELTRR 236 ++ +L+ + ++ L N + + + I + E+ + + Sbjct: 183 QVRND--SNELVRRLQEIEQSLKKLSFNNIERFLLWSQHVIRPRLGNEQKKEYDKLVMKV 240 Query: 237 MPQHRERIM----TIAERIHNDGYIKG-------------------EQRILRLLLQNGAD 273 + E + +A + + + ++Q G Sbjct: 241 RQEGVELMGEFVSNVARLLDETKTKEFLAGVQQGIQQGIQQGIQQERIETAKRMIQLGIS 300 Query: 274 PEWIQKITGLSAEQMQALRQP 294 E I K T LS E+++ + + Sbjct: 301 YEVISKATNLSIEEIEKIARE 321 >UniRef50_C6PYR3 Putative uncharacterized protein n=1 Tax=Clostridium carboxidivorans P7 RepID=C6PYR3_9CLOT Length = 344 Score = 145 bits (367), Expect = 2e-33, Method: Composition-based stats. Identities = 43/299 (14%), Positives = 114/299 (38%), Gaps = 23/299 (7%) Query: 3 NFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHS 62 HD +K ++ + D ++ + + D+++L + S++ L S Sbjct: 4 KKEMHHIHDKSYKDLFSNKELLVDMIQNFVKSSWIKEIKKDNIELVNKSYILSDYEELES 63 Query: 63 DILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQ-------PL 115 DI++ + Y+++E QS D M RL Y + + +++ K+ L Sbjct: 64 DIVYKATIDGREVIFYILLEFQSYVDYSMPIRLFLYMSEIWREVLKNTKQAEVKSKEFRL 123 Query: 116 PLVIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRV- 174 P ++P++ Y+G + + + L+D+ +E+++ + + Sbjct: 124 PAIVPLVLYNGEYKWTVEKKFKNIINKSELFGNNIIDFEYILIDINKYEKEELMELKNLV 183 Query: 175 ALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELT 234 + + L+ + + + + + + + + Q L +++ +T + ++ Sbjct: 184 SAVFLLDQKVDIEEFISRVKDIA--IDFNNLTEEQKMMLRHWLRVTLSDELKGNLGEKIE 241 Query: 235 RRMPQHRERI--------MTIAE---RIHNDGYIKGEQRILRLLLQNGA--DPEWIQKI 280 + +E + TI E + +G KG + + ++ D E + K+ Sbjct: 242 DILIAKKEEVNRMTSNISKTIKETFAKTREEGMEKGIEEGIEKGIEKARQKDVEIVLKL 300 >UniRef50_A9BGB3 Putative uncharacterized protein n=2 Tax=Petrotoga mobilis SJ95 RepID=A9BGB3_PETMO Length = 336 Score = 144 bits (364), Expect = 3e-33, Method: Composition-based stats. Identities = 63/317 (19%), Positives = 129/317 (40%), Gaps = 32/317 (10%) Query: 11 DALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSV-K 69 D++FK DF++ LPK+ + LK E + + SDIL+ + K Sbjct: 7 DSIFKELFEDRTVFYDFLKAFLPKETTKQIKETDLKREQTELIGKDFSIKRSDILYKIEK 66 Query: 70 TREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQ------PLPLVIPMLF 123 D YIY+++EHQS+ D MAFR++ Y + + ++++ K++ LP++I M+F Sbjct: 67 RNGQDVYIYLLLEHQSKVDQLMAFRMLAYKVRIWEQYVNSHKKESEQKGFKLPVIIGMVF 126 Query: 124 YHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKH 183 Y G L A + L++++ + ++ I+ ++ + L+ Sbjct: 127 YDGKAKWTSPMDVKEKITEIKNMEEYLIKANYELINLSNIKEETIINMKKALGVILLTDK 186 Query: 184 IRQR-----DLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTR--- 235 R +L+ +I++ ++L ++E + +I L G + E Sbjct: 187 PNVRVKNAEELLKIINKDILLKLSEEEQEKFNKHRNAFIELFGKRTDYEEIKERFEELKE 246 Query: 236 -RMPQHRERIMTIAERIHNDGYIKGE----------------QRILRLLLQNGADPEWIQ 278 +P+ + IA+R ++G+ IL D + Sbjct: 247 MEVPKMFNTLEEIAKRDREKAKLEGKAEGKVEGKLEERRELIIEILNQRFGEDFDKSLEE 306 Query: 279 KITGLSAEQMQALRQPL 295 KI + E + +++ + Sbjct: 307 KIRNANEETINQIKKNI 323 >UniRef50_D0YJF1 Putative transposase YhgA family protein n=1 Tax=Klebsiella variicola At-22 RepID=D0YJF1_KLEVA Length = 190 Score = 144 bits (363), Expect = 4e-33, Method: Composition-based stats. Identities = 72/168 (42%), Positives = 102/168 (60%), Gaps = 20/168 (11%) Query: 144 PTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIRQRDLMGLIDQLVVLLVTE 203 P TA+ LY F L+DVTV+PDD++VQHRRVALLEL+QKHIRQRDL + + L +++ Sbjct: 23 PETAKTLYGCPFTLIDVTVMPDDDLVQHRRVALLELMQKHIRQRDLSSITESLAAVVMLG 82 Query: 204 CANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRERIMTIAERIHNDGYIKGE--- 260 N Q+ L +Y+L G+ A F+ L RR+PQ+ E +M+IA+++ +G +G Sbjct: 83 YTNRRQLRMLFHYMLQYGNTAEPGVFLRRLARRLPQYEETLMSIAQKLKQEGRQEGRLEG 142 Query: 261 -----------------QRILRLLLQNGADPEWIQKITGLSAEQMQAL 291 RI +LQNG D E +QKITGLSA+++Q L Sbjct: 143 REEGHQEGLQEGSRREALRIAGSMLQNGLDKEMVQKITGLSADELQPL 190 Score = 41.1 bits (95), Expect = 0.056, Method: Composition-based stats. Identities = 16/27 (59%), Positives = 19/27 (70%) Query: 1 MTNFTTSTPHDALFKTFLTHPDTARDF 27 M TSTPHDA+FK FL HP+TA+ Sbjct: 3 MKKRMTSTPHDAVFKRFLRHPETAKTL 29 >UniRef50_B0K503 Putative uncharacterized protein n=12 Tax=Thermoanaerobacteraceae RepID=B0K503_THEPX Length = 360 Score = 142 bits (359), Expect = 1e-32, Method: Composition-based stats. Identities = 45/268 (16%), Positives = 109/268 (40%), Gaps = 9/268 (3%) Query: 3 NFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHS 62 HD +K L+ + + + ++ D ++ SFV + + Sbjct: 7 KEAIHNQHDKGYKFLLSSKRVFIELLRSFVKQEWVNDIDEANVVKVDKSFVLQDFADKEA 66 Query: 63 DILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQ-------PL 115 D+++ VK ++ + Y+++E QS D M +RL+ Y + + + ++ R+ L Sbjct: 67 DLVYRVKLKDKEVIFYILMELQSTVDYQMPYRLLLYMVEIWRSILKDTPRKESRRKDFKL 126 Query: 116 PLVIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQ-HRRV 174 P+++P++ Y+G + + T + + L+DV +E+++ + Sbjct: 127 PVIVPIVLYNGDHKWTAKTSYKETLNSYETFGEYAVDFKYILIDVNRYTKEELLKLENLI 186 Query: 175 ALLELIQKHIRQRDLMGLIDQLVVLLVTECANDS-QITALLNYILLTGDEARFNEFISEL 233 A + L+++ + ++M + +L +L ++ A ILL E I + Sbjct: 187 ASVFLLEQKVEFEEIMKRLKELSEILNNLDKDEILLFKAWFKKILLARLPEEERENIERI 246 Query: 234 TRRMPQHRERIMTIAERIHNDGYIKGEQ 261 + E I + + I + + ++ Sbjct: 247 IDENKEVEEMISNLEKTILQEMKEREKR 274 >UniRef50_C6HTR6 Probable transposase n=5 Tax=Leptospirillum ferrodiazotrophum RepID=C6HTR6_9BACT Length = 216 Score = 137 bits (345), Expect = 5e-31, Method: Composition-based stats. Identities = 53/217 (24%), Positives = 87/217 (40%), Gaps = 15/217 (6%) Query: 4 FTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKL-RALHS 62 TT TPHD+ FK + L D SL S + E L + S Sbjct: 2 TTTPTPHDSFFKDVFGPGKANLPALLSLLDAPFASRIDPSSLTFLSGETIGEGLATSFRS 61 Query: 63 DILWSV-----KTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPL 117 D++ S+ ++EH+S + F+L A+ R + K PLP Sbjct: 62 DLVGSLLVADATVDGKPLEFVFLVEHKSSPARDIQFKLACLVTALWARFLREGK-PPLP- 119 Query: 118 VIPMLFYHGSRSPYPWSLCWLDEFAD-PTTARKLYNAAFPLVDVTVVPDDEIVQH---RR 173 V+P+L +HG +SP+ L + P A + + A ++D+T + DDEI + Sbjct: 120 VVPILIHHG-KSPWNQPLRLYETLGLRPELATGMLDYALHVIDLTRIEDDEIRRKIPDPE 178 Query: 174 VALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQI 210 + KHI D + +++ L+ E + I Sbjct: 179 PQMSLAAMKHIH--DPLPAFLRVMADLLKEIEENRDI 213 >UniRef50_D0LPI9 Putative transposase n=2 Tax=Haliangium ochraceum DSM 14365 RepID=D0LPI9_HALO1 Length = 338 Score = 136 bits (343), Expect = 9e-31, Method: Composition-based stats. Identities = 58/267 (21%), Positives = 106/267 (39%), Gaps = 31/267 (11%) Query: 10 HDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVK 69 +D L +T + A D LP L + DLD+L L S ++V ++LR ++D+L+SV Sbjct: 24 YDVLVETTFARREYAADTFRTMLPPALVKRLDLDALSLRSGTYVSDELRQYYTDVLYSVL 83 Query: 70 TREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRH-IEHDKRQPLPLVIPMLFYHGSR 128 +IY++++HQS D RL R +++ +R+ IE LP+++P++F+H + Sbjct: 84 LDGEQAFIYLLLKHQSATDPMFPLRLPRNVLSIWERYLIERQDATTLPVILPIVFHHEAT 143 Query: 129 SPYPW--------------SLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRV 174 + + L F D+ Sbjct: 144 GWSDAVGLNGSLALGADVRTALSANRRDFRRLRYLLLVLCFQF-------DEASRAQNLN 196 Query: 175 ALLELIQK----HIRQRDL---MGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFN 227 L L+ + +RDL + + ++ +V + ++ +IL E + Sbjct: 197 EALGLLMRTFGVARPKRDLVASLKGWEDVIREVVATQRGREMLATVVQFIL-ENSETDPD 255 Query: 228 EFISELT-RRMPQHRERIMTIAERIHN 253 E S L R MT A+R+ Sbjct: 256 ELKSFLEFTAGEPARTAFMTGADRLTQ 282 >UniRef50_D2NBJ3 Putative uncharacterized protein n=1 Tax=Escherichia coli SE15 RepID=D2NBJ3_ECOLX Length = 136 Score = 135 bits (341), Expect = 1e-30, Method: Composition-based stats. Identities = 61/125 (48%), Positives = 83/125 (66%) Query: 169 VQHRRVALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNE 228 +H +ALLELIQKHIRQRDLMGL++Q+ LL + AND QI L NYIL TGD RFN+ Sbjct: 12 RRHASMALLELIQKHIRQRDLMGLVEQMACLLSSGYANDRQIKGLFNYILQTGDAVRFND 71 Query: 229 FISELTRRMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQM 288 FI + R P+H+E +MTIAER+ +G I +++L++G I + TG+S E++ Sbjct: 72 FIDGVAERSPKHKESLMTIAERLRQEGEQSKALHIAKIMLESGVPLADIMRFTGVSEEEL 131 Query: 289 QALRQ 293 A Q Sbjct: 132 AAASQ 136 >UniRef50_A4XMU7 Putative uncharacterized protein n=1 Tax=Caldicellulosiruptor saccharolyticus DSM 8903 RepID=A4XMU7_CALS8 Length = 313 Score = 132 bits (331), Expect = 2e-29, Method: Composition-based stats. Identities = 58/307 (18%), Positives = 130/307 (42%), Gaps = 32/307 (10%) Query: 11 DALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVKT 70 D FK LT+ + + L + L L+ +++ + ++ + RA SD+++ +K Sbjct: 9 DEGFKKVLTNRTNIKWLLTELL-EVLPIQIGLEDIEVIATESINRQWRARRSDMVYKIKY 67 Query: 71 REGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGSRSP 130 + D YI V++E QS ++ + R++ Y + + ++ + LP+VIP++ Y G Sbjct: 68 K--DAYICVLLEFQSSKEELIHLRVLEYMLLI---QKKYTTKNLLPVVIPVVLYTGEEKW 122 Query: 131 YPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRV-ALLELIQK-HIRQRD 188 P + + + + + VDV ++ D+++++ + A + K Sbjct: 123 TPATCFEQNVVYGEDFKQFVQKFSLVFVDVRMIDDEKLLKSPNLLAAALYVDKVSDNPEK 182 Query: 189 LMGLIDQLVVLLVTECANDSQITALLNYILLTG---DEARFNEFISELTRRMPQHRERIM 245 + ++ L + + L +++L G + +EF+ + E + Sbjct: 183 VAERLEYLSKHVKFSEEQKEEFCEWLYHVVLKGYGFSDEEVDEFLFKSDFLRLGVNEMFL 242 Query: 246 TIAERIHNDGYIKGEQ--------------------RILRLLLQNGADPEWIQKITGLSA 285 AE+I G K + + + +++ GA+ +I K+TGL Sbjct: 243 NTAEKIR-KGLEKELEKERKQGIQQGIQQGKEQALLEVAQKMIEEGAEDSFIAKVTGLDM 301 Query: 286 EQMQALR 292 E+++ LR Sbjct: 302 ERIRQLR 308 >UniRef50_B8FP58 Putative uncharacterized protein n=1 Tax=Desulfitobacterium hafniense DCB-2 RepID=B8FP58_DESHD Length = 167 Score = 129 bits (324), Expect = 1e-28, Method: Composition-based stats. Identities = 43/136 (31%), Positives = 66/136 (48%), Gaps = 5/136 (3%) Query: 6 TSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDIL 65 PHD FK TAR F+E +LP+++R L DL ++ + S++D++L+ SD+L Sbjct: 4 IHNPHDKFFKETFGDVGTARSFLENYLPQEVRALVDLKTVLPQKDSYIDQELQESFSDLL 63 Query: 66 WSVKTREGDGYIYVVIEHQSRE----DIHMAFRLMRYSMAVMQRHI-EHDKRQPLPLVIP 120 + VK RE +GY Y + EH+ R M+ RL S+ QR + P I Sbjct: 64 FQVKIRENEGYFYFLFEHKVRPYADRRKKMSTRLADDSVLSKQREMFMQSVNHGKPPYIS 123 Query: 121 MLFYHGSRSPYPWSLC 136 G+R+ C Sbjct: 124 RFIRKGNRTGSAACRC 139 >UniRef50_C1MD86 Putative uncharacterized protein n=5 Tax=Enterobacteriaceae RepID=C1MD86_9ENTR Length = 155 Score = 126 bits (317), Expect = 8e-28, Method: Composition-based stats. Identities = 67/150 (44%), Positives = 97/150 (64%), Gaps = 24/150 (16%) Query: 163 VPDDEIVQHRRVALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGD 222 +PDD+I+QHRR+ALLELIQKHIR+RDLMGL+++L +LLV AND+Q+ AL NY++ G+ Sbjct: 1 MPDDKIMQHRRMALLELIQKHIRKRDLMGLVEKLAILLVKGHANDNQLKALFNYLMQAGN 60 Query: 223 EARFNEFISELTRRMPQHRERIMTIAERIHNDGYIKGEQ--------------------- 261 F EF+ E+ R+PQH++++MTIAER+ +G++ G Q Sbjct: 61 TTHFGEFLHEVAERLPQHKDKLMTIAERLRQEGHLNGLQEGHRKGLQEGLQTGLQQGKRE 120 Query: 262 ---RILRLLLQNGADPEWIQKITGLSAEQM 288 RI + +G DP I +ITGL+AE + Sbjct: 121 EALRIASTMQADGIDPLTIIRITGLTAEDL 150 >UniRef50_B9MMM9 Putative uncharacterized protein n=1 Tax=Anaerocellum thermophilum DSM 6725 RepID=B9MMM9_ANATD Length = 315 Score = 121 bits (303), Expect = 4e-26, Method: Composition-based stats. Identities = 52/318 (16%), Positives = 127/318 (39%), Gaps = 36/318 (11%) Query: 6 TSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDIL 65 T +D +K ++ + F++ L ++ + + +++ + +++K + SDI+ Sbjct: 3 TYKKYDEGYKKLFSNKENLIWFLQNVLNEERFKKIEKSDVEIIATESINKKWQKKISDIV 62 Query: 66 WSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYH 125 + +K + D + + IE QSRED + RL Y + +++ +P+V+P++ Y+ Sbjct: 63 YKIKYK--DSFFCLTIEFQSREDKKILHRLYEYMHLI---QLKNKVNGEIPVVVPIVLYN 117 Query: 126 GSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIR 185 G P N +D+ +P+++++ V + + Sbjct: 118 GISHWKPNEQYNEIILFAKDFPEYAQNFKIIFLDIKSIPEEKLISAANVLAI-AVYIDQV 176 Query: 186 QRDLMGLIDQLVVLLVTECANDSQITALLNYILLT-----------------GDEARFNE 228 + ++++++ L N Q L +++ +E Sbjct: 177 SNNPERVLNRILNLRGKIHLNWEQREELADWLYEVILRSYGVSEEEAEEMFKKSGLEVDE 236 Query: 229 FISELTRRMPQH---------RERIMTIAERIHNDGYIKGEQR----ILRLLLQNGADPE 275 S ++ Q +E + ++ G +G +R I + +L++ E Sbjct: 237 LFSSTAEKIKQGIEREKKKIAKEAMKQGMKQGMKQGMKQGMKRAIKLIAKQMLKDNQPIE 296 Query: 276 WIQKITGLSAEQMQALRQ 293 I K TGL+ E+++ L++ Sbjct: 297 LISKYTGLTPEEIKKLKK 314 >UniRef50_C1DXV7 Putative uncharacterized protein n=1 Tax=Sulfurihydrogenibium azorense Az-Fu1 RepID=C1DXV7_SULAA Length = 357 Score = 120 bits (301), Expect = 6e-26, Method: Composition-based stats. Identities = 46/258 (17%), Positives = 106/258 (41%), Gaps = 15/258 (5%) Query: 7 STPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLR-ALHSDIL 65 PHD K L + A+ ++ HLP+++ + ++L++ + +D K + +DI+ Sbjct: 14 QNPHDTYAKELLKDEEVAQVLLDAHLPQEINSIIKKETLEIINTENLDYKEKSKYFADII 73 Query: 66 WSVKTR-EGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFY 124 +S+KT D IYV+IEH+S +D H+ +L++ AV + I K + P++ Y Sbjct: 74 YSLKTIYGEDLKIYVLIEHKSYDDKHLPLQLIKNMTAVWSKEILEGKIT---PIYPIVIY 130 Query: 125 HGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQH-RRVALLELIQKH 183 S + + +++ + + I + + + L + + Sbjct: 131 ASKEKLSLESKFSNYYKISDNMKKFFLDFYVSTLNLNELDEKTIKEKYKNIYTLIMTLRI 190 Query: 184 IRQR---DLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQH 240 I++ +++ LI + L + I + + D+ + + +L Sbjct: 191 IQEPTPENILNLIKSIETLYNYKPKAVYVIALSYIFTIAKKDKNTYIKVKKQL------E 244 Query: 241 RERIMTIAERIHNDGYIK 258 + ++ + +G K Sbjct: 245 GGNMGSLLDMFIEEGLEK 262 >UniRef50_C1I6Y7 Putative uncharacterized protein n=1 Tax=Clostridium sp. 7_2_43FAA RepID=C1I6Y7_9CLOT Length = 226 Score = 116 bits (290), Expect = 1e-24, Method: Composition-based stats. Identities = 32/226 (14%), Positives = 86/226 (38%), Gaps = 17/226 (7%) Query: 45 LKLESASFVDEKLRALHSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQ 104 + L + S++ SDI++ D + YV++E QS D M RL+ Y + + + Sbjct: 1 MILVNKSYILSDYEEQESDIVYKANFNGNDVFFYVLLEFQSSVDFRMPIRLLLYMIEIWR 60 Query: 105 RHIEHDKRQ-------PLPLVIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPL 157 + + + + LP ++P++ Y+G + + N + Sbjct: 61 DILRNTELKEFKRKTFRLPSIVPIVLYNGKKKWTAAKELKHAISNSDVFGDNILNFKYEF 120 Query: 158 VDVTVVPDDEIVQHRRVALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYI 217 +D+ +E+ + ++ + + + + ++L +++ + L + Sbjct: 121 IDINSYEKEELYNKQNISSAIFLL--DQNINRIEFYNRLKDIIIGFNNLSIEEKMHLKHW 178 Query: 218 LLTGDEARFNEFISELTRRMPQHRERIMTIA-------ERIHNDGY 256 L+ + N F + + ++ ++ + E++ DG Sbjct: 179 LVNINTEE-NNFKDNIEKIFNADKQEVLNMTSNISKGLEKLKEDGK 223 >UniRef50_B9MPV5 Putative uncharacterized protein n=5 Tax=Clostridia RepID=B9MPV5_ANATD Length = 331 Score = 113 bits (282), Expect = 1e-23, Method: Composition-based stats. Identities = 54/330 (16%), Positives = 122/330 (36%), Gaps = 47/330 (14%) Query: 7 STPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILW 66 S +D FK + F+ +P+ + +++ + ++ + +A SD+++ Sbjct: 4 SRSYDVGFKKLFSDKINVCWFITEIIPEPRLKNYTQSDIEIVATESINAQWKARRSDMVY 63 Query: 67 SVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHG 126 + IY+++E QSR + M R+ Y + QR + DKR P+ + ++ Y+G Sbjct: 64 RLPYSSSW--IYLLVEFQSRPNKQMHCRIYEYVFLI-QRKYQIDKRLPVVVP--VVLYNG 118 Query: 127 SRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQ-HRRVALLELIQKHIR 185 P + + + + +DV +P+D+++ + +A + + Sbjct: 119 VEKWQPVTQFADNVEYAEDFPEYVQRLNYIFIDVRDIPEDKLLNGNNVLAAALYVDQVAT 178 Query: 186 QRD-LMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRERI 244 D ++ + +L + ++ L + +L + E + E + Sbjct: 179 NPDSVVERLLELGKNIRIPDEQREELAEWLYHAVLKSYKIPREEINELFAKSKILGVEEM 238 Query: 245 --------------------------------------MTIAERIHN--DGYIKGEQRIL 264 I +I +G ++ + I Sbjct: 239 FQSTAMKIKKGLAEEKKKIRLESKIEGKIEGKIEGKIEGKIEGKIEGKIEGRMEAQLEIA 298 Query: 265 RLLLQNGADPEWIQKITGLSAEQMQALRQP 294 R L+ GA+ +I K+TGL E+++ LR Sbjct: 299 RNLILEGAEDSFIAKVTGLDIEKVKELRNQ 328 >UniRef50_C0GV86 Transposase, ISNCY family n=7 Tax=Desulfonatronospira thiodismutans ASO3-1 RepID=C0GV86_9DELT Length = 125 Score = 110 bits (275), Expect = 6e-23, Method: Composition-based stats. Identities = 32/104 (30%), Positives = 60/104 (57%), Gaps = 3/104 (2%) Query: 1 MTNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRAL 60 M PH+ LF + D AR F++ H+ +++++ DLD+L+LE ++VDEKL+ Sbjct: 1 MATKRNQAPHEGLFLKIFQNLDNARHFLKNHMSEEIQKRFDLDTLRLEPTTYVDEKLKKH 60 Query: 61 HSDILWSVKT---REGDGYIYVVIEHQSREDIHMAFRLMRYSMA 101 +SD+++SV+ + IY++ EH+S D ++++Y Sbjct: 61 YSDLVFSVRLIGYKNQFAKIYLLFEHKSSPDPLTGVQVLKYMAL 104 >UniRef50_B0G834 Putative uncharacterized protein n=3 Tax=Dorea formicigenerans ATCC 27755 RepID=B0G834_9FIRM Length = 369 Score = 106 bits (264), Expect = 1e-21, Method: Composition-based stats. Identities = 47/326 (14%), Positives = 115/326 (35%), Gaps = 32/326 (9%) Query: 2 TNFTTSTPH--DALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRA 59 T+ H D K P F++ + L + + ++ S+ F+ + Sbjct: 8 TSNGVHNTHTKDNAAKIVFGDPVLCAQFLKGYTDIPLFKEIKPEDIENVSSHFLPLFQES 67 Query: 60 LHSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQP----- 114 SD + + + Y+ +IEHQS D M+FR++RY + + + ++ Sbjct: 68 RDSDTVNKIWIGNSEIYLIALIEHQSENDFDMSFRILRYIVFIWTDYAAQQEKLHKGTTK 127 Query: 115 -----LPLVIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIV 169 P ++P+++Y GS + F + + + +V + +++ Sbjct: 128 SKDFLYPPILPIVYYEGSSTWSAPLNFKNRVFLSDVFGDYIPSFNYLVVPLNKYSKQDLI 187 Query: 170 QHR---RVALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARF 226 + + L + + + I + +TE D + + I + + Sbjct: 188 EKNDELSLIFLINQLQSSSEFHALKDIPKKYTEHLTEDTPDYLLKIIGKVIAVLLHKLNV 247 Query: 227 -NEFISELTRRMPQHRERIM----------TIAERIHNDGYIKGEQRILRLLLQNGADPE 275 +E + E+T ++ + + +M +G ++G R G Sbjct: 248 PDEEVYEVTDQITRRKFSMMFDNFQAYDVQETRRVSREEGRLEGRIEGERAGRIEG---- 303 Query: 276 WIQKITGLSAEQMQALRQPLPERERY 301 ++ + E++ ++Q + E Sbjct: 304 --ERAGRIEGERLHLIKQVIKRIELQ 327 >UniRef50_B0K519 Putative uncharacterized protein n=14 Tax=Thermoanaerobacteraceae RepID=B0K519_THEPX Length = 288 Score = 105 bits (263), Expect = 1e-21, Method: Composition-based stats. Identities = 46/248 (18%), Positives = 110/248 (44%), Gaps = 17/248 (6%) Query: 64 ILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIE-------HDKRQPLP 116 +++ VK ++ + + Y+++E QS+ D M +RL+ Y + V + ++ K LP Sbjct: 1 MVYQVKLKDKEVFFYILLELQSKVDFQMPYRLLLYIIEVWREILKDTSLNQQKRKDYKLP 60 Query: 117 LVIPMLFYHGSRSPYPWSLCWLDEFAD-PTTARKLYNAAFPLVDVTVVPDDEIVQ-HRRV 174 +IP++ Y+G + SL + + + + + L+DV ++E++Q + Sbjct: 61 AIIPIVLYNGVNR-WTASLSFKETIDSYQLFGENIIDFKYILIDVNRYNEEELLQLSNLI 119 Query: 175 ALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGD---EARFNEFIS 231 + + L+ + I + +L +L +L ++ + L N++ + I Sbjct: 120 SSIFLLDRKIDKEELTEKWGKLADVLKDI--SEEEFIILRNWLFSVVSRFLPEDKEKEIK 177 Query: 232 ELTRRMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNG-ADPEWIQKITGLSAEQMQA 290 E+ + E I + + + + K + L+ L+ G + I K+ G +++ Sbjct: 178 EILVQSEGVEEMISNLERSLREE-FRKTRREGLKEGLKKGKLEGLKIGKMEGRMEGKIEG 236 Query: 291 LRQPLPER 298 +R + E+ Sbjct: 237 IRMVVFEQ 244 >UniRef50_C6IY67 Transposase n=1 Tax=Paenibacillus sp. oral taxon 786 str. D14 RepID=C6IY67_9BACL Length = 333 Score = 100 bits (250), Expect = 5e-20, Method: Composition-based stats. Identities = 57/318 (17%), Positives = 107/318 (33%), Gaps = 48/318 (15%) Query: 9 PHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRA--LHSDILW 66 PHD FK L +F+ + P +L D + + + + D+L Sbjct: 27 PHDEAFKKLLH--TFFAEFIALFFP-ELESQLDFSQTRFLMQEQLVDVVGEEARTLDLLL 83 Query: 67 SVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHG 126 K D +I + +E QS R+ Y + +RH + + L+IP+ + Sbjct: 84 ETKYIGTDAFILIHLEPQSYRQADFHERMFIYFSRLFERHRKEHQ-----LIIPIAIFTS 138 Query: 127 SRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHI-- 184 + S + + F V++ P + L+ K Sbjct: 139 AE-----SKNERNSLNMSILGEDILQFRFLKVELINQPWRRFIDSNNPVAAALLAKMGYN 193 Query: 185 --RQRDL-MGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHR 241 +R+L + + L+ L + + + D + E + EL ++ + Sbjct: 194 KGEERELRLAYLRMLLQLSQRLDQARLALVMSIADLYFEPDPRQDEEMLRELAKQYAKES 253 Query: 242 ERIMTI--------------------AERIHNDGYIKGEQR--------ILRLLLQNGAD 273 E IM + E+ G+ KG ++ I R LL G Sbjct: 254 EVIMELMPAWMRQGYEKGLEEGLEKGIEQGIEKGFEKGIEQGTLIERRQIARRLLSKGFT 313 Query: 274 PEWIQKITGLSAEQMQAL 291 E I +T LS E+++ + Sbjct: 314 LEEIADMTQLSIEEIKKI 331 >UniRef50_B0K813 Putative uncharacterized protein n=13 Tax=Thermoanaerobacterales RepID=B0K813_THEP3 Length = 267 Score = 99.6 bits (247), Expect = 1e-19, Method: Composition-based stats. Identities = 57/295 (19%), Positives = 110/295 (37%), Gaps = 39/295 (13%) Query: 6 TSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDIL 65 S +D K ++ A D L L + F + + SD++ Sbjct: 1 MSQEYDITAKNIFSN--LADDIASYFL------GLKFTKLDELNIEFTT--IESRESDMV 50 Query: 66 WSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYH 125 + T D I + IE Q+ D M +R++RY+ +M++H LP ++ Y Sbjct: 51 FKCTTENRD--IALHIEFQTYNDSKMPYRMLRYATEIMEKH------NLLP--YQVVVYC 100 Query: 126 GSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELI---QK 182 L + L N + ++DV + ++IV+ + L + K Sbjct: 101 SKNE-----LKMENNLNYHLGEENLLNFRYRIIDVGKIKFEDIVKTKYYDLYTFLPVADK 155 Query: 183 HIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRE 242 RQ++ + + ++ + ++ + ++ + E I ++ + Sbjct: 156 DKRQKEKEAYLRKCAEVIRDMPVDKAKKSYIVTTAEILAGIIYDEEVIEKIFSEVIG--- 212 Query: 243 RIMTIAER------IHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQAL 291 M+I E I G + I R LL+ G D I +IT LS E+++ L Sbjct: 213 --MSILEESKVYKNILEKGKKEKSIEIARELLKEGMDINKIAQITKLSVEEIKKL 265 >UniRef50_B9E303 Putative uncharacterized protein n=2 Tax=Clostridium kluyveri RepID=B9E303_CLOK1 Length = 304 Score = 97.7 bits (242), Expect = 4e-19, Method: Composition-based stats. Identities = 42/252 (16%), Positives = 92/252 (36%), Gaps = 39/252 (15%) Query: 81 IEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQ-------PLPLVIPMLFYHGSRSPYPW 133 +E QSR D M RL+ Y + + + +++ + LP +IPM+ Y+G ++ + Sbjct: 30 LEFQSRVDYRMPMRLLFYMVEIWREILKNTSKNDRSKKDFKLPSIIPMVLYNG-KNTWTA 88 Query: 134 SLCWLDEFADPTT-ARKLYNAAFPLVDVTVVPDDEIVQHRRV-ALLELIQKHIRQRDLMG 191 + D + + + + L D+ ++++ + + + L+ K I + DL+ Sbjct: 89 CKNFKDVLSGSKLFGENVIDFRYMLFDIYRYNEEQLEDMANMVSTVFLLDKEISKEDLVK 148 Query: 192 LIDQLVVLLVTECAN-----------------DSQITALLNYILLTGDEARFNEFISELT 234 + +L DS+ + IL + + +S L Sbjct: 149 RLRLTAYVLKKITPEQFDILKAWLKSIIKPRLDSESKIKIEEILEKSSQGEVDSMVSNLG 208 Query: 235 RRMPQ-HRERIMTIAERIHNDGYIKGEQRILRLLLQNG--------ADPEWIQKITGLSA 285 + + RE T E +G +G + + + G ++K T L Sbjct: 209 KTIDNIIREGRETGLEEGRREGRKEGRKEGRKEGRKEGRKEGKSELITKMLVKKFTKLPD 268 Query: 286 E---QMQALRQP 294 ++ +L Sbjct: 269 GYTHKINSLSDE 280 >UniRef50_A5USQ0 Putative uncharacterized protein n=4 Tax=Roseiflexus sp. RS-1 RepID=A5USQ0_ROSS1 Length = 330 Score = 93.5 bits (231), Expect = 9e-18, Method: Composition-based stats. Identities = 48/270 (17%), Positives = 92/270 (34%), Gaps = 24/270 (8%) Query: 10 HDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLR--ALHSDILWS 67 HDALFK LT R+F++ + DL D + +D++ Sbjct: 7 HDALFKLVLT--AFFREFID-LVAPDLAAALDPAPPVFLDKESFADLFDPDRREADLVAQ 63 Query: 68 VKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLF--YH 125 V+ R+ + + +EHQ++ D + R+ RY + R+ + + P+ Y Sbjct: 64 VRLRQHPATLLIHLEHQAQADAALDRRMFRYFARLYDRYDQ--------PIYPIALCSYP 115 Query: 126 GSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHI- 184 R P D R + + +V + + + A + L+ + Sbjct: 116 RPRRPAA------DRHEVRAAQRTVLTFQYQVVQLNRMDWRAYLTTTNPAAMALMARMRV 169 Query: 185 RQRDLMGLIDQLVVLLVTECANDSQITALLNY--ILLTGDEARFNEFISELTRRMPQHRE 242 D + + LL +Q + + I L + +E+ R +E Sbjct: 170 APEDRWRVKAACLRLLAGAPLTGAQRRLIGQFVDIYLPLNAREEQALAAEVARLPGAAKE 229 Query: 243 RIMTIAERIHNDGYIKGEQRILRLLLQNGA 272 +M + G +G + LR G Sbjct: 230 VVMELITSWERKGRAEGLREGLREGRAEGL 259 >UniRef50_B5Q357 Transposase n=10 Tax=Salmonella enterica subsp. enterica RepID=B5Q357_SALVI Length = 174 Score = 92.3 bits (228), Expect = 2e-17, Method: Composition-based stats. Identities = 50/134 (37%), Positives = 73/134 (54%), Gaps = 29/134 (21%) Query: 185 RQRDLMGLIDQLVVLLVTECANDSQITALLNYIL-LTGDEARFNEFISELTRRMPQHRER 243 RQRDL+GL++++ LLVT CAND Q+ AL NY++ G RF FI ++ +P +ER Sbjct: 37 RQRDLLGLVERIASLLVTGCANDRQLKALFNYLMIQHGHTPRFTTFIRDVVGHVPHTKER 96 Query: 244 IMTIAERIHN--------------------DGYIKGEQ--------RILRLLLQNGADPE 275 +MT+ ERI +G KG + RI R +L +G D E Sbjct: 97 LMTLIERIRAADRRKGERQGRQLGLEEGLAEGLEKGLEKGQHVAALRIARQMLADGLDRE 156 Query: 276 WIQKITGLSAEQMQ 289 +Q+ TGL+AE++Q Sbjct: 157 TVQRFTGLTAEELQ 170 Score = 61.5 bits (148), Expect = 3e-08, Method: Composition-based stats. Identities = 42/100 (42%), Positives = 52/100 (52%), Gaps = 22/100 (22%) Query: 1 MTNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFV------D 54 M TTSTPHDA+FKTFL HP+TARDFMEIHLP LR+ DL L AS + D Sbjct: 1 MKKSTTSTPHDAVFKTFLRHPETARDFMEIHLPVSLRQR-DLLGLVERIASLLVTGCAND 59 Query: 55 EKLRALHSDILWSVKTREGDGYIYVVIEHQSREDIHMAFR 94 +L+AL + Y++I+H R Sbjct: 60 RQLKALFN---------------YLMIQHGHTPRFTTFIR 84 >UniRef50_C4FHW2 Putative uncharacterized protein n=1 Tax=Sulfurihydrogenibium yellowstonense SS-5 RepID=C4FHW2_9AQUI Length = 211 Score = 91.2 bits (225), Expect = 4e-17, Method: Composition-based stats. Identities = 32/158 (20%), Positives = 70/158 (44%), Gaps = 9/158 (5%) Query: 108 EHDKRQPLPLVIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDE 167 K++ P +I ++FYHG R + L D + L+D+ +PD+E Sbjct: 3 RSHKKEYYPPIINIVFYHGEREWNIPTN--LPTVKDKDLQEYTQKLNYILIDLNKIPDEE 60 Query: 168 IVQH--RRVALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEAR 225 + + + ++ I R D + + ++ L++ +DS L +L+ D + Sbjct: 61 LKNRISKNMDVILAILVMKRIFDDIQNLRPILELIIK-HKSDSLFIILDYIVLIKKDAEK 119 Query: 226 FNEFISELTRRMPQHRERIMTIAERIHNDGYIKGEQRI 263 + + E++ E++MT+ E+ +G++KG+ Sbjct: 120 VEKILKEIS----GGDEKMMTLTEKWKMEGWMKGKLEG 153 >UniRef50_C4UAM6 Putative uncharacterized protein n=1 Tax=Yersinia aldovae ATCC 35236 RepID=C4UAM6_YERAL Length = 105 Score = 88.9 bits (219), Expect = 2e-16, Method: Composition-based stats. Identities = 34/101 (33%), Positives = 51/101 (50%), Gaps = 20/101 (19%) Query: 211 TALLNYILLTGDEARFNEFISELTRRMPQHRERIMTIAERIHNDGYIKGEQ--------- 261 +L+NY+L GD A FI EL RR PQH+E +MTIA+++ +G +G Q Sbjct: 3 KSLINYMLQDGDAATPKTFIWELARRSPQHKELLMTIAQKLKQEGRQEGRQEGRVEGIQI 62 Query: 262 -----------RILRLLLQNGADPEWIQKITGLSAEQMQAL 291 + R +L NG D + K+TGLS + + + Sbjct: 63 GEANGLKKGKLEVARTMLVNGLDRATVMKMTGLSDKDLTQI 103 >UniRef50_B0KCX4 Putative uncharacterized protein n=12 Tax=Thermoanaerobacterales RepID=B0KCX4_THEP3 Length = 267 Score = 86.9 bits (214), Expect = 7e-16, Method: Composition-based stats. Identities = 57/294 (19%), Positives = 110/294 (37%), Gaps = 37/294 (12%) Query: 6 TSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDIL 65 S +D K ++ A D L + F K+ SDI+ Sbjct: 1 MSQKYDITIKDIFSN--MADDITAYFL------GLTYTKTDELNIEFT--KVEKRQSDIV 50 Query: 66 WSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYH 125 T +GD I V +E QS D M +R++RYS+ +M+++ + ++ Y Sbjct: 51 LKCTTEKGD--IAVHLEFQSDNDDKMPYRMLRYSLEIMEKYNLTPYQ--------LVIYM 100 Query: 126 GSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQ---HRRVALLELIQK 182 G L ++ + + + ++DV + +I + + ALL ++ + Sbjct: 101 GKND-----LRMENKLDYNLGEENILDYRYKIIDVGTIKFLDITKTDYYDLYALLPIMDR 155 Query: 183 HIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRE 242 R+ + + + V + + ++ + + E I + + Sbjct: 156 ERRKTEGEKYLKECVEAIKNIPIDINKKKDITFKAEILSGLVYSREVIERVFTEV----M 211 Query: 243 RIMTIAE-----RIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQAL 291 ++ I E I G + RI + LL+ G D I KIT LS E+++ L Sbjct: 212 EMLRIEESEAYKMILEKGAKEKSLRIAKELLKEGMDINKIAKITELSIEEIKKL 265 >UniRef50_C4G1D5 Putative uncharacterized protein n=2 Tax=Abiotrophia defectiva ATCC 49176 RepID=C4G1D5_ABIDE Length = 297 Score = 86.9 bits (214), Expect = 8e-16, Method: Composition-based stats. Identities = 44/234 (18%), Positives = 83/234 (35%), Gaps = 16/234 (6%) Query: 71 REGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGSRSP 130 + + ++ IE+QS D M R++ Y A + + + + V+ ++ Y G Sbjct: 67 KNEVIFSFIGIENQSAPDKDMILRIISYDGATYKSQM---GNESIYPVLTIVIYWGKYEW 123 Query: 131 YPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQH----RRVALLELIQKHIRQ 186 A + + F L+D+ + E+++ R VA QK + Sbjct: 124 KAPVSLQERINCPRELADIIPDYRFKLIDIGRLSGKELIKFKSDFRLVAEFIARQKEYKP 183 Query: 187 RDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRER-IM 245 + ++ L+ A D + L + E R L + E+ I Sbjct: 184 GKEEIKHPEELLDLLDLLAGDKRFKELKGKVKNIRKEGRIINMCELLDEIENRGIEKGIE 243 Query: 246 TIAERIHNDGYIKGE--------QRILRLLLQNGADPEWIQKITGLSAEQMQAL 291 E+ G KG RI + + + I K TGL+ E+++ L Sbjct: 244 QGIEQGIEKGIEKGRSEGEETATLRIAKKFKDSNVSIDIIMKATGLTKEEIEEL 297 >UniRef50_Q1PZ06 Putative uncharacterized protein n=1 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1PZ06_9BACT Length = 238 Score = 84.6 bits (208), Expect = 4e-15, Method: Composition-based stats. Identities = 37/200 (18%), Positives = 69/200 (34%), Gaps = 16/200 (8%) Query: 96 MRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAF 155 M+Y + + + + Q L VIP++ YHG + E D R + + Sbjct: 1 MKYLLKIWAANSKQ--MQRLIPVIPVILYHGKETWKVRRFRDYFEGIDEVFFRFIPEFEY 58 Query: 156 PLVDVTVVPDDEIVQH----RRVALLELIQKHIRQR----DLMGLIDQLVVLLVTECAND 207 L D++ ++EI + + L+ ++I D + ++ E Sbjct: 59 LLTDLSFYSNEEIKDKVFRRVSLQITMLLMRNIYNDKILGDKLKAFFEIGKQYFEEGEGL 118 Query: 208 SQITALLNYILLTGDEARFNEFISELTRRMPQHRERIMTIAERIHND----GYIKGEQRI 263 + +++ Y+ D I L + MTIA R+ G ++G Sbjct: 119 KFLESVIRYLYYASDIEE-ERVIDTLKEISEEGGRLSMTIAARLIEKGKIAGRMEGRAEG 177 Query: 264 LRLLLQNGADPEWIQKITGL 283 R G E I+ L Sbjct: 178 ERKGRMEGL-IEAIEIGLEL 196 >UniRef50_C9KKN3 Putative uncharacterized protein n=1 Tax=Mitsuokella multacida DSM 20544 RepID=C9KKN3_9FIRM Length = 297 Score = 83.5 bits (205), Expect = 9e-15, Method: Composition-based stats. Identities = 50/309 (16%), Positives = 112/309 (36%), Gaps = 32/309 (10%) Query: 1 MTNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLREL-CDLDSLKLESASFVDEKLRA 59 M T D+LF+ E + + + +L+ + + + Sbjct: 1 MCMKPKRTYKDSLFRHIFNDKRRLASLYESLTGRKVAPRDIAITTLRGVFFNDIKNDI-- 58 Query: 60 LHSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVI 119 S + + D +++EHQS + +M R++ Y + R ++ + +I Sbjct: 59 -------SFRIGDRDI---ILMEHQSSWNPNMPLRMLWYVAKLYSRQLDSQEVVYRSRLI 108 Query: 120 PM------LFYHGSR-SPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHR 172 P+ +FY+GS+ P L D FA T +L + ++ ++++ Sbjct: 109 PIPAPEFYVFYNGSQDEPDYQKLRLSDAFAHATDTLELAVDCY---NINYSTQNKLLDSC 165 Query: 173 RVALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFN--EFI 230 I + + ++ + + + + L+ + F+ F Sbjct: 166 YELRCYSIFVQKVREGIQNGLE--LRTAIRQAITYCKTHDLMGDYFQKNESEVFDMVNFK 223 Query: 231 SELTRRMPQHRERIMTIAE-RIHNDGYIKGEQ----RILRLLLQNGADPEWIQKITGLSA 285 + R + +E + I E R G + GE+ ++ LL+ G I + T LS Sbjct: 224 WDQKRALEVAKEDGVAIGEARGEARGKLLGERNAMMKVALSLLKKGLPVGVITESTNLSL 283 Query: 286 EQMQALRQP 294 E+++ + + Sbjct: 284 EEVRKIAKD 292 >UniRef50_A8VV66 ATPase associated with various cellular activities, AAA_3 n=2 Tax=Bacillus selenitireducens MLS10 RepID=A8VV66_9BACI Length = 214 Score = 82.7 bits (203), Expect = 1e-14, Method: Composition-based stats. Identities = 42/211 (19%), Positives = 79/211 (37%), Gaps = 15/211 (7%) Query: 107 IEHDKRQPLPLVIPMLFYHGSRSPYPWSLCWLDEFA------DPTTARKLYNAAFPLVDV 160 + + P L+IP+L G R + D F+ + N + L D+ Sbjct: 2 RKEGRGNPRTLIIPILIAQGRRRWSRSTTLMADFFSHYSEALRDDCEPFIPNFRYLLYDI 61 Query: 161 TVVPDDEIVQHRRVALLELIQKHIRQRDLMGLIDQLVVLLVTECAND------SQITALL 214 ++++H + + + + + D L ++ LL + Q+ LL Sbjct: 62 QEQDAADMIRHTLLKITIELMALVFEEDESKLEARMTELLTMSEIGEISDSYAEQVLRLL 121 Query: 215 NYILLTG---DEARFNEFISELTRRMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNG 271 Y++ D+A F +T + E IM A+++ G K E I L + G Sbjct: 122 EYVMRGNRHFDQAMFETIRQNVTTEAHEGSELIMNFADQLEQKGKHKKELAIFLKLTRRG 181 Query: 272 ADPEWIQKITGLSAEQMQALRQPLPERERYS 302 E I + L + +AL+ + E + S Sbjct: 182 ESKESIMDLLDLDDKSFEALQAEVNEMDENS 212 >UniRef50_C9RQ02 Putative uncharacterized protein n=1 Tax=Fibrobacter succinogenes subsp. succinogenes S85 RepID=C9RQ02_FIBSS Length = 360 Score = 81.5 bits (200), Expect = 3e-14, Method: Composition-based stats. Identities = 57/312 (18%), Positives = 117/312 (37%), Gaps = 26/312 (8%) Query: 3 NFTTSTPHDALFKTFLTHPDTARDFME-----IHLPKDLRELCDLDSLKLESASF--VDE 55 N T HDA F+ AR +E H +LD+L S+ VD+ Sbjct: 5 NKVTKRKHDAYFRWLFADTTHARCLLELAGKINHEIDAFLTQINLDTLMRIPDSYSEVDD 64 Query: 56 KLRALHSDILWSVKTREG-DGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQP 114 +D+ + V G + +++EH+S D + ++ +Y +VM+ ++ Sbjct: 65 TG---EADLAFRVNVSTGAPILVGILLEHKSGRDPIIFDQISKYIHSVMKIQDKNRIFSG 121 Query: 115 LPLVIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIV--QHR 172 +P + ++FY+G + P L L++ K+ V++ +PD + + ++ Sbjct: 122 IPT-MAIIFYNGRDNWNP--LKILEKSYPDYFRGKVLPFQCTFVNMADIPDSDCLACENT 178 Query: 173 RVALLELIQKHIRQRD-LMGLIDQLVVLLVTECAND--SQITALLNYILLTGDEARFNEF 229 + + KH +D L+ L+ Q L N+ + Y++ + E Sbjct: 179 ATGMGIIALKHAFNKDKLLELLPQFCKFLDKMPRNEASCLLEKTSIYLMEYLGKDFLKEL 238 Query: 230 ISELTRRMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQ 289 ++ ++I + + Q++ LQ + + I + QM Sbjct: 239 NMAFVSIGQKYG--FVSIGDYFRQQ-LAEERQQMTEERLQMAEERQQITE----ERLQMA 291 Query: 290 ALRQPLPERERY 301 RQ + E Sbjct: 292 EERQQITEERLQ 303 >UniRef50_Q2RKN5 Putative uncharacterized protein n=1 Tax=Moorella thermoacetica ATCC 39073 RepID=Q2RKN5_MOOTA Length = 304 Score = 80.4 bits (197), Expect = 6e-14, Method: Composition-based stats. Identities = 66/302 (21%), Positives = 115/302 (38%), Gaps = 31/302 (10%) Query: 10 HDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDE--KLRALHSDILWS 67 HD LFK LT R+FME+ P L D K + + + + DIL Sbjct: 5 HDRLFKELLT--TFFREFMELFFPAA-HTLIDYTDTKFLTQEVITDITAGDKHYVDILAE 61 Query: 68 VKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPM-LFYHG 126 VK + DG + V IE Q+ A R+ Y + ++H + V+P+ +F H Sbjct: 62 VKIKGEDGCVLVHIEPQAYRQADFARRMFIYFSRLYEKHQKR--------VLPIAVFAHD 113 Query: 127 SRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIRQ 186 S+ F K+ F + + +P + + L+ K Sbjct: 114 SKVEETNRHEVEFPF------LKVLQFEFYKIQLKRLPWRQYLNSNNPVAAALLSKMDYS 167 Query: 187 -RDLMGLIDQLVVLLVTECANDSQITAL--LNYILLTGDEARFNEFISELTRRM-PQHRE 242 R+ + + + + LL + +++ + L + +L+ + P+ + Sbjct: 168 PRERVQVKIEFLRLLTRMQLDPARMELITAFFDSYLVLNAEEEKSLQEKLSEELQPEEVQ 227 Query: 243 RIMTIAERIH----NDGYIKGEQRILRLLLQNGA---DPEWIQKITGLSAEQMQALRQPL 295 R+M + H G +G Q IL L+ PE KI LSAEQ+ L + + Sbjct: 228 RVMELTTSWHLKGWQQGRQEGRQEILLRQLRKRLGTTSPEVEAKIKTLSAEQLDDLAEKI 287 Query: 296 PE 297 + Sbjct: 288 LD 289 >UniRef50_Q7NIZ1 Gll2041 protein n=9 Tax=Cyanobacteria RepID=Q7NIZ1_GLOVI Length = 311 Score = 80.4 bits (197), Expect = 8e-14, Method: Composition-based stats. Identities = 50/319 (15%), Positives = 113/319 (35%), Gaps = 39/319 (12%) Query: 10 HDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDE--KLRALHSDILWS 67 HD LFK L+ +F+++ D+ + S+ + +D++ Sbjct: 4 HDRLFKELLS--TFFVEFIDLFF-ADVGNYLERGSIVFLEKELFSDITAGERYEADLVVK 60 Query: 68 VKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGS 127 + R+ + V IE+Q+ ++R+ RY + +++ + P+ + + Sbjct: 61 ARFRDHQSFFLVHIENQTEAQSIFSYRMFRYFARLYEKYQL--------PIYPIAVFSFT 112 Query: 128 RSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIRQR 187 F D + + +V + + + ++ L+ + Sbjct: 113 EPLRAEPTAHRVAFPD----FTVLEFHYRVVQLNRLDWRDFLRQPNPVASALMARMRIAP 168 Query: 188 DLMGLID----QLVVLLVTECANDSQITALLN-YILLTGDEARFNEFISELTRRMPQHRE 242 + +L+ L + A I+ ++ Y+ LT E R F +EL +E Sbjct: 169 ADRPRVKLECLRLLATLRLDPARTQLISGFVDTYLKLTAQEERL--FAAELATIGASEQE 226 Query: 243 RIMTIA--------ERIHNDGYIKGEQRILRLLLQNGADP-------EWIQKITGLSAEQ 287 ++ I E+ G +G Q ++ + ++++GLS Sbjct: 227 AVVQIVTSWMQQGLEQGRQVGRQEGRQEEALAIVLRQLSRRLGTLPAQNAERVSGLSTTA 286 Query: 288 MQALRQPLPERERYSWLKS 306 ++AL + L + S L S Sbjct: 287 LEALSEALLDFASISDLDS 305 >UniRef50_C1PBU4 Putative uncharacterized protein n=4 Tax=Bacillus coagulans 36D1 RepID=C1PBU4_BACCO Length = 329 Score = 80.0 bits (196), Expect = 1e-13, Method: Composition-based stats. Identities = 58/343 (16%), Positives = 119/343 (34%), Gaps = 67/343 (19%) Query: 1 MTNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDE--KLR 58 M HD LFK + + ++FM+ P DL D ++ S + Sbjct: 5 MEKHAGYHVHDRLFKELIQN--FFQEFMDAFFP-DLSADLDYRRVRFLSQEQFTDFPGGE 61 Query: 59 ALHSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLV 118 DIL K + D I + +E QS + R+ RY M + RH + V Sbjct: 62 QKRVDILAETKVKGKDTVILIHVEPQSYYEKPFPERMFRYYMMISLRHRK--------PV 113 Query: 119 IPM-LFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDD----EIVQHRR 173 +P+ +F + ++ P + + F + R F + + ++ + + Sbjct: 114 LPIAVFSYEEKTETPDTYTF--AFHNIEILR------FHYLSIHLMKQNWRNYIRSNNPV 165 Query: 174 VALLELIQKHIRQRDLMGLID--QLVVLLVTECANDSQITALLNYILLTGDEARFNEFIS 231 A L + + ++ +++ + + A + +Y L +E E + Sbjct: 166 AAALLSKMGYTETERVQVKLEFLRMLARMELDPAKMRLLHGFFDYYL-KLNEKEEAEVME 224 Query: 232 ELTRRMPQHRERIMTI------------------------AERIHNDGYIKGEQR----- 262 + P E+++ + E+ +G G ++ Sbjct: 225 NIKMLDPDEAEQVLKLPNSYFDRGYKKGKEEGREEGIEIGVEKGREEGIEIGVEKGREEE 284 Query: 263 ---------ILRLLLQNGADPEWIQKITGLSAEQMQALRQPLP 296 I +LQ G + + I + TGLS +++ ++Q L Sbjct: 285 RKEMLQTIPIAIKMLQEGRELQLIVEKTGLSQREVEKIKQQLE 327 >UniRef50_A5D0D4 Putative uncharacterized protein n=10 Tax=Clostridia RepID=A5D0D4_PELTS Length = 332 Score = 79.6 bits (195), Expect = 1e-13, Method: Composition-based stats. Identities = 50/311 (16%), Positives = 104/311 (33%), Gaps = 38/311 (12%) Query: 10 HDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDE--KLRALHSDILWS 67 HD LFK L +FME+ P+ + DL+ +K + +DI+ Sbjct: 8 HDRLFKQLLE--TFFAEFMELFFPEA-AQATDLEYVKFLQQELFTDITAGEKHRADIIVE 64 Query: 68 VKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGS 127 + ++ G I V +E QS R+ Y + +++ ++P+ + Sbjct: 65 TRLKDEPGLILVHVEPQSYIQKEFNERMFIYFSRLYEKYRRK--------ILPVAVF--- 113 Query: 128 RSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIRQR 187 Y D F + + F +++ + + ++ L+ K + Sbjct: 114 --TYDHIRNEPDSFEIGFSFLDVLRFHFYKLELKKLHWRDYIRSDNPVAAALLSKMGFRP 171 Query: 188 D----LMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRER 243 + + +++ + + A I L + EF EL + + E Sbjct: 172 EERVQVKLEFMRMLARMKLDPARTELIGGFFE-TYLKLNRQEEEEFYRELGKIDKKEVEL 230 Query: 244 IMTIA----ERIHNDGYIKGEQRILRLLLQNGADPEWIQKIT-----------GLSAEQM 288 IM I E+ +G ++G G ++K GL + Sbjct: 231 IMQITTSWHEKGRMEGRLEGRLEGRLEGRLEGEARGKVEKAQEIICEYLKVRFGLDTSGI 290 Query: 289 QALRQPLPERE 299 + + L ++E Sbjct: 291 REKVRQLTDQE 301 >UniRef50_Q73P51 Conserved domain protein n=7 Tax=Treponema RepID=Q73P51_TREDE Length = 292 Score = 76.9 bits (188), Expect = 8e-13, Method: Composition-based stats. Identities = 49/301 (16%), Positives = 117/301 (38%), Gaps = 38/301 (12%) Query: 11 DALFKTFLTHPDTARD-FMEIHLP---KDLRELCDLDSLKLESASFVDEKLRALHSDILW 66 D++F + + A++ F+ ++ +L C +++++L++ ++ + +D+ Sbjct: 10 DSVFVDLFSEDERAKENFLSLYNALHGTNLPMSCPVENIRLDNVMYM-----NIINDV-- 62 Query: 67 SVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQ-------RHIEHDKRQPLPLVI 119 S DG I ++ EHQS + +M R + Y + + R+++ + P P Sbjct: 63 SCLV---DGKIIILAEHQSTINENMPLRFLEYIARLYEKLQAPTDRYLKKLSKIPTPEFY 119 Query: 120 PMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQ-----HRRV 174 +FY+G + L + + ++++ ++I+ Sbjct: 120 --VFYNGKEDYPETTALKLSDAFITKPKQAPLELTVQVLNINTDKANKILTACKPLEEYS 177 Query: 175 ALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELT 234 +E ++K + G + + + + + + I + E ++ I+ Sbjct: 178 LFVEEVRKQTQLDPENGFTNAIKICIEKGILKEYLMRKSREVINMLVAEYDYDTDIAV-- 235 Query: 235 RRMPQHRERIMTIAERIHNDGYIKGE----QRILRLLLQNGADPEWIQKITGLSAEQMQA 290 Q E + E+ G+ G I + Q G D + I + TGLS E+++ Sbjct: 236 ----QREESLRIGIEQGIRQGFSDGAYQKAIEIAKAFKQFGFDIDKIAEGTGLSREEIEK 291 Query: 291 L 291 L Sbjct: 292 L 292 >UniRef50_C4GYF6 Transposase n=20 Tax=Yersinia pestis RepID=C4GYF6_YERPN Length = 105 Score = 76.2 bits (186), Expect = 1e-12, Method: Composition-based stats. Identities = 31/104 (29%), Positives = 53/104 (50%), Gaps = 4/104 (3%) Query: 203 ECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRERIMTIAERIHNDGYIKGEQR 262 + + Q+ AL++Y+L G+ A F+ EL +R+PQH + +MTIA+++ G KG ++ Sbjct: 3 DYLSSPQVMALIHYLLQAGESADSEAFVRELAQRVPQHGDALMTIAQQLEQKGIEKGIEK 62 Query: 263 ILRLLLQNGADPEWIQKITGLSAEQMQALRQPLPERERYSWLKS 306 ++L Q G + L E +R + RYS S Sbjct: 63 GIQLGEQKGKLEVGVS----LDIENYCTIRASPACQRRYSRFLS 102 >UniRef50_Q6D6X6 Putative transposase (Fragment) n=2 Tax=Pectobacterium RepID=Q6D6X6_ERWCT Length = 135 Score = 76.2 bits (186), Expect = 1e-12, Method: Composition-based stats. Identities = 34/132 (25%), Positives = 59/132 (44%), Gaps = 20/132 (15%) Query: 186 QRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRERIM 245 D++ L + +L Q A+L YI +G+ ++ EFI + + + RE IM Sbjct: 4 HHDMLELAQDIGILFERWQIPLPQKRAILFYIARSGNTSKPAEFIEAVAQSLSTDREAIM 63 Query: 246 TIAERIHNDGYIKGE--------------------QRILRLLLQNGADPEWIQKITGLSA 285 TIA+++ G+ KG ++I R LL +G +P + ++T LSA Sbjct: 64 TIAQQLEKIGFEKGIKHGMQQGMQRGMEQGIKTSARQIARQLLLSGMEPAQVCQMTQLSA 123 Query: 286 EQMQALRQPLPE 297 ++ L E Sbjct: 124 AELAQLSNESNE 135 >UniRef50_Q2RGS0 Putative uncharacterized protein n=2 Tax=Moorella thermoacetica ATCC 39073 RepID=Q2RGS0_MOOTA Length = 310 Score = 75.8 bits (185), Expect = 2e-12, Method: Composition-based stats. Identities = 45/287 (15%), Positives = 109/287 (37%), Gaps = 29/287 (10%) Query: 8 TPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWS 67 +D K ++ + + R DL ++ ++ SD++ Sbjct: 7 NRYDITIKDLFADET--QELINYFGHFEARVTGDL-KIEF-------PQVETRVSDLVMK 56 Query: 68 VKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGS 127 ++++G I+ +E QSR D M +R++RY++ + K LP V ++ Y G Sbjct: 57 AESQQGPLAIH--LEFQSRNDDEMPYRMLRYALEI-------HKTYHLP-VYQIVIYFGQ 106 Query: 128 RSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQH---RRVALLELIQKHI 184 W + + + L + + L+DV + +E+ R ++LL ++ + Sbjct: 107 -----WQMNMTSQLEYRLGDQNLLDYRYHLIDVGNITYEELKNSPHQRLLSLLPVVDREK 161 Query: 185 RQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRERI 244 RQ+ + + ++ + +L + + I + R + Q Sbjct: 162 RQKGGKEFLRRCAEDIINSDLDLETKKTVLLRAEIFAGLVFDKKAIDLVFREVEQMLSIE 221 Query: 245 MTIA-ERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQA 290 + +RI G KG ++ + ++ G + + + ++ ++ Sbjct: 222 ESAGYQRIFEKGMEKGIEKGMEKGMEKGIEKGQQESLLDVTIRLLRK 268 >UniRef50_Q3C0L0 TpnA protein n=2 Tax=Sodalis glossinidius RepID=Q3C0L0_SODGL Length = 131 Score = 75.8 bits (185), Expect = 2e-12, Method: Composition-based stats. Identities = 26/59 (44%), Positives = 37/59 (62%) Query: 4 FTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHS 62 +T + HD +FK FL ARDF+EIHLP LR+ CD +L + S SF+++ L+ S Sbjct: 2 TSTLSHHDHVFKKFLGDIAVARDFLEIHLPPHLRKHCDFSTLAMASGSFIEDDLKGQCS 60 >UniRef50_B1XMU9 Putative uncharacterized protein n=1 Tax=Synechococcus sp. PCC 7002 RepID=B1XMU9_SYNP2 Length = 316 Score = 75.8 bits (185), Expect = 2e-12, Method: Composition-based stats. Identities = 45/259 (17%), Positives = 99/259 (38%), Gaps = 20/259 (7%) Query: 10 HDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDE--KLRALHSDILWS 67 HD LFK LT DF+ + P ++ E + +SL + ++ + DI+ Sbjct: 7 HDLLFKELLT--TFFWDFLALFAP-EILETAEQNSLTFLTQEVFNDLPGQTRRNVDIVAK 63 Query: 68 VKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGS 127 + R + V +E+Q+ A R+ Y + +++ + P+ + Sbjct: 64 LHFRGQETCFLVHVENQATSQADFAERMFLYFARLYEKYRL--------PIYPIALF--- 112 Query: 128 RSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIR-Q 186 S + F+ ++++ + +F + + +P + ++ L+ K Sbjct: 113 -SYRSPQRLEPETFSVAFPSKEILSFSFQTIQLNRLPWRDFLRQPNPVAAALMAKMNFSS 171 Query: 187 RDLMGLIDQLVVLLVTECANDSQITAL--LNYILLTGDEARFNEFISELTRRMPQHRERI 244 + + + + ++VT + ++I L L + A F EL R PQ ++ Sbjct: 172 EERPKVKLECLRMIVTLRLDSARIHLLSGFVDTYLRLNMAEQQVFEQELHRIQPQEEAQV 231 Query: 245 MTIAERIHNDGYIKGEQRI 263 + I +G +G Q Sbjct: 232 LRIVTSWMEEGLQQGRQEG 250 >UniRef50_B7GJZ4 Transposase n=10 Tax=Bacillaceae RepID=B7GJZ4_ANOFW Length = 286 Score = 74.6 bits (182), Expect = 4e-12, Method: Composition-based stats. Identities = 45/290 (15%), Positives = 102/290 (35%), Gaps = 25/290 (8%) Query: 10 HDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDE--KLRALHSDILWS 67 HD LFK LT + + E D L S + D+L Sbjct: 7 HDRLFKELLTTFFEEFILLFF---PHVHEHIDFRHLSFLSEELFTDVTAGEKYRVDLLIQ 63 Query: 68 VKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGS 127 K + G I + +E+QS R+ Y + +++ + ++P+ + Sbjct: 64 TKLKGEAGIIIIHVENQSYMQSSFPERMFIYFSRLFEKYRTN--------ILPIAIF--- 112 Query: 128 RSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVP-DDEIVQHRRVALLELIQKHIRQ 186 Y + F + F V++ I +A L + + Sbjct: 113 --SYDFIRDEPSSFTLQFPFLHVLQFQFLAVELRKQNWRHYIRSENPIATALLSKMGYNE 170 Query: 187 RDLMGLIDQLVVLLVTECANDSQITALL-----NYILLTGDEARFNEFISELT-RRMPQH 240 + + L Q +L+ + ++++ L+ L +E +F + ++ + Q Sbjct: 171 NERVELKKQFFRMLIRQNIDEAKRRLLIGFFETYVKLTEQEEEQFQNEVKKMGGKEGEQV 230 Query: 241 RERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQA 290 E I++ ++ G + E+ +++ +++ G I + S E+++ Sbjct: 231 MELIISYEQKGKIAGAKEKEREMIQKMVEKGMSITQIAHLLDRSEEEVRK 280 >UniRef50_B7CC32 Putative uncharacterized protein n=10 Tax=Eubacterium biforme DSM 3989 RepID=B7CC32_9FIRM Length = 301 Score = 73.8 bits (180), Expect = 7e-12, Method: Composition-based stats. Identities = 53/294 (18%), Positives = 101/294 (34%), Gaps = 15/294 (5%) Query: 11 DALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVKT 70 D K FL + DF + R L + ++L+S H D++ K Sbjct: 6 DKTMKEFLENNAYFVDFFNAYFFDGERVLKPENCMELDSEMNDSNMDLEKHVDVI--RKY 63 Query: 71 REGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQR----HIEHDKRQPLPLVIPMLFYHG 126 +G+ Y +IE+QS D M R Y R ++ ++ LP+V ++FY G Sbjct: 64 NDGNLYSAFIIENQSYVDASMVVRAAAYEFVAYDRMLKKLKKNKAKEKLPMVHILVFYTG 123 Query: 127 SRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVT-----VVPDDEIVQHRRVALLELIQ 181 + + D ++ L+++T ++++ + Q Sbjct: 124 EKLWNAANKLSQLVEVDERFESYFHDYQMNLIEITGNTSYNFNEEDVYNLFYICRSIYDQ 183 Query: 182 KHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHR 241 ++ + + VL V + D + L E E + Sbjct: 184 SIYEEKSNGFGLVKSSVLKVVKTLTDVEWLDLEELEEKEEIEMCEAEKRWLEVKSKEWEA 243 Query: 242 ERIMTIAERIHNDGYIKG----EQRILRLLLQNGADPEWIQKITGLSAEQMQAL 291 + I E+ G +G E + R ++ G + I I +S E ++ L Sbjct: 244 KGIKKGIEQGIEQGIEQGSEKKELEMYRKMMDKGFGIKAIASIFSVSEESIEKL 297 >UniRef50_C6XV94 Putative uncharacterized protein n=7 Tax=Pedobacter heparinus DSM 2366 RepID=C6XV94_PEDHD Length = 283 Score = 71.1 bits (173), Expect = 5e-11, Method: Composition-based stats. Identities = 57/282 (20%), Positives = 107/282 (37%), Gaps = 26/282 (9%) Query: 22 DTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVKTREGDGYIYVVI 81 R+ M LP ++ + L+ +E + + + +D+L V+ +G+ Y+ + + Sbjct: 16 KIFRENMHNTLPGIIKHVLHLNVNTVEELADDVQFTKERKTDLLKKVRDNKGNRYV-LHV 74 Query: 82 EHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGSRSPYPWSLCWLDEF 141 E+Q+ MAFR+ YS+ + ++H LP V + Y G + +F Sbjct: 75 EYQTDNYPEMAFRMAEYSIMLQRKH-------KLP-VKQFVIYIGPAKANMATSITTKDF 126 Query: 142 ADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIRQRDLMGLIDQL----V 197 +L + L + + ++ + +A+L + + L ++ ++ Sbjct: 127 RFRYNLTELSAVNYKLFLKSDLVEE-----KMLAILSNLASESTESVLAQVVQEIETHTS 181 Query: 198 VLLVTECANDSQITALLNYILLTGDEAR------FNEFISELTRRMPQHRERIMTIAERI 251 L +I L + + F E L RR E I I Sbjct: 182 TLEQGRYFRQLRILLQLRNLNKKAIKDMALVGKIFKEEKDILYRRGEIKGEIKGEIKGEI 241 Query: 252 H--NDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQAL 291 G + I L + G E+I KIT LS E++QAL Sbjct: 242 KGIEKGRYEEAMEIALELKKEGLATEFIAKITKLSIEEIQAL 283 >UniRef50_B5U1X5 Putative uncharacterized protein n=1 Tax=uncultured bacterium RepID=B5U1X5_9BACT Length = 304 Score = 70.4 bits (171), Expect = 7e-11, Method: Composition-based stats. Identities = 64/319 (20%), Positives = 118/319 (36%), Gaps = 37/319 (11%) Query: 1 MTNFTTSTP---H-DALFKTFLT-HPDTARDFMEIHLPKDLRELCDLDSL-KLESASFVD 54 M N + H D+LF + + D + F+ ++ L D+L + + +D Sbjct: 1 MQNENPTNENRSHKDSLFVDYFSKDRDWKQHFLSLYNALHGTNLQVADTLLERVN---ID 57 Query: 55 EKL-RALHSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQ 113 + L ++ ++DI V +G ++IEHQS + +M RL+ Y + ++ + Sbjct: 58 QVLYKSYYNDIAVLV-----NGQFILMIEHQSTINPNMPLRLLEYVARIYGNLVDSKAKF 112 Query: 114 PLPLVIPM------LFYHGSRSPYPWS-LCWLDEFADPTTARKLYNAAFPLVDVTVVPDD 166 LV P+ +FY G + P S L D F + + A L V Sbjct: 113 SRHLV-PLARPEFYVFYTGDQKLPPESYLHLSDSFPN-----QPPKADLTLELKVKVC-- 164 Query: 167 EIVQHRRVALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARF 226 I ++ + + L+++ E + A+ IL E R Sbjct: 165 TIKSDHPSPVVHRCPDLEQYAQFLKLVEEAKAAGQAEPLTWAIQEAVRRNILRDYLERRG 224 Query: 227 NEFISELTRRMPQHRERIMTIAERIHN---DGYIKG----EQRILRLLLQNGADPEWIQK 279 E +S L + + E + G +G + R LL G P+ + + Sbjct: 225 GETLSILMAEYDYATDFAVQKEEAYEDGLFAGLERGAYQNKLETARSLLSEGLAPQMVAR 284 Query: 280 ITGLSAEQMQALRQPLPER 298 T L E +Q L + + + Sbjct: 285 CTSLPLETVQQLGREVSPK 303 >UniRef50_C9LXX0 Putative uncharacterized protein n=6 Tax=Selenomonas sputigena ATCC 35185 RepID=C9LXX0_9FIRM Length = 301 Score = 70.0 bits (170), Expect = 9e-11, Method: Composition-based stats. Identities = 67/318 (21%), Positives = 117/318 (36%), Gaps = 49/318 (15%) Query: 3 NFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKL-RALH 61 T T D+LF+ + + + E L D + L + +DE L + Sbjct: 2 RNTKRTYKDSLFRDIFNNAERLPEIYEALLD----HKTTPDDITLAT---IDETLFTGVK 54 Query: 62 SDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHD---KRQPLPLV 118 +DI + V G+ ++ +++EHQS + +M RL+ Y + + +R+++ D K++ +PL Sbjct: 55 NDIGFIV----GNQHV-LLVEHQSTINANMPLRLLMYLVEIYRRYVDKDAIYKKELIPLP 109 Query: 119 IP--MLFYHG-SRSPYPWSLCWLDEF----ADPTTARKLYNAAFPLVDVTVVPDDEIVQH 171 P +FY+G + P W+L D F +D K++N I Sbjct: 110 APKFYVFYNGLAEMPDIWALHLSDAFGGHDSDLELEVKVFN---------------INDK 154 Query: 172 RRVALLELIQKHIRQRDLMGLIDQLVVL---LVTECANDSQITALLNYILLTGDEARFNE 228 +LE + + + + L N Q +Y+ + + E Sbjct: 155 PNRPILEKCHALKSYSVFVAKVRECIKNGSSLEIAVGNAVQYCVAHDYLGEYFRQKQAKE 214 Query: 229 FISELTRRMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGA------DPEWIQKITG 282 L Q R + AE G G Q L L G + K Sbjct: 215 VFDMLNFVWNQER-ALEVRAEEAMEKGLRLGRQEGLSQGLSQGVLETTTASIRNVMKSMD 273 Query: 283 LSAEQMQALRQPLPERER 300 E+ + Q +PE ER Sbjct: 274 FPIEKAMDILQ-IPEEER 290 >UniRef50_UPI0001BC3A9D hypothetical protein BcroD2_08902 n=3 Tax=Butyrivibrio crossotus DSM 2876 RepID=UPI0001BC3A9D Length = 324 Score = 70.0 bits (170), Expect = 1e-10, Method: Composition-based stats. Identities = 55/324 (16%), Positives = 114/324 (35%), Gaps = 61/324 (18%) Query: 11 DALFKTFLTHPDTARDFMEIHL-------PKDLRELCDLDSLKLESAS---FVDEKLRAL 60 D L K + T PD D + L + D+++ ++E + D +LR Sbjct: 20 DILLKDYFT-PDIFADAINAILYDGKSVVTPERMRTIDIETQRVEDENGNVTADTRLRD- 77 Query: 61 HSDILWSVKTREGDGYIYVV--IEHQSREDIHMAFRLMRYSMAVMQRHIEHDK--RQPLP 116 S K E D IY + IEHQS ED M R+M Y + R ++ +K + + Sbjct: 78 ------SAKVVEVDDAIYCLFAIEHQSVEDYTMPLRIMEYDVREYLRQVKSNKGVQVRIK 131 Query: 117 LVIPMLFYHGSRSPYPWSLCWLDEFADPT--------TARKLYNAAFPLVDVTVVPDDEI 168 +I ++ Y + + D F T + + L + V ++++ Sbjct: 132 PIITIVMY-WKADKWNQPVSVKDMFDKNTVRWLEYNGLGGYIQDYRMHLFEPGTVKEEDL 190 Query: 169 VQHRRVALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNE 228 + + +D++ + + N+ L + +E ++ Sbjct: 191 EKFK-----------TELKDVIAYVKYSKSTEALKDYNEKYKPDLTKSTVTLINELTNSK 239 Query: 229 FISELTRRMPQHRE--RIMTIAERIHNDGYIKGEQRILR-----------LLLQNGADPE 275 ++ + +E + E + +G KG+ L+ L + G Sbjct: 240 YVFI------EGKERLDMCEAFEGLIEEGRAKGKAEELKEKYKSWVTLSNNLKKRGMSNP 293 Query: 276 WIQKITGLSAEQMQALRQPLPERE 299 I + G+ ++Q + + E + Sbjct: 294 EIASLLGVPETELQKAFKMIKEEK 317 >UniRef50_C9LWJ8 Putative uncharacterized protein n=1 Tax=Selenomonas sputigena ATCC 35185 RepID=C9LWJ8_9FIRM Length = 292 Score = 69.6 bits (169), Expect = 1e-10, Method: Composition-based stats. Identities = 48/302 (15%), Positives = 118/302 (39%), Gaps = 43/302 (14%) Query: 11 DALFKTFLTHPDTARDFMEIHLPKDLR-ELCDLDSLKLESASFVDEKLRALHSDILWSVK 69 D+LF D +D +D+ + L +L+ +F +++ +D+ + Sbjct: 10 DSLFCDIFRRKDYLQDVYRGLFGRDVSLQEIQLMTLQ---GTFFNDE----KNDVSFLA- 61 Query: 70 TREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHD-----KRQPLPLVIPMLFY 124 G I V++EHQS + +M R+ Y + ++ + D +R LP +FY Sbjct: 62 ---GKRQI-VLMEHQSTLNENMPLRMFWYMAKLYRKQVPKDAPYRTRRLRLPAPCFYVFY 117 Query: 125 HG-SRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRV--ALLELIQ 181 +G +P W + + F ++ +L A+ ++ + +++ R + Sbjct: 118 NGLDPAPDEWEMRLSEAFEGECSSLELCVKAY---NINEMSGSRLLEKSRALKGYSVFVA 174 Query: 182 KHIRQRDLMGLIDQLVVL---------LVTECANDSQITALLNYILLTGDEARFNEFISE 232 + R+ +++ V L+ E + ++ + + + D Sbjct: 175 QIRRKTAAGVCLEEAVKQAIRYCIEQDLLAEYFLEREMEEVFDMVSFKWDPELAKRV--- 231 Query: 233 LTRRMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGA-DPEWIQKITGLSAEQMQAL 291 ++ + +E M E+ G KG I+ +L+ + I +++ +++++L Sbjct: 232 ---QLQEAQEIGM---EKGMEKGMEKGVTEIVLNMLKKKKWSLQDISEVSQWPLDKIESL 285 Query: 292 RQ 293 + Sbjct: 286 GK 287 >UniRef50_C6LE73 Putative uncharacterized protein n=1 Tax=Bryantella formatexigens DSM 14469 RepID=C6LE73_9FIRM Length = 326 Score = 69.2 bits (168), Expect = 2e-10, Method: Composition-based stats. Identities = 54/310 (17%), Positives = 118/310 (38%), Gaps = 30/310 (9%) Query: 11 DALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFV-------DEK-----LR 58 D + K + DF+ L + R L L V D++ Sbjct: 5 DIILKEYQRDSRHFCDFVNGALAQG-RPLLKRGQLVPVPTELVLVKDTEEDDENAVVKTV 63 Query: 59 ALHSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQ------RHIEHDKR 112 DI + + G I V I++Q+ D M R+M ++ + K Sbjct: 64 QRFRDITGKAEADKNAGCIIVAIQNQTTVDYGMPLRVMLEDALEYDVQRRTKKNRKLHKG 123 Query: 113 QPLPLVIPMLFYHGSRSPYPWSLCWLDEFADPTTARKL--YNAAFPLVDVTVVP-DDEIV 169 + L LVI ++FY+G+ +P+ + + P R+L Y ++P+V VT D Sbjct: 124 EKLCLVITLVFYYGT-TPWRAPSDLAEMISVPREFRQLREYIQSYPIVVVTPENVDTACF 182 Query: 170 QHRRVALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQ-ITALLNYILLTGDEARFNE 228 + +LE++++ ++++ +++ + + ++ I AL +++ + E Sbjct: 183 RGGWQEILEILRRQNDEKEMGRYLEKNRAIYEKLPEDTNRVIFALTDHLDYYRELKEKGE 242 Query: 229 FISELTRRMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQM 288 I+ ++ + E G +G ++ ++ + G D I + ++ Sbjct: 243 KITMCKAFTDHYKSGV----EEGKKQGMKRGRRQGIKQGKRQGMDMGIRAMIE--TCREL 296 Query: 289 QALRQPLPER 298 + R +R Sbjct: 297 KIPRNETKKR 306 >UniRef50_A6LFA9 Putative uncharacterized protein n=22 Tax=Bacteroidales RepID=A6LFA9_PARD8 Length = 305 Score = 68.8 bits (167), Expect = 2e-10, Method: Composition-based stats. Identities = 48/300 (16%), Positives = 99/300 (33%), Gaps = 23/300 (7%) Query: 11 DALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVKT 70 D FK + +D + L L + L++ + + E + +++ + Sbjct: 10 DFGFKHIFG-REMDKDILIEFLNDLLEGEYTIMDLRIMNNERLPETEQGRK--VIFDIHC 66 Query: 71 REGDGYIYVVIEHQSREDIHMAFRLMRYSM-AVMQRHIEHDKRQPLPLVIPMLFYH---- 125 G ++IE Q+RE H R + Y +V+++ I+ L V + F + Sbjct: 67 ETDKGE-RIIIEMQNREQPHFKDRALYYLSHSVVEQGIKGTWDYELAAVYGVFFLNFTLD 125 Query: 126 ---GSRSPYPWSLCWLDEFADPTTARKLYNAAF--PLVDVTVVPDDEIVQHRRVALLELI 180 G D +++N F +++ +E + Sbjct: 126 EENGPDKNGKEGKFRRDIILADRENGQVFNPKFRQIYIELPRFNKEEEECETDFERWIYV 185 Query: 181 QKHIRQRD---------LMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFIS 231 KH+ D + ++++ + +Q A + F Sbjct: 186 LKHMDTLDRMPFKARKAIFERLERIGSMANLTPKQRAQYEAEWKMYNDYYNTLDFAVEKG 245 Query: 232 ELTRRMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQAL 291 + + + +G KG++ R + G P IQK TGLS E+++ L Sbjct: 246 MKKGMEEGMEKGLQKGLQEGLQEGLQKGKESTARNMKAEGITPLIIQKCTGLSLEEIERL 305 >UniRef50_A4XJH0 Putative uncharacterized protein n=1 Tax=Caldicellulosiruptor saccharolyticus DSM 8903 RepID=A4XJH0_CALS8 Length = 134 Score = 68.4 bits (166), Expect = 3e-10, Method: Composition-based stats. Identities = 19/128 (14%), Positives = 50/128 (39%), Gaps = 3/128 (2%) Query: 1 MTNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRAL 60 M N + +A+F+ + ++ + DS+++ + +E + Sbjct: 1 MNNNFSQDE-NAIFRLIFSDSKEILFLLKNVAKFSWVDRIQKDSIEVILVDYDNENVLKY 59 Query: 61 HSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIP 120 D++ V YI+V + + M ++ + ++ I+ + +P +IP Sbjct: 60 KPDVIAKVTIENNTAYIFVFFVSKV-PECGMRNIILNNMLLFWEKKIKEGTDK-IPPIIP 117 Query: 121 MLFYHGSR 128 ++ Y+G Sbjct: 118 LVLYNGKE 125 >UniRef50_B1WSK8 CHP1784-containing protein n=11 Tax=Cyanobacteria RepID=B1WSK8_CYAA5 Length = 260 Score = 66.5 bits (161), Expect = 1e-09, Method: Composition-based stats. Identities = 48/282 (17%), Positives = 111/282 (39%), Gaps = 30/282 (10%) Query: 22 DTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVKTREGDGYIYVVI 81 D F+ +D + ++L + L + +D L +++ + I + I Sbjct: 5 DNVCKFLAERFSRDFANWLLNEPIELTELKPTELSLNPIRADSLIFLQSDD----IVLHI 60 Query: 82 EHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGSRSPYPWSLCWLDEF 141 E Q+ D + FR+ Y + V +R+ + Q ++ Y P L + + F Sbjct: 61 EFQTSPDEDIPFRMTDYRLRVYRRYPNKEMYQ-------VVIY---LKPSNSELVYQNTF 110 Query: 142 ADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIRQRDLMGLIDQLVVLLV 201 + F ++ + D + + + ++ R+ + Q+ ++ Sbjct: 111 ELTNLRHQ-----FNVIRLWEENTDSFLNNSGLLPFAVLTCTDNPRETLT---QIAAIID 162 Query: 202 TECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRERIMTIAERIHNDGYIKGEQ 261 + Q + +L+G + + L R +E + I + I ++G +KG++ Sbjct: 163 SMPNQQRQSDISASTAILSGLKLDQDSIKRIL--RSDIMKESV--IYQEIFHEGEVKGQK 218 Query: 262 R----ILRLLLQNGADPEWIQKITGLSAEQMQALRQPLPERE 299 + I +L+N + E I ++TGL+ ++++ L L E Sbjct: 219 QAIKNIALNMLRNHMNLEVISQLTGLNLQEIEQLNLSLNTEE 260 >UniRef50_C4G3R2 Putative uncharacterized protein n=2 Tax=Abiotrophia defectiva ATCC 49176 RepID=C4G3R2_ABIDE Length = 336 Score = 66.5 bits (161), Expect = 1e-09, Method: Composition-based stats. Identities = 57/303 (18%), Positives = 115/303 (37%), Gaps = 55/303 (18%) Query: 11 DALFKTFLTHPDTARDFMEI-HLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVK 69 D++F + R + H D D + LE+ F++ A ++D+ ++VK Sbjct: 67 DSVFTLLFSDIKNIRKLYQSLHDDSDSYSDEDFKIITLENV-FIN----APYNDLGFTVK 121 Query: 70 TREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEH-------DKRQPLPLVIPML 122 + + ++ E QS + +M RL+ Y +I +K LP ++ Sbjct: 122 NK-----VIILAEAQSTFNPNMGLRLLIYIAQSYHDYISEYKFNIFSEKLIRLPNPEFIV 176 Query: 123 FYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQK 182 Y GS+ + D F T LV V V+ + + + +IQ+ Sbjct: 177 IYSGSKKTDITEIRLSDCFESGT------APNIELV-VKVIGGNNVKE-------GIIQE 222 Query: 183 HIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRE 242 +++ ++ D+ V + + ++ + G F + + + Sbjct: 223 YLKFCEM---YDEKVRSVKPSEEKAYSLKKVIKDCIDNGILKDFLTLHQK------EVED 273 Query: 243 RIMTI---AERIH-------NDGYIKGEQRI----LRLLLQNGADPEWIQKITGLSAEQM 288 +MT+ + + N G +G+ R +L+N + I +ITGLS EQ+ Sbjct: 274 MMMTVIPPEQALEYIKLEEYNKGIEQGKLDTSLNFARNMLKNNYSIDSIIEITGLSREQI 333 Query: 289 QAL 291 + L Sbjct: 334 KRL 336 >UniRef50_B3CQQ1 Putative transposase n=3 Tax=Orientia tsutsugamushi str. Ikeda RepID=B3CQQ1_ORITI Length = 153 Score = 66.5 bits (161), Expect = 1e-09, Method: Composition-based stats. Identities = 37/152 (24%), Positives = 66/152 (43%), Gaps = 31/152 (20%) Query: 176 LLELIQKHIRQRDLMGLIDQLV-----VLLVTECANDSQITALLNYILLTGDEARFNEFI 230 +LE + KHI QRD++ L ++ + VL++ + + + L Y E++ E Sbjct: 1 MLEYMLKHIHQRDMLKLWEEFLIKFKHVLILDKEKGYIYLRSFLWYTDTKLLESQQPELE 60 Query: 231 SELTRRM-PQHRERIM-TIAERIHNDGYIKGE------------------------QRIL 264 L + + + + IM TIA + ++G GE Q + Sbjct: 61 QVLAKYLSEEEKSNIMRTIAAKYIDEGIEIGETKGIAKGIAKGIAEGIEIGEVKAKQGLA 120 Query: 265 RLLLQNGADPEWIQKITGLSAEQMQALRQPLP 296 R LL+ G E+I + TGLS E++ L+ + Sbjct: 121 RNLLKAGFSVEFISENTGLSKEEVINLKNNIE 152 >UniRef50_UPI0001C351D8 hypothetical protein ChatD1_33675 n=1 Tax=Clostridium hathewayi DSM 13479 RepID=UPI0001C351D8 Length = 313 Score = 66.1 bits (160), Expect = 1e-09, Method: Composition-based stats. Identities = 53/308 (17%), Positives = 104/308 (33%), Gaps = 31/308 (10%) Query: 4 FTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSD 63 D LF+ +D ++++ R+ + D L + + D + +D Sbjct: 5 KLNRNYKDRLFRLAFQEK---KDLLDLYNAVSGRQYTNPDDLII--TTLADAIYLGMKND 59 Query: 64 ILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQ-------PLP 116 I + V + EHQS + +M R + Y + +I+ + LP Sbjct: 60 ISFLVSDVLN------LYEHQSSFNPNMPVRGLNYFADTYREYIDRNGFDIYGEKLIRLP 113 Query: 117 LVIPMLFYHG-SRSPYPWSLCWLDEF-ADPTTARKLYNAAFPLVDVTVVPDDEIVQH--R 172 + ++FY+G P L D F + ++++ + E++ R Sbjct: 114 MPQYIVFYNGTKEEPDRIELRLSDAFLCQNPEEKGCLECRATMININYGHNKELMDRCRR 173 Query: 173 RVALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISE 232 + + +D+ V V C + +L IL N + E Sbjct: 174 LKDYAVFVSRIRNNEKRGMALDEAVKQAVHSCIEEG----ILADILKKNRAEVCNLILYE 229 Query: 233 L--TRRMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQA 290 R++ RE M +G + I+R + G +P I + GL ++ Sbjct: 230 YDEQRQLAIAREGAMKAG---REEGRAAEQVTIIRNMAGKGLNPSAIADMLGLEEGYVKK 286 Query: 291 LRQPLPER 298 + L E Sbjct: 287 VLYLLAEE 294 >UniRef50_A7BWQ7 Putative uncharacterized protein n=3 Tax=Beggiatoa sp. PS RepID=A7BWQ7_9GAMM Length = 290 Score = 66.1 bits (160), Expect = 2e-09, Method: Composition-based stats. Identities = 60/311 (19%), Positives = 110/311 (35%), Gaps = 51/311 (16%) Query: 6 TSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVD--EKLRALHSD 63 HD+LFK +T +F + P + F+ E L+ Sbjct: 3 NPKSHDSLFKWLIT--AFTTEFFGHYFPD-----IRIGEYTFIDKEFISKYENLKESLKG 55 Query: 64 ILW---SVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIP 120 L+ V+ I + IEHQS + ++ R+ YS + V Sbjct: 56 DLFLGMEVEIDGLLREIIIQIEHQSERE-DVSERVYEYSCYAWLLKKK--------PVWS 106 Query: 121 MLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDE----IVQHRRVAL 176 ++ Y W ++F ++K + DV V ++ I +H + Sbjct: 107 IVIY---TDEAVWRKPVTEQFWYAFDSQK--GKQYHHFDVIKVKAEKSSDLIQKHSLMCK 161 Query: 177 LELIQKHIRQRDLMGLIDQLVVL--LVTECANDSQITALLNYI--LLTGDEARFNEFISE 232 L ++ RQ D L+ ++ L+ E + Q+ + ++ E R ++ E Sbjct: 162 LLALKADDRQTDPEKLVYEIYRAAALMKEQLTNEQLLLIDQWVSFYKKVSEKRLDKIKKE 221 Query: 233 LTRRMPQHRERIMTIAE------------RIHNDGYIKGEQRILRLLLQNGADPEWIQKI 280 + + TI+E +G KG ++ LL+ G D E IQK Sbjct: 222 IKMDFIE-----TTISEHVYNQGWIKGEAEGKAEGEAKGRKKTAINLLKMGIDVEIIQKA 276 Query: 281 TGLSAEQMQAL 291 TG S +++ + Sbjct: 277 TGFSDAEIKQM 287 >UniRef50_C0QGW4 Putative uncharacterized protein n=1 Tax=Desulfobacterium autotrophicum HRM2 RepID=C0QGW4_DESAH Length = 298 Score = 65.8 bits (159), Expect = 2e-09, Method: Composition-based stats. Identities = 53/296 (17%), Positives = 107/296 (36%), Gaps = 34/296 (11%) Query: 10 HDALFKTFLTHPDTARDFMEIHLPKD---LRELCDLDSLKLESASFVDEKLRALHSDILW 66 HD FK D ++ ++ P+ ++ D++ L+ E +L D+ Sbjct: 4 HDHNFKNLF--LDFPKETLDWFFPQAGQSWGKVLDVEFLRQEPKKHNLSD-SSLELDMPI 60 Query: 67 SVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHG 126 ++ ++E Q + ++L+RY+ +M+ H + LVIP + + Sbjct: 61 LFNFENQQLLLW-LVEFQEDKSKFSIYKLLRYTTDLMETHPDA-------LVIPTVLFTD 112 Query: 127 SRSPYPWSLCWLDEFADPTTARKLYNAAF---PLVDVTVVPDDEIVQHRRVALLELIQKH 183 + WS L + R + + L D+ D V + V +L + Sbjct: 113 RKK---WSKAVLQQLHAQLHDRMFLHFEYVFHKLFDLNAR-DYYNVDNPVVKILLPKMHY 168 Query: 184 IRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRER 243 ++ D + +I Q L ++ +++I + + L + QH+E Sbjct: 169 KKE-DRIEVIRQAYAGLFQLVSS-GLFDKYVDFIDTYAEIEDQEQL--NLYNEIVQHKET 224 Query: 244 IMTIAERIHNDGYIKGEQR--------ILRLLLQNGADPEWIQKITGLSAEQMQAL 291 M +A+ I G +G + +R Q G I KI L + + Sbjct: 225 AM-LAQYIRERGMQEGRKEERKQSLISFIRKAKQEGVSVPTIAKIVDLDVSMVNKI 279 >UniRef50_C6LJP2 Putative transposase n=1 Tax=Bryantella formatexigens DSM 14469 RepID=C6LJP2_9FIRM Length = 326 Score = 65.4 bits (158), Expect = 3e-09, Method: Composition-based stats. Identities = 40/236 (16%), Positives = 83/236 (35%), Gaps = 17/236 (7%) Query: 69 KTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHI------EHDKRQPLPLVIPML 122 K DG I V +++Q+ D M R+M + K + L VI ++ Sbjct: 81 KIVAPDGEIIVALQNQTTVDFGMPLRVMTEDALEYDVQRRMCKDEKLHKGEKLAPVITIV 140 Query: 123 FYHGSR-SPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVP-DDEIVQHRRVALLELI 180 FY+G++ P L + + + K Y + ++ +T D + E++ Sbjct: 141 FYYGAQIWSGPTDLADMVKIPEEFKWLKKYIRPYAMLLITPENVDAAWFSGGWREVFEIL 200 Query: 181 QKHIRQRDLMGLIDQLVVLLVTECANDSQITAL----LNYILLTGDEARFNEFISELTRR 236 Q+ ++++ + + + + +++ L+Y + Sbjct: 201 QRRNDEKEMQRYLQKKRSVYEKLPEDTNRLIFALTGHLDYYNALKRKGERAVMCKAFEDH 260 Query: 237 MPQHRERIMTIA-ERIHNDGYIKGEQRILRLLLQNGADPEWI----QKITGLSAEQ 287 E I + + G +G ++R + G E I QK LS E+ Sbjct: 261 YKSGVEEGKNIGIHQGISQGLGRGIGAMIRENQEEGKTTESIIDKLQKYFSLSREE 316 >UniRef50_C2LUG6 Putative uncharacterized protein n=1 Tax=Streptococcus salivarius SK126 RepID=C2LUG6_STRSL Length = 299 Score = 64.6 bits (156), Expect = 4e-09, Method: Composition-based stats. Identities = 57/299 (19%), Positives = 105/299 (35%), Gaps = 32/299 (10%) Query: 11 DALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVKT 70 D + K + P+ F+ L D+ + L+ +L F +++L + D+ K Sbjct: 13 DIMAKKIFSLPEVTVAFIRDILDLDVVDAQILEGTQLHKKDFDEDELFSTSVDV--RAKL 70 Query: 71 REGDGYIYVVIEHQSR--------EDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPML 122 +G V+IE Q R ++A +L+ + Q+ H + + V + Sbjct: 71 NDGTE---VIIEIQVRKQHYFLNRFHYYLANQLVENVQQLRQQGQTHKMYEQMEPVYGIA 127 Query: 123 FYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLEL--I 180 + P S A+ T + LY+ D + ++A LEL Sbjct: 128 ILEKTLLPDEESPINTYWMANSRTGKPLYSF---------YKDGKQQNLLQIAFLELDKY 178 Query: 181 QKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQH 240 K RD + L A +T + + + I E R + Sbjct: 179 NKDKHIRDEGRQWLEFFGNLPFSKAPSRAVTHADSLLDSSSWTQEEKAMIDERIRIQENY 238 Query: 241 RERIMTIAERIHNDGYIKGEQRI--------LRLLLQNGADPEWIQKITGLSAEQMQAL 291 + T + +G +G +R +R +L G E + +TGLS E++ L Sbjct: 239 DMTMETAIDEAREEGLEQGLKRGRYEGQLELIRKMLAKGLSLEVVSDVTGLSLEELDGL 297 >UniRef50_UPI0001C353CE hypothetical protein ChatD1_20495 n=1 Tax=Clostridium hathewayi DSM 13479 RepID=UPI0001C353CE Length = 319 Score = 64.6 bits (156), Expect = 4e-09, Method: Composition-based stats. Identities = 42/309 (13%), Positives = 109/309 (35%), Gaps = 47/309 (15%) Query: 11 DALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVKT 70 D LF+ + + + ++ E + D L++ D+++ +K Sbjct: 29 DRLFRMVFNRKE---ELLSLYNAVSHSEYTNPDDLEI-----------NTLDDVIY-MKM 73 Query: 71 REGDGY----IYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPL---PLVIPM-- 121 + + + + EHQS + +M R Y + +++I+ + + +P+ Sbjct: 74 KNDLAFLIDDVLNLWEHQSTWNPNMPVRGTFYIVEEYRKYIDQNGLNLYGSSRITLPVPQ 133 Query: 122 --LFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFP--LVDVTVVPDDEIVQH--RRVA 175 +FY+G R + L + + F ++++ ++E+++ Sbjct: 134 FYVFYNGLREEPDYIELKLSDAFSRVHSEVEPCMEFKAVMLNINRGHNEELMRQCTTLRE 193 Query: 176 LLELIQKHIRQRDLMGLIDQLVVLLVTECANDS--------QITALLNYILLTGDEARFN 227 E + + + + +++ + ++ C D + +L DE R Sbjct: 194 YAEFVARIRDETEDGTALEEAAMNVMDSCIRDGILAEFLSVHRAEVFEVLLTEYDEQRHI 253 Query: 228 EFISELTRRMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQ 287 E++R RE M +G ++ + + L++ G E I G + Sbjct: 254 ASEKEISR-----REGHM----EGRTEGILEKAKEVAVNLIKKGFTVEDAASICGEDICR 304 Query: 288 MQALRQPLP 296 ++ + Sbjct: 305 VKEWHREWK 313 >UniRef50_C9XMT1 Putative uncharacterized protein n=4 Tax=Clostridium difficile RepID=C9XMT1_CLODC Length = 158 Score = 64.6 bits (156), Expect = 4e-09, Method: Composition-based stats. Identities = 28/123 (22%), Positives = 54/123 (43%), Gaps = 9/123 (7%) Query: 14 FKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVKTREG 73 ++ + ++ ++ + + L +++LE SF+ E + DI++ V ++ G Sbjct: 13 YRRMYSDKESFLSLIQNFTSVSIAKELTLKNIELE-TSFICE-YKGKEVDIIYKVFSKSG 70 Query: 74 DGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIE-------HDKRQPLPLVIPMLFYHG 126 Y+V+E Q+ D + RL Y + + I DK LP VIP++ Y G Sbjct: 71 KVSHYIVLEFQTEMDTEIVPRLKSYREQIWKSFIMKKSLEEIEDKNFKLPKVIPVVLYSG 130 Query: 127 SRS 129 Sbjct: 131 PER 133 >UniRef50_C8PTN1 Putative uncharacterized protein n=4 Tax=Treponema vincentii ATCC 35580 RepID=C8PTN1_9SPIO Length = 303 Score = 64.6 bits (156), Expect = 4e-09, Method: Composition-based stats. Identities = 52/312 (16%), Positives = 118/312 (37%), Gaps = 52/312 (16%) Query: 11 DALFKTFLTHPDTARD-FMEIHLP---KDLRELCDLDSLKLESASFVDEKLRALHSDILW 66 D++F + + A++ F+ ++ +L+ C ++++KL++ ++ + +D+ Sbjct: 10 DSVFVDLFSEDEKAKENFLSLYNALHGTNLQLSCPVENIKLDNVMYM-----NIVNDV-- 62 Query: 67 SVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQ-------RHIEHDKRQPLPLVI 119 S I V+ EHQS + +M R ++Y + + R++ + P P Sbjct: 63 SCLVDNK---IIVLAEHQSTINENMPLRFLQYIARLYEKLQKPTDRYLRTLSKIPTPEFY 119 Query: 120 PMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLEL 179 +FY+G ++ L + R + + ++ E++ Sbjct: 120 --VFYNGLNDYPETTVLKLSDAFITKPERIPLDLEVKVYNINKSKGAEVLSRC------- 170 Query: 180 IQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNY-ILLTGDEARFNEFISELTRR-- 236 K + + L +L L E + + + IL + + E I+ L Sbjct: 171 --KTLDEYSLFIEEVRLQTQLDPENGFTNAVKICIEKGILKEYLQRKSREVINMLIAEYD 228 Query: 237 ----MPQHRERIMTIA-ERIHNDGYIKGE------------QRILRLLLQNGADPEWIQK 279 + RE IA + + G +G RL+ Q + +I K Sbjct: 229 YDTDIAVQREEAGKIAFAKGISQGLSQGISQGLSQGSHQKALETARLMKQANCEIPFIAK 288 Query: 280 ITGLSAEQMQAL 291 +TGL+ +++++ Sbjct: 289 MTGLTQAEVESI 300 >UniRef50_A8GY36 Putative uncharacterized protein n=15 Tax=Rickettsia RepID=A8GY36_RICB8 Length = 279 Score = 64.6 bits (156), Expect = 4e-09, Method: Composition-based stats. Identities = 46/296 (15%), Positives = 96/296 (32%), Gaps = 35/296 (11%) Query: 11 DALFKTFLTHPDTARDFMEI--HLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSV 68 D FK T F+ LP++LR + D LK S V + + S + V Sbjct: 10 DVAFKKLFTDKARLISFLNNIMRLPEELR-IID---LKYISNEQVPDLGQNKRS--IVDV 63 Query: 69 KTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDK-RQPLPLVIPMLFYHGS 127 K + G IY+V D +A R+ Y ++ K L V+ ++ G Sbjct: 64 KVTDNSGNIYIVEMQNGYADAFLA-RVQFYGCVAFSSQLKRGKEYADLAPVVMVIITSGF 122 Query: 128 RS--PYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDD----EIVQHRRVALLELIQ 181 ++ + + +L ++ V++ + E ++ + ++ Sbjct: 123 QALPEEKECISYHQTINVGNGKNQLKCLSYVFVELDKFTKEANELETIEDDWLYMMAKFD 182 Query: 182 KHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHR 241 K ++ + + + Q Sbjct: 183 KAKEPPK----------------HTQDEVVLSAYKTIEQFNWSEAEYDNYIKAMLAAQTE 226 Query: 242 ERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQALRQPLPE 297 E + + +G + + + +LQ+ E I K T LS E+++ L+ + + Sbjct: 227 E--LNQKSKFK-EGKAERSIEMAKEMLQDNEPIEKIIKYTKLSKEEIEKLKLEIEK 279 >UniRef50_A6LF36 Putative uncharacterized protein n=7 Tax=Bacteroidales RepID=A6LF36_PARD8 Length = 273 Score = 64.2 bits (155), Expect = 5e-09, Method: Composition-based stats. Identities = 45/286 (15%), Positives = 95/286 (33%), Gaps = 27/286 (9%) Query: 11 DALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVKT 70 D F + ++ + L + D++ + + E L I++ V Sbjct: 10 DFGFHRIFGQ-EVHKELLIDFLNQLFFGEHDIEDITFLNPIQTPETLDDRG--IVFDVHC 66 Query: 71 REGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQP--LPLVIPMLFYHGSR 128 ++ +G ++V +E Q+ + R + Y + + K L V + + Sbjct: 67 KDSNGNLFV-VEMQTGAQPYFHDRGLYYLARAISNQGQKGKDWKFALQPVYGVFLLNYKM 125 Query: 129 SPYPWSLCWLDEFADPTTARKLYNAAFP--LVDVTVVPDDEIVQHRRVALLELIQKHIRQ 186 S D ++++ +++ + + KH+ Sbjct: 126 DVN--SKFRTDVILADRETGRMFSDRIRQVYLELPYFQKEPDECENDFERWIYLLKHMDT 183 Query: 187 RDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARF-NEFISELTRRMPQHRERIM 245 + M A+ + +L D A E + + ++R+ Sbjct: 184 LERMPF---------------KAKKAVFDKLLEVADVANLSKEERIQYDEALKRYRDYKN 228 Query: 246 TIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQAL 291 TI + G +KG++ R + G P IQK TGLS E ++ L Sbjct: 229 TI-DYAEEKGILKGKESTARNMKAEGIAPLIIQKCTGLSLEDIEKL 273 >UniRef50_C5UZR7 Putative uncharacterized protein n=1 Tax=Clostridium botulinum E1 str. 'BoNT E Beluga' RepID=C5UZR7_CLOBO Length = 334 Score = 63.1 bits (152), Expect = 1e-08, Method: Composition-based stats. Identities = 50/340 (14%), Positives = 106/340 (31%), Gaps = 53/340 (15%) Query: 1 MTNFTTSTPHDALFKTFLTH-PDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRA 59 MT D + K + + + ++ D L + + F+ + Sbjct: 1 MTVSNEKVKLDEILKFLFSTSKKVLVNLLNGIFEENFSS--DEVELSVSNNEFIMDTFDT 58 Query: 60 LHSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRY--------------------- 98 L D+ + V E + +E Q++ D M R+ Y Sbjct: 59 LRGDVFFEVLNNEVSNKVTYHLEFQTKNDSTMIIRMFEYGFRKGKEQTGNRDDFKTIYFP 118 Query: 99 --SMAVMQRHIEHDKRQPLPLVIPMLFYHGSRSPYPWSLCWLD--EFADPTTARKLYNAA 154 + ++R+ + L +V+P + +S+ + E+ D Sbjct: 119 KQKVIFIERNNNIKEDIKLKIVLP------DEQSFIYSVPVMKYWEYTDNELIENKMYPL 172 Query: 155 FPLVDVTVVPDDEIVQHRRVALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITA-- 212 PL + D E + H + + + ++ L Sbjct: 173 LPLQLFNLRKDLEYARRSNNIDKINDLSHEAKEIALKIANESKKLFDDNEIIGEDFHKML 232 Query: 213 -----LLNYILLTG-DEARFNEFISELTRRM---PQHRERIMTIAERIHNDGYIKGEQRI 263 L+ Y+ ++ R E +S +T+ + + I E+ G KG ++ Sbjct: 233 LAIQNLIEYLNRNYFNDDRLEEEVSTMTKTLYDPEVEKRGIEKGIEKGIEKGIEKGMEKG 292 Query: 264 LRL--------LLQNGADPEWIQKITGLSAEQMQALRQPL 295 + L+ G E + K TGL E+++ L+ + Sbjct: 293 IEKKAIEDAIGFLRLGVSEEIVSKGTGLPIEKVRELKDKI 332 >UniRef50_C8W2V6 Putative uncharacterized protein n=2 Tax=Desulfotomaculum acetoxidans DSM 771 RepID=C8W2V6_DESAS Length = 300 Score = 62.7 bits (151), Expect = 1e-08, Method: Composition-based stats. Identities = 43/253 (16%), Positives = 90/253 (35%), Gaps = 33/253 (13%) Query: 35 DLRELCDLDSLKLESASFVDEKLRALHSDILWSVKTREGDGYIYVV-IEHQSREDIHMAF 93 ++ ++ ++ + SD+L+ V DGY Y++ IE Q R D M Sbjct: 22 EMVRGITVEDVQRVEKEAIA---VKRESDMLFRVS---EDGYEYLMAIEMQIRPDREMPR 75 Query: 94 RLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNA 153 RL+ Y+ MQ K+ P+++ + + + + LD + Sbjct: 76 RLLEYT--AMQH--REFKKPVYPVIVNLTGH--KKKDESYCFDCLD--------FTVVTF 121 Query: 154 AFPLVDVTVVPDDEIVQHRRVALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITAL 213 + ++++ +P + ++ V L+ L+ + + V + D + A Sbjct: 122 NYRQINLSDLPGQDFLRSGPVGLIPLVVLMRHDEAPEEVFAKCVQRVDE--VQDEGLRAD 179 Query: 214 LNYILLTGDEARFN-EFISELTR--------RMPQHRERIMTIAERI-HNDGYIKGEQRI 263 L L +F E I + RE+ + E+I G KG Q+ Sbjct: 180 LYLGLAVLSTIKFTREIILKYIEVNKMENSPLFDGIREKWIDQGEQIGFQKGIQKGIQQA 239 Query: 264 LRLLLQNGADPEW 276 ++ + + Sbjct: 240 MQQSILEALEENI 252 >UniRef50_C8W1F3 Putative uncharacterized protein n=2 Tax=Desulfotomaculum acetoxidans DSM 771 RepID=C8W1F3_DESAS Length = 303 Score = 62.7 bits (151), Expect = 2e-08, Method: Composition-based stats. Identities = 43/271 (15%), Positives = 94/271 (34%), Gaps = 54/271 (19%) Query: 56 KLRALHSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPL 115 +++ D ++ +K + +E Q+ + R++ Y +++++ + Sbjct: 55 EVKEKRIDFVFLLKDNS-----ILHLEFQTTIPKDILIRMVTYGSRLVEKYDQD------ 103 Query: 116 PLVIPMLFYHGSRSPYPW-----------SLCWLDEFADPTTARKLYNAAFPLVDVTVVP 164 V ++ Y G P ++ +F +++Y P Sbjct: 104 --VNTVVIYSGKIESAPRLLRKGSLTYKVKNIYMKKFDGDAEYKRIYEKI-----KNKKP 156 Query: 165 DDEIVQHRRVALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALL---------- 214 DEI R + L + K + ++ +L + E I A++ Sbjct: 157 LDEIDIQRLIFLPLMKSKEKSEDEMAIQAAELAKEIPNEPIRAFTIGAIVAISDNFLTEE 216 Query: 215 --NYILLTGDEARFNEFISELTRRMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQN-- 270 +L + ++I E E + + +G +G + LR L+ Sbjct: 217 YKKRLLEVLRMTQIEQWIRE-----EGREEGLKEGLKEGREEGLKEGLKEGLREGLEKTA 271 Query: 271 ------GADPEWIQKITGLSAEQMQALRQPL 295 G D E I KIT LS E++ +L++ + Sbjct: 272 IAALREGFDIETIVKITNLSKEEILSLKKKI 302 >UniRef50_C6VTD5 Putative uncharacterized protein n=1 Tax=Dyadobacter fermentans DSM 18053 RepID=C6VTD5_DYAFD Length = 308 Score = 62.3 bits (150), Expect = 2e-08, Method: Composition-based stats. Identities = 49/318 (15%), Positives = 105/318 (33%), Gaps = 58/318 (18%) Query: 11 DALFKTFLT---HPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWS 67 D FK + D DF+ + + R + DL E+ + LR D+ Sbjct: 10 DFGFKRIFGSEANKDILIDFLNVLFAGE-RLVADLTFASNENNGRI-PILRRAIFDLC-- 65 Query: 68 VKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDK---RQPLPLVIPMLFY 124 +G+ +I IE Q + R + YS ++++ +E R L V + Sbjct: 66 CTGADGEQFI---IEVQRVRQEYFKDRCLYYSASLIRDQVEAGGTNWRYDLKPVYLI--- 119 Query: 125 HGSRSPYPWSLCWLDEFADPTTARKLYNAAFP---------------LVDVTVVPDDEIV 169 F D L+ +++ E Sbjct: 120 ----------GLMDFCFEDSDDGHYLHEIRLIKRSNGQVFYDKFGLTFIEMPAFQKKESD 169 Query: 170 QHRRVALLELIQKHIRQRDLMG------LIDQLVVLLVTECANDSQITALLNYILLTGDE 223 + + K++ + +++ + ++ + N + A Y+ D Sbjct: 170 LSTELDRWLYLLKNLSKLNIVPPVLTNPVYQKVFRVAEVCNLNKEEKMAWDAYLKAKWDN 229 Query: 224 ARFNEFISELTRRM-----------PQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGA 272 ++ + R+ H+E + ++ G G++++++ +L G Sbjct: 230 ENSMDYAKKEAMRVGHEEGHKEGHKEGHKEGMKEGIKKGRETGIELGKRQVVKNMLAKGF 289 Query: 273 DPEWIQKITGLSAEQMQA 290 D + I ITGL+ EQ++ Sbjct: 290 DMQTISDITGLTFEQIRN 307 >UniRef50_C8WSD0 Putative uncharacterized protein n=5 Tax=Alicyclobacillus acidocaldarius RepID=C8WSD0_ALIAD Length = 270 Score = 61.5 bits (148), Expect = 3e-08, Method: Composition-based stats. Identities = 53/265 (20%), Positives = 102/265 (38%), Gaps = 40/265 (15%) Query: 42 LDSLKLESASFVDEKLRALHSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMA 101 +++L+ + LR D W + GD +E Q R + + R + Y Sbjct: 34 VETLEPFTTELPASTLR---MDRAW--RMANGDV---FHLEFQDRRERTLH-RFLEYDAR 84 Query: 102 VMQRHIEHDKRQPLPLVIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVT 161 + ++ R ++ YH + P L D TA F Sbjct: 85 L-ANQVKTRIRT-------VVLYHAQVASAPQEL-------DIGTAIYRVENVFLSALDG 129 Query: 162 VVPDDEIVQHRRVALLELIQK-------HIRQRDLMGLIDQLVVLLVTECANDSQITALL 214 DE+ H RV E + +R D + +++ LL +D + + Sbjct: 130 DGALDEVEAHLRVGRWEPADRLRLGLALSMRVEDRHQAMARVLNLLPR-VPDDEERELVA 188 Query: 215 NYILLTGDEARFNEFISELTRRMPQHRERIMTIAERIHNDGYIKGEQR----ILRLLLQN 270 + +L GD A +E +L + + + + +AE ++ DG G+Q+ I LL Sbjct: 189 SAVLAFGDRALSDEDRRKLRKEL----KNVFRMAEELYEDGRHDGKQQAAEDIAHRLLAE 244 Query: 271 GADPEWIQKITGLSAEQMQALRQPL 295 G + ++K TGL E+++ +++ + Sbjct: 245 GVPVDVVEKATGLPRERLEQMKREV 269 >UniRef50_Q1NK38 Putative uncharacterized protein n=2 Tax=delta proteobacterium MLMS-1 RepID=Q1NK38_9DELT Length = 115 Score = 61.5 bits (148), Expect = 4e-08, Method: Composition-based stats. Identities = 14/48 (29%), Positives = 22/48 (45%) Query: 10 HDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKL 57 HD +K +HP D + + +D D +L+ S SFV + L Sbjct: 11 HDNSYKLLFSHPRMVEDLLRGFVREDWISEVDFTTLETVSGSFVSDDL 58 >UniRef50_Q5GSR2 Uncharacterized conserved protein n=15 Tax=Wolbachia RepID=Q5GSR2_WOLTR Length = 317 Score = 61.1 bits (147), Expect = 4e-08, Method: Composition-based stats. Identities = 44/310 (14%), Positives = 103/310 (33%), Gaps = 35/310 (11%) Query: 11 DALFKTFLT---HPDTARDFMEIHLP-KDLRELCDLDSLKLESASFVDEKLRALHSDILW 66 D +FK + F+ L ++ + +++ L + +++ D+ Sbjct: 12 DLIFKKIFGTEKNKKIIICFLNNILGFAEINAIQEVEFLSAIIDPEIASNKQSIIVDVF- 70 Query: 67 SVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKR--QPLPLVIPMLFY 124 K G + IE Q + R+ Y++ R ++ L V + Sbjct: 71 -CKDATGTRRV---IEVQLAINKGFEKRVQPYAVKAYSRQLDKSGNYIVDLKKVFFIAIS 126 Query: 125 H----GSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVA----L 176 + + Y + D T L + F +++ ++ Q + Sbjct: 127 NCNLLSEKVDYISTHNIHDT---KTNGHYLKDFQFIFIELPKFSKSKVEQLINIVEHWCF 183 Query: 177 LELIQKHIRQRDLMGLIDQ-LVVLLVTECANDSQITA--LLNYILLTGDEARFNEFIS-E 232 + + DL + + L++ L + ++ ++ Y + + + Sbjct: 184 FFKNAEDTTETDLKRVAKKVLIIKLAYDGLDEFHWNEEDIIAYEERVMNLQKEKAILEYR 243 Query: 233 LTRRMPQHRERIMTI---------AERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGL 283 L + RE + I AE+ +G K + + + L+ G I +I GL Sbjct: 244 LDLATEKGREEGVKISKERGIKVGAEKGREEGVKKAKIAVAKNSLKAGMSIGAIAEIIGL 303 Query: 284 SAEQMQALRQ 293 S +++ L + Sbjct: 304 SVGKIKKLHE 313 >UniRef50_Q00255 ORF295 n=1 Tax=Leptolyngbya boryana RepID=Q00255_PLEBO Length = 295 Score = 60.7 bits (146), Expect = 5e-08, Method: Composition-based stats. Identities = 58/308 (18%), Positives = 111/308 (36%), Gaps = 45/308 (14%) Query: 1 MTNFTTSTP-HDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRA 59 MT ++ +D +KTF+ R+F+ P + D F+D++L+ Sbjct: 1 MTQQSSENTDYDNPWKTFIE--LYFREFLAFFFPT-IEADVDWSKPVR----FLDKELQK 53 Query: 60 ---------LHSDILWSV-KTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEH 109 ++D L V + R + IE QS+E+ R+ Y+ + R+ Sbjct: 54 IVRDAEIPKRYADKLVEVHRLRGERTLVICHIEVQSQEERDFVARMYSYNYRLRDRY--- 110 Query: 110 DKRQPLPLV-IPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVV----P 164 P+V + +L G P + DE T FP+V ++ Sbjct: 111 ----NCPVVSLAIL---GDDRPNWRPSRFYDELWGCATH-----FEFPIVKLSDYQSQWT 158 Query: 165 DDEIVQHRRVALLELIQK----HIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLT 220 + E +Q+ + K H + + L +L ++ I L N++ Sbjct: 159 ELEAIQNPFAVVAMAHLKTKETHNQPLERKRWRYHLTTMLYDRGYSEQDILELHNFLDWL 218 Query: 221 GDEARFNEFISELTRRMPQHRERI-MTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQK 279 + E +L + E M + ++ +Q I +L+ D E I + Sbjct: 219 MNLP--EELERQLQAELETFEEARRMKYVSSLERRAKLEEKQAIALNMLRRNLDMELIAE 276 Query: 280 ITGLSAEQ 287 +TGL+ + Sbjct: 277 VTGLTIAE 284 >UniRef50_A6LFH9 Putative uncharacterized protein n=6 Tax=Bacteroidales RepID=A6LFH9_PARD8 Length = 295 Score = 60.7 bits (146), Expect = 6e-08, Method: Composition-based stats. Identities = 49/297 (16%), Positives = 105/297 (35%), Gaps = 27/297 (9%) Query: 11 DALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVKT 70 D FK F+ L + R++ D+ L E + + + I++ + Sbjct: 10 DVGFKAVFQDKQVTIKFLNAALAGE-RQIKDITYLDKE----IKPETVENRT-IIFDLLC 63 Query: 71 REGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKR--QPLPLVIPMLFYHGS- 127 + G +++ E Q+ + R Y ++ R + K+ L + + F + Sbjct: 64 EDVSGAKFIL-EMQNCPQHYFFNRGFYYLCRMVARQGQIGKQWQYRLLPIYGVYFLNFKL 122 Query: 128 ------RSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVP-----DDEIVQHRRVAL 176 R+ + + + +++Y +FPL ++ + I + + L Sbjct: 123 PEFTDFRTDVVLANERTGKVFNEIKMKQIYI-SFPLFSLSKEECKSSFERWIYTLKNMNL 181 Query: 177 LELIQKHIRQRDLMGLID--QLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELT 234 E Q + L+D + L E A + T D A+ + Sbjct: 182 FEQSPFKEEQETFLRLLDVANVNSLSEKERAIYEENLKNYRDWYATIDYAQTEGIEKGMQ 241 Query: 235 RRMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQAL 291 ++ + E+ G + + +I R + + G D E I + +GLS E ++ L Sbjct: 242 E---GMQKGMQKGIEKGIEKGRQEEKLQIARKMKKQGLDSELIAQCSGLSVEDIERL 295 >UniRef50_A1ZPJ4 Hypothetical conserved protein n=6 Tax=Microscilla marina ATCC 23134 RepID=A1ZPJ4_9SPHI Length = 302 Score = 60.7 bits (146), Expect = 7e-08, Method: Composition-based stats. Identities = 48/325 (14%), Positives = 112/325 (34%), Gaps = 68/325 (20%) Query: 2 TNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLR--- 58 ++ S +D +FK + + + +L +++ S+ + +KL+ Sbjct: 14 KSYDMSNQYDKIFKENIG--EHFLSLSKTYL-----------GIEVASSEELKDKLQTTL 60 Query: 59 ALHSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLV 118 +D L + T +G+ I + +E QS ++ MA R+ Y + Q++ LP + Sbjct: 61 EREADFLRKITTPKGEQMI-IQLEFQSTDEQGMAERMQLYFAILRQKY-------KLP-I 111 Query: 119 IPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVA-LL 177 + Y GS+ P + + F L+D+ V + ++ +L Sbjct: 112 RQFVIYVGSKPPKMRTR----------LKPEEVFTGFELLDLRQVSYTQWLESDIPEEVL 161 Query: 178 ELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRM 237 + +Q+ + ++ Q++ +V + + + + + L Sbjct: 162 LAVLGDFQQKKVSTVLKQIISKIVKLIDDPGTLQKYIRQLATFAR-------LRNLVIET 214 Query: 238 PQHRERIMTIAE------------RIHNDGYIKGEQRILRLLLQNG-------------A 272 Q E + + + +G KG Q + + G Sbjct: 215 EQTLEYMGLTYDIEKDVFYQRGVKKGQQEGIEKGHQEGIEKGITQGVVKMVIALLKSGKM 274 Query: 273 DPEWIQKITGLSAEQMQALRQPLPE 297 E + +I LS +Q + + + Sbjct: 275 PLEEVARIAELSVIDVQKMADQIKK 299 >UniRef50_A6M1J9 Putative uncharacterized protein n=1 Tax=Clostridium beijerinckii NCIMB 8052 RepID=A6M1J9_CLOB8 Length = 278 Score = 59.6 bits (143), Expect = 1e-07, Method: Composition-based stats. Identities = 46/293 (15%), Positives = 99/293 (33%), Gaps = 31/293 (10%) Query: 11 DALFKTFLTHPDTARDFMEIHLPKDLRELCD-LDSLKLESASFVDEKLRALHSDILWSVK 69 D +FK +D + L L+ D L+ ++L + + E + K Sbjct: 8 DFVFKLLFGDEKN-KDLIIELLNSILKMPHDELEDIELINTELLREFAEDRKGILDVRAK 66 Query: 70 TREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHD---KRQPLPLVIPMLFYHG 126 T+ G+ ++ IE Q +MA R + Y + I+ + + I ++ ++ Sbjct: 67 TKSGE---HIDIEIQVLYTYYMAERTLFYWSKMYNGQIKSGYTYDKLKKCITINIVDFNC 123 Query: 127 SRSPYPWSLCWLDEFADPTTARKLYNAAFP----LVDVTVVPDDEIVQHRRVALLELIQK 182 + + E + + L D +P DE V + +Q Sbjct: 124 IEINKLHTSFHITEDETNKKLTDVLEIHYLELPKLFD-NNIPKDE--SEPLVQWMMFLQ- 179 Query: 183 HIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRE 242 R ++ ++ + + +I N + + + + Sbjct: 180 -SRNKEAFEMLAE----------KNEKIKKAYNILEVISKDDNARAAYEAREAELHDQ-- 226 Query: 243 RIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQALRQPL 295 MT + +G + + + L G D E + K TGLS +++ ++ L Sbjct: 227 --MTRLKSAREEGIKEATIKNAKNFLVMGLDVEMVAKGTGLSVDEVLKIKGEL 277 >UniRef50_B7BFV9 Putative uncharacterized protein n=1 Tax=Parabacteroides johnsonii DSM 18315 RepID=B7BFV9_9PORP Length = 293 Score = 59.6 bits (143), Expect = 1e-07, Method: Composition-based stats. Identities = 34/291 (11%), Positives = 85/291 (29%), Gaps = 17/291 (5%) Query: 11 DALFKTFLTH---PDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWS 67 D FK + D + + + L + E + + ++ Sbjct: 10 DRGFKHLFGQEDSKELLVDLLNGLFEGERV----ITELSFLNVEMPAESTDSRAA--VFD 63 Query: 68 VKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQP--LPLVIPMLFYH 125 +K ++ +G I++ +E Q+ + R + Y ++ L V + + Sbjct: 64 LKCKDKEGRIFI-VEVQNAPQTYFYERGLYYLCRIISDQDRRGNDWKFELYPVYGIFLLN 122 Query: 126 GSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIR 185 + T + +++ +E + K++ Sbjct: 123 FKSGKTDKVRTDIVLADRETGKQMSDTMRQIYLEMPFFNKEEAECETSLDYWLYTLKYME 182 Query: 186 QRDLMGL-----IDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQH 240 + + + + + + L + + L + + + M Sbjct: 183 KLETLPFKGQKQLFEKLERLAKIVNMNKKERMEYEESLKIYRDNQGVLDYAIEKGYMEGV 242 Query: 241 RERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQAL 291 + + E+ G KG + + G D I +TGL+AE + L Sbjct: 243 EKGLKEGIEKGLEKGMEKGIYLVAAKMKMQGIDFATITSVTGLNAETIATL 293 >UniRef50_B0A7T9 Putative uncharacterized protein n=2 Tax=Clostridium bartlettii DSM 16795 RepID=B0A7T9_9CLOT Length = 271 Score = 59.2 bits (142), Expect = 2e-07, Method: Composition-based stats. Identities = 48/289 (16%), Positives = 105/289 (36%), Gaps = 34/289 (11%) Query: 11 DALFKTFLT---HPDTARDFMEIHL-PKDLRELCDLDSLKLESASFVDEKLRALHSDILW 66 D +FK +P F+ L PKDL ++ + + +++++K D+ Sbjct: 10 DFVFKNIFGSEKNPKILISFLNATLKPKDLITSVEIKN-TDINKNYIEDKF--SRLDV-- 64 Query: 67 SVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKR---QPLPLVIPMLF 123 KT + + IE Q + + +M R + Y + + + + I +L Sbjct: 65 KAKTSNDEI---INIEIQLKNEYNMIKRSLYYWSKLYSEQLGEGQDYSVLKRTICINILN 121 Query: 124 YHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKH 183 + ++ S L E + A +++ + D + V +E ++ Sbjct: 122 FKYLKTRKFHSGYRLKEIY--SNEELTNVAEIHFIEIPKLDDGADEKDMLVNWIEFLK-- 177 Query: 184 IRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRER 243 D + + N +I + ++ ++ E + + R++ Sbjct: 178 ----------DPESETVRSLEMNIEEIRQAKDELIRMSNDDTQREIYEMRAKTL---RDK 224 Query: 244 IMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQALR 292 I + G +G++ I + LL D E I TGLS +++ L+ Sbjct: 225 I-SALNEAERKGIQQGKREIAKALLDV-LDIETIALKTGLSIDEINKLK 271 >UniRef50_UPI000190BD13 hypothetical protein SentesTyph_06309 n=2 Tax=Salmonella enterica subsp. enterica serovar Typhi RepID=UPI000190BD13 Length = 105 Score = 59.2 bits (142), Expect = 2e-07, Method: Composition-based stats. Identities = 28/104 (26%), Positives = 45/104 (43%), Gaps = 21/104 (20%) Query: 210 ITALLNYILLTGDEAR-FNEFISELTRRMPQHRERIM-TIAERIHNDGY----------- 256 + A+L YI+ G + F+ EL +P+++E IM TIA+++ +G Sbjct: 1 MEAVLCYIIYNGMTSESITPFLYELAGEIPEYKELIMGTIAQQLKEEGIQQGIQQSIQQE 60 Query: 257 --------IKGEQRILRLLLQNGADPEWIQKITGLSAEQMQALR 292 K LL NG E + K TGL+ E ++ R Sbjct: 61 RQASLEREQKTLLETAYALLDNGVSLEVVIKSTGLNRETLEQPR 104 >UniRef50_C0CSV6 Putative uncharacterized protein n=1 Tax=Clostridium asparagiforme DSM 15981 RepID=C0CSV6_9CLOT Length = 317 Score = 58.8 bits (141), Expect = 2e-07, Method: Composition-based stats. Identities = 54/307 (17%), Positives = 101/307 (32%), Gaps = 32/307 (10%) Query: 11 DALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVKT 70 D LF+ D + D D+L++ + D ++ +D+ + V Sbjct: 10 DRLFRLVFGDRRRLLDLYNAL---NGSHYEDPDALEI--TTLDDAVYLSMKNDLSFLV-- 62 Query: 71 REGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQ-------PLPLVIPMLF 123 G +Y EHQS + +M R Y V ++++ K LP ++F Sbjct: 63 -NGVLNLY---EHQSTYNPNMPVRGFFYLADVYRKYVVEHKLNLYGSRLAKLPSPKYLVF 118 Query: 124 YHGSRSPYPWSLCWL-DEFADPTTARKLYNAAFPLVDVTVVPDDEIVQ--HRRVALLELI 180 Y+G + + L D F A ++++ + + +++ + + Sbjct: 119 YNGRKEEPDRKILRLSDAFQGGRNAEPCLELCAVMLNINLGRNQVLMERCRTLKEYAQFV 178 Query: 181 QKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQH 240 + R G ++ V V +C D IL + E + + + Sbjct: 179 DRVRRMIAETGALESAVDCAVEDCIRDG--------ILENFLSSHRAEVLDVILTDYNEQ 230 Query: 241 RERIMTIAERIHNDGYIKGEQRILRLLLQNG--ADPEWIQKITGLSAEQMQALRQPLPER 298 M E +G +G L L G E I + G E + LR + Sbjct: 231 EYIAMEREEAWE-EGRAEGLTEGLSEGLSEGLSVSREAILDLLGEFGEVPEELRARICAE 289 Query: 299 ERYSWLK 305 LK Sbjct: 290 SDKETLK 296 >UniRef50_A7BN25 Putative uncharacterized protein n=3 Tax=Beggiatoa sp. SS RepID=A7BN25_9GAMM Length = 219 Score = 58.4 bits (140), Expect = 3e-07, Method: Composition-based stats. Identities = 34/174 (19%), Positives = 65/174 (37%), Gaps = 17/174 (9%) Query: 110 DKRQPLPLVIPMLFYHGSRSPYPWSLCWLDEFADPT--TARKLYNAAFPLVDVTVVPDDE 167 K+ LP V P++ Y+G+++ + + + + + + + L+D D + Sbjct: 2 KKKIKLPPVCPVVIYNGNKA-WNAAQEISELIEEVPGGLEKYRPHLRYFLIDEAKFADAD 60 Query: 168 IVQHRRVALLELIQKHIRQRD----LMGLIDQLVVLLVTECAN------DSQITALLNYI 217 + + + ++ R D L I Q++ LLV + I L + Sbjct: 61 LAPLHNLVAAIIRLENTRSFDDEKALAEAISQVLNLLVDWLKDSEFIQLRRDIVTWLRRV 120 Query: 218 LLTGDEARFN--EFISELTRRMPQHRERIMTIAERIHNDGYIKGE-QRILRLLL 268 LL + E I EL RE + + G +G+ Q I + LL Sbjct: 121 LLPKNLPDVEIPEVI-ELQEMNAMLRENMQLWYQTAEKKGEARGKAQGIAQTLL 173 >UniRef50_A8YL21 Genome sequencing data, contig C325 n=27 Tax=Cyanobacteria RepID=A8YL21_MICAE Length = 149 Score = 58.4 bits (140), Expect = 3e-07, Method: Composition-based stats. Identities = 22/126 (17%), Positives = 48/126 (38%), Gaps = 16/126 (12%) Query: 1 MTNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDE--KLR 58 MTN HD LFK ++ +F+E+ P ++ D +S+ + + Sbjct: 1 MTNNID---HDRLFKELIS--TFFVEFIELFFP-EVMNYLDTESITFLDKEVFTDVTEGE 54 Query: 59 ALHSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLV 118 SD++ V+ R + + + +E Q R+ Y ++ + + Sbjct: 55 RHKSDLVAQVRFRGKESFFLIYVEAQESSRKWFNRRMFTYFARFHEKFVL--------PI 106 Query: 119 IPMLFY 124 P++ + Sbjct: 107 YPIVIF 112 >UniRef50_UPI0001C34E7F hypothetical protein ClM62_15401 n=1 Tax=Clostridium sp. M62/1 RepID=UPI0001C34E7F Length = 324 Score = 58.4 bits (140), Expect = 3e-07, Method: Composition-based stats. Identities = 46/292 (15%), Positives = 98/292 (33%), Gaps = 27/292 (9%) Query: 11 DALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVKT 70 DALF+ + + L + LE+A +++ K +D+ + + Sbjct: 28 DALFRMIFNDKEALLSLYNAVGNTSYTDASQLQIVTLENAVYMNIK-----NDLAFLLNM 82 Query: 71 REGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQ-----RHIEHDKRQPLPLVIPMLFYH 125 + EHQS + +M R + Y + + I LP ++F++ Sbjct: 83 ELN------LYEHQSTWNPNMPLRDLFYVSREYEMLLANQSIYSSSLLKLPAPRFVVFFN 136 Query: 126 GSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIR 185 GS + L + + ++++ +DE++ R+ L E R Sbjct: 137 GSYDMGEQCVLKLSDAYEKKVEDPDLELKVTVLNINAGWNDELMNTCRL-LKEYSLYVAR 195 Query: 186 QRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRERIM 245 R ++ + V+ ++ +L L+ + I E + +E + Sbjct: 196 VRAYAKEME--LAEAVSRAVDECIKEGILRDFLMKYRAEAISVSIFEYDE--EREKELLR 251 Query: 246 TIA-ERIHNDGYIKGEQRILRLLLQNGADPEWIQKIT-----GLSAEQMQAL 291 E +G +G + L ++ G I G S E ++ Sbjct: 252 KTEYEFGRQEGLSQGREEGLSQGIKEGMAQGVSAMIRHCRKAGASREDTLSI 303 >UniRef50_C2G1H3 Hypothetical cytosolic protein n=1 Tax=Sphingobacterium spiritivorum ATCC 33300 RepID=C2G1H3_9SPHI Length = 294 Score = 58.4 bits (140), Expect = 3e-07, Method: Composition-based stats. Identities = 57/311 (18%), Positives = 111/311 (35%), Gaps = 52/311 (16%) Query: 11 DALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALH--------- 61 D L+K L DF+ L D + DL +F+D++L L Sbjct: 6 DYLWKGVLED--VFDDFLR-FLYPDADSVFDLSR----GITFLDKELEQLFPPEGNEFAP 58 Query: 62 --SDILWSVKTREG-DGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLV 118 D L V T +G + ++ + +E Q A R+ Y ++ ++ + Sbjct: 59 KVVDKLAQVYTHDGMEEWVLIHVEVQGTCRKDFASRMFTYYYRILDKYHKRITA------ 112 Query: 119 IPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLE 178 +L S P + +EF + F + D ++ L Sbjct: 113 FAILT---EASKKPRPNVYEEEFMGTSI-----QYRFNTYKIAEQDTDRLLASDNPFALV 164 Query: 179 LI------------QKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYI--LLTGDEA 224 ++ K + L+ QL L+ + +I L+N++ + D + Sbjct: 165 VLTAKAAFVGKNLNDKDESDKALLQTKIQLARELLERNMSKEKIRGLMNFLRYYVRFDNS 224 Query: 225 RFNEFISELTRRMPQHRERIMTIAERIHNDGYIKGEQR----ILRLLLQNGADPEWIQKI 280 N + ++ + R M I E + N +G++ + R + ++G E I K Sbjct: 225 EVNTIFEQEVEKLTE-RSHTMGIEELLLNRAKKEGKRESLISVAREMKKDGIPVEQIVKF 283 Query: 281 TGLSAEQMQAL 291 T LS ++++ L Sbjct: 284 TKLSIKEIEKL 294 >UniRef50_A7BTR0 Putative uncharacterized protein n=3 Tax=Beggiatoa RepID=A7BTR0_9GAMM Length = 309 Score = 58.0 bits (139), Expect = 3e-07, Method: Composition-based stats. Identities = 54/330 (16%), Positives = 106/330 (32%), Gaps = 62/330 (18%) Query: 1 MTNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRAL 60 M T D K L D +E L L+E D+ L++ + Sbjct: 1 MPTETKLVRFDWALKNILRDKANF-DVLEGFLTALLQE--DISVLEILESESNQSDFAKK 57 Query: 61 HS--DILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIE-HDKRQPLPL 117 + DIL + ++IE Q+ + R++ + ++ +E + + + Sbjct: 58 FNRVDILVKDSHQRK-----MIIEVQNHRETGYLERILWGTSKLIVETLELGEDYRNISK 112 Query: 118 VIPM---------------LFY-----HGSRSPYPWSLCWL--DEFADPTTARKLYNAAF 155 VI + ++Y HG + P+ L D+ + ++ F Sbjct: 113 VISISIVYFDLGLSDDNEYVYYGVANLHGLQHNQPFRFRRLMADKTFKSLQTKDIF-PEF 171 Query: 156 PLVDVTVVPDDEIVQHRRVALLELIQKHIRQRDLMG-----LIDQLVVLLVTECANDSQI 210 L+ V D + + + KH R + + LL Sbjct: 172 YLLRVEHFQD---IIKTDLDEWIYMLKHSTIRTDFKSKNINKAQEKLTLLQMNPQKRKDY 228 Query: 211 TALLNYILLTGDEARFNEFISELTRRMPQHRERIMTIAERIHNDGYIKGEQR-------- 262 + + + D + E I + +G +G Q+ Sbjct: 229 EKYMVDMTVERDVLEAAQ------------EEGIQKGRQEGIQEGRQEGIQKGMEKKTVV 276 Query: 263 ILRLLLQNGADPEWIQKITGLSAEQMQALR 292 I++ LQ G + I +TGLS E++Q ++ Sbjct: 277 IVKNALQQGLELTLISSLTGLSIEEIQKIQ 306 >UniRef50_C8PLW8 Putative uncharacterized protein n=2 Tax=Treponema vincentii ATCC 35580 RepID=C8PLW8_9SPIO Length = 264 Score = 57.7 bits (138), Expect = 5e-07, Method: Composition-based stats. Identities = 49/289 (16%), Positives = 96/289 (33%), Gaps = 45/289 (15%) Query: 11 DALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVKT 70 D +F + H R F+E+ + ++ L S + + + + + + D+L VK Sbjct: 14 DFMFCKVMEHESLCRPFLEMLFSTQIEKITYLSSQNIITTN---SEAKTVRLDVL--VKD 68 Query: 71 REGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGSRSP 130 G Y IE Q + ++ R+ Y + ++ ++ Sbjct: 69 DIGTSY---DIEMQVGNEYNIPKRMRYYQAVLDVAFLDKGYS--------------YKAL 111 Query: 131 YPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQH---RRVALLELIQKHIRQR 187 + ++ F R +Y + D I+ H +++ L K + Sbjct: 112 NNSVIIFVCLFDPIGNDRAVYTFENI-----CIEDKTILLHDGTKKIILNAKAFKKTDNQ 166 Query: 188 DLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRERIMTI 247 +L G + + T I + NE +P +M Sbjct: 167 ELRGFLQYVTTGKATTAYTGR--------IEQMIQTVKQNELARREYHILPA---ALMDA 215 Query: 248 AERIHNDGYIKGEQ----RILRLLLQNGADPEWIQKITGLSAEQMQALR 292 + G KG + + LL G E I + TGLS +++AL+ Sbjct: 216 MDEGEARGLAKGSRQKALETAKNLLHFGLSVENIAQATGLSQAEVEALK 264 >UniRef50_A5D5U3 Hypothetical membrane protein n=3 Tax=Peptococcaceae RepID=A5D5U3_PELTS Length = 292 Score = 57.3 bits (137), Expect = 6e-07, Method: Composition-based stats. Identities = 41/219 (18%), Positives = 76/219 (34%), Gaps = 20/219 (9%) Query: 59 ALHSDILWSVKTREGDGYIY-VVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPL 117 SD L V+ DGY Y +++E Q+R D MA RL+ Y+ H H+K P+ Sbjct: 44 QRTSDALVKVR---EDGYEYLMLVEFQARPDRKMARRLLEYTAM---HHCRHEKPV-YPV 96 Query: 118 VIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALL 177 +I + G W + + N + +++ + E++ V LL Sbjct: 97 IINL---TGGSLQDGW-------YTFECLDLTVVNFNYRQINLQDIAGRELLYRGPVGLL 146 Query: 178 ELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRM 237 L ++D+ L +E + L + + I + Sbjct: 147 PLAPLMSHDEPPEKVLDKCARRLQSEVEAEDDRALLYLALAALASLKYPKDLILRVLEVS 206 Query: 238 PQHRERIMT-IAERIHNDGYIKGEQRI-LRLLLQNGADP 274 + I E G I+G+ + +++ D Sbjct: 207 RLENIPLFDGIREEWEAKGRIEGKNEGKIEGMVEMLFDL 245 >UniRef50_Q24MW9 Putative uncharacterized protein n=4 Tax=Desulfitobacterium hafniense RepID=Q24MW9_DESHY Length = 295 Score = 56.9 bits (136), Expect = 8e-07, Method: Composition-based stats. Identities = 53/298 (17%), Positives = 97/298 (32%), Gaps = 35/298 (11%) Query: 11 DALFKTFLT---HPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHS--DIL 65 D LFK + D F+ L + +L + L E L+ S DIL Sbjct: 12 DYLFKYIFGRQENKDILLSFLNAVLSPAGED--ELTDITLSDRELDPEHLKDKMSRLDIL 69 Query: 66 WSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYH 125 DG + + IE Q + ++ R + Y + Q ++ + Y Sbjct: 70 GVAN----DGSL-INIEVQIASEKNIDKRTLYYWAKIYQSQLQSG-----------MLYK 113 Query: 126 GSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIR 185 ++ D +++ + + D + + + + R Sbjct: 114 DLARTVTVNVLNFSFLPDAQRYHSMFSL-YEAHSGLRLNRDLEIHFLELEKWKALSTKPR 172 Query: 186 QR---DLMGLIDQLVVLLVTECANDSQITALLNY--ILLTGDEARFNEFISE--LTRRMP 238 R LM L + L ++ I L I L D+ R+ + E + + Sbjct: 173 TRLDKWLMYLSNTDPKELEEIAMSEPAIGKALTVEEIFLKNDKERYLYEMREKGIRDHLS 232 Query: 239 QHRERIMTIAERIHNDGYIKGEQR----ILRLLLQNGADPEWIQKITGLSAEQMQALR 292 E+ G +G +R I +L+ G I +IT L EQ++ +R Sbjct: 233 AMDNAKTEGIEQGLAQGIAQGIERGKTEIALSMLKKGLSLNMIAEITDLPIEQIEEIR 290 >UniRef50_C0F0J0 Putative uncharacterized protein n=1 Tax=Eubacterium hallii DSM 3353 RepID=C0F0J0_9FIRM Length = 316 Score = 56.9 bits (136), Expect = 9e-07, Method: Composition-based stats. Identities = 53/324 (16%), Positives = 99/324 (30%), Gaps = 62/324 (19%) Query: 11 DALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRAL--------HS 62 DAL K +L++ + D +L D ++ ++L S + L Sbjct: 5 DALTKEYLSNNEIFADVF-NYLIYDGQQRILPENLIERDTSEITLPLGKRGELATIQKFR 63 Query: 63 DILWSVKTREGDGYIYVVI--EHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQP------ 114 DIL +E +YV+ E+QS M R M Y + ++ Sbjct: 64 DILKGCIAKEYKNTLYVLFGVENQSHIHYAMPVRNMLYDAINYSAQVNEKTKKYRKIRKQ 123 Query: 115 --------------------LPLVIPMLFYHGSRSPYPW-SLCWLDEFADPTTARKLYNA 153 L VI + Y G+ SL + D + L + Sbjct: 124 NPNFKETTEEFLSGWHPDDRLVPVITVTIYFGNDGWDAAKSLQEMFSETDESLKEFLPDY 183 Query: 154 AFPLVDVTVVPDDEIVQHRRVALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITAL 213 L+ + + H L I K I + M ++ Sbjct: 184 KLHLISCNNISNFTKF-HTEFGRLMHILKVISDEEQMDILLSDPG--------------- 227 Query: 214 LNYILLTGDEARFNEFISELTRRMPQHRERI-MTIA-----ERIHNDGYIKGEQRILRLL 267 Y L+ A+ + L +P+ + I M A E +G+ + + + Sbjct: 228 --YSALSVTAAQIINTFTGLHFSIPEKEDTINMRNAWTDHKESGRREGFNEATTSYTQRM 285 Query: 268 LQNGADPEWIQKITGLSAEQMQAL 291 + G E I ++ +++ + Sbjct: 286 YKAGIPLEVIAEVIEKPVTEVEKI 309 >UniRef50_Q8YMI0 Alr4953 protein n=8 Tax=Cyanobacteria RepID=Q8YMI0_ANASP Length = 314 Score = 56.5 bits (135), Expect = 1e-06, Method: Composition-based stats. Identities = 58/329 (17%), Positives = 123/329 (37%), Gaps = 39/329 (11%) Query: 1 MTNFTTSTPHDALFKTFLT--HPDTARDFMEIHLPKDLRELCDLDSL-KLESASFV---- 53 MT+ D+ +K L P + F + L + + + + F Sbjct: 1 MTDNNERADFDSPWKEILEAYFPQAVQFFF-----PETAALINWERPYEFLNTEFQQIAR 55 Query: 54 DEKLRALHSDILWSV-KTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKR 112 + + ++D L V + + + ++ + +E Q++++ + R+ Y+ + R + Sbjct: 56 EAEQGKPYADQLVKVWQIQGEEIWLLIHVEIQAQKEDDFSKRMFTYNFRIFDRFEK---- 111 Query: 113 QPLPLVIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDV-TVVPDDEIVQH 171 P + + +R P + + + L+D + E + Sbjct: 112 ---PAISLAILCDTNRQWRPSNYSYNYPQTRLNFEFGIV----KLLDYENRFDELENNTN 164 Query: 172 RRVALLELIQKHIRQR----DLMGLIDQLVVLLVTECANDSQITALLNYI--LLTGDEAR 225 ++ K + R + L+ L + I L +I ++ +A Sbjct: 165 PFATVVMAHLKTQQTRSSPQERKIWKFSLIRRLYDLGLQEQDIRNLYRFIDWVMILPKAL 224 Query: 226 FNEFISELTRRMPQHRERIMTIAERI-HNDGYIKGEQRILRLLLQN---GADPEWIQKIT 281 N+ SE+ + + R +T AERI + G +GE I+ LL+ PE Q+I Sbjct: 225 ENQLCSEVQQLEQERTMRYVTSAERIGYERGIQEGELGIILKLLKRRLGELSPEIQQRIQ 284 Query: 282 GLSAEQMQALRQPLPE----RERYSWLKS 306 LS Q++ L + L + + +WL+S Sbjct: 285 SLSVNQLENLSEALLDFSNLTDLVNWLQS 313 >UniRef50_C0EXQ3 Putative uncharacterized protein n=1 Tax=Eubacterium hallii DSM 3353 RepID=C0EXQ3_9FIRM Length = 290 Score = 56.5 bits (135), Expect = 1e-06, Method: Composition-based stats. Identities = 45/307 (14%), Positives = 114/307 (37%), Gaps = 29/307 (9%) Query: 1 MTNFTTSTP----HDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEK 56 MT T D LF+ + + + + D+D +++ + D Sbjct: 1 MTKINTGNANREYKDRLFRFVFGAEENKAYLLSLCNAVSGTDYTDVDDIEI--TTLSDAI 58 Query: 57 LRALHSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMR-----YSMAVMQRHIEHDK 111 + +DI + + ++ + EHQS + +M R M Y + +++ +++ Sbjct: 59 YIKMKNDISFLIDSQMN------LFEHQSTFNPNMPLRGMECFAELYGIYIIENNLDIYV 112 Query: 112 RQPLPLVIPM--LFYHG-SRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEI 168 ++ P + Y+G + P L D F P + + + ++++ + ++ Sbjct: 113 SSLQKILTPRYYVIYNGTEKQPDVVKLKLSDAFQVPDDSGE-FEWTATMLNINYGHNRKL 171 Query: 169 VQHRRVALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNE 228 ++ + L E R+ +L + + + ++ E Sbjct: 172 LEQCQP-LYEYAHFIKLVREYSE-AMELKKAIDKAVEKAREWKCIGTFLYQCKSEVSVM- 228 Query: 229 FISELTRRMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQM 288 ++E + +H + ++ + E+ +G K + + +L PE I K +S + + Sbjct: 229 LLTEFDEK--KHEDNLIKLGEK---EGREKERMKNICSMLALSLSPEIIAKACEVSVDYV 283 Query: 289 QALRQPL 295 L++ L Sbjct: 284 LNLKKEL 290 >UniRef50_A7C3K1 Putative uncharacterized protein n=3 Tax=Beggiatoa sp. PS RepID=A7C3K1_9GAMM Length = 272 Score = 56.5 bits (135), Expect = 1e-06, Method: Composition-based stats. Identities = 47/290 (16%), Positives = 110/290 (37%), Gaps = 32/290 (11%) Query: 13 LFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESA-SFVDEKLRALHSDILWSVKTR 71 K + P F++ L ++ ++D ++ E + S + + + + + Sbjct: 4 FLKKVFSKPHIFTAFVKDMLGIEI----EIDKVETEKSFSPIIGNVDSRFD-----LFAQ 54 Query: 72 EGDGYIYVVIEHQSREDIHMAFRLMRY-SMAVMQRHIEHDKRQPLPLVIPMLF------Y 124 + + V I+H+ +D + R + Y +A++++ +P V ++ + Sbjct: 55 DTKNRLIVDIQHKRYKDHY--DRFLHYHCVALLEQITSSANYKPDMQVYTIVVLTSGDKH 112 Query: 125 HGSRSPYPWSLCWLDEFADPTTARKL-YNAAFPLVDVTVVPDDEIVQHRRVALLELIQKH 183 +S LD + T K+ Y + D T P E ++ +L + +++ Sbjct: 113 KTDLLITDFSPKKLDGSSIAETQHKIVYVCPKYVTDETPKPYQEWLKAINDSLDKQVEES 172 Query: 184 IRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTR--RMPQHR 241 +++ I L+ +++ DE E++ E T+ R Sbjct: 173 HYHNEVIQEIFSLIKKDKISPEEYARMK----------DEYSDEEYLQEQTQKARKEGME 222 Query: 242 ERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQAL 291 + + + G KG + + + + E I ++TGLS EQ++ L Sbjct: 223 KGMEKGIGKGIEKGIEKGVLMMAKNMKEAKVAIETIIEVTGLSIEQIEDL 272 >UniRef50_A8F2U7 Putative uncharacterized protein n=15 Tax=Bacteria RepID=A8F2U7_RICM5 Length = 281 Score = 56.1 bits (134), Expect = 1e-06, Method: Composition-based stats. Identities = 41/287 (14%), Positives = 92/287 (32%), Gaps = 22/287 (7%) Query: 11 DALFKTFLTHP-DTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVK 69 D FK + + + DL + E E R+ L+ +K Sbjct: 10 DIAFKKLFSDKVKLINLLNSLLRLSKGDRIIDLSYITTEQLPLFLEGRRS-----LFDLK 64 Query: 70 TREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHD-KRQPLPLVIPMLFYHGSR 128 ++ G Y+ IE Q + + R Y I+ K + L V+ + Sbjct: 65 VKDETGRWYI-IEMQRKMEKDYLNRTQLYGCYTYVSQIKKGMKHKDLLPVVIISIIRAKA 123 Query: 129 SPYPWSLCWLDEFADPTTAR-KLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIRQR 187 P + + L++ + +++ +++ L L++ +++ Sbjct: 124 LPDELPYISYHHIKESNIHKQYLFSLTYVFIELGKFKKNDLKDDTD-EWLYLLKYASQEQ 182 Query: 188 DLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRERIMTI 247 + I +VL S +L Y + + + + + + E+ Sbjct: 183 EPPKEIKNEIVL--------SAYASLEQYKWTEQEHDDYFRAEMAIQQEIDKFEEK---- 230 Query: 248 AERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQALRQP 294 G K + + +L E I + T L+ E+++ L+ Sbjct: 231 FNAGMEKGIEKEKIETAKEMLIENGPIEQIARYTKLTIEEIKKLKAE 277 >UniRef50_C0R0H3 Putative uncharacterized protein n=8 Tax=Brachyspira RepID=C0R0H3_BRAHW Length = 292 Score = 55.7 bits (133), Expect = 2e-06, Method: Composition-based stats. Identities = 39/288 (13%), Positives = 99/288 (34%), Gaps = 16/288 (5%) Query: 11 DALFKTFLTHP---DTARDFME-IHLPKDLRELCDLDSLKLES-ASFVDEKLRALHSDIL 65 D + + DF+ I L ++ ++ L + ++ + +D+ Sbjct: 14 DYFVRYLFSDKGSEAILLDFINSIMLDSGMKTFRSVEILTPFNYKENYED--KETITDV- 70 Query: 66 WSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHD-KRQPLPLVIPMLFY 124 T+ G V+IE Q + + R++ Y + + ++ K L VI + Sbjct: 71 -KCITQNGTV---VIIEIQLQGNSRFPERILYYWASNYSKLLKQGEKYDALTPVISINLL 126 Query: 125 HGSRSPYPWSLCWLDEFADPTTARKLYN-AAFPLVDVTVVPDDEIVQHRRVALLELIQKH 183 + + S+ D R L + ++++ + + L K Sbjct: 127 NFNLDDND-SIHSCYMIYDTNNKRLLTDHLQIHIIELKKFKYNSLEYDLNCWLKFFTMKD 185 Query: 184 IRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRER 243 ++++ + + + + E + + +++ + R + R Sbjct: 186 KDNKEVI-MSELVKEKPIMEEVQRRYNNFIKDRLMMNEYDKRQAYLYGNQIMLEEERRLG 244 Query: 244 IMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQAL 291 + E +G + + + R + D I ++TGLS E+++ L Sbjct: 245 RVEGKEEGIKEGIEQEKYSLARNMKNKNMDLNLISELTGLSIEKIEKL 292 >UniRef50_C1Q938 Putative uncharacterized protein n=4 Tax=Brachyspira murdochii DSM 12563 RepID=C1Q938_9SPIR Length = 326 Score = 55.7 bits (133), Expect = 2e-06, Method: Composition-based stats. Identities = 43/301 (14%), Positives = 95/301 (31%), Gaps = 44/301 (14%) Query: 11 DALFKTFLTH---PDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWS 67 D + +H + A +F+ + +++ + + E S + Sbjct: 50 DYFVRYLFSHDGNENIALNFINAVFKD--LNFETFNKIEILNPFNISENYDEKESIVDIK 107 Query: 68 VKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGS 127 T G I V+IE QSR + R + Y + FY G Sbjct: 108 ATTETG---ITVLIEIQSRGNEDFIKRALYYWAYNYSSSLNRGS-----------FYDGL 153 Query: 128 RSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLEL----IQKH 183 + ++ + + + L ++ + + H ++ LEL ++ Sbjct: 154 KPTVSINITNFILTDEDKVH-----SCYVLKELN--NNKILTDHCQLHFLELPKFNLKDI 206 Query: 184 IRQRDLMGLIDQLVVLLVTECANDSQITALLNYILL---------TGDEARFNEFISELT 234 L + + + + D I N I D +++ Sbjct: 207 SAIESLDNIHKEFISWIKFFKGEDMSILMKENTIFEEVEKKCLTFVNDSPVIDKYKKREV 266 Query: 235 RRMPQHRERIMTIAERIHNDGYIKGEQR----ILRLLLQNGADPEWIQKITGLSAEQMQA 290 ++ + I ++ +G +G + + + + D I KITGLS ++++ Sbjct: 267 DTYFFNKSMELDI-KKAKEEGIKEGIKENQILTAKNMKKENIDINIISKITGLSIQEIEN 325 Query: 291 L 291 L Sbjct: 326 L 326 >UniRef50_D0BNN6 ATP-dependent DNA helicase RecQ n=1 Tax=Granulicatella elegans ATCC 700633 RepID=D0BNN6_9LACT Length = 302 Score = 55.7 bits (133), Expect = 2e-06, Method: Composition-based stats. Identities = 58/312 (18%), Positives = 109/312 (34%), Gaps = 43/312 (13%) Query: 11 DALFKTFL---THPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWS 67 D LFK + DF+E L+ + + ++E+ E L + + Sbjct: 8 DLLFKKMMTTAGKEYILEDFIEAVTGMKLKNVRPANPYQIETYQKTIENLNPVMYSTIVD 67 Query: 68 VKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGS 127 V DG ++IE Q + R+ Y ++ + + + +I ++ Sbjct: 68 VAATTEDGME-IMIEMQLYQHKDFFERIFNYMATAYTQNYKAETAK---PIISIVV---- 119 Query: 128 RSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQK---HI 184 + EF + L N A+ + + + + R+ L+ L K + Sbjct: 120 -----TNFTVFPEFQEARIEIGLTNFAY----YQEIRNRKQQPYWRIYLVNLTDKAIVNG 170 Query: 185 RQRDLMGLIDQLVVLLVTECAND--SQITALLNYILLTGDEARFNEFISELTRRM----- 237 RD D L + ++ + ++N+ L G+E R E + + Sbjct: 171 ESRDFSEWRDFLKNGTIKPKSSRGLKEAQKIVNFSNLAGEERRLAELMEKYEDVYYQVMK 230 Query: 238 PQHRERIMTIAERIHNDGYIKGEQRILRLLLQNG-------------ADPEWIQKITGLS 284 Q E + E G GE+R + + G E IQK TGLS Sbjct: 231 HQLEEGLEQGIEIGRQQGVALGEKRGMEKGVALGERKGQVMICFKMNLPIEEIQKHTGLS 290 Query: 285 AEQMQALRQPLP 296 E+++A R+ + Sbjct: 291 IEEIEAFRKEME 302 >UniRef50_B3CVG1 Putative uncharacterized protein n=2 Tax=Orientia tsutsugamushi str. Ikeda RepID=B3CVG1_ORITI Length = 96 Score = 55.4 bits (132), Expect = 2e-06, Method: Composition-based stats. Identities = 28/124 (22%), Positives = 50/124 (40%), Gaps = 34/124 (27%) Query: 176 LLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTR 235 +LE + KHI QRD++ L ++ ++ D Sbjct: 1 MLEYMLKHIHQRDMLKLWEEFLIKFKHGLILDK--------------------------- 33 Query: 236 RMPQHRERIMTIAERIHNDGYIKGE----QRILRLLLQNGADPEWIQKITGLSAEQMQAL 291 + + TIA + ++G KG Q + R LL+ G E+I + TGLS E++ + Sbjct: 34 ---EKGNSMRTIAAKYIDEGIAKGRAEAAQELTRNLLKAGFLVEFISETTGLSKEEVVNV 90 Query: 292 RQPL 295 + + Sbjct: 91 KNNM 94 >UniRef50_UPI00006A2D99 UPI00006A2D99 related cluster n=2 Tax=Xenopus (Silurana) tropicalis RepID=UPI00006A2D99 Length = 308 Score = 55.4 bits (132), Expect = 2e-06, Method: Composition-based stats. Identities = 39/272 (14%), Positives = 90/272 (33%), Gaps = 27/272 (9%) Query: 7 STPHDALFKT-FLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALH--SD 63 T HD FK L +P A F P + + + D + + +L D Sbjct: 1 PTSHDQNFKNLILDYPRQALQF---FAPDEAKNIDDSAVITPIRQEQLKNRLGDRFYELD 57 Query: 64 ILWSVKTREG-DGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPML 122 + V+ +G + ++E ++ RL+ Y + + + V+P++ Sbjct: 58 VPLKVEWPDGRHAAMLFLLEEETDPARFSIHRLVSYCANLAELMGTNR-------VVPIV 110 Query: 123 FYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELI-- 180 + S + + + + +P ++ + + Sbjct: 111 IF------LRSSPDIRRDLHLGVDGVNFLSFHYIACVLPDIPAEQYKDSTNIVARIALPT 164 Query: 181 QKHIRQRDLMGLIDQLVVLLVTECANDSQITAL-LNYILLTGDEARFNEFISELTRRMPQ 239 + R++ + + L L E D +I L ++ F +R PQ Sbjct: 165 MHYAREQVIDVMAWALRGLDTLEANGDKRIKYLDFIDTYSQLEDNERQLF----KQRYPQ 220 Query: 240 HRERIMTIAERIHNDGYIKGEQRILRLLLQNG 271 + + +I +R + G +G + ++ + G Sbjct: 221 EEKTVTSIVQRAIHQGIHQGIHQGIQEGMLMG 252 >UniRef50_C1J8G9 YdgA n=11 Tax=Enterobacteriaceae RepID=C1J8G9_ECOLX Length = 81 Score = 55.0 bits (131), Expect = 3e-06, Method: Composition-based stats. Identities = 23/64 (35%), Positives = 33/64 (51%), Gaps = 8/64 (12%) Query: 236 RMPQHRERIMTIAERIHN----DGYIKGEQRILR----LLLQNGADPEWIQKITGLSAEQ 287 + + R MTIAER+ +G+ KG + R L G PE IQ+ TGLS E+ Sbjct: 13 FVSRSRANSMTIAERLIQKGFDEGFKKGALEVAREAACRLRDMGWTPERIQEATGLSGEE 72 Query: 288 MQAL 291 ++ L Sbjct: 73 LKKL 76 >UniRef50_C1P7A8 Putative uncharacterized protein n=1 Tax=Bacillus coagulans 36D1 RepID=C1P7A8_BACCO Length = 345 Score = 55.0 bits (131), Expect = 3e-06, Method: Composition-based stats. Identities = 52/329 (15%), Positives = 110/329 (33%), Gaps = 51/329 (15%) Query: 10 HDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDS----LKLESASFVDEKLR-ALHSDI 64 +D L+K ++ + +F+ P DL E D L+ E + + + +D Sbjct: 19 YDGLWKKIIS--ELFEEFILFFAP-DLYETIDFGKGIVFLEQELHKVIIKHKKGKRIADK 75 Query: 65 LWSVKTREGD-GYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEH-----------DKR 112 + V + G+ Y+++ IE Q ++D + R+ Y + R E+ Sbjct: 76 IVKVSLKNGEEKYVFIHIEIQEKQDPDFSKRMFTYFYRLFDRFQENIYSIAILTDLSKSN 135 Query: 113 QPLPLVIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAA--------FPLVDVTVVP 164 P FY G+ Y ++ +E P+ + A L + Sbjct: 136 NSEPFQYS--FY-GTELTYRFNTYKFNEADIPSLKKSTNPFAIAVLAGIYLHLTEKNYQK 192 Query: 165 DDEIVQHRRVALLELIQKHIR-----QRDLMGLIDQLVVLLVTECAND--SQITALLNYI 217 E+ + + Q + + L L + + I N++ Sbjct: 193 RYEVKKKLLKEFILSNQNLSSNYAEALCYFIDYLLYLPGELTKQLTKELFIHIEKEANHM 252 Query: 218 LLTGDEARFNEFISELTRRMPQH-----RERIMTIAERIHNDGYIKGEQRI--------L 264 L + + F L + + I E+ +G G ++ Sbjct: 253 LYSEELKEAPTFAEYLKTVKEEGIEIGIEKGIEKGIEKGKEEGIEIGIEKGKMEEKRNLA 312 Query: 265 RLLLQNGADPEWIQKITGLSAEQMQALRQ 293 LL+ G E + K+ LS ++++ +++ Sbjct: 313 AELLREGFSVEKVAKMVKLSIDEVKKIKK 341 >UniRef50_D1PHY3 Putative uncharacterized protein n=2 Tax=Prevotella copri DSM 18205 RepID=D1PHY3_9BACT Length = 307 Score = 55.0 bits (131), Expect = 3e-06, Method: Composition-based stats. Identities = 50/305 (16%), Positives = 92/305 (30%), Gaps = 35/305 (11%) Query: 11 DALFKTFLT-HPDTARDFMEIHLPKDLRELCDLDSLKLESASFVD--EKLRALHSDILWS 67 D FK HP + LP E +K V E + D+L Sbjct: 11 DLTFKKIFGNHPKRLISLLNALLPLSDEEQI--REIKYLPTELVPQLEGGKNTIVDVL-- 66 Query: 68 VKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVM-QRHIEHDKRQPLPLVIPM----- 121 + G + +E Q R++ + + + + K L V + Sbjct: 67 --CTDVRGRKFC-VEMQMEWSDAFQQRVLFNASKLYVSQAKKGGKYSELQPVYSLNLIND 123 Query: 122 LFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQ 181 +F H + D+ ++ F +++ I R + L Sbjct: 124 IFAHDTPDFIHNYRIVHDKDSNKVIE----GLHFTFIELPKFTPHSIADKRMMVLWLRFL 179 Query: 182 KHIRQR------DLMG--LIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISEL 233 I DL+ I + V L +D+++ A + E + + Sbjct: 180 TEINSNTKDIPADLLNDPEIGKAVEELEISGFSDAELRAYDKFWDSVSVERTLIDDSYQK 239 Query: 234 TRRMPQHR---ERIMTIAERIHNDGYIKGEQR----ILRLLLQNGADPEWIQKITGLSAE 286 + + E + E+ G +G+ I + LL G E + K T L E Sbjct: 240 GKEKGKQEGLAEGMEKGMEKGMEKGRAEGKHEANTEIAQRLLAMGLPAEQVSKATQLPLE 299 Query: 287 QMQAL 291 ++ L Sbjct: 300 IIKNL 304 >UniRef50_UPI0001BC3131 hypothetical protein BcroD2_12630 n=4 Tax=Butyrivibrio crossotus DSM 2876 RepID=UPI0001BC3131 Length = 247 Score = 55.0 bits (131), Expect = 3e-06, Method: Composition-based stats. Identities = 56/275 (20%), Positives = 99/275 (36%), Gaps = 43/275 (15%) Query: 1 MTNFTTSTPH-DALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRA 59 M N T + + D +F+ D + D+ LE+A ++ K Sbjct: 1 MNNETVNRKYKDTVFRLLFKDKSNLLSLFNAVNDTDFSDENDIKITTLENAIYMTSK--- 57 Query: 60 LHSDI--LWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEH-----DKR 112 +DI + +K + EHQS + +M +R + Y +R++ + K Sbjct: 58 --NDISCIIDMKLN--------LFEHQSTVNPNMPYRNLEYVTKCFKRYVGNFDVYTGKA 107 Query: 113 QPLPLVIPMLFYHGSRSPYPWSLCWL-------DEFADPTTARKLYNAAFPLVDVTVVPD 165 LP ++FY+G P + L DE + YN LV+ T++ Sbjct: 108 LTLPNPKFVVFYNGVNEQPPIRVMRLSDLYAHKDEIPNLELVVIQYNIN-NLVNCTLMDR 166 Query: 166 DEIVQHRRVALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEAR 225 E ++ + I+ +++ D +D + D I + LT + Sbjct: 167 CEPLKE-YSEFIGCIRSNLKTMDKGEAVDSAI---------DYCIGNGILKDFLTNNRNE 216 Query: 226 FNEFISELTRRMPQHRERIMTIAERIHNDGYIKGE 260 S +H + I IA + DGY KGE Sbjct: 217 VRSM-SLFEFDAEEHEKAIKQIA---YEDGYDKGE 247 >UniRef50_C0QZ87 Chromosome segregation ATPase n=19 Tax=Bacteria RepID=C0QZ87_BRAHW Length = 309 Score = 55.0 bits (131), Expect = 4e-06, Method: Composition-based stats. Identities = 47/267 (17%), Positives = 101/267 (37%), Gaps = 43/267 (16%) Query: 55 EKLRALHSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDK-RQ 113 E L+ D+ KT++G ++IE Q + + R++ Y + ++ ++ Sbjct: 56 ENLKESILDV--KAKTKDGK---KILIEIQLIGNNNFIKRILYYIAKNISSELKENENYI 110 Query: 114 PLPLVIPMLFYH-----GSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEI 168 + +I + F + GS S ++ KL + +++ EI Sbjct: 111 NISQMISISFLNFNLKIGSESDIKREHKCFQLSDINNSSLKLDDFQIHFIEIKRFA--EI 168 Query: 169 VQHRRVALL---ELIQKHIR--QRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDE 223 +++ + +L+ +DL I++L+ ND + Y DE Sbjct: 169 LKNASIDDYNKNKLLSWIDFFTAKDLEKSINKLIG------GNDIMSKVMDKYKRFVADE 222 Query: 224 AR------FNEFISELT-----RRMPQHRERIMTIAERIHNDGYIKGEQR--------IL 264 + F+ R +E I ++ +G +G ++ I Sbjct: 223 KEMSAYNERDTFLYGQAAMLQYEREEGKKEGIEIGIQQGIKEGIEQGIEQGEKNKALSIA 282 Query: 265 RLLLQNGADPEWIQKITGLSAEQMQAL 291 R L ++G D ++I + TGL+ E+++ L Sbjct: 283 RSLKKSGLDDKFISENTGLTIEEIEKL 309 >UniRef50_Q8YTL4 All2703 protein n=13 Tax=Cyanobacteria RepID=Q8YTL4_ANASP Length = 270 Score = 54.6 bits (130), Expect = 4e-06, Method: Composition-based stats. Identities = 42/283 (14%), Positives = 99/283 (34%), Gaps = 26/283 (9%) Query: 23 TARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRA--LHSDILWSVKTREGDGYIYVV 80 + P EL + + F +++ D L+ K + Y+ Sbjct: 6 IFYSLFQEF-PHIFFELINQSPQEASIYEFTSREVKQLAFRLDGLFLPKINDSTKPFYI- 63 Query: 81 IEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGSRSPYPWSLCWLDE 140 +E Q + D +RL ++++ + P P + ++ Y ++ + DE Sbjct: 64 VEVQFQPDDDFYYRLFAELFLYLKQY-----KPPYPWQV-VVIYPSRGIERQQTIHF-DE 116 Query: 141 FADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIRQRDLM--GLIDQLVV 198 +++Y L ++ V + + V +++L+ + ++ LI Q Sbjct: 117 ILVLNRVKRIY-----LDELGEVAETSL----GVGVVKLVIETEETAPVLARQLIAQAKQ 167 Query: 199 LLVTECANDSQITALLNYILLTGDEARFNEFISELT----RRMPQHRERIMTIAERIHND 254 L A I + I+ + E + L ++ ++E + + + Sbjct: 168 QLTDVTAKRDLINLIETIIVYKLPQKSREEIEAMLGLNELKQSRVYQEALEEGKQEGKQE 227 Query: 255 GYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQALRQPLPE 297 G + + + ++Q G E I ++ L E +Q Q + Sbjct: 228 GKQEAKLETIPRMVQFGLSVEAIAQLLDLPLEVVQQAVQQFNQ 270 >UniRef50_C6XVT6 Putative uncharacterized protein n=1 Tax=Pedobacter heparinus DSM 2366 RepID=C6XVT6_PEDHD Length = 317 Score = 54.6 bits (130), Expect = 5e-06, Method: Composition-based stats. Identities = 44/312 (14%), Positives = 109/312 (34%), Gaps = 51/312 (16%) Query: 11 DALFKTFLTHPDTARDFMEIHLPKDLRELCDLDS-LKLESASFV------DEKLRALHSD 63 D K DF+ + D E+ D + ++ + + K +D Sbjct: 26 DEFLKGAFED--NFPDFLR-FVFSDADEILDFNREIEFLNNELFTIIPDRERKGGGRRAD 82 Query: 64 ILWSVKTREG-DGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPML 122 +L + ++G + ++ + +E + D R+ Y+ + ++ V + Sbjct: 83 LLAKLYLKDGTEKWVLLNVEIEGGNDRKFGQRVFEYNYRIRDKYKVS--------VASIA 134 Query: 123 FYHGSRSPYPWSLCWLDEFADPTT-----ARKLYN----------AAFPLVDV------- 160 + G ++ +LDE A +++ F L+ + Sbjct: 135 VFTGKKTQL-RPTEYLDELLGTVLSFKYTAYHVFDHQEDELLKSDNPFSLIALACQKALL 193 Query: 161 -TVVPDDEIVQHRRVALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILL 219 +PD+E+ R + +++ + +H D +I ++ L +I + + Sbjct: 194 EGKIPDEELADER-LVIVKALLRHGY--DRQRIISFILFLKNFIFIESEEINRKFDQQIE 250 Query: 220 TGDEARFNEFISELTRRMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQK 279 + + + ++ ++ + +I +G + I R L + G E+I K Sbjct: 251 ELTKDKNPMGVIDVFKKWERQEAKI-----EGKLEGRREEALEIARELKKEGLTIEFIAK 305 Query: 280 ITGLSAEQMQAL 291 T L +++ L Sbjct: 306 TTKLPIAEIEKL 317 >UniRef50_B4VKW0 Putative uncharacterized protein n=2 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VKW0_9CYAN Length = 296 Score = 54.6 bits (130), Expect = 5e-06, Method: Composition-based stats. Identities = 51/305 (16%), Positives = 108/305 (35%), Gaps = 41/305 (13%) Query: 11 DALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVKT 70 D K L + + L + LR+ + S+ ++ E + DIL + Sbjct: 9 DWAIKKLLRNKAN-YGVLAGFLSELLRKPITIQSILEGESNQQAEDDKLNRVDIL--AEN 65 Query: 71 REGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLV-----IPML--- 122 G+ ++IE Q+ + R++ + ++ +E +P V + ++ Sbjct: 66 DRGEL---ILIEVQNSTEQDYFHRMLYGTSRLITDFLEKG--EPYGNVKKVYSVNIVYFS 120 Query: 123 -------FYHGSRSPYPWSLCWLDEFADPTTARKLYN--------AAFPLVDVTVVPDDE 167 YHG+ L D+ RKL+N + ++ V E Sbjct: 121 LGQGDDYIYHGTLEFRG--LHLDDKLGLSINQRKLFNSQDVYEIFPEYYVIKVNNFN--E 176 Query: 168 IVQHRRVALLELIQKHIRQRDL-MGLIDQLVVLLVTECANDSQITALLNYILLTGDEARF 226 + + ++K + + + + L+ + ++++ L ++ E Sbjct: 177 VASDTLDEWIYFLKKSQIKEEFTAQGLAEAKENLLVDSLSEAERANYLRFMENRRYEISL 236 Query: 227 NEFISELTRRMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAE 286 E + R E + + G + + I RLL Q G D + I TGL+ E Sbjct: 237 IE-----SSRSEGRLEGLEEGLKEGMEQGKQQEKVNIARLLKQQGTDLDTITAATGLTRE 291 Query: 287 QMQAL 291 +++ L Sbjct: 292 EIEEL 296 >UniRef50_C0BF92 Putative uncharacterized protein n=1 Tax=Coprococcus comes ATCC 27758 RepID=C0BF92_9FIRM Length = 307 Score = 54.2 bits (129), Expect = 5e-06, Method: Composition-based stats. Identities = 58/307 (18%), Positives = 109/307 (35%), Gaps = 44/307 (14%) Query: 11 DALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVKT 70 D L++ + + D + DL+ LE ++ K +D+ + V Sbjct: 21 DRLWRMIFNNKEDLLQLYNAINHTDYQNPDDLEVNTLEDVLYLSMK-----NDVSFLV-- 73 Query: 71 REGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRH-------IEHDKRQPLPLVIPMLF 123 G +Y EH S + +M R + Y + + + I H+KR LP ++F Sbjct: 74 -GGTMNLY---EHLSTFNPNMPLRGVFYFSRLYEGYVADNNLMIYHEKRVRLPKPKYIVF 129 Query: 124 YHGSRS-PYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVA------- 175 Y+G+++ P L D F + ++++ + E+++H R Sbjct: 130 YNGTKNQPDSMELRLSDCFENTDNDAPCLECTATMLNINYGHNQELMKHCRRLEEYSIFV 189 Query: 176 --LLELIQKHIRQRD-LMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISE 232 + E IQ D L ID + V + N IL T D+ + + + E Sbjct: 190 QCVREYIQSEPSVEDALEKAIDTCINQDVLADFLKKHRAEVTNMILTTYDKDLYEKTLKE 249 Query: 233 LTRRMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLS--AEQMQA 290 R E +G ++G L Q ++ L A+ ++ Sbjct: 250 DAR-------------EEGREEGLMEGRAETRAELNQLTICLLNAKRYNDLEHAAKDIEY 296 Query: 291 LRQPLPE 297 ++ L E Sbjct: 297 QKKLLKE 303 >UniRef50_C1QAJ2 Putative uncharacterized protein n=2 Tax=Brachyspira murdochii DSM 12563 RepID=C1QAJ2_9SPIR Length = 312 Score = 53.8 bits (128), Expect = 8e-06, Method: Composition-based stats. Identities = 41/311 (13%), Positives = 99/311 (31%), Gaps = 37/311 (11%) Query: 11 DALFKTFLTHPD---TARDFMEI-HLPKDLRELCDLDSLKLESASFVDEKLRALHSD--- 63 D + + D DF+ L +++ ++ L + + + D Sbjct: 9 DYFVRYLFSSKDSNFILLDFINSTMLDANMKTFRSVEILTPSPKAGSRLNYKENYDDKES 68 Query: 64 ---------------ILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIE 108 + T+ G V+IE Q + + R++ Y + + ++ Sbjct: 69 IAPKVARKVDRCRRRLDVKCITQNGTV---VIIEIQLQGNSRFPERILYYWASNYSKLLK 125 Query: 109 HD-KRQPLPLVIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYN-AAFPLVDVTVVPDD 166 K L VI + + + D + R L + ++++ D+ Sbjct: 126 QGEKYDALTPVISINLLN-FNLDNNDCIHSCYMIYDTKSKRLLTDHLQIHIIEIKKFKDN 184 Query: 167 EIVQHR--RVALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEA 224 + + + + +K R+ + L+ + + E + + +++ + Sbjct: 185 LLDKDLDCWLKFFTIKEKDNREVIMSELVKEKP---IMEEVQKRYNNFIKDRLMMNEYDK 241 Query: 225 RFNEFISELTRRMPQHRERIMTIAERIHNDGYIKGEQR----ILRLLLQNGADPEWIQKI 280 R + R I ++ G KG + + + D I I Sbjct: 242 REAYLYGNQIMLEEERRLGIEEGFKKGIEKGIEKGIKENQILTAKNMKNKNIDIALISDI 301 Query: 281 TGLSAEQMQAL 291 TGLS ++++ L Sbjct: 302 TGLSIKEIEEL 312 >UniRef50_A7BPH0 Putative uncharacterized protein n=5 Tax=Beggiatoa RepID=A7BPH0_9GAMM Length = 289 Score = 53.8 bits (128), Expect = 8e-06, Method: Composition-based stats. Identities = 50/300 (16%), Positives = 101/300 (33%), Gaps = 41/300 (13%) Query: 3 NFTTSTPHDALFKTFLTHPDTARDFMEIHL--PKDLRELCDLDSLKLESASFVDEKLRAL 60 +D +FK +HP ++ L ++ E+ S V E Sbjct: 20 KQVAPLRYDVIFKKAFSHPTIFTALVKDFLGIQLEIDEVKYNKGFVPSVNSLVSE----- 74 Query: 61 HSDILWSVKTREGDGYIYVVIEHQ--SREDIHMAFRLMRYSMAVM-QRHIEHDKRQPLPL 117 + + + + V ++H SR D R + Y + M + I + P+ Sbjct: 75 -----FDLFVEDKKNQLIVEMKHAYCSRSDYE---RFVYYQCSSMVEAVINSNSDYDFPM 126 Query: 118 -VIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVAL 176 +I ++F+ ++P P S + +F A L+D +I Q + + Sbjct: 127 TIITIVFFTWKKTPSPDSSIIVHDFESRDLATG------QLLD-------KIYQRKHQLI 173 Query: 177 LELIQKHIRQRDL---MGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISEL 233 + + + L E + L+ + ++ + Sbjct: 174 FVFTNDSTHENTPSTYREWMQAIDDSLDGEVDEEKYTNPLIQELFGVIEKDKITPEERAC 233 Query: 234 TRRMPQHRERIMTIAERIHNDGYIKGE-QRILRLLLQNG-ADPEWIQKITGLSAEQMQAL 291 + E + + NDG +G+ ++ R L N + I + TGLS E ++AL Sbjct: 234 MKDQYSQEEACI----KAFNDGMKQGQSKKTARNLKANSKLTEKEIARATGLSLEMVKAL 289 >UniRef50_Q24Y59 Putative uncharacterized protein n=4 Tax=Peptococcaceae RepID=Q24Y59_DESHY Length = 283 Score = 53.4 bits (127), Expect = 9e-06, Method: Composition-based stats. Identities = 31/246 (12%), Positives = 79/246 (32%), Gaps = 31/246 (12%) Query: 59 ALHSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLV 118 +DI++ ++ + +E Q+ R + Y +++R V Sbjct: 53 ETRNDIIFLLEDDT-----LLHLEFQTTAGEQDLKRFLYYDARLVRRQERK--------V 99 Query: 119 IPMLFYHG---SRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVA 175 ++ Y G L + + + + + + + +++ Sbjct: 100 HTIVIYSGRIEQARERLECGSILYQVENIYMKHYNGDQEYNRLK-HKIDNHQLLSETDTL 158 Query: 176 LLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTR 235 L + ++ L Q L ++ + +++ D+ +L Sbjct: 159 KLIFLPLMKSEQKEEELAIQ-AAELAKAAPDEKTKLFAIAALIVITDKIMSESNKRKLLE 217 Query: 236 RMPQHRERIMTIAERIHNDGYIKGE--------QRILRLLLQNGADPEWIQKITGLSAEQ 287 + ++ I + I +G +GE + + +L G PE I K T L E+ Sbjct: 218 VL-----KMTQIEQWIREEGRQEGELKGRRDEKRETAQTMLNLGMSPELIAKATKLPLEE 272 Query: 288 MQALRQ 293 + + + Sbjct: 273 ILEMAK 278 >UniRef50_Q2FTW8 Putative uncharacterized protein n=2 Tax=Methanospirillum hungatei JF-1 RepID=Q2FTW8_METHJ Length = 306 Score = 53.4 bits (127), Expect = 1e-05, Method: Composition-based stats. Identities = 57/318 (17%), Positives = 90/318 (28%), Gaps = 56/318 (17%) Query: 3 NFTTSTPHDALFKTFLTHP---DTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRA 59 F S +D F+ P D D + LP + + D L + + Sbjct: 17 EFLMSPRNDFAFRLLFGDPNNSDILLDLLNAILPDHFQSVVCTDPHLLIPDTK-----KE 71 Query: 60 LHSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVI 119 DI V + G +YV IE Q + M R + Y + + Sbjct: 72 CILDI--KVLSDSG---VYVDIEMQVLDLKSMEKRSLFYWAKMYLDQLNRGHS------- 119 Query: 120 PMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLEL 179 YH + + LD P F D T D ++ Sbjct: 120 ----YHELKRTIV--INILDYMLMPVEDLHT---CFQAYDKTH---DILMSDVFEIHFLE 167 Query: 180 IQKHIRQRDLMGLIDQLVVLLVTECANDSQITALL---------NYILLTGDEARFNEFI 230 + K R R D L L + +I L + Sbjct: 168 LPKVHRCRVPYKGTDLLSWLTFLNAYTEEEIIMAAEGKPAIQKAYNNLQIMSLDEETRRL 227 Query: 231 SELTRRMPQHRERIMTIAERIHNDGYIKGEQR------------ILRLLLQNGADPEWIQ 278 E + M A +G +G ++ ++ LL G D E+I+ Sbjct: 228 YEAREMFLHDQATRMYEA---KEEGLEEGMKKGREEGREEEREGFVKNLLSLGMDDEFIK 284 Query: 279 KITGLSAEQMQALRQPLP 296 K TGL + L++ L Sbjct: 285 KATGLDQSIIDKLKKSLS 302 >UniRef50_B7CCB3 Putative uncharacterized protein n=1 Tax=Eubacterium biforme DSM 3989 RepID=B7CCB3_9FIRM Length = 291 Score = 53.0 bits (126), Expect = 1e-05, Method: Composition-based stats. Identities = 31/222 (13%), Positives = 83/222 (37%), Gaps = 26/222 (11%) Query: 70 TREGDGYIYVV-IEHQSREDIHMAFRLMRYSMAVMQRH-IEHDKRQPLPLVIPMLFYHGS 127 ++G+ + + +E+Q+ E+ +M R+ Y A +R + + P+ V+ ++ + G Sbjct: 65 WQDGNALVAICGLENQTVEEKYMPLRVFSYDGASYRRQLLSENDENPIVPVVSLVLHFGM 124 Query: 128 RSPYPWSLCWLDEFADPTTARKLYNAAFP--LVDVTVVPDD---EIVQHRRVALLELIQK 182 + S L D + Y + + ++ + D+ R+ +QK Sbjct: 125 KKWS--SPHNLKGVIDIPKELEPYVNDYKANIFNIAFLDDETVQMFQSDFRIVADFFVQK 182 Query: 183 -----HIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRM 237 ++ + + +D+++ LL +D ++ + + R + Sbjct: 183 RKNKDYVPDKHKIKHVDEMLKLLQVLTGDDRYNVK-----FSETEKKEDIKMCDVMERAV 237 Query: 238 PQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQK 279 + +E + + I + ++ L + G E I + Sbjct: 238 NKGKEEV-------REEERINSIKVLISSLEEFGISSEAIIE 272 >UniRef50_A6BF26 Putative uncharacterized protein n=14 Tax=Clostridiales RepID=A6BF26_9FIRM Length = 366 Score = 53.0 bits (126), Expect = 1e-05, Method: Composition-based stats. Identities = 50/269 (18%), Positives = 96/269 (35%), Gaps = 18/269 (6%) Query: 11 DALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVKT 70 D +F+ + ++ + L + LE+A ++ K +D+ + Sbjct: 58 DTIFRMLYHDKENLLSLYNAVNGREYTDPEKLQVVTLENAIYMGMK-----NDLAF---I 109 Query: 71 REGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRH-----IEHDKRQPLPLVIPMLFYH 125 + + Y+Y EHQS + ++ R + Y QR + Q +P ++FY+ Sbjct: 110 MDMNLYLY---EHQSTYNPNIPLRNLFYIADEYQRLVVRKSLYSTVIQKIPTPRFLVFYN 166 Query: 126 GSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIR 185 G++ S L + T +++V ++++H R L E Q R Sbjct: 167 GTKEVEDRSEFRLSSAYENPTENPDLELRVTMLNVNDGHSSDLMEHCR-TLKEYAQYVAR 225 Query: 186 QRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRERIM 245 R D + VT ++ +L LL I E + + + R Sbjct: 226 VRKYAAKQDVSLEEAVTRAVDECIEEGILAEFLLKNKTEVIRVSIYEYDKEFEEKKLRKA 285 Query: 246 TIAERIHNDGYIKGEQRILRLLLQNGADP 274 E DG G Q + + Q+G + Sbjct: 286 EY-EAGRQDGIEIGRQDGIEIGRQDGIEI 313 >UniRef50_C0G0A4 Putative uncharacterized protein n=2 Tax=Roseburia inulinivorans DSM 16841 RepID=C0G0A4_9FIRM Length = 319 Score = 52.7 bits (125), Expect = 2e-05, Method: Composition-based stats. Identities = 25/125 (20%), Positives = 51/125 (40%), Gaps = 16/125 (12%) Query: 8 TPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWS 67 D +F+ + + + DL+ + LE+A ++ K +D+ + Sbjct: 55 NYKDTVFRMLFSDRKNLLSLYNAVNQSNYKNPEDLEIVTLENAIYMGIK-----NDLAF- 108 Query: 68 VKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHD-----KRQPLPLVIPML 122 + + Y+Y EHQS + +M R + Y + Q+ ++ Q +P + Sbjct: 109 --IMDTNLYLY---EHQSTYNPNMPLRDLFYICSEYQKLVDKKSLFSSTLQKIPAPNFIE 163 Query: 123 FYHGS 127 FY+GS Sbjct: 164 FYNGS 168 >UniRef50_C5RQ96 Putative uncharacterized protein n=1 Tax=Clostridium cellulovorans 743B RepID=C5RQ96_CLOCL Length = 288 Score = 52.7 bits (125), Expect = 2e-05, Method: Composition-based stats. Identities = 48/299 (16%), Positives = 94/299 (31%), Gaps = 27/299 (9%) Query: 3 NFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFV-DEKLRALH 61 NF S D +FK +D + L + +L + + E R Sbjct: 9 NFIMSPKIDFVFKLLFGDEKN-KDLLIAFL----SAVLNLPEREFVGIEILNTELFREFK 63 Query: 62 SD----ILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIE-HDKRQPLP 116 D + VKT G + IE Q M R + Y + ++ D L Sbjct: 64 EDKKGILDVRVKTVNGKQ---IDIEIQVLPTEFMPERTLFYWSKMYTTQVKPGDTYDKLK 120 Query: 117 LVIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVAL 176 I + P D T + +++ + D +I + + Sbjct: 121 KCITINIVDFKCIPLNKLHTSYHLIEDETGHKLTDILEVHFLEIPKLFDKQIEINEDDPI 180 Query: 177 LELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRR 236 ++ + +D ++ A ++ +L + I E R Sbjct: 181 IQWM----------EFLDGKSKGVMEMLAEKNESIKKAYNLLKIISKDEKARMIYE--AR 228 Query: 237 MPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQALRQPL 295 + R+++ I G + R+ +++ G I ++T LS E++ L+ L Sbjct: 229 EAELRDQLTRIR-SAEEKGANEKALRVAEKMIKRGDSINDIIELTELSKEKILELKNKL 286 >UniRef50_A7N2B6 Putative uncharacterized protein n=1 Tax=Vibrio harveyi ATCC BAA-1116 RepID=A7N2B6_VIBHB Length = 86 Score = 52.7 bits (125), Expect = 2e-05, Method: Composition-based stats. Identities = 16/50 (32%), Positives = 30/50 (60%) Query: 210 ITALLNYILLTGDEARFNEFISELTRRMPQHRERIMTIAERIHNDGYIKG 259 +L+ Y+L G+ + + + L R++P+H ER MT+AE++ G +G Sbjct: 8 YDSLVEYLLRVGETSNLEDLMRTLARQVPEHEERFMTVAEQLEARGREQG 57 >UniRef50_C4G7H9 Putative uncharacterized protein n=2 Tax=Abiotrophia defectiva ATCC 49176 RepID=C4G7H9_ABIDE Length = 305 Score = 52.7 bits (125), Expect = 2e-05, Method: Composition-based stats. Identities = 35/267 (13%), Positives = 92/267 (34%), Gaps = 29/267 (10%) Query: 54 DEKLRALHSDILWSVKTREGDGYIYVV-IEHQSREDIHMAFRLMRYSMAVMQRHIEHDK- 111 D KL D+ S +EG+ + VV IE+Q++ + M R++ Y A + + Sbjct: 51 DGKLHEQERDV--SKYWKEGNTNLLVVGIENQTKAEKLMPARIIGYDGASYRSQLLKSTG 108 Query: 112 ---RQPLPLVIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEI 168 + L V+ ++ Y G + + + ++ +P++++ Sbjct: 109 RLPKNKLTPVVTIVLYFGLTRWNQPKNLKGILDIPTGLEDFVSDYKINVFEIAFLPEEKV 168 Query: 169 VQHRRVALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNE 228 ++ + L+ K+ I + L + + A+L ++ + E Sbjct: 169 --NKFKSDFRLVAKY------FTNIRKNPYYLPADENEIKHVDAVLKFLSIMSGSEDIIE 220 Query: 229 FISELTRRMPQHRERIMTIAERIHNDGYIKGEQRILRLLL------------QNGADPEW 276 ++ + + +++ G +G + L + G E Sbjct: 221 KLT--ANNGSEVKNMTGGPLSQLYYKGVSEGREEGLLQGINETLLKVYLNCRSKGMSVEE 278 Query: 277 IQKITGLSAEQMQALRQPLPERERYSW 303 ++I + + + + +R++ Sbjct: 279 SEEIVHFADRESLDMAEEEYQRQKLGK 305 >UniRef50_B2JB68 Putative uncharacterized protein n=1 Tax=Nostoc punctiforme PCC 73102 RepID=B2JB68_NOSP7 Length = 192 Score = 52.7 bits (125), Expect = 2e-05, Method: Composition-based stats. Identities = 41/219 (18%), Positives = 77/219 (35%), Gaps = 37/219 (16%) Query: 91 MAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGSRSPYPWSLCWLDEFADPTTARKL 150 M FR+ Y + V +R + Q ++ Y LD+ + Sbjct: 1 MPFRMADYRLRVYRRFPKKRMHQ-------VVIY-------------LDKTESEKVYQTT 40 Query: 151 YNA-----AFPLVDVTVVPDDEIVQHRRVALLELIQKHIRQRDLMGLIDQLVVLLVTECA 205 + A F ++ + P + + + ++ + + + Q + Sbjct: 41 FIAGSLQHEFSVIRLWEQPPEVFLTAPGLLPFAVL---SATENKVATLQQSSAAVDKIAD 97 Query: 206 NDSQITALLNYILLTGDEARFNEFISELTRRMPQHRERIMTIAERIHNDGYIKG----EQ 261 +Q +L G E I L R+ RE + I ++I +G +G + Sbjct: 98 RRTQSNIAAASAILAG-LVLEQEVIRRLFRK-DIMRESV--IYQQIKTEGEEEGSDKKAR 153 Query: 262 RILRLLLQNGADPEWIQKITGLSAEQMQALRQ-PLPERE 299 +I LL G + I + TGLS E +Q L+Q +E Sbjct: 154 KIAINLLAEGISVDVIARSTGLSIEVVQQLQQGEFENQE 192 >UniRef50_UPI0001C371D2 hypothetical protein RflaF_10865 n=1 Tax=Ruminococcus flavefaciens FD-1 RepID=UPI0001C371D2 Length = 317 Score = 52.3 bits (124), Expect = 2e-05, Method: Composition-based stats. Identities = 48/295 (16%), Positives = 102/295 (34%), Gaps = 40/295 (13%) Query: 11 DALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFV----DEKLR----ALHS 62 DA+ K ++ + D L R++ + LK + + ++ R + Sbjct: 5 DAVTKDYMQDSEHFADAF-NFLLYGGRQVIKPEQLKPLDTTSIALPYGDESRFVPIQKYR 63 Query: 63 DILWSVKTREGDGYIYVV--IEHQSREDIHMAFRLMRYSMAVMQRH-----IEHDKRQPL 115 D+L V E + Y++ IE+QS M R M Y EH K + + Sbjct: 64 DVLKMVTAMEDENATYLILGIENQSDIHYAMPIRNMLYDAIQYVNQADTIAKEHRKSKKM 123 Query: 116 P-----------------LVIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLV 158 P +I + Y G+ A+ + + N L+ Sbjct: 124 PETRAEYLSGFYKTDRILPIITLTLYFGADEWDAPRDLHSMLTANEDILKFVDNYHLHLI 183 Query: 159 DVTVVPDDEIVQ-HRRVALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYI 217 + D++ + H ++ L K+++ + +V + + ++N + Sbjct: 184 APAEIEDEDFAKFHTELS---LALKYVKYSKDKKKLRDIVNEDTAFRSVSRKTADMVNVV 240 Query: 218 LLTGDEARFNEFISELTRRMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGA 272 + E ++ + + R+ +AE +G +G R L L+++G Sbjct: 241 TSSNLHYNDGEERVDMCEAIEEIRKDA--LAE-GKAEGIEEGIIRTLIGLVKDGI 292 >UniRef50_UPI0001C369BC hypothetical protein ChatD1_02491 n=1 Tax=Clostridium hathewayi DSM 13479 RepID=UPI0001C369BC Length = 310 Score = 51.9 bits (123), Expect = 3e-05, Method: Composition-based stats. Identities = 41/292 (14%), Positives = 86/292 (29%), Gaps = 48/292 (16%) Query: 11 DALFKTFLTHPDTARDFM-------EIHLPKDLRELCDLDS-LKLESASFVDEKLRALHS 62 D K L P D + L +L +S + + + S + ++ Sbjct: 5 DFYIKKLLQDPARFADLYNAEIFHGKQILKAELLSPVSTESGIAITNRSGRKQTIQRRR- 63 Query: 63 DILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRY--------SMAVMQRHIEHD---- 110 DI G +I E Q M R + Y + + H + Sbjct: 64 DIAMKASI--GACFIVAGCEAQGEIHYGMPIRSLTYDALDYTEQLTEIQKEHRKKKDLAK 121 Query: 111 ---------KRQPLPLVIPMLFYHGSRSPYPWSLCWLDEFAD-------PTTARKLYNAA 154 +R L V+ ++ Y G P+ D P L + Sbjct: 122 SPEFLSGITRRDKLQPVLTLVLYCGKD-PWDGPKSLYDMLDLRGPTECIPDLLAALPDYR 180 Query: 155 FPLVDVTVVPDDEIVQHRRVALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALL 214 LVD+ + + + + + +++ + I + +D+ +TA++ Sbjct: 181 INLVDIRKIENLSLYKTGLQQVFGMLKYSTDKSKFYNYITSNHDQI--SMLDDNALTAVM 238 Query: 215 NYILLTGDEARFNEFISELTRRMPQHRERIMTIAERIHNDGYIKGEQRILRL 266 G + L + + + + DG ++G++ R Sbjct: 239 ------GLLGENRRLMKYLAAPGREEGYTMCQAIDDLIADGKLEGKREGKRR 284 >UniRef50_A7AK04 Putative uncharacterized protein n=2 Tax=Parabacteroides RepID=A7AK04_9PORP Length = 299 Score = 51.9 bits (123), Expect = 3e-05, Method: Composition-based stats. Identities = 45/313 (14%), Positives = 109/313 (34%), Gaps = 59/313 (18%) Query: 11 DALFKTFL---THPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWS 67 D FK F ++ + F+ L KD++++ +++++ + K ++ Sbjct: 12 DYAFKRFFGTVSNKELTIGFLNSLLNKDIKDII-FHNVEMQGNNTDSRKA-------VFD 63 Query: 68 VKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQ----------RHIEHDKRQPLPL 117 + DG +++ +E Q + + + R++ Y+ V+Q R + ++R+ Sbjct: 64 LFCEGSDGELFI-VEIQKKRQKYFSDRVLYYASFVIQMQADIESEKFRLAKEEERRRWNY 122 Query: 118 VIPMLF------------YHGSRSPYPWSLCWLDE-----FADPTTA--RKLYNAAFPLV 158 I ++ Y Y W + +D F++ +L Sbjct: 123 HINKVYVVCFLDFRLDTRY---TDKYRWDVVRMDRELKIPFSETLNEIYLELPKFNLNFE 179 Query: 159 DVTVVPDDEIVQHRRVALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYIL 218 + + + ++ + K Q D + + + L A + + Sbjct: 180 ECDTFYKKFLYTMNNIDIMGQLSKETIQNDKLLRKLKSAIELQRMSAKER--------LA 231 Query: 219 LTGDEARFNEFISELTRRMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQ 278 A + + + + E + G +G ++I+ + Q G D I Sbjct: 232 YELSIAAERDLAACMATSFEEGEE-------KGIAKGITEGMRKIILNMKQAGMDLATIA 284 Query: 279 KITGLSAEQMQAL 291 K GL ++++AL Sbjct: 285 KTAGLPEKEVEAL 297 >UniRef50_Q8F560 Putative uncharacterized protein n=1 Tax=Leptospira interrogans RepID=Q8F560_LEPIN Length = 278 Score = 51.5 bits (122), Expect = 3e-05, Method: Composition-based stats. Identities = 44/289 (15%), Positives = 96/289 (33%), Gaps = 31/289 (10%) Query: 14 FKTFL-THPDTARDFMEIHLPKDLRELCDLDSLKLESASFV--DEKLRALHSDILWSVKT 70 FK PD + L D ++K+ + V + + DI + Sbjct: 3 FKILFVKEPDLLISILNSVLFTDGEHTI--RNIKILNPELVGSSPNDKRSYLDI----RA 56 Query: 71 REGDGYIY---VVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDK-RQPLPLV--IPMLFY 124 ++ DG I+ + + HQS R + Y +++ + L V I ++ + Sbjct: 57 QDEDGKIFHVEIQVAHQSSFVK----RSLYYLSGLIRDQLNRGSMYSDLKPVYQINIVDF 112 Query: 125 HGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQ-HRRVALLELIQKH 183 S S E ++P + +++ ++ + + + + KH Sbjct: 113 DLIPSENFHSKFKFREESNPDIIL-TDDVEIHFLELCKFVKRDVRELRNNLEIWLYVLKH 171 Query: 184 IRQ---RDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQH 240 + ++ L+D+ L + L ++ +L R Sbjct: 172 TSELEEEEMRILVDKTPDLSKAFTILEQYSNDPQKRNELEAKLKSDRDYAYDLAARFEAG 231 Query: 241 RERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQ 289 + G K + + R +L+ G + I +ITGLS + ++ Sbjct: 232 EL-------QGIEKGAEKEKLKSARKMLEEGMRLDVILRITGLSKKDLK 273 >UniRef50_C4ZGR2 Putative uncharacterized protein n=2 Tax=Eubacterium rectale ATCC 33656 RepID=C4ZGR2_EUBR3 Length = 370 Score = 51.5 bits (122), Expect = 4e-05, Method: Composition-based stats. Identities = 41/235 (17%), Positives = 78/235 (33%), Gaps = 27/235 (11%) Query: 80 VIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQ--------PLPLV-IP----MLFYHG 126 + EHQS +M R + Y ++ + K+ LV IP ++FY+G Sbjct: 134 IYEHQSTVCPNMPVRSLIYFSVILSDMLSDKKKGTKSGKNIYGRRLVKIPTPHFVVFYNG 193 Query: 127 SRS-PYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQ-----HRRVALLELI 180 P L D F PT L + ++ + I++ + + + + Sbjct: 194 EEEQPEVQELKLSDAFEKPTDEPNLELKC-KVYNINDGKNKAIMESCGWLNDYMTFVNKV 252 Query: 181 QKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQH 240 +++ L + + ND L Y + N + Sbjct: 253 REYHADGAFDDLAIDIEKAIDYCIDNDILKEFLKTYRSEVTKSMQLNYEFDRQLEL--ER 310 Query: 241 RERIMTIAERIHNDGYIKGEQRILRLLLQNG-ADPEWIQKITGLSAEQMQALRQP 294 + I E G KG ++L L+ G D + + G+S + + L + Sbjct: 311 ADAI----EEGMEIGIEKGANKMLFTLVTKGKLDIDTAAEEAGVSVSEFEKLMRE 361 >UniRef50_D1P8S5 Putative uncharacterized protein n=1 Tax=Prevotella copri DSM 18205 RepID=D1P8S5_9BACT Length = 303 Score = 51.5 bits (122), Expect = 4e-05, Method: Composition-based stats. Identities = 40/296 (13%), Positives = 99/296 (33%), Gaps = 19/296 (6%) Query: 11 DALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVKT 70 D FK +D + L + + + V + + ++ V Sbjct: 16 DFGFKRIFGT-AMNKDLLICFLNSLFNGRQVVKDVSYLNPEHVGDVYTDRRA--IFDVYC 72 Query: 71 REGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPL--VIPMLFYH--- 125 +G ++ +E Q+ + R + YS ++ L + + + Sbjct: 73 EGENGEKFI-VEMQNAYQTYFKDRALFYSTFPIREQAPKGNEWDFKLNNIYTVALLNFNM 131 Query: 126 GSRSPYPWSLCWLDEFADPTTARKLYNA-AFPLVDVTVVPD--DEI--VQHRRVALLELI 180 + + + D T + Y+ + V+++ +E+ + + + L+ + Sbjct: 132 NEDAFDKEKIRHHVQLCDTATHKVFYDKLEYIYVEISKFNKTLEELDTLYEKWLYALKNL 191 Query: 181 QKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQH 240 K ++ L D++ L E + + + + + + + Sbjct: 192 YKLTQRPK--ELCDKVFDRLFEEAEIAKFTPQEMR--EYETSKMAYRDIKNSVDTAKREG 247 Query: 241 -RERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQALRQPL 295 E I E+ +G I R +L G D I +TGL++E+++ L+ + Sbjct: 248 IAEGIEIGMEKGRAEGMNLRSLEIARKMLAKGMDEASIMDMTGLTSEEIKLLKAEI 303 >UniRef50_A8SDU3 Putative uncharacterized protein n=1 Tax=Faecalibacterium prausnitzii M21/2 RepID=A8SDU3_9FIRM Length = 295 Score = 50.7 bits (120), Expect = 6e-05, Method: Composition-based stats. Identities = 40/250 (16%), Positives = 86/250 (34%), Gaps = 33/250 (13%) Query: 54 DEKLRALHSDILWSVKTREGDGYIYVV-IEHQSREDIHMAFRLMRYSMAVMQRHI--EHD 110 D +L + D+ + + G+ + + E+Q+ D M R++ Y A + + ++D Sbjct: 50 DGRLHEIERDVA--KRWKNGNIRVACIGFENQTASDPDMPLRVIGYDGAEYRAQLLGDND 107 Query: 111 KRQPLPLVIPMLFYHGSRSPYPWSLCWLDEFADP-TTARKLYNAAFPLVDVTVVPDDEIV 169 P V ++ Y G P+ L + P + + L Sbjct: 108 TGSRYPAV-TLVLYFGHEKPWSGPLSLKERLNVPKEFEPYVNDYKINLF----------- 155 Query: 170 QHRRVALLELIQKHIRQRDLMGLIDQLVVL-----LVTECANDSQITALLNYILLTGDEA 224 ++A L Q + Q D + D V V + + + L + + ++ Sbjct: 156 ---QIAYLTREQVELFQSDFKVVADYFVQKRENGDYVPSSQDLTHVQETLQLLSIMTNDH 212 Query: 225 RFNEFISELTRRMPQHRERIMTIAERIHNDGYIKGEQRILRL-------LLQNGADPEWI 277 RF + + T + + +++ N G KG + L++ D I Sbjct: 213 RFEDAYNTSTDDRKGGPRNMCDVLDKVENRGIEKGIVKGESRGENKMALLVKKLLDQNRI 272 Query: 278 QKITGLSAEQ 287 + S ++ Sbjct: 273 DDVKRASEDE 282 >UniRef50_D1PGQ2 Transposase, ISNCY family n=2 Tax=Prevotella copri DSM 18205 RepID=D1PGQ2_9BACT Length = 118 Score = 50.7 bits (120), Expect = 6e-05, Method: Composition-based stats. Identities = 16/43 (37%), Positives = 21/43 (48%) Query: 250 RIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQALR 292 G K I R +L NG D + +ITGLS Q+Q L+ Sbjct: 75 EGMEKGMNKRSLEIARKMLANGMDAATVMEITGLSESQLQQLK 117 >UniRef50_C9LBM4 Putative uncharacterized protein n=1 Tax=Blautia hansenii DSM 20583 RepID=C9LBM4_RUMHA Length = 247 Score = 50.7 bits (120), Expect = 7e-05, Method: Composition-based stats. Identities = 18/74 (24%), Positives = 37/74 (50%), Gaps = 7/74 (9%) Query: 226 FNEFISELTRRMPQHRERIMTIAERI---HNDGYIKG----EQRILRLLLQNGADPEWIQ 278 F +F+ + + ER MT+ E + +G +G ++RI+ +L G E I Sbjct: 171 FLKFVKADLKESREMEERFMTLEEMLKDERKEGLKEGTVKAQKRIVSKMLSKGLSDEEIM 230 Query: 279 KITGLSAEQMQALR 292 ++ +S E+++ L+ Sbjct: 231 ELCDISLEELENLK 244 >UniRef50_A7B1D1 Putative uncharacterized protein n=3 Tax=Ruminococcus gnavus ATCC 29149 RepID=A7B1D1_RUMGN Length = 323 Score = 50.3 bits (119), Expect = 8e-05, Method: Composition-based stats. Identities = 50/296 (16%), Positives = 100/296 (33%), Gaps = 37/296 (12%) Query: 11 DALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVKT 70 D F+ + + R F+ L E+ D ++L + + + V+ Sbjct: 53 DFCFQELMEDEEVRRGFIGAFLRIPPEEILD---MELLPKKLRKKYKEEKYGILDVRVRL 109 Query: 71 REGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHI-EHDKRQPLPLVIPM------LF 123 REG+ + IE QS + R + Y + I E + L I + LF Sbjct: 110 REGEQ---LNIEMQSIAYDYWQERSLFYLGKMYVDQIHEGEDYDKLKKCIHVGILDFTLF 166 Query: 124 YHGSRSPYPWSLCWLDEFADP-TTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQK 182 H Y W D D + +++ P + P E+++ + Sbjct: 167 EH--ERYYSCFHIWEDTIRDMYSDKFEIHVLELPKLAKYEYPQTELLRWAQFFG------ 218 Query: 183 HIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRE 242 R R+ + ++ + D I + + + E + + HR Sbjct: 219 -ARSREEIEVLAE----------KDEYIHKAYDKLEEISADEEKRLEYEERQKAIRDHRH 267 Query: 243 RIMTIAERIHNDGYIKGEQR----ILRLLLQNGADPEWIQKITGLSAEQMQALRQP 294 + + +G +G+ + R +L++ E I + +GLS E + L + Sbjct: 268 MLASGRREGLREGLREGKHEHAVEMARKMLEDKLPIEKIAEYSGLSPEDVHRLEEQ 323 >UniRef50_A7M2M6 Putative uncharacterized protein n=2 Tax=Bacteroides RepID=A7M2M6_BACOV Length = 182 Score = 50.3 bits (119), Expect = 8e-05, Method: Composition-based stats. Identities = 26/145 (17%), Positives = 52/145 (35%), Gaps = 16/145 (11%) Query: 163 VPDDEIVQHRRVALLELIQKHIRQRDLMG--LIDQLVVLLVTECANDSQITA-------- 212 + D + + + L+ + R D + + + L L+ ++ A Sbjct: 38 LSDCHTLYDKLIYALKNMSNWNRMPDALKEQVFEHLARLVAVADLSEENRIAYDKALDRY 97 Query: 213 LLNYILLTGDEARFNEFISELTRR--MPQHRERIMTIAERIHNDGYIKGEQ----RILRL 266 +N I+ + + E + +E I + G KGEQ I R Sbjct: 98 RVNQIVEEDERRKNEEMRRKAAEEGMKEGLKEGIREGIKEGMEKGMEKGEQKKQIEIARK 157 Query: 267 LLQNGADPEWIQKITGLSAEQMQAL 291 + ++G + I K TGL + ++ L Sbjct: 158 MREDGISIDTIIKYTGLQSSDIENL 182 >UniRef50_A5KR99 Putative uncharacterized protein n=11 Tax=Ruminococcus torques ATCC 27756 RepID=A5KR99_9FIRM Length = 317 Score = 50.3 bits (119), Expect = 9e-05, Method: Composition-based stats. Identities = 46/320 (14%), Positives = 102/320 (31%), Gaps = 40/320 (12%) Query: 1 MTNFTTSTPHDALFKTFLTHPDTARD----FMEIHLPKDLRELCDLDSLKLESASFVDEK 56 M +++F ++A + L E + ++++ ++ Sbjct: 8 MAGKENREIKNSVFVDLFYEDESAEANEIALFNAIHDEPLPEGTKIRRFRVDNTIYM--- 64 Query: 57 LRALHSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHI---EHDKRQ 113 +DI + G ++ EHQS + +M R + Y +R + K++ Sbjct: 65 --NFQNDISFDA---GGKVIVFG--EHQSTINENMPLRSLLYIGRAYERLVPPRSRYKKK 117 Query: 114 PLPLVIPML--FYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQH 171 +PL P FY+G L + ++++ EI++ Sbjct: 118 IVPLPTPEFYTFYNGKEKWEKEKELRLSDAYIVKDGEPSLELKVKVINIRPEEHHEILEK 177 Query: 172 RRV-----ALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITA---LLNYILLTGDE 223 +V +E++Q + + + + D + ++N +L D Sbjct: 178 CQVLKEYSQFMEIVQNYQISGEEEPYKKAIKECIEKGILADYLMRKGSEVVNMLLDEYDY 237 Query: 224 ARFNEFISELTRRMPQHRERIMTIAERIHNDGYIKGEQRI--------LRLLLQNGADPE 275 E E R Q RE ++ +G +G + ++ L+ G Sbjct: 238 ETDIEVQREEAR--EQGREE---GRKQGREEGRKQGREEGRKAERSTLIQKKLEKGKTIS 292 Query: 276 WIQKITGLSAEQMQALRQPL 295 I + E + L + Sbjct: 293 QIADELEDTEENIACLIEQF 312 >UniRef50_Q3ATN4 Putative uncharacterized protein n=1 Tax=Chlorobium chlorochromatii CaD3 RepID=Q3ATN4_CHLCH Length = 287 Score = 50.0 bits (118), Expect = 1e-04, Method: Composition-based stats. Identities = 49/306 (16%), Positives = 106/306 (34%), Gaps = 58/306 (18%) Query: 11 DALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVKT 70 D + K L A D I L +D +L +++ +D++ V+ Sbjct: 5 DVVSKDIL--KRIALDIARILL------HLKVDHAELLETEH--QRVEERRADVVVLVQ- 53 Query: 71 REGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGSRSP 130 G + +E Q+ ++A+RL+RY + H +D +Q L Y G Sbjct: 54 -GESGRFILHLEIQNDNQANIAWRLLRYRSDIGLAHKGYDIKQYL-------IYIGK--- 102 Query: 131 YPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELI----QKHIRQ 186 P S+ + + + ++D+ V ++ L L K + Sbjct: 103 APLSMP-------TGIHQTGLDYRYHVIDMHSVDCQALLTQDTPDALVLAILCDFKGRSE 155 Query: 187 RDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRERIMT 246 R+++ I Q + L E + + IL ++ + E +++ Sbjct: 156 REVVRYIIQRLQELTAENESRYHDYMRMLEILSANRS----------LEKIIEEEEAMLS 205 Query: 247 IAER--------IHNDGYIKGEQRILRLLLQNGADPEW-------IQKITGLSAEQMQAL 291 + ++ G +G Q+ L++ + + ++ L+ EQ++ L Sbjct: 206 VVDQTRLPSFRIGMRHGIEQGVQQGTLSLVKRQLTRRFGTLSYHHVARLDKLNIEQLEEL 265 Query: 292 RQPLPE 297 L + Sbjct: 266 SDALLD 271 >UniRef50_Q2FSG0 Putative uncharacterized protein n=1 Tax=Methanospirillum hungatei JF-1 RepID=Q2FSG0_METHJ Length = 291 Score = 50.0 bits (118), Expect = 1e-04, Method: Composition-based stats. Identities = 26/140 (18%), Positives = 51/140 (36%), Gaps = 27/140 (19%) Query: 158 VDVTV-VPDDEIVQHRRVALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNY 216 +D+ +P+ ++ R L L K + Q+ L L ++L + V + A + + L Sbjct: 174 IDLEKELPEKDLRNKVRELTLILADKIVDQKILDELWEELRMFKVVKYAEEKGMEKGLEK 233 Query: 217 ILLTGDEARFNEFISELTRRMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEW 276 L G ++ G + + + + +L G + E Sbjct: 234 GLEKG--------------------------IKKGMEKGKKQERETVAKNMLSLGIEDEL 267 Query: 277 IQKITGLSAEQMQALRQPLP 296 I K TGL + L++ L Sbjct: 268 IIKATGLDQSIIDKLKKSLS 287 >UniRef50_A7C2W6 Putative uncharacterized protein n=1 Tax=Beggiatoa sp. PS RepID=A7C2W6_9GAMM Length = 103 Score = 49.6 bits (117), Expect = 1e-04, Method: Composition-based stats. Identities = 16/44 (36%), Positives = 24/44 (54%) Query: 250 RIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQALRQ 293 G KG+ +I + + Q G D E I ITGLS E +++L + Sbjct: 56 EGEQIGEEKGKLKIAQKMQQAGMDIETIATITGLSPEVIKSLSK 99 >UniRef50_C5EKZ7 Predicted protein n=1 Tax=Clostridiales bacterium 1_7_47FAA RepID=C5EKZ7_9FIRM Length = 329 Score = 49.6 bits (117), Expect = 2e-04, Method: Composition-based stats. Identities = 45/301 (14%), Positives = 94/301 (31%), Gaps = 62/301 (20%) Query: 15 KTFLTHPDTARDFMEIHL--------PKDLRELCDLDSLKLESASFVDEKLRALHSDILW 66 + L HP DF + P+ L ++ + + + +++ DI+ Sbjct: 9 RKLLNHPARFADFYNGTVFGGRQVLRPEQLSDVPNEQGIVILDKDG-KKRVVERRRDIIK 67 Query: 67 SVKTREGDGYIYVVI---EHQSREDIHMAFRLMRY--------SMAVMQRHIEHD----- 110 Y ++ E+Q M R M Y + Q H Sbjct: 68 KASFGA-----YFILAAEENQDTIHYGMPVRNMMYDALDYTEQMECLKQAHKSRGDVLDG 122 Query: 111 --------KRQPLPLVIPMLFYHGSRSPYPWSLCWLDEFADPTTARK-------LYNAAF 155 + L V+ ++ YHGS+ P+ D +A++ L + Sbjct: 123 GGFLSGITREDRLMPVVSLILYHGSK-PWDGPRSLYDMLGLDASAKETLALKQVLPDYRI 181 Query: 156 PLVDVTVVPDDEIVQHRRVALLELIQKHIRQRDLMGLIDQLVVLLVTECAND-------- 207 L+D + + E+ + +++ + ++ G Q L+ + Sbjct: 182 NLIDASNIEHPELFCTSLQHVFSMLKYNTDKQKFYGYAKQHQKDLLDMDDDSMLAMLTLL 241 Query: 208 -SQITALLNYILLTGDEARFNEFISELTRRMPQHRERIMTIAERIHNDGYIKGEQRILRL 266 Q L + D + + + + I +I +G I+GE R+ L Sbjct: 242 GEQKRLLKILETSSNDTKEGTDVCIAIDELINDGK-----IEGKI--EGKIEGEHRLATL 294 Query: 267 L 267 + Sbjct: 295 M 295 >UniRef50_C0QWI7 Putative uncharacterized protein n=4 Tax=Brachyspira RepID=C0QWI7_BRAHW Length = 289 Score = 49.2 bits (116), Expect = 2e-04, Method: Composition-based stats. Identities = 27/168 (16%), Positives = 57/168 (33%), Gaps = 34/168 (20%) Query: 157 LVDVTVVPDDEIVQHRRVALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNY 216 ++D ++ D +H + EL HI +L L + ++ A + Sbjct: 123 ILDFNLLKD-IDKEHSCYVIKELETNHILTNHFEMHFLELQKYLSSNSNLKEELDAWFYF 181 Query: 217 ILLTGDEARFNEFISELTRRMPQHRERI-------------MTIAE-------------- 249 + + + E + L ++ P +E AE Sbjct: 182 LTIKEKIEKMEEIMDILVKKNPIMKEVYDEYNKFADTKDLFENYAEYEKNYFDILALSEE 241 Query: 250 --RIHNDGYIKGEQR----ILRLLLQNGADPEWIQKITGLSAEQMQAL 291 R +G +G + + R + D + I ++TGL+ E+++ L Sbjct: 242 RIRGREEGIKEGIKETQISMARNMKNKNMDIKLIGELTGLTTEEIEKL 289 >UniRef50_C0DAA1 Putative uncharacterized protein n=2 Tax=Clostridium asparagiforme DSM 15981 RepID=C0DAA1_9CLOT Length = 302 Score = 49.2 bits (116), Expect = 2e-04, Method: Composition-based stats. Identities = 46/287 (16%), Positives = 106/287 (36%), Gaps = 40/287 (13%) Query: 11 DALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVKT 70 D+LF+ + ++ +E++ + + D L + D+L+ Sbjct: 18 DSLFRVIFSEK---KELLELYNAINGSHYENPDDLIIT-----------TIGDVLY---L 60 Query: 71 REGDGYIYV------VIEHQSREDIHMAFRLMRYSMAVMQRHIEHDK-----RQPLPLVI 119 + ++ + E QS + +M R + Y + Q +++ + R+PL L Sbjct: 61 GMKNDISFLIGQHLSLYEAQSTWNPNMPLRGLFYFSRLYQGYLKEHQLDLYSRRPLSLPF 120 Query: 120 P--MLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQH-RRVAL 176 P ++FY+G+ + L + ++++ ++E+++ R++ Sbjct: 121 PEFIVFYNGTMEQPDRTQLRLSDLFYQAEGVPCLECTATMININYGHNEEMMKSCRKLYE 180 Query: 177 LELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRR 236 + +R R GL + V + D +L LL E +SE Sbjct: 181 YAFLINAVRSRLNEGLHLEAA---VDQAVEDCIQHDVLKNFLLKHREEVREMILSEYDEE 237 Query: 237 MPQHRERIMTIAERIHN---DGYIKGEQRI---LRLLLQNGADPEWI 277 + + E+ ++ E + G G++R+ + L G + I Sbjct: 238 LHINSEKKISYEEGLEAGVVQGTQHGQERVNALITRLAAAGRADDII 284 >UniRef50_C1DU78 Putative uncharacterized protein n=1 Tax=Sulfurihydrogenibium azorense Az-Fu1 RepID=C1DU78_SULAA Length = 163 Score = 49.2 bits (116), Expect = 2e-04, Method: Composition-based stats. Identities = 32/146 (21%), Positives = 58/146 (39%), Gaps = 14/146 (9%) Query: 159 DVTVVPDDEIVQHRRVALLEL----IQKHIRQRDLMGLIDQLVVLLVTECANDSQITALL 214 D+ + IV+ + L K+I +D L L LLV E I ++ Sbjct: 2 DLNKISSKRIVKEFYDDICLLSAILTLKNIF-KDFNDLKPILRNLLVAE--TKDCIYIII 58 Query: 215 NYI-LLTGDEARFNEFISELTRRMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGAD 273 NYI L D + E+ +E++MT+ E+ +G +G + ++ L++ Sbjct: 59 NYIALAKKDLKTVENILEEV-----GGKEKMMTLTEKWRIEGLQQGIEEGIKKQLKDDI- 112 Query: 274 PEWIQKITGLSAEQMQALRQPLPERE 299 E I+ G E + + + E Sbjct: 113 KEAIEIKFGEVMEDINTKIESIKSVE 138 >UniRef50_C9KZM2 Transposase n=8 Tax=cellular organisms RepID=C9KZM2_9BACE Length = 48 Score = 48.8 bits (115), Expect = 2e-04, Method: Composition-based stats. Identities = 17/42 (40%), Positives = 22/42 (52%) Query: 250 RIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQAL 291 +G ++G I R L G D E IQK TGL E++Q L Sbjct: 7 EGRMEGKMEGCIEIARNLKSMGLDTETIQKATGLLPEEIQKL 48 >UniRef50_C1TQY0 Putative transposase, YhgA n=1 Tax=Dethiosulfovibrio peptidovorans DSM 11002 RepID=C1TQY0_9BACT Length = 133 Score = 48.8 bits (115), Expect = 2e-04, Method: Composition-based stats. Identities = 19/85 (22%), Positives = 36/85 (42%), Gaps = 10/85 (11%) Query: 220 TGDEARFNEFISELTRRMPQ--HRERIMTIAERIHNDGYIKGEQRILRL--------LLQ 269 D + ++E R + +E + R +G KG Q LR +++ Sbjct: 39 YNDLLEVDTMLAEKVRDWTKAWEKEGLRKGIHRGRREGMEKGRQEGLRKALARTAMRMIE 98 Query: 270 NGADPEWIQKITGLSAEQMQALRQP 294 G D E I ++TGL ++++ + Q Sbjct: 99 KGMDLETISELTGLDIDKVRDMSQN 123 >UniRef50_B4SC57 Putative uncharacterized protein n=14 Tax=Bacteria RepID=B4SC57_PELPB Length = 299 Score = 48.8 bits (115), Expect = 3e-04, Method: Composition-based stats. Identities = 48/295 (16%), Positives = 97/295 (32%), Gaps = 21/295 (7%) Query: 11 DALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVKT 70 D FK + +D + I L + D + RA IL +K Sbjct: 9 DFAFKKLFGSEEN-KDLL-ISLINAIVSEEDQVVEIELKNPYNLADYRAGKISIL-DIKA 65 Query: 71 REGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHD---KRQPLPLVIPMLFYHGS 127 + +G + +E Q ED + R + Y ++ + K + I +L Y+ Sbjct: 66 KAENGR-WFNVEMQISEDYNFDKRAIFYWAKLVTEQLSEGMMYKELKKTISINILDYN-- 122 Query: 128 RSPYPWSLCWLDEFADPTTA----RKLYN-AAFPLVDVTVVPDDEIVQHRRVALLELIQK 182 P + + TA +L++ +++ + Sbjct: 123 --FVPDTTEVHSCYKIINTATGKDDRLHDVFELHYIELKKFNKLHHEISSTLDRWTTFLT 180 Query: 183 HIRQRDLMGLIDQLVV---LLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQ 239 Q D +L + ++ A D + ++ + S++ + + Sbjct: 181 TAHQLDREHTPKELALDKNIVKAIAAIDRMFNEEERQVYEVRKQSLVDA-ESKIASALEK 239 Query: 240 HRERIMTIA-ERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQALRQ 293 E+ M + E+ ++G + I LL G I + TGLS ++ +L Q Sbjct: 240 GMEKGMEMGLEKGRDEGINAASKTIALNLLGKGIAIATIAEATGLSVLEITSLSQ 294 >UniRef50_B0G418 Putative uncharacterized protein n=5 Tax=Dorea formicigenerans ATCC 27755 RepID=B0G418_9FIRM Length = 312 Score = 48.4 bits (114), Expect = 3e-04, Method: Composition-based stats. Identities = 44/258 (17%), Positives = 88/258 (34%), Gaps = 28/258 (10%) Query: 11 DALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVKT 70 D +F+ L P A + +L LE+A ++ K +D+ + + T Sbjct: 42 DRVFRMLLKEPKVALEVYNAMNGTLYDNPDELIITTLENAVYLGMK-----NDVSFILGT 96 Query: 71 REGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLV-IP----MLFYH 125 + V+ EHQS + +M R + Y V ++ D L+ IP ++FY+ Sbjct: 97 Q------LVLYEHQSTPNPNMPLRNLAYVACVYMAYVFGDNLYGRKLIKIPEPRFVVFYN 150 Query: 126 GSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIR 185 G+ S+ L + + + V++ ++E+V+ L Q Sbjct: 151 GTDKMPEQSVLRLSDAYESKSEELDLELKIRFVNINPGYNEEMVEKSP----TLYQYVKF 206 Query: 186 QRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRERIM 245 + ++ E A D I + L + A E M Sbjct: 207 VDIVRKYQKEMPFPEAVEKAIDECIKKGILAEFLRKNRAEVLRV-----SIFEYDEEEHM 261 Query: 246 TIAERIHNDGYIKGEQRI 263 + + +G +++ Sbjct: 262 R---QEREESRQEGIEQV 276 >UniRef50_UPI00006CAA90 hypothetical protein TTHERM_00670420 n=1 Tax=Tetrahymena thermophila RepID=UPI00006CAA90 Length = 345 Score = 48.4 bits (114), Expect = 3e-04, Method: Composition-based stats. Identities = 41/304 (13%), Positives = 113/304 (37%), Gaps = 44/304 (14%) Query: 11 DALFKTFLTHPDTARDFMEIHL-------PKDLRELCDLDSLKLESASFVDEKLRALHSD 63 D +F+ ++ + + F+E L +++ E+ L++ L+++ + + D Sbjct: 64 DFVFEKIFSNHERMKSFLESVLVGKNKILHEEINEVIYLNNNLLQNSLTQEYIPKKSMFD 123 Query: 64 ILWSVKTREGDGYIYVVIE-HQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPM- 121 + +KT +G ++E ++ + R+ YS + + ++ L +I + Sbjct: 124 L--QIKTSQGT----FIVEIYKRSFQPFLK-RIQYYSAQSLSQQ-QNQTHTSLKPIISIA 175 Query: 122 -----LFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDE--------I 168 LF + + T L + + +++ + + + Sbjct: 176 IVDDILF-----EDDVPCISFHKTIEQKTQKVFLNYSTYVFIELGKYDNKKYDQSCVHGV 230 Query: 169 VQHRRVALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNE 228 + + LL+ H + + L + E D + L ++ E Sbjct: 231 NEKEWLDLLKKSDIHRQYKTKEVLNAAQYAQFIQEKLFDEYVKHKLY------EDQFIEE 284 Query: 229 FISELTRRMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQM 288 + + Q +E + +++ G++ +++ +L++G + I TGLS E++ Sbjct: 285 IKNAKVEGIQQGQEETIKLSKHYS---IKAGKEEVVKQMLKDGLSLQKIITYTGLSKEEI 341 Query: 289 QALR 292 ++ Sbjct: 342 DEIK 345 >UniRef50_Q8YKL8 Alr7276 protein n=12 Tax=Bacteria RepID=Q8YKL8_ANASP Length = 152 Score = 48.4 bits (114), Expect = 3e-04, Method: Composition-based stats. Identities = 24/114 (21%), Positives = 44/114 (38%), Gaps = 9/114 (7%) Query: 1 MTNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDS-LKLESASFV----DE 55 MTN T T +D+ +K L +DFM+ P+ D D Sbjct: 1 MTNKTPKTEYDSPWKQMLQ--LYFQDFMQFFFPQA-HAQIDWSRGFVFLDKELQQVVRDA 57 Query: 56 KLRALHSDILWSV-KTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIE 108 +L D L + + + ++ + +E QS+E+ R+ Y+ + R+ Sbjct: 58 ELGKRLVDKLVKIYRIGGEESWLLIHVEVQSQEEDDFPKRMFVYNYRIFDRYDR 111 >UniRef50_Q8GBS6 Putative uncharacterized protein n=12 Tax=Treponema RepID=Q8GBS6_TREMA Length = 262 Score = 48.4 bits (114), Expect = 3e-04, Method: Composition-based stats. Identities = 50/295 (16%), Positives = 106/295 (35%), Gaps = 59/295 (20%) Query: 11 DALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVKT 70 D +F + + + + F+E+ L + + + S +S + + + + DIL Sbjct: 13 DFMFCQVMKNKNLCKTFLEMLLADKIGNITHIAS---QSTVAPESEAKFVRLDIL----V 65 Query: 71 REGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGSRSP 130 ++ Y IE Q + ++A R+ Y A+ ++ + +Y + Sbjct: 66 QDEKNNFYD-IEMQVVNEHNVAKRMRYYQSALDVSFLDKGE-----------YYTNLKDS 113 Query: 131 YPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIRQRDLM 190 Y + ++ F + +Y + D+ I R RD Sbjct: 114 Y---IIFVCLFDFIGKNKAVYFFENI-----CLEDEPI----------------RLRDGT 149 Query: 191 GLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRER------- 243 I ++ + + D ++ L YI +F+E I ++ R + Q+ + Sbjct: 150 KKI--IINVDAFKNIKDKALSGFLEYIKTGCITTKFSERIEKMIRTIKQNEQARQEYRFI 207 Query: 244 ---IMTIAERIHNDGYIKG----EQRILRLLLQNGADPEWIQKITGLSAEQMQAL 291 +M E + G+ G +++ L G I K TGLS +++ L Sbjct: 208 SAVVMDAKEEGRSQGFTDGVNQTKRKTAAALKAMGLAKSKIAKATGLSLAEIEKL 262 >UniRef50_A5Z376 Putative uncharacterized protein n=1 Tax=Eubacterium ventriosum ATCC 27560 RepID=A5Z376_9FIRM Length = 316 Score = 48.4 bits (114), Expect = 3e-04, Method: Composition-based stats. Identities = 46/285 (16%), Positives = 97/285 (34%), Gaps = 46/285 (16%) Query: 11 DALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILW---- 66 D +F+ + + +E++ + + D L++ D+ + Sbjct: 9 DRVFRKLFGYEKNKGNLLELYNALNDSNYTNPDDLEI-----------NTLDDVFYMNMK 57 Query: 67 ---SVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDK----RQPLPLVI 119 S + + IY EHQS +M R RYS + +I + R+ L + I Sbjct: 58 NDVSC-IIDWNMAIY---EHQSTWSYNMPLRGYRYSAELYNDYIVRNNLDVFRRKL-IKI 112 Query: 120 PM----LFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVA 175 P +FY+G+ + L + + ++++ ++E++ + Sbjct: 113 PTPQYYVFYNGNEKRPDREVLKLSDAFMVPCKDGEFEWTATVLNINAGHNEELMSKCSI- 171 Query: 176 LLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTR 235 R+ ++ ++ L I ++Y L F + Sbjct: 172 ----------LREYAIMVSKIKEFLAESLELKDAIKKAIDYCLDNNVLKEFLQDHRSEVE 221 Query: 236 --RMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQ 278 ++ E T+A D Y +GEQ L + NG + I+ Sbjct: 222 DMLWREYNEEE-TMAH-WKEDFYEEGEQHGLEVGRANGEKIKLIK 264 >UniRef50_B6FJ15 Putative uncharacterized protein n=5 Tax=Clostridium RepID=B6FJ15_9CLOT Length = 310 Score = 48.4 bits (114), Expect = 3e-04, Method: Composition-based stats. Identities = 48/307 (15%), Positives = 111/307 (36%), Gaps = 41/307 (13%) Query: 11 DALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVKT 70 D +F+ + + DL +E ++ K +D+ + + Sbjct: 21 DRIFRMIFHEKKELLELYNAVNNSNYTNPDDLTITTIEDVVYMGMK-----NDLSFLI-- 73 Query: 71 REGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPL---PLVIP----MLF 123 G + + EHQS ++ R + Y ++ + +IE K + PL IP ++F Sbjct: 74 ----GDVMNLYEHQSSFSPNLPLRGLFYFSSLYKEYIEPVKHRLYTASPLHIPFPKYVVF 129 Query: 124 YHG-SRSPYPWSLCWLDEFADPTTARKL-YNAAFPLVDVTVVPDDEIVQHRR-----VAL 176 Y+G + P L D F + ++++ + + E+++ R Sbjct: 130 YNGTKKEPERQELKLSDLFLENKEETTPSLECTAVVLNINLGKNRELMEKCRPLKEYAEF 189 Query: 177 LELIQKHIRQR-DLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTR 235 + +I+K++ ++ D +++ V + +L IL + ++E Sbjct: 190 ISIIRKYLSEQMDFGNAVNKAVDFCIHN--------GILADILQKNRSEVVDMILTEYDE 241 Query: 236 RM--PQHRERIMT-IAERIHNDGYIKGEQRILRLLLQNGADPEWIQKIT----GLSAEQM 288 RE ++ + N+G KG + + ++ E + + LS E+ Sbjct: 242 EEFRRAWREDLLNEGFRKGLNNGLSKGIKGTIHACMKFNVPKEDVMQNLMEEFSLSQEEA 301 Query: 289 QALRQPL 295 + + Sbjct: 302 EKYLEEY 308 >UniRef50_C0CTJ7 Putative uncharacterized protein n=5 Tax=Clostridium RepID=C0CTJ7_9CLOT Length = 327 Score = 48.4 bits (114), Expect = 3e-04, Method: Composition-based stats. Identities = 45/323 (13%), Positives = 95/323 (29%), Gaps = 53/323 (16%) Query: 11 DALFKTFLTHPDTARDFMEIHL--PKDLRELCDLDSLKLESASFVDEKLR----ALHSDI 64 D + + + D + + + D+ L R + D Sbjct: 5 DMVLNRYFEDGERYADLINGYAFNGDQVVRKEDVQELDPRETGVAGRLGRRPGVQKYRDS 64 Query: 65 LWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEH--------------- 109 + V ++ + +EHQ + M R M A R + Sbjct: 65 IRRVVLGAR--FVLIGLEHQDQVHYAMPVRAMLQDAAEYDRQLRRIRRVNRRVGGLTGAE 122 Query: 110 -----DKRQPLPLVIPMLFYHGSRSPYPWSLCW-----LDEFADPTTARKLYN-AAFPLV 158 ++ + VI ++ Y+G + PW +D P +L N ++ Sbjct: 123 FLGGFTRKDRVCPVITLVLYYGKK---PWDGAMDLHGLMDCAGYPEPMLRLVNNYRLHVL 179 Query: 159 DVTVVPDDEIVQHRRVALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYIL 218 +V + + + IQ+ + ++ V D + ++ I Sbjct: 180 EVRRFVNIRRFRTDLYQVFGFIQRSGDKEAERRFTEE---NRVYFEGMDEEAFDVITAIT 236 Query: 219 LTGDEARFNEFISELTRRMPQHRERIMTIAERIHNDGYIKGEQR------------ILRL 266 + + R E E R+ E I + E +G ++G+ + R Sbjct: 237 GSRELERVKEQYREEGGRI-NMCEAIRGMIEDGRIEGRLEGKIEGKYEGALEKTRTVARN 295 Query: 267 LLQNGADPEWIQKITGLSAEQMQ 289 + G E I + Q++ Sbjct: 296 MYLRGMSAEDAAAICEMDTAQIE 318 >UniRef50_C4FYK3 Putative uncharacterized protein n=2 Tax=Abiotrophia defectiva ATCC 49176 RepID=C4FYK3_ABIDE Length = 365 Score = 48.0 bits (113), Expect = 4e-04, Method: Composition-based stats. Identities = 47/327 (14%), Positives = 101/327 (30%), Gaps = 50/327 (15%) Query: 11 DALFKTFLTHPDTARDFMEIHL-----PKDLRELCDLDSLKLESASFVDEKLRALHSDIL 65 D L K D DF+ + + EL DL + + + R + D Sbjct: 5 DILEKKLFMFNDVFADFLNGIIFNGRQIVEESELFDLSGW----SHYKADDSRHRYQDRD 60 Query: 66 WSVKTREGDGYIYVV-IEHQSREDIHMAFRLMRYSMAVM-----------QRHIEHDKRQ 113 ++ + I ++ IE+Q D M FR++ Y A ++H++ K Sbjct: 61 VVKLWKKKNVVISLIGIENQDVPDKDMVFRVLSYDGASYKTQLAKKDEDKRKHLKDKKNT 120 Query: 114 PLP-----------LVIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTV 162 + VI + Y+G + + + L+D+ Sbjct: 121 EIVEIGKEDEKDIFPVITFVVYYGEEEWKYETTLKKRLKIGDGLDEFVSDYKINLIDLKK 180 Query: 163 VPDDEIVQHRRVALLEL------------IQKHIRQRDLMGLIDQLVVLLVTECANDSQI 210 +D+I + ++ L + + ++ L+ +L + + Sbjct: 181 FTEDDINKFKKDFKLLVNYMVKGSNHDAGSIELNHPEEVSELVLRLTGEELPIPRENDGG 240 Query: 211 TALLNYILLTGDEARFNEFISELTRRMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQN 270 + + + + M + + MT G +G+ + + L Sbjct: 241 KTMEKFFEPMFARMAEKAEARGMAKGMTEGMAKGMT---EGMAKGLAEGKAKGMTEGLAK 297 Query: 271 GADPEWIQKITGLSAEQMQALRQPLPE 297 G GL+ + + L + L E Sbjct: 298 GMTEG---MAKGLAEGKARGLAEGLVE 321 >UniRef50_C1J8S3 YdgA n=6 Tax=Escherichia coli RepID=C1J8S3_ECOLX Length = 68 Score = 48.0 bits (113), Expect = 4e-04, Method: Composition-based stats. Identities = 21/63 (33%), Positives = 30/63 (47%), Gaps = 16/63 (25%) Query: 245 MTIAERIHNDGYIKG----------------EQRILRLLLQNGADPEWIQKITGLSAEQM 288 MTIAER+ G+ +G + I L G PE IQ++TGLS E++ Sbjct: 1 MTIAERLIQKGFDEGFKESFKEGFKEGALEVAREIACRLRDMGWPPERIQEVTGLSGEEL 60 Query: 289 QAL 291 + L Sbjct: 61 KKL 63 >UniRef50_B1V1L4 Putative uncharacterized protein n=38 Tax=Clostridium RepID=B1V1L4_CLOPE Length = 300 Score = 48.0 bits (113), Expect = 4e-04, Method: Composition-based stats. Identities = 44/296 (14%), Positives = 102/296 (34%), Gaps = 21/296 (7%) Query: 11 DALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVKT 70 D +FK + ++D + L ++ + ++L+S + + + KT Sbjct: 8 DFVFKRLFGAEE-SKDSLISLLNAIIKSDNPIKDIELKSPDLEKQHIGDKFCRLDIKAKT 66 Query: 71 REGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHI---EHDKRQPLPLVIPMLFYHGS 127 +G+ + +E Q R++ +M R + Y + + E+ K + I +L + Sbjct: 67 DKGEI---INVEIQVRDEYNMVQRTLYYWSKIYSDQLGASENYKNLARTVCINILNFKLL 123 Query: 128 RSPYPWSLCWLDEFADPTTARKLYNAAFP-LVDVTVVPDDEIVQHRRVALLELIQKHIRQ 186 + + L E + F L + +E+ + K Sbjct: 124 DNDRYHNTYRLKEITTNEELTDIEEIHFIELPKSKEIKSEEVNNIDSLLKWIEFIKEPES 183 Query: 187 R--DLMGLIDQLVVL-LVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRER 243 ++ L D+ + + Y ++A ++E + R +E Sbjct: 184 ETVRILELTDESIRKAKTQLYKLSLDKKTIEQY--RIREKAMYDEISALENSREKGLQEG 241 Query: 244 IMTIAERIHNDGYIKGE--------QRILRLLLQNGADPEWIQKITGLSAEQMQAL 291 + + +G +GE ++I + LL G + + I KI L ++ + Sbjct: 242 VKIGRKEGKEEGLKEGEVRGKLKANRKIAKNLLSKGLELKEIAKILELDENLVEEI 297 >UniRef50_B6FTF1 Putative uncharacterized protein n=1 Tax=Clostridium nexile DSM 1787 RepID=B6FTF1_9CLOT Length = 173 Score = 48.0 bits (113), Expect = 4e-04, Method: Composition-based stats. Identities = 35/185 (18%), Positives = 71/185 (38%), Gaps = 20/185 (10%) Query: 111 KRQPLPLVIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVV-PDDEIV 169 K L VI + Y+G + + C D + + + + + V D + Sbjct: 3 KEDRLHPVITLTVYYGEK-QWDGPYCLKDMIVEMPEEIAAIFSDYKMNLLEVRNSDRYVF 61 Query: 170 QHRRVALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEF 229 + V + I RD+ + + S + ++ G A E Sbjct: 62 NNTDVQSVFEI-----TRDIFAGHFERIQEKYGNKELGSDLLTVI------GQMAGSKEL 110 Query: 230 ISELTRRMPQHRE--RIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQ 287 I RM ++RE + T E++ +G ++G++ ++ +LQN I K+ +S E+ Sbjct: 111 I-----RMSRNREVNNMCTALEKLKEEGKMEGKKEVILTMLQNDYPISEICKLLNISEEE 165 Query: 288 MQALR 292 + +R Sbjct: 166 VLEIR 170 >UniRef50_Q73KA7 Putative uncharacterized protein n=2 Tax=Treponema RepID=Q73KA7_TREDE Length = 172 Score = 48.0 bits (113), Expect = 4e-04, Method: Composition-based stats. Identities = 20/112 (17%), Positives = 45/112 (40%), Gaps = 10/112 (8%) Query: 190 MGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRER-----I 244 L ++ + + L Y+ + F I E+ + + Q+ + + Sbjct: 61 FPLSKIIINADAFNNTKNKALKGFLEYLKTGKTKNEFTRRIEEMIQTVKQNEQARQEYRL 120 Query: 245 MTIAER-IHNDGYIKG----EQRILRLLLQNGADPEWIQKITGLSAEQMQAL 291 M+ E G+ +G ++ ++L Q G + I ++TGL E+++ L Sbjct: 121 MSTFEMDARYKGFTEGTYNNKKETAKILKQLGDSIQKIMQVTGLPEEEIEKL 172 >UniRef50_C9LXS5 Transposase n=3 Tax=Selenomonas sputigena ATCC 35185 RepID=C9LXS5_9FIRM Length = 347 Score = 48.0 bits (113), Expect = 4e-04, Method: Composition-based stats. Identities = 44/323 (13%), Positives = 101/323 (31%), Gaps = 51/323 (15%) Query: 10 HDALFKTFLTHPDTARDFMEIHLP------------------KDLREL-CDLDS------ 44 +D K L D ++ +P ++ + D+D Sbjct: 32 YDQHAKRLLAQKDVVARILKGVVPEFRQMDLATIIGRCIEGEPEIGAIPIDMDKTNAARR 91 Query: 45 ----LKLESASFVDEKLRALHSDILWSVKTREGDGYIYVV--IEHQ-----SREDIHMAF 93 ++ ++ + DIL+ K + I ++ IE Q SR + Sbjct: 92 IPKEIRGDNTESASPTEGWIRFDILFRAKVPQTGARITLIVNIEAQKTQSNSRLGYALLR 151 Query: 94 RLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNA 153 R + Y+ ++ E + + Y+ + Y P + Sbjct: 152 RAIYYACRLISSQKETEFAKSN--------YNDIKKVY----SVWICMDAPDDKSAINFY 199 Query: 154 AFPLVDVTVVPDDEIVQHRRVALLELIQKHIRQ-RDLMGLIDQLVVLLVTECANDSQITA 212 E + + ++ + +L+ + L V A +I Sbjct: 200 DMQERHFLHRTKAEKSDYDLLNIIMIYLGADDSGNELVRFLKLLFRDTVKSAAEKKKILE 259 Query: 213 LLNYILLTGDEARFNEFISELTRRMPQH--RERIMTIAERIHNDGYIKGEQRILRLLLQN 270 + ++GD + + L+ + + + I E+ G +GE ++ +L+ Sbjct: 260 SEFDLDISGDMEKEMNTMCNLSEGIFERGIEQGIEQGIEQGIEQGIEQGESGMILSMLKK 319 Query: 271 GADPEWIQKITGLSAEQMQALRQ 293 G D I I+ S ++++ L + Sbjct: 320 GYDLTSIADISQWSIKKIEQLAK 342 >UniRef50_Q6D2V6 Putative uncharacterized protein (Fragment) n=1 Tax=Pectobacterium atrosepticum RepID=Q6D2V6_ERWCT Length = 77 Score = 47.6 bits (112), Expect = 5e-04, Method: Composition-based stats. Identities = 21/67 (31%), Positives = 31/67 (46%), Gaps = 20/67 (29%) Query: 245 MTIAERIHNDGYIKGEQR--------------------ILRLLLQNGADPEWIQKITGLS 284 MTIAE++ G+ +G QR I R LL G D E +++IT L Sbjct: 1 MTIAEQLKKMGFDEGIQRGIQQGLEQGIEQGMKNSARQIARELLLTGMDKEKVRQITRLD 60 Query: 285 AEQMQAL 291 E+++ L Sbjct: 61 DEELEQL 67 >UniRef50_B8FTH9 Putative uncharacterized protein n=3 Tax=Desulfitobacterium hafniense RepID=B8FTH9_DESHD Length = 325 Score = 47.6 bits (112), Expect = 5e-04, Method: Composition-based stats. Identities = 46/331 (13%), Positives = 105/331 (31%), Gaps = 49/331 (14%) Query: 3 NFTTSTPHDALFKTFLT---HPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRA 59 S D FK + F+ L + ++ + + + Sbjct: 2 KEFISLKIDYAFKLIFGKEGNEAILIAFLNAALKLPQERRIEEITIINPELNKEYPEDKK 61 Query: 60 LHSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHD---KRQPLP 116 D+ T +G + + IE Q M R + Y + R I K Sbjct: 62 SILDV--RAITSQG---MQINIEIQLSNQYDMEKRSLYYWAQMYSRQIREGMAYKELTKT 116 Query: 117 LVIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVV----PDDEIVQHR 172 + I ++ ++ + + + + D + +++ + EI Sbjct: 117 VSINIVDFNYLKQTSSYHNVFH-LYEDEEKFQLTDVLEIHFMELPKLLAKWRKREISLWE 175 Query: 173 R-VALLELIQKHIRQRDLMGLIDQLVVLL------------------VTECANDSQITAL 213 + L+ + ++++ +++++ + + E D + L Sbjct: 176 NELVRWLLLLEGADNQEILQILEEIAMKDPVLYQAMNAWEETSEDPRIREAYFDRRKAIL 235 Query: 214 LNYILLTGDEARFNEFISE-LTRRMPQHRERIMTIAE-----------RIHNDGYIKGEQ 261 + E R E + E + + + + R + IAE +G +G Sbjct: 236 DEKAAIREAELRLQEALEEGMAKGIAEGRAK--GIAEGKAEGKAEGRAEGRAEGRAEGRA 293 Query: 262 RILRLLLQNGADPEWIQKITGLSAEQMQALR 292 + + LL G + I + TGLS E++ L+ Sbjct: 294 EVAKKLLVLGFEITKIAEATGLSEEEISGLK 324 >UniRef50_UPI0001C3858D hypothetical protein AplaP_08641 n=1 Tax=Arthrospira platensis str. Paraca RepID=UPI0001C3858D Length = 102 Score = 47.6 bits (112), Expect = 5e-04, Method: Composition-based stats. Identities = 14/46 (30%), Positives = 22/46 (47%), Gaps = 1/46 (2%) Query: 249 ERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQALRQP 294 E G + ++ R LL + E I + TGLS E++ LR+ Sbjct: 54 EERRQQGIEQAKRETARNLLGQ-LNDEAIAQATGLSLEEIATLREE 98 >UniRef50_C4ZLA7 Conserved hypothetical cytosolic protein n=2 Tax=Proteobacteria RepID=C4ZLA7_THASP Length = 339 Score = 47.6 bits (112), Expect = 5e-04, Method: Composition-based stats. Identities = 52/296 (17%), Positives = 96/296 (32%), Gaps = 46/296 (15%) Query: 1 MTNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRA- 59 M +D+ +K + H +F++ + P D F+D++L+ Sbjct: 1 MPASAAQDDYDSPWKEAVEH--AFPEFIDFYFPDA-GRQIDWARGHR----FLDKELQQI 53 Query: 60 --------LHSDILWSVKTREGD-GYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHD 110 H D L SV T G+ ++ V IE Q D A R+ Y+ + + Sbjct: 54 VRDAALGRRHVDKLASVTTHAGEEDWLCVHIEVQGSMDPDFARRMFVYNYRIYDSYDR-- 111 Query: 111 KRQPLPLVIPM-LFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFP---LVDVTVVPDD 166 V + + + P D F + +N FP LVD + Sbjct: 112 ------PVASLAVLADDDPAWRP------DRFGYERLGCR-HNLQFPVAKLVD-HAADEA 157 Query: 167 EIVQHRRVALLELIQKHIRQRDLMGLIDQL--VVLLVTECANDSQITALLNYILLTGDEA 224 ++ + L +R I + LV + D Sbjct: 158 ALLCNPNPFALVTAAHLYTRRTRRSPIARFDAKRRLVRLLYERDWTRQRILDFFSVLDWM 217 Query: 225 R--FNEFISELTRRMP----QHRERIMTIAERI-HNDGYIKGEQRILRLLLQNGAD 273 EF L + + + + + +T ER+ G KG ++ L + ++ G + Sbjct: 218 MRLPREFEQRLWQDIENIEGERKVKYVTSVERLAIERGLQKGMEQGLEIGIEKGIE 273 >UniRef50_C1DU30 Putative uncharacterized protein n=7 Tax=Sulfurihydrogenibium azorense Az-Fu1 RepID=C1DU30_SULAA Length = 313 Score = 47.6 bits (112), Expect = 5e-04, Method: Composition-based stats. Identities = 48/296 (16%), Positives = 100/296 (33%), Gaps = 62/296 (20%) Query: 11 DALFKTFLTHPDTARDFMEIHLPKDLR-ELCDLDSLKLESASFVDEKLRALHSDILWSVK 69 D L K +P A +EI L K + +L LK+ +D++ ++ Sbjct: 7 DLLLKHLFKNP--ATKLIEIILGKKVNWQLLQDSDLKIVKT---------READLVVKLE 55 Query: 70 TREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGSRS 129 + IE QS D M +R+ Y + ++ D Q + Y G Sbjct: 56 DNT-----ILHIEIQSTNDPSMPYRMFEYFYLITDKYKPKDLIQ-------VCIYIGKE- 102 Query: 130 PYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELI--------- 180 P + +F+D + + L+D+ +P E++ + + L Sbjct: 103 --PLKMSDKIQFSD-------WTYRYRLIDIKDIPCKELITSQNITDKLLAGLCKIEDPK 153 Query: 181 ---------QKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNY--------ILLTGDE 223 K+ +D L + + + +I + + I T +E Sbjct: 154 FYVENVIKEIKNANPKDRKELFTLFLEISKIRNNIEEEIRSYIRQEDFEMPITIEWTREE 213 Query: 224 ARFNEFISELTR--RMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWI 277 + ++ + + ++E + ++ +G +G Q+ L+ L G I Sbjct: 214 IESYPVLRDVLKIGKEEGYKEGLQQGLQQGLKEGLEQGVQQGLQKGLIEGLRQSVI 269 >UniRef50_C6W4R9 Putative uncharacterized protein n=1 Tax=Dyadobacter fermentans DSM 18053 RepID=C6W4R9_DYAFD Length = 293 Score = 47.6 bits (112), Expect = 5e-04, Method: Composition-based stats. Identities = 51/311 (16%), Positives = 105/311 (33%), Gaps = 51/311 (16%) Query: 11 DALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRA----------- 59 D L+K+ L + DF++ P L D+D ++D++L Sbjct: 5 DMLWKSILE--EIFDDFLKFFFPNA-EALFDMDR----GFEYLDQELEQLFPPEGNAIAT 57 Query: 60 LHSDILWSVKTREG-DGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLV 118 + D L V R G + ++ V IE Q D R+ Y + ++ + + Sbjct: 58 RYVDKLVKVYCRSGAEAWLLVHIEVQGYRDETFPDRMFTYYYRICDKYRK--------PI 109 Query: 119 IPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRR----- 173 + +L + F V ++E+ Sbjct: 110 TAIAILTDD------CRHFLPGQFEQACLGTSVCFRFNSYKVLEQSEEELAASDNPFAQV 163 Query: 174 -VALLELIQKHIRQRDLMGLID-QLVVLLVTECANDSQITAL---LNYILLTGDEARFNE 228 +A I+ D + + L L+ + ++ L L + + D+ E Sbjct: 164 ILATKLAIKGSRFSSDELYRLKIDLAKRLLKRNFSKRKVGRLMEFLKFYVSLEDDDLDRE 223 Query: 229 FISELTRR-----MPQHRERIMTIAERIHNDGYIKGEQRILRLLLQN-GADPEWIQKITG 282 ++ E+ R +P E TI + G + +++ L++ E I ++ Sbjct: 224 YLKEVQRLFNPEPIPMTWEE--TILYIVEEKGAEAAKTTVVQNLIRETNFTSEEIARLAD 281 Query: 283 LSAEQMQALRQ 293 +S E +Q ++Q Sbjct: 282 VSVEFVQKIKQ 292 >UniRef50_C3QLI8 Putative uncharacterized protein n=1 Tax=Bacteroides sp. D1 RepID=C3QLI8_9BACE Length = 233 Score = 47.6 bits (112), Expect = 6e-04, Method: Composition-based stats. Identities = 13/54 (24%), Positives = 24/54 (44%) Query: 238 PQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQAL 291 +E + E+ G K + I R + ++G + I K TGL + ++ L Sbjct: 180 EGMKEGMKEGLEKGLEKGEQKKQIEIARKMREDGISIDIIIKYTGLQSSDIENL 233 >UniRef50_A7BL62 Putative uncharacterized protein n=2 Tax=Beggiatoa RepID=A7BL62_9GAMM Length = 166 Score = 47.3 bits (111), Expect = 6e-04, Method: Composition-based stats. Identities = 21/113 (18%), Positives = 40/113 (35%), Gaps = 8/113 (7%) Query: 184 IRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRER 243 + ++ + D L L+ ++ + +YI E R ++ E Sbjct: 56 PYREWMLAIQDSLDELVEENDYTVPEVRQIFDYIEKDLISPE------ERARMFDEYGEE 109 Query: 244 IMTIAERIHN--DGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQALRQP 294 + + G K + I + +LQ G I +IT LS E + L+ Sbjct: 110 QVKQQHFVKGVAKGIEKEKLEIAQKMLQQGMAISLISQITKLSEEAITHLKNE 162 >UniRef50_A8V3I7 Putative uncharacterized protein (Fragment) n=2 Tax=Hydrogenivirga sp. 128-5-R1-1 RepID=A8V3I7_9AQUI Length = 246 Score = 47.3 bits (111), Expect = 7e-04, Method: Composition-based stats. Identities = 45/266 (16%), Positives = 99/266 (37%), Gaps = 35/266 (13%) Query: 33 PKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVKTREGDGYIYVVIEHQSREDIHMA 92 PK ++ L + KL S +++ D L + DG I+ +E Q+ D +M Sbjct: 4 PKFIQILTGKSATKLLDTSL--PEVKDRRVDFLVEL----EDGKIF-HLELQTTNDKNMP 56 Query: 93 FRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYN 152 FR++ Y + ++ D ++ M+ Y G + + ++ Sbjct: 57 FRMLEYYTLISPKYPSKD-------ILQMVLYLGEK----------PLKMENKIEKENLK 99 Query: 153 AAFPLVDVTVVPDDEIVQHRRVALLELIQKHIRQRDLMGLIDQLVVLLVT--ECANDSQI 210 ++ L D+ + +E+++ + +++ ++ +++ L E + Sbjct: 100 FSYILKDIKEIKCEELLESEDLTD-KILAVLCDVKNPSKYFREILTELSKLPERKRRDYL 158 Query: 211 TALLNYILLTGDEARFNEFISELTRRMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQN 270 LLN LL+ E E + + M ++ DG KG+ + + N Sbjct: 159 KKLLN--LLSYRPKLMEELRKEENKMPLTIDKETME-KHPLYKDGVEKGKLEAKKEDILN 215 Query: 271 -----GADPEWIQKITGLSAEQMQAL 291 G D I ++ L E ++ + Sbjct: 216 LHKKIGWDSNKIAEVLELPVEFVKEI 241 >UniRef50_C4Z1Q2 Putative uncharacterized protein n=1 Tax=Eubacterium eligens ATCC 27750 RepID=C4Z1Q2_EUBE2 Length = 321 Score = 47.3 bits (111), Expect = 7e-04, Method: Composition-based stats. Identities = 51/332 (15%), Positives = 103/332 (31%), Gaps = 61/332 (18%) Query: 3 NFTTSTPH--DALFKTFLTHPDTARDFMEIHL--------PKDLRELCDLDSLKLESASF 52 N + T H D KTF + D + P L E+ S + S S+ Sbjct: 2 NNSNRTTHQKDVSLKTFWRDNEHFADLFNATVFNGKQVLKPDKLTEMDTDVSATIHSKSY 61 Query: 53 VDEKLRALHSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEH--- 109 + R D++ K +G + + +E Q + M R M Y + Sbjct: 62 NESITRNR--DVV--KKMSDGVEFNILGLEIQDKTHYAMPLRTMTYDALGYIKEYNDIKK 117 Query: 110 -------------------DKRQPLPLVIPMLFYHGSRSPYPWSLCWLDEFADPTTARKL 150 +K +I ++ Y+G S + C D K Sbjct: 118 HHKLNKDSFSSHEEFLSGINKSDRFHPIITLVLYYGE-SLWDGPTCLSDMMISMPDNIKA 176 Query: 151 YNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQI 210 Y + + L V ++ D+ + RD+ +I + + + Sbjct: 177 YFSDYKLNLVQILDSDK-----------YTFYNEDVRDVFNIIRNIYNDDFDSIYREYES 225 Query: 211 TAL-LNYILLTGDEARFNEFISELTRRMPQHR-----ERIMTIAERIHNDGYIKGE--QR 262 + ++ + L + + + L Q E + + G +G ++ Sbjct: 226 RNVDIDVMELICNITSVPKLMD-LCTDTEQGGTVNMCEAMKRFQAECESKGMKEGIDSEK 284 Query: 263 ILRL--LLQNGADPEWIQKITGLSAEQMQALR 292 + + +L+ G E I +T + E ++ Sbjct: 285 VNSIISMLEFGITKEQI--LTRYTKEDLERAE 314 >UniRef50_UPI00019735B3 hypothetical protein ClM62_08045 n=1 Tax=Clostridium sp. M62/1 RepID=UPI00019735B3 Length = 255 Score = 46.9 bits (110), Expect = 9e-04, Method: Composition-based stats. Identities = 39/220 (17%), Positives = 77/220 (35%), Gaps = 27/220 (12%) Query: 11 DALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVKT 70 D LF+ + D R+ L + LE+A +++ K +D+ + + Sbjct: 27 DTLFRMLFNDREALLSLYNAVGNTDYRDPSLLQIVTLENAVYMNVK-----NDLAFLLGF 81 Query: 71 REGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLV---IP--MLFYH 125 + EHQS + +M R + Y+ + I L+ +P ++FY+ Sbjct: 82 ELN------LYEHQSTWNPNMPLRDLFYAAREYEMLIRDQSLYSSRLIKLPVPRFIVFYN 135 Query: 126 GSRSPYPWSLCWL-DEFADPTTA--RKLYNAAFPL--------VDVTVVPDDEIVQHRRV 174 G + L D F P + F L V + ++ + R Sbjct: 136 GREKQEERCVLKLSDAFETPVEECIHEGILRDFLLKYRAEVTNVSIFEYNEEREKELLRK 195 Query: 175 ALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALL 214 A E +K ++ + I L+ A+ +++ L Sbjct: 196 AEYEFGKKEGMEQGMEQGICALIQTCRELGASRETVSSAL 235 >UniRef50_C4Z592 Putative uncharacterized protein n=2 Tax=Clostridiales RepID=C4Z592_EUBE2 Length = 315 Score = 46.9 bits (110), Expect = 0.001, Method: Composition-based stats. Identities = 45/270 (16%), Positives = 91/270 (33%), Gaps = 48/270 (17%) Query: 11 DALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKL--RALHSDILWSV 68 DALF+ L + DE L D+++ Sbjct: 11 DALFRKVFEEKKDLLSLYNA----------------LNNTEHTDENLITVNTIEDVIY-- 52 Query: 69 KTREGDGYIYV------VIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQ-------PL 115 + +V + EHQS + +M R + Y + + +IE + + L Sbjct: 53 -VGYKNDIAFVIDSELNLYEHQSSVNKNMPIRGLIYFAELYKGYIERNSLRIYNETEVKL 111 Query: 116 PLVIPMLFYHGSRSPYPWSLCWL-DEFADPT---TARKLYNAAFPLVDVTVVPDDEIVQH 171 P ++FY+G + S+ L D F + + L+++ + EI+ Sbjct: 112 PFPRYVVFYNGEKDETEKSVQRLADLFVRNEANQNQKPCLDVEVLLLNINYGCNKEIMNK 171 Query: 172 RRVALLELIQKHIRQRDLM-GLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFI 230 + + ++ R ++ G L + + S+ + L +EA N + Sbjct: 172 -----CQKLMEYSRLIAMIRGKTADLAKIYSQDSIEKSKKEIFTEAVSLAIEEAISNNIL 226 Query: 231 SE-LTRRMPQHRERIMTIAERIHNDGYIKG 259 E L + + + ++T YI+G Sbjct: 227 REILIKNKAEVTDMLLT---EFDEKDYIEG 253 >UniRef50_B0BV37 Putative uncharacterized protein n=7 Tax=Rickettsia RepID=B0BV37_RICRO Length = 213 Score = 46.9 bits (110), Expect = 0.001, Method: Composition-based stats. Identities = 25/150 (16%), Positives = 60/150 (40%), Gaps = 4/150 (2%) Query: 155 FPLVDVTVVPDDEIVQHRRVALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALL 214 F +++ + ++ +H + L ++++K D+ LL + +L Sbjct: 64 FKDIELHTIEINKFAKHPKEELSDVVKKVKNALDIWLAFLTRNDLLNKDNLPKELDNDVL 123 Query: 215 NYILLTGDEARFNEFISELTRRMPQHRERIMTIAERIHNDGYIKGE----QRILRLLLQN 270 L + N+ E + + +++ ++G +GE I + +L + Sbjct: 124 KKALTVLEVINLNDAEREEYENRLELLRIETSAFKKMKDEGRAEGEARRNIEIAKEMLID 183 Query: 271 GADPEWIQKITGLSAEQMQALRQPLPERER 300 E I K T LS E+++ L+ + + E+ Sbjct: 184 KEPLETIIKYTKLSKEEIEKLKAEIDKAEK 213 >UniRef50_A3JHY3 Putative uncharacterized protein n=3 Tax=Gammaproteobacteria RepID=A3JHY3_9ALTE Length = 337 Score = 46.5 bits (109), Expect = 0.001, Method: Composition-based stats. Identities = 41/252 (16%), Positives = 89/252 (35%), Gaps = 27/252 (10%) Query: 10 HDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALH--SDILWS 67 HD FK + D R +E+ P++ R + + + E+L D+ Sbjct: 9 HDQNFKNLI--IDYPRQAIELFSPEEARHIGPKARVVPLRQEQLKERLGERFRELDVPLM 66 Query: 68 VKT--REGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYH 125 V+ E + ++V+ E ++ D RL Y + + + +R V+P++ + Sbjct: 67 VEWPGGEREALLFVL-EEETDPDRFSIHRLAHYCLDL--SELCKTRR-----VVPVVIFL 118 Query: 126 GSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVP-DDEIVQHRRVA-LLELIQKH 183 +R P L + + + + +D +A L + K Sbjct: 119 NTRGSEPVQLNL------GGDCHEYLRFHYIRCALQELNAEDYWESSNLIARLCLNLMKW 172 Query: 184 IRQRDLMGLIDQLVVLLVTECANDSQITAL-LNYILLTGDEARFNEFISELTRRMPQHRE 242 ++ L + L E + Q+ + I D+ ++ + PQ Sbjct: 173 KEEQKLEVYARAVRGLSTLEPDPEKQLKYIDFIDIYSALDDNEMEQY----KAQYPQESN 228 Query: 243 RIMTIAERIHND 254 + T++ER+ + Sbjct: 229 TMATLSERLRAE 240 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P37415 Uncharacterized protein pSLT051 n=256 Tax=Gammap... 282 1e-74 UniRef50_Q1CC76 Transposase n=27 Tax=Gammaproteobacteria RepID=Q... 277 5e-73 UniRef50_P77768 Uncharacterized protein yfcI n=175 Tax=Gammaprot... 275 1e-72 UniRef50_Q4LC22 TpnA protein n=9 Tax=Enterobacteriaceae RepID=Q4... 269 8e-71 UniRef50_Q7N1D0 Transposase, ISNCY family n=36 Tax=root RepID=Q7... 261 3e-68 UniRef50_B7UFQ5 Predicted protein n=14 Tax=Enterobacteriaceae Re... 259 1e-67 UniRef50_D2U4R8 Transposase (Fragment) n=4 Tax=Enterobacteriacea... 257 3e-67 UniRef50_Q7B1W7 YadD homologue n=11 Tax=root RepID=Q7B1W7_ECOLX 251 2e-65 UniRef50_P31665 Uncharacterized protein yadD n=59 Tax=Enterobact... 247 3e-64 UniRef50_C2LLN3 Transposase n=37 Tax=Enterobacteriaceae RepID=C2... 247 3e-64 UniRef50_D1P284 Transposase, ISNCY family n=10 Tax=Enterobacteri... 243 6e-63 UniRef50_B6XDZ7 Putative uncharacterized protein n=2 Tax=Provide... 240 7e-62 UniRef50_C2DMU4 Possible transposase n=6 Tax=Enterobacteriaceae ... 238 2e-61 UniRef50_D0KLJ7 Putative transposase YhgA family protein n=1 Tax... 236 8e-61 UniRef50_C8QFJ7 Putative transposase YhgA family protein n=4 Tax... 235 2e-60 UniRef50_C0Q5B1 Ytl2 n=4 Tax=Enterobacteriaceae RepID=C0Q5B1_SALPC 226 5e-58 UniRef50_C2LF55 Transposase n=3 Tax=Enterobacteriaceae RepID=C2L... 216 7e-55 UniRef50_A8PLK1 Putative uncharacterized protein n=3 Tax=Rickett... 216 8e-55 UniRef50_Q3C0L1 TpnA protein n=16 Tax=Enterobacteriaceae RepID=Q... 209 1e-52 UniRef50_C3M8C1 Putative transposase n=3 Tax=Candidatus Hamilton... 205 2e-51 UniRef50_B7MZS6 Putative uncharacterized protein n=3 Tax=Escheri... 202 1e-50 UniRef50_Q2J904 Putative uncharacterized protein n=1 Tax=Frankia... 196 7e-49 UniRef50_A6G4N5 Putative uncharacterized protein n=1 Tax=Plesioc... 195 2e-48 UniRef50_C0AXL8 Putative uncharacterized protein n=1 Tax=Proteus... 192 1e-47 UniRef50_Q52101 ORF n=1 Tax=Salmonella enterica subsp. enterica ... 191 3e-47 UniRef50_A8PQ66 Putative uncharacterized protein n=3 Tax=Rickett... 191 4e-47 UniRef50_A6G0X2 Putative uncharacterized protein n=1 Tax=Plesioc... 187 5e-46 UniRef50_A9EVM7 Similar to putative transposase n=2 Tax=Sorangiu... 186 7e-46 UniRef50_A8GX51 Transposase and inactivated derivative n=11 Tax=... 183 9e-45 UniRef50_Q2RLW6 Putative uncharacterized protein n=9 Tax=Clostri... 180 4e-44 UniRef50_A0LBL3 Putative uncharacterized protein n=6 Tax=Magneto... 180 5e-44 UniRef50_B3ESQ9 Putative uncharacterized protein n=2 Tax=Bacteri... 180 5e-44 UniRef50_D0LMM4 Putative transposase n=10 Tax=Haliangium ochrace... 179 8e-44 UniRef50_Q1QWV4 Putative uncharacterized protein n=11 Tax=Proteo... 179 1e-43 UniRef50_C7RR52 Putative transposase n=1 Tax=Candidatus Accumuli... 177 3e-43 UniRef50_C2DIT3 Possible transposase n=5 Tax=Enterobacteriaceae ... 176 1e-42 UniRef50_Q24W02 Putative uncharacterized protein n=3 Tax=Clostri... 176 1e-42 UniRef50_C1J8H0 Truncated transposase n=3 Tax=Escherichia coli R... 174 4e-42 UniRef50_A6TJT5 Putative uncharacterized protein n=1 Tax=Alkalip... 172 1e-41 UniRef50_C3PPD7 Transposase and inactivated derivative n=13 Tax=... 171 3e-41 UniRef50_C5UWW9 Putative uncharacterized protein n=1 Tax=Clostri... 170 5e-41 UniRef50_A5CC03 Transposase and inactivated derivative n=9 Tax=O... 170 5e-41 UniRef50_Q6TFF6 Putative transposase n=1 Tax=Caedibacter taenios... 170 6e-41 UniRef50_Q1RJ73 Transposase and inactivated derivative n=10 Tax=... 169 1e-40 UniRef50_D2QBD7 Putative uncharacterized protein n=1 Tax=Spiroso... 169 1e-40 UniRef50_C5JAV2 Transposase n=2 Tax=uncultured bacterium RepID=C... 167 4e-40 UniRef50_Q2FP14 Putative uncharacterized protein n=4 Tax=Methano... 167 5e-40 UniRef50_B9TA29 Putative uncharacterized protein n=1 Tax=Ricinus... 166 8e-40 UniRef50_Q1RGR6 Transposase and inactivated derivative n=15 Tax=... 164 3e-39 UniRef50_B9MMR0 Putative uncharacterized protein n=1 Tax=Anaeroc... 164 3e-39 UniRef50_Q1RKI3 Transposase and inactivated derivative n=10 Tax=... 163 1e-38 UniRef50_A4XMD0 Putative uncharacterized protein n=5 Tax=Clostri... 162 1e-38 UniRef50_A9BGB6 Putative uncharacterized protein n=3 Tax=Petroto... 161 3e-38 UniRef50_C6VTM0 Putative uncharacterized protein n=1 Tax=Dyadoba... 159 2e-37 UniRef50_C4YU05 Transposase n=5 Tax=Rickettsieae RepID=C4YU05_9RICK 158 3e-37 UniRef50_A3JHZ5 Putative transposase n=11 Tax=Proteobacteria Rep... 158 3e-37 UniRef50_B4U689 Putative uncharacterized protein n=8 Tax=Aquific... 158 4e-37 UniRef50_Q3JB06 Putative transposase n=17 Tax=Proteobacteria Rep... 156 6e-37 UniRef50_C4FIM1 Putative uncharacterized protein n=1 Tax=Sulfuri... 154 3e-36 UniRef50_A3ET28 Probable transposase n=6 Tax=Leptospirillum sp. ... 154 3e-36 UniRef50_C0GW46 Putative uncharacterized protein n=2 Tax=Desulfo... 154 4e-36 UniRef50_A6G1G8 Putative uncharacterized protein n=1 Tax=Plesioc... 154 5e-36 UniRef50_A4XG55 Putative uncharacterized protein n=2 Tax=Caldice... 153 6e-36 UniRef50_C6HXQ0 Putative uncharacterized protein n=1 Tax=Leptosp... 151 2e-35 UniRef50_B2V9N0 Putative uncharacterized protein n=4 Tax=Sulfuri... 150 5e-35 UniRef50_C6I158 Putative uncharacterized protein n=3 Tax=Leptosp... 150 5e-35 UniRef50_B6WXP3 Putative uncharacterized protein n=1 Tax=Desulfo... 150 5e-35 UniRef50_Q1Q296 Putative uncharacterized protein n=6 Tax=Candida... 150 6e-35 UniRef50_C0GTX5 Putative uncharacterized protein n=8 Tax=Desulfo... 147 6e-34 UniRef50_A4XFI8 Putative uncharacterized protein n=7 Tax=Clostri... 146 9e-34 UniRef50_A4U3R1 Putative uncharacterized protein n=1 Tax=Magneto... 146 1e-33 UniRef50_C6HY29 Putative uncharacterized protein n=1 Tax=Leptosp... 143 1e-32 UniRef50_C0GW49 Putative uncharacterized protein n=6 Tax=Desulfo... 142 1e-32 UniRef50_B3ETR6 Putative uncharacterized protein n=1 Tax=Candida... 139 1e-31 UniRef50_C6HZP6 Putative uncharacterized protein n=1 Tax=Leptosp... 139 1e-31 UniRef50_C5RH90 Putative uncharacterized protein n=2 Tax=Clostri... 138 2e-31 UniRef50_A4XMU7 Putative uncharacterized protein n=1 Tax=Caldice... 135 2e-30 UniRef50_Q04UG3 Transposase, YhgA-like n=8 Tax=Leptospira RepID=... 134 3e-30 UniRef50_C0GWA6 Putative uncharacterized protein n=3 Tax=Desulfo... 134 4e-30 UniRef50_B0G834 Putative uncharacterized protein n=3 Tax=Dorea f... 133 7e-30 UniRef50_C1DXM1 Putative uncharacterized protein n=5 Tax=Sulfuri... 133 8e-30 UniRef50_A5USQ0 Putative uncharacterized protein n=4 Tax=Roseifl... 132 1e-29 UniRef50_B0K503 Putative uncharacterized protein n=12 Tax=Thermo... 132 1e-29 UniRef50_C6PYR3 Putative uncharacterized protein n=1 Tax=Clostri... 132 2e-29 UniRef50_B9MN47 Putative uncharacterized protein n=2 Tax=Bacteri... 130 6e-29 UniRef50_C0A240 Putative uncharacterized protein n=1 Tax=Opituta... 129 1e-28 UniRef50_B9MMM9 Putative uncharacterized protein n=1 Tax=Anaeroc... 129 1e-28 UniRef50_Q7NIZ1 Gll2041 protein n=9 Tax=Cyanobacteria RepID=Q7NI... 128 2e-28 UniRef50_B2V697 Putative uncharacterized protein n=6 Tax=Sulfuri... 128 3e-28 UniRef50_B6J6C6 Hypothetical cytosolic protein n=1 Tax=Coxiella ... 125 2e-27 UniRef50_B9MPV5 Putative uncharacterized protein n=5 Tax=Clostri... 124 3e-27 UniRef50_C6HTR6 Probable transposase n=5 Tax=Leptospirillum ferr... 122 1e-26 UniRef50_A9BGB3 Putative uncharacterized protein n=2 Tax=Petroto... 119 9e-26 UniRef50_B1XMU9 Putative uncharacterized protein n=1 Tax=Synecho... 119 1e-25 UniRef50_UPI0001C351D8 hypothetical protein ChatD1_33675 n=1 Tax... 119 1e-25 UniRef50_C0CSV6 Putative uncharacterized protein n=1 Tax=Clostri... 118 2e-25 UniRef50_C9KKN3 Putative uncharacterized protein n=1 Tax=Mitsuok... 118 3e-25 UniRef50_C6IY67 Transposase n=1 Tax=Paenibacillus sp. oral taxon... 117 6e-25 UniRef50_D0YJF1 Putative transposase YhgA family protein n=1 Tax... 117 7e-25 UniRef50_C8T759 Putative uncharacterized protein n=1 Tax=Klebsie... 116 1e-24 UniRef50_A6LF36 Putative uncharacterized protein n=7 Tax=Bactero... 114 5e-24 UniRef50_Q2RKN5 Putative uncharacterized protein n=1 Tax=Moorell... 114 6e-24 UniRef50_Q73P51 Conserved domain protein n=7 Tax=Treponema RepID... 113 7e-24 UniRef50_A8PLG1 Transposase n=1 Tax=Rickettsiella grylli RepID=A... 113 7e-24 UniRef50_D0LPI9 Putative transposase n=2 Tax=Haliangium ochraceu... 112 2e-23 UniRef50_UPI0001C353CE hypothetical protein ChatD1_20495 n=1 Tax... 112 2e-23 UniRef50_B5U1X5 Putative uncharacterized protein n=1 Tax=uncultu... 111 4e-23 UniRef50_A6LFA9 Putative uncharacterized protein n=22 Tax=Bacter... 110 5e-23 UniRef50_C8PTN1 Putative uncharacterized protein n=4 Tax=Trepone... 110 7e-23 UniRef50_A5D0D4 Putative uncharacterized protein n=10 Tax=Clostr... 110 8e-23 UniRef50_C4G1D5 Putative uncharacterized protein n=2 Tax=Abiotro... 109 9e-23 UniRef50_UPI0001C34E7F hypothetical protein ClM62_15401 n=1 Tax=... 109 1e-22 UniRef50_B7BFV9 Putative uncharacterized protein n=1 Tax=Parabac... 108 2e-22 UniRef50_B7CC32 Putative uncharacterized protein n=10 Tax=Eubact... 107 6e-22 UniRef50_Q24MW9 Putative uncharacterized protein n=4 Tax=Desulfi... 107 7e-22 UniRef50_C6VTD5 Putative uncharacterized protein n=1 Tax=Dyadoba... 106 1e-21 UniRef50_C1PBU4 Putative uncharacterized protein n=4 Tax=Bacillu... 106 1e-21 UniRef50_C0BF92 Putative uncharacterized protein n=1 Tax=Coproco... 105 2e-21 UniRef50_B4SC57 Putative uncharacterized protein n=14 Tax=Bacter... 104 5e-21 UniRef50_B8FP58 Putative uncharacterized protein n=1 Tax=Desulfi... 104 5e-21 UniRef50_B0K813 Putative uncharacterized protein n=13 Tax=Thermo... 104 5e-21 UniRef50_D2NBJ3 Putative uncharacterized protein n=1 Tax=Escheri... 104 6e-21 UniRef50_C0R0H3 Putative uncharacterized protein n=8 Tax=Brachys... 102 1e-20 UniRef50_C1I6Y7 Putative uncharacterized protein n=1 Tax=Clostri... 102 1e-20 UniRef50_A6LFH9 Putative uncharacterized protein n=6 Tax=Bactero... 102 2e-20 UniRef50_C9LXX0 Putative uncharacterized protein n=6 Tax=Selenom... 101 3e-20 UniRef50_D1PHY3 Putative uncharacterized protein n=2 Tax=Prevote... 101 3e-20 UniRef50_C9LWJ8 Putative uncharacterized protein n=1 Tax=Selenom... 101 3e-20 UniRef50_B0K519 Putative uncharacterized protein n=14 Tax=Thermo... 101 4e-20 UniRef50_C1DXV7 Putative uncharacterized protein n=1 Tax=Sulfuri... 100 5e-20 UniRef50_C0DAA1 Putative uncharacterized protein n=2 Tax=Clostri... 100 7e-20 UniRef50_B7GJZ4 Transposase n=10 Tax=Bacillaceae RepID=B7GJZ4_ANOFW 100 9e-20 UniRef50_A6BF26 Putative uncharacterized protein n=14 Tax=Clostr... 100 1e-19 UniRef50_C2LUG6 Putative uncharacterized protein n=1 Tax=Strepto... 100 1e-19 UniRef50_B0KCX4 Putative uncharacterized protein n=12 Tax=Thermo... 99 2e-19 UniRef50_UPI0001BC3A9D hypothetical protein BcroD2_08902 n=3 Tax... 98 4e-19 UniRef50_B9E303 Putative uncharacterized protein n=2 Tax=Clostri... 98 4e-19 UniRef50_D1P8S5 Putative uncharacterized protein n=1 Tax=Prevote... 98 5e-19 UniRef50_A6M1J9 Putative uncharacterized protein n=1 Tax=Clostri... 98 5e-19 UniRef50_C6LE73 Putative uncharacterized protein n=1 Tax=Bryante... 96 2e-18 UniRef50_C1MD86 Putative uncharacterized protein n=5 Tax=Enterob... 96 2e-18 UniRef50_C0EXQ3 Putative uncharacterized protein n=1 Tax=Eubacte... 96 2e-18 UniRef50_Q2RGS0 Putative uncharacterized protein n=2 Tax=Moorell... 95 3e-18 UniRef50_Q5GSR2 Uncharacterized conserved protein n=15 Tax=Wolba... 95 3e-18 UniRef50_C5RQ96 Putative uncharacterized protein n=1 Tax=Clostri... 95 3e-18 UniRef50_A8GY36 Putative uncharacterized protein n=15 Tax=Ricket... 95 3e-18 UniRef50_C0G0A4 Putative uncharacterized protein n=2 Tax=Rosebur... 94 4e-18 UniRef50_A5Z376 Putative uncharacterized protein n=1 Tax=Eubacte... 94 4e-18 UniRef50_B8FTH9 Putative uncharacterized protein n=3 Tax=Desulfi... 94 5e-18 UniRef50_B6FJ15 Putative uncharacterized protein n=5 Tax=Clostri... 93 9e-18 UniRef50_B1WSK8 CHP1784-containing protein n=11 Tax=Cyanobacteri... 93 2e-17 UniRef50_C1Q938 Putative uncharacterized protein n=4 Tax=Brachys... 92 2e-17 UniRef50_C9RQ02 Putative uncharacterized protein n=1 Tax=Fibroba... 91 4e-17 UniRef50_C0F0J0 Putative uncharacterized protein n=1 Tax=Eubacte... 91 4e-17 UniRef50_C6XV94 Putative uncharacterized protein n=7 Tax=Pedobac... 91 4e-17 UniRef50_UPI0001C369BC hypothetical protein ChatD1_02491 n=1 Tax... 91 4e-17 UniRef50_C1QAJ2 Putative uncharacterized protein n=2 Tax=Brachys... 91 4e-17 UniRef50_A8F2U7 Putative uncharacterized protein n=15 Tax=Bacter... 91 6e-17 UniRef50_C0QZQ8 Putative uncharacterized protein n=4 Tax=Brachys... 91 6e-17 UniRef50_C4FYK3 Putative uncharacterized protein n=2 Tax=Abiotro... 91 6e-17 UniRef50_C0CTJ7 Putative uncharacterized protein n=5 Tax=Clostri... 91 7e-17 UniRef50_A5KR99 Putative uncharacterized protein n=11 Tax=Rumino... 90 1e-16 UniRef50_C4G3R2 Putative uncharacterized protein n=2 Tax=Abiotro... 89 1e-16 UniRef50_Q2FTW8 Putative uncharacterized protein n=2 Tax=Methano... 89 2e-16 UniRef50_Q8F560 Putative uncharacterized protein n=1 Tax=Leptosp... 89 2e-16 UniRef50_UPI0001BC3131 hypothetical protein BcroD2_12630 n=4 Tax... 89 2e-16 UniRef50_A7BWQ7 Putative uncharacterized protein n=3 Tax=Beggiat... 89 2e-16 UniRef50_Q00255 ORF295 n=1 Tax=Leptolyngbya boryana RepID=Q00255... 88 3e-16 UniRef50_C0GV86 Transposase, ISNCY family n=7 Tax=Desulfonatrono... 88 3e-16 UniRef50_B4VKW0 Putative uncharacterized protein n=2 Tax=Microco... 88 3e-16 UniRef50_UPI0001C371D2 hypothetical protein RflaF_10865 n=1 Tax=... 88 3e-16 UniRef50_Q8YMI0 Alr4953 protein n=8 Tax=Cyanobacteria RepID=Q8YM... 88 4e-16 UniRef50_B0A7T9 Putative uncharacterized protein n=2 Tax=Clostri... 88 5e-16 UniRef50_C8PT67 Putative uncharacterized protein n=1 Tax=Trepone... 88 5e-16 UniRef50_C4ZLA7 Conserved hypothetical cytosolic protein n=2 Tax... 87 9e-16 UniRef50_UPI00006A2D99 UPI00006A2D99 related cluster n=2 Tax=Xen... 86 1e-15 UniRef50_A7B1D1 Putative uncharacterized protein n=3 Tax=Ruminoc... 86 1e-15 UniRef50_B0G418 Putative uncharacterized protein n=5 Tax=Dorea f... 86 2e-15 UniRef50_C4ZGR2 Putative uncharacterized protein n=2 Tax=Eubacte... 86 2e-15 UniRef50_C8PLW8 Putative uncharacterized protein n=2 Tax=Trepone... 85 2e-15 UniRef50_C6LJP2 Putative transposase n=1 Tax=Bryantella formatex... 85 3e-15 UniRef50_D0BNN6 ATP-dependent DNA helicase RecQ n=1 Tax=Granulic... 85 4e-15 UniRef50_C4G7H9 Putative uncharacterized protein n=2 Tax=Abiotro... 84 5e-15 UniRef50_B1V1L4 Putative uncharacterized protein n=38 Tax=Clostr... 84 7e-15 UniRef50_C2G1H3 Hypothetical cytosolic protein n=1 Tax=Sphingoba... 83 9e-15 UniRef50_A7AK04 Putative uncharacterized protein n=2 Tax=Parabac... 82 3e-14 UniRef50_C1P7A8 Putative uncharacterized protein n=1 Tax=Bacillu... 82 3e-14 UniRef50_A7C3K1 Putative uncharacterized protein n=3 Tax=Beggiat... 81 4e-14 UniRef50_Q3ATN4 Putative uncharacterized protein n=1 Tax=Chlorob... 81 5e-14 UniRef50_A6EA97 Putative uncharacterized protein n=1 Tax=Pedobac... 80 7e-14 UniRef50_A8VV66 ATPase associated with various cellular activiti... 80 1e-13 UniRef50_C4Z592 Putative uncharacterized protein n=2 Tax=Clostri... 79 1e-13 UniRef50_UPI00006CAA90 hypothetical protein TTHERM_00670420 n=1 ... 79 2e-13 UniRef50_C0DB21 Putative uncharacterized protein n=2 Tax=Clostri... 79 2e-13 UniRef50_UPI00019735B3 hypothetical protein ClM62_08045 n=1 Tax=... 79 2e-13 UniRef50_C4FHW2 Putative uncharacterized protein n=1 Tax=Sulfuri... 79 2e-13 UniRef50_A7BTR0 Putative uncharacterized protein n=3 Tax=Beggiat... 79 2e-13 UniRef50_C6W4R9 Putative uncharacterized protein n=1 Tax=Dyadoba... 78 3e-13 UniRef50_C0QZ87 Chromosome segregation ATPase n=19 Tax=Bacteria ... 78 3e-13 UniRef50_C0QGW4 Putative uncharacterized protein n=1 Tax=Desulfo... 78 3e-13 UniRef50_C8WSD0 Putative uncharacterized protein n=5 Tax=Alicycl... 78 3e-13 UniRef50_A1ZPJ4 Hypothetical conserved protein n=6 Tax=Microscil... 78 3e-13 UniRef50_A4XJH0 Putative uncharacterized protein n=1 Tax=Caldice... 78 5e-13 UniRef50_Q8GBS6 Putative uncharacterized protein n=12 Tax=Trepon... 78 5e-13 UniRef50_C4Z1Q2 Putative uncharacterized protein n=1 Tax=Eubacte... 78 6e-13 UniRef50_C0QWI7 Putative uncharacterized protein n=4 Tax=Brachys... 77 6e-13 UniRef50_C5EKZ7 Predicted protein n=1 Tax=Clostridiales bacteriu... 77 8e-13 UniRef50_C8W1F3 Putative uncharacterized protein n=2 Tax=Desulfo... 76 1e-12 UniRef50_C6LTE0 Putative uncharacterized protein n=1 Tax=Giardia... 76 1e-12 UniRef50_Q24Y59 Putative uncharacterized protein n=4 Tax=Peptoco... 76 1e-12 UniRef50_A6EJS1 Putative uncharacterized protein n=2 Tax=Pedobac... 76 1e-12 UniRef50_Q1PZ06 Putative uncharacterized protein n=1 Tax=Candida... 76 1e-12 UniRef50_A7BPH0 Putative uncharacterized protein n=5 Tax=Beggiat... 76 2e-12 UniRef50_C6XVT6 Putative uncharacterized protein n=1 Tax=Pedobac... 76 2e-12 UniRef50_C5UZR7 Putative uncharacterized protein n=1 Tax=Clostri... 75 2e-12 UniRef50_B8HNA0 Putative uncharacterized protein n=3 Tax=Cyanoba... 74 5e-12 UniRef50_C9LXS5 Transposase n=3 Tax=Selenomonas sputigena ATCC 3... 73 1e-11 UniRef50_Q8YTL4 All2703 protein n=13 Tax=Cyanobacteria RepID=Q8Y... 73 2e-11 UniRef50_C8W2V6 Putative uncharacterized protein n=2 Tax=Desulfo... 72 2e-11 UniRef50_A8YL21 Genome sequencing data, contig C325 n=27 Tax=Cya... 71 4e-11 Sequences not found previously or not previously below threshold: UniRef50_C3R531 Putative uncharacterized protein n=6 Tax=Bactero... 91 4e-17 UniRef50_C6XV81 Putative uncharacterized protein n=4 Tax=Pedobac... 90 1e-16 UniRef50_A5CBY6 Transposase and inactivated derivative n=47 Tax=... 88 3e-16 UniRef50_C0QWG9 Putative uncharacterized protein n=8 Tax=Brachys... 83 2e-14 UniRef50_C6Y2B5 Transposase and inactivated derivative n=1 Tax=P... 82 2e-14 UniRef50_Q3ARM2 Putative uncharacterized protein n=10 Tax=Bacter... 82 2e-14 UniRef50_B0NFN2 Putative uncharacterized protein n=4 Tax=Clostri... 82 2e-14 UniRef50_B5CRG1 Putative uncharacterized protein n=4 Tax=Ruminoc... 81 3e-14 UniRef50_A6EAN2 Putative uncharacterized protein n=1 Tax=Pedobac... 81 6e-14 UniRef50_D0TYF1 Putative uncharacterized protein n=1 Tax=Bactero... 79 1e-13 UniRef50_A6MYW5 Chromosome segregation ATPase n=4 Tax=Rickettsia... 79 2e-13 UniRef50_UPI0001B4A8CA hypothetical protein Bfra3_22303 n=1 Tax=... 78 4e-13 UniRef50_B0C251 Putative uncharacterized protein n=1 Tax=Acaryoc... 77 8e-13 UniRef50_C9RMD5 Putative uncharacterized protein n=1 Tax=Fibroba... 76 1e-12 UniRef50_C6Y2C7 Putative uncharacterized protein n=2 Tax=Pedobac... 76 1e-12 UniRef50_Q24Y19 Putative uncharacterized protein n=3 Tax=Desulfi... 76 2e-12 UniRef50_UPI0001C366FA hypothetical protein ChatD1_09620 n=1 Tax... 74 4e-12 UniRef50_C1QAK6 Putative uncharacterized protein n=1 Tax=Brachys... 74 4e-12 UniRef50_B4VKU9 Putative uncharacterized protein n=1 Tax=Microco... 74 5e-12 UniRef50_A7C3X3 Putative uncharacterized protein n=7 Tax=Beggiat... 74 5e-12 UniRef50_B7I1C8 Putative uncharacterized protein n=16 Tax=Bacill... 74 5e-12 UniRef50_B3QUJ9 Putative uncharacterized protein n=8 Tax=Bacteri... 74 8e-12 UniRef50_C9LUC8 Putative uncharacterized protein n=5 Tax=Selenom... 73 1e-11 UniRef50_B8HL58 Putative uncharacterized protein n=2 Tax=Cyanoth... 73 1e-11 UniRef50_C4Z2A6 Putative uncharacterized protein n=2 Tax=Eubacte... 73 2e-11 UniRef50_C0D7Q8 Putative uncharacterized protein n=1 Tax=Clostri... 73 2e-11 UniRef50_B0MQP0 Putative uncharacterized protein n=2 Tax=Eubacte... 71 5e-11 UniRef50_C9RP54 Putative uncharacterized protein n=1 Tax=Fibroba... 71 5e-11 UniRef50_C7GDU7 Putative uncharacterized protein n=1 Tax=Rosebur... 71 5e-11 UniRef50_C9RLI8 Putative uncharacterized protein n=1 Tax=Fibroba... 71 6e-11 >UniRef50_P37415 Uncharacterized protein pSLT051 n=256 Tax=Gammaproteobacteria RepID=YTL2_SALTY Length = 313 Score = 282 bits (721), Expect = 1e-74, Method: Composition-based stats. Identities = 157/311 (50%), Positives = 210/311 (67%), Gaps = 21/311 (6%) Query: 2 TNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALH 61 TT TPHDA F+ FLT PD ARDFME+HLP +LR +CDL +LKLES SFV++ LR Sbjct: 3 KKNTTPTPHDATFRQFLTQPDIARDFMELHLPAELRAICDLSTLKLESGSFVEDDLRQYF 62 Query: 62 SDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPM 121 SD+L+S+KT GDGYI+V++EHQS D HMAFRL+RY++A MQRH+E + LPLVIP+ Sbjct: 63 SDVLYSLKTTAGDGYIHVLVEHQSTPDKHMAFRLIRYAVAAMQRHLEAG-HKKLPLVIPV 121 Query: 122 LFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQ 181 LFY G RSPYP+S WLDEF D A KLY++AFPLVDVTV+PDDEI HR +A L L+Q Sbjct: 122 LFYTGKRSPYPYSTRWLDEFDDTALADKLYSSAFPLVDVTVIPDDEIAGHRSMAALTLLQ 181 Query: 182 KHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHR 241 KHI QRDL L+D+L +L+ + SQ+ +L++YI+ G+ + F+ EL +R+PQH Sbjct: 182 KHIHQRDLAELVDRLAPILLAGYLSSSQVISLVHYIVQAGETSDAEAFVRELAQRVPQHG 241 Query: 242 ERIMTIAERIHNDGYIKGEQ--------------------RILRLLLQNGADPEWIQKIT 281 + +MTIA+++ G KG Q +I R +LQN D + K+T Sbjct: 242 DALMTIAQQLEQKGIEKGIQLGEQRGIEKGRSEGEREATLKIARTMLQNCIDRNTVMKMT 301 Query: 282 GLSAEQMQALR 292 GL+ + + +R Sbjct: 302 GLTEDDLAQIR 312 >UniRef50_Q1CC76 Transposase n=27 Tax=Gammaproteobacteria RepID=Q1CC76_YERPN Length = 313 Score = 277 bits (707), Expect = 5e-73, Method: Composition-based stats. Identities = 154/311 (49%), Positives = 209/311 (67%), Gaps = 21/311 (6%) Query: 2 TNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALH 61 +T TPHDA F+ FLT P+ ARDFME+HLP +LR +CDL +LKLES SFV++ LR Sbjct: 3 KKNSTPTPHDATFRQFLTQPEIARDFMELHLPAELRAICDLSTLKLESGSFVEDDLRQYF 62 Query: 62 SDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPM 121 SD+L+S+ T EG+GY++V+IEHQS D HMAFRL+RY++A MQRH+E LPLVIP+ Sbjct: 63 SDVLYSLDTVEGEGYVHVLIEHQSSPDKHMAFRLIRYAIAAMQRHLEAG-HAKLPLVIPV 121 Query: 122 LFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQ 181 LFY G RSPYP+S WLDEF DP A KLY+ AFPLVDVTV+PDD+I++HR +A L L+Q Sbjct: 122 LFYVGKRSPYPYSTRWLDEFDDPELAHKLYSGAFPLVDVTVIPDDDIMEHRSMAALTLLQ 181 Query: 182 KHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHR 241 KHI QRD+ L D+L LL+ + + Q+ AL++Y+L G+ A F+ EL +R+PQH Sbjct: 182 KHIHQRDIATLTDRLATLLMADYLSSPQVMALIHYLLQAGESADSEAFVRELAQRVPQHG 241 Query: 242 ERIMTIAERIHND--------------------GYIKGEQRILRLLLQNGADPEWIQKIT 281 + +MTIA+++ G KG+ + R LL+ G E +Q+ T Sbjct: 242 DALMTIAQQLEQKGIEKGRMEGRTEGIQLGEQRGIEKGKLEVARSLLKMGMPIESVQEAT 301 Query: 282 GLSAEQMQALR 292 GLS + + +R Sbjct: 302 GLSEDDLAQIR 312 >UniRef50_P77768 Uncharacterized protein yfcI n=175 Tax=Gammaproteobacteria RepID=YFCI_ECOLI Length = 296 Score = 275 bits (704), Expect = 1e-72, Method: Composition-based stats. Identities = 181/292 (61%), Positives = 229/292 (78%), Gaps = 5/292 (1%) Query: 1 MTNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRAL 60 MT TTSTPHDA+FK+FL HPDTARDF++IHLP LR+LCDL +LKLE SF+DE LR Sbjct: 1 MTISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQY 60 Query: 61 HSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIP 120 +SD+LWSVKT+EG GYIYVVIEHQS+ + MAFR+MRYS+A MQ H++ + LPLV+P Sbjct: 61 YSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAG-YKELPLVLP 119 Query: 121 MLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELI 180 MLFYHG RSPYP+SLCWLDEFA+P ARK+Y++AFPLVD+TVVPDDEI+QHR++ALLELI Sbjct: 120 MLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELI 179 Query: 181 QKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQH 240 QKHIRQRDL+GL+DQ+V LLVT ND Q+ AL NY+L TGD RF FI E+ R PQ Sbjct: 180 QKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQE 239 Query: 241 RERIMTIAERIHNDGYIKGEQR----ILRLLLQNGADPEWIQKITGLSAEQM 288 +E++MTIA+R+ +G ++G+ I + +L G D E + +T LS + + Sbjct: 240 KEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDL 291 >UniRef50_Q4LC22 TpnA protein n=9 Tax=Enterobacteriaceae RepID=Q4LC22_SODGL Length = 308 Score = 269 bits (688), Expect = 8e-71, Method: Composition-based stats. Identities = 139/307 (45%), Positives = 191/307 (62%), Gaps = 17/307 (5%) Query: 1 MTNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRAL 60 M+ T TPHDA+F+ FL TA+DF +I LP D++ LCD ++LK ES SF+D ++ Sbjct: 1 MSKKFTPTPHDAVFRQFLHDKATAQDFFDIWLPDDIKALCDWETLKPESGSFIDPDMKPY 60 Query: 61 HSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIP 120 SDIL+SV DGY+Y +IEHQS D MA+RLMRYSMA MQRH+E LPLV P Sbjct: 61 QSDILYSVNANGVDGYVYCLIEHQSTPDKLMAWRLMRYSMAAMQRHLEAG-HDKLPLVFP 119 Query: 121 MLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELI 180 +LFY G +SP+P+S WLD F P A K+Y+ F L+DVT + DD I+QHRR+ALLELI Sbjct: 120 VLFYCGEKSPHPYSTNWLDCFERPDIAAKIYSQPFRLMDVTTLDDDAIMQHRRMALLELI 179 Query: 181 QKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQH 240 QKHIR+RD+ L+D +V LL D+Q+ ++NY++ G+ A FI+E+ +R +H Sbjct: 180 QKHIRRRDMTELLDSIVKLLSYNYYTDTQVVTMMNYLVQEGNAASPRTFITEIAKRAEKH 239 Query: 241 RERIMTIAERIHND--------GYIKGEQR--------ILRLLLQNGADPEWIQKITGLS 284 E +MTIAE + + G +G Q+ I R +L G + ++ TGLS Sbjct: 240 EEALMTIAEALKQEGYQIGRDDGRQEGIQQGEHAAAMKIARQMLSRGIARDAVKACTGLS 299 Query: 285 AEQMQAL 291 + L Sbjct: 300 DNALDNL 306 >UniRef50_Q7N1D0 Transposase, ISNCY family n=36 Tax=root RepID=Q7N1D0_PHOLL Length = 335 Score = 261 bits (666), Expect = 3e-68, Method: Composition-based stats. Identities = 153/334 (45%), Positives = 206/334 (61%), Gaps = 44/334 (13%) Query: 1 MTNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRAL 60 M T TPHDA+FK FL+H DTARDF+EIHLP LR +CDLD+L+LES SF+++ LR Sbjct: 1 MKRKNTPTPHDAIFKKFLSHIDTARDFLEIHLPATLRAVCDLDTLRLESGSFIEDNLRVH 60 Query: 61 HSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIP 120 +SDIL+S+KT +G+ Y+Y VIEHQS D MAFRLMRYS++ MQ H+E + LPLVIP Sbjct: 61 YSDILYSLKTTQGESYVYCVIEHQSSPDKMMAFRLMRYSISAMQWHLEQG-HKKLPLVIP 119 Query: 121 MLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELI 180 +LFYHG PYPWS W D F A ++Y++AFPLVDVTV+PDDEI+ H+RVALLE++ Sbjct: 120 VLFYHGKIRPYPWSTNWFDCFDASALAEEIYSSAFPLVDVTVIPDDEILTHKRVALLEIV 179 Query: 181 QKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQH 240 QKHIRQRD+ L +L +L + + ++LNYILL GD A FI +L + P++ Sbjct: 180 QKHIRQRDMAELQQELTMLFAYDYYTYELLKSMLNYILLVGDTADPEGFIRQLAEQFPKY 239 Query: 241 RERIMTIAERIHNDGYIKGEQR-------------------------------------- 262 E +MTIA+++ + G+ +G + Sbjct: 240 EEVLMTIAQKLQHKGHQEGLKEGLQKCQDAREEGLQEGLQKGEKKGEKKGEKKGEEKGEK 299 Query: 263 -----ILRLLLQNGADPEWIQKITGLSAEQMQAL 291 I R L+ NG D E I K TGLS +++ + Sbjct: 300 RASLKIARALMDNGIDRETIMKSTGLSQNELEQI 333 >UniRef50_B7UFQ5 Predicted protein n=14 Tax=Enterobacteriaceae RepID=B7UFQ5_ECO27 Length = 315 Score = 259 bits (661), Expect = 1e-67, Method: Composition-based stats. Identities = 165/312 (52%), Positives = 215/312 (68%), Gaps = 25/312 (8%) Query: 1 MTNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRAL 60 MT TTS+PHDA+FKTF+ P+TARDF+EIHLP+ LR+LC+L +L+LE SF+++ LRA Sbjct: 1 MTESTTSSPHDAVFKTFMFTPETARDFLEIHLPEPLRKLCNLQTLRLEPTSFIEKSLRAY 60 Query: 61 HSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIP 120 +SD+LWSV+T EGDGYIY VIEHQS + +MAFRLMRY+ A MQRH++ +PLV+P Sbjct: 61 YSDVLWSVETSEGDGYIYCVIEHQSSAEKNMAFRLMRYATAAMQRHLDKG-YDRVPLVVP 119 Query: 121 MLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELI 180 +LFYHG SPYP+SL WLDEF DP AR+LY AFPLVD+T+VPDDEI+QHRR+ALLELI Sbjct: 120 LLFYHGEASPYPYSLNWLDEFDDPQLARQLYTEAFPLVDITIVPDDEIMQHRRIALLELI 179 Query: 181 QKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQH 240 QKHIR RDL+G++D++ LLV NDSQ+ L NY+L GD +RF FI E+ R P Sbjct: 180 QKHIRDRDLIGMVDRITTLLVRGFTNDSQLQTLFNYLLQCGDTSRFTRFIQEIAERSPLQ 239 Query: 241 RERIMTIAERIHNDGYIKGEQR------------------------ILRLLLQNGADPEW 276 +E +MTIAER+ +G+ G Q I +L+ G + E Sbjct: 240 KEILMTIAERLRQEGHQIGWQEGKIEGWQEGKLEGLQEGMHEQAIKIALRMLEQGFEREI 299 Query: 277 IQKITGLSAEQM 288 + T L+ + Sbjct: 300 VLAATQLTDADI 311 >UniRef50_D2U4R8 Transposase (Fragment) n=4 Tax=Enterobacteriaceae RepID=D2U4R8_9ENTR Length = 308 Score = 257 bits (657), Expect = 3e-67, Method: Composition-based stats. Identities = 141/302 (46%), Positives = 197/302 (65%), Gaps = 9/302 (2%) Query: 1 MTNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRAL 60 MT T TPHDA+FK FL+ +TA+DF +I LP +++ LCDLDSLK+ES SF+D +++ Sbjct: 7 MTKKFTPTPHDAVFKQFLSEKETAKDFFDIWLPDEIKALCDLDSLKMESGSFIDSEMKNY 66 Query: 61 HSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIP 120 SDIL+SV T +G GYIYV+IEHQS D +A+RLMRYS+A MQ+H+E +Q LPLV P Sbjct: 67 QSDILYSVSTTKGSGYIYVLIEHQSTPDKLIAWRLMRYSLAAMQKHLEDGNKQ-LPLVFP 125 Query: 121 MLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELI 180 +LFY G +SP+P+S WLD F D A +YN F L DVT + D EI+QH+R+ALLEL+ Sbjct: 126 ILFYCGEQSPHPYSTHWLDCFEDRKLAESIYNNPFKLADVTTLDDGEIMQHKRIALLELL 185 Query: 181 QKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQH 240 QKHIR+RD+ L+D +V LL D+Q+ + NY++ G+ R EFI+ + ++ +H Sbjct: 186 QKHIRRRDMTELLDSIVKLLSYNYYTDNQVITMFNYLIQEGNAQRPMEFITNIAKQAEKH 245 Query: 241 RERIMTIAERIHNDGYIKGEQRI--------LRLLLQNGADPEWIQKITGLSAEQMQALR 292 +MTIA++I G KG Q+ + L NG D ++ TGLS E++ Sbjct: 246 EGALMTIAQQIEEIGIQKGIQQGIQKTKIELAKQFLANGVDRNTVKISTGLSDEELNKFE 305 Query: 293 QP 294 Sbjct: 306 NQ 307 >UniRef50_Q7B1W7 YadD homologue n=11 Tax=root RepID=Q7B1W7_ECOLX Length = 313 Score = 251 bits (640), Expect = 2e-65, Method: Composition-based stats. Identities = 152/311 (48%), Positives = 202/311 (64%), Gaps = 25/311 (8%) Query: 2 TNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALH 61 N TT TPHDA F++FL +PD ARDF+E+HLP + R+LCDL +LKLE A+FV+ L Sbjct: 5 KNTTTPTPHDAAFRSFLANPDVARDFLELHLPAEYRQLCDLSTLKLEPATFVEPDLHQYA 64 Query: 62 SDILWSVKTREG-DGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIP 120 SDILWSVKT G DGY+Y +IEHQS E+++M FR++RYS+A MQRH+E K LPLVIP Sbjct: 65 SDILWSVKTTGGEDGYVYTLIEHQSTENLYMPFRMLRYSVAAMQRHLEQHK--TLPLVIP 122 Query: 121 MLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELI 180 +LFYHG RSPYP+S+ WLD F +P A K+Y FPLVD+TVV D+EI+ HRR+A L L+ Sbjct: 123 VLFYHGERSPYPYSMNWLDCFENPALAAKIYTKPFPLVDITVVDDNEIMNHRRMAALTLL 182 Query: 181 QKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQH 240 KHIRQRD++ +D LV L ++ QIT L NY+L G E EF+ L +R+PQH Sbjct: 183 MKHIRQRDMLMCLDNLVRALQDI-QDEEQITVLFNYLL-NGSEHVTVEFLQTLAQRLPQH 240 Query: 241 RERIMTIAERIHNDGYI--------------------KGEQRILRLLLQNGADPEWIQKI 280 + IMT+AER+ +G + + I R L G I ++ Sbjct: 241 EDSIMTLAERLKQEGIQQGIQQGIQQGIQQGVQQGALQKAREIARELRNAGMPAAQICQL 300 Query: 281 TGLSAEQMQAL 291 TGLS +++ + Sbjct: 301 TGLSEAELKNI 311 >UniRef50_P31665 Uncharacterized protein yadD n=59 Tax=Enterobacteriaceae RepID=YADD_ECOLI Length = 300 Score = 247 bits (631), Expect = 3e-64, Method: Composition-based stats. Identities = 138/298 (46%), Positives = 194/298 (65%), Gaps = 10/298 (3%) Query: 6 TSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDIL 65 ++TPHDA+FK FL H +TARDF+EIHLP +LRELCDL++L LES SF++E L+ +D+L Sbjct: 5 STTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVL 64 Query: 66 WSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYH 125 +SV+ + GY++VVIEHQS+ D MAFR+MRYS+A M RH+ LPLV+P+LFY Sbjct: 65 YSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHL-EADHDKLPLVVPILFYQ 123 Query: 126 GSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIR 185 G +PYP S+CW D F P AR++YN+ FPLVD+T+ PDDEI+QHRR+A+LEL+QKHIR Sbjct: 124 GEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIR 183 Query: 186 QRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRERIM 245 QRDLM L++QLV L+ + SQ+ A+ NY+L G + + R E +M Sbjct: 184 QRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHT-EQADLFYGVLRDRETGGESMM 242 Query: 246 TIAERIHNDGYIKGEQRI--------LRLLLQNGADPEWIQKITGLSAEQMQALRQPL 295 T+A+ G KG Q+ + LL G E + ++ L ++ + + Sbjct: 243 TLAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKVINLI 300 >UniRef50_C2LLN3 Transposase n=37 Tax=Enterobacteriaceae RepID=C2LLN3_PROMI Length = 319 Score = 247 bits (631), Expect = 3e-64, Method: Composition-based stats. Identities = 136/318 (42%), Positives = 197/318 (61%), Gaps = 26/318 (8%) Query: 1 MTNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRAL 60 MT T HDALFK FLTHP+ ARDF +HLP ++ LCDL +L+LE ASFV+ +LR L Sbjct: 1 MTKNTQQPVHDALFKQFLTHPENARDFFSVHLPANILPLCDLSTLRLEPASFVERRLRQL 60 Query: 61 HSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQP-LPLVI 119 HSD+L+SV+ EG+GYIY +IEHQS+ D M FRLM Y+M+ + H++ LPLV+ Sbjct: 61 HSDVLYSVQMTEGEGYIYCLIEHQSKPDRLMGFRLMHYAMSAIAHHLKKSPADKTLPLVV 120 Query: 120 PMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLEL 179 P LFY GS PYP+S+ WLD FADP A++LY +FPLVD++V+ D+EI+ H+ +ALLEL Sbjct: 121 PFLFYQGSVCPYPYSMNWLDGFADPALAQQLYTRSFPLVDLSVLSDEEILTHKGIALLEL 180 Query: 180 IQKHIRQRD-LMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMP 238 +QKHIR RD LM ++ + ++ ++ Q+ +++ YI G + F S+L P Sbjct: 181 VQKHIRTRDGLMAVLPIIAQIINSQHNTVDQVRSVIEYIAYQGYILDESRFFSQLIALSP 240 Query: 239 QHRERIMTIAERIHNDGYIKGEQRI------------------------LRLLLQNGADP 274 +++ + TIAE++ G KG ++ R LLQ G D Sbjct: 241 EYKTMLTTIAEQLEQKGIEKGIEKGIEKGIEKGIEKGIEKGIGLGVEKVARSLLQQGVDL 300 Query: 275 EWIQKITGLSAEQMQALR 292 I + TGL+ E++++L+ Sbjct: 301 NIIMQCTGLTREKIESLK 318 >UniRef50_D1P284 Transposase, ISNCY family n=10 Tax=Enterobacteriaceae RepID=D1P284_9ENTR Length = 322 Score = 243 bits (620), Expect = 6e-63, Method: Composition-based stats. Identities = 123/323 (38%), Positives = 180/323 (55%), Gaps = 32/323 (9%) Query: 1 MTNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRAL 60 M + PHD+ FK F++ D ARDF E+HLP ++ LC+ D+LKL SASFVD+ LR+ Sbjct: 1 MATQSIVAPHDSTFKGFMSKVDNARDFFEVHLPNRIKHLCNFDTLKLASASFVDKTLRSR 60 Query: 61 HSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIP 120 SD+L+SV+T +G GY Y ++EHQS D M +RLM Y+ M +H++ Q LPLV+P Sbjct: 61 FSDMLYSVQTLKGKGYFYFLVEHQSSPDKLMGWRLMHYAFCAMNQHLQQG-HQSLPLVVP 119 Query: 121 MLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELI 180 +LFYHG++SPYP+S W D F A LY PLVDVTV DDE++ HR+VA +EL+ Sbjct: 120 ILFYHGNQSPYPYSQSWTDCFQWSDLAHDLYCNPLPLVDVTVACDDELMNHRKVAAMELV 179 Query: 181 QKHIRQR-DLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQ 239 KH R D+ GL ++L +L + + ++NY+ D + + L + + Sbjct: 180 FKHASLRGDVFGLSERLAQVLNNNQNHQDDVILIINYLFSVMDTPAYTHIVKTLVDQTEK 239 Query: 240 HRERIMTIAERIHNDGYIKGEQRILRL------------------------------LLQ 269 H+E +M IA+R+ N+G KG ++ + L+ Sbjct: 240 HQETVMNIAQRLRNEGMEKGMEKGRKEERMISQQKLANERQHYQQQMALNLQQQAIMSLK 299 Query: 270 NGADPEWIQKITGLSAEQMQALR 292 G + I +ITGLS + ALR Sbjct: 300 LGLSVDIISQITGLSPSDIHALR 322 >UniRef50_B6XDZ7 Putative uncharacterized protein n=2 Tax=Providencia RepID=B6XDZ7_9ENTR Length = 327 Score = 240 bits (611), Expect = 7e-62, Method: Composition-based stats. Identities = 121/324 (37%), Positives = 179/324 (55%), Gaps = 33/324 (10%) Query: 1 MTNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRAL 60 MT + PHD+ FK F++ D ARDF EI+LP ++ LC+LD+LKL SASF+D+ LR+ Sbjct: 5 MTMQLIARPHDSTFKGFMSKVDNARDFFEIYLPNRIKPLCNLDTLKLASASFIDKTLRSR 64 Query: 61 HSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIP 120 SD+L+SV+T +G GY Y+++EHQS D M +RLM Y+ M +H++ LPLV+P Sbjct: 65 FSDMLYSVQTLKGKGYFYLLVEHQSTPDKLMGWRLMHYAFCAMNQHLQQGNNA-LPLVVP 123 Query: 121 MLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELI 180 +LFYHG +SPYP+S W D F A LY PLVDVTV DDEIV HR+VA +EL+ Sbjct: 124 ILFYHGKQSPYPYSQVWTDCFPWADLAYDLYCNPLPLVDVTVASDDEIVNHRKVAAMELV 183 Query: 181 QKHIRQRDLMGLI-DQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQ 239 KH RD + ++ ++L ++ + + ++NY+ D + + + L + Sbjct: 184 LKHSTLRDDLIVLSERLAQVISENENHRDDVILIINYLFSVMDTPTYTQIVKTLIEQTEG 243 Query: 240 HRERIMTIAERIHNDGYIKGEQRILRLLLQNG---------------------------- 271 ++E +MTIA+R+ N+G KG + G Sbjct: 244 YQETVMTIADRLRNEGLEKGLIKGREEGKAEGKAEGREEARQEEQAIARQRTYTQVITSL 303 Query: 272 ---ADPEWIQKITGLSAEQMQALR 292 + I KITGL ++QA+R Sbjct: 304 DLGLSIDIISKITGLPHSEIQAMR 327 >UniRef50_C2DMU4 Possible transposase n=6 Tax=Enterobacteriaceae RepID=C2DMU4_ECOLX Length = 314 Score = 238 bits (607), Expect = 2e-61, Method: Composition-based stats. Identities = 137/295 (46%), Positives = 197/295 (66%), Gaps = 3/295 (1%) Query: 6 TSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDIL 65 ++TPHDA+FK FL H +TARDF++IHLP +LRELCDLD+L LES SF++E L+ +D+L Sbjct: 5 STTPHDAVFKQFLMHAETARDFLDIHLPAELRELCDLDTLHLESGSFIEESLKGHSTDVL 64 Query: 66 WSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYH 125 +SV+ + GY++VVIEHQS+ D MAFR+MRYS+A M RH+ LPLV+P+LFY Sbjct: 65 YSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHL-EADHDKLPLVVPILFYQ 123 Query: 126 GSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIR 185 G +PYP S+CW D F P AR++YN+ FPLVD+T+ PDDEI+QHRR+A+LEL+QKHIR Sbjct: 124 GEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIR 183 Query: 186 QRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRERIM 245 QRDLM L++QLV L+ + SQ+ A+ NY+L G + + R + +M Sbjct: 184 QRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHT-EQADLFYGVLRDRETGGKSMM 242 Query: 246 TIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQ-MQALRQPLPERE 299 T+A+ G KG ++ + ++ G + Q +S E ++ L + +P + Sbjct: 243 TLAQWFEEKGIEKGIEKGIEKGMEKGIEKGIQQGRQEVSQEFALRLLSKGMPRED 297 >UniRef50_D0KLJ7 Putative transposase YhgA family protein n=1 Tax=Pectobacterium wasabiae WPP163 RepID=D0KLJ7_PECWW Length = 288 Score = 236 bits (602), Expect = 8e-61, Method: Composition-based stats. Identities = 132/284 (46%), Positives = 176/284 (61%), Gaps = 7/284 (2%) Query: 8 TPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWS 67 HDA+FK FL+ ARDF+ IHLP +RE CD ++L+LESASF+DEKLRA SD+L+S Sbjct: 2 PSHDAIFKQFLSDIAVARDFLTIHLPDSIRERCDFNTLQLESASFIDEKLRARISDVLYS 61 Query: 68 VKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGS 127 + T G GYIY VIEHQSR + MAFRL+RY +A MQ+H++ LPLV+P+LFYHG Sbjct: 62 LHTSVGKGYIYCVIEHQSRPEKQMAFRLLRYCLAAMQQHLDQG-HDRLPLVVPLLFYHGR 120 Query: 128 RSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIRQR 187 PYP+SL WLD FA P A+ LY FPLVD+TV+PDDEI HRR+ALLEL+QKHIR R Sbjct: 121 SRPYPYSLRWLDSFAAPVLAQTLYEQPFPLVDLTVMPDDEIRTHRRMALLELVQKHIRTR 180 Query: 188 DLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRERIMTI 247 D++ L ++ +L A S + I + F+E I ++ + Sbjct: 181 DMLELAREIGLLFERWAAPLSIGQEDIMTIAEQLKKMGFDEGIQR------GIQQGLAQG 234 Query: 248 AERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQAL 291 E+ G ++I R LL G D +Q+ T L E+++ L Sbjct: 235 LEQGIEQGMKNSARQIARHLLLTGMDKNSVQQATQLETEELEQL 278 >UniRef50_C8QFJ7 Putative transposase YhgA family protein n=4 Tax=Pantoea sp. At-9b RepID=C8QFJ7_9ENTR Length = 301 Score = 235 bits (598), Expect = 2e-60, Method: Composition-based stats. Identities = 126/301 (41%), Positives = 185/301 (61%), Gaps = 14/301 (4%) Query: 3 NFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHS 62 S PHDALFK FL+H AR F+EIHLP+ +RE CDLD L++ +F++ L AL+S Sbjct: 1 MSVVSAPHDALFKKFLSHLPVARQFLEIHLPQSIREHCDLDKLQVVPTTFIERDLSALYS 60 Query: 63 DILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPML 122 D+L S+KT +G+GYIY +IEHQS D HM R+MRY++A +QRH++ +PLVIP+L Sbjct: 61 DVLLSMKTDDGEGYIYALIEHQSTPDKHMTLRMMRYTLAAIQRHLDEG-HHDVPLVIPIL 119 Query: 123 FYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQK 182 FY G SPYP+S+ WL+ F +P A++++ +FPLVDVTV+PD+EI+ HR VA LE+ K Sbjct: 120 FYQGKTSPYPYSMNWLESFRNPVLAKQIFCHSFPLVDVTVIPDEEIMAHRDVARLEMAHK 179 Query: 183 HIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRE 242 IR RD++ ID + LL + +D ++ Y+L G+ + + L + PQ Sbjct: 180 IIRLRDILENIDPMATLLALDYNDD-LSIDVVFYLLRYGNTDDREKIVKILIQAKPQLEG 238 Query: 243 RIMTIAERIHNDGYIKGEQRI------------LRLLLQNGADPEWIQKITGLSAEQMQA 290 +IMTI E+ + +G Q + +L+ D I K+TGLS +++ Sbjct: 239 KIMTIEEQWRQESRQEGRQEGRKEGRQEVMLELAQRMLREQFDLNTIMKLTGLSEGELRQ 298 Query: 291 L 291 L Sbjct: 299 L 299 >UniRef50_C0Q5B1 Ytl2 n=4 Tax=Enterobacteriaceae RepID=C0Q5B1_SALPC Length = 316 Score = 226 bits (577), Expect = 5e-58, Method: Composition-based stats. Identities = 135/316 (42%), Positives = 186/316 (58%), Gaps = 25/316 (7%) Query: 1 MTNFTTSTP--HDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLR 58 M N HD LFK FL PDTARDF+ +HLP D+R LD+LKLE SFVD+KLR Sbjct: 1 MDNEKGHNRPGHDGLFKLFLREPDTARDFLAVHLPADIRAQVRLDTLKLEPGSFVDQKLR 60 Query: 59 ALHSDILWSVKTREGD-GYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPL 117 LHSD+L+SV+T EG GYIY ++EHQS D MA+R+MRYSMAVM H++ LP+ Sbjct: 61 ELHSDVLYSVETAEGHAGYIYCLVEHQSTADRMMAWRMMRYSMAVMDAHLKKG-NGTLPV 119 Query: 118 VIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALL 177 V+P+LFY G PYP+S W+D F P AR++Y+ +PLVDV+V+ D ++ HRR+ALL Sbjct: 120 VVPLLFYQGMVRPYPYSTDWMDCFDVPALAREVYSRPWPLVDVSVMEDCDLQSHRRMALL 179 Query: 178 ELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNE-FISELTRR 236 EL+Q+ IR RD L+ +V L+ +Q+ A+L YI+ G + F+ EL Sbjct: 180 ELVQRDIRHRDAASLLRDVVQLIRLAGNTRAQVEAVLCYIIYNGMTSESITPFLYELAGE 239 Query: 237 MPQHRERIM-TIAERIHN-------------------DGYIKGEQRILRLLLQNGADPEW 276 +P+++E IM TIA+++ + K LL NG E Sbjct: 240 IPEYKELIMGTIAQQLKEEGIQQGIQQGIQQERQASLEREQKTLLETAYALLDNGVSLEV 299 Query: 277 IQKITGLSAEQMQALR 292 + K TGL+ E ++ R Sbjct: 300 VIKSTGLNRETLEQPR 315 >UniRef50_C2LF55 Transposase n=3 Tax=Enterobacteriaceae RepID=C2LF55_PROMI Length = 330 Score = 216 bits (550), Expect = 7e-55, Method: Composition-based stats. Identities = 114/328 (34%), Positives = 178/328 (54%), Gaps = 38/328 (11%) Query: 1 MTNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRAL 60 M + HDA FK F+ + A+DF IHL +L+ CD +LKL+++SF+D KLR+ Sbjct: 1 MNKPLLISSHDAAFKRFMMNISNAKDFFFIHLSDELKSYCDFSTLKLQNSSFIDIKLRSR 60 Query: 61 HSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIP 120 SDIL+SVKT++G+ IY +IEHQSR D +A+R+M Y+ M +H++ LPLV+P Sbjct: 61 MSDILYSVKTKKGNISIYFLIEHQSRPDKMIAWRMMHYAFCTMNQHLQQG-YTSLPLVVP 119 Query: 121 MLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELI 180 +LFYHG R PYP+S+ WLD F T A +LY F L+D+ + D+ ++ HR+ A++E+ Sbjct: 120 ILFYHGKRKPYPFSVNWLDCFPLSTLANQLYLNNFALIDLNSIDDEILLTHRKAAVMEIA 179 Query: 181 QKHIRQ-RDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQ 239 KH+ DL L L + + +D A++ Y+ D A F I+++ ++ Sbjct: 180 MKHVNSCDDLDKLAMLLSKAINQKNCSDEDTIAVVQYLFSIMDAADFESIINKIAEQVDN 239 Query: 240 HRERIMTIAERIHNDGYIKGEQRI------------------------------------ 263 HRE IM IA R+ N G+ G+ Sbjct: 240 HRETIMNIAWRLENKGFKLGKMEGIEIGKNEGIEIGKNEGIEIGKNEGIEIGKKIVQIQL 299 Query: 264 LRLLLQNGADPEWIQKITGLSAEQMQAL 291 + LL+ + E+I++ITGLS ++++ L Sbjct: 300 AKNLLKENVELEFIERITGLSIQELKIL 327 >UniRef50_A8PLK1 Putative uncharacterized protein n=3 Tax=Rickettsiella grylli RepID=A8PLK1_9COXI Length = 308 Score = 216 bits (550), Expect = 8e-55, Method: Composition-based stats. Identities = 108/305 (35%), Positives = 171/305 (56%), Gaps = 19/305 (6%) Query: 6 TSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDIL 65 HDA+FKTF T + A F+ I+LPK +++ CD +LK+E SFVD L+ HSDIL Sbjct: 5 IHNAHDAIFKTFFTDIEVATHFITIYLPKHMKQACDFSTLKIEPGSFVDADLKQHHSDIL 64 Query: 66 WSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYH 125 +S+K GY+Y+ +EHQS + M FR+ RY +A+MQ+H+ + LPLVI MLFYH Sbjct: 65 YSLKVNGMHGYVYLNLEHQSTAEELMPFRMHRYKVAIMQQHLNQGNK-KLPLVISMLFYH 123 Query: 126 GSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIR 185 G + YP+ L +D D A+ + L+D+ V+PD+EI +H+++A LE++QKHI Sbjct: 124 G-KGQYPYCLKLIDCVEDTPFAKAHFFDDPLLIDLNVLPDEEIYRHKQLAFLEIVQKHIF 182 Query: 186 QRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRERIM 245 RDL + D +V L+ + L+ Y+L+ G+ A N+ I +L + + + E IM Sbjct: 183 TRDLEDIADHIVRLVKQVKPDHDLFNQLVYYMLVKGETANVNQVIEKL-KTIEDYEEDIM 241 Query: 246 TIAERIHNDGYIKGEQR----------------ILRLLLQNGADPEWIQKITGLSAEQMQ 289 A+++ G +G I + L+ G ++IQ +T LS ++ Sbjct: 242 NAAQQLKQQGRQEGLYEGRQEGLQKGEYRKAITIAKKLIAEGRSIQYIQDLTNLSENEVL 301 Query: 290 ALRQP 294 +L + Sbjct: 302 SLVEE 306 >UniRef50_Q3C0L1 TpnA protein n=16 Tax=Enterobacteriaceae RepID=Q3C0L1_SODGL Length = 277 Score = 209 bits (531), Expect = 1e-52, Method: Composition-based stats. Identities = 110/272 (40%), Positives = 163/272 (59%), Gaps = 25/272 (9%) Query: 42 LDSLKLESASFVDEKLRALHSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMA 101 L +L + S SF+++ L + SD+L+S+K+ GD YIY +IEHQS + MAFRL+RY++ Sbjct: 3 LSTLVMVSGSFIEDDLCSQCSDMLYSLKSTLGDAYIYCLIEHQSCPEPMMAFRLLRYAVT 62 Query: 102 VMQRHIEHDKRQPLPLVIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVT 161 M RH+E + +Q LP+VIP+LFYHGS SPYP++ WLD FAD A +Y AFPLVDVT Sbjct: 63 AMHRHLEQENKQ-LPVVIPILFYHGSTSPYPYTTHWLDCFADRKLAESVYEKAFPLVDVT 121 Query: 162 VVPDDEIVQHRRVALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTG 221 + D+EI++HRR+AL+E++QKHIR R+++ L +L LL + Q L+ Y++L G Sbjct: 122 AMEDEEILRHRRMALMEIVQKHIRTRNMLELAGELANLLEQWKFSKEQCKTLVYYLVLAG 181 Query: 222 DEARFNEFISELTRRMPQHRERIMTIAERIHNDGYIKGEQ-------------------- 261 + F+ L + P +RE +MTIAE++ G KG Q Sbjct: 182 NTTDGEGFLRTLAQPAPSYREDMMTIAEQLEAKGMQKGIQLGEKKGIERGLQEGIQLGKK 241 Query: 262 ----RILRLLLQNGADPEWIQKITGLSAEQMQ 289 +I R L NG + + ++ TGL+ + Sbjct: 242 QATLKIARQFLVNGVERDIVKMSTGLTDRDIN 273 >UniRef50_C3M8C1 Putative transposase n=3 Tax=Candidatus Hamiltonella defensa 5AT (Acyrthosiphon pisum) RepID=C3M8C1_HAMD5 Length = 308 Score = 205 bits (521), Expect = 2e-51, Method: Composition-based stats. Identities = 118/307 (38%), Positives = 174/307 (56%), Gaps = 24/307 (7%) Query: 6 TSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDIL 65 STPHD LFK F AR+F EIHLP + ++ SLK+ SF+D+ L+ HSD++ Sbjct: 3 ISTPHDRLFKKFFGDIALARNFFEIHLPSSILKIVSFPSLKMVPGSFIDKSLKQSHSDMV 62 Query: 66 WSVKT-REGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFY 124 +S +T +GY+Y V+EHQS +D MAFR+ +YS+AVMQ+H++ LPLV+P+LFY Sbjct: 63 YSFETSTGKEGYLYCVVEHQSTDDKMMAFRMKKYSLAVMQQHLDQG-HDTLPLVLPVLFY 121 Query: 125 HGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHI 184 HG +SPYP S+ W D F + AR L + FPLVDVT++P++EI++H ++ LE+ QK + Sbjct: 122 HGQKSPYPHSMDWRDCFCEKELARILDSQPFPLVDVTMLPEEEIMKHGIISWLEMSQKMV 181 Query: 185 RQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRERI 244 RD+M + L+ L ND +LL Y+ G+ A F L+ RE + Sbjct: 182 HTRDMMEIAPYLIRLDKLFPLNDELFKSLLYYLFQEGETADRMLFFDALSSTT--QRENV 239 Query: 245 MTIAERIH--------------------NDGYIKGEQRILRLLLQNGADPEWIQKITGLS 284 MTIAE + +G +G + I + LL NG + ++ TGLS Sbjct: 240 MTIAEELKREGREEGREEGREEGREEGREEGREEGREEIAKNLLNNGFSFKQVKMYTGLS 299 Query: 285 AEQMQAL 291 + + L Sbjct: 300 EDSLNKL 306 >UniRef50_B7MZS6 Putative uncharacterized protein n=3 Tax=Escherichia coli ED1a RepID=B7MZS6_ECO81 Length = 319 Score = 202 bits (513), Expect = 1e-50, Method: Composition-based stats. Identities = 111/304 (36%), Positives = 162/304 (53%), Gaps = 19/304 (6%) Query: 2 TNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALH 61 TS HDA F+ L P ARDF+E L + C+LD+++LE +FV E LR Sbjct: 4 KVNKTSLIHDAAFRKTLKDPAAARDFLEQVLTPYQKSRCNLDTIELEPTTFVAESLRQSA 63 Query: 62 SDILWSVKTREG-DGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIP 120 D+L S+KT +G DGYIY +IEHQS D + R+MRY +AVM++HIE K P+VIP Sbjct: 64 CDVLLSMKTNDGKDGYIYTLIEHQSSPDKFIPLRMMRYILAVMEQHIEEHKCA--PVVIP 121 Query: 121 MLFYHGSRSPYPWSLCWLDEFADPTTARKLY--NAAFPLVDVTVVPDDEIVQHRRVALLE 178 +LFYHG++ PYP+ + W+D DP R++Y F LVDV+ + DDEI + R+A L Sbjct: 122 VLFYHGAKRPYPYPMNWVDCLDDPAYGREIYGEQKPFSLVDVSTLTDDEIEHYHRMAALM 181 Query: 179 LIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMP 238 K D++ LI + + L + + + +L Y+L + F E ++ P Sbjct: 182 FTMKSGTSGDVIELIGKSIT-LTDKYGSSVHLNTVLTYLLELY-QMDFAELSEAVSTHYP 239 Query: 239 QHRERIMTIAERIHNDGYIKGEQRILRL------------LLQNGADPEWIQKITGLSAE 286 H+ IMTIAE++ G KG ++ L + Q G E I+ L+ E Sbjct: 240 SHKGVIMTIAEQLEERGLKKGLEKGLEKGRAEERSRLVLMMRQRGKSLEEIKDFLDLTDE 299 Query: 287 QMQA 290 Q+ Sbjct: 300 QLLQ 303 >UniRef50_Q2J904 Putative uncharacterized protein n=1 Tax=Frankia sp. CcI3 RepID=Q2J904_FRASC Length = 323 Score = 196 bits (498), Expect = 7e-49, Method: Composition-based stats. Identities = 78/287 (27%), Positives = 129/287 (44%), Gaps = 14/287 (4%) Query: 3 NFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHS 62 + +PHDA+F+ L P A + LP L DLD L + S VD LR H+ Sbjct: 1 MSSPPSPHDAVFRRVLGVPSNAASQLRATLPAALVARLDLDRLAIVPGSLVDATLRWRHT 60 Query: 63 DILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHI-EHDKRQPLPLVIPM 121 D+L++ + +IYV++EHQS D MAFR++RY + V R++ +H K LP V+P+ Sbjct: 61 DLLFTAPLDGHEAFIYVLVEHQSSSDPLMAFRMLRYVVRVWDRYLADHHKAARLPAVVPL 120 Query: 122 LFYHGSRSPYPWSLCWLDEFADPTTA----RKLYNAAFPLVDVTVVPDDEIVQHRR---- 173 + +H + + P A L F L D+ V + E+ + Sbjct: 121 VVHHNEHAWVAPTQVLDLVDLAPDLAGAWREHLPRFQFLLDDLVRVDERELRERPLTHSV 180 Query: 174 ---VALLELIQKHIRQ-RDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEF 229 + LL+++ + R +DL +D+L +L + LL YI L G+ +E Sbjct: 181 RLTLLLLKIVPGNPRLAQDLRPWVDELRAVL-DGPDGREEFATLLRYIELVGEADARDEL 239 Query: 230 ISELTRRMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEW 276 + P+ + MTIAE + +G ++G L ++ Sbjct: 240 HDLIAGLGPEAEDAYMTIAEMLRAEGRVEGRVEGRVESLLQLLTLKF 286 >UniRef50_A6G4N5 Putative uncharacterized protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6G4N5_9DELT Length = 343 Score = 195 bits (495), Expect = 2e-48, Method: Composition-based stats. Identities = 71/313 (22%), Positives = 127/313 (40%), Gaps = 16/313 (5%) Query: 5 TTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDI 64 T+ +PHDALFK+ P A ++ L + + D +L+ E S++DE L HSD+ Sbjct: 4 TSPSPHDALFKSAFKDPKDAAKLLQNVLDEPIAHAIDWSTLRPEPGSYIDETLAERHSDL 63 Query: 65 LWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFY 124 L+S D Y+Y++IEHQS D M R++ Y V RH + LP ++P++ Sbjct: 64 LFSASIGGEDAYVYLLIEHQSTVDRDMPLRMLVYLTRVWLRHRSAHPGRDLPPILPVVVS 123 Query: 125 HGSRSPYPWSLCWLDE----FADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELI 180 H P + + D+T + D ++ + L+ Sbjct: 124 HAPGGWTAPVTFESLVRPGPTDLPELTPHIPRFELVINDLTHLSDQQLREWSMRGFATLV 183 Query: 181 QKHIRQR-------DLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISEL 233 +R R D + + + +T + +YI + EF ++L Sbjct: 184 LWILRTRHEIPELIDGVSTWRDMFREVFEAPDGVQAMTKIFHYIACIAQRVQVQEFHAKL 243 Query: 234 TRRMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQALRQ 293 +PQ RE + T E + +G KG + + G + I+ + + + + + Sbjct: 244 DEHVPQTREVMKTYYEELMEEGMAKGLAKG----REEGREQSRIETLQE-TLIDLLSAKF 298 Query: 294 PLPERERYSWLKS 306 L E E ++S Sbjct: 299 DLRELEHAERIRS 311 >UniRef50_C0AXL8 Putative uncharacterized protein n=1 Tax=Proteus penneri ATCC 35198 RepID=C0AXL8_9ENTR Length = 254 Score = 192 bits (487), Expect = 1e-47, Method: Composition-based stats. Identities = 90/247 (36%), Positives = 148/247 (59%), Gaps = 2/247 (0%) Query: 25 RDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVKTREGDGYIYVVIEHQ 84 + F IHLP++L+ CD +L+L+++SF+D KLR+ SDIL+ VKT+EGD IY++IEHQ Sbjct: 6 KTFFFIHLPEELKSQCDFSTLQLQNSSFIDIKLRSRMSDILYLVKTKEGDVPIYLLIEHQ 65 Query: 85 SREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGSRSPYPWSLCWLDEFADP 144 SR D +A+R+M Y+ M +H++ + LPLV+P+LFYHG + PYP+ + W++ F Sbjct: 66 SRPDKMIAWRMMHYAFCTMNQHLQQG-YKSLPLVVPILFYHGKKKPYPFPVNWMECFPLS 124 Query: 145 TTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIRQR-DLMGLIDQLVVLLVTE 203 + A +Y+ F L+D+T + DD ++ H++ A++E+ KH+ DL + L + + Sbjct: 125 SLANHIYSNDFSLIDLTSIDDDILLTHKKAAVMEIAMKHVNSCHDLNKIAMLLSKAINQK 184 Query: 204 CANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRERIMTIAERIHNDGYIKGEQRI 263 D A++ Y+ D + F I+++ R+ HRE IM IA R+ N G+ G Sbjct: 185 NCRDEDTVAVVQYLFSIMDASDFEFIINKIAERVDNHRETIMNIAWRLENKGFKLGIDEG 244 Query: 264 LRLLLQN 270 + Sbjct: 245 FEIGKLK 251 >UniRef50_Q52101 ORF n=1 Tax=Salmonella enterica subsp. enterica serovar Enteritidis RepID=Q52101_SALEN Length = 292 Score = 191 bits (484), Expect = 3e-47, Method: Composition-based stats. Identities = 108/278 (38%), Positives = 149/278 (53%), Gaps = 12/278 (4%) Query: 2 TNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALH 61 TT TPHDA F+ FLT PD ARDFME+HLP +LR +CDL +LKLES SFV++ LR Sbjct: 3 KKNTTPTPHDATFRQFLTQPDIARDFMELHLPAELRAICDLSTLKLESGSFVEDDLRQYF 62 Query: 62 SDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSM-AVMQRHIEHDKRQPLPLVIP 120 SD+L+S+KT GD I++ + S+ ++ F + A MQRH+E + LPLVIP Sbjct: 63 SDVLYSLKTTAGDD-IFMSWLNTSQHLTNICFPPDTLCVGAAMQRHLEAG-HKKLPLVIP 120 Query: 121 MLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAF-PLVDVTVVPDDEIVQHRRVALLEL 179 +LFY G RSPYP+S WLDEF D R+ LVDVTV+PDDEI HR +A L L Sbjct: 121 VLFYTGKRSPYPYSTRWLDEFDDTAPGRQTLQQRLSRLVDVTVIPDDEIAGHRSMAALTL 180 Query: 180 IQKHIRQ----RDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTR 235 + ++I ++ + + A + T R + Sbjct: 181 LPENIFISGTWQNWLTGWRPFYGRISVFIAGNIAGTL----YSAGRRNIRRRSLCTRTGT 236 Query: 236 RMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGAD 273 QH + +MTIA+++ G KG Q + ++ G Sbjct: 237 ACAQHGDALMTIAQQLEQKGIEKGIQLGEQRGIEKGRS 274 >UniRef50_A8PQ66 Putative uncharacterized protein n=3 Tax=Rickettsiella grylli RepID=A8PQ66_9COXI Length = 307 Score = 191 bits (484), Expect = 4e-47, Method: Composition-based stats. Identities = 84/301 (27%), Positives = 161/301 (53%), Gaps = 15/301 (4%) Query: 5 TTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDI 64 T HD LFK L+ A F++ L ++ +L ++++L+L SFV + R +HSDI Sbjct: 4 TIHQAHDKLFKYSLSKKTIAISFLKSRLSSEIYKLINIETLQLTDKSFVLPEFREIHSDI 63 Query: 65 LWSVKTREGDGYIYVVIEHQSRED-IHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLF 123 ++ + E GYI+ ++EH+S MAFR ++Y+++ M ++ + LP+V+P+ Sbjct: 64 VYQCQINEKKGYIFFILEHESTAHVELMAFRQLQYTISAMDQYCRQGNK-KLPIVLPICV 122 Query: 124 YHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKH 183 YHG +SPYP S D F + AR++ F L+D+TV+ D+E+ + L+E++ KH Sbjct: 123 YHGIKSPYPHSQDVYDNFENLQIARQIVFKPFTLIDLTVLSDEELAKDGPAYLMEMLLKH 182 Query: 184 IRQRDLMGLI---DQLVVLLVTECANDSQITALLNYILLTGDE--ARFNEFISELTRRMP 238 R ++ + ++ + + L+ + + + I T DE + + L+ P Sbjct: 183 SRAKNFLSILHRRIEFIQSLLNRFGKEYRWFVVKYMINETQDESPNAVEQLVQTLSTAFP 242 Query: 239 QHRERIMTIAERIHNDGYIKGEQR--------ILRLLLQNGADPEWIQKITGLSAEQMQA 290 + + +MT A+++ +G +G ++ I + LL +G + +Q++TGLS +++ Sbjct: 243 EEKNTMMTFAQQLRQEGLEQGLEQGRYEEAIAIAKNLLGDGMSFKAVQRLTGLSEKEVMN 302 Query: 291 L 291 L Sbjct: 303 L 303 >UniRef50_A6G0X2 Putative uncharacterized protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6G0X2_9DELT Length = 363 Score = 187 bits (474), Expect = 5e-46, Method: Composition-based stats. Identities = 79/312 (25%), Positives = 129/312 (41%), Gaps = 22/312 (7%) Query: 3 NFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHS 62 TS PHDALF+ HP A + LP++L L D L+ + V L + Sbjct: 12 ESVTSRPHDALFRATFEHPSHAGSLLRSALPRELAALIDWSRLRPAANELVSSSLGERRT 71 Query: 63 DILWSVKTR-----EGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPL 117 D+L+S +G +Y+ IEHQSR D M R++ Y + + +RH + LP Sbjct: 72 DLLFSTALEGPGAGDGARVVYLHIEHQSRVDTTMPLRVLGYRVRIWERHRKRHG-GALPP 130 Query: 118 VIPMLFYHGSRSPYPWSLCWLDEFADP-----TTARKLYNAAFPLVDVTVVPDDEIV--- 169 V ++ H ++ ++ F +P A L + D+ D E+ Sbjct: 131 VFCVVLSHAAKGWT-GPRSLVELFPEPVRTLAPIAAHLPRCPLIVEDLGRRADAELRARH 189 Query: 170 QHRRVALLELIQKHIRQRD-----LMGLIDQLVVLLVTECANDSQITALLNYILLTGDEA 224 H AL + + R + L+ DQ++ LL + LL Y+ L G E Sbjct: 190 AHPLPALTLWLLRDARSPERLVHRLLDWRDQIIALLDYDHGERDLAQ-LLRYVALVGSEM 248 Query: 225 RFNEFISELTRRMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNG-ADPEWIQKITGL 283 F EF + +P+ MTIAE++ + +G ++ R + G + + + G Sbjct: 249 DFEEFHRFVAHHIPEVEAMTMTIAEQLCREALQRGREQGQREGQREGRLEGQREGRAVGF 308 Query: 284 SAEQMQALRQPL 295 + Q L Q L Sbjct: 309 EEGRSQVLVQML 320 >UniRef50_A9EVM7 Similar to putative transposase n=2 Tax=Sorangium cellulosum 'So ce 56' RepID=A9EVM7_SORC5 Length = 336 Score = 186 bits (472), Expect = 7e-46, Method: Composition-based stats. Identities = 69/275 (25%), Positives = 117/275 (42%), Gaps = 15/275 (5%) Query: 8 TPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWS 67 HDALFK + + A + LP L D +L+L SFVDE L+ SD+L+S Sbjct: 12 NAHDALFKAAFSQVEHAAGELRQALPPALSARIDFAALRLRPGSFVDEALKERQSDLLFS 71 Query: 68 VKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHD-KRQPLPLVIPMLFYHG 126 E +Y++ EHQS + MAFRL+RY + + + H+ + LP ++P++ +H Sbjct: 72 ASMGEARVLLYLLFEHQSTVEPLMAFRLLRYMVRIWEHHLAEHPGSKRLPAILPVVLHHS 131 Query: 127 SRSPYPWSLCWLDEFADPTTAR-----KLYNAAFPLVDVTVVPDDEIVQHRRVAL---LE 178 + + D AR + F L D++ D+ + A + Sbjct: 132 ETGWTAAT-SFEDLLDLDEGARAVMVDHVPRFRFVLDDISQEGDEALKARAMSAFSRLVL 190 Query: 179 LIQKHIRQRDL----MGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFI-SEL 233 +H R+ D +G LV + + A+ YIL T + +E + L Sbjct: 191 WCLRHGREPDELLRQLGKWLDLVNEVRRAPNGVEALRAIWRYILATNERDEADEVLQRLL 250 Query: 234 TRRMPQHRERIMTIAERIHNDGYIKGEQRILRLLL 268 +E I++ A+++ G +G + LR Sbjct: 251 AAAGEPWKEEIVSAADQLMERGRQQGLREGLREGR 285 >UniRef50_A8GX51 Transposase and inactivated derivative n=11 Tax=Rickettsia RepID=A8GX51_RICB8 Length = 355 Score = 183 bits (463), Expect = 9e-45, Method: Composition-based stats. Identities = 78/286 (27%), Positives = 146/286 (51%), Gaps = 10/286 (3%) Query: 13 LFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVKTRE 72 +F+ L +P A +F HLP +++ L D SL +E+ +FV+ L+ SD+L+S K + Sbjct: 23 IFRKALENPLVAHEFFNAHLPPNIKSLIDFPSLAMENTTFVESSLKDSISDVLFSCKFDK 82 Query: 73 GDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRH-IEHDKRQPLPLVIPMLFYHGSRSPY 131 DGY+++++EHQS+ D MAFRL +Y + + +R+ I++ K + LPL+ PM+F++G Sbjct: 83 QDGYLFLLVEHQSKADHFMAFRLFKYMINICERYLIQNPKAKTLPLIYPMIFFNGQEKYN 142 Query: 132 PWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIRQRDLMG 191 W D F + A++L+ + LV+V +PD+E Q +LE KHI +R+L+ Sbjct: 143 VARNLW-DLFTNNKLAKELWINDYQLVNVHEIPDEEFKQRIWSGILEFFLKHIHERELLK 201 Query: 192 LIDQLVV---LLVTECANDSQITALLNYILLTGDEARFNEFISELTRRM--PQHRERIMT 246 ++ L + +L Y L ++A + + L+ ++ + + Sbjct: 202 RWQEISDILPELTKITIGYDYLEMILYYTLTKIEQADKIKLKNLLSTKLNPEIGTRLMRS 261 Query: 247 IAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQALR 292 +AE +G G L++ G I + G+ + + ++ Sbjct: 262 LAEHWQQEGKEIGILEGLQVGEAKGI---QIGEAKGIQIGKAEGIQ 304 >UniRef50_Q2RLW6 Putative uncharacterized protein n=9 Tax=Clostridia RepID=Q2RLW6_MOOTA Length = 344 Score = 180 bits (457), Expect = 4e-44, Method: Composition-based stats. Identities = 64/334 (19%), Positives = 135/334 (40%), Gaps = 42/334 (12%) Query: 6 TSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDIL 65 P+D ++ L + ++ + + E D D L L + S+V + +D++ Sbjct: 11 PHHPYDKGYRQLLADKRVFLELLKTFVREAWVEAIDADDLILVNKSYVLQDFSEKEADVV 70 Query: 66 WSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIE-------HDKRQPLPLV 118 + +KTR + YV++E QS D M FRL+ Y + + + K LP + Sbjct: 71 YRLKTRNRNVIFYVLLELQSTVDYLMPFRLLLYMVEIWREIYNNTPQGERESKHFRLPPI 130 Query: 119 IPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVAL-L 177 IP + Y+G+ S + + L + + L DV ++E+++ + + Sbjct: 131 IPAVLYNGAGSWTAALSFKEMLNSYQDFSGHLLDFRYLLFDVNRYSEEELIRAANLIAGI 190 Query: 178 ELIQKHIRQRDLMGLIDQLVVLLVTECAND------------------------------ 207 L+ + ++ DL G + +L +L ++ Sbjct: 191 FLLDQKMQPEDLAGRLQKLAGVLRRLTPDEFRHFTTWLKNVVQPRMPGDFSEKIDGILNA 250 Query: 208 ---SQITALLNYILLTGDEARFNEFISELTRRMPQHR-ERIMTIAERIHNDGYIKGEQRI 263 ++ ++ + LT +E + + L + + E + +G ++G++ + Sbjct: 251 SNPWEVERMIYNLELTLEEMQRQALLKGLKEGEQKGKLEGKLEGKLEGKLEGKLEGKREV 310 Query: 264 LRLLLQNGADPEWIQKITGLSAEQMQALRQPLPE 297 R LL D E I K TGL+ E++ AL++ + + Sbjct: 311 ARNLLLLNVDIETIIKATGLALEEINALKKQMEQ 344 >UniRef50_A0LBL3 Putative uncharacterized protein n=6 Tax=Magnetococcus sp. MC-1 RepID=A0LBL3_MAGSM Length = 322 Score = 180 bits (457), Expect = 5e-44, Method: Composition-based stats. Identities = 68/323 (21%), Positives = 133/323 (41%), Gaps = 26/323 (8%) Query: 1 MTNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRAL 60 MT + PHD K L+ PD + LPK++ EL + L +F+D + R Sbjct: 1 MTK--ITQPHDRFLKALLSDPDKTGTLLRERLPKEVAELLSSEPPVLVDGTFIDGEFREH 58 Query: 61 HSDILWSVKTREGDG-YIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVI 119 +D L+ VKT+EG YIY +IEH+S D +AF+L+RY + + +R ++ +Q LP ++ Sbjct: 59 LTDRLFKVKTQEGKAAYIYALIEHKSYADEWVAFQLLRYMVRIWERFLKEG-QQKLPPIV 117 Query: 120 PMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLEL 179 P++ YHG+R + AD L + +F + D+ + DD++ Q + + Sbjct: 118 PLVVYHGAREWTVPNQFSALLEADKGLLHHLLDFSFAVTDLGRIADDDLSQDTHLRAALM 177 Query: 180 IQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQ 239 K+ Q + + + + +L Y++ T + + P Sbjct: 178 AMKYAFQG--AEGVVVIPQIGKGAQGDPEFAKLVLRYLIQTYRGMTMADVQAYAEEAFPG 235 Query: 240 HRERI-----MTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQ---------------K 279 E + + +G +G + + Q G ++ K Sbjct: 236 EAEHYASQFAREMMSKGRQEGRQEGRREGRQEGRQEGESSLLLRLLHRRFGDVPSWAELK 295 Query: 280 ITGLSAEQMQALRQPLPERERYS 302 + + ++++ + + + E Sbjct: 296 VANATIDELETWGEQIFDAETLE 318 >UniRef50_B3ESQ9 Putative uncharacterized protein n=2 Tax=Bacteria RepID=B3ESQ9_AMOA5 Length = 308 Score = 180 bits (456), Expect = 5e-44, Method: Composition-based stats. Identities = 92/303 (30%), Positives = 160/303 (52%), Gaps = 11/303 (3%) Query: 3 NFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHS 62 S PHD L K L+HP+ ++F + + P D+ + DL SLKL + S+V E+LR H+ Sbjct: 6 KNDLSNPHDLLVKATLSHPEAIQEFAKAYFPADILKRVDLPSLKLTNKSYVTEELREFHN 65 Query: 63 DILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEH-DKRQPLPLVIPM 121 D+++S + GY + V+EHQS D MA R ++Y++A+++ +I+ ++ P P+++ + Sbjct: 66 DLVFSFTIDKQPGYAFFVLEHQSTPDPLMALRFVKYNIALIEEYIKEKGEKTPWPIIVNI 125 Query: 122 LFYHG-SRSPYPWSLCWLDEFADPTTARKLYNA-AFPLVDVTVVPDDEIVQHRRVALLEL 179 YH + PYP+S D F DP TA+ L F L D+ P++ + QH + L+E Sbjct: 126 CLYHNANEKPYPYSTSVYDLFKDPLTAKALEMFTKFYLADLNSTPNEVLEQHGSIGLMEK 185 Query: 180 IQKHIRQRDLMGLID-QLVVLLVTECANDSQITALLNYILLTGDEARFNE--FISELTRR 236 + K+ R RD+ +I+ +L +L Y + +E +S Sbjct: 186 LLKYSRHRDIFNVIEKELKRSKGYLIVRGDYWKTILIYSSYVIGQEEKSEKDLVSLFKEV 245 Query: 237 MPQHRERIM-----TIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQAL 291 + ++ E IM TI ER G + + I + +L+ G + +I++ITGLS + ++ L Sbjct: 246 LSKNEEEIMITIAQTIEERGEMRGKRREKIAIAKNMLKKGCEISFIEEITGLSRKDIEKL 305 Query: 292 RQP 294 +Q Sbjct: 306 KQE 308 >UniRef50_D0LMM4 Putative transposase n=10 Tax=Haliangium ochraceum DSM 14365 RepID=D0LMM4_HALO1 Length = 345 Score = 179 bits (455), Expect = 8e-44, Method: Composition-based stats. Identities = 73/308 (23%), Positives = 127/308 (41%), Gaps = 15/308 (4%) Query: 6 TSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDIL 65 HD+L K D A D LP + E DLD L L SFV ++LR H+D+L Sbjct: 2 PHDSHDSLVKATFARLDFAADEFRAVLPPAILERLDLDKLALCPGSFVSDELRQQHTDLL 61 Query: 66 WSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHD-KRQPLPLVIPMLFY 124 + ++Y+++EHQS + M RL+RY ++ +RH+ LP ++P++ + Sbjct: 62 FRAPLDGEPAFLYLLLEHQSSVERMMPLRLLRYVASIWERHLGEHPGAATLPPILPVVLH 121 Query: 125 HGSRSPYPWS----LCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVA---LL 177 H + + L L + A L F L D++ PD+ ++ A L Sbjct: 122 HSEQGWTAPTSLGQLFALSDGAREALGPYLPELRFLLDDLSHQPDEALLMREMAAQAKLA 181 Query: 178 ELIQKHIRQ-RDLMGLID---QLVVLLVTECANDSQITALLNYILLTGDEARFNEFISEL 233 K+ R +DL+ L+ +++ VT + A++ Y L D + Sbjct: 182 LWALKNARHAQDLLALLRPWSPVILEAVTAPGGIDALAAIVRYTLQHADTDPDALMRFLI 241 Query: 234 TRRMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQALRQ 293 +E MT AE++ + ++ G ++ G + +ALR Sbjct: 242 DSAGDPAKEAFMTGAEKLTQAVREQSLRQGRVEGRVEGRVEGRVE---GRVEGRTEALRT 298 Query: 294 PLPERERY 301 L ++ R Sbjct: 299 VLSKQLRQ 306 >UniRef50_Q1QWV4 Putative uncharacterized protein n=11 Tax=Proteobacteria RepID=Q1QWV4_CHRSD Length = 326 Score = 179 bits (454), Expect = 1e-43, Method: Composition-based stats. Identities = 73/314 (23%), Positives = 134/314 (42%), Gaps = 30/314 (9%) Query: 14 FKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVKTREG 73 +K +HP+ RD + + + E D +L+ S S++ E LR D++W V+ + Sbjct: 13 YKLLFSHPEMVRDLLTGFVKEAWVEQLDFSTLEKVSGSYITEDLRDREDDVIWRVRWGDD 72 Query: 74 DGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDK---RQPLPLVIPMLFYHGSRSP 130 Y+Y+++E QS D MA R+M Y + Q I + LP V+P++ Y+G + Sbjct: 73 WLYVYLLLEFQSSVDRFMAVRVMTYLGLLYQDLIRQEAFTPNGKLPPVLPIVLYNGEKRW 132 Query: 131 YPW-SLCWLDEFADPTTARKLYNAAFPLVDVT-VVPDDEIVQH-RRVALLELIQKHIR-Q 186 ++ L E R N A+ L+D V+ D E H R VA +H R + Sbjct: 133 TAAQNVADLVEQVPGGLERYRPNLAYLLLDEGAVISDPEWSDHMRNVAAALFRLEHNRDE 192 Query: 187 RDLMGLIDQLVVLLVTECAN--DSQITALLNYILLTG--------------DEARFNEFI 230 +D++ ++ LV L + +LL D ++ + Sbjct: 193 QDMLEVLGTLVEWLKAPEQTGLRRAFVVWIRRVLLPNRAPGMELPEFNELQDLHEVHDML 252 Query: 231 SELTRRMPQHRERIM------TIAERIHNDGYIKGEQRILRLLLQNG-ADPEWIQKITGL 283 +E ++ P+ E + +G +G ++ R L++ G E I + TGL Sbjct: 253 AERIKQWPERWEEKGRQEGRQEGRKEGRQEGEQRGIEKTARNLIKLGVLSDEQIAEATGL 312 Query: 284 SAEQMQALRQPLPE 297 + +++ LR+ + Sbjct: 313 TVAEVEGLREEDTQ 326 >UniRef50_C7RR52 Putative transposase n=1 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RR52_9PROT Length = 330 Score = 177 bits (449), Expect = 3e-43, Method: Composition-based stats. Identities = 65/317 (20%), Positives = 115/317 (36%), Gaps = 25/317 (7%) Query: 6 TSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDIL 65 + HD +K + P+ RD + +P D D +L+ S+V E DI+ Sbjct: 1 MANTHDTGYKLLFSTPELVRDLILGFVPDDWLHGLDYSTLERVPGSYVTEDFTNRADDIV 60 Query: 66 WSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDK---RQPLPLVIPML 122 W VK Y+Y++IE QS D +MA R+M Y + Q I+ + LP V+P++ Sbjct: 61 WRVKVGGEWVYLYLLIEFQSSVDKYMALRMMVYGGLLYQDLIKRGEVLADGRLPPVLPIV 120 Query: 123 FYHGSRSPYPWSLCWLDEFADPTT-ARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQ 181 Y+GS+ + + P + + L+D D E+ + + Sbjct: 121 LYNGSQRWSAVTDVFELIPPVPGLVEQFKPRLKYLLIDENAWSDSELASLKNLVAAVFRI 180 Query: 182 KHIRQRDLMGLIDQLV-VLLVTECANDSQITALLNYILLTGDEARFN-EFISELTRRMPQ 239 +H +G + L+ L + L+ E R I +L Sbjct: 181 EHPASPAAIGDLLSLLDEWLAERPDLRRMFALWIRATLMRKAEYRIVLPRIDDLQELNVM 240 Query: 240 HRERIMTIAERIHNDGYIKGEQRILRLLLQNG-------------------ADPEWIQKI 280 ER+ A+ +G +G+ G P+ + +I Sbjct: 241 LAERLEEWAQAYKAEGKAEGKAEGKAEGKAEGKAEGEALALQKLLKKRFGAVPPDVLAQI 300 Query: 281 TGLSAEQMQALRQPLPE 297 + S EQ+ A + + Sbjct: 301 SRASLEQIDAWLDQVLD 317 >UniRef50_C2DIT3 Possible transposase n=5 Tax=Enterobacteriaceae RepID=C2DIT3_ECOLX Length = 197 Score = 176 bits (445), Expect = 1e-42, Method: Composition-based stats. Identities = 107/198 (54%), Positives = 144/198 (72%), Gaps = 1/198 (0%) Query: 96 MRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAF 155 MRY++A MQ H++ + LP+V+P+LFYHG SPYP+SLCWLD FADP AR+LY +AF Sbjct: 1 MRYAIAAMQNHLDAG-YKTLPMVVPLLFYHGIESPYPYSLCWLDCFADPNLARQLYASAF 59 Query: 156 PLVDVTVVPDDEIVQHRRVALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLN 215 PL+DVT++PDDEI+ HRR+ALLELIQKHIRQRDLMGL++Q+ LL + AN QI L N Sbjct: 60 PLIDVTLMPDDEIMLHRRMALLELIQKHIRQRDLMGLVEQMACLLSSGYANGRQIKGLFN 119 Query: 216 YILLTGDEARFNEFISELTRRMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGADPE 275 YIL TGD RFN+FI + +R P+H+ +MTIAER+ +G I +++L++G Sbjct: 120 YILQTGDAVRFNDFIDGVAKRSPKHKVSLMTIAERLRQEGEQSKALHIAKIMLESGVPLA 179 Query: 276 WIQKITGLSAEQMQALRQ 293 I + TG+S E++ A Q Sbjct: 180 DIMRFTGVSEEELAAASQ 197 >UniRef50_Q24W02 Putative uncharacterized protein n=3 Tax=Clostridiales RepID=Q24W02_DESHY Length = 333 Score = 176 bits (445), Expect = 1e-42, Method: Composition-based stats. Identities = 76/333 (22%), Positives = 140/333 (42%), Gaps = 42/333 (12%) Query: 3 NFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHS 62 PHD FK AR F++ +LP+++ L DL+++ + S++D++L+ S Sbjct: 1 MSLIHNPHDKFFKETFGDVGMARSFLKNYLPQEILALVDLETILPQKDSYIDQELQESFS 60 Query: 63 DILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPML 122 D+L+ VK + +GY+Y + EH+S +A +L++Y + + + ++ K LPL+IPM+ Sbjct: 61 DLLFQVKIHKNEGYLYFLFEHKSYPSQGIALQLLKYMVRIWESKLKESKPDKLPLIIPMV 120 Query: 123 FYHGSRSPYPWSLCWLDEFADPTT----ARKLYNAAFPLVDVTVVPDDEIVQHRRVALLE 178 YHG + + + L D++ D E+V + + ++ Sbjct: 121 VYHGQEKWNSSLKLSGIIDNYEQLPNAVTQYIPEYEYILYDLSTYTDQEMVGNMLLLIIL 180 Query: 179 LIQKHIRQRDLMGLIDQLVVLLVTECANDSQIT------ALLNYILLTGDEARFNEFISE 232 + I +D + L LL++ + Q L+ YIL T + Sbjct: 181 RTMRDIFIKDTEAFHNILHELLISFERVEDQEKGMQFFETLIRYILSTRQDLELERIYEI 240 Query: 233 LTRRMPQHRERIMTIAERIHNDGYIKG--------------------------------E 260 + E +MTIAE++ +G KG + Sbjct: 241 AKEVSLERGEVMMTIAEKLIMEGMEKGLKKGREEGLKKGREEGLEKGREEGLEKGREETK 300 Query: 261 QRILRLLLQNGADPEWIQKITGLSAEQMQALRQ 293 + R LL G + + + K TGLS E+++ L Sbjct: 301 LEVARNLLGLGIEMDKVAKATGLSEEEIRKLMN 333 >UniRef50_C1J8H0 Truncated transposase n=3 Tax=Escherichia coli RepID=C1J8H0_ECOLX Length = 202 Score = 174 bits (440), Expect = 4e-42, Method: Composition-based stats. Identities = 98/205 (47%), Positives = 136/205 (66%), Gaps = 9/205 (4%) Query: 91 MAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGSRSPYPWSLCWLDEFADPTTARKL 150 M FR++RYS+A MQRH+E K LPLVIP+LFYHG RSPYP+S+ WLD F +P A K+ Sbjct: 1 MPFRMLRYSVAAMQRHLEQHK--TLPLVIPVLFYHGERSPYPYSMNWLDCFEEPALAAKI 58 Query: 151 YNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQI 210 Y FPLVD+TVV D+EI+ HRR+A L L+ KHIR RD+M L+D+L ++V +D Q+ Sbjct: 59 YTKPFPLVDITVVDDNEIMNHRRMAALTLLMKHIRHRDMMELLDKLPQVMVEI--SDEQV 116 Query: 211 TALLNYILLTGDEARFNEFISELTRRMPQHRERIMTIAERIHNDGYIKGEQR----ILRL 266 L++YI+ GD EF+ L R+PQH +++MTIAER+ G +G I Sbjct: 117 RVLIHYIVNAGDSVSP-EFMRALAERLPQHEDKLMTIAERLEQKGRQEGALEKALAIACQ 175 Query: 267 LLQNGADPEWIQKITGLSAEQMQAL 291 L + G PE I++ TGLS +++ + Sbjct: 176 LQKMGMTPEQIKQATGLSEAELKNI 200 >UniRef50_A6TJT5 Putative uncharacterized protein n=1 Tax=Alkaliphilus metalliredigens QYMF RepID=A6TJT5_ALKMQ Length = 312 Score = 172 bits (435), Expect = 1e-41, Method: Composition-based stats. Identities = 69/307 (22%), Positives = 134/307 (43%), Gaps = 21/307 (6%) Query: 6 TSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDIL 65 PHD FK + A+DFM +LP +L ++ D+++L E ++++ L+ SD+L Sbjct: 4 IHQPHDKFFKEMFGNLALAKDFMTNYLPLELLKIVDIETLTPEKEHYIEDDLKESFSDLL 63 Query: 66 WSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYH 125 + +GY+Y + EH+S +A +L+ Y + + +K++ +P++IPM YH Sbjct: 64 FKANINGREGYLYFLFEHKSYPSKRIAIQLLHYMVRIWDDKSLKEKKEKIPMIIPMTVYH 123 Query: 126 GSRSPYPW----SLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQ 181 G + L E + + + + D++ DDE+ ++ ++ I Sbjct: 124 GKENWNVALRLSDLMEGYEELPEEIRKYIPEYEYLIYDLSGYTDDEVKGDVQLQIVIKIL 183 Query: 182 KHIRQRD-----LMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRR 236 + I + D + +++ L + + YIL E + Sbjct: 184 RSIFRNDEEFFKVFKEAVEVLDKLEKQEKGIEYFKTFIYYILSARKGVTLTEIYDLVKEV 243 Query: 237 MPQHRERIMTIAERIHNDGYIKG------------EQRILRLLLQNGADPEWIQKITGLS 284 + + IMTIAE + +G KG ++ + R L+ G + + + K TGLS Sbjct: 244 SVERSDEIMTIAEELLKEGMEKGMEKGMEKGKLEEKREVARNLIGLGVELDKVMKATGLS 303 Query: 285 AEQMQAL 291 E++ L Sbjct: 304 EEEINKL 310 >UniRef50_C3PPD7 Transposase and inactivated derivative n=13 Tax=spotted fever group RepID=C3PPD7_RICAE Length = 361 Score = 171 bits (433), Expect = 3e-41, Method: Composition-based stats. Identities = 85/303 (28%), Positives = 146/303 (48%), Gaps = 31/303 (10%) Query: 2 TNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALH 61 +N + HD LFK ++ P AR+F+E +LP + +L+S+K+E SFV E LR Sbjct: 33 SNTSERPRHDELFKKVMSEPVAAREFLEHYLPVTFKNKINLNSIKIEKESFVTEDLRKRL 92 Query: 62 SDILWSVKTREGD--------------GYIYVVIEHQSREDIHMAFRLMRYSMAVMQRH- 106 SD+++SV + + Y+YV+IEHQS D +AFRL +Y + + +RH Sbjct: 93 SDVVYSVSLKNDNIKDSTTEKSVHNDKAYVYVLIEHQSSSDYWIAFRLWQYMLLLCERHK 152 Query: 107 --------IEHDKRQPLPLVIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLV 158 + +K LPL+ P++ Y + PY + + F D TA+ + + LV Sbjct: 153 DANNNKSSVTKEKDNKLPLICPIVVY-ANDKPYNAPRSFWELFEDSKTAKDMMGDEYLLV 211 Query: 159 DVTVVPDDEIVQHRRVALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQI-----TAL 213 D+ DDEI + + + ++E + KHI+ RD++ L L+ + D + L Sbjct: 212 DLQKQSDDEIEKKKHLGMMEYMLKHIKARDILNLWQSLLEKFESSIEIDKENGYIYIKWL 271 Query: 214 LNYILLTGDEARFNEFISELTRRM--PQHRERIMTIAERIHNDGYIKGEQRILRLLLQNG 271 L Y E + E S + + + E + TIA++ ++G KG + +++ G Sbjct: 272 LWYSDAKVSEDKQVELASIIAKHLKKEDQEELMRTIADKYIDEGVQKGMVQGMQIGEARG 331 Query: 272 ADP 274 Sbjct: 332 MQI 334 >UniRef50_C5UWW9 Putative uncharacterized protein n=1 Tax=Clostridium botulinum E1 str. 'BoNT E Beluga' RepID=C5UWW9_CLOBO Length = 323 Score = 170 bits (431), Expect = 5e-41, Method: Composition-based stats. Identities = 60/320 (18%), Positives = 121/320 (37%), Gaps = 26/320 (8%) Query: 1 MTNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRAL 60 M N HD +K +H +T +F+ K+ L + D L L S++ Sbjct: 1 MKNNNVHHEHDVGYKHIFSHKETFLEFLRSFTKKEWANLINEDDLILVDKSYILSDFEEE 60 Query: 61 HSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVM-------QRHIEHDKRQ 113 SDIL+ + + YV++E QS+ D M RL+ Y + +++ K Sbjct: 61 ESDILYKANIDDKEVIFYVLLEFQSKVDFQMPMRLLFYMTEIWRDVLKNTEKNERKRKNF 120 Query: 114 PLPLVIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQ-HR 172 LP ++P++ Y+G + + + L D+ D E++ Sbjct: 121 KLPSIVPIVLYNGKNKWSAKISFKEMLSGYELFEDNILDFNYMLFDINRYSDHELLNISN 180 Query: 173 RVALLELIQKHIRQRDLMG------------------LIDQLVVLLVTECANDSQITALL 214 ++ + L+ + I +++LM + + + +V D+ + Sbjct: 181 MISAVFLLDQEIDEQELMRRLKKIIYILKKISPEQFSVFKKWLKNIVKPRVRDNLQGEID 240 Query: 215 NYILLTGDEARFNEFISELTRRMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGADP 274 + + + E + + I ++ G +G ++ + ++ G D Sbjct: 241 DVLEKSNQEEVDFMVSNLGKTIERMQDKAIERGLKKGIEQGIEQGIEQTAKKAIEMGMDN 300 Query: 275 EWIQKITGLSAEQMQALRQP 294 E I +TGLS EQ+ +RQ Sbjct: 301 EIIMNLTGLSEEQINTIRQE 320 >UniRef50_A5CC03 Transposase and inactivated derivative n=9 Tax=Orientia tsutsugamushi RepID=A5CC03_ORITB Length = 355 Score = 170 bits (431), Expect = 5e-41, Method: Composition-based stats. Identities = 87/345 (25%), Positives = 146/345 (42%), Gaps = 62/345 (17%) Query: 9 PHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSV 68 HD LFK + P A DF+ LP +++ + DL+++K+E SFV+ LR D+L+SV Sbjct: 6 KHDGLFKDLMNEPKAALDFINDFLPNEVKNVLDLNTIKVEQESFVEANLRRSMCDVLFSV 65 Query: 69 KTR-EGDGYIYVVIEHQSREDIHMAFRLMRYSMAV-----MQRHIEHDKRQPLPLVIPML 122 KT+ D +IYV+IE + R D +AF+L +Y++++ +R LP+V+P++ Sbjct: 66 KTKNNNDAFIYVLIEAELRSDYWIAFKLWQYTLSILKRHKKGLKKRKKERGKLPIVVPIV 125 Query: 123 FYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQK 182 YHG+ + F DP A++L + + L+D +PD EI + AL+ ++ Sbjct: 126 VYHGADRFN-APRSLWELFDDPKLAKELMGSEYLLIDWQAMPDSEIKRKATAALVHFMKY 184 Query: 183 HIRQRDLMGLIDQLVVLLVTECANDSQ-----ITALLNYILLTGDEARFNEFISELTRR- 236 Q D++ L + L D + I +LL Y + + L Sbjct: 185 IHNQPDIIELWAKFFNTLQEIVQKDKEEGFLYIRSLLYYTISKVSQNEQPRLKQLLDENL 244 Query: 237 -MPQHRERIMTIAERIHNDGYIKGEQRI-------------------------------- 263 + + TIA + ++G KG Sbjct: 245 SIEDRDRIMGTIAAQYIDEGKAKGRAEGRAEGRAEGRAEGRAEGRAEGRAEGRAEGRAEG 304 Query: 264 ----------------LRLLLQNGADPEWIQKITGLSAEQMQALR 292 R LL+ G E+I + TGLS E++ L+ Sbjct: 305 IEIGETKGRAEAAQGLARNLLKAGFSVEFIAENTGLSNEEVVNLK 349 >UniRef50_Q6TFF6 Putative transposase n=1 Tax=Caedibacter taeniospiralis RepID=Q6TFF6_CAETA Length = 299 Score = 170 bits (430), Expect = 6e-41, Method: Composition-based stats. Identities = 87/304 (28%), Positives = 150/304 (49%), Gaps = 14/304 (4%) Query: 6 TSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESAS-------FVDEKLR 58 HD++FK + + D A F+ +LPK+L EL D ++KLESA+ D + + Sbjct: 1 MKNVHDSVFKDLIANRDFAVSFLMTYLPKELVELVDWQTVKLESANVEHVRQQQKDNQKQ 60 Query: 59 ALHSDILWSVKTREG-DGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQP-LP 116 SD+ + K ++G +G ++V IE Q+ +D + R Y + + +I+ K LP Sbjct: 61 KEQSDLTFLFKFKDGKNGAVFVHIESQTGDDGTILIRTRHYQTSYLLDYIKRHKTVKGLP 120 Query: 117 LVIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVAL 176 LV+ + Y+ ++ P+ SL D FA+ A+K Y +D+ D+EI++H +A Sbjct: 121 LVVSI-IYYANQKPFSHSLNIHDYFANTELAKK-YAFTTQFIDLNRYSDEEILEHGFIAG 178 Query: 177 LELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRR 236 ELI K IR++++ G +D + + E + L+ Y+ D +F +L Sbjct: 179 YELILKAIREKNIDGKLDIAINQI--EAYDHIARQVLIRYMSQYSD-METKDFHDKLIYS 235 Query: 237 MPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQALRQPLP 296 P R +MT+AE+ G KG Q R L G E + K TGL + + L++ + Sbjct: 236 KPDLRGDVMTVAEQWEQKGIQKGIQTTARNFLLMGLSAEQVVKGTGLDQDTVLKLKKEVE 295 Query: 297 ERER 300 + + Sbjct: 296 QTQH 299 >UniRef50_Q1RJ73 Transposase and inactivated derivative n=10 Tax=Rickettsieae RepID=Q1RJ73_RICBR Length = 305 Score = 169 bits (427), Expect = 1e-40, Method: Composition-based stats. Identities = 86/302 (28%), Positives = 160/302 (52%), Gaps = 19/302 (6%) Query: 9 PHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSV 68 HD+L K +T A++F+E +LP+D ++L DL + +E S+++E L +SDI++ + Sbjct: 6 KHDSLVKIIMTDKIAAQEFLEYYLPEDFKKLIDLSKITVEQESYIEESLSKKYSDIVYGI 65 Query: 69 KTRE-GDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGS 127 +T+E G G++Y++IE QS D A RL +Y++ + +RH +KR LPLV ++ Y+G Sbjct: 66 ETKEYGKGFVYILIEAQSTVDYWTALRLWKYTLLLCERH--KEKRNKLPLVYNLVIYNGK 123 Query: 128 RSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIRQR 187 + Y D F + A+KL + LVD+ + D+EIV+ + + +L+ I KHI +R Sbjct: 124 QV-YNAPRNLWDLFTNSVMAKKLMMEDYQLVDLQAMSDNEIVKKKHIGMLDYILKHIHER 182 Query: 188 DLMGLIDQLVVLLVTECANDSQI-----TALLNYILLTGDEARFNEFISELTRRM-PQHR 241 D++ L +Q + D + + L Y + + + + + PQH+ Sbjct: 183 DMIQLWEQFLANFNHVIMLDKEKGYIYLKSFLWYTDAKISKKQQPRLVQVFDKYLSPQHK 242 Query: 242 ERIM-TIAERIHNDGYIKGEQR--------ILRLLLQNGADPEWIQKITGLSAEQMQALR 292 + IM TIA+ ++G +G++ I + + G I ++TGL ++++ Sbjct: 243 DNIMKTIADVYIDEGKQEGKREGEYNKAVMIAKKMFSQGFKIPVIAELTGLKETLIRSII 302 Query: 293 QP 294 + Sbjct: 303 ES 304 >UniRef50_D2QBD7 Putative uncharacterized protein n=1 Tax=Spirosoma linguale DSM 74 RepID=D2QBD7_9SPHI Length = 341 Score = 169 bits (427), Expect = 1e-40, Method: Composition-based stats. Identities = 74/305 (24%), Positives = 138/305 (45%), Gaps = 16/305 (5%) Query: 8 TPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWS 67 PHD FK + P+ DF+ P+ +RE D +L E +F DE+L +D+++S Sbjct: 7 NPHDRFFKESFSQPEILIDFLNAFAPEAVRERIDYTTLTREVDTFTDEQLAEHFADLVFS 66 Query: 68 VKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGS 127 V+ + +++EH+S + + F++ RY + + + I+ +QPL V+P+L YHG+ Sbjct: 67 VQYNGQPIRLVILLEHKSYTEEYPHFQINRYLLNLWESQIKQ--KQPLTPVLPVLVYHGN 124 Query: 128 RSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVAL---LELIQKHI 184 R S+ T L + L+D++ + D+ + + L+Q Sbjct: 125 RRWKQRSIPDYFAPLHETLTPYLPAFEYLLIDLSTLSDERLPTLQSDYARLTAILLQNSR 184 Query: 185 RQRDLMGLID---QLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHR 241 R+R+L L+D +V L A ++ Y+ T + + E +R + Sbjct: 185 RKRELTRLLDAFADVVRRLTDTTAGQRFVSTGFLYLSYTANLTKV-ELFGIFSRISSKIE 243 Query: 242 ERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQ-MQALRQPLPERER 300 MT+AE + +G ++ ++ E IQ+ L Q M A + L ++ER Sbjct: 244 SSTMTVAEELIQEGRELERRQT--RMVAE----ELIQQGRELERRQAMMAAEELLKQQER 297 Query: 301 YSWLK 305 + +K Sbjct: 298 QNKIK 302 >UniRef50_C5JAV2 Transposase n=2 Tax=uncultured bacterium RepID=C5JAV2_9BACT Length = 334 Score = 167 bits (423), Expect = 4e-40, Method: Composition-based stats. Identities = 65/285 (22%), Positives = 128/285 (44%), Gaps = 8/285 (2%) Query: 3 NFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHS 62 + PHD K L++P TA + LP+++ E D +L SF+DE LR + Sbjct: 1 MTEIAHPHDRFLKALLSNPATAGTLLRERLPREVAEALSDDPPELLEGSFIDEALRPHLT 60 Query: 63 DILWSVKT-REGDGYIYVVIEHQSREDIHMAFRLMRYSM-AVMQRHIEHDKRQPLPLVIP 120 D L+ V+T +YV+IEH+S D+ + ++L++Y + A+ Q E+ + LP ++P Sbjct: 61 DRLYRVRTVTGRTALLYVLIEHKSSPDLRIGWQLLKYLVEALKQWERENPAWERLPAIVP 120 Query: 121 MLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELI 180 +FYHG+ + A+ L N F ++D+ + D ++ + + L Sbjct: 121 FVFYHGAAAWKVPDAFLALVDAEEGWRSHLLNFRFTVLDLGQIDDRQLSRQPNLQAWLLA 180 Query: 181 QKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQH 240 K+ + D + +L++ + A D + L+ Y++ T + R P+ Sbjct: 181 AKYATRDDRQLEVKELLIQTLVSVA-DEEFRFLMRYVVETYRSYDEPMVREIIRRVRPEE 239 Query: 241 RERIMTIA-----ERIHNDGYIKGEQRILRLLLQNGADPEWIQKI 280 E +M++ + +G +G Q + ++ G ++ Sbjct: 240 EETMMSMFAQDMMAKGRQEGRQEGRQEGRQEGIKLGEQRGRQEEA 284 >UniRef50_Q2FP14 Putative uncharacterized protein n=4 Tax=Methanospirillum hungatei JF-1 RepID=Q2FP14_METHJ Length = 312 Score = 167 bits (422), Expect = 5e-40, Method: Composition-based stats. Identities = 56/307 (18%), Positives = 114/307 (37%), Gaps = 26/307 (8%) Query: 11 DALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVKT 70 D +K +HP+ D + L L CDL +L+ + S+V + LR DI+W + Sbjct: 5 DHPYKRLFSHPEMIADLIRGFLDPKLVSGCDLSTLERCNGSYVTDDLREREDDIIWRLAY 64 Query: 71 REGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKR--QPLPLVIPMLFYHGSR 128 + +Y++IE QS+ D M R+M Y + Q I +P +IP++ Y+G Sbjct: 65 GDRTLILYLLIEFQSKPDYSMPIRIMSYMALLWQDLIRSGVIVPSRIPGIIPIVLYNGEI 124 Query: 129 SPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIRQR- 187 +R + + + L+D + +++ R +A + Sbjct: 125 PWKVPHDIRETIQMPKPVSRFIPSVPYLLIDELRLSVHHLMEVRNLAACLFGLEQSSGPL 184 Query: 188 ---DLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNE-------FISELTRRM 237 +L +++ + + + L D+ + + + + Sbjct: 185 ELFELGARLNRWMQTDPNLDSMRRDFSLFFENTLKRDDDISISNPFQGGTMLAERVNKWI 244 Query: 238 PQHRERIMTIAE-------------RIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLS 284 Q++ + +G ++G IL+ + + G I ITGL Sbjct: 245 AQYKAEGRKEGKEEGKKEGLLEGRVEGKLEGKLEGMATILKRMKEKGMSVTEIATITGLP 304 Query: 285 AEQMQAL 291 +++Q L Sbjct: 305 EDEIQHL 311 >UniRef50_B9TA29 Putative uncharacterized protein n=1 Tax=Ricinus communis RepID=B9TA29_RICCO Length = 411 Score = 166 bits (421), Expect = 8e-40, Method: Composition-based stats. Identities = 60/320 (18%), Positives = 115/320 (35%), Gaps = 29/320 (9%) Query: 4 FTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSD 63 F S+ D+L+K HP+ RD + L D +++ + +AS+ + H D Sbjct: 37 FFMSSRTDSLYKQLFAHPEIVRDLVAGFLAADWARGLTVEAFERVNASYASDHGHVRHDD 96 Query: 64 ILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHD---KRQPLPLVIP 120 ++W + Y+Y+++E Q+R D MA R+ Y + Q + K LP V+P Sbjct: 97 VVWRARIGGEWVYVYILLEFQARPDKWMALRMQVYVGLLYQDLVAQHKLSKHGKLPPVLP 156 Query: 121 MLFYHGSRSPYPWSLCWLDEFADP-TTARKLYNAAFPLVDVTVVPDDEIV---------- 169 ++ YHG + P R + + L+D V Sbjct: 157 VVLYHGRGPWRAATALASLMLPAPSGLERYQPSQRYLLIDQHHGTARADVVSLLFRLLDA 216 Query: 170 --QHRRVALLELIQKHIRQRDLMGLIDQLVVLLVTECAN---------DSQITALLNYIL 218 + L+L+ + IR RD+ + D L + + + T + Sbjct: 217 ATDLQLREALDLLAERIRARDMDPVRDSLTRWIQLTLQDAAVETSMDLEEAFTMKMRRKF 276 Query: 219 LTGDEARFNEFISELTRRMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQ 278 + F L + + I+ ++ +G +G L G + + Sbjct: 277 SYDEMFDPGMFERPLAKA---REKAIVEGLQQGREEGLERGRVEGLERGRVEGLERGREE 333 Query: 279 KIT-GLSAEQMQALRQPLPE 297 + GL + L++ L + Sbjct: 334 GLKAGLQEGLQEGLKEGLQQ 353 >UniRef50_Q1RGR6 Transposase and inactivated derivative n=15 Tax=Rickettsia RepID=Q1RGR6_RICBR Length = 313 Score = 164 bits (416), Expect = 3e-39, Method: Composition-based stats. Identities = 76/313 (24%), Positives = 144/313 (46%), Gaps = 28/313 (8%) Query: 5 TTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDI 64 + HD + ++ +P +++F E+HLP ++ L + LK+E SFVD++L+ DI Sbjct: 2 SQKPKHDEIIRSAFENPLVSKEFFEMHLPPHIQNLISFEKLKMEKDSFVDKRLKKSIVDI 61 Query: 65 LWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFY 124 L+S K E GY+Y+++EHQS + MA RL RY + + H + K + P + P++FY Sbjct: 62 LFSAKFGEKKGYLYLLLEHQSTPEYKMALRLFRYMFKIAEYHKKSTKSKKFPFIYPLIFY 121 Query: 125 HGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHI 184 +G + Y + F + + ++ + L++V +PD+++ + +L+ KHI Sbjct: 122 NGVQK-YNAPRNLWELFENSELVKSTWSGDYQLINVHDIPDEKLKEKAWSGILQFFMKHI 180 Query: 185 RQRDLMGLIDQLVVLLVTECAND---SQITALLNYILLTGDEARFNEFISELTRRM--PQ 239 +RDL+ +++ LL D I +L Y L + E L ++ + Sbjct: 181 HERDLLKRWEEVADLLPKFAKIDIGIEHIELILCYTLTRIKQDDIIEVEKLLQSKLNPKK 240 Query: 240 HRERIMTIAERIHNDGY----------------------IKGEQRILRLLLQNGADPEWI 277 + +IA G + + + + +++ G E + Sbjct: 241 RENVMKSIAHHWIQQGREEEKAIMLKKMQEEKVIMAEKVQEEKVMMAKEMMKEGFSLESV 300 Query: 278 QKITGLSAEQMQA 290 KIT LS E ++ Sbjct: 301 IKITKLSKEDLEK 313 >UniRef50_B9MMR0 Putative uncharacterized protein n=1 Tax=Anaerocellum thermophilum DSM 6725 RepID=B9MMR0_ANATD Length = 333 Score = 164 bits (415), Expect = 3e-39, Method: Composition-based stats. Identities = 69/336 (20%), Positives = 132/336 (39%), Gaps = 44/336 (13%) Query: 1 MTNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRAL 60 M +D FK + + +F++ + + + DL SL+ SFV ++ Sbjct: 1 MEQKPPHNQYDLTFKRIFSFKEVFLNFLKSTIKRPWVDKIDLQSLEFVDRSFVKDEFVEK 60 Query: 61 HSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIP 120 +D+++ K + D Y YV++E QS D M RL Y + QRHIE K L ++P Sbjct: 61 EADVIYRAKIEDTDIYFYVLLEAQSTTDKTMPRRLFEYMNLIWQRHIEETKDDLLSPIVP 120 Query: 121 MLFYHGSRSPYPWSLCW--LDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLE 178 ++ Y+G + +L + + F D + LVDV + D+ + + + Sbjct: 121 IVLYNGRSNWNVPTLIFKGWEIFKDDM-------FNYFLVDVNNIDDETLKNRLDLLSVI 173 Query: 179 LIQKHIRQ--RDLMGLIDQLVVLLVTECAND-SQITALLNYILLTGDEARFNEFISELTR 235 L R+ ++ + + ++ + L ++ I EL + Sbjct: 174 LYLDRSRKTAKEFIEKLKEVTEYISCLPTEQVKVFAMWLLRVIRPQMMEEVQGEIDELLK 233 Query: 236 RMPQ--------------------------------HRERIMTIAERIHNDGYIKGEQRI 263 R+ Q + E + +G ++ RI Sbjct: 234 RIEQEGVTDVGDFVFNVQRLMQEYYKEAEEKGKEKGYEEGKLEGKLEGKLEGELEATIRI 293 Query: 264 LRLLLQNGADPEWIQKITGLSAEQMQALRQPLPERE 299 R ++ GA+ +I K+TGL E+++ LRQ + ++E Sbjct: 294 ARNMILAGAEDSFISKVTGLDIEKIKELRQNMTDKE 329 >UniRef50_Q1RKI3 Transposase and inactivated derivative n=10 Tax=Rickettsia RepID=Q1RKI3_RICBR Length = 270 Score = 163 bits (411), Expect = 1e-38, Method: Composition-based stats. Identities = 62/216 (28%), Positives = 114/216 (52%), Gaps = 5/216 (2%) Query: 9 PHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSV 68 HD F+ L++P AR+F E +LP +++ L +L LE+ SF+D L+ +D+L+S Sbjct: 55 KHDKFFQKALSNPIVAREFFEEYLPTEIKALFSPTTLTLENDSFIDPNLKESITDVLYSA 114 Query: 69 KTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRH-IEHDKRQPLPLVIPMLFYHGS 127 + D YIY++ EHQS D HMAFRL +Y + + ++H I H + P + P++ Y Sbjct: 115 RINNRDCYIYILCEHQSSSDPHMAFRLFKYMLNIAEKHLISHPDSKKFPFIYPLV-YSND 173 Query: 128 RSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIRQR 187 Y L D F + + ++ + L+ + + DD++ ++ +A L+++ K+I + Sbjct: 174 HKKYTAPLNLWDLFENSELVKDTWSNNYQLISLRDISDDKLKENPWLAPLQILMKYIHKP 233 Query: 188 DLMGLIDQLVVLLVTECAND---SQITALLNYILLT 220 ++ ++ L T A+ I + L+Y L Sbjct: 234 NVFDKWQEISGCLATIAASSSGIEYIKSALSYSLTK 269 >UniRef50_A4XMD0 Putative uncharacterized protein n=5 Tax=Clostridia RepID=A4XMD0_CALS8 Length = 329 Score = 162 bits (409), Expect = 1e-38, Method: Composition-based stats. Identities = 62/334 (18%), Positives = 130/334 (38%), Gaps = 44/334 (13%) Query: 1 MTNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRAL 60 M +D FK + +F+ ++ ++ D +SL+ SF+ ++ Sbjct: 1 MQQKVPHNQYDLTFKRLFQFKEVFLNFLRGNINREWVNRIDAESLEFVDRSFIKDEFVEK 60 Query: 61 HSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIP 120 +D+++ + + D Y YV+IE QS D +M RL Y + +RH+E + LP ++P Sbjct: 61 EADVIYRARLEDTDVYFYVLIEPQSTADRNMPRRLFEYMTLIWKRHMEEKADELLPPIVP 120 Query: 121 MLFYHGSRSPYPWSLCW--LDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLE 178 ++ Y+G + + D F D + LVDV + D+++ + + Sbjct: 121 IVLYNGRSGWNIPTQIFKGFDIFKDDM-------FNYILVDVNRLDDEKLKSRLDLLSII 173 Query: 179 LIQKHIRQ--RDLMGLIDQLVVLLVTECA-NDSQITALLNYILLTGDEARFNEFISELTR 235 L + R+ + + + ++ + + L I+ I EL + Sbjct: 174 LYLEKSRRNAEEFVEKLSEVSEYICKLPQVQLKVFCSWLLRIVKPQVREEMESRIDELLK 233 Query: 236 RMP--------------------------------QHRERIMTIAERIHNDGYIKGEQRI 263 ++ + E I + +G + E+ I Sbjct: 234 KIEAEGVEDVGEFIFNVQQLIQEYYREAEEKGKEKGYEEGIQEGIKEGIKEGIQRKEEEI 293 Query: 264 LRLLLQNGADPEWIQKITGLSAEQMQALRQPLPE 297 +R L+Q G + +I + TG+ E+++ +R+ E Sbjct: 294 VRRLIQKGFNDNFIAEATGVEIERIKKIREEYTE 327 >UniRef50_A9BGB6 Putative uncharacterized protein n=3 Tax=Petrotoga mobilis SJ95 RepID=A9BGB6_PETMO Length = 331 Score = 161 bits (406), Expect = 3e-38, Method: Composition-based stats. Identities = 77/313 (24%), Positives = 139/313 (44%), Gaps = 13/313 (4%) Query: 3 NFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHS 62 N PHD FK + + ARDF++ +LP++ E+ DLD L E+ S VDE LR S Sbjct: 2 NELVHNPHDRFFKLIFSDKEIARDFLQNYLPQEAVEIVDLDYLIPENNSHVDENLRESLS 61 Query: 63 DILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPML 122 D+L+ K + DGYIY+++EH+S + + F+L+RY ++ + K + +P++IPM+ Sbjct: 62 DMLYKTKIKGQDGYIYILMEHKSYIEGKVIFQLLRYITSIWEE-KYDPKTKKVPIIIPMV 120 Query: 123 FYHGSRSPYPWSLCWL----DEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLE 178 YHG + E L + + D ++ I+ + + Sbjct: 121 IYHGREIWNVETNLLNMVQGIEDLPNELKTYLPTYRYEICDFSIKRKKRIIGLTAMKVAI 180 Query: 179 LIQK---HIRQRDLMGLIDQLVVLLVTECAN--DSQITALLNYILLTGDEARFNEFISEL 233 + + + + + ++ + + Y+L ++ E + Sbjct: 181 EAMRAGTAMTKEEFKERLRRVFAYIKQLPKEQVHEWFEECMIYLLNVREDVTIEEILKVQ 240 Query: 234 TRRMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQALRQ 293 MP E +MTIAE++ N+G KG+ R G + E+ +I LS L + Sbjct: 241 KEIMPGRGEIVMTIAEKLRNEGMEKGKIEGERKGKLEG-EREFAIRI--LSKRFGNQLTE 297 Query: 294 PLPERERYSWLKS 306 + +R R + K+ Sbjct: 298 EIKDRIREADEKT 310 >UniRef50_C6VTM0 Putative uncharacterized protein n=1 Tax=Dyadobacter fermentans DSM 18053 RepID=C6VTM0_DYAFD Length = 308 Score = 159 bits (401), Expect = 2e-37, Method: Composition-based stats. Identities = 64/314 (20%), Positives = 138/314 (43%), Gaps = 22/314 (7%) Query: 1 MTNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRAL 60 M T HDA + + + A D+ +P+++++L D +L+ ++V ++L+ Sbjct: 1 MDKHTP--KHDAFIRAIMGNKQIALDYFRASIPQNIQDLLDFSTLRQLPDTYVSKELQKS 58 Query: 61 HSDILWSVK--TREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLV 118 SDI++ + + G+ I +++EH+S D + ++ Y + + + I + + L + Sbjct: 59 ISDIVYVCQKASGNGEVKISLLVEHKSYVDKYTPIQIGSYIFSGLLKQIGNKESPSL--I 116 Query: 119 IPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEI--VQHRRVAL 176 IP+L YHG+ ++ L E +P + + + + D+ + D+EI + ++ +A Sbjct: 117 IPILLYHGADRWEYKTVADLFENPEPALQQFIPDYQYIFHDLGQISDEEIQSLHNKFLAA 176 Query: 177 LELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRR 236 L K+ +D + + L +L D + L + L G+ +F++ + Sbjct: 177 SLLAMKYSALKDQLNTL--LPTILTLASEVDRNLHKSLLFYTLVGNPLTEEQFLNLIKSV 234 Query: 237 MPQHRERIMTIAERIHNDGYIKGEQRI-----------LRLLLQNG-ADPEWIQKITGLS 284 Q +E IM I E G+ KG + +R L++ E I ++ Sbjct: 235 PNQKKEAIMDIFEIFEEKGWKKGIEEGRAEAEQKIETAVRNLIKQSVLTDEQIASAMNVT 294 Query: 285 AEQMQALRQPLPER 298 + + +R L Sbjct: 295 TDYVAEVRNNLAAE 308 >UniRef50_C4YU05 Transposase n=5 Tax=Rickettsieae RepID=C4YU05_9RICK Length = 342 Score = 158 bits (399), Expect = 3e-37, Method: Composition-based stats. Identities = 94/348 (27%), Positives = 156/348 (44%), Gaps = 62/348 (17%) Query: 1 MTNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRAL 60 M HDAL K LT A++F+E +LP D +EL DL +K+E SFV++ L+ Sbjct: 1 MAKKL---KHDALVKKILTEKIAAQEFLEHYLPSDFKELIDLREIKVEKESFVEDDLKRK 57 Query: 61 HSDILWSVKTRE-GDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVI 119 +SDI++SVKTR+ + ++YV+IE QS D +A RL +Y + + +RH + + LPL+ Sbjct: 58 YSDIIYSVKTRDQEEAFVYVLIEAQSSCDYWIALRLWKYMLLLCERH--ENNKNKLPLIC 115 Query: 120 PMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLEL 179 P+L Y+GS Y + + F P A+KL + LVD+ DDEI Q + + ++E Sbjct: 116 PLLIYNGSEV-YNAPRNFWELFTKPERAKKLMVQDYQLVDLQNQSDDEIEQKKHLGMMEY 174 Query: 180 IQKHIRQRDLMGLIDQLVVLLVTECANDSQI-----TALLNYILLTGDEARFNEFISELT 234 KHI QRD++ L D+ ++ D + + + Y E + E + Sbjct: 175 FLKHIHQRDMLKLWDEFLIRFKPSIIMDKESGYIYLRSFVWYTDAKISEEKQQELEQIIV 234 Query: 235 RRM--PQHRERIMTIAERIHNDGYI----------------------------------- 257 + + + + TIA++ ++G Sbjct: 235 KHLSTEEKDNIMRTIAQKYIDEGVQHGIIQGIQQGIQQGVEKGKAEGLKIGEAKGKAEGK 294 Query: 258 -------------KGEQRILRLLLQNGADPEWIQKITGLSAEQMQALR 292 + I R +L G D +I +TGL +++L Sbjct: 295 AEGKAEGKAEGKAEERVEIARKMLSQGCDFSFISSVTGLEEAFIRSLS 342 >UniRef50_A3JHZ5 Putative transposase n=11 Tax=Proteobacteria RepID=A3JHZ5_9ALTE Length = 325 Score = 158 bits (398), Expect = 3e-37, Method: Composition-based stats. Identities = 63/320 (19%), Positives = 131/320 (40%), Gaps = 32/320 (10%) Query: 7 STPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILW 66 + HD +K +HP+ + +E P ++ L D ++LK S +++ D++W Sbjct: 3 TNHHDTGYKELFSHPEFVQQLVEGFAPSEIAGLMDFNTLKNHSGNYITPLFEEKFEDVVW 62 Query: 67 SVKTR----EGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDK---RQPLPLVI 119 SV+ ++Y+++E QS+ D M RLM Y ++ + RQ LP + Sbjct: 63 SVEVTWEGITQRVFLYILLEFQSKIDSTMPLRLMHYVACFYDHLLKTRETTVRQGLPPIF 122 Query: 120 PMLFYHGSRSPYPWSLCWLDEFADPTTARKLYN--AAFPLVDVTVVPDDEIVQHR-RVAL 176 PM+ Y+GS+ + P ++Y + L+D D+E++ R ++ Sbjct: 123 PMVLYNGSQRWSARQDIYDMVQPAPPEFLRVYQPHLRYYLIDEGRYTDEELISKRTPLSG 182 Query: 177 LELIQKHIRQRD-LMGLIDQLVVLLVTECANDSQITALLNYI------------LLTGDE 223 + ++ + L +D++V ++ + D + +I L Sbjct: 183 IFGVENAGHSWEALQQAVDRIVEIVKADPNKDRVDKIVTRWIKRHLQRVAPKARLNLDRM 242 Query: 224 ARFNEFISELTRRMPQ-HRERIMTIAERIHNDGYIKG-------EQRILRLLLQNG-ADP 274 + E + L + ++ + + +G +G +++ +R LL G Sbjct: 243 SSLVEDRNMLAENLENLVKKERLEGRQEGRQEGRQEGDRRALEEKRKTVRHLLSFGVLSN 302 Query: 275 EWIQKITGLSAEQMQALRQP 294 + I TGLS +++ LR Sbjct: 303 DQIAVATGLSVDEIDKLRIE 322 >UniRef50_B4U689 Putative uncharacterized protein n=8 Tax=Aquificales RepID=B4U689_HYDS0 Length = 323 Score = 158 bits (398), Expect = 4e-37, Method: Composition-based stats. Identities = 58/304 (19%), Positives = 123/304 (40%), Gaps = 13/304 (4%) Query: 8 TPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWS 67 PHD+ FK + P + ++I + + S+ + K + D+L+S Sbjct: 4 QPHDSFFKQIFSDPRRVKTLLDIFAKDVAKS---IHSITPVNTEKFSSKSQKFMLDLLFS 60 Query: 68 VKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGS 127 K ++ D YI +V+EH+S D + +L Y+ A+ + I+ + P +I ++FYHG Sbjct: 61 CKVKDQDAYIRIVLEHKSYLDKELPIQLSYYNAAIWEEAIKEKEY--YPPIINIVFYHGK 118 Query: 128 RSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLE----LIQKH 183 + L D + + + L+D+ V DDE++ + + KH Sbjct: 119 GEWNIPTS--LPVLEDQNLEKYVSKLNYILIDLNKVSDDELINEAYIDFCFTSAVIAMKH 176 Query: 184 IRQR-DLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRE 242 + + + + + + +V V ++ L + + + + Sbjct: 177 VHENIEKIKAVFRPLVEYVQIHEDEEGYHCLFFSFNYISYVKGDTKEAENALKELIGGDK 236 Query: 243 RIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQALRQPLPERERYS 302 + MT+ E+ +G KG+Q L+ L+ G I K + + ++ + +E Sbjct: 237 KAMTLIEKWIMEGLEKGKQEGLQEGLEKGKQEGLI-KAKKDDIKSVILIKFGVLPKELEE 295 Query: 303 WLKS 306 ++S Sbjct: 296 KIES 299 >UniRef50_Q3JB06 Putative transposase n=17 Tax=Proteobacteria RepID=Q3JB06_NITOC Length = 350 Score = 156 bits (395), Expect = 6e-37, Method: Composition-based stats. Identities = 55/251 (21%), Positives = 106/251 (42%), Gaps = 8/251 (3%) Query: 10 HDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVK 69 HDA +K +HP+ RD ++ + + + D +L+ S S+V + LR DI+W ++ Sbjct: 4 HDASYKRLFSHPEMVRDLLQGFVREPWVQQLDFSTLEKVSGSYVTDDLREREDDIIWRLR 63 Query: 70 TREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHD---KRQPLPLVIPMLFYHG 126 +EG YIY+++E QS D +MA R++ Y + Q I+ Q LP V P++ Y+G Sbjct: 64 HQEGWMYIYLLLEFQSTVDPYMAVRVLAYVGLLYQDLIKARYIAPNQKLPPVFPLVLYNG 123 Query: 127 SRSPYPWS-LCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIR 185 + + L + R + + LVD D+ + + + ++ R Sbjct: 124 GPRWRAATEVGDLITPLEGGLERYRPSLRYLLVDEGDYQDEALAPLKNLVASLFRLENSR 183 Query: 186 Q-RDLMGLIDQLVVLLV---TECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHR 241 +L+ ++ L+ L + L +L ++ L Sbjct: 184 TPEELLQVLRNLLQWLQSPAQKGLERDFTLWLKRVLLPARLPGVEIPSVASLEEMNSMLA 243 Query: 242 ERIMTIAERIH 252 ER++ ++ Sbjct: 244 ERVVEWTQQWK 254 >UniRef50_C4FIM1 Putative uncharacterized protein n=1 Tax=Sulfurihydrogenibium yellowstonense SS-5 RepID=C4FIM1_9AQUI Length = 316 Score = 154 bits (389), Expect = 3e-36, Method: Composition-based stats. Identities = 61/301 (20%), Positives = 128/301 (42%), Gaps = 18/301 (5%) Query: 8 TPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWS 67 PHD FK + P + ++I P +L + DL+S++L ++ +K+ ++L+ Sbjct: 5 QPHDQFFKQIFSEPKRVKSLLDIFYP-ELSQKIDLESIRLLNSEKYSQKVGKSLLNLLYE 63 Query: 68 VKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGS 127 K ++ ++ EH+S D ++ +L+ Y+ + + + + P +I ++ YHG Sbjct: 64 CKIENEKSFLRIIFEHKSYIDKNLPSQLLYYNGILWEE---TGEYEEYPPIINIVLYHGK 120 Query: 128 RSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALL----ELIQKH 183 R + L + R + L+D++ V D+E++ + L KH Sbjct: 121 RKWNIPAT--LPKTNSEIIERFANKLNYHLIDLSKVADEEMISKLYLDFCTVSALLTMKH 178 Query: 184 IRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRER 243 I + DL + ++ V E D + +L+YI + + + E+ + Sbjct: 179 IFE-DLRKY--KHILKKVFEHYQDGCVFIILDYISVVNNPQEVENVLKEIL----GGEKD 231 Query: 244 IMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQALRQPLPERERYSW 303 +MT+ E+ +G +G Q+ + + + IQ G E ++ L + E Sbjct: 232 MMTLTEKWKMEGLQQGLQQGMIEGQKKAI-LKSIQLKFGRVPENIEKLISNINNLEELDK 290 Query: 304 L 304 L Sbjct: 291 L 291 >UniRef50_A3ET28 Probable transposase n=6 Tax=Leptospirillum sp. Group II RepID=A3ET28_9BACT Length = 335 Score = 154 bits (389), Expect = 3e-36, Method: Composition-based stats. Identities = 60/340 (17%), Positives = 129/340 (37%), Gaps = 52/340 (15%) Query: 3 NFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHS 62 S HD FKT + RDF+ LP ++ + D DSL+ + + H Sbjct: 1 MNEISGLHDRFFKTSFGRIEVLRDFLTGFLPPEISQSIDPDSLRFLNTESIGLSFEKSHM 60 Query: 63 DILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPML 122 D++ + E Y++IEH+S D + +++RY +A+ R+ + +K PL V+P++ Sbjct: 61 DLVVECRISETPAQFYLLIEHKSVPDPEVFLQMLRYMVALWTRNRQDNK--PLVPVLPLV 118 Query: 123 FYHGSRSPYPWSLCWLDEFADPT-TARKLYNAAFPLVDVTVVPDDEIVQ---HRRVALLE 178 F+ G R P+ + + + F P + A L D++ V I + H ++ Sbjct: 119 FHQGGR-PWTLPVRFQETFPVPETLKAHAVDFAPLLFDLSTVSGTTIRERSAHAETVVVL 177 Query: 179 LIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMP 238 + K+ + ++ L +++ + +LNY + + + ++R Sbjct: 178 TLLKYAFSGSVEDVLRALKE--TGGSFDETFLFGVLNYAIRAFEVKDPV-VVDAISRSF- 233 Query: 239 QHRERIMTIAERIHNDGY----------------------------------------IK 258 + + +I + +G + Sbjct: 234 GGEKIMPSIIDEWVEEGLKEGLKKGREEGREEGREEGKEEGRKEGREEGKEEGRKEGQKE 293 Query: 259 GEQRILRLLLQNG-ADPEWIQKITGLSAEQMQALRQPLPE 297 G+++ + LL G I + + ++ +R+ L + Sbjct: 294 GQRKTIEKLLAKGVLSVSEIASALDVDLQWVEQIRKDLEK 333 >UniRef50_C0GW46 Putative uncharacterized protein n=2 Tax=Desulfonatronospira thiodismutans ASO3-1 RepID=C0GW46_9DELT Length = 341 Score = 154 bits (388), Expect = 4e-36, Method: Composition-based stats. Identities = 66/274 (24%), Positives = 132/274 (48%), Gaps = 7/274 (2%) Query: 3 NFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHS 62 +F PH+A FK F P+ + F++ H+P+++ L DLD+L+++ + FV E+ R ++ Sbjct: 2 SFEIPNPHNACFKDFFKDPEFVKAFIKYHIPEEICSLLDLDTLQVDLSGFVSEEHREYYA 61 Query: 63 DILWSVKTRE--GDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQ-PLPLVI 119 D++ +V+ + + IY+++EH+S + +++ Y + + Q LP++I Sbjct: 62 DVMVTVQLKGHTENVNIYILLEHKSTPEFLTRLQILNYEVQKWMDLKRKGQLQGYLPVII 121 Query: 120 PMLFYHGSRSPYPWSLCWLDEFADP--TTARKLYNAAFPLVDVTVVPDDEIVQHRRVALL 177 P++ YHG +S + D F P + + D++ + DDE + + Sbjct: 122 PVVIYHGKGRWN-FSRKFSDLFDLPSEVLRPFVPEFKHMIHDISSMEDDEFKTTAILEIF 180 Query: 178 ELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYI-LLTGDEARFNEFISELTRR 236 L+ K+I +L + ++ LL T D L + + E + E TRR Sbjct: 181 HLLFKYIHYPELETKLQEIYDLLETIPDQDKVKQYLQAIVQYVAVQGPISLERLGEYTRR 240 Query: 237 MPQHRERIMTIAERIHNDGYIKGEQRILRLLLQN 270 +P E + T A++I + Y + Q ++L++ Sbjct: 241 LPGGDEAMQTAAQQIRQEAYNEFIQEQEKMLVER 274 >UniRef50_A6G1G8 Putative uncharacterized protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6G1G8_9DELT Length = 329 Score = 154 bits (388), Expect = 5e-36, Method: Composition-based stats. Identities = 69/308 (22%), Positives = 111/308 (36%), Gaps = 19/308 (6%) Query: 8 TPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWS 67 HDALFK P A LP L + D + E + +D +L D+LW Sbjct: 5 HAHDALFKAAFGAPAHAARLCRALLPPALVAVLDWRASTSEPTAVLDLRLSERRCDVLWR 64 Query: 68 VKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGS 127 + +G G IYV++EHQS + M R+ Y + H D+ PLP +IP++ H Sbjct: 65 TRFVDG-GPIYVLLEHQSTRERDMPLRIEGYLARIWAGHRRGDRHGPLPPIIPIVVSHAE 123 Query: 128 RSPYPWSLCWLDEFADPT----TARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKH 183 W P A + N + D+T V D + L Sbjct: 124 HGWRAPRSFWEQFSPSPDCIPGLAPFVPNFQLLIDDLTQVDDASLRGRSLPLFQTLALWL 183 Query: 184 IRQ-RDLMGLIDQL---------VVLLVTECANDSQITALLNYILLTGDEARFNEFISEL 233 +R RD +++ + + + I LL Y E +EF +L Sbjct: 184 LRDARDPGRVLESVDEWNTWIHRLRGESQHEQDGGDIEQLLRYAYAVMGEGEDSEFHRKL 243 Query: 234 TRRMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQALRQ 293 P E +T ++ N G+ +G + + + K + L LR+ Sbjct: 244 AAFHPPSAEMSLTFEQQAINRGHKRGLEEGRIKGRLELLEAQLHAKFSTLP----MRLRE 299 Query: 294 PLPERERY 301 L + + Sbjct: 300 RLDQADDL 307 >UniRef50_A4XG55 Putative uncharacterized protein n=2 Tax=Caldicellulosiruptor saccharolyticus DSM 8903 RepID=A4XG55_CALS8 Length = 327 Score = 153 bits (387), Expect = 6e-36, Method: Composition-based stats. Identities = 56/322 (17%), Positives = 118/322 (36%), Gaps = 30/322 (9%) Query: 1 MTNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRAL 60 M + +D +K ++ ++ ++ + L+L ++V Sbjct: 1 MCSNLPHNVNDLEYKYIFSNKSLFLRLLKRIDRINIFNKLTEEDLELVDKNYVLPDFSEQ 60 Query: 61 HSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQ-------RHIEHDKRQ 113 SD+L+ + +E + + Y++ EHQS D +MA RL+ Y + + ++ +K Sbjct: 61 ESDLLYKARLQEEELFFYILFEHQSTVDYNMAMRLLFYITDIWRDWLKQFDKNQFKNKSF 120 Query: 114 PLPLVIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRR 173 P V+P++ Y G + + + + L+D+ + Sbjct: 121 KFPPVVPIVLYDGDNPWTASVNLKERIMNFEVFGKYIVDFEYILIDLNDPDEMIFKYKDI 180 Query: 174 VALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISEL 233 ++L+ + K +++L L L L + + ++L + ++ Sbjct: 181 LSLILKLNKVKTEKELERLFLDLYEYLQGAKEKEINTLKICLPVVLKELGEDKVQEAKDM 240 Query: 234 TRRMPQHRERIM-------TIAERIHNDGYIKGEQ----------------RILRLLLQN 270 + E IM I E +++G KG Q I ++ Sbjct: 241 LECIDVGGEGIMPLFQNLRKIREEWYHEGIQKGIQDGLQQGLQQGLQKKELEIAERMIVK 300 Query: 271 GADPEWIQKITGLSAEQMQALR 292 G E I +ITGL E+++ LR Sbjct: 301 GYSDEEIHEITGLDIEKIKELR 322 >UniRef50_C6HXQ0 Putative uncharacterized protein n=1 Tax=Leptospirillum ferrodiazotrophum RepID=C6HXQ0_9BACT Length = 341 Score = 151 bits (382), Expect = 2e-35, Method: Composition-based stats. Identities = 55/267 (20%), Positives = 106/267 (39%), Gaps = 7/267 (2%) Query: 10 HDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVK 69 HD FK+ L P ++ LP L L SL + V + L A D+ + Sbjct: 8 HDRFFKSTLGRPKRMEHILKAFLPPALSALLAPGSLVPLFSEVVGDSLDASLLDMAFEAT 67 Query: 70 TREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGSRS 129 E I+V++EH+S D F+++ Y + R + + R P+P V P+LFYHG R Sbjct: 68 FGERKTRIHVLVEHKSSPDPWAHFQILHYLAELWLR-DKKESRSPIPFV-PVLFYHGLRP 125 Query: 130 PYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRR---VALLELIQKHIRQ 186 + + + P++D+ + D +I + R + L+ KHI + Sbjct: 126 WNLPTRLSEMLDPPSELLPFVPDYLLPVIDLGKIDDLDIREKIRDFETSACLLLLKHIFE 185 Query: 187 RDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRERIMT 246 G + + + + I + ++Y++ E + + + Sbjct: 186 GAR-GSLRAFLQETNGKNLSRDIIISGMSYVIGVHHLESTAELSRLVNTILKEEG-MSQN 243 Query: 247 IAERIHNDGYIKGEQRILRLLLQNGAD 273 + E + +G Q+ ++ +Q G + Sbjct: 244 VVELWMEELIQQGVQKGIQQGVQLGIE 270 >UniRef50_B2V9N0 Putative uncharacterized protein n=4 Tax=Sulfurihydrogenibium RepID=B2V9N0_SULSY Length = 312 Score = 150 bits (379), Expect = 5e-35, Method: Composition-based stats. Identities = 59/278 (21%), Positives = 124/278 (44%), Gaps = 11/278 (3%) Query: 1 MTNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRAL 60 M N + PH+ FK ++ +DF+ I L DL + L SL+L + + Sbjct: 1 MKNKESIQPHNWFFKQVFSNSKNVQDFLSIFL-SDLSQKIQLSSLELVPSEKFSNNQKKH 59 Query: 61 HSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIP 120 D+L+ K + + YI ++ EH+S D + +LM+Y+ + + ++ P +I Sbjct: 60 FLDLLYKCKLNDKEAYIRLIFEHKSYVDKKLPLQLMQYNAVIWEEALKEKDY--YPPIIN 117 Query: 121 MLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQ--HRRVALLE 178 ++FYHG + + D + + + L+D+ + D+ + + + V L+ Sbjct: 118 IVFYHGQAKWNFPTTI--PDIEDEELDKYIQKLNYILIDLNEIEDENLKRYLKKNVDLIM 175 Query: 179 LIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMP 238 + D + I L+ ++ EC+ D + L +L+ D + E E+ Sbjct: 176 EMLIMKHIHDRLERIKTLLKDVIDECSEDCFVIILNYLVLVKKDYEKVKEVFKEII---- 231 Query: 239 QHRERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEW 276 E++M +++ +G ++G+ ILR + + D ++ Sbjct: 232 GGEEKMMLFTDKLKMEGKMEGKIEILRENIIDLIDVKF 269 >UniRef50_C6I158 Putative uncharacterized protein n=3 Tax=Leptospirillum ferrodiazotrophum RepID=C6I158_9BACT Length = 328 Score = 150 bits (379), Expect = 5e-35, Method: Composition-based stats. Identities = 67/329 (20%), Positives = 122/329 (37%), Gaps = 43/329 (13%) Query: 10 HDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVK 69 HD FK+ L PD ++ LP ++ D SL V E L + D+ +S + Sbjct: 7 HDRFFKSTLGRPDRLGKVLKAFLPTNISASLDPGSLVPLGTESVGEGLDSSLMDLAFSAR 66 Query: 70 TREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGSRS 129 + + I++++EH+S D F++ RY + R ++ + ++P+LFYHG Sbjct: 67 FGDQEARIHLIVEHKSSPDPRTHFQIARYLCGLWIRELKEGLQPR--PLLPILFYHGVVP 124 Query: 130 PYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIRQRDL 189 S + PL+D+ V D+EI H V LE + + + + Sbjct: 125 WTLPSRLTEVLRPPSELLAVTPDFVLPLIDLRRVDDEEIRHH--VDDLEAVLALLSLKHI 182 Query: 190 MGLIDQLVVLLVTECANDSQITALLNY-------ILLTGDEARFNEFISELTRRMPQHRE 242 ++ LV LL+ E A+L + + + + + R + ++ Sbjct: 183 FDGVETLVRLLLREIWERKAPHAILKPEMNYMAGVYKITNSQEMKQIVDPIAREVGMAQD 242 Query: 243 RIMTIAERIHN--------------------DGYIKGEQRILR---------LLLQNGAD 273 + T + G KG Q+ R LL + Sbjct: 243 IVETWLDEYLQQGLQKGLEQGLQQGLQQGLEKGLEKGFQQGARLKEEQVIRTLLKKKTFS 302 Query: 274 PEWIQKITGLSAEQMQALRQPLPERERYS 302 E I + G+ ++ +R+ ER S Sbjct: 303 FEEIASLVGV---ELSRVREVAESPERGS 328 >UniRef50_B6WXP3 Putative uncharacterized protein n=1 Tax=Desulfovibrio piger ATCC 29098 RepID=B6WXP3_9DELT Length = 330 Score = 150 bits (379), Expect = 5e-35, Method: Composition-based stats. Identities = 58/273 (21%), Positives = 111/273 (40%), Gaps = 13/273 (4%) Query: 9 PHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSV 68 PHD+ +K F ++P+ + +P D E D +L+ S S+V + LR H DI+W + Sbjct: 7 PHDSAYKQFFSNPEMVESLLRDFVPADFIEDLDFSTLERCSGSYVTDDLRERHDDIVWRI 66 Query: 69 KTREG-DGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDK---RQPLPLVIPMLFY 124 ++G Y+ +V+E QS D MA R + Y+ ++ ++ K + LP V P++ Y Sbjct: 67 GWKKGAWCYVALVLEFQSTPDYWMALRTLSYTALLLLDLVKTGKVHEGEGLPPVFPIVIY 126 Query: 125 HGSRSPY-PWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEI-VQHRRVALLELIQK 182 +G ++ P + L + L+D + V DE+ VA L +++ Sbjct: 127 NGGKAWKAPQEVATLFAPMPDSLKHYCPQHRHFLLDESRVSGDELDKSQGLVAQLLKLER 186 Query: 183 HIRQRDLMGLIDQLVVLL---VTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQ 239 + ++ +L+ L + L +L +L Sbjct: 187 AQEPEQVRQIVKELITRLHEPKYLLLRRAFTVWLSRVVLKRSGITEEIPEFQDLREVDAM 246 Query: 240 HRERIMTIAERIHNDGYIKGEQRILRLLLQNGA 272 ER A + ++ +G+ + + G Sbjct: 247 LEER----AAQWKDEYIKQGKTEGISIGEARGI 275 >UniRef50_Q1Q296 Putative uncharacterized protein n=6 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1Q296_9BACT Length = 338 Score = 150 bits (378), Expect = 6e-35, Method: Composition-based stats. Identities = 61/300 (20%), Positives = 123/300 (41%), Gaps = 18/300 (6%) Query: 5 TTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDI 64 PHD FK + + A DF+ P ++ + DL +L +++S++DE+L+ SDI Sbjct: 2 EILNPHDKFFKETFSIRENAIDFLSGRFPPEILKKLDLSTLTQDNSSYIDEELKEHFSDI 61 Query: 65 LWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFY 124 +++ ++ + I ++ EH+S +LM+Y + + + + Q L VIP++ Y Sbjct: 62 VYTCFCKDKEIRITLLFEHKSYAVACPYLQLMKYLLKIWE--ANSKQAQRLIPVIPVILY 119 Query: 125 HGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIV----QHRRVALLELI 180 HG + E D R + + L D++ ++EI + + + L+ Sbjct: 120 HGKEAWKVRRFREYFEGIDEVFYRFIPEFEYLLTDISCYSNEEIKDRVFRRVSLQITMLL 179 Query: 181 QKHIRQ----RDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRR 236 ++I D + ++ + E + + + Y+ D A I L Sbjct: 180 MRNIFDEKYLEDKLKDFFEIGIQYFEEDEGLKFLESAIRYLYYASDIAE-KRVIDTLKEI 238 Query: 237 MPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQALRQPLP 296 + + MTIA ++ G I G G I G +++ L++ + Sbjct: 239 SEEGGKLSMTIAAKLIEKGKIAGRVEGRAEGRAEG-------AIEGERKGRIEGLKEAIE 291 >UniRef50_C0GTX5 Putative uncharacterized protein n=8 Tax=Desulfonatronospira thiodismutans ASO3-1 RepID=C0GTX5_9DELT Length = 338 Score = 147 bits (370), Expect = 6e-34, Method: Composition-based stats. Identities = 58/268 (21%), Positives = 120/268 (44%), Gaps = 7/268 (2%) Query: 5 TTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDI 64 +T+ HD+ K FL+ A ++ LP+++ + D + + E S++ + L+ +SD+ Sbjct: 2 STTNIHDSTIKYFLSDRLNAISLLKSMLPEEIVKQLDFNKIYYEKDSYLPKSLQGYYSDL 61 Query: 65 LWSVKTREGD--GYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPML 122 + SV T+ G ++ ++EH+S + + +RY + +++ ++ LP++IP+L Sbjct: 62 VVSVPTKCGSYVAKVFFLLEHKSTFKKNTPLQFLRYILEFWEQYQKNTGETRLPVIIPIL 121 Query: 123 FYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVD-VTVVPDDEIVQHRRVALLELIQ 181 H P + L + + + F L D V P+D AL + Sbjct: 122 IAHPEEGWKPTKVSDLVDLPSDDFKIFVPDFNFLLYDAVNDDPEDYDFDETLKALFT-LW 180 Query: 182 KHIRQRDLMGLIDQLVVLLVTECANDS---QITALLNYILLTGDEARFNEFISELTRRMP 238 ++ R + M + + L+ + +L+Y+ +T DE + + + Sbjct: 181 RYSRSPEFMQGVQKAFQLIKKVDPKARLLDFVQMILHYLEVTRDEKEYIDIQKIAETEID 240 Query: 239 QHRERIMTIAERIHNDGYIKGEQRILRL 266 + E + TIAE +G + EQR L+ Sbjct: 241 EGEEYMGTIAEMFRREGDERTEQRFLQE 268 >UniRef50_A4XFI8 Putative uncharacterized protein n=7 Tax=Clostridia RepID=A4XFI8_CALS8 Length = 321 Score = 146 bits (368), Expect = 9e-34, Method: Composition-based stats. Identities = 63/324 (19%), Positives = 129/324 (39%), Gaps = 34/324 (10%) Query: 1 MTNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRAL 60 M++ HD+ FK HP ++ + + DS++L FVDE Sbjct: 1 MSSSLPPQEHDSTFKFLFEHPKDILFLVKDVIGYSWAKEIKEDSIELADKEFVDETFHQK 60 Query: 61 HSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIP 120 +D++ + ++ + Y Y++IE+QS M RL+RY + + + I + LP +IP Sbjct: 61 RADVIAKARLKDREVYFYIIIENQSTVAEDMPERLLRYMILLWAKKIREGVK-KLPAIIP 119 Query: 121 MLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHR----RVAL 176 ++ Y+G + S + EF K + +V+++ + ++Q + Sbjct: 120 IVTYNGLEKDWDVSQEIISEFDI----FKDDIFKYAVVNISKLDAKTLLQEEEDILSPVV 175 Query: 177 LELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLN-YILLTGDEARFNEFISELTR 235 L Q +L+ + ++ L N+++ + ++ E EL + Sbjct: 176 FYLEQVRDDTEELVKRLKEIEPKLTKLSQNNAERFLIWAGNVIRPRLVKEDKEKYDELAQ 235 Query: 236 RMPQHRERIM------------------------TIAERIHNDGYIKGEQRILRLLLQNG 271 R+ Q R M +G I+G+ + + +++ G Sbjct: 236 RVEQGGSRQMGEFVSNVAKLLDEVQMRKFNEGKIEGKIEGKIEGKIEGKIEVAKKMIRRG 295 Query: 272 ADPEWIQKITGLSAEQMQALRQPL 295 E I ++T L E+++ LR+ L Sbjct: 296 FSDEDIAELTELDIEKVKELRKEL 319 >UniRef50_A4U3R1 Putative uncharacterized protein n=1 Tax=Magnetospirillum gryphiswaldense RepID=A4U3R1_9PROT Length = 322 Score = 146 bits (367), Expect = 1e-33, Method: Composition-based stats. Identities = 58/274 (21%), Positives = 102/274 (37%), Gaps = 10/274 (3%) Query: 11 DALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVKT 70 DAL+ +HP A + +P+ + D ++ +A F D + D++W + T Sbjct: 5 DALYHRLFSHPLMAEQLVREFVPEAMAVGLDFARMERVNAKFHDRDGKRREGDVIWRIPT 64 Query: 71 REG-DGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQ---PLPLVIPMLFYHG 126 +G D ++++ E QS D MA R Y + Q I K + LP V+ ++ Y+G Sbjct: 65 ADGEDVVLHILCEFQSTTDWWMAVRTQVYEGLLWQHLIAERKLKSGDRLPPVLTLVLYNG 124 Query: 127 SRSPYPWSLCWLDEF--ADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHI 184 + + + A A + L+D+ VP++E+ +A L +H Sbjct: 125 EQRWHAPTDTIPLIALPAGSPLWPWQPRACYHLLDMGAVPEEELAIRDSLAALLFRLEHP 184 Query: 185 RQ-RDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRER 243 R+ +L GLID +V D + L E + M + R Sbjct: 185 REPEELAGLIDDVVGWFRRHPGYDELRRL---FTELVRQAIEGYETSVAVPGDMMEMRSM 241 Query: 244 IMTIAERIHNDGYIKGEQRILRLLLQNGADPEWI 277 + + E +G G I Sbjct: 242 LANLGETWKKRWLAEGIAEGEARGEARGEAKALI 275 >UniRef50_C6HY29 Putative uncharacterized protein n=1 Tax=Leptospirillum ferrodiazotrophum RepID=C6HY29_9BACT Length = 319 Score = 143 bits (359), Expect = 1e-32, Method: Composition-based stats. Identities = 62/319 (19%), Positives = 122/319 (38%), Gaps = 32/319 (10%) Query: 1 MTNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKL-RA 59 M T PHD FK + + + LP+D+ D DSL V E L R+ Sbjct: 1 MAKNLT--PHDVFFKEIFSQREILSSALSELLPEDVVRRMDFDSLAYLPGESVGEGLSRS 58 Query: 60 LHSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVI 119 +D+++SV E +G + V++EH+S D + F++++ + +++ R+PLP ++ Sbjct: 59 TRADLVFSVSFGEREGRLVVILEHKSHPDPRVHFQILQMMVMGWMQNLREG-REPLP-IL 116 Query: 120 PMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEI--VQHRRVALL 177 P+LFYHG S AR L + +D+ ++ D I +Q+ Sbjct: 117 PILFYHGQGSWSIPDRFSERMKIPREIARYLPDFELLRIDLGLIDDTRIRSLQNVLAGAA 176 Query: 178 ELIQKHIRQ--RDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTR 235 L KH+ + R L+ + +I + + +E + + Sbjct: 177 LLSMKHVFENPRRFFHLLIEFGRERSAPHDIIEKIVLVALDYAGHVHKNIPDEELYNIMA 236 Query: 236 RMPQ---HRERIMTIAERIHNDGYIKGEQ--------------------RILRLLLQNGA 272 + + + + +G KG Q + + L ++ Sbjct: 237 AITEEAGMETTTERLKKIWIEEGIQKGVQLGIQQGVQQGVQQGVRQNQIKTILSLSKHNF 296 Query: 273 DPEWIQKITGLSAEQMQAL 291 P+ I + L +++ + Sbjct: 297 TPQQIADLLSLELPEVERV 315 >UniRef50_C0GW49 Putative uncharacterized protein n=6 Tax=Desulfonatronospira thiodismutans ASO3-1 RepID=C0GW49_9DELT Length = 339 Score = 142 bits (358), Expect = 1e-32, Method: Composition-based stats. Identities = 68/266 (25%), Positives = 123/266 (46%), Gaps = 10/266 (3%) Query: 3 NFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHS 62 TS HD F+ L ARDF+ HLP+++ +LD++K+ S S+V + L+ + Sbjct: 7 MSDTSKYHDHTFRAILGREPVARDFVRYHLPEEITRDMNLDTVKVSSRSYVSDNLKESMT 66 Query: 63 DILWSVK-TREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPM 121 DI+ +++ IY+++EH+S D +L +Y V Q I K LP+++P+ Sbjct: 67 DIVITLELITGEPAEIYILVEHKSDLDAWTKIQLFKYMNEVWQSFI-QKKTGTLPIIVPL 125 Query: 122 LFYHGSRSPYPWSLCWLDEFADPTTA--RKLYNAAFPLVDVTVVPDDEIVQHRRVALLEL 179 +FYHG+ +SL + D F P+ + + L +V V+ ++ + + L Sbjct: 126 VFYHGTARWN-YSLEFSDLFNLPSEHYRKYIPKFEHLLHEVPVINKKKVKSSITLEVFHL 184 Query: 180 IQKHIRQRDLMGLIDQLVVLLVTECANDSQIT---ALLNYILLTGDEARFNEFISELTRR 236 + ++I + I + + LL L+ Y+L+ D E E + Sbjct: 185 VLEYIFYPEKRDQIYEALELLFKGLDAKEAHEIFAILIKYLLIATD--ETPEEAEEKVKH 242 Query: 237 MPQHRERIMTIAERIHNDGYIKGEQR 262 +P+ E + T AE + GY K + Sbjct: 243 LPKGGETVRTTAEVLEERGYNKAIKE 268 >UniRef50_B3ETR6 Putative uncharacterized protein n=1 Tax=Candidatus Amoebophilus asiaticus 5a2 RepID=B3ETR6_AMOA5 Length = 275 Score = 139 bits (350), Expect = 1e-31, Method: Composition-based stats. Identities = 61/243 (25%), Positives = 124/243 (51%), Gaps = 27/243 (11%) Query: 76 YIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGSRSPYPWSL 135 Y+Y +IE+QS + MAF ++ Y++A+M++H+ Q LP+++ + Y G +SPYP+S Sbjct: 36 YVYTLIENQSTHNKLMAFSMLSYNVALMEQHLNEG-YQELPIIVNICIYTGKKSPYPYSQ 94 Query: 136 CWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIRQRDLMGLIDQ 195 D F AR+ F L+D++V+ +E+++ +E + + R+RD + I+ Sbjct: 95 DICDYFEGVELAREQMFKHFKLLDLSVLSQEELLKDGTFGSVEALLRQGRERDYLNWINN 154 Query: 196 LVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRERIMTIAERIHN-- 253 VL+ +N +++ YIL T D+ + + + + + +E I+T A+++ Sbjct: 155 NQVLIWELVSN--YGLSIVIYILTTDDKNDADYLMQAIIEAVLEQKEIIVTAAQQLRQVD 212 Query: 254 ----------DGYIKGEQRIL------------RLLLQNGADPEWIQKITGLSAEQMQAL 291 +G +G++ + + +L+ G + IQK+TG+S E ++ L Sbjct: 213 IQTGLIKGIKEGIEQGKEEGVKLGIQAKAQAIDKSMLKEGLEISLIQKVTGISREAIEKL 272 Query: 292 RQP 294 + Sbjct: 273 TKE 275 >UniRef50_C6HZP6 Putative uncharacterized protein n=1 Tax=Leptospirillum ferrodiazotrophum RepID=C6HZP6_9BACT Length = 334 Score = 139 bits (350), Expect = 1e-31, Method: Composition-based stats. Identities = 64/311 (20%), Positives = 121/311 (38%), Gaps = 23/311 (7%) Query: 2 TNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKL-RAL 60 ++TPHD+ FK + + L +L SL+ + E L R+ Sbjct: 17 KTSISTTPHDSFFKDVFGPGKGHLPSLIPLIDGSLASRIELSSLEYLPGESIAEDLARST 76 Query: 61 HSDILWSV-----KTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPL 115 SD+ S+ + GD I + EH+S H+ L+ A++ R + ++ Sbjct: 77 RSDLSASLLISNARIDGGDARIAFIFEHKSFLPHHIHIPLLSLVSALLSRDLREGRKP-- 134 Query: 116 PLVIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQ---HR 172 VIP++ YHG + P A +L + L+D++ D+ + + H Sbjct: 135 CPVIPVVLYHGRAPWTLPARLSEALDLSPELAPRLPDFELTLIDLSRFSDETLKEKIAHP 194 Query: 173 RVALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITAL----LNYILLTGDEARFNE 228 + + KHI + ++ V L+ T + + + L+YI E Sbjct: 195 EPLVSLSVMKHIFEP-PESVLGHFVRLIKTLSPSRDILKRIVDTTLHYISYVKKSHHPQE 253 Query: 229 FISELTRRMPQHRERIMTIAERIHNDGYIKGEQ-----RILRLLLQNGADPEWIQKITGL 283 + T + + E++ T+ + I +G +G Q I RLL + P+ I I + Sbjct: 254 IRTIFTTFLAE--EKMTTVLDLIKEEGIQEGIQMGRDEAITRLLQHSSLSPQQIASILNV 311 Query: 284 SAEQMQALRQP 294 ++ +L Sbjct: 312 DLSRVLSLANS 322 >UniRef50_C5RH90 Putative uncharacterized protein n=2 Tax=Clostridium cellulovorans 743B RepID=C5RH90_CLOCL Length = 339 Score = 138 bits (348), Expect = 2e-31, Method: Composition-based stats. Identities = 51/320 (15%), Positives = 111/320 (34%), Gaps = 18/320 (5%) Query: 4 FTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSD 63 + HD +K ++ +T ++ + ++L L S+V L SD Sbjct: 17 NKKNNLHDKSYKDLFSNKETFLSLIQTFVSNTWGSKLTKENLVLVDKSYVLSDYEELESD 76 Query: 64 ILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHI-------EHDKRQPLP 116 I++ + + + + Y+++E QS D M RL+ Y + + + + K LP Sbjct: 77 IVYKARIGDHEVFFYMLLEFQSYVDYRMPIRLLLYMIEIWREILKNTSEKEFKRKSFRLP 136 Query: 117 LVIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVAL 176 V+P++ Y+G ++ + + + +DV DE+ +++ +A Sbjct: 137 AVVPIVVYNGEKNWTVARTLKEVISNSDIFGESILDFRYEFLDVNRFKKDELYENQNIAS 196 Query: 177 LELIQKHIRQR-DLMGLIDQLVVLLVTECANDSQITALLN---YILLTGDEARFNEFISE 232 + R + + +V+ + + + S Sbjct: 197 AIFLLDQSISRIEFYNRLKDIVIEFNKLTVEEKAQLKHWLVNVNSEENNYKENIEKIFSS 256 Query: 233 LTRRMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNG-------ADPEWIQKITGLSA 285 R + I E++ +G I+G+ LL E+ +KI L Sbjct: 257 NKREVEIMTSNISKGLEKLKEEGKIEGKAEGKAELLIKQLNKKFKLLPMEYEKKIKALPE 316 Query: 286 EQMQALRQPLPERERYSWLK 305 + + + + E LK Sbjct: 317 KILDDIATDIFSLEEIDELK 336 >UniRef50_A4XMU7 Putative uncharacterized protein n=1 Tax=Caldicellulosiruptor saccharolyticus DSM 8903 RepID=A4XMU7_CALS8 Length = 313 Score = 135 bits (339), Expect = 2e-30, Method: Composition-based stats. Identities = 56/306 (18%), Positives = 131/306 (42%), Gaps = 30/306 (9%) Query: 11 DALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVKT 70 D FK LT+ + + L + L L+ +++ + ++ + RA SD+++ +K Sbjct: 9 DEGFKKVLTNRTNIKWLLTELL-EVLPIQIGLEDIEVIATESINRQWRARRSDMVYKIKY 67 Query: 71 REGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGSRSP 130 ++ YI V++E QS ++ + R++ Y + + +++ + LP+VIP++ Y G Sbjct: 68 KD--AYICVLLEFQSSKEELIHLRVLEYMLLIQKKYTTKN---LLPVVIPVVLYTGEEKW 122 Query: 131 YPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRR-VALLELIQKHIRQRD- 188 P + + + + + VDV ++ D+++++ +A + K + Sbjct: 123 TPATCFEQNVVYGEDFKQFVQKFSLVFVDVRMIDDEKLLKSPNLLAAALYVDKVSDNPEK 182 Query: 189 LMGLIDQLVVLLVTECANDSQITALLNYILLTG---DEARFNEFISELTRRMPQHRERIM 245 + ++ L + + L +++L G + +EF+ + E + Sbjct: 183 VAERLEYLSKHVKFSEEQKEEFCEWLYHVVLKGYGFSDEEVDEFLFKSDFLRLGVNEMFL 242 Query: 246 TIAERIHN-------------------DGYIKGEQRILRLLLQNGADPEWIQKITGLSAE 286 AE+I G + + + +++ GA+ +I K+TGL E Sbjct: 243 NTAEKIRKGLEKELEKERKQGIQQGIQQGKEQALLEVAQKMIEEGAEDSFIAKVTGLDME 302 Query: 287 QMQALR 292 +++ LR Sbjct: 303 RIRQLR 308 >UniRef50_Q04UG3 Transposase, YhgA-like n=8 Tax=Leptospira RepID=Q04UG3_LEPBJ Length = 304 Score = 134 bits (337), Expect = 3e-30, Method: Composition-based stats. Identities = 67/301 (22%), Positives = 131/301 (43%), Gaps = 16/301 (5%) Query: 3 NFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHS 62 + PHD L + A F + LP ++ EL DL++L+L +SFV E+L+ + Sbjct: 1 MTEVNNPHDRLIRETFQDKKEAATFFKNTLPPEVVELLDLENLELTESSFVSEELKQEQT 60 Query: 63 DILWSVKTREGD-GYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPM 121 D+L+ + + G+ +Y++ EH+S + + +L+ Y + + + +VIP Sbjct: 61 DLLFQIPLKSGNKSNVYLLFEHKSYLENTIYIQLLGYLTEIYRNQQRSGE--SFSVVIPF 118 Query: 122 LFYHGSRSP----YPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALL 177 +FYHG + L + + + L D+ + + ++ + Sbjct: 119 VFYHGEKEWKLGDRFSDQFVLTKQETDVFQDFIPDFKIDLFDLEGIELKKKLESITFQVT 178 Query: 178 ELIQKHIRQRDL-----MGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISE 232 + + IR+RDL + + L++ + E + + LL YI D Sbjct: 179 LGVVQRIRERDLEFVSHLPGLFSLLLGIEEESKRVAILRKLLLYIYWARDLKPTELKRVL 238 Query: 233 LTRRMPQHRERIMTIAERIHNDGYI----KGEQRILRLLLQNGADPEWIQKITGLSAEQM 288 ++ Q+ E MT AER+ ++G +G+ R +L E + +ITGLS + + Sbjct: 239 AISKLEQYEELTMTTAERLISEGIQQGKIEGKIETARNMLSEDIQLEAVLRITGLSKQDL 298 Query: 289 Q 289 + Sbjct: 299 K 299 >UniRef50_C0GWA6 Putative uncharacterized protein n=3 Tax=Desulfonatronospira thiodismutans ASO3-1 RepID=C0GWA6_9DELT Length = 334 Score = 134 bits (337), Expect = 4e-30, Method: Composition-based stats. Identities = 56/311 (18%), Positives = 130/311 (41%), Gaps = 10/311 (3%) Query: 3 NFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHS 62 + HD FK+F + + RDF++ +LP+++++ DL ++++ ++ E+ + +S Sbjct: 2 SKKIPNAHDICFKSFFSREEFVRDFIQYYLPEEIKKHLDLTIIEIDMEGYLSEEFKEFYS 61 Query: 63 DILWSVKTREG--DGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDK-RQPLPLVI 119 D++ V + + +Y + EH+S+ + + Y + R + K Q LP+++ Sbjct: 62 DVVAKVYFNDRVHELELYFLFEHKSKPYRFTILQTLNYQVQKWMRLLVEGKLNQHLPIIV 121 Query: 120 PMLFYHGSRSPYPWSLCWLDEFADPTT--ARKLYNAAFPLVDVTVVPDDEIVQHRRVALL 177 P++ Y+G +S +S+ + D F P+ + L D+ + + + + Sbjct: 122 PVVIYNGYKSWN-FSVQFEDLFQLPSEYYKDFIPQFRHILHDIGQMDEASFKTTTIMEIF 180 Query: 178 ELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYI-LLTGDEARFNEFISELTRR 236 L+ K+I +L I ++ LL ND L + + A + + E +R Sbjct: 181 HLLLKYIYYPELDTKIHEIYDLLEKLPDNDKLTDYLFIIVRYVMASGAIPEKRLLEHAKR 240 Query: 237 MPQHRERIMTIAERIHNDGYIKGE---QRILRLLLQNGADPEWIQKITGLSAEQMQALRQ 293 E I A I + Q+ ++ + ++ L ++ + Sbjct: 241 FSGGEEMIGLAAREIEERVEQTRKPYWQKQAKVENSQEMLIKSLKMRFDLVRPSIKEQIR 300 Query: 294 PLPERERYSWL 304 + + + + L Sbjct: 301 SIQDVDTLNDL 311 >UniRef50_B0G834 Putative uncharacterized protein n=3 Tax=Dorea formicigenerans ATCC 27755 RepID=B0G834_9FIRM Length = 369 Score = 133 bits (335), Expect = 7e-30, Method: Composition-based stats. Identities = 47/326 (14%), Positives = 110/326 (33%), Gaps = 32/326 (9%) Query: 2 TNFTTSTPH--DALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRA 59 T+ H D K P F++ + L + + ++ S+ F+ + Sbjct: 8 TSNGVHNTHTKDNAAKIVFGDPVLCAQFLKGYTDIPLFKEIKPEDIENVSSHFLPLFQES 67 Query: 60 LHSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHI----------EH 109 SD + + + Y+ +IEHQS D M+FR++RY + + + Sbjct: 68 RDSDTVNKIWIGNSEIYLIALIEHQSENDFDMSFRILRYIVFIWTDYAAQQEKLHKGTTK 127 Query: 110 DKRQPLPLVIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIV 169 K P ++P+++Y GS + F + + + +V + +++ Sbjct: 128 SKDFLYPPILPIVYYEGSSTWSAPLNFKNRVFLSDVFGDYIPSFNYLVVPLNKYSKQDLI 187 Query: 170 QHRRVALLELIQKHIRQRDLMGLIDQLVVLLVT---ECANDSQITALLNYILLTGDEAR- 225 + L + ++ + + E D + + I + + Sbjct: 188 EKNDELSLIFLINQLQSSSEFHALKDIPKKYTEHLTEDTPDYLLKIIGKVIAVLLHKLNV 247 Query: 226 FNEFISELTRRMPQHRERIM----------TIAERIHNDGYIKGEQRILRLLLQNGADPE 275 +E + E+T ++ + + +M +G ++G R G Sbjct: 248 PDEEVYEVTDQITRRKFSMMFDNFQAYDVQETRRVSREEGRLEGRIEGERAGRIEGERAG 307 Query: 276 WIQKITGLSAEQMQALRQPLPERERY 301 I+ E++ ++Q + E Sbjct: 308 RIE------GERLHLIKQVIKRIELQ 327 >UniRef50_C1DXM1 Putative uncharacterized protein n=5 Tax=Sulfurihydrogenibium azorense Az-Fu1 RepID=C1DXM1_SULAA Length = 342 Score = 133 bits (334), Expect = 8e-30, Method: Composition-based stats. Identities = 58/269 (21%), Positives = 113/269 (42%), Gaps = 19/269 (7%) Query: 3 NFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHS 62 +PHD FK + F+EI LP+ L E +SLKL +K + Sbjct: 1 MSIEKSPHDWFFKMIFSQKQNVESFLEIFLPQ-LYECIIPNSLKLSDTEKFSKKYKKFFL 59 Query: 63 DILWSVKTREG-----DGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPL 117 D+ + K ++ DG IY+V EH+S D H ++ Y +M+ + +P Sbjct: 60 DLAFDCKLKDKEGNTIDGQIYIVFEHKSYPDKHTPSQISFYKSVMMEE--DERLSRPYRP 117 Query: 118 VIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALL 177 VIP++FYHG +S + + L++ ++ L DV+ V + +++ + Sbjct: 118 VIPIVFYHGEKSWNIPTDIPQQFNTLGNLEKYLHSLSYILFDVSKVDESFLIEKIYLNAC 177 Query: 178 E----LIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISEL 233 K+I + + + ++ L+ + D + +++ D + + E+ Sbjct: 178 LISGVFTLKNIFKD--LKYLRPVLEKLILDDVKDCLYIIIDYTVIVKKDLETIEKILEEI 235 Query: 234 TRRMPQHRERIMTIAERIHNDGYIKGEQR 262 E++MT+ E+ +G KG + Sbjct: 236 -----GGEEKMMTLTEKWKMEGLKKGMEE 259 >UniRef50_A5USQ0 Putative uncharacterized protein n=4 Tax=Roseiflexus sp. RS-1 RepID=A5USQ0_ROSS1 Length = 330 Score = 132 bits (333), Expect = 1e-29, Method: Composition-based stats. Identities = 48/268 (17%), Positives = 91/268 (33%), Gaps = 20/268 (7%) Query: 10 HDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDE--KLRALHSDILWS 67 HDALFK LT R+F+++ P DL D + +D++ Sbjct: 7 HDALFKLVLT--AFFREFIDLVAP-DLAAALDPAPPVFLDKESFADLFDPDRREADLVAQ 63 Query: 68 VKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGS 127 V+ R+ + + +EHQ++ D + R+ RY + R+ + + P+ Sbjct: 64 VRLRQHPATLLIHLEHQAQADAALDRRMFRYFARLYDRYDQ--------PIYPIALCSYP 115 Query: 128 RSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHI-RQ 186 R P D R + + +V + + + A + L+ + Sbjct: 116 RPRRPA----ADRHEVRAAQRTVLTFQYQVVQLNRMDWRAYLTTTNPAAMALMARMRVAP 171 Query: 187 RDLMGLIDQLVVLLVTECANDSQITAL--LNYILLTGDEARFNEFISELTRRMPQHRERI 244 D + + LL +Q + I L + +E+ R +E + Sbjct: 172 EDRWRVKAACLRLLAGAPLTGAQRRLIGQFVDIYLPLNAREEQALAAEVARLPGAAKEVV 231 Query: 245 MTIAERIHNDGYIKGEQRILRLLLQNGA 272 M + G +G + LR G Sbjct: 232 MELITSWERKGRAEGLREGLREGRAEGL 259 >UniRef50_B0K503 Putative uncharacterized protein n=12 Tax=Thermoanaerobacteraceae RepID=B0K503_THEPX Length = 360 Score = 132 bits (333), Expect = 1e-29, Method: Composition-based stats. Identities = 44/268 (16%), Positives = 103/268 (38%), Gaps = 9/268 (3%) Query: 3 NFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHS 62 HD +K L+ + + + ++ D ++ SFV + + Sbjct: 7 KEAIHNQHDKGYKFLLSSKRVFIELLRSFVKQEWVNDIDEANVVKVDKSFVLQDFADKEA 66 Query: 63 DILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQ-------RHIEHDKRQPL 115 D+++ VK ++ + Y+++E QS D M +RL+ Y + + + R K L Sbjct: 67 DLVYRVKLKDKEVIFYILMELQSTVDYQMPYRLLLYMVEIWRSILKDTPRKESRRKDFKL 126 Query: 116 PLVIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVA 175 P+++P++ Y+G + + T + + L+DV +E+++ + Sbjct: 127 PVIVPIVLYNGDHKWTAKTSYKETLNSYETFGEYAVDFKYILIDVNRYTKEELLKLENLI 186 Query: 176 LLELIQKHIRQ-RDLMGLIDQLVVLLVTECANDS-QITALLNYILLTGDEARFNEFISEL 233 + + + ++M + +L +L ++ A ILL E I + Sbjct: 187 ASVFLLEQKVEFEEIMKRLKELSEILNNLDKDEILLFKAWFKKILLARLPEEERENIERI 246 Query: 234 TRRMPQHRERIMTIAERIHNDGYIKGEQ 261 + E I + + I + + ++ Sbjct: 247 IDENKEVEEMISNLEKTILQEMKEREKR 274 >UniRef50_C6PYR3 Putative uncharacterized protein n=1 Tax=Clostridium carboxidivorans P7 RepID=C6PYR3_9CLOT Length = 344 Score = 132 bits (331), Expect = 2e-29, Method: Composition-based stats. Identities = 42/300 (14%), Positives = 107/300 (35%), Gaps = 14/300 (4%) Query: 3 NFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHS 62 HD +K ++ + D ++ + + D+++L + S++ L S Sbjct: 4 KKEMHHIHDKSYKDLFSNKELLVDMIQNFVKSSWIKEIKKDNIELVNKSYILSDYEELES 63 Query: 63 DILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQ-------RHIEHDKRQPL 115 DI++ + Y+++E QS D M RL Y + + + K L Sbjct: 64 DIVYKATIDGREVIFYILLEFQSYVDYSMPIRLFLYMSEIWREVLKNTKQAEVKSKEFRL 123 Query: 116 PLVIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVA 175 P ++P++ Y+G + + + L+D+ +E+++ + + Sbjct: 124 PAIVPLVLYNGEYKWTVEKKFKNIINKSELFGNNIIDFEYILIDINKYEKEELMELKNLV 183 Query: 176 LLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFN----EFIS 231 + ++ D+ I ++ + + + +L + L E I Sbjct: 184 SAVFLL--DQKVDIEEFISRVKDIAIDFNNLTEEQKMMLRHWLRVTLSDELKGNLGEKIE 241 Query: 232 ELTRRMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQAL 291 ++ + R+ + + + + K + + ++ G + I+K E + L Sbjct: 242 DILIAKKEEVNRMTSNISKTIKETFAKTREEGMEKGIEEGIEKG-IEKARQKDVEIVLKL 300 >UniRef50_B9MN47 Putative uncharacterized protein n=2 Tax=Bacteria RepID=B9MN47_ANATD Length = 324 Score = 130 bits (327), Expect = 6e-29, Method: Composition-based stats. Identities = 52/321 (16%), Positives = 122/321 (38%), Gaps = 33/321 (10%) Query: 2 TNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALH 61 + HD+ FK +P + + S++++ +++ ++ + Sbjct: 6 KEKLPAKEHDSTFKLLFENPKDIYLLLSKIINYSWANEIRESSIEIKKTNYITKEFSQVE 65 Query: 62 SDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPM 121 +D++ + ++ D Y Y++IE+QS M RL+RY +++ I + + LP +IP+ Sbjct: 66 ADVVAKARLKDRDVYFYILIENQSTVAKDMPERLLRYMISIWAEEIRNGV-EKLPAIIPI 124 Query: 122 LFYHGSRSPYPWSLCW---LDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLE 178 + Y+G + S D F + K+ + +D+ +E V + LE Sbjct: 125 VVYNGLDRRWEVSTDIIGAFDIFKNDIFKYKVVD--IAQIDIKNYLQEEDVLTPIIFYLE 182 Query: 179 LIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALL-NYILLTGDEARFNEFISELTRRM 237 ++ +L+ + ++ L N+ + L +++ + +L ++ Sbjct: 183 QVRN--DSNELVRRLQEIEQSLKKLSFNNIERFLLWSQHVIRPRLGNEQKKEYDKLVMKV 240 Query: 238 PQHRERIM-----TIAERIHNDGYIK-------------------GEQRILRLLLQNGAD 273 Q +M +A + + + ++Q G Sbjct: 241 RQEGVELMGEFVSNVARLLDETKTKEFLAGVQQGIQQGIQQGIQQERIETAKRMIQLGIS 300 Query: 274 PEWIQKITGLSAEQMQALRQP 294 E I K T LS E+++ + + Sbjct: 301 YEVISKATNLSIEEIEKIARE 321 >UniRef50_C0A240 Putative uncharacterized protein n=1 Tax=Opitutaceae bacterium TAV2 RepID=C0A240_9BACT Length = 365 Score = 129 bits (324), Expect = 1e-28, Method: Composition-based stats. Identities = 61/323 (18%), Positives = 124/323 (38%), Gaps = 38/323 (11%) Query: 10 HDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVK 69 HD +F+ + P AR F+ LP +L D +L + S + + L D+++ + Sbjct: 36 HDRIFRHAFSLPAVARQFLRTWLPPELVAQADWHTLTVTRISGISDTLGERREDVVYRIN 95 Query: 70 TREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQ-------------------RHIEHD 110 + + YV++EHQ++ + HMA R+M + + + R Sbjct: 96 VNGRNVHFYVLMEHQTKTEKHMARRIMEETFLIWRQDEHDRAEAAKKEAPGKADRQSRRR 155 Query: 111 KRQPLPLVIPMLFYHGSRSPYPWSLCWLDEFADPTTAR-----KLYNAAFPLVDVTVVPD 165 + PLVI M+ + G R D P + + F +V++ +P Sbjct: 156 ETDKFPLVISMVLHPGPRKWGK-IWRLADLIDVPPRMEKWARTFMPDCGFIVVELAGLPL 214 Query: 166 DEIVQ-HRRVALLELIQKHIRQRDLMGLIDQLVVLLVTECAND---SQITALLNYILLTG 221 +++ H A+L +Q + + I +L+ + ++ + + L +Y++ + Sbjct: 215 EKLADGHLARAILGALQGNRLGLIDIRKIKRLLDEMFSDPDRASVGAVVKQLWHYLISSS 274 Query: 222 DEARFNEFISELTRRMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKIT 281 D + ++R IM ER+ G +K + + L+ D Sbjct: 275 DLKEEQTKDIVIAHIPEEYRSNIMNTVERLKQAGALKAQHNAVIEALEVRFDR------- 327 Query: 282 GLSAEQMQALRQPLPERERYSWL 304 E ++ Q + + ER L Sbjct: 328 --VPEGLREAIQGINDPERLRNL 348 >UniRef50_B9MMM9 Putative uncharacterized protein n=1 Tax=Anaerocellum thermophilum DSM 6725 RepID=B9MMM9_ANATD Length = 315 Score = 129 bits (323), Expect = 1e-28, Method: Composition-based stats. Identities = 50/318 (15%), Positives = 126/318 (39%), Gaps = 36/318 (11%) Query: 6 TSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDIL 65 T +D +K ++ + F++ L ++ + + +++ + +++K + SDI+ Sbjct: 3 TYKKYDEGYKKLFSNKENLIWFLQNVLNEERFKKIEKSDVEIIATESINKKWQKKISDIV 62 Query: 66 WSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYH 125 + +K ++ + + IE QSRED + RL Y + +++ +P+V+P++ Y+ Sbjct: 63 YKIKYKD--SFFCLTIEFQSREDKKILHRLYEYMHLI---QLKNKVNGEIPVVVPIVLYN 117 Query: 126 GSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIR 185 G P N +D+ +P+++++ V + + + Sbjct: 118 GISHWKPNEQYNEIILFAKDFPEYAQNFKIIFLDIKSIPEEKLISAANVLAIAVYIDQV- 176 Query: 186 QRDLMGLIDQLVVLLVTECANDSQITALLNYILLTG------DEARFNEFISE------- 232 + ++++++ L N Q L +++ E E + Sbjct: 177 SNNPERVLNRILNLRGKIHLNWEQREELADWLYEVILRSYGVSEEEAEEMFKKSGLEVDE 236 Query: 233 -------------LTRRMPQHRERIMTIAERIHNDGYIKGEQR----ILRLLLQNGADPE 275 + +E + ++ G +G +R I + +L++ E Sbjct: 237 LFSSTAEKIKQGIEREKKKIAKEAMKQGMKQGMKQGMKQGMKRAIKLIAKQMLKDNQPIE 296 Query: 276 WIQKITGLSAEQMQALRQ 293 I K TGL+ E+++ L++ Sbjct: 297 LISKYTGLTPEEIKKLKK 314 >UniRef50_Q7NIZ1 Gll2041 protein n=9 Tax=Cyanobacteria RepID=Q7NIZ1_GLOVI Length = 311 Score = 128 bits (322), Expect = 2e-28, Method: Composition-based stats. Identities = 48/319 (15%), Positives = 108/319 (33%), Gaps = 35/319 (10%) Query: 8 TPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEK--LRALHSDIL 65 T HD LFK L+ +F+++ D+ + S+ + +D++ Sbjct: 2 TDHDRLFKELLS--TFFVEFIDLFF-ADVGNYLERGSIVFLEKELFSDITAGERYEADLV 58 Query: 66 WSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYH 125 + R+ + V IE+Q+ ++R+ RY + +++ + P+ + Sbjct: 59 VKARFRDHQSFFLVHIENQTEAQSIFSYRMFRYFARLYEKYQL--------PIYPIAVFS 110 Query: 126 GSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKH-I 184 + F D + + +V + + + ++ L+ + I Sbjct: 111 FTEPLRAEPTAHRVAFPD----FTVLEFHYRVVQLNRLDWRDFLRQPNPVASALMARMRI 166 Query: 185 RQRDLMGLIDQLVVLLVTECANDSQITAL--LNYILLTGDEARFNEFISELTRRMPQHRE 242 D + + + LL T + ++ + L F +EL +E Sbjct: 167 APADRPRVKLECLRLLATLRLDPARTQLISGFVDTYLKLTAQEERLFAAELATIGASEQE 226 Query: 243 RIMTIAERIHNDGYIKGEQRILRLLLQN---------------GADPEWIQKITGLSAEQ 287 ++ I G +G Q + Q + ++++GLS Sbjct: 227 AVVQIVTSWMQQGLEQGRQVGRQEGRQEEALAIVLRQLSRRLGTLPAQNAERVSGLSTTA 286 Query: 288 MQALRQPLPERERYSWLKS 306 ++AL + L + S L S Sbjct: 287 LEALSEALLDFASISDLDS 305 >UniRef50_B2V697 Putative uncharacterized protein n=6 Tax=Sulfurihydrogenibium RepID=B2V697_SULSY Length = 311 Score = 128 bits (321), Expect = 3e-28, Method: Composition-based stats. Identities = 62/304 (20%), Positives = 127/304 (41%), Gaps = 20/304 (6%) Query: 8 TPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWS 67 PHD FK + P + ++I +L + DL+S++L ++ +K+ D+L+ Sbjct: 5 QPHDQFFKQIFSEPKRVKSLLDIFY-SELSQKIDLESIRLLNSEKYSQKIGKSLLDLLYE 63 Query: 68 VKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGS 127 K ++ ++ EH+S D ++ +L+ Y+ + + + + +I ++ YHG Sbjct: 64 CKIENEKSFLRIIFEHKSYIDKNLPSQLLYYNGILWEE---TGEYKEYLPIINIVLYHGK 120 Query: 128 RSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALL----ELIQKH 183 R + L + R + L+D++ V D+E++ V L KH Sbjct: 121 RKWNIPTT--LPKTNSEIIERFSNKLNYHLIDLSKVADEEMINKLYVDFCTASALLTMKH 178 Query: 184 IRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRER 243 I + DL + ++ V E D + +L+YI + + + E+ + Sbjct: 179 IFE-DLKKY--KHILKKVFEHYQDGCVFIILDYISVVNNPQEVENVLKEIL----GGEKE 231 Query: 244 IMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKI---TGLSAEQMQALRQPLPERER 300 + T+ E+ +G +G Q+ L+ L + I+ I G E + L + + E Sbjct: 232 MTTLTEKWKMEGLQQGLQQGLQQGLIKAKQEDIIKLIKVRFGNVPENVGKLISDINDLEE 291 Query: 301 YSWL 304 L Sbjct: 292 LDKL 295 >UniRef50_B6J6C6 Hypothetical cytosolic protein n=1 Tax=Coxiella burnetii CbuK_Q154 RepID=B6J6C6_COXB1 Length = 143 Score = 125 bits (314), Expect = 2e-27, Method: Composition-based stats. Identities = 49/138 (35%), Positives = 83/138 (60%) Query: 3 NFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHS 62 PHD F+T ++ A++F E HLP ++ + DL+SL+L+ +SF+DE L+A + Sbjct: 1 MKKIHNPHDYYFRTAMSDTRVAKEFFEYHLPNNILKAADLNSLQLQKSSFIDEHLKASMA 60 Query: 63 DILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPML 122 D+L+SVK GY Y+++EHQ D M +RL+RY + ++ H++ PLP+V+P++ Sbjct: 61 DVLYSVKLNRRPGYFYIIVEHQRNPDKLMPYRLLRYILRIIDHHLKKKDYLPLPIVVPLV 120 Query: 123 FYHGSRSPYPWSLCWLDE 140 FY+G + + L Sbjct: 121 FYNGKKRYPFQRIFLLYL 138 >UniRef50_B9MPV5 Putative uncharacterized protein n=5 Tax=Clostridia RepID=B9MPV5_ANATD Length = 331 Score = 124 bits (311), Expect = 3e-27, Method: Composition-based stats. Identities = 47/330 (14%), Positives = 118/330 (35%), Gaps = 47/330 (14%) Query: 7 STPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILW 66 S +D FK + F+ +P+ + +++ + ++ + +A SD+++ Sbjct: 4 SRSYDVGFKKLFSDKINVCWFITEIIPEPRLKNYTQSDIEIVATESINAQWKARRSDMVY 63 Query: 67 SVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHG 126 + +IY+++E QSR + M R+ Y + +++ + + V+ Y+G Sbjct: 64 RLPYS--SSWIYLLVEFQSRPNKQMHCRIYEYVFLIQRKYQIDKRLPVVVPVVL---YNG 118 Query: 127 SRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQ-HRRVALLELIQKHIR 185 P + + + + +DV +P+D+++ + +A + + Sbjct: 119 VEKWQPVTQFADNVEYAEDFPEYVQRLNYIFIDVRDIPEDKLLNGNNVLAAALYVDQVAT 178 Query: 186 QRD-LMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTR--------- 235 D ++ + +L + ++ L + +L + E + Sbjct: 179 NPDSVVERLLELGKNIRIPDEQREELAEWLYHAVLKSYKIPREEINELFAKSKILGVEEM 238 Query: 236 -------------------------------RMPQHRERIMTIAERIHNDGYIKGEQRIL 264 ++ E + +G ++ + I Sbjct: 239 FQSTAMKIKKGLAEEKKKIRLESKIEGKIEGKIEGKIEGKIEGKIEGKIEGRMEAQLEIA 298 Query: 265 RLLLQNGADPEWIQKITGLSAEQMQALRQP 294 R L+ GA+ +I K+TGL E+++ LR Sbjct: 299 RNLILEGAEDSFIAKVTGLDIEKVKELRNQ 328 >UniRef50_C6HTR6 Probable transposase n=5 Tax=Leptospirillum ferrodiazotrophum RepID=C6HTR6_9BACT Length = 216 Score = 122 bits (307), Expect = 1e-26, Method: Composition-based stats. Identities = 49/217 (22%), Positives = 76/217 (35%), Gaps = 14/217 (6%) Query: 1 MTNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKL-RA 59 MT TT TPHD+ FK + L D SL S + E L + Sbjct: 1 MT--TTPTPHDSFFKDVFGPGKANLPALLSLLDAPFASRIDPSSLTFLSGETIGEGLATS 58 Query: 60 LHSDIL-----WSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQP 114 SD++ ++EH+S + F+L A+ R + K P Sbjct: 59 FRSDLVGSLLVADATVDGKPLEFVFLVEHKSSPARDIQFKLACLVTALWARFLREGK--P 116 Query: 115 LPLVIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQH--- 171 V+P+L +HG + P A + + A ++D+T + DDEI + Sbjct: 117 PLPVVPILIHHGKSPWNQPLRLYETLGLRPELATGMLDYALHVIDLTRIEDDEIRRKIPD 176 Query: 172 RRVALLELIQKHIRQRDLMGLIDQLVVLLVTECANDS 208 + KHI L + + LL N Sbjct: 177 PEPQMSLAAMKHIHDP-LPAFLRVMADLLKEIEENRD 212 >UniRef50_A9BGB3 Putative uncharacterized protein n=2 Tax=Petrotoga mobilis SJ95 RepID=A9BGB3_PETMO Length = 336 Score = 119 bits (299), Expect = 9e-26, Method: Composition-based stats. Identities = 61/314 (19%), Positives = 127/314 (40%), Gaps = 13/314 (4%) Query: 6 TSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDIL 65 ++ D++FK DF++ LPK+ + LK E + + SDIL Sbjct: 2 SNPIKDSIFKELFEDRTVFYDFLKAFLPKETTKQIKETDLKREQTELIGKDFSIKRSDIL 61 Query: 66 WSV-KTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQ------PLPLV 118 + + K D YIY+++EHQS+ D MAFR++ Y + + ++++ K++ LP++ Sbjct: 62 YKIEKRNGQDVYIYLLLEHQSKVDQLMAFRMLAYKVRIWEQYVNSHKKESEQKGFKLPVI 121 Query: 119 IPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLE 178 I M+FY G L A + L++++ + ++ I+ ++ + Sbjct: 122 IGMVFYDGKAKWTSPMDVKEKITEIKNMEEYLIKANYELINLSNIKEETIINMKKALGVI 181 Query: 179 LIQ-----KHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISEL 233 L+ + +L+ +I++ ++L ++E + +I L G + E Sbjct: 182 LLTDKPNVRVKNAEELLKIINKDILLKLSEEEQEKFNKHRNAFIELFGKRTDYEEIKERF 241 Query: 234 TRRMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITG-LSAEQMQALR 292 ++ E I K + G E + I L+ + Sbjct: 242 EELKEMEVPKMFNTLEEIAKRDREKAKLEGKAEGKVEGKLEERRELIIEILNQRFGEDFD 301 Query: 293 QPLPERERYSWLKS 306 + L E+ R + ++ Sbjct: 302 KSLEEKIRNANEET 315 >UniRef50_B1XMU9 Putative uncharacterized protein n=1 Tax=Synechococcus sp. PCC 7002 RepID=B1XMU9_SYNP2 Length = 316 Score = 119 bits (299), Expect = 1e-25, Method: Composition-based stats. Identities = 44/262 (16%), Positives = 96/262 (36%), Gaps = 20/262 (7%) Query: 10 HDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALH--SDILWS 67 HD LFK LT DF+ + P ++ E + +SL + ++ DI+ Sbjct: 7 HDLLFKELLT--TFFWDFLALFAP-EILETAEQNSLTFLTQEVFNDLPGQTRRNVDIVAK 63 Query: 68 VKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGS 127 + R + V +E+Q+ A R+ Y + +++ + P+ + Sbjct: 64 LHFRGQETCFLVHVENQATSQADFAERMFLYFARLYEKYRL--------PIYPIALFSYR 115 Query: 128 RSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIR-Q 186 + F +++ + +F + + +P + ++ L+ K Sbjct: 116 SPQRLEPETFSVAFPS----KEILSFSFQTIQLNRLPWRDFLRQPNPVAAALMAKMNFSS 171 Query: 187 RDLMGLIDQLVVLLVTECANDSQITAL--LNYILLTGDEARFNEFISELTRRMPQHRERI 244 + + + + ++VT + ++I L L + A F EL R PQ ++ Sbjct: 172 EERPKVKLECLRMIVTLRLDSARIHLLSGFVDTYLRLNMAEQQVFEQELHRIQPQEEAQV 231 Query: 245 MTIAERIHNDGYIKGEQRILRL 266 + I +G +G Q + Sbjct: 232 LRIVTSWMEEGLQQGRQEGRQE 253 >UniRef50_UPI0001C351D8 hypothetical protein ChatD1_33675 n=1 Tax=Clostridium hathewayi DSM 13479 RepID=UPI0001C351D8 Length = 313 Score = 119 bits (298), Expect = 1e-25, Method: Composition-based stats. Identities = 53/308 (17%), Positives = 98/308 (31%), Gaps = 31/308 (10%) Query: 4 FTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSD 63 D LF+ D + DL L A ++ +D Sbjct: 5 KLNRNYKDRLFRLAFQEKKDLLDLYNAVSGRQYTNPDDLIITTLADAIYLGM-----KND 59 Query: 64 ILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHD-------KRQPLP 116 I + V + + EHQS + +M R + Y + +I+ + K LP Sbjct: 60 ISFLVSD------VLNLYEHQSSFNPNMPVRGLNYFADTYREYIDRNGFDIYGEKLIRLP 113 Query: 117 LVIPMLFYHG-SRSPYPWSLCWLDEF-ADPTTARKLYNAAFPLVDVTVVPDDEIVQHRR- 173 + ++FY+G P L D F + ++++ + E++ R Sbjct: 114 MPQYIVFYNGTKEEPDRIELRLSDAFLCQNPEEKGCLECRATMININYGHNKELMDRCRR 173 Query: 174 -VALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISE 232 + + +D+ V V C + +L IL N + E Sbjct: 174 LKDYAVFVSRIRNNEKRGMALDEAVKQAVHSCIEE----GILADILKKNRAEVCNLILYE 229 Query: 233 LTRR--MPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQA 290 + + RE M +G + I+R + G +P I + GL ++ Sbjct: 230 YDEQRQLAIAREGAMKA---GREEGRAAEQVTIIRNMAGKGLNPSAIADMLGLEEGYVKK 286 Query: 291 LRQPLPER 298 + L E Sbjct: 287 VLYLLAEE 294 >UniRef50_C0CSV6 Putative uncharacterized protein n=1 Tax=Clostridium asparagiforme DSM 15981 RepID=C0CSV6_9CLOT Length = 317 Score = 118 bits (296), Expect = 2e-25, Method: Composition-based stats. Identities = 57/317 (17%), Positives = 100/317 (31%), Gaps = 33/317 (10%) Query: 1 MTNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRAL 60 MT D LF+ D + L+ L+ A ++ Sbjct: 1 MTK-VNKKYKDRLFRLVFGDRRRLLDLYNALNGSHYEDPDALEITTLDDAVYLSM----- 54 Query: 61 HSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQ------- 113 +D+ + V G +Y EHQS + +M R Y V ++++ K Sbjct: 55 KNDLSFLV---NGVLNLY---EHQSTYNPNMPVRGFFYLADVYRKYVVEHKLNLYGSRLA 108 Query: 114 PLPLVIPMLFYHG-SRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHR 172 LP ++FY+G P L D F A ++++ + + +++ Sbjct: 109 KLPSPKYLVFYNGRKEEPDRKILRLSDAFQGGRNAEPCLELCAVMLNINLGRNQVLMERC 168 Query: 173 R--VALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFI 230 R + + + R G ++ V V +C D IL + E + Sbjct: 169 RTLKEYAQFVDRVRRMIAETGALESAVDCAVEDCIRDG--------ILENFLSSHRAEVL 220 Query: 231 SELTRRMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGA--DPEWIQKITGLSAEQM 288 + + M E +G +G L L G E I + G E Sbjct: 221 DVILTDYNEQEYIAMEREEAW-EEGRAEGLTEGLSEGLSEGLSVSREAILDLLGEFGEVP 279 Query: 289 QALRQPLPERERYSWLK 305 + LR + LK Sbjct: 280 EELRARICAESDKETLK 296 >UniRef50_C9KKN3 Putative uncharacterized protein n=1 Tax=Mitsuokella multacida DSM 20544 RepID=C9KKN3_9FIRM Length = 297 Score = 118 bits (295), Expect = 3e-25, Method: Composition-based stats. Identities = 53/308 (17%), Positives = 112/308 (36%), Gaps = 30/308 (9%) Query: 1 MTNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRAL 60 M T D+LF+ E + + D+ L F D + Sbjct: 1 MCMKPKRTYKDSLFRHIFNDKRRLASLYESLTGRKVAPR-DIAITTLRGVFFND-----I 54 Query: 61 HSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIP 120 +DI + + R+ +++EHQS + +M R++ Y + R ++ + +IP Sbjct: 55 KNDISFRIGDRD-----IILMEHQSSWNPNMPLRMLWYVAKLYSRQLDSQEVVYRSRLIP 109 Query: 121 M------LFYHGSRS-PYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRR 173 + +FY+GS+ P L D FA T +L + ++ ++++ Sbjct: 110 IPAPEFYVFYNGSQDEPDYQKLRLSDAFAHATDTLELAVDCY---NINYSTQNKLLDSCY 166 Query: 174 VALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEA--RFNEFIS 231 I + + ++ + + + + L+ + F Sbjct: 167 ELRCYSIFVQKVREGIQNGLE--LRTAIRQAITYCKTHDLMGDYFQKNESEVFDMVNFKW 224 Query: 232 ELTRRMPQHRERIMTIAE-RIHNDGYIKGEQ----RILRLLLQNGADPEWIQKITGLSAE 286 + R + +E + I E R G + GE+ ++ LL+ G I + T LS E Sbjct: 225 DQKRALEVAKEDGVAIGEARGEARGKLLGERNAMMKVALSLLKKGLPVGVITESTNLSLE 284 Query: 287 QMQALRQP 294 +++ + + Sbjct: 285 EVRKIAKD 292 >UniRef50_C6IY67 Transposase n=1 Tax=Paenibacillus sp. oral taxon 786 str. D14 RepID=C6IY67_9BACL Length = 333 Score = 117 bits (292), Expect = 6e-25, Method: Composition-based stats. Identities = 57/320 (17%), Positives = 106/320 (33%), Gaps = 48/320 (15%) Query: 9 PHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRA--LHSDILW 66 PHD FK L +F+ + P +L D + + + + D+L Sbjct: 27 PHDEAFKKLLH--TFFAEFIALFFP-ELESQLDFSQTRFLMQEQLVDVVGEEARTLDLLL 83 Query: 67 SVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHG 126 K D +I + +E QS R+ Y + +RH + + L+IP+ + Sbjct: 84 ETKYIGTDAFILIHLEPQSYRQADFHERMFIYFSRLFERHRKEHQ-----LIIPIAIFTS 138 Query: 127 SRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHI-- 184 + S + + + F V++ P + L+ K Sbjct: 139 AESKNERNSLNMSI-----LGEDILQFRFLKVELINQPWRRFIDSNNPVAAALLAKMGYN 193 Query: 185 --RQRDL-MGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHR 241 +R+L + + L+ L + + + D + E + EL ++ + Sbjct: 194 KGEERELRLAYLRMLLQLSQRLDQARLALVMSIADLYFEPDPRQDEEMLRELAKQYAKES 253 Query: 242 ERIMT----------------IAERIHNDGYIKG------------EQRILRLLLQNGAD 273 E IM E+ G KG ++I R LL G Sbjct: 254 EVIMELMPAWMRQGYEKGLEEGLEKGIEQGIEKGFEKGIEQGTLIERRQIARRLLSKGFT 313 Query: 274 PEWIQKITGLSAEQMQALRQ 293 E I +T LS E+++ + Sbjct: 314 LEEIADMTQLSIEEIKKIMN 333 >UniRef50_D0YJF1 Putative transposase YhgA family protein n=1 Tax=Klebsiella variicola At-22 RepID=D0YJF1_KLEVA Length = 190 Score = 117 bits (292), Expect = 7e-25, Method: Composition-based stats. Identities = 72/181 (39%), Positives = 103/181 (56%), Gaps = 20/181 (11%) Query: 131 YPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIRQRDLM 190 P + P TA+ LY F L+DVTV+PDD++VQHRRVALLEL+QKHIRQRDL Sbjct: 10 TPHDAVFKRFLRHPETAKTLYGCPFTLIDVTVMPDDDLVQHRRVALLELMQKHIRQRDLS 69 Query: 191 GLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRERIMTIAER 250 + + L +++ N Q+ L +Y+L G+ A F+ L RR+PQ+ E +M+IA++ Sbjct: 70 SITESLAAVVMLGYTNRRQLRMLFHYMLQYGNTAEPGVFLRRLARRLPQYEETLMSIAQK 129 Query: 251 IHNDGYIKGEQR--------------------ILRLLLQNGADPEWIQKITGLSAEQMQA 290 + +G +G I +LQNG D E +QKITGLSA+++Q Sbjct: 130 LKQEGRQEGRLEGREEGHQEGLQEGSRREALRIAGSMLQNGLDKEMVQKITGLSADELQP 189 Query: 291 L 291 L Sbjct: 190 L 190 >UniRef50_C8T759 Putative uncharacterized protein n=1 Tax=Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884 RepID=C8T759_KLEPR Length = 185 Score = 116 bits (289), Expect = 1e-24, Method: Composition-based stats. Identities = 74/184 (40%), Positives = 106/184 (57%), Gaps = 26/184 (14%) Query: 135 LCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIRQRDLMGLID 194 +CWL FADP AR++Y FPL+D+T PDDEI++HRRVA+LEL+QKHIRQRDLM L + Sbjct: 1 MCWLAGFADPDIARRIYGEDFPLIDITSTPDDEIMRHRRVAMLELLQKHIRQRDLMDLHE 60 Query: 195 QLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQ--HRERIMTIAERIH 252 QLV LL + Q+ LL+Y+L G+ A F+ L + +P+ H+E +M IA+ + Sbjct: 61 QLVRLLALGYTSRRQLKTLLHYLLQAGNAADPVAFLRHLAQNVPRRPHKETLMNIAQFLE 120 Query: 253 NDGYIKG------------------------EQRILRLLLQNGADPEWIQKITGLSAEQM 288 G+ +G +RI R +L NG D + K+TGL+ E + Sbjct: 121 QRGHQQGLKQGLEQGLQQGIEQGIEQGEQQTAERIARAMLANGLDLSLVAKLTGLAPECL 180 Query: 289 QALR 292 L+ Sbjct: 181 ARLQ 184 >UniRef50_A6LF36 Putative uncharacterized protein n=7 Tax=Bacteroidales RepID=A6LF36_PARD8 Length = 273 Score = 114 bits (284), Expect = 5e-24, Method: Composition-based stats. Identities = 40/283 (14%), Positives = 85/283 (30%), Gaps = 21/283 (7%) Query: 11 DALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVKT 70 D F + ++ + L + D++ + + E L I++ V Sbjct: 10 DFGFHRIFGQ-EVHKELLIDFLNQLFFGEHDIEDITFLNPIQTPETLDDRG--IVFDVHC 66 Query: 71 REGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQP--LPLVIPMLFYHGSR 128 ++ +G ++V +E Q+ + R + Y + + K L V + + Sbjct: 67 KDSNGNLFV-VEMQTGAQPYFHDRGLYYLARAISNQGQKGKDWKFALQPVYGVFLLNYKM 125 Query: 129 SPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIRQRD 188 + T +++ + + KH+ + Sbjct: 126 DVNSKFRTDVILADRETGRMFSDRIRQVYLELPYFQKEPDECENDFERWIYLLKHMDTLE 185 Query: 189 LMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRERIMTIA 248 M A+ + +L D A ++ + Sbjct: 186 RMPF---------------KAKKAVFDKLLEVADVANLSKEERIQYDEALKRYRDYKNTI 230 Query: 249 ERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQAL 291 + G +KG++ R + G P IQK TGLS E ++ L Sbjct: 231 DYAEEKGILKGKESTARNMKAEGIAPLIIQKCTGLSLEDIEKL 273 >UniRef50_Q2RKN5 Putative uncharacterized protein n=1 Tax=Moorella thermoacetica ATCC 39073 RepID=Q2RKN5_MOOTA Length = 304 Score = 114 bits (284), Expect = 6e-24, Method: Composition-based stats. Identities = 65/313 (20%), Positives = 117/313 (37%), Gaps = 31/313 (9%) Query: 6 TSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEK--LRALHSD 63 HD LFK LT R+FME+ P L D K + + + + D Sbjct: 1 MPVDHDRLFKELLT--TFFREFMELFFPAA-HTLIDYTDTKFLTQEVITDITAGDKHYVD 57 Query: 64 ILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPM-L 122 IL VK + DG + V IE Q+ A R+ Y + ++H + V+P+ + Sbjct: 58 ILAEVKIKGEDGCVLVHIEPQAYRQADFARRMFIYFSRLYEKHQKR--------VLPIAV 109 Query: 123 FYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQK 182 F H S+ F K+ F + + +P + + L+ K Sbjct: 110 FAHDSKVEETNRHEVEFPF------LKVLQFEFYKIQLKRLPWRQYLNSNNPVAAALLSK 163 Query: 183 H-IRQRDLMGLIDQLVVLLVTECANDSQITAL--LNYILLTGDEARFNEFISELTRRM-P 238 R+ + + + + LL + +++ + L + +L+ + P Sbjct: 164 MDYSPRERVQVKIEFLRLLTRMQLDPARMELITAFFDSYLVLNAEEEKSLQEKLSEELQP 223 Query: 239 QHRERIMTIAERIHNDGYIKGEQRILRLLLQNGA-------DPEWIQKITGLSAEQMQAL 291 + +R+M + H G+ +G Q + +L PE KI LSAEQ+ L Sbjct: 224 EEVQRVMELTTSWHLKGWQQGRQEGRQEILLRQLRKRLGTTSPEVEAKIKTLSAEQLDDL 283 Query: 292 RQPLPERERYSWL 304 + + + + L Sbjct: 284 AEKILDITSEAEL 296 >UniRef50_Q73P51 Conserved domain protein n=7 Tax=Treponema RepID=Q73P51_TREDE Length = 292 Score = 113 bits (283), Expect = 7e-24, Method: Composition-based stats. Identities = 46/303 (15%), Positives = 112/303 (36%), Gaps = 26/303 (8%) Query: 3 NFTTSTPHDALFKTFLTH----PDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLR 58 + + D++F + + +L C +++++L++ +++ Sbjct: 2 STSNRKYKDSVFVDLFSEDERAKENFLSLYNALHGTNLPMSCPVENIRLDNVMYMN---- 57 Query: 59 ALHSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEH-----DKRQ 113 + +D+ V DG I ++ EHQS + +M R + Y + ++ K Sbjct: 58 -IINDVSCLV-----DGKIIILAEHQSTINENMPLRFLEYIARLYEKLQAPTDRYLKKLS 111 Query: 114 PLPLVIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRR 173 +P +FY+G + L + + ++++ ++I+ + Sbjct: 112 KIPTPEFYVFYNGKEDYPETTALKLSDAFITKPKQAPLELTVQVLNINTDKANKILTACK 171 Query: 174 VA-----LLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNE 228 +E ++K + G + + + + + + I + E ++ Sbjct: 172 PLEEYSLFVEEVRKQTQLDPENGFTNAIKICIEKGILKEYLMRKSREVINMLVAEYDYDT 231 Query: 229 FISELTRRMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQM 288 I+ +R R I + +DG + I + Q G D + I + TGLS E++ Sbjct: 232 DIA--VQREESLRIGIEQGIRQGFSDGAYQKAIEIAKAFKQFGFDIDKIAEGTGLSREEI 289 Query: 289 QAL 291 + L Sbjct: 290 EKL 292 >UniRef50_A8PLG1 Transposase n=1 Tax=Rickettsiella grylli RepID=A8PLG1_9COXI Length = 212 Score = 113 bits (283), Expect = 7e-24, Method: Composition-based stats. Identities = 59/214 (27%), Positives = 104/214 (48%), Gaps = 12/214 (5%) Query: 90 HMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGSRSPYPWSLCWLDEF-ADPTTAR 148 F++ RY A+M +H++ LP+V+ ML+Y G +PYP++ D F + T A Sbjct: 1 MTPFKIARYVHAIMDQHLKQG-HAFLPIVVAMLYYRGKVTPYPYTGNIFDCFGKNKTIAE 59 Query: 149 KLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHI-RQRDLMGLIDQLVVLLVTECAND 207 K+Y +P++D+T + DD I H +A+L+ QK+ RD+ I+ ++ L Sbjct: 60 KIYLRPYPIIDITALSDDAIRGHGSIAILDFAQKYAAFNRDIQDGIEHIIGELKKGYLTR 119 Query: 208 SQITALLNYILLTGDEARFNEFISELTRRMPQHRERIMTIAERIHNDGYI--------KG 259 Q LL Y D + +L + + + E IM++A +I G + Sbjct: 120 EQCQTLLYYTFRETDTDNVKMLLEQL-QTIRIYEEDIMSVAHKIEQQGLQRGLQQGRYEE 178 Query: 260 EQRILRLLLQNGADPEWIQKITGLSAEQMQALRQ 293 + +I + +L G D +I+ +TGLS + + L Sbjct: 179 DLKIAKRMLAKGTDRGYIKDVTGLSDQDLLNLED 212 >UniRef50_D0LPI9 Putative transposase n=2 Tax=Haliangium ochraceum DSM 14365 RepID=D0LPI9_HALO1 Length = 338 Score = 112 bits (279), Expect = 2e-23, Method: Composition-based stats. Identities = 54/271 (19%), Positives = 103/271 (38%), Gaps = 29/271 (10%) Query: 10 HDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVK 69 +D L +T + A D LP L + DLD+L L S ++V ++LR ++D+L+SV Sbjct: 24 YDVLVETTFARREYAADTFRTMLPPALVKRLDLDALSLRSGTYVSDELRQYYTDVLYSVL 83 Query: 70 TREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRH-IEHDKRQPLPLVIPMLFYHGSR 128 +IY++++HQS D RL R +++ +R+ IE LP+++P++F+H + Sbjct: 84 LDGEQAFIYLLLKHQSATDPMFPLRLPRNVLSIWERYLIERQDATTLPVILPIVFHHEAT 143 Query: 129 SPYP--------------WSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRV 174 + + L F D+ Sbjct: 144 GWSDAVGLNGSLALGADVRTALSANRRDFRRLRYLLLVLCFQF-------DEASRAQNLN 196 Query: 175 ALLELIQK----HIRQRDL---MGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFN 227 L L+ + +RDL + + ++ +V + ++ +IL + Sbjct: 197 EALGLLMRTFGVARPKRDLVASLKGWEDVIREVVATQRGREMLATVVQFILENSETDPDE 256 Query: 228 EFISELTRRMPQHRERIMTIAERIHNDGYIK 258 R MT A+R+ + Sbjct: 257 LKSFLEFTAGEPARTAFMTGADRLTQGVREE 287 >UniRef50_UPI0001C353CE hypothetical protein ChatD1_20495 n=1 Tax=Clostridium hathewayi DSM 13479 RepID=UPI0001C353CE Length = 319 Score = 112 bits (279), Expect = 2e-23, Method: Composition-based stats. Identities = 40/307 (13%), Positives = 105/307 (34%), Gaps = 29/307 (9%) Query: 4 FTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSD 63 D LF+ + + DL+ L+ ++ + +D Sbjct: 22 KVNKKYKDRLFRMVFNRKEELLSLYNAVSHSEYTNPDDLEINTLDDVIYM-----KMKND 76 Query: 64 ILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHD-------KRQPLP 116 + + + + + EHQS + +M R Y + +++I+ + R LP Sbjct: 77 LAFLI------DDVLNLWEHQSTWNPNMPVRGTFYIVEEYRKYIDQNGLNLYGSSRITLP 130 Query: 117 LVIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFP--LVDVTVVPDDEIVQHR-- 172 + +FY+G R + L + + F ++++ ++E+++ Sbjct: 131 VPQFYVFYNGLREEPDYIELKLSDAFSRVHSEVEPCMEFKAVMLNINRGHNEELMRQCTT 190 Query: 173 RVALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISE 232 E + + + + +++ + ++ C D +L L F ++E Sbjct: 191 LREYAEFVARIRDETEDGTALEEAAMNVMDSCIRD----GILAEFLSVHRAEVFEVLLTE 246 Query: 233 LTRRMPQHRERIMTIAE---RIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQ 289 + E+ ++ E +G ++ + + L++ G E I G +++ Sbjct: 247 YDEQRHIASEKEISRREGHMEGRTEGILEKAKEVAVNLIKKGFTVEDAASICGEDICRVK 306 Query: 290 ALRQPLP 296 + Sbjct: 307 EWHREWK 313 >UniRef50_B5U1X5 Putative uncharacterized protein n=1 Tax=uncultured bacterium RepID=B5U1X5_9BACT Length = 304 Score = 111 bits (276), Expect = 4e-23, Method: Composition-based stats. Identities = 58/315 (18%), Positives = 103/315 (32%), Gaps = 29/315 (9%) Query: 1 MTNFTTSTP----HDALFKTFLT-HPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDE 55 M N + D+LF + + D + F+ ++ L D+L LE + Sbjct: 1 MQNENPTNENRSHKDSLFVDYFSKDRDWKQHFLSLYNALHGTNLQVADTL-LERVNIDQV 59 Query: 56 KLRALHSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVM-----QRHIEHD 110 ++ ++DI V +G ++IEHQS + +M RL+ Y + + Sbjct: 60 LYKSYYNDIAVLV-----NGQFILMIEHQSTINPNMPLRLLEYVARIYGNLVDSKAKFSR 114 Query: 111 KRQPLPLVIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQ 170 PL +FY G + P S L + + V I Sbjct: 115 HLVPLARPEFYVFYTGDQKLPPESYLHLSDSFPNQPPKADLTLEL------KVKVCTIKS 168 Query: 171 HRRVALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFI 230 ++ + + L+++ E + A+ IL E R E + Sbjct: 169 DHPSPVVHRCPDLEQYAQFLKLVEEAKAAGQAEPLTWAIQEAVRRNILRDYLERRGGETL 228 Query: 231 SELTRRMP-------QHRERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGL 283 S L Q E G + + R LL G P+ + + T L Sbjct: 229 SILMAEYDYATDFAVQKEEAYEDGLFAGLERGAYQNKLETARSLLSEGLAPQMVARCTSL 288 Query: 284 SAEQMQALRQPLPER 298 E +Q L + + + Sbjct: 289 PLETVQQLGREVSPK 303 >UniRef50_A6LFA9 Putative uncharacterized protein n=22 Tax=Bacteroidales RepID=A6LFA9_PARD8 Length = 305 Score = 110 bits (275), Expect = 5e-23, Method: Composition-based stats. Identities = 50/300 (16%), Positives = 99/300 (33%), Gaps = 23/300 (7%) Query: 11 DALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVKT 70 D FK + +D + L L + L++ + + E + +T Sbjct: 10 DFGFKHIFG-REMDKDILIEFLNDLLEGEYTIMDLRIMNNERLPETEQGRKVIFDIHCET 68 Query: 71 REGDGYIYVVIEHQSREDIHMAFRLMRYSM-AVMQRHIEHDKRQPLPLVIPMLFYH---- 125 +G+ ++IE Q+RE H R + Y +V+++ I+ L V + F + Sbjct: 69 DKGER---IIIEMQNREQPHFKDRALYYLSHSVVEQGIKGTWDYELAAVYGVFFLNFTLD 125 Query: 126 ---GSRSPYPWSLCWLDEFADPTTARKLYNAAF--PLVDVTVVPDDEIVQHRRVALLELI 180 G D +++N F +++ +E + Sbjct: 126 EENGPDKNGKEGKFRRDIILADRENGQVFNPKFRQIYIELPRFNKEEEECETDFERWIYV 185 Query: 181 QKHIRQRDLM---------GLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFIS 231 KH+ D M ++++ + +Q A + F Sbjct: 186 LKHMDTLDRMPFKARKAIFERLERIGSMANLTPKQRAQYEAEWKMYNDYYNTLDFAVEKG 245 Query: 232 ELTRRMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQAL 291 + + + +G KG++ R + G P IQK TGLS E+++ L Sbjct: 246 MKKGMEEGMEKGLQKGLQEGLQEGLQKGKESTARNMKAEGITPLIIQKCTGLSLEEIERL 305 >UniRef50_C8PTN1 Putative uncharacterized protein n=4 Tax=Treponema vincentii ATCC 35580 RepID=C8PTN1_9SPIO Length = 303 Score = 110 bits (274), Expect = 7e-23, Method: Composition-based stats. Identities = 40/309 (12%), Positives = 112/309 (36%), Gaps = 30/309 (9%) Query: 3 NFTTSTPHDALFKTFLTH----PDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLR 58 + D++F + + +L+ C ++++KL++ +++ Sbjct: 2 STANRKYKDSVFVDLFSEDEKAKENFLSLYNALHGTNLQLSCPVENIKLDNVMYMN---- 57 Query: 59 ALHSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQ----- 113 + +D+ V D I V+ EHQS + +M R ++Y + ++ + R Sbjct: 58 -IVNDVSCLV-----DNKIIVLAEHQSTINENMPLRFLQYIARLYEKLQKPTDRYLRTLS 111 Query: 114 PLPLVIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRR 173 +P +FY+G ++ L + R + + ++ E++ + Sbjct: 112 KIPTPEFYVFYNGLNDYPETTVLKLSDAFITKPERIPLDLEVKVYNINKSKGAEVLSRCK 171 Query: 174 -VALLELIQKHIR-------QRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEAR 225 + L + +R + + + + + + ++N ++ D Sbjct: 172 TLDEYSLFIEEVRLQTQLDPENGFTNAVKICIEKGILKEYLQRKSREVINMLIAEYDYDT 231 Query: 226 FNEFISELTRRMPQHR---ERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITG 282 E ++ + + + + + G + RL+ Q + +I K+TG Sbjct: 232 DIAVQREEAGKIAFAKGISQGLSQGISQGLSQGSHQKALETARLMKQANCEIPFIAKMTG 291 Query: 283 LSAEQMQAL 291 L+ +++++ Sbjct: 292 LTQAEVESI 300 >UniRef50_A5D0D4 Putative uncharacterized protein n=10 Tax=Clostridia RepID=A5D0D4_PELTS Length = 332 Score = 110 bits (274), Expect = 8e-23, Method: Composition-based stats. Identities = 52/322 (16%), Positives = 105/322 (32%), Gaps = 36/322 (11%) Query: 3 NFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEK--LRAL 60 HD LFK L +FME+ P + + DL+ +K + Sbjct: 1 MNKDQVDHDRLFKQLL--ETFFAEFMELFFP-EAAQATDLEYVKFLQQELFTDITAGEKH 57 Query: 61 HSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIP 120 +DI+ + ++ G I V +E QS R+ Y + +++ ++P Sbjct: 58 RADIIVETRLKDEPGLILVHVEPQSYIQKEFNERMFIYFSRLYEKYRRK--------ILP 109 Query: 121 MLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELI 180 + + Y D F + + F +++ + + ++ L+ Sbjct: 110 VAVFT-----YDHIRNEPDSFEIGFSFLDVLRFHFYKLELKKLHWRDYIRSDNPVAAALL 164 Query: 181 QKHI-RQRDLMGLIDQLVVLLVTECANDSQITAL--LNYILLTGDEARFNEFISELTRRM 237 K R + + + + + +L + ++ + L + EF EL + Sbjct: 165 SKMGFRPEERVQVKLEFMRMLARMKLDPARTELIGGFFETYLKLNRQEEEEFYRELGKID 224 Query: 238 PQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGA-----------DPEWIQKIT----G 282 + E IM I H G ++G G E I + G Sbjct: 225 KKEVELIMQITTSWHEKGRMEGRLEGRLEGRLEGRLEGEARGKVEKAQEIICEYLKVRFG 284 Query: 283 LSAEQMQALRQPLPERERYSWL 304 L ++ + L ++E L Sbjct: 285 LDTSGIREKVRQLTDQEVLDRL 306 >UniRef50_C4G1D5 Putative uncharacterized protein n=2 Tax=Abiotrophia defectiva ATCC 49176 RepID=C4G1D5_ABIDE Length = 297 Score = 109 bits (273), Expect = 9e-23, Method: Composition-based stats. Identities = 50/297 (16%), Positives = 102/297 (34%), Gaps = 18/297 (6%) Query: 10 HDALFKTFLTHPDTARDFMEI--HLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWS 67 D K L++ D D + +++ + +L + D+ + + Sbjct: 4 KDIAEKYLLSYNDVFADIVNGAVFGGEEIVKSNELADANGITQFKDDQNIHHEQVRDIAK 63 Query: 68 VKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGS 127 + + ++ IE+QS D M R++ Y A + + + + + V+ ++ Y G Sbjct: 64 FWKKNEVIFSFIGIENQSAPDKDMILRIISYDGATYKSQMGN---ESIYPVLTIVIYWGK 120 Query: 128 RSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQ----HRRVALLELIQKH 183 A + + F L+D+ + E+++ R VA QK Sbjct: 121 YEWKAPVSLQERINCPRELADIIPDYRFKLIDIGRLSGKELIKFKSDFRLVAEFIARQKE 180 Query: 184 IRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQH-RE 242 + + ++ L+ A D + L + E R L + + Sbjct: 181 YKPGKEEIKHPEELLDLLDLLAGDKRFKELKGKVKNIRKEGRIINMCELLDEIENRGIEK 240 Query: 243 RIMTIAERIHNDGYIKGE--------QRILRLLLQNGADPEWIQKITGLSAEQMQAL 291 I E+ G KG RI + + + I K TGL+ E+++ L Sbjct: 241 GIEQGIEQGIEKGIEKGRSEGEETATLRIAKKFKDSNVSIDIIMKATGLTKEEIEEL 297 >UniRef50_UPI0001C34E7F hypothetical protein ClM62_15401 n=1 Tax=Clostridium sp. M62/1 RepID=UPI0001C34E7F Length = 324 Score = 109 bits (273), Expect = 1e-22, Method: Composition-based stats. Identities = 45/298 (15%), Positives = 96/298 (32%), Gaps = 25/298 (8%) Query: 7 STPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILW 66 DALF+ + + L + LE+A +++ + +D+ + Sbjct: 24 RDYKDALFRMIFNDKEALLSLYNAVGNTSYTDASQLQIVTLENAVYMN-----IKNDLAF 78 Query: 67 SVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQ-----RHIEHDKRQPLPLVIPM 121 + + EHQS + +M R + Y + + I LP + Sbjct: 79 LLNMELN------LYEHQSTWNPNMPLRDLFYVSREYEMLLANQSIYSSSLLKLPAPRFV 132 Query: 122 LFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQ 181 +F++GS + L + + ++++ +DE++ R+ L E Sbjct: 133 VFFNGSYDMGEQCVLKLSDAYEKKVEDPDLELKVTVLNINAGWNDELMNTCRL-LKEYSL 191 Query: 182 KHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHR 241 R R ++ + V+ ++ +L L+ + I E + Sbjct: 192 YVARVRAYAKEME--LAEAVSRAVDECIKEGILRDFLMKYRAEAISVSIFEYDEE-REKE 248 Query: 242 ERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKIT-----GLSAEQMQALRQP 294 T E +G +G + L ++ G I G S E ++ Sbjct: 249 LLRKTEYEFGRQEGLSQGREEGLSQGIKEGMAQGVSAMIRHCRKAGASREDTLSILME 306 >UniRef50_B7BFV9 Putative uncharacterized protein n=1 Tax=Parabacteroides johnsonii DSM 18315 RepID=B7BFV9_9PORP Length = 293 Score = 108 bits (270), Expect = 2e-22, Method: Composition-based stats. Identities = 34/291 (11%), Positives = 85/291 (29%), Gaps = 17/291 (5%) Query: 11 DALFKTFLTH---PDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWS 67 D FK + D + + + L + E + + ++ Sbjct: 10 DRGFKHLFGQEDSKELLVDLLNGLFEGERV----ITELSFLNVEMPAESTDSRAA--VFD 63 Query: 68 VKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQP--LPLVIPMLFYH 125 +K ++ +G I++ +E Q+ + R + Y ++ L V + + Sbjct: 64 LKCKDKEGRIFI-VEVQNAPQTYFYERGLYYLCRIISDQDRRGNDWKFELYPVYGIFLLN 122 Query: 126 GSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIR 185 + T + +++ +E + K++ Sbjct: 123 FKSGKTDKVRTDIVLADRETGKQMSDTMRQIYLEMPFFNKEEAECETSLDYWLYTLKYME 182 Query: 186 QRDLM-----GLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQH 240 + + + + + + L + + L + + + M Sbjct: 183 KLETLPFKGQKQLFEKLERLAKIVNMNKKERMEYEESLKIYRDNQGVLDYAIEKGYMEGV 242 Query: 241 RERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQAL 291 + + E+ G KG + + G D I +TGL+AE + L Sbjct: 243 EKGLKEGIEKGLEKGMEKGIYLVAAKMKMQGIDFATITSVTGLNAETIATL 293 >UniRef50_B7CC32 Putative uncharacterized protein n=10 Tax=Eubacterium biforme DSM 3989 RepID=B7CC32_9FIRM Length = 301 Score = 107 bits (266), Expect = 6e-22, Method: Composition-based stats. Identities = 53/299 (17%), Positives = 102/299 (34%), Gaps = 15/299 (5%) Query: 6 TSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDIL 65 + D K FL + DF + R L + ++L+S H D++ Sbjct: 1 MNKIKDKTMKEFLENNAYFVDFFNAYFFDGERVLKPENCMELDSEMNDSNMDLEKHVDVI 60 Query: 66 WSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQR----HIEHDKRQPLPLVIPM 121 K +G+ Y +IE+QS D M R Y R ++ ++ LP+V + Sbjct: 61 R--KYNDGNLYSAFIIENQSYVDASMVVRAAAYEFVAYDRMLKKLKKNKAKEKLPMVHIL 118 Query: 122 LFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVT-----VVPDDEIVQHRRVAL 176 +FY G + + D ++ L+++T ++++ + Sbjct: 119 VFYTGEKLWNAANKLSQLVEVDERFESYFHDYQMNLIEITGNTSYNFNEEDVYNLFYICR 178 Query: 177 LELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRR 236 Q ++ + + VL V + D + L E E + Sbjct: 179 SIYDQSIYEEKSNGFGLVKSSVLKVVKTLTDVEWLDLEELEEKEEIEMCEAEKRWLEVKS 238 Query: 237 MP----QHRERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQAL 291 ++ I E+ G K E + R ++ G + I I +S E ++ L Sbjct: 239 KEWEAKGIKKGIEQGIEQGIEQGSEKKELEMYRKMMDKGFGIKAIASIFSVSEESIEKL 297 >UniRef50_Q24MW9 Putative uncharacterized protein n=4 Tax=Desulfitobacterium hafniense RepID=Q24MW9_DESHY Length = 295 Score = 107 bits (266), Expect = 7e-22, Method: Composition-based stats. Identities = 43/287 (14%), Positives = 90/287 (31%), Gaps = 13/287 (4%) Query: 11 DALFKTFLT---HPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWS 67 D LFK + D F+ L + +L + L E L+ S + Sbjct: 12 DYLFKYIFGRQENKDILLSFLNAVLSPAGED--ELTDITLSDRELDPEHLKDKMSRLDIL 69 Query: 68 VKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDK-RQPLPLVIPMLFYHG 126 +G + IE Q + ++ R + Y + Q ++ + L + + + Sbjct: 70 GVANDGS---LINIEVQIASEKNIDKRTLYYWAKIYQSQLQSGMLYKDLARTVTVNVLNF 126 Query: 127 SRSPYPWSLCWLDEFADPTTARKLY-NAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIR 185 S P + + + +L + +++ R+ + + Sbjct: 127 SFLPDAQRYHSMFSLYEAHSGLRLNRDLEIHFLELEKWKALSTKPRTRLDKWLMYLSNTD 186 Query: 186 QRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRERIM 245 ++L + + + + N E R L+ E I Sbjct: 187 PKELEEIAMSEPAIGKALTVEEIFLK---NDKERYLYEMREKGIRDHLSAMDNAKTEGIE 243 Query: 246 TIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQALR 292 + G +G+ I +L+ G I +IT L EQ++ +R Sbjct: 244 QGLAQGIAQGIERGKTEIALSMLKKGLSLNMIAEITDLPIEQIEEIR 290 >UniRef50_C6VTD5 Putative uncharacterized protein n=1 Tax=Dyadobacter fermentans DSM 18053 RepID=C6VTD5_DYAFD Length = 308 Score = 106 bits (263), Expect = 1e-21, Method: Composition-based stats. Identities = 43/305 (14%), Positives = 98/305 (32%), Gaps = 32/305 (10%) Query: 11 DALFKTFLT---HPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWS 67 D FK + D DF+ + + + L S + Sbjct: 10 DFGFKRIFGSEANKDILIDFLNVLFAGERL----VADLTFASNENNGRIPILRRAIFDLC 65 Query: 68 VKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDK---RQPLPLVIPMLF- 123 +G+ +I IE Q + R + YS ++++ +E R L V + Sbjct: 66 CTGADGEQFI---IEVQRVRQEYFKDRCLYYSASLIRDQVEAGGTNWRYDLKPVYLIGLM 122 Query: 124 -YHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQK 182 + S L + +++ E + + K Sbjct: 123 DFCFEDSDDGHYLHEIRLIKRSNGQVFYDKFGLTFIEMPAFQKKESDLSTELDRWLYLLK 182 Query: 183 HIRQRDLMG------LIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRR 236 ++ + +++ + ++ + N + A Y+ D ++ + R Sbjct: 183 NLSKLNIVPPVLTNPVYQKVFRVAEVCNLNKEEKMAWDAYLKAKWDNENSMDYAKKEAMR 242 Query: 237 M-----------PQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSA 285 + H+E + ++ G G++++++ +L G D + I ITGL+ Sbjct: 243 VGHEEGHKEGHKEGHKEGMKEGIKKGRETGIELGKRQVVKNMLAKGFDMQTISDITGLTF 302 Query: 286 EQMQA 290 EQ++ Sbjct: 303 EQIRN 307 >UniRef50_C1PBU4 Putative uncharacterized protein n=4 Tax=Bacillus coagulans 36D1 RepID=C1PBU4_BACCO Length = 329 Score = 106 bits (263), Expect = 1e-21, Method: Composition-based stats. Identities = 54/339 (15%), Positives = 110/339 (32%), Gaps = 59/339 (17%) Query: 1 MTNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEK--LR 58 M HD LFK + + ++FM+ P DL D ++ S + Sbjct: 5 MEKHAGYHVHDRLFKELIQN--FFQEFMDAFFP-DLSADLDYRRVRFLSQEQFTDFPGGE 61 Query: 59 ALHSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLV 118 DIL K + D I + +E QS + R+ RY M + RH V Sbjct: 62 QKRVDILAETKVKGKDTVILIHVEPQSYYEKPFPERMFRYYMMISLRHR--------KPV 113 Query: 119 IPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLE 178 +P+ + D + ++ + + + ++ Sbjct: 114 LPIAVFSYEEKTETP-----DTYTFAFHNIEILRFHYLSIHLMKQNWRNYIRSNNPVAAA 168 Query: 179 LIQKHIRQR-DLMGLIDQLVVLLVTECANDSQITAL--LNYILLTGDEARFNEFISELTR 235 L+ K + + + + + +L + +++ L L +E E + + Sbjct: 169 LLSKMGYTETERVQVKLEFLRMLARMELDPAKMRLLHGFFDYYLKLNEKEEAEVMENIKM 228 Query: 236 RMPQHRERIMTI------------------------AERIHNDGYIKGEQR--------- 262 P E+++ + E+ +G G ++ Sbjct: 229 LDPDEAEQVLKLPNSYFDRGYKKGKEEGREEGIEIGVEKGREEGIEIGVEKGREEERKEM 288 Query: 263 -----ILRLLLQNGADPEWIQKITGLSAEQMQALRQPLP 296 I +LQ G + + I + TGLS +++ ++Q L Sbjct: 289 LQTIPIAIKMLQEGRELQLIVEKTGLSQREVEKIKQQLE 327 >UniRef50_C0BF92 Putative uncharacterized protein n=1 Tax=Coprococcus comes ATCC 27758 RepID=C0BF92_9FIRM Length = 307 Score = 105 bits (261), Expect = 2e-21, Method: Composition-based stats. Identities = 57/316 (18%), Positives = 107/316 (33%), Gaps = 44/316 (13%) Query: 2 TNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALH 61 D L++ + + D + DL+ LE ++ Sbjct: 12 KQTHNRQYKDRLWRMIFNNKEDLLQLYNAINHTDYQNPDDLEVNTLEDVLYLSM-----K 66 Query: 62 SDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRH-------IEHDKRQP 114 +D+ + V G +Y EH S + +M R + Y + + + I H+KR Sbjct: 67 NDVSFLV---GGTMNLY---EHLSTFNPNMPLRGVFYFSRLYEGYVADNNLMIYHEKRVR 120 Query: 115 LPLVIPMLFYHGSR-SPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRR 173 LP ++FY+G++ P L D F + ++++ + E+++H R Sbjct: 121 LPKPKYIVFYNGTKNQPDSMELRLSDCFENTDNDAPCLECTATMLNINYGHNQELMKHCR 180 Query: 174 VA---------LLELIQKHIRQRD-LMGLIDQLVVLLVTECANDSQITALLNYILLTGDE 223 + E IQ D L ID + V + N IL T D+ Sbjct: 181 RLEEYSIFVQCVREYIQSEPSVEDALEKAIDTCINQDVLADFLKKHRAEVTNMILTTYDK 240 Query: 224 ARFNEFISELTRRMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGL 283 + + + E R E +G ++G L Q ++ L Sbjct: 241 DLYEKTLKEDAR-------------EEGREEGLMEGRAETRAELNQLTICLLNAKRYNDL 287 Query: 284 S--AEQMQALRQPLPE 297 A+ ++ ++ L E Sbjct: 288 EHAAKDIEYQKKLLKE 303 >UniRef50_B4SC57 Putative uncharacterized protein n=14 Tax=Bacteria RepID=B4SC57_PELPB Length = 299 Score = 104 bits (259), Expect = 5e-21, Method: Composition-based stats. Identities = 40/300 (13%), Positives = 89/300 (29%), Gaps = 13/300 (4%) Query: 1 MTNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRAL 60 M D FK + +D + + + E + ++L++ + + Sbjct: 1 MCKINPRV--DFAFKKLFGSEEN-KDLLISLINAIVSEEDQVVEIELKNPYNLADYRAGK 57 Query: 61 HSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHD---KRQPLPL 117 S + K G + +E Q ED + R + Y ++ + K + Sbjct: 58 ISILDIKAKAENGR---WFNVEMQISEDYNFDKRAIFYWAKLVTEQLSEGMMYKELKKTI 114 Query: 118 VIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYN-AAFPLVDVTVVPDDEIVQHRRVAL 176 I +L Y+ C+ +L++ +++ + Sbjct: 115 SINILDYNFVPDTTEVHSCYKIINTATGKDDRLHDVFELHYIELKKFNKLHHEISSTLDR 174 Query: 177 LELIQKHIRQRDLMGLIDQLV---VLLVTECANDSQITALLNYILLTGDEARFNEFISEL 233 Q D +L ++ A D + ++ + Sbjct: 175 WTTFLTTAHQLDREHTPKELALDKNIVKAIAAIDRMFNEEERQVYEVRKQSLVDAESKIA 234 Query: 234 TRRMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQALRQ 293 + + + E+ ++G + I LL G I + TGLS ++ +L Q Sbjct: 235 SALEKGMEKGMEMGLEKGRDEGINAASKTIALNLLGKGIAIATIAEATGLSVLEITSLSQ 294 >UniRef50_B8FP58 Putative uncharacterized protein n=1 Tax=Desulfitobacterium hafniense DCB-2 RepID=B8FP58_DESHD Length = 167 Score = 104 bits (258), Expect = 5e-21, Method: Composition-based stats. Identities = 43/139 (30%), Positives = 66/139 (47%), Gaps = 5/139 (3%) Query: 3 NFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHS 62 PHD FK TAR F+E +LP+++R L DL ++ + S++D++L+ S Sbjct: 1 MSLIHNPHDKFFKETFGDVGTARSFLENYLPQEVRALVDLKTVLPQKDSYIDQELQESFS 60 Query: 63 DILWSVKTREGDGYIYVVIEHQSRE----DIHMAFRLMRYSMAVMQRHI-EHDKRQPLPL 117 D+L+ VK RE +GY Y + EH+ R M+ RL S+ QR + P Sbjct: 61 DLLFQVKIRENEGYFYFLFEHKVRPYADRRKKMSTRLADDSVLSKQREMFMQSVNHGKPP 120 Query: 118 VIPMLFYHGSRSPYPWSLC 136 I G+R+ C Sbjct: 121 YISRFIRKGNRTGSAACRC 139 >UniRef50_B0K813 Putative uncharacterized protein n=13 Tax=Thermoanaerobacterales RepID=B0K813_THEP3 Length = 267 Score = 104 bits (258), Expect = 5e-21, Method: Composition-based stats. Identities = 52/290 (17%), Positives = 106/290 (36%), Gaps = 29/290 (10%) Query: 6 TSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDIL 65 S +D K ++ A D L L + F + + SD++ Sbjct: 1 MSQEYDITAKNIFSN--LADDIASYFLG------LKFTKLDELNIEFTT--IESRESDMV 50 Query: 66 WSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYH 125 + T D I + IE Q+ D M +R++RY+ +M++H L ++ Y Sbjct: 51 FKCTTENRD--IALHIEFQTYNDSKMPYRMLRYATEIMEKHNL--------LPYQVVVYC 100 Query: 126 GSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELI---QK 182 + L N + ++DV + ++IV+ + L + K Sbjct: 101 SKNELKMENNLNYHL-----GEENLLNFRYRIIDVGKIKFEDIVKTKYYDLYTFLPVADK 155 Query: 183 HIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRE 242 RQ++ + + ++ + ++ + ++ + E I ++ + Sbjct: 156 DKRQKEKEAYLRKCAEVIRDMPVDKAKKSYIVTTAEILAGIIYDEEVIEKIFSEVIGMSI 215 Query: 243 RIMTIAER-IHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQAL 291 + + I G + I R LL+ G D I +IT LS E+++ L Sbjct: 216 LEESKVYKNILEKGKKEKSIEIARELLKEGMDINKIAQITKLSVEEIKKL 265 >UniRef50_D2NBJ3 Putative uncharacterized protein n=1 Tax=Escherichia coli SE15 RepID=D2NBJ3_ECOLX Length = 136 Score = 104 bits (258), Expect = 6e-21, Method: Composition-based stats. Identities = 61/125 (48%), Positives = 83/125 (66%) Query: 169 VQHRRVALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNE 228 +H +ALLELIQKHIRQRDLMGL++Q+ LL + AND QI L NYIL TGD RFN+ Sbjct: 12 RRHASMALLELIQKHIRQRDLMGLVEQMACLLSSGYANDRQIKGLFNYILQTGDAVRFND 71 Query: 229 FISELTRRMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQM 288 FI + R P+H+E +MTIAER+ +G I +++L++G I + TG+S E++ Sbjct: 72 FIDGVAERSPKHKESLMTIAERLRQEGEQSKALHIAKIMLESGVPLADIMRFTGVSEEEL 131 Query: 289 QALRQ 293 A Q Sbjct: 132 AAASQ 136 >UniRef50_C0R0H3 Putative uncharacterized protein n=8 Tax=Brachyspira RepID=C0R0H3_BRAHW Length = 292 Score = 102 bits (255), Expect = 1e-20, Method: Composition-based stats. Identities = 34/294 (11%), Positives = 94/294 (31%), Gaps = 10/294 (3%) Query: 2 TNFTTSTPHDALFKTFLTHP---DTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLR 58 +N + +D + + DF+ + + S+++ + E Sbjct: 5 SNNNFNVLNDYFVRYLFSDKGSEAILLDFINSIMLDSGMK--TFRSVEILTPFNYKENYE 62 Query: 59 ALHSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHD-KRQPLPL 117 + T+ G V+IE Q + + R++ Y + + ++ K L Sbjct: 63 DKETITDVKCITQNGTV---VIIEIQLQGNSRFPERILYYWASNYSKLLKQGEKYDALTP 119 Query: 118 VIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALL 177 VI + + + + + ++++ + + L Sbjct: 120 VISINLLNFNLDDNDSIHSCYMIYDTNNKRLLTDHLQIHIIELKKFKYNSLEYDLNCWLK 179 Query: 178 ELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRM 237 K ++++ + + + + E + + +++ + R Sbjct: 180 FFTMKDKDNKEVI-MSELVKEKPIMEEVQRRYNNFIKDRLMMNEYDKRQAYLYGNQIMLE 238 Query: 238 PQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQAL 291 + R + E +G + + + R + D I ++TGLS E+++ L Sbjct: 239 EERRLGRVEGKEEGIKEGIEQEKYSLARNMKNKNMDLNLISELTGLSIEKIEKL 292 >UniRef50_C1I6Y7 Putative uncharacterized protein n=1 Tax=Clostridium sp. 7_2_43FAA RepID=C1I6Y7_9CLOT Length = 226 Score = 102 bits (255), Expect = 1e-20, Method: Composition-based stats. Identities = 33/225 (14%), Positives = 80/225 (35%), Gaps = 11/225 (4%) Query: 45 LKLESASFVDEKLRALHSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQ 104 + L + S++ SDI++ D + YV++E QS D M RL+ Y + + + Sbjct: 1 MILVNKSYILSDYEEQESDIVYKANFNGNDVFFYVLLEFQSSVDFRMPIRLLLYMIEIWR 60 Query: 105 RHI-------EHDKRQPLPLVIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPL 157 + K LP ++P++ Y+G + + N + Sbjct: 61 DILRNTELKEFKRKTFRLPSIVPIVLYNGKKKWTAAKELKHAISNSDVFGDNILNFKYEF 120 Query: 158 VDVTVVPDDEIVQHRRVALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYI 217 +D+ +E+ + ++ + R + ++L +++ + L + Sbjct: 121 IDINSYEKEELYNKQNISSAIFLLDQNINR--IEFYNRLKDIIIGFNNLSIEEKMHLKHW 178 Query: 218 LLTGDEAR--FNEFISELTRRMPQHRERIMTIAERIHNDGYIKGE 260 L+ + F + I ++ Q + + + G+ Sbjct: 179 LVNINTEENNFKDNIEKIFNADKQEVLNMTSNISKGLEKLKEDGK 223 >UniRef50_A6LFH9 Putative uncharacterized protein n=6 Tax=Bacteroidales RepID=A6LFH9_PARD8 Length = 295 Score = 102 bits (254), Expect = 2e-20, Method: Composition-based stats. Identities = 42/296 (14%), Positives = 90/296 (30%), Gaps = 25/296 (8%) Query: 11 DALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVKT 70 D FK F+ L + + + + E + I++ + Sbjct: 10 DVGFKAVFQDKQVTIKFLNAALAGERQ----IKDITYLDKEIKPETVENRT--IIFDLLC 63 Query: 71 REGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQ--PLPLVIPMLFYHGSR 128 + G +++ E Q+ + R Y ++ R + K+ L + + F + Sbjct: 64 EDVSGAKFIL-EMQNCPQHYFFNRGFYYLCRMVARQGQIGKQWQYRLLPIYGVYFLNFKL 122 Query: 129 SP-----------YPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALL 177 + +E + + + I + + L Sbjct: 123 PEFTDFRTDVVLANERTGKVFNEIKMKQIYISFPLFSLSKEECKSSFERWIYTLKNMNLF 182 Query: 178 ELIQKHIRQRDLMGLID--QLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTR 235 E Q + L+D + L E A + T D A+ + Sbjct: 183 EQSPFKEEQETFLRLLDVANVNSLSEKERAIYEENLKNYRDWYATIDYAQTEGIEKGMQE 242 Query: 236 RMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQAL 291 ++ + E+ G + + +I R + + G D E I + +GLS E ++ L Sbjct: 243 ---GMQKGMQKGIEKGIEKGRQEEKLQIARKMKKQGLDSELIAQCSGLSVEDIERL 295 >UniRef50_C9LXX0 Putative uncharacterized protein n=6 Tax=Selenomonas sputigena ATCC 35185 RepID=C9LXX0_9FIRM Length = 301 Score = 101 bits (252), Expect = 3e-20, Method: Composition-based stats. Identities = 59/313 (18%), Positives = 104/313 (33%), Gaps = 39/313 (12%) Query: 3 NFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHS 62 T T D+LF+ + + + E L D + L A+ + + + Sbjct: 2 RNTKRTYKDSLFRDIFNNAERLPEIYEALLD----HKTTPDDITL--ATIDETLFTGVKN 55 Query: 63 DILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRH-----IEHDKRQPLPL 117 DI + V +++EHQS + +M RL+ Y + + +R+ I + PLP Sbjct: 56 DIGFIV-----GNQHVLLVEHQSTINANMPLRLLMYLVEIYRRYVDKDAIYKKELIPLPA 110 Query: 118 VIPMLFYHG-SRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVAL 176 +FY+G + P W+L D F + ++ P+ I++ Sbjct: 111 PKFYVFYNGLAEMPDIWALHLSDAF---GGHDSDLELEVKVFNINDKPNRPILEKCHAL- 166 Query: 177 LELIQKHIRQRDLMGLIDQLVVL---LVTECANDSQITALLNYILLTGDEARFNEFISEL 233 + + + + L N Q +Y+ + + E L Sbjct: 167 -------KSYSVFVAKVRECIKNGSSLEIAVGNAVQYCVAHDYLGEYFRQKQAKEVFDML 219 Query: 234 TRRMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGA------DPEWIQKITGLSAEQ 287 Q R + AE G G Q L L G + K E+ Sbjct: 220 NFVWNQER-ALEVRAEEAMEKGLRLGRQEGLSQGLSQGVLETTTASIRNVMKSMDFPIEK 278 Query: 288 MQALRQPLPERER 300 + Q +PE ER Sbjct: 279 AMDILQ-IPEEER 290 >UniRef50_D1PHY3 Putative uncharacterized protein n=2 Tax=Prevotella copri DSM 18205 RepID=D1PHY3_9BACT Length = 307 Score = 101 bits (251), Expect = 3e-20, Method: Composition-based stats. Identities = 45/312 (14%), Positives = 89/312 (28%), Gaps = 23/312 (7%) Query: 1 MTNFTTSTPHDALFKTFLT-HPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRA 59 M D FK HP + LP L + + +K V + Sbjct: 1 MVMKYLDPKADLTFKKIFGNHPKRLISLLNALLP--LSDEEQIREIKYLPTELVPQLEGG 58 Query: 60 LHSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHD-KRQPLPLV 118 ++ + V + G + +E Q R++ + + + K L V Sbjct: 59 KNT--IVDVLCTDVRGRKFC-VEMQMEWSDAFQQRVLFNASKLYVSQAKKGGKYSELQPV 115 Query: 119 IPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYN-AAFPLVDVTVVPDDEIVQHRRVALL 177 + + + + K+ F +++ I R + L Sbjct: 116 YSLNLINDIFAHDTPDFIHNYRIVHDKDSNKVIEGLHFTFIELPKFTPHSIADKRMMVLW 175 Query: 178 ELIQKHIRQR------DLMG--LIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEF 229 I DL+ I + V L +D+++ A + E + Sbjct: 176 LRFLTEINSNTKDIPADLLNDPEIGKAVEELEISGFSDAELRAYDKFWDSVSVERTLIDD 235 Query: 230 ISELTRRM-------PQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITG 282 + + + + E+ +G + I + LL G E + K T Sbjct: 236 SYQKGKEKGKQEGLAEGMEKGMEKGMEKGRAEGKHEANTEIAQRLLAMGLPAEQVSKATQ 295 Query: 283 LSAEQMQALRQP 294 L E ++ L Sbjct: 296 LPLEIIKNLSNS 307 >UniRef50_C9LWJ8 Putative uncharacterized protein n=1 Tax=Selenomonas sputigena ATCC 35185 RepID=C9LWJ8_9FIRM Length = 292 Score = 101 bits (251), Expect = 3e-20, Method: Composition-based stats. Identities = 51/297 (17%), Positives = 113/297 (38%), Gaps = 25/297 (8%) Query: 7 STPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILW 66 T D+LF D +D +D+ L ++ + L+ F DE +D+ + Sbjct: 6 RTYKDSLFCDIFRRKDYLQDVYRGLFGRDV-SLQEIQLMTLQGTFFNDE-----KNDVSF 59 Query: 67 SVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHD-----KRQPLPLVIPM 121 G I V++EHQS + +M R+ Y + ++ + D +R LP Sbjct: 60 LA----GKRQI-VLMEHQSTLNENMPLRMFWYMAKLYRKQVPKDAPYRTRRLRLPAPCFY 114 Query: 122 LFYHGSR-SPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRV--ALLE 178 +FY+G +P W + + F ++ ++ + +++ R Sbjct: 115 VFYNGLDPAPDEWEMRLSEAFEGECSS---LELCVKAYNINEMSGSRLLEKSRALKGYSV 171 Query: 179 LIQKHIRQRDLMGLIDQLVVL-LVTECANDSQITALLNYILLTGDEARFNEFISELTRRM 237 + + R+ +++ V + D L + + ++ EL +R+ Sbjct: 172 FVAQIRRKTAAGVCLEEAVKQAIRYCIEQDLLAEYFLEREMEEVFDMVSFKWDPELAKRV 231 Query: 238 PQHRERIMTIAERIHNDGYIKGEQRILRLLL-QNGADPEWIQKITGLSAEQMQALRQ 293 Q +E E+ G KG I+ +L + + I +++ +++++L + Sbjct: 232 -QLQEAQEIGMEKGMEKGMEKGVTEIVLNMLKKKKWSLQDISEVSQWPLDKIESLGK 287 >UniRef50_B0K519 Putative uncharacterized protein n=14 Tax=Thermoanaerobacteraceae RepID=B0K519_THEPX Length = 288 Score = 101 bits (251), Expect = 4e-20, Method: Composition-based stats. Identities = 43/247 (17%), Positives = 106/247 (42%), Gaps = 9/247 (3%) Query: 64 ILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQR-------HIEHDKRQPLP 116 +++ VK ++ + + Y+++E QS+ D M +RL+ Y + V + + + K LP Sbjct: 1 MVYQVKLKDKEVFFYILLELQSKVDFQMPYRLLLYIIEVWREILKDTSLNQQKRKDYKLP 60 Query: 117 LVIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQ-HRRVA 175 +IP++ Y+G + + + + L+DV ++E++Q ++ Sbjct: 61 AIIPIVLYNGVNRWTASLSFKETIDSYQLFGENIIDFKYILIDVNRYNEEELLQLSNLIS 120 Query: 176 LLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTR 235 + L+ + I + +L +L +L + I + +++ E + Sbjct: 121 SIFLLDRKIDKEELTEKWGKLADVLKDISEEEFIILRNWLFSVVSRFLPEDKEKEIKEIL 180 Query: 236 RMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNG-ADPEWIQKITGLSAEQMQALRQP 294 + E +++ ER + + K + L+ L+ G + I K+ G +++ +R Sbjct: 181 VQSEGVEEMISNLERSLREEFRKTRREGLKEGLKKGKLEGLKIGKMEGRMEGKIEGIRMV 240 Query: 295 LPERERY 301 + E+ + Sbjct: 241 VFEQLKE 247 >UniRef50_C1DXV7 Putative uncharacterized protein n=1 Tax=Sulfurihydrogenibium azorense Az-Fu1 RepID=C1DXV7_SULAA Length = 357 Score = 100 bits (249), Expect = 5e-20, Method: Composition-based stats. Identities = 42/255 (16%), Positives = 109/255 (42%), Gaps = 9/255 (3%) Query: 7 STPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALH-SDIL 65 PHD K L + A+ ++ HLP+++ + ++L++ + +D K ++ + +DI+ Sbjct: 14 QNPHDTYAKELLKDEEVAQVLLDAHLPQEINSIIKKETLEIINTENLDYKEKSKYFADII 73 Query: 66 WSVKTR-EGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFY 124 +S+KT D IYV+IEH+S +D H+ +L++ AV + I K + + P++ Y Sbjct: 74 YSLKTIYGEDLKIYVLIEHKSYDDKHLPLQLIKNMTAVWSKEILEGK---ITPIYPIVIY 130 Query: 125 HGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQH-RRVALLELIQKH 183 S + + +++ + + I + + + L + + Sbjct: 131 ASKEKLSLESKFSNYYKISDNMKKFFLDFYVSTLNLNELDEKTIKEKYKNIYTLIMTLRI 190 Query: 184 IRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRER 243 I++ +++ L+ + T + ++ + + ++ +++ Sbjct: 191 IQEPTPENILN-LIKSIETLYNYKPKAVYVIALSYIFTIAKKDKNTYIKVKKQLEGG--N 247 Query: 244 IMTIAERIHNDGYIK 258 + ++ + +G K Sbjct: 248 MGSLLDMFIEEGLEK 262 >UniRef50_C0DAA1 Putative uncharacterized protein n=2 Tax=Clostridium asparagiforme DSM 15981 RepID=C0DAA1_9CLOT Length = 302 Score = 100 bits (249), Expect = 7e-20, Method: Composition-based stats. Identities = 43/282 (15%), Positives = 95/282 (33%), Gaps = 28/282 (9%) Query: 10 HDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVK 69 D+LF+ + + DL + ++ +DI + + Sbjct: 17 KDSLFRVIFSEKKELLELYNAINGSHYENPDDLIITTIGDVLYLGM-----KNDISFLI- 70 Query: 70 TREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQ-------PLPLVIPML 122 G + E QS + +M R + Y + Q +++ + LP ++ Sbjct: 71 -----GQHLSLYEAQSTWNPNMPLRGLFYFSRLYQGYLKEHQLDLYSRRPLSLPFPEFIV 125 Query: 123 FYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRR-VALLELIQ 181 FY+G+ + L + ++++ ++E+++ R + + Sbjct: 126 FYNGTMEQPDRTQLRLSDLFYQAEGVPCLECTATMININYGHNEEMMKSCRKLYEYAFLI 185 Query: 182 KHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHR 241 +R R GL + V + D +L LL E +SE + + Sbjct: 186 NAVRSRLNEGLHLEAA---VDQAVEDCIQHDVLKNFLLKHREEVREMILSEYDEELHINS 242 Query: 242 ERIMTIAERIHN---DGYIKGEQRI---LRLLLQNGADPEWI 277 E+ ++ E + G G++R+ + L G + I Sbjct: 243 EKKISYEEGLEAGVVQGTQHGQERVNALITRLAAAGRADDII 284 >UniRef50_B7GJZ4 Transposase n=10 Tax=Bacillaceae RepID=B7GJZ4_ANOFW Length = 286 Score = 99.9 bits (247), Expect = 9e-20, Method: Composition-based stats. Identities = 44/293 (15%), Positives = 102/293 (34%), Gaps = 25/293 (8%) Query: 10 HDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDE--KLRALHSDILWS 67 HD LFK LT + + E D L S + D+L Sbjct: 7 HDRLFKELLTTFFEEF---ILLFFPHVHEHIDFRHLSFLSEELFTDVTAGEKYRVDLLIQ 63 Query: 68 VKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGS 127 K + G I + +E+QS R+ Y + +++ + ++P+ + Sbjct: 64 TKLKGEAGIIIIHVENQSYMQSSFPERMFIYFSRLFEKYRTN--------ILPIAIFS-- 113 Query: 128 RSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHI-RQ 186 Y + F + F V++ ++ L+ K + Sbjct: 114 ---YDFIRDEPSSFTLQFPFLHVLQFQFLAVELRKQNWRHYIRSENPIATALLSKMGYNE 170 Query: 187 RDLMGLIDQLVVLLVTECANDSQITALLNY------ILLTGDEARFNEFISELTRRMPQH 240 + + L Q +L+ + ++++ L+ + + +E NE + Q Sbjct: 171 NERVELKKQFFRMLIRQNIDEAKRRLLIGFFETYVKLTEQEEEQFQNEVKKMGGKEGEQV 230 Query: 241 RERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQALRQ 293 E I++ ++ G + E+ +++ +++ G I + S E+++ + + Sbjct: 231 MELIISYEQKGKIAGAKEKEREMIQKMVEKGMSITQIAHLLDRSEEEVRKVVE 283 >UniRef50_A6BF26 Putative uncharacterized protein n=14 Tax=Clostridiales RepID=A6BF26_9FIRM Length = 366 Score = 99.9 bits (247), Expect = 1e-19, Method: Composition-based stats. Identities = 49/276 (17%), Positives = 95/276 (34%), Gaps = 18/276 (6%) Query: 4 FTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSD 63 D +F+ + ++ + L + LE+A ++ +D Sbjct: 51 KAKRMYKDTIFRMLYHDKENLLSLYNAVNGREYTDPEKLQVVTLENAIYMGM-----KND 105 Query: 64 ILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKR-----QPLPLV 118 + + + + Y+Y EHQS + ++ R + Y QR + Q +P Sbjct: 106 LAF---IMDMNLYLY---EHQSTYNPNIPLRNLFYIADEYQRLVVRKSLYSTVIQKIPTP 159 Query: 119 IPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLE 178 ++FY+G++ S L + T +++V ++++H R L E Sbjct: 160 RFLVFYNGTKEVEDRSEFRLSSAYENPTENPDLELRVTMLNVNDGHSSDLMEHCR-TLKE 218 Query: 179 LIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMP 238 Q R R D + VT ++ +L LL I E + Sbjct: 219 YAQYVARVRKYAAKQDVSLEEAVTRAVDECIEEGILAEFLLKNKTEVIRVSIYEYDKEFE 278 Query: 239 QHRERIMTIAERIHNDGYIKGEQRILRLLLQNGADP 274 + + R E DG G Q + + Q+G + Sbjct: 279 EKKLRKAE-YEAGRQDGIEIGRQDGIEIGRQDGIEI 313 >UniRef50_C2LUG6 Putative uncharacterized protein n=1 Tax=Streptococcus salivarius SK126 RepID=C2LUG6_STRSL Length = 299 Score = 99.9 bits (247), Expect = 1e-19, Method: Composition-based stats. Identities = 56/299 (18%), Positives = 104/299 (34%), Gaps = 32/299 (10%) Query: 11 DALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVKT 70 D + K + P+ F+ L D+ + L+ +L F +++L + D+ K Sbjct: 13 DIMAKKIFSLPEVTVAFIRDILDLDVVDAQILEGTQLHKKDFDEDELFSTSVDV--RAKL 70 Query: 71 REGDGYIYVVIEHQSREDIHMAFRLMRYSM--------AVMQRHIEHDKRQPLPLVIPML 122 +G V+IE Q R+ + R Y + Q+ H + + V + Sbjct: 71 NDGTE---VIIEIQVRKQHYFLNRFHYYLANQLVENVQQLRQQGQTHKMYEQMEPVYGIA 127 Query: 123 FYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQ- 181 + P S A+ T + LY+ D + ++A LEL + Sbjct: 128 ILEKTLLPDEESPINTYWMANSRTGKPLYSF---------YKDGKQQNLLQIAFLELDKY 178 Query: 182 -KHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQH 240 K RD + L A +T + + + I E R + Sbjct: 179 NKDKHIRDEGRQWLEFFGNLPFSKAPSRAVTHADSLLDSSSWTQEEKAMIDERIRIQENY 238 Query: 241 RERIMTIAERIHNDGYI--------KGEQRILRLLLQNGADPEWIQKITGLSAEQMQAL 291 + T + +G +G+ ++R +L G E + +TGLS E++ L Sbjct: 239 DMTMETAIDEAREEGLEQGLKRGRYEGQLELIRKMLAKGLSLEVVSDVTGLSLEELDGL 297 >UniRef50_B0KCX4 Putative uncharacterized protein n=12 Tax=Thermoanaerobacterales RepID=B0KCX4_THEP3 Length = 267 Score = 99.1 bits (245), Expect = 2e-19, Method: Composition-based stats. Identities = 55/292 (18%), Positives = 105/292 (35%), Gaps = 29/292 (9%) Query: 6 TSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDIL 65 S +D K ++ D + + D +++ SDI+ Sbjct: 1 MSQKYDITIKDIFSN---MADDITAYFLGLTYTKTDELNIEFTKVE-------KRQSDIV 50 Query: 66 WSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYH 125 T +GD I V +E QS D M +R++RYS+ +M+++ ++ Y Sbjct: 51 LKCTTEKGD--IAVHLEFQSDNDDKMPYRMLRYSLEIMEKYNLT--------PYQLVIYM 100 Query: 126 GSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIR 185 G L ++ + + + ++DV + +I + L L+ R Sbjct: 101 GKND-----LRMENKLDYNLGEENILDYRYKIIDVGTIKFLDITKTDYYDLYALLPIMDR 155 Query: 186 QR---DLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISEL-TRRMPQHR 241 +R + + + V + + ++ + + E I + T M R Sbjct: 156 ERRKTEGEKYLKECVEAIKNIPIDINKKKDITFKAEILSGLVYSREVIERVFTEVMEMLR 215 Query: 242 ERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQALRQ 293 + I G + RI + LL+ G D I KIT LS E+++ L Sbjct: 216 IEESEAYKMILEKGAKEKSLRIAKELLKEGMDINKIAKITELSIEEIKKLMN 267 >UniRef50_UPI0001BC3A9D hypothetical protein BcroD2_08902 n=3 Tax=Butyrivibrio crossotus DSM 2876 RepID=UPI0001BC3A9D Length = 324 Score = 97.9 bits (242), Expect = 4e-19, Method: Composition-based stats. Identities = 49/317 (15%), Positives = 105/317 (33%), Gaps = 43/317 (13%) Query: 9 PHDALFKTFLTHPDTARDFMEIHL-------PKDLRELCDLDSLKLESASFVDEKLRALH 61 D L K + T PD D + L + D+++ ++E + L Sbjct: 18 QKDILLKDYFT-PDIFADAINAILYDGKSVVTPERMRTIDIETQRVEDENGNVTADTRLR 76 Query: 62 SDILWSVKTREGDGYIYVV--IEHQSREDIHMAFRLMRYSMAVMQRHIEHDK--RQPLPL 117 S K E D IY + IEHQS ED M R+M Y + R ++ +K + + Sbjct: 77 D----SAKVVEVDDAIYCLFAIEHQSVEDYTMPLRIMEYDVREYLRQVKSNKGVQVRIKP 132 Query: 118 VIPMLFYHGSRSPYPWSLCWLDEF--------ADPTTARKLYNAAFPLVDVTVVPDDEIV 169 +I ++ Y + + D F + + L + V ++++ Sbjct: 133 IITIVMY-WKADKWNQPVSVKDMFDKNTVRWLEYNGLGGYIQDYRMHLFEPGTVKEEDLE 191 Query: 170 QHRRVALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEF 229 + + +D++ + + N+ L + +E +++ Sbjct: 192 KFK-----------TELKDVIAYVKYSKSTEALKDYNEKYKPDLTKSTVTLINELTNSKY 240 Query: 230 ISELTRRMPQHRERIMTIAERIHNDGYIKGEQRILR-------LLLQNGADPEWIQKITG 282 + + E + E G + + + L + G I + G Sbjct: 241 VFIEGKERLDMCEAFEGLIEEGRAKGKAEELKEKYKSWVTLSNNLKKRGMSNPEIASLLG 300 Query: 283 LSAEQMQALRQPLPERE 299 + ++Q + + E + Sbjct: 301 VPETELQKAFKMIKEEK 317 >UniRef50_B9E303 Putative uncharacterized protein n=2 Tax=Clostridium kluyveri RepID=B9E303_CLOK1 Length = 304 Score = 97.5 bits (241), Expect = 4e-19, Method: Composition-based stats. Identities = 41/239 (17%), Positives = 83/239 (34%), Gaps = 26/239 (10%) Query: 79 VVIEHQSREDIHMAFRLMRYSMAVMQ-------RHIEHDKRQPLPLVIPMLFYHGSRSPY 131 +E QSR D M RL+ Y + + + ++ K LP +IPM+ Y+G + Sbjct: 28 CFLEFQSRVDYRMPMRLLFYMVEIWREILKNTSKNDRSKKDFKLPSIIPMVLYNGKNTWT 87 Query: 132 PWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVAL-LELIQKHIRQRDLM 190 + + + L D+ ++++ + + L+ K I + DL+ Sbjct: 88 ACKNFKDVLSGSKLFGENVIDFRYMLFDIYRYNEEQLEDMANMVSTVFLLDKEISKEDLV 147 Query: 191 GLIDQLVVLLVTE-----------------CANDSQITALLNYILLTGDEARFNEFISEL 233 + +L DS+ + IL + + +S L Sbjct: 148 KRLRLTAYVLKKITPEQFDILKAWLKSIIKPRLDSESKIKIEEILEKSSQGEVDSMVSNL 207 Query: 234 TRRMPQ-HRERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQAL 291 + + RE T E +G +G + + + G + IT + ++ L Sbjct: 208 GKTIDNIIREGRETGLEEGRREGRKEGRKEGRKEGRKEGRKEGKSELITKMLVKKFTKL 266 >UniRef50_D1P8S5 Putative uncharacterized protein n=1 Tax=Prevotella copri DSM 18205 RepID=D1P8S5_9BACT Length = 303 Score = 97.5 bits (241), Expect = 5e-19, Method: Composition-based stats. Identities = 38/294 (12%), Positives = 92/294 (31%), Gaps = 15/294 (5%) Query: 11 DALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVKT 70 D FK +D + L + + + V + + ++ V Sbjct: 16 DFGFKRIFGT-AMNKDLLICFLNSLFNGRQVVKDVSYLNPEHVGDVYTDRRA--IFDVYC 72 Query: 71 REGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQ--PLPLVIPMLFYHG-- 126 +G ++ +E Q+ + R + YS ++ L + + + Sbjct: 73 EGENGEKFI-VEMQNAYQTYFKDRALFYSTFPIREQAPKGNEWDFKLNNIYTVALLNFNM 131 Query: 127 -SRSPYPWSLCWLDEFADPTTARKLYN-AAFPLVDVTVVPDDEIVQHRRVALLELIQKHI 184 + + + D T + Y+ + V+++ K++ Sbjct: 132 NEDAFDKEKIRHHVQLCDTATHKVFYDKLEYIYVEISKFNKTLEELDTLYEKWLYALKNL 191 Query: 185 RQ--RDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQH-R 241 + + L D++ L E + + + + + + + Sbjct: 192 YKLTQRPKELCDKVFDRLFEEAEIAKFTPQEMR--EYETSKMAYRDIKNSVDTAKREGIA 249 Query: 242 ERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQALRQPL 295 E I E+ +G I R +L G D I +TGL++E+++ L+ + Sbjct: 250 EGIEIGMEKGRAEGMNLRSLEIARKMLAKGMDEASIMDMTGLTSEEIKLLKAEI 303 >UniRef50_A6M1J9 Putative uncharacterized protein n=1 Tax=Clostridium beijerinckii NCIMB 8052 RepID=A6M1J9_CLOB8 Length = 278 Score = 97.5 bits (241), Expect = 5e-19, Method: Composition-based stats. Identities = 47/296 (15%), Positives = 95/296 (32%), Gaps = 25/296 (8%) Query: 6 TSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCD-LDSLKLESASFVDEKLRALHSDI 64 S +D +FK +D + L L+ D L+ ++L + + E + Sbjct: 3 ISPKNDFVFKLLFGDEKN-KDLIIELLNSILKMPHDELEDIELINTELLREFAEDRKGIL 61 Query: 65 LWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHD-KRQPLPLVIPMLF 123 KT+ G+ ++ IE Q +MA R + Y + I+ L I + Sbjct: 62 DVRAKTKSGE---HIDIEIQVLYTYYMAERTLFYWSKMYNGQIKSGYTYDKLKKCITINI 118 Query: 124 YHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEI---VQHRRVALLELI 180 + D T + +++ + D+ I V + + Sbjct: 119 VDFNCIEINKLHTSFHITEDETNKKLTDVLEIHYLELPKLFDNNIPKDESEPLVQWMMFL 178 Query: 181 QKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQH 240 Q R ++ ++ + + IL + E Sbjct: 179 Q--SRNKEAFEMLAEKNEKIKKA-----------YNILEVISKDDNARAAYEAREAELHD 225 Query: 241 RERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQALRQPLP 296 + MT + +G + + + L G D E + K TGLS +++ ++ L Sbjct: 226 Q---MTRLKSAREEGIKEATIKNAKNFLVMGLDVEMVAKGTGLSVDEVLKIKGELN 278 >UniRef50_C6LE73 Putative uncharacterized protein n=1 Tax=Bryantella formatexigens DSM 14469 RepID=C6LE73_9FIRM Length = 326 Score = 95.6 bits (236), Expect = 2e-18, Method: Composition-based stats. Identities = 52/311 (16%), Positives = 111/311 (35%), Gaps = 28/311 (9%) Query: 9 PHDALFKTFLTHPDTARDFMEIHLPK-----------DLRELCDLDSLKLESASFVDEKL 57 D + K + DF+ L + + L E K Sbjct: 3 EKDIILKEYQRDSRHFCDFVNGALAQGRPLLKRGQLVPVPTELVLVKDTEEDDENAVVKT 62 Query: 58 RALHSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKR----- 112 DI + + G I V I++Q+ D M R+M K Sbjct: 63 VQRFRDITGKAEADKNAGCIIVAIQNQTTVDYGMPLRVMLEDALEYDVQRRTKKNRKLHK 122 Query: 113 -QPLPLVIPMLFYHGSRSPYPWSLCWLDEFADPTTARKL--YNAAFPLVDVTVVP-DDEI 168 + L LVI ++FY+G+ +P+ + + P R+L Y ++P+V VT D Sbjct: 123 GEKLCLVITLVFYYGT-TPWRAPSDLAEMISVPREFRQLREYIQSYPIVVVTPENVDTAC 181 Query: 169 VQHRRVALLELIQKHIRQRDLMGLIDQLVVLLVTECAN-DSQITALLNYILLTGDEARFN 227 + +LE++++ ++++ +++ + + + I AL +++ + Sbjct: 182 FRGGWQEILEILRRQNDEKEMGRYLEKNRAIYEKLPEDTNRVIFALTDHLDYYRELKEKG 241 Query: 228 EFISELTRRMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQ 287 E I+ ++ + E G +G ++ ++ + G D I + + Sbjct: 242 EKITMCKAFTDHYK----SGVEEGKKQGMKRGRRQGIKQGKRQGMDMGIRAMIE--TCRE 295 Query: 288 MQALRQPLPER 298 ++ R +R Sbjct: 296 LKIPRNETKKR 306 >UniRef50_C1MD86 Putative uncharacterized protein n=5 Tax=Enterobacteriaceae RepID=C1MD86_9ENTR Length = 155 Score = 95.6 bits (236), Expect = 2e-18, Method: Composition-based stats. Identities = 66/150 (44%), Positives = 96/150 (64%), Gaps = 24/150 (16%) Query: 163 VPDDEIVQHRRVALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGD 222 +PDD+I+QHRR+ALLELIQKHIR+RDLMGL+++L +LLV AND+Q+ AL NY++ G+ Sbjct: 1 MPDDKIMQHRRMALLELIQKHIRKRDLMGLVEKLAILLVKGHANDNQLKALFNYLMQAGN 60 Query: 223 EARFNEFISELTRRMPQHRERIMTIAERIHNDGYIKGEQR-------------------- 262 F EF+ E+ R+PQH++++MTIAER+ +G++ G Q Sbjct: 61 TTHFGEFLHEVAERLPQHKDKLMTIAERLRQEGHLNGLQEGHRKGLQEGLQTGLQQGKRE 120 Query: 263 ----ILRLLLQNGADPEWIQKITGLSAEQM 288 I + +G DP I +ITGL+AE + Sbjct: 121 EALRIASTMQADGIDPLTIIRITGLTAEDL 150 >UniRef50_C0EXQ3 Putative uncharacterized protein n=1 Tax=Eubacterium hallii DSM 3353 RepID=C0EXQ3_9FIRM Length = 290 Score = 95.6 bits (236), Expect = 2e-18, Method: Composition-based stats. Identities = 41/302 (13%), Positives = 103/302 (34%), Gaps = 25/302 (8%) Query: 2 TNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALH 61 T D LF+ + + + + D+D +++ + D + Sbjct: 6 TGNANREYKDRLFRFVFGAEENKAYLLSLCNAVSGTDYTDVDDIEI--TTLSDAIYIKMK 63 Query: 62 SDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKR-------QP 114 +DI + + ++ + EHQS + +M R M + +I + Q Sbjct: 64 NDISFLIDSQMN------LFEHQSTFNPNMPLRGMECFAELYGIYIIENNLDIYVSSLQK 117 Query: 115 LPLVIPMLFYHGSRS-PYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRR 173 + + Y+G+ P L D F P + + + ++++ + ++++ + Sbjct: 118 ILTPRYYVIYNGTEKQPDVVKLKLSDAFQVPDDSGE-FEWTATMLNINYGHNRKLLEQCQ 176 Query: 174 VALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISEL 233 L E R+ + +L + + + ++ E ++E Sbjct: 177 P-LYEYAHFIKLVREYSEAM-ELKKAIDKAVEKAREWKCIGTFLYQCKSEVSVM-LLTEF 233 Query: 234 TRRMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQALRQ 293 + + + + +G K + + +L PE I K +S + + L++ Sbjct: 234 DEKKHED-----NLIKLGEKEGREKERMKNICSMLALSLSPEIIAKACEVSVDYVLNLKK 288 Query: 294 PL 295 L Sbjct: 289 EL 290 >UniRef50_Q2RGS0 Putative uncharacterized protein n=2 Tax=Moorella thermoacetica ATCC 39073 RepID=Q2RGS0_MOOTA Length = 310 Score = 95.2 bits (235), Expect = 3e-18, Method: Composition-based stats. Identities = 42/295 (14%), Positives = 111/295 (37%), Gaps = 29/295 (9%) Query: 8 TPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWS 67 +D K ++ + + R DL ++ ++ SD++ Sbjct: 7 NRYDITIKDLFADET--QELINYFGHFEARVTGDL-KIEF-------PQVETRVSDLVMK 56 Query: 68 VKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGS 127 ++++G + + +E QSR D M +R++RY++ + + + V ++ Y G Sbjct: 57 AESQQGP--LAIHLEFQSRNDDEMPYRMLRYALEIHKTYHL--------PVYQIVIYFGQ 106 Query: 128 RSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQH---RRVALLELIQKHI 184 W + + + L + + L+DV + +E+ R ++LL ++ + Sbjct: 107 -----WQMNMTSQLEYRLGDQNLLDYRYHLIDVGNITYEELKNSPHQRLLSLLPVVDREK 161 Query: 185 RQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRERI 244 RQ+ + + ++ + +L + + I + R + Q Sbjct: 162 RQKGGKEFLRRCAEDIINSDLDLETKKTVLLRAEIFAGLVFDKKAIDLVFREVEQMLSIE 221 Query: 245 MT-IAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQALRQPLPER 298 + +RI G KG ++ + ++ G + + + ++ ++ + +P Sbjct: 222 ESAGYQRIFEKGMEKGIEKGMEKGMEKGIEKGQQESLLDVTIRLLRKKFRKIPRE 276 >UniRef50_Q5GSR2 Uncharacterized conserved protein n=15 Tax=Wolbachia RepID=Q5GSR2_WOLTR Length = 317 Score = 94.8 bits (234), Expect = 3e-18, Method: Composition-based stats. Identities = 41/309 (13%), Positives = 99/309 (32%), Gaps = 29/309 (9%) Query: 9 PHDALFKTFLT---HPDTARDFMEIHLP-KDLRELCDLDSLKLESASFVDEKLRALHSDI 64 D +FK + F+ L ++ + +++ L + +++ D+ Sbjct: 10 KFDLIFKKIFGTEKNKKIIICFLNNILGFAEINAIQEVEFLSAIIDPEIASNKQSIIVDV 69 Query: 65 LWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQ--PLPLVIPML 122 K G + IE Q + R+ Y++ R ++ L V + Sbjct: 70 F--CKDATGTRRV---IEVQLAINKGFEKRVQPYAVKAYSRQLDKSGNYIVDLKKVFFIA 124 Query: 123 FYHGSRSPYPWSLCWLDEFADPT-TARKLYNAAFPLVDVTVVPDDEIVQHRRVA----LL 177 + + D L + F +++ ++ Q + Sbjct: 125 ISNCNLLSEKVDYISTHNIHDTKTNGHYLKDFQFIFIELPKFSKSKVEQLINIVEHWCFF 184 Query: 178 ELIQKHIRQRDLMGLIDQLVVLLVTECANDSQI---TALLNYILLTGDEARFNEFIS-EL 233 + + DL + +++++ + D ++ Y + + + L Sbjct: 185 FKNAEDTTETDLKRVAKKVLIIKLAYDGLDEFHWNEEDIIAYEERVMNLQKEKAILEYRL 244 Query: 234 TRRMPQHRERIMTI---------AERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLS 284 + RE + I AE+ +G K + + + L+ G I +I GLS Sbjct: 245 DLATEKGREEGVKISKERGIKVGAEKGREEGVKKAKIAVAKNSLKAGMSIGAIAEIIGLS 304 Query: 285 AEQMQALRQ 293 +++ L + Sbjct: 305 VGKIKKLHE 313 >UniRef50_C5RQ96 Putative uncharacterized protein n=1 Tax=Clostridium cellulovorans 743B RepID=C5RQ96_CLOCL Length = 288 Score = 94.8 bits (234), Expect = 3e-18, Method: Composition-based stats. Identities = 43/295 (14%), Positives = 89/295 (30%), Gaps = 19/295 (6%) Query: 3 NFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLR-ELCDLDSLKLESASFVDEKLRALH 61 NF S D +FK +D + L L + +++ + E Sbjct: 9 NFIMSPKIDFVFKLLFGDEKN-KDLLIAFLSAVLNLPEREFVGIEILNTELFREFKEDKK 67 Query: 62 SDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDK-RQPLPLVIP 120 + VKT G + IE Q M R + Y + ++ L I Sbjct: 68 GILDVRVKTVNGKQ---IDIEIQVLPTEFMPERTLFYWSKMYTTQVKPGDTYDKLKKCIT 124 Query: 121 MLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELI 180 + P D T + +++ + D +I + +++ + Sbjct: 125 INIVDFKCIPLNKLHTSYHLIEDETGHKLTDILEVHFLEIPKLFDKQIEINEDDPIIQWM 184 Query: 181 QKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQH 240 +D ++ A ++ +L + I E + Sbjct: 185 ----------EFLDGKSKGVMEMLAEKNESIKKAYNLLKIISKDEKARMIYEAREAELRD 234 Query: 241 RERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQALRQPL 295 + +T G + R+ +++ G I ++T LS E++ L+ L Sbjct: 235 Q---LTRIRSAEEKGANEKALRVAEKMIKRGDSINDIIELTELSKEKILELKNKL 286 >UniRef50_A8GY36 Putative uncharacterized protein n=15 Tax=Rickettsia RepID=A8GY36_RICB8 Length = 279 Score = 94.8 bits (234), Expect = 3e-18, Method: Composition-based stats. Identities = 42/292 (14%), Positives = 88/292 (30%), Gaps = 27/292 (9%) Query: 11 DALFKTFLTHPDTARDFMEIH--LPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSV 68 D FK T F+ LP++LR + LK S V + + S + V Sbjct: 10 DVAFKKLFTDKARLISFLNNIMRLPEELR----IIDLKYISNEQVPDLGQNKRS--IVDV 63 Query: 69 KTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDK-RQPLPLVIPMLFYHGS 127 K + G IY+ +E Q+ R+ Y ++ K L V+ ++ G Sbjct: 64 KVTDNSGNIYI-VEMQNGYADAFLARVQFYGCVAFSSQLKRGKEYADLAPVVMVIITSGF 122 Query: 128 RS--PYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIR 185 ++ + + +L ++ V++ + + Sbjct: 123 QALPEEKECISYHQTINVGNGKNQLKCLSYVFVELDKFTKEANELETIEDDWLYMM---- 178 Query: 186 QRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRERIM 245 + ++ + + + Q E Sbjct: 179 --------AKFDKAKEPPKHTQDEVVLSAYKTIEQFNWSEAEYDNYIKAMLAAQTEELNQ 230 Query: 246 TIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQALRQPLPE 297 +G + + + +LQ+ E I K T LS E+++ L+ + + Sbjct: 231 KSK---FKEGKAERSIEMAKEMLQDNEPIEKIIKYTKLSKEEIEKLKLEIEK 279 >UniRef50_C0G0A4 Putative uncharacterized protein n=2 Tax=Roseburia inulinivorans DSM 16841 RepID=C0G0A4_9FIRM Length = 319 Score = 94.5 bits (233), Expect = 4e-18, Method: Composition-based stats. Identities = 40/252 (15%), Positives = 88/252 (34%), Gaps = 19/252 (7%) Query: 6 TSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDIL 65 D +F+ + + + DL+ + LE+A ++ + +D+ Sbjct: 53 NRNYKDTVFRMLFSDRKNLLSLYNAVNQSNYKNPEDLEIVTLENAIYMG-----IKNDLA 107 Query: 66 WSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDK-----RQPLPLVIP 120 + + + Y+Y EHQS + +M R + Y + Q+ ++ Q +P Sbjct: 108 F---IMDTNLYLY---EHQSTYNPNMPLRDLFYICSEYQKLVDKKSLFSSTLQKIPAPNF 161 Query: 121 MLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELI 180 + FY+GS + L + T +++V + +++QH + L E Sbjct: 162 IEFYNGSTVISDCTELRLSSAFECLTGEPKLELIVTVLNVNEGHNADLMQHCSM-LKEYA 220 Query: 181 QKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQH 240 Q R R D + V ++ +L L + I E + + + Sbjct: 221 QYVARVRHYAS--DMPLNEAVKHAVDECIREGILAEFLTQNRNEVISMSIFEYDKELEEK 278 Query: 241 RERIMTIAERIH 252 ++ + Sbjct: 279 NYEKQSLRQDAK 290 >UniRef50_A5Z376 Putative uncharacterized protein n=1 Tax=Eubacterium ventriosum ATCC 27560 RepID=A5Z376_9FIRM Length = 316 Score = 94.5 bits (233), Expect = 4e-18, Method: Composition-based stats. Identities = 44/304 (14%), Positives = 99/304 (32%), Gaps = 28/304 (9%) Query: 3 NFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHS 62 + D +F+ + + +E++ + + D L++ + D + + Sbjct: 1 MEGSKKHKDRVFRKLFGYEKNKGNLLELYNALNDSNYTNPDDLEI--NTLDDVFYMNMKN 58 Query: 63 DILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHI-------EHDKRQPL 115 D+ + + IY EHQS +M R RYS + +I K + Sbjct: 59 DVSC---IIDWNMAIY---EHQSTWSYNMPLRGYRYSAELYNDYIVRNNLDVFRRKLIKI 112 Query: 116 PLVIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVA 175 P +FY+G+ + L + + ++++ ++E++ + Sbjct: 113 PTPQYYVFYNGNEKRPDREVLKLSDAFMVPCKDGEFEWTATVLNINAGHNEELMSKCSIL 172 Query: 176 LLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTR 235 R+ ++ ++ L I ++Y L F + Sbjct: 173 -----------REYAIMVSKIKEFLAESLELKDAIKKAIDYCLDNNVLKEFLQDHRSEVE 221 Query: 236 RMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITG--LSAEQMQALRQ 293 M D Y +GEQ L + NG + I+ + + + ++ + Sbjct: 222 DMLWREYNEEETMAHWKEDFYEEGEQHGLEVGRANGEKIKLIKLVCKKLVKNKSIEEIAD 281 Query: 294 PLPE 297 L E Sbjct: 282 DLEE 285 >UniRef50_B8FTH9 Putative uncharacterized protein n=3 Tax=Desulfitobacterium hafniense RepID=B8FTH9_DESHD Length = 325 Score = 94.5 bits (233), Expect = 5e-18, Method: Composition-based stats. Identities = 36/330 (10%), Positives = 94/330 (28%), Gaps = 45/330 (13%) Query: 3 NFTTSTPHDALFKTFLT---HPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRA 59 S D FK + F+ L + + + + E Sbjct: 2 KEFISLKIDYAFKLIFGKEGNEAILIAFLNAALKLPQERRI--EEITIINPELNKEYPED 59 Query: 60 LHSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHD---KRQPLP 116 S + T +G + + IE Q M R + Y + R I K Sbjct: 60 KKSILDVRAITSQG---MQINIEIQLSNQYDMEKRSLYYWAQMYSRQIREGMAYKELTKT 116 Query: 117 LVIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVV-----PDDEIVQH 171 + I ++ ++ + + + D + +++ + + + Sbjct: 117 VSINIVDFNYLKQTSSY-HNVFHLYEDEEKFQLTDVLEIHFMELPKLLAKWRKREISLWE 175 Query: 172 RRVALLELIQKHIRQRDLMGLIDQLVV-------------------LLVTECANDSQITA 212 + L+ + ++++ +++++ + + + + Sbjct: 176 NELVRWLLLLEGADNQEILQILEEIAMKDPVLYQAMNAWEETSEDPRIREAYFDRRKAIL 235 Query: 213 LLNYILLTGDEARFNEFISELTRRMPQHR---------ERIMTIAERIHNDGYIKGEQRI 263 + + + + + + R E +G +G + Sbjct: 236 DEKAAIREAELRLQEALEEGMAKGIAEGRAKGIAEGKAEGKAEGRAEGRAEGRAEGRAEV 295 Query: 264 LRLLLQNGADPEWIQKITGLSAEQMQALRQ 293 + LL G + I + TGLS E++ L+ Sbjct: 296 AKKLLVLGFEITKIAEATGLSEEEISGLKD 325 >UniRef50_B6FJ15 Putative uncharacterized protein n=5 Tax=Clostridium RepID=B6FJ15_9CLOT Length = 310 Score = 93.3 bits (230), Expect = 9e-18, Method: Composition-based stats. Identities = 39/313 (12%), Positives = 104/313 (33%), Gaps = 41/313 (13%) Query: 4 FTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSD 63 D +F+ + + DL +E ++ +D Sbjct: 14 KINKKYKDRIFRMIFHEKKELLELYNAVNNSNYTNPDDLTITTIEDVVYMGM-----KND 68 Query: 64 ILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQ-------PLP 116 + + + G + + EHQS ++ R + Y ++ + +IE K + +P Sbjct: 69 LSFLI------GDVMNLYEHQSSFSPNLPLRGLFYFSSLYKEYIEPVKHRLYTASPLHIP 122 Query: 117 LVIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFP--LVDVTVVPDDEIVQHRRV 174 ++FY+G++ L + + ++++ + + E+++ R Sbjct: 123 FPKYVVFYNGTKKEPERQELKLSDLFLENKEETTPSLECTAVVLNINLGKNRELMEKCRP 182 Query: 175 -----ALLELIQKHIRQR-DLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNE 228 + +I+K++ ++ D +++ V +L IL + Sbjct: 183 LKEYAEFISIIRKYLSEQMDFGNAVNKAVDF--------CIHNGILADILQKNRSEVVDM 234 Query: 229 FISELTRR---MPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKIT---- 281 ++E + + + N+G KG + + ++ E + + Sbjct: 235 ILTEYDEEEFRRAWREDLLNEGFRKGLNNGLSKGIKGTIHACMKFNVPKEDVMQNLMEEF 294 Query: 282 GLSAEQMQALRQP 294 LS E+ + + Sbjct: 295 SLSQEEAEKYLEE 307 >UniRef50_B1WSK8 CHP1784-containing protein n=11 Tax=Cyanobacteria RepID=B1WSK8_CYAA5 Length = 260 Score = 92.5 bits (228), Expect = 2e-17, Method: Composition-based stats. Identities = 43/278 (15%), Positives = 100/278 (35%), Gaps = 22/278 (7%) Query: 22 DTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVKTREGDGYIYVVI 81 D F+ +D + ++L + L + +D L +++ + I + I Sbjct: 5 DNVCKFLAERFSRDFANWLLNEPIELTELKPTELSLNPIRADSLIFLQSDD----IVLHI 60 Query: 82 EHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGSRSPYPWSLCWLDEF 141 E Q+ D + FR+ Y + V +R+ + Q ++ Y P L + + F Sbjct: 61 EFQTSPDEDIPFRMTDYRLRVYRRYPNKEMYQ-------VVIY---LKPSNSELVYQNTF 110 Query: 142 ADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIRQRDLMGLIDQLVVLLV 201 F ++ + D + + + ++ R+ + Q+ ++ Sbjct: 111 ELTNLRH-----QFNVIRLWEENTDSFLNNSGLLPFAVLTCTDNPRE---TLTQIAAIID 162 Query: 202 TECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRERIMTIAERIHNDGYIKGEQ 261 + Q + +L+G + + L + + I G + + Sbjct: 163 SMPNQQRQSDISASTAILSGLKLDQDSIKRILRSDIMKESVIYQEIFHEGEVKGQKQAIK 222 Query: 262 RILRLLLQNGADPEWIQKITGLSAEQMQALRQPLPERE 299 I +L+N + E I ++TGL+ ++++ L L E Sbjct: 223 NIALNMLRNHMNLEVISQLTGLNLQEIEQLNLSLNTEE 260 >UniRef50_C1Q938 Putative uncharacterized protein n=4 Tax=Brachyspira murdochii DSM 12563 RepID=C1Q938_9SPIR Length = 326 Score = 92.2 bits (227), Expect = 2e-17, Method: Composition-based stats. Identities = 37/294 (12%), Positives = 88/294 (29%), Gaps = 12/294 (4%) Query: 2 TNFTTSTPHDALFKTFLTH---PDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLR 58 T + +D + +H + A +F+ + +++ + + E Sbjct: 41 TINNLNRINDYFVRYLFSHDGNENIALNFINAVFKD--LNFETFNKIEILNPFNISENYD 98 Query: 59 ALHSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDK-RQPLPL 117 S + T G I V+IE QSR + R + Y + L Sbjct: 99 EKESIVDIKATTETG---ITVLIEIQSRGNEDFIKRALYYWAYNYSSSLNRGSFYDGLKP 155 Query: 118 VIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALL 177 + + + + + +++ +I + L Sbjct: 156 TVSINITNFILTDEDKVHSCYVLKELNNNKILTDHCQLHFLELPKFNLKDI---SAIESL 212 Query: 178 ELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRM 237 + I K + + +L+ + L ++ + ++ + + Sbjct: 213 DNIHKEFISWIKFFKGEDMSILMKENTIFEEVEKKCLTFVNDSPVIDKYKKREVDTYFFN 272 Query: 238 PQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQAL 291 I E +G + + + + + D I KITGLS ++++ L Sbjct: 273 KSMELDIKKAKEEGIKEGIKENQILTAKNMKKENIDINIISKITGLSIQEIENL 326 >UniRef50_C9RQ02 Putative uncharacterized protein n=1 Tax=Fibrobacter succinogenes subsp. succinogenes S85 RepID=C9RQ02_FIBSS Length = 360 Score = 91.4 bits (225), Expect = 4e-17, Method: Composition-based stats. Identities = 56/308 (18%), Positives = 113/308 (36%), Gaps = 18/308 (5%) Query: 3 NFTTSTPHDALFKTFLTHPDTARDFME-----IHLPKDLRELCDLDSLKLESASFVDEKL 57 N T HDA F+ AR +E H +LD+L S+ E Sbjct: 5 NKVTKRKHDAYFRWLFADTTHARCLLELAGKINHEIDAFLTQINLDTLMRIPDSY-SEVD 63 Query: 58 RALHSDILWSVKTREG-DGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLP 116 +D+ + V G + +++EH+S D + ++ +Y +VM+ ++ +P Sbjct: 64 DTGEADLAFRVNVSTGAPILVGILLEHKSGRDPIIFDQISKYIHSVMKIQDKNRIFSGIP 123 Query: 117 LVIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIV--QHRRV 174 + ++FY+G + P L L++ K+ V++ +PD + + ++ Sbjct: 124 -TMAIIFYNGRDNWNP--LKILEKSYPDYFRGKVLPFQCTFVNMADIPDSDCLACENTAT 180 Query: 175 ALLELIQKHIRQRD-LMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISEL 233 + + KH +D L+ L+ Q L N++ I L + + Sbjct: 181 GMGIIALKHAFNKDKLLELLPQFCKFLDKMPRNEASCLLEKTSIYLMEYLGKDFLKELNM 240 Query: 234 TRRMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQALRQ 293 + ++I + + Q++ LQ + + I + QM RQ Sbjct: 241 AFVSIGQKYGFVSIGDYFRQQ-LAEERQQMTEERLQMAEERQQITE----ERLQMAEERQ 295 Query: 294 PLPERERY 301 + E Sbjct: 296 QITEERLQ 303 >UniRef50_C3R531 Putative uncharacterized protein n=6 Tax=Bacteroidales RepID=C3R531_9BACE Length = 325 Score = 91.4 bits (225), Expect = 4e-17, Method: Composition-based stats. Identities = 42/321 (13%), Positives = 89/321 (27%), Gaps = 44/321 (13%) Query: 8 TPH-DALFKTFLT---HPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSD 63 P+ D FK + + F+ ++ + + + K + Sbjct: 12 NPYTDFAFKLLFGTDLNKEILIGFLNALFDGKQV----IEDVTYLNTEHLGSKETDRRA- 66 Query: 64 ILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDK--RQPLPLVIPM 121 ++ V G ++IE Q E R + Y+ ++ + L V + Sbjct: 67 -VFDVYCENEKGEK-ILIEMQRGEQQFFKDRSIYYATYPIREQAIKGEIWDYELKAVYVI 124 Query: 122 LFYHGSRSPYPWSLCWLDEFADPTTARKLYN--AAFPLVDVTVVPDDEIVQHRRVALLEL 179 + + S + TT +++ F +++ E Sbjct: 125 GILNFALDDVSSSSFRHEVKLMDTTTHEVFFDKLTFVYLEMPKFHKTEQELDTLFDKWMF 184 Query: 180 IQKHI----------RQRDLMGLID--QLVVLLVTECANDSQITALLNYILLTGDEARFN 227 + K++ ++R L + ++ + + D A Sbjct: 185 VLKNLARLMERPTALQERVFNRLFEAAEIAQFSKENLYAYEESLKVYRDWNNVIDTAIQK 244 Query: 228 EFISELTRRM---------PQHRERIMTIAERIHNDGYIKGEQR--------ILRLLLQN 270 + + E I+ E G KG I L Sbjct: 245 GIARGMEEGLVKGMEEGIAKGMEEGIVKGMEEGIAKGMEKGIAEGEWMKAQTIAGNLKNA 304 Query: 271 GADPEWIQKITGLSAEQMQAL 291 G I K+TGLS +++ +L Sbjct: 305 GLSIAEIAKVTGLSEDEINSL 325 >UniRef50_C0F0J0 Putative uncharacterized protein n=1 Tax=Eubacterium hallii DSM 3353 RepID=C0F0J0_9FIRM Length = 316 Score = 91.0 bits (224), Expect = 4e-17, Method: Composition-based stats. Identities = 50/318 (15%), Positives = 94/318 (29%), Gaps = 50/318 (15%) Query: 11 DALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRAL--------HS 62 DAL K +L++ + D +L D ++ ++L S + L Sbjct: 5 DALTKEYLSNNEIFADVFN-YLIYDGQQRILPENLIERDTSEITLPLGKRGELATIQKFR 63 Query: 63 DILWSVKTREGDGYIYVV--IEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQ------- 113 DIL +E +YV+ +E+QS M R M Y + ++ Sbjct: 64 DILKGCIAKEYKNTLYVLFGVENQSHIHYAMPVRNMLYDAINYSAQVNEKTKKYRKIRKQ 123 Query: 114 -------------------PLPLVIPMLFYHGSRSPYPW-SLCWLDEFADPTTARKLYNA 153 L VI + Y G+ SL + D + L + Sbjct: 124 NPNFKETTEEFLSGWHPDDRLVPVITVTIYFGNDGWDAAKSLQEMFSETDESLKEFLPDY 183 Query: 154 AFPLVDVTVVPDDEIVQHRRVALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITAL 213 L+ + + H L I K I + M ++ +D +AL Sbjct: 184 KLHLISCNNISNFT-KFHTEFGRLMHILKVISDEEQMDIL-----------LSDPGYSAL 231 Query: 214 LNYILLTGDEARFNEFISELTRRMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGAD 273 + F R E +G+ + + + + G Sbjct: 232 SVTAAQIINTFTGLHFSIPEKEDTINMRNAWTDHKESGRREGFNEATTSYTQRMYKAGIP 291 Query: 274 PEWIQKITGLSAEQMQAL 291 E I ++ +++ + Sbjct: 292 LEVIAEVIEKPVTEVEKI 309 >UniRef50_C6XV94 Putative uncharacterized protein n=7 Tax=Pedobacter heparinus DSM 2366 RepID=C6XV94_PEDHD Length = 283 Score = 91.0 bits (224), Expect = 4e-17, Method: Composition-based stats. Identities = 51/281 (18%), Positives = 107/281 (38%), Gaps = 24/281 (8%) Query: 22 DTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVKTREGDGYIYVVI 81 R+ M LP ++ + L+ +E + + + +D+L V+ +G+ Y+ + + Sbjct: 16 KIFRENMHNTLPGIIKHVLHLNVNTVEELADDVQFTKERKTDLLKKVRDNKGNRYV-LHV 74 Query: 82 EHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGSRSPYPWSLCWLDEF 141 E+Q+ MAFR+ YS+ + ++H V + Y G + +F Sbjct: 75 EYQTDNYPEMAFRMAEYSIMLQRKHKL--------PVKQFVIYIGPAKANMATSITTKDF 126 Query: 142 ADPTTARKLYNAAFPLVDVTVVPDDEIV-----------QHRRVALLELIQKHIRQRDLM 190 +L + L + + +++++ + +++ I+ H + Sbjct: 127 RFRYNLTELSAVNYKLFLKSDLVEEKMLAILSNLASESTESVLAQVVQEIETHTSTLEQG 186 Query: 191 GLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRERIMTIAER 250 QL +LL N + + L G + + I + + + + Sbjct: 187 RYFRQLRILLQLRNLN----KKAIKDMALVGKIFKEEKDILYRRGEIKGEIKGEIKGEIK 242 Query: 251 IHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQAL 291 G + I L + G E+I KIT LS E++QAL Sbjct: 243 GIEKGRYEEAMEIALELKKEGLATEFIAKITKLSIEEIQAL 283 >UniRef50_UPI0001C369BC hypothetical protein ChatD1_02491 n=1 Tax=Clostridium hathewayi DSM 13479 RepID=UPI0001C369BC Length = 310 Score = 91.0 bits (224), Expect = 4e-17, Method: Composition-based stats. Identities = 37/292 (12%), Positives = 72/292 (24%), Gaps = 46/292 (15%) Query: 11 DALFKTFLTHPDTARDFMEI-------HLPKDLRELCDLDSLKLESASFVDEKLRALHSD 63 D K L P D L +L +S + ++ D Sbjct: 5 DFYIKKLLQDPARFADLYNAEIFHGKQILKAELLSPVSTESGIAITNRSGRKQTIQRRRD 64 Query: 64 ILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEH-------------- 109 I G +I E Q M R + Y + Sbjct: 65 IAMKASI--GACFIVAGCEAQGEIHYGMPIRSLTYDALDYTEQLTEIQKEHRKKKDLAKS 122 Query: 110 -------DKRQPLPLVIPMLFYHGSRSPYPWSLCWLDEFA-------DPTTARKLYNAAF 155 +R L V+ ++ Y G + P+ D P L + Sbjct: 123 PEFLSGITRRDKLQPVLTLVLYCG-KDPWDGPKSLYDMLDLRGPTECIPDLLAALPDYRI 181 Query: 156 PLVDVTVVPDDEIVQHRRVALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLN 215 LVD+ + + + + + +++ + I + N L Sbjct: 182 NLVDIRKIENLSLYKTGLQQVFGMLKYSTDKSKFYNYITSNHDQISMLDDNALTAVMGL- 240 Query: 216 YILLTGDEARFNEFISELTRRMPQHRERIMTIAERIHNDGYIKGEQRILRLL 267 + L + + + + DG ++G++ R Sbjct: 241 -------LGENRRLMKYLAAPGREEGYTMCQAIDDLIADGKLEGKREGKRRG 285 >UniRef50_C1QAJ2 Putative uncharacterized protein n=2 Tax=Brachyspira murdochii DSM 12563 RepID=C1QAJ2_9SPIR Length = 312 Score = 91.0 bits (224), Expect = 4e-17, Method: Composition-based stats. Identities = 38/308 (12%), Positives = 96/308 (31%), Gaps = 31/308 (10%) Query: 11 DALFKTFLTHPD---TARDFMEI-HLPKDLRELCDLDSLKLESASFVDEKLRALHSD--- 63 D + + D DF+ L +++ ++ L + + + D Sbjct: 9 DYFVRYLFSSKDSNFILLDFINSTMLDANMKTFRSVEILTPSPKAGSRLNYKENYDDKES 68 Query: 64 ---------------ILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIE 108 + T+ G V+IE Q + + R++ Y + + ++ Sbjct: 69 IAPKVARKVDRCRRRLDVKCITQNGTV---VIIEIQLQGNSRFPERILYYWASNYSKLLK 125 Query: 109 HD-KRQPLPLVIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDE 167 K L VI + + + + + + ++++ D+ Sbjct: 126 QGEKYDALTPVISINLLNFNLDNNDCIHSCYMIYDTKSKRLLTDHLQIHIIEIKKFKDNL 185 Query: 168 IVQHRRVALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFN 227 + + L K R+++ + + + + E + + +++ + R Sbjct: 186 LDKDLDCWLKFFTIKEKDNREVI-MSELVKEKPIMEEVQKRYNNFIKDRLMMNEYDKREA 244 Query: 228 EFISELTRRMPQHRERIMTIAERIHNDGYIKGEQR----ILRLLLQNGADPEWIQKITGL 283 + R I ++ G KG + + + D I ITGL Sbjct: 245 YLYGNQIMLEEERRLGIEEGFKKGIEKGIEKGIKENQILTAKNMKNKNIDIALISDITGL 304 Query: 284 SAEQMQAL 291 S ++++ L Sbjct: 305 SIKEIEEL 312 >UniRef50_A8F2U7 Putative uncharacterized protein n=15 Tax=Bacteria RepID=A8F2U7_RICM5 Length = 281 Score = 90.6 bits (223), Expect = 6e-17, Method: Composition-based stats. Identities = 35/287 (12%), Positives = 84/287 (29%), Gaps = 22/287 (7%) Query: 11 DALFKTFLTHP-DTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVK 69 D FK + + + DL + E E R+ L+ +K Sbjct: 10 DIAFKKLFSDKVKLINLLNSLLRLSKGDRIIDLSYITTEQLPLFLEGRRS-----LFDLK 64 Query: 70 TREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHD-KRQPLPLVIPMLFYHGSR 128 ++ G Y+ IE Q + + R Y I+ K + L V+ + Sbjct: 65 VKDETGRWYI-IEMQRKMEKDYLNRTQLYGCYTYVSQIKKGMKHKDLLPVVIISIIRAKA 123 Query: 129 SPYPWSLCWLDEFADPTTAR-KLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIRQR 187 P + + L++ + +++ +++ + K+ Q Sbjct: 124 LPDELPYISYHHIKESNIHKQYLFSLTYVFIELGKFKKNDLKDDT--DEWLYLLKYASQE 181 Query: 188 DLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRERIMTI 247 + + + + + + ++ + Q ++ Sbjct: 182 -----------QEPPKEIKNEIVLSAYASLEQYKWTEQEHDDYFRAEMAIQQEIDKFEEK 230 Query: 248 AERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQALRQP 294 G K + + +L E I + T L+ E+++ L+ Sbjct: 231 FNAGMEKGIEKEKIETAKEMLIENGPIEQIARYTKLTIEEIKKLKAE 277 >UniRef50_C0QZQ8 Putative uncharacterized protein n=4 Tax=Brachyspira RepID=C0QZQ8_BRAHW Length = 309 Score = 90.6 bits (223), Expect = 6e-17, Method: Composition-based stats. Identities = 37/302 (12%), Positives = 93/302 (30%), Gaps = 20/302 (6%) Query: 2 TNFTTSTPHDALFKTFLTH---PDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLR 58 T + +D + +H + A +F+ +++ + + E Sbjct: 16 TIENLNRINDYFIRYLFSHTGNENIALNFINAVFKD--LNFETFQKIEILNPFNIAENYD 73 Query: 59 ALHSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDK-RQPLPL 117 S + T G I V+IE QSR + R + Y + L Sbjct: 74 EKESIVDIKATTESG---ITVLIEIQSRGNEDFIKRALYYWAYNYSSSLNRGSFYDELKP 130 Query: 118 VIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTV-----VPDDEIVQHR 172 + + + + + V++ + + E + + Sbjct: 131 TVSINITNFILTDEDKVHSCYILKELNNNKILTDHCQLHFVELPKSNLKNISEIESLDNT 190 Query: 173 RVALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISE 232 + + K + D+ L+ + + E + + ++ + + + Sbjct: 191 HKEFISWV-KFFKGEDMSILMKE--NTIFEEVERKCRTFVNDSPVMDKYKKREVDTYFLN 247 Query: 233 LTRRMPQH---RERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQ 289 + + E I + +G + + + + + ++ D I K TGLS E+++ Sbjct: 248 KSMELDIRKAKEEGIKEGIKEGIKEGIKENQISMAKNMKKDKVDFNIISKYTGLSIEEIK 307 Query: 290 AL 291 L Sbjct: 308 KL 309 >UniRef50_C4FYK3 Putative uncharacterized protein n=2 Tax=Abiotrophia defectiva ATCC 49176 RepID=C4FYK3_ABIDE Length = 365 Score = 90.6 bits (223), Expect = 6e-17, Method: Composition-based stats. Identities = 40/325 (12%), Positives = 92/325 (28%), Gaps = 42/325 (12%) Query: 9 PHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLES-ASFVDEKLRALHSDILWS 67 D L K D DF+ + + + + + L + + + R + D Sbjct: 3 EKDILEKKLFMFNDVFADFLNGIIFNGRQIVEESELFDLSGWSHYKADDSRHRYQDRDVV 62 Query: 68 VKTREGDGYI-YVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLP---------- 116 ++ + I + IE+Q D M FR++ Y A + + Sbjct: 63 KLWKKKNVVISLIGIENQDVPDKDMVFRVLSYDGASYKTQLAKKDEDKRKHLKDKKNTEI 122 Query: 117 ------------LVIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVP 164 VI + Y+G + + + L+D+ Sbjct: 123 VEIGKEDEKDIFPVITFVVYYGEEEWKYETTLKKRLKIGDGLDEFVSDYKINLIDLKKFT 182 Query: 165 DDEIVQHRRVALLEL------------IQKHIRQRDLMGLIDQLVVLLVTECANDSQITA 212 +D+I + ++ L + + ++ L+ +L + + Sbjct: 183 EDDINKFKKDFKLLVNYMVKGSNHDAGSIELNHPEEVSELVLRLTGEELPIPRENDGGKT 242 Query: 213 LLNYILLTGDEARFNEFISELTRRMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGA 272 + + + + E + G +G+ + + L G Sbjct: 243 MEKFFEPMFARMAEKAEARGMAK---GMTEGMAKGMTEGMAKGLAEGKAKGMTEGLAKGM 299 Query: 273 DPEWIQKITGLSAEQMQALRQPLPE 297 GL+ + + L + L E Sbjct: 300 TEG---MAKGLAEGKARGLAEGLVE 321 >UniRef50_C0CTJ7 Putative uncharacterized protein n=5 Tax=Clostridium RepID=C0CTJ7_9CLOT Length = 327 Score = 90.6 bits (223), Expect = 7e-17, Method: Composition-based stats. Identities = 44/328 (13%), Positives = 93/328 (28%), Gaps = 45/328 (13%) Query: 11 DALFKTFLTHPDTARDFME--IHLPKDLRELCDLDSLKLESASFVDEKLR----ALHSDI 64 D + + + D + + D+ L R + D Sbjct: 5 DMVLNRYFEDGERYADLINGYAFNGDQVVRKEDVQELDPRETGVAGRLGRRPGVQKYRDS 64 Query: 65 LWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEH--------------- 109 + V G ++ + +EHQ + M R M A R + Sbjct: 65 IRRVVL--GARFVLIGLEHQDQVHYAMPVRAMLQDAAEYDRQLRRIRRVNRRVGGLTGAE 122 Query: 110 -----DKRQPLPLVIPMLFYHGSRSPYPW--SLCWLDEFADPTTARKLYN-AAFPLVDVT 161 ++ + VI ++ Y+G + +D P +L N +++V Sbjct: 123 FLGGFTRKDRVCPVITLVLYYGKKPWDGAMDLHGLMDCAGYPEPMLRLVNNYRLHVLEVR 182 Query: 162 VVPDDEIVQHRRVALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTG 221 + + + IQ+ + ++ V D + ++ I + Sbjct: 183 RFVNIRRFRTDLYQVFGFIQRSGDKEAERRFTEENR---VYFEGMDEEAFDVITAITGSR 239 Query: 222 DEARFNEFISELTRRMPQHRERIMTIAE---RIHNDGYIKGEQRI--------LRLLLQN 270 + R E E R+ I + +G I+G+ R + Sbjct: 240 ELERVKEQYREEGGRINMCEAIRGMIEDGRIEGRLEGKIEGKYEGALEKTRTVARNMYLR 299 Query: 271 GADPEWIQKITGLSAEQMQALRQPLPER 298 G E I + Q++ + +R Sbjct: 300 GMSAEDAAAICEMDTAQIEVWFREWGKR 327 >UniRef50_A5KR99 Putative uncharacterized protein n=11 Tax=Ruminococcus torques ATCC 27756 RepID=A5KR99_9FIRM Length = 317 Score = 89.8 bits (221), Expect = 1e-16, Method: Composition-based stats. Identities = 42/315 (13%), Positives = 95/315 (30%), Gaps = 30/315 (9%) Query: 1 MTNFTTSTPHDALFKTFL----THPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEK 56 M +++F + + L E + ++++ +++ Sbjct: 8 MAGKENREIKNSVFVDLFYEDESAEANEIALFNAIHDEPLPEGTKIRRFRVDNTIYMN-- 65 Query: 57 LRALHSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHI-----EHDK 111 +DI + G + V EHQS + +M R + Y +R + K Sbjct: 66 ---FQNDISFDA-----GGKVIVFGEHQSTINENMPLRSLLYIGRAYERLVPPRSRYKKK 117 Query: 112 RQPLPLVIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQH 171 PLP FY+G L + ++++ EI++ Sbjct: 118 IVPLPTPEFYTFYNGKEKWEKEKELRLSDAYIVKDGEPSLELKVKVINIRPEEHHEILEK 177 Query: 172 RRV-----ALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITA---LLNYILLTGDE 223 +V +E++Q + + + + D + ++N +L D Sbjct: 178 CQVLKEYSQFMEIVQNYQISGEEEPYKKAIKECIEKGILADYLMRKGSEVVNMLLDEYDY 237 Query: 224 ARFNEFISELTRRM---PQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKI 280 E E R ++ ++ +G +++ L+ G I Sbjct: 238 ETDIEVQREEAREQGREEGRKQGREEGRKQGREEGRKAERSTLIQKKLEKGKTISQIADE 297 Query: 281 TGLSAEQMQALRQPL 295 + E + L + Sbjct: 298 LEDTEENIACLIEQF 312 >UniRef50_C6XV81 Putative uncharacterized protein n=4 Tax=Pedobacter heparinus DSM 2366 RepID=C6XV81_PEDHD Length = 318 Score = 89.8 bits (221), Expect = 1e-16, Method: Composition-based stats. Identities = 42/297 (14%), Positives = 91/297 (30%), Gaps = 20/297 (6%) Query: 11 DALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVKT 70 D FK ++ + + L + + ++ F E + + ++ V Sbjct: 28 DFSFKRLFATEES-KPILIGLLNHLFKGRKYITEIEYGKNEFPGEIAQEGGA--VFDVYC 84 Query: 71 REGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQ----PLPLVIPMLFYHG 126 + +G ++ IE Q + R + Y + R+ L V + F Sbjct: 85 TDVNGSKFI-IEVQRGNQEYFKERALFYVSRAISEQAPKGDRKGWAYKLTEVYLLAFLED 143 Query: 127 SRSPYPWSLCWLDEFADPTTARKLYNAA---FPLVDVTVVPDDEIVQHRRVALLELIQKH 183 P ++ + + F +++ + + KH Sbjct: 144 FNLPDSPKSEYVQDICLANRHTGIIFYDKVGFIFIEMLNFVKGSDELYTELDKWLYALKH 203 Query: 184 IRQRDLMGL------IDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRM 237 + + DQL L + + + D + + + + Sbjct: 204 LTEFKQRPEYLSGPEFDQLFTLANYASLTPEERDMYNSSLKRKWDNKNVLD--YAVKKSL 261 Query: 238 PQH-RERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQALRQ 293 Q + + E+ G K I +L N E I K+T LS E++Q+L++ Sbjct: 262 EQGLEQGLEQGREQGREQGIHKKAIEIALEMLVNKYPIEEIIKLTKLSKEEIQSLQK 318 >UniRef50_C4G3R2 Putative uncharacterized protein n=2 Tax=Abiotrophia defectiva ATCC 49176 RepID=C4G3R2_ABIDE Length = 336 Score = 89.5 bits (220), Expect = 1e-16, Method: Composition-based stats. Identities = 45/297 (15%), Positives = 97/297 (32%), Gaps = 25/297 (8%) Query: 2 TNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALH 61 T D++F + R + + + F++ + Sbjct: 58 TQEIKYAVKDSVFTLLFSDIKNIRKLYQSLHDDSDSYSDEDFKIITLENVFINAP----Y 113 Query: 62 SDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQ-------P 114 +D+ ++VK + + ++ E QS + +M RL+ Y +I K Sbjct: 114 NDLGFTVKNK-----VIILAEAQSTFNPNMGLRLLIYIAQSYHDYISEYKFNIFSEKLIR 168 Query: 115 LPLVIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRV 174 LP ++ Y GS+ + D F T N + + E + + Sbjct: 169 LPNPEFIVIYSGSKKTDITEIRLSDCFESGT----APNIELVVKVIGGNNVKEGIIQEYL 224 Query: 175 ALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELT 234 E+ + +R + ++ +C ++ + L T + + + + Sbjct: 225 KFCEMYDEKVRSVKPSEEKAYSLKKVIKDCIDNGILKDFL-----TLHQKEVEDMMMTVI 279 Query: 235 RRMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQAL 291 + + G + R +L+N + I +ITGLS EQ++ L Sbjct: 280 PPEQALEYIKLEEYNKGIEQGKLDTSLNFARNMLKNNYSIDSIIEITGLSREQIKRL 336 >UniRef50_Q2FTW8 Putative uncharacterized protein n=2 Tax=Methanospirillum hungatei JF-1 RepID=Q2FTW8_METHJ Length = 306 Score = 89.1 bits (219), Expect = 2e-16, Method: Composition-based stats. Identities = 48/310 (15%), Positives = 92/310 (29%), Gaps = 40/310 (12%) Query: 3 NFTTSTPHDALFKTFLTHP---DTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRA 59 F S +D F+ P D D + LP + + D L + + + Sbjct: 17 EFLMSPRNDFAFRLLFGDPNNSDILLDLLNAILPDHFQSVVCTDPHLL-----IPDTKKE 71 Query: 60 LHSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQ---PLP 116 DI V + G +YV IE Q + M R + Y + + Sbjct: 72 CILDI--KVLSDSG---VYVDIEMQVLDLKSMEKRSLFYWAKMYLDQLNRGHSYHELKRT 126 Query: 117 LVIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEI--VQHRRV 174 +VI +L Y + + + +++ V + + Sbjct: 127 IVINILDYMLMPVEDLHTCFQAYDKTHDILMSDV--FEIHFLELPKVHRCRVPYKGTDLL 184 Query: 175 ALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELT 234 + L + + + ++ I N + + + Sbjct: 185 SWLTFLNAYTEEE------------IIMAAEGKPAIQKAYNNLQIMSLDEETRRLYEARE 232 Query: 235 RRMPQHRERIMTIAERIHNDGYIKGEQR--------ILRLLLQNGADPEWIQKITGLSAE 286 + R+ E +G KG + ++ LL G D E+I+K TGL Sbjct: 233 MFLHDQATRMYEAKEEGLEEGMKKGREEGREEEREGFVKNLLSLGMDDEFIKKATGLDQS 292 Query: 287 QMQALRQPLP 296 + L++ L Sbjct: 293 IIDKLKKSLS 302 >UniRef50_Q8F560 Putative uncharacterized protein n=1 Tax=Leptospira interrogans RepID=Q8F560_LEPIN Length = 278 Score = 89.1 bits (219), Expect = 2e-16, Method: Composition-based stats. Identities = 40/285 (14%), Positives = 89/285 (31%), Gaps = 21/285 (7%) Query: 13 LFKTFL-THPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVKTR 71 +FK PD + L D ++K+ + V S + + Sbjct: 2 MFKILFVKEPDLLISILNSVLFTDGEHTI--RNIKILNPELVGSSPNDKRSYLDIRAQDE 59 Query: 72 EGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDK-RQPLPLV--IPMLFYHGSR 128 +G +E Q R + Y +++ + L V I ++ + Sbjct: 60 DGK---IFHVEIQVAHQSSFVKRSLYYLSGLIRDQLNRGSMYSDLKPVYQINIVDFDLIP 116 Query: 129 SPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQ-HRRVALLELIQKHI--- 184 S S E ++P + +++ ++ + + + + KH Sbjct: 117 SENFHSKFKFREESNPDIIL-TDDVEIHFLELCKFVKRDVRELRNNLEIWLYVLKHTSEL 175 Query: 185 RQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRERI 244 + ++ L+D+ L + L ++ +L R Sbjct: 176 EEEEMRILVDKTPDLSKAFTILEQYSNDPQKRNELEAKLKSDRDYAYDLAARFEAGEL-- 233 Query: 245 MTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQ 289 + G K + + R +L+ G + I +ITGLS + ++ Sbjct: 234 -----QGIEKGAEKEKLKSARKMLEEGMRLDVILRITGLSKKDLK 273 >UniRef50_UPI0001BC3131 hypothetical protein BcroD2_12630 n=4 Tax=Butyrivibrio crossotus DSM 2876 RepID=UPI0001BC3131 Length = 247 Score = 89.1 bits (219), Expect = 2e-16, Method: Composition-based stats. Identities = 43/266 (16%), Positives = 86/266 (32%), Gaps = 25/266 (9%) Query: 1 MTNFTTSTPH-DALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRA 59 M N T + + D +F+ D + D+ LE+A ++ Sbjct: 1 MNNETVNRKYKDTVFRLLFKDKSNLLSLFNAVNDTDFSDENDIKITTLENAIYMT----- 55 Query: 60 LHSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEH-----DKRQP 114 +DI + + + EHQS + +M +R + Y +R++ + K Sbjct: 56 SKNDISCIIDMKLN------LFEHQSTVNPNMPYRNLEYVTKCFKRYVGNFDVYTGKALT 109 Query: 115 LPLVIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRV 174 LP ++FY+G P + L + ++ + + ++ Sbjct: 110 LPNPKFVVFYNGVNEQPPIRVMRLSDLYAHKDEIPNLELVVIQYNINNLVNCTLMDRCEP 169 Query: 175 ALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELT 234 + +G I + + A DS I + +L + ++ Sbjct: 170 L--------KEYSEFIGCIRSNLKTMDKGEAVDSAIDYCIGNGILKDFLTNNRNEVRSMS 221 Query: 235 RRMPQHRERIMTIAERIHNDGYIKGE 260 E I + + DGY KGE Sbjct: 222 LFEFDAEEHEKAIKQIAYEDGYDKGE 247 >UniRef50_A7BWQ7 Putative uncharacterized protein n=3 Tax=Beggiatoa sp. PS RepID=A7BWQ7_9GAMM Length = 290 Score = 88.7 bits (218), Expect = 2e-16, Method: Composition-based stats. Identities = 51/298 (17%), Positives = 99/298 (33%), Gaps = 23/298 (7%) Query: 6 TSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDIL 65 HD+LFK +T +F + P + S ++ + Sbjct: 3 NPKSHDSLFKWLIT--AFTTEFFGHYFPDIRIGEYTFIDKEFISKYENLKESLKGDLFLG 60 Query: 66 WSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYH 125 V+ I + IEHQS + ++ R+ YS V ++ Y Sbjct: 61 MEVEIDGLLREIIIQIEHQSERE-DVSERVYEYSCYAW--------LLKKKPVWSIVIYT 111 Query: 126 GSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIR 185 D ++ ++ V D I +H + L ++ R Sbjct: 112 DEAVWRKPVTEQFWYAFDSQKGKQYHHFDVIKVKAEKSSDL-IQKHSLMCKLLALKADDR 170 Query: 186 QRDLMGLIDQLVVL--LVTECANDSQITALLNYI--LLTGDEARFNEFISELTRRMPQ-- 239 Q D L+ ++ L+ E + Q+ + ++ E R ++ E+ + Sbjct: 171 QTDPEKLVYEIYRAAALMKEQLTNEQLLLIDQWVSFYKKVSEKRLDKIKKEIKMDFIETT 230 Query: 240 -----HRERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQALR 292 + + + +G KG ++ LL+ G D E IQK TG S +++ + Sbjct: 231 ISEHVYNQGWIKGEAEGKAEGEAKGRKKTAINLLKMGIDVEIIQKATGFSDAEIKQMS 288 >UniRef50_Q00255 ORF295 n=1 Tax=Leptolyngbya boryana RepID=Q00255_PLEBO Length = 295 Score = 88.3 bits (217), Expect = 3e-16, Method: Composition-based stats. Identities = 53/302 (17%), Positives = 106/302 (35%), Gaps = 33/302 (10%) Query: 1 MTNFTTSTP-HDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFV----DE 55 MT ++ +D +KTF+ R+F+ P ++ ++ D Sbjct: 1 MTQQSSENTDYDNPWKTFI--ELYFREFLAFFFPTIEADVDWSKPVRFLDKELQKIVRDA 58 Query: 56 KLRALHSDILWSV-KTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQP 114 ++ ++D L V + R + IE QS+E+ R+ Y+ + R+ Sbjct: 59 EIPKRYADKLVEVHRLRGERTLVICHIEVQSQEERDFVARMYSYNYRLRDRYNC------ 112 Query: 115 LPLVIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVV----PDDEIVQ 170 V+ + G P + DE T FP+V ++ + E +Q Sbjct: 113 --PVVSLAIL-GDDRPNWRPSRFYDELWGCATH-----FEFPIVKLSDYQSQWTELEAIQ 164 Query: 171 HRRVALLELIQK----HIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARF 226 + + K H + + L +L ++ I L N++ + Sbjct: 165 NPFAVVAMAHLKTKETHNQPLERKRWRYHLTTMLYDRGYSEQDILELHNFLDWLMNLPE- 223 Query: 227 NEFISELTRRMPQHRERI-MTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSA 285 E +L + E M + ++ +Q I +L+ D E I ++TGL+ Sbjct: 224 -ELERQLQAELETFEEARRMKYVSSLERRAKLEEKQAIALNMLRRNLDMELIAEVTGLTI 282 Query: 286 EQ 287 + Sbjct: 283 AE 284 >UniRef50_C0GV86 Transposase, ISNCY family n=7 Tax=Desulfonatronospira thiodismutans ASO3-1 RepID=C0GV86_9DELT Length = 125 Score = 88.3 bits (217), Expect = 3e-16, Method: Composition-based stats. Identities = 32/104 (30%), Positives = 60/104 (57%), Gaps = 3/104 (2%) Query: 1 MTNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRAL 60 M PH+ LF + D AR F++ H+ +++++ DLD+L+LE ++VDEKL+ Sbjct: 1 MATKRNQAPHEGLFLKIFQNLDNARHFLKNHMSEEIQKRFDLDTLRLEPTTYVDEKLKKH 60 Query: 61 HSDILWSVKT---REGDGYIYVVIEHQSREDIHMAFRLMRYSMA 101 +SD+++SV+ + IY++ EH+S D ++++Y Sbjct: 61 YSDLVFSVRLIGYKNQFAKIYLLFEHKSSPDPLTGVQVLKYMAL 104 >UniRef50_B4VKW0 Putative uncharacterized protein n=2 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VKW0_9CYAN Length = 296 Score = 88.3 bits (217), Expect = 3e-16, Method: Composition-based stats. Identities = 52/308 (16%), Positives = 107/308 (34%), Gaps = 37/308 (12%) Query: 6 TSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDIL 65 T D K L + + L + LR+ + S+ ++ E + DIL Sbjct: 4 THIRFDWAIKKLLRNKAN-YGVLAGFLSELLRKPITIQSILEGESNQQAEDDKLNRVDIL 62 Query: 66 WSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDK---RQPLPLVIPML 122 E D ++IE Q+ + R++ + ++ +E + + ++ Sbjct: 63 A-----ENDRGELILIEVQNSTEQDYFHRMLYGTSRLITDFLEKGEPYGNVKKVYSVNIV 117 Query: 123 ----------FYHGSRSPYPWSLCWLDEFADPTTARKLYN--------AAFPLVDVTVVP 164 YHG+ L D+ RKL+N + ++ V Sbjct: 118 YFSLGQGDDYIYHGTLEF--RGLHLDDKLGLSINQRKLFNSQDVYEIFPEYYVIKVNNFN 175 Query: 165 DDEIVQHRRVALLELIQKHIRQRDL-MGLIDQLVVLLVTECANDSQITALLNYILLTGDE 223 E+ + ++K + + + + L+ + ++++ L ++ E Sbjct: 176 --EVASDTLDEWIYFLKKSQIKEEFTAQGLAEAKENLLVDSLSEAERANYLRFM-----E 228 Query: 224 ARFNEFISELTRRMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGL 283 R E + R E + + G + + I RLL Q G D + I TGL Sbjct: 229 NRRYEISLIESSRSEGRLEGLEEGLKEGMEQGKQQEKVNIARLLKQQGTDLDTITAATGL 288 Query: 284 SAEQMQAL 291 + E+++ L Sbjct: 289 TREEIEEL 296 >UniRef50_A5CBY6 Transposase and inactivated derivative n=47 Tax=cellular organisms RepID=A5CBY6_ORITB Length = 324 Score = 88.3 bits (217), Expect = 3e-16, Method: Composition-based stats. Identities = 44/321 (13%), Positives = 87/321 (27%), Gaps = 37/321 (11%) Query: 8 TPHDALFKTFLT---HPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDI 64 +D FK + D F+ L + ++ + S + Sbjct: 9 PKNDVAFKKIFGSEKNKDILIHFLNDILLFEGNREI--TEVEFLGTILDADIASKKESIV 66 Query: 65 LWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIE---HDKRQPLPLVIPM 121 K + G YI IE Q R Y+ R K L VI + Sbjct: 67 DVLCKDKNGAQYI---IEMQVDPTQGFEKRAQYYAAKAYGRQPNRGKEGKYSDLKEVIFI 123 Query: 122 LFYHGSRSPYPWSLCWLDEFADPTT-ARKLYNAAFPLVDVTVVPDDEIVQHRRV-ALLEL 179 P D T L + +F +++ + + + + Sbjct: 124 AIADYKLFPNKEDYISRHVILDKKTYEHDLKDFSFTFIELPKFKKNRVEELSDITEKWCY 183 Query: 180 IQKHIRQRDLMGL---------IDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFI 230 KH ++ L G I + L ++ ++ + D ++ Sbjct: 184 FFKHAKETTLDGYHKIIGEDLIIKRAYEALDQFNWSEDELITYEQELKRIWDNKAVEDYK 243 Query: 231 SELTR---------------RMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGADPE 275 E + E +G + ++ LL++ E Sbjct: 244 LERAKAEGIKLGEAKGIKLGEAKGKAEGKAEGKAEGKAEGKAEAKKDFAIKLLKSELSVE 303 Query: 276 WIQKITGLSAEQMQALRQPLP 296 I + T LS +++ L+ + Sbjct: 304 TIAEYTDLSIQEVLNLKNSVK 324 >UniRef50_UPI0001C371D2 hypothetical protein RflaF_10865 n=1 Tax=Ruminococcus flavefaciens FD-1 RepID=UPI0001C371D2 Length = 317 Score = 87.9 bits (216), Expect = 3e-16, Method: Composition-based stats. Identities = 43/314 (13%), Positives = 96/314 (30%), Gaps = 39/314 (12%) Query: 10 HDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLR--------ALH 61 DA+ K ++ + D L R++ + LK + + + Sbjct: 4 KDAVTKDYMQDSEHFADAFN-FLLYGGRQVIKPEQLKPLDTTSIALPYGDESRFVPIQKY 62 Query: 62 SDILWSVKTREGDG--YIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEH---------- 109 D+L V E + Y+ + IE+QS M R M Y + Sbjct: 63 RDVLKMVTAMEDENATYLILGIENQSDIHYAMPIRNMLYDAIQYVNQADTIAKEHRKSKK 122 Query: 110 ------------DKRQPLPLVIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPL 157 K + +I + Y G+ A+ + + N L Sbjct: 123 MPETRAEYLSGFYKTDRILPIITLTLYFGADEWDAPRDLHSMLTANEDILKFVDNYHLHL 182 Query: 158 VDVTVVPDDEIVQHRRVALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYI 217 + + D++ + L L K+++ + +V + + ++N + Sbjct: 183 IAPAEIEDEDFAKFH--TELSLALKYVKYSKDKKKLRDIVNEDTAFRSVSRKTADMVNVV 240 Query: 218 LLTGDEARFNEFISELTRRMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGA-DPEW 276 + E ++ + + R+ +G +G R L L+++G Sbjct: 241 TSSNLHYNDGEERVDMCEAIEEIRK---DALAEGKAEGIEEGIIRTLIGLVKDGILTIAD 297 Query: 277 IQKITGLSAEQMQA 290 K ++ + + Sbjct: 298 AAKRADMTVPEFEE 311 >UniRef50_Q8YMI0 Alr4953 protein n=8 Tax=Cyanobacteria RepID=Q8YMI0_ANASP Length = 314 Score = 87.9 bits (216), Expect = 4e-16, Method: Composition-based stats. Identities = 54/326 (16%), Positives = 115/326 (35%), Gaps = 33/326 (10%) Query: 1 MTNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFV----DEK 56 MT+ D+ +K L ++ P+ + + + F + + Sbjct: 1 MTDNNERADFDSPWKEIL--EAYFPQAVQFFFPETAALINWERPYEFLNTEFQQIAREAE 58 Query: 57 LRALHSDILWSV-KTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPL 115 ++D L V + + + ++ + +E Q++++ + R+ Y+ + R Sbjct: 59 QGKPYADQLVKVWQIQGEEIWLLIHVEIQAQKEDDFSKRMFTYNFRIFDRF-------EK 111 Query: 116 PLVIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVD-VTVVPDDEIVQHRRV 174 P + + +R P + + + L+D + E + Sbjct: 112 PAISLAILCDTNRQWRPSNYSYNYPQTRLNFEFGIV----KLLDYENRFDELENNTNPFA 167 Query: 175 ALLELIQKHIRQR----DLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNE-- 228 ++ K + R + L+ L + I L +I + E Sbjct: 168 TVVMAHLKTQQTRSSPQERKIWKFSLIRRLYDLGLQEQDIRNLYRFIDWVMILPKALENQ 227 Query: 229 FISELTRRMPQHRERIMTIAER-IHNDGYIKGEQRILRLLLQNG---ADPEWIQKITGLS 284 SE+ + + R +T AER + G +GE I+ LL+ PE Q+I LS Sbjct: 228 LCSEVQQLEQERTMRYVTSAERIGYERGIQEGELGIILKLLKRRLGELSPEIQQRIQSLS 287 Query: 285 AEQMQALRQPLPE----RERYSWLKS 306 Q++ L + L + + +WL+S Sbjct: 288 VNQLENLSEALLDFSNLTDLVNWLQS 313 >UniRef50_B0A7T9 Putative uncharacterized protein n=2 Tax=Clostridium bartlettii DSM 16795 RepID=B0A7T9_9CLOT Length = 271 Score = 87.5 bits (215), Expect = 5e-16, Method: Composition-based stats. Identities = 41/285 (14%), Positives = 96/285 (33%), Gaps = 26/285 (9%) Query: 11 DALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVKT 70 D +FK + + L L+ + S+++++ + S + KT Sbjct: 10 DFVFKNIFGSEKNPK-ILISFLNATLKPKDLITSVEIKNTDINKNYIEDKFSRLDVKAKT 68 Query: 71 REGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQ---PLPLVIPMLFYHGS 127 + + IE Q + + +M R + Y + + + + I +L + Sbjct: 69 SNDE---IINIEIQLKNEYNMIKRSLYYWSKLYSEQLGEGQDYSVLKRTICINILNFKYL 125 Query: 128 RSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIRQR 187 ++ S L E + A +++ + D + V +E ++ Sbjct: 126 KTRKFHSGYRLKEIYSNEELTNV--AEIHFIEIPKLDDGADEKDMLVNWIEFLK------ 177 Query: 188 DLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRERIMTI 247 D + + N +I + ++ ++ E + + ++ Sbjct: 178 ------DPESETVRSLEMNIEEIRQAKDELIRMSNDDTQREIYEMRAKTLRDK----ISA 227 Query: 248 AERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQALR 292 G +G++ I + LL D E I TGLS +++ L+ Sbjct: 228 LNEAERKGIQQGKREIAKALLDV-LDIETIALKTGLSIDEINKLK 271 >UniRef50_C8PT67 Putative uncharacterized protein n=1 Tax=Treponema vincentii ATCC 35580 RepID=C8PT67_9SPIO Length = 285 Score = 87.5 bits (215), Expect = 5e-16, Method: Composition-based stats. Identities = 37/283 (13%), Positives = 93/283 (32%), Gaps = 13/283 (4%) Query: 11 DALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVKT 70 D +F + P+ + + L + + ++E +D+ RA + + V Sbjct: 13 DYMFYRVMEDPEICKMLLNRVLQGKVDTIT-----EIELQKTIDDAGRAKG--VRFDVWA 65 Query: 71 REGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDK-RQPLPLVIPMLFYHGSRS 129 ++ +G IY IE Q+ + +A R+ Y A+ + K + LP + F Sbjct: 66 KDCNGRIY-DIEMQAIDKKDLAKRIRYYQAAIDVSILGKSKPYESLPDTFILFFCTFDYL 124 Query: 130 PYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIRQRDL 189 + + +L + ++ + + + LE + + + Sbjct: 125 EKTLPVYTFKTMCSEDSRIELGDGVTKII-INSKAAEHEKNEKLKVFLEYMNGKVSNDEF 183 Query: 190 MGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRERIMTIAE 249 + ++Q + + + + ++ + + + Q + Sbjct: 184 IQRLEQRIKEVKANEELRREYMLVNTIERDARNDGWKAGIAQGIAQGIAQGKSL---GLA 240 Query: 250 RIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQALR 292 G R L G E I + TGL+ ++++ + Sbjct: 241 EGEARGSHHKALETARNLRSMGLSIEKIAQATGLTVQEVETIA 283 >UniRef50_C4ZLA7 Conserved hypothetical cytosolic protein n=2 Tax=Proteobacteria RepID=C4ZLA7_THASP Length = 339 Score = 86.8 bits (213), Expect = 9e-16, Method: Composition-based stats. Identities = 53/334 (15%), Positives = 96/334 (28%), Gaps = 52/334 (15%) Query: 1 MTNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFV----DEK 56 M +D+ +K + H +F++ + P R++ + D Sbjct: 1 MPASAAQDDYDSPWKEAVEH--AFPEFIDFYFPDAGRQIDWARGHRFLDKELQQIVRDAA 58 Query: 57 LRALHSDILWSVKTREGD-GYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPL 115 L H D L SV T G+ ++ V IE Q D A R+ Y+ + + Sbjct: 59 LGRRHVDKLASVTTHAGEEDWLCVHIEVQGSMDPDFARRMFVYNYRIYDSY-------DR 111 Query: 116 PLVIPMLFYHGSRSPYPWSLCWLDEFADPTTA--RKLYNAAFPLVDVTVVPDDEIVQHRR 173 P+ + + P D F L LVD + ++ + Sbjct: 112 PVASLAVLADDDPAWRP------DRFGYERLGCRHNLQFPVAKLVD-HAADEAALLCNPN 164 Query: 174 VALLELIQKHIRQRDLMGLIDQL--VVLLVTECANDSQITALLNYILLTGDEAR--FNEF 229 L +R I + LV + D EF Sbjct: 165 PFALVTAAHLYTRRTRRSPIARFDAKRRLVRLLYERDWTRQRILDFFSVLDWMMRLPREF 224 Query: 230 ISELTRRMPQ-------------HRERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEW 276 L + + R I ++ G G ++ + ++ G + Sbjct: 225 EQRLWQDIENIEGERKVKYVTSVERLAIERGLQKGMEQGLEIGIEKGIEQGIEKGIEKGR 284 Query: 277 IQ------------KITGLSAEQMQALRQPLPER 298 Q + LS + ++ L Q PE+ Sbjct: 285 AQGSASVLLRLLNRRFGPLSPDIIRRLSQSTPEQ 318 >UniRef50_UPI00006A2D99 UPI00006A2D99 related cluster n=2 Tax=Xenopus (Silurana) tropicalis RepID=UPI00006A2D99 Length = 308 Score = 86.4 bits (212), Expect = 1e-15, Method: Composition-based stats. Identities = 35/269 (13%), Positives = 91/269 (33%), Gaps = 21/269 (7%) Query: 7 STPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALH--SDI 64 T HD FK + D R ++ P + + + D + + +L D+ Sbjct: 1 PTSHDQNFKNLI--LDYPRQALQFFAPDEAKNIDDSAVITPIRQEQLKNRLGDRFYELDV 58 Query: 65 LWSVKTREG-DGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLF 123 V+ +G + ++E ++ RL+ Y + + + V+P++ Sbjct: 59 PLKVEWPDGRHAAMLFLLEEETDPARFSIHRLVSYCANLAELMGTNR-------VVPIVI 111 Query: 124 YHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKH 183 + S + + + + +P ++ + + Sbjct: 112 F------LRSSPDIRRDLHLGVDGVNFLSFHYIACVLPDIPAEQYKDSTNIVARIALPTM 165 Query: 184 IRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTR-RMPQHRE 242 R+ +ID + L ++ + Y+ ++ + +L + R PQ + Sbjct: 166 HYARE--QVIDVMAWALRGLDTLEANGDKRIKYLDFIDTYSQLEDNERQLFKQRYPQEEK 223 Query: 243 RIMTIAERIHNDGYIKGEQRILRLLLQNG 271 + +I +R + G +G + ++ + G Sbjct: 224 TVTSIVQRAIHQGIHQGIHQGIQEGMLMG 252 >UniRef50_A7B1D1 Putative uncharacterized protein n=3 Tax=Ruminococcus gnavus ATCC 29149 RepID=A7B1D1_RUMGN Length = 323 Score = 86.0 bits (211), Expect = 1e-15, Method: Composition-based stats. Identities = 43/298 (14%), Positives = 94/298 (31%), Gaps = 23/298 (7%) Query: 2 TNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALH 61 F D F+ + + R F+ L E+ D ++L + + Sbjct: 44 QKFIMLPSVDFCFQELMEDEEVRRGFIGAFLRIPPEEILD---MELLPKKLRKKYKEEKY 100 Query: 62 SDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDK-RQPLPLVIP 120 + V+ REG+ + IE QS + R + Y + I + L I Sbjct: 101 GILDVRVRLREGEQ---LNIEMQSIAYDYWQERSLFYLGKMYVDQIHEGEDYDKLKKCIH 157 Query: 121 MLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELI 180 + + + + D ++++ + E Q + + Sbjct: 158 VGILDFTLFEHERYYSCFHIWEDTIRDMYSDKFEIHVLELPKLAKYEYPQTELLRWAQFF 217 Query: 181 QKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQH 240 R R+ + ++ + D I + + + E + + H Sbjct: 218 --GARSREEIEVLAE----------KDEYIHKAYDKLEEISADEEKRLEYEERQKAIRDH 265 Query: 241 RERIMTIAERIHNDGYIKGEQR----ILRLLLQNGADPEWIQKITGLSAEQMQALRQP 294 R + + +G +G+ + R +L++ E I + +GLS E + L + Sbjct: 266 RHMLASGRREGLREGLREGKHEHAVEMARKMLEDKLPIEKIAEYSGLSPEDVHRLEEQ 323 >UniRef50_B0G418 Putative uncharacterized protein n=5 Tax=Dorea formicigenerans ATCC 27755 RepID=B0G418_9FIRM Length = 312 Score = 85.6 bits (210), Expect = 2e-15, Method: Composition-based stats. Identities = 42/258 (16%), Positives = 83/258 (32%), Gaps = 24/258 (9%) Query: 6 TSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDIL 65 D +F+ L P A + +L LE+A ++ +D+ Sbjct: 37 NREYKDRVFRMLLKEPKVALEVYNAMNGTLYDNPDELIITTLENAVYLGM-----KNDVS 91 Query: 66 WSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRH-----IEHDKRQPLPLVIP 120 + + T+ V+ EHQS + +M R + Y V + + K +P Sbjct: 92 FILGTQ------LVLYEHQSTPNPNMPLRNLAYVACVYMAYVFGDNLYGRKLIKIPEPRF 145 Query: 121 MLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELI 180 ++FY+G+ S+ L + + + V++ ++E+V+ L Sbjct: 146 VVFYNGTDKMPEQSVLRLSDAYESKSEELDLELKIRFVNINPGYNEEMVEKSP----TLY 201 Query: 181 QKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQH 240 Q + ++ E A D I IL E + + Sbjct: 202 QYVKFVDIVRKYQKEMPFPEAVEKAIDECIKKG---ILAEFLRKNRAEVLRVSIFEYDE- 257 Query: 241 RERIMTIAERIHNDGYIK 258 E + E +G + Sbjct: 258 EEHMRQEREESRQEGIEQ 275 >UniRef50_C4ZGR2 Putative uncharacterized protein n=2 Tax=Eubacterium rectale ATCC 33656 RepID=C4ZGR2_EUBR3 Length = 370 Score = 85.6 bits (210), Expect = 2e-15, Method: Composition-based stats. Identities = 41/297 (13%), Positives = 86/297 (28%), Gaps = 28/297 (9%) Query: 13 LFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVKTRE 72 +F + + A D++ + ++ +D + V + Sbjct: 78 VFSMLMQDKERALQLYNAMNGSSYDNPEDVEMVI-----HDGGISLSVRNDASFIV---D 129 Query: 73 GDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKR-------------QPLPLVI 119 IY EHQS +M R + Y ++ + K+ +P Sbjct: 130 ARLSIY---EHQSTVCPNMPVRSLIYFSVILSDMLSDKKKGTKSGKNIYGRRLVKIPTPH 186 Query: 120 PMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHR-RVALLE 178 ++FY+G L + + T + ++ + I++ + Sbjct: 187 FVVFYNGEEEQPEVQELKLSDAFEKPTDEPNLELKCKVYNINDGKNKAIMESCGWLNDYM 246 Query: 179 LIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMP 238 +R+ G D L + + + +L L T Sbjct: 247 TFVNKVREYHADGAFDDLAIDIEK-AIDYCIDNDILKEFLKTYRSEVTKSMQLNY-EFDR 304 Query: 239 QHRERIMTIAERIHNDGYIKGEQRILRLLLQNG-ADPEWIQKITGLSAEQMQALRQP 294 Q E G KG ++L L+ G D + + G+S + + L + Sbjct: 305 QLELERADAIEEGMEIGIEKGANKMLFTLVTKGKLDIDTAAEEAGVSVSEFEKLMRE 361 >UniRef50_C8PLW8 Putative uncharacterized protein n=2 Tax=Treponema vincentii ATCC 35580 RepID=C8PLW8_9SPIO Length = 264 Score = 85.2 bits (209), Expect = 2e-15, Method: Composition-based stats. Identities = 47/286 (16%), Positives = 94/286 (32%), Gaps = 39/286 (13%) Query: 11 DALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVKT 70 D +F + H R F+E+ + ++ L S + + + + + + D+L VK Sbjct: 14 DFMFCKVMEHESLCRPFLEMLFSTQIEKITYLSSQNIITTN---SEAKTVRLDVL--VKD 68 Query: 71 REGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGSRSP 130 G Y IE Q + ++ R+ Y + ++ ++ Sbjct: 69 DIGTSY---DIEMQVGNEYNIPKRMRYYQAVLDVAFLDKGYSY--------------KAL 111 Query: 131 YPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQH---RRVALLELIQKHIRQR 187 + ++ F R +Y + D I+ H +++ L K + Sbjct: 112 NNSVIIFVCLFDPIGNDRAVYTFENI-----CIEDKTILLHDGTKKIILNAKAFKKTDNQ 166 Query: 188 DLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQH-RERIMT 246 +L G + + T I + NE +P + + Sbjct: 167 ELRGFLQYVTTGKATTAYTGR--------IEQMIQTVKQNELARREYHILPAALMDAMDE 218 Query: 247 IAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQALR 292 R G + + LL G E I + TGLS +++AL+ Sbjct: 219 GEARGLAKGSRQKALETAKNLLHFGLSVENIAQATGLSQAEVEALK 264 >UniRef50_C6LJP2 Putative transposase n=1 Tax=Bryantella formatexigens DSM 14469 RepID=C6LJP2_9FIRM Length = 326 Score = 84.8 bits (208), Expect = 3e-15, Method: Composition-based stats. Identities = 40/239 (16%), Positives = 84/239 (35%), Gaps = 17/239 (7%) Query: 66 WSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEH------DKRQPLPLVI 119 ++ K DG I V +++Q+ D M R+M K + L VI Sbjct: 78 FNKKIVAPDGEIIVALQNQTTVDFGMPLRVMTEDALEYDVQRRMCKDEKLHKGEKLAPVI 137 Query: 120 PMLFYHGSRSPY-PWSLCWLDEFADPTTARKLYNAAFPLVDVTVVP-DDEIVQHRRVALL 177 ++FY+G++ P L + + + K Y + ++ +T D + Sbjct: 138 TIVFYYGAQIWSGPTDLADMVKIPEEFKWLKKYIRPYAMLLITPENVDAAWFSGGWREVF 197 Query: 178 ELIQKHIRQRDLMGLIDQLVVLLVTECANDSQIT----ALLNYILLTGDEARFNEFISEL 233 E++Q+ ++++ + + + + +++ L+Y + Sbjct: 198 EILQRRNDEKEMQRYLQKKRSVYEKLPEDTNRLIFALTGHLDYYNALKRKGERAVMCKAF 257 Query: 234 TRRMPQHRERIMTI-AERIHNDGYIKGEQRILRLLLQNGADPEWI----QKITGLSAEQ 287 E I + + G +G ++R + G E I QK LS E+ Sbjct: 258 EDHYKSGVEEGKNIGIHQGISQGLGRGIGAMIRENQEEGKTTESIIDKLQKYFSLSREE 316 >UniRef50_D0BNN6 ATP-dependent DNA helicase RecQ n=1 Tax=Granulicatella elegans ATCC 700633 RepID=D0BNN6_9LACT Length = 302 Score = 84.8 bits (208), Expect = 4e-15, Method: Composition-based stats. Identities = 60/317 (18%), Positives = 110/317 (34%), Gaps = 43/317 (13%) Query: 6 TSTPHDALFKTFL---THPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHS 62 +D LFK + DF+E L+ + + ++E+ E L + Sbjct: 3 IKPTNDLLFKKMMTTAGKEYILEDFIEAVTGMKLKNVRPANPYQIETYQKTIENLNPVMY 62 Query: 63 DILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPML 122 + V DG ++IE Q + R+ Y + + K + +I ++ Sbjct: 63 STIVDVAATTEDGME-IMIEMQLYQHKDFFERIFNYMATAYTQ---NYKAETAKPIISIV 118 Query: 123 FYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQK 182 + + EF + L N A+ + + + + R+ L+ L K Sbjct: 119 VTNFT---------VFPEFQEARIEIGLTNFAYY----QEIRNRKQQPYWRIYLVNLTDK 165 Query: 183 HI---RQRDLMGLIDQLVVLLVTECAND--SQITALLNYILLTGDEARFNEFISELTRRM 237 I RD D L + ++ + ++N+ L G+E R E + + Sbjct: 166 AIVNGESRDFSEWRDFLKNGTIKPKSSRGLKEAQKIVNFSNLAGEERRLAELMEKYEDVY 225 Query: 238 -----PQHRERIMTIAERIHNDGYIKGEQRILRLLLQNG-------------ADPEWIQK 279 Q E + E G GE+R + + G E IQK Sbjct: 226 YQVMKHQLEEGLEQGIEIGRQQGVALGEKRGMEKGVALGERKGQVMICFKMNLPIEEIQK 285 Query: 280 ITGLSAEQMQALRQPLP 296 TGLS E+++A R+ + Sbjct: 286 HTGLSIEEIEAFRKEME 302 >UniRef50_C4G7H9 Putative uncharacterized protein n=2 Tax=Abiotrophia defectiva ATCC 49176 RepID=C4G7H9_ABIDE Length = 305 Score = 84.4 bits (207), Expect = 5e-15, Method: Composition-based stats. Identities = 42/313 (13%), Positives = 109/313 (34%), Gaps = 30/313 (9%) Query: 10 HDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLK--LESASFVDEKLRALHSDILWS 67 +D K + D D + L D +E DSL+ ++++ E + + S Sbjct: 4 YDVTEKLLEDYNDVFADIVNTLLF-DGKERVKEDSLEDSKINSAYKAEDGKLHEQERDVS 62 Query: 68 VKTREGDGYIYVV-IEHQSREDIHMAFRLMRYSMAVMQRHIEHDK----RQPLPLVIPML 122 +EG+ + VV IE+Q++ + M R++ Y A + + + L V+ ++ Sbjct: 63 KYWKEGNTNLLVVGIENQTKAEKLMPARIIGYDGASYRSQLLKSTGRLPKNKLTPVVTIV 122 Query: 123 FYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQK 182 Y G + + + ++ +P++++ ++ + L+ K Sbjct: 123 LYFGLTRWNQPKNLKGILDIPTGLEDFVSDYKINVFEIAFLPEEKV--NKFKSDFRLVAK 180 Query: 183 HIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRE 242 + I + L + + A+L ++ + E ++ + + Sbjct: 181 YFTN------IRKNPYYLPADENEIKHVDAVLKFLSIMSGSEDIIEKLT--ANNGSEVKN 232 Query: 243 RIMTIAERIHNDGYIKGEQRILRLLL------------QNGADPEWIQKITGLSAEQMQA 290 +++ G +G + L + G E ++I + + Sbjct: 233 MTGGPLSQLYYKGVSEGREEGLLQGINETLLKVYLNCRSKGMSVEESEEIVHFADRESLD 292 Query: 291 LRQPLPERERYSW 303 + + +R++ Sbjct: 293 MAEEEYQRQKLGK 305 >UniRef50_B1V1L4 Putative uncharacterized protein n=38 Tax=Clostridium RepID=B1V1L4_CLOPE Length = 300 Score = 83.7 bits (205), Expect = 7e-15, Method: Composition-based stats. Identities = 39/298 (13%), Positives = 104/298 (34%), Gaps = 19/298 (6%) Query: 11 DALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVKT 70 D +FK ++ +D + L ++ + ++L+S + + + KT Sbjct: 8 DFVFKRLFGAEES-KDSLISLLNAIIKSDNPIKDIELKSPDLEKQHIGDKFCRLDIKAKT 66 Query: 71 REGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHI-EHDKRQPLPLVIPMLFYHGSRS 129 +G+ + +E Q R++ +M R + Y + + + + L + + + Sbjct: 67 DKGE---IINVEIQVRDEYNMVQRTLYYWSKIYSDQLGASENYKNLARTVCINILNFKLL 123 Query: 130 PYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIRQRD- 188 T +++ + + + + L + I++ + Sbjct: 124 DNDRYHNTYRLKEITTNEELTDIEEIHFIELPKSKEIKSEEVNNIDSLLKWIEFIKEPES 183 Query: 189 ----LMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRERI 244 ++ L D+ + T+ S + ++A ++E + R +E + Sbjct: 184 ETVRILELTDESIRKAKTQLYKLSLDKKTIEQ-YRIREKAMYDEISALENSREKGLQEGV 242 Query: 245 MTIAERIHNDGYIKGE--------QRILRLLLQNGADPEWIQKITGLSAEQMQALRQP 294 + +G +GE ++I + LL G + + I KI L ++ + + Sbjct: 243 KIGRKEGKEEGLKEGEVRGKLKANRKIAKNLLSKGLELKEIAKILELDENLVEEIIKD 300 >UniRef50_C2G1H3 Hypothetical cytosolic protein n=1 Tax=Sphingobacterium spiritivorum ATCC 33300 RepID=C2G1H3_9SPHI Length = 294 Score = 83.3 bits (204), Expect = 9e-15, Method: Composition-based stats. Identities = 50/311 (16%), Positives = 100/311 (32%), Gaps = 42/311 (13%) Query: 6 TSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFV------DEKLRA 59 D L+K L DF+ P + + Sbjct: 1 MKQKDDYLWKGVLED--VFDDFLRFLYPDADSVFDLSRGITFLDKELEQLFPPEGNEFAP 58 Query: 60 LHSDILWSVKTREG-DGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLV 118 D L V T +G + ++ + +E Q A R+ Y ++ ++ + Sbjct: 59 KVVDKLAQVYTHDGMEEWVLIHVEVQGTCRKDFASRMFTYYYRILDKYHKR--------- 109 Query: 119 IPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLE 178 I S P + +EF + F + D ++ L Sbjct: 110 ITAFAILTEASKKPRPNVYEEEFMGTSI-----QYRFNTYKIAEQDTDRLLASDNPFALV 164 Query: 179 LI------------QKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYI--LLTGDEA 224 ++ K + L+ QL L+ + +I L+N++ + D + Sbjct: 165 VLTAKAAFVGKNLNDKDESDKALLQTKIQLARELLERNMSKEKIRGLMNFLRYYVRFDNS 224 Query: 225 RFNEFISELTRRMPQHRERIMTIAERIHNDGYIKGEQR----ILRLLLQNGADPEWIQKI 280 N + ++ + R M I E + N +G++ + R + ++G E I K Sbjct: 225 EVNTIFEQEVEKLTE-RSHTMGIEELLLNRAKKEGKRESLISVAREMKKDGIPVEQIVKF 283 Query: 281 TGLSAEQMQAL 291 T LS ++++ L Sbjct: 284 TKLSIKEIEKL 294 >UniRef50_C0QWG9 Putative uncharacterized protein n=8 Tax=Brachyspira RepID=C0QWG9_BRAHW Length = 301 Score = 82.5 bits (202), Expect = 2e-14, Method: Composition-based stats. Identities = 36/294 (12%), Positives = 83/294 (28%), Gaps = 12/294 (4%) Query: 2 TNFTTSTPHDALFKTFLTHP---DTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLR 58 T + +D + +H + A +F+ E +++ + + E Sbjct: 16 TINNLNRINDYFIRYLFSHEGNENIALNFINAVFKDLGFE--TFKKIEILNPFNIAENYD 73 Query: 59 ALHSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDK-RQPLPL 117 S + T G I V+IE Q+R + R + Y + L Sbjct: 74 EKESIVDIKAITESG---ITVLIEIQARGNEDFIKRALYYWAYNYSSSLNRGSFYDELKP 130 Query: 118 VIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALL 177 + + + + + +++ + + L Sbjct: 131 TVSINITNFILTNEDKVHSCYVLKELNNNKILTDHCQLHFLELPKFN---LKNISAIESL 187 Query: 178 ELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRM 237 + I K + + +L+ + ++ T ++ + + Sbjct: 188 DNIHKEFISWVKFFKGEDMSILMKENTIFEEVEKKCRTFVNNTPVMDKYKKREVDAYFFD 247 Query: 238 PQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQAL 291 + E G I + G D + I + TGLS E+++ L Sbjct: 248 KSIELDLKKAKEEGIEQGEKNKAISIAKSFKNAGIDIKIISENTGLSIEEVEKL 301 >UniRef50_C6Y2B5 Transposase and inactivated derivative n=1 Tax=Pedobacter heparinus DSM 2366 RepID=C6Y2B5_PEDHD Length = 310 Score = 82.1 bits (201), Expect = 2e-14, Method: Composition-based stats. Identities = 40/291 (13%), Positives = 86/291 (29%), Gaps = 24/291 (8%) Query: 11 DALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVKT 70 D FK + +D L L+ ++ SL+ + E I K Sbjct: 34 DLGFKRLFSAEQN-KDITITFLNHVLKGKREVVSLEFLKNEYPGETQEEGGVIIDIVCKD 92 Query: 71 REGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQ----PLPLVIPMLFYH- 125 + G + ++E Q + + R + Y+ ++ H R+ L V + Sbjct: 93 QIGA---FFLVEMQKSWNQNFKERSLFYASRLITEQAPHGNRKEWAYSLKDVYVIALLEK 149 Query: 126 -----GSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELI 180 G++ + + ++ +L F +++ E + Sbjct: 150 FTINAGNKGKWLHDIALVNTDTGKVFNERL---RFTYIELLSFKKTENQLETDLEKWIYA 206 Query: 181 QKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQH 240 K+++ LL A +++ ++ + Sbjct: 207 LKNLKHLKQAPAAFTEPQLLQFCQAARYINLTKEEKNMISAKTKARWDYYYAIDGAKIMG 266 Query: 241 RERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQAL 291 RE G + +I L G IQ++T LS +++ L Sbjct: 267 RE-------EGETRGAHQKAAQIAIKLKNKGVPFTEIQELTELSITEIKNL 310 >UniRef50_Q3ARM2 Putative uncharacterized protein n=10 Tax=Bacteroidetes/Chlorobi group RepID=Q3ARM2_CHLCH Length = 322 Score = 82.1 bits (201), Expect = 2e-14, Method: Composition-based stats. Identities = 40/311 (12%), Positives = 91/311 (29%), Gaps = 39/311 (12%) Query: 11 DALFKTFLT---HPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWS 67 D FK + D F+ LP + + D L V + ++ Sbjct: 13 DFGFKKLFGSEMNKDLLIAFLNTLLPIEAGTIAD---LTFLPNDRVGRSEFDRRA--IFD 67 Query: 68 VKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPML----- 122 + + G Y ++E Q + + R + Y+ +Q + K I M+ Sbjct: 68 LHCKNEKGE-YFIVEMQQAKQDYFKDRSVFYASFPIQEQAQKGKWNYCLQPIYMVGILDF 126 Query: 123 FYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPD--DEIVQH--RRVALLE 178 + +++ + T F +++ DE+ + LL Sbjct: 127 IFDENKADDTIVHHEIKLVNLSTGKVFYEKLTFIYLELPKFTKSVDELESDFDKWCYLLS 186 Query: 179 LIQKHIRQRDLMGL--------IDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFI 230 + + + + ++ E + + + D A Sbjct: 187 NLPDLTDRPARLQEKVFLKVFELAEIAKYTPEEAREYEKSLKVYRDLKNVIDCAYDEGKA 246 Query: 231 SELTRRMPQHRE-------------RIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWI 277 + + + +E + ++ G +KG+ I R L+ G + Sbjct: 247 EGIEEGIEKGKEIGVLEGMVKGKELGLQEGLQKGMEAGLLKGKLEIARKLMVKGMSADEA 306 Query: 278 QKITGLSAEQM 288 I G+ E++ Sbjct: 307 AGIAGVDVERL 317 >UniRef50_B0NFN2 Putative uncharacterized protein n=4 Tax=Clostridiales RepID=B0NFN2_EUBSP Length = 341 Score = 82.1 bits (201), Expect = 2e-14, Method: Composition-based stats. Identities = 36/288 (12%), Positives = 90/288 (31%), Gaps = 22/288 (7%) Query: 12 ALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVKTR 71 LF+ + + + L+ LE+A ++ H+DI + + +R Sbjct: 44 RLFEMIFSQKKELLELYNAVNGTSYDDPELLEINTLENAIYMSM-----HNDISFIIDSR 98 Query: 72 EGDGYIYVVIEHQSREDIHMAFRLMRYSMAVM-----QRHIEHDKRQPLPLVIPMLFYHG 126 + EHQS ++ R + Y + ++ + +P ++FY+G Sbjct: 99 ------LALYEHQSTYSPNLPLRHLMYVTDLYSAMIRDANLYGSRIVRVPTPRFLIFYNG 152 Query: 127 SRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIRQ 186 + + L + ++++ + ++++ R L + + R Sbjct: 153 EQEQPERRILRLSDAYTVPEESPALELEAVMLNINEGKNRQLMESCR-TLSDYARYTQRV 211 Query: 187 RDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQ---HRER 243 R +++ + V + +L+ L I E E Sbjct: 212 RGYARVME--ISAAVERAVTECIAEGILSEFLSKNRAEASKVSIYEYDEEKHMRQVREEG 269 Query: 244 IMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQAL 291 M ++G ++ Q G E + +++ + Sbjct: 270 QMDGRNEGRSEGEALKLITQIQKKCQKGKSLEETAEDLEEKPGEIEGI 317 >UniRef50_A7AK04 Putative uncharacterized protein n=2 Tax=Parabacteroides RepID=A7AK04_9PORP Length = 299 Score = 81.8 bits (200), Expect = 3e-14, Method: Composition-based stats. Identities = 40/312 (12%), Positives = 98/312 (31%), Gaps = 53/312 (16%) Query: 11 DALFKTFL---THPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWS 67 D FK F ++ + F+ L K D+ + + + + ++ Sbjct: 12 DYAFKRFFGTVSNKELTIGFLNSLLNK------DIKDIIFHNVEMQGNNTDSRKA--VFD 63 Query: 68 VKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQ----------RHIEHDKRQPLPL 117 + DG +++ +E Q + + + R++ Y+ V+Q R + ++R+ Sbjct: 64 LFCEGSDGELFI-VEIQKKRQKYFSDRVLYYASFVIQMQADIESEKFRLAKEEERRRWNY 122 Query: 118 VIPMLF------------YHGSRSPYPWSLCWLDEFA----DPTTARKLYNAAFPLVDVT 161 I ++ Y + + +L + Sbjct: 123 HINKVYVVCFLDFRLDTRYTDKYRWDVVRMDRELKIPFSETLNEIYLELPKFNLNFEECD 182 Query: 162 VVPDDEIVQHRRVALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTG 221 + + ++ + K Q D + + + L A + + Sbjct: 183 TFYKKFLYTMNNIDIMGQLSKETIQNDKLLRKLKSAIELQRMSAKER--------LAYEL 234 Query: 222 DEARFNEFISELTRRMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKIT 281 A + + + + E + G +G ++I+ + Q G D I K Sbjct: 235 SIAAERDLAACMATSFEEGEE-------KGIAKGITEGMRKIILNMKQAGMDLATIAKTA 287 Query: 282 GLSAEQMQALRQ 293 GL ++++AL + Sbjct: 288 GLPEKEVEALLK 299 >UniRef50_C1P7A8 Putative uncharacterized protein n=1 Tax=Bacillus coagulans 36D1 RepID=C1P7A8_BACCO Length = 345 Score = 81.8 bits (200), Expect = 3e-14, Method: Composition-based stats. Identities = 50/336 (14%), Positives = 107/336 (31%), Gaps = 51/336 (15%) Query: 3 NFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDS-LKLESASF----VDEKL 57 N+ T +D L+K ++ + +F+ + DL E D + + K Sbjct: 12 NYLPGTDYDGLWKKIIS--ELFEEFI-LFFAPDLYETIDFGKGIVFLEQELHKVIIKHKK 68 Query: 58 RALHSDILWSVKTREGD-GYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEH------- 109 +D + V + G+ Y+++ IE Q ++D + R+ Y + R E+ Sbjct: 69 GKRIADKIVKVSLKNGEEKYVFIHIEIQEKQDPDFSKRMFTYFYRLFDRFQENIYSIAIL 128 Query: 110 ----DKRQPLPLVIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAF--------PL 157 P ++G+ Y ++ +E P+ + A L Sbjct: 129 TDLSKSNNSEPFQYS---FYGTELTYRFNTYKFNEADIPSLKKSTNPFAIAVLAGIYLHL 185 Query: 158 VDVTVVPDDEIVQHRRVALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYI 217 + E+ + + Q +L + L +I Sbjct: 186 TEKNYQKRYEVKKKLLKEFILSNQNLSSNYAEALCYFIDYLLYLPGELTKQLTKELFIHI 245 Query: 218 LLTGDEARFNEFISELTRRMPQ----HRERIMTIAERIHNDGYIKGEQRILR-------- 265 + ++E + E E I E+ G KG++ + Sbjct: 246 EKEANHMLYSEELKEAPTFAEYLKTVKEEGIEIGIEKGIEKGIEKGKEEGIEIGIEKGKM 305 Query: 266 --------LLLQNGADPEWIQKITGLSAEQMQALRQ 293 LL+ G E + K+ LS ++++ +++ Sbjct: 306 EEKRNLAAELLREGFSVEKVAKMVKLSIDEVKKIKK 341 >UniRef50_B5CRG1 Putative uncharacterized protein n=4 Tax=Ruminococcus lactaris ATCC 29176 RepID=B5CRG1_9FIRM Length = 356 Score = 81.4 bits (199), Expect = 3e-14, Method: Composition-based stats. Identities = 39/314 (12%), Positives = 89/314 (28%), Gaps = 44/314 (14%) Query: 18 LTHPDTARDFMEIHL--------PKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVK 69 L D L P++L + ++ L+ ++ + DI+ + Sbjct: 42 LKDTKRFADLFNAILFQGKAVILPENLYPSPETTAVSLQDTQG-KNVVKKQYRDII--MN 98 Query: 70 TREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHD------------------K 111 ++ + + +E Q+ ++M Y + K Sbjct: 99 WQDQALLMLLAVESQTAIHYAAPLKVMLYDSMEYAEQVRVKWKERPPRLSSAEFLSRFQK 158 Query: 112 RQPLPLVIPMLFYHGSRSPYPWSLCWLDEFAD-------PTTARKLYNAAFPLVDVTVVP 164 L VI ++FY+G+ L F + L N LVDV + Sbjct: 159 NDKLIPVITLIFYYGTEEWD-GPLELHQMFDLGTEKKHAELMKKYLPNYHINLVDVRRLK 217 Query: 165 DDEIVQHRRVALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALL---NYILLTG 221 + E Q + ++Q + L + ++ Sbjct: 218 NLESFQSDLQIIFGMLQYSQDKYALRTYVANHKDYFQKLDLETYHALGAFLNSRQLMEIN 277 Query: 222 DEARFNEFISELTRRMPQHRERIMTIAERIHNDGYIKG----EQRILRLLLQNGADPEWI 277 E E + + + + E+ G +G ++ + + + G + I Sbjct: 278 VEKNEREELDMCKALEDIYNDGVQDGMEQGRRSGIAEGEASHKKEVAFQMQKLGYSLDAI 337 Query: 278 QKITGLSAEQMQAL 291 + S + + + Sbjct: 338 AAVLRESVDGISQI 351 >UniRef50_A7C3K1 Putative uncharacterized protein n=3 Tax=Beggiatoa sp. PS RepID=A7C3K1_9GAMM Length = 272 Score = 81.4 bits (199), Expect = 4e-14, Method: Composition-based stats. Identities = 42/287 (14%), Positives = 98/287 (34%), Gaps = 26/287 (9%) Query: 13 LFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVKTRE 72 K + P F++ L ++ D + + + D + + ++ Sbjct: 4 FLKKVFSKPHIFTAFVKDMLGIEIE--IDKVETEKSFSPIIGN------VDSRFDLFAQD 55 Query: 73 GDGYIYVVIEHQSREDIHMAFRLMRY-SMAVMQRHIEHDKRQPLPLVIPMLFYHGSRSPY 131 + V I+H+ +D + R + Y +A++++ +P V ++ Sbjct: 56 TKNRLIVDIQHKRYKDHY--DRFLHYHCVALLEQITSSANYKPDMQVYTIVVLTSGDKHK 113 Query: 132 ------PWSLCWLDEFADPTTARKL-YNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHI 184 +S LD + T K+ Y + D T P E ++ +L + +++ Sbjct: 114 TDLLITDFSPKKLDGSSIAETQHKIVYVCPKYVTDETPKPYQEWLKAINDSLDKQVEESH 173 Query: 185 RQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRERI 244 +++ I L+ + Y + + + E + + Sbjct: 174 YHNEVIQEIFSLIKKDKISP--EEYARMKDEYSDEEYLQEQTQKARKE------GMEKGM 225 Query: 245 MTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQAL 291 + G KG + + + + E I ++TGLS EQ++ L Sbjct: 226 EKGIGKGIEKGIEKGVLMMAKNMKEAKVAIETIIEVTGLSIEQIEDL 272 >UniRef50_Q3ATN4 Putative uncharacterized protein n=1 Tax=Chlorobium chlorochromatii CaD3 RepID=Q3ATN4_CHLCH Length = 287 Score = 81.0 bits (198), Expect = 5e-14, Method: Composition-based stats. Identities = 48/309 (15%), Positives = 100/309 (32%), Gaps = 42/309 (13%) Query: 8 TPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWS 67 D + K L A D I L +D +L +++ +D++ Sbjct: 2 HAKDVVSKDIL--KRIALDIARILLH------LKVDHAELLETEH--QRVEERRADVVVL 51 Query: 68 VKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGS 127 V + G + +E Q+ ++A+RL+RY + H +D +Q L Y G Sbjct: 52 V--QGESGRFILHLEIQNDNQANIAWRLLRYRSDIGLAHKGYDIKQYL-------IYIGK 102 Query: 128 RSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELI----QKH 183 + + + ++D+ V ++ L L K Sbjct: 103 A----------PLSMPTGIHQTGLDYRYHVIDMHSVDCQALLTQDTPDALVLAILCDFKG 152 Query: 184 IRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRER 243 +R+++ I Q + L E + + IL + I E + + Sbjct: 153 RSEREVVRYIIQRLQELTAENESRYHDYMRMLEILSANRSLE--KIIEEEEAMLSVVDQT 210 Query: 244 IMTIAERIHNDGYIKGEQRILRLLLQNGADPEW-------IQKITGLSAEQMQALRQPLP 296 + G +G Q+ L++ + + ++ L+ EQ++ L L Sbjct: 211 RLPSFRIGMRHGIEQGVQQGTLSLVKRQLTRRFGTLSYHHVARLDKLNIEQLEELSDALL 270 Query: 297 ERERYSWLK 305 + + Sbjct: 271 DFNTVTDFD 279 >UniRef50_A6EAN2 Putative uncharacterized protein n=1 Tax=Pedobacter sp. BAL39 RepID=A6EAN2_9SPHI Length = 317 Score = 80.6 bits (197), Expect = 6e-14, Method: Composition-based stats. Identities = 32/318 (10%), Positives = 88/318 (27%), Gaps = 32/318 (10%) Query: 2 TNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALH 61 T D FK +D + L + + L + Sbjct: 4 TTKYIDPLIDFAFKKIFGGDPN-KDLLIDLLNALFKGRKIIIDLTYNKNEHPGDSEHEGA 62 Query: 62 SDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKR----QPLPL 117 + ++ + +G ++ IE Q + + R + Y+ ++ R L Sbjct: 63 A--VFDLLCTGQNGEQFI-IEIQRAKQENFKERALFYTSRLISSQAPKGNRASWGYRLTE 119 Query: 118 VIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAA---FPLVDVTVVPDDEIVQHRRV 174 V + + +L + + +++ + Sbjct: 120 VYLIALMEDTTLNDESEHEFLHDICLCKRDTGKVFYEKLGYLYIELRKFVKSSTELQTDL 179 Query: 175 ALLELIQKHIRQRD------LMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNE 228 + K++ D + ++L + + + + + + D E Sbjct: 180 DRWLFLLKNLSSMDKIPVYLRKPIFEKLFSIAEYSNLSKEEKMSYDSRMKYKWDNENVRE 239 Query: 229 FISE--LTRRMPQHRER-------------IMTIAERIHNDGYIKGEQRILRLLLQNGAD 273 + + L + + + RE+ + +G + +I + Sbjct: 240 YARKEGLEKGLEEGREKGRLEGKLEGKLEGKLEGKLEGKLEGRKEAAIKIAGEMKSANLP 299 Query: 274 PEWIQKITGLSAEQMQAL 291 + I + T LS E+++ + Sbjct: 300 LDQIARFTKLSLEEIEGI 317 >UniRef50_A6EA97 Putative uncharacterized protein n=1 Tax=Pedobacter sp. BAL39 RepID=A6EA97_9SPHI Length = 293 Score = 80.2 bits (196), Expect = 7e-14, Method: Composition-based stats. Identities = 47/282 (16%), Positives = 107/282 (37%), Gaps = 21/282 (7%) Query: 22 DTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVKTREGDGYIYVVI 81 R+ ME+ LP+ +RE+ L+ L E + R D L V +G+ ++ + I Sbjct: 16 KIIRENMEVTLPEVIREVLGLEILLSEELPDDVQHTRERKPDALKKVTDIQGNTFV-LHI 74 Query: 82 EHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGSRS------PYPWSL 135 E Q ++ M +R+ YS+ +M+R+ LP+ ++F ++ P + Sbjct: 75 EFQVEDEKEMVYRMAEYSIMLMRRYQ-------LPVKQYVIFLKDTKPRMPTGLKTPKLV 127 Query: 136 CWLDEFADPTTARKLY----NAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIRQRDLMG 191 D + KL+ N ++ V D+ + +++ + H + Sbjct: 128 YSFDLIRIAEISYKLFIKSDNPEVKMLAVLANFDEADREGALTSIITGLLSHSKGDFAER 187 Query: 192 LIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRERIMTIAERI 251 + + + + ++ Q + + T + + F + R E + Sbjct: 188 RHFKQLRIFMQLRSSIEQHFDKVMDSVSTFFKEENDYFYRKGEARGEIKGEA--KGEAKG 245 Query: 252 HNDGYIKGEQRILRLLLQN-GADPEWIQKITGLSAEQMQALR 292 G K + ++ L+ G E +I ++ + ++ +R Sbjct: 246 EAKGEAKKSRAVVENLIAKLGFSDEQAAEIAEVTVDFVKDIR 287 >UniRef50_A8VV66 ATPase associated with various cellular activities, AAA_3 n=2 Tax=Bacillus selenitireducens MLS10 RepID=A8VV66_9BACI Length = 214 Score = 79.8 bits (195), Expect = 1e-13, Method: Composition-based stats. Identities = 42/210 (20%), Positives = 79/210 (37%), Gaps = 15/210 (7%) Query: 108 EHDKRQPLPLVIPMLFYHGSRSPYPWSLCWLDEFAD------PTTARKLYNAAFPLVDVT 161 + + P L+IP+L G R + D F+ + N + L D+ Sbjct: 3 KEGRGNPRTLIIPILIAQGRRRWSRSTTLMADFFSHYSEALRDDCEPFIPNFRYLLYDIQ 62 Query: 162 VVPDDEIVQHRRVALLELIQKHIRQRDLMGLIDQLVVLLVTECAND------SQITALLN 215 ++++H + + + + + D L ++ LL + Q+ LL Sbjct: 63 EQDAADMIRHTLLKITIELMALVFEEDESKLEARMTELLTMSEIGEISDSYAEQVLRLLE 122 Query: 216 YILL---TGDEARFNEFISELTRRMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGA 272 Y++ D+A F +T + E IM A+++ G K E I L + G Sbjct: 123 YVMRGNRHFDQAMFETIRQNVTTEAHEGSELIMNFADQLEQKGKHKKELAIFLKLTRRGE 182 Query: 273 DPEWIQKITGLSAEQMQALRQPLPERERYS 302 E I + L + +AL+ + E + S Sbjct: 183 SKESIMDLLDLDDKSFEALQAEVNEMDENS 212 >UniRef50_C4Z592 Putative uncharacterized protein n=2 Tax=Clostridiales RepID=C4Z592_EUBE2 Length = 315 Score = 79.4 bits (194), Expect = 1e-13, Method: Composition-based stats. Identities = 45/312 (14%), Positives = 101/312 (32%), Gaps = 34/312 (10%) Query: 1 MTNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRAL 60 M DALF+ + + + +E +V Sbjct: 1 MGVEINRKFKDALFRKVFEEKKDLLSLYNALNNTEHTDENLITVNTIEDVIYVG-----Y 55 Query: 61 HSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQ------- 113 +DI + + + + +Y EHQS + +M R + Y + + +IE + + Sbjct: 56 KNDIAFVI---DSELNLY---EHQSSVNKNMPIRGLIYFAELYKGYIERNSLRIYNETEV 109 Query: 114 PLPLVIPMLFYHGSRSPYPWSLCWLDEFADPTTA----RKLYNAAFPLVDVTVVPDDEIV 169 LP ++FY+G + S+ L + A + + L+++ + EI+ Sbjct: 110 KLPFPRYVVFYNGEKDETEKSVQRLADLFVRNEANQNQKPCLDVEVLLLNINYGCNKEIM 169 Query: 170 QHR-------RVALLEL-----IQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYI 217 R+ + + K Q + ++ V+ ++ +L I Sbjct: 170 NKCQKLMEYSRLIAMIRGKTADLAKIYSQDSIEKSKKEIFTEAVSLAIEEAISNNILREI 229 Query: 218 LLTGDEARFNEFISELTRRMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWI 277 L+ + ++E + R + + I L+ E Sbjct: 230 LIKNKAEVTDMLLTEFDEKDYIEGVREEGERKGREEGREEGRNKMIYSLVEDKSISMEKG 289 Query: 278 QKITGLSAEQMQ 289 + G+S E+++ Sbjct: 290 AQKLGISVEKLK 301 >UniRef50_D0TYF1 Putative uncharacterized protein n=1 Tax=Bacteroides sp. 2_1_22 RepID=D0TYF1_9BACE Length = 349 Score = 79.4 bits (194), Expect = 1e-13, Method: Composition-based stats. Identities = 48/355 (13%), Positives = 103/355 (29%), Gaps = 70/355 (19%) Query: 1 MTNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRAL 60 M+ + D FK P + ++ + L + L + L + + Sbjct: 1 MSKYVNPFT-DIGFKIIFGQPAS-KNLLITLLNELLAGEHHITELTFLDKEDHADNVSDK 58 Query: 61 HSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEH----------- 109 I++ + R G Y+++E Q+R + R + Y + R IE Sbjct: 59 G--IIYDLYCRTASGE-YIIVEMQNRWHSNFLDRTLYYVCRAVSRQIESPSSKEVPVPED 115 Query: 110 -----------DKRQPLPLVIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLV 158 K+ LP + + + + D K+ N + Sbjct: 116 PMTAREPLVSYGKQYRLPTIYGIFLTNFKEE-NLEAKFRTDTVLSDRDTGKIVNPHLRQI 174 Query: 159 DV------TVVPDDEIVQHRRVALLELIQKHIRQRDLMG--LIDQLVVLLVTECANDS-- 208 + + D + + + L+ + R D + + + L L ++ Sbjct: 175 YLQFPYFTKDLSDCHTLYDKLIYALKNMSNWNRMPDALKEQVFEHLARLAAVADLSEENR 234 Query: 209 ------QITALLNYILLTGDEARFNEFISELTRRM------------------------- 237 +N I+ + + E + Sbjct: 235 IAYDKALDRYRVNQIVEEDERRKNEEMRRKAAEEGLKEGMKAGLEKGVKKGRLEGIKEGM 294 Query: 238 -PQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQAL 291 +E + E+ G K + I R + ++G + I K TGL + ++ L Sbjct: 295 KEGMKEGMKEGLEKGLEKGEQKKQIEIARKMREDGISIDIIIKYTGLQSSDIENL 349 >UniRef50_UPI00006CAA90 hypothetical protein TTHERM_00670420 n=1 Tax=Tetrahymena thermophila RepID=UPI00006CAA90 Length = 345 Score = 79.4 bits (194), Expect = 2e-13, Method: Composition-based stats. Identities = 40/298 (13%), Positives = 107/298 (35%), Gaps = 32/298 (10%) Query: 11 DALFKTFLTHPDTARDFMEIHL-------PKDLRELCDLDSLKLESASFVDEKLRALHSD 63 D +F+ ++ + + F+E L +++ E+ L++ L+++ + + D Sbjct: 64 DFVFEKIFSNHERMKSFLESVLVGKNKILHEEINEVIYLNNNLLQNSLTQEYIPKKSMFD 123 Query: 64 ILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLF 123 + +KT +G ++E R R+ YS + + L +I + Sbjct: 124 L--QIKTSQGT----FIVEIYKRSFQPFLKRIQYYSAQSLSQQQNQ-THTSLKPIISIAI 176 Query: 124 YHGSR-SPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDE--------IVQHRRV 174 + + T L + + +++ + + + + + Sbjct: 177 VDDILFEDDVPCISFHKTIEQKTQKVFLNYSTYVFIELGKYDNKKYDQSCVHGVNEKEWL 236 Query: 175 ALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELT 234 LL+ H + + L + E D + L ++ E + Sbjct: 237 DLLKKSDIHRQYKTKEVLNAAQYAQFIQEKLFDEYVKHKLY------EDQFIEEIKNAKV 290 Query: 235 RRMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQALR 292 + Q +E + +++ G++ +++ +L++G + I TGLS E++ ++ Sbjct: 291 EGIQQGQEETIKLSKHYS---IKAGKEEVVKQMLKDGLSLQKIITYTGLSKEEIDEIK 345 >UniRef50_C0DB21 Putative uncharacterized protein n=2 Tax=Clostridium asparagiforme DSM 15981 RepID=C0DB21_9CLOT Length = 328 Score = 79.4 bits (194), Expect = 2e-13, Method: Composition-based stats. Identities = 40/310 (12%), Positives = 87/310 (28%), Gaps = 52/310 (16%) Query: 16 TFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESAS----FVD----EKLRALHSDILWS 67 L+ P D L + + L S + D +K+ D+ Sbjct: 10 KLLSDPVYFSDLCNGVLFR-GEMYLKPEDLMPVKGSQGVLYADRKGVKKVLERRRDVAMR 68 Query: 68 VKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQ-------------- 113 +K G Y + +E+Q+ M R + Y ++ +++ Sbjct: 69 LK--SGTRYAVIAVENQANIHYAMVIRSLLYDALDYTDQVQIQEKELRQAGRRPSGDGFL 126 Query: 114 -------PLPLVIPMLFYHGSRSPYPWSLCWLDEF------ADPTTARKLYNAAFPLVDV 160 L V+ ++ Y GS + P A + + LV+ Sbjct: 127 SGVGPKLRLEPVVTLVLYWGSGHWDGSTSLHELLGLKDGKGEAPELAGYIPDYRLNLVNA 186 Query: 161 TVVPDDEIVQHRRVALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQ-------ITAL 213 + D I + + +++ + L + L Sbjct: 187 ANMDDPSIFRTHLQQIFSMLKYKSDKAALYRYAQENRTELRDMDGTAKLALLSMMGEQKR 246 Query: 214 LNYILLTGDEARFNEFISELTRRMPQHRERIMTIAERIHNDGYIKGEQR---ILRLLLQN 270 L I+ + + + + ER G+ +GE++ ++ LL Sbjct: 247 LQKIMEEAEGEEEFDMCKAIDDLIADGES---RGFERGDRQGFERGERQLSSLISRLLAE 303 Query: 271 GADPEWIQKI 280 + I++ Sbjct: 304 NRP-DLIEQA 312 >UniRef50_UPI00019735B3 hypothetical protein ClM62_08045 n=1 Tax=Clostridium sp. M62/1 RepID=UPI00019735B3 Length = 255 Score = 79.1 bits (193), Expect = 2e-13, Method: Composition-based stats. Identities = 35/241 (14%), Positives = 80/241 (33%), Gaps = 27/241 (11%) Query: 7 STPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILW 66 D LF+ + D R+ L + LE+A +++ + +D+ + Sbjct: 23 RDYKDTLFRMLFNDREALLSLYNAVGNTDYRDPSLLQIVTLENAVYMN-----VKNDLAF 77 Query: 67 SVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQ-----RHIEHDKRQPLPLVIPM 121 + G+ + EHQS + +M R + Y+ + + + + LP+ + Sbjct: 78 LL------GFELNLYEHQSTWNPNMPLRDLFYAAREYEMLIRDQSLYSSRLIKLPVPRFI 131 Query: 122 LFYHGSRSPYPWSLCWL---------DEFADPTTARKLYNAAFPLVDVT--VVPDDEIVQ 170 +FY+G + L + + L + +V+ ++ + Sbjct: 132 VFYNGREKQEERCVLKLSDAFETPVEECIHEGILRDFLLKYRAEVTNVSIFEYNEEREKE 191 Query: 171 HRRVALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFI 230 R A E +K ++ + I L+ A+ +++ L E Sbjct: 192 LLRKAEYEFGKKEGMEQGMEQGICALIQTCRELGASRETVSSALIRRFSISCEEAEGYLE 251 Query: 231 S 231 Sbjct: 252 R 252 >UniRef50_C4FHW2 Putative uncharacterized protein n=1 Tax=Sulfurihydrogenibium yellowstonense SS-5 RepID=C4FHW2_9AQUI Length = 211 Score = 79.1 bits (193), Expect = 2e-13, Method: Composition-based stats. Identities = 36/199 (18%), Positives = 86/199 (43%), Gaps = 10/199 (5%) Query: 108 EHDKRQPLPLVIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDE 167 K++ P +I ++FYHG R + L D + L+D+ +PD+E Sbjct: 3 RSHKKEYYPPIINIVFYHGEREWNIPTN--LPTVKDKDLQEYTQKLNYILIDLNKIPDEE 60 Query: 168 IVQH--RRVALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEAR 225 + + + ++ I R D + + ++ L++ ++ + +L+YI+L +A Sbjct: 61 LKNRISKNMDVILAILVMKRIFDDIQNLRPILELIIKHKSDS--LFIILDYIVLIKKDAE 118 Query: 226 FNEFISELTRRMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSA 285 E ++ + + E++MT+ E+ +G++KG+ + + IQ G Sbjct: 119 KVE---KILKEISGGDEKMMTLTEKWKMEGWMKGKLEGRLEAQRKAI-IKLIQLKFGNIP 174 Query: 286 EQMQALRQPLPERERYSWL 304 E +++ + ++ + Sbjct: 175 ESLESFINKCEDADKLDEI 193 >UniRef50_A6MYW5 Chromosome segregation ATPase n=4 Tax=Rickettsia RepID=A6MYW5_9RICK Length = 296 Score = 79.1 bits (193), Expect = 2e-13, Method: Composition-based stats. Identities = 37/295 (12%), Positives = 87/295 (29%), Gaps = 23/295 (7%) Query: 11 DALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVKT 70 D FK + +D + + + + + + L + S + Sbjct: 9 DLAFKKIFGVEEN-KDLLISLINSIVSKEDQIVDVTLLNPYNPQNFRNDKLSILDIKALG 67 Query: 71 REGDGYIYVVIEHQSREDIHMAFRLMRYSMAVM-QRHIEHDKRQPLPLVIPMLFYHGSRS 129 G + IE Q ++ R + Y + + L I + + + Sbjct: 68 ESGKRF---NIEIQITDEADYDKRALYYWAKLYTEALQASQDYSSLNKAIGIHILNFTSI 124 Query: 130 PYPWSLCWLDEFADPTTAR-KLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIRQRD 188 P + + + + +++ + + L ++++K D Sbjct: 125 PETNKYHNIFHITEKDSGLLYFKDLELHTIELNKFSN-----NPNEELADILKKVGNSLD 179 Query: 189 LMGLIDQLVVLLVTECANDSQITALLNYILLTGD----EARFNEFISELTRRMPQHRERI 244 + LL + A L L D + + + + + + Sbjct: 180 IWSAFLTRHDLLNSNNLPKKLDNASLKKALTVLDVMNFTSEERDAYEDHLKWLRIEANTL 239 Query: 245 MTIAERIHNDGYIKGEQ--------RILRLLLQNGADPEWIQKITGLSAEQMQAL 291 + G ++G Q I R L ++G I + TGL+ +Q++ L Sbjct: 240 KKYEAQARVRGKVEGIQIGKTEEKIAIARNLKRSGVAITIISESTGLTKKQIEEL 294 >UniRef50_A7BTR0 Putative uncharacterized protein n=3 Tax=Beggiatoa RepID=A7BTR0_9GAMM Length = 309 Score = 78.7 bits (192), Expect = 2e-13, Method: Composition-based stats. Identities = 57/318 (17%), Positives = 104/318 (32%), Gaps = 34/318 (10%) Query: 1 MTNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRAL 60 M T D K L D +E L L+E + + ++ D + Sbjct: 1 MPTETKLVRFDWALKNILRDKANF-DVLEGFLTALLQEDISVLEILESESNQSDFAKKFN 59 Query: 61 HSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDK-RQPLPLVI 119 DIL + ++IE Q+ + R++ + ++ +E + + + VI Sbjct: 60 RVDILVKDSHQRK-----MIIEVQNHRETGYLERILWGTSKLIVETLELGEDYRNISKVI 114 Query: 120 PM---------------LFY-----HGSRSPYPWSLCWLD-EFADPTTARKLYNAAFPLV 158 + ++Y HG + P+ L + + K F L+ Sbjct: 115 SISIVYFDLGLSDDNEYVYYGVANLHGLQHNQPFRFRRLMADKTFKSLQTKDIFPEFYLL 174 Query: 159 DVTVVPDDEIVQHRRVALLELIQKH--IRQRDLMGLIDQLVVLLVTECANDSQITALLNY 216 V D + + + KH IR I++ L N + Y Sbjct: 175 RVEHFQD---IIKTDLDEWIYMLKHSTIRTDFKSKNINKAQEKLTLLQMNPQKRKDYEKY 231 Query: 217 ILLTGDEARFNEFISELTRRMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEW 276 ++ E E E +E I + G K I++ LQ G + Sbjct: 232 MVDMTVERDVLEAAQEEG-IQKGRQEGIQEGRQEGIQKGMEKKTVVIVKNALQQGLELTL 290 Query: 277 IQKITGLSAEQMQALRQP 294 I +TGLS E++Q ++ Sbjct: 291 ISSLTGLSIEEIQKIQND 308 >UniRef50_C6W4R9 Putative uncharacterized protein n=1 Tax=Dyadobacter fermentans DSM 18053 RepID=C6W4R9_DYAFD Length = 293 Score = 78.3 bits (191), Expect = 3e-13, Method: Composition-based stats. Identities = 45/306 (14%), Positives = 98/306 (32%), Gaps = 37/306 (12%) Query: 9 PHDALFKTFLTHPDTARDFMEIHLPKD---LRELCDLDSLKLESASFVDEKLRA---LHS 62 +D L+K+ L + DF++ P + L E + A + Sbjct: 3 RNDMLWKSIL--EEIFDDFLKFFFPNAEALFDMDRGFEYLDQELEQLFPPEGNAIATRYV 60 Query: 63 DILWSVKTREG-DGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPM 121 D L V R G + ++ V IE Q D R+ Y + ++ + + Sbjct: 61 DKLVKVYCRSGAEAWLLVHIEVQGYRDETFPDRMFTYYYRICDKYR--------KPITAI 112 Query: 122 LFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQ 181 +L + F V ++E+ ++ Sbjct: 113 AILTDD------CRHFLPGQFEQACLGTSVCFRFNSYKVLEQSEEELAASDNPFAQVILA 166 Query: 182 KHI-------RQRDLMGLIDQLVVLLVTECANDSQITALLNYI---LLTGDEARFNEFIS 231 + +L L L L+ + ++ L+ ++ + D+ E++ Sbjct: 167 TKLAIKGSRFSSDELYRLKIDLAKRLLKRNFSKRKVGRLMEFLKFYVSLEDDDLDREYLK 226 Query: 232 ELTRRMPQHRERI---MTIAERIHNDGYIKGEQRILRLLLQN-GADPEWIQKITGLSAEQ 287 E+ R + TI + G + +++ L++ E I ++ +S E Sbjct: 227 EVQRLFNPEPIPMTWEETILYIVEEKGAEAAKTTVVQNLIRETNFTSEEIARLADVSVEF 286 Query: 288 MQALRQ 293 +Q ++Q Sbjct: 287 VQKIKQ 292 >UniRef50_C0QZ87 Chromosome segregation ATPase n=19 Tax=Bacteria RepID=C0QZ87_BRAHW Length = 309 Score = 78.3 bits (191), Expect = 3e-13, Method: Composition-based stats. Identities = 44/314 (14%), Positives = 109/314 (34%), Gaps = 30/314 (9%) Query: 3 NFTTSTPHDALFKTFLT---HPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRA 59 + +D + + D + + L E + +L++ + + E Sbjct: 1 MKEINRLNDLFVRYLIGTEGDEDILENIVNAVLNDVGFE--SVSNLEIINPYNLAENENL 58 Query: 60 LHSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDK-RQPLPLV 118 S + KT++G ++IE Q + + R++ Y + ++ ++ + + Sbjct: 59 KESILDVKAKTKDGKK---ILIEIQLIGNNNFIKRILYYIAKNISSELKENENYINISQM 115 Query: 119 IPMLFYH-----GSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPD-------D 166 I + F + GS S ++ KL + +++ + D Sbjct: 116 ISISFLNFNLKIGSESDIKREHKCFQLSDINNSSLKLDDFQIHFIEIKRFAEILKNASID 175 Query: 167 EIVQHRRVALLEL---------IQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYI 217 + +++ ++ ++ I K I D+M + V + S ++ Sbjct: 176 DYNKNKLLSWIDFFTAKDLEKSINKLIGGNDIMSKVMDKYKRFVADEKEMSAYNERDTFL 235 Query: 218 LLTGDEARFNEFISELTRRMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWI 277 ++ + ++ I E+ G I R L ++G D ++I Sbjct: 236 YGQAAMLQYEREEGKKEGIEIGIQQGIKEGIEQGIEQGEKNKALSIARSLKKSGLDDKFI 295 Query: 278 QKITGLSAEQMQAL 291 + TGL+ E+++ L Sbjct: 296 SENTGLTIEEIEKL 309 >UniRef50_C0QGW4 Putative uncharacterized protein n=1 Tax=Desulfobacterium autotrophicum HRM2 RepID=C0QGW4_DESAH Length = 298 Score = 78.3 bits (191), Expect = 3e-13, Method: Composition-based stats. Identities = 49/297 (16%), Positives = 105/297 (35%), Gaps = 30/297 (10%) Query: 10 HDALFKTFLTHPDTARDFMEIHLPKD---LRELCDLDSLKLESASFVDEKLRALHSDILW 66 HD FK D ++ ++ P+ ++ D++ L+ E + +L D+ Sbjct: 4 HDHNFKNLF--LDFPKETLDWFFPQAGQSWGKVLDVEFLRQEPKKH-NLSDSSLELDMPI 60 Query: 67 SVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHG 126 ++ ++E Q + ++L+RY+ +M+ H + LVIP + + Sbjct: 61 LFNFENQQLLLW-LVEFQEDKSKFSIYKLLRYTTDLMETHPDA-------LVIPTVLFTD 112 Query: 127 SRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQK-HIR 185 + S L + R + + + + + + L+ K H + Sbjct: 113 RKKW---SKAVLQQLHAQLHDRMFLHFEYVFHKLFDLNARDYYNVDNPVVKILLPKMHYK 169 Query: 186 QRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRERIM 245 + D + +I Q L ++ +++I + + L + QH+E M Sbjct: 170 KEDRIEVIRQAYAGLFQLVSS-GLFDKYVDFIDTYAEIEDQEQL--NLYNEIVQHKETAM 226 Query: 246 TIAERIHNDGYIKGEQR--------ILRLLLQNGADPEWIQKITGLSAEQMQALRQP 294 +A+ I G +G + +R Q G I KI L + + Sbjct: 227 -LAQYIRERGMQEGRKEERKQSLISFIRKAKQEGVSVPTIAKIVDLDVSMVNKILNN 282 >UniRef50_C8WSD0 Putative uncharacterized protein n=5 Tax=Alicyclobacillus acidocaldarius RepID=C8WSD0_ALIAD Length = 270 Score = 78.3 bits (191), Expect = 3e-13, Method: Composition-based stats. Identities = 47/261 (18%), Positives = 94/261 (36%), Gaps = 32/261 (12%) Query: 42 LDSLKLESASFVDEKLRALHSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMA 101 +++L+ + LR D W + + +E Q R + + R + Y Sbjct: 34 VETLEPFTTELPASTLR---MDRAWRMANGD-----VFHLEFQDRRERTLH-RFLEYDAR 84 Query: 102 VMQRHIEHDKRQPLPLVIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVT 161 + + + ++ YH + P L D TA F Sbjct: 85 LANQVKTR--------IRTVVLYHAQVASAPQEL-------DIGTAIYRVENVFLSALDG 129 Query: 162 VVPDDEIVQHRRVALLELIQK-------HIRQRDLMGLIDQLVVLLVTECANDSQITALL 214 DE+ H RV E + +R D + +++ LL +D + + Sbjct: 130 DGALDEVEAHLRVGRWEPADRLRLGLALSMRVEDRHQAMARVLNLLPRVP-DDEERELVA 188 Query: 215 NYILLTGDEARFNEFISELTRRMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGADP 274 + +L GD A +E +L + + + E +DG + + I LL G Sbjct: 189 SAVLAFGDRALSDEDRRKLRKELKNVFRMAEELYEDGRHDGKQQAAEDIAHRLLAEGVPV 248 Query: 275 EWIQKITGLSAEQMQALRQPL 295 + ++K TGL E+++ +++ + Sbjct: 249 DVVEKATGLPRERLEQMKREV 269 >UniRef50_A1ZPJ4 Hypothetical conserved protein n=6 Tax=Microscilla marina ATCC 23134 RepID=A1ZPJ4_9SPHI Length = 302 Score = 78.3 bits (191), Expect = 3e-13, Method: Composition-based stats. Identities = 49/320 (15%), Positives = 108/320 (33%), Gaps = 54/320 (16%) Query: 2 TNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALH 61 ++ S +D +FK + + + +L ++ +L + Sbjct: 14 KSYDMSNQYDKIFKENIG--EHFLSLSKTYLGIEVASSEELKD--------KLQTTLERE 63 Query: 62 SDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPM 121 +D L + T +G+ I + +E QS ++ MA R+ Y + Q++ + Sbjct: 64 ADFLRKITTPKGEQMI-IQLEFQSTDEQGMAERMQLYFAILRQKYKL--------PIRQF 114 Query: 122 LFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVA-LLELI 180 + Y GS+ P + +E F L+D+ V + ++ +L + Sbjct: 115 VIYVGSKPPKMRTRLKPEEVFTG----------FELLDLRQVSYTQWLESDIPEEVLLAV 164 Query: 181 QKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQH 240 +Q+ + ++ Q++ +V + L YI AR + E + + Sbjct: 165 LGDFQQKKVSTVLKQIISKIVKLIDD---PGTLQKYIRQLATFARLRNLVIETEQTLEYM 221 Query: 241 RERIMTIAERIHNDGYIKGEQRILRLLLQNG---------------------ADPEWIQK 279 + + G KG+Q + Q G E + + Sbjct: 222 GLTYDIEKDVFYQRGVKKGQQEGIEKGHQEGIEKGITQGVVKMVIALLKSGKMPLEEVAR 281 Query: 280 ITGLSAEQMQALRQPLPERE 299 I LS +Q + + + + Sbjct: 282 IAELSVIDVQKMADQIKKPD 301 >UniRef50_UPI0001B4A8CA hypothetical protein Bfra3_22303 n=1 Tax=Bacteroides fragilis 3_1_12 RepID=UPI0001B4A8CA Length = 282 Score = 77.9 bits (190), Expect = 4e-13, Method: Composition-based stats. Identities = 43/284 (15%), Positives = 86/284 (30%), Gaps = 13/284 (4%) Query: 11 DALFKTFLT-HPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVK 69 D FK HPD F+ LP L E ++ + V E +S + + Sbjct: 9 DLTFKRVFGEHPDLVMSFLNALLPLRLEESI--TDIEYLPSGMVPENSLPKNSIVYVRCR 66 Query: 70 TREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDK-RQPLPLVIPMLFYHGSR 128 +G +I +E Q +M + R ++ + L V + + Sbjct: 67 DSKGRSFI---VEMQMIWSPEFKQCVMFNASKAYVRQMDSGEQYDLLQPVYSLNLVNDIF 123 Query: 129 SPYPWSLCWLD-EFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIRQR 187 P T R + V++ + + L I ++ Sbjct: 124 EPDIKEYYHYYRLVHVEHTERVINGLHLVFVELPKFTPHTYSEKKMHILWLRYLTEIDEK 183 Query: 188 DLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRERIMTI 247 ++ L+ +T L + +F ++ Sbjct: 184 -----THEVPEELLENPEIKKAVTVLEESAFTPEQLLGYEKFWDIISVEKTLISSAERKE 238 Query: 248 AERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQAL 291 E +G ++ + + + G + I +TGLSAE+++ L Sbjct: 239 KEEGRKEGELQEKLLVASNAKKQGLSLDIISSLTGLSAEEIERL 282 >UniRef50_A4XJH0 Putative uncharacterized protein n=1 Tax=Caldicellulosiruptor saccharolyticus DSM 8903 RepID=A4XJH0_CALS8 Length = 134 Score = 77.5 bits (189), Expect = 5e-13, Method: Composition-based stats. Identities = 19/137 (13%), Positives = 50/137 (36%), Gaps = 3/137 (2%) Query: 1 MTNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRAL 60 M N + +A+F+ + ++ + DS+++ + +E + Sbjct: 1 MNNNFSQDE-NAIFRLIFSDSKEILFLLKNVAKFSWVDRIQKDSIEVILVDYDNENVLKY 59 Query: 61 HSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIP 120 D++ V YI+V + + M ++ + ++ I+ +P +IP Sbjct: 60 KPDVIAKVTIENNTAYIFVFFVSKV-PECGMRNIILNNMLLFWEKKIKEGT-DKIPPIIP 117 Query: 121 MLFYHGSRSPYPWSLCW 137 ++ Y+G + Sbjct: 118 LVLYNGKEIWTEPREIY 134 >UniRef50_Q8GBS6 Putative uncharacterized protein n=12 Tax=Treponema RepID=Q8GBS6_TREMA Length = 262 Score = 77.5 bits (189), Expect = 5e-13, Method: Composition-based stats. Identities = 36/284 (12%), Positives = 86/284 (30%), Gaps = 37/284 (13%) Query: 11 DALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVKT 70 D +F + + + + F+E+ L + + + S + + + + DIL Sbjct: 13 DFMFCQVMKNKNLCKTFLEMLLADKIGNITHIASQSTVAPE---SEAKFVRLDILV---- 65 Query: 71 REGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQ---PLPLVIPMLFYHGS 127 ++ Y IE Q + ++A R+ Y A+ ++ + +I + + Sbjct: 66 QDEKNNFY-DIEMQVVNEHNVAKRMRYYQSALDVSFLDKGEYYTNLKDSYIIFVCLFDFI 124 Query: 128 RSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIRQR 187 D + ++V + I LE I+ Sbjct: 125 GKNKAVYFFENICLEDEPIRLRDGTKKII-INVDAFKN--IKDKALSGFLEYIKTGCITT 181 Query: 188 DLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRERIMTI 247 I++++ + + + ++ + Sbjct: 182 KFSERIEKMIRTIKQNEQARQEYRFISAVVM-----------------------DAKEEG 218 Query: 248 AERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQAL 291 + DG + +++ L G I K TGLS +++ L Sbjct: 219 RSQGFTDGVNQTKRKTAAALKAMGLAKSKIAKATGLSLAEIEKL 262 >UniRef50_C4Z1Q2 Putative uncharacterized protein n=1 Tax=Eubacterium eligens ATCC 27750 RepID=C4Z1Q2_EUBE2 Length = 321 Score = 77.5 bits (189), Expect = 6e-13, Method: Composition-based stats. Identities = 47/336 (13%), Positives = 96/336 (28%), Gaps = 57/336 (16%) Query: 2 TNFTTSTPHDALFKTFLTHPDTARDFMEIHL--------PKDLRELCDLDSLKLESASFV 53 + T+ D KTF + D + P L E+ S + S S+ Sbjct: 3 NSNRTTHQKDVSLKTFWRDNEHFADLFNATVFNGKQVLKPDKLTEMDTDVSATIHSKSYN 62 Query: 54 DEKLRALHSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEH---- 109 + R D++ K +G + + +E Q + M R M Y + Sbjct: 63 ESITRNR--DVVK--KMSDGVEFNILGLEIQDKTHYAMPLRTMTYDALGYIKEYNDIKKH 118 Query: 110 ------------------DKRQPLPLVIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLY 151 +K +I ++ Y+G C D K Y Sbjct: 119 HKLNKDSFSSHEEFLSGINKSDRFHPIITLVLYYGESLWD-GPTCLSDMMISMPDNIKAY 177 Query: 152 NAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQIT 211 + + L V ++ D+ + RD+ +I + + + Sbjct: 178 FSDYKLNLVQILDSDK-----------YTFYNEDVRDVFNIIRNIYNDDFDSIYREYESR 226 Query: 212 ALLNYILLTGDEARFNEFISELTRRMPQHR-----ERIMTIAERIHNDGYIKG----EQR 262 + ++ + +L Q E + + G +G + Sbjct: 227 NVDIDVMELICNITSVPKLMDLCTDTEQGGTVNMCEAMKRFQAECESKGMKEGIDSEKVN 286 Query: 263 ILRLLLQNGADPEWIQKITGLSAEQMQALRQPLPER 298 + +L+ G E I +T + E ++ + Sbjct: 287 SIISMLEFGITKEQI--LTRYTKEDLERAEAAIANE 320 >UniRef50_C0QWI7 Putative uncharacterized protein n=4 Tax=Brachyspira RepID=C0QWI7_BRAHW Length = 289 Score = 77.1 bits (188), Expect = 6e-13, Method: Composition-based stats. Identities = 35/293 (11%), Positives = 92/293 (31%), Gaps = 13/293 (4%) Query: 4 FTTSTPHDALFKTFL---THPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRAL 60 + +D + H + + + +K+ + + E + Sbjct: 5 KNINVLNDYFMRYMFAKEGHERILLNLINAVRTD--YNQEPFEEVKVLNTFNLKETINDK 62 Query: 61 HSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHI-EHDKRQPLPLVI 119 S + T+ G+ V++E Q + +R + Y ++ ++K L VI Sbjct: 63 QSIVDVRAVTKSGET---VLVEIQRIGNQSFVYRSLYYWAKCYVSNLRNNEKYNDLKQVI 119 Query: 120 PMLFYHGSRSPYPWSLCWLDEFADPTTARKLYN-AAFPLVDVTVVPDDEIVQHRRVALLE 178 + + + T L N +++ + Sbjct: 120 VINILDFNLLKDIDKEHSCYVIKELETNHILTNHFEMHFLELQKYLSSNSNLKEELDAWF 179 Query: 179 LIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMP 238 ++ + +++++ +LV + ++ N T D + + Sbjct: 180 YFL---TIKEKIEKMEEIMDILVKKNPIMKEVYDEYNKFADTKDLFENYAEYEKNYFDIL 236 Query: 239 QHRERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQAL 291 E + E +G + + + R + D + I ++TGL+ E+++ L Sbjct: 237 ALSEERIRGREEGIKEGIKETQISMARNMKNKNMDIKLIGELTGLTTEEIEKL 289 >UniRef50_C5EKZ7 Predicted protein n=1 Tax=Clostridiales bacterium 1_7_47FAA RepID=C5EKZ7_9FIRM Length = 329 Score = 76.7 bits (187), Expect = 8e-13, Method: Composition-based stats. Identities = 44/289 (15%), Positives = 90/289 (31%), Gaps = 42/289 (14%) Query: 15 KTFLTHPDTARDFMEIHL--------PKDLRELCDLDSLKLESASFVDEKLRALHSDILW 66 + L HP DF + P+ L ++ + + + +++ DI+ Sbjct: 9 RKLLNHPARFADFYNGTVFGGRQVLRPEQLSDVPNEQGIVILDKDG-KKRVVERRRDIIK 67 Query: 67 SVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIE------------------ 108 G +I E+Q M R M Y +E Sbjct: 68 KASF--GAYFILAAEENQDTIHYGMPVRNMMYDALDYTEQMECLKQAHKSRGDVLDGGGF 125 Query: 109 ---HDKRQPLPLVIPMLFYHGSRSPYPWSLCWLDEFADPTTARK-------LYNAAFPLV 158 + L V+ ++ YHGS+ P+ D +A++ L + L+ Sbjct: 126 LSGITREDRLMPVVSLILYHGSK-PWDGPRSLYDMLGLDASAKETLALKQVLPDYRINLI 184 Query: 159 DVTVVPDDEIVQHRRVALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYIL 218 D + + E+ + +++ + ++ G Q +D + A+L + Sbjct: 185 DASNIEHPELFCTSLQHVFSMLKYNTDKQKFYGYAKQHQK--DLLDMDDDSMLAMLTLLG 242 Query: 219 LTGDEARFNEFISELTRRMPQHRERIMTIAERIHNDGYIKGEQRILRLL 267 + E S T+ I + +G I+G+ L Sbjct: 243 EQKRLLKILETSSNDTKEGTDVCIAIDELINDGKIEGKIEGKIEGEHRL 291 >UniRef50_B0C251 Putative uncharacterized protein n=1 Tax=Acaryochloris marina MBIC11017 RepID=B0C251_ACAM1 Length = 313 Score = 76.7 bits (187), Expect = 8e-13, Method: Composition-based stats. Identities = 50/320 (15%), Positives = 117/320 (36%), Gaps = 40/320 (12%) Query: 11 DALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESAS-----------FVDEKLRA 59 D LF+ L + +F+++ P + D S++ + Sbjct: 5 DRLFRDLLKN--FFLEFVDLFFP-KIAVAIDPKSIRFLEDEESLKPQEQGEHSPASTKQE 61 Query: 60 LHSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVI 119 S++L V+ R + +V +E+ S +I + R+ + +++ + Sbjct: 62 ASSNVLVQVRLRGQESCFWVHLENSSETNIKLERRIFHTFARLDEKYNL--------PIY 113 Query: 120 PMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLEL 179 P++ +S + + EF D R++ + +F + + + + +Q R L Sbjct: 114 PIILQSSDKSQRLETNGYRVEFVD----RRVLDFSFVAIQLHRLNWRDFLQRRNPVAAAL 169 Query: 180 IQKH-IRQRDLMGLIDQLVVLLVTECANDSQITALLNYI--LLTGDEARFNEFISELTRR 236 + ++ D + + + LL + ++ + +I L + A +E+ R Sbjct: 170 MPTMNVQTFDRPVVKAECLRLLTNLRLDAKKVKVISQFIEAFLHLNAAEEQVLQTEMERM 229 Query: 237 MPQHRERIMTIAERIHNDGYIKGEQRILRLLLQN-------GADPEWIQKITGLSAEQMQ 289 RERI + +G +R L+ +P+ ++ L Q++ Sbjct: 230 GLLERERITNLLTSTTQANQQQGAEREALSLVFRLLKRRIGDLNPDLEAQVRSLPVNQVE 289 Query: 290 ALRQPL----PERERYSWLK 305 L + L E + +WL+ Sbjct: 290 DLGEALLDFNNEEDLKNWLR 309 >UniRef50_C9RMD5 Putative uncharacterized protein n=1 Tax=Fibrobacter succinogenes subsp. succinogenes S85 RepID=C9RMD5_FIBSS Length = 344 Score = 76.4 bits (186), Expect = 1e-12, Method: Composition-based stats. Identities = 44/313 (14%), Positives = 94/313 (30%), Gaps = 40/313 (12%) Query: 11 DALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVKT 70 DA FK FL+ + +F+ + + +K + + + DI KT Sbjct: 38 DAAFKAFLSDEEALVNFLNGVFHLNEDNKIESVVIKNSEINIIFPSAKQFRLDI--RAKT 95 Query: 71 REGDGYIYVVIEHQSREDIHMAFRLMRYSMAVM----------------------QRHIE 108 +G I + IE Q + R++ A M +R Sbjct: 96 SKG---ICINIEMQKARPDYFVDRVLLQQSAFMLQSKYEWDKLNFGDLPSCLTKEERAER 152 Query: 109 HDKRQPLPLVIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAA-FPLVDVTVVPDDE 167 R +P + S D L + + L D+T Sbjct: 153 EIHRYEVPPTYAIWICDFSIG--KQKSFRGDWAVRNKKGLTLTDKMMYILYDLTKFNKPY 210 Query: 168 IVQHRRVALLELIQKHIRQRDLMG-----LIDQLVVLLVTECANDSQITALLNYILLTGD 222 + K+ + + + +I + + ++ A++ I N ++ T + Sbjct: 211 KKITTTEDRWLYLLKYAGKAENLPDFNNSIIAKAINRILVNRASEKLIREQANDMVWTEE 270 Query: 223 EARFNEFISELTRRMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITG 282 E + ++ + ++ G +G + +L + I K + Sbjct: 271 ELDHLALLEVRAE-----KKGLKQGLKQGLEQGLEQGRVEMALAMLADNEPIGKIVKYSH 325 Query: 283 LSAEQMQALRQPL 295 L ++ L+ L Sbjct: 326 LPESKILELKASL 338 >UniRef50_C6Y2C7 Putative uncharacterized protein n=2 Tax=Pedobacter heparinus DSM 2366 RepID=C6Y2C7_PEDHD Length = 283 Score = 76.4 bits (186), Expect = 1e-12, Method: Composition-based stats. Identities = 36/285 (12%), Positives = 84/285 (29%), Gaps = 19/285 (6%) Query: 11 DALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVKT 70 D FK + +D M L + ++ E ++ +L+ + Sbjct: 14 DYGFKRLFGNEPD-KDIMIEFLNALFEGEKIVIDIRYSPTEHAGEDVKEKK--VLFDLTC 70 Query: 71 REGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQ---PLPLVIPMLFYHGS 127 DG ++ IE Q + R + Y ++ + PL V + Sbjct: 71 TGADGETFI-IEMQRADQEFFRDRCVFYMSRLISAQLPRGTSNWDVPLKEVYLIGIMEFQ 129 Query: 128 RSPYPWSLCWLDEFADPTTARKLYN-AAFPLVDVTVVPDDEIVQHRRVALLELIQKHIRQ 186 + + + T + Y + +++ E + + K++ Sbjct: 130 FNNINSNYLHNIALMNRDTGKVFYKGMGYKFLELPNFDKKESDLVTELDKWFYLLKNLSH 189 Query: 187 RDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRERIMT 246 D + + + + ++ E + I Sbjct: 190 LDKI-----------PDFLDKRVFQKIFKIAEMSKMTKEERELYDSDVKAKSDWNAGIRY 238 Query: 247 IAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQAL 291 ++ +G ++ + I R L E I + TGLS ++++ L Sbjct: 239 AEKKAKEEGKLEEKLEIARNLKSKAIAFEIIAETTGLSIDEIEKL 283 >UniRef50_C8W1F3 Putative uncharacterized protein n=2 Tax=Desulfotomaculum acetoxidans DSM 771 RepID=C8W1F3_DESAS Length = 303 Score = 76.4 bits (186), Expect = 1e-12, Method: Composition-based stats. Identities = 43/277 (15%), Positives = 93/277 (33%), Gaps = 44/277 (15%) Query: 45 LKLESASFVDEKLRALHSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQ 104 + + +++ D ++ +K + +E Q+ + R++ Y +++ Sbjct: 44 VSVMPTVLPVVEVKEKRIDFVFLLKDNS-----ILHLEFQTTIPKDILIRMVTYGSRLVE 98 Query: 105 RHIEHDKRQPLPLVIPMLFYHGS-----------RSPYPWSLCWLDEFADPTTARKLYNA 153 ++ + V ++ Y G Y ++ +F +++Y Sbjct: 99 KYDQD--------VNTVVIYSGKIESAPRLLRKGSLTYKVKNIYMKKFDGDAEYKRIYEK 150 Query: 154 AFPLVDVTVVPDDEIVQHRRVALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITAL 213 P DEI R + L + K + ++ +L + E I A+ Sbjct: 151 I-----KNKKPLDEIDIQRLIFLPLMKSKEKSEDEMAIQAAELAKEIPNEPIRAFTIGAI 205 Query: 214 LNYILLTGDEARFNEFISELTR-------RMPQHRERIMTIAERIHNDGYIKGEQRILRL 266 + E + L R E + + +G +G + LR Sbjct: 206 VAISDNFLTEEYKKRLLEVLRMTQIEQWIREEGREEGLKEGLKEGREEGLKEGLKEGLRE 265 Query: 267 LLQN--------GADPEWIQKITGLSAEQMQALRQPL 295 L+ G D E I KIT LS E++ +L++ + Sbjct: 266 GLEKTAIAALREGFDIETIVKITNLSKEEILSLKKKI 302 >UniRef50_C6LTE0 Putative uncharacterized protein n=1 Tax=Giardia intestinalis ATCC 50581 RepID=C6LTE0_GIALA Length = 353 Score = 76.4 bits (186), Expect = 1e-12, Method: Composition-based stats. Identities = 34/301 (11%), Positives = 89/301 (29%), Gaps = 34/301 (11%) Query: 11 DALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVKT 70 D +F + + L L+ + ++++ + Sbjct: 73 DFVFYQIFGVEKH-KSVLISLLNSILKGNPHVKDVRIDPTEHKRTTPDGKSVRLDIKATI 131 Query: 71 REGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRH--IEHDKRQPLPLVIPMLFYHGSR 128 +G V +E Q + R + Y +++ + + + +P VI + Sbjct: 132 NDGT---IVDVEMQCINTGDIYHRSIYYQSLILRDYTIKQGQSYKSIPDVIII------- 181 Query: 129 SPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIRQRD 188 D T + + P+ + EI + + + K + Sbjct: 182 ---------WIMNQDITNRKGCMHEIVPMYKANGIDQIEIASEKMRQFIIELTKLGNTSN 232 Query: 189 L--------MGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQH 240 + + E + + + + + Sbjct: 233 FCYNKAFTAWMTFIKDPSSISGELLEVEGVQTAMKELTYLSENKETRAIYDARRIALLDL 292 Query: 241 RERIMTIAERIHNDGYIKGE----QRILRLLLQNGADPEWIQKITGLSAEQMQALRQPLP 296 I E+ +G ++G +R+ +L +G D E+I + +GLS ++++ +++ Sbjct: 293 NSAIEHGIEKGKAEGLVEGRDKERERMAEQMLSDGLDIEFIVRYSGLSMQEIENVKKMAS 352 Query: 297 E 297 E Sbjct: 353 E 353 >UniRef50_Q24Y59 Putative uncharacterized protein n=4 Tax=Peptococcaceae RepID=Q24Y59_DESHY Length = 283 Score = 76.4 bits (186), Expect = 1e-12, Method: Composition-based stats. Identities = 31/258 (12%), Positives = 82/258 (31%), Gaps = 31/258 (12%) Query: 47 LESASFVDEKLRALHSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRH 106 L + + +DI++ ++ + +E Q+ R + Y +++R Sbjct: 41 LIPSVHPAVEANETRNDIIFLLEDDT-----LLHLEFQTTAGEQDLKRFLYYDARLVRRQ 95 Query: 107 IEHDKRQPLPLVIPMLFYHGS---RSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVV 163 V ++ Y G L + + + + + + Sbjct: 96 ERK--------VHTIVIYSGRIEQARERLECGSILYQVENIYMKHYNGDQEYNRL-KHKI 146 Query: 164 PDDEIVQHRRVALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDE 223 + +++ L + ++ L Q L ++ + +++ D+ Sbjct: 147 DNHQLLSETDTLKLIFLPLMKSEQKEEELAIQ-AAELAKAAPDEKTKLFAIAALIVITDK 205 Query: 224 ARFNEFISELTRRMPQHRERIMTIAERIHNDGYIKG--------EQRILRLLLQNGADPE 275 +L + ++ I + I +G +G ++ + +L G PE Sbjct: 206 IMSESNKRKLLEVL-----KMTQIEQWIREEGRQEGELKGRRDEKRETAQTMLNLGMSPE 260 Query: 276 WIQKITGLSAEQMQALRQ 293 I K T L E++ + + Sbjct: 261 LIAKATKLPLEEILEMAK 278 >UniRef50_A6EJS1 Putative uncharacterized protein n=2 Tax=Pedobacter sp. BAL39 RepID=A6EJS1_9SPHI Length = 293 Score = 76.4 bits (186), Expect = 1e-12, Method: Composition-based stats. Identities = 47/291 (16%), Positives = 94/291 (32%), Gaps = 29/291 (9%) Query: 22 DTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVKTREGDGYIYVVI 81 R+ M+ LP + +L + + E + + D L + +G YI + I Sbjct: 20 KIVRENMQKVLPALINDLLGIKIIDREDLPESIQYTNEVIPDQLSKITDSQGGTYI-LHI 78 Query: 82 EHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGSRSPYPWSLCWLDEF 141 E QSR+D+ M R++ Y + K L + H ++ Sbjct: 79 EWQSRKDVCMTNRMLTYRAML------RRKYNLLVKQYVIFLEHSNQK------------ 120 Query: 142 ADPTTARKLYNAAFPLVDVTVVPDD-EIVQHRRVALLELIQKHIRQRDLMGLIDQLVVLL 200 P + ++ ++++ + L I +I Q++ L Sbjct: 121 ISPEIEEEQLKFSYHMIELRQYDYQLFLKSEIPEQQLLAIFGDFGAVPPTEVISQIMQRL 180 Query: 201 VTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRERIMTIAERIHNDGYIKGE 260 D LL + + + + + + G +G Sbjct: 181 KKNPDGDLTTNKLLKQLRVLTQLRKLQSEFKTAMGTITKFKAEKDPYYIEGFEKGIQQGV 240 Query: 261 QR--------ILRLLLQN-GADPEWIQKITGLSAEQMQALRQPLPERERYS 302 Q+ I+ L+Q E K G+ E +Q +R+ L +++R S Sbjct: 241 QQGVQRRNHAIVINLIQQCRFSDEQAAKAVGVPIELVQQIRKELEDKDRDS 291 >UniRef50_Q1PZ06 Putative uncharacterized protein n=1 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1PZ06_9BACT Length = 238 Score = 76.0 bits (185), Expect = 1e-12, Method: Composition-based stats. Identities = 36/193 (18%), Positives = 66/193 (34%), Gaps = 11/193 (5%) Query: 96 MRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAF 155 M+Y + + + Q L VIP++ YHG + E D R + + Sbjct: 1 MKYLLKIWA--ANSKQMQRLIPVIPVILYHGKETWKVRRFRDYFEGIDEVFFRFIPEFEY 58 Query: 156 PLVDVTVVPDDEIVQHRR----VALLELIQKHIRQR----DLMGLIDQLVVLLVTECAND 207 L D++ ++EI + + L+ ++I D + ++ E Sbjct: 59 LLTDLSFYSNEEIKDKVFRRVSLQITMLLMRNIYNDKILGDKLKAFFEIGKQYFEEGEGL 118 Query: 208 SQITALLNYILLTGDEARFNEFISELTRRMPQHRERIMTIAERIHNDGYIKGEQRILRLL 267 + +++ Y+ D I L + MTIA R+ G I G Sbjct: 119 KFLESVIRYLYYASDIEE-ERVIDTLKEISEEGGRLSMTIAARLIEKGKIAGRMEGRAEG 177 Query: 268 LQNGADPEWIQKI 280 + G I+ I Sbjct: 178 ERKGRMEGLIEAI 190 >UniRef50_A7BPH0 Putative uncharacterized protein n=5 Tax=Beggiatoa RepID=A7BPH0_9GAMM Length = 289 Score = 75.6 bits (184), Expect = 2e-12, Method: Composition-based stats. Identities = 44/297 (14%), Positives = 91/297 (30%), Gaps = 35/297 (11%) Query: 3 NFTTSTPHDALFKTFLTHPDTARDFMEIHLPK--DLRELCDLDSLKLESASFVDEKLRAL 60 +D +FK +HP ++ L ++ E+ S V E Sbjct: 20 KQVAPLRYDVIFKKAFSHPTIFTALVKDFLGIQLEIDEVKYNKGFVPSVNSLVSE----- 74 Query: 61 HSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVM-QRHIEHDKRQPLPL-V 118 + + + + V ++H + R + Y + M + I + P+ + Sbjct: 75 -----FDLFVEDKKNQLIVEMKH-AYCSRSDYERFVYYQCSSMVEAVINSNSDYDFPMTI 128 Query: 119 IPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLE 178 I ++F+ ++P P S + +F A D+I Q + + Sbjct: 129 ITIVFFTWKKTPSPDSSIIVHDFESRDLATGQLL-------------DKIYQRKHQLIFV 175 Query: 179 LIQKHIRQRDL---MGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTR 235 + + + L E + L+ + ++ + + Sbjct: 176 FTNDSTHENTPSTYREWMQAIDDSLDGEVDEEKYTNPLIQELFGVIEKDKITPEERACMK 235 Query: 236 RMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNG-ADPEWIQKITGLSAEQMQAL 291 E + G K + R L N + I + TGLS E ++AL Sbjct: 236 DQYSQEEACIKAFNDGMKQGQSK---KTARNLKANSKLTEKEIARATGLSLEMVKAL 289 >UniRef50_C6XVT6 Putative uncharacterized protein n=1 Tax=Pedobacter heparinus DSM 2366 RepID=C6XVT6_PEDHD Length = 317 Score = 75.6 bits (184), Expect = 2e-12, Method: Composition-based stats. Identities = 39/318 (12%), Positives = 100/318 (31%), Gaps = 47/318 (14%) Query: 3 NFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFV------DEK 56 D K DF+ L ++ + + K Sbjct: 18 EERPRRKDDEFLKGAFED--NFPDFLRFVFSDADEILDFNREIEFLNNELFTIIPDRERK 75 Query: 57 LRALHSDILWSVKTREG-DGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPL 115 +D+L + ++G + ++ + +E + D R+ Y+ + ++ Sbjct: 76 GGGRRADLLAKLYLKDGTEKWVLLNVEIEGGNDRKFGQRVFEYNYRIRDKYKVS------ 129 Query: 116 PLVIPMLFYHGSRSPYPWSLCWLDE--------------FADPTTARKLYNAAFPLVDV- 160 V + + G ++ + + F + F L+ + Sbjct: 130 --VASIAVFTGKKTQLRPTEYLDELLGTVLSFKYTAYHVFDHQEDELLKSDNPFSLIALA 187 Query: 161 -------TVVPDDEIVQHRRVALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITAL 213 +PD+E+ R+ +++ + +H D +I ++ L +I Sbjct: 188 CQKALLEGKIPDEELADE-RLVIVKALLRHGY--DRQRIISFILFLKNFIFIESEEINRK 244 Query: 214 LNYILLTGDEARFNEFISELTRRMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGAD 273 + + + + + ++ ++ + +I +G + I R L + G Sbjct: 245 FDQQIEELTKDKNPMGVIDVFKKWERQEAKI-----EGKLEGRREEALEIARELKKEGLT 299 Query: 274 PEWIQKITGLSAEQMQAL 291 E+I K T L +++ L Sbjct: 300 IEFIAKTTKLPIAEIEKL 317 >UniRef50_Q24Y19 Putative uncharacterized protein n=3 Tax=Desulfitobacterium hafniense RepID=Q24Y19_DESHY Length = 248 Score = 75.6 bits (184), Expect = 2e-12, Method: Composition-based stats. Identities = 26/247 (10%), Positives = 75/247 (30%), Gaps = 33/247 (13%) Query: 79 VVIEHQSREDIHMAFRLMRYSMAVMQRHIEHD---KRQPLPLVIPMLFYHGSRSPYPWSL 135 + IE Q M R + Y + R I K + I ++ ++ + + Sbjct: 3 INIEIQLSNQYDMEKRSLYYWAQMYSRQIREGMAYKELTKTVSINIVDFNYLKQTSNY-H 61 Query: 136 CWLDEFADPTTARKLYNAAFPLVDVTVV-----PDDEIVQHRRVALLELIQKHIRQRDLM 190 + D + +++ + + + + L+ + ++++ Sbjct: 62 NVFHLYEDEEKFQLTDVLEIHFMELPKLLAKWRRREISLWENELVRWLLLLEGADNQEIL 121 Query: 191 GLIDQLVV-------------------LLVTECANDSQITALLNYILLTGDEARFNEFIS 231 +++++ + + + + + + Sbjct: 122 QILEEIAMKDPVLYQAMNAWEETSEDPRIREAYFDRRKAILDEKAAIREAELRLQEALEE 181 Query: 232 ELTRRMPQHR-----ERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAE 286 + + + + R E +G +G + + LL G + I + TGLS E Sbjct: 182 GMAKGIAEGRAKGIAEGKAEGKAEGRAEGRAEGRAEVAKKLLVLGFEITKIAEATGLSEE 241 Query: 287 QMQALRQ 293 ++ L+ Sbjct: 242 EISGLKD 248 >UniRef50_C5UZR7 Putative uncharacterized protein n=1 Tax=Clostridium botulinum E1 str. 'BoNT E Beluga' RepID=C5UZR7_CLOBO Length = 334 Score = 75.2 bits (183), Expect = 2e-12, Method: Composition-based stats. Identities = 45/336 (13%), Positives = 91/336 (27%), Gaps = 41/336 (12%) Query: 1 MTNFTTSTPHDALFKTFLTH-PDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRA 59 MT D + K + + + ++ D L + + F+ + Sbjct: 1 MTVSNEKVKLDEILKFLFSTSKKVLVNLLNGIFEENFSS--DEVELSVSNNEFIMDTFDT 58 Query: 60 LHSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDK-----RQP 114 L D+ + V E + +E Q++ D M R+ Y + + P Sbjct: 59 LRGDVFFEVLNNEVSNKVTYHLEFQTKNDSTMIIRMFEYGFRKGKEQTGNRDDFKTIYFP 118 Query: 115 LPLVIPMLFYHGSRSP--------------YPWSLCWLDEFADPTTARKLYNAAFPLVDV 160 VI + + + Y + E+ D PL Sbjct: 119 KQKVIFIERNNNIKEDIKLKIVLPDEQSFIYSVPVMKYWEYTDNELIENKMYPLLPLQLF 178 Query: 161 TVVPDDEIVQHRRVALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITA-------L 213 + D E + H + + + ++ L L Sbjct: 179 NLRKDLEYARRSNNIDKINDLSHEAKEIALKIANESKKLFDDNEIIGEDFHKMLLAIQNL 238 Query: 214 LNYILLTGDEARFNEFISELTRRMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQN--- 270 + Y+ E + E E+ G KG ++ + ++ Sbjct: 239 IEYLNRNYFNDDRLEEEVSTMTKTLYDPEVEKRGIEKGIEKGIEKGIEKGMEKGIEKKAI 298 Query: 271 ---------GADPEWIQKITGLSAEQMQALRQPLPE 297 G E + K TGL E+++ L+ + Sbjct: 299 EDAIGFLRLGVSEEIVSKGTGLPIEKVRELKDKINN 334 >UniRef50_UPI0001C366FA hypothetical protein ChatD1_09620 n=1 Tax=Clostridium hathewayi DSM 13479 RepID=UPI0001C366FA Length = 342 Score = 74.4 bits (181), Expect = 4e-12, Method: Composition-based stats. Identities = 41/324 (12%), Positives = 86/324 (26%), Gaps = 39/324 (12%) Query: 11 DALFKTFLTHPDTARDFMEI-------HLPKDLRELCDLDSLKLESASFVDEKLRALHSD 63 D K L DF+ + L D L ++ + + ++ D Sbjct: 8 DYYMKILLEDRARFADFINVNVFHGKQVLAADKLSLLPNEAGIVVVDADGVKRTIQRRRD 67 Query: 64 ILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEH-------------- 109 ++ + G + V E+Q + MA R M Y I Sbjct: 68 VVMKAEF--GAYFCVVASENQGKVHYGMAVREMMYDALDYTEQIRKIEEKHRAEGDKLEG 125 Query: 110 -------DKRQPLPLVIPMLFYHGSRSPYPWSLCWLDEFADPTTARK------LYNAAFP 156 K L V+ + Y+G+ + + D L + Sbjct: 126 ADFLSHVTKADRLIPVVTLTLYYGNEAWDGPRSLYEMMGIDEEWEETALVKKCLPDYKIN 185 Query: 157 LVDVTVVPDDEIVQHRRVALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALL-- 214 L+D+ + + + L++ + ++ L + L Sbjct: 186 LIDIREGEKLDQYKTSLQHVFGLVKYNKNKQKLYEYTRVHREEINRMDRESKAAALALIG 245 Query: 215 -NYILLTGDEARFNEFISELTRRMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGAD 273 L E++ E + + + R G K + +R + Sbjct: 246 EQKRLQKILESKREEEMDMCQAIDELIADGEVRGEVRGILMGMEKTKINFIRKQYKKQLS 305 Query: 274 PEWIQKITGLSAEQMQALRQPLPE 297 I I L ++ + + + Sbjct: 306 SSQIANILDLDERYVEKVIKLFKQ 329 >UniRef50_C1QAK6 Putative uncharacterized protein n=1 Tax=Brachyspira murdochii DSM 12563 RepID=C1QAK6_9SPIR Length = 290 Score = 74.4 bits (181), Expect = 4e-12, Method: Composition-based stats. Identities = 37/289 (12%), Positives = 85/289 (29%), Gaps = 15/289 (5%) Query: 11 DALFKTFL---THPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWS 67 D + H + + + +K+ + + E + S + Sbjct: 9 DYFMRYMFAKEGHEHILLNLINAIRTD--YNQEPFEEVKVLNTFNLKETINDKQSIVDVR 66 Query: 68 VKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHI-EHDKRQPLPLVIPMLFYHG 126 T+ G+ V++E Q + +R + Y ++ ++K L VI + Sbjct: 67 AITKSGET---VLVEIQRVGNQSFVYRSLYYWAKGYISNLRNNEKYNDLKQVIVINILDF 123 Query: 127 SRSPYPWSLCWLDEFADPTTARKLYN-AAFPLVDVTVVPDDEIVQHRRVALLELIQKHIR 185 + + T L N +++ + Sbjct: 124 NLLKDINKEHSCYVIKELETNHILTNHLEMHFLELPKYLFSSSRLTDELYAWFYFLTIKE 183 Query: 186 QRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRERI- 244 +R+ M I ++ L+ ++ + E+ + + ERI Sbjct: 184 KREKMEEIMEM--LVKKNPIMKEVYDEYNKFVNTKDLFDNYTEYEKNYFDMLALNEERIK 241 Query: 245 --MTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQAL 291 + G + + + ++ D I K TGLS E+++ L Sbjct: 242 GREEGLKEGIEKGEKNKAISMAKNMKKDKVDFNTISKYTGLSIEEIENL 290 >UniRef50_B4VKU9 Putative uncharacterized protein n=1 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VKU9_9CYAN Length = 323 Score = 74.4 bits (181), Expect = 5e-12, Method: Composition-based stats. Identities = 43/322 (13%), Positives = 82/322 (25%), Gaps = 25/322 (7%) Query: 2 TNFTTSTPHDALFKTFLTH---PDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLR 58 S D FK D F+ + + + + ++ E L+ Sbjct: 3 KKKFISPKIDYAFKKIFGSDQSEDILISFLNAIV-YNGKSVISSLTIVNPYNPGQVETLK 61 Query: 59 ALHSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQ-PLPL 117 + DI + + E V+IE Q R+ + + Sbjct: 62 DSYLDIRAVLNSGE-----IVLIEMQVARIAAFYKRVTYNLCKAYANQLTSGDYYLEITP 116 Query: 118 VIPMLFYHGSRSPYPWSLCWLDEFADPTT--ARKLYNAAFPLVDVTVVPDDEIVQHRRVA 175 VI + F D + + V++ Sbjct: 117 VIAVTITDFILFKENPKCIHHFVFKDKESSSEYPEHELQLIFVELPRFVKKLPELQTLAE 176 Query: 176 LLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTR 235 + +DL + + L + E A A L R + E+ R Sbjct: 177 KWIYFM--TQAQDLEEIPESLAEVTAIEKALTIANQANLTPAEAEEVSRRAMQLRDEIGR 234 Query: 236 R----MPQHRERIMTIAERIHNDGYIKGEQRILR----LLLQNGADPEWIQ---KITGLS 284 +E + +G +G R LL + + + GLS Sbjct: 235 IKYATEEASKEAREEGRQEGRQEGRQEGRITEARALVLRLLNKRFPDQTAELNSLVEGLS 294 Query: 285 AEQMQALRQPLPERERYSWLKS 306 ++ L + E + L + Sbjct: 295 LSALEGLSDAMFELNNWQDLLT 316 >UniRef50_A7C3X3 Putative uncharacterized protein n=7 Tax=Beggiatoa RepID=A7C3X3_9GAMM Length = 308 Score = 74.4 bits (181), Expect = 5e-12, Method: Composition-based stats. Identities = 42/317 (13%), Positives = 105/317 (33%), Gaps = 36/317 (11%) Query: 3 NFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHS 62 +D +FK P F+ L DL+ +E D + + + Sbjct: 2 KEVAPLRYDVIFKKAFGVPKIFTAFVHDFLN------IDLEIDTVEKDKVYDPPIGNVAA 55 Query: 63 DILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRY-SMAVMQRHIEHDKRQPLPLVIPM 121 + + + + V ++H D + R + Y A++++ + +P V + Sbjct: 56 --KFDLYAEDKKNRVIVDMQHVRFPDHY--DRFLHYHCAALLEQVVYSKDYRPNLKVFTL 111 Query: 122 LFYH-GSRSPYPWSLCWLDEFADP-----TTARKLYNAAFPLVDVTVVP--DDEIVQHRR 173 + G R ++ D T K+ + ++ P E ++ Sbjct: 112 VILTSGDRHKKDITITDFDPKDLEGNPIGETEHKIIHICPKYLNKAHTPPQYHEWMEAIE 171 Query: 174 VALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISEL 233 +L E + + + I +L+ +++ + + ++ + + ++ Sbjct: 172 DSLDEQVDESKYTHPEIQQIFKLIEKDKVTPQERAKMFDEYSMDAVKQEKIQKIQKKAKE 231 Query: 234 TRRMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNG-----------------ADPEW 276 +E + + +G +G + L+ G E Sbjct: 232 EGLKEGLKEGLKEGLKEGKEEGLKEGLKEGKEEGLKEGEHKAKEELVRNLWSIGMLTEEQ 291 Query: 277 IQKITGLSAEQMQALRQ 293 I + TGL+ E+++AL++ Sbjct: 292 IAQTTGLTLEKVKALKE 308 >UniRef50_B7I1C8 Putative uncharacterized protein n=16 Tax=Bacillus cereus group RepID=B7I1C8_BACC7 Length = 307 Score = 74.4 bits (181), Expect = 5e-12, Method: Composition-based stats. Identities = 46/305 (15%), Positives = 92/305 (30%), Gaps = 17/305 (5%) Query: 2 TNFTTSTPHDALFKTFLT---HPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLR 58 + D FK + D F+ L + E SL L+ E+ Sbjct: 6 KKNLVNLRVDYAFKRLFGVEGNEDILIGFLNAVLQSSIDEEI--TSLHLDDPHLPREQKD 63 Query: 59 ALHSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHD-KRQPLPL 117 S + G I + IE Q R+ M R + Y + + K L Sbjct: 64 DKLSILDLRATLNSG---IKINIEIQVRDKKDMIERSLFYWSGMYYSQMTQGMKYTELRP 120 Query: 118 VIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNA-AFPLVDVTVVPDDEIVQH----- 171 I + P ++ + + R + +++ V + + Sbjct: 121 TICINIVDFILFPEEQEFHSINTVMNKKSKRIITENMQLHFLEIPKVIQEWQGKRMDPWE 180 Query: 172 RRVALLELIQKHIRQRDLMGLIDQLVVLL--VTECANDSQITALLNYILLTGDEARFNEF 229 +A L+ L +++ + + V + A + + L EAR Sbjct: 181 DSLARWLLLFPAHEDERLTTILEAIAMEKDPVLKKAIEDWERLSSDKDFLRLYEAREKAI 240 Query: 230 ISELTRRMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQ 289 ++ AE + I + ++ + + G E I K+ LS E++ Sbjct: 241 KDRISEIETAEERAAKKAAEIATEETKIATKIEMIENMFKIGLPIEKIAKVAELSVEEVN 300 Query: 290 ALRQP 294 + + Sbjct: 301 EIIRN 305 >UniRef50_B8HNA0 Putative uncharacterized protein n=3 Tax=Cyanobacteria RepID=B8HNA0_CYAP4 Length = 315 Score = 74.0 bits (180), Expect = 5e-12, Method: Composition-based stats. Identities = 46/317 (14%), Positives = 104/317 (32%), Gaps = 39/317 (12%) Query: 22 DTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVKTREGDGYIYVVI 81 D F+ ++ + + L + + + +D L ++ + + + + Sbjct: 4 DNICKFLAESFSTEVATWLLGERISLFKLEPTELSVEPIRADSLILLEAED----LILHV 59 Query: 82 EHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQ---PLPLVIPMLFYHG----SRSPYPWS 134 E Q+ D M R++ Y + +++R + RQ L +L Y + + ++ Sbjct: 60 EFQTGPDADMPLRMLDYRVRLLRRSPQKVVRQFVIYLRQTTSVLVYQTELQLESTWHEFN 119 Query: 135 -LCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIRQRDLMGLI 193 + + DP A + P + + E + L I+ Q +L Sbjct: 120 VVRLWECSTDPLLASRGL---LPFAVLGQTSNPEATLAQVAQRLSTIENRTEQSNLTAAS 176 Query: 194 DQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQH-----RERIMTIA 248 L L++ + + + L + R + Q + Sbjct: 177 AILAGLVLDQQTIQRLLRREIMRESLFYQGILEEGMQKGVERGIAQGIQLGLEQGRQEGL 236 Query: 249 ERIHNDGYIKGEQRILRLLLQNG---------------ADPEWIQKITGLSAEQMQALRQ 293 E+ +G +G Q + +Q G P+ +I+ L M+ L + Sbjct: 237 EQGRQEGRQEGRQEGRQEGIQQGVLSLVLRSLTRKFGNISPDLQARISQLPLFVMENLSE 296 Query: 294 PLPER----ERYSWLKS 306 L + E +WL++ Sbjct: 297 DLLDFTNLDELLNWLQA 313 >UniRef50_B3QUJ9 Putative uncharacterized protein n=8 Tax=Bacteria RepID=B3QUJ9_CHLT3 Length = 286 Score = 73.7 bits (179), Expect = 8e-12, Method: Composition-based stats. Identities = 39/295 (13%), Positives = 99/295 (33%), Gaps = 36/295 (12%) Query: 11 DALFKTFLT---HPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWS 67 D FK + DF+ LP+ + + L V ++ + ++ Sbjct: 14 DFGFKKLFGTEPNKILLMDFLNQILPEKHQ----IQELSYSKNEHVGQQELDRKA--IFD 67 Query: 68 VKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDK-RQPLPLVIPMLFYHG 126 V G ++ +E Q + + R + Y+ +Q + L V + Sbjct: 68 VYCVGQSGERFI-VEVQKAKQNYFKDRSIYYASFPIQEQAKRGNWDDKLEPVYTVGILDF 126 Query: 127 SRSPYPWSLCWLDEFADPTTARKLYN--AAFPLVDVTVVPDDEIVQHRRVALLELIQKHI 184 + + A +A++ ++ F +++ E + + +H+ Sbjct: 127 IFDDHKLDAELIHVVALKKSAQRSFSDKLKFIYIELPKFKKTEAELETQFDKWLYVFRHL 186 Query: 185 RQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRERI 244 Q + + + + A+F++ ++ + Sbjct: 187 SQ---------------LQKRPTKFQEKIFEKLFEAAEIAKFSKNELVAYEESLKYYRDM 231 Query: 245 MTIAERIHNDGYIKGEQ--------RILRLLLQNGADPEWIQKITGLSAEQMQAL 291 + + +G+++G++ I R+L G + I +ITGL+A++++ L Sbjct: 232 KNVVDTSKEEGWLEGQKAGCEQRNYEIARVLKAKGMPIQEISEITGLTAQEIEHL 286 >UniRef50_C9LUC8 Putative uncharacterized protein n=5 Tax=Selenomonas sputigena ATCC 35185 RepID=C9LUC8_9FIRM Length = 325 Score = 73.3 bits (178), Expect = 1e-11, Method: Composition-based stats. Identities = 48/300 (16%), Positives = 97/300 (32%), Gaps = 33/300 (11%) Query: 2 TNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALH 61 D +F+ + + ++ + L E L + LE L + Sbjct: 42 KEKKGRQYQDTVFRMYFNEEERLKE-VAGALHGRSYEEEPLKIVTLEGT-----FLSQIK 95 Query: 62 SDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQ-----PLP 116 +DI + + G + +EHQS + +M R + Y ++++I K LP Sbjct: 96 NDISFLL-----AGRHLIFMEHQSTANQNMPLRCLYYVCEQLRQYIPAKKLYQNTPIKLP 150 Query: 117 LVIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVAL 176 +FY G+ L + T ++ Sbjct: 151 APEFHVFYTGNNDMPETCQMKLSDAYVKTDEEIHLELKANFHNI-----------AYDNA 199 Query: 177 LELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNY-----ILLTGDEARFNEFIS 231 L+Q+ D I ++ + I + Y I+ + E + Sbjct: 200 KILLQRSRSIHDYSFFIARIKRNMAAGMERAQAIREAMRYCEESDIMKEFLQQHEREVVD 259 Query: 232 ELTRRMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQAL 291 + Q ++ I E G +G+ ++ +L++ E I +I+ S E++Q L Sbjct: 260 MVNFEWNQ-KDFEEAILEEGMERGREEGKVDMVLEMLRDKLPLETIARISKFSMERVQEL 318 >UniRef50_B8HL58 Putative uncharacterized protein n=2 Tax=Cyanothece RepID=B8HL58_CYAP4 Length = 334 Score = 72.9 bits (177), Expect = 1e-11, Method: Composition-based stats. Identities = 41/291 (14%), Positives = 91/291 (31%), Gaps = 44/291 (15%) Query: 6 TSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFV----DEKLRALH 61 + D+ +K L ++ + P+ ++ + F D + + Sbjct: 4 PRSDKDSAWKEIL--RQYFQEAIVFFFPQTAEQVDWTRPYEFLDKEFQQIAPDAETGKRY 61 Query: 62 SDILWSVKTREG-DGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIP 120 +D L V ++G + ++ + +E Q+ + A R+ Y++ + R P + Sbjct: 62 ADQLVKVWLKDGAELWLLIHVEVQAARESEFAQRMFTYNLRIFDRFNH-------PAISL 114 Query: 121 MLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDD--EIVQHRRVALLE 178 + S P S + +F D + + F V + + E+ Q + Sbjct: 115 AILCDESVRWRPESFSF--DFPDTSL-----SFRFGRVKLLDYRERISELEQSPNPFSIV 167 Query: 179 LIQ--------KHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEAR-FNEF 229 ++ K +QR L L+ L ++ L +I E Sbjct: 168 VMAHLRAQATRKDDQQRKFWKLT--LIRRLYEGGYGRQEVINLFRFIDWVMILPEGLKEE 225 Query: 230 ISELTRRMPQHRER----------IMTIAERIHNDGYIKGEQRILRLLLQN 270 + + + R E+ +G +G Q + Q Sbjct: 226 FWQELKIYEEERRMPFITSVEEIGFERGLEQGRQEGRQEGRQEGRQEGRQE 276 >UniRef50_C9LXS5 Transposase n=3 Tax=Selenomonas sputigena ATCC 35185 RepID=C9LXS5_9FIRM Length = 347 Score = 72.9 bits (177), Expect = 1e-11, Method: Composition-based stats. Identities = 38/262 (14%), Positives = 85/262 (32%), Gaps = 22/262 (8%) Query: 42 LDSLKLESASFVDEKLRALHSDILWSVKTREGDGYI--YVVIEHQ-----SREDIHMAFR 94 ++ ++ + DIL+ K + I V IE Q SR + R Sbjct: 93 PKEIRGDNTESASPTEGWIRFDILFRAKVPQTGARITLIVNIEAQKTQSNSRLGYALLRR 152 Query: 95 LMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAA 154 + Y+ ++ E + + Y+ + Y P + Sbjct: 153 AIYYACRLISSQKETEFAKSN--------YNDIKKVYS----VWICMDAPDDKSAINFYD 200 Query: 155 FPLVDVTVVPDDEIVQHRRVALLELIQKHIRQ-RDLMGLIDQLVVLLVTECANDSQITAL 213 E + + ++ + +L+ + L V A +I Sbjct: 201 MQERHFLHRTKAEKSDYDLLNIIMIYLGADDSGNELVRFLKLLFRDTVKSAAEKKKILES 260 Query: 214 LNYILLTGDEARFNEFISELTRRMPQH--RERIMTIAERIHNDGYIKGEQRILRLLLQNG 271 + ++GD + + L+ + + + I E+ G +GE ++ +L+ G Sbjct: 261 EFDLDISGDMEKEMNTMCNLSEGIFERGIEQGIEQGIEQGIEQGIEQGESGMILSMLKKG 320 Query: 272 ADPEWIQKITGLSAEQMQALRQ 293 D I I+ S ++++ L + Sbjct: 321 YDLTSIADISQWSIKKIEQLAK 342 >UniRef50_C4Z2A6 Putative uncharacterized protein n=2 Tax=Eubacterium eligens ATCC 27750 RepID=C4Z2A6_EUBE2 Length = 336 Score = 72.5 bits (176), Expect = 2e-11, Method: Composition-based stats. Identities = 38/315 (12%), Positives = 92/315 (29%), Gaps = 29/315 (9%) Query: 2 TNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALH 61 + TS D +F + + DL L S++ Sbjct: 5 KDNVTSKFKDNVFCMLYRDKRNLLELYNALNNSAYTNVDDLQVTTLNGGSYM-----KYK 59 Query: 62 SDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRH-----IEHDKRQPLP 116 +D + + + E QS ++ +M R + Y V + + +P Sbjct: 60 NDASFLLCMS------LYMFEQQSSKNPNMPLRFLHYVSDVFRELFSNSMLHRRSTIKIP 113 Query: 117 LVIPMLFYHGSRSP-YPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVA 175 + + FY+G L + + T ++++ Sbjct: 114 VPHFVTFYNGLEKWIEDEDEIRLSDMYEIPTDNPELELKVRVININKDVHILNKCKTLRD 173 Query: 176 LLELIQKHIRQRDLM---------GLIDQLVVLLVTECANDSQITALLNYILLTGDEARF 226 + + K + + +D+ + + + ++ + DE Sbjct: 174 YMTFVNKVRFKMGVEGDDVRIAVTEAMDECIDEDILVDFFEQHREEVVEVSIYDYDEEDV 233 Query: 227 NEFISELTRRMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSA- 285 + E + M + E T+ E + + + Q + Q+G + + +I Sbjct: 234 RRTLFEEAKEMAKE-ELKETVIEELKREAKEELTQEVFAEGEQSGEQLKIVNQIIKKVKK 292 Query: 286 -EQMQALRQPLPERE 299 + ++ + L E E Sbjct: 293 SKTLETIASELEEEE 307 >UniRef50_Q8YTL4 All2703 protein n=13 Tax=Cyanobacteria RepID=Q8YTL4_ANASP Length = 270 Score = 72.5 bits (176), Expect = 2e-11, Method: Composition-based stats. Identities = 39/288 (13%), Positives = 95/288 (32%), Gaps = 26/288 (9%) Query: 18 LTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRA--LHSDILWSVKTREGDG 75 + + P EL + + F +++ D L+ K + Sbjct: 1 MKTDTIFYSLFQEF-PHIFFELINQSPQEASIYEFTSREVKQLAFRLDGLFLPKINDSTK 59 Query: 76 YIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGSRSPYPWSL 135 Y+ +E Q + D +RL ++++ + P P + ++ Y ++ Sbjct: 60 PFYI-VEVQFQPDDDFYYRLFAELFLYLKQY-----KPPYPWQV-VVIYPSRGIERQQTI 112 Query: 136 CWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIRQRDLM--GLI 193 + + L ++ V + + V +++L+ + ++ LI Sbjct: 113 HFDEILVLNRVK------RIYLDELGEVAETSL----GVGVVKLVIETEETAPVLARQLI 162 Query: 194 DQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISEL----TRRMPQHRERIMTIAE 249 Q L A I + I+ + E + L ++ ++E + + Sbjct: 163 AQAKQQLTDVTAKRDLINLIETIIVYKLPQKSREEIEAMLGLNELKQSRVYQEALEEGKQ 222 Query: 250 RIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQALRQPLPE 297 +G + + + ++Q G E I ++ L E +Q Q + Sbjct: 223 EGKQEGKQEAKLETIPRMVQFGLSVEAIAQLLDLPLEVVQQAVQQFNQ 270 >UniRef50_C0D7Q8 Putative uncharacterized protein n=1 Tax=Clostridium asparagiforme DSM 15981 RepID=C0D7Q8_9CLOT Length = 351 Score = 72.5 bits (176), Expect = 2e-11, Method: Composition-based stats. Identities = 37/301 (12%), Positives = 81/301 (26%), Gaps = 39/301 (12%) Query: 11 DALFKTFLTHPDTARDFMEI--HLPKDLRELCDLDSLKLESASFVDEKLRALHS-----D 63 D + + D K + + DL + E+ + D Sbjct: 17 DYSVNKLMRNSVRFADLYNGTVFRGKQVLKPEDLSDVPDENGIAIVGLDGKRRLIRRSRD 76 Query: 64 ILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKR----------- 112 ++ G ++ + E+Q + M R M Y +E KR Sbjct: 77 VIKKASF--GAYFVLLAEENQDKVHYAMPVRSMLYDALEYTEQVEALKRRHRECGDRLEG 134 Query: 113 ----------QPLPLVIPMLFYHGSRSPYPWSLCW------LDEFADPTTARKLYNAAFP 156 + V+ + YHG++ + D L + Sbjct: 135 DAFLSGITRDDRIMPVVTLTVYHGAKPWDGPRSLYDMLEMDRDSKEWEALKEVLPDYRLN 194 Query: 157 LVDVTVVPDDEIVQHRRVALLELIQKH-IRQRDLMGLIDQLVVLLVTECANDSQITALLN 215 LV++ + E + + +++ + +R ++ L +D + A+L Sbjct: 195 LVELNNMQHLERFRSSLQPIFTVLKYNRKDKRKFYEYLENHREELRKM--DDDSVRAMLA 252 Query: 216 YILLTGDEARFNEFISELTRRMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGADPE 275 + R E + I + +G +G+ G + Sbjct: 253 LLGEQKRLLRMLELPGGEGKERMDVYNAIDELIADGREEGKAEGKAEGRVEGKAIGLELG 312 Query: 276 W 276 Sbjct: 313 Q 313 >UniRef50_C8W2V6 Putative uncharacterized protein n=2 Tax=Desulfotomaculum acetoxidans DSM 771 RepID=C8W2V6_DESAS Length = 300 Score = 72.1 bits (175), Expect = 2e-11, Method: Composition-based stats. Identities = 45/277 (16%), Positives = 96/277 (34%), Gaps = 29/277 (10%) Query: 35 DLRELCDLDSLKLESASFVDEKLRALHSDILWSVKTREGDGYIYVV-IEHQSREDIHMAF 93 ++ ++ ++ + SD+L+ V DGY Y++ IE Q R D M Sbjct: 22 EMVRGITVEDVQRVEKEAIA---VKRESDMLFRVS---EDGYEYLMAIEMQIRPDREMPR 75 Query: 94 RLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNA 153 RL+ Y+ K+ P+++ + + + + LD + Sbjct: 76 RLLEYTAM----QHREFKKPVYPVIVNLTGH--KKKDESYCFDCLD--------FTVVTF 121 Query: 154 AFPLVDVTVVPDDEIVQHRRVALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITAL 213 + ++++ +P + ++ V L+ L+ + + V + D + A Sbjct: 122 NYRQINLSDLPGQDFLRSGPVGLIPLVVLMRHDEAPEEVFAKCVQRVDE--VQDEGLRAD 179 Query: 214 LNYILLTGDEARFNEFISELTRRMPQHRERIMTIAERIHNDGYIKGEQRILRLLLQNGAD 273 L L +F I + + + ++ + + I +GEQ + +Q G Sbjct: 180 LYLGLAVLSTIKFTREI--ILKYIEVNKMENSPLFDGIREKWIDQGEQIGFQKGIQKGIQ 237 Query: 274 PEWIQKITGLSAEQM----QALRQPLPERERYSWLKS 306 Q I E + + L S LK+ Sbjct: 238 QAMQQSILEALEENIGMCSSKIGNKLSSIRDISVLKT 274 >UniRef50_A8YL21 Genome sequencing data, contig C325 n=27 Tax=Cyanobacteria RepID=A8YL21_MICAE Length = 149 Score = 71.4 bits (173), Expect = 4e-11, Method: Composition-based stats. Identities = 25/145 (17%), Positives = 55/145 (37%), Gaps = 16/145 (11%) Query: 1 MTNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDE--KLR 58 MTN HD LFK ++ +F+E+ P ++ D +S+ + + Sbjct: 1 MTNNID---HDRLFKELIS--TFFVEFIELFFP-EVMNYLDTESITFLDKEVFTDVTEGE 54 Query: 59 ALHSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLV 118 SD++ V+ R + + + +E Q R+ Y ++ + + Sbjct: 55 RHKSDLVAQVRFRGKESFFLIYVEAQESSRKWFNRRMFTYFARFHEKFVL--------PI 106 Query: 119 IPMLFYHGSRSPYPWSLCWLDEFAD 143 P++ + S+ ++ +F D Sbjct: 107 YPIVIFSYSQPKKEAISQYVVDFPD 131 >UniRef50_B0MQP0 Putative uncharacterized protein n=2 Tax=Eubacterium siraeum DSM 15702 RepID=B0MQP0_9FIRM Length = 289 Score = 71.0 bits (172), Expect = 5e-11, Method: Composition-based stats. Identities = 30/284 (10%), Positives = 87/284 (30%), Gaps = 18/284 (6%) Query: 3 NFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCD-LDSLKLESASFVDEKLRALH 61 + D +FK T + ++ +L L D +++L + ++ + + + + Sbjct: 19 SNIVKAKLDIIFKKLFTDEGN-QHLLQAYLSDTLGIPYDSIENLVVLNSEIMPDSITEKY 77 Query: 62 SDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDK-RQPLPLVIP 120 S + +K + +E Q +++ R + Y + ++ + L I Sbjct: 78 SRMDIRMKANGR----LINVEMQIKDEGDYKDRSLYYLSKLYSGQLKSGEVYGSLNQCIS 133 Query: 121 MLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELI 180 + + + D + ++ + + +++ L LI Sbjct: 134 INIINFNLFDCEKYHSSFSMREDSRNEQLTDKFTAHYFELKKIGKNIDKNNKQELWLRLI 193 Query: 181 QKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQH 240 + +L Q ++ + + ++ R + E + Sbjct: 194 NAETEDE---------LDMLQQTGVKQIQDAVVVLHKMSADEKTRELAEMREKALHIEAT 244 Query: 241 RERIMTIAER--IHNDGYIKGEQRILRLLLQNGADPEWIQKITG 282 + G + E ++ + ++G E I+ I Sbjct: 245 EKAHARAEGEAVGLKKGEKRKEAEMISKMRKSGLSEEQIKAILN 288 >UniRef50_C9RP54 Putative uncharacterized protein n=1 Tax=Fibrobacter succinogenes subsp. succinogenes S85 RepID=C9RP54_FIBSS Length = 312 Score = 71.0 bits (172), Expect = 5e-11, Method: Composition-based stats. Identities = 42/287 (14%), Positives = 90/287 (31%), Gaps = 20/287 (6%) Query: 11 DALFKTFLTH---PDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWS 67 D +FK + + P+ F+ L + +L + V + DI + Sbjct: 35 DGIFKMLIANEAKPERTVKFLNAMLGLTGDKAIKTYTLGVPENPGVLND-KTAIFDIYGT 93 Query: 68 VKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGS 127 + E V+IE Q + RL+ Y+ V+ R ++ + LP + + + Sbjct: 94 TQAGEP-----VLIEVQQNFNTLFVDRLIYYTARVISRTVKKAQDYNLPHIYVLSILTEN 148 Query: 128 RSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIRQR 187 + P LV++ E + ++ Q Sbjct: 149 QFPRERDTYLHHAQLVRNRHLFYSKLDIYLVELEKFFAIEDRT---------LPENREQS 199 Query: 188 DLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQHRERIMTI 247 D ++ +L + + ++ LL+ + + L ++ Sbjct: 200 DRAEMLRIFRDVLEDKDIPEEKLKRLLDKDFANDVSFKGYTDETLLNEV--DGMTDMLYE 257 Query: 248 AERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQALRQP 294 + + G I +L G E I ++T LS ++ L+ Sbjct: 258 KQGSYLQGKDDERNEIAIAMLAEGDSIEKIARVTKLSENDVRKLQAE 304 >UniRef50_C7GDU7 Putative uncharacterized protein n=1 Tax=Roseburia intestinalis L1-82 RepID=C7GDU7_9FIRM Length = 318 Score = 71.0 bits (172), Expect = 5e-11, Method: Composition-based stats. Identities = 34/300 (11%), Positives = 89/300 (29%), Gaps = 42/300 (14%) Query: 11 DALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLR---ALHSDILWS 67 D + K ++ D +L +++ D L + + + D+L + Sbjct: 5 DIVTKEYMRENAIFADAFN-YLIYGGKKVIDPAGLTEIDTATSAVGKKDALQKYRDVLKA 63 Query: 68 VKTREGDGYIYVV--IEHQSREDIHMAFRLMRYSMAVMQRHIEH---------------- 109 ++ + YV+ +E+Q+ M R Y + + Sbjct: 64 AVIKQDEKMSYVLLGVENQTDVHYAMPVRNAIYDALQYGKQVSDIAAGHRRSQKDYSGKT 123 Query: 110 --------DKRQPLPLVIPMLFYHGSRSPY-PWSLCWLDEFADPTTARKLYNAAFPLVDV 160 K + VI ++ + G+ P SL + D + N L+D Sbjct: 124 GGEYLSGFLKEDHIKPVITLVIHFGAEEWDGPLSLHEMMPIRDMEILSYVENYRIHLIDP 183 Query: 161 TVVPDDEIVQHRRVALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLT 220 + ++E+ ++ + + +I+ + + + ++ Sbjct: 184 AKLTEEEL--NKFSTSMREVMGYIKYSNNKEKLLDFLRT--------DTHKSIEMNAARV 233 Query: 221 GDEARFNEFISELTRRMPQHRERIMTIAERIHNDGYIKGEQRILRLL-LQNGADPEWIQK 279 + + + I + G ++G+ + + L+ + I K Sbjct: 234 IKTITKTPIKISEEKEGIEMCQAIEELIAESEARGEVRGKAEGMIEMCLEMNIPKKGIIK 293 >UniRef50_C9RLI8 Putative uncharacterized protein n=1 Tax=Fibrobacter succinogenes subsp. succinogenes S85 RepID=C9RLI8_FIBSS Length = 323 Score = 70.6 bits (171), Expect = 6e-11, Method: Composition-based stats. Identities = 46/301 (15%), Positives = 95/301 (31%), Gaps = 36/301 (11%) Query: 11 DALFKTFLTHPD---TARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWS 67 D +FK T + L + SL+++ + K + DI+ + Sbjct: 39 DGVFKIVFTEEKSHSLLISLLNAMLDLHGGDAIGEISLEMQEFPGIFNK-KNCIVDIIGT 97 Query: 68 VKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGS 127 E V++E Q ++D R+ Y V++ + ++ LP + + Sbjct: 98 TNAGEK-----VLVEIQQQKDKFFKDRVEYYVSRVIENQVHKSEKFELPHIYFLGLLDFE 152 Query: 128 RSPYPWSLCWLDEFADPTTARKLYNAAFPLVDVTVVPDDEIVQHRRVALLELIQKHIRQR 187 P + V++ E +L + Sbjct: 153 LFPEEEHEYIHHVDEMCHGKKFFPKIQKVFVEIEKFFKLE----------KLGFTKDDES 202 Query: 188 DLMGLIDQLVVLLVTECANDSQI-TALLNYILLTGDEARFNEFI-----SELTRRMPQHR 241 D + + V++ E A + + +L + F E + ++T M + Sbjct: 203 DAAQWLRAIRVVIKEEPAPEKIMQNETFRRLLESVKLINFAEELFNCEVKKMTEVMAERE 262 Query: 242 ERIMTIAERIHNDGYIKG-----------EQRILRLLLQNGADPEWIQKITGLSAEQMQA 290 E GY +G ++++ + L + D I K TG S E++ + Sbjct: 263 NAYAEGKEEGRAVGYAEGASAERTKADQEKRQMAKSLKEQNVDVSIIAKSTGFSEEEILS 322 Query: 291 L 291 L Sbjct: 323 L 323 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.310 0.125 0.309 Lambda K H 0.267 0.0386 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,597,378,274 Number of Sequences: 3077464 Number of extensions: 61736433 Number of successful extensions: 365564 Number of sequences better than 1.0e-01: 250 Number of HSP's better than 0.1 without gapping: 993 Number of HSP's successfully gapped in prelim test: 1156 Number of HSP's that attempted gapping in prelim test: 349833 Number of HSP's gapped (non-prelim): 8072 length of query: 306 length of database: 1,040,396,356 effective HSP length: 128 effective length of query: 178 effective length of database: 646,480,964 effective search space: 115073611592 effective search space used: 115073611592 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 40 (20.9 bits) S2: 92 (40.1 bits)