BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (296 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P77768 Uncharacterized protein yfcI n=175 Tax=Gammaprot... 612 e-174 UniRef50_B7UFQ5 Predicted protein n=14 Tax=Enterobacteriaceae Re... 405 e-111 UniRef50_P37415 Uncharacterized protein pSLT051 n=256 Tax=Gammap... 342 1e-92 UniRef50_Q1CC76 Transposase n=27 Tax=Gammaproteobacteria RepID=Q... 327 2e-88 UniRef50_Q7N1D0 Transposase, ISNCY family n=36 Tax=root RepID=Q7... 313 6e-84 UniRef50_P31665 Uncharacterized protein yadD n=59 Tax=Enterobact... 312 8e-84 UniRef50_Q4LC22 TpnA protein n=9 Tax=Enterobacteriaceae RepID=Q4... 310 5e-83 UniRef50_D2U4R8 Transposase (Fragment) n=4 Tax=Enterobacteriacea... 306 7e-82 UniRef50_C2DMU4 Possible transposase n=6 Tax=Enterobacteriaceae ... 304 2e-81 UniRef50_Q7B1W7 YadD homologue n=11 Tax=root RepID=Q7B1W7_ECOLX 290 5e-77 UniRef50_C8QFJ7 Putative transposase YhgA family protein n=4 Tax... 265 1e-69 UniRef50_C2DIT3 Possible transposase n=5 Tax=Enterobacteriaceae ... 264 3e-69 UniRef50_C2LLN3 Transposase n=37 Tax=Enterobacteriaceae RepID=C2... 262 1e-68 UniRef50_D0KLJ7 Putative transposase YhgA family protein n=1 Tax... 253 4e-66 UniRef50_C0Q5B1 Ytl2 n=4 Tax=Enterobacteriaceae RepID=C0Q5B1_SALPC 247 4e-64 UniRef50_D1P284 Transposase, ISNCY family n=10 Tax=Enterobacteri... 243 6e-63 UniRef50_B6XDZ7 Putative uncharacterized protein n=2 Tax=Provide... 239 6e-62 UniRef50_Q3C0L1 TpnA protein n=16 Tax=Enterobacteriaceae RepID=Q... 224 2e-57 UniRef50_C3M8C1 Putative transposase n=3 Tax=Candidatus Hamilton... 221 2e-56 UniRef50_C2LF55 Transposase n=3 Tax=Enterobacteriaceae RepID=C2L... 221 3e-56 UniRef50_A8PLK1 Putative uncharacterized protein n=3 Tax=Rickett... 213 8e-54 UniRef50_C1J8H0 Truncated transposase n=3 Tax=Escherichia coli R... 201 2e-50 UniRef50_C0AXL8 Putative uncharacterized protein n=1 Tax=Proteus... 199 1e-49 UniRef50_Q52101 ORF n=1 Tax=Salmonella enterica subsp. enterica ... 194 3e-48 UniRef50_B7MZS6 Putative uncharacterized protein n=3 Tax=Escheri... 181 3e-44 UniRef50_A8PQ66 Putative uncharacterized protein n=3 Tax=Rickett... 168 2e-40 UniRef50_B3ESQ9 Putative uncharacterized protein n=2 Tax=Bacteri... 155 2e-36 UniRef50_C1MD86 Putative uncharacterized protein n=5 Tax=Enterob... 154 3e-36 UniRef50_C8T759 Putative uncharacterized protein n=1 Tax=Klebsie... 150 4e-35 UniRef50_D0YJF1 Putative transposase YhgA family protein n=1 Tax... 143 7e-33 UniRef50_D2NBJ3 Putative uncharacterized protein n=1 Tax=Escheri... 138 2e-31 UniRef50_Q1RGR6 Transposase and inactivated derivative n=15 Tax=... 131 3e-29 UniRef50_Q1RJ73 Transposase and inactivated derivative n=10 Tax=... 128 2e-28 UniRef50_C4YU05 Transposase n=5 Tax=Rickettsieae RepID=C4YU05_9RICK 126 1e-27 UniRef50_C3PPD7 Transposase and inactivated derivative n=13 Tax=... 123 8e-27 UniRef50_A8GX51 Transposase and inactivated derivative n=11 Tax=... 121 3e-26 UniRef50_B3ETR6 Putative uncharacterized protein n=1 Tax=Candida... 114 6e-24 UniRef50_A5CC03 Transposase and inactivated derivative n=9 Tax=O... 108 2e-22 UniRef50_Q24W02 Putative uncharacterized protein n=3 Tax=Clostri... 108 2e-22 UniRef50_C0GW46 Putative uncharacterized protein n=2 Tax=Desulfo... 108 3e-22 UniRef50_A8PLG1 Transposase n=1 Tax=Rickettsiella grylli RepID=A... 107 5e-22 UniRef50_B6J6C6 Hypothetical cytosolic protein n=1 Tax=Coxiella ... 106 8e-22 UniRef50_A0LBL3 Putative uncharacterized protein n=6 Tax=Magneto... 105 1e-21 UniRef50_A9EVM7 Similar to putative transposase n=2 Tax=Sorangiu... 105 1e-21 UniRef50_B5Q357 Transposase n=10 Tax=Salmonella enterica subsp. ... 105 2e-21 UniRef50_Q2J904 Putative uncharacterized protein n=1 Tax=Frankia... 104 3e-21 UniRef50_Q1RKI3 Transposase and inactivated derivative n=10 Tax=... 102 1e-20 UniRef50_A6TJT5 Putative uncharacterized protein n=1 Tax=Alkalip... 100 8e-20 UniRef50_A9BGB6 Putative uncharacterized protein n=3 Tax=Petroto... 99 2e-19 UniRef50_Q6TFF6 Putative transposase n=1 Tax=Caedibacter taenios... 95 3e-18 UniRef50_A6G4N5 Putative uncharacterized protein n=1 Tax=Plesioc... 95 3e-18 UniRef50_D2QBD7 Putative uncharacterized protein n=1 Tax=Spiroso... 91 3e-17 UniRef50_D0LMM4 Putative transposase n=10 Tax=Haliangium ochrace... 89 2e-16 UniRef50_C5JAV2 Transposase n=2 Tax=uncultured bacterium RepID=C... 88 3e-16 UniRef50_A6G0X2 Putative uncharacterized protein n=1 Tax=Plesioc... 87 1e-15 UniRef50_C0GWA6 Putative uncharacterized protein n=3 Tax=Desulfo... 86 2e-15 UniRef50_C0GW49 Putative uncharacterized protein n=6 Tax=Desulfo... 82 2e-14 UniRef50_C6VTM0 Putative uncharacterized protein n=1 Tax=Dyadoba... 82 2e-14 UniRef50_A9BGB3 Putative uncharacterized protein n=2 Tax=Petroto... 82 2e-14 UniRef50_B2V9N0 Putative uncharacterized protein n=4 Tax=Sulfuri... 80 6e-14 UniRef50_Q3JB06 Putative transposase n=17 Tax=Proteobacteria Rep... 80 1e-13 UniRef50_Q04UG3 Transposase, YhgA-like n=8 Tax=Leptospira RepID=... 77 6e-13 UniRef50_C0GV86 Transposase, ISNCY family n=7 Tax=Desulfonatrono... 75 2e-12 UniRef50_C4UAM6 Putative uncharacterized protein n=1 Tax=Yersini... 73 9e-12 UniRef50_B8FP58 Putative uncharacterized protein n=1 Tax=Desulfi... 73 1e-11 UniRef50_B6WXP3 Putative uncharacterized protein n=1 Tax=Desulfo... 72 3e-11 UniRef50_C7RR52 Putative transposase n=1 Tax=Candidatus Accumuli... 70 1e-10 UniRef50_Q2FP14 Putative uncharacterized protein n=4 Tax=Methano... 69 1e-10 UniRef50_Q1Q296 Putative uncharacterized protein n=6 Tax=Candida... 69 3e-10 UniRef50_A4XFI8 Putative uncharacterized protein n=7 Tax=Clostri... 68 5e-10 UniRef50_A3JHZ5 Putative transposase n=11 Tax=Proteobacteria Rep... 67 7e-10 UniRef50_C6I158 Putative uncharacterized protein n=3 Tax=Leptosp... 67 1e-09 UniRef50_D0LPI9 Putative transposase n=2 Tax=Haliangium ochraceu... 65 3e-09 UniRef50_C0GTX5 Putative uncharacterized protein n=8 Tax=Desulfo... 65 3e-09 UniRef50_C6HY29 Putative uncharacterized protein n=1 Tax=Leptosp... 65 4e-09 UniRef50_Q1QWV4 Putative uncharacterized protein n=11 Tax=Proteo... 64 8e-09 UniRef50_C0A240 Putative uncharacterized protein n=1 Tax=Opituta... 62 3e-08 UniRef50_B9TA29 Putative uncharacterized protein n=1 Tax=Ricinus... 62 3e-08 UniRef50_Q3C0L0 TpnA protein n=2 Tax=Sodalis glossinidius RepID=... 60 9e-08 UniRef50_A3ET28 Probable transposase n=6 Tax=Leptospirillum sp. ... 59 3e-07 UniRef50_A4U3R1 Putative uncharacterized protein n=1 Tax=Magneto... 55 2e-06 UniRef50_B9MMR0 Putative uncharacterized protein n=1 Tax=Anaeroc... 55 2e-06 UniRef50_B4U689 Putative uncharacterized protein n=8 Tax=Aquific... 55 4e-06 UniRef50_C6HTR6 Probable transposase n=5 Tax=Leptospirillum ferr... 54 7e-06 UniRef50_A6G1G8 Putative uncharacterized protein n=1 Tax=Plesioc... 54 8e-06 UniRef50_B9MN47 Putative uncharacterized protein n=2 Tax=Bacteri... 53 1e-05 UniRef50_C4FIM1 Putative uncharacterized protein n=1 Tax=Sulfuri... 52 2e-05 UniRef50_C1DXV7 Putative uncharacterized protein n=1 Tax=Sulfuri... 50 1e-04 UniRef50_C5UWW9 Putative uncharacterized protein n=1 Tax=Clostri... 49 3e-04 UniRef50_B7UFQ6 Predicted protein n=11 Tax=Escherichia RepID=B7U... 48 4e-04 UniRef50_C6HZP6 Putative uncharacterized protein n=1 Tax=Leptosp... 48 5e-04 UniRef50_A4XMD0 Putative uncharacterized protein n=5 Tax=Clostri... 47 7e-04 UniRef50_C4GYF6 Transposase n=20 Tax=Yersinia pestis RepID=C4GYF... 47 7e-04 UniRef50_B1EI63 Putative uncharacterized protein n=1 Tax=Escheri... 47 0.001 UniRef50_B2V697 Putative uncharacterized protein n=6 Tax=Sulfuri... 47 0.001 UniRef50_C6HXQ0 Putative uncharacterized protein n=1 Tax=Leptosp... 46 0.001 UniRef50_C1DXM1 Putative uncharacterized protein n=5 Tax=Sulfuri... 45 0.002 UniRef50_C6XV94 Putative uncharacterized protein n=7 Tax=Pedobac... 44 0.005 UniRef50_Q2RLW6 Putative uncharacterized protein n=9 Tax=Clostri... 43 0.012 UniRef50_B0K503 Putative uncharacterized protein n=12 Tax=Thermo... 43 0.013 UniRef50_A4XMU7 Putative uncharacterized protein n=1 Tax=Caldice... 42 0.020 UniRef50_C5RH90 Putative uncharacterized protein n=2 Tax=Clostri... 41 0.061 >UniRef50_P77768 Uncharacterized protein yfcI n=175 Tax=Gammaproteobacteria RepID=YFCI_ECOLI Length = 296 Score = 612 bits (1579), Expect = e-174, Method: Compositional matrix adjust. Identities = 296/296 (100%), Positives = 296/296 (100%) Query: 1 MTISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQY 60 MTISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQY Sbjct: 1 MTISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQY 60 Query: 61 YSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPM 120 YSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPM Sbjct: 61 YSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPM 120 Query: 121 LFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQ 180 LFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQ Sbjct: 121 LFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQ 180 Query: 181 KHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEK 240 KHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEK Sbjct: 181 KHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEK 240 Query: 241 EKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQSH 296 EKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQSH Sbjct: 241 EKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQSH 296 >UniRef50_B7UFQ5 Predicted protein n=14 Tax=Enterobacteriaceae RepID=B7UFQ5_ECO27 Length = 315 Score = 405 bits (1040), Expect = e-111, Method: Compositional matrix adjust. Identities = 194/311 (62%), Positives = 242/311 (77%), Gaps = 20/311 (6%) Query: 1 MTISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQY 60 MT STTS+PHDAVFK+F+ P+TARDF++IHLP PLRKLC+L TL+LEP SFI++ LR Y Sbjct: 1 MTESTTSSPHDAVFKTFMFTPETARDFLEIHLPEPLRKLCNLQTLRLEPTSFIEKSLRAY 60 Query: 61 YSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPM 120 YSD+LWSV+T EG GYIY VIEHQS E+ MAFR+MRY+ AAMQ HLD GY +PLV+P+ Sbjct: 61 YSDVLWSVETSEGDGYIYCVIEHQSSAEKNMAFRLMRYATAAMQRHLDKGYDRVPLVVPL 120 Query: 121 LFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQ 180 LFYHG SPYPYSL WLDEF +P +AR++Y+ AFPLVDIT+VPDDEIMQHR++ALLELIQ Sbjct: 121 LFYHGEASPYPYSLNWLDEFDDPQLARQLYTEAFPLVDITIVPDDEIMQHRRIALLELIQ 180 Query: 181 KHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEK 240 KHIR RDL+G+VD+I +LLV G TND QL+ LFNY+LQ GD RF FI EIAER+P +K Sbjct: 181 KHIRDRDLIGMVDRITTLLVRGFTNDSQLQTLFNYLLQCGDTSRFTRFIQEIAERSPLQK 240 Query: 241 EKLMTIADRLRE--------------------EGAMQGKHEEALRIAQEMLDRGLDRELV 280 E LMTIA+RLR+ EG +G HE+A++IA ML++G +RE+V Sbjct: 241 EILMTIAERLRQEGHQIGWQEGKIEGWQEGKLEGLQEGMHEQAIKIALRMLEQGFEREIV 300 Query: 281 MMVTRLSPDDL 291 + T+L+ D+ Sbjct: 301 LAATQLTDADI 311 >UniRef50_P37415 Uncharacterized protein pSLT051 n=256 Tax=Gammaproteobacteria RepID=YTL2_SALTY Length = 313 Score = 342 bits (877), Expect = 1e-92, Method: Compositional matrix adjust. Identities = 164/309 (53%), Positives = 222/309 (71%), Gaps = 16/309 (5%) Query: 4 STTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSD 63 +TT TPHDA F+ FL PD ARDF+++HLPA LR +CDL+TLKLE SF+++DLRQY+SD Sbjct: 5 NTTPTPHDATFRQFLTQPDIARDFMELHLPAELRAICDLSTLKLESGSFVEDDLRQYFSD 64 Query: 64 LLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFY 123 +L+S+KT G GYI+V++EHQS P++ MAFR++RY++AAMQ HL+AG+K+LPLV+P+LFY Sbjct: 65 VLYSLKTTAGDGYIHVLVEHQSTPDKHMAFRLIRYAVAAMQRHLEAGHKKLPLVIPVLFY 124 Query: 124 HGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHI 183 G RSPYPYS WLDEF + A+A K+YSSAFPLVD+TV+PDDEI HR MA L L+QKHI Sbjct: 125 TGKRSPYPYSTRWLDEFDDTALADKLYSSAFPLVDVTVIPDDEIAGHRSMAALTLLQKHI 184 Query: 184 RQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKL 243 QRDL LVD++ +L+ G + Q+ +L +Y++Q G+ AF+ E+A+R PQ + L Sbjct: 185 HQRDLAELVDRLAPILLAGYLSSSQVISLVHYIVQAGETSDAEAFVRELAQRVPQHGDAL 244 Query: 244 MTIADRLR----------------EEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLS 287 MTIA +L E+G +G+ E L+IA+ ML +DR VM +T L+ Sbjct: 245 MTIAQQLEQKGIEKGIQLGEQRGIEKGRSEGEREATLKIARTMLQNCIDRNTVMKMTGLT 304 Query: 288 PDDLIAQSH 296 DDL H Sbjct: 305 EDDLAQIRH 313 >UniRef50_Q1CC76 Transposase n=27 Tax=Gammaproteobacteria RepID=Q1CC76_YERPN Length = 313 Score = 327 bits (839), Expect = 2e-88, Method: Compositional matrix adjust. Identities = 160/309 (51%), Positives = 220/309 (71%), Gaps = 16/309 (5%) Query: 4 STTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSD 63 ++T TPHDA F+ FL P+ ARDF+++HLPA LR +CDL+TLKLE SF+++DLRQY+SD Sbjct: 5 NSTPTPHDATFRQFLTQPEIARDFMELHLPAELRAICDLSTLKLESGSFVEDDLRQYFSD 64 Query: 64 LLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFY 123 +L+S+ T EG GY++V+IEHQS P++ MAFR++RY+IAAMQ HL+AG+ +LPLV+P+LFY Sbjct: 65 VLYSLDTVEGEGYVHVLIEHQSSPDKHMAFRLIRYAIAAMQRHLEAGHAKLPLVIPVLFY 124 Query: 124 HGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHI 183 G RSPYPYS WLDEF +P +A K+YS AFPLVD+TV+PDD+IM+HR MA L L+QKHI Sbjct: 125 VGKRSPYPYSTRWLDEFDDPELAHKLYSGAFPLVDVTVIPDDDIMEHRSMAALTLLQKHI 184 Query: 184 RQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKL 243 QRD+ L D++ +LL+ + Q+ AL +Y+LQ G++ AF+ E+A+R PQ + L Sbjct: 185 HQRDIATLTDRLATLLMADYLSSPQVMALIHYLLQAGESADSEAFVRELAQRVPQHGDAL 244 Query: 244 MTIADRLR----EEGAMQGKHE------------EALRIAQEMLDRGLDRELVMMVTRLS 287 MTIA +L E+G M+G+ E L +A+ +L G+ E V T LS Sbjct: 245 MTIAQQLEQKGIEKGRMEGRTEGIQLGEQRGIEKGKLEVARSLLKMGMPIESVQEATGLS 304 Query: 288 PDDLIAQSH 296 DDL H Sbjct: 305 EDDLAQIRH 313 >UniRef50_Q7N1D0 Transposase, ISNCY family n=36 Tax=root RepID=Q7N1D0_PHOLL Length = 335 Score = 313 bits (801), Expect = 6e-84, Method: Compositional matrix adjust. Identities = 156/330 (47%), Positives = 215/330 (65%), Gaps = 39/330 (11%) Query: 1 MTISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQY 60 M T TPHDA+FK FL H DTARDF++IHLPA LR +CDL TL+LE SFI+++LR + Sbjct: 1 MKRKNTPTPHDAIFKKFLSHIDTARDFLEIHLPATLRAVCDLDTLRLESGSFIEDNLRVH 60 Query: 61 YSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPM 120 YSD+L+S+KT +G Y+Y VIEHQS P+++MAFR+MRYSI+AMQ HL+ G+K+LPLV+P+ Sbjct: 61 YSDILYSLKTTQGESYVYCVIEHQSSPDKMMAFRLMRYSISAMQWHLEQGHKKLPLVIPV 120 Query: 121 LFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQ 180 LFYHG PYP+S W D F A+A +IYSSAFPLVD+TV+PDDEI+ H+++ALLE++Q Sbjct: 121 LFYHGKIRPYPWSTNWFDCFDASALAEEIYSSAFPLVDVTVIPDDEILTHKRVALLEIVQ 180 Query: 181 KHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEK 240 KHIRQRD+ L ++ L LK++ NY+L GD FI ++AE+ P+ + Sbjct: 181 KHIRQRDMAELQQELTMLFAYDYYTYELLKSMLNYILLVGDTADPEGFIRQLAEQFPKYE 240 Query: 241 EKLMTIADRLREEGAMQGKHE--------------------------------------- 261 E LMTIA +L+ +G +G E Sbjct: 241 EVLMTIAQKLQHKGHQEGLKEGLQKCQDAREEGLQEGLQKGEKKGEKKGEKKGEEKGEKR 300 Query: 262 EALRIAQEMLDRGLDRELVMMVTRLSPDDL 291 +L+IA+ ++D G+DRE +M T LS ++L Sbjct: 301 ASLKIARALMDNGIDRETIMKSTGLSQNEL 330 >UniRef50_P31665 Uncharacterized protein yadD n=59 Tax=Enterobacteriaceae RepID=YADD_ECOLI Length = 300 Score = 312 bits (800), Expect = 8e-84, Method: Compositional matrix adjust. Identities = 152/285 (53%), Positives = 207/285 (72%), Gaps = 5/285 (1%) Query: 6 TSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLL 65 ++TPHDAVFK FL H +TARDF++IHLP LR+LCDL TL LE SFI+E L+ + +D+L Sbjct: 5 STTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVL 64 Query: 66 WSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYHG 125 +SV+ Q GY++VVIEHQSKP++ MAFRMMRYSIAAM HL+A + +LPLV+P+LFY G Sbjct: 65 YSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQG 124 Query: 126 CRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQ 185 +PYP S+CW D F P +AR++Y+S FPLVDIT+ PDDEIMQHR++A+LEL+QKHIRQ Sbjct: 125 EATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQ 184 Query: 186 RDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKLMT 245 RDL+ L++Q+V+L+ G T+ QL A+ NY+LQ G ++ F G + +R E +MT Sbjct: 185 RDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETG-GESMMT 243 Query: 246 IADRLR----EEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRL 286 +A E+G QG+ E + AQ +L +G+ RE V + L Sbjct: 244 LAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANL 288 >UniRef50_Q4LC22 TpnA protein n=9 Tax=Enterobacteriaceae RepID=Q4LC22_SODGL Length = 308 Score = 310 bits (793), Expect = 5e-83, Method: Compositional matrix adjust. Identities = 159/303 (52%), Positives = 206/303 (67%), Gaps = 12/303 (3%) Query: 1 MTISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQY 60 M+ T TPHDAVF+ FL TA+DF DI LP ++ LCD TLK E SFID D++ Y Sbjct: 1 MSKKFTPTPHDAVFRQFLHDKATAQDFFDIWLPDDIKALCDWETLKPESGSFIDPDMKPY 60 Query: 61 YSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPM 120 SD+L+SV GY+Y +IEHQS P++LMA+R+MRYS+AAMQ HL+AG+ +LPLV P+ Sbjct: 61 QSDILYSVNANGVDGYVYCLIEHQSTPDKLMAWRLMRYSMAAMQRHLEAGHDKLPLVFPV 120 Query: 121 LFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQ 180 LFY G +SP+PYS WLD F P IA KIYS F L+D+T + DD IMQHR+MALLELIQ Sbjct: 121 LFYCGEKSPHPYSTNWLDCFERPDIAAKIYSQPFRLMDVTTLDDDAIMQHRRMALLELIQ 180 Query: 181 KHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEK 240 KHIR+RD+ L+D IV LL D Q+ + NY++Q G+A R FI EIA+RA + + Sbjct: 181 KHIRRRDMTELLDSIVKLLSYNYYTDTQVVTMMNYLVQEGNAASPRTFITEIAKRAEKHE 240 Query: 241 EKLMTIADRL------------REEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSP 288 E LMTIA+ L R+EG QG+H A++IA++ML RG+ R+ V T LS Sbjct: 241 EALMTIAEALKQEGYQIGRDDGRQEGIQQGEHAAAMKIARQMLSRGIARDAVKACTGLSD 300 Query: 289 DDL 291 + L Sbjct: 301 NAL 303 >UniRef50_D2U4R8 Transposase (Fragment) n=4 Tax=Enterobacteriaceae RepID=D2U4R8_9ENTR Length = 308 Score = 306 bits (783), Expect = 7e-82, Method: Compositional matrix adjust. Identities = 149/295 (50%), Positives = 207/295 (70%), Gaps = 4/295 (1%) Query: 1 MTISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQY 60 MT T TPHDAVFK FL +TA+DF DI LP ++ LCDL +LK+E SFID +++ Y Sbjct: 7 MTKKFTPTPHDAVFKQFLSEKETAKDFFDIWLPDEIKALCDLDSLKMESGSFIDSEMKNY 66 Query: 61 YSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPM 120 SD+L+SV T +G GYIYV+IEHQS P++L+A+R+MRYS+AAMQ HL+ G K+LPLV P+ Sbjct: 67 QSDILYSVSTTKGSGYIYVLIEHQSTPDKLIAWRLMRYSLAAMQKHLEDGNKQLPLVFPI 126 Query: 121 LFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQ 180 LFY G +SP+PYS WLD F + +A IY++ F L D+T + D EIMQH+++ALLEL+Q Sbjct: 127 LFYCGEQSPHPYSTHWLDCFEDRKLAESIYNNPFKLADVTTLDDGEIMQHKRIALLELLQ 186 Query: 181 KHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEK 240 KHIR+RD+ L+D IV LL D Q+ +FNY++Q G+AQR FI IA++A + + Sbjct: 187 KHIRRRDMTELLDSIVKLLSYNYYTDNQVITMFNYLIQEGNAQRPMEFITNIAKQAEKHE 246 Query: 241 EKLMTIADRLRE----EGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDL 291 LMTIA ++ E +G QG + + +A++ L G+DR V + T LS ++L Sbjct: 247 GALMTIAQQIEEIGIQKGIQQGIQKTKIELAKQFLANGVDRNTVKISTGLSDEEL 301 >UniRef50_C2DMU4 Possible transposase n=6 Tax=Enterobacteriaceae RepID=C2DMU4_ECOLX Length = 314 Score = 304 bits (779), Expect = 2e-81, Method: Compositional matrix adjust. Identities = 153/303 (50%), Positives = 210/303 (69%), Gaps = 21/303 (6%) Query: 4 STTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSD 63 + ++TPHDAVFK FL H +TARDF+DIHLPA LR+LCDL TL LE SFI+E L+ + +D Sbjct: 3 APSTTPHDAVFKQFLMHAETARDFLDIHLPAELRELCDLDTLHLESGSFIEESLKGHSTD 62 Query: 64 LLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFY 123 +L+SV+ Q GY++VVIEHQSKP++ MAFRMMRYSIAAM HL+A + +LPLV+P+LFY Sbjct: 63 VLYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFY 122 Query: 124 HGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHI 183 G +PYP S+CW D F P +AR++Y+S FPLVDIT+ PDDEIMQHR++A+LEL+QKHI Sbjct: 123 QGEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHI 182 Query: 184 RQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKL 243 RQRDL+ L++Q+V+L+ G T+ QL A+ NY+LQ G ++ F G + +R K + Sbjct: 183 RQRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGK-SM 241 Query: 244 MTIA--------------------DRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMV 283 MT+A ++ E+G QG+ E + A +L +G+ RE V + Sbjct: 242 MTLAQWFEEKGIEKGIEKGIEKGMEKGIEKGIQQGRQEVSQEFALRLLSKGMPREDVAEM 301 Query: 284 TRL 286 L Sbjct: 302 ANL 304 >UniRef50_Q7B1W7 YadD homologue n=11 Tax=root RepID=Q7B1W7_ECOLX Length = 313 Score = 290 bits (741), Expect = 5e-77, Method: Compositional matrix adjust. Identities = 156/310 (50%), Positives = 209/310 (67%), Gaps = 20/310 (6%) Query: 4 STTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSD 63 +TT TPHDA F+SFL +PD ARDF+++HLPA R+LCDL+TLKLEP +F++ DL QY SD Sbjct: 7 TTTPTPHDAAFRSFLANPDVARDFLELHLPAEYRQLCDLSTLKLEPATFVEPDLHQYASD 66 Query: 64 LLWSVKTQEGV-GYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLF 122 +LWSVKT G GY+Y +IEHQS M FRM+RYS+AAMQ HL+ +K LPLV+P+LF Sbjct: 67 ILWSVKTTGGEDGYVYTLIEHQSTENLYMPFRMLRYSVAAMQRHLEQ-HKTLPLVIPVLF 125 Query: 123 YHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKH 182 YHG RSPYPYS+ WLD F PA+A KIY+ FPLVDITVV D+EIM HR+MA L L+ KH Sbjct: 126 YHGERSPYPYSMNWLDCFENPALAAKIYTKPFPLVDITVVDDNEIMNHRRMAALTLLMKH 185 Query: 183 IRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEK 242 IRQRD+L +D +V L ++ Q+ LFNY+L G F+ +A+R PQ ++ Sbjct: 186 IRQRDMLMCLDNLVRAL-QDIQDEEQITVLFNYLL-NGSEHVTVEFLQTLAQRLPQHEDS 243 Query: 243 LMTIADRLREEGAMQGKH----------------EEALRIAQEMLDRGLDRELVMMVTRL 286 +MT+A+RL++EG QG ++A IA+E+ + G+ + +T L Sbjct: 244 IMTLAERLKQEGIQQGIQQGIQQGIQQGVQQGALQKAREIARELRNAGMPAAQICQLTGL 303 Query: 287 SPDDLIAQSH 296 S +L +H Sbjct: 304 SEAELKNITH 313 >UniRef50_C8QFJ7 Putative transposase YhgA family protein n=4 Tax=Pantoea sp. At-9b RepID=C8QFJ7_9ENTR Length = 301 Score = 265 bits (678), Expect = 1e-69, Method: Compositional matrix adjust. Identities = 132/297 (44%), Positives = 192/297 (64%), Gaps = 9/297 (3%) Query: 3 ISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYS 62 +S S PHDA+FK FL H AR F++IHLP +R+ CDL L++ P +FI+ DL YS Sbjct: 1 MSVVSAPHDALFKKFLSHLPVARQFLEIHLPQSIREHCDLDKLQVVPTTFIERDLSALYS 60 Query: 63 DLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLF 122 D+L S+KT +G GYIY +IEHQS P++ M RMMRY++AA+Q HLD G+ ++PLV+P+LF Sbjct: 61 DVLLSMKTDDGEGYIYALIEHQSTPDKHMTLRMMRYTLAAIQRHLDEGHHDVPLVIPILF 120 Query: 123 YHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKH 182 Y G SPYPYS+ WL+ F P +A++I+ +FPLVD+TV+PD+EIM HR +A LE+ K Sbjct: 121 YQGKTSPYPYSMNWLESFRNPVLAKQIFCHSFPLVDVTVIPDEEIMAHRDVARLEMAHKI 180 Query: 183 IRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEK 242 IR RD+L +D + +LL +D + +F Y+L+ G+ + + + PQ + K Sbjct: 181 IRLRDILENIDPMATLLALDYNDDLSIDVVF-YLLRYGNTDDREKIVKILIQAKPQLEGK 239 Query: 243 LMTIADRL--------REEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDL 291 +MTI ++ R+EG +G+ E L +AQ ML D +M +T LS +L Sbjct: 240 IMTIEEQWRQESRQEGRQEGRKEGRQEVMLELAQRMLREQFDLNTIMKLTGLSEGEL 296 >UniRef50_C2DIT3 Possible transposase n=5 Tax=Enterobacteriaceae RepID=C2DIT3_ECOLX Length = 197 Score = 264 bits (675), Expect = 3e-69, Method: Compositional matrix adjust. Identities = 125/201 (62%), Positives = 160/201 (79%), Gaps = 4/201 (1%) Query: 96 MRYSIAAMQNHLDAGYKELPLVLPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFP 155 MRY+IAAMQNHLDAGYK LP+V+P+LFYHG SPYPYSLCWLD FA+P +AR++Y+SAFP Sbjct: 1 MRYAIAAMQNHLDAGYKTLPMVVPLLFYHGIESPYPYSLCWLDCFADPNLARQLYASAFP 60 Query: 156 LVDITVVPDDEIMQHRKMALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNY 215 L+D+T++PDDEIM HR+MALLELIQKHIRQRDL+GLV+Q+ LL +G N RQ+K LFNY Sbjct: 61 LIDVTLMPDDEIMLHRRMALLELIQKHIRQRDLMGLVEQMACLLSSGYANGRQIKGLFNY 120 Query: 216 VLQTGDAQRFRAFIGEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGL 275 +LQTGDA RF FI +A+R+P+ K LMTIA+RLR+E G+ +AL IA+ ML+ G+ Sbjct: 121 ILQTGDAVRFNDFIDGVAKRSPKHKVSLMTIAERLRQE----GEQSKALHIAKIMLESGV 176 Query: 276 DRELVMMVTRLSPDDLIAQSH 296 +M T +S ++L A S Sbjct: 177 PLADIMRFTGVSEEELAAASQ 197 >UniRef50_C2LLN3 Transposase n=37 Tax=Enterobacteriaceae RepID=C2LLN3_PROMI Length = 319 Score = 262 bits (669), Expect = 1e-68, Method: Compositional matrix adjust. Identities = 128/291 (43%), Positives = 199/291 (68%), Gaps = 5/291 (1%) Query: 1 MTISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQY 60 MT +T HDA+FK FL HP+ ARDF +HLPA + LCDL+TL+LEP SF++ LRQ Sbjct: 1 MTKNTQQPVHDALFKQFLTHPENARDFFSVHLPANILPLCDLSTLRLEPASFVERRLRQL 60 Query: 61 YSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHL--DAGYKELPLVL 118 +SD+L+SV+ EG GYIY +IEHQSKP+ LM FR+M Y+++A+ +HL K LPLV+ Sbjct: 61 HSDVLYSVQMTEGEGYIYCLIEHQSKPDRLMGFRLMHYAMSAIAHHLKKSPADKTLPLVV 120 Query: 119 PMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLEL 178 P LFY G PYPYS+ WLD FA+PA+A+++Y+ +FPLVD++V+ D+EI+ H+ +ALLEL Sbjct: 121 PFLFYQGSVCPYPYSMNWLDGFADPALAQQLYTRSFPLVDLSVLSDEEILTHKGIALLEL 180 Query: 179 IQKHIRQRDLLGLVDQIVSLLVTG--NTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERA 236 +QKHIR RD L V I++ ++ NT D Q++++ Y+ G F ++ + Sbjct: 181 VQKHIRTRDGLMAVLPIIAQIINSQHNTVD-QVRSVIEYIAYQGYILDESRFFSQLIALS 239 Query: 237 PQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLS 287 P+ K L TIA++L ++G +G + + ++ +++G+++ + + V +++ Sbjct: 240 PEYKTMLTTIAEQLEQKGIEKGIEKGIEKGIEKGIEKGIEKGIGLGVEKVA 290 >UniRef50_D0KLJ7 Putative transposase YhgA family protein n=1 Tax=Pectobacterium wasabiae WPP163 RepID=D0KLJ7_PECWW Length = 288 Score = 253 bits (647), Expect = 4e-66, Method: Compositional matrix adjust. Identities = 141/304 (46%), Positives = 183/304 (60%), Gaps = 54/304 (17%) Query: 10 HDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVK 69 HDA+FK FL ARDF+ IHLP +R+ CD TL+LE SFIDE LR SD+L+S+ Sbjct: 4 HDAIFKQFLSDIAVARDFLTIHLPDSIRERCDFNTLQLESASFIDEKLRARISDVLYSLH 63 Query: 70 TQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYHGCRSP 129 T G GYIY VIEHQS+PE+ MAFR++RY +AAMQ HLD G+ LPLV+P+LFYHG P Sbjct: 64 TSVGKGYIYCVIEHQSRPEKQMAFRLLRYCLAAMQQHLDQGHDRLPLVVPLLFYHGRSRP 123 Query: 130 YPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQRDLL 189 YPYSL WLD FA P +A+ +Y FPLVD+TV+PDDEI HR+MALLEL+QKHIR RD+L Sbjct: 124 YPYSLRWLDSFAAPVLAQTLYEQPFPLVDLTVMPDDEIRTHRRMALLELVQKHIRTRDML 183 Query: 190 GLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRA--FIGEIAERAPQEKEKLMTIA 247 L R++ LF +R+ A IG+ E +MTIA Sbjct: 184 ELA--------------REIGLLF---------ERWAAPLSIGQ---------EDIMTIA 211 Query: 248 DRLR--------------------EEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLS 287 ++L+ E+G QG A +IA+ +L G+D+ V T+L Sbjct: 212 EQLKKMGFDEGIQRGIQQGLAQGLEQGIEQGMKNSARQIARHLLLTGMDKNSVQQATQLE 271 Query: 288 PDDL 291 ++L Sbjct: 272 TEEL 275 >UniRef50_C0Q5B1 Ytl2 n=4 Tax=Enterobacteriaceae RepID=C0Q5B1_SALPC Length = 316 Score = 247 bits (630), Expect = 4e-64, Method: Compositional matrix adjust. Identities = 130/305 (42%), Positives = 191/305 (62%), Gaps = 18/305 (5%) Query: 10 HDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVK 69 HD +FK FLR PDTARDF+ +HLPA +R L TLKLEP SF+D+ LR+ +SD+L+SV+ Sbjct: 12 HDGLFKLFLREPDTARDFLAVHLPADIRAQVRLDTLKLEPGSFVDQKLRELHSDVLYSVE 71 Query: 70 TQEG-VGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYHGCRS 128 T EG GYIY ++EHQS + +MA+RMMRYS+A M HL G LP+V+P+LFY G Sbjct: 72 TAEGHAGYIYCLVEHQSTADRMMAWRMMRYSMAVMDAHLKKGNGTLPVVVPLLFYQGMVR 131 Query: 129 PYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQRDL 188 PYPYS W+D F PA+AR++YS +PLVD++V+ D ++ HR+MALLEL+Q+ IR RD Sbjct: 132 PYPYSTDWMDCFDVPALAREVYSRPWPLVDVSVMEDCDLQSHRRMALLELVQRDIRHRDA 191 Query: 189 LGLVDQIVSLLVTGNTNDRQLKALFNYVLQTG-DAQRFRAFIGEIAERAPQEKEKLM-TI 246 L+ +V L+ Q++A+ Y++ G ++ F+ E+A P+ KE +M TI Sbjct: 192 ASLLRDVVQLIRLAGNTRAQVEAVLCYIIYNGMTSESITPFLYELAGEIPEYKELIMGTI 251 Query: 247 ADRLR---------------EEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDL 291 A +L+ + +++ + + L A +LD G+ E+V+ T L+ + L Sbjct: 252 AQQLKEEGIQQGIQQGIQQERQASLEREQKTLLETAYALLDNGVSLEVVIKSTGLNRETL 311 Query: 292 IAQSH 296 H Sbjct: 312 EQPRH 316 >UniRef50_D1P284 Transposase, ISNCY family n=10 Tax=Enterobacteriaceae RepID=D1P284_9ENTR Length = 322 Score = 243 bits (620), Expect = 6e-63, Method: Compositional matrix adjust. Identities = 120/320 (37%), Positives = 182/320 (56%), Gaps = 27/320 (8%) Query: 1 MTISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQY 60 M + PHD+ FK F+ D ARDF ++HLP ++ LC+ TLKL SF+D+ LR Sbjct: 1 MATQSIVAPHDSTFKGFMSKVDNARDFFEVHLPNRIKHLCNFDTLKLASASFVDKTLRSR 60 Query: 61 YSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPM 120 +SD+L+SV+T +G GY Y ++EHQS P++LM +R+M Y+ AM HL G++ LPLV+P+ Sbjct: 61 FSDMLYSVQTLKGKGYFYFLVEHQSSPDKLMGWRLMHYAFCAMNQHLQQGHQSLPLVVPI 120 Query: 121 LFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQ 180 LFYHG +SPYPYS W D F +A +Y + PLVD+TV DDE+M HRK+A +EL+ Sbjct: 121 LFYHGNQSPYPYSQSWTDCFQWSDLAHDLYCNPLPLVDVTVACDDELMNHRKVAAMELVF 180 Query: 181 KHIRQR-DLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQE 239 KH R D+ GL +++ +L + + + NY+ D + + + ++ + Sbjct: 181 KHASLRGDVFGLSERLAQVLNNNQNHQDDVILIINYLFSVMDTPAYTHIVKTLVDQTEKH 240 Query: 240 KEKLMTIADRLREEGAMQG-----------------------KHEEALRIAQEM---LDR 273 +E +M IA RLR EG +G + + AL + Q+ L Sbjct: 241 QETVMNIAQRLRNEGMEKGMEKGRKEERMISQQKLANERQHYQQQMALNLQQQAIMSLKL 300 Query: 274 GLDRELVMMVTRLSPDDLIA 293 GL +++ +T LSP D+ A Sbjct: 301 GLSVDIISQITGLSPSDIHA 320 >UniRef50_B6XDZ7 Putative uncharacterized protein n=2 Tax=Providencia RepID=B6XDZ7_9ENTR Length = 327 Score = 239 bits (611), Expect = 6e-62, Method: Compositional matrix adjust. Identities = 115/259 (44%), Positives = 167/259 (64%), Gaps = 1/259 (0%) Query: 1 MTISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQY 60 MT+ + PHD+ FK F+ D ARDF +I+LP ++ LC+L TLKL SFID+ LR Sbjct: 5 MTMQLIARPHDSTFKGFMSKVDNARDFFEIYLPNRIKPLCNLDTLKLASASFIDKTLRSR 64 Query: 61 YSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPM 120 +SD+L+SV+T +G GY Y+++EHQS P++LM +R+M Y+ AM HL G LPLV+P+ Sbjct: 65 FSDMLYSVQTLKGKGYFYLLVEHQSTPDKLMGWRLMHYAFCAMNQHLQQGNNALPLVVPI 124 Query: 121 LFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQ 180 LFYHG +SPYPYS W D F +A +Y + PLVD+TV DDEI+ HRK+A +EL+ Sbjct: 125 LFYHGKQSPYPYSQVWTDCFPWADLAYDLYCNPLPLVDVTVASDDEIVNHRKVAAMELVL 184 Query: 181 KHIRQRDLLGLVDQIVSLLVTGNTNDR-QLKALFNYVLQTGDAQRFRAFIGEIAERAPQE 239 KH RD L ++ + ++ +++ N N R + + NY+ D + + + E+ Sbjct: 185 KHSTLRDDLIVLSERLAQVISENENHRDDVILIINYLFSVMDTPTYTQIVKTLIEQTEGY 244 Query: 240 KEKLMTIADRLREEGAMQG 258 +E +MTIADRLR EG +G Sbjct: 245 QETVMTIADRLRNEGLEKG 263 >UniRef50_Q3C0L1 TpnA protein n=16 Tax=Enterobacteriaceae RepID=Q3C0L1_SODGL Length = 277 Score = 224 bits (571), Expect = 2e-57, Method: Compositional matrix adjust. Identities = 112/270 (41%), Positives = 171/270 (63%), Gaps = 20/270 (7%) Query: 42 LTTLKLEPNSFIDEDLRQYYSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIA 101 L+TL + SFI++DL SD+L+S+K+ G YIY +IEHQS PE +MAFR++RY++ Sbjct: 3 LSTLVMVSGSFIEDDLCSQCSDMLYSLKSTLGDAYIYCLIEHQSCPEPMMAFRLLRYAVT 62 Query: 102 AMQNHLDAGYKELPLVLPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITV 161 AM HL+ K+LP+V+P+LFYHG SPYPY+ WLD FA+ +A +Y AFPLVD+T Sbjct: 63 AMHRHLEQENKQLPVVIPILFYHGSTSPYPYTTHWLDCFADRKLAESVYEKAFPLVDVTA 122 Query: 162 VPDDEIMQHRKMALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGD 221 + D+EI++HR+MAL+E++QKHIR R++L L ++ +LL + Q K L Y++ G+ Sbjct: 123 MEDEEILRHRRMALMEIVQKHIRTRNMLELAGELANLLEQWKFSKEQCKTLVYYLVLAGN 182 Query: 222 AQRFRAFIGEIAERAPQEKEKLMTIADRLR--------------------EEGAMQGKHE 261 F+ +A+ AP +E +MTIA++L +EG GK + Sbjct: 183 TTDGEGFLRTLAQPAPSYREDMMTIAEQLEAKGMQKGIQLGEKKGIERGLQEGIQLGKKQ 242 Query: 262 EALRIAQEMLDRGLDRELVMMVTRLSPDDL 291 L+IA++ L G++R++V M T L+ D+ Sbjct: 243 ATLKIARQFLVNGVERDIVKMSTGLTDRDI 272 >UniRef50_C3M8C1 Putative transposase n=3 Tax=Candidatus Hamiltonella defensa 5AT (Acyrthosiphon pisum) RepID=C3M8C1_HAMD5 Length = 308 Score = 221 bits (564), Expect = 2e-56, Method: Compositional matrix adjust. Identities = 123/302 (40%), Positives = 184/302 (60%), Gaps = 19/302 (6%) Query: 7 STPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLW 66 STPHD +FK F AR+F +IHLP+ + K+ +LK+ P SFID+ L+Q +SD+++ Sbjct: 4 STPHDRLFKKFFGDIALARNFFEIHLPSSILKIVSFPSLKMVPGSFIDKSLKQSHSDMVY 63 Query: 67 SVKTQEGV-GYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYHG 125 S +T G GY+Y V+EHQS +++MAFRM +YS+A MQ HLD G+ LPLVLP+LFYHG Sbjct: 64 SFETSTGKEGYLYCVVEHQSTDDKMMAFRMKKYSLAVMQQHLDQGHDTLPLVLPVLFYHG 123 Query: 126 CRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQ 185 +SPYP+S+ W D F E +AR + S FPLVD+T++P++EIM+H ++ LE+ QK + Sbjct: 124 QKSPYPHSMDWRDCFCEKELARILDSQPFPLVDVTMLPEEEIMKHGIISWLEMSQKMVHT 183 Query: 186 RDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKLMT 245 RD++ + ++ L ND K+L Y+ Q G+ F ++ + ++E +MT Sbjct: 184 RDMMEIAPYLIRLDKLFPLNDELFKSLLYYLFQEGETADRMLFFDALS--STTQRENVMT 241 Query: 246 IADRLR----------------EEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPD 289 IA+ L+ EEG +G+ E IA+ +L+ G + V M T LS D Sbjct: 242 IAEELKREGREEGREEGREEGREEGREEGREEGREEIAKNLLNNGFSFKQVKMYTGLSED 301 Query: 290 DL 291 L Sbjct: 302 SL 303 >UniRef50_C2LF55 Transposase n=3 Tax=Enterobacteriaceae RepID=C2LF55_PROMI Length = 330 Score = 221 bits (562), Expect = 3e-56, Method: Compositional matrix adjust. Identities = 107/255 (41%), Positives = 158/255 (61%), Gaps = 1/255 (0%) Query: 8 TPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWS 67 + HDA FK F+ + A+DF IHL L+ CD +TLKL+ +SFID LR SD+L+S Sbjct: 8 SSHDAAFKRFMMNISNAKDFFFIHLSDELKSYCDFSTLKLQNSSFIDIKLRSRMSDILYS 67 Query: 68 VKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYHGCR 127 VKT++G IY +IEHQS+P++++A+RMM Y+ M HL GY LPLV+P+LFYHG R Sbjct: 68 VKTKKGNISIYFLIEHQSRPDKMIAWRMMHYAFCTMNQHLQQGYTSLPLVVPILFYHGKR 127 Query: 128 SPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQRD 187 PYP+S+ WLD F +A ++Y + F L+D+ + D+ ++ HRK A++E+ KH+ D Sbjct: 128 KPYPFSVNWLDCFPLSTLANQLYLNNFALIDLNSIDDEILLTHRKAAVMEIAMKHVNSCD 187 Query: 188 LLGLVDQIVSLLVT-GNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKLMTI 246 L + ++S + N +D A+ Y+ DA F + I +IAE+ +E +M I Sbjct: 188 DLDKLAMLLSKAINQKNCSDEDTIAVVQYLFSIMDAADFESIINKIAEQVDNHRETIMNI 247 Query: 247 ADRLREEGAMQGKHE 261 A RL +G GK E Sbjct: 248 AWRLENKGFKLGKME 262 >UniRef50_A8PLK1 Putative uncharacterized protein n=3 Tax=Rickettsiella grylli RepID=A8PLK1_9COXI Length = 308 Score = 213 bits (541), Expect = 8e-54, Method: Compositional matrix adjust. Identities = 112/298 (37%), Positives = 177/298 (59%), Gaps = 14/298 (4%) Query: 8 TPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWS 67 HDA+FK+F + A FI I+LP +++ CD +TLK+EP SF+D DL+Q++SD+L+S Sbjct: 7 NAHDAIFKTFFTDIEVATHFITIYLPKHMKQACDFSTLKIEPGSFVDADLKQHHSDILYS 66 Query: 68 VKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYHGCR 127 +K GY+Y+ +EHQS EELM FRM RY +A MQ HL+ G K+LPLV+ MLFYHG + Sbjct: 67 LKVNGMHGYVYLNLEHQSTAEELMPFRMHRYKVAIMQQHLNQGNKKLPLVISMLFYHG-K 125 Query: 128 SPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQRD 187 YPY L +D + A+ + L+D+ V+PD+EI +H+++A LE++QKHI RD Sbjct: 126 GQYPYCLKLIDCVEDTPFAKAHFFDDPLLIDLNVLPDEEIYRHKQLAFLEIVQKHIFTRD 185 Query: 188 LLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKLMTIA 247 L + D IV L+ + L Y+L G+ I ++ E E +M A Sbjct: 186 LEDIADHIVRLVKQVKPDHDLFNQLVYYMLVKGETANVNQVIEKLKTIEDYE-EDIMNAA 244 Query: 248 DRL------------REEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIA 293 +L R+EG +G++ +A+ IA++++ G + + +T LS +++++ Sbjct: 245 QQLKQQGRQEGLYEGRQEGLQKGEYRKAITIAKKLIAEGRSIQYIQDLTNLSENEVLS 302 >UniRef50_C1J8H0 Truncated transposase n=3 Tax=Escherichia coli RepID=C1J8H0_ECOLX Length = 202 Score = 201 bits (512), Expect = 2e-50, Method: Compositional matrix adjust. Identities = 104/206 (50%), Positives = 145/206 (70%), Gaps = 4/206 (1%) Query: 91 MAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIY 150 M FRM+RYS+AAMQ HL+ +K LPLV+P+LFYHG RSPYPYS+ WLD F EPA+A KIY Sbjct: 1 MPFRMLRYSVAAMQRHLEQ-HKTLPLVIPVLFYHGERSPYPYSMNWLDCFEEPALAAKIY 59 Query: 151 SSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLK 210 + FPLVDITVV D+EIM HR+MA L L+ KHIR RD++ L+D++ ++V +D Q++ Sbjct: 60 TKPFPLVDITVVDDNEIMNHRRMAALTLLMKHIRHRDMMELLDKLPQVMV--EISDEQVR 117 Query: 211 ALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEM 270 L +Y++ GD+ F+ +AER PQ ++KLMTIA+RL ++G +G E+AL IA ++ Sbjct: 118 VLIHYIVNAGDSVS-PEFMRALAERLPQHEDKLMTIAERLEQKGRQEGALEKALAIACQL 176 Query: 271 LDRGLDRELVMMVTRLSPDDLIAQSH 296 G+ E + T LS +L +H Sbjct: 177 QKMGMTPEQIKQATGLSEAELKNITH 202 >UniRef50_C0AXL8 Putative uncharacterized protein n=1 Tax=Proteus penneri ATCC 35198 RepID=C0AXL8_9ENTR Length = 254 Score = 199 bits (506), Expect = 1e-49, Method: Compositional matrix adjust. Identities = 98/238 (41%), Positives = 146/238 (61%), Gaps = 1/238 (0%) Query: 25 RDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVKTQEGVGYIYVVIEHQ 84 + F IHLP L+ CD +TL+L+ +SFID LR SD+L+ VKT+EG IY++IEHQ Sbjct: 6 KTFFFIHLPEELKSQCDFSTLQLQNSSFIDIKLRSRMSDILYLVKTKEGDVPIYLLIEHQ 65 Query: 85 SKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYHGCRSPYPYSLCWLDEFAEPA 144 S+P++++A+RMM Y+ M HL GYK LPLV+P+LFYHG + PYP+ + W++ F + Sbjct: 66 SRPDKMIAWRMMHYAFCTMNQHLQQGYKSLPLVVPILFYHGKKKPYPFPVNWMECFPLSS 125 Query: 145 IARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQRDLLGLVDQIVSLLVT-GN 203 +A IYS+ F L+D+T + DD ++ H+K A++E+ KH+ L + ++S + N Sbjct: 126 LANHIYSNDFSLIDLTSIDDDILLTHKKAAVMEIAMKHVNSCHDLNKIAMLLSKAINQKN 185 Query: 204 TNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKLMTIADRLREEGAMQGKHE 261 D A+ Y+ DA F I +IAER +E +M IA RL +G G E Sbjct: 186 CRDEDTVAVVQYLFSIMDASDFEFIINKIAERVDNHRETIMNIAWRLENKGFKLGIDE 243 >UniRef50_Q52101 ORF n=1 Tax=Salmonella enterica subsp. enterica serovar Enteritidis RepID=Q52101_SALEN Length = 292 Score = 194 bits (493), Expect = 3e-48, Method: Compositional matrix adjust. Identities = 114/264 (43%), Positives = 156/264 (59%), Gaps = 17/264 (6%) Query: 4 STTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSD 63 +TT TPHDA F+ FL PD ARDF+++HLPA LR +CDL+TLKLE SF+++DLRQY+SD Sbjct: 5 NTTPTPHDATFRQFLTQPDIARDFMELHLPAELRAICDLSTLKLESGSFVEDDLRQYFSD 64 Query: 64 LLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSI-AAMQNHLDAGYKELPLVLPMLF 122 +L+S+KT G I++ + S+ + F + AAMQ HL+AG+K+LPLV+P+LF Sbjct: 65 VLYSLKTTAGDD-IFMSWLNTSQHLTNICFPPDTLCVGAAMQRHLEAGHKKLPLVIPVLF 123 Query: 123 YHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFP-LVDITVVPDDEIMQHRKMALLELIQK 181 Y G RSPYPYS WLDEF + A R+ LVD+TV+PDDEI HR MA L L+ + Sbjct: 124 YTGKRSPYPYSTRWLDEFDDTAPGRQTLQQRLSRLVDVTVIPDDEIAGHRSMAALTLLPE 183 Query: 182 HI-----RQRDLLGL--VDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAE 234 +I Q L G +S+ + GN A + R R+ Sbjct: 184 NIFISGTWQNWLTGWRPFYGRISVFIAGNIAGTLYSAGRRNI-------RRRSLCTRTGT 236 Query: 235 RAPQEKEKLMTIADRLREEGAMQG 258 Q + LMTIA +L ++G +G Sbjct: 237 ACAQHGDALMTIAQQLEQKGIEKG 260 >UniRef50_B7MZS6 Putative uncharacterized protein n=3 Tax=Escherichia coli ED1a RepID=B7MZS6_ECO81 Length = 319 Score = 181 bits (459), Expect = 3e-44, Method: Compositional matrix adjust. Identities = 103/283 (36%), Positives = 159/283 (56%), Gaps = 14/283 (4%) Query: 3 ISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYS 62 ++ TS HDA F+ L+ P ARDF++ L + C+L T++LEP +F+ E LRQ Sbjct: 5 VNKTSLIHDAAFRKTLKDPAAARDFLEQVLTPYQKSRCNLDTIELEPTTFVAESLRQSAC 64 Query: 63 DLLWSVKTQEGV-GYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPML 121 D+L S+KT +G GYIY +IEHQS P++ + RMMRY +A M+ H++ +K P+V+P+L Sbjct: 65 DVLLSMKTNDGKDGYIYTLIEHQSSPDKFIPLRMMRYILAVMEQHIEE-HKCAPVVIPVL 123 Query: 122 FYHGCRSPYPYSLCWLDEFAEPAIARKIYS--SAFPLVDITVVPDDEIMQHRKMALLELI 179 FYHG + PYPY + W+D +PA R+IY F LVD++ + DDEI + +MA L Sbjct: 124 FYHGAKRPYPYPMNWVDCLDDPAYGREIYGEQKPFSLVDVSTLTDDEIEHYHRMAALMFT 183 Query: 180 QKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQE 239 K D++ L+ + ++ L + L + Y+L+ F ++ P Sbjct: 184 MKSGTSGDVIELIGKSIT-LTDKYGSSVHLNTVLTYLLELYQMD-FAELSEAVSTHYPSH 241 Query: 240 KEKLMTIADRLR--------EEGAMQGKHEEALRIAQEMLDRG 274 K +MTIA++L E+G +G+ EE R+ M RG Sbjct: 242 KGVIMTIAEQLEERGLKKGLEKGLEKGRAEERSRLVLMMRQRG 284 >UniRef50_A8PQ66 Putative uncharacterized protein n=3 Tax=Rickettsiella grylli RepID=A8PQ66_9COXI Length = 307 Score = 168 bits (426), Expect = 2e-40, Method: Compositional matrix adjust. Identities = 101/299 (33%), Positives = 165/299 (55%), Gaps = 12/299 (4%) Query: 5 TTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDL 64 T HD +FK L A F+ L + + KL ++ TL+L SF+ + R+ +SD+ Sbjct: 4 TIHQAHDKLFKYSLSKKTIAISFLKSRLSSEIYKLINIETLQLTDKSFVLPEFREIHSDI 63 Query: 65 LWSVKTQEGVGYIYVVIEHQSKPE-ELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFY 123 ++ + E GYI+ ++EH+S ELMAFR ++Y+I+AM + G K+LP+VLP+ Y Sbjct: 64 VYQCQINEKKGYIFFILEHESTAHVELMAFRQLQYTISAMDQYCRQGNKKLPIVLPICVY 123 Query: 124 HGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHI 183 HG +SPYP+S D F IAR+I F L+D+TV+ D+E+ + L+E++ KH Sbjct: 124 HGIKSPYPHSQDVYDNFENLQIARQIVFKPFTLIDLTVLSDEELAKDGPAYLMEMLLKHS 183 Query: 184 RQRDLLGL----VDQIVSLLVTGNTNDRQ--LKALFNYVLQTGDAQRFRAFIGEIAERAP 237 R ++ L + ++ I SLL R +K + N Q + ++ P Sbjct: 184 RAKNFLSILHRRIEFIQSLLNRFGKEYRWFVVKYMINET-QDESPNAVEQLVQTLSTAFP 242 Query: 238 QEKEKLMTIADRLR----EEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLI 292 +EK +MT A +LR E+G QG++EEA+ IA+ +L G+ + V +T LS +++ Sbjct: 243 EEKNTMMTFAQQLRQEGLEQGLEQGRYEEAIAIAKNLLGDGMSFKAVQRLTGLSEKEVM 301 >UniRef50_B3ESQ9 Putative uncharacterized protein n=2 Tax=Bacteria RepID=B3ESQ9_AMOA5 Length = 308 Score = 155 bits (391), Expect = 2e-36, Method: Compositional matrix adjust. Identities = 91/296 (30%), Positives = 165/296 (55%), Gaps = 14/296 (4%) Query: 7 STPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLW 66 S PHD + K+ L HP+ ++F + PA + K DL +LKL S++ E+LR++++DL++ Sbjct: 10 SNPHDLLVKATLSHPEAIQEFAKAYFPADILKRVDLPSLKLTNKSYVTEELREFHNDLVF 69 Query: 67 SVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHL-DAGYKE-LPLVLPMLFYH 124 S + GY + V+EHQS P+ LMA R ++Y+IA ++ ++ + G K P+++ + YH Sbjct: 70 SFTIDKQPGYAFFVLEHQSTPDPLMALRFVKYNIALIEEYIKEKGEKTPWPIIVNICLYH 129 Query: 125 GCR-SPYPYSLCWLDEFAEPAIARKI-YSSAFPLVDITVVPDDEIMQHRKMALLELIQKH 182 PYPYS D F +P A+ + + F L D+ P++ + QH + L+E + K+ Sbjct: 130 NANEKPYPYSTSVYDLFKDPLTAKALEMFTKFYLADLNSTPNEVLEQHGSIGLMEKLLKY 189 Query: 183 IRQRDLLGLVDQIVS-----LLVTGNTNDRQLKALFNYVL--QTGDAQRFRAFIGEIAER 235 R RD+ ++++ + L+V G+ + + +YV+ + + + E+ + Sbjct: 190 SRHRDIFNVIEKELKRSKGYLIVRGDYW-KTILIYSSYVIGQEEKSEKDLVSLFKEVLSK 248 Query: 236 APQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDL 291 E+E ++TIA + E G M+GK E + IA+ ML +G + + +T LS D+ Sbjct: 249 --NEEEIMITIAQTIEERGEMRGKRREKIAIAKNMLKKGCEISFIEEITGLSRKDI 302 >UniRef50_C1MD86 Putative uncharacterized protein n=5 Tax=Enterobacteriaceae RepID=C1MD86_9ENTR Length = 155 Score = 154 bits (390), Expect = 3e-36, Method: Compositional matrix adjust. Identities = 81/155 (52%), Positives = 106/155 (68%), Gaps = 20/155 (12%) Query: 162 VPDDEIMQHRKMALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGD 221 +PDD+IMQHR+MALLELIQKHIR+RDL+GLV+++ LLV G+ ND QLKALFNY++Q G+ Sbjct: 1 MPDDKIMQHRRMALLELIQKHIRKRDLMGLVEKLAILLVKGHANDNQLKALFNYLMQAGN 60 Query: 222 AQRFRAFIGEIAERAPQEKEKLMTIADRLREEGAM--------------------QGKHE 261 F F+ E+AER PQ K+KLMTIA+RLR+EG + QGK E Sbjct: 61 TTHFGEFLHEVAERLPQHKDKLMTIAERLRQEGHLNGLQEGHRKGLQEGLQTGLQQGKRE 120 Query: 262 EALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQSH 296 EALRIA M G+D ++ +T L+ +DL +SH Sbjct: 121 EALRIASTMQADGIDPLTIIRITGLTAEDLATRSH 155 >UniRef50_C8T759 Putative uncharacterized protein n=1 Tax=Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884 RepID=C8T759_KLEPR Length = 185 Score = 150 bits (380), Expect = 4e-35, Method: Compositional matrix adjust. Identities = 86/185 (46%), Positives = 113/185 (61%), Gaps = 22/185 (11%) Query: 134 LCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQRDLLGLVD 193 +CWL FA+P IAR+IY FPL+DIT PDDEIM+HR++A+LEL+QKHIRQRDL+ L + Sbjct: 1 MCWLAGFADPDIARRIYGEDFPLIDITSTPDDEIMRHRRVAMLELLQKHIRQRDLMDLHE 60 Query: 194 QIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQE--KEKLMTIADRLR 251 Q+V LL G T+ RQLK L +Y+LQ G+A AF+ +A+ P+ KE LM IA L Sbjct: 61 QLVRLLALGYTSRRQLKTLLHYLLQAGNAADPVAFLRHLAQNVPRRPHKETLMNIAQFLE 120 Query: 252 --------------------EEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDL 291 E+G QG+ + A RIA+ ML GLD LV +T L+P+ L Sbjct: 121 QRGHQQGLKQGLEQGLQQGIEQGIEQGEQQTAERIARAMLANGLDLSLVAKLTGLAPECL 180 Query: 292 IAQSH 296 H Sbjct: 181 ARLQH 185 >UniRef50_D0YJF1 Putative transposase YhgA family protein n=1 Tax=Klebsiella variicola At-22 RepID=D0YJF1_KLEVA Length = 190 Score = 143 bits (361), Expect = 7e-33, Method: Compositional matrix adjust. Identities = 74/177 (41%), Positives = 111/177 (62%), Gaps = 16/177 (9%) Query: 131 PYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQRDLLG 190 P+ + P A+ +Y F L+D+TV+PDD+++QHR++ALLEL+QKHIRQRDL Sbjct: 11 PHDAVFKRFLRHPETAKTLYGCPFTLIDVTVMPDDDLVQHRRVALLELMQKHIRQRDLSS 70 Query: 191 LVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKLMTIADRL 250 + + + ++++ G TN RQL+ LF+Y+LQ G+ F+ +A R PQ +E LM+IA +L Sbjct: 71 ITESLAAVVMLGYTNRRQLRMLFHYMLQYGNTAEPGVFLRRLARRLPQYEETLMSIAQKL 130 Query: 251 REEGAMQGKHE----------------EALRIAQEMLDRGLDRELVMMVTRLSPDDL 291 ++EG +G+ E EALRIA ML GLD+E+V +T LS D+L Sbjct: 131 KQEGRQEGRLEGREEGHQEGLQEGSRREALRIAGSMLQNGLDKEMVQKITGLSADEL 187 Score = 43.5 bits (101), Expect = 0.008, Method: Compositional matrix adjust. Identities = 18/25 (72%), Positives = 20/25 (80%) Query: 1 MTISTTSTPHDAVFKSFLRHPDTAR 25 M TSTPHDAVFK FLRHP+TA+ Sbjct: 3 MKKRMTSTPHDAVFKRFLRHPETAK 27 >UniRef50_D2NBJ3 Putative uncharacterized protein n=1 Tax=Escherichia coli SE15 RepID=D2NBJ3_ECOLX Length = 136 Score = 138 bits (348), Expect = 2e-31, Method: Compositional matrix adjust. Identities = 72/128 (56%), Positives = 93/128 (72%), Gaps = 4/128 (3%) Query: 169 QHRKMALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAF 228 +H MALLELIQKHIRQRDL+GLV+Q+ LL +G NDRQ+K LFNY+LQTGDA RF F Sbjct: 13 RHASMALLELIQKHIRQRDLMGLVEQMACLLSSGYANDRQIKGLFNYILQTGDAVRFNDF 72 Query: 229 IGEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSP 288 I +AER+P+ KE LMTIA+RLR+E G+ +AL IA+ ML+ G+ +M T +S Sbjct: 73 IDGVAERSPKHKESLMTIAERLRQE----GEQSKALHIAKIMLESGVPLADIMRFTGVSE 128 Query: 289 DDLIAQSH 296 ++L A S Sbjct: 129 EELAAASQ 136 >UniRef50_Q1RGR6 Transposase and inactivated derivative n=15 Tax=Rickettsia RepID=Q1RGR6_RICBR Length = 313 Score = 131 bits (329), Expect = 3e-29, Method: Compositional matrix adjust. Identities = 84/306 (27%), Positives = 155/306 (50%), Gaps = 25/306 (8%) Query: 10 HDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVK 69 HD + +S +P +++F ++HLP ++ L LK+E +SF+D+ L++ D+L+S K Sbjct: 7 HDEIIRSAFENPLVSKEFFEMHLPPHIQNLISFEKLKMEKDSFVDKRLKKSIVDILFSAK 66 Query: 70 TQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDA-GYKELPLVLPMLFYHGCRS 128 E GY+Y+++EHQS PE MA R+ RY + H + K+ P + P++FY+G + Sbjct: 67 FGEKKGYLYLLLEHQSTPEYKMALRLFRYMFKIAEYHKKSTKSKKFPFIYPLIFYNGVQK 126 Query: 129 PYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQRDL 188 W + F + + +S + L+++ +PD+++ + +L+ KHI +RDL Sbjct: 127 YNAPRNLW-ELFENSELVKSTWSGDYQLINVHDIPDEKLKEKAWSGILQFFMKHIHERDL 185 Query: 189 LGLVDQIVSLLVTGNTND---RQLKALFNY---------------VLQTG-DAQRFRAFI 229 L +++ LL D ++ + Y +LQ+ + ++ + Sbjct: 186 LKRWEEVADLLPKFAKIDIGIEHIELILCYTLTRIKQDDIIEVEKLLQSKLNPKKRENVM 245 Query: 230 GEIAERAPQ---EKEKLMTIADRLREEGAMQGK-HEEALRIAQEMLDRGLDRELVMMVTR 285 IA Q E+EK + + E+ M K EE + +A+EM+ G E V+ +T+ Sbjct: 246 KSIAHHWIQQGREEEKAIMLKKMQEEKVIMAEKVQEEKVMMAKEMMKEGFSLESVIKITK 305 Query: 286 LSPDDL 291 LS +DL Sbjct: 306 LSKEDL 311 >UniRef50_Q1RJ73 Transposase and inactivated derivative n=10 Tax=Rickettsieae RepID=Q1RJ73_RICBR Length = 305 Score = 128 bits (322), Expect = 2e-28, Method: Compositional matrix adjust. Identities = 84/291 (28%), Positives = 158/291 (54%), Gaps = 14/291 (4%) Query: 9 PHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSV 68 HD++ K + A++F++ +LP +KL DL+ + +E S+I+E L + YSD+++ + Sbjct: 6 KHDSLVKIIMTDKIAAQEFLEYYLPEDFKKLIDLSKITVEQESYIEESLSKKYSDIVYGI 65 Query: 69 KTQE-GVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYHGCR 127 +T+E G G++Y++IE QS + A R+ +Y++ + H + K LPLV ++ Y+G + Sbjct: 66 ETKEYGKGFVYILIEAQSTVDYWTALRLWKYTLLLCERHKEKRNK-LPLVYNLVIYNGKQ 124 Query: 128 SPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQRD 187 W D F +A+K+ + LVD+ + D+EI++ + + +L+ I KHI +RD Sbjct: 125 VYNAPRNLW-DLFTNSVMAKKLMMEDYQLVDLQAMSDNEIVKKKHIGMLDYILKHIHERD 183 Query: 188 LLGLVDQIVS-----LLVTGNTNDRQLKALFNYV-LQTGDAQRFRAFIGEIAERAPQEKE 241 ++ L +Q ++ +++ LK+ Y + Q+ R +PQ K+ Sbjct: 184 MIQLWEQFLANFNHVIMLDKEKGYIYLKSFLWYTDAKISKKQQPRLVQVFDKYLSPQHKD 243 Query: 242 KLM-TIADRLREEGAMQGKHE----EALRIAQEMLDRGLDRELVMMVTRLS 287 +M TIAD +EG +GK E +A+ IA++M +G ++ +T L Sbjct: 244 NIMKTIADVYIDEGKQEGKREGEYNKAVMIAKKMFSQGFKIPVIAELTGLK 294 >UniRef50_C4YU05 Transposase n=5 Tax=Rickettsieae RepID=C4YU05_9RICK Length = 342 Score = 126 bits (316), Expect = 1e-27, Method: Compositional matrix adjust. Identities = 92/332 (27%), Positives = 160/332 (48%), Gaps = 58/332 (17%) Query: 9 PHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSV 68 HDA+ K L A++F++ +LP+ ++L DL +K+E SF+++DL++ YSD+++SV Sbjct: 6 KHDALVKKILTEKIAAQEFLEHYLPSDFKELIDLREIKVEKESFVEDDLKRKYSDIIYSV 65 Query: 69 KTQ-EGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYHGCR 127 KT+ + ++YV+IE QS + +A R+ +Y + + H + K LPL+ P+L Y+G Sbjct: 66 KTRDQEEAFVYVLIEAQSSCDYWIALRLWKYMLLLCERHENNKNK-LPLICPLLIYNGS- 123 Query: 128 SPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQRD 187 Y + + F +P A+K+ + LVD+ DDEI Q + + ++E KHI QRD Sbjct: 124 EVYNAPRNFWELFTKPERAKKLMVQDYQLVDLQNQSDDEIEQKKHLGMMEYFLKHIHQRD 183 Query: 188 LLGLVDQIV-----SLLVTGNTNDRQLKALFNYV---LQTGDAQRFRAFIGEIAERAPQE 239 +L L D+ + S+++ + L++ Y + Q I + + +E Sbjct: 184 MLKLWDEFLIRFKPSIIMDKESGYIYLRSFVWYTDAKISEEKQQELEQII--VKHLSTEE 241 Query: 240 KEKLM-TIADRL--------------------------------------------REEG 254 K+ +M TIA + + EG Sbjct: 242 KDNIMRTIAQKYIDEGVQHGIIQGIQQGIQQGVEKGKAEGLKIGEAKGKAEGKAEGKAEG 301 Query: 255 AMQGKHEEALRIAQEMLDRGLDRELVMMVTRL 286 +GK EE + IA++ML +G D + VT L Sbjct: 302 KAEGKAEERVEIARKMLSQGCDFSFISSVTGL 333 >UniRef50_C3PPD7 Transposase and inactivated derivative n=13 Tax=spotted fever group RepID=C3PPD7_RICAE Length = 361 Score = 123 bits (308), Expect = 8e-27, Method: Compositional matrix adjust. Identities = 81/288 (28%), Positives = 147/288 (51%), Gaps = 35/288 (12%) Query: 4 STTSTP-HDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYS 62 +T+ P HD +FK + P AR+F++ +LP + +L ++K+E SF+ EDLR+ S Sbjct: 34 NTSERPRHDELFKKVMSEPVAAREFLEHYLPVTFKNKINLNSIKIEKESFVTEDLRKRLS 93 Query: 63 DLLWSVK----------TQEGV----GYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLD 108 D+++SV T++ V Y+YV+IEHQS + +AFR+ +Y + + H D Sbjct: 94 DVVYSVSLKNDNIKDSTTEKSVHNDKAYVYVLIEHQSSSDYWIAFRLWQYMLLLCERHKD 153 Query: 109 AGY----------KELPLVLPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVD 158 A +LPL+ P++ Y + PY + + F + A+ + + LVD Sbjct: 154 ANNNKSSVTKEKDNKLPLICPIVVYANDK-PYNAPRSFWELFEDSKTAKDMMGDEYLLVD 212 Query: 159 ITVVPDDEIMQHRKMALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALF-NYVL 217 + DDEI + + + ++E + KHI+ RD+L L ++ + D++ ++ ++L Sbjct: 213 LQKQSDDEIEKKKHLGMMEYMLKHIKARDILNLWQSLLEKFESSIEIDKENGYIYIKWLL 272 Query: 218 QTGDAQ-------RFRAFIGEIAERAPQEKEKLMTIADRLREEGAMQG 258 DA+ + I + ++ QE E + TIAD+ +EG +G Sbjct: 273 WYSDAKVSEDKQVELASIIAKHLKKEDQE-ELMRTIADKYIDEGVQKG 319 >UniRef50_A8GX51 Transposase and inactivated derivative n=11 Tax=Rickettsia RepID=A8GX51_RICB8 Length = 355 Score = 121 bits (304), Expect = 3e-26, Method: Compositional matrix adjust. Identities = 61/189 (32%), Positives = 110/189 (58%), Gaps = 3/189 (1%) Query: 13 VFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVKTQE 72 +F+ L +P A +F + HLP ++ L D +L +E +F++ L+ SD+L+S K + Sbjct: 23 IFRKALENPLVAHEFFNAHLPPNIKSLIDFPSLAMENTTFVESSLKDSISDVLFSCKFDK 82 Query: 73 GVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHL--DAGYKELPLVLPMLFYHGCRSPY 130 GY+++++EHQSK + MAFR+ +Y I + +L + K LPL+ PM+F++G + Y Sbjct: 83 QDGYLFLLVEHQSKADHFMAFRLFKYMINICERYLIQNPKAKTLPLIYPMIFFNG-QEKY 141 Query: 131 PYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQRDLLG 190 + D F +A++++ + + LV++ +PD+E Q +LE KHI +R+LL Sbjct: 142 NVARNLWDLFTNNKLAKELWINDYQLVNVHEIPDEEFKQRIWSGILEFFLKHIHERELLK 201 Query: 191 LVDQIVSLL 199 +I +L Sbjct: 202 RWQEISDIL 210 >UniRef50_B3ETR6 Putative uncharacterized protein n=1 Tax=Candidatus Amoebophilus asiaticus 5a2 RepID=B3ETR6_AMOA5 Length = 275 Score = 114 bits (284), Expect = 6e-24, Method: Compositional matrix adjust. Identities = 77/259 (29%), Positives = 133/259 (51%), Gaps = 32/259 (12%) Query: 61 YSDLLWSVK--TQEGVGY--------IYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAG 110 Y ++W+V+ + G+ Y +Y +IE+QS +LMAF M+ Y++A M+ HL+ G Sbjct: 11 YDIIIWAVRWYCKYGISYPDLAEMLYVYTLIENQSTHNKLMAFSMLSYNVALMEQHLNEG 70 Query: 111 YKELPLVLPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQH 170 Y+ELP+++ + Y G +SPYPYS D F +AR+ F L+D++V+ +E+++ Sbjct: 71 YQELPIIVNICIYTGKKSPYPYSQDICDYFEGVELAREQMFKHFKLLDLSVLSQEELLKD 130 Query: 171 RKMALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIG 230 +E + + R+RD L ++ +L+ ++ L + Y+L T D + Sbjct: 131 GTFGSVEALLRQGRERDYLNWINN-NQVLIWELVSNYGLSIVI-YILTTDDKNDADYLMQ 188 Query: 231 EIAERAPQEKEKLMTIADRLRE------------EGAMQGKHE--------EALRIAQEM 270 I E ++KE ++T A +LR+ EG QGK E +A I + M Sbjct: 189 AIIEAVLEQKEIIVTAAQQLRQVDIQTGLIKGIKEGIEQGKEEGVKLGIQAKAQAIDKSM 248 Query: 271 LDRGLDRELVMMVTRLSPD 289 L GL+ L+ VT +S + Sbjct: 249 LKEGLEISLIQKVTGISRE 267 >UniRef50_A5CC03 Transposase and inactivated derivative n=9 Tax=Orientia tsutsugamushi RepID=A5CC03_ORITB Length = 355 Score = 108 bits (271), Expect = 2e-22, Method: Compositional matrix adjust. Identities = 77/274 (28%), Positives = 149/274 (54%), Gaps = 21/274 (7%) Query: 5 TTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDL 64 + + HD +FK + P A DFI+ LP ++ + DL T+K+E SF++ +LR+ D+ Sbjct: 2 SENLKHDGLFKDLMNEPKAALDFINDFLPNEVKNVLDLNTIKVEQESFVEANLRRSMCDV 61 Query: 65 LWSVKTQEGV-GYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYK------ELPLV 117 L+SVKT+ +IYV+IE + + + +AF++ +Y+++ ++ H K +LP+V Sbjct: 62 LFSVKTKNNNDAFIYVLIEAELRSDYWIAFKLWQYTLSILKRHKKGLKKRKKERGKLPIV 121 Query: 118 LPMLFYHGC-RSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALL 176 +P++ YHG R P SL L F +P +A+++ S + L+D +PD EI + AL+ Sbjct: 122 VPIVVYHGADRFNAPRSLWEL--FDDPKLAKELMGSEYLLIDWQAMPDSEIKRKATAALV 179 Query: 177 ELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQ-----LKALFNYVLQT---GDAQRFRAF 228 ++ Q D++ L + + L D++ +++L Y + + R + Sbjct: 180 HFMKYIHNQPDIIELWAKFFNTLQEIVQKDKEEGFLYIRSLLYYTISKVSQNEQPRLKQL 239 Query: 229 IGEIAERAPQEKEKLM-TIADRLREEGAMQGKHE 261 + E + ++++++M TIA + +EG +G+ E Sbjct: 240 LDE--NLSIEDRDRIMGTIAAQYIDEGKAKGRAE 271 >UniRef50_Q24W02 Putative uncharacterized protein n=3 Tax=Clostridiales RepID=Q24W02_DESHY Length = 333 Score = 108 bits (271), Expect = 2e-22, Method: Compositional matrix adjust. Identities = 88/331 (26%), Positives = 163/331 (49%), Gaps = 45/331 (13%) Query: 3 ISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYS 62 +S PHD FK AR F+ +LP + L DL T+ + +S+ID++L++ +S Sbjct: 1 MSLIHNPHDKFFKETFGDVGMARSFLKNYLPQEILALVDLETILPQKDSYIDQELQESFS 60 Query: 63 DLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHL-DAGYKELPLVLPML 121 DLL+ VK + GY+Y + EH+S P + +A ++++Y + ++ L ++ +LPL++PM+ Sbjct: 61 DLLFQVKIHKNEGYLYFLFEHKSYPSQGIALQLLKYMVRIWESKLKESKPDKLPLIIPMV 120 Query: 122 FYHGCRSPYPYSL---CWLDEFAE--PAIARKIYSSAFPLVDITVVPDDEIMQHRKMALL 176 YHG + + SL +D + + A+ + I + L D++ D E++ + + ++ Sbjct: 121 VYHG-QEKWNSSLKLSGIIDNYEQLPNAVTQYIPEYEYILYDLSTYTDQEMVGNMLLLII 179 Query: 177 ELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKA------LFNYVLQT-GDAQRFRAFI 229 + I +D + + LL++ + Q K L Y+L T D + R + Sbjct: 180 LRTMRDIFIKDTEAFHNILHELLISFERVEDQEKGMQFFETLIRYILSTRQDLELERIY- 238 Query: 230 GEIAERAPQEK-EKLMTIADRL----------------------------REEGAMQGKH 260 EIA+ E+ E +MTIA++L REEG +G+ Sbjct: 239 -EIAKEVSLERGEVMMTIAEKLIMEGMEKGLKKGREEGLKKGREEGLEKGREEGLEKGRE 297 Query: 261 EEALRIAQEMLDRGLDRELVMMVTRLSPDDL 291 E L +A+ +L G++ + V T LS +++ Sbjct: 298 ETKLEVARNLLGLGIEMDKVAKATGLSEEEI 328 >UniRef50_C0GW46 Putative uncharacterized protein n=2 Tax=Desulfonatronospira thiodismutans ASO3-1 RepID=C0GW46_9DELT Length = 341 Score = 108 bits (269), Expect = 3e-22, Method: Compositional matrix adjust. Identities = 75/257 (29%), Positives = 134/257 (52%), Gaps = 14/257 (5%) Query: 8 TPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWS 67 PH+A FK F + P+ + FI H+P + L DL TL+++ + F+ E+ R+YY+D++ + Sbjct: 7 NPHNACFKDFFKDPEFVKAFIKYHIPEEICSLLDLDTLQVDLSGFVSEEHREYYADVMVT 66 Query: 68 VKTQ---EGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKE--LPLVLPMLF 122 V+ + E V IY+++EH+S PE L +++ Y + + G + LP+++P++ Sbjct: 67 VQLKGHTENVN-IYILLEHKSTPEFLTRLQILNYEVQKWMDLKRKGQLQGYLPVIIPVVI 125 Query: 123 YHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFP--LVDITVVPDDEIMQHRKMALLELIQ 180 YHG + + +S + D F P+ + + F + DI+ + DDE + + L+ Sbjct: 126 YHG-KGRWNFSRKFSDLFDLPSEVLRPFVPEFKHMIHDISSMEDDEFKTTAILEIFHLLF 184 Query: 181 KHIRQRDLLGLVDQIVSLLVTGNTNDR---QLKALFNYVLQTGDAQRFRAFIGEIAERAP 237 K+I +L + +I LL T D+ L+A+ YV G R +GE R P Sbjct: 185 KYIHYPELETKLQEIYDLLETIPDQDKVKQYLQAIVQYVAVQGPISLER--LGEYTRRLP 242 Query: 238 QEKEKLMTIADRLREEG 254 E + T A ++R+E Sbjct: 243 GGDEAMQTAAQQIRQEA 259 >UniRef50_A8PLG1 Transposase n=1 Tax=Rickettsiella grylli RepID=A8PLG1_9COXI Length = 212 Score = 107 bits (267), Expect = 5e-22, Method: Compositional matrix adjust. Identities = 70/206 (33%), Positives = 110/206 (53%), Gaps = 7/206 (3%) Query: 93 FRMMRYSIAAMQNHLDAGYKELPLVLPMLFYHGCRSPYPYSLCWLDEFAE-PAIARKIYS 151 F++ RY A M HL G+ LP+V+ ML+Y G +PYPY+ D F + IA KIY Sbjct: 4 FKIARYVHAIMDQHLKQGHAFLPIVVAMLYYRGKVTPYPYTGNIFDCFGKNKTIAEKIYL 63 Query: 152 SAFPLVDITVVPDDEIMQHRKMALLELIQKHIR-QRDLLGLVDQIVSLLVTGNTNDRQLK 210 +P++DIT + DD I H +A+L+ QK+ RD+ ++ I+ L G Q + Sbjct: 64 RPYPIIDITALSDDAIRGHGSIAILDFAQKYAAFNRDIQDGIEHIIGELKKGYLTREQCQ 123 Query: 211 ALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKLMTIADRLR----EEGAMQGKHEEALRI 266 L Y + D + + ++ E E +M++A ++ + G QG++EE L+I Sbjct: 124 TLLYYTFRETDTDNVKMLLEQLQTIRIYE-EDIMSVAHKIEQQGLQRGLQQGRYEEDLKI 182 Query: 267 AQEMLDRGLDRELVMMVTRLSPDDLI 292 A+ ML +G DR + VT LS DL+ Sbjct: 183 AKRMLAKGTDRGYIKDVTGLSDQDLL 208 >UniRef50_B6J6C6 Hypothetical cytosolic protein n=1 Tax=Coxiella burnetii CbuK_Q154 RepID=B6J6C6_COXB1 Length = 143 Score = 106 bits (265), Expect = 8e-22, Method: Compositional matrix adjust. Identities = 51/141 (36%), Positives = 87/141 (61%), Gaps = 2/141 (1%) Query: 3 ISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYS 62 + PHD F++ + A++F + HLP + K DL +L+L+ +SFIDE L+ + Sbjct: 1 MKKIHNPHDYYFRTAMSDTRVAKEFFEYHLPNNILKAADLNSLQLQKSSFIDEHLKASMA 60 Query: 63 DLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAG-YKELPLVLPML 121 D+L+SVK GY Y+++EHQ P++LM +R++RY + + +HL Y LP+V+P++ Sbjct: 61 DVLYSVKLNRRPGYFYIIVEHQRNPDKLMPYRLLRYILRIIDHHLKKKDYLPLPIVVPLV 120 Query: 122 FYHGCRSPYPYSLCWLDEFAE 142 FY+G + YP+ +L A+ Sbjct: 121 FYNGKKR-YPFQRIFLLYLAK 140 >UniRef50_A0LBL3 Putative uncharacterized protein n=6 Tax=Magnetococcus sp. MC-1 RepID=A0LBL3_MAGSM Length = 322 Score = 105 bits (263), Expect = 1e-21, Method: Compositional matrix adjust. Identities = 80/272 (29%), Positives = 137/272 (50%), Gaps = 22/272 (8%) Query: 3 ISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYS 62 ++ + PHD K+ L PD + LP + +L L +FID + R++ + Sbjct: 1 MTKITQPHDRFLKALLSDPDKTGTLLRERLPKEVAELLSSEPPVLVDGTFIDGEFREHLT 60 Query: 63 DLLWSVKTQEG-VGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPML 121 D L+ VKTQEG YIY +IEH+S +E +AF+++RY + + L G ++LP ++P++ Sbjct: 61 DRLFKVKTQEGKAAYIYALIEHKSYADEWVAFQLLRYMVRIWERFLKEGQQKLPPIVPLV 120 Query: 122 FYHGCRS-PYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQ--HRKMALLEL 178 YHG R P L E A+ + + +F + D+ + DD++ Q H + AL+ + Sbjct: 121 VYHGAREWTVPNQFSALLE-ADKGLLHHLLDFSFAVTDLGRIADDDLSQDTHLRAALMAM 179 Query: 179 IQKHIRQRDLLGLVDQIVSLLVTGNTNDRQL-KALFNYVLQT------GDAQRF--RAFI 229 K+ Q G+V ++ + G D + K + Y++QT D Q + AF Sbjct: 180 --KYAFQ-GAEGVV--VIPQIGKGAQGDPEFAKLVLRYLIQTYRGMTMADVQAYAEEAFP 234 Query: 230 GEIAERAPQEKEKLMTIADRLREEGAMQGKHE 261 GE A Q ++M+ + R+EG +G+ E Sbjct: 235 GEAEHYASQFAREMMS---KGRQEGRQEGRRE 263 >UniRef50_A9EVM7 Similar to putative transposase n=2 Tax=Sorangium cellulosum 'So ce 56' RepID=A9EVM7_SORC5 Length = 336 Score = 105 bits (263), Expect = 1e-21, Method: Compositional matrix adjust. Identities = 87/275 (31%), Positives = 132/275 (48%), Gaps = 28/275 (10%) Query: 8 TPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWS 67 HDA+FK+ + A + LP L D L+L P SF+DE L++ SDLL+S Sbjct: 12 NAHDALFKAAFSQVEHAAGELRQALPPALSARIDFAALRLRPGSFVDEALKERQSDLLFS 71 Query: 68 VKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHL--DAGYKELPLVLPMLFYH- 124 E +Y++ EHQS E LMAFR++RY + ++HL G K LP +LP++ +H Sbjct: 72 ASMGEARVLLYLLFEHQSTVEPLMAFRLLRYMVRIWEHHLAEHPGSKRLPAILPVVLHHS 131 Query: 125 --GCRSPYPYS-LCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMA----LLE 177 G + + L LDE A + + F L DI+ DE ++ R M+ L+ Sbjct: 132 ETGWTAATSFEDLLDLDEGARAVMVDHVPRFRFVLDDIS-QEGDEALKARAMSAFSRLVL 190 Query: 178 LIQKHIRQRD----LLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDA-------QRFR 226 +H R+ D LG +V+ + L+A++ Y+L T + QR Sbjct: 191 WCLRHGREPDELLRQLGKWLDLVNEVRRAPNGVEALRAIWRYILATNERDEADEVLQRLL 250 Query: 227 AFIGEIAERAPQEKEKLMTIADRLREEGAMQGKHE 261 A GE KE++++ AD+L E G QG E Sbjct: 251 AAAGE------PWKEEIVSAADQLMERGRQQGLRE 279 >UniRef50_B5Q357 Transposase n=10 Tax=Salmonella enterica subsp. enterica RepID=B5Q357_SALVI Length = 174 Score = 105 bits (262), Expect = 2e-21, Method: Compositional matrix adjust. Identities = 63/139 (45%), Positives = 82/139 (58%), Gaps = 25/139 (17%) Query: 183 IRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVL-QTGDAQRFRAFIGEIAERAPQEKE 241 +RQRDLLGLV++I SLLVTG NDRQLKALFNY++ Q G RF FI ++ P KE Sbjct: 36 LRQRDLLGLVERIASLLVTGCANDRQLKALFNYLMIQHGHTPRFTTFIRDVVGHVPHTKE 95 Query: 242 KLMTIADRLR------------------------EEGAMQGKHEEALRIAQEMLDRGLDR 277 +LMT+ +R+R E+G +G+H ALRIA++ML GLDR Sbjct: 96 RLMTLIERIRAADRRKGERQGRQLGLEEGLAEGLEKGLEKGQHVAALRIARQMLADGLDR 155 Query: 278 ELVMMVTRLSPDDLIAQSH 296 E V T L+ ++L SH Sbjct: 156 ETVQRFTGLTAEELQDVSH 174 Score = 68.6 bits (166), Expect = 3e-10, Method: Compositional matrix adjust. Identities = 29/38 (76%), Positives = 34/38 (89%) Query: 1 MTISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRK 38 M STTSTPHDAVFK+FLRHP+TARDF++IHLP LR+ Sbjct: 1 MKKSTTSTPHDAVFKTFLRHPETARDFMEIHLPVSLRQ 38 >UniRef50_Q2J904 Putative uncharacterized protein n=1 Tax=Frankia sp. CcI3 RepID=Q2J904_FRASC Length = 323 Score = 104 bits (260), Expect = 3e-21, Method: Compositional matrix adjust. Identities = 84/275 (30%), Positives = 136/275 (49%), Gaps = 19/275 (6%) Query: 3 ISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYS 62 +S+ +PHDAVF+ L P A + LPA L DL L + P S +D LR ++ Sbjct: 1 MSSPPSPHDAVFRRVLGVPSNAASQLRATLPAALVARLDLDRLAIVPGSLVDATLRWRHT 60 Query: 63 DLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYK--ELPLVLPM 120 DLL++ +IYV++EHQS + LMAFRM+RY + +L +K LP V+P+ Sbjct: 61 DLLFTAPLDGHEAFIYVLVEHQSSSDPLMAFRMLRYVVRVWDRYLADHHKAARLPAVVPL 120 Query: 121 LFYHGCRSPY-PYSLCWLDEFAEPAIARKIYSSAFP----LVDITVVPDDEIMQHRKMA- 174 + +H + P + L + A P +A + P L+D V D+ ++ R + Sbjct: 121 VVHHNEHAWVAPTQVLDLVDLA-PDLA-GAWREHLPRFQFLLDDLVRVDERELRERPLTH 178 Query: 175 -------LLELIQKHIR-QRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFR 226 LL+++ + R +DL VD++ ++L G + L Y+ G+A Sbjct: 179 SVRLTLLLLKIVPGNPRLAQDLRPWVDELRAVL-DGPDGREEFATLLRYIELVGEADARD 237 Query: 227 AFIGEIAERAPQEKEKLMTIADRLREEGAMQGKHE 261 IA P+ ++ MTIA+ LR EG ++G+ E Sbjct: 238 ELHDLIAGLGPEAEDAYMTIAEMLRAEGRVEGRVE 272 >UniRef50_Q1RKI3 Transposase and inactivated derivative n=10 Tax=Rickettsia RepID=Q1RKI3_RICBR Length = 270 Score = 102 bits (255), Expect = 1e-20, Method: Compositional matrix adjust. Identities = 57/194 (29%), Positives = 105/194 (54%), Gaps = 3/194 (1%) Query: 10 HDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVK 69 HD F+ L +P AR+F + +LP ++ L TTL LE +SFID +L++ +D+L+S + Sbjct: 56 HDKFFQKALSNPIVAREFFEEYLPTEIKALFSPTTLTLENDSFIDPNLKESITDVLYSAR 115 Query: 70 TQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDA--GYKELPLVLPMLFYHGCR 127 YIY++ EHQS + MAFR+ +Y + + HL + K+ P + P++ Y Sbjct: 116 INNRDCYIYILCEHQSSSDPHMAFRLFKYMLNIAEKHLISHPDSKKFPFIYPLV-YSNDH 174 Query: 128 SPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQRD 187 Y L D F + + +S+ + L+ + + DD++ ++ +A L+++ K+I + + Sbjct: 175 KKYTAPLNLWDLFENSELVKDTWSNNYQLISLRDISDDKLKENPWLAPLQILMKYIHKPN 234 Query: 188 LLGLVDQIVSLLVT 201 + +I L T Sbjct: 235 VFDKWQEISGCLAT 248 >UniRef50_A6TJT5 Putative uncharacterized protein n=1 Tax=Alkaliphilus metalliredigens QYMF RepID=A6TJT5_ALKMQ Length = 312 Score = 100 bits (248), Expect = 8e-20, Method: Compositional matrix adjust. Identities = 75/302 (24%), Positives = 145/302 (48%), Gaps = 20/302 (6%) Query: 9 PHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSV 68 PHD FK + A+DF+ +LP L K+ D+ TL E +I++DL++ +SDLL+ Sbjct: 7 PHDKFFKEMFGNLALAKDFMTNYLPLELLKIVDIETLTPEKEHYIEDDLKESFSDLLFKA 66 Query: 69 KTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNH-LDAGYKELPLVLPMLFYHGCR 127 GY+Y + EH+S P + +A +++ Y + + L +++P+++PM YHG + Sbjct: 67 NINGREGYLYFLFEHKSYPSKRIAIQLLHYMVRIWDDKSLKEKKEKIPMIIPMTVYHG-K 125 Query: 128 SPYPYSLCWLD-----EFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKH 182 + +L D E I + I + + D++ DDE+ ++ ++ I + Sbjct: 126 ENWNVALRLSDLMEGYEELPEEIRKYIPEYEYLIYDLSGYTDDEVKGDVQLQIVIKILRS 185 Query: 183 IRQRD--LLGLVDQIVSLLVTGNTNDRQL---KALFNYVLQTGDAQRFRAFIGEIAERAP 237 I + D + + V +L ++ + K Y+L + E + Sbjct: 186 IFRNDEEFFKVFKEAVEVLDKLEKQEKGIEYFKTFIYYILSARKGVTLTEIYDLVKEVSV 245 Query: 238 QEKEKLMTIADRL--------REEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPD 289 + +++MTIA+ L E+G +GK EE +A+ ++ G++ + VM T LS + Sbjct: 246 ERSDEIMTIAEELLKEGMEKGMEKGMEKGKLEEKREVARNLIGLGVELDKVMKATGLSEE 305 Query: 290 DL 291 ++ Sbjct: 306 EI 307 >UniRef50_A9BGB6 Putative uncharacterized protein n=3 Tax=Petrotoga mobilis SJ95 RepID=A9BGB6_PETMO Length = 331 Score = 99.0 bits (245), Expect = 2e-19, Method: Compositional matrix adjust. Identities = 79/284 (27%), Positives = 131/284 (46%), Gaps = 51/284 (17%) Query: 8 TPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWS 67 PHD FK + ARDF+ +LP ++ DL L E NS +DE+LR+ SD+L+ Sbjct: 7 NPHDRFFKLIFSDKEIARDFLQNYLPQEAVEIVDLDYLIPENNSHVDENLRESLSDMLYK 66 Query: 68 VKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYHGCR 127 K + GYIY+++EH+S E + F+++RY + + D K++P+++PM+ YHG Sbjct: 67 TKIKGQDGYIYILMEHKSYIEGKVIFQLLRYITSIWEEKYDPKTKKVPIIIPMVIYHG-- 124 Query: 128 SPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDD---EIMQHRKMALLELIQKHI- 183 R+I++ L+++ +D E+ + E+ I Sbjct: 125 -------------------REIWNVETNLLNMVQGIEDLPNELKTYLPTYRYEICDFSIK 165 Query: 184 RQRDLLGLVDQIVSL-------LVTGNTNDRQLKALFNYVLQTGDAQRFRAF-------- 228 R++ ++GL V++ +T +L+ +F Y+ Q Q F Sbjct: 166 RKKRIIGLTAMKVAIEAMRAGTAMTKEEFKERLRRVFAYIKQLPKEQVHEWFEECMIYLL 225 Query: 229 -------IGEI----AERAPQEKEKLMTIADRLREEGAMQGKHE 261 I EI E P E +MTIA++LR EG +GK E Sbjct: 226 NVREDVTIEEILKVQKEIMPGRGEIVMTIAEKLRNEGMEKGKIE 269 >UniRef50_Q6TFF6 Putative transposase n=1 Tax=Caedibacter taeniospiralis RepID=Q6TFF6_CAETA Length = 299 Score = 94.7 bits (234), Expect = 3e-18, Method: Compositional matrix adjust. Identities = 82/295 (27%), Positives = 148/295 (50%), Gaps = 23/295 (7%) Query: 10 HDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLR---------QY 60 HD+VFK + + D A F+ +LP L +L D T+KLE + E +R + Sbjct: 5 HDSVFKDLIANRDFAVSFLMTYLPKELVELVDWQTVKLESANV--EHVRQQQKDNQKQKE 62 Query: 61 YSDLLWSVKTQEGV-GYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLD--AGYKELPLV 117 SDL + K ++G G ++V IE Q+ + + R Y + + +++ K LPLV Sbjct: 63 QSDLTFLFKFKDGKNGAVFVHIESQTGDDGTILIRTRHYQTSYLLDYIKRHKTVKGLPLV 122 Query: 118 LPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLE 177 + +++Y + P+ +SL D FA +A+K Y+ +D+ D+EI++H +A E Sbjct: 123 VSIIYY-ANQKPFSHSLNIHDYFANTELAKK-YAFTTQFIDLNRYSDEEILEHGFIAGYE 180 Query: 178 LIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAP 237 LI K IR++++ G +D ++ + + RQ+ L Y+ Q D + + F ++ P Sbjct: 181 LILKAIREKNIDGKLDIAINQIEAYDHIARQV--LIRYMSQYSDMET-KDFHDKLIYSKP 237 Query: 238 QEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLI 292 + +MT+A++ ++G +G A+ L GL E V+ T L D ++ Sbjct: 238 DLRGDVMTVAEQWEQKGIQKGIQ----TTARNFLLMGLSAEQVVKGTGLDQDTVL 288 >UniRef50_A6G4N5 Putative uncharacterized protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6G4N5_9DELT Length = 343 Score = 94.7 bits (234), Expect = 3e-18, Method: Compositional matrix adjust. Identities = 72/267 (26%), Positives = 124/267 (46%), Gaps = 12/267 (4%) Query: 4 STTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSD 63 T+ +PHDA+FKS + P A + L P+ D +TL+ EP S+IDE L + +SD Sbjct: 3 GTSPSPHDALFKSAFKDPKDAAKLLQNVLDEPIAHAIDWSTLRPEPGSYIDETLAERHSD 62 Query: 64 LLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDA-GYKELPLVLPMLF 122 LL+S Y+Y++IEHQS + M RM+ Y H A ++LP +LP++ Sbjct: 63 LLFSASIGGEDAYVYLLIEHQSTVDRDMPLRMLVYLTRVWLRHRSAHPGRDLPPILPVVV 122 Query: 123 YH---GCRSPYPY-SLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLEL 178 H G +P + SL P + I + D+T + D ++ + L Sbjct: 123 SHAPGGWTAPVTFESLVRPGPTDLPELTPHIPRFELVINDLTHLSDQQLREWSMRGFATL 182 Query: 179 IQKHIRQR-DLLGLVDQIVSL------LVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGE 231 + +R R ++ L+D + + + + + +F+Y+ + + F + Sbjct: 183 VLWILRTRHEIPELIDGVSTWRDMFREVFEAPDGVQAMTKIFHYIACIAQRVQVQEFHAK 242 Query: 232 IAERAPQEKEKLMTIADRLREEGAMQG 258 + E PQ +E + T + L EEG +G Sbjct: 243 LDEHVPQTREVMKTYYEELMEEGMAKG 269 >UniRef50_D2QBD7 Putative uncharacterized protein n=1 Tax=Spirosoma linguale DSM 74 RepID=D2QBD7_9SPHI Length = 341 Score = 91.3 bits (225), Expect = 3e-17, Method: Compositional matrix adjust. Identities = 80/286 (27%), Positives = 139/286 (48%), Gaps = 15/286 (5%) Query: 8 TPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWS 67 PHD FK P+ DF++ P +R+ D TTL E ++F DE L ++++DL++S Sbjct: 7 NPHDRFFKESFSQPEILIDFLNAFAPEAVRERIDYTTLTREVDTFTDEQLAEHFADLVFS 66 Query: 68 VKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYHGCR 127 V+ + +++EH+S EE F++ RY + ++ + P VLP+L YHG R Sbjct: 67 VQYNGQPIRLVILLEHKSYTEEYPHFQINRYLLNLWESQIKQKQPLTP-VLPVLVYHGNR 125 Query: 128 SPYPYSLCWLDEFAEPAIARKIYSSAFP--LVDITVVPDDEI--MQHRKMALLE-LIQKH 182 S+ D FA Y AF L+D++ + D+ + +Q L L+Q Sbjct: 126 RWKQRSIP--DYFAPLHETLTPYLPAFEYLLIDLSTLSDERLPTLQSDYARLTAILLQNS 183 Query: 183 IRQRDLLGLVD---QIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQE 239 R+R+L L+D +V L R + F Y+ T + + F G + + + Sbjct: 184 RRKRELTRLLDAFADVVRRLTDTTAGQRFVSTGFLYLSYTANLTKVELF-GIFSRISSKI 242 Query: 240 KEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRG--LDRELVMMV 283 + MT+A+ L +EG + + + +A+E++ +G L+R MM Sbjct: 243 ESSTMTVAEELIQEGRELERRQTRM-VAEELIQQGRELERRQAMMA 287 >UniRef50_D0LMM4 Putative transposase n=10 Tax=Haliangium ochraceum DSM 14365 RepID=D0LMM4_HALO1 Length = 345 Score = 89.0 bits (219), Expect = 2e-16, Method: Compositional matrix adjust. Identities = 83/274 (30%), Positives = 128/274 (46%), Gaps = 27/274 (9%) Query: 10 HDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVK 69 HD++ K+ D A D LP + + DL L L P SF+ ++LRQ ++DLL+ Sbjct: 6 HDSLVKATFARLDFAADEFRAVLPPAILERLDLDKLALCPGSFVSDELRQQHTDLLFRAP 65 Query: 70 TQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLD--AGYKELPLVLPMLFYH--- 124 ++Y+++EHQS E +M R++RY + + HL G LP +LP++ +H Sbjct: 66 LDGEPAFLYLLLEHQSSVERMMPLRLLRYVASIWERHLGEHPGAATLPPILPVVLHHSEQ 125 Query: 125 GCRSPYPYS-LCWLDEFAEPAIARKIYSSAFPLVDITVVPDD-----EIMQHRKMALLEL 178 G +P L L + A A+ + F L D++ PD+ E+ K+AL L Sbjct: 126 GWTAPTSLGQLFALSDGAREALGPYLPELRFLLDDLSHQPDEALLMREMAAQAKLALWAL 185 Query: 179 IQKHIRQ-RDLLGLV---DQIVSLLVTGNTNDRQLKALFNYVLQTGDAQR---FRAFIGE 231 K+ R +DLL L+ ++ VT L A+ Y LQ D R I Sbjct: 186 --KNARHAQDLLALLRPWSPVILEAVTAPGGIDALAAIVRYTLQHADTDPDALMRFLIDS 243 Query: 232 IAERAPQEKEKLMTIADRL----REEGAMQGKHE 261 + A KE MT A++L RE+ QG+ E Sbjct: 244 AGDPA---KEAFMTGAEKLTQAVREQSLRQGRVE 274 >UniRef50_C5JAV2 Transposase n=2 Tax=uncultured bacterium RepID=C5JAV2_9BACT Length = 334 Score = 88.2 bits (217), Expect = 3e-16, Method: Compositional matrix adjust. Identities = 76/303 (25%), Positives = 152/303 (50%), Gaps = 33/303 (10%) Query: 3 ISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYS 62 ++ + PHD K+ L +P TA + LP + + +L SFIDE LR + + Sbjct: 1 MTEIAHPHDRFLKALLSNPATAGTLLRERLPREVAEALSDDPPELLEGSFIDEALRPHLT 60 Query: 63 DLLWSVKTQEG-VGYIYVVIEHQSKPEELMAFRMMRYSIAAMQN--HLDAGYKELPLVLP 119 D L+ V+T G +YV+IEH+S P+ + +++++Y + A++ + ++ LP ++P Sbjct: 61 DRLYRVRTVTGRTALLYVLIEHKSSPDLRIGWQLLKYLVEALKQWERENPAWERLPAIVP 120 Query: 120 MLFYHGC---RSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALL 176 +FYHG + P + L +D AE + + F ++D+ + D ++ + + Sbjct: 121 FVFYHGAAAWKVPDAF-LALVD--AEEGWRSHLLNFRFTVLDLGQIDDRQLSRQPNLQAW 177 Query: 177 ELIQKHIRQRDL-LGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAF----IGE 231 L K+ + D L + + ++ LV+ D + + L YV++T +R++ + E Sbjct: 178 LLAAKYATRDDRQLEVKELLIQTLVS--VADEEFRFLMRYVVET-----YRSYDEPMVRE 230 Query: 232 IAERA-PQEKEKLMT-----IADRLREEGAMQGKHE---EALRIAQEMLDRGLDRELVMM 282 I R P+E+E +M+ + + R+EG +G+ E E +++ ++ RG E M Sbjct: 231 IIRRVRPEEEETMMSMFAQDMMAKGRQEGRQEGRQEGRQEGIKLGEQ---RGRQEEAAYM 287 Query: 283 VTR 285 + + Sbjct: 288 LLK 290 >UniRef50_A6G0X2 Putative uncharacterized protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6G0X2_9DELT Length = 363 Score = 86.7 bits (213), Expect = 1e-15, Method: Compositional matrix adjust. Identities = 82/293 (27%), Positives = 123/293 (41%), Gaps = 30/293 (10%) Query: 4 STTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSD 63 S TS PHDA+F++ HP A + LP L L D + L+ N + L + +D Sbjct: 13 SVTSRPHDALFRATFEHPSHAGSLLRSALPRELAALIDWSRLRPAANELVSSSLGERRTD 72 Query: 64 LLWSVKTQ-----EGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVL 118 LL+S + +G +Y+ IEHQS+ + M R++ Y + + H LP V Sbjct: 73 LLFSTALEGPGAGDGARVVYLHIEHQSRVDTTMPLRVLGYRVRIWERHRKRHGGALPPVF 132 Query: 119 PMLFYHGCRS-PYPYSLCWLDEFAEPA-----IARKIYSSAFPLVDITVVPDDEIMQHRK 172 ++ H + P SL L F EP IA + + D+ D E+ Sbjct: 133 CVVLSHAAKGWTGPRSLVEL--FPEPVRTLAPIAAHLPRCPLIVEDLGRRADAELRARHA 190 Query: 173 MALLELIQKHIRQ--------RDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQR 224 L L +R LL DQI++LL + +R L L YV G Sbjct: 191 HPLPALTLWLLRDARSPERLVHRLLDWRDQIIALL-DYDHGERDLAQLLRYVALVGSEMD 249 Query: 225 FRAFIGEIAERAPQEKEKLMTIADRL--------REEGAMQGKHEEALRIAQE 269 F F +A P+ + MTIA++L RE+G +G+ E L +E Sbjct: 250 FEEFHRFVAHHIPEVEAMTMTIAEQLCREALQRGREQGQREGQREGRLEGQRE 302 >UniRef50_C0GWA6 Putative uncharacterized protein n=3 Tax=Desulfonatronospira thiodismutans ASO3-1 RepID=C0GWA6_9DELT Length = 334 Score = 85.5 bits (210), Expect = 2e-15, Method: Compositional matrix adjust. Identities = 71/259 (27%), Positives = 122/259 (47%), Gaps = 26/259 (10%) Query: 8 TPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWS 67 HD FKSF + RDFI +LP ++K DLT ++++ ++ E+ +++YSD++ Sbjct: 7 NAHDICFKSFFSREEFVRDFIQYYLPEEIKKHLDLTIIEIDMEGYLSEEFKEFYSDVVAK 66 Query: 68 VKTQEGVG--YIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAG--YKELPLVLPMLFY 123 V + V +Y + EH+SKP + + Y + L G + LP+++P++ Y Sbjct: 67 VYFNDRVHELELYFLFEHKSKPYRFTILQTLNYQVQKWMRLLVEGKLNQHLPIIVPVVIY 126 Query: 124 HGCRSPYPYSLCWLDEFAEPAIARKIYSSAFP--LVDITVVPDDEIMQHRKMALLELIQK 181 +G +S + +S+ + D F P+ K + F L DI + + M + L+ K Sbjct: 127 NGYKS-WNFSVQFEDLFQLPSEYYKDFIPQFRHILHDIGQMDEASFKTTTIMEIFHLLLK 185 Query: 182 HIRQRDLLGLVDQIVSLLVTGNTNDRQLKALF---NYVLQTG---------DAQRFRA-- 227 +I +L + +I LL ND+ LF YV+ +G A+RF Sbjct: 186 YIYYPELDTKIHEIYDLLEKLPDNDKLTDYLFIIVRYVMASGAIPEKRLLEHAKRFSGGE 245 Query: 228 -FIG----EIAERAPQEKE 241 IG EI ER Q ++ Sbjct: 246 EMIGLAAREIEERVEQTRK 264 >UniRef50_C0GW49 Putative uncharacterized protein n=6 Tax=Desulfonatronospira thiodismutans ASO3-1 RepID=C0GW49_9DELT Length = 339 Score = 82.0 bits (201), Expect = 2e-14, Method: Compositional matrix adjust. Identities = 63/258 (24%), Positives = 123/258 (47%), Gaps = 9/258 (3%) Query: 3 ISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYS 62 +S TS HD F++ L ARDF+ HLP + + +L T+K+ S++ ++L++ + Sbjct: 7 MSDTSKYHDHTFRAILGREPVARDFVRYHLPEEITRDMNLDTVKVSSRSYVSDNLKESMT 66 Query: 63 DLLWSVKTQEGV-GYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPML 121 D++ +++ G IY+++EH+S + ++ +Y Q+ + LP+++P++ Sbjct: 67 DIVITLELITGEPAEIYILVEHKSDLDAWTKIQLFKYMNEVWQSFIQKKTGTLPIIVPLV 126 Query: 122 FYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFP--LVDITVVPDDEIMQHRKMALLELI 179 FYHG + + YSL + D F P+ + Y F L ++ V+ ++ + + L+ Sbjct: 127 FYHGT-ARWNYSLEFSDLFNLPSEHYRKYIPKFEHLLHEVPVINKKKVKSSITLEVFHLV 185 Query: 180 QKHIRQRDLLGLVDQIVSLLVTG---NTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERA 236 ++I + + + + LL G L Y+L D A E + Sbjct: 186 LEYIFYPEKRDQIYEALELLFKGLDAKEAHEIFAILIKYLLIATDETPEEA--EEKVKHL 243 Query: 237 PQEKEKLMTIADRLREEG 254 P+ E + T A+ L E G Sbjct: 244 PKGGETVRTTAEVLEERG 261 >UniRef50_C6VTM0 Putative uncharacterized protein n=1 Tax=Dyadobacter fermentans DSM 18053 RepID=C6VTM0_DYAFD Length = 308 Score = 82.0 bits (201), Expect = 2e-14, Method: Compositional matrix adjust. Identities = 67/261 (25%), Positives = 131/261 (50%), Gaps = 12/261 (4%) Query: 8 TP-HDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLW 66 TP HDA ++ + + A D+ +P ++ L D +TL+ P++++ ++L++ SD+++ Sbjct: 5 TPKHDAFIRAIMGNKQIALDYFRASIPQNIQDLLDFSTLRQLPDTYVSKELQKSISDIVY 64 Query: 67 SVKTQEGVGYIYV--VIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELP-LVLPMLFY 123 + G G + + ++EH+S ++ ++ Y + + + G KE P L++P+L Y Sbjct: 65 VCQKASGNGEVKISLLVEHKSYVDKYTPIQIGSYIFSGLLKQI--GNKESPSLIIPILLY 122 Query: 124 HGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEI--MQHRKMALLELIQK 181 HG ++ L E EPA+ + I + D+ + D+EI + ++ +A L K Sbjct: 123 HGADRWEYKTVADLFENPEPALQQFIPDYQYIFHDLGQISDEEIQSLHNKFLAASLLAMK 182 Query: 182 HIRQRDLLGLVDQIVSLLVTGNTNDRQL-KALFNYVLQTGDAQRFRAFIGEIAERAPQEK 240 + +D L + + ++L + DR L K+L Y L G+ F+ I Q+K Sbjct: 183 YSALKDQLNTL--LPTILTLASEVDRNLHKSLLFYTL-VGNPLTEEQFLNLIKSVPNQKK 239 Query: 241 EKLMTIADRLREEGAMQGKHE 261 E +M I + E+G +G E Sbjct: 240 EAIMDIFEIFEEKGWKKGIEE 260 >UniRef50_A9BGB3 Putative uncharacterized protein n=2 Tax=Petrotoga mobilis SJ95 RepID=A9BGB3_PETMO Length = 336 Score = 82.0 bits (201), Expect = 2e-14, Method: Compositional matrix adjust. Identities = 68/274 (24%), Positives = 136/274 (49%), Gaps = 20/274 (7%) Query: 7 STP-HDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLL 65 S P D++FK DF+ LP K T LK E I +D SD+L Sbjct: 2 SNPIKDSIFKELFEDRTVFYDFLKAFLPKETTKQIKETDLKREQTELIGKDFSIKRSDIL 61 Query: 66 WSVKTQEGVG-YIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKE-------LPLV 117 + ++ + G YIY+++EHQSK ++LMAFRM+ Y + + ++++ KE LP++ Sbjct: 62 YKIEKRNGQDVYIYLLLEHQSKVDQLMAFRMLAYKVRIWEQYVNSHKKESEQKGFKLPVI 121 Query: 118 LPMLFYHGCRSPYPYSLCWLDEFAE-PAIARKIYSSAFPLVDITVVPDDEIMQHRK-MAL 175 + M+FY G ++ + + ++ E + + + + L++++ + ++ I+ +K + + Sbjct: 122 IGMVFYDG-KAKWTSPMDVKEKITEIKNMEEYLIKANYELINLSNIKEETIINMKKALGV 180 Query: 176 LELIQK-HIRQRD---LLGLVDQIVSLLVTGNTNDRQLK---ALFNYVLQTGDAQRFRAF 228 + L K ++R ++ LL ++++ + L ++ ++ K A + D + + Sbjct: 181 ILLTDKPNVRVKNAEELLKIINKDILLKLSEEEQEKFNKHRNAFIELFGKRTDYEEIKER 240 Query: 229 IGEIAE-RAPQEKEKLMTIADRLREEGAMQGKHE 261 E+ E P+ L IA R RE+ ++GK E Sbjct: 241 FEELKEMEVPKMFNTLEEIAKRDREKAKLEGKAE 274 >UniRef50_B2V9N0 Putative uncharacterized protein n=4 Tax=Sulfurihydrogenibium RepID=B2V9N0_SULSY Length = 312 Score = 80.5 bits (197), Expect = 6e-14, Method: Compositional matrix adjust. Identities = 66/270 (24%), Positives = 133/270 (49%), Gaps = 24/270 (8%) Query: 1 MTISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQY 60 M + PH+ FK + +DF+ I L + L + L++L+L P+ + +++ Sbjct: 1 MKNKESIQPHNWFFKQVFSNSKNVQDFLSIFL-SDLSQKIQLSSLELVPSEKFSNNQKKH 59 Query: 61 YSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHL-DAGYKELPLVLP 119 + DLL+ K + YI ++ EH+S ++ + ++M+Y+ + L + Y P ++ Sbjct: 60 FLDLLYKCKLNDKEAYIRLIFEHKSYVDKKLPLQLMQYNAVIWEEALKEKDY--YPPIIN 117 Query: 120 MLFYHG-CRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRK----MA 174 ++FYHG + +P ++ + + + + I + L+D+ + D+ + ++ K + Sbjct: 118 IVFYHGQAKWNFPTTIP---DIEDEELDKYIQKLNYILIDLNEIEDENLKRYLKKNVDLI 174 Query: 175 LLELIQKHIRQRDLLGLVDQIVSLL--VTGNTNDRQLKALFNY-VLQTGDAQRFRAFIGE 231 + LI KHI R +++I +LL V ++ + NY VL D ++ + E Sbjct: 175 MEMLIMKHIHDR-----LERIKTLLKDVIDECSEDCFVIILNYLVLVKKDYEKVKEVFKE 229 Query: 232 IAERAPQEKEKLMTIADRLREEGAMQGKHE 261 I +EK+M D+L+ EG M+GK E Sbjct: 230 II----GGEEKMMLFTDKLKMEGKMEGKIE 255 >UniRef50_Q3JB06 Putative transposase n=17 Tax=Proteobacteria RepID=Q3JB06_NITOC Length = 350 Score = 79.7 bits (195), Expect = 1e-13, Method: Compositional matrix adjust. Identities = 40/120 (33%), Positives = 68/120 (56%), Gaps = 4/120 (3%) Query: 10 HDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVK 69 HDA +K HP+ RD + + P + D +TL+ S++ +DLR+ D++W ++ Sbjct: 4 HDASYKRLFSHPEMVRDLLQGFVREPWVQQLDFSTLEKVSGSYVTDDLREREDDIIWRLR 63 Query: 70 TQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGY----KELPLVLPMLFYHG 125 QEG YIY+++E QS + MA R++ Y Q+ + A Y ++LP V P++ Y+G Sbjct: 64 HQEGWMYIYLLLEFQSTVDPYMAVRVLAYVGLLYQDLIKARYIAPNQKLPPVFPLVLYNG 123 >UniRef50_Q04UG3 Transposase, YhgA-like n=8 Tax=Leptospira RepID=Q04UG3_LEPBJ Length = 304 Score = 77.4 bits (189), Expect = 6e-13, Method: Compositional matrix adjust. Identities = 80/306 (26%), Positives = 144/306 (47%), Gaps = 25/306 (8%) Query: 3 ISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYS 62 ++ + PHD + + + A F LP + +L DL L+L +SF+ E+L+Q + Sbjct: 1 MTEVNNPHDRLIRETFQDKKEAATFFKNTLPPEVVELLDLENLELTESSFVSEELKQEQT 60 Query: 63 DLLWSVKTQEG-VGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPML 121 DLL+ + + G +Y++ EH+S E + +++ Y +N +G + +V+P + Sbjct: 61 DLLFQIPLKSGNKSNVYLLFEHKSYLENTIYIQLLGYLTEIYRNQQRSG-ESFSVVIPFV 119 Query: 122 FYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKM------AL 175 FYHG + + + D+F ++ P I + + I +K+ Sbjct: 120 FYHGEKE-WKLGDRFSDQFVLTKQETDVFQDFIPDFKIDLFDLEGIELKKKLESITFQVT 178 Query: 176 LELIQKHIRQRDL--LGLVDQIVSLLVTGNTNDRQ---LKALFNYV-----LQTGDAQRF 225 L ++Q+ IR+RDL + + + SLL+ ++ L+ L Y+ L+ + +R Sbjct: 179 LGVVQR-IRERDLEFVSHLPGLFSLLLGIEEESKRVAILRKLLLYIYWARDLKPTELKRV 237 Query: 226 RAFIGEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTR 285 A + Q +E MT A+RL EG QGK E + A+ ML + E V+ +T Sbjct: 238 LAI-----SKLEQYEELTMTTAERLISEGIQQGKIEGKIETARNMLSEDIQLEAVLRITG 292 Query: 286 LSPDDL 291 LS DL Sbjct: 293 LSKQDL 298 >UniRef50_C0GV86 Transposase, ISNCY family n=7 Tax=Desulfonatronospira thiodismutans ASO3-1 RepID=C0GV86_9DELT Length = 125 Score = 75.5 bits (184), Expect = 2e-12, Method: Compositional matrix adjust. Identities = 36/104 (34%), Positives = 63/104 (60%), Gaps = 9/104 (8%) Query: 1 MTISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQY 60 M PH+ +F ++ D AR F+ H+ ++K DL TL+LEP +++DE L+++ Sbjct: 1 MATKRNQAPHEGLFLKIFQNLDNARHFLKNHMSEEIQKRFDLDTLRLEPTTYVDEKLKKH 60 Query: 61 YSDLLWSVKTQEGVGY------IYVVIEHQSKPEELMAFRMMRY 98 YSDL++SV+ +GY IY++ EH+S P+ L ++++Y Sbjct: 61 YSDLVFSVRL---IGYKNQFAKIYLLFEHKSSPDPLTGVQVLKY 101 >UniRef50_C4UAM6 Putative uncharacterized protein n=1 Tax=Yersinia aldovae ATCC 35236 RepID=C4UAM6_YERAL Length = 105 Score = 73.2 bits (178), Expect = 9e-12, Method: Compositional matrix adjust. Identities = 46/103 (44%), Positives = 59/103 (57%), Gaps = 16/103 (15%) Query: 210 KALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKLMTIADRLREEGAMQGKHE-------- 261 K+L NY+LQ GDA + FI E+A R+PQ KE LMTIA +L++EG +G+ E Sbjct: 3 KSLINYMLQDGDAATPKTFIWELARRSPQHKELLMTIAQKLKQEGRQEGRQEGRVEGIQI 62 Query: 262 -EA-------LRIAQEMLDRGLDRELVMMVTRLSPDDLIAQSH 296 EA L +A+ ML GLDR VM +T LS DL H Sbjct: 63 GEANGLKKGKLEVARTMLVNGLDRATVMKMTGLSDKDLTQIHH 105 >UniRef50_B8FP58 Putative uncharacterized protein n=1 Tax=Desulfitobacterium hafniense DCB-2 RepID=B8FP58_DESHD Length = 167 Score = 73.2 bits (178), Expect = 1e-11, Method: Compositional matrix adjust. Identities = 34/85 (40%), Positives = 53/85 (62%) Query: 3 ISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYS 62 +S PHD FK TAR F++ +LP +R L DL T+ + +S+ID++L++ +S Sbjct: 1 MSLIHNPHDKFFKETFGDVGTARSFLENYLPQEVRALVDLKTVLPQKDSYIDQELQESFS 60 Query: 63 DLLWSVKTQEGVGYIYVVIEHQSKP 87 DLL+ VK +E GY Y + EH+ +P Sbjct: 61 DLLFQVKIRENEGYFYFLFEHKVRP 85 >UniRef50_B6WXP3 Putative uncharacterized protein n=1 Tax=Desulfovibrio piger ATCC 29098 RepID=B6WXP3_9DELT Length = 330 Score = 71.6 bits (174), Expect = 3e-11, Method: Compositional matrix adjust. Identities = 35/125 (28%), Positives = 69/125 (55%), Gaps = 5/125 (4%) Query: 9 PHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSV 68 PHD+ +K F +P+ + +PA + D +TL+ S++ +DLR+ + D++W + Sbjct: 7 PHDSAYKQFFSNPEMVESLLRDFVPADFIEDLDFSTLERCSGSYVTDDLRERHDDIVWRI 66 Query: 69 KTQEGVG-YIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGY----KELPLVLPMLFY 123 ++G Y+ +V+E QS P+ MA R + Y+ + + + G + LP V P++ Y Sbjct: 67 GWKKGAWCYVALVLEFQSTPDYWMALRTLSYTALLLLDLVKTGKVHEGEGLPPVFPIVIY 126 Query: 124 HGCRS 128 +G ++ Sbjct: 127 NGGKA 131 >UniRef50_C7RR52 Putative transposase n=1 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RR52_9PROT Length = 330 Score = 69.7 bits (169), Expect = 1e-10, Method: Compositional matrix adjust. Identities = 40/125 (32%), Positives = 61/125 (48%), Gaps = 4/125 (3%) Query: 7 STPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLW 66 + HD +K P+ RD I +P D +TL+ P S++ ED D++W Sbjct: 2 ANTHDTGYKLLFSTPELVRDLILGFVPDDWLHGLDYSTLERVPGSYVTEDFTNRADDIVW 61 Query: 67 SVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGY----KELPLVLPMLF 122 VK Y+Y++IE QS ++ MA RMM Y Q+ + G LP VLP++ Sbjct: 62 RVKVGGEWVYLYLLIEFQSSVDKYMALRMMVYGGLLYQDLIKRGEVLADGRLPPVLPIVL 121 Query: 123 YHGCR 127 Y+G + Sbjct: 122 YNGSQ 126 >UniRef50_Q2FP14 Putative uncharacterized protein n=4 Tax=Methanospirillum hungatei JF-1 RepID=Q2FP14_METHJ Length = 312 Score = 69.3 bits (168), Expect = 1e-10, Method: Compositional matrix adjust. Identities = 75/306 (24%), Positives = 134/306 (43%), Gaps = 27/306 (8%) Query: 11 DAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVKT 70 D +K HP+ D I L L CDL+TL+ S++ +DLR+ D++W + Sbjct: 5 DHPYKRLFSHPEMIADLIRGFLDPKLVSGCDLSTLERCNGSYVTDDLREREDDIIWRLAY 64 Query: 71 QEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGY---KELPLVLPMLFYHGCR 127 + +Y++IE QSKP+ M R+M Y Q+ + +G +P ++P++ Y+G Sbjct: 65 GDRTLILYLLIEFQSKPDYSMPIRIMSYMALLWQDLIRSGVIVPSRIPGIIPIVLYNG-E 123 Query: 128 SPY--PYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMAL---------- 175 P+ P+ + + +P ++R I S + L+D + +M+ R +A Sbjct: 124 IPWKVPHDIRETIQMPKP-VSRFIPSVPYLLIDELRLSVHHLMEVRNLAACLFGLEQSSG 182 Query: 176 -LELIQKHIRQRDLLGLVDQIVSL-----LVTGNTNDRQLKALFNYVLQTGD--AQRFRA 227 LEL + R + + S+ L NT R + Q G A+R Sbjct: 183 PLELFELGARLNRWMQTDPNLDSMRRDFSLFFENTLKRDDDISISNPFQGGTMLAERVNK 242 Query: 228 FIGE--IAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTR 285 +I + R ++E R EG ++GK E I + M ++G+ + +T Sbjct: 243 WIAQYKAEGRKEGKEEGKKEGLLEGRVEGKLEGKLEGMATILKRMKEKGMSVTEIATITG 302 Query: 286 LSPDDL 291 L D++ Sbjct: 303 LPEDEI 308 >UniRef50_Q1Q296 Putative uncharacterized protein n=6 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1Q296_9BACT Length = 338 Score = 68.6 bits (166), Expect = 3e-10, Method: Compositional matrix adjust. Identities = 66/268 (24%), Positives = 124/268 (46%), Gaps = 22/268 (8%) Query: 8 TPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWS 67 PHD FK + A DF+ P + K DL+TL + +S+IDE+L++++SD++++ Sbjct: 5 NPHDKFFKETFSIRENAIDFLSGRFPPEILKKLDLSTLTQDNSSYIDEELKEHFSDIVYT 64 Query: 68 VKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYHGCR 127 ++ I ++ EH+S ++M+Y + + + + +P V+P++ YHG Sbjct: 65 CFCKDKEIRITLLFEHKSYAVACPYLQLMKYLLKIWEANSKQAQRLIP-VIPVILYHGKE 123 Query: 128 SPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQH--RKMAL---LELIQKH 182 + E + R I + L DI+ ++EI R+++L + L++ Sbjct: 124 AWKVRRFREYFEGIDEVFYRFIPEFEYLLTDISCYSNEEIKDRVFRRVSLQITMLLMRNI 183 Query: 183 IRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAER------- 235 ++ L + + + D LK L A R+ + +IAE+ Sbjct: 184 FDEKYLEDKLKDFFEIGIQYFEEDEGLKFL-------ESAIRYLYYASDIAEKRVIDTLK 236 Query: 236 -APQEKEKL-MTIADRLREEGAMQGKHE 261 +E KL MTIA +L E+G + G+ E Sbjct: 237 EISEEGGKLSMTIAAKLIEKGKIAGRVE 264 >UniRef50_A4XFI8 Putative uncharacterized protein n=7 Tax=Clostridia RepID=A4XFI8_CALS8 Length = 321 Score = 67.8 bits (164), Expect = 5e-10, Method: Compositional matrix adjust. Identities = 72/315 (22%), Positives = 135/315 (42%), Gaps = 37/315 (11%) Query: 1 MTISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQY 60 M+ S HD+ FK HP + + K +++L F+DE Q Sbjct: 1 MSSSLPPQEHDSTFKFLFEHPKDILFLVKDVIGYSWAKEIKEDSIELADKEFVDETFHQK 60 Query: 61 YSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPM 120 +D++ + ++ Y Y++IE+QS E M R++RY I + G K+LP ++P+ Sbjct: 61 RADVIAKARLKDREVYFYIIIENQSTVAEDMPERLLRYMILLWAKKIREGVKKLPAIIPI 120 Query: 121 LFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELI- 179 + Y+G + S + EF I+ + +V+I+ + ++Q + L ++ Sbjct: 121 VTYNGLEKDWDVSQEIISEF--DIFKDDIFK--YAVVNISKLDAKTLLQEEEDILSPVVF 176 Query: 180 ---QKHIRQRDLLGLVDQIVSLL--VTGNTNDRQLKALFNYV---LQTGDAQRFRAFIGE 231 Q +L+ + +I L ++ N +R L N + L D +++ E Sbjct: 177 YLEQVRDDTEELVKRLKEIEPKLTKLSQNNAERFLIWAGNVIRPRLVKEDKEKY----DE 232 Query: 232 IAERAPQEKEKLM-----TIADRLRE---------------EGAMQGKHEEALRIAQEML 271 +A+R Q + M +A L E EG ++GK E + +A++M+ Sbjct: 233 LAQRVEQGGSRQMGEFVSNVAKLLDEVQMRKFNEGKIEGKIEGKIEGKIEGKIEVAKKMI 292 Query: 272 DRGLDRELVMMVTRL 286 RG E + +T L Sbjct: 293 RRGFSDEDIAELTEL 307 >UniRef50_A3JHZ5 Putative transposase n=11 Tax=Proteobacteria RepID=A3JHZ5_9ALTE Length = 325 Score = 67.0 bits (162), Expect = 7e-10, Method: Compositional matrix adjust. Identities = 37/127 (29%), Positives = 66/127 (51%), Gaps = 10/127 (7%) Query: 10 HDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVK 69 HD +K HP+ + ++ P+ + L D TLK ++I + + D++WSV+ Sbjct: 6 HDTGYKELFSHPEFVQQLVEGFAPSEIAGLMDFNTLKNHSGNYITPLFEEKFEDVVWSVE 65 Query: 70 -TQEGVG---YIYVVIEHQSKPEELMAFRMMRYSIAAMQNHL-----DAGYKELPLVLPM 120 T EG+ ++Y+++E QSK + M R+M Y +A +HL + LP + PM Sbjct: 66 VTWEGITQRVFLYILLEFQSKIDSTMPLRLMHY-VACFYDHLLKTRETTVRQGLPPIFPM 124 Query: 121 LFYHGCR 127 + Y+G + Sbjct: 125 VLYNGSQ 131 >UniRef50_C6I158 Putative uncharacterized protein n=3 Tax=Leptospirillum ferrodiazotrophum RepID=C6I158_9BACT Length = 328 Score = 66.6 bits (161), Expect = 1e-09, Method: Compositional matrix adjust. Identities = 72/260 (27%), Positives = 110/260 (42%), Gaps = 26/260 (10%) Query: 10 HDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVK 69 HD FKS L PD + LP + D +L + E L DL +S + Sbjct: 7 HDRFFKSTLGRPDRLGKVLKAFLPTNISASLDPGSLVPLGTESVGEGLDSSLMDLAFSAR 66 Query: 70 TQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYHGCRSP 129 + I++++EH+S P+ F++ RY L G + PL LP+LFYHG Sbjct: 67 FGDQEARIHLIVEHKSSPDPRTHFQIARYLCGLWIRELKEGLQPRPL-LPILFYHGV--- 122 Query: 130 YPYSL-CWLDEFAEPAIARKIYSSAF--PLVDITVVPDDEIMQHRK-----MALLELIQK 181 P++L L E P + F PL+D+ V D+EI H +ALL L K Sbjct: 123 VPWTLPSRLTEVLRPPSELLAVTPDFVLPLIDLRRVDDEEIRHHVDDLEAVLALLSL--K 180 Query: 182 HIRQRDLLGLVDQIVSLLVTGNTNDRQ----LKALFNY---VLQTGDAQRFRAFIGEIAE 234 HI V+ +V LL+ + LK NY V + ++Q + + IA Sbjct: 181 HI-----FDGVETLVRLLLREIWERKAPHAILKPEMNYMAGVYKITNSQEMKQIVDPIAR 235 Query: 235 RAPQEKEKLMTIADRLREEG 254 ++ + T D ++G Sbjct: 236 EVGMAQDIVETWLDEYLQQG 255 >UniRef50_D0LPI9 Putative transposase n=2 Tax=Haliangium ochraceum DSM 14365 RepID=D0LPI9_HALO1 Length = 338 Score = 65.1 bits (157), Expect = 3e-09, Method: Compositional matrix adjust. Identities = 36/119 (30%), Positives = 66/119 (55%), Gaps = 6/119 (5%) Query: 10 HDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVK 69 +D + ++ + A D LP L K DL L L +++ ++LRQYY+D+L+SV Sbjct: 24 YDVLVETTFARREYAADTFRTMLPPALVKRLDLDALSLRSGTYVSDELRQYYTDVLYSVL 83 Query: 70 TQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHL----DAGYKELPLVLPMLFYH 124 +IY++++HQS + + R+ R ++ + +L DA LP++LP++F+H Sbjct: 84 LDGEQAFIYLLLKHQSATDPMFPLRLPRNVLSIWERYLIERQDA--TTLPVILPIVFHH 140 >UniRef50_C0GTX5 Putative uncharacterized protein n=8 Tax=Desulfonatronospira thiodismutans ASO3-1 RepID=C0GTX5_9DELT Length = 338 Score = 65.1 bits (157), Expect = 3e-09, Method: Compositional matrix adjust. Identities = 68/260 (26%), Positives = 120/260 (46%), Gaps = 14/260 (5%) Query: 5 TTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDL 64 +T+ HD+ K FL A + LP + K D + E +S++ + L+ YYSDL Sbjct: 2 STTNIHDSTIKYFLSDRLNAISLLKSMLPEEIVKQLDFNKIYYEKDSYLPKSLQGYYSDL 61 Query: 65 LWSVKTQEG--VGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHL-DAGYKELPLVLPML 121 + SV T+ G V ++ ++EH+S ++ + +RY + + + + G LP+++P+L Sbjct: 62 VVSVPTKCGSYVAKVFFLLEHKSTFKKNTPLQFLRYILEFWEQYQKNTGETRLPVIIPIL 121 Query: 122 FYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVV---PDDEIMQHRKMALLEL 178 H P + L + P+ KI+ F + V P+D AL L Sbjct: 122 IAHPEEGWKPTKVSDLVDL--PSDDFKIFVPDFNFLLYDAVNDDPEDYDFDETLKALFTL 179 Query: 179 IQKHIRQRDLLGLVDQIVSLLVTGNTNDRQL---KALFNYVLQTGDAQRFRAFIGEIAER 235 ++ R + + V + L+ + R L + + +Y+ T D + + I +IAE Sbjct: 180 -WRYSRSPEFMQGVQKAFQLIKKVDPKARLLDFVQMILHYLEVTRDEKEYID-IQKIAET 237 Query: 236 APQEKEKLM-TIADRLREEG 254 E E+ M TIA+ R EG Sbjct: 238 EIDEGEEYMGTIAEMFRREG 257 >UniRef50_C6HY29 Putative uncharacterized protein n=1 Tax=Leptospirillum ferrodiazotrophum RepID=C6HY29_9BACT Length = 319 Score = 64.7 bits (156), Expect = 4e-09, Method: Compositional matrix adjust. Identities = 51/166 (30%), Positives = 83/166 (50%), Gaps = 12/166 (7%) Query: 8 TPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDL-RQYYSDLLW 66 TPHD FK + + LP + + D +L P + E L R +DL++ Sbjct: 6 TPHDVFFKEIFSQREILSSALSELLPEDVVRRMDFDSLAYLPGESVGEGLSRSTRADLVF 65 Query: 67 SVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYHGC 126 SV E G + V++EH+S P+ + F++++ + +L G + LP +LP+LFYHG Sbjct: 66 SVSFGEREGRLVVILEHKSHPDPRVHFQILQMMVMGWMQNLREGREPLP-ILPILFYHGQ 124 Query: 127 RSPYPYSLCWLDEFAEP-AIARKI--YSSAFPL--VDITVVPDDEI 167 S +S+ D F+E I R+I Y F L +D+ ++ D I Sbjct: 125 GS---WSIP--DRFSERMKIPREIARYLPDFELLRIDLGLIDDTRI 165 >UniRef50_Q1QWV4 Putative uncharacterized protein n=11 Tax=Proteobacteria RepID=Q1QWV4_CHRSD Length = 326 Score = 63.5 bits (153), Expect = 8e-09, Method: Compositional matrix adjust. Identities = 72/292 (24%), Positives = 124/292 (42%), Gaps = 42/292 (14%) Query: 14 FKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVKTQEG 73 +K HP+ RD + + + D +TL+ S+I EDLR D++W V+ + Sbjct: 13 YKLLFSHPEMVRDLLTGFVKEAWVEQLDFSTLEKVSGSYITEDLRDREDDVIWRVRWGDD 72 Query: 74 VGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLD----AGYKELPLVLPMLFYHG-CRS 128 Y+Y+++E QS + MA R+M Y Q+ + +LP VLP++ Y+G R Sbjct: 73 WLYVYLLLEFQSSVDRFMAVRVMTYLGLLYQDLIRQEAFTPNGKLPPVLPIVLYNGEKRW 132 Query: 129 PYPYSLCWLDEFAEPAIARKIYSSAFPLVD-ITVVPDDEIMQH-RKMALLELIQKHIR-Q 185 ++ L E + R + A+ L+D V+ D E H R +A +H R + Sbjct: 133 TAAQNVADLVEQVPGGLERYRPNLAYLLLDEGAVISDPEWSDHMRNVAAALFRLEHNRDE 192 Query: 186 RDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEI--AERAPQEK--- 240 +D+L ++ +V L QTG + F +I + RAP + Sbjct: 193 QDMLEVLGTLVEWLKAPE--------------QTGLRRAFVVWIRRVLLPNRAPGMELPE 238 Query: 241 ---------------EKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDR 277 E++ +R E+G +G+ E QE RG+++ Sbjct: 239 FNELQDLHEVHDMLAERIKQWPERWEEKGRQEGRQEGRKEGRQEGEQRGIEK 290 >UniRef50_C0A240 Putative uncharacterized protein n=1 Tax=Opitutaceae bacterium TAV2 RepID=C0A240_9BACT Length = 365 Score = 62.0 bits (149), Expect = 3e-08, Method: Compositional matrix adjust. Identities = 71/300 (23%), Positives = 129/300 (43%), Gaps = 44/300 (14%) Query: 10 HDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVK 69 HD +F+ P AR F+ LP L D TL + S I + L + D+++ + Sbjct: 36 HDRIFRHAFSLPAVARQFLRTWLPPELVAQADWHTLTVTRISGISDTLGERREDVVYRIN 95 Query: 70 TQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQ----NHLDAGYKE------------ 113 + YV++EHQ+K E+ MA R+M + + + +A KE Sbjct: 96 VNGRNVHFYVLMEHQTKTEKHMARRIMEETFLIWRQDEHDRAEAAKKEAPGKADRQSRRR 155 Query: 114 ----LPLVLPMLFYHGCRSPYPYSLCW--LDEFAEPAIARKIYSSAFP-----LVDITVV 162 PLV+ M+ + G P + W D P K + P +V++ + Sbjct: 156 ETDKFPLVISMVLHPG---PRKWGKIWRLADLIDVPPRMEKWARTFMPDCGFIVVELAGL 212 Query: 163 PDDEIMQ-HRKMALLELIQKHIRQRDLLGLVD--QIVSLLVTGNTN-DRQ-----LKALF 213 P +++ H A+L +Q + LGL+D +I LL ++ DR +K L+ Sbjct: 213 PLEKLADGHLARAILGALQG-----NRLGLIDIRKIKRLLDEMFSDPDRASVGAVVKQLW 267 Query: 214 NYVLQTGDAQRFRAFIGEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDR 273 +Y++ + D + + IA + + +M +RL++ GA++ +H + + DR Sbjct: 268 HYLISSSDLKEEQTKDIVIAHIPEEYRSNIMNTVERLKQAGALKAQHNAVIEALEVRFDR 327 >UniRef50_B9TA29 Putative uncharacterized protein n=1 Tax=Ricinus communis RepID=B9TA29_RICCO Length = 411 Score = 61.6 bits (148), Expect = 3e-08, Method: Compositional matrix adjust. Identities = 34/128 (26%), Positives = 64/128 (50%), Gaps = 5/128 (3%) Query: 7 STPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLW 66 S+ D+++K HP+ RD + L A + + + S+ + + D++W Sbjct: 40 SSRTDSLYKQLFAHPEIVRDLVAGFLAADWARGLTVEAFERVNASYASDHGHVRHDDVVW 99 Query: 67 SVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQN----HLDAGYKELPLVLPMLF 122 + Y+Y+++E Q++P++ MA RM Y Q+ H + + +LP VLP++ Sbjct: 100 RARIGGEWVYVYILLEFQARPDKWMALRMQVYVGLLYQDLVAQHKLSKHGKLPPVLPVVL 159 Query: 123 YHGCRSPY 130 YHG R P+ Sbjct: 160 YHG-RGPW 166 >UniRef50_Q3C0L0 TpnA protein n=2 Tax=Sodalis glossinidius RepID=Q3C0L0_SODGL Length = 131 Score = 60.1 bits (144), Expect = 9e-08, Method: Compositional matrix adjust. Identities = 28/56 (50%), Positives = 39/56 (69%) Query: 3 ISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLR 58 +++T + HD VFK FL ARDF++IHLP LRK CD +TL + SFI++DL+ Sbjct: 1 MTSTLSHHDHVFKKFLGDIAVARDFLEIHLPPHLRKHCDFSTLAMASGSFIEDDLK 56 >UniRef50_A3ET28 Probable transposase n=6 Tax=Leptospirillum sp. Group II RepID=A3ET28_9BACT Length = 335 Score = 58.5 bits (140), Expect = 3e-07, Method: Compositional matrix adjust. Identities = 59/252 (23%), Positives = 112/252 (44%), Gaps = 8/252 (3%) Query: 10 HDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVK 69 HD FK+ + RDF+ LP + + D +L+ I + + DL+ + Sbjct: 8 HDRFFKTSFGRIEVLRDFLTGFLPPEISQSIDPDSLRFLNTESIGLSFEKSHMDLVVECR 67 Query: 70 TQEGVGYIYVVIEHQSKPEELMAFRMMRYSIA-AMQNHLDAGYKELPLVLPMLFYHGCRS 128 E Y++IEH+S P+ + +M+RY +A +N D K L VLP++F+ G R Sbjct: 68 ISETPAQFYLLIEHKSVPDPEVFLQMLRYMVALWTRNRQDN--KPLVPVLPLVFHQGGR- 124 Query: 129 PYPYSLCWLDEFAEPAIARKIYSSAFPLV-DITVVPDDEIMQHRKMALLELIQKHIRQRD 187 P+ + + + F P + PL+ D++ V I + A ++ ++ Sbjct: 125 PWTLPVRFQETFPVPETLKAHAVDFAPLLFDLSTVSGTTIRERSAHAETVVVLTLLKYAF 184 Query: 188 LLGLVDQIVSLLVTGNTNDRQ-LKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKLMTI 246 + D + +L TG + D L + NY ++ + + + + R+ ++ + +I Sbjct: 185 SGSVEDVLRALKETGGSFDETFLFGVLNYAIRAFEVK--DPVVVDAISRSFGGEKIMPSI 242 Query: 247 ADRLREEGAMQG 258 D EEG +G Sbjct: 243 IDEWVEEGLKEG 254 >UniRef50_A4U3R1 Putative uncharacterized protein n=1 Tax=Magnetospirillum gryphiswaldense RepID=A4U3R1_9PROT Length = 322 Score = 55.5 bits (132), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 70/302 (23%), Positives = 118/302 (39%), Gaps = 47/302 (15%) Query: 11 DAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVKT 70 DA++ HP A + +P + D ++ F D D ++ D++W + T Sbjct: 5 DALYHRLFSHPLMAEQLVREFVPEAMAVGLDFARMERVNAKFHDRDGKRREGDVIWRIPT 64 Query: 71 QEGVGYI-YVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYK-----ELPLVLPMLFYH 124 +G + +++ E QS + MA R Y + HL A K LP VL ++ Y+ Sbjct: 65 ADGEDVVLHILCEFQSTTDWWMAVRTQVYE-GLLWQHLIAERKLKSGDRLPPVLTLVLYN 123 Query: 125 G-CRSPYPYSLCWLDEFAEPAIARKIYSSAFP--------LVDITVVPDDEIMQHRKMAL 175 G R P P IA S +P L+D+ VP++E+ +A Sbjct: 124 GEQRWHAPTDTI-------PLIALPAGSPLWPWQPRACYHLLDMGAVPEEELAIRDSLAA 176 Query: 176 LELIQKHIRQ-RDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQT------------GDA 222 L +H R+ +L GL+D +V D +L+ LF +++ GD Sbjct: 177 LLFRLEHPREPEELAGLIDDVVGWFRRHPGYD-ELRRLFTELVRQAIEGYETSVAVPGDM 235 Query: 223 QRFRAFIGEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMM 282 R+ + + E T R EG +G+ R + L R L++ + Sbjct: 236 MEMRSMLANLGE----------TWKKRWLAEGIAEGEARGEARGEAKALIRLLEKRFGQL 285 Query: 283 VT 284 T Sbjct: 286 PT 287 >UniRef50_B9MMR0 Putative uncharacterized protein n=1 Tax=Anaerocellum thermophilum DSM 6725 RepID=B9MMR0_ANATD Length = 333 Score = 55.5 bits (132), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 75/317 (23%), Positives = 135/317 (42%), Gaps = 53/317 (16%) Query: 10 HDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVK 69 +D FK + +F+ + P DL +L+ SF+ ++ + +D+++ K Sbjct: 10 YDLTFKRIFSFKEVFLNFLKSTIKRPWVDKIDLQSLEFVDRSFVKDEFVEKEADVIYRAK 69 Query: 70 TQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKEL--PLVLPMLFYHGCR 127 ++ Y YV++E QS ++ M R+ Y Q H++ +L P+V P++ Y+G Sbjct: 70 IEDTDIYFYVLLEAQSTTDKTMPRRLFEYMNLIWQRHIEETKDDLLSPIV-PIVLYNGRS 128 Query: 128 SPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQR- 186 + +L F I + + F LVD+ + DDE +++R + LL +I R R Sbjct: 129 NWNVPTLI----FKGWEIFKDDMFNYF-LVDVNNI-DDETLKNR-LDLLSVILYLDRSRK 181 Query: 187 ------DLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQ-E 239 + L V + +S L T Q+K ++L+ Q GEI E + E Sbjct: 182 TAKEFIEKLKEVTEYISCLPT-----EQVKVFAMWLLRVIRPQMMEEVQGEIDELLKRIE 236 Query: 240 KEKLMTIAD------RLRE------------------------EGAMQGKHEEALRIAQE 269 +E + + D RL + EG ++G+ E +RIA+ Sbjct: 237 QEGVTDVGDFVFNVQRLMQEYYKEAEEKGKEKGYEEGKLEGKLEGKLEGELEATIRIARN 296 Query: 270 MLDRGLDRELVMMVTRL 286 M+ G + + VT L Sbjct: 297 MILAGAEDSFISKVTGL 313 >UniRef50_B4U689 Putative uncharacterized protein n=8 Tax=Aquificales RepID=B4U689_HYDS0 Length = 323 Score = 54.7 bits (130), Expect = 4e-06, Method: Compositional matrix adjust. Identities = 66/291 (22%), Positives = 128/291 (43%), Gaps = 43/291 (14%) Query: 9 PHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSV 68 PHD+ FK P + +DI + + +T + E S +++ DLL+S Sbjct: 5 PHDSFFKQIFSDPRRVKTLLDIFAKDVAKSIHSITPVNTEKFS---SKSQKFMLDLLFSC 61 Query: 69 KTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYHG-CR 127 K ++ YI +V+EH+S ++ + ++ Y+ A + + + P ++ ++FYHG Sbjct: 62 KVKDQDAYIRIVLEHKSYLDKELPIQLSYYNAAIWEEAIKEK-EYYPPIINIVFYHGKGE 120 Query: 128 SPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALL----ELIQKHI 183 P SL L+ + + + + + L+D+ V DDE++ + + KH+ Sbjct: 121 WNIPTSLPVLE---DQNLEKYVSKLNYILIDLNKVSDDELINEAYIDFCFTSAVIAMKHV 177 Query: 184 RQRDLLGLVDQIVSLL------VTGNTNDRQLKAL---FNYVLQT-GDAQRFRAFIGEIA 233 + +++I ++ V + ++ L FNY+ GD + A Sbjct: 178 HEN-----IEKIKAVFRPLVEYVQIHEDEEGYHCLFFSFNYISYVKGDTKE--------A 224 Query: 234 ERAPQE----KEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELV 280 E A +E +K MT+ ++ EG +GK E QE L++G L+ Sbjct: 225 ENALKELIGGDKKAMTLIEKWIMEGLEKGKQEG----LQEGLEKGKQEGLI 271 >UniRef50_C6HTR6 Probable transposase n=5 Tax=Leptospirillum ferrodiazotrophum RepID=C6HTR6_9BACT Length = 216 Score = 53.9 bits (128), Expect = 7e-06, Method: Compositional matrix adjust. Identities = 50/172 (29%), Positives = 79/172 (45%), Gaps = 9/172 (5%) Query: 3 ISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDL-RQYY 61 ++TT TPHD+ FK + L AP D ++L I E L + Sbjct: 1 MTTTPTPHDSFFKDVFGPGKANLPALLSLLDAPFASRIDPSSLTFLSGETIGEGLATSFR 60 Query: 62 SDLLWSV----KTQEGVGYIYV-VIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPL 116 SDL+ S+ T +G +V ++EH+S P + F++ A L G LP Sbjct: 61 SDLVGSLLVADATVDGKPLEFVFLVEHKSSPARDIQFKLACLVTALWARFLREGKPPLP- 119 Query: 117 VLPMLFYHGCRSPYPYSLCWLDEFA-EPAIARKIYSSAFPLVDITVVPDDEI 167 V+P+L +HG +SP+ L + P +A + A ++D+T + DDEI Sbjct: 120 VVPILIHHG-KSPWNQPLRLYETLGLRPELATGMLDYALHVIDLTRIEDDEI 170 >UniRef50_A6G1G8 Putative uncharacterized protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6G1G8_9DELT Length = 329 Score = 53.5 bits (127), Expect = 8e-06, Method: Compositional matrix adjust. Identities = 49/179 (27%), Positives = 81/179 (45%), Gaps = 15/179 (8%) Query: 10 HDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVK 69 HDA+FK+ P A LP L + D EP + +D L + D+LW + Sbjct: 7 HDALFKAAFGAPAHAARLCRALLPPALVAVLDWRASTSEPTAVLDLRLSERRCDVLWRTR 66 Query: 70 TQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAG-YKELPLVLPMLFY---HG 125 +G G IYV++EHQS E M R+ Y H + LP ++P++ HG Sbjct: 67 FVDG-GPIYVLLEHQSTRERDMPLRIEGYLARIWAGHRRGDRHGPLPPIIPIVVSHAEHG 125 Query: 126 CRSPYPYSLCWLDEFAE-----PAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELI 179 R+P + ++F+ P +A + + + D+T V DD ++ R + L + + Sbjct: 126 WRAPRSF----WEQFSPSPDCIPGLAPFVPNFQLLIDDLTQV-DDASLRGRSLPLFQTL 179 >UniRef50_B9MN47 Putative uncharacterized protein n=2 Tax=Bacteria RepID=B9MN47_ANATD Length = 324 Score = 52.8 bits (125), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 31/135 (22%), Positives = 69/135 (51%), Gaps = 16/135 (11%) Query: 7 STPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLT--------TLKLEPNSFIDEDLR 58 + HD+ FK +P DI+L L K+ + + +++++ ++I ++ Sbjct: 11 AKEHDSTFKLLFENPK------DIYLL--LSKIINYSWANEIRESSIEIKKTNYITKEFS 62 Query: 59 QYYSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVL 118 Q +D++ + ++ Y Y++IE+QS + M R++RY I+ + G ++LP ++ Sbjct: 63 QVEADVVAKARLKDRDVYFYILIENQSTVAKDMPERLLRYMISIWAEEIRNGVEKLPAII 122 Query: 119 PMLFYHGCRSPYPYS 133 P++ Y+G + S Sbjct: 123 PIVVYNGLDRRWEVS 137 >UniRef50_C4FIM1 Putative uncharacterized protein n=1 Tax=Sulfurihydrogenibium yellowstonense SS-5 RepID=C4FIM1_9AQUI Length = 316 Score = 52.0 bits (123), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 55/261 (21%), Positives = 116/261 (44%), Gaps = 22/261 (8%) Query: 5 TTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDL 64 T PHD FK P + +DI P +K+ DL +++L + + + + +L Sbjct: 2 TDLQPHDQFFKQIFSEPKRVKSLLDIFYPELSQKI-DLESIRLLNSEKYSQKVGKSLLNL 60 Query: 65 LWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYH 124 L+ K + ++ ++ EH+S ++ + +++ Y+ + Y+E P ++ ++ YH Sbjct: 61 LYECKIENEKSFLRIIFEHKSYIDKNLPSQLLYYNGILWEE--TGEYEEYPPIINIVLYH 118 Query: 125 GCRS-PYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLE----LI 179 G R P +L + A K+ + L+D++ V D+E++ + L Sbjct: 119 GKRKWNIPATLPKTNSEIIERFANKL---NYHLIDLSKVADEEMISKLYLDFCTVSALLT 175 Query: 180 QKHIRQ--RDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAP 237 KHI + R ++ ++ + D + + +Y+ + Q + EI Sbjct: 176 MKHIFEDLRKYKHILKKVFE-----HYQDGCVFIILDYISVVNNPQEVENVLKEI---LG 227 Query: 238 QEKEKLMTIADRLREEGAMQG 258 EK+ +MT+ ++ + EG QG Sbjct: 228 GEKD-MMTLTEKWKMEGLQQG 247 >UniRef50_C1DXV7 Putative uncharacterized protein n=1 Tax=Sulfurihydrogenibium azorense Az-Fu1 RepID=C1DXV7_SULAA Length = 357 Score = 49.7 bits (117), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 30/118 (25%), Positives = 63/118 (53%), Gaps = 4/118 (3%) Query: 8 TPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFID-EDLRQYYSDLLW 66 PHD K L+ + A+ +D HLP + + TL++ +D ++ +Y++D+++ Sbjct: 15 NPHDTYAKELLKDEEVAQVLLDAHLPQEINSIIKKETLEIINTENLDYKEKSKYFADIIY 74 Query: 67 SVKTQEGVGY-IYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFY 123 S+KT G IYV+IEH+S ++ + ++++ A + G ++ + P++ Y Sbjct: 75 SLKTIYGEDLKIYVLIEHKSYDDKHLPLQLIKNMTAVWSKEILEG--KITPIYPIVIY 130 >UniRef50_C5UWW9 Putative uncharacterized protein n=1 Tax=Clostridium botulinum E1 str. 'BoNT E Beluga' RepID=C5UWW9_CLOBO Length = 323 Score = 48.5 bits (114), Expect = 3e-04, Method: Compositional matrix adjust. Identities = 60/319 (18%), Positives = 134/319 (42%), Gaps = 33/319 (10%) Query: 1 MTISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQY 60 M + HD +K H +T +F+ L + L L S+I D + Sbjct: 1 MKNNNVHHEHDVGYKHIFSHKETFLEFLRSFTKKEWANLINEDDLILVDKSYILSDFEEE 60 Query: 61 YSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYK-------- 112 SD+L+ + YV++E QSK + M R++ Y ++ L K Sbjct: 61 ESDILYKANIDDKEVIFYVLLEFQSKVDFQMPMRLLFYMTEIWRDVLKNTEKNERKRKNF 120 Query: 113 ELPLVLPMLFYHGCRSPYPYSLCWLDEFAEPAIAR-KIYSSAFPLVDITVVPDDEIMQHR 171 +LP ++P++ Y+G ++ + + + + + + I + L DI D E++ Sbjct: 121 KLPSIVPIVLYNG-KNKWSAKISFKEMLSGYELFEDNILDFNYMLFDINRYSDHELLNIS 179 Query: 172 KM-ALLELIQKHIRQRDLL------------------GLVDQIVSLLVTGNTNDRQLKAL 212 M + + L+ + I +++L+ + + + +V D L+ Sbjct: 180 NMISAVFLLDQEIDEQELMRRLKKIIYILKKISPEQFSVFKKWLKNIVKPRVRD-NLQGE 238 Query: 213 FNYVLQTGDAQRFRAFIGEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLD 272 + VL+ + + + + + + ++K + +R ++G QG + + A++ ++ Sbjct: 239 IDDVLEKSNQEEVDFMVSNLGKTIERMQDKAI---ERGLKKGIEQGIEQGIEQTAKKAIE 295 Query: 273 RGLDRELVMMVTRLSPDDL 291 G+D E++M +T LS + + Sbjct: 296 MGMDNEIIMNLTGLSEEQI 314 >UniRef50_B7UFQ6 Predicted protein n=11 Tax=Escherichia RepID=B7UFQ6_ECO27 Length = 73 Score = 48.1 bits (113), Expect = 4e-04, Method: Compositional matrix adjust. Identities = 29/73 (39%), Positives = 43/73 (58%), Gaps = 20/73 (27%) Query: 244 MTIADRLREEG------------AMQGK--------HEEALRIAQEMLDRGLDRELVMMV 283 MTIA+RLR+EG +GK HE+A++IA ML++G+DR+ V+ Sbjct: 1 MTIAERLRQEGHQIGWQEGKIEGWQEGKLEGLQESMHEQAIKIALRMLEQGIDRDQVLAA 60 Query: 284 TRLSPDDLIAQSH 296 T+LS DL A++H Sbjct: 61 TQLSETDLAAKNH 73 >UniRef50_C6HZP6 Putative uncharacterized protein n=1 Tax=Leptospirillum ferrodiazotrophum RepID=C6HZP6_9BACT Length = 334 Score = 47.8 bits (112), Expect = 5e-04, Method: Compositional matrix adjust. Identities = 74/283 (26%), Positives = 124/283 (43%), Gaps = 47/283 (16%) Query: 4 STTSTPHDAVFKSFLRHPDTARDFIDIHLPA-------PLRKLCDLTTLKLEPNSFIDED 56 S ++TPHD+ FK P HLP+ L +L++L+ P I ED Sbjct: 19 SISTTPHDSFFKDVF-GPGKG------HLPSLIPLIDGSLASRIELSSLEYLPGESIAED 71 Query: 57 L-RQYYSDL-----LWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAG 110 L R SDL + + + G I + EH+S + ++ A + L G Sbjct: 72 LARSTRSDLSASLLISNARIDGGDARIAFIFEHKSFLPHHIHIPLLSLVSALLSRDLREG 131 Query: 111 YKELPLVLPMLFYHGCRSPY--PYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDD--- 165 K P V+P++ YHG R+P+ P L + + P +A ++ L+D++ D+ Sbjct: 132 RKPCP-VIPVVLYHG-RAPWTLPARLSEALDLS-PELAPRLPDFELTLIDLSRFSDETLK 188 Query: 166 EIMQHRKMALLELIQKHIRQ--RDLLGLVDQIVSLLVTGNTNDRQLKAL-------FNYV 216 E + H + + + KHI + +LG V L+ T + + LK + +YV Sbjct: 189 EKIAHPEPLVSLSVMKHIFEPPESVLG---HFVRLIKTLSPSRDILKRIVDTTLHYISYV 245 Query: 217 LQTGDAQRFRA-FIGEIAERAPQEKEKLMTIADRLREEGAMQG 258 ++ Q R F +AE EK+ T+ D ++EEG +G Sbjct: 246 KKSHHPQEIRTIFTTFLAE------EKMTTVLDLIKEEGIQEG 282 >UniRef50_A4XMD0 Putative uncharacterized protein n=5 Tax=Clostridia RepID=A4XMD0_CALS8 Length = 329 Score = 47.4 bits (111), Expect = 7e-04, Method: Compositional matrix adjust. Identities = 29/123 (23%), Positives = 58/123 (47%), Gaps = 4/123 (3%) Query: 7 STPH---DAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSD 63 PH D FK + + +F+ ++ D +L+ SFI ++ + +D Sbjct: 4 KVPHNQYDLTFKRLFQFKEVFLNFLRGNINREWVNRIDAESLEFVDRSFIKDEFVEKEAD 63 Query: 64 LLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKE-LPLVLPMLF 122 +++ + ++ Y YV+IE QS + M R+ Y + H++ E LP ++P++ Sbjct: 64 VIYRARLEDTDVYFYVLIEPQSTADRNMPRRLFEYMTLIWKRHMEEKADELLPPIVPIVL 123 Query: 123 YHG 125 Y+G Sbjct: 124 YNG 126 >UniRef50_C4GYF6 Transposase n=20 Tax=Yersinia pestis RepID=C4GYF6_YERPN Length = 105 Score = 47.4 bits (111), Expect = 7e-04, Method: Compositional matrix adjust. Identities = 22/51 (43%), Positives = 35/51 (68%) Query: 208 QLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKLMTIADRLREEGAMQG 258 Q+ AL +Y+LQ G++ AF+ E+A+R PQ + LMTIA +L ++G +G Sbjct: 9 QVMALIHYLLQAGESADSEAFVRELAQRVPQHGDALMTIAQQLEQKGIEKG 59 >UniRef50_B1EI63 Putative uncharacterized protein n=1 Tax=Escherichia albertii TW07627 RepID=B1EI63_9ESCH Length = 78 Score = 46.6 bits (109), Expect = 0.001, Method: Compositional matrix adjust. Identities = 22/36 (61%), Positives = 29/36 (80%), Gaps = 2/36 (5%) Query: 42 LTTLKLEPNSFIDEDLRQYYSDLLWSVKTQEGVGYI 77 + TLKLE +SFID+DLR+ YSD+LWSVK +GY+ Sbjct: 1 MKTLKLESSSFIDDDLRESYSDVLWSVKYL--IGYL 34 >UniRef50_B2V697 Putative uncharacterized protein n=6 Tax=Sulfurihydrogenibium RepID=B2V697_SULSY Length = 311 Score = 46.6 bits (109), Expect = 0.001, Method: Compositional matrix adjust. Identities = 36/162 (22%), Positives = 75/162 (46%), Gaps = 7/162 (4%) Query: 9 PHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSV 68 PHD FK P + +DI + L + DL +++L + + + + DLL+ Sbjct: 6 PHDQFFKQIFSEPKRVKSLLDI-FYSELSQKIDLESIRLLNSEKYSQKIGKSLLDLLYEC 64 Query: 69 KTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYHGCRS 128 K + ++ ++ EH+S ++ + +++ Y+ + YKE ++ ++ YHG R Sbjct: 65 KIENEKSFLRIIFEHKSYIDKNLPSQLLYYNGILWEE--TGEYKEYLPIINIVLYHGKRK 122 Query: 129 -PYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQ 169 P + L + I R + L+D++ V D+E++ Sbjct: 123 WNIPTT---LPKTNSEIIERFSNKLNYHLIDLSKVADEEMIN 161 >UniRef50_C6HXQ0 Putative uncharacterized protein n=1 Tax=Leptospirillum ferrodiazotrophum RepID=C6HXQ0_9BACT Length = 341 Score = 46.2 bits (108), Expect = 0.001, Method: Compositional matrix adjust. Identities = 41/161 (25%), Positives = 68/161 (42%), Gaps = 7/161 (4%) Query: 10 HDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVK 69 HD FKS L P + LP L L +L + + + L D+ + Sbjct: 8 HDRFFKSTLGRPKRMEHILKAFLPPALSALLAPGSLVPLFSEVVGDSLDASLLDMAFEAT 67 Query: 70 TQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYHGCRSP 129 E I+V++EH+S P+ F+++ Y +P V P+LFYHG R Sbjct: 68 FGERKTRIHVLVEHKSSPDPWAHFQILHYLAELWLRDKKESRSPIPFV-PVLFYHGLR-- 124 Query: 130 YPYSL-CWLDEFAEPA--IARKIYSSAFPLVDITVVPDDEI 167 P++L L E +P + + P++D+ + D +I Sbjct: 125 -PWNLPTRLSEMLDPPSELLPFVPDYLLPVIDLGKIDDLDI 164 >UniRef50_C1DXM1 Putative uncharacterized protein n=5 Tax=Sulfurihydrogenibium azorense Az-Fu1 RepID=C1DXM1_SULAA Length = 342 Score = 45.4 bits (106), Expect = 0.002, Method: Compositional matrix adjust. Identities = 64/275 (23%), Positives = 121/275 (44%), Gaps = 32/275 (11%) Query: 3 ISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYS 62 +S +PHD FK F++I LP L + +LKL + ++++ Sbjct: 1 MSIEKSPHDWFFKMIFSQKQNVESFLEIFLPQ-LYECIIPNSLKLSDTEKFSKKYKKFFL 59 Query: 63 DLLW--SVKTQEGV---GYIYVVIEHQSKPEELMAFRMMRYSIAAMQN--HLDAGYKELP 115 DL + +K +EG G IY+V EH+S P++ ++ Y M+ L Y+ Sbjct: 60 DLAFDCKLKDKEGNTIDGQIYIVFEHKSYPDKHTPSQISFYKSVMMEEDERLSRPYRP-- 117 Query: 116 LVLPMLFYHGCRS-----PYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQH 170 V+P++FYHG +S P L + + ++S ++ L D++ V D+ + Sbjct: 118 -VIPIVFYHGEKSWNIPTDIPQQFNTLGN-----LEKYLHSLSYILFDVSKV-DESFLIE 170 Query: 171 RKMALLELIQKHIRQRDL---LGLVDQIVSLLVTGNTNDRQLKALFNY-VLQTGDAQRFR 226 + LI +++ L + ++ L+ + D L + +Y V+ D + Sbjct: 171 KIYLNACLISGVFTLKNIFKDLKYLRPVLEKLILDDVKD-CLYIIIDYTVIVKKDLETIE 229 Query: 227 AFIGEIAERAPQEKEKLMTIADRLREEGAMQGKHE 261 + EI +EK+MT+ ++ + EG +G E Sbjct: 230 KILEEIGG-----EEKMMTLTEKWKMEGLKKGMEE 259 >UniRef50_C6XV94 Putative uncharacterized protein n=7 Tax=Pedobacter heparinus DSM 2366 RepID=C6XV94_PEDHD Length = 283 Score = 44.3 bits (103), Expect = 0.005, Method: Compositional matrix adjust. Identities = 60/246 (24%), Positives = 109/246 (44%), Gaps = 33/246 (13%) Query: 62 SDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPML 121 +DLL V+ +G Y+ + +E+Q+ MAFRM YSI + H +LP V + Sbjct: 56 TDLLKKVRDNKGNRYV-LHVEYQTDNYPEMAFRMAEYSIMLQRKH------KLP-VKQFV 107 Query: 122 FYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVP-----DDEIMQHRKMALL 176 Y G P ++ +I K + + L +++ V ++++ + +A+L Sbjct: 108 IYIG---PAKANMA-------TSITTKDFRFRYNLTELSAVNYKLFLKSDLVEEKMLAIL 157 Query: 177 ELIQKHIRQRDLLGLVDQI---VSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIA 233 + + L +V +I S L G RQL+ L A + A +G+I Sbjct: 158 SNLASESTESVLAQVVQEIETHTSTLEQGRYF-RQLRILLQLRNLNKKAIKDMALVGKIF 216 Query: 234 ERAPQEKEKLMTIADRLREE------GAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLS 287 + + I ++ E G +G++EEA+ IA E+ GL E + +T+LS Sbjct: 217 KEEKDILYRRGEIKGEIKGEIKGEIKGIEKGRYEEAMEIALELKKEGLATEFIAKITKLS 276 Query: 288 PDDLIA 293 +++ A Sbjct: 277 IEEIQA 282 >UniRef50_Q2RLW6 Putative uncharacterized protein n=9 Tax=Clostridia RepID=Q2RLW6_MOOTA Length = 344 Score = 43.1 bits (100), Expect = 0.012, Method: Compositional matrix adjust. Identities = 66/329 (20%), Positives = 140/329 (42%), Gaps = 49/329 (14%) Query: 9 PHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSV 68 P+D ++ L + + + + D L L S++ +D + +D+++ + Sbjct: 14 PYDKGYRQLLADKRVFLELLKTFVREAWVEAIDADDLILVNKSYVLQDFSEKEADVVYRL 73 Query: 69 KTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQ---NHLDAGYKE-----LPLVLPM 120 KT+ YV++E QS + LM FR++ Y + + N+ G +E LP ++P Sbjct: 74 KTRNRNVIFYVLLELQSTVDYLMPFRLLLYMVEIWREIYNNTPQGERESKHFRLPPIIPA 133 Query: 121 LFYHGCRSPYPYSLCWLDEFAEPAIARKIYSS-----AFPLVDITVVPDDEIMQHRKM-A 174 + Y+G S + +L F E + + +S + L D+ ++E+++ + A Sbjct: 134 VLYNGAGS-WTAALS----FKEMLNSYQDFSGHLLDFRYLLFDVNRYSEEELIRAANLIA 188 Query: 175 LLELIQKHIRQRDLLGLVDQIVSLLVTGNTND-RQLKALFNYVLQTGDAQRFRAFI-GEI 232 + L+ + ++ DL G + ++ +L ++ R V+Q F I G + Sbjct: 189 GIFLLDQKMQPEDLAGRLQKLAGVLRRLTPDEFRHFTTWLKNVVQPRMPGDFSEKIDGIL 248 Query: 233 AERAPQEKEKL-----MTIADRLRE-----------------------EGAMQGKHEEAL 264 P E E++ +T+ + R+ EG ++GK E Sbjct: 249 NASNPWEVERMIYNLELTLEEMQRQALLKGLKEGEQKGKLEGKLEGKLEGKLEGKLEGKR 308 Query: 265 RIAQEMLDRGLDRELVMMVTRLSPDDLIA 293 +A+ +L +D E ++ T L+ +++ A Sbjct: 309 EVARNLLLLNVDIETIIKATGLALEEINA 337 >UniRef50_B0K503 Putative uncharacterized protein n=12 Tax=Thermoanaerobacteraceae RepID=B0K503_THEPX Length = 360 Score = 43.1 bits (100), Expect = 0.013, Method: Compositional matrix adjust. Identities = 32/132 (24%), Positives = 67/132 (50%), Gaps = 18/132 (13%) Query: 51 SFIDEDLRQYYSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHL-DA 109 SF+ +D +DL++ VK ++ Y+++E QS + M +R++ Y + ++ L D Sbjct: 55 SFVLQDFADKEADLVYRVKLKDKEVIFYILMELQSTVDYQMPYRLLLYMVEIWRSILKDT 114 Query: 110 GYKE-------LPLVLPMLFYHG-----CRSPYPYSLCWLDEFAEPAIARKIYSSAFPLV 157 KE LP+++P++ Y+G ++ Y +L + F E A+ K + L+ Sbjct: 115 PRKESRRKDFKLPVIVPIVLYNGDHKWTAKTSYKETLNSYETFGEYAVDFK-----YILI 169 Query: 158 DITVVPDDEIMQ 169 D+ +E+++ Sbjct: 170 DVNRYTKEELLK 181 >UniRef50_A4XMU7 Putative uncharacterized protein n=1 Tax=Caldicellulosiruptor saccharolyticus DSM 8903 RepID=A4XMU7_CALS8 Length = 313 Score = 42.4 bits (98), Expect = 0.020, Method: Compositional matrix adjust. Identities = 26/117 (22%), Positives = 58/117 (49%), Gaps = 4/117 (3%) Query: 53 IDEDLRQYYSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYK 112 I+ R SD+++ +K ++ YI V++E QS EEL+ R++ Y + + + Sbjct: 50 INRQWRARRSDMVYKIKYKDA--YICVLLEFQSSKEELIHLRVLEYMLLIQKKYTTKNL- 106 Query: 113 ELPLVLPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQ 169 LP+V+P++ Y G P + + + + + VD+ ++ D+++++ Sbjct: 107 -LPVVIPVVLYTGEEKWTPATCFEQNVVYGEDFKQFVQKFSLVFVDVRMIDDEKLLK 162 >UniRef50_C5RH90 Putative uncharacterized protein n=2 Tax=Clostridium cellulovorans 743B RepID=C5RH90_CLOCL Length = 339 Score = 40.8 bits (94), Expect = 0.061, Method: Compositional matrix adjust. Identities = 54/275 (19%), Positives = 117/275 (42%), Gaps = 20/275 (7%) Query: 3 ISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYS 62 ++ + HD +K + +T I + L L S++ D + S Sbjct: 16 VNKKNNLHDKSYKDLFSNKETFLSLIQTFVSNTWGSKLTKENLVLVDKSYVLSDYEELES 75 Query: 63 DLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAA----MQNHLDAGYKE----L 114 D+++ + + + Y+++E QS + M R++ Y I ++N + +K L Sbjct: 76 DIVYKARIGDHEVFFYMLLEFQSYVDYRMPIRLLLYMIEIWREILKNTSEKEFKRKSFRL 135 Query: 115 PLVLPMLFYHGCRSPYPYSLCWLDEFAEPAI-ARKIYSSAFPLVDITVVPDDEIMQHRKM 173 P V+P++ Y+G ++ + + + + I I + +D+ DE+ +++ + Sbjct: 136 PAVVPIVVYNGEKN-WTVARTLKEVISNSDIFGESILDFRYEFLDVNRFKKDELYENQNI 194 Query: 174 A-LLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDR-QLKALFNYVLQTGDAQRFRAFIGE 231 A + L+ + I + + + IV ++ QLK V + ++ I + Sbjct: 195 ASAIFLLDQSISRIEFYNRLKDIVIEFNKLTVEEKAQLKHWL--VNVNSEENNYKENIEK 252 Query: 232 IAERAPQEKEKLMTIA-----DRLREEGAMQGKHE 261 I +E E +MT ++L+EEG ++GK E Sbjct: 253 IFSSNKREVE-IMTSNISKGLEKLKEEGKIEGKAE 286 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P77768 Uncharacterized protein yfcI n=175 Tax=Gammaprot... 405 e-112 UniRef50_P37415 Uncharacterized protein pSLT051 n=256 Tax=Gammap... 373 e-102 UniRef50_Q1CC76 Transposase n=27 Tax=Gammaproteobacteria RepID=Q... 361 1e-98 UniRef50_B7UFQ5 Predicted protein n=14 Tax=Enterobacteriaceae Re... 358 2e-97 UniRef50_Q4LC22 TpnA protein n=9 Tax=Enterobacteriaceae RepID=Q4... 356 4e-97 UniRef50_D2U4R8 Transposase (Fragment) n=4 Tax=Enterobacteriacea... 348 1e-94 UniRef50_Q7N1D0 Transposase, ISNCY family n=36 Tax=root RepID=Q7... 348 1e-94 UniRef50_Q7B1W7 YadD homologue n=11 Tax=root RepID=Q7B1W7_ECOLX 320 3e-86 UniRef50_B6XDZ7 Putative uncharacterized protein n=2 Tax=Provide... 318 2e-85 UniRef50_D1P284 Transposase, ISNCY family n=10 Tax=Enterobacteri... 315 1e-84 UniRef50_P31665 Uncharacterized protein yadD n=59 Tax=Enterobact... 313 5e-84 UniRef50_C2LLN3 Transposase n=37 Tax=Enterobacteriaceae RepID=C2... 313 6e-84 UniRef50_C8QFJ7 Putative transposase YhgA family protein n=4 Tax... 310 3e-83 UniRef50_C2DMU4 Possible transposase n=6 Tax=Enterobacteriaceae ... 304 2e-81 UniRef50_C0Q5B1 Ytl2 n=4 Tax=Enterobacteriaceae RepID=C0Q5B1_SALPC 299 8e-80 UniRef50_C2LF55 Transposase n=3 Tax=Enterobacteriaceae RepID=C2L... 295 1e-78 UniRef50_A8PLK1 Putative uncharacterized protein n=3 Tax=Rickett... 286 4e-76 UniRef50_D0KLJ7 Putative transposase YhgA family protein n=1 Tax... 283 4e-75 UniRef50_Q3C0L1 TpnA protein n=16 Tax=Enterobacteriaceae RepID=Q... 281 2e-74 UniRef50_C3M8C1 Putative transposase n=3 Tax=Candidatus Hamilton... 279 8e-74 UniRef50_B7MZS6 Putative uncharacterized protein n=3 Tax=Escheri... 264 2e-69 UniRef50_C0AXL8 Putative uncharacterized protein n=1 Tax=Proteus... 264 3e-69 UniRef50_A8PQ66 Putative uncharacterized protein n=3 Tax=Rickett... 261 2e-68 UniRef50_Q52101 ORF n=1 Tax=Salmonella enterica subsp. enterica ... 246 5e-64 UniRef50_Q24W02 Putative uncharacterized protein n=3 Tax=Clostri... 239 7e-62 UniRef50_C2DIT3 Possible transposase n=5 Tax=Enterobacteriaceae ... 239 8e-62 UniRef50_A6TJT5 Putative uncharacterized protein n=1 Tax=Alkalip... 239 1e-61 UniRef50_Q1RJ73 Transposase and inactivated derivative n=10 Tax=... 239 1e-61 UniRef50_B3ESQ9 Putative uncharacterized protein n=2 Tax=Bacteri... 238 2e-61 UniRef50_C1J8H0 Truncated transposase n=3 Tax=Escherichia coli R... 234 4e-60 UniRef50_A6G4N5 Putative uncharacterized protein n=1 Tax=Plesioc... 232 9e-60 UniRef50_C4YU05 Transposase n=5 Tax=Rickettsieae RepID=C4YU05_9RICK 230 4e-59 UniRef50_Q2J904 Putative uncharacterized protein n=1 Tax=Frankia... 229 9e-59 UniRef50_A9EVM7 Similar to putative transposase n=2 Tax=Sorangiu... 229 1e-58 UniRef50_A6G0X2 Putative uncharacterized protein n=1 Tax=Plesioc... 228 2e-58 UniRef50_C3PPD7 Transposase and inactivated derivative n=13 Tax=... 226 7e-58 UniRef50_Q1RGR6 Transposase and inactivated derivative n=15 Tax=... 226 9e-58 UniRef50_A8GX51 Transposase and inactivated derivative n=11 Tax=... 225 1e-57 UniRef50_A0LBL3 Putative uncharacterized protein n=6 Tax=Magneto... 220 4e-56 UniRef50_A5CC03 Transposase and inactivated derivative n=9 Tax=O... 211 3e-53 UniRef50_Q1Q296 Putative uncharacterized protein n=6 Tax=Candida... 208 2e-52 UniRef50_D0LMM4 Putative transposase n=10 Tax=Haliangium ochrace... 207 4e-52 UniRef50_D2QBD7 Putative uncharacterized protein n=1 Tax=Spiroso... 207 4e-52 UniRef50_Q1RKI3 Transposase and inactivated derivative n=10 Tax=... 205 1e-51 UniRef50_Q1QWV4 Putative uncharacterized protein n=11 Tax=Proteo... 205 2e-51 UniRef50_C5JAV2 Transposase n=2 Tax=uncultured bacterium RepID=C... 203 5e-51 UniRef50_C5UWW9 Putative uncharacterized protein n=1 Tax=Clostri... 202 9e-51 UniRef50_C0GW46 Putative uncharacterized protein n=2 Tax=Desulfo... 202 1e-50 UniRef50_C6VTM0 Putative uncharacterized protein n=1 Tax=Dyadoba... 202 2e-50 UniRef50_Q6TFF6 Putative transposase n=1 Tax=Caedibacter taenios... 198 2e-49 UniRef50_Q2FP14 Putative uncharacterized protein n=4 Tax=Methano... 197 3e-49 UniRef50_B2V9N0 Putative uncharacterized protein n=4 Tax=Sulfuri... 196 6e-49 UniRef50_Q04UG3 Transposase, YhgA-like n=8 Tax=Leptospira RepID=... 196 6e-49 UniRef50_A3ET28 Probable transposase n=6 Tax=Leptospirillum sp. ... 195 2e-48 UniRef50_C0GW49 Putative uncharacterized protein n=6 Tax=Desulfo... 194 2e-48 UniRef50_A9BGB6 Putative uncharacterized protein n=3 Tax=Petroto... 194 4e-48 UniRef50_C4FIM1 Putative uncharacterized protein n=1 Tax=Sulfuri... 193 6e-48 UniRef50_B9MMR0 Putative uncharacterized protein n=1 Tax=Anaeroc... 193 7e-48 UniRef50_C7RR52 Putative transposase n=1 Tax=Candidatus Accumuli... 191 3e-47 UniRef50_C6I158 Putative uncharacterized protein n=3 Tax=Leptosp... 186 7e-46 UniRef50_B4U689 Putative uncharacterized protein n=8 Tax=Aquific... 185 2e-45 UniRef50_B3ETR6 Putative uncharacterized protein n=1 Tax=Candida... 185 2e-45 UniRef50_A4XFI8 Putative uncharacterized protein n=7 Tax=Clostri... 184 3e-45 UniRef50_A4XMD0 Putative uncharacterized protein n=5 Tax=Clostri... 184 5e-45 UniRef50_C0GWA6 Putative uncharacterized protein n=3 Tax=Desulfo... 180 3e-44 UniRef50_A4U3R1 Putative uncharacterized protein n=1 Tax=Magneto... 175 1e-42 UniRef50_C0GTX5 Putative uncharacterized protein n=8 Tax=Desulfo... 175 2e-42 UniRef50_A3JHZ5 Putative transposase n=11 Tax=Proteobacteria Rep... 174 4e-42 UniRef50_C6HXQ0 Putative uncharacterized protein n=1 Tax=Leptosp... 174 4e-42 UniRef50_B9TA29 Putative uncharacterized protein n=1 Tax=Ricinus... 174 5e-42 UniRef50_Q3JB06 Putative transposase n=17 Tax=Proteobacteria Rep... 173 6e-42 UniRef50_B6WXP3 Putative uncharacterized protein n=1 Tax=Desulfo... 170 4e-41 UniRef50_A6G1G8 Putative uncharacterized protein n=1 Tax=Plesioc... 170 6e-41 UniRef50_C6HY29 Putative uncharacterized protein n=1 Tax=Leptosp... 169 1e-40 UniRef50_B2V697 Putative uncharacterized protein n=6 Tax=Sulfuri... 167 4e-40 UniRef50_C0A240 Putative uncharacterized protein n=1 Tax=Opituta... 167 5e-40 UniRef50_C8T759 Putative uncharacterized protein n=1 Tax=Klebsie... 166 8e-40 UniRef50_A9BGB3 Putative uncharacterized protein n=2 Tax=Petroto... 164 3e-39 UniRef50_A8PLG1 Transposase n=1 Tax=Rickettsiella grylli RepID=A... 162 1e-38 UniRef50_C6HZP6 Putative uncharacterized protein n=1 Tax=Leptosp... 160 7e-38 UniRef50_D0YJF1 Putative transposase YhgA family protein n=1 Tax... 160 7e-38 UniRef50_B9MN47 Putative uncharacterized protein n=2 Tax=Bacteri... 155 2e-36 UniRef50_B6J6C6 Hypothetical cytosolic protein n=1 Tax=Coxiella ... 153 9e-36 UniRef50_C6HTR6 Probable transposase n=5 Tax=Leptospirillum ferr... 152 2e-35 UniRef50_D0LPI9 Putative transposase n=2 Tax=Haliangium ochraceu... 143 8e-33 UniRef50_D2NBJ3 Putative uncharacterized protein n=1 Tax=Escheri... 142 1e-32 UniRef50_C1MD86 Putative uncharacterized protein n=5 Tax=Enterob... 141 4e-32 UniRef50_C1DXV7 Putative uncharacterized protein n=1 Tax=Sulfuri... 124 4e-27 UniRef50_B8FP58 Putative uncharacterized protein n=1 Tax=Desulfi... 116 1e-24 UniRef50_C0GV86 Transposase, ISNCY family n=7 Tax=Desulfonatrono... 113 6e-24 UniRef50_B5Q357 Transposase n=10 Tax=Salmonella enterica subsp. ... 106 1e-21 UniRef50_C4UAM6 Putative uncharacterized protein n=1 Tax=Yersini... 102 1e-20 UniRef50_Q3C0L0 TpnA protein n=2 Tax=Sodalis glossinidius RepID=... 78 3e-13 UniRef50_C4GYF6 Transposase n=20 Tax=Yersinia pestis RepID=C4GYF... 73 9e-12 UniRef50_B7UFQ6 Predicted protein n=11 Tax=Escherichia RepID=B7U... 55 4e-06 Sequences not found previously or not previously below threshold: UniRef50_Q2RLW6 Putative uncharacterized protein n=9 Tax=Clostri... 171 2e-41 UniRef50_A4XG55 Putative uncharacterized protein n=2 Tax=Caldice... 150 6e-35 UniRef50_C5RH90 Putative uncharacterized protein n=2 Tax=Clostri... 150 6e-35 UniRef50_C1DXM1 Putative uncharacterized protein n=5 Tax=Sulfuri... 145 2e-33 UniRef50_C6PYR3 Putative uncharacterized protein n=1 Tax=Clostri... 143 7e-33 UniRef50_B0K503 Putative uncharacterized protein n=12 Tax=Thermo... 135 2e-30 UniRef50_B9MMM9 Putative uncharacterized protein n=1 Tax=Anaeroc... 125 2e-27 UniRef50_A4XMU7 Putative uncharacterized protein n=1 Tax=Caldice... 124 3e-27 UniRef50_B9MPV5 Putative uncharacterized protein n=5 Tax=Clostri... 112 2e-23 UniRef50_C1I6Y7 Putative uncharacterized protein n=1 Tax=Clostri... 110 6e-23 UniRef50_B0G834 Putative uncharacterized protein n=3 Tax=Dorea f... 109 1e-22 UniRef50_B0K519 Putative uncharacterized protein n=14 Tax=Thermo... 108 3e-22 UniRef50_B9E303 Putative uncharacterized protein n=2 Tax=Clostri... 105 1e-21 UniRef50_Q1PZ06 Putative uncharacterized protein n=1 Tax=Candida... 104 4e-21 UniRef50_C4G1D5 Putative uncharacterized protein n=2 Tax=Abiotro... 101 4e-20 UniRef50_A5D0D4 Putative uncharacterized protein n=10 Tax=Clostr... 98 3e-19 UniRef50_C6IY67 Transposase n=1 Tax=Paenibacillus sp. oral taxon... 97 1e-18 UniRef50_B0K813 Putative uncharacterized protein n=13 Tax=Thermo... 95 3e-18 UniRef50_C4FHW2 Putative uncharacterized protein n=1 Tax=Sulfuri... 94 4e-18 UniRef50_A5USQ0 Putative uncharacterized protein n=4 Tax=Roseifl... 92 2e-17 UniRef50_C9KKN3 Putative uncharacterized protein n=1 Tax=Mitsuok... 91 4e-17 UniRef50_B0KCX4 Putative uncharacterized protein n=12 Tax=Thermo... 87 5e-16 UniRef50_B1XMU9 Putative uncharacterized protein n=1 Tax=Synecho... 87 6e-16 UniRef50_Q7NIZ1 Gll2041 protein n=9 Tax=Cyanobacteria RepID=Q7NI... 85 4e-15 UniRef50_B7GJZ4 Transposase n=10 Tax=Bacillaceae RepID=B7GJZ4_ANOFW 84 6e-15 UniRef50_Q2RKN5 Putative uncharacterized protein n=1 Tax=Moorell... 83 7e-15 UniRef50_C9RQ02 Putative uncharacterized protein n=1 Tax=Fibroba... 81 4e-14 UniRef50_A4XJH0 Putative uncharacterized protein n=1 Tax=Caldice... 79 1e-13 UniRef50_C8PTN1 Putative uncharacterized protein n=4 Tax=Trepone... 79 2e-13 UniRef50_Q73P51 Conserved domain protein n=7 Tax=Treponema RepID... 78 4e-13 UniRef50_C1PBU4 Putative uncharacterized protein n=4 Tax=Bacillu... 77 6e-13 UniRef50_Q2RGS0 Putative uncharacterized protein n=2 Tax=Moorell... 75 2e-12 UniRef50_Q6D6X6 Putative transposase (Fragment) n=2 Tax=Pectobac... 75 2e-12 UniRef50_C6XV94 Putative uncharacterized protein n=7 Tax=Pedobac... 75 3e-12 UniRef50_A8VV66 ATPase associated with various cellular activiti... 73 7e-12 UniRef50_C4G3R2 Putative uncharacterized protein n=2 Tax=Abiotro... 73 1e-11 UniRef50_UPI0001BC3A9D hypothetical protein BcroD2_08902 n=3 Tax... 72 2e-11 UniRef50_B5U1X5 Putative uncharacterized protein n=1 Tax=uncultu... 72 3e-11 UniRef50_A6LFH9 Putative uncharacterized protein n=6 Tax=Bactero... 71 4e-11 UniRef50_B7CC32 Putative uncharacterized protein n=10 Tax=Eubact... 70 6e-11 UniRef50_A7BWQ7 Putative uncharacterized protein n=3 Tax=Beggiat... 70 7e-11 UniRef50_D1PHY3 Putative uncharacterized protein n=2 Tax=Prevote... 70 8e-11 UniRef50_C9LWJ8 Putative uncharacterized protein n=1 Tax=Selenom... 70 1e-10 UniRef50_C6LE73 Putative uncharacterized protein n=1 Tax=Bryante... 70 1e-10 UniRef50_C9LXX0 Putative uncharacterized protein n=6 Tax=Selenom... 69 2e-10 UniRef50_C8W2V6 Putative uncharacterized protein n=2 Tax=Desulfo... 68 3e-10 UniRef50_UPI0001C351D8 hypothetical protein ChatD1_33675 n=1 Tax... 68 3e-10 UniRef50_C8W1F3 Putative uncharacterized protein n=2 Tax=Desulfo... 68 4e-10 UniRef50_B1WSK8 CHP1784-containing protein n=11 Tax=Cyanobacteri... 68 4e-10 UniRef50_C2LUG6 Putative uncharacterized protein n=1 Tax=Strepto... 68 5e-10 UniRef50_A1ZPJ4 Hypothetical conserved protein n=6 Tax=Microscil... 68 5e-10 UniRef50_UPI0001C353CE hypothetical protein ChatD1_20495 n=1 Tax... 68 5e-10 UniRef50_C9XMT1 Putative uncharacterized protein n=4 Tax=Clostri... 66 2e-09 UniRef50_A5D5U3 Hypothetical membrane protein n=3 Tax=Peptococca... 66 2e-09 UniRef50_C2G1H3 Hypothetical cytosolic protein n=1 Tax=Sphingoba... 65 2e-09 UniRef50_C0QGW4 Putative uncharacterized protein n=1 Tax=Desulfo... 65 3e-09 UniRef50_A6LFA9 Putative uncharacterized protein n=22 Tax=Bacter... 65 4e-09 UniRef50_C6LJP2 Putative transposase n=1 Tax=Bryantella formatex... 65 4e-09 UniRef50_UPI0001C34E7F hypothetical protein ClM62_15401 n=1 Tax=... 65 4e-09 UniRef50_B3CQQ1 Putative transposase n=3 Tax=Orientia tsutsugamu... 64 5e-09 UniRef50_Q1NK38 Putative uncharacterized protein n=2 Tax=delta p... 64 6e-09 UniRef50_UPI0001C371D2 hypothetical protein RflaF_10865 n=1 Tax=... 64 7e-09 UniRef50_C8PLW8 Putative uncharacterized protein n=2 Tax=Trepone... 64 7e-09 UniRef50_C0CSV6 Putative uncharacterized protein n=1 Tax=Clostri... 63 1e-08 UniRef50_C6XVT6 Putative uncharacterized protein n=1 Tax=Pedobac... 63 1e-08 UniRef50_UPI000190BD13 hypothetical protein SentesTyph_06309 n=2... 63 1e-08 UniRef50_Q24MW9 Putative uncharacterized protein n=4 Tax=Desulfi... 63 1e-08 UniRef50_B7BFV9 Putative uncharacterized protein n=1 Tax=Parabac... 63 1e-08 UniRef50_C0F0J0 Putative uncharacterized protein n=1 Tax=Eubacte... 63 1e-08 UniRef50_A7BN25 Putative uncharacterized protein n=3 Tax=Beggiat... 63 1e-08 UniRef50_B3CVG1 Putative uncharacterized protein n=2 Tax=Orienti... 62 2e-08 UniRef50_D0TYF1 Putative uncharacterized protein n=1 Tax=Bactero... 62 2e-08 UniRef50_C5UZR7 Putative uncharacterized protein n=1 Tax=Clostri... 62 3e-08 UniRef50_C1Q938 Putative uncharacterized protein n=4 Tax=Brachys... 62 3e-08 UniRef50_C4FYK3 Putative uncharacterized protein n=2 Tax=Abiotro... 61 4e-08 UniRef50_C8WSD0 Putative uncharacterized protein n=5 Tax=Alicycl... 61 6e-08 UniRef50_A8GY36 Putative uncharacterized protein n=15 Tax=Ricket... 60 7e-08 UniRef50_C4G7H9 Putative uncharacterized protein n=2 Tax=Abiotro... 60 8e-08 UniRef50_A5KR99 Putative uncharacterized protein n=11 Tax=Rumino... 60 9e-08 UniRef50_Q8F560 Putative uncharacterized protein n=1 Tax=Leptosp... 60 9e-08 UniRef50_UPI0001C369BC hypothetical protein ChatD1_02491 n=1 Tax... 60 1e-07 UniRef50_A7BTR0 Putative uncharacterized protein n=3 Tax=Beggiat... 60 1e-07 UniRef50_A8YL21 Genome sequencing data, contig C325 n=27 Tax=Cya... 59 2e-07 UniRef50_A7B1D1 Putative uncharacterized protein n=3 Tax=Ruminoc... 59 2e-07 UniRef50_C0BF92 Putative uncharacterized protein n=1 Tax=Coproco... 59 2e-07 UniRef50_A8SDU3 Putative uncharacterized protein n=1 Tax=Faecali... 59 2e-07 UniRef50_UPI0001BC3131 hypothetical protein BcroD2_12630 n=4 Tax... 59 2e-07 UniRef50_B8HL58 Putative uncharacterized protein n=2 Tax=Cyanoth... 58 5e-07 UniRef50_C3QLI8 Putative uncharacterized protein n=1 Tax=Bactero... 57 7e-07 UniRef50_A6BF26 Putative uncharacterized protein n=14 Tax=Clostr... 57 7e-07 UniRef50_Q8GBS6 Putative uncharacterized protein n=12 Tax=Trepon... 57 8e-07 UniRef50_A7M2M6 Putative uncharacterized protein n=2 Tax=Bactero... 57 8e-07 UniRef50_B7CCB3 Putative uncharacterized protein n=1 Tax=Eubacte... 57 1e-06 UniRef50_B8FTH9 Putative uncharacterized protein n=3 Tax=Desulfi... 57 1e-06 UniRef50_C1P7A8 Putative uncharacterized protein n=1 Tax=Bacillu... 57 1e-06 UniRef50_D1P8S5 Putative uncharacterized protein n=1 Tax=Prevote... 56 2e-06 UniRef50_Q00255 ORF295 n=1 Tax=Leptolyngbya boryana RepID=Q00255... 56 2e-06 UniRef50_B5CRG1 Putative uncharacterized protein n=4 Tax=Ruminoc... 56 2e-06 UniRef50_A5Z376 Putative uncharacterized protein n=1 Tax=Eubacte... 55 2e-06 UniRef50_B4VKW0 Putative uncharacterized protein n=2 Tax=Microco... 55 2e-06 UniRef50_B4SC57 Putative uncharacterized protein n=14 Tax=Bacter... 55 3e-06 UniRef50_A6M1J9 Putative uncharacterized protein n=1 Tax=Clostri... 55 3e-06 UniRef50_A7C3K1 Putative uncharacterized protein n=3 Tax=Beggiat... 55 3e-06 UniRef50_C6XV81 Putative uncharacterized protein n=4 Tax=Pedobac... 55 3e-06 UniRef50_B6FJ15 Putative uncharacterized protein n=5 Tax=Clostri... 55 3e-06 UniRef50_Q8YTL4 All2703 protein n=13 Tax=Cyanobacteria RepID=Q8Y... 55 4e-06 UniRef50_C0CTJ7 Putative uncharacterized protein n=5 Tax=Clostri... 55 4e-06 UniRef50_C0R0H3 Putative uncharacterized protein n=8 Tax=Brachys... 54 5e-06 UniRef50_C6VTD5 Putative uncharacterized protein n=1 Tax=Dyadoba... 54 5e-06 UniRef50_A7BL62 Putative uncharacterized protein n=2 Tax=Beggiat... 54 5e-06 UniRef50_Q3ARU8 Putative uncharacterized protein n=12 Tax=Chloro... 54 6e-06 UniRef50_C0G0A4 Putative uncharacterized protein n=2 Tax=Rosebur... 54 6e-06 UniRef50_C1DU30 Putative uncharacterized protein n=7 Tax=Sulfuri... 53 8e-06 UniRef50_Q24Y59 Putative uncharacterized protein n=4 Tax=Peptoco... 53 8e-06 UniRef50_D1PGQ2 Transposase, ISNCY family n=2 Tax=Prevotella cop... 53 9e-06 UniRef50_C6LTE0 Putative uncharacterized protein n=1 Tax=Giardia... 53 1e-05 UniRef50_A6L0F5 Putative uncharacterized protein n=6 Tax=Bactero... 53 1e-05 UniRef50_C1QAJ2 Putative uncharacterized protein n=2 Tax=Brachys... 53 1e-05 UniRef50_C0DAA1 Putative uncharacterized protein n=2 Tax=Clostri... 53 1e-05 UniRef50_A7AK04 Putative uncharacterized protein n=2 Tax=Parabac... 53 1e-05 UniRef50_B8HNA0 Putative uncharacterized protein n=3 Tax=Cyanoba... 53 1e-05 UniRef50_C1DU78 Putative uncharacterized protein n=1 Tax=Sulfuri... 53 1e-05 UniRef50_Q24Y19 Putative uncharacterized protein n=3 Tax=Desulfi... 53 2e-05 UniRef50_C0QZQ8 Putative uncharacterized protein n=4 Tax=Brachys... 52 2e-05 UniRef50_Q73KA7 Putative uncharacterized protein n=2 Tax=Trepone... 52 2e-05 UniRef50_C5EKZ7 Predicted protein n=1 Tax=Clostridiales bacteriu... 52 2e-05 UniRef50_C0D7Q8 Putative uncharacterized protein n=1 Tax=Clostri... 52 2e-05 UniRef50_C4Z1Q2 Putative uncharacterized protein n=1 Tax=Eubacte... 52 2e-05 UniRef50_B0A7T9 Putative uncharacterized protein n=2 Tax=Clostri... 52 2e-05 UniRef50_C0EXQ3 Putative uncharacterized protein n=1 Tax=Eubacte... 52 2e-05 UniRef50_B0C251 Putative uncharacterized protein n=1 Tax=Acaryoc... 52 2e-05 UniRef50_Q6ZEK6 Slr5124 protein n=11 Tax=Chroococcales RepID=Q6Z... 52 3e-05 UniRef50_UPI00006A2D99 UPI00006A2D99 related cluster n=2 Tax=Xen... 52 3e-05 UniRef50_C6Y2B5 Transposase and inactivated derivative n=1 Tax=P... 52 3e-05 UniRef50_C5RQ96 Putative uncharacterized protein n=1 Tax=Clostri... 52 3e-05 UniRef50_A7N2B6 Putative uncharacterized protein n=1 Tax=Vibrio ... 52 3e-05 UniRef50_A8F2U7 Putative uncharacterized protein n=15 Tax=Bacter... 52 4e-05 UniRef50_A7C854 Putative uncharacterized protein n=3 Tax=Beggiat... 51 4e-05 UniRef50_B1V1L4 Putative uncharacterized protein n=38 Tax=Clostr... 51 4e-05 UniRef50_A8PPL6 Putative uncharacterized protein n=1 Tax=Rickett... 51 5e-05 UniRef50_C0QWG9 Putative uncharacterized protein n=8 Tax=Brachys... 51 6e-05 UniRef50_C0QWI7 Putative uncharacterized protein n=4 Tax=Brachys... 51 6e-05 UniRef50_C1J8G9 YdgA n=11 Tax=Enterobacteriaceae RepID=C1J8G9_ECOLX 51 6e-05 UniRef50_A8PKB8 Putative uncharacterized protein n=1 Tax=Rickett... 50 7e-05 UniRef50_C4Z592 Putative uncharacterized protein n=2 Tax=Clostri... 50 7e-05 UniRef50_B0G418 Putative uncharacterized protein n=5 Tax=Dorea f... 50 7e-05 UniRef50_UPI0001B4A8CA hypothetical protein Bfra3_22303 n=1 Tax=... 50 8e-05 UniRef50_C4G2Y1 Putative uncharacterized protein n=7 Tax=Abiotro... 50 9e-05 UniRef50_Q5GSR2 Uncharacterized conserved protein n=15 Tax=Wolba... 50 9e-05 UniRef50_Q2FSG0 Putative uncharacterized protein n=1 Tax=Methano... 50 1e-04 UniRef50_UPI00006CAA90 hypothetical protein TTHERM_00670420 n=1 ... 50 1e-04 UniRef50_B4VKU9 Putative uncharacterized protein n=1 Tax=Microco... 50 1e-04 UniRef50_C4ZLA7 Conserved hypothetical cytosolic protein n=2 Tax... 50 1e-04 UniRef50_C8PR55 Transposase (Fragment) n=5 Tax=Treponema RepID=C... 50 1e-04 UniRef50_A7C5R5 Putative uncharacterized protein n=2 Tax=Beggiat... 50 1e-04 >UniRef50_P77768 Uncharacterized protein yfcI n=175 Tax=Gammaproteobacteria RepID=YFCI_ECOLI Length = 296 Score = 405 bits (1042), Expect = e-112, Method: Composition-based stats. Identities = 296/296 (100%), Positives = 296/296 (100%) Query: 1 MTISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQY 60 MTISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQY Sbjct: 1 MTISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQY 60 Query: 61 YSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPM 120 YSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPM Sbjct: 61 YSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPM 120 Query: 121 LFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQ 180 LFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQ Sbjct: 121 LFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQ 180 Query: 181 KHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEK 240 KHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEK Sbjct: 181 KHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEK 240 Query: 241 EKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQSH 296 EKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQSH Sbjct: 241 EKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQSH 296 >UniRef50_P37415 Uncharacterized protein pSLT051 n=256 Tax=Gammaproteobacteria RepID=YTL2_SALTY Length = 313 Score = 373 bits (957), Expect = e-102, Method: Composition-based stats. Identities = 163/311 (52%), Positives = 222/311 (71%), Gaps = 16/311 (5%) Query: 2 TISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYY 61 +TT TPHDA F+ FL PD ARDF+++HLPA LR +CDL+TLKLE SF+++DLRQY+ Sbjct: 3 KKNTTPTPHDATFRQFLTQPDIARDFMELHLPAELRAICDLSTLKLESGSFVEDDLRQYF 62 Query: 62 SDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPML 121 SD+L+S+KT G GYI+V++EHQS P++ MAFR++RY++AAMQ HL+AG+K+LPLV+P+L Sbjct: 63 SDVLYSLKTTAGDGYIHVLVEHQSTPDKHMAFRLIRYAVAAMQRHLEAGHKKLPLVIPVL 122 Query: 122 FYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQK 181 FY G RSPYPYS WLDEF + A+A K+YSSAFPLVD+TV+PDDEI HR MA L L+QK Sbjct: 123 FYTGKRSPYPYSTRWLDEFDDTALADKLYSSAFPLVDVTVIPDDEIAGHRSMAALTLLQK 182 Query: 182 HIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKE 241 HI QRDL LVD++ +L+ G + Q+ +L +Y++Q G+ AF+ E+A+R PQ + Sbjct: 183 HIHQRDLAELVDRLAPILLAGYLSSSQVISLVHYIVQAGETSDAEAFVRELAQRVPQHGD 242 Query: 242 KLMTIADRLREEGAM----------------QGKHEEALRIAQEMLDRGLDRELVMMVTR 285 LMTIA +L ++G +G+ E L+IA+ ML +DR VM +T Sbjct: 243 ALMTIAQQLEQKGIEKGIQLGEQRGIEKGRSEGEREATLKIARTMLQNCIDRNTVMKMTG 302 Query: 286 LSPDDLIAQSH 296 L+ DDL H Sbjct: 303 LTEDDLAQIRH 313 >UniRef50_Q1CC76 Transposase n=27 Tax=Gammaproteobacteria RepID=Q1CC76_YERPN Length = 313 Score = 361 bits (928), Expect = 1e-98, Method: Composition-based stats. Identities = 158/311 (50%), Positives = 219/311 (70%), Gaps = 16/311 (5%) Query: 2 TISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYY 61 ++T TPHDA F+ FL P+ ARDF+++HLPA LR +CDL+TLKLE SF+++DLRQY+ Sbjct: 3 KKNSTPTPHDATFRQFLTQPEIARDFMELHLPAELRAICDLSTLKLESGSFVEDDLRQYF 62 Query: 62 SDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPML 121 SD+L+S+ T EG GY++V+IEHQS P++ MAFR++RY+IAAMQ HL+AG+ +LPLV+P+L Sbjct: 63 SDVLYSLDTVEGEGYVHVLIEHQSSPDKHMAFRLIRYAIAAMQRHLEAGHAKLPLVIPVL 122 Query: 122 FYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQK 181 FY G RSPYPYS WLDEF +P +A K+YS AFPLVD+TV+PDD+IM+HR MA L L+QK Sbjct: 123 FYVGKRSPYPYSTRWLDEFDDPELAHKLYSGAFPLVDVTVIPDDDIMEHRSMAALTLLQK 182 Query: 182 HIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKE 241 HI QRD+ L D++ +LL+ + Q+ AL +Y+LQ G++ AF+ E+A+R PQ + Sbjct: 183 HIHQRDIATLTDRLATLLMADYLSSPQVMALIHYLLQAGESADSEAFVRELAQRVPQHGD 242 Query: 242 KLMTIADRLREEGAMQGKHE----------------EALRIAQEMLDRGLDRELVMMVTR 285 LMTIA +L ++G +G+ E L +A+ +L G+ E V T Sbjct: 243 ALMTIAQQLEQKGIEKGRMEGRTEGIQLGEQRGIEKGKLEVARSLLKMGMPIESVQEATG 302 Query: 286 LSPDDLIAQSH 296 LS DDL H Sbjct: 303 LSEDDLAQIRH 313 >UniRef50_B7UFQ5 Predicted protein n=14 Tax=Enterobacteriaceae RepID=B7UFQ5_ECO27 Length = 315 Score = 358 bits (918), Expect = 2e-97, Method: Composition-based stats. Identities = 194/311 (62%), Positives = 242/311 (77%), Gaps = 20/311 (6%) Query: 1 MTISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQY 60 MT STTS+PHDAVFK+F+ P+TARDF++IHLP PLRKLC+L TL+LEP SFI++ LR Y Sbjct: 1 MTESTTSSPHDAVFKTFMFTPETARDFLEIHLPEPLRKLCNLQTLRLEPTSFIEKSLRAY 60 Query: 61 YSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPM 120 YSD+LWSV+T EG GYIY VIEHQS E+ MAFR+MRY+ AAMQ HLD GY +PLV+P+ Sbjct: 61 YSDVLWSVETSEGDGYIYCVIEHQSSAEKNMAFRLMRYATAAMQRHLDKGYDRVPLVVPL 120 Query: 121 LFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQ 180 LFYHG SPYPYSL WLDEF +P +AR++Y+ AFPLVDIT+VPDDEIMQHR++ALLELIQ Sbjct: 121 LFYHGEASPYPYSLNWLDEFDDPQLARQLYTEAFPLVDITIVPDDEIMQHRRIALLELIQ 180 Query: 181 KHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEK 240 KHIR RDL+G+VD+I +LLV G TND QL+ LFNY+LQ GD RF FI EIAER+P +K Sbjct: 181 KHIRDRDLIGMVDRITTLLVRGFTNDSQLQTLFNYLLQCGDTSRFTRFIQEIAERSPLQK 240 Query: 241 EKLMTIADRLRE--------------------EGAMQGKHEEALRIAQEMLDRGLDRELV 280 E LMTIA+RLR+ EG +G HE+A++IA ML++G +RE+V Sbjct: 241 EILMTIAERLRQEGHQIGWQEGKIEGWQEGKLEGLQEGMHEQAIKIALRMLEQGFEREIV 300 Query: 281 MMVTRLSPDDL 291 + T+L+ D+ Sbjct: 301 LAATQLTDADI 311 >UniRef50_Q4LC22 TpnA protein n=9 Tax=Enterobacteriaceae RepID=Q4LC22_SODGL Length = 308 Score = 356 bits (915), Expect = 4e-97, Method: Composition-based stats. Identities = 159/306 (51%), Positives = 206/306 (67%), Gaps = 12/306 (3%) Query: 1 MTISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQY 60 M+ T TPHDAVF+ FL TA+DF DI LP ++ LCD TLK E SFID D++ Y Sbjct: 1 MSKKFTPTPHDAVFRQFLHDKATAQDFFDIWLPDDIKALCDWETLKPESGSFIDPDMKPY 60 Query: 61 YSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPM 120 SD+L+SV GY+Y +IEHQS P++LMA+R+MRYS+AAMQ HL+AG+ +LPLV P+ Sbjct: 61 QSDILYSVNANGVDGYVYCLIEHQSTPDKLMAWRLMRYSMAAMQRHLEAGHDKLPLVFPV 120 Query: 121 LFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQ 180 LFY G +SP+PYS WLD F P IA KIYS F L+D+T + DD IMQHR+MALLELIQ Sbjct: 121 LFYCGEKSPHPYSTNWLDCFERPDIAAKIYSQPFRLMDVTTLDDDAIMQHRRMALLELIQ 180 Query: 181 KHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEK 240 KHIR+RD+ L+D IV LL D Q+ + NY++Q G+A R FI EIA+RA + + Sbjct: 181 KHIRRRDMTELLDSIVKLLSYNYYTDTQVVTMMNYLVQEGNAASPRTFITEIAKRAEKHE 240 Query: 241 EKLMTIADRL------------REEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSP 288 E LMTIA+ L R+EG QG+H A++IA++ML RG+ R+ V T LS Sbjct: 241 EALMTIAEALKQEGYQIGRDDGRQEGIQQGEHAAAMKIARQMLSRGIARDAVKACTGLSD 300 Query: 289 DDLIAQ 294 + L Sbjct: 301 NALDNL 306 >UniRef50_D2U4R8 Transposase (Fragment) n=4 Tax=Enterobacteriaceae RepID=D2U4R8_9ENTR Length = 308 Score = 348 bits (893), Expect = 1e-94, Method: Composition-based stats. Identities = 149/295 (50%), Positives = 207/295 (70%), Gaps = 4/295 (1%) Query: 1 MTISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQY 60 MT T TPHDAVFK FL +TA+DF DI LP ++ LCDL +LK+E SFID +++ Y Sbjct: 7 MTKKFTPTPHDAVFKQFLSEKETAKDFFDIWLPDEIKALCDLDSLKMESGSFIDSEMKNY 66 Query: 61 YSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPM 120 SD+L+SV T +G GYIYV+IEHQS P++L+A+R+MRYS+AAMQ HL+ G K+LPLV P+ Sbjct: 67 QSDILYSVSTTKGSGYIYVLIEHQSTPDKLIAWRLMRYSLAAMQKHLEDGNKQLPLVFPI 126 Query: 121 LFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQ 180 LFY G +SP+PYS WLD F + +A IY++ F L D+T + D EIMQH+++ALLEL+Q Sbjct: 127 LFYCGEQSPHPYSTHWLDCFEDRKLAESIYNNPFKLADVTTLDDGEIMQHKRIALLELLQ 186 Query: 181 KHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEK 240 KHIR+RD+ L+D IV LL D Q+ +FNY++Q G+AQR FI IA++A + + Sbjct: 187 KHIRRRDMTELLDSIVKLLSYNYYTDNQVITMFNYLIQEGNAQRPMEFITNIAKQAEKHE 246 Query: 241 EKLMTIADRLRE----EGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDL 291 LMTIA ++ E +G QG + + +A++ L G+DR V + T LS ++L Sbjct: 247 GALMTIAQQIEEIGIQKGIQQGIQKTKIELAKQFLANGVDRNTVKISTGLSDEEL 301 >UniRef50_Q7N1D0 Transposase, ISNCY family n=36 Tax=root RepID=Q7N1D0_PHOLL Length = 335 Score = 348 bits (893), Expect = 1e-94, Method: Composition-based stats. Identities = 157/335 (46%), Positives = 216/335 (64%), Gaps = 39/335 (11%) Query: 1 MTISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQY 60 M T TPHDA+FK FL H DTARDF++IHLPA LR +CDL TL+LE SFI+++LR + Sbjct: 1 MKRKNTPTPHDAIFKKFLSHIDTARDFLEIHLPATLRAVCDLDTLRLESGSFIEDNLRVH 60 Query: 61 YSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPM 120 YSD+L+S+KT +G Y+Y VIEHQS P+++MAFR+MRYSI+AMQ HL+ G+K+LPLV+P+ Sbjct: 61 YSDILYSLKTTQGESYVYCVIEHQSSPDKMMAFRLMRYSISAMQWHLEQGHKKLPLVIPV 120 Query: 121 LFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQ 180 LFYHG PYP+S W D F A+A +IYSSAFPLVD+TV+PDDEI+ H+++ALLE++Q Sbjct: 121 LFYHGKIRPYPWSTNWFDCFDASALAEEIYSSAFPLVDVTVIPDDEILTHKRVALLEIVQ 180 Query: 181 KHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEK 240 KHIRQRD+ L ++ L LK++ NY+L GD FI ++AE+ P+ + Sbjct: 181 KHIRQRDMAELQQELTMLFAYDYYTYELLKSMLNYILLVGDTADPEGFIRQLAEQFPKYE 240 Query: 241 EKLMTIADRLREEGAMQGKHEE-------------------------------------- 262 E LMTIA +L+ +G +G E Sbjct: 241 EVLMTIAQKLQHKGHQEGLKEGLQKCQDAREEGLQEGLQKGEKKGEKKGEKKGEEKGEKR 300 Query: 263 -ALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQSH 296 +L+IA+ ++D G+DRE +M T LS ++L H Sbjct: 301 ASLKIARALMDNGIDRETIMKSTGLSQNELEQIHH 335 >UniRef50_Q7B1W7 YadD homologue n=11 Tax=root RepID=Q7B1W7_ECOLX Length = 313 Score = 320 bits (821), Expect = 3e-86, Method: Composition-based stats. Identities = 156/312 (50%), Positives = 209/312 (66%), Gaps = 20/312 (6%) Query: 2 TISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYY 61 +TT TPHDA F+SFL +PD ARDF+++HLPA R+LCDL+TLKLEP +F++ DL QY Sbjct: 5 KNTTTPTPHDAAFRSFLANPDVARDFLELHLPAEYRQLCDLSTLKLEPATFVEPDLHQYA 64 Query: 62 SDLLWSVKTQEGV-GYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPM 120 SD+LWSVKT G GY+Y +IEHQS M FRM+RYS+AAMQ HL+ +K LPLV+P+ Sbjct: 65 SDILWSVKTTGGEDGYVYTLIEHQSTENLYMPFRMLRYSVAAMQRHLEQ-HKTLPLVIPV 123 Query: 121 LFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQ 180 LFYHG RSPYPYS+ WLD F PA+A KIY+ FPLVDITVV D+EIM HR+MA L L+ Sbjct: 124 LFYHGERSPYPYSMNWLDCFENPALAAKIYTKPFPLVDITVVDDNEIMNHRRMAALTLLM 183 Query: 181 KHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEK 240 KHIRQRD+L +D +V L ++ Q+ LFNY+L G F+ +A+R PQ + Sbjct: 184 KHIRQRDMLMCLDNLVRAL-QDIQDEEQITVLFNYLL-NGSEHVTVEFLQTLAQRLPQHE 241 Query: 241 EKLMTIADRLREEGAMQGKH----------------EEALRIAQEMLDRGLDRELVMMVT 284 + +MT+A+RL++EG QG ++A IA+E+ + G+ + +T Sbjct: 242 DSIMTLAERLKQEGIQQGIQQGIQQGIQQGVQQGALQKAREIARELRNAGMPAAQICQLT 301 Query: 285 RLSPDDLIAQSH 296 LS +L +H Sbjct: 302 GLSEAELKNITH 313 >UniRef50_B6XDZ7 Putative uncharacterized protein n=2 Tax=Providencia RepID=B6XDZ7_9ENTR Length = 327 Score = 318 bits (814), Expect = 2e-85, Method: Composition-based stats. Identities = 127/323 (39%), Positives = 187/323 (57%), Gaps = 28/323 (8%) Query: 1 MTISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQY 60 MT+ + PHD+ FK F+ D ARDF +I+LP ++ LC+L TLKL SFID+ LR Sbjct: 5 MTMQLIARPHDSTFKGFMSKVDNARDFFEIYLPNRIKPLCNLDTLKLASASFIDKTLRSR 64 Query: 61 YSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPM 120 +SD+L+SV+T +G GY Y+++EHQS P++LM +R+M Y+ AM HL G LPLV+P+ Sbjct: 65 FSDMLYSVQTLKGKGYFYLLVEHQSTPDKLMGWRLMHYAFCAMNQHLQQGNNALPLVVPI 124 Query: 121 LFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQ 180 LFYHG +SPYPYS W D F +A +Y + PLVD+TV DDEI+ HRK+A +EL+ Sbjct: 125 LFYHGKQSPYPYSQVWTDCFPWADLAYDLYCNPLPLVDVTVASDDEIVNHRKVAAMELVL 184 Query: 181 KHIRQRDLLGLVDQIVSLLVTGNTNDR-QLKALFNYVLQTGDAQRFRAFIGEIAERAPQE 239 KH RD L ++ + ++ +++ N N R + + NY+ D + + + E+ Sbjct: 185 KHSTLRDDLIVLSERLAQVISENENHRDDVILIINYLFSVMDTPTYTQIVKTLIEQTEGY 244 Query: 240 KEKLMTIADRL------------REEGAMQGKHEEALR-------IAQEM--------LD 272 +E +MTIADRL REEG +GK E IA++ LD Sbjct: 245 QETVMTIADRLRNEGLEKGLIKGREEGKAEGKAEGREEARQEEQAIARQRTYTQVITSLD 304 Query: 273 RGLDRELVMMVTRLSPDDLIAQS 295 GL +++ +T L ++ A Sbjct: 305 LGLSIDIISKITGLPHSEIQAMR 327 >UniRef50_D1P284 Transposase, ISNCY family n=10 Tax=Enterobacteriaceae RepID=D1P284_9ENTR Length = 322 Score = 315 bits (808), Expect = 1e-84, Method: Composition-based stats. Identities = 120/322 (37%), Positives = 183/322 (56%), Gaps = 27/322 (8%) Query: 1 MTISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQY 60 M + PHD+ FK F+ D ARDF ++HLP ++ LC+ TLKL SF+D+ LR Sbjct: 1 MATQSIVAPHDSTFKGFMSKVDNARDFFEVHLPNRIKHLCNFDTLKLASASFVDKTLRSR 60 Query: 61 YSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPM 120 +SD+L+SV+T +G GY Y ++EHQS P++LM +R+M Y+ AM HL G++ LPLV+P+ Sbjct: 61 FSDMLYSVQTLKGKGYFYFLVEHQSSPDKLMGWRLMHYAFCAMNQHLQQGHQSLPLVVPI 120 Query: 121 LFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQ 180 LFYHG +SPYPYS W D F +A +Y + PLVD+TV DDE+M HRK+A +EL+ Sbjct: 121 LFYHGNQSPYPYSQSWTDCFQWSDLAHDLYCNPLPLVDVTVACDDELMNHRKVAAMELVF 180 Query: 181 KHIRQR-DLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQE 239 KH R D+ GL +++ +L + + + NY+ D + + + ++ + Sbjct: 181 KHASLRGDVFGLSERLAQVLNNNQNHQDDVILIINYLFSVMDTPAYTHIVKTLVDQTEKH 240 Query: 240 KEKLMTIADRLR----EEGAMQGKHEEALRIAQEM----------------------LDR 273 +E +M IA RLR E+G +G+ EE + Q++ L Sbjct: 241 QETVMNIAQRLRNEGMEKGMEKGRKEERMISQQKLANERQHYQQQMALNLQQQAIMSLKL 300 Query: 274 GLDRELVMMVTRLSPDDLIAQS 295 GL +++ +T LSP D+ A Sbjct: 301 GLSVDIISQITGLSPSDIHALR 322 >UniRef50_P31665 Uncharacterized protein yadD n=59 Tax=Enterobacteriaceae RepID=YADD_ECOLI Length = 300 Score = 313 bits (802), Expect = 5e-84, Method: Composition-based stats. Identities = 152/290 (52%), Positives = 211/290 (72%), Gaps = 5/290 (1%) Query: 6 TSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLL 65 ++TPHDAVFK FL H +TARDF++IHLP LR+LCDL TL LE SFI+E L+ + +D+L Sbjct: 5 STTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDVL 64 Query: 66 WSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYHG 125 +SV+ Q GY++VVIEHQSKP++ MAFRMMRYSIAAM HL+A + +LPLV+P+LFY G Sbjct: 65 YSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQG 124 Query: 126 CRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQ 185 +PYP S+CW D F P +AR++Y+S FPLVDIT+ PDDEIMQHR++A+LEL+QKHIRQ Sbjct: 125 EATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQ 184 Query: 186 RDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKLMT 245 RDL+ L++Q+V+L+ G T+ QL A+ NY+LQ G ++ F G + +R E +MT Sbjct: 185 RDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETG-GESMMT 243 Query: 246 IA----DRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDL 291 +A ++ E+G QG+ E + AQ +L +G+ RE V + L ++ Sbjct: 244 LAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEI 293 >UniRef50_C2LLN3 Transposase n=37 Tax=Enterobacteriaceae RepID=C2LLN3_PROMI Length = 319 Score = 313 bits (801), Expect = 6e-84, Method: Composition-based stats. Identities = 130/319 (40%), Positives = 202/319 (63%), Gaps = 23/319 (7%) Query: 1 MTISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQY 60 MT +T HDA+FK FL HP+ ARDF +HLPA + LCDL+TL+LEP SF++ LRQ Sbjct: 1 MTKNTQQPVHDALFKQFLTHPENARDFFSVHLPANILPLCDLSTLRLEPASFVERRLRQL 60 Query: 61 YSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDA--GYKELPLVL 118 +SD+L+SV+ EG GYIY +IEHQSKP+ LM FR+M Y+++A+ +HL K LPLV+ Sbjct: 61 HSDVLYSVQMTEGEGYIYCLIEHQSKPDRLMGFRLMHYAMSAIAHHLKKSPADKTLPLVV 120 Query: 119 PMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLEL 178 P LFY G PYPYS+ WLD FA+PA+A+++Y+ +FPLVD++V+ D+EI+ H+ +ALLEL Sbjct: 121 PFLFYQGSVCPYPYSMNWLDGFADPALAQQLYTRSFPLVDLSVLSDEEILTHKGIALLEL 180 Query: 179 IQKHIRQRD-LLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAP 237 +QKHIR RD L+ ++ I ++ + + Q++++ Y+ G F ++ +P Sbjct: 181 VQKHIRTRDGLMAVLPIIAQIINSQHNTVDQVRSVIEYIAYQGYILDESRFFSQLIALSP 240 Query: 238 QEKEKLMTIADRLREEGAMQGKHE--------------------EALRIAQEMLDRGLDR 277 + K L TIA++L ++G +G + ++A+ +L +G+D Sbjct: 241 EYKTMLTTIAEQLEQKGIEKGIEKGIEKGIEKGIEKGIEKGIGLGVEKVARSLLQQGVDL 300 Query: 278 ELVMMVTRLSPDDLIAQSH 296 ++M T L+ + + + H Sbjct: 301 NIIMQCTGLTREKIESLKH 319 >UniRef50_C8QFJ7 Putative transposase YhgA family protein n=4 Tax=Pantoea sp. At-9b RepID=C8QFJ7_9ENTR Length = 301 Score = 310 bits (795), Expect = 3e-83, Method: Composition-based stats. Identities = 131/301 (43%), Positives = 192/301 (63%), Gaps = 9/301 (2%) Query: 3 ISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYS 62 +S S PHDA+FK FL H AR F++IHLP +R+ CDL L++ P +FI+ DL YS Sbjct: 1 MSVVSAPHDALFKKFLSHLPVARQFLEIHLPQSIREHCDLDKLQVVPTTFIERDLSALYS 60 Query: 63 DLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLF 122 D+L S+KT +G GYIY +IEHQS P++ M RMMRY++AA+Q HLD G+ ++PLV+P+LF Sbjct: 61 DVLLSMKTDDGEGYIYALIEHQSTPDKHMTLRMMRYTLAAIQRHLDEGHHDVPLVIPILF 120 Query: 123 YHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKH 182 Y G SPYPYS+ WL+ F P +A++I+ +FPLVD+TV+PD+EIM HR +A LE+ K Sbjct: 121 YQGKTSPYPYSMNWLESFRNPVLAKQIFCHSFPLVDVTVIPDEEIMAHRDVARLEMAHKI 180 Query: 183 IRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEK 242 IR RD+L +D + +LL +D + +F Y+L+ G+ + + + PQ + K Sbjct: 181 IRLRDILENIDPMATLLALDYNDDLSIDVVF-YLLRYGNTDDREKIVKILIQAKPQLEGK 239 Query: 243 LMTIADRLREEGAMQGKHEEA--------LRIAQEMLDRGLDRELVMMVTRLSPDDLIAQ 294 +MTI ++ R+E +G+ E L +AQ ML D +M +T LS +L Sbjct: 240 IMTIEEQWRQESRQEGRQEGRKEGRQEVMLELAQRMLREQFDLNTIMKLTGLSEGELRQL 299 Query: 295 S 295 + Sbjct: 300 N 300 >UniRef50_C2DMU4 Possible transposase n=6 Tax=Enterobacteriaceae RepID=C2DMU4_ECOLX Length = 314 Score = 304 bits (780), Expect = 2e-81, Method: Composition-based stats. Identities = 153/306 (50%), Positives = 211/306 (68%), Gaps = 21/306 (6%) Query: 6 TSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLL 65 ++TPHDAVFK FL H +TARDF+DIHLPA LR+LCDL TL LE SFI+E L+ + +D+L Sbjct: 5 STTPHDAVFKQFLMHAETARDFLDIHLPAELRELCDLDTLHLESGSFIEESLKGHSTDVL 64 Query: 66 WSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYHG 125 +SV+ Q GY++VVIEHQSKP++ MAFRMMRYSIAAM HL+A + +LPLV+P+LFY G Sbjct: 65 YSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQG 124 Query: 126 CRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQ 185 +PYP S+CW D F P +AR++Y+S FPLVDIT+ PDDEIMQHR++A+LEL+QKHIRQ Sbjct: 125 EATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIRQ 184 Query: 186 RDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKLMT 245 RDL+ L++Q+V+L+ G T+ QL A+ NY+LQ G ++ F G + +R K +MT Sbjct: 185 RDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGK-SMMT 243 Query: 246 IA--------------------DRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTR 285 +A ++ E+G QG+ E + A +L +G+ RE V + Sbjct: 244 LAQWFEEKGIEKGIEKGIEKGMEKGIEKGIQQGRQEVSQEFALRLLSKGMPREDVAEMAN 303 Query: 286 LSPDDL 291 L ++ Sbjct: 304 LPLAEI 309 >UniRef50_C0Q5B1 Ytl2 n=4 Tax=Enterobacteriaceae RepID=C0Q5B1_SALPC Length = 316 Score = 299 bits (766), Expect = 8e-80, Method: Composition-based stats. Identities = 130/305 (42%), Positives = 191/305 (62%), Gaps = 18/305 (5%) Query: 10 HDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVK 69 HD +FK FLR PDTARDF+ +HLPA +R L TLKLEP SF+D+ LR+ +SD+L+SV+ Sbjct: 12 HDGLFKLFLREPDTARDFLAVHLPADIRAQVRLDTLKLEPGSFVDQKLRELHSDVLYSVE 71 Query: 70 TQEG-VGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYHGCRS 128 T EG GYIY ++EHQS + +MA+RMMRYS+A M HL G LP+V+P+LFY G Sbjct: 72 TAEGHAGYIYCLVEHQSTADRMMAWRMMRYSMAVMDAHLKKGNGTLPVVVPLLFYQGMVR 131 Query: 129 PYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQRDL 188 PYPYS W+D F PA+AR++YS +PLVD++V+ D ++ HR+MALLEL+Q+ IR RD Sbjct: 132 PYPYSTDWMDCFDVPALAREVYSRPWPLVDVSVMEDCDLQSHRRMALLELVQRDIRHRDA 191 Query: 189 LGLVDQIVSLLVTGNTNDRQLKALFNYVLQTG-DAQRFRAFIGEIAERAPQEKEKLM-TI 246 L+ +V L+ Q++A+ Y++ G ++ F+ E+A P+ KE +M TI Sbjct: 192 ASLLRDVVQLIRLAGNTRAQVEAVLCYIIYNGMTSESITPFLYELAGEIPEYKELIMGTI 251 Query: 247 ADRLR---------------EEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDL 291 A +L+ + +++ + + L A +LD G+ E+V+ T L+ + L Sbjct: 252 AQQLKEEGIQQGIQQGIQQERQASLEREQKTLLETAYALLDNGVSLEVVIKSTGLNRETL 311 Query: 292 IAQSH 296 H Sbjct: 312 EQPRH 316 >UniRef50_C2LF55 Transposase n=3 Tax=Enterobacteriaceae RepID=C2LF55_PROMI Length = 330 Score = 295 bits (756), Expect = 1e-78, Method: Composition-based stats. Identities = 115/325 (35%), Positives = 176/325 (54%), Gaps = 33/325 (10%) Query: 1 MTISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQY 60 M + HDA FK F+ + A+DF IHL L+ CD +TLKL+ +SFID LR Sbjct: 1 MNKPLLISSHDAAFKRFMMNISNAKDFFFIHLSDELKSYCDFSTLKLQNSSFIDIKLRSR 60 Query: 61 YSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPM 120 SD+L+SVKT++G IY +IEHQS+P++++A+RMM Y+ M HL GY LPLV+P+ Sbjct: 61 MSDILYSVKTKKGNISIYFLIEHQSRPDKMIAWRMMHYAFCTMNQHLQQGYTSLPLVVPI 120 Query: 121 LFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQ 180 LFYHG R PYP+S+ WLD F +A ++Y + F L+D+ + D+ ++ HRK A++E+ Sbjct: 121 LFYHGKRKPYPFSVNWLDCFPLSTLANQLYLNNFALIDLNSIDDEILLTHRKAAVMEIAM 180 Query: 181 KHIRQRDLLGLVDQIVSL-LVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQE 239 KH+ D L + ++S + N +D A+ Y+ DA F + I +IAE+ Sbjct: 181 KHVNSCDDLDKLAMLLSKAINQKNCSDEDTIAVVQYLFSIMDAADFESIINKIAEQVDNH 240 Query: 240 KEKLMTIADRLREEGAMQGKHEEA--------------------------------LRIA 267 +E +M IA RL +G GK E +++A Sbjct: 241 RETIMNIAWRLENKGFKLGKMEGIEIGKNEGIEIGKNEGIEIGKNEGIEIGKKIVQIQLA 300 Query: 268 QEMLDRGLDRELVMMVTRLSPDDLI 292 + +L ++ E + +T LS +L Sbjct: 301 KNLLKENVELEFIERITGLSIQELK 325 >UniRef50_A8PLK1 Putative uncharacterized protein n=3 Tax=Rickettsiella grylli RepID=A8PLK1_9COXI Length = 308 Score = 286 bits (733), Expect = 4e-76, Method: Composition-based stats. Identities = 110/301 (36%), Positives = 176/301 (58%), Gaps = 14/301 (4%) Query: 6 TSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLL 65 HDA+FK+F + A FI I+LP +++ CD +TLK+EP SF+D DL+Q++SD+L Sbjct: 5 IHNAHDAIFKTFFTDIEVATHFITIYLPKHMKQACDFSTLKIEPGSFVDADLKQHHSDIL 64 Query: 66 WSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYHG 125 +S+K GY+Y+ +EHQS EELM FRM RY +A MQ HL+ G K+LPLV+ MLFYHG Sbjct: 65 YSLKVNGMHGYVYLNLEHQSTAEELMPFRMHRYKVAIMQQHLNQGNKKLPLVISMLFYHG 124 Query: 126 CRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQ 185 + YPY L +D + A+ + L+D+ V+PD+EI +H+++A LE++QKHI Sbjct: 125 -KGQYPYCLKLIDCVEDTPFAKAHFFDDPLLIDLNVLPDEEIYRHKQLAFLEIVQKHIFT 183 Query: 186 RDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKLMT 245 RDL + D IV L+ + L Y+L G+ I ++ +E +M Sbjct: 184 RDLEDIADHIVRLVKQVKPDHDLFNQLVYYMLVKGETANVNQVIEKLKT-IEDYEEDIMN 242 Query: 246 IADRLREEGAMQGKHEE------------ALRIAQEMLDRGLDRELVMMVTRLSPDDLIA 293 A +L+++G +G +E A+ IA++++ G + + +T LS +++++ Sbjct: 243 AAQQLKQQGRQEGLYEGRQEGLQKGEYRKAITIAKKLIAEGRSIQYIQDLTNLSENEVLS 302 Query: 294 Q 294 Sbjct: 303 L 303 >UniRef50_D0KLJ7 Putative transposase YhgA family protein n=1 Tax=Pectobacterium wasabiae WPP163 RepID=D0KLJ7_PECWW Length = 288 Score = 283 bits (725), Expect = 4e-75, Method: Composition-based stats. Identities = 132/287 (45%), Positives = 173/287 (60%), Gaps = 14/287 (4%) Query: 10 HDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVK 69 HDA+FK FL ARDF+ IHLP +R+ CD TL+LE SFIDE LR SD+L+S+ Sbjct: 4 HDAIFKQFLSDIAVARDFLTIHLPDSIRERCDFNTLQLESASFIDEKLRARISDVLYSLH 63 Query: 70 TQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYHGCRSP 129 T G GYIY VIEHQS+PE+ MAFR++RY +AAMQ HLD G+ LPLV+P+LFYHG P Sbjct: 64 TSVGKGYIYCVIEHQSRPEKQMAFRLLRYCLAAMQQHLDQGHDRLPLVVPLLFYHGRSRP 123 Query: 130 YPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQRDLL 189 YPYSL WLD FA P +A+ +Y FPLVD+TV+PDDEI HR+MALLEL+QKHIR RD+L Sbjct: 124 YPYSLRWLDSFAAPVLAQTLYEQPFPLVDLTVMPDDEIRTHRRMALLELVQKHIRTRDML 183 Query: 190 GLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERA--PQEKEKLMTIA 247 L +I L L G + ++ + + + Sbjct: 184 ELAREIGLLFERWAA-----------PLSIG-QEDIMTIAEQLKKMGFDEGIQRGIQQGL 231 Query: 248 DRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQ 294 + E+G QG A +IA+ +L G+D+ V T+L ++L Sbjct: 232 AQGLEQGIEQGMKNSARQIARHLLLTGMDKNSVQQATQLETEELEQL 278 >UniRef50_Q3C0L1 TpnA protein n=16 Tax=Enterobacteriaceae RepID=Q3C0L1_SODGL Length = 277 Score = 281 bits (720), Expect = 2e-74, Method: Composition-based stats. Identities = 112/273 (41%), Positives = 171/273 (62%), Gaps = 20/273 (7%) Query: 42 LTTLKLEPNSFIDEDLRQYYSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIA 101 L+TL + SFI++DL SD+L+S+K+ G YIY +IEHQS PE +MAFR++RY++ Sbjct: 3 LSTLVMVSGSFIEDDLCSQCSDMLYSLKSTLGDAYIYCLIEHQSCPEPMMAFRLLRYAVT 62 Query: 102 AMQNHLDAGYKELPLVLPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITV 161 AM HL+ K+LP+V+P+LFYHG SPYPY+ WLD FA+ +A +Y AFPLVD+T Sbjct: 63 AMHRHLEQENKQLPVVIPILFYHGSTSPYPYTTHWLDCFADRKLAESVYEKAFPLVDVTA 122 Query: 162 VPDDEIMQHRKMALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGD 221 + D+EI++HR+MAL+E++QKHIR R++L L ++ +LL + Q K L Y++ G+ Sbjct: 123 MEDEEILRHRRMALMEIVQKHIRTRNMLELAGELANLLEQWKFSKEQCKTLVYYLVLAGN 182 Query: 222 AQRFRAFIGEIAERAPQEKEKLMTIADRLR--------------------EEGAMQGKHE 261 F+ +A+ AP +E +MTIA++L +EG GK + Sbjct: 183 TTDGEGFLRTLAQPAPSYREDMMTIAEQLEAKGMQKGIQLGEKKGIERGLQEGIQLGKKQ 242 Query: 262 EALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQ 294 L+IA++ L G++R++V M T L+ D+ Sbjct: 243 ATLKIARQFLVNGVERDIVKMSTGLTDRDINDV 275 >UniRef50_C3M8C1 Putative transposase n=3 Tax=Candidatus Hamiltonella defensa 5AT (Acyrthosiphon pisum) RepID=C3M8C1_HAMD5 Length = 308 Score = 279 bits (714), Expect = 8e-74, Method: Composition-based stats. Identities = 123/304 (40%), Positives = 183/304 (60%), Gaps = 19/304 (6%) Query: 5 TTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDL 64 STPHD +FK F AR+F +IHLP+ + K+ +LK+ P SFID+ L+Q +SD+ Sbjct: 2 KISTPHDRLFKKFFGDIALARNFFEIHLPSSILKIVSFPSLKMVPGSFIDKSLKQSHSDM 61 Query: 65 LWSVKTQEGV-GYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFY 123 ++S +T G GY+Y V+EHQS +++MAFRM +YS+A MQ HLD G+ LPLVLP+LFY Sbjct: 62 VYSFETSTGKEGYLYCVVEHQSTDDKMMAFRMKKYSLAVMQQHLDQGHDTLPLVLPVLFY 121 Query: 124 HGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHI 183 HG +SPYP+S+ W D F E +AR + S FPLVD+T++P++EIM+H ++ LE+ QK + Sbjct: 122 HGQKSPYPHSMDWRDCFCEKELARILDSQPFPLVDVTMLPEEEIMKHGIISWLEMSQKMV 181 Query: 184 RQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKL 243 RD++ + ++ L ND K+L Y+ Q G+ F ++ ++E + Sbjct: 182 HTRDMMEIAPYLIRLDKLFPLNDELFKSLLYYLFQEGETADRMLFFDALSSTT--QRENV 239 Query: 244 MTIADRLR----------------EEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLS 287 MTIA+ L+ EEG +G+ E IA+ +L+ G + V M T LS Sbjct: 240 MTIAEELKREGREEGREEGREEGREEGREEGREEGREEIAKNLLNNGFSFKQVKMYTGLS 299 Query: 288 PDDL 291 D L Sbjct: 300 EDSL 303 >UniRef50_B7MZS6 Putative uncharacterized protein n=3 Tax=Escherichia coli ED1a RepID=B7MZS6_ECO81 Length = 319 Score = 264 bits (676), Expect = 2e-69, Method: Composition-based stats. Identities = 106/304 (34%), Positives = 166/304 (54%), Gaps = 14/304 (4%) Query: 2 TISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYY 61 ++ TS HDA F+ L+ P ARDF++ L + C+L T++LEP +F+ E LRQ Sbjct: 4 KVNKTSLIHDAAFRKTLKDPAAARDFLEQVLTPYQKSRCNLDTIELEPTTFVAESLRQSA 63 Query: 62 SDLLWSVKTQEGV-GYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPM 120 D+L S+KT +G GYIY +IEHQS P++ + RMMRY +A M+ H++ +K P+V+P+ Sbjct: 64 CDVLLSMKTNDGKDGYIYTLIEHQSSPDKFIPLRMMRYILAVMEQHIEE-HKCAPVVIPV 122 Query: 121 LFYHGCRSPYPYSLCWLDEFAEPAIARKIY--SSAFPLVDITVVPDDEIMQHRKMALLEL 178 LFYHG + PYPY + W+D +PA R+IY F LVD++ + DDEI + +MA L Sbjct: 123 LFYHGAKRPYPYPMNWVDCLDDPAYGREIYGEQKPFSLVDVSTLTDDEIEHYHRMAALMF 182 Query: 179 IQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQ 238 K D++ L+ + ++ L + L + Y+L+ F ++ P Sbjct: 183 TMKSGTSGDVIELIGKSIT-LTDKYGSSVHLNTVLTYLLELYQM-DFAELSEAVSTHYPS 240 Query: 239 EKEKLMTIADRLR--------EEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDD 290 K +MTIA++L E+G +G+ EE R+ M RG E + L+ + Sbjct: 241 HKGVIMTIAEQLEERGLKKGLEKGLEKGRAEERSRLVLMMRQRGKSLEEIKDFLDLTDEQ 300 Query: 291 LIAQ 294 L+ Sbjct: 301 LLQA 304 >UniRef50_C0AXL8 Putative uncharacterized protein n=1 Tax=Proteus penneri ATCC 35198 RepID=C0AXL8_9ENTR Length = 254 Score = 264 bits (674), Expect = 3e-69, Method: Composition-based stats. Identities = 98/239 (41%), Positives = 146/239 (61%), Gaps = 1/239 (0%) Query: 25 RDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVKTQEGVGYIYVVIEHQ 84 + F IHLP L+ CD +TL+L+ +SFID LR SD+L+ VKT+EG IY++IEHQ Sbjct: 6 KTFFFIHLPEELKSQCDFSTLQLQNSSFIDIKLRSRMSDILYLVKTKEGDVPIYLLIEHQ 65 Query: 85 SKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYHGCRSPYPYSLCWLDEFAEPA 144 S+P++++A+RMM Y+ M HL GYK LPLV+P+LFYHG + PYP+ + W++ F + Sbjct: 66 SRPDKMIAWRMMHYAFCTMNQHLQQGYKSLPLVVPILFYHGKKKPYPFPVNWMECFPLSS 125 Query: 145 IARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQRDLLGLVDQIVSL-LVTGN 203 +A IYS+ F L+D+T + DD ++ H+K A++E+ KH+ L + ++S + N Sbjct: 126 LANHIYSNDFSLIDLTSIDDDILLTHKKAAVMEIAMKHVNSCHDLNKIAMLLSKAINQKN 185 Query: 204 TNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKLMTIADRLREEGAMQGKHEE 262 D A+ Y+ DA F I +IAER +E +M IA RL +G G E Sbjct: 186 CRDEDTVAVVQYLFSIMDASDFEFIINKIAERVDNHRETIMNIAWRLENKGFKLGIDEG 244 >UniRef50_A8PQ66 Putative uncharacterized protein n=3 Tax=Rickettsiella grylli RepID=A8PQ66_9COXI Length = 307 Score = 261 bits (668), Expect = 2e-68, Method: Composition-based stats. Identities = 101/306 (33%), Positives = 167/306 (54%), Gaps = 15/306 (4%) Query: 1 MTISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQY 60 M + T HD +FK L A F+ L + + KL ++ TL+L SF+ + R+ Sbjct: 1 MAM-TIHQAHDKLFKYSLSKKTIAISFLKSRLSSEIYKLINIETLQLTDKSFVLPEFREI 59 Query: 61 YSDLLWSVKTQEGVGYIYVVIEHQSK-PEELMAFRMMRYSIAAMQNHLDAGYKELPLVLP 119 +SD+++ + E GYI+ ++EH+S ELMAFR ++Y+I+AM + G K+LP+VLP Sbjct: 60 HSDIVYQCQINEKKGYIFFILEHESTAHVELMAFRQLQYTISAMDQYCRQGNKKLPIVLP 119 Query: 120 MLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELI 179 + YHG +SPYP+S D F IAR+I F L+D+TV+ D+E+ + L+E++ Sbjct: 120 ICVYHGIKSPYPHSQDVYDNFENLQIARQIVFKPFTLIDLTVLSDEELAKDGPAYLMEML 179 Query: 180 QKHIRQRDLLGL----VDQIVSLLVTGNTNDRQLKALFNYVL---QTGDAQRFRAFIGEI 232 KH R ++ L + ++ I SLL R + Y++ Q + + Sbjct: 180 LKHSRAKNFLSILHRRIEFIQSLLNRFGKEYRWF--VVKYMINETQDESPNAVEQLVQTL 237 Query: 233 AERAPQEKEKLMTIADRLREE----GAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSP 288 + P+EK +MT A +LR+E G QG++EEA+ IA+ +L G+ + V +T LS Sbjct: 238 STAFPEEKNTMMTFAQQLRQEGLEQGLEQGRYEEAIAIAKNLLGDGMSFKAVQRLTGLSE 297 Query: 289 DDLIAQ 294 +++ Sbjct: 298 KEVMNL 303 >UniRef50_Q52101 ORF n=1 Tax=Salmonella enterica subsp. enterica serovar Enteritidis RepID=Q52101_SALEN Length = 292 Score = 246 bits (629), Expect = 5e-64, Method: Composition-based stats. Identities = 116/282 (41%), Positives = 160/282 (56%), Gaps = 17/282 (6%) Query: 2 TISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYY 61 +TT TPHDA F+ FL PD ARDF+++HLPA LR +CDL+TLKLE SF+++DLRQY+ Sbjct: 3 KKNTTPTPHDATFRQFLTQPDIARDFMELHLPAELRAICDLSTLKLESGSFVEDDLRQYF 62 Query: 62 SDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSI-AAMQNHLDAGYKELPLVLPM 120 SD+L+S+KT G I++ + S+ + F + AAMQ HL+AG+K+LPLV+P+ Sbjct: 63 SDVLYSLKTTAGDD-IFMSWLNTSQHLTNICFPPDTLCVGAAMQRHLEAGHKKLPLVIPV 121 Query: 121 LFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAF-PLVDITVVPDDEIMQHRKMALLELI 179 LFY G RSPYPYS WLDEF + A R+ LVD+TV+PDDEI HR MA L L+ Sbjct: 122 LFYTGKRSPYPYSTRWLDEFDDTAPGRQTLQQRLSRLVDVTVIPDDEIAGHRSMAALTLL 181 Query: 180 QKHIR-----QRDLLGL--VDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEI 232 ++I Q L G +S+ + GN A + R R+ Sbjct: 182 PENIFISGTWQNWLTGWRPFYGRISVFIAGNIAGTLYSAGRRNI-------RRRSLCTRT 234 Query: 233 AERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRG 274 Q + LMTIA +L ++G +G R ++ G Sbjct: 235 GTACAQHGDALMTIAQQLEQKGIEKGIQLGEQRGIEKGRSEG 276 >UniRef50_Q24W02 Putative uncharacterized protein n=3 Tax=Clostridiales RepID=Q24W02_DESHY Length = 333 Score = 239 bits (611), Expect = 7e-62, Method: Composition-based stats. Identities = 80/328 (24%), Positives = 148/328 (45%), Gaps = 39/328 (11%) Query: 3 ISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYS 62 +S PHD FK AR F+ +LP + L DL T+ + +S+ID++L++ +S Sbjct: 1 MSLIHNPHDKFFKETFGDVGMARSFLKNYLPQEILALVDLETILPQKDSYIDQELQESFS 60 Query: 63 DLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGY-KELPLVLPML 121 DLL+ VK + GY+Y + EH+S P + +A ++++Y + ++ L +LPL++PM+ Sbjct: 61 DLLFQVKIHKNEGYLYFLFEHKSYPSQGIALQLLKYMVRIWESKLKESKPDKLPLIIPMV 120 Query: 122 FYHGCRSPYP----YSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLE 177 YHG + E A+ + I + L D++ D E++ + + ++ Sbjct: 121 VYHGQEKWNSSLKLSGIIDNYEQLPNAVTQYIPEYEYILYDLSTYTDQEMVGNMLLLIIL 180 Query: 178 LIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQ------LKALFNYVLQTGDAQRFRAFIGE 231 + I +D + + LL++ + Q + L Y+L T Sbjct: 181 RTMRDIFIKDTEAFHNILHELLISFERVEDQEKGMQFFETLIRYILSTRQDLELERIYEI 240 Query: 232 IAERAPQEKEKLMTIADRL----------------------------REEGAMQGKHEEA 263 E + + E +MTIA++L REEG +G+ E Sbjct: 241 AKEVSLERGEVMMTIAEKLIMEGMEKGLKKGREEGLKKGREEGLEKGREEGLEKGREETK 300 Query: 264 LRIAQEMLDRGLDRELVMMVTRLSPDDL 291 L +A+ +L G++ + V T LS +++ Sbjct: 301 LEVARNLLGLGIEMDKVAKATGLSEEEI 328 >UniRef50_C2DIT3 Possible transposase n=5 Tax=Enterobacteriaceae RepID=C2DIT3_ECOLX Length = 197 Score = 239 bits (610), Expect = 8e-62, Method: Composition-based stats. Identities = 125/201 (62%), Positives = 160/201 (79%), Gaps = 4/201 (1%) Query: 96 MRYSIAAMQNHLDAGYKELPLVLPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFP 155 MRY+IAAMQNHLDAGYK LP+V+P+LFYHG SPYPYSLCWLD FA+P +AR++Y+SAFP Sbjct: 1 MRYAIAAMQNHLDAGYKTLPMVVPLLFYHGIESPYPYSLCWLDCFADPNLARQLYASAFP 60 Query: 156 LVDITVVPDDEIMQHRKMALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNY 215 L+D+T++PDDEIM HR+MALLELIQKHIRQRDL+GLV+Q+ LL +G N RQ+K LFNY Sbjct: 61 LIDVTLMPDDEIMLHRRMALLELIQKHIRQRDLMGLVEQMACLLSSGYANGRQIKGLFNY 120 Query: 216 VLQTGDAQRFRAFIGEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGL 275 +LQTGDA RF FI +A+R+P+ K LMTIA+RLR+E G+ +AL IA+ ML+ G+ Sbjct: 121 ILQTGDAVRFNDFIDGVAKRSPKHKVSLMTIAERLRQE----GEQSKALHIAKIMLESGV 176 Query: 276 DRELVMMVTRLSPDDLIAQSH 296 +M T +S ++L A S Sbjct: 177 PLADIMRFTGVSEEELAAASQ 197 >UniRef50_A6TJT5 Putative uncharacterized protein n=1 Tax=Alkaliphilus metalliredigens QYMF RepID=A6TJT5_ALKMQ Length = 312 Score = 239 bits (609), Expect = 1e-61, Method: Composition-based stats. Identities = 74/308 (24%), Positives = 143/308 (46%), Gaps = 20/308 (6%) Query: 3 ISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYS 62 + PHD FK + A+DF+ +LP L K+ D+ TL E +I++DL++ +S Sbjct: 1 MGIIHQPHDKFFKEMFGNLALAKDFMTNYLPLELLKIVDIETLTPEKEHYIEDDLKESFS 60 Query: 63 DLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNH-LDAGYKELPLVLPML 121 DLL+ GY+Y + EH+S P + +A +++ Y + + L +++P+++PM Sbjct: 61 DLLFKANINGREGYLYFLFEHKSYPSKRIAIQLLHYMVRIWDDKSLKEKKEKIPMIIPMT 120 Query: 122 FYHGCRSPYPYSLCWLDEFA-----EPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALL 176 YHG + + +L D I + I + + D++ DDE+ ++ ++ Sbjct: 121 VYHG-KENWNVALRLSDLMEGYEELPEEIRKYIPEYEYLIYDLSGYTDDEVKGDVQLQIV 179 Query: 177 ELIQKHIRQRD--LLGLVDQIVSLLVTGNTND---RQLKALFNYVLQTGDAQRFRAFIGE 231 I + I + D + + V +L + K Y+L Sbjct: 180 IKILRSIFRNDEEFFKVFKEAVEVLDKLEKQEKGIEYFKTFIYYILSARKGVTLTEIYDL 239 Query: 232 IAERAPQEKEKLMTIADRL--------REEGAMQGKHEEALRIAQEMLDRGLDRELVMMV 283 + E + + +++MTIA+ L E+G +GK EE +A+ ++ G++ + VM Sbjct: 240 VKEVSVERSDEIMTIAEELLKEGMEKGMEKGMEKGKLEEKREVARNLIGLGVELDKVMKA 299 Query: 284 TRLSPDDL 291 T LS +++ Sbjct: 300 TGLSEEEI 307 >UniRef50_Q1RJ73 Transposase and inactivated derivative n=10 Tax=Rickettsieae RepID=Q1RJ73_RICBR Length = 305 Score = 239 bits (609), Expect = 1e-61, Method: Composition-based stats. Identities = 83/297 (27%), Positives = 158/297 (53%), Gaps = 14/297 (4%) Query: 10 HDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVK 69 HD++ K + A++F++ +LP +KL DL+ + +E S+I+E L + YSD+++ ++ Sbjct: 7 HDSLVKIIMTDKIAAQEFLEYYLPEDFKKLIDLSKITVEQESYIEESLSKKYSDIVYGIE 66 Query: 70 TQE-GVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYHGCRS 128 T+E G G++Y++IE QS + A R+ +Y++ + H +LPLV ++ Y+G + Sbjct: 67 TKEYGKGFVYILIEAQSTVDYWTALRLWKYTLLLCERH-KEKRNKLPLVYNLVIYNGKQV 125 Query: 129 PYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQRDL 188 Y D F +A+K+ + LVD+ + D+EI++ + + +L+ I KHI +RD+ Sbjct: 126 -YNAPRNLWDLFTNSVMAKKLMMEDYQLVDLQAMSDNEIVKKKHIGMLDYILKHIHERDM 184 Query: 189 LGLVDQIVS-----LLVTGNTNDRQLKALFNYV-LQTGDAQRFRAFIGEIAERAPQEKEK 242 + L +Q ++ +++ LK+ Y + Q+ R +PQ K+ Sbjct: 185 IQLWEQFLANFNHVIMLDKEKGYIYLKSFLWYTDAKISKKQQPRLVQVFDKYLSPQHKDN 244 Query: 243 LM-TIADRLREEGAMQGKHEE----ALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQ 294 +M TIAD +EG +GK E A+ IA++M +G ++ +T L + + Sbjct: 245 IMKTIADVYIDEGKQEGKREGEYNKAVMIAKKMFSQGFKIPVIAELTGLKETLIRSI 301 >UniRef50_B3ESQ9 Putative uncharacterized protein n=2 Tax=Bacteria RepID=B3ESQ9_AMOA5 Length = 308 Score = 238 bits (607), Expect = 2e-61, Method: Composition-based stats. Identities = 89/304 (29%), Positives = 161/304 (52%), Gaps = 12/304 (3%) Query: 3 ISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYS 62 + S PHD + K+ L HP+ ++F + PA + K DL +LKL S++ E+LR++++ Sbjct: 6 KNDLSNPHDLLVKATLSHPEAIQEFAKAYFPADILKRVDLPSLKLTNKSYVTEELREFHN 65 Query: 63 DLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKE--LPLVLPM 120 DL++S + GY + V+EHQS P+ LMA R ++Y+IA ++ ++ ++ P+++ + Sbjct: 66 DLVFSFTIDKQPGYAFFVLEHQSTPDPLMALRFVKYNIALIEEYIKEKGEKTPWPIIVNI 125 Query: 121 LFYHGC-RSPYPYSLCWLDEFAEPAIARKI-YSSAFPLVDITVVPDDEIMQHRKMALLEL 178 YH PYPYS D F +P A+ + + F L D+ P++ + QH + L+E Sbjct: 126 CLYHNANEKPYPYSTSVYDLFKDPLTAKALEMFTKFYLADLNSTPNEVLEQHGSIGLMEK 185 Query: 179 IQKHIRQRDLLGLVDQIVS-----LLVTGNTNDRQLKALFNYVLQTGD-AQRFRAFIGEI 232 + K+ R RD+ ++++ + L+V G+ L + Q + + E+ Sbjct: 186 LLKYSRHRDIFNVIEKELKRSKGYLIVRGDYWKTILIYSSYVIGQEEKSEKDLVSLFKEV 245 Query: 233 AERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLI 292 + E+E ++TIA + E G M+GK E + IA+ ML +G + + +T LS D+ Sbjct: 246 --LSKNEEEIMITIAQTIEERGEMRGKRREKIAIAKNMLKKGCEISFIEEITGLSRKDIE 303 Query: 293 AQSH 296 Sbjct: 304 KLKQ 307 >UniRef50_C1J8H0 Truncated transposase n=3 Tax=Escherichia coli RepID=C1J8H0_ECOLX Length = 202 Score = 234 bits (596), Expect = 4e-60, Method: Composition-based stats. Identities = 104/206 (50%), Positives = 145/206 (70%), Gaps = 4/206 (1%) Query: 91 MAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIY 150 M FRM+RYS+AAMQ HL+ +K LPLV+P+LFYHG RSPYPYS+ WLD F EPA+A KIY Sbjct: 1 MPFRMLRYSVAAMQRHLEQ-HKTLPLVIPVLFYHGERSPYPYSMNWLDCFEEPALAAKIY 59 Query: 151 SSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLK 210 + FPLVDITVV D+EIM HR+MA L L+ KHIR RD++ L+D++ ++V +D Q++ Sbjct: 60 TKPFPLVDITVVDDNEIMNHRRMAALTLLMKHIRHRDMMELLDKLPQVMV--EISDEQVR 117 Query: 211 ALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEM 270 L +Y++ GD+ F+ +AER PQ ++KLMTIA+RL ++G +G E+AL IA ++ Sbjct: 118 VLIHYIVNAGDSVSPE-FMRALAERLPQHEDKLMTIAERLEQKGRQEGALEKALAIACQL 176 Query: 271 LDRGLDRELVMMVTRLSPDDLIAQSH 296 G+ E + T LS +L +H Sbjct: 177 QKMGMTPEQIKQATGLSEAELKNITH 202 >UniRef50_A6G4N5 Putative uncharacterized protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6G4N5_9DELT Length = 343 Score = 232 bits (593), Expect = 9e-60, Method: Composition-based stats. Identities = 76/304 (25%), Positives = 131/304 (43%), Gaps = 14/304 (4%) Query: 5 TTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDL 64 T+ +PHDA+FKS + P A + L P+ D +TL+ EP S+IDE L + +SDL Sbjct: 4 TSPSPHDALFKSAFKDPKDAAKLLQNVLDEPIAHAIDWSTLRPEPGSYIDETLAERHSDL 63 Query: 65 LWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKE-LPLVLPMLFY 123 L+S Y+Y++IEHQS + M RM+ Y H A LP +LP++ Sbjct: 64 LFSASIGGEDAYVYLLIEHQSTVDRDMPLRMLVYLTRVWLRHRSAHPGRDLPPILPVVVS 123 Query: 124 H---GCRSPYPY-SLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELI 179 H G +P + SL P + I + D+T + D ++ + L+ Sbjct: 124 HAPGGWTAPVTFESLVRPGPTDLPELTPHIPRFELVINDLTHLSDQQLREWSMRGFATLV 183 Query: 180 QKHIRQR-------DLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEI 232 +R R D + + + + + +F+Y+ + + F ++ Sbjct: 184 LWILRTRHEIPELIDGVSTWRDMFREVFEAPDGVQAMTKIFHYIACIAQRVQVQEFHAKL 243 Query: 233 AERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLI 292 E PQ +E + T + L EEG +G + ++ L L+ +++ + DL Sbjct: 244 DEHVPQTREVMKTYYEELMEEGMAKGLAKGREEGREQSRIETLQETLIDLLS--AKFDLR 301 Query: 293 AQSH 296 H Sbjct: 302 ELEH 305 >UniRef50_C4YU05 Transposase n=5 Tax=Rickettsieae RepID=C4YU05_9RICK Length = 342 Score = 230 bits (587), Expect = 4e-59, Method: Composition-based stats. Identities = 93/340 (27%), Positives = 163/340 (47%), Gaps = 58/340 (17%) Query: 10 HDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVK 69 HDA+ K L A++F++ +LP+ ++L DL +K+E SF+++DL++ YSD+++SVK Sbjct: 7 HDALVKKILTEKIAAQEFLEHYLPSDFKELIDLREIKVEKESFVEDDLKRKYSDIIYSVK 66 Query: 70 TQEG-VGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYHGCRS 128 T++ ++YV+IE QS + +A R+ +Y + + H + K LPL+ P+L Y+G Sbjct: 67 TRDQEEAFVYVLIEAQSSCDYWIALRLWKYMLLLCERHENNKNK-LPLICPLLIYNGSEV 125 Query: 129 PYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQRDL 188 Y + + F +P A+K+ + LVD+ DDEI Q + + ++E KHI QRD+ Sbjct: 126 -YNAPRNFWELFTKPERAKKLMVQDYQLVDLQNQSDDEIEQKKHLGMMEYFLKHIHQRDM 184 Query: 189 LGLVDQIV-----SLLVTGNTNDRQLKALFNYV---LQTGDAQRFRAFIGEIAERAPQEK 240 L L D+ + S+++ + L++ Y + Q I + + +EK Sbjct: 185 LKLWDEFLIRFKPSIIMDKESGYIYLRSFVWYTDAKISEEKQQELEQII--VKHLSTEEK 242 Query: 241 EKLM-TIADRLREEGAM------------------------------------------- 256 + +M TIA + +EG Sbjct: 243 DNIMRTIAQKYIDEGVQHGIIQGIQQGIQQGVEKGKAEGLKIGEAKGKAEGKAEGKAEGK 302 Query: 257 -QGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQS 295 +GK EE + IA++ML +G D + VT L + + S Sbjct: 303 AEGKAEERVEIARKMLSQGCDFSFISSVTGLEEAFIRSLS 342 >UniRef50_Q2J904 Putative uncharacterized protein n=1 Tax=Frankia sp. CcI3 RepID=Q2J904_FRASC Length = 323 Score = 229 bits (584), Expect = 9e-59, Method: Composition-based stats. Identities = 81/283 (28%), Positives = 135/283 (47%), Gaps = 15/283 (5%) Query: 3 ISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYS 62 +S+ +PHDAVF+ L P A + LPA L DL L + P S +D LR ++ Sbjct: 1 MSSPPSPHDAVFRRVLGVPSNAASQLRATLPAALVARLDLDRLAIVPGSLVDATLRWRHT 60 Query: 63 DLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYK--ELPLVLPM 120 DLL++ +IYV++EHQS + LMAFRM+RY + +L +K LP V+P+ Sbjct: 61 DLLFTAPLDGHEAFIYVLVEHQSSSDPLMAFRMLRYVVRVWDRYLADHHKAARLPAVVPL 120 Query: 121 LFYHGCRSPYPYSLCWLDEFAEPAIA----RKIYSSAFPLVDITVVPDDEIMQHR----- 171 + +H + + P +A + F L D+ V + E+ + Sbjct: 121 VVHHNEHAWVAPTQVLDLVDLAPDLAGAWREHLPRFQFLLDDLVRVDERELRERPLTHSV 180 Query: 172 --KMALLELIQKHIRQ-RDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAF 228 + LL+++ + R +DL VD++ ++L G + L Y+ G+A Sbjct: 181 RLTLLLLKIVPGNPRLAQDLRPWVDELRAVL-DGPDGREEFATLLRYIELVGEADARDEL 239 Query: 229 IGEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEML 271 IA P+ ++ MTIA+ LR EG ++G+ E + ++L Sbjct: 240 HDLIAGLGPEAEDAYMTIAEMLRAEGRVEGRVEGRVESLLQLL 282 >UniRef50_A9EVM7 Similar to putative transposase n=2 Tax=Sorangium cellulosum 'So ce 56' RepID=A9EVM7_SORC5 Length = 336 Score = 229 bits (583), Expect = 1e-58, Method: Composition-based stats. Identities = 82/280 (29%), Positives = 128/280 (45%), Gaps = 14/280 (5%) Query: 8 TPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWS 67 HDA+FK+ + A + LP L D L+L P SF+DE L++ SDLL+S Sbjct: 12 NAHDALFKAAFSQVEHAAGELRQALPPALSARIDFAALRLRPGSFVDEALKERQSDLLFS 71 Query: 68 VKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDA--GYKELPLVLPMLFYHG 125 E +Y++ EHQS E LMAFR++RY + ++HL G K LP +LP++ +H Sbjct: 72 ASMGEARVLLYLLFEHQSTVEPLMAFRLLRYMVRIWEHHLAEHPGSKRLPAILPVVLHHS 131 Query: 126 CRSPYPYS----LCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMA---LLEL 178 + L LDE A + + F L DI+ D+ + A L+ Sbjct: 132 ETGWTAATSFEDLLDLDEGARAVMVDHVPRFRFVLDDISQEGDEALKARAMSAFSRLVLW 191 Query: 179 IQKHIRQRD----LLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGE-IA 233 +H R+ D LG +V+ + L+A++ Y+L T + + +A Sbjct: 192 CLRHGREPDELLRQLGKWLDLVNEVRRAPNGVEALRAIWRYILATNERDEADEVLQRLLA 251 Query: 234 ERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDR 273 KE++++ AD+L E G QG E ML + Sbjct: 252 AAGEPWKEEIVSAADQLMERGRQQGLREGLREGRCHMLLK 291 >UniRef50_A6G0X2 Putative uncharacterized protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6G0X2_9DELT Length = 363 Score = 228 bits (581), Expect = 2e-58, Method: Composition-based stats. Identities = 79/301 (26%), Positives = 123/301 (40%), Gaps = 28/301 (9%) Query: 4 STTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSD 63 S TS PHDA+F++ HP A + LP L L D + L+ N + L + +D Sbjct: 13 SVTSRPHDALFRATFEHPSHAGSLLRSALPRELAALIDWSRLRPAANELVSSSLGERRTD 72 Query: 64 LLWSVKT-----QEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVL 118 LL+S +G +Y+ IEHQS+ + M R++ Y + + H LP V Sbjct: 73 LLFSTALEGPGAGDGARVVYLHIEHQSRVDTTMPLRVLGYRVRIWERHRKRHGGALPPVF 132 Query: 119 PMLFYHGCRSPYPYSLCWLDEFAEP-----AIARKIYSSAFPLVDITVVPDDEIMQHRKM 173 ++ H + + ++ F EP IA + + D+ D E+ Sbjct: 133 CVVLSHAAKG-WTGPRSLVELFPEPVRTLAPIAAHLPRCPLIVEDLGRRADAELRARHAH 191 Query: 174 ALLELIQKHIRQ--------RDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRF 225 L L +R LL DQI++LL + +R L L YV G F Sbjct: 192 PLPALTLWLLRDARSPERLVHRLLDWRDQIIALL-DYDHGERDLAQLLRYVALVGSEMDF 250 Query: 226 RAFIGEIAERAPQEKEKLMTIADRL--------REEGAMQGKHEEALRIAQEMLDRGLDR 277 F +A P+ + MTIA++L RE+G +G+ E L +E G + Sbjct: 251 EEFHRFVAHHIPEVEAMTMTIAEQLCREALQRGREQGQREGQREGRLEGQREGRAVGFEE 310 Query: 278 E 278 Sbjct: 311 G 311 >UniRef50_C3PPD7 Transposase and inactivated derivative n=13 Tax=spotted fever group RepID=C3PPD7_RICAE Length = 361 Score = 226 bits (576), Expect = 7e-58, Method: Composition-based stats. Identities = 76/303 (25%), Positives = 142/303 (46%), Gaps = 33/303 (10%) Query: 2 TISTTSTP-HDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQY 60 + +T+ P HD +FK + P AR+F++ +LP + +L ++K+E SF+ EDLR+ Sbjct: 32 SSNTSERPRHDELFKKVMSEPVAAREFLEHYLPVTFKNKINLNSIKIEKESFVTEDLRKR 91 Query: 61 YSDLLWSVKTQ--------------EGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNH 106 SD+++SV + Y+YV+IEHQS + +AFR+ +Y + + H Sbjct: 92 LSDVVYSVSLKNDNIKDSTTEKSVHNDKAYVYVLIEHQSSSDYWIAFRLWQYMLLLCERH 151 Query: 107 L----------DAGYKELPLVLPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPL 156 +LPL+ P++ Y + PY + + F + A+ + + L Sbjct: 152 KDANNNKSSVTKEKDNKLPLICPIVVYANDK-PYNAPRSFWELFEDSKTAKDMMGDEYLL 210 Query: 157 VDITVVPDDEIMQHRKMALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKAL-FNY 215 VD+ DDEI + + + ++E + KHI+ RD+L L ++ + D++ + + Sbjct: 211 VDLQKQSDDEIEKKKHLGMMEYMLKHIKARDILNLWQSLLEKFESSIEIDKENGYIYIKW 270 Query: 216 VLQTGDAQRFRAFIGEIAERAPQE------KEKLMTIADRLREEGAMQGKHEEALRIAQE 269 +L DA+ E+A + +E + TIAD+ +EG +G + Sbjct: 271 LLWYSDAKVSEDKQVELASIIAKHLKKEDQEELMRTIADKYIDEGVQKGMVQGMQIGEAR 330 Query: 270 MLD 272 + Sbjct: 331 GMQ 333 >UniRef50_Q1RGR6 Transposase and inactivated derivative n=15 Tax=Rickettsia RepID=Q1RGR6_RICBR Length = 313 Score = 226 bits (576), Expect = 9e-58, Method: Composition-based stats. Identities = 81/307 (26%), Positives = 152/307 (49%), Gaps = 25/307 (8%) Query: 10 HDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVK 69 HD + +S +P +++F ++HLP ++ L LK+E +SF+D+ L++ D+L+S K Sbjct: 7 HDEIIRSAFENPLVSKEFFEMHLPPHIQNLISFEKLKMEKDSFVDKRLKKSIVDILFSAK 66 Query: 70 TQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDA-GYKELPLVLPMLFYHGCRS 128 E GY+Y+++EHQS PE MA R+ RY + H + K+ P + P++FY+G + Sbjct: 67 FGEKKGYLYLLLEHQSTPEYKMALRLFRYMFKIAEYHKKSTKSKKFPFIYPLIFYNGVQK 126 Query: 129 PYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQRDL 188 Y + F + + +S + L+++ +PD+++ + +L+ KHI +RDL Sbjct: 127 -YNAPRNLWELFENSELVKSTWSGDYQLINVHDIPDEKLKEKAWSGILQFFMKHIHERDL 185 Query: 189 LGLVDQIVSLLVTGNTND---RQLKALFNYVLQTGDAQRFRAFIGEIAERA-PQEKEKLM 244 L +++ LL D ++ + Y L + + P+++E +M Sbjct: 186 LKRWEEVADLLPKFAKIDIGIEHIELILCYTLTRIKQDDIIEVEKLLQSKLNPKKRENVM 245 Query: 245 -TIADRLREEGAMQGK------------------HEEALRIAQEMLDRGLDRELVMMVTR 285 +IA ++G + K EE + +A+EM+ G E V+ +T+ Sbjct: 246 KSIAHHWIQQGREEEKAIMLKKMQEEKVIMAEKVQEEKVMMAKEMMKEGFSLESVIKITK 305 Query: 286 LSPDDLI 292 LS +DL Sbjct: 306 LSKEDLE 312 >UniRef50_A8GX51 Transposase and inactivated derivative n=11 Tax=Rickettsia RepID=A8GX51_RICB8 Length = 355 Score = 225 bits (574), Expect = 1e-57, Method: Composition-based stats. Identities = 81/333 (24%), Positives = 150/333 (45%), Gaps = 52/333 (15%) Query: 13 VFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVKTQE 72 +F+ L +P A +F + HLP ++ L D +L +E +F++ L+ SD+L+S K + Sbjct: 23 IFRKALENPLVAHEFFNAHLPPNIKSLIDFPSLAMENTTFVESSLKDSISDVLFSCKFDK 82 Query: 73 GVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHL--DAGYKELPLVLPMLFYHGCRSPY 130 GY+++++EHQSK + MAFR+ +Y I + +L + K LPL+ PM+F++G + Y Sbjct: 83 QDGYLFLLVEHQSKADHFMAFRLFKYMINICERYLIQNPKAKTLPLIYPMIFFNG-QEKY 141 Query: 131 PYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQRDLLG 190 + D F +A++++ + + LV++ +PD+E Q +LE KHI +R+LL Sbjct: 142 NVARNLWDLFTNNKLAKELWINDYQLVNVHEIPDEEFKQRIWSGILEFFLKHIHERELLK 201 Query: 191 LVDQ---IVSLLVTGNTNDRQLKALFNYVLQTGDAQRF----------------RAFIGE 231 + I+ L L+ + Y L + + Sbjct: 202 RWQEISDILPELTKITIGYDYLEMILYYTLTKIEQADKIKLKNLLSTKLNPEIGTRLMRS 261 Query: 232 IAERAPQEK------------------------------EKLMTIADRLREEGAMQGKHE 261 +AE QE E + + EG +G++ Sbjct: 262 LAEHWQQEGKEIGILEGLQVGEAKGIQIGEAKGIQIGKAEGIQIGKAEGKAEGKAEGEYN 321 Query: 262 EALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQ 294 +A+ +A++ML +G + L+ VT L + + Sbjct: 322 KAVEVAKKMLTQGCNVSLISSVTGLDEAFISSL 354 >UniRef50_A0LBL3 Putative uncharacterized protein n=6 Tax=Magnetococcus sp. MC-1 RepID=A0LBL3_MAGSM Length = 322 Score = 220 bits (561), Expect = 4e-56, Method: Composition-based stats. Identities = 71/275 (25%), Positives = 125/275 (45%), Gaps = 6/275 (2%) Query: 3 ISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYS 62 ++ + PHD K+ L PD + LP + +L L +FID + R++ + Sbjct: 1 MTKITQPHDRFLKALLSDPDKTGTLLRERLPKEVAELLSSEPPVLVDGTFIDGEFREHLT 60 Query: 63 DLLWSVKTQEGVG-YIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPML 121 D L+ VKTQEG YIY +IEH+S +E +AF+++RY + + L G ++LP ++P++ Sbjct: 61 DRLFKVKTQEGKAAYIYALIEHKSYADEWVAFQLLRYMVRIWERFLKEGQQKLPPIVPLV 120 Query: 122 FYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQK 181 YHG R + A+ + + +F + D+ + DD++ Q + + K Sbjct: 121 VYHGAREWTVPNQFSALLEADKGLLHHLLDFSFAVTDLGRIADDDLSQDTHLRAALMAMK 180 Query: 182 HIRQRDLLGLVDQIVSLLVTGNTNDRQL-KALFNYVLQTGDAQRFRAFIGEIAERAPQEK 240 + Q G+V ++ + G D + K + Y++QT E P E Sbjct: 181 YAFQG-AEGVV--VIPQIGKGAQGDPEFAKLVLRYLIQTYRGMTMADVQAYAEEAFPGEA 237 Query: 241 EKLMT-IADRLREEGAMQGKHEEALRIAQEMLDRG 274 E + A + +G +G+ E QE G Sbjct: 238 EHYASQFAREMMSKGRQEGRQEGRREGRQEGRQEG 272 >UniRef50_A5CC03 Transposase and inactivated derivative n=9 Tax=Orientia tsutsugamushi RepID=A5CC03_ORITB Length = 355 Score = 211 bits (537), Expect = 3e-53, Method: Composition-based stats. Identities = 77/346 (22%), Positives = 155/346 (44%), Gaps = 63/346 (18%) Query: 10 HDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVK 69 HD +FK + P A DFI+ LP ++ + DL T+K+E SF++ +LR+ D+L+SVK Sbjct: 7 HDGLFKDLMNEPKAALDFINDFLPNEVKNVLDLNTIKVEQESFVEANLRRSMCDVLFSVK 66 Query: 70 TQ-EGVGYIYVVIEHQSKPEELMAFRMMRYSIAA------MQNHLDAGYKELPLVLPMLF 122 T+ +IYV+IE + + + +AF++ +Y+++ +LP+V+P++ Sbjct: 67 TKNNNDAFIYVLIEAELRSDYWIAFKLWQYTLSILKRHKKGLKKRKKERGKLPIVVPIVV 126 Query: 123 YHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKH 182 YHG + + F +P +A+++ S + L+D +PD EI + AL+ ++ Sbjct: 127 YHGADR-FNAPRSLWELFDDPKLAKELMGSEYLLIDWQAMPDSEIKRKATAALVHFMKYI 185 Query: 183 IRQRDLLGLVDQIVSLLVTGNTNDRQ-----LKALFNYVL---QTGDAQRFRAFIGEIAE 234 Q D++ L + + L D++ +++L Y + + R + + E Sbjct: 186 HNQPDIIELWAKFFNTLQEIVQKDKEEGFLYIRSLLYYTISKVSQNEQPRLKQLLDE--N 243 Query: 235 RAPQEKEKLM-TIADRLREEGAMQGKHEEALR---------------------------- 265 + ++++++M TIA + +EG +G+ E Sbjct: 244 LSIEDRDRIMGTIAAQYIDEGKAKGRAEGRAEGRAEGRAEGRAEGRAEGRAEGRAEGRAE 303 Query: 266 ----------------IAQEMLDRGLDRELVMMVTRLSPDDLIAQS 295 +A+ +L G E + T LS ++++ Sbjct: 304 GIEIGETKGRAEAAQGLARNLLKAGFSVEFIAENTGLSNEEVVNLK 349 >UniRef50_Q1Q296 Putative uncharacterized protein n=6 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1Q296_9BACT Length = 338 Score = 208 bits (529), Expect = 2e-52, Method: Composition-based stats. Identities = 63/287 (21%), Positives = 125/287 (43%), Gaps = 10/287 (3%) Query: 5 TTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDL 64 PHD FK + A DF+ P + K DL+TL + +S+IDE+L++++SD+ Sbjct: 2 EILNPHDKFFKETFSIRENAIDFLSGRFPPEILKKLDLSTLTQDNSSYIDEELKEHFSDI 61 Query: 65 LWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYH 124 +++ ++ I ++ EH+S ++M+Y + + + + +P V+P++ YH Sbjct: 62 VYTCFCKDKEIRITLLFEHKSYAVACPYLQLMKYLLKIWEANSKQAQRLIP-VIPVILYH 120 Query: 125 GCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIM----QHRKMALLELIQ 180 G + E + R I + L DI+ ++EI + + + L+ Sbjct: 121 GKEAWKVRRFREYFEGIDEVFYRFIPEFEYLLTDISCYSNEEIKDRVFRRVSLQITMLLM 180 Query: 181 KHIR-QRDLLGLVDQIVSLLVTGNTNDRQLKAL---FNYVLQTGDAQRFRAFIGEIAERA 236 ++I ++ L + + + D LK L Y+ D + I + E + Sbjct: 181 RNIFDEKYLEDKLKDFFEIGIQYFEEDEGLKFLESAIRYLYYASDIAE-KRVIDTLKEIS 239 Query: 237 PQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMV 283 + + MTIA +L E+G + G+ E E G + + + Sbjct: 240 EEGGKLSMTIAAKLIEKGKIAGRVEGRAEGRAEGAIEGERKGRIEGL 286 >UniRef50_D0LMM4 Putative transposase n=10 Tax=Haliangium ochraceum DSM 14365 RepID=D0LMM4_HALO1 Length = 345 Score = 207 bits (527), Expect = 4e-52, Method: Composition-based stats. Identities = 81/302 (26%), Positives = 129/302 (42%), Gaps = 18/302 (5%) Query: 6 TSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLL 65 HD++ K+ D A D LP + + DL L L P SF+ ++LRQ ++DLL Sbjct: 2 PHDSHDSLVKATFARLDFAADEFRAVLPPAILERLDLDKLALCPGSFVSDELRQQHTDLL 61 Query: 66 WSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHL--DAGYKELPLVLPMLFY 123 + ++Y+++EHQS E +M R++RY + + HL G LP +LP++ + Sbjct: 62 FRAPLDGEPAFLYLLLEHQSSVERMMPLRLLRYVASIWERHLGEHPGAATLPPILPVVLH 121 Query: 124 HGCRSPYPYS----LCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMA---LL 176 H + + L L + A A+ + F L D++ PD+ ++ A L Sbjct: 122 HSEQGWTAPTSLGQLFALSDGAREALGPYLPELRFLLDDLSHQPDEALLMREMAAQAKLA 181 Query: 177 ELIQKHIRQ-RDLLGL---VDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEI 232 K+ R +DLL L ++ VT L A+ Y LQ D I Sbjct: 182 LWALKNARHAQDLLALLRPWSPVILEAVTAPGGIDALAAIVRYTLQHADTDPDALMRFLI 241 Query: 233 AERAPQEKEKLMTIADRL----REEGAMQGKHEEALRIAQEMLDRG-LDRELVMMVTRLS 287 KE MT A++L RE+ QG+ E + E G ++ + T LS Sbjct: 242 DSAGDPAKEAFMTGAEKLTQAVREQSLRQGRVEGRVEGRVEGRVEGRVEGRTEALRTVLS 301 Query: 288 PD 289 Sbjct: 302 KQ 303 >UniRef50_D2QBD7 Putative uncharacterized protein n=1 Tax=Spirosoma linguale DSM 74 RepID=D2QBD7_9SPHI Length = 341 Score = 207 bits (526), Expect = 4e-52, Method: Composition-based stats. Identities = 69/294 (23%), Positives = 140/294 (47%), Gaps = 11/294 (3%) Query: 8 TPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWS 67 PHD FK P+ DF++ P +R+ D TTL E ++F DE L ++++DL++S Sbjct: 7 NPHDRFFKESFSQPEILIDFLNAFAPEAVRERIDYTTLTREVDTFTDEQLAEHFADLVFS 66 Query: 68 VKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYHGCR 127 V+ + +++EH+S EE F++ RY + ++ + + L VLP+L YHG R Sbjct: 67 VQYNGQPIRLVILLEHKSYTEEYPHFQINRYLLNLWESQIKQK-QPLTPVLPVLVYHGNR 125 Query: 128 SPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEI--MQHRKMALLELIQKHIRQ 185 S+ + + + + L+D++ + D+ + +Q L ++ ++ R+ Sbjct: 126 RWKQRSIPDYFAPLHETLTPYLPAFEYLLIDLSTLSDERLPTLQSDYARLTAILLQNSRR 185 Query: 186 R----DLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKE 241 + LL +V L R + F Y+ T + + F G + + + + Sbjct: 186 KRELTRLLDAFADVVRRLTDTTAGQRFVSTGFLYLSYTANLTKVELF-GIFSRISSKIES 244 Query: 242 KLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQS 295 MT+A+ L +EG + + +A+E++ +G + E + ++ ++L+ Q Sbjct: 245 STMTVAEELIQEGRELERRQTR-MVAEELIQQGRELERRQAM--MAAEELLKQQ 295 >UniRef50_Q1RKI3 Transposase and inactivated derivative n=10 Tax=Rickettsia RepID=Q1RKI3_RICBR Length = 270 Score = 205 bits (522), Expect = 1e-51, Method: Composition-based stats. Identities = 60/213 (28%), Positives = 112/213 (52%), Gaps = 6/213 (2%) Query: 10 HDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVK 69 HD F+ L +P AR+F + +LP ++ L TTL LE +SFID +L++ +D+L+S + Sbjct: 56 HDKFFQKALSNPIVAREFFEEYLPTEIKALFSPTTLTLENDSFIDPNLKESITDVLYSAR 115 Query: 70 TQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAG--YKELPLVLPMLFYHGCR 127 YIY++ EHQS + MAFR+ +Y + + HL + K+ P + P++ Y Sbjct: 116 INNRDCYIYILCEHQSSSDPHMAFRLFKYMLNIAEKHLISHPDSKKFPFIYPLV-YSNDH 174 Query: 128 SPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQRD 187 Y L D F + + +S+ + L+ + + DD++ ++ +A L+++ K+I + + Sbjct: 175 KKYTAPLNLWDLFENSELVKDTWSNNYQLISLRDISDDKLKENPWLAPLQILMKYIHKPN 234 Query: 188 LLGLVDQIVSLLVTGNTND---RQLKALFNYVL 217 + +I L T + +K+ +Y L Sbjct: 235 VFDKWQEISGCLATIAASSSGIEYIKSALSYSL 267 >UniRef50_Q1QWV4 Putative uncharacterized protein n=11 Tax=Proteobacteria RepID=Q1QWV4_CHRSD Length = 326 Score = 205 bits (521), Expect = 2e-51, Method: Composition-based stats. Identities = 68/309 (22%), Positives = 126/309 (40%), Gaps = 27/309 (8%) Query: 14 FKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVKTQEG 73 +K HP+ RD + + + D +TL+ S+I EDLR D++W V+ + Sbjct: 13 YKLLFSHPEMVRDLLTGFVKEAWVEQLDFSTLEKVSGSYITEDLRDREDDVIWRVRWGDD 72 Query: 74 VGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAG----YKELPLVLPMLFYHGCRSP 129 Y+Y+++E QS + MA R+M Y Q+ + +LP VLP++ Y+G + Sbjct: 73 WLYVYLLLEFQSSVDRFMAVRVMTYLGLLYQDLIRQEAFTPNGKLPPVLPIVLYNGEKRW 132 Query: 130 -YPYSLCWLDEFAEPAIARKIYSSAFPLVDIT-VVPDDEIMQH-RKMALLELIQKHIR-Q 185 ++ L E + R + A+ L+D V+ D E H R +A +H R + Sbjct: 133 TAAQNVADLVEQVPGGLERYRPNLAYLLLDEGAVISDPEWSDHMRNVAAALFRLEHNRDE 192 Query: 186 RDLLGLVDQIVSLLVTGNTN---DRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEK-- 240 +D+L ++ +V L + + +L E+ + Sbjct: 193 QDMLEVLGTLVEWLKAPEQTGLRRAFVVWIRRVLLPNRAPGMELPEFNELQDLHEVHDML 252 Query: 241 -EKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDR-------------ELVMMVTRL 286 E++ +R E+G +G+ E QE RG+++ E + T L Sbjct: 253 AERIKQWPERWEEKGRQEGRQEGRKEGRQEGEQRGIEKTARNLIKLGVLSDEQIAEATGL 312 Query: 287 SPDDLIAQS 295 + ++ Sbjct: 313 TVAEVEGLR 321 >UniRef50_C5JAV2 Transposase n=2 Tax=uncultured bacterium RepID=C5JAV2_9BACT Length = 334 Score = 203 bits (517), Expect = 5e-51, Method: Composition-based stats. Identities = 71/294 (24%), Positives = 138/294 (46%), Gaps = 15/294 (5%) Query: 3 ISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYS 62 ++ + PHD K+ L +P TA + LP + + +L SFIDE LR + + Sbjct: 1 MTEIAHPHDRFLKALLSNPATAGTLLRERLPREVAEALSDDPPELLEGSFIDEALRPHLT 60 Query: 63 DLLWSVKTQEGV-GYIYVVIEHQSKPEELMAFRMMRYSIAA---MQNHLDAGYKELPLVL 118 D L+ V+T G +YV+IEH+S P+ + +++++Y + A + + ++ LP ++ Sbjct: 61 DRLYRVRTVTGRTALLYVLIEHKSSPDLRIGWQLLKYLVEALKQWERE-NPAWERLPAIV 119 Query: 119 PMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLEL 178 P +FYHG + AE + + F ++D+ + D ++ + + L Sbjct: 120 PFVFYHGAAAWKVPDAFLALVDAEEGWRSHLLNFRFTVLDLGQIDDRQLSRQPNLQAWLL 179 Query: 179 IQKHIRQRDL-LGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERA- 236 K+ + D L + + ++ LV+ D + + L YV++T + + EI R Sbjct: 180 AAKYATRDDRQLEVKELLIQTLVS--VADEEFRFLMRYVVETYRSYD-EPMVREIIRRVR 236 Query: 237 PQEKEKLMT-----IADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTR 285 P+E+E +M+ + + R+EG +G+ E + RG E M+ + Sbjct: 237 PEEEETMMSMFAQDMMAKGRQEGRQEGRQEGRQEGIKLGEQRGRQEEAAYMLLK 290 >UniRef50_C5UWW9 Putative uncharacterized protein n=1 Tax=Clostridium botulinum E1 str. 'BoNT E Beluga' RepID=C5UWW9_CLOBO Length = 323 Score = 202 bits (515), Expect = 9e-51, Method: Composition-based stats. Identities = 59/323 (18%), Positives = 129/323 (39%), Gaps = 31/323 (9%) Query: 1 MTISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQY 60 M + HD +K H +T +F+ L + L L S+I D + Sbjct: 1 MKNNNVHHEHDVGYKHIFSHKETFLEFLRSFTKKEWANLINEDDLILVDKSYILSDFEEE 60 Query: 61 YSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYK-------- 112 SD+L+ + YV++E QSK + M R++ Y ++ L K Sbjct: 61 ESDILYKANIDDKEVIFYVLLEFQSKVDFQMPMRLLFYMTEIWRDVLKNTEKNERKRKNF 120 Query: 113 ELPLVLPMLFYHGCRSPYPYSLCWLDEFAEPAIARK-IYSSAFPLVDITVVPDDEIMQHR 171 +LP ++P++ Y+G ++ + + + + + + I + L DI D E++ Sbjct: 121 KLPSIVPIVLYNG-KNKWSAKISFKEMLSGYELFEDNILDFNYMLFDINRYSDHELLNIS 179 Query: 172 KM-ALLELIQKHIRQRDLL---------------GLVDQIVSLLVT--GNTNDRQLKALF 213 M + + L+ + I +++L+ L L+ Sbjct: 180 NMISAVFLLDQEIDEQELMRRLKKIIYILKKISPEQFSVFKKWLKNIVKPRVRDNLQGEI 239 Query: 214 NYVLQTGDAQRFRAFIGEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDR 273 + VL+ + + + + + + ++K + +R ++G QG + + A++ ++ Sbjct: 240 DDVLEKSNQEEVDFMVSNLGKTIERMQDKAI---ERGLKKGIEQGIEQGIEQTAKKAIEM 296 Query: 274 GLDRELVMMVTRLSPDDLIAQSH 296 G+D E++M +T LS + + Sbjct: 297 GMDNEIIMNLTGLSEEQINTIRQ 319 >UniRef50_C0GW46 Putative uncharacterized protein n=2 Tax=Desulfonatronospira thiodismutans ASO3-1 RepID=C0GW46_9DELT Length = 341 Score = 202 bits (514), Expect = 1e-50, Method: Composition-based stats. Identities = 74/274 (27%), Positives = 136/274 (49%), Gaps = 12/274 (4%) Query: 5 TTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDL 64 PH+A FK F + P+ + FI H+P + L DL TL+++ + F+ E+ R+YY+D+ Sbjct: 4 EIPNPHNACFKDFFKDPEFVKAFIKYHIPEEICSLLDLDTLQVDLSGFVSEEHREYYADV 63 Query: 65 LWSVKTQE--GVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAG--YKELPLVLPM 120 + +V+ + IY+++EH+S PE L +++ Y + + G LP+++P+ Sbjct: 64 MVTVQLKGHTENVNIYILLEHKSTPEFLTRLQILNYEVQKWMDLKRKGQLQGYLPVIIPV 123 Query: 121 LFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAF--PLVDITVVPDDEIMQHRKMALLEL 178 + YHG + + +S + D F P+ + + F + DI+ + DDE + + L Sbjct: 124 VIYHG-KGRWNFSRKFSDLFDLPSEVLRPFVPEFKHMIHDISSMEDDEFKTTAILEIFHL 182 Query: 179 IQKHIRQRDLLGLVDQIVSLLVTGNTNDR---QLKALFNYVLQTGDAQRFRAFIGEIAER 235 + K+I +L + +I LL T D+ L+A+ YV G R +GE R Sbjct: 183 LFKYIHYPELETKLQEIYDLLETIPDQDKVKQYLQAIVQYVAVQGPISLER--LGEYTRR 240 Query: 236 APQEKEKLMTIADRLREEGAMQGKHEEALRIAQE 269 P E + T A ++R+E + E+ + + Sbjct: 241 LPGGDEAMQTAAQQIRQEAYNEFIQEQEKMLVER 274 >UniRef50_C6VTM0 Putative uncharacterized protein n=1 Tax=Dyadobacter fermentans DSM 18053 RepID=C6VTM0_DYAFD Length = 308 Score = 202 bits (513), Expect = 2e-50, Method: Composition-based stats. Identities = 69/298 (23%), Positives = 145/298 (48%), Gaps = 14/298 (4%) Query: 3 ISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYS 62 + + HDA ++ + + A D+ +P ++ L D +TL+ P++++ ++L++ S Sbjct: 1 MDKHTPKHDAFIRAIMGNKQIALDYFRASIPQNIQDLLDFSTLRQLPDTYVSKELQKSIS 60 Query: 63 DLLWSVKTQEGVGYIYV--VIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELP-LVLP 119 D+++ + G G + + ++EH+S ++ ++ Y + + + G KE P L++P Sbjct: 61 DIVYVCQKASGNGEVKISLLVEHKSYVDKYTPIQIGSYIFSGLLKQI--GNKESPSLIIP 118 Query: 120 MLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEI--MQHRKMALLE 177 +L YHG ++ L E EPA+ + I + D+ + D+EI + ++ +A Sbjct: 119 ILLYHGADRWEYKTVADLFENPEPALQQFIPDYQYIFHDLGQISDEEIQSLHNKFLAASL 178 Query: 178 LIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQL-KALFNYVLQTGDAQRFRAFIGEIAERA 236 L K+ +D L + + ++L + DR L K+L Y L G+ F+ I Sbjct: 179 LAMKYSALKDQLNTL--LPTILTLASEVDRNLHKSLLFYTL-VGNPLTEEQFLNLIKSVP 235 Query: 237 PQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQ 294 Q+KE +M I + E+G +G E A++ ++ + R L+ L+ + + + Sbjct: 236 NQKKEAIMDIFEIFEEKGWKKGIEEGRAE-AEQKIETAV-RNLIKQSV-LTDEQIASA 290 >UniRef50_Q6TFF6 Putative transposase n=1 Tax=Caedibacter taeniospiralis RepID=Q6TFF6_CAETA Length = 299 Score = 198 bits (503), Expect = 2e-49, Method: Composition-based stats. Identities = 81/298 (27%), Positives = 147/298 (49%), Gaps = 19/298 (6%) Query: 8 TPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSF-------IDEDLRQY 60 HD+VFK + + D A F+ +LP L +L D T+KLE + D ++ Sbjct: 3 NVHDSVFKDLIANRDFAVSFLMTYLPKELVELVDWQTVKLESANVEHVRQQQKDNQKQKE 62 Query: 61 YSDLLWSVKTQEGV-GYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGY--KELPLV 117 SDL + K ++G G ++V IE Q+ + + R Y + + +++ K LPLV Sbjct: 63 QSDLTFLFKFKDGKNGAVFVHIESQTGDDGTILIRTRHYQTSYLLDYIKRHKTVKGLPLV 122 Query: 118 LPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLE 177 + +++Y + P+ +SL D FA +A+K Y+ +D+ D+EI++H +A E Sbjct: 123 VSIIYY-ANQKPFSHSLNIHDYFANTELAKK-YAFTTQFIDLNRYSDEEILEHGFIAGYE 180 Query: 178 LIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAP 237 LI K IR++++ G +D ++ + + RQ+ L Y+ Q D + + F ++ P Sbjct: 181 LILKAIREKNIDGKLDIAINQIEAYDHIARQV--LIRYMSQYSDMET-KDFHDKLIYSKP 237 Query: 238 QEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQS 295 + +MT+A++ ++G +G A+ L GL E V+ T L D ++ Sbjct: 238 DLRGDVMTVAEQWEQKGIQKGIQ----TTARNFLLMGLSAEQVVKGTGLDQDTVLKLK 291 >UniRef50_Q2FP14 Putative uncharacterized protein n=4 Tax=Methanospirillum hungatei JF-1 RepID=Q2FP14_METHJ Length = 312 Score = 197 bits (502), Expect = 3e-49, Method: Composition-based stats. Identities = 71/306 (23%), Positives = 130/306 (42%), Gaps = 25/306 (8%) Query: 11 DAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVKT 70 D +K HP+ D I L L CDL+TL+ S++ +DLR+ D++W + Sbjct: 5 DHPYKRLFSHPEMIADLIRGFLDPKLVSGCDLSTLERCNGSYVTDDLREREDDIIWRLAY 64 Query: 71 QEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGY---KELPLVLPMLFYHGC- 126 + +Y++IE QSKP+ M R+M Y Q+ + +G +P ++P++ Y+G Sbjct: 65 GDRTLILYLLIEFQSKPDYSMPIRIMSYMALLWQDLIRSGVIVPSRIPGIIPIVLYNGEI 124 Query: 127 RSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQR 186 P+ + + +P ++R I S + L+D + +M+ R +A + Sbjct: 125 PWKVPHDIRETIQMPKP-VSRFIPSVPYLLIDELRLSVHHLMEVRNLAACLFGLEQSSGP 183 Query: 187 -DLLGLVDQIVSLLVTGNTND---RQLKALFNYVLQTGD--------------AQRFRAF 228 +L L ++ + T D R F L+ D A+R + Sbjct: 184 LELFELGARLNRWMQTDPNLDSMRRDFSLFFENTLKRDDDISISNPFQGGTMLAERVNKW 243 Query: 229 IGEIA--ERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRL 286 I + R ++E R EG ++GK E I + M ++G+ + +T L Sbjct: 244 IAQYKAEGRKEGKEEGKKEGLLEGRVEGKLEGKLEGMATILKRMKEKGMSVTEIATITGL 303 Query: 287 SPDDLI 292 D++ Sbjct: 304 PEDEIQ 309 >UniRef50_B2V9N0 Putative uncharacterized protein n=4 Tax=Sulfurihydrogenibium RepID=B2V9N0_SULSY Length = 312 Score = 196 bits (499), Expect = 6e-49, Method: Composition-based stats. Identities = 63/268 (23%), Positives = 129/268 (48%), Gaps = 20/268 (7%) Query: 1 MTISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQY 60 M + PH+ FK + +DF+ I L + L + L++L+L P+ + +++ Sbjct: 1 MKNKESIQPHNWFFKQVFSNSKNVQDFLSIFL-SDLSQKIQLSSLELVPSEKFSNNQKKH 59 Query: 61 YSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPM 120 + DLL+ K + YI ++ EH+S ++ + ++M+Y+ + L P ++ + Sbjct: 60 FLDLLYKCKLNDKEAYIRLIFEHKSYVDKKLPLQLMQYNAVIWEEALKEK-DYYPPIINI 118 Query: 121 LFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRK----MALL 176 +FYHG ++ + + D + + + I + L+D+ + D+ + ++ K + + Sbjct: 119 VFYHG-QAKWNFPTTIPD-IEDEELDKYIQKLNYILIDLNEIEDENLKRYLKKNVDLIME 176 Query: 177 ELIQKHIRQRDLLGLVDQIVSLLVT--GNTNDRQLKALFNYV-LQTGDAQRFRAFIGEIA 233 LI KHI R +++I +LL ++ + NY+ L D ++ + EI Sbjct: 177 MLIMKHIHDR-----LERIKTLLKDVIDECSEDCFVIILNYLVLVKKDYEKVKEVFKEII 231 Query: 234 ERAPQEKEKLMTIADRLREEGAMQGKHE 261 +EK+M D+L+ EG M+GK E Sbjct: 232 ----GGEEKMMLFTDKLKMEGKMEGKIE 255 >UniRef50_Q04UG3 Transposase, YhgA-like n=8 Tax=Leptospira RepID=Q04UG3_LEPBJ Length = 304 Score = 196 bits (499), Expect = 6e-49, Method: Composition-based stats. Identities = 76/302 (25%), Positives = 137/302 (45%), Gaps = 13/302 (4%) Query: 3 ISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYS 62 ++ + PHD + + + A F LP + +L DL L+L +SF+ E+L+Q + Sbjct: 1 MTEVNNPHDRLIRETFQDKKEAATFFKNTLPPEVVELLDLENLELTESSFVSEELKQEQT 60 Query: 63 DLLWSVKTQEG-VGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPML 121 DLL+ + + G +Y++ EH+S E + +++ Y +N +G + +V+P + Sbjct: 61 DLLFQIPLKSGNKSNVYLLFEHKSYLENTIYIQLLGYLTEIYRNQQRSG-ESFSVVIPFV 119 Query: 122 FYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLEL--- 178 FYHG + + + D+F ++ P I + + I +K+ + Sbjct: 120 FYHGEKE-WKLGDRFSDQFVLTKQETDVFQDFIPDFKIDLFDLEGIELKKKLESITFQVT 178 Query: 179 --IQKHIRQRDLL--GLVDQIVSLLVTGNTNDRQ---LKALFNYVLQTGDAQRFRAFIGE 231 + + IR+RDL + + SLL+ ++ L+ L Y+ D + Sbjct: 179 LGVVQRIRERDLEFVSHLPGLFSLLLGIEEESKRVAILRKLLLYIYWARDLKPTELKRVL 238 Query: 232 IAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDL 291 + Q +E MT A+RL EG QGK E + A+ ML + E V+ +T LS DL Sbjct: 239 AISKLEQYEELTMTTAERLISEGIQQGKIEGKIETARNMLSEDIQLEAVLRITGLSKQDL 298 Query: 292 IA 293 Sbjct: 299 KD 300 >UniRef50_A3ET28 Probable transposase n=6 Tax=Leptospirillum sp. Group II RepID=A3ET28_9BACT Length = 335 Score = 195 bits (495), Expect = 2e-48, Method: Composition-based stats. Identities = 58/258 (22%), Positives = 113/258 (43%), Gaps = 6/258 (2%) Query: 3 ISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYS 62 ++ S HD FK+ + RDF+ LP + + D +L+ I + + Sbjct: 1 MNEISGLHDRFFKTSFGRIEVLRDFLTGFLPPEISQSIDPDSLRFLNTESIGLSFEKSHM 60 Query: 63 DLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLF 122 DL+ + E Y++IEH+S P+ + +M+RY +A + K L VLP++F Sbjct: 61 DLVVECRISETPAQFYLLIEHKSVPDPEVFLQMLRYMVALWTRN-RQDNKPLVPVLPLVF 119 Query: 123 YHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLV-DITVVPDDEIMQHRKMALLELIQK 181 + G R P+ + + + F P + PL+ D++ V I + A ++ Sbjct: 120 HQGGR-PWTLPVRFQETFPVPETLKAHAVDFAPLLFDLSTVSGTTIRERSAHAETVVVLT 178 Query: 182 HIRQRDLLGLVDQIVSLLVTGNTNDR-QLKALFNYVLQTGDAQRFRAFIGEIAERAPQEK 240 ++ + D + +L TG + D L + NY ++ + + + + R+ + Sbjct: 179 LLKYAFSGSVEDVLRALKETGGSFDETFLFGVLNYAIRAFEVKDP--VVVDAISRSFGGE 236 Query: 241 EKLMTIADRLREEGAMQG 258 + + +I D EEG +G Sbjct: 237 KIMPSIIDEWVEEGLKEG 254 >UniRef50_C0GW49 Putative uncharacterized protein n=6 Tax=Desulfonatronospira thiodismutans ASO3-1 RepID=C0GW49_9DELT Length = 339 Score = 194 bits (494), Expect = 2e-48, Method: Composition-based stats. Identities = 63/266 (23%), Positives = 125/266 (46%), Gaps = 9/266 (3%) Query: 3 ISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYS 62 +S TS HD F++ L ARDF+ HLP + + +L T+K+ S++ ++L++ + Sbjct: 7 MSDTSKYHDHTFRAILGREPVARDFVRYHLPEEITRDMNLDTVKVSSRSYVSDNLKESMT 66 Query: 63 DLLWSVKTQEGV-GYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPML 121 D++ +++ G IY+++EH+S + ++ +Y Q+ + LP+++P++ Sbjct: 67 DIVITLELITGEPAEIYILVEHKSDLDAWTKIQLFKYMNEVWQSFIQKKTGTLPIIVPLV 126 Query: 122 FYHGCRSPYPYSLCWLDEFAEPAIA--RKIYSSAFPLVDITVVPDDEIMQHRKMALLELI 179 FYHG + + YSL + D F P+ + I L ++ V+ ++ + + L+ Sbjct: 127 FYHG-TARWNYSLEFSDLFNLPSEHYRKYIPKFEHLLHEVPVINKKKVKSSITLEVFHLV 185 Query: 180 QKHIRQRDLLGLVDQIVSLLVTG---NTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERA 236 ++I + + + + LL G L Y+L D A E + Sbjct: 186 LEYIFYPEKRDQIYEALELLFKGLDAKEAHEIFAILIKYLLIATDETPEEA--EEKVKHL 243 Query: 237 PQEKEKLMTIADRLREEGAMQGKHEE 262 P+ E + T A+ L E G + E+ Sbjct: 244 PKGGETVRTTAEVLEERGYNKAIKEK 269 >UniRef50_A9BGB6 Putative uncharacterized protein n=3 Tax=Petrotoga mobilis SJ95 RepID=A9BGB6_PETMO Length = 331 Score = 194 bits (492), Expect = 4e-48, Method: Composition-based stats. Identities = 70/296 (23%), Positives = 124/296 (41%), Gaps = 15/296 (5%) Query: 3 ISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYS 62 PHD FK + ARDF+ +LP ++ DL L E NS +DE+LR+ S Sbjct: 2 NELVHNPHDRFFKLIFSDKEIARDFLQNYLPQEAVEIVDLDYLIPENNSHVDENLRESLS 61 Query: 63 DLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLF 122 D+L+ K + GYIY+++EH+S E + F+++RY + + D K++P+++PM+ Sbjct: 62 DMLYKTKIKGQDGYIYILMEHKSYIEGKVIFQLLRYITSIWEEKYDPKTKKVPIIIPMVI 121 Query: 123 YHGCRSPYPYSLCWLDEFA-----EPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLE 177 YHG R + L+ + + + + + D ++ I+ M + Sbjct: 122 YHG-REIWNVETNLLNMVQGIEDLPNELKTYLPTYRYEICDFSIKRKKRIIGLTAMKVAI 180 Query: 178 LIQKHIRQRDLLGLVDQIVSLLVT-----GNTNDRQLKALFNYVLQTGDAQRFRAFIGEI 232 + +++ + + Y+L + + Sbjct: 181 EAMRAGTAMTKEEFKERLRRVFAYIKQLPKEQVHEWFEECMIYLLNVREDVTIEEILKVQ 240 Query: 233 AERAPQEKEKLMTIADRLREEGAMQGKHE----EALRIAQEMLDRGLDRELVMMVT 284 E P E +MTIA++LR EG +GK E L +E R L + +T Sbjct: 241 KEIMPGRGEIVMTIAEKLRNEGMEKGKIEGERKGKLEGEREFAIRILSKRFGNQLT 296 >UniRef50_C4FIM1 Putative uncharacterized protein n=1 Tax=Sulfurihydrogenibium yellowstonense SS-5 RepID=C4FIM1_9AQUI Length = 316 Score = 193 bits (491), Expect = 6e-48, Method: Composition-based stats. Identities = 52/281 (18%), Positives = 120/281 (42%), Gaps = 20/281 (7%) Query: 5 TTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDL 64 T PHD FK P + +DI P L + DL +++L + + + + +L Sbjct: 2 TDLQPHDQFFKQIFSEPKRVKSLLDIFYP-ELSQKIDLESIRLLNSEKYSQKVGKSLLNL 60 Query: 65 LWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYH 124 L+ K + ++ ++ EH+S ++ + +++ Y+ + Y+E P ++ ++ YH Sbjct: 61 LYECKIENEKSFLRIIFEHKSYIDKNLPSQLLYYNGILWEE--TGEYEEYPPIINIVLYH 118 Query: 125 GCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALL----ELIQ 180 G R + L + I R + L+D++ V D+E++ + L Sbjct: 119 GKRKWNIPAT--LPKTNSEIIERFANKLNYHLIDLSKVADEEMISKLYLDFCTVSALLTM 176 Query: 181 KHIRQ--RDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQ 238 KHI + R ++ ++ + D + + +Y+ + Q + EI Sbjct: 177 KHIFEDLRKYKHILKKVFE-----HYQDGCVFIILDYISVVNNPQEVENVLKEIL----G 227 Query: 239 EKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDREL 279 ++ +MT+ ++ + EG QG + + ++ + + + + Sbjct: 228 GEKDMMTLTEKWKMEGLQQGLQQGMIEGQKKAILKSIQLKF 268 >UniRef50_B9MMR0 Putative uncharacterized protein n=1 Tax=Anaerocellum thermophilum DSM 6725 RepID=B9MMR0_ANATD Length = 333 Score = 193 bits (490), Expect = 7e-48, Method: Composition-based stats. Identities = 65/332 (19%), Positives = 130/332 (39%), Gaps = 45/332 (13%) Query: 1 MTISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQY 60 M +D FK + +F+ + P DL +L+ SF+ ++ + Sbjct: 1 MEQKPPHNQYDLTFKRIFSFKEVFLNFLKSTIKRPWVDKIDLQSLEFVDRSFVKDEFVEK 60 Query: 61 YSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKE-LPLVLP 119 +D+++ K ++ Y YV++E QS ++ M R+ Y Q H++ + L ++P Sbjct: 61 EADVIYRAKIEDTDIYFYVLLEAQSTTDKTMPRRLFEYMNLIWQRHIEETKDDLLSPIVP 120 Query: 120 MLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELI 179 ++ Y+G + +L F I + + LVD+ + D+ + + LL +I Sbjct: 121 IVLYNGRSNWNVPTLI----FKGWEIFKDDM-FNYFLVDVNNIDDETLKNR--LDLLSVI 173 Query: 180 QKHIRQR----DLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAER 235 R R + + + ++ + T Q+K ++L+ Q GEI E Sbjct: 174 LYLDRSRKTAKEFIEKLKEVTEYISCLPT--EQVKVFAMWLLRVIRPQMMEEVQGEIDEL 231 Query: 236 APQ-EKEKLMTI------ADRLRE------------------------EGAMQGKHEEAL 264 + E+E + + RL + EG ++G+ E + Sbjct: 232 LKRIEQEGVTDVGDFVFNVQRLMQEYYKEAEEKGKEKGYEEGKLEGKLEGKLEGELEATI 291 Query: 265 RIAQEMLDRGLDRELVMMVTRLSPDDLIAQSH 296 RIA+ M+ G + + VT L + + Sbjct: 292 RIARNMILAGAEDSFISKVTGLDIEKIKELRQ 323 >UniRef50_C7RR52 Putative transposase n=1 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RR52_9PROT Length = 330 Score = 191 bits (485), Expect = 3e-47, Method: Composition-based stats. Identities = 69/303 (22%), Positives = 117/303 (38%), Gaps = 17/303 (5%) Query: 7 STPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLW 66 + HD +K P+ RD I +P D +TL+ P S++ ED D++W Sbjct: 2 ANTHDTGYKLLFSTPELVRDLILGFVPDDWLHGLDYSTLERVPGSYVTEDFTNRADDIVW 61 Query: 67 SVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGY----KELPLVLPMLF 122 VK Y+Y++IE QS ++ MA RMM Y Q+ + G LP VLP++ Sbjct: 62 RVKVGGEWVYLYLLIEFQSSVDKYMALRMMVYGGLLYQDLIKRGEVLADGRLPPVLPIVL 121 Query: 123 YHGCRSPYPYSLCWLDEFAEPAI-ARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQK 181 Y+G + + + P + + + L+D D E+ + + + Sbjct: 122 YNGSQRWSAVTDVFELIPPVPGLVEQFKPRLKYLLIDENAWSDSELASLKNLVAAVFRIE 181 Query: 182 HIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNY-----VLQTGDAQRFRAFIGEIAERA 236 H + ++SLL L+ +F +++ + + I ++ E Sbjct: 182 HPASP---AAIGDLLSLLDEWLAERPDLRRMFALWIRATLMRKAEYRIVLPRIDDLQELN 238 Query: 237 PQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRG----LDRELVMMVTRLSPDDLI 292 E+L A + EG +GK E E G L + L + PD L Sbjct: 239 VMLAERLEEWAQAYKAEGKAEGKAEGKAEGKAEGKAEGEALALQKLLKKRFGAVPPDVLA 298 Query: 293 AQS 295 S Sbjct: 299 QIS 301 >UniRef50_C6I158 Putative uncharacterized protein n=3 Tax=Leptospirillum ferrodiazotrophum RepID=C6I158_9BACT Length = 328 Score = 186 bits (473), Expect = 7e-46, Method: Composition-based stats. Identities = 73/295 (24%), Positives = 122/295 (41%), Gaps = 19/295 (6%) Query: 10 HDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVK 69 HD FKS L PD + LP + D +L + E L DL +S + Sbjct: 7 HDRFFKSTLGRPDRLGKVLKAFLPTNISASLDPGSLVPLGTESVGEGLDSSLMDLAFSAR 66 Query: 70 TQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYHGCRSP 129 + I++++EH+S P+ F++ RY L G + PL LP+LFYHG P Sbjct: 67 FGDQEARIHLIVEHKSSPDPRTHFQIARYLCGLWIRELKEGLQPRPL-LPILFYHGVV-P 124 Query: 130 YPYSLCWLDEFAEP-AIARKIYSSAFPLVDITVVPDDEIMQH---RKMALLELIQKHIRQ 185 + + P + PL+D+ V D+EI H + L L KHI Sbjct: 125 WTLPSRLTEVLRPPSELLAVTPDFVLPLIDLRRVDDEEIRHHVDDLEAVLALLSLKHIF- 183 Query: 186 RDLLGLVDQIVSLLVTGNTNDRQ----LKALFNYVL---QTGDAQRFRAFIGEIAERAPQ 238 V+ +V LL+ + LK NY+ + ++Q + + IA Sbjct: 184 ----DGVETLVRLLLREIWERKAPHAILKPEMNYMAGVYKITNSQEMKQIVDPIAREVGM 239 Query: 239 EKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIA 293 ++ + T D ++G +G + + Q+ L++GL++ RL + +I Sbjct: 240 AQDIVETWLDEYLQQGLQKGLEQGLQQGLQQGLEKGLEKGF-QQGARLKEEQVIR 293 >UniRef50_B4U689 Putative uncharacterized protein n=8 Tax=Aquificales RepID=B4U689_HYDS0 Length = 323 Score = 185 bits (470), Expect = 2e-45, Method: Composition-based stats. Identities = 57/280 (20%), Positives = 114/280 (40%), Gaps = 25/280 (8%) Query: 8 TPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWS 67 PHD+ FK P + +DI + + ++ +++ DLL+S Sbjct: 4 QPHDSFFKQIFSDPRRVKTLLDIFAKDVAKSI---HSITPVNTEKFSSKSQKFMLDLLFS 60 Query: 68 VKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYHGC- 126 K ++ YI +V+EH+S ++ + ++ Y+ A + + + P ++ ++FYHG Sbjct: 61 CKVKDQDAYIRIVLEHKSYLDKELPIQLSYYNAAIWEEAIKEK-EYYPPIINIVFYHGKG 119 Query: 127 RSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALL----ELIQKH 182 P SL L+ + + + + + L+D+ V DDE++ + + KH Sbjct: 120 EWNIPTSLPVLE---DQNLEKYVSKLNYILIDLNKVSDDELINEAYIDFCFTSAVIAMKH 176 Query: 183 IRQRDLLGLVDQIVSLLVTGNTNDRQ------LKALFNYVLQT-GDAQRFRAFIGEIAER 235 + + + + + LV L FNY+ GD + + E+ Sbjct: 177 VHEN--IEKIKAVFRPLVEYVQIHEDEEGYHCLFFSFNYISYVKGDTKEAENALKELI-- 232 Query: 236 APQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGL 275 +K MT+ ++ EG +GK E ++ GL Sbjct: 233 --GGDKKAMTLIEKWIMEGLEKGKQEGLQEGLEKGKQEGL 270 >UniRef50_B3ETR6 Putative uncharacterized protein n=1 Tax=Candidatus Amoebophilus asiaticus 5a2 RepID=B3ETR6_AMOA5 Length = 275 Score = 185 bits (469), Expect = 2e-45, Method: Composition-based stats. Identities = 77/266 (28%), Positives = 132/266 (49%), Gaps = 32/266 (12%) Query: 60 YYSDLLWSVKTQEGVG----------YIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDA 109 Y ++W+V+ G Y+Y +IE+QS +LMAF M+ Y++A M+ HL+ Sbjct: 10 KYDIIIWAVRWYCKYGISYPDLAEMLYVYTLIENQSTHNKLMAFSMLSYNVALMEQHLNE 69 Query: 110 GYKELPLVLPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQ 169 GY+ELP+++ + Y G +SPYPYS D F +AR+ F L+D++V+ +E+++ Sbjct: 70 GYQELPIIVNICIYTGKKSPYPYSQDICDYFEGVELAREQMFKHFKLLDLSVLSQEELLK 129 Query: 170 HRKMALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFI 229 +E + + R+RD L ++ +L+ ++ L + Y+L T D + Sbjct: 130 DGTFGSVEALLRQGRERDYLNWINN-NQVLIWELVSNYGLSIVI-YILTTDDKNDADYLM 187 Query: 230 GEIAERAPQEKEKLMTIADRLRE------------EGAMQGKHEE--------ALRIAQE 269 I E ++KE ++T A +LR+ EG QGK E A I + Sbjct: 188 QAIIEAVLEQKEIIVTAAQQLRQVDIQTGLIKGIKEGIEQGKEEGVKLGIQAKAQAIDKS 247 Query: 270 MLDRGLDRELVMMVTRLSPDDLIAQS 295 ML GL+ L+ VT +S + + + Sbjct: 248 MLKEGLEISLIQKVTGISREAIEKLT 273 >UniRef50_A4XFI8 Putative uncharacterized protein n=7 Tax=Clostridia RepID=A4XFI8_CALS8 Length = 321 Score = 184 bits (467), Expect = 3e-45, Method: Composition-based stats. Identities = 69/321 (21%), Positives = 133/321 (41%), Gaps = 31/321 (9%) Query: 1 MTISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQY 60 M+ S HD+ FK HP + + K +++L F+DE Q Sbjct: 1 MSSSLPPQEHDSTFKFLFEHPKDILFLVKDVIGYSWAKEIKEDSIELADKEFVDETFHQK 60 Query: 61 YSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPM 120 +D++ + ++ Y Y++IE+QS E M R++RY I + G K+LP ++P+ Sbjct: 61 RADVIAKARLKDREVYFYIIIENQSTVAEDMPERLLRYMILLWAKKIREGVKKLPAIIPI 120 Query: 121 LFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQ 180 + Y+G + S + EF K + +V+I+ + ++Q + L ++ Sbjct: 121 VTYNGLEKDWDVSQEIISEFDI----FKDDIFKYAVVNISKLDAKTLLQEEEDILSPVVF 176 Query: 181 KHIRQRDLLGLVDQIVSLLV------TGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAE 234 + RD + + + + + N +R L N V++ + + E+A+ Sbjct: 177 YLEQVRDDTEELVKRLKEIEPKLTKLSQNNAERFLIWAGN-VIRPRLVKEDKEKYDELAQ 235 Query: 235 RAPQEKEKLM-----TIADRLRE---------------EGAMQGKHEEALRIAQEMLDRG 274 R Q + M +A L E EG ++GK E + +A++M+ RG Sbjct: 236 RVEQGGSRQMGEFVSNVAKLLDEVQMRKFNEGKIEGKIEGKIEGKIEGKIEVAKKMIRRG 295 Query: 275 LDRELVMMVTRLSPDDLIAQS 295 E + +T L + + Sbjct: 296 FSDEDIAELTELDIEKVKELR 316 >UniRef50_A4XMD0 Putative uncharacterized protein n=5 Tax=Clostridia RepID=A4XMD0_CALS8 Length = 329 Score = 184 bits (466), Expect = 5e-45, Method: Composition-based stats. Identities = 51/329 (15%), Positives = 117/329 (35%), Gaps = 41/329 (12%) Query: 1 MTISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQY 60 M +D FK + + +F+ ++ D +L+ SFI ++ + Sbjct: 1 MQQKVPHNQYDLTFKRLFQFKEVFLNFLRGNINREWVNRIDAESLEFVDRSFIKDEFVEK 60 Query: 61 YSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKE-LPLVLP 119 +D+++ + ++ Y YV+IE QS + M R+ Y + H++ E LP ++P Sbjct: 61 EADVIYRARLEDTDVYFYVLIEPQSTADRNMPRRLFEYMTLIWKRHMEEKADELLPPIVP 120 Query: 120 MLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELI 179 ++ Y+G + + I + + LVD+ + D+++ + + L Sbjct: 121 IVLYNGRSGWNIPTQIFKGF----DIFKDDM-FNYILVDVNRLDDEKLKSRLDLLSIILY 175 Query: 180 QKHIRQR--DLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERA- 236 + R+ + + + ++ + QLK +++L+ Q I E Sbjct: 176 LEKSRRNAEEFVEKLSEVSEYICKLPQV--QLKVFCSWLLRIVKPQVREEMESRIDELLK 233 Query: 237 ----------------------PQEKEKLMTIADRLREEGAMQGKHEE--------ALRI 266 +E ++ EEG +G E I Sbjct: 234 KIEAEGVEDVGEFIFNVQQLIQEYYREAEEKGKEKGYEEGIQEGIKEGIKEGIQRKEEEI 293 Query: 267 AQEMLDRGLDRELVMMVTRLSPDDLIAQS 295 + ++ +G + + T + + + Sbjct: 294 VRRLIQKGFNDNFIAEATGVEIERIKKIR 322 >UniRef50_C0GWA6 Putative uncharacterized protein n=3 Tax=Desulfonatronospira thiodismutans ASO3-1 RepID=C0GWA6_9DELT Length = 334 Score = 180 bits (458), Expect = 3e-44, Method: Composition-based stats. Identities = 72/290 (24%), Positives = 129/290 (44%), Gaps = 16/290 (5%) Query: 3 ISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYS 62 HD FKSF + RDFI +LP ++K DLT ++++ ++ E+ +++YS Sbjct: 2 SKKIPNAHDICFKSFFSREEFVRDFIQYYLPEEIKKHLDLTIIEIDMEGYLSEEFKEFYS 61 Query: 63 DLLWSVKTQEG--VGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAG--YKELPLVL 118 D++ V + +Y + EH+SKP + + Y + L G + LP+++ Sbjct: 62 DVVAKVYFNDRVHELELYFLFEHKSKPYRFTILQTLNYQVQKWMRLLVEGKLNQHLPIIV 121 Query: 119 PMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAF--PLVDITVVPDDEIMQHRKMALL 176 P++ Y+G +S + +S+ + D F P+ K + F L DI + + M + Sbjct: 122 PVVIYNGYKS-WNFSVQFEDLFQLPSEYYKDFIPQFRHILHDIGQMDEASFKTTTIMEIF 180 Query: 177 ELIQKHIRQRDLLGLVDQIVSLLVTGNTNDR---QLKALFNYVLQTGDAQRFRAFIGEIA 233 L+ K+I +L + +I LL ND+ L + YV+ +G R E A Sbjct: 181 HLLLKYIYYPELDTKIHEIYDLLEKLPDNDKLTDYLFIIVRYVMASGAIPEKRLL--EHA 238 Query: 234 ERAPQEKEKLMTIADRLREEGAMQGK----HEEALRIAQEMLDRGLDREL 279 +R +E + A + E K + + +QEML + L Sbjct: 239 KRFSGGEEMIGLAAREIEERVEQTRKPYWQKQAKVENSQEMLIKSLKMRF 288 >UniRef50_A4U3R1 Putative uncharacterized protein n=1 Tax=Magnetospirillum gryphiswaldense RepID=A4U3R1_9PROT Length = 322 Score = 175 bits (445), Expect = 1e-42, Method: Composition-based stats. Identities = 63/303 (20%), Positives = 118/303 (38%), Gaps = 31/303 (10%) Query: 11 DAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVKT 70 DA++ HP A + +P + D ++ F D D ++ D++W + T Sbjct: 5 DALYHRLFSHPLMAEQLVREFVPEAMAVGLDFARMERVNAKFHDRDGKRREGDVIWRIPT 64 Query: 71 QEGVGYI-YVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGY----KELPLVLPMLFYHG 125 +G + +++ E QS + MA R Y Q+ + LP VL ++ Y+G Sbjct: 65 ADGEDVVLHILCEFQSTTDWWMAVRTQVYEGLLWQHLIAERKLKSGDRLPPVLTLVLYNG 124 Query: 126 CRSPYPY--SLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHI 183 + + ++ + A + + + L+D+ VP++E+ +A L +H Sbjct: 125 EQRWHAPTDTIPLIALPAGSPLWPWQPRACYHLLDMGAVPEEELAIRDSLAALLFRLEHP 184 Query: 184 RQR-DLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQT------------GDAQRFRAFIG 230 R+ +L GL+D +V D L+ LF +++ GD R+ + Sbjct: 185 REPEELAGLIDDVVGWFRRHPGYDE-LRRLFTELVRQAIEGYETSVAVPGDMMEMRSMLA 243 Query: 231 EIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDD 290 + E T R EG +G+ R + L R L++ + T Sbjct: 244 NLGE----------TWKKRWLAEGIAEGEARGEARGEAKALIRLLEKRFGQLPTDTRERV 293 Query: 291 LIA 293 L A Sbjct: 294 LAA 296 >UniRef50_C0GTX5 Putative uncharacterized protein n=8 Tax=Desulfonatronospira thiodismutans ASO3-1 RepID=C0GTX5_9DELT Length = 338 Score = 175 bits (443), Expect = 2e-42, Method: Composition-based stats. Identities = 59/264 (22%), Positives = 113/264 (42%), Gaps = 8/264 (3%) Query: 5 TTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDL 64 +T+ HD+ K FL A + LP + K D + E +S++ + L+ YYSDL Sbjct: 2 STTNIHDSTIKYFLSDRLNAISLLKSMLPEEIVKQLDFNKIYYEKDSYLPKSLQGYYSDL 61 Query: 65 LWSVKTQEG--VGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDA-GYKELPLVLPML 121 + SV T+ G V ++ ++EH+S ++ + +RY + + + G LP+++P+L Sbjct: 62 VVSVPTKCGSYVAKVFFLLEHKSTFKKNTPLQFLRYILEFWEQYQKNTGETRLPVIIPIL 121 Query: 122 FYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVD-ITVVPDDEIMQHRKMALLELIQ 180 H P + L + + F L D + P+D AL L Sbjct: 122 IAHPEEGWKPTKVSDLVDLPSDDFKIFVPDFNFLLYDAVNDDPEDYDFDETLKALFTL-W 180 Query: 181 KHIRQRDLLGLVDQIVSLLVTGNTNDR---QLKALFNYVLQTGDAQRFRAFIGEIAERAP 237 ++ R + + V + L+ + R ++ + +Y+ T D + + Sbjct: 181 RYSRSPEFMQGVQKAFQLIKKVDPKARLLDFVQMILHYLEVTRDEKEYIDIQKIAETEID 240 Query: 238 QEKEKLMTIADRLREEGAMQGKHE 261 + +E + TIA+ R EG + + Sbjct: 241 EGEEYMGTIAEMFRREGDERTEQR 264 >UniRef50_A3JHZ5 Putative transposase n=11 Tax=Proteobacteria RepID=A3JHZ5_9ALTE Length = 325 Score = 174 bits (440), Expect = 4e-42, Method: Composition-based stats. Identities = 63/319 (19%), Positives = 126/319 (39%), Gaps = 31/319 (9%) Query: 7 STPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLW 66 + HD +K HP+ + ++ P+ + L D TLK ++I + + D++W Sbjct: 3 TNHHDTGYKELFSHPEFVQQLVEGFAPSEIAGLMDFNTLKNHSGNYITPLFEEKFEDVVW 62 Query: 67 SVKTQ----EGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAG----YKELPLVL 118 SV+ ++Y+++E QSK + M R+M Y + L + LP + Sbjct: 63 SVEVTWEGITQRVFLYILLEFQSKIDSTMPLRLMHYVACFYDHLLKTRETTVRQGLPPIF 122 Query: 119 PMLFYHGCRSPYPYSLCWLDEFAE-PAIARKIYSSA--FPLVDITVVPDDEIMQHR-KMA 174 PM+ Y+G + + D P ++Y + L+D D+E++ R ++ Sbjct: 123 PMVLYNGSQR-WSARQDIYDMVQPAPPEFLRVYQPHLRYYLIDEGRYTDEELISKRTPLS 181 Query: 175 LLELIQKHIRQRDLLGL-VDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFI---- 229 + ++ + L VD+IV ++ DR K + ++ + +A + Sbjct: 182 GIFGVENAGHSWEALQQAVDRIVEIVKADPNKDRVDKIVTRWIKRHLQRVAPKARLNLDR 241 Query: 230 -GEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRG------------LD 276 + E E L + + R EG +G+ E + L+ L Sbjct: 242 MSSLVEDRNMLAENLENLVKKERLEGRQEGRQEGRQEGDRRALEEKRKTVRHLLSFGVLS 301 Query: 277 RELVMMVTRLSPDDLIAQS 295 + + + T LS D++ Sbjct: 302 NDQIAVATGLSVDEIDKLR 320 >UniRef50_C6HXQ0 Putative uncharacterized protein n=1 Tax=Leptospirillum ferrodiazotrophum RepID=C6HXQ0_9BACT Length = 341 Score = 174 bits (440), Expect = 4e-42, Method: Composition-based stats. Identities = 50/267 (18%), Positives = 99/267 (37%), Gaps = 10/267 (3%) Query: 3 ISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYS 62 ++ HD FKS L P + LP L L +L + + + L Sbjct: 1 MTIDGPLHDRFFKSTLGRPKRMEHILKAFLPPALSALLAPGSLVPLFSEVVGDSLDASLL 60 Query: 63 DLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLF 122 D+ + E I+V++EH+S P+ F+++ Y +P V P+LF Sbjct: 61 DMAFEATFGERKTRIHVLVEHKSSPDPWAHFQILHYLAELWLRDKKESRSPIPFV-PVLF 119 Query: 123 YHGCRSPYPYSLCWLDEFAEP-AIARKIYSSAFPLVDITVVPDDEIMQHRK---MALLEL 178 YHG R P+ + P + + P++D+ + D +I + + + L Sbjct: 120 YHGLR-PWNLPTRLSEMLDPPSELLPFVPDYLLPVIDLGKIDDLDIREKIRDFETSACLL 178 Query: 179 IQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGD---AQRFRAFIGEIAER 235 + KHI + G + + N + + + +YV+ + I + Sbjct: 179 LLKHIFEG-ARGSLRAFLQETNGKNLSRDIIISGMSYVIGVHHLESTAELSRLVNTILKE 237 Query: 236 APQEKEKLMTIADRLREEGAMQGKHEE 262 + + + L ++G +G + Sbjct: 238 EGMSQNVVELWMEELIQQGVQKGIQQG 264 >UniRef50_B9TA29 Putative uncharacterized protein n=1 Tax=Ricinus communis RepID=B9TA29_RICCO Length = 411 Score = 174 bits (440), Expect = 5e-42, Method: Composition-based stats. Identities = 60/304 (19%), Positives = 117/304 (38%), Gaps = 39/304 (12%) Query: 7 STPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLW 66 S+ D+++K HP+ RD + L A + + + S+ + + D++W Sbjct: 40 SSRTDSLYKQLFAHPEIVRDLVAGFLAADWARGLTVEAFERVNASYASDHGHVRHDDVVW 99 Query: 67 SVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQ----NHLDAGYKELPLVLPMLF 122 + Y+Y+++E Q++P++ MA RM Y Q H + + +LP VLP++ Sbjct: 100 RARIGGEWVYVYILLEFQARPDKWMALRMQVYVGLLYQDLVAQHKLSKHGKLPPVLPVVL 159 Query: 123 YHGCRSPYPYSLCWLDEFAEPA-IARKIYSSAFPLVD-----------------ITVVPD 164 YHG + P+ + R S + L+D + D Sbjct: 160 YHGRGPWRAATALASLMLPAPSGLERYQPSQRYLLIDQHHGTARADVVSLLFRLLDAATD 219 Query: 165 DEIMQHRKMALLELIQKHIRQRDLLGLVDQIVSLL---------VTGNTNDRQLKALFNY 215 ++ + L+L+ + IR RD+ + D + + T + Sbjct: 220 LQLRE-----ALDLLAERIRARDMDPVRDSLTRWIQLTLQDAAVETSMDLEEAFTMKMRR 274 Query: 216 VLQTGDAQRFRAFIGEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGL 275 + F +A+ ++ ++ + REEG +G+ E R E L+RG Sbjct: 275 KFSYDEMFDPGMFERPLAKA---REKAIVEGLQQGREEGLERGRVEGLERGRVEGLERGR 331 Query: 276 DREL 279 + L Sbjct: 332 EEGL 335 >UniRef50_Q3JB06 Putative transposase n=17 Tax=Proteobacteria RepID=Q3JB06_NITOC Length = 350 Score = 173 bits (439), Expect = 6e-42, Method: Composition-based stats. Identities = 57/251 (22%), Positives = 108/251 (43%), Gaps = 9/251 (3%) Query: 10 HDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVK 69 HDA +K HP+ RD + + P + D +TL+ S++ +DLR+ D++W ++ Sbjct: 4 HDASYKRLFSHPEMVRDLLQGFVREPWVQQLDFSTLEKVSGSYVTDDLREREDDIIWRLR 63 Query: 70 TQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGY----KELPLVLPMLFYHG 125 QEG YIY+++E QS + MA R++ Y Q+ + A Y ++LP V P++ Y+G Sbjct: 64 HQEGWMYIYLLLEFQSTVDPYMAVRVLAYVGLLYQDLIKARYIAPNQKLPPVFPLVLYNG 123 Query: 126 C-RSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIR 184 R + L E + R S + LVD D+ + + + ++ R Sbjct: 124 GPRWRAATEVGDLITPLEGGLERYRPSLRYLLVDEGDYQDEALAPLKNLVASLFRLENSR 183 Query: 185 QR-DLLGLVDQIVSLLVTGNT---NDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEK 240 +LL ++ ++ L + L +L + + E Sbjct: 184 TPEELLQVLRNLLQWLQSPAQKGLERDFTLWLKRVLLPARLPGVEIPSVASLEEMNSMLA 243 Query: 241 EKLMTIADRLR 251 E+++ + + Sbjct: 244 ERVVEWTQQWK 254 >UniRef50_Q2RLW6 Putative uncharacterized protein n=9 Tax=Clostridia RepID=Q2RLW6_MOOTA Length = 344 Score = 171 bits (434), Expect = 2e-41, Method: Composition-based stats. Identities = 59/331 (17%), Positives = 133/331 (40%), Gaps = 43/331 (12%) Query: 6 TSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLL 65 P+D ++ L + + + + D L L S++ +D + +D++ Sbjct: 11 PHHPYDKGYRQLLADKRVFLELLKTFVREAWVEAIDADDLILVNKSYVLQDFSEKEADVV 70 Query: 66 WSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQN--------HLDAGYKELPLV 117 + +KT+ YV++E QS + LM FR++ Y + + ++ + LP + Sbjct: 71 YRLKTRNRNVIFYVLLELQSTVDYLMPFRLLLYMVEIWREIYNNTPQGERESKHFRLPPI 130 Query: 118 LPMLFYHGCRSPYPYSLCWLDEFAE-PAIARKIYSSAFPLVDITVVPDDEIMQHRKM--A 174 +P + Y+G S + +L + + + + + L D+ ++E+++ + Sbjct: 131 IPAVLYNGAGS-WTAALSFKEMLNSYQDFSGHLLDFRYLLFDVNRYSEEELIRAANLIAG 189 Query: 175 LLELIQKHIRQRDLLGLVDQIVSLLVTGNTND-RQLKALFNYVLQTGDAQRFRAFIGEIA 233 + L QK + DL G + ++ +L ++ R V+Q F I I Sbjct: 190 IFLLDQKM-QPEDLAGRLQKLAGVLRRLTPDEFRHFTTWLKNVVQPRMPGDFSEKIDGIL 248 Query: 234 ERAPQEK-EKLM-----TIADRLRE-----------------------EGAMQGKHEEAL 264 + + E+++ T+ + R+ EG ++GK E Sbjct: 249 NASNPWEVERMIYNLELTLEEMQRQALLKGLKEGEQKGKLEGKLEGKLEGKLEGKLEGKR 308 Query: 265 RIAQEMLDRGLDRELVMMVTRLSPDDLIAQS 295 +A+ +L +D E ++ T L+ +++ A Sbjct: 309 EVARNLLLLNVDIETIIKATGLALEEINALK 339 >UniRef50_B6WXP3 Putative uncharacterized protein n=1 Tax=Desulfovibrio piger ATCC 29098 RepID=B6WXP3_9DELT Length = 330 Score = 170 bits (431), Expect = 4e-41, Method: Composition-based stats. Identities = 56/281 (19%), Positives = 113/281 (40%), Gaps = 14/281 (4%) Query: 3 ISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYS 62 + PHD+ +K F +P+ + +PA + D +TL+ S++ +DLR+ + Sbjct: 1 MGKERIPHDSAYKQFFSNPEMVESLLRDFVPADFIEDLDFSTLERCSGSYVTDDLRERHD 60 Query: 63 DLLWSVKTQEG-VGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGY----KELPLV 117 D++W + ++G Y+ +V+E QS P+ MA R + Y+ + + + G + LP V Sbjct: 61 DIVWRIGWKKGAWCYVALVLEFQSTPDYWMALRTLSYTALLLLDLVKTGKVHEGEGLPPV 120 Query: 118 LPMLFYHGCRSPYPYSLCWLDEFAEPA-IARKIYSSAFPLVDITVVPDDEIMQHRKMALL 176 P++ Y+G ++ P + L+D + V DE+ + L+ Sbjct: 121 FPIVIYNGGKAWKAPQEVATLFAPMPDSLKHYCPQHRHFLLDESRVSGDEL--DKSQGLV 178 Query: 177 ELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQL------KALFNYVLQTGDAQRFRAFIG 230 + K R ++ + + L+ + L L VL+ Sbjct: 179 AQLLKLERAQEPEQVRQIVKELITRLHEPKYLLLRRAFTVWLSRVVLKRSGITEEIPEFQ 238 Query: 231 EIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEML 271 ++ E +E+ D ++G +G R + L Sbjct: 239 DLREVDAMLEERAAQWKDEYIKQGKTEGISIGEARGIRSAL 279 >UniRef50_A6G1G8 Putative uncharacterized protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6G1G8_9DELT Length = 329 Score = 170 bits (430), Expect = 6e-41, Method: Composition-based stats. Identities = 64/281 (22%), Positives = 110/281 (39%), Gaps = 18/281 (6%) Query: 8 TPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWS 67 HDA+FK+ P A LP L + D EP + +D L + D+LW Sbjct: 5 HAHDALFKAAFGAPAHAARLCRALLPPALVAVLDWRASTSEPTAVLDLRLSERRCDVLWR 64 Query: 68 VKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAG-YKELPLVLPMLFYHGC 126 + +G G IYV++EHQS E M R+ Y H + LP ++P++ H Sbjct: 65 TRFVDG-GPIYVLLEHQSTRERDMPLRIEGYLARIWAGHRRGDRHGPLPPIIPIVVSHAE 123 Query: 127 RSPYPYSLCWLDEFAE-----PAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQK 181 + + ++F+ P +A + + + D+T V D + L Sbjct: 124 HG-WRAPRSFWEQFSPSPDCIPGLAPFVPNFQLLIDDLTQVDDASLRGRSLPLFQTLALW 182 Query: 182 HIRQ-RD---LLGLVDQIVSLL--VTGNTNDRQ----LKALFNYVLQTGDAQRFRAFIGE 231 +R RD +L VD+ + + + G + Q ++ L Y F + Sbjct: 183 LLRDARDPGRVLESVDEWNTWIHRLRGESQHEQDGGDIEQLLRYAYAVMGEGEDSEFHRK 242 Query: 232 IAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLD 272 +A P E +T + G +G E ++ E+L+ Sbjct: 243 LAAFHPPSAEMSLTFEQQAINRGHKRGLEEGRIKGRLELLE 283 >UniRef50_C6HY29 Putative uncharacterized protein n=1 Tax=Leptospirillum ferrodiazotrophum RepID=C6HY29_9BACT Length = 319 Score = 169 bits (428), Expect = 1e-40, Method: Composition-based stats. Identities = 60/295 (20%), Positives = 119/295 (40%), Gaps = 11/295 (3%) Query: 3 ISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDL-RQYY 61 ++ TPHD FK + + LP + + D +L P + E L R Sbjct: 1 MAKNLTPHDVFFKEIFSQREILSSALSELLPEDVVRRMDFDSLAYLPGESVGEGLSRSTR 60 Query: 62 SDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPML 121 +DL++SV E G + V++EH+S P+ + F++++ + +L G + LP +LP+L Sbjct: 61 ADLVFSVSFGEREGRLVVILEHKSHPDPRVHFQILQMMVMGWMQNLREGREPLP-ILPIL 119 Query: 122 FYHGCRSPYPYSLCWLDEFAEP-AIARKIYSSAFPLVDITVVPDDEI--MQHRKMALLEL 178 FYHG + + + + P IAR + +D+ ++ D I +Q+ L Sbjct: 120 FYHG-QGSWSIPDRFSERMKIPREIARYLPDFELLRIDLGLIDDTRIRSLQNVLAGAALL 178 Query: 179 IQKHIRQ--RDLLGLVDQIVSLLVTGNTNDRQLKAL-FNYVLQTGDAQRFRAFIGEIAER 235 KH+ + R L+ + + ++ + +Y +A Sbjct: 179 SMKHVFENPRRFFHLLIEFGRERSAPHDIIEKIVLVALDYAGHVHKNIPDEELYNIMAAI 238 Query: 236 APQ--EKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSP 288 + + + EEG +G + Q+ + +G+ + + + LS Sbjct: 239 TEEAGMETTTERLKKIWIEEGIQKGVQLGIQQGVQQGVQQGVRQNQIKTILSLSK 293 >UniRef50_B2V697 Putative uncharacterized protein n=6 Tax=Sulfurihydrogenibium RepID=B2V697_SULSY Length = 311 Score = 167 bits (423), Expect = 4e-40, Method: Composition-based stats. Identities = 48/250 (19%), Positives = 105/250 (42%), Gaps = 16/250 (6%) Query: 8 TPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWS 67 PHD FK P + +DI + L + DL +++L + + + + DLL+ Sbjct: 5 QPHDQFFKQIFSEPKRVKSLLDIFY-SELSQKIDLESIRLLNSEKYSQKIGKSLLDLLYE 63 Query: 68 VKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYHGCR 127 K + ++ ++ EH+S ++ + +++ Y+ + YKE ++ ++ YHG R Sbjct: 64 CKIENEKSFLRIIFEHKSYIDKNLPSQLLYYNGILWEE--TGEYKEYLPIINIVLYHGKR 121 Query: 128 SPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALL----ELIQKHI 183 + L + I R + L+D++ V D+E++ + L KHI Sbjct: 122 KWNIPTT--LPKTNSEIIERFSNKLNYHLIDLSKVADEEMINKLYVDFCTASALLTMKHI 179 Query: 184 RQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKL 243 + L I+ + D + + +Y+ + Q + EI ++++ Sbjct: 180 F--EDLKKYKHILKKVFEHY-QDGCVFIILDYISVVNNPQEVENVLKEIL----GGEKEM 232 Query: 244 MTIADRLREE 253 T+ ++ + E Sbjct: 233 TTLTEKWKME 242 >UniRef50_C0A240 Putative uncharacterized protein n=1 Tax=Opitutaceae bacterium TAV2 RepID=C0A240_9BACT Length = 365 Score = 167 bits (422), Expect = 5e-40, Method: Composition-based stats. Identities = 66/316 (20%), Positives = 131/316 (41%), Gaps = 31/316 (9%) Query: 10 HDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVK 69 HD +F+ P AR F+ LP L D TL + S I + L + D+++ + Sbjct: 36 HDRIFRHAFSLPAVARQFLRTWLPPELVAQADWHTLTVTRISGISDTLGERREDVVYRIN 95 Query: 70 TQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNH--------------------LDA 109 + YV++EHQ+K E+ MA R+M + + Sbjct: 96 VNGRNVHFYVLMEHQTKTEKHMARRIMEETFLIWRQDEHDRAEAAKKEAPGKADRQSRRR 155 Query: 110 GYKELPLVLPMLFYHGCRSPYP-YSLCWLDEFAE--PAIAR-KIYSSAFPLVDITVVPDD 165 + PLV+ M+ + G R + L L + AR + F +V++ +P + Sbjct: 156 ETDKFPLVISMVLHPGPRKWGKIWRLADLIDVPPRMEKWARTFMPDCGFIVVELAGLPLE 215 Query: 166 EIMQ-HRKMALLELIQKH----IRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTG 220 ++ H A+L +Q + I R + L+D++ S + +K L++Y++ + Sbjct: 216 KLADGHLARAILGALQGNRLGLIDIRKIKRLLDEMFSDPDRASVGAV-VKQLWHYLISSS 274 Query: 221 DAQRFRAFIGEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELV 280 D + + IA + + +M +RL++ GA++ +H + + DR + L Sbjct: 275 DLKEEQTKDIVIAHIPEEYRSNIMNTVERLKQAGALKAQHNAVIEALEVRFDR-VPEGLR 333 Query: 281 MMVTRLSPDDLIAQSH 296 + ++ + + H Sbjct: 334 EAIQGINDPERLRNLH 349 >UniRef50_C8T759 Putative uncharacterized protein n=1 Tax=Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884 RepID=C8T759_KLEPR Length = 185 Score = 166 bits (421), Expect = 8e-40, Method: Composition-based stats. Identities = 86/185 (46%), Positives = 113/185 (61%), Gaps = 22/185 (11%) Query: 134 LCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQRDLLGLVD 193 +CWL FA+P IAR+IY FPL+DIT PDDEIM+HR++A+LEL+QKHIRQRDL+ L + Sbjct: 1 MCWLAGFADPDIARRIYGEDFPLIDITSTPDDEIMRHRRVAMLELLQKHIRQRDLMDLHE 60 Query: 194 QIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQ--EKEKLMTIADRLR 251 Q+V LL G T+ RQLK L +Y+LQ G+A AF+ +A+ P+ KE LM IA L Sbjct: 61 QLVRLLALGYTSRRQLKTLLHYLLQAGNAADPVAFLRHLAQNVPRRPHKETLMNIAQFLE 120 Query: 252 --------------------EEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDL 291 E+G QG+ + A RIA+ ML GLD LV +T L+P+ L Sbjct: 121 QRGHQQGLKQGLEQGLQQGIEQGIEQGEQQTAERIARAMLANGLDLSLVAKLTGLAPECL 180 Query: 292 IAQSH 296 H Sbjct: 181 ARLQH 185 >UniRef50_A9BGB3 Putative uncharacterized protein n=2 Tax=Petrotoga mobilis SJ95 RepID=A9BGB3_PETMO Length = 336 Score = 164 bits (415), Expect = 3e-39, Method: Composition-based stats. Identities = 66/314 (21%), Positives = 136/314 (43%), Gaps = 29/314 (9%) Query: 11 DAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVKT 70 D++FK DF+ LP K T LK E I +D SD+L+ ++ Sbjct: 7 DSIFKELFEDRTVFYDFLKAFLPKETTKQIKETDLKREQTELIGKDFSIKRSDILYKIEK 66 Query: 71 QEG-VGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYK-------ELPLVLPMLF 122 + G YIY+++EHQSK ++LMAFRM+ Y + + ++++ K +LP+++ M+F Sbjct: 67 RNGQDVYIYLLLEHQSKVDQLMAFRMLAYKVRIWEQYVNSHKKESEQKGFKLPVIIGMVF 126 Query: 123 YHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQ-HRKMALLELIQK 181 Y G + + + + L++++ + ++ I+ + + ++ L K Sbjct: 127 YDGKAKWTSPMDVKEKITEIKNMEEYLIKANYELINLSNIKEETIINMKKALGVILLTDK 186 Query: 182 -HIRQR---DLLGLVDQIVSLLVTGNTNDRQLK---ALFNYVLQTGDAQRFRAFIGEIAE 234 ++R + +LL ++++ + L ++ ++ K A + D + + E+ E Sbjct: 187 PNVRVKNAEELLKIINKDILLKLSEEEQEKFNKHRNAFIELFGKRTDYEEIKERFEELKE 246 Query: 235 -RAPQEKEKLMTIADRLREEGAMQGKHEEA------------LRIAQEMLDRGLDRELVM 281 P+ L IA R RE+ ++GK E + I + D+ L Sbjct: 247 MEVPKMFNTLEEIAKRDREKAKLEGKAEGKVEGKLEERRELIIEILNQRFGEDFDKSLEE 306 Query: 282 MVTRLSPDDLIAQS 295 + + + + Sbjct: 307 KIRNANEETINQIK 320 >UniRef50_A8PLG1 Transposase n=1 Tax=Rickettsiella grylli RepID=A8PLG1_9COXI Length = 212 Score = 162 bits (410), Expect = 1e-38, Method: Composition-based stats. Identities = 69/211 (32%), Positives = 110/211 (52%), Gaps = 7/211 (3%) Query: 90 LMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYHGCRSPYPYSLCWLDEF-AEPAIARK 148 + F++ RY A M HL G+ LP+V+ ML+Y G +PYPY+ D F IA K Sbjct: 1 MTPFKIARYVHAIMDQHLKQGHAFLPIVVAMLYYRGKVTPYPYTGNIFDCFGKNKTIAEK 60 Query: 149 IYSSAFPLVDITVVPDDEIMQHRKMALLELIQKH-IRQRDLLGLVDQIVSLLVTGNTNDR 207 IY +P++DIT + DD I H +A+L+ QK+ RD+ ++ I+ L G Sbjct: 61 IYLRPYPIIDITALSDDAIRGHGSIAILDFAQKYAAFNRDIQDGIEHIIGELKKGYLTRE 120 Query: 208 QLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKLMTIADRLR----EEGAMQGKHEEA 263 Q + L Y + D + + ++ +E +M++A ++ + G QG++EE Sbjct: 121 QCQTLLYYTFRETDTDNVKMLLEQLQTIRI-YEEDIMSVAHKIEQQGLQRGLQQGRYEED 179 Query: 264 LRIAQEMLDRGLDRELVMMVTRLSPDDLIAQ 294 L+IA+ ML +G DR + VT LS DL+ Sbjct: 180 LKIAKRMLAKGTDRGYIKDVTGLSDQDLLNL 210 >UniRef50_C6HZP6 Putative uncharacterized protein n=1 Tax=Leptospirillum ferrodiazotrophum RepID=C6HZP6_9BACT Length = 334 Score = 160 bits (404), Expect = 7e-38, Method: Composition-based stats. Identities = 63/288 (21%), Positives = 116/288 (40%), Gaps = 25/288 (8%) Query: 2 TISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDL-RQY 60 S ++TPHD+ FK + + L +L++L+ P I EDL R Sbjct: 17 KTSISTTPHDSFFKDVFGPGKGHLPSLIPLIDGSLASRIELSSLEYLPGESIAEDLARST 76 Query: 61 YSDL-----LWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELP 115 SDL + + + G I + EH+S + ++ A + L G K P Sbjct: 77 RSDLSASLLISNARIDGGDARIAFIFEHKSFLPHHIHIPLLSLVSALLSRDLREGRKPCP 136 Query: 116 LVLPMLFYHGCRSPYPYSLCWLDEFAE-PAIARKIYSSAFPLVDITVVPDDEIMQ---HR 171 V+P++ YHG R+P+ + P +A ++ L+D++ D+ + + H Sbjct: 137 -VIPVVLYHG-RAPWTLPARLSEALDLSPELAPRLPDFELTLIDLSRFSDETLKEKIAHP 194 Query: 172 KMALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKAL-------FNYVLQTGDAQR 224 + + + KHI + ++ V L+ T + + LK + +YV ++ Q Sbjct: 195 EPLVSLSVMKHIFEP-PESVLGHFVRLIKTLSPSRDILKRIVDTTLHYISYVKKSHHPQE 253 Query: 225 FRAFIGEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLD 272 R +EK+ T+ D ++EEG +G +L Sbjct: 254 IRTIFTTFL-----AEEKMTTVLDLIKEEGIQEGIQMGRDEAITRLLQ 296 >UniRef50_D0YJF1 Putative transposase YhgA family protein n=1 Tax=Klebsiella variicola At-22 RepID=D0YJF1_KLEVA Length = 190 Score = 160 bits (404), Expect = 7e-38, Method: Composition-based stats. Identities = 73/178 (41%), Positives = 110/178 (61%), Gaps = 16/178 (8%) Query: 131 PYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQRDLLG 190 P+ + P A+ +Y F L+D+TV+PDD+++QHR++ALLEL+QKHIRQRDL Sbjct: 11 PHDAVFKRFLRHPETAKTLYGCPFTLIDVTVMPDDDLVQHRRVALLELMQKHIRQRDLSS 70 Query: 191 LVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKLMTIADRL 250 + + + ++++ G TN RQL+ LF+Y+LQ G+ F+ +A R PQ +E LM+IA +L Sbjct: 71 ITESLAAVVMLGYTNRRQLRMLFHYMLQYGNTAEPGVFLRRLARRLPQYEETLMSIAQKL 130 Query: 251 REEGAMQGKHEE----------------ALRIAQEMLDRGLDRELVMMVTRLSPDDLI 292 ++EG +G+ E ALRIA ML GLD+E+V +T LS D+L Sbjct: 131 KQEGRQEGRLEGREEGHQEGLQEGSRREALRIAGSMLQNGLDKEMVQKITGLSADELQ 188 Score = 40.0 bits (92), Expect = 0.097, Method: Composition-based stats. Identities = 18/27 (66%), Positives = 20/27 (74%) Query: 1 MTISTTSTPHDAVFKSFLRHPDTARDF 27 M TSTPHDAVFK FLRHP+TA+ Sbjct: 3 MKKRMTSTPHDAVFKRFLRHPETAKTL 29 >UniRef50_B9MN47 Putative uncharacterized protein n=2 Tax=Bacteria RepID=B9MN47_ANATD Length = 324 Score = 155 bits (392), Expect = 2e-36, Method: Composition-based stats. Identities = 52/317 (16%), Positives = 127/317 (40%), Gaps = 28/317 (8%) Query: 2 TISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYY 61 + HD+ FK +P + + ++++++ ++I ++ Q Sbjct: 6 KEKLPAKEHDSTFKLLFENPKDIYLLLSKIINYSWANEIRESSIEIKKTNYITKEFSQVE 65 Query: 62 SDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPML 121 +D++ + ++ Y Y++IE+QS + M R++RY I+ + G ++LP ++P++ Sbjct: 66 ADVVAKARLKDRDVYFYILIENQSTVAKDMPERLLRYMISIWAEEIRNGVEKLPAIIPIV 125 Query: 122 FYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVP-DDEIMQHRKMALLELIQ 180 Y+G + S + F I+ + +VDI + + + + + + Sbjct: 126 VYNGLDRRWEVSTDIIGAFDI--FKNDIFK--YKVVDIAQIDIKNYLQEEDVLTPIIFYL 181 Query: 181 KHIR--QRDLLGLVDQIVSLLVT--GNTNDRQLKALFNYV---LQTGDAQRFRAFIGEIA 233 + +R +L+ + +I L N +R L + + L + + + ++ Sbjct: 182 EQVRNDSNELVRRLQEIEQSLKKLSFNNIERFLLWSQHVIRPRLGNEQKKEYDKLVMKVR 241 Query: 234 ER-APQEKEKLMTIADRLREEGAMQ---------------GKHEEALRIAQEMLDRGLDR 277 + E + +A L E + G +E + A+ M+ G+ Sbjct: 242 QEGVELMGEFVSNVARLLDETKTKEFLAGVQQGIQQGIQQGIQQERIETAKRMIQLGISY 301 Query: 278 ELVMMVTRLSPDDLIAQ 294 E++ T LS +++ Sbjct: 302 EVISKATNLSIEEIEKI 318 >UniRef50_B6J6C6 Hypothetical cytosolic protein n=1 Tax=Coxiella burnetii CbuK_Q154 RepID=B6J6C6_COXB1 Length = 143 Score = 153 bits (386), Expect = 9e-36, Method: Composition-based stats. Identities = 51/143 (35%), Positives = 88/143 (61%), Gaps = 2/143 (1%) Query: 3 ISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYS 62 + PHD F++ + A++F + HLP + K DL +L+L+ +SFIDE L+ + Sbjct: 1 MKKIHNPHDYYFRTAMSDTRVAKEFFEYHLPNNILKAADLNSLQLQKSSFIDEHLKASMA 60 Query: 63 DLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAG-YKELPLVLPML 121 D+L+SVK GY Y+++EHQ P++LM +R++RY + + +HL Y LP+V+P++ Sbjct: 61 DVLYSVKLNRRPGYFYIIVEHQRNPDKLMPYRLLRYILRIIDHHLKKKDYLPLPIVVPLV 120 Query: 122 FYHGCRSPYPYSLCWLDEFAEPA 144 FY+G + YP+ +L A+ + Sbjct: 121 FYNGKKR-YPFQRIFLLYLAKKS 142 >UniRef50_C6HTR6 Probable transposase n=5 Tax=Leptospirillum ferrodiazotrophum RepID=C6HTR6_9BACT Length = 216 Score = 152 bits (383), Expect = 2e-35, Method: Composition-based stats. Identities = 58/217 (26%), Positives = 92/217 (42%), Gaps = 13/217 (5%) Query: 3 ISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDL-RQYY 61 ++TT TPHD+ FK + L AP D ++L I E L + Sbjct: 1 MTTTPTPHDSFFKDVFGPGKANLPALLSLLDAPFASRIDPSSLTFLSGETIGEGLATSFR 60 Query: 62 SDLLWSV----KTQEGVGYIYV-VIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPL 116 SDL+ S+ T +G +V ++EH+S P + F++ A L G LP Sbjct: 61 SDLVGSLLVADATVDGKPLEFVFLVEHKSSPARDIQFKLACLVTALWARFLREGKPPLP- 119 Query: 117 VLPMLFYHGCRSPYPYSLCWLDEFA-EPAIARKIYSSAFPLVDITVVPDDEIMQH---RK 172 V+P+L +HG +SP+ L + P +A + A ++D+T + DDEI + + Sbjct: 120 VVPILIHHG-KSPWNQPLRLYETLGLRPELATGMLDYALHVIDLTRIEDDEIRRKIPDPE 178 Query: 173 MALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQL 209 + KHI L + + LL N L Sbjct: 179 PQMSLAAMKHIHDP-LPAFLRVMADLLKEIEENRDIL 214 >UniRef50_A4XG55 Putative uncharacterized protein n=2 Tax=Caldicellulosiruptor saccharolyticus DSM 8903 RepID=A4XG55_CALS8 Length = 327 Score = 150 bits (378), Expect = 6e-35, Method: Composition-based stats. Identities = 54/323 (16%), Positives = 114/323 (35%), Gaps = 29/323 (8%) Query: 1 MTISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQY 60 M + +D +K + + + L+L +++ D + Sbjct: 1 MCSNLPHNVNDLEYKYIFSNKSLFLRLLKRIDRINIFNKLTEEDLELVDKNYVLPDFSEQ 60 Query: 61 YSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYK-------- 112 SDLL+ + QE + Y++ EHQS + MA R++ Y ++ L K Sbjct: 61 ESDLLYKARLQEEELFFYILFEHQSTVDYNMAMRLLFYITDIWRDWLKQFDKNQFKNKSF 120 Query: 113 ELPLVLPMLFYHGCRSPYPYSLCWLDEFAE-PAIARKIYSSAFPLVDITVVPDDEIMQHR 171 + P V+P++ Y G +P+ S+ + + I + L+D+ + Sbjct: 121 KFPPVVPIVLYDGD-NPWTASVNLKERIMNFEVFGKYIVDFEYILIDLNDPDEMIFKYKD 179 Query: 172 KMALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGE 231 ++L+ + K +++L L + L + + V+ + + Sbjct: 180 ILSLILKLNKVKTEKELERLFLDLYEYLQGAKEKEINTLKICLPVVLKELGEDKVQEAKD 239 Query: 232 IAERAPQEKEKLM-------TIADRLREEGAMQGKH------------EEALRIAQEMLD 272 + E E +M I + EG +G ++ L IA+ M+ Sbjct: 240 MLECIDVGGEGIMPLFQNLRKIREEWYHEGIQKGIQDGLQQGLQQGLQKKELEIAERMIV 299 Query: 273 RGLDRELVMMVTRLSPDDLIAQS 295 +G E + +T L + + Sbjct: 300 KGYSDEEIHEITGLDIEKIKELR 322 >UniRef50_C5RH90 Putative uncharacterized protein n=2 Tax=Clostridium cellulovorans 743B RepID=C5RH90_CLOCL Length = 339 Score = 150 bits (378), Expect = 6e-35, Method: Composition-based stats. Identities = 52/307 (16%), Positives = 107/307 (34%), Gaps = 15/307 (4%) Query: 3 ISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYS 62 ++ + HD +K + +T I + L L S++ D + S Sbjct: 16 VNKKNNLHDKSYKDLFSNKETFLSLIQTFVSNTWGSKLTKENLVLVDKSYVLSDYEELES 75 Query: 63 DLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKE--------L 114 D+++ + + + Y+++E QS + M R++ Y I + L ++ L Sbjct: 76 DIVYKARIGDHEVFFYMLLEFQSYVDYRMPIRLLLYMIEIWREILKNTSEKEFKRKSFRL 135 Query: 115 PLVLPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKM- 173 P V+P++ Y+G ++ I + +D+ DE+ +++ + Sbjct: 136 PAVVPIVVYNGEKNWTVARTLKEVISNSDIFGESILDFRYEFLDVNRFKKDELYENQNIA 195 Query: 174 -ALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGD--AQRFRAFIG 230 A+ L Q R L D ++ QLK V + + Sbjct: 196 SAIFLLDQSISRIEFYNRLKDIVIEFNKLTVEEKAQLKHWLVNVNSEENNYKENIEKIFS 255 Query: 231 EIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLD---RGLDRELVMMVTRLS 287 + ++L+EEG ++GK E + + L+ + L E + L Sbjct: 256 SNKREVEIMTSNISKGLEKLKEEGKIEGKAEGKAELLIKQLNKKFKLLPMEYEKKIKALP 315 Query: 288 PDDLIAQ 294 L Sbjct: 316 EKILDDI 322 >UniRef50_C1DXM1 Putative uncharacterized protein n=5 Tax=Sulfurihydrogenibium azorense Az-Fu1 RepID=C1DXM1_SULAA Length = 342 Score = 145 bits (366), Expect = 2e-33, Method: Composition-based stats. Identities = 56/268 (20%), Positives = 113/268 (42%), Gaps = 18/268 (6%) Query: 3 ISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYS 62 +S +PHD FK F++I LP L + +LKL + ++++ Sbjct: 1 MSIEKSPHDWFFKMIFSQKQNVESFLEIFLPQ-LYECIIPNSLKLSDTEKFSKKYKKFFL 59 Query: 63 DLLWSVKTQEGVG-----YIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLV 117 DL + K ++ G IY+V EH+S P++ ++ Y M+ P V Sbjct: 60 DLAFDCKLKDKEGNTIDGQIYIVFEHKSYPDKHTPSQISFYKSVMMEEDERLSRPYRP-V 118 Query: 118 LPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLE 177 +P++FYHG +S + + + ++S ++ L D++ V + +++ + Sbjct: 119 IPIVFYHGEKSWNIPTDIPQQFNTLGNLEKYLHSLSYILFDVSKVDESFLIEKIYLNACL 178 Query: 178 ----LIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIA 233 K+I + L + ++ L+ + D + V+ D + + EI Sbjct: 179 ISGVFTLKNIFK--DLKYLRPVLEKLILDDVKDCLYIIIDYTVIVKKDLETIEKILEEI- 235 Query: 234 ERAPQEKEKLMTIADRLREEGAMQGKHE 261 +EK+MT+ ++ + EG +G E Sbjct: 236 ----GGEEKMMTLTEKWKMEGLKKGMEE 259 >UniRef50_C6PYR3 Putative uncharacterized protein n=1 Tax=Clostridium carboxidivorans P7 RepID=C6PYR3_9CLOT Length = 344 Score = 143 bits (361), Expect = 7e-33, Method: Composition-based stats. Identities = 49/297 (16%), Positives = 106/297 (35%), Gaps = 26/297 (8%) Query: 3 ISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYS 62 HD +K + + D I + + K ++L S+I D + S Sbjct: 4 KKEMHHIHDKSYKDLFSNKELLVDMIQNFVKSSWIKEIKKDNIELVNKSYILSDYEELES 63 Query: 63 DLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYK--------EL 114 D+++ Y+++E QS + M R+ Y + L + L Sbjct: 64 DIVYKATIDGREVIFYILLEFQSYVDYSMPIRLFLYMSEIWREVLKNTKQAEVKSKEFRL 123 Query: 115 PLVLPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKM- 173 P ++P++ Y+G I + L+DI +E+M+ + + Sbjct: 124 PAIVPLVLYNGEYKWTVEKKFKNIINKSELFGNNIIDFEYILIDINKYEKEELMELKNLV 183 Query: 174 -ALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEI 232 A+ L QK D+ + ++ + + N + K + + L+ + + +GE Sbjct: 184 SAVFLLDQKV----DIEEFISRVKDIAIDFNNLTEEQKMMLRHWLRVTLSDELKGNLGEK 239 Query: 233 AE--RAPQEKE----------KLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDR 277 E +++E + + REEG +G E + ++ + ++ Sbjct: 240 IEDILIAKKEEVNRMTSNISKTIKETFAKTREEGMEKGIEEGIEKGIEKARQKDVEI 296 >UniRef50_D0LPI9 Putative transposase n=2 Tax=Haliangium ochraceum DSM 14365 RepID=D0LPI9_HALO1 Length = 338 Score = 143 bits (360), Expect = 8e-33, Method: Composition-based stats. Identities = 62/293 (21%), Positives = 114/293 (38%), Gaps = 47/293 (16%) Query: 5 TTSTPH-----------------DAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKL 47 TT PH D + ++ + A D LP L K DL L L Sbjct: 2 TTHQPHTKTLDDRNEAPMSQDFYDVLVETTFARREYAADTFRTMLPPALVKRLDLDALSL 61 Query: 48 EPNSFIDEDLRQYYSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHL 107 +++ ++LRQYY+D+L+SV +IY++++HQS + + R+ R ++ + +L Sbjct: 62 RSGTYVSDELRQYYTDVLYSVLLDGEQAFIYLLLKHQSATDPMFPLRLPRNVLSIWERYL 121 Query: 108 --DAGYKELPLVLPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSA------------ 153 LP++LP++F+H + ++ A A R S+ Sbjct: 122 IERQDATTLPVILPIVFHHEATG-WSDAVGLNGSLALGADVRTALSANRRDFRRLRYLLL 180 Query: 154 ---FPLVDITVVPDDEIMQHRKMALLELIQKHIR-QRDL---LGLVDQIVSLLVTGNTND 206 F + + + + + LL R +RDL L + ++ +V Sbjct: 181 VLCFQFDEASRAQN----LNEALGLLMRTFGVARPKRDLVASLKGWEDVIREVVATQRGR 236 Query: 207 RQLKALFNYVL--QTGDAQRFRAFIGEIAERAPQEKEKLMTIADRLREEGAMQ 257 L + ++L D ++F+ A + MT ADRL + + Sbjct: 237 EMLATVVQFILENSETDPDELKSFLEFTAG--EPARTAFMTGADRLTQGVREE 287 >UniRef50_D2NBJ3 Putative uncharacterized protein n=1 Tax=Escherichia coli SE15 RepID=D2NBJ3_ECOLX Length = 136 Score = 142 bits (359), Expect = 1e-32, Method: Composition-based stats. Identities = 72/129 (55%), Positives = 93/129 (72%), Gaps = 4/129 (3%) Query: 168 MQHRKMALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRA 227 +H MALLELIQKHIRQRDL+GLV+Q+ LL +G NDRQ+K LFNY+LQTGDA RF Sbjct: 12 RRHASMALLELIQKHIRQRDLMGLVEQMACLLSSGYANDRQIKGLFNYILQTGDAVRFND 71 Query: 228 FIGEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLS 287 FI +AER+P+ KE LMTIA+RLR+E G+ +AL IA+ ML+ G+ +M T +S Sbjct: 72 FIDGVAERSPKHKESLMTIAERLRQE----GEQSKALHIAKIMLESGVPLADIMRFTGVS 127 Query: 288 PDDLIAQSH 296 ++L A S Sbjct: 128 EEELAAASQ 136 >UniRef50_C1MD86 Putative uncharacterized protein n=5 Tax=Enterobacteriaceae RepID=C1MD86_9ENTR Length = 155 Score = 141 bits (355), Expect = 4e-32, Method: Composition-based stats. Identities = 81/155 (52%), Positives = 105/155 (67%), Gaps = 20/155 (12%) Query: 162 VPDDEIMQHRKMALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGD 221 +PDD+IMQHR+MALLELIQKHIR+RDL+GLV+++ LLV G+ ND QLKALFNY++Q G+ Sbjct: 1 MPDDKIMQHRRMALLELIQKHIRKRDLMGLVEKLAILLVKGHANDNQLKALFNYLMQAGN 60 Query: 222 AQRFRAFIGEIAERAPQEKEKLMTIADRLREE--------------------GAMQGKHE 261 F F+ E+AER PQ K+KLMTIA+RLR+E G QGK E Sbjct: 61 TTHFGEFLHEVAERLPQHKDKLMTIAERLRQEGHLNGLQEGHRKGLQEGLQTGLQQGKRE 120 Query: 262 EALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQSH 296 EALRIA M G+D ++ +T L+ +DL +SH Sbjct: 121 EALRIASTMQADGIDPLTIIRITGLTAEDLATRSH 155 >UniRef50_B0K503 Putative uncharacterized protein n=12 Tax=Thermoanaerobacteraceae RepID=B0K503_THEPX Length = 360 Score = 135 bits (340), Expect = 2e-30, Method: Composition-based stats. Identities = 40/265 (15%), Positives = 103/265 (38%), Gaps = 10/265 (3%) Query: 6 TSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLL 65 HD +K L + + + D + SF+ +D +DL+ Sbjct: 10 IHNQHDKGYKFLLSSKRVFIELLRSFVKQEWVNDIDEANVVKVDKSFVLQDFADKEADLV 69 Query: 66 WSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYK--------ELPLV 117 + VK ++ Y+++E QS + M +R++ Y + ++ L + +LP++ Sbjct: 70 YRVKLKDKEVIFYILMELQSTVDYQMPYRLLLYMVEIWRSILKDTPRKESRRKDFKLPVI 129 Query: 118 LPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLE 177 +P++ Y+G + + + L+D+ +E+++ + Sbjct: 130 VPIVLYNGDHKWTAKTSYKETLNSYETFGEYAVDFKYILIDVNRYTKEELLKLENLIASV 189 Query: 178 LIQKH-IRQRDLLGLVDQIVSLLVTGNTNDRQL-KALFNYVLQTGDAQRFRAFIGEIAER 235 + + + +++ + ++ +L + ++ L KA F +L + R I I + Sbjct: 190 FLLEQKVEFEEIMKRLKELSEILNNLDKDEILLFKAWFKKILLARLPEEERENIERIIDE 249 Query: 236 APQEKEKLMTIADRLREEGAMQGKH 260 + +E + + + +E + K Sbjct: 250 NKEVEEMISNLEKTILQEMKEREKR 274 >UniRef50_B9MMM9 Putative uncharacterized protein n=1 Tax=Anaerocellum thermophilum DSM 6725 RepID=B9MMM9_ANATD Length = 315 Score = 125 bits (314), Expect = 2e-27, Method: Composition-based stats. Identities = 50/316 (15%), Positives = 117/316 (37%), Gaps = 29/316 (9%) Query: 5 TTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDL 64 T +D +K + + F+ L K + + +++ I++ ++ SD+ Sbjct: 2 KTYKKYDEGYKKLFSNKENLIWFLQNVLNEERFKKIEKSDVEIIATESINKKWQKKISDI 61 Query: 65 LWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYH 124 ++ +K ++ + + IE QS+ ++ + R+ Y + E+P+V+P++ Y+ Sbjct: 62 VYKIKYKD--SFFCLTIEFQSREDKKILHRLYEYMHLI--QLKNKVNGEIPVVVPIVLYN 117 Query: 125 GCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRK-MALLELIQKHI 183 G P + +DI +P+++++ +A+ I + Sbjct: 118 GISHWKPNEQYNEIILFAKDFPEYAQNFKIIFLDIKSIPEEKLISAANVLAIAVYIDQVS 177 Query: 184 RQRD-LLGLVDQIVSLLVTGNTNDRQLK------ALFNYVLQTGDAQRF--------RAF 228 + +L + + + +L L +Y + +A+ Sbjct: 178 NNPERVLNRILNLRGKIHLNWEQREELADWLYEVILRSYGVSEEEAEEMFKKSGLEVDEL 237 Query: 229 IGEIAERAPQE---------KEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDREL 279 AE+ Q KE + + ++G QG IA++ML EL Sbjct: 238 FSSTAEKIKQGIEREKKKIAKEAMKQGMKQGMKQGMKQGMKRAIKLIAKQMLKDNQPIEL 297 Query: 280 VMMVTRLSPDDLIAQS 295 + T L+P+++ Sbjct: 298 ISKYTGLTPEEIKKLK 313 >UniRef50_A4XMU7 Putative uncharacterized protein n=1 Tax=Caldicellulosiruptor saccharolyticus DSM 8903 RepID=A4XMU7_CALS8 Length = 313 Score = 124 bits (312), Expect = 3e-27, Method: Composition-based stats. Identities = 54/314 (17%), Positives = 122/314 (38%), Gaps = 27/314 (8%) Query: 3 ISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYS 62 +S D FK L + + + L L L +++ I+ R S Sbjct: 1 MSRKRRSADEGFKKVLTNRTNIKWLLTELL-EVLPIQIGLEDIEVIATESINRQWRARRS 59 Query: 63 DLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLF 122 D+++ +K ++ YI V++E QS EEL+ R++ Y + LP+V+P++ Sbjct: 60 DMVYKIKYKD--AYICVLLEFQSSKEELIHLRVLEYMLLI--QKKYTTKNLLPVVIPVVL 115 Query: 123 YHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRK-MALLELIQK 181 Y G P + + + + + VD+ ++ D+++++ +A + K Sbjct: 116 YTGEEKWTPATCFEQNVVYGEDFKQFVQKFSLVFVDVRMIDDEKLLKSPNLLAAALYVDK 175 Query: 182 HIRQRDLLGLVDQIVS--LLVTGNTNDRQLKALFNYVLQTG--DAQRFRAFIGEIAERAP 237 + + + +S + + + + L++ VL+ + F+ + Sbjct: 176 VSDNPEKVAERLEYLSKHVKFSEEQKEEFCEWLYHVVLKGYGFSDEEVDEFLFKSDFLRL 235 Query: 238 QEKEKLMTIADRLREEGAMQG----------------KHEEALRIAQEMLDRGLDRELVM 281 E + A+++R +G + K + L +AQ+M++ G + + Sbjct: 236 GVNEMFLNTAEKIR-KGLEKELEKERKQGIQQGIQQGKEQALLEVAQKMIEEGAEDSFIA 294 Query: 282 MVTRLSPDDLIAQS 295 VT L + + Sbjct: 295 KVTGLDMERIRQLR 308 >UniRef50_C1DXV7 Putative uncharacterized protein n=1 Tax=Sulfurihydrogenibium azorense Az-Fu1 RepID=C1DXV7_SULAA Length = 357 Score = 124 bits (311), Expect = 4e-27, Method: Composition-based stats. Identities = 44/256 (17%), Positives = 112/256 (43%), Gaps = 12/256 (4%) Query: 7 STPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFID-EDLRQYYSDLL 65 PHD K L+ + A+ +D HLP + + TL++ +D ++ +Y++D++ Sbjct: 14 QNPHDTYAKELLKDEEVAQVLLDAHLPQEINSIIKKETLEIINTENLDYKEKSKYFADII 73 Query: 66 WSVKTQEGVGY-IYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYH 124 +S+KT G IYV+IEH+S ++ + ++++ A + G ++ + P++ Y Sbjct: 74 YSLKTIYGEDLKIYVLIEHKSYDDKHLPLQLIKNMTAVWSKEILEG--KITPIYPIVIY- 130 Query: 125 GCRSPYPYSLCWLDEFAEPAIARKIYSSAFP-LVDITVVPDDEIMQ-HRKMALLELIQKH 182 + + + + +K + + +++ + + I + ++ + L + + Sbjct: 131 ASKEKLSLESKFSNYYKISDNMKKFFLDFYVSTLNLNELDEKTIKEKYKNIYTLIMTLRI 190 Query: 183 IRQRDLLGLVDQIVSLLVTGNTNDRQLKAL-FNYVLQTGDAQRFRAFIGEIAERAPQEKE 241 I++ +++ I S+ N + + + +Y+ + I + E Sbjct: 191 IQEPTPENILNLIKSIETLYNYKPKAVYVIALSYIFTIAKKDKNTY----IKVKKQLEGG 246 Query: 242 KLMTIADRLREEGAMQ 257 + ++ D EEG + Sbjct: 247 NMGSLLDMFIEEGLEK 262 >UniRef50_B8FP58 Putative uncharacterized protein n=1 Tax=Desulfitobacterium hafniense DCB-2 RepID=B8FP58_DESHD Length = 167 Score = 116 bits (290), Expect = 1e-24, Method: Composition-based stats. Identities = 34/85 (40%), Positives = 53/85 (62%) Query: 3 ISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYS 62 +S PHD FK TAR F++ +LP +R L DL T+ + +S+ID++L++ +S Sbjct: 1 MSLIHNPHDKFFKETFGDVGTARSFLENYLPQEVRALVDLKTVLPQKDSYIDQELQESFS 60 Query: 63 DLLWSVKTQEGVGYIYVVIEHQSKP 87 DLL+ VK +E GY Y + EH+ +P Sbjct: 61 DLLFQVKIRENEGYFYFLFEHKVRP 85 >UniRef50_C0GV86 Transposase, ISNCY family n=7 Tax=Desulfonatronospira thiodismutans ASO3-1 RepID=C0GV86_9DELT Length = 125 Score = 113 bits (284), Expect = 6e-24, Method: Composition-based stats. Identities = 34/104 (32%), Positives = 61/104 (58%), Gaps = 3/104 (2%) Query: 1 MTISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQY 60 M PH+ +F ++ D AR F+ H+ ++K DL TL+LEP +++DE L+++ Sbjct: 1 MATKRNQAPHEGLFLKIFQNLDNARHFLKNHMSEEIQKRFDLDTLRLEPTTYVDEKLKKH 60 Query: 61 YSDLLWSVKT---QEGVGYIYVVIEHQSKPEELMAFRMMRYSIA 101 YSDL++SV+ + IY++ EH+S P+ L ++++Y Sbjct: 61 YSDLVFSVRLIGYKNQFAKIYLLFEHKSSPDPLTGVQVLKYMAL 104 >UniRef50_B9MPV5 Putative uncharacterized protein n=5 Tax=Clostridia RepID=B9MPV5_ANATD Length = 331 Score = 112 bits (279), Expect = 2e-23, Method: Composition-based stats. Identities = 51/335 (15%), Positives = 115/335 (34%), Gaps = 49/335 (14%) Query: 1 MTISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQY 60 M +S + +D FK FI +P P K + +++ I+ + Sbjct: 1 MKLSRS---YDVGFKKLFSDKINVCWFITEIIPEPRLKNYTQSDIEIVATESINAQWKAR 57 Query: 61 YSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPM 120 SD+++ + IY+++E QS+P + M R+ Y + + +V+P+ Sbjct: 58 RSDMVYRLPYSSSW--IYLLVEFQSRPNKQMHCRIYEYVFLIQRKYQIDKRLP--VVVPV 113 Query: 121 LFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQ-HRKMALLELI 179 + Y+G P + + + + +D+ +P+D+++ + +A + Sbjct: 114 VLYNGVEKWQPVTQFADNVEYAEDFPEYVQRLNYIFIDVRDIPEDKLLNGNNVLAAALYV 173 Query: 180 QKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKAL------------------FNYVLQTGD 221 + D +V++++ L D Q + L N + Sbjct: 174 DQVATNPD--SVVERLLELGKNIRIPDEQREELAEWLYHAVLKSYKIPREEINELFAKSK 231 Query: 222 AQRFRAFIG---------------------EIAERAPQEKEKLMTIADRLREEGAMQGKH 260 +I + + E + + EG ++G+ Sbjct: 232 ILGVEEMFQSTAMKIKKGLAEEKKKIRLESKIEGKIEGKIEGKIEGKIEGKIEGKIEGRM 291 Query: 261 EEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQS 295 E L IA+ ++ G + + VT L + + Sbjct: 292 EAQLEIARNLILEGAEDSFIAKVTGLDIEKVKELR 326 >UniRef50_C1I6Y7 Putative uncharacterized protein n=1 Tax=Clostridium sp. 7_2_43FAA RepID=C1I6Y7_9CLOT Length = 226 Score = 110 bits (275), Expect = 6e-23, Method: Composition-based stats. Identities = 37/223 (16%), Positives = 75/223 (33%), Gaps = 12/223 (5%) Query: 45 LKLEPNSFIDEDLRQYYSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQ 104 + L S+I D + SD+++ + YV++E QS + M R++ Y I + Sbjct: 1 MILVNKSYILSDYEEQESDIVYKANFNGNDVFFYVLLEFQSSVDFRMPIRLLLYMIEIWR 60 Query: 105 NHLDAGYKE--------LPLVLPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPL 156 + L + LP ++P++ Y+G + I + + Sbjct: 61 DILRNTELKEFKRKTFRLPSIVPIVLYNGKKKWTAAKELKHAISNSDVFGDNILNFKYEF 120 Query: 157 VDITVVPDDEIMQHRKM--ALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFN 214 +DI +E+ + + A+ L Q R L D I+ LK Sbjct: 121 IDINSYEKEELYNKQNISSAIFLLDQNINRIEFYNRLKDIIIGFNNLSIEEKMHLKHWLV 180 Query: 215 YVLQTGD--AQRFRAFIGEIAERAPQEKEKLMTIADRLREEGA 255 + + + + ++L+E+G Sbjct: 181 NINTEENNFKDNIEKIFNADKQEVLNMTSNISKGLEKLKEDGK 223 >UniRef50_B0G834 Putative uncharacterized protein n=3 Tax=Dorea formicigenerans ATCC 27755 RepID=B0G834_9FIRM Length = 369 Score = 109 bits (273), Expect = 1e-22, Method: Composition-based stats. Identities = 55/314 (17%), Positives = 110/314 (35%), Gaps = 33/314 (10%) Query: 2 TISTTSTPH--DAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQ 59 T + H D K P F+ + PL K ++ + F+ Sbjct: 8 TSNGVHNTHTKDNAAKIVFGDPVLCAQFLKGYTDIPLFKEIKPEDIENVSSHFLPLFQES 67 Query: 60 YYSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYK------- 112 SD + + Y+ +IEHQS+ + M+FR++RY + ++ K Sbjct: 68 RDSDTVNKIWIGNSEIYLIALIEHQSENDFDMSFRILRYIVFIWTDYAAQQEKLHKGTTK 127 Query: 113 ----ELPLVLPMLFYHGCRSPYPYSLCWLDE-FAEPAIARKIYSSAFPLVDITVVPDDE- 166 P +LP+++Y G S + L + + F I S + +V + + Sbjct: 128 SKDFLYPPILPIVYYEGS-STWSAPLNFKNRVFLSDVFGDYIPSFNYLVVPLNKYSKQDL 186 Query: 167 IMQHRKMALLELI--QKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDA-- 222 I ++ +++L+ LI + + L + + + +T +T D LK + + Sbjct: 187 IEKNDELSLIFLINQLQSSSEFHALKDIPKKYTEHLTEDTPDYLLKIIGKVIAVLLHKLN 246 Query: 223 ---QRFRAFIGEIAERAPQE----------KEKLMTIADRLREEGAMQGKHEEALRIAQE 269 + +I R +E + R EG ++G+ + + Sbjct: 247 VPDEEVYEVTDQITRRKFSMMFDNFQAYDVQETRRVSREEGRLEGRIEGERAGRIEGERA 306 Query: 270 MLDRGLDRELVMMV 283 G L+ V Sbjct: 307 GRIEGERLHLIKQV 320 >UniRef50_B0K519 Putative uncharacterized protein n=14 Tax=Thermoanaerobacteraceae RepID=B0K519_THEPX Length = 288 Score = 108 bits (269), Expect = 3e-22, Method: Composition-based stats. Identities = 39/227 (17%), Positives = 95/227 (41%), Gaps = 15/227 (6%) Query: 64 LLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQN--------HLDAGYKELP 115 +++ VK ++ + Y+++E QSK + M +R++ Y I + +LP Sbjct: 1 MVYQVKLKDKEVFFYILLELQSKVDFQMPYRLLLYIIEVWREILKDTSLNQQKRKDYKLP 60 Query: 116 LVLPMLFYHGCRSPYPYSLCWLDEFAEPAIA-RKIYSSAFPLVDITVVPDDEIMQHRKM- 173 ++P++ Y+G + SL + + + I + L+D+ ++E++Q + Sbjct: 61 AIIPIVLYNGVNR-WTASLSFKETIDSYQLFGENIIDFKYILIDVNRYNEEELLQLSNLI 119 Query: 174 ALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIA 233 + + L+ + I + +L ++ +L ++ + L N++ EI Sbjct: 120 SSIFLLDRKIDKEELTEKWGKLADVLKDI--SEEEFIILRNWLFSVVSRFLPEDKEKEIK 177 Query: 234 ERAPQEK--EKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRE 278 E Q + E++++ +R E + + E ++ GL Sbjct: 178 EILVQSEGVEEMISNLERSLREEFRKTRREGLKEGLKKGKLEGLKIG 224 >UniRef50_B5Q357 Transposase n=10 Tax=Salmonella enterica subsp. enterica RepID=B5Q357_SALVI Length = 174 Score = 106 bits (265), Expect = 1e-21, Method: Composition-based stats. Identities = 63/140 (45%), Positives = 82/140 (58%), Gaps = 25/140 (17%) Query: 182 HIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYV-LQTGDAQRFRAFIGEIAERAPQEK 240 +RQRDLLGLV++I SLLVTG NDRQLKALFNY+ +Q G RF FI ++ P K Sbjct: 35 SLRQRDLLGLVERIASLLVTGCANDRQLKALFNYLMIQHGHTPRFTTFIRDVVGHVPHTK 94 Query: 241 EKLMTIADRLR------------------------EEGAMQGKHEEALRIAQEMLDRGLD 276 E+LMT+ +R+R E+G +G+H ALRIA++ML GLD Sbjct: 95 ERLMTLIERIRAADRRKGERQGRQLGLEEGLAEGLEKGLEKGQHVAALRIARQMLADGLD 154 Query: 277 RELVMMVTRLSPDDLIAQSH 296 RE V T L+ ++L SH Sbjct: 155 RETVQRFTGLTAEELQDVSH 174 Score = 58.8 bits (141), Expect = 2e-07, Method: Composition-based stats. Identities = 29/39 (74%), Positives = 34/39 (87%) Query: 1 MTISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKL 39 M STTSTPHDAVFK+FLRHP+TARDF++IHLP LR+ Sbjct: 1 MKKSTTSTPHDAVFKTFLRHPETARDFMEIHLPVSLRQR 39 >UniRef50_B9E303 Putative uncharacterized protein n=2 Tax=Clostridium kluyveri RepID=B9E303_CLOK1 Length = 304 Score = 105 bits (263), Expect = 1e-21, Method: Composition-based stats. Identities = 42/242 (17%), Positives = 90/242 (37%), Gaps = 33/242 (13%) Query: 79 VVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYK--------ELPLVLPMLFYHGCRSPY 130 +E QS+ + M R++ Y + + L K +LP ++PM+ Y+G ++ + Sbjct: 28 CFLEFQSRVDYRMPMRLLFYMVEIWREILKNTSKNDRSKKDFKLPSIIPMVLYNG-KNTW 86 Query: 131 PYSLCWLDEFAEPAIA-RKIYSSAFPLVDITVVPDDEIMQHRKM-ALLELIQKHIRQRDL 188 + D + + + + L DI ++++ M + + L+ K I + DL Sbjct: 87 TACKNFKDVLSGSKLFGENVIDFRYMLFDIYRYNEEQLEDMANMVSTVFLLDKEISKEDL 146 Query: 189 LGLV---------------DQIVSLLVT--GNTNDRQLKALFNYVLQTGDAQRFRAFIGE 231 + + D + + L + D + K +L+ + + Sbjct: 147 VKRLRLTAYVLKKITPEQFDILKAWLKSIIKPRLDSESKIKIEEILEKSSQGEVDSMVSN 206 Query: 232 IAERAPQ-EKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRG----LDRELVMMVTRL 286 + + +E T + R EG +G+ E +E G + + LV T+L Sbjct: 207 LGKTIDNIIREGRETGLEEGRREGRKEGRKEGRKEGRKEGRKEGKSELITKMLVKKFTKL 266 Query: 287 SP 288 Sbjct: 267 PD 268 >UniRef50_Q1PZ06 Putative uncharacterized protein n=1 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1PZ06_9BACT Length = 238 Score = 104 bits (259), Expect = 4e-21, Method: Composition-based stats. Identities = 38/188 (20%), Positives = 72/188 (38%), Gaps = 10/188 (5%) Query: 96 MRYSIAAMQNHLDAGYKELPLVLPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFP 155 M+Y + + + +P V+P++ YHG + E + R I + Sbjct: 1 MKYLLKIWAANSKQMQRLIP-VIPVILYHGKETWKVRRFRDYFEGIDEVFFRFIPEFEYL 59 Query: 156 LVDITVVPDDEIM----QHRKMALLELIQKHIRQRDLL-GLVDQIVSLLVTGNTNDRQLK 210 L D++ ++EI + + + L+ ++I +L + + LK Sbjct: 60 LTDLSFYSNEEIKDKVFRRVSLQITMLLMRNIYNDKILGDKLKAFFEIGKQYFEEGEGLK 119 Query: 211 AL---FNYVLQTGDAQRFRAFIGEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIA 267 L Y+ D + R I + E + + MTIA RL E+G + G+ E Sbjct: 120 FLESVIRYLYYASDIEEER-VIDTLKEISEEGGRLSMTIAARLIEKGKIAGRMEGRAEGE 178 Query: 268 QEMLDRGL 275 ++ GL Sbjct: 179 RKGRMEGL 186 >UniRef50_C4UAM6 Putative uncharacterized protein n=1 Tax=Yersinia aldovae ATCC 35236 RepID=C4UAM6_YERAL Length = 105 Score = 102 bits (255), Expect = 1e-20, Method: Composition-based stats. Identities = 44/103 (42%), Positives = 57/103 (55%), Gaps = 16/103 (15%) Query: 210 KALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKLMTIADRLREEGAMQGKHE-------- 261 K+L NY+LQ GDA + FI E+A R+PQ KE LMTIA +L++EG +G+ E Sbjct: 3 KSLINYMLQDGDAATPKTFIWELARRSPQHKELLMTIAQKLKQEGRQEGRQEGRVEGIQI 62 Query: 262 --------EALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQSH 296 L +A+ ML GLDR VM +T LS DL H Sbjct: 63 GEANGLKKGKLEVARTMLVNGLDRATVMKMTGLSDKDLTQIHH 105 >UniRef50_C4G1D5 Putative uncharacterized protein n=2 Tax=Abiotrophia defectiva ATCC 49176 RepID=C4G1D5_ABIDE Length = 297 Score = 101 bits (251), Expect = 4e-20, Method: Composition-based stats. Identities = 44/234 (18%), Positives = 101/234 (43%), Gaps = 13/234 (5%) Query: 71 QEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYHGCRSPY 130 + V + ++ IE+QS P++ M R++ Y A ++ + G + + VL ++ Y G Sbjct: 67 KNEVIFSFIGIENQSAPDKDMILRIISYDGATYKSQM--GNESIYPVLTIVIYWGKYEWK 124 Query: 131 PYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQRDLLG 190 +A I F L+DI + E+++ + + + RQ++ Sbjct: 125 APVSLQERINCPRELADIIPDYRFKLIDIGRLSGKELIKFKS-DFRLVAEFIARQKEYKP 183 Query: 191 LVDQI--------VSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQE--K 240 ++I + L+ G+ ++LK + + G + EI R ++ + Sbjct: 184 GKEEIKHPEELLDLLDLLAGDKRFKELKGKVKNIRKEGRIINMCELLDEIENRGIEKGIE 243 Query: 241 EKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQ 294 + + ++ E+G +G+ LRIA++ D + +++M T L+ +++ Sbjct: 244 QGIEQGIEKGIEKGRSEGEETATLRIAKKFKDSNVSIDIIMKATGLTKEEIEEL 297 >UniRef50_A5D0D4 Putative uncharacterized protein n=10 Tax=Clostridia RepID=A5D0D4_PELTS Length = 332 Score = 98.1 bits (243), Expect = 3e-19, Method: Composition-based stats. Identities = 51/286 (17%), Positives = 102/286 (35%), Gaps = 20/286 (6%) Query: 3 ISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDE--DLRQY 60 ++ HD +FK L +F+++ P + DL +K + ++ Sbjct: 1 MNKDQVDHDRLFKQLLE--TFFAEFMELFFPEA-AQATDLEYVKFLQQELFTDITAGEKH 57 Query: 61 YSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPM 120 +D++ + ++ G I V +E QS ++ RM Y + + +LP+ Sbjct: 58 RADIIVETRLKDEPGLILVHVEPQSYIQKEFNERMFIYFSRLYEKYRRK-------ILPV 110 Query: 121 LFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVP-DDEIMQHRKMALLELI 179 + Y + D F + F +++ + D I +A L Sbjct: 111 AVF-----TYDHIRNEPDSFEIGFSFLDVLRFHFYKLELKKLHWRDYIRSDNPVAAALLS 165 Query: 180 QKHIRQRDLLGLVDQIVSLL--VTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAP 237 + R + + + + + +L + + +L F + Q F E+ + Sbjct: 166 KMGFRPEERVQVKLEFMRMLARMKLDPARTELIGGFFETYLKLNRQEEEEFYRELGKIDK 225 Query: 238 QEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMV 283 +E E +M I E+G M+G+ E L E G R V Sbjct: 226 KEVELIMQITTSWHEKGRMEGRLEGRLEGRLEGRLEGEARGKVEKA 271 >UniRef50_C6IY67 Transposase n=1 Tax=Paenibacillus sp. oral taxon 786 str. D14 RepID=C6IY67_9BACL Length = 333 Score = 96.6 bits (239), Expect = 1e-18, Method: Composition-based stats. Identities = 54/317 (17%), Positives = 105/317 (33%), Gaps = 43/317 (13%) Query: 9 PHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQ--YYSDLLW 66 PHD FK L +FI + P L D + + + + + + DLL Sbjct: 27 PHDEAFKKLLH--TFFAEFIALFFP-ELESQLDFSQTRFLMQEQLVDVVGEEARTLDLLL 83 Query: 67 SVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYHGC 126 K +I + +E QS + RM Y + H L++P+ + Sbjct: 84 ETKYIGTDAFILIHLEPQSYRQADFHERMFIYFSRLFERHRKEHQ----LIIPIAIFTSA 139 Query: 127 RSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIR-- 184 S + + I F V++ P + L+ K Sbjct: 140 ESK-----NERNSLNMSILGEDILQFRFLKVELINQPWRRFIDSNNPVAAALLAKMGYNK 194 Query: 185 --QRDL-LGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKE 241 +R+L L + ++ L + L + D ++ + E+A++ +E E Sbjct: 195 GEERELRLAYLRMLLQLSQRLDQARLALVMSIADLYFEPDPRQDEEMLRELAKQYAKESE 254 Query: 242 KLMTIADRLREEGAMQGKHEE------------------------ALRIAQEMLDRGLDR 277 +M + +G +G E +IA+ +L +G Sbjct: 255 VIMELMPAWMRQGYEKGLEEGLEKGIEQGIEKGFEKGIEQGTLIERRQIARRLLSKGFTL 314 Query: 278 ELVMMVTRLSPDDLIAQ 294 E + +T+LS +++ Sbjct: 315 EEIADMTQLSIEEIKKI 331 >UniRef50_B0K813 Putative uncharacterized protein n=13 Tax=Thermoanaerobacterales RepID=B0K813_THEP3 Length = 267 Score = 95.1 bits (235), Expect = 3e-18, Method: Composition-based stats. Identities = 55/294 (18%), Positives = 119/294 (40%), Gaps = 36/294 (12%) Query: 7 STPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLW 66 S +D K+ + A D L T L F + R+ SD+++ Sbjct: 2 SQEYDITAKNIFSN--LADDIASYFL------GLKFTKLDELNIEFTTIESRE--SDMVF 51 Query: 67 SVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYHGC 126 T+ I + IE Q+ + M +RM+RY+ M+ H LP ++ Y Sbjct: 52 KCTTENRD--IALHIEFQTYNDSKMPYRMLRYATEIMEKH-----NLLP--YQVVVYCSK 102 Query: 127 RSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELI---QKHI 183 L + + + + ++D+ + ++I++ + L + K Sbjct: 103 NE-----LKMENNLNYHLGEENLLNFRYRIIDVGKIKFEDIVKTKYYDLYTFLPVADKDK 157 Query: 184 RQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKL 243 RQ++ + + ++ + KA +Y++ T + + E+ E+ E + Sbjct: 158 RQKEKEAYLRKCAEVIRDMPVD----KAKKSYIVTTAEILAGIIYDEEVIEKI--FSEVI 211 Query: 244 -MTIADRLR--EEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQ 294 M+I + + + +GK E+++ IA+E+L G+D + +T+LS +++ Sbjct: 212 GMSILEESKVYKNILEKGKKEKSIEIARELLKEGMDINKIAQITKLSVEEIKKL 265 >UniRef50_C4FHW2 Putative uncharacterized protein n=1 Tax=Sulfurihydrogenibium yellowstonense SS-5 RepID=C4FHW2_9AQUI Length = 211 Score = 94.3 bits (233), Expect = 4e-18, Method: Composition-based stats. Identities = 38/174 (21%), Positives = 77/174 (44%), Gaps = 13/174 (7%) Query: 110 GYKELPLVLPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQ 169 + P ++ ++FYHG R + L + + + L+D+ +PD+E+ Sbjct: 6 KKEYYPPIINIVFYHGEREWNIPTN--LPTVKDKDLQEYTQKLNYILIDLNKIPDEELKN 63 Query: 170 H----RKMALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRF 225 + L L+ K I D + + I+ L++ + +D L VL DA++ Sbjct: 64 RISKNMDVILAILVMKRIF--DDIQNLRPILELIIK-HKSDSLFIILDYIVLIKKDAEKV 120 Query: 226 RAFIGEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDREL 279 + EI+ EK+MT+ ++ + EG M+GK E L ++ + + + + Sbjct: 121 EKILKEIS----GGDEKMMTLTEKWKMEGWMKGKLEGRLEAQRKAIIKLIQLKF 170 >UniRef50_A5USQ0 Putative uncharacterized protein n=4 Tax=Roseiflexus sp. RS-1 RepID=A5USQ0_ROSS1 Length = 330 Score = 92.4 bits (228), Expect = 2e-17, Method: Composition-based stats. Identities = 51/271 (18%), Positives = 94/271 (34%), Gaps = 21/271 (7%) Query: 10 HDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLR--QYYSDLLWS 67 HDA+FK L R+FID+ P L D + + +DL+ Sbjct: 7 HDALFKLVLT--AFFREFIDLVAP-DLAAALDPAPPVFLDKESFADLFDPDRREADLVAQ 63 Query: 68 VKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYHGCR 127 V+ ++ + + +EHQ++ + + RM RY + + P+ Sbjct: 64 VRLRQHPATLLIHLEHQAQADAALDRRMFRYFARLYDRYDQ-------PIYPIAL----- 111 Query: 128 SPYPYSLC-WLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKH-IRQ 185 YP D A R + + + +V + + + A + L+ + + Sbjct: 112 CSYPRPRRPAADRHEVRAAQRTVLTFQYQVVQLNRMDWRAYLTTTNPAAMALMARMRVAP 171 Query: 186 RDLLGLVDQIVSLLVTGN--TNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKL 243 D + + LL R+L F + +A+ +A E+A KE + Sbjct: 172 EDRWRVKAACLRLLAGAPLTGAQRRLIGQFVDIYLPLNAREEQALAAEVARLPGAAKEVV 231 Query: 244 MTIADRLREEGAMQGKHEEALRIAQEMLDRG 274 M + +G +G E E L G Sbjct: 232 MELITSWERKGRAEGLREGLREGRAEGLREG 262 >UniRef50_C9KKN3 Putative uncharacterized protein n=1 Tax=Mitsuokella multacida DSM 20544 RepID=C9KKN3_9FIRM Length = 297 Score = 91.2 bits (225), Expect = 4e-17, Method: Composition-based stats. Identities = 51/308 (16%), Positives = 103/308 (33%), Gaps = 33/308 (10%) Query: 1 MTISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKL-CDLTTLKLEPNSFIDEDLRQ 59 M + T D++F+ + + +TTL+ + I D+ Sbjct: 1 MCMKPKRTYKDSLFRHIFNDKRRLASLYESLTGRKVAPRDIAITTLRGVFFNDIKNDI-- 58 Query: 60 YYSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDA-----GYKEL 114 S + + +++EHQS M RM+ Y LD+ + + Sbjct: 59 -------SFRIGDRDI---ILMEHQSSWNPNMPLRMLWYVAKLYSRQLDSQEVVYRSRLI 108 Query: 115 PLVLP--MLFYHGCR-SPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHR 171 P+ P +FY+G + P L D FA ++ + ++ + Sbjct: 109 PIPAPEFYVFYNGSQDEPDYQKLRLSDAFAHATDTLELAVDCYN-INYST------QNKL 161 Query: 172 KMALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKA--LFNYVLQTGDAQ--RFRA 227 + EL I + + + + L K L Q +++ Sbjct: 162 LDSCYELRCYSIFVQKVREGIQNGLELRTAIRQAITYCKTHDLMGDYFQKNESEVFDMVN 221 Query: 228 FIGEIAERAPQEKEKLMTIAD-RLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRL 286 F + KE + I + R G + G+ +++A +L +GL ++ T L Sbjct: 222 FKWDQKRALEVAKEDGVAIGEARGEARGKLLGERNAMMKVALSLLKKGLPVGVITESTNL 281 Query: 287 SPDDLIAQ 294 S +++ Sbjct: 282 SLEEVRKI 289 >UniRef50_B0KCX4 Putative uncharacterized protein n=12 Tax=Thermoanaerobacterales RepID=B0KCX4_THEP3 Length = 267 Score = 87.4 bits (215), Expect = 5e-16, Method: Composition-based stats. Identities = 57/296 (19%), Positives = 103/296 (34%), Gaps = 40/296 (13%) Query: 7 STPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLW 66 S +D K + A D L T F + + SD++ Sbjct: 2 SQKYDITIKDIFSN--MADDITAYFL------GLTYTKTDELNIEFT--KVEKRQSDIVL 51 Query: 67 SVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYHGC 126 T++G I V +E QS ++ M +RM+RYS+ M+ + Y+ ++ Y G Sbjct: 52 KCTTEKGD--IAVHLEFQSDNDDKMPYRMLRYSLEIMEKYNLTPYQ-------LVIYMGK 102 Query: 127 RSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQR 186 L ++ I + ++D+ + +I + L L+ R+R Sbjct: 103 N-----DLRMENKLDYNLGEENILDYRYKIIDVGTIKFLDITKTDYYDLYALLPIMDRER 157 Query: 187 DLLGLVDQIVSLLVTGNT--------NDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQ 238 + + D KA L R F + + Sbjct: 158 RKTEGEKYLKECVEAIKNIPIDINKKKDITFKAEILSGLVYSREVIERVFTEVMEMLRIE 217 Query: 239 EKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQ 294 E E I ++ G E++LRIA+E+L G+D + +T LS +++ Sbjct: 218 ESEAYKMILEK--------GAKEKSLRIAKELLKEGMDINKIAKITELSIEEIKKL 265 >UniRef50_B1XMU9 Putative uncharacterized protein n=1 Tax=Synechococcus sp. PCC 7002 RepID=B1XMU9_SYNP2 Length = 316 Score = 87.4 bits (215), Expect = 6e-16, Method: Composition-based stats. Identities = 50/287 (17%), Positives = 107/287 (37%), Gaps = 23/287 (8%) Query: 10 HDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDE--DLRQYYSDLLWS 67 HD +FK L DF+ + P + + + +L ++ + D++ Sbjct: 7 HDLLFKELLT--TFFWDFLALFAP-EILETAEQNSLTFLTQEVFNDLPGQTRRNVDIVAK 63 Query: 68 VKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYHGCR 127 + + V +E+Q+ + A RM Y + + LP + P+ + Sbjct: 64 LHFRGQETCFLVHVENQATSQADFAERMFLYFARLYEKY------RLP-IYPIALF---- 112 Query: 128 SPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQRD 187 S + F+ +++I S +F + + +P + ++ L+ K + Sbjct: 113 SYRSPQRLEPETFSVAFPSKEILSFSFQTIQLNRLPWRDFLRQPNPVAAALMAKMNFSSE 172 Query: 188 -----LLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEK 242 L + IV+L ++ L + F + + F E+ PQE+ + Sbjct: 173 ERPKVKLECLRMIVTL--RLDSARIHLLSGFVDTYLRLNMAEQQVFEQELHRIQPQEEAQ 230 Query: 243 LMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPD 289 ++ I EEG QG+ E A +++ R + + V+ +P Sbjct: 231 VLRIVTSWMEEGLQQGRQEGRQEEACKLILRFVQQRFPEQVSGFAPQ 277 >UniRef50_Q7NIZ1 Gll2041 protein n=9 Tax=Cyanobacteria RepID=Q7NIZ1_GLOVI Length = 311 Score = 84.7 bits (208), Expect = 4e-15, Method: Composition-based stats. Identities = 56/309 (18%), Positives = 113/309 (36%), Gaps = 44/309 (14%) Query: 10 HDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDE--DLRQYYSDLLWS 67 HD +FK L +FID+ A + + ++ + +Y +DL+ Sbjct: 4 HDRLFKELLS--TFFVEFIDLFF-ADVGNYLERGSIVFLEKELFSDITAGERYEADLVVK 60 Query: 68 VKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYHGCR 127 + ++ + V IE+Q++ + + ++RM RY + + + P+ + Sbjct: 61 ARFRDHQSFFLVHIENQTEAQSIFSYRMFRYFARLYEKYQL-------PIYPIAVF---- 109 Query: 128 SPYPYSLCWLDEFAEPAIARKIYSSAFPLVD-------ITVVPDDEIMQHRKMALLELIQ 180 + + A ++ F +++ + + + ++ L+ Sbjct: 110 -------SFTEPLRAEPTAHRVAFPDFTVLEFHYRVVQLNRLDWRDFLRQPNPVASALMA 162 Query: 181 KH-IRQRDLLGLVDQIVSLL--VTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAP 237 + I D + + + LL + + QL + F AQ R F E+A Sbjct: 163 RMRIAPADRPRVKLECLRLLATLRLDPARTQLISGFVDTYLKLTAQEERLFAAELATIGA 222 Query: 238 QEKEKLMTIADRLREEGA--------MQGKHEEALRIAQEMLDR---GLDRELVMMVTRL 286 E+E ++ I ++G +G+ EEAL I L R L + V+ L Sbjct: 223 SEQEAVVQIVTSWMQQGLEQGRQVGRQEGRQEEALAIVLRQLSRRLGTLPAQNAERVSGL 282 Query: 287 SPDDLIAQS 295 S L A S Sbjct: 283 STTALEALS 291 >UniRef50_B7GJZ4 Transposase n=10 Tax=Bacillaceae RepID=B7GJZ4_ANOFW Length = 286 Score = 83.9 bits (206), Expect = 6e-15, Method: Composition-based stats. Identities = 49/290 (16%), Positives = 98/290 (33%), Gaps = 20/290 (6%) Query: 10 HDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDE--DLRQYYSDLLWS 67 HD +FK L + + + D L + +Y DLL Sbjct: 7 HDRLFKELLTTFFEEFILLFF---PHVHEHIDFRHLSFLSEELFTDVTAGEKYRVDLLIQ 63 Query: 68 VKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYHGCR 127 K + G I + +E+QS + RM Y + + +LP+ + Sbjct: 64 TKLKGEAGIIIIHVENQSYMQSSFPERMFIYFSRLFEKYRTN-------ILPIAIFS--- 113 Query: 128 SPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVP-DDEIMQHRKMALLELIQKHIRQR 186 Y + F + F V++ I +A L + + Sbjct: 114 --YDFIRDEPSSFTLQFPFLHVLQFQFLAVELRKQNWRHYIRSENPIATALLSKMGYNEN 171 Query: 187 DLLGLVDQIVSLLVTGN--TNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKLM 244 + + L Q +L+ N R+L F Q F E+ + +E E++M Sbjct: 172 ERVELKKQFFRMLIRQNIDEAKRRLLIGFFETYVKLTEQEEEQFQNEVKKMGGKEGEQVM 231 Query: 245 TIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQ 294 + ++G + G E+ + Q+M+++G+ + + S +++ Sbjct: 232 ELIISYEQKGKIAGAKEKEREMIQKMVEKGMSITQIAHLLDRSEEEVRKV 281 >UniRef50_Q2RKN5 Putative uncharacterized protein n=1 Tax=Moorella thermoacetica ATCC 39073 RepID=Q2RKN5_MOOTA Length = 304 Score = 83.5 bits (205), Expect = 7e-15, Method: Composition-based stats. Identities = 61/292 (20%), Positives = 111/292 (38%), Gaps = 26/292 (8%) Query: 10 HDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDE--DLRQYYSDLLWS 67 HD +FK L R+F+++ PA L D T K I + ++Y D+L Sbjct: 5 HDRLFKELLT--TFFREFMELFFPAA-HTLIDYTDTKFLTQEVITDITAGDKHYVDILAE 61 Query: 68 VKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPM-LFYHGC 126 VK + G + V IE Q+ + A RM Y + H VLP+ +F H Sbjct: 62 VKIKGEDGCVLVHIEPQAYRQADFARRMFIYFSRLYEKHQKR-------VLPIAVFAHDS 114 Query: 127 RSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQ- 185 + F K+ F + + +P + + L+ K Sbjct: 115 KVEETNRHEVEFPFL------KVLQFEFYKIQLKRLPWRQYLNSNNPVAAALLSKMDYSP 168 Query: 186 RDLLGLVDQIVSLL--VTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERA-PQEKEK 242 R+ + + + + LL + + +L F +A+ ++ +++E P+E ++ Sbjct: 169 RERVQVKIEFLRLLTRMQLDPARMELITAFFDSYLVLNAEEEKSLQEKLSEELQPEEVQR 228 Query: 243 LMTIADRLREEGAMQGKHEEALRIAQEMLDRGL---DRELVMMVTRLSPDDL 291 +M + +G QG+ E I L + L E+ + LS + L Sbjct: 229 VMELTTSWHLKGWQQGRQEGRQEILLRQLRKRLGTTSPEVEAKIKTLSAEQL 280 >UniRef50_C9RQ02 Putative uncharacterized protein n=1 Tax=Fibrobacter succinogenes subsp. succinogenes S85 RepID=C9RQ02_FIBSS Length = 360 Score = 81.2 bits (199), Expect = 4e-14, Method: Composition-based stats. Identities = 58/307 (18%), Positives = 122/307 (39%), Gaps = 39/307 (12%) Query: 3 ISTTSTPHDAVFKSFLRHPDTARDFID-----IHLPAPLRKLCDLTTLKLEPNSF--IDE 55 T HDA F+ AR ++ H +L TL P+S+ +D+ Sbjct: 5 NKVTKRKHDAYFRWLFADTTHARCLLELAGKINHEIDAFLTQINLDTLMRIPDSYSEVDD 64 Query: 56 DLRQYYSDLLWSVKTQEG-VGYIYVVIEHQSKPEELMAFRMMRYSIAAMQ-NHLDAGYKE 113 +DL + V G + +++EH+S + ++ ++ +Y + M+ + + Sbjct: 65 TG---EADLAFRVNVSTGAPILVGILLEHKSGRDPIIFDQISKYIHSVMKIQDKNRIFSG 121 Query: 114 LPLVLPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIM--QHR 171 +P + ++FY+G + P L L++ K+ V++ +PD + + ++ Sbjct: 122 IPT-MAIIFYNGRDNWNP--LKILEKSYPDYFRGKVLPFQCTFVNMADIPDSDCLACENT 178 Query: 172 KMALLELIQKHIRQRD-LLGLVDQIVSLLVTGNTN------DRQLKALFNYVLQTGDAQR 224 + + KH +D LL L+ Q L N ++ L Y+ + + Sbjct: 179 ATGMGIIALKHAFNKDKLLELLPQFCKFLDKMPRNEASCLLEKTSIYLMEYLGKDFLKEL 238 Query: 225 FRAFIGEIAERAPQEKEKLMTIADRLREEGAMQGKH---------EEALRIAQEMLDRGL 275 AF+ + +K ++I D R++ A + + EE +I +E L Sbjct: 239 NMAFV------SIGQKYGFVSIGDYFRQQLAEERQQMTEERLQMAEERQQITEERLQMAE 292 Query: 276 DRELVMM 282 +R+ + Sbjct: 293 ERQQITE 299 >UniRef50_A4XJH0 Putative uncharacterized protein n=1 Tax=Caldicellulosiruptor saccharolyticus DSM 8903 RepID=A4XJH0_CALS8 Length = 134 Score = 79.3 bits (194), Expect = 1e-13, Method: Composition-based stats. Identities = 18/129 (13%), Positives = 50/129 (38%), Gaps = 2/129 (1%) Query: 1 MTISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQY 60 M + + +A+F+ + ++++ + +E++ +Y Sbjct: 1 MNNNFSQDE-NAIFRLIFSDSKEILFLLKNVAKFSWVDRIQKDSIEVILVDYDNENVLKY 59 Query: 61 YSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPM 120 D++ V + YI+V + PE M ++ + + + G ++P ++P+ Sbjct: 60 KPDVIAKVTIENNTAYIFVFFVSK-VPECGMRNIILNNMLLFWEKKIKEGTDKIPPIIPL 118 Query: 121 LFYHGCRSP 129 + Y+G Sbjct: 119 VLYNGKEIW 127 >UniRef50_C8PTN1 Putative uncharacterized protein n=4 Tax=Treponema vincentii ATCC 35580 RepID=C8PTN1_9SPIO Length = 303 Score = 78.9 bits (193), Expect = 2e-13, Method: Composition-based stats. Identities = 50/311 (16%), Positives = 106/311 (34%), Gaps = 30/311 (9%) Query: 3 ISTTSTPH-DAVFKSFLRH----PDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDL 57 +ST + + D+VF + + L+ C + +KL+ ++ Sbjct: 1 MSTANRKYKDSVFVDLFSEDEKAKENFLSLYNALHGTNLQLSCPVENIKLDNVMYM---- 56 Query: 58 RQYYSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLV 117 +D+ S I V+ EHQS E M R ++Y + + L + Sbjct: 57 -NIVNDV--SCLVDNK---IIVLAEHQSTINENMPLRFLQYIARLYEKLQKPTDRYLRTL 110 Query: 118 --LPM----LFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQH- 170 +P +FY+G ++ L + R + +I E++ Sbjct: 111 SKIPTPEFYVFYNGLNDYPETTVLKLSDAFITKPERIPLDLEVKVYNINKSKGAEVLSRC 170 Query: 171 RKMALLELIQKHIR-------QRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQ 223 + + L + +R + V + + R+ + + N ++ D Sbjct: 171 KTLDEYSLFIEEVRLQTQLDPENGFTNAVKICIEKGILKEYLQRKSREVINMLIAEYDYD 230 Query: 224 RFRAFIGEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMV 283 A E A + K + + +G QG H++AL A+ M + + + Sbjct: 231 TDIAVQREEAGKIAFAKG-ISQGLSQGISQGLSQGSHQKALETARLMKQANCEIPFIAKM 289 Query: 284 TRLSPDDLIAQ 294 T L+ ++ + Sbjct: 290 TGLTQAEVESI 300 >UniRef50_Q3C0L0 TpnA protein n=2 Tax=Sodalis glossinidius RepID=Q3C0L0_SODGL Length = 131 Score = 78.1 bits (191), Expect = 3e-13, Method: Composition-based stats. Identities = 29/60 (48%), Positives = 40/60 (66%) Query: 3 ISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYS 62 +++T + HD VFK FL ARDF++IHLP LRK CD +TL + SFI++DL+ S Sbjct: 1 MTSTLSHHDHVFKKFLGDIAVARDFLEIHLPPHLRKHCDFSTLAMASGSFIEDDLKGQCS 60 >UniRef50_Q73P51 Conserved domain protein n=7 Tax=Treponema RepID=Q73P51_TREDE Length = 292 Score = 78.1 bits (191), Expect = 4e-13, Method: Composition-based stats. Identities = 54/312 (17%), Positives = 112/312 (35%), Gaps = 40/312 (12%) Query: 3 ISTTSTPH-DAVFKSFLRHPDTARD-FIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQY 60 +ST++ + D+VF + A++ F+ ++ L P I Sbjct: 1 MSTSNRKYKDSVFVDLFSEDERAKENFLSLY-----NALHGTNLPMSCPVENI------R 49 Query: 61 YSDLLWSVKTQEG----VGYIYVVIEHQSKPEELMAFRMMRYSIAAMQ-------NHLDA 109 ++++ + G I ++ EHQS E M R + Y + +L Sbjct: 50 LDNVMYMNIINDVSCLVDGKIIILAEHQSTINENMPLRFLEYIARLYEKLQAPTDRYLKK 109 Query: 110 GYK-ELPLVLPMLFYHGCRS-PYPYSLCWLDEF-AEPAIARKIYSSAFPLVDITVVPDDE 166 K P +FY+G P +L D F +P A +++I ++ Sbjct: 110 LSKIPTPEFY--VFYNGKEDYPETTALKLSDAFITKPKQAP--LELTVQVLNINTDKANK 165 Query: 167 IMQH-RKMALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRF 225 I+ + + L + +R++ L + + + K + L + Sbjct: 166 ILTACKPLEEYSLFVEEVRKQTQLDPENGFTNAIKICIE-----KGILKEYLMRKSREVI 220 Query: 226 RAFIGEIA---ERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMM 282 + E + A Q +E L ++ +G G +++A+ IA+ G D + + Sbjct: 221 NMLVAEYDYDTDIAVQREESLRIGIEQGIRQGFSDGAYQKAIEIAKAFKQFGFDIDKIAE 280 Query: 283 VTRLSPDDLIAQ 294 T LS +++ Sbjct: 281 GTGLSREEIEKL 292 >UniRef50_C1PBU4 Putative uncharacterized protein n=4 Tax=Bacillus coagulans 36D1 RepID=C1PBU4_BACCO Length = 329 Score = 77.3 bits (189), Expect = 6e-13, Method: Composition-based stats. Identities = 58/335 (17%), Positives = 111/335 (33%), Gaps = 54/335 (16%) Query: 1 MTISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDE--DLR 58 M HD +FK +++ ++F+D P L D ++ + Sbjct: 5 MEKHAGYHVHDRLFKELIQN--FFQEFMDAFFP-DLSADLDYRRVRFLSQEQFTDFPGGE 61 Query: 59 QYYSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVL 118 Q D+L K + I + +E QS E+ RM RY + H VL Sbjct: 62 QKRVDILAETKVKGKDTVILIHVEPQSYYEKPFPERMFRYYMMISLRHRK-------PVL 114 Query: 119 PMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLEL 178 P+ + + F I R Y + + + I + +A L Sbjct: 115 PIAVFS-YEEKTETPDTYTFAFHNIEILRFHY---LSIHLMKQNWRNYIRSNNPVAAALL 170 Query: 179 IQKHIRQRDLLGLVDQIVSLLVT---GNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAER 235 + + + + + + + +L R L F+Y L+ + + + I Sbjct: 171 SKMGYTETERVQVKLEFLRMLARMELDPAKMRLLHGFFDYYLKL-NEKEEAEVMENIKML 229 Query: 236 APQEKEKLMTI------------------------ADRLREEGAMQGKHE---------- 261 P E E+++ + ++ REEG G + Sbjct: 230 DPDEAEQVLKLPNSYFDRGYKKGKEEGREEGIEIGVEKGREEGIEIGVEKGREEERKEML 289 Query: 262 EALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQSH 296 + + IA +ML G + +L++ T LS ++ Sbjct: 290 QTIPIAIKMLQEGRELQLIVEKTGLSQREVEKIKQ 324 >UniRef50_Q2RGS0 Putative uncharacterized protein n=2 Tax=Moorella thermoacetica ATCC 39073 RepID=Q2RGS0_MOOTA Length = 310 Score = 75.4 bits (184), Expect = 2e-12, Method: Composition-based stats. Identities = 44/288 (15%), Positives = 103/288 (35%), Gaps = 37/288 (12%) Query: 5 TTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDL 64 + +D K H A + + ++E SDL Sbjct: 4 KSGNRYDITIKDLFADETQELINYFGHFEARVTGDLKIEFPQVET----------RVSDL 53 Query: 65 LWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKEL-PLVLPMLFY 123 + +K + G + + +E QS+ ++ M +RM+RY++ +K V ++ Y Sbjct: 54 V--MKAESQQGPLAIHLEFQSRNDDEMPYRMLRYALEI--------HKTYHLPVYQIVIY 103 Query: 124 HGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHI 183 G + + + + + + L+D+ + +E+ LL L+ Sbjct: 104 FGQ-----WQMNMTSQLEYRLGDQNLLDYRYHLIDVGNITYEELKNSPHQRLLSLLPVVD 158 Query: 184 RQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTG-------DAQRFRAFIGEIAERA 236 R++ G + + +D L+ +L+ D + E+ + Sbjct: 159 REKRQKGGKEFLRRCAEDIINSDLDLETKKTVLLRAEIFAGLVFDKKAIDLVFREVEQML 218 Query: 237 PQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVT 284 E+ R+ E+G +G + + ++ +++G +E ++ VT Sbjct: 219 SIEESA---GYQRIFEKGMEKGIEKGMEKGMEKGIEKG-QQESLLDVT 262 >UniRef50_Q6D6X6 Putative transposase (Fragment) n=2 Tax=Pectobacterium RepID=Q6D6X6_ERWCT Length = 135 Score = 75.4 bits (184), Expect = 2e-12, Method: Composition-based stats. Identities = 32/127 (25%), Positives = 56/127 (44%), Gaps = 16/127 (12%) Query: 185 QRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKLM 244 D+L L I L Q +A+ Y+ ++G+ + FI +A+ ++E +M Sbjct: 4 HHDMLELAQDIGILFERWQIPLPQKRAILFYIARSGNTSKPAEFIEAVAQSLSTDREAIM 63 Query: 245 TIADRLR----------------EEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSP 288 TIA +L + G QG A +IA+++L G++ V +T+LS Sbjct: 64 TIAQQLEKIGFEKGIKHGMQQGMQRGMEQGIKTSARQIARQLLLSGMEPAQVCQMTQLSA 123 Query: 289 DDLIAQS 295 +L S Sbjct: 124 AELAQLS 130 >UniRef50_C6XV94 Putative uncharacterized protein n=7 Tax=Pedobacter heparinus DSM 2366 RepID=C6XV94_PEDHD Length = 283 Score = 75.0 bits (183), Expect = 3e-12, Method: Composition-based stats. Identities = 61/286 (21%), Positives = 118/286 (41%), Gaps = 31/286 (10%) Query: 22 DTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVKTQEGVGYIYVVI 81 R+ + LP ++ + L +E + + ++ +DLL V+ +G Y+ + + Sbjct: 16 KIFRENMHNTLPGIIKHVLHLNVNTVEELADDVQFTKERKTDLLKKVRDNKGNRYV-LHV 74 Query: 82 EHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYHGCRSPYPYSLCWLDEFA 141 E+Q+ MAFRM YSI + H +LP V + Y G P ++ Sbjct: 75 EYQTDNYPEMAFRMAEYSIMLQRKH------KLP-VKQFVIYIG---PAKANMA------ 118 Query: 142 EPAIARKIYSSAFPLVDITVVP-----DDEIMQHRKMALLELIQKHIRQRDLLGLVDQIV 196 +I K + + L +++ V ++++ + +A+L + + L +V +I Sbjct: 119 -TSITTKDFRFRYNLTELSAVNYKLFLKSDLVEEKMLAILSNLASESTESVLAQVVQEIE 177 Query: 197 SLLVTGNTND--RQLKALFN----YVLQTGDAQRFRAFIGEIAERAPQEKEKLMTIAD-- 248 + T RQL+ L D E + + E I Sbjct: 178 THTSTLEQGRYFRQLRILLQLRNLNKKAIKDMALVGKIFKEEKDILYRRGEIKGEIKGEI 237 Query: 249 RLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQ 294 + +G +G++EEA+ IA E+ GL E + +T+LS +++ A Sbjct: 238 KGEIKGIEKGRYEEAMEIALELKKEGLATEFIAKITKLSIEEIQAL 283 >UniRef50_A8VV66 ATPase associated with various cellular activities, AAA_3 n=2 Tax=Bacillus selenitireducens MLS10 RepID=A8VV66_9BACI Length = 214 Score = 73.5 bits (179), Expect = 7e-12, Method: Composition-based stats. Identities = 40/195 (20%), Positives = 76/195 (38%), Gaps = 19/195 (9%) Query: 116 LVLPMLFYHGCRSPYPYSLCWLDEFAEPAIA------RKIYSSAFPLVDITVVPDDEIMQ 169 L++P+L G R + D F+ + A I + + L DI ++++ Sbjct: 12 LIIPILIAQGRRRWSRSTTLMADFFSHYSEALRDDCEPFIPNFRYLLYDIQEQDAADMIR 71 Query: 170 HRKMALLELIQKHIRQRDLLGLVDQIVSLLVTGNTND------RQLKALFNYVLQTG--- 220 H + + + + + D L ++ LL + Q+ L YV++ Sbjct: 72 HTLLKITIELMALVFEEDESKLEARMTELLTMSEIGEISDSYAEQVLRLLEYVMRGNRHF 131 Query: 221 DAQRFRAFIGEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELV 280 D F + A + E +M AD+L ++ GKH++ L I ++ RG +E + Sbjct: 132 DQAMFETIRQNVTTEAHEGSELIMNFADQLEQK----GKHKKELAIFLKLTRRGESKESI 187 Query: 281 MMVTRLSPDDLIAQS 295 M + L A Sbjct: 188 MDLLDLDDKSFEALQ 202 >UniRef50_C4GYF6 Transposase n=20 Tax=Yersinia pestis RepID=C4GYF6_YERPN Length = 105 Score = 73.5 bits (179), Expect = 9e-12, Method: Composition-based stats. Identities = 25/81 (30%), Positives = 46/81 (56%), Gaps = 1/81 (1%) Query: 200 VTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKLMTIADRLREEGAMQGK 259 + + Q+ AL +Y+LQ G++ AF+ E+A+R PQ + LMTIA +L ++G +G Sbjct: 1 MADYLSSPQVMALIHYLLQAGESADSEAFVRELAQRVPQHGDALMTIAQQLEQKGIEKGI 60 Query: 260 HEEALRIAQEM-LDRGLDREL 279 + Q+ L+ G+ ++ Sbjct: 61 EKGIQLGEQKGKLEVGVSLDI 81 >UniRef50_C4G3R2 Putative uncharacterized protein n=2 Tax=Abiotrophia defectiva ATCC 49176 RepID=C4G3R2_ABIDE Length = 336 Score = 72.7 bits (177), Expect = 1e-11, Method: Composition-based stats. Identities = 52/303 (17%), Positives = 111/303 (36%), Gaps = 52/303 (17%) Query: 11 DAVFKSFLRHPDTARDFIDI-HLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVK 69 D+VF R H + D + LE FI+ Y+DL ++VK Sbjct: 67 DSVFTLLFSDIKNIRKLYQSLHDDSDSYSDEDFKIITLENV-FINAP----YNDLGFTVK 121 Query: 70 TQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDA--------GYKELPLVLPML 121 + + ++ E QS M R++ Y + +++ LP ++ Sbjct: 122 NK-----VIILAEAQSTFNPNMGLRLLIYIAQSYHDYISEYKFNIFSEKLIRLPNPEFIV 176 Query: 122 FYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQK 181 Y G + + D F + LV + V+ + + + +IQ+ Sbjct: 177 IYSGSKKTDITEIRLSDCFESGT------APNIELV-VKVIGGNNVKE-------GIIQE 222 Query: 182 HIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKE 241 +++ ++ +++ S+ + LK + + G + F + + ++ Sbjct: 223 YLKFCEMYD--EKVRSVKPSEEKAYS-LKKVIKDCIDNGILKDFLTLHQK------EVED 273 Query: 242 KLMTIAD--------RLRE--EGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDL 291 +MT+ +L E +G QGK + +L A+ ML + ++ +T LS + + Sbjct: 274 MMMTVIPPEQALEYIKLEEYNKGIEQGKLDTSLNFARNMLKNNYSIDSIIEITGLSREQI 333 Query: 292 IAQ 294 Sbjct: 334 KRL 336 >UniRef50_UPI0001BC3A9D hypothetical protein BcroD2_08902 n=3 Tax=Butyrivibrio crossotus DSM 2876 RepID=UPI0001BC3A9D Length = 324 Score = 72.3 bits (176), Expect = 2e-11, Method: Composition-based stats. Identities = 49/315 (15%), Positives = 103/315 (32%), Gaps = 56/315 (17%) Query: 11 DAVFKSFLRHPDTARDFIDIHL-------PAPLRKLCDLTT--LKLEPNSFIDEDLRQYY 61 D + K + PD D I+ L + D+ T ++ E + + Sbjct: 20 DILLKDYFT-PDIFADAINAILYDGKSVVTPERMRTIDIETQRVEDENGNVTADT---RL 75 Query: 62 SDLLWSVKTQEGVGYIYVV--IEHQSKPEELMAFRMMRYSIAAMQNHLDAGYK---ELPL 116 D S K E IY + IEHQS + M R+M Y + + + + Sbjct: 76 RD---SAKVVEVDDAIYCLFAIEHQSVEDYTMPLRIMEYDVREYLRQVKSNKGVQVRIKP 132 Query: 117 VLPMLFYHGCRS-PYPYSLCWLDEFAE--------PAIARKIYSSAFPLVDITVVPDDEI 167 ++ ++ Y ++ + + D F + + I L + V ++++ Sbjct: 133 IITIVMY--WKADKWNQPVSVKDMFDKNTVRWLEYNGLGGYIQDYRMHLFEPGTVKEEDL 190 Query: 168 MQ-HRKMALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFR 226 + ++ + K+ + + L ++ + L N + + Sbjct: 191 EKFKTELKDVIAYVKYSKSTEALKDYNE-----KYKPDLTKSTVTLINEL------TNSK 239 Query: 227 AFIGEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALR-------IAQEMLDRGLDREL 279 E ER + + L EEG +GK EE ++ + RG+ Sbjct: 240 YVFIEGKERL-----DMCEAFEGLIEEGRAKGKAEELKEKYKSWVTLSNNLKKRGMSNPE 294 Query: 280 VMMVTRLSPDDLIAQ 294 + + + +L Sbjct: 295 IASLLGVPETELQKA 309 >UniRef50_B5U1X5 Putative uncharacterized protein n=1 Tax=uncultured bacterium RepID=B5U1X5_9BACT Length = 304 Score = 71.6 bits (174), Expect = 3e-11, Method: Composition-based stats. Identities = 56/311 (18%), Positives = 107/311 (34%), Gaps = 32/311 (10%) Query: 1 MTISTTSTP---H-DAVFKSFLR-HPDTARDFIDIHLPAPLRKLCDLTTL-KLEPNSFID 54 M + H D++F + D + F+ ++ L TL + ID Sbjct: 1 MQNENPTNENRSHKDSLFVDYFSKDRDWKQHFLSLYNALHGTNLQVADTLLERVN---ID 57 Query: 55 EDL-RQYYSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKE 113 + L + YY+D+ V G ++IEHQS M R++ Y N +D+ K Sbjct: 58 QVLYKSYYNDIAVLV-----NGQFILMIEHQSTINPNMPLRLLEYVARIYGNLVDSKAKF 112 Query: 114 LPLVLPM------LFYHGCRSPYPYS-LCWLDEFAEPAIARKIYSSAFPLVDITVVPDDE 166 ++P+ +FY G + P S L D F + L V Sbjct: 113 SRHLVPLARPEFYVFYTGDQKLPPESYLHLSDSFPNQP-----PKADLTL--ELKVKVCT 165 Query: 167 IMQHRKMALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFR 226 I ++ + L LV++ + +A+ +L+ +R Sbjct: 166 IKSDHPSPVVHRCPDLEQYAQFLKLVEEAKAAGQAEPLTWAIQEAVRRNILRDYLERRGG 225 Query: 227 AFIGEIAERAPQEKEKLMTIADRLRE---EGAMQGKHEEALRIAQEMLDRGLDRELVMMV 283 + + + + + + G +G ++ L A+ +L GL ++V Sbjct: 226 ETLSILMAEYDYATDFAVQKEEAYEDGLFAGLERGAYQNKLETARSLLSEGLAPQMVARC 285 Query: 284 TRLSPDDLIAQ 294 T L + + Sbjct: 286 TSLPLETVQQL 296 >UniRef50_A6LFH9 Putative uncharacterized protein n=6 Tax=Bacteroidales RepID=A6LFH9_PARD8 Length = 295 Score = 71.2 bits (173), Expect = 4e-11, Method: Composition-based stats. Identities = 56/299 (18%), Positives = 113/299 (37%), Gaps = 28/299 (9%) Query: 11 DAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVKT 70 D FK+ + F++ L R++ D+T L E I + + + +++ + Sbjct: 10 DVGFKAVFQDKQVTIKFLNAALAGE-RQIKDITYLDKE----IKPETVENRT-IIFDLLC 63 Query: 71 QEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYK---ELPLVLPMLFYHGC- 126 ++ G +++ E Q+ P+ R Y + G + L + + F + Sbjct: 64 EDVSGAKFIL-EMQNCPQHYFFNRGFYYLCRMVARQGQIGKQWQYRLLPIYGVYFLNFKL 122 Query: 127 ------RSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQ 180 R+ + + ++IY S FPL ++ + + R + L+ Sbjct: 123 PEFTDFRTDVVLANERTGKVFNEIKMKQIYIS-FPLFSLSK-EECKSSFERWIYTLK-NM 179 Query: 181 KHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEI-----AER 235 Q + + LL N N K Y + + +R + I Sbjct: 180 NLFEQSPFKEEQETFLRLLDVANVNSLSEKERAIY---EENLKNYRDWYATIDYAQTEGI 236 Query: 236 APQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQ 294 +E + + E+G +G+ EE L+IA++M +GLD EL+ + LS +D+ Sbjct: 237 EKGMQEGMQKGMQKGIEKGIEKGRQEEKLQIARKMKKQGLDSELIAQCSGLSVEDIERL 295 >UniRef50_B7CC32 Putative uncharacterized protein n=10 Tax=Eubacterium biforme DSM 3989 RepID=B7CC32_9FIRM Length = 301 Score = 70.4 bits (171), Expect = 6e-11, Method: Composition-based stats. Identities = 48/296 (16%), Positives = 108/296 (36%), Gaps = 16/296 (5%) Query: 11 DAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLR-QYYSDLLWSVK 69 D K FL + DF + + ++ + D ++ + + D++ K Sbjct: 6 DKTMKEFLENNAYFVDFFNAYF-FDGERVLKPENCMELDSEMNDSNMDLEKHVDVI--RK 62 Query: 70 TQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQN-----HLDAGYKELPLVLPMLFYH 124 +G Y +IE+QS + M R Y A + ++LP+V ++FY Sbjct: 63 YNDGNLYSAFIIENQSYVDASMVVRAAAYEFVAYDRMLKKLKKNKAKEKLPMVHILVFYT 122 Query: 125 GCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVP----DDEIMQHRKMALLELIQ 180 G + + + + L++IT ++E + + + Sbjct: 123 GEKLWNAANKLSQLVEVDERFESYFHDYQMNLIEITGNTSYNFNEEDVYNLFYICRSIYD 182 Query: 181 KHIRQR--DLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQ 238 + I + + GLV V +V T+ L + + E+ + + Sbjct: 183 QSIYEEKSNGFGLVKSSVLKVVKTLTDVEWLDLEELEEKEEIEMCEAEKRWLEVKSKEWE 242 Query: 239 EKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQ 294 K + ++ E+G QG ++ L + ++M+D+G + + + +S + + Sbjct: 243 AKG-IKKGIEQGIEQGIEQGSEKKELEMYRKMMDKGFGIKAIASIFSVSEESIEKL 297 >UniRef50_A7BWQ7 Putative uncharacterized protein n=3 Tax=Beggiatoa sp. PS RepID=A7BWQ7_9GAMM Length = 290 Score = 70.4 bits (171), Expect = 7e-11, Method: Composition-based stats. Identities = 57/309 (18%), Positives = 107/309 (34%), Gaps = 42/309 (13%) Query: 6 TSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFID--EDLRQYYSD 63 HD++FK + +F + P + FI E+L++ Sbjct: 3 NPKSHDSLFKWLIT--AFTTEFFGHYFPD-----IRIGEYTFIDKEFISKYENLKESLKG 55 Query: 64 LLW---SVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPM 120 L+ V+ + I + IEHQS+ E ++ R+ YS A V + Sbjct: 56 DLFLGMEVEIDGLLREIIIQIEHQSERE-DVSERVYEYSCYAWLLKKK-------PVWSI 107 Query: 121 LFYHGCRSP-YPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELI 179 + Y P + + F + F ++ + +++Q + L Sbjct: 108 VIYTDEAVWRKPVTEQFWYAFDSQKGKQYH---HFDVIKVKAEKSSDLIQKHSLMCKLLA 164 Query: 180 QK-HIRQRDLLGLVDQIVSL--LVTGNTNDRQLKALFNYV--LQTGDAQRFRAFIGEIAE 234 K RQ D LV +I L+ + QL + +V + +R EI Sbjct: 165 LKADDRQTDPEKLVYEIYRAAALMKEQLTNEQLLLIDQWVSFYKKVSEKRLDKIKKEIKM 224 Query: 235 RAPQEKEKLMTIAD--------RLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRL 286 + TI++ + EG +G+ + + A +L G+D E++ T Sbjct: 225 DFIE-----TTISEHVYNQGWIKGEAEGKAEGEAKGRKKTAINLLKMGIDVEIIQKATGF 279 Query: 287 SPDDLIAQS 295 S ++ S Sbjct: 280 SDAEIKQMS 288 >UniRef50_D1PHY3 Putative uncharacterized protein n=2 Tax=Prevotella copri DSM 18205 RepID=D1PHY3_9BACT Length = 307 Score = 70.4 bits (171), Expect = 8e-11, Method: Composition-based stats. Identities = 61/316 (19%), Positives = 104/316 (32%), Gaps = 32/316 (10%) Query: 1 MTISTTSTPHDAVFKSFLR-HPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFID--EDL 57 M + D FK HP ++ LP L + +K P + E Sbjct: 1 MVMKYLDPKADLTFKKIFGNHPKRLISLLNALLP--LSDEEQIREIKYLPTELVPQLEGG 58 Query: 58 RQYYSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYK--ELP 115 + D+L + G + +E Q + + R++ + + G K EL Sbjct: 59 KNTIVDVL----CTDVRGRKFC-VEMQMEWSDAFQQRVLFNASKLYVSQAKKGGKYSELQ 113 Query: 116 LVLPM-----LFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQH 170 V + +F H P + + + I F +++ I Sbjct: 114 PVYSLNLINDIFAH----DTPDFIHNYRIVHDKDSNKVIEGLHFTFIELPKFTPHSIADK 169 Query: 171 RKMALLELIQKHIRQR------DLLG--LVDQIVSLLVTGNTNDRQLKA---LFNYVLQT 219 R M L I DLL + + V L +D +L+A ++ V Sbjct: 170 RMMVLWLRFLTEINSNTKDIPADLLNDPEIGKAVEELEISGFSDAELRAYDKFWDSVSVE 229 Query: 220 GDAQRFRAFIGEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDREL 279 G+ + E + ++ E+G +GKHE IAQ +L GL E Sbjct: 230 RTLIDDSYQKGKEKGKQEGLAEGMEKGMEKGMEKGRAEGKHEANTEIAQRLLAMGLPAEQ 289 Query: 280 VMMVTRLSPDDLIAQS 295 V T+L + + S Sbjct: 290 VSKATQLPLEIIKNLS 305 >UniRef50_C9LWJ8 Putative uncharacterized protein n=1 Tax=Selenomonas sputigena ATCC 35185 RepID=C9LWJ8_9FIRM Length = 292 Score = 70.0 bits (170), Expect = 1e-10, Method: Composition-based stats. Identities = 52/284 (18%), Positives = 107/284 (37%), Gaps = 29/284 (10%) Query: 11 DAVFKSFLRHPDTARDFIDIHLPAPLR-KLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVK 69 D++F R D +D + + L TL+ +F +++ +D+ + Sbjct: 10 DSLFCDIFRRKDYLQDVYRGLFGRDVSLQEIQLMTLQ---GTFFNDE----KNDVSFLA- 61 Query: 70 TQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHL------DAGYKELPLVLPMLFY 123 G I V++EHQS E M RM Y + + LP +FY Sbjct: 62 ---GKRQI-VLMEHQSTLNENMPLRMFWYMAKLYRKQVPKDAPYRTRRLRLPAPCFYVFY 117 Query: 124 HGCR-SPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQH-RKMALLELIQK 181 +G +P + + + F + ++ A+ +I + +++ R + + Sbjct: 118 NGLDPAPDEWEMRLSEAFEGECSSLELCVKAY---NINEMSGSRLLEKSRALKGYSVFVA 174 Query: 182 HIRQRDLLGL-VDQIVSLLVTGNTNDRQLKALF--NYVLQTGDAQRFRAFIGEIAERAPQ 238 IR++ G+ +++ V + L F + + D F+ + E+A+R Q Sbjct: 175 QIRRKTAAGVCLEEAVKQAIRYCIEQDLLAEYFLEREMEEVFDMVSFK-WDPELAKRV-Q 232 Query: 239 EKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMM 282 +E ++ E+G +G E L + ++ D V Sbjct: 233 LQEAQEIGMEKGMEKGMEKGVTEIVLNMLKKKKWSLQDISEVSQ 276 >UniRef50_C6LE73 Putative uncharacterized protein n=1 Tax=Bryantella formatexigens DSM 14469 RepID=C6LE73_9FIRM Length = 326 Score = 69.6 bits (169), Expect = 1e-10, Method: Composition-based stats. Identities = 36/242 (14%), Positives = 91/242 (37%), Gaps = 16/242 (6%) Query: 59 QYYSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGY------- 111 Q + D+ + + G I V I++Q+ + M R+M Sbjct: 64 QRFRDITGKAEADKNAGCIIVAIQNQTTVDYGMPLRVMLEDALEYDVQRRTKKNRKLHKG 123 Query: 112 KELPLVLPMLFYHGCRSPYPYSLCWLDEFAEPAIARKI--YSSAFPLVDIT-VVPDDEIM 168 ++L LV+ ++FY+G +P+ + + P R++ Y ++P+V +T D Sbjct: 124 EKLCLVITLVFYYG-TTPWRAPSDLAEMISVPREFRQLREYIQSYPIVVVTPENVDTACF 182 Query: 169 QHRKMALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALF-----NYVLQTGDAQ 223 + +LE++++ ++++ +++ ++ + ++ Y + Sbjct: 183 RGGWQEILEILRRQNDEKEMGRYLEKNRAIYEKLPEDTNRVIFALTDHLDYYRELKEKGE 242 Query: 224 RFRAFIGEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMV 283 + +E R R +G QGK + + M++ + ++ Sbjct: 243 KITMCKAFTDHYKSGVEEGKKQGMKRGRRQGIKQGKRQGMDMGIRAMIETCRELKIPRNE 302 Query: 284 TR 285 T+ Sbjct: 303 TK 304 >UniRef50_C9LXX0 Putative uncharacterized protein n=6 Tax=Selenomonas sputigena ATCC 35185 RepID=C9LXX0_9FIRM Length = 301 Score = 68.9 bits (167), Expect = 2e-10, Method: Composition-based stats. Identities = 58/304 (19%), Positives = 107/304 (35%), Gaps = 34/304 (11%) Query: 4 STTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSF--IDEDL-RQY 60 +T T D++F+ + + LP L D T + + IDE L Sbjct: 3 NTKRTYKDSLFRDIFNNAER--------LPEIYEALLDHKT-TPDDITLATIDETLFTGV 53 Query: 61 YSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDA----GYKELPL 116 +D+ + V G ++ +++EHQS M R++ Y + + ++D + +PL Sbjct: 54 KNDIGFIV----GNQHV-LLVEHQSTINANMPLRLLMYLVEIYRRYVDKDAIYKKELIPL 108 Query: 117 VLP--MLFYHG-CRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKM 173 P +FY+G P ++L D F ++ F + D P E H Sbjct: 109 PAPKFYVFYNGLAEMPDIWALHLSDAFGGHDSDLELEVKVFNINDKPNRPILE-KCHALK 167 Query: 174 ALLELIQKHIRQR-DLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEI 232 + + K +R+ ++ V V L F Q + F+ Sbjct: 168 SYSVFVAK-VRECIKNGSSLEIAVGNAVQYCVAHDYLGEYFR-QKQAKEVFDMLNFVWNQ 225 Query: 233 AERAP-QEKEKLMTIADRLREEGAMQGKHEEALR-----IAQEMLDRGLDRELVMMVTRL 286 + +E + R+EG QG + L I M E M + ++ Sbjct: 226 ERALEVRAEEAMEKGLRLGRQEGLSQGLSQGVLETTTASIRNVMKSMDFPIEKAMDILQI 285 Query: 287 SPDD 290 ++ Sbjct: 286 PEEE 289 >UniRef50_C8W2V6 Putative uncharacterized protein n=2 Tax=Desulfotomaculum acetoxidans DSM 771 RepID=C8W2V6_DESAS Length = 300 Score = 68.5 bits (166), Expect = 3e-10, Method: Composition-based stats. Identities = 36/252 (14%), Positives = 90/252 (35%), Gaps = 24/252 (9%) Query: 35 PLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVKTQEGVGYIYVV-IEHQSKPEELMAF 93 + + + ++ I + SD+L+ V GY Y++ IE Q +P+ M Sbjct: 22 EMVRGITVEDVQRVEKEAIA---VKRESDMLFRVS---EDGYEYLMAIEMQIRPDREMPR 75 Query: 94 RMMRYSIAAMQNHLDAGYKELPLVLPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSA 153 R++ Y+ H + P+++ + + + Y LD + + Sbjct: 76 RLLEYTA---MQHREFKKPVYPVIVNLTGH--KKKDESYCFDCLDFT--------VVTFN 122 Query: 154 FPLVDITVVPDDEIMQHRKMALLELI--QKHIRQRD--LLGLVDQIVSLLVTGNTNDRQL 209 + ++++ +P + ++ + L+ L+ +H + V ++ + G D L Sbjct: 123 YRQINLSDLPGQDFLRSGPVGLIPLVVLMRHDEAPEEVFAKCVQRVDEVQDEGLRADLYL 182 Query: 210 KALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQE 269 ++ + E +P D+ + G +G + + Q+ Sbjct: 183 GLAVLSTIKFTREIILKYIEVNKMENSPLFDGIREKWIDQGEQIGFQKGIQKGIQQAMQQ 242 Query: 270 MLDRGLDRELVM 281 + L+ + M Sbjct: 243 SILEALEENIGM 254 >UniRef50_UPI0001C351D8 hypothetical protein ChatD1_33675 n=1 Tax=Clostridium hathewayi DSM 13479 RepID=UPI0001C351D8 Length = 313 Score = 68.5 bits (166), Expect = 3e-10, Method: Composition-based stats. Identities = 50/300 (16%), Positives = 99/300 (33%), Gaps = 38/300 (12%) Query: 4 STTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYY-- 61 D +F+ + D + DL L ++ + Sbjct: 5 KLNRNYKDRLFRLAFQEKKDLLDLYNAVSGRQYTNPDDLIITTLADAIYLGMKNDISFLV 64 Query: 62 SDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLD--------AGYKE 113 SD+L + EHQS M R + Y + ++D Sbjct: 65 SDVL-------------NLYEHQSSFNPNMPVRGLNYFADTYREYIDRNGFDIYGEKLIR 111 Query: 114 LPLVLPMLFYHG-CRSPYPYSLCWLDEF-AEPAIARKIYSSAFPLVDITVVPDDEIMQH- 170 LP+ ++FY+G P L D F + + +++I + E+M Sbjct: 112 LPMPQYIVFYNGTKEEPDRIELRLSDAFLCQNPEEKGCLECRATMININYGHNKELMDRC 171 Query: 171 RKMALLELIQKHIRQRDLLGL-VDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFI 229 R++ + IR + G+ +D+ V V + + +L+ A+ + Sbjct: 172 RRLKDYAVFVSRIRNNEKRGMALDEAVKQAVHSCIEE----GILADILKKNRAEVCNLIL 227 Query: 230 GEIAERAPQEKEKLMTIADRLREE-GAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSP 288 E E+ + + IA + G +G+ E + I + M +GL+ + + L Sbjct: 228 YEYDEQ------RQLAIAREGAMKAGREEGRAAEQVTIIRNMAGKGLNPSAIADMLGLEE 281 >UniRef50_C8W1F3 Putative uncharacterized protein n=2 Tax=Desulfotomaculum acetoxidans DSM 771 RepID=C8W1F3_DESAS Length = 303 Score = 68.1 bits (165), Expect = 4e-10, Method: Composition-based stats. Identities = 41/263 (15%), Positives = 90/263 (34%), Gaps = 41/263 (15%) Query: 56 DLRQYYSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELP 115 ++++ D ++ +K + +E Q+ + + RM+ Y ++ + Sbjct: 55 EVKEKRIDFVFLLKDNS-----ILHLEFQTTIPKDILIRMVTYGSRLVEKYDQDVNT--- 106 Query: 116 LVLPMLFYHGCRSPYPYSLCW-----------LDEFAEPAIARKIY-----SSAFPLVDI 159 ++ Y G P L + +F A ++IY +DI Sbjct: 107 ----VVIYSGKIESAPRLLRKGSLTYKVKNIYMKKFDGDAEYKRIYEKIKNKKPLDEIDI 162 Query: 160 TVV-------PDDEIMQHRKMALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKAL 212 + ++ + EL ++ + + IV+ + + K L Sbjct: 163 QRLIFLPLMKSKEKSEDEMAIQAAELAKEIPNEPIRAFTIGAIVA-ISDNFLTEEYKKRL 221 Query: 213 FNYVLQTGDAQRFRAFIGEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLD 272 + + +I E R KE L + +EG +G E + A L Sbjct: 222 LEVL----RMTQIEQWIRE-EGREEGLKEGLKEGREEGLKEGLKEGLREGLEKTAIAALR 276 Query: 273 RGLDRELVMMVTRLSPDDLIAQS 295 G D E ++ +T LS +++++ Sbjct: 277 EGFDIETIVKITNLSKEEILSLK 299 >UniRef50_B1WSK8 CHP1784-containing protein n=11 Tax=Cyanobacteria RepID=B1WSK8_CYAA5 Length = 260 Score = 68.1 bits (165), Expect = 4e-10, Method: Composition-based stats. Identities = 43/274 (15%), Positives = 100/274 (36%), Gaps = 25/274 (9%) Query: 22 DTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVKTQEGVGYIYVVI 81 D F+ ++L + L +D L +++ + I + I Sbjct: 5 DNVCKFLAERFSRDFANWLLNEPIELTELKPTELSLNPIRADSLIFLQSDD----IVLHI 60 Query: 82 EHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYHGCRSPYPYSLCWLDEFA 141 E Q+ P+E + FRM Y + + + + + ++ Y P L + + F Sbjct: 61 EFQTSPDEDIPFRMTDYRLRVYRRYPNKE------MYQVVIY---LKPSNSELVYQNTFE 111 Query: 142 EPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQRDLLGLVDQIVSLLVT 201 + F ++ + D + + + ++ R+ L QI +++ + Sbjct: 112 LTNL-----RHQFNVIRLWEENTDSFLNNSGLLPFAVLTCTDNPRETLT---QIAAIIDS 163 Query: 202 GNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKLMTIADRLREEGAMQGKHE 261 RQ + + +G + + R+ KE + I + EG ++G+ + Sbjct: 164 MPNQQRQSDISASTAILSGLKLDQDSIKRIL--RSDIMKESV--IYQEIFHEGEVKGQKQ 219 Query: 262 EALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQS 295 IA ML ++ E++ +T L+ ++ + Sbjct: 220 AIKNIALNMLRNHMNLEVISQLTGLNLQEIEQLN 253 >UniRef50_C2LUG6 Putative uncharacterized protein n=1 Tax=Streptococcus salivarius SK126 RepID=C2LUG6_STRSL Length = 299 Score = 67.7 bits (164), Expect = 5e-10, Method: Composition-based stats. Identities = 65/298 (21%), Positives = 103/298 (34%), Gaps = 33/298 (11%) Query: 11 DAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVKT 70 D + K P+ FI L + L +L F +++L D+ K Sbjct: 13 DIMAKKIFSLPEVTVAFIRDILDLDVVDAQILEGTQLHKKDFDEDELFSTSVDV--RAKL 70 Query: 71 QEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAM----QNHLDAG-----YKELPLVLPML 121 +G V+IE Q + + R Y + Q G Y+++ V + Sbjct: 71 NDGTE---VIIEIQVRKQHYFLNRFHYYLANQLVENVQQLRQQGQTHKMYEQMEPVYGIA 127 Query: 122 FYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLEL--- 178 P S A + +YS D + ++A LEL Sbjct: 128 ILEKTLLPDEESPINTYWMANSRTGKPLYSF---------YKDGKQQNLLQIAFLELDKY 178 Query: 179 -IQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAP 237 KHIR D + L R + + + + Q +A I E Sbjct: 179 NKDKHIR--DEGRQWLEFFGNLPFSKAPSRAVTHADSLLDSSSWTQEEKAMIDERIRIQE 236 Query: 238 QEKEKLMTIADRLREEGAMQ----GKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDL 291 + T D REEG Q G++E L + ++ML +GL E+V VT LS ++L Sbjct: 237 NYDMTMETAIDEAREEGLEQGLKRGRYEGQLELIRKMLAKGLSLEVVSDVTGLSLEEL 294 >UniRef50_A1ZPJ4 Hypothetical conserved protein n=6 Tax=Microscilla marina ATCC 23134 RepID=A1ZPJ4_9SPHI Length = 302 Score = 67.7 bits (164), Expect = 5e-10, Method: Composition-based stats. Identities = 54/284 (19%), Positives = 110/284 (38%), Gaps = 38/284 (13%) Query: 7 STPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLR---QYYSD 63 S +D +FK + + +L +++ + + + L+ + +D Sbjct: 19 SNQYDKIFKENIG--EHFLSLSKTYLG-----------IEVASSEELKDKLQTTLEREAD 65 Query: 64 LLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFY 123 L + T +G I + +E QS E+ MA RM Y Q + +LP + + Y Sbjct: 66 FLRKITTPKGEQMI-IQLEFQSTDEQGMAERMQLYFAILRQKY------KLP-IRQFVIY 117 Query: 124 HGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMA-LLELIQKH 182 G + P + +E + F L+D+ V + ++ +L + Sbjct: 118 VGSKPPKMRTRLKPEEV----------FTGFELLDLRQVSYTQWLESDIPEEVLLAVLGD 167 Query: 183 IRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEK 242 +Q+ + ++ QI+S +V + L+ Y+ Q R R + E + Sbjct: 168 FQQKKVSTVLKQIISKIVKLIDDPGTLQ---KYIRQLATFARLRNLVIETEQTLEYMGLT 224 Query: 243 LMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRL 286 D + G +G+ E + QE +++G+ + +V MV L Sbjct: 225 YDIEKDVFYQRGVKKGQQEGIEKGHQEGIEKGITQGVVKMVIAL 268 >UniRef50_UPI0001C353CE hypothetical protein ChatD1_20495 n=1 Tax=Clostridium hathewayi DSM 13479 RepID=UPI0001C353CE Length = 319 Score = 67.7 bits (164), Expect = 5e-10, Method: Composition-based stats. Identities = 47/293 (16%), Positives = 97/293 (33%), Gaps = 40/293 (13%) Query: 11 DAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVKT 70 D +F+ + + + DL L D+++ +K Sbjct: 29 DRLFRMVFNRKEELLSLYNAVSHSEYTNPDDLEINTL--------------DDVIY-MKM 73 Query: 71 QEGVGY----IYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDA------GYKELPLVLP- 119 + + + + + EHQS M R Y + + ++D G + L +P Sbjct: 74 KNDLAFLIDDVLNLWEHQSTWNPNMPVRGTFYIVEEYRKYIDQNGLNLYGSSRITLPVPQ 133 Query: 120 -MLFYHGCR-SPYPYSLCWLDEFAEP-AIARKIYSSAFPLVDITVVPDDEIMQH--RKMA 174 +FY+G R P L D F+ + +++I ++E+M+ Sbjct: 134 FYVFYNGLREEPDYIELKLSDAFSRVHSEVEPCMEFKAVMLNINRGHNEELMRQCTTLRE 193 Query: 175 LLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAE 234 E + + + + +++ ++ D + L A+ F + E E Sbjct: 194 YAEFVARIRDETEDGTALEEAAMNVMDSCIRD----GILAEFLSVHRAEVFEVLLTEYDE 249 Query: 235 --RAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTR 285 EKE EG +G E+A +A ++ +G E + Sbjct: 250 QRHIASEKEISR---REGHMEGRTEGILEKAKEVAVNLIKKGFTVEDAASICG 299 >UniRef50_C9XMT1 Putative uncharacterized protein n=4 Tax=Clostridium difficile RepID=C9XMT1_CLODC Length = 158 Score = 65.8 bits (159), Expect = 2e-09, Method: Composition-based stats. Identities = 24/123 (19%), Positives = 51/123 (41%), Gaps = 10/123 (8%) Query: 14 FKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVKTQEG 73 ++ ++ I + K L ++LE SFI E + D+++ V ++ G Sbjct: 13 YRRMYSDKESFLSLIQNFTSVSIAKELTLKNIELE-TSFICE-YKGKEVDIIYKVFSKSG 70 Query: 74 VGYIYVVIEHQSKPEELMAFRMMRYSIAAMQ--------NHLDAGYKELPLVLPMLFYHG 125 Y+V+E Q++ + + R+ Y + ++ +LP V+P++ Y G Sbjct: 71 KVSHYIVLEFQTEMDTEIVPRLKSYREQIWKSFIMKKSLEEIEDKNFKLPKVIPVVLYSG 130 Query: 126 CRS 128 Sbjct: 131 PER 133 >UniRef50_A5D5U3 Hypothetical membrane protein n=3 Tax=Peptococcaceae RepID=A5D5U3_PELTS Length = 292 Score = 65.8 bits (159), Expect = 2e-09, Method: Composition-based stats. Identities = 37/223 (16%), Positives = 83/223 (37%), Gaps = 18/223 (8%) Query: 59 QYYSDLLWSVKTQEGVGYIYV-VIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLV 117 Q SD L V+ GY Y+ ++E Q++P+ MA R++ Y+ +H P++ Sbjct: 44 QRTSDALVKVR---EDGYEYLMLVEFQARPDRKMARRLLEYTA---MHHCRHEKPVYPVI 97 Query: 118 LPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLE 177 + + Y + + + + + +++ + E++ + LL Sbjct: 98 INLTGGSLQDGWYTF----------ECLDLTVVNFNYRQINLQDIAGRELLYRGPVGLLP 147 Query: 178 LIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAP 237 L ++D+ L + + L+ + + + I + E + Sbjct: 148 LAPLMSHDEPPEKVLDKCARRLQSEVEAEDDRALLYLALAALASLKYPKDLILRVLEVSR 207 Query: 238 QEKEKLMT-IADRLREEGAMQGKHEEALRIAQEMLDRGLDREL 279 E L I + +G ++GK+E + EML ++ Sbjct: 208 LENIPLFDGIREEWEAKGRIEGKNEGKIEGMVEMLFDLVEARF 250 >UniRef50_C2G1H3 Hypothetical cytosolic protein n=1 Tax=Sphingobacterium spiritivorum ATCC 33300 RepID=C2G1H3_9SPHI Length = 294 Score = 65.4 bits (158), Expect = 2e-09, Method: Composition-based stats. Identities = 61/305 (20%), Positives = 114/305 (37%), Gaps = 37/305 (12%) Query: 11 DAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYY--------- 61 D ++K L DF+ L + DL+ +F+D++L Q + Sbjct: 6 DYLWKGVLED--VFDDFLR-FLYPDADSVFDLSR----GITFLDKELEQLFPPEGNEFAP 58 Query: 62 --SDLLWSVKTQEG-VGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVL 118 D L V T +G ++ + +E Q + A RM Y + + +K + Sbjct: 59 KVVDKLAQVYTHDGMEEWVLIHVEVQGTCRKDFASRMFTYYYRILDKY----HKRITA-F 113 Query: 119 PMLFYHGCRSPYPYSLCWLDEFAEPAIARK--IYSSAFPLVDITVVPDDEIMQHRKMALL 176 +L S P + +EF +I + Y A D + D+ A Sbjct: 114 AILT---EASKKPRPNVYEEEFMGTSIQYRFNTYKIAEQDTDRLLASDNPFALVVLTAKA 170 Query: 177 ELIQKHIRQRD-----LLGLVDQIVSLLVTGNTNDRQLKALFNYV--LQTGDAQRFRAFI 229 + K++ +D LL Q+ L+ N + +++ L N++ D Sbjct: 171 AFVGKNLNDKDESDKALLQTKIQLARELLERNMSKEKIRGLMNFLRYYVRFDNSEVNTIF 230 Query: 230 GEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPD 289 + E+ E+ M I + L +GK E + +A+EM G+ E ++ T+LS Sbjct: 231 EQEVEKLT-ERSHTMGIEELLLNRAKKEGKRESLISVAREMKKDGIPVEQIVKFTKLSIK 289 Query: 290 DLIAQ 294 ++ Sbjct: 290 EIEKL 294 >UniRef50_C0QGW4 Putative uncharacterized protein n=1 Tax=Desulfobacterium autotrophicum HRM2 RepID=C0QGW4_DESAH Length = 298 Score = 65.0 bits (157), Expect = 3e-09, Method: Composition-based stats. Identities = 48/284 (16%), Positives = 100/284 (35%), Gaps = 23/284 (8%) Query: 10 HDAVFKSFLRH-PDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSV 68 HD FK+ P D+ K+ D+ L+ EP D D+ Sbjct: 4 HDHNFKNLFLDFPKETLDWFFPQAGQSWGKVLDVEFLRQEPKKHNLSD-SSLELDMPILF 62 Query: 69 KTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYHGCRS 128 + ++ ++E Q + ++++RY+ M+ H DA LV+P + + + Sbjct: 63 NFENQQLLLW-LVEFQEDKSKFSIYKLLRYTTDLMETHPDA------LVIPTVLFTDRKK 115 Query: 129 PYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQRDL 188 L L + L D+ D + + L + H ++ D Sbjct: 116 WSKAVLQQLHAQLHDRMFLHFEYVFHKLFDLNAR--DYYNVDNPVVKILLPKMHYKKEDR 173 Query: 189 LGLVDQIVS---LLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKLMT 245 + ++ Q + LV+ D+ + + Y + D ++ + Q KE M Sbjct: 174 IEVIRQAYAGLFQLVSSGLFDKYVDFIDTY-AEIEDQEQLNLY-----NEIVQHKETAML 227 Query: 246 ---IADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRL 286 I +R +EG + + + + ++ G+ + + L Sbjct: 228 AQYIRERGMQEGRKEERKQSLISFIRKAKQEGVSVPTIAKIVDL 271 >UniRef50_A6LFA9 Putative uncharacterized protein n=22 Tax=Bacteroidales RepID=A6LFA9_PARD8 Length = 305 Score = 64.6 bits (156), Expect = 4e-09, Method: Composition-based stats. Identities = 43/304 (14%), Positives = 99/304 (32%), Gaps = 28/304 (9%) Query: 11 DAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVKT 70 D FK + +D + L L + L++ N + E + +++ + Sbjct: 10 DFGFKHIFG-REMDKDILIEFLNDLLEGEYTIMDLRIMNNERLPETEQGRK--VIFDIHC 66 Query: 71 QEGVGYIYVVIEHQSKPEELMAFRMMRYS-IAAMQNHLDAGYK-ELPLVLPMLFYH---- 124 + G ++IE Q++ + R + Y + ++ + + EL V + F + Sbjct: 67 ETDKGE-RIIIEMQNREQPHFKDRALYYLSHSVVEQGIKGTWDYELAAVYGVFFLNFTLD 125 Query: 125 ---GCRSPYPYSLCWLDEFAEPAIARKIYSSAF--PLVDITVVPDDEIMQHRKMALLELI 179 G D ++++ F +++ +E + Sbjct: 126 EENGPDKNGKEGKFRRDIILADRENGQVFNPKFRQIYIELPRFNKEEEECETDFERWIYV 185 Query: 180 QKHIRQRDLL---------GLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIG 230 KH+ D + +++I S+ Q +A + D F Sbjct: 186 LKHMDTLDRMPFKARKAIFERLERIGSMANLTPKQRAQYEAEWK---MYNDYYNTLDFAV 242 Query: 231 EIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDD 290 E +E + + +EG +G + A+ M G+ ++ T LS ++ Sbjct: 243 E-KGMKKGMEEGMEKGLQKGLQEGLQEGLQKGKESTARNMKAEGITPLIIQKCTGLSLEE 301 Query: 291 LIAQ 294 + Sbjct: 302 IERL 305 >UniRef50_C6LJP2 Putative transposase n=1 Tax=Bryantella formatexigens DSM 14469 RepID=C6LJP2_9FIRM Length = 326 Score = 64.6 bits (156), Expect = 4e-09, Method: Composition-based stats. Identities = 37/240 (15%), Positives = 86/240 (35%), Gaps = 16/240 (6%) Query: 66 WSVKTQEGVGYIYVVIEHQSKPEELMAFRMM-----RYSI--AAMQNHLDAGYKELPLVL 118 ++ K G I V +++Q+ + M R+M Y + ++ ++L V+ Sbjct: 78 FNKKIVAPDGEIIVALQNQTTVDFGMPLRVMTEDALEYDVQRRMCKDEKLHKGEKLAPVI 137 Query: 119 PMLFYHGCRSPYPYSLCWLDEFAEPAIAR--KIYSSAFPLVDIT-VVPDDEIMQHRKMAL 175 ++FY+G + + D P + K Y + ++ IT D + Sbjct: 138 TIVFYYGAQI-WSGPTDLADMVKIPEEFKWLKKYIRPYAMLLITPENVDAAWFSGGWREV 196 Query: 176 LELIQKHIRQRDLLGLVDQIVSLLVTGNTN-DRQLKALFNYVLQTGDAQRFRAFIGEIAE 234 E++Q+ ++++ + + S+ + +R + AL ++ +R Sbjct: 197 FEILQRRNDEKEMQRYLQKKRSVYEKLPEDTNRLIFALTGHLDYYNALKRKGERAVMCKA 256 Query: 235 RAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELV----MMVTRLSPDD 290 K + + +G QG + +E + G E + LS ++ Sbjct: 257 FEDHYKSGVEEGKNIGIHQGISQGLGRGIGAMIRENQEEGKTTESIIDKLQKYFSLSREE 316 >UniRef50_UPI0001C34E7F hypothetical protein ClM62_15401 n=1 Tax=Clostridium sp. M62/1 RepID=UPI0001C34E7F Length = 324 Score = 64.6 bits (156), Expect = 4e-09, Method: Composition-based stats. Identities = 52/293 (17%), Positives = 103/293 (35%), Gaps = 26/293 (8%) Query: 11 DAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVKT 70 DA+F+ + + L + LE +++ +DL + + Sbjct: 28 DALFRMIFNDKEALLSLYNAVGNTSYTDASQLQIVTLENAVYMNIK-----NDLAFLLNM 82 Query: 71 QEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHL------DAGYKELPLVLPMLFYH 124 + + EHQS M R + Y + L + +LP ++F++ Sbjct: 83 ELN------LYEHQSTWNPNMPLRDLFYVSREYEMLLANQSIYSSSLLKLPAPRFVVFFN 136 Query: 125 GCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIR 184 G + L + E + +++I +DE+M ++ L E R Sbjct: 137 GSYDMGEQCVLKLSDAYEKKVEDPDLELKVTVLNINAGWNDELMNTCRL-LKEYSLYVAR 195 Query: 185 QRDLLGLVDQIVSLLVTGNTNDRQLK-ALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKL 243 R ++ ++ D +K + L A+ I E E +EKE L Sbjct: 196 VRAYAKEMELAEAV---SRAVDECIKEGILRDFLMKYRAEAISVSIFEYDE--EREKELL 250 Query: 244 -MTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELV-MMVTRLSPDDLIAQ 294 T + R+EG QG+ E + +E + +G+ + S +D ++ Sbjct: 251 RKTEYEFGRQEGLSQGREEGLSQGIKEGMAQGVSAMIRHCRKAGASREDTLSI 303 >UniRef50_B3CQQ1 Putative transposase n=3 Tax=Orientia tsutsugamushi str. Ikeda RepID=B3CQQ1_ORITI Length = 153 Score = 64.2 bits (155), Expect = 5e-09, Method: Composition-based stats. Identities = 31/150 (20%), Positives = 57/150 (38%), Gaps = 31/150 (20%) Query: 175 LLELIQKHIRQRDLLGLVDQIV-----SLLVTGNTNDRQLKALFNYV---LQTGDAQRFR 226 +LE + KHI QRD+L L ++ + L++ L++ Y L Sbjct: 1 MLEYMLKHIHQRDMLKLWEEFLIKFKHVLILDKEKGYIYLRSFLWYTDTKLLESQQPELE 60 Query: 227 AFIGEIAERAPQEKEKLM-TIADRLREEGAMQGKHEEALR-------------------- 265 + + +EK +M TIA + +EG G+ + + Sbjct: 61 QVLA--KYLSEEEKSNIMRTIAAKYIDEGIEIGETKGIAKGIAKGIAEGIEIGEVKAKQG 118 Query: 266 IAQEMLDRGLDRELVMMVTRLSPDDLIAQS 295 +A+ +L G E + T LS +++I Sbjct: 119 LARNLLKAGFSVEFISENTGLSKEEVINLK 148 >UniRef50_Q1NK38 Putative uncharacterized protein n=2 Tax=delta proteobacterium MLMS-1 RepID=Q1NK38_9DELT Length = 115 Score = 63.9 bits (154), Expect = 6e-09, Method: Composition-based stats. Identities = 15/58 (25%), Positives = 24/58 (41%), Gaps = 1/58 (1%) Query: 1 MTIS-TTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDL 57 M + + + HD +K HP D + + D TTL+ SF+ +DL Sbjct: 1 MAMKPDSKSDHDNSYKLLFSHPRMVEDLLRGFVREDWISEVDFTTLETVSGSFVSDDL 58 >UniRef50_UPI0001C371D2 hypothetical protein RflaF_10865 n=1 Tax=Ruminococcus flavefaciens FD-1 RepID=UPI0001C371D2 Length = 317 Score = 63.9 bits (154), Expect = 7e-09, Method: Composition-based stats. Identities = 48/298 (16%), Positives = 89/298 (29%), Gaps = 43/298 (14%) Query: 11 DAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFI----DEDLR----QYYS 62 DAV K +++ + D L R++ LK + I ++ R Q Y Sbjct: 5 DAVTKDYMQDSEHFADAF-NFLLYGGRQVIKPEQLKPLDTTSIALPYGDESRFVPIQKYR 63 Query: 63 DLLWSVKTQEGVGYIYVV--IEHQSKPEELMAFRMMRYSIA--------AMQNHLDAGY- 111 D+L V E Y++ IE+QS M R M Y + H + Sbjct: 64 DVLKMVTAMEDENATYLILGIENQSDIHYAMPIRNMLYDAIQYVNQADTIAKEHRKSKKM 123 Query: 112 --------------KELPLVLPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLV 157 + ++ + Y G A I + + + L+ Sbjct: 124 PETRAEYLSGFYKTDRILPIITLTLYFGADEWDAPRDLHSMLTANEDILKFVDNYHLHLI 183 Query: 158 DITVVPDDEIMQHRKMALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVL 217 + D++ + L L K+++ + IV D +++ Sbjct: 184 APAEIEDEDFA--KFHTELSLALKYVKYSKDKKKLRDIV-------NEDTAFRSVSRKTA 234 Query: 218 QTGDAQRFRAFIGEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGL 275 + E E + I EG +G E +R ++ G+ Sbjct: 235 DMVNVVTSSNLHYNDGEERVDMCEAIEEIRKDALAEGKAEGIEEGIIRTLIGLVKDGI 292 >UniRef50_C8PLW8 Putative uncharacterized protein n=2 Tax=Treponema vincentii ATCC 35580 RepID=C8PLW8_9SPIO Length = 264 Score = 63.9 bits (154), Expect = 7e-09, Method: Composition-based stats. Identities = 56/295 (18%), Positives = 98/295 (33%), Gaps = 54/295 (18%) Query: 11 DAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVKT 70 D +F + H R F+++ + K+ L++ + + + + D+L VK Sbjct: 14 DFMFCKVMEHESLCRPFLEMLFSTQIEKITYLSSQNIITTN---SEAKTVRLDVL--VKD 68 Query: 71 QEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYHGCRSPY 130 G Y IE Q E + RM Y LD GY ++ Sbjct: 69 DIGTSYD---IEMQVGNEYNIPKRMRYYQAVLDVAFLDKGYS-------------YKALN 112 Query: 131 PYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQRDLLG 190 + ++ F R +Y+ + D I+ H G Sbjct: 113 NSVIIFVCLFDPIGNDRAVYTFENI-----CIEDKTILLHD------------------G 149 Query: 191 LVDQIVSLLVTGNTNDRQLKALFNYV----LQTGDAQRFRAFIGEIAERAPQEKE----- 241 I++ T++++L+ YV T R I + + +E Sbjct: 150 TKKIILNAKAFKKTDNQELRGFLQYVTTGKATTAYTGRIEQMIQTVKQNELARREYHILP 209 Query: 242 -KLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQS 295 LM D G +G ++AL A+ +L GL E + T LS ++ A Sbjct: 210 AALMDAMDEGEARGLAKGSRQKALETAKNLLHFGLSVENIAQATGLSQAEVEALK 264 >UniRef50_C0CSV6 Putative uncharacterized protein n=1 Tax=Clostridium asparagiforme DSM 15981 RepID=C0CSV6_9CLOT Length = 317 Score = 63.1 bits (152), Expect = 1e-08, Method: Composition-based stats. Identities = 47/301 (15%), Positives = 97/301 (32%), Gaps = 46/301 (15%) Query: 3 ISTTSTPH-DAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYY 61 ++ + + D +F+ D + + L L+ ++ Sbjct: 1 MTKVNKKYKDRLFRLVFGDRRRLLDLYNALNGSHYEDPDALEITTLDDAVYLSMK----- 55 Query: 62 SDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGY--------KE 113 +DL + V GV +Y EHQS M R Y + ++ + Sbjct: 56 NDLSFLV---NGVLNLY---EHQSTYNPNMPVRGFFYLADVYRKYVVEHKLNLYGSRLAK 109 Query: 114 LPLVLPMLFYHGCR-SPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRK 172 LP ++FY+G + P L D F A + +M + Sbjct: 110 LPSPKYLVFYNGRKEEPDRKILRLSDAFQGGRNAE------------PCLELCAVMLNIN 157 Query: 173 MALLELIQKHIRQ-RDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRF-----R 226 + +++ + R ++ VD++ ++ + + ++ G + F Sbjct: 158 LGRNQVLMERCRTLKEYAQFVDRVRRMIAETGALESAVDCAVEDCIRDGILENFLSSHRA 217 Query: 227 AFIGEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGL--DRELVMMVT 284 + I +++ M REE +G+ E E L GL RE ++ + Sbjct: 218 EVLDVILTDYNEQEYIAME-----REEAWEEGRAEGLTEGLSEGLSEGLSVSREAILDLL 272 Query: 285 R 285 Sbjct: 273 G 273 >UniRef50_C6XVT6 Putative uncharacterized protein n=1 Tax=Pedobacter heparinus DSM 2366 RepID=C6XVT6_PEDHD Length = 317 Score = 63.1 bits (152), Expect = 1e-08, Method: Composition-based stats. Identities = 48/307 (15%), Positives = 112/307 (36%), Gaps = 38/307 (12%) Query: 11 DAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTT-LKLEPNSFI------DEDLRQYYSD 63 D K DF+ + + ++ D ++ N + +D Sbjct: 26 DEFLKGAFED--NFPDFLR-FVFSDADEILDFNREIEFLNNELFTIIPDRERKGGGRRAD 82 Query: 64 LLWSVKTQEG-VGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLF 122 LL + ++G ++ + +E + + R+ Y+ + + V + Sbjct: 83 LLAKLYLKDGTEKWVLLNVEIEGGNDRKFGQRVFEYNYRIRDKYKVS-------VASIAV 135 Query: 123 YHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLEL-IQK 181 + G ++ +LDE ++ K +A+ + D D+ + +L+ L QK Sbjct: 136 FTGKKTQL-RPTEYLDELLGTVLSFK--YTAYHVFD--HQEDELLKSDNPFSLIALACQK 190 Query: 182 H-----IRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYV--LQTGDAQRFRAFIGEIAE 234 I +L IV L+ + +++ + ++ +++ + E Sbjct: 191 ALLEGKIPDEELADERLVIVKALLRHGYDRQRIISFILFLKNFIFIESEEINRKFDQQIE 250 Query: 235 RAPQEKEKL--MTIADRLRE-----EGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLS 287 ++K + + + + EG ++G+ EEAL IA+E+ GL E + T+L Sbjct: 251 ELTKDKNPMGVIDVFKKWERQEAKIEGKLEGRREEALEIARELKKEGLTIEFIAKTTKLP 310 Query: 288 PDDLIAQ 294 ++ Sbjct: 311 IAEIEKL 317 >UniRef50_UPI000190BD13 hypothetical protein SentesTyph_06309 n=2 Tax=Salmonella enterica subsp. enterica serovar Typhi RepID=UPI000190BD13 Length = 105 Score = 63.1 bits (152), Expect = 1e-08, Method: Composition-based stats. Identities = 27/105 (25%), Positives = 51/105 (48%), Gaps = 17/105 (16%) Query: 209 LKALFNYVLQTG-DAQRFRAFIGEIAERAPQEKEKLM-TIADRLREE------------- 253 ++A+ Y++ G ++ F+ E+A P+ KE +M TIA +L+EE Sbjct: 1 MEAVLCYIIYNGMTSESITPFLYELAGEIPEYKELIMGTIAQQLKEEGIQQGIQQSIQQE 60 Query: 254 --GAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQSH 296 +++ + + L A +LD G+ E+V+ T L+ + L H Sbjct: 61 RQASLEREQKTLLETAYALLDNGVSLEVVIKSTGLNRETLEQPRH 105 >UniRef50_Q24MW9 Putative uncharacterized protein n=4 Tax=Desulfitobacterium hafniense RepID=Q24MW9_DESHY Length = 295 Score = 63.1 bits (152), Expect = 1e-08, Method: Composition-based stats. Identities = 49/301 (16%), Positives = 95/301 (31%), Gaps = 36/301 (11%) Query: 11 DAVFKSFLR---HPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYS--DLL 65 D +FK + D F++ L +LT + L E L+ S D+L Sbjct: 12 DYLFKYIFGRQENKDILLSFLNAVLSPAGED--ELTDITLSDRELDPEHLKDKMSRLDIL 69 Query: 66 WSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYHG 125 +G + IE Q E+ + R + Y Q+ L +G + Y Sbjct: 70 GVA--NDGSL---INIEVQIASEKNIDKRTLYYWAKIYQSQLQSG----------MLYKD 114 Query: 126 CRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQ 185 ++ F A S + + D + ++ + + R Sbjct: 115 LARTVTVNV-LNFSFLPDAQRYHSMFSLYEAHSGLRLNRDLEIHFLELEKWKALSTKPRT 173 Query: 186 RDLLGLV--------DQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGE--IAER 235 R L + ++ + ++ + L + D +R+ + E I + Sbjct: 174 R-LDKWLMYLSNTDPKELEEIAMSEPAIGKALT--VEEIFLKNDKERYLYEMREKGIRDH 230 Query: 236 APQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQS 295 ++ +G QG IA ML +GL ++ +T L + + Sbjct: 231 LSAMDNAKTEGIEQGLAQGIAQGIERGKTEIALSMLKKGLSLNMIAEITDLPIEQIEEIR 290 Query: 296 H 296 H Sbjct: 291 H 291 >UniRef50_B7BFV9 Putative uncharacterized protein n=1 Tax=Parabacteroides johnsonii DSM 18315 RepID=B7BFV9_9PORP Length = 293 Score = 63.1 bits (152), Expect = 1e-08, Method: Composition-based stats. Identities = 41/296 (13%), Positives = 97/296 (32%), Gaps = 24/296 (8%) Query: 11 DAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVKT 70 D FK + +++ + L +T L E + ++ +K Sbjct: 10 DRGFKHLFGQ-EDSKELLVDLLNGLFEGERVITELSFLNVEMPAESTDSRAA--VFDLKC 66 Query: 71 QEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYK---ELPLVLPMLFYH--- 124 ++ G I++ +E Q+ P+ R + Y + + G EL V + + Sbjct: 67 KDKEGRIFI-VEVQNAPQTYFYERGLYYLCRIISDQDRRGNDWKFELYPVYGIFLLNFKS 125 Query: 125 GCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIR 184 G + D ++ + +++ +E + K++ Sbjct: 126 GKTDKVRTDIVLADRETGKQMS-DTMRQIY--LEMPFFNKEEAECETSLDYWLYTLKYME 182 Query: 185 QRDLL------GLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQ 238 + + L L +++ L N N ++ Y + + + E+ Sbjct: 183 KLETLPFKGQKQLFEKLERLAKIVNMNKKER---MEYEESLKIYRDNQGVLDYAIEK--G 237 Query: 239 EKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQ 294 E + E+G +G + +A +M +G+D + VT L+ + + Sbjct: 238 YMEGVEKGLKEGIEKGLEKGMEKGIYLVAAKMKMQGIDFATITSVTGLNAETIATL 293 >UniRef50_C0F0J0 Putative uncharacterized protein n=1 Tax=Eubacterium hallii DSM 3353 RepID=C0F0J0_9FIRM Length = 316 Score = 63.1 bits (152), Expect = 1e-08, Method: Composition-based stats. Identities = 54/312 (17%), Positives = 94/312 (30%), Gaps = 57/312 (18%) Query: 11 DAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQY--------YS 62 DA+ K +L + + D +L ++ L S I L + + Sbjct: 5 DALTKEYLSNNEIFADVF-NYLIYDGQQRILPENLIERDTSEITLPLGKRGELATIQKFR 63 Query: 63 DLLWSVKTQEGVGYIYVVI--EHQSKPEELMAFRMMRYSIAAMQNHLDAGYKE------- 113 D+L +E +YV+ E+QS M R M Y ++ K+ Sbjct: 64 DILKGCIAKEYKNTLYVLFGVENQSHIHYAMPVRNMLYDAINYSAQVNEKTKKYRKIRKQ 123 Query: 114 --------------------LPLVLPMLFYHGCRSP-YPYSLCWLDEFAEPAIARKIYSS 152 L V+ + Y G SL + + ++ + Sbjct: 124 NPNFKETTEEFLSGWHPDDRLVPVITVTIYFGNDGWDAAKSLQEMFSETDESLKEFLPDY 183 Query: 153 AFPLVDITVVPDDEIMQHRKMALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKAL 212 L+ + + H + L I K I + + ++ +D AL Sbjct: 184 KLHLISCNNISNFT-KFHTEFGRLMHILKVISDEEQMDIL-----------LSDPGYSAL 231 Query: 213 FNYVLQTGDAQRFRAFIGEIAERAPQEKEKLMTIADRL-REEGAMQGKHEEALRIAQEML 271 AQ F G +E M A +E G +G +E Q M Sbjct: 232 -----SVTAAQIINTFTGLHFSIPEKEDTINMRNAWTDHKESGRREGFNEATTSYTQRMY 286 Query: 272 DRGLDRELVMMV 283 G+ E++ V Sbjct: 287 KAGIPLEVIAEV 298 >UniRef50_A7BN25 Putative uncharacterized protein n=3 Tax=Beggiatoa sp. SS RepID=A7BN25_9GAMM Length = 219 Score = 63.1 bits (152), Expect = 1e-08, Method: Composition-based stats. Identities = 27/177 (15%), Positives = 61/177 (34%), Gaps = 12/177 (6%) Query: 109 AGYKELPLVLPMLFYHGCRSPYPYS-LCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEI 167 +LP V P++ Y+G ++ + L E + + + L+D D ++ Sbjct: 2 KKKIKLPPVCPVVIYNGNKAWNAAQEISELIEEVPGGLEKYRPHLRYFLIDEAKFADADL 61 Query: 168 MQHRKMALLELIQKHIRQRD----LLGLVDQIVSLLVTGNTNDRQLK-------ALFNYV 216 + + ++ R D L + Q+++LLV + ++ L + Sbjct: 62 APLHNLVAAIIRLENTRSFDDEKALAEAISQVLNLLVDWLKDSEFIQLRRDIVTWLRRVL 121 Query: 217 LQTGDAQRFRAFIGEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDR 273 L + E+ E +E + ++G +GK + + L Sbjct: 122 LPKNLPDVEIPEVIELQEMNAMLRENMQLWYQTAEKKGEARGKAQGIAQTLLYFLTE 178 >UniRef50_B3CVG1 Putative uncharacterized protein n=2 Tax=Orientia tsutsugamushi str. Ikeda RepID=B3CVG1_ORITI Length = 96 Score = 62.3 bits (150), Expect = 2e-08, Method: Composition-based stats. Identities = 26/121 (21%), Positives = 48/121 (39%), Gaps = 30/121 (24%) Query: 175 LLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAE 234 +LE + KHI QRD+L L ++ + G D+ Sbjct: 1 MLEYMLKHIHQRDMLKLWEEFLIKFKHGLILDK--------------------------- 33 Query: 235 RAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQ 294 ++ + TIA + +EG +G+ E A + + +L G E + T LS ++++ Sbjct: 34 ---EKGNSMRTIAAKYIDEGIAKGRAEAAQELTRNLLKAGFLVEFISETTGLSKEEVVNV 90 Query: 295 S 295 Sbjct: 91 K 91 >UniRef50_D0TYF1 Putative uncharacterized protein n=1 Tax=Bacteroides sp. 2_1_22 RepID=D0TYF1_9BACE Length = 349 Score = 61.9 bits (149), Expect = 2e-08, Method: Composition-based stats. Identities = 46/356 (12%), Positives = 113/356 (31%), Gaps = 71/356 (19%) Query: 3 ISTTSTP-HDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYY 61 +S P D FK P +++ + L L +T L +++ Sbjct: 1 MSKYVNPFTDIGFKIIFGQPA-SKNLLITLLNELLAGEHHITELTFLDKEDHADNVSDKG 59 Query: 62 SDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDA------------ 109 +++ + + G Y+++E Q++ R + Y A+ +++ Sbjct: 60 --IIYDLYCRTASGE-YIIVEMQNRWHSNFLDRTLYYVCRAVSRQIESPSSKEVPVPEDP 116 Query: 110 -----------GYKELPLVLPMLFYHGCRSP--------YPYSLCWLDEFAEPAIARKIY 150 LP + + + S + P + + Sbjct: 117 MTAREPLVSYGKQYRLPTIYGIFLTNFKEENLEAKFRTDTVLSDRDTGKIVNPHLRQIYL 176 Query: 151 SSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQRDLLG--LVDQIVSLLVTGNTNDR- 207 + D++ D + + + L+ + R D L + + + L + ++ Sbjct: 177 QFPYFTKDLS---DCHTLYDKLIYALKNMSNWNRMPDALKEQVFEHLARLAAVADLSEEN 233 Query: 208 ---QLKALFNY----VLQTGDAQRFRAFIGEIAER----------------------APQ 238 KAL Y +++ + ++ + AE Sbjct: 234 RIAYDKALDRYRVNQIVEEDERRKNEEMRRKAAEEGLKEGMKAGLEKGVKKGRLEGIKEG 293 Query: 239 EKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQ 294 KE + E+G +G+ ++ + IA++M + G+ ++++ T L D+ Sbjct: 294 MKEGMKEGMKEGLEKGLEKGEQKKQIEIARKMREDGISIDIIIKYTGLQSSDIENL 349 >UniRef50_C5UZR7 Putative uncharacterized protein n=1 Tax=Clostridium botulinum E1 str. 'BoNT E Beluga' RepID=C5UZR7_CLOBO Length = 334 Score = 61.9 bits (149), Expect = 3e-08, Method: Composition-based stats. Identities = 50/336 (14%), Positives = 102/336 (30%), Gaps = 48/336 (14%) Query: 1 MTISTTSTPHDAVFKSFLRH-PDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQ 59 MT+S D + K + ++ D L + N FI + Sbjct: 1 MTVSNEKVKLDEILKFLFSTSKKVLVNLLNGIFEENFSS--DEVELSVSNNEFIMDTFDT 58 Query: 60 YYSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLP 119 D+ + V E + +E Q+K + M RM Y + + P Sbjct: 59 LRGDVFFEVLNNEVSNKVTYHLEFQTKNDSTMIIRMFEYGFRKGKEQTGNRDDFKTIYFP 118 Query: 120 ---MLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALL 176 ++F + P IYS P++ D+E+++++ LL Sbjct: 119 KQKVIF---IERNNNIKEDIKLKIVLPDEQSFIYSV--PVMKYWEYTDNELIENKMYPLL 173 Query: 177 ELIQKHIRQ--------------RDL--------LGLVDQIVSLLVTGNTNDRQLKALF- 213 L ++R+ DL L + ++ L + Sbjct: 174 PLQLFNLRKDLEYARRSNNIDKINDLSHEAKEIALKIANESKKLFDDNEIIGEDFHKMLL 233 Query: 214 -----------NYVLQTGDAQRFRAFIGEIAERAPQE---KEKLMTIADRLREEGAMQGK 259 NY + + + ++ ++ + ++ E+G +G Sbjct: 234 AIQNLIEYLNRNYFNDDRLEEEVSTMTKTLYDPEVEKRGIEKGIEKGIEKGIEKGMEKGI 293 Query: 260 HEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQS 295 ++A+ A L G+ E+V T L + + Sbjct: 294 EKKAIEDAIGFLRLGVSEEIVSKGTGLPIEKVRELK 329 >UniRef50_C1Q938 Putative uncharacterized protein n=4 Tax=Brachyspira murdochii DSM 12563 RepID=C1Q938_9SPIR Length = 326 Score = 61.9 bits (149), Expect = 3e-08, Method: Composition-based stats. Identities = 51/309 (16%), Positives = 106/309 (34%), Gaps = 37/309 (11%) Query: 1 MTISTTSTPHDAVFKSFLRH---PDTARDFIDIHLPAPLRKLCDLTT---LKLEPNSFID 54 +TI+ + +D + H + A +FI+ K + T +++ I Sbjct: 40 ITINNLNRINDYFVRYLFSHDGNENIALNFINAVF-----KDLNFETFNKIEILNPFNIS 94 Query: 55 EDLRQYYSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKEL 114 E+ + S + T+ G I V+IE QS+ E R + Y + L+ G Sbjct: 95 ENYDEKESIVDIKATTETG---ITVLIEIQSRGNEDFIKRALYYWAYNYSSSLNRGS--- 148 Query: 115 PLVLPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMA 174 FY G + ++ F + ++ + D H + Sbjct: 149 -------FYDGLKPTVSINIT---NFILTDEDKVHSCYVLKELNNNKILTDHCQLH-FLE 197 Query: 175 LLELIQKHIRQRDLLG-LVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFR-----AF 228 L + K I + L + + +S + D + N + + + + Sbjct: 198 LPKFNLKDISAIESLDNIHKEFISWIKFFKGEDMSILMKENTIFEEVEKKCLTFVNDSPV 257 Query: 229 IGEIAERAPQE---KEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTR 285 I + +R + + + +EEG +G E + A+ M +D ++ +T Sbjct: 258 IDKYKKREVDTYFFNKSMELDIKKAKEEGIKEGIKENQILTAKNMKKENIDINIISKITG 317 Query: 286 LSPDDLIAQ 294 LS ++ Sbjct: 318 LSIQEIENL 326 >UniRef50_C4FYK3 Putative uncharacterized protein n=2 Tax=Abiotrophia defectiva ATCC 49176 RepID=C4FYK3_ABIDE Length = 365 Score = 61.2 bits (147), Expect = 4e-08, Method: Composition-based stats. Identities = 52/310 (16%), Positives = 108/310 (34%), Gaps = 37/310 (11%) Query: 11 DAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPN--SFIDEDLRQYYSDLLWSV 68 D + K D DF++ + R++ + + L + +D R Y D Sbjct: 5 DILEKKLFMFNDVFADFLNGII-FNGRQIVEESELFDLSGWSHYKADDSRHRYQDRDVVK 63 Query: 69 KTQEGVGYIYVV-IEHQSKPEELMAFRMMRYSIAAM-----------QNHLD-------- 108 ++ I ++ IE+Q P++ M FR++ Y A+ + HL Sbjct: 64 LWKKKNVVISLIGIENQDVPDKDMVFRVLSYDGASYKTQLAKKDEDKRKHLKDKKNTEIV 123 Query: 109 ----AGYKELPLVLPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPD 164 K++ V+ + Y+G + + + L+D+ + Sbjct: 124 EIGKEDEKDIFPVITFVVYYGEEEWKYETTLKKRLKIGDGLDEFVSDYKINLIDLKKFTE 183 Query: 165 DEI---MQHRKMALLELIQKHIRQRDLLGL-VDQIVSLLVTGNTNDRQLKALFNYVLQTG 220 D+I + K+ + +++ + L + VS LV T + N +T Sbjct: 184 DDINKFKKDFKLLVNYMVKGSNHDAGSIELNHPEEVSELVLRLTGEELPIPRENDGGKTM 243 Query: 221 DAQRFRAFIGEIAERAP--QEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRE 278 + + F +AE+A + + + EG +G E + E L +G+ Sbjct: 244 E-KFFEPMFARMAEKAEARGMAKGMTEGMAKGMTEGMAKGLAEGKAKGMTEGLAKGMTEG 302 Query: 279 LVMMVTRLSP 288 + L+ Sbjct: 303 MAK---GLAE 309 >UniRef50_C8WSD0 Putative uncharacterized protein n=5 Tax=Alicyclobacillus acidocaldarius RepID=C8WSD0_ALIAD Length = 270 Score = 60.8 bits (146), Expect = 6e-08, Method: Composition-based stats. Identities = 49/260 (18%), Positives = 89/260 (34%), Gaps = 35/260 (13%) Query: 43 TTLKLEPNSFIDEDLRQYYSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAA 102 TL+ LR D W + + +E Q + E + R + Y Sbjct: 35 ETLEPFTTELPASTLR---MDRAWRMANGD-----VFHLEFQDRRERTLH-RFLEYDARL 85 Query: 103 MQNHLDAGYKELPLVLPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVV 162 + ++ YH + P L A + F Sbjct: 86 ANQVKTR-------IRTVVLYHAQVASAPQELDI-------GTAIYRVENVFLSALDGDG 131 Query: 163 PDDEIMQHRKMALLELIQK-------HIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNY 215 DE+ H ++ E + +R D + ++++LL +D + + + + Sbjct: 132 ALDEVEAHLRVGRWEPADRLRLGLALSMRVEDRHQAMARVLNLLPRVP-DDEERELVASA 190 Query: 216 VLQTGDAQRFRAFIGEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGL 275 VL GD RA E + +E + + +A+ L E+G GK + A IA +L G+ Sbjct: 191 VLAFGD----RALSDEDRRKLRKELKNVFRMAEELYEDGRHDGKQQAAEDIAHRLLAEGV 246 Query: 276 DRELVMMVTRLSPDDLIAQS 295 ++V T L + L Sbjct: 247 PVDVVEKATGLPRERLEQMK 266 >UniRef50_A8GY36 Putative uncharacterized protein n=15 Tax=Rickettsia RepID=A8GY36_RICB8 Length = 279 Score = 60.4 bits (145), Expect = 7e-08, Method: Composition-based stats. Identities = 50/297 (16%), Positives = 103/297 (34%), Gaps = 44/297 (14%) Query: 11 DAVFKSFLRHPDTARDFIDI--HLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSV 68 D FK F++ LP LR + D LK N + + + S + V Sbjct: 10 DVAFKKLFTDKARLISFLNNIMRLPEELR-IID---LKYISNEQVPDLGQNKRS--IVDV 63 Query: 69 KTQEGVGYIYVVIEHQS-KPEELMAFRMMRYSIAAMQNHLDAGYK--ELPLVLPMLFYHG 125 K + G IY+ +E Q+ + +A R+ Y A + L G + +L V+ ++ G Sbjct: 64 KVTDNSGNIYI-VEMQNGYADAFLA-RVQFYGCVAFSSQLKRGKEYADLAPVVMVIITSG 121 Query: 126 CRS--PYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDD----EIMQHRKMALLELI 179 ++ + + ++ ++ V++ + E ++ + ++ Sbjct: 122 FQALPEEKECISYHQTINVGNGKNQLKCLSYVFVELDKFTKEANELETIEDDWLYMMAKF 181 Query: 180 QKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQE 239 K Q + + + F AE Sbjct: 182 DK------------------AKEPPKHTQDEVVL------SAYKTIEQFNWSEAEYDNYI 217 Query: 240 KEKLMTIADRLREEGA-MQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQS 295 K L + L ++ +GK E ++ +A+EML E ++ T+LS +++ Sbjct: 218 KAMLAAQTEELNQKSKFKEGKAERSIEMAKEMLQDNEPIEKIIKYTKLSKEEIEKLK 274 >UniRef50_C4G7H9 Putative uncharacterized protein n=2 Tax=Abiotrophia defectiva ATCC 49176 RepID=C4G7H9_ABIDE Length = 305 Score = 60.0 bits (144), Expect = 8e-08, Method: Composition-based stats. Identities = 42/250 (16%), Positives = 87/250 (34%), Gaps = 24/250 (9%) Query: 54 DEDLRQYYSDLLWSVKTQEGVGYIYVV-IEHQSKPEELMAFRMMRYSIAAMQNHLDAGYK 112 D L + D+ S +EG + VV IE+Q+K E+LM R++ Y A+ ++ L Sbjct: 51 DGKLHEQERDV--SKYWKEGNTNLLVVGIENQTKAEKLMPARIIGYDGASYRSQLLKSTG 108 Query: 113 ELP-----LVLPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEI 167 LP V+ ++ Y G + + + +I +P++++ Sbjct: 109 RLPKNKLTPVVTIVLYFGLTRWNQPKNLKGILDIPTGLEDFVSDYKINVFEIAFLPEEKV 168 Query: 168 -MQHRKMALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYV-LQTGDAQRF 225 L+ +IR+ L + + A+ ++ + +G Sbjct: 169 NKFKSDFRLVAKYFTNIRKNPYY---------LPADENEIKHVDAVLKFLSIMSGSEDII 219 Query: 226 RAFIGEIAERAPQEKEKLMT-----IADRLREEGAMQGKHEEALRIAQEMLDRGLDRELV 280 ++ REEG +QG +E L++ +G+ E Sbjct: 220 EKLTANNGSEVKNMTGGPLSQLYYKGVSEGREEGLLQGINETLLKVYLNCRSKGMSVEES 279 Query: 281 MMVTRLSPDD 290 + + + Sbjct: 280 EEIVHFADRE 289 >UniRef50_A5KR99 Putative uncharacterized protein n=11 Tax=Ruminococcus torques ATCC 27756 RepID=A5KR99_9FIRM Length = 317 Score = 60.0 bits (144), Expect = 9e-08, Method: Composition-based stats. Identities = 44/310 (14%), Positives = 102/310 (32%), Gaps = 29/310 (9%) Query: 1 MTISTTSTPHDAVFKSFLRHPDTARD----FIDIHLPAPLRKLCDLTTLKLEPNSFIDED 56 M ++VF ++A + PL + + +++ ++ Sbjct: 8 MAGKENREIKNSVFVDLFYEDESAEANEIALFNAIHDEPLPEGTKIRRFRVDNTIYM--- 64 Query: 57 LRQYYSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPL 116 + +D+ + G ++ EHQS E M R + Y A + + + Sbjct: 65 --NFQNDISFDA---GGKVIVFG--EHQSTINENMPLRSLLYIGRAYERLVPPRSRYKKK 117 Query: 117 VLPM------LFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQ- 169 ++P+ FY+G L + +++I EI++ Sbjct: 118 IVPLPTPEFYTFYNGKEKWEKEKELRLSDAYIVKDGEPSLELKVKVINIRPEEHHEILEK 177 Query: 170 ----HRKMALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKA---LFNYVLQTGDA 222 +E++Q + + I + G D ++ + N +L D Sbjct: 178 CQVLKEYSQFMEIVQNYQISGEEEPYKKAIKECIEKGILADYLMRKGSEVVNMLLDEYDY 237 Query: 223 QRFRAFIGEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMM 282 + E A R +E + R++G +G+ E + Q+ L++G + Sbjct: 238 ETDIEVQREEA-REQGREEGRKQGREEGRKQGREEGRKAERSTLIQKKLEKGKTISQIAD 296 Query: 283 VTRLSPDDLI 292 + +++ Sbjct: 297 ELEDTEENIA 306 >UniRef50_Q8F560 Putative uncharacterized protein n=1 Tax=Leptospira interrogans RepID=Q8F560_LEPIN Length = 278 Score = 60.0 bits (144), Expect = 9e-08, Method: Composition-based stats. Identities = 46/288 (15%), Positives = 106/288 (36%), Gaps = 24/288 (8%) Query: 14 FKSFL-RHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVKTQE 72 FK + PD ++ L + +K+ + S L + ++ Sbjct: 3 FKILFVKEPDLLISILNSVLFTD--GEHTIRNIKILNPELVGSSPNDKRSYLDIRAQDED 60 Query: 73 GVGY-IYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAG--YKELPLV--LPMLFYHGCR 127 G + + + + HQS + R + Y +++ L+ G Y +L V + ++ + Sbjct: 61 GKIFHVEIQVAHQSSFVK----RSLYYLSGLIRDQLNRGSMYSDLKPVYQINIVDFDLIP 116 Query: 128 SPYPYSLCWLDEFAEPA--IARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQ 185 S +S E + P + + L E+ + + + + KH + Sbjct: 117 SENFHSKFKFREESNPDIILTDDVEIHFLELCKFVKRDVRELRNN--LEIWLYVLKHTSE 174 Query: 186 RDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKLMT 245 + +++ L+ + L Y + D Q+ ++ + L Sbjct: 175 LE----EEEMRILVDKTPDLSKAFTILEQY---SNDPQKRNELEAKLKSDRDYAYD-LAA 226 Query: 246 IADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIA 293 + +G +G +E L+ A++ML+ G+ ++++ +T LS DL Sbjct: 227 RFEAGELQGIEKGAEKEKLKSARKMLEEGMRLDVILRITGLSKKDLKD 274 >UniRef50_UPI0001C369BC hypothetical protein ChatD1_02491 n=1 Tax=Clostridium hathewayi DSM 13479 RepID=UPI0001C369BC Length = 310 Score = 60.0 bits (144), Expect = 1e-07, Method: Composition-based stats. Identities = 49/297 (16%), Positives = 86/297 (28%), Gaps = 57/297 (19%) Query: 11 DAVFKSFLRHPDTARDFI-------DIHLPAPLRKLCDLTT-LKLEPNSFIDEDLRQYYS 62 D K L+ P D L A L + + + S + +++ Sbjct: 5 DFYIKKLLQDPARFADLYNAEIFHGKQILKAELLSPVSTESGIAITNRSGRKQTIQRRR- 63 Query: 63 DLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRY--------SIAAMQNHLDAG---- 110 D+ G +I E Q + M R + Y + H Sbjct: 64 DIAMKASI--GACFIVAGCEAQGEIHYGMPIRSLTYDALDYTEQLTEIQKEHRKKKDLAK 121 Query: 111 ----------YKELPLVLPMLFYHGCRSPYPYSLCWLDEFAE-------PAIARKIYSSA 153 +L VL ++ Y G + P+ D P + + Sbjct: 122 SPEFLSGITRRDKLQPVLTLVLYCG-KDPWDGPKSLYDMLDLRGPTECIPDLLAALPDYR 180 Query: 154 FPLVDITVVPDDEIMQHRKMALLELI----QKHIRQRDLLGLVDQIVSLLVTGNTNDRQL 209 LVDI + + + + + ++ K + DQI L +D L Sbjct: 181 INLVDIRKIENLSLYKTGLQQVFGMLKYSTDKSKFYNYITSNHDQISML------DDNAL 234 Query: 210 KALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRI 266 A+ G R + +A +E + D L +G ++GK E R Sbjct: 235 TAVM------GLLGENRRLMKYLAAPGREEGYTMCQAIDDLIADGKLEGKREGKRRG 285 >UniRef50_A7BTR0 Putative uncharacterized protein n=3 Tax=Beggiatoa RepID=A7BTR0_9GAMM Length = 309 Score = 60.0 bits (144), Expect = 1e-07, Method: Composition-based stats. Identities = 58/325 (17%), Positives = 120/325 (36%), Gaps = 49/325 (15%) Query: 1 MTISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQY 60 M T D K+ LR D ++ L A L++ D++ L++ + D + Sbjct: 1 MPTETKLVRFDWALKNILRDKANF-DVLEGFLTALLQE--DISVLEILESESNQSDFAKK 57 Query: 61 YS--DLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMM----RYSIAAMQNHLDAGYKEL 114 ++ D+L Q ++IE Q+ E R++ + + ++ L Y+ + Sbjct: 58 FNRVDILVKDSHQRK-----MIIEVQNHRETGYLERILWGTSKLIVETLE--LGEDYRNI 110 Query: 115 PLVLPM---------------LFY-----HGCRSPYPYSLCWL--DEFAEPAIARKIYSS 152 V+ + ++Y HG + P+ L D+ + + I+ Sbjct: 111 SKVISISIVYFDLGLSDDNEYVYYGVANLHGLQHNQPFRFRRLMADKTFKSLQTKDIF-P 169 Query: 153 AFPLVDITVVPDDEIMQHRKMALLELIQKH--IRQRDLLGLVDQIVSLLVTGNTNDRQLK 210 F L+ + D + + + KH IR +++ L N ++ K Sbjct: 170 EFYLLRVEHFQD---IIKTDLDEWIYMLKHSTIRTDFKSKNINKAQEKLTLLQMNPQKRK 226 Query: 211 ALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEM 270 Y++ D R + E Q+ R+EG +G ++ + I + Sbjct: 227 DYEKYMV---DMTVERDVLEAAQEEGIQKGR--QEGIQEGRQEGIQKGMEKKTVVIVKNA 281 Query: 271 LDRGLDRELVMMVTRLSPDDLIAQS 295 L +GL+ L+ +T LS +++ Sbjct: 282 LQQGLELTLISSLTGLSIEEIQKIQ 306 >UniRef50_A8YL21 Genome sequencing data, contig C325 n=27 Tax=Cyanobacteria RepID=A8YL21_MICAE Length = 149 Score = 59.2 bits (142), Expect = 2e-07, Method: Composition-based stats. Identities = 21/125 (16%), Positives = 46/125 (36%), Gaps = 15/125 (12%) Query: 1 MTISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDE--DLR 58 MT + HD +FK + +FI++ P + D ++ + + Sbjct: 1 MTNNID---HDRLFKELIS--TFFVEFIELFFP-EVMNYLDTESITFLDKEVFTDVTEGE 54 Query: 59 QYYSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVL 118 ++ SDL+ V+ + + + +E Q + RM Y K + + Sbjct: 55 RHKSDLVAQVRFRGKESFFLIYVEAQESSRKWFNRRMFTYFARF-------HEKFVLPIY 107 Query: 119 PMLFY 123 P++ + Sbjct: 108 PIVIF 112 >UniRef50_A7B1D1 Putative uncharacterized protein n=3 Tax=Ruminococcus gnavus ATCC 29149 RepID=A7B1D1_RUMGN Length = 323 Score = 59.2 bits (142), Expect = 2e-07, Method: Composition-based stats. Identities = 54/295 (18%), Positives = 101/295 (34%), Gaps = 38/295 (12%) Query: 11 DAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVKT 70 D F+ + + R FI L P ++ D ++L P + + Y L V+ Sbjct: 53 DFCFQELMEDEEVRRGFIGAFLRIPPEEILD---MELLPKKLRKKYKEEKYGILDVRVRL 109 Query: 71 QEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYHGCRSPY 130 +EG + IE QS + R + Y + + G Y + Sbjct: 110 REGEQ---LNIEMQSIAYDYWQERSLFYLGKMYVDQIHEGED----------YDKLKKCI 156 Query: 131 PYSLCWLDEFAEP----------AIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQ 180 + F R +YS F + + +P ++ + LL Q Sbjct: 157 HVGILDFTLFEHERYYSCFHIWEDTIRDMYSDKFEIH-VLELPKLAKYEYPQTELLRWAQ 215 Query: 181 K-HIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQE 239 R R+ + ++ + + ++ A + R E + Sbjct: 216 FFGARSREEIEVLAEKDEYIHKAYDKLEEISA----------DEEKRLEYEERQKAIRDH 265 Query: 240 KEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQ 294 + L + EG +GKHE A+ +A++ML+ L E + + LSP+D+ Sbjct: 266 RHMLASGRREGLREGLREGKHEHAVEMARKMLEDKLPIEKIAEYSGLSPEDVHRL 320 >UniRef50_C0BF92 Putative uncharacterized protein n=1 Tax=Coprococcus comes ATCC 27758 RepID=C0BF92_9FIRM Length = 307 Score = 59.2 bits (142), Expect = 2e-07, Method: Composition-based stats. Identities = 42/271 (15%), Positives = 89/271 (32%), Gaps = 35/271 (12%) Query: 11 DAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVKT 70 D +++ + + + + DL LE ++ +D+ + V Sbjct: 21 DRLWRMIFNNKEDLLQLYNAINHTDYQNPDDLEVNTLEDVLYLSMK-----NDVSFLV-- 73 Query: 71 QEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHL--------DAGYKELPLVLPMLF 122 G +Y EH S M R + Y + ++ LP ++F Sbjct: 74 -GGTMNLY---EHLSTFNPNMPLRGVFYFSRLYEGYVADNNLMIYHEKRVRLPKPKYIVF 129 Query: 123 YHGCRS-PYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQH-RKMALLELIQ 180 Y+G ++ P L D F +++I + E+M+H R++ + Sbjct: 130 YNGTKNQPDSMELRLSDCFENTDNDAPCLECTATMLNINYGHNQELMKHCRRLEEYSIFV 189 Query: 181 KHIR-----QRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAER 235 + +R + + +++ + + + LK I ++ Sbjct: 190 QCVREYIQSEPSVEDALEKAIDTCINQDVLADFLK---------KHRAEVTNMILTTYDK 240 Query: 236 APQEKEKLMTIADRLREEGAMQGKHEEALRI 266 EK + REEG M+G+ E + Sbjct: 241 DLYEKTLKEDAREEGREEGLMEGRAETRAEL 271 >UniRef50_A8SDU3 Putative uncharacterized protein n=1 Tax=Faecalibacterium prausnitzii M21/2 RepID=A8SDU3_9FIRM Length = 295 Score = 58.8 bits (141), Expect = 2e-07, Method: Composition-based stats. Identities = 40/250 (16%), Positives = 81/250 (32%), Gaps = 30/250 (12%) Query: 54 DEDLRQYYSDLLWSVKTQEGVGYIYVV-IEHQSKPEELMAFRMMRYSIAAMQNHLDAGYK 112 D L + D+ + + + G + + E+Q+ + M R++ Y A + L Sbjct: 50 DGRLHEIERDV--AKRWKNGNIRVACIGFENQTASDPDMPLRVIGYDGAEYRAQLLGDND 107 Query: 113 ---ELPLVLPMLFYHGCRSPYPYSLCWLDEFAEP-AIARKIYSSAFPLVDITVVPDDEIM 168 P V ++ Y G P+ L + P + L Sbjct: 108 TGSRYPAV-TLVLYFGHEKPWSGPLSLKERLNVPKEFEPYVNDYKINLF----------- 155 Query: 169 QHRKMALLELIQKHIRQRDLLGLVDQIVSLLVTGNT-----NDRQLKALFNYVLQTGDAQ 223 ++A L Q + Q D + D V G+ + ++ + + Sbjct: 156 ---QIAYLTREQVELFQSDFKVVADYFVQKRENGDYVPSSQDLTHVQETLQLLSIMTNDH 212 Query: 224 RFRAFIGEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEM---LDRGLDRELV 280 RF + + + D++ G +G + R +M + + LD+ + Sbjct: 213 RFEDAYNTSTDDRKGGPRNMCDVLDKVENRGIEKGIVKGESRGENKMALLVKKLLDQNRI 272 Query: 281 MMVTRLSPDD 290 V R S D+ Sbjct: 273 DDVKRASEDE 282 >UniRef50_UPI0001BC3131 hypothetical protein BcroD2_12630 n=4 Tax=Butyrivibrio crossotus DSM 2876 RepID=UPI0001BC3131 Length = 247 Score = 58.8 bits (141), Expect = 2e-07, Method: Composition-based stats. Identities = 30/182 (16%), Positives = 61/182 (33%), Gaps = 28/182 (15%) Query: 1 MTISTTSTPH-DAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQ 59 M T + + D VF+ + + D+ LE ++ Sbjct: 1 MNNETVNRKYKDTVFRLLFKDKSNLLSLFNAVNDTDFSDENDIKITTLENAIYMTSK--- 57 Query: 60 YYSDL--LWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDA----GYKE 113 +D+ + +K + EHQS M +R + Y + ++ K Sbjct: 58 --NDISCIIDMKLN--------LFEHQSTVNPNMPYRNLEYVTKCFKRYVGNFDVYTGKA 107 Query: 114 LPLVLP--MLFYHGCRSPYPYS-LCWLDEFAEPAIARKIYSSAFPLV--DITVVPDDEIM 168 L L P ++FY+G P + D +A +I + ++ +I + + +M Sbjct: 108 LTLPNPKFVVFYNGVNEQPPIRVMRLSDLYAHKD---EIPNLELVVIQYNINNLVNCTLM 164 Query: 169 QH 170 Sbjct: 165 DR 166 >UniRef50_B8HL58 Putative uncharacterized protein n=2 Tax=Cyanothece RepID=B8HL58_CYAP4 Length = 334 Score = 57.7 bits (138), Expect = 5e-07, Method: Composition-based stats. Identities = 50/311 (16%), Positives = 110/311 (35%), Gaps = 55/311 (17%) Query: 3 ISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLR---- 58 ++ + D+ +K L ++ I P + D T P F+D++ + Sbjct: 1 MTQPRSDKDSAWKEIL--RQYFQEAIVFFFPQT-AEQVDWTR----PYEFLDKEFQQIAP 53 Query: 59 -----QYYSDLLWSVKTQEG-VGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYK 112 + Y+D L V ++G ++ + +E Q+ E A RM Y++ Sbjct: 54 DAETGKRYADQLVKVWLKDGAELWLLIHVEVQAARESEFAQRMFTYNLRIFDRFNH---- 109 Query: 113 ELPLVLPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRK 172 P + + P S + +F + +++ + L+D + ++ Sbjct: 110 --PAISLAILCDESVRWRPESFSF--DFPDTSLSFRFGRVK--LLDYRERISE--LEQSP 161 Query: 173 MALLELIQKHIR----QRDLLG---LVDQIVSLLVTGNTNDRQLKALFNYV--------- 216 ++ H+R ++D ++ L G +++ LF ++ Sbjct: 162 NPFSIVVMAHLRAQATRKDDQQRKFWKLTLIRRLYEGGYGRQEVINLFRFIDWVMILPEG 221 Query: 217 --------LQTGDAQRFRAFIGEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQ 268 L+ + +R FI + E + L R+EG +G+ E A+ Sbjct: 222 LKEEFWQELKIYEEERRMPFITSVEEI--GFERGLEQGRQEGRQEGRQEGRQEGRQEEAR 279 Query: 269 EMLDRGLDREL 279 ++ R L R+ Sbjct: 280 ALILRPLTRKF 290 >UniRef50_C3QLI8 Putative uncharacterized protein n=1 Tax=Bacteroides sp. D1 RepID=C3QLI8_9BACE Length = 233 Score = 56.9 bits (136), Expect = 7e-07, Method: Composition-based stats. Identities = 13/65 (20%), Positives = 30/65 (46%) Query: 230 GEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPD 289 G + KE + E+G +G+ ++ + IA++M + G+ ++++ T L Sbjct: 169 GRLEGIKEGMKEGMKEGMKEGLEKGLEKGEQKKQIEIARKMREDGISIDIIIKYTGLQSS 228 Query: 290 DLIAQ 294 D+ Sbjct: 229 DIENL 233 >UniRef50_A6BF26 Putative uncharacterized protein n=14 Tax=Clostridiales RepID=A6BF26_9FIRM Length = 366 Score = 56.9 bits (136), Expect = 7e-07, Method: Composition-based stats. Identities = 44/275 (16%), Positives = 91/275 (33%), Gaps = 19/275 (6%) Query: 11 DAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVKT 70 D +F+ + + L + LE ++ +DL + Sbjct: 58 DTIFRMLYHDKENLLSLYNAVNGREYTDPEKLQVVTLENAIYMGMK-----NDLAF---I 109 Query: 71 QEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLV--LP----MLFYH 124 + Y+Y EHQS + R + Y Q + ++ +P ++FY+ Sbjct: 110 MDMNLYLY---EHQSTYNPNIPLRNLFYIADEYQRLVVRKSLYSTVIQKIPTPRFLVFYN 166 Query: 125 GCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIR 184 G + S L E ++++ ++M+H + L E Q R Sbjct: 167 GTKEVEDRSEFRLSSAYENPTENPDLELRVTMLNVNDGHSSDLMEHCRT-LKEYAQYVAR 225 Query: 185 QRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKLM 244 R D + VT ++ + + L + R I E + +EK+ Sbjct: 226 VRKYAAKQDVSLEEAVTRAVDECIEEGILAEFLLKNKTEVIRVSIYEYDKEF-EEKKLRK 284 Query: 245 TIADRLREEGAMQGKHEEALRIAQEMLDRGLDREL 279 + R++G G+ + Q+ ++ G + Sbjct: 285 AEYEAGRQDGIEIGRQDGIEIGRQDGIEIGRQDGI 319 >UniRef50_Q8GBS6 Putative uncharacterized protein n=12 Tax=Treponema RepID=Q8GBS6_TREMA Length = 262 Score = 56.9 bits (136), Expect = 8e-07, Method: Composition-based stats. Identities = 54/298 (18%), Positives = 101/298 (33%), Gaps = 62/298 (20%) Query: 11 DAVFKSFLRHPDTARDFIDIHLPA---PLRKLCDLTTLKLEP-NSFIDEDLRQYYSDLLW 66 D +F +++ + + F+++ L + + +T+ E F+ D+L Sbjct: 13 DFMFCQVMKNKNLCKTFLEMLLADKIGNITHIASQSTVAPESEAKFV-------RLDIL- 64 Query: 67 SVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYHGC 126 Q+ Y IE Q E +A RM Y A + LD G +Y Sbjct: 65 ---VQDEKNNFYD-IEMQVVNEHNVAKRMRYYQSALDVSFLDKGE----------YYTNL 110 Query: 127 RSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQR 186 + Y +C D I + F + + D+ IR R Sbjct: 111 KDSYIIFVCLFDF-----IGKNKAVYFFENI---CLEDEP----------------IRLR 146 Query: 187 DLLGLVDQIVSLLVTGNTNDRQLKALFNYV----LQTGDAQRFRAFIGEIAERAPQEKE- 241 D + I+++ N D+ L Y+ + T ++R I I + +E Sbjct: 147 DGTKKI--IINVDAFKNIKDKALSGFLEYIKTGCITTKFSERIEKMIRTIKQNEQARQEY 204 Query: 242 -----KLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQ 294 +M + R +G G ++ + A + GL + + T LS ++ Sbjct: 205 RFISAVVMDAKEEGRSQGFTDGVNQTKRKTAAALKAMGLAKSKIAKATGLSLAEIEKL 262 >UniRef50_A7M2M6 Putative uncharacterized protein n=2 Tax=Bacteroides RepID=A7M2M6_BACOV Length = 182 Score = 56.9 bits (136), Expect = 8e-07, Method: Composition-based stats. Identities = 24/145 (16%), Positives = 59/145 (40%), Gaps = 12/145 (8%) Query: 162 VPDDEIMQHRKMALLELIQKHIRQRDLLG--LVDQIVSLLVTGNTNDR----QLKALFNY 215 + D + + + L+ + R D L + + + L+ + ++ KAL Y Sbjct: 38 LSDCHTLYDKLIYALKNMSNWNRMPDALKEQVFEHLARLVAVADLSEENRIAYDKALDRY 97 Query: 216 ----VLQTGDAQRFRAFIGEIAER--APQEKEKLMTIADRLREEGAMQGKHEEALRIAQE 269 +++ + ++ + AE KE + E+G +G+ ++ + IA++ Sbjct: 98 RVNQIVEEDERRKNEEMRRKAAEEGMKEGLKEGIREGIKEGMEKGMEKGEQKKQIEIARK 157 Query: 270 MLDRGLDRELVMMVTRLSPDDLIAQ 294 M + G+ + ++ T L D+ Sbjct: 158 MREDGISIDTIIKYTGLQSSDIENL 182 >UniRef50_B7CCB3 Putative uncharacterized protein n=1 Tax=Eubacterium biforme DSM 3989 RepID=B7CCB3_9FIRM Length = 291 Score = 56.5 bits (135), Expect = 1e-06, Method: Composition-based stats. Identities = 31/238 (13%), Positives = 87/238 (36%), Gaps = 29/238 (12%) Query: 56 DLRQYYSDLLWSVKTQEGVGYIYVV-IEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKEL 114 L + D+ + Q+G + + +E+Q+ E+ M R+ Y A+ + L + E Sbjct: 53 KLHEQERDV--AKYWQDGNALVAICGLENQTVEEKYMPLRVFSYDGASYRRQLLSENDEN 110 Query: 115 P--LVLPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEI---MQ 169 P V+ ++ + G + + + + +I + D+ + Sbjct: 111 PIVPVVSLVLHFGMKKWSSPHNLKGVIDIPKELEPYVNDYKANIFNIAFLDDETVQMFQS 170 Query: 170 HRKMALLELIQK-----HIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQR 224 ++ +QK ++ + + VD+++ LL +DR Y ++ + ++ Sbjct: 171 DFRIVADFFVQKRKNKDYVPDKHKIKHVDEMLKLLQVLTGDDR-------YNVKFSETEK 223 Query: 225 FRAFIGEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMM 282 + + + ++ +EE + + + + + G+ E ++ Sbjct: 224 KEDI---------KMCDVMERAVNKGKEEVREEERINSIKVLISSLEEFGISSEAIIE 272 >UniRef50_B8FTH9 Putative uncharacterized protein n=3 Tax=Desulfitobacterium hafniense RepID=B8FTH9_DESHD Length = 325 Score = 56.5 bits (135), Expect = 1e-06, Method: Composition-based stats. Identities = 46/318 (14%), Positives = 103/318 (32%), Gaps = 44/318 (13%) Query: 11 DAVFKSFLR---HPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWS 67 D FK + F++ L P + + T+ + + ++ D+ Sbjct: 10 DYAFKLIFGKEGNEAILIAFLNAALKLPQERRIEEITIINPELNKEYPEDKKSILDV--R 67 Query: 68 VKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAG--YKELP--LVLPMLFY 123 T +G + + IE Q + M R + Y + G YKEL + + ++ + Sbjct: 68 AITSQG---MQINIEIQLSNQYDMEKRSLYYWAQMYSRQIREGMAYKELTKTVSINIVDF 124 Query: 124 HGCRSPYPYSLCWLDEFAEPA------IARKIYSSAFPLVDITVVPDDEIMQHRKMALLE 177 + + Y + E + L + + ++ Sbjct: 125 NYLKQTSSYHNVFHLYEDEEKFQLTDVLEIHFMELPKLLAKWRKR--EISLWENELVRWL 182 Query: 178 LIQKHIRQRDLLGLVDQIV-------SLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIG 230 L+ + +++L ++++I + + Y + +A I Sbjct: 183 LLLEGADNQEILQILEEIAMKDPVLYQAMNAWEETSEDPRIREAYFDRRKAILDEKAAIR 242 Query: 231 EIAERAPQ--EKEKLMTIAD---------------RLREEGAMQGKHEEALRIAQEMLDR 273 E R + E+ IA+ R EG +G+ E +A+++L Sbjct: 243 EAELRLQEALEEGMAKGIAEGRAKGIAEGKAEGKAEGRAEGRAEGRAEGRAEVAKKLLVL 302 Query: 274 GLDRELVMMVTRLSPDDL 291 G + + T LS +++ Sbjct: 303 GFEITKIAEATGLSEEEI 320 >UniRef50_C1P7A8 Putative uncharacterized protein n=1 Tax=Bacillus coagulans 36D1 RepID=C1P7A8_BACCO Length = 345 Score = 56.5 bits (135), Expect = 1e-06, Method: Composition-based stats. Identities = 59/335 (17%), Positives = 113/335 (33%), Gaps = 62/335 (18%) Query: 10 HDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTT----LKLEPNSFIDEDLR-QYYSDL 64 +D ++K + + +FI P L + D L+ E + I + + + +D Sbjct: 19 YDGLWKKIIS--ELFEEFILFFAP-DLYETIDFGKGIVFLEQELHKVIIKHKKGKRIADK 75 Query: 65 LWSVKTQEGV-GYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFY 123 + V + G Y+++ IE Q K + + RM Y + Y + +L Sbjct: 76 IVKVSLKNGEEKYVFIHIEIQEKQDPDFSKRMFTYFYRLFDRFQENIYS-----IAILT- 129 Query: 124 HGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHI 183 S S + F + + + F DI + + +A+L I H+ Sbjct: 130 --DLSKSNNSEPFQYSFYGTELTYRFNTYKFNEADIPSL--KKSTNPFAIAVLAGIYLHL 185 Query: 184 RQRDLLGLVDQIVSLLVTGNTNDRQLKA--------LFNYVLQ---------------TG 220 +++ + LL +++ L + +Y+L Sbjct: 186 TEKNYQKRYEVKKKLLKEFILSNQNLSSNYAEALCYFIDYLLYLPGELTKQLTKELFIHI 245 Query: 221 DAQRFRAFIGEIAERAPQEKEKLMTIADRLREE--------------------GAMQGKH 260 + + E + AP E L T+ + E G +GK Sbjct: 246 EKEANHMLYSEELKEAPTFAEYLKTVKEEGIEIGIEKGIEKGIEKGKEEGIEIGIEKGKM 305 Query: 261 EEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQS 295 EE +A E+L G E V + +LS D++ Sbjct: 306 EEKRNLAAELLREGFSVEKVAKMVKLSIDEVKKIK 340 >UniRef50_D1P8S5 Putative uncharacterized protein n=1 Tax=Prevotella copri DSM 18205 RepID=D1P8S5_9BACT Length = 303 Score = 56.2 bits (134), Expect = 2e-06, Method: Composition-based stats. Identities = 37/291 (12%), Positives = 94/291 (32%), Gaps = 18/291 (6%) Query: 11 DAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVKT 70 D FK +D + L + + + + + + ++ V Sbjct: 16 DFGFKRIFGT-AMNKDLLICFLNSLFNGRQVVKDVSYLNPEHVGDVYTDRRA--IFDVYC 72 Query: 71 QEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYK---ELPLVLPMLFYH--- 124 + G ++ +E Q+ + R + YS ++ G + +L + + + Sbjct: 73 EGENGEKFI-VEMQNAYQTYFKDRALFYSTFPIREQAPKGNEWDFKLNNIYTVALLNFNM 131 Query: 125 GCRSPYPYSLCWLDEFAEPAIARKIYSS-AFPLVDITVVPDDEIMQHRKMALLELIQKHI 183 + + + + A + Y + V+I+ K++ Sbjct: 132 NEDAFDKEKIRHHVQLCDTATHKVFYDKLEYIYVEISKFNKTLEELDTLYEKWLYALKNL 191 Query: 184 R--QRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKE 241 + L D++ L + +R + +E Sbjct: 192 YKLTQRPKELCDKVFDRLFEEAEIAKFTPQEMRE--YETSKMAYRDIKNSVDTAK---RE 246 Query: 242 KLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLI 292 + + E+G +G + +L IA++ML +G+D +M +T L+ +++ Sbjct: 247 GIAEGIEIGMEKGRAEGMNLRSLEIARKMLAKGMDEASIMDMTGLTSEEIK 297 >UniRef50_Q00255 ORF295 n=1 Tax=Leptolyngbya boryana RepID=Q00255_PLEBO Length = 295 Score = 55.8 bits (133), Expect = 2e-06, Method: Composition-based stats. Identities = 58/306 (18%), Positives = 112/306 (36%), Gaps = 45/306 (14%) Query: 4 STTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQ---- 59 S+ +T +D +K+F+ R+F+ P + D + F+D++L++ Sbjct: 5 SSENTDYDNPWKTFIE--LYFREFLAFFFPT-IEADVDWSKPVR----FLDKELQKIVRD 57 Query: 60 -----YYSDLLWSV-KTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKE 113 Y+D L V + + + IE QS+ E RM Y+ + Sbjct: 58 AEIPKRYADKLVEVHRLRGERTLVICHIEVQSQEERDFVARMYSYNYRLRDRY------N 111 Query: 114 LPLV-LPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVV-PDDEIMQHR 171 P+V L +L G P + DE A FP+V ++ ++ Sbjct: 112 CPVVSLAIL---GDDRPNWRPSRFYDEL--WGCATH---FEFPIVKLSDYQSQWTELEAI 163 Query: 172 KMALLELIQKHIRQRDLLG-------LVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQR 224 + + H++ ++ + ++L +++ + L N++ + Sbjct: 164 QNPFAVVAMAHLKTKETHNQPLERKRWRYHLTTMLYDRGYSEQDILELHNFLDWLMNLPE 223 Query: 225 FRAFIGEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVT 284 + AE E+ + M L ++ K IA ML R LD EL+ VT Sbjct: 224 ELERQLQ-AELETFEEARRMKYVSSLERRAKLEEKQ----AIALNMLRRNLDMELIAEVT 278 Query: 285 RLSPDD 290 L+ + Sbjct: 279 GLTIAE 284 >UniRef50_B5CRG1 Putative uncharacterized protein n=4 Tax=Ruminococcus lactaris ATCC 29176 RepID=B5CRG1_9FIRM Length = 356 Score = 55.8 bits (133), Expect = 2e-06, Method: Composition-based stats. Identities = 38/273 (13%), Positives = 86/273 (31%), Gaps = 46/273 (16%) Query: 58 RQYYSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPL- 116 ++ Y D++ + Q+ + + +E Q+ ++M Y + +KE P Sbjct: 89 KKQYRDII--MNWQDQALLMLLAVESQTAIHYAAPLKVMLYDSMEYAEQVRVKWKERPPR 146 Query: 117 ------------------VLPMLFYHGCRSPYPYSLCWLDEFAE-------PAIARKIYS 151 V+ ++FY+G + L F + + + + Sbjct: 147 LSSAEFLSRFQKNDKLIPVITLIFYYGTEE-WDGPLELHQMFDLGTEKKHAELMKKYLPN 205 Query: 152 SAFPLVDITVVPDDEIMQHRKMALLELIQKH----------IRQRDLLGLVDQIVSLLVT 201 LVD+ + + E Q + ++Q +D +D + Sbjct: 206 YHINLVDVRRLKNLESFQSDLQIIFGMLQYSQDKYALRTYVANHKDYFQKLDLETYHALG 265 Query: 202 GNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKLMTIADRLREEGAMQGKHE 261 N RQL + + + + + + + ++ R G +G+ Sbjct: 266 AFLNSRQL----MEINVEKNEREELDMCKALEDI---YNDGVQDGMEQGRRSGIAEGEAS 318 Query: 262 EALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQ 294 +A +M G + + V R S D + Sbjct: 319 HKKEVAFQMQKLGYSLDAIAAVLRESVDGISQI 351 >UniRef50_A5Z376 Putative uncharacterized protein n=1 Tax=Eubacterium ventriosum ATCC 27560 RepID=A5Z376_9FIRM Length = 316 Score = 55.4 bits (132), Expect = 2e-06, Method: Composition-based stats. Identities = 47/302 (15%), Positives = 95/302 (31%), Gaps = 53/302 (17%) Query: 3 ISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYS 62 + + D VF+ + + ++++ + L++ Sbjct: 1 MEGSKKHKDRVFRKLFGYEKNKGNLLELYNALNDSNYTNPDDLEI-----------NTLD 49 Query: 63 DLLW-------SVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHL-------- 107 D+ + S + IY EHQS M R RYS +++ Sbjct: 50 DVFYMNMKNDVSC-IIDWNMAIY---EHQSTWSYNMPLRGYRYSAELYNDYIVRNNLDVF 105 Query: 108 DAGYKELPLVLPMLFYHG-CRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDE 166 ++P +FY+G + P L D F P + + +++I ++E Sbjct: 106 RRKLIKIPTPQYYVFYNGNEKRPDREVLKLSDAFMVPCKDGE-FEWTATVLNINAGHNEE 164 Query: 167 IMQHRKMALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFR 226 +M + R+ +V +I L +K +Y L D + Sbjct: 165 LMSKCSI-----------LREYAIMVSKIKEFLAESLELKDAIKKAIDYCL---DNNVLK 210 Query: 227 AFIGEIAERAP-------QEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDREL 279 F+ + E+E + + EEG G + L + + ++L Sbjct: 211 EFLQDHRSEVEDMLWREYNEEETMAHWKEDFYEEGEQHGLEVGRANGEKIKLIKLVCKKL 270 Query: 280 VM 281 V Sbjct: 271 VK 272 >UniRef50_B4VKW0 Putative uncharacterized protein n=2 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VKW0_9CYAN Length = 296 Score = 55.4 bits (132), Expect = 2e-06, Method: Composition-based stats. Identities = 51/315 (16%), Positives = 115/315 (36%), Gaps = 42/315 (13%) Query: 3 ISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYS 62 + T D K LR+ + L LRK + ++ ++ ED + Sbjct: 1 MPPTHIRFDWAIKKLLRNKAN-YGVLAGFLSELLRKPITIQSILEGESNQQAEDDKLNRV 59 Query: 63 DLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYK----ELPLVL 118 D+L + G ++IE Q+ E+ RM+ + + + L+ G + + Sbjct: 60 DIL--AENDRGEL---ILIEVQNSTEQDYFHRMLYGTSRLITDFLEKGEPYGNVKKVYSV 114 Query: 119 PML----------FYHGCRSPYPYSLCWLDEFAEPAIARKIYS--------SAFPLVDIT 160 ++ YHG L D+ RK+++ + ++ + Sbjct: 115 NIVYFSLGQGDDYIYHGTLE--FRGLHLDDKLGLSINQRKLFNSQDVYEIFPEYYVIKVN 172 Query: 161 VVPDDEIMQHRKMALLELIQKH-IRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQT 219 E+ + ++K I++ + + L+ + ++ + NY+ Sbjct: 173 NFN--EVASDTLDEWIYFLKKSQIKEEFTAQGLAEAKENLLVDSLSEAERA---NYLRFM 227 Query: 220 GDAQRFRAFIGEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDREL 279 + + + I + R+ E L + +EG QGK +E + IA+ + +G D + Sbjct: 228 ENRRYEISLIE--SSRSEGRLEGL----EEGLKEGMEQGKQQEKVNIARLLKQQGTDLDT 281 Query: 280 VMMVTRLSPDDLIAQ 294 + T L+ +++ Sbjct: 282 ITAATGLTREEIEEL 296 >UniRef50_B4SC57 Putative uncharacterized protein n=14 Tax=Bacteria RepID=B4SC57_PELPB Length = 299 Score = 55.4 bits (132), Expect = 3e-06, Method: Composition-based stats. Identities = 43/291 (14%), Positives = 98/291 (33%), Gaps = 10/291 (3%) Query: 11 DAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVKT 70 D FK + +D + + A + + + ++L+ + + S L K Sbjct: 9 DFAFKKLFGSEEN-KDLLISLINAIVSEEDQVVEIELKNPYNLADYRAGKISILDIKAKA 67 Query: 71 QEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAG--YKEL--PLVLPMLFYHGC 126 + G + +E Q + R + Y + L G YKEL + + +L Y+ Sbjct: 68 ENGRWF---NVEMQISEDYNFDKRAIFYWAKLVTEQLSEGMMYKELKKTISINILDYNFV 124 Query: 127 -RSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQ 185 + +S + A R +++ + Q Sbjct: 125 PDTTEVHSCYKIINTATGKDDRLHDVFELHYIELKKFNKLHHEISSTLDRWTTFLTTAHQ 184 Query: 186 RDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKLMT 245 D ++ +L + +FN + R ++ + ++ A ++ + Sbjct: 185 LDREHTPKEL-ALDKNIVKAIAAIDRMFNEEERQVYEVRKQSLVDAESKIASALEKGMEK 243 Query: 246 IADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQSH 296 + E+G +G + + IA +L +G+ + T LS ++ + S Sbjct: 244 GMEMGLEKGRDEGINAASKTIALNLLGKGIAIATIAEATGLSVLEITSLSQ 294 >UniRef50_A6M1J9 Putative uncharacterized protein n=1 Tax=Clostridium beijerinckii NCIMB 8052 RepID=A6M1J9_CLOB8 Length = 278 Score = 55.0 bits (131), Expect = 3e-06, Method: Composition-based stats. Identities = 51/286 (17%), Positives = 96/286 (33%), Gaps = 20/286 (6%) Query: 11 DAVFKSFLRHPDTARDFIDIHLPAPLRKLCD-LTTLKLEPNSFIDEDLRQYYSDLLWSVK 69 D VFK +D I L + L+ D L ++L + E L K Sbjct: 8 DFVFKLLFGDEKN-KDLIIELLNSILKMPHDELEDIELINTELLREFAEDRKGILDVRAK 66 Query: 70 TQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYHGCRSP 129 T+ G ++ IE Q MA R + Y + +GY Y + Sbjct: 67 TKSGE---HIDIEIQVLYTYYMAERTLFYWSKMYNGQIKSGYT----------YDKLKKC 113 Query: 130 YPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQRDLL 189 ++ + + + + D T ++++ + L +L +I + + Sbjct: 114 ITINIVDFNCIEINKLHTSFHITE----DETNKKLTDVLEIHYLELPKLFDNNIPKDESE 169 Query: 190 GLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKLMTIADR 249 LV ++ L L + + + + + +L R Sbjct: 170 PLVQWMMFLQSRNKEAFEMLAEKNEKIKKAYNILEVISKDDNARAAYEAREAELHDQMTR 229 Query: 250 LREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQS 295 L+ +G E ++ A+ L GLD E+V T LS D+++ Sbjct: 230 LKS-AREEGIKEATIKNAKNFLVMGLDVEMVAKGTGLSVDEVLKIK 274 >UniRef50_A7C3K1 Putative uncharacterized protein n=3 Tax=Beggiatoa sp. PS RepID=A7C3K1_9GAMM Length = 272 Score = 55.0 bits (131), Expect = 3e-06, Method: Composition-based stats. Identities = 45/292 (15%), Positives = 102/292 (34%), Gaps = 33/292 (11%) Query: 13 VFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPN-SFIDEDLRQYYSDLLWSVKTQ 71 K P F+ L ++ ++ E + S I ++ + + Q Sbjct: 4 FLKKVFSKPHIFTAFVKDMLGIE----IEIDKVETEKSFSPIIGNVDSRFD-----LFAQ 54 Query: 72 EGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELP--LVLPMLFYHGCRSP 129 + + V I+H+ + R + Y A+ + + P V ++ Sbjct: 55 DTKNRLIVDIQHKRYKDHY--DRFLHYHCVALLEQITSSANYKPDMQVYTIVVLTSGDKH 112 Query: 130 YP------YSLCWLDEFAEPAIARKI-YSSAFPLVDITVVPDDEIMQHRKMALLELIQKH 182 +S LD + KI Y + D T P E ++ +L + +++ Sbjct: 113 KTDLLITDFSPKKLDGSSIAETQHKIVYVCPKYVTDETPKPYQEWLKAINDSLDKQVEES 172 Query: 183 IRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEK 242 + ++ +I SL+ + + + D ++ E ++A KE Sbjct: 173 HYHNE---VIQEIFSLIKKDKISPEEYARM-------KDEYSDEEYLQEQTQKA--RKEG 220 Query: 243 LMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQ 294 + ++ +G +G + L +A+ M + + E ++ VT LS + + Sbjct: 221 MEKGMEKGIGKGIEKGIEKGVLMMAKNMKEAKVAIETIIEVTGLSIEQIEDL 272 >UniRef50_C6XV81 Putative uncharacterized protein n=4 Tax=Pedobacter heparinus DSM 2366 RepID=C6XV81_PEDHD Length = 318 Score = 55.0 bits (131), Expect = 3e-06, Method: Composition-based stats. Identities = 55/315 (17%), Positives = 113/315 (35%), Gaps = 55/315 (17%) Query: 11 DAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVKT 70 D FK + ++ + L + +T ++ N F E ++ + ++ V Sbjct: 28 DFSFKRLFATEE-SKPILIGLLNHLFKGRKYITEIEYGKNEFPGEIAQEGGA--VFDVYC 84 Query: 71 QEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYHGC--RS 128 + G ++ IE Q +E R + Y A+ G ++ G + Sbjct: 85 TDVNGSKFI-IEVQRGNQEYFKERALFYVSRAISEQAPKGDRK-----------GWAYKL 132 Query: 129 PYPYSLCWLDEFAEPAIARKIYSSA-----------------FPLVDITVVPDDEIMQHR 171 Y L +L++F P + Y F +++ + Sbjct: 133 TEVYLLAFLEDFNLPDSPKSEYVQDICLANRHTGIIFYDKVGFIFIEMLNFVKGSDELYT 192 Query: 172 KMALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRF------ 225 ++ KH+ + Q L +G D QL L NY T + + Sbjct: 193 ELDKWLYALKHLTE------FKQRPEYL-SGPEFD-QLFTLANYASLTPEERDMYNSSLK 244 Query: 226 -----RAFIGEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELV 280 + + +++ ++ L ++ RE+G QG H++A+ IA EML E + Sbjct: 245 RKWDNKNVLDYAVKKSLEQG--LEQGLEQGREQGREQGIHKKAIEIALEMLVNKYPIEEI 302 Query: 281 MMVTRLSPDDLIAQS 295 + +T+LS +++ + Sbjct: 303 IKLTKLSKEEIQSLQ 317 >UniRef50_B6FJ15 Putative uncharacterized protein n=5 Tax=Clostridium RepID=B6FJ15_9CLOT Length = 310 Score = 55.0 bits (131), Expect = 3e-06, Method: Composition-based stats. Identities = 43/285 (15%), Positives = 95/285 (33%), Gaps = 40/285 (14%) Query: 11 DAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVKT 70 D +F+ + + + DLT +E ++ +DL + + Sbjct: 21 DRIFRMIFHEKKELLELYNAVNNSNYTNPDDLTITTIEDVVYMGMK-----NDLSFLI-- 73 Query: 71 QEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKEL----PLVLP----MLF 122 G + + EHQS + R + Y + + +++ L PL +P ++F Sbjct: 74 ----GDVMNLYEHQSSFSPNLPLRGLFYFSSLYKEYIEPVKHRLYTASPLHIPFPKYVVF 129 Query: 123 YHG-CRSPYPYSLCWLDEF-AEPAIARKIYSSAFPLVDITVVPDDEIMQH-RKMALLELI 179 Y+G + P L D F +++I + + E+M+ R + Sbjct: 130 YNGTKKEPERQELKLSDLFLENKEETTPSLECTAVVLNINLGKNRELMEKCRPLKEYAEF 189 Query: 180 QKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNY-----VLQTGDAQRFRAFIGEIAE 234 IR + + GN ++ + + +LQ ++ + E E Sbjct: 190 ISIIR--------KYLSEQMDFGNAVNKAVDFCIHNGILADILQKNRSEVVDMILTEYDE 241 Query: 235 RAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDREL 279 +E + L EG +G + + + + + + Sbjct: 242 -----EEFRRAWREDLLNEGFRKGLNNGLSKGIKGTIHACMKFNV 281 >UniRef50_Q8YTL4 All2703 protein n=13 Tax=Cyanobacteria RepID=Q8YTL4_ANASP Length = 270 Score = 54.6 bits (130), Expect = 4e-06, Method: Composition-based stats. Identities = 41/291 (14%), Positives = 93/291 (31%), Gaps = 33/291 (11%) Query: 11 DAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQ--YYSDLLWSV 68 D +F S + P +L + + + F +++Q + D L+ Sbjct: 4 DTIFYSLFQE-----------FPHIFFELINQSPQEASIYEFTSREVKQLAFRLDGLFLP 52 Query: 69 KTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYHGCRS 128 K + Y+ +E Q +P++ +R+ +L P + ++ Y Sbjct: 53 KINDSTKPFYI-VEVQFQPDDDFYYRLFAELFL----YLKQYKPPYPWQV-VVIYPSRGI 106 Query: 129 PYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEI---MQHRKMALLELIQKHIRQ 185 ++ + DE ++IY L ++ V + + + + E RQ Sbjct: 107 ERQQTIHF-DEILVLNRVKRIY-----LDELGEVAETSLGVGVVKLVIETEETAPVLARQ 160 Query: 186 RDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKLMT 245 L+ Q L + + ++ + + ++ Sbjct: 161 -----LIAQAKQQLTDVTAKRDLINLIETIIVYKLPQKSREEIEAMLGLNELKQSRVYQE 215 Query: 246 IADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQSH 296 + ++EG +GK E L M+ GL E + + L + + Sbjct: 216 ALEEGKQEGKQEGKQEAKLETIPRMVQFGLSVEAIAQLLDLPLEVVQQAVQ 266 >UniRef50_B7UFQ6 Predicted protein n=11 Tax=Escherichia RepID=B7UFQ6_ECO27 Length = 73 Score = 54.6 bits (130), Expect = 4e-06, Method: Composition-based stats. Identities = 21/62 (33%), Positives = 36/62 (58%) Query: 235 RAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQ 294 +E + + EG + HE+A++IA ML++G+DR+ V+ T+LS DL A+ Sbjct: 12 HQIGWQEGKIEGWQEGKLEGLQESMHEQAIKIALRMLEQGIDRDQVLAATQLSETDLAAK 71 Query: 295 SH 296 +H Sbjct: 72 NH 73 >UniRef50_C0CTJ7 Putative uncharacterized protein n=5 Tax=Clostridium RepID=C0CTJ7_9CLOT Length = 327 Score = 54.6 bits (130), Expect = 4e-06, Method: Composition-based stats. Identities = 46/321 (14%), Positives = 96/321 (29%), Gaps = 46/321 (14%) Query: 11 DAVFKSFLRHPDTARDFIDIHL--PAPLRKLCDLTTLKLEPNSFIDEDLR----QYYSDL 64 D V + + D I+ + + + D+ L R Q Y D Sbjct: 5 DMVLNRYFEDGERYADLINGYAFNGDQVVRKEDVQELDPRETGVAGRLGRRPGVQKYRDS 64 Query: 65 LWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLD---------------- 108 + V G ++ + +EHQ + M R M A L Sbjct: 65 IRRVVL--GARFVLIGLEHQDQVHYAMPVRAMLQDAAEYDRQLRRIRRVNRRVGGLTGAE 122 Query: 109 -----AGYKELPLVLPMLFYHGCRSPYPYSLCWLDEFA----EPAIARKIYSSAFPLVDI 159 + V+ ++ Y+G + P+ ++ + R + + ++++ Sbjct: 123 FLGGFTRKDRVCPVITLVLYYGKK-PWDGAMDLHGLMDCAGYPEPMLRLVNNYRLHVLEV 181 Query: 160 TVVPDDEIMQHRKMALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQT 219 + + + IQ R D V D + + + + Sbjct: 182 RRFVNIRRFRTDLYQVFGFIQ---RSGDKEAERRFTEENRVYFEGMDEEAFDVITAITGS 238 Query: 220 GDAQRFRAFIGEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALR--------IAQEML 271 + +R + E R E + + + R EG ++GK E +A+ M Sbjct: 239 RELERVKEQYREEGGRI-NMCEAIRGMIEDGRIEGRLEGKIEGKYEGALEKTRTVARNMY 297 Query: 272 DRGLDRELVMMVTRLSPDDLI 292 RG+ E + + + Sbjct: 298 LRGMSAEDAAAICEMDTAQIE 318 >UniRef50_C0R0H3 Putative uncharacterized protein n=8 Tax=Brachyspira RepID=C0R0H3_BRAHW Length = 292 Score = 54.2 bits (129), Expect = 5e-06, Method: Composition-based stats. Identities = 47/293 (16%), Positives = 96/293 (32%), Gaps = 23/293 (7%) Query: 11 DAVFKSFLRHP---DTARDFID-IHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLW 66 D + DFI+ I L + ++ + L ED ++ +D+ Sbjct: 14 DYFVRYLFSDKGSEAILLDFINSIMLDSGMKTFRSVEILTPFNYKENYED-KETITDV-- 70 Query: 67 SVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYK--ELPLVLPMLFYH 124 TQ G V+IE Q + R++ Y + L G K L V+ + + Sbjct: 71 KCITQNGTV---VIIEIQLQGNSRFPERILYYWASNYSKLLKQGEKYDALTPVISINLLN 127 Query: 125 GCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVD---ITVVPDDEIMQHRKMALLELIQK 181 ++L D + + L D I ++ + + L K Sbjct: 128 -------FNLDDNDSIHSCYMIYDTNNKRL-LTDHLQIHIIELKKFKYNSLEYDLNCWLK 179 Query: 182 HIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKE 241 +D + L+ + + N++ + + +E Sbjct: 180 FFTMKDKDNKEVIMSELVKEKPIMEEVQRRYNNFIKDRLMMNEYDKRQAYLYGNQIMLEE 239 Query: 242 KLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQ 294 + +EEG +G +E +A+ M ++ +D L+ +T LS + + Sbjct: 240 ERRLGRVEGKEEGIKEGIEQEKYSLARNMKNKNMDLNLISELTGLSIEKIEKL 292 >UniRef50_C6VTD5 Putative uncharacterized protein n=1 Tax=Dyadobacter fermentans DSM 18053 RepID=C6VTD5_DYAFD Length = 308 Score = 54.2 bits (129), Expect = 5e-06, Method: Composition-based stats. Identities = 49/310 (15%), Positives = 103/310 (33%), Gaps = 37/310 (11%) Query: 11 DAVFKSFLR---HPDTARDFIDI-HLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLW 66 D FK + D DF+++ L + E N I LR+ DL Sbjct: 10 DFGFKRIFGSEANKDILIDFLNVLFAGERLVADLTFAS--NENNGRI-PILRRAIFDLC- 65 Query: 67 SVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYK----ELPLVLPM-- 120 +G +I IE Q +E R + YS + +++ ++AG +L V + Sbjct: 66 -CTGADGEQFI---IEVQRVRQEYFKDRCLYYSASLIRDQVEAGGTNWRYDLKPVYLIGL 121 Query: 121 ---LFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLE 177 F Y + + + K +++ E ++ Sbjct: 122 MDFCFEDSDDGHYLHEIRLIKRSNGQVFYDK---FGLTFIEMPAFQKKESDLSTELDRWL 178 Query: 178 LIQKHIRQRDLLG------LVDQIVSLLVTGNTNDRQLKALFNYVLQTGD-------AQR 224 + K++ + +++ + ++ + N N + A Y+ D A++ Sbjct: 179 YLLKNLSKLNIVPPVLTNPVYQKVFRVAEVCNLNKEEKMAWDAYLKAKWDNENSMDYAKK 238 Query: 225 FRAFIGEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVT 284 +G KE ++G G ++ + ML +G D + + +T Sbjct: 239 EAMRVGHEEGHKEGHKEGHKEGMKEGIKKGRETGIELGKRQVVKNMLAKGFDMQTISDIT 298 Query: 285 RLSPDDLIAQ 294 L+ + + Sbjct: 299 GLTFEQIRNA 308 >UniRef50_A7BL62 Putative uncharacterized protein n=2 Tax=Beggiatoa RepID=A7BL62_9GAMM Length = 166 Score = 54.2 bits (129), Expect = 5e-06, Method: Composition-based stats. Identities = 22/136 (16%), Positives = 55/136 (40%), Gaps = 17/136 (12%) Query: 168 MQHRKMALLELIQKHIRQR-------DLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQT- 219 + ++ + K++ + +L + D + L+ + +++ +F+Y+ + Sbjct: 34 LNEIPHKMMYICPKYLNDKTPAPYREWMLAIQDSLDELVEENDYTVPEVRQIFDYIEKDL 93 Query: 220 GDAQRFRAFIGEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDREL 279 + E E + + +G +G +E L IAQ+ML +G+ L Sbjct: 94 ISPEERARMFDEYGEE---------QVKQQHFVKGVAKGIEKEKLEIAQKMLQQGMAISL 144 Query: 280 VMMVTRLSPDDLIAQS 295 + +T+LS + + Sbjct: 145 ISQITKLSEEAITHLK 160 >UniRef50_Q3ARU8 Putative uncharacterized protein n=12 Tax=Chlorobium chlorochromatii CaD3 RepID=Q3ARU8_CHLCH Length = 324 Score = 54.2 bits (129), Expect = 6e-06, Method: Composition-based stats. Identities = 48/325 (14%), Positives = 102/325 (31%), Gaps = 57/325 (17%) Query: 10 HDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLR---------QY 60 +D+ +K + +F+ P D + P F+D++LR + Sbjct: 14 YDSPWKEAIE--LYFPEFMAFFYPNAFLA-IDWSK----PYHFLDQELRSILPEAENGKR 66 Query: 61 YSDLLWSVKTQEGVGY-IYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLP 119 D L V G +Y+ IE Q E R+ + + V Sbjct: 67 IVDKLVQVHLLGGKERCLYIQIEVQGNREADFPRRIFICNYRIFDKYGK-------PVAS 119 Query: 120 MLF----------------YHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVD---IT 160 + + G + + + L +F +AF LV + Sbjct: 120 FVILTDSDSSWRPTTYSYEFAGSKMTLEFDMVKLLDFEPRIKELLASDNAFALVTAAHLL 179 Query: 161 VVPDDEIMQHRKMALLELIQKHIRQRDLLGLVDQIVSLLVTG----NTNDRQLKALFNYV 216 E R A +LI+ ++ V ++ ++ ++QL+ + Sbjct: 180 TQKTREKSFERLDAKSQLIRLLYNKQWTKERVKELFRVIDWFMELPKELEQQLQTEIYNI 239 Query: 217 LQTGDAQRFRAFIGEIAER----------APQEKEKLMTIADRLREEGAMQGKHEEALRI 266 + + + E+ ++ LM +R +G G + L I Sbjct: 240 EEEQKMKYISSIERYAMEKGWSEGMERGILEGMEKGLMEGMERGMAKGKEIGAEQTKLDI 299 Query: 267 AQEMLDRGLDRELVMMVTRLSPDDL 291 A+ ++ G+ + ++ +S + L Sbjct: 300 ARRLVASGISKAEAALLAGVSLETL 324 >UniRef50_C0G0A4 Putative uncharacterized protein n=2 Tax=Roseburia inulinivorans DSM 16841 RepID=C0G0A4_9FIRM Length = 319 Score = 53.8 bits (128), Expect = 6e-06, Method: Composition-based stats. Identities = 24/125 (19%), Positives = 43/125 (34%), Gaps = 17/125 (13%) Query: 8 TPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWS 67 D VF+ + + + DL + LE ++ +DL + Sbjct: 55 NYKDTVFRMLFSDRKNLLSLYNAVNQSNYKNPEDLEIVTLENAIYMGIK-----NDLAF- 108 Query: 68 VKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGY------KELPLVLPML 121 + Y+Y EHQS M R + Y + Q +D +++P + Sbjct: 109 --IMDTNLYLY---EHQSTYNPNMPLRDLFYICSEYQKLVDKKSLFSSTLQKIPAPNFIE 163 Query: 122 FYHGC 126 FY+G Sbjct: 164 FYNGS 168 >UniRef50_C1DU30 Putative uncharacterized protein n=7 Tax=Sulfurihydrogenibium azorense Az-Fu1 RepID=C1DU30_SULAA Length = 313 Score = 53.5 bits (127), Expect = 8e-06, Method: Composition-based stats. Identities = 49/310 (15%), Positives = 105/310 (33%), Gaps = 62/310 (20%) Query: 11 DAVFKSFLRHPDTARDFIDIHLPAPLR-KLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVK 69 D + K ++P A I+I L + +L + LK+ +DL+ VK Sbjct: 7 DLLLKHLFKNP--ATKLIEIILGKKVNWQLLQDSDLKIVKT---------READLV--VK 53 Query: 70 TQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYHGCRSP 129 ++ + IE QS + M +RM Y + ++ + Y G Sbjct: 54 LEDNTI---LHIEIQSTNDPSMPYRMFEYFYLITDKYKPKD------LIQVCIYIGKE-- 102 Query: 130 YPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELI---------- 179 P + +F++ ++ + L+DI +P E++ + + L Sbjct: 103 -PLKMSDKIQFSD-------WTYRYRLIDIKDIPCKELITSQNITDKLLAGLCKIEDPKF 154 Query: 180 --------QKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNY--------VLQTGDAQ 223 K+ +D L + + N + ++++ + T + Sbjct: 155 YVENVIKEIKNANPKDRKELFTLFLEISKIRNNIEEEIRSYIRQEDFEMPITIEWTREEI 214 Query: 224 RFRAFIGEI--AERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVM 281 + ++ + KE L + +EG QG + + E L + + + + Sbjct: 215 ESYPVLRDVLKIGKEEGYKEGLQQGLQQGLKEGLEQGVQQGLQKGLIEGLRQSV-IDTIE 273 Query: 282 MVTRLSPDDL 291 + DDL Sbjct: 274 LKFGYVEDDL 283 >UniRef50_Q24Y59 Putative uncharacterized protein n=4 Tax=Peptococcaceae RepID=Q24Y59_DESHY Length = 283 Score = 53.5 bits (127), Expect = 8e-06, Method: Composition-based stats. Identities = 34/241 (14%), Positives = 85/241 (35%), Gaps = 22/241 (9%) Query: 59 QYYSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVL 118 + +D+++ ++ + +E Q+ E R + Y ++ + Sbjct: 53 ETRNDIIFLLEDDT-----LLHLEFQTTAGEQDLKRFLYYDARLVRRQERKVHT------ 101 Query: 119 PMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDI-----TVVPDDEIMQHRKM 173 ++ Y G + L+ + IY + + + +++ Sbjct: 102 -IVIYSGR---IEQARERLECGSILYQVENIYMKHYNGDQEYNRLKHKIDNHQLLSETDT 157 Query: 174 ALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIA 233 L + ++ L Q L +L A+ ++ D + ++ Sbjct: 158 LKLIFLPLMKSEQKEEELAIQAAELAKAAPDEKTKLFAIAA-LIVITDKIMSESNKRKLL 216 Query: 234 ERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIA 293 E + + I + R+EG ++G+ +E AQ ML+ G+ EL+ T+L ++++ Sbjct: 217 E-VLKMTQIEQWIREEGRQEGELKGRRDEKRETAQTMLNLGMSPELIAKATKLPLEEILE 275 Query: 294 Q 294 Sbjct: 276 M 276 >UniRef50_D1PGQ2 Transposase, ISNCY family n=2 Tax=Prevotella copri DSM 18205 RepID=D1PGQ2_9BACT Length = 118 Score = 53.5 bits (127), Expect = 9e-06, Method: Composition-based stats. Identities = 17/55 (30%), Positives = 27/55 (49%) Query: 241 EKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQS 295 E + + EG +G ++ +L IA++ML G+D VM +T LS L Sbjct: 63 EGMEMGLVKGLAEGMEKGMNKRSLEIARKMLANGMDAATVMEITGLSESQLQQLK 117 >UniRef50_C6LTE0 Putative uncharacterized protein n=1 Tax=Giardia intestinalis ATCC 50581 RepID=C6LTE0_GIALA Length = 353 Score = 53.5 bits (127), Expect = 1e-05, Method: Composition-based stats. Identities = 42/300 (14%), Positives = 100/300 (33%), Gaps = 39/300 (13%) Query: 11 DAVFKSFLR---HPDTARDFIDIHLP--APLRK-LCDLTTLKLEPNSFIDEDLRQYYSDL 64 D VF H ++ L ++ D T K D + D+ Sbjct: 73 DFVFYQIFGVEKHKSVLISLLNSILKGNPHVKDVRIDPTEHKRT-----TPDGKSVRLDI 127 Query: 65 LWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNH-LDAG--YKELPLVLPML 121 +G V +E Q + R + Y ++++ + G YK +P V+ + Sbjct: 128 --KATINDGTI---VDVEMQCINTGDIYHRSIYYQSLILRDYTIKQGQSYKSIPDVIIIW 182 Query: 122 FYHGCRSPYPYSLCWLDEFAEP------AIARKIYSSAFPLVDITVVPDDEIMQHRKMAL 175 + + + + + IA + ++++T + + + K Sbjct: 183 IMNQDITNRKGCMHEIVPMYKANGIDQIEIASEKMRQF--IIELTKLGNTSNFCYNK--- 237 Query: 176 LELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAER 235 +D + +++ + ++ + + + RA Sbjct: 238 -AFTAWMTFIKDPSSISGELLEV--------EGVQTAMKELTYLSENKETRAIYDARRIA 288 Query: 236 APQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQS 295 + ++ + EG ++G+ +E R+A++ML GLD E ++ + LS ++ Sbjct: 289 LLDLNSAIEHGIEKGKAEGLVEGRDKERERMAEQMLSDGLDIEFIVRYSGLSMQEIENVK 348 >UniRef50_A6L0F5 Putative uncharacterized protein n=6 Tax=Bacteroides RepID=A6L0F5_BACV8 Length = 125 Score = 53.1 bits (126), Expect = 1e-05, Method: Composition-based stats. Identities = 15/61 (24%), Positives = 31/61 (50%) Query: 234 ERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIA 293 + +E L ++ +EG QG+ EE + A++M G+ +++ VT LS ++ + Sbjct: 65 QLKKGHEEGLKEGIEKGLKEGLEQGRKEECFKNAKKMKQAGIAFDVIAQVTGLSIGEIAS 124 Query: 294 Q 294 Sbjct: 125 L 125 >UniRef50_C1QAJ2 Putative uncharacterized protein n=2 Tax=Brachyspira murdochii DSM 12563 RepID=C1QAJ2_9SPIR Length = 312 Score = 53.1 bits (126), Expect = 1e-05, Method: Composition-based stats. Identities = 48/313 (15%), Positives = 100/313 (31%), Gaps = 38/313 (12%) Query: 11 DAVFKSFLRHPD---TARDFIDI-HLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSD--- 63 D + D DFI+ L A ++ + L P + + ++ Y D Sbjct: 9 DYFVRYLFSSKDSNFILLDFINSTMLDANMKTFRSVEILTPSPKAGSRLNYKENYDDKES 68 Query: 64 ---------------LLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLD 108 L TQ G V+IE Q + R++ Y + L Sbjct: 69 IAPKVARKVDRCRRRLDVKCITQNGTV---VIIEIQLQGNSRFPERILYYWASNYSKLLK 125 Query: 109 AGYK--ELPLVLPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSA---FPLVDITVVP 163 G K L V+ + + C + K + +++I Sbjct: 126 QGEKYDALTPVISINL---LNFNLDNNDCIHSCYMIYDTKSKRLLTDHLQIHIIEIKKFK 182 Query: 164 DDEIMQHR--KMALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGD 221 D+ + + + + +K R+ + LV + + R + + ++ Sbjct: 183 DNLLDKDLDCWLKFFTIKEKDNREVIMSELVKEKP---IMEEVQKRYNNFIKDRLMMNEY 239 Query: 222 AQRFRAFIGEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVM 281 +R G + + + + E+G +G E + A+ M ++ +D L+ Sbjct: 240 DKREAYLYGNQIMLEEERRLGIEEGFKKGIEKGIEKGIKENQILTAKNMKNKNIDIALIS 299 Query: 282 MVTRLSPDDLIAQ 294 +T LS ++ Sbjct: 300 DITGLSIKEIEEL 312 >UniRef50_C0DAA1 Putative uncharacterized protein n=2 Tax=Clostridium asparagiforme DSM 15981 RepID=C0DAA1_9CLOT Length = 302 Score = 53.1 bits (126), Expect = 1e-05, Method: Composition-based stats. Identities = 50/271 (18%), Positives = 97/271 (35%), Gaps = 36/271 (13%) Query: 11 DAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVKT 70 D++F+ + + D+ + D+L+ + Sbjct: 18 DSLFRVIFSEKKELLEL--------------YNAINGSHYENPDDLIITTIGDVLY-LGM 62 Query: 71 QEGVGYI---YV-VIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKEL----PLVLP--- 119 + + ++ ++ + E QS M R + Y Q +L +L PL LP Sbjct: 63 KNDISFLIGQHLSLYEAQSTWNPNMPLRGLFYFSRLYQGYLKEHQLDLYSRRPLSLPFPE 122 Query: 120 -MLFYHGC-RSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQH-RKMALL 176 ++FY+G P L D F + + +++I ++E+M+ RK+ Sbjct: 123 FIVFYNGTMEQPDRTQLRLSDLFYQAEGVPCL-ECTATMININYGHNEEMMKSCRKLYEY 181 Query: 177 ELIQKHIRQRDLLGL-VDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAER 235 + +R R GL ++ V V LK N++L+ + R + E E Sbjct: 182 AFLINAVRSRLNEGLHLEAAVDQAVEDCIQHDVLK---NFLLKHREEVR-EMILSEYDEE 237 Query: 236 APQEKEKLMTIADRLREEGAMQGKHEEALRI 266 EK ++ + E G +QG R+ Sbjct: 238 LHINSEKKISY-EEGLEAGVVQGTQHGQERV 267 >UniRef50_A7AK04 Putative uncharacterized protein n=2 Tax=Parabacteroides RepID=A7AK04_9PORP Length = 299 Score = 53.1 bits (126), Expect = 1e-05, Method: Composition-based stats. Identities = 47/311 (15%), Positives = 103/311 (33%), Gaps = 52/311 (16%) Query: 11 DAVFKSFL---RHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWS 67 D FK F + + F++ L ++ + ++++ N+ ++ Sbjct: 12 DYAFKRFFGTVSNKELTIGFLNSLLNKDIKDII-FHNVEMQGNNTDSRKA-------VFD 63 Query: 68 VKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPL----------- 116 + + G +++ +E Q K ++ + R++ Y+ +Q D ++ L Sbjct: 64 LFCEGSDGELFI-VEIQKKRQKYFSDRVLYYASFVIQMQADIESEKFRLAKEEERRRWNY 122 Query: 117 ----VLPMLF--------YHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPD 164 V + F Y + + +IY P ++ Sbjct: 123 HINKVYVVCFLDFRLDTRYTDKYRWDVVRMDRELKIPFSETLNEIY-LELPKFNLNFEEC 181 Query: 165 DEIMQHRKMALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYV-LQTGDAQ 223 D + +I D++G + + ND+ L+ L + + LQ A+ Sbjct: 182 DTFYKK-----FLYTMNNI---DIMGQLSK------ETIQNDKLLRKLKSAIELQRMSAK 227 Query: 224 RFRAFIGEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMV 283 A+ IA + T + E+G +G E +I M G+D + Sbjct: 228 ERLAYELSIAAERDLAA-CMATSFEEGEEKGIAKGITEGMRKIILNMKQAGMDLATIAKT 286 Query: 284 TRLSPDDLIAQ 294 L ++ A Sbjct: 287 AGLPEKEVEAL 297 >UniRef50_B8HNA0 Putative uncharacterized protein n=3 Tax=Cyanobacteria RepID=B8HNA0_CYAP4 Length = 315 Score = 53.1 bits (126), Expect = 1e-05, Method: Composition-based stats. Identities = 38/234 (16%), Positives = 83/234 (35%), Gaps = 21/234 (8%) Query: 79 VVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVL----PMLFYHGC----RSPY 130 + +E Q+ P+ M RM+ Y + ++ ++ + L +L Y + + Sbjct: 57 LHVEFQTGPDADMPLRMLDYRVRLLRRSPQKVVRQFVIYLRQTTSVLVYQTELQLESTWH 116 Query: 131 PYSL-CWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQRDLL 189 +++ + +P +A + L + E + L I+ Q +L Sbjct: 117 EFNVVRLWECSTDPLLASRGLLPFAVL---GQTSNPEATLAQVAQRLSTIENRTEQSNLT 173 Query: 190 GLVDQIVSLLVTGNTNDRQLKALFN-----Y--VLQTGDAQRFRAFIGEIAERAPQEKEK 242 + L++ T R L+ Y +L+ G + I + + ++ Sbjct: 174 AASAILAGLVLDQQTIQRLLRREIMRESLFYQGILEEGMQKGVERGIAQGIQLGLEQGR- 232 Query: 243 LMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQSH 296 ++ R+EG +G+ E Q+ + + R L +SPD S Sbjct: 233 -QEGLEQGRQEGRQEGRQEGRQEGIQQGVLSLVLRSLTRKFGNISPDLQARISQ 285 >UniRef50_C1DU78 Putative uncharacterized protein n=1 Tax=Sulfurihydrogenibium azorense Az-Fu1 RepID=C1DU78_SULAA Length = 163 Score = 53.1 bits (126), Expect = 1e-05, Method: Composition-based stats. Identities = 26/127 (20%), Positives = 52/127 (40%), Gaps = 13/127 (10%) Query: 158 DITVVPDDEIMQHRKMALLEL----IQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALF 213 D+ + I++ + L K+I +D L + +LLV + + Sbjct: 2 DLNKISSKRIVKEFYDDICLLSAILTLKNIF-KDFNDLKPILRNLLVA--ETKDCIYIII 58 Query: 214 NYV-LQTGDAQRFRAFIGEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLD 272 NY+ L D + + E+ KEK+MT+ ++ R EG QG E + ++ + Sbjct: 59 NYIALAKKDLKTVENILEEV-----GGKEKMMTLTEKWRIEGLQQGIEEGIKKQLKDDIK 113 Query: 273 RGLDREL 279 ++ + Sbjct: 114 EAIEIKF 120 >UniRef50_Q24Y19 Putative uncharacterized protein n=3 Tax=Desulfitobacterium hafniense RepID=Q24Y19_DESHY Length = 248 Score = 52.7 bits (125), Expect = 2e-05, Method: Composition-based stats. Identities = 32/243 (13%), Positives = 76/243 (31%), Gaps = 32/243 (13%) Query: 79 VVIEHQSKPEELMAFRMMRYSIAAMQNHLDAG--YKELP--LVLPMLFYHGCRSPYPYSL 134 + IE Q + M R + Y + G YKEL + + ++ ++ + Y Sbjct: 3 INIEIQLSNQYDMEKRSLYYWAQMYSRQIREGMAYKELTKTVSINIVDFNYLKQTSNYHN 62 Query: 135 CWLDEFAEPA------IARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQRDL 188 + E + L + + ++ L+ + +++ Sbjct: 63 VFHLYEDEEKFQLTDVLEIHFMELPKLLAKWRRR--EISLWENELVRWLLLLEGADNQEI 120 Query: 189 LGLVDQIV-------SLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEI--------- 232 L ++++I + + Y + +A I E Sbjct: 121 LQILEEIAMKDPVLYQAMNAWEETSEDPRIREAYFDRRKAILDEKAAIREAELRLQEALE 180 Query: 233 ----AERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSP 288 A + + + EG +G+ E +A+++L G + + T LS Sbjct: 181 EGMAKGIAEGRAKGIAEGKAEGKAEGRAEGRAEGRAEVAKKLLVLGFEITKIAEATGLSE 240 Query: 289 DDL 291 +++ Sbjct: 241 EEI 243 >UniRef50_C0QZQ8 Putative uncharacterized protein n=4 Tax=Brachyspira RepID=C0QZQ8_BRAHW Length = 309 Score = 52.3 bits (124), Expect = 2e-05, Method: Composition-based stats. Identities = 43/298 (14%), Positives = 104/298 (34%), Gaps = 27/298 (9%) Query: 11 DAVFKSFLRH---PDTARDFIDIHLPAPLRKLCDLTT---LKLEPNSFIDEDLRQYYSDL 64 D + H + A +FI+ K + T +++ I E+ + S + Sbjct: 25 DYFIRYLFSHTGNENIALNFINAVF-----KDLNFETFQKIEILNPFNIAENYDEKESIV 79 Query: 65 LWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAG--YKELPL-----V 117 T+ G I V+IE QS+ E R + Y + L+ G Y EL + Sbjct: 80 DIKATTESG---ITVLIEIQSRGNEDFIKRALYYWAYNYSSSLNRGSFYDELKPTVSINI 136 Query: 118 LPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLE 177 + + Y L L+ +++ P ++ + + E + + + Sbjct: 137 TNFILTDEDKVHSCYILKELNNNKILTDHCQLHFVELPKSNLKNISEIESLDNTHKEFIS 196 Query: 178 LIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAP 237 ++ + + + ++ + ++ +R+ + N ++ + + Sbjct: 197 WVK--FFKGEDMSILMKENTIF---EEVERKCRTFVNDSPVMDKYKKREVDTYFLNKSME 251 Query: 238 -QEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQ 294 ++ +EG +G E + +A+ M +D ++ T LS +++ Sbjct: 252 LDIRKAKEEGIKEGIKEGIKEGIKENQISMAKNMKKDKVDFNIISKYTGLSIEEIKKL 309 >UniRef50_Q73KA7 Putative uncharacterized protein n=2 Tax=Treponema RepID=Q73KA7_TREDE Length = 172 Score = 52.3 bits (124), Expect = 2e-05, Method: Composition-based stats. Identities = 22/110 (20%), Positives = 44/110 (40%), Gaps = 6/110 (5%) Query: 191 LVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKE-----KLMT 245 L I++ NT ++ LK Y+ F I E+ + Q ++ +LM+ Sbjct: 63 LSKIIINADAFNNTKNKALKGFLEYLKTGKTKNEFTRRIEEMIQTVKQNEQARQEYRLMS 122 Query: 246 IADRL-REEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQ 294 + R +G +G + A+ + G + +M VT L +++ Sbjct: 123 TFEMDARYKGFTEGTYNNKKETAKILKQLGDSIQKIMQVTGLPEEEIEKL 172 >UniRef50_C5EKZ7 Predicted protein n=1 Tax=Clostridiales bacterium 1_7_47FAA RepID=C5EKZ7_9FIRM Length = 329 Score = 52.3 bits (124), Expect = 2e-05, Method: Composition-based stats. Identities = 44/296 (14%), Positives = 94/296 (31%), Gaps = 53/296 (17%) Query: 15 KSFLRHPDTARDFIDIHL--------PAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLW 66 + L HP DF + + P L + + + + + + + D++ Sbjct: 9 RKLLNHPARFADFYNGTVFGGRQVLRPEQLSDVPNEQGIVILDKDG-KKRVVERRRDIIK 67 Query: 67 SVKTQEGVGYIYVVI---EHQSKPEELMAFRMMRY--------SIAAMQNHLDAG----- 110 Y ++ E+Q M R M Y Q H G Sbjct: 68 KASFGA-----YFILAAEENQDTIHYGMPVRNMMYDALDYTEQMECLKQAHKSRGDVLDG 122 Query: 111 ---------YKELPLVLPMLFYHGCRSPYPYSLCWLDEFAEPAIARK-------IYSSAF 154 L V+ ++ YHG + P+ D A A++ + Sbjct: 123 GGFLSGITREDRLMPVVSLILYHGSK-PWDGPRSLYDMLGLDASAKETLALKQVLPDYRI 181 Query: 155 PLVDITVVPDDEIMQHRKMALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFN 214 L+D + + E+ + +++ + ++ G Q + +D + A+ Sbjct: 182 NLIDASNIEHPELFCTSLQHVFSMLKYNTDKQKFYGYAKQ--HQKDLLDMDDDSMLAMLT 239 Query: 215 YVLQTGDAQRFRAFIGEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEM 270 + G+ +R + E + +E + D L +G ++GK E + + Sbjct: 240 LL---GEQKRLLKIL-ETSSNDTKEGTDVCIAIDELINDGKIEGKIEGKIEGEHRL 291 >UniRef50_C0D7Q8 Putative uncharacterized protein n=1 Tax=Clostridium asparagiforme DSM 15981 RepID=C0D7Q8_9CLOT Length = 351 Score = 52.3 bits (124), Expect = 2e-05, Method: Composition-based stats. Identities = 44/277 (15%), Positives = 85/277 (30%), Gaps = 40/277 (14%) Query: 33 PAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVKTQEGVGYIYVVIE-HQSKPEELM 91 P L + D + + +R+ D++ Y ++ E +Q K M Sbjct: 47 PEDLSDVPDENGIAIVGLDGKRRLIRRSR-DVIKKASFG---AYFVLLAEENQDKVHYAM 102 Query: 92 AFRMMRY--------SIAAMQNHLDAGY--------------KELPLVLPMLFYHGCRSP 129 R M Y A + H + G + V+ + YHG + P Sbjct: 103 PVRSMLYDALEYTEQVEALKRRHRECGDRLEGDAFLSGITRDDRIMPVVTLTVYHGAK-P 161 Query: 130 YPYSLCWLDEFA-------EPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKH 182 + D A+ + LV++ + E + + + K+ Sbjct: 162 WDGPRSLYDMLEMDRDSKEWEALKEVLPDYRLNLVELNNMQHLE-RFRSSLQPIFTVLKY 220 Query: 183 IRQ--RDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEK 240 R+ R ++ L +D ++A+ + + R G + Sbjct: 221 NRKDKRKFYEYLENHREELRKM--DDDSVRAMLALLGEQKRLLRMLELPGGEGKERMDVY 278 Query: 241 EKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDR 277 + + REEG +GK E + L+ G R Sbjct: 279 NAIDELIADGREEGKAEGKAEGRVEGKAIGLELGQKR 315 >UniRef50_C4Z1Q2 Putative uncharacterized protein n=1 Tax=Eubacterium eligens ATCC 27750 RepID=C4Z1Q2_EUBE2 Length = 321 Score = 52.3 bits (124), Expect = 2e-05, Method: Composition-based stats. Identities = 47/328 (14%), Positives = 96/328 (29%), Gaps = 54/328 (16%) Query: 3 ISTTSTPHDAVFKSFLRHPDTARDFI-------DIHLPAPLRKLCDLT-TLKLEPNSFID 54 + T+ D K+F R + D L D + + S+ + Sbjct: 4 SNRTTHQKDVSLKTFWRDNEHFADLFNATVFNGKQVLKPDKLTEMDTDVSATIHSKSYNE 63 Query: 55 EDLRQYYSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNH---LDAGY 111 R D++ K +GV + + +E Q K M R M Y + + Sbjct: 64 SITRNR--DVV--KKMSDGVEFNILGLEIQDKTHYAMPLRTMTYDALGYIKEYNDIKKHH 119 Query: 112 K--------------------ELPLVLPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYS 151 K ++ ++ Y+G S + C D K Y Sbjct: 120 KLNKDSFSSHEEFLSGINKSDRFHPIITLVLYYG-ESLWDGPTCLSDMMISMPDNIKAYF 178 Query: 152 SAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQRDLLGLVDQI----VSLLVTGNTNDR 207 S + L + ++ D+ + RD+ ++ I + + Sbjct: 179 SDYKLNLVQILDSDK-----------YTFYNEDVRDVFNIIRNIYNDDFDSIYREYESRN 227 Query: 208 QLKALFNYVLQTGDAQRFRAFIGEIAER-APQEKEKLMTIADRLREEGAMQGKHEEALRI 266 + + + + + E + +G +G E + Sbjct: 228 VDIDVMELICNITSVPKLMDLCTDTEQGGTVNMCEAMKRFQAECESKGMKEGIDSEKVNS 287 Query: 267 AQEMLDRGLDRELVMMVTRLSPDDLIAQ 294 ML+ G+ +E + +TR + +DL Sbjct: 288 IISMLEFGITKEQI--LTRYTKEDLERA 313 >UniRef50_B0A7T9 Putative uncharacterized protein n=2 Tax=Clostridium bartlettii DSM 16795 RepID=B0A7T9_9CLOT Length = 271 Score = 52.3 bits (124), Expect = 2e-05, Method: Composition-based stats. Identities = 50/291 (17%), Positives = 102/291 (35%), Gaps = 43/291 (14%) Query: 11 DAVFKSFLR---HPDTARDFIDIHL-PAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLW 66 D VFK+ +P F++ L P L ++ ++I++ D+ Sbjct: 10 DFVFKNIFGSEKNPKILISFLNATLKPKDLITSVEIKN-TDINKNYIEDKF--SRLDV-- 64 Query: 67 SVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYK----ELPLVLPMLF 122 KT + IE Q K E M R + Y L G + + + +L Sbjct: 65 KAKTSNDEI---INIEIQLKNEYNMIKRSLYYWSKLYSEQLGEGQDYSVLKRTICINIL- 120 Query: 123 YHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKH 182 + +F ++IY ++E+ ++ +E I K Sbjct: 121 --------NFKYLKTRKFHSGYRLKEIY------------SNEELTNVAEIHFIE-IPKL 159 Query: 183 IRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEK 242 D ++ + L + +++L + + A+ + + + + Sbjct: 160 DDGADEKDMLVNWIEFLKDPES--ETVRSLEMNIEEIRQAKDELIRMSNDDTQREIYEMR 217 Query: 243 LMTIADRL--REEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDL 291 T+ D++ E +G + IA+ +LD LD E + + T LS D++ Sbjct: 218 AKTLRDKISALNEAERKGIQQGKREIAKALLDV-LDIETIALKTGLSIDEI 267 >UniRef50_C0EXQ3 Putative uncharacterized protein n=1 Tax=Eubacterium hallii DSM 3353 RepID=C0EXQ3_9FIRM Length = 290 Score = 51.9 bits (123), Expect = 2e-05, Method: Composition-based stats. Identities = 43/298 (14%), Positives = 100/298 (33%), Gaps = 38/298 (12%) Query: 11 DAVFKSFLR---HPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWS 67 D +F+ + + + D+ L +I + +D+ + Sbjct: 15 DRLFRFVFGAEENKAYLLSLCNAVSGTDYTDVDDIEITTLSDAIYI-----KMKNDISFL 69 Query: 68 VKTQEGVGYIYVVIEHQSKPEELMAFRMMR-----YSIAAMQNHLDAGYKELPLVLP--- 119 + +Q + EHQS M R M Y I ++N+LD L +L Sbjct: 70 IDSQMN------LFEHQSTFNPNMPLRGMECFAELYGIYIIENNLDIYVSSLQKILTPRY 123 Query: 120 MLFYHG-CRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQH-RKMALLE 177 + Y+G + P L D F P + + + +++I + ++++ + + Sbjct: 124 YVIYNGTEKQPDVVKLKLSDAFQVPDDSGE-FEWTATMLNINYGHNRKLLEQCQPLYEYA 182 Query: 178 LIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAP 237 K +R+ + + + V + + F + Sbjct: 183 HFIKLVREYSEAMELKKAIDKAVEKAREWKCIGTFLYQCKSEVSVMLLTEFDEK------ 236 Query: 238 QEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQS 295 + ++ L+ + G +G+ +E ++ ML L E++ +S D ++ Sbjct: 237 KHEDNLIKL-------GEKEGREKERMKNICSMLALSLSPEIIAKACEVSVDYVLNLK 287 >UniRef50_B0C251 Putative uncharacterized protein n=1 Tax=Acaryochloris marina MBIC11017 RepID=B0C251_ACAM1 Length = 313 Score = 51.9 bits (123), Expect = 2e-05, Method: Composition-based stats. Identities = 40/277 (14%), Positives = 105/277 (37%), Gaps = 28/277 (10%) Query: 11 DAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLK-LEPNSFIDEDLRQYYS------- 62 D +F+ L++ +F+D+ P + D +++ LE + + +S Sbjct: 5 DRLFRDLLKN--FFLEFVDLFFPK-IAVAIDPKSIRFLEDEESLKPQEQGEHSPASTKQE 61 Query: 63 ---DLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLP 119 ++L V+ + +V +E+ S+ + R+ + + LP + P Sbjct: 62 ASSNVLVQVRLRGQESCFWVHLENSSETNIKLERRIFHTFARLDEKY------NLP-IYP 114 Query: 120 MLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELI 179 ++ +S + + + R++ +F + + + + +Q R L+ Sbjct: 115 IIL----QSSDKSQRLETNGYRVEFVDRRVLDFSFVAIQLHRLNWRDFLQRRNPVAAALM 170 Query: 180 QK-HIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYV--LQTGDAQRFRAFIGEIAERA 236 +++ D + + + LL + +++K + ++ +A + E+ Sbjct: 171 PTMNVQTFDRPVVKAECLRLLTNLRLDAKKVKVISQFIEAFLHLNAAEEQVLQTEMERMG 230 Query: 237 PQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDR 273 E+E++ + + QG EAL + +L R Sbjct: 231 LLERERITNLLTSTTQANQQQGAEREALSLVFRLLKR 267 >UniRef50_Q6ZEK6 Slr5124 protein n=11 Tax=Chroococcales RepID=Q6ZEK6_SYNY3 Length = 276 Score = 51.9 bits (123), Expect = 3e-05, Method: Composition-based stats. Identities = 40/265 (15%), Positives = 89/265 (33%), Gaps = 23/265 (8%) Query: 27 FIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVKTQEGVGYIYVVIEHQSK 86 F+ +KL S + L +D L ++++ + + +E Q++ Sbjct: 8 FLAESFSEDYAAWLLGRPIKLTKLSPTELSLEPIRADSLILEQSED----LVLHLEFQTE 63 Query: 87 PEELMAFRMMRYSIAAMQNHLDAGYKELPLVL----PMLFYH-----GCRSPYPYSLCWL 137 P+ M FRM+ Y + + + + L L Y G ++ Sbjct: 64 PDPTMGFRMLDYRVRVYRRFPQKTMHQFVIYLKRSSNDLVYQDSFQVGETLHRYQAIRLW 123 Query: 138 DEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQRDLLGLVDQIVS 197 ++ P+ A PL +T D + LE I+ + + +L+ Sbjct: 124 EQ---PSEAFLQSPGLLPLAVLTQTSDPTLKLREVATALEQIEDNRVKANLMAATSVFGG 180 Query: 198 LLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKLMTIADRLREEGAMQ 257 +L+ L++ ++ ++ + E + + E + + EG ++ Sbjct: 181 ILLAPELIKTILRSEIM-----KESAVYQEILEE--GKIAGKLEGRLEGKLEGKLEGKLE 233 Query: 258 GKHEEALRIAQEMLDRGLDRELVMM 282 G+ E L + GL + Sbjct: 234 GRLEAKLETIPLLKKLGLTITEIAK 258 >UniRef50_UPI00006A2D99 UPI00006A2D99 related cluster n=2 Tax=Xenopus (Silurana) tropicalis RepID=UPI00006A2D99 Length = 308 Score = 51.9 bits (123), Expect = 3e-05, Method: Composition-based stats. Identities = 48/281 (17%), Positives = 97/281 (34%), Gaps = 34/281 (12%) Query: 7 STPHDAVFKS-FLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYY--SD 63 T HD FK+ L +P A F P + + D + + L + D Sbjct: 1 PTSHDQNFKNLILDYPRQALQF---FAPDEAKNIDDSAVITPIRQEQLKNRLGDRFYELD 57 Query: 64 LLWSVKTQEGV-GYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLF 122 + V+ +G + ++E ++ P R++ Y + L + V+P++ Sbjct: 58 VPLKVEWPDGRHAAMLFLLEEETDPARFSIHRLVSYCANLAE--LMGTNR----VVPIVI 111 Query: 123 YHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMA--LLELIQ 180 + D + S + + +P ++ + + Sbjct: 112 F-----LRSSPDIRRDLHLGVDGVNFL-SFHYIACVLPDIPAEQYKDSTNIVARIALPTM 165 Query: 181 KHIRQR--DLLGL-VDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAP 237 + R++ D++ + + +L G+ + L + Y Q D +R +R P Sbjct: 166 HYAREQVIDVMAWALRGLDTLEANGDKRIKYLDFIDTY-SQLEDNER-----QLFKQRYP 219 Query: 238 QEKEKLMTIADR----LREEGAMQGKHEEALRIAQEMLDRG 274 QE++ + +I R +G QG E L QE G Sbjct: 220 QEEKTVTSIVQRAIHQGIHQGIHQGIQEGMLMGRQEGRQEG 260 >UniRef50_C6Y2B5 Transposase and inactivated derivative n=1 Tax=Pedobacter heparinus DSM 2366 RepID=C6Y2B5_PEDHD Length = 310 Score = 51.9 bits (123), Expect = 3e-05, Method: Composition-based stats. Identities = 49/299 (16%), Positives = 107/299 (35%), Gaps = 37/299 (12%) Query: 11 DAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQ--YYSDLLWSV 68 D FK +D L L+ ++ +L+ N + E + D++ Sbjct: 34 DLGFKRLFSAEQN-KDITITFLNHVLKGKREVVSLEFLKNEYPGETQEEGGVIIDIV--- 89 Query: 69 KTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYHGCRS 128 ++ +G + ++E Q + R + Y+ + ++ P HG R Sbjct: 90 -CKDQIG-AFFLVEMQKSWNQNFKERSLFYASRLI-------TEQAP--------HGNRK 132 Query: 129 PYPYSLCW------LDEFAEPAIARKIYSSAFPLV--DITVVPDDEIM-QHRKMALLELI 179 + YSL L++F A + + LV D V ++ + + ++ + Sbjct: 133 EWAYSLKDVYVIALLEKFTINAGNKGKWLHDIALVNTDTGKVFNERLRFTYIELLSFKKT 192 Query: 180 QKHIRQRDLLGLVDQIVSL--LVTGNT--NDRQLKALFNYVLQTGDAQRFRAFIGEIAER 235 + + + DL + + +L L + QL + + I + Sbjct: 193 ENQL-ETDLEKWIYALKNLKHLKQAPAAFTEPQLLQFCQAARYINLTKEEKNMISAKTKA 251 Query: 236 APQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQ 294 + REEG +G H++A +IA ++ ++G+ + +T LS ++ Sbjct: 252 RWDYYYAIDGAKIMGREEGETRGAHQKAAQIAIKLKNKGVPFTEIQELTELSITEIKNL 310 >UniRef50_C5RQ96 Putative uncharacterized protein n=1 Tax=Clostridium cellulovorans 743B RepID=C5RQ96_CLOCL Length = 288 Score = 51.5 bits (122), Expect = 3e-05, Method: Composition-based stats. Identities = 50/297 (16%), Positives = 100/297 (33%), Gaps = 42/297 (14%) Query: 11 DAVFKSFLRHPDTARDFIDIHLPAPL-RKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVK 69 D VFK +D + L A L + +++ E L VK Sbjct: 17 DFVFKLLFGDEKN-KDLLIAFLSAVLNLPEREFVGIEILNTELFREFKEDKKGILDVRVK 75 Query: 70 TQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYHGCRSP 129 T G + IE Q P E M R + Y + G Y + Sbjct: 76 TVNGKQ---IDIEIQVLPTEFMPERTLFYWSKMYTTQVKPGDT----------YDKLKKC 122 Query: 130 YPYSLCWLDEFAEPAIARKIYSSAFPLV-DITVVPDDEIMQHRKMALLELIQKHIRQRDL 188 ++ + +++ L+ D T +I++ + + +L K I + Sbjct: 123 ITINIVDFKCIPLNKLH-----TSYHLIEDETGHKLTDILEVHFLEIPKLFDKQIEINED 177 Query: 189 LGLVDQIVSLLVTGNTNDRQLKALF----------NYVLQTGDAQRFRAFIGEIAERAPQ 238 D I+ + + + + + +L+ I E E + Sbjct: 178 ----DPIIQWMEFLDGKSKGVMEMLAEKNESIKKAYNLLKIISKDEKARMIYEAREAELR 233 Query: 239 EKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQS 295 ++ + A+ +G +E+ALR+A++M+ RG ++ +T LS + ++ Sbjct: 234 DQLTRIRSAE-------EKGANEKALRVAEKMIKRGDSINDIIELTELSKEKILELK 283 >UniRef50_A7N2B6 Putative uncharacterized protein n=1 Tax=Vibrio harveyi ATCC BAA-1116 RepID=A7N2B6_VIBHB Length = 86 Score = 51.5 bits (122), Expect = 3e-05, Method: Composition-based stats. Identities = 18/74 (24%), Positives = 33/74 (44%), Gaps = 12/74 (16%) Query: 208 QLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKLMTIADRLREEGAMQGK-------- 259 +L Y+L+ G+ + +A + P+ +E+ MT+A++L G QG Sbjct: 7 AYDSLVEYLLRVGETSNLEDLMRTLARQVPEHEERFMTVAEQLEARGREQGLQQGRQEGE 66 Query: 260 ----HEEALRIAQE 269 E L IA++ Sbjct: 67 QQGWQEAQLSIAKK 80 >UniRef50_A8F2U7 Putative uncharacterized protein n=15 Tax=Bacteria RepID=A8F2U7_RICM5 Length = 281 Score = 51.5 bits (122), Expect = 4e-05, Method: Composition-based stats. Identities = 47/292 (16%), Positives = 101/292 (34%), Gaps = 33/292 (11%) Query: 11 DAVFKSFLRHP-DTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVK 69 D FK + + ++ DL+ + E + L S L+ +K Sbjct: 10 DIAFKKLFSDKVKLINLLNSLLRLSKGDRIIDLSYITTEQ---LPLFLEGRRS--LFDLK 64 Query: 70 TQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAG--YKELPLVLPMLFYHGCR 127 ++ G Y+ IE Q K E+ R Y + + G +K+L V+ + Sbjct: 65 VKDETGRWYI-IEMQRKMEKDYLNRTQLYGCYTYVSQIKKGMKHKDLLPVVIISIIRAKA 123 Query: 128 SPYPYSLCWLDEFAEPAIAR-KIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQR 186 P E I + ++S + +++ +++ + K+ + Sbjct: 124 LPDELPYISYHHIKESNIHKQYLFSLTYVFIELGKFKKNDLKDDT--DEWLYLLKYA-SQ 180 Query: 187 DLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKLMTI 246 + + ++++ + Q K Q + AE A Q++ Sbjct: 181 EQEPPKEIKNEIVLSAYASLEQYKW---------TEQEHDDYFR--AEMAIQQE------ 223 Query: 247 ADRLREE---GAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQS 295 D+ E+ G +G +E + A+EML E + T+L+ +++ Sbjct: 224 IDKFEEKFNAGMEKGIEKEKIETAKEMLIENGPIEQIARYTKLTIEEIKKLK 275 >UniRef50_A7C854 Putative uncharacterized protein n=3 Tax=Beggiatoa sp. PS RepID=A7C854_9GAMM Length = 69 Score = 51.1 bits (121), Expect = 4e-05, Method: Composition-based stats. Identities = 17/57 (29%), Positives = 34/57 (59%) Query: 239 EKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQS 295 +++K ++EG +G+ ++AL IA+ +L +G++ +V T LSP++L S Sbjct: 11 QQDKFEKGKTVGKKEGLKEGQQQKALEIARSLLAKGIEPPIVAETTGLSPNELATLS 67 >UniRef50_B1V1L4 Putative uncharacterized protein n=38 Tax=Clostridium RepID=B1V1L4_CLOPE Length = 300 Score = 51.1 bits (121), Expect = 4e-05, Method: Composition-based stats. Identities = 47/298 (15%), Positives = 106/298 (35%), Gaps = 22/298 (7%) Query: 11 DAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVKT 70 D VFK + ++D + L A ++ + ++L+ + + + L KT Sbjct: 8 DFVFKRLFGAEE-SKDSLISLLNAIIKSDNPIKDIELKSPDLEKQHIGDKFCRLDIKAKT 66 Query: 71 QEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHL--DAGYKEL--PLVLPMLFYHGC 126 +G + +E Q + E M R + Y + L YK L + + +L + Sbjct: 67 DKGEI---INVEIQVRDEYNMVQRTLYYWSKIYSDQLGASENYKNLARTVCINILNFKLL 123 Query: 127 RSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQR 186 + ++ L E I +++ + + + + L + I++ Sbjct: 124 DNDRYHNTYRLKEITTNEELTDI--EEIHFIELPKSKEIKSEEVNNIDSLLKWIEFIKEP 181 Query: 187 D-----LLGLVDQIVSLLVTG-NTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEK 240 + +L L D+ + T K + Y + + + R + Sbjct: 182 ESETVRILELTDESIRKAKTQLYKLSLDKKTIEQY--RIREKAMYDEISALENSREKGLQ 239 Query: 241 EKLMTIADRLREEGAMQGKHEEALR----IAQEMLDRGLDRELVMMVTRLSPDDLIAQ 294 E + +EEG +G+ L+ IA+ +L +GL+ + + + L + + Sbjct: 240 EGVKIGRKEGKEEGLKEGEVRGKLKANRKIAKNLLSKGLELKEIAKILELDENLVEEI 297 >UniRef50_A8PPL6 Putative uncharacterized protein n=1 Tax=Rickettsiella grylli RepID=A8PPL6_9COXI Length = 53 Score = 51.1 bits (121), Expect = 5e-05, Method: Composition-based stats. Identities = 20/51 (39%), Positives = 31/51 (60%) Query: 244 MTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQ 294 M+IA ++ ++G +G +EE L+IA+ ML RG D + V LS DL+ Sbjct: 1 MSIAHKIEQQGLRKGWYEEDLKIARRMLARGADCGYMKDVIGLSDQDLLNL 51 >UniRef50_C0QWG9 Putative uncharacterized protein n=8 Tax=Brachyspira RepID=C0QWG9_BRAHW Length = 301 Score = 50.8 bits (120), Expect = 6e-05, Method: Composition-based stats. Identities = 46/306 (15%), Positives = 110/306 (35%), Gaps = 33/306 (10%) Query: 2 TISTTSTPHDAVFKSFLRHPDT---ARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLR 58 TI+ + +D + H A +FI+ +++ I E+ Sbjct: 16 TINNLNRINDYFIRYLFSHEGNENIALNFINAVFKD--LGFETFKKIEILNPFNIAENYD 73 Query: 59 QYYSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAG--YKELPL 116 + S + T+ G I V+IE Q++ E R + Y + L+ G Y EL Sbjct: 74 EKESIVDIKAITESG---ITVLIEIQARGNEDFIKRALYYWAYNYSSSLNRGSFYDELKP 130 Query: 117 -----VLPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHR 171 + + + + Y L L+ +++ P ++ + E + + Sbjct: 131 TVSINITNFILTNEDKVHSCYVLKELNNNKILTDHCQLHFLELPKFNLKNISAIESLDNI 190 Query: 172 KMALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFR---AF 228 + ++ + + + ++ + ++ +++ + N ++ F Sbjct: 191 HKEFISWVK--FFKGEDMSILMKENTIF---EEVEKKCRTFVNNTPVMDKYKKREVDAYF 245 Query: 229 IGEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSP 288 + E + +EEG QG+ +A+ IA+ + G+D +++ T LS Sbjct: 246 FDKSIELD----------LKKAKEEGIEQGEKNKAISIAKSFKNAGIDIKIISENTGLSI 295 Query: 289 DDLIAQ 294 +++ Sbjct: 296 EEVEKL 301 >UniRef50_C0QWI7 Putative uncharacterized protein n=4 Tax=Brachyspira RepID=C0QWI7_BRAHW Length = 289 Score = 50.8 bits (120), Expect = 6e-05, Method: Composition-based stats. Identities = 37/262 (14%), Positives = 96/262 (36%), Gaps = 25/262 (9%) Query: 42 LTTLKLEPNSFIDEDLRQYYSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIA 101 +K+ + E + S + T+ G V++E Q + +R + Y Sbjct: 44 FEEVKVLNTFNLKETINDKQSIVDVRAVTKSGET---VLVEIQRIGNQSFVYRSLYYWAK 100 Query: 102 AMQNHLDAGYK----ELPLVLPMLFYHGCRSPYPYSLCWLDEFAEPAIA-RKIYSSAF-- 154 ++L K + +V+ +L ++ + + + I ++ F Sbjct: 101 CYVSNLRNNEKYNDLKQVIVINILDFNLLKD----IDKEHSCYVIKELETNHILTNHFEM 156 Query: 155 PLVDITVVPDDEIMQHRKMALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFN 214 +++ ++ ++ + +++I+ +LV N +++ +N Sbjct: 157 HFLELQKYLSSNSNLKEELDAWFYFLTI---KEKIEKMEEIMDILVKKNPIMKEVYDEYN 213 Query: 215 YVLQTGDAQRFRAFIGEIAERAPQEKEKLMTIADRLR--EEGAMQGKHEEALRIAQEMLD 272 + AE + L +R+R EEG +G E + +A+ M + Sbjct: 214 ------KFADTKDLFENYAEYEKNYFDILALSEERIRGREEGIKEGIKETQISMARNMKN 267 Query: 273 RGLDRELVMMVTRLSPDDLIAQ 294 + +D +L+ +T L+ +++ Sbjct: 268 KNMDIKLIGELTGLTTEEIEKL 289 >UniRef50_C1J8G9 YdgA n=11 Tax=Enterobacteriaceae RepID=C1J8G9_ECOLX Length = 81 Score = 50.8 bits (120), Expect = 6e-05, Method: Composition-based stats. Identities = 19/55 (34%), Positives = 30/55 (54%), Gaps = 4/55 (7%) Query: 244 MTIADRLREEGAMQGKHEEALRIAQE----MLDRGLDRELVMMVTRLSPDDLIAQ 294 MTIA+RL ++G +G + AL +A+E + D G E + T LS ++L Sbjct: 22 MTIAERLIQKGFDEGFKKGALEVAREAACRLRDMGWTPERIQEATGLSGEELKKL 76 >UniRef50_A8PKB8 Putative uncharacterized protein n=1 Tax=Rickettsiella grylli RepID=A8PKB8_9COXI Length = 172 Score = 50.4 bits (119), Expect = 7e-05, Method: Composition-based stats. Identities = 27/134 (20%), Positives = 54/134 (40%), Gaps = 5/134 (3%) Query: 155 PLVDITVVPDDEIMQHRKMALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFN 214 L+ + D + K+ L +L+ + RD + + +++ + + Sbjct: 35 HLIFLETKKDPKARFIMKLRLTQLLYERGYGRDYVINLLKVIDWALVIPKDLE-----LE 89 Query: 215 YVLQTGDAQRFRAFIGEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRG 274 Y + + + + + +E L + R+EG QG+ EE +A+ +L G Sbjct: 90 YKEKLHELEEEKNMSYITSFERLSREEGLQQGLQKGRQEGLEQGREEERYEMAKNLLAEG 149 Query: 275 LDRELVMMVTRLSP 288 L ELV VT+L Sbjct: 150 LPLELVKKVTKLPD 163 >UniRef50_C4Z592 Putative uncharacterized protein n=2 Tax=Clostridiales RepID=C4Z592_EUBE2 Length = 315 Score = 50.4 bits (119), Expect = 7e-05, Method: Composition-based stats. Identities = 35/214 (16%), Positives = 69/214 (32%), Gaps = 43/214 (20%) Query: 11 DAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDL--RQYYSDLLWSV 68 DA+F+ + L DE+L D+++ V Sbjct: 11 DALFRKVFEEKKDLLSLYNA----------------LNNTEHTDENLITVNTIEDVIY-V 53 Query: 69 KTQEGVGYI----YVVIEHQSKPEELMAFRMMRYSIAAMQNHLD--------AGYKELPL 116 + + ++ + EHQS + M R + Y + +++ +LP Sbjct: 54 GYKNDIAFVIDSELNLYEHQSSVNKNMPIRGLIYFAELYKGYIERNSLRIYNETEVKLPF 113 Query: 117 VLPMLFYHGCRSPYPYSLCWL-DEFAEPAI---ARKIYSSAFPLVDITVVPDDEIMQHRK 172 ++FY+G + S+ L D F + L++I + EIM Sbjct: 114 PRYVVFYNGEKDETEKSVQRLADLFVRNEANQNQKPCLDVEVLLLNINYGCNKEIMNK-- 171 Query: 173 MALLELIQKHIRQRDLLGLVDQIVSLLVTGNTND 206 QK + L+ ++ + L + D Sbjct: 172 ------CQKLMEYSRLIAMIRGKTADLAKIYSQD 199 >UniRef50_B0G418 Putative uncharacterized protein n=5 Tax=Dorea formicigenerans ATCC 27755 RepID=B0G418_9FIRM Length = 312 Score = 50.4 bits (119), Expect = 7e-05, Method: Composition-based stats. Identities = 47/258 (18%), Positives = 95/258 (36%), Gaps = 35/258 (13%) Query: 11 DAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVKT 70 D VF+ L+ P A + + +L LE ++ +D+ + + T Sbjct: 42 DRVFRMLLKEPKVALEVYNAMNGTLYDNPDELIITTLENAVYLGMK-----NDVSFILGT 96 Query: 71 QEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHL--DAGYKELPLVLP----MLFYH 124 Q V+ EHQS P M R + Y ++ D Y + +P ++FY+ Sbjct: 97 QL------VLYEHQSTPNPNMPLRNLAYVACVYMAYVFGDNLYGRKLIKIPEPRFVVFYN 150 Query: 125 GCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHR-----KMALLELI 179 G S+ L + E V+I ++E+++ + ++++ Sbjct: 151 GTDKMPEQSVLRLSDAYESKSEELDLELKIRFVNINPGYNEEMVEKSPTLYQYVKFVDIV 210 Query: 180 QKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQE 239 +K+ ++ V++ + + K + L+ A+ R I E E Sbjct: 211 RKYQKEMPFPEAVEKAIDECIK--------KGILAEFLRKNRAEVLRVSIFEYDE----- 257 Query: 240 KEKLMTIADRLREEGAMQ 257 +E + + R+EG Q Sbjct: 258 EEHMRQEREESRQEGIEQ 275 >UniRef50_UPI0001B4A8CA hypothetical protein Bfra3_22303 n=1 Tax=Bacteroides fragilis 3_1_12 RepID=UPI0001B4A8CA Length = 282 Score = 50.4 bits (119), Expect = 8e-05, Method: Composition-based stats. Identities = 48/288 (16%), Positives = 98/288 (34%), Gaps = 18/288 (6%) Query: 11 DAVFKSFLR-HPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVK 69 D FK HPD F++ LP L + T ++ P+ + E+ S + + Sbjct: 9 DLTFKRVFGEHPDLVMSFLNALLPLRLEESI--TDIEYLPSGMVPENSLPKNSIVYVRCR 66 Query: 70 TQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKE--LPLVLPMLFYHGCR 127 +G +I +E Q +M + A +D+G + L V + + Sbjct: 67 DSKGRSFI---VEMQMIWSPEFKQCVMFNASKAYVRQMDSGEQYDLLQPVYSLNLVNDIF 123 Query: 128 SP-YPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQR 186 P + R I V++ + + L I ++ Sbjct: 124 EPDIKEYYHYYRLVHVEHTERVINGLHLVFVELPKFTPHTYSEKKMHILWLRYLTEIDEK 183 Query: 187 DLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKLMTI 246 ++ L+ + + L + F I+ EK + + Sbjct: 184 -----THEVPEELLENPEIKKAVTVLEESAFTPEQLLGYEKFWDIISV----EKTLISSA 234 Query: 247 ADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQ 294 + +EEG +G+ +E L +A +GL +++ +T LS +++ Sbjct: 235 ERKEKEEGRKEGELQEKLLVASNAKKQGLSLDIISSLTGLSAEEIERL 282 >UniRef50_C4G2Y1 Putative uncharacterized protein n=7 Tax=Abiotrophia defectiva ATCC 49176 RepID=C4G2Y1_ABIDE Length = 272 Score = 50.4 bits (119), Expect = 9e-05, Method: Composition-based stats. Identities = 50/291 (17%), Positives = 108/291 (37%), Gaps = 43/291 (14%) Query: 11 DAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVKT 70 D +F+ H + + + + L D + E + + Q S ++ K Sbjct: 18 DVMFRKMAEHKEFCEEILRVILD-------DDNLIVTESGAQWEGTNLQGRS-VVLDAKC 69 Query: 71 QEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYHGCRSPY 130 G G ++ IE Q ++ + +RY+ + + ++ + V P Sbjct: 70 ISGDGR-HINIEVQKGNDDD-HLKRVRYNGSILTTNITNPGTKFEAV-P----------- 115 Query: 131 PYSLCWLDEFAEPAIARKIYSSAFPL--VDITVVPDDEIMQHRKMALLELIQKHIRQRDL 188 + ++ +F I++ FP+ VD V EI+ + K R Sbjct: 116 DVCIIFISKF-------DIFNCGFPVYHVDKVVRETGEIIDDGLTEIYVTTVKKDDSR-- 166 Query: 189 LGLVDQIVSLLVTGNTNDRQLKALFNYV-----LQTGDAQRFRAFIGEIAERAPQEKEKL 243 V +++ L + + + + + + + + G + + +I +E EK Sbjct: 167 ---VSKLMELFIKDDAYNTEDFPVTSEIKARFKISEGGRKEMNEMLEKIIREEKEESEK- 222 Query: 244 MTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQ 294 R + G +G +EE +RIA++M + L L+ +T L ++A Sbjct: 223 -RGEKRGEKRGKKEGANEEKIRIAKDMKKKSLPFSLIAEITGLPEKKILAL 272 >UniRef50_Q5GSR2 Uncharacterized conserved protein n=15 Tax=Wolbachia RepID=Q5GSR2_WOLTR Length = 317 Score = 50.0 bits (118), Expect = 9e-05, Method: Composition-based stats. Identities = 43/309 (13%), Positives = 107/309 (34%), Gaps = 32/309 (10%) Query: 11 DAVFKSFLR---HPDTARDFIDIHLP-APLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLW 66 D +FK + F++ L A + + ++ L ++ ID ++ ++ Sbjct: 12 DLIFKKIFGTEKNKKIIICFLNNILGFAEINAIQEVEFL----SAIIDPEIASNKQSIIV 67 Query: 67 SVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYK---ELPLVLPMLFY 123 V ++ G V IE Q + R+ Y++ A LD +L V + Sbjct: 68 DVFCKDATGTRRV-IEVQLAINKGFEKRVQPYAVKAYSRQLDKSGNYIVDLKKVFFIAIS 126 Query: 124 H----GCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMAL-LEL 178 + + Y + D + F +++ ++ Q + Sbjct: 127 NCNLLSEKVDYISTHNIHDTKTN---GHYLKDFQFIFIELPKFSKSKVEQLINIVEHWCF 183 Query: 179 IQKHI---RQRDLLGLVDQIVSLLVTGNTNDRQ---LKALFNYVLQTGDAQRFRAFIG-- 230 K+ + DL + +++ + + + D + + Y + + Q+ +A + Sbjct: 184 FFKNAEDTTETDLKRVAKKVLIIKLAYDGLDEFHWNEEDIIAYEERVMNLQKEKAILEYR 243 Query: 231 -EIAERAPQEKEKLMT---IADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRL 286 ++A +E+ ++ E+G +G + + +A+ L G+ + + L Sbjct: 244 LDLATEKGREEGVKISKERGIKVGAEKGREEGVKKAKIAVAKNSLKAGMSIGAIAEIIGL 303 Query: 287 SPDDLIAQS 295 S + Sbjct: 304 SVGKIKKLH 312 >UniRef50_Q2FSG0 Putative uncharacterized protein n=1 Tax=Methanospirillum hungatei JF-1 RepID=Q2FSG0_METHJ Length = 291 Score = 50.0 bits (118), Expect = 1e-04, Method: Composition-based stats. Identities = 24/145 (16%), Positives = 61/145 (42%), Gaps = 10/145 (6%) Query: 144 AIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQRDLLGLVDQIVSLLVTGN 203 A+ + + L+ + ++ ++ + + ++K + ++DL V ++ +L Sbjct: 142 ALEKGEPVNELELIFLPLM-KSKLTKIELLRRTIDLEKELPEKDLRNKVRELTLILADKI 200 Query: 204 TNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKLMTIADRLREEGAMQGKHEEA 263 + + L L+ + R F ++ L ++ ++G +GK +E Sbjct: 201 VDQKILDELW---------EELRMFKVVKYAEEKGMEKGLEKGLEKGIKKGMEKGKKQER 251 Query: 264 LRIAQEMLDRGLDRELVMMVTRLSP 288 +A+ ML G++ EL++ T L Sbjct: 252 ETVAKNMLSLGIEDELIIKATGLDQ 276 >UniRef50_UPI00006CAA90 hypothetical protein TTHERM_00670420 n=1 Tax=Tetrahymena thermophila RepID=UPI00006CAA90 Length = 345 Score = 50.0 bits (118), Expect = 1e-04, Method: Composition-based stats. Identities = 40/311 (12%), Positives = 106/311 (34%), Gaps = 55/311 (17%) Query: 11 DAVFKSFLRHPDTARDFIDIHL-------PAPLRKLCDLTTLKLEPNSFIDEDLRQYYSD 63 D VF+ + + + F++ L + ++ L L+ + + ++ D Sbjct: 64 DFVFEKIFSNHERMKSFLESVLVGKNKILHEEINEVIYLNNNLLQNSLTQEYIPKKSMFD 123 Query: 64 LLWSVKTQEGVGYIYVVIE-HQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPM-- 120 L +KT +G ++E ++ + + R+ YS ++ + + L ++ + Sbjct: 124 L--QIKTSQGT----FIVEIYKRSFQPFLK-RIQYYSAQSLSQQQNQTHTSLKPIISIAI 176 Query: 121 ----LFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALL 176 LF + + + + S + +++ + + Q Sbjct: 177 VDDILF-----EDDVPCISFHKTIEQKTQKVFLNYSTYVFIELGKYDNKKYDQSCVHG-- 229 Query: 177 ELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERA 236 + +++ L L+ + S + L A Q + + F + + Sbjct: 230 ------VNEKEWLDLLKK--SDIHRQYKTKEVLNA-----AQYAQFIQEKLFDEYVKHKL 276 Query: 237 PQEKEKLMTIADRLREEGAMQGKHE------------EALRIAQEMLDRGLDRELVMMVT 284 +++ + + EG QG+ E + ++ML GL + ++ T Sbjct: 277 --YEDQFIEEIKNAKVEGIQQGQEETIKLSKHYSIKAGKEEVVKQMLKDGLSLQKIITYT 334 Query: 285 RLSPDDLIAQS 295 LS +++ Sbjct: 335 GLSKEEIDEIK 345 >UniRef50_B4VKU9 Putative uncharacterized protein n=1 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VKU9_9CYAN Length = 323 Score = 50.0 bits (118), Expect = 1e-04, Method: Composition-based stats. Identities = 46/291 (15%), Positives = 91/291 (31%), Gaps = 24/291 (8%) Query: 1 MTISTTSTPH-DAVFKSFLRH---PDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDED 56 MT +P D FK D F++ + + LT + + E Sbjct: 1 MTKKKFISPKIDYAFKKIFGSDQSEDILISFLNAIVYNGKSVISSLTIVNPYNPGQV-ET 59 Query: 57 LRQYYSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAG--YKEL 114 L+ Y D+ G V+IE Q R+ A N L +G Y E+ Sbjct: 60 LKDSYLDI--RAVLNSGEI---VLIEMQVARIAAFYKRVTYNLCKAYANQLTSGDYYLEI 114 Query: 115 PLVLPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDIT------VVPDDEIM 168 V+ + F + + + L+ + +P+ + + Sbjct: 115 TPVIAVTITDFILFKENPKCIHHFVFKDKESSSEYPEHELQLIFVELPRFVKKLPELQTL 174 Query: 169 QHRKMALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAF 228 + + + Q DL + + + + +A ++R Sbjct: 175 AEKWIYFMTQAQ------DLEEIPESLAEVTAIEKALTIANQANLTPAEAEEVSRRAMQL 228 Query: 229 IGEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDREL 279 EI +E + R+EG +G+ E + A+ ++ R L++ Sbjct: 229 RDEIGRIKYATEEASKEAREEGRQEGRQEGRQEGRITEARALVLRLLNKRF 279 >UniRef50_C4ZLA7 Conserved hypothetical cytosolic protein n=2 Tax=Proteobacteria RepID=C4ZLA7_THASP Length = 339 Score = 50.0 bits (118), Expect = 1e-04, Method: Composition-based stats. Identities = 49/307 (15%), Positives = 104/307 (33%), Gaps = 47/307 (15%) Query: 1 MTISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQ- 59 M S +D+ +K + H +FID + P + D F+D++L+Q Sbjct: 1 MPASAAQDDYDSPWKEAVEH--AFPEFIDFYFPDA-GRQIDWARGHR----FLDKELQQI 53 Query: 60 --------YYSDLLWSVKTQEGV-GYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAG 110 + D L SV T G ++ V IE Q + A RM Y+ ++ Sbjct: 54 VRDAALGRRHVDKLASVTTHAGEEDWLCVHIEVQGSMDPDFARRMFVYNYRIYDSYDR-- 111 Query: 111 YKELPLVLPM-LFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDI-TVVPDDEIM 168 V + + + P D F + + + FP+ + D+ + Sbjct: 112 -----PVASLAVLADDDPAWRP------DRFGYERLGCRH-NLQFPVAKLVDHAADEAAL 159 Query: 169 QHRKMALLELIQKHIRQRD-------LLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGD 221 + H+ R ++V LL + +++ F+ + Sbjct: 160 LCNPNPFALVTAAHLYTRRTRRSPIARFDAKRRLVRLLYERDWTRQRILDFFSVLDWMMR 219 Query: 222 -AQRFRAFIGEIAERAPQEKEKL------MTIADRLREEGAMQGKHEEALRIAQEMLDRG 274 + F + + E E++ +R ++G QG + ++ +++G Sbjct: 220 LPREFEQRLWQDIENIEGERKVKYVTSVERLAIERGLQKGMEQGLEIGIEKGIEQGIEKG 279 Query: 275 LDRELVM 281 +++ Sbjct: 280 IEKGRAQ 286 >UniRef50_C8PR55 Transposase (Fragment) n=5 Tax=Treponema RepID=C8PR55_9SPIO Length = 53 Score = 50.0 bits (118), Expect = 1e-04, Method: Composition-based stats. Identities = 11/47 (23%), Positives = 19/47 (40%) Query: 249 RLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQS 295 G +G + L A+ ML G + + +T LS ++ A Sbjct: 7 EGEARGRSEGSRQAKLETAKTMLSMGYPQADICKITGLSKAEIEAIK 53 >UniRef50_A7C5R5 Putative uncharacterized protein n=2 Tax=Beggiatoa sp. PS RepID=A7C5R5_9GAMM Length = 263 Score = 49.6 bits (117), Expect = 1e-04, Method: Composition-based stats. Identities = 52/289 (17%), Positives = 108/289 (37%), Gaps = 52/289 (17%) Query: 28 IDIHLPAPLRKLCDLTTLKLEPNSFIDE-DLRQYYSDLLWSVKTQEGVGYIYVVIEHQSK 86 ++ L A LR+ + + LE S + + + + D+L K + YV+IE Q+ Sbjct: 1 MEGFLSAVLRQDVSIIEI-LESESNVSDIEQKLNRVDILIQDKLKR-----YVIIEIQNC 54 Query: 87 PEELMAFRMMRYSIAAMQNHLDAG--YKELPLVLPM-------------LFYHGCR---- 127 R++ + +++ AG Y+E+ V+ + +FY Sbjct: 55 HITAYLERILFGVSKIIVDNVKAGEDYREISKVVSISILYFNLGLGEDYVFYGNTEFRGL 114 Query: 128 ---SPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIR 184 +P + D+ E +++I+ + L+++ D + K Sbjct: 115 HDNAPLIFRRRREDKTLEKLKSQEIF-PEYYLINVERFSD--------------VMK--- 156 Query: 185 QRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKLM 244 DL + + + + L + + R E++ + Sbjct: 157 -TDLDEWIYLFKHAALPPHCQAKNLDKAGEKLDVLKMSAEERHRYDRYLVAMVNEQDAID 215 Query: 245 TIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIA 293 T +G +G+ +A +IA++ML +G+D E V +T LSPD + Sbjct: 216 TA----HNKGWQEGEDAKARKIAKKMLAKGMDIETVAAMTDLSPDIIER 260 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P77768 Uncharacterized protein yfcI n=175 Tax=Gammaprot... 284 3e-75 UniRef50_P37415 Uncharacterized protein pSLT051 n=256 Tax=Gammap... 279 8e-74 UniRef50_Q1CC76 Transposase n=27 Tax=Gammaproteobacteria RepID=Q... 271 2e-71 UniRef50_Q4LC22 TpnA protein n=9 Tax=Enterobacteriaceae RepID=Q4... 264 2e-69 UniRef50_B7UFQ5 Predicted protein n=14 Tax=Enterobacteriaceae Re... 258 2e-67 UniRef50_Q7N1D0 Transposase, ISNCY family n=36 Tax=root RepID=Q7... 254 2e-66 UniRef50_D2U4R8 Transposase (Fragment) n=4 Tax=Enterobacteriacea... 253 6e-66 UniRef50_C2LLN3 Transposase n=37 Tax=Enterobacteriaceae RepID=C2... 246 8e-64 UniRef50_Q7B1W7 YadD homologue n=11 Tax=root RepID=Q7B1W7_ECOLX 241 3e-62 UniRef50_P31665 Uncharacterized protein yadD n=59 Tax=Enterobact... 238 1e-61 UniRef50_B6XDZ7 Putative uncharacterized protein n=2 Tax=Provide... 235 1e-60 UniRef50_C8QFJ7 Putative transposase YhgA family protein n=4 Tax... 235 1e-60 UniRef50_C2DMU4 Possible transposase n=6 Tax=Enterobacteriaceae ... 234 3e-60 UniRef50_D0KLJ7 Putative transposase YhgA family protein n=1 Tax... 233 7e-60 UniRef50_D1P284 Transposase, ISNCY family n=10 Tax=Enterobacteri... 232 8e-60 UniRef50_C0Q5B1 Ytl2 n=4 Tax=Enterobacteriaceae RepID=C0Q5B1_SALPC 227 5e-58 UniRef50_C2LF55 Transposase n=3 Tax=Enterobacteriaceae RepID=C2L... 217 4e-55 UniRef50_A8PLK1 Putative uncharacterized protein n=3 Tax=Rickett... 209 1e-52 UniRef50_Q3C0L1 TpnA protein n=16 Tax=Enterobacteriaceae RepID=Q... 204 3e-51 UniRef50_C3M8C1 Putative transposase n=3 Tax=Candidatus Hamilton... 198 2e-49 UniRef50_Q2J904 Putative uncharacterized protein n=1 Tax=Frankia... 197 4e-49 UniRef50_B7MZS6 Putative uncharacterized protein n=3 Tax=Escheri... 197 4e-49 UniRef50_A8PQ66 Putative uncharacterized protein n=3 Tax=Rickett... 192 9e-48 UniRef50_A6G4N5 Putative uncharacterized protein n=1 Tax=Plesioc... 188 2e-46 UniRef50_C0AXL8 Putative uncharacterized protein n=1 Tax=Proteus... 186 8e-46 UniRef50_A6G0X2 Putative uncharacterized protein n=1 Tax=Plesioc... 185 2e-45 UniRef50_Q52101 ORF n=1 Tax=Salmonella enterica subsp. enterica ... 182 1e-44 UniRef50_A0LBL3 Putative uncharacterized protein n=6 Tax=Magneto... 181 2e-44 UniRef50_A9EVM7 Similar to putative transposase n=2 Tax=Sorangiu... 181 3e-44 UniRef50_Q1QWV4 Putative uncharacterized protein n=11 Tax=Proteo... 179 9e-44 UniRef50_C7RR52 Putative transposase n=1 Tax=Candidatus Accumuli... 178 3e-43 UniRef50_B3ESQ9 Putative uncharacterized protein n=2 Tax=Bacteri... 177 3e-43 UniRef50_C2DIT3 Possible transposase n=5 Tax=Enterobacteriaceae ... 176 1e-42 UniRef50_Q24W02 Putative uncharacterized protein n=3 Tax=Clostri... 173 7e-42 UniRef50_D0LMM4 Putative transposase n=10 Tax=Haliangium ochrace... 173 9e-42 UniRef50_Q2FP14 Putative uncharacterized protein n=4 Tax=Methano... 172 1e-41 UniRef50_A8GX51 Transposase and inactivated derivative n=11 Tax=... 170 5e-41 UniRef50_C1J8H0 Truncated transposase n=3 Tax=Escherichia coli R... 170 5e-41 UniRef50_A6TJT5 Putative uncharacterized protein n=1 Tax=Alkalip... 168 2e-40 UniRef50_Q2RLW6 Putative uncharacterized protein n=9 Tax=Clostri... 165 2e-39 UniRef50_Q1RJ73 Transposase and inactivated derivative n=10 Tax=... 163 7e-39 UniRef50_A5CC03 Transposase and inactivated derivative n=9 Tax=O... 162 1e-38 UniRef50_C4YU05 Transposase n=5 Tax=Rickettsieae RepID=C4YU05_9RICK 162 2e-38 UniRef50_B9MMR0 Putative uncharacterized protein n=1 Tax=Anaeroc... 162 2e-38 UniRef50_Q3JB06 Putative transposase n=17 Tax=Proteobacteria Rep... 160 5e-38 UniRef50_Q1RGR6 Transposase and inactivated derivative n=15 Tax=... 160 6e-38 UniRef50_D2QBD7 Putative uncharacterized protein n=1 Tax=Spiroso... 159 1e-37 UniRef50_C5UWW9 Putative uncharacterized protein n=1 Tax=Clostri... 158 2e-37 UniRef50_C5JAV2 Transposase n=2 Tax=uncultured bacterium RepID=C... 157 3e-37 UniRef50_Q1RKI3 Transposase and inactivated derivative n=10 Tax=... 157 4e-37 UniRef50_B9TA29 Putative uncharacterized protein n=1 Tax=Ricinus... 156 7e-37 UniRef50_A3JHZ5 Putative transposase n=11 Tax=Proteobacteria Rep... 156 1e-36 UniRef50_A4XMD0 Putative uncharacterized protein n=5 Tax=Clostri... 156 1e-36 UniRef50_C3PPD7 Transposase and inactivated derivative n=13 Tax=... 155 2e-36 UniRef50_B6WXP3 Putative uncharacterized protein n=1 Tax=Desulfo... 153 6e-36 UniRef50_A4XFI8 Putative uncharacterized protein n=7 Tax=Clostri... 153 9e-36 UniRef50_Q6TFF6 Putative transposase n=1 Tax=Caedibacter taenios... 152 1e-35 UniRef50_A3ET28 Probable transposase n=6 Tax=Leptospirillum sp. ... 152 1e-35 UniRef50_A4XG55 Putative uncharacterized protein n=2 Tax=Caldice... 149 1e-34 UniRef50_C6HXQ0 Putative uncharacterized protein n=1 Tax=Leptosp... 147 4e-34 UniRef50_C6VTM0 Putative uncharacterized protein n=1 Tax=Dyadoba... 146 6e-34 UniRef50_A6G1G8 Putative uncharacterized protein n=1 Tax=Plesioc... 146 8e-34 UniRef50_C0GW46 Putative uncharacterized protein n=2 Tax=Desulfo... 146 1e-33 UniRef50_Q1Q296 Putative uncharacterized protein n=6 Tax=Candida... 145 2e-33 UniRef50_A4U3R1 Putative uncharacterized protein n=1 Tax=Magneto... 144 4e-33 UniRef50_C6I158 Putative uncharacterized protein n=3 Tax=Leptosp... 144 5e-33 UniRef50_A9BGB6 Putative uncharacterized protein n=3 Tax=Petroto... 143 6e-33 UniRef50_B4U689 Putative uncharacterized protein n=8 Tax=Aquific... 142 1e-32 UniRef50_C6HY29 Putative uncharacterized protein n=1 Tax=Leptosp... 142 1e-32 UniRef50_C0GW49 Putative uncharacterized protein n=6 Tax=Desulfo... 140 6e-32 UniRef50_C4FIM1 Putative uncharacterized protein n=1 Tax=Sulfuri... 139 1e-31 UniRef50_Q04UG3 Transposase, YhgA-like n=8 Tax=Leptospira RepID=... 139 1e-31 UniRef50_C5RH90 Putative uncharacterized protein n=2 Tax=Clostri... 137 4e-31 UniRef50_B9MN47 Putative uncharacterized protein n=2 Tax=Bacteri... 135 2e-30 UniRef50_B3ETR6 Putative uncharacterized protein n=1 Tax=Candida... 135 2e-30 UniRef50_C0GTX5 Putative uncharacterized protein n=8 Tax=Desulfo... 134 3e-30 UniRef50_B2V9N0 Putative uncharacterized protein n=4 Tax=Sulfuri... 132 1e-29 UniRef50_C6HZP6 Putative uncharacterized protein n=1 Tax=Leptosp... 132 1e-29 UniRef50_A4XMU7 Putative uncharacterized protein n=1 Tax=Caldice... 129 1e-28 UniRef50_C0A240 Putative uncharacterized protein n=1 Tax=Opituta... 129 2e-28 UniRef50_C0GWA6 Putative uncharacterized protein n=3 Tax=Desulfo... 128 2e-28 UniRef50_B9MMM9 Putative uncharacterized protein n=1 Tax=Anaeroc... 128 3e-28 UniRef50_C6PYR3 Putative uncharacterized protein n=1 Tax=Clostri... 126 8e-28 UniRef50_B1XMU9 Putative uncharacterized protein n=1 Tax=Synecho... 124 2e-27 UniRef50_C6HTR6 Probable transposase n=5 Tax=Leptospirillum ferr... 124 2e-27 UniRef50_B9MPV5 Putative uncharacterized protein n=5 Tax=Clostri... 124 3e-27 UniRef50_A5USQ0 Putative uncharacterized protein n=4 Tax=Roseifl... 124 5e-27 UniRef50_C9KKN3 Putative uncharacterized protein n=1 Tax=Mitsuok... 123 8e-27 UniRef50_C1DXM1 Putative uncharacterized protein n=5 Tax=Sulfuri... 122 1e-26 UniRef50_Q7NIZ1 Gll2041 protein n=9 Tax=Cyanobacteria RepID=Q7NI... 122 1e-26 UniRef50_D1PHY3 Putative uncharacterized protein n=2 Tax=Prevote... 122 1e-26 UniRef50_C6IY67 Transposase n=1 Tax=Paenibacillus sp. oral taxon... 121 4e-26 UniRef50_C8T759 Putative uncharacterized protein n=1 Tax=Klebsie... 120 5e-26 UniRef50_C8PTN1 Putative uncharacterized protein n=4 Tax=Trepone... 119 9e-26 UniRef50_B0K503 Putative uncharacterized protein n=12 Tax=Thermo... 119 1e-25 UniRef50_A8PLG1 Transposase n=1 Tax=Rickettsiella grylli RepID=A... 119 1e-25 UniRef50_A5D0D4 Putative uncharacterized protein n=10 Tax=Clostr... 118 2e-25 UniRef50_Q73P51 Conserved domain protein n=7 Tax=Treponema RepID... 118 2e-25 UniRef50_B6J6C6 Hypothetical cytosolic protein n=1 Tax=Coxiella ... 118 2e-25 UniRef50_C4G1D5 Putative uncharacterized protein n=2 Tax=Abiotro... 118 2e-25 UniRef50_B0G834 Putative uncharacterized protein n=3 Tax=Dorea f... 118 3e-25 UniRef50_B4SC57 Putative uncharacterized protein n=14 Tax=Bacter... 118 3e-25 UniRef50_B2V697 Putative uncharacterized protein n=6 Tax=Sulfuri... 118 3e-25 UniRef50_A9BGB3 Putative uncharacterized protein n=2 Tax=Petroto... 117 4e-25 UniRef50_Q2RKN5 Putative uncharacterized protein n=1 Tax=Moorell... 115 2e-24 UniRef50_Q24MW9 Putative uncharacterized protein n=4 Tax=Desulfi... 115 2e-24 UniRef50_UPI0001C351D8 hypothetical protein ChatD1_33675 n=1 Tax... 114 5e-24 UniRef50_D0LPI9 Putative transposase n=2 Tax=Haliangium ochraceu... 112 1e-23 UniRef50_C6VTD5 Putative uncharacterized protein n=1 Tax=Dyadoba... 112 2e-23 UniRef50_D0YJF1 Putative transposase YhgA family protein n=1 Tax... 111 3e-23 UniRef50_UPI0001C353CE hypothetical protein ChatD1_20495 n=1 Tax... 108 2e-22 UniRef50_A6LFH9 Putative uncharacterized protein n=6 Tax=Bactero... 108 3e-22 UniRef50_C6XV81 Putative uncharacterized protein n=4 Tax=Pedobac... 108 3e-22 UniRef50_B7BFV9 Putative uncharacterized protein n=1 Tax=Parabac... 106 1e-21 UniRef50_B5U1X5 Putative uncharacterized protein n=1 Tax=uncultu... 106 1e-21 UniRef50_A6LFA9 Putative uncharacterized protein n=22 Tax=Bacter... 105 2e-21 UniRef50_B8FTH9 Putative uncharacterized protein n=3 Tax=Desulfi... 105 2e-21 UniRef50_D1P8S5 Putative uncharacterized protein n=1 Tax=Prevote... 104 3e-21 UniRef50_C0R0H3 Putative uncharacterized protein n=8 Tax=Brachys... 104 3e-21 UniRef50_A5CBY6 Transposase and inactivated derivative n=47 Tax=... 104 4e-21 UniRef50_C0CSV6 Putative uncharacterized protein n=1 Tax=Clostri... 104 5e-21 UniRef50_C2LUG6 Putative uncharacterized protein n=1 Tax=Strepto... 103 7e-21 UniRef50_C6Y2B5 Transposase and inactivated derivative n=1 Tax=P... 103 8e-21 UniRef50_C9LWJ8 Putative uncharacterized protein n=1 Tax=Selenom... 103 8e-21 UniRef50_C6LTE0 Putative uncharacterized protein n=1 Tax=Giardia... 103 8e-21 UniRef50_B7GJZ4 Transposase n=10 Tax=Bacillaceae RepID=B7GJZ4_ANOFW 103 9e-21 UniRef50_B0K813 Putative uncharacterized protein n=13 Tax=Thermo... 102 1e-20 UniRef50_D2NBJ3 Putative uncharacterized protein n=1 Tax=Escheri... 102 1e-20 UniRef50_A6M1J9 Putative uncharacterized protein n=1 Tax=Clostri... 102 2e-20 UniRef50_UPI0001C34E7F hypothetical protein ClM62_15401 n=1 Tax=... 101 3e-20 UniRef50_C0QZQ8 Putative uncharacterized protein n=4 Tax=Brachys... 99 9e-20 UniRef50_C1I6Y7 Putative uncharacterized protein n=1 Tax=Clostri... 99 9e-20 UniRef50_UPI0001B4A8CA hypothetical protein Bfra3_22303 n=1 Tax=... 99 1e-19 UniRef50_C1PBU4 Putative uncharacterized protein n=4 Tax=Bacillu... 99 1e-19 UniRef50_C9LXX0 Putative uncharacterized protein n=6 Tax=Selenom... 99 2e-19 UniRef50_Q5GSR2 Uncharacterized conserved protein n=15 Tax=Wolba... 99 2e-19 UniRef50_A6EAN2 Putative uncharacterized protein n=1 Tax=Pedobac... 99 2e-19 UniRef50_C1QAJ2 Putative uncharacterized protein n=2 Tax=Brachys... 99 2e-19 UniRef50_C3R531 Putative uncharacterized protein n=6 Tax=Bactero... 98 3e-19 UniRef50_C0CTJ7 Putative uncharacterized protein n=5 Tax=Clostri... 98 4e-19 UniRef50_C4ZLA7 Conserved hypothetical cytosolic protein n=2 Tax... 97 7e-19 UniRef50_A8GY36 Putative uncharacterized protein n=15 Tax=Ricket... 97 8e-19 UniRef50_C1Q938 Putative uncharacterized protein n=4 Tax=Brachys... 97 8e-19 UniRef50_D0TYF1 Putative uncharacterized protein n=1 Tax=Bactero... 96 1e-18 UniRef50_C1MD86 Putative uncharacterized protein n=5 Tax=Enterob... 96 1e-18 UniRef50_Q8F560 Putative uncharacterized protein n=1 Tax=Leptosp... 95 3e-18 UniRef50_C0DAA1 Putative uncharacterized protein n=2 Tax=Clostri... 95 3e-18 UniRef50_B7CC32 Putative uncharacterized protein n=10 Tax=Eubact... 95 3e-18 UniRef50_C0QWG9 Putative uncharacterized protein n=8 Tax=Brachys... 95 3e-18 UniRef50_C0BF92 Putative uncharacterized protein n=1 Tax=Coproco... 95 4e-18 UniRef50_B0KCX4 Putative uncharacterized protein n=12 Tax=Thermo... 95 4e-18 UniRef50_B9E303 Putative uncharacterized protein n=2 Tax=Clostri... 94 5e-18 UniRef50_A6BF26 Putative uncharacterized protein n=14 Tax=Clostr... 94 5e-18 UniRef50_C2G1H3 Hypothetical cytosolic protein n=1 Tax=Sphingoba... 93 8e-18 UniRef50_C8PLW8 Putative uncharacterized protein n=2 Tax=Trepone... 93 8e-18 UniRef50_C8PT67 Putative uncharacterized protein n=1 Tax=Trepone... 93 1e-17 UniRef50_C0G0A4 Putative uncharacterized protein n=2 Tax=Rosebur... 93 1e-17 UniRef50_A7B1D1 Putative uncharacterized protein n=3 Tax=Ruminoc... 93 1e-17 UniRef50_A5KR99 Putative uncharacterized protein n=11 Tax=Rumino... 93 2e-17 UniRef50_C6LE73 Putative uncharacterized protein n=1 Tax=Bryante... 92 2e-17 UniRef50_C0F0J0 Putative uncharacterized protein n=1 Tax=Eubacte... 92 3e-17 UniRef50_B4VKU9 Putative uncharacterized protein n=1 Tax=Microco... 91 3e-17 UniRef50_C0QZ87 Chromosome segregation ATPase n=19 Tax=Bacteria ... 91 4e-17 UniRef50_C5RQ96 Putative uncharacterized protein n=1 Tax=Clostri... 91 4e-17 UniRef50_B0K519 Putative uncharacterized protein n=14 Tax=Thermo... 91 5e-17 UniRef50_A7BWQ7 Putative uncharacterized protein n=3 Tax=Beggiat... 91 6e-17 UniRef50_UPI0001C371D2 hypothetical protein RflaF_10865 n=1 Tax=... 90 7e-17 UniRef50_B8FP58 Putative uncharacterized protein n=1 Tax=Desulfi... 90 7e-17 UniRef50_A5Z376 Putative uncharacterized protein n=1 Tax=Eubacte... 90 7e-17 UniRef50_C6XV94 Putative uncharacterized protein n=7 Tax=Pedobac... 90 9e-17 UniRef50_C1QAK6 Putative uncharacterized protein n=1 Tax=Brachys... 90 9e-17 UniRef50_B1WSK8 CHP1784-containing protein n=11 Tax=Cyanobacteri... 90 1e-16 UniRef50_C0EXQ3 Putative uncharacterized protein n=1 Tax=Eubacte... 89 2e-16 UniRef50_C4G3R2 Putative uncharacterized protein n=2 Tax=Abiotro... 89 2e-16 UniRef50_B4VKW0 Putative uncharacterized protein n=2 Tax=Microco... 89 2e-16 UniRef50_C1DXV7 Putative uncharacterized protein n=1 Tax=Sulfuri... 89 2e-16 UniRef50_Q24Y19 Putative uncharacterized protein n=3 Tax=Desulfi... 88 2e-16 UniRef50_B1V1L4 Putative uncharacterized protein n=38 Tax=Clostr... 88 3e-16 UniRef50_C9RP54 Putative uncharacterized protein n=1 Tax=Fibroba... 88 4e-16 UniRef50_Q2RGS0 Putative uncharacterized protein n=2 Tax=Moorell... 88 4e-16 UniRef50_B8HL58 Putative uncharacterized protein n=2 Tax=Cyanoth... 88 4e-16 UniRef50_A8F2U7 Putative uncharacterized protein n=15 Tax=Bacter... 88 5e-16 UniRef50_B5CRG1 Putative uncharacterized protein n=4 Tax=Ruminoc... 88 5e-16 UniRef50_B6FJ15 Putative uncharacterized protein n=5 Tax=Clostri... 87 6e-16 UniRef50_UPI0001BC3A9D hypothetical protein BcroD2_08902 n=3 Tax... 87 7e-16 UniRef50_C4FYK3 Putative uncharacterized protein n=2 Tax=Abiotro... 87 8e-16 UniRef50_C6LJP2 Putative transposase n=1 Tax=Bryantella formatex... 87 8e-16 UniRef50_D0BNN6 ATP-dependent DNA helicase RecQ n=1 Tax=Granulic... 87 9e-16 UniRef50_UPI0001C369BC hypothetical protein ChatD1_02491 n=1 Tax... 86 1e-15 UniRef50_Q8YK35 All8083 protein n=6 Tax=Cyanobacteria RepID=Q8YK... 86 1e-15 UniRef50_Q8ZS56 Alr7656 protein n=6 Tax=Nostocaceae RepID=Q8ZS56... 86 2e-15 UniRef50_B0C251 Putative uncharacterized protein n=1 Tax=Acaryoc... 86 2e-15 UniRef50_C9RQ02 Putative uncharacterized protein n=1 Tax=Fibroba... 85 2e-15 UniRef50_Q2FTW8 Putative uncharacterized protein n=2 Tax=Methano... 85 3e-15 UniRef50_Q00255 ORF295 n=1 Tax=Leptolyngbya boryana RepID=Q00255... 85 3e-15 UniRef50_C0QWI7 Putative uncharacterized protein n=4 Tax=Brachys... 85 4e-15 UniRef50_C0GV86 Transposase, ISNCY family n=7 Tax=Desulfonatrono... 85 4e-15 UniRef50_C1P7A8 Putative uncharacterized protein n=1 Tax=Bacillu... 85 4e-15 UniRef50_A7AK04 Putative uncharacterized protein n=2 Tax=Parabac... 84 5e-15 UniRef50_B0A7T9 Putative uncharacterized protein n=2 Tax=Clostri... 84 8e-15 UniRef50_B0G418 Putative uncharacterized protein n=5 Tax=Dorea f... 83 9e-15 UniRef50_C0R2N1 Putative uncharacterized protein n=4 Tax=Wolbach... 83 1e-14 UniRef50_C5UZR7 Putative uncharacterized protein n=1 Tax=Clostri... 83 1e-14 UniRef50_A7BTR0 Putative uncharacterized protein n=3 Tax=Beggiat... 83 1e-14 UniRef50_UPI00019735B3 hypothetical protein ClM62_08045 n=1 Tax=... 83 1e-14 UniRef50_C4ZGR2 Putative uncharacterized protein n=2 Tax=Eubacte... 83 2e-14 UniRef50_A7C3K1 Putative uncharacterized protein n=3 Tax=Beggiat... 82 2e-14 UniRef50_C6XVT6 Putative uncharacterized protein n=1 Tax=Pedobac... 82 2e-14 UniRef50_C4Z1Q2 Putative uncharacterized protein n=1 Tax=Eubacte... 81 4e-14 UniRef50_C0D7Q8 Putative uncharacterized protein n=1 Tax=Clostri... 81 5e-14 UniRef50_Q8GBS6 Putative uncharacterized protein n=12 Tax=Trepon... 81 6e-14 UniRef50_C4G7H9 Putative uncharacterized protein n=2 Tax=Abiotro... 80 7e-14 UniRef50_C0QGW4 Putative uncharacterized protein n=1 Tax=Desulfo... 80 9e-14 UniRef50_A5D5U3 Hypothetical membrane protein n=3 Tax=Peptococca... 80 9e-14 UniRef50_A1ZPJ4 Hypothetical conserved protein n=6 Tax=Microscil... 80 9e-14 UniRef50_C5EKZ7 Predicted protein n=1 Tax=Clostridiales bacteriu... 79 2e-13 UniRef50_UPI0001BC3131 hypothetical protein BcroD2_12630 n=4 Tax... 79 2e-13 UniRef50_Q8YTL4 All2703 protein n=13 Tax=Cyanobacteria RepID=Q8Y... 79 2e-13 UniRef50_Q3ARU8 Putative uncharacterized protein n=12 Tax=Chloro... 78 3e-13 UniRef50_UPI00006CAA90 hypothetical protein TTHERM_00670420 n=1 ... 78 3e-13 UniRef50_C8NHS0 Putative uncharacterized protein n=1 Tax=Granuli... 78 3e-13 UniRef50_A8SDU3 Putative uncharacterized protein n=1 Tax=Faecali... 77 7e-13 UniRef50_B5Q357 Transposase n=10 Tax=Salmonella enterica subsp. ... 77 7e-13 UniRef50_A4XJH0 Putative uncharacterized protein n=1 Tax=Caldice... 77 8e-13 UniRef50_B8HNA0 Putative uncharacterized protein n=3 Tax=Cyanoba... 77 8e-13 UniRef50_UPI00006A2D99 UPI00006A2D99 related cluster n=2 Tax=Xen... 75 2e-12 UniRef50_Q3ATN4 Putative uncharacterized protein n=1 Tax=Chlorob... 75 3e-12 UniRef50_C8WSD0 Putative uncharacterized protein n=5 Tax=Alicycl... 74 4e-12 UniRef50_Q1PZ06 Putative uncharacterized protein n=1 Tax=Candida... 74 4e-12 UniRef50_Q24Y59 Putative uncharacterized protein n=4 Tax=Peptoco... 73 1e-11 Sequences not found previously or not previously below threshold: UniRef50_A6LF36 Putative uncharacterized protein n=7 Tax=Bactero... 100 5e-20 UniRef50_A6MYW5 Chromosome segregation ATPase n=4 Tax=Rickettsia... 90 7e-17 UniRef50_B3QUJ9 Putative uncharacterized protein n=8 Tax=Bacteri... 86 1e-15 UniRef50_C6Y2C7 Putative uncharacterized protein n=2 Tax=Pedobac... 85 4e-15 UniRef50_Q2FSM2 Putative uncharacterized protein n=3 Tax=Methano... 81 6e-14 UniRef50_Q3ARM2 Putative uncharacterized protein n=10 Tax=Bacter... 80 8e-14 UniRef50_B0NFN2 Putative uncharacterized protein n=4 Tax=Clostri... 78 3e-13 UniRef50_Q8YMI0 Alr4953 protein n=8 Tax=Cyanobacteria RepID=Q8YM... 78 3e-13 UniRef50_B4VQ19 Putative uncharacterized protein n=3 Tax=Microco... 78 4e-13 UniRef50_C9RLI8 Putative uncharacterized protein n=1 Tax=Fibroba... 78 6e-13 UniRef50_C9RMD5 Putative uncharacterized protein n=1 Tax=Fibroba... 77 6e-13 UniRef50_C6W4R9 Putative uncharacterized protein n=1 Tax=Dyadoba... 76 1e-12 UniRef50_B7I1C8 Putative uncharacterized protein n=16 Tax=Bacill... 76 1e-12 UniRef50_B4VTF8 Putative uncharacterized protein n=7 Tax=Oscilla... 76 2e-12 UniRef50_B7K6I4 Putative uncharacterized protein n=2 Tax=Cyanoth... 76 2e-12 UniRef50_Q111X0 Putative uncharacterized protein n=10 Tax=Oscill... 73 8e-12 UniRef50_B0MQP0 Putative uncharacterized protein n=2 Tax=Eubacte... 73 1e-11 UniRef50_C0DB21 Putative uncharacterized protein n=2 Tax=Clostri... 73 1e-11 UniRef50_UPI0001C366FA hypothetical protein ChatD1_09620 n=1 Tax... 73 2e-11 UniRef50_C9LT45 Putative uncharacterized protein n=2 Tax=Selenom... 72 2e-11 >UniRef50_P77768 Uncharacterized protein yfcI n=175 Tax=Gammaproteobacteria RepID=YFCI_ECOLI Length = 296 Score = 284 bits (726), Expect = 3e-75, Method: Composition-based stats. Identities = 296/296 (100%), Positives = 296/296 (100%) Query: 1 MTISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQY 60 MTISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQY Sbjct: 1 MTISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQY 60 Query: 61 YSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPM 120 YSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPM Sbjct: 61 YSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPM 120 Query: 121 LFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQ 180 LFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQ Sbjct: 121 LFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQ 180 Query: 181 KHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEK 240 KHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEK Sbjct: 181 KHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEK 240 Query: 241 EKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQSH 296 EKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQSH Sbjct: 241 EKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQSH 296 >UniRef50_P37415 Uncharacterized protein pSLT051 n=256 Tax=Gammaproteobacteria RepID=YTL2_SALTY Length = 313 Score = 279 bits (713), Expect = 8e-74, Method: Composition-based stats. Identities = 163/311 (52%), Positives = 221/311 (71%), Gaps = 16/311 (5%) Query: 2 TISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYY 61 +TT TPHDA F+ FL PD ARDF+++HLPA LR +CDL+TLKLE SF+++DLRQY+ Sbjct: 3 KKNTTPTPHDATFRQFLTQPDIARDFMELHLPAELRAICDLSTLKLESGSFVEDDLRQYF 62 Query: 62 SDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPML 121 SD+L+S+KT G GYI+V++EHQS P++ MAFR++RY++AAMQ HL+AG+K+LPLV+P+L Sbjct: 63 SDVLYSLKTTAGDGYIHVLVEHQSTPDKHMAFRLIRYAVAAMQRHLEAGHKKLPLVIPVL 122 Query: 122 FYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQK 181 FY G RSPYPYS WLDEF + A+A K+YSSAFPLVD+TV+PDDEI HR MA L L+QK Sbjct: 123 FYTGKRSPYPYSTRWLDEFDDTALADKLYSSAFPLVDVTVIPDDEIAGHRSMAALTLLQK 182 Query: 182 HIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKE 241 HI QRDL LVD++ +L+ G + Q+ +L +Y++Q G+ AF+ E+A+R PQ + Sbjct: 183 HIHQRDLAELVDRLAPILLAGYLSSSQVISLVHYIVQAGETSDAEAFVRELAQRVPQHGD 242 Query: 242 KLMTIADRLREEGAMQGKH----------------EEALRIAQEMLDRGLDRELVMMVTR 285 LMTIA +L ++G +G E L+IA+ ML +DR VM +T Sbjct: 243 ALMTIAQQLEQKGIEKGIQLGEQRGIEKGRSEGEREATLKIARTMLQNCIDRNTVMKMTG 302 Query: 286 LSPDDLIAQSH 296 L+ DDL H Sbjct: 303 LTEDDLAQIRH 313 >UniRef50_Q1CC76 Transposase n=27 Tax=Gammaproteobacteria RepID=Q1CC76_YERPN Length = 313 Score = 271 bits (692), Expect = 2e-71, Method: Composition-based stats. Identities = 158/311 (50%), Positives = 219/311 (70%), Gaps = 16/311 (5%) Query: 2 TISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYY 61 ++T TPHDA F+ FL P+ ARDF+++HLPA LR +CDL+TLKLE SF+++DLRQY+ Sbjct: 3 KKNSTPTPHDATFRQFLTQPEIARDFMELHLPAELRAICDLSTLKLESGSFVEDDLRQYF 62 Query: 62 SDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPML 121 SD+L+S+ T EG GY++V+IEHQS P++ MAFR++RY+IAAMQ HL+AG+ +LPLV+P+L Sbjct: 63 SDVLYSLDTVEGEGYVHVLIEHQSSPDKHMAFRLIRYAIAAMQRHLEAGHAKLPLVIPVL 122 Query: 122 FYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQK 181 FY G RSPYPYS WLDEF +P +A K+YS AFPLVD+TV+PDD+IM+HR MA L L+QK Sbjct: 123 FYVGKRSPYPYSTRWLDEFDDPELAHKLYSGAFPLVDVTVIPDDDIMEHRSMAALTLLQK 182 Query: 182 HIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKE 241 HI QRD+ L D++ +LL+ + Q+ AL +Y+LQ G++ AF+ E+A+R PQ + Sbjct: 183 HIHQRDIATLTDRLATLLMADYLSSPQVMALIHYLLQAGESADSEAFVRELAQRVPQHGD 242 Query: 242 KLMTIADRLREEGAMQGKHEEA----------------LRIAQEMLDRGLDRELVMMVTR 285 LMTIA +L ++G +G+ E L +A+ +L G+ E V T Sbjct: 243 ALMTIAQQLEQKGIEKGRMEGRTEGIQLGEQRGIEKGKLEVARSLLKMGMPIESVQEATG 302 Query: 286 LSPDDLIAQSH 296 LS DDL H Sbjct: 303 LSEDDLAQIRH 313 >UniRef50_Q4LC22 TpnA protein n=9 Tax=Enterobacteriaceae RepID=Q4LC22_SODGL Length = 308 Score = 264 bits (675), Expect = 2e-69, Method: Composition-based stats. Identities = 158/306 (51%), Positives = 205/306 (66%), Gaps = 12/306 (3%) Query: 1 MTISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQY 60 M+ T TPHDAVF+ FL TA+DF DI LP ++ LCD TLK E SFID D++ Y Sbjct: 1 MSKKFTPTPHDAVFRQFLHDKATAQDFFDIWLPDDIKALCDWETLKPESGSFIDPDMKPY 60 Query: 61 YSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPM 120 SD+L+SV GY+Y +IEHQS P++LMA+R+MRYS+AAMQ HL+AG+ +LPLV P+ Sbjct: 61 QSDILYSVNANGVDGYVYCLIEHQSTPDKLMAWRLMRYSMAAMQRHLEAGHDKLPLVFPV 120 Query: 121 LFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQ 180 LFY G +SP+PYS WLD F P IA KIYS F L+D+T + DD IMQHR+MALLELIQ Sbjct: 121 LFYCGEKSPHPYSTNWLDCFERPDIAAKIYSQPFRLMDVTTLDDDAIMQHRRMALLELIQ 180 Query: 181 KHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEK 240 KHIR+RD+ L+D IV LL D Q+ + NY++Q G+A R FI EIA+RA + + Sbjct: 181 KHIRRRDMTELLDSIVKLLSYNYYTDTQVVTMMNYLVQEGNAASPRTFITEIAKRAEKHE 240 Query: 241 EKLMTIAD------------RLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSP 288 E LMTIA+ R+EG QG+H A++IA++ML RG+ R+ V T LS Sbjct: 241 EALMTIAEALKQEGYQIGRDDGRQEGIQQGEHAAAMKIARQMLSRGIARDAVKACTGLSD 300 Query: 289 DDLIAQ 294 + L Sbjct: 301 NALDNL 306 >UniRef50_B7UFQ5 Predicted protein n=14 Tax=Enterobacteriaceae RepID=B7UFQ5_ECO27 Length = 315 Score = 258 bits (659), Expect = 2e-67, Method: Composition-based stats. Identities = 194/311 (62%), Positives = 242/311 (77%), Gaps = 20/311 (6%) Query: 1 MTISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQY 60 MT STTS+PHDAVFK+F+ P+TARDF++IHLP PLRKLC+L TL+LEP SFI++ LR Y Sbjct: 1 MTESTTSSPHDAVFKTFMFTPETARDFLEIHLPEPLRKLCNLQTLRLEPTSFIEKSLRAY 60 Query: 61 YSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPM 120 YSD+LWSV+T EG GYIY VIEHQS E+ MAFR+MRY+ AAMQ HLD GY +PLV+P+ Sbjct: 61 YSDVLWSVETSEGDGYIYCVIEHQSSAEKNMAFRLMRYATAAMQRHLDKGYDRVPLVVPL 120 Query: 121 LFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQ 180 LFYHG SPYPYSL WLDEF +P +AR++Y+ AFPLVDIT+VPDDEIMQHR++ALLELIQ Sbjct: 121 LFYHGEASPYPYSLNWLDEFDDPQLARQLYTEAFPLVDITIVPDDEIMQHRRIALLELIQ 180 Query: 181 KHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEK 240 KHIR RDL+G+VD+I +LLV G TND QL+ LFNY+LQ GD RF FI EIAER+P +K Sbjct: 181 KHIRDRDLIGMVDRITTLLVRGFTNDSQLQTLFNYLLQCGDTSRFTRFIQEIAERSPLQK 240 Query: 241 EKLMTIADRLRE--------------------EGAMQGKHEEALRIAQEMLDRGLDRELV 280 E LMTIA+RLR+ EG +G HE+A++IA ML++G +RE+V Sbjct: 241 EILMTIAERLRQEGHQIGWQEGKIEGWQEGKLEGLQEGMHEQAIKIALRMLEQGFEREIV 300 Query: 281 MMVTRLSPDDL 291 + T+L+ D+ Sbjct: 301 LAATQLTDADI 311 >UniRef50_Q7N1D0 Transposase, ISNCY family n=36 Tax=root RepID=Q7N1D0_PHOLL Length = 335 Score = 254 bits (649), Expect = 2e-66, Method: Composition-based stats. Identities = 157/335 (46%), Positives = 216/335 (64%), Gaps = 39/335 (11%) Query: 1 MTISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQY 60 M T TPHDA+FK FL H DTARDF++IHLPA LR +CDL TL+LE SFI+++LR + Sbjct: 1 MKRKNTPTPHDAIFKKFLSHIDTARDFLEIHLPATLRAVCDLDTLRLESGSFIEDNLRVH 60 Query: 61 YSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPM 120 YSD+L+S+KT +G Y+Y VIEHQS P+++MAFR+MRYSI+AMQ HL+ G+K+LPLV+P+ Sbjct: 61 YSDILYSLKTTQGESYVYCVIEHQSSPDKMMAFRLMRYSISAMQWHLEQGHKKLPLVIPV 120 Query: 121 LFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQ 180 LFYHG PYP+S W D F A+A +IYSSAFPLVD+TV+PDDEI+ H+++ALLE++Q Sbjct: 121 LFYHGKIRPYPWSTNWFDCFDASALAEEIYSSAFPLVDVTVIPDDEILTHKRVALLEIVQ 180 Query: 181 KHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEK 240 KHIRQRD+ L ++ L LK++ NY+L GD FI ++AE+ P+ + Sbjct: 181 KHIRQRDMAELQQELTMLFAYDYYTYELLKSMLNYILLVGDTADPEGFIRQLAEQFPKYE 240 Query: 241 EKLMTIADRLREEGAMQGKHEE-------------------------------------- 262 E LMTIA +L+ +G +G E Sbjct: 241 EVLMTIAQKLQHKGHQEGLKEGLQKCQDAREEGLQEGLQKGEKKGEKKGEKKGEEKGEKR 300 Query: 263 -ALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQSH 296 +L+IA+ ++D G+DRE +M T LS ++L H Sbjct: 301 ASLKIARALMDNGIDRETIMKSTGLSQNELEQIHH 335 >UniRef50_D2U4R8 Transposase (Fragment) n=4 Tax=Enterobacteriaceae RepID=D2U4R8_9ENTR Length = 308 Score = 253 bits (645), Expect = 6e-66, Method: Composition-based stats. Identities = 148/297 (49%), Positives = 206/297 (69%), Gaps = 4/297 (1%) Query: 1 MTISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQY 60 MT T TPHDAVFK FL +TA+DF DI LP ++ LCDL +LK+E SFID +++ Y Sbjct: 7 MTKKFTPTPHDAVFKQFLSEKETAKDFFDIWLPDEIKALCDLDSLKMESGSFIDSEMKNY 66 Query: 61 YSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPM 120 SD+L+SV T +G GYIYV+IEHQS P++L+A+R+MRYS+AAMQ HL+ G K+LPLV P+ Sbjct: 67 QSDILYSVSTTKGSGYIYVLIEHQSTPDKLIAWRLMRYSLAAMQKHLEDGNKQLPLVFPI 126 Query: 121 LFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQ 180 LFY G +SP+PYS WLD F + +A IY++ F L D+T + D EIMQH+++ALLEL+Q Sbjct: 127 LFYCGEQSPHPYSTHWLDCFEDRKLAESIYNNPFKLADVTTLDDGEIMQHKRIALLELLQ 186 Query: 181 KHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEK 240 KHIR+RD+ L+D IV LL D Q+ +FNY++Q G+AQR FI IA++A + + Sbjct: 187 KHIRRRDMTELLDSIVKLLSYNYYTDNQVITMFNYLIQEGNAQRPMEFITNIAKQAEKHE 246 Query: 241 EKLMTIADRLREEGAMQGKHEE----ALRIAQEMLDRGLDRELVMMVTRLSPDDLIA 293 LMTIA ++ E G +G + + +A++ L G+DR V + T LS ++L Sbjct: 247 GALMTIAQQIEEIGIQKGIQQGIQKTKIELAKQFLANGVDRNTVKISTGLSDEELNK 303 >UniRef50_C2LLN3 Transposase n=37 Tax=Enterobacteriaceae RepID=C2LLN3_PROMI Length = 319 Score = 246 bits (627), Expect = 8e-64, Method: Composition-based stats. Identities = 130/319 (40%), Positives = 202/319 (63%), Gaps = 23/319 (7%) Query: 1 MTISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQY 60 MT +T HDA+FK FL HP+ ARDF +HLPA + LCDL+TL+LEP SF++ LRQ Sbjct: 1 MTKNTQQPVHDALFKQFLTHPENARDFFSVHLPANILPLCDLSTLRLEPASFVERRLRQL 60 Query: 61 YSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAG--YKELPLVL 118 +SD+L+SV+ EG GYIY +IEHQSKP+ LM FR+M Y+++A+ +HL K LPLV+ Sbjct: 61 HSDVLYSVQMTEGEGYIYCLIEHQSKPDRLMGFRLMHYAMSAIAHHLKKSPADKTLPLVV 120 Query: 119 PMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLEL 178 P LFY G PYPYS+ WLD FA+PA+A+++Y+ +FPLVD++V+ D+EI+ H+ +ALLEL Sbjct: 121 PFLFYQGSVCPYPYSMNWLDGFADPALAQQLYTRSFPLVDLSVLSDEEILTHKGIALLEL 180 Query: 179 IQKHIRQRD-LLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAP 237 +QKHIR RD L+ ++ I ++ + + Q++++ Y+ G F ++ +P Sbjct: 181 VQKHIRTRDGLMAVLPIIAQIINSQHNTVDQVRSVIEYIAYQGYILDESRFFSQLIALSP 240 Query: 238 QEKEKLMTIADRLREEGAMQGKHEEA--------------------LRIAQEMLDRGLDR 277 + K L TIA++L ++G +G + ++A+ +L +G+D Sbjct: 241 EYKTMLTTIAEQLEQKGIEKGIEKGIEKGIEKGIEKGIEKGIGLGVEKVARSLLQQGVDL 300 Query: 278 ELVMMVTRLSPDDLIAQSH 296 ++M T L+ + + + H Sbjct: 301 NIIMQCTGLTREKIESLKH 319 >UniRef50_Q7B1W7 YadD homologue n=11 Tax=root RepID=Q7B1W7_ECOLX Length = 313 Score = 241 bits (614), Expect = 3e-62, Method: Composition-based stats. Identities = 152/312 (48%), Positives = 204/312 (65%), Gaps = 20/312 (6%) Query: 2 TISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYY 61 +TT TPHDA F+SFL +PD ARDF+++HLPA R+LCDL+TLKLEP +F++ DL QY Sbjct: 5 KNTTTPTPHDAAFRSFLANPDVARDFLELHLPAEYRQLCDLSTLKLEPATFVEPDLHQYA 64 Query: 62 SDLLWSVKTQEGV-GYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPM 120 SD+LWSVKT G GY+Y +IEHQS M FRM+RYS+AAMQ HL+ +K LPLV+P+ Sbjct: 65 SDILWSVKTTGGEDGYVYTLIEHQSTENLYMPFRMLRYSVAAMQRHLEQ-HKTLPLVIPV 123 Query: 121 LFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQ 180 LFYHG RSPYPYS+ WLD F PA+A KIY+ FPLVDITVV D+EIM HR+MA L L+ Sbjct: 124 LFYHGERSPYPYSMNWLDCFENPALAAKIYTKPFPLVDITVVDDNEIMNHRRMAALTLLM 183 Query: 181 KHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEK 240 KHIRQRD+L +D +V L + D + + L G F+ +A+R PQ + Sbjct: 184 KHIRQRDMLMCLDNLVRALQ--DIQDEEQITVLFNYLLNGSEHVTVEFLQTLAQRLPQHE 241 Query: 241 EKLMTIADRLREEGAMQGKH----------------EEALRIAQEMLDRGLDRELVMMVT 284 + +MT+A+RL++EG QG ++A IA+E+ + G+ + +T Sbjct: 242 DSIMTLAERLKQEGIQQGIQQGIQQGIQQGVQQGALQKAREIARELRNAGMPAAQICQLT 301 Query: 285 RLSPDDLIAQSH 296 LS +L +H Sbjct: 302 GLSEAELKNITH 313 >UniRef50_P31665 Uncharacterized protein yadD n=59 Tax=Enterobacteriaceae RepID=YADD_ECOLI Length = 300 Score = 238 bits (608), Expect = 1e-61, Method: Composition-based stats. Identities = 148/293 (50%), Positives = 207/293 (70%), Gaps = 3/293 (1%) Query: 5 TTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDL 64 ++TPHDAVFK FL H +TARDF++IHLP LR+LCDL TL LE SFI+E L+ + +D+ Sbjct: 4 PSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGSFIEESLKGHSTDV 63 Query: 65 LWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYH 124 L+SV+ Q GY++VVIEHQSKP++ MAFRMMRYSIAAM HL+A + +LPLV+P+LFY Sbjct: 64 LYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQ 123 Query: 125 GCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIR 184 G +PYP S+CW D F P +AR++Y+S FPLVDIT+ PDDEIMQHR++A+LEL+QKHIR Sbjct: 124 GEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIR 183 Query: 185 QRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKL- 243 QRDL+ L++Q+V+L+ G T+ QL A+ NY+LQ G ++ F G + +R + + Sbjct: 184 QRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGESMMT 243 Query: 244 --MTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQ 294 ++ E+G QG+ E + AQ +L +G+ RE V + L ++ Sbjct: 244 LAQWFEEKGIEKGIQQGRQEVSQEFAQRLLSKGMSREDVAEMANLPLAEIDKV 296 >UniRef50_B6XDZ7 Putative uncharacterized protein n=2 Tax=Providencia RepID=B6XDZ7_9ENTR Length = 327 Score = 235 bits (600), Expect = 1e-60, Method: Composition-based stats. Identities = 121/323 (37%), Positives = 182/323 (56%), Gaps = 28/323 (8%) Query: 1 MTISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQY 60 MT+ + PHD+ FK F+ D ARDF +I+LP ++ LC+L TLKL SFID+ LR Sbjct: 5 MTMQLIARPHDSTFKGFMSKVDNARDFFEIYLPNRIKPLCNLDTLKLASASFIDKTLRSR 64 Query: 61 YSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPM 120 +SD+L+SV+T +G GY Y+++EHQS P++LM +R+M Y+ AM HL G LPLV+P+ Sbjct: 65 FSDMLYSVQTLKGKGYFYLLVEHQSTPDKLMGWRLMHYAFCAMNQHLQQGNNALPLVVPI 124 Query: 121 LFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQ 180 LFYHG +SPYPYS W D F +A +Y + PLVD+TV DDEI+ HRK+A +EL+ Sbjct: 125 LFYHGKQSPYPYSQVWTDCFPWADLAYDLYCNPLPLVDVTVASDDEIVNHRKVAAMELVL 184 Query: 181 KHIRQRDLLGLV-DQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQE 239 KH RD L ++ +++ ++ + + + NY+ D + + + E+ Sbjct: 185 KHSTLRDDLIVLSERLAQVISENENHRDDVILIINYLFSVMDTPTYTQIVKTLIEQTEGY 244 Query: 240 KEKLMTIADRLREEGAMQGKHEEALR-------------------IAQEM--------LD 272 +E +MTIADRLR EG +G + IA++ LD Sbjct: 245 QETVMTIADRLRNEGLEKGLIKGREEGKAEGKAEGREEARQEEQAIARQRTYTQVITSLD 304 Query: 273 RGLDRELVMMVTRLSPDDLIAQS 295 GL +++ +T L ++ A Sbjct: 305 LGLSIDIISKITGLPHSEIQAMR 327 >UniRef50_C8QFJ7 Putative transposase YhgA family protein n=4 Tax=Pantoea sp. At-9b RepID=C8QFJ7_9ENTR Length = 301 Score = 235 bits (599), Expect = 1e-60, Method: Composition-based stats. Identities = 131/301 (43%), Positives = 191/301 (63%), Gaps = 9/301 (2%) Query: 3 ISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYS 62 +S S PHDA+FK FL H AR F++IHLP +R+ CDL L++ P +FI+ DL YS Sbjct: 1 MSVVSAPHDALFKKFLSHLPVARQFLEIHLPQSIREHCDLDKLQVVPTTFIERDLSALYS 60 Query: 63 DLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLF 122 D+L S+KT +G GYIY +IEHQS P++ M RMMRY++AA+Q HLD G+ ++PLV+P+LF Sbjct: 61 DVLLSMKTDDGEGYIYALIEHQSTPDKHMTLRMMRYTLAAIQRHLDEGHHDVPLVIPILF 120 Query: 123 YHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKH 182 Y G SPYPYS+ WL+ F P +A++I+ +FPLVD+TV+PD+EIM HR +A LE+ K Sbjct: 121 YQGKTSPYPYSMNWLESFRNPVLAKQIFCHSFPLVDVTVIPDEEIMAHRDVARLEMAHKI 180 Query: 183 IRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEK 242 IR RD+L +D + +LL + ND + Y+L+ G+ + + + PQ + K Sbjct: 181 IRLRDILENIDPMATLL-ALDYNDDLSIDVVFYLLRYGNTDDREKIVKILIQAKPQLEGK 239 Query: 243 LMTIADRLREEGAMQGKHEEA--------LRIAQEMLDRGLDRELVMMVTRLSPDDLIAQ 294 +MTI ++ R+E +G+ E L +AQ ML D +M +T LS +L Sbjct: 240 IMTIEEQWRQESRQEGRQEGRKEGRQEVMLELAQRMLREQFDLNTIMKLTGLSEGELRQL 299 Query: 295 S 295 + Sbjct: 300 N 300 >UniRef50_C2DMU4 Possible transposase n=6 Tax=Enterobacteriaceae RepID=C2DMU4_ECOLX Length = 314 Score = 234 bits (596), Expect = 3e-60, Method: Composition-based stats. Identities = 149/309 (48%), Positives = 206/309 (66%), Gaps = 19/309 (6%) Query: 5 TTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDL 64 ++TPHDAVFK FL H +TARDF+DIHLPA LR+LCDL TL LE SFI+E L+ + +D+ Sbjct: 4 PSTTPHDAVFKQFLMHAETARDFLDIHLPAELRELCDLDTLHLESGSFIEESLKGHSTDV 63 Query: 65 LWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYH 124 L+SV+ Q GY++VVIEHQSKP++ MAFRMMRYSIAAM HL+A + +LPLV+P+LFY Sbjct: 64 LYSVQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIAAMHRHLEADHDKLPLVVPILFYQ 123 Query: 125 GCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIR 184 G +PYP S+CW D F P +AR++Y+S FPLVDIT+ PDDEIMQHR++A+LEL+QKHIR Sbjct: 124 GEATPYPLSMCWFDMFYSPELARRVYNSPFPLVDITITPDDEIMQHRRIAILELLQKHIR 183 Query: 185 QRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIG-------------- 230 QRDL+ L++Q+V+L+ G T+ QL A+ NY+LQ G ++ F G Sbjct: 184 QRDLMLLLEQLVTLIDEGYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGKSMMT 243 Query: 231 -----EIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTR 285 E ++ + ++ E+G QG+ E + A +L +G+ RE V + Sbjct: 244 LAQWFEEKGIEKGIEKGIEKGMEKGIEKGIQQGRQEVSQEFALRLLSKGMPREDVAEMAN 303 Query: 286 LSPDDLIAQ 294 L ++ Sbjct: 304 LPLAEIDKL 312 >UniRef50_D0KLJ7 Putative transposase YhgA family protein n=1 Tax=Pectobacterium wasabiae WPP163 RepID=D0KLJ7_PECWW Length = 288 Score = 233 bits (593), Expect = 7e-60, Method: Composition-based stats. Identities = 130/287 (45%), Positives = 172/287 (59%), Gaps = 14/287 (4%) Query: 10 HDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVK 69 HDA+FK FL ARDF+ IHLP +R+ CD TL+LE SFIDE LR SD+L+S+ Sbjct: 4 HDAIFKQFLSDIAVARDFLTIHLPDSIRERCDFNTLQLESASFIDEKLRARISDVLYSLH 63 Query: 70 TQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYHGCRSP 129 T G GYIY VIEHQS+PE+ MAFR++RY +AAMQ HLD G+ LPLV+P+LFYHG P Sbjct: 64 TSVGKGYIYCVIEHQSRPEKQMAFRLLRYCLAAMQQHLDQGHDRLPLVVPLLFYHGRSRP 123 Query: 130 YPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQRDLL 189 YPYSL WLD FA P +A+ +Y FPLVD+TV+PDDEI HR+MALLEL+QKHIR RD+L Sbjct: 124 YPYSLRWLDSFAAPVLAQTLYEQPFPLVDLTVMPDDEIRTHRRMALLELVQKHIRTRDML 183 Query: 190 GLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAER--APQEKEKLMTIA 247 L +I L + + ++ + + + Sbjct: 184 ELAREIGLLFERWAA------------PLSIGQEDIMTIAEQLKKMGFDEGIQRGIQQGL 231 Query: 248 DRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQ 294 + E+G QG A +IA+ +L G+D+ V T+L ++L Sbjct: 232 AQGLEQGIEQGMKNSARQIARHLLLTGMDKNSVQQATQLETEELEQL 278 >UniRef50_D1P284 Transposase, ISNCY family n=10 Tax=Enterobacteriaceae RepID=D1P284_9ENTR Length = 322 Score = 232 bits (592), Expect = 8e-60, Method: Composition-based stats. Identities = 118/322 (36%), Positives = 179/322 (55%), Gaps = 27/322 (8%) Query: 1 MTISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQY 60 M + PHD+ FK F+ D ARDF ++HLP ++ LC+ TLKL SF+D+ LR Sbjct: 1 MATQSIVAPHDSTFKGFMSKVDNARDFFEVHLPNRIKHLCNFDTLKLASASFVDKTLRSR 60 Query: 61 YSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPM 120 +SD+L+SV+T +G GY Y ++EHQS P++LM +R+M Y+ AM HL G++ LPLV+P+ Sbjct: 61 FSDMLYSVQTLKGKGYFYFLVEHQSSPDKLMGWRLMHYAFCAMNQHLQQGHQSLPLVVPI 120 Query: 121 LFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQ 180 LFYHG +SPYPYS W D F +A +Y + PLVD+TV DDE+M HRK+A +EL+ Sbjct: 121 LFYHGNQSPYPYSQSWTDCFQWSDLAHDLYCNPLPLVDVTVACDDELMNHRKVAAMELVF 180 Query: 181 KHIRQR-DLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQE 239 KH R D+ GL +++ +L + + + NY+ D + + + ++ + Sbjct: 181 KHASLRGDVFGLSERLAQVLNNNQNHQDDVILIINYLFSVMDTPAYTHIVKTLVDQTEKH 240 Query: 240 KEKLMTIADRLREEGAMQGKHEEALR-------------------IAQEM-------LDR 273 +E +M IA RLR EG +G + +A + L Sbjct: 241 QETVMNIAQRLRNEGMEKGMEKGRKEERMISQQKLANERQHYQQQMALNLQQQAIMSLKL 300 Query: 274 GLDRELVMMVTRLSPDDLIAQS 295 GL +++ +T LSP D+ A Sbjct: 301 GLSVDIISQITGLSPSDIHALR 322 >UniRef50_C0Q5B1 Ytl2 n=4 Tax=Enterobacteriaceae RepID=C0Q5B1_SALPC Length = 316 Score = 227 bits (577), Expect = 5e-58, Method: Composition-based stats. Identities = 131/316 (41%), Positives = 190/316 (60%), Gaps = 20/316 (6%) Query: 1 MTISTTSTP--HDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLR 58 M HD +FK FLR PDTARDF+ +HLPA +R L TLKLEP SF+D+ LR Sbjct: 1 MDNEKGHNRPGHDGLFKLFLREPDTARDFLAVHLPADIRAQVRLDTLKLEPGSFVDQKLR 60 Query: 59 QYYSDLLWSVKTQEG-VGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLV 117 + +SD+L+SV+T EG GYIY ++EHQS + +MA+RMMRYS+A M HL G LP+V Sbjct: 61 ELHSDVLYSVETAEGHAGYIYCLVEHQSTADRMMAWRMMRYSMAVMDAHLKKGNGTLPVV 120 Query: 118 LPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLE 177 +P+LFY G PYPYS W+D F PA+AR++YS +PLVD++V+ D ++ HR+MALLE Sbjct: 121 VPLLFYQGMVRPYPYSTDWMDCFDVPALAREVYSRPWPLVDVSVMEDCDLQSHRRMALLE 180 Query: 178 LIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQR-FRAFIGEIAERA 236 L+Q+ IR RD L+ +V L+ Q++A+ Y++ G F+ E+A Sbjct: 181 LVQRDIRHRDAASLLRDVVQLIRLAGNTRAQVEAVLCYIIYNGMTSESITPFLYELAGEI 240 Query: 237 PQEKEKLM-TIADRLR---------------EEGAMQGKHEEALRIAQEMLDRGLDRELV 280 P+ KE +M TIA +L+ + +++ + + L A +LD G+ E+V Sbjct: 241 PEYKELIMGTIAQQLKEEGIQQGIQQGIQQERQASLEREQKTLLETAYALLDNGVSLEVV 300 Query: 281 MMVTRLSPDDLIAQSH 296 + T L+ + L H Sbjct: 301 IKSTGLNRETLEQPRH 316 >UniRef50_C2LF55 Transposase n=3 Tax=Enterobacteriaceae RepID=C2LF55_PROMI Length = 330 Score = 217 bits (552), Expect = 4e-55, Method: Composition-based stats. Identities = 115/325 (35%), Positives = 176/325 (54%), Gaps = 33/325 (10%) Query: 1 MTISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQY 60 M + HDA FK F+ + A+DF IHL L+ CD +TLKL+ +SFID LR Sbjct: 1 MNKPLLISSHDAAFKRFMMNISNAKDFFFIHLSDELKSYCDFSTLKLQNSSFIDIKLRSR 60 Query: 61 YSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPM 120 SD+L+SVKT++G IY +IEHQS+P++++A+RMM Y+ M HL GY LPLV+P+ Sbjct: 61 MSDILYSVKTKKGNISIYFLIEHQSRPDKMIAWRMMHYAFCTMNQHLQQGYTSLPLVVPI 120 Query: 121 LFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQ 180 LFYHG R PYP+S+ WLD F +A ++Y + F L+D+ + D+ ++ HRK A++E+ Sbjct: 121 LFYHGKRKPYPFSVNWLDCFPLSTLANQLYLNNFALIDLNSIDDEILLTHRKAAVMEIAM 180 Query: 181 KHIRQRDLLGLVDQIVSL-LVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQE 239 KH+ D L + ++S + N +D A+ Y+ DA F + I +IAE+ Sbjct: 181 KHVNSCDDLDKLAMLLSKAINQKNCSDEDTIAVVQYLFSIMDAADFESIINKIAEQVDNH 240 Query: 240 KEKLMTIADRLREEGAMQGKHEEA--------------------------------LRIA 267 +E +M IA RL +G GK E +++A Sbjct: 241 RETIMNIAWRLENKGFKLGKMEGIEIGKNEGIEIGKNEGIEIGKNEGIEIGKKIVQIQLA 300 Query: 268 QEMLDRGLDRELVMMVTRLSPDDLI 292 + +L ++ E + +T LS +L Sbjct: 301 KNLLKENVELEFIERITGLSIQELK 325 >UniRef50_A8PLK1 Putative uncharacterized protein n=3 Tax=Rickettsiella grylli RepID=A8PLK1_9COXI Length = 308 Score = 209 bits (531), Expect = 1e-52, Method: Composition-based stats. Identities = 109/301 (36%), Positives = 174/301 (57%), Gaps = 14/301 (4%) Query: 6 TSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLL 65 HDA+FK+F + A FI I+LP +++ CD +TLK+EP SF+D DL+Q++SD+L Sbjct: 5 IHNAHDAIFKTFFTDIEVATHFITIYLPKHMKQACDFSTLKIEPGSFVDADLKQHHSDIL 64 Query: 66 WSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYHG 125 +S+K GY+Y+ +EHQS EELM FRM RY +A MQ HL+ G K+LPLV+ MLFYHG Sbjct: 65 YSLKVNGMHGYVYLNLEHQSTAEELMPFRMHRYKVAIMQQHLNQGNKKLPLVISMLFYHG 124 Query: 126 CRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQ 185 + YPY L +D + A+ + L+D+ V+PD+EI +H+++A LE++QKHI Sbjct: 125 -KGQYPYCLKLIDCVEDTPFAKAHFFDDPLLIDLNVLPDEEIYRHKQLAFLEIVQKHIFT 183 Query: 186 RDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKLMT 245 RDL + D IV L+ + L Y+L G+ I ++ +E +M Sbjct: 184 RDLEDIADHIVRLVKQVKPDHDLFNQLVYYMLVKGETANVNQVIEKLKT-IEDYEEDIMN 242 Query: 246 IADRLREEGAMQGKHEEALR------------IAQEMLDRGLDRELVMMVTRLSPDDLIA 293 A +L+++G +G +E IA++++ G + + +T LS +++++ Sbjct: 243 AAQQLKQQGRQEGLYEGRQEGLQKGEYRKAITIAKKLIAEGRSIQYIQDLTNLSENEVLS 302 Query: 294 Q 294 Sbjct: 303 L 303 >UniRef50_Q3C0L1 TpnA protein n=16 Tax=Enterobacteriaceae RepID=Q3C0L1_SODGL Length = 277 Score = 204 bits (519), Expect = 3e-51, Method: Composition-based stats. Identities = 110/270 (40%), Positives = 170/270 (62%), Gaps = 20/270 (7%) Query: 42 LTTLKLEPNSFIDEDLRQYYSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIA 101 L+TL + SFI++DL SD+L+S+K+ G YIY +IEHQS PE +MAFR++RY++ Sbjct: 3 LSTLVMVSGSFIEDDLCSQCSDMLYSLKSTLGDAYIYCLIEHQSCPEPMMAFRLLRYAVT 62 Query: 102 AMQNHLDAGYKELPLVLPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITV 161 AM HL+ K+LP+V+P+LFYHG SPYPY+ WLD FA+ +A +Y AFPLVD+T Sbjct: 63 AMHRHLEQENKQLPVVIPILFYHGSTSPYPYTTHWLDCFADRKLAESVYEKAFPLVDVTA 122 Query: 162 VPDDEIMQHRKMALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGD 221 + D+EI++HR+MAL+E++QKHIR R++L L ++ +LL + Q K L Y++ G+ Sbjct: 123 MEDEEILRHRRMALMEIVQKHIRTRNMLELAGELANLLEQWKFSKEQCKTLVYYLVLAGN 182 Query: 222 AQRFRAFIGEIAERAPQEKEKLMTIADRLREEGAMQGKH--------------------E 261 F+ +A+ AP +E +MTIA++L +G +G + Sbjct: 183 TTDGEGFLRTLAQPAPSYREDMMTIAEQLEAKGMQKGIQLGEKKGIERGLQEGIQLGKKQ 242 Query: 262 EALRIAQEMLDRGLDRELVMMVTRLSPDDL 291 L+IA++ L G++R++V M T L+ D+ Sbjct: 243 ATLKIARQFLVNGVERDIVKMSTGLTDRDI 272 >UniRef50_C3M8C1 Putative transposase n=3 Tax=Candidatus Hamiltonella defensa 5AT (Acyrthosiphon pisum) RepID=C3M8C1_HAMD5 Length = 308 Score = 198 bits (503), Expect = 2e-49, Method: Composition-based stats. Identities = 123/307 (40%), Positives = 183/307 (59%), Gaps = 19/307 (6%) Query: 5 TTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDL 64 STPHD +FK F AR+F +IHLP+ + K+ +LK+ P SFID+ L+Q +SD+ Sbjct: 2 KISTPHDRLFKKFFGDIALARNFFEIHLPSSILKIVSFPSLKMVPGSFIDKSLKQSHSDM 61 Query: 65 LWSVKTQEG-VGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFY 123 ++S +T G GY+Y V+EHQS +++MAFRM +YS+A MQ HLD G+ LPLVLP+LFY Sbjct: 62 VYSFETSTGKEGYLYCVVEHQSTDDKMMAFRMKKYSLAVMQQHLDQGHDTLPLVLPVLFY 121 Query: 124 HGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHI 183 HG +SPYP+S+ W D F E +AR + S FPLVD+T++P++EIM+H ++ LE+ QK + Sbjct: 122 HGQKSPYPHSMDWRDCFCEKELARILDSQPFPLVDVTMLPEEEIMKHGIISWLEMSQKMV 181 Query: 184 RQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKL 243 RD++ + ++ L ND K+L Y+ Q G+ F ++ ++E + Sbjct: 182 HTRDMMEIAPYLIRLDKLFPLNDELFKSLLYYLFQEGETADRMLFFDALSSTT--QRENV 239 Query: 244 MTIADRLR----------------EEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLS 287 MTIA+ L+ EEG +G+ E IA+ +L+ G + V M T LS Sbjct: 240 MTIAEELKREGREEGREEGREEGREEGREEGREEGREEIAKNLLNNGFSFKQVKMYTGLS 299 Query: 288 PDDLIAQ 294 D L Sbjct: 300 EDSLNKL 306 >UniRef50_Q2J904 Putative uncharacterized protein n=1 Tax=Frankia sp. CcI3 RepID=Q2J904_FRASC Length = 323 Score = 197 bits (500), Expect = 4e-49, Method: Composition-based stats. Identities = 81/283 (28%), Positives = 133/283 (46%), Gaps = 15/283 (5%) Query: 3 ISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYS 62 +S+ +PHDAVF+ L P A + LPA L DL L + P S +D LR ++ Sbjct: 1 MSSPPSPHDAVFRRVLGVPSNAASQLRATLPAALVARLDLDRLAIVPGSLVDATLRWRHT 60 Query: 63 DLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYK--ELPLVLPM 120 DLL++ +IYV++EHQS + LMAFRM+RY + +L +K LP V+P+ Sbjct: 61 DLLFTAPLDGHEAFIYVLVEHQSSSDPLMAFRMLRYVVRVWDRYLADHHKAARLPAVVPL 120 Query: 121 LFYHGCRSPYPYSLCWLDEFAEPAIA----RKIYSSAFPLVDITVVPDDEIMQH---RKM 173 + +H + + P +A + F L D+ V + E+ + + Sbjct: 121 VVHHNEHAWVAPTQVLDLVDLAPDLAGAWREHLPRFQFLLDDLVRVDERELRERPLTHSV 180 Query: 174 ALLELIQKHIR-----QRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAF 228 L L+ K + +DL VD++ ++L G + L Y+ G+A Sbjct: 181 RLTLLLLKIVPGNPRLAQDLRPWVDELRAVL-DGPDGREEFATLLRYIELVGEADARDEL 239 Query: 229 IGEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEML 271 IA P+ ++ MTIA+ LR EG ++G+ E + ++L Sbjct: 240 HDLIAGLGPEAEDAYMTIAEMLRAEGRVEGRVEGRVESLLQLL 282 >UniRef50_B7MZS6 Putative uncharacterized protein n=3 Tax=Escherichia coli ED1a RepID=B7MZS6_ECO81 Length = 319 Score = 197 bits (500), Expect = 4e-49, Method: Composition-based stats. Identities = 106/303 (34%), Positives = 165/303 (54%), Gaps = 14/303 (4%) Query: 2 TISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYY 61 ++ TS HDA F+ L+ P ARDF++ L + C+L T++LEP +F+ E LRQ Sbjct: 4 KVNKTSLIHDAAFRKTLKDPAAARDFLEQVLTPYQKSRCNLDTIELEPTTFVAESLRQSA 63 Query: 62 SDLLWSVKTQEG-VGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPM 120 D+L S+KT +G GYIY +IEHQS P++ + RMMRY +A M+ H++ +K P+V+P+ Sbjct: 64 CDVLLSMKTNDGKDGYIYTLIEHQSSPDKFIPLRMMRYILAVMEQHIEE-HKCAPVVIPV 122 Query: 121 LFYHGCRSPYPYSLCWLDEFAEPAIARKIY--SSAFPLVDITVVPDDEIMQHRKMALLEL 178 LFYHG + PYPY + W+D +PA R+IY F LVD++ + DDEI + +MA L Sbjct: 123 LFYHGAKRPYPYPMNWVDCLDDPAYGREIYGEQKPFSLVDVSTLTDDEIEHYHRMAALMF 182 Query: 179 IQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQ 238 K D++ L+ + ++ L + L + Y+L+ F ++ P Sbjct: 183 TMKSGTSGDVIELIGKSIT-LTDKYGSSVHLNTVLTYLLELYQM-DFAELSEAVSTHYPS 240 Query: 239 EKEKLMTIADRLREE--------GAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDD 290 K +MTIA++L E G +G+ EE R+ M RG E + L+ + Sbjct: 241 HKGVIMTIAEQLEERGLKKGLEKGLEKGRAEERSRLVLMMRQRGKSLEEIKDFLDLTDEQ 300 Query: 291 LIA 293 L+ Sbjct: 301 LLQ 303 >UniRef50_A8PQ66 Putative uncharacterized protein n=3 Tax=Rickettsiella grylli RepID=A8PQ66_9COXI Length = 307 Score = 192 bits (488), Expect = 9e-48, Method: Composition-based stats. Identities = 93/304 (30%), Positives = 159/304 (52%), Gaps = 11/304 (3%) Query: 1 MTISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQY 60 M + T HD +FK L A F+ L + + KL ++ TL+L SF+ + R+ Sbjct: 1 MAM-TIHQAHDKLFKYSLSKKTIAISFLKSRLSSEIYKLINIETLQLTDKSFVLPEFREI 59 Query: 61 YSDLLWSVKTQEGVGYIYVVIEHQSKPE-ELMAFRMMRYSIAAMQNHLDAGYKELPLVLP 119 +SD+++ + E GYI+ ++EH+S ELMAFR ++Y+I+AM + G K+LP+VLP Sbjct: 60 HSDIVYQCQINEKKGYIFFILEHESTAHVELMAFRQLQYTISAMDQYCRQGNKKLPIVLP 119 Query: 120 MLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELI 179 + YHG +SPYP+S D F IAR+I F L+D+TV+ D+E+ + L+E++ Sbjct: 120 ICVYHGIKSPYPHSQDVYDNFENLQIARQIVFKPFTLIDLTVLSDEELAKDGPAYLMEML 179 Query: 180 QKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRF-----RAFIGEIAE 234 KH R ++ L ++ + + + + + F + Q + ++ Sbjct: 180 LKHSRAKNFLSILHRRIEFIQSLLNRFGKEYRWFVVKYMINETQDESPNAVEQLVQTLST 239 Query: 235 RAPQEKEKLMTIADRLREEGAMQGKHEEALR----IAQEMLDRGLDRELVMMVTRLSPDD 290 P+EK +MT A +LR+EG QG + IA+ +L G+ + V +T LS + Sbjct: 240 AFPEEKNTMMTFAQQLRQEGLEQGLEQGRYEEAIAIAKNLLGDGMSFKAVQRLTGLSEKE 299 Query: 291 LIAQ 294 ++ Sbjct: 300 VMNL 303 >UniRef50_A6G4N5 Putative uncharacterized protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6G4N5_9DELT Length = 343 Score = 188 bits (477), Expect = 2e-46, Method: Composition-based stats. Identities = 74/304 (24%), Positives = 129/304 (42%), Gaps = 14/304 (4%) Query: 5 TTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDL 64 T+ +PHDA+FKS + P A + L P+ D +TL+ EP S+IDE L + +SDL Sbjct: 4 TSPSPHDALFKSAFKDPKDAAKLLQNVLDEPIAHAIDWSTLRPEPGSYIDETLAERHSDL 63 Query: 65 LWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAG-YKELPLVLPMLFY 123 L+S Y+Y++IEHQS + M RM+ Y H A ++LP +LP++ Sbjct: 64 LFSASIGGEDAYVYLLIEHQSTVDRDMPLRMLVYLTRVWLRHRSAHPGRDLPPILPVVVS 123 Query: 124 HGCRSPYPY----SLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELI 179 H SL P + I + D+T + D ++ + L+ Sbjct: 124 HAPGGWTAPVTFESLVRPGPTDLPELTPHIPRFELVINDLTHLSDQQLREWSMRGFATLV 183 Query: 180 QKHIRQR-------DLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEI 232 +R R D + + + + + +F+Y+ + + F ++ Sbjct: 184 LWILRTRHEIPELIDGVSTWRDMFREVFEAPDGVQAMTKIFHYIACIAQRVQVQEFHAKL 243 Query: 233 AERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLI 292 E PQ +E + T + L EEG +G + ++ L L+ +++ + DL Sbjct: 244 DEHVPQTREVMKTYYEELMEEGMAKGLAKGREEGREQSRIETLQETLIDLLS--AKFDLR 301 Query: 293 AQSH 296 H Sbjct: 302 ELEH 305 >UniRef50_C0AXL8 Putative uncharacterized protein n=1 Tax=Proteus penneri ATCC 35198 RepID=C0AXL8_9ENTR Length = 254 Score = 186 bits (472), Expect = 8e-46, Method: Composition-based stats. Identities = 98/239 (41%), Positives = 146/239 (61%), Gaps = 1/239 (0%) Query: 25 RDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVKTQEGVGYIYVVIEHQ 84 + F IHLP L+ CD +TL+L+ +SFID LR SD+L+ VKT+EG IY++IEHQ Sbjct: 6 KTFFFIHLPEELKSQCDFSTLQLQNSSFIDIKLRSRMSDILYLVKTKEGDVPIYLLIEHQ 65 Query: 85 SKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYHGCRSPYPYSLCWLDEFAEPA 144 S+P++++A+RMM Y+ M HL GYK LPLV+P+LFYHG + PYP+ + W++ F + Sbjct: 66 SRPDKMIAWRMMHYAFCTMNQHLQQGYKSLPLVVPILFYHGKKKPYPFPVNWMECFPLSS 125 Query: 145 IARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQRDLLGLVDQIVSL-LVTGN 203 +A IYS+ F L+D+T + DD ++ H+K A++E+ KH+ L + ++S + N Sbjct: 126 LANHIYSNDFSLIDLTSIDDDILLTHKKAAVMEIAMKHVNSCHDLNKIAMLLSKAINQKN 185 Query: 204 TNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKLMTIADRLREEGAMQGKHEE 262 D A+ Y+ DA F I +IAER +E +M IA RL +G G E Sbjct: 186 CRDEDTVAVVQYLFSIMDASDFEFIINKIAERVDNHRETIMNIAWRLENKGFKLGIDEG 244 >UniRef50_A6G0X2 Putative uncharacterized protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6G0X2_9DELT Length = 363 Score = 185 bits (468), Expect = 2e-45, Method: Composition-based stats. Identities = 79/300 (26%), Positives = 124/300 (41%), Gaps = 26/300 (8%) Query: 4 STTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSD 63 S TS PHDA+F++ HP A + LP L L D + L+ N + L + +D Sbjct: 13 SVTSRPHDALFRATFEHPSHAGSLLRSALPRELAALIDWSRLRPAANELVSSSLGERRTD 72 Query: 64 LLWSVKTQ-----EGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVL 118 LL+S + +G +Y+ IEHQS+ + M R++ Y + + H LP V Sbjct: 73 LLFSTALEGPGAGDGARVVYLHIEHQSRVDTTMPLRVLGYRVRIWERHRKRHGGALPPVF 132 Query: 119 PMLFYHGCRSPY-PYSLCWLDEFAEPAIAR---KIYSSAFPLVDITVVPDDEIM---QHR 171 ++ H + P SL L +A + + D+ D E+ H Sbjct: 133 CVVLSHAAKGWTGPRSLVELFPEPVRTLAPIAAHLPRCPLIVEDLGRRADAELRARHAHP 192 Query: 172 KMALLELIQKHIRQRD-----LLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFR 226 AL + + R + LL DQI++LL + +R L L YV G F Sbjct: 193 LPALTLWLLRDARSPERLVHRLLDWRDQIIALL-DYDHGERDLAQLLRYVALVGSEMDFE 251 Query: 227 AFIGEIAERAPQEKEKLMTIADR--------LREEGAMQGKHEEALRIAQEMLDRGLDRE 278 F +A P+ + MTIA++ RE+G +G+ E L +E G + Sbjct: 252 EFHRFVAHHIPEVEAMTMTIAEQLCREALQRGREQGQREGQREGRLEGQREGRAVGFEEG 311 >UniRef50_Q52101 ORF n=1 Tax=Salmonella enterica subsp. enterica serovar Enteritidis RepID=Q52101_SALEN Length = 292 Score = 182 bits (461), Expect = 1e-44, Method: Composition-based stats. Identities = 116/282 (41%), Positives = 160/282 (56%), Gaps = 17/282 (6%) Query: 2 TISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYY 61 +TT TPHDA F+ FL PD ARDF+++HLPA LR +CDL+TLKLE SF+++DLRQY+ Sbjct: 3 KKNTTPTPHDATFRQFLTQPDIARDFMELHLPAELRAICDLSTLKLESGSFVEDDLRQYF 62 Query: 62 SDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSI-AAMQNHLDAGYKELPLVLPM 120 SD+L+S+KT G I++ + S+ + F + AAMQ HL+AG+K+LPLV+P+ Sbjct: 63 SDVLYSLKTTAGDD-IFMSWLNTSQHLTNICFPPDTLCVGAAMQRHLEAGHKKLPLVIPV 121 Query: 121 LFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAF-PLVDITVVPDDEIMQHRKMALLELI 179 LFY G RSPYPYS WLDEF + A R+ LVD+TV+PDDEI HR MA L L+ Sbjct: 122 LFYTGKRSPYPYSTRWLDEFDDTAPGRQTLQQRLSRLVDVTVIPDDEIAGHRSMAALTLL 181 Query: 180 QKHIR-----QRDLLGLVDQI--VSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEI 232 ++I Q L G +S+ + GN A + R R+ Sbjct: 182 PENIFISGTWQNWLTGWRPFYGRISVFIAGNIAGTLYSAGRRNI-------RRRSLCTRT 234 Query: 233 AERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRG 274 Q + LMTIA +L ++G +G R ++ G Sbjct: 235 GTACAQHGDALMTIAQQLEQKGIEKGIQLGEQRGIEKGRSEG 276 >UniRef50_A0LBL3 Putative uncharacterized protein n=6 Tax=Magnetococcus sp. MC-1 RepID=A0LBL3_MAGSM Length = 322 Score = 181 bits (459), Expect = 2e-44, Method: Composition-based stats. Identities = 69/276 (25%), Positives = 119/276 (43%), Gaps = 4/276 (1%) Query: 3 ISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYS 62 ++ + PHD K+ L PD + LP + +L L +FID + R++ + Sbjct: 1 MTKITQPHDRFLKALLSDPDKTGTLLRERLPKEVAELLSSEPPVLVDGTFIDGEFREHLT 60 Query: 63 DLLWSVKTQEGVG-YIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPML 121 D L+ VKTQEG YIY +IEH+S +E +AF+++RY + + L G ++LP ++P++ Sbjct: 61 DRLFKVKTQEGKAAYIYALIEHKSYADEWVAFQLLRYMVRIWERFLKEGQQKLPPIVPLV 120 Query: 122 FYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQK 181 YHG R + A+ + + +F + D+ + DD++ Q + + K Sbjct: 121 VYHGAREWTVPNQFSALLEADKGLLHHLLDFSFAVTDLGRIADDDLSQDTHLRAALMAMK 180 Query: 182 HIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKE 241 + Q V I + + K + Y++QT E P E E Sbjct: 181 YAFQG--AEGVVVIPQIGKGAQGDPEFAKLVLRYLIQTYRGMTMADVQAYAEEAFPGEAE 238 Query: 242 KLM-TIADRLREEGAMQGKHEEALRIAQEMLDRGLD 276 A + +G +G+ E QE G Sbjct: 239 HYASQFAREMMSKGRQEGRQEGRREGRQEGRQEGES 274 >UniRef50_A9EVM7 Similar to putative transposase n=2 Tax=Sorangium cellulosum 'So ce 56' RepID=A9EVM7_SORC5 Length = 336 Score = 181 bits (459), Expect = 3e-44, Method: Composition-based stats. Identities = 81/280 (28%), Positives = 127/280 (45%), Gaps = 14/280 (5%) Query: 8 TPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWS 67 HDA+FK+ + A + LP L D L+L P SF+DE L++ SDLL+S Sbjct: 12 NAHDALFKAAFSQVEHAAGELRQALPPALSARIDFAALRLRPGSFVDEALKERQSDLLFS 71 Query: 68 VKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDA--GYKELPLVLPMLFYHG 125 E +Y++ EHQS E LMAFR++RY + ++HL G K LP +LP++ +H Sbjct: 72 ASMGEARVLLYLLFEHQSTVEPLMAFRLLRYMVRIWEHHLAEHPGSKRLPAILPVVLHHS 131 Query: 126 CRSPYPYS----LCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMAL---LEL 178 + L LDE A + + F L DI+ D+ + A + Sbjct: 132 ETGWTAATSFEDLLDLDEGARAVMVDHVPRFRFVLDDISQEGDEALKARAMSAFSRLVLW 191 Query: 179 IQKHIRQRD----LLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGE-IA 233 +H R+ D LG +V+ + L+A++ Y+L T + + +A Sbjct: 192 CLRHGREPDELLRQLGKWLDLVNEVRRAPNGVEALRAIWRYILATNERDEADEVLQRLLA 251 Query: 234 ERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDR 273 KE++++ AD+L E G QG E ML + Sbjct: 252 AAGEPWKEEIVSAADQLMERGRQQGLREGLREGRCHMLLK 291 >UniRef50_Q1QWV4 Putative uncharacterized protein n=11 Tax=Proteobacteria RepID=Q1QWV4_CHRSD Length = 326 Score = 179 bits (454), Expect = 9e-44, Method: Composition-based stats. Identities = 70/309 (22%), Positives = 127/309 (41%), Gaps = 27/309 (8%) Query: 14 FKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVKTQEG 73 +K HP+ RD + + + D +TL+ S+I EDLR D++W V+ + Sbjct: 13 YKLLFSHPEMVRDLLTGFVKEAWVEQLDFSTLEKVSGSYITEDLRDREDDVIWRVRWGDD 72 Query: 74 VGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAG----YKELPLVLPMLFYHGCRSP 129 Y+Y+++E QS + MA R+M Y Q+ + +LP VLP++ Y+G + Sbjct: 73 WLYVYLLLEFQSSVDRFMAVRVMTYLGLLYQDLIRQEAFTPNGKLPPVLPIVLYNGEKRW 132 Query: 130 Y-PYSLCWLDEFAEPAIARKIYSSAFPLVDIT-VVPDDEIMQH-RKMALLELIQKHIR-Q 185 ++ L E + R + A+ L+D V+ D E H R +A +H R + Sbjct: 133 TAAQNVADLVEQVPGGLERYRPNLAYLLLDEGAVISDPEWSDHMRNVAAALFRLEHNRDE 192 Query: 186 RDLLGLVDQIVSLLVTGNTN--DRQLKALFN-----------YVLQTGDAQRFRAFIGEI 232 +D+L ++ +V L R + + + Q + Sbjct: 193 QDMLEVLGTLVEWLKAPEQTGLRRAFVVWIRRVLLPNRAPGMELPEFNELQDLHEVHDML 252 Query: 233 AERAPQ-----EKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRG-LDRELVMMVTRL 286 AER Q E++ R+EG +G+ + A+ ++ G L E + T L Sbjct: 253 AERIKQWPERWEEKGRQEGRQEGRKEGRQEGEQRGIEKTARNLIKLGVLSDEQIAEATGL 312 Query: 287 SPDDLIAQS 295 + ++ Sbjct: 313 TVAEVEGLR 321 >UniRef50_C7RR52 Putative transposase n=1 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RR52_9PROT Length = 330 Score = 178 bits (450), Expect = 3e-43, Method: Composition-based stats. Identities = 67/310 (21%), Positives = 119/310 (38%), Gaps = 26/310 (8%) Query: 8 TPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWS 67 HD +K P+ RD I +P D +TL+ P S++ ED D++W Sbjct: 3 NTHDTGYKLLFSTPELVRDLILGFVPDDWLHGLDYSTLERVPGSYVTEDFTNRADDIVWR 62 Query: 68 VKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGY----KELPLVLPMLFY 123 VK Y+Y++IE QS ++ MA RMM Y Q+ + G LP VLP++ Y Sbjct: 63 VKVGGEWVYLYLLIEFQSSVDKYMALRMMVYGGLLYQDLIKRGEVLADGRLPPVLPIVLY 122 Query: 124 HGCRSPYPYSLCWLDEFAEPAI-ARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKH 182 +G + + + P + + + L+D D E+ + + +H Sbjct: 123 NGSQRWSAVTDVFELIPPVPGLVEQFKPRLKYLLIDENAWSDSELASLKNLVAAVFRIEH 182 Query: 183 IRQR----DLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQ 238 DLL L+D+ L + + +++ + + I ++ E Sbjct: 183 PASPAAIGDLLSLLDEW--LAERPDLRRMFALWIRATLMRKAEYRIVLPRIDDLQELNVM 240 Query: 239 EKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRG---------------LDRELVMMV 283 E+L A + EG +GK E E G + +++ + Sbjct: 241 LAERLEEWAQAYKAEGKAEGKAEGKAEGKAEGKAEGEALALQKLLKKRFGAVPPDVLAQI 300 Query: 284 TRLSPDDLIA 293 +R S + + A Sbjct: 301 SRASLEQIDA 310 >UniRef50_B3ESQ9 Putative uncharacterized protein n=2 Tax=Bacteria RepID=B3ESQ9_AMOA5 Length = 308 Score = 177 bits (449), Expect = 3e-43, Method: Composition-based stats. Identities = 86/302 (28%), Positives = 158/302 (52%), Gaps = 8/302 (2%) Query: 3 ISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYS 62 + S PHD + K+ L HP+ ++F + PA + K DL +LKL S++ E+LR++++ Sbjct: 6 KNDLSNPHDLLVKATLSHPEAIQEFAKAYFPADILKRVDLPSLKLTNKSYVTEELREFHN 65 Query: 63 DLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKE--LPLVLPM 120 DL++S + GY + V+EHQS P+ LMA R ++Y+IA ++ ++ ++ P+++ + Sbjct: 66 DLVFSFTIDKQPGYAFFVLEHQSTPDPLMALRFVKYNIALIEEYIKEKGEKTPWPIIVNI 125 Query: 121 LFYHG-CRSPYPYSLCWLDEFAEPAIARKI-YSSAFPLVDITVVPDDEIMQHRKMALLEL 178 YH PYPYS D F +P A+ + + F L D+ P++ + QH + L+E Sbjct: 126 CLYHNANEKPYPYSTSVYDLFKDPLTAKALEMFTKFYLADLNSTPNEVLEQHGSIGLMEK 185 Query: 179 IQKHIRQRDLLGLVDQIVS----LLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAE 234 + K+ R RD+ ++++ + L+ + + +YV+ + Sbjct: 186 LLKYSRHRDIFNVIEKELKRSKGYLIVRGDYWKTILIYSSYVIGQEEKSEKDLVSLFKEV 245 Query: 235 RAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQ 294 + E+E ++TIA + E G M+GK E + IA+ ML +G + + +T LS D+ Sbjct: 246 LSKNEEEIMITIAQTIEERGEMRGKRREKIAIAKNMLKKGCEISFIEEITGLSRKDIEKL 305 Query: 295 SH 296 Sbjct: 306 KQ 307 >UniRef50_C2DIT3 Possible transposase n=5 Tax=Enterobacteriaceae RepID=C2DIT3_ECOLX Length = 197 Score = 176 bits (445), Expect = 1e-42, Method: Composition-based stats. Identities = 124/201 (61%), Positives = 160/201 (79%), Gaps = 4/201 (1%) Query: 96 MRYSIAAMQNHLDAGYKELPLVLPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFP 155 MRY+IAAMQNHLDAGYK LP+V+P+LFYHG SPYPYSLCWLD FA+P +AR++Y+SAFP Sbjct: 1 MRYAIAAMQNHLDAGYKTLPMVVPLLFYHGIESPYPYSLCWLDCFADPNLARQLYASAFP 60 Query: 156 LVDITVVPDDEIMQHRKMALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNY 215 L+D+T++PDDEIM HR+MALLELIQKHIRQRDL+GLV+Q+ LL +G N RQ+K LFNY Sbjct: 61 LIDVTLMPDDEIMLHRRMALLELIQKHIRQRDLMGLVEQMACLLSSGYANGRQIKGLFNY 120 Query: 216 VLQTGDAQRFRAFIGEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGL 275 +LQTGDA RF FI +A+R+P+ K LMTIA+RLR+ +G+ +AL IA+ ML+ G+ Sbjct: 121 ILQTGDAVRFNDFIDGVAKRSPKHKVSLMTIAERLRQ----EGEQSKALHIAKIMLESGV 176 Query: 276 DRELVMMVTRLSPDDLIAQSH 296 +M T +S ++L A S Sbjct: 177 PLADIMRFTGVSEEELAAASQ 197 >UniRef50_Q24W02 Putative uncharacterized protein n=3 Tax=Clostridiales RepID=Q24W02_DESHY Length = 333 Score = 173 bits (438), Expect = 7e-42, Method: Composition-based stats. Identities = 80/331 (24%), Positives = 147/331 (44%), Gaps = 39/331 (11%) Query: 3 ISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYS 62 +S PHD FK AR F+ +LP + L DL T+ + +S+ID++L++ +S Sbjct: 1 MSLIHNPHDKFFKETFGDVGMARSFLKNYLPQEILALVDLETILPQKDSYIDQELQESFS 60 Query: 63 DLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGY-KELPLVLPML 121 DLL+ VK + GY+Y + EH+S P + +A ++++Y + ++ L +LPL++PM+ Sbjct: 61 DLLFQVKIHKNEGYLYFLFEHKSYPSQGIALQLLKYMVRIWESKLKESKPDKLPLIIPMV 120 Query: 122 FYHGCRSPYPY----SLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLE 177 YHG + E A+ + I + L D++ D E++ + + ++ Sbjct: 121 VYHGQEKWNSSLKLSGIIDNYEQLPNAVTQYIPEYEYILYDLSTYTDQEMVGNMLLLIIL 180 Query: 178 LIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLK------ALFNYVLQTGDAQRFRAFIGE 231 + I +D + + LL++ + Q K L Y+L T Sbjct: 181 RTMRDIFIKDTEAFHNILHELLISFERVEDQEKGMQFFETLIRYILSTRQDLELERIYEI 240 Query: 232 IAERAPQEKEKLMTIADR----------------------------LREEGAMQGKHEEA 263 E + + E +MTIA++ REEG +G+ E Sbjct: 241 AKEVSLERGEVMMTIAEKLIMEGMEKGLKKGREEGLKKGREEGLEKGREEGLEKGREETK 300 Query: 264 LRIAQEMLDRGLDRELVMMVTRLSPDDLIAQ 294 L +A+ +L G++ + V T LS +++ Sbjct: 301 LEVARNLLGLGIEMDKVAKATGLSEEEIRKL 331 >UniRef50_D0LMM4 Putative transposase n=10 Tax=Haliangium ochraceum DSM 14365 RepID=D0LMM4_HALO1 Length = 345 Score = 173 bits (437), Expect = 9e-42, Method: Composition-based stats. Identities = 80/319 (25%), Positives = 132/319 (41%), Gaps = 29/319 (9%) Query: 6 TSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLL 65 HD++ K+ D A D LP + + DL L L P SF+ ++LRQ ++DLL Sbjct: 2 PHDSHDSLVKATFARLDFAADEFRAVLPPAILERLDLDKLALCPGSFVSDELRQQHTDLL 61 Query: 66 WSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDA--GYKELPLVLPMLFY 123 + ++Y+++EHQS E +M R++RY + + HL G LP +LP++ + Sbjct: 62 FRAPLDGEPAFLYLLLEHQSSVERMMPLRLLRYVASIWERHLGEHPGAATLPPILPVVLH 121 Query: 124 HGCRSPYPYS----LCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMA---LL 176 H + + L L + A A+ + F L D++ PD+ ++ A L Sbjct: 122 HSEQGWTAPTSLGQLFALSDGAREALGPYLPELRFLLDDLSHQPDEALLMREMAAQAKLA 181 Query: 177 ELIQKHIRQ-RDLLGLVDQ---IVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEI 232 K+ R +DLL L+ ++ VT L A+ Y LQ D I Sbjct: 182 LWALKNARHAQDLLALLRPWSPVILEAVTAPGGIDALAAIVRYTLQHADTDPDALMRFLI 241 Query: 233 AERAPQEKEKLMTIADRLRE----------------EGAMQGKHEEALRIAQEMLDRGLD 276 KE MT A++L + EG ++G+ E + E L L Sbjct: 242 DSAGDPAKEAFMTGAEKLTQAVREQSLRQGRVEGRVEGRVEGRVEGRVEGRTEALRTVLS 301 Query: 277 RELVMMVTRLSPDDLIAQS 295 ++L L + + Sbjct: 302 KQLRQRFGTLPSEVTERLN 320 >UniRef50_Q2FP14 Putative uncharacterized protein n=4 Tax=Methanospirillum hungatei JF-1 RepID=Q2FP14_METHJ Length = 312 Score = 172 bits (435), Expect = 1e-41, Method: Composition-based stats. Identities = 63/307 (20%), Positives = 116/307 (37%), Gaps = 23/307 (7%) Query: 11 DAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVKT 70 D +K HP+ D I L L CDL+TL+ S++ +DLR+ D++W + Sbjct: 5 DHPYKRLFSHPEMIADLIRGFLDPKLVSGCDLSTLERCNGSYVTDDLREREDDIIWRLAY 64 Query: 71 QEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGY---KELPLVLPMLFYHGCR 127 + +Y++IE QSKP+ M R+M Y Q+ + +G +P ++P++ Y+G Sbjct: 65 GDRTLILYLLIEFQSKPDYSMPIRIMSYMALLWQDLIRSGVIVPSRIPGIIPIVLYNGEI 124 Query: 128 SPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQR- 186 ++R I S + L+D + +M+ R +A + Sbjct: 125 PWKVPHDIRETIQMPKPVSRFIPSVPYLLIDELRLSVHHLMEVRNLAACLFGLEQSSGPL 184 Query: 187 DLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKLMTI 246 +L L ++ + T D + + T + + I Sbjct: 185 ELFELGARLNRWMQTDPNLDSMRRDFSLFFENTLKRDDDISISNPFQGGTMLAERVNKWI 244 Query: 247 AD-------------------RLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLS 287 A R EG ++GK E I + M ++G+ + +T L Sbjct: 245 AQYKAEGRKEGKEEGKKEGLLEGRVEGKLEGKLEGMATILKRMKEKGMSVTEIATITGLP 304 Query: 288 PDDLIAQ 294 D++ Sbjct: 305 EDEIQHL 311 >UniRef50_A8GX51 Transposase and inactivated derivative n=11 Tax=Rickettsia RepID=A8GX51_RICB8 Length = 355 Score = 170 bits (430), Expect = 5e-41, Method: Composition-based stats. Identities = 80/333 (24%), Positives = 147/333 (44%), Gaps = 52/333 (15%) Query: 13 VFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVKTQE 72 +F+ L +P A +F + HLP ++ L D +L +E +F++ L+ SD+L+S K + Sbjct: 23 IFRKALENPLVAHEFFNAHLPPNIKSLIDFPSLAMENTTFVESSLKDSISDVLFSCKFDK 82 Query: 73 GVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHL--DAGYKELPLVLPMLFYHGCRSPY 130 GY+++++EHQSK + MAFR+ +Y I + +L + K LPL+ PM+F++G Y Sbjct: 83 QDGYLFLLVEHQSKADHFMAFRLFKYMINICERYLIQNPKAKTLPLIYPMIFFNGQEK-Y 141 Query: 131 PYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQRDLLG 190 + D F +A++++ + + LV++ +PD+E Q +LE KHI +R+LL Sbjct: 142 NVARNLWDLFTNNKLAKELWINDYQLVNVHEIPDEEFKQRIWSGILEFFLKHIHERELLK 201 Query: 191 LVDQIVSL---LVTGNTNDRQLKALFNYVLQTGDAQRFRAF------------------- 228 +I + L L+ + Y L + Sbjct: 202 RWQEISDILPELTKITIGYDYLEMILYYTLTKIEQADKIKLKNLLSTKLNPEIGTRLMRS 261 Query: 229 ---------------------------IGEIAERAPQEKEKLMTIADRLREEGAMQGKHE 261 IGE + E + + EG +G++ Sbjct: 262 LAEHWQQEGKEIGILEGLQVGEAKGIQIGEAKGIQIGKAEGIQIGKAEGKAEGKAEGEYN 321 Query: 262 EALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQ 294 +A+ +A++ML +G + L+ VT L + + Sbjct: 322 KAVEVAKKMLTQGCNVSLISSVTGLDEAFISSL 354 >UniRef50_C1J8H0 Truncated transposase n=3 Tax=Escherichia coli RepID=C1J8H0_ECOLX Length = 202 Score = 170 bits (430), Expect = 5e-41, Method: Composition-based stats. Identities = 104/206 (50%), Positives = 145/206 (70%), Gaps = 4/206 (1%) Query: 91 MAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIY 150 M FRM+RYS+AAMQ HL+ +K LPLV+P+LFYHG RSPYPYS+ WLD F EPA+A KIY Sbjct: 1 MPFRMLRYSVAAMQRHLEQ-HKTLPLVIPVLFYHGERSPYPYSMNWLDCFEEPALAAKIY 59 Query: 151 SSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLK 210 + FPLVDITVV D+EIM HR+MA L L+ KHIR RD++ L+D++ ++V +D Q++ Sbjct: 60 TKPFPLVDITVVDDNEIMNHRRMAALTLLMKHIRHRDMMELLDKLPQVMV--EISDEQVR 117 Query: 211 ALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEM 270 L +Y++ GD+ F+ +AER PQ ++KLMTIA+RL ++G +G E+AL IA ++ Sbjct: 118 VLIHYIVNAGDSVSPE-FMRALAERLPQHEDKLMTIAERLEQKGRQEGALEKALAIACQL 176 Query: 271 LDRGLDRELVMMVTRLSPDDLIAQSH 296 G+ E + T LS +L +H Sbjct: 177 QKMGMTPEQIKQATGLSEAELKNITH 202 >UniRef50_A6TJT5 Putative uncharacterized protein n=1 Tax=Alkaliphilus metalliredigens QYMF RepID=A6TJT5_ALKMQ Length = 312 Score = 168 bits (426), Expect = 2e-40, Method: Composition-based stats. Identities = 73/310 (23%), Positives = 140/310 (45%), Gaps = 18/310 (5%) Query: 3 ISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYS 62 + PHD FK + A+DF+ +LP L K+ D+ TL E +I++DL++ +S Sbjct: 1 MGIIHQPHDKFFKEMFGNLALAKDFMTNYLPLELLKIVDIETLTPEKEHYIEDDLKESFS 60 Query: 63 DLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNH-LDAGYKELPLVLPML 121 DLL+ GY+Y + EH+S P + +A +++ Y + + L +++P+++PM Sbjct: 61 DLLFKANINGREGYLYFLFEHKSYPSKRIAIQLLHYMVRIWDDKSLKEKKEKIPMIIPMT 120 Query: 122 FYHGCRSPYPY----SLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLE 177 YHG + L E I + I + + D++ DDE+ ++ ++ Sbjct: 121 VYHGKENWNVALRLSDLMEGYEELPEEIRKYIPEYEYLIYDLSGYTDDEVKGDVQLQIVI 180 Query: 178 LIQKHIRQRD--LLGLVDQIVSLLVTGNTND---RQLKALFNYVLQTGDAQRFRAFIGEI 232 I + I + D + + V +L + K Y+L + Sbjct: 181 KILRSIFRNDEEFFKVFKEAVEVLDKLEKQEKGIEYFKTFIYYILSARKGVTLTEIYDLV 240 Query: 233 AERAPQEKEKLMTIADR--------LREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVT 284 E + + +++MTIA+ E+G +GK EE +A+ ++ G++ + VM T Sbjct: 241 KEVSVERSDEIMTIAEELLKEGMEKGMEKGMEKGKLEEKREVARNLIGLGVELDKVMKAT 300 Query: 285 RLSPDDLIAQ 294 LS +++ Sbjct: 301 GLSEEEINKL 310 >UniRef50_Q2RLW6 Putative uncharacterized protein n=9 Tax=Clostridia RepID=Q2RLW6_MOOTA Length = 344 Score = 165 bits (416), Expect = 2e-39, Method: Composition-based stats. Identities = 48/330 (14%), Positives = 121/330 (36%), Gaps = 39/330 (11%) Query: 5 TTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDL 64 P+D ++ L + + + + D L L S++ +D + +D+ Sbjct: 10 PPHHPYDKGYRQLLADKRVFLELLKTFVREAWVEAIDADDLILVNKSYVLQDFSEKEADV 69 Query: 65 LWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQN--------HLDAGYKELPL 116 ++ +KT+ YV++E QS + LM FR++ Y + + ++ + LP Sbjct: 70 VYRLKTRNRNVIFYVLLELQSTVDYLMPFRLLLYMVEIWREIYNNTPQGERESKHFRLPP 129 Query: 117 VLPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMAL- 175 ++P + Y+G S + + + + L D+ ++E+++ + Sbjct: 130 IIPAVLYNGAGSWTAALSFKEMLNSYQDFSGHLLDFRYLLFDVNRYSEEELIRAANLIAG 189 Query: 176 LELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALF---------------------- 213 + L+ + ++ DL G + ++ +L ++ + + Sbjct: 190 IFLLDQKMQPEDLAGRLQKLAGVLRRLTPDEFRHFTTWLKNVVQPRMPGDFSEKIDGILN 249 Query: 214 --------NYVLQTGDAQRFRAFIGEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALR 265 + + E++ + + EG ++GK E Sbjct: 250 ASNPWEVERMIYNLELTLEEMQRQALLKGLKEGEQKGKLEGKLEGKLEGKLEGKLEGKRE 309 Query: 266 IAQEMLDRGLDRELVMMVTRLSPDDLIAQS 295 +A+ +L +D E ++ T L+ +++ A Sbjct: 310 VARNLLLLNVDIETIIKATGLALEEINALK 339 >UniRef50_Q1RJ73 Transposase and inactivated derivative n=10 Tax=Rickettsieae RepID=Q1RJ73_RICBR Length = 305 Score = 163 bits (412), Expect = 7e-39, Method: Composition-based stats. Identities = 78/298 (26%), Positives = 157/298 (52%), Gaps = 14/298 (4%) Query: 9 PHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSV 68 HD++ K + A++F++ +LP +KL DL+ + +E S+I+E L + YSD+++ + Sbjct: 6 KHDSLVKIIMTDKIAAQEFLEYYLPEDFKKLIDLSKITVEQESYIEESLSKKYSDIVYGI 65 Query: 69 KTQE-GVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYHGCR 127 +T+E G G++Y++IE QS + A R+ +Y++ + H +LPLV ++ Y+G + Sbjct: 66 ETKEYGKGFVYILIEAQSTVDYWTALRLWKYTLLLCERH-KEKRNKLPLVYNLVIYNGKQ 124 Query: 128 SPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQRD 187 Y D F +A+K+ + LVD+ + D+EI++ + + +L+ I KHI +RD Sbjct: 125 V-YNAPRNLWDLFTNSVMAKKLMMEDYQLVDLQAMSDNEIVKKKHIGMLDYILKHIHERD 183 Query: 188 LLGLVDQIVS-----LLVTGNTNDRQLKALFNYVL-QTGDAQRFRAFIGEIAERAPQEKE 241 ++ L +Q ++ +++ LK+ Y + Q+ R +PQ K+ Sbjct: 184 MIQLWEQFLANFNHVIMLDKEKGYIYLKSFLWYTDAKISKKQQPRLVQVFDKYLSPQHKD 243 Query: 242 KLMTI-----ADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQ 294 +M D ++EG +G++ +A+ IA++M +G ++ +T L + + Sbjct: 244 NIMKTIADVYIDEGKQEGKREGEYNKAVMIAKKMFSQGFKIPVIAELTGLKETLIRSI 301 >UniRef50_A5CC03 Transposase and inactivated derivative n=9 Tax=Orientia tsutsugamushi RepID=A5CC03_ORITB Length = 355 Score = 162 bits (410), Expect = 1e-38, Method: Composition-based stats. Identities = 77/345 (22%), Positives = 144/345 (41%), Gaps = 59/345 (17%) Query: 9 PHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSV 68 HD +FK + P A DFI+ LP ++ + DL T+K+E SF++ +LR+ D+L+SV Sbjct: 6 KHDGLFKDLMNEPKAALDFINDFLPNEVKNVLDLNTIKVEQESFVEANLRRSMCDVLFSV 65 Query: 69 KTQ-EGVGYIYVVIEHQSKPEELMAFRMMRYSIAA------MQNHLDAGYKELPLVLPML 121 KT+ +IYV+IE + + + +AF++ +Y+++ +LP+V+P++ Sbjct: 66 KTKNNNDAFIYVLIEAELRSDYWIAFKLWQYTLSILKRHKKGLKKRKKERGKLPIVVPIV 125 Query: 122 FYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQK 181 YHG + + F +P +A+++ S + L+D +PD EI + AL+ ++ Sbjct: 126 VYHGADR-FNAPRSLWELFDDPKLAKELMGSEYLLIDWQAMPDSEIKRKATAALVHFMKY 184 Query: 182 HIRQRDLLGLVDQIVSLLVTGNTNDRQL-----KALFNYVLQT---GDAQRFRAFIGE-- 231 Q D++ L + + L D++ ++L Y + + R + + E Sbjct: 185 IHNQPDIIELWAKFFNTLQEIVQKDKEEGFLYIRSLLYYTISKVSQNEQPRLKQLLDENL 244 Query: 232 ---------------------IAERAPQEKEKLMTIADRLREEGAMQGKHEEALRI---- 266 RA E R EG +G+ E Sbjct: 245 SIEDRDRIMGTIAAQYIDEGKAKGRAEGRAEGRAEGRAEGRAEGRAEGRAEGRAEGRAEG 304 Query: 267 ----------------AQEMLDRGLDRELVMMVTRLSPDDLIAQS 295 A+ +L G E + T LS ++++ Sbjct: 305 IEIGETKGRAEAAQGLARNLLKAGFSVEFIAENTGLSNEEVVNLK 349 >UniRef50_C4YU05 Transposase n=5 Tax=Rickettsieae RepID=C4YU05_9RICK Length = 342 Score = 162 bits (409), Expect = 2e-38, Method: Composition-based stats. Identities = 85/339 (25%), Positives = 155/339 (45%), Gaps = 54/339 (15%) Query: 9 PHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSV 68 HDA+ K L A++F++ +LP+ ++L DL +K+E SF+++DL++ YSD+++SV Sbjct: 6 KHDALVKKILTEKIAAQEFLEHYLPSDFKELIDLREIKVEKESFVEDDLKRKYSDIIYSV 65 Query: 69 KTQEG-VGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYHGCR 127 KT++ ++YV+IE QS + +A R+ +Y + + H + +LPL+ P+L Y+G Sbjct: 66 KTRDQEEAFVYVLIEAQSSCDYWIALRLWKYMLLLCERH-ENNKNKLPLICPLLIYNGSE 124 Query: 128 SPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQRD 187 Y + + F +P A+K+ + LVD+ DDEI Q + + ++E KHI QRD Sbjct: 125 V-YNAPRNFWELFTKPERAKKLMVQDYQLVDLQNQSDDEIEQKKHLGMMEYFLKHIHQRD 183 Query: 188 LLGLVDQIVS---------------------LLVTGNTNDRQLKALFNYVLQTGDAQRFR 226 +L L D+ + ++ + + L +++ + Sbjct: 184 MLKLWDEFLIRFKPSIIMDKESGYIYLRSFVWYTDAKISEEKQQELEQIIVKHLSTEEKD 243 Query: 227 AFIGEIAERAPQEK------------------------------EKLMTIADRLREEGAM 256 + IA++ E + + EG Sbjct: 244 NIMRTIAQKYIDEGVQHGIIQGIQQGIQQGVEKGKAEGLKIGEAKGKAEGKAEGKAEGKA 303 Query: 257 QGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQS 295 +GK EE + IA++ML +G D + VT L + + S Sbjct: 304 EGKAEERVEIARKMLSQGCDFSFISSVTGLEEAFIRSLS 342 >UniRef50_B9MMR0 Putative uncharacterized protein n=1 Tax=Anaerocellum thermophilum DSM 6725 RepID=B9MMR0_ANATD Length = 333 Score = 162 bits (409), Expect = 2e-38, Method: Composition-based stats. Identities = 58/330 (17%), Positives = 124/330 (37%), Gaps = 41/330 (12%) Query: 1 MTISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQY 60 M +D FK + +F+ + P DL +L+ SF+ ++ + Sbjct: 1 MEQKPPHNQYDLTFKRIFSFKEVFLNFLKSTIKRPWVDKIDLQSLEFVDRSFVKDEFVEK 60 Query: 61 YSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELP-LVLP 119 +D+++ K ++ Y YV++E QS ++ M R+ Y Q H++ +L ++P Sbjct: 61 EADVIYRAKIEDTDIYFYVLLEAQSTTDKTMPRRLFEYMNLIWQRHIEETKDDLLSPIVP 120 Query: 120 MLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELI 179 ++ Y+G + +L F I + + LVD+ + D+ + + + L Sbjct: 121 IVLYNGRSNWNVPTLI----FKGWEIFKDDM-FNYFLVDVNNIDDETLKNRLDLLSVILY 175 Query: 180 QKHIRQ--RDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAP 237 R+ ++ + + ++ + T Q+K ++L+ Q GEI E Sbjct: 176 LDRSRKTAKEFIEKLKEVTEYISCLPT--EQVKVFAMWLLRVIRPQMMEEVQGEIDELLK 233 Query: 238 QEKE-------------------------------KLMTIADRLREEGAMQGKHEEALRI 266 + ++ + EG ++G+ E +RI Sbjct: 234 RIEQEGVTDVGDFVFNVQRLMQEYYKEAEEKGKEKGYEEGKLEGKLEGKLEGELEATIRI 293 Query: 267 AQEMLDRGLDRELVMMVTRLSPDDLIAQSH 296 A+ M+ G + + VT L + + Sbjct: 294 ARNMILAGAEDSFISKVTGLDIEKIKELRQ 323 >UniRef50_Q3JB06 Putative transposase n=17 Tax=Proteobacteria RepID=Q3JB06_NITOC Length = 350 Score = 160 bits (404), Expect = 5e-38, Method: Composition-based stats. Identities = 56/251 (22%), Positives = 107/251 (42%), Gaps = 9/251 (3%) Query: 10 HDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVK 69 HDA +K HP+ RD + + P + D +TL+ S++ +DLR+ D++W ++ Sbjct: 4 HDASYKRLFSHPEMVRDLLQGFVREPWVQQLDFSTLEKVSGSYVTDDLREREDDIIWRLR 63 Query: 70 TQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGY----KELPLVLPMLFYHG 125 QEG YIY+++E QS + MA R++ Y Q+ + A Y ++LP V P++ Y+G Sbjct: 64 HQEGWMYIYLLLEFQSTVDPYMAVRVLAYVGLLYQDLIKARYIAPNQKLPPVFPLVLYNG 123 Query: 126 CRSPYPYSLC-WLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIR 184 + L E + R S + LVD D+ + + + ++ R Sbjct: 124 GPRWRAATEVGDLITPLEGGLERYRPSLRYLLVDEGDYQDEALAPLKNLVASLFRLENSR 183 Query: 185 QR-DLLGLVDQIVSLLVTGNT---NDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEK 240 +LL ++ ++ L + L +L + + E Sbjct: 184 TPEELLQVLRNLLQWLQSPAQKGLERDFTLWLKRVLLPARLPGVEIPSVASLEEMNSMLA 243 Query: 241 EKLMTIADRLR 251 E+++ + + Sbjct: 244 ERVVEWTQQWK 254 >UniRef50_Q1RGR6 Transposase and inactivated derivative n=15 Tax=Rickettsia RepID=Q1RGR6_RICBR Length = 313 Score = 160 bits (404), Expect = 6e-38, Method: Composition-based stats. Identities = 78/311 (25%), Positives = 148/311 (47%), Gaps = 25/311 (8%) Query: 7 STPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLW 66 HD + +S +P +++F ++HLP ++ L LK+E +SF+D+ L++ D+L+ Sbjct: 4 KPKHDEIIRSAFENPLVSKEFFEMHLPPHIQNLISFEKLKMEKDSFVDKRLKKSIVDILF 63 Query: 67 SVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDA-GYKELPLVLPMLFYHG 125 S K E GY+Y+++EHQS PE MA R+ RY + H + K+ P + P++FY+G Sbjct: 64 SAKFGEKKGYLYLLLEHQSTPEYKMALRLFRYMFKIAEYHKKSTKSKKFPFIYPLIFYNG 123 Query: 126 CRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQ 185 + Y + F + + +S + L+++ +PD+++ + +L+ KHI + Sbjct: 124 VQK-YNAPRNLWELFENSELVKSTWSGDYQLINVHDIPDEKLKEKAWSGILQFFMKHIHE 182 Query: 186 RDLLGLVDQIVSLLVTGNTND---RQLKALFNYVLQTGDAQRFRAFIGEIAERA--PQEK 240 RDLL +++ LL D ++ + Y L + + + + Sbjct: 183 RDLLKRWEEVADLLPKFAKIDIGIEHIELILCYTLTRIKQDDIIEVEKLLQSKLNPKKRE 242 Query: 241 EKLMTIADRLREEGAMQGK------------------HEEALRIAQEMLDRGLDRELVMM 282 + +IA ++G + K EE + +A+EM+ G E V+ Sbjct: 243 NVMKSIAHHWIQQGREEEKAIMLKKMQEEKVIMAEKVQEEKVMMAKEMMKEGFSLESVIK 302 Query: 283 VTRLSPDDLIA 293 +T+LS +DL Sbjct: 303 ITKLSKEDLEK 313 >UniRef50_D2QBD7 Putative uncharacterized protein n=1 Tax=Spirosoma linguale DSM 74 RepID=D2QBD7_9SPHI Length = 341 Score = 159 bits (401), Expect = 1e-37, Method: Composition-based stats. Identities = 69/296 (23%), Positives = 138/296 (46%), Gaps = 11/296 (3%) Query: 6 TSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLL 65 PHD FK P+ DF++ P +R+ D TTL E ++F DE L ++++DL+ Sbjct: 5 PDNPHDRFFKESFSQPEILIDFLNAFAPEAVRERIDYTTLTREVDTFTDEQLAEHFADLV 64 Query: 66 WSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYHG 125 +SV+ + +++EH+S EE F++ RY + ++ + + L VLP+L YHG Sbjct: 65 FSVQYNGQPIRLVILLEHKSYTEEYPHFQINRYLLNLWESQIKQK-QPLTPVLPVLVYHG 123 Query: 126 CRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMAL---LELIQKH 182 R S+ + + + + L+D++ + D+ + + L+Q Sbjct: 124 NRRWKQRSIPDYFAPLHETLTPYLPAFEYLLIDLSTLSDERLPTLQSDYARLTAILLQNS 183 Query: 183 IRQRDLLGLVDQIVSL---LVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQE 239 R+R+L L+D + L R + F Y+ T + + G + + + Sbjct: 184 RRKRELTRLLDAFADVVRRLTDTTAGQRFVSTGFLYLSYTANLTKVE-LFGIFSRISSKI 242 Query: 240 KEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQS 295 + MT+A+ L +EG + + +A+E++ +G + E + ++ ++L+ Q Sbjct: 243 ESSTMTVAEELIQEGRELERRQTR-MVAEELIQQGRELERRQAM--MAAEELLKQQ 295 >UniRef50_C5UWW9 Putative uncharacterized protein n=1 Tax=Clostridium botulinum E1 str. 'BoNT E Beluga' RepID=C5UWW9_CLOBO Length = 323 Score = 158 bits (399), Expect = 2e-37, Method: Composition-based stats. Identities = 62/319 (19%), Positives = 122/319 (38%), Gaps = 23/319 (7%) Query: 1 MTISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQY 60 M + HD +K H +T +F+ L + L L S+I D + Sbjct: 1 MKNNNVHHEHDVGYKHIFSHKETFLEFLRSFTKKEWANLINEDDLILVDKSYILSDFEEE 60 Query: 61 YSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYK-------- 112 SD+L+ + YV++E QSK + M R++ Y ++ L K Sbjct: 61 ESDILYKANIDDKEVIFYVLLEFQSKVDFQMPMRLLFYMTEIWRDVLKNTEKNERKRKNF 120 Query: 113 ELPLVLPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRK 172 +LP ++P++ Y+G I + L DI D E++ Sbjct: 121 KLPSIVPIVLYNGKNKWSAKISFKEMLSGYELFEDNILDFNYMLFDINRYSDHELLNISN 180 Query: 173 -MALLELIQKHIRQRDLL-GLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIG 230 ++ + L+ + I +++L+ L I L K +++ + I Sbjct: 181 MISAVFLLDQEIDEQELMRRLKKIIYILKKISPEQFSVFKKWLKNIVKPRVRDNLQGEID 240 Query: 231 EIAERAPQEKEKLM-------------TIADRLREEGAMQGKHEEALRIAQEMLDRGLDR 277 ++ E++ QE+ M +R ++G QG + + A++ ++ G+D Sbjct: 241 DVLEKSNQEEVDFMVSNLGKTIERMQDKAIERGLKKGIEQGIEQGIEQTAKKAIEMGMDN 300 Query: 278 ELVMMVTRLSPDDLIAQSH 296 E++M +T LS + + Sbjct: 301 EIIMNLTGLSEEQINTIRQ 319 >UniRef50_C5JAV2 Transposase n=2 Tax=uncultured bacterium RepID=C5JAV2_9BACT Length = 334 Score = 157 bits (397), Expect = 3e-37, Method: Composition-based stats. Identities = 68/285 (23%), Positives = 132/285 (46%), Gaps = 11/285 (3%) Query: 3 ISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYS 62 ++ + PHD K+ L +P TA + LP + + +L SFIDE LR + + Sbjct: 1 MTEIAHPHDRFLKALLSNPATAGTLLRERLPREVAEALSDDPPELLEGSFIDEALRPHLT 60 Query: 63 DLLWSVKTQEG-VGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAG--YKELPLVLP 119 D L+ V+T G +YV+IEH+S P+ + +++++Y + A++ ++ LP ++P Sbjct: 61 DRLYRVRTVTGRTALLYVLIEHKSSPDLRIGWQLLKYLVEALKQWERENPAWERLPAIVP 120 Query: 120 MLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELI 179 +FYHG + AE + + F ++D+ + D ++ + + L Sbjct: 121 FVFYHGAAAWKVPDAFLALVDAEEGWRSHLLNFRFTVLDLGQIDDRQLSRQPNLQAWLLA 180 Query: 180 QKHIRQRDL-LGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQ 238 K+ + D L + + ++ LV+ D + + L YV++T + I P+ Sbjct: 181 AKYATRDDRQLEVKELLIQTLVS--VADEEFRFLMRYVVETYRSYDEPMVREIIRRVRPE 238 Query: 239 EKEKLMT-----IADRLREEGAMQGKHEEALRIAQEMLDRGLDRE 278 E+E +M+ + + R+EG +G+ E + RG E Sbjct: 239 EEETMMSMFAQDMMAKGRQEGRQEGRQEGRQEGIKLGEQRGRQEE 283 >UniRef50_Q1RKI3 Transposase and inactivated derivative n=10 Tax=Rickettsia RepID=Q1RKI3_RICBR Length = 270 Score = 157 bits (397), Expect = 4e-37, Method: Composition-based stats. Identities = 56/193 (29%), Positives = 103/193 (53%), Gaps = 3/193 (1%) Query: 9 PHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSV 68 HD F+ L +P AR+F + +LP ++ L TTL LE +SFID +L++ +D+L+S Sbjct: 55 KHDKFFQKALSNPIVAREFFEEYLPTEIKALFSPTTLTLENDSFIDPNLKESITDVLYSA 114 Query: 69 KTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHL--DAGYKELPLVLPMLFYHGC 126 + YIY++ EHQS + MAFR+ +Y + + HL K+ P + P++ Y Sbjct: 115 RINNRDCYIYILCEHQSSSDPHMAFRLFKYMLNIAEKHLISHPDSKKFPFIYPLV-YSND 173 Query: 127 RSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQR 186 Y L D F + + +S+ + L+ + + DD++ ++ +A L+++ K+I + Sbjct: 174 HKKYTAPLNLWDLFENSELVKDTWSNNYQLISLRDISDDKLKENPWLAPLQILMKYIHKP 233 Query: 187 DLLGLVDQIVSLL 199 ++ +I L Sbjct: 234 NVFDKWQEISGCL 246 >UniRef50_B9TA29 Putative uncharacterized protein n=1 Tax=Ricinus communis RepID=B9TA29_RICCO Length = 411 Score = 156 bits (395), Expect = 7e-37, Method: Composition-based stats. Identities = 53/292 (18%), Positives = 106/292 (36%), Gaps = 27/292 (9%) Query: 11 DAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVKT 70 D+++K HP+ RD + L A + + + S+ + + D++W + Sbjct: 44 DSLYKQLFAHPEIVRDLVAGFLAADWARGLTVEAFERVNASYASDHGHVRHDDVVWRARI 103 Query: 71 QEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQ----NHLDAGYKELPLVLPMLFYHGC 126 Y+Y+++E Q++P++ MA RM Y Q H + + +LP VLP++ YHG Sbjct: 104 GGEWVYVYILLEFQARPDKWMALRMQVYVGLLYQDLVAQHKLSKHGKLPPVLPVVLYHGR 163 Query: 127 RSPYPYSLCWLDEFAEPA-IARKIYSSAFPLVD------------ITVVPDDEIMQHRKM 173 + P+ + R S + L+D + D + Sbjct: 164 GPWRAATALASLMLPAPSGLERYQPSQRYLLIDQHHGTARADVVSLLFRLLDAATDLQLR 223 Query: 174 ALLELIQKHIRQRDLLGLVDQIVSLLVTGNTN---------DRQLKALFNYVLQTGDAQR 224 L+L+ + IR RD+ + D + + + + + Sbjct: 224 EALDLLAERIRARDMDPVRDSLTRWIQLTLQDAAVETSMDLEEAFTMKMRRKFSYDEMFD 283 Query: 225 FRAFIGEIAERAPQ-EKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGL 275 F +A+ + E L + E G ++G + + + GL Sbjct: 284 PGMFERPLAKAREKAIVEGLQQGREEGLERGRVEGLERGRVEGLERGREEGL 335 >UniRef50_A3JHZ5 Putative transposase n=11 Tax=Proteobacteria RepID=A3JHZ5_9ALTE Length = 325 Score = 156 bits (393), Expect = 1e-36, Method: Composition-based stats. Identities = 60/318 (18%), Positives = 124/318 (38%), Gaps = 29/318 (9%) Query: 7 STPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLW 66 + HD +K HP+ + ++ P+ + L D TLK ++I + + D++W Sbjct: 3 TNHHDTGYKELFSHPEFVQQLVEGFAPSEIAGLMDFNTLKNHSGNYITPLFEEKFEDVVW 62 Query: 67 SVKTQ----EGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGY----KELPLVL 118 SV+ ++Y+++E QSK + M R+M Y + L + LP + Sbjct: 63 SVEVTWEGITQRVFLYILLEFQSKIDSTMPLRLMHYVACFYDHLLKTRETTVRQGLPPIF 122 Query: 119 PMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYS--SAFPLVDITVVPDDEIM-QHRKMAL 175 PM+ Y+G + + P ++Y + L+D D+E++ + ++ Sbjct: 123 PMVLYNGSQRWSARQDIYDMVQPAPPEFLRVYQPHLRYYLIDEGRYTDEELISKRTPLSG 182 Query: 176 LELIQKHIRQRDLL-GLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFR-----AFI 229 + ++ + L VD+IV ++ DR K + ++ + + + Sbjct: 183 IFGVENAGHSWEALQQAVDRIVEIVKADPNKDRVDKIVTRWIKRHLQRVAPKARLNLDRM 242 Query: 230 GEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRG------------LDR 277 + E E L + + R EG +G+ E + L+ L Sbjct: 243 SSLVEDRNMLAENLENLVKKERLEGRQEGRQEGRQEGDRRALEEKRKTVRHLLSFGVLSN 302 Query: 278 ELVMMVTRLSPDDLIAQS 295 + + + T LS D++ Sbjct: 303 DQIAVATGLSVDEIDKLR 320 >UniRef50_A4XMD0 Putative uncharacterized protein n=5 Tax=Clostridia RepID=A4XMD0_CALS8 Length = 329 Score = 156 bits (393), Expect = 1e-36, Method: Composition-based stats. Identities = 44/327 (13%), Positives = 116/327 (35%), Gaps = 37/327 (11%) Query: 1 MTISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQY 60 M +D FK + + +F+ ++ D +L+ SFI ++ + Sbjct: 1 MQQKVPHNQYDLTFKRLFQFKEVFLNFLRGNINREWVNRIDAESLEFVDRSFIKDEFVEK 60 Query: 61 YSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKE-LPLVLP 119 +D+++ + ++ Y YV+IE QS + M R+ Y + H++ E LP ++P Sbjct: 61 EADVIYRARLEDTDVYFYVLIEPQSTADRNMPRRLFEYMTLIWKRHMEEKADELLPPIVP 120 Query: 120 MLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELI 179 ++ Y+G + F I + + LVD+ + D+++ + + L Sbjct: 121 IVLYNGRSGWNIPTQI----FKGFDIFKDDM-FNYILVDVNRLDDEKLKSRLDLLSIILY 175 Query: 180 QKHIRQR--DLLGLVDQIVSLLVT-----------------GNTNDRQLKALFNYVLQTG 220 + R+ + + + ++ + ++++ + +L+ Sbjct: 176 LEKSRRNAEEFVEKLSEVSEYICKLPQVQLKVFCSWLLRIVKPQVREEMESRIDELLKKI 235 Query: 221 DAQRFRAF------------IGEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQ 268 +A+ +++ +EG +G + I + Sbjct: 236 EAEGVEDVGEFIFNVQQLIQEYYREAEEKGKEKGYEEGIQEGIKEGIKEGIQRKEEEIVR 295 Query: 269 EMLDRGLDRELVMMVTRLSPDDLIAQS 295 ++ +G + + T + + + Sbjct: 296 RLIQKGFNDNFIAEATGVEIERIKKIR 322 >UniRef50_C3PPD7 Transposase and inactivated derivative n=13 Tax=spotted fever group RepID=C3PPD7_RICAE Length = 361 Score = 155 bits (391), Expect = 2e-36, Method: Composition-based stats. Identities = 74/302 (24%), Positives = 138/302 (45%), Gaps = 32/302 (10%) Query: 2 TISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYY 61 + ++ HD +FK + P AR+F++ +LP + +L ++K+E SF+ EDLR+ Sbjct: 33 SNTSERPRHDELFKKVMSEPVAAREFLEHYLPVTFKNKINLNSIKIEKESFVTEDLRKRL 92 Query: 62 SDLLWSVKTQEG--------------VGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHL 107 SD+++SV + Y+YV+IEHQS + +AFR+ +Y + + H Sbjct: 93 SDVVYSVSLKNDNIKDSTTEKSVHNDKAYVYVLIEHQSSSDYWIAFRLWQYMLLLCERHK 152 Query: 108 ----------DAGYKELPLVLPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLV 157 +LPL+ P++ Y PY + + F + A+ + + LV Sbjct: 153 DANNNKSSVTKEKDNKLPLICPIVVY-ANDKPYNAPRSFWELFEDSKTAKDMMGDEYLLV 211 Query: 158 DITVVPDDEIMQHRKMALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQL-KALFNYV 216 D+ DDEI + + + ++E + KHI+ RD+L L ++ + D++ ++ Sbjct: 212 DLQKQSDDEIEKKKHLGMMEYMLKHIKARDILNLWQSLLEKFESSIEIDKENGYIYIKWL 271 Query: 217 LQTGDAQRFRAFIGEIAERA------PQEKEKLMTIADRLREEGAMQGKHEEALRIAQEM 270 L DA+ E+A ++E + TIAD+ +EG +G + Sbjct: 272 LWYSDAKVSEDKQVELASIIAKHLKKEDQEELMRTIADKYIDEGVQKGMVQGMQIGEARG 331 Query: 271 LD 272 + Sbjct: 332 MQ 333 >UniRef50_B6WXP3 Putative uncharacterized protein n=1 Tax=Desulfovibrio piger ATCC 29098 RepID=B6WXP3_9DELT Length = 330 Score = 153 bits (387), Expect = 6e-36, Method: Composition-based stats. Identities = 56/281 (19%), Positives = 115/281 (40%), Gaps = 14/281 (4%) Query: 3 ISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYS 62 + PHD+ +K F +P+ + +PA + D +TL+ S++ +DLR+ + Sbjct: 1 MGKERIPHDSAYKQFFSNPEMVESLLRDFVPADFIEDLDFSTLERCSGSYVTDDLRERHD 60 Query: 63 DLLWSVKTQEG-VGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGY----KELPLV 117 D++W + ++G Y+ +V+E QS P+ MA R + Y+ + + + G + LP V Sbjct: 61 DIVWRIGWKKGAWCYVALVLEFQSTPDYWMALRTLSYTALLLLDLVKTGKVHEGEGLPPV 120 Query: 118 LPMLFYHGCRSPY-PYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALL 176 P++ Y+G ++ P + L ++ L+D + V DE+ + L+ Sbjct: 121 FPIVIYNGGKAWKAPQEVATLFAPMPDSLKHYCPQHRHFLLDESRVSGDEL--DKSQGLV 178 Query: 177 ELIQKHIRQRDLLGLVDQIVSLLVTGNTN------DRQLKALFNYVLQTGDAQRFRAFIG 230 + K R ++ + + L+ + L VL+ Sbjct: 179 AQLLKLERAQEPEQVRQIVKELITRLHEPKYLLLRRAFTVWLSRVVLKRSGITEEIPEFQ 238 Query: 231 EIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEML 271 ++ E +E+ D ++G +G R + L Sbjct: 239 DLREVDAMLEERAAQWKDEYIKQGKTEGISIGEARGIRSAL 279 >UniRef50_A4XFI8 Putative uncharacterized protein n=7 Tax=Clostridia RepID=A4XFI8_CALS8 Length = 321 Score = 153 bits (385), Expect = 9e-36, Method: Composition-based stats. Identities = 63/320 (19%), Positives = 126/320 (39%), Gaps = 29/320 (9%) Query: 1 MTISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQY 60 M+ S HD+ FK HP + + K +++L F+DE Q Sbjct: 1 MSSSLPPQEHDSTFKFLFEHPKDILFLVKDVIGYSWAKEIKEDSIELADKEFVDETFHQK 60 Query: 61 YSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPM 120 +D++ + ++ Y Y++IE+QS E M R++RY I + G K+LP ++P+ Sbjct: 61 RADVIAKARLKDREVYFYIIIENQSTVAEDMPERLLRYMILLWAKKIREGVKKLPAIIPI 120 Query: 121 LFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQ 180 + Y+G + S + EF K + +V+I+ + ++Q + L ++ Sbjct: 121 VTYNGLEKDWDVSQEIISEFDI----FKDDIFKYAVVNISKLDAKTLLQEEEDILSPVVF 176 Query: 181 KHIRQRDLLGLVDQIVSLL-----VTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAER 235 + RD + + + + N + V++ + + E+A+R Sbjct: 177 YLEQVRDDTEELVKRLKEIEPKLTKLSQNNAERFLIWAGNVIRPRLVKEDKEKYDELAQR 236 Query: 236 APQEKEKLM--------------------TIADRLREEGAMQGKHEEALRIAQEMLDRGL 275 Q + M + EG ++GK E + +A++M+ RG Sbjct: 237 VEQGGSRQMGEFVSNVAKLLDEVQMRKFNEGKIEGKIEGKIEGKIEGKIEVAKKMIRRGF 296 Query: 276 DRELVMMVTRLSPDDLIAQS 295 E + +T L + + Sbjct: 297 SDEDIAELTELDIEKVKELR 316 >UniRef50_Q6TFF6 Putative transposase n=1 Tax=Caedibacter taeniospiralis RepID=Q6TFF6_CAETA Length = 299 Score = 152 bits (384), Expect = 1e-35, Method: Composition-based stats. Identities = 79/299 (26%), Positives = 144/299 (48%), Gaps = 19/299 (6%) Query: 7 STPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSF-------IDEDLRQ 59 HD+VFK + + D A F+ +LP L +L D T+KLE + D ++ Sbjct: 2 KNVHDSVFKDLIANRDFAVSFLMTYLPKELVELVDWQTVKLESANVEHVRQQQKDNQKQK 61 Query: 60 YYSDLLWSVKTQEGV-GYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGY--KELPL 116 SDL + K ++G G ++V IE Q+ + + R Y + + +++ K LPL Sbjct: 62 EQSDLTFLFKFKDGKNGAVFVHIESQTGDDGTILIRTRHYQTSYLLDYIKRHKTVKGLPL 121 Query: 117 VLPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALL 176 V+ +++Y + P+ +SL D FA +A+K Y+ +D+ D+EI++H +A Sbjct: 122 VVSIIYY-ANQKPFSHSLNIHDYFANTELAKK-YAFTTQFIDLNRYSDEEILEHGFIAGY 179 Query: 177 ELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERA 236 ELI K IR++++ G +D ++ + + + L Y+ Q D + F ++ Sbjct: 180 ELILKAIREKNIDGKLDIAINQIEA--YDHIARQVLIRYMSQYSDM-ETKDFHDKLIYSK 236 Query: 237 PQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQS 295 P + +MT+A++ ++G +G A+ L GL E V+ T L D ++ Sbjct: 237 PDLRGDVMTVAEQWEQKGIQKGIQ----TTARNFLLMGLSAEQVVKGTGLDQDTVLKLK 291 >UniRef50_A3ET28 Probable transposase n=6 Tax=Leptospirillum sp. Group II RepID=A3ET28_9BACT Length = 335 Score = 152 bits (384), Expect = 1e-35, Method: Composition-based stats. Identities = 57/334 (17%), Positives = 123/334 (36%), Gaps = 47/334 (14%) Query: 3 ISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYS 62 ++ S HD FK+ + RDF+ LP + + D +L+ I + + Sbjct: 1 MNEISGLHDRFFKTSFGRIEVLRDFLTGFLPPEISQSIDPDSLRFLNTESIGLSFEKSHM 60 Query: 63 DLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLF 122 DL+ + E Y++IEH+S P+ + +M+RY +A + K L VLP++F Sbjct: 61 DLVVECRISETPAQFYLLIEHKSVPDPEVFLQMLRYMVALWTRN-RQDNKPLVPVLPLVF 119 Query: 123 YHGCRSPYPYSLCWLDEFA-EPAIARKIYSSAFPLVDITVVPDDEIMQHRKMA---LLEL 178 + G P+ + + + F + A L D++ V I + A ++ Sbjct: 120 HQG-GRPWTLPVRFQETFPVPETLKAHAVDFAPLLFDLSTVSGTTIRERSAHAETVVVLT 178 Query: 179 IQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQ 238 + K+ + ++ + G+ ++ L + NY ++ + + + + R+ Sbjct: 179 LLKYAFSGSVEDVLRALKE--TGGSFDETFLFGVLNYAIRAFEVKD--PVVVDAISRSFG 234 Query: 239 EKEKLMTIADRLREEGAMQG------------------------------------KHEE 262 ++ + +I D EEG +G + E Sbjct: 235 GEKIMPSIIDEWVEEGLKEGLKKGREEGREEGREEGKEEGRKEGREEGKEEGRKEGQKEG 294 Query: 263 ALRIAQEMLDRG-LDRELVMMVTRLSPDDLIAQS 295 + +++L +G L + + + Sbjct: 295 QRKTIEKLLAKGVLSVSEIASALDVDLQWVEQIR 328 >UniRef50_A4XG55 Putative uncharacterized protein n=2 Tax=Caldicellulosiruptor saccharolyticus DSM 8903 RepID=A4XG55_CALS8 Length = 327 Score = 149 bits (375), Expect = 1e-34, Method: Composition-based stats. Identities = 52/322 (16%), Positives = 108/322 (33%), Gaps = 27/322 (8%) Query: 1 MTISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQY 60 M + +D +K + + + L+L +++ D + Sbjct: 1 MCSNLPHNVNDLEYKYIFSNKSLFLRLLKRIDRINIFNKLTEEDLELVDKNYVLPDFSEQ 60 Query: 61 YSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYK-------- 112 SDLL+ + QE + Y++ EHQS + MA R++ Y ++ L K Sbjct: 61 ESDLLYKARLQEEELFFYILFEHQSTVDYNMAMRLLFYITDIWRDWLKQFDKNQFKNKSF 120 Query: 113 ELPLVLPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRK 172 + P V+P++ Y G + I + L+D+ + Sbjct: 121 KFPPVVPIVLYDGDNPWTASVNLKERIMNFEVFGKYIVDFEYILIDLNDPDEMIFKYKDI 180 Query: 173 MALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEI 232 ++L+ + K +++L L + L + + V+ + ++ Sbjct: 181 LSLILKLNKVKTEKELERLFLDLYEYLQGAKEKEINTLKICLPVVLKELGEDKVQEAKDM 240 Query: 233 AERAPQEKEKLM-------TIADRLREEGAMQGKH------------EEALRIAQEMLDR 273 E E +M I + EG +G ++ L IA+ M+ + Sbjct: 241 LECIDVGGEGIMPLFQNLRKIREEWYHEGIQKGIQDGLQQGLQQGLQKKELEIAERMIVK 300 Query: 274 GLDRELVMMVTRLSPDDLIAQS 295 G E + +T L + + Sbjct: 301 GYSDEEIHEITGLDIEKIKELR 322 >UniRef50_C6HXQ0 Putative uncharacterized protein n=1 Tax=Leptospirillum ferrodiazotrophum RepID=C6HXQ0_9BACT Length = 341 Score = 147 bits (371), Expect = 4e-34, Method: Composition-based stats. Identities = 54/331 (16%), Positives = 110/331 (33%), Gaps = 41/331 (12%) Query: 3 ISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYS 62 ++ HD FKS L P + LP L L +L + + + L Sbjct: 1 MTIDGPLHDRFFKSTLGRPKRMEHILKAFLPPALSALLAPGSLVPLFSEVVGDSLDASLL 60 Query: 63 DLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLF 122 D+ + E I+V++EH+S P+ F+++ Y +P V P+LF Sbjct: 61 DMAFEATFGERKTRIHVLVEHKSSPDPWAHFQILHYLAELWLRDKKESRSPIPFV-PVLF 119 Query: 123 YHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRK---MALLELI 179 YHG R + + + P++D+ + D +I + + + L+ Sbjct: 120 YHGLRPWNLPTRLSEMLDPPSELLPFVPDYLLPVIDLGKIDDLDIREKIRDFETSACLLL 179 Query: 180 QKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQ---RFRAFIGEIAERA 236 KHI + G + + N + + + +YV+ + + I + Sbjct: 180 LKHIFEG-ARGSLRAFLQETNGKNLSRDIIISGMSYVIGVHHLESTAELSRLVNTILKEE 238 Query: 237 PQEKEKLMTIADRLREEGAMQGKHEEAL-------------------------------- 264 + + + L ++G +G + Sbjct: 239 GMSQNVVELWMEELIQQGVQKGIQQGVQLGIEQGIQQGIQQGVQQGVRQGVQQGIRITQD 298 Query: 265 RIAQEMLDRG-LDRELVMMVTRLSPDDLIAQ 294 +++L++G L E + L D + Sbjct: 299 DTIRKLLNKGQLSVEQIAFFLDLPTDRIREV 329 >UniRef50_C6VTM0 Putative uncharacterized protein n=1 Tax=Dyadobacter fermentans DSM 18053 RepID=C6VTM0_DYAFD Length = 308 Score = 146 bits (369), Expect = 6e-34, Method: Composition-based stats. Identities = 60/305 (19%), Positives = 134/305 (43%), Gaps = 15/305 (4%) Query: 3 ISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYS 62 + + HDA ++ + + A D+ +P ++ L D +TL+ P++++ ++L++ S Sbjct: 1 MDKHTPKHDAFIRAIMGNKQIALDYFRASIPQNIQDLLDFSTLRQLPDTYVSKELQKSIS 60 Query: 63 DLLWSVK--TQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPM 120 D+++ + + G I +++EH+S ++ ++ Y + + + L++P+ Sbjct: 61 DIVYVCQKASGNGEVKISLLVEHKSYVDKYTPIQIGSYIFSGLLKQIGNKESP-SLIIPI 119 Query: 121 LFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEI--MQHRKMALLEL 178 L YHG ++ L E EPA+ + I + D+ + D+EI + ++ +A L Sbjct: 120 LLYHGADRWEYKTVADLFENPEPALQQFIPDYQYIFHDLGQISDEEIQSLHNKFLAASLL 179 Query: 179 IQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQ 238 K+ +D L + + ++L + DR L + G+ F+ I Q Sbjct: 180 AMKYSALKDQLNTL--LPTILTLASEVDRNLHKSLLFYTLVGNPLTEEQFLNLIKSVPNQ 237 Query: 239 EKEKLMTIADRLREEGAMQGKHEEALR-------IAQEMLDRG-LDRELVMMVTRLSPDD 290 +KE +M I + E+G +G E + ++ + L E + ++ D Sbjct: 238 KKEAIMDIFEIFEEKGWKKGIEEGRAEAEQKIETAVRNLIKQSVLTDEQIASAMNVTTDY 297 Query: 291 LIAQS 295 + Sbjct: 298 VAEVR 302 >UniRef50_A6G1G8 Putative uncharacterized protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6G1G8_9DELT Length = 329 Score = 146 bits (368), Expect = 8e-34, Method: Composition-based stats. Identities = 63/304 (20%), Positives = 104/304 (34%), Gaps = 20/304 (6%) Query: 8 TPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWS 67 HDA+FK+ P A LP L + D EP + +D L + D+LW Sbjct: 5 HAHDALFKAAFGAPAHAARLCRALLPPALVAVLDWRASTSEPTAVLDLRLSERRCDVLWR 64 Query: 68 VKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLD-AGYKELPLVLPMLFYHGC 126 + +G G IYV++EHQS E M R+ Y H + LP ++P++ H Sbjct: 65 TRFVDG-GPIYVLLEHQSTRERDMPLRIEGYLARIWAGHRRGDRHGPLPPIIPIVVSHAE 123 Query: 127 RSPYPYSLCWLDEFAEPA----IARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKH 182 W P +A + + + D+T V D + L Sbjct: 124 HGWRAPRSFWEQFSPSPDCIPGLAPFVPNFQLLIDDLTQVDDASLRGRSLPLFQTLALWL 183 Query: 183 IRQ-RDLLGLVDQIVSL-----LVTGNTNDRQL----KALFNYVLQTGDAQRFRAFIGEI 232 +R RD +++ + + G + Q + L Y F ++ Sbjct: 184 LRDARDPGRVLESVDEWNTWIHRLRGESQHEQDGGDIEQLLRYAYAVMGEGEDSEFHRKL 243 Query: 233 AERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLI 292 A P E +T + G +G E ++ E+L+ L + L Sbjct: 244 AAFHPPSAEMSLTFEQQAINRGHKRGLEEGRIKGRLELLEAQLH----AKFSTLPMRLRE 299 Query: 293 AQSH 296 Sbjct: 300 RLDQ 303 >UniRef50_C0GW46 Putative uncharacterized protein n=2 Tax=Desulfonatronospira thiodismutans ASO3-1 RepID=C0GW46_9DELT Length = 341 Score = 146 bits (367), Expect = 1e-33, Method: Composition-based stats. Identities = 66/271 (24%), Positives = 123/271 (45%), Gaps = 6/271 (2%) Query: 5 TTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDL 64 PH+A FK F + P+ + FI H+P + L DL TL+++ + F+ E+ R+YY+D+ Sbjct: 4 EIPNPHNACFKDFFKDPEFVKAFIKYHIPEEICSLLDLDTLQVDLSGFVSEEHREYYADV 63 Query: 65 LWSVKTQE--GVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAG--YKELPLVLPM 120 + +V+ + IY+++EH+S PE L +++ Y + + G LP+++P+ Sbjct: 64 MVTVQLKGHTENVNIYILLEHKSTPEFLTRLQILNYEVQKWMDLKRKGQLQGYLPVIIPV 123 Query: 121 LFYHGCRSPY-PYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELI 179 + YHG L + + + + DI+ + DDE + + L+ Sbjct: 124 VIYHGKGRWNFSRKFSDLFDLPSEVLRPFVPEFKHMIHDISSMEDDEFKTTAILEIFHLL 183 Query: 180 QKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRF-RAFIGEIAERAPQ 238 K+I +L + +I LL T D+ + L V +GE R P Sbjct: 184 FKYIHYPELETKLQEIYDLLETIPDQDKVKQYLQAIVQYVAVQGPISLERLGEYTRRLPG 243 Query: 239 EKEKLMTIADRLREEGAMQGKHEEALRIAQE 269 E + T A ++R+E + E+ + + Sbjct: 244 GDEAMQTAAQQIRQEAYNEFIQEQEKMLVER 274 >UniRef50_Q1Q296 Putative uncharacterized protein n=6 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1Q296_9BACT Length = 338 Score = 145 bits (365), Expect = 2e-33, Method: Composition-based stats. Identities = 62/284 (21%), Positives = 123/284 (43%), Gaps = 8/284 (2%) Query: 5 TTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDL 64 PHD FK + A DF+ P + K DL+TL + +S+IDE+L++++SD+ Sbjct: 2 EILNPHDKFFKETFSIRENAIDFLSGRFPPEILKKLDLSTLTQDNSSYIDEELKEHFSDI 61 Query: 65 LWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYH 124 +++ ++ I ++ EH+S ++M+Y + + + + L V+P++ YH Sbjct: 62 VYTCFCKDKEIRITLLFEHKSYAVACPYLQLMKYLLKIWEANSKQA-QRLIPVIPVILYH 120 Query: 125 GCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEI----MQHRKMALLELIQ 180 G + E + R I + L DI+ ++EI + + + L+ Sbjct: 121 GKEAWKVRRFREYFEGIDEVFYRFIPEFEYLLTDISCYSNEEIKDRVFRRVSLQITMLLM 180 Query: 181 KHIR-QRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQR--FRAFIGEIAERAP 237 ++I ++ L + + + D LK L + + A + I + E + Sbjct: 181 RNIFDEKYLEDKLKDFFEIGIQYFEEDEGLKFLESAIRYLYYASDIAEKRVIDTLKEISE 240 Query: 238 QEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVM 281 + + MTIA +L E+G + G+ E E G + + Sbjct: 241 EGGKLSMTIAAKLIEKGKIAGRVEGRAEGRAEGAIEGERKGRIE 284 >UniRef50_A4U3R1 Putative uncharacterized protein n=1 Tax=Magnetospirillum gryphiswaldense RepID=A4U3R1_9PROT Length = 322 Score = 144 bits (363), Expect = 4e-33, Method: Composition-based stats. Identities = 61/292 (20%), Positives = 114/292 (39%), Gaps = 9/292 (3%) Query: 11 DAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVKT 70 DA++ HP A + +P + D ++ F D D ++ D++W + T Sbjct: 5 DALYHRLFSHPLMAEQLVREFVPEAMAVGLDFARMERVNAKFHDRDGKRREGDVIWRIPT 64 Query: 71 QEGVGYI-YVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGY----KELPLVLPMLFYHG 125 +G + +++ E QS + MA R Y Q+ + LP VL ++ Y+G Sbjct: 65 ADGEDVVLHILCEFQSTTDWWMAVRTQVYEGLLWQHLIAERKLKSGDRLPPVLTLVLYNG 124 Query: 126 CRSPYPY--SLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHI 183 + + ++ + A + + + L+D+ VP++E+ +A L +H Sbjct: 125 EQRWHAPTDTIPLIALPAGSPLWPWQPRACYHLLDMGAVPEEELAIRDSLAALLFRLEHP 184 Query: 184 RQR-DLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGD-AQRFRAFIGEIAERAPQEKE 241 R+ +L GL+D +V D + V Q + + A G++ E Sbjct: 185 REPEELAGLIDDVVGWFRRHPGYDELRRLFTELVRQAIEGYETSVAVPGDMMEMRSMLAN 244 Query: 242 KLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIA 293 T R EG +G+ R + L R L++ + T L A Sbjct: 245 LGETWKKRWLAEGIAEGEARGEARGEAKALIRLLEKRFGQLPTDTRERVLAA 296 >UniRef50_C6I158 Putative uncharacterized protein n=3 Tax=Leptospirillum ferrodiazotrophum RepID=C6I158_9BACT Length = 328 Score = 144 bits (362), Expect = 5e-33, Method: Composition-based stats. Identities = 69/288 (23%), Positives = 116/288 (40%), Gaps = 9/288 (3%) Query: 10 HDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVK 69 HD FKS L PD + LP + D +L + E L DL +S + Sbjct: 7 HDRFFKSTLGRPDRLGKVLKAFLPTNISASLDPGSLVPLGTESVGEGLDSSLMDLAFSAR 66 Query: 70 TQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYHGCRSP 129 + I++++EH+S P+ F++ RY L G + PL LP+LFYHG Sbjct: 67 FGDQEARIHLIVEHKSSPDPRTHFQIARYLCGLWIRELKEGLQPRPL-LPILFYHGVVPW 125 Query: 130 YPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQH---RKMALLELIQKHIRQR 186 S + PL+D+ V D+EI H + L L KHI Sbjct: 126 TLPSRLTEVLRPPSELLAVTPDFVLPLIDLRRVDDEEIRHHVDDLEAVLALLSLKHIFDG 185 Query: 187 DLLGLVDQIVSLLVTGNTNDRQLKALFNYV---LQTGDAQRFRAFIGEIAERAPQEKEKL 243 + LV ++ + LK NY+ + ++Q + + IA ++ + Sbjct: 186 -VETLVRLLLREIWERKAPHAILKPEMNYMAGVYKITNSQEMKQIVDPIAREVGMAQDIV 244 Query: 244 MTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDL 291 T D ++G +G + + Q+ L++GL++ RL + + Sbjct: 245 ETWLDEYLQQGLQKGLEQGLQQGLQQGLEKGLEKGF-QQGARLKEEQV 291 >UniRef50_A9BGB6 Putative uncharacterized protein n=3 Tax=Petrotoga mobilis SJ95 RepID=A9BGB6_PETMO Length = 331 Score = 143 bits (361), Expect = 6e-33, Method: Composition-based stats. Identities = 69/304 (22%), Positives = 128/304 (42%), Gaps = 13/304 (4%) Query: 3 ISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYS 62 PHD FK + ARDF+ +LP ++ DL L E NS +DE+LR+ S Sbjct: 2 NELVHNPHDRFFKLIFSDKEIARDFLQNYLPQEAVEIVDLDYLIPENNSHVDENLRESLS 61 Query: 63 DLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLF 122 D+L+ K + GYIY+++EH+S E + F+++RY + + D K++P+++PM+ Sbjct: 62 DMLYKTKIKGQDGYIYILMEHKSYIEGKVIFQLLRYITSIWEEKYDPKTKKVPIIIPMVI 121 Query: 123 YHGCRSPYPYSLCWL----DEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLEL 178 YHG + E + + + + + D ++ I+ M + Sbjct: 122 YHGREIWNVETNLLNMVQGIEDLPNELKTYLPTYRYEICDFSIKRKKRIIGLTAMKVAIE 181 Query: 179 IQK---HIRQRDLLGLVDQIVSLLVTGNTN--DRQLKALFNYVLQTGDAQRFRAFIGEIA 233 + + + + + ++ + + + Y+L + + Sbjct: 182 AMRAGTAMTKEEFKERLRRVFAYIKQLPKEQVHEWFEECMIYLLNVREDVTIEEILKVQK 241 Query: 234 ERAPQEKEKLMTIADRLREEGAMQGK----HEEALRIAQEMLDRGLDRELVMMVTRLSPD 289 E P E +MTIA++LR EG +GK + L +E R L + +T D Sbjct: 242 EIMPGRGEIVMTIAEKLRNEGMEKGKIEGERKGKLEGEREFAIRILSKRFGNQLTEEIKD 301 Query: 290 DLIA 293 + Sbjct: 302 RIRE 305 >UniRef50_B4U689 Putative uncharacterized protein n=8 Tax=Aquificales RepID=B4U689_HYDS0 Length = 323 Score = 142 bits (357), Expect = 1e-32, Method: Composition-based stats. Identities = 49/275 (17%), Positives = 107/275 (38%), Gaps = 15/275 (5%) Query: 8 TPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWS 67 PHD+ FK P + +DI + + ++ +++ DLL+S Sbjct: 4 QPHDSFFKQIFSDPRRVKTLLDIFAKDVAKS---IHSITPVNTEKFSSKSQKFMLDLLFS 60 Query: 68 VKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYHGCR 127 K ++ YI +V+EH+S ++ + ++ Y+ A + + + P ++ ++FYHG Sbjct: 61 CKVKDQDAYIRIVLEHKSYLDKELPIQLSYYNAAIWEEAIKEK-EYYPPIINIVFYHGKG 119 Query: 128 SPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLE----LIQKHI 183 + L + + + + + L+D+ V DDE++ + + KH+ Sbjct: 120 EWNIPTS--LPVLEDQNLEKYVSKLNYILIDLNKVSDDELINEAYIDFCFTSAVIAMKHV 177 Query: 184 RQRDLLGLVDQIVSLLVTG---NTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEK 240 + + + + LV + ++ LF + + Sbjct: 178 HEN--IEKIKAVFRPLVEYVQIHEDEEGYHCLFFSFNYISYVKGDTKEAENALKELIGGD 235 Query: 241 EKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGL 275 +K MT+ ++ EG +GK E ++ GL Sbjct: 236 KKAMTLIEKWIMEGLEKGKQEGLQEGLEKGKQEGL 270 >UniRef50_C6HY29 Putative uncharacterized protein n=1 Tax=Leptospirillum ferrodiazotrophum RepID=C6HY29_9BACT Length = 319 Score = 142 bits (357), Expect = 1e-32, Method: Composition-based stats. Identities = 56/316 (17%), Positives = 111/316 (35%), Gaps = 25/316 (7%) Query: 3 ISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDL-RQYY 61 ++ TPHD FK + + LP + + D +L P + E L R Sbjct: 1 MAKNLTPHDVFFKEIFSQREILSSALSELLPEDVVRRMDFDSLAYLPGESVGEGLSRSTR 60 Query: 62 SDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPML 121 +DL++SV E G + V++EH+S P+ + F++++ + +L G + LP +LP+L Sbjct: 61 ADLVFSVSFGEREGRLVVILEHKSHPDPRVHFQILQMMVMGWMQNLREGREPLP-ILPIL 119 Query: 122 FYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKM--ALLELI 179 FYHG S IAR + +D+ ++ D I + + L Sbjct: 120 FYHGQGSWSIPDRFSERMKIPREIARYLPDFELLRIDLGLIDDTRIRSLQNVLAGAALLS 179 Query: 180 QKHIRQ--RDLLGLVDQIVSLLVT-GNTNDRQLKALFNYVLQTGDAQRFRAFIGEIA--E 234 KH+ + R L+ + + ++ + +Y +A Sbjct: 180 MKHVFENPRRFFHLLIEFGRERSAPHDIIEKIVLVALDYAGHVHKNIPDEELYNIMAAIT 239 Query: 235 RAPQEKEKLMTIADRLREEGAMQGKH----------------EEALRIAQEMLDRGLDRE 278 + + EEG +G + ++ + + Sbjct: 240 EEAGMETTTERLKKIWIEEGIQKGVQLGIQQGVQQGVQQGVRQNQIKTILSLSKHNFTPQ 299 Query: 279 LVMMVTRLSPDDLIAQ 294 + + L ++ Sbjct: 300 QIADLLSLELPEVERV 315 >UniRef50_C0GW49 Putative uncharacterized protein n=6 Tax=Desulfonatronospira thiodismutans ASO3-1 RepID=C0GW49_9DELT Length = 339 Score = 140 bits (352), Expect = 6e-32, Method: Composition-based stats. Identities = 58/264 (21%), Positives = 119/264 (45%), Gaps = 5/264 (1%) Query: 3 ISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYS 62 +S TS HD F++ L ARDF+ HLP + + +L T+K+ S++ ++L++ + Sbjct: 7 MSDTSKYHDHTFRAILGREPVARDFVRYHLPEEITRDMNLDTVKVSSRSYVSDNLKESMT 66 Query: 63 DLLWSVK-TQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPML 121 D++ +++ IY+++EH+S + ++ +Y Q+ + LP+++P++ Sbjct: 67 DIVITLELITGEPAEIYILVEHKSDLDAWTKIQLFKYMNEVWQSFIQKKTGTLPIIVPLV 126 Query: 122 FYHGCRSPYPYSLCWLDEFAEPAIA--RKIYSSAFPLVDITVVPDDEIMQHRKMALLELI 179 FYHG YSL + D F P+ + I L ++ V+ ++ + + L+ Sbjct: 127 FYHGTARW-NYSLEFSDLFNLPSEHYRKYIPKFEHLLHEVPVINKKKVKSSITLEVFHLV 185 Query: 180 QKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIA-ERAPQ 238 ++I + + + + LL G + + A E + P+ Sbjct: 186 LEYIFYPEKRDQIYEALELLFKGLDAKEAHEIFAILIKYLLIATDETPEEAEEKVKHLPK 245 Query: 239 EKEKLMTIADRLREEGAMQGKHEE 262 E + T A+ L E G + E+ Sbjct: 246 GGETVRTTAEVLEERGYNKAIKEK 269 >UniRef50_C4FIM1 Putative uncharacterized protein n=1 Tax=Sulfurihydrogenibium yellowstonense SS-5 RepID=C4FIM1_9AQUI Length = 316 Score = 139 bits (350), Expect = 1e-31, Method: Composition-based stats. Identities = 53/296 (17%), Positives = 124/296 (41%), Gaps = 25/296 (8%) Query: 5 TTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDL 64 T PHD FK P + +DI P L + DL +++L + + + + +L Sbjct: 2 TDLQPHDQFFKQIFSEPKRVKSLLDIFYP-ELSQKIDLESIRLLNSEKYSQKVGKSLLNL 60 Query: 65 LWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYH 124 L+ K + ++ ++ EH+S ++ + +++ Y+ + Y+E P ++ ++ YH Sbjct: 61 LYECKIENEKSFLRIIFEHKSYIDKNLPSQLLYYNGILWEE--TGEYEEYPPIINIVLYH 118 Query: 125 GCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALL----ELIQ 180 G R + L + I R + L+D++ V D+E++ + L Sbjct: 119 GKRKWNIPAT--LPKTNSEIIERFANKLNYHLIDLSKVADEEMISKLYLDFCTVSALLTM 176 Query: 181 KHIRQ--RDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQ 238 KHI + R ++ ++ + D + + +Y+ + Q + EI Sbjct: 177 KHIFEDLRKYKHILKKVFE-----HYQDGCVFIILDYISVVNNPQEVENVLKEI----LG 227 Query: 239 EKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQ 294 ++ +MT+ ++ + EG QG + + ++ + + + + P+++ Sbjct: 228 GEKDMMTLTEKWKMEGLQQGLQQGMIEGQKKAILKSIQLKF-----GRVPENIEKL 278 >UniRef50_Q04UG3 Transposase, YhgA-like n=8 Tax=Leptospira RepID=Q04UG3_LEPBJ Length = 304 Score = 139 bits (350), Expect = 1e-31, Method: Composition-based stats. Identities = 75/302 (24%), Positives = 133/302 (44%), Gaps = 13/302 (4%) Query: 3 ISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYS 62 ++ + PHD + + + A F LP + +L DL L+L +SF+ E+L+Q + Sbjct: 1 MTEVNNPHDRLIRETFQDKKEAATFFKNTLPPEVVELLDLENLELTESSFVSEELKQEQT 60 Query: 63 DLLWSVKTQEG-VGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPML 121 DLL+ + + G +Y++ EH+S E + +++ Y +N +G + +V+P + Sbjct: 61 DLLFQIPLKSGNKSNVYLLFEHKSYLENTIYIQLLGYLTEIYRNQQRSG-ESFSVVIPFV 119 Query: 122 FYHGCRSPYPYSLCWLDEF-----AEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALL 176 FYHG + + D+F I L D+ + + ++ + Sbjct: 120 FYHGEKEW-KLGDRFSDQFVLTKQETDVFQDFIPDFKIDLFDLEGIELKKKLESITFQVT 178 Query: 177 ELIQKHIRQRDLL--GLVDQIVSLLVTGNTNDR---QLKALFNYVLQTGDAQRFRAFIGE 231 + + IR+RDL + + SLL+ + L+ L Y+ D + Sbjct: 179 LGVVQRIRERDLEFVSHLPGLFSLLLGIEEESKRVAILRKLLLYIYWARDLKPTELKRVL 238 Query: 232 IAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDL 291 + Q +E MT A+RL EG QGK E + A+ ML + E V+ +T LS DL Sbjct: 239 AISKLEQYEELTMTTAERLISEGIQQGKIEGKIETARNMLSEDIQLEAVLRITGLSKQDL 298 Query: 292 IA 293 Sbjct: 299 KD 300 >UniRef50_C5RH90 Putative uncharacterized protein n=2 Tax=Clostridium cellulovorans 743B RepID=C5RH90_CLOCL Length = 339 Score = 137 bits (345), Expect = 4e-31, Method: Composition-based stats. Identities = 52/306 (16%), Positives = 106/306 (34%), Gaps = 15/306 (4%) Query: 4 STTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSD 63 + + HD +K + +T I + L L S++ D + SD Sbjct: 17 NKKNNLHDKSYKDLFSNKETFLSLIQTFVSNTWGSKLTKENLVLVDKSYVLSDYEELESD 76 Query: 64 LLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKE--------LP 115 +++ + + + Y+++E QS + M R++ Y I + L ++ LP Sbjct: 77 IVYKARIGDHEVFFYMLLEFQSYVDYRMPIRLLLYMIEIWREILKNTSEKEFKRKSFRLP 136 Query: 116 LVLPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKM-- 173 V+P++ Y+G ++ I + +D+ DE+ +++ + Sbjct: 137 AVVPIVVYNGEKNWTVARTLKEVISNSDIFGESILDFRYEFLDVNRFKKDELYENQNIAS 196 Query: 174 ALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGD--AQRFRAFIGE 231 A+ L Q R L D ++ QLK V + + Sbjct: 197 AIFLLDQSISRIEFYNRLKDIVIEFNKLTVEEKAQLKHWLVNVNSEENNYKENIEKIFSS 256 Query: 232 IAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLD---RGLDRELVMMVTRLSP 288 + ++L+EEG ++GK E + + L+ + L E + L Sbjct: 257 NKREVEIMTSNISKGLEKLKEEGKIEGKAEGKAELLIKQLNKKFKLLPMEYEKKIKALPE 316 Query: 289 DDLIAQ 294 L Sbjct: 317 KILDDI 322 >UniRef50_B9MN47 Putative uncharacterized protein n=2 Tax=Bacteria RepID=B9MN47_ANATD Length = 324 Score = 135 bits (340), Expect = 2e-30, Method: Composition-based stats. Identities = 51/317 (16%), Positives = 124/317 (39%), Gaps = 28/317 (8%) Query: 2 TISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYY 61 + HD+ FK +P + + ++++++ ++I ++ Q Sbjct: 6 KEKLPAKEHDSTFKLLFENPKDIYLLLSKIINYSWANEIRESSIEIKKTNYITKEFSQVE 65 Query: 62 SDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPML 121 +D++ + ++ Y Y++IE+QS + M R++RY I+ + G ++LP ++P++ Sbjct: 66 ADVVAKARLKDRDVYFYILIENQSTVAKDMPERLLRYMISIWAEEIRNGVEKLPAIIPIV 125 Query: 122 FYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVP-DDEIMQHRKMALLELIQ 180 Y+G + S + F + +VDI + + + + + + Sbjct: 126 VYNGLDRRWEVSTDIIGAFDIFKND----IFKYKVVDIAQIDIKNYLQEEDVLTPIIFYL 181 Query: 181 KHIR--QRDLLGLVDQIVSLLVT-GNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAP 237 + +R +L+ + +I L N + +V++ + ++ + Sbjct: 182 EQVRNDSNELVRRLQEIEQSLKKLSFNNIERFLLWSQHVIRPRLGNEQKKEYDKLVMKVR 241 Query: 238 QEKEKLM-----TIADRLREEGAMQGK---------------HEEALRIAQEMLDRGLDR 277 QE +LM +A L E + +E + A+ M+ G+ Sbjct: 242 QEGVELMGEFVSNVARLLDETKTKEFLAGVQQGIQQGIQQGIQQERIETAKRMIQLGISY 301 Query: 278 ELVMMVTRLSPDDLIAQ 294 E++ T LS +++ Sbjct: 302 EVISKATNLSIEEIEKI 318 >UniRef50_B3ETR6 Putative uncharacterized protein n=1 Tax=Candidatus Amoebophilus asiaticus 5a2 RepID=B3ETR6_AMOA5 Length = 275 Score = 135 bits (340), Expect = 2e-30, Method: Composition-based stats. Identities = 65/239 (27%), Positives = 116/239 (48%), Gaps = 22/239 (9%) Query: 76 YIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYHGCRSPYPYSLC 135 Y+Y +IE+QS +LMAF M+ Y++A M+ HL+ GY+ELP+++ + Y G +SPYPYS Sbjct: 36 YVYTLIENQSTHNKLMAFSMLSYNVALMEQHLNEGYQELPIIVNICIYTGKKSPYPYSQD 95 Query: 136 WLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQRDLLGLVDQI 195 D F +AR+ F L+D++V+ +E+++ +E + + R+RD L ++ Sbjct: 96 ICDYFEGVELAREQMFKHFKLLDLSVLSQEELLKDGTFGSVEALLRQGRERDYLNWINN- 154 Query: 196 VSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKE-------------- 241 + ++ ++ Y+L T D + I E ++KE Sbjct: 155 -NQVLIWELVSNYGLSIVIYILTTDDKNDADYLMQAIIEAVLEQKEIIVTAAQQLRQVDI 213 Query: 242 ------KLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQ 294 + ++ +EEG G +A I + ML GL+ L+ VT +S + + Sbjct: 214 QTGLIKGIKEGIEQGKEEGVKLGIQAKAQAIDKSMLKEGLEISLIQKVTGISREAIEKL 272 >UniRef50_C0GTX5 Putative uncharacterized protein n=8 Tax=Desulfonatronospira thiodismutans ASO3-1 RepID=C0GTX5_9DELT Length = 338 Score = 134 bits (338), Expect = 3e-30, Method: Composition-based stats. Identities = 54/266 (20%), Positives = 109/266 (40%), Gaps = 6/266 (2%) Query: 6 TSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLL 65 T+ HD+ K FL A + LP + K D + E +S++ + L+ YYSDL+ Sbjct: 3 TTNIHDSTIKYFLSDRLNAISLLKSMLPEEIVKQLDFNKIYYEKDSYLPKSLQGYYSDLV 62 Query: 66 WSVKTQEGV--GYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDA-GYKELPLVLPMLF 122 SV T+ G ++ ++EH+S ++ + +RY + + + G LP+++P+L Sbjct: 63 VSVPTKCGSYVAKVFFLLEHKSTFKKNTPLQFLRYILEFWEQYQKNTGETRLPVIIPILI 122 Query: 123 YHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKH 182 H P + L + + F L D ++ + L + ++ Sbjct: 123 AHPEEGWKPTKVSDLVDLPSDDFKIFVPDFNFLLYDAVNDDPEDYDFDETLKALFTLWRY 182 Query: 183 IRQRDLLGLVDQIVSLLVTGNTNDR---QLKALFNYVLQTGDAQRFRAFIGEIAERAPQE 239 R + + V + L+ + R ++ + +Y+ T D + + + Sbjct: 183 SRSPEFMQGVQKAFQLIKKVDPKARLLDFVQMILHYLEVTRDEKEYIDIQKIAETEIDEG 242 Query: 240 KEKLMTIADRLREEGAMQGKHEEALR 265 +E + TIA+ R EG + + Sbjct: 243 EEYMGTIAEMFRREGDERTEQRFLQE 268 >UniRef50_B2V9N0 Putative uncharacterized protein n=4 Tax=Sulfurihydrogenibium RepID=B2V9N0_SULSY Length = 312 Score = 132 bits (332), Expect = 1e-29, Method: Composition-based stats. Identities = 62/302 (20%), Positives = 133/302 (44%), Gaps = 17/302 (5%) Query: 1 MTISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQY 60 M + PH+ FK + +DF+ I L + L + L++L+L P+ + +++ Sbjct: 1 MKNKESIQPHNWFFKQVFSNSKNVQDFLSIFL-SDLSQKIQLSSLELVPSEKFSNNQKKH 59 Query: 61 YSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPM 120 + DLL+ K + YI ++ EH+S ++ + ++M+Y+ + L P ++ + Sbjct: 60 FLDLLYKCKLNDKEAYIRLIFEHKSYVDKKLPLQLMQYNAVIWEEALKEK-DYYPPIINI 118 Query: 121 LFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHR----KMALL 176 +FYHG + + + + + I + L+D+ + D+ + ++ + + Sbjct: 119 VFYHGQAKWNFPTTI--PDIEDEELDKYIQKLNYILIDLNEIEDENLKRYLKKNVDLIME 176 Query: 177 ELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERA 236 LI KHI D L + ++ ++ + D + L VL D ++ + EI Sbjct: 177 MLIMKHIH--DRLERIKTLLKDVIDECSEDCFVIILNYLVLVKKDYEKVKEVFKEII--- 231 Query: 237 PQEKEKLMTIADRLREEGAMQGKHEEALRIAQEML--DRGLDRELVMMVTRLSPDDLIAQ 294 +EK+M D+L+ EG M+GK E +++ G+ + + D++ Sbjct: 232 -GGEEKMMLFTDKLKMEGKMEGKIEILRENIIDLIDVKFGVVDKSITEKVN-QIDNIETL 289 Query: 295 SH 296 Sbjct: 290 KQ 291 >UniRef50_C6HZP6 Putative uncharacterized protein n=1 Tax=Leptospirillum ferrodiazotrophum RepID=C6HZP6_9BACT Length = 334 Score = 132 bits (332), Expect = 1e-29, Method: Composition-based stats. Identities = 59/307 (19%), Positives = 115/307 (37%), Gaps = 18/307 (5%) Query: 2 TISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDL-RQY 60 S ++TPHD+ FK + + L +L++L+ P I EDL R Sbjct: 17 KTSISTTPHDSFFKDVFGPGKGHLPSLIPLIDGSLASRIELSSLEYLPGESIAEDLARST 76 Query: 61 YSDLLWS-----VKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELP 115 SDL S + G I + EH+S + ++ A + L G K P Sbjct: 77 RSDLSASLLISNARIDGGDARIAFIFEHKSFLPHHIHIPLLSLVSALLSRDLREGRKPCP 136 Query: 116 LVLPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMAL 175 V+P++ YHG + P +A ++ L+D++ D+ + + Sbjct: 137 -VIPVVLYHGRAPWTLPARLSEALDLSPELAPRLPDFELTLIDLSRFSDETLKEKIAHPE 195 Query: 176 LEL---IQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKAL----FNYVLQTGDAQRFRAF 228 + + KHI + ++ V L+ T + + LK + +Y+ + + Sbjct: 196 PLVSLSVMKHIFEP-PESVLGHFVRLIKTLSPSRDILKRIVDTTLHYISYVKKSHHPQEI 254 Query: 229 IGEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLD-RGLDRELVMMVTRLS 287 +EK + T+ D ++EEG +G +L L + + + + Sbjct: 255 RTIFTTFLAEEK--MTTVLDLIKEEGIQEGIQMGRDEAITRLLQHSSLSPQQIASILNVD 312 Query: 288 PDDLIAQ 294 +++ Sbjct: 313 LSRVLSL 319 >UniRef50_A4XMU7 Putative uncharacterized protein n=1 Tax=Caldicellulosiruptor saccharolyticus DSM 8903 RepID=A4XMU7_CALS8 Length = 313 Score = 129 bits (324), Expect = 1e-28, Method: Composition-based stats. Identities = 52/313 (16%), Positives = 119/313 (38%), Gaps = 25/313 (7%) Query: 3 ISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYS 62 +S D FK L + + + L L L +++ I+ R S Sbjct: 1 MSRKRRSADEGFKKVLTNRTNIKWLLTELL-EVLPIQIGLEDIEVIATESINRQWRARRS 59 Query: 63 DLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLF 122 D+++ +K ++ YI V++E QS EEL+ R++ Y + + + LP+V+P++ Sbjct: 60 DMVYKIKYKD--AYICVLLEFQSSKEELIHLRVLEYMLLIQKKY--TTKNLLPVVIPVVL 115 Query: 123 YHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKM-ALLELIQK 181 Y G P + + + + + VD+ ++ D+++++ + A + K Sbjct: 116 YTGEEKWTPATCFEQNVVYGEDFKQFVQKFSLVFVDVRMIDDEKLLKSPNLLAAALYVDK 175 Query: 182 HIRQRDL-LGLVDQIVSLLVTGNTNDRQLKALFNYVLQTG---DAQRFRAFIGEIAERAP 237 + ++ + + + +V+ G + F+ + Sbjct: 176 VSDNPEKVAERLEYLSKHVKFSEEQKEEFCEWLYHVVLKGYGFSDEEVDEFLFKSDFLRL 235 Query: 238 QEKEKLMTIADRLREEGAMQ---------------GKHEEALRIAQEMLDRGLDRELVMM 282 E + A+++R+ + GK + L +AQ+M++ G + + Sbjct: 236 GVNEMFLNTAEKIRKGLEKELEKERKQGIQQGIQQGKEQALLEVAQKMIEEGAEDSFIAK 295 Query: 283 VTRLSPDDLIAQS 295 VT L + + Sbjct: 296 VTGLDMERIRQLR 308 >UniRef50_C0A240 Putative uncharacterized protein n=1 Tax=Opitutaceae bacterium TAV2 RepID=C0A240_9BACT Length = 365 Score = 129 bits (323), Expect = 2e-28, Method: Composition-based stats. Identities = 59/317 (18%), Positives = 123/317 (38%), Gaps = 31/317 (9%) Query: 9 PHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSV 68 HD +F+ P AR F+ LP L D TL + S I + L + D+++ + Sbjct: 35 DHDRIFRHAFSLPAVARQFLRTWLPPELVAQADWHTLTVTRISGISDTLGERREDVVYRI 94 Query: 69 KTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNH--------------------LD 108 + YV++EHQ+K E+ MA R+M + + Sbjct: 95 NVNGRNVHFYVLMEHQTKTEKHMARRIMEETFLIWRQDEHDRAEAAKKEAPGKADRQSRR 154 Query: 109 AGYKELPLVLPMLFYHGCRSPYPYSLCWLDEFAEPAIAR----KIYSSAFPLVDITVVPD 164 + PLV+ M+ + G R P + + + F +V++ +P Sbjct: 155 RETDKFPLVISMVLHPGPRKWGKIWRLADLIDVPPRMEKWARTFMPDCGFIVVELAGLPL 214 Query: 165 DEIMQHRKMALLELIQK-----HIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQT 219 +++ + + I R + L+D++ S + +K L++Y++ + Sbjct: 215 EKLADGHLARAILGALQGNRLGLIDIRKIKRLLDEMFSDPDRASVGA-VVKQLWHYLISS 273 Query: 220 GDAQRFRAFIGEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDREL 279 D + + IA + + +M +RL++ GA++ +H + + DR + L Sbjct: 274 SDLKEEQTKDIVIAHIPEEYRSNIMNTVERLKQAGALKAQHNAVIEALEVRFDR-VPEGL 332 Query: 280 VMMVTRLSPDDLIAQSH 296 + ++ + + H Sbjct: 333 REAIQGINDPERLRNLH 349 >UniRef50_C0GWA6 Putative uncharacterized protein n=3 Tax=Desulfonatronospira thiodismutans ASO3-1 RepID=C0GWA6_9DELT Length = 334 Score = 128 bits (322), Expect = 2e-28, Method: Composition-based stats. Identities = 66/287 (22%), Positives = 117/287 (40%), Gaps = 10/287 (3%) Query: 3 ISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYS 62 HD FKSF + RDFI +LP ++K DLT ++++ ++ E+ +++YS Sbjct: 2 SKKIPNAHDICFKSFFSREEFVRDFIQYYLPEEIKKHLDLTIIEIDMEGYLSEEFKEFYS 61 Query: 63 DLLWSVKTQEG--VGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGY--KELPLVL 118 D++ V + +Y + EH+SKP + + Y + L G + LP+++ Sbjct: 62 DVVAKVYFNDRVHELELYFLFEHKSKPYRFTILQTLNYQVQKWMRLLVEGKLNQHLPIIV 121 Query: 119 PMLFYHGCRSPY-PYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLE 177 P++ Y+G +S L + I L DI + + M + Sbjct: 122 PVVIYNGYKSWNFSVQFEDLFQLPSEYYKDFIPQFRHILHDIGQMDEASFKTTTIMEIFH 181 Query: 178 LIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRF-RAFIGEIAERA 236 L+ K+I +L + +I LL ND+ LF V + + E A+R Sbjct: 182 LLLKYIYYPELDTKIHEIYDLLEKLPDNDKLTDYLFIIVRYVMASGAIPEKRLLEHAKRF 241 Query: 237 PQEKEKLMTIADRLREEGAMQGK----HEEALRIAQEMLDRGLDREL 279 +E + A + E K + + +QEML + L Sbjct: 242 SGGEEMIGLAAREIEERVEQTRKPYWQKQAKVENSQEMLIKSLKMRF 288 >UniRef50_B9MMM9 Putative uncharacterized protein n=1 Tax=Anaerocellum thermophilum DSM 6725 RepID=B9MMM9_ANATD Length = 315 Score = 128 bits (320), Expect = 3e-28, Method: Composition-based stats. Identities = 48/316 (15%), Positives = 114/316 (36%), Gaps = 29/316 (9%) Query: 5 TTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDL 64 T +D +K + + F+ L K + + +++ I++ ++ SD+ Sbjct: 2 KTYKKYDEGYKKLFSNKENLIWFLQNVLNEERFKKIEKSDVEIIATESINKKWQKKISDI 61 Query: 65 LWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYH 124 ++ +K ++ + + IE QS+ ++ + R+ Y + E+P+V+P++ Y+ Sbjct: 62 VYKIKYKD--SFFCLTIEFQSREDKKILHRLYEYMHLI--QLKNKVNGEIPVVVPIVLYN 117 Query: 125 GCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRK-MALLELIQKHI 183 G P + +DI +P+++++ +A+ I + Sbjct: 118 GISHWKPNEQYNEIILFAKDFPEYAQNFKIIFLDIKSIPEEKLISAANVLAIAVYIDQVS 177 Query: 184 RQRDL-------------------LGLVDQIVSLLVTGN-----TNDRQLKALFNYVLQT 219 + L D + +++ + K V + Sbjct: 178 NNPERVLNRILNLRGKIHLNWEQREELADWLYEVILRSYGVSEEEAEEMFKKSGLEVDEL 237 Query: 220 GDAQRFRAFIGEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDREL 279 + + G E+ KE + + ++G QG IA++ML EL Sbjct: 238 FSSTAEKIKQGIEREKKKIAKEAMKQGMKQGMKQGMKQGMKRAIKLIAKQMLKDNQPIEL 297 Query: 280 VMMVTRLSPDDLIAQS 295 + T L+P+++ Sbjct: 298 ISKYTGLTPEEIKKLK 313 >UniRef50_C6PYR3 Putative uncharacterized protein n=1 Tax=Clostridium carboxidivorans P7 RepID=C6PYR3_9CLOT Length = 344 Score = 126 bits (317), Expect = 8e-28, Method: Composition-based stats. Identities = 45/285 (15%), Positives = 99/285 (34%), Gaps = 10/285 (3%) Query: 3 ISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYS 62 HD +K + + D I + + K ++L S+I D + S Sbjct: 4 KKEMHHIHDKSYKDLFSNKELLVDMIQNFVKSSWIKEIKKDNIELVNKSYILSDYEELES 63 Query: 63 DLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYK--------EL 114 D+++ Y+++E QS + M R+ Y + L + L Sbjct: 64 DIVYKATIDGREVIFYILLEFQSYVDYSMPIRLFLYMSEIWREVLKNTKQAEVKSKEFRL 123 Query: 115 PLVLPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKM- 173 P ++P++ Y+G I + L+DI +E+M+ + + Sbjct: 124 PAIVPLVLYNGEYKWTVEKKFKNIINKSELFGNNIIDFEYILIDINKYEKEELMELKNLV 183 Query: 174 -ALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEI 232 A+ L QK + + + D + L+ L I +I Sbjct: 184 SAVFLLDQKVDIEEFISRVKDIAIDFNNLTEEQKMMLRHWLRVTLSDELKGNLGEKIEDI 243 Query: 233 AERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDR 277 +E ++ + + +E + + E + +E +++G+++ Sbjct: 244 LIAKKEEVNRMTSNISKTIKETFAKTREEGMEKGIEEGIEKGIEK 288 >UniRef50_B1XMU9 Putative uncharacterized protein n=1 Tax=Synechococcus sp. PCC 7002 RepID=B1XMU9_SYNP2 Length = 316 Score = 124 bits (312), Expect = 2e-27, Method: Composition-based stats. Identities = 46/286 (16%), Positives = 105/286 (36%), Gaps = 19/286 (6%) Query: 9 PHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQ--YYSDLLW 66 HD +FK L DF+ + P + + + +L ++ Q D++ Sbjct: 6 DHDLLFKELLT--TFFWDFLALFAP-EILETAEQNSLTFLTQEVFNDLPGQTRRNVDIVA 62 Query: 67 SVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYHGC 126 + + V +E+Q+ + A RM Y + + + P+ + Sbjct: 63 KLHFRGQETCFLVHVENQATSQADFAERMFLYFARLYEKYR-------LPIYPIALF--- 112 Query: 127 RSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIR-Q 185 S + F+ +++I S +F + + +P + ++ L+ K Sbjct: 113 -SYRSPQRLEPETFSVAFPSKEILSFSFQTIQLNRLPWRDFLRQPNPVAAALMAKMNFSS 171 Query: 186 RDLLGLVDQIVSLLV--TGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKL 243 + + + + ++V ++ L + F + + F E+ PQE+ ++ Sbjct: 172 EERPKVKLECLRMIVTLRLDSARIHLLSGFVDTYLRLNMAEQQVFEQELHRIQPQEEAQV 231 Query: 244 MTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPD 289 + I EEG QG+ E A +++ R + + V+ +P Sbjct: 232 LRIVTSWMEEGLQQGRQEGRQEEACKLILRFVQQRFPEQVSGFAPQ 277 >UniRef50_C6HTR6 Probable transposase n=5 Tax=Leptospirillum ferrodiazotrophum RepID=C6HTR6_9BACT Length = 216 Score = 124 bits (312), Expect = 2e-27, Method: Composition-based stats. Identities = 53/215 (24%), Positives = 83/215 (38%), Gaps = 13/215 (6%) Query: 3 ISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDL-RQYY 61 ++TT TPHD+ FK + L AP D ++L I E L + Sbjct: 1 MTTTPTPHDSFFKDVFGPGKANLPALLSLLDAPFASRIDPSSLTFLSGETIGEGLATSFR 60 Query: 62 SD-----LLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPL 116 SD L+ ++EH+S P + F++ A L G LP Sbjct: 61 SDLVGSLLVADATVDGKPLEFVFLVEHKSSPARDIQFKLACLVTALWARFLREGKPPLP- 119 Query: 117 VLPMLFYHGCRSPYPYSLCWLDEF-AEPAIARKIYSSAFPLVDITVVPDDEIMQHRK--- 172 V+P+L +HG +SP+ L + P +A + A ++D+T + DDEI + Sbjct: 120 VVPILIHHG-KSPWNQPLRLYETLGLRPELATGMLDYALHVIDLTRIEDDEIRRKIPDPE 178 Query: 173 MALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDR 207 + KHI L + + LL N Sbjct: 179 PQMSLAAMKHIHDP-LPAFLRVMADLLKEIEENRD 212 >UniRef50_B9MPV5 Putative uncharacterized protein n=5 Tax=Clostridia RepID=B9MPV5_ANATD Length = 331 Score = 124 bits (312), Expect = 3e-27, Method: Composition-based stats. Identities = 48/329 (14%), Positives = 116/329 (35%), Gaps = 42/329 (12%) Query: 5 TTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDL 64 S +D FK FI +P P K + +++ I+ + SD+ Sbjct: 2 KLSRSYDVGFKKLFSDKINVCWFITEIIPEPRLKNYTQSDIEIVATESINAQWKARRSDM 61 Query: 65 LWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYH 124 ++ + +IY+++E QS+P + M R+ Y + + K LP+V+P++ Y+ Sbjct: 62 VYRLPYSS--SWIYLLVEFQSRPNKQMHCRIYEYVFLIQRKY--QIDKRLPVVVPVVLYN 117 Query: 125 GCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRK-MALLELIQKHI 183 G P + + + + +D+ +P+D+++ +A + + Sbjct: 118 GVEKWQPVTQFADNVEYAEDFPEYVQRLNYIFIDVRDIPEDKLLNGNNVLAAALYVDQVA 177 Query: 184 RQRD-LLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTG---------------------- 220 D ++ + ++ + + +L + + Sbjct: 178 TNPDSVVERLLELGKNIRIPDEQREELAEWLYHAVLKSYKIPREEINELFAKSKILGVEE 237 Query: 221 --------------DAQRFRAFIGEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRI 266 + ++ +I + + E + + EG ++G+ E L I Sbjct: 238 MFQSTAMKIKKGLAEEKKKIRLESKIEGKIEGKIEGKIEGKIEGKIEGKIEGRMEAQLEI 297 Query: 267 AQEMLDRGLDRELVMMVTRLSPDDLIAQS 295 A+ ++ G + + VT L + + Sbjct: 298 ARNLILEGAEDSFIAKVTGLDIEKVKELR 326 >UniRef50_A5USQ0 Putative uncharacterized protein n=4 Tax=Roseiflexus sp. RS-1 RepID=A5USQ0_ROSS1 Length = 330 Score = 124 bits (310), Expect = 5e-27, Method: Composition-based stats. Identities = 52/271 (19%), Positives = 96/271 (35%), Gaps = 19/271 (7%) Query: 9 PHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDE--DLRQYYSDLLW 66 HDA+FK L R+FID+ P L D + D + +DL+ Sbjct: 6 DHDALFKLVLT--AFFREFIDLVAP-DLAAALDPAPPVFLDKESFADLFDPDRREADLVA 62 Query: 67 SVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYHGC 126 V+ ++ + + +EHQ++ + + RM RY + + P+ Sbjct: 63 QVRLRQHPATLLIHLEHQAQADAALDRRMFRYFARLYDRYDQ-------PIYPIALCSYP 115 Query: 127 RSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKH-IRQ 185 R P + D A R + + + +V + + + A + L+ + + Sbjct: 116 RPRRPAA----DRHEVRAAQRTVLTFQYQVVQLNRMDWRAYLTTTNPAAMALMARMRVAP 171 Query: 186 RDLLGLVDQIVSLLVTGNTN--DRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKL 243 D + + LL R+L F + +A+ +A E+A KE + Sbjct: 172 EDRWRVKAACLRLLAGAPLTGAQRRLIGQFVDIYLPLNAREEQALAAEVARLPGAAKEVV 231 Query: 244 MTIADRLREEGAMQGKHEEALRIAQEMLDRG 274 M + +G +G E E L G Sbjct: 232 MELITSWERKGRAEGLREGLREGRAEGLREG 262 >UniRef50_C9KKN3 Putative uncharacterized protein n=1 Tax=Mitsuokella multacida DSM 20544 RepID=C9KKN3_9FIRM Length = 297 Score = 123 bits (308), Expect = 8e-27, Method: Composition-based stats. Identities = 46/307 (14%), Positives = 97/307 (31%), Gaps = 31/307 (10%) Query: 1 MTISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQY 60 M + T D++F+ + + D+ L F D Sbjct: 1 MCMKPKRTYKDSLFRHIFNDKRRLASLYESLTGRKVA-PRDIAITTLRGVFFND-----I 54 Query: 61 YSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGY-------KE 113 +D+ + + ++ +++EHQS M RM+ Y LD+ Sbjct: 55 KNDISFRIGDRD-----IILMEHQSSWNPNMPLRMLWYVAKLYSRQLDSQEVVYRSRLIP 109 Query: 114 LPLVLPMLFYHGC-RSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRK 172 +P +FY+G P L D FA ++ + + + Sbjct: 110 IPAPEFYVFYNGSQDEPDYQKLRLSDAFAHATDTLELAVDCYNI-------NYSTQNKLL 162 Query: 173 MALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKA--LFNYVLQTGDAQRFRAF-- 228 + EL I + + + + L K L Q +++ F Sbjct: 163 DSCYELRCYSIFVQKVREGIQNGLELRTAIRQAITYCKTHDLMGDYFQKNESEVFDMVNF 222 Query: 229 -IGEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLS 287 + +++ + R G + G+ +++A +L +GL ++ T LS Sbjct: 223 KWDQKRALEVAKEDGVAIGEARGEARGKLLGERNAMMKVALSLLKKGLPVGVITESTNLS 282 Query: 288 PDDLIAQ 294 +++ Sbjct: 283 LEEVRKI 289 >UniRef50_C1DXM1 Putative uncharacterized protein n=5 Tax=Sulfurihydrogenibium azorense Az-Fu1 RepID=C1DXM1_SULAA Length = 342 Score = 122 bits (306), Expect = 1e-26, Method: Composition-based stats. Identities = 55/268 (20%), Positives = 114/268 (42%), Gaps = 18/268 (6%) Query: 3 ISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYS 62 +S +PHD FK F++I LP L + +LKL + ++++ Sbjct: 1 MSIEKSPHDWFFKMIFSQKQNVESFLEIFLPQ-LYECIIPNSLKLSDTEKFSKKYKKFFL 59 Query: 63 DLLWSVKTQEGVG-----YIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLV 117 DL + K ++ G IY+V EH+S P++ ++ Y M+ + + V Sbjct: 60 DLAFDCKLKDKEGNTIDGQIYIVFEHKSYPDKHTPSQISFYKSVMMEED-ERLSRPYRPV 118 Query: 118 LPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLE 177 +P++FYHG +S + + + ++S ++ L D++ V + +++ + Sbjct: 119 IPIVFYHGEKSWNIPTDIPQQFNTLGNLEKYLHSLSYILFDVSKVDESFLIEKIYLNACL 178 Query: 178 ----LIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIA 233 K+I + L + ++ L+ + D + V+ D + + EI Sbjct: 179 ISGVFTLKNIFK--DLKYLRPVLEKLILDDVKDCLYIIIDYTVIVKKDLETIEKILEEI- 235 Query: 234 ERAPQEKEKLMTIADRLREEGAMQGKHE 261 +EK+MT+ ++ + EG +G E Sbjct: 236 ----GGEEKMMTLTEKWKMEGLKKGMEE 259 >UniRef50_Q7NIZ1 Gll2041 protein n=9 Tax=Cyanobacteria RepID=Q7NIZ1_GLOVI Length = 311 Score = 122 bits (306), Expect = 1e-26, Method: Composition-based stats. Identities = 57/304 (18%), Positives = 111/304 (36%), Gaps = 30/304 (9%) Query: 8 TPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDED--LRQYYSDLL 65 T HD +FK L +FID+ A + + ++ + +Y +DL+ Sbjct: 2 TDHDRLFKELLS--TFFVEFIDLFF-ADVGNYLERGSIVFLEKELFSDITAGERYEADLV 58 Query: 66 WSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYHG 125 + ++ + V IE+Q++ + + ++RM RY + + + P+ + Sbjct: 59 VKARFRDHQSFFLVHIENQTEAQSIFSYRMFRYFARLYEKYQ-------LPIYPIAVFSF 111 Query: 126 CRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKH-IR 184 F + + + +V + + + ++ L+ + I Sbjct: 112 TEPLRAEPTAHRVAFPD----FTVLEFHYRVVQLNRLDWRDFLRQPNPVASALMARMRIA 167 Query: 185 QRDLLGLVDQIVSLL--VTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEK 242 D + + + LL + + QL + F AQ R F E+A E+E Sbjct: 168 PADRPRVKLECLRLLATLRLDPARTQLISGFVDTYLKLTAQEERLFAAELATIGASEQEA 227 Query: 243 --------LMTIADRLREEGAMQGKHEEALRIAQEMLDR---GLDRELVMMVTRLSPDDL 291 + ++ R+ G +G+ EEAL I L R L + V+ LS L Sbjct: 228 VVQIVTSWMQQGLEQGRQVGRQEGRQEEALAIVLRQLSRRLGTLPAQNAERVSGLSTTAL 287 Query: 292 IAQS 295 A S Sbjct: 288 EALS 291 >UniRef50_D1PHY3 Putative uncharacterized protein n=2 Tax=Prevotella copri DSM 18205 RepID=D1PHY3_9BACT Length = 307 Score = 122 bits (306), Expect = 1e-26, Method: Composition-based stats. Identities = 54/310 (17%), Positives = 96/310 (30%), Gaps = 20/310 (6%) Query: 1 MTISTTSTPHDAVFKSFLR-HPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQ 59 M + D FK HP ++ LP L + +K P + + Sbjct: 1 MVMKYLDPKADLTFKKIFGNHPKRLISLLNALLP--LSDEEQIREIKYLPTELVPQLEGG 58 Query: 60 YYSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAG--YKELPLV 117 + + G + +E Q + + R++ + + G Y EL V Sbjct: 59 KNTIVDVLCTDVRGRKF---CVEMQMEWSDAFQQRVLFNASKLYVSQAKKGGKYSELQPV 115 Query: 118 LPMLFYHGCRSPYPYS-LCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALL 176 + + + + + + I F +++ I R M L Sbjct: 116 YSLNLINDIFAHDTPDFIHNYRIVHDKDSNKVIEGLHFTFIELPKFTPHSIADKRMMVLW 175 Query: 177 ELIQKHIRQR------DLLG--LVDQIVSLLVTGNTND---RQLKALFNYVLQTGDAQRF 225 I DLL + + V L +D R ++ V Sbjct: 176 LRFLTEINSNTKDIPADLLNDPEIGKAVEELEISGFSDAELRAYDKFWDSVSVERTLIDD 235 Query: 226 RAFIGEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTR 285 G+ + E + ++ E+G +GKHE IAQ +L GL E V T+ Sbjct: 236 SYQKGKEKGKQEGLAEGMEKGMEKGMEKGRAEGKHEANTEIAQRLLAMGLPAEQVSKATQ 295 Query: 286 LSPDDLIAQS 295 L + + S Sbjct: 296 LPLEIIKNLS 305 >UniRef50_C6IY67 Transposase n=1 Tax=Paenibacillus sp. oral taxon 786 str. D14 RepID=C6IY67_9BACL Length = 333 Score = 121 bits (302), Expect = 4e-26, Method: Composition-based stats. Identities = 54/317 (17%), Positives = 105/317 (33%), Gaps = 43/317 (13%) Query: 9 PHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQ--YYSDLLW 66 PHD FK L +FI + P L D + + + + + + DLL Sbjct: 27 PHDEAFKKLLH--TFFAEFIALFFP-ELESQLDFSQTRFLMQEQLVDVVGEEARTLDLLL 83 Query: 67 SVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYHGC 126 K +I + +E QS + RM Y + H L++P+ + Sbjct: 84 ETKYIGTDAFILIHLEPQSYRQADFHERMFIYFSRLFERHRKEHQ----LIIPIAIFTSA 139 Query: 127 RSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIR-- 184 S + + I F V++ P + L+ K Sbjct: 140 ESKNER-----NSLNMSILGEDILQFRFLKVELINQPWRRFIDSNNPVAAALLAKMGYNK 194 Query: 185 --QRDL-LGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKE 241 +R+L L + ++ L + L + D ++ + E+A++ +E E Sbjct: 195 GEERELRLAYLRMLLQLSQRLDQARLALVMSIADLYFEPDPRQDEEMLRELAKQYAKESE 254 Query: 242 KLMTIADRLREEGAMQGKHEE------------------------ALRIAQEMLDRGLDR 277 +M + +G +G E +IA+ +L +G Sbjct: 255 VIMELMPAWMRQGYEKGLEEGLEKGIEQGIEKGFEKGIEQGTLIERRQIARRLLSKGFTL 314 Query: 278 ELVMMVTRLSPDDLIAQ 294 E + +T+LS +++ Sbjct: 315 EEIADMTQLSIEEIKKI 331 >UniRef50_C8T759 Putative uncharacterized protein n=1 Tax=Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884 RepID=C8T759_KLEPR Length = 185 Score = 120 bits (301), Expect = 5e-26, Method: Composition-based stats. Identities = 81/185 (43%), Positives = 109/185 (58%), Gaps = 22/185 (11%) Query: 134 LCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQRDLLGLVD 193 +CWL FA+P IAR+IY FPL+DIT PDDEIM+HR++A+LEL+QKHIRQRDL+ L + Sbjct: 1 MCWLAGFADPDIARRIYGEDFPLIDITSTPDDEIMRHRRVAMLELLQKHIRQRDLMDLHE 60 Query: 194 QIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAP---------------- 237 Q+V LL G T+ RQLK L +Y+LQ G+A AF+ +A+ P Sbjct: 61 QLVRLLALGYTSRRQLKTLLHYLLQAGNAADPVAFLRHLAQNVPRRPHKETLMNIAQFLE 120 Query: 238 ------QEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDL 291 K+ L + E+G QG+ + A RIA+ ML GLD LV +T L+P+ L Sbjct: 121 QRGHQQGLKQGLEQGLQQGIEQGIEQGEQQTAERIARAMLANGLDLSLVAKLTGLAPECL 180 Query: 292 IAQSH 296 H Sbjct: 181 ARLQH 185 >UniRef50_C8PTN1 Putative uncharacterized protein n=4 Tax=Treponema vincentii ATCC 35580 RepID=C8PTN1_9SPIO Length = 303 Score = 119 bits (299), Expect = 9e-26, Method: Composition-based stats. Identities = 48/311 (15%), Positives = 106/311 (34%), Gaps = 30/311 (9%) Query: 3 ISTTSTPH-DAVFKSFLRH----PDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDL 57 +ST + + D+VF + + L+ C + +KL+ +++ Sbjct: 1 MSTANRKYKDSVFVDLFSEDEKAKENFLSLYNALHGTNLQLSCPVENIKLDNVMYMN--- 57 Query: 58 RQYYSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYK----- 112 +D+ V I V+ EHQS E M R ++Y + + Sbjct: 58 --IVNDVSCLV-----DNKIIVLAEHQSTINENMPLRFLQYIARLYEKLQKPTDRYLRTL 110 Query: 113 -ELPLVLPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQH- 170 ++P +FY+G ++ L + R + +I E++ Sbjct: 111 SKIPTPEFYVFYNGLNDYPETTVLKLSDAFITKPERIPLDLEVKVYNINKSKGAEVLSRC 170 Query: 171 RKMALLELIQKHIR-------QRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQ 223 + + L + +R + V + + R+ + + N ++ D Sbjct: 171 KTLDEYSLFIEEVRLQTQLDPENGFTNAVKICIEKGILKEYLQRKSREVINMLIAEYDYD 230 Query: 224 RFRAFIGEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMV 283 A E A + + + + +G QG H++AL A+ M + + + Sbjct: 231 TDIAVQREEAGKIA-FAKGISQGLSQGISQGLSQGSHQKALETARLMKQANCEIPFIAKM 289 Query: 284 TRLSPDDLIAQ 294 T L+ ++ + Sbjct: 290 TGLTQAEVESI 300 >UniRef50_B0K503 Putative uncharacterized protein n=12 Tax=Thermoanaerobacteraceae RepID=B0K503_THEPX Length = 360 Score = 119 bits (298), Expect = 1e-25, Method: Composition-based stats. Identities = 39/268 (14%), Positives = 101/268 (37%), Gaps = 10/268 (3%) Query: 3 ISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYS 62 HD +K L + + + D + SF+ +D + Sbjct: 7 KEAIHNQHDKGYKFLLSSKRVFIELLRSFVKQEWVNDIDEANVVKVDKSFVLQDFADKEA 66 Query: 63 DLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLD--------AGYKEL 114 DL++ VK ++ Y+++E QS + M +R++ Y + ++ L +L Sbjct: 67 DLVYRVKLKDKEVIFYILMELQSTVDYQMPYRLLLYMVEIWRSILKDTPRKESRRKDFKL 126 Query: 115 PLVLPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMA 174 P+++P++ Y+G + + + L+D+ +E+++ + Sbjct: 127 PVIVPIVLYNGDHKWTAKTSYKETLNSYETFGEYAVDFKYILIDVNRYTKEELLKLENLI 186 Query: 175 LLELIQKH-IRQRDLLGLVDQIVSLLVTGNTNDR-QLKALFNYVLQTGDAQRFRAFIGEI 232 + + + +++ + ++ +L + ++ KA F +L + R I I Sbjct: 187 ASVFLLEQKVEFEEIMKRLKELSEILNNLDKDEILLFKAWFKKILLARLPEEERENIERI 246 Query: 233 AERAPQEKEKLMTIADRLREEGAMQGKH 260 + + +E + + + +E + K Sbjct: 247 IDENKEVEEMISNLEKTILQEMKEREKR 274 >UniRef50_A8PLG1 Transposase n=1 Tax=Rickettsiella grylli RepID=A8PLG1_9COXI Length = 212 Score = 119 bits (298), Expect = 1e-25, Method: Composition-based stats. Identities = 68/210 (32%), Positives = 107/210 (50%), Gaps = 5/210 (2%) Query: 90 LMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYHGCRSPYPYSLCWLDEF-AEPAIARK 148 + F++ RY A M HL G+ LP+V+ ML+Y G +PYPY+ D F IA K Sbjct: 1 MTPFKIARYVHAIMDQHLKQGHAFLPIVVAMLYYRGKVTPYPYTGNIFDCFGKNKTIAEK 60 Query: 149 IYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHI-RQRDLLGLVDQIVSLLVTGNTNDR 207 IY +P++DIT + DD I H +A+L+ QK+ RD+ ++ I+ L G Sbjct: 61 IYLRPYPIIDITALSDDAIRGHGSIAILDFAQKYAAFNRDIQDGIEHIIGELKKGYLTRE 120 Query: 208 QLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKL---MTIADRLREEGAMQGKHEEAL 264 Q + L Y + D + + ++ E++ + I + + G QG++EE L Sbjct: 121 QCQTLLYYTFRETDTDNVKMLLEQLQTIRIYEEDIMSVAHKIEQQGLQRGLQQGRYEEDL 180 Query: 265 RIAQEMLDRGLDRELVMMVTRLSPDDLIAQ 294 +IA+ ML +G DR + VT LS DL+ Sbjct: 181 KIAKRMLAKGTDRGYIKDVTGLSDQDLLNL 210 >UniRef50_A5D0D4 Putative uncharacterized protein n=10 Tax=Clostridia RepID=A5D0D4_PELTS Length = 332 Score = 118 bits (296), Expect = 2e-25, Method: Composition-based stats. Identities = 49/286 (17%), Positives = 102/286 (35%), Gaps = 20/286 (6%) Query: 3 ISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDED--LRQY 60 ++ HD +FK L +F+++ P + DL +K + ++ Sbjct: 1 MNKDQVDHDRLFKQLL--ETFFAEFMELFFPEA-AQATDLEYVKFLQQELFTDITAGEKH 57 Query: 61 YSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPM 120 +D++ + ++ G I V +E QS ++ RM Y + + +LP+ Sbjct: 58 RADIIVETRLKDEPGLILVHVEPQSYIQKEFNERMFIYFSRLYEKYRRK-------ILPV 110 Query: 121 LFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQ 180 + Y + D F + F +++ + + ++ L+ Sbjct: 111 AVF-----TYDHIRNEPDSFEIGFSFLDVLRFHFYKLELKKLHWRDYIRSDNPVAAALLS 165 Query: 181 KH-IRQRDLLGLVDQIVSLL--VTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAP 237 K R + + + + + +L + + +L F + Q F E+ + Sbjct: 166 KMGFRPEERVQVKLEFMRMLARMKLDPARTELIGGFFETYLKLNRQEEEEFYRELGKIDK 225 Query: 238 QEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMV 283 +E E +M I E+G M+G+ E L E G R V Sbjct: 226 KEVELIMQITTSWHEKGRMEGRLEGRLEGRLEGRLEGEARGKVEKA 271 >UniRef50_Q73P51 Conserved domain protein n=7 Tax=Treponema RepID=Q73P51_TREDE Length = 292 Score = 118 bits (296), Expect = 2e-25, Method: Composition-based stats. Identities = 45/304 (14%), Positives = 105/304 (34%), Gaps = 24/304 (7%) Query: 3 ISTTSTPH-DAVFKSFLRH----PDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDL 57 +ST++ + D+VF + + L C + ++L+ +++ Sbjct: 1 MSTSNRKYKDSVFVDLFSEDERAKENFLSLYNALHGTNLPMSCPVENIRLDNVMYMN--- 57 Query: 58 RQYYSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLD------AGY 111 +D+ V G I ++ EHQS E M R + Y + Sbjct: 58 --IINDVSCLV-----DGKIIILAEHQSTINENMPLRFLEYIARLYEKLQAPTDRYLKKL 110 Query: 112 KELPLVLPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQ-H 170 ++P +FY+G + L + + +++I ++I+ Sbjct: 111 SKIPTPEFYVFYNGKEDYPETTALKLSDAFITKPKQAPLELTVQVLNINTDKANKILTAC 170 Query: 171 RKMALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIG 230 + + L + +R++ L + + + + L + + Sbjct: 171 KPLEEYSLFVEEVRKQTQLDPENGFTNAIKICIEKGILKEYLMRKSREVINMLVAE--YD 228 Query: 231 EIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDD 290 + A Q +E L ++ +G G +++A+ IA+ G D + + T LS ++ Sbjct: 229 YDTDIAVQREESLRIGIEQGIRQGFSDGAYQKAIEIAKAFKQFGFDIDKIAEGTGLSREE 288 Query: 291 LIAQ 294 + Sbjct: 289 IEKL 292 >UniRef50_B6J6C6 Hypothetical cytosolic protein n=1 Tax=Coxiella burnetii CbuK_Q154 RepID=B6J6C6_COXB1 Length = 143 Score = 118 bits (295), Expect = 2e-25, Method: Composition-based stats. Identities = 48/138 (34%), Positives = 82/138 (59%), Gaps = 1/138 (0%) Query: 3 ISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYS 62 + PHD F++ + A++F + HLP + K DL +L+L+ +SFIDE L+ + Sbjct: 1 MKKIHNPHDYYFRTAMSDTRVAKEFFEYHLPNNILKAADLNSLQLQKSSFIDEHLKASMA 60 Query: 63 DLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAG-YKELPLVLPML 121 D+L+SVK GY Y+++EHQ P++LM +R++RY + + +HL Y LP+V+P++ Sbjct: 61 DVLYSVKLNRRPGYFYIIVEHQRNPDKLMPYRLLRYILRIIDHHLKKKDYLPLPIVVPLV 120 Query: 122 FYHGCRSPYPYSLCWLDE 139 FY+G + + L Sbjct: 121 FYNGKKRYPFQRIFLLYL 138 >UniRef50_C4G1D5 Putative uncharacterized protein n=2 Tax=Abiotrophia defectiva ATCC 49176 RepID=C4G1D5_ABIDE Length = 297 Score = 118 bits (295), Expect = 2e-25, Method: Composition-based stats. Identities = 52/297 (17%), Positives = 115/297 (38%), Gaps = 15/297 (5%) Query: 10 HDAVFKSFLRHPDTARDFIDI--HLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWS 67 D K L + D D ++ + K +L D+++ + Sbjct: 4 KDIAEKYLLSYNDVFADIVNGAVFGGEEIVKSNELADANGITQFKDDQNIHHEQVRDIAK 63 Query: 68 VKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYHGCR 127 + V + ++ IE+QS P++ M R++ Y A ++ + G + + VL ++ Y G Sbjct: 64 FWKKNEVIFSFIGIENQSAPDKDMILRIISYDGATYKSQM--GNESIYPVLTIVIYWGKY 121 Query: 128 SPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQRD 187 +A I F L+DI + E+++ + + + RQ++ Sbjct: 122 EWKAPVSLQERINCPRELADIIPDYRFKLIDIGRLSGKELIKFKS-DFRLVAEFIARQKE 180 Query: 188 LLGLVDQIV--------SLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAER--AP 237 ++I L+ G+ ++LK + + G + EI R Sbjct: 181 YKPGKEEIKHPEELLDLLDLLAGDKRFKELKGKVKNIRKEGRIINMCELLDEIENRGIEK 240 Query: 238 QEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQ 294 ++ + ++ E+G +G+ LRIA++ D + +++M T L+ +++ Sbjct: 241 GIEQGIEQGIEKGIEKGRSEGEETATLRIAKKFKDSNVSIDIIMKATGLTKEEIEEL 297 >UniRef50_B0G834 Putative uncharacterized protein n=3 Tax=Dorea formicigenerans ATCC 27755 RepID=B0G834_9FIRM Length = 369 Score = 118 bits (295), Expect = 3e-25, Method: Composition-based stats. Identities = 49/315 (15%), Positives = 105/315 (33%), Gaps = 31/315 (9%) Query: 2 TISTTSTPH--DAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQ 59 T + H D K P F+ + PL K ++ + F+ Sbjct: 8 TSNGVHNTHTKDNAAKIVFGDPVLCAQFLKGYTDIPLFKEIKPEDIENVSSHFLPLFQES 67 Query: 60 YYSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHL-----------D 108 SD + + Y+ +IEHQS+ + M+FR++RY + ++ Sbjct: 68 RDSDTVNKIWIGNSEIYLIALIEHQSENDFDMSFRILRYIVFIWTDYAAQQEKLHKGTTK 127 Query: 109 AGYKELPLVLPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIM 168 + P +LP+++Y G + F I S + +V + +++ Sbjct: 128 SKDFLYPPILPIVYYEGSSTWSAPLNFKNRVFLSDVFGDYIPSFNYLVVPLNKYSKQDLI 187 Query: 169 QHRKMALLELI---QKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRF 225 + L + + + L + + + +T +T D LK + + Sbjct: 188 EKNDELSLIFLINQLQSSSEFHALKDIPKKYTEHLTEDTPDYLLKIIGKVIAVLLHKLNV 247 Query: 226 -RAFIGEIAERAPQEKEKLM--------------TIADRLREEGAMQGKHEEALRIAQEM 270 + E+ ++ + K +M + R EG ++G+ + + Sbjct: 248 PDEEVYEVTDQITRRKFSMMFDNFQAYDVQETRRVSREEGRLEGRIEGERAGRIEGERAG 307 Query: 271 LDRGLDRELVMMVTR 285 G L+ V + Sbjct: 308 RIEGERLHLIKQVIK 322 >UniRef50_B4SC57 Putative uncharacterized protein n=14 Tax=Bacteria RepID=B4SC57_PELPB Length = 299 Score = 118 bits (294), Expect = 3e-25, Method: Composition-based stats. Identities = 40/299 (13%), Positives = 94/299 (31%), Gaps = 10/299 (3%) Query: 3 ISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYS 62 + + D FK + +D + + A + + + ++L+ + + S Sbjct: 1 MCKINPRVDFAFKKLFGSEEN-KDLLISLINAIVSEEDQVVEIELKNPYNLADYRAGKIS 59 Query: 63 DLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAG--YKELPLVLPM 120 L K + G + +E Q + R + Y + L G YKEL + + Sbjct: 60 ILDIKAKAENGR---WFNVEMQISEDYNFDKRAIFYWAKLVTEQLSEGMMYKELKKTISI 116 Query: 121 LFYHGCRSPYPYSLCWLDEFAEPAIARK---IYSSAFPLVDITVVPDDEIMQHRKMALLE 177 P + + A + +++ + Sbjct: 117 NILDYNFVPDTTEVHSCYKIINTATGKDDRLHDVFELHYIELKKFNKLHHEISSTLDRWT 176 Query: 178 LIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAP 237 Q D ++ L + +FN + R ++ + ++ A Sbjct: 177 TFLTTAHQLDREHTPKELA-LDKNIVKAIAAIDRMFNEEERQVYEVRKQSLVDAESKIAS 235 Query: 238 QEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQSH 296 ++ + + E+G +G + + IA +L +G+ + T LS ++ + S Sbjct: 236 ALEKGMEKGMEMGLEKGRDEGINAASKTIALNLLGKGIAIATIAEATGLSVLEITSLSQ 294 >UniRef50_B2V697 Putative uncharacterized protein n=6 Tax=Sulfurihydrogenibium RepID=B2V697_SULSY Length = 311 Score = 118 bits (294), Expect = 3e-25, Method: Composition-based stats. Identities = 46/244 (18%), Positives = 97/244 (39%), Gaps = 12/244 (4%) Query: 8 TPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWS 67 PHD FK P + +DI + L + DL +++L + + + + DLL+ Sbjct: 5 QPHDQFFKQIFSEPKRVKSLLDIF-YSELSQKIDLESIRLLNSEKYSQKIGKSLLDLLYE 63 Query: 68 VKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYHGCR 127 K + ++ ++ EH+S ++ + +++ Y+ + YKE ++ ++ YHG R Sbjct: 64 CKIENEKSFLRIIFEHKSYIDKNLPSQLLYYNGILWEE--TGEYKEYLPIINIVLYHGKR 121 Query: 128 SPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALL----ELIQKHI 183 + L + I R + L+D++ V D+E++ + L KHI Sbjct: 122 KWNIPTT--LPKTNSEIIERFSNKLNYHLIDLSKVADEEMINKLYVDFCTASALLTMKHI 179 Query: 184 RQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKL 243 + DL + + + D + + +Y+ + Q + EI + Sbjct: 180 FE-DLKKYKHILKKVFE--HYQDGCVFIILDYISVVNNPQEVENVLKEILGGEKEMTTLT 236 Query: 244 MTIA 247 Sbjct: 237 EKWK 240 >UniRef50_A9BGB3 Putative uncharacterized protein n=2 Tax=Petrotoga mobilis SJ95 RepID=A9BGB3_PETMO Length = 336 Score = 117 bits (293), Expect = 4e-25, Method: Composition-based stats. Identities = 66/319 (20%), Positives = 134/319 (42%), Gaps = 29/319 (9%) Query: 6 TSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLL 65 ++ D++FK DF+ LP K T LK E I +D SD+L Sbjct: 2 SNPIKDSIFKELFEDRTVFYDFLKAFLPKETTKQIKETDLKREQTELIGKDFSIKRSDIL 61 Query: 66 WSVKTQEG-VGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYK-------ELPLV 117 + ++ + G YIY+++EHQSK ++LMAFRM+ Y + + ++++ K +LP++ Sbjct: 62 YKIEKRNGQDVYIYLLLEHQSKVDQLMAFRMLAYKVRIWEQYVNSHKKESEQKGFKLPVI 121 Query: 118 LPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQ-HRKMALL 176 + M+FY G + + + + L++++ + ++ I+ + + ++ Sbjct: 122 IGMVFYDGKAKWTSPMDVKEKITEIKNMEEYLIKANYELINLSNIKEETIINMKKALGVI 181 Query: 177 ELIQK-HIRQRDLLGLVDQIVSLLVTGNTNDRQLK------ALFNYVLQTGDAQRFRAFI 229 L K ++R ++ L+ I ++ + + Q K A + D + + Sbjct: 182 LLTDKPNVRVKNAEELLKIINKDILLKLSEEEQEKFNKHRNAFIELFGKRTDYEEIKERF 241 Query: 230 GEIAE-RAPQEKEKLMTIADRLREEGAMQGKHEEA------------LRIAQEMLDRGLD 276 E+ E P+ L IA R RE+ ++GK E + I + D Sbjct: 242 EELKEMEVPKMFNTLEEIAKRDREKAKLEGKAEGKVEGKLEERRELIIEILNQRFGEDFD 301 Query: 277 RELVMMVTRLSPDDLIAQS 295 + L + + + + Sbjct: 302 KSLEEKIRNANEETINQIK 320 >UniRef50_Q2RKN5 Putative uncharacterized protein n=1 Tax=Moorella thermoacetica ATCC 39073 RepID=Q2RKN5_MOOTA Length = 304 Score = 115 bits (288), Expect = 2e-24, Method: Composition-based stats. Identities = 58/297 (19%), Positives = 108/297 (36%), Gaps = 24/297 (8%) Query: 7 STPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDED--LRQYYSDL 64 HD +FK L R+F+++ PA L D T K I + ++Y D+ Sbjct: 2 PVDHDRLFKELLT--TFFREFMELFFPAA-HTLIDYTDTKFLTQEVITDITAGDKHYVDI 58 Query: 65 LWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYH 124 L VK + G + V IE Q+ + A RM Y + H VLP+ + Sbjct: 59 LAEVKIKGEDGCVLVHIEPQAYRQADFARRMFIYFSRLYEKHQKR-------VLPIAVFA 111 Query: 125 GCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIR 184 + + K+ F + + +P + + L+ K Sbjct: 112 HDSK-----VEETNRHEVEFPFLKVLQFEFYKIQLKRLPWRQYLNSNNPVAAALLSKMDY 166 Query: 185 Q-RDLLGLVDQIVSLLVTG--NTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERA-PQEK 240 R+ + + + + LL + +L F +A+ ++ +++E P+E Sbjct: 167 SPRERVQVKIEFLRLLTRMQLDPARMELITAFFDSYLVLNAEEEKSLQEKLSEELQPEEV 226 Query: 241 EKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGL---DRELVMMVTRLSPDDLIAQ 294 +++M + +G QG+ E I L + L E+ + LS + L Sbjct: 227 QRVMELTTSWHLKGWQQGRQEGRQEILLRQLRKRLGTTSPEVEAKIKTLSAEQLDDL 283 >UniRef50_Q24MW9 Putative uncharacterized protein n=4 Tax=Desulfitobacterium hafniense RepID=Q24MW9_DESHY Length = 295 Score = 115 bits (287), Expect = 2e-24, Method: Composition-based stats. Identities = 44/303 (14%), Positives = 95/303 (31%), Gaps = 19/303 (6%) Query: 1 MTI-STTSTPHDAVFKSFLR---HPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDED 56 M + + +D +FK + D F++ L +LT + L E Sbjct: 1 MYMAERLNRINDYLFKYIFGRQENKDILLSFLNAVLSP--AGEDELTDITLSDRELDPEH 58 Query: 57 LRQYYSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAG--YKEL 114 L+ S L +G + IE Q E+ + R + Y Q+ L +G YK+L Sbjct: 59 LKDKMSRLDILGVANDGS---LINIEVQIASEKNIDKRTLYYWAKIYQSQLQSGMLYKDL 115 Query: 115 PLVLPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIY-SSAFPLVDITVVPDDEIMQHRKM 173 + + + P + E ++ +++ ++ Sbjct: 116 ARTVTVNVLNFSFLPDAQRYHSMFSLYEAHSGLRLNRDLEIHFLELEKWKALSTKPRTRL 175 Query: 174 ALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIA 233 + + ++L + ++ + L ++ + + I Sbjct: 176 DKWLMYLSNTDPKELEE-------IAMSEPAIGKALTVEEIFLKNDKERYLYEMREKGIR 228 Query: 234 ERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIA 293 + ++ +G QG IA ML +GL ++ +T L + + Sbjct: 229 DHLSAMDNAKTEGIEQGLAQGIAQGIERGKTEIALSMLKKGLSLNMIAEITDLPIEQIEE 288 Query: 294 QSH 296 H Sbjct: 289 IRH 291 >UniRef50_UPI0001C351D8 hypothetical protein ChatD1_33675 n=1 Tax=Clostridium hathewayi DSM 13479 RepID=UPI0001C351D8 Length = 313 Score = 114 bits (284), Expect = 5e-24, Method: Composition-based stats. Identities = 47/304 (15%), Positives = 99/304 (32%), Gaps = 30/304 (9%) Query: 2 TISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYY 61 I D +F+ + D + DL L ++ Sbjct: 3 QIKLNRNYKDRLFRLAFQEKKDLLDLYNAVSGRQYTNPDDLIITTLADAIYLGMK----- 57 Query: 62 SDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLD--------AGYKE 113 +D+ + V + + EHQS M R + Y + ++D Sbjct: 58 NDISFLVSD------VLNLYEHQSSFNPNMPVRGLNYFADTYREYIDRNGFDIYGEKLIR 111 Query: 114 LPLVLPMLFYHG-CRSPYPYSLCWLDEF-AEPAIARKIYSSAFPLVDITVVPDDEIMQH- 170 LP+ ++FY+G P L D F + + +++I + E+M Sbjct: 112 LPMPQYIVFYNGTKEEPDRIELRLSDAFLCQNPEEKGCLECRATMININYGHNKELMDRC 171 Query: 171 RKMALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIG 230 R++ + IR + G+ + V + + + +L+ A+ + Sbjct: 172 RRLKDYAVFVSRIRNNEKRGM---ALDEAVKQAVHSCIEEGILADILKKNRAEVCNLILY 228 Query: 231 EIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDD 290 E E + +L + + G +G+ E + I + M +GL+ + + L Sbjct: 229 EYDE-----QRQLAIAREGAMKAGREEGRAAEQVTIIRNMAGKGLNPSAIADMLGLEEGY 283 Query: 291 LIAQ 294 + Sbjct: 284 VKKV 287 >UniRef50_D0LPI9 Putative transposase n=2 Tax=Haliangium ochraceum DSM 14365 RepID=D0LPI9_HALO1 Length = 338 Score = 112 bits (280), Expect = 1e-23, Method: Composition-based stats. Identities = 53/270 (19%), Positives = 106/270 (39%), Gaps = 28/270 (10%) Query: 10 HDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVK 69 +D + ++ + A D LP L K DL L L +++ ++LRQYY+D+L+SV Sbjct: 24 YDVLVETTFARREYAADTFRTMLPPALVKRLDLDALSLRSGTYVSDELRQYYTDVLYSVL 83 Query: 70 TQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHL--DAGYKELPLVLPMLFYHGCR 127 +IY++++HQS + + R+ R ++ + +L LP++LP++F+H Sbjct: 84 LDGEQAFIYLLLKHQSATDPMFPLRLPRNVLSIWERYLIERQDATTLPVILPIVFHHEAT 143 Query: 128 SP--------------YPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKM 173 + + + + F + + + + + Sbjct: 144 GWSDAVGLNGSLALGADVRTALSANRRDFRRLRYLLLVLCFQFDEASRAQN----LNEAL 199 Query: 174 ALLELIQKHIR-QRDL---LGLVDQIVSLLVTGNTNDRQLKALFNYV--LQTGDAQRFRA 227 LL R +RDL L + ++ +V L + ++ D ++ Sbjct: 200 GLLMRTFGVARPKRDLVASLKGWEDVIREVVATQRGREMLATVVQFILENSETDPDELKS 259 Query: 228 FIGEIAERAPQEKEKLMTIADRLREEGAMQ 257 F+ A + MT ADRL + + Sbjct: 260 FLEFTAG--EPARTAFMTGADRLTQGVREE 287 >UniRef50_C6VTD5 Putative uncharacterized protein n=1 Tax=Dyadobacter fermentans DSM 18053 RepID=C6VTD5_DYAFD Length = 308 Score = 112 bits (279), Expect = 2e-23, Method: Composition-based stats. Identities = 41/302 (13%), Positives = 92/302 (30%), Gaps = 23/302 (7%) Query: 11 DAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVKT 70 D FK +D + L + L N + Sbjct: 10 DFGFKRIFGSEAN-KDILIDFLNVLFAGERLVADLTFASNENNGRIPILRRAIFDLCCTG 68 Query: 71 QEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYK----ELPLVLPMLFYHG- 125 +G +I IE Q +E R + YS + +++ ++AG +L V + Sbjct: 69 ADGEQFI---IEVQRVRQEYFKDRCLYYSASLIRDQVEAGGTNWRYDLKPVYLIGLMDFC 125 Query: 126 -CRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIR 184 S + L + +++ E ++ + K++ Sbjct: 126 FEDSDDGHYLHEIRLIKRSNGQVFYDKFGLTFIEMPAFQKKESDLSTELDRWLYLLKNLS 185 Query: 185 QRDLLG------LVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERA-- 236 + +++ + ++ + N N + A Y+ D + + + A R Sbjct: 186 KLNIVPPVLTNPVYQKVFRVAEVCNLNKEEKMAWDAYLKAKWDNENSMDYAKKEAMRVGH 245 Query: 237 -----PQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDL 291 KE ++G G ++ + ML +G D + + +T L+ + + Sbjct: 246 EEGHKEGHKEGHKEGMKEGIKKGRETGIELGKRQVVKNMLAKGFDMQTISDITGLTFEQI 305 Query: 292 IA 293 Sbjct: 306 RN 307 >UniRef50_D0YJF1 Putative transposase YhgA family protein n=1 Tax=Klebsiella variicola At-22 RepID=D0YJF1_KLEVA Length = 190 Score = 111 bits (277), Expect = 3e-23, Method: Composition-based stats. Identities = 70/180 (38%), Positives = 107/180 (59%), Gaps = 16/180 (8%) Query: 131 PYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQRDLLG 190 P+ + P A+ +Y F L+D+TV+PDD+++QHR++ALLEL+QKHIRQRDL Sbjct: 11 PHDAVFKRFLRHPETAKTLYGCPFTLIDVTVMPDDDLVQHRRVALLELMQKHIRQRDLSS 70 Query: 191 LVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKLMTIADRL 250 + + + ++++ G TN RQL+ LF+Y+LQ G+ F+ +A R PQ +E LM+IA +L Sbjct: 71 ITESLAAVVMLGYTNRRQLRMLFHYMLQYGNTAEPGVFLRRLARRLPQYEETLMSIAQKL 130 Query: 251 REEGAMQGKHEEALR----------------IAQEMLDRGLDRELVMMVTRLSPDDLIAQ 294 ++EG +G+ E IA ML GLD+E+V +T LS D+L Sbjct: 131 KQEGRQEGRLEGREEGHQEGLQEGSRREALRIAGSMLQNGLDKEMVQKITGLSADELQPL 190 >UniRef50_UPI0001C353CE hypothetical protein ChatD1_20495 n=1 Tax=Clostridium hathewayi DSM 13479 RepID=UPI0001C353CE Length = 319 Score = 108 bits (269), Expect = 2e-22, Method: Composition-based stats. Identities = 44/294 (14%), Positives = 95/294 (32%), Gaps = 28/294 (9%) Query: 4 STTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSD 63 D +F+ + + + DL L+ ++ + +D Sbjct: 22 KVNKKYKDRLFRMVFNRKEELLSLYNAVSHSEYTNPDDLEINTLDDVIYM-----KMKND 76 Query: 64 LLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAG--------YKELP 115 L + + + + EHQS M R Y + + ++D LP Sbjct: 77 LAFLI------DDVLNLWEHQSTWNPNMPVRGTFYIVEEYRKYIDQNGLNLYGSSRITLP 130 Query: 116 LVLPMLFYHGC-RSPYPYSLCWLDEFAEPAIA-RKIYSSAFPLVDITVVPDDEIMQH--R 171 + +FY+G P L D F+ +++I ++E+M+ Sbjct: 131 VPQFYVFYNGLREEPDYIELKLSDAFSRVHSEVEPCMEFKAVMLNINRGHNEELMRQCTT 190 Query: 172 KMALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGE 231 E + + + + +++ ++ D + L A+ F + E Sbjct: 191 LREYAEFVARIRDETEDGTALEEAAMNVMDSCIRD----GILAEFLSVHRAEVFEVLLTE 246 Query: 232 IAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTR 285 E+ EK ++ EG +G E+A +A ++ +G E + Sbjct: 247 YDEQRHIASEKEIS-RREGHMEGRTEGILEKAKEVAVNLIKKGFTVEDAASICG 299 >UniRef50_A6LFH9 Putative uncharacterized protein n=6 Tax=Bacteroidales RepID=A6LFH9_PARD8 Length = 295 Score = 108 bits (269), Expect = 3e-22, Method: Composition-based stats. Identities = 39/294 (13%), Positives = 95/294 (32%), Gaps = 18/294 (6%) Query: 11 DAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVKT 70 D FK+ + F++ L + + E + +++ + Sbjct: 10 DVGFKAVFQDKQVTIKFLNA----ALAGERQIKDITYLDKEIKPETVENRT--IIFDLLC 63 Query: 71 QEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYK---ELPLVLPMLFYHGCR 127 ++ G +++ E Q+ P+ R Y + G + L + + F + + Sbjct: 64 EDVSGAKFIL-EMQNCPQHYFFNRGFYYLCRMVARQGQIGKQWQYRLLPIYGVYFLNF-K 121 Query: 128 SPYPYSLCWLDEFAEPAIARKIYSSAF--PLVDITVVPDDEIMQHRKMALLELIQKHIRQ 185 P A + + + + K++ Sbjct: 122 LPEFTDFRTDVVLANERTGKVFNEIKMKQIYISFPLFSLSKEECKSSFERWIYTLKNMNL 181 Query: 186 RDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEI-----AERAPQEK 240 + ++ + L + + + + + + +R + I + Sbjct: 182 FEQSPFKEEQETFLRLLDVANVNSLSEKERAIYEENLKNYRDWYATIDYAQTEGIEKGMQ 241 Query: 241 EKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQ 294 E + + E+G +G+ EE L+IA++M +GLD EL+ + LS +D+ Sbjct: 242 EGMQKGMQKGIEKGIEKGRQEEKLQIARKMKKQGLDSELIAQCSGLSVEDIERL 295 >UniRef50_C6XV81 Putative uncharacterized protein n=4 Tax=Pedobacter heparinus DSM 2366 RepID=C6XV81_PEDHD Length = 318 Score = 108 bits (269), Expect = 3e-22, Method: Composition-based stats. Identities = 46/296 (15%), Positives = 102/296 (34%), Gaps = 17/296 (5%) Query: 11 DAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVKT 70 D FK + ++ + L + +T ++ N F E ++ + ++ V Sbjct: 28 DFSFKRLFATEE-SKPILIGLLNHLFKGRKYITEIEYGKNEFPGEIAQEGGA--VFDVYC 84 Query: 71 QEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKE-----LPLVLPMLFYHG 125 + G ++ IE Q +E R + Y A+ G ++ L V + F Sbjct: 85 TDVNGSKFI-IEVQRGNQEYFKERALFYVSRAISEQAPKGDRKGWAYKLTEVYLLAFLED 143 Query: 126 CRSPYPYSLCWLDEFAEPAIARKIYSSA---FPLVDITVVPDDEIMQHRKMALLELIQKH 182 P ++ + I F +++ + ++ KH Sbjct: 144 FNLPDSPKSEYVQDICLANRHTGIIFYDKVGFIFIEMLNFVKGSDELYTELDKWLYALKH 203 Query: 183 IRQRDLLGLV---DQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQE 239 + + + L N + Y + + +++ ++ Sbjct: 204 LTEFKQRPEYLSGPEFDQLFTLANYASLTPEERDMYNSSLKRKWDNKNVLDYAVKKSLEQ 263 Query: 240 KEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQS 295 L ++ RE+G QG H++A+ IA EML E ++ +T+LS +++ + Sbjct: 264 --GLEQGLEQGREQGREQGIHKKAIEIALEMLVNKYPIEEIIKLTKLSKEEIQSLQ 317 >UniRef50_B7BFV9 Putative uncharacterized protein n=1 Tax=Parabacteroides johnsonii DSM 18315 RepID=B7BFV9_9PORP Length = 293 Score = 106 bits (263), Expect = 1e-21, Method: Composition-based stats. Identities = 39/290 (13%), Positives = 90/290 (31%), Gaps = 12/290 (4%) Query: 11 DAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVKT 70 D FK D +++ + L +T L E + ++ +K Sbjct: 10 DRGFKHLFGQED-SKELLVDLLNGLFEGERVITELSFLNVEMPAESTDSRAA--VFDLKC 66 Query: 71 QEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYK---ELPLVLPMLFYHGCR 127 ++ G I++ +E Q+ P+ R + Y + + G EL V + + Sbjct: 67 KDKEGRIFI-VEVQNAPQTYFYERGLYYLCRIISDQDRRGNDWKFELYPVYGIFLLNFKS 125 Query: 128 SPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQRD 187 + + + +++ +E + K++ + + Sbjct: 126 GKTDKVRTDIVLADRETGKQMSDTMRQIYLEMPFFNKEEAECETSLDYWLYTLKYMEKLE 185 Query: 188 LL--GLVDQIVSLLVTG-NTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKLM 244 L Q+ L + K Y + + + E+ E + Sbjct: 186 TLPFKGQKQLFEKLERLAKIVNMNKKERMEYEESLKIYRDNQGVLDYAIEK--GYMEGVE 243 Query: 245 TIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQ 294 E+G +G + +A +M +G+D + VT L+ + + Sbjct: 244 KGLKEGIEKGLEKGMEKGIYLVAAKMKMQGIDFATITSVTGLNAETIATL 293 >UniRef50_B5U1X5 Putative uncharacterized protein n=1 Tax=uncultured bacterium RepID=B5U1X5_9BACT Length = 304 Score = 106 bits (263), Expect = 1e-21, Method: Composition-based stats. Identities = 46/306 (15%), Positives = 96/306 (31%), Gaps = 28/306 (9%) Query: 2 TISTTSTPHDAVFKSFLRH----PDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDL 57 + + D++F + + L+ L L+ + Sbjct: 6 PTNENRSHKDSLFVDYFSKDRDWKQHFLSLYNALHGTNLQVADTL--LERVNIDQV--LY 61 Query: 58 RQYYSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLV 117 + YY+D+ V G ++IEHQS M R++ Y N +D+ K + Sbjct: 62 KSYYNDIAVLV-----NGQFILMIEHQSTINPNMPLRLLEYVARIYGNLVDSKAKFSRHL 116 Query: 118 LPM------LFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHR 171 +P+ +FY G + P S L + + + V I Sbjct: 117 VPLARPEFYVFYTGDQKLPPESYLHLSDSFPNQPPKADLTLEL------KVKVCTIKSDH 170 Query: 172 KMALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGE 231 ++ + L LV++ + +A+ +L+ +R + Sbjct: 171 PSPVVHRCPDLEQYAQFLKLVEEAKAAGQAEPLTWAIQEAVRRNILRDYLERRGGETLSI 230 Query: 232 IAERAP---QEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSP 288 + + + G +G ++ L A+ +L GL ++V T L Sbjct: 231 LMAEYDYATDFAVQKEEAYEDGLFAGLERGAYQNKLETARSLLSEGLAPQMVARCTSLPL 290 Query: 289 DDLIAQ 294 + + Sbjct: 291 ETVQQL 296 >UniRef50_A6LFA9 Putative uncharacterized protein n=22 Tax=Bacteroidales RepID=A6LFA9_PARD8 Length = 305 Score = 105 bits (261), Expect = 2e-21, Method: Composition-based stats. Identities = 41/301 (13%), Positives = 94/301 (31%), Gaps = 22/301 (7%) Query: 11 DAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVKT 70 D FK + +D + L L + L++ N + E + +T Sbjct: 10 DFGFKHIFG-REMDKDILIEFLNDLLEGEYTIMDLRIMNNERLPETEQGRKVIFDIHCET 68 Query: 71 QEGVGYIYVVIEHQSKPEELMAFRMMRYSI-AAMQNHLDAGYKE-LPLVLPMLFYH---- 124 +G ++IE Q++ + R + Y + ++ + + L V + F + Sbjct: 69 DKGER---IIIEMQNREQPHFKDRALYYLSHSVVEQGIKGTWDYELAAVYGVFFLNFTLD 125 Query: 125 ---GCRSPYPYSLCWLDEFAEPAIARKIYSSAF--PLVDITVVPDDEIMQHRKMALLELI 179 G D ++++ F +++ +E + Sbjct: 126 EENGPDKNGKEGKFRRDIILADRENGQVFNPKFRQIYIELPRFNKEEEECETDFERWIYV 185 Query: 180 QKHIRQRDLL------GLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIA 233 KH+ D + + +++ + N +Q D F E Sbjct: 186 LKHMDTLDRMPFKARKAIFERLERIGSMANLTPKQRAQYEAEWKMYNDYYNTLDFAVE-K 244 Query: 234 ERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIA 293 +E + + +EG +G + A+ M G+ ++ T LS +++ Sbjct: 245 GMKKGMEEGMEKGLQKGLQEGLQEGLQKGKESTARNMKAEGITPLIIQKCTGLSLEEIER 304 Query: 294 Q 294 Sbjct: 305 L 305 >UniRef50_B8FTH9 Putative uncharacterized protein n=3 Tax=Desulfitobacterium hafniense RepID=B8FTH9_DESHD Length = 325 Score = 105 bits (261), Expect = 2e-21, Method: Composition-based stats. Identities = 41/321 (12%), Positives = 93/321 (28%), Gaps = 42/321 (13%) Query: 11 DAVFKSFLR---HPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWS 67 D FK + F++ L P + + + E S L Sbjct: 10 DYAFKLIFGKEGNEAILIAFLNAALKLPQERRI--EEITIINPELNKEYPEDKKSILDVR 67 Query: 68 VKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKEL----PLVLPMLFY 123 T +G + + IE Q + M R + Y + G + + ++ + Sbjct: 68 AITSQG---MQINIEIQLSNQYDMEKRSLYYWAQMYSRQIREGMAYKELTKTVSINIVDF 124 Query: 124 HGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVV-----PDDEIMQHRKMALLEL 178 + + Y + E +++ + + + ++ L Sbjct: 125 NYLKQTSSYHNVFHLYEDEEKFQL-TDVLEIHFMELPKLLAKWRKREISLWENELVRWLL 183 Query: 179 IQKHIRQRDLLGLVDQI-------VSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGE 231 + + +++L ++++I + + Y + +A I E Sbjct: 184 LLEGADNQEILQILEEIAMKDPVLYQAMNAWEETSEDPRIREAYFDRRKAILDEKAAIRE 243 Query: 232 IA-----------------ERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRG 274 RA E R EG +G+ E +A+++L G Sbjct: 244 AELRLQEALEEGMAKGIAEGRAKGIAEGKAEGKAEGRAEGRAEGRAEGRAEVAKKLLVLG 303 Query: 275 LDRELVMMVTRLSPDDLIAQS 295 + + T LS +++ Sbjct: 304 FEITKIAEATGLSEEEISGLK 324 >UniRef50_D1P8S5 Putative uncharacterized protein n=1 Tax=Prevotella copri DSM 18205 RepID=D1P8S5_9BACT Length = 303 Score = 104 bits (260), Expect = 3e-21, Method: Composition-based stats. Identities = 37/310 (11%), Positives = 98/310 (31%), Gaps = 25/310 (8%) Query: 1 MTISTTSTPH-----DAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDE 55 M + + D FK +D + L + + + + + Sbjct: 1 MIMKQVEERYISLLTDFGFKRIFGT-AMNKDLLICFLNSLFNGRQVVKDVSYLNPEHVGD 59 Query: 56 DLRQYYSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYK--- 112 + ++ V + G ++ +E Q+ + R + YS ++ G + Sbjct: 60 VYTDRRA--IFDVYCEGENGEKFI-VEMQNAYQTYFKDRALFYSTFPIREQAPKGNEWDF 116 Query: 113 ELPLVLPMLFYH---GCRSPYPYSLCWLDEFAEPAIARKIYS-SAFPLVDITVVPDDEIM 168 +L + + + + + + + A + Y + V+I+ Sbjct: 117 KLNNIYTVALLNFNMNEDAFDKEKIRHHVQLCDTATHKVFYDKLEYIYVEISKFNKTLEE 176 Query: 169 QHRKMALLELIQKHIRQ--RDLLGLVDQIV-SLLVTGNTNDRQLKALFNYVLQTGDAQRF 225 K++ + + L D++ L + + Y + Sbjct: 177 LDTLYEKWLYALKNLYKLTQRPKELCDKVFDRLFEEAEIAKFTPQEMREYETSKMAYRDI 236 Query: 226 RAFIGEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTR 285 + + + + E+G +G + +L IA++ML +G+D +M +T Sbjct: 237 KNSVDTAKRE------GIAEGIEIGMEKGRAEGMNLRSLEIARKMLAKGMDEASIMDMTG 290 Query: 286 LSPDDLIAQS 295 L+ +++ Sbjct: 291 LTSEEIKLLK 300 >UniRef50_C0R0H3 Putative uncharacterized protein n=8 Tax=Brachyspira RepID=C0R0H3_BRAHW Length = 292 Score = 104 bits (259), Expect = 3e-21, Method: Composition-based stats. Identities = 37/298 (12%), Positives = 91/298 (30%), Gaps = 15/298 (5%) Query: 2 TISTTSTPHDAVFKSFLRHP---DTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLR 58 + + + +D + DFI+ + K ++++ E+ Sbjct: 5 SNNNFNVLNDYFVRYLFSDKGSEAILLDFINSIMLDSGMK--TFRSVEILTPFNYKENYE 62 Query: 59 QYYSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKE--LPL 116 + TQ G V+IE Q + R++ Y + L G K L Sbjct: 63 DKETITDVKCITQNGTV---VIIEIQLQGNSRFPERILYYWASNYSKLLKQGEKYDALTP 119 Query: 117 VLPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALL 176 V+ + + + ++++ + + L Sbjct: 120 VISINLLNFNLDDNDSIHSCYMIYDTNNKRLLTDHLQIHIIELKKFKYNSLEYDLNCWLK 179 Query: 177 ELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERA 236 K ++++ + L+ + + N++ + + Sbjct: 180 FFTMKDKDNKEVI-----MSELVKEKPIMEEVQRRYNNFIKDRLMMNEYDKRQAYLYGNQ 234 Query: 237 PQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQ 294 +E+ +EEG +G +E +A+ M ++ +D L+ +T LS + + Sbjct: 235 IMLEEERRLGRVEGKEEGIKEGIEQEKYSLARNMKNKNMDLNLISELTGLSIEKIEKL 292 >UniRef50_A5CBY6 Transposase and inactivated derivative n=47 Tax=cellular organisms RepID=A5CBY6_ORITB Length = 324 Score = 104 bits (259), Expect = 4e-21, Method: Composition-based stats. Identities = 48/316 (15%), Positives = 98/316 (31%), Gaps = 32/316 (10%) Query: 8 TPHDAVFKSFLRHPDTARDFIDIHLPAPL--RKLCDLTTLKLEPNSFIDEDLRQYYSDLL 65 +D FK +D + L L ++T ++ + + S + Sbjct: 9 PKNDVAFKKIFGSEKN-KDILIHFLNDILLFEGNREITEVEFLGTILDADIASKKESIVD 67 Query: 66 WSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAG----YKELPLVLPML 121 K + G YI IE Q P + R Y+ A + G Y +L V+ + Sbjct: 68 VLCKDKNGAQYI---IEMQVDPTQGFEKRAQYYAAKAYGRQPNRGKEGKYSDLKEVIFIA 124 Query: 122 FYHGCRSPYPYS-LCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKM-ALLELI 179 P + + + +F +++ + + + + Sbjct: 125 IADYKLFPNKEDYISRHVILDKKTYEHDLKDFSFTFIELPKFKKNRVEELSDITEKWCYF 184 Query: 180 QKHIRQRDLLG---------LVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAF-- 228 KH ++ L G ++ + L N ++ +L + + D + + Sbjct: 185 FKHAKETTLDGYHKIIGEDLIIKRAYEALDQFNWSEDELITYEQELKRIWDNKAVEDYKL 244 Query: 229 ---------IGEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDREL 279 +GE E + + EG +GK E A ++L L E Sbjct: 245 ERAKAEGIKLGEAKGIKLGEAKGKAEGKAEGKAEGKAEGKAEAKKDFAIKLLKSELSVET 304 Query: 280 VMMVTRLSPDDLIAQS 295 + T LS +++ Sbjct: 305 IAEYTDLSIQEVLNLK 320 >UniRef50_C0CSV6 Putative uncharacterized protein n=1 Tax=Clostridium asparagiforme DSM 15981 RepID=C0CSV6_9CLOT Length = 317 Score = 104 bits (258), Expect = 5e-21, Method: Composition-based stats. Identities = 43/296 (14%), Positives = 86/296 (29%), Gaps = 34/296 (11%) Query: 1 MTISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQY 60 MT D +F+ D + + L L+ ++ Sbjct: 1 MT-KVNKKYKDRLFRLVFGDRRRLLDLYNALNGSHYEDPDALEITTLDDAVYLSMK---- 55 Query: 61 YSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGY--------K 112 +DL + V + + EHQS M R Y + ++ Sbjct: 56 -NDLSFLV------NGVLNLYEHQSTYNPNMPVRGFFYLADVYRKYVVEHKLNLYGSRLA 108 Query: 113 ELPLVLPMLFYHG-CRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHR 171 +LP ++FY+G P L D F A +++I + + +M+ Sbjct: 109 KLPSPKYLVFYNGRKEEPDRKILRLSDAFQGGRNAEPCLELCAVMLNINLGRNQVLMERC 168 Query: 172 KMALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGE 231 + ++ VD++ ++ + + ++ G + F + Sbjct: 169 R-----------TLKEYAQFVDRVRRMIAETGALESAVDCAVEDCIRDGILENFLSSHRA 217 Query: 232 IAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDR--ELVMMVTR 285 REE +G+ E E L GL E ++ + Sbjct: 218 EVLDVILTDYNEQEYIAMEREEAWEEGRAEGLTEGLSEGLSEGLSVSREAILDLLG 273 >UniRef50_C2LUG6 Putative uncharacterized protein n=1 Tax=Streptococcus salivarius SK126 RepID=C2LUG6_STRSL Length = 299 Score = 103 bits (257), Expect = 7e-21, Method: Composition-based stats. Identities = 61/299 (20%), Positives = 101/299 (33%), Gaps = 29/299 (9%) Query: 11 DAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVKT 70 D + K P+ FI L + L +L F +++L D+ K Sbjct: 13 DIMAKKIFSLPEVTVAFIRDILDLDVVDAQILEGTQLHKKDFDEDELFSTSVDV--RAKL 70 Query: 71 QEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLD---------AGYKELPLVLPML 121 +G V+IE Q + + R Y + ++ Y+++ V + Sbjct: 71 NDGTE---VIIEIQVRKQHYFLNRFHYYLANQLVENVQQLRQQGQTHKMYEQMEPVYGIA 127 Query: 122 FYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLEL--I 179 P S A + +YS D + ++A LEL Sbjct: 128 ILEKTLLPDEESPINTYWMANSRTGKPLYSF---------YKDGKQQNLLQIAFLELDKY 178 Query: 180 QKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQE 239 K RD + L R + + + + Q +A I E Sbjct: 179 NKDKHIRDEGRQWLEFFGNLPFSKAPSRAVTHADSLLDSSSWTQEEKAMIDERIRIQENY 238 Query: 240 KEKLMTIADRLREEGAMQG----KHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQ 294 + T D REEG QG ++E L + ++ML +GL E+V VT LS ++L Sbjct: 239 DMTMETAIDEAREEGLEQGLKRGRYEGQLELIRKMLAKGLSLEVVSDVTGLSLEELDGL 297 >UniRef50_C6Y2B5 Transposase and inactivated derivative n=1 Tax=Pedobacter heparinus DSM 2366 RepID=C6Y2B5_PEDHD Length = 310 Score = 103 bits (256), Expect = 8e-21, Method: Composition-based stats. Identities = 41/295 (13%), Positives = 93/295 (31%), Gaps = 29/295 (9%) Query: 11 DAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVKT 70 D FK +D L L+ ++ +L+ N + E + + K Sbjct: 34 DLGFKRLFSAEQN-KDITITFLNHVLKGKREVVSLEFLKNEYPGETQEEGGVIIDIVCKD 92 Query: 71 QEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKE-----LPLVLPMLFYH- 124 Q G + ++E Q + R + Y+ + G ++ L V + Sbjct: 93 QIGA---FFLVEMQKSWNQNFKERSLFYASRLITEQAPHGNRKEWAYSLKDVYVIALLEK 149 Query: 125 -----GCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELI 179 G + + + + ++ ++ F +++ E + Sbjct: 150 FTINAGNKGKWLHDIALVNTDTGKVFNERL---RFTYIELLSFKKTENQLETDLEKWIYA 206 Query: 180 QKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQE 239 K+ L + Q + + QL + + I + Sbjct: 207 LKN------LKHLKQAPAAF-----TEPQLLQFCQAARYINLTKEEKNMISAKTKARWDY 255 Query: 240 KEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQ 294 + REEG +G H++A +IA ++ ++G+ + +T LS ++ Sbjct: 256 YYAIDGAKIMGREEGETRGAHQKAAQIAIKLKNKGVPFTEIQELTELSITEIKNL 310 >UniRef50_C9LWJ8 Putative uncharacterized protein n=1 Tax=Selenomonas sputigena ATCC 35185 RepID=C9LWJ8_9FIRM Length = 292 Score = 103 bits (256), Expect = 8e-21, Method: Composition-based stats. Identities = 44/298 (14%), Positives = 100/298 (33%), Gaps = 26/298 (8%) Query: 6 TSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLL 65 T D++F R D +D + L +F +++ +D+ Sbjct: 5 KRTYKDSLFCDIFRRKDYLQDVYRGLFGRDVS--LQEIQLMTLQGTFFNDE----KNDVS 58 Query: 66 WSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYK------ELPLVLP 119 + G I V++EHQS E M RM Y + + LP Sbjct: 59 FLA----GKRQI-VLMEHQSTLNENMPLRMFWYMAKLYRKQVPKDAPYRTRRLRLPAPCF 113 Query: 120 MLFYHGCR-SPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRK-MALLE 177 +FY+G +P + + + F + ++ A+ +I + +++ + + Sbjct: 114 YVFYNGLDPAPDEWEMRLSEAFEGECSSLELCVKAY---NINEMSGSRLLEKSRALKGYS 170 Query: 178 LIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAP 237 + IR++ G+ + V + L + + + + Sbjct: 171 VFVAQIRRKTAAGV---CLEEAVKQAIRYCIEQDLLAEYFLEREMEEVFDMVSFKWDPEL 227 Query: 238 QEKEKLMTIADRLREEGAMQGKHEEALRIAQEML-DRGLDRELVMMVTRLSPDDLIAQ 294 ++ +L + E+G +G + I ML + + + V++ D + + Sbjct: 228 AKRVQLQEAQEIGMEKGMEKGMEKGVTEIVLNMLKKKKWSLQDISEVSQWPLDKIESL 285 >UniRef50_C6LTE0 Putative uncharacterized protein n=1 Tax=Giardia intestinalis ATCC 50581 RepID=C6LTE0_GIALA Length = 353 Score = 103 bits (256), Expect = 8e-21, Method: Composition-based stats. Identities = 38/292 (13%), Positives = 101/292 (34%), Gaps = 23/292 (7%) Query: 11 DAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVKT 70 D VF + + L + L+ + ++++P L Sbjct: 73 DFVFYQIFGVEKH-KSVLISLLNSILKGNPHVKDVRIDPTEHKRTTPDGKSVRLDIKATI 131 Query: 71 QEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNH-LDAG--YKELPLVLPMLFYHGCR 127 +G V +E Q + R + Y ++++ + G YK +P V+ + + Sbjct: 132 NDGT---IVDVEMQCINTGDIYHRSIYYQSLILRDYTIKQGQSYKSIPDVIIIWIMNQDI 188 Query: 128 SPYPYSLCWLDEFAEPAIARKIYSSAFP----LVDITVVPDDEIMQHRKMALLELIQKHI 183 + + + + +I ++ ++++T + + + K Sbjct: 189 TNRKGCMHEIVPMYKANGIDQIEIASEKMRQFIIELTKLGNTSNFCYNK----AFTAWMT 244 Query: 184 RQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKL 243 +D + +++ + ++ + + + RA + Sbjct: 245 FIKDPSSISGELLEV--------EGVQTAMKELTYLSENKETRAIYDARRIALLDLNSAI 296 Query: 244 MTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQS 295 ++ + EG ++G+ +E R+A++ML GLD E ++ + LS ++ Sbjct: 297 EHGIEKGKAEGLVEGRDKERERMAEQMLSDGLDIEFIVRYSGLSMQEIENVK 348 >UniRef50_B7GJZ4 Transposase n=10 Tax=Bacillaceae RepID=B7GJZ4_ANOFW Length = 286 Score = 103 bits (256), Expect = 9e-21, Method: Composition-based stats. Identities = 47/291 (16%), Positives = 100/291 (34%), Gaps = 20/291 (6%) Query: 9 PHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDE--DLRQYYSDLLW 66 HD +FK L + + + D L + +Y DLL Sbjct: 6 DHDRLFKELLTTFFEEF---ILLFFPHVHEHIDFRHLSFLSEELFTDVTAGEKYRVDLLI 62 Query: 67 SVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYHGC 126 K + G I + +E+QS + RM Y + + +LP+ + Sbjct: 63 QTKLKGEAGIIIIHVENQSYMQSSFPERMFIYFSRLFEKYRTN-------ILPIAIFS-- 113 Query: 127 RSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKH-IRQ 185 Y + F + F V++ ++ L+ K + Sbjct: 114 ---YDFIRDEPSSFTLQFPFLHVLQFQFLAVELRKQNWRHYIRSENPIATALLSKMGYNE 170 Query: 186 RDLLGLVDQIVSLLVTGNTNDRQLKAL--FNYVLQTGDAQRFRAFIGEIAERAPQEKEKL 243 + + L Q +L+ N ++ + + L F Q F E+ + +E E++ Sbjct: 171 NERVELKKQFFRMLIRQNIDEAKRRLLIGFFETYVKLTEQEEEQFQNEVKKMGGKEGEQV 230 Query: 244 MTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQ 294 M + ++G + G E+ + Q+M+++G+ + + S +++ Sbjct: 231 MELIISYEQKGKIAGAKEKEREMIQKMVEKGMSITQIAHLLDRSEEEVRKV 281 >UniRef50_B0K813 Putative uncharacterized protein n=13 Tax=Thermoanaerobacterales RepID=B0K813_THEP3 Length = 267 Score = 102 bits (254), Expect = 1e-20, Method: Composition-based stats. Identities = 51/292 (17%), Positives = 114/292 (39%), Gaps = 32/292 (10%) Query: 7 STPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLW 66 S +D K+ + A D L T L F + SD+++ Sbjct: 2 SQEYDITAKNIFSN--LADDIASYFLG------LKFTKLDELNIEFT--TIESRESDMVF 51 Query: 67 SVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYHGC 126 T+ I + IE Q+ + M +RM+RY+ M+ H +L Y Sbjct: 52 KCTTENRD--IALHIEFQTYNDSKMPYRMLRYATEIMEKH------------NLLPYQVV 97 Query: 127 RSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELI---QKHI 183 L + + + + ++D+ + ++I++ + L + K Sbjct: 98 VYCSKNELKMENNLNYHLGEENLLNFRYRIIDVGKIKFEDIVKTKYYDLYTFLPVADKDK 157 Query: 184 RQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKL 243 RQ++ + + ++ + KA +Y++ T + + E+ E+ E + Sbjct: 158 RQKEKEAYLRKCAEVIRDMPVD----KAKKSYIVTTAEILAGIIYDEEVIEKIFSEVIGM 213 Query: 244 MTIAD-RLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQ 294 + + ++ + +GK E+++ IA+E+L G+D + +T+LS +++ Sbjct: 214 SILEESKVYKNILEKGKKEKSIEIARELLKEGMDINKIAQITKLSVEEIKKL 265 >UniRef50_D2NBJ3 Putative uncharacterized protein n=1 Tax=Escherichia coli SE15 RepID=D2NBJ3_ECOLX Length = 136 Score = 102 bits (254), Expect = 1e-20, Method: Composition-based stats. Identities = 71/129 (55%), Positives = 93/129 (72%), Gaps = 4/129 (3%) Query: 168 MQHRKMALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRA 227 +H MALLELIQKHIRQRDL+GLV+Q+ LL +G NDRQ+K LFNY+LQTGDA RF Sbjct: 12 RRHASMALLELIQKHIRQRDLMGLVEQMACLLSSGYANDRQIKGLFNYILQTGDAVRFND 71 Query: 228 FIGEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLS 287 FI +AER+P+ KE LMTIA+RLR+ +G+ +AL IA+ ML+ G+ +M T +S Sbjct: 72 FIDGVAERSPKHKESLMTIAERLRQ----EGEQSKALHIAKIMLESGVPLADIMRFTGVS 127 Query: 288 PDDLIAQSH 296 ++L A S Sbjct: 128 EEELAAASQ 136 >UniRef50_A6M1J9 Putative uncharacterized protein n=1 Tax=Clostridium beijerinckii NCIMB 8052 RepID=A6M1J9_CLOB8 Length = 278 Score = 102 bits (254), Expect = 2e-20, Method: Composition-based stats. Identities = 52/291 (17%), Positives = 98/291 (33%), Gaps = 20/291 (6%) Query: 6 TSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCD-LTTLKLEPNSFIDEDLRQYYSDL 64 S +D VFK +D I L + L+ D L ++L + E L Sbjct: 3 ISPKNDFVFKLLFGDEKN-KDLIIELLNSILKMPHDELEDIELINTELLREFAEDRKGIL 61 Query: 65 LWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYH 124 KT+ G ++ IE Q MA R + Y + +GY Y Sbjct: 62 DVRAKTKSGE---HIDIEIQVLYTYYMAERTLFYWSKMYNGQIKSGYT----------YD 108 Query: 125 GCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIR 184 + ++ + + + + D T ++++ + L +L +I Sbjct: 109 KLKKCITINIVDFNCIEINKLHTSFHITE----DETNKKLTDVLEIHYLELPKLFDNNIP 164 Query: 185 QRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKLM 244 + + LV ++ L L + + + + + +L Sbjct: 165 KDESEPLVQWMMFLQSRNKEAFEMLAEKNEKIKKAYNILEVISKDDNARAAYEAREAELH 224 Query: 245 TIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQS 295 RL+ +G E ++ A+ L GLD E+V T LS D+++ Sbjct: 225 DQMTRLK-SAREEGIKEATIKNAKNFLVMGLDVEMVAKGTGLSVDEVLKIK 274 >UniRef50_UPI0001C34E7F hypothetical protein ClM62_15401 n=1 Tax=Clostridium sp. M62/1 RepID=UPI0001C34E7F Length = 324 Score = 101 bits (251), Expect = 3e-20, Method: Composition-based stats. Identities = 46/276 (16%), Positives = 94/276 (34%), Gaps = 21/276 (7%) Query: 7 STPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLW 66 DA+F+ + + L + LE +++ +DL + Sbjct: 24 RDYKDALFRMIFNDKEALLSLYNAVGNTSYTDASQLQIVTLENAVYMN-----IKNDLAF 78 Query: 67 SVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYK------ELPLVLPM 120 + + + EHQS M R + Y + L +LP + Sbjct: 79 LLNME------LNLYEHQSTWNPNMPLRDLFYVSREYEMLLANQSIYSSSLLKLPAPRFV 132 Query: 121 LFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQ 180 +F++G + L + E + +++I +DE+M ++ L E Sbjct: 133 VFFNGSYDMGEQCVLKLSDAYEKKVEDPDLELKVTVLNINAGWNDELMNTCRL-LKEYSL 191 Query: 181 KHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEK 240 R R + ++ V+ ++ + + L A+ I E E +E Sbjct: 192 YVARVRAYAK--EMELAEAVSRAVDECIKEGILRDFLMKYRAEAISVSIFEYDEEREKE- 248 Query: 241 EKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLD 276 T + R+EG QG+ E + +E + +G+ Sbjct: 249 LLRKTEYEFGRQEGLSQGREEGLSQGIKEGMAQGVS 284 >UniRef50_A6LF36 Putative uncharacterized protein n=7 Tax=Bacteroidales RepID=A6LF36_PARD8 Length = 273 Score = 100 bits (249), Expect = 5e-20, Method: Composition-based stats. Identities = 33/287 (11%), Positives = 76/287 (26%), Gaps = 26/287 (9%) Query: 11 DAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVKT 70 D F + ++ + L D+ + E L +++ V Sbjct: 10 DFGFHRIFGQ-EVHKELLIDFLNQLFFGEHDIEDITFLNPIQTPETLDDRG--IVFDVHC 66 Query: 71 QEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYK---ELPLVLPMLFYHGCR 127 ++ G ++V +E Q+ + R + Y A+ N G L V + + Sbjct: 67 KDSNGNLFV-VEMQTGAQPYFHDRGLYYLARAISNQGQKGKDWKFALQPVYGVFLLNYKM 125 Query: 128 SPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQRD 187 + +++ + + KH+ + Sbjct: 126 DVNSKFRTDVILADRETGRMFSDRIRQVYLELPYFQKEPDECENDFERWIYLLKHMDTLE 185 Query: 188 LLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKLMTIA 247 + + L ++ R E +R K + Sbjct: 186 RMPFKAKKA-----------VFDKLLEVADVANLSKEERIQYDEALKRYRDYKNTI---- 230 Query: 248 DRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQ 294 + +G + A+ M G+ ++ T LS +D+ Sbjct: 231 ----DYAEEKGILKGKESTARNMKAEGIAPLIIQKCTGLSLEDIEKL 273 >UniRef50_C0QZQ8 Putative uncharacterized protein n=4 Tax=Brachyspira RepID=C0QZQ8_BRAHW Length = 309 Score = 99 bits (247), Expect = 9e-20, Method: Composition-based stats. Identities = 42/303 (13%), Positives = 101/303 (33%), Gaps = 19/303 (6%) Query: 2 TISTTSTPHDAVFKSFLRH---PDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLR 58 TI + +D + H + A +FI+ +++ I E+ Sbjct: 16 TIENLNRINDYFIRYLFSHTGNENIALNFINAVFKD--LNFETFQKIEILNPFNIAENYD 73 Query: 59 QYYSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAG--YKELPL 116 + S + T+ G I V+IE QS+ E R + Y + L+ G Y EL Sbjct: 74 EKESIVDIKATTESG---ITVLIEIQSRGNEDFIKRALYYWAYNYSSSLNRGSFYDELKP 130 Query: 117 -----VLPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHR 171 + + + Y L L+ +++ P ++ + + E + + Sbjct: 131 TVSINITNFILTDEDKVHSCYILKELNNNKILTDHCQLHFVELPKSNLKNISEIESLDNT 190 Query: 172 KMALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGE 231 + ++ + + + ++ + + + + + + +R Sbjct: 191 HKEFISWVK--FFKGEDMSIL--MKENTIFEEVERKCRTFVNDSPVMDKYKKREVDTYFL 246 Query: 232 IAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDL 291 ++ +EG +G E + +A+ M +D ++ T LS +++ Sbjct: 247 NKSMELDIRKAKEEGIKEGIKEGIKEGIKENQISMAKNMKKDKVDFNIISKYTGLSIEEI 306 Query: 292 IAQ 294 Sbjct: 307 KKL 309 >UniRef50_C1I6Y7 Putative uncharacterized protein n=1 Tax=Clostridium sp. 7_2_43FAA RepID=C1I6Y7_9CLOT Length = 226 Score = 99 bits (247), Expect = 9e-20, Method: Composition-based stats. Identities = 37/221 (16%), Positives = 73/221 (33%), Gaps = 12/221 (5%) Query: 47 LEPNSFIDEDLRQYYSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNH 106 L S+I D + SD+++ + YV++E QS + M R++ Y I ++ Sbjct: 3 LVNKSYILSDYEEQESDIVYKANFNGNDVFFYVLLEFQSSVDFRMPIRLLLYMIEIWRDI 62 Query: 107 L--------DAGYKELPLVLPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVD 158 L LP ++P++ Y+G + I + + +D Sbjct: 63 LRNTELKEFKRKTFRLPSIVPIVLYNGKKKWTAAKELKHAISNSDVFGDNILNFKYEFID 122 Query: 159 ITVVPDDEIMQHRKM--ALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYV 216 I +E+ + + A+ L Q R L D I+ LK + Sbjct: 123 INSYEKEELYNKQNISSAIFLLDQNINRIEFYNRLKDIIIGFNNLSIEEKMHLKHWLVNI 182 Query: 217 LQTGD--AQRFRAFIGEIAERAPQEKEKLMTIADRLREEGA 255 + + + ++L+E+G Sbjct: 183 NTEENNFKDNIEKIFNADKQEVLNMTSNISKGLEKLKEDGK 223 >UniRef50_UPI0001B4A8CA hypothetical protein Bfra3_22303 n=1 Tax=Bacteroides fragilis 3_1_12 RepID=UPI0001B4A8CA Length = 282 Score = 99 bits (247), Expect = 1e-19, Method: Composition-based stats. Identities = 48/296 (16%), Positives = 100/296 (33%), Gaps = 18/296 (6%) Query: 3 ISTTSTPHDAVFKSFLR-HPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYY 61 + + D FK HPD F++ LP L + T ++ P+ + E+ Sbjct: 1 MRYLNPKADLTFKRVFGEHPDLVMSFLNALLPLRLEESI--TDIEYLPSGMVPENSLPKN 58 Query: 62 SDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKE--LPLVLP 119 S + + +G +I +E Q +M + A +D+G + L V Sbjct: 59 SIVYVRCRDSKGRSFI---VEMQMIWSPEFKQCVMFNASKAYVRQMDSGEQYDLLQPVYS 115 Query: 120 MLFYHGCRSPYPYSL-CWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLEL 178 + + P + R I V++ + + L Sbjct: 116 LNLVNDIFEPDIKEYYHYYRLVHVEHTERVINGLHLVFVELPKFTPHTYSEKKMHILWLR 175 Query: 179 IQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQ 238 I ++ ++ L+ + + L + F I+ Sbjct: 176 YLTEIDEKT-----HEVPEELLENPEIKKAVTVLEESAFTPEQLLGYEKFWDIIS----V 226 Query: 239 EKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQ 294 EK + + + +EEG +G+ +E L +A +GL +++ +T LS +++ Sbjct: 227 EKTLISSAERKEKEEGRKEGELQEKLLVASNAKKQGLSLDIISSLTGLSAEEIERL 282 >UniRef50_C1PBU4 Putative uncharacterized protein n=4 Tax=Bacillus coagulans 36D1 RepID=C1PBU4_BACCO Length = 329 Score = 99.1 bits (245), Expect = 1e-19, Method: Composition-based stats. Identities = 52/335 (15%), Positives = 108/335 (32%), Gaps = 54/335 (16%) Query: 1 MTISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDED--LR 58 M HD +FK +++ ++F+D P L D ++ + Sbjct: 5 MEKHAGYHVHDRLFKELIQN--FFQEFMDAFFP-DLSADLDYRRVRFLSQEQFTDFPGGE 61 Query: 59 QYYSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVL 118 Q D+L K + I + +E QS E+ RM RY + H VL Sbjct: 62 QKRVDILAETKVKGKDTVILIHVEPQSYYEKPFPERMFRYYMMISLRHRK-------PVL 114 Query: 119 PMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLEL 178 P+ + D + +I + + + ++ L Sbjct: 115 PIAVFSYEEKTETP-----DTYTFAFHNIEILRFHYLSIHLMKQNWRNYIRSNNPVAAAL 169 Query: 179 IQKHIR-QRDLLGLVDQIVSLLVTGNTNDRQLKAL--FNYVLQTGDAQRFRAFIGEIAER 235 + K + + + + + + +L + +++ L F + + + I Sbjct: 170 LSKMGYTETERVQVKLEFLRMLARMELDPAKMRLLHGFFDYYLKLNEKEEAEVMENIKML 229 Query: 236 APQEKEKLMTI------------------------ADRLREEGAMQGKHE---------- 261 P E E+++ + ++ REEG G + Sbjct: 230 DPDEAEQVLKLPNSYFDRGYKKGKEEGREEGIEIGVEKGREEGIEIGVEKGREEERKEML 289 Query: 262 EALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQSH 296 + + IA +ML G + +L++ T LS ++ Sbjct: 290 QTIPIAIKMLQEGRELQLIVEKTGLSQREVEKIKQ 324 >UniRef50_C9LXX0 Putative uncharacterized protein n=6 Tax=Selenomonas sputigena ATCC 35185 RepID=C9LXX0_9FIRM Length = 301 Score = 98.8 bits (244), Expect = 2e-19, Method: Composition-based stats. Identities = 47/304 (15%), Positives = 98/304 (32%), Gaps = 28/304 (9%) Query: 4 STTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSD 63 +T T D++F+ + + + + L D+T ++ F +D Sbjct: 3 NTKRTYKDSLFRDIFNNAERLPEIYEALL-DHKTTPDDITLATIDETLFTG-----VKND 56 Query: 64 LLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHL------DAGYKELPLV 117 + + V G ++ +++EHQS M R++ Y + + ++ LP Sbjct: 57 IGFIV----GNQHV-LLVEHQSTINANMPLRLLMYLVEIYRRYVDKDAIYKKELIPLPAP 111 Query: 118 LPMLFYHGC-RSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRK-MAL 175 +FY+G P ++L D F + +I P+ I++ + Sbjct: 112 KFYVFYNGLAEMPDIWALHLSDAFGGHDSD---LELEVKVFNINDKPNRPILEKCHALKS 168 Query: 176 LELIQKHIRQRDLLG-LVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAE 234 + +R+ G ++ V V L F + Sbjct: 169 YSVFVAKVRECIKNGSSLEIAVGNAVQYCVAHDYLGEYFRQKQAKEVFDMLNFVWNQERA 228 Query: 235 RAPQEKEKLMTIADRLREEGAMQGKHEEALR-----IAQEMLDRGLDRELVMMVTRLSPD 289 + +E + R+EG QG + L I M E M + ++ + Sbjct: 229 LEVRAEEAMEKGLRLGRQEGLSQGLSQGVLETTTASIRNVMKSMDFPIEKAMDILQIPEE 288 Query: 290 DLIA 293 + Sbjct: 289 ERAK 292 >UniRef50_Q5GSR2 Uncharacterized conserved protein n=15 Tax=Wolbachia RepID=Q5GSR2_WOLTR Length = 317 Score = 98.8 bits (244), Expect = 2e-19, Method: Composition-based stats. Identities = 40/305 (13%), Positives = 96/305 (31%), Gaps = 24/305 (7%) Query: 11 DAVFKSFLR---HPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWS 67 D +FK + F++ L ++ + ++ ID ++ ++ Sbjct: 12 DLIFKKIFGTEKNKKIIICFLNNILG--FAEINAIQEVEFLSAI-IDPEIASNKQSIIVD 68 Query: 68 VKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKE---LPLVLPMLFYH 124 V ++ G V IE Q + R+ Y++ A LD L V + + Sbjct: 69 VFCKDATGTRRV-IEVQLAINKGFEKRVQPYAVKAYSRQLDKSGNYIVDLKKVFFIAISN 127 Query: 125 -GCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMAL-LELIQKH 182 S + + + F +++ ++ Q + K+ Sbjct: 128 CNLLSEKVDYISTHNIHDTKTNGHYLKDFQFIFIELPKFSKSKVEQLINIVEHWCFFFKN 187 Query: 183 I---RQRDLLGLVDQIVSLLVTGNTNDRQ---LKALFNYVLQTGDAQRFRAFIG------ 230 + DL + +++ + + + D + + Y + + Q+ +A + Sbjct: 188 AEDTTETDLKRVAKKVLIIKLAYDGLDEFHWNEEDIIAYEERVMNLQKEKAILEYRLDLA 247 Query: 231 EIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDD 290 R K E+G +G + + +A+ L G+ + + LS Sbjct: 248 TEKGREEGVKISKERGIKVGAEKGREEGVKKAKIAVAKNSLKAGMSIGAIAEIIGLSVGK 307 Query: 291 LIAQS 295 + Sbjct: 308 IKKLH 312 >UniRef50_A6EAN2 Putative uncharacterized protein n=1 Tax=Pedobacter sp. BAL39 RepID=A6EAN2_9SPHI Length = 317 Score = 98.8 bits (244), Expect = 2e-19, Method: Composition-based stats. Identities = 40/309 (12%), Positives = 96/309 (31%), Gaps = 29/309 (9%) Query: 11 DAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVKT 70 D FK +D + L A + + L N + + + ++ + Sbjct: 13 DFAFKKIFGGDPN-KDLLIDLLNALFKGRKIIIDLTYNKNEHPGDSEHEGAA--VFDLLC 69 Query: 71 QEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYK-----ELPLVLPMLFYHG 125 G ++ IE Q +E R + Y+ + + G + L V + Sbjct: 70 TGQNGEQFI-IEIQRAKQENFKERALFYTSRLISSQAPKGNRASWGYRLTEVYLIALMED 128 Query: 126 CRSPYPYSLCWLDEFAEPAIARKIYSSA---FPLVDITVVPDDEIMQHRKMALLELIQKH 182 +L + + +++ + + K+ Sbjct: 129 TTLNDESEHEFLHDICLCKRDTGKVFYEKLGYLYIELRKFVKSSTELQTDLDRWLFLLKN 188 Query: 183 IRQRD------LLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEI---- 232 + D + +++ S+ N + + + + + D + R + + Sbjct: 189 LSSMDKIPVYLRKPIFEKLFSIAEYSNLSKEEKMSYDSRMKYKWDNENVREYARKEGLEK 248 Query: 233 -------AERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTR 285 R + E + + EG ++G+ E A++IA EM L + + T+ Sbjct: 249 GLEEGREKGRLEGKLEGKLEGKLEGKLEGKLEGRKEAAIKIAGEMKSANLPLDQIARFTK 308 Query: 286 LSPDDLIAQ 294 LS +++ Sbjct: 309 LSLEEIEGI 317 >UniRef50_C1QAJ2 Putative uncharacterized protein n=2 Tax=Brachyspira murdochii DSM 12563 RepID=C1QAJ2_9SPIR Length = 312 Score = 98.8 bits (244), Expect = 2e-19, Method: Composition-based stats. Identities = 46/318 (14%), Positives = 101/318 (31%), Gaps = 32/318 (10%) Query: 3 ISTTSTPHDAVFKSFLRHPD---TARDFIDI-HLPAPLRKLCDLTTLKLEPNSFIDEDLR 58 + + +D + D DFI+ L A ++ + L P + + + Sbjct: 1 MRDINVLNDYFVRYLFSSKDSNFILLDFINSTMLDANMKTFRSVEILTPSPKAGSRLNYK 60 Query: 59 QYYSD------------------LLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSI 100 + Y D L TQ G V+IE Q + R++ Y Sbjct: 61 ENYDDKESIAPKVARKVDRCRRRLDVKCITQNGTV---VIIEIQLQGNSRFPERILYYWA 117 Query: 101 AAMQNHLDAGYKE--LPLVLPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVD 158 + L G K L V+ + + + + +++ Sbjct: 118 SNYSKLLKQGEKYDALTPVISINLLNFNLDNNDCIHSCYMIYDTKSKRLLTDHLQIHIIE 177 Query: 159 ITVVPDDEIMQH--RKMALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYV 216 I D+ + + + + +K R+ + LV + + R + + + Sbjct: 178 IKKFKDNLLDKDLDCWLKFFTIKEKDNREVIMSELVKEKP---IMEEVQKRYNNFIKDRL 234 Query: 217 LQTGDAQRFRAFIGEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLD 276 + +R G + + + + E+G +G E + A+ M ++ +D Sbjct: 235 MMNEYDKREAYLYGNQIMLEEERRLGIEEGFKKGIEKGIEKGIKENQILTAKNMKNKNID 294 Query: 277 RELVMMVTRLSPDDLIAQ 294 L+ +T LS ++ Sbjct: 295 IALISDITGLSIKEIEEL 312 >UniRef50_C3R531 Putative uncharacterized protein n=6 Tax=Bacteroidales RepID=C3R531_9BACE Length = 325 Score = 98.4 bits (243), Expect = 3e-19, Method: Composition-based stats. Identities = 41/318 (12%), Positives = 89/318 (27%), Gaps = 35/318 (11%) Query: 8 TPH-DAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLW 66 P+ D FK D ++ + L A + + + + ++ Sbjct: 12 NPYTDFAFKLLFGT-DLNKEILIGFLNALFDGKQVIEDVTYLNTEHLGSKETDRRA--VF 68 Query: 67 SVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYK---ELPLVLPMLFY 123 V + G ++IE Q ++ R + Y+ ++ G EL V + Sbjct: 69 DVYCENEKGEK-ILIEMQRGEQQFFKDRSIYYATYPIREQAIKGEIWDYELKAVYVIGIL 127 Query: 124 HGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLV--DITVVPDDEIMQHRKMALLELIQK 181 + S + +++ V ++ E + K Sbjct: 128 NFALDDVSSSSFRHEVKLMDTTTHEVFFDKLTFVYLEMPKFHKTEQELDTLFDKWMFVLK 187 Query: 182 HI----------------RQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDA--- 222 ++ R + + L + + + N + Sbjct: 188 NLARLMERPTALQERVFNRLFEAAEIAQFSKENLYAYEESLKVYRDWNNVIDTAIQKGIA 247 Query: 223 --QRFRAFIGEIAERAPQEKEKLMTIADRLREEGAMQGKHEE----ALRIAQEMLDRGLD 276 G A +E ++ + +G +G E A IA + + GL Sbjct: 248 RGMEEGLVKGMEEGIAKGMEEGIVKGMEEGIAKGMEKGIAEGEWMKAQTIAGNLKNAGLS 307 Query: 277 RELVMMVTRLSPDDLIAQ 294 + VT LS D++ + Sbjct: 308 IAEIAKVTGLSEDEINSL 325 >UniRef50_C0CTJ7 Putative uncharacterized protein n=5 Tax=Clostridium RepID=C0CTJ7_9CLOT Length = 327 Score = 98.0 bits (242), Expect = 4e-19, Method: Composition-based stats. Identities = 47/320 (14%), Positives = 96/320 (30%), Gaps = 44/320 (13%) Query: 11 DAVFKSFLRHPDTARDFID--IHLPAPLRKLCDLTTLKLEPNSFIDEDLR----QYYSDL 64 D V + + D I+ + + D+ L R Q Y D Sbjct: 5 DMVLNRYFEDGERYADLINGYAFNGDQVVRKEDVQELDPRETGVAGRLGRRPGVQKYRDS 64 Query: 65 LWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDA--------------- 109 + V G ++ + +EHQ + M R M A L Sbjct: 65 IRRVVL--GARFVLIGLEHQDQVHYAMPVRAMLQDAAEYDRQLRRIRRVNRRVGGLTGAE 122 Query: 110 ------GYKELPLVLPMLFYHGCRSPYPYSLCWLDEFA----EPAIARKIYSSAFPLVDI 159 + V+ ++ Y+G + P+ ++ + R + + ++++ Sbjct: 123 FLGGFTRKDRVCPVITLVLYYGKK-PWDGAMDLHGLMDCAGYPEPMLRLVNNYRLHVLEV 181 Query: 160 TVVPDDEIMQHRKMALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQT 219 + + + IQ R D V D + + + + Sbjct: 182 RRFVNIRRFRTDLYQVFGFIQ---RSGDKEAERRFTEENRVYFEGMDEEAFDVITAITGS 238 Query: 220 GDAQRFRAFIGEIAERA---PQEKEKLMTIADRLREEGAMQGKHEEALR----IAQEMLD 272 + +R + E R + + R EG ++GK+E AL +A+ M Sbjct: 239 RELERVKEQYREEGGRINMCEAIRGMIEDGRIEGRLEGKIEGKYEGALEKTRTVARNMYL 298 Query: 273 RGLDRELVMMVTRLSPDDLI 292 RG+ E + + + Sbjct: 299 RGMSAEDAAAICEMDTAQIE 318 >UniRef50_C4ZLA7 Conserved hypothetical cytosolic protein n=2 Tax=Proteobacteria RepID=C4ZLA7_THASP Length = 339 Score = 96.8 bits (239), Expect = 7e-19, Method: Composition-based stats. Identities = 52/328 (15%), Positives = 108/328 (32%), Gaps = 47/328 (14%) Query: 1 MTISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFI----DED 56 M S +D+ +K + H +FID + P R++ + D Sbjct: 1 MPASAAQDDYDSPWKEAVEH--AFPEFIDFYFPDAGRQIDWARGHRFLDKELQQIVRDAA 58 Query: 57 LRQYYSDLLWSVKTQEGV-GYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELP 115 L + + D L SV T G ++ V IE Q + A RM Y+ ++ P Sbjct: 59 LGRRHVDKLASVTTHAGEEDWLCVHIEVQGSMDPDFARRMFVYNYRIYDSYDR------P 112 Query: 116 LVLPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDI-TVVPDDEIMQHRKMA 174 + + + P D F + + + FP+ + D+ + Sbjct: 113 VASLAVLADDDPAWRP------DRFGYERLGCRH-NLQFPVAKLVDHAADEAALLCNPNP 165 Query: 175 LLELIQKHIRQRD-------LLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGD-AQRFR 226 + H+ R ++V LL + +++ F+ + + F Sbjct: 166 FALVTAAHLYTRRTRRSPIARFDAKRRLVRLLYERDWTRQRILDFFSVLDWMMRLPREFE 225 Query: 227 AFIGEIAERAPQEKE----------KLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLD 276 + + E E++ + + E+G G + + ++ +++G Sbjct: 226 QRLWQDIENIEGERKVKYVTSVERLAIERGLQKGMEQGLEIGIEKGIEQGIEKGIEKGRA 285 Query: 277 RELVMMVTR--------LSPDDLIAQSH 296 + ++ R LSPD + S Sbjct: 286 QGSASVLLRLLNRRFGPLSPDIIRRLSQ 313 >UniRef50_A8GY36 Putative uncharacterized protein n=15 Tax=Rickettsia RepID=A8GY36_RICB8 Length = 279 Score = 96.8 bits (239), Expect = 8e-19, Method: Composition-based stats. Identities = 43/291 (14%), Positives = 91/291 (31%), Gaps = 30/291 (10%) Query: 10 HDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVK 69 +D FK F++ + L + + LK N + + + S + V Sbjct: 9 NDVAFKKLFTDKARLISFLNNIMR--LPEELRIIDLKYISNEQVPDLGQNKRSIVDVKVT 66 Query: 70 TQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKE--LPLVLPMLFYHGCR 127 G YI +E Q+ + R+ Y A + L G + L V+ ++ G + Sbjct: 67 DNSGNIYI---VEMQNGYADAFLARVQFYGCVAFSSQLKRGKEYADLAPVVMVIITSGFQ 123 Query: 128 S--PYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQ 185 + + + ++ ++ V++ + + Sbjct: 124 ALPEEKECISYHQTINVGNGKNQLKCLSYVFVELDKFTKEANELETIEDDWLYMM----- 178 Query: 186 RDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKLMT 245 + K + V + F AE K L Sbjct: 179 --------------AKFDKAKEPPKHTQDEV-VLSAYKTIEQFNWSEAEYDNYIKAMLAA 223 Query: 246 IADRLREEGA-MQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQS 295 + L ++ +GK E ++ +A+EML E ++ T+LS +++ Sbjct: 224 QTEELNQKSKFKEGKAERSIEMAKEMLQDNEPIEKIIKYTKLSKEEIEKLK 274 >UniRef50_C1Q938 Putative uncharacterized protein n=4 Tax=Brachyspira murdochii DSM 12563 RepID=C1Q938_9SPIR Length = 326 Score = 96.8 bits (239), Expect = 8e-19, Method: Composition-based stats. Identities = 43/304 (14%), Positives = 103/304 (33%), Gaps = 27/304 (8%) Query: 1 MTISTTSTPHDAVFKSFLRH---PDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDL 57 +TI+ + +D + H + A +FI+ +++ I E+ Sbjct: 40 ITINNLNRINDYFVRYLFSHDGNENIALNFINAVFKD--LNFETFNKIEILNPFNISENY 97 Query: 58 RQYYSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAG--YKELP 115 + S + T+ G I V+IE QS+ E R + Y + L+ G Y L Sbjct: 98 DEKESIVDIKATTETG---ITVLIEIQSRGNEDFIKRALYYWAYNYSSSLNRGSFYDGLK 154 Query: 116 L-----VLPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQH 170 + + + Y L L+ +++ P ++ + E + + Sbjct: 155 PTVSINITNFILTDEDKVHSCYVLKELNNNKILTDHCQLHFLELPKFNLKDISAIESLDN 214 Query: 171 RKMALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIG 230 + I+ + + + ++ + ++ +++ N ++ Sbjct: 215 IHKEFISWIK--FFKGEDMSILMKENTIF---EEVEKKCLTFVNDSPVIDKYKKREV--- 266 Query: 231 EIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDD 290 + + + +EEG +G E + A+ M +D ++ +T LS + Sbjct: 267 ----DTYFFNKSMELDIKKAKEEGIKEGIKENQILTAKNMKKENIDINIISKITGLSIQE 322 Query: 291 LIAQ 294 + Sbjct: 323 IENL 326 >UniRef50_D0TYF1 Putative uncharacterized protein n=1 Tax=Bacteroides sp. 2_1_22 RepID=D0TYF1_9BACE Length = 349 Score = 96.4 bits (238), Expect = 1e-18, Method: Composition-based stats. Identities = 44/356 (12%), Positives = 110/356 (30%), Gaps = 71/356 (19%) Query: 3 ISTTSTPH-DAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYY 61 +S P D FK P +++ + L L +T L +++ Sbjct: 1 MSKYVNPFTDIGFKIIFGQPA-SKNLLITLLNELLAGEHHITELTFLDKEDHADNVSDKG 59 Query: 62 SDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDA------------ 109 +++ + + G Y+++E Q++ R + Y A+ +++ Sbjct: 60 --IIYDLYCRTASGE-YIIVEMQNRWHSNFLDRTLYYVCRAVSRQIESPSSKEVPVPEDP 116 Query: 110 -----------GYKELPLVLPMLFYHGCRSP--------YPYSLCWLDEFAEPAIARKIY 150 LP + + + S + P + + Sbjct: 117 MTAREPLVSYGKQYRLPTIYGIFLTNFKEENLEAKFRTDTVLSDRDTGKIVNPHLRQIYL 176 Query: 151 SSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQRDLLG--LVDQIVSLLVTGNTNDRQ 208 + D+ D + + + L+ + R D L + + + L + ++ Sbjct: 177 QFPYFTKDL---SDCHTLYDKLIYALKNMSNWNRMPDALKEQVFEHLARLAAVADLSEEN 233 Query: 209 LKAL--------FNYVLQTGDAQRFRAFIGEIAER----------------------APQ 238 A N +++ + ++ + AE Sbjct: 234 RIAYDKALDRYRVNQIVEEDERRKNEEMRRKAAEEGLKEGMKAGLEKGVKKGRLEGIKEG 293 Query: 239 EKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQ 294 KE + E+G +G+ ++ + IA++M + G+ ++++ T L D+ Sbjct: 294 MKEGMKEGMKEGLEKGLEKGEQKKQIEIARKMREDGISIDIIIKYTGLQSSDIENL 349 >UniRef50_C1MD86 Putative uncharacterized protein n=5 Tax=Enterobacteriaceae RepID=C1MD86_9ENTR Length = 155 Score = 96.1 bits (237), Expect = 1e-18, Method: Composition-based stats. Identities = 81/155 (52%), Positives = 105/155 (67%), Gaps = 20/155 (12%) Query: 162 VPDDEIMQHRKMALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGD 221 +PDD+IMQHR+MALLELIQKHIR+RDL+GLV+++ LLV G+ ND QLKALFNY++Q G+ Sbjct: 1 MPDDKIMQHRRMALLELIQKHIRKRDLMGLVEKLAILLVKGHANDNQLKALFNYLMQAGN 60 Query: 222 AQRFRAFIGEIAERAPQEKEKLMTIADRLREE--------------------GAMQGKHE 261 F F+ E+AER PQ K+KLMTIA+RLR+E G QGK E Sbjct: 61 TTHFGEFLHEVAERLPQHKDKLMTIAERLRQEGHLNGLQEGHRKGLQEGLQTGLQQGKRE 120 Query: 262 EALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQSH 296 EALRIA M G+D ++ +T L+ +DL +SH Sbjct: 121 EALRIASTMQADGIDPLTIIRITGLTAEDLATRSH 155 >UniRef50_Q8F560 Putative uncharacterized protein n=1 Tax=Leptospira interrogans RepID=Q8F560_LEPIN Length = 278 Score = 94.9 bits (234), Expect = 3e-18, Method: Composition-based stats. Identities = 40/286 (13%), Positives = 96/286 (33%), Gaps = 18/286 (6%) Query: 13 VFKSFL-RHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVKTQ 71 +FK + PD ++ L +K+ + S L + + Sbjct: 2 MFKILFVKEPDLLISILNSVLFTDGEHTI--RNIKILNPELVGSSPNDKRSYLDIRAQDE 59 Query: 72 EGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAG--YKELPLVLPMLFYHGCRSP 129 +G + +E Q + R + Y +++ L+ G Y +L V + P Sbjct: 60 DGKIF---HVEIQVAHQSSFVKRSLYYLSGLIRDQLNRGSMYSDLKPVYQINIVDFDLIP 116 Query: 130 -YPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQ-HRKMALLELIQKHIRQRD 187 + + +++ ++ + + + + KH + + Sbjct: 117 SENFHSKFKFREESNPDIILTDDVEIHFLELCKFVKRDVRELRNNLEIWLYVLKHTSELE 176 Query: 188 LLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKLMTIA 247 +++ L+ + L Y + D Q+ ++ L Sbjct: 177 E----EEMRILVDKTPDLSKAFTILEQY---SNDPQKRNELEAKLKSDR-DYAYDLAARF 228 Query: 248 DRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIA 293 + +G +G +E L+ A++ML+ G+ ++++ +T LS DL Sbjct: 229 EAGELQGIEKGAEKEKLKSARKMLEEGMRLDVILRITGLSKKDLKD 274 >UniRef50_C0DAA1 Putative uncharacterized protein n=2 Tax=Clostridium asparagiforme DSM 15981 RepID=C0DAA1_9CLOT Length = 302 Score = 94.9 bits (234), Expect = 3e-18, Method: Composition-based stats. Identities = 42/284 (14%), Positives = 86/284 (30%), Gaps = 29/284 (10%) Query: 10 HDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVK 69 D++F+ + + + DL + ++ +D+ + + Sbjct: 17 KDSLFRVIFSEKKELLELYNAINGSHYENPDDLIITTIGDVLYLGMK-----NDISFLI- 70 Query: 70 TQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKE--------LPLVLPML 121 G + E QS M R + Y Q +L + LP ++ Sbjct: 71 -----GQHLSLYEAQSTWNPNMPLRGLFYFSRLYQGYLKEHQLDLYSRRPLSLPFPEFIV 125 Query: 122 FYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQ-HRKMALLELIQ 180 FY+G + L + A +++I ++E+M+ RK+ + Sbjct: 126 FYNGTMEQPDRTQLRLSDLFYQAEGVPCLECTATMININYGHNEEMMKSCRKLYEYAFLI 185 Query: 181 KHIRQRDLLGL-VDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQE 239 +R R GL ++ V V LK + R I + Sbjct: 186 NAVRSRLNEGLHLEAAVDQAVEDCIQHDVLKNFL-----LKHREEVREMILSEYDEELHI 240 Query: 240 KEKLMTIADRLREEGAMQGKHEEALRI---AQEMLDRGLDRELV 280 + + E G +QG R+ + G +++ Sbjct: 241 NSEKKISYEEGLEAGVVQGTQHGQERVNALITRLAAAGRADDII 284 >UniRef50_B7CC32 Putative uncharacterized protein n=10 Tax=Eubacterium biforme DSM 3989 RepID=B7CC32_9FIRM Length = 301 Score = 94.9 bits (234), Expect = 3e-18, Method: Composition-based stats. Identities = 47/300 (15%), Positives = 108/300 (36%), Gaps = 16/300 (5%) Query: 7 STPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLR-QYYSDLL 65 + D K FL + DF + + ++ + D ++ + + D++ Sbjct: 2 NKIKDKTMKEFLENNAYFVDFFNAYF-FDGERVLKPENCMELDSEMNDSNMDLEKHVDVI 60 Query: 66 WSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQN-----HLDAGYKELPLVLPM 120 K +G Y +IE+QS + M R Y A + ++LP+V + Sbjct: 61 R--KYNDGNLYSAFIIENQSYVDASMVVRAAAYEFVAYDRMLKKLKKNKAKEKLPMVHIL 118 Query: 121 LFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDIT-----VVPDDEIMQHRKMAL 175 +FY G + + + + L++IT ++++ + Sbjct: 119 VFYTGEKLWNAANKLSQLVEVDERFESYFHDYQMNLIEITGNTSYNFNEEDVYNLFYICR 178 Query: 176 LELIQKHIRQR-DLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAE 234 Q ++ + GLV V +V T+ L + + E+ Sbjct: 179 SIYDQSIYEEKSNGFGLVKSSVLKVVKTLTDVEWLDLEELEEKEEIEMCEAEKRWLEVKS 238 Query: 235 RAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQ 294 + + K + ++ E+G QG ++ L + ++M+D+G + + + +S + + Sbjct: 239 KEWEAK-GIKKGIEQGIEQGIEQGSEKKELEMYRKMMDKGFGIKAIASIFSVSEESIEKL 297 >UniRef50_C0QWG9 Putative uncharacterized protein n=8 Tax=Brachyspira RepID=C0QWG9_BRAHW Length = 301 Score = 94.5 bits (233), Expect = 3e-18, Method: Composition-based stats. Identities = 46/303 (15%), Positives = 109/303 (35%), Gaps = 27/303 (8%) Query: 2 TISTTSTPHDAVFKSFLRHP---DTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLR 58 TI+ + +D + H + A +FI+ + +++ I E+ Sbjct: 16 TINNLNRINDYFIRYLFSHEGNENIALNFINAVFKDLGFE--TFKKIEILNPFNIAENYD 73 Query: 59 QYYSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAG--YKELPL 116 + S + T+ G I V+IE Q++ E R + Y + L+ G Y EL Sbjct: 74 EKESIVDIKAITESG---ITVLIEIQARGNEDFIKRALYYWAYNYSSSLNRGSFYDELKP 130 Query: 117 -----VLPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHR 171 + + + + Y L L+ +++ P ++ + E + + Sbjct: 131 TVSINITNFILTNEDKVHSCYVLKELNNNKILTDHCQLHFLELPKFNLKNISAIESLDNI 190 Query: 172 KMALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGE 231 + ++ + + + ++ + ++ R + + + F + Sbjct: 191 HKEFISWVK--FFKGEDMSILMKENTIFEEVEKKCRTFVNNTPVMDKYKKREVDAYFFDK 248 Query: 232 IAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDL 291 E + +EEG QG+ +A+ IA+ + G+D +++ T LS +++ Sbjct: 249 SIELD----------LKKAKEEGIEQGEKNKAISIAKSFKNAGIDIKIISENTGLSIEEV 298 Query: 292 IAQ 294 Sbjct: 299 EKL 301 >UniRef50_C0BF92 Putative uncharacterized protein n=1 Tax=Coprococcus comes ATCC 27758 RepID=C0BF92_9FIRM Length = 307 Score = 94.5 bits (233), Expect = 4e-18, Method: Composition-based stats. Identities = 43/276 (15%), Positives = 87/276 (31%), Gaps = 27/276 (9%) Query: 2 TISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYY 61 + D +++ + + + + DL LE ++ Sbjct: 12 KQTHNRQYKDRLWRMIFNNKEDLLQLYNAINHTDYQNPDDLEVNTLEDVLYLSMK----- 66 Query: 62 SDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHL--------DAGYKE 113 +D+ + V G +Y EH S M R + Y + ++ Sbjct: 67 NDVSFLV---GGTMNLY---EHLSTFNPNMPLRGVFYFSRLYEGYVADNNLMIYHEKRVR 120 Query: 114 LPLVLPMLFYHGCRS-PYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQH-R 171 LP ++FY+G ++ P L D F +++I + E+M+H R Sbjct: 121 LPKPKYIVFYNGTKNQPDSMELRLSDCFENTDNDAPCLECTATMLNINYGHNQELMKHCR 180 Query: 172 KMALLELIQKHIRQRDLLG-LVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIG 230 ++ + + +R+ V+ + + N L I Sbjct: 181 RLEEYSIFVQCVREYIQSEPSVEDALEKAIDTCINQDVLADFL-----KKHRAEVTNMIL 235 Query: 231 EIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRI 266 ++ EK + REEG M+G+ E + Sbjct: 236 TTYDKDLYEKTLKEDAREEGREEGLMEGRAETRAEL 271 >UniRef50_B0KCX4 Putative uncharacterized protein n=12 Tax=Thermoanaerobacterales RepID=B0KCX4_THEP3 Length = 267 Score = 94.5 bits (233), Expect = 4e-18, Method: Composition-based stats. Identities = 52/289 (17%), Positives = 103/289 (35%), Gaps = 26/289 (8%) Query: 7 STPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLW 66 S +D K + A D L T F + + SD++ Sbjct: 2 SQKYDITIKDIFSN--MADDITAYFLG------LTYTKTDELNIEFT--KVEKRQSDIVL 51 Query: 67 SVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYHGC 126 T++G I V +E QS ++ M +RM+RYS+ M+ + ++ Y G Sbjct: 52 KCTTEKGD--IAVHLEFQSDNDDKMPYRMLRYSLEIMEKYN-------LTPYQLVIYMGK 102 Query: 127 RSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQR 186 L ++ I + ++D+ + +I + L L+ R+R Sbjct: 103 ND-----LRMENKLDYNLGEENILDYRYKIIDVGTIKFLDITKTDYYDLYALLPIMDRER 157 Query: 187 DLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKLMTI 246 + + + + + E+ ER E +++ I Sbjct: 158 RKTEGEKYLKECVEAIKNI-PIDINKKKDITFKAEILSGLVYSREVIERVFTEVMEMLRI 216 Query: 247 AD-RLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQ 294 + + +G E++LRIA+E+L G+D + +T LS +++ Sbjct: 217 EESEAYKMILEKGAKEKSLRIAKELLKEGMDINKIAKITELSIEEIKKL 265 >UniRef50_B9E303 Putative uncharacterized protein n=2 Tax=Clostridium kluyveri RepID=B9E303_CLOK1 Length = 304 Score = 94.1 bits (232), Expect = 5e-18, Method: Composition-based stats. Identities = 38/241 (15%), Positives = 79/241 (32%), Gaps = 31/241 (12%) Query: 79 VVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYK--------ELPLVLPMLFYHGCRSPY 130 +E QS+ + M R++ Y + + L K +LP ++PM+ Y+G + Sbjct: 28 CFLEFQSRVDYRMPMRLLFYMVEIWREILKNTSKNDRSKKDFKLPSIIPMVLYNGKNTWT 87 Query: 131 PYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKM-ALLELIQKHIRQRDLL 189 + + L DI ++++ M + + L+ K I + DL+ Sbjct: 88 ACKNFKDVLSGSKLFGENVIDFRYMLFDIYRYNEEQLEDMANMVSTVFLLDKEISKEDLV 147 Query: 190 GLVDQIVSLLVT-----------------GNTNDRQLKALFNYVLQTGDAQRFRAFIGEI 232 + +L D + K +L+ + + + Sbjct: 148 KRLRLTAYVLKKITPEQFDILKAWLKSIIKPRLDSESKIKIEEILEKSSQGEVDSMVSNL 207 Query: 233 AE-----RAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLS 287 + + L R+EG +G+ E +E + + LV T+L Sbjct: 208 GKTIDNIIREGRETGLEEGRREGRKEGRKEGRKEGRKEGRKEGKSELITKMLVKKFTKLP 267 Query: 288 P 288 Sbjct: 268 D 268 >UniRef50_A6BF26 Putative uncharacterized protein n=14 Tax=Clostridiales RepID=A6BF26_9FIRM Length = 366 Score = 94.1 bits (232), Expect = 5e-18, Method: Composition-based stats. Identities = 42/285 (14%), Positives = 84/285 (29%), Gaps = 21/285 (7%) Query: 4 STTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSD 63 D +F+ + + L + LE ++ +D Sbjct: 51 KAKRMYKDTIFRMLYHDKENLLSLYNAVNGREYTDPEKLQVVTLENAIYMGMK-----ND 105 Query: 64 LLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGY------KELPLV 117 L + + Y+Y EHQS + R + Y Q + +++P Sbjct: 106 LAF---IMDMNLYLY---EHQSTYNPNIPLRNLFYIADEYQRLVVRKSLYSTVIQKIPTP 159 Query: 118 LPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLE 177 ++FY+G + S L E ++++ ++M+H + L E Sbjct: 160 RFLVFYNGTKEVEDRSEFRLSSAYENPTENPDLELRVTMLNVNDGHSSDLMEHCR-TLKE 218 Query: 178 LIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAP 237 Q R R D + VT ++ + + L + R I E + Sbjct: 219 YAQYVARVRKYAAKQDVSLEEAVTRAVDECIEEGILAEFLLKNKTEVIRVSIYEYDKEFE 278 Query: 238 QEKEK---LMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDREL 279 ++K + E G G + G++ Sbjct: 279 EKKLRKAEYEAGRQDGIEIGRQDGIEIGRQDGIEIGRQDGIEIGK 323 >UniRef50_C2G1H3 Hypothetical cytosolic protein n=1 Tax=Sphingobacterium spiritivorum ATCC 33300 RepID=C2G1H3_9SPHI Length = 294 Score = 93.4 bits (230), Expect = 8e-18, Method: Composition-based stats. Identities = 51/300 (17%), Positives = 100/300 (33%), Gaps = 27/300 (9%) Query: 11 DAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFI------DEDLRQYYSDL 64 D ++K L DF+ P + + D Sbjct: 6 DYLWKGVLED--VFDDFLRFLYPDADSVFDLSRGITFLDKELEQLFPPEGNEFAPKVVDK 63 Query: 65 LWSVKTQEG-VGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFY 123 L V T +G ++ + +E Q + A RM Y + + +K + + Sbjct: 64 LAQVYTHDGMEEWVLIHVEVQGTCRKDFASRMFTYYYRILDKY----HKRITAFAILT-- 117 Query: 124 HGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVV--PDDEIMQHRKMALLELIQK 181 S P + +EF +I + + D + D+ A + K Sbjct: 118 --EASKKPRPNVYEEEFMGTSIQYRFNTYKIAEQDTDRLLASDNPFALVVLTAKAAFVGK 175 Query: 182 HIRQRD-----LLGLVDQIVSLLVTGNTNDRQLKALFNYVLQT--GDAQRFRAFIGEIAE 234 ++ +D LL Q+ L+ N + +++ L N++ D + E Sbjct: 176 NLNDKDESDKALLQTKIQLARELLERNMSKEKIRGLMNFLRYYVRFDNSEVNTIFEQEVE 235 Query: 235 RAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQ 294 + + M I + L +GK E + +A+EM G+ E ++ T+LS ++ Sbjct: 236 KLTERSHT-MGIEELLLNRAKKEGKRESLISVAREMKKDGIPVEQIVKFTKLSIKEIEKL 294 >UniRef50_C8PLW8 Putative uncharacterized protein n=2 Tax=Treponema vincentii ATCC 35580 RepID=C8PLW8_9SPIO Length = 264 Score = 93.4 bits (230), Expect = 8e-18, Method: Composition-based stats. Identities = 53/285 (18%), Positives = 93/285 (32%), Gaps = 34/285 (11%) Query: 11 DAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVKT 70 D +F + H R F+++ + K+ L++ + + + + D+L VK Sbjct: 14 DFMFCKVMEHESLCRPFLEMLFSTQIEKITYLSSQNIITTN---SEAKTVRLDVL--VKD 68 Query: 71 QEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYHGCRSPY 130 G Y IE Q E + RM Y LD GY ++ Sbjct: 69 DIGTSY---DIEMQVGNEYNIPKRMRYYQAVLDVAFLDKGYSY-------------KALN 112 Query: 131 PYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQRDLLG 190 + ++ F R +Y+ + D I+ H + L K ++ Sbjct: 113 NSVIIFVCLFDPIGNDRAVYTFENI-----CIEDKTILLHDGTKKIILNAK-AFKKTDNQ 166 Query: 191 LVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKLMTIADRL 250 + + + TG KA Y + + LM D Sbjct: 167 ELRGFLQYVTTG-------KATTAYTGRIEQMIQTVKQNELARREYHILPAALMDAMDEG 219 Query: 251 REEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQS 295 G +G ++AL A+ +L GL E + T LS ++ A Sbjct: 220 EARGLAKGSRQKALETAKNLLHFGLSVENIAQATGLSQAEVEALK 264 >UniRef50_C8PT67 Putative uncharacterized protein n=1 Tax=Treponema vincentii ATCC 35580 RepID=C8PT67_9SPIO Length = 285 Score = 93.4 bits (230), Expect = 1e-17, Method: Composition-based stats. Identities = 48/288 (16%), Positives = 96/288 (33%), Gaps = 22/288 (7%) Query: 11 DAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVKT 70 D +F + P+ + ++ L + T ++E ID+ R + + V Sbjct: 13 DYMFYRVMEDPEICKMLLNRVLQGKVD-----TITEIELQKTIDDAGRAKG--VRFDVWA 65 Query: 71 QEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHL--DAGYKELP--LVLPMLFYHGC 126 ++ G IY IE Q+ ++ +A R+ Y A + L Y+ LP +L + Sbjct: 66 KDCNGRIY-DIEMQAIDKKDLAKRIRYYQAAIDVSILGKSKPYESLPDTFILFFCTFDYL 124 Query: 127 RSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQR 186 P + I + I + + LE + + Sbjct: 125 EKTLPVYTFKTMCSEDSRIELGDGVTKII---INSKAAEHEKNEKLKVFLEYMNGKVSND 181 Query: 187 DLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKLMTI 246 + + ++Q + + R+ + +R G A A + + Sbjct: 182 EFIQRLEQRIKEVKANEELRREY-------MLVNTIERDARNDGWKAGIAQGIAQGIAQG 234 Query: 247 ADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQ 294 EG +G H +AL A+ + GL E + T L+ ++ Sbjct: 235 KSLGLAEGEARGSHHKALETARNLRSMGLSIEKIAQATGLTVQEVETI 282 >UniRef50_C0G0A4 Putative uncharacterized protein n=2 Tax=Roseburia inulinivorans DSM 16841 RepID=C0G0A4_9FIRM Length = 319 Score = 93.0 bits (229), Expect = 1e-17, Method: Composition-based stats. Identities = 40/252 (15%), Positives = 81/252 (32%), Gaps = 20/252 (7%) Query: 6 TSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLL 65 D VF+ + + + DL + LE ++ +DL Sbjct: 53 NRNYKDTVFRMLFSDRKNLLSLYNAVNQSNYKNPEDLEIVTLENAIYMG-----IKNDLA 107 Query: 66 WSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGY------KELPLVLP 119 + + Y+Y EHQS M R + Y + Q +D +++P Sbjct: 108 F---IMDTNLYLY---EHQSTYNPNMPLRDLFYICSEYQKLVDKKSLFSSTLQKIPAPNF 161 Query: 120 MLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELI 179 + FY+G + L E ++++ + ++MQH M L E Sbjct: 162 IEFYNGSTVISDCTELRLSSAFECLTGEPKLELIVTVLNVNEGHNADLMQHCSM-LKEYA 220 Query: 180 QKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQE 239 Q R R D ++ V ++ + + L + I E + ++ Sbjct: 221 QYVARVRHYAS--DMPLNEAVKHAVDECIREGILAEFLTQNRNEVISMSIFEYDKELEEK 278 Query: 240 KEKLMTIADRLR 251 + ++ + Sbjct: 279 NYEKQSLRQDAK 290 >UniRef50_A7B1D1 Putative uncharacterized protein n=3 Tax=Ruminococcus gnavus ATCC 29149 RepID=A7B1D1_RUMGN Length = 323 Score = 92.6 bits (228), Expect = 1e-17, Method: Composition-based stats. Identities = 48/286 (16%), Positives = 101/286 (35%), Gaps = 20/286 (6%) Query: 11 DAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVKT 70 D F+ + + R FI L P ++ D ++L P + + Y L V+ Sbjct: 53 DFCFQELMEDEEVRRGFIGAFLRIPPEEILD---MELLPKKLRKKYKEEKYGILDVRVRL 109 Query: 71 QEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAG--YKELPLVLPMLFYHGCRS 128 +EG + IE QS + R + Y + + G Y +L + + Sbjct: 110 REGEQ---LNIEMQSIAYDYWQERSLFYLGKMYVDQIHEGEDYDKLKKCIHVGILDFTLF 166 Query: 129 PYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQRDL 188 + + + ++++ + E Q + + R R+ Sbjct: 167 EHERYYSCFHIWEDTIRDMYSDKFEIHVLELPKLAKYEYPQTELLRWAQFF--GARSREE 224 Query: 189 LGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKLMTIAD 248 + ++ + D + ++ + + + R E + + L + Sbjct: 225 IEVLAE----------KDEYIHKAYDKLEEISADEEKRLEYEERQKAIRDHRHMLASGRR 274 Query: 249 RLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQ 294 EG +GKHE A+ +A++ML+ L E + + LSP+D+ Sbjct: 275 EGLREGLREGKHEHAVEMARKMLEDKLPIEKIAEYSGLSPEDVHRL 320 >UniRef50_A5KR99 Putative uncharacterized protein n=11 Tax=Ruminococcus torques ATCC 27756 RepID=A5KR99_9FIRM Length = 317 Score = 92.6 bits (228), Expect = 2e-17, Method: Composition-based stats. Identities = 43/311 (13%), Positives = 95/311 (30%), Gaps = 27/311 (8%) Query: 1 MTISTTSTPHDAVFKSFL----RHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDED 56 M ++VF + PL + + +++ +++ Sbjct: 8 MAGKENREIKNSVFVDLFYEDESAEANEIALFNAIHDEPLPEGTKIRRFRVDNTIYMN-- 65 Query: 57 LRQYYSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHL------DAG 110 + +D+ + G + V EHQS E M R + Y A + + Sbjct: 66 ---FQNDISFDAG-----GKVIVFGEHQSTINENMPLRSLLYIGRAYERLVPPRSRYKKK 117 Query: 111 YKELPLVLPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQH 170 LP FY+G L + +++I EI++ Sbjct: 118 IVPLPTPEFYTFYNGKEKWEKEKELRLSDAYIVKDGEPSLELKVKVINIRPEEHHEILEK 177 Query: 171 RKM-----ALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRF 225 ++ +E++Q + + I + G D ++ V D + Sbjct: 178 CQVLKEYSQFMEIVQNYQISGEEEPYKKAIKECIEKGILADYLMRKGSEVVNMLLDEYDY 237 Query: 226 RAFIGEIAER--APQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMV 283 I E +E + R++G +G+ E + Q+ L++G + Sbjct: 238 ETDIEVQREEAREQGREEGRKQGREEGRKQGREEGRKAERSTLIQKKLEKGKTISQIADE 297 Query: 284 TRLSPDDLIAQ 294 + +++ Sbjct: 298 LEDTEENIACL 308 >UniRef50_C6LE73 Putative uncharacterized protein n=1 Tax=Bryantella formatexigens DSM 14469 RepID=C6LE73_9FIRM Length = 326 Score = 92.2 bits (227), Expect = 2e-17, Method: Composition-based stats. Identities = 45/303 (14%), Positives = 111/303 (36%), Gaps = 27/303 (8%) Query: 9 PHDAVFKSFLRHPDTARDFIDIHLPA-----------PLRKLCDLTTLKLEPNSFIDEDL 57 D + K + R DF++ L P+ L E + Sbjct: 3 EKDIILKEYQRDSRHFCDFVNGALAQGRPLLKRGQLVPVPTELVLVKDTEEDDENAVVKT 62 Query: 58 RQYYSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGY------ 111 Q + D+ + + G I V I++Q+ + M R+M Sbjct: 63 VQRFRDITGKAEADKNAGCIIVAIQNQTTVDYGMPLRVMLEDALEYDVQRRTKKNRKLHK 122 Query: 112 -KELPLVLPMLFYHGCRSPYPYSLCWLDEFAEPAIARKI--YSSAFPLVDIT-VVPDDEI 167 ++L LV+ ++FY+G +P+ + + P R++ Y ++P+V +T D Sbjct: 123 GEKLCLVITLVFYYG-TTPWRAPSDLAEMISVPREFRQLREYIQSYPIVVVTPENVDTAC 181 Query: 168 MQHRKMALLELIQKHIRQRDLLGLVDQIVSLLVTGNTN-DRQLKALFNYVLQTGDAQRFR 226 + +LE++++ ++++ +++ ++ + +R + AL +++ + + Sbjct: 182 FRGGWQEILEILRRQNDEKEMGRYLEKNRAIYEKLPEDTNRVIFALTDHLDYYRELKEKG 241 Query: 227 AFIGEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALR----IAQEMLDRGLDRELVMM 282 I K + + + G QG + + + M++ + ++ Sbjct: 242 EKITMCKAFTDHYKSGVEEGKKQGMKRGRRQGIKQGKRQGMDMGIRAMIETCRELKIPRN 301 Query: 283 VTR 285 T+ Sbjct: 302 ETK 304 >UniRef50_C0F0J0 Putative uncharacterized protein n=1 Tax=Eubacterium hallii DSM 3353 RepID=C0F0J0_9FIRM Length = 316 Score = 91.8 bits (226), Expect = 3e-17, Method: Composition-based stats. Identities = 46/321 (14%), Positives = 96/321 (29%), Gaps = 53/321 (16%) Query: 11 DAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLR-------QYYSD 63 DA+ K +L + + D + + +++ ++ + + + Q + D Sbjct: 5 DALTKEYLSNNEIFADVFNYLIYDGQQRILPENLIERDTSEITLPLGKRGELATIQKFRD 64 Query: 64 LLWSVKTQEGVGYIYVV--IEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKE-------- 113 +L +E +YV+ +E+QS M R M Y ++ K+ Sbjct: 65 ILKGCIAKEYKNTLYVLFGVENQSHIHYAMPVRNMLYDAINYSAQVNEKTKKYRKIRKQN 124 Query: 114 -------------------LPLVLPMLFYHGCRSPYPY-SLCWLDEFAEPAIARKIYSSA 153 L V+ + Y G SL + + ++ + Sbjct: 125 PNFKETTEEFLSGWHPDDRLVPVITVTIYFGNDGWDAAKSLQEMFSETDESLKEFLPDYK 184 Query: 154 FPLVDITVVPDDEIMQHRKMALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALF 213 L+ + + H + L I K I + + ++ +D AL Sbjct: 185 LHLISCNNISNFT-KFHTEFGRLMHILKVISDEEQMDIL-----------LSDPGYSALS 232 Query: 214 NYVLQTGDAQRFRAFIGEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDR 273 Q + F E + +E G +G +E Q M Sbjct: 233 VTAAQIINTFTGLHFSIPEKED----TINMRNAWTDHKESGRREGFNEATTSYTQRMYKA 288 Query: 274 GLDRELVMMVTRLSPDDLIAQ 294 G+ E++ V ++ Sbjct: 289 GIPLEVIAEVIEKPVTEVEKI 309 >UniRef50_B4VKU9 Putative uncharacterized protein n=1 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VKU9_9CYAN Length = 323 Score = 91.4 bits (225), Expect = 3e-17, Method: Composition-based stats. Identities = 45/295 (15%), Positives = 86/295 (29%), Gaps = 16/295 (5%) Query: 1 MTISTTSTPH-DAVFKSFLRH---PDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDED 56 MT +P D FK D F++ + + LT + + E Sbjct: 1 MTKKKFISPKIDYAFKKIFGSDQSEDILISFLNAIVYNGKSVISSLTIVNPYNPGQV-ET 59 Query: 57 LRQYYSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAG--YKEL 114 L+ Y D+ G V+IE Q R+ A N L +G Y E+ Sbjct: 60 LKDSYLDI--RAVLNSGE---IVLIEMQVARIAAFYKRVTYNLCKAYANQLTSGDYYLEI 114 Query: 115 PLVLPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLV--DITVVPDDEIMQHRK 172 V+ + F + + + L+ ++ Sbjct: 115 TPVIAVTITDFILFKENPKCIHHFVFKDKESSSEYPEHELQLIFVELPRFVKKLPELQTL 174 Query: 173 MALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEI 232 +DL + + + + +A ++R EI Sbjct: 175 AEKWIYFMTQA--QDLEEIPESLAEVTAIEKALTIANQANLTPAEAEEVSRRAMQLRDEI 232 Query: 233 AERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLS 287 +E + R+EG +G+ E + A+ ++ R L++ L+ Sbjct: 233 GRIKYATEEASKEAREEGRQEGRQEGRQEGRITEARALVLRLLNKRFPDQTAELN 287 >UniRef50_C0QZ87 Chromosome segregation ATPase n=19 Tax=Bacteria RepID=C0QZ87_BRAHW Length = 309 Score = 91.1 bits (224), Expect = 4e-17, Method: Composition-based stats. Identities = 43/315 (13%), Positives = 101/315 (32%), Gaps = 29/315 (9%) Query: 3 ISTTSTPHDAVFKSFLR---HPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQ 59 + + +D + + D + ++ L + ++ L++ + E+ Sbjct: 1 MKEINRLNDLFVRYLIGTEGDEDILENIVNAVLNDVGFE--SVSNLEIINPYNLAENENL 58 Query: 60 YYSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPL--V 117 S L KT++G ++IE Q R++ Y + + L + + + Sbjct: 59 KESILDVKAKTKDGKK---ILIEIQLIGNNNFIKRILYYIAKNISSELKENENYINISQM 115 Query: 118 LPMLFYH-----GCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVV----PDDEIM 168 + + F + G S + K+ ++I + I Sbjct: 116 ISISFLNFNLKIGSESDIKREHKCFQLSDINNSSLKLDDFQIHFIEIKRFAEILKNASID 175 Query: 169 QHRKMALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAF 228 + K LL I +DL +++++ + + K + F Sbjct: 176 DYNKNKLLSWID-FFTAKDLEKSINKLIGGNDIMSKVMDKYKRFVADEKEMSAYNERDTF 234 Query: 229 ---------IGEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDREL 279 + + + E+G QG+ +AL IA+ + GLD + Sbjct: 235 LYGQAAMLQYEREEGKKEGIEIGIQQGIKEGIEQGIEQGEKNKALSIARSLKKSGLDDKF 294 Query: 280 VMMVTRLSPDDLIAQ 294 + T L+ +++ Sbjct: 295 ISENTGLTIEEIEKL 309 >UniRef50_C5RQ96 Putative uncharacterized protein n=1 Tax=Clostridium cellulovorans 743B RepID=C5RQ96_CLOCL Length = 288 Score = 91.1 bits (224), Expect = 4e-17, Method: Composition-based stats. Identities = 47/288 (16%), Positives = 94/288 (32%), Gaps = 24/288 (8%) Query: 11 DAVFKSFLRHPDTARDFIDIHLPAPL-RKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVK 69 D VFK +D + L A L + +++ E L VK Sbjct: 17 DFVFKLLFGDEKN-KDLLIAFLSAVLNLPEREFVGIEILNTELFREFKEDKKGILDVRVK 75 Query: 70 TQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAG--YKELPLVLPMLFYHGCR 127 T G + IE Q P E M R + Y + G Y +L + + Sbjct: 76 TVNGKQ---IDIEIQVLPTEFMPERTLFYWSKMYTTQVKPGDTYDKLKKCITINIVDFKC 132 Query: 128 SPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQRD 187 P + + ++I + D +I + +++ Sbjct: 133 IPLNKLHTSYHLIEDETGHKLTDILEVHFLEIPKLFDKQIEINEDDPIIQW--------- 183 Query: 188 LLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKLMTIA 247 + +D ++ + +L+ I E E +++ + A Sbjct: 184 -MEFLDGKSKGVMEMLAEKNESIKKAYNLLKIISKDEKARMIYEAREAELRDQLTRIRSA 242 Query: 248 DRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQS 295 + +G +E+ALR+A++M+ RG ++ +T LS + ++ Sbjct: 243 E-------EKGANEKALRVAEKMIKRGDSINDIIELTELSKEKILELK 283 >UniRef50_B0K519 Putative uncharacterized protein n=14 Tax=Thermoanaerobacteraceae RepID=B0K519_THEPX Length = 288 Score = 90.7 bits (223), Expect = 5e-17, Method: Composition-based stats. Identities = 36/229 (15%), Positives = 90/229 (39%), Gaps = 13/229 (5%) Query: 64 LLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQN--------HLDAGYKELP 115 +++ VK ++ + Y+++E QSK + M +R++ Y I + +LP Sbjct: 1 MVYQVKLKDKEVFFYILLELQSKVDFQMPYRLLLYIIEVWREILKDTSLNQQKRKDYKLP 60 Query: 116 LVLPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKM-A 174 ++P++ Y+G + I + L+D+ ++E++Q + + Sbjct: 61 AIIPIVLYNGVNRWTASLSFKETIDSYQLFGENIIDFKYILIDVNRYNEEELLQLSNLIS 120 Query: 175 LLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAE 234 + L+ + I + +L ++ +L + ++ + L N++ EI E Sbjct: 121 SIFLLDRKIDKEELTEKWGKLADVLK--DISEEEFIILRNWLFSVVSRFLPEDKEKEIKE 178 Query: 235 RAPQEKEK--LMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVM 281 Q + +++ +R E + + E ++ GL + Sbjct: 179 ILVQSEGVEEMISNLERSLREEFRKTRREGLKEGLKKGKLEGLKIGKME 227 >UniRef50_A7BWQ7 Putative uncharacterized protein n=3 Tax=Beggiatoa sp. PS RepID=A7BWQ7_9GAMM Length = 290 Score = 90.7 bits (223), Expect = 6e-17, Method: Composition-based stats. Identities = 46/299 (15%), Positives = 94/299 (31%), Gaps = 22/299 (7%) Query: 6 TSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLL 65 HD++FK + +F + P + ++ + L Sbjct: 3 NPKSHDSLFKWLIT--AFTTEFFGHYFPDIRIGEYTFIDKEFISKYENLKESLKGDLFLG 60 Query: 66 WSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYHG 125 V+ + I + IEHQS+ E ++ R+ YS A V ++ Y Sbjct: 61 MEVEIDGLLREIIIQIEHQSERE-DVSERVYEYSCYAWLLKKK-------PVWSIVIYTD 112 Query: 126 CRSPY-PYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQK-HI 183 P + + F + F ++ + +++Q + L K Sbjct: 113 EAVWRKPVTEQFWYAFDSQKGKQYH---HFDVIKVKAEKSSDLIQKHSLMCKLLALKADD 169 Query: 184 RQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQE---- 239 RQ D LV +I + L + + + +I + + Sbjct: 170 RQTDPEKLVYEIYRAAALMKEQLTNEQLLLIDQWVSFYKKVSEKRLDKIKKEIKMDFIET 229 Query: 240 ---KEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQS 295 + + EG +G+ + + A +L G+D E++ T S ++ S Sbjct: 230 TISEHVYNQGWIKGEAEGKAEGEAKGRKKTAINLLKMGIDVEIIQKATGFSDAEIKQMS 288 >UniRef50_A6MYW5 Chromosome segregation ATPase n=4 Tax=Rickettsia RepID=A6MYW5_9RICK Length = 296 Score = 90.3 bits (222), Expect = 7e-17, Method: Composition-based stats. Identities = 43/300 (14%), Positives = 90/300 (30%), Gaps = 14/300 (4%) Query: 3 ISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYS 62 + + D FK + +D + + + + K + + L S Sbjct: 1 MERITPRVDLAFKKIFGVEEN-KDLLISLINSIVSKEDQIVDVTLLNPYNPQNFRNDKLS 59 Query: 63 DLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLF 122 L + G + IE Q E R + Y L A L + Sbjct: 60 ILDIKALGESGKRF---NIEIQITDEADYDKRALYYWAKLYTEALQASQDYSSLNKAIGI 116 Query: 123 YHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKH 182 + + P + + + F + F +++ + ++ + L ++++K Sbjct: 117 HILNFTSIPETNKYHNIFHITEKDSGLLY--FKDLELHTIELNKFSNNPNEELADILKKV 174 Query: 183 IRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEK 242 D+ LL + N + A L D F + + E + Sbjct: 175 GNSLDIWSAFLTRHDLLNSNNLPKKLDNASLKKALTVLDVMNFTSEERDAYEDHLKWLRI 234 Query: 243 LMTIADR--------LREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQ 294 + + EG GK EE + IA+ + G+ ++ T L+ + Sbjct: 235 EANTLKKYEAQARVRGKVEGIQIGKTEEKIAIARNLKRSGVAITIISESTGLTKKQIEEL 294 >UniRef50_UPI0001C371D2 hypothetical protein RflaF_10865 n=1 Tax=Ruminococcus flavefaciens FD-1 RepID=UPI0001C371D2 Length = 317 Score = 90.3 bits (222), Expect = 7e-17, Method: Composition-based stats. Identities = 48/319 (15%), Positives = 91/319 (28%), Gaps = 44/319 (13%) Query: 9 PHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLR--------QY 60 DAV K +++ + D + L R++ LK + I Q Sbjct: 3 DKDAVTKDYMQDSEHFADAFN-FLLYGGRQVIKPEQLKPLDTTSIALPYGDESRFVPIQK 61 Query: 61 YSDLLWSVKTQEGVG--YIYVVIEHQSKPEELMAFRMMRYSIAAM--------QNHLDAG 110 Y D+L V E Y+ + IE+QS M R M Y + H + Sbjct: 62 YRDVLKMVTAMEDENATYLILGIENQSDIHYAMPIRNMLYDAIQYVNQADTIAKEHRKSK 121 Query: 111 Y---------------KELPLVLPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFP 155 + ++ + Y G A I + + + Sbjct: 122 KMPETRAEYLSGFYKTDRILPIITLTLYFGADEWDAPRDLHSMLTANEDILKFVDNYHLH 181 Query: 156 LVDITVVPDDEIMQHRKMALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNY 215 L+ + D++ + L L K+++ + IV+ D +++ Sbjct: 182 LIAPAEIEDEDFA--KFHTELSLALKYVKYSKDKKKLRDIVNE-------DTAFRSVSRK 232 Query: 216 VLQTGDAQRFRAFIGEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRG- 274 + E E + I EG +G E +R ++ G Sbjct: 233 TADMVNVVTSSNLHYNDGEERVDMCEAIEEIRKDALAEGKAEGIEEGIIRTLIGLVKDGI 292 Query: 275 LDRELVMMVTRLSPDDLIA 293 L ++ + Sbjct: 293 LTIADAAKRADMTVPEFEE 311 >UniRef50_B8FP58 Putative uncharacterized protein n=1 Tax=Desulfitobacterium hafniense DCB-2 RepID=B8FP58_DESHD Length = 167 Score = 90.3 bits (222), Expect = 7e-17, Method: Composition-based stats. Identities = 34/88 (38%), Positives = 53/88 (60%) Query: 3 ISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYS 62 +S PHD FK TAR F++ +LP +R L DL T+ + +S+ID++L++ +S Sbjct: 1 MSLIHNPHDKFFKETFGDVGTARSFLENYLPQEVRALVDLKTVLPQKDSYIDQELQESFS 60 Query: 63 DLLWSVKTQEGVGYIYVVIEHQSKPEEL 90 DLL+ VK +E GY Y + EH+ +P Sbjct: 61 DLLFQVKIRENEGYFYFLFEHKVRPYAD 88 >UniRef50_A5Z376 Putative uncharacterized protein n=1 Tax=Eubacterium ventriosum ATCC 27560 RepID=A5Z376_9FIRM Length = 316 Score = 90.3 bits (222), Expect = 7e-17, Method: Composition-based stats. Identities = 41/307 (13%), Positives = 92/307 (29%), Gaps = 29/307 (9%) Query: 3 ISTTSTPHDAVFKSFLR---HPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQ 59 + + D VF+ + + + + DL L+ +++ Sbjct: 1 MEGSKKHKDRVFRKLFGYEKNKGNLLELYNALNDSNYTNPDDLEINTLDDVFYMNMKNDV 60 Query: 60 YYSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHL--------DAGY 111 + IY EHQS M R RYS +++ Sbjct: 61 SC--------IIDWNMAIY---EHQSTWSYNMPLRGYRYSAELYNDYIVRNNLDVFRRKL 109 Query: 112 KELPLVLPMLFYHG-CRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQH 170 ++P +FY+G + P L D F P + + +++I ++E+M Sbjct: 110 IKIPTPQYYVFYNGNEKRPDREVLKLSDAFMVPCKDGE-FEWTATVLNINAGHNEELMSK 168 Query: 171 RKMALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIG 230 + L E + ++ L ++ + + + LQ ++ Sbjct: 169 CSI-LREYAIMVSKIKEFLAESLELKDAIKKA-IDYCLDNNVLKEFLQDHRSEVEDMLWR 226 Query: 231 EIAERAPQ---EKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLS 287 E E +++ E G G+ + +++ + L + E + Sbjct: 227 EYNEEETMAHWKEDFYEEGEQHGLEVGRANGEKIKLIKLVCKKLVKNKSIEEIADDLEED 286 Query: 288 PDDLIAQ 294 + Sbjct: 287 VSTIEKI 293 >UniRef50_C6XV94 Putative uncharacterized protein n=7 Tax=Pedobacter heparinus DSM 2366 RepID=C6XV94_PEDHD Length = 283 Score = 89.9 bits (221), Expect = 9e-17, Method: Composition-based stats. Identities = 50/281 (17%), Positives = 109/281 (38%), Gaps = 21/281 (7%) Query: 22 DTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVKTQEGVGYIYVVI 81 R+ + LP ++ + L +E + + ++ +DLL V+ +G Y+ + + Sbjct: 16 KIFRENMHNTLPGIIKHVLHLNVNTVEELADDVQFTKERKTDLLKKVRDNKGNRYV-LHV 74 Query: 82 EHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYHGCRSPYPYSLCWLDEFA 141 E+Q+ MAFRM YSI + H V + Y G + +F Sbjct: 75 EYQTDNYPEMAFRMAEYSIMLQRKHK-------LPVKQFVIYIGPAKANMATSITTKDFR 127 Query: 142 EPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQRDLLGLVDQIVSLLVT 201 ++ + + L ++++ + +A+L + + L +V +I + T Sbjct: 128 FRYNLTELSAVNYKL-----FLKSDLVEEKMLAILSNLASESTESVLAQVVQEIETHTST 182 Query: 202 GNTNDRQLKALFNYVLQTGDAQRFRAF--------IGEIAERAPQEKEKLMTIADRLREE 253 + L+ + + + + E + + + + Sbjct: 183 LEQGRYFRQLRILLQLRNLNKKAIKDMALVGKIFKEEKDILYRRGEIKGEIKGEIKGEIK 242 Query: 254 GAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQ 294 G +G++EEA+ IA E+ GL E + +T+LS +++ A Sbjct: 243 GIEKGRYEEAMEIALELKKEGLATEFIAKITKLSIEEIQAL 283 >UniRef50_C1QAK6 Putative uncharacterized protein n=1 Tax=Brachyspira murdochii DSM 12563 RepID=C1QAK6_9SPIR Length = 290 Score = 89.9 bits (221), Expect = 9e-17, Method: Composition-based stats. Identities = 31/299 (10%), Positives = 90/299 (30%), Gaps = 16/299 (5%) Query: 3 ISTTSTPHDAVFKSFL---RHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQ 59 + + + +D + H + I+ +K+ + E + Sbjct: 1 MRSINVLNDYFMRYMFAKEGHEHILLNLINAIRTD--YNQEPFEEVKVLNTFNLKETIND 58 Query: 60 YYSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYK----ELP 115 S + T+ G V++E Q + +R + Y ++L K + Sbjct: 59 KQSIVDVRAITKSGET---VLVEIQRVGNQSFVYRSLYYWAKGYISNLRNNEKYNDLKQV 115 Query: 116 LVLPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMAL 175 +V+ +L ++ + +++ ++ Sbjct: 116 IVINILDFNLLK-DINKEHSCYVIKELETNHILTNHLEMHFLELPKYLFSSSRLTDELYA 174 Query: 176 LELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAER 235 +R+ + + ++ L+ +V + + + Sbjct: 175 WFYFLTIKEKREKMEEIMEM--LVKKNPIMKEVYDEYNKFVNTKDLFDNYTEYEKNYFDM 232 Query: 236 APQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQ 294 +E++ + +EG +G+ +A+ +A+ M +D + T LS +++ Sbjct: 233 LALNEERIK-GREEGLKEGIEKGEKNKAISMAKNMKKDKVDFNTISKYTGLSIEEIENL 290 >UniRef50_B1WSK8 CHP1784-containing protein n=11 Tax=Cyanobacteria RepID=B1WSK8_CYAA5 Length = 260 Score = 89.9 bits (221), Expect = 1e-16, Method: Composition-based stats. Identities = 40/274 (14%), Positives = 97/274 (35%), Gaps = 25/274 (9%) Query: 22 DTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVKTQEGVGYIYVVI 81 D F+ ++L + L +D L +++ + I + I Sbjct: 5 DNVCKFLAERFSRDFANWLLNEPIELTELKPTELSLNPIRADSLIFLQSDD----IVLHI 60 Query: 82 EHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYHGCRSPYPYSLCWLDEFA 141 E Q+ P+E + FRM Y + + + + + ++ Y P L + + F Sbjct: 61 EFQTSPDEDIPFRMTDYRLRVYRRYPNKE------MYQVVIY---LKPSNSELVYQNTFE 111 Query: 142 EPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQRDLLGLVDQIVSLLVT 201 + F ++ + D + + + ++ R+ + QI +++ + Sbjct: 112 LTNL-----RHQFNVIRLWEENTDSFLNNSGLLPFAVLTCTDNPRET---LTQIAAIIDS 163 Query: 202 GNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKLMTIADRLREEGAMQGKHE 261 RQ + + +G + + +E I + EG ++G+ + Sbjct: 164 MPNQQRQSDISASTAILSGLKLDQDSIKRILRSDIMKESV----IYQEIFHEGEVKGQKQ 219 Query: 262 EALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQS 295 IA ML ++ E++ +T L+ ++ + Sbjct: 220 AIKNIALNMLRNHMNLEVISQLTGLNLQEIEQLN 253 >UniRef50_C0EXQ3 Putative uncharacterized protein n=1 Tax=Eubacterium hallii DSM 3353 RepID=C0EXQ3_9FIRM Length = 290 Score = 89.1 bits (219), Expect = 2e-16, Method: Composition-based stats. Identities = 39/307 (12%), Positives = 93/307 (30%), Gaps = 38/307 (12%) Query: 2 TISTTSTPHDAVFKSFLR---HPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLR 58 T + D +F+ + + + D+ L +I Sbjct: 6 TGNANREYKDRLFRFVFGAEENKAYLLSLCNAVSGTDYTDVDDIEITTLSDAIYI----- 60 Query: 59 QYYSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGY------- 111 + +D+ + + +Q + EHQS M R M ++ Sbjct: 61 KMKNDISFLIDSQMN------LFEHQSTFNPNMPLRGMECFAELYGIYIIENNLDIYVSS 114 Query: 112 -KELPLVLPMLFYHG-CRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQ 169 +++ + Y+G + P L D F P + + + +++I + ++++ Sbjct: 115 LQKILTPRYYVIYNGTEKQPDVVKLKLSDAFQVPDDSGE-FEWTATMLNINYGHNRKLLE 173 Query: 170 HR-KMALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAF 228 + K +R+ + + + V + + F Sbjct: 174 QCQPLYEYAHFIKLVREYSEAMELKKAIDKAVEKAREWKCIGTFLYQCKSEVSVMLLTEF 233 Query: 229 IGEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSP 288 + E D L + G +G+ +E ++ ML L E++ +S Sbjct: 234 DEKKHE-------------DNLIKLGEKEGREKERMKNICSMLALSLSPEIIAKACEVSV 280 Query: 289 DDLIAQS 295 D ++ Sbjct: 281 DYVLNLK 287 >UniRef50_C4G3R2 Putative uncharacterized protein n=2 Tax=Abiotrophia defectiva ATCC 49176 RepID=C4G3R2_ABIDE Length = 336 Score = 88.7 bits (218), Expect = 2e-16, Method: Composition-based stats. Identities = 43/293 (14%), Positives = 91/293 (31%), Gaps = 30/293 (10%) Query: 10 HDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVK 69 D+VF R + + N FI+ Y+DL ++VK Sbjct: 66 KDSVFTLLFSDIKNIRKLYQSLHDDSDSYSDEDFKIITLENVFINAP----YNDLGFTVK 121 Query: 70 TQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGY--------KELPLVLPML 121 + + ++ E QS M R++ Y + +++ LP ++ Sbjct: 122 NK-----VIILAEAQSTFNPNMGLRLLIYIAQSYHDYISEYKFNIFSEKLIRLPNPEFIV 176 Query: 122 FYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQK 181 Y G + + D F + + I E + + E+ + Sbjct: 177 IYSGSKKTDITEIRLSDCFESGT----APNIELVVKVIGGNNVKEGIIQEYLKFCEMYDE 232 Query: 182 HIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKE 241 +R + ++ ++ LK + + + ++ Sbjct: 233 KVRSVKPSEEKAYSLKKVIKDCIDNGILKDFLTL-----HQKEVEDMMMTVI----PPEQ 283 Query: 242 KLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQ 294 L I +G QGK + +L A+ ML + ++ +T LS + + Sbjct: 284 ALEYIKLEEYNKGIEQGKLDTSLNFARNMLKNNYSIDSIIEITGLSREQIKRL 336 >UniRef50_B4VKW0 Putative uncharacterized protein n=2 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VKW0_9CYAN Length = 296 Score = 88.7 bits (218), Expect = 2e-16, Method: Composition-based stats. Identities = 44/314 (14%), Positives = 101/314 (32%), Gaps = 40/314 (12%) Query: 3 ISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYS 62 + T D K LR+ + L LRK + ++ ++ ED + Sbjct: 1 MPPTHIRFDWAIKKLLRNKAN-YGVLAGFLSELLRKPITIQSILEGESNQQAEDDKLNRV 59 Query: 63 DLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKE-------LP 115 D+L E ++IE Q+ E+ RM+ + + + L+ G Sbjct: 60 DILA-----ENDRGELILIEVQNSTEQDYFHRMLYGTSRLITDFLEKGEPYGNVKKVYSV 114 Query: 116 LVLPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIY-------------SSAFPLVDITVV 162 ++ G Y +L + + + I + ++ + Sbjct: 115 NIVYFSLGQGDDYIYHGTLEFRGLHLDDKLGLSINQRKLFNSQDVYEIFPEYYVIKVNNF 174 Query: 163 PDDEIMQHRKMALLELIQK--HIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTG 220 + + + K I++ + + L+ + ++ + + Sbjct: 175 NE---VASDTLDEWIYFLKKSQIKEEFTAQGLAEAKENLLVDSLSEAERANYLRF----- 226 Query: 221 DAQRFRAFIGEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELV 280 EI+ E + + +EG QGK +E + IA+ + +G D + + Sbjct: 227 ----MENRRYEISLIESSRSEGRLEGLEEGLKEGMEQGKQQEKVNIARLLKQQGTDLDTI 282 Query: 281 MMVTRLSPDDLIAQ 294 T L+ +++ Sbjct: 283 TAATGLTREEIEEL 296 >UniRef50_C1DXV7 Putative uncharacterized protein n=1 Tax=Sulfurihydrogenibium azorense Az-Fu1 RepID=C1DXV7_SULAA Length = 357 Score = 88.7 bits (218), Expect = 2e-16, Method: Composition-based stats. Identities = 44/254 (17%), Positives = 106/254 (41%), Gaps = 8/254 (3%) Query: 7 STPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLR-QYYSDLL 65 PHD K L+ + A+ +D HLP + + TL++ +D + +Y++D++ Sbjct: 14 QNPHDTYAKELLKDEEVAQVLLDAHLPQEINSIIKKETLEIINTENLDYKEKSKYFADII 73 Query: 66 WSVKTQEGVGY-IYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYH 124 +S+KT G IYV+IEH+S ++ + ++++ A + G ++ + P++ Y Sbjct: 74 YSLKTIYGEDLKIYVLIEHKSYDDKHLPLQLIKNMTAVWSKEILEG--KITPIYPIVIYA 131 Query: 125 GCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQ-HRKMALLELIQKHI 183 S + + +++ + + I + ++ + L + + I Sbjct: 132 SKEKLSLESKFSNYYKISDNMKKFFLDFYVSTLNLNELDEKTIKEKYKNIYTLIMTLRII 191 Query: 184 RQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKL 243 ++ +++ I S+ N + + + + T + +I + E + Sbjct: 192 QEPTPENILNLIKSIETLYNYKPKAVYVIALSYIFTIAKKDKNTYIKVKKQL---EGGNM 248 Query: 244 MTIADRLREEGAMQ 257 ++ D EEG + Sbjct: 249 GSLLDMFIEEGLEK 262 >UniRef50_Q24Y19 Putative uncharacterized protein n=3 Tax=Desulfitobacterium hafniense RepID=Q24Y19_DESHY Length = 248 Score = 88.4 bits (217), Expect = 2e-16, Method: Composition-based stats. Identities = 28/247 (11%), Positives = 72/247 (29%), Gaps = 32/247 (12%) Query: 79 VVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKEL----PLVLPMLFYHGCRSPYPYSL 134 + IE Q + M R + Y + G + + ++ ++ + Y Sbjct: 3 INIEIQLSNQYDMEKRSLYYWAQMYSRQIREGMAYKELTKTVSINIVDFNYLKQTSNYHN 62 Query: 135 CWLDEFAEPA------IARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQRDL 188 + E + L + + ++ L+ + +++ Sbjct: 63 VFHLYEDEEKFQLTDVLEIHFMELPKLLAKWRRR--EISLWENELVRWLLLLEGADNQEI 120 Query: 189 LGLVDQI-------VSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEI--------- 232 L ++++I + + Y + +A I E Sbjct: 121 LQILEEIAMKDPVLYQAMNAWEETSEDPRIREAYFDRRKAILDEKAAIREAELRLQEALE 180 Query: 233 ----AERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSP 288 A + + + EG +G+ E +A+++L G + + T LS Sbjct: 181 EGMAKGIAEGRAKGIAEGKAEGKAEGRAEGRAEGRAEVAKKLLVLGFEITKIAEATGLSE 240 Query: 289 DDLIAQS 295 +++ Sbjct: 241 EEISGLK 247 >UniRef50_B1V1L4 Putative uncharacterized protein n=38 Tax=Clostridium RepID=B1V1L4_CLOPE Length = 300 Score = 88.0 bits (216), Expect = 3e-16, Method: Composition-based stats. Identities = 41/294 (13%), Positives = 96/294 (32%), Gaps = 14/294 (4%) Query: 11 DAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVKT 70 D VFK + ++D + L A ++ + ++L+ + + + L KT Sbjct: 8 DFVFKRLFGAEE-SKDSLISLLNAIIKSDNPIKDIELKSPDLEKQHIGDKFCRLDIKAKT 66 Query: 71 QEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHL--DAGYKELPLVLPMLFYHGCRS 128 +G + +E Q + E M R + Y + L YK L + + + Sbjct: 67 DKGE---IINVEIQVRDEYNMVQRTLYYWSKIYSDQLGASENYKNLARTVCINILNFKLL 123 Query: 129 PYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQRDL 188 +++ + + + + L + I++ + Sbjct: 124 DNDRYHNTYRLKEITTNEELTDIEEIHFIELPKSKEIKSEEVNNIDSLLKWIEFIKEPES 183 Query: 189 LGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKLMTIAD 248 + ++ + K + R +A EI+ ++ L Sbjct: 184 ETVRILELTDESIRKAKTQLYKLSLDKKTIEQYRIREKAMYDEISALENSREKGLQEGVK 243 Query: 249 RLREEGAMQGKHEEAL--------RIAQEMLDRGLDRELVMMVTRLSPDDLIAQ 294 R+EG +G E + +IA+ +L +GL+ + + + L + + Sbjct: 244 IGRKEGKEEGLKEGEVRGKLKANRKIAKNLLSKGLELKEIAKILELDENLVEEI 297 >UniRef50_C9RP54 Putative uncharacterized protein n=1 Tax=Fibrobacter succinogenes subsp. succinogenes S85 RepID=C9RP54_FIBSS Length = 312 Score = 88.0 bits (216), Expect = 4e-16, Method: Composition-based stats. Identities = 50/290 (17%), Positives = 95/290 (32%), Gaps = 27/290 (9%) Query: 11 DAVFKSFLRH---PDTARDFIDIHLPAPLRKLCDLTTL-KLEPNSFIDEDLRQYYSDLLW 66 D +FK + + P+ F++ L K TL E +++ ++ Sbjct: 35 DGIFKMLIANEAKPERTVKFLNAMLGLTGDKAIKTYTLGVPENPGVLNDKTA------IF 88 Query: 67 SVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKE-LPLVLPMLFYHG 125 + G V+IE Q L R++ Y+ + + LP + + Sbjct: 89 DIYGTTQAGEP-VLIEVQQNFNTLFVDRLIYYTARVISRTVKKAQDYNLPHIYVLSILTE 147 Query: 126 CRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQ 185 + P LV++ E + ++ Q Sbjct: 148 NQFPRERDTYLHHAQLVRNRHLFYSKLDIYLVELEKFFAIEDRT---------LPENREQ 198 Query: 186 RDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKLMT 245 D ++ +L + + +LK L + D +F G E E + + Sbjct: 199 SDRAEMLRIFRDVLEDKDIPEEKLKRLLD-----KDFANDVSFKGYTDETLLNEVDGMTD 253 Query: 246 IADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQS 295 + ++ +QGK +E IA ML G E + VT+LS +D+ Sbjct: 254 MLYE-KQGSYLQGKDDERNEIAIAMLAEGDSIEKIARVTKLSENDVRKLQ 302 >UniRef50_Q2RGS0 Putative uncharacterized protein n=2 Tax=Moorella thermoacetica ATCC 39073 RepID=Q2RGS0_MOOTA Length = 310 Score = 88.0 bits (216), Expect = 4e-16, Method: Composition-based stats. Identities = 41/303 (13%), Positives = 99/303 (32%), Gaps = 36/303 (11%) Query: 5 TTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDL 64 + +D K H A + + ++E SDL Sbjct: 4 KSGNRYDITIKDLFADETQELINYFGHFEARVTGDLKIEFPQVET----------RVSDL 53 Query: 65 LWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYH 124 + ++Q+G + + +E QS+ ++ M +RM+RY++ V ++ Y Sbjct: 54 VMKAESQQGP--LAIHLEFQSRNDDEMPYRMLRYALEI-------HKTYHLPVYQIVIYF 104 Query: 125 GCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIR 184 G + + + + + + L+D+ + +E+ LL L+ R Sbjct: 105 GQ-----WQMNMTSQLEYRLGDQNLLDYRYHLIDVGNITYEELKNSPHQRLLSLLPVVDR 159 Query: 185 QRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTG-------DAQRFRAFIGEIAERAP 237 ++ G + + +D L+ +L+ D + E+ + Sbjct: 160 EKRQKGGKEFLRRCAEDIINSDLDLETKKTVLLRAEIFAGLVFDKKAIDLVFREVEQMLS 219 Query: 238 -QEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELV----MMVTRLSPDDLI 292 +E I ++ E+G +G + + ++ L + ++ + L Sbjct: 220 IEESAGYQRIFEKGMEKGIEKGMEKGMEKGIEKGQQESLLDVTIRLLRKKFRKIPREYLA 279 Query: 293 AQS 295 Sbjct: 280 RIK 282 >UniRef50_B8HL58 Putative uncharacterized protein n=2 Tax=Cyanothece RepID=B8HL58_CYAP4 Length = 334 Score = 87.6 bits (215), Expect = 4e-16, Method: Composition-based stats. Identities = 40/300 (13%), Positives = 98/300 (32%), Gaps = 33/300 (11%) Query: 3 ISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSF----IDEDLR 58 ++ + D+ +K L ++ I P ++ + F D + Sbjct: 1 MTQPRSDKDSAWKEIL--RQYFQEAIVFFFPQTAEQVDWTRPYEFLDKEFQQIAPDAETG 58 Query: 59 QYYSDLLWSVKTQEG-VGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLV 117 + Y+D L V ++G ++ + +E Q+ E A RM Y++ P + Sbjct: 59 KRYADQLVKVWLKDGAELWLLIHVEVQAARESEFAQRMFTYNLRIFDRFNH------PAI 112 Query: 118 LPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLE 177 + P S + +F + +++ + + ++ + ++ Sbjct: 113 SLAILCDESVRWRPES--FSFDFPDTSLSFRFGRVKLLDYRERISELEQSPNPFSIVVMA 170 Query: 178 LIQKHIRQRDLLG---LVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQR--FRAFIGEI 232 ++ ++D ++ L G +++ LF ++ F E+ Sbjct: 171 HLRAQATRKDDQQRKFWKLTLIRRLYEGGYGRQEVINLFRFIDWVMILPEGLKEEFWQEL 230 Query: 233 AERAPQEKE-------------KLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDREL 279 + + L R+EG +G+ E A+ ++ R L R+ Sbjct: 231 KIYEEERRMPFITSVEEIGFERGLEQGRQEGRQEGRQEGRQEGRQEEARALILRPLTRKF 290 >UniRef50_A8F2U7 Putative uncharacterized protein n=15 Tax=Bacteria RepID=A8F2U7_RICM5 Length = 281 Score = 87.6 bits (215), Expect = 5e-16, Method: Composition-based stats. Identities = 44/289 (15%), Positives = 98/289 (33%), Gaps = 25/289 (8%) Query: 10 HDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVK 69 +D FK + ++ L K + L + L S L+ +K Sbjct: 9 NDIAFKKLFSDK--VKLINLLNSLLRLSKGDRIIDLSYITTEQLPLFLEGRRS--LFDLK 64 Query: 70 TQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAG--YKELPLVLPMLFYHGCR 127 ++ G Y+ IE Q K E+ R Y + + G +K+L V+ + Sbjct: 65 VKDETGRWYI-IEMQRKMEKDYLNRTQLYGCYTYVSQIKKGMKHKDLLPVVIISIIRAKA 123 Query: 128 SPYPYSLCWLDEFAEPAIAR-KIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQR 186 P E I + ++S + +++ +++ + K+ + Sbjct: 124 LPDELPYISYHHIKESNIHKQYLFSLTYVFIELGKFKKNDLKDDT--DEWLYLLKYA-SQ 180 Query: 187 DLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKLMTI 246 + + ++++ + Q K + + F E+A + +K Sbjct: 181 EQEPPKEIKNEIVLSAYASLEQYKW--------TEQEHDDYFRAEMAIQQEIDK------ 226 Query: 247 ADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQS 295 + G +G +E + A+EML E + T+L+ +++ Sbjct: 227 FEEKFNAGMEKGIEKEKIETAKEMLIENGPIEQIARYTKLTIEEIKKLK 275 >UniRef50_B5CRG1 Putative uncharacterized protein n=4 Tax=Ruminococcus lactaris ATCC 29176 RepID=B5CRG1_9FIRM Length = 356 Score = 87.6 bits (215), Expect = 5e-16, Method: Composition-based stats. Identities = 40/314 (12%), Positives = 91/314 (28%), Gaps = 41/314 (13%) Query: 18 LRHPDTARDFIDIHL--------PAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVK 69 L+ D + L P L + T + L+ + +QY + + Sbjct: 42 LKDTKRFADLFNAILFQGKAVILPENLYPSPETTAVSLQDTQGKNVVKKQYRDII---MN 98 Query: 70 TQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLD-------------------AG 110 Q+ + + +E Q+ ++M Y + Sbjct: 99 WQDQALLMLLAVESQTAIHYAAPLKVMLYDSMEYAEQVRVKWKERPPRLSSAEFLSRFQK 158 Query: 111 YKELPLVLPMLFYHGCRSPYPYSLCWLDEFAE-------PAIARKIYSSAFPLVDITVVP 163 +L V+ ++FY+G L F + + + + LVD+ + Sbjct: 159 NDKLIPVITLIFYYGTEEWD-GPLELHQMFDLGTEKKHAELMKKYLPNYHINLVDVRRLK 217 Query: 164 DDEIMQHRKMALLELIQKHIRQ---RDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTG 220 + E Q + ++Q + R + L + +++ Sbjct: 218 NLESFQSDLQIIFGMLQYSQDKYALRTYVANHKDYFQKLDLETYHALGAFLNSRQLMEIN 277 Query: 221 DAQRFRAFIGEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELV 280 + R + + + ++ R G +G+ +A +M G + + Sbjct: 278 VEKNEREELDMCKALEDIYNDGVQDGMEQGRRSGIAEGEASHKKEVAFQMQKLGYSLDAI 337 Query: 281 MMVTRLSPDDLIAQ 294 V R S D + Sbjct: 338 AAVLRESVDGISQI 351 >UniRef50_B6FJ15 Putative uncharacterized protein n=5 Tax=Clostridium RepID=B6FJ15_9CLOT Length = 310 Score = 87.2 bits (214), Expect = 6e-16, Method: Composition-based stats. Identities = 44/310 (14%), Positives = 101/310 (32%), Gaps = 40/310 (12%) Query: 4 STTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSD 63 D +F+ + + + DLT +E ++ +D Sbjct: 14 KINKKYKDRIFRMIFHEKKELLELYNAVNNSNYTNPDDLTITTIEDVVYMGMK-----ND 68 Query: 64 LLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKEL--------P 115 L + + G + + EHQS + R + Y + + +++ L P Sbjct: 69 LSFLI------GDVMNLYEHQSSFSPNLPLRGLFYFSSLYKEYIEPVKHRLYTASPLHIP 122 Query: 116 LVLPMLFYHG-CRSPYPYSLCWLDEFAEPAIA-RKIYSSAFPLVDITVVPDDEIMQHRKM 173 ++FY+G + P L D F E +++I + + E+M+ + Sbjct: 123 FPKYVVFYNGTKKEPERQELKLSDLFLENKEETTPSLECTAVVLNINLGKNRELMEKCRP 182 Query: 174 -----ALLELIQKHIRQR-DLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRA 227 + +I+K++ ++ D V++ V + + +LQ ++ Sbjct: 183 LKEYAEFISIIRKYLSEQMDFGNAVNKAVDFCIHN--------GILADILQKNRSEVVDM 234 Query: 228 FIGEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVT--- 284 + E E + + + +G G + + + +E VM Sbjct: 235 ILTEYDEE-EFRRAWREDLLNEGFRKGLNNGLSKGIKGTIHACMKFNVPKEDVMQNLMEE 293 Query: 285 -RLSPDDLIA 293 LS ++ Sbjct: 294 FSLSQEEAEK 303 >UniRef50_UPI0001BC3A9D hypothetical protein BcroD2_08902 n=3 Tax=Butyrivibrio crossotus DSM 2876 RepID=UPI0001BC3A9D Length = 324 Score = 86.8 bits (213), Expect = 7e-16, Method: Composition-based stats. Identities = 39/307 (12%), Positives = 94/307 (30%), Gaps = 38/307 (12%) Query: 9 PHDAVFKSFLRHPDTARDFIDIHLP-------APLRKLCDLTTLKLEPNSFIDEDLRQYY 61 D + K + PD D I+ L + D+ T ++E + Sbjct: 18 QKDILLKDYFT-PDIFADAINAILYDGKSVVTPERMRTIDIETQRVED-ENGNVTADTRL 75 Query: 62 SDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYK---ELPLVL 118 D V+ + Y IEHQS + M R+M Y + + + + ++ Sbjct: 76 RDSAKVVEVDD-AIYCLFAIEHQSVEDYTMPLRIMEYDVREYLRQVKSNKGVQVRIKPII 134 Query: 119 PMLFYHGCRSPYPYSLCWLDEFA--------EPAIARKIYSSAFPLVDITVVPDDEIMQ- 169 ++ Y + D F + I L + V ++++ + Sbjct: 135 TIVMYWKADKW-NQPVSVKDMFDKNTVRWLEYNGLGGYIQDYRMHLFEPGTVKEEDLEKF 193 Query: 170 HRKMALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFI 229 ++ + K+ + + L + N++ L + + ++ Sbjct: 194 KTELKDVIAYVKYSKSTEALK------------DYNEKYKPDLTKSTVTLINELTNSKYV 241 Query: 230 GEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALR---IAQEMLDRGLDRELVMMVTRL 286 + E + + R +G + E+ ++ + RG+ + + + Sbjct: 242 FIEGKERLDMCEAFEGLIEEGRAKGKAEELKEKYKSWVTLSNNLKKRGMSNPEIASLLGV 301 Query: 287 SPDDLIA 293 +L Sbjct: 302 PETELQK 308 >UniRef50_C4FYK3 Putative uncharacterized protein n=2 Tax=Abiotrophia defectiva ATCC 49176 RepID=C4FYK3_ABIDE Length = 365 Score = 86.8 bits (213), Expect = 8e-16, Method: Composition-based stats. Identities = 51/303 (16%), Positives = 100/303 (33%), Gaps = 34/303 (11%) Query: 9 PHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEP--NSFIDEDLRQYYSDLLW 66 D + K D DF++ + R++ + + L + + +D R Y D Sbjct: 3 EKDILEKKLFMFNDVFADFLNGII-FNGRQIVEESELFDLSGWSHYKADDSRHRYQDRDV 61 Query: 67 SVKTQEGVGYI-YVVIEHQSKPEELMAFRMMRYSIAAMQNHL------------------ 107 ++ I + IE+Q P++ M FR++ Y A+ + L Sbjct: 62 VKLWKKKNVVISLIGIENQDVPDKDMVFRVLSYDGASYKTQLAKKDEDKRKHLKDKKNTE 121 Query: 108 -----DAGYKELPLVLPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVV 162 K++ V+ + Y+G + + + L+D+ Sbjct: 122 IVEIGKEDEKDIFPVITFVVYYGEEEWKYETTLKKRLKIGDGLDEFVSDYKINLIDLKKF 181 Query: 163 PDDEI-MQHRKMALLELIQKHIRQRDLLGL---VDQIVSLLVTGNTNDRQLKALFNYVLQ 218 +D+I + LL D + + VS LV T + N + Sbjct: 182 TEDDINKFKKDFKLLVNYMVKGSNHDAGSIELNHPEEVSELVLRLTGEELPIPRENDGGK 241 Query: 219 TGDAQRFRAFIGEIAERAP--QEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLD 276 T + F +AE+A + + + EG +G E + E L +G+ Sbjct: 242 TMEKF-FEPMFARMAEKAEARGMAKGMTEGMAKGMTEGMAKGLAEGKAKGMTEGLAKGMT 300 Query: 277 REL 279 + Sbjct: 301 EGM 303 >UniRef50_C6LJP2 Putative transposase n=1 Tax=Bryantella formatexigens DSM 14469 RepID=C6LJP2_9FIRM Length = 326 Score = 86.8 bits (213), Expect = 8e-16, Method: Composition-based stats. Identities = 35/240 (14%), Positives = 81/240 (33%), Gaps = 16/240 (6%) Query: 66 WSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLD-------AGYKELPLVL 118 ++ K G I V +++Q+ + M R+M ++L V+ Sbjct: 78 FNKKIVAPDGEIIVALQNQTTVDFGMPLRVMTEDALEYDVQRRMCKDEKLHKGEKLAPVI 137 Query: 119 PMLFYHGCRSPYPYSLCWLDEFAEPAIARKI--YSSAFPLVDIT-VVPDDEIMQHRKMAL 175 ++FY+G + D P + + Y + ++ IT D + Sbjct: 138 TIVFYYGAQIW-SGPTDLADMVKIPEEFKWLKKYIRPYAMLLITPENVDAAWFSGGWREV 196 Query: 176 LELIQKHIRQRDLLGLVDQIVSLLVTGNTN-DRQLKALFNYVLQTGDAQRFRAFIGEIAE 234 E++Q+ ++++ + + S+ + +R + AL ++ +R Sbjct: 197 FEILQRRNDEKEMQRYLQKKRSVYEKLPEDTNRLIFALTGHLDYYNALKRKGERAVMCKA 256 Query: 235 RAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELV----MMVTRLSPDD 290 K + + +G QG + +E + G E + LS ++ Sbjct: 257 FEDHYKSGVEEGKNIGIHQGISQGLGRGIGAMIRENQEEGKTTESIIDKLQKYFSLSREE 316 >UniRef50_D0BNN6 ATP-dependent DNA helicase RecQ n=1 Tax=Granulicatella elegans ATCC 700633 RepID=D0BNN6_9LACT Length = 302 Score = 86.8 bits (213), Expect = 9e-16, Method: Composition-based stats. Identities = 51/318 (16%), Positives = 104/318 (32%), Gaps = 43/318 (13%) Query: 1 MTISTTSTPHDAVFKSFL---RHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDL 57 M I T+ D +FK + DFI+ L+ + ++E E+L Sbjct: 1 MKIKPTN---DLLFKKMMTTAGKEYILEDFIEAVTGMKLKNVRPANPYQIETYQKTIENL 57 Query: 58 RQYYSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLV 117 + V G + ++IE Q + R+ Y A + A + + Sbjct: 58 NPVMYSTIVDVAATTEDG-MEIMIEMQLYQHKDFFERIFNYMATAYTQNYKAETAK--PI 114 Query: 118 LPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLE 177 + ++ + P EF E I + + A+ + + + + ++ L+ Sbjct: 115 ISIVVTNFTVFP---------EFQEARIEIGLTNFAYY----QEIRNRKQQPYWRIYLVN 161 Query: 178 LIQKHI---RQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAE 234 L K I RD D + + + + R LK V + A R + + Sbjct: 162 LTDKAIVNGESRDFSEWRDFLKNGTIK-PKSSRGLKEAQKIVNFSNLAGEERRLAELMEK 220 Query: 235 RAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRG-----------------LDR 277 + + + E+G G+ + + +++G L Sbjct: 221 YEDVYYQVMKHQLEEGLEQGIEIGRQQGVALGEKRGMEKGVALGERKGQVMICFKMNLPI 280 Query: 278 ELVMMVTRLSPDDLIAQS 295 E + T LS +++ A Sbjct: 281 EEIQKHTGLSIEEIEAFR 298 >UniRef50_B3QUJ9 Putative uncharacterized protein n=8 Tax=Bacteria RepID=B3QUJ9_CHLT3 Length = 286 Score = 86.4 bits (212), Expect = 1e-15, Method: Composition-based stats. Identities = 37/290 (12%), Positives = 85/290 (29%), Gaps = 23/290 (7%) Query: 11 DAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVKT 70 D FK + + L L + + L N + + + ++ V Sbjct: 14 DFGFKKLFGTEPN-KILLMDFLNQILPEKHQIQELSYSKNEHVGQQELDRKA--IFDVYC 70 Query: 71 QEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGY--KELPLVLP--MLFYHGC 126 G ++ +E Q + R + Y+ +Q G +L V +L + Sbjct: 71 VGQSGERFI-VEVQKAKQNYFKDRSIYYASFPIQEQAKRGNWDDKLEPVYTVGILDFIFD 129 Query: 127 RSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQ- 185 L + + A F +++ E + + +H+ Q Sbjct: 130 DHKLDAELIHVVALKKSAQRSFSDKLKFIYIELPKFKKTEAELETQFDKWLYVFRHLSQL 189 Query: 186 -RDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKLM 244 + ++I L + ++ E + K + Sbjct: 190 QKRPTKFQEKIFEKLFEAAEIAKF-------------SKNELVAYEESLKYYRDMKNVVD 236 Query: 245 TIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQ 294 T + EG G + IA+ + +G+ + + +T L+ ++ Sbjct: 237 TSKEEGWLEGQKAGCEQRNYEIARVLKAKGMPIQEISEITGLTAQEIEHL 286 >UniRef50_UPI0001C369BC hypothetical protein ChatD1_02491 n=1 Tax=Clostridium hathewayi DSM 13479 RepID=UPI0001C369BC Length = 310 Score = 86.0 bits (211), Expect = 1e-15, Method: Composition-based stats. Identities = 45/292 (15%), Positives = 78/292 (26%), Gaps = 47/292 (16%) Query: 11 DAVFKSFLRHPDTARDFIDI-------HLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSD 63 D K L+ P D + L A L + N + Q D Sbjct: 5 DFYIKKLLQDPARFADLYNAEIFHGKQILKAELLSPVSTESGIAITNRSGRKQTIQRRRD 64 Query: 64 LLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYK----------- 112 + G +I E Q + M R + Y L K Sbjct: 65 IAMKASI--GACFIVAGCEAQGEIHYGMPIRSLTYDALDYTEQLTEIQKEHRKKKDLAKS 122 Query: 113 -----------ELPLVLPMLFYHGCRSPYPYSLCWLDEFA-------EPAIARKIYSSAF 154 +L VL ++ Y G P+ D P + + Sbjct: 123 PEFLSGITRRDKLQPVLTLVLYCGKD-PWDGPKSLYDMLDLRGPTECIPDLLAALPDYRI 181 Query: 155 PLVDITVVPDDEIMQHRKMALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFN 214 LVDI + + + + + +++ + + T N + + Sbjct: 182 NLVDIRKIENLSLYKTGLQQVFGMLKYSTDKSKFYNYI--------TSNHDQISMLDDNA 233 Query: 215 YVLQTGDAQRFRAFIGEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRI 266 G R + +A +E + D L +G ++GK E R Sbjct: 234 LTAVMGLLGENRRLMKYLAAPGREEGYTMCQAIDDLIADGKLEGKREGKRRG 285 >UniRef50_Q8YK35 All8083 protein n=6 Tax=Cyanobacteria RepID=Q8YK35_ANASP Length = 313 Score = 86.0 bits (211), Expect = 1e-15, Method: Composition-based stats. Identities = 45/310 (14%), Positives = 102/310 (32%), Gaps = 38/310 (12%) Query: 3 ISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTT----LKLEPNSFIDE-DL 57 +S +D +K F+ P ++ D L+ E I E ++ Sbjct: 1 MSEVRADYDGAWKE--GVEQYFEAFLAFFFP-EIQAEIDWERGYEFLEQELQQLIKESEV 57 Query: 58 RQYYSDLLWSVKTQEG-VGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPL 116 + + D L V ++G ++ + +E QS+ + RM Y + Sbjct: 58 GKQFVDKLIKVWLKDGKETWLLIHLEIQSQVDPNFTKRMFSYHYRIFDRYNQE------- 110 Query: 117 VLPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDD-EIMQHRKMAL 175 V+ + ++ + S FP+V + ++ Sbjct: 111 VVSLAILGDNQANWRPQE------YSYGRWGCRLSLQFPIVKLLDYESRWSELEQSDSPF 164 Query: 176 LELIQKHIR----QRDLLGLVDQIVSLLVTGNT---NDRQLKALFNYVLQTGD--AQRFR 226 L+ H+R +DL G + +SL+ + +++ +F + + + Sbjct: 165 AVLVMAHLRTQATTQDLTGRLQWKLSLIKRMYEVGYSRDKIQQIFRLLDRLMTLPPELDL 224 Query: 227 AFIGEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRL 286 F E+ +++ MT +R+ G Q E + + + EL+ + ++ Sbjct: 225 NFQAELERFEAEQEMTYMTSIERI---GRAQTLQESITEVLETRF-NNVPPELIEQLKKI 280 Query: 287 SPDDLIAQSH 296 +L Sbjct: 281 Y--ELDRLKQ 288 >UniRef50_Q8ZS56 Alr7656 protein n=6 Tax=Nostocaceae RepID=Q8ZS56_ANASP Length = 319 Score = 86.0 bits (211), Expect = 2e-15, Method: Composition-based stats. Identities = 46/307 (14%), Positives = 102/307 (33%), Gaps = 36/307 (11%) Query: 3 ISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTT----LKLEPNSFIDE-DL 57 +S +D +K F+ P ++ D L E I E ++ Sbjct: 1 MSEVRADYDGAWKE--GVEQYFEAFLAFFFP-EIQAEIDWERGYDFLDQELQQLIRESEI 57 Query: 58 RQYYSDLLWSVKTQEG-VGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPL 116 + + D L V ++G ++ + +E QS+ + RM Y + Sbjct: 58 GKQFVDKLIKVWLKDGKETWLLIHLEIQSQVDTNFPKRMFSYHYRIFDRYNQE------- 110 Query: 117 VLPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDD-EIMQHRKMAL 175 V+ + ++ + S FP+V + ++ Sbjct: 111 VVSLAILGDNQANWRPQE------YSYGRWGCHLSLQFPIVKLLDYESRWSELEQSDSPF 164 Query: 176 LELIQKHIR----QRDLLGLVDQIVSLLVTGNT---NDRQLKALFNYVLQTGD--AQRFR 226 L+ H+R +DL G + +SL+ + +++ +F + + + Sbjct: 165 AVLVMAHLRTQATTQDLAGRLQWKLSLIKRMYELGYSRDKIQQIFRLLDRLMTLPPELDL 224 Query: 227 AFIGEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRL 286 F E+ +++ MT +R+ G + + +I + ELV + +L Sbjct: 225 NFKAELERFEAEQEMTYMTSIERI---GIAEATQKYIAQILTIRFKD-IPTELVEKLNKL 280 Query: 287 SPDDLIA 293 +L+ Sbjct: 281 YDIELLN 287 >UniRef50_B0C251 Putative uncharacterized protein n=1 Tax=Acaryochloris marina MBIC11017 RepID=B0C251_ACAM1 Length = 313 Score = 85.7 bits (210), Expect = 2e-15, Method: Composition-based stats. Identities = 40/301 (13%), Positives = 109/301 (36%), Gaps = 31/301 (10%) Query: 11 DAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNS-----------FIDEDLRQ 59 D +F+ L++ +F+D+ P + D +++ + ++ Sbjct: 5 DRLFRDLLKN--FFLEFVDLFFPK-IAVAIDPKSIRFLEDEESLKPQEQGEHSPASTKQE 61 Query: 60 YYSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLP 119 S++L V+ + +V +E+ S+ + R+ + + + P Sbjct: 62 ASSNVLVQVRLRGQESCFWVHLENSSETNIKLERRIFHTFARLDEKYN-------LPIYP 114 Query: 120 MLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELI 179 ++ +S + + + R++ +F + + + + +Q R L+ Sbjct: 115 IIL----QSSDKSQRLETNGYRVEFVDRRVLDFSFVAIQLHRLNWRDFLQRRNPVAAALM 170 Query: 180 Q-KHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYV--LQTGDAQRFRAFIGEIAERA 236 +++ D + + + LL + +++K + ++ +A + E+ Sbjct: 171 PTMNVQTFDRPVVKAECLRLLTNLRLDAKKVKVISQFIEAFLHLNAAEEQVLQTEMERMG 230 Query: 237 PQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDR---GLDRELVMMVTRLSPDDLIA 293 E+E++ + + QG EAL + +L R L+ +L V L + + Sbjct: 231 LLERERITNLLTSTTQANQQQGAEREALSLVFRLLKRRIGDLNPDLEAQVRSLPVNQVED 290 Query: 294 Q 294 Sbjct: 291 L 291 >UniRef50_C9RQ02 Putative uncharacterized protein n=1 Tax=Fibrobacter succinogenes subsp. succinogenes S85 RepID=C9RQ02_FIBSS Length = 360 Score = 85.3 bits (209), Expect = 2e-15, Method: Composition-based stats. Identities = 54/298 (18%), Positives = 111/298 (37%), Gaps = 21/298 (7%) Query: 3 ISTTSTPHDAVFKSFLRHPDTARDFID-----IHLPAPLRKLCDLTTLKLEPNSFIDEDL 57 T HDA F+ AR ++ H +L TL P+S+ E Sbjct: 5 NKVTKRKHDAYFRWLFADTTHARCLLELAGKINHEIDAFLTQINLDTLMRIPDSY-SEVD 63 Query: 58 RQYYSDLLWSVKTQEG-VGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPL 116 +DL + V G + +++EH+S + ++ ++ +Y + M+ Sbjct: 64 DTGEADLAFRVNVSTGAPILVGILLEHKSGRDPIIFDQISKYIHSVMKIQDKNRIFSGIP 123 Query: 117 VLPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIM--QHRKMA 174 + ++FY+G + P L L++ K+ V++ +PD + + ++ Sbjct: 124 TMAIIFYNGRDNWNP--LKILEKSYPDYFRGKVLPFQCTFVNMADIPDSDCLACENTATG 181 Query: 175 LLELIQKHIRQRD-LLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIA 233 + + KH +D LL L+ Q L N+ + + +A Sbjct: 182 MGIIALKHAFNKDKLLELLPQFCKFLDKMPRNEASCLLEKTSIYLMEYLGKDFLKELNMA 241 Query: 234 ERAPQEKEKLMTIADRLREEGAMQGKH---------EEALRIAQEMLDRGLDRELVMM 282 + +K ++I D R++ A + + EE +I +E L +R+ + Sbjct: 242 FVSIGQKYGFVSIGDYFRQQLAEERQQMTEERLQMAEERQQITEERLQMAEERQQITE 299 >UniRef50_Q2FTW8 Putative uncharacterized protein n=2 Tax=Methanospirillum hungatei JF-1 RepID=Q2FTW8_METHJ Length = 306 Score = 84.9 bits (208), Expect = 3e-15, Method: Composition-based stats. Identities = 40/301 (13%), Positives = 94/301 (31%), Gaps = 35/301 (11%) Query: 7 STPHDAVFKSFLRHP---DTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSD 63 S +D F+ P D D ++ LP + + + +P+ I + ++ D Sbjct: 21 SPRNDFAFRLLFGDPNNSDILLDLLNAILPDHFQSV-----VCTDPHLLIPDTKKECILD 75 Query: 64 LLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYK----ELPLVLP 119 + V + G +YV IE Q + M R + Y + L+ G+ + +V+ Sbjct: 76 I--KVLSDSG---VYVDIEMQVLDLKSMEKRSLFYWAKMYLDQLNRGHSYHELKRTIVIN 130 Query: 120 MLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQH-RKMALLEL 178 +L Y ++ + + + +++ V + + Sbjct: 131 ILDYMLMPVEDLHTCFQAYDKTHDILMSDV--FEIHFLELPKVHRCRVPYKGTDLLSWLT 188 Query: 179 IQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQ 238 + + ++ ++ +N + + R Sbjct: 189 FLNAYTEEE-----------IIMAAEGKPAIQKAYNNLQIMSLDEETRRLYEAREMFLHD 237 Query: 239 EKEKLMTIADRLREEGAMQGKHEEALR----IAQEMLDRGLDRELVMMVTRLSPDDLIAQ 294 + ++ + EEG +G+ E + +L G+D E + T L + Sbjct: 238 QATRMYEAKEEGLEEGMKKGREEGREEEREGFVKNLLSLGMDDEFIKKATGLDQSIIDKL 297 Query: 295 S 295 Sbjct: 298 K 298 >UniRef50_Q00255 ORF295 n=1 Tax=Leptolyngbya boryana RepID=Q00255_PLEBO Length = 295 Score = 84.9 bits (208), Expect = 3e-15, Method: Composition-based stats. Identities = 50/300 (16%), Positives = 104/300 (34%), Gaps = 33/300 (11%) Query: 4 STTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFI----DEDLRQ 59 S+ +T +D +K+F+ R+F+ P + ++ D ++ + Sbjct: 5 SSENTDYDNPWKTFI--ELYFREFLAFFFPTIEADVDWSKPVRFLDKELQKIVRDAEIPK 62 Query: 60 YYSDLLWSV-KTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVL 118 Y+D L V + + + IE QS+ E RM Y+ + V+ Sbjct: 63 RYADKLVEVHRLRGERTLVICHIEVQSQEERDFVARMYSYNYRLRDRYN-------CPVV 115 Query: 119 PMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVV-PDDEIMQHRKMALLE 177 + G P + DE A FP+V ++ ++ + Sbjct: 116 SLAIL-GDDRPNWRPSRFYDELWGCATH-----FEFPIVKLSDYQSQWTELEAIQNPFAV 169 Query: 178 LIQKHIRQRD-------LLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIG 230 + H++ ++ + ++L +++ + L N++ + Sbjct: 170 VAMAHLKTKETHNQPLERKRWRYHLTTMLYDRGYSEQDILELHNFLDWLMNLPEELERQL 229 Query: 231 EIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDD 290 + +E ++ + + K EE IA ML R LD EL+ VT L+ + Sbjct: 230 QAELETFEEARRM-----KYVSSLERRAKLEEKQAIALNMLRRNLDMELIAEVTGLTIAE 284 >UniRef50_C0QWI7 Putative uncharacterized protein n=4 Tax=Brachyspira RepID=C0QWI7_BRAHW Length = 289 Score = 84.5 bits (207), Expect = 4e-15, Method: Composition-based stats. Identities = 36/298 (12%), Positives = 96/298 (32%), Gaps = 20/298 (6%) Query: 4 STTSTPHDAVFKSFLRHPDTARDFIDIHLP-APLRKLCDLTTLKLEPNSFIDEDLRQYYS 62 + +D + R +++ +K+ + E + S Sbjct: 5 KNINVLNDYFMRYMFAKEGHERILLNLINAVRTDYNQEPFEEVKVLNTFNLKETINDKQS 64 Query: 63 DLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYK----ELPLVL 118 + T+ G V++E Q + +R + Y ++L K + +V+ Sbjct: 65 IVDVRAVTKSGET---VLVEIQRIGNQSFVYRSLYYWAKCYVSNLRNNEKYNDLKQVIVI 121 Query: 119 PMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLEL 178 +L ++ + +++ ++ Sbjct: 122 NILDFNLLK-DIDKEHSCYVIKELETNHILTNHFEMHFLELQKYLSSNSNLKEELDAWFY 180 Query: 179 IQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQ 238 ++ + +++I+ +LV N +++ + + AE Sbjct: 181 FLTI---KEKIEKMEEIMDILVKKNPIMKEVY------DEYNKFADTKDLFENYAEYEKN 231 Query: 239 EKEKLMTIADR--LREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQ 294 + L +R REEG +G E + +A+ M ++ +D +L+ +T L+ +++ Sbjct: 232 YFDILALSEERIRGREEGIKEGIKETQISMARNMKNKNMDIKLIGELTGLTTEEIEKL 289 >UniRef50_C6Y2C7 Putative uncharacterized protein n=2 Tax=Pedobacter heparinus DSM 2366 RepID=C6Y2C7_PEDHD Length = 283 Score = 84.5 bits (207), Expect = 4e-15, Method: Composition-based stats. Identities = 42/289 (14%), Positives = 92/289 (31%), Gaps = 24/289 (8%) Query: 11 DAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVKT 70 D FK + +D + L A + ++ P ED+++ +L+ + Sbjct: 14 DYGFKRLFGNEPD-KDIMIEFLNALFEGEKIVIDIRYSPTEHAGEDVKEKK--VLFDLTC 70 Query: 71 QEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYK----ELPLVLPMLFYHGC 126 G ++ IE Q +E R + Y + L G L V + Sbjct: 71 TGADGETFI-IEMQRADQEFFRDRCVFYMSRLISAQLPRGTSNWDVPLKEVYLIGIMEFQ 129 Query: 127 RSPYPYSLCWLDEFAEPAIARKIYS-SAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQ 185 + + + Y + +++ E ++ + K++ Sbjct: 130 FNNINSNYLHNIALMNRDTGKVFYKGMGYKFLELPNFDKKESDLVTELDKWFYLLKNLSH 189 Query: 186 RDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKLMT 245 +D+I L R + +F + + R + + Sbjct: 190 ------LDKIPDFLDK-----RVFQKIFKIAEMSKMTKEERELYDSDVKAKSDWNAGIRY 238 Query: 246 IADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQ 294 + +EEG ++ E L IA+ + + + E++ T LS D++ Sbjct: 239 AEKKAKEEGKLE----EKLEIARNLKSKAIAFEIIAETTGLSIDEIEKL 283 >UniRef50_C0GV86 Transposase, ISNCY family n=7 Tax=Desulfonatronospira thiodismutans ASO3-1 RepID=C0GV86_9DELT Length = 125 Score = 84.5 bits (207), Expect = 4e-15, Method: Composition-based stats. Identities = 34/104 (32%), Positives = 61/104 (58%), Gaps = 3/104 (2%) Query: 1 MTISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQY 60 M PH+ +F ++ D AR F+ H+ ++K DL TL+LEP +++DE L+++ Sbjct: 1 MATKRNQAPHEGLFLKIFQNLDNARHFLKNHMSEEIQKRFDLDTLRLEPTTYVDEKLKKH 60 Query: 61 YSDLLWSVKT---QEGVGYIYVVIEHQSKPEELMAFRMMRYSIA 101 YSDL++SV+ + IY++ EH+S P+ L ++++Y Sbjct: 61 YSDLVFSVRLIGYKNQFAKIYLLFEHKSSPDPLTGVQVLKYMAL 104 >UniRef50_C1P7A8 Putative uncharacterized protein n=1 Tax=Bacillus coagulans 36D1 RepID=C1P7A8_BACCO Length = 345 Score = 84.5 bits (207), Expect = 4e-15, Method: Composition-based stats. Identities = 49/338 (14%), Positives = 99/338 (29%), Gaps = 60/338 (17%) Query: 6 TSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSF----IDEDLRQYY 61 T +D ++K + + +FI P + + I + Sbjct: 15 PGTDYDGLWKKIIS--ELFEEFILFFAPDLYETIDFGKGIVFLEQELHKVIIKHKKGKRI 72 Query: 62 SDLLWSVKTQEGVG-YIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPM 120 +D + V + G Y+++ IE Q K + + RM Y + + + Sbjct: 73 ADKIVKVSLKNGEEKYVFIHIEIQEKQDPDFSKRMFTYFYRLFDRFQEN-------IYSI 125 Query: 121 LFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQ 180 S + F + + + F DI + + +A+L I Sbjct: 126 AILTDLSKSNN-SEPFQYSFYGTELTYRFNTYKFNEADIPSLK--KSTNPFAIAVLAGIY 182 Query: 181 KHIRQRDLLGLVDQIVSLLVTGNTND-----------------------RQLKALFNYVL 217 H+ +++ + LL ++ K L + Sbjct: 183 LHLTEKNYQKRYEVKKKLLKEFILSNQNLSSNYAEALCYFIDYLLYLPGELTKQLTKELF 242 Query: 218 QTGDAQRFRAFIGEIAERAP--------------------QEKEKLMTIADRLREEGAMQ 257 + + E + AP ++ + + E G + Sbjct: 243 IHIEKEANHMLYSEELKEAPTFAEYLKTVKEEGIEIGIEKGIEKGIEKGKEEGIEIGIEK 302 Query: 258 GKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQS 295 GK EE +A E+L G E V + +LS D++ Sbjct: 303 GKMEEKRNLAAELLREGFSVEKVAKMVKLSIDEVKKIK 340 >UniRef50_A7AK04 Putative uncharacterized protein n=2 Tax=Parabacteroides RepID=A7AK04_9PORP Length = 299 Score = 84.1 bits (206), Expect = 5e-15, Method: Composition-based stats. Identities = 38/304 (12%), Positives = 88/304 (28%), Gaps = 38/304 (12%) Query: 11 DAVFKSFL---RHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWS 67 D FK F + + F++ L ++ + + + ++ Sbjct: 12 DYAFKRFFGTVSNKELTIGFLNSLLNKDIKDII------FHNVEMQGNNTDSRKA--VFD 63 Query: 68 VKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPL----------- 116 + + G +++ +E Q K ++ + R++ Y+ +Q D ++ L Sbjct: 64 LFCEGSDGELFI-VEIQKKRQKYFSDRVLYYASFVIQMQADIESEKFRLAKEEERRRWNY 122 Query: 117 ----VLPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLV--DITVVPDDEIMQH 170 V + F D + +S + ++ + Sbjct: 123 HINKVYVVCFLDFRLDTRYTDKYRWDVVRMDRELKIPFSETLNEIYLELPKFNLNFEECD 182 Query: 171 RKMALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIG 230 +I D + L ND+ L+ L + + + + R Sbjct: 183 TFYKKFLYTMNNI---------DIMGQLSKETIQNDKLLRKLKSAIELQRMSAKERLAYE 233 Query: 231 EIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDD 290 + T + E+G +G E +I M G+D + L + Sbjct: 234 LSIAAERDLAACMATSFEEGEEKGIAKGITEGMRKIILNMKQAGMDLATIAKTAGLPEKE 293 Query: 291 LIAQ 294 + A Sbjct: 294 VEAL 297 >UniRef50_B0A7T9 Putative uncharacterized protein n=2 Tax=Clostridium bartlettii DSM 16795 RepID=B0A7T9_9CLOT Length = 271 Score = 83.7 bits (205), Expect = 8e-15, Method: Composition-based stats. Identities = 43/289 (14%), Positives = 91/289 (31%), Gaps = 31/289 (10%) Query: 11 DAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVKT 70 D VFK+ + + L A L+ +T+++++ + +S L KT Sbjct: 10 DFVFKNIFGSEKNPK-ILISFLNATLKPKDLITSVEIKNTDINKNYIEDKFSRLDVKAKT 68 Query: 71 QEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKE--LPLVLPMLFYHGCRS 128 + IE Q K E M R + Y L G L + + + Sbjct: 69 SNDE---IINIEIQLKNEYNMIKRSLYYWSKLYSEQLGEGQDYSVLKRTICINILNFKYL 125 Query: 129 PYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQRDL 188 + + ++I + D + + +E + Sbjct: 126 KTRKFHSGYRLKEIYSNEELTNVAEIHFIEIPKLDDGADEKDMLVNWIEFL--------- 176 Query: 189 LGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKLMTIAD 248 + +++L + + A+ + + + + T+ D Sbjct: 177 -------------KDPESETVRSLEMNIEEIRQAKDELIRMSNDDTQREIYEMRAKTLRD 223 Query: 249 RL--REEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQS 295 ++ E +G + IA+ +LD LD E + + T LS D++ Sbjct: 224 KISALNEAERKGIQQGKREIAKALLDV-LDIETIALKTGLSIDEINKLK 271 >UniRef50_B0G418 Putative uncharacterized protein n=5 Tax=Dorea formicigenerans ATCC 27755 RepID=B0G418_9FIRM Length = 312 Score = 83.3 bits (204), Expect = 9e-15, Method: Composition-based stats. Identities = 42/284 (14%), Positives = 98/284 (34%), Gaps = 26/284 (9%) Query: 5 TTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDL 64 D VF+ L+ P A + + +L LE ++ +D+ Sbjct: 36 PNREYKDRVFRMLLKEPKVALEVYNAMNGTLYDNPDELIITTLENAVYLGMK-----NDV 90 Query: 65 LWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHL------DAGYKELPLVL 118 + + TQ V+ EHQS P M R + Y ++ ++P Sbjct: 91 SFILGTQ------LVLYEHQSTPNPNMPLRNLAYVACVYMAYVFGDNLYGRKLIKIPEPR 144 Query: 119 PMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRK-----M 173 ++FY+G S+ L + E V+I ++E+++ + Sbjct: 145 FVVFYNGTDKMPEQSVLRLSDAYESKSEELDLELKIRFVNINPGYNEEMVEKSPTLYQYV 204 Query: 174 ALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTG--DAQRFRAFIGE 231 +++++K+ ++ V++ + + L+ VL+ + E Sbjct: 205 KFVDIVRKYQKEMPFPEAVEKAIDECIKKGILAEFLRKNRAEVLRVSIFEYDEEEHMRQE 264 Query: 232 IAERAPQEKEKLMTIADRLREEGAMQGKHEEAL--RIAQEMLDR 273 E + E++ + D+L + + + +++L+ Sbjct: 265 REESRQEGIEQVNDLYDKLHDLNREEDIWKAIKDVEYRKKLLEE 308 >UniRef50_C0R2N1 Putative uncharacterized protein n=4 Tax=Wolbachia RepID=C0R2N1_WOLWR Length = 277 Score = 83.3 bits (204), Expect = 1e-14, Method: Composition-based stats. Identities = 29/237 (12%), Positives = 70/237 (29%), Gaps = 24/237 (10%) Query: 83 HQSKPEELMAFRMMRYSIAAMQNHLDAG--YKELPLVLPMLFYHGCRSPYPYSLCWLDEF 140 +Q + R Y+ A D G Y L ++ + P Sbjct: 41 NQVAKTKGFEKRAQYYAAKAYSRQADKGDQYHNLKEIIFIAIADCVLFPNKSEYKSKHTI 100 Query: 141 AEPAIARKIY-SSAFPLVDITVVP-DDEIMQHRKMALLELIQKHIRQRDLLGL------- 191 + F +++ P + E + ++ + L Sbjct: 101 RDEDTNEHDLKDFYFIFIELPKFPKNKEDQLENIVEKWVYFFRYADETSEEELEKIIGSD 160 Query: 192 --VDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRF-----------RAFIGEIAERAPQ 238 + + L N ++++ A + + D Q G+ + Sbjct: 161 VIIKKAYEELNRFNWSEKEFIAYEQEIKRILDEQAVLAQKLDDATEKGREEGKEEGKEEG 220 Query: 239 EKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQS 295 + + GA +G+ + + +A+ +L G+ +++ T L+ D++ S Sbjct: 221 IQIGHEKGRAEGIQIGAEKGEKQAKITVAKNLLKAGVSIDIIAQTTGLTVDEVKDLS 277 >UniRef50_C5UZR7 Putative uncharacterized protein n=1 Tax=Clostridium botulinum E1 str. 'BoNT E Beluga' RepID=C5UZR7_CLOBO Length = 334 Score = 83.0 bits (203), Expect = 1e-14, Method: Composition-based stats. Identities = 47/333 (14%), Positives = 95/333 (28%), Gaps = 42/333 (12%) Query: 1 MTISTTSTPHDAVFKSFLRH-PDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQ 59 MT+S D + K + ++ D L + N FI + Sbjct: 1 MTVSNEKVKLDEILKFLFSTSKKVLVNLLNGIFEENFSS--DEVELSVSNNEFIMDTFDT 58 Query: 60 YYSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLP 119 D+ + V E + +E Q+K + M RM Y + + P Sbjct: 59 LRGDVFFEVLNNEVSNKVTYHLEFQTKNDSTMIIRMFEYGFRKGKEQTGNRDDFKTIYFP 118 Query: 120 MLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELI 179 + P IYS P++ D+E+++++ LL L Sbjct: 119 KQKVIFIERNNNIKEDIKLKIVLPDEQSFIYSV--PVMKYWEYTDNELIENKMYPLLPLQ 176 Query: 180 QKHIRQ----------------------RDLLGLVDQIVSLLVTGNTNDRQLKALFNYVL 217 ++R+ L + ++ L + + Sbjct: 177 LFNLRKDLEYARRSNNIDKINDLSHEAKEIALKIANESKKLFDDNEIIGEDFHKMLLAIQ 236 Query: 218 QTGDAQRFRAFIGEIAERA---------------PQEKEKLMTIADRLREEGAMQGKHEE 262 + F + E ++ + ++ E+G +G ++ Sbjct: 237 NLIEYLNRNYFNDDRLEEEVSTMTKTLYDPEVEKRGIEKGIEKGIEKGIEKGMEKGIEKK 296 Query: 263 ALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQS 295 A+ A L G+ E+V T L + + Sbjct: 297 AIEDAIGFLRLGVSEEIVSKGTGLPIEKVRELK 329 >UniRef50_A7BTR0 Putative uncharacterized protein n=3 Tax=Beggiatoa RepID=A7BTR0_9GAMM Length = 309 Score = 83.0 bits (203), Expect = 1e-14, Method: Composition-based stats. Identities = 48/317 (15%), Positives = 107/317 (33%), Gaps = 33/317 (10%) Query: 1 MTISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQY 60 M T D K+ LR D ++ L A L++ D++ L++ + D + Sbjct: 1 MPTETKLVRFDWALKNILRDKANF-DVLEGFLTALLQE--DISVLEILESESNQSDFAKK 57 Query: 61 YSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKE----LPL 116 ++ + VK I IE Q+ E R++ + + L+ G + Sbjct: 58 FNRVDILVKDSHQRKMI---IEVQNHRETGYLERILWGTSKLIVETLELGEDYRNISKVI 114 Query: 117 VLPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSA----------------FPLVDIT 160 + ++++ S + + + + FP + Sbjct: 115 SISIVYFDLGLSDDNEYVYYGVANLHGLQHNQPFRFRRLMADKTFKSLQTKDIFPEFYLL 174 Query: 161 VVPDDEIMQHRKMALLELIQKH--IRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQ 218 V + + + + KH IR +++ L N ++ K Y++ Sbjct: 175 RVEHFQDIIKTDLDEWIYMLKHSTIRTDFKSKNINKAQEKLTLLQMNPQKRKDYEKYMVD 234 Query: 219 TGDAQRFRAFIGEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRE 278 + A + ++ R+EG +G ++ + I + L +GL+ Sbjct: 235 MTVERDVLE-----AAQEEGIQKGRQEGIQEGRQEGIQKGMEKKTVVIVKNALQQGLELT 289 Query: 279 LVMMVTRLSPDDLIAQS 295 L+ +T LS +++ Sbjct: 290 LISSLTGLSIEEIQKIQ 306 >UniRef50_UPI00019735B3 hypothetical protein ClM62_08045 n=1 Tax=Clostridium sp. M62/1 RepID=UPI00019735B3 Length = 255 Score = 82.6 bits (202), Expect = 1e-14, Method: Composition-based stats. Identities = 27/218 (12%), Positives = 70/218 (32%), Gaps = 28/218 (12%) Query: 7 STPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLW 66 D +F+ + + R L + LE +++ +DL + Sbjct: 23 RDYKDTLFRMLFNDREALLSLYNAVGNTDYRDPSLLQIVTLENAVYMN-----VKNDLAF 77 Query: 67 SVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLD------AGYKELPLVLPM 120 + G+ + EHQS M R + Y+ + + + +LP+ + Sbjct: 78 LL------GFELNLYEHQSTWNPNMPLRDLFYAAREYEMLIRDQSLYSSRLIKLPVPRFI 131 Query: 121 LFYHGCRSPYPYSLCWL---------DEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHR 171 +FY+G + L + E + + + ++++ +E + Sbjct: 132 VFYNGREKQEERCVLKLSDAFETPVEECIHEGILRDFLLKYRAEVTNVSIFEYNEEREKE 191 Query: 172 KM--ALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDR 207 + A E +K ++ + + ++ + Sbjct: 192 LLRKAEYEFGKKEGMEQGMEQGICALIQTCRELGASRE 229 >UniRef50_C4ZGR2 Putative uncharacterized protein n=2 Tax=Eubacterium rectale ATCC 33656 RepID=C4ZGR2_EUBR3 Length = 370 Score = 82.6 bits (202), Expect = 2e-14, Method: Composition-based stats. Identities = 44/313 (14%), Positives = 88/313 (28%), Gaps = 43/313 (13%) Query: 3 ISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYS 62 +S VF ++ + A + + D+ + + Sbjct: 68 MSGNREYKSDVFSMLMQDKERALQLYNAMNGSSYDNPEDVEMVI-----HDGGISLSVRN 122 Query: 63 DLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYK---------- 112 D + V + IY EHQS M R + Y + + L K Sbjct: 123 DASFIV---DARLSIY---EHQSTVCPNMPVRSLIYFSVILSDMLSDKKKGTKSGKNIYG 176 Query: 113 ----ELPLVLPMLFYHGCRS-PYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEI 167 ++P ++FY+G P L D F +P + + +I + I Sbjct: 177 RRLVKIPTPHFVVFYNGEEEQPEVQELKLSDAFEKPTDEPNL-ELKCKVYNINDGKNKAI 235 Query: 168 MQHR-KMALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFR 226 M+ + +R+ G D + + +Y + + F Sbjct: 236 MESCGWLNDYMTFVNKVREYHADGAFDDLAIDIEKA----------IDYCIDNDILKEFL 285 Query: 227 AFIGEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEML-----DRGLDRELVM 281 ++ Q + + R + +G + A +ML LD + Sbjct: 286 KTYRSEVTKSMQLNYEFDRQLELERADAIEEGMEIGIEKGANKMLFTLVTKGKLDIDTAA 345 Query: 282 MVTRLSPDDLIAQ 294 +S + Sbjct: 346 EEAGVSVSEFEKL 358 >UniRef50_A7C3K1 Putative uncharacterized protein n=3 Tax=Beggiatoa sp. PS RepID=A7C3K1_9GAMM Length = 272 Score = 82.2 bits (201), Expect = 2e-14, Method: Composition-based stats. Identities = 36/291 (12%), Positives = 89/291 (30%), Gaps = 31/291 (10%) Query: 13 VFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVKTQE 72 K P F+ L ++ K+E + D + + Q+ Sbjct: 4 FLKKVFSKPHIFTAFVKDMLG------IEIEIDKVETEKSFSPIIGN--VDSRFDLFAQD 55 Query: 73 GVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPL--VLPMLFYHGCRSPY 130 + V I+H+ + R + Y A+ + + P V ++ Sbjct: 56 TKNRLIVDIQHKRYKDHY--DRFLHYHCVALLEQITSSANYKPDMQVYTIVVLTSGDKHK 113 Query: 131 P-------YSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHI 183 + +Y + D T P E ++ +L + +++ Sbjct: 114 TDLLITDFSPKKLDGSSIAETQHKIVYVCPKYVTDETPKPYQEWLKAINDSLDKQVEESH 173 Query: 184 RQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKL 243 + ++ +I SL+ + + Y + + + + + Sbjct: 174 YHNE---VIQEIFSLIKKDKISPEE------YARMKDEYSDEEYLQEQTQKARKE---GM 221 Query: 244 MTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQ 294 ++ +G +G + L +A+ M + + E ++ VT LS + + Sbjct: 222 EKGMEKGIGKGIEKGIEKGVLMMAKNMKEAKVAIETIIEVTGLSIEQIEDL 272 >UniRef50_C6XVT6 Putative uncharacterized protein n=1 Tax=Pedobacter heparinus DSM 2366 RepID=C6XVT6_PEDHD Length = 317 Score = 82.2 bits (201), Expect = 2e-14, Method: Composition-based stats. Identities = 42/313 (13%), Positives = 101/313 (32%), Gaps = 36/313 (11%) Query: 4 STTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFI------DEDL 57 D K DF+ L ++ N + Sbjct: 19 ERPRRKDDEFLKGAFED--NFPDFLRFVFSDADEILDFNREIEFLNNELFTIIPDRERKG 76 Query: 58 RQYYSDLLWSVKTQEG-VGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPL 116 +DLL + ++G ++ + +E + + R+ Y+ + + Sbjct: 77 GGRRADLLAKLYLKDGTEKWVLLNVEIEGGNDRKFGQRVFEYNYRIRDKYKVS------- 129 Query: 117 VLPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALL 176 V + + G ++ + + + A+ + D D+ + +L+ Sbjct: 130 VASIAVFTGKKTQLRPTEYLDELLGTVLSFKYT---AYHVFD--HQEDELLKSDNPFSLI 184 Query: 177 EL------IQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYV--LQTGDAQRFRAF 228 L ++ I +L IV L+ + +++ + ++ +++ Sbjct: 185 ALACQKALLEGKIPDEELADERLVIVKALLRHGYDRQRIISFILFLKNFIFIESEEINRK 244 Query: 229 IGEIAERAPQEKEKL-------MTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVM 281 + E ++K + + EG ++G+ EEAL IA+E+ GL E + Sbjct: 245 FDQQIEELTKDKNPMGVIDVFKKWERQEAKIEGKLEGRREEALEIARELKKEGLTIEFIA 304 Query: 282 MVTRLSPDDLIAQ 294 T+L ++ Sbjct: 305 KTTKLPIAEIEKL 317 >UniRef50_C4Z1Q2 Putative uncharacterized protein n=1 Tax=Eubacterium eligens ATCC 27750 RepID=C4Z1Q2_EUBE2 Length = 321 Score = 81.0 bits (198), Expect = 4e-14, Method: Composition-based stats. Identities = 45/325 (13%), Positives = 91/325 (28%), Gaps = 48/325 (14%) Query: 2 TISTTSTPHDAVFKSFLRHPDTARDFIDI-------HLPAPLRKLCDLT-TLKLEPNSFI 53 + T+ D K+F R + D + L D + + S+ Sbjct: 3 NSNRTTHQKDVSLKTFWRDNEHFADLFNATVFNGKQVLKPDKLTEMDTDVSATIHSKSYN 62 Query: 54 DEDLRQYYSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLD----- 108 + R D++ K +GV + + +E Q K M R M Y + Sbjct: 63 ESITRNR--DVV--KKMSDGVEFNILGLEIQDKTHYAMPLRTMTYDALGYIKEYNDIKKH 118 Query: 109 ------------------AGYKELPLVLPMLFYHGCRSPY-PYSLCWLDEFAEPAIARKI 149 ++ ++ Y+G P L + I Sbjct: 119 HKLNKDSFSSHEEFLSGINKSDRFHPIITLVLYYGESLWDGPTCLSDMMISMPDNIKAYF 178 Query: 150 YSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQL 209 LV I D + + + I ++I D + + + Sbjct: 179 SDYKLNLVQILD-SDKYTFYNEDVRDVFNIIRNIYNDDFDSIYRE--------YESRNVD 229 Query: 210 KALFNYVLQTGDAQRFRAFIGEIAERAP-QEKEKLMTIADRLREEGAMQGKHEEALRIAQ 268 + + + + + E + +G +G E + Sbjct: 230 IDVMELICNITSVPKLMDLCTDTEQGGTVNMCEAMKRFQAECESKGMKEGIDSEKVNSII 289 Query: 269 EMLDRGLDRELVMMVTRLSPDDLIA 293 ML+ G+ +E + +TR + +DL Sbjct: 290 SMLEFGITKEQI--LTRYTKEDLER 312 >UniRef50_C0D7Q8 Putative uncharacterized protein n=1 Tax=Clostridium asparagiforme DSM 15981 RepID=C0D7Q8_9CLOT Length = 351 Score = 81.0 bits (198), Expect = 5e-14, Method: Composition-based stats. Identities = 46/302 (15%), Positives = 90/302 (29%), Gaps = 44/302 (14%) Query: 11 DAVFKSFLRHPDTARDFIDI--HLPAPLRKLCDLTTLKLEPNSFI-----DEDLRQYYSD 63 D +R+ D + + K DL+ + E I L + D Sbjct: 17 DYSVNKLMRNSVRFADLYNGTVFRGKQVLKPEDLSDVPDENGIAIVGLDGKRRLIRRSRD 76 Query: 64 LLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDA-------------- 109 ++ G ++ + E+Q K M R M Y ++A Sbjct: 77 VIKKASF--GAYFVLLAEENQDKVHYAMPVRSMLYDALEYTEQVEALKRRHRECGDRLEG 134 Query: 110 --------GYKELPLVLPMLFYHGCRSPYPYSLCWLDEFA-------EPAIARKIYSSAF 154 + V+ + YHG + P+ D A+ + Sbjct: 135 DAFLSGITRDDRIMPVVTLTVYHGAK-PWDGPRSLYDMLEMDRDSKEWEALKEVLPDYRL 193 Query: 155 PLVDITVVPDDEIMQHRKMALLELIQKHIRQ--RDLLGLVDQIVSLLVTGNTNDRQLKAL 212 LV++ + E + + + K+ R+ R ++ L + D ++A+ Sbjct: 194 NLVELNNMQHLE-RFRSSLQPIFTVLKYNRKDKRKFYEYLENHREELRKMD--DDSVRAM 250 Query: 213 FNYVLQTGDAQRFRAFIGEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLD 272 + + R G + + + REEG +GK E + L+ Sbjct: 251 LALLGEQKRLLRMLELPGGEGKERMDVYNAIDELIADGREEGKAEGKAEGRVEGKAIGLE 310 Query: 273 RG 274 G Sbjct: 311 LG 312 >UniRef50_Q8GBS6 Putative uncharacterized protein n=12 Tax=Treponema RepID=Q8GBS6_TREMA Length = 262 Score = 80.7 bits (197), Expect = 6e-14, Method: Composition-based stats. Identities = 36/288 (12%), Positives = 90/288 (31%), Gaps = 42/288 (14%) Query: 11 DAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVKT 70 D +F +++ + + F+++ L + +T + + + + + D+L Sbjct: 13 DFMFCQVMKNKNLCKTFLEMLLADKIGN---ITHIASQSTVAPESEAKFVRLDILVQ--- 66 Query: 71 QEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPL----VLPMLFYHGC 126 + Y IE Q E +A RM Y A + LD G L ++ + + Sbjct: 67 -DEKNNFY-DIEMQVVNEHNVAKRMRYYQSALDVSFLDKGEYYTNLKDSYIIFVCLFDFI 124 Query: 127 RSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQR 186 + I + + ++++ + I LE I+ Sbjct: 125 GKNKAVYFFENICLEDEPIRLRDGTKKI-IINVDAFKN--IKDKALSGFLEYIKTGCITT 181 Query: 187 DLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKLMTI 246 +++++ + ++ + + +M Sbjct: 182 KFSERIEKMIRTIKQNEQARQEYRFI---------------------------SAVVMDA 214 Query: 247 ADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQ 294 + R +G G ++ + A + GL + + T LS ++ Sbjct: 215 KEEGRSQGFTDGVNQTKRKTAAALKAMGLAKSKIAKATGLSLAEIEKL 262 >UniRef50_Q2FSM2 Putative uncharacterized protein n=3 Tax=Methanospirillum hungatei JF-1 RepID=Q2FSM2_METHJ Length = 304 Score = 80.7 bits (197), Expect = 6e-14, Method: Composition-based stats. Identities = 30/291 (10%), Positives = 82/291 (28%), Gaps = 14/291 (4%) Query: 7 STPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLW 66 S +D +F+ + ++ + ++ + + L Sbjct: 23 SPKNDFLFRLLFGDDGNEE---LLASLLSSILHEEIEHVVIKNPYILKLFSEDKETILDI 79 Query: 67 SVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKE--LPLVLPMLFYH 124 V IE Q + R++ Y + + G L +L Sbjct: 80 KAAINSKK---LVDIEIQLWNSPCLMSRILFYWARLYASQIKQGNDYTVLQKTTSILILD 136 Query: 125 GCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIR 184 ++++ + + + + L ++ + + Sbjct: 137 DLNHSSEDYHACSHLHDWKQHITLTDMIEVHVLELPKLHNLKQLDKSNTLLQWMLFFNAQ 196 Query: 185 QRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKLM 244 R+ +++ + + L Q + A + R + + Sbjct: 197 TRE------ELIMVSEANPVIKKATDLLITMSRDEETRQMYEAREEYLLGRQIEIQGAKG 250 Query: 245 TIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQS 295 + R EG ++G+ E IA +++ G++ E + +T L + + Sbjct: 251 EGREEGRIEGRIEGRETERKDIAMRLIEEGMNDEFIKKITGLDIAVIRSLH 301 >UniRef50_C4G7H9 Putative uncharacterized protein n=2 Tax=Abiotrophia defectiva ATCC 49176 RepID=C4G7H9_ABIDE Length = 305 Score = 80.3 bits (196), Expect = 7e-14, Method: Composition-based stats. Identities = 46/297 (15%), Positives = 103/297 (34%), Gaps = 25/297 (8%) Query: 9 PHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLK--LEPNSFIDEDLRQYYSDLLW 66 +D K + D D ++ L ++ +L+ +++ ED + + + Sbjct: 3 DYDVTEKLLEDYNDVFADIVNTLL-FDGKERVKEDSLEDSKINSAYKAEDGKLHEQERDV 61 Query: 67 SVKTQEGVGYIYVV-IEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELP-----LVLPM 120 S +EG + VV IE+Q+K E+LM R++ Y A+ ++ L LP V+ + Sbjct: 62 SKYWKEGNTNLLVVGIENQTKAEKLMPARIIGYDGASYRSQLLKSTGRLPKNKLTPVVTI 121 Query: 121 LFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEI-MQHRKMALLELI 179 + Y G + + + +I +P++++ L+ Sbjct: 122 VLYFGLTRWNQPKNLKGILDIPTGLEDFVSDYKINVFEIAFLPEEKVNKFKSDFRLVAKY 181 Query: 180 QKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQE 239 +IR+ L + + A+ ++ ++ + + Sbjct: 182 FTNIRKNPY---------YLPADENEIKHVDAVLKFLSIMSGSEDIIEKLTANNGSEVKN 232 Query: 240 KEK------LMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDD 290 REEG +QG +E L++ +G+ E + + + Sbjct: 233 MTGGPLSQLYYKGVSEGREEGLLQGINETLLKVYLNCRSKGMSVEESEEIVHFADRE 289 >UniRef50_Q3ARM2 Putative uncharacterized protein n=10 Tax=Bacteroidetes/Chlorobi group RepID=Q3ARM2_CHLCH Length = 322 Score = 80.3 bits (196), Expect = 8e-14, Method: Composition-based stats. Identities = 31/311 (9%), Positives = 83/311 (26%), Gaps = 36/311 (11%) Query: 11 DAVFKSFLR---HPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWS 67 D FK + D F++ LP + D L PN + + ++ Sbjct: 13 DFGFKKLFGSEMNKDLLIAFLNTLLPIEAGTIAD---LTFLPNDRVGRSEFDRRA--IFD 67 Query: 68 VKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGY--KELPLVLPM----L 121 + + G Y ++E Q ++ R + Y+ +Q G L + + Sbjct: 68 LHCKNEKGE-YFIVEMQQAKQDYFKDRSVFYASFPIQEQAQKGKWNYCLQPIYMVGILDF 126 Query: 122 FYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQK 181 + ++ + F +++ + Sbjct: 127 IFDENKADDTIVHHEIKLVNLSTGKVFYEKLTFIYLELPKFTKSVDELESDFDKWCYLLS 186 Query: 182 HIRQ------RDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAER 235 ++ R + ++ L + + + D + + + Sbjct: 187 NLPDLTDRPARLQEKVFLKVFELAEIAKYTPEEAREYEKSLKVYRDLKNVIDCAYDEGKA 246 Query: 236 A---------------PQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELV 280 + ++G G + L IA++++ +G+ + Sbjct: 247 EGIEEGIEKGKEIGVLEGMVKGKELGLQEGLQKGMEAGLLKGKLEIARKLMVKGMSADEA 306 Query: 281 MMVTRLSPDDL 291 + + + L Sbjct: 307 AGIAGVDVERL 317 >UniRef50_C0QGW4 Putative uncharacterized protein n=1 Tax=Desulfobacterium autotrophicum HRM2 RepID=C0QGW4_DESAH Length = 298 Score = 79.9 bits (195), Expect = 9e-14, Method: Composition-based stats. Identities = 46/289 (15%), Positives = 97/289 (33%), Gaps = 17/289 (5%) Query: 10 HDAVFKSFLRH-PDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSV 68 HD FK+ P D+ K+ D+ L+ EP D D+ Sbjct: 4 HDHNFKNLFLDFPKETLDWFFPQAGQSWGKVLDVEFLRQEPKKHNLSD-SSLELDMPILF 62 Query: 69 KTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYHGCRS 128 + ++ ++E Q + ++++RY+ M+ H DA LV+P + + + Sbjct: 63 NFENQQLLLW-LVEFQEDKSKFSIYKLLRYTTDLMETHPDA------LVIPTVLFTDRKK 115 Query: 129 PYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQRDL 188 L L + L D+ D + + L + H ++ D Sbjct: 116 WSKAVLQQLHAQLHDRMFLHFEYVFHKLFDLNAR--DYYNVDNPVVKILLPKMHYKKEDR 173 Query: 189 LGLVDQIVS---LLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKLMT 245 + ++ Q + LV+ D+ + + Y + Q EI + Sbjct: 174 IEVIRQAYAGLFQLVSSGLFDKYVDFIDTY--AEIEDQEQLNLYNEIVQHKETAMLA-QY 230 Query: 246 IADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQ 294 I +R +EG + + + + ++ G+ + + L + Sbjct: 231 IRERGMQEGRKEERKQSLISFIRKAKQEGVSVPTIAKIVDLDVSMVNKI 279 >UniRef50_A5D5U3 Hypothetical membrane protein n=3 Tax=Peptococcaceae RepID=A5D5U3_PELTS Length = 292 Score = 79.9 bits (195), Expect = 9e-14, Method: Composition-based stats. Identities = 35/236 (14%), Positives = 88/236 (37%), Gaps = 16/236 (6%) Query: 45 LKLEPNSFIDEDLRQYYSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQ 104 ++ + + Q SD L V+ ++G Y+ +++E Q++P+ MA R++ Y+ Sbjct: 30 VEQIEDKDKEAVAVQRTSDALVKVR-EDGYEYL-MLVEFQARPDRKMARRLLEYTAM--- 84 Query: 105 NHLDAGYKELPLVLPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPD 164 +H P+++ + Y + + + + + +++ + Sbjct: 85 HHCRHEKPVYPVIINLTGGSLQDGWYTF----------ECLDLTVVNFNYRQINLQDIAG 134 Query: 165 DEIMQHRKMALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQR 224 E++ + LL L ++D+ L + + L+ + + Sbjct: 135 RELLYRGPVGLLPLAPLMSHDEPPEKVLDKCARRLQSEVEAEDDRALLYLALAALASLKY 194 Query: 225 FRAFIGEIAERAPQEKEKLMTIA-DRLREEGAMQGKHEEALRIAQEMLDRGLDREL 279 + I + E + E L + +G ++GK+E + EML ++ Sbjct: 195 PKDLILRVLEVSRLENIPLFDGIREEWEAKGRIEGKNEGKIEGMVEMLFDLVEARF 250 >UniRef50_A1ZPJ4 Hypothetical conserved protein n=6 Tax=Microscilla marina ATCC 23134 RepID=A1ZPJ4_9SPHI Length = 302 Score = 79.9 bits (195), Expect = 9e-14, Method: Composition-based stats. Identities = 52/295 (17%), Positives = 103/295 (34%), Gaps = 38/295 (12%) Query: 7 STPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLW 66 S +D +FK + + +L + E + + +D L Sbjct: 19 SNQYDKIFKENIG--EHFLSLSKTYLGIEVASS--------EELKDKLQTTLEREADFLR 68 Query: 67 SVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYHGC 126 + T +G I + +E QS E+ MA RM Y Q + + + Y G Sbjct: 69 KITTPKGEQMI-IQLEFQSTDEQGMAERMQLYFAILRQKYK-------LPIRQFVIYVGS 120 Query: 127 RSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLEL-IQKHIRQ 185 + P + +E + F L+D+ V + ++ + L + +Q Sbjct: 121 KPPKMRTRLKPEEV----------FTGFELLDLRQVSYTQWLESDIPEEVLLAVLGDFQQ 170 Query: 186 RDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKLMT 245 + + ++ QI+S +V L Y+ Q R R + E + Sbjct: 171 KKVSTVLKQIISKIVKLI---DDPGTLQKYIRQLATFARLRNLVIETEQTLEYMGLTYDI 227 Query: 246 IADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRL------SPDDLIAQ 294 D + G +G+ E + QE +++G+ + +V MV L +++ Sbjct: 228 EKDVFYQRGVKKGQQEGIEKGHQEGIEKGITQGVVKMVIALLKSGKMPLEEVARI 282 >UniRef50_C5EKZ7 Predicted protein n=1 Tax=Clostridiales bacterium 1_7_47FAA RepID=C5EKZ7_9FIRM Length = 329 Score = 79.1 bits (193), Expect = 2e-13, Method: Composition-based stats. Identities = 42/292 (14%), Positives = 90/292 (30%), Gaps = 45/292 (15%) Query: 15 KSFLRHPDTARDFIDI--HLPAPLRKLCDLTTLKLEPNSFIDEDLR-----QYYSDLLWS 67 + L HP DF + + + L+ + E I + + D++ Sbjct: 9 RKLLNHPARFADFYNGTVFGGRQVLRPEQLSDVPNEQGIVILDKDGKKRVVERRRDIIKK 68 Query: 68 VKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLD------------------- 108 G +I E+Q M R M Y ++ Sbjct: 69 ASF--GAYFILAAEENQDTIHYGMPVRNMMYDALDYTEQMECLKQAHKSRGDVLDGGGFL 126 Query: 109 ---AGYKELPLVLPMLFYHGCRSPYPYSLCWLDEFAEPAIARK-------IYSSAFPLVD 158 L V+ ++ YHG + P+ D A A++ + L+D Sbjct: 127 SGITREDRLMPVVSLILYHGSK-PWDGPRSLYDMLGLDASAKETLALKQVLPDYRINLID 185 Query: 159 ITVVPDDEIMQHRKMALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQ 218 + + E+ + +++ + ++ G Q + D ++ + Sbjct: 186 ASNIEHPELFCTSLQHVFSMLKYNTDKQKFYGYAKQ-----HQKDLLDMDDDSMLAMLTL 240 Query: 219 TGDAQRFRAFIGEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEM 270 G+ +R + E + +E + D L +G ++GK E + + Sbjct: 241 LGEQKRLLKIL-ETSSNDTKEGTDVCIAIDELINDGKIEGKIEGKIEGEHRL 291 >UniRef50_UPI0001BC3131 hypothetical protein BcroD2_12630 n=4 Tax=Butyrivibrio crossotus DSM 2876 RepID=UPI0001BC3131 Length = 247 Score = 79.1 bits (193), Expect = 2e-13, Method: Composition-based stats. Identities = 34/266 (12%), Positives = 74/266 (27%), Gaps = 26/266 (9%) Query: 1 MTISTTSTPH-DAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQ 59 M T + + D VF+ + + D+ LE ++ Sbjct: 1 MNNETVNRKYKDTVFRLLFKDKSNLLSLFNAVNDTDFSDENDIKITTLENAIYMTSKNDI 60 Query: 60 YYSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDA------GYKE 113 + +K + EHQS M +R + Y + ++ Sbjct: 61 SC---IIDMKLN--------LFEHQSTVNPNMPYRNLEYVTKCFKRYVGNFDVYTGKALT 109 Query: 114 LPLVLPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKM 173 LP ++FY+G P + L + +I + + +M + Sbjct: 110 LPNPKFVVFYNGVNEQPPIRVMRLSDLYAHKDEIPNLELVVIQYNINNLVNCTLMDRCEP 169 Query: 174 ALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIA 233 + +G + + + G D + + R + ++ Sbjct: 170 LK--------EYSEFIGCIRSNLKTMDKGEAVDSAIDYCIGNGILKDFLTNNRNEVRSMS 221 Query: 234 ERAPQEKEKLMTIADRLREEGAMQGK 259 +E I E+G +G+ Sbjct: 222 LFEFDAEEHEKAIKQIAYEDGYDKGE 247 >UniRef50_Q8YTL4 All2703 protein n=13 Tax=Cyanobacteria RepID=Q8YTL4_ANASP Length = 270 Score = 79.1 bits (193), Expect = 2e-13, Method: Composition-based stats. Identities = 35/284 (12%), Positives = 87/284 (30%), Gaps = 23/284 (8%) Query: 18 LRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQ--YYSDLLWSVKTQEGVG 75 ++ P +L + + + F +++Q + D L+ K + Sbjct: 1 MKTDTIFYSLFQEF-PHIFFELINQSPQEASIYEFTSREVKQLAFRLDGLFLPKINDSTK 59 Query: 76 YIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYHGCRSPYPYSLC 135 Y+ +E Q +P++ +R+ +L P + ++ Y ++ Sbjct: 60 PFYI-VEVQFQPDDDFYYRLFAELFL----YLKQYKPPYPWQV-VVIYPSRGIERQQTIH 113 Query: 136 WLDEFAEPAIARKIYSSAFPLVDITVVPDDEI---MQHRKMALLELIQKHIRQRDLLGLV 192 + + + R L ++ V + + + + E RQ L+ Sbjct: 114 FDEILVLNRVKR------IYLDELGEVAETSLGVGVVKLVIETEETAPVLARQ-----LI 162 Query: 193 DQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKLMTIADRLRE 252 Q L + + ++ + + ++ + ++ Sbjct: 163 AQAKQQLTDVTAKRDLINLIETIIVYKLPQKSREEIEAMLGLNELKQSRVYQEALEEGKQ 222 Query: 253 EGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQSH 296 EG +GK E L M+ GL E + + L + + Sbjct: 223 EGKQEGKQEAKLETIPRMVQFGLSVEAIAQLLDLPLEVVQQAVQ 266 >UniRef50_Q3ARU8 Putative uncharacterized protein n=12 Tax=Chlorobium chlorochromatii CaD3 RepID=Q3ARU8_CHLCH Length = 324 Score = 78.3 bits (191), Expect = 3e-13, Method: Composition-based stats. Identities = 45/315 (14%), Positives = 97/315 (30%), Gaps = 31/315 (9%) Query: 7 STPHDAVFKSFLR--HPDTARDFI-DIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSD 63 +D+ +K + P+ F + L K +L + + + D Sbjct: 11 RDDYDSPWKEAIELYFPEFMAFFYPNAFLAIDWSKPYHFLDQELRSI-LPEAENGKRIVD 69 Query: 64 LLWSVKTQEG-VGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNH---------LDAGYKE 113 L V G +Y+ IE Q E R+ + + L Sbjct: 70 KLVQVHLLGGKERCLYIQIEVQGNREADFPRRIFICNYRIFDKYGKPVASFVILTDSDSS 129 Query: 114 LPLVLPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVD---ITVVPDDEIMQH 170 + G + + + L +F +AF LV + E Sbjct: 130 WRPTTYSYEFAGSKMTLEFDMVKLLDFEPRIKELLASDNAFALVTAAHLLTQKTREKSFE 189 Query: 171 RKMALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIG 230 R A +LI+ ++ V ++ ++ ++L+ + + ++ +I Sbjct: 190 RLDAKSQLIRLLYNKQWTKERVKELFRVIDWFMELPKELEQQLQTEIYNIEEEQKMKYIS 249 Query: 231 EIAE--------------RAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLD 276 I ++ LM +R +G G + L IA+ ++ G+ Sbjct: 250 SIERYAMEKGWSEGMERGILEGMEKGLMEGMERGMAKGKEIGAEQTKLDIARRLVASGIS 309 Query: 277 RELVMMVTRLSPDDL 291 + ++ +S + L Sbjct: 310 KAEAALLAGVSLETL 324 >UniRef50_UPI00006CAA90 hypothetical protein TTHERM_00670420 n=1 Tax=Tetrahymena thermophila RepID=UPI00006CAA90 Length = 345 Score = 78.3 bits (191), Expect = 3e-13, Method: Composition-based stats. Identities = 29/289 (10%), Positives = 88/289 (30%), Gaps = 11/289 (3%) Query: 11 DAVFKSFLRHPDTARDFIDIHL---PAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWS 67 D VF+ + + + F++ L L + + ++ Sbjct: 64 DFVFEKIFSNHERMKSFLESVLVGKNKILHEEINEVIYLNNNLLQNSLTQEYIPKKSMFD 123 Query: 68 VKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYHG-C 126 ++ + G V I ++ + R+ YS ++ + + L ++ + Sbjct: 124 LQIKTSQGTFIVEI-YKRSFQP-FLKRIQYYSAQSLSQQQNQTHTSLKPIISIAIVDDIL 181 Query: 127 RSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQR 186 + + + + S + +++ + + Q + E + ++ Sbjct: 182 FEDDVPCISFHKTIEQKTQKVFLNYSTYVFIELGKYDNKKYDQSCVHGVNEKEWLDLLKK 241 Query: 187 DLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKLMTI 246 + + +L + LF+ ++ + I E + E + Sbjct: 242 SDIHRQYKTKEVLNAAQYAQFIQEKLFDEYVKHKLYED-----QFIEEIKNAKVEGIQQG 296 Query: 247 ADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQS 295 + + + ++ML GL + ++ T LS +++ Sbjct: 297 QEETIKLSKHYSIKAGKEEVVKQMLKDGLSLQKIITYTGLSKEEIDEIK 345 >UniRef50_B0NFN2 Putative uncharacterized protein n=4 Tax=Clostridiales RepID=B0NFN2_EUBSP Length = 341 Score = 78.3 bits (191), Expect = 3e-13, Method: Composition-based stats. Identities = 38/298 (12%), Positives = 92/298 (30%), Gaps = 21/298 (7%) Query: 5 TTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDL 64 T +F+ + + L LE ++ ++D+ Sbjct: 37 PNRTYKARLFEMIFSQKKELLELYNAVNGTSYDDPELLEINTLENAIYMSM-----HNDI 91 Query: 65 LWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYK------ELPLVL 118 + + ++ + EHQS + R + Y + +P Sbjct: 92 SFIIDSR------LALYEHQSTYSPNLPLRHLMYVTDLYSAMIRDANLYGSRIVRVPTPR 145 Query: 119 PMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLEL 178 ++FY+G + + L + +++I + ++M+ + L + Sbjct: 146 FLIFYNGEQEQPERRILRLSDAYTVPEESPALELEAVMLNINEGKNRQLMESCR-TLSDY 204 Query: 179 IQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQ 238 + R R +++ +S V + + + + L A+ + I E E Sbjct: 205 ARYTQRVRGYARVME--ISAAVERAVTECIAEGILSEFLSKNRAEASKVSIYEYDEE-KH 261 Query: 239 EKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQSH 296 ++ R EG +G+ + + Q+ +G E P ++ H Sbjct: 262 MRQVREEGQMDGRNEGRSEGEALKLITQIQKKCQKGKSLEETAEDLEEKPGEIEGIYH 319 >UniRef50_C8NHS0 Putative uncharacterized protein n=1 Tax=Granulicatella adiacens ATCC 49175 RepID=C8NHS0_9LACT Length = 278 Score = 78.0 bits (190), Expect = 3e-13, Method: Composition-based stats. Identities = 38/297 (12%), Positives = 91/297 (30%), Gaps = 28/297 (9%) Query: 3 ISTTSTPHDAVFKSFL---RHPDTARDFIDIHLPAPLRKLCDLTTLKLEPN--SFIDEDL 57 + +D +FK + ++FI + L + +++ + +L Sbjct: 1 MRKILPTNDLMFKKMMTSEGKEYILQNFIQVVTGMKLSNVKPTNPYQIQKYRENLAGVNL 60 Query: 58 RQYYSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLV 117 Y + + + T+EG I ++IE Q R+ Y + + AG++ V Sbjct: 61 EMYQTIVDIAATTEEG---IDIIIEMQLYKHRGFFERIRYYMASTYMDSYSAGHQTYKPV 117 Query: 118 LPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLE 177 + ++ LV++ + + + + Sbjct: 118 ISIVVTDFSVFKEDPE----------------PRVEIGLVNLEKNREVLNEKGQPFERVY 161 Query: 178 LIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAP 237 L+ + ++ + L G + K + D + ++AE+ Sbjct: 162 LVNLATTLPNQDEAFNEWRNFLKNGTITAKASKEI-QDAYAVVDFYNLDSEEMKMAEQME 220 Query: 238 QEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQ 294 + +E + +E G E L L ++ T LS +++ Sbjct: 221 KYEEVYWKTIEYAKETAREAGLKEGQ---VLAFLKMNLPITEIIKHTGLSEEEIQKI 274 >UniRef50_Q8YMI0 Alr4953 protein n=8 Tax=Cyanobacteria RepID=Q8YMI0_ANASP Length = 314 Score = 78.0 bits (190), Expect = 3e-13, Method: Composition-based stats. Identities = 43/313 (13%), Positives = 104/313 (33%), Gaps = 36/313 (11%) Query: 1 MTISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFI----DED 56 MT + D+ +K L + P + + F + + Sbjct: 1 MTDNNERADFDSPWKEIL--EAYFPQAVQFFFPETAALINWERPYEFLNTEFQQIAREAE 58 Query: 57 LRQYYSDLLWSVK-TQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELP 115 + Y+D L V Q ++ + +E Q++ E+ + RM Y+ P Sbjct: 59 QGKPYADQLVKVWQIQGEEIWLLIHVEIQAQKEDDFSKRMFTYNFRIFDRFEK------P 112 Query: 116 LVLPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDD-EIMQHRKMA 174 + + R P + + + + F +V + + + +++ Sbjct: 113 AISLAILCDTNRQWRPSNYSYNYP-------QTRLNFEFGIVKLLDYENRFDELENNTNP 165 Query: 175 LLELIQKHIRQ-------RDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQR--F 225 ++ H++ ++ ++ L ++ ++ L+ ++ + Sbjct: 166 FATVVMAHLKTQQTRSSPQERKIWKFSLIRRLYDLGLQEQDIRNLYRFIDWVMILPKALE 225 Query: 226 RAFIGEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDR---GLDRELVMM 282 E+ + + + +T A+R+ G +G E L I ++L R L E+ Sbjct: 226 NQLCSEVQQLEQERTMRYVTSAERI---GYERGIQEGELGIILKLLKRRLGELSPEIQQR 282 Query: 283 VTRLSPDDLIAQS 295 + LS + L S Sbjct: 283 IQSLSVNQLENLS 295 >UniRef50_B4VQ19 Putative uncharacterized protein n=3 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VQ19_9CYAN Length = 318 Score = 77.6 bits (189), Expect = 4e-13, Method: Composition-based stats. Identities = 41/303 (13%), Positives = 86/303 (28%), Gaps = 20/303 (6%) Query: 3 ISTTSTPHDAVFKSFLR---HPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQ 59 + S D FK D F++ + + DL + + + L+ Sbjct: 1 MRFISPKTDFAFKKIFGAKDSKDILISFLNALIYNANPVIQDLEIIDPYNPGDVVD-LKD 59 Query: 60 YYSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLD--AGYKELPLV 117 Y D+ G V+IE Q R++ N L GY L Sbjct: 60 SYLDV--RAVLDNGST---VLIEMQVLNVASFEKRVIYNLTKTYANQLKYGEGYSHLKPA 114 Query: 118 LPMLFYHGC--RSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMAL 175 + + + + + + V++T Sbjct: 115 IALTITDFQLFDQTQRFLTRFGLKEKQELFDYTDPEIELIFVELTKFNKKLEQLDNLTDK 174 Query: 176 LELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAER 235 K L ++ + V + ++ F+ + Sbjct: 175 WIYFIKDAPS---LEVIPPTFRQVPELEKAMNIANQANLSVEELEKIRKREVFLEDQRGF 231 Query: 236 APQ-EKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDR---GLDRELVMMVTRLSPDDL 291 + ++E + R EG ++G+ EE +R +L+R + ++ + LS ++L Sbjct: 232 IVKAKQEGRVEGRVEGRVEGRVEGRVEEGIRWTLRLLERQFGSIPPAIINQIQNLSVEEL 291 Query: 292 IAQ 294 Sbjct: 292 EDL 294 >UniRef50_C9RLI8 Putative uncharacterized protein n=1 Tax=Fibrobacter succinogenes subsp. succinogenes S85 RepID=C9RLI8_FIBSS Length = 323 Score = 77.6 bits (189), Expect = 6e-13, Method: Composition-based stats. Identities = 35/301 (11%), Positives = 90/301 (29%), Gaps = 33/301 (10%) Query: 11 DAVFKSFLRHPD---TARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWS 67 D VFK ++ L +L+++ I + + D++ + Sbjct: 39 DGVFKIVFTEEKSHSLLISLLNAMLDLHGGDAIGEISLEMQEFPGI-FNKKNCIVDIIGT 97 Query: 68 VKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYK-ELPLVLPMLFYHGC 126 E V++E Q + ++ R+ Y ++N + K ELP + + Sbjct: 98 TNAGEK-----VLVEIQQQKDKFFKDRVEYYVSRVIENQVHKSEKFELPHIYFLGLLDFE 152 Query: 127 RSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQR 186 P + V+I E + K + Sbjct: 153 LFPEEEHEYIHHVDEMCHGKKFFPKIQKVFVEIEKFFKLEKLGFTK----------DDES 202 Query: 187 DLLGLVDQIVSLLVTGNTNDRQLK--ALFNYVLQTGDAQRFRAFI----GEIAERAPQEK 240 D + I ++ ++ ++ + ++ E + + Sbjct: 203 DAAQWLRAIRVVIKEEPAPEKIMQNETFRRLLESVKLINFAEELFNCEVKKMTEVMAERE 262 Query: 241 EKLMTIADRLREEGAMQGK-------HEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIA 293 + R G +G +E ++A+ + ++ +D ++ T S +++++ Sbjct: 263 NAYAEGKEEGRAVGYAEGASAERTKADQEKRQMAKSLKEQNVDVSIIAKSTGFSEEEILS 322 Query: 294 Q 294 Sbjct: 323 L 323 >UniRef50_C9RMD5 Putative uncharacterized protein n=1 Tax=Fibrobacter succinogenes subsp. succinogenes S85 RepID=C9RMD5_FIBSS Length = 344 Score = 77.2 bits (188), Expect = 6e-13, Method: Composition-based stats. Identities = 40/313 (12%), Positives = 94/313 (30%), Gaps = 43/313 (13%) Query: 11 DAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVKT 70 DA FK+FL + +F++ + +K + I +Q+ D+ KT Sbjct: 38 DAAFKAFLSDEEALVNFLNGVFHLNEDNKIESVVIKNSEINIIFPSAKQFRLDI--RAKT 95 Query: 71 QEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQN-----------------------HL 107 +G I + IE Q + R++ A M Sbjct: 96 SKG---ICINIEMQKARPDYFVDRVLLQQSAFMLQSKYEWDKLNFGDLPSCLTKEERAER 152 Query: 108 DAGYKELPLVLPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEI 167 + E+P + + + + L D+T Sbjct: 153 EIHRYEVPPTYAIWICDFSIGKQKSFRGDWAVRNKKGLTL-TDKMMYILYDLTKFNKPYK 211 Query: 168 MQHRKMALLELIQKHIRQRDLL-----GLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDA 222 + K+ + + L ++ + ++ ++ +++ ++ N ++ T + Sbjct: 212 KITTTEDRWLYLLKYAGKAENLPDFNNSIIAKAINRILVNRASEKLIREQANDMVWTEEE 271 Query: 223 QRFRAFIGEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMM 282 A + AE + ++G QG + + +A ML ++ Sbjct: 272 LDHLALLEVRAE---------KKGLKQGLKQGLEQGLEQGRVEMALAMLADNEPIGKIVK 322 Query: 283 VTRLSPDDLIAQS 295 + L ++ Sbjct: 323 YSHLPESKILELK 335 >UniRef50_A8SDU3 Putative uncharacterized protein n=1 Tax=Faecalibacterium prausnitzii M21/2 RepID=A8SDU3_9FIRM Length = 295 Score = 77.2 bits (188), Expect = 7e-13, Method: Composition-based stats. Identities = 42/292 (14%), Positives = 93/292 (31%), Gaps = 22/292 (7%) Query: 9 PHDAVFKSFLRHPDTARDFID--IHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLW 66 DA K + D D ++ + + +L D L + D+ Sbjct: 3 EKDASEKILESYNDVFSDIVNVLLFNGRQVLGADELEDQAPRTYYKADGRLHEIERDVA- 61 Query: 67 SVKTQEGVGYI-YVVIEHQSKPEELMAFRMMRYSIAAMQNHL---DAGYKELPLVLPMLF 122 + + G + + E+Q+ + M R++ Y A + L + P V ++ Sbjct: 62 -KRWKNGNIRVACIGFENQTASDPDMPLRVIGYDGAEYRAQLLGDNDTGSRYPAV-TLVL 119 Query: 123 YHGCRSPYPYSLCWLDEFAEP-AIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQK 181 Y G P+ L + P + L I + +++ + + Sbjct: 120 YFGHEKPWSGPLSLKERLNVPKEFEPYVNDYKINLFQIAYLTREQVELFQS-DFKVVADY 178 Query: 182 HIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKE 241 +++R+ V L ++ + + RF + Sbjct: 179 FVQKRENGDYVPSSQDL--------THVQETLQLLSIMTNDHRFEDAYNTSTDDRKGGPR 230 Query: 242 KLMTIADRLREEGAMQGKHEEALRIAQEM---LDRGLDRELVMMVTRLSPDD 290 + + D++ G +G + R +M + + LD+ + V R S D+ Sbjct: 231 NMCDVLDKVENRGIEKGIVKGESRGENKMALLVKKLLDQNRIDDVKRASEDE 282 >UniRef50_B5Q357 Transposase n=10 Tax=Salmonella enterica subsp. enterica RepID=B5Q357_SALVI Length = 174 Score = 77.2 bits (188), Expect = 7e-13, Method: Composition-based stats. Identities = 61/139 (43%), Positives = 79/139 (56%), Gaps = 25/139 (17%) Query: 183 IRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVL-QTGDAQRFRAFIGEIAERAPQEKE 241 +RQRDLLGLV++I SLLVTG NDRQLKALFNY++ Q G RF FI ++ P KE Sbjct: 36 LRQRDLLGLVERIASLLVTGCANDRQLKALFNYLMIQHGHTPRFTTFIRDVVGHVPHTKE 95 Query: 242 KLMTIAD------------------------RLREEGAMQGKHEEALRIAQEMLDRGLDR 277 +LMT+ + E+G +G+H ALRIA++ML GLDR Sbjct: 96 RLMTLIERIRAADRRKGERQGRQLGLEEGLAEGLEKGLEKGQHVAALRIARQMLADGLDR 155 Query: 278 ELVMMVTRLSPDDLIAQSH 296 E V T L+ ++L SH Sbjct: 156 ETVQRFTGLTAEELQDVSH 174 Score = 52.9 bits (125), Expect = 1e-05, Method: Composition-based stats. Identities = 35/115 (30%), Positives = 49/115 (42%), Gaps = 2/115 (1%) Query: 1 MTISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQY 60 M STTSTPHDAVFK+FLRHP+TARDF++IHLP LR+ L ++ + + Sbjct: 1 MKKSTTSTPHDAVFKTFLRHPETARDFMEIHLPVSLRQRDLLGLVERIASLLVTGCANDR 60 Query: 61 YSDLL--WSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKE 113 L + + I R+M G ++ Sbjct: 61 QLKALFNYLMIQHGHTPRFTTFIRDVVGHVPHTKERLMTLIERIRAADRRKGERQ 115 >UniRef50_A4XJH0 Putative uncharacterized protein n=1 Tax=Caldicellulosiruptor saccharolyticus DSM 8903 RepID=A4XJH0_CALS8 Length = 134 Score = 76.8 bits (187), Expect = 8e-13, Method: Composition-based stats. Identities = 18/133 (13%), Positives = 50/133 (37%), Gaps = 2/133 (1%) Query: 1 MTISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQY 60 M + + + A+F+ + ++++ + +E++ +Y Sbjct: 1 MNNNFSQDEN-AIFRLIFSDSKEILFLLKNVAKFSWVDRIQKDSIEVILVDYDNENVLKY 59 Query: 61 YSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPM 120 D++ V + YI+V + PE M ++ + + + G ++P ++P+ Sbjct: 60 KPDVIAKVTIENNTAYIFVFFVSK-VPECGMRNIILNNMLLFWEKKIKEGTDKIPPIIPL 118 Query: 121 LFYHGCRSPYPYS 133 + Y+G Sbjct: 119 VLYNGKEIWTEPR 131 >UniRef50_B8HNA0 Putative uncharacterized protein n=3 Tax=Cyanobacteria RepID=B8HNA0_CYAP4 Length = 315 Score = 76.8 bits (187), Expect = 8e-13, Method: Composition-based stats. Identities = 39/289 (13%), Positives = 94/289 (32%), Gaps = 21/289 (7%) Query: 22 DTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWSVKTQEGVGYIYVVI 81 D F+ + + L + + +D L ++ ++ + + + Sbjct: 4 DNICKFLAESFSTEVATWLLGERISLFKLEPTELSVEPIRADSLILLEAED----LILHV 59 Query: 82 EHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVL----PMLFYH----GCRSPYPYS 133 E Q+ P+ M RM+ Y + ++ ++ + L +L Y + + ++ Sbjct: 60 EFQTGPDADMPLRMLDYRVRLLRRSPQKVVRQFVIYLRQTTSVLVYQTELQLESTWHEFN 119 Query: 134 L-CWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHIRQRDLLGLV 192 + + +P +A + P + + E + L I+ Q +L Sbjct: 120 VVRLWECSTDPLLASRGL---LPFAVLGQTSNPEATLAQVAQRLSTIENRTEQSNLTAAS 176 Query: 193 DQIVSLLVTGNTN-----DRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKLMTIA 247 + L++ T ++ Y + + G ++ Sbjct: 177 AILAGLVLDQQTIQRLLRREIMRESLFYQGILEEGMQKGVERGIAQGIQLGLEQGRQEGL 236 Query: 248 DRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDDLIAQSH 296 ++ R+EG +G+ E Q+ + + R L +SPD S Sbjct: 237 EQGRQEGRQEGRQEGRQEGIQQGVLSLVLRSLTRKFGNISPDLQARISQ 285 >UniRef50_C6W4R9 Putative uncharacterized protein n=1 Tax=Dyadobacter fermentans DSM 18053 RepID=C6W4R9_DYAFD Length = 293 Score = 76.4 bits (186), Expect = 1e-12, Method: Composition-based stats. Identities = 41/306 (13%), Positives = 83/306 (27%), Gaps = 32/306 (10%) Query: 8 TPHDAVFKSFLRHPDTARDFIDIHLPAP---LRKLCDLTTLKLEPNSFIDEDLR---QYY 61 +D ++KS L + DF+ P L E + Y Sbjct: 2 KRNDMLWKSIL--EEIFDDFLKFFFPNAEALFDMDRGFEYLDQELEQLFPPEGNAIATRY 59 Query: 62 SDLLWSVKTQEG-VGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPM 120 D L V + G ++ V IE Q +E RM Y + + + Sbjct: 60 VDKLVKVYCRSGAEAWLLVHIEVQGYRDETFPDRMFTYYYRICDKYRK-------PITAI 112 Query: 121 LFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRK-----MAL 175 + F + ++E+ + Sbjct: 113 AILTDDCRHFLPGQFEQACLGTSV------CFRFNSYKVLEQSEEELAASDNPFAQVILA 166 Query: 176 LELIQKHIR--QRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIA 233 +L K R +L L + L+ N + R++ L ++ + + Sbjct: 167 TKLAIKGSRFSSDELYRLKIDLAKRLLKRNFSKRKVGRLMEFLKFYVSLEDDDLDREYLK 226 Query: 234 E--RAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDR-GLDRELVMMVTRLSPDD 290 E R + MT + + +G + Q ++ E + + +S + Sbjct: 227 EVQRLFNPEPIPMTWEETILYIVEEKGAEAAKTTVVQNLIRETNFTSEEIARLADVSVEF 286 Query: 291 LIAQSH 296 + Sbjct: 287 VQKIKQ 292 >UniRef50_B7I1C8 Putative uncharacterized protein n=16 Tax=Bacillus cereus group RepID=B7I1C8_BACC7 Length = 307 Score = 76.0 bits (185), Expect = 1e-12, Method: Composition-based stats. Identities = 46/304 (15%), Positives = 93/304 (30%), Gaps = 18/304 (5%) Query: 2 TISTTSTPHDAVFKSFLR---HPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLR 58 + + D FK + D F++ L + + + T+L L+ E Sbjct: 6 KKNLVNLRVDYAFKRLFGVEGNEDILIGFLNAVLQSSIDEEI--TSLHLDDPHLPREQKD 63 Query: 59 QYYSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAG--YKELPL 116 S L G I + IE Q + ++ M R + Y + + G Y EL Sbjct: 64 DKLSILDLRATLNSG---IKINIEIQVRDKKDMIERSLFYWSGMYYSQMTQGMKYTELRP 120 Query: 117 VLPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSS-AFPLVDITVV-----PDDEIMQH 170 + + P ++ R I + ++I V Sbjct: 121 TICINIVDFILFPEEQEFHSINTVMNKKSKRIITENMQLHFLEIPKVIQEWQGKRMDPWE 180 Query: 171 RKMALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIG 230 +A L+ L +++ I + + ++ + + A Sbjct: 181 DSLARWLLLFPAHEDERLTTILEAIA--MEKDPVLKKAIEDWERLSSDKDFLRLYEAREK 238 Query: 231 EIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSPDD 290 I +R + + A + E + K + + + M GL E + V LS ++ Sbjct: 239 AIKDRISEIETAEERAAKKAAEIATEETKIATKIEMIENMFKIGLPIEKIAKVAELSVEE 298 Query: 291 LIAQ 294 + Sbjct: 299 VNEI 302 >UniRef50_B4VTF8 Putative uncharacterized protein n=7 Tax=Oscillatoriales RepID=B4VTF8_9CYAN Length = 306 Score = 75.6 bits (184), Expect = 2e-12, Method: Composition-based stats. Identities = 38/306 (12%), Positives = 82/306 (26%), Gaps = 21/306 (6%) Query: 3 ISTTSTPHDAVFKSFLR---HPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQ 59 + + D FK +P+ F++ L +T L++ + Sbjct: 1 MIFINPKTDFAFKKIFGSEQNPEILISFLNSLL---YGGHPRITELEIINPYLAPKIQGI 57 Query: 60 YYSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKE--LPLV 117 + L K + V+IE Q R++ + A L+ G L V Sbjct: 58 KDTFLDVKAKLTDETT---VIIEMQVLNLSGFEKRILYNAAKAYSIQLEPGDDYTLLNPV 114 Query: 118 LPMLFYHGCRSPYPYSLCWLDEFAEPA--IARKIYSSAFPLVDITVVPDDEIMQHRKMAL 175 + + + E I V++ + Sbjct: 115 IALTLTDFEMFEDLPQVISNFVLKEKKVLTDYPINDLELVFVELPKFTKELDELETLADK 174 Query: 176 LELIQKHIRQRDLLGL-------VDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAF 228 K R + + + + + N +L+AL + D + Sbjct: 175 WIYFIKCARGLETIPETMAQVPEIRKAFEVANQANMTREELEALEQREIYIHDQRNAIKL 234 Query: 229 IGEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMVTRLSP 288 ++ + ++ +EG QG E + Q + + +L Sbjct: 235 -ALRQGIQLGREQGIQVGREQGIQEGREQGIQEGREQEKQRTREAEQLAQQERQRAKLLE 293 Query: 289 DDLIAQ 294 + L + Sbjct: 294 ERLRSL 299 >UniRef50_B7K6I4 Putative uncharacterized protein n=2 Tax=Cyanothece RepID=B7K6I4_CYAP8 Length = 319 Score = 75.6 bits (184), Expect = 2e-12, Method: Composition-based stats. Identities = 46/315 (14%), Positives = 101/315 (32%), Gaps = 38/315 (12%) Query: 3 ISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSF----IDEDLR 58 + ST D+ +K DF+ P ++ + D L Sbjct: 1 MKDPSTSFDSPWKDI--VEAYLPDFMAFFFPDAYEQINWEQGFEFLDKELGQVVRDAQLG 58 Query: 59 QYYSDLLWSVKTQEGV-GYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLV 117 + + D L V + G ++ + +E QS+ E A R+ Y + V Sbjct: 59 KRFVDKLVKVYRRSGEETWVLIHLEIQSQYEAGFAERIYVYQYRIYDRYRRK-------V 111 Query: 118 LPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVV-PDDEIMQHRKMALL 176 ++ + S + F + + +V + + D E + + Sbjct: 112 ASLVVLGDESPTWKPSEFGYEIFGVEI------NYRYRVVKLLDLGQDWEALSANENPFA 165 Query: 177 ELIQKHI-------RQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQR--FRA 227 ++ H+ +++ L + L + + LF ++ Sbjct: 166 TVVMAHLKAGQTKKNRQERLQWKLSLTRQLYQKGYLRQDVINLFRFIDWILSLPDNLESE 225 Query: 228 FIGEIAERAPQEKEKLMT-----IADRLREEGAMQGKHEEALRIAQEMLDR---GLDREL 279 F E+ + +++ +T +R REEG ++G EA + L+R + + Sbjct: 226 FWSELRQYEEEQRMPYITSVERLGRERGREEGRLEGMQREAANMVLRQLNRRLGQVSPSV 285 Query: 280 VMMVTRLSPDDLIAQ 294 + +L + L Sbjct: 286 EEQIRQLRVEQLEDL 300 >UniRef50_UPI00006A2D99 UPI00006A2D99 related cluster n=2 Tax=Xenopus (Silurana) tropicalis RepID=UPI00006A2D99 Length = 308 Score = 75.3 bits (183), Expect = 2e-12, Method: Composition-based stats. Identities = 35/274 (12%), Positives = 82/274 (29%), Gaps = 20/274 (7%) Query: 7 STPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYY--SDL 64 T HD FK+ + R + P + + D + + L + D+ Sbjct: 1 PTSHDQNFKNLILD--YPRQALQFFAPDEAKNIDDSAVITPIRQEQLKNRLGDRFYELDV 58 Query: 65 LWSVKTQEGV-GYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFY 123 V+ +G + ++E ++ P R++ Y + L + +P+V+ + Sbjct: 59 PLKVEWPDGRHAAMLFLLEEETDPARFSIHRLVSYCANLAE--LMGTNRVVPIVIFL--- 113 Query: 124 HGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELIQKHI 183 S + S + + +P ++ + + Sbjct: 114 -------RSSPDIRRDLHLGVDGVNFLSFHYIACVLPDIPAEQYKDSTNIVARIALPTMH 166 Query: 184 RQRD-LLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRA--FIGEIAERAPQEK 240 R+ ++ ++ + L T N + +++ + F + Sbjct: 167 YAREQVIDVMAWALRGLDTLEANGDKRIKYLDFIDTYSQLEDNERQLFKQRYPQEEKTVT 226 Query: 241 EKLMTIADRLREEGAMQGKHEEALRIAQEMLDRG 274 + + +G QG E L QE G Sbjct: 227 SIVQRAIHQGIHQGIHQGIQEGMLMGRQEGRQEG 260 >UniRef50_Q3ATN4 Putative uncharacterized protein n=1 Tax=Chlorobium chlorochromatii CaD3 RepID=Q3ATN4_CHLCH Length = 287 Score = 74.9 bits (182), Expect = 3e-12, Method: Composition-based stats. Identities = 44/295 (14%), Positives = 89/295 (30%), Gaps = 37/295 (12%) Query: 8 TPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCDLTTLKLEPNSFIDEDLRQYYSDLLWS 67 D V K L A D I L + +L + + + +D++ Sbjct: 2 HAKDVVSKDIL--KRIALDIARILL------HLKVDHAELLETEH--QRVEERRADVVVL 51 Query: 68 VKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELPLVLPMLFYHGCR 127 V+ G + +E Q+ + +A+R++RY H K+ + + Sbjct: 52 VQ--GESGRFILHLEIQNDNQANIAWRLLRYRSDIGLAHKGYDIKQYLIYIG-------- 101 Query: 128 SPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALLELI----QKHI 183 I + + ++D+ V ++ L L K Sbjct: 102 --------KAPLSMPTGIHQTGLDYRYHVIDMHSVDCQALLTQDTPDALVLAILCDFKGR 153 Query: 184 RQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKL 243 +R+++ + Q + L N + +L + I E + Sbjct: 154 SEREVVRYIIQRLQELTAENESRYHDYMRMLEILSANRS--LEKIIEEEEAMLSVVDQTR 211 Query: 244 MTIADRLREEGAMQGKHEEALRIAQEMLDR---GLDRELVMMVTRLSPDDLIAQS 295 + G QG + L + + L R L V + +L+ + L S Sbjct: 212 LPSFRIGMRHGIEQGVQQGTLSLVKRQLTRRFGTLSYHHVARLDKLNIEQLEELS 266 >UniRef50_C8WSD0 Putative uncharacterized protein n=5 Tax=Alicyclobacillus acidocaldarius RepID=C8WSD0_ALIAD Length = 270 Score = 74.5 bits (181), Expect = 4e-12, Method: Composition-based stats. Identities = 44/261 (16%), Positives = 84/261 (32%), Gaps = 35/261 (13%) Query: 42 LTTLKLEPNSFIDEDLRQYYSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIA 101 + TL+ LR D W + + +E Q + E + R + Y Sbjct: 34 VETLEPFTTELPASTLR---MDRAWRMANGD-----VFHLEFQDRRERTLH-RFLEYDAR 84 Query: 102 AMQNHLDAGYKELPLVLPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITV 161 + ++ YH + P L A + F Sbjct: 85 LANQVKTR-------IRTVVLYHAQVASAPQELDI-------GTAIYRVENVFLSALDGD 130 Query: 162 VPDDEIMQHRKMALLELIQK-------HIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFN 214 DE+ H ++ E + +R D + ++++LL ++ + Sbjct: 131 GALDEVEAHLRVGRWEPADRLRLGLALSMRVEDRHQAMARVLNLLPRVPDDEERELVASA 190 Query: 215 YVLQTGDAQRFRAFIGEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRG 274 + RA E + +E + + +A+ L E+G GK + A IA +L G Sbjct: 191 VLAFG-----DRALSDEDRRKLRKELKNVFRMAEELYEDGRHDGKQQAAEDIAHRLLAEG 245 Query: 275 LDRELVMMVTRLSPDDLIAQS 295 + ++V T L + L Sbjct: 246 VPVDVVEKATGLPRERLEQMK 266 >UniRef50_Q1PZ06 Putative uncharacterized protein n=1 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1PZ06_9BACT Length = 238 Score = 74.5 bits (181), Expect = 4e-12, Method: Composition-based stats. Identities = 36/188 (19%), Positives = 73/188 (38%), Gaps = 10/188 (5%) Query: 96 MRYSIAAMQNHLDAGYKELPLVLPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFP 155 M+Y + + + L V+P++ YHG + E + R I + Sbjct: 1 MKYLLKIWAANSKQ-MQRLIPVIPVILYHGKETWKVRRFRDYFEGIDEVFFRFIPEFEYL 59 Query: 156 LVDITVVPDDEIMQH----RKMALLELIQKHIRQRDLLGLVDQIVSLLVTGNTND----R 207 L D++ ++EI + + L+ ++I +LG + + + + Sbjct: 60 LTDLSFYSNEEIKDKVFRRVSLQITMLLMRNIYNDKILGDKLKAFFEIGKQYFEEGEGLK 119 Query: 208 QLKALFNYVLQTGDAQRFRAFIGEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIA 267 L+++ Y+ D + I + E + + MTIA RL E+G + G+ E Sbjct: 120 FLESVIRYLYYASDIEE-ERVIDTLKEISEEGGRLSMTIAARLIEKGKIAGRMEGRAEGE 178 Query: 268 QEMLDRGL 275 ++ GL Sbjct: 179 RKGRMEGL 186 >UniRef50_Q111X0 Putative uncharacterized protein n=10 Tax=Oscillatoriales RepID=Q111X0_TRIEI Length = 309 Score = 73.3 bits (178), Expect = 8e-12, Method: Composition-based stats. Identities = 41/300 (13%), Positives = 80/300 (26%), Gaps = 22/300 (7%) Query: 3 ISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCD-LTTLKLEPNSFIDEDLRQYY 61 + S D FK ++D + L A + + +L + + L Sbjct: 1 MKFVSPKIDYAFKKIFGS-QQSQDILISFLNAIIYGGKKIIQSLTIANPFNPGQLLSLKD 59 Query: 62 SDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYKELP--LVLP 119 + L +G VVIE Q R+ A N L+ G L + Sbjct: 60 TYLDIKAVLVDGS---IVVIEMQVARMTGFNKRVAYNLAKAYANQLETGEDYLLLNPAIG 116 Query: 120 MLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPL--VDITVVPDDEIMQHRKMALLE 177 + + F + K L V++ + H Sbjct: 117 VTITDFILFENNEDIINKFVFQQETKKFKFLEQELQLFFVELPKFKKNLSELHTLSDKWI 176 Query: 178 LIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERAP 237 K +D+I L + ++ N + + Sbjct: 177 YFLKSA------SRLDEIPENLREVSEIEKA----LNIANKINMTAEELDIVERRGIAMQ 226 Query: 238 QEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDR--GLDRELV-MMVTRLSPDDLIAQ 294 E+ ++ + G +G+ + ++ L + G E + + LS L Sbjct: 227 DERGRITYAEQQGERRGEQKGEQKGRGQLIIRQLKKRFGEVPEAITSQIEGLSVAHLDNL 286 >UniRef50_B0MQP0 Putative uncharacterized protein n=2 Tax=Eubacterium siraeum DSM 15702 RepID=B0MQP0_9FIRM Length = 289 Score = 73.3 bits (178), Expect = 1e-11, Method: Composition-based stats. Identities = 42/287 (14%), Positives = 88/287 (30%), Gaps = 25/287 (8%) Query: 3 ISTTSTPHDAVFKSFLRHPDTARDFIDIHLPAPLRKLCD-LTTLKLEPNSFIDEDLRQYY 61 + D +FK + + +L L D + L + + + + + + Y Sbjct: 19 SNIVKAKLDIIFKKLFTDEGN-QHLLQAYLSDTLGIPYDSIENLVVLNSEIMPDSITEKY 77 Query: 62 SDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAG--YKELPLVLP 119 S + +K + +E Q K E R + Y L +G Y L + Sbjct: 78 SRMDIRMKANGR----LINVEMQIKDEGDYKDRSLYYLSKLYSGQLKSGEVYGSLNQCIS 133 Query: 120 MLFYHGCR---SPYPYSLCWLDEFAEPAIARKIYSSAFPLVDITVVPDDEIMQHRKMALL 176 + + Y S ++ + K + F L I D Q + L+ Sbjct: 134 INIINFNLFDCEKYHSSFSMREDSRNEQLTDKFTAHYFELKKIGKNIDKNNKQELWLRLI 193 Query: 177 ELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDAQRFRAFIGEIAERA 236 + D L ++ Q T +Q++ + + ++ R + Sbjct: 194 -----NAETEDELDMLQQ---------TGVKQIQDAVVVLHKMSADEKTRELAEMREKAL 239 Query: 237 PQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMMV 283 E + G +G+ + + +M GL E + + Sbjct: 240 HIEATEKAHARAEGEAVGLKKGEKRKEAEMISKMRKSGLSEEQIKAI 286 >UniRef50_Q24Y59 Putative uncharacterized protein n=4 Tax=Peptococcaceae RepID=Q24Y59_DESHY Length = 283 Score = 72.9 bits (177), Expect = 1e-11, Method: Composition-based stats. Identities = 35/252 (13%), Positives = 85/252 (33%), Gaps = 18/252 (7%) Query: 46 KLEPNSFIDEDLRQYYSDLLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQN 105 L P+ + + +D+++ ++ + +E Q+ E R + Y ++ Sbjct: 40 DLIPSVHPAVEANETRNDIIFLLEDDT-----LLHLEFQTTAGEQDLKRFLYYDARLVRR 94 Query: 106 HLDAGYKELPLVLPMLFYHGCRSPYPYSL---CWLDEFAEPAIARKIYSSAFPLVDITVV 162 V ++ Y G L L + + + + + Sbjct: 95 QERK-------VHTIVIYSGRIEQARERLECGSILYQVENIYMKHYNGDQEYNRLK-HKI 146 Query: 163 PDDEIMQHRKMALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQTGDA 222 + +++ L + ++ L Q L +L A+ ++ T Sbjct: 147 DNHQLLSETDTLKLIFLPLMKSEQKEEELAIQAAELAKAAPDEKTKLFAIAALIVITDKI 206 Query: 223 QRFRAFIGEIAERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQEMLDRGLDRELVMM 282 + + + I + R+EG ++G+ +E AQ ML+ G+ EL+ Sbjct: 207 MSESNKRKLL--EVLKMTQIEQWIREEGRQEGELKGRRDEKRETAQTMLNLGMSPELIAK 264 Query: 283 VTRLSPDDLIAQ 294 T+L ++++ Sbjct: 265 ATKLPLEEILEM 276 >UniRef50_C0DB21 Putative uncharacterized protein n=2 Tax=Clostridium asparagiforme DSM 15981 RepID=C0DB21_9CLOT Length = 328 Score = 72.6 bits (176), Expect = 1e-11, Method: Composition-based stats. Identities = 37/292 (12%), Positives = 83/292 (28%), Gaps = 45/292 (15%) Query: 16 SFLRHPDTARDFIDIHL--PAPLRKLCDLTTLKLEPNSFIDEDLR-----QYYSDLLWSV 68 L P D + L K DL +K + + D+ + Sbjct: 10 KLLSDPVYFSDLCNGVLFRGEMYLKPEDLMPVKGSQGVLYADRKGVKKVLERRRDVA--M 67 Query: 69 KTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDAGYK---------------- 112 + + G Y + +E+Q+ M R + Y + + K Sbjct: 68 RLKSGTRYAVIAVENQANIHYAMVIRSLLYDALDYTDQVQIQEKELRQAGRRPSGDGFLS 127 Query: 113 ------ELPLVLPMLFYHGCRSPYPYSLCWLDEF-------AEPAIARKIYSSAFPLVDI 159 L V+ ++ Y G + S + P +A I LV+ Sbjct: 128 GVGPKLRLEPVVTLVLYWGS-GHWDGSTSLHELLGLKDGKGEAPELAGYIPDYRLNLVNA 186 Query: 160 TVVPDDEIMQHRKMALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVLQT 219 + D I + + +++ + L + + L + + + Sbjct: 187 ANMDDPSIFRTHLQQIFSMLKYKSDKAALYRYAQENRTELRDMDGTAK-----LALLSMM 241 Query: 220 GDAQRFRAFIGEIAERAP-QEKEKLMTIADRLREEGAMQGKHEEALRIAQEM 270 G+ +R + + E + + + G +G + R +++ Sbjct: 242 GEQKRLQKIMEEAEGEEEFDMCKAIDDLIADGESRGFERGDRQGFERGERQL 293 >UniRef50_UPI0001C366FA hypothetical protein ChatD1_09620 n=1 Tax=Clostridium hathewayi DSM 13479 RepID=UPI0001C366FA Length = 342 Score = 72.6 bits (176), Expect = 2e-11, Method: Composition-based stats. Identities = 37/323 (11%), Positives = 87/323 (26%), Gaps = 46/323 (14%) Query: 11 DAVFKSFLRHPDTARDFIDI--HLPAPLRKLCDLTTLKLEPNSFIDEDLR-----QYYSD 63 D K L DFI++ + L+ L E + + Q D Sbjct: 8 DYYMKILLEDRARFADFINVNVFHGKQVLAADKLSLLPNEAGIVVVDADGVKRTIQRRRD 67 Query: 64 LLWSVKTQEGVGYIYVVIEHQSKPEELMAFRMMRYSIAAMQNHLDA-------------- 109 ++ + G + V E+Q K MA R M Y + Sbjct: 68 VVMKAEF--GAYFCVVASENQGKVHYGMAVREMMYDALDYTEQIRKIEEKHRAEGDKLEG 125 Query: 110 --------GYKELPLVLPMLFYHGCRSPYPYSLCWLDEFAEPAIA------RKIYSSAFP 155 L V+ + Y+G + + + + + Sbjct: 126 ADFLSHVTKADRLIPVVTLTLYYGNEAWDGPRSLYEMMGIDEEWEETALVKKCLPDYKIN 185 Query: 156 LVDITVVPDDEIMQHRKMALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNY 215 L+DI + + + L++ + ++ L + + + Sbjct: 186 LIDIREGEKLDQYKTSLQHVFGLVKYNKNKQKLYEYTRVHREEINRMDRESKAAA----- 240 Query: 216 VLQTGDAQRFRAFIGEIAERAPQEKEKLMTIADRLREEGAMQGK----HEEALRIAQEML 271 + G+ +R + + E + + + G ++G + + ++ Sbjct: 241 LALIGEQKRLQKILESKREEEMDMCQAIDELIADGEVRGEVRGILMGMEKTKINFIRKQY 300 Query: 272 DRGLDRELVMMVTRLSPDDLIAQ 294 + L + + L + Sbjct: 301 KKQLSSSQIANILDLDERYVEKV 323 >UniRef50_C9LT45 Putative uncharacterized protein n=2 Tax=Selenomonas sputigena ATCC 35185 RepID=C9LT45_9FIRM Length = 374 Score = 72.2 bits (175), Expect = 2e-11, Method: Composition-based stats. Identities = 34/266 (12%), Positives = 76/266 (28%), Gaps = 37/266 (13%) Query: 50 NSFIDEDLRQYYSDLLW--SVKTQEGVGYIYVVIEHQSKPE-----ELMAFRMMRYSIAA 102 I D+L+ V + + +E Q + + R + Y+ Sbjct: 118 TENIGITEGWVRFDILFHARVPQSGERITLIINVEAQRTQKRAKLGYALLRRAVYYASRL 177 Query: 103 MQNHLD-----AGYKELPLVLPMLFYHGCRSPYPYSLCWLDEFAEPAIARKIYSSAFPLV 157 + + + + Y E+ V + P I Sbjct: 178 ISSQKETEFTGSSYDEIKKVYSIWL----------------CMDSPDGRSAINRYDLAEH 221 Query: 158 DITVVPDDEIMQHRKMALLELIQKHIRQRDLLGLVDQIVSLLVTGNTNDRQLKALFNYVL 217 I + + M+++ + + RQ+D L+ + L + K L Sbjct: 222 HILHHHKGKRADYDLMSIITIYLGNERQQDEDWLIRFLQILFKDMEISPAAKKQLLKNEF 281 Query: 218 QTGDAQRFRAFIGEI---------AERAPQEKEKLMTIADRLREEGAMQGKHEEALRIAQ 268 + + + + + +R E G +G+ E + I Sbjct: 282 DMDISADIEEEMRTMCNLSTGIYEQGMERGMERGMERGMERGMERGMERGREEGKVDIVL 341 Query: 269 EMLDRGLDRELVMMVTRLSPDDLIAQ 294 EML L E++ +++ S + + Sbjct: 342 EMLRNKLPLEMIASMSKFSLEKVKEL 367 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.309 0.120 0.289 Lambda K H 0.267 0.0371 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,457,219,731 Number of Sequences: 3077464 Number of extensions: 54731466 Number of successful extensions: 245460 Number of sequences better than 1.0e-01: 250 Number of HSP's better than 0.1 without gapping: 846 Number of HSP's successfully gapped in prelim test: 767 Number of HSP's that attempted gapping in prelim test: 240392 Number of HSP's gapped (non-prelim): 2771 length of query: 296 length of database: 1,040,396,356 effective HSP length: 128 effective length of query: 168 effective length of database: 646,480,964 effective search space: 108608801952 effective search space used: 108608801952 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.4 bits) S2: 92 (40.2 bits)