BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (384 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P37007 Uncharacterized protein yagA n=15 Tax=Bacteria R... 798 0.0 UniRef50_B0T7X0 Integrase catalytic region n=15 Tax=Alphaproteob... 346 6e-94 UniRef50_B6J0H9 Transposase n=3 Tax=Coxiella burnetii RepID=B6J0... 251 3e-65 UniRef50_A9ER25 Transposase n=4 Tax=Sorangium cellulosum 'So ce ... 218 4e-55 UniRef50_D1TPC8 Putative transposase integrase n=1 Tax=Burkholde... 215 3e-54 UniRef50_UPI00017F3A47 integrase catalytic subunit n=1 Tax=Esche... 204 5e-51 UniRef50_D2LAL9 Integrase catalytic region n=1 Tax=Desulfovibrio... 197 8e-49 UniRef50_C6AZP5 Integrase catalytic region n=3 Tax=Rhizobium leg... 195 2e-48 UniRef50_B8GMF2 Integrase catalytic region n=2 Tax=Gammaproteoba... 192 2e-47 UniRef50_A4WDP4 Integrase, catalytic region n=4 Tax=Enterobacter... 185 2e-45 UniRef50_Q01QQ4 Integrase, catalytic region n=9 Tax=Bacteria Rep... 184 4e-45 UniRef50_B4UMG9 Integrase catalytic region n=3 Tax=Bacteria RepI... 184 5e-45 UniRef50_Q07SS7 Integrase, catalytic region n=5 Tax=Alphaproteob... 179 2e-43 UniRef50_D2ML31 Integrase, catalytic region (Fragment) n=1 Tax=C... 178 3e-43 UniRef50_C3K093 Putative integrase n=2 Tax=Pseudomonas fluoresce... 178 3e-43 UniRef50_UPI0001913B70 CP4-6 prophage; DNA-binding transcription... 177 5e-43 UniRef50_Q5ZTP2 Transposase (ISmav2) n=14 Tax=Proteobacteria Rep... 177 8e-43 UniRef50_C3MF40 Integrase catalytic core domain protein n=3 Tax=... 171 6e-41 UniRef50_A9BRN3 Integrase catalytic region n=7 Tax=Proteobacteri... 170 8e-41 UniRef50_UPI00019025E5 ISHne2, transposase n=1 Tax=Rhizobium etl... 149 1e-34 UniRef50_Q82H05 Putative IS481 family ISMav2-like transposase n=... 147 5e-34 UniRef50_B1ZP18 Integrase catalytic region n=1 Tax=Opitutus terr... 146 1e-33 UniRef50_A1T2L4 Integrase, catalytic region n=4 Tax=Actinomyceta... 144 4e-33 UniRef50_B2HR82 Transposase for ISMyma05 n=5 Tax=Mycobacterium R... 137 6e-31 UniRef50_UPI0001B540A0 transposase for IS3514a n=1 Tax=Streptomy... 132 3e-29 UniRef50_UPI0001B45627 transposase for ISMyma05 n=1 Tax=Mycobact... 130 6e-29 UniRef50_C8XFB0 Integrase catalytic region n=4 Tax=Actinomycetal... 127 1e-27 UniRef50_Q4JUH9 Transposase for IS3514b n=9 Tax=Corynebacterium ... 123 1e-26 UniRef50_UPI0001B453C4 transposase for IS3514a n=1 Tax=Mycobacte... 121 4e-26 UniRef50_Q4JWW8 Transposase for IS3511a n=7 Tax=Corynebacterium ... 120 8e-26 UniRef50_C8NDJ4 ISHne2 transposase (Fragment) n=1 Tax=Cardiobact... 115 3e-24 UniRef50_A1UAJ3 Integrase, catalytic region n=7 Tax=Actinomyceta... 109 2e-22 UniRef50_C7MHU2 Integrase family protein n=3 Tax=Actinomycetales... 103 9e-21 UniRef50_D1VRH7 Integrase catalytic region n=1 Tax=Frankia sp. E... 103 1e-20 UniRef50_A3TPQ6 Transposase n=7 Tax=Actinomycetales RepID=A3TPQ6... 102 2e-20 UniRef50_UPI0001C30E87 Integrase catalytic region n=1 Tax=Conexi... 101 4e-20 UniRef50_A3J543 Helix-turn-helix, Fis-type protein n=6 Tax=Bacte... 100 7e-20 UniRef50_C1DTA7 Putative transposase n=2 Tax=Sulfurihydrogenibiu... 100 7e-20 UniRef50_A0JV34 Integrase, catalytic region n=12 Tax=Actinomycet... 100 1e-19 UniRef50_A6DU50 Putative ISmav2-like transposase n=1 Tax=Lentisp... 98 4e-19 UniRef50_Q3A8V0 ISChy3, transposase n=5 Tax=Clostridia RepID=Q3A... 94 6e-18 UniRef50_Q3SW20 Helix-turn-helix, Fis-type n=112 Tax=Bacteria Re... 94 1e-17 UniRef50_C2GDW5 IS3514a transposase n=1 Tax=Corynebacterium gluc... 92 4e-17 UniRef50_C1F5F4 IS3 family transposase orfB n=1 Tax=Acidobacteri... 91 8e-17 UniRef50_C7RJ38 Integrase catalytic region n=5 Tax=Proteobacteri... 89 3e-16 UniRef50_D1TPC7 Putative transposase integrase n=1 Tax=Burkholde... 88 5e-16 UniRef50_C1F9W4 ISAca1, transposase n=2 Tax=Acidobacterium capsu... 88 6e-16 UniRef50_B1KCL6 Integrase catalytic region n=100 Tax=Proteobacte... 87 7e-16 UniRef50_A6V7Q6 Transposase n=92 Tax=Bacteria RepID=A6V7Q6_PSEA7 86 2e-15 UniRef50_Q12FI2 Integrase, catalytic region n=28 Tax=Proteobacte... 85 3e-15 UniRef50_C8PWM8 Transposase B n=2 Tax=Enhydrobacter aerosaccus S... 85 6e-15 UniRef50_B4E5J2 Transposase n=21 Tax=Proteobacteria RepID=B4E5J2... 84 8e-15 UniRef50_A9G353 Putative transposase n=1 Tax=Sorangium cellulosu... 84 1e-14 UniRef50_Q8XPL1 Isrso16-transposase orfb protein n=2 Tax=Ralston... 84 1e-14 UniRef50_C8X9D2 Integrase catalytic region n=6 Tax=Actinomycetal... 84 1e-14 UniRef50_C5CAM0 Transposase n=26 Tax=Actinomycetales RepID=C5CAM... 82 3e-14 UniRef50_C4KRZ4 Integrase core domain protein n=59 Tax=Proteobac... 82 3e-14 UniRef50_C1F6R2 ISAca4, transposase orfB n=1 Tax=Acidobacterium ... 82 4e-14 UniRef50_A6BYW2 Integrase, catalytic region n=1 Tax=Planctomyces... 80 9e-14 UniRef50_B0STB8 Putative transposase n=1 Tax=Leptospira biflexa ... 80 1e-13 UniRef50_A4JLW8 Integrase, catalytic region n=9 Tax=Proteobacter... 80 1e-13 UniRef50_Q92X98 Putative transposase protein n=1 Tax=Sinorhizobi... 79 3e-13 UniRef50_B1K7U4 Integrase catalytic region n=7 Tax=Bacteria RepI... 79 3e-13 UniRef50_A5GAT8 Integrase, catalytic region n=1 Tax=Geobacter ur... 79 3e-13 UniRef50_B4S6V0 Integrase catalytic region n=10 Tax=Bacteria Rep... 78 7e-13 UniRef50_C7NJB3 Integrase family protein n=3 Tax=Actinomycetales... 77 1e-12 UniRef50_Q1GCB4 Integrase catalytic region n=29 Tax=Alphaproteob... 77 1e-12 UniRef50_B4RA95 Transposase, IS1477 n=35 Tax=Proteobacteria RepI... 77 1e-12 UniRef50_B0UC72 Integrase catalytic region n=4 Tax=Alphaproteoba... 76 3e-12 UniRef50_C5D6W5 Integrase catalytic region n=19 Tax=Firmicutes R... 75 4e-12 UniRef50_B0TDR5 Transposase, putative n=5 Tax=Firmicutes RepID=B... 75 4e-12 UniRef50_UPI0001B511C3 integrase n=1 Tax=Streptomyces hygroscopi... 75 5e-12 UniRef50_Q2Y8D0 Integrase, catalytic region n=1 Tax=Nitrosospira... 74 6e-12 UniRef50_B3E8B6 Integrase catalytic region n=1 Tax=Geobacter lov... 73 2e-11 UniRef50_C6VW29 Integrase catalytic region n=2 Tax=Sphingobacter... 73 2e-11 UniRef50_B8J8P0 Integrase catalytic region n=1 Tax=Anaeromyxobac... 72 2e-11 UniRef50_A3JW74 Transposase n=12 Tax=Proteobacteria RepID=A3JW74... 72 3e-11 UniRef50_B9XMR2 Putative uncharacterized protein n=4 Tax=bacteri... 71 7e-11 UniRef50_A8HUC5 Transposase n=2 Tax=Alphaproteobacteria RepID=A8... 70 9e-11 UniRef50_C6PFD7 Integrase catalytic region n=3 Tax=Thermoanaerob... 70 1e-10 UniRef50_Q8NL32 Predicted transposase n=7 Tax=Corynebacterium Re... 70 1e-10 UniRef50_C0QA21 Transposase /integrase family protein n=1 Tax=De... 70 2e-10 UniRef50_C1A8I3 Putative transposase orfB for insertion sequence... 70 2e-10 UniRef50_A1JLT7 Transposase for insertion element IS1222 n=8 Tax... 69 2e-10 UniRef50_A4TG41 Integrase, catalytic region n=32 Tax=Actinomycet... 69 2e-10 UniRef50_A3ZSC8 Transposase orfB n=1 Tax=Blastopirellula marina ... 69 3e-10 UniRef50_A0AXB8 Integrase, catalytic region n=27 Tax=Betaproteob... 69 3e-10 UniRef50_A2DEY0 Integrase core domain containing protein n=11 Ta... 69 4e-10 UniRef50_Q1NW03 Integrase, catalytic region n=7 Tax=Proteobacter... 68 5e-10 UniRef50_Q8PGV8 ISxac4 transposase n=3 Tax=Xanthomonas axonopodi... 67 7e-10 UniRef50_O28862 ISA0963-5, putative transposase n=5 Tax=Archaeog... 67 9e-10 UniRef50_D2MKS7 Transposase (Fragment) n=3 Tax=Candidatus Poriba... 67 9e-10 UniRef50_Q1DAH7 Transposase orfB, IS3 family n=29 Tax=Proteobact... 67 1e-09 UniRef50_A1UD36 Integrase, catalytic region n=28 Tax=Actinomycet... 67 1e-09 UniRef50_B3PKR5 Transposase n=7 Tax=Gammaproteobacteria RepID=B3... 67 2e-09 UniRef50_B8ER74 Integrase catalytic region n=103 Tax=Bacteria Re... 67 2e-09 UniRef50_A3YV04 Transposase n=3 Tax=Synechococcus sp. WH 5701 Re... 66 2e-09 UniRef50_C3LLF8 IS1627, transposase n=27 Tax=Bacillaceae RepID=C... 66 2e-09 UniRef50_C1F2E9 IS3 family transposase orfB n=1 Tax=Acidobacteri... 66 3e-09 UniRef50_A3PPM4 Integrase, catalytic region n=5 Tax=Rhodobactera... 65 3e-09 UniRef50_C0WLI3 Transposase n=14 Tax=Corynebacterium RepID=C0WLI... 65 3e-09 UniRef50_Q2CG00 Integrase, catalytic domain n=8 Tax=Rhodobactera... 65 3e-09 UniRef50_B4RV10 Integrase, catalytic region n=16 Tax=Proteobacte... 65 3e-09 UniRef50_Q1BK79 Integrase, catalytic region n=37 Tax=Proteobacte... 65 4e-09 UniRef50_C1FA08 Integrase core domain protein n=1 Tax=Acidobacte... 65 5e-09 UniRef50_B0NHH2 Putative uncharacterized protein (Fragment) n=2 ... 65 6e-09 UniRef50_Q4FQT2 Transposase OrfB n=179 Tax=Bacteria RepID=Q4FQT2... 64 7e-09 UniRef50_A7B7Y8 Putative uncharacterized protein n=4 Tax=Clostri... 64 1e-08 UniRef50_A1WCB6 Integrase, catalytic region n=2 Tax=Burkholderia... 64 1e-08 UniRef50_C6BTX7 Integrase catalytic region n=88 Tax=Bacteria Rep... 64 1e-08 UniRef50_A9B8L4 Integrase catalytic region n=5 Tax=Herpetosiphon... 63 2e-08 UniRef50_A4LGI6 YD repeat protein n=14 Tax=Proteobacteria RepID=... 63 2e-08 UniRef50_Q64B23 Transposase n=1 Tax=uncultured archaeon GZfos27G... 62 3e-08 UniRef50_UPI0000E49FEF PREDICTED: similar to LReO_3 n=2 Tax=Stro... 61 6e-08 UniRef50_UPI0001925317 PREDICTED: similar to COS41.3 n=1 Tax=Hyd... 61 8e-08 UniRef50_C4URW7 Integrase n=14 Tax=Proteobacteria RepID=C4URW7_Y... 60 1e-07 UniRef50_A4A249 Transposase orfB n=4 Tax=Planctomycetaceae RepID... 60 1e-07 UniRef50_D1K5D0 Transposase n=1 Tax=Bacteroides sp. 3_1_33FAA Re... 60 1e-07 UniRef50_Q9FZN9 Retroelement pol polyprotein-like n=10 Tax=Arabi... 60 1e-07 UniRef50_C2GFW3 Possible transposase n=1 Tax=Corynebacterium glu... 60 1e-07 UniRef50_A1R4J8 ISAau1, transposase orfB n=3 Tax=Actinomycetales... 60 1e-07 UniRef50_Q0P7I8 IS1400 transposase B n=231 Tax=Bacteria RepID=Q0... 60 2e-07 UniRef50_B1LE64 Putative uncharacterized protein n=1 Tax=Escheri... 60 2e-07 UniRef50_Q04ND9 Putative uncharacterized protein n=3 Tax=Leptosp... 60 2e-07 UniRef50_Q2AA50 Retrotransposon gag protein n=6 Tax=Asparagus of... 60 2e-07 UniRef50_B8KLM8 Integrase, catalytic region n=2 Tax=gamma proteo... 59 2e-07 UniRef50_C1XPR1 Transcriptional regulator/sugar kinase n=14 Tax=... 59 2e-07 UniRef50_C5CE17 Integrase catalytic region n=3 Tax=Kosmotoga ole... 59 2e-07 UniRef50_P24577 Insertion element IS407 uncharacterized 31.7 kDa... 59 3e-07 UniRef50_Q3BT31 Transposase n=22 Tax=Bacteria RepID=Q3BT31_XANC5 59 3e-07 UniRef50_UPI0001924F80 PREDICTED: similar to COS41.3, partial n=... 59 3e-07 UniRef50_UPI0001924F1E PREDICTED: similar to COS41.3 n=2 Tax=Hyd... 59 4e-07 UniRef50_C4UEN4 Transposase n=1 Tax=Yersinia aldovae ATCC 35236 ... 59 4e-07 UniRef50_B8FAB2 Integrase catalytic region n=70 Tax=Bacteria Rep... 59 4e-07 UniRef50_UPI00015B43F4 PREDICTED: similar to pol polyprotein n=1... 59 4e-07 UniRef50_UPI00005104D7 transposase n=1 Tax=Brevibacterium linens... 58 5e-07 UniRef50_P51517 Integrase n=76 Tax=root RepID=POL_SRV2 58 6e-07 UniRef50_A1VJC3 Integrase, catalytic region n=25 Tax=Bacteria Re... 58 6e-07 UniRef50_UPI0001792303 PREDICTED: similar to zinc finger protein... 58 6e-07 UniRef50_C4V4D7 Transposase n=5 Tax=Clostridiales RepID=C4V4D7_9... 58 7e-07 UniRef50_A5D1X6 Transposase and inactivated derivatives n=2 Tax=... 57 8e-07 UniRef50_Q39TE2 Putative uncharacterized protein n=2 Tax=Geobact... 57 8e-07 UniRef50_A5BQ80 Putative uncharacterized protein n=1 Tax=Vitis v... 57 9e-07 UniRef50_P63135 Integrase n=404 Tax=root RepID=POK12_HUMAN 57 9e-07 UniRef50_A3DCZ2 Integrase, catalytic region n=10 Tax=Clostridium... 57 1e-06 UniRef50_A5BFS9 Putative uncharacterized protein n=9 Tax=Vitis v... 57 1e-06 UniRef50_UPI0001986237 PREDICTED: hypothetical protein n=1 Tax=V... 57 1e-06 UniRef50_A5BJN2 Putative uncharacterized protein n=5 Tax=Vitis v... 57 1e-06 UniRef50_A3PLB1 Integrase, catalytic region n=59 Tax=Proteobacte... 57 2e-06 UniRef50_A5B9R1 Putative uncharacterized protein n=10 Tax=Vitis ... 57 2e-06 UniRef50_Q3A4V8 Transposase and inactivated derivatives n=9 Tax=... 57 2e-06 UniRef50_C0VUG3 Transposase n=2 Tax=Corynebacterium glucuronolyt... 57 2e-06 UniRef50_UPI0001791F50 PREDICTED: similar to putative gag-pol pr... 56 2e-06 UniRef50_A5BPP5 Putative uncharacterized protein n=1 Tax=Vitis v... 56 2e-06 UniRef50_Q1M9G1 Putative transposase-related protein n=2 Tax=Rhi... 56 2e-06 UniRef50_A5CA04 Putative uncharacterized protein n=3 Tax=Vitis v... 56 2e-06 UniRef50_A5AWA7 Putative uncharacterized protein n=6 Tax=Vitis v... 56 2e-06 UniRef50_UPI000050FEEF transposase n=1 Tax=Brevibacterium linens... 56 3e-06 UniRef50_A9LH60 Integrase n=1 Tax=uncultured planctomycete 13FN ... 56 3e-06 UniRef50_A5BI07 Putative uncharacterized protein n=7 Tax=Vitis v... 56 3e-06 UniRef50_A5ASA6 Putative uncharacterized protein n=2 Tax=Vitis v... 56 3e-06 UniRef50_Q3I6J4 Pol-polyprotein n=4 Tax=Eukaryota RepID=Q3I6J4_S... 55 3e-06 UniRef50_UPI0001792682 PREDICTED: similar to putative gag-pol pr... 55 3e-06 UniRef50_A5BJP7 Putative uncharacterized protein n=6 Tax=Vitis v... 55 3e-06 UniRef50_B1FI87 Integrase, catalytic region n=1 Tax=Burkholderia... 55 3e-06 UniRef50_A5C2R0 Putative uncharacterized protein n=10 Tax=Vitis ... 55 3e-06 UniRef50_A5AH70 Putative uncharacterized protein n=20 Tax=Vitis ... 55 3e-06 UniRef50_A5C623 Putative uncharacterized protein n=4 Tax=Vitis v... 55 4e-06 UniRef50_A5BXG2 Putative uncharacterized protein n=1 Tax=Vitis v... 55 4e-06 UniRef50_A5B5G8 Putative uncharacterized protein n=2 Tax=Vitis v... 55 4e-06 UniRef50_A5ASD2 Putative uncharacterized protein n=7 Tax=Vitis v... 55 4e-06 UniRef50_A5AHC2 Putative uncharacterized protein n=3 Tax=Vitis v... 55 4e-06 UniRef50_A5BKJ4 Putative uncharacterized protein n=2 Tax=Vitis v... 55 4e-06 UniRef50_C7C202 Gag-Pol polyprotein n=1 Tax=Schistosoma japonicu... 55 5e-06 UniRef50_A5CBG5 Putative uncharacterized protein n=4 Tax=Vitis v... 55 5e-06 UniRef50_UPI0001927064 PREDICTED: similar to vascular endothelia... 55 5e-06 UniRef50_A5BTM1 Putative uncharacterized protein n=31 Tax=Vitis ... 55 5e-06 UniRef50_D1PR45 Transposase n=1 Tax=Subdoligranulum variabile DS... 55 6e-06 UniRef50_A5BI69 Putative uncharacterized protein n=1 Tax=Vitis v... 55 6e-06 UniRef50_B0RNN4 ISxcc1 transposase ORFB, n=4 Tax=Xanthomonas Rep... 55 6e-06 UniRef50_A5BT93 Putative uncharacterized protein n=1 Tax=Vitis v... 55 6e-06 UniRef50_A5C4R5 Putative uncharacterized protein n=1 Tax=Vitis v... 54 7e-06 UniRef50_A5B2X9 Putative uncharacterized protein n=4 Tax=Vitis v... 54 7e-06 UniRef50_UPI00017B5545 UPI00017B5545 related cluster n=4 Tax=Tet... 54 8e-06 UniRef50_Q0ZCB7 Integrase n=4 Tax=Eukaryota RepID=Q0ZCB7_POPTR 54 8e-06 UniRef50_UPI00015B4786 PREDICTED: hypothetical protein n=2 Tax=N... 54 8e-06 UniRef50_A5AKZ0 Putative uncharacterized protein n=18 Tax=Vitis ... 54 8e-06 UniRef50_A5AYT6 Putative uncharacterized protein n=3 Tax=Vitis v... 54 8e-06 UniRef50_P25438 Insertion element IS476 uncharacterized 39.2 kDa... 54 8e-06 UniRef50_B2Q343 Putative uncharacterized protein n=1 Tax=Provide... 54 9e-06 UniRef50_C4FKG6 Integrase, catalytic region n=1 Tax=Sulfurihydro... 54 9e-06 UniRef50_A5CBC9 Putative uncharacterized protein n=1 Tax=Vitis v... 54 9e-06 UniRef50_A1UPZ6 Histidine kinase n=2 Tax=Mycobacterium RepID=A1U... 54 1e-05 UniRef50_A4J392 Integrase, catalytic region n=22 Tax=Clostridia ... 54 1e-05 UniRef50_Q2W8I9 Transposase and inactivated derivative n=89 Tax=... 54 1e-05 UniRef50_C7MBS5 Transposase n=2 Tax=Micrococcineae RepID=C7MBS5_... 54 1e-05 UniRef50_B0SFL5 Transposase n=2 Tax=Leptospira biflexa serovar P... 54 1e-05 UniRef50_A5AWF5 Putative uncharacterized protein n=16 Tax=Vitis ... 54 1e-05 UniRef50_A5AKV0 Putative uncharacterized protein n=16 Tax=Vitis ... 54 1e-05 UniRef50_P31623 Integrase n=24 Tax=root RepID=POL_JSRV 54 1e-05 UniRef50_UPI0000E49205 PREDICTED: similar to novel transposon n=... 53 1e-05 UniRef50_UPI0000F1E4F0 PREDICTED: similar to LReO_3 n=1 Tax=Dani... 53 2e-05 UniRef50_Q9LHC0 Retroelement pol polyprotein-like n=440 Tax=Sper... 53 2e-05 UniRef50_A5AQ03 Putative uncharacterized protein n=5 Tax=Vitis v... 53 2e-05 UniRef50_A5G4C5 Putative uncharacterized protein n=1 Tax=Geobact... 53 2e-05 UniRef50_UPI000038392B COG2801: Transposase and inactivated deri... 53 2e-05 UniRef50_A5C1P8 Putative uncharacterized protein n=4 Tax=Vitis v... 53 2e-05 UniRef50_A0L7D7 Integrase, catalytic region n=5 Tax=Bacteria Rep... 53 2e-05 UniRef50_A4WYC8 Putative uncharacterized protein n=2 Tax=Rhodoba... 53 2e-05 UniRef50_A1V109 A, transposase OrfB n=56 Tax=Proteobacteria RepI... 53 2e-05 UniRef50_UPI0001B416F4 ISA0963-5 transposase n=6 Tax=Ferroplasma... 53 2e-05 UniRef50_A5BAC6 Putative uncharacterized protein n=1 Tax=Vitis v... 53 2e-05 UniRef50_A6Q4E4 Transposase n=2 Tax=Nitratiruptor sp. SB155-2 Re... 53 2e-05 UniRef50_A5AY91 Putative uncharacterized protein n=1 Tax=Vitis v... 53 2e-05 UniRef50_Q1N8F6 Transposase n=2 Tax=Sphingomonas RepID=Q1N8F6_9SPHN 53 2e-05 UniRef50_Q2ILP8 Integrase n=2 Tax=Anaeromyxobacter RepID=Q2ILP8_... 53 2e-05 UniRef50_Q0ZCC0 Gag protein n=2 Tax=Populus trichocarpa RepID=Q0... 53 2e-05 UniRef50_UPI00016E16D4 UPI00016E16D4 related cluster n=2 Tax=Tak... 52 3e-05 UniRef50_A5C4S0 Putative uncharacterized protein n=1 Tax=Vitis v... 52 3e-05 UniRef50_A5ACN5 Putative uncharacterized protein n=7 Tax=Vitis v... 52 3e-05 UniRef50_UPI000192724E PREDICTED: similar to RETRotransposon-lik... 52 3e-05 UniRef50_A0P341 Transposase n=5 Tax=Alphaproteobacteria RepID=A0... 52 3e-05 UniRef50_A2EZM6 Integrase core domain containing protein n=19 Ta... 52 3e-05 UniRef50_UPI00015B47AA PREDICTED: similar to pol polyprotein n=1... 52 3e-05 UniRef50_UPI000179C74F UPI000179C74F related cluster n=8 Tax=Bos... 52 3e-05 UniRef50_A5BY78 Putative uncharacterized protein n=2 Tax=Vitis v... 52 3e-05 UniRef50_A5AIU0 Putative uncharacterized protein n=2 Tax=Vitis v... 52 3e-05 UniRef50_A5BWY6 Putative uncharacterized protein n=3 Tax=Vitis v... 52 3e-05 UniRef50_C7C200 Gag-Pol polyprotein n=2 Tax=Schistosoma japonicu... 52 4e-05 UniRef50_C2BWQ2 IS21 family transposase n=1 Tax=Mobiluncus curti... 52 4e-05 UniRef50_P03365 Integrase n=46 Tax=root RepID=POL_MMTVB 52 4e-05 UniRef50_UPI00017615A3 PREDICTED: similar to Os07g0444200 n=2 Ta... 52 4e-05 UniRef50_D1QR90 ISPg5, transposase n=4 Tax=Bacteroidales RepID=D... 52 4e-05 UniRef50_A5BG34 Putative uncharacterized protein n=2 Tax=Vitis v... 52 4e-05 UniRef50_A5C050 Putative uncharacterized protein n=32 Tax=Vitis ... 52 4e-05 UniRef50_A5BFP8 Putative uncharacterized protein n=4 Tax=Vitis v... 52 4e-05 UniRef50_Q9SHM3 F7F22.17 n=4 Tax=Arabidopsis thaliana RepID=Q9SH... 52 4e-05 UniRef50_UPI000179ECC6 UPI000179ECC6 related cluster n=2 Tax=Bos... 52 5e-05 UniRef50_UPI000179E089 UPI000179E089 related cluster n=3 Tax=Bos... 52 5e-05 UniRef50_B9K5X7 IS3 family transposase n=7 Tax=Proteobacteria Re... 52 5e-05 UniRef50_A5BWH5 Putative uncharacterized protein n=17 Tax=Vitis ... 52 5e-05 UniRef50_B1ZXZ1 Integrase catalytic region n=2 Tax=Verrucomicrob... 51 6e-05 UniRef50_A4G2L6 Transposase IS3 family, part 2 n=17 Tax=Bacteria... 51 6e-05 UniRef50_UPI000180AEE9 PREDICTED: similar to pumilio 2 n=1 Tax=C... 51 6e-05 UniRef50_A5B7N0 Putative uncharacterized protein n=17 Tax=Vitis ... 51 7e-05 UniRef50_A5B346 Putative uncharacterized protein n=2 Tax=Vitis v... 51 7e-05 UniRef50_B7ZFS7 Polyprotein n=2 Tax=Eukaryota RepID=B7ZFS7_9METZ 51 7e-05 UniRef50_C5PML6 Transposase OrfB n=1 Tax=Sphingobacterium spirit... 51 8e-05 UniRef50_UPI0001793640 PREDICTED: similar to SD02026p, partial n... 51 8e-05 UniRef50_A5BPW1 Putative uncharacterized protein n=3 Tax=Vitis v... 51 8e-05 >UniRef50_P37007 Uncharacterized protein yagA n=15 Tax=Bacteria RepID=YAGA_ECOLI Length = 384 Score = 798 bits (2061), Expect = 0.0, Method: Compositional matrix adjust. Identities = 384/384 (100%), Positives = 384/384 (100%) Query: 1 MESLMPWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQ 60 MESLMPWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQ Sbjct: 1 MESLMPWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQ 60 Query: 61 DRPRIPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARH 120 DRPRIPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARH Sbjct: 61 DRPRIPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARH 120 Query: 121 GLLPGASPGIPATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCT 180 GLLPGASPGIPATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCT Sbjct: 121 GLLPGASPGIPATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCT 180 Query: 181 DERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYH 240 DERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYH Sbjct: 181 DERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYH 240 Query: 241 PQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQ 300 PQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQ Sbjct: 241 PQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQ 300 Query: 301 PSARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQEDGSYE 360 PSARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQEDGSYE Sbjct: 301 PSARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQEDGSYE 360 Query: 361 VWWYSTKVGVIDLKKKSITMGKGC 384 VWWYSTKVGVIDLKKKSITMGKGC Sbjct: 361 VWWYSTKVGVIDLKKKSITMGKGC 384 >UniRef50_B0T7X0 Integrase catalytic region n=15 Tax=Alphaproteobacteria RepID=B0T7X0_CAUSK Length = 400 Score = 346 bits (888), Expect = 6e-94, Method: Compositional matrix adjust. Identities = 183/352 (51%), Positives = 227/352 (64%), Gaps = 7/352 (1%) Query: 5 MPWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPR 64 MPW M R EFV A +GAN R LCRRFGISP GYKWL R ++ G L DR R Sbjct: 1 MPWREVSVMEQRREFVRLARLEGANRRELCRRFGISPEVGYKWLAR-SKAGDEALADRSR 59 Query: 65 IPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLP 124 PH+SP RS+ +I A + D H WGARKI WLED+G PA ST+H ++ RHG + Sbjct: 60 RPHNSPWRSAAEIEAAVLAVRDAHPAWGARKIGAWLEDRGVDPPAVSTIHAILRRHGRID 119 Query: 125 G--ASPGIPATGRFEHDAPNRLWQMDFKGHFPFGGGR-CHPLTLLDDHSRFSLCLAHCTD 181 SPG A RFE PN+LWQMDFKG F G+ CHPLT++DDHSR S CL C D Sbjct: 120 DFPTSPGK-AWRRFEKAEPNQLWQMDFKGWFRLSSGQPCHPLTIVDDHSRLSPCLKACAD 178 Query: 182 ERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGT-WTALELWLMRLGIRVGHSRPYH 240 ++ +TV+ L + F RYGLP +DNG PWG+ +G WT LE+WL++LG+ V HSRPYH Sbjct: 179 QQGQTVRPHLEAAFRRYGLPLAFFVDNGPPWGEPSGERWTRLEVWLLKLGVDVLHSRPYH 238 Query: 241 PQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQ 300 PQ++GK+ERFHRSL AEVL + F ++QRAFD WR VYN ERPHEALD+ P +RYQ Sbjct: 239 PQSRGKIERFHRSLAAEVLDLQRFDSFAQVQRAFDRWREVYNFERPHEALDLDCPANRYQ 298 Query: 301 PSARQYSGNTTPPEYDEGVMVRKVDIS-GKLSVKGVSLSAGKAFRGERVGLK 351 PS R + P YD G ++R V + + KG KAF+GER+ L+ Sbjct: 299 PSPRAMPDHPPEPRYDSGEILRTVSTTKAYVRFKGRLWRVPKAFQGERLALR 350 >UniRef50_B6J0H9 Transposase n=3 Tax=Coxiella burnetii RepID=B6J0H9_COXB2 Length = 317 Score = 251 bits (641), Expect = 3e-65, Method: Compositional matrix adjust. Identities = 129/281 (45%), Positives = 170/281 (60%), Gaps = 3/281 (1%) Query: 98 RWLEDQGHTMPAFSTVHNLMARHG-LLPGASPGIPATGRFEHDAPNRLWQMDFKGHFPFG 156 R + +G+ MP TV+ ++ R+G + S RFEH+ PN LWQMDFKGHF Sbjct: 33 RNIVKKGYIMPCIKTVNRILKRYGRITIEESLKRKKFIRFEHEHPNDLWQMDFKGHFRLT 92 Query: 157 GG-RCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDT 215 RCHPLTLLDD +R+SL + C DER ETV+Q L+ +F ++GLP RMTMDNG+PWG + Sbjct: 93 NKIRCHPLTLLDDCTRYSLGIIACGDERLETVKQALIDIFRKWGLPKRMTMDNGAPWGYS 152 Query: 216 -TGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAF 274 + +T L +WL++ I V HSRPYHPQTQGKLERFHR+ K E L +F + Q+ F Sbjct: 153 GSQNYTQLTVWLIQQTIYVSHSRPYHPQTQGKLERFHRTFKQEFLNRYYFDTLAQAQKVF 212 Query: 275 DHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGNTTPPEYDEGVMVRKVDISGKLSVKG 334 D WR YN ERPH A++ P Y S R Y P EY + VRKV+ G +S KG Sbjct: 213 DWWRDFYNDERPHSAIEAYSPSEIYHRSERSYCEKIQPYEYATEMDVRKVNQKGIMSYKG 272 Query: 335 VSLSAGKAFRGERVGLKEMQEDGSYEVWWYSTKVGVIDLKK 375 G+AF G+ +GL E+ V++ KV +DL + Sbjct: 273 RRYFVGEAFGGQAMGLMPSNENDIVNVYFCHQKVFKLDLNQ 313 >UniRef50_A9ER25 Transposase n=4 Tax=Sorangium cellulosum 'So ce 56' RepID=A9ER25_SORC5 Length = 387 Score = 218 bits (554), Expect = 4e-55, Method: Compositional matrix adjust. Identities = 134/394 (34%), Positives = 196/394 (49%), Gaps = 25/394 (6%) Query: 5 MPWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPR 64 MPW ++ R F+ ++ LCRRFGIS TGYKW++R+ Q G +GL++R Sbjct: 1 MPWKETCSVDERLRFIAQVNESDETFAELCRRFGISRKTGYKWVERYEQAGPSGLEERRP 60 Query: 65 IPHHSPNRSSDDIT-ALLRMAHDRHERWGARKIKRWLEDQG-HTMPAFSTVHNLMARHGL 122 + H P+ + + AL+ + +R WG +K++ LE G +PA ST+ L+ +HGL Sbjct: 61 VAHTFPHATPTVLVDALIELRKER-PTWGPKKLRARLESLGLEGLPAASTIGELLKKHGL 119 Query: 123 LPGA------------SPGIPATGRFEHDAPNRLWQMDFKGHFPFGG-GRCHPLTLLDDH 169 + SP PA + PN W DFKGHF G RCHPLTL D Sbjct: 120 IRPRRRRVVTPTTAMPSPLAPA------EQPNDTWCADFKGHFALGDRTRCHPLTLTDQA 173 Query: 170 SRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDT-TGTWTALELWLMR 228 SR+ L +V+ F +GLP R+ DNG P+ G +AL + ++ Sbjct: 174 SRYLLKCEGVAKPHEASVRPHFERAFREFGLPHRIRSDNGPPFATIGIGGLSALSVSWIK 233 Query: 229 LGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHE 288 LGI P PQ G+ ER H++LKAE A+ QR FD +R YN +RPHE Sbjct: 234 LGIHPERIEPGKPQQNGRHERMHKTLKAEATSPPE-ANLAAQQRVFDRFRHEYNDQRPHE 292 Query: 289 ALDMAVPGSRYQPSARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERV 348 AL P SRY PS R + PEY + + VR++D G++ G + GE V Sbjct: 293 ALGQRTPASRYTPSRRSMPSKPSSPEYPDTMAVRRLDEQGRMLFGGAQTNVSTLLAGEPV 352 Query: 349 GLKEMQEDGSYEVWWYSTKVGVIDLKKKSITMGK 382 GL + +D +E+++ + + LK K + + + Sbjct: 353 GLTPIADD-VWELYYGPVLLAQVTLKNKELKLAR 385 >UniRef50_D1TPC8 Putative transposase integrase n=1 Tax=Burkholderia sp. CCGE1002 RepID=D1TPC8_9BURK Length = 255 Score = 215 bits (547), Expect = 3e-54, Method: Compositional matrix adjust. Identities = 120/233 (51%), Positives = 145/233 (62%), Gaps = 6/233 (2%) Query: 5 MPWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAA-GLQDRP 63 MPW+ R+TM+LR EFV A Q+GAN R LCRRFGIS TGYKWL R AQ+ A L DR Sbjct: 1 MPWNPRETMNLRLEFVCLALQEGANRRELCRRFGISAKTGYKWLSRHAQDSTAMALADRS 60 Query: 64 RIPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHT-MPAFSTVHNLMARHGL 122 R P +P R++ + + H WG RKI R L D GHT +PA STV +++ RHGL Sbjct: 61 RRPRQTPARTAPCVEQQVVQLRQAHPAWGGRKISRRLSDLGHTDVPAPSTVTDILHRHGL 120 Query: 123 LPGASPGIPAT-GRFEHDAPNRLWQMDFKGHFPFGGGR-CHPLTLLDDHSRFSLCLAHCT 180 + A+ RFEH+ PN LWQMDFKG F GR C PLT+LDDHSR+++ L C Sbjct: 121 IDAAASAAATPWQRFEHEQPNDLWQMDFKGWFDLQDGRHCSPLTMLDDHSRYNVTLDACI 180 Query: 181 DERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTT--GTWTALELWLMRLGI 231 TVQ L F RYGLP R+ DNGSPWG + G T L +WL+RLGI Sbjct: 181 GTDTRTVQHHLERTFRRYGLPLRINADNGSPWGSPSQAGQLTELAIWLIRLGI 233 >UniRef50_UPI00017F3A47 integrase catalytic subunit n=1 Tax=Escherichia coli O157:H7 str. EC4024 RepID=UPI00017F3A47 Length = 365 Score = 204 bits (518), Expect = 5e-51, Method: Compositional matrix adjust. Identities = 140/365 (38%), Positives = 187/365 (51%), Gaps = 17/365 (4%) Query: 5 MPWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPR 64 MPW M R +F+ + +LCR FGIS TGYKWLQR+ + L DR R Sbjct: 1 MPWTETRPMQ-RLDFIRACHAGTDSFSALCRLFGISRKTGYKWLQRFDPSDLSSLSDRSR 59 Query: 65 IPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQ--GHTMPAFSTVHNLMARHGL 122 PH DDI A L +H WG +K++ WL + T+PA ST+ +++ R GL Sbjct: 60 APHSHSRTVPDDIAAQLTALRQKHPDWGPKKLRMWLLNHHADFTVPAASTIGDILKREGL 119 Query: 123 LPGA-----SPGI--PATGRFEHDAPNRLWQMDFKGHFPFGGGR-CHPLTLLDDHSRFSL 174 +P +PG P T E+ N++W DFKG F CHP TL D+HSR+ L Sbjct: 120 VPDKKRKRRTPGNRQPLTTISEN---NQVWSADFKGKFRLLSREYCHPFTLTDNHSRYLL 176 Query: 175 CLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDT-TGTWTALELWLMRLGIRV 233 E V+Q L F YGLP+ + DNG P+ T + L +WL+RLGIR Sbjct: 177 SCRGTDRESEPFVRQCLTDAFLEYGLPEVLRTDNGQPFAGTGIAGLSRLAVWLIRLGIRP 236 Query: 234 GHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMA 293 R HP+ G+ ER HRSLK+ V G F E QR F +R +N ERPHE+L A Sbjct: 237 ERIRKGHPEENGRHERMHRSLKSAVSHGNTFMTMEEQQRWFSDYREEFNHERPHESLAGA 296 Query: 294 VPGSRYQPSARQYSGNTTPPEYDEGVMVRKVDISGKLSV-KGVSLSAGKAFRGERVGLKE 352 PG +QPS RQ+ G Y EG V +V G L + K ++ +A E + L+E Sbjct: 297 TPGMVWQPSCRQWDGRVPDYAYPEGGTVYRVKSRGTLYMGKKGTVFLSEALTDEYIMLEE 356 Query: 353 MQEDG 357 ++DG Sbjct: 357 -RDDG 360 >UniRef50_D2LAL9 Integrase catalytic region n=1 Tax=Desulfovibrio sp. FW1012B RepID=D2LAL9_9DELT Length = 390 Score = 197 bits (500), Expect = 8e-49, Method: Compositional matrix adjust. Identities = 128/364 (35%), Positives = 184/364 (50%), Gaps = 16/364 (4%) Query: 5 MPWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPR 64 MPW + M R F++ SQ + +LCR++GIS TGYKWL+R+ + GL +R R Sbjct: 1 MPWKKVNPMEERARFIVELSQRRESFAALCRKYGISRETGYKWLRRY--QAGEGLGERSR 58 Query: 65 IPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTM---PAFSTVHNLMARHG 121 + H P+++ D + LL + WG +K+ R L+D H + PA ST +++ RHG Sbjct: 59 VARHCPHKTPDAVVTLLLALRQENPYWGPKKLVRLLQDV-HGIEYPPAKSTAGDILKRHG 117 Query: 122 LLPGASPGIPATG----RFEHDAP---NRLWQMDFKGHFPFGG-GRCHPLTLLDDHSRFS 173 L+ +G R + P N +W D+KG F CHPLT+ D SR+ Sbjct: 118 LITATKAKRRQSGGRLRREDLRQPKQANDVWSADYKGWFRLEDRSICHPLTISDIFSRYV 177 Query: 174 LCLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDT-TGTWTALELWLMRLGIR 232 L + E ++ + VF RYGLP + +DNG+P+G T T L +W ++LGI Sbjct: 178 LGCYVFPTQTLERTKEAMRRVFMRYGLPRAIRVDNGTPFGSTGIAGLTGLSVWWLQLGIV 237 Query: 233 VGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDM 292 V P P+ G ER HR+LK E A+ E Q + WR +N RPHEALD Sbjct: 238 VDFIAPGKPEQNGCHERMHRTLKLEATIPP-SANLREQQERLESWRERFNSHRPHEALDQ 296 Query: 293 AVPGSRYQPSARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKE 352 A P S Y+PS+R+ N EY R V G + +G + G+AF R+GL Sbjct: 297 ATPASIYRPSSRRLPRNEPCFEYPSSFESRTVRRDGMFNWEGRQIFLGEAFAKCRIGLTR 356 Query: 353 MQED 356 +D Sbjct: 357 NYDD 360 >UniRef50_C6AZP5 Integrase catalytic region n=3 Tax=Rhizobium leguminosarum bv. trifolii WSM1325 RepID=C6AZP5_RHILS Length = 402 Score = 195 bits (496), Expect = 2e-48, Method: Compositional matrix adjust. Identities = 128/388 (32%), Positives = 186/388 (47%), Gaps = 22/388 (5%) Query: 5 MPWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPR 64 M W M R FV + +LC +GIS TGYKWL+R+ G AGL D PR Sbjct: 1 MVWRETGIMDERLRFVGECLAGEETMTALCAAYGISRKTGYKWLERYRALGPAGLIDLPR 60 Query: 65 IPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTM--PAFSTVHNLMARHGL 122 P ++ ++ A + + + +WG +K+ L+ + PA ST+ ++ RHGL Sbjct: 61 APLEHGRATAAELVARIVAEKEANPQWGPKKVLARLKRSAPQLCWPAASTIGEILKRHGL 120 Query: 123 L---------PGASPGIPATGRFEHDAPNRLWQMDFKGHF-PFGGGRCHPLTLLDDHSRF 172 + G P PA G PN +W D+KG F G RC PLT++D SRF Sbjct: 121 VGRRRHRWRAAGCGPFAPANG------PNAVWSADYKGWFRTRDGRRCEPLTVMDTASRF 174 Query: 173 SLCLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGT-WTALELWLMRLGI 231 L L C +F +GLP+R DNGSP+ T T L + ++LGI Sbjct: 175 LLALEACATPAEVEAWPVFERLFAEHGLPERFRSDNGSPFAAIGVTGLTTLAVRFIKLGI 234 Query: 232 RVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALD 291 + +P PQ G+ ERFH ++ + + D Q FD +R YN ERPHEAL Sbjct: 235 GLERIQPGKPQQNGRHERFHLTMLPLAMAPE--VDHAAQQAVFDAFRQNYNAERPHEALA 292 Query: 292 MAVPGSRYQPSARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLK 351 M VP Y+PS R+ P+Y VR+V +G++ G + A GE V ++ Sbjct: 293 MDVPADHYRPSLRRLPDRLPEPDYPAEAAVRRVRSNGEIKWNGDLVYVAAALAGEVVAIE 352 Query: 352 EMQEDGSYEVWWYSTKVGVIDLKKKSIT 379 E E G + + +++ +G+ID K K + Sbjct: 353 E-SEAGIWTLRFHAHPLGIIDKKTKRLV 379 >UniRef50_B8GMF2 Integrase catalytic region n=2 Tax=Gammaproteobacteria RepID=B8GMF2_THISH Length = 391 Score = 192 bits (488), Expect = 2e-47, Method: Compositional matrix adjust. Identities = 135/394 (34%), Positives = 187/394 (47%), Gaps = 25/394 (6%) Query: 5 MPWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPR 64 MPW M R +F+ + +LCR FGIS TG KW++R A G GL++ R Sbjct: 1 MPWKETCAMDQRVQFIGAWLSGRYSKSALCRHFGISRPTGDKWIRRHALVGVDGLKESSR 60 Query: 65 IPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTM--PAFSTVHNLMARHGL 122 PH+ PNR S+ + + A H+ WG +K+ WL + + PA ST ++ R GL Sbjct: 61 APHNQPNRISEALCERIVQAKLAHQDWGPKKVLDWLRAREPEVVWPADSTGGEILRRAGL 120 Query: 123 LPGASPGIPATGRFEHDAP-------NRLWQMDFKGHFPFGGG-RCHPLTLLDDHSRFSL 174 P H+AP N +W +DFKG + G G RC+PLTL D SR+ L Sbjct: 121 ---VKPRRRRRVVPPHEAPFADCEQSNAVWAVDFKGDYRLGEGRRCYPLTLSDSFSRYLL 177 Query: 175 CLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGD-TTGTWTALELWLMRLGIRV 233 V F YGLP + DNG+P+ G +AL W + LGI Sbjct: 178 LCRGLARPSGAAVHPWFEWAFREYGLPQAIRSDNGAPFASRAVGGLSALSKWWIDLGIHP 237 Query: 234 GHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGE----LQRAFDHWRTVYNLERPHEA 289 RP P G+ ER HRSLK W + QR + +R YN ER HEA Sbjct: 238 ERIRPGRPDQNGRHERMHRSLKG------WLGTPAQGLEAEQRRLEAFRAEYNWERSHEA 291 Query: 290 LDMAVPGSRYQPSARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVG 349 L PGS Y S R Y PP+YD+GV VR+V +G++ +G + + GE V Sbjct: 292 LSRRTPGSLYAASPRPYPPCIEPPDYDQGVEVRRVRNNGEIKWRGRLIYLSEVLIGEPVA 351 Query: 350 LKEMQEDGSYEVWWYSTKVGVIDLKKKSITMGKG 383 L E DG +E+ + +G+++ + IT +G Sbjct: 352 L-EPAGDGLWELRYRFHPLGLLNEQNDRITPARG 384 >UniRef50_A4WDP4 Integrase, catalytic region n=4 Tax=Enterobacter sp. 638 RepID=A4WDP4_ENT38 Length = 382 Score = 185 bits (469), Expect = 2e-45, Method: Compositional matrix adjust. Identities = 123/361 (34%), Positives = 173/361 (47%), Gaps = 13/361 (3%) Query: 5 MPWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPR 64 MPW TM R +FV + + +CRRF IS TGYKWL R++ + A L DR R Sbjct: 1 MPWTETVTMQ-RLQFVAACLEGNLPVAEVCRRFNISRKTGYKWLARFSPDDTASLADRSR 59 Query: 65 IPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHT-MPAFSTVHNLMARHGLL 123 HH N + + + LL +H WG KI++ L + T +PA ST+ L HGL+ Sbjct: 60 ARHHQ-NSTPEPMVQLLLDTKQQHPLWGPDKIRQRLLNLNITGVPAASTIGELFRVHGLV 118 Query: 124 PGASPGIPATGRFEHDA-----PNRLWQMDFKGHFPFGGGR-CHPLTLLDDHSRFSLCLA 177 P + R H+ PN +W DFKG F GGR CHP TL D+ SR L Sbjct: 119 KKRRPPAFKSTR-PHELHTVAHPNDVWSADFKGKFTHTGGRWCHPFTLTDNCSRIVLACD 177 Query: 178 HCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTW--TALELWLMRLGIRVGH 235 V L VF G+P + DNG P+ G W + + +WL++ G+ Sbjct: 178 ATYMPDGRFVIPCLERVFRECGMPQVLRTDNGPPFAGA-GLWGLSQMSIWLIKCGVLPER 236 Query: 236 SRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVP 295 RP P G+ ER HR+LK + + F E Q D WR+ +N RPH+AL P Sbjct: 237 IRPGKPTENGRHERMHRTLKDALKRHTKFTSLEEQQAWLDAWRSEFNDIRPHKALGGKTP 296 Query: 296 GSRYQPSARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 GS + PS R ++G + +V + G L + +A RGE + +K+++E Sbjct: 297 GSVWYPSERIFTGPLKAMPVPDDARTLRVSVKGDLCFNSTRIFLSEALRGEWIWMKQVEE 356 Query: 356 D 356 D Sbjct: 357 D 357 >UniRef50_Q01QQ4 Integrase, catalytic region n=9 Tax=Bacteria RepID=Q01QQ4_SOLUE Length = 395 Score = 184 bits (467), Expect = 4e-45, Method: Compositional matrix adjust. Identities = 128/388 (32%), Positives = 194/388 (50%), Gaps = 16/388 (4%) Query: 5 MPWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPR 64 MPW R ++ ++G +I L +G+S T YKWL+R ++G GLQ + R Sbjct: 1 MPWQEIRVEEQRL-LMIRDHEEGMSISELAEVYGVSRKTVYKWLERHDEQGFLGLQAQSR 59 Query: 65 IPHHSPNRSSDDITALLRMAHDRHERWG----ARKIKRWLEDQGHTMPAFSTVHNLMARH 120 PH SPN+ + ++ + A RH +WG ++K + +D PA ST+ ++ + Sbjct: 60 RPHRSPNQVTSEVEGAIIAA--RH-KWGWGPGKLRVKLFQQDSRVPWPAVSTIAAVLKAN 116 Query: 121 GLLPG--ASPGIPAT--GRFEHDAPNRLWQMDFKGHFPFGGG-RCHPLTLLDDHSRFSLC 175 GL+ P +P D PN +W +D+KG F G G R PLT+ D SR+ L Sbjct: 117 GLVVSRRNRPRVPIQRPPYLAADGPNAVWNIDYKGWFRCGDGTRVDPLTISDGFSRYLLR 176 Query: 176 LAHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTT-GTWTALELWLMRLGIRVG 234 H E + V+ F+ +GLP + DNG+P+ G + L +W ++LGI V Sbjct: 177 CQHVEQTGYELTRAVFVATFQEFGLPGAIHSDNGTPFASVAPGGLSRLSIWFVKLGIVVE 236 Query: 235 HSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAV 294 SRP PQ G+ ER HR+LKA + A Q+AF ++ YN ERPHEALD Sbjct: 237 RSRPACPQDNGRHERMHRTLKAATAKPPQ-ATVRLQQQAFHAFQREYNEERPHEALDNKT 295 Query: 295 PGSRYQPSARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQ 354 P S YQ SAR Y EY + + R + G L KGV + F E +G+K + Sbjct: 296 PHSCYQASARSYPRRVPELEYGDDMETRVISQQGSLKWKGVRTFISEVFAYETLGIKVID 355 Query: 355 EDGSYEVWWYSTKVGVIDLKKKSITMGK 382 E E+++ ++G +D +++ + K Sbjct: 356 ERW-VELYFGPIRLGWLDGYRQTFSRRK 382 >UniRef50_B4UMG9 Integrase catalytic region n=3 Tax=Bacteria RepID=B4UMG9_ANASK Length = 407 Score = 184 bits (467), Expect = 5e-45, Method: Compositional matrix adjust. Identities = 124/378 (32%), Positives = 187/378 (49%), Gaps = 13/378 (3%) Query: 5 MPWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPR 64 MPW MS + EFV A GAN+ +LCR FGIS T +KWL+R+ +G GL ++ R Sbjct: 1 MPWKELRPMSQKLEFVEKAIVPGANVSALCRDFGISRQTAHKWLRRYRDQGYLGLVEKSR 60 Query: 65 IPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQ-GHTMPAFSTVHNLMARHGLL 123 P SP +++D+ + +H WG +KI L + G P+ +TV ++ R G + Sbjct: 61 RPASSPLATAEDVVVSIIELRSKHASWGPQKIAGVLARRLGPEAPSPTTVARVLRRLGKV 120 Query: 124 PGASPG-----IPATGRFEHDAPNRLWQMDFKGHF-PFGGGRCHPLTLLDDHSRFSLCLA 177 P + R E A N LW +DFKG + G +C PLT+ D SR L +A Sbjct: 121 KRRRPAARIWSVDGRPRIEVKASNDLWTIDFKGWWRALNGDKCEPLTVRDAFSRRVLAVA 180 Query: 178 HCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPW--GDTTGTWTALELWLMRLGIRVGH 235 V++ L +F ++GLP + DNGSP+ + G T L WL+ LGIR+ Sbjct: 181 LVPATTAAHVRRVLELLFRKHGLPSAIQSDNGSPFICSRSRGGLTVLSAWLVSLGIRIVR 240 Query: 236 SRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVP 295 SRP HPQ G ER HR L LQ QR D W +N RPH+AL P Sbjct: 241 SRPGHPQDNGGHERMHRDLSE--LQLSPARSRRAQQRQCDRWMLDFNHVRPHDALGGKTP 298 Query: 296 GSRYQPSARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 Y+ S R+ S + P Y + R+ + +G + + G + A + +GL++ + Sbjct: 299 AELYRNSTRR-SLSPLLPTYPPEWLTRRANKAGYVRINGDQVFVATALARQLIGLRQ-ES 356 Query: 356 DGSYEVWWYSTKVGVIDL 373 + + ++ +G+I++ Sbjct: 357 ELRWSARFFDVDLGMIEI 374 >UniRef50_Q07SS7 Integrase, catalytic region n=5 Tax=Alphaproteobacteria RepID=Q07SS7_RHOP5 Length = 582 Score = 179 bits (454), Expect = 2e-43, Method: Compositional matrix adjust. Identities = 129/389 (33%), Positives = 179/389 (46%), Gaps = 11/389 (2%) Query: 5 MPWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPR 64 M W + R FV+ + +CRRFG+S TGYKWL+R+ EG AGL DR R Sbjct: 1 MGWMETRVVDERMRFVMAVADHEEAFAVVCRRFGVSRRTGYKWLERYDAEGVAGLMDRSR 60 Query: 65 IPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWL-EDQGHTM-PAFSTVHNLMARHGL 122 PH P + + H WG KI+ WL E G T PA ST+ L+ R GL Sbjct: 61 APHSHPQAIAAPLAERCLAVRRAHPTWGPVKIRHWLAERDGATEWPAPSTIGALLDREGL 120 Query: 123 LPGASPGIPATGR---FEH-DAPNRLWQMDFKGHFPFGGGR-CHPLTLLDDHSRFSLCLA 177 + F H N +W MDFKG F G G C PLTL D +SR+ L Sbjct: 121 TVKRRLRRRSPPSSVPFGHCGGANDIWCMDFKGWFLTGDGSCCEPLTLSDAYSRYLLRCQ 180 Query: 178 HCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDT-TGTWTALELWLMRLGIRVGHS 236 V L + F +GLP R+ DNG P+ G + L + +++ G+ Sbjct: 181 ALARTDTAHVWPVLEAAFREFGLPHRLRSDNGPPFASCGAGGLSRLAVQVIKAGVVPERI 240 Query: 237 RPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPG 296 P PQ G+LER H +LK + +L+R ++ +YN ERPH+AL P Sbjct: 241 APGKPQQNGRLERLHLTLKQDTAMPPAQTLPEQLKR-LRAFQRLYNEERPHQALGNDTPS 299 Query: 297 SRYQPSARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQED 356 Y S R++ G P+Y VR+V +G + G + +A GE +GL E Q + Sbjct: 300 QHYARSPRRFDGCLRAPDYGPDQTVRRVRSNGAIKWGGNEIYINEALAGEPIGLTE-QPN 358 Query: 357 GSYEVWWYSTKVGVIDLKKKSITMGK-GC 384 GS+ + +GVI + + K GC Sbjct: 359 GSFAASYGPIVLGVIAHRGNQLRKAKRGC 387 >UniRef50_D2ML31 Integrase, catalytic region (Fragment) n=1 Tax=Candidatus Poribacteria sp. WGA-A3 RepID=D2ML31_9BACT Length = 411 Score = 178 bits (452), Expect = 3e-43, Method: Compositional matrix adjust. Identities = 133/383 (34%), Positives = 178/383 (46%), Gaps = 13/383 (3%) Query: 5 MPWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPR 64 MPW M R FV +LC +GIS TG KWL R+ +G AGL D R Sbjct: 1 MPWKEIKIMDQREHFVSDYLTGDYPKGALCELYGISRPTGDKWLARYHAQGVAGLADLAR 60 Query: 65 IPHHSPNRS-SDDITALLRMAHDRHERWGARKIKRWLEDQG--HTMPAFSTVHNLMARHG 121 PH P+++ + I A+L M H RH +G +KI+ L P STV ++ R G Sbjct: 61 RPHTQPHQTPAAVIEAILTMKH-RHPSFGPKKIRDRLRAVAPEEAWPVESTVGVILKRAG 119 Query: 122 LLPGASPGIPATGRFEH----DAPNRLWQMDFKGHFPFGGG-RCHPLTLLDDHSRFSLCL 176 L+ + AP W DFKG FP G G RC+PLT++D SR+ L Sbjct: 120 LVRPRRVRRRVPADPQRLSRGTAPAPTWSADFKGDFPLGTGPRCYPLTVMDHASRYLLRG 179 Query: 177 AHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTT-GTWTALELWLMRLGIRVGH 235 R VQ + VF YGLP + DNG P+ T G + L W +RLG+R Sbjct: 180 EGLLQPTRAAVQPWVAWVFHEYGLPATIRTDNGPPFASTALGGLSRLAAWWVRLGLRPER 239 Query: 236 SRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVP 295 RP P G ER HR+LKA V G A QR F + YN R HEA+ P Sbjct: 240 IRPGTPSENGCHERMHRALKAAV--GPPAATLAAQQRRFAAFVDEYNWARSHEAVARQPP 297 Query: 296 GSRYQPSARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 G YQPS R Y P EY G +VR+V +G + +G + E VG ++ E Sbjct: 298 GQVYQPSPRAYPAKLPPIEYAPGTLVRQVRQNGAVRWRGHGRYLSEVLAPEPVGFTQIGE 357 Query: 356 DGSYEVWWYSTKVGVIDLKKKSI 378 ++ + + + G +D + +I Sbjct: 358 R-TWAIHYRFHRRGTLDDRTLTI 379 >UniRef50_C3K093 Putative integrase n=2 Tax=Pseudomonas fluorescens SBW25 RepID=C3K093_PSEFS Length = 382 Score = 178 bits (451), Expect = 3e-43, Method: Compositional matrix adjust. Identities = 122/361 (33%), Positives = 175/361 (48%), Gaps = 14/361 (3%) Query: 5 MPWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPR 64 MPW+ M+ R + V L RRFG+S T KW+ R + L + R Sbjct: 1 MPWNQESPMNQRIKLVADWLSGNFTKSQLARRFGVSRPTVDKWISRHNGD-LKSLAEVSR 59 Query: 65 IPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWL--EDQGHTMPAFSTVHNLMARHGL 122 PH+SPN++ D+I A + + H++WG +K+ L ED P+ ST + R GL Sbjct: 60 RPHNSPNKTDDEILARVVAMKEAHDKWGPKKLIELLRIEDPSIDWPSPSTAGQWLDRLGL 119 Query: 123 LPGAS----PGIPATGRFEHDAPNRLWQMDFKGHFPFGGGR-CHPLTLLDDHSRFSL-CL 176 + G E + PN+ W D+KG F + C PLT+ D SR L C Sbjct: 120 VNKRRFKRRHGTSHIEMREANDPNKTWCADYKGQFKMLNAQMCFPLTVTDHASRLILACR 179 Query: 177 AHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDT-TGTWTALELWLMRLGIRVGH 235 AH + + V+Q +F+ YG+P+ + DNG P+ + L +W +RLGI Sbjct: 180 AH-PKIKTQPVKQTFERLFQEYGMPEVIRSDNGVPFASPGLARMSTLAVWWIRLGIYPER 238 Query: 236 SRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVP 295 + P P G+ ER HRSLK E+ G ++ E Q +H++ +N RPHEAL M P Sbjct: 239 TMPGRPAQNGRHERMHRSLKLELPLG---SNLVEQQLLLEHFKHEFNYVRPHEALGMKRP 295 Query: 296 GSRYQPSARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 G Y PS R Y G EY + VR V G + G + +A GER+GLKE ++ Sbjct: 296 GDVYMPSTRLYPGCLPDVEYPAEMRVRSVRQDGSIKWNGKLVFVSEALSGERIGLKEAED 355 Query: 356 D 356 D Sbjct: 356 D 356 >UniRef50_UPI0001913B70 CP4-6 prophage; DNA-binding transcriptional regulator n=1 Tax=Salmonella enterica subsp. enterica serovar Typhi str. M223 RepID=UPI0001913B70 Length = 84 Score = 177 bits (450), Expect = 5e-43, Method: Compositional matrix adjust. Identities = 84/84 (100%), Positives = 84/84 (100%) Query: 68 HSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGAS 127 HSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGAS Sbjct: 1 HSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGAS 60 Query: 128 PGIPATGRFEHDAPNRLWQMDFKG 151 PGIPATGRFEHDAPNRLWQMDFKG Sbjct: 61 PGIPATGRFEHDAPNRLWQMDFKG 84 >UniRef50_Q5ZTP2 Transposase (ISmav2) n=14 Tax=Proteobacteria RepID=Q5ZTP2_LEGPH Length = 341 Score = 177 bits (448), Expect = 8e-43, Method: Compositional matrix adjust. Identities = 108/314 (34%), Positives = 153/314 (48%), Gaps = 14/314 (4%) Query: 5 MPWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPR 64 MPW + + +FV DG + SLC+ FGIS TG+K R+ + G GL DR R Sbjct: 1 MPWQECTKVDEKIKFVA-RLLDGEQMSSLCQEFGISRKTGHKIYNRYKESGLEGLNDRSR 59 Query: 65 IPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTM--PAFSTVHNLMARHGL 122 PH N+ + + WGA KI+ + Q + PA ST+H ++ ++GL Sbjct: 60 KPHRYANQLPFQLEKEILKVKKEKPTWGAPKIREKILRQYPDVKSPAISTIHTILDKYGL 119 Query: 123 LPGASP---GIPATGRFEHDAPNRLWQMDFKGHFPFGGGR-CHPLTLLDDHSRFSLCLAH 178 + T PN LW D+KG F G C+PLTL D +SR+ L Sbjct: 120 VTKRKRRRYKAEGTKLTNGKTPNELWCADYKGEFQLGSKEYCYPLTLTDFNSRYLLACEG 179 Query: 179 CTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTW--TALELWLMRLGIRVGHS 236 + + + VF+ YGLP+ + DNG P+ + + L +W +RLGI + Sbjct: 180 LSTTKEQYAITVFERVFKEYGLPNAIRTDNGVPFSSVQALFGLSKLSVWWLRLGISIERI 239 Query: 237 RPYHPQTQGKLERFHRSLKAEVLQ--GKWFADSGELQRAFDHWRTVYNLERPHEALDMAV 294 RP +PQ G+ ER H +LK E + G+ F Q FD + YN ERPH+ALDM Sbjct: 240 RPGNPQENGRHERMHLTLKKETTKPSGENFLQQ---QEKFDRFIDEYNNERPHQALDMRY 296 Query: 295 PGSRYQPSARQYSG 308 PG Y PS ++Y G Sbjct: 297 PGEVYIPSNKEYKG 310 >UniRef50_C3MF40 Integrase catalytic core domain protein n=3 Tax=Rhizobium sp. NGR234 RepID=C3MF40_RHISN Length = 400 Score = 171 bits (432), Expect = 6e-41, Method: Compositional matrix adjust. Identities = 126/382 (32%), Positives = 178/382 (46%), Gaps = 10/382 (2%) Query: 5 MPWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPR 64 M W M R +FV + LC +GIS TGYKWL+R+ G AGL+D PR Sbjct: 1 MVWRETGIMEERLKFVAACLSGEETMAGLCALYGISRKTGYKWLRRFQLRGPAGLEDLPR 60 Query: 65 IPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWL--EDQGHTMPAFSTVHNLMARHGL 122 P + ++ ++ + + H WG +KI L +D P+ ST ++ RHGL Sbjct: 61 APLNHGRATAAELVERIVAEKEAHPLWGPKKIVARLARQDPATAWPSASTAGAILNRHGL 120 Query: 123 LPGASPGIPATGRF---EHDAPNRLWQMDFKGHFPFGGG-RCHPLTLLDDHSRFSLCLAH 178 + G E PN +W D KG F G RC PLT++D SR+ L L Sbjct: 121 VGRRRARWKGAGNGPWPEPAMPNAVWTGDHKGWFTTRDGWRCEPLTVMDVKSRYLLALEA 180 Query: 179 CTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGT-WTALELWLMRLGIRVGHSR 237 E +F+ +GLPDR+ DNG P+ T T L L +RLGI + Sbjct: 181 TGSTGDEEAWPVFERLFDEHGLPDRIRTDNGPPFAAAGVTGLTPLSLRFVRLGITLERIA 240 Query: 238 PYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGS 297 P PQ GK ERFH ++ L AD AF+ +R YN ERPHE L M P Sbjct: 241 PGKPQQNGKHERFHLTMLP--LAKAPAADRAAQAEAFEAFRREYNEERPHETLGMDTPAE 298 Query: 298 RYQPSARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQEDG 357 Y+ S R+ + P+Y VRKV +G + +G + GE V ++E E G Sbjct: 299 HYRASTRKMPVSPPEPDYPAEAAVRKVRHNGAVKWQGAEIYVSATLVGEVVAIEET-ESG 357 Query: 358 SYEVWWYSTKVGVIDLKKKSIT 379 + + +Y+ ++G ID K+ + Sbjct: 358 EWAMRFYAHRLGFIDEKRGRLV 379 >UniRef50_A9BRN3 Integrase catalytic region n=7 Tax=Proteobacteria RepID=A9BRN3_DELAS Length = 395 Score = 170 bits (431), Expect = 8e-41, Method: Compositional matrix adjust. Identities = 116/383 (30%), Positives = 185/383 (48%), Gaps = 23/383 (6%) Query: 5 MPWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPR 64 MPW M + F+ + GA + LCRR+GIS T YKW++R+ Q G GLQ+R R Sbjct: 1 MPWKECAPMDEKLLFIADHLRGGAPLSELCRRYGISRKTAYKWVERYRQLGMDGLQERSR 60 Query: 65 IPH-HSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQ--GHTMPAFSTVHNLMARHG 121 PH ++ S A++ + + + G +K+ L + P+ +T++N++ G Sbjct: 61 RPHGNNQAISYAQRRAIIELRTQQRSQMGPKKLHALLLQRWGPQETPSKTTIYNVLKAEG 120 Query: 122 LL----------PGASPGIPATGRFEHDAPNRLWQMDFKGHFPFGGGR-CHPLTLLDDHS 170 L+ P A P + PN +W D+KG F G C+PLT++D S Sbjct: 121 LVCSRRVRRRSVPTAQPLRTS------KQPNGVWSADYKGQFKTADGHWCYPLTIMDHAS 174 Query: 171 RFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDT-TGTWTALELWLMRL 229 R+ L + E ++ VF +YGLP+R+ DNG P+ T + L +W +RL Sbjct: 175 RYLLAVHVYDSPNYEDAKRSFEQVFRQYGLPERIRSDNGPPFATTGVAGLSRLAIWWIRL 234 Query: 230 GIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEA 289 GIR PQ G+ ER HR+LK L + AD LQ D + YN +RPHEA Sbjct: 235 GIRPERIERGKPQQNGRHERMHRTLK-HALGKEPAADKAALQMQLDAFVEHYNQQRPHEA 293 Query: 290 LDMAVPGSRYQPSARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVG 349 L ++P Y SAR Y +Y + +V +G + + + + G G+ +G Sbjct: 294 LQQSMPAQHYSDSARPYPSKLPELQYPKHWERVRVSHNGLIYWRALRVYIGYLLAGQWIG 353 Query: 350 LKEMQEDGSYEVWWYSTKVGVID 372 ++E+ G ++V+ ++G + Sbjct: 354 MQEVAA-GQWDVYLGPVRLGCFN 375 >UniRef50_UPI00019025E5 ISHne2, transposase n=1 Tax=Rhizobium etli Brasil 5 RepID=UPI00019025E5 Length = 392 Score = 149 bits (377), Expect = 1e-34, Method: Compositional matrix adjust. Identities = 124/357 (34%), Positives = 167/357 (46%), Gaps = 22/357 (6%) Query: 29 NIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPR----IPHHSPNRSSDDITALLRMA 84 ++ LCRR+G+S T Y W +R DR PH +P D + AL Sbjct: 17 SVSDLCRRYGVSRETFYSWRKRQMSGADDWFVDRSHGTVSCPHRTPAALVDQVIAL---- 72 Query: 85 HDRHERWGARKIKRWLEDQGHTMP--AFSTVHNLMARHGLLPGASPGIPATGR----FEH 138 R G RK+ L+ Q P A ST+ +++ R GL+ A A + E Sbjct: 73 RQRFPHMGPRKLLALLQRQSAQTPWPAASTIGDILKRAGLVEVAKRRRRALDQSRPFTEA 132 Query: 139 DAPNRLWQMDFKGHF-PFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFER 197 N W +DFKG F R PLT+ D +SRF L + E V+ F Sbjct: 133 TQANDEWSVDFKGWFRTRDQQRIDPLTISDSYSRF-LIDVRIAPQTIEGVRPVFEEAFRT 191 Query: 198 YGLPDRMTMDNGSPWGD-TTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKA 256 +GLP + DNGSP+G G T L W ++LGI P PQ G+ ER HR+LKA Sbjct: 192 HGLPFAIRCDNGSPFGSHGAGGLTRLSTWWIKLGIEAHFIAPASPQENGRHERMHRTLKA 251 Query: 257 EVLQGKWFAD-SGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQP-SARQYSGNTTPPE 314 + K AD +G+ Q FD +R YN ERPHEAL P Y+P R P Sbjct: 252 QT--SKPPADNAGQQQVRFDAFRQHYNEERPHEALGQRPPADLYRPCQPRAMPERLDDPW 309 Query: 315 YDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQEDGSYEVWWYSTKVGVI 371 YD VR+V SG++ KG L +A GE VGL E+ E+G + V + + +G+I Sbjct: 310 YDADHQVRRVRDSGEIKWKGGRLFVSEALAGELVGLSEL-ENGDHVVRFCNRDIGLI 365 >UniRef50_Q82H05 Putative IS481 family ISMav2-like transposase n=1 Tax=Streptomyces avermitilis RepID=Q82H05_STRAW Length = 589 Score = 147 bits (372), Expect = 5e-34, Method: Compositional matrix adjust. Identities = 94/290 (32%), Positives = 141/290 (48%), Gaps = 15/290 (5%) Query: 27 GANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMAHD 86 G + + R+G+S + + W++++ Q G AGL DR P P+R + ++ A++ Sbjct: 15 GVPVTQVAARYGVSRQSVHSWVRKYEQSGLAGLTDRSHRPASCPHRIASEVEAVVCELRR 74 Query: 87 RHERWGARKIKRWLEDQGHT-MPAFSTVHNLMARHGLLPGASPGI-----PATGRFEHDA 140 RH WG R++ LE +G +P+ +TV+ ++ R+ L+ PG+ R+E A Sbjct: 75 RHPTWGPRRLVHELERRGLAPVPSRATVYRVLIRNSLI---EPGVRRRRRSDYRRWERSA 131 Query: 141 PNRLWQMDFKGHFPFG-GGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYG 199 LWQMD G GG C +T +DDHSRF + V R+G Sbjct: 132 AMELWQMDIVGGLLLADGGECKMVTGIDDHSRFMVIAKVVQRATARAVCSAFGEALVRFG 191 Query: 200 LPDRMTMDNGSPWGDTTGTWTALELWLMRL----GIRVGHSRPYHPQTQGKLERFHRSLK 255 +P+ + DNG + E R+ GI ++P P T GK+ERFH++L+ Sbjct: 192 VPEEVLTDNGKQFTARFSPGKPGEAMFDRICRENGITHRLTKPRSPTTTGKIERFHQTLR 251 Query: 256 AEVL-QGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSAR 304 E+L Q F D Q D W YN RPH+ LDMAVP SR+ P R Sbjct: 252 RELLDQQDPFTDLATAQATVDAWLEEYNRMRPHQGLDMAVPASRFVPRPR 301 >UniRef50_B1ZP18 Integrase catalytic region n=1 Tax=Opitutus terrae PB90-1 RepID=B1ZP18_OPITP Length = 387 Score = 146 bits (369), Expect = 1e-33, Method: Compositional matrix adjust. Identities = 119/384 (30%), Positives = 165/384 (42%), Gaps = 28/384 (7%) Query: 5 MPWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPR 64 MPW + R ++ ++ +LC RFG+S T YKW R+ +G GL R Sbjct: 1 MPWKIKTAEQQRQALAREMTRGTVSVTALCARFGVSRTTAYKWAARYVAQGVNGLVAR-- 58 Query: 65 IPHHSPNRSSDDITALLR------MAHDRHERWGARKIKRWLEDQ--GHTMPAFSTVHNL 116 P R AL R +A WGA K++ WLE G +P T+H Sbjct: 59 ----QPGRPKQVSQALARWHARVLLARQARPSWGAPKLRWWLERTHPGERVPCSRTLHRW 114 Query: 117 MARHGLLPG------ASPGIPATGRFEHDAPNRLWQMDFKGHF-PFGGGRCHPLTLLDDH 169 + G + A PG PAT E N +W DFKG F G LT+ D + Sbjct: 115 LVAAGRVHQRRRKLRAGPGRPATVLAERV--NAVWTADFKGDFYTKDGAWILALTVRDLY 172 Query: 170 SRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPW-GDTTGTWTALELWLMR 228 SRF L + V++ +F R+G+P + +D G+P+ G TAL LW R Sbjct: 173 SRFMLTAHPVPRQSEPVVRRVFARLFRRFGVPQAIRVDRGTPFCGSGPYGLTALSLWWQR 232 Query: 229 LGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHE 288 LGI V E+ HR LKAE + +++R W YN +RPHE Sbjct: 233 LGIEVQFVSRKRRLDNNAHEQMHRMLKAEAATPVSRSYGAQVRR-LQRWCGRYNHDRPHE 291 Query: 289 ALDMAVPGSRYQPSARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERV 348 L P S Y+PS R PP+Y G + R+V G + + G G+AF G V Sbjct: 292 GLAGRTPASLYRPSTRLLP-RLVPPQYPLGCVTRRVRPHGYVKLDGSHRHIGRAFVGLTV 350 Query: 349 GLKEMQEDGSYEVWWYSTKVGVID 372 ++ Y V + S +G ID Sbjct: 351 AFTPYRQ--LYRVHFDSLLLGTID 372 >UniRef50_A1T2L4 Integrase, catalytic region n=4 Tax=Actinomycetales RepID=A1T2L4_MYCVP Length = 597 Score = 144 bits (364), Expect = 4e-33, Method: Compositional matrix adjust. Identities = 95/288 (32%), Positives = 142/288 (49%), Gaps = 14/288 (4%) Query: 26 DGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMAH 85 DG + + + G S + + W+ R+ G AGL DR R P SPN S + A++ Sbjct: 26 DGVSKSQVAQECGASRQSVHSWVIRYEALGVAGLADRSRRPLTSPNELSPAVVAMVCELR 85 Query: 86 DRHERWGARKIKRWLEDQG-HTMPAFSTVHNLMARHGLLPGASPGIP-ATGRFEHDAPNR 143 + RWGA++I L +G P+ S+V+ ++ RHGL+ R++ DAP + Sbjct: 86 RTYPRWGAQRIAHELALRGVDAPPSRSSVYRILVRHGLVAAQQQNHKRKYRRWQRDAPMQ 145 Query: 144 LWQMDFK-GHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPD 202 LWQ+D G F G C +T +DDHSRF + D V + YG+P Sbjct: 146 LWQIDIMGGVFLVDGRECKVVTGIDDHSRFVVMATVVADPGARAVCAAFTATMAIYGVPS 205 Query: 203 RMTMDNGSPWGDTTGTWTA---LELWLMRL----GIRVGHSRPYHPQTQGKLERFHRSLK 255 + DNG + TG +T E+ R+ GI ++P P T GK+ERFH++L+ Sbjct: 206 EVLTDNGKQF---TGRFTKPYPAEVLFERICRENGITTRLTKPRSPTTTGKIERFHKTLR 262 Query: 256 AEVLQGKW-FADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPS 302 E+L FA Q A D W YN RPH++L MA P + ++P+ Sbjct: 263 RELLDSAGPFASIEVAQEAIDAWVHGYNHSRPHQSLGMATPATMFRPA 310 >UniRef50_B2HR82 Transposase for ISMyma05 n=5 Tax=Mycobacterium RepID=B2HR82_MYCMM Length = 518 Score = 137 bits (346), Expect = 6e-31, Method: Compositional matrix adjust. Identities = 91/280 (32%), Positives = 141/280 (50%), Gaps = 12/280 (4%) Query: 26 DGANIRSLCRRFGISPATGYKWLQRWAQEGAA-GLQDRPRIPHHSPNRSSDDITALLRMA 84 DGA + ++ RRFG+S T + WL+R+A+EGAA L+DR PH P++ ++ A + Sbjct: 20 DGAPVTAVARRFGVSRQTVHAWLRRYAEEGAALNLEDRSSRPHRCPHQMPVEVEARVLTL 79 Query: 85 HDRHERWGARKIKRWLE-DQGHTMPAFSTVHNLMARHGLLPGASPGIPATG--RFEHDAP 141 D H RWG +I L+ D +P S+V+ + R+G + A + R+E P Sbjct: 80 RDAHPRWGPTRIVYELQRDVVPVVPGRSSVYRALVRNGRIDPAKRRRRRSDYKRWERGRP 139 Query: 142 NRLWQMDFKGHFPFGGG-RCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGL 200 LWQMD G G +T +DD+SRF + V L+ R+G+ Sbjct: 140 MELWQMDVVGGLHLRDGIEVKVVTGIDDNSRFVVSAKVVARATARPVCAALLEALRRHGV 199 Query: 201 PDRMTMDNGSPWGDTTGT-WTALELWLMRLGIR--VGH--SRPYHPQTQGKLERFHRSLK 255 P+++ DNG + G ++ E+ R+ + +GH + P P T GK+ER H++++ Sbjct: 200 PEQILTDNGKVFTGRFGPGGSSAEVLFDRVCVENGIGHLLTAPRSPTTTGKVERLHKTMR 259 Query: 256 AEVLQ--GKWFADSGELQRAFDHWRTVYNLERPHEALDMA 293 AE+ F ELQ A D W YN RPH++L M Sbjct: 260 AEIFAEVDGVFDAIAELQAAIDRWVQYYNTARPHQSLGMV 299 >UniRef50_UPI0001B540A0 transposase for IS3514a n=1 Tax=Streptomyces sp. AA4 RepID=UPI0001B540A0 Length = 383 Score = 132 bits (331), Expect = 3e-29, Method: Compositional matrix adjust. Identities = 111/372 (29%), Positives = 165/372 (44%), Gaps = 35/372 (9%) Query: 26 DGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDIT-ALLRMA 84 + NI CR G+S +K+L R+ EGA G R PH P ++ A+LR Sbjct: 16 EDVNIARFCREHGVSRTVFHKYLNRFRAEGADGFTRRSTAPHRRPTALGTEVAEAVLRAR 75 Query: 85 HDRHERW---GARKIKRWLEDQGHT-MPAFSTVHNLMARHG-LLPGASPGIPATGRFEHD 139 + + G I+ LE QG +P+ S V+ ++ HG ++P RFE+ Sbjct: 76 KELADEGLDNGPISIRWRLEAQGAAAVPSQSAVYRILRAHGQIVPQPRKKPRTRRRFEYA 135 Query: 140 APNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYG 199 PN WQ+D H G + + +LDDHSR + T E L F +G Sbjct: 136 DPNGCWQIDGMEHHLADGTKVCIIQILDDHSRLDVGAYAATGETTAATWAALQHAFAGHG 195 Query: 200 LPDRMTMDNGSPW-GDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEV 258 LP + DNG + G G LE L LGI + P+HPQT GK ER H++L+ Sbjct: 196 LPVALLSDNGLAFSGKHRGRMVELERRLAALGITAIAAAPHHPQTCGKNERSHQTLQ--- 252 Query: 259 LQGKWFADS------GELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGNTTP 312 KW A +LQ D +RT+YN R H++L+ P RY AR + T Sbjct: 253 ---KWLAARPAAGTLAQLQELLDEYRTIYN-HRRHQSLNGDTPRQRY--DARPKAVPATG 306 Query: 313 PEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQEDGSYEVWWYSTKVGVI- 371 P G+ R V +G ++ G S+ G+ + G + V+W +V V+ Sbjct: 307 PRRPSGLATRPVSATGVIAFSGCSIVLGRRWAGH-----------TASVYWQGDRVTVMI 355 Query: 372 -DLKKKSITMGK 382 D + +T+ + Sbjct: 356 NDTIARQLTLDR 367 >UniRef50_UPI0001B45627 transposase for ISMyma05 n=1 Tax=Mycobacterium intracellulare ATCC 13950 RepID=UPI0001B45627 Length = 519 Score = 130 bits (328), Expect = 6e-29, Method: Compositional matrix adjust. Identities = 97/290 (33%), Positives = 148/290 (51%), Gaps = 12/290 (4%) Query: 26 DGANIRSLCRRFGISPATGYKWLQRWAQEGAA-GLQDRPRIPHHSPNRSSDDITALLRMA 84 DGA I ++ R+G+S T + WL+R+A+EGA L+DR PH P++ + ++ A + + Sbjct: 20 DGAVISTVACRYGVSRQTVHAWLRRYAREGAVLNLEDRSSRPHGCPHQMAAELEARVLVL 79 Query: 85 HDRHERWGARKIKRWLEDQGHT-MPAFSTVHNLMARHGLLPGASPGIPATG--RFEHDAP 141 D H RWG +I L +G +P S+V+ + R+G + A R+E P Sbjct: 80 RDAHPRWGPTRIVYELVREGVVAVPGRSSVYRALVRNGRIDPARRRRRRADYKRWERGRP 139 Query: 142 NRLWQMDFKGHFPFGGG-RCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGL 200 LWQMD G G +T +DD+SRF +C A V + L++ R+G+ Sbjct: 140 MELWQMDVVGGVHLCDGVEVKVITGIDDNSRFVVCAAVVARATARPVCEALLAALARHGV 199 Query: 201 PDRMTMDNGSPWGDTTGTW-TALELWLMRL----GIRVGHSRPYHPQTQGKLERFHRSLK 255 P+++ DNG + G ++ E R+ GIR + P P T GK+ER H++++ Sbjct: 200 PEQILTDNGKVFTGRFGPGGSSSEALFDRVCAENGIRHLLTAPRSPTTTGKVERLHKTMR 259 Query: 256 AEVLQGK--WFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSA 303 AE FA ELQ A D W YN ERPH++L M P R+ +A Sbjct: 260 AEFFTDADGRFATIAELQAALDGWVGQYNTERPHQSLGMRPPAERFALAA 309 >UniRef50_C8XFB0 Integrase catalytic region n=4 Tax=Actinomycetales RepID=C8XFB0_NAKMY Length = 607 Score = 127 bits (318), Expect = 1e-27, Method: Compositional matrix adjust. Identities = 90/291 (30%), Positives = 139/291 (47%), Gaps = 16/291 (5%) Query: 27 GANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMAHD 86 GA + + G+S + + WL+R+ EG GL DR P P+++ D+++ + Sbjct: 22 GATVTEVAAAVGVSRVSVHAWLRRYLTEGVTGLADRSHRPRSCPHQAGDEVSVRVAELRR 81 Query: 87 RHERWGARKIKRWL--EDQGHTMPAFSTVHNLMARHGLLPGASPGIPATG--RFEHDAPN 142 H RWGA++I+ L + G T+P+ +T++ ++ RHGL+ P + R+E P Sbjct: 82 THPRWGAKRIRMELLRKPAGLTVPSTATINRILIRHGLVTPRRRKRPRSSYQRWERPGPM 141 Query: 143 RLWQMDFKGHF----PFGGGR--CHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFE 196 +LWQ+D G P G +T +DDHSRF + A V L + Sbjct: 142 QLWQLDIVGDVWLVNPATGVLRGVKVVTGVDDHSRFCVIAAVVERATGRAVCLALAAALA 201 Query: 197 RYGLPDRMTMDNGSPWGDT--TGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSL 254 R+G+P + DNG + G + GI ++P P T GK+ERFH +L Sbjct: 202 RFGVPGEILTDNGKQFTARFGRGGEVLFDKICRHNGITHRLTQPASPTTTGKIERFHLTL 261 Query: 255 KAEVLQG-KWFADSGELQRAFDHWRTVYNLERPHEALD---MAVPGSRYQP 301 + E+L + F Q A D + VYN ERPH+ALD P R+ P Sbjct: 262 RRELLDDHEPFESLAAAQAAVDEFVRVYNTERPHQALDGQRPVSPADRFTP 312 >UniRef50_Q4JUH9 Transposase for IS3514b n=9 Tax=Corynebacterium RepID=Q4JUH9_CORJK Length = 407 Score = 123 bits (309), Expect = 1e-26, Method: Compositional matrix adjust. Identities = 105/354 (29%), Positives = 145/354 (40%), Gaps = 36/354 (10%) Query: 19 FVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSP-------- 70 V + G + + +RF IS YK L ++ GA + + R PH P Sbjct: 10 IVKAVREQGEPVTKVAKRFRISRQRIYKILSQFDAGGADAIAPKSRAPHTHPQAVPTSLR 69 Query: 71 NRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGI 130 N+ D L+R D G I L QG +P+ ST+ ++ GL+ Sbjct: 70 NQIIDMRKQLVRSGLD----AGPETIAFHLHRQGLRVPSTSTIRRIITNAGLVTPQPQKK 125 Query: 131 PATG--RFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQ 188 P + RFE PN WQ D G R L +DDHSR+ L + V Sbjct: 126 PRSSFIRFEAAMPNECWQADITHLHLLDGTRLEVLDFIDDHSRYLLSITAAASFSGPAVA 185 Query: 189 QQLVSVFERYGLPDRMTMDNG----SPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQ 244 +L + YG P DNG + G A E L + I+ + RP HPQTQ Sbjct: 186 AELQRLIATYGPPASTLTDNGLVFTARLAGARGGRNAFEKTLNKYRIQQKNGRPGHPQTQ 245 Query: 245 GKLERFHRSLKAEVLQGKWFADSG------ELQRAFDHWRTVYNLERPHEALDMAVPGSR 298 GK+ERFH++LK KW A ELQR D + YN RPH AL P Sbjct: 246 GKIERFHQTLK------KWIAAQSPAITLVELQRQLDTFADYYNTVRPHRALGRRTPHEV 299 Query: 299 YQPSARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVS----LSAGKAFRGERV 348 Y + + PE + V V +GK++V+ S L G+ + GE + Sbjct: 300 YTTGPKAEPNDK--PEEEWRVRNDVVTPNGKVTVRYASRLYQLGIGRKYTGETI 351 >UniRef50_UPI0001B453C4 transposase for IS3514a n=1 Tax=Mycobacterium intracellulare ATCC 13950 RepID=UPI0001B453C4 Length = 397 Score = 121 bits (304), Expect = 4e-26, Method: Compositional matrix adjust. Identities = 96/342 (28%), Positives = 154/342 (45%), Gaps = 25/342 (7%) Query: 26 DGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDI-------- 77 +G + ++ R + +S ++ ++R+ EG A + R R PH +P + D+ Sbjct: 14 EGRSKSAVARDYEVSRYWVHQLVKRYEAEGPAAFEPRSRRPHTNPRAVAGDLEERIVRLR 73 Query: 78 TALLRMAHDRHERWGARKIKRWL--EDQGHTMPAFSTVHNLMARHGLLPGASPGIPATG- 134 LLR +D GA I L + T+PA +T+ +++R G + P + Sbjct: 74 KTLLREGYD----AGAATIAEHLARDPAVATVPALATIWRVLSRRGFITAQPQKRPRSSW 129 Query: 135 -RFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVS 193 RFE D PN+ WQ D L ++DDHSR ++ V + + Sbjct: 130 KRFEADLPNQCWQADVTHWQLADHTSAEILNIIDDHSRLAIASTAYRTVTAPDVVEAFTA 189 Query: 194 VFERYGLPDRMTMDNGSPWGDTT--GTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFH 251 F +G P + DNG+ + T G TAL++ L LGI +SRPYHPQT GK+ERFH Sbjct: 190 AFATWGTPAALLTDNGAVFTATPRRGGRTALQILLGELGITYINSRPYHPQTCGKVERFH 249 Query: 252 RSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQY-SGNT 310 ++LK + ELQ + + YN RPH A+ P + + +G Sbjct: 250 QTLKKRLTAVPPATTITELQSHLNEFVNYYNTVRPHRAVGRRTPHHAFTSRPAAFPTGYH 309 Query: 311 TPPEYDEGVMVRKVDISGKLSVKGVS----LSAGKAFRGERV 348 PP + + ++D +G ++V+ S + K RG V Sbjct: 310 IPPHFR--LRHDRIDAAGVITVRYNSRLHHIGLSKHLRGTHV 349 >UniRef50_Q4JWW8 Transposase for IS3511a n=7 Tax=Corynebacterium RepID=Q4JWW8_CORJK Length = 405 Score = 120 bits (301), Expect = 8e-26, Method: Compositional matrix adjust. Identities = 92/313 (29%), Positives = 139/313 (44%), Gaps = 16/313 (5%) Query: 34 CRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSS-DDITALLRMAHDRHER-- 90 + FG++ +R+ + G L + + PH +P ++ D + +L++ ++ R Sbjct: 24 AQHFGVTTRWIRTLQKRYNEGGVEALTPKSKRPHTNPRATTPDTVDRILQLRNELTNRGT 83 Query: 91 -WGARKIKRWLEDQGHT-MPAFSTVHNLMARHGLLPGASPGIPATG--RFEHDAPNRLWQ 146 GA I+ LE + T +PA +T+H ++ +G + P + RF+ D PN WQ Sbjct: 84 DAGAHTIRWHLEQEDTTPLPATATIHRILKNNGHVTLQPQKRPRSSWIRFQADQPNETWQ 143 Query: 147 MDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMTM 206 MD+ G R LT+LDDHSR+ L V + +G P Sbjct: 144 MDYSDWTIAGHQRVVILTILDDHSRYVLRCQAFNSATVTHVIEAFAYTAAIHGYPQSTLT 203 Query: 207 DNGSPWGD----TTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGK 262 DNG + T E L+ LGI + RPYHPQTQGK+ERFH +LK + Sbjct: 204 DNGRAFTTSNDRTNPARNGFEQLLLDLGIEQKNGRPYHPQTQGKVERFHYTLKLALRNKP 263 Query: 263 WFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQ--PSARQYSGNTTPPEYDEGVM 320 D L D YN +RPH AL+ P Y P A G T +D + Sbjct: 264 QARDIDNLNEQLDDIIDYYNNKRPHRALNRCTPAEAYNALPKAHPRPGAKT---HDYRLR 320 Query: 321 VRKVDISGKLSVK 333 KV +GK +++ Sbjct: 321 TDKVAKNGKTTLR 333 >UniRef50_C8NDJ4 ISHne2 transposase (Fragment) n=1 Tax=Cardiobacterium hominis ATCC 15826 RepID=C8NDJ4_9GAMM Length = 240 Score = 115 bits (287), Expect = 3e-24, Method: Compositional matrix adjust. Identities = 79/215 (36%), Positives = 103/215 (47%), Gaps = 8/215 (3%) Query: 2 ESLMPWDARDTMSLRTEFV-LFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQ 60 E MPW + M R F+ + SQ + I +LCR+FGIS TG KW+ R+ Q G A L Sbjct: 27 EIPMPWRQTNVMQQREMFINAWLSQKYSKI-ALCRQFGISRVTGDKWIVRFKQGGMAALA 85 Query: 61 DRPRIPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQG--HTMPAFSTVHNLMA 118 D P P + + LL A H WGA+K+ L + PA ST ++ Sbjct: 86 DHSSRPAGCPRATDARLCELLCAAKREHPSWGAKKLLALLRRRAPHEAWPADSTGDLILK 145 Query: 119 RHGLLPGASP--GIPATGR--FEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSL 174 R GL+ P GI A DAPN+ W +DFKG F GGRC PLT+ D++SR L Sbjct: 146 RAGLVKARKPRRGISADASPFTAADAPNQSWSVDFKGDFAMRGGRCFPLTVSDNYSRKLL 205 Query: 175 CLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNG 209 C V Q +F G+P + DNG Sbjct: 206 CCHGLASTAYAGVWPQFERLFAENGMPWSILSDNG 240 >UniRef50_A1UAJ3 Integrase, catalytic region n=7 Tax=Actinomycetales RepID=A1UAJ3_MYCSK Length = 426 Score = 109 bits (273), Expect = 2e-22, Method: Compositional matrix adjust. Identities = 93/337 (27%), Positives = 146/337 (43%), Gaps = 31/337 (9%) Query: 30 IRSLCRRFGISPATGYKWLQRWAQEG-AAGLQDRPRIPHHSPNRSSDDITALLRMAHDRH 88 + + C +GIS + Y+ +R EG AA L+ R P SP++ SD++ Sbjct: 28 VSTFCAEYGISRKSFYELRKRVKTEGPAAVLEPMTRRPKSSPSKLSDEVKEQALAVRAAL 87 Query: 89 ERWGARKIKRWLEDQGHTM-----PAFSTVHNLMARHGLLPGASPGIPATG--RFEHDAP 141 E G + D+ H M P+ +++ + G+ P + RF + AP Sbjct: 88 EATGLDHGPISVHDKMHAMGLERVPSTASLARVFREAGVARLEPKKKPRSAWRRFVYPAP 147 Query: 142 NRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLP 201 N WQ+D + GG RC L+DDHSR++L E + + +G+P Sbjct: 148 NACWQLDATEYVLSGGRRCVIFQLIDDHSRYALASHVALSETAKEAIAVVDKAIAAHGVP 207 Query: 202 DRMTMDNGSPWGDT-TGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQ 260 R+ DNG + G L L LG+ +PY P TQGK ERFH++L L Sbjct: 208 QRLLSDNGIALNPSRRGHVGQLVAHLAALGVEAITGKPYKPTTQGKNERFHQTL-FRYLD 266 Query: 261 GKWFADS-GELQRAFDHWRTVYNLERPHEALDMAV-PGSRYQPSA--------------- 303 + A+S ELQ D + +YN ERPH+ L V P + ++ +A Sbjct: 267 KQPIAESLAELQCHVDAFDGIYNTERPHQGLPGRVTPRTAWEATAKAPAPRPKPDPPSFD 326 Query: 304 ----RQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVS 336 R + TP + G V+ ++ +G + GV+ Sbjct: 327 HAVVRPHRPAPTPADLPHGTSVKTLNTAGAFVLAGVT 363 >UniRef50_C7MHU2 Integrase family protein n=3 Tax=Actinomycetales RepID=C7MHU2_BRAFD Length = 434 Score = 103 bits (258), Expect = 9e-21, Method: Compositional matrix adjust. Identities = 78/273 (28%), Positives = 117/273 (42%), Gaps = 14/273 (5%) Query: 29 NIRSLCRRFGISPATGYKWLQRWAQEG-AAGLQDRPRIPHHSPNRSSDDIT-------AL 80 + S C GIS T Y R +EG AA L+ + R P SP R +D+ A Sbjct: 26 TVTSFCVEHGISRKTFYVLRARLREEGPAAVLEPKSRRPSSSPTRIGEDVKDQAVAVRAA 85 Query: 81 LRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPATG--RFEH 138 L + H G + + G P+ + + + G+ P + RF + Sbjct: 86 LEASGLDH---GPISVFDRMGAMGLESPSVAALARIFRERGVARADPKKKPRSAYRRFVY 142 Query: 139 DAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERY 198 APN WQ+D G+ G C L DDHSR ++ E + + RY Sbjct: 143 PAPNACWQLDATGYVLIDGRSCTIFQLQDDHSRLAVASLVAPAETTQAALDVFLKGVARY 202 Query: 199 GLPDRMTMDNGSPWGDTTGTW-TALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAE 257 G+P R+ DNG+ T W + L LG++ +P+ P TQGK ERFH++L Sbjct: 203 GVPQRLLTDNGAAMNPTRRGWPSPLVTHATGLGVQAITGKPFKPTTQGKNERFHQTLFRW 262 Query: 258 VLQGKWFADSGELQRAFDHWRTVYNLERPHEAL 290 + Q +LQ D++ +YN +R H+ L Sbjct: 263 LDQQPLAETISQLQAMVDNFDIIYNQQRRHQGL 295 >UniRef50_D1VRH7 Integrase catalytic region n=1 Tax=Frankia sp. EuI1c RepID=D1VRH7_9ACTO Length = 410 Score = 103 bits (257), Expect = 1e-20, Method: Compositional matrix adjust. Identities = 104/336 (30%), Positives = 143/336 (42%), Gaps = 37/336 (11%) Query: 26 DGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITAL-LRMA 84 +G + RR+ +S YK R+ EG + R R P SP +S + L LR+ Sbjct: 14 EGQTAAQVARRYEVSRGWVYKLKARYDAEGEVAFEPRSRRPVSSPTATSVAMVDLVLRLR 73 Query: 85 HDRHER---WGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPATG--RFEHD 139 + E GA I L T + +T++ ++ R G + P + RF+ D Sbjct: 74 KELAEAGLDAGADTIGWHLAHHHDTTLSRATINRILNRAGAVTPEPAKRPRSSYIRFQAD 133 Query: 140 APNRLWQMDFKGHF----PFG--GGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVS 193 PN WQ DF H+ P G G LT LDDHSRF+L ++ V Sbjct: 134 QPNECWQSDFT-HYRLTRPNGKIGIDTEILTWLDDHSRFALRVSAHLKITGRIVVASFRQ 192 Query: 194 VFERYGLPDRMTMDNGSPW------GDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKL 247 + +G P DNG + G T E L RLGI +SRP HP T GK+ Sbjct: 193 AADLHGYPASTLTDNGMVYTVRLASAGVAGGRTGFEAELRRLGIVQKNSRPNHPTTCGKV 252 Query: 248 ERFHRSLKAEVLQGKWF-------ADSGELQRAFDHWRTVYNLERPHEALD-MAVPGSRY 299 ERF ++LK KW A + LQ D + YN RPH +L P Y Sbjct: 253 ERFQQTLK------KWLAAQPVQPASTYALQTLIDQFVETYNQHRPHRSLPGRCTPAVAY 306 Query: 300 QPSARQYSGNTTPPEYDEGVMVRK--VDISGKLSVK 333 Q AR + T D VR+ VD +GKL+++ Sbjct: 307 Q--ARPKARPNTDRSADSHDRVRRDHVDANGKLTLR 340 >UniRef50_A3TPQ6 Transposase n=7 Tax=Actinomycetales RepID=A3TPQ6_9MICO Length = 402 Score = 102 bits (255), Expect = 2e-20, Method: Compositional matrix adjust. Identities = 92/321 (28%), Positives = 141/321 (43%), Gaps = 14/321 (4%) Query: 26 DGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSD-DITALLRMA 84 +G + RR+G+ A YK R+ EG A + R R P SP + + + +LR+ Sbjct: 14 EGLKPAEVSRRYGVHRAWVYKLKARYEAEGEAAFEPRSRRPTTSPRATPEGTVDLVLRLR 73 Query: 85 HDRHERW--GARKIKRWLEDQGHTMP-AFSTVHNLMARHGLL---PGASPGIPATGRFEH 138 D + G W GH + + +TVH ++ R G + PG P + RFE Sbjct: 74 EDLTGKGLDGGADTIVWHLLHGHGVTLSRATVHRILTRAGKVTAEPGKRPK-SSFIRFEA 132 Query: 139 DAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERY 198 + PN WQ DF + G +T LDDHSR++L ++ T + V + Sbjct: 133 EQPNETWQSDFTHYRLSTGADVEVITWLDDHSRYALHVSAHTRTTAKIVLATFRAATAEQ 192 Query: 199 GLPDRMTMDNGSPW----GDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSL 254 G P DNG + G TALE L L I +SRP HP T GK+ERF +++ Sbjct: 193 GCPAGTLTDNGMVYTVRFATGPGGRTALEHELRTLNIVQKNSRPNHPTTCGKVERFQQTM 252 Query: 255 KAEV-LQGKWFADSGELQRAFDHWRTVYNLERPHEAL-DMAVPGSRYQPSARQYSGNTTP 312 K + Q A EL + T YN RPH +L + P + Y + Sbjct: 253 KNWLRAQPDQPATVAELNTLLAAFVTEYNTRRPHRSLPHRSTPATAYNARPKATPTTDRT 312 Query: 313 PEYDEGVMVRKVDISGKLSVK 333 + + V K++ +G ++++ Sbjct: 313 DDTHDRVRTDKINKNGVVTLR 333 >UniRef50_UPI0001C30E87 Integrase catalytic region n=1 Tax=Conexibacter woesei DSM 14684 RepID=UPI0001C30E87 Length = 318 Score = 101 bits (252), Expect = 4e-20, Method: Compositional matrix adjust. Identities = 99/325 (30%), Positives = 134/325 (41%), Gaps = 44/325 (13%) Query: 16 RTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSD 75 R VL + G +I + G+S T KW+ R+ EG+ GL DR P SP R+ + Sbjct: 14 RERMVLRVVEQGWSIAEAAQAAGVSDRTCSKWIGRYRAEGSMGLVDRASTPKRSPTRTPE 73 Query: 76 DITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPATGR 135 D L+ A R R A +I L T+ A NL R L P P R Sbjct: 74 DRVQLI--AALRRLRMTAAEIALCLGMALSTVSAVLRRINLGKRSRLDPPEPPN-----R 126 Query: 136 FEHDAPNRLWQMDFKGHFPFGGGRCHPLT-------------------LLDDHSRFSLCL 176 +E P L +D K GG H +T +DD +R + Sbjct: 127 YERARPGELLHIDVKKLGRIHGGAGHRVTGRKSGMHRARGAGWDYVHVCVDDATRLAYVE 186 Query: 177 AHCTDERRETVQ---QQLVSVFERYGLP-DRMTMDNGSPWGDTTGTWTALELWLMRL-GI 231 DER TV ++ + + R+G+ +R+ DNGS + T L RL G+ Sbjct: 187 V-LPDERGTTVAGFLRRAIRHYRRHGITVERVMTDNGSGYRST------LHAIACRLQGV 239 Query: 232 RVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALD 291 R +RPY P+T GK ERF R++ G +A S E A D W YN RPH +L Sbjct: 240 RHLRTRPYRPRTNGKAERFIRTMIEGWAYGAIYASSAERTAALDGWLFTYNHRRPHGSL- 298 Query: 292 MAVPGSRYQPSARQYSGNTTPPEYD 316 S P+AR N P Y Sbjct: 299 -----SHKPPAARLRELNNLPSSYS 318 >UniRef50_A3J543 Helix-turn-helix, Fis-type protein n=6 Tax=Bacteria RepID=A3J543_9FLAO Length = 336 Score = 100 bits (250), Expect = 7e-20, Method: Compositional matrix adjust. Identities = 75/296 (25%), Positives = 135/296 (45%), Gaps = 10/296 (3%) Query: 10 RDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHS 69 R T+S + E + ++ + R GI+ + Y W +++ G GL R + Sbjct: 2 RLTVSEKQEIIHMVTRSEIGVNRTLREIGINKSMFYNWYHAYSENGVEGLLPTKRASNRQ 61 Query: 70 PNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPG 129 N + L+ + +R++ + D+ + S+V+ ++ GL+ + Sbjct: 62 WNSIPQEQKNLVVKLALEYPDLSSRELAYKVTDEQQIFLSESSVYRILKSRGLITAPAHI 121 Query: 130 IPATGRFEHDAPN---RLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRET 186 + G D + ++WQ DF G G + T+LDD+SR+ + C++ + + Sbjct: 122 FLSAGNEFTDKTSFVHQMWQTDFTYFKILGWGWYYLSTVLDDYSRYIVHWELCSNMKADD 181 Query: 187 VQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALEL--WLM-RLGIRVGHSRPYHPQT 243 V++ + S ++ L +T D + A EL +L ++ H RP HPQT Sbjct: 182 VKRTVDSAIKKAKL---VTKQKPKLLSDKGSCYIASELKTYLKDNYQMQQVHGRPNHPQT 238 Query: 244 QGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRY 299 QGK+ER+HR++K V +FA EL+ A + + YN ER HE+L+ P Y Sbjct: 239 QGKIERYHRTIKNVVKLDNYFAPE-ELEAALEKFVYRYNNERYHESLNNLTPADVY 293 >UniRef50_C1DTA7 Putative transposase n=2 Tax=Sulfurihydrogenibium azorense Az-Fu1 RepID=C1DTA7_SULAA Length = 327 Score = 100 bits (250), Expect = 7e-20, Method: Compositional matrix adjust. Identities = 74/292 (25%), Positives = 131/292 (44%), Gaps = 33/292 (11%) Query: 25 QDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHS--PNRSSDDITALLR 82 QD NI CR FGIS T YKW +R+ ++G GL DRP+ P ++ P + +++ Sbjct: 43 QDTKNISKTCRYFGISRTTFYKWFERYKKDGLEGLLDRPKTPKNTRKPTIRNQYREQIIK 102 Query: 83 MAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLP-GASPGIPATGR------ 135 + ++ W KI +L+++ + + STV+ ++ GL+ S I + Sbjct: 103 V-RKQNPTWSKEKISAYLQEEKNIKVSPSTVYKVLKEEGLIERTKSIKIQNKRKKSIKKK 161 Query: 136 -----FEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQ 190 + AP + Q+D K H G + T +D +SRF + + ++T ++ Sbjct: 162 RTKRGLQAQAPGDVVQIDVK-HLNIAGATYYQFTAIDKYSRFCFARVYESKNSKKT-KEF 219 Query: 191 LVSVFERYGLP-DRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLER 249 + + E + R+ DNGS + G + +L +G+ S P P+T G +ER Sbjct: 220 YIELNEYFEFEIKRVQTDNGSEF---LGEFNK---YLTDIGVEHYFSYPRSPKTNGVVER 273 Query: 250 FHRSLKAEVLQGKWFADS-----GELQRAFDHWRTVYNLERPHEALDMAVPG 296 R+++ E+ W + E+ + + YN RPH +L P Sbjct: 274 LIRTIEEEL----WLIEGLDYTLEEMNKKLRKYVRKYNFIRPHHSLGYKRPA 321 >UniRef50_A0JV34 Integrase, catalytic region n=12 Tax=Actinomycetales RepID=A0JV34_ARTS2 Length = 325 Score = 100 bits (249), Expect = 1e-19, Method: Compositional matrix adjust. Identities = 97/333 (29%), Positives = 135/333 (40%), Gaps = 52/333 (15%) Query: 1 MESLMPWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQ 60 M + +A T R + + G +R RF SPAT KW+ R+ G G+ Sbjct: 1 MSYVTHANADLTPKARGKLARLVIEQGWTLRRAAERFQCSPATAKKWVDRYRARGEDGMA 60 Query: 61 DRPRIPHHSPNRSSDDITALLRMAHDRH-ERWGARKIKRWLEDQGHTMPAFSTVHNLMAR 119 D P SPNR+ D+ R+ R RWG +I H A STV ++ R Sbjct: 61 DLSSRPRRSPNRT--DVRTERRILALRFTRRWGPHRI------AAHLHLARSTVGKVLTR 112 Query: 120 HGL--LPGASPGI------PATGRFEHDAPNRLWQMDFK--GHFPFGGGR---------- 159 + + L G PA R+EHD P L +D K G P GGG Sbjct: 113 YRMPRLACLDQGTGLPIRKPAPQRYEHDHPGDLVHVDIKKLGRIPDGGGHRALGRAAGRK 172 Query: 160 --------CHPLTLLDDHSRFSLCLAHCTDERRETVQQ---QLVSVFERYGLPDRMTM-D 207 + +DDHSR + TDE++ET + S F +G+ R + D Sbjct: 173 NRRAGTGYAYLHHAVDDHSRLAYSEI-LTDEKKETATAFWFRAASFFAAHGITVRAVLTD 231 Query: 208 NGSPWGDTTGTWTALELWLMRLGIRVGH--SRPYHPQTQGKLERFHRSLKAEVLQGKWFA 265 NG+ + T LG + H +RPY PQT GK+ERF+R+L E + + Sbjct: 232 NGACYRSRAFTAA--------LGPNIKHRRTRPYRPQTNGKVERFNRTLNTEWAYARPYT 283 Query: 266 DSGELQRAFDHWRTVYNLERPHEALDMAVPGSR 298 E + W YN R H + P SR Sbjct: 284 SEAERAATYPGWLHQYNHHRTHTGIGGKTPISR 316 >UniRef50_A6DU50 Putative ISmav2-like transposase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DU50_9BACT Length = 598 Score = 98.2 bits (243), Expect = 4e-19, Method: Compositional matrix adjust. Identities = 82/289 (28%), Positives = 132/289 (45%), Gaps = 30/289 (10%) Query: 21 LFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITAL 80 A++ GAN++SL W++R+ +EG GL+ RP+ + + Sbjct: 30 FVAAESGANLKSLG-----------AWIKRYNEEGPQGLKPRPKGKKGRQQIHPETKEKI 78 Query: 81 LRMAHDRHERWGARKI----KR--WLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPATG 134 + + ++ +G ++I KR +L+ T+ NL+ + P +P P Sbjct: 79 IELKK-QYPIFGIKRISDLLKRVFFLKASPETVRKTLNEENLIQKERKKPRKNPQKPRF- 136 Query: 135 RFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSV 194 FE PN++WQ D F GG + L +DD+SR+ + L RR+T + L+ V Sbjct: 137 -FERSRPNQMWQTDI-FSFRLGGQAAYLLAFIDDYSRYMVGLGLY---RRQTAEN-LLEV 190 Query: 195 FER----YGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERF 250 + R Y P M DNG + + GT T E L + I+ S+P+HP T GK+ERF Sbjct: 191 YRRATGEYNCPAEMLTDNGRQYTNWRGT-TRFEKELKKDRIKHIRSQPHHPMTLGKIERF 249 Query: 251 HRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRY 299 +++ E L F Q+ W YN +RPH+ + P RY Sbjct: 250 WKTIWTEFLDRCQFDCMETAQQRITLWIKYYNHQRPHQGIGGLCPADRY 298 >UniRef50_Q3A8V0 ISChy3, transposase n=5 Tax=Clostridia RepID=Q3A8V0_CARHZ Length = 448 Score = 94.4 bits (233), Expect = 6e-18, Method: Compositional matrix adjust. Identities = 100/371 (26%), Positives = 169/371 (45%), Gaps = 31/371 (8%) Query: 21 LFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITA- 79 L A++ + + R IS T ++LQ + Q+G +GL + R + S S +I Sbjct: 30 LEAAEKRQRRKEILARSEISSRTLRRYLQLYRQQGLSGLMPKIRSDNGSSRTISHEIIEE 89 Query: 80 LLRMAHDRHERWGARKIKRWLEDQGHT---MPAFSTVHNLMARHGLLPG-ASPGIPATGR 135 +++ + ER +I LE + M A ST+ ++R GL A+ I R Sbjct: 90 AVKLKEELPER-SVSQIIAILEGEKKVPAGMLARSTLGRHLSRLGLTQKEANQKISGHRR 148 Query: 136 FEHDAPNRLWQMDFK-GHF------PFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQ 188 F + NRLWQ D K G + P R + + +DD +R D++R ++ Sbjct: 149 FAKEQRNRLWQADIKYGPYLPHPKNPKRKVRTYLVAFIDDATRLLCHGEFYLDQKRPVLE 208 Query: 189 QQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLE 248 + G+PD + +DNG + W L RLGIR +++PY P+++GK+E Sbjct: 209 DCFRKAILKRGIPDAVYVDNGKIF---VSRW--FRLGCARLGIRPINTKPYSPESKGKIE 263 Query: 249 RFHRSLKA-----EVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSA 303 RF+R++++ E+ Q + A EL +AF W PH +L+ P +R+Q Sbjct: 264 RFNRTVESFIAEIELQQPETLA---ELNQAFAVWVEEGYNHHPHSSLENETPANRFQKDT 320 Query: 304 RQYSGNTTPPEYDEGVM---VRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQED-GSY 359 R+ + E E + R+VD +G + ++G G + + V L+ D S Sbjct: 321 RRLRFASL-EECREAFLWEASRRVDKTGCIKLEGRFYEIGLEWIRKTVDLRYDPFDLESI 379 Query: 360 EVWWYSTKVGV 370 E W+ K G+ Sbjct: 380 EFWYNGQKQGL 390 >UniRef50_Q3SW20 Helix-turn-helix, Fis-type n=112 Tax=Bacteria RepID=Q3SW20_NITWN Length = 785 Score = 93.6 bits (231), Expect = 1e-17, Method: Compositional matrix adjust. Identities = 87/303 (28%), Positives = 132/303 (43%), Gaps = 30/303 (9%) Query: 14 SLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRS 73 S + E + Q + + GI AT Y+W R+ G L D P NR Sbjct: 465 SEKAEIIALVEQSHLPAKRTLDKLGIPRATFYRWYDRYRAGGIEALADHRSRPDRVWNRI 524 Query: 74 SDDITA-LLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPG--- 129 DD+ ++ +A + E R++ D+ + ++V+ L+ H L+ SP Sbjct: 525 PDDVRGQIIDLALELPE-LSPRELAVRFTDERKYFVSEASVYRLLKAHDLI--TSPAYVV 581 Query: 130 IPATGRFEHD--APNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRF----SLCLAHCTDER 183 I A F+ A N+LWQ DF G G + T+LDD SR+ L C + Sbjct: 582 IKAANEFKDKTTAANQLWQTDFTYLKITGWGWYYLSTVLDDFSRYIVAWRLGPTMCASDV 641 Query: 184 RETVQQQL-------VSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHS 236 T+ Q L VSV +R R+ DNGS + L WL ++ Sbjct: 642 TATLDQALAASGLDHVSVRQR----PRLLSDNGSSY-----VADDLATWLRAKDMQHVRG 692 Query: 237 RPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPG 296 PYHPQTQGK+ER+H++LK +L ++ +L+R + YN +R HE++ P Sbjct: 693 APYHPQTQGKIERWHQTLKNRILLENYYL-PDDLKRQVAAFVEHYNHDRYHESIGNVTPA 751 Query: 297 SRY 299 Y Sbjct: 752 DVY 754 >UniRef50_C2GDW5 IS3514a transposase n=1 Tax=Corynebacterium glucuronolyticum ATCC 51866 RepID=C2GDW5_9CORY Length = 322 Score = 92.0 bits (227), Expect = 4e-17, Method: Compositional matrix adjust. Identities = 66/221 (29%), Positives = 95/221 (42%), Gaps = 10/221 (4%) Query: 45 YKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDD----ITALLRMAHDRHERWGARKIKRWL 100 Y L+R+ + G ++ R P P + S++ I + R + G I L Sbjct: 36 YTLLRRYEEGGPEAVKPRSTAPKTHPTKVSEEVIKQIIKIRRELASKGADNGPETIAWVL 95 Query: 101 EDQGHTMPAFSTVHNLMARHGLLPGASPGIPAT--GRFEHDAPNRLWQMDFKGHFPFGGG 158 E +G PA ST+ ++ ++G++ P RFE PN WQ D G Sbjct: 96 EQRGFHAPAESTIRRILTKNGMVTPQPKKRPKAYLRRFEATLPNECWQADVTSTRLLNGQ 155 Query: 159 RCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNG----SPWGD 214 L LDDHSRF L L TV ++ ++YG P DNG + Sbjct: 156 VVEILDFLDDHSRFLLYLGAYKRVAGPTVVTAAETITKKYGFPQSTLTDNGLVFTARLAG 215 Query: 215 TTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLK 255 G E +L + I + R HPQTQGK+ERFH++LK Sbjct: 216 AKGGKNGFEKFLEKHSILQKNGRAGHPQTQGKIERFHQTLK 256 >UniRef50_C1F5F4 IS3 family transposase orfB n=1 Tax=Acidobacterium capsulatum ATCC 51196 RepID=C1F5F4_ACIC5 Length = 296 Score = 90.9 bits (224), Expect = 8e-17, Method: Compositional matrix adjust. Identities = 64/214 (29%), Positives = 99/214 (46%), Gaps = 13/214 (6%) Query: 90 RWGARKIKRWLEDQGHTMPAFSTVHNLMARHGL-LPGASPGIPATG------RFEHDAPN 142 R+G RKI+ L +G + + V+ L GL L P +F+ AP+ Sbjct: 60 RYGYRKIRVLLNREGWNVGRY-LVYPLYCEEGLCLQRMRPAGKHKASRSRAEKFKATAPD 118 Query: 143 RLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPD 202 + W MDF GG R LT++D ++R ++ + + E V + L V + G+P Sbjct: 119 QAWSMDFVSDQLQGGTRFRSLTIVDVYTREAVVIEAGQSLKGEDVVRTLNRVKQERGVPK 178 Query: 203 RMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGK 262 + DNGS + T A++LW R ++ SRP P +E F+ + ++E L Sbjct: 179 ILFCDNGSEF-----TSQAMDLWAYRNNTKIDFSRPGKPTDNAFVEGFNGTFRSECLNTH 233 Query: 263 WFADSGELQRAFDHWRTVYNLERPHEALDMAVPG 296 WFAD E + + WR YN RPH +L P Sbjct: 234 WFADLREAKVLIEAWRKEYNESRPHASLADRTPS 267 >UniRef50_C7RJ38 Integrase catalytic region n=5 Tax=Proteobacteria RepID=C7RJ38_9PROT Length = 441 Score = 89.0 bits (219), Expect = 3e-16, Method: Compositional matrix adjust. Identities = 82/294 (27%), Positives = 130/294 (44%), Gaps = 29/294 (9%) Query: 29 NIRSLCRRFGISPATGYK---------WLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITA 79 ++R L R + P T + W R+ G GL + R ++ S + A Sbjct: 32 SLRELATREYVIPGTDRRLLGEKTIEGWYYRYRARGLDGLIPKVR-ADRGQSKLSASVQA 90 Query: 80 LLRMAHDRHERWGARKIKRWLEDQGHT---MPAFSTVHNLMARHGL--LPGASPGIPATG 134 + A + R R+I+R LE G + S++H L+ +HGL LPG S +P Sbjct: 91 AILAAKRENPRRSIRQIQRVLEIGGIVARGTLSRSSLHRLLQQHGLSRLPG-SASLPEEK 149 Query: 135 R-FEHDAPNRLWQMDFK--GHFPFGG--GRCHPLTLLDDHSRFSLCLAHCTDERRETVQQ 189 R F LW D P GG G+ + ++L DD SR A C E ++ Sbjct: 150 RSFVAACAGELWYSDVMHGPRVPIGGRLGKSYLVSLFDDASRLVAHGAFCRGETALDIEG 209 Query: 190 QLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLER 249 L + G+P ++ +DNG+ + T L+ RLGI + H RPY P+++GK+ER Sbjct: 210 VLKQALLKRGVPVKLVVDNGAAYVAQT-----LQGICARLGIVLVHCRPYAPESKGKIER 264 Query: 250 FHRSLKAEVL---QGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQ 300 +HR+ + + L + + +L W PH+ L+ P +RYQ Sbjct: 265 WHRTCRDQFLSEVEERHVLSLDDLNARLWAWLEQVYHRTPHDGLEGQTPLARYQ 318 >UniRef50_D1TPC7 Putative transposase integrase n=1 Tax=Burkholderia sp. CCGE1002 RepID=D1TPC7_9BURK Length = 121 Score = 88.2 bits (217), Expect = 5e-16, Method: Compositional matrix adjust. Identities = 44/120 (36%), Positives = 69/120 (57%), Gaps = 1/120 (0%) Query: 258 VLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGNTTPPEYDE 317 ++ G F D QRAFD WR+VYN +RPH+ALDMA P +RY+PS R Y P EY E Sbjct: 1 MIAGHHFKDLPSAQRAFDAWRSVYNHQRPHQALDMATPVTRYRPSPRAYPEILPPIEYGE 60 Query: 318 GVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKE-MQEDGSYEVWWYSTKVGVIDLKKK 376 +V++V +G+L + A V ++E + E+ Y++++ + G I+L ++ Sbjct: 61 NDIVQRVGWNGELRFRKRRFKVSSALHNLPVAIRERVGEENCYDLFFAHHRFGTINLNQQ 120 >UniRef50_C1F9W4 ISAca1, transposase n=2 Tax=Acidobacterium capsulatum ATCC 51196 RepID=C1F9W4_ACIC5 Length = 310 Score = 87.8 bits (216), Expect = 6e-16, Method: Compositional matrix adjust. Identities = 89/312 (28%), Positives = 126/312 (40%), Gaps = 31/312 (9%) Query: 8 DARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPH 67 +AR T R + G ++ F +S T KW++R+ EG+ GL+DR PH Sbjct: 6 NARLTPYSREQLARKVICTGCTLKLAAASFNVSAKTAGKWVRRYRAEGSDGLRDRSSRPH 65 Query: 68 HSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGAS 127 SP R + + L + R +I R ++ L L P Sbjct: 66 RSPRRLPEALR--LSVIELRRGYMPGYQIARRSAVSVSSVSRILRRARLSRWRDLNPP-- 121 Query: 128 PGIPATGRFEHDAPNRLWQMDFKGHFPFG-----------GGRCHPLTL-----LDDHSR 171 P R+EH AP L +D KG FG G + HP L +DDHSR Sbjct: 122 ---PPVVRYEHAAPGDLLHLDIKGMTRFGEVSLRGDGRLRGKKEHPGFLALHVAVDDHSR 178 Query: 172 --FSLCLAHCTDERRETVQQQLVSVFERYGLPDR-MTMDNGSPWGDTTGTWTALELWLMR 228 F+ LA E V F +G+ R + DNGS + + Sbjct: 179 MVFAQMLADQKAETTIGFLHAAVEFFASHGIGIRALLTDNGSSYRSRQ-----FRQACQQ 233 Query: 229 LGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHE 288 + I+ +RPY P+T GK ERF ++ E K + DS + + W YN ERPH Sbjct: 234 MAIKHSRTRPYTPRTNGKAERFIQTAMREWAYAKHWTDSSQRDQHLQSWIHYYNHERPHG 293 Query: 289 ALDMAVPGSRYQ 300 +L+ P SR Q Sbjct: 294 SLNYKPPISRSQ 305 >UniRef50_B1KCL6 Integrase catalytic region n=100 Tax=Proteobacteria RepID=B1KCL6_BURCC Length = 316 Score = 87.4 bits (215), Expect = 7e-16, Method: Compositional matrix adjust. Identities = 96/319 (30%), Positives = 138/319 (43%), Gaps = 36/319 (11%) Query: 8 DARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPH 67 +AR T + R E V ++ G++++ G++ T KWL R+ GA L D P Sbjct: 6 NARLTFARRLEMVQEITEFGSSVQQAAADHGVTAPTVRKWLGRYLVGGAPALADASSRPA 65 Query: 68 HSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGL--LPG 125 SP RS TALL + E R ++R + Q + STV ++AR GL L Sbjct: 66 RSP-RSIAPATALLIV-----ELRQQRLLQRQITRQAGV--SASTVSRVLARAGLSRLSD 117 Query: 126 ASPGIPATGRFEHDAPNRLWQMDFK--------GHFPFGGGRC--------HPLTLLDDH 169 P P R+EH+AP L +D K GH G R + +DDH Sbjct: 118 LQPREPVQ-RYEHEAPGDLLHIDIKKLGRIARPGHRVTGNRRDTVDGVGWEYLFVAVDDH 176 Query: 170 SRFSLCLAHCTDERRETVQ--QQLVSVFERYGL-PDRMTMDNGSPWGDTTGTWTALELWL 226 +R + H + +R VQ + V+ + +G+ R+ DNGS + EL Sbjct: 177 ARVAFTAMHPDETKRSAVQFLRDAVAWYAGFGVRVRRLLTDNGSAFRSHEFARACQEL-- 234 Query: 227 MRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERP 286 GIR +R Y PQT GK ERF +S E + S A W+ YN R Sbjct: 235 ---GIRHKFTRAYRPQTNGKAERFIQSALREWAYAWTYQSSAHRIEALASWQHHYNWHRA 291 Query: 287 HEALDMAVPGSRYQPSARQ 305 H A+ P +R P++R Sbjct: 292 HSAIGGIAPMARL-PASRN 309 >UniRef50_A6V7Q6 Transposase n=92 Tax=Bacteria RepID=A6V7Q6_PSEA7 Length = 481 Score = 85.9 bits (211), Expect = 2e-15, Method: Compositional matrix adjust. Identities = 91/319 (28%), Positives = 130/319 (40%), Gaps = 54/319 (16%) Query: 12 TMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPN 71 T R +DG + F +SP T KW R+ +EG G+QDR PH P Sbjct: 161 TPRARLRLARLIVEDGYPATIAAKMFMVSPITARKWAGRYREEGEFGMQDRSSKPHRIPG 220 Query: 72 RSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIP 131 R+ + + + R R G +I L + STVH ++ R + S Sbjct: 221 RTPEHVKKKIINLRWR-LRLGPAQIAARLG------LSTSTVHAVLVR-CRVNRLSHIDR 272 Query: 132 ATG----RFEHDAPNRLWQMDFK--GHFPFGGGR-------------------------- 159 TG R+EH P L +D G+ P GGG Sbjct: 273 VTGEPLRRYEHPHPGSLIHVDVTKFGNIPDGGGHRYVGRQQGARNKLATPGLPRGKDHKP 332 Query: 160 ----CHPLTLLDDHSRFSLCLAHCTDERRET---VQQQLVSVFERYGLP-DRMTMDNGSP 211 T++DDHSR + +DE+ T V ++ V+ F G+ +R+ DNGS Sbjct: 333 RTGTAFVHTVIDDHSRVAYAEI-WSDEQASTAVGVLERAVAWFAERGVTVERVLSDNGSA 391 Query: 212 WGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQ 271 + A + RLGIR +RPY PQT GK+ERFHR+L +++ E + Sbjct: 392 YRSH-----AWRDFCARLGIRHKRTRPYRPQTNGKIERFHRTLGDGWAYARFYGSEAERR 446 Query: 272 RAFDHWRTVYNLERPHEAL 290 A W YN R H A+ Sbjct: 447 LALPGWLHFYNHHRHHSAI 465 >UniRef50_Q12FI2 Integrase, catalytic region n=28 Tax=Proteobacteria RepID=Q12FI2_POLSJ Length = 315 Score = 85.1 bits (209), Expect = 3e-15, Method: Compositional matrix adjust. Identities = 91/285 (31%), Positives = 116/285 (40%), Gaps = 40/285 (14%) Query: 38 GISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRS-SDDITALLRMAHDRHERWGARKI 96 G+SP T KW QR+AQEG AGL DR P P RS + I +R+ R +R +I Sbjct: 34 GLSPRTARKWQQRYAQEGRAGLLDRSSRPLVCPQRSCASKIERAVRL--RRTQRLTYERI 91 Query: 97 KRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPATGRFEHDAPNRLWQMDFK------ 150 E G + A + L A P R+E +P L +D K Sbjct: 92 A---ERVGLSRSAIARACKAAGAAKL--PAFQNAPPVVRYERASPGELLHLDTKKLHRFD 146 Query: 151 --GH---------FPFGGGRCHPLTLLDDHSR--FSLCLAHCTDERRETVQQQLVSVFER 197 GH P G + + + DDHSR FSL L DE L++ Sbjct: 147 KPGHRVTGDRTQNTPRAGSQALHVAI-DDHSRVGFSLLL---PDETARCACAHLLAALRY 202 Query: 198 Y---GLPDRMTM-DNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRS 253 Y G+ M DNGS + L RLGIR +RPY P+T GK ERF ++ Sbjct: 203 YKALGVRVAQVMTDNGSAYKSKR-----FAKLLRRLGIRHIRTRPYTPRTNGKAERFIQT 257 Query: 254 LKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSR 298 L E + DS + W YN RPH A P SR Sbjct: 258 LLREWAYAFIYPDSDARAHELEPWMHHYNFRRPHSATSHRPPASR 302 >UniRef50_C8PWM8 Transposase B n=2 Tax=Enhydrobacter aerosaccus SK60 RepID=C8PWM8_9GAMM Length = 271 Score = 84.7 bits (208), Expect = 6e-15, Method: Compositional matrix adjust. Identities = 68/236 (28%), Positives = 103/236 (43%), Gaps = 12/236 (5%) Query: 67 HHSPNRSSDD-ITALLRMAHDRHERWGARKIKRWLEDQGHT--MPAFSTVHNLMARHGLL 123 ++ P + DD I L D+H RWG K + L G+ V+ M + L Sbjct: 36 YYEPKLNDDDAIVDKLTELTDKHTRWGFPKCYKRLRKLGYVWNHKRVYRVYTAM-KLNLR 94 Query: 124 PGASPGIPATGRFEHDAPNRL---WQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCT 180 A +P PN L W MDF R ++DD++R L + T Sbjct: 95 RKAKRRLPTRAPEPLTVPNSLDHTWSMDFMSDKLHNNSRFRTFNVIDDYNRELLGIDIGT 154 Query: 181 DERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYH 240 V + L + E +G P+++ +DNGS + T+ +T W I V + +P Sbjct: 155 SIPSLRVIRYLDQLAECHGYPNKIRIDNGSEF--TSSVFTD---WAASHSILVDYIKPGC 209 Query: 241 PQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPG 296 P +ERF+RS + EVL F + E+++ D W VYN ERPH++L P Sbjct: 210 PYQNAYIERFNRSYRNEVLDCYLFNNLNEVRQLTDEWINVYNHERPHDSLGNMTPA 265 >UniRef50_B4E5J2 Transposase n=21 Tax=Proteobacteria RepID=B4E5J2_BURCJ Length = 276 Score = 84.0 bits (206), Expect = 8e-15, Method: Compositional matrix adjust. Identities = 55/178 (30%), Positives = 76/178 (42%), Gaps = 9/178 (5%) Query: 140 APNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYG 199 A N +W MDF F G R LT++D+++R L + R E V L + + Sbjct: 103 AINEIWSMDFVADALFDGRRLRTLTIVDNYTRECLAIEVDGSLRGEHVVAALTRLAQHRP 162 Query: 200 LPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVL 259 LP + DNGS + T L+ W G+ + SRP P K E F+ + E L Sbjct: 163 LPRYIKADNGSEFISKT-----LDKWAYENGVEIDFSRPGKPTDNAKNESFNGRFREECL 217 Query: 260 QGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGNTTPPEYDE 317 WF + +R + WR YN RPH AL P ARQ + P +E Sbjct: 218 NAHWFLSLEDARRKIEVWREYYNEARPHSALQWMTPAE----FARQCTDRADPARPEE 271 >UniRef50_A9G353 Putative transposase n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9G353_SORC5 Length = 428 Score = 83.6 bits (205), Expect = 1e-14, Method: Compositional matrix adjust. Identities = 75/226 (33%), Positives = 106/226 (46%), Gaps = 19/226 (8%) Query: 46 KWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGH 105 +WL + G GL+ +PR S+++ LL H I + L D+G Sbjct: 40 RWLYAYRSAGLDGLRPQPRSDRGFAQDLSEELRTLLLDIRREHPDASVPLILKTLVDEGR 99 Query: 106 ---TMPAFSTVHNLMARHGLLPGAS--PGIPATG-RFEHDAPNRLWQMDFKGHFP--FGG 157 T + TV L A HGL A+ G P T R++ + P LW D H G Sbjct: 100 LEATQVSEPTVRRLYAAHGLRRRAARAEGEPKTRLRWQVERPGALWHGDV-CHVTGCTVG 158 Query: 158 GRCHPLT---LLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPW-G 213 G+ PL LLDD SR+ + L T E+ + V R+G PD + +DNGS + G Sbjct: 159 GKAMPLRIHGLLDDASRYVVALEAHTTEKEIDMLAMTVDALRRHGKPDALYLDNGSTYRG 218 Query: 214 DTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVL 259 D T A RLGI + H++PY P+ +GK+ERF R+L+ L Sbjct: 219 DVLKTACA------RLGITLLHAKPYDPEARGKMERFWRTLREGCL 258 >UniRef50_Q8XPL1 Isrso16-transposase orfb protein n=2 Tax=Ralstonia solanacearum RepID=Q8XPL1_RALSO Length = 269 Score = 83.6 bits (205), Expect = 1e-14, Method: Compositional matrix adjust. Identities = 52/175 (29%), Positives = 82/175 (46%), Gaps = 8/175 (4%) Query: 142 NRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLP 201 N +W MDF G R LT++D+++R SL + + + V + L +V ++G P Sbjct: 94 NEIWSMDFVSDALLDGQRLRALTVVDNYTRESLAIEVGQSLKGKDVVRVLDAVVAQHGTP 153 Query: 202 DRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQG 261 + +DNG+ + A++ W G+ + SRP P K+E F+ + E L Sbjct: 154 QTIKVDNGTEF-----ISKAMDRWAYEHGVELDFSRPGTPTDNAKVESFNGRFRQECLNE 208 Query: 262 KWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSG--NTTPPE 314 WF + Q WR YN RPH AL A P + AR+ + ++T PE Sbjct: 209 HWFLSLEDAQSKIADWRRHYNESRPHSALQWATP-DEFARQARKSASMDDSTTPE 262 >UniRef50_C8X9D2 Integrase catalytic region n=6 Tax=Actinomycetales RepID=C8X9D2_NAKMY Length = 347 Score = 83.6 bits (205), Expect = 1e-14, Method: Compositional matrix adjust. Identities = 94/309 (30%), Positives = 132/309 (42%), Gaps = 53/309 (17%) Query: 38 GISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNR---SSDDITALLRMAHDRHERWGAR 94 G++ T KW RW +G AGLQDR P SP + +D+ LR RH + G Sbjct: 51 GVARQTLAKWHARWKADGPAGLQDRSSRPVSSPGQVDAQVEDVVEYLR----RHLKLGPV 106 Query: 95 KIKRWLEDQGHTMPAFSTVHNLMARHGL-------LPGASPGIPATGRFEHDAPNRLWQM 147 + L + G T+ A ST+H ++ R G+ + G P R+E AP L + Sbjct: 107 MLAAELREFGITL-APSTIHRVLVRRGISRLRDLDVTGHQLREPVR-RYEWAAPGDLIHV 164 Query: 148 DFK--GHFPFGGG-RCHPL--------------------TLLDDHSRFSLCLAHCTDERR 184 D K G P GGG R H T +DD SR + DE+ Sbjct: 165 DVKKIGRIPDGGGWRIHGRGNDAHRASQRGQRPGYAFLHTAIDDRSRLAYT-EELADEKS 223 Query: 185 ETVQ---QQLVSVFERYGLP--DRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPY 239 T + V F +G+ R+ DNGS + TAL + + ++RPY Sbjct: 224 VTAAGFWARAVEFFAAHGIERIHRVLTDNGSCYRGKDFN-TALGATVHK------YTRPY 276 Query: 240 HPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRY 299 PQT GK+ER+HR+L E + ++ + E A + YN RPH AL P R Sbjct: 277 RPQTNGKVERYHRTLAREWAYRQAWSCNDERAAALAGFVHRYNYHRPHTALKGKPPAYR- 335 Query: 300 QPSARQYSG 308 P+ SG Sbjct: 336 TPAVTNLSG 344 >UniRef50_C5CAM0 Transposase n=26 Tax=Actinomycetales RepID=C5CAM0_MICLC Length = 335 Score = 82.0 bits (201), Expect = 3e-14, Method: Compositional matrix adjust. Identities = 87/312 (27%), Positives = 130/312 (41%), Gaps = 51/312 (16%) Query: 25 QDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMA 84 DG + F +S T W+ R+ EG AGLQD PH SP + ++ A ++ Sbjct: 22 HDGIPQAHVAAEFRVSRPTVATWVARYRAEGEAGLQDLSSRPHRSPAQLDPEVVAQIQTL 81 Query: 85 HDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGL--LPGASPGIPATG-------- 134 R +W AR+I L +GH + TV + R G+ LP +P TG Sbjct: 82 R-RERKWSARRIHHHLVSEGHRV-CLRTVGRWLHRLGISRLPDLAP----TGEDLRQRPQ 135 Query: 135 RFEHDAPNRLWQMDFK--GHFPFGGG-RCHPL----------------------TLLDDH 169 + P + +D K G P GGG R H + +D Sbjct: 136 KITARGPGHMVHLDVKKIGRIPEGGGWRAHGRDSENARAAKRGPGRRVGYTYLHSAIDGF 195 Query: 170 SRFSLCLAHCTDERRETVQQ---QLVSVFERYGL-PDRMTMDNGSPWGDTTGTWTALELW 225 +R + A DER T + + F +G+ DR+ DNG+ + +TA Sbjct: 196 TRLAYTEA-LEDERAATTVSFYCRARAFFAAHGIRIDRVVTDNGNNY--RAADFTAK--- 249 Query: 226 LMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLER 285 ++ LG R RPY P+ GK+ER++R + EVL + ++ + A W YN R Sbjct: 250 VVSLGGRHHRIRPYTPRHNGKVERYNRLMVDEVLYARPYSSETARREALQVWVNHYNYHR 309 Query: 286 PHEALDMAVPGS 297 PH + A P S Sbjct: 310 PHTSCGDAPPAS 321 >UniRef50_C4KRZ4 Integrase core domain protein n=59 Tax=Proteobacteria RepID=C4KRZ4_BURPS Length = 318 Score = 82.0 bits (201), Expect = 3e-14, Method: Compositional matrix adjust. Identities = 65/237 (27%), Positives = 95/237 (40%), Gaps = 13/237 (5%) Query: 67 HHSPNRSSDD--ITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGL-- 122 H+ R DD +T + + R+G R+I L+ G + L ++ GL Sbjct: 66 HYESRRRVDDEALTGRMMAIAAQKRRYGYRRIHVLLQRDG-CFANHKRIWRLYSKAGLSV 124 Query: 123 LPGASPGIPATGRFEH---DAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHC 179 I A R PN+ W MDF G R L ++DD++R L + Sbjct: 125 RKRRRKRIAAVERTPLPLPTGPNQSWSMDFVSDGLAYGRRFRCLNVVDDYTRECLAIEVD 184 Query: 180 TDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPY 239 T VQQ L + E GLP +T+DNG + L+ W G+ + RP Sbjct: 185 TSLPGLRVQQVLARLKEMRGLPASITVDNGPEFAGKV-----LDAWAYEAGVTLSFIRPG 239 Query: 240 HPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPG 296 P +E F+ + E L WF ++ + WR YN ERPH +L P Sbjct: 240 KPVENAYIESFNGRFRDECLNEHWFVSMRHAKQLIEEWRIEYNTERPHSSLGYLTPA 296 >UniRef50_C1F6R2 ISAca4, transposase orfB n=1 Tax=Acidobacterium capsulatum ATCC 51196 RepID=C1F6R2_ACIC5 Length = 308 Score = 81.6 bits (200), Expect = 4e-14, Method: Compositional matrix adjust. Identities = 64/231 (27%), Positives = 100/231 (43%), Gaps = 11/231 (4%) Query: 70 PNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGL----LPG 125 P+R+++ L+++A + R+G R++ LE +G T+ V+ L A GL Sbjct: 23 PDRNAELRDELVKLARQK-PRYGYRRLHAVLERRGQTV-NVKRVYRLYAEEGLAVRRRRR 80 Query: 126 ASPGIPATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRE 185 G + N+ W MDF G L+++D +R L L T Sbjct: 81 KRLVRERVGEVQLIRANQEWAMDFIVDGLANGRMVRILSVVDAFTRECLALEADTSLGSG 140 Query: 186 TVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQG 245 V + L + E GLP+ + DNG + T + W I + H +P P G Sbjct: 141 RVTRALDRLIEERGLPENVRSDNGPEF-----TSRRMLGWAEERKINLVHIQPGRPMQNG 195 Query: 246 KLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPG 296 +E FH L+ E L WF +++R D++R YN ERPH +L P Sbjct: 196 HVESFHGRLRDECLNVSWFRTLNDVRRTLDNYRQEYNCERPHSSLAYRTPA 246 >UniRef50_A6BYW2 Integrase, catalytic region n=1 Tax=Planctomyces maris DSM 8797 RepID=A6BYW2_9PLAN Length = 269 Score = 80.5 bits (197), Expect = 9e-14, Method: Compositional matrix adjust. Identities = 70/266 (26%), Positives = 113/266 (42%), Gaps = 20/266 (7%) Query: 61 DRPRIPHHSPNRSSDDITALLRMAHD---RHERWGARKIKRWLEDQGHTMPAFSTVHNLM 117 ++PR + DD ALL+ D RH R+G R+I R ++ G + ++ L Sbjct: 2 NQPRSSQRYQSEPPDDEPALLKQILDLVRRHPRFGYRRIGRMIQADGWKV-NLKRIYRLW 60 Query: 118 ARHGL-LP-------GASPGIPATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDH 169 R GL +P G A + N +W DF G L++LD++ Sbjct: 61 RREGLKVPRKQKKKRALGTGANACHLRRAERKNHVWCWDFIFDRTETGTTLKWLSVLDEY 120 Query: 170 SRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRL 229 +R L L E V L +F+ +G+P+ + DNGS + A+ WL ++ Sbjct: 121 TRECLVLKVDRHITSEDVINVLAELFKTHGVPEHIRSDNGSEF-----VAQAIREWLKQI 175 Query: 230 GIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEA 289 G+ + P P G E FH ++ E + + F + ++ D W+ YN RPH + Sbjct: 176 GVETLYIEPASPWENGYAESFHSRVRDEFMNCEIFENLRSARKQTDSWKEFYNEVRPHSS 235 Query: 290 LDMAVPGSRYQPSARQYSGNTTPPEY 315 L P Q S + S + TP + Sbjct: 236 LGYLTPR---QFSQQCISSSRTPSAF 258 >UniRef50_B0STB8 Putative transposase n=1 Tax=Leptospira biflexa serovar Patoc strain 'Patoc 1 (Paris)' RepID=B0STB8_LEPBP Length = 150 Score = 80.1 bits (196), Expect = 1e-13, Method: Compositional matrix adjust. Identities = 42/123 (34%), Positives = 65/123 (52%), Gaps = 8/123 (6%) Query: 5 MPWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPR 64 MPW + + LR +FVL + Q G N LC ++GIS GYKW +R+ +EG GL D+ + Sbjct: 3 MPWKEFNIVDLRFQFVLDSFQIGGNFTELCAQYGISTQCGYKWKERFIKEGKEGLFDKKK 62 Query: 65 IPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTM-----PAFSTVHNLMAR 119 P +SP + +++ + + + WGA+KI LE T P ST+ ++ Sbjct: 63 TPKNSPAKIAEETILEIIKIKNHRKFWGAKKI---LETYKKTFPNRKAPKRSTIERILKN 119 Query: 120 HGL 122 GL Sbjct: 120 VGL 122 >UniRef50_A4JLW8 Integrase, catalytic region n=9 Tax=Proteobacteria RepID=A4JLW8_BURVG Length = 277 Score = 80.1 bits (196), Expect = 1e-13, Method: Compositional matrix adjust. Identities = 59/216 (27%), Positives = 93/216 (43%), Gaps = 13/216 (6%) Query: 90 RWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPG------IPATGRFEHDAPNR 143 R+G RKI+ L +G+ + + ++ L GL P + R + A N+ Sbjct: 51 RYGYRKIRVLLLREGYQVSK-NRLYRLYREEGLSLRYRPNRKRRAQMSRPARAKSTAANQ 109 Query: 144 LWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDR 203 W +DF G R LT++D +R +L + V + L + + G P Sbjct: 110 AWSLDFVADQLSNGQRFRALTIIDVFTREALAIDVGQRLSASDVVRVLDELRSKRGAPRT 169 Query: 204 MTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKW 263 + DNGS + T ++LW + + SRP P +E F+ +L+ E L W Sbjct: 170 LFCDNGSEF-----TSQVMDLWAYHHKVEIAFSRPGKPTDNAFVESFNGTLRDECLNVHW 224 Query: 264 FADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRY 299 F + + + WR YN RPH ALD VP + Y Sbjct: 225 FTSLADAREQIERWRVEYNESRPHRALD-EVPPAEY 259 >UniRef50_Q92X98 Putative transposase protein n=1 Tax=Sinorhizobium meliloti RepID=Q92X98_RHIME Length = 148 Score = 79.0 bits (193), Expect = 3e-13, Method: Compositional matrix adjust. Identities = 45/106 (42%), Positives = 54/106 (50%), Gaps = 1/106 (0%) Query: 5 MPWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPR 64 MPW TM R FV ++G N R LCRRFGISP YKWL RW + G L DR R Sbjct: 10 MPWREVSTMGERRGFVRLPLEEGVNRRELCRRFGISPDMRYKWLARW-EAGDGELADRSR 68 Query: 65 IPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAF 110 PH SP R ++ + A + D H WG L + T+P F Sbjct: 69 RPHISPMRCNEAVEAEVLAMRDAHRAWGTWADDYMLTGKHDTLPYF 114 >UniRef50_B1K7U4 Integrase catalytic region n=7 Tax=Bacteria RepID=B1K7U4_BURCC Length = 282 Score = 79.0 bits (193), Expect = 3e-13, Method: Compositional matrix adjust. Identities = 54/171 (31%), Positives = 74/171 (43%), Gaps = 9/171 (5%) Query: 141 PNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGL 200 PN +W +DF G R LT++D SR +L + R E V L + + Sbjct: 117 PNEVWSLDFVADQLADGTRLCALTVVDIFSREALAIEVGKRLRAEDVVSVLNRLVAQRRA 176 Query: 201 PDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQ 260 P + DNGS + L++W +++ SRP P +E F+ S + E L Sbjct: 177 PRFLFADNGSEFSGRL-----LDMWAYHYKVQIDFSRPGKPTDNSFIETFNGSFRDECLN 231 Query: 261 GKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGNTT 311 WF E +R + WR YN RPH AL PG ARQYS T Sbjct: 232 LHWFESLAEAKREIEAWRCDYNETRPHMALKELTPGE----FARQYSLRPT 278 >UniRef50_A5GAT8 Integrase, catalytic region n=1 Tax=Geobacter uraniireducens Rf4 RepID=A5GAT8_GEOUR Length = 426 Score = 78.6 bits (192), Expect = 3e-13, Method: Compositional matrix adjust. Identities = 55/183 (30%), Positives = 81/183 (44%), Gaps = 10/183 (5%) Query: 109 AFSTVHNLMARHGLLPGASPGIPATGRFEHDAPNRLWQMD-FKGHFPFGGGR---CHPLT 164 + ST + + RH L+ G P +FE + PN LWQ D G G + + + Sbjct: 129 SLSTAYRFLHRHDLM-GKQPAPVDRRKFEAELPNDLWQSDVMHGPMLLSGDKRRKTYLIA 187 Query: 165 LLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALEL 224 +DDHSR E R GLP ++ +DNGS + +TA L Sbjct: 188 FIDDHSRLIPHGRFYLSEGVACFMSAFSDAVLRRGLPRKLYVDNGSAFRSRQLEYTAAAL 247 Query: 225 WLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLE 284 GI + H+RPY PQ +GK+ERF ++++ L E+ AF+ W Y + Sbjct: 248 -----GIALVHARPYQPQGKGKIERFFKNVRTSFLPSFKGETLEEINEAFELWLNDYYHQ 302 Query: 285 RPH 287 R H Sbjct: 303 RSH 305 >UniRef50_B4S6V0 Integrase catalytic region n=10 Tax=Bacteria RepID=B4S6V0_PROA2 Length = 282 Score = 77.8 bits (190), Expect = 7e-13, Method: Compositional matrix adjust. Identities = 67/236 (28%), Positives = 97/236 (41%), Gaps = 27/236 (11%) Query: 73 SSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTV-----HNLMAR----HGLL 123 S++ I LR D +RWG R++ L +G + T NLM R + Sbjct: 44 SNEPIRKRLRELADERKRWGYRRLHYLLRREGFQINHKRTERLYREENLMLRVRRRRKMA 103 Query: 124 PGASPGIPATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDER 183 + P R H W MDF + G R L +LD +SR L T Sbjct: 104 SESRVAPPPPERKNH-----CWAMDFMSDNLYNGRRFRVLNVLDSYSRDYLGFEVDTS-- 156 Query: 184 RETVQQQLVSVFERY----GLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPY 239 +++ SV ER GLP+ +T+DNG + AL+ W R G+++ +RP Sbjct: 157 --INGKRVCSVLERIAWFKGLPELITVDNGPEF-----IGKALDAWAHRHGVKLVFNRPG 209 Query: 240 HPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVP 295 P +E F+ L+ E L WF G + W+ YN RPH +L P Sbjct: 210 KPVDNTYIESFNGRLRDECLNVNWFMSLGHAREVIAEWQEDYNSVRPHSSLGTRTP 265 >UniRef50_C7NJB3 Integrase family protein n=3 Tax=Actinomycetales RepID=C7NJB3_KYTSD Length = 422 Score = 77.0 bits (188), Expect = 1e-12, Method: Compositional matrix adjust. Identities = 73/239 (30%), Positives = 96/239 (40%), Gaps = 47/239 (19%) Query: 109 AFSTVHNLMARHGLLPGASPGIPATG----RFEHDAPNRLWQMDFK--GHFPFGGG---- 158 A STVH ++ R+ L S ATG R+EHD P + +D K G+ P GGG Sbjct: 10 APSTVHRIL-RNARLNRLSHVDRATGEPIRRYEHDHPGAMLHVDVKKLGNIPDGGGWRYV 68 Query: 159 --------------------------RCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLV 192 + + T++DDH+R + H DE T LV Sbjct: 69 GRQQGEKIRASTPGKPRNKYSDPLMGKAYVHTVIDDHTRVAYAEIH-DDETAPTATAVLV 127 Query: 193 SVFE----RYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLE 248 E R +R+ DNG + T EL GI +RPY PQT GK+E Sbjct: 128 RAVEWFNQRGVTVERVLSDNGGAYRSHLWRETCAEL-----GITHKRTRPYRPQTNGKVE 182 Query: 249 RFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYS 307 RFHR++ + + E + D W YN RPH A P SR QYS Sbjct: 183 RFHRTMADGWAYARCYTSEAERRGELDGWLHYYNRHRPHTACGNKPPFSRLTNVTGQYS 241 Score = 58.5 bits (140), Expect = 4e-07, Method: Compositional matrix adjust. Identities = 48/151 (31%), Positives = 66/151 (43%), Gaps = 16/151 (10%) Query: 168 DHSRFSLCLAHCTDERRET---VQQQLVSVF-ERYGLPDRMTMDNGSPWGDTTGTWTALE 223 DHSR + H DE T V + V F +R + +R+ DNG+ + Sbjct: 278 DHSRLAYSEVH-DDETAITAVAVLHRAVDWFVDRSVIMERVLSDNGAAYRSF-------- 328 Query: 224 LW---LMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTV 280 LW L + +RPY PQT GK+ER HR++ + + G+ + + D W Sbjct: 329 LWRDACEALRVTPKRTRPYRPQTNGKVERLHRTMADGWAYSRCYTSEGDRRASLDGWLHQ 388 Query: 281 YNLERPHEALDMAVPGSRYQPSARQYSGNTT 311 YN RPH A D P SR QYS T Sbjct: 389 YNQHRPHSACDNQPPFSRLINVPDQYSKGTV 419 >UniRef50_Q1GCB4 Integrase catalytic region n=29 Tax=Alphaproteobacteria RepID=Q1GCB4_SILST Length = 265 Score = 77.0 bits (188), Expect = 1e-12, Method: Compositional matrix adjust. Identities = 58/212 (27%), Positives = 85/212 (40%), Gaps = 10/212 (4%) Query: 90 RWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPATGRFEHD-----APNRL 144 R+G R++ L +G + A T L +P + D +PN + Sbjct: 48 RFGYRRVHVLLRREGWEINAKKTYRIYKELGMQLRSKTPKRRVKAKLRDDRKEAVSPNDV 107 Query: 145 WQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRM 204 W MDF G + LT++D SR+ L R E V L V +R G P + Sbjct: 108 WAMDFVHDQLATGWKLRVLTVVDTFSRYVPVLDARFTYRGEDVVATLEQVCKRTGYPATI 167 Query: 205 TMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWF 264 +D GS + ++LW + + SRP P +E F+ +AE L WF Sbjct: 168 RVDQGSEFISKD-----MDLWAYANDVTLDFSRPGKPTDNAFIEAFNGRFRAECLNAHWF 222 Query: 265 ADSGELQRAFDHWRTVYNLERPHEALDMAVPG 296 + + WR YN ERPH A+ VP Sbjct: 223 MSLEDAAEKLEAWRRDYNEERPHGAIGNKVPA 254 >UniRef50_B4RA95 Transposase, IS1477 n=35 Tax=Proteobacteria RepID=B4RA95_PHEZH Length = 361 Score = 76.6 bits (187), Expect = 1e-12, Method: Compositional matrix adjust. Identities = 74/260 (28%), Positives = 109/260 (41%), Gaps = 13/260 (5%) Query: 51 WAQEGAAGL-QDRPRIPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPA 109 ++Q A GL Q P+ ++ A LR R+G R++ LE +G +M Sbjct: 80 FSQRRACGLVQVDPKTVRRVAQPGDAEVRARLRGLAAERRRFGYRRLGILLEREGVSMNK 139 Query: 110 FSTVHNLMARHGL-LPGASPGIPATGRFEH----DAPNRLWQMDFKGHFPFGGGRCHPLT 164 + L GL + ATG D PN+ W +DF G R L Sbjct: 140 -KKLFRLYREEGLAVRRRRGRKRATGTRAPMALPDGPNQRWSLDFVADTLSWGRRFRILC 198 Query: 165 LLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALEL 224 ++DD +R +L L T + ++L ++ R G P + DNG T T A+ Sbjct: 199 IVDDFTREALALVVDTSIGGHRMARELDALIARRGRPATIVSDNG-----TEMTSRAMLE 253 Query: 225 WLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLE 284 W R G+ + P PQ G +E F+ L+ E L + FA+ E + + WR YN Sbjct: 254 WTNRTGVDWHYIAPGKPQQNGFVESFNGKLRDECLNEEVFANLAEARAVIERWRLDYNHV 313 Query: 285 RPHEALDMAVP-GSRYQPSA 303 RPH A P R P+A Sbjct: 314 RPHSAHGGLTPEAVRLNPAA 333 >UniRef50_B0UC72 Integrase catalytic region n=4 Tax=Alphaproteobacteria RepID=B0UC72_METS4 Length = 263 Score = 75.9 bits (185), Expect = 3e-12, Method: Compositional matrix adjust. Identities = 48/160 (30%), Positives = 73/160 (45%), Gaps = 10/160 (6%) Query: 129 GIPATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQ 188 G PA G PN +W +DF F G LT++D H+R +L LA + R V Sbjct: 110 GRPAIG-----GPNEVWAIDFMSDRLFDGRPFRILTVVDCHTREALSLAPRANFRAYQVV 164 Query: 189 QQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLE 248 + L ++ G P + +DNG + L+ W+ G+ + SRP P +E Sbjct: 165 EALDALVRLRGRPKSLRVDNGPEFAGRM-----LDRWVYLNGVELYFSRPGKPTDNAYIE 219 Query: 249 RFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHE 288 F+ L+AE L WF + + + WR YN +RP + Sbjct: 220 NFNGRLRAECLNASWFLSLTDARERIEEWRPHYNKDRPRQ 259 >UniRef50_C5D6W5 Integrase catalytic region n=19 Tax=Firmicutes RepID=C5D6W5_GEOSW Length = 417 Score = 75.5 bits (184), Expect = 4e-12, Method: Compositional matrix adjust. Identities = 70/274 (25%), Positives = 113/274 (41%), Gaps = 19/274 (6%) Query: 39 ISPATGYKWLQRWAQEGAAGLQDRPRIPH-HSPNRSSDDITALLRMAHDRHERWGARKIK 97 I+ T W R+ + G L+ + R HS S DD +L + + H Sbjct: 49 IAAKTILDWCTRYKKGGFDALKPKRRSDRGHSRRLSPDDEDHILALRKE-HPTMPVTVFY 107 Query: 98 RWLEDQGH---TMPAFSTVHNLMARHGLLPGASPGIPATGRFEHDAPNRLWQMDFKGHFP 154 L +QG ++ T++ L+ +H L+ +P RF +D N LWQ D H P Sbjct: 108 EHLIEQGEIPENHISYFTIYRLLKKHNLVGKEILPMPERKRFAYDQINELWQGDL-SHGP 166 Query: 155 F-----GGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNG 209 + + +DD SRF E+ + ++ R G P R+ DNG Sbjct: 167 TIRVNGKAQKTFLIAYIDDCSRFVPYAQFFPSEKFDGLRIVTKEAVLRCGKPKRIYSDNG 226 Query: 210 SPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAE---VLQGKWFAD 266 + + E+ GI + H++PY PQ++GK+ERF R+++ +L+ Sbjct: 227 KIYRSEVLQYACAEM-----GITLIHTQPYDPQSKGKIERFFRTVQTRFYPLLELDPPKS 281 Query: 267 SGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQ 300 EL F W +PH +LD P +Q Sbjct: 282 LEELNERFWRWLEEEYHRKPHASLDGKTPHEVFQ 315 >UniRef50_B0TDR5 Transposase, putative n=5 Tax=Firmicutes RepID=B0TDR5_HELMI Length = 451 Score = 75.1 bits (183), Expect = 4e-12, Method: Compositional matrix adjust. Identities = 98/386 (25%), Positives = 155/386 (40%), Gaps = 68/386 (17%) Query: 9 ARDTMSLRTEFV---LFASQDGANIR----SLCRRFGISPATGYKWLQRWAQEGAAGLQD 61 A D S R + + L D A R +C + GIS T ++L ++ ++G +GL+ Sbjct: 7 AEDIASQRVQLLSPLLAEGLDAARARLMKQQICEQAGISERTLRRYLSQYREKGFSGLKP 66 Query: 62 RPR--------IPH-----------HSPNRSSDDITALLRMAHDRHERWGAR----KIKR 98 + + IPH P RS I +L W + K+KR Sbjct: 67 KGKGRSRSEEAIPHALLEEAILLRREVPRRSIAQIIQILE--------WEGKAEPGKLKR 118 Query: 99 WLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPATGRFEHDAPNRLWQMDFK--GHFPFG 156 + +ST H M A+ G+ A RF+ N+LW D K + P G Sbjct: 119 STLQEKLAERGYSTRHMQMY-------ANTGV-AARRFQQKHRNQLWHSDIKYGPYLPIG 170 Query: 157 ----GGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPW 212 + + +T DD +RF L + V+ +YG P+ + DNG + Sbjct: 171 PDGAKKQVYLVTFFDDATRFVLHGQFYPTLDQVIVEDCFRQAILKYGAPEAVFFDNGKQY 230 Query: 213 GDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQR 272 W + ++GIR+ ++PY P++ GK+ERF+R++ A LQ L R Sbjct: 231 ---RTKW--MHRACAKMGIRLLFAKPYSPESTGKVERFNRTVDA-FLQEAALEKPHTLDR 284 Query: 273 ---AFDHWRTVYNLERPHEALDMAV-PGSRYQPSARQYSGNTTPPEYDEGVMV----RKV 324 F W +PH AL V P + Y+ + + P+ + RKV Sbjct: 285 LNQLFWVWLDECYQNKPHSALAGNVSPDTAYRSDKK--AVKFLDPDVVANAFLHCESRKV 342 Query: 325 DISGKLSVKGVSLSAGKAFRGERVGL 350 D SG +S +G G +F G V + Sbjct: 343 DKSGCISFEGRKYEVGLSFIGCTVDV 368 >UniRef50_UPI0001B511C3 integrase n=1 Tax=Streptomyces hygroscopicus ATCC 53653 RepID=UPI0001B511C3 Length = 319 Score = 74.7 bits (182), Expect = 5e-12, Method: Compositional matrix adjust. Identities = 79/304 (25%), Positives = 123/304 (40%), Gaps = 32/304 (10%) Query: 16 RTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSD 75 R + + N+ CR FGIS Y W +R+ EG GL+ R + P SPN + Sbjct: 18 RLAVIRHVEEVTGNVAMSCRYFGISRQAYYTWYRRYQAEGVEGLRTRSKAPKTSPNATHV 77 Query: 76 DITA-LLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMAR--HGLLPGASPGIPA 132 ++ ++ + + H +G KI +L+ + S V ++ R G LP + Sbjct: 78 EVVGKIIYLRQNYH--FGPEKIAMYLKRYHDVTISKSGVWRILNRLDMGRLPASQRYKRH 135 Query: 133 T---GRFEHDAPNRLWQMDFKGHFPFG-------GGR--CHPLTLLDDHSRFSLCLAHCT 180 R+E P Q+D K P GGR + T +DD +R + + Sbjct: 136 DRRWKRYEKQLPGHRVQIDVKFIEPLANTAQGRRGGRNKYYQFTAIDDCTRLRILRIYPQ 195 Query: 181 DERRETVQQQLVSVFERYGLP---DRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSR 237 ++ V Q L V +R LP + + DNG+ + +A ++ GI + + Sbjct: 196 LNQKTAV-QFLDYVLQR--LPFQVEVIQTDNGAEFQ------SAFHWHVLDKGIAHTYIK 246 Query: 238 PYHPQTQGKLERFHRSLKAE---VLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAV 294 P P+ GK+ER HR E +L G D+ W YN RPH L Sbjct: 247 PRTPRLNGKVERSHRIDAEEFYRLLDGVVIDDAEVFNDKLREWEDYYNYHRPHGGLGGHT 306 Query: 295 PGSR 298 P R Sbjct: 307 PYER 310 >UniRef50_Q2Y8D0 Integrase, catalytic region n=1 Tax=Nitrosospira multiformis ATCC 25196 RepID=Q2Y8D0_NITMU Length = 167 Score = 74.3 bits (181), Expect = 6e-12, Method: Compositional matrix adjust. Identities = 50/165 (30%), Positives = 75/165 (45%), Gaps = 11/165 (6%) Query: 165 LLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALEL 224 ++D+++R SL + + E V + L + R GLP + +DNGS + ++ Sbjct: 1 MVDNYTRESLAIEVGQSLKGEDVVKTLNHIATRRGLPSIIKVDNGSEF-----ISRVMDK 55 Query: 225 WLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLE 284 W GI + SRP P ++E F+ + E L WF + +R D WR YN Sbjct: 56 WAYERGIELDFSRPGKPTDNARVESFNGRFRQECLNAHWFLSLEDARRKIDEWRQYYNEM 115 Query: 285 RPHEALDMAVPGSRYQPSARQYS--GNTTPPEY---DEGVMVRKV 324 RPH AL A P + AR+ + T PE+ D G R V Sbjct: 116 RPHSALQWATPAE-FARRARENALPDRPTEPEFSTLDRGAFNRSV 159 >UniRef50_B3E8B6 Integrase catalytic region n=1 Tax=Geobacter lovleyi SZ RepID=B3E8B6_GEOLS Length = 269 Score = 73.2 bits (178), Expect = 2e-11, Method: Compositional matrix adjust. Identities = 48/157 (30%), Positives = 71/157 (45%), Gaps = 9/157 (5%) Query: 142 NRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDE--RRETVQQQLVSVFERYG 199 N W MDF F G R LT++D+ SR CLA D+ + + V + + + Sbjct: 108 NDSWSMDFVADSLFNGRRFRALTVVDNWSR--QCLAIRVDQAMKGDDVVDAMSELTQIRN 165 Query: 200 LPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVL 259 P R+ +DNGS + +L+ W G+ + SRP P +E F+ S + E L Sbjct: 166 CPKRIFLDNGSEF-----ISKSLDRWAYENGVTLDFSRPGKPTDNALIESFNGSFRDECL 220 Query: 260 QGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPG 296 WF + ++ + WR YN RPH AL P Sbjct: 221 SVNWFLSMDDARQKIEDWRQEYNDFRPHTALKNLTPN 257 >UniRef50_C6VW29 Integrase catalytic region n=2 Tax=Sphingobacteriales RepID=C6VW29_DYAFD Length = 273 Score = 72.8 bits (177), Expect = 2e-11, Method: Compositional matrix adjust. Identities = 50/183 (27%), Positives = 79/183 (43%), Gaps = 22/183 (12%) Query: 142 NRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLP 201 N +W +DF G + ++DD SR +L + T + + + L + E G P Sbjct: 107 NEVWSVDFMSDSMVGNRKFRTFNVIDDCSREALAIEIDTSLSAKRIIRTLNRIGESRGFP 166 Query: 202 DRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQG 261 + DNG + T+G +T +W GI +P P G +ERF+R + VL Sbjct: 167 MAIRSDNGPEF--TSGNFT---IWCEEKGIEAKFIQPGKPTQNGYIERFNRLYREAVLDA 221 Query: 262 KWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGNTTPPEYDEGVMV 321 F D ++++ W YN RPHE L GN TP E+ E ++ Sbjct: 222 YLFFDLDQVRQLTAEWIEEYNQRRPHEGL-----------------GNLTPFEWKESLVK 264 Query: 322 RKV 324 +K+ Sbjct: 265 KKI 267 >UniRef50_B8J8P0 Integrase catalytic region n=1 Tax=Anaeromyxobacter dehalogenans 2CP-1 RepID=B8J8P0_ANAD2 Length = 281 Score = 72.4 bits (176), Expect = 2e-11, Method: Compositional matrix adjust. Identities = 69/288 (23%), Positives = 113/288 (39%), Gaps = 32/288 (11%) Query: 14 SLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRS 73 +LR + ++ R CR G++P+T Y +R P R+ Sbjct: 6 ALRPAVIELGAKFAMKKRRACRVVGLAPSTLYYCSRR-------------------PERA 46 Query: 74 SDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPAT 133 ++ A LR + RWG R++ L+ +GH + V L GL T Sbjct: 47 --EVRARLRDLAAQRPRWGYRRLHVLLDREGHHLN-HKLVFRLYRSEGLAVRRKRRKRIT 103 Query: 134 GRFEHDAPN-----RLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQ 188 P + W MDF G + L L+D +R L + V Sbjct: 104 SSLRVVPPPPTRPRQQWTMDFTQDSLASGRQFRTLNLIDAFTRECLLIEADHSLTGARVV 163 Query: 189 QQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLE 248 + L + E +G P+ + +DNG+ + T +A++ W +R+ P P G +E Sbjct: 164 RALERLRELHGTPEVIRIDNGTEF-----TSSAVDAWAYTNQVRLDFITPGKPTENGHIE 218 Query: 249 RFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPG 296 F+ + E L WF +++R + +R YN RPH +LD P Sbjct: 219 SFNGKFRDECLNENWFISLDDVRRKVEAYRVDYNEVRPHSSLDNRTPN 266 >UniRef50_A3JW74 Transposase n=12 Tax=Proteobacteria RepID=A3JW74_9RHOB Length = 303 Score = 72.0 bits (175), Expect = 3e-11, Method: Compositional matrix adjust. Identities = 78/303 (25%), Positives = 126/303 (41%), Gaps = 20/303 (6%) Query: 2 ESLMPWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQD 61 E M + RD R VL ++ N R CR FGI ++ Y+W + + G +GL++ Sbjct: 3 EVSMTNEERDIQ--RKLRVLQHAEKIGNARKACRYFGIGRSSFYRWRDAYQKHGESGLKN 60 Query: 62 RPRIPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHG 121 IP + N++ +I + + R G +I +L + + V+ ++ R+G Sbjct: 61 AKSIPKNPANQTPPEIVEKV-LYLRRKYHLGPIRIVWYLARYHGIKISDAGVYRILKRNG 119 Query: 122 L--LP-GASPGIPATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPL-----TLLDDHSRFS 173 L LP G T R++ P Q+D K F G R + T +DD +R Sbjct: 120 LNRLPRGTRMRKLHTKRYQKQVPGHHIQVDVK-FLTFKGKRGEKVRRFQFTAIDDATRVR 178 Query: 174 LCLAHCTDERRETVQQQLVSVFERYGLPDR-MTMDNGSPWGDTTGTWTALELWLMRLGIR 232 L + + + + E++ R + DNG + W +L GIR Sbjct: 179 -ALKIYEKHTQASAIDFIDHIIEKFPFRIREVRTDNGHEF-QAKFHWHVEDL-----GIR 231 Query: 233 VGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDM 292 + + PQ GK+ER HRS + Q + +L+ D W YN RPH A + Sbjct: 232 HAYIKRGTPQLNGKVERSHRSDQQAFYQLLSYKGDVDLEAKLDEWERFYNFARPHGAHNG 291 Query: 293 AVP 295 P Sbjct: 292 QTP 294 >UniRef50_B9XMR2 Putative uncharacterized protein n=4 Tax=bacterium Ellin514 RepID=B9XMR2_9BACT Length = 237 Score = 70.9 bits (172), Expect = 7e-11, Method: Compositional matrix adjust. Identities = 51/160 (31%), Positives = 71/160 (44%), Gaps = 9/160 (5%) Query: 5 MPWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPR 64 M W M FV+ A + LC +FGIS TGYK L R+A +G GL R Sbjct: 1 MAWKTVTPMEEMIRFVMLARSARFTVTELCEQFGISRKTGYKHLARYAADGLQGLAQRSH 60 Query: 65 IPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLE-DQG-HTMPAFSTVHNLMARHGL 122 P P R+ + AL+ H+ W +K+ + LE + G + PA ST+ ++ RHGL Sbjct: 61 RPLQFPQRTDLAVEALVLAERRLHQTWEPKKLHKVLELNHGIESPPAPSTIGEILRRHGL 120 Query: 123 ------LPGASPGIPATGRFEHDAPNRLWQMDFKGHFPFG 156 G + G PN +W +DF FG Sbjct: 121 SVKRRRKAGLYVAL-NEGLTVPTHPNHVWTVDFNFFRDFG 159 >UniRef50_A8HUC5 Transposase n=2 Tax=Alphaproteobacteria RepID=A8HUC5_AZOC5 Length = 314 Score = 70.5 bits (171), Expect = 9e-11, Method: Compositional matrix adjust. Identities = 84/289 (29%), Positives = 116/289 (40%), Gaps = 43/289 (14%) Query: 38 GISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMAHDRHERWGARKIK 97 GIS T YKWL R+ G A L D P P +S + A + R +R I Sbjct: 36 GISVRTAYKWLARFRAGGEAALHDASSAPGRKPRATSGETVAAIEAL--RRQRLSGPAI- 92 Query: 98 RWLEDQGHTMP-AFSTVHNLMARHGL--LPGASPGIPATGRFEHDAPNRLWQMDFK---- 150 H++ A STV ++ R GL L PA R++ P L MD K Sbjct: 93 ------AHSLGLARSTVGAILRRIGLSRLAALDEKRPAN-RYQKAMPGELIHMDTKKLGR 145 Query: 151 ----GHFPFG-----------GGRCHPLTLLDDHSRFSLCLAHCTDERRETV---QQQLV 192 GH G G C + + DD SR + DE++ TV + + Sbjct: 146 IDGIGHRITGDRTRQSNRRGTGWECLHVAI-DDASRLAYTEV-LPDEKKGTVCAFTARAL 203 Query: 193 SVFERYGL-PDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFH 251 F R+G+ R+ DNGS + L G+R +RPY P+T GK ERF Sbjct: 204 GWFARHGVVTARLMTDNGSAYKSHD-----FRDLLRAAGVRHVRTRPYTPRTNGKAERFI 258 Query: 252 RSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQ 300 ++ E + S + +A W YNL RPH A + P +R Sbjct: 259 QTSLREWAYAVPYTSSRQRTQAMPGWIDTYNLNRPHSAHNGLSPWTRLN 307 >UniRef50_C6PFD7 Integrase catalytic region n=3 Tax=Thermoanaerobacterium thermosaccharolyticum DSM 571 RepID=C6PFD7_CLOTS Length = 412 Score = 70.5 bits (171), Expect = 1e-10, Method: Compositional matrix adjust. Identities = 74/311 (23%), Positives = 133/311 (42%), Gaps = 43/311 (13%) Query: 18 EFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDI 77 E++ + ++ SL +R SP T WL + + G GL + R + +DD+ Sbjct: 32 EYMEVITSKVYDVPSLGKR-EFSPNTIKTWLYCYRKYGFEGLYPKSRCDKGASRVLTDDV 90 Query: 78 TALLRMAHDRHERWGARKI------KRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIP 131 A ++ + R A+ I K+++E ++ STV + R + ++ Sbjct: 91 KAYIKNLKLDNPRRSAKSIYQELLVKKFIELDKVSL---STVQRYL-RKTKISTSALNTK 146 Query: 132 ATGRFEHDAPNRLWQMDFK-GHFPFGGGR---CHPLTLLDDHSRFSLCLAHCTDERRETV 187 FE + PN WQ D G + + + + LDD SR + H + V Sbjct: 147 DRRSFEMEYPNDCWQSDISMGPYLIINDKKIKTYLIAFLDDSSRL---ITHAEFYDTDNV 203 Query: 188 QQQLVSVFERY----GLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQT 243 L+ F++ G+P ++ +DNG + L L LG + ++ PY P++ Sbjct: 204 IS-LIDAFKKAVSKRGVPKKLFVDNGKVFQSE-----QLHLICASLGTSLCYAEPYSPES 257 Query: 244 QGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSA 303 +GK+ERF R+LK + + G FD W+ + +++ +E L+ + G +Q Sbjct: 258 KGKIERFFRTLKDQWMYG------------FD-WQKISSIDELNENLNKYIEGIYHQ--T 302 Query: 304 RQYSGNTTPPE 314 S N P E Sbjct: 303 VHSSTNMKPIE 313 >UniRef50_Q8NL32 Predicted transposase n=7 Tax=Corynebacterium RepID=Q8NL32_CORGL Length = 500 Score = 70.5 bits (171), Expect = 1e-10, Method: Compositional matrix adjust. Identities = 75/287 (26%), Positives = 113/287 (39%), Gaps = 17/287 (5%) Query: 18 EFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDI 77 +F FA + +I C R IS + Y R+ Q+ A L P + + I Sbjct: 34 DFDPFAP-NSPSIEEFCSRLKISRRSFYNIRNRYQQDANAALHPHSSAPITARRTYDESI 92 Query: 78 TALLRMAHDRHE----RWGARKIKRWLEDQGHT---MPAFSTVHNLMARHGLLPGASPGI 130 T+ L R + +G I+ G +P+ ST+ L+ G + Sbjct: 93 TSTLLSIRARLKAQGWEYGPISIRFEGISTGELTAPIPSVSTIARLLRAAGAVESNPKKR 152 Query: 131 PATG--RFEHDAPNRLWQMD---FKGHFPFGGGRCHPLTLLDDHSRFSL-CLAHCTDERR 184 P + RF+ +WQ+D + H R +LDD +RF + +E Sbjct: 153 PKSSVVRFQRGQAMEMWQIDGFIYTLH-DTDLTRVTIYQILDDATRFDVGTCVFPANENS 211 Query: 185 ETVQQQLVSVFERYGLPDRMTMDNGSPWGDT-TGTWTALELWLMRLGIRVGHSRPYHPQT 243 + L +G P + DNGS + G +LE +L +G +P HPQT Sbjct: 212 VDARTALEQAIAHFGAPHELLSDNGSAFNRMRQGYVGSLESYLATVGCLSITGKPGHPQT 271 Query: 244 QGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEAL 290 QGK ER HR+L LQ E + +R YN RPH+ L Sbjct: 272 QGKNERSHRTL-FRFLQAHQPHTLEECAHYIEQFRDHYNNRRPHQGL 317 >UniRef50_C0QA21 Transposase /integrase family protein n=1 Tax=Desulfobacterium autotrophicum HRM2 RepID=C0QA21_DESAH Length = 402 Score = 69.7 bits (169), Expect = 2e-10, Method: Compositional matrix adjust. Identities = 83/328 (25%), Positives = 133/328 (40%), Gaps = 28/328 (8%) Query: 40 SPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMAHDRHERWGARKIKRW 99 SP T KW R+ G L + PR + I L + H RW ++ Sbjct: 53 SPDTLKKWFYRYRNGGLPALNNSPRKDIGTHGTIPQTIVDRLFKLREEHPRWTLSRMLDQ 112 Query: 100 LEDQG---HTMPAFSTVHNLMARHGLL--PGASPGIPATGRFEHDAPNRLWQMDFKGHFP 154 L + PA ST++ L P + +PA F + +LW DF H P Sbjct: 113 LVQENLWDKKSPARSTLYRFAQTANLKRDPHLAAHVPARP-FAYSFFGQLWMADFL-HGP 170 Query: 155 F----GGGRCHPL-TLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNG 209 G R L ++DD +R+ + T E E + +L++ +G P R DNG Sbjct: 171 KIREKGKKRKTYLHAIIDDATRYIVHAGFFTAESTEVMMAELMASVRTHGKPIRFYTDNG 230 Query: 210 SPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGE 269 + + L+ LGI + H+ P P+ +GK+ERF RS++ + L GK Sbjct: 231 ACYASK-----HLKFVCANLGIHLIHTPPGKPRGRGKVERFFRSVRDQFLDGKKAPAKTL 285 Query: 270 --LQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGNTTPPEYDEGVMVR----- 322 L +AF W Y+ +R H +L ++ R + Q + P + + R Sbjct: 286 DGLNKAFREWVASYH-KRIHSSLGISPLQKRL---SHQSACKALPETVEIEPLFRMKRRC 341 Query: 323 KVDISGKLSVKGVSLSAGKAFRGERVGL 350 KV ++ + +K A G+RV + Sbjct: 342 KVYLNNTIRLKRRIYEVIDALPGQRVDV 369 >UniRef50_C1A8I3 Putative transposase orfB for insertion sequence element n=1 Tax=Gemmatimonas aurantiaca T-27 RepID=C1A8I3_GEMAT Length = 295 Score = 69.7 bits (169), Expect = 2e-10, Method: Compositional matrix adjust. Identities = 48/158 (30%), Positives = 76/158 (48%), Gaps = 14/158 (8%) Query: 139 DAP---NRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTD---ERRETVQQQLV 192 DAP N +DF + G R L +LD+ +R +L + T R +V +QL+ Sbjct: 122 DAPPQLNHTRALDFMHDMLYDGRRFRTLNVLDEGNREALAIEVSTSLPGTRVVSVLEQLL 181 Query: 193 SVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHR 252 ++ +G P + DNG AL W + G+R+ H +P P +ERF+R Sbjct: 182 AI---HGAPCTIRCDNGPELISH-----ALTTWCEQHGVRLQHIQPGKPNQNAYIERFNR 233 Query: 253 SLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEAL 290 + + EVL FA +++ + W YN ERPH++L Sbjct: 234 TYRREVLDAYIFASLAQVRAETETWLMTYNTERPHDSL 271 >UniRef50_A1JLT7 Transposase for insertion element IS1222 n=8 Tax=Yersinia RepID=A1JLT7_YERE8 Length = 249 Score = 69.3 bits (168), Expect = 2e-10, Method: Compositional matrix adjust. Identities = 46/157 (29%), Positives = 65/157 (41%), Gaps = 5/157 (3%) Query: 140 APNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYG 199 APN W MDF G R LT +DD ++ L + V + L S+ G Sbjct: 78 APNLTWSMDFVMDALATGRRIKCLTCVDDFTKECLTVTVAFGISGVQVTRILDSIALFRG 137 Query: 200 LPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVL 259 P + D G + T AL+ W G+ + +P P G +E F+ + E L Sbjct: 138 YPATIRTDQGPEF-----TCRALDQWAFEHGVELRLIQPGKPTQNGFIESFNGRFRDECL 192 Query: 260 QGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPG 296 WF+D ++ WR YN RPH AL+ P Sbjct: 193 NEHWFSDVSHARKTISEWRQDYNECRPHSALNYQTPS 229 >UniRef50_A4TG41 Integrase, catalytic region n=32 Tax=Actinomycetales RepID=A4TG41_MYCGI Length = 522 Score = 69.3 bits (168), Expect = 2e-10, Method: Compositional matrix adjust. Identities = 79/283 (27%), Positives = 114/283 (40%), Gaps = 33/283 (11%) Query: 21 LFASQDGANIRSLCRRFGISP---------ATGYKWLQRWAQEGAAGLQDRPRIPHHSPN 71 L Q G +R + R I P AT +W++R+ G L PR Sbjct: 58 LSTKQRGKLVREIADRRHIDPFGAQVQVARATLDRWIRRYRTGGFEALVPEPR---RLGT 114 Query: 72 RSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIP 131 R+ + L + ++ R L P+ ST+ R L+ G + G P Sbjct: 115 RTDTQVLELAVSLKRENPARTVAQVARILRTATGWAPSESTLLRHFHRCELM-GPTAGQP 173 Query: 132 AT--GRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQ 189 GRFE PN LW D G + + LDDHSR L + H +TV+ Sbjct: 174 GEVFGRFEAADPNELWVGDALHGPRVGDRKTYLFAFLDDHSR--LVVGHRFGFAEDTVRL 231 Query: 190 QLVSVFERY--GLPDRMTMDNGSPWGDTTGTWTALELWLMR----LGIRVGHSRPYHPQT 243 G+P + +DNGS + D WL+R LGIR+ HS P PQ Sbjct: 232 AAALKPALAARGVPASIYVDNGSAFVDA---------WLLRACAKLGIRLVHSAPGRPQG 282 Query: 244 QGKLERFHRSLKAEVLQGKWFADSGELQRA-FDHWRTVYNLER 285 +GK+ERF R+++ + L + +L A DH + L R Sbjct: 283 RGKIERFFRTVRDQFLVEVTDTSAEDLTAAGVDHRGALLELNR 325 >UniRef50_A3ZSC8 Transposase orfB n=1 Tax=Blastopirellula marina DSM 3645 RepID=A3ZSC8_9PLAN Length = 281 Score = 68.9 bits (167), Expect = 3e-10, Method: Compositional matrix adjust. Identities = 57/218 (26%), Positives = 91/218 (41%), Gaps = 20/218 (9%) Query: 90 RWGARKIKRWLEDQGHTMPAFSTVHNLMARHGL-----------LPGASPGIPATGRFEH 138 R+G R I R L +G + F V+ L R GL L GI R + Sbjct: 43 RYGYRMITRLLRQEGWQV-NFKRVYRLWRREGLKVPVKQAKKRRLGTVDGGI---TRRQA 98 Query: 139 DAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERY 198 + PN +W +DF G L+L+D+ +R + L + + + L +F Sbjct: 99 ERPNHVWSIDFIFDRTENGRPLKILSLVDEFTRECIALEVNRKFTGDHLVELLADLFAIR 158 Query: 199 GLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEV 258 G+P+ + DNG + ++ +L ++ + + + P P G +ERFH L+ E Sbjct: 159 GVPEFIRSDNGPEFISRR-----VQKFLEKIDVGMSYIEPGSPWQNGYVERFHSRLRDEC 213 Query: 259 LQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPG 296 L + F E + WR YN RPH +L P Sbjct: 214 LACELFTTLAEARTVIAAWRQTYNHRRPHSSLGGQTPA 251 >UniRef50_A0AXB8 Integrase, catalytic region n=27 Tax=Betaproteobacteria RepID=A0AXB8_BURCH Length = 279 Score = 68.9 bits (167), Expect = 3e-10, Method: Compositional matrix adjust. Identities = 73/295 (24%), Positives = 111/295 (37%), Gaps = 47/295 (15%) Query: 16 RTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGA------AGLQDRPRIPHHS 69 R E + ++ G + R CR G+S L++ ++ + A Q+ PR + Sbjct: 7 RREALEVLTRRGLSQRKACRYLGLSRRVAIYTLKQPEKDRSLGERLIAASQEVPRFGYRR 66 Query: 70 PNRSSDDITALLRMAHDRHER-WGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASP 128 I+A L + R R W A K+ +P LPGA+ Sbjct: 67 -------ISAWLSLGESRVRRMWRALKL---------NIPKRRPRRRRCGSDIRLPGAT- 109 Query: 129 GIPATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQ 188 PN +W DF G L ++D+++R L + R + V Sbjct: 110 -----------KPNSVWSYDFVHDQLVDGRVLKMLCVIDEYTRECLAIEVGASLRSQDVI 158 Query: 189 QQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLE 248 L + YG P + DNG+ + T + WL I P +P G +E Sbjct: 159 LVLSRLMRLYGKPAFIRSDNGAEF-----TAAKVMRWLRDAAIGPAFITPGNPWQNGFVE 213 Query: 249 RFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSA 303 F+ L+ E+L +WF E + + WR YN RPH A RYQP A Sbjct: 214 SFNGKLRDELLNREWFRSRAEAKVLIERWRQFYNERRPHSA-------HRYQPPA 261 >UniRef50_A2DEY0 Integrase core domain containing protein n=11 Tax=Trichomonas vaginalis RepID=A2DEY0_TRIVA Length = 324 Score = 68.6 bits (166), Expect = 4e-10, Method: Compositional matrix adjust. Identities = 55/200 (27%), Positives = 88/200 (44%), Gaps = 16/200 (8%) Query: 136 FEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVF 195 +E P+ +W +D HF +G R ++DD SRF L L ++ E+ + Sbjct: 138 YEATKPDTIWHVDV--HFLYGSQRLPVYGIIDDKSRFLLALKILPNKSSESTTRVAEETI 195 Query: 196 ERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLK 255 Y P DNG G+ G + WLMR I++ H+ P+ P+ GK+ER SL+ Sbjct: 196 GLYQKPFCFWSDNG---GENMGQFYN---WLMRNNIQIRHTHPHMPRQNGKIERLWPSLE 249 Query: 256 AEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGNTTPPEY 315 V + +++ ++Q D R YN PH+ L P ++P + N TP Sbjct: 250 RNVGRS---SNAKQIQEFLDKIRKNYN-SLPHKTL----PKKGFRPMTPEECYNETPHWK 301 Query: 316 DEGVMVRKVDISGKLSVKGV 335 +V I G +K + Sbjct: 302 QGETPFWRVMIDGHFEIKEI 321 >UniRef50_Q1NW03 Integrase, catalytic region n=7 Tax=Proteobacteria RepID=Q1NW03_9DELT Length = 447 Score = 68.2 bits (165), Expect = 5e-10, Method: Compositional matrix adjust. Identities = 60/232 (25%), Positives = 98/232 (42%), Gaps = 14/232 (6%) Query: 36 RFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMAHDRHERWGARK 95 R ++ T WL+++ ++G GL + R ++ LL + + R+ Sbjct: 52 RTRVACETIRDWLKKYRKDGFNGLLPKGRNDKGRSRSLPPEVADLLIATKEENPELSIRQ 111 Query: 96 IKRWLEDQGHTMPAFSTVHNLMARHGLLP--GASPGIPATGRFEHDAPNRLWQMDFKGHF 153 + D+ PA STVH L+A GL+ G P RF + L+ D H Sbjct: 112 VIAATADRIPVQPAPSTVHALLAGKGLMKKKGEDPDSKDHRRFSYQFAGDLFMCDVM-HG 170 Query: 154 PF----GGGR--CHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMTMD 207 P G R + + +DD +R A E R G+P R+ +D Sbjct: 171 PTVRTSGNKRRKTYLIAFIDDATRVIAFAAFAMSESTADFMTVFKQTIIRRGIPLRLFVD 230 Query: 208 NGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVL 259 NG+ + L L +LGI + H+R YH Q +GK+ER+ R+++ + L Sbjct: 231 NGAAFRSQH-----LALVCAKLGITLIHARAYHAQAKGKIERWFRTIRLQFL 277 >UniRef50_Q8PGV8 ISxac4 transposase n=3 Tax=Xanthomonas axonopodis pv. citri RepID=Q8PGV8_XANAC Length = 274 Score = 67.4 bits (163), Expect = 7e-10, Method: Compositional matrix adjust. Identities = 50/170 (29%), Positives = 78/170 (45%), Gaps = 15/170 (8%) Query: 141 PNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFER--- 197 P+ +W +DF G R ++DD +R L + T +LV VFE+ Sbjct: 108 PDTVWSVDFMSDALACGRRFRTFNVVDDSNREVLHIEVDT----SINSHRLVRVFEQIKH 163 Query: 198 -YGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKA 256 +GLP + DNG + A WL G+ + + +P P +ERF+R+ + Sbjct: 164 DHGLPQIVRSDNGPEF-----LGEAFTSWLKVNGVAIKYIQPGKPNQNAFIERFNRTFRE 218 Query: 257 EVLQGKWFADSGELQRAFDHWRTV-YNLERPHEALDMAVPGSRYQPSARQ 305 EVL F ++++A HWR + YN ERPH++L P AR+ Sbjct: 219 EVLDQHLFTCLDDIRQAI-HWRMIDYNEERPHDSLSGLTPTEYRNQHARR 267 >UniRef50_O28862 ISA0963-5, putative transposase n=5 Tax=Archaeoglobus fulgidus RepID=O28862_ARCFU Length = 357 Score = 67.4 bits (163), Expect = 9e-10, Method: Compositional matrix adjust. Identities = 63/305 (20%), Positives = 123/305 (40%), Gaps = 24/305 (7%) Query: 6 PWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRI 65 P R + + +++ GA ++ + ++P Y+ +++ + G + Sbjct: 56 PLHVRKLTNKKIRWIIRQLDKGAPVKEIAAVMRVTPRRIYQLKKQYEETGQI---PELKQ 112 Query: 66 PHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPG 125 P P ++ ++ A+ ++ R +++ +E + +T++ ++ +HGL+ Sbjct: 113 PGRKPKEIDEETEKIILQAYKKY-RLSPVPLEKLIERDYGIHISHNTIYKVLLKHGLVEE 171 Query: 126 ASPGIPATG--RFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDER 183 R+E LWQ D+K G + +DD SRF C Sbjct: 172 NMSKKKRRKWVRYERTHSMSLWQGDWKR-----LGEKWIIAFMDDASRFITCYGVFDSAT 226 Query: 184 RETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTA---LELWLMRLGIRVGHSRPYH 240 E + L F YG+PD + D+G+ + A +L G+R +R H Sbjct: 227 TENTIRVLKVGFREYGIPDEILTDHGTQFVAAKSREKAKHRFREFLAENGVRHVLARINH 286 Query: 241 PQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQ 300 PQT GK+ERF ++ ++ L + D + YN +PH +L+ + YQ Sbjct: 287 PQTNGKIERFFGLMEQKI----------HLFDSLDEFIYWYNYVKPHMSLNFDELETPYQ 336 Query: 301 PSARQ 305 R+ Sbjct: 337 AFLRK 341 >UniRef50_D2MKS7 Transposase (Fragment) n=3 Tax=Candidatus Poribacteria sp. WGA-A3 RepID=D2MKS7_9BACT Length = 327 Score = 67.4 bits (163), Expect = 9e-10, Method: Compositional matrix adjust. Identities = 72/266 (27%), Positives = 108/266 (40%), Gaps = 49/266 (18%) Query: 50 RWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPA 109 +W Q G ++ + H + +I LL A K+KR P+ Sbjct: 77 KWKQRRRYGYIEQQVLQHRQQGVNRYEICQLL-----------APKLKRL-------TPS 118 Query: 110 FSTVHNLMARHGLLPGASPGIPATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLT---LL 166 STV+ + ++GL P R L +D CH L+ + Sbjct: 119 PSTVYRITHQYGLNRLTPPLQQEKRRIVKQKAGELGHLD-----------CHHLSKDLMA 167 Query: 167 DDHSRFSL-CLAH----------CTDERRETVQQQLVSVF----ERYGLP-DRMTMDNGS 210 D +R+ L C+ TD + TV + F +RY L + DNGS Sbjct: 168 TDPTRYYLVCVIDACTRLAWAEVVTDLKSLTVMFSALKSFNLLHQRYQLQFAEVLTDNGS 227 Query: 211 PWGDTTGTWT-ALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGE 269 + T T E L+ LGI+ ++RPY PQT GK+ERF R+L +++ G F E Sbjct: 228 EFAARTPPATHPFERMLLELGIKHRYTRPYRPQTNGKVERFWRTLNDDLIAGTTFGSLEE 287 Query: 270 LQRAFDHWRTVYNLERPHEALDMAVP 295 + + + YN RPH+ALD P Sbjct: 288 FRDDLEQYLLYYNEGRPHQALDGKTP 313 >UniRef50_Q1DAH7 Transposase orfB, IS3 family n=29 Tax=Proteobacteria RepID=Q1DAH7_MYXXD Length = 293 Score = 67.0 bits (162), Expect = 1e-09, Method: Compositional matrix adjust. Identities = 63/230 (27%), Positives = 91/230 (39%), Gaps = 21/230 (9%) Query: 77 ITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFST--VHNLMARHGLLPGASPGIPATG 134 + A LR R+G R+ L +G PA + VH L + GL Sbjct: 48 LVAQLRDIARARPRFGYRRAWALLRREG---PAVNVKRVHRLWRKEGLALSRRRPRKRLR 104 Query: 135 RFEHDAP-----NRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCL---AHCTDERRET 186 + P N +W DF G + LT++D+HSR L + + R Sbjct: 105 LGQQRQPKPEGVNSVWAWDFVHDRCANGQKLKCLTVVDEHSRECLAIDVAGRISARRVIE 164 Query: 187 VQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGK 246 V +LV+V +G P + DNG + AL WL GI+ + P P G Sbjct: 165 VLSRLVAV---HGPPKYLRSDNGPEF-----IAKALRRWLEANGIQTAYIAPGKPWQNGT 216 Query: 247 LERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPG 296 E F+ + E L +WF+ E + WR YN +RPH +L P Sbjct: 217 NESFNGRFRDECLSAEWFSTRREAVVLIEAWRRDYNEKRPHSSLGYKTPA 266 >UniRef50_A1UD36 Integrase, catalytic region n=28 Tax=Actinomycetales RepID=A1UD36_MYCSK Length = 341 Score = 66.6 bits (161), Expect = 1e-09, Method: Compositional matrix adjust. Identities = 78/289 (26%), Positives = 112/289 (38%), Gaps = 48/289 (16%) Query: 37 FGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMAHDRHERWGARKI 96 G+S + W+ R+ +G AGL DR PH SP R++ + + +A R R G +I Sbjct: 45 MGVSRKCVHTWISRFEADGEAGLIDRSSRPHTSPMRTAQRLENQI-VAWRRRHRCGPEEI 103 Query: 97 KRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPATG-----------RFEHDAPNRLW 145 L TV ++ R G P P TG R+E P L Sbjct: 104 GAKLGVSA------RTVSRVLHRRGA-PYLRDCDPMTGQVIRASKSTAVRYERGRPGELV 156 Query: 146 QMDFK--GHFPFGGG------RCHP-------------LTLLDDHSRFSLCLAHCTDERR 184 MD K G P GGG C P +L+DDHSR + DE+ Sbjct: 157 HMDVKKLGRIPDGGGWRAHGRGCAPDRKRLRGNGFDYIHSLVDDHSRLAYSEI-LPDEKG 215 Query: 185 ETVQ---QQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHP 241 T ++ F +G+ + + W +L +LG R RP+ P Sbjct: 216 STCAGFLERAAHYFRAHGITTIEQVMTDNAWAYRY----SLRDVCTQLGARQIFIRPHCP 271 Query: 242 QTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEAL 290 GK+ER +R+L+ E + F + A W YN +R H AL Sbjct: 272 WQNGKVERLNRTLQTEWAYKRVFTSNAHRAAALAPWLKHYNTQRRHSAL 320 >UniRef50_B3PKR5 Transposase n=7 Tax=Gammaproteobacteria RepID=B3PKR5_CELJU Length = 280 Score = 66.6 bits (161), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 47/156 (30%), Positives = 68/156 (43%), Gaps = 5/156 (3%) Query: 142 NRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLP 201 N W +DF + G R L +LD+ +R L + T E V + L + GLP Sbjct: 116 NHQWALDFMHDSLYCGKRFRTLNVLDEGTRECLAIEVDTSLPAERVVRALEQIKVERGLP 175 Query: 202 DRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQG 261 ++ +DNG L W GIR+ + P PQ G +ERF+ S + E L Sbjct: 176 TQLRVDNGPELISAR-----LTDWCEENGIRLVYIEPGKPQQNGFVERFNGSFRREFLNA 230 Query: 262 KWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGS 297 F +++ WR YN ER HE+L P + Sbjct: 231 YLFESLTQVREMAWFWRMDYNEERTHESLGHLPPAA 266 >UniRef50_B8ER74 Integrase catalytic region n=103 Tax=Bacteria RepID=B8ER74_METSB Length = 309 Score = 66.6 bits (161), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 42/156 (26%), Positives = 65/156 (41%), Gaps = 5/156 (3%) Query: 141 PNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGL 200 PN +W DF G + L L+D+ +R +L + V + L + G Sbjct: 122 PNHVWSYDFVADRTQDGRKFRMLCLIDEFTREALAIQVKRRLNATDVLETLADLMILRGT 181 Query: 201 PDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQ 260 P + DNG + AL W+ +G + + P P G E F+ +L+ E+L Sbjct: 182 PAYVRSDNGPEF-----IAVALREWIAAVGSKTAYIEPGSPWENGACESFNSNLRDELLN 236 Query: 261 GKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPG 296 G+ F E Q + WR +N RPH +L P Sbjct: 237 GELFFSPAEAQAMIEAWRRHFNAVRPHSSLGYRSPA 272 >UniRef50_A3YV04 Transposase n=3 Tax=Synechococcus sp. WH 5701 RepID=A3YV04_9SYNE Length = 312 Score = 66.2 bits (160), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 88/317 (27%), Positives = 127/317 (40%), Gaps = 40/317 (12%) Query: 8 DARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPH 67 +AR T R + +G +++L + GIS + YKWL R+ G L DR Sbjct: 6 NARLTPISRERLIRRHLNEGEPLKALAAQAGISLRSAYKWLARFRDGGVTALADR--RSV 63 Query: 68 HSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGL--LPG 125 R + D L + RH+R R+I + L+ S+V M GL L Sbjct: 64 RRTQRRTLDPQQLQQAVDLRHQRCTLRRIAKALK------APLSSVGRAMNALGLGRLRN 117 Query: 126 ASPGIPATGRFEHDAPNRLWQMDFK--------GHFPFGGGR--CHP-------LTLLDD 168 P P R++ + P + +D K GH G R C P +DD Sbjct: 118 LEPKKPVQ-RYQWERPGDMIHVDTKQLARFERVGHRITGDRRQGCSPGAGYEKVHVAIDD 176 Query: 169 HSRFSLCLAHCTDERRETV--QQQLVSVFERYGLPDRMTM-DNGSPW--GDTTGTWTALE 223 +R + ++R TV + V F G+ R + DNG + GD AL+ Sbjct: 177 ATRLAYVEVLADEQRATTVGFLARAVGWFSEQGITCRRILSDNGPAYRSGDWRKACQALD 236 Query: 224 LWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNL 283 L +R ++PY PQT GK ERF ++L AE + S E R + +YN Sbjct: 237 LKPIR-------TKPYTPQTNGKAERFIKTLLAEWAYVMAYQTSEERNRWLPRYLGIYNG 289 Query: 284 ERPHEALDMAVPGSRYQ 300 R H AL P Q Sbjct: 290 HRCHMALGGLTPQQSLQ 306 >UniRef50_C3LLF8 IS1627, transposase n=27 Tax=Bacillaceae RepID=C3LLF8_BACAC Length = 274 Score = 66.2 bits (160), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 65/287 (22%), Positives = 115/287 (40%), Gaps = 38/287 (13%) Query: 25 QDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMA 84 +D +I+ +C GI +T Y+W + A L+ A+L + Sbjct: 2 KDEYSIKEICILIGIPRSTYYRWKNKEKDVKEAKLEQ-----------------AILTIC 44 Query: 85 HDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGL-----------LPGASPGIPA- 132 H R+G RK+ L+ + + P TV +M + L + G S + Sbjct: 45 MTNHFRYGHRKVTALLKRKYNYHPNRKTVQKIMQKKNLQCRVKRKRRTWINGESRIVVEN 104 Query: 133 --TGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQ 190 F+ + PN W D + PFG + L+++D ++ + +A+ R++ V Sbjct: 105 LLNRNFQANKPNEKWVTDI-TYLPFGTEMLYLLSIMDLYN--NEIIAYEISNRQD-VTLV 160 Query: 191 LVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERF 250 L +V + L + + S G ++ A + + GI SR + +E F Sbjct: 161 LRTVEKAIKLQQKTQIILHSDQGAVYTSY-AFQTLSKKNGITTSMSRKGNCHDNAVIESF 219 Query: 251 HRSLKAEVL--QGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVP 295 H SLK+E+ Q K + L++ + YN ER E L+ P Sbjct: 220 HSSLKSELFYSQEKQIHSTSTLKQLIHDYIEYYNTERIQEKLNYLSP 266 >UniRef50_C1F2E9 IS3 family transposase orfB n=1 Tax=Acidobacterium capsulatum ATCC 51196 RepID=C1F2E9_ACIC5 Length = 309 Score = 65.9 bits (159), Expect = 3e-09, Method: Compositional matrix adjust. Identities = 63/249 (25%), Positives = 102/249 (40%), Gaps = 22/249 (8%) Query: 57 AGLQDRPRIPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNL 116 A ++ RPR + +++ + LR D RWG R++ L+ +G + + V+ + Sbjct: 34 AMVRYRPRASRYEA--ANEKLRKRLRELADERRRWGYRRLHILLKREGWKVNS-KRVYRI 90 Query: 117 MARHGLLPGASPGIP---ATGRFEHDAPNRL---WQMDFKGHFPFGGGRCHPLTLLDDHS 170 L+ A R P RL W MDF G + L++ D ++ Sbjct: 91 YVEEKLVVRRRRRRRRVCAQARVPLLPPTRLNETWTMDFLHDALANGRKLRTLSIEDAYT 150 Query: 171 RFSLCLAHCTDERRETVQQQLVSVFERY----GLPDRMTMDNGSPWGDTTGTWTALELWL 226 R L + T ++V V ER GLP+R+ +D+G T T L+ W Sbjct: 151 REMLAIEVDTS----LPALRVVRVLERLRLERGLPERIVIDHG-----TEFTSKLLDQWA 201 Query: 227 MRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERP 286 + + + P P G +E FH + E L WF + ++ + WR YN RP Sbjct: 202 YKNQVTLHFITPGLPMENGYIESFHGKFREECLNEHWFLMLDDARQTIESWRIDYNWVRP 261 Query: 287 HEALDMAVP 295 H +L P Sbjct: 262 HSSLGYLTP 270 >UniRef50_A3PPM4 Integrase, catalytic region n=5 Tax=Rhodobacteraceae RepID=A3PPM4_RHOS1 Length = 348 Score = 65.5 bits (158), Expect = 3e-09, Method: Compositional matrix adjust. Identities = 69/305 (22%), Positives = 118/305 (38%), Gaps = 41/305 (13%) Query: 20 VLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITA 79 VL +++ N+ CR+ G+ + Y+W +R+ +G GL+D P I P + + A Sbjct: 26 VLELAKELGNVAEACRQRGLDRTSFYEWKRRFQTQGFEGLKDLPPIHKSHPQSTPPETVA 85 Query: 80 LLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGL-----------LPGASP 128 ++ H +G + + L +G + + T+ ++ +GL A Sbjct: 86 RIKTLALAHPAYGCNRFEAMLALEGIRVSSI-TIQKILNENGLGTKSDRWLALEQANAEK 144 Query: 129 GIPATGR----------------FEHDAPNRLWQMD--FKGHFPFGGGRCHPLTLLDDHS 170 I T E AP L D F G G GR + ++D Sbjct: 145 RIELTAEQAAFIEKLNPCFRERHVESSAPGELLSADTFFVGALK-GIGRVYLHAVVDTFG 203 Query: 171 RFSLCLAHCTDERRETV---QQQLVSVFERYGLP-DRMTMDNGSPWGDTTGTWTALELWL 226 ++ H + + V ++ + LP + DNG + T EL+L Sbjct: 204 SYAFGFLHVSKQPEAAVAVLHNDVLPFYRNLDLPVGAVLTDNGREFCGT--ERHPYELYL 261 Query: 227 MRLGIRVGHSRPYHPQTQGKLERFHRSLKAE----VLQGKWFADSGELQRAFDHWRTVYN 282 GI +R P+T G +ERF+ ++ E ++ ++ LQ D W YN Sbjct: 262 DLNGIEHRRTRVRTPKTNGFVERFNGTILDEFFRVAMRDNFYESVEALQADLDAWLVHYN 321 Query: 283 LERPH 287 ERPH Sbjct: 322 TERPH 326 >UniRef50_C0WLI3 Transposase n=14 Tax=Corynebacterium RepID=C0WLI3_9CORY Length = 497 Score = 65.5 bits (158), Expect = 3e-09, Method: Compositional matrix adjust. Identities = 72/292 (24%), Positives = 118/292 (40%), Gaps = 18/292 (6%) Query: 13 MSLRTEFVLFAS-QDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPN 71 ++LR + F ++G ++ C+ G+S T Y R A+ G AG+ P + P Sbjct: 5 ITLRKKIADFDPIREGITVQQFCKNIGVSKQTYYNIKARIAERGRAGIVPDSTAPLN-PR 63 Query: 72 RSSDDITALLRMAHDRHERWGARKIKRW------LEDQGHTMP-AFSTVHNLMARHGLLP 124 R DD + R + W L++ G+ P + + + + G+ Sbjct: 64 RVYDDKIRQQVLQARGTLRARGQDCGPWSIFYFFLDELGYDQPPSRALIAQWLHEAGVAD 123 Query: 125 GASPGIPATG--RFEHDAPNRLWQMDFKGH--FPFGGGRCHPLTLLDDHSRFSL-CLAHC 179 + P F N LWQ+D + F + ++DD SRF + A Sbjct: 124 INARKRPRKSYRHFARGEVNELWQIDAFAYRLFDVPHTQVTIYQVVDDASRFDVGSQAFG 183 Query: 180 TDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDT-TGTWTALELWLMRLGIRVGHSRP 238 T E + L + YGLP + DNG + G + E WL LG++ S Sbjct: 184 TPENGTDARITLSGAIDAYGLPQEVLSDNGDAFATYHRGRLSQTERWLASLGVQ--SSAG 241 Query: 239 YHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEAL 290 + P TQGK ER H+++ L + ++Q+ +R YN R H+ L Sbjct: 242 FAPTTQGKDERSHQTM-TRFLDARTPTTLAQVQQLIVDYRNFYNTRRRHQGL 292 >UniRef50_Q2CG00 Integrase, catalytic domain n=8 Tax=Rhodobacterales RepID=Q2CG00_9RHOB Length = 340 Score = 65.5 bits (158), Expect = 3e-09, Method: Compositional matrix adjust. Identities = 60/215 (27%), Positives = 89/215 (41%), Gaps = 13/215 (6%) Query: 87 RHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGL-----LPGASPGIPATGRFEHDAP 141 + R+G R+I LE +G M ++ L GL T E P Sbjct: 43 KRRRFGYRRIGILLERKGMLM-NHKKLYRLYREEGLSVKRRGGRKRARGSRTPMPEAAHP 101 Query: 142 NRLWQMDFKGHFPFGGGR-CHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGL 200 W +DF FG R L ++DD R +LCL T V ++L ++ YG Sbjct: 102 KARWSLDFLAD-SFGASRKFRILAVIDDCCRENLCLTADTSISGARVARELDALVRIYGT 160 Query: 201 PDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQ 260 P + DNG+ + T A+ W + G+ + P PQ +E F+ SL+ E+L Sbjct: 161 PACIVSDNGTEF-----TSRAILKWADKNGVPWHYIDPGKPQQNAFIESFNGSLRDELLY 215 Query: 261 GKWFADSGELQRAFDHWRTVYNLERPHEALDMAVP 295 + F + +R WR YN RPH +L P Sbjct: 216 EEIFVTLEDARRKLALWRYDYNAVRPHSSLGNQTP 250 >UniRef50_B4RV10 Integrase, catalytic region n=16 Tax=Proteobacteria RepID=B4RV10_ALTMD Length = 267 Score = 65.5 bits (158), Expect = 3e-09, Method: Compositional matrix adjust. Identities = 46/159 (28%), Positives = 70/159 (44%), Gaps = 5/159 (3%) Query: 139 DAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERY 198 + N W +DF + G R L ++D+ +R L + T V + L + Sbjct: 113 EQANYQWALDFMHDTLYCGKRFRTLNVVDEGTRECLAIEVDTSLPAGRVVRVLEQLKTER 172 Query: 199 GLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEV 258 GLP ++ MDNG T L W I + + +P PQ G +ERF+ S + E Sbjct: 173 GLPKQLRMDNGPELISAT-----LTDWCQNHNIELLYIQPGKPQQNGFVERFNGSFRREF 227 Query: 259 LQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGS 297 L F + G+++ WR YN ER HE+L P + Sbjct: 228 LDAYLFENIGQVREMSWFWRLDYNEERTHESLGNLPPAA 266 >UniRef50_Q1BK79 Integrase, catalytic region n=37 Tax=Proteobacteria RepID=Q1BK79_BURCA Length = 339 Score = 65.1 bits (157), Expect = 4e-09, Method: Compositional matrix adjust. Identities = 77/274 (28%), Positives = 112/274 (40%), Gaps = 25/274 (9%) Query: 33 LCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNR--SSDDITALLRMAHDRHER 90 +C R GIS T KWL+R+ + G GL+ + R P SPNR S D +LR+ +R + Sbjct: 18 VCTRCGISRPTLRKWLRRYQEAGEEGLRSQSRRPLTSPNRKVSDADRATILRLRAER--K 75 Query: 91 WGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGA-SPGIPATGRFEHDAPNRLWQMDF 149 GAR+I+ L + +T+H ++ + P R+ P QMD Sbjct: 76 GGARRIQNELRLNEQRELSLATIHKVLCEALVKPLVRPRRPAQPRRYSRPVPGDRVQMD- 134 Query: 150 KGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLP-DRMTMDN 208 G + T +DD SRF + + R T+ L V E P R+ D Sbjct: 135 --TMKIARG-VYQYTAIDDCSRFRVLAVYPRRNARNTL-FFLDRVIEEMPFPIQRIQTDR 190 Query: 209 GSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSG 268 G + +++ LM I+ P P GK+ER + E WF Sbjct: 191 GGEF-----FAESVQRRLMNECIKFRPIPPRSPHLNGKVERSQLTDLNEF----WF-HHA 240 Query: 269 ELQRAFD----HWRTVYNLERPHEALDMAVPGSR 298 +RA D W+ YN RPH +L P R Sbjct: 241 PTERAIDLRIEEWQFDYNWRRPHGSLGGKTPVDR 274 >UniRef50_C1FA08 Integrase core domain protein n=1 Tax=Acidobacterium capsulatum ATCC 51196 RepID=C1FA08_ACIC5 Length = 697 Score = 64.7 bits (156), Expect = 5e-09, Method: Compositional matrix adjust. Identities = 70/285 (24%), Positives = 107/285 (37%), Gaps = 56/285 (19%) Query: 25 QDGANIRSLCRRF-------GISPATGYKWLQRWAQEGAAGLQDRPRIPH-----HSPNR 72 +DG+ + S R G+ T +WL+RW G A L DR R + Sbjct: 143 KDGSQVTSQTRMLEYQAEISGVYSRTLKRWLKRWRDGGLAALADRNRQDKGLSRWFEEHP 202 Query: 73 SSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHN------------LMARH 120 + D+TA + ++ + ++RW G P +STV +AR Sbjct: 203 EAKDVTAAAYLQPEQSKIAAYEALQRWCSRAGILAPHYSTVRRWLDSDDLPEPVVTLARE 262 Query: 121 GLLPGASPGIPATGRFEHD-APNRLWQMD------------FKGHFPFGGGRCHPLTLLD 167 G +P + D APN++W D F+G P R ++D Sbjct: 263 GERALKERHLPFLRKAYTDIAPNQIWVSDHMIHDVLVRNDCFEGAAPNAAIRLRFTAMID 322 Query: 168 DHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDT------------ 215 SR + + ++ L ERYG P + +DNG + Sbjct: 323 YRSRKPMGYCWTVEGSSRSIALALRRGIERYGAPSTLYVDNGKDYKRVARGAAPVWKRVA 382 Query: 216 TGTWTALELW------LMRLGIRVGHSRPYHPQTQGKLERFHRSL 254 + + A W L RLGI V H YHPQ++ +ERF R+L Sbjct: 383 SEQFVADVAWVESLGVLSRLGIEVQHCLKYHPQSK-HIERFFRTL 426 >UniRef50_B0NHH2 Putative uncharacterized protein (Fragment) n=2 Tax=Clostridium scindens ATCC 35704 RepID=B0NHH2_EUBSP Length = 422 Score = 64.7 bits (156), Expect = 6e-09, Method: Compositional matrix adjust. Identities = 72/341 (21%), Positives = 128/341 (37%), Gaps = 21/341 (6%) Query: 37 FGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMAHDRHERWGARKI 96 F SP T KW+ + G L R R + D + R + +I Sbjct: 55 FRYSPKTISKWVSLYQNGGIDALMPRERSDKGATRVLPDTAIEEICRLKAAFPRLNSTQI 114 Query: 97 KRWLEDQGHTMPAFST--VHNLMARHGLLPGASPGIPATGRFEHDAPNRLWQMDFKGHFP 154 + L ++ + S V + +H L ++P + FE DA ++WQ D + P Sbjct: 115 HKHLVEEAFIPASVSVCAVQRFVKKHDLKSASNPNLRDRKAFEEDAFGKMWQAD-TCYLP 173 Query: 155 FGGG-----RCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNG 209 + R + + ++DDHSRF + ++ Q+ L +G+P ++ +DNG Sbjct: 174 YITENGQRRRVYCILVIDDHSRFLVGGGLFYNDTAYNFQKVLKDAVAAHGIPSKLYVDNG 233 Query: 210 SPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAE---VLQGKWFAD 266 + L L +G + H++ ++ K+ER R+LK L Sbjct: 234 CSY-----VGAQLSLICGSIGTVLLHTKVRDGASKAKIERQFRTLKETWLYTLDMDSITS 288 Query: 267 SGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQ---PSARQYSGNTTPPEYDEGVMVRK 323 + + YN H + P +RYQ S R+ E + RK Sbjct: 289 LAQFNGLLKDYMRSYNTS-VHSGIG-TTPLARYQQTRSSIRRPKSREWLEECFLNRITRK 346 Query: 324 VDISGKLSVKGVSLSAGKAFRGERVGLKEMQEDGSYEVWWY 364 V+ +S+ V+ F +V ++ + +D S Y Sbjct: 347 VNKDSTVSIDRVAYDVPMQFISSKVEIRFLPDDMSSAFILY 387 >UniRef50_Q4FQT2 Transposase OrfB n=179 Tax=Bacteria RepID=Q4FQT2_PSYA2 Length = 288 Score = 64.3 bits (155), Expect = 7e-09, Method: Compositional matrix adjust. Identities = 42/148 (28%), Positives = 75/148 (50%), Gaps = 7/148 (4%) Query: 141 PNRLWQMDFKGHFPFGGGRCHPL-TLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYG 199 PN+ W MDF H GR L ++DD++R +L + + V + L + E G Sbjct: 131 PNQSWSMDFM-HDALTDGRAFRLFNVIDDYNREALTVEIDFSLPAQRVIRSLNQLIEYRG 189 Query: 200 LPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVL 259 P ++ DNG+ + AL+ W + GI + + P +PQ +ER++R+++ + L Sbjct: 190 KPVQVRCDNGAEYISN-----ALKDWAVNQGITIRYIEPGNPQQNAYVERYNRTMRYDWL 244 Query: 260 QGKWFADSGELQRAFDHWRTVYNLERPH 287 + F D ++++ + W YN ERP+ Sbjct: 245 NQELFTDLDQVRQQAEDWLYHYNNERPN 272 >UniRef50_A7B7Y8 Putative uncharacterized protein n=4 Tax=Clostridiales RepID=A7B7Y8_RUMGN Length = 417 Score = 63.5 bits (153), Expect = 1e-08, Method: Compositional matrix adjust. Identities = 65/277 (23%), Positives = 110/277 (39%), Gaps = 30/277 (10%) Query: 40 SPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMAHDRHERWGARKIKRW 99 +PAT KW + G GL + R + +++ +R + R A I R Sbjct: 57 APATIEKWYLDYQNHGFEGLVPKGRSDAGMSRKLDEELQERIRYFKTNYPRMSAAAIYRQ 116 Query: 100 LEDQGHTMP---AFSTVHNLMARHGLLPGASPGIPATGRFEHDAPNRLW----------- 145 L+ G + + STV + R +P R+E N +W Sbjct: 117 LKSDGSVINGQVSESTVSRFVKRLQSELRQTPN-KDMRRYERPHINEVWCGDSSVGPRLT 175 Query: 146 QMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMT 205 D K H R + + L+DD SRF + ++ + + S +YG P Sbjct: 176 DSDGKKH------RIYIIALIDDASRFITGIDVFYNDNFINLMSVMRSAIAKYGRPKVFN 229 Query: 206 MDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAE---VLQGK 262 DNG + + +EL R+G + + +PY P + K+ER+ R++K + L + Sbjct: 230 FDNGKSYKNKQ-----MELLAARIGTTLSYCQPYTPTGKAKIERWFRTMKDQWMAALDMR 284 Query: 263 WFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRY 299 F EL+ + + YN + PH +L P R+ Sbjct: 285 DFHSLEELRGSLHAFVQRYN-QSPHSSLHGLSPQDRF 320 >UniRef50_A1WCB6 Integrase, catalytic region n=2 Tax=Burkholderiales RepID=A1WCB6_ACISJ Length = 270 Score = 63.5 bits (153), Expect = 1e-08, Method: Compositional matrix adjust. Identities = 45/155 (29%), Positives = 67/155 (43%), Gaps = 5/155 (3%) Query: 141 PNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGL 200 PN+ W DF + G R ++D+ +R L + T V + L + E G Sbjct: 108 PNQGWSCDFMADALWSGRRFRTFNVIDEFNREGLRIEVDTSLPATRVIRALNELVEVRGA 167 Query: 201 PDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQ 260 P + +DNG + AL W GI + H +P P +ERF+++ + EVL Sbjct: 168 PLSIRLDNGPEF-----IAHALSEWAKSKGIALNHIQPGKPTQNAYVERFNKTYRTEVLD 222 Query: 261 GKWFADSGELQRAFDHWRTVYNLERPHEALDMAVP 295 F + E++ W YN RPHEAL P Sbjct: 223 CYVFDNLQEVRDMTADWLHRYNHHRPHEALGRIPP 257 >UniRef50_C6BTX7 Integrase catalytic region n=88 Tax=Bacteria RepID=C6BTX7_DESAD Length = 289 Score = 63.5 bits (153), Expect = 1e-08, Method: Compositional matrix adjust. Identities = 59/214 (27%), Positives = 89/214 (41%), Gaps = 13/214 (6%) Query: 90 RWGARKIKRWLEDQGHTMPAFSTVHNLMARHGL-LPGASPG--IPATGRF---EHDAPNR 143 R+G +I L +G + VH + GL L P I A R E ++ Sbjct: 60 RYGCHRIYILLRREGWYV-NHKKVHRIYCEEGLNLRSKRPRRHISAARRMDRPELSTIDQ 118 Query: 144 LWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGL-PD 202 W MDF + G R LT++D+ SR L + + + + V +L + G P Sbjct: 119 CWSMDFVADNLYNGRRIRALTVVDNFSRECLDIYVDSSIKGDKVVARLEWLRVISGRKPI 178 Query: 203 RMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGK 262 R+ +DNGS + AL+ W + + SRP P +E F+ S + E L Sbjct: 179 RIQVDNGSEF-----ISKALDKWAYENEVVLDFSRPGKPTDNPFIESFNGSFRDECLNTH 233 Query: 263 WFADSGELQRAFDHWRTVYNLERPHEALDMAVPG 296 WF + + + WR YN RPH +L P Sbjct: 234 WFLSVSDARTRIETWRKEYNEFRPHSSLGDQTPN 267 >UniRef50_A9B8L4 Integrase catalytic region n=5 Tax=Herpetosiphon aurantiacus ATCC 23779 RepID=A9B8L4_HERA2 Length = 435 Score = 63.2 bits (152), Expect = 2e-08, Method: Compositional matrix adjust. Identities = 49/186 (26%), Positives = 80/186 (43%), Gaps = 17/186 (9%) Query: 180 TDERRETVQQQLVSVFERYGLPDRMTMD------NGSPWGDTTGTWTALELWLMRLGIRV 233 TD +V + S+ ++ GLP R+ MD + D + L L+ LGI+V Sbjct: 194 TDFHMASVIRTTASILQQIGLPARIRMDCDVRLVSNKRVADFPSPFQRL---LLNLGIQV 250 Query: 234 GHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHE--ALD 291 P+ P + +ERFH++ K E + W E Q D + Y ERPH+ A Sbjct: 251 DVCPPHRPDLKPFVERFHKNYKGESVYPNWPTTEAEAQVQVDAYCDWYRTERPHQGRACG 310 Query: 292 MAVPGSRYQPSAR------QYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRG 345 P + Q + + D VR+V+ GKL + G + +AG A+ G Sbjct: 311 NRPPAEAFPELPVLPPVPAQVDADGWLKQIDGWTFVRRVNAQGKLMLDGATYTAGIAYAG 370 Query: 346 ERVGLK 351 + + ++ Sbjct: 371 QELAVQ 376 >UniRef50_A4LGI6 YD repeat protein n=14 Tax=Proteobacteria RepID=A4LGI6_BURPS Length = 643 Score = 62.8 bits (151), Expect = 2e-08, Method: Compositional matrix adjust. Identities = 34/110 (30%), Positives = 48/110 (43%), Gaps = 5/110 (4%) Query: 187 VQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGK 246 VQQ L + E GLP +T+DNG + L+ W G+ + RP P Sbjct: 517 VQQVLARLKEMRGLPASITVDNGPEFAGKV-----LDAWAYEAGVTLSFIRPGKPVENAY 571 Query: 247 LERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPG 296 +E F+ + E L WF ++ + WR YN ERPH +L P Sbjct: 572 IESFNGRFRDECLNEHWFVSMRHAKQLIEEWRIEYNTERPHSSLGYLTPA 621 >UniRef50_Q64B23 Transposase n=1 Tax=uncultured archaeon GZfos27G5 RepID=Q64B23_9ARCH Length = 414 Score = 62.0 bits (149), Expect = 3e-08, Method: Compositional matrix adjust. Identities = 79/364 (21%), Positives = 148/364 (40%), Gaps = 43/364 (11%) Query: 46 KWLQRWAQEGAAGLQDRPR----IPHHSPNRSSDDITALLRMAHD------RHERWGARK 95 KWL R+ +D P+ IPH + R + + + D ++ GA Sbjct: 39 KWLGRYKTGRKGWYKDLPKRARVIPHKTSERIEQIVVNIRKALMDGTEDSTKYSCVGAEA 98 Query: 96 IKRWLEDQGHT---MPAFSTVHNLMARHGL---LPGASPGIPATGRFEHDAP---NRLWQ 146 I+ +E+ G+ +P+ ST+ ++ R+ L P + + GR+ P + L Q Sbjct: 99 IQFHMEELGYKPSEIPSISTIKRIIKRNKLRANKPERYKRVRSKGRYTILNPKHIDELHQ 158 Query: 147 MDFKG--HFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRM 204 +D+ G H G G + + L D R + ++ + V L+ +++ +P + Sbjct: 159 LDYVGPRHIK-GYGPINSIHLKDVAGR-QVAGQQYNEKSMDNVMDFLMGYWKQCPIPKYL 216 Query: 205 TMDNGSPW-GDTT--GTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQG 261 +DNG + GD +++ + +GI V P P G +E F++ Q Sbjct: 217 QVDNGMCFAGDYKHPKSFSRFVRLALYVGIEVVFIAPSRPWMNGTIEEFNKGFDKRFWQK 276 Query: 262 KWFADSGELQRA----------FDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGNTT 311 + F D ++++ F+ W+ +E L + P R P + N Sbjct: 277 ELFTDLNDIRKKSVIFFEKENKFNAWKL------RNEKLKVVDP-KRMLPGDFTITVNRL 329 Query: 312 PPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQEDGSYEVWWYSTKVGVI 371 P E +R+VD GK+SV G+ + GE V ++ V++ + V Sbjct: 330 PLVTGEIHFIRRVDSRGKISVLNEYFDVGREYTGEYVWATIETMKQTHIVYYKDENLVVR 389 Query: 372 DLKK 375 ++KK Sbjct: 390 EIKK 393 >UniRef50_UPI0000E49FEF PREDICTED: similar to LReO_3 n=2 Tax=Strongylocentrotus purpuratus RepID=UPI0000E49FEF Length = 1320 Score = 61.2 bits (147), Expect = 6e-08, Method: Compositional matrix adjust. Identities = 37/112 (33%), Positives = 61/112 (54%), Gaps = 7/112 (6%) Query: 158 GRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTG 217 G + LT++D +RF + + +TV L+ F RYGLP + D GS + Sbjct: 474 GSQYLLTIMDTTTRFPEAFP-LRNIKAKTVIDALLMFFTRYGLPHEIQSDQGSNF----- 527 Query: 218 TWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGE 269 T + + +LGI+ +S YHPQ+QG LER+H++LK +++ +SG+ Sbjct: 528 TSNVFQEVMYQLGIKQINSSAYHPQSQGALERYHQTLKT-MIKSYCLENSGD 578 >UniRef50_UPI0001925317 PREDICTED: similar to COS41.3 n=1 Tax=Hydra magnipapillata RepID=UPI0001925317 Length = 1187 Score = 60.8 bits (146), Expect = 8e-08, Method: Composition-based stats. Identities = 35/118 (29%), Positives = 59/118 (50%), Gaps = 6/118 (5%) Query: 141 PNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGL 200 P +DFKG P + LT++D++SRF + C D +TV + L VF +GL Sbjct: 856 PLERLNLDFKGPLPSETNNKYFLTIIDEYSRFPFAIP-CPDISAQTVIKCLSQVFSIFGL 914 Query: 201 PDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEV 258 P + D GS + L+ +L+ GI H+ Y+PQ G+ E+++ ++ + Sbjct: 915 PSYIHSDRGSAF-----ISKELKQYLLEKGIATSHTTAYNPQGNGQAEKYNGTIMKSI 967 >UniRef50_C4URW7 Integrase n=14 Tax=Proteobacteria RepID=C4URW7_YERRO Length = 302 Score = 60.5 bits (145), Expect = 1e-07, Method: Compositional matrix adjust. Identities = 75/302 (24%), Positives = 118/302 (39%), Gaps = 32/302 (10%) Query: 4 LMPWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRP 63 +M A+ ++ +T+ + +A+ + NI CR F IS T Y W + + + G GL + Sbjct: 1 MMNAKAKRDITHKTKVLNYAN-NTKNIAKTCRHFSISRRTYYTWKKAYERYGEQGLINHK 59 Query: 64 RIPHHSPNRSSDDI---TALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARH 120 P + R + I LR + +G ++I +L + + S + ++ R+ Sbjct: 60 PCPENPTRRVAKHIEEQIIYLRTTY----HFGPQRISWYLLRFHNIKVSRSGCYYVLLRN 115 Query: 121 GL--LPGASP--GIPATGRFEHDAPNRLWQMDFKGHFPF----GGGRCHPL--TLLDDHS 170 L LP P R+E P Q+D K F F G R T +DD + Sbjct: 116 RLNQLPQNQRQRSKPLFKRYEKQVPGHHVQVDVK--FLFFNSPNGQRIKRFQYTAIDDAT 173 Query: 171 RFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMTM---DNGSPWGDTTGTWTALELWLM 227 R + ER + P R+ DNG + W EL + Sbjct: 174 RIRALKIY---ERHNQANAINFIDYVVNKFPFRLKTIRTDNGHEF-QAKFNWHVHELGME 229 Query: 228 RLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPH 287 + I+ P P+ GK+ER H + K E Q + D +L W YN RPH Sbjct: 230 HVYIK-----PATPRLNGKVERSHLTDKQEFYQLIDYTDDVDLHEKLAEWEAFYNCHRPH 284 Query: 288 EA 289 A Sbjct: 285 SA 286 >UniRef50_A4A249 Transposase orfB n=4 Tax=Planctomycetaceae RepID=A4A249_9PLAN Length = 279 Score = 60.5 bits (145), Expect = 1e-07, Method: Compositional matrix adjust. Identities = 55/219 (25%), Positives = 85/219 (38%), Gaps = 22/219 (10%) Query: 90 RWGARKIKRWLEDQGHTMPAFSTVHNLMARHGL-LPGASPGIPATGR-----------FE 137 RWG R+I + L +G T+ ++ L GL +P ATG F Sbjct: 35 RWGYRQICQLLRREGETL-NMKKMYRLWKAAGLKVPQKRRKKRATGVSTNACHVQPAGFR 93 Query: 138 HDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFER 197 HD +W DF G L ++D+++R L + E L +F Sbjct: 94 HD----VWTWDFIQSSTIDGRTIRFLNIVDEYTRQCLAIKVGRSITSEDAIDTLAELFAM 149 Query: 198 YGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAE 257 +G+P R+ DNG + A++ WL +G+ V + P P G F+ L+ E Sbjct: 150 HGVPKRIRCDNGPEFIS-----CAIKTWLDLIGVEVLYIEPGSPWQNGLCVSFNSRLRDE 204 Query: 258 VLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPG 296 L + + WR +N RPH +L P Sbjct: 205 YLHQTDLLSLEDARIKARAWREDFNHNRPHSSLGYLTPA 243 >UniRef50_D1K5D0 Transposase n=1 Tax=Bacteroides sp. 3_1_33FAA RepID=D1K5D0_9BACE Length = 380 Score = 60.5 bits (145), Expect = 1e-07, Method: Compositional matrix adjust. Identities = 86/362 (23%), Positives = 139/362 (38%), Gaps = 62/362 (17%) Query: 20 VLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDI-T 78 +L SQ N+ C+ G S + Y++ + + Q G LQ+ R NR + I Sbjct: 14 LLELSQQLGNVSRACKIMGYSRDSFYRFKELYEQGGEIALQEISRRKPVIKNRVEEHIEQ 73 Query: 79 ALLRMAHDRHERWGARKIKRWLEDQG--------------HTMPAF----STVHNLMARH 120 A++ MA D + G ++ L +G H M F + + + Sbjct: 74 AVVGMAID-NPALGQVRVSNELRKKGILVSPGGVRSIWLRHDMETFQKRLKALSAKVEQE 132 Query: 121 GL---------LPGASPGIPATGRFEHDAPNRLWQMD--FKGHFPFGGGRCHPLTLLDDH 169 G+ L A A G E P L D + GH G G + T++D + Sbjct: 133 GIILDENQVAALEKAKEEKQAHGEIETYYPGFLVAQDTYYVGHIK-GVGHIYQQTVIDTY 191 Query: 170 SRFSLCLAHCTDERRETV-----QQQLVSVFERYGLP-DRMTMDNGSPWGDTTGTWTALE 223 S+ A D + V ++V FE++ L RM D G+ + E Sbjct: 192 SKIGF--AKLYDRKNALVAADMLNDRIVPFFEQHDLKLMRMLTDRGTEYCGNRENH-EYE 248 Query: 224 LWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAE----VLQGKWFADSGELQRAFDHWRT 279 L+L I + PQT G ERF+R+++ E + K + +LQ D W Sbjct: 249 LYLAVEDIDHSKIKAKSPQTNGICERFNRTVQNEFYAIAFRKKIYTSIEQLQTDLDAWMN 308 Query: 280 VYNLERPHEALDMAVPGSRYQPSARQYSGNTTPPEYDEGVMV-RKVDISGKLSVK--GVS 336 YN +R H S + G T + EG+ V RK + + ++K GV+ Sbjct: 309 SYNTQRTH--------------SGKYCFGKTPMQTFIEGIAVARKYKLQNQETIKPNGVN 354 Query: 337 LS 338 ++ Sbjct: 355 IT 356 >UniRef50_Q9FZN9 Retroelement pol polyprotein-like n=10 Tax=Arabidopsis thaliana RepID=Q9FZN9_ARATH Length = 1864 Score = 60.1 bits (144), Expect = 1e-07, Method: Compositional matrix adjust. Identities = 42/141 (29%), Positives = 70/141 (49%), Gaps = 8/141 (5%) Query: 144 LWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDR 203 +W +DF G FP G + L +D S++ +A T++ R ++ +F R+G+P Sbjct: 1568 VWGIDFMGPFPSSYGNKYILVAVDYVSKWVEAIASPTNDARVVLKLFKTIIFPRFGVPRI 1627 Query: 204 MTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKA--EVLQG 261 M D G + + E L + G++ + PYHPQT G++E +R +KA E + G Sbjct: 1628 MISDGGKHFIN-----KVFENLLKKHGVKHKVATPYHPQTSGQVEISNREIKAILEKIVG 1682 Query: 262 KWFAD-SGELQRAFDHWRTVY 281 D S +L A +RT + Sbjct: 1683 STRKDWSAKLDDALWAYRTAF 1703 >UniRef50_C2GFW3 Possible transposase n=1 Tax=Corynebacterium glucuronolyticum ATCC 51866 RepID=C2GFW3_9CORY Length = 282 Score = 60.1 bits (144), Expect = 1e-07, Method: Compositional matrix adjust. Identities = 50/173 (28%), Positives = 77/173 (44%), Gaps = 8/173 (4%) Query: 122 LLPGASPGIPATGRFEHDAPNRLWQMDFKGH--FPFGGGRCHPLTLLDDHSRFSL-CLAH 178 L PG + + TG F D N LWQ+D + F + ++DD SRF + A Sbjct: 26 LTPG-NDSVARTGGFTRDKVNELWQIDGLVYRLFDHDHTQITVYQVIDDASRFDVGTQAF 84 Query: 179 CTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTT-GTWTALELWLMRLGIRVGHSR 237 E + L + F+ YGL + DNG + G ++ ELWL + G V Sbjct: 85 PAAENGNDARFVLAAAFDTYGLSQEVLSDNGDAFATYHWGRLSSTELWLAQKG--VAAIA 142 Query: 238 PYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEAL 290 + P QGK ER H++L L + +++ +R YN +R H++L Sbjct: 143 GFAPTVQGKYERSHKTL-TRFLDARQPTMLTHVRQLLTQFRQFYNTQRIHQSL 194 >UniRef50_A1R4J8 ISAau1, transposase orfB n=3 Tax=Actinomycetales RepID=A1R4J8_ARTAT Length = 279 Score = 60.1 bits (144), Expect = 1e-07, Method: Compositional matrix adjust. Identities = 58/255 (22%), Positives = 95/255 (37%), Gaps = 19/255 (7%) Query: 60 QDRPRIPHHSPNRSSDD--ITALLRMAHDRHERWGARKIKRWLEDQGHTMPAF---STVH 114 Q+R + P S ++ + A LR +H WG +K + L Q V Sbjct: 19 QNRSALRKKKPEMSFEETRLRADLRAVAQKHPAWGWKKARWHLRAQPQWQDVALNKKRVR 78 Query: 115 NLMARHGLL--------PGASPGIPATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLL 166 L GL+ P R + + P + DF+ G ++ Sbjct: 79 RLWRDEGLVCKPKPKKKRRTGPDAGEQKRLKAEYPMHVISFDFQSDVTSCGRHIRFFNVI 138 Query: 167 DDHSRFSLCLAHCTDERRETVQQQLVSVFERYGL-PDRMTMDNGSPWGDTTGTWTALELW 225 D+ +R +L + + V L ++ G+ P + DNG + T AL W Sbjct: 139 DECTRTALAIVPRRSFKASDVVAVLENIIAETGIEPAYVRCDNGPEF-----TAAALIEW 193 Query: 226 LMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLER 285 G++ P P G +E F+ + E L G+ E + D W+ +YN ER Sbjct: 194 CSTAGVKTAFIDPGSPWQNGFIESFNAQFRREQLSGEIIDTMAEAKYLADEWKDIYNHER 253 Query: 286 PHEALDMAVPGSRYQ 300 PH +LD P + + Sbjct: 254 PHGSLDGMTPSNYWN 268 >UniRef50_Q0P7I8 IS1400 transposase B n=231 Tax=Bacteria RepID=Q0P7I8_ECOLX Length = 159 Score = 59.7 bits (143), Expect = 2e-07, Method: Compositional matrix adjust. Identities = 37/139 (26%), Positives = 64/139 (46%), Gaps = 5/139 (3%) Query: 157 GGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTT 216 G R ++DD +R +L + + + V + L + G P + +DNG + Sbjct: 8 GRRFRMFNVVDDFNREALSIEIDLNLPAQRVVRVLDRIAANRGYPAMLRLDNGPEFISL- 66 Query: 217 GTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDH 276 AL W + I++ +P P +ERF+R+ + E+L F E++ + Sbjct: 67 ----ALAEWAEKHAIKLEFIQPGKPTQNAFIERFNRTYRTEILDFYLFRTLNEVREITEK 122 Query: 277 WRTVYNLERPHEALDMAVP 295 W + YN ERPHE+L+ P Sbjct: 123 WLSKYNCERPHESLNNMTP 141 >UniRef50_B1LE64 Putative uncharacterized protein n=1 Tax=Escherichia coli SMS-3-5 RepID=B1LE64_ECOSM Length = 49 Score = 59.7 bits (143), Expect = 2e-07, Method: Composition-based stats. Identities = 26/35 (74%), Positives = 28/35 (80%) Query: 107 MPAFSTVHNLMARHGLLPGASPGIPATGRFEHDAP 141 +PAFSTVHNLMAR+GLLPG PATGRFEH P Sbjct: 9 LPAFSTVHNLMARNGLLPGLVAAAPATGRFEHAEP 43 >UniRef50_Q04ND9 Putative uncharacterized protein n=3 Tax=Leptospira RepID=Q04ND9_LEPBJ Length = 120 Score = 59.7 bits (143), Expect = 2e-07, Method: Composition-based stats. Identities = 38/105 (36%), Positives = 48/105 (45%), Gaps = 1/105 (0%) Query: 251 HRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGNT 310 HR+LK E + E Q AFD +R YNLERPHEAL+ P Y PS R + Sbjct: 2 HRTLKQETALPPR-SSLKEQQEAFDRFRIEYNLERPHEALEYKTPEKIYIPSERVFPIRI 60 Query: 311 TPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 Y ++V V G + G GERVG +E+ E Sbjct: 61 PEIAYATNIVVETVLDDGTAKYGPYRIFFGSPLIGERVGFEEVTE 105 >UniRef50_Q2AA50 Retrotransposon gag protein n=6 Tax=Asparagus officinalis RepID=Q2AA50_ASPOF Length = 1788 Score = 59.7 bits (143), Expect = 2e-07, Method: Compositional matrix adjust. Identities = 33/112 (29%), Positives = 59/112 (52%), Gaps = 5/112 (4%) Query: 144 LWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDR 203 LW +DF G FP G + L ++ S++ +A T++ + V+ ++F R+G+P Sbjct: 1425 LWGIDFMGPFPNSFGNVYILVAVEYMSKWVEAVACKTNDNKVVVKFLKENIFARFGVPRA 1484 Query: 204 MTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLK 255 + DNG+ + + + E + + I S PYHPQT G++E +R +K Sbjct: 1485 IISDNGTHFCN-----RSFEALMRKYSITHKLSTPYHPQTSGQVEVTNRQIK 1531 >UniRef50_B8KLM8 Integrase, catalytic region n=2 Tax=gamma proteobacterium NOR5-3 RepID=B8KLM8_9GAMM Length = 272 Score = 59.3 bits (142), Expect = 2e-07, Method: Compositional matrix adjust. Identities = 40/154 (25%), Positives = 62/154 (40%), Gaps = 5/154 (3%) Query: 142 NRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLP 201 N+ W +DF G R L ++DD SR + V + L +FE P Sbjct: 116 NQRWSIDFVSDQLSSGRRFRVLNVVDDFSREMVGQLVAVSITGSQVARFLSELFEDREKP 175 Query: 202 DRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQG 261 ++ DNG T T A+ W G+++G +P P +E + + E L Sbjct: 176 QKIICDNG-----TECTSKAMFFWSQESGVKLGFIQPGKPTQNAFVESLNGKFRNECLNR 230 Query: 262 KWFADSGELQRAFDHWRTVYNLERPHEALDMAVP 295 WF + + W+ YN RPH +L+ P Sbjct: 231 HWFRSLDDAKTEIMLWQNQYNNVRPHSSLNYLPP 264 Score = 43.1 bits (100), Expect = 0.019, Method: Compositional matrix adjust. Identities = 41/172 (23%), Positives = 69/172 (40%), Gaps = 12/172 (6%) Query: 187 VQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGK 246 V + L + E G P+ + DNG + AL W + I + + +P P Sbjct: 10 VIRALDQIIEWRGKPEALRCDNGPEYISQ-----ALVAWANQQRITLMYIQPGKPTQNAY 64 Query: 247 LERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQY 306 +ERF+R+++ E L F Q W YN ERP+ A+ P +++ Sbjct: 65 IERFNRTVRHEWLDLHSFVSLDHAQNLATQWLWQYNNERPNTAIGGVPPRVN-----QRW 119 Query: 307 SGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVG--LKEMQED 356 S + + G R +++ S + V + G +V L E+ ED Sbjct: 120 SIDFVSDQLSSGRRFRVLNVVDDFSREMVGQLVAVSITGSQVARFLSELFED 171 >UniRef50_C1XPR1 Transcriptional regulator/sugar kinase n=14 Tax=Meiothermus silvanus DSM 9946 RepID=C1XPR1_9DEIN Length = 777 Score = 59.3 bits (142), Expect = 2e-07, Method: Compositional matrix adjust. Identities = 75/283 (26%), Positives = 108/283 (38%), Gaps = 32/283 (11%) Query: 37 FGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMAHDRHER--WGAR 94 GIS AT ++W + ++G AGL+ R R P H + L+R+ R E WG Sbjct: 57 VGISRATYHRWQKALKEKGLAGLKPRSRRPKHLRTKVHWTPGLLIRIETLRKENPTWGRW 116 Query: 95 KIKRWLEDQGHTMPAFSTVHNLMA---RHGLLPGASPGIPATGR---------------- 135 I L +G M + TV ++A +H + + + T R Sbjct: 117 SIWLTLRKEGFQM-SERTVGRILAYLEKHRRIESVAGYLARTQRGKLKRRVNRPYAKRKP 175 Query: 136 --FEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVS 193 +E AP L Q+D G + +D HSRF L H + + + L Sbjct: 176 RGYEARAPGDLVQVDTLTLTLGPGSMVKHFSAIDLHSRFVLAEVHSRATAKLS-EGFLSL 234 Query: 194 VFERYGLPDR-MTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHR 252 + R P R + +D GS + E LGI + P P+ G +ER R Sbjct: 235 LLARAPFPIRAIQVDGGSEF------MAEFEEACCALGIALFVLPPRSPKLNGHVERMQR 288 Query: 253 SLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVP 295 + K E ELQ D + YN RPH AL P Sbjct: 289 TFKEEFYTRPLPTPLSELQAELDTYLDYYNRRRPHMALGGLAP 331 >UniRef50_C5CE17 Integrase catalytic region n=3 Tax=Kosmotoga olearia TBF 19.5.1 RepID=C5CE17_KOSOT Length = 367 Score = 59.3 bits (142), Expect = 2e-07, Method: Compositional matrix adjust. Identities = 86/344 (25%), Positives = 139/344 (40%), Gaps = 37/344 (10%) Query: 25 QDGANIRSLCRRFGISPATGYKWLQR-WAQEGAAGLQDRPRIPHHSPNRSSDDITALLRM 83 ++G + ++ RR GI+P T K+L++ W Q G + P D I L+ Sbjct: 15 REGLSKLAIARRLGIAPNTVKKYLEKEWCQMAKRGSKLDP---------FKDYIEKRLQ- 64 Query: 84 AHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPATGRFEHDAPNR 143 + A + + L ++G+T T+ M + P P I RFE + P R Sbjct: 65 ---EYPELTATVLFKELVERGYTGKL--TILR-MYVSSIRPKGKPEIVV--RFETE-PGR 115 Query: 144 LWQMDF-KGHFPFGGGR--CHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYG- 199 +Q+D+ G G + ++ +SR L DE+ ET+ Q + FE +G Sbjct: 116 QFQVDWGTGTTVIAGEKTTVKFFIMVLSYSRM-LYAEIVPDEKLETLIQAHLHAFEYFGG 174 Query: 200 LPDRMTMDNGSPWGDTTGTWTALELWLMRL----GIRVGHSRPYHPQTQGKLERFHRSLK 255 P DN M G +V RPY+P+ +GK+ER ++ Sbjct: 175 YPSEGLYDNMKTVVKKLQKQKEYNARFMDFANFYGFKVITHRPYNPKAKGKVERLVPYVR 234 Query: 256 AEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGNTTPPEY 315 +L G+ ++ EL+ W + N +R H L P R++ N Y Sbjct: 235 ENILYGQSYSSLTELKNVLRDWLAIAN-QRLHSELK-ETPLERFEREKNHL--NKLSKSY 290 Query: 316 D-EGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQEDGS 358 + R V G++ K + GK + GERV L Q +GS Sbjct: 291 PIRRLNTRLVRDKGQIVYKERAYHVGKKYTGERVNL---QVEGS 331 >UniRef50_P24577 Insertion element IS407 uncharacterized 31.7 kDa protein n=164 Tax=Proteobacteria RepID=YI71_BURM1 Length = 277 Score = 59.3 bits (142), Expect = 3e-07, Method: Compositional matrix adjust. Identities = 60/263 (22%), Positives = 103/263 (39%), Gaps = 30/263 (11%) Query: 70 PNRSSDDITA-LLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGL------ 122 P+ ++ + A L+ +AH+R R+G R++ +E +G T ++ L GL Sbjct: 32 PDHENEVLAARLVELAHER-RRFGYRRLHALVEREG-THANHKRIYRLYREAGLAVRRRR 89 Query: 123 ------LPGASPGIPATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCL 176 + +P APN +W +DF G R LT++DD ++ ++ + Sbjct: 90 KRQGVMIEREQLALPG-------APNEVWSIDFVMDALSNGRRVKCLTVVDDFTKEAVDI 142 Query: 177 AHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHS 236 V + L G P + D G + T AL+ W G+ + Sbjct: 143 VVDHGISGLYVARALDRAARFRGYPKAVRTDQGPEF-----TSRALDQWAYANGVTLKLI 197 Query: 237 RPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPG 296 + P +E F+ + E L WF + WR YN +RPH AL+ P Sbjct: 198 QAGKPTQNAYIESFNGKFRDECLNEHWFTTLAHARAVIAAWRQDYNEQRPHSALNYLAPS 257 Query: 297 SRYQPSARQYSGNTTPPEYDEGV 319 + +A+ + P + E V Sbjct: 258 ---EFAAKHRATADAPAAFQELV 277 >UniRef50_Q3BT31 Transposase n=22 Tax=Bacteria RepID=Q3BT31_XANC5 Length = 343 Score = 58.9 bits (141), Expect = 3e-07, Method: Compositional matrix adjust. Identities = 44/160 (27%), Positives = 63/160 (39%), Gaps = 5/160 (3%) Query: 139 DAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERY 198 D PN+ W +DF R L ++DD++R L L T V ++L + Sbjct: 161 DGPNQRWSLDFVSDTLTCSRRFRILCVVDDYTRECLALVADTSLSGVRVARELTRLIGMR 220 Query: 199 GLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEV 258 G P + DNG T T +A+ W + + P P G +E F+ L+ E Sbjct: 221 GKPHTVVSDNG-----TELTSSAILRWSQERRVEWHYIAPGKPMQNGFVESFNGRLRDEC 275 Query: 259 LQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSR 298 L F + D WR YN RPH L P + Sbjct: 276 LNETLFTSLPHARFVLDAWRHDYNHVRPHSKLGGRTPAEK 315 >UniRef50_UPI0001924F80 PREDICTED: similar to COS41.3, partial n=2 Tax=Hydra magnipapillata RepID=UPI0001924F80 Length = 777 Score = 58.5 bits (140), Expect = 3e-07, Method: Compositional matrix adjust. Identities = 43/139 (30%), Positives = 65/139 (46%), Gaps = 12/139 (8%) Query: 131 PATGRFEHDAPNRLWQ---MDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETV 187 P GR R W+ MDF G + L ++D++SR+ C D TV Sbjct: 561 PPAGRLIQAT--RPWERLSMDFVGRLQSTSANKYILVIVDEYSRYPFGFP-CKDITANTV 617 Query: 188 QQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKL 247 Q L+S+F +G P M D G+ + +L+++L R G+ + PYHPQ G+ Sbjct: 618 IQHLLSLFSLFGAPSSMHTDRGTQFES-----ISLKVFLERNGVIRTRTTPYHPQGNGQC 672 Query: 248 ERFHRS-LKAEVLQGKWFA 265 ER + + LKA L K + Sbjct: 673 ERMNGTILKAISLALKTLS 691 >UniRef50_UPI0001924F1E PREDICTED: similar to COS41.3 n=2 Tax=Hydra magnipapillata RepID=UPI0001924F1E Length = 2009 Score = 58.5 bits (140), Expect = 4e-07, Method: Composition-based stats. Identities = 38/127 (29%), Positives = 59/127 (46%), Gaps = 11/127 (8%) Query: 131 PATGRFEHDAPNRLWQ---MDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETV 187 P GR R W+ MDF G + L ++D++SR+ C D TV Sbjct: 1657 PPAGRLIQ--ATRPWERLSMDFVGPLQSTSANKYILVIVDEYSRYPFAFP-CKDITANTV 1713 Query: 188 QQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKL 247 Q L+S+F +G P M D G+ + + L+ +L R G+ + PYHPQ G+ Sbjct: 1714 IQHLLSLFSLFGAPSSMHTDRGTQFESIS-----LKNFLERNGVIRTRTTPYHPQGNGQR 1768 Query: 248 ERFHRSL 254 ER + ++ Sbjct: 1769 ERMNGTI 1775 >UniRef50_C4UEN4 Transposase n=1 Tax=Yersinia aldovae ATCC 35236 RepID=C4UEN4_YERAL Length = 281 Score = 58.5 bits (140), Expect = 4e-07, Method: Compositional matrix adjust. Identities = 42/152 (27%), Positives = 69/152 (45%), Gaps = 5/152 (3%) Query: 139 DAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERY 198 + P+ W +DF + G R L ++D+ +R L + T + V + L + + Sbjct: 114 NLPDIQWALDFMHDALYCGKRFRTLNIIDEGTRECLAIEVDTSLPTDRVIRVLDRLKKER 173 Query: 199 GLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEV 258 GLP ++ +DNG P + E I + H +P PQ G +ERF+ S + E Sbjct: 174 GLPQQLRVDNG-PELISVNLLNYCEYN----HITLCHIQPGKPQQNGFIERFNGSFRREF 228 Query: 259 LQGKWFADSGELQRAFDHWRTVYNLERPHEAL 290 L F +++ W+ YNL R HE+L Sbjct: 229 LNAYLFESLSQVREMAWFWQQDYNLNRTHESL 260 >UniRef50_B8FAB2 Integrase catalytic region n=70 Tax=Bacteria RepID=B8FAB2_DESAA Length = 327 Score = 58.5 bits (140), Expect = 4e-07, Method: Compositional matrix adjust. Identities = 56/247 (22%), Positives = 97/247 (39%), Gaps = 20/247 (8%) Query: 67 HHSPNRSSDDITALLRMAHDRHER---WGARKIKRWLEDQGHTM--PAFSTVHNLMARHG 121 ++ P SD L+R+ +++ + WG+R ++ +L G+ + + +M Sbjct: 34 YYRPKPVSDHDLELMRLIDEQYLKQPTWGSRSMRNFLRGLGYKINRKKVRRLMRIMGICA 93 Query: 122 LLPGASPGIPATGRFEH---------DAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRF 172 + P +P G + D N++W D + P G + ++D HSR Sbjct: 94 VYPKPRTSLPHPGHKVYPYLLKGVSIDRANQVWSSDIT-YIPMRKGFMYLCAVIDWHSRK 152 Query: 173 SLCLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIR 232 L + +RYG P+ D G + T+ +T L L GIR Sbjct: 153 VLSWRLSNTMDADFCVDAAAEAIDRYGPPEIFNTDQGVQF--TSADFTGL---LKGHGIR 207 Query: 233 VGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDM 292 + +ER +LK + + F D +L++ W YN ERPH++LD Sbjct: 208 ISMDGKGRCLDNIFVERLWWTLKYHYVYLRDFEDGVQLRKGLAGWFDFYNRERPHQSLDG 267 Query: 293 AVPGSRY 299 P Y Sbjct: 268 KTPNEAY 274 >UniRef50_UPI00015B43F4 PREDICTED: similar to pol polyprotein n=1 Tax=Nasonia vitripennis RepID=UPI00015B43F4 Length = 905 Score = 58.5 bits (140), Expect = 4e-07, Method: Compositional matrix adjust. Identities = 45/130 (34%), Positives = 63/130 (48%), Gaps = 21/130 (16%) Query: 134 GRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVS 193 RFEH MD PF G + LT++D SR+ + + D R ET+ + S Sbjct: 569 NRFEH------VHMDIIV-MPFVGDLRYCLTMIDRFSRWPVVVP-IADIRAETIAR---S 617 Query: 194 VFER----YGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLER 249 FE YG P +T D G+ + +AL +G R H+ PYHPQ +ER Sbjct: 618 FFEHWVAYYGTPITITTDQGTQF------ESALFALAQMIGSRRIHTTPYHPQANRLIER 671 Query: 250 FHRSLKAEVL 259 FHR+LKA ++ Sbjct: 672 FHRTLKAALM 681 >UniRef50_UPI00005104D7 transposase n=1 Tax=Brevibacterium linens BL2 RepID=UPI00005104D7 Length = 401 Score = 58.2 bits (139), Expect = 5e-07, Method: Compositional matrix adjust. Identities = 52/198 (26%), Positives = 80/198 (40%), Gaps = 5/198 (2%) Query: 104 GHTMPAFSTVHNLMARHGLLPGASPGIPATGR--FEHDAPNRLWQMDFKGHFPFGGGRCH 161 G +P+ +T+ L+A G + + P + F LWQ+D + G Sbjct: 36 GGKVPSPATIARLLASVGHVEASPKKRPKSCYIPFARSTAMALWQLDAFEYTLTTGTIVT 95 Query: 162 PLTLLDDHSRFSL-CLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDT-TGTW 219 LLDD +RF + AH E + L + YG P + DN S + G Sbjct: 96 IYQLLDDATRFDVGTSAHSRAENSADAHEILAAAITEYGAPKEVLSDNSSAFNQLRQGRI 155 Query: 220 TALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRT 279 A+E +L G P P TQGK ER ++L L A +++ + Sbjct: 156 GAVETFLASKGAMPISGLPGKPTTQGKNERSRQTL-IRFLDANTPASLEKIRALLRRFHD 214 Query: 280 VYNLERPHEALDMAVPGS 297 YN RPH+++ A P + Sbjct: 215 HYNNRRPHQSIGGATPAT 232 >UniRef50_P51517 Integrase n=76 Tax=root RepID=POL_SRV2 Length = 867 Score = 58.2 bits (139), Expect = 6e-07, Method: Compositional matrix adjust. Identities = 45/147 (30%), Positives = 68/147 (46%), Gaps = 10/147 (6%) Query: 140 APNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYG 199 PN LWQMD + FG + +++ D S F L T E + V L+ F G Sbjct: 649 VPNMLWQMDVTHYSEFGKLKYVHVSI-DTFSGF-LVATLQTGEATKHVIAHLLHCFSIIG 706 Query: 200 LPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLK---A 256 P + DNG G T+ + A + +L I+ PY+PQ QG +ER H SLK Sbjct: 707 QPIHIKTDNGP--GYTSSNFRA---FCSKLHIKHTFGIPYNPQGQGIVERAHLSLKNTLE 761 Query: 257 EVLQGKWFADSGELQRAFDHWRTVYNL 283 ++ +G+W+ G + +H + N Sbjct: 762 KIKKGEWYPTQGSPRNILNHALFILNF 788 >UniRef50_A1VJC3 Integrase, catalytic region n=25 Tax=Bacteria RepID=A1VJC3_POLNA Length = 325 Score = 57.8 bits (138), Expect = 6e-07, Method: Compositional matrix adjust. Identities = 69/261 (26%), Positives = 103/261 (39%), Gaps = 26/261 (9%) Query: 57 AGLQDRPRIPHHSPNRSSDDITALLRMA---HDRHERWGARKIKRWLEDQGHTMPAFSTV 113 AG+ P R D LLR+ + RH +G+RK+ L GH++ Sbjct: 8 AGISRAALYARRKPKRIVQDDELLLRLIDEEYTRHPFYGSRKMVVHLGRCGHSVNR-KWA 66 Query: 114 HNLMARHGLLPGASPGIPATGRF--EHDA------------PNRLWQMDFKGHFPFGGGR 159 LM GL G +PG P T R +H PN++W D + G Sbjct: 67 QRLMRSLGL-AGMAPG-PNTSRAHPQHKVYPYLLRGVAISRPNQVWSTDIT-YIRLARGF 123 Query: 160 CHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTW 219 + + ++D +SR L L +G P+ D GS + T+ + Sbjct: 124 AYLVAVIDWYSRRVLSWRISNSMETVFCVDCLEEALRIHGKPEVFNTDQGSQF--TSEAF 181 Query: 220 TALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRT 279 T++ L R G+ + +ER RS+K E + K ++ GEL + T Sbjct: 182 TSV---LKREGVIISMDGRGRALDNIFVERLWRSVKHEDVYLKGYSAMGELLIGLTQYFT 238 Query: 280 VYNLERPHEALDMAVPGSRYQ 300 YN ERPH+AL P YQ Sbjct: 239 FYNGERPHQALKNLTPDVVYQ 259 >UniRef50_UPI0001792303 PREDICTED: similar to zinc finger protein n=2 Tax=Acyrthosiphon pisum RepID=UPI0001792303 Length = 808 Score = 57.8 bits (138), Expect = 6e-07, Method: Compositional matrix adjust. Identities = 37/122 (30%), Positives = 57/122 (46%), Gaps = 12/122 (9%) Query: 130 IPATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQ 189 I AT FE +DFKG P G + LT++D+ SRF + C D TV + Sbjct: 512 IKATSPFER------LNIDFKGPIPSKTGNNYILTIVDEFSRFPFAIP-CRDLSSATVIK 564 Query: 190 QLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLER 249 L +F +G+P + D G+ + L +L GI + PY+P G++ER Sbjct: 565 CLSYLFSIFGMPAYIHSDRGAAFLS-----QELTTFLCTRGIATSRTTPYNPAGNGQVER 619 Query: 250 FH 251 ++ Sbjct: 620 YN 621 >UniRef50_C4V4D7 Transposase n=5 Tax=Clostridiales RepID=C4V4D7_9FIRM Length = 329 Score = 57.8 bits (138), Expect = 7e-07, Method: Compositional matrix adjust. Identities = 61/224 (27%), Positives = 89/224 (39%), Gaps = 20/224 (8%) Query: 58 GLQDRPRIPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLM 117 L R R PHH PN SD AL+R R G L +G+T + + ++ M Sbjct: 65 SLDPRSRRPHHHPNEHSDREIALIRRMRKRRPNTGLICFWVHLRKKGYTR-SITGLYRCM 123 Query: 118 ARHGLLPGAS--PGIPATGRFEHDAPNRLWQMDFK--------GHFPFGGGRCHPLTLLD 167 R GL G + P + P + Q+D K G G + + T +D Sbjct: 124 KRLGLKAGKAKKPVYKPKPYEQATFPGQKVQIDVKVVPSVCIVGQAKEQGEKMYQYTAID 183 Query: 168 DHSRFSLCLAHCTDERRETV--QQQLVSVFERYGLPDRMTMDNGSPW-----GDTTGTWT 220 +++RF A ++ QQL+ F + + ++ DNG+ + T Sbjct: 184 EYTRFRFIAAFKEQSTYSSMCFLQQLIRRFP-FKI-HKVQTDNGAEFTKRFQAADEANLT 241 Query: 221 ALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWF 264 E L RLGI RPY P+ GK+ER HR E F Sbjct: 242 LFEKELKRLGIAHQKIRPYTPRHNGKVERSHRKDNEEFYASHTF 285 >UniRef50_A5D1X6 Transposase and inactivated derivatives n=2 Tax=Pelotomaculum thermopropionicum SI RepID=A5D1X6_PELTS Length = 308 Score = 57.4 bits (137), Expect = 8e-07, Method: Compositional matrix adjust. Identities = 69/282 (24%), Positives = 118/282 (41%), Gaps = 36/282 (12%) Query: 39 ISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDIT-ALLRMAHDRHERWGARKIK 97 +S T YK L R+ + G AGL D+ R+P PN++ D+ A+L D H +G ++I Sbjct: 1 MSHTTFYKLLDRFKEHGEAGLYDKERVPGIKPNQTPTDVEGAILAFVLD-HPTYGPKRIS 59 Query: 98 RWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPATGRFEHDAPNRLWQMDFKG------ 151 L+ + + + V+ ++ R+ L + + W++D + Sbjct: 60 AELKKRCIRV-GETAVYGVLKRNSL-NTRRDRLKWVDSLQPPQEKTAWELDKEASQHRHV 117 Query: 152 HFPFGG----------------GRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVF 195 H P G G+ + +D S + + TD+ ++ L+ V Sbjct: 118 HAPQPGYLMGQDGKLVGRLANIGKVYVQVGVDCASSYGWAKLY-TDKTADSAADFLIHVH 176 Query: 196 ---ERYGLP-DRMTMDNGSPWGDTTGTWT-ALELWLMRLGIRVGHSRPYHPQTQGKLERF 250 + G+ R+ DNG +G + T+ + LGI+ ++ HP T G ERF Sbjct: 177 SDCQSKGVEVQRVLTDNGKEYGSSEPTYGHTYGAACLILGIKHKTTKVKHPWTNGYAERF 236 Query: 251 HRSLKAEVLQG----KWFADSGELQRAFDHWRTVYNLERPHE 288 ++L E Q K + ELQ D + YN ERPH+ Sbjct: 237 VQTLYQEFFQVALRRKRYTSVEELQADLDRYLLYYNWERPHQ 278 >UniRef50_Q39TE2 Putative uncharacterized protein n=2 Tax=Geobacter metallireducens GS-15 RepID=Q39TE2_GEOMG Length = 389 Score = 57.4 bits (137), Expect = 8e-07, Method: Compositional matrix adjust. Identities = 51/196 (26%), Positives = 91/196 (46%), Gaps = 13/196 (6%) Query: 25 QDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMA 84 ++G S+C G S + YKW+ R + +R R P PNR++ +I +++M Sbjct: 17 RNGETPESICTSLGKSRSWLYKWVARQNGDDPVWSDERSRCPQSMPNRTTAEIEEIVKMV 76 Query: 85 ----HDRHERWGARKIKRWLEDQG-HTMPAFSTVHNLMARHGLLPGASPGIPATGR---- 135 +++ GA+ I LED G +P+ T++ ++AR+ L + A G Sbjct: 77 RLNLYNKGLFCGAQAILWELEDLGVKPLPSTRTINRILARNELTHRRTGKYEAKGTLYPV 136 Query: 136 FEHDAPNRLWQMDFKG-HFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERR-ETVQQQLVS 193 PN+ Q D G + G R + L ++D + C H + + + V L Sbjct: 137 LPSALPNQTHQADLVGPCYLTGPIRFYSLNVVDTAT--VRCGLHSSRSKAGQMVIDGLWE 194 Query: 194 VFERYGLPDRMTMDNG 209 V++R G+P+R+ +DN Sbjct: 195 VWKRLGIPERLQVDNA 210 >UniRef50_A5BQ80 Putative uncharacterized protein n=1 Tax=Vitis vinifera RepID=A5BQ80_VITVI Length = 305 Score = 57.4 bits (137), Expect = 9e-07, Method: Compositional matrix adjust. Identities = 30/117 (25%), Positives = 60/117 (51%), Gaps = 5/117 (4%) Query: 144 LWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDR 203 +W +DF G FP G + L +D S++ + ++ R ++ ++F R+G+P Sbjct: 15 VWGIDFMGPFPMSFGYTYILVGVDYVSKWVKAVPCKYNDYRVVIKFLKENIFSRFGVPKA 74 Query: 204 MTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQ 260 + D G+ + + E L + G++ + PYHPQT G++E +R +K +++ Sbjct: 75 IISDGGTHFCN-----KPFETLLAKYGVKHKVATPYHPQTSGQIELANREIKNILMK 126 >UniRef50_P63135 Integrase n=404 Tax=root RepID=POK12_HUMAN Length = 1459 Score = 57.4 bits (137), Expect = 9e-07, Method: Composition-based stats. Identities = 46/148 (31%), Positives = 69/148 (46%), Gaps = 13/148 (8%) Query: 123 LPGASPGIPATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHC-TD 181 LP G+ G PN LWQMD H P G + +D +S F A C T Sbjct: 632 LPTQEAGVNPRGL----CPNALWQMDVT-HVPSFGRLSYVHVTVDTYSHF--IWATCQTG 684 Query: 182 ERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHP 241 E V++ L+S F G+P+++ DNG + A + +L + I PY+ Sbjct: 685 ESTSHVKKHLLSCFAVMGVPEKIKTDNGPGYCSK-----AFQKFLSQWKISHTTGIPYNS 739 Query: 242 QTQGKLERFHRSLKAEVLQGKWFADSGE 269 Q Q +ER +R+LK ++++ K DS E Sbjct: 740 QGQAIVERTNRTLKTQLVKQKEGGDSKE 767 >UniRef50_A3DCZ2 Integrase, catalytic region n=10 Tax=Clostridium RepID=A3DCZ2_CLOTH Length = 278 Score = 57.0 bits (136), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 47/192 (24%), Positives = 80/192 (41%), Gaps = 15/192 (7%) Query: 117 MARHGLLPGASPGIPATGR---------FEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLD 167 M HG PG + G+ + D PN++W +D + G + + ++D Sbjct: 82 MGIHGFCPGPNLSKRIHGKNLYPYLLRNLKIDHPNQVWSIDV-TYCRMKRGFMYMVAIID 140 Query: 168 DHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLM 227 +SR+ + + V + + +RYG P+ M D GS + T+ + L L Sbjct: 141 WYSRYIVGFELSNTLDKTFVIEAIQKAIKRYGKPEIMNSDQGSQF--TSDDYINL---LK 195 Query: 228 RLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPH 287 GI++ ++ERF RS K E L + +L++ + YN RPH Sbjct: 196 NNGIKISMDGKGRALDNQRIERFFRSYKWEKLYLEECETVQQLRQITKEYVEHYNHRRPH 255 Query: 288 EALDMAVPGSRY 299 ++LD P Y Sbjct: 256 QSLDYKTPAEYY 267 >UniRef50_A5BFS9 Putative uncharacterized protein n=9 Tax=Vitis vinifera RepID=A5BFS9_VITVI Length = 2326 Score = 57.0 bits (136), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 31/117 (26%), Positives = 61/117 (52%), Gaps = 5/117 (4%) Query: 144 LWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDR 203 +W +DF G FP G + L +D S++ + T++ + ++ ++F R+G+P Sbjct: 931 VWGIDFMGPFPMSFGHSYILVGVDYVSKWVEAIPCRTNDHKVVLKFLKENIFSRFGVPKV 990 Query: 204 MTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQ 260 + D G+ + + E L + GI+ + PYHPQT G++E +R +K +++ Sbjct: 991 IISDGGTHFCN-----KPFEALLAKYGIKHKVATPYHPQTSGQVELANREIKNILMK 1042 Score = 49.3 bits (116), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 28/117 (23%), Positives = 58/117 (49%), Gaps = 5/117 (4%) Query: 144 LWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDR 203 +W +DF G FP G + L +D S++ + ++ R ++ ++F R+G+P Sbjct: 1767 VWGIDFMGPFPMSFGYSYILVRVDYVSKWVEAIPCNHNDHRVVLKFLKENIFSRFGVPKA 1826 Query: 204 MTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQ 260 + D G+ + + E L + ++ + YHPQT G++E +R +K +++ Sbjct: 1827 IISDGGTHFCN-----KPFETLLAKYRVKHKVATLYHPQTNGQVELANRKIKNILMK 1878 >UniRef50_UPI0001986237 PREDICTED: hypothetical protein n=1 Tax=Vitis vinifera RepID=UPI0001986237 Length = 1360 Score = 57.0 bits (136), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 39/143 (27%), Positives = 67/143 (46%), Gaps = 14/143 (9%) Query: 145 WQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRM 204 W +DF G FP G + L +D S++ +A +++ + ++ ++F R+G+P + Sbjct: 1062 WGLDFMGPFPHSFGNLYILVGVDYVSKWVEAVACKSNDHKVVLKFLKENIFSRFGIPRAI 1121 Query: 205 TMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLK------AEV 258 D GS + + L + G+R S PYHPQT G+ E +R +K Sbjct: 1122 ISDGGSHFCN-----KPFSTLLQKYGVRHKVSTPYHPQTNGQAELANREIKRILTKVVNT 1176 Query: 259 LQGKWFADSGELQRAFDHWRTVY 281 ++ W S +L A +RT Y Sbjct: 1177 IRKDW---STKLSDALWAYRTAY 1196 >UniRef50_A5BJN2 Putative uncharacterized protein n=5 Tax=Vitis vinifera RepID=A5BJN2_VITVI Length = 1380 Score = 56.6 bits (135), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 30/117 (25%), Positives = 61/117 (52%), Gaps = 5/117 (4%) Query: 144 LWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDR 203 +W +DF G FP G + L +D S++ + + ++ R ++ ++F R+G+P Sbjct: 919 VWGIDFMGPFPMSFGNSYILVGVDYVSKWVEAIPYKHNDHRVVLKFLKENIFSRFGVPKA 978 Query: 204 MTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQ 260 + D G+ + + E L + G++ + PYHPQT G++E +R +K +++ Sbjct: 979 IISDGGTHFCN-----KPFETLLAKYGVKHKVATPYHPQTSGQVELANREIKNILMK 1030 >UniRef50_A3PLB1 Integrase, catalytic region n=59 Tax=Proteobacteria RepID=A3PLB1_RHOS1 Length = 273 Score = 56.6 bits (135), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 61/239 (25%), Positives = 97/239 (40%), Gaps = 18/239 (7%) Query: 72 RSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIP 131 R +D I A++ ++ +G R + L +T+ + + R + G P I Sbjct: 46 RFADPIKAMI----EKEPSFGYRTVAWLLGFNKNTVQRIFQIKSWQVRKRQI-GMRPRIE 100 Query: 132 ATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQL 191 A APN W D + G ++D H+R L + T L Sbjct: 101 AVPSVAQ-APNERWSTDLCRVWAGRDGWATLALVIDCHTRELLGWHLSRSGKASTAASAL 159 Query: 192 V-SVFERYGLPDRMTM------DNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQ 244 ++ R+G R+T DNG + T+ +TAL + G++ P+ PQ Sbjct: 160 EHALINRFGTLGRVTKEFLLRSDNGLVF--TSRKYTAL---VRSYGLKQEFITPHCPQQN 214 Query: 245 GKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSA 303 G +ER R+LK + + + F RA W YN RPH+ALDM P + +A Sbjct: 215 GMVERVIRTLKEQCVHRRRFDSLQHAARAIGDWIAFYNHRRPHQALDMKTPAEAFALAA 273 >UniRef50_A5B9R1 Putative uncharacterized protein n=10 Tax=Vitis vinifera RepID=A5B9R1_VITVI Length = 2171 Score = 56.6 bits (135), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 30/117 (25%), Positives = 60/117 (51%), Gaps = 5/117 (4%) Query: 144 LWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDR 203 +W +DF G FP G + L +D S++ + ++ R ++ ++F R+G+P Sbjct: 1429 VWGIDFMGPFPMSFGNSYILVGVDYVSKWVXAIPCXXNDHRVVLKFLKENIFSRFGVPKA 1488 Query: 204 MTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQ 260 + D G+ + + E L + G++ + PYHPQT G++E +R +K +++ Sbjct: 1489 IISDGGAHFCN-----KPFEALLSKYGVKHKVATPYHPQTSGQVELANREIKNILMK 1540 >UniRef50_Q3A4V8 Transposase and inactivated derivatives n=9 Tax=Bacteria RepID=Q3A4V8_PELCD Length = 336 Score = 56.6 bits (135), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 71/305 (23%), Positives = 126/305 (41%), Gaps = 29/305 (9%) Query: 5 MPWDARDTMSLRTEFVL-FASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRP 63 MP D RD + +FV +A + G + + G++ + Y W QR+ + + Sbjct: 1 MPHDIRDAV---IDFVKHWAKRTGIAVTHIIDWLGLAVSKFYNWQQRYGKAN----EHNA 53 Query: 64 RIPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLL 123 IP + A+++ + + G R++ + DQ + S+V+ ++ GLL Sbjct: 54 LIPRDFWLEEKEK-QAIIKFYQQKPQE-GYRRLTFMMLDQDVVAVSPSSVYRVLNAAGLL 111 Query: 124 P--GASPGIPATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTD 181 TG + P+ W +D + G + LLD SR+ + H Sbjct: 112 RRWNGKQSKKGTGFVQPLKPHEHWHIDV-SYINICGTFYYLCCLLDGCSRY---IVHW-- 165 Query: 182 ERRETVQQQLVSVF-----ERY-GLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGH 235 E RE + + V + E++ R+ DNG + + ++ G+ Sbjct: 166 ELREAMTEANVEIILQRAREKHPAATPRIISDNGPQF-----ITKDFKEFIRVAGMTHVR 220 Query: 236 SRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVP 295 + PY+PQ+ GKLERFH ++K E ++ K E + + YN ER H AL P Sbjct: 221 TSPYYPQSNGKLERFHGTIKQECIRPKVPLSLEEARAQVADYIRYYNDERLHSALGYVAP 280 Query: 296 GSRYQ 300 + + Sbjct: 281 KVKLE 285 >UniRef50_C0VUG3 Transposase n=2 Tax=Corynebacterium glucuronolyticum ATCC 51867 RepID=C0VUG3_9CORY Length = 259 Score = 56.6 bits (135), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 46/174 (26%), Positives = 77/174 (44%), Gaps = 11/174 (6%) Query: 12 TMSLRTEFVLF-ASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSP 70 ++SLR F +DG +R CR G S T Y +R + G AG+ PH Sbjct: 4 SLSLRKRIASFDPVRDGKTVRQFCREVGCSRQTYYNIKKRVKERGEAGIIPDSTAPHTPH 63 Query: 71 NRSSDDITALLRMAHDRHERWGAR----KIKRWLEDQGHTM--PAFSTVHNLMARHGLLP 124 + DD L+ +R +++GA I L D+ + + P+ ST+ +++ HG+ Sbjct: 64 TKFDDDDHRLIPETRERLKKFGADYGPWSIYYALMDENNRLDGPSRSTIARVLSVHGVTE 123 Query: 125 GASPGIPATG--RFEHDAPNRLWQMDFKGH--FPFGGGRCHPLTLLDDHSRFSL 174 + P + RF A N LWQ+D + F + ++DD +R + Sbjct: 124 ENARKRPRSSLKRFARGAANELWQIDAMIYRLFDLHHTQITIYQVIDDATRLDV 177 >UniRef50_UPI0001791F50 PREDICTED: similar to putative gag-pol protein n=1 Tax=Acyrthosiphon pisum RepID=UPI0001791F50 Length = 1042 Score = 56.2 bits (134), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 40/143 (27%), Positives = 67/143 (46%), Gaps = 14/143 (9%) Query: 117 MARHGLLPGASPGIPATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCL 176 + RH P +P T RF+ +D G P G + LT +D ++ + Sbjct: 730 VIRHTKSPLQHFELPQT-RFQQ------VHIDLIGPLPPSKGNVYCLTCIDRYTSWPEVF 782 Query: 177 AHCTDERRETVQQQLVSV-FERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGH 235 D + +TV + + R+G+P+++ D G + + +TA + LGI Sbjct: 783 P-IADMKADTVAEAFFNGWIARFGVPEKIITDQGRQFE--SNLFTA---FTNILGINRAR 836 Query: 236 SRPYHPQTQGKLERFHRSLKAEV 258 + PYHPQ GK+ERFH++LK + Sbjct: 837 TTPYHPQANGKIERFHKTLKQSI 859 >UniRef50_A5BPP5 Putative uncharacterized protein n=1 Tax=Vitis vinifera RepID=A5BPP5_VITVI Length = 1583 Score = 55.8 bits (133), Expect = 2e-06, Method: Composition-based stats. Identities = 57/233 (24%), Positives = 106/233 (45%), Gaps = 28/233 (12%) Query: 144 LWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDR 203 +W + F G FP G + L +D S++ + ++ R + ++F R+G+P Sbjct: 1250 VWGIXFMGPFPMSFGNSYILVGVDYVSKWVEAIPCKQNDHRVVLXFLKENIFSRFGVPKA 1309 Query: 204 MTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQG-- 261 + D G+ + + E L + G++ + PYHPQT G++E +R +K +++ Sbjct: 1310 IISDGGTHFCNK-----PFETLLAKYGVKHKVATPYHPQTSGQVELANREIKNILMESGN 1364 Query: 262 -KWF-ADSGELQRAFD--HWR----TVYNLERPHEALDMAVPGSRYQP-----SARQYSG 308 K F + L +A D WR ++ + + L+ V G RY+P +Q +G Sbjct: 1365 LKCFNSPEPTLGKASDLRPWRFTSLSLASFWEVKDHLEWQVLGERYEPLQGASEKKQVTG 1424 Query: 309 NTTPPEYDE------GVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 TP Y+E + ++ KL VK +++ +A + L EM+E Sbjct: 1425 --TPFLYEEYEPSDLKLQETFFFLNTKLGVKKLNMDLIRAGAKRCLDLNEMEE 1475 >UniRef50_Q1M9G1 Putative transposase-related protein n=2 Tax=Rhizobium RepID=Q1M9G1_RHIL3 Length = 151 Score = 55.8 bits (133), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 35/113 (30%), Positives = 48/113 (42%), Gaps = 5/113 (4%) Query: 185 ETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQ 244 E V Q L V G P + +D G+ + L+LW G + SRP P Sbjct: 32 EDVVQTLERVCRNVGYPKTIRVDQGTEFVSRD-----LDLWAYAKGATLDFSRPGKPTDN 86 Query: 245 GKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGS 297 +E F+ +AE L WF + + + WR YN ERPH A+ P S Sbjct: 87 AFIEAFNGRFRAECLNLHWFLTLADAREKMEDWRRYYNEERPHGAIGNKPPIS 139 >UniRef50_A5CA04 Putative uncharacterized protein n=3 Tax=Vitis vinifera RepID=A5CA04_VITVI Length = 2174 Score = 55.8 bits (133), Expect = 2e-06, Method: Composition-based stats. Identities = 30/112 (26%), Positives = 58/112 (51%), Gaps = 5/112 (4%) Query: 144 LWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDR 203 +W +DF G FP G + L +D S++ + + ++ R ++ ++F R+G+P Sbjct: 1440 VWGIDFMGPFPMSFGNSYILVGVDYVSKWVEAIPYKQNDHRVVLKFLKENIFSRFGVPKA 1499 Query: 204 MTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLK 255 + D G+ + + E L + G++ + PYHPQT G++E +R +K Sbjct: 1500 IISDGGAHFCN-----KPFEALLSKYGVKHKVATPYHPQTSGQVELANREIK 1546 >UniRef50_A5AWA7 Putative uncharacterized protein n=6 Tax=Vitis vinifera RepID=A5AWA7_VITVI Length = 2136 Score = 55.8 bits (133), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 30/117 (25%), Positives = 60/117 (51%), Gaps = 5/117 (4%) Query: 144 LWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDR 203 +W +DF G FP G + L +D S++ + ++ R ++ ++F R+G+P Sbjct: 1852 VWGIDFMGPFPMSFGNSYILVGVDYVSKWVEAIPCKQNDHRVVLKFLKENIFSRFGVPKA 1911 Query: 204 MTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQ 260 + D G+ + + E L + G++ + PYHPQT G++E +R +K +++ Sbjct: 1912 IISDGGAHFCN-----KPFEALLSKYGVKHKVATPYHPQTSGQVELANREIKNILMK 1963 >UniRef50_UPI000050FEEF transposase n=1 Tax=Brevibacterium linens BL2 RepID=UPI000050FEEF Length = 316 Score = 55.8 bits (133), Expect = 3e-06, Method: Compositional matrix adjust. Identities = 35/128 (27%), Positives = 59/128 (46%), Gaps = 3/128 (2%) Query: 165 LLDDHSRFSLCLAHCTDERRETVQQQLV-SVFERYGLPDRMTMDNGSPWG-DTTGTWTAL 222 ++DD +R+ + + C D T V + +G+P + DNGS + G TAL Sbjct: 4 IVDDSTRYDVGTSACVDPENGTDAVTTVRAAIAAHGVPQELLSDNGSAFNLARQGAVTAL 63 Query: 223 ELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYN 282 +++L G + R P TQGK ER H+++ + + +H+R YN Sbjct: 64 QMYLADQGCKPISGRIKSPTTQGKNERSHQTI-CQYFDAHAPKSVDSVHTLIEHYREYYN 122 Query: 283 LERPHEAL 290 R H++L Sbjct: 123 HRRHHQSL 130 >UniRef50_A9LH60 Integrase n=1 Tax=uncultured planctomycete 13FN RepID=A9LH60_9BACT Length = 209 Score = 55.8 bits (133), Expect = 3e-06, Method: Compositional matrix adjust. Identities = 39/156 (25%), Positives = 62/156 (39%), Gaps = 5/156 (3%) Query: 142 NRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLP 201 + +W DF G + L ++D+ +R L + V L +F G P Sbjct: 31 DHVWSYDFVADRLEDGRKIRLLVIIDEFTRECLAIEVARSFTAMQVIDVLQYLFAVRGSP 90 Query: 202 DRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQG 261 + DNG + L WL + + P P G +E F+ L+ E+L G Sbjct: 91 KHIRSDNGPEF-----VARKLTKWLKQAAVETLFIAPGSPWENGYVESFNGKLRDELLNG 145 Query: 262 KWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGS 297 + F GE + D W+ YN R H ++D P + Sbjct: 146 ELFLSLGEARWIIDRWQLDYNHHRLHSSIDYQTPAA 181 >UniRef50_A5BI07 Putative uncharacterized protein n=7 Tax=Vitis vinifera RepID=A5BI07_VITVI Length = 1803 Score = 55.8 bits (133), Expect = 3e-06, Method: Compositional matrix adjust. Identities = 30/117 (25%), Positives = 60/117 (51%), Gaps = 5/117 (4%) Query: 144 LWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDR 203 +W +DF G FP G + L +D S++ + ++ R ++ ++F R+G+P Sbjct: 1203 VWGIDFMGPFPMSFGNSYILVGVDYVSKWVEAIPCKQNDHRVVLKFLKENIFSRFGVPKA 1262 Query: 204 MTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQ 260 + D G+ + + E L + G++ + PYHPQT G++E +R +K +++ Sbjct: 1263 IISDGGAHFCN-----KPFETLLAKYGVKHKVATPYHPQTSGQVELANREIKNILMK 1314 >UniRef50_A5ASA6 Putative uncharacterized protein n=2 Tax=Vitis vinifera RepID=A5ASA6_VITVI Length = 1839 Score = 55.8 bits (133), Expect = 3e-06, Method: Compositional matrix adjust. Identities = 29/117 (24%), Positives = 61/117 (52%), Gaps = 5/117 (4%) Query: 144 LWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDR 203 +W +DF G FP G + L +D S++ + +++ + ++ ++F R+G+P Sbjct: 1447 VWGIDFMGPFPMSFGHSYILVGVDYVSKWVDAIPCRSNDHKVVLKFLKENIFSRFGVPKA 1506 Query: 204 MTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQ 260 + D G+ + + E L + G++ + PYHPQT G++E +R +K +++ Sbjct: 1507 IISDXGTHFCN-----XXFETLLAKYGVKHKVATPYHPQTSGQVELANREIKNILMK 1558 >UniRef50_Q3I6J4 Pol-polyprotein n=4 Tax=Eukaryota RepID=Q3I6J4_SILLA Length = 1307 Score = 55.5 bits (132), Expect = 3e-06, Method: Compositional matrix adjust. Identities = 38/134 (28%), Positives = 67/134 (50%), Gaps = 20/134 (14%) Query: 141 PNRLWQMDFKGHFPFG-GGRCHPLTLLDDHSRFSLCLA---HCTDERRETVQQQLVSVFE 196 P W MD G FP GGR + + +D +++ +A T R+ + + +++ Sbjct: 1023 PFAQWGMDLLGPFPTASGGRKYLIVAVDYFTKWVEAVAVPAKTTAAVRKVIWENIIT--- 1079 Query: 197 RYGLPDRMTMDNGSPWGDTTGTWTALEL-WLMRLGIRVGHSRPYHPQTQGKLERFHRSL- 254 R+GLP M D+G + W+ + + WL LGI+ +S HPQ+ G+ E ++++ Sbjct: 1080 RFGLPQVMVFDHGREF------WSDMVMNWLEELGIKFAYSSVCHPQSNGQAEAANKTIL 1133 Query: 255 -----KAEVLQGKW 263 K E L+G+W Sbjct: 1134 NGLKKKVEDLKGRW 1147 >UniRef50_UPI0001792682 PREDICTED: similar to putative gag-pol protein, partial n=1 Tax=Acyrthosiphon pisum RepID=UPI0001792682 Length = 1213 Score = 55.5 bits (132), Expect = 3e-06, Method: Compositional matrix adjust. Identities = 45/143 (31%), Positives = 64/143 (44%), Gaps = 14/143 (9%) Query: 117 MARHGLLPGASPGIPATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCL 176 + RH + P P + RF H +D G P G + LT +D ++R+ Sbjct: 879 VQRHVIAP-VMPFVTPKRRFGH------IHIDLVGPLPSSDGHEYLLTAIDRYTRWPEAY 931 Query: 177 AHCTDERRETVQQQLVSV-FERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGH 235 T+ V +LVS F R+G PD +T D G + + + AL G R Sbjct: 932 P-LTNMSAHAVADKLVSQWFSRFGTPDVVTTDQGRQF--ESELFAALS---QTYGFRHSR 985 Query: 236 SRPYHPQTQGKLERFHRSLKAEV 258 + PYHPQ G +ER HR LKA + Sbjct: 986 TSPYHPQANGLIERLHRPLKAAL 1008 >UniRef50_A5BJP7 Putative uncharacterized protein n=6 Tax=Vitis vinifera RepID=A5BJP7_VITVI Length = 1265 Score = 55.5 bits (132), Expect = 3e-06, Method: Compositional matrix adjust. Identities = 30/117 (25%), Positives = 60/117 (51%), Gaps = 5/117 (4%) Query: 144 LWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDR 203 +W +DF G FP G + L +D S++ + ++ R ++ ++F R+G+P Sbjct: 385 VWGIDFMGPFPMSFGNSYILVGVDYVSKWVEAIPCKQNDHRVVLKFLKDNIFSRFGVPKA 444 Query: 204 MTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQ 260 + D G+ + + E L + G++ + PYHPQT G++E +R +K +++ Sbjct: 445 IISDGGAHFCN-----KPFETLLAKYGVKHKVATPYHPQTSGQVELANREIKNILMK 496 >UniRef50_B1FI87 Integrase, catalytic region n=1 Tax=Burkholderia ambifaria IOP40-10 RepID=B1FI87_9BURK Length = 223 Score = 55.5 bits (132), Expect = 3e-06, Method: Compositional matrix adjust. Identities = 34/94 (36%), Positives = 50/94 (53%), Gaps = 5/94 (5%) Query: 33 LCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNR--SSDDITALLRMAHDRHER 90 +C R GIS T KWL+R+ + G GL+ + R P SPNR S D +LR+ R ER Sbjct: 11 VCTRCGISRPTLRKWLRRYQEAGEDGLRSQSRRPLTSPNRKVSDADRATILRL---RAER 67 Query: 91 WGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLP 124 GAR+I+ L + +T+H ++ + P Sbjct: 68 KGARRIQNELRLHEQRELSLATIHKVLCEATVKP 101 >UniRef50_A5C2R0 Putative uncharacterized protein n=10 Tax=Vitis vinifera RepID=A5C2R0_VITVI Length = 2116 Score = 55.5 bits (132), Expect = 3e-06, Method: Compositional matrix adjust. Identities = 30/117 (25%), Positives = 60/117 (51%), Gaps = 5/117 (4%) Query: 144 LWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDR 203 +W +DF G FP G + L +D S++ + ++ + ++ ++F R+G+P Sbjct: 1819 VWGIDFMGPFPMSFGNSYILVGVDYVSKWVEAIPCKQNDHKVVLKFLKENIFSRFGVPKA 1878 Query: 204 MTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQ 260 + D G+ + + E L R G++ + PYHPQT G++E +R +K +++ Sbjct: 1879 IISDGGAHFCN-----KPFEALLSRYGVKHKVATPYHPQTSGQVELANREIKNILMK 1930 >UniRef50_A5AH70 Putative uncharacterized protein n=20 Tax=Vitis vinifera RepID=A5AH70_VITVI Length = 2203 Score = 55.5 bits (132), Expect = 3e-06, Method: Compositional matrix adjust. Identities = 31/117 (26%), Positives = 59/117 (50%), Gaps = 5/117 (4%) Query: 144 LWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDR 203 +W +DF G FP G + L +D S++ + T++ + ++ ++F R+G+P Sbjct: 1519 VWGIDFMGPFPMSFGHSYILVGVDYVSKWVEAIPCRTNDHKVVLKFLKENIFSRFGVPKA 1578 Query: 204 MTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQ 260 + D G+ + + E L + GI + PYHPQT G++E R +K +++ Sbjct: 1579 IISDGGTHFCN-----KPFEALLAKYGINHKVATPYHPQTSGQVELAKREIKNILMK 1630 >UniRef50_A5C623 Putative uncharacterized protein n=4 Tax=Vitis vinifera RepID=A5C623_VITVI Length = 1694 Score = 55.5 bits (132), Expect = 4e-06, Method: Compositional matrix adjust. Identities = 30/117 (25%), Positives = 61/117 (52%), Gaps = 5/117 (4%) Query: 144 LWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDR 203 +W +DF G FP G + L +D S++ + ++ R ++ ++F R+G P Sbjct: 1008 VWGIDFMGPFPMSFGYSYILVRVDYISKWVEAIPCKRNDHRVVLKYLKENIFSRFGEPKA 1067 Query: 204 MTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQ 260 + D G+ + + E+ L++ G++ + PYHPQT G++E +R +K +++ Sbjct: 1068 IISDGGTHFCN-----KPFEILLVKYGVKHKVATPYHPQTFGQVELANRDIKNILMK 1119 >UniRef50_A5BXG2 Putative uncharacterized protein n=1 Tax=Vitis vinifera RepID=A5BXG2_VITVI Length = 1268 Score = 55.5 bits (132), Expect = 4e-06, Method: Composition-based stats. Identities = 30/112 (26%), Positives = 58/112 (51%), Gaps = 5/112 (4%) Query: 144 LWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDR 203 +W +DF G FP G + L +D S++ + + ++ R ++ ++F R+G+P Sbjct: 1110 VWGIDFMGPFPMSFGNSYILVRVDYVSKWVEAIPNKHNDHRVVLKFLKENIFLRFGVPKA 1169 Query: 204 MTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLK 255 + D G+ + + E L + G++ + PYHPQT G++E +R +K Sbjct: 1170 IISDGGTHFCN-----KPFETLLAKYGVKHKVATPYHPQTSGQVELANREIK 1216 >UniRef50_A5B5G8 Putative uncharacterized protein n=2 Tax=Vitis vinifera RepID=A5B5G8_VITVI Length = 1856 Score = 55.1 bits (131), Expect = 4e-06, Method: Compositional matrix adjust. Identities = 29/117 (24%), Positives = 61/117 (52%), Gaps = 5/117 (4%) Query: 144 LWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDR 203 +W +DF G FP G + L +D S++ + +++ + ++ ++F R+G+P Sbjct: 1420 VWGIDFMGPFPMSFGHSYILVGVDYVSKWVEAIPCRSNDHKVVLKFLKENIFSRFGVPKA 1479 Query: 204 MTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQ 260 + D G+ + + E L + G++ + PYHPQT G++E +R +K +++ Sbjct: 1480 IINDGGTHFCN-----KPFETLLAKYGVKHKVATPYHPQTSGQVELANRKIKNILMK 1531 >UniRef50_A5ASD2 Putative uncharacterized protein n=7 Tax=Vitis vinifera RepID=A5ASD2_VITVI Length = 1801 Score = 55.1 bits (131), Expect = 4e-06, Method: Compositional matrix adjust. Identities = 30/117 (25%), Positives = 60/117 (51%), Gaps = 5/117 (4%) Query: 144 LWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDR 203 +W +DF G FP G + L +D S++ + ++ R ++ ++F R+G+P Sbjct: 1504 VWGIDFMGPFPMSFGNSYILVGVDYVSKWVEAIPCKHNDHRVVLKFLKKNIFSRFGVPKA 1563 Query: 204 MTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQ 260 + D G+ + + E L + G++ + PYHPQT G++E +R +K +++ Sbjct: 1564 IISDRGTHFCN-----KPFETLLAKYGVKHKVATPYHPQTSGQVELANREIKNILMK 1615 >UniRef50_A5AHC2 Putative uncharacterized protein n=3 Tax=Vitis vinifera RepID=A5AHC2_VITVI Length = 1270 Score = 55.1 bits (131), Expect = 4e-06, Method: Compositional matrix adjust. Identities = 29/117 (24%), Positives = 61/117 (52%), Gaps = 5/117 (4%) Query: 144 LWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDR 203 +W +DF G FP G + L +D S++ + +++ + ++ ++F R+G+P Sbjct: 385 VWGIDFMGPFPMSFGHSYILVGVDYVSKWVEAIPCRSNDHKVVLKFLKDNIFARFGVPKA 444 Query: 204 MTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQ 260 + D G+ + + E L + G++ + PYHPQT G++E +R +K +++ Sbjct: 445 IISDGGTHFCN-----KPFETLLAKYGVKHKEATPYHPQTSGQVELANREIKNILMK 496 >UniRef50_A5BKJ4 Putative uncharacterized protein n=2 Tax=Vitis vinifera RepID=A5BKJ4_VITVI Length = 922 Score = 55.1 bits (131), Expect = 4e-06, Method: Compositional matrix adjust. Identities = 36/144 (25%), Positives = 69/144 (47%), Gaps = 14/144 (9%) Query: 144 LWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDR 203 +W +DF G FP G + L +D S++ + ++ R ++ ++F R+G+P Sbjct: 127 VWGIDFMGPFPMSFGNSYILVGVDYVSKWVEAILCKQNDHRVVLKFLKENIFSRFGVPKA 186 Query: 204 MTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQG-- 261 + D G+ + + E L + G++ + PYHPQT G++E +R +K +++ Sbjct: 187 IISDEGTHFCN-----KPFETLLAKYGVKHKVATPYHPQTFGQVELANREIKNILMKVVN 241 Query: 262 ----KWFADSGELQRAFDHWRTVY 281 WF +L + +RT Y Sbjct: 242 TSRRNWFV---KLHDSLWAYRTTY 262 >UniRef50_C7C202 Gag-Pol polyprotein n=1 Tax=Schistosoma japonicum RepID=C7C202_SCHJA Length = 1507 Score = 55.1 bits (131), Expect = 5e-06, Method: Compositional matrix adjust. Identities = 39/136 (28%), Positives = 68/136 (50%), Gaps = 15/136 (11%) Query: 147 MDFKGHFPFGGGRCHPLTLLDDHSRF-SLCLAHCTDERRETVQQQLVSVFERYGLPDRMT 205 +DF G PF G + L +D +S++ + H + TV+ L +F R+G+PD + Sbjct: 1208 LDFAG--PFQGA--YFLVCVDAYSKWPEIFPMHHITLQATTVK--LRHLFSRFGVPDVLV 1261 Query: 206 MDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFA 265 DNG T T + + + G++ + PYHPQ+ G+ ERF + K +L+ K Sbjct: 1262 TDNG-----TQFTSSIFSQFCGKFGVKHVRAPPYHPQSNGQAERFVDTFKRALLKAK--- 1313 Query: 266 DSGELQRAFDHWRTVY 281 G+++ D + +Y Sbjct: 1314 GEGKIKEILDDFLLIY 1329 >UniRef50_A5CBG5 Putative uncharacterized protein n=4 Tax=Vitis vinifera RepID=A5CBG5_VITVI Length = 2329 Score = 54.7 bits (130), Expect = 5e-06, Method: Composition-based stats. Identities = 30/112 (26%), Positives = 57/112 (50%), Gaps = 5/112 (4%) Query: 144 LWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDR 203 +W +DF G FP G + L +D S++ + ++ R ++ ++F R+G+P Sbjct: 2102 VWGIDFMGPFPMSFGNSYILVGVDYVSKWVEAIPCRKNDHRVVLKFLKENIFSRFGVPKA 2161 Query: 204 MTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLK 255 + D G+ + + E L + G++ + PYHPQT G++E +R +K Sbjct: 2162 IISDGGAHFCN-----KPFEALLSKYGVKHKVATPYHPQTSGQVELANREIK 2208 >UniRef50_UPI0001927064 PREDICTED: similar to vascular endothelial growth factor receptor n=1 Tax=Hydra magnipapillata RepID=UPI0001927064 Length = 1829 Score = 54.7 bits (130), Expect = 5e-06, Method: Composition-based stats. Identities = 35/122 (28%), Positives = 57/122 (46%), Gaps = 12/122 (9%) Query: 130 IPATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQ 189 I AT FE + +DFKG P + LT++D++SRF D +T+ Sbjct: 98 IKATQPFERIS------IDFKGPLPSSTPEQYMLTIVDEYSRFPFAYP-VKDMTTQTIIN 150 Query: 190 QLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLER 249 L +F +G+P + D GS + L+ W + GI + PY+P G++ER Sbjct: 151 CLDDLFSMFGMPSYVLSDRGSSLMSSE-----LKHWFLSKGIATNRTTPYNPTGNGQVER 205 Query: 250 FH 251 ++ Sbjct: 206 YN 207 >UniRef50_A5BTM1 Putative uncharacterized protein n=31 Tax=Vitis vinifera RepID=A5BTM1_VITVI Length = 2292 Score = 54.7 bits (130), Expect = 5e-06, Method: Compositional matrix adjust. Identities = 29/117 (24%), Positives = 61/117 (52%), Gaps = 5/117 (4%) Query: 144 LWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDR 203 +W +DF G FP G + L +D S++ + ++ R ++ ++F R+G+P Sbjct: 1469 VWGIDFMGPFPMSFGNSYILVGVDYVSKWVKAIPCKHNDHRVVLKFLKENIFSRFGVPKA 1528 Query: 204 MTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQ 260 + D G+ + + E L + G++ + PYHPQT G++E+ ++ +K +++ Sbjct: 1529 IISDGGTHFCN-----RPFETLLAKYGVKHKVATPYHPQTSGQVEQANKGIKNILMK 1580 >UniRef50_D1PR45 Transposase n=1 Tax=Subdoligranulum variabile DSM 15176 RepID=D1PR45_9FIRM Length = 394 Score = 54.7 bits (130), Expect = 6e-06, Method: Compositional matrix adjust. Identities = 65/277 (23%), Positives = 97/277 (35%), Gaps = 33/277 (11%) Query: 34 CRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDIT-ALLRMAHDRH-ERW 91 CR FGI + Y QR P + N+ D++ L+ + R +R Sbjct: 110 CRLFGIRKSNFYYRTQRR--------------PAKTQNQQRDEVLRPLVEKIYLRSGKRI 155 Query: 92 GARKIKRWLEDQGHTMP---AFSTVHNLMARHGLLPGASPGIPATGRFEH---------- 138 + I++ L DQG ++ S + A G A RF H Sbjct: 156 SSEAIRQKLLDQGISISKRKVLSFLQEWKADKGTTSARMSAAAAQRRFCHLNLLDRQFNP 215 Query: 139 DAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERY 198 +APN+ W D + GG+ + +LD SR L V + + F R Sbjct: 216 EAPNKAWVSDI-TELHYAGGKLYLCVVLDLFSRKVLAARASCQNDTALVARTFETAFLRR 274 Query: 199 GLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEV 258 G P + + T+ + L L +R S P P +E F SLK E Sbjct: 275 GRPRGLLFHSDQGRQYTSDYFREL---LEEFSVRQSFSTPGVPYDNAVMESFFASLKKEE 331 Query: 259 LQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVP 295 + G L + + YN +RPH L P Sbjct: 332 YHRYCYKSIGALLDSVQQYLLFYNRQRPHSRLGYRTP 368 >UniRef50_A5BI69 Putative uncharacterized protein n=1 Tax=Vitis vinifera RepID=A5BI69_VITVI Length = 1628 Score = 54.7 bits (130), Expect = 6e-06, Method: Compositional matrix adjust. Identities = 29/117 (24%), Positives = 61/117 (52%), Gaps = 5/117 (4%) Query: 144 LWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDR 203 +W +DF G FP G + L +D S++ + +++ + ++ ++F R+G+P Sbjct: 1065 VWGIDFMGPFPMSFGHSYILVGVDYVSKWVEVIPCRSNDHKVVLKFLKENIFARFGVPKS 1124 Query: 204 MTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQ 260 + D G+ + + E L + G++ + PYHPQT G++E +R +K +++ Sbjct: 1125 IISDGGTHFCN-----KPFETLLAKYGVKHKVATPYHPQTSGQVELANREIKNILMK 1176 >UniRef50_B0RNN4 ISxcc1 transposase ORFB, n=4 Tax=Xanthomonas RepID=B0RNN4_XANCB Length = 166 Score = 54.7 bits (130), Expect = 6e-06, Method: Compositional matrix adjust. Identities = 38/133 (28%), Positives = 59/133 (44%), Gaps = 5/133 (3%) Query: 164 TLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALE 223 T++DD SR +L + + V + L + G P ++ +DNG + AL Sbjct: 23 TVIDDFSREALAIEVDLNLPAARVIRTLERIAAWRGYPGKLRLDNGPEF-----VALALA 77 Query: 224 LWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNL 283 W R GI + P P G +ERF+ S + VL F E++ +HW YN Sbjct: 78 EWAERKGIALDFIEPGQPMQNGFIERFNGSYRRGVLDMHIFRTLSEVREQTEHWLADYNQ 137 Query: 284 ERPHEALDMAVPG 296 + PH++L P Sbjct: 138 QIPHDSLGGLTPA 150 >UniRef50_A5BT93 Putative uncharacterized protein n=1 Tax=Vitis vinifera RepID=A5BT93_VITVI Length = 1184 Score = 54.7 bits (130), Expect = 6e-06, Method: Compositional matrix adjust. Identities = 29/117 (24%), Positives = 60/117 (51%), Gaps = 5/117 (4%) Query: 144 LWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDR 203 +W +BF G FP G + L +D S++ + T+ + ++ ++F R+G+P Sbjct: 798 VWGIBFMGPFPMSFGHXYILVGVDYVSKWVEAIPCRTNXHKVVLKFLKENIFSRFGVPKA 857 Query: 204 MTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQ 260 + D G+ + + E L + G++ + PYHPQT G++E +R +K +++ Sbjct: 858 IISDGGTHFCN-----KPFEALLAKYGVKHKVATPYHPQTSGQVELANREIKNILMK 909 >UniRef50_A5C4R5 Putative uncharacterized protein n=1 Tax=Vitis vinifera RepID=A5C4R5_VITVI Length = 1398 Score = 54.3 bits (129), Expect = 7e-06, Method: Compositional matrix adjust. Identities = 30/117 (25%), Positives = 59/117 (50%), Gaps = 5/117 (4%) Query: 144 LWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDR 203 +W +DF G FP G + L +D S++ + + R ++ ++F R+G+P Sbjct: 1101 VWGIDFMGPFPMSFGNSYILVGVDYVSKWVEAIPCKHNVHRVVLKFLKENIFSRFGVPKA 1160 Query: 204 MTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQ 260 + D G+ + + E L + G++ + PYHPQT G++E +R +K +++ Sbjct: 1161 IISDGGTHFCNK-----PFEALLAKYGVKHKLATPYHPQTSGQVELANREIKNILMK 1212 >UniRef50_A5B2X9 Putative uncharacterized protein n=4 Tax=Vitis vinifera RepID=A5B2X9_VITVI Length = 1595 Score = 54.3 bits (129), Expect = 7e-06, Method: Compositional matrix adjust. Identities = 29/117 (24%), Positives = 61/117 (52%), Gaps = 5/117 (4%) Query: 144 LWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDR 203 +W +DF G FP G + L +D S++ + +++ + ++ ++F R+G+P Sbjct: 1298 VWGIDFMGPFPMSFGHSYILVGVDYVSKWVEAIPCRSNDHKVVLKFLKDNIFARFGVPKA 1357 Query: 204 MTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQ 260 + D G+ + + E L + G++ + PYHPQT G++E +R +K +++ Sbjct: 1358 IISDGGTHFCN-----KPFETLLAKYGVKHKVATPYHPQTSGQVELANREIKNILMK 1409 >UniRef50_UPI00017B5545 UPI00017B5545 related cluster n=4 Tax=Tetraodontidae RepID=UPI00017B5545 Length = 1392 Score = 54.3 bits (129), Expect = 8e-06, Method: Compositional matrix adjust. Identities = 40/129 (31%), Positives = 60/129 (46%), Gaps = 12/129 (9%) Query: 130 IPATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQ 189 IPA RF+H +D G P G H LT++D +R+ + + + + Sbjct: 169 IPAR-RFDH------VHVDLVGPLPPSHGYTHLLTMVDRTTRWPEVVPLSSTTSADVARA 221 Query: 190 QLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLER 249 L R+G P +T D G + + W+++ LG +V + YHPQ G ER Sbjct: 222 FLSVWVARFGSPSDITSDRGPQF--VSDLWSSMA---RSLGTQVHRTTAYHPQANGLCER 276 Query: 250 FHRSLKAEV 258 FHRSLKA + Sbjct: 277 FHRSLKAAL 285 Score = 53.9 bits (128), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 40/129 (31%), Positives = 60/129 (46%), Gaps = 12/129 (9%) Query: 130 IPATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQ 189 IPA RF+H +D G P G H LT++D +R+ + + + + Sbjct: 997 IPAR-RFDH------VHVDLVGPLPPSHGYTHLLTMVDRTTRWPEVVPLSSTTSADVARA 1049 Query: 190 QLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLER 249 L R+G P +T D G + + W+++ LG +V + YHPQ G ER Sbjct: 1050 FLSVWVARFGSPSDITSDRGPQF--VSELWSSMA---RSLGTQVHRTTAYHPQANGLCER 1104 Query: 250 FHRSLKAEV 258 FHRSLKA + Sbjct: 1105 FHRSLKAAL 1113 >UniRef50_Q0ZCB7 Integrase n=4 Tax=Eukaryota RepID=Q0ZCB7_POPTR Length = 1332 Score = 54.3 bits (129), Expect = 8e-06, Method: Composition-based stats. Identities = 30/111 (27%), Positives = 56/111 (50%), Gaps = 5/111 (4%) Query: 145 WQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRM 204 W +DF G FP G + L +D S++ + ++ + ++ ++ R+G+P M Sbjct: 899 WGIDFMGPFPPSFGFLYILVAVDYVSKWIEAIPSRNNDHKTVIKFLKENILSRFGIPRAM 958 Query: 205 TMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLK 255 D G+ + +T+ E + + GI + PYHPQT G++E +R +K Sbjct: 959 ISDGGTHFCNTS-----FESLMKKYGITHKVATPYHPQTSGQIELANREIK 1004 >UniRef50_UPI00015B4786 PREDICTED: hypothetical protein n=2 Tax=Nasonia vitripennis RepID=UPI00015B4786 Length = 1208 Score = 54.3 bits (129), Expect = 8e-06, Method: Composition-based stats. Identities = 39/126 (30%), Positives = 58/126 (46%), Gaps = 15/126 (11%) Query: 128 PGIPATGRFEHDAPNRLW---QMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERR 184 P IP T PN+ W DF G PF G L ++D HS++ + + + Sbjct: 939 PNIPLT---PWPWPNKAWSRIHCDFLG--PFMGHMY--LVVIDAHSKWPEVIDFHNNTKA 991 Query: 185 ETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQ 244 E + + +F R+GL + + DNG + T + +L +LGI+ S PYHP T Sbjct: 992 EKLISKFRDIFARHGLANHIVTDNGPQF-----TSDLFQNYLKKLGIKHTFSPPYHPATN 1046 Query: 245 GKLERF 250 G E F Sbjct: 1047 GAAENF 1052 >UniRef50_A5AKZ0 Putative uncharacterized protein n=18 Tax=Vitis vinifera RepID=A5AKZ0_VITVI Length = 2140 Score = 54.3 bits (129), Expect = 8e-06, Method: Compositional matrix adjust. Identities = 29/117 (24%), Positives = 61/117 (52%), Gaps = 5/117 (4%) Query: 144 LWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDR 203 +W +DF G FP G + L +D S++ + +++ + ++ ++F R+G+P Sbjct: 1369 VWGIDFMGPFPMSFGHSYILVGVDYVSKWVEAIPCRSNDHKVVLKFLKDNIFARFGVPKA 1428 Query: 204 MTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQ 260 + D G+ + + E L + G++ + PYHPQT G++E +R +K +++ Sbjct: 1429 IISDGGTHFCN-----KPFETLLAKYGVKHKVATPYHPQTSGQVELANREIKNILMK 1480 >UniRef50_A5AYT6 Putative uncharacterized protein n=3 Tax=Vitis vinifera RepID=A5AYT6_VITVI Length = 1897 Score = 53.9 bits (128), Expect = 8e-06, Method: Compositional matrix adjust. Identities = 29/114 (25%), Positives = 59/114 (51%), Gaps = 5/114 (4%) Query: 147 MDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMTM 206 +DF G FP G + L +D S++ + T++ + ++ ++F R+G+P + Sbjct: 1445 IDFMGPFPMSFGHSYILVGVDYVSKWVEVIPCXTNDHKVVLKFLRENIFSRFGVPKAIIS 1504 Query: 207 DNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQ 260 D G+ + + E L + G++ + PYHPQT G++E +R +K +++ Sbjct: 1505 DGGTHFCN-----KPFEALLAKYGVKHKVATPYHPQTSGQVELSNREIKNILMK 1553 >UniRef50_P25438 Insertion element IS476 uncharacterized 39.2 kDa protein n=21 Tax=Proteobacteria RepID=YI61_XANEU Length = 346 Score = 53.9 bits (128), Expect = 8e-06, Method: Compositional matrix adjust. Identities = 45/173 (26%), Positives = 65/173 (37%), Gaps = 6/173 (3%) Query: 141 PNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGL 200 PN W MDF R LT++DD +R S+ +A V + L G Sbjct: 173 PNDTWSMDFVFDALANARRIKCLTVVDDFTRESVDIAVDHGISGAYVVRLLDQAACFRGY 232 Query: 201 PDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQ 260 P + DNG + T A W + GI P P +E F+ + E L Sbjct: 233 PRAVRTDNGPEF-----TSRAFIAWTQQHGIEHILIEPGAPTQNAYIESFNGKFRDECLN 287 Query: 261 GKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGNTTPP 313 WF + + WR YN RPH + +P +++ + R N P Sbjct: 288 EHWFTSLAQARDVIADWRRHYNQIRPHSSCGR-IPPAQFAANYRTQQANNAVP 339 >UniRef50_B2Q343 Putative uncharacterized protein n=1 Tax=Providencia stuartii ATCC 25827 RepID=B2Q343_PROST Length = 156 Score = 53.9 bits (128), Expect = 9e-06, Method: Compositional matrix adjust. Identities = 41/154 (26%), Positives = 69/154 (44%), Gaps = 9/154 (5%) Query: 196 ERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLK 255 E G P+++T DNG+ + T A+ W GI +G +P +P +E + + Sbjct: 2 EERGKPNKVTCDNGTEF-----TSKAMFFWSKETGITLGFIQPENPTQNAFVESLNGKFR 56 Query: 256 AEVLQGKWFADSGELQRAFDHWRTVYNLERPHEAL-DMAVPGSRYQPSARQYSGNTTPPE 314 + L F E + + WR YN RPH +L DM Q +++ YS Sbjct: 57 NKCLNPHGFRTLDEARYEIELWREHYNHVRPHSSLNDMPPVAYAKQVTSQAYS--RLGYI 114 Query: 315 YDEGVMV-RKVDISGKLSVKGVSLSAGKAFRGER 347 Y EG+ V R + ISG+ ++ + + +G + Sbjct: 115 YQEGIGVKRNLGISGQYFIQANVANTLNSLKGNK 148 >UniRef50_C4FKG6 Integrase, catalytic region n=1 Tax=Sulfurihydrogenibium yellowstonense SS-5 RepID=C4FKG6_9AQUI Length = 305 Score = 53.9 bits (128), Expect = 9e-06, Method: Compositional matrix adjust. Identities = 53/229 (23%), Positives = 90/229 (39%), Gaps = 22/229 (9%) Query: 85 HDRHERWGARKIKRWLEDQGHTM--PAFSTVHNLMARHGLLPGASPGIPATGRFEH---- 138 + H +GAR++++ LE G + S + M L P I ++ Sbjct: 66 YTEHPYYGARRMQKALESIGIKVGKRKLSRTYKFMGIRALYPPPKTTILNKENKKYPYLL 125 Query: 139 ------DAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLC--LAHCTDERRETVQQQ 190 PN++W D + G + T++D HS+ L L + D T Sbjct: 126 EQITTTQRPNQIWSGDI-TYIKLEKGYAYLATIIDWHSKKVLSWKLGNTMDSYLTT--SI 182 Query: 191 LVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERF 250 L ERYG P+ D GS + +E+ L + GI++ + +ERF Sbjct: 183 LEEAIERYGKPEIFNSDQGSQYTSKE----HIEI-LEKNGIKISMNANGRSIDNTVIERF 237 Query: 251 HRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRY 299 R+LK E + K + E + + + +YN +R H ++ P Y Sbjct: 238 WRALKYENVYPKGYNTIKEAREGINQYIEIYNSQRIHSSIGYKTPDMVY 286 >UniRef50_A5CBC9 Putative uncharacterized protein n=1 Tax=Vitis vinifera RepID=A5CBC9_VITVI Length = 926 Score = 53.9 bits (128), Expect = 9e-06, Method: Compositional matrix adjust. Identities = 30/112 (26%), Positives = 57/112 (50%), Gaps = 5/112 (4%) Query: 144 LWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDR 203 +W +DF G FP G + L +D S++ + ++ R ++ ++F R+G+P Sbjct: 350 VWGIDFMGPFPMSFGYSYILVGVDYVSKWVEAVPCKHNDHRVVLKFLKENIFSRFGVPKA 409 Query: 204 MTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLK 255 + D G+ + + E L + G++ + PYHPQT G++E +R +K Sbjct: 410 IISDGGTHFCN-----KPFETLLTKYGVKHKVATPYHPQTSGQVELANREIK 456 >UniRef50_A1UPZ6 Histidine kinase n=2 Tax=Mycobacterium RepID=A1UPZ6_MYCSK Length = 151 Score = 53.9 bits (128), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 29/74 (39%), Positives = 37/74 (50%) Query: 234 GHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMA 293 G SRPY PQT GK+ERFH++L E + + + F W YN R H AL Sbjct: 78 GRSRPYRPQTNGKVERFHQTLGDEWAYTRLYTSDAQRCEQFPIWLHTYNYHRGHTALGGQ 137 Query: 294 VPGSRYQPSARQYS 307 P +R + QYS Sbjct: 138 PPATRIPNLSGQYS 151 >UniRef50_A4J392 Integrase, catalytic region n=22 Tax=Clostridia RepID=A4J392_DESRM Length = 459 Score = 53.9 bits (128), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 67/329 (20%), Positives = 122/329 (37%), Gaps = 23/329 (6%) Query: 40 SPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMAHDRHERWGARKIKRW 99 SP T WL + + G L+ R + S +I +R + R + + Sbjct: 52 SPKTLMCWLSDYRRGGLDSLKPGYRSDKGKSRKVSLEIADEIRKKRSQMPRITSALLYEE 111 Query: 100 LEDQGHTMP---AFSTVHNLMARHGLLPGA----SPGIPATGRFEHDAPNRLWQMD--FK 150 L +P + +T + + + L +PG RF H N LWQ D F Sbjct: 112 LVKDKVILPEKLSRATFYRFLVANPELAAGKDPENPGEKELKRFSHQRINELWQTDIMFG 171 Query: 151 GHFPFGGGR--CHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMTMDN 208 + G + + + +DD SR + ++ L + G+P + DN Sbjct: 172 PYISIGKSKKQAYLIAFIDDASRLITHAQFFFFQNFVALRVALKEAVLKRGIPKMIYTDN 231 Query: 209 GSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQ---GKWFA 265 G + L + LG + H+ P+ P ++GK+ERF +++ L Sbjct: 232 GKVYRSDQ-----LNMLCAGLGCSLIHTEPFTPTSKGKIERFFHTVRQRFLSRLDPTKLK 286 Query: 266 DSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGNTTPPEYDEGVMV---R 322 +L F W + H AL+M+ P + + P +E ++ R Sbjct: 287 SLDQLNLYFWQWLEEDYQCKTHSALNMS-PLDFFMAQVHNINFLANPQLLEEHFLLRVTR 345 Query: 323 KVDISGKLSVKGVSLSAGKAFRGERVGLK 351 KV+ LSV+ + ++ R+ ++ Sbjct: 346 KVNHDATLSVESILYETEQSLANSRLEVR 374 >UniRef50_Q2W8I9 Transposase and inactivated derivative n=89 Tax=Bacteria RepID=Q2W8I9_MAGSA Length = 416 Score = 53.9 bits (128), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 44/172 (25%), Positives = 70/172 (40%), Gaps = 7/172 (4%) Query: 142 NRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLP 201 N W +DF G R L ++DD +R L T V ++L ++ R G P Sbjct: 116 NARWSLDFVHDQLACGRRFRILNVVDDVTRECLAAIPDTSISGARVTRELSALIARRGRP 175 Query: 202 DRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQG 261 + + DNG T T A+ W + I + P P G +E F+ ++ E+L Sbjct: 176 EMIVSDNG-----TELTSNAVLAWKQQQRIDWHYIAPGKPMQNGFVESFNGRMRDELLNE 230 Query: 262 KWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGNTTPP 313 F + + W YN RPH +L P + + + G ++PP Sbjct: 231 TVFTSLPQARAVIAAWADDYNTARPHSSLGYQTPAA--HAAKLKAMGPSSPP 280 >UniRef50_C7MBS5 Transposase n=2 Tax=Micrococcineae RepID=C7MBS5_BRAFD Length = 365 Score = 53.5 bits (127), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 73/291 (25%), Positives = 107/291 (36%), Gaps = 42/291 (14%) Query: 27 GANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMAHD 86 G I + F I+ AT KW+ R+ GAAGL++ P H P+R + L+ Sbjct: 24 GRPIAHVAAEFHIARATLSKWVGRYRAAGAAGLEEHSSAPAHRPSRLEGWVVELIEHWR- 82 Query: 87 RHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGL--LPGASP---GIPATGRFEHDAP 141 R ++W AR+I R L D TV + R GL + +P + G P Sbjct: 83 RKQKWSARRIARELADGHGVHCCVRTVTRWLDRLGLNRIRDITPDGGNLRQPGTITARYP 142 Query: 142 NRLWQMDFK--GHFPFGG-----------------------GRCHPLTLLDDHSRFSLCL 176 + +D K G P GG G + + +D SR + Sbjct: 143 GHMIHVDVKKVGKIPDGGGWKVHGRDSALGRASKRGKGRRVGYTYLHSAIDGFSRLAYTE 202 Query: 177 AHCTDERRETV---QQQLVSVFERYGLP--DRMTMDNGSPWGDTTGTWTALELWLMRLGI 231 DE T + + F +G+ R+ DNG + A + Sbjct: 203 P-LEDETAATTIGFLHRAFAFFAAHGITRITRLISDNGPNYRS-----NAFARSIRGKVS 256 Query: 232 RVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYN 282 R +RPY P+ GK+ERF R E L + F E + W YN Sbjct: 257 RHQRTRPYTPRHNGKVERFQRITVDEFLYAEVFESEQERRNRHGVWLHHYN 307 >UniRef50_B0SFL5 Transposase n=2 Tax=Leptospira biflexa serovar Patoc RepID=B0SFL5_LEPBA Length = 206 Score = 53.5 bits (127), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 41/155 (26%), Positives = 67/155 (43%), Gaps = 8/155 (5%) Query: 141 PNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGL 200 PN +W +DF G + L+ +D +R ++ L+ E + + L S+ E L Sbjct: 51 PNEVWAIDFLHERTIDGRKARILSGVDLCTRENVVLSADYSISSERLIRFLESLPE---L 107 Query: 201 PDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQ 260 P DNG + T WL + I + + P P +E F+ L++E LQ Sbjct: 108 PKSFITDNGPEF-----TSRVFINWLSKNNIGISYIDPGKPTQNAFVESFNGKLRSECLQ 162 Query: 261 GKWFADSGELQRAFDHWRTVYNLERPHEALDMAVP 295 + D EL+ ++ YN ER H +L+ P Sbjct: 163 LSFCRDLTELRNELSKFQKDYNEERLHSSLNYLTP 197 >UniRef50_A5AWF5 Putative uncharacterized protein n=16 Tax=Vitis vinifera RepID=A5AWF5_VITVI Length = 2072 Score = 53.5 bits (127), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 30/117 (25%), Positives = 58/117 (49%), Gaps = 5/117 (4%) Query: 144 LWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDR 203 +W +DF G FP G + L +D S++ + ++ + ++ ++F R+G+P Sbjct: 1476 VWGIDFMGPFPMSFGNSYILVRVDYVSKWVEAIPCKHNDHKVVLKFLKENIFSRFGVPKA 1535 Query: 204 MTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQ 260 + D G T E L + G++ + PYHPQT G++E +R +K +++ Sbjct: 1536 IISDGG-----THFCIRPFETLLAKYGVKHKVATPYHPQTSGQVELANREIKNMLMK 1587 >UniRef50_A5AKV0 Putative uncharacterized protein n=16 Tax=Vitis vinifera RepID=A5AKV0_VITVI Length = 2067 Score = 53.5 bits (127), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 29/117 (24%), Positives = 60/117 (51%), Gaps = 5/117 (4%) Query: 144 LWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDR 203 +W +DF G FP G + L +D S++ + T++ + ++ ++F R+G+P Sbjct: 553 VWGIDFMGPFPMSFGHSYILVGVDYVSKWVEAIPCRTNDHKVVLKFLKENIFSRFGVPKA 612 Query: 204 MTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQ 260 + D G+ + + E L + ++ + PYHPQT G++E +R +K +++ Sbjct: 613 IISDGGTHFCN-----KPFEALLAKYRVKHKVATPYHPQTSGQVELANREIKNILMK 664 >UniRef50_P31623 Integrase n=24 Tax=root RepID=POL_JSRV Length = 870 Score = 53.5 bits (127), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 36/122 (29%), Positives = 56/122 (45%), Gaps = 7/122 (5%) Query: 141 PNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGL 200 PN LWQ D H P G + +D S F + H + R +Q L+ F G+ Sbjct: 648 PNHLWQTDVT-HIPQFGRLKYVHVSIDTFSNFLMASLHTGESTRHCIQH-LLFCFSTSGI 705 Query: 201 PDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQ 260 P + DNG + T + + + + I PY+PQ QG +ER H+ +K ++L+ Sbjct: 706 PQTLKTDNGPGY-----TSRSFQRFCLSFQIHHKTGIPYNPQGQGIVERAHQRIKHQLLK 760 Query: 261 GK 262 K Sbjct: 761 QK 762 >UniRef50_UPI0000E49205 PREDICTED: similar to novel transposon n=1 Tax=Strongylocentrotus purpuratus RepID=UPI0000E49205 Length = 188 Score = 53.1 bits (126), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 29/97 (29%), Positives = 50/97 (51%), Gaps = 7/97 (7%) Query: 185 ETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQ 244 ET +L ++F RYG+P + DNG+ + T + + ++ GI+ S PYHP + Sbjct: 7 ETTISRLRALFSRYGIPQILVSDNGTQF-----TSSKFQQFVKSNGIKHKFSAPYHPSSN 61 Query: 245 GKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVY 281 G+ ERF ++LK + + D G +Q + + Y Sbjct: 62 GQAERFVQTLKQALRAAR--KDGGSIQAKLERFLFAY 96 >UniRef50_UPI0000F1E4F0 PREDICTED: similar to LReO_3 n=1 Tax=Danio rerio RepID=UPI0000F1E4F0 Length = 1276 Score = 53.1 bits (126), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 39/131 (29%), Positives = 62/131 (47%), Gaps = 9/131 (6%) Query: 140 APNRLWQMDFKGHFP-FGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERY 198 P R MD G G + L + D +R+ T + VQ L +F R Sbjct: 487 TPFRRIAMDIVGPLERSSAGHRYILVVCDYATRYPEAFPLRTVTTSKVVQA-LTELFSRV 545 Query: 199 GLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEV 258 G+PD + D G+ + T + +LGI+ + PYHPQT G +ERF+ +LK+ Sbjct: 546 GIPDEIITDQGTNFMSRVMTQ-----FHQQLGIKALKTTPYHPQTDGLVERFNGTLKS-- 598 Query: 259 LQGKWFADSGE 269 + K+ +D+G+ Sbjct: 599 MLRKFVSDTGK 609 >UniRef50_Q9LHC0 Retroelement pol polyprotein-like n=440 Tax=Spermatophyta RepID=Q9LHC0_ARATH Length = 897 Score = 53.1 bits (126), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 31/113 (27%), Positives = 56/113 (49%), Gaps = 5/113 (4%) Query: 144 LWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDR 203 +W +DF G FP G + L +D S++ +A T++ + ++ +F R+G+P Sbjct: 601 VWGIDFMGLFPSSYGNKYILVAIDYVSKWVEAIAIPTNDAKVVLKLFKTIIFPRFGVPRV 660 Query: 204 MTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKA 256 + D G + + E L + G++ + PYHPQT G++E R +K Sbjct: 661 VISDGGKHFIN-----KVFENLLKKHGVKHKVATPYHPQTSGQVEISDREIKT 708 >UniRef50_A5AQ03 Putative uncharacterized protein n=5 Tax=Vitis vinifera RepID=A5AQ03_VITVI Length = 1873 Score = 53.1 bits (126), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 29/117 (24%), Positives = 60/117 (51%), Gaps = 5/117 (4%) Query: 144 LWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDR 203 +W +DF G FP G + L +D S++ + +++ + ++ +F R+G+P Sbjct: 1588 VWGIDFMGPFPMSFGHSYILVGVDYISKWVEAIPCRSNDHKVVLKFLKDHIFARFGVPKA 1647 Query: 204 MTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQ 260 + D G+ + + E L + G++ + PYHPQT G++E +R +K +++ Sbjct: 1648 IISDGGTHFCN-----KPFETLLAKYGVKHKVATPYHPQTSGQVELANREIKNILMK 1699 >UniRef50_A5G4C5 Putative uncharacterized protein n=1 Tax=Geobacter uraniireducens Rf4 RepID=A5G4C5_GEOUR Length = 390 Score = 53.1 bits (126), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 57/265 (21%), Positives = 109/265 (41%), Gaps = 16/265 (6%) Query: 45 YKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMAHDRHE-----RWGARKIKRW 99 +KWL R+ ++ R P P+ S ++ +R + + G IK Sbjct: 34 FKWLNRYQSGATDWYKEHSRAPLKRPSELSIVDKEIIVSTRNRLDSSPFAQIGVSAIKWE 93 Query: 100 LEDQGHTMPAFSTVHNLMARHGLLPGASPGIPATGRF----EHDAPNRLWQMDFKG-HFP 154 L G + ST++ + R GL+ + P + E N + QMD G + Sbjct: 94 LHKLGLPFRSDSTINRTLKREGLVKKKTRYSPKGVEYPYFTEALCCNNIHQMDLVGPRYI 153 Query: 155 FGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGD 214 GR + + ++D +S ++ T E + + Q L+ ++ GLPD + MDN + Sbjct: 154 KSDGRFYSMNVIDLYSHRVFIESNRTKED-DNIAQGLLRCWKSMGLPDFLQMDNELSFRG 212 Query: 215 TTGTWTALELWLMRLGIRVGHSRPYHPQTQ----GKLERFHRSLKAEVLQGKWFADSGEL 270 + +L L ++RL + G + P + G +E F+ + + + +WF L Sbjct: 213 SNRYPRSLGL-VLRLCLYFGVHPVFIPVAEPWRDGVIESFNDTYDKKFFRRQWFTSYSML 271 Query: 271 QRAFDHWRTVYNLERPHEALDMAVP 295 +R +++ +N + L P Sbjct: 272 KRQSKNFQQFHNKNHRYSYLKGKTP 296 >UniRef50_UPI000038392B COG2801: Transposase and inactivated derivatives n=2 Tax=Magnetospirillum magnetotacticum MS-1 RepID=UPI000038392B Length = 239 Score = 53.1 bits (126), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 50/203 (24%), Positives = 84/203 (41%), Gaps = 23/203 (11%) Query: 111 STVHNLMARHGL--LPGASPGIPATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDD 168 S++H + RHG+ LP P RF+ +D G + L ++D+ Sbjct: 33 SSLHRCLQRHGISRLPEVDGDKPRRSRFKAYP----LVLDVSE-----GRKFRMLNVVDE 83 Query: 169 HSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMR 228 +R L + + V L +F G+P + DNG + +++ W+ Sbjct: 84 FTRECLAIRVSCKLKAADVIDVLSDLFILRGVPGHVRSDNGPEF-----IARSVQSWIAA 138 Query: 229 LGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHE 288 +G + + P P +E F+ L+ E+L G+ F E Q + WR +N RPH Sbjct: 139 VGSQTAYIAPGSPWENSYVESFNARLRDELLNGEIFYTLQEAQIIIESWRRHHNTIRPHG 198 Query: 289 AL-------DMAVPGSRYQPSAR 304 AL ++ VP P+AR Sbjct: 199 ALGYKPSAPEVFVPAPTAWPAAR 221 >UniRef50_A5C1P8 Putative uncharacterized protein n=4 Tax=Vitis vinifera RepID=A5C1P8_VITVI Length = 1601 Score = 53.1 bits (126), Expect = 2e-05, Method: Composition-based stats. Identities = 29/111 (26%), Positives = 56/111 (50%), Gaps = 5/111 (4%) Query: 144 LWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDR 203 +W +DF G FP G + L +D S++ + ++ R ++ ++F R+G+P Sbjct: 1307 VWGIDFMGPFPMSFGNSYILVGVDYVSKWVEAIPCKHNDHRVVLKFLKENIFSRFGVPKS 1366 Query: 204 MTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSL 254 + D G+ + + E L + G++ + PYHPQT G++E +R + Sbjct: 1367 IISDGGTHFCN-----KPFETLLAKYGVKHKVATPYHPQTSGQVELANREI 1412 >UniRef50_A0L7D7 Integrase, catalytic region n=5 Tax=Bacteria RepID=A0L7D7_MAGSM Length = 272 Score = 53.1 bits (126), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 53/216 (24%), Positives = 91/216 (42%), Gaps = 9/216 (4%) Query: 88 HERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASP--GIPATGRFEHDAPNRLW 145 H G R++ + D + ++V ++ GL+ SP TG + P++ W Sbjct: 30 HPDEGYRRLTYMMLDADVVAVSPASVLRVLRAAGLMRKWSPPPSQKGTGFKQPLEPHKHW 89 Query: 146 QMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERY-GLPDRM 204 +D + G + ++LD SRF L + + V+ L+ E Y R+ Sbjct: 90 HIDI-SYLNIQGTFYYLCSVLDGCSRFILHWEIRESMKEDEVEVILLRAKEAYPEAKPRV 148 Query: 205 TMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWF 264 DNG + + ++ G+ + PY+PQ+ GKLERFH +LK E ++ + Sbjct: 149 ISDNGPQF-----VAKDFKTFIRESGMTHVRTSPYYPQSNGKLERFHGTLKRECIRPQTP 203 Query: 265 ADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQ 300 + QR + + YN R H A P R + Sbjct: 204 LSLEDAQRVVEGYVEHYNTYRLHSATGYITPKDRLE 239 >UniRef50_A4WYC8 Putative uncharacterized protein n=2 Tax=Rhodobacter sphaeroides ATCC 17025 RepID=A4WYC8_RHOS5 Length = 174 Score = 53.1 bits (126), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 26/79 (32%), Positives = 36/79 (45%) Query: 220 TALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRT 279 TA+ W+ +G R P P G E F+ L+ E+L G+ F E + + WR Sbjct: 70 TAVRTWIAAVGARCAFIEPGSPWENGYCESFNSKLRDELLNGEIFYSLAEARIVIEFWRQ 129 Query: 280 VYNLERPHEALDMAVPGSR 298 YN RPH +L P R Sbjct: 130 HYNTRRPHSSLGYRPPAPR 148 >UniRef50_A1V109 A, transposase OrfB n=56 Tax=Proteobacteria RepID=A1V109_BURMS Length = 797 Score = 53.1 bits (126), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 58/235 (24%), Positives = 94/235 (40%), Gaps = 31/235 (13%) Query: 70 PNRSSDDITA-LLRMAHDRHERWGARKIKRWLEDQG-HT---------MPAFSTVHNLMA 118 P+ ++ + A L+++AH+R R+G R++ +E +G H A V Sbjct: 32 PDHENEVLAARLVKLAHERR-RFGYRRLHALVEREGTHANHKRIYRLYREAGLAVRRRRK 90 Query: 119 RHGLLPG----ASPGIPATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSL 174 RHG++ A PG APN +W +DF G R LT++DD ++ ++ Sbjct: 91 RHGVMIEREQLALPG----------APNEVWSIDFVMDALSNGRRVKCLTVVDDFTKEAV 140 Query: 175 CLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVG 234 + V + L G P + D G + T AL+ W G+ + Sbjct: 141 DIVVDHGISGLYVARALDRAARFRGYPKAVRTDQGPEF-----TSRALDQWAYANGVTLK 195 Query: 235 HSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEA 289 + P +E F+ + E L WF + WR YN +RPH A Sbjct: 196 LIQAGKPTQNAYIESFNGKFRDECLNEHWFTTLAHARAVIAAWRQGYNEQRPHHA 250 >UniRef50_UPI0001B416F4 ISA0963-5 transposase n=6 Tax=Ferroplasma acidarmanus fer1 RepID=UPI0001B416F4 Length = 318 Score = 53.1 bits (126), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 51/242 (21%), Positives = 103/242 (42%), Gaps = 33/242 (13%) Query: 73 SSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPA 132 +S DI + ++ ++ + G +I+++L+ +G + A + ++ ++ ++ ++ Sbjct: 90 NSKDIEIVKKIRYE-YPMSGPERIRKYLKRKG-IIIAKNNIYRILLLLNMVDNSNNKKKQ 147 Query: 133 TG--RFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQ 190 ++E N LW MD+ + + + DD SRF + + +E + + Sbjct: 148 RKYIKYERKHSNSLWHMDWTKY----SDSEKLIIIEDDASRFIVGMGIYGEETIDNTIEA 203 Query: 191 LVSVFERYGLPDRMTMDNGSPW-----GDTTGTWTALELWLMRLGIRVGHSRPYHPQTQG 245 L YG P+ + D+G+ + G + +L I+ R HP+T G Sbjct: 204 LEIAINTYGKPEEILTDHGTQFFSNGKNGIPGDHNKFQEYLDNSNIKHILGRVKHPETNG 263 Query: 246 KLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTV---YNLERPHEAL----DMAVPGSR 298 KLER + ++K L+ F W V YN ER H++L ++ P Sbjct: 264 KLERLNYTIK-------------RLRPYFSTWEEVVYHYNYERMHDSLSDGDNIVTPAMA 310 Query: 299 YQ 300 Y+ Sbjct: 311 YK 312 >UniRef50_A5BAC6 Putative uncharacterized protein n=1 Tax=Vitis vinifera RepID=A5BAC6_VITVI Length = 1485 Score = 52.8 bits (125), Expect = 2e-05, Method: Composition-based stats. Identities = 29/112 (25%), Positives = 56/112 (50%), Gaps = 5/112 (4%) Query: 144 LWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDR 203 +W +DF G FP G + L +D S++ + ++ R ++ ++F R+G+P Sbjct: 1208 VWGIDFMGPFPMSFGNSYILVEVDYVSKWVEAILCKHNDHRVVLKFLRENIFSRFGVPKA 1267 Query: 204 MTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLK 255 + D G+ + + E L + G++ + PYHPQ G++E +R +K Sbjct: 1268 IISDGGTHFCN-----KPFETLLAKYGVKHKVATPYHPQNSGQVELANREIK 1314 >UniRef50_A6Q4E4 Transposase n=2 Tax=Nitratiruptor sp. SB155-2 RepID=A6Q4E4_NITSB Length = 271 Score = 52.8 bits (125), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 42/170 (24%), Positives = 69/170 (40%), Gaps = 12/170 (7%) Query: 141 PNRLWQMDFKGHFPFGGGRCHPLTLLDDHSR----FSLCLAHCTDERRETVQQQLVSVF- 195 PN+ W +D + G G ++D ++R + L + +Q+ L+ F Sbjct: 107 PNQRWAIDMTRVYSSGDGWSTLACVIDTYTREIVGWRLSKSGKATTAEAVLQEGLIYRFG 166 Query: 196 --ERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRS 253 +R P + DNG + + T TA + + I PY P+ G +ERF R+ Sbjct: 167 KLKRLQEPIILRSDNGLVFSSKSFTKTAQDYNFTQEFIT-----PYTPEQNGMIERFFRT 221 Query: 254 LKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSA 303 +K E + F E + W YN +R H AL P ++ A Sbjct: 222 IKEECIWHYNFKSLKEANKIIGEWINFYNQKRKHSALQYKTPAEVFRLVA 271 >UniRef50_A5AY91 Putative uncharacterized protein n=1 Tax=Vitis vinifera RepID=A5AY91_VITVI Length = 1162 Score = 52.8 bits (125), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 29/117 (24%), Positives = 60/117 (51%), Gaps = 5/117 (4%) Query: 144 LWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDR 203 +W +DF G FP G + L +D S++ + +++ + ++ +F R+G+P Sbjct: 156 VWGIDFMGPFPMSFGHSYILVGVDYVSKWVEAIPCRSNDHKVVLKFLKDHIFARFGVPKA 215 Query: 204 MTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQ 260 + D G+ + + E L + G++ + PYHPQT G++E +R +K +++ Sbjct: 216 IISDGGTHFCN-----KPFETLLAKYGVKHKVATPYHPQTSGQVELANRXIKNILMK 267 >UniRef50_Q1N8F6 Transposase n=2 Tax=Sphingomonas RepID=Q1N8F6_9SPHN Length = 466 Score = 52.8 bits (125), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 65/264 (24%), Positives = 107/264 (40%), Gaps = 47/264 (17%) Query: 26 DGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPR-------IPHHSPNRSSDDIT 78 DG + L R +S T +WL R+ EG AGL PR +P H + Sbjct: 24 DGIPLADLARTGTLSERTLQRWLGRYRAEGLAGLARLPRNDRGRLHLPEH--------LV 75 Query: 79 ALLRMAHDRHERWGARKIKRWLED----QGHTMPAFSTVHNLMARHGLLPGASPGIPATG 134 L R + R I R +++ GH P+++ V ++ A+ PA Sbjct: 76 ELTRTLATKRPRPPVAAIHRKVQELAIAHGHRTPSYAAVARVVRAIPASQIAAASDPAVY 135 Query: 135 RFEHD--------APNRLWQMDFKGH----FPFGGGRCHP--LTLLDDHSR----FSLCL 176 R +H+ N +WQ D G P ++DDHSR + L L Sbjct: 136 RDQHELVHRREAATSNEMWQADHTVLDILVLDDAGTPVRPWLTVIVDDHSRAIAGYFLSL 195 Query: 177 -----AHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGI 231 + R+ + ++ + G+P+++ +DNGS + +E + L I Sbjct: 196 DAPSALNTALALRQAIWRKPNPEWIVSGIPEQLYVDNGSDF-----ISEHIEQACIALKI 250 Query: 232 RVGHSRPYHPQTQGKLERFHRSLK 255 R+ HS P P+ +GK+ER R++ Sbjct: 251 RLIHSLPGRPRGRGKIERLFRTIN 274 >UniRef50_Q2ILP8 Integrase n=2 Tax=Anaeromyxobacter RepID=Q2ILP8_ANADE Length = 280 Score = 52.8 bits (125), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 28/92 (30%), Positives = 43/92 (46%), Gaps = 5/92 (5%) Query: 205 TMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWF 264 ++DNGS + T ++ W GI++ P P G +E F+ + E L WF Sbjct: 180 SVDNGSEF-----TSHTVDAWAYERGIKLDFITPGKPTENGHIESFNGKFRDECLNENWF 234 Query: 265 ADSGELQRAFDHWRTVYNLERPHEALDMAVPG 296 + +R F+ +R YN RPH +LD P Sbjct: 235 ISLDDARRKFEVFRVDYNEVRPHSSLDNQTPN 266 >UniRef50_Q0ZCC0 Gag protein n=2 Tax=Populus trichocarpa RepID=Q0ZCC0_POPTR Length = 1886 Score = 52.8 bits (125), Expect = 2e-05, Method: Composition-based stats. Identities = 30/111 (27%), Positives = 55/111 (49%), Gaps = 5/111 (4%) Query: 145 WQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRM 204 W +DF G FP G + L +D S++ + T++ + ++ ++ R+G+P M Sbjct: 960 WGIDFMGPFPPSFGFLYILVAVDYVSKWIEAIPSRTNDHKTVIKFLKENILSRFGIPRAM 1019 Query: 205 TMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLK 255 D G+ + + E + + GI + PYHPQT G++E +R +K Sbjct: 1020 ISDGGTHFCN-----KPFESLMKKYGITHKVATPYHPQTSGQVELANREIK 1065 >UniRef50_UPI00016E16D4 UPI00016E16D4 related cluster n=2 Tax=Takifugu rubripes RepID=UPI00016E16D4 Length = 899 Score = 52.4 bits (124), Expect = 3e-05, Method: Compositional matrix adjust. Identities = 40/130 (30%), Positives = 60/130 (46%), Gaps = 12/130 (9%) Query: 129 GIPATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQ 188 IPA RF+H +D G P G H LT++D R+ + + + + Sbjct: 651 AIPAR-RFDH------VHVDPVGPLPPSHGYTHLLTMVDRTIRWPEVVPLSSTTSADVAR 703 Query: 189 QQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLE 248 L + R+G P +T D G + + W+++ LG +V + YHPQ G E Sbjct: 704 AFLSAWVARFGSPSDITSDRGPQF--VSELWSSMA---RSLGTQVHRTTAYHPQANGLCE 758 Query: 249 RFHRSLKAEV 258 RFHRSLKA + Sbjct: 759 RFHRSLKAAL 768 >UniRef50_A5C4S0 Putative uncharacterized protein n=1 Tax=Vitis vinifera RepID=A5C4S0_VITVI Length = 1374 Score = 52.4 bits (124), Expect = 3e-05, Method: Compositional matrix adjust. Identities = 28/117 (23%), Positives = 59/117 (50%), Gaps = 5/117 (4%) Query: 144 LWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDR 203 +W +DF HFP G + L +D S++ + ++ R ++ ++F R+G+P Sbjct: 1156 VWGIDFMRHFPMSFGYSYILVGVDYVSKWVEAIPCKRNDHRVVIKFLKENIFSRFGVPKA 1215 Query: 204 MTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQ 260 + D G+ + + E L + ++ + PYHPQT G+++ +R +K +++ Sbjct: 1216 IISDGGTHFCN-----KPFETLLAKYEVKHKVATPYHPQTSGQVKLANREIKKVLMR 1267 >UniRef50_A5ACN5 Putative uncharacterized protein n=7 Tax=Vitis vinifera RepID=A5ACN5_VITVI Length = 1390 Score = 52.4 bits (124), Expect = 3e-05, Method: Compositional matrix adjust. Identities = 29/117 (24%), Positives = 59/117 (50%), Gaps = 5/117 (4%) Query: 144 LWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDR 203 +W +DF FP G + L +D S+ + +++ R ++ ++F R+G+P Sbjct: 606 VWGIDFMXPFPMSFGHSYILVGVDYVSKXVEAIPCRSNDHRVVLKFLKDNIFARFGVPKA 665 Query: 204 MTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQ 260 + D G+ + + E L + G++ + PYHPQT G++E +R +K +++ Sbjct: 666 IISDGGTHFCNK-----PFETLLAKYGVKHKVATPYHPQTSGQVELANREIKNILMK 717 >UniRef50_UPI000192724E PREDICTED: similar to RETRotransposon-like family member (retr-1) n=3 Tax=Hydra magnipapillata RepID=UPI000192724E Length = 1235 Score = 52.4 bits (124), Expect = 3e-05, Method: Composition-based stats. Identities = 27/70 (38%), Positives = 39/70 (55%), Gaps = 5/70 (7%) Query: 186 TVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQG 245 TV+ L SVF R G+P+ + DN + + DTT L WL ++G + PYHPQ+ G Sbjct: 985 TVKAVLQSVFSRNGVPEVLVSDNAAEFHDTT-----LHQWLKKVGCVPYKTPPYHPQSNG 1039 Query: 246 KLERFHRSLK 255 ER ++K Sbjct: 1040 AAERMVETVK 1049 >UniRef50_A0P341 Transposase n=5 Tax=Alphaproteobacteria RepID=A0P341_9RHOB Length = 197 Score = 52.4 bits (124), Expect = 3e-05, Method: Compositional matrix adjust. Identities = 41/154 (26%), Positives = 61/154 (39%), Gaps = 10/154 (6%) Query: 144 LWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDR 203 +W MDF G + L ++D SRFS + R E V L G P Sbjct: 43 VWAMDFVHDQLATGRKIRVLKVVDTFSRFSPVVNPRFSYRGEDVVATLEQACRFVGYPKT 102 Query: 204 MTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKW 263 + +D GS + L+L + + + SRPY +E F+ + E L W Sbjct: 103 IRVDQGSEFISRD-----LDLLAYQRDVELDFSRPY-----AFIESFNGKFRTECLNAHW 152 Query: 264 FADSGELQRAFDHWRTVYNLERPHEALDMAVPGS 297 F + + + WR Y+ RPH A+ P S Sbjct: 153 FLTFEDARSKMEEWRKDYSTVRPHIAIGNKPPIS 186 >UniRef50_A2EZM6 Integrase core domain containing protein n=19 Tax=Trichomonas vaginalis RepID=A2EZM6_TRIVA Length = 326 Score = 52.4 bits (124), Expect = 3e-05, Method: Compositional matrix adjust. Identities = 41/155 (26%), Positives = 68/155 (43%), Gaps = 14/155 (9%) Query: 136 FEHDAPNRLWQMDFKGHFPFGGGRCHPLT-LLDDHSRFSLCLAHCTDERRETVQQQLVSV 194 +E PN +W +D HF + P+ ++DD SR+ L L ++ + ++ Sbjct: 140 YEAILPNTIWHVDI--HF-LKEPKTLPVYGIIDDKSRYLLALKLLRNKSSTETSKVAIAT 196 Query: 195 FERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSL 254 ++YG P DNG + G +T L I++ S PY P+ GK+ER SL Sbjct: 197 VQKYGAPFCFWSDNGK---ENEGEFTKF---LSTYDIQIRKSAPYMPRQNGKIERLWPSL 250 Query: 255 KAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEA 289 + F + + + +R YN + PH A Sbjct: 251 DKNAPKSSEFES---INQELEEFRQKYN-DMPHGA 281 >UniRef50_UPI00015B47AA PREDICTED: similar to pol polyprotein n=1 Tax=Nasonia vitripennis RepID=UPI00015B47AA Length = 1193 Score = 52.4 bits (124), Expect = 3e-05, Method: Compositional matrix adjust. Identities = 37/111 (33%), Positives = 56/111 (50%), Gaps = 13/111 (11%) Query: 153 FPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFER----YGLPDRMTMDN 208 P G + LT++D SR + + D R ET+ + S FE YG P +T D Sbjct: 867 MPLVGDLRYCLTMIDRFSRGPVVVP-IADIRAETIAR---SFFEHWVAHYGTPITITTDQ 922 Query: 209 GSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVL 259 G+ + + + +L ++ I H+ PYHPQ G +ERFHR LKA ++ Sbjct: 923 GTQF--ESALFASLAQMIVSRRI---HTTPYHPQANGLIERFHRMLKAALM 968 >UniRef50_UPI000179C74F UPI000179C74F related cluster n=8 Tax=Bos taurus RepID=UPI000179C74F Length = 1431 Score = 52.4 bits (124), Expect = 3e-05, Method: Compositional matrix adjust. Identities = 39/136 (28%), Positives = 61/136 (44%), Gaps = 11/136 (8%) Query: 141 PNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGL 200 PN LWQMD F +T+ D S + A T E + V Q L + F GL Sbjct: 1182 PNALWQMDVTHISSFAKLSFVHVTV-DTFSHVIVATAR-TGEAVKDVIQHLFTCFSYMGL 1239 Query: 201 PDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQ 260 P + DN + T + + + ++ I+ PY+PQ Q +ER H++LK ++ + Sbjct: 1240 PKALKTDNAPAY-----TSKSFQEFCLKFQIKHNTGIPYNPQGQAIVERAHQTLKTQIQK 1294 Query: 261 GKWFADSGELQRAFDH 276 K GE + + H Sbjct: 1295 LK----EGEFKYSSPH 1306 >UniRef50_A5BY78 Putative uncharacterized protein n=2 Tax=Vitis vinifera RepID=A5BY78_VITVI Length = 1947 Score = 52.0 bits (123), Expect = 3e-05, Method: Compositional matrix adjust. Identities = 29/117 (24%), Positives = 59/117 (50%), Gaps = 5/117 (4%) Query: 144 LWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDR 203 +W +DF G FP G + L +D S++ + T++ + ++ ++F R+ +P Sbjct: 1360 VWGIDFMGPFPMSFGHSYILVGVDYVSKWVEXIPCRTNDHKVVLKFLKENIFSRFXVPKA 1419 Query: 204 MTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQ 260 + D G+ + + E L + GI+ + PYHP T G++E +R +K +++ Sbjct: 1420 IIXDXGTHFCN-----KPFEALLAKYGIKHKVATPYHPXTSGQVELANREIKNILMK 1471 >UniRef50_A5AIU0 Putative uncharacterized protein n=2 Tax=Vitis vinifera RepID=A5AIU0_VITVI Length = 1753 Score = 52.0 bits (123), Expect = 3e-05, Method: Compositional matrix adjust. Identities = 28/117 (23%), Positives = 59/117 (50%), Gaps = 5/117 (4%) Query: 144 LWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDR 203 +W +DF G FP G + L ++ S++ + ++ R ++ ++F R+G+P Sbjct: 1204 VWDIDFMGPFPMSFGNSYILVGVNYVSKWVEAIPCKHNDHRVVLKFLKENIFSRFGVPKA 1263 Query: 204 MTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQ 260 + D G+ + + E L + G++ + PYHP T G++E +R +K +++ Sbjct: 1264 IISDEGTHFCN-----KPFETLLAKYGVKHKVATPYHPXTSGQVELANREIKNILMK 1315 >UniRef50_A5BWY6 Putative uncharacterized protein n=3 Tax=Vitis vinifera RepID=A5BWY6_VITVI Length = 1068 Score = 52.0 bits (123), Expect = 3e-05, Method: Compositional matrix adjust. Identities = 29/117 (24%), Positives = 59/117 (50%), Gaps = 5/117 (4%) Query: 144 LWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDR 203 +W +DF G FP G + L +D S++ + ++ R ++ ++F R+G+P Sbjct: 530 VWGIDFMGPFPMSFGNSYILVGVDYVSKWVEAIPCKHNDHRVVLKFLKENIFSRFGVPKA 589 Query: 204 MTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQ 260 + D G+ + + E L + G++ + PYHPQT ++E +R +K +++ Sbjct: 590 IISDGGTHFCN-----KPFETLLAKYGVKHKVATPYHPQTSEQVELANREIKNILMK 641 >UniRef50_C7C200 Gag-Pol polyprotein n=2 Tax=Schistosoma japonicum RepID=C7C200_SCHJA Length = 1367 Score = 52.0 bits (123), Expect = 4e-05, Method: Compositional matrix adjust. Identities = 36/100 (36%), Positives = 48/100 (48%), Gaps = 6/100 (6%) Query: 189 QQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLE 248 Q L +F R GLPD + DNGS + T + + + RLGI PYHPQ+ G+ E Sbjct: 1123 QILSEIFSRNGLPDMIVSDNGSQF-----TSSQFQEFCRRLGIIHYRFPPYHPQSNGQAE 1177 Query: 249 RFHRSLKAEVLQGKWFADSGE-LQRAFDHWRTVYNLERPH 287 RF + K +L+ K E LQ +RT N P Sbjct: 1178 RFVDTFKRALLKSKGEETPMESLQNFLFVYRTTPNDALPE 1217 >UniRef50_C2BWQ2 IS21 family transposase n=1 Tax=Mobiluncus curtisii ATCC 43063 RepID=C2BWQ2_9ACTO Length = 329 Score = 52.0 bits (123), Expect = 4e-05, Method: Compositional matrix adjust. Identities = 38/145 (26%), Positives = 63/145 (43%), Gaps = 4/145 (2%) Query: 169 HSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMR 228 +SR+++C+ T + +Q V G+P R DN S G + ++ Sbjct: 77 YSRYTVCVVIPTRTTADLLQGMWEGVQRFGGVPRRFIWDNESGIGRGNHLAAGVSGFMGV 136 Query: 229 LGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHE 288 LG + P P+++G +ERF+R + + G+ F +LQ D W N R H Sbjct: 137 LGATLKQLPPRDPESKGTIERFNRYAETSFMPGRHFLSPQDLQAQLDDWMVKAN-GRTHA 195 Query: 289 ALDMAVPGSRYQPSARQYSGNTTPP 313 L A+P YQ ++ + PP Sbjct: 196 TLH-AIPAEMYQEELTHFA--SLPP 217 >UniRef50_P03365 Integrase n=46 Tax=root RepID=POL_MMTVB Length = 899 Score = 52.0 bits (123), Expect = 4e-05, Method: Compositional matrix adjust. Identities = 42/138 (30%), Positives = 65/138 (47%), Gaps = 11/138 (7%) Query: 141 PNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGL 200 P LWQMD FG + +T+ D +S F+ A T E + V Q L F G+ Sbjct: 638 PRVLWQMDVTHVSEFGKLKYVHVTV-DTYSHFTFATAR-TGEATKDVLQHLAQSFAYMGI 695 Query: 201 PDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEV-- 258 P ++ DN + +++ +L R I PY+PQ Q +ER H+++KA++ Sbjct: 696 PQKIKTDNAPAYVSR-----SIQEFLARWKISHVTGIPYNPQGQAIVERTHQNIKAQLNK 750 Query: 259 LQ--GKWFADSGELQRAF 274 LQ GK++ L A Sbjct: 751 LQKAGKYYTPHHLLAHAL 768 >UniRef50_UPI00017615A3 PREDICTED: similar to Os07g0444200 n=2 Tax=Danio rerio RepID=UPI00017615A3 Length = 1901 Score = 52.0 bits (123), Expect = 4e-05, Method: Compositional matrix adjust. Identities = 41/119 (34%), Positives = 61/119 (51%), Gaps = 11/119 (9%) Query: 139 DAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCT-DERRETVQQQL-VSVFE 196 +AP L +DF GG + L + D SRF+ A+ T D++ TV + L + F Sbjct: 1359 NAPMELLCIDFLVLEKSRGGFENVLVVTDHFSRFAQ--AYPTRDQKAVTVAKVLWKNFFC 1416 Query: 197 RYGLPDRMTMDNGSPWGDTTGTWTALELWLMRL-GIRVGHSRPYHPQTQGKLERFHRSL 254 R+G P R+ D G + +A+ L +L G+ H+ PYHPQ G ERF+R+L Sbjct: 1417 RFGFPARLHADQGRNF------ESAVVKELCKLIGVTKTHTTPYHPQGNGTTERFNRTL 1469 >UniRef50_D1QR90 ISPg5, transposase n=4 Tax=Bacteroidales RepID=D1QR90_9BACT Length = 329 Score = 52.0 bits (123), Expect = 4e-05, Method: Compositional matrix adjust. Identities = 67/299 (22%), Positives = 115/299 (38%), Gaps = 60/299 (20%) Query: 30 IRSLCRRFGISPATGYKW-----LQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMA 84 + LC FG++ YK+ LQR AQE A + + Sbjct: 1 MEGLCTLFGVTKQAYYKYDENRVLQRVAQEKFA--------------------VSFINEI 40 Query: 85 HDRHERWGARKI----KRWLEDQG--------HTMPAFSTVHNLMARHGLLPGASPGIPA 132 ++ G K+ KR +D + + A+ L R ++ G+P Sbjct: 41 REQDPGIGGMKLWYMYKRRFQDNAPLGRDRFENIVDAYGLKVRLRIRKPRTTDSTHGLPV 100 Query: 133 TGRFEHD----APNRLWQMDFK--------GHFPFGGGRCHPLTLLDDHSRFSLCLAHCT 180 + APNRLW D H+ F C+ +LD ++ C Sbjct: 101 FPNLIKEYIPLAPNRLWVSDITYITVWLDTEHYCF----CYLSLILDAYT--EEIAGWCV 154 Query: 181 DERRETVQ--QQLVSVFERYG--LPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHS 236 + ET + L ER + + + + S G + +E+ L + GI++ + Sbjct: 155 GDTLETEYPVKALGVALERIKDIAKEEVKLIHHSDRGCQYASAKYVEI-LRQYGIKISMT 213 Query: 237 RPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVP 295 P+ + ER + ++K E+L+GK FA++ ++ A YN ERPH ++DM P Sbjct: 214 ECGDPKDNAQAERINNTMKNELLKGKHFANTDQVIEAVRAAVAFYNEERPHMSIDMMTP 272 >UniRef50_A5BG34 Putative uncharacterized protein n=2 Tax=Vitis vinifera RepID=A5BG34_VITVI Length = 1654 Score = 51.6 bits (122), Expect = 4e-05, Method: Compositional matrix adjust. Identities = 31/120 (25%), Positives = 62/120 (51%), Gaps = 10/120 (8%) Query: 144 LWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDR 203 +W +DF G FP G + L +D S++ +A ++ + V+ ++F R+G+P Sbjct: 939 IWGIDFMGLFPLSFGYSYILVGVDYVSKWVEAIACKHNDHKVVVKFLKENIFTRFGVPKA 998 Query: 204 MTMDNGSPWGDTTGTWTALELW---LMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQ 260 + + +G GT +++ L R G++ + PYHPQT G++E + +K +++ Sbjct: 999 IIISDG-------GTHFCNKIFNNLLARYGVKHKVATPYHPQTSGQVELANCEIKNILME 1051 >UniRef50_A5C050 Putative uncharacterized protein n=32 Tax=Vitis vinifera RepID=A5C050_VITVI Length = 2064 Score = 51.6 bits (122), Expect = 4e-05, Method: Compositional matrix adjust. Identities = 28/117 (23%), Positives = 59/117 (50%), Gaps = 5/117 (4%) Query: 144 LWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDR 203 +W +DF G FP G + L +D S++ + +++ + ++ ++F R+G+P Sbjct: 1459 VWGIDFMGPFPMSFGHSYILVGVDYVSKWVEAIPCRSNDHKVVLKFLKDNIFARFGVPKA 1518 Query: 204 MTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQ 260 + D G+ + + E L + G++ PYHPQT G++E +R + +++ Sbjct: 1519 IISDGGTHFCN-----KPFETLLAKYGVKHKVVTPYHPQTSGQVELANREINNILMK 1570 >UniRef50_A5BFP8 Putative uncharacterized protein n=4 Tax=Vitis vinifera RepID=A5BFP8_VITVI Length = 1563 Score = 51.6 bits (122), Expect = 4e-05, Method: Compositional matrix adjust. Identities = 28/117 (23%), Positives = 60/117 (51%), Gaps = 5/117 (4%) Query: 144 LWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDR 203 +W +DF G FP G + L ++ ++ + ++ R ++ ++F R+G+P Sbjct: 1006 VWGIDFMGPFPMSFGYSYILVGVNYVFKWVEAIPCKHNDHRVVLKFLKENIFSRFGVPKA 1065 Query: 204 MTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQ 260 + D G+ + + E+ L + G++ + PYHPQT G++E +R +K +++ Sbjct: 1066 IISDGGTHFCNK-----PFEMLLAKYGVKHKVATPYHPQTSGQVELANREIKNILMK 1117 >UniRef50_Q9SHM3 F7F22.17 n=4 Tax=Arabidopsis thaliana RepID=Q9SHM3_ARATH Length = 1799 Score = 51.6 bits (122), Expect = 4e-05, Method: Compositional matrix adjust. Identities = 30/113 (26%), Positives = 57/113 (50%), Gaps = 5/113 (4%) Query: 144 LWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDR 203 +W +DF G FP G + L +D S++ +A T++ + ++ +F R+G+P Sbjct: 1503 VWGIDFMGPFPSSYGNKYILVAVDYVSKWVEAIASPTNDAKVVLKLFKTIIFPRFGVPRV 1562 Query: 204 MTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKA 256 + D G + + E L + G++ + PY+PQT G++E +R +K Sbjct: 1563 VISDGGKHFIN-----KVFENLLKKHGVKHKVATPYNPQTSGQVEISNREIKT 1610 >UniRef50_UPI000179ECC6 UPI000179ECC6 related cluster n=2 Tax=Bos taurus RepID=UPI000179ECC6 Length = 801 Score = 51.6 bits (122), Expect = 5e-05, Method: Compositional matrix adjust. Identities = 39/136 (28%), Positives = 61/136 (44%), Gaps = 11/136 (8%) Query: 141 PNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGL 200 PN LWQMD F +T+ D S + A T E + V Q L + F GL Sbjct: 576 PNALWQMDVTHISSFAKLSFVHVTV-DTFSHVIVATAR-TGEAVKDVIQHLFTCFSYMGL 633 Query: 201 PDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQ 260 P + DN + T + + + ++ I+ PY+PQ Q +ER H++LK ++ + Sbjct: 634 PKALKTDNAPAY-----TSKSFQEFCLKFQIKHNTGIPYNPQGQAIVERAHQTLKTQIQK 688 Query: 261 GKWFADSGELQRAFDH 276 K GE + + H Sbjct: 689 LK----EGEFKYSSPH 700 >UniRef50_UPI000179E089 UPI000179E089 related cluster n=3 Tax=Bos taurus RepID=UPI000179E089 Length = 792 Score = 51.6 bits (122), Expect = 5e-05, Method: Compositional matrix adjust. Identities = 38/135 (28%), Positives = 60/135 (44%), Gaps = 11/135 (8%) Query: 122 LLPGASPGIPATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFS-LCLAHCT 180 L P P+T + + PN LWQMD FG +T+ FS + ++ Sbjct: 664 LCPNCPTFNPST-QLGVNPPNALWQMDVTHIAAFGKLSFVHVTM----DTFSHVIISSRL 718 Query: 181 DERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYH 240 DE V + L F + GLP+++ DN + T + + + + I PY+ Sbjct: 719 DEATRDVTEHLFQCFSQIGLPEQIKTDNAPAY-----TSSDFKRFCQQFSIVHSTEIPYN 773 Query: 241 PQTQGKLERFHRSLK 255 PQ Q +ER H++LK Sbjct: 774 PQDQAIVERVHQTLK 788 >UniRef50_B9K5X7 IS3 family transposase n=7 Tax=Proteobacteria RepID=B9K5X7_AGRVS Length = 359 Score = 51.6 bits (122), Expect = 5e-05, Method: Compositional matrix adjust. Identities = 35/109 (32%), Positives = 48/109 (44%), Gaps = 5/109 (4%) Query: 142 NRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLP 201 N +W MDF G + LT++D SRFS + + E V Q L V + G P Sbjct: 205 NDVWAMDFVHDQLATGRKIRVLTVVDTFSRFSPAVDARFSYKGEDVVQTLERVCRQVGYP 264 Query: 202 DRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERF 250 + MDNGS + L+LW G+ + SRP P +E F Sbjct: 265 ATIRMDNGSEFISRN-----LDLWAYHRGVVLDFSRPGKPTDNSYIESF 308 >UniRef50_A5BWH5 Putative uncharacterized protein n=17 Tax=Vitis vinifera RepID=A5BWH5_VITVI Length = 2160 Score = 51.6 bits (122), Expect = 5e-05, Method: Compositional matrix adjust. Identities = 28/117 (23%), Positives = 59/117 (50%), Gaps = 5/117 (4%) Query: 144 LWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDR 203 +W ++F G FP G + L +D S++ + ++ R ++ ++F R+G+P Sbjct: 1257 VWGINFMGPFPMSFGNSYILVGVDYVSKWVEAIPCKQNDHRVVLKFLKENIFSRFGVPKA 1316 Query: 204 MTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQ 260 + D G+ + + E L + ++ + PYHPQT G++E +R +K +++ Sbjct: 1317 IISDGGAHFCN-----KPFEALLSKYXVKHKVATPYHPQTSGQVELANREIKNTLMK 1368 >UniRef50_B1ZXZ1 Integrase catalytic region n=2 Tax=Verrucomicrobia RepID=B1ZXZ1_OPITP Length = 298 Score = 51.2 bits (121), Expect = 6e-05, Method: Compositional matrix adjust. Identities = 42/156 (26%), Positives = 60/156 (38%), Gaps = 5/156 (3%) Query: 141 PNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGL 200 PN W +DF G LT+ DD++R L + T V + L + E G Sbjct: 108 PNERWSLDFVHDRLANGRSLRLLTVHDDYTRECLWIEADTSLSGPRVARVLDYLTELRGR 167 Query: 201 PDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQ 260 P + DNG + ALE W + P P G +E F+ L+ E L Sbjct: 168 PGSLLTDNGPEFAG-----LALERWTHERQVNHRFITPGKPSQNGYIESFNGKLRDECLN 222 Query: 261 GKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPG 296 F + + +R YN +RPH +L P Sbjct: 223 ETEFLSVSHARDLLEAFREDYNHQRPHSSLHDLTPA 258 >UniRef50_A4G2L6 Transposase IS3 family, part 2 n=17 Tax=Bacteria RepID=A4G2L6_HERAR Length = 288 Score = 51.2 bits (121), Expect = 6e-05, Method: Compositional matrix adjust. Identities = 69/269 (25%), Positives = 103/269 (38%), Gaps = 16/269 (5%) Query: 34 CRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMAHDRHE---R 90 C G+S + Y WL R + + R HHS +S D R+ HD R Sbjct: 19 CDTLGVSRSGFYAWLTRTPCKRRTENEQLGRAVHHSFIQS-DRTYGARRVWHDLLASGYR 77 Query: 91 WGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPATGR-FEHDAPNRLWQMDF 149 G +++R + Q + A +L G P R F+ APNR W DF Sbjct: 78 CGLHRVERLM--QAQALRARPRRRSLPIDRGERPVIGIAANVLDRQFDASAPNRKWVADF 135 Query: 150 KGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMT--MD 207 + G + +LD +SR + + + + V L+ R G P+ + D Sbjct: 136 T-YIWSAEGWLYLAVVLDLYSRRVIGWSMKPEMNAQLVADALMMAVWRRGKPESVMHHSD 194 Query: 208 NGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADS 267 GS + T + L+ LG+ SR + +E F SLK E L K F Sbjct: 195 RGSQY-----TSEQFQRLLLELGVTCSMSRAGNVWDNSAMESFFSSLKTERLSRKMFRTR 249 Query: 268 GELQ-RAFDHWRTVYNLERPHEALDMAVP 295 +++ FD+ YN R H L P Sbjct: 250 DDIRAEVFDYIERFYNPVRRHSTLGYISP 278 >UniRef50_UPI000180AEE9 PREDICTED: similar to pumilio 2 n=1 Tax=Ciona intestinalis RepID=UPI000180AEE9 Length = 570 Score = 51.2 bits (121), Expect = 6e-05, Method: Compositional matrix adjust. Identities = 34/125 (27%), Positives = 57/125 (45%), Gaps = 11/125 (8%) Query: 135 RFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSV 194 RF+H +D G P G LT++D +R+ + V+ + + Sbjct: 263 RFQH------VNIDIVGPLPPSQGYRFLLTIVDRFTRWPEAIPIADTMTITCVRAFIFNW 316 Query: 195 FERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSL 254 RYG+P ++ D G + T+ W +GIR+ + +HPQ G +ERFHR L Sbjct: 317 IARYGIPSDISSDQGPQF--TSEFWKTFN---QMMGIRIHRTTAFHPQANGLVERFHRHL 371 Query: 255 KAEVL 259 K+ ++ Sbjct: 372 KSALM 376 >UniRef50_A5B7N0 Putative uncharacterized protein n=17 Tax=Vitis vinifera RepID=A5B7N0_VITVI Length = 2000 Score = 51.2 bits (121), Expect = 7e-05, Method: Compositional matrix adjust. Identities = 29/114 (25%), Positives = 58/114 (50%), Gaps = 5/114 (4%) Query: 147 MDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMTM 206 +DF G FP G + L +D S++ + ++ R ++ ++F R+G+P + Sbjct: 1261 IDFMGPFPMSFGNSYILVGVDYVSKWVEAIPCKQNDHRVVLKFLKENIFSRFGVPKAIIS 1320 Query: 207 DNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQ 260 D G+ + + E L + G++ + PYHPQT G++E +R +K +++ Sbjct: 1321 DGGAHFCN-----KPFEALLSKYGVKHKVATPYHPQTFGQVELANREIKNILMK 1369 >UniRef50_A5B346 Putative uncharacterized protein n=2 Tax=Vitis vinifera RepID=A5B346_VITVI Length = 916 Score = 51.2 bits (121), Expect = 7e-05, Method: Compositional matrix adjust. Identities = 32/129 (24%), Positives = 62/129 (48%), Gaps = 6/129 (4%) Query: 132 ATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQL 191 TG D N +W +DF G FP G + L +D S++ + ++ R ++ Sbjct: 167 VTGEIPIDLFN-VWGIDFMGPFPMSFGNSYILVGVDYVSKWVEAIPCKQNDHRVVLKFLN 225 Query: 192 VSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFH 251 ++F R+ +P + D G+ + + E L + G++ + PYHPQT G++E + Sbjct: 226 ENIFSRFRVPKVIISDRGTHFCN-----KPFETLLAKYGVKHKVATPYHPQTSGQVELAN 280 Query: 252 RSLKAEVLQ 260 +K +++ Sbjct: 281 TEIKNILMK 289 >UniRef50_B7ZFS7 Polyprotein n=2 Tax=Eukaryota RepID=B7ZFS7_9METZ Length = 1425 Score = 50.8 bits (120), Expect = 7e-05, Method: Compositional matrix adjust. Identities = 25/74 (33%), Positives = 41/74 (55%), Gaps = 5/74 (6%) Query: 181 DERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYH 240 D+ T+ +LV +F YG+P+ + D G+ + T T L G++ H+ PYH Sbjct: 1111 DQTAATISTELVKLFCTYGIPEIIHSDQGANFESTIIGQT-----LEAFGVKKSHTTPYH 1165 Query: 241 PQTQGKLERFHRSL 254 P+ G +ERF+R+L Sbjct: 1166 PEGDGMVERFNRTL 1179 >UniRef50_C5PML6 Transposase OrfB n=1 Tax=Sphingobacterium spiritivorum ATCC 33861 RepID=C5PML6_9SPHI Length = 267 Score = 50.8 bits (120), Expect = 8e-05, Method: Compositional matrix adjust. Identities = 38/154 (24%), Positives = 60/154 (38%), Gaps = 5/154 (3%) Query: 142 NRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLP 201 N W +DF G R L ++DD++R SL + Q++ + P Sbjct: 119 NETWSIDFMSDSLANGRRFRVLNVIDDYNRESLINEAFYSIPGGRLVQKIKELIIDRSTP 178 Query: 202 DRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQG 261 R+ DNG + T E GI + + +P P +ER +R+ + +VL Sbjct: 179 KRIRTDNGPEFLSKVFTDFCTEN-----GIELQYIQPGKPAQNAYIERLNRTFREDVLDA 233 Query: 262 KWFADSGELQRAFDHWRTVYNLERPHEALDMAVP 295 F E+ W+ YN P AL+ P Sbjct: 234 YLFDSLTEVNAIAYEWQIDYNENHPDTALNGLSP 267 >UniRef50_UPI0001793640 PREDICTED: similar to SD02026p, partial n=1 Tax=Acyrthosiphon pisum RepID=UPI0001793640 Length = 776 Score = 50.8 bits (120), Expect = 8e-05, Method: Compositional matrix adjust. Identities = 51/192 (26%), Positives = 83/192 (43%), Gaps = 18/192 (9%) Query: 105 HTMPAFSTVHNLMARHGLLPGASPGIPATGRFEHDAPNRLWQMDFKGHFPFG-GGRCHPL 163 H F + + R G PG IP R P MD G F GG + L Sbjct: 427 HIRSCFECLLTRVPR-GKRPGLLHSIPVGKR-----PFYTVHMDHVGPFVTAPGGFRYIL 480 Query: 164 TLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALE 223 L+D+ +++ + L TD R + + + YGLP R+ D G+ + T+GT+ E Sbjct: 481 VLVDNLTKY-VSLYAVTDTRTRPLINCVEQFVKEYGLPGRLITDRGTCY--TSGTF---E 534 Query: 224 LWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVL-----QGKWFADSGELQRAFDHWR 278 + I++ + HPQ G++ER H + A ++ + +W E+QR ++ Sbjct: 535 QFCASQRIKLVWTSSRHPQANGQVERTHSVVMATLMTMGGAEDQWAELLPEVQRLLNNSE 594 Query: 279 TVYNLERPHEAL 290 T + P E L Sbjct: 595 TKVTGKTPFEML 606 >UniRef50_A5BPW1 Putative uncharacterized protein n=3 Tax=Vitis vinifera RepID=A5BPW1_VITVI Length = 1335 Score = 50.8 bits (120), Expect = 8e-05, Method: Compositional matrix adjust. Identities = 28/117 (23%), Positives = 61/117 (52%), Gaps = 5/117 (4%) Query: 144 LWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDR 203 +W +DF G FP G + L +D ++ + + ++ R ++ ++F R+G+P Sbjct: 644 VWGIDFMGPFPMSFGNSYILVGVDYVFKWVEPIPYKHNDHRVVLKFLKENIFLRFGVPKA 703 Query: 204 MTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQ 260 + D G+ + + L+ L + G++ + PYHPQT G+++ +R +K +++ Sbjct: 704 IISDGGTHFCNK-----PLDTLLAKYGVKHKVATPYHPQTSGQVKLANREIKNILMK 755 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P37007 Uncharacterized protein yagA n=15 Tax=Bacteria R... 502 e-140 UniRef50_C6AZP5 Integrase catalytic region n=3 Tax=Rhizobium leg... 332 2e-89 UniRef50_B0T7X0 Integrase catalytic region n=15 Tax=Alphaproteob... 330 6e-89 UniRef50_Q07SS7 Integrase, catalytic region n=5 Tax=Alphaproteob... 330 8e-89 UniRef50_A9ER25 Transposase n=4 Tax=Sorangium cellulosum 'So ce ... 329 1e-88 UniRef50_B8GMF2 Integrase catalytic region n=2 Tax=Gammaproteoba... 320 8e-86 UniRef50_B4UMG9 Integrase catalytic region n=3 Tax=Bacteria RepI... 313 6e-84 UniRef50_C3MF40 Integrase catalytic core domain protein n=3 Tax=... 313 7e-84 UniRef50_A9BRN3 Integrase catalytic region n=7 Tax=Proteobacteri... 309 1e-82 UniRef50_C3K093 Putative integrase n=2 Tax=Pseudomonas fluoresce... 305 2e-81 UniRef50_A4WDP4 Integrase, catalytic region n=4 Tax=Enterobacter... 305 2e-81 UniRef50_D2LAL9 Integrase catalytic region n=1 Tax=Desulfovibrio... 302 1e-80 UniRef50_Q01QQ4 Integrase, catalytic region n=9 Tax=Bacteria Rep... 302 1e-80 UniRef50_D2ML31 Integrase, catalytic region (Fragment) n=1 Tax=C... 301 3e-80 UniRef50_UPI00017F3A47 integrase catalytic subunit n=1 Tax=Esche... 293 7e-78 UniRef50_UPI00019025E5 ISHne2, transposase n=1 Tax=Rhizobium etl... 278 2e-73 UniRef50_B6J0H9 Transposase n=3 Tax=Coxiella burnetii RepID=B6J0... 269 1e-70 UniRef50_Q5ZTP2 Transposase (ISmav2) n=14 Tax=Proteobacteria Rep... 264 5e-69 UniRef50_Q4JUH9 Transposase for IS3514b n=9 Tax=Corynebacterium ... 261 3e-68 UniRef50_Q82H05 Putative IS481 family ISMav2-like transposase n=... 260 5e-68 UniRef50_B1ZP18 Integrase catalytic region n=1 Tax=Opitutus terr... 255 2e-66 UniRef50_Q4JWW8 Transposase for IS3511a n=7 Tax=Corynebacterium ... 250 5e-65 UniRef50_A1T2L4 Integrase, catalytic region n=4 Tax=Actinomyceta... 250 6e-65 UniRef50_B2HR82 Transposase for ISMyma05 n=5 Tax=Mycobacterium R... 243 8e-63 UniRef50_UPI0001B45627 transposase for ISMyma05 n=1 Tax=Mycobact... 241 2e-62 UniRef50_UPI0001B540A0 transposase for IS3514a n=1 Tax=Streptomy... 241 4e-62 UniRef50_UPI0001B453C4 transposase for IS3514a n=1 Tax=Mycobacte... 240 6e-62 UniRef50_C8XFB0 Integrase catalytic region n=4 Tax=Actinomycetal... 236 1e-60 UniRef50_A3TPQ6 Transposase n=7 Tax=Actinomycetales RepID=A3TPQ6... 229 1e-58 UniRef50_Q3A8V0 ISChy3, transposase n=5 Tax=Clostridia RepID=Q3A... 224 3e-57 UniRef50_D1VRH7 Integrase catalytic region n=1 Tax=Frankia sp. E... 218 3e-55 UniRef50_A1UAJ3 Integrase, catalytic region n=7 Tax=Actinomyceta... 218 4e-55 UniRef50_A6DU50 Putative ISmav2-like transposase n=1 Tax=Lentisp... 213 1e-53 UniRef50_C5D6W5 Integrase catalytic region n=19 Tax=Firmicutes R... 209 1e-52 UniRef50_B0TDR5 Transposase, putative n=5 Tax=Firmicutes RepID=B... 207 5e-52 UniRef50_Q3SW20 Helix-turn-helix, Fis-type n=112 Tax=Bacteria Re... 207 5e-52 UniRef50_D1TPC8 Putative transposase integrase n=1 Tax=Burkholde... 206 9e-52 UniRef50_C7MHU2 Integrase family protein n=3 Tax=Actinomycetales... 205 3e-51 UniRef50_A3J543 Helix-turn-helix, Fis-type protein n=6 Tax=Bacte... 205 3e-51 UniRef50_B0NHH2 Putative uncharacterized protein (Fragment) n=2 ... 201 3e-50 UniRef50_C6PFD7 Integrase catalytic region n=3 Tax=Thermoanaerob... 201 5e-50 UniRef50_A7B7Y8 Putative uncharacterized protein n=4 Tax=Clostri... 198 3e-49 UniRef50_A0JV34 Integrase, catalytic region n=12 Tax=Actinomycet... 197 5e-49 UniRef50_B4RV10 Integrase, catalytic region n=16 Tax=Proteobacte... 197 7e-49 UniRef50_C7RJ38 Integrase catalytic region n=5 Tax=Proteobacteri... 196 1e-48 UniRef50_C4KRZ4 Integrase core domain protein n=59 Tax=Proteobac... 195 2e-48 UniRef50_A4J392 Integrase, catalytic region n=22 Tax=Clostridia ... 195 3e-48 UniRef50_B8J8P0 Integrase catalytic region n=1 Tax=Anaeromyxobac... 193 8e-48 UniRef50_C0QA21 Transposase /integrase family protein n=1 Tax=De... 192 2e-47 UniRef50_C1DTA7 Putative transposase n=2 Tax=Sulfurihydrogenibiu... 191 4e-47 UniRef50_B3PKR5 Transposase n=7 Tax=Gammaproteobacteria RepID=B3... 190 6e-47 UniRef50_B4S6V0 Integrase catalytic region n=10 Tax=Bacteria Rep... 190 8e-47 UniRef50_C1F9W4 ISAca1, transposase n=2 Tax=Acidobacterium capsu... 189 2e-46 UniRef50_C2GDW5 IS3514a transposase n=1 Tax=Corynebacterium gluc... 189 2e-46 UniRef50_A6V7Q6 Transposase n=92 Tax=Bacteria RepID=A6V7Q6_PSEA7 188 3e-46 UniRef50_B4RA95 Transposase, IS1477 n=35 Tax=Proteobacteria RepI... 187 7e-46 UniRef50_C8NDJ4 ISHne2 transposase (Fragment) n=1 Tax=Cardiobact... 186 1e-45 UniRef50_UPI0001C30E87 Integrase catalytic region n=1 Tax=Conexi... 185 2e-45 UniRef50_Q8NL32 Predicted transposase n=7 Tax=Corynebacterium Re... 185 2e-45 UniRef50_P24577 Insertion element IS407 uncharacterized 31.7 kDa... 185 3e-45 UniRef50_Q64B23 Transposase n=1 Tax=uncultured archaeon GZfos27G... 183 7e-45 UniRef50_C5CAM0 Transposase n=26 Tax=Actinomycetales RepID=C5CAM... 183 7e-45 UniRef50_C8PWM8 Transposase B n=2 Tax=Enhydrobacter aerosaccus S... 183 1e-44 UniRef50_P25438 Insertion element IS476 uncharacterized 39.2 kDa... 183 1e-44 UniRef50_O28862 ISA0963-5, putative transposase n=5 Tax=Archaeog... 183 1e-44 UniRef50_Q1GCB4 Integrase catalytic region n=29 Tax=Alphaproteob... 182 2e-44 UniRef50_A3JW74 Transposase n=12 Tax=Proteobacteria RepID=A3JW74... 182 2e-44 UniRef50_C4UEN4 Transposase n=1 Tax=Yersinia aldovae ATCC 35236 ... 181 3e-44 UniRef50_Q1DAH7 Transposase orfB, IS3 family n=29 Tax=Proteobact... 181 3e-44 UniRef50_A1JLT7 Transposase for insertion element IS1222 n=8 Tax... 181 4e-44 UniRef50_C1XPR1 Transcriptional regulator/sugar kinase n=14 Tax=... 181 5e-44 UniRef50_A3ZSC8 Transposase orfB n=1 Tax=Blastopirellula marina ... 181 5e-44 UniRef50_C6VW29 Integrase catalytic region n=2 Tax=Sphingobacter... 180 6e-44 UniRef50_B4E5J2 Transposase n=21 Tax=Proteobacteria RepID=B4E5J2... 180 7e-44 UniRef50_Q3BT31 Transposase n=22 Tax=Bacteria RepID=Q3BT31_XANC5 180 7e-44 UniRef50_A5GAT8 Integrase, catalytic region n=1 Tax=Geobacter ur... 179 1e-43 UniRef50_B1KCL6 Integrase catalytic region n=100 Tax=Proteobacte... 179 1e-43 UniRef50_A4JLW8 Integrase, catalytic region n=9 Tax=Proteobacter... 179 2e-43 UniRef50_UPI0001B511C3 integrase n=1 Tax=Streptomyces hygroscopi... 179 2e-43 UniRef50_C1A8I3 Putative transposase orfB for insertion sequence... 179 2e-43 UniRef50_C1F6R2 ISAca4, transposase orfB n=1 Tax=Acidobacterium ... 178 3e-43 UniRef50_Q1NW03 Integrase, catalytic region n=7 Tax=Proteobacter... 178 3e-43 UniRef50_C1F5F4 IS3 family transposase orfB n=1 Tax=Acidobacteri... 178 4e-43 UniRef50_B3E8B6 Integrase catalytic region n=1 Tax=Geobacter lov... 177 4e-43 UniRef50_Q2W8I9 Transposase and inactivated derivative n=89 Tax=... 177 5e-43 UniRef50_A6BYW2 Integrase, catalytic region n=1 Tax=Planctomyces... 177 7e-43 UniRef50_A4A249 Transposase orfB n=4 Tax=Planctomycetaceae RepID... 176 1e-42 UniRef50_C1F2E9 IS3 family transposase orfB n=1 Tax=Acidobacteri... 176 1e-42 UniRef50_A4TG41 Integrase, catalytic region n=32 Tax=Actinomycet... 176 1e-42 UniRef50_A1UD36 Integrase, catalytic region n=28 Tax=Actinomycet... 176 1e-42 UniRef50_A1R4J8 ISAau1, transposase orfB n=3 Tax=Actinomycetales... 176 2e-42 UniRef50_A8HUC5 Transposase n=2 Tax=Alphaproteobacteria RepID=A8... 176 2e-42 UniRef50_Q8XPL1 Isrso16-transposase orfb protein n=2 Tax=Ralston... 176 2e-42 UniRef50_C4URW7 Integrase n=14 Tax=Proteobacteria RepID=C4URW7_Y... 175 2e-42 UniRef50_A3PPM4 Integrase, catalytic region n=5 Tax=Rhodobactera... 175 3e-42 UniRef50_Q4FQT2 Transposase OrfB n=179 Tax=Bacteria RepID=Q4FQT2... 174 5e-42 UniRef50_C0WLI3 Transposase n=14 Tax=Corynebacterium RepID=C0WLI... 174 5e-42 UniRef50_C6BTX7 Integrase catalytic region n=88 Tax=Bacteria Rep... 174 5e-42 UniRef50_A0AXB8 Integrase, catalytic region n=27 Tax=Betaproteob... 173 9e-42 UniRef50_Q3A4V8 Transposase and inactivated derivatives n=9 Tax=... 173 1e-41 UniRef50_B8FAB2 Integrase catalytic region n=70 Tax=Bacteria Rep... 173 1e-41 UniRef50_C8X9D2 Integrase catalytic region n=6 Tax=Actinomycetal... 172 1e-41 UniRef50_B8ER74 Integrase catalytic region n=103 Tax=Bacteria Re... 170 7e-41 UniRef50_Q2CG00 Integrase, catalytic domain n=8 Tax=Rhodobactera... 170 9e-41 UniRef50_A5G4C5 Putative uncharacterized protein n=1 Tax=Geobact... 169 1e-40 UniRef50_A1WCB6 Integrase, catalytic region n=2 Tax=Burkholderia... 168 3e-40 UniRef50_A3YV04 Transposase n=3 Tax=Synechococcus sp. WH 5701 Re... 167 7e-40 UniRef50_A3DCZ2 Integrase, catalytic region n=10 Tax=Clostridium... 166 2e-39 UniRef50_Q1BK79 Integrase, catalytic region n=37 Tax=Proteobacte... 164 5e-39 UniRef50_B1K7U4 Integrase catalytic region n=7 Tax=Bacteria RepI... 164 6e-39 UniRef50_B1ZXZ1 Integrase catalytic region n=2 Tax=Verrucomicrob... 163 1e-38 UniRef50_A9G353 Putative transposase n=1 Tax=Sorangium cellulosu... 162 2e-38 UniRef50_Q8PGV8 ISxac4 transposase n=3 Tax=Xanthomonas axonopodi... 162 2e-38 UniRef50_D1K5D0 Transposase n=1 Tax=Bacteroides sp. 3_1_33FAA Re... 161 6e-38 UniRef50_A5D1X6 Transposase and inactivated derivatives n=2 Tax=... 160 7e-38 UniRef50_A1V109 A, transposase OrfB n=56 Tax=Proteobacteria RepI... 160 8e-38 UniRef50_A3PLB1 Integrase, catalytic region n=59 Tax=Proteobacte... 160 8e-38 UniRef50_Q12FI2 Integrase, catalytic region n=28 Tax=Proteobacte... 160 9e-38 UniRef50_A6Q4E4 Transposase n=2 Tax=Nitratiruptor sp. SB155-2 Re... 160 9e-38 UniRef50_D2MKS7 Transposase (Fragment) n=3 Tax=Candidatus Poriba... 160 1e-37 UniRef50_B5YKC0 Putative transposase n=3 Tax=Thermodesulfovibrio... 160 1e-37 UniRef50_C3LLF8 IS1627, transposase n=27 Tax=Bacillaceae RepID=C... 159 1e-37 UniRef50_A9LH60 Integrase n=1 Tax=uncultured planctomycete 13FN ... 159 2e-37 UniRef50_A5BPP5 Putative uncharacterized protein n=1 Tax=Vitis v... 159 2e-37 UniRef50_A8F1E6 Transposase and inactivated derivative n=159 Tax... 157 4e-37 UniRef50_B9NFA6 Predicted protein n=5 Tax=cellular organisms Rep... 157 4e-37 UniRef50_A5ASD2 Putative uncharacterized protein n=7 Tax=Vitis v... 157 4e-37 UniRef50_C7MBS5 Transposase n=2 Tax=Micrococcineae RepID=C7MBS5_... 157 4e-37 UniRef50_A5C4R5 Putative uncharacterized protein n=1 Tax=Vitis v... 157 8e-37 UniRef50_A4TG51 Integrase, catalytic region n=13 Tax=Actinomycet... 157 8e-37 UniRef50_A9B8L4 Integrase catalytic region n=5 Tax=Herpetosiphon... 157 8e-37 UniRef50_A1VJC3 Integrase, catalytic region n=25 Tax=Bacteria Re... 156 1e-36 UniRef50_C4FKG6 Integrase, catalytic region n=1 Tax=Sulfurihydro... 156 2e-36 UniRef50_C5PML6 Transposase OrfB n=1 Tax=Sphingobacterium spirit... 156 2e-36 UniRef50_A5CBG5 Putative uncharacterized protein n=4 Tax=Vitis v... 155 3e-36 UniRef50_B8KLM8 Integrase, catalytic region n=2 Tax=gamma proteo... 154 3e-36 UniRef50_B0UC72 Integrase catalytic region n=4 Tax=Alphaproteoba... 154 4e-36 UniRef50_A5C1P8 Putative uncharacterized protein n=4 Tax=Vitis v... 154 4e-36 UniRef50_Q39TE2 Putative uncharacterized protein n=2 Tax=Geobact... 154 6e-36 UniRef50_B0SFL5 Transposase n=2 Tax=Leptospira biflexa serovar P... 153 1e-35 UniRef50_A6VYF3 Integrase catalytic region n=14 Tax=Bacteria Rep... 152 1e-35 UniRef50_C1XUW8 Transposase n=2 Tax=Meiothermus silvanus DSM 994... 152 2e-35 UniRef50_A5BJN2 Putative uncharacterized protein n=5 Tax=Vitis v... 151 4e-35 UniRef50_A5BI07 Putative uncharacterized protein n=7 Tax=Vitis v... 151 4e-35 UniRef50_A0L7D7 Integrase, catalytic region n=5 Tax=Bacteria Rep... 151 5e-35 UniRef50_A9B8J0 Integrase catalytic region n=3 Tax=Herpetosiphon... 151 6e-35 UniRef50_A5AQ03 Putative uncharacterized protein n=5 Tax=Vitis v... 151 6e-35 UniRef50_Q1N8F6 Transposase n=2 Tax=Sphingomonas RepID=Q1N8F6_9SPHN 150 7e-35 UniRef50_UPI0001B416F4 ISA0963-5 transposase n=6 Tax=Ferroplasma... 149 1e-34 UniRef50_A5C2R0 Putative uncharacterized protein n=10 Tax=Vitis ... 149 1e-34 UniRef50_A7VEZ2 Putative uncharacterized protein n=3 Tax=Bacteri... 149 1e-34 UniRef50_UPI00005104D7 transposase n=1 Tax=Brevibacterium linens... 149 2e-34 UniRef50_A5B2X9 Putative uncharacterized protein n=4 Tax=Vitis v... 149 2e-34 UniRef50_A8LT45 Integrase n=5 Tax=Bacteria RepID=A8LT45_DINSH 148 3e-34 UniRef50_A4SIH8 IS3-family transposase n=42 Tax=Proteobacteria R... 148 3e-34 UniRef50_A5BTM1 Putative uncharacterized protein n=31 Tax=Vitis ... 148 3e-34 UniRef50_A5CA04 Putative uncharacterized protein n=3 Tax=Vitis v... 147 5e-34 UniRef50_A5AWA7 Putative uncharacterized protein n=6 Tax=Vitis v... 147 6e-34 UniRef50_Q2Y8D0 Integrase, catalytic region n=1 Tax=Nitrosospira... 146 9e-34 UniRef50_A5C046 Putative uncharacterized protein n=3 Tax=Vitis v... 146 9e-34 UniRef50_A5AKZ0 Putative uncharacterized protein n=18 Tax=Vitis ... 146 1e-33 UniRef50_A5BFN4 Putative uncharacterized protein n=2 Tax=Vitis v... 146 1e-33 UniRef50_C4V4D7 Transposase n=5 Tax=Clostridiales RepID=C4V4D7_9... 146 2e-33 UniRef50_A5B9R1 Putative uncharacterized protein n=10 Tax=Vitis ... 146 2e-33 UniRef50_Q0P7I8 IS1400 transposase B n=231 Tax=Bacteria RepID=Q0... 145 2e-33 UniRef50_A3QMY0 Transposase n=37 Tax=Bacilli RepID=A3QMY0_ENTFC 145 2e-33 UniRef50_A5B5G8 Putative uncharacterized protein n=2 Tax=Vitis v... 144 4e-33 UniRef50_A4Z1R9 Putative transposase, probably encoded by an uni... 144 4e-33 UniRef50_A5AMG6 Putative uncharacterized protein n=3 Tax=Vitis v... 144 4e-33 UniRef50_A5BKJ4 Putative uncharacterized protein n=2 Tax=Vitis v... 144 5e-33 UniRef50_A5BWY6 Putative uncharacterized protein n=3 Tax=Vitis v... 144 5e-33 UniRef50_C5CE17 Integrase catalytic region n=3 Tax=Kosmotoga ole... 144 7e-33 UniRef50_A5BYC4 Putative uncharacterized protein n=5 Tax=Vitis v... 143 9e-33 UniRef50_D1PR45 Transposase n=1 Tax=Subdoligranulum variabile DS... 143 1e-32 UniRef50_A5BFS9 Putative uncharacterized protein n=9 Tax=Vitis v... 143 1e-32 UniRef50_A5BXG2 Putative uncharacterized protein n=1 Tax=Vitis v... 143 1e-32 UniRef50_A5B5S6 Putative uncharacterized protein n=2 Tax=Vitis v... 142 1e-32 UniRef50_A5BI69 Putative uncharacterized protein n=1 Tax=Vitis v... 142 2e-32 UniRef50_A4G2L6 Transposase IS3 family, part 2 n=17 Tax=Bacteria... 142 2e-32 UniRef50_A5AKV0 Putative uncharacterized protein n=16 Tax=Vitis ... 142 2e-32 UniRef50_A5BWF3 Putative uncharacterized protein n=9 Tax=Vitis v... 142 2e-32 UniRef50_D2U9E3 Putative integrase protein n=1 Tax=Xanthomonas a... 142 2e-32 UniRef50_UPI0001986237 PREDICTED: hypothetical protein n=1 Tax=V... 142 2e-32 UniRef50_A5BT93 Putative uncharacterized protein n=1 Tax=Vitis v... 141 3e-32 UniRef50_A5AH70 Putative uncharacterized protein n=20 Tax=Vitis ... 141 4e-32 UniRef50_A5BYU9 Putative uncharacterized protein n=8 Tax=Vitis v... 141 6e-32 UniRef50_Q8GAC2 Putative transposase n=2 Tax=Micrococcineae RepI... 141 6e-32 UniRef50_A5BJP7 Putative uncharacterized protein n=6 Tax=Vitis v... 140 8e-32 UniRef50_A5B346 Putative uncharacterized protein n=2 Tax=Vitis v... 140 8e-32 UniRef50_A5AH69 Putative uncharacterized protein n=3 Tax=Vitis v... 140 9e-32 UniRef50_A5AWI1 Putative uncharacterized protein n=2 Tax=Vitis v... 139 1e-31 UniRef50_A5BQ80 Putative uncharacterized protein n=1 Tax=Vitis v... 139 1e-31 UniRef50_UPI000038392B COG2801: Transposase and inactivated deri... 139 2e-31 UniRef50_A5CA05 Putative uncharacterized protein n=2 Tax=Vitis v... 139 2e-31 UniRef50_A5ASA6 Putative uncharacterized protein n=2 Tax=Vitis v... 139 2e-31 UniRef50_A5BAC6 Putative uncharacterized protein n=1 Tax=Vitis v... 139 2e-31 UniRef50_A5AHC2 Putative uncharacterized protein n=3 Tax=Vitis v... 138 3e-31 UniRef50_Q0ZCC0 Gag protein n=2 Tax=Populus trichocarpa RepID=Q0... 138 3e-31 UniRef50_A5AWF5 Putative uncharacterized protein n=16 Tax=Vitis ... 138 3e-31 UniRef50_A5AY91 Putative uncharacterized protein n=1 Tax=Vitis v... 138 4e-31 UniRef50_A5AQQ3 Putative uncharacterized protein n=1 Tax=Vitis v... 138 4e-31 UniRef50_B7WRK9 Integrase catalytic region n=2 Tax=Comamonas tes... 138 4e-31 UniRef50_A5ACN5 Putative uncharacterized protein n=7 Tax=Vitis v... 137 5e-31 UniRef50_A5B7N0 Putative uncharacterized protein n=17 Tax=Vitis ... 137 5e-31 UniRef50_A0P341 Transposase n=5 Tax=Alphaproteobacteria RepID=A0... 137 6e-31 UniRef50_A5C050 Putative uncharacterized protein n=32 Tax=Vitis ... 137 7e-31 UniRef50_A5AIU0 Putative uncharacterized protein n=2 Tax=Vitis v... 137 7e-31 UniRef50_Q0ZCB7 Integrase n=4 Tax=Eukaryota RepID=Q0ZCB7_POPTR 137 7e-31 UniRef50_A5AFC7 Putative uncharacterized protein n=8 Tax=Vitis v... 137 7e-31 UniRef50_C7NJB3 Integrase family protein n=3 Tax=Actinomycetales... 136 1e-30 UniRef50_A5AYI6 Putative uncharacterized protein n=7 Tax=Vitis v... 136 1e-30 UniRef50_A5CBC9 Putative uncharacterized protein n=1 Tax=Vitis v... 136 2e-30 UniRef50_A5BWH5 Putative uncharacterized protein n=17 Tax=Vitis ... 135 2e-30 UniRef50_A5AYT6 Putative uncharacterized protein n=3 Tax=Vitis v... 135 2e-30 UniRef50_A5BNX4 Putative uncharacterized protein n=2 Tax=Vitis v... 135 3e-30 UniRef50_A5BIQ2 Putative uncharacterized protein n=1 Tax=Vitis v... 135 3e-30 Sequences not found previously or not previously below threshold: UniRef50_C4YYX7 Integrase catalytic region n=2 Tax=Rickettsia en... 165 2e-39 UniRef50_Q2P621 ISXoo3 transposase n=194 Tax=Proteobacteria RepI... 165 3e-39 UniRef50_B3EBT2 Integrase catalytic region n=87 Tax=Bacteria Rep... 163 9e-39 UniRef50_B7AC35 Integrase catalytic region n=11 Tax=Bacteria Rep... 159 2e-37 UniRef50_A1UPS7 Integrase, catalytic region n=17 Tax=Bacteria Re... 154 6e-36 UniRef50_A1VBQ7 Integrase, catalytic region n=9 Tax=Proteobacter... 149 2e-34 UniRef50_Q46NI5 Integrase, catalytic region n=26 Tax=Proteobacte... 149 2e-34 UniRef50_C6W069 Integrase catalytic region n=3 Tax=Dyadobacter f... 148 4e-34 UniRef50_Q24VK2 Putative uncharacterized protein n=3 Tax=Clostri... 148 4e-34 UniRef50_C8WWR8 Integrase catalytic region n=1 Tax=Alicyclobacil... 147 5e-34 UniRef50_B5CNC7 Putative uncharacterized protein n=5 Tax=Clostri... 146 1e-33 UniRef50_A9VK05 Integrase catalytic region n=17 Tax=Bacillaceae ... 143 9e-33 UniRef50_C8VYH9 Transposase IS3/IS911 family protein n=11 Tax=Ba... 142 3e-32 UniRef50_C5CF68 Integrase catalytic region n=6 Tax=Thermotogacea... 142 3e-32 UniRef50_A8F1V3 Transposase and inactivated derivative n=4 Tax=B... 142 3e-32 UniRef50_C6N0S2 Putative uncharacterized protein n=1 Tax=Legione... 141 4e-32 UniRef50_B7GET7 Transposase n=6 Tax=Bacillales RepID=B7GET7_ANOFW 140 8e-32 UniRef50_C7I620 Integrase catalytic region n=4 Tax=Proteobacteri... 140 1e-31 UniRef50_UPI0001725BBF transposase n=1 Tax=Micrococcus luteus NC... 139 1e-31 UniRef50_A7VUR6 Putative uncharacterized protein n=3 Tax=Clostri... 139 1e-31 UniRef50_A5AMM4 Putative uncharacterized protein n=14 Tax=Vitis ... 139 1e-31 UniRef50_C1F0V3 IS3 family transposase orfB n=1 Tax=Acidobacteri... 139 2e-31 UniRef50_A5BSN7 Putative uncharacterized protein n=16 Tax=Vitis ... 139 2e-31 UniRef50_Q1LNW1 Integrase, catalytic region n=42 Tax=Bacteria Re... 139 2e-31 UniRef50_B8FYC8 Transposase IS3/IS911 family protein n=6 Tax=Clo... 139 2e-31 UniRef50_Q9ANU8 OrfB (Fragment) n=8 Tax=Bacteria RepID=Q9ANU8_RUMGN 137 4e-31 UniRef50_A5BJ10 Putative uncharacterized protein n=3 Tax=Vitis v... 137 5e-31 UniRef50_B3PDG9 IS3 family transposase, orfB n=3 Tax=Gammaproteo... 137 9e-31 UniRef50_A5APG9 Putative uncharacterized protein n=11 Tax=Vitis ... 136 1e-30 UniRef50_Q2YZQ9 Transposase n=3 Tax=Bacteria RepID=Q2YZQ9_9DELT 136 1e-30 UniRef50_A9B827 Integrase catalytic region n=1 Tax=Herpetosiphon... 136 1e-30 UniRef50_Q43917 ORF2 gene product (Fragment) n=15 Tax=cellular o... 136 2e-30 UniRef50_B0MP11 Putative uncharacterized protein n=1 Tax=Eubacte... 135 3e-30 UniRef50_A8ZKJ8 Integrase, catalytic region n=10 Tax=Bacteria Re... 135 3e-30 >UniRef50_P37007 Uncharacterized protein yagA n=15 Tax=Bacteria RepID=YAGA_ECOLI Length = 384 Score = 502 bits (1292), Expect = e-140, Method: Composition-based stats. Identities = 384/384 (100%), Positives = 384/384 (100%) Query: 1 MESLMPWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQ 60 MESLMPWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQ Sbjct: 1 MESLMPWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQ 60 Query: 61 DRPRIPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARH 120 DRPRIPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARH Sbjct: 61 DRPRIPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARH 120 Query: 121 GLLPGASPGIPATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCT 180 GLLPGASPGIPATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCT Sbjct: 121 GLLPGASPGIPATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCT 180 Query: 181 DERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYH 240 DERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYH Sbjct: 181 DERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYH 240 Query: 241 PQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQ 300 PQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQ Sbjct: 241 PQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQ 300 Query: 301 PSARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQEDGSYE 360 PSARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQEDGSYE Sbjct: 301 PSARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQEDGSYE 360 Query: 361 VWWYSTKVGVIDLKKKSITMGKGC 384 VWWYSTKVGVIDLKKKSITMGKGC Sbjct: 361 VWWYSTKVGVIDLKKKSITMGKGC 384 >UniRef50_C6AZP5 Integrase catalytic region n=3 Tax=Rhizobium leguminosarum bv. trifolii WSM1325 RepID=C6AZP5_RHILS Length = 402 Score = 332 bits (850), Expect = 2e-89, Method: Composition-based stats. Identities = 125/383 (32%), Positives = 183/383 (47%), Gaps = 10/383 (2%) Query: 5 MPWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPR 64 M W M R FV + +LC +GIS TGYKWL+R+ G AGL D PR Sbjct: 1 MVWRETGIMDERLRFVGECLAGEETMTALCAAYGISRKTGYKWLERYRALGPAGLIDLPR 60 Query: 65 IPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHT--MPAFSTVHNLMARHGL 122 P ++ ++ A + + + +WG +K+ L+ PA ST+ ++ RHGL Sbjct: 61 APLEHGRATAAELVARIVAEKEANPQWGPKKVLARLKRSAPQLCWPAASTIGEILKRHGL 120 Query: 123 LPGASPGIPATGR---FEHDAPNRLWQMDFKGHFPFG-GGRCHPLTLLDDHSRFSLCLAH 178 + A G + PN +W D+KG F G RC PLT++D SRF L L Sbjct: 121 VGRRRHRWRAAGCGPFAPANGPNAVWSADYKGWFRTRDGRRCEPLTVMDTASRFLLALEA 180 Query: 179 CTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGT-WTALELWLMRLGIRVGHSR 237 C +F +GLP+R DNGSP+ T T L + ++LGI + + Sbjct: 181 CATPAEVEAWPVFERLFAEHGLPERFRSDNGSPFAAIGVTGLTTLAVRFIKLGIGLERIQ 240 Query: 238 PYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGS 297 P PQ G+ ERFH ++ + + D Q FD +R YN ERPHEAL M VP Sbjct: 241 PGKPQQNGRHERFHLTMLPLAMAPEV--DHAAQQAVFDAFRQNYNAERPHEALAMDVPAD 298 Query: 298 RYQPSARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQEDG 357 Y+PS R+ P+Y VR+V +G++ G + A GE V ++E E G Sbjct: 299 HYRPSLRRLPDRLPEPDYPAEAAVRRVRSNGEIKWNGDLVYVAAALAGEVVAIEE-SEAG 357 Query: 358 SYEVWWYSTKVGVIDLKKKSITM 380 + + +++ +G+ID K K + Sbjct: 358 IWTLRFHAHPLGIIDKKTKRLVR 380 >UniRef50_B0T7X0 Integrase catalytic region n=15 Tax=Alphaproteobacteria RepID=B0T7X0_CAUSK Length = 400 Score = 330 bits (846), Expect = 6e-89, Method: Composition-based stats. Identities = 180/354 (50%), Positives = 223/354 (62%), Gaps = 5/354 (1%) Query: 5 MPWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPR 64 MPW M R EFV A +GAN R LCRRFGISP GYKWL R ++ G L DR R Sbjct: 1 MPWREVSVMEQRREFVRLARLEGANRRELCRRFGISPEVGYKWLAR-SKAGDEALADRSR 59 Query: 65 IPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLP 124 PH+SP RS+ +I A + D H WGARKI WLED+G PA ST+H ++ RHG + Sbjct: 60 RPHNSPWRSAAEIEAAVLAVRDAHPAWGARKIGAWLEDRGVDPPAVSTIHAILRRHGRID 119 Query: 125 GASPGI-PATGRFEHDAPNRLWQMDFKGHF-PFGGGRCHPLTLLDDHSRFSLCLAHCTDE 182 A RFE PN+LWQMDFKG F G CHPLT++DDHSR S CL C D+ Sbjct: 120 DFPTSPGKAWRRFEKAEPNQLWQMDFKGWFRLSSGQPCHPLTIVDDHSRLSPCLKACADQ 179 Query: 183 RRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTG-TWTALELWLMRLGIRVGHSRPYHP 241 + +TV+ L + F RYGLP +DNG PWG+ +G WT LE+WL++LG+ V HSRPYHP Sbjct: 180 QGQTVRPHLEAAFRRYGLPLAFFVDNGPPWGEPSGERWTRLEVWLLKLGVDVLHSRPYHP 239 Query: 242 QTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQP 301 Q++GK+ERFHRSL AEVL + F ++QRAFD WR VYN ERPHEALD+ P +RYQP Sbjct: 240 QSRGKIERFHRSLAAEVLDLQRFDSFAQVQRAFDRWREVYNFERPHEALDLDCPANRYQP 299 Query: 302 SARQYSGNTTPPEYDEGVMVRKVD-ISGKLSVKGVSLSAGKAFRGERVGLKEMQ 354 S R + P YD G ++R V + KG KAF+GER+ L+ + Sbjct: 300 SPRAMPDHPPEPRYDSGEILRTVSTTKAYVRFKGRLWRVPKAFQGERLALRPPE 353 >UniRef50_Q07SS7 Integrase, catalytic region n=5 Tax=Alphaproteobacteria RepID=Q07SS7_RHOP5 Length = 582 Score = 330 bits (845), Expect = 8e-89, Method: Composition-based stats. Identities = 123/389 (31%), Positives = 176/389 (45%), Gaps = 11/389 (2%) Query: 5 MPWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPR 64 M W + R FV+ + +CRRFG+S TGYKWL+R+ EG AGL DR R Sbjct: 1 MGWMETRVVDERMRFVMAVADHEEAFAVVCRRFGVSRRTGYKWLERYDAEGVAGLMDRSR 60 Query: 65 IPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHT--MPAFSTVHNLMARHGL 122 PH P + + H WG KI+ WL ++ PA ST+ L+ R GL Sbjct: 61 APHSHPQAIAAPLAERCLAVRRAHPTWGPVKIRHWLAERDGATEWPAPSTIGALLDREGL 120 Query: 123 LPGASPGIPATGRFEH----DAPNRLWQMDFKGHFPFGGGR-CHPLTLLDDHSRFSLCLA 177 + N +W MDFKG F G G C PLTL D +SR+ L Sbjct: 121 TVKRRLRRRSPPSSVPFGHCGGANDIWCMDFKGWFLTGDGSCCEPLTLSDAYSRYLLRCQ 180 Query: 178 HCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGD-TTGTWTALELWLMRLGIRVGHS 236 V L + F +GLP R+ DNG P+ G + L + +++ G+ Sbjct: 181 ALARTDTAHVWPVLEAAFREFGLPHRLRSDNGPPFASCGAGGLSRLAVQVIKAGVVPERI 240 Query: 237 RPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPG 296 P PQ G+LER H +LK + +L+R ++ +YN ERPH+AL P Sbjct: 241 APGKPQQNGRLERLHLTLKQDTAMPPAQTLPEQLKR-LRAFQRLYNEERPHQALGNDTPS 299 Query: 297 SRYQPSARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQED 356 Y S R++ G P+Y VR+V +G + G + +A GE +GL E Q + Sbjct: 300 QHYARSPRRFDGCLRAPDYGPDQTVRRVRSNGAIKWGGNEIYINEALAGEPIGLTE-QPN 358 Query: 357 GSYEVWWYSTKVGVIDLKKKSITMG-KGC 384 GS+ + +GVI + + +GC Sbjct: 359 GSFAASYGPIVLGVIAHRGNQLRKAKRGC 387 >UniRef50_A9ER25 Transposase n=4 Tax=Sorangium cellulosum 'So ce 56' RepID=A9ER25_SORC5 Length = 387 Score = 329 bits (843), Expect = 1e-88, Method: Composition-based stats. Identities = 128/387 (33%), Positives = 189/387 (48%), Gaps = 11/387 (2%) Query: 5 MPWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPR 64 MPW ++ R F+ ++ LCRRFGIS TGYKW++R+ Q G +GL++R Sbjct: 1 MPWKETCSVDERLRFIAQVNESDETFAELCRRFGISRKTGYKWVERYEQAGPSGLEERRP 60 Query: 65 IPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQG-HTMPAFSTVHNLMARHGLL 123 + H P+ + + L WG +K++ LE G +PA ST+ L+ +HGL+ Sbjct: 61 VAHTFPHATPTVLVDALIELRKERPTWGPKKLRARLESLGLEGLPAASTIGELLKKHGLI 120 Query: 124 PGASPGIP------ATGRFEHDAPNRLWQMDFKGHFPFGGG-RCHPLTLLDDHSRFSLCL 176 + + + PN W DFKGHF G RCHPLTL D SR+ L Sbjct: 121 RPRRRRVVTPTTAMPSPLAPAEQPNDTWCADFKGHFALGDRTRCHPLTLTDQASRYLLKC 180 Query: 177 AHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGD-TTGTWTALELWLMRLGIRVGH 235 +V+ F +GLP R+ DNG P+ G +AL + ++LGI Sbjct: 181 EGVAKPHEASVRPHFERAFREFGLPHRIRSDNGPPFATIGIGGLSALSVSWIKLGIHPER 240 Query: 236 SRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVP 295 P PQ G+ ER H++LKAE A+ QR FD +R YN +RPHEAL P Sbjct: 241 IEPGKPQQNGRHERMHKTLKAEATSPPE-ANLAAQQRVFDRFRHEYNDQRPHEALGQRTP 299 Query: 296 GSRYQPSARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 SRY PS R + PEY + + VR++D G++ G + GE VGL + + Sbjct: 300 ASRYTPSRRSMPSKPSSPEYPDTMAVRRLDEQGRMLFGGAQTNVSTLLAGEPVGLTPIAD 359 Query: 356 DGSYEVWWYSTKVGVIDLKKKSITMGK 382 D +E+++ + + LK K + + + Sbjct: 360 D-VWELYYGPVLLAQVTLKNKELKLAR 385 >UniRef50_B8GMF2 Integrase catalytic region n=2 Tax=Gammaproteobacteria RepID=B8GMF2_THISH Length = 391 Score = 320 bits (819), Expect = 8e-86, Method: Composition-based stats. Identities = 129/387 (33%), Positives = 181/387 (46%), Gaps = 11/387 (2%) Query: 5 MPWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPR 64 MPW M R +F+ + +LCR FGIS TG KW++R A G GL++ R Sbjct: 1 MPWKETCAMDQRVQFIGAWLSGRYSKSALCRHFGISRPTGDKWIRRHALVGVDGLKESSR 60 Query: 65 IPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGH--TMPAFSTVHNLMARHGL 122 PH+ PNR S+ + + A H+ WG +K+ WL + PA ST ++ R GL Sbjct: 61 APHNQPNRISEALCERIVQAKLAHQDWGPKKVLDWLRAREPEVVWPADSTGGEILRRAGL 120 Query: 123 LPGASPGIPATGRFEH----DAPNRLWQMDFKGHFPF-GGGRCHPLTLLDDHSRFSLCLA 177 + + N +W +DFKG + G RC+PLTL D SR+ L Sbjct: 121 VKPRRRRRVVPPHEAPFADCEQSNAVWAVDFKGDYRLGEGRRCYPLTLSDSFSRYLLLCR 180 Query: 178 HCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDT-TGTWTALELWLMRLGIRVGHS 236 V F YGLP + DNG+P+ G +AL W + LGI Sbjct: 181 GLARPSGAAVHPWFEWAFREYGLPQAIRSDNGAPFASRAVGGLSALSKWWIDLGIHPERI 240 Query: 237 RPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPG 296 RP P G+ ER HRSLK G QR + +R YN ER HEAL PG Sbjct: 241 RPGRPDQNGRHERMHRSLKG--WLGTPAQGLEAEQRRLEAFRAEYNWERSHEALSRRTPG 298 Query: 297 SRYQPSARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQED 356 S Y S R Y PP+YD+GV VR+V +G++ +G + + GE V L+ D Sbjct: 299 SLYAASPRPYPPCIEPPDYDQGVEVRRVRNNGEIKWRGRLIYLSEVLIGEPVALEPAG-D 357 Query: 357 GSYEVWWYSTKVGVIDLKKKSITMGKG 383 G +E+ + +G+++ + IT +G Sbjct: 358 GLWELRYRFHPLGLLNEQNDRITPARG 384 >UniRef50_B4UMG9 Integrase catalytic region n=3 Tax=Bacteria RepID=B4UMG9_ANASK Length = 407 Score = 313 bits (802), Expect = 6e-84, Method: Composition-based stats. Identities = 125/378 (33%), Positives = 185/378 (48%), Gaps = 13/378 (3%) Query: 5 MPWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPR 64 MPW MS + EFV A GAN+ +LCR FGIS T +KWL+R+ +G GL ++ R Sbjct: 1 MPWKELRPMSQKLEFVEKAIVPGANVSALCRDFGISRQTAHKWLRRYRDQGYLGLVEKSR 60 Query: 65 IPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQ-GHTMPAFSTVHNLMARHGLL 123 P SP +++D+ + +H WG +KI L + G P+ +TV ++ R G + Sbjct: 61 RPASSPLATAEDVVVSIIELRSKHASWGPQKIAGVLARRLGPEAPSPTTVARVLRRLGKV 120 Query: 124 PGASPGIP-----ATGRFEHDAPNRLWQMDFKGHFP-FGGGRCHPLTLLDDHSRFSLCLA 177 P R E A N LW +DFKG + G +C PLT+ D SR L +A Sbjct: 121 KRRRPAARIWSVDGRPRIEVKASNDLWTIDFKGWWRALNGDKCEPLTVRDAFSRRVLAVA 180 Query: 178 HCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPW--GDTTGTWTALELWLMRLGIRVGH 235 V++ L +F ++GLP + DNGSP+ + G T L WL+ LGIR+ Sbjct: 181 LVPATTAAHVRRVLELLFRKHGLPSAIQSDNGSPFICSRSRGGLTVLSAWLVSLGIRIVR 240 Query: 236 SRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVP 295 SRP HPQ G ER HR L LQ QR D W +N RPH+AL P Sbjct: 241 SRPGHPQDNGGHERMHRDLSE--LQLSPARSRRAQQRQCDRWMLDFNHVRPHDALGGKTP 298 Query: 296 GSRYQPSARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 Y+ S R+ S + P Y + R+ + +G + + G + A + +GL++ E Sbjct: 299 AELYRNSTRR-SLSPLLPTYPPEWLTRRANKAGYVRINGDQVFVATALARQLIGLRQESE 357 Query: 356 DGSYEVWWYSTKVGVIDL 373 + ++ +G+I++ Sbjct: 358 -LRWSARFFDVDLGMIEI 374 >UniRef50_C3MF40 Integrase catalytic core domain protein n=3 Tax=Rhizobium sp. NGR234 RepID=C3MF40_RHISN Length = 400 Score = 313 bits (802), Expect = 7e-84, Method: Composition-based stats. Identities = 126/383 (32%), Positives = 177/383 (46%), Gaps = 10/383 (2%) Query: 5 MPWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPR 64 M W M R +FV + LC +GIS TGYKWL+R+ G AGL+D PR Sbjct: 1 MVWRETGIMEERLKFVAACLSGEETMAGLCALYGISRKTGYKWLRRFQLRGPAGLEDLPR 60 Query: 65 IPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQ--GHTMPAFSTVHNLMARHGL 122 P + ++ ++ + + H WG +KI L Q P+ ST ++ RHGL Sbjct: 61 APLNHGRATAAELVERIVAEKEAHPLWGPKKIVARLARQDPATAWPSASTAGAILNRHGL 120 Query: 123 LPGASPGIPATGR---FEHDAPNRLWQMDFKGHFPFGGG-RCHPLTLLDDHSRFSLCLAH 178 + G E PN +W D KG F G RC PLT++D SR+ L L Sbjct: 121 VGRRRARWKGAGNGPWPEPAMPNAVWTGDHKGWFTTRDGWRCEPLTVMDVKSRYLLALEA 180 Query: 179 CTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGT-WTALELWLMRLGIRVGHSR 237 E +F+ +GLPDR+ DNG P+ T T L L +RLGI + Sbjct: 181 TGSTGDEEAWPVFERLFDEHGLPDRIRTDNGPPFAAAGVTGLTPLSLRFVRLGITLERIA 240 Query: 238 PYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGS 297 P PQ GK ERFH ++ L AD AF+ +R YN ERPHE L M P Sbjct: 241 PGKPQQNGKHERFHLTML--PLAKAPAADRAAQAEAFEAFRREYNEERPHETLGMDTPAE 298 Query: 298 RYQPSARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQEDG 357 Y+ S R+ + P+Y VRKV +G + +G + GE V ++E E G Sbjct: 299 HYRASTRKMPVSPPEPDYPAEAAVRKVRHNGAVKWQGAEIYVSATLVGEVVAIEE-TESG 357 Query: 358 SYEVWWYSTKVGVIDLKKKSITM 380 + + +Y+ ++G ID K+ + Sbjct: 358 EWAMRFYAHRLGFIDEKRGRLVR 380 >UniRef50_A9BRN3 Integrase catalytic region n=7 Tax=Proteobacteria RepID=A9BRN3_DELAS Length = 395 Score = 309 bits (791), Expect = 1e-82, Method: Composition-based stats. Identities = 113/378 (29%), Positives = 182/378 (48%), Gaps = 11/378 (2%) Query: 5 MPWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPR 64 MPW M + F+ + GA + LCRR+GIS T YKW++R+ Q G GLQ+R R Sbjct: 1 MPWKECAPMDEKLLFIADHLRGGAPLSELCRRYGISRKTAYKWVERYRQLGMDGLQERSR 60 Query: 65 IPHHSPNRSS-DDITALLRMAHDRHERWGARKIKRWLEDQ--GHTMPAFSTVHNLMARHG 121 PH + S A++ + + + G +K+ L + P+ +T++N++ G Sbjct: 61 RPHGNNQAISYAQRRAIIELRTQQRSQMGPKKLHALLLQRWGPQETPSKTTIYNVLKAEG 120 Query: 122 LLPGASPGIPATGRFEH----DAPNRLWQMDFKGHFPFGGG-RCHPLTLLDDHSRFSLCL 176 L+ + + PN +W D+KG F G C+PLT++D SR+ L + Sbjct: 121 LVCSRRVRRRSVPTAQPLRTSKQPNGVWSADYKGQFKTADGHWCYPLTIMDHASRYLLAV 180 Query: 177 AHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDT-TGTWTALELWLMRLGIRVGH 235 E ++ VF +YGLP+R+ DNG P+ T + L +W +RLGIR Sbjct: 181 HVYDSPNYEDAKRSFEQVFRQYGLPERIRSDNGPPFATTGVAGLSRLAIWWIRLGIRPER 240 Query: 236 SRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVP 295 PQ G+ ER HR+LK L + AD LQ D + YN +RPHEAL ++P Sbjct: 241 IERGKPQQNGRHERMHRTLK-HALGKEPAADKAALQMQLDAFVEHYNQQRPHEALQQSMP 299 Query: 296 GSRYQPSARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 Y SAR Y +Y + +V +G + + + + G G+ +G++E+ Sbjct: 300 AQHYSDSARPYPSKLPELQYPKHWERVRVSHNGLIYWRALRVYIGYLLAGQWIGMQEVAA 359 Query: 356 DGSYEVWWYSTKVGVIDL 373 G ++V+ ++G + Sbjct: 360 -GQWDVYLGPVRLGCFNE 376 >UniRef50_C3K093 Putative integrase n=2 Tax=Pseudomonas fluorescens SBW25 RepID=C3K093_PSEFS Length = 382 Score = 305 bits (781), Expect = 2e-81, Method: Composition-based stats. Identities = 118/375 (31%), Positives = 172/375 (45%), Gaps = 13/375 (3%) Query: 5 MPWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPR 64 MPW+ M+ R + V L RRFG+S T KW+ R L + R Sbjct: 1 MPWNQESPMNQRIKLVADWLSGNFTKSQLARRFGVSRPTVDKWISRHN-GDLKSLAEVSR 59 Query: 65 IPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLE--DQGHTMPAFSTVHNLMARHGL 122 PH+SPN++ D+I A + + H++WG +K+ L D P+ ST + R GL Sbjct: 60 RPHNSPNKTDDEILARVVAMKEAHDKWGPKKLIELLRIEDPSIDWPSPSTAGQWLDRLGL 119 Query: 123 LPGASPGIPATGR----FEHDAPNRLWQMDFKGHFPFGG-GRCHPLTLLDDHSRFSLCLA 177 + E + PN+ W D+KG F C PLT+ D SR L Sbjct: 120 VNKRRFKRRHGTSHIEMREANDPNKTWCADYKGQFKMLNAQMCFPLTVTDHASRLILACR 179 Query: 178 HCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTT-GTWTALELWLMRLGIRVGHS 236 + + V+Q +F+ YG+P+ + DNG P+ + L +W +RLGI + Sbjct: 180 AHPKIKTQPVKQTFERLFQEYGMPEVIRSDNGVPFASPGLARMSTLAVWWIRLGIYPERT 239 Query: 237 RPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPG 296 P P G+ ER HRSLK E ++ E Q +H++ +N RPHEAL M PG Sbjct: 240 MPGRPAQNGRHERMHRSLKLE---LPLGSNLVEQQLLLEHFKHEFNYVRPHEALGMKRPG 296 Query: 297 SRYQPSARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQED 356 Y PS R Y G EY + VR V G + G + +A GER+GLKE ED Sbjct: 297 DVYMPSTRLYPGCLPDVEYPAEMRVRSVRQDGSIKWNGKLVFVSEALSGERIGLKE-AED 355 Query: 357 GSYEVWWYSTKVGVI 371 ++++ +G + Sbjct: 356 DVWDLYLCDYPLGRL 370 >UniRef50_A4WDP4 Integrase, catalytic region n=4 Tax=Enterobacter sp. 638 RepID=A4WDP4_ENT38 Length = 382 Score = 305 bits (780), Expect = 2e-81, Method: Composition-based stats. Identities = 123/384 (32%), Positives = 176/384 (45%), Gaps = 10/384 (2%) Query: 5 MPWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPR 64 MPW TM R +FV + + +CRRF IS TGYKWL R++ + A L DR R Sbjct: 1 MPWTETVTM-QRLQFVAACLEGNLPVAEVCRRFNISRKTGYKWLARFSPDDTASLADRSR 59 Query: 65 IPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHT-MPAFSTVHNLMARHGLL 123 HH N + + + LL +H WG KI++ L + T +PA ST+ L HGL+ Sbjct: 60 ARHHQ-NSTPEPMVQLLLDTKQQHPLWGPDKIRQRLLNLNITGVPAASTIGELFRVHGLV 118 Query: 124 PGASPGIPATGRF----EHDAPNRLWQMDFKGHFPFGGGR-CHPLTLLDDHSRFSLCLAH 178 P + R PN +W DFKG F GGR CHP TL D+ SR L Sbjct: 119 KKRRPPAFKSTRPHELHTVAHPNDVWSADFKGKFTHTGGRWCHPFTLTDNCSRIVLACDA 178 Query: 179 CTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTT-GTWTALELWLMRLGIRVGHSR 237 V L VF G+P + DNG P+ + + +WL++ G+ R Sbjct: 179 TYMPDGRFVIPCLERVFRECGMPQVLRTDNGPPFAGAGLWGLSQMSIWLIKCGVLPERIR 238 Query: 238 PYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGS 297 P P G+ ER HR+LK + + F E Q D WR+ +N RPH+AL PGS Sbjct: 239 PGKPTENGRHERMHRTLKDALKRHTKFTSLEEQQAWLDAWRSEFNDIRPHKALGGKTPGS 298 Query: 298 RYQPSARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQEDG 357 + PS R ++G + +V + G L + +A RGE + +K+++ED Sbjct: 299 VWYPSERIFTGPLKAMPVPDDARTLRVSVKGDLCFNSTRIFLSEALRGEWIWMKQVEEDL 358 Query: 358 SYEVWWYSTKVGVIDLKKKSITMG 381 E+ + + D + I Sbjct: 359 D-EIGFGELILARYDRRNHRIIRA 381 >UniRef50_D2LAL9 Integrase catalytic region n=1 Tax=Desulfovibrio sp. FW1012B RepID=D2LAL9_9DELT Length = 390 Score = 302 bits (774), Expect = 1e-80, Method: Composition-based stats. Identities = 127/376 (33%), Positives = 184/376 (48%), Gaps = 15/376 (3%) Query: 5 MPWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPR 64 MPW + M R F++ SQ + +LCR++GIS TGYKWL+R+ GL +R R Sbjct: 1 MPWKKVNPMEERARFIVELSQRRESFAALCRKYGISRETGYKWLRRYQAG--EGLGERSR 58 Query: 65 IPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTM--PAFSTVHNLMARHGL 122 + H P+++ D + LL + WG +K+ R L+D PA ST +++ RHGL Sbjct: 59 VARHCPHKTPDAVVTLLLALRQENPYWGPKKLVRLLQDVHGIEYPPAKSTAGDILKRHGL 118 Query: 123 LPGASPGIPATG-------RFEHDAPNRLWQMDFKGHFPFGGGR-CHPLTLLDDHSRFSL 174 + +G + N +W D+KG F CHPLT+ D SR+ L Sbjct: 119 ITATKAKRRQSGGRLRREDLRQPKQANDVWSADYKGWFRLEDRSICHPLTISDIFSRYVL 178 Query: 175 CLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDT-TGTWTALELWLMRLGIRV 233 + E ++ + VF RYGLP + +DNG+P+G T T L +W ++LGI V Sbjct: 179 GCYVFPTQTLERTKEAMRRVFMRYGLPRAIRVDNGTPFGSTGIAGLTGLSVWWLQLGIVV 238 Query: 234 GHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMA 293 P P+ G ER HR+LK E A+ E Q + WR +N RPHEALD A Sbjct: 239 DFIAPGKPEQNGCHERMHRTLKLEATIPPS-ANLREQQERLESWRERFNSHRPHEALDQA 297 Query: 294 VPGSRYQPSARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEM 353 P S Y+PS+R+ N EY R V G + +G + G+AF R+GL Sbjct: 298 TPASIYRPSSRRLPRNEPCFEYPSSFESRTVRRDGMFNWEGRQIFLGEAFAKCRIGLTRN 357 Query: 354 QEDGSYEVWWYSTKVG 369 +D + V+ +G Sbjct: 358 YDD-RWLVYLGEHLLG 372 >UniRef50_Q01QQ4 Integrase, catalytic region n=9 Tax=Bacteria RepID=Q01QQ4_SOLUE Length = 395 Score = 302 bits (773), Expect = 1e-80, Method: Composition-based stats. Identities = 127/386 (32%), Positives = 191/386 (49%), Gaps = 12/386 (3%) Query: 5 MPWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPR 64 MPW R + ++G +I L +G+S T YKWL+R ++G GLQ + R Sbjct: 1 MPWQEIRVEEQRLLMIRDH-EEGMSISELAEVYGVSRKTVYKWLERHDEQGFLGLQAQSR 59 Query: 65 IPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWL--EDQGHTMPAFSTVHNLMARHGL 122 PH SPN+ + ++ + A + WG K++ L +D PA ST+ ++ +GL Sbjct: 60 RPHRSPNQVTSEVEGAIIAARHKW-GWGPGKLRVKLFQQDSRVPWPAVSTIAAVLKANGL 118 Query: 123 L--PGASPGIP--ATGRFEHDAPNRLWQMDFKGHFPFGGG-RCHPLTLLDDHSRFSLCLA 177 + P +P D PN +W +D+KG F G G R PLT+ D SR+ L Sbjct: 119 VVSRRNRPRVPIQRPPYLAADGPNAVWNIDYKGWFRCGDGTRVDPLTISDGFSRYLLRCQ 178 Query: 178 HCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDT-TGTWTALELWLMRLGIRVGHS 236 H E + V+ F+ +GLP + DNG+P+ G + L +W ++LGI V S Sbjct: 179 HVEQTGYELTRAVFVATFQEFGLPGAIHSDNGTPFASVAPGGLSRLSIWFVKLGIVVERS 238 Query: 237 RPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPG 296 RP PQ G+ ER HR+LKA + A Q+AF ++ YN ERPHEALD P Sbjct: 239 RPACPQDNGRHERMHRTLKAATAKPPQ-ATVRLQQQAFHAFQREYNEERPHEALDNKTPH 297 Query: 297 SRYQPSARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQED 356 S YQ SAR Y EY + + R + G L KGV + F E +G+K + E Sbjct: 298 SCYQASARSYPRRVPELEYGDDMETRVISQQGSLKWKGVRTFISEVFAYETLGIKVIDER 357 Query: 357 GSYEVWWYSTKVGVIDLKKKSITMGK 382 E+++ ++G +D +++ + K Sbjct: 358 W-VELYFGPIRLGWLDGYRQTFSRRK 382 >UniRef50_D2ML31 Integrase, catalytic region (Fragment) n=1 Tax=Candidatus Poribacteria sp. WGA-A3 RepID=D2ML31_9BACT Length = 411 Score = 301 bits (770), Expect = 3e-80, Method: Composition-based stats. Identities = 126/386 (32%), Positives = 173/386 (44%), Gaps = 11/386 (2%) Query: 5 MPWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPR 64 MPW M R FV +LC +GIS TG KWL R+ +G AGL D R Sbjct: 1 MPWKEIKIMDQREHFVSDYLTGDYPKGALCELYGISRPTGDKWLARYHAQGVAGLADLAR 60 Query: 65 IPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQG--HTMPAFSTVHNLMARHGL 122 PH P+++ + + RH +G +KI+ L P STV ++ R GL Sbjct: 61 RPHTQPHQTPAAVIEAILTMKHRHPSFGPKKIRDRLRAVAPEEAWPVESTVGVILKRAGL 120 Query: 123 LPGASPGIPATGRFEHDAPNR----LWQMDFKGHFPFGGG-RCHPLTLLDDHSRFSLCLA 177 + + + W DFKG FP G G RC+PLT++D SR+ L Sbjct: 121 VRPRRVRRRVPADPQRLSRGTAPAPTWSADFKGDFPLGTGPRCYPLTVMDHASRYLLRGE 180 Query: 178 HCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTT-GTWTALELWLMRLGIRVGHS 236 R VQ + VF YGLP + DNG P+ T G + L W +RLG+R Sbjct: 181 GLLQPTRAAVQPWVAWVFHEYGLPATIRTDNGPPFASTALGGLSRLAAWWVRLGLRPERI 240 Query: 237 RPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPG 296 RP P G ER HR+LKA V G A QR F + YN R HEA+ PG Sbjct: 241 RPGTPSENGCHERMHRALKAAV--GPPAATLAAQQRRFAAFVDEYNWARSHEAVARQPPG 298 Query: 297 SRYQPSARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQED 356 YQPS R Y P EY G +VR+V +G + +G + E VG ++ E Sbjct: 299 QVYQPSPRAYPAKLPPIEYAPGTLVRQVRQNGAVRWRGHGRYLSEVLAPEPVGFTQIGER 358 Query: 357 GSYEVWWYSTKVGVIDLKKKSITMGK 382 ++ + + + G +D + +I + Sbjct: 359 -TWAIHYRFHRRGTLDDRTLTIIPVR 383 >UniRef50_UPI00017F3A47 integrase catalytic subunit n=1 Tax=Escherichia coli O157:H7 str. EC4024 RepID=UPI00017F3A47 Length = 365 Score = 293 bits (750), Expect = 7e-78, Method: Composition-based stats. Identities = 134/360 (37%), Positives = 179/360 (49%), Gaps = 10/360 (2%) Query: 5 MPWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPR 64 MPW M R +F+ + +LCR FGIS TGYKWLQR+ + L DR R Sbjct: 1 MPWTETRPM-QRLDFIRACHAGTDSFSALCRLFGISRKTGYKWLQRFDPSDLSSLSDRSR 59 Query: 65 IPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQG--HTMPAFSTVHNLMARHGL 122 PH DDI A L +H WG +K++ WL + T+PA ST+ +++ R GL Sbjct: 60 APHSHSRTVPDDIAAQLTALRQKHPDWGPKKLRMWLLNHHADFTVPAASTIGDILKREGL 119 Query: 123 LPGASPGIPATGRFEHDAP----NRLWQMDFKGHFPFGGG-RCHPLTLLDDHSRFSLCLA 177 +P G + N++W DFKG F CHP TL D+HSR+ L Sbjct: 120 VPDKKRKRRTPGNRQPLTTISENNQVWSADFKGKFRLLSREYCHPFTLTDNHSRYLLSCR 179 Query: 178 HCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDT-TGTWTALELWLMRLGIRVGHS 236 E V+Q L F YGLP+ + DNG P+ T + L +WL+RLGIR Sbjct: 180 GTDRESEPFVRQCLTDAFLEYGLPEVLRTDNGQPFAGTGIAGLSRLAVWLIRLGIRPERI 239 Query: 237 RPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPG 296 R HP+ G+ ER HRSLK+ V G F E QR F +R +N ERPHE+L A PG Sbjct: 240 RKGHPEENGRHERMHRSLKSAVSHGNTFMTMEEQQRWFSDYREEFNHERPHESLAGATPG 299 Query: 297 SRYQPSARQYSGNTTPPEYDEGVMVRKVDISGKLSV-KGVSLSAGKAFRGERVGLKEMQE 355 +QPS RQ+ G Y EG V +V G L + K ++ +A E + L+E + Sbjct: 300 MVWQPSCRQWDGRVPDYAYPEGGTVYRVKSRGTLYMGKKGTVFLSEALTDEYIMLEERDD 359 >UniRef50_UPI00019025E5 ISHne2, transposase n=1 Tax=Rhizobium etli Brasil 5 RepID=UPI00019025E5 Length = 392 Score = 278 bits (712), Expect = 2e-73, Method: Composition-based stats. Identities = 119/368 (32%), Positives = 166/368 (45%), Gaps = 12/368 (3%) Query: 13 MSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNR 72 M R + ++ LCRR+G+S T Y W +R DR P+R Sbjct: 1 MEERVRMLSDYVSGHWSVSDLCRRYGVSRETFYSWRKRQMSGADDWFVDRSHGTVSCPHR 60 Query: 73 SSDDITALLRMAHDRHERWGARKIKRWLEDQGHT--MPAFSTVHNLMARHGLLPGASPGI 130 + + + R G RK+ L+ Q PA ST+ +++ R GL+ A Sbjct: 61 TPAALVDQVIALRQRFPHMGPRKLLALLQRQSAQTPWPAASTIGDILKRAGLVEVAKRRR 120 Query: 131 PA----TGRFEHDAPNRLWQMDFKGHFPFGG-GRCHPLTLLDDHSRFSLCLAHCTDERRE 185 A E N W +DFKG F R PLT+ D +SRF L + E Sbjct: 121 RALDQSRPFTEATQANDEWSVDFKGWFRTRDQQRIDPLTISDSYSRF-LIDVRIAPQTIE 179 Query: 186 TVQQQLVSVFERYGLPDRMTMDNGSPWGDT-TGTWTALELWLMRLGIRVGHSRPYHPQTQ 244 V+ F +GLP + DNGSP+G G T L W ++LGI P PQ Sbjct: 180 GVRPVFEEAFRTHGLPFAIRCDNGSPFGSHGAGGLTRLSTWWIKLGIEAHFIAPASPQEN 239 Query: 245 GKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQP-SA 303 G+ ER HR+LKA+ + ++G+ Q FD +R YN ERPHEAL P Y+P Sbjct: 240 GRHERMHRTLKAQTSKPP-ADNAGQQQVRFDAFRQHYNEERPHEALGQRPPADLYRPCQP 298 Query: 304 RQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQEDGSYEVWW 363 R P YD VR+V SG++ KG L +A GE VGL E+ E+G + V + Sbjct: 299 RAMPERLDDPWYDADHQVRRVRDSGEIKWKGGRLFVSEALAGELVGLSEL-ENGDHVVRF 357 Query: 364 YSTKVGVI 371 + +G+I Sbjct: 358 CNRDIGLI 365 >UniRef50_B6J0H9 Transposase n=3 Tax=Coxiella burnetii RepID=B6J0H9_COXB2 Length = 317 Score = 269 bits (687), Expect = 1e-70, Method: Composition-based stats. Identities = 128/277 (46%), Positives = 167/277 (60%), Gaps = 3/277 (1%) Query: 102 DQGHTMPAFSTVHNLMARHGLLP-GASPGIPATGRFEHDAPNRLWQMDFKGHFPFGGG-R 159 +G+ MP TV+ ++ R+G + S RFEH+ PN LWQMDFKGHF R Sbjct: 37 KKGYIMPCIKTVNRILKRYGRITIEESLKRKKFIRFEHEHPNDLWQMDFKGHFRLTNKIR 96 Query: 160 CHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWG-DTTGT 218 CHPLTLLDD +R+SL + C DER ETV+Q L+ +F ++GLP RMTMDNG+PWG + Sbjct: 97 CHPLTLLDDCTRYSLGIIACGDERLETVKQALIDIFRKWGLPKRMTMDNGAPWGYSGSQN 156 Query: 219 WTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWR 278 +T L +WL++ I V HSRPYHPQTQGKLERFHR+ K E L +F + Q+ FD WR Sbjct: 157 YTQLTVWLIQQTIYVSHSRPYHPQTQGKLERFHRTFKQEFLNRYYFDTLAQAQKVFDWWR 216 Query: 279 TVYNLERPHEALDMAVPGSRYQPSARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLS 338 YN ERPH A++ P Y S R Y P EY + VRKV+ G +S KG Sbjct: 217 DFYNDERPHSAIEAYSPSEIYHRSERSYCEKIQPYEYATEMDVRKVNQKGIMSYKGRRYF 276 Query: 339 AGKAFRGERVGLKEMQEDGSYEVWWYSTKVGVIDLKK 375 G+AF G+ +GL E+ V++ KV +DL + Sbjct: 277 VGEAFGGQAMGLMPSNENDIVNVYFCHQKVFKLDLNQ 313 >UniRef50_Q5ZTP2 Transposase (ISmav2) n=14 Tax=Proteobacteria RepID=Q5ZTP2_LEGPH Length = 341 Score = 264 bits (674), Expect = 5e-69, Method: Composition-based stats. Identities = 109/335 (32%), Positives = 158/335 (47%), Gaps = 11/335 (3%) Query: 5 MPWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPR 64 MPW + + +FV DG + SLC+ FGIS TG+K R+ + G GL DR R Sbjct: 1 MPWQECTKVDEKIKFVARLL-DGEQMSSLCQEFGISRKTGHKIYNRYKESGLEGLNDRSR 59 Query: 65 IPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTM--PAFSTVHNLMARHGL 122 PH N+ + + WGA KI+ + Q + PA ST+H ++ ++GL Sbjct: 60 KPHRYANQLPFQLEKEILKVKKEKPTWGAPKIREKILRQYPDVKSPAISTIHTILDKYGL 119 Query: 123 LPGASPGI---PATGRFEHDAPNRLWQMDFKGHF-PFGGGRCHPLTLLDDHSRFSLCLAH 178 + T PN LW D+KG F C+PLTL D +SR+ L Sbjct: 120 VTKRKRRRYKAEGTKLTNGKTPNELWCADYKGEFQLGSKEYCYPLTLTDFNSRYLLACEG 179 Query: 179 CTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTW--TALELWLMRLGIRVGHS 236 + + + VF+ YGLP+ + DNG P+ + + L +W +RLGI + Sbjct: 180 LSTTKEQYAITVFERVFKEYGLPNAIRTDNGVPFSSVQALFGLSKLSVWWLRLGISIERI 239 Query: 237 RPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPG 296 RP +PQ G+ ER H +LK E + + + Q FD + YN ERPH+ALDM PG Sbjct: 240 RPGNPQENGRHERMHLTLKKETTKPSG-ENFLQQQEKFDRFIDEYNNERPHQALDMRYPG 298 Query: 297 SRYQPSARQYSGNTTPPEYDEGVMVRKVDISGKLS 331 Y PS ++Y G +Y + V G+L Sbjct: 299 EVYIPSNKEYKG-LPEVDYPFHDKMITVTHCGRLC 332 >UniRef50_Q4JUH9 Transposase for IS3514b n=9 Tax=Corynebacterium RepID=Q4JUH9_CORJK Length = 407 Score = 261 bits (667), Expect = 3e-68, Method: Composition-based stats. Identities = 96/352 (27%), Positives = 141/352 (40%), Gaps = 16/352 (4%) Query: 11 DTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSP 70 ++ + V + G + + +RF IS YK L ++ GA + + R PH P Sbjct: 2 NSPNRNLAIVKAVREQGEPVTKVAKRFRISRQRIYKILSQFDAGGADAIAPKSRAPHTHP 61 Query: 71 NRSSDDITALLRMAHDRHER----WGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGA 126 + + + R G I L QG +P+ ST+ ++ GL+ Sbjct: 62 QAVPTSLRNQIIDMRKQLVRSGLDAGPETIAFHLHRQGLRVPSTSTIRRIITNAGLVTPQ 121 Query: 127 SPGIPATG--RFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERR 184 P + RFE PN WQ D G R L +DDHSR+ L + Sbjct: 122 PQKKPRSSFIRFEAAMPNECWQADITHLHLLDGTRLEVLDFIDDHSRYLLSITAAASFSG 181 Query: 185 ETVQQQLVSVFERYGLPDRMTMDNGSPWGDT----TGTWTALELWLMRLGIRVGHSRPYH 240 V +L + YG P DNG + G A E L + I+ + RP H Sbjct: 182 PAVAAELQRLIATYGPPASTLTDNGLVFTARLAGARGGRNAFEKTLNKYRIQQKNGRPGH 241 Query: 241 PQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQ 300 PQTQGK+ERFH++LK + ELQR D + YN RPH AL P Y Sbjct: 242 PQTQGKIERFHQTLKKWIAAQSPAITLVELQRQLDTFADYYNTVRPHRALGRRTPHEVYT 301 Query: 301 PSARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGV----SLSAGKAFRGERV 348 + + E+ V V +GK++V+ L G+ + GE + Sbjct: 302 TGPKAEPNDKPEEEW--RVRNDVVTPNGKVTVRYASRLYQLGIGRKYTGETI 351 >UniRef50_Q82H05 Putative IS481 family ISMav2-like transposase n=1 Tax=Streptomyces avermitilis RepID=Q82H05_STRAW Length = 589 Score = 260 bits (665), Expect = 5e-68, Method: Composition-based stats. Identities = 105/380 (27%), Positives = 159/380 (41%), Gaps = 40/380 (10%) Query: 13 MSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNR 72 + R V+ + G + + R+G+S + + W++++ Q G AGL DR P P+R Sbjct: 2 VEQRYHAVMEVAA-GVPVTQVAARYGVSRQSVHSWVRKYEQSGLAGLTDRSHRPASCPHR 60 Query: 73 SSDDITALLRMAHDRHERWGARKIKRWLEDQGHT-MPAFSTVHNLMARHGLLPG--ASPG 129 + ++ A++ RH WG R++ LE +G +P+ +TV+ ++ R+ L+ Sbjct: 61 IASEVEAVVCELRRRHPTWGPRRLVHELERRGLAPVPSRATVYRVLIRNSLIEPGVRRRR 120 Query: 130 IPATGRFEHDAPNRLWQMDFK-GHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQ 188 R+E A LWQMD G GG C +T +DDHSRF + V Sbjct: 121 RSDYRRWERSAAMELWQMDIVGGLLLADGGECKMVTGIDDHSRFMVIAKVVQRATARAVC 180 Query: 189 QQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTA----LELWLMRLGIRVGHSRPYHPQTQ 244 R+G+P+ + DNG + + GI ++P P T Sbjct: 181 SAFGEALVRFGVPEEVLTDNGKQFTARFSPGKPGEAMFDRICRENGITHRLTKPRSPTTT 240 Query: 245 GKLERFHRSLKAEVL-QGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSA 303 GK+ERFH++L+ E+L Q F D Q D W YN RPH+ LDMAVP SR+ P Sbjct: 241 GKIERFHQTLRRELLDQQDPFTDLATAQATVDAWLEEYNRMRPHQGLDMAVPASRFVPRP 300 Query: 304 RQYSGNTT-------------------PPEYDEGVMV-----------RKVDISGKLSVK 333 R P + R V SG LS++ Sbjct: 301 RAEQDALPVRLPARLDPVPAPASAEPEPATVPRAWPMTEGEVGAIEVDRVVPASGNLSLR 360 Query: 334 GVSLSAGKAFRGERVGLKEM 353 G + G A G V L+ Sbjct: 361 GQQIWFGPALAGTTVTLRID 380 >UniRef50_B1ZP18 Integrase catalytic region n=1 Tax=Opitutus terrae PB90-1 RepID=B1ZP18_OPITP Length = 387 Score = 255 bits (651), Expect = 2e-66, Method: Composition-based stats. Identities = 108/378 (28%), Positives = 159/378 (42%), Gaps = 12/378 (3%) Query: 5 MPWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPR 64 MPW + R ++ ++ +LC RFG+S T YKW R+ +G GL R Sbjct: 1 MPWKIKTAEQQRQALAREMTRGTVSVTALCARFGVSRTTAYKWAARYVAQGVNGLVARQP 60 Query: 65 IPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQ--GHTMPAFSTVHNLMARHGL 122 +++ A + +A WGA K++ WLE G +P T+H + G Sbjct: 61 GRPKQVSQALARWHARVLLARQARPSWGAPKLRWWLERTHPGERVPCSRTLHRWLVAAGR 120 Query: 123 LPGASPGIPATGRFE----HDAPNRLWQMDFKGHF-PFGGGRCHPLTLLDDHSRFSLCLA 177 + + A + N +W DFKG F G LT+ D +SRF L Sbjct: 121 VHQRRRKLRAGPGRPATVLAERVNAVWTADFKGDFYTKDGAWILALTVRDLYSRFMLTAH 180 Query: 178 HCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWG-DTTGTWTALELWLMRLGIRVGHS 236 + V++ +F R+G+P + +D G+P+ TAL LW RLGI V Sbjct: 181 PVPRQSEPVVRRVFARLFRRFGVPQAIRVDRGTPFCGSGPYGLTALSLWWQRLGIEVQFV 240 Query: 237 RPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPG 296 E+ HR LKAE + +++R W YN +RPHE L P Sbjct: 241 SRKRRLDNNAHEQMHRMLKAEAATPVSRSYGAQVRR-LQRWCGRYNHDRPHEGLAGRTPA 299 Query: 297 SRYQPSARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQED 356 S Y+PS R PP+Y G + R+V G + + G G+AF G V Sbjct: 300 SLYRPSTRLLP-RLVPPQYPLGCVTRRVRPHGYVKLDGSHRHIGRAFVGLTVAFTPY--R 356 Query: 357 GSYEVWWYSTKVGVIDLK 374 Y V + S +G ID + Sbjct: 357 QLYRVHFDSLLLGTIDPR 374 >UniRef50_Q4JWW8 Transposase for IS3511a n=7 Tax=Corynebacterium RepID=Q4JWW8_CORJK Length = 405 Score = 250 bits (639), Expect = 5e-65, Method: Composition-based stats. Identities = 92/348 (26%), Positives = 141/348 (40%), Gaps = 16/348 (4%) Query: 19 FVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDIT 78 + G + FG++ +R+ + G L + + PH +P ++ D Sbjct: 9 IIETMLATGMTQAEAAQHFGVTTRWIRTLQKRYNEGGVEALTPKSKRPHTNPRATTPDTV 68 Query: 79 ALLRMAHD----RHERWGARKIKRWLEDQGHT-MPAFSTVHNLMARHGLLPGASPGIPAT 133 + + R GA I+ LE + T +PA +T+H ++ +G + P + Sbjct: 69 DRILQLRNELTNRGTDAGAHTIRWHLEQEDTTPLPATATIHRILKNNGHVTLQPQKRPRS 128 Query: 134 G--RFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQL 191 RF+ D PN WQMD+ G R LT+LDDHSR+ L V + Sbjct: 129 SWIRFQADQPNETWQMDYSDWTIAGHQRVVILTILDDHSRYVLRCQAFNSATVTHVIEAF 188 Query: 192 VSVFERYGLPDRMTMDNGSPWGDTTGTWTA----LELWLMRLGIRVGHSRPYHPQTQGKL 247 +G P DNG + + E L+ LGI + RPYHPQTQGK+ Sbjct: 189 AYTAAIHGYPQSTLTDNGRAFTTSNDRTNPARNGFEQLLLDLGIEQKNGRPYHPQTQGKV 248 Query: 248 ERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYS 307 ERFH +LK + D L D YN +RPH AL+ P Y + + Sbjct: 249 ERFHYTLKLALRNKPQARDIDNLNEQLDDIIDYYNNKRPHRALNRCTPAEAYNALPKAHP 308 Query: 308 GNTTPPEYDEGVMVRKVDISGK--LSVKG--VSLSAGKAFRGERVGLK 351 +D + KV +GK L G + G+ + GE + + Sbjct: 309 -RPGAKTHDYRLRTDKVAKNGKTTLRWGGQLRRIYIGRRWTGEPITIM 355 >UniRef50_A1T2L4 Integrase, catalytic region n=4 Tax=Actinomycetales RepID=A1T2L4_MYCVP Length = 597 Score = 250 bits (638), Expect = 6e-65, Method: Composition-based stats. Identities = 93/300 (31%), Positives = 138/300 (46%), Gaps = 9/300 (3%) Query: 12 TMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPN 71 + R V DG + + + G S + + W+ R+ G AGL DR R P SPN Sbjct: 13 VIEHRYRAVRQVL-DGVSKSQVAQECGASRQSVHSWVIRYEALGVAGLADRSRRPLTSPN 71 Query: 72 RSSDDITALLRMAHDRHERWGARKIKRWLEDQG-HTMPAFSTVHNLMARHGLLPGASPGI 130 S + A++ + RWGA++I L +G P+ S+V+ ++ RHGL+ Sbjct: 72 ELSPAVVAMVCELRRTYPRWGAQRIAHELALRGVDAPPSRSSVYRILVRHGLVAAQQQNH 131 Query: 131 PAT-GRFEHDAPNRLWQMDFK-GHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQ 188 R++ DAP +LWQ+D G F G C +T +DDHSRF + D V Sbjct: 132 KRKYRRWQRDAPMQLWQIDIMGGVFLVDGRECKVVTGIDDHSRFVVMATVVADPGARAVC 191 Query: 189 QQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTA----LELWLMRLGIRVGHSRPYHPQTQ 244 + YG+P + DNG + E GI ++P P T Sbjct: 192 AAFTATMAIYGVPSEVLTDNGKQFTGRFTKPYPAEVLFERICRENGITTRLTKPRSPTTT 251 Query: 245 GKLERFHRSLKAEVLQ-GKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSA 303 GK+ERFH++L+ E+L FA Q A D W YN RPH++L MA P + ++P+ Sbjct: 252 GKIERFHKTLRRELLDSAGPFASIEVAQEAIDAWVHGYNHSRPHQSLGMATPATMFRPAP 311 >UniRef50_B2HR82 Transposase for ISMyma05 n=5 Tax=Mycobacterium RepID=B2HR82_MYCMM Length = 518 Score = 243 bits (620), Expect = 8e-63, Method: Composition-based stats. Identities = 106/389 (27%), Positives = 166/389 (42%), Gaps = 33/389 (8%) Query: 8 DARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAA-GLQDRPRIP 66 +R VL DGA + ++ RRFG+S T + WL+R+A+EGAA L+DR P Sbjct: 3 QELRMSEMRYRAVLEVL-DGAPVTAVARRFGVSRQTVHAWLRRYAEEGAALNLEDRSSRP 61 Query: 67 HHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQ-GHTMPAFSTVHNLMARHGLLPG 125 H P++ ++ A + D H RWG +I L+ +P S+V+ + R+G + Sbjct: 62 HRCPHQMPVEVEARVLTLRDAHPRWGPTRIVYELQRDVVPVVPGRSSVYRALVRNGRIDP 121 Query: 126 ASPGIPAT--GRFEHDAPNRLWQMDFKGHFPFGGG-RCHPLTLLDDHSRFSLCLAHCTDE 182 A + R+E P LWQMD G G +T +DD+SRF + Sbjct: 122 AKRRRRRSDYKRWERGRPMELWQMDVVGGLHLRDGIEVKVVTGIDDNSRFVVSAKVVARA 181 Query: 183 RRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTA-----LELWLMRLGIRVGHSR 237 V L+ R+G+P+++ DNG + G + + + GI + Sbjct: 182 TARPVCAALLEALRRHGVPEQILTDNGKVFTGRFGPGGSSAEVLFDRVCVENGIGHLLTA 241 Query: 238 PYHPQTQGKLERFHRSLKAEVLQ--GKWFADSGELQRAFDHWRTVYNLERPHEALDM--- 292 P P T GK+ER H++++AE+ F ELQ A D W YN RPH++L M Sbjct: 242 PRSPTTTGKVERLHKTMRAEIFAEVDGVFDAIAELQAAIDRWVQYYNTARPHQSLGMVAP 301 Query: 293 -------------AVPGSRYQPSARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSA 339 V P+ Q+ G + + + R V+ G +SV G Sbjct: 302 AARFALAAGPDLSVVEPVAAVPAEGQHPGALV--DLRDAGVRRWVNRHGSISVAGFRYRV 359 Query: 340 GKAFRGERVGLKEMQEDGSYEVWWYSTKV 368 GE V + + D ++ + V Sbjct: 360 PIVLAGEPVSV--VVADNLVSIYHHDVLV 386 >UniRef50_UPI0001B45627 transposase for ISMyma05 n=1 Tax=Mycobacterium intracellulare ATCC 13950 RepID=UPI0001B45627 Length = 519 Score = 241 bits (616), Expect = 2e-62, Method: Composition-based stats. Identities = 110/388 (28%), Positives = 170/388 (43%), Gaps = 30/388 (7%) Query: 8 DARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAA-GLQDRPRIP 66 +R VL DGA I ++ R+G+S T + WL+R+A+EGA L+DR P Sbjct: 3 RELRVSEMRYRAVLEVL-DGAVISTVACRYGVSRQTVHAWLRRYAREGAVLNLEDRSSRP 61 Query: 67 HHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQG-HTMPAFSTVHNLMARHGLLPG 125 H P++ + ++ A + + D H RWG +I L +G +P S+V+ + R+G + Sbjct: 62 HGCPHQMAAELEARVLVLRDAHPRWGPTRIVYELVREGVVAVPGRSSVYRALVRNGRIDP 121 Query: 126 ASPGIPAT--GRFEHDAPNRLWQMDFKGHFPFGGG-RCHPLTLLDDHSRFSLCLAHCTDE 182 A R+E P LWQMD G G +T +DD+SRF +C A Sbjct: 122 ARRRRRRADYKRWERGRPMELWQMDVVGGVHLCDGVEVKVITGIDDNSRFVVCAAVVARA 181 Query: 183 RRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTG-----TWTALELWLMRLGIRVGHSR 237 V + L++ R+G+P+++ DNG + G + + GIR + Sbjct: 182 TARPVCEALLAALARHGVPEQILTDNGKVFTGRFGPGGSSSEALFDRVCAENGIRHLLTA 241 Query: 238 PYHPQTQGKLERFHRSLKAEVL--QGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVP 295 P P T GK+ER H++++AE FA ELQ A D W YN ERPH++L M P Sbjct: 242 PRSPTTTGKVERLHKTMRAEFFTDADGRFATIAELQAALDGWVGQYNTERPHQSLGMRPP 301 Query: 296 GSRY---------------QPSARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAG 340 R+ A T P+ + R VD G++ + G Sbjct: 302 AERFALAAAGPDPAVVDPIAAVAVHQQSPTRRPDLRHAGVQRWVDQRGRIRLAGFGYRVP 361 Query: 341 KAFRGERVGLKEMQEDGSYEVWWYSTKV 368 GE V + D +++ + V Sbjct: 362 IVLAGEPVEA--VVADNLVQIYHHDVLV 387 >UniRef50_UPI0001B540A0 transposase for IS3514a n=1 Tax=Streptomyces sp. AA4 RepID=UPI0001B540A0 Length = 383 Score = 241 bits (615), Expect = 4e-62, Method: Composition-based stats. Identities = 105/366 (28%), Positives = 158/366 (43%), Gaps = 23/366 (6%) Query: 26 DGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALL---- 81 + NI CR G+S +K+L R+ EGA G R PH P ++ + Sbjct: 16 EDVNIARFCREHGVSRTVFHKYLNRFRAEGADGFTRRSTAPHRRPTALGTEVAEAVLRAR 75 Query: 82 RMAHDRHERWGARKIKRWLEDQGHT-MPAFSTVHNLMARHGL-LPGASPGIPATGRFEHD 139 + D G I+ LE QG +P+ S V+ ++ HG +P RFE+ Sbjct: 76 KELADEGLDNGPISIRWRLEAQGAAAVPSQSAVYRILRAHGQIVPQPRKKPRTRRRFEYA 135 Query: 140 APNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYG 199 PN WQ+D H G + + +LDDHSR + T E L F +G Sbjct: 136 DPNGCWQIDGMEHHLADGTKVCIIQILDDHSRLDVGAYAATGETTAATWAALQHAFAGHG 195 Query: 200 LPDRMTMDNGSPWGDT-TGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEV 258 LP + DNG + G LE L LGI + P+HPQT GK ER H++L+ + Sbjct: 196 LPVALLSDNGLAFSGKHRGRMVELERRLAALGITAIAAAPHHPQTCGKNERSHQTLQKWL 255 Query: 259 LQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGNTTPPEYDEG 318 +LQ D +RT+YN R H++L+ P RY AR + T P G Sbjct: 256 AARPAAGTLAQLQELLDEYRTIYNHRR-HQSLNGDTPRQRY--DARPKAVPATGPRRPSG 312 Query: 319 VMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQEDGSYEVWWYSTKVGVI--DLKKK 376 + R V +G ++ G S+ G+ + G + V+W +V V+ D + Sbjct: 313 LATRPVSATGVIAFSGCSIVLGRRWAGH-----------TASVYWQGDRVTVMINDTIAR 361 Query: 377 SITMGK 382 +T+ + Sbjct: 362 QLTLDR 367 >UniRef50_UPI0001B453C4 transposase for IS3514a n=1 Tax=Mycobacterium intracellulare ATCC 13950 RepID=UPI0001B453C4 Length = 397 Score = 240 bits (613), Expect = 6e-62, Method: Composition-based stats. Identities = 97/386 (25%), Positives = 158/386 (40%), Gaps = 17/386 (4%) Query: 13 MSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNR 72 MS + +G + ++ R + +S ++ ++R+ EG A + R R PH +P Sbjct: 1 MSKAQLVITAVVLEGRSKSAVARDYEVSRYWVHQLVKRYEAEGPAAFEPRSRRPHTNPRA 60 Query: 73 SSDDITALLRMAH----DRHERWGARKIKRWLEDQGH--TMPAFSTVHNLMARHGLLPGA 126 + D+ + GA I L T+PA +T+ +++R G + Sbjct: 61 VAGDLEERIVRLRKTLLREGYDAGAATIAEHLARDPAVATVPALATIWRVLSRRGFITAQ 120 Query: 127 SPGIPAT--GRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERR 184 P + RFE D PN+ WQ D L ++DDHSR ++ Sbjct: 121 PQKRPRSSWKRFEADLPNQCWQADVTHWQLADHTSAEILNIIDDHSRLAIASTAYRTVTA 180 Query: 185 ETVQQQLVSVFERYGLPDRMTMDNGSPWGDT--TGTWTALELWLMRLGIRVGHSRPYHPQ 242 V + + F +G P + DNG+ + T G TAL++ L LGI +SRPYHPQ Sbjct: 181 PDVVEAFTAAFATWGTPAALLTDNGAVFTATPRRGGRTALQILLGELGITYINSRPYHPQ 240 Query: 243 TQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPS 302 T GK+ERFH++LK + ELQ + + YN RPH A+ P + Sbjct: 241 TCGKVERFHQTLKKRLTAVPPATTITELQSHLNEFVNYYNTVRPHRAVGRRTPHHAFTSR 300 Query: 303 ARQYS-GNTTPPEYDEGVMVRKVDISGKLSVKGVS----LSAGKAFRGERVGLKEMQEDG 357 + G PP + + ++D +G ++V+ S + K RG V + D Sbjct: 301 PAAFPTGYHIPPHF--RLRHDRIDAAGVITVRYNSRLHHIGLSKHLRGTHVIVLINNRDI 358 Query: 358 SYEVWWYSTKVGVIDLKKKSITMGKG 383 + + L +G Sbjct: 359 RVLARDTGQLIRKLTLDPTRDYQPRG 384 >UniRef50_C8XFB0 Integrase catalytic region n=4 Tax=Actinomycetales RepID=C8XFB0_NAKMY Length = 607 Score = 236 bits (602), Expect = 1e-60, Method: Composition-based stats. Identities = 91/313 (29%), Positives = 142/313 (45%), Gaps = 17/313 (5%) Query: 5 MPWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPR 64 M + R + V GA + + G+S + + WL+R+ EG GL DR Sbjct: 1 MALVVLSKVEQRLDAVRAVLA-GATVTEVAAAVGVSRVSVHAWLRRYLTEGVTGLADRSH 59 Query: 65 IPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQ--GHTMPAFSTVHNLMARHGL 122 P P+++ D+++ + H RWGA++I+ L + G T+P+ +T++ ++ RHGL Sbjct: 60 RPRSCPHQAGDEVSVRVAELRRTHPRWGAKRIRMELLRKPAGLTVPSTATINRILIRHGL 119 Query: 123 LPGASPGIPAT--GRFEHDAPNRLWQMDFKGHFPFGG------GRCHPLTLLDDHSRFSL 174 + P + R+E P +LWQ+D G +T +DDHSRF + Sbjct: 120 VTPRRRKRPRSSYQRWERPGPMQLWQLDIVGDVWLVNPATGVLRGVKVVTGVDDHSRFCV 179 Query: 175 CLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDT--TGTWTALELWLMRLGIR 232 A V L + R+G+P + DNG + G + GI Sbjct: 180 IAAVVERATGRAVCLALAAALARFGVPGEILTDNGKQFTARFGRGGEVLFDKICRHNGIT 239 Query: 233 VGHSRPYHPQTQGKLERFHRSLKAEVL-QGKWFADSGELQRAFDHWRTVYNLERPHEALD 291 ++P P T GK+ERFH +L+ E+L + F Q A D + VYN ERPH+ALD Sbjct: 240 HRLTQPASPTTTGKIERFHLTLRRELLDDHEPFESLAAAQAAVDEFVRVYNTERPHQALD 299 Query: 292 MA---VPGSRYQP 301 P R+ P Sbjct: 300 GQRPVSPADRFTP 312 >UniRef50_A3TPQ6 Transposase n=7 Tax=Actinomycetales RepID=A3TPQ6_9MICO Length = 402 Score = 229 bits (584), Expect = 1e-58, Method: Composition-based stats. Identities = 94/354 (26%), Positives = 143/354 (40%), Gaps = 16/354 (4%) Query: 13 MSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNR 72 MS + +G + RR+G+ A YK R+ EG A + R R P SP Sbjct: 1 MSKARLVITALFVEGLKPAEVSRRYGVHRAWVYKLKARYEAEGEAAFEPRSRRPTTSPRA 60 Query: 73 SSDDITALLRMAHD----RHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASP 128 + + L+ + + GA I L + +TVH ++ R G + Sbjct: 61 TPEGTVDLVLRLREDLTGKGLDGGADTIVWHLLHGHGVTLSRATVHRILTRAGKVTAEPG 120 Query: 129 GIPATG--RFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRET 186 P + RFE + PN WQ DF + G +T LDDHSR++L ++ T + Sbjct: 121 KRPKSSFIRFEAEQPNETWQSDFTHYRLSTGADVEVITWLDDHSRYALHVSAHTRTTAKI 180 Query: 187 VQQQLVSVFERYGLPDRMTMDNGSP----WGDTTGTWTALELWLMRLGIRVGHSRPYHPQ 242 V + G P DNG + G TALE L L I +SRP HP Sbjct: 181 VLATFRAATAEQGCPAGTLTDNGMVYTVRFATGPGGRTALEHELRTLNIVQKNSRPNHPT 240 Query: 243 TQGKLERFHRSLKAE-VLQGKWFADSGELQRAFDHWRTVYNLERPHEAL-DMAVPGSRYQ 300 T GK+ERF +++K Q A EL + T YN RPH +L + P + Y Sbjct: 241 TCGKVERFQQTMKNWLRAQPDQPATVAELNTLLAAFVTEYNTRRPHRSLPHRSTPATAYN 300 Query: 301 PSARQYSGNTTPPEYDEGVMVRKVDISGKLSVKG----VSLSAGKAFRGERVGL 350 + + + V K++ +G ++++ + G+ RV L Sbjct: 301 ARPKATPTTDRTDDTHDRVRTDKINKNGVVTLRYQGTLHKIGVGRTHARTRVFL 354 >UniRef50_Q3A8V0 ISChy3, transposase n=5 Tax=Clostridia RepID=Q3A8V0_CARHZ Length = 448 Score = 224 bits (572), Expect = 3e-57, Method: Composition-based stats. Identities = 93/366 (25%), Positives = 159/366 (43%), Gaps = 21/366 (5%) Query: 21 LFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITAL 80 L A++ + + R IS T ++LQ + Q+G +GL + R + S S +I Sbjct: 30 LEAAEKRQRRKEILARSEISSRTLRRYLQLYRQQGLSGLMPKIRSDNGSSRTISHEIIEE 89 Query: 81 LRMAHDRHERWGARKIKRWLEDQGHT---MPAFSTVHNLMARHGLLPG-ASPGIPATGRF 136 + +I LE + M A ST+ ++R GL A+ I RF Sbjct: 90 AVKLKEELPERSVSQIIAILEGEKKVPAGMLARSTLGRHLSRLGLTQKEANQKISGHRRF 149 Query: 137 EHDAPNRLWQMDFK-GHF------PFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQ 189 + NRLWQ D K G + P R + + +DD +R D++R ++ Sbjct: 150 AKEQRNRLWQADIKYGPYLPHPKNPKRKVRTYLVAFIDDATRLLCHGEFYLDQKRPVLED 209 Query: 190 QLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLER 249 + G+PD + +DNG + L RLGIR +++PY P+++GK+ER Sbjct: 210 CFRKAILKRGIPDAVYVDNGKIFVSRW-----FRLGCARLGIRPINTKPYSPESKGKIER 264 Query: 250 FHRSLKA--EVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYS 307 F+R++++ ++ + EL +AF W PH +L+ P +R+Q R+ Sbjct: 265 FNRTVESFIAEIELQQPETLAELNQAFAVWVEEGYNHHPHSSLENETPANRFQKDTRRLR 324 Query: 308 GNTTPP--EYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQED-GSYEVWWY 364 + E R+VD +G + ++G G + + V L+ D S E W+ Sbjct: 325 FASLEECREAFLWEASRRVDKTGCIKLEGRFYEIGLEWIRKTVDLRYDPFDLESIEFWYN 384 Query: 365 STKVGV 370 K G+ Sbjct: 385 GQKQGL 390 >UniRef50_D1VRH7 Integrase catalytic region n=1 Tax=Frankia sp. EuI1c RepID=D1VRH7_9ACTO Length = 410 Score = 218 bits (555), Expect = 3e-55, Method: Composition-based stats. Identities = 97/349 (27%), Positives = 138/349 (39%), Gaps = 21/349 (6%) Query: 13 MSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNR 72 MS + +G + RR+ +S YK R+ EG + R R P SP Sbjct: 1 MSKARLVITALVVEGQTAAQVARRYEVSRGWVYKLKARYDAEGEVAFEPRSRRPVSSPTA 60 Query: 73 SS----DDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASP 128 +S D + L + + GA I L T + +T++ ++ R G + Sbjct: 61 TSVAMVDLVLRLRKELAEAGLDAGADTIGWHLAHHHDTTLSRATINRILNRAGAVTPEPA 120 Query: 129 GIPATG--RFEHDAPNRLWQMDFKGHFPFG-----GGRCHPLTLLDDHSRFSLCLAHCTD 181 P + RF+ D PN WQ DF + G LT LDDHSRF+L ++ Sbjct: 121 KRPRSSYIRFQADQPNECWQSDFTHYRLTRPNGKIGIDTEILTWLDDHSRFALRVSAHLK 180 Query: 182 ERRETVQQQLVSVFERYGLPDRMTMDNGSPW------GDTTGTWTALELWLMRLGIRVGH 235 V + +G P DNG + G T E L RLGI + Sbjct: 181 ITGRIVVASFRQAADLHGYPASTLTDNGMVYTVRLASAGVAGGRTGFEAELRRLGIVQKN 240 Query: 236 SRPYHPQTQGKLERFHRSLKAE-VLQGKWFADSGELQRAFDHWRTVYNLERPHEAL-DMA 293 SRP HP T GK+ERF ++LK Q A + LQ D + YN RPH +L Sbjct: 241 SRPNHPTTCGKVERFQQTLKKWLAAQPVQPASTYALQTLIDQFVETYNQHRPHRSLPGRC 300 Query: 294 VPGSRYQPSARQYSGNTTPPEYDEGVMVRKVDISGKLSVK--GVSLSAG 340 P YQ + + + V VD +GKL+++ G G Sbjct: 301 TPAVAYQARPKARPNTDRSADSHDRVRRDHVDANGKLTLRVNGRLHHIG 349 >UniRef50_A1UAJ3 Integrase, catalytic region n=7 Tax=Actinomycetales RepID=A1UAJ3_MYCSK Length = 426 Score = 218 bits (554), Expect = 4e-55, Method: Composition-based stats. Identities = 86/339 (25%), Positives = 139/339 (41%), Gaps = 29/339 (8%) Query: 30 IRSLCRRFGISPATGYKWLQRWAQEGAAG-LQDRPRIPHHSPNRSSDDITALLRMAHDRH 88 + + C +GIS + Y+ +R EG A L+ R P SP++ SD++ Sbjct: 28 VSTFCAEYGISRKSFYELRKRVKTEGPAAVLEPMTRRPKSSPSKLSDEVKEQALAVRAAL 87 Query: 89 E----RWGARKIKRWLEDQG-HTMPAFSTVHNLMARHGLLPGASPGIPAT--GRFEHDAP 141 E G + + G +P+ +++ + G+ P + RF + AP Sbjct: 88 EATGLDHGPISVHDKMHAMGLERVPSTASLARVFREAGVARLEPKKKPRSAWRRFVYPAP 147 Query: 142 NRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLP 201 N WQ+D + GG RC L+DDHSR++L E + + +G+P Sbjct: 148 NACWQLDATEYVLSGGRRCVIFQLIDDHSRYALASHVALSETAKEAIAVVDKAIAAHGVP 207 Query: 202 DRMTMDNGSPWG-DTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQ 260 R+ DNG G L L LG+ +PY P TQGK ERFH++L + + Sbjct: 208 QRLLSDNGIALNPSRRGHVGQLVAHLAALGVEAITGKPYKPTTQGKNERFHQTLFRYLDK 267 Query: 261 GKWFADSGELQRAFDHWRTVYNLERPHEAL-DMAVPGSRYQPSA---------------- 303 ELQ D + +YN ERPH+ L P + ++ +A Sbjct: 268 QPIAESLAELQCHVDAFDGIYNTERPHQGLPGRVTPRTAWEATAKAPAPRPKPDPPSFDH 327 Query: 304 ---RQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSA 339 R + TP + G V+ ++ +G + GV+ Sbjct: 328 AVVRPHRPAPTPADLPHGTSVKTLNTAGAFVLAGVTYKV 366 >UniRef50_A6DU50 Putative ISmav2-like transposase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DU50_9BACT Length = 598 Score = 213 bits (541), Expect = 1e-53, Method: Composition-based stats. Identities = 69/288 (23%), Positives = 119/288 (41%), Gaps = 7/288 (2%) Query: 16 RTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSD 75 + + V ++G + + G + + W++R+ +EG GL+ RP+ + Sbjct: 14 KLKCVKLCLEEGYPRKFVAAESGANLKSLGAWIKRYNEEGPQGLKPRPKGKKGR-QQIHP 72 Query: 76 DITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPA--- 132 + + ++ +G ++I L+ + TV + L+ Sbjct: 73 ETKEKIIELKKQYPIFGIKRISDLLKRVFFLKASPETVRKTLNEENLIQKERKKPRKNPQ 132 Query: 133 -TGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQL 191 FE PN++WQ D F GG + L +DD+SR+ + L + E + + Sbjct: 133 KPRFFERSRPNQMWQTDIFS-FRLGGQAAYLLAFIDDYSRYMVGLGLYRRQTAENLLEVY 191 Query: 192 VSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFH 251 Y P M DNG + + GT T E L + I+ S+P+HP T GK+ERF Sbjct: 192 RRATGEYNCPAEMLTDNGRQYTNWRGT-TRFEKELKKDRIKHIRSQPHHPMTLGKIERFW 250 Query: 252 RSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRY 299 +++ E L F Q+ W YN +RPH+ + P RY Sbjct: 251 KTIWTEFLDRCQFDCMETAQQRITLWIKYYNHQRPHQGIGGLCPADRY 298 >UniRef50_C5D6W5 Integrase catalytic region n=19 Tax=Firmicutes RepID=C5D6W5_GEOSW Length = 417 Score = 209 bits (532), Expect = 1e-52, Method: Composition-based stats. Identities = 74/329 (22%), Positives = 122/329 (37%), Gaps = 18/329 (5%) Query: 39 ISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMAHDRHERWGARKIKR 98 I+ T W R+ + G L+ + R R S D + H Sbjct: 49 IAAKTILDWCTRYKKGGFDALKPKRRSDRGHSRRLSPDDEDHILALRKEHPTMPVTVFYE 108 Query: 99 WLEDQGHT---MPAFSTVHNLMARHGLLPGASPGIPATGRFEHDAPNRLWQMDFKGHFPF 155 L +QG ++ T++ L+ +H L+ +P RF +D N LWQ D Sbjct: 109 HLIEQGEIPENHISYFTIYRLLKKHNLVGKEILPMPERKRFAYDQINELWQGDLSHGPTI 168 Query: 156 G----GGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSP 211 + + +DD SRF E+ + ++ R G P R+ DNG Sbjct: 169 RVNGKAQKTFLIAYIDDCSRFVPYAQFFPSEKFDGLRIVTKEAVLRCGKPKRIYSDNGKI 228 Query: 212 WGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEV---LQGKWFADSG 268 + L+ +GI + H++PY PQ++GK+ERF R+++ L+ Sbjct: 229 YRSEV-----LQYACAEMGITLIHTQPYDPQSKGKIERFFRTVQTRFYPLLELDPPKSLE 283 Query: 269 ELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGNTTPPEYDEGVM---VRKVD 325 EL F W +PH +LD P +Q S D + RKV Sbjct: 284 ELNERFWRWLEEEYHRKPHASLDGKTPHEVFQSQVHLVSFIEDGDWLDAIFLKREHRKVK 343 Query: 326 ISGKLSVKGVSLSAGKAFRGERVGLKEMQ 354 G +++ F G+ + L+ + Sbjct: 344 ADGTITLNKQLYEVPPRFIGQSIELRYDE 372 >UniRef50_B0TDR5 Transposase, putative n=5 Tax=Firmicutes RepID=B0TDR5_HELMI Length = 451 Score = 207 bits (527), Expect = 5e-52, Method: Composition-based stats. Identities = 85/376 (22%), Positives = 148/376 (39%), Gaps = 34/376 (9%) Query: 8 DARDTMSLRTEFVLFASQDGAN-------IRSLCRRFGISPATGYKWLQRWAQEGAAGLQ 60 A D S R + + +G + + +C + GIS T ++L ++ ++G +GL+ Sbjct: 6 KAEDIASQRVQLLSPLLAEGLDAARARLMKQQICEQAGISERTLRRYLSQYREKGFSGLK 65 Query: 61 DRPRIPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMP---AFSTVHNLM 117 + + S + + R +I + LE +G P ST+ + Sbjct: 66 PKGKGRSRSEEAIPHALLEEAILLRREVPRRSIAQIIQILEWEGKAEPGKLKRSTLQEKL 125 Query: 118 ARHGLLPGASPGIPATG----RFEHDAPNRLWQMDFK-GHFPFGG-----GRCHPLTLLD 167 A G TG RF+ N+LW D K G + G + + +T D Sbjct: 126 AERGYSTRHMQMYANTGVAARRFQQKHRNQLWHSDIKYGPYLPIGPDGAKKQVYLVTFFD 185 Query: 168 DHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLM 227 D +RF L + V+ +YG P+ + DNG + + Sbjct: 186 DATRFVLHGQFYPTLDQVIVEDCFRQAILKYGAPEAVFFDNGKQYRTKW-----MHRACA 240 Query: 228 RLGIRVGHSRPYHPQTQGKLERFHRSLKA--EVLQGKWFADSGELQRAFDHWRTVYNLER 285 ++GIR+ ++PY P++ GK+ERF+R++ A + + L + F W + Sbjct: 241 KMGIRLLFAKPYSPESTGKVERFNRTVDAFLQEAALEKPHTLDRLNQLFWVWLDECYQNK 300 Query: 286 PHEAL-DMAVPGSRYQPSARQYSGNTTPPEYDEGVMV----RKVDISGKLSVKGVSLSAG 340 PH AL P + Y+ + + P+ + RKVD SG +S +G G Sbjct: 301 PHSALAGNVSPDTAYRSDKK--AVKFLDPDVVANAFLHCESRKVDKSGCISFEGRKYEVG 358 Query: 341 KAFRGERVGLKEMQED 356 +F G V + D Sbjct: 359 LSFIGCTVDVIYDPAD 374 >UniRef50_Q3SW20 Helix-turn-helix, Fis-type n=112 Tax=Bacteria RepID=Q3SW20_NITWN Length = 785 Score = 207 bits (527), Expect = 5e-52, Method: Composition-based stats. Identities = 80/300 (26%), Positives = 120/300 (40%), Gaps = 16/300 (5%) Query: 10 RDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHS 69 R S + E + Q + + GI AT Y+W R+ G L D P Sbjct: 461 RYPASEKAEIIALVEQSHLPAKRTLDKLGIPRATFYRWYDRYRAGGIEALADHRSRPDRV 520 Query: 70 PNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLP-GASP 128 NR DD+ + R++ D+ + ++V+ L+ H L+ A Sbjct: 521 WNRIPDDVRGQIIDLALELPELSPRELAVRFTDERKYFVSEASVYRLLKAHDLITSPAYV 580 Query: 129 GIPATGRF--EHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRET 186 I A F + A N+LWQ DF G G + T+LDD SR+ + Sbjct: 581 VIKAANEFKDKTTAANQLWQTDFTYLKITGWGWYYLSTVLDDFSRYIVAWRLGPTMCASD 640 Query: 187 VQQQLVSVFERYGL-------PDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPY 239 V L GL R+ DNGS + L WL ++ PY Sbjct: 641 VTATLDQALAASGLDHVSVRQRPRLLSDNGSSYVADD-----LATWLRAKDMQHVRGAPY 695 Query: 240 HPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRY 299 HPQTQGK+ER+H++LK +L ++ +L+R + YN +R HE++ P Y Sbjct: 696 HPQTQGKIERWHQTLKNRILLENYYL-PDDLKRQVAAFVEHYNHDRYHESIGNVTPADVY 754 >UniRef50_D1TPC8 Putative transposase integrase n=1 Tax=Burkholderia sp. CCGE1002 RepID=D1TPC8_9BURK Length = 255 Score = 206 bits (525), Expect = 9e-52, Method: Composition-based stats. Identities = 120/233 (51%), Positives = 144/233 (61%), Gaps = 6/233 (2%) Query: 5 MPWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAA-GLQDRP 63 MPW+ R+TM+LR EFV A Q+GAN R LCRRFGIS TGYKWL R AQ+ A L DR Sbjct: 1 MPWNPRETMNLRLEFVCLALQEGANRRELCRRFGISAKTGYKWLSRHAQDSTAMALADRS 60 Query: 64 RIPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHT-MPAFSTVHNLMARHGL 122 R P +P R++ + + H WG RKI R L D GHT +PA STV +++ RHGL Sbjct: 61 RRPRQTPARTAPCVEQQVVQLRQAHPAWGGRKISRRLSDLGHTDVPAPSTVTDILHRHGL 120 Query: 123 LPGASPGIPATG-RFEHDAPNRLWQMDFKGHFPFGGGR-CHPLTLLDDHSRFSLCLAHCT 180 + A+ RFEH+ PN LWQMDFKG F GR C PLT+LDDHSR+++ L C Sbjct: 121 IDAAASAAATPWQRFEHEQPNDLWQMDFKGWFDLQDGRHCSPLTMLDDHSRYNVTLDACI 180 Query: 181 DERRETVQQQLVSVFERYGLPDRMTMDNGSPWG--DTTGTWTALELWLMRLGI 231 TVQ L F RYGLP R+ DNGSPWG G T L +WL+RLGI Sbjct: 181 GTDTRTVQHHLERTFRRYGLPLRINADNGSPWGSPSQAGQLTELAIWLIRLGI 233 >UniRef50_C7MHU2 Integrase family protein n=3 Tax=Actinomycetales RepID=C7MHU2_BRAFD Length = 434 Score = 205 bits (521), Expect = 3e-51, Method: Composition-based stats. Identities = 95/418 (22%), Positives = 153/418 (36%), Gaps = 53/418 (12%) Query: 5 MPWDARDTMSLRTEFVLFASQD-GANIRSLCRRFGISPATGYKWLQRWAQEGAAG-LQDR 62 M + +R + + S C GIS T Y R +EG A L+ + Sbjct: 1 MSKNQPVDPRVRLAISRWPEDAPRGTVTSFCVEHGISRKTFYVLRARLREEGPAAVLEPK 60 Query: 63 PRIPHHSPNRSSDDITALLRMAHDRHE----RWGARKIKRWLEDQGHTMPAFSTVHNLMA 118 R P SP R +D+ E G + + G P+ + + + Sbjct: 61 SRRPSSSPTRIGEDVKDQAVAVRAALEASGLDHGPISVFDRMGAMGLESPSVAALARIFR 120 Query: 119 RHGLLPGASPGIPAT--GRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCL 176 G+ P + RF + APN WQ+D G+ G C L DDHSR ++ Sbjct: 121 ERGVARADPKKKPRSAYRRFVYPAPNACWQLDATGYVLIDGRSCTIFQLQDDHSRLAVAS 180 Query: 177 AHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTW-TALELWLMRLGIRVGH 235 E + + RYG+P R+ DNG+ T W + L LG++ Sbjct: 181 LVAPAETTQAALDVFLKGVARYGVPQRLLTDNGAAMNPTRRGWPSPLVTHATGLGVQAIT 240 Query: 236 SRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMA-V 294 +P+ P TQGK ERFH++L + Q +LQ D++ +YN +R H+ L Sbjct: 241 GKPFKPTTQGKNERFHQTLFRWLDQQPLAETISQLQAMVDNFDIIYNQQRRHQGLPGRIT 300 Query: 295 PGSRY--------------------------------QPSARQYSGNTTPPEYDEGVMVR 322 P + +P A Y+ P + + G V Sbjct: 301 PQQAWDATPVAEAPKPPAAPIDVLLPAPLDEVLHPGEEPQALDYTAWGDPFDKEAGQRVL 360 Query: 323 KVDISGKLSVKGVSLSAGKAFRGERVGLKEMQEDGSYEVWWYSTKVGVIDLKKKSITM 380 + +G + ++ ++ K GE V V W +T + VID+ + + Sbjct: 361 RTGSNGSIVLRRITFYLSKRRAGEHV-----------RVIWDATGLVVIDVHGEVLIK 407 >UniRef50_A3J543 Helix-turn-helix, Fis-type protein n=6 Tax=Bacteria RepID=A3J543_9FLAO Length = 336 Score = 205 bits (521), Expect = 3e-51, Method: Composition-based stats. Identities = 74/298 (24%), Positives = 137/298 (45%), Gaps = 14/298 (4%) Query: 10 RDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHS 69 R T+S + E + ++ + R GI+ + Y W +++ G GL R + Sbjct: 2 RLTVSEKQEIIHMVTRSEIGVNRTLREIGINKSMFYNWYHAYSENGVEGLLPTKRASNRQ 61 Query: 70 PNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPG 129 N + L+ + +R++ + D+ + S+V+ ++ GL+ + Sbjct: 62 WNSIPQEQKNLVVKLALEYPDLSSRELAYKVTDEQQIFLSESSVYRILKSRGLITAPAHI 121 Query: 130 IPATGRFEHDAP---NRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRET 186 + G D +++WQ DF G G + T+LDD+SR+ + C++ + + Sbjct: 122 FLSAGNEFTDKTSFVHQMWQTDFTYFKILGWGWYYLSTVLDDYSRYIVHWELCSNMKADD 181 Query: 187 VQQQLVSVFERYGL----PDRMTMDNGSPWGDTTGTWTALELWLMR-LGIRVGHSRPYHP 241 V++ + S ++ L ++ D GS + + L+ +L ++ H RP HP Sbjct: 182 VKRTVDSAIKKAKLVTKQKPKLLSDKGSCY-----IASELKTYLKDNYQMQQVHGRPNHP 236 Query: 242 QTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRY 299 QTQGK+ER+HR++K V +FA EL+ A + + YN ER HE+L+ P Y Sbjct: 237 QTQGKIERYHRTIKNVVKLDNYFA-PEELEAALEKFVYRYNNERYHESLNNLTPADVY 293 >UniRef50_B0NHH2 Putative uncharacterized protein (Fragment) n=2 Tax=Clostridium scindens ATCC 35704 RepID=B0NHH2_EUBSP Length = 422 Score = 201 bits (512), Expect = 3e-50, Method: Composition-based stats. Identities = 72/341 (21%), Positives = 128/341 (37%), Gaps = 21/341 (6%) Query: 37 FGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMAHDRHERWGARKI 96 F SP T KW+ + G L R R + D + R + +I Sbjct: 55 FRYSPKTISKWVSLYQNGGIDALMPRERSDKGATRVLPDTAIEEICRLKAAFPRLNSTQI 114 Query: 97 KRWLEDQGHTMPAFS--TVHNLMARHGLLPGASPGIPATGRFEHDAPNRLWQMDFKGHFP 154 + L ++ + S V + +H L ++P + FE DA ++WQ D + P Sbjct: 115 HKHLVEEAFIPASVSVCAVQRFVKKHDLKSASNPNLRDRKAFEEDAFGKMWQAD-TCYLP 173 Query: 155 F-----GGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNG 209 + R + + ++DDHSRF + ++ Q+ L +G+P ++ +DNG Sbjct: 174 YITENGQRRRVYCILVIDDHSRFLVGGGLFYNDTAYNFQKVLKDAVAAHGIPSKLYVDNG 233 Query: 210 SPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQ---GKWFAD 266 + L L +G + H++ ++ K+ER R+LK L Sbjct: 234 CSYVG-----AQLSLICGSIGTVLLHTKVRDGASKAKIERQFRTLKETWLYTLDMDSITS 288 Query: 267 SGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQP---SARQYSGNTTPPEYDEGVMVRK 323 + + YN H + P +RYQ S R+ E + RK Sbjct: 289 LAQFNGLLKDYMRSYNTS-VHSGIG-TTPLARYQQTRSSIRRPKSREWLEECFLNRITRK 346 Query: 324 VDISGKLSVKGVSLSAGKAFRGERVGLKEMQEDGSYEVWWY 364 V+ +S+ V+ F +V ++ + +D S Y Sbjct: 347 VNKDSTVSIDRVAYDVPMQFISSKVEIRFLPDDMSSAFILY 387 >UniRef50_C6PFD7 Integrase catalytic region n=3 Tax=Thermoanaerobacterium thermosaccharolyticum DSM 571 RepID=C6PFD7_CLOTS Length = 412 Score = 201 bits (510), Expect = 5e-50, Method: Composition-based stats. Identities = 70/369 (18%), Positives = 141/369 (38%), Gaps = 21/369 (5%) Query: 8 DARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPH 67 + T E++ + ++ SL +R SP T WL + + G GL + R Sbjct: 22 NENYTQETAKEYMEVITSKVYDVPSLGKR-EFSPNTIKTWLYCYRKYGFEGLYPKSRCDK 80 Query: 68 HSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHT---MPAFSTVHNLMARHGLLP 124 + +DD+ A ++ + R A+ I + L + + STV + R + Sbjct: 81 GASRVLTDDVKAYIKNLKLDNPRRSAKSIYQELLVKKFIELDKVSLSTVQRYL-RKTKIS 139 Query: 125 GASPGIPATGRFEHDAPNRLWQMDF-KGHFPFGGG---RCHPLTLLDDHSRFSLCLAHCT 180 ++ FE + PN WQ D G + + + + LDD SR Sbjct: 140 TSALNTKDRRSFEMEYPNDCWQSDISMGPYLIINDKKIKTYLIAFLDDSSRLITHAEFYD 199 Query: 181 DERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYH 240 + ++ + G+P ++ +DNG + L L LG + ++ PY Sbjct: 200 TDNVISLIDAFKKAVSKRGVPKKLFVDNGKVFQSEQ-----LHLICASLGTSLCYAEPYS 254 Query: 241 PQTQGKLERFHRSLKAEVLQG---KWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGS 297 P+++GK+ERF R+LK + + G + + EL + + + H + +M P Sbjct: 255 PESKGKIERFFRTLKDQWMYGFDWQKISSIDELNENLNKYIEGIYHQTVHSSTNMK-PIE 313 Query: 298 RYQPSARQYSGNTTPPEYDEGV---MVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQ 354 ++ + + D + R+V +S++ + + G+ V ++ Sbjct: 314 KFIKYTDTMKFINSKEDLDNIFLYRVKRRVIKDATVSIEKIKFEVPMQYIGDYVNIRYYP 373 Query: 355 EDGSYEVWW 363 + + Sbjct: 374 KSLDKAYIF 382 >UniRef50_A7B7Y8 Putative uncharacterized protein n=4 Tax=Clostridiales RepID=A7B7Y8_RUMGN Length = 417 Score = 198 bits (503), Expect = 3e-49, Method: Composition-based stats. Identities = 70/335 (20%), Positives = 128/335 (38%), Gaps = 22/335 (6%) Query: 35 RRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMAHDRHERWGAR 94 + +PAT KW + G GL + R + +++ +R + R A Sbjct: 52 KLHHYAPATIEKWYLDYQNHGFEGLVPKGRSDAGMSRKLDEELQERIRYFKTNYPRMSAA 111 Query: 95 KIKRWLEDQGHTMP---AFSTVHNLMARHGLLPGASPGIPATGRFEHDAPNRLWQMD-FK 150 I R L+ G + + STV + R +P R+E N +W D Sbjct: 112 AIYRQLKSDGSVINGQVSESTVSRFVKRLQSELRQTPN-KDMRRYERPHINEVWCGDSSV 170 Query: 151 GHFPFG----GGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMTM 206 G R + + L+DD SRF + ++ + + S +YG P Sbjct: 171 GPRLTDSDGKKHRIYIIALIDDASRFITGIDVFYNDNFINLMSVMRSAIAKYGRPKVFNF 230 Query: 207 DNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEV---LQGKW 263 DNG + + +EL R+G + + +PY P + K+ER+ R++K + L + Sbjct: 231 DNGKSYKN-----KQMELLAARIGTTLSYCQPYTPTGKAKIERWFRTMKDQWMAALDMRD 285 Query: 264 FADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGNTTPPEYDEGVMV-- 321 F EL+ + + YN PH +L P R+ Q + + + ++ Sbjct: 286 FHSLEELRGSLHAFVQRYNQS-PHSSLHGLSPQDRFFSEPEQIR-RLSEEDITQNFLLEI 343 Query: 322 -RKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 R+V + + + F +R+ L+ + Sbjct: 344 ERRVSADSVIVIDQIEYEVDYRFARQRIRLRYSPD 378 >UniRef50_A0JV34 Integrase, catalytic region n=12 Tax=Actinomycetales RepID=A0JV34_ARTS2 Length = 325 Score = 197 bits (501), Expect = 5e-49, Method: Composition-based stats. Identities = 84/331 (25%), Positives = 119/331 (35%), Gaps = 32/331 (9%) Query: 1 MESLMPWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQ 60 M + +A T R + + G +R RF SPAT KW+ R+ G G+ Sbjct: 1 MSYVTHANADLTPKARGKLARLVIEQGWTLRRAAERFQCSPATAKKWVDRYRARGEDGMA 60 Query: 61 DRPRIPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHN--LMA 118 D P SPNR+ + RWG +I L T+ T + +A Sbjct: 61 DLSSRPRRSPNRTDVRTERRILALR-FTRRWGPHRIAAHLHLARSTVGKVLTRYRMPRLA 119 Query: 119 RHGLLPGASPGIPATGRFEHDAPNRLWQMDFKGHFPFGGG-------------------- 158 G PA R+EHD P L +D K G Sbjct: 120 CLDQGTGLPIRKPAPQRYEHDHPGDLVHVDIKKLGRIPDGGGHRALGRAAGRKNRRAGTG 179 Query: 159 RCHPLTLLDDHSR--FSLCLAHCTDERRETVQQQLVSVFERYGLPDR-MTMDNGSPWGDT 215 + +DDHSR +S L E + S F +G+ R + DNG+ + Sbjct: 180 YAYLHHAVDDHSRLAYSEILTDEKKETATAFWFRAASFFAAHGITVRAVLTDNGACYRSR 239 Query: 216 TGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFD 275 T + I+ +RPY PQT GK+ERF+R+L E + + E + Sbjct: 240 AFTA------ALGPNIKHRRTRPYRPQTNGKVERFNRTLNTEWAYARPYTSEAERAATYP 293 Query: 276 HWRTVYNLERPHEALDMAVPGSRYQPSARQY 306 W YN R H + P SR Y Sbjct: 294 GWLHQYNHHRTHTGIGGKTPISRVHNLRGNY 324 >UniRef50_B4RV10 Integrase, catalytic region n=16 Tax=Proteobacteria RepID=B4RV10_ALTMD Length = 267 Score = 197 bits (500), Expect = 7e-49, Method: Composition-based stats. Identities = 65/289 (22%), Positives = 102/289 (35%), Gaps = 30/289 (10%) Query: 15 LRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSS 74 + E G +I C GIS +T Y P ++ Sbjct: 4 EKRECASILVDAGLSIVKACLFVGISRSTFY-------------------RPERDWRKAD 44 Query: 75 DDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPATG 134 + + D+ R G K + +G V+ + + GL Sbjct: 45 AAVIDAINAVLDKSPRAGFWKCFGRMRFKGFPF-NHKRVYRVYCQMGLNLRRRTKRVLPK 103 Query: 135 RFEH-----DAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQ 189 R + N W +DF + G R L ++D+ +R L + T V + Sbjct: 104 RIAQPLEVLEQANYQWALDFMHDTLYCGKRFRTLNVVDEGTRECLAIEVDTSLPAGRVVR 163 Query: 190 QLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLER 249 L + GLP ++ MDNG T L W I + + +P PQ G +ER Sbjct: 164 VLEQLKTERGLPKQLRMDNGPELISAT-----LTDWCQNHNIELLYIQPGKPQQNGFVER 218 Query: 250 FHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSR 298 F+ S + E L F + G+++ WR YN ER HE+L P + Sbjct: 219 FNGSFRREFLDAYLFENIGQVREMSWFWRLDYNEERTHESLGNLPPAAY 267 >UniRef50_C7RJ38 Integrase catalytic region n=5 Tax=Proteobacteria RepID=C7RJ38_9PROT Length = 441 Score = 196 bits (498), Expect = 1e-48, Method: Composition-based stats. Identities = 91/373 (24%), Positives = 147/373 (39%), Gaps = 33/373 (8%) Query: 12 TMSLRTEFVLFASQDGANIRSLCRRFGISPATGYK---------WLQRWAQEGAAGLQDR 62 + R Q ++R L R + P T + W R+ G GL + Sbjct: 17 PLISRQRLARGELQK--SLRELATREYVIPGTDRRLLGEKTIEGWYYRYRARGLDGLIPK 74 Query: 63 PRIPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMP---AFSTVHNLMAR 119 R ++ S + A + A + R R+I+R LE G + S++H L+ + Sbjct: 75 VRADRGQ-SKLSASVQAAILAAKRENPRRSIRQIQRVLEIGGIVARGTLSRSSLHRLLQQ 133 Query: 120 HG--LLPGASPGIPATGRFEHDAPNRLWQMDFKG--HFPFGG--GRCHPLTLLDDHSRFS 173 HG LPG++ F LW D P GG G+ + ++L DD SR Sbjct: 134 HGLSRLPGSASLPEEKRSFVAACAGELWYSDVMHGPRVPIGGRLGKSYLVSLFDDASRLV 193 Query: 174 LCLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRV 233 A C E ++ L + G+P ++ +DNG+ + T L+ RLGI + Sbjct: 194 AHGAFCRGETALDIEGVLKQALLKRGVPVKLVVDNGAAYVAQT-----LQGICARLGIVL 248 Query: 234 GHSRPYHPQTQGKLERFHRSLKAEVLQG---KWFADSGELQRAFDHWRTVYNLERPHEAL 290 H RPY P+++GK+ER+HR+ + + L + +L W PH+ L Sbjct: 249 VHCRPYAPESKGKIERWHRTCRDQFLSEVEERHVLSLDDLNARLWAWLEQVYHRTPHDGL 308 Query: 291 DMAVPGSRYQ---PSARQYSGNTTPPEYDEGVMVRK-VDISGKLSVKGVSLSAGKAFRGE 346 + P +RYQ P R + VR+ V G +S +G G+ Sbjct: 309 EGQTPLARYQQDLPKIRLLGPLAATLDTLFLHRVRRLVRKDGTVSYQGGRFEVPFELTGK 368 Query: 347 RVGLKEMQEDGSY 359 V L+ + Sbjct: 369 TVCLRVDPHTETV 381 >UniRef50_C4KRZ4 Integrase core domain protein n=59 Tax=Proteobacteria RepID=C4KRZ4_BURPS Length = 318 Score = 195 bits (495), Expect = 2e-48, Method: Composition-based stats. Identities = 65/300 (21%), Positives = 104/300 (34%), Gaps = 31/300 (10%) Query: 8 DARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPH 67 + R + ++ + C GIS + + R Sbjct: 29 KVASPQAKREAVRILMTERTMGVTRACGLVGISRSLLHY---------------ESRR-- 71 Query: 68 HSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGAS 127 + +T + + R+G R+I L+ G + L ++ GL Sbjct: 72 ---RVDDEALTGRMMAIAAQKRRYGYRRIHVLLQRDG-CFANHKRIWRLYSKAGLSVRKR 127 Query: 128 -----PGIPATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDE 182 + T PN+ W MDF G R L ++DD++R L + T Sbjct: 128 RRKRIAAVERTPLPLPTGPNQSWSMDFVSDGLAYGRRFRCLNVVDDYTRECLAIEVDTSL 187 Query: 183 RRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQ 242 VQQ L + E GLP +T+DNG + L+ W G+ + RP P Sbjct: 188 PGLRVQQVLARLKEMRGLPASITVDNGPEFAG-----KVLDAWAYEAGVTLSFIRPGKPV 242 Query: 243 TQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPS 302 +E F+ + E L WF ++ + WR YN ERPH +L P + Sbjct: 243 ENAYIESFNGRFRDECLNEHWFVSMRHAKQLIEEWRIEYNTERPHSSLGYLTPAQFARAH 302 >UniRef50_A4J392 Integrase, catalytic region n=22 Tax=Clostridia RepID=A4J392_DESRM Length = 459 Score = 195 bits (495), Expect = 3e-48, Method: Composition-based stats. Identities = 67/334 (20%), Positives = 121/334 (36%), Gaps = 23/334 (6%) Query: 39 ISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMAHDRHERWGARKIKR 98 SP T WL + + G L+ R + S +I +R + R + + Sbjct: 51 YSPKTLMCWLSDYRRGGLDSLKPGYRSDKGKSRKVSLEIADEIRKKRSQMPRITSALLYE 110 Query: 99 WLEDQGHTMP---AFSTVHNLMARHGLLP----GASPGIPATGRFEHDAPNRLWQMDFK- 150 L +P + +T + + + L +PG RF H N LWQ D Sbjct: 111 ELVKDKVILPEKLSRATFYRFLVANPELAAGKDPENPGEKELKRFSHQRINELWQTDIMF 170 Query: 151 GHFPFGG---GRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMTMD 207 G + G + + + +DD SR + ++ L + G+P + D Sbjct: 171 GPYISIGKSKKQAYLIAFIDDASRLITHAQFFFFQNFVALRVALKEAVLKRGIPKMIYTD 230 Query: 208 NGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEV---LQGKWF 264 NG + L + LG + H+ P+ P ++GK+ERF +++ L Sbjct: 231 NGKVYRSDQ-----LNMLCAGLGCSLIHTEPFTPTSKGKIERFFHTVRQRFLSRLDPTKL 285 Query: 265 ADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGNTTPPEYDEGV---MV 321 +L F W + H AL+M P + + P +E + Sbjct: 286 KSLDQLNLYFWQWLEEDYQCKTHSALNM-SPLDFFMAQVHNINFLANPQLLEEHFLLRVT 344 Query: 322 RKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 RKV+ LSV+ + ++ R+ ++ + Sbjct: 345 RKVNHDATLSVESILYETEQSLANSRLEVRYDPD 378 >UniRef50_B8J8P0 Integrase catalytic region n=1 Tax=Anaeromyxobacter dehalogenans 2CP-1 RepID=B8J8P0_ANAD2 Length = 281 Score = 193 bits (491), Expect = 8e-48, Method: Composition-based stats. Identities = 68/291 (23%), Positives = 110/291 (37%), Gaps = 32/291 (10%) Query: 12 TMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPN 71 +LR + ++ R CR G++P+T Y R P Sbjct: 4 PAALRPAVIELGAKFAMKKRRACRVVGLAPSTLYYC---------------SRRPER--- 45 Query: 72 RSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIP 131 ++ A LR + RWG R++ L+ +GH + V L GL Sbjct: 46 ---AEVRARLRDLAAQRPRWGYRRLHVLLDREGHHL-NHKLVFRLYRSEGLAVRRKRRKR 101 Query: 132 ATGRFEHDAPN-----RLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRET 186 T P + W MDF G + L L+D +R L + Sbjct: 102 ITSSLRVVPPPPTRPRQQWTMDFTQDSLASGRQFRTLNLIDAFTRECLLIEADHSLTGAR 161 Query: 187 VQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGK 246 V + L + E +G P+ + +DNG+ + T +A++ W +R+ P P G Sbjct: 162 VVRALERLRELHGTPEVIRIDNGTEF-----TSSAVDAWAYTNQVRLDFITPGKPTENGH 216 Query: 247 LERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGS 297 +E F+ + E L WF +++R + +R YN RPH +LD P Sbjct: 217 IESFNGKFRDECLNENWFISLDDVRRKVEAYRVDYNEVRPHSSLDNRTPNE 267 >UniRef50_C0QA21 Transposase /integrase family protein n=1 Tax=Desulfobacterium autotrophicum HRM2 RepID=C0QA21_DESAH Length = 402 Score = 192 bits (487), Expect = 2e-47, Method: Composition-based stats. Identities = 81/347 (23%), Positives = 133/347 (38%), Gaps = 33/347 (9%) Query: 39 ISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMAHDRHERWGARKIKR 98 SP T KW R+ G L + PR + I L + H RW ++ Sbjct: 52 YSPDTLKKWFYRYRNGGLPALNNSPRKDIGTHGTIPQTIVDRLFKLREEHPRWTLSRMLD 111 Query: 99 WLEDQGHT---MPAFSTVHNL-----MARHGLLPGASPGIPATGRFEHDAPNRLWQMDFK 150 L + PA ST++ + R L P P F + +LW DF Sbjct: 112 QLVQENLWDKKSPARSTLYRFAQTANLKRDPHLAAHVPARP----FAYSFFGQLWMADFL 167 Query: 151 GHFPFG----GGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMTM 206 + + ++DD +R+ + T E E + +L++ +G P R Sbjct: 168 HGPKIREKGKKRKTYLHAIIDDATRYIVHAGFFTAESTEVMMAELMASVRTHGKPIRFYT 227 Query: 207 DNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGK--WF 264 DNG+ + L+ LGI + H+ P P+ +GK+ERF RS++ + L GK Sbjct: 228 DNGACYASKH-----LKFVCANLGIHLIHTPPGKPRGRGKVERFFRSVRDQFLDGKKAPA 282 Query: 265 ADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGNTTPPEYDEGVMVR-- 322 L +AF W Y +R H +L ++ R + Q + P + + R Sbjct: 283 KTLDGLNKAFREWVASY-HKRIHSSLGISPLQKRL---SHQSACKALPETVEIEPLFRMK 338 Query: 323 ---KVDISGKLSVKGVSLSAGKAFRGERVGLKEMQEDGSYEVWWYST 366 KV ++ + +K A G+RV + M + VW+ Sbjct: 339 RRCKVYLNNTIRLKRRIYEVIDALPGQRVDVWFMPWNLD-MVWYGPE 384 >UniRef50_C1DTA7 Putative transposase n=2 Tax=Sulfurihydrogenibium azorense Az-Fu1 RepID=C1DTA7_SULAA Length = 327 Score = 191 bits (485), Expect = 4e-47, Method: Composition-based stats. Identities = 72/299 (24%), Positives = 125/299 (41%), Gaps = 26/299 (8%) Query: 15 LRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPH--HSPNR 72 R +++ QD NI CR FGIS T YKW +R+ ++G GL DRP+ P P Sbjct: 34 KRLKWI-QHYQDTKNISKTCRYFGISRTTFYKWFERYKKDGLEGLLDRPKTPKNTRKPTI 92 Query: 73 SSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGAS----- 127 + +++ ++ W KI +L+++ + + STV+ ++ GL+ Sbjct: 93 RNQYREQIIK-VRKQNPTWSKEKISAYLQEEKNIKVSPSTVYKVLKEEGLIERTKSIKIQ 151 Query: 128 -------PGIPATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCT 180 + AP + Q+D K H G + T +D +SRF + + Sbjct: 152 NKRKKSIKKKRTKRGLQAQAPGDVVQIDVK-HLNIAGATYYQFTAIDKYSRFCFARVYES 210 Query: 181 DERRETVQQQLVSVFERYGLP-DRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPY 239 ++T ++ + + E + R+ DNGS + +L +G+ S P Sbjct: 211 KNSKKT-KEFYIELNEYFEFEIKRVQTDNGSEFLG------EFNKYLTDIGVEHYFSYPR 263 Query: 240 HPQTQGKLERFHRSLKAE-VLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGS 297 P+T G +ER R+++ E L E+ + + YN RPH +L P Sbjct: 264 SPKTNGVVERLIRTIEEELWLIEGLDYTLEEMNKKLRKYVRKYNFIRPHHSLGYKRPAD 322 >UniRef50_B3PKR5 Transposase n=7 Tax=Gammaproteobacteria RepID=B3PKR5_CELJU Length = 280 Score = 190 bits (483), Expect = 6e-47, Method: Composition-based stats. Identities = 63/289 (21%), Positives = 100/289 (34%), Gaps = 30/289 (10%) Query: 15 LRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSS 74 + + G +I S C+ +S A+ Y P + Sbjct: 4 EKKTCAQALVEHGIDIASACKLADLSRASYY-------------------RPERDWRKCD 44 Query: 75 DDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPATG 134 + + R + G K + +G+ V+ + + GL Sbjct: 45 AAVIDAINNELKRSPQAGFWKCYGRIRHKGYPF-NHKRVYRVYCQMGLNLKRRVKRVLPR 103 Query: 135 RFEHD-----APNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQ 189 R N W +DF + G R L +LD+ +R L + T E V + Sbjct: 104 RIVQPLAVVAQANHQWALDFMHDSLYCGKRFRTLNVLDEGTRECLAIEVDTSLPAERVVR 163 Query: 190 QLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLER 249 L + GLP ++ +DNG L W GIR+ + P PQ G +ER Sbjct: 164 ALEQIKVERGLPTQLRVDNGPELISAR-----LTDWCEENGIRLVYIEPGKPQQNGFVER 218 Query: 250 FHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSR 298 F+ S + E L F +++ WR YN ER HE+L P + Sbjct: 219 FNGSFRREFLNAYLFESLTQVREMAWFWRMDYNEERTHESLGHLPPAAY 267 >UniRef50_B4S6V0 Integrase catalytic region n=10 Tax=Bacteria RepID=B4S6V0_PROA2 Length = 282 Score = 190 bits (482), Expect = 8e-47, Method: Composition-based stats. Identities = 69/293 (23%), Positives = 107/293 (36%), Gaps = 32/293 (10%) Query: 10 RDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHS 69 + R G ++R LC G++ ++ Y + +P Sbjct: 2 VTPEAKRNAVKHLHDTFGQSLRKLCILIGLNRSSWYY-------------EPQP------ 42 Query: 70 PNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPG 129 S++ I LR D +RWG R++ L +G + T L L+ Sbjct: 43 --DSNEPIRKRLRELADERKRWGYRRLHYLLRREGFQINHKRT-ERLYREENLMLRVRRR 99 Query: 130 IP-----ATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERR 184 + N W MDF + G R L +LD +SR L T Sbjct: 100 RKMASESRVAPPPPERKNHCWAMDFMSDNLYNGRRFRVLNVLDSYSRDYLGFEVDTSING 159 Query: 185 ETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQ 244 + V L + GLP+ +T+DNG + AL+ W R G+++ +RP P Sbjct: 160 KRVCSVLERIAWFKGLPELITVDNGPEFIG-----KALDAWAHRHGVKLVFNRPGKPVDN 214 Query: 245 GKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGS 297 +E F+ L+ E L WF G + W+ YN RPH +L P Sbjct: 215 TYIESFNGRLRDECLNVNWFMSLGHAREVIAEWQEDYNSVRPHSSLGTRTPEE 267 >UniRef50_C1F9W4 ISAca1, transposase n=2 Tax=Acidobacterium capsulatum ATCC 51196 RepID=C1F9W4_ACIC5 Length = 310 Score = 189 bits (480), Expect = 2e-46, Method: Composition-based stats. Identities = 82/315 (26%), Positives = 119/315 (37%), Gaps = 31/315 (9%) Query: 8 DARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPH 67 +AR T R + G ++ F +S T KW++R+ EG+ GL+DR PH Sbjct: 6 NARLTPYSREQLARKVICTGCTLKLAAASFNVSAKTAGKWVRRYRAEGSDGLRDRSSRPH 65 Query: 68 HSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGAS 127 SP R + + L + R +I R ++ L L P Sbjct: 66 RSPRRLPEAL--RLSVIELRRGYMPGYQIARRSAVSVSSVSRILRRARLSRWRDLNPP-- 121 Query: 128 PGIPATGRFEHDAPNRLWQMDFKGHFPF----------------GGGRCHPLTLLDDHSR 171 P R+EH AP L +D KG F G +DDHSR Sbjct: 122 ---PPVVRYEHAAPGDLLHLDIKGMTRFGEVSLRGDGRLRGKKEHPGFLALHVAVDDHSR 178 Query: 172 FSLCLAHCTDERRETV--QQQLVSVFERYGLPDR-MTMDNGSPWGDTTGTWTALELWLMR 228 + T+ V F +G+ R + DNGS + + Sbjct: 179 MVFAQMLADQKAETTIGFLHAAVEFFASHGIGIRALLTDNGSSYRSRQ-----FRQACQQ 233 Query: 229 LGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHE 288 + I+ +RPY P+T GK ERF ++ E K + DS + + W YN ERPH Sbjct: 234 MAIKHSRTRPYTPRTNGKAERFIQTAMREWAYAKHWTDSSQRDQHLQSWIHYYNHERPHG 293 Query: 289 ALDMAVPGSRYQPSA 303 +L+ P SR Q Sbjct: 294 SLNYKPPISRSQEGT 308 >UniRef50_C2GDW5 IS3514a transposase n=1 Tax=Corynebacterium glucuronolyticum ATCC 51866 RepID=C2GDW5_9CORY Length = 322 Score = 189 bits (479), Expect = 2e-46, Method: Composition-based stats. Identities = 69/260 (26%), Positives = 103/260 (39%), Gaps = 10/260 (3%) Query: 14 SLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRS 73 + V ++ + RF S Y L+R+ + G ++ R P P + Sbjct: 5 NANLAIVKAITEQHLTVSEASVRFKRSRQWIYTLLRRYEEGGPEAVKPRSTAPKTHPTKV 64 Query: 74 SDDITALL----RMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPG 129 S+++ + R + G I LE +G PA ST+ ++ ++G++ Sbjct: 65 SEEVIKQIIKIRRELASKGADNGPETIAWVLEQRGFHAPAESTIRRILTKNGMVTPQPKK 124 Query: 130 IPAT--GRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETV 187 P RFE PN WQ D G L LDDHSRF L L TV Sbjct: 125 RPKAYLRRFEATLPNECWQADVTSTRLLNGQVVEILDFLDDHSRFLLYLGAYKRVAGPTV 184 Query: 188 QQQLVSVFERYGLPDRMTMDNGSPWGDT----TGTWTALELWLMRLGIRVGHSRPYHPQT 243 ++ ++YG P DNG + G E +L + I + R HPQT Sbjct: 185 VTAAETITKKYGFPQSTLTDNGLVFTARLAGAKGGKNGFEKFLEKHSILQKNGRAGHPQT 244 Query: 244 QGKLERFHRSLKAEVLQGKW 263 QGK+ERFH++LK Sbjct: 245 QGKIERFHQTLKNGSAHDHP 264 >UniRef50_A6V7Q6 Transposase n=92 Tax=Bacteria RepID=A6V7Q6_PSEA7 Length = 481 Score = 188 bits (477), Expect = 3e-46, Method: Composition-based stats. Identities = 90/344 (26%), Positives = 135/344 (39%), Gaps = 51/344 (14%) Query: 2 ESLMPWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQD 61 + + +A T R +DG + F +SP T KW R+ +EG G+QD Sbjct: 151 QLMTHPNALLTPRARLRLARLIVEDGYPATIAAKMFMVSPITARKWAGRYREEGEFGMQD 210 Query: 62 RPRIPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHG 121 R PH P R+ + + + R R G +I L + STVH ++ R Sbjct: 211 RSSKPHRIPGRTPEHVKKKIINLRWRL-RLGPAQIAARLG------LSTSTVHAVLVRCR 263 Query: 122 LLPGASPGI---PATGRFEHDAPNRLWQMDFK--GHFPFGGGRCHP-------------- 162 + + R+EH P L +D G+ P GGG + Sbjct: 264 VNRLSHIDRVTGEPLRRYEHPHPGSLIHVDVTKFGNIPDGGGHRYVGRQQGARNKLATPG 323 Query: 163 ----------------LTLLDDHSR--FSLCLAHCTDERRETVQQQLVSVFERYGLP-DR 203 T++DDHSR ++ + V ++ V+ F G+ +R Sbjct: 324 LPRGKDHKPRTGTAFVHTVIDDHSRVAYAEIWSDEQASTAVGVLERAVAWFAERGVTVER 383 Query: 204 MTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKW 263 + DNGS + A + RLGIR +RPY PQT GK+ERFHR+L ++ Sbjct: 384 VLSDNGSAYRS-----HAWRDFCARLGIRHKRTRPYRPQTNGKIERFHRTLGDGWAYARF 438 Query: 264 FADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYS 307 + E + A W YN R H A+ VP R +S Sbjct: 439 YGSEAERRLALPGWLHFYNHHRHHSAIGG-VPFDRLNNVPGHHS 481 >UniRef50_B4RA95 Transposase, IS1477 n=35 Tax=Proteobacteria RepID=B4RA95_PHEZH Length = 361 Score = 187 bits (474), Expect = 7e-46, Method: Composition-based stats. Identities = 72/316 (22%), Positives = 113/316 (35%), Gaps = 38/316 (12%) Query: 12 TMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPN 71 + R + ++ G + R C + P T Sbjct: 64 PAARREAVLRLMAERGFSQRRACGLVQVDPKTV----------------------RRVAQ 101 Query: 72 RSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIP 131 ++ A LR R+G R++ LE +G +M + Sbjct: 102 PGDAEVRARLRGLAAERRRFGYRRLGILLEREGVSMNKKKLFRLYREEGLAVRRRRGRKR 161 Query: 132 ATGRFE----HDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETV 187 ATG D PN+ W +DF G R L ++DD +R +L L T + Sbjct: 162 ATGTRAPMALPDGPNQRWSLDFVADTLSWGRRFRILCIVDDFTREALALVVDTSIGGHRM 221 Query: 188 QQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKL 247 ++L ++ R G P + DNG+ T A+ W R G+ + P PQ G + Sbjct: 222 ARELDALIARRGRPATIVSDNGTE-----MTSRAMLEWTNRTGVDWHYIAPGKPQQNGFV 276 Query: 248 ERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVP-GSRYQPSARQY 306 E F+ L+ E L + FA+ E + + WR YN RPH A P R P+A + Sbjct: 277 ESFNGKLRDECLNEEVFANLAEARAVIERWRLDYNHVRPHSAHGGLTPEAVRLNPAAGRL 336 Query: 307 ------SGNTTPPEYD 316 + PP + Sbjct: 337 RNLISSTARPLPPALE 352 >UniRef50_C8NDJ4 ISHne2 transposase (Fragment) n=1 Tax=Cardiobacterium hominis ATCC 15826 RepID=C8NDJ4_9GAMM Length = 240 Score = 186 bits (473), Expect = 1e-45, Method: Composition-based stats. Identities = 74/214 (34%), Positives = 98/214 (45%), Gaps = 6/214 (2%) Query: 2 ESLMPWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQD 61 E MPW + M R F+ + +LCR+FGIS TG KW+ R+ Q G A L D Sbjct: 27 EIPMPWRQTNVMQQREMFINAWLSQKYSKIALCRQFGISRVTGDKWIVRFKQGGMAALAD 86 Query: 62 RPRIPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQ--GHTMPAFSTVHNLMAR 119 P P + + LL A H WGA+K+ L + PA ST ++ R Sbjct: 87 HSSRPAGCPRATDARLCELLCAAKREHPSWGAKKLLALLRRRAPHEAWPADSTGDLILKR 146 Query: 120 HGLLPGASPGI----PATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLC 175 GL+ P A+ DAPN+ W +DFKG F GGRC PLT+ D++SR LC Sbjct: 147 AGLVKARKPRRGISADASPFTAADAPNQSWSVDFKGDFAMRGGRCFPLTVSDNYSRKLLC 206 Query: 176 LAHCTDERRETVQQQLVSVFERYGLPDRMTMDNG 209 V Q +F G+P + DNG Sbjct: 207 CHGLASTAYAGVWPQFERLFAENGMPWSILSDNG 240 >UniRef50_UPI0001C30E87 Integrase catalytic region n=1 Tax=Conexibacter woesei DSM 14684 RepID=UPI0001C30E87 Length = 318 Score = 185 bits (470), Expect = 2e-45, Method: Composition-based stats. Identities = 88/319 (27%), Positives = 124/319 (38%), Gaps = 38/319 (11%) Query: 6 PWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRI 65 +A R VL + G +I + G+S T KW+ R+ EG+ GL DR Sbjct: 4 HANAPLGPKGRERMVLRVVEQGWSIAEAAQAAGVSDRTCSKWIGRYRAEGSMGLVDRAST 63 Query: 66 PHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPG 125 P SP R+ +D L+ R A +I L A STV ++ R L Sbjct: 64 PKRSPTRTPEDRVQLIAALRRL--RMTAAEIALCLGM------ALSTVSAVLRRINLGKR 115 Query: 126 ASPGIPATG-RFEHDAPNRLWQMDFKGHFPFGGG-------------------RCHPLTL 165 + P R+E P L +D K GG + Sbjct: 116 SRLDPPEPPNRYERARPGELLHIDVKKLGRIHGGAGHRVTGRKSGMHRARGAGWDYVHVC 175 Query: 166 LDDHSRFSLCLAHCTDERRETVQQQLVSVFE---RYGLP-DRMTMDNGSPWGDTTGTWTA 221 +DD +R + + DER TV L R+G+ +R+ DNGS + T Sbjct: 176 VDDATRLAY-VEVLPDERGTTVAGFLRRAIRHYRRHGITVERVMTDNGSGYRSTLHAIA- 233 Query: 222 LELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVY 281 G+R +RPY P+T GK ERF R++ G +A S E A D W Y Sbjct: 234 ----CRLQGVRHLRTRPYRPRTNGKAERFIRTMIEGWAYGAIYASSAERTAALDGWLFTY 289 Query: 282 NLERPHEALDMAVPGSRYQ 300 N RPH +L P +R + Sbjct: 290 NHRRPHGSLSHKPPAARLR 308 >UniRef50_Q8NL32 Predicted transposase n=7 Tax=Corynebacterium RepID=Q8NL32_CORGL Length = 500 Score = 185 bits (470), Expect = 2e-45, Method: Composition-based stats. Identities = 76/313 (24%), Positives = 121/313 (38%), Gaps = 19/313 (6%) Query: 4 LMPWDARDTMSLRTEFVLFA--SQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQD 61 +MP R + + F + + +I C R IS + Y R+ Q+ A L Sbjct: 19 IMP--KPLPPETRRKIIDFDPFAPNSPSIEEFCSRLKISRRSFYNIRNRYQQDANAALHP 76 Query: 62 RPRIPHHSPNRSSDDITALLRMAHDRHER----WGARKIKRWLEDQGH---TMPAFSTVH 114 P + + IT+ L R + +G I+ G +P+ ST+ Sbjct: 77 HSSAPITARRTYDESITSTLLSIRARLKAQGWEYGPISIRFEGISTGELTAPIPSVSTIA 136 Query: 115 NLMARHGLLPGASPGIPATG--RFEHDAPNRLWQMDFKGHFPFGG--GRCHPLTLLDDHS 170 L+ G + P + RF+ +WQ+D + R +LDD + Sbjct: 137 RLLRAAGAVESNPKKRPKSSVVRFQRGQAMEMWQIDGFIYTLHDTDLTRVTIYQILDDAT 196 Query: 171 RFSLC-LAHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGD-TTGTWTALELWLMR 228 RF + +E + L +G P + DNGS + G +LE +L Sbjct: 197 RFDVGTCVFPANENSVDARTALEQAIAHFGAPHELLSDNGSAFNRMRQGYVGSLESYLAT 256 Query: 229 LGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHE 288 +G +P HPQTQGK ER HR+L LQ E + +R YN RPH+ Sbjct: 257 VGCLSITGKPGHPQTQGKNERSHRTLFR-FLQAHQPHTLEECAHYIEQFRDHYNNRRPHQ 315 Query: 289 AL-DMAVPGSRYQ 300 L + P + ++ Sbjct: 316 GLPNNLTPAAAWE 328 >UniRef50_P24577 Insertion element IS407 uncharacterized 31.7 kDa protein n=164 Tax=Proteobacteria RepID=YI71_BURM1 Length = 277 Score = 185 bits (469), Expect = 3e-45, Method: Composition-based stats. Identities = 60/292 (20%), Positives = 102/292 (34%), Gaps = 34/292 (11%) Query: 31 RSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMAHDRHER 90 R CR G+S + + + + P+ ++ + A L R Sbjct: 13 RRACRLVGLSRSVLH-----YDAK---------------PDHENEVLAARLVELAHERRR 52 Query: 91 WGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGI-----PATGRFEHDAPNRLW 145 +G R++ +E +G T ++ L GL APN +W Sbjct: 53 FGYRRLHALVEREG-THANHKRIYRLYREAGLAVRRRRKRQGVMIEREQLALPGAPNEVW 111 Query: 146 QMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMT 205 +DF G R LT++DD ++ ++ + V + L G P + Sbjct: 112 SIDFVMDALSNGRRVKCLTVVDDFTKEAVDIVVDHGISGLYVARALDRAARFRGYPKAVR 171 Query: 206 MDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFA 265 D G + T AL+ W G+ + + P +E F+ + E L WF Sbjct: 172 TDQGPEF-----TSRALDQWAYANGVTLKLIQAGKPTQNAYIESFNGKFRDECLNEHWFT 226 Query: 266 DSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGNTTPPEYDE 317 + WR YN +RPH AL+ P +A+ + P + E Sbjct: 227 TLAHARAVIAAWRQDYNEQRPHSALNYLAPSE---FAAKHRATADAPAAFQE 275 >UniRef50_Q64B23 Transposase n=1 Tax=uncultured archaeon GZfos27G5 RepID=Q64B23_9ARCH Length = 414 Score = 183 bits (465), Expect = 7e-45, Method: Composition-based stats. Identities = 74/389 (19%), Positives = 149/389 (38%), Gaps = 30/389 (7%) Query: 14 SLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRS 73 R + + + + + R S KWL R+ +D P+ P+++ Sbjct: 8 EERIKAIQRCLEGEREV-EIYRSLERSKGWFSKWLGRYKTGRKGWYKDLPKRARVIPHKT 66 Query: 74 SDDITALLRMAH----------DRHERWGARKIKRWLEDQGHT---MPAFSTVHNLMARH 120 S+ I ++ ++ GA I+ +E+ G+ +P+ ST+ ++ R+ Sbjct: 67 SERIEQIVVNIRKALMDGTEDSTKYSCVGAEAIQFHMEELGYKPSEIPSISTIKRIIKRN 126 Query: 121 ---GLLPGASPGIPATGRFE---HDAPNRLWQMDFKGHFPFGG-GRCHPLTLLDDHSRFS 173 P + + GR+ + L Q+D+ G G G + + L D R Sbjct: 127 KLRANKPERYKRVRSKGRYTILNPKHIDELHQLDYVGPRHIKGYGPINSIHLKDVAGR-Q 185 Query: 174 LCLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLM---RLG 230 + ++ + V L+ +++ +P + +DNG + + ++ +G Sbjct: 186 VAGQQYNEKSMDNVMDFLMGYWKQCPIPKYLQVDNGMCFAGDYKHPKSFSRFVRLALYVG 245 Query: 231 IRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYN----LERP 286 I V P P G +E F++ Q + F D ++++ + N + Sbjct: 246 IEVVFIAPSRPWMNGTIEEFNKGFDKRFWQKELFTDLNDIRKKSVIFFEKENKFNAWKLR 305 Query: 287 HEALDMAVPGSRYQPSARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGE 346 +E L + P R P + N P E +R+VD GK+SV G+ + GE Sbjct: 306 NEKLKVVDP-KRMLPGDFTITVNRLPLVTGEIHFIRRVDSRGKISVLNEYFDVGREYTGE 364 Query: 347 RVGLKEMQEDGSYEVWWYSTKVGVIDLKK 375 V ++ V++ + V ++KK Sbjct: 365 YVWATIETMKQTHIVYYKDENLVVREIKK 393 >UniRef50_C5CAM0 Transposase n=26 Tax=Actinomycetales RepID=C5CAM0_MICLC Length = 335 Score = 183 bits (465), Expect = 7e-45, Method: Composition-based stats. Identities = 81/328 (24%), Positives = 126/328 (38%), Gaps = 41/328 (12%) Query: 4 LMPWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRP 63 + +A T + R V DG + F +S T W+ R+ EG AGLQD Sbjct: 1 MTHANAPLTPTGRLRMVHRHLHDGIPQAHVAAEFRVSRPTVATWVARYRAEGEAGLQDLS 60 Query: 64 RIPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGL- 122 PH SP + ++ A ++ + W AR+I L +GH + TV + R G+ Sbjct: 61 SRPHRSPAQLDPEVVAQIQTLRRERK-WSARRIHHHLVSEGHRV-CLRTVGRWLHRLGIS 118 Query: 123 -----LPGASPGIPATGRFEHDAPNRLWQMDFK--GHFPFGGGR---------------- 159 P + P + +D K G P GGG Sbjct: 119 RLPDLAPTGEDLRQRPQKITARGPGHMVHLDVKKIGRIPEGGGWRAHGRDSENARAAKRG 178 Query: 160 -------CHPLTLLDDHSRFSLCLAHCTDERRETVQQ--QLVSVFERYGLP-DRMTMDNG 209 + + +D +R + A + TV + + F +G+ DR+ DNG Sbjct: 179 PGRRVGYTYLHSAIDGFTRLAYTEALEDERAATTVSFYCRARAFFAAHGIRIDRVVTDNG 238 Query: 210 SPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGE 269 + + T + LG R RPY P+ GK+ER++R + EVL + ++ Sbjct: 239 NNYRAADFTAKVV-----SLGGRHHRIRPYTPRHNGKVERYNRLMVDEVLYARPYSSETA 293 Query: 270 LQRAFDHWRTVYNLERPHEALDMAVPGS 297 + A W YN RPH + A P S Sbjct: 294 RREALQVWVNHYNYHRPHTSCGDAPPAS 321 >UniRef50_C8PWM8 Transposase B n=2 Tax=Enhydrobacter aerosaccus SK60 RepID=C8PWM8_9GAMM Length = 271 Score = 183 bits (464), Expect = 1e-44, Method: Composition-based stats. Identities = 76/301 (25%), Positives = 115/301 (38%), Gaps = 37/301 (12%) Query: 5 MPWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPR 64 MP R ++ + + +Q +I C+ IS Y L D Sbjct: 1 MPVVERKALAQQLQ-----AQHNISIVVSCQIVCISRTAYYY---------EPKLND--- 43 Query: 65 IPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMA--RHGL 122 D I L D+H RWG K + L G+ V+ + + L Sbjct: 44 ---------DDAIVDKLTELTDKHTRWGFPKCYKRLRKLGYVW-NHKRVYRVYTAMKLNL 93 Query: 123 LPGASPGIPATGRFEHDAPNRL---WQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHC 179 A +P PN L W MDF R ++DD++R L + Sbjct: 94 RRKAKRRLPTRAPEPLTVPNSLDHTWSMDFMSDKLHNNSRFRTFNVIDDYNRELLGIDIG 153 Query: 180 TDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPY 239 T V + L + E +G P+++ +DNGS + T + W I V + +P Sbjct: 154 TSIPSLRVIRYLDQLAECHGYPNKIRIDNGSEF-----TSSVFTDWAASHSILVDYIKPG 208 Query: 240 HPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRY 299 P +ERF+RS + EVL F + E+++ D W VYN ERPH++L P Sbjct: 209 CPYQNAYIERFNRSYRNEVLDCYLFNNLNEVRQLTDEWINVYNHERPHDSLGNMTPAEFK 268 Query: 300 Q 300 Q Sbjct: 269 Q 269 >UniRef50_P25438 Insertion element IS476 uncharacterized 39.2 kDa protein n=21 Tax=Proteobacteria RepID=YI61_XANEU Length = 346 Score = 183 bits (463), Expect = 1e-44, Method: Composition-based stats. Identities = 62/310 (20%), Positives = 94/310 (30%), Gaps = 32/310 (10%) Query: 10 RDTMSLRTEFVLFASQDG-ANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHH 68 + E + + + R CR G+S P Sbjct: 57 TLAPQRKREAIRRMLEHTPLSERRACRLAGLSRDAFR------------------HAP-- 96 Query: 69 SPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNL-----MARHGLL 123 P ++ ++A L H R+G R++ L + ++ L + Sbjct: 97 VPTPATQALSARLVELAQTHRRFGYRRLHDLLRPE-FPSVNHKKIYRLYEEAELKVRKRR 155 Query: 124 PGASPGIPATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDER 183 P PN W MDF R LT++DD +R S+ +A Sbjct: 156 KAKRPVGERQKLLASSMPNDTWSMDFVFDALANARRIKCLTVVDDFTRESVDIAVDHGIS 215 Query: 184 RETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQT 243 V + L G P + DNG + T A W + GI P P Sbjct: 216 GAYVVRLLDQAACFRGYPRAVRTDNGPEF-----TSRAFIAWTQQHGIEHILIEPGAPTQ 270 Query: 244 QGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSA 303 +E F+ + E L WF + + WR YN RPH + P Sbjct: 271 NAYIESFNGKFRDECLNEHWFTSLAQARDVIADWRRHYNQIRPHSSCGRIPPAQFAANYR 330 Query: 304 RQYSGNTTPP 313 Q + N P Sbjct: 331 TQQANNAVPF 340 >UniRef50_O28862 ISA0963-5, putative transposase n=5 Tax=Archaeoglobus fulgidus RepID=O28862_ARCFU Length = 357 Score = 183 bits (463), Expect = 1e-44, Method: Composition-based stats. Identities = 65/320 (20%), Positives = 126/320 (39%), Gaps = 25/320 (7%) Query: 6 PWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRI 65 P R + + +++ GA ++ + ++P Y+ +++ + G + Sbjct: 56 PLHVRKLTNKKIRWIIRQLDKGAPVKEIAAVMRVTPRRIYQLKKQYEETGQ---IPELKQ 112 Query: 66 PHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPG 125 P P ++ ++ A+ ++ R +++ +E + +T++ ++ +HGL+ Sbjct: 113 PGRKPKEIDEETEKIILQAYKKY-RLSPVPLEKLIERDYGIHISHNTIYKVLLKHGLVEE 171 Query: 126 ASPGIPATG--RFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDER 183 R+E LWQ D+K G + +DD SRF C Sbjct: 172 NMSKKKRRKWVRYERTHSMSLWQGDWKRL-----GEKWIIAFMDDASRFITCYGVFDSAT 226 Query: 184 RETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTA---LELWLMRLGIRVGHSRPYH 240 E + L F YG+PD + D+G+ + A +L G+R +R H Sbjct: 227 TENTIRVLKVGFREYGIPDEILTDHGTQFVAAKSREKAKHRFREFLAENGVRHVLARINH 286 Query: 241 PQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQ 300 PQT GK+ERF ++ ++ L + D + YN +PH +L+ + YQ Sbjct: 287 PQTNGKIERFFGLMEQKI----------HLFDSLDEFIYWYNYVKPHMSLNFDELETPYQ 336 Query: 301 PSARQYSGNTTPPEYDEGVM 320 R+ EY + Sbjct: 337 AFLRKLPAERV-FEYGRWLF 355 >UniRef50_Q1GCB4 Integrase catalytic region n=29 Tax=Alphaproteobacteria RepID=Q1GCB4_SILST Length = 265 Score = 182 bits (461), Expect = 2e-44, Method: Composition-based stats. Identities = 68/275 (24%), Positives = 99/275 (36%), Gaps = 31/275 (11%) Query: 28 ANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMAHDR 87 +IR C P T R+ + R P + +R Sbjct: 7 VSIRRACEVIRFDPRTY-----RY----------KSRRPGQ------AALEQRIREICQT 45 Query: 88 HERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIP-----ATGRFEHDAPN 142 R+G R++ L +G + A T L +P R E +PN Sbjct: 46 RVRFGYRRVHVLLRREGWEINAKKTYRIYKELGMQLRSKTPKRRVKAKLRDDRKEAVSPN 105 Query: 143 RLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPD 202 +W MDF G + LT++D SR+ L R E V L V +R G P Sbjct: 106 DVWAMDFVHDQLATGWKLRVLTVVDTFSRYVPVLDARFTYRGEDVVATLEQVCKRTGYPA 165 Query: 203 RMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGK 262 + +D GS + ++LW + + SRP P +E F+ +AE L Sbjct: 166 TIRVDQGSEFIS-----KDMDLWAYANDVTLDFSRPGKPTDNAFIEAFNGRFRAECLNAH 220 Query: 263 WFADSGELQRAFDHWRTVYNLERPHEALDMAVPGS 297 WF + + WR YN ERPH A+ VP Sbjct: 221 WFMSLEDAAEKLEAWRRDYNEERPHGAIGNKVPAD 255 >UniRef50_A3JW74 Transposase n=12 Tax=Proteobacteria RepID=A3JW74_9RHOB Length = 303 Score = 182 bits (461), Expect = 2e-44, Method: Composition-based stats. Identities = 74/309 (23%), Positives = 123/309 (39%), Gaps = 22/309 (7%) Query: 2 ESLMPWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQD 61 E M + RD R VL ++ N R CR FGI ++ Y+W + + G +GL++ Sbjct: 3 EVSMTNEERDI--QRKLRVLQHAEKIGNARKACRYFGIGRSSFYRWRDAYQKHGESGLKN 60 Query: 62 RPRIPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHG 121 IP + N++ +I + ++ G +I +L + + V+ ++ R+G Sbjct: 61 AKSIPKNPANQTPPEIVEKVLYLRRKYH-LGPIRIVWYLARYHGIKISDAGVYRILKRNG 119 Query: 122 LLP---GASPGIPATGRFEHDAPNRLWQMDFKGHFPFGGG------RCHPLTLLDDHSRF 172 L G T R++ P Q+D K F G R T +DD +R Sbjct: 120 LNRLPRGTRMRKLHTKRYQKQVPGHHIQVDVK--FLTFKGKRGEKVRRFQFTAIDDATR- 176 Query: 173 SLCLAHCTDERRETVQQQLVSVFERYGLPDR-MTMDNGSPWGDTTGTWTALELWLMRLGI 231 L + + + + E++ R + DNG + + LGI Sbjct: 177 VRALKIYEKHTQASAIDFIDHIIEKFPFRIREVRTDNGHEFQAK------FHWHVEDLGI 230 Query: 232 RVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALD 291 R + + PQ GK+ER HRS + Q + +L+ D W YN RPH A + Sbjct: 231 RHAYIKRGTPQLNGKVERSHRSDQQAFYQLLSYKGDVDLEAKLDEWERFYNFARPHGAHN 290 Query: 292 MAVPGSRYQ 300 P + Sbjct: 291 GQTPYEALR 299 >UniRef50_C4UEN4 Transposase n=1 Tax=Yersinia aldovae ATCC 35236 RepID=C4UEN4_YERAL Length = 281 Score = 181 bits (460), Expect = 3e-44, Method: Composition-based stats. Identities = 57/292 (19%), Positives = 105/292 (35%), Gaps = 30/292 (10%) Query: 12 TMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPN 71 ++ + + G ++ + C+ +S A+ Y+ W ++ Sbjct: 2 VITEKKSCAGLLTASGLSVITACKLTSLSRASFYRRGTDWREK----------------- 44 Query: 72 RSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGAS---- 127 + ++ + G K L +G V+ + R GL Sbjct: 45 --DKVVIDAIQAVLSESPQAGFWKCYYRLRFKGFIF-NHKRVYRVYCRLGLNLKRRIKKT 101 Query: 128 -PGIPATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRET 186 P + P+ W +DF + G R L ++D+ +R L + T + Sbjct: 102 LPKRENKPLSIVNLPDIQWALDFMHDALYCGKRFRTLNIIDEGTRECLAIEVDTSLPTDR 161 Query: 187 VQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGK 246 V + L + + GLP ++ +DNG L + I + H +P PQ G Sbjct: 162 VIRVLDRLKKERGLPQQLRVDNGPELISVN-----LLNYCEYNHITLCHIQPGKPQQNGF 216 Query: 247 LERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSR 298 +ERF+ S + E L F +++ W+ YNL R HE+L P + Sbjct: 217 IERFNGSFRREFLNAYLFESLSQVREMAWFWQQDYNLNRTHESLGHLPPETY 268 >UniRef50_Q1DAH7 Transposase orfB, IS3 family n=29 Tax=Proteobacteria RepID=Q1DAH7_MYXXD Length = 293 Score = 181 bits (460), Expect = 3e-44, Method: Composition-based stats. Identities = 67/297 (22%), Positives = 100/297 (33%), Gaps = 31/297 (10%) Query: 16 RTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSD 75 R V +A G + R C ++ ++ G R Sbjct: 7 RRRQVRYAMGKGVSQRRACALLQVAGSSL-------------GYASRK-------EAKDA 46 Query: 76 DITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPATGR 135 + A LR R+G R+ L +G + VH L + GL Sbjct: 47 ALVAQLRDIARARPRFGYRRAWALLRREGPAV-NVKRVHRLWRKEGLALSRRRPRKRLRL 105 Query: 136 FEHDAP-----NRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQ 190 + P N +W DF G + LT++D+HSR L + V + Sbjct: 106 GQQRQPKPEGVNSVWAWDFVHDRCANGQKLKCLTVVDEHSRECLAIDVAGRISARRVIEV 165 Query: 191 LVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERF 250 L + +G P + DNG + AL WL GI+ + P P G E F Sbjct: 166 LSRLVAVHGPPKYLRSDNGPEF-----IAKALRRWLEANGIQTAYIAPGKPWQNGTNESF 220 Query: 251 HRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYS 307 + + E L +WF+ E + WR YN +RPH +L P A Sbjct: 221 NGRFRDECLSAEWFSTRREAVVLIEAWRRDYNEKRPHSSLGYKTPAEVGARRAHAGP 277 >UniRef50_A1JLT7 Transposase for insertion element IS1222 n=8 Tax=Yersinia RepID=A1JLT7_YERE8 Length = 249 Score = 181 bits (459), Expect = 4e-44, Method: Composition-based stats. Identities = 64/278 (23%), Positives = 96/278 (34%), Gaps = 48/278 (17%) Query: 20 VLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITA 79 +L G + R CR G+S +T R+ + A ++ Sbjct: 1 MLMCDATGLSQRRACRLTGLSLSTC-----RYEAQRPAA---------------DAHLSG 40 Query: 80 LLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPATGRFEHD 139 + R+G R+I + L +G LP P P Sbjct: 41 RIIELALERRRFGYRRIWQLLRRKGLATE-------------RLPLLRPAAP-------- 79 Query: 140 APNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYG 199 N W MDF G R LT +DD ++ L + V + L S+ G Sbjct: 80 --NLTWSMDFVMDALATGRRIKCLTCVDDFTKECLTVTVAFGISGVQVTRILDSIALFRG 137 Query: 200 LPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVL 259 P + D G + T AL+ W G+ + +P P G +E F+ + E L Sbjct: 138 YPATIRTDQGPEF-----TCRALDQWAFEHGVELRLIQPGKPTQNGFIESFNGRFRDECL 192 Query: 260 QGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGS 297 WF+D ++ WR YN RPH AL+ P Sbjct: 193 NEHWFSDVSHARKTISEWRQDYNECRPHSALNYQTPSE 230 >UniRef50_C1XPR1 Transcriptional regulator/sugar kinase n=14 Tax=Meiothermus silvanus DSM 9946 RepID=C1XPR1_9DEIN Length = 777 Score = 181 bits (458), Expect = 5e-44, Method: Composition-based stats. Identities = 87/392 (22%), Positives = 141/392 (35%), Gaps = 39/392 (9%) Query: 15 LRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSS 74 + V + + + + GIS AT ++W + ++G AGL+ R R P H + Sbjct: 35 RKLRLVKALRESKKSWKEIQDLVGISRATYHRWQKALKEKGLAGLKPRSRRPKHLRTKVH 94 Query: 75 DDITALLRM--AHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMA-------------- 118 L+R+ + WG I L +G M + TV ++A Sbjct: 95 WTPGLLIRIETLRKENPTWGRWSIWLTLRKEGFQM-SERTVGRILAYLEKHRRIESVAGY 153 Query: 119 ----RHGLLPGA--SPGIPATGR-FEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSR 171 + G L P R +E AP L Q+D G + +D HSR Sbjct: 154 LARTQRGKLKRRVNRPYAKRKPRGYEARAPGDLVQVDTLTLTLGPGSMVKHFSAIDLHSR 213 Query: 172 FSLCLAHCTDERRETVQQQLVSVFERYGLPDR-MTMDNGSPWGDTTGTWTALELWLMRLG 230 F L H + + + L + R P R + +D GS + E LG Sbjct: 214 FVLAEVHSRATAKLS-EGFLSLLLARAPFPIRAIQVDGGSEFM------AEFEEACCALG 266 Query: 231 IRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEAL 290 I + P P+ G +ER R+ K E ELQ D + YN RPH AL Sbjct: 267 IALFVLPPRSPKLNGHVERMQRTFKEEFYTRPLPTPLSELQAELDTYLDYYNRRRPHMAL 326 Query: 291 DMAVPGSRY-----QPSARQYSGNTTPPEYDEGVMVR-KVDISGKLSVKGVSLSAGKAFR 344 P + ++ S T Y ++ GKL++ G + + + Sbjct: 327 GGLAPLEFLAKMQEESVPQRVSNVLTDYTYLTPRAGWSRLSSFGKLTIGG-LVPMSPSRK 385 Query: 345 GERVGLKEMQEDGSYEVWWYSTKVGVIDLKKK 376 G+ + +K + E S + +L ++ Sbjct: 386 GDPLEMKRINRRAILEALKGSRSLTRAELARR 417 >UniRef50_A3ZSC8 Transposase orfB n=1 Tax=Blastopirellula marina DSM 3645 RepID=A3ZSC8_9PLAN Length = 281 Score = 181 bits (458), Expect = 5e-44, Method: Composition-based stats. Identities = 54/219 (24%), Positives = 88/219 (40%), Gaps = 14/219 (6%) Query: 87 RHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPAT--------GRFEH 138 R+G R I R L +G + F V+ L R GL R + Sbjct: 40 EFPRYGYRMITRLLRQEGWQV-NFKRVYRLWRREGLKVPVKQAKKRRLGTVDGGITRRQA 98 Query: 139 DAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERY 198 + PN +W +DF G L+L+D+ +R + L + + + L +F Sbjct: 99 ERPNHVWSIDFIFDRTENGRPLKILSLVDEFTRECIALEVNRKFTGDHLVELLADLFAIR 158 Query: 199 GLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEV 258 G+P+ + DNG + ++ +L ++ + + + P P G +ERFH L+ E Sbjct: 159 GVPEFIRSDNGPEFISRR-----VQKFLEKIDVGMSYIEPGSPWQNGYVERFHSRLRDEC 213 Query: 259 LQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGS 297 L + F E + WR YN RPH +L P Sbjct: 214 LACELFTTLAEARTVIAAWRQTYNHRRPHSSLGGQTPAD 252 >UniRef50_C6VW29 Integrase catalytic region n=2 Tax=Sphingobacteriales RepID=C6VW29_DYAFD Length = 273 Score = 180 bits (457), Expect = 6e-44, Method: Composition-based stats. Identities = 54/236 (22%), Positives = 88/236 (37%), Gaps = 11/236 (4%) Query: 67 HHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGA 126 ++S + ++ L+ +H +G RK+ +L G VH + L Sbjct: 28 YYSSKKDDSEVIESLQDLAFKHPSYGFRKLFAYLRRSGKPW-NHKRVHRIYQVLKLNKRR 86 Query: 127 SPGIPATGRF-----EHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTD 181 R + N +W +DF G + ++DD SR +L + T Sbjct: 87 KGKRRLPDRVRQPLAQPAQVNEVWSVDFMSDSMVGNRKFRTFNVIDDCSREALAIEIDTS 146 Query: 182 ERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHP 241 + + + L + E G P + DNG + T +W GI +P P Sbjct: 147 LSAKRIIRTLNRIGESRGFPMAIRSDNGPEFTSGNFT-----IWCEEKGIEAKFIQPGKP 201 Query: 242 QTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGS 297 G +ERF+R + VL F D ++++ W YN RPHE L P Sbjct: 202 TQNGYIERFNRLYREAVLDAYLFFDLDQVRQLTAEWIEEYNQRRPHEGLGNLTPFE 257 >UniRef50_B4E5J2 Transposase n=21 Tax=Proteobacteria RepID=B4E5J2_BURCJ Length = 276 Score = 180 bits (457), Expect = 7e-44, Method: Composition-based stats. Identities = 70/298 (23%), Positives = 105/298 (35%), Gaps = 34/298 (11%) Query: 25 QDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMA 84 + GA+ R C +S T Y R+ + ++ Sbjct: 4 RFGASQRQTCALLQLSR-TVY----RYESVARDQSA----------------LEMRIKEI 42 Query: 85 HDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPG-ASPGIPATGRFEHD---- 139 + +GA ++ L +G V + GL P + R Sbjct: 43 TEVRVHYGAPRVYVMLRREGW-RDNHKRVERVYRELGLSLRHKRPRRNKSARRRQPKQSV 101 Query: 140 -APNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERY 198 A N +W MDF F G R LT++D+++R L + R E V L + + Sbjct: 102 SAINEIWSMDFVADALFDGRRLRTLTIVDNYTRECLAIEVDGSLRGEHVVAALTRLAQHR 161 Query: 199 GLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEV 258 LP + DNGS + T L+ W G+ + SRP P K E F+ + E Sbjct: 162 PLPRYIKADNGSEFISKT-----LDKWAYENGVEIDFSRPGKPTDNAKNESFNGRFREEC 216 Query: 259 LQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSA-RQYSGNTTPPEY 315 L WF + +R + WR YN RPH AL P + R PE+ Sbjct: 217 LNAHWFLSLEDARRKIEVWREYYNEARPHSALQWMTPAEFARQCTDRADPARPEEPEF 274 >UniRef50_Q3BT31 Transposase n=22 Tax=Bacteria RepID=Q3BT31_XANC5 Length = 343 Score = 180 bits (457), Expect = 7e-44, Method: Composition-based stats. Identities = 71/324 (21%), Positives = 108/324 (33%), Gaps = 38/324 (11%) Query: 7 WDARDTMSLRTEFVLFASQD-GANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRI 65 T + E V A + G + R C G+S R+ R Sbjct: 45 HKKMVTPGAKREAVAHAREHHGLSERRACNLVGVSRRVI-----RY----------RSSR 89 Query: 66 PHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPG 125 P P + LR R+G R++ L +G T P + + L Sbjct: 90 PDDGP------LRQRLRELAAERRRFGYRRLGYLLAREGIT-PNHKKLLRVYREENLRVR 142 Query: 126 ASPGIP-----ATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCT 180 G D PN+ W +DF R L ++DD++R L L T Sbjct: 143 RRGGRKRALGTRAPMVLPDGPNQRWSLDFVSDTLTCSRRFRILCVVDDYTRECLALVADT 202 Query: 181 DERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYH 240 V ++L + G P + DNG+ T +A+ W + + P Sbjct: 203 SLSGVRVARELTRLIGMRGKPHTVVSDNGTE-----LTSSAILRWSQERRVEWHYIAPGK 257 Query: 241 PQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGS--- 297 P G +E F+ L+ E L F + D WR YN RPH L P Sbjct: 258 PMQNGFVESFNGRLRDECLNETLFTSLPHARFVLDAWRHDYNHVRPHSKLGGRTPAEKAG 317 Query: 298 --RYQPSARQYSGNTTPPEYDEGV 319 ++ + RQ + +T G+ Sbjct: 318 KPVWEHAPRQVAITSTNHHVGAGL 341 >UniRef50_A5GAT8 Integrase, catalytic region n=1 Tax=Geobacter uraniireducens Rf4 RepID=A5GAT8_GEOUR Length = 426 Score = 179 bits (455), Expect = 1e-43, Method: Composition-based stats. Identities = 80/361 (22%), Positives = 130/361 (36%), Gaps = 25/361 (6%) Query: 9 ARDTMSLRTEFVLFASQDGANIRSLCR-RFGISPATGYKWLQRW--AQEGAAGLQDRPRI 65 AR R + I C + +S +T +W+ + + L + R Sbjct: 24 ARLDHGERERLLREKCARKWEIP--CSNQTRLSRSTILRWIGLYLKERGKLEALYPQGRN 81 Query: 66 PHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMP----AFSTVHNLMARHG 121 + L + ++ + + ++ + ST + + RH Sbjct: 82 DRGMSRVLDQETGLALAQLRRQQPELTVPELVKQMHERRLVTDGVGLSLSTAYRFLHRHD 141 Query: 122 LLPGASPGIPATGRFEHDAPNRLWQMDFK-GHFPFGG---GRCHPLTLLDDHSRFSLCLA 177 L+ G P +FE + PN LWQ D G G + + + +DDHSR Sbjct: 142 LM-GKQPAPVDRRKFEAELPNDLWQSDVMHGPMLLSGDKRRKTYLIAFIDDHSRLIPHGR 200 Query: 178 HCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSR 237 E R GLP ++ +DNGS + LE LGI + H+R Sbjct: 201 FYLSEGVACFMSAFSDAVLRRGLPRKLYVDNGSAFRSRQ-----LEYTAAALGIALVHAR 255 Query: 238 PYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGS 297 PY PQ +GK+ERF ++++ L E+ AF+ W Y +R H + P Sbjct: 256 PYQPQGKGKIERFFKNVRTSFLPSFKGETLEEINEAFELWLNDYYHQRSHGSTG-ETPFK 314 Query: 298 RYQPS---ARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQ 354 R+ R N +Y + R V+ + V A G+RV L + Sbjct: 315 RFTSRMECLRPAPDNLK--DYFRKTVRRLVNKDRSVVVDRRLFEAPVELIGKRVELLYFE 372 Query: 355 E 355 E Sbjct: 373 E 373 >UniRef50_B1KCL6 Integrase catalytic region n=100 Tax=Proteobacteria RepID=B1KCL6_BURCC Length = 316 Score = 179 bits (454), Expect = 1e-43, Method: Composition-based stats. Identities = 85/315 (26%), Positives = 125/315 (39%), Gaps = 33/315 (10%) Query: 8 DARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPH 67 +AR T + R E V ++ G++++ G++ T KWL R+ GA L D P Sbjct: 6 NARLTFARRLEMVQEITEFGSSVQQAAADHGVTAPTVRKWLGRYLVGGAPALADASSRPA 65 Query: 68 HSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGAS 127 SP + L+ + R+I R + STV ++AR GL + Sbjct: 66 RSPRSIAPATALLIVELRQQRLL--QRQITR------QAGVSASTVSRVLARAGLSRLSD 117 Query: 128 -PGIPATGRFEHDAPNRLWQMDFK--------GHFPFGGGR--------CHPLTLLDDHS 170 R+EH+AP L +D K GH G R + +DDH+ Sbjct: 118 LQPREPVQRYEHEAPGDLLHIDIKKLGRIARPGHRVTGNRRDTVDGVGWEYLFVAVDDHA 177 Query: 171 RFSLCLAHCTDERRETVQQQ--LVSVFERYGLP-DRMTMDNGSPWGDTTGTWTALELWLM 227 R + H + +R VQ V+ + +G+ R+ DNGS + Sbjct: 178 RVAFTAMHPDETKRSAVQFLRDAVAWYAGFGVRVRRLLTDNGSAFRS-----HEFARACQ 232 Query: 228 RLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPH 287 LGIR +R Y PQT GK ERF +S E + S A W+ YN R H Sbjct: 233 ELGIRHKFTRAYRPQTNGKAERFIQSALREWAYAWTYQSSAHRIEALASWQHHYNWHRAH 292 Query: 288 EALDMAVPGSRYQPS 302 A+ P +R S Sbjct: 293 SAIGGIAPMARLPAS 307 >UniRef50_A4JLW8 Integrase, catalytic region n=9 Tax=Proteobacteria RepID=A4JLW8_BURVG Length = 277 Score = 179 bits (454), Expect = 2e-43, Method: Composition-based stats. Identities = 58/246 (23%), Positives = 95/246 (38%), Gaps = 12/246 (4%) Query: 62 RPRIPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHG 121 R + S + + R+G RKI+ L +G+ + + + ++ L G Sbjct: 23 RSVYQYRSHRDPETALRQRMCEIAATRVRYGYRKIRVLLLREGYQV-SKNRLYRLYREEG 81 Query: 122 LLPGASPGIPATGRFEHDAP------NRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLC 175 L P + A N+ W +DF G R LT++D +R +L Sbjct: 82 LSLRYRPNRKRRAQMSRPARAKSTAANQAWSLDFVADQLSNGQRFRALTIIDVFTREALA 141 Query: 176 LAHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGH 235 + V + L + + G P + DNGS + T ++LW + + Sbjct: 142 IDVGQRLSASDVVRVLDELRSKRGAPRTLFCDNGSEF-----TSQVMDLWAYHHKVEIAF 196 Query: 236 SRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVP 295 SRP P +E F+ +L+ E L WF + + + WR YN RPH ALD P Sbjct: 197 SRPGKPTDNAFVESFNGTLRDECLNVHWFTSLADAREQIERWRVEYNESRPHRALDEVPP 256 Query: 296 GSRYQP 301 + Sbjct: 257 AEYVRQ 262 >UniRef50_UPI0001B511C3 integrase n=1 Tax=Streptomyces hygroscopicus ATCC 53653 RepID=UPI0001B511C3 Length = 319 Score = 179 bits (453), Expect = 2e-43, Method: Composition-based stats. Identities = 72/307 (23%), Positives = 110/307 (35%), Gaps = 26/307 (8%) Query: 15 LRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSS 74 R + + N+ CR FGIS Y W +R+ EG GL+ R + P SPN + Sbjct: 17 RRLAVIRHVEEVTGNVAMSCRYFGISRQAYYTWYRRYQAEGVEGLRTRSKAPKTSPNATH 76 Query: 75 DDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNL-----MARHGLLPGASPG 129 ++ + + +G KI +L+ + S V + M R Sbjct: 77 VEVVGKIIYLRQNYH-FGPEKIAMYLKRYHDVTISKSGVWRILNRLDMGRLPASQRYKRH 135 Query: 130 IPATGRFEHDAPNRLWQMDFKGHFPFGGG---------RCHPLTLLDDHSRFSLCLAHCT 180 R+E P Q+D K P + + T +DD +R L Sbjct: 136 DRRWKRYEKQLPGHRVQIDVKFIEPLANTAQGRRGGRNKYYQFTAIDDCTRLR-ILRIYP 194 Query: 181 DERRETVQQQLVSVFERYGLP-DRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPY 239 ++T Q L V +R + + DNG+ + A ++ GI + +P Sbjct: 195 QLNQKTAVQFLDYVLQRLPFQVEVIQTDNGAEFQS------AFHWHVLDKGIAHTYIKPR 248 Query: 240 HPQTQGKLERFHRSLKAEV---LQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPG 296 P+ GK+ER HR E L G D+ W YN RPH L P Sbjct: 249 TPRLNGKVERSHRIDAEEFYRLLDGVVIDDAEVFNDKLREWEDYYNYHRPHGGLGGHTPY 308 Query: 297 SRYQPSA 303 R + Sbjct: 309 ERLKQKT 315 >UniRef50_C1A8I3 Putative transposase orfB for insertion sequence element n=1 Tax=Gemmatimonas aurantiaca T-27 RepID=C1A8I3_GEMAT Length = 295 Score = 179 bits (453), Expect = 2e-43, Method: Composition-based stats. Identities = 68/292 (23%), Positives = 104/292 (35%), Gaps = 30/292 (10%) Query: 9 ARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHH 68 AR ++R +Q G ++ CR G++ A Y P Sbjct: 10 ARARPAMRQVATTLVTQHGLSVVRACRIAGLARAAYYT-------------------PLS 50 Query: 69 SPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASP 128 ++ A L RWG K L GH + VH + L Sbjct: 51 DRVERDAEVIAALTTLAAARPRWGFWKCFDRLRLDGHGW-NWKRVHRVYCALRLNLPRRT 109 Query: 129 GIPATGRFE-----HDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDER 183 R + N +DF + G R L +LD+ +R +L + T Sbjct: 110 KRRLPQRVQQPLDAPPQLNHTRALDFMHDMLYDGRRFRTLNVLDEGNREALAIEVSTSLP 169 Query: 184 RETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQT 243 V L + +G P + DNG AL W + G+R+ H +P P Sbjct: 170 GTRVVSVLEQLLAIHGAPCTIRCDNGPELIS-----HALTTWCEQHGVRLQHIQPGKPNQ 224 Query: 244 QGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVP 295 +ERF+R+ + EVL FA +++ + W YN ERPH++L P Sbjct: 225 NAYIERFNRTYRREVLDAYIFASLAQVRAETETWLMTYNTERPHDSLGGVPP 276 >UniRef50_C1F6R2 ISAca4, transposase orfB n=1 Tax=Acidobacterium capsulatum ATCC 51196 RepID=C1F6R2_ACIC5 Length = 308 Score = 178 bits (452), Expect = 3e-43, Method: Composition-based stats. Identities = 66/271 (24%), Positives = 102/271 (37%), Gaps = 31/271 (11%) Query: 31 RSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMAHDRHER 90 R C+ G+ A+ R+ RP + ++ L + R Sbjct: 4 RRACKLLGVDRASY-----RYE--------PRPDR--------NAELRDELVKLARQKPR 42 Query: 91 WGARKIKRWLEDQGHTMPAFSTVHNLMARHG----LLPGASPGIPATGRFEHDAPNRLWQ 146 +G R++ LE +G T+ V+ L A G G + N+ W Sbjct: 43 YGYRRLHAVLERRGQTV-NVKRVYRLYAEEGLAVRRRRRKRLVRERVGEVQLIRANQEWA 101 Query: 147 MDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMTM 206 MDF G L+++D +R L L T V + L + E GLP+ + Sbjct: 102 MDFIVDGLANGRMVRILSVVDAFTRECLALEADTSLGSGRVTRALDRLIEERGLPENVRS 161 Query: 207 DNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFAD 266 DNG + + W I + H +P P G +E FH L+ E L WF Sbjct: 162 DNGPEFTSRR-----MLGWAEERKINLVHIQPGRPMQNGHVESFHGRLRDECLNVSWFRT 216 Query: 267 SGELQRAFDHWRTVYNLERPHEALDMAVPGS 297 +++R D++R YN ERPH +L P Sbjct: 217 LNDVRRTLDNYRQEYNCERPHSSLAYRTPAE 247 >UniRef50_Q1NW03 Integrase, catalytic region n=7 Tax=Proteobacteria RepID=Q1NW03_9DELT Length = 447 Score = 178 bits (451), Expect = 3e-43, Method: Composition-based stats. Identities = 72/334 (21%), Positives = 123/334 (36%), Gaps = 19/334 (5%) Query: 36 RFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMAHDRHERWGARK 95 R ++ T WL+++ ++G GL + R ++ LL + + R+ Sbjct: 52 RTRVACETIRDWLKKYRKDGFNGLLPKGRNDKGRSRSLPPEVADLLIATKEENPELSIRQ 111 Query: 96 IKRWLEDQGHTMPAFSTVHNLMARHGLLPGA--SPGIPATGRFEHDAPNRLWQMDFKGHF 153 + D+ PA STVH L+A GL+ P RF + L+ D Sbjct: 112 VIAATADRIPVQPAPSTVHALLAGKGLMKKKGEDPDSKDHRRFSYQFAGDLFMCDVMHGP 171 Query: 154 PFGG-----GRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMTMDN 208 + + + +DD +R A E R G+P R+ +DN Sbjct: 172 TVRTSGNKRRKTYLIAFIDDATRVIAFAAFAMSESTADFMTVFKQTIIRRGIPLRLFVDN 231 Query: 209 GSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEV---LQGKWFA 265 G+ + L L +LGI + H+R YH Q +GK+ER+ R+++ + L Sbjct: 232 GAAFRSQH-----LALVCAKLGITLIHARAYHAQAKGKIERWFRTIRLQFLPLLDPASTD 286 Query: 266 DSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGNTTPPEYDEGVMV---R 322 L RA + + H L P R+ + + D+ + R Sbjct: 287 SLEALNRALWSYVEMEYHRNHHRMLG-ETPLDRWARLGHKVRYPEPGLDLDDLFLFEAKR 345 Query: 323 KVDISGKLSVKGVSLSAGKAFRGERVGLKEMQED 356 KV +S+ ++ A GE V L+ Sbjct: 346 KVHKDRTVSLNTLAYEVDAALVGETVTLRFDPSR 379 >UniRef50_C1F5F4 IS3 family transposase orfB n=1 Tax=Acidobacterium capsulatum ATCC 51196 RepID=C1F5F4_ACIC5 Length = 296 Score = 178 bits (451), Expect = 4e-43, Method: Composition-based stats. Identities = 75/293 (25%), Positives = 116/293 (39%), Gaps = 35/293 (11%) Query: 13 MSLRTEFVLFASQ-DGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPN 71 S R V + G R CR + AT R+ L R Sbjct: 3 PSGRGPMVEHLERMHGVAERRACRVLCVPRATY-----RYRSC----LDPR--------- 44 Query: 72 RSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASP--- 128 ++ +R R+G RKI+ L +G + + V+ L GL Sbjct: 45 ---TELRMRIREIAQSRVRYGYRKIRVLLNREGWNVGRYL-VYPLYCEEGLCLQRMRPAG 100 Query: 129 ----GIPATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERR 184 +F+ AP++ W MDF GG R LT++D ++R ++ + + Sbjct: 101 KHKASRSRAEKFKATAPDQAWSMDFVSDQLQGGTRFRSLTIVDVYTREAVVIEAGQSLKG 160 Query: 185 ETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQ 244 E V + L V + G+P + DNGS + T A++LW R ++ SRP P Sbjct: 161 EDVVRTLNRVKQERGVPKILFCDNGSEF-----TSQAMDLWAYRNNTKIDFSRPGKPTDN 215 Query: 245 GKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGS 297 +E F+ + ++E L WFAD E + + WR YN RPH +L P Sbjct: 216 AFVEGFNGTFRSECLNTHWFADLREAKVLIEAWRKEYNESRPHASLADRTPSE 268 >UniRef50_B3E8B6 Integrase catalytic region n=1 Tax=Geobacter lovleyi SZ RepID=B3E8B6_GEOLS Length = 269 Score = 177 bits (450), Expect = 4e-43, Method: Composition-based stats. Identities = 55/247 (22%), Positives = 90/247 (36%), Gaps = 12/247 (4%) Query: 62 RPRIPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHG 121 R + + + +R + R+G +I L +G + V+ + Sbjct: 23 RGSYRYVHHGKDDSALRLRIRQIAETRIRYGYLRIHTLLRREGWHV-NHKRVYRIYCEEC 81 Query: 122 LLPGASPGIP------ATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLC 175 L R + N W MDF F G R LT++D+ SR L Sbjct: 82 LNLRRKRPRRRVSAAHRANRPVASSLNDSWSMDFVADSLFNGRRFRALTVVDNWSRQCLA 141 Query: 176 LAHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGH 235 + + + V + + + P R+ +DNGS + +L+ W G+ + Sbjct: 142 IRVDQAMKGDDVVDAMSELTQIRNCPKRIFLDNGSEFIS-----KSLDRWAYENGVTLDF 196 Query: 236 SRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVP 295 SRP P +E F+ S + E L WF + ++ + WR YN RPH AL P Sbjct: 197 SRPGKPTDNALIESFNGSFRDECLSVNWFLSMDDARQKIEDWRQEYNDFRPHTALKNLTP 256 Query: 296 GSRYQPS 302 Sbjct: 257 NEYANQF 263 >UniRef50_Q2W8I9 Transposase and inactivated derivative n=89 Tax=Bacteria RepID=Q2W8I9_MAGSA Length = 416 Score = 177 bits (450), Expect = 5e-43, Method: Composition-based stats. Identities = 62/293 (21%), Positives = 97/293 (33%), Gaps = 32/293 (10%) Query: 11 DTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSP 70 + R SQ + R C T R+ R R P Sbjct: 2 RPAAKREAVAHLRSQFEMSERRACAVIAADRMTI-----RY----------RSRRPR--- 43 Query: 71 NRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGI 130 + + A LR + R+G R++ L + G S ++ L GL Sbjct: 44 ---DEALRARLRDLANLRRRFGYRRLFILLHETGEP-SGLSRIYRLYREEGLAVRKRKTR 99 Query: 131 P-----ATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRE 185 N W +DF G R L ++DD +R L T Sbjct: 100 RKAIGSRAPILVEARANARWSLDFVHDQLACGRRFRILNVVDDVTRECLAAIPDTSISGA 159 Query: 186 TVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQG 245 V ++L ++ R G P+ + DNG+ T A+ W + I + P P G Sbjct: 160 RVTRELSALIARRGRPEMIVSDNGTE-----LTSNAVLAWKQQQRIDWHYIAPGKPMQNG 214 Query: 246 KLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSR 298 +E F+ ++ E+L F + + W YN RPH +L P + Sbjct: 215 FVESFNGRMRDELLNETVFTSLPQARAVIAAWADDYNTARPHSSLGYQTPAAH 267 >UniRef50_A6BYW2 Integrase, catalytic region n=1 Tax=Planctomyces maris DSM 8797 RepID=A6BYW2_9PLAN Length = 269 Score = 177 bits (448), Expect = 7e-43, Method: Composition-based stats. Identities = 66/268 (24%), Positives = 108/268 (40%), Gaps = 20/268 (7%) Query: 60 QDRPRIPHHSPNRSSDDITALLRMAH---DRHERWGARKIKRWLEDQGHTMPAFSTVHNL 116 ++PR + DD ALL+ RH R+G R+I R ++ G + ++ L Sbjct: 1 MNQPRSSQRYQSEPPDDEPALLKQILDLVRRHPRFGYRRIGRMIQADGWKV-NLKRIYRL 59 Query: 117 MARHGLLPGASPGIPA--------TGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDD 168 R GL + N +W DF G L++LD+ Sbjct: 60 WRREGLKVPRKQKKKRALGTGANACHLRRAERKNHVWCWDFIFDRTETGTTLKWLSVLDE 119 Query: 169 HSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMR 228 ++R L L E V L +F+ +G+P+ + DNGS + A+ WL + Sbjct: 120 YTRECLVLKVDRHITSEDVINVLAELFKTHGVPEHIRSDNGSEF-----VAQAIREWLKQ 174 Query: 229 LGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHE 288 +G+ + P P G E FH ++ E + + F + ++ D W+ YN RPH Sbjct: 175 IGVETLYIEPASPWENGYAESFHSRVRDEFMNCEIFENLRSARKQTDSWKEFYNEVRPHS 234 Query: 289 ALDMAVPGSRYQPSARQYSGNTTPPEYD 316 +L P Q S + S + TP + Sbjct: 235 SLGYLTPR---QFSQQCISSSRTPSAFQ 259 >UniRef50_A4A249 Transposase orfB n=4 Tax=Planctomycetaceae RepID=A4A249_9PLAN Length = 279 Score = 176 bits (447), Expect = 1e-42, Method: Composition-based stats. Identities = 59/264 (22%), Positives = 97/264 (36%), Gaps = 17/264 (6%) Query: 61 DRPRIPHHSPNRSSDD---ITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLM 117 D+PR + D+ +T + + RWG R+I + L +G T+ ++ L Sbjct: 3 DQPRSSQRFEGKPKDEDVRLTKRILELVRQRPRWGYRQICQLLRREGETL-NMKKMYRLW 61 Query: 118 ARHG-LLPGASPGIPATGRF-------EHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDH 169 G +P ATG + +W DF G L ++D++ Sbjct: 62 KAAGLKVPQKRRKKRATGVSTNACHVQPAGFRHDVWTWDFIQSSTIDGRTIRFLNIVDEY 121 Query: 170 SRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRL 229 +R L + E L +F +G+P R+ DNG + A++ WL + Sbjct: 122 TRQCLAIKVGRSITSEDAIDTLAELFAMHGVPKRIRCDNGPEFIS-----CAIKTWLDLI 176 Query: 230 GIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEA 289 G+ V + P P G F+ L+ E L + + WR +N RPH + Sbjct: 177 GVEVLYIEPGSPWQNGLCVSFNSRLRDEYLHQTDLLSLEDARIKARAWREDFNHNRPHSS 236 Query: 290 LDMAVPGSRYQPSARQYSGNTTPP 313 L P + A S P Sbjct: 237 LGYLTPAEFARRCAASTSVAALLP 260 >UniRef50_C1F2E9 IS3 family transposase orfB n=1 Tax=Acidobacterium capsulatum ATCC 51196 RepID=C1F2E9_ACIC5 Length = 309 Score = 176 bits (447), Expect = 1e-42, Method: Composition-based stats. Identities = 55/242 (22%), Positives = 94/242 (38%), Gaps = 14/242 (5%) Query: 62 RPRIPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHG 121 RPR + +++ + LR D RWG R++ L+ +G + + V+ + Sbjct: 39 RPRASRYE--AANEKLRKRLRELADERRRWGYRRLHILLKREGWKVNS-KRVYRIYVEEK 95 Query: 122 LLPGASPGIP------ATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLC 175 L+ N W MDF G + L++ D ++R L Sbjct: 96 LVVRRRRRRRRVCAQARVPLLPPTRLNETWTMDFLHDALANGRKLRTLSIEDAYTREMLA 155 Query: 176 LAHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGH 235 + T V + L + GLP+R+ +D+G+ + T L+ W + + + Sbjct: 156 IEVDTSLPALRVVRVLERLRLERGLPERIVIDHGTEF-----TSKLLDQWAYKNQVTLHF 210 Query: 236 SRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVP 295 P P G +E FH + E L WF + ++ + WR YN RPH +L P Sbjct: 211 ITPGLPMENGYIESFHGKFREECLNEHWFLMLDDARQTIESWRIDYNWVRPHSSLGYLTP 270 Query: 296 GS 297 Sbjct: 271 EE 272 >UniRef50_A4TG41 Integrase, catalytic region n=32 Tax=Actinomycetales RepID=A4TG41_MYCGI Length = 522 Score = 176 bits (447), Expect = 1e-42, Method: Composition-based stats. Identities = 89/381 (23%), Positives = 135/381 (35%), Gaps = 32/381 (8%) Query: 11 DTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSP 70 + R + V + D +I + ++ AT +W++R+ G L PR Sbjct: 58 LSTKQRGKLVREIA-DRRHIDPFGAQVQVARATLDRWIRRYRTGGFEALVPEPRR---LG 113 Query: 71 NRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLL-PGASPG 129 R+ + L + ++ R L P+ ST+ R L+ P A Sbjct: 114 TRTDTQVLELAVSLKRENPARTVAQVARILRTATGWAPSESTLLRHFHRCELMGPTAGQP 173 Query: 130 IPATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQ 189 GRFE PN LW D G + + LDDHSR + E + Sbjct: 174 GEVFGRFEAADPNELWVGDALHGPRVGDRKTYLFAFLDDHSRLVVGHRFGFAEDTVRLAA 233 Query: 190 QLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLER 249 L G+P + +DNGS + D L +LGIR+ HS P PQ +GK+ER Sbjct: 234 ALKPALAARGVPASIYVDNGSAFVDAW-----LLRACAKLGIRLVHSAPGRPQGRGKIER 288 Query: 250 FHRSLKAEVLQGKWFADSG--------------ELQRAFDHWRTVYNLERPHEALDMAVP 295 F R+++ + L + EL R F W R H P Sbjct: 289 FFRTVRDQFLVEVTDTSAEDLTAAGVDHRGALLELNRLFMAWTETEYHRRTHSETG-QSP 347 Query: 296 GSRYQPSARQYSGNTTPPEYDE------GVMVRKVDISGKLSVKGVSLSAGKAFRGERVG 349 R++ + G+ P + R V + +S+ + A G RV Sbjct: 348 LDRWEDGWDRLGGSPALPTAADLTEAFLWSEFRVVTKTATVSLHSNTYRVDPALAGRRVE 407 Query: 350 LKEMQED-GSYEVWWYSTKVG 369 L D S EV + G Sbjct: 408 LVFSPFDLESIEVRYRDQSFG 428 >UniRef50_A1UD36 Integrase, catalytic region n=28 Tax=Actinomycetales RepID=A1UD36_MYCSK Length = 341 Score = 176 bits (445), Expect = 1e-42, Method: Composition-based stats. Identities = 78/317 (24%), Positives = 114/317 (35%), Gaps = 47/317 (14%) Query: 8 DARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPH 67 +AR T R V Q G + G+S + W+ R+ +G AGL DR PH Sbjct: 17 NARTTFHGRLLMVRRH-QAGWPKAHIASAMGVSRKCVHTWISRFEADGEAGLIDRSSRPH 75 Query: 68 HSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHG------ 121 SP R++ + + RH R G +I L + TV ++ R G Sbjct: 76 TSPMRTAQRLENQIVAWRRRH-RCGPEEIGAKLG------VSARTVSRVLHRRGAPYLRD 128 Query: 122 ----LLPGASPGIPATGRFEHDAPNRLWQMDFK--------GHFPFGGGRC--------- 160 R+E P L MD K G + G C Sbjct: 129 CDPMTGQVIRASKSTAVRYERGRPGELVHMDVKKLGRIPDGGGWRAHGRGCAPDRKRLRG 188 Query: 161 ----HPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSV---FERYGLPDRMTMDNGSPWG 213 + +L+DDHSR + DE+ T L F +G+ + + W Sbjct: 189 NGFDYIHSLVDDHSRLAYS-EILPDEKGSTCAGFLERAAHYFRAHGITTIEQVMTDNAWA 247 Query: 214 DTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRA 273 +L +LG R RP+ P GK+ER +R+L+ E + F + A Sbjct: 248 YRY----SLRDVCTQLGARQIFIRPHCPWQNGKVERLNRTLQTEWAYKRVFTSNAHRAAA 303 Query: 274 FDHWRTVYNLERPHEAL 290 W YN +R H AL Sbjct: 304 LAPWLKHYNTQRRHSAL 320 >UniRef50_A1R4J8 ISAau1, transposase orfB n=3 Tax=Actinomycetales RepID=A1R4J8_ARTAT Length = 279 Score = 176 bits (445), Expect = 2e-42, Method: Composition-based stats. Identities = 58/256 (22%), Positives = 95/256 (37%), Gaps = 19/256 (7%) Query: 60 QDRPRIPHHSPNRSSDD--ITALLRMAHDRHERWGARKIKRWLEDQGHTMP---AFSTVH 114 Q+R + P S ++ + A LR +H WG +K + L Q V Sbjct: 19 QNRSALRKKKPEMSFEETRLRADLRAVAQKHPAWGWKKARWHLRAQPQWQDVALNKKRVR 78 Query: 115 NLMARHGLL--------PGASPGIPATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLL 166 L GL+ P R + + P + DF+ G ++ Sbjct: 79 RLWRDEGLVCKPKPKKKRRTGPDAGEQKRLKAEYPMHVISFDFQSDVTSCGRHIRFFNVI 138 Query: 167 DDHSRFSLCLAHCTDERRETVQQQLVSVFERYGL-PDRMTMDNGSPWGDTTGTWTALELW 225 D+ +R +L + + V L ++ G+ P + DNG + T AL W Sbjct: 139 DECTRTALAIVPRRSFKASDVVAVLENIIAETGIEPAYVRCDNGPEF-----TAAALIEW 193 Query: 226 LMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLER 285 G++ P P G +E F+ + E L G+ E + D W+ +YN ER Sbjct: 194 CSTAGVKTAFIDPGSPWQNGFIESFNAQFRREQLSGEIIDTMAEAKYLADEWKDIYNHER 253 Query: 286 PHEALDMAVPGSRYQP 301 PH +LD P + + Sbjct: 254 PHGSLDGMTPSNYWNQ 269 >UniRef50_A8HUC5 Transposase n=2 Tax=Alphaproteobacteria RepID=A8HUC5_AZOC5 Length = 314 Score = 176 bits (445), Expect = 2e-42, Method: Composition-based stats. Identities = 79/316 (25%), Positives = 111/316 (35%), Gaps = 35/316 (11%) Query: 8 DARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPH 67 AR T R + +G GIS T YKWL R+ G A L D P Sbjct: 6 HARMTFHGRVLLAQRITVEGWRTADAAGAAGISVRTAYKWLARFRAGGEAALHDASSAPG 65 Query: 68 HSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGAS 127 P +S + A + + R I L A STV ++ R GL A+ Sbjct: 66 RKPRATSGETVAAIEALRRQ--RLSGPAIAHSLG------LARSTVGAILRRIGLSRLAA 117 Query: 128 -PGIPATGRFEHDAPNRLWQMDFKGHFPFGG------------------GRCHPLTLLDD 168 R++ P L MD K G G +DD Sbjct: 118 LDEKRPANRYQKAMPGELIHMDTKKLGRIDGIGHRITGDRTRQSNRRGTGWECLHVAIDD 177 Query: 169 HSRFSLCLAHCTDERRETVQQQLVS--VFERYGL-PDRMTMDNGSPWGDTTGTWTALELW 225 SR + +++ + F R+G+ R+ DNGS + Sbjct: 178 ASRLAYTEVLPDEKKGTVCAFTARALGWFARHGVVTARLMTDNGSAYKS-----HDFRDL 232 Query: 226 LMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLER 285 L G+R +RPY P+T GK ERF ++ E + S + +A W YNL R Sbjct: 233 LRAAGVRHVRTRPYTPRTNGKAERFIQTSLREWAYAVPYTSSRQRTQAMPGWIDTYNLNR 292 Query: 286 PHEALDMAVPGSRYQP 301 PH A + P +R Sbjct: 293 PHSAHNGLSPWTRLNN 308 >UniRef50_Q8XPL1 Isrso16-transposase orfb protein n=2 Tax=Ralstonia solanacearum RepID=Q8XPL1_RALSO Length = 269 Score = 176 bits (445), Expect = 2e-42, Method: Composition-based stats. Identities = 56/243 (23%), Positives = 93/243 (38%), Gaps = 11/243 (4%) Query: 71 NRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGI 130 +R + + +G ++ + + H A L P Sbjct: 18 SRKRKRLVGIACTHQRDCGHYGYHRVHVMQQRESWKDNHKRVYHLYRAEGLSLRHKRPKC 77 Query: 131 PATGRFEHDA-----PNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRE 185 + R N +W MDF G R LT++D+++R SL + + + Sbjct: 78 NKSARLRQPKSIVMGINEIWSMDFVSDALLDGQRLRALTVVDNYTRESLAIEVGQSLKGK 137 Query: 186 TVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQG 245 V + L +V ++G P + +DNG+ + A++ W G+ + SRP P Sbjct: 138 DVVRVLDAVVAQHGTPQTIKVDNGTEFIS-----KAMDRWAYEHGVELDFSRPGTPTDNA 192 Query: 246 KLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQ 305 K+E F+ + E L WF + Q WR YN RPH AL A P + AR+ Sbjct: 193 KVESFNGRFRQECLNEHWFLSLEDAQSKIADWRRHYNESRPHSALQWATPDE-FARQARK 251 Query: 306 YSG 308 + Sbjct: 252 SAS 254 >UniRef50_C4URW7 Integrase n=14 Tax=Proteobacteria RepID=C4URW7_YERRO Length = 302 Score = 175 bits (443), Expect = 2e-42, Method: Composition-based stats. Identities = 67/305 (21%), Positives = 114/305 (37%), Gaps = 18/305 (5%) Query: 4 LMPWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRP 63 +M A+ ++ +T+ + +A+ + NI CR F IS T Y W + + + G GL + Sbjct: 1 MMNAKAKRDITHKTKVLNYAN-NTKNIAKTCRHFSISRRTYYTWKKAYERYGEQGLINHK 59 Query: 64 RIPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLL 123 P + R + I + + +G ++I +L + + S + ++ R+ L Sbjct: 60 PCPENPTRRVAKHIEEQIIYLRTTYH-FGPQRISWYLLRFHNIKVSRSGCYYVLLRNRLN 118 Query: 124 P----GASPGIPATGRFEHDAPNRLWQMDFKGHF--PFGGGRCHPL--TLLDDHSRFSLC 175 P R+E P Q+D K F G R T +DD +R Sbjct: 119 QLPQNQRQRSKPLFKRYEKQVPGHHVQVDVKFLFFNSPNGQRIKRFQYTAIDDATR-IRA 177 Query: 176 LAHCTDERRETVQQQLVSVFERYGLP-DRMTMDNGSPWGDTTGTWTALELWLMRLGIRVG 234 L + + V ++ + DNG + + LG+ Sbjct: 178 LKIYERHNQANAINFIDYVVNKFPFRLKTIRTDNGHEFQAK------FNWHVHELGMEHV 231 Query: 235 HSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAV 294 + +P P+ GK+ER H + K E Q + D +L W YN RPH A Sbjct: 232 YIKPATPRLNGKVERSHLTDKQEFYQLIDYTDDVDLHEKLAEWEAFYNCHRPHSAHGGKT 291 Query: 295 PGSRY 299 P Sbjct: 292 PYEVL 296 >UniRef50_A3PPM4 Integrase, catalytic region n=5 Tax=Rhodobacteraceae RepID=A3PPM4_RHOS1 Length = 348 Score = 175 bits (443), Expect = 3e-42, Method: Composition-based stats. Identities = 71/311 (22%), Positives = 118/311 (37%), Gaps = 43/311 (13%) Query: 20 VLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITA 79 VL +++ N+ CR+ G+ + Y+W +R+ +G GL+D P I P + + A Sbjct: 26 VLELAKELGNVAEACRQRGLDRTSFYEWKRRFQTQGFEGLKDLPPIHKSHPQSTPPETVA 85 Query: 80 LLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASP----------- 128 ++ H +G + + L +G + + T+ ++ +GL + Sbjct: 86 RIKTLALAHPAYGCNRFEAMLALEGIRVSSI-TIQKILNENGLGTKSDRWLALEQANAEK 144 Query: 129 GIPAT----------------GRFEHDAPNRLWQMD--FKGHFPFGGGRCHPLTLLDDHS 170 I T E AP L D F G G GR + ++D Sbjct: 145 RIELTAEQAAFIEKLNPCFRERHVESSAPGELLSADTFFVG-ALKGIGRVYLHAVVDTFG 203 Query: 171 RFSLCLAHCTDERRETVQQQLVSVFERYG---LP-DRMTMDNGSPWGDTTGTWTALELWL 226 ++ H + + V V Y LP + DNG + T EL+L Sbjct: 204 SYAFGFLHVSKQPEAAVAVLHNDVLPFYRNLDLPVGAVLTDNGREFCGTER--HPYELYL 261 Query: 227 MRLGIRVGHSRPYHPQTQGKLERFHRSLKAEV----LQGKWFADSGELQRAFDHWRTVYN 282 GI +R P+T G +ERF+ ++ E ++ ++ LQ D W YN Sbjct: 262 DLNGIEHRRTRVRTPKTNGFVERFNGTILDEFFRVAMRDNFYESVEALQADLDAWLVHYN 321 Query: 283 LERPHEALDMA 293 ERPH L Sbjct: 322 TERPH--LGYR 330 >UniRef50_Q4FQT2 Transposase OrfB n=179 Tax=Bacteria RepID=Q4FQT2_PSYA2 Length = 288 Score = 174 bits (441), Expect = 5e-42, Method: Composition-based stats. Identities = 57/286 (19%), Positives = 103/286 (36%), Gaps = 30/286 (10%) Query: 23 ASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLR 82 + +IR C F IS Y + + + I LL Sbjct: 28 IKERRISIRRACLIFNISVTCYY--------------------HKSAASDENKQIADLLV 67 Query: 83 MAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPATGRFEHD--- 139 +++ WG L + V+ + L P Sbjct: 68 ELTTQNKNWGFGLCFLSLRNVLGLPYNHKRVYRIYCELELNLRIKPKRRIKRVKPVPLAV 127 Query: 140 --APNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFER 197 PN+ W MDF G ++DD++R +L + + V + L + E Sbjct: 128 PVEPNQSWSMDFMHDALTDGRAFRLFNVIDDYNREALTVEIDFSLPAQRVIRSLNQLIEY 187 Query: 198 YGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAE 257 G P ++ DNG+ + AL+ W + GI + + P +PQ +ER++R+++ + Sbjct: 188 RGKPVQVRCDNGAEYIS-----NALKDWAVNQGITIRYIEPGNPQQNAYVERYNRTMRYD 242 Query: 258 VLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSA 303 L + F D ++++ + W YN ERP+ P + +A Sbjct: 243 WLNQELFTDLDQVRQQAEDWLYHYNNERPNMGNGGFTPIQKLHQAA 288 >UniRef50_C0WLI3 Transposase n=14 Tax=Corynebacterium RepID=C0WLI3_9CORY Length = 497 Score = 174 bits (441), Expect = 5e-42, Method: Composition-based stats. Identities = 71/317 (22%), Positives = 119/317 (37%), Gaps = 20/317 (6%) Query: 12 TMSLRTEFVLF-ASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSP 70 ++LR + F ++G ++ C+ G+S T Y R A+ G AG+ P + Sbjct: 4 PITLRKKIADFDPIREGITVQQFCKNIGVSKQTYYNIKARIAERGRAGIVPDSTAPLNPR 63 Query: 71 NRSSDDITALLRM----AHDRHERWGARKIKRWLEDQ--GHTMPAFSTVHNLMARHGLLP 124 D I + R + G I + D+ P+ + + + G+ Sbjct: 64 RVYDDKIRQQVLQARGTLRARGQDCGPWSIFYFFLDELGYDQPPSRALIAQWLHEAGVAD 123 Query: 125 GASPGIPAT--GRFEHDAPNRLWQMDFKGHFPF--GGGRCHPLTLLDDHSRFSLCL-AHC 179 + P F N LWQ+D + F + ++DD SRF + A Sbjct: 124 INARKRPRKSYRHFARGEVNELWQIDAFAYRLFDVPHTQVTIYQVVDDASRFDVGSQAFG 183 Query: 180 TDERRETVQQQLVSVFERYGLPDRMTMDNGSPWG-DTTGTWTALELWLMRLGIRVGHSRP 238 T E + L + YGLP + DNG + G + E WL LG++ S Sbjct: 184 TPENGTDARITLSGAIDAYGLPQEVLSDNGDAFATYHRGRLSQTERWLASLGVQ--SSAG 241 Query: 239 YHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEAL----DMAV 294 + P TQGK ER H+++ L + ++Q+ +R YN R H+ L Sbjct: 242 FAPTTQGKDERSHQTM-TRFLDARTPTTLAQVQQLIVDYRNFYNTRRRHQGLLRGKMHIT 300 Query: 295 PGSRYQPSARQYSGNTT 311 P ++ + Sbjct: 301 PAQAWEIISHAQPPTQP 317 >UniRef50_C6BTX7 Integrase catalytic region n=88 Tax=Bacteria RepID=C6BTX7_DESAD Length = 289 Score = 174 bits (441), Expect = 5e-42, Method: Composition-based stats. Identities = 60/255 (23%), Positives = 95/255 (37%), Gaps = 13/255 (5%) Query: 62 RPRIPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHG 121 R + S + + +R + R+G +I L +G + VH + G Sbjct: 32 RSSYYYKSVRKDDTPLRLRIRDIAESRVRYGCHRIYILLRREGWYV-NHKKVHRIYCEEG 90 Query: 122 -LLPGASPGI-----PATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLC 175 L P R E ++ W MDF + G R LT++D+ SR L Sbjct: 91 LNLRSKRPRRHISAARRMDRPELSTIDQCWSMDFVADNLYNGRRIRALTVVDNFSRECLD 150 Query: 176 LAHCTDERRETVQQQLVSVFERYG-LPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVG 234 + + + + V +L + G P R+ +DNGS + AL+ W + + Sbjct: 151 IYVDSSIKGDKVVARLEWLRVISGRKPIRIQVDNGSEFIS-----KALDKWAYENEVVLD 205 Query: 235 HSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAV 294 SRP P +E F+ S + E L WF + + + WR YN RPH +L Sbjct: 206 FSRPGKPTDNPFIESFNGSFRDECLNTHWFLSVSDARTRIETWRKEYNEFRPHSSLGDQT 265 Query: 295 PGSRYQPSARQYSGN 309 P G Sbjct: 266 PNDCALAHKTPSEGR 280 >UniRef50_A0AXB8 Integrase, catalytic region n=27 Tax=Betaproteobacteria RepID=A0AXB8_BURCH Length = 279 Score = 173 bits (438), Expect = 9e-42, Method: Composition-based stats. Identities = 64/296 (21%), Positives = 103/296 (34%), Gaps = 36/296 (12%) Query: 11 DTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSP 70 ++ + R E + ++ G + R CR G+S L++ ++ + G Sbjct: 2 NSPTGRREALEVLTRRGLSQRKACRYLGLSRRVAIYTLKQPEKDRSLG------------ 49 Query: 71 NRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGI 130 L A R+G R+I WL S V + L Sbjct: 50 --------ERLIAASQEVPRFGYRRISAWLS------LGESRVRRMWRALKLNIPKRRPR 95 Query: 131 PAT-----GRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRE 185 PN +W DF G L ++D+++R L + R + Sbjct: 96 RRRCGSDIRLPGATKPNSVWSYDFVHDQLVDGRVLKMLCVIDEYTRECLAIEVGASLRSQ 155 Query: 186 TVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQG 245 V L + YG P + DNG+ + T + WL I P +P G Sbjct: 156 DVILVLSRLMRLYGKPAFIRSDNGAEF-----TAAKVMRWLRDAAIGPAFITPGNPWQNG 210 Query: 246 KLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQP 301 +E F+ L+ E+L +WF E + + WR YN RPH A P + + Sbjct: 211 FVESFNGKLRDELLNREWFRSRAEAKVLIERWRQFYNERRPHSAHRYQPPATVRRA 266 >UniRef50_Q3A4V8 Transposase and inactivated derivatives n=9 Tax=Bacteria RepID=Q3A4V8_PELCD Length = 336 Score = 173 bits (438), Expect = 1e-41, Method: Composition-based stats. Identities = 67/295 (22%), Positives = 117/295 (39%), Gaps = 19/295 (6%) Query: 5 MPWDARDTMSLRTEFVL-FASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRP 63 MP D RD + +FV +A + G + + G++ + Y W QR+ + + Sbjct: 1 MPHDIRDAV---IDFVKHWAKRTGIAVTHIIDWLGLAVSKFYNWQQRYGK----ANEHNA 53 Query: 64 RIPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLL 123 IP + + + + + G R++ + DQ + S+V+ ++ GLL Sbjct: 54 LIPRDFW--LEEKEKQAIIKFYQQKPQEGYRRLTFMMLDQDVVAVSPSSVYRVLNAAGLL 111 Query: 124 PGA--SPGIPATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTD 181 TG + P+ W +D + G + LLD SR+ + Sbjct: 112 RRWNGKQSKKGTGFVQPLKPHEHWHIDV-SYINICGTFYYLCCLLDGCSRYIVHWELREA 170 Query: 182 ERRETVQQQLVSVFERY-GLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYH 240 V+ L E++ R+ DNG + + ++ G+ + PY+ Sbjct: 171 MTEANVEIILQRAREKHPAATPRIISDNGPQFIT-----KDFKEFIRVAGMTHVRTSPYY 225 Query: 241 PQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVP 295 PQ+ GKLERFH ++K E ++ K E + + YN ER H AL P Sbjct: 226 PQSNGKLERFHGTIKQECIRPKVPLSLEEARAQVADYIRYYNDERLHSALGYVAP 280 >UniRef50_B8FAB2 Integrase catalytic region n=70 Tax=Bacteria RepID=B8FAB2_DESAA Length = 327 Score = 173 bits (438), Expect = 1e-41, Method: Composition-based stats. Identities = 63/320 (19%), Positives = 110/320 (34%), Gaps = 37/320 (11%) Query: 16 RTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSD 75 R + V S + +I C +S ++ Y + +P H Sbjct: 6 RKQAV-EPSSEELSITRQCELLSMSRSSYYY-------------RPKPVSDH------DL 45 Query: 76 DITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFS--TVHNLMARHGLLPGASPGIPAT 133 ++ L+ + + WG+R ++ +L G+ + + +M + P +P Sbjct: 46 ELMRLIDEQYLKQPTWGSRSMRNFLRGLGYKINRKKVRRLMRIMGICAVYPKPRTSLPHP 105 Query: 134 GRFEH---------DAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERR 184 G + D N++W D + P G + ++D HSR L Sbjct: 106 GHKVYPYLLKGVSIDRANQVWSSDIT-YIPMRKGFMYLCAVIDWHSRKVLSWRLSNTMDA 164 Query: 185 ETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQ 244 + +RYG P+ D G + T L GIR+ Sbjct: 165 DFCVDAAAEAIDRYGPPEIFNTDQGVQF-----TSADFTGLLKGHGIRISMDGKGRCLDN 219 Query: 245 GKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSAR 304 +ER +LK + + F D +L++ W YN ERPH++LD P Y + Sbjct: 220 IFVERLWWTLKYHYVYLRDFEDGVQLRKGLAGWFDFYNRERPHQSLDGKTPNEAYFNVSG 279 Query: 305 QYSGNTTPPEYDEGVMVRKV 324 S P + + R V Sbjct: 280 PISWVREPMKPVRSMEQRGV 299 >UniRef50_C8X9D2 Integrase catalytic region n=6 Tax=Actinomycetales RepID=C8X9D2_NAKMY Length = 347 Score = 172 bits (437), Expect = 1e-41, Method: Composition-based stats. Identities = 83/325 (25%), Positives = 120/325 (36%), Gaps = 45/325 (13%) Query: 8 DARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPH 67 +A T R G I + G++ T KW RW +G AGLQDR P Sbjct: 22 NAPLTPEGRRRLCQRV-DAGRPICHVAAEAGVARQTLAKWHARWKADGPAGLQDRSSRPV 80 Query: 68 HSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGAS 127 SP + + ++ RH + G + L + G T+ A ST+H ++ R G+ Sbjct: 81 SSPGQVDAQVEDVVEYLR-RHLKLGPVMLAAELREFGITL-APSTIHRVLVRRGISRLRD 138 Query: 128 ------PGIPATGRFEHDAPNRLWQMDFKGHFPFGGG----------------------- 158 R+E AP L +D K G Sbjct: 139 LDVTGHQLREPVRRYEWAAPGDLIHVDVKKIGRIPDGGGWRIHGRGNDAHRASQRGQRPG 198 Query: 159 RCHPLTLLDDHSR--FSLCLAHCTDERRETVQQQLVSVFERYGLP--DRMTMDNGSPWGD 214 T +DD SR ++ LA + V F +G+ R+ DNGS + Sbjct: 199 YAFLHTAIDDRSRLAYTEELADEKSVTAAGFWARAVEFFAAHGIERIHRVLTDNGSCYRG 258 Query: 215 TTGTWTALELWLMRLGIR-VGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRA 273 LG ++RPY PQT GK+ER+HR+L E + ++ + E A Sbjct: 259 KDFNTA--------LGATVHKYTRPYRPQTNGKVERYHRTLAREWAYRQAWSCNDERAAA 310 Query: 274 FDHWRTVYNLERPHEALDMAVPGSR 298 + YN RPH AL P R Sbjct: 311 LAGFVHRYNYHRPHTALKGKPPAYR 335 >UniRef50_B8ER74 Integrase catalytic region n=103 Tax=Bacteria RepID=B8ER74_METSB Length = 309 Score = 170 bits (431), Expect = 7e-41, Method: Composition-based stats. Identities = 64/270 (23%), Positives = 97/270 (35%), Gaps = 29/270 (10%) Query: 50 RWAQEGAAGLQDRPRIPHHSPNRSSDDITAL---LRMAHDRHERWGARKIKRWLEDQGHT 106 R ++ A +PR P R DD AL + + R+G R+I L G Sbjct: 9 RVSERRACAALRQPRSTQRKPARGRDDEAALTADIVELAKAYGRYGYRRITALLRHAGWV 68 Query: 107 MPAFSTVHNLMA----------RHGLLPGASPGIPATGRF----------EHDAPNRLWQ 146 + A V + R P GR + PN +W Sbjct: 69 VNA-KRVQRIWRADKFTQSAQQRDCEGLKVPQKHPKRGRLWLNDGSCVRPRAERPNHVWS 127 Query: 147 MDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMTM 206 DF G + L L+D+ +R +L + V + L + G P + Sbjct: 128 YDFVADRTQDGRKFRMLCLIDEFTREALAIQVKRRLNATDVLETLADLMILRGTPAYVRS 187 Query: 207 DNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFAD 266 DNG + AL W+ +G + + P P G E F+ +L+ E+L G+ F Sbjct: 188 DNGPEFIAV-----ALREWIAAVGSKTAYIEPGSPWENGACESFNSNLRDELLNGELFFS 242 Query: 267 SGELQRAFDHWRTVYNLERPHEALDMAVPG 296 E Q + WR +N RPH +L P Sbjct: 243 PAEAQAMIEAWRRHFNAVRPHSSLGYRSPA 272 >UniRef50_Q2CG00 Integrase, catalytic domain n=8 Tax=Rhodobacterales RepID=Q2CG00_9RHOB Length = 340 Score = 170 bits (430), Expect = 9e-41, Method: Composition-based stats. Identities = 64/281 (22%), Positives = 101/281 (35%), Gaps = 33/281 (11%) Query: 23 ASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLR 82 + R C+ G+ P T + P P +I ++ Sbjct: 1 MRDHDISQRRACQLVGVDPKTVRR-----------------TRPPDCP-----EIHEEMK 38 Query: 83 MAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHG-----LLPGASPGIPATGRFE 137 + R+G R+I LE +G M ++ L G T E Sbjct: 39 EIAGKRRRFGYRRIGILLERKGMLM-NHKKLYRLYREEGLSVKRRGGRKRARGSRTPMPE 97 Query: 138 HDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFER 197 P W +DF + L ++DD R +LCL T V ++L ++ Sbjct: 98 AAHPKARWSLDFLADSFGASRKFRILAVIDDCCRENLCLTADTSISGARVARELDALVRI 157 Query: 198 YGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAE 257 YG P + DNG+ + T A+ W + G+ + P PQ +E F+ SL+ E Sbjct: 158 YGTPACIVSDNGTEF-----TSRAILKWADKNGVPWHYIDPGKPQQNAFIESFNGSLRDE 212 Query: 258 VLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSR 298 +L + F + +R WR YN RPH +L P Sbjct: 213 LLYEEIFVTLEDARRKLALWRYDYNAVRPHSSLGNQTPLEA 253 >UniRef50_A5G4C5 Putative uncharacterized protein n=1 Tax=Geobacter uraniireducens Rf4 RepID=A5G4C5_GEOUR Length = 390 Score = 169 bits (429), Expect = 1e-40, Method: Composition-based stats. Identities = 69/382 (18%), Positives = 140/382 (36%), Gaps = 23/382 (6%) Query: 14 SLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRS 73 LR + V G + +++ S +KWL R+ ++ R P P+ Sbjct: 4 ELRKQAVQRHLA-GESPKAVYTSLDRSKKWFFKWLNRYQSGATDWYKEHSRAPLKRPSEL 62 Query: 74 SDDITALLRMAHDR-----HERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASP 128 S ++ +R + G IK L G + ST++ + R GL+ + Sbjct: 63 SIVDKEIIVSTRNRLDSSPFAQIGVSAIKWELHKLGLPFRSDSTINRTLKREGLVKKKTR 122 Query: 129 GIPATGRF----EHDAPNRLWQMDFKGH-FPFGGGRCHPLTLLDDHSRFSLCLAHCTDER 183 P + E N + QMD G + GR + + ++D +S ++ T E Sbjct: 123 YSPKGVEYPYFTEALCCNNIHQMDLVGPRYIKSDGRFYSMNVIDLYSHRVFIESNRTKED 182 Query: 184 RETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTAL---ELWLMRLGIRVGHSRPYH 240 + + Q L+ ++ GLPD + MDN + + +L + G+ Sbjct: 183 -DNIAQGLLRCWKSMGLPDFLQMDNELSFRGSNRYPRSLGLVLRLCLYFGVHPVFIPVAE 241 Query: 241 PQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVP----- 295 P G +E F+ + + + +WF L+R +++ +N + L P Sbjct: 242 PWRDGVIESFNDTYDKKFFRRQWFTSYSMLKRQSKNFQQFHNKNHRYSYLKGKTPLEVIE 301 Query: 296 GSRYQPSARQYSGNTTPPEY-DEGV--MVRKVDISGKLSVKGVSLSAGKAFRGERVGLKE 352 +++P + ++ +G ++R + L++ G K V Sbjct: 302 ADKFKPLTLGPNTRMPKLDFLPDGTISLIRFIRSDRTLNIFGEKFEVSKDLVYSYVRAMI 361 Query: 353 MQEDGSYEVWWYSTKVGVIDLK 374 + E + +V+ V + + Sbjct: 362 VTEIHTLQVYLGEDFVQSFEYR 383 >UniRef50_A1WCB6 Integrase, catalytic region n=2 Tax=Burkholderiales RepID=A1WCB6_ACISJ Length = 270 Score = 168 bits (425), Expect = 3e-40, Method: Composition-based stats. Identities = 60/282 (21%), Positives = 95/282 (33%), Gaps = 32/282 (11%) Query: 22 FASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALL 81 + G + R C+ GI+ +T R+ R + + Sbjct: 6 LMVEHGMSQRRACQASGIARSTL-----RYR--------PIARDDSG--------VITFI 44 Query: 82 RMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPATGRFEHD-- 139 R + R G + QG + + L R + Sbjct: 45 RAYMALNPRHGFGLLYDSARHQGKPWGKT-VLWRVYCELRLNLPRRGKKRLPARIKQPLH 103 Query: 140 ---APNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFE 196 PN+ W DF + G R ++D+ +R L + T V + L + E Sbjct: 104 AAAQPNQGWSCDFMADALWSGRRFRTFNVIDEFNREGLRIEVDTSLPATRVIRALNELVE 163 Query: 197 RYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKA 256 G P + +DNG + AL W GI + H +P P +ERF+++ + Sbjct: 164 VRGAPLSIRLDNGPEF-----IAHALSEWAKSKGIALNHIQPGKPTQNAYVERFNKTYRT 218 Query: 257 EVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSR 298 EVL F + E++ W YN RPHEAL P Sbjct: 219 EVLDCYVFDNLQEVRDMTADWLHRYNHHRPHEALGRIPPVEY 260 >UniRef50_A3YV04 Transposase n=3 Tax=Synechococcus sp. WH 5701 RepID=A3YV04_9SYNE Length = 312 Score = 167 bits (422), Expect = 7e-40, Method: Composition-based stats. Identities = 74/315 (23%), Positives = 114/315 (36%), Gaps = 34/315 (10%) Query: 8 DARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPH 67 +AR T R + +G +++L + GIS + YKWL R+ G L DR + Sbjct: 6 NARLTPISRERLIRRHLNEGEPLKALAAQAGISLRSAYKWLARFRDGGVTALADRRSVRR 65 Query: 68 HSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGAS 127 L + RH+R R+I + L+ S+V M GL + Sbjct: 66 TQRRTLDP--QQLQQAVDLRHQRCTLRRIAKALK------APLSSVGRAMNALGLGRLRN 117 Query: 128 PG-IPATGRFEHDAPNRLWQMD-------------FKGHFPFG----GGRCHPLTLLDDH 169 R++ + P + +D G G G +DD Sbjct: 118 LEPKKPVQRYQWERPGDMIHVDTKQLARFERVGHRITGDRRQGCSPGAGYEKVHVAIDDA 177 Query: 170 SRFSLCLAHCTDERRETV--QQQLVSVFERYGLP-DRMTMDNGSPWGDTTGTWTALELWL 226 +R + ++R TV + V F G+ R+ DNG + Sbjct: 178 TRLAYVEVLADEQRATTVGFLARAVGWFSEQGITCRRILSDNGPAYRSGDWRKA-----C 232 Query: 227 MRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERP 286 L ++ ++PY PQT GK ERF ++L AE + S E R + +YN R Sbjct: 233 QALDLKPIRTKPYTPQTNGKAERFIKTLLAEWAYVMAYQTSEERNRWLPRYLGIYNGHRC 292 Query: 287 HEALDMAVPGSRYQP 301 H AL P Q Sbjct: 293 HMALGGLTPQQSLQR 307 >UniRef50_A3DCZ2 Integrase, catalytic region n=10 Tax=Clostridium RepID=A3DCZ2_CLOTH Length = 278 Score = 166 bits (419), Expect = 2e-39, Method: Composition-based stats. Identities = 51/248 (20%), Positives = 94/248 (37%), Gaps = 21/248 (8%) Query: 67 HHSPNRSSDD---ITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMAR---H 120 ++ P +++ I ++ + + +G R++ L H M H Sbjct: 26 YYKPAPVNEEEYLIKRIIDEIYASYPEYGYRRMTSILNKDYHIHINRKRTRRYMREMGIH 85 Query: 121 GLLPGASPGIPATGR---------FEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSR 171 G PG + G+ + D PN++W +D + G + + ++D +SR Sbjct: 86 GFCPGPNLSKRIHGKNLYPYLLRNLKIDHPNQVWSIDVT-YCRMKRGFMYMVAIIDWYSR 144 Query: 172 FSLCLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGI 231 + + + V + + +RYG P+ M D GS + L GI Sbjct: 145 YIVGFELSNTLDKTFVIEAIQKAIKRYGKPEIMNSDQGSQFTSDDYI-----NLLKNNGI 199 Query: 232 RVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALD 291 ++ ++ERF RS K E L + +L++ + YN RPH++LD Sbjct: 200 KISMDGKGRALDNQRIERFFRSYKWEKLYLEECETVQQLRQITKEYVEHYNHRRPHQSLD 259 Query: 292 MAVPGSRY 299 P Y Sbjct: 260 YKTPAEYY 267 >UniRef50_C4YYX7 Integrase catalytic region n=2 Tax=Rickettsia endosymbiont of Ixodes scapularis RepID=C4YYX7_9RICK Length = 280 Score = 165 bits (418), Expect = 2e-39, Method: Composition-based stats. Identities = 53/285 (18%), Positives = 98/285 (34%), Gaps = 36/285 (12%) Query: 26 DGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMAH 85 +I C I+ ++ Y + +G + +I ++ + Sbjct: 15 SNLSIARQCTLLFINKSSYY-----YKPQGL--------------TQKDLEIMQVIDEIY 55 Query: 86 DRHERWGARKIKRWLEDQGHTM--PAFSTVHNLMARHGLLPGASPGIPATGR-------- 135 +H +GAR++ L G T+ A S + +MA + P + Sbjct: 56 TQHPYFGARRMSEHLVPFGITIGREAVSRYYRIMAIEAIYPKMNLSKRNQAHKIYPYLLK 115 Query: 136 -FEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSV 194 E N++W D + G + + ++D SR+ + + L Sbjct: 116 GVEIIKVNQVWSTDIT-YIRMAQGFVYLVAIIDRFSRYIVSWKVSISLESDFCIDALEEA 174 Query: 195 FERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSL 254 +YG P+ D GS + T L++ I++ +ERF RSL Sbjct: 175 IIKYGQPEIFNTDQGSQFTSKNFTDK-----LIKREIKIIMDGKGRALDNVFIERFWRSL 229 Query: 255 KAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRY 299 K E + E + A + YN +R H++L+ P Y Sbjct: 230 KQEKIYLMVLNTVKEAKNAITDYINFYNRKRMHQSLEYLTPEQVY 274 >UniRef50_Q2P621 ISXoo3 transposase n=194 Tax=Proteobacteria RepID=Q2P621_XANOM Length = 349 Score = 165 bits (417), Expect = 3e-39, Method: Composition-based stats. Identities = 56/293 (19%), Positives = 92/293 (31%), Gaps = 32/293 (10%) Query: 16 RTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSD 75 R V GA+ R G+S + L+ RPR + Sbjct: 75 RRALVREWIAGGASERCALAAIGMSAS---------------ALRYRPREDRNV------ 113 Query: 76 DITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIP---- 131 ++ + RH R+G I L +G + + V L L Sbjct: 114 ELRERILALAHRHRRYGVGMISLKLRQEGRLV-NYKRVERLYCEQQLQVRRRKRKKVPLG 172 Query: 132 -ATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQ 190 N +W MD G L ++DD + ++ + V + Sbjct: 173 ERAPLLRPTKANPVWSMDVVFDRTAEGRAIKCLVIVDDATHEAVAIDVERAISGHGVVRV 232 Query: 191 LVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERF 250 L + GLP + DNG + A+ W +++ +P P +E F Sbjct: 233 LDRLAHSRGLPKMIRTDNGKEFCG-----KAMVAWAHANRVQLRQIQPGKPNQNAYVESF 287 Query: 251 HRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSA 303 + L+ E L F + + WR YN RP + + P + Q A Sbjct: 288 NGRLRDECLNKHGFPTLLHARTEIERWRREYNEHRPKKTIGGMTPAAYAQQLA 340 >UniRef50_Q1BK79 Integrase, catalytic region n=37 Tax=Proteobacteria RepID=Q1BK79_BURCA Length = 339 Score = 164 bits (415), Expect = 5e-39, Method: Composition-based stats. Identities = 71/285 (24%), Positives = 108/285 (37%), Gaps = 14/285 (4%) Query: 16 RTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSD 75 R +V + G N +C R GIS T KWL+R+ + G GL+ + R P SPNR Sbjct: 2 RLRWVRMYHETG-NAGLVCTRCGISRPTLRKWLRRYQEAGEEGLRSQSRRPLTSPNRKVS 60 Query: 76 DITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPG-ASPGIPATG 134 D + + GAR+I+ L + +T+H ++ + P Sbjct: 61 DADRATILRLRAERKGGARRIQNELRLNEQRELSLATIHKVLCEALVKPLVRPRRPAQPR 120 Query: 135 RFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSV 194 R+ P QMD + T +DD SRF + + R T+ L V Sbjct: 121 RYSRPVPGDRVQMDTMK----IARGVYQYTAIDDCSRFRVLAVYPRRNARNTL-FFLDRV 175 Query: 195 FERYGLP-DRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRS 253 E P R+ D G + +++ LM I+ P P GK+ER + Sbjct: 176 IEEMPFPIQRIQTDRGGEF-----FAESVQRRLMNECIKFRPIPPRSPHLNGKVERSQLT 230 Query: 254 LKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSR 298 E + + + W+ YN RPH +L P R Sbjct: 231 DLNEFWFHHAPTERA-IDLRIEEWQFDYNWRRPHGSLGGKTPVDR 274 >UniRef50_B1K7U4 Integrase catalytic region n=7 Tax=Bacteria RepID=B1K7U4_BURCC Length = 282 Score = 164 bits (414), Expect = 6e-39, Method: Composition-based stats. Identities = 76/307 (24%), Positives = 117/307 (38%), Gaps = 38/307 (12%) Query: 12 TMSLRTEFVLFASQD-GANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSP 70 +L+ + V + G + CR + +T Y R ++ L+ R Sbjct: 3 APALKRQAVSYIVDHYGLPTQRACRLVKQARSTHYY---RSVKDPQTALRQR-------- 51 Query: 71 NRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMA------RHGLLP 124 +R R+G R++ L+ +G + + + V+ L A R L Sbjct: 52 ----------MREIAQTRVRYGYRRVHVLLKREGWRV-SRNRVYRLYAEEQLQLRSKLPK 100 Query: 125 GASPGIPATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERR 184 + R PN +W +DF G R LT++D SR +L + R Sbjct: 101 RRKMVVSRRERCVPVRPNEVWSLDFVADQLADGTRLCALTVVDIFSREALAIEVGKRLRA 160 Query: 185 ETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQ 244 E V L + + P + DNGS + L++W +++ SRP P Sbjct: 161 EDVVSVLNRLVAQRRAPRFLFADNGSEFSGRL-----LDMWAYHYKVQIDFSRPGKPTDN 215 Query: 245 GKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSAR 304 +E F+ S + E L WF E +R + WR YN RPH AL PG AR Sbjct: 216 SFIETFNGSFRDECLNLHWFESLAEAKREIEAWRCDYNETRPHMALKELTPGE----FAR 271 Query: 305 QYSGNTT 311 QYS T Sbjct: 272 QYSLRPT 278 >UniRef50_B3EBT2 Integrase catalytic region n=87 Tax=Bacteria RepID=B3EBT2_GEOLS Length = 305 Score = 163 bits (413), Expect = 9e-39, Method: Composition-based stats. Identities = 64/298 (21%), Positives = 102/298 (34%), Gaps = 38/298 (12%) Query: 14 SLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRS 73 + R ++ +I C G+S + Y +P + NR Sbjct: 12 TARKRELIDWRHPTISIARQCELLGVSRSCLYYH----------------PVPASNENRL 55 Query: 74 SDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLL---PGASPGI 130 + LL + RH G K+ WL QG+ V L+ GL+ PG + Sbjct: 56 ---LMRLLDEEYTRHPFLGVIKLTNWLRSQGYWHIGTRRVRRLLRLMGLMAIYPGPNLSK 112 Query: 131 PATG---------RFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTD 181 PA G E + N++W D + G + + ++D SR+ L Sbjct: 113 PAPGHKIYPYLLRNVEVERVNQVWSADIT-YIRLKTGFVYLVAVVDWCSRYILAFEISIT 171 Query: 182 ERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHP 241 + + L R G P+ D GS + T L G+R+ Sbjct: 172 LEADFCIEALQQALTR-GTPEIFNSDQGSQFTSPRHT-----EILHLAGVRISMDGKGRA 225 Query: 242 QTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRY 299 +ERF RS+K E + + E + + YN ER H++L P Y Sbjct: 226 LDNIFVERFWRSVKYEEVYLHDYESVQEARIGLKRYIEYYNNERQHQSLGYQTPAEVY 283 >UniRef50_B1ZXZ1 Integrase catalytic region n=2 Tax=Verrucomicrobia RepID=B1ZXZ1_OPITP Length = 298 Score = 163 bits (412), Expect = 1e-38, Method: Composition-based stats. Identities = 61/271 (22%), Positives = 86/271 (31%), Gaps = 12/271 (4%) Query: 48 LQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTM 107 + R G L R + + LR RWG R++ L +G Sbjct: 10 VSRTRACGLLRLAVSSYYYQSHGRRDELPVRSALRRHAVVRRRWGYRRLLVLLRREGIA- 68 Query: 108 PAFSTVHNLMARHGLLPGASPGIPATGRF------EHDAPNRLWQMDFKGHFPFGGGRCH 161 V+ L GL PN W +DF G Sbjct: 69 DNHKRVYRLYRAEGLQVRQRRRRKQRLARGVEAVAAPQRPNERWSLDFVHDRLANGRSLR 128 Query: 162 PLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTA 221 LT+ DD++R L + T V + L + E G P + DNG + A Sbjct: 129 LLTVHDDYTRECLWIEADTSLSGPRVARVLDYLTELRGRPGSLLTDNGPEFAGL-----A 183 Query: 222 LELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVY 281 LE W + P P G +E F+ L+ E L F + + +R Y Sbjct: 184 LERWTHERQVNHRFITPGKPSQNGYIESFNGKLRDECLNETEFLSVSHARDLLEAFREDY 243 Query: 282 NLERPHEALDMAVPGSRYQPSARQYSGNTTP 312 N +RPH +L P AR G Sbjct: 244 NHQRPHSSLHDLTPAQFAAKIARAPMGAPVD 274 >UniRef50_A9G353 Putative transposase n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9G353_SORC5 Length = 428 Score = 162 bits (410), Expect = 2e-38, Method: Composition-based stats. Identities = 87/327 (26%), Positives = 130/327 (39%), Gaps = 21/327 (6%) Query: 39 ISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMAHDRHERWGARKIKR 98 S T +WL + G GL+ +PR S+++ LL H I + Sbjct: 33 YSVPTLERWLYAYRSAGLDGLRPQPRSDRGFAQDLSEELRTLLLDIRREHPDASVPLILK 92 Query: 99 WLEDQGH---TMPAFSTVHNLMARHGLLPGASP--GIPATG-RFEHDAPNRLWQMDFKGH 152 L D+G T + TV L A HGL A+ G P T R++ + P LW D Sbjct: 93 TLVDEGRLEATQVSEPTVRRLYAAHGLRRRAARAEGEPKTRLRWQVERPGALWHGDVCHV 152 Query: 153 F-PFGGGRCHPLTL---LDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMTMDN 208 GG+ PL + LDD SR+ + L T E+ + V R+G PD + +DN Sbjct: 153 TGCTVGGKAMPLRIHGLLDDASRYVVALEAHTTEKEIDMLAMTVDALRRHGKPDALYLDN 212 Query: 209 GSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVL-QGKWFADS 267 GS + L+ RLGI + H++PY P+ +GK+ERF R+L+ L A Sbjct: 213 GSTYRGDV-----LKTACARLGITLLHAKPYDPEARGKMERFWRTLREGCLTYLGAVASL 267 Query: 268 GELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSA---RQYSGNTTPPEYDEGVMVRKV 324 ++ + PH L P Y E R+V Sbjct: 268 VDINTTLRAFLDRRYHPAPHAGLLGQTPAKVYAARPAAEGSVDEKALRVALTERTR-RRV 326 Query: 325 DISGKLSVKGVSLSAGKAF-RGERVGL 350 +SV GV+ + + G+ V + Sbjct: 327 SGDNIVSVDGVAWQLDQGYLAGQIVSV 353 >UniRef50_Q8PGV8 ISxac4 transposase n=3 Tax=Xanthomonas axonopodis pv. citri RepID=Q8PGV8_XANAC Length = 274 Score = 162 bits (409), Expect = 2e-38, Method: Composition-based stats. Identities = 53/285 (18%), Positives = 93/285 (32%), Gaps = 30/285 (10%) Query: 26 DGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMAH 85 +R C G++ A Y P + + + Sbjct: 8 HARPLRRSCACVGLARAAWY-------------------APPLDWTVCDAGLISAIARVV 48 Query: 86 DRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPATGR-----FEHDA 140 + G K L ++ + L + R + Sbjct: 49 EDRPSRGFWKCSDVLRRTRPDW-NPKRIYRVYKAMRLNLRRAAKRRLPKRERVALYVPRL 107 Query: 141 PNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGL 200 P+ +W +DF G R ++DD +R L + T + + + +GL Sbjct: 108 PDTVWSVDFMSDALACGRRFRTFNVVDDSNREVLHIEVDTSINSHRLVRVFEQIKHDHGL 167 Query: 201 PDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQ 260 P + DNG + A WL G+ + + +P P +ERF+R+ + EVL Sbjct: 168 PQIVRSDNGPEFLGE-----AFTSWLKVNGVAIKYIQPGKPNQNAFIERFNRTFREEVLD 222 Query: 261 GKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQ 305 F ++++A YN ERPH++L P AR+ Sbjct: 223 QHLFTCLDDIRQAIHWRMIDYNEERPHDSLSGLTPTEYRNQHARR 267 >UniRef50_D1K5D0 Transposase n=1 Tax=Bacteroides sp. 3_1_33FAA RepID=D1K5D0_9BACE Length = 380 Score = 161 bits (406), Expect = 6e-38, Method: Composition-based stats. Identities = 69/358 (19%), Positives = 120/358 (33%), Gaps = 54/358 (15%) Query: 20 VLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITA 79 +L SQ N+ C+ G S + Y++ + + Q G LQ+ R NR + I Sbjct: 14 LLELSQQLGNVSRACKIMGYSRDSFYRFKELYEQGGEIALQEISRRKPVIKNRVEEHIEQ 73 Query: 80 LLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPG---------- 129 + + G ++ L +G + + Sbjct: 74 AVVGMAIDNPALGQVRVSNELRKKGILVSPGGVRSIWLRHDMETFQKRLKALSAKVEQEG 133 Query: 130 -----------------IPATGRFEHDAPNRLWQMD--FKGHFPFGGGRCHPLTLLDDHS 170 A G E P L D + GH G + T++D +S Sbjct: 134 IILDENQVAALEKAKEEKQAHGEIETYYPGFLVAQDTYYVGHIKGV-GHIYQQTVIDTYS 192 Query: 171 RFSLCLAHCTDE---RRETVQQQLVSVFERYGLP-DRMTMDNGSPWGDTTGTWTALELWL 226 + + + + ++V FE++ L RM D G+ + EL+L Sbjct: 193 KIGFAKLYDRKNALVAADMLNDRIVPFFEQHDLKLMRMLTDRGTEYCGNRENH-EYELYL 251 Query: 227 MRLGIRVGHSRPYHPQTQGKLERFHRSLKAEV----LQGKWFADSGELQRAFDHWRTVYN 282 I + PQT G ERF+R+++ E + K + +LQ D W YN Sbjct: 252 AVEDIDHSKIKAKSPQTNGICERFNRTVQNEFYAIAFRKKIYTSIEQLQTDLDAWMNSYN 311 Query: 283 LERPHEALDMAVPGSRYQPSARQYSGNTTPPEYDEGVMV-RKVDISGKLSVKGVSLSA 339 +R H + G T + EG+ V RK + + ++K ++ Sbjct: 312 TQRTHSG--------------KYCFGKTPMQTFIEGIAVARKYKLQNQETIKPNGVNI 355 >UniRef50_A5D1X6 Transposase and inactivated derivatives n=2 Tax=Pelotomaculum thermopropionicum SI RepID=A5D1X6_PELTS Length = 308 Score = 160 bits (405), Expect = 7e-38, Method: Composition-based stats. Identities = 65/293 (22%), Positives = 113/293 (38%), Gaps = 36/293 (12%) Query: 40 SPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMAHDRHERWGARKIKRW 99 S T YK L R+ + G AGL D+ R+P PN++ D+ + H +G ++I Sbjct: 2 SHTTFYKLLDRFKEHGEAGLYDKERVPGIKPNQTPTDVEGAILAFVLDHPTYGPKRISAE 61 Query: 100 LEDQGHTMPAFSTVHNLMARHGLLPGASPGIPATGRFEHDAPNRLWQMD----------- 148 L+ + + + V+ ++ R+ L + + W++D Sbjct: 62 LKKRCIRV-GETAVYGVLKRN-SLNTRRDRLKWVDSLQPPQEKTAWELDKEASQHRHVHA 119 Query: 149 -----FKGH------FPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSV--- 194 G G+ + +D S + TD+ ++ L+ V Sbjct: 120 PQPGYLMGQDGKLVGRLANIGKVYVQVGVDCASSYGWA-KLYTDKTADSAADFLIHVHSD 178 Query: 195 FERYGLP-DRMTMDNGSPWGDTTGTW-TALELWLMRLGIRVGHSRPYHPQTQGKLERFHR 252 + G+ R+ DNG +G + T+ + LGI+ ++ HP T G ERF + Sbjct: 179 CQSKGVEVQRVLTDNGKEYGSSEPTYGHTYGAACLILGIKHKTTKVKHPWTNGYAERFVQ 238 Query: 253 SLKAEV----LQGKWFADSGELQRAFDHWRTVYNLERPHEAL--DMAVPGSRY 299 +L E L+ K + ELQ D + YN ERPH+ P + Sbjct: 239 TLYQEFFQVALRRKRYTSVEELQADLDRYLLYYNWERPHQGRRTRGRTPAQAF 291 >UniRef50_A1V109 A, transposase OrfB n=56 Tax=Proteobacteria RepID=A1V109_BURMS Length = 797 Score = 160 bits (405), Expect = 8e-38, Method: Composition-based stats. Identities = 55/264 (20%), Positives = 90/264 (34%), Gaps = 31/264 (11%) Query: 31 RSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMAHDRHER 90 R CR G+S + + P+ ++ + A L R Sbjct: 13 RRACRLVGLSRSVLHY--------------------DAKPDHENEVLAARLVKLAHERRR 52 Query: 91 WGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGI-----PATGRFEHDAPNRLW 145 +G R++ +E +G T ++ L GL APN +W Sbjct: 53 FGYRRLHALVEREG-THANHKRIYRLYREAGLAVRRRRKRHGVMIEREQLALPGAPNEVW 111 Query: 146 QMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMT 205 +DF G R LT++DD ++ ++ + V + L G P + Sbjct: 112 SIDFVMDALSNGRRVKCLTVVDDFTKEAVDIVVDHGISGLYVARALDRAARFRGYPKAVR 171 Query: 206 MDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFA 265 D G + T AL+ W G+ + + P +E F+ + E L WF Sbjct: 172 TDQGPEF-----TSRALDQWAYANGVTLKLIQAGKPTQNAYIESFNGKFRDECLNEHWFT 226 Query: 266 DSGELQRAFDHWRTVYNLERPHEA 289 + WR YN +RPH A Sbjct: 227 TLAHARAVIAAWRQGYNEQRPHHA 250 >UniRef50_A3PLB1 Integrase, catalytic region n=59 Tax=Proteobacteria RepID=A3PLB1_RHOS1 Length = 273 Score = 160 bits (404), Expect = 8e-38, Method: Composition-based stats. Identities = 61/283 (21%), Positives = 96/283 (33%), Gaps = 32/283 (11%) Query: 28 ANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMAHDR 87 + LC FG+ T Y P + + ++ ++ Sbjct: 16 VPLTKLCAWFGVPRRTVYY------------------KPTKAAPKVDARFADPIKAMIEK 57 Query: 88 HERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPATGRFEHDAPNRLWQM 147 +G R + L +T+ + + R + G P I A APN W Sbjct: 58 EPSFGYRTVAWLLGFNKNTVQRIFQIKSWQVRKRQI-GMRPRIEAVPSV-AQAPNERWST 115 Query: 148 DFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSV-FERYGLPDRMT- 205 D + G ++D H+R L + T L R+G R+T Sbjct: 116 DLCRVWAGRDGWATLALVIDCHTRELLGWHLSRSGKASTAASALEHALINRFGTLGRVTK 175 Query: 206 -----MDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQ 260 DNG + T + G++ P+ PQ G +ER R+LK + + Sbjct: 176 EFLLRSDNGLVFTSRKYT-----ALVRSYGLKQEFITPHCPQQNGMVERVIRTLKEQCVH 230 Query: 261 GKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSA 303 + F RA W YN RPH+ALDM P + +A Sbjct: 231 RRRFDSLQHAARAIGDWIAFYNHRRPHQALDMKTPAEAFALAA 273 >UniRef50_Q12FI2 Integrase, catalytic region n=28 Tax=Proteobacteria RepID=Q12FI2_POLSJ Length = 315 Score = 160 bits (404), Expect = 9e-38, Method: Composition-based stats. Identities = 85/312 (27%), Positives = 110/312 (35%), Gaps = 34/312 (10%) Query: 8 DARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPH 67 +A T R V ++ G + G+SP T KW QR+AQEG AGL DR P Sbjct: 6 NASMTPKGRAHLVQEIARIGLKPAAAAA--GLSPRTARKWQQRYAQEGRAGLLDRSSRPL 63 Query: 68 HSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGAS 127 P RS R +R +I + + A A Sbjct: 64 VCPQRSCASKIERAVRLR-RTQRLTYERIAERVGLSRSAIARACK-----AAGAAKLPAF 117 Query: 128 PGIPATGRFEHDAPNRLWQMDFK--GHFPFGGGRCH--------------PLTLLDDHSR 171 P R+E +P L +D K F G R +DDHSR Sbjct: 118 QNAPPVVRYERASPGELLHLDTKKLHRFDKPGHRVTGDRTQNTPRAGSQALHVAIDDHSR 177 Query: 172 FSLCLAHCTDERRETVQQQLVSVFERYGLP----DRMTMDNGSPWGDTTGTWTALELWLM 227 L DE L++ Y ++ DNGS + L Sbjct: 178 VGFSL-LLPDETARCACAHLLAALRYYKALGVRVAQVMTDNGSAYKSKR-----FAKLLR 231 Query: 228 RLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPH 287 RLGIR +RPY P+T GK ERF ++L E + DS + W YN RPH Sbjct: 232 RLGIRHIRTRPYTPRTNGKAERFIQTLLREWAYAFIYPDSDARAHELEPWMHHYNFRRPH 291 Query: 288 EALDMAVPGSRY 299 A P SR Sbjct: 292 SATSHRPPASRL 303 >UniRef50_A6Q4E4 Transposase n=2 Tax=Nitratiruptor sp. SB155-2 RepID=A6Q4E4_NITSB Length = 271 Score = 160 bits (404), Expect = 9e-38, Method: Composition-based stats. Identities = 52/280 (18%), Positives = 91/280 (32%), Gaps = 32/280 (11%) Query: 28 ANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMAHDR 87 +I LC F + + Y P + + ++ ++ Sbjct: 14 ISITKLCSLFDLPRRSFYY------------------KPIKRMQKLDEGRVKKVKEMIEK 55 Query: 88 HERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPATGRFEHDAPNRLWQM 147 +G R++ L + + + R G P H PN+ W + Sbjct: 56 FPTYGYRRLALLLGMNKKAVQRILQLKSWQVRKRS-KGHRPRAKMMPSRSH-YPNQRWAI 113 Query: 148 DFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQL-VSVFERYGL------ 200 D + G G ++D ++R + + T + L + R+G Sbjct: 114 DMTRVYSSGDGWSTLACVIDTYTREIVGWRLSKSGKATTAEAVLQEGLIYRFGKLKRLQE 173 Query: 201 PDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQ 260 P + DNG + + T TA PY P+ G +ERF R++K E + Sbjct: 174 PIILRSDNGLVFSSKSFTKTA-----QDYNFTQEFITPYTPEQNGMIERFFRTIKEECIW 228 Query: 261 GKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQ 300 F E + W YN +R H AL P ++ Sbjct: 229 HYNFKSLKEANKIIGEWINFYNQKRKHSALQYKTPAEVFR 268 >UniRef50_D2MKS7 Transposase (Fragment) n=3 Tax=Candidatus Poribacteria sp. WGA-A3 RepID=D2MKS7_9BACT Length = 327 Score = 160 bits (404), Expect = 1e-37, Method: Composition-based stats. Identities = 71/271 (26%), Positives = 111/271 (40%), Gaps = 19/271 (7%) Query: 41 PATGYKWLQRWAQEGAA-GLQDRPRIPHHSPNRSSDDITALLRMAHDRHERWGARK--IK 97 T +K+ R+ + GA L + R P R I + +H + G + I Sbjct: 50 RQTFFKYYHRFRESGADHALVPQKRGPKWKQRRRYGYIEQQV----LQHRQQGVNRYEIC 105 Query: 98 RWLE-DQGHTMPAFSTVHNLMARHGLLPGASPGIPATGRFEHDAPNRLWQMDFKGH---- 152 + L P+ STV+ + ++GL P R L +D Sbjct: 106 QLLAPKLKRLTPSPSTVYRITHQYGLNRLTPPLQQEKRRIVKQKAGELGHLDCHHLSKDL 165 Query: 153 FPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVF----ERYGLP-DRMTMD 207 R + + ++D +R + TD + TV + F +RY L + D Sbjct: 166 MATDPTRYYLVCVIDACTRLAWA-EVVTDLKSLTVMFSALKSFNLLHQRYQLQFAEVLTD 224 Query: 208 NGSPWGDTTGTWT-ALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFAD 266 NGS + T T E L+ LGI+ ++RPY PQT GK+ERF R+L +++ G F Sbjct: 225 NGSEFAARTPPATHPFERMLLELGIKHRYTRPYRPQTNGKVERFWRTLNDDLIAGTTFGS 284 Query: 267 SGELQRAFDHWRTVYNLERPHEALDMAVPGS 297 E + + + YN RPH+ALD P Sbjct: 285 LEEFRDDLEQYLLYYNEGRPHQALDGKTPKQ 315 >UniRef50_B5YKC0 Putative transposase n=3 Tax=Thermodesulfovibrio yellowstonii DSM 11347 RepID=B5YKC0_THEYD Length = 286 Score = 160 bits (404), Expect = 1e-37, Method: Composition-based stats. Identities = 64/305 (20%), Positives = 111/305 (36%), Gaps = 35/305 (11%) Query: 19 FVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSD-DI 77 V ++G ++ C+ G+S + Y ++ E P + ++ +I Sbjct: 1 MVKQLLKEGYTVKESCKASGLSRSRYYSFINLREIE--------------KPKKINEIEI 46 Query: 78 TALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGA---SPGIPATG 134 ++ H WG R++ WL + + TV +M H LL A G Sbjct: 47 LEKIKAIKSEHPFWGYRRVTAWLRHREGVLINHKTVSKIMKEHSLLASQTVHKAKRKAEG 106 Query: 135 RFEHDA-PNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAH----CTDERRETVQQ 189 R P +W +D G + + +LD +++ + T E + + + Sbjct: 107 RKPRTQRPKEIWGIDMTKFMIPCIGWAYLVVVLDWYTKKIVGWEISLRGRTAEWKSALDK 166 Query: 190 QLVSVFER--YGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKL 247 LVS F+ G ++ DNGS T A + LGI + +P+ Sbjct: 167 GLVSEFKEGVRGRGLKLVSDNGSQ-----PTSRAFMKEMAVLGIEQIFTSYDNPKGNADT 221 Query: 248 ERFHRSLKAEVLQGKWFADSGELQRAFDHWRTV-YNLERPHEALDMAVPGS----RYQPS 302 ER R++K E++ F E + + W T YN H AL P Y+ Sbjct: 222 ERVIRTIKEELIWLNEFRSLDEARERIEDWITNCYNKLYVHSALGYLSPEEYELKYYREQ 281 Query: 303 ARQYS 307 R + Sbjct: 282 QRNVA 286 >UniRef50_C3LLF8 IS1627, transposase n=27 Tax=Bacillaceae RepID=C3LLF8_BACAC Length = 274 Score = 159 bits (402), Expect = 1e-37, Method: Composition-based stats. Identities = 58/292 (19%), Positives = 103/292 (35%), Gaps = 40/292 (13%) Query: 24 SQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRM 83 +D +I+ +C GI +T Y+W + A L+ A+L + Sbjct: 1 MKDEYSIKEICILIGIPRSTYYRWKNKEKDVKEAKLE-----------------QAILTI 43 Query: 84 AHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIP------------ 131 H R+G RK+ L+ + + P TV +M + L Sbjct: 44 CMTNHFRYGHRKVTALLKRKYNYHPNRKTVQKIMQKKNLQCRVKRKRRTWINGESRIVVE 103 Query: 132 --ATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQ 189 F+ + PN W D + PFG + L+++D ++ + + V + Sbjct: 104 NLLNRNFQANKPNEKWVTDIT-YLPFGTEMLYLLSIMDLYNNEIIAYEISNRQDVTLVLR 162 Query: 190 QLVSVFERYGLPDRM-TMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLE 248 + + + D G+ + T A + + GI SR + +E Sbjct: 163 TVEKAIKLQQKTQIILHSDQGAVY-----TSYAFQTLSKKNGITTSMSRKGNCHDNAVIE 217 Query: 249 RFHRSLKAEVLQG--KWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSR 298 FH SLK+E+ K + L++ + YN ER E L+ P Sbjct: 218 SFHSSLKSELFYSQEKQIHSTSTLKQLIHDYIEYYNTERIQEKLNYLSPIEY 269 >UniRef50_B7AC35 Integrase catalytic region n=11 Tax=Bacteria RepID=B7AC35_THEAQ Length = 333 Score = 159 bits (401), Expect = 2e-37, Method: Composition-based stats. Identities = 75/307 (24%), Positives = 108/307 (35%), Gaps = 32/307 (10%) Query: 15 LRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSS 74 + + V + G + + GIS AT Y+W +R +EG AGL+ R R P R Sbjct: 14 RKLKQVEAFRKHGVSWPEIQELLGISRATYYRWRKRLKEEGLAGLKPRSRRPQRLRRRIY 73 Query: 75 D--DITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMAR------------- 119 D+ + + WG I L +G + + TV ++A Sbjct: 74 WSSDLLIRVEALRKENPTWGRWPIWLTLRKEGFAV-SERTVGRILAHLEARGRVERVAAF 132 Query: 120 -------HGLLPGASPGIPATGR-FEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSR 171 G P R +E AP L Q+D G + LD +R Sbjct: 133 LARARRGKGRPRPQRPYAQRKPRGYEARAPGDLVQLDTLTVTLGPGEVVKHFSALDLVTR 192 Query: 172 FSLCLAHCTDERRETVQQQLVSVFERYGLPDR-MTMDNGSPWGDTTGTWTALELWLMRLG 230 FSL H T L ++ + P R + +D GS + E LG Sbjct: 193 FSLAQVH-TRATANLAAGFLSALVTKAPFPIRAVQVDGGSEFM------AEFEEACRSLG 245 Query: 231 IRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEAL 290 IR+ P P+ G +ER R+ + E LQ D + YN RPH AL Sbjct: 246 IRLFVLPPRSPKLNGHVERMQRTFRDEFYTLPLPRGLVRLQAELDAYLAYYNHRRPHMAL 305 Query: 291 DMAVPGS 297 P Sbjct: 306 GGLAPLE 312 >UniRef50_A9LH60 Integrase n=1 Tax=uncultured planctomycete 13FN RepID=A9LH60_9BACT Length = 209 Score = 159 bits (401), Expect = 2e-37, Method: Composition-based stats. Identities = 45/200 (22%), Positives = 72/200 (36%), Gaps = 14/200 (7%) Query: 126 ASPGIPATG------RFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHC 179 +P G R + +W DF G + L ++D+ +R L + Sbjct: 9 RRKRLPRRGSENSCIRRCAQYKDHVWSYDFVADRLEDGRKIRLLVIIDEFTRECLAIEVA 68 Query: 180 TDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPY 239 V L +F G P + DNG + L WL + + P Sbjct: 69 RSFTAMQVIDVLQYLFAVRGSPKHIRSDNGPEFVARK-----LTKWLKQAAVETLFIAPG 123 Query: 240 HPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRY 299 P G +E F+ L+ E+L G+ F GE + D W+ YN R H ++D P + Sbjct: 124 SPWENGYVESFNGKLRDELLNGELFLSLGEARWIIDRWQLDYNHHRLHSSIDYQTPAA-- 181 Query: 300 QPSARQYSGNTTPPEYDEGV 319 +AR S + + Sbjct: 182 -FAARCSSSDRPTASLQKNT 200 >UniRef50_A5BPP5 Putative uncharacterized protein n=1 Tax=Vitis vinifera RepID=A5BPP5_VITVI Length = 1583 Score = 159 bits (401), Expect = 2e-37, Method: Composition-based stats. Identities = 60/307 (19%), Positives = 113/307 (36%), Gaps = 31/307 (10%) Query: 75 DDITALLRMAHDRH--ERWGARKIKRWLEDQGHTMPA-FSTVHNLMARHGLLPGASPGIP 131 ++ +L H+ + ++K + G T P+ F H + Sbjct: 1174 EEQQGILNXCHENACGGHFASQKXAMKVXXSGFTWPSLFKDAHIICRSCDRCQRLGKLTK 1233 Query: 132 ATGRFEHD----APNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETV 187 + +W + F G FP G + L +D S++ + ++ R + Sbjct: 1234 RNQMPMNPILIVELFDVWGIXFMGPFPMSFGNSYILVGVDYVSKWVEAIPCKQNDHRVVL 1293 Query: 188 QQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKL 247 ++F R+G+P + D G+ + + E L + G++ + PYHPQT G++ Sbjct: 1294 XFLKENIFSRFGVPKAIISDGGTHFCN-----KPFETLLAKYGVKHKVATPYHPQTSGQV 1348 Query: 248 ERFHRSLKAEVL---QGKWFADSGEL---QRAFDHWRTVY----NLERPHEALDMAVPGS 297 E +R +K ++ K F WR + + L+ V G Sbjct: 1349 ELANREIKNILMESGNLKCFNSPEPTLGKASDLRPWRFTSLSLASFWEVKDHLEWQVLGE 1408 Query: 298 RYQPSARQYSGNT---TPPEYDE------GVMVRKVDISGKLSVKGVSLSAGKAFRGERV 348 RY+P TP Y+E + ++ KL VK +++ +A + Sbjct: 1409 RYEPLQGASEKKQVTGTPFLYEEYEPSDLKLQETFFFLNTKLGVKKLNMDLIRAGAKRCL 1468 Query: 349 GLKEMQE 355 L EM+E Sbjct: 1469 DLNEMEE 1475 >UniRef50_A8F1E6 Transposase and inactivated derivative n=159 Tax=Bacteria RepID=A8F1E6_RICM5 Length = 373 Score = 157 bits (398), Expect = 4e-37, Method: Composition-based stats. Identities = 71/356 (19%), Positives = 127/356 (35%), Gaps = 49/356 (13%) Query: 7 WDARDTMSLRTEFV------LFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQ 60 W TM+L + + L ++ +I S C+ G S + Y++ + + G L Sbjct: 6 WKV--TMNLNQKIIKPKLGLLELAKSLGSISSACKAMGYSRDSYYRFKELYETGGEEALY 63 Query: 61 DRPRIPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMA-- 118 + R NR I + + +G ++ L+ +G + + Sbjct: 64 EISRRKPIIANRVDPAIEKAVMDMAIEYPAYGQLRVSNELKKRGILVSPGGVRSIWLRND 123 Query: 119 -------------------------RHGLLPGASPGIPATGRFEHDAPNRLWQMD--FKG 151 + +L A G E P L D + G Sbjct: 124 LNNISKRLKALEAKMAQDGIVLTEAQLQVLEKRRNEKEADGEIETQHPGYLGCQDTYYVG 183 Query: 152 HFPFGGGRCHPLTLLDDHSRFSLCLAHCTD---ERRETVQQQLVSVFERYGLPD-RMTMD 207 +F G+ + +D +SR + + + + +++ +E G+P R+ D Sbjct: 184 NFKGI-GKVYSQVFIDSYSRVADAKLYTDKTALTAADMLNDRVLPWYETQGIPILRILTD 242 Query: 208 NGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEV----LQGKW 263 GS + A EL+L GI ++ Y PQT G ERF++++K E ++ K Sbjct: 243 RGSEYKGNIE-HHAFELFLSIEGIEHTTTKAYSPQTNGMCERFNKTMKQEFFDTAMRKKI 301 Query: 264 FADSGELQRAFDHWRTVYNLERPHEA--LDMAVPGSRYQPSARQYSGNTTPPEYDE 317 + D +LQ D W +N ERPH P ++ S + Y E Sbjct: 302 YTDLDDLQLDLDIWLEYFNNERPHSGKYCYGKTPMQTFKDSKKLAVEKNNEILYLE 357 >UniRef50_B9NFA6 Predicted protein n=5 Tax=cellular organisms RepID=B9NFA6_POPTR Length = 736 Score = 157 bits (398), Expect = 4e-37, Method: Composition-based stats. Identities = 88/385 (22%), Positives = 135/385 (35%), Gaps = 46/385 (11%) Query: 29 NIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMAHDRH 88 I +FG S + + R Q+ L++R R S I L + +H Sbjct: 47 PITGADVQFGASSIERWYYRARHVQDPVDQLKNRLRDDCGHFVSLSPAIIEALVEQYRQH 106 Query: 89 ERWG----ARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPATG---------- 134 W ++ +D T+P+++TV + GL+ +P G Sbjct: 107 PGWTMQLHYDNLRVATKDGPDTLPSYTTVCRYLKAQGLVRKPTPRSSTDGAIAAAARREA 166 Query: 135 ----RFEHDAPNRLWQMDFKG---HFPFGGGRCH---PLTLLDDHSRFSLCLAHCTDERR 184 +E D LW +DF G+ H L ++DDHSR L DE Sbjct: 167 REVRSYEVDHVAALWHLDFHHGSRKVLTPDGQWHKPLLLCIMDDHSRLVCHLQWFLDETT 226 Query: 185 ETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQ 244 ++ + + GLP + DNG+ L LGI + PY P Sbjct: 227 ASLVHGVSQAIMKRGLPRAIMTDNGAAMMADE-----FVEGLASLGILHQTTLPYSPYQN 281 Query: 245 GKLERFHRSLKAE---VLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQP 301 K ERF L++ +L+G+ +L A W + H L+ A P RY Sbjct: 282 AKQERFWGQLESRLMAMLEGESHLTLEQLNLATQAWVEQEYHHKEHAELE-ATPLQRYLS 340 Query: 302 SA---RQYSGNT-TPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRG-ERVGLKEMQED 356 A R + R+ G ++ GV A+R E+V L Sbjct: 341 CADVSRPSPEALALRRAFRIRQHRRQRRTDGTFTLDGVRFEIPGAYRHLEQVCL------ 394 Query: 357 GSYEVWWYSTKVGVIDLKKKSITMG 381 SY W S V +ID + +I Sbjct: 395 -SYARWDLSQ-VDLIDARIGAILAA 417 >UniRef50_A5ASD2 Putative uncharacterized protein n=7 Tax=Vitis vinifera RepID=A5ASD2_VITVI Length = 1801 Score = 157 bits (398), Expect = 4e-37, Method: Composition-based stats. Identities = 55/293 (18%), Positives = 111/293 (37%), Gaps = 39/293 (13%) Query: 75 DDITALLRMAHDRH---ERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIP 131 ++ +L H+ + I + L+ G T P+ ++M R + Sbjct: 1428 EEQQGILSHCHENACGGHFASQKTIMKVLQS-GFTWPSLFKDSHIMCRSYDRCQRLGKLT 1486 Query: 132 ATGRFEHDAP-----NRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRET 186 + + +W +DF G FP G + L +D S++ + ++ R Sbjct: 1487 RRNQMPMNPILIVDLFDVWGIDFMGPFPMSFGNSYILVGVDYVSKWVEAIPCKHNDHRVV 1546 Query: 187 VQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGK 246 ++ ++F R+G+P + D G+ + + E L + G++ + PYHPQT G+ Sbjct: 1547 LKFLKKNIFSRFGVPKAIISDRGTHFCN-----KPFETLLAKYGVKHKVATPYHPQTSGQ 1601 Query: 247 LERFHRSLKAEVLQGKWFADSGE----LQRAFDHWRTVYNLERPHEALDMAVPGSRYQPS 302 +E +R +K +++ + L + +RT Y L M S Y+ Sbjct: 1602 VELANREIKNILMKV-VITTRRDWSIKLHDSLWAYRTTYKTI-----LGM----SSYRLV 1651 Query: 303 ARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 + EY ++K++ + +A + L EM+E Sbjct: 1652 YGKACHLPVEVEYKAWWAIKKLN-----------MDLIRAGAKRCLDLNEMEE 1693 >UniRef50_C7MBS5 Transposase n=2 Tax=Micrococcineae RepID=C7MBS5_BRAFD Length = 365 Score = 157 bits (398), Expect = 4e-37, Method: Composition-based stats. Identities = 69/314 (21%), Positives = 106/314 (33%), Gaps = 40/314 (12%) Query: 4 LMPWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRP 63 + +A T R V G I + F I+ AT KW+ R+ GAAGL++ Sbjct: 1 MTHRNAPLTAVGRRRAVDQVLARGRPIAHVAAEFHIARATLSKWVGRYRAAGAAGLEEHS 60 Query: 64 RIPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLL 123 P H P+R + L+ + ++W AR+I R L D TV + R GL Sbjct: 61 SAPAHRPSRLEGWVVELIEHWRRK-QKWSARRIARELADGHGVHCCVRTVTRWLDRLGLN 119 Query: 124 PGAS-----PGIPATGRFEHDAPNRLWQMDFKGHFPFGG--------------------- 157 + G P + +D K Sbjct: 120 RIRDITPDGGNLRQPGTITARYPGHMIHVDVKKVGKIPDGGGWKVHGRDSALGRASKRGK 179 Query: 158 ----GRCHPLTLLDDHSRFSLCLAHCTDERRETV--QQQLVSVFERYGLPD--RMTMDNG 209 G + + +D SR + + T+ + + F +G+ R+ DNG Sbjct: 180 GRRVGYTYLHSAIDGFSRLAYTEPLEDETAATTIGFLHRAFAFFAAHGITRITRLISDNG 239 Query: 210 SPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGE 269 + A + R +RPY P+ GK+ERF R E L + F E Sbjct: 240 PNYRS-----NAFARSIRGKVSRHQRTRPYTPRHNGKVERFQRITVDEFLYAEVFESEQE 294 Query: 270 LQRAFDHWRTVYNL 283 + W YN Sbjct: 295 RRNRHGVWLHHYNY 308 >UniRef50_A5C4R5 Putative uncharacterized protein n=1 Tax=Vitis vinifera RepID=A5C4R5_VITVI Length = 1398 Score = 157 bits (396), Expect = 8e-37, Method: Composition-based stats. Identities = 54/294 (18%), Positives = 111/294 (37%), Gaps = 41/294 (13%) Query: 75 DDITALLRMAHDRH--ERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPA 132 ++ +L H+ + ++K + G T P+ ++M R Sbjct: 1025 EEQQGILNHCHENACGGHFASQKTAMKVLQSGFTWPSLFKDSHIMCR--SCDRCQRLGKL 1082 Query: 133 TGRFEHDA-------PNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRE 185 T R + +W +DF G FP G + L +D S++ + + R Sbjct: 1083 TKRNQMPMNPILIVDLFDVWGIDFMGPFPMSFGNSYILVGVDYVSKWVEAIPCKHNVHRV 1142 Query: 186 TVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQG 245 ++ ++F R+G+P + D G+ + + E L + G++ + PYHPQT G Sbjct: 1143 VLKFLKENIFSRFGVPKAIISDGGTHFCN-----KPFEALLAKYGVKHKLATPYHPQTSG 1197 Query: 246 KLERFHRSLKAEVLQGKWFADSGE----LQRAFDHWRTVYNLERPHEALDMAVPGSRYQP 301 ++E +R +K +++ + L + +RT Y L M+ Y+ Sbjct: 1198 QVELANREIKNILMKV-VITSRKDWSIKLHDSLWAYRTAYKTI-----LGMSP----YRL 1247 Query: 302 SARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 + EY ++++++ + +A + L EM+E Sbjct: 1248 VYGKACHLPMEVEYKAWWVIKRLN-----------MDLIRAGAKRCLDLNEMEE 1290 >UniRef50_A4TG51 Integrase, catalytic region n=13 Tax=Actinomycetales RepID=A4TG51_MYCGI Length = 278 Score = 157 bits (396), Expect = 8e-37, Method: Composition-based stats. Identities = 59/277 (21%), Positives = 97/277 (35%), Gaps = 31/277 (11%) Query: 29 NIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIP-HHSPNRSSDDITALLRMAHDR 87 + R CR G+ +T R P +P D+ A LR+ Sbjct: 9 SKRLACRAVGLPRSTY------------------ARTPVAQTPADPDADLRATLRIYARE 50 Query: 88 HERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLL-----PGASPGIPATGRFEHDAPN 142 H G R+ L+ + VH L GL P G+ + E DAP Sbjct: 51 HPLHGFRRAWAHLKHDQGVLVNKKKVHRLWKEEGLQVRIYHPRKRAGVSTMPQIEADAPK 110 Query: 143 RLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYG-LP 201 +W +DF+ +++D+H+R SL E + +L F +G P Sbjct: 111 VVWAIDFQFDSTVDDKAIKICSMIDEHTRLSLLNIVERSITAERLTVELDKAFALWGGPP 170 Query: 202 DRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQG 261 + MDNG + L+ + I + + P P G +E F+ L+ E L Sbjct: 171 LVLRMDNGPEFIS-----HVLQQFCGD-RIGISYIPPGTPWNNGHIESFNNRLRKECLNR 224 Query: 262 KWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSR 298 + E + + ++ +N H +L P Sbjct: 225 NHWTSLLEARVVIEDFKDDHNNRHRHSSLGYLTPAEY 261 >UniRef50_A9B8L4 Integrase catalytic region n=5 Tax=Herpetosiphon aurantiacus ATCC 23779 RepID=A9B8L4_HERA2 Length = 435 Score = 157 bits (396), Expect = 8e-37, Method: Composition-based stats. Identities = 77/369 (20%), Positives = 130/369 (35%), Gaps = 34/369 (9%) Query: 15 LRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQ----EGAAGLQDRPRIPHHSP 70 RT + L + + R +SP W QR + Q + R P Sbjct: 10 RRTLYELLHTNPDWSNRQFATALNVSPDWVRLWKQRIGSPPHPDPDVVCQSQSRARKTPP 69 Query: 71 NRSSDDITALLRMAHDR-----HERWGARKIKRWLEDQGH-----TMPAFSTVHNLMARH 120 SD + + H GA+ I +L+ + +TV+ ++ H Sbjct: 70 PAWSDRVIHRILTLRQELAAQFHRTVGAKTILAYLQRDPDLADDRIPRSPTTVNRILRDH 129 Query: 121 GLLPGASPGIPATGRFEHDAPNRLWQMDFKGHFPFG----GGRCHP---LTLLDDHSRFS 173 LL + ++DF G R H +D + Sbjct: 130 QLLVDPPTHQRQPRTPCPP--MQEIEIDFTDVTTIPTNPDGKRQHAAEAFMWVDAGTSIR 187 Query: 174 LCLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTW---TALELWLMRLG 230 + TD +V + S+ ++ GLP R+ MD + + L+ LG Sbjct: 188 VAARISTDFHMASVIRTTASILQQIGLPARIRMDCDVRLVSNKRVADFPSPFQRLLLNLG 247 Query: 231 IRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHE-- 288 I+V P+ P + +ERFH++ K E + W E Q D + Y ERPH+ Sbjct: 248 IQVDVCPPHRPDLKPFVERFHKNYKGESVYPNWPTTEAEAQVQVDAYCDWYRTERPHQGR 307 Query: 289 ALDMAVPGSRYQPSAR------QYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKA 342 A P + Q + + D VR+V+ GKL + G + +AG A Sbjct: 308 ACGNRPPAEAFPELPVLPPVPAQVDADGWLKQIDGWTFVRRVNAQGKLMLDGATYTAGIA 367 Query: 343 FRGERVGLK 351 + G+ + ++ Sbjct: 368 YAGQELAVQ 376 >UniRef50_A1VJC3 Integrase, catalytic region n=25 Tax=Bacteria RepID=A1VJC3_POLNA Length = 325 Score = 156 bits (394), Expect = 1e-36, Method: Composition-based stats. Identities = 68/288 (23%), Positives = 99/288 (34%), Gaps = 42/288 (14%) Query: 30 IRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMA---HD 86 + C GIS A Y P R D LLR+ + Sbjct: 1 MSRQCVLAGISRAALYA--------------------RRKPKRIVQDDELLLRLIDEEYT 40 Query: 87 RHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPATGRFEHD------- 139 RH +G+RK+ L GH++ LM GL A + +H Sbjct: 41 RHPFYGSRKMVVHLGRCGHSV-NRKWAQRLMRSLGLAGMAPGPNTSRAHPQHKVYPYLLR 99 Query: 140 -----APNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSV 194 PN++W D + G + + ++D +SR L L Sbjct: 100 GVAISRPNQVWSTDIT-YIRLARGFAYLVAVIDWYSRRVLSWRISNSMETVFCVDCLEEA 158 Query: 195 FERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSL 254 +G P+ D GS + T A L R G+ + +ER RS+ Sbjct: 159 LRIHGKPEVFNTDQGSQF-----TSEAFTSVLKREGVIISMDGRGRALDNIFVERLWRSV 213 Query: 255 KAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPS 302 K E + K ++ GEL + T YN ERPH+AL P YQ + Sbjct: 214 KHEDVYLKGYSAMGELLIGLTQYFTFYNGERPHQALKNLTPDVVYQRA 261 >UniRef50_C4FKG6 Integrase, catalytic region n=1 Tax=Sulfurihydrogenibium yellowstonense SS-5 RepID=C4FKG6_9AQUI Length = 305 Score = 156 bits (393), Expect = 2e-36, Method: Composition-based stats. Identities = 53/247 (21%), Positives = 90/247 (36%), Gaps = 20/247 (8%) Query: 67 HHSPNRSSDD--ITALLRMAHDRHERWGARKIKRWLEDQGHTM--PAFSTVHNLMARHGL 122 P S +D + + + H +GAR++++ LE G + S + M L Sbjct: 46 KKEPFSSEEDKILLDAIDKIYTEHPYYGARRMQKALESIGIKVGKRKLSRTYKFMGIRAL 105 Query: 123 LPGASPGIPATGRFEHD----------APNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRF 172 P I ++ PN++W D + G + T++D HS+ Sbjct: 106 YPPPKTTILNKENKKYPYLLEQITTTQRPNQIWSGDIT-YIKLEKGYAYLATIIDWHSKK 164 Query: 173 SLCLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIR 232 L L ERYG P+ D GS + T L + GI+ Sbjct: 165 VLSWKLGNTMDSYLTTSILEEAIERYGKPEIFNSDQGSQY-----TSKEHIEILEKNGIK 219 Query: 233 VGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDM 292 + + +ERF R+LK E + K + E + + + +YN +R H ++ Sbjct: 220 ISMNANGRSIDNTVIERFWRALKYENVYPKGYNTIKEAREGINQYIEIYNSQRIHSSIGY 279 Query: 293 AVPGSRY 299 P Y Sbjct: 280 KTPDMVY 286 >UniRef50_C5PML6 Transposase OrfB n=1 Tax=Sphingobacterium spiritivorum ATCC 33861 RepID=C5PML6_9SPHI Length = 267 Score = 156 bits (393), Expect = 2e-36, Method: Composition-based stats. Identities = 51/287 (17%), Positives = 90/287 (31%), Gaps = 34/287 (11%) Query: 15 LRTEFVL-FASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRS 73 R E V + G +I C+ +S + Y ++ Sbjct: 9 ERKELVDGEMKEQGISIHRACKIVCMSRSMYYY----------------------VHKKN 46 Query: 74 SDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPAT 133 + + L R+ G + + +G V + L Sbjct: 47 DQTVISKLMDLAARYPSRGFQTYYGKIRLEGLLW-NRKRVLRVYRSINLKLRIKRKRCIP 105 Query: 134 GRFE-----HDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQ 188 R + + N W +DF G R L ++DD++R SL + Sbjct: 106 PRIKEKLLVPGSVNETWSIDFMSDSLANGRRFRVLNVIDDYNRESLINEAFYSIPGGRLV 165 Query: 189 QQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLE 248 Q++ + P R+ DNG + + GI + + +P P +E Sbjct: 166 QKIKELIIDRSTPKRIRTDNGPEFLS-----KVFTDFCTENGIELQYIQPGKPAQNAYIE 220 Query: 249 RFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVP 295 R +R+ + +VL F E+ W+ YN P AL+ P Sbjct: 221 RLNRTFREDVLDAYLFDSLTEVNAIAYEWQIDYNENHPDTALNGLSP 267 >UniRef50_A5CBG5 Putative uncharacterized protein n=4 Tax=Vitis vinifera RepID=A5CBG5_VITVI Length = 2329 Score = 155 bits (391), Expect = 3e-36, Method: Composition-based stats. Identities = 56/293 (19%), Positives = 111/293 (37%), Gaps = 39/293 (13%) Query: 75 DDITALLRMAHDRH--ERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPA 132 D+ +L H+ + ++K + G T P+ ++M R Sbjct: 2026 DEQQGILNHCHENACGGHFASQKTAMKVLQSGFTWPSXFKDAHIMCR--SCDRCQRLGKL 2083 Query: 133 TGRFEHDA-------PNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRE 185 T R + +W +DF G FP G + L +D S++ + ++ R Sbjct: 2084 TKRNQMPMNPILIVELFDVWGIDFMGPFPMSFGNSYILVGVDYVSKWVEAIPCRKNDHRV 2143 Query: 186 TVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQG 245 ++ ++F R+G+P + D G+ + + E L + G++ + PYHPQT G Sbjct: 2144 VLKFLKENIFSRFGVPKAIISDGGAHFCN-----KPFEALLSKYGVKHKVATPYHPQTSG 2198 Query: 246 KLERFHRSLKAEVLQGKWFADSG---ELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPS 302 ++E +R +K +++ + L + +RT Y L M+ Y+ Sbjct: 2199 QVELANREIKNILMKVVNASRKDWSIRLHDSLWAYRTXYKTI-----LGMSP----YRLV 2249 Query: 303 ARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 + EY ++K++ + +A + L EM+E Sbjct: 2250 YGKACHLPVEVEYKAWXAIKKLN-----------MDLIRAGAKRCLDLNEMEE 2291 >UniRef50_B8KLM8 Integrase, catalytic region n=2 Tax=gamma proteobacterium NOR5-3 RepID=B8KLM8_9GAMM Length = 272 Score = 154 bits (390), Expect = 3e-36, Method: Composition-based stats. Identities = 40/166 (24%), Positives = 66/166 (39%), Gaps = 5/166 (3%) Query: 138 HDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFER 197 N+ W +DF G R L ++DD SR + V + L +FE Sbjct: 112 PPRVNQRWSIDFVSDQLSSGRRFRVLNVVDDFSREMVGQLVAVSITGSQVARFLSELFED 171 Query: 198 YGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAE 257 P ++ DNG+ T A+ W G+++G +P P +E + + E Sbjct: 172 REKPQKIICDNGTE-----CTSKAMFFWSQESGVKLGFIQPGKPTQNAFVESLNGKFRNE 226 Query: 258 VLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSA 303 L WF + + W+ YN RPH +L+ P + + +A Sbjct: 227 CLNRHWFRSLDDAKTEIMLWQNQYNNVRPHSSLNYLPPVAFARQAA 272 Score = 108 bits (270), Expect = 3e-22, Method: Composition-based stats. Identities = 30/117 (25%), Positives = 46/117 (39%), Gaps = 5/117 (4%) Query: 179 CTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRP 238 V + L + E G P+ + DNG + AL W + I + + +P Sbjct: 2 DFSLPAPRVIRALDQIIEWRGKPEALRCDNGPEYIS-----QALVAWANQQRITLMYIQP 56 Query: 239 YHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVP 295 P +ERF+R+++ E L F Q W YN ERP+ A+ P Sbjct: 57 GKPTQNAYIERFNRTVRHEWLDLHSFVSLDHAQNLATQWLWQYNNERPNTAIGGVPP 113 >UniRef50_B0UC72 Integrase catalytic region n=4 Tax=Alphaproteobacteria RepID=B0UC72_METS4 Length = 263 Score = 154 bits (390), Expect = 4e-36, Method: Composition-based stats. Identities = 53/211 (25%), Positives = 86/211 (40%), Gaps = 10/211 (4%) Query: 81 LRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPATGRFEHDA 140 L+ R+G R+++ L +G + T + P R+ Sbjct: 52 LKELAATRVRYGYRRLQILLRREGWAVNHKRTYRLYRDEGLSIRPKLPRQKRAWRYRQGR 111 Query: 141 P-----NRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVF 195 P N +W +DF F G LT++D H+R +L LA + R V + L ++ Sbjct: 112 PAIGGPNEVWAIDFMSDRLFDGRPFRILTVVDCHTREALSLAPRANFRAYQVVEALDALV 171 Query: 196 ERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLK 255 G P + +DNG + L+ W+ G+ + SRP P +E F+ L+ Sbjct: 172 RLRGRPKSLRVDNGPEFAGRM-----LDRWVYLNGVELYFSRPGKPTDNAYIENFNGRLR 226 Query: 256 AEVLQGKWFADSGELQRAFDHWRTVYNLERP 286 AE L WF + + + WR YN +RP Sbjct: 227 AECLNASWFLSLTDARERIEEWRPHYNKDRP 257 >UniRef50_A5C1P8 Putative uncharacterized protein n=4 Tax=Vitis vinifera RepID=A5C1P8_VITVI Length = 1601 Score = 154 bits (390), Expect = 4e-36, Method: Composition-based stats. Identities = 50/292 (17%), Positives = 106/292 (36%), Gaps = 37/292 (12%) Query: 75 DDITALLRMAHDRH--ERWGARKIKRWLEDQGHTMPA-FSTVHNLMARHGLLPGASPGIP 131 ++ +L H+R + ++K + G + P+ F H + Sbjct: 1231 EEQQGILSHCHERACGGHFTSQKTTMKVLQSGFSWPSLFKNAHTMCRSCDRYQRLRKLTR 1290 Query: 132 ATGRFEHDAP----NRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETV 187 + +W +DF G FP G + L +D S++ + ++ R + Sbjct: 1291 RNQMPMNPILIVDLFDVWGIDFMGPFPMSFGNSYILVGVDYVSKWVEAIPCKHNDHRVVL 1350 Query: 188 QQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKL 247 + ++F R+G+P + D G+ + + E L + G++ + PYHPQT G++ Sbjct: 1351 KFLKENIFSRFGVPKSIISDGGTHFCN-----KPFETLLAKYGVKHKVATPYHPQTSGQV 1405 Query: 248 ERFHRSLKAEVLQGKWFADSGE----LQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSA 303 E +R + +++ + L + ++T Y + + P Y Sbjct: 1406 ELANREIMNILMKVMS-TSRRDWSIKLHDSLWAYKTTY------KTIFGMSP---YHLVY 1455 Query: 304 RQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 + EY ++KV+ + + + L EM+E Sbjct: 1456 GKACHLPVEVEYKAWWAIKKVN-----------MDLIRVGAKRCLDLNEMEE 1496 >UniRef50_Q39TE2 Putative uncharacterized protein n=2 Tax=Geobacter metallireducens GS-15 RepID=Q39TE2_GEOMG Length = 389 Score = 154 bits (388), Expect = 6e-36, Method: Composition-based stats. Identities = 78/385 (20%), Positives = 142/385 (36%), Gaps = 24/385 (6%) Query: 14 SLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRS 73 LR V ++G S+C G S + YKW+ R + +R R P PNR+ Sbjct: 7 QLRVLAVQR-FRNGETPESICTSLGKSRSWLYKWVARQNGDDPVWSDERSRCPQSMPNRT 65 Query: 74 SDDITALLRMAH----DRHERWGARKIKRWLEDQGHT-MPAFSTVHNLMARHGLLPGASP 128 + +I +++M ++ GA+ I LED G +P+ T++ ++AR+ L + Sbjct: 66 TAEIEEIVKMVRLNLYNKGLFCGAQAILWELEDLGVKPLPSTRTINRILARNELTHRRTG 125 Query: 129 GIPATGR----FEHDAPNRLWQMDFKGH-FPFGGGRCHPLTLLDDHSRFSLCLAHCTDER 183 A G PN+ Q D G + G R + L ++D + L + Sbjct: 126 KYEAKGTLYPVLPSALPNQTHQADLVGPCYLTGPIRFYSLNVVDTAT-VRCGLHSSRSKA 184 Query: 184 RETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWT---ALELWLMRLGIRVGHSRPYH 240 + V L V++R G+P+R+ +DN + + L + + Sbjct: 185 GQMVIDGLWEVWKRLGIPERLQVDNAMSFFGSPTHPRGMGPLIRLCLHNDVEPWFIPMAE 244 Query: 241 PQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGS--- 297 P G +E+F+ + L EL+ + +N + + L + P Sbjct: 245 PWRNGMIEKFNDRYQQRFLGKVIMTSEEELKVGSLTFEQRHNSKYRYSKLKGSTPLKALA 304 Query: 298 ------RYQPSARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLK 351 R+ PE + VR + KL++ G S E V Sbjct: 305 TSHVQLRFPTEDEPPRHRLKKPEVGKYHAVRLIRSDLKLNIFGECFSVPPETALEYVVAT 364 Query: 352 EMQEDGSYEVWWYSTKVGVIDLKKK 376 ++ +++ +V D K + Sbjct: 365 IDVKEQKLKLFLDKNQVEEFDYKLR 389 >UniRef50_A1UPS7 Integrase, catalytic region n=17 Tax=Bacteria RepID=A1UPS7_MYCSK Length = 358 Score = 154 bits (388), Expect = 6e-36, Method: Composition-based stats. Identities = 59/291 (20%), Positives = 89/291 (30%), Gaps = 32/291 (10%) Query: 14 SLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRS 73 R+ + + G + R C G+ +T L P Sbjct: 79 RKRSAVIALRERFGVSERRACTVVGLHRSTMR-------------LTPAPV------TTE 119 Query: 74 SDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPA- 132 ++ A LR RWG R+ + G + L GL Sbjct: 120 EAELRAWLRRFSTDRPRWGWRRAAKMARRAGWKANN-KRIRRLWREEGLRVPQRRRKKRL 178 Query: 133 ------TGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRET 186 G PN +W MDF+ G L ++D+ +R +L + + Sbjct: 179 TGIGVAVGAMSPIRPNVIWAMDFQFDTTADGRTLKMLNVIDEFTREALAIEVDRAINADG 238 Query: 187 VQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGK 246 V L + YG P + DNG + A+ W P P Sbjct: 239 VVDVLDRLALTYGAPHYVRFDNGPEF-----VANAVADWCRFNSAGSLFIDPGSPWQNAW 293 Query: 247 LERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGS 297 +E F+ L+ E+L F E + + WR YN RPH A P Sbjct: 294 IESFNGRLRDELLNLWRFDSLLEARVIIEDWRRDYNANRPHSAHGELTPAE 344 >UniRef50_B0SFL5 Transposase n=2 Tax=Leptospira biflexa serovar Patoc RepID=B0SFL5_LEPBA Length = 206 Score = 153 bits (386), Expect = 1e-35, Method: Composition-based stats. Identities = 48/207 (23%), Positives = 79/207 (38%), Gaps = 14/207 (6%) Query: 96 IKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPAT-----GRFEHDAPNRLWQMDFK 150 I + + G + V L ++ GL P PN +W +DF Sbjct: 2 IYKSMRKAGWRI-NHKKVCRLYSQEGLKIRTKPRKKRKLADSKPIPIPTRPNEVWAIDFL 60 Query: 151 GHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNGS 210 G + L+ +D +R ++ L+ E + + L S+ E LP DNG Sbjct: 61 HERTIDGRKARILSGVDLCTRENVVLSADYSISSERLIRFLESLPE---LPKSFITDNGP 117 Query: 211 PWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGEL 270 + T WL + I + + P P +E F+ L++E LQ + D EL Sbjct: 118 EF-----TSRVFINWLSKNNIGISYIDPGKPTQNAFVESFNGKLRSECLQLSFCRDLTEL 172 Query: 271 QRAFDHWRTVYNLERPHEALDMAVPGS 297 + ++ YN ER H +L+ P Sbjct: 173 RNELSKFQKDYNEERLHSSLNYLTPLE 199 >UniRef50_A6VYF3 Integrase catalytic region n=14 Tax=Bacteria RepID=A6VYF3_MARMS Length = 290 Score = 152 bits (385), Expect = 1e-35, Method: Composition-based stats. Identities = 57/284 (20%), Positives = 98/284 (34%), Gaps = 40/284 (14%) Query: 29 NIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMAHDRH 88 +I+ C I+ +T Y + G + ++ L+ H ++ Sbjct: 26 SIKRQCELLNIARSTAY-----YQPIGL--------------STEEIELRRLIDEIHLQY 66 Query: 89 ERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPATGRFEHDA-------- 140 G+R+I+ L + H + + +M G + P T + Sbjct: 67 PYMGSRRIRTELAKKDHHV-NRKRIVRIMRDMG-IGAIYPKPKTTVTNQAHKVYPYLLRD 124 Query: 141 -----PNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVF 195 PN+ W +D + P G + + ++D +SR L + L Sbjct: 125 IKVTYPNQAWAIDIT-YIPMAKGFLYLVAIIDWYSRKVLSWRLSNTMDVSFCIEALEEAL 183 Query: 196 ERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLK 255 + YG PD D GS + T T L+ G+R+ +ER RSLK Sbjct: 184 KHYGPPDIFNSDQGSQFTSTEFTQKLLD-----HGVRISMDGKGRWVDNVFIERLWRSLK 238 Query: 256 AEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRY 299 E + K + E + H+ YN R H+ L+ P Y Sbjct: 239 YEEVYLKAYTTPREAELEISHYMVFYNEARHHQGLNELTPDEVY 282 >UniRef50_C1XUW8 Transposase n=2 Tax=Meiothermus silvanus DSM 9946 RepID=C1XUW8_9DEIN Length = 278 Score = 152 bits (384), Expect = 2e-35, Method: Composition-based stats. Identities = 66/292 (22%), Positives = 111/292 (38%), Gaps = 43/292 (14%) Query: 22 FASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALL 81 A ++ +R LCR G+ +T Y + +G PN + L Sbjct: 1 MALKEAYPLRLLCRALGVPRSTLY-----YRSKG--------------PNPEEAVLRGRL 41 Query: 82 RMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPATGR------ 135 R R+G R++ L +G + V +LM R GLL P P T Sbjct: 42 RELAGAWPRYGYRRLAALLRGEGFGV-GEKRVRSLMRREGLLLTRKPLKPRTTLPEELLP 100 Query: 136 ---------FEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRET 186 E +++W D + G G + ++D H+R L +A + Sbjct: 101 EGVPNLLLGLEVTGFHQVWVAD-LSYVVLGEGVAYLAVVMDLHTRKILGVALGPRL-SQG 158 Query: 187 VQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGK 246 + + + R G P+ D G + T A L+ LG+R+ ++ P G Sbjct: 159 LALAALEMALREGCPEVHHSDRGVQY-----TSRAYVERLLGLGVRLSYAGTGRPWENGH 213 Query: 247 LERFHRSLKAEVLQGKWFADSGELQRAFDHWR-TVYNLERPHEALDMAVPGS 297 ER R++K E + + + E + + + + VYN +RPH AL P + Sbjct: 214 AERLIRTVKEEWVDLREYRTLEEARASVEAFVFEVYNRKRPHSALGYLTPEA 265 >UniRef50_A5BJN2 Putative uncharacterized protein n=5 Tax=Vitis vinifera RepID=A5BJN2_VITVI Length = 1380 Score = 151 bits (381), Expect = 4e-35, Method: Composition-based stats. Identities = 54/290 (18%), Positives = 110/290 (37%), Gaps = 41/290 (14%) Query: 77 ITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPATGRF 136 + ++R E+ G + E+ T P+ ++M R T R Sbjct: 849 VDQIIRKCVPEEEQQGI--LSHCHENAWFTWPSLFKDSHIMCR--SCDRCQRLGKLTKRN 904 Query: 137 EHDA-------PNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQ 189 + +W +DF G FP G + L +D S++ + + ++ R ++ Sbjct: 905 QMPMNPILIVDLFDVWGIDFMGPFPMSFGNSYILVGVDYVSKWVEAIPYKHNDHRVVLKF 964 Query: 190 QLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLER 249 ++F R+G+P + D G+ + + E L + G++ + PYHPQT G++E Sbjct: 965 LKENIFSRFGVPKAIISDGGTHFCN-----KPFETLLAKYGVKHKVATPYHPQTSGQVEL 1019 Query: 250 FHRSLKAEVLQGKWFADSGE----LQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQ 305 +R +K +++ + L + +RT Y L M+ Y+ + Sbjct: 1020 ANREIKNILMKV-VITSRKDWSIKLHDSLWAYRTAYKTI-----LGMSP----YRLVYGK 1069 Query: 306 YSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 EY +++++ + +A + L EM+E Sbjct: 1070 ACHLPVEVEYKAWWAIKRLN-----------MDLIRARAKRCLDLNEMEE 1108 >UniRef50_A5BI07 Putative uncharacterized protein n=7 Tax=Vitis vinifera RepID=A5BI07_VITVI Length = 1803 Score = 151 bits (381), Expect = 4e-35, Method: Composition-based stats. Identities = 45/219 (20%), Positives = 89/219 (40%), Gaps = 28/219 (12%) Query: 140 APNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYG 199 +W +DF G FP G + L +D S++ + ++ R ++ ++F R+G Sbjct: 1199 KLFDVWGIDFMGPFPMSFGNSYILVGVDYVSKWVEAIPCKQNDHRVVLKFLKENIFSRFG 1258 Query: 200 LPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVL 259 +P + D G+ + + E L + G++ + PYHPQT G++E +R +K ++ Sbjct: 1259 VPKAIISDGGAHFCN-----KPFETLLAKYGVKHKVATPYHPQTSGQVELANREIKNILM 1313 Query: 260 QGKWFADSG---ELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGNTTPPEYD 316 + + L + +RT Y L M+ Y+ + EY Sbjct: 1314 KVVNASRKDWSIRLHDSLWAYRTTYKTI-----LGMSP----YRLVYGKACHLLMEVEYK 1364 Query: 317 EGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 ++K++ + +A + L EM+E Sbjct: 1365 AWWAIKKLN-----------MDLIRAGAKRCLDLNEMEE 1392 >UniRef50_A0L7D7 Integrase, catalytic region n=5 Tax=Bacteria RepID=A0L7D7_MAGSM Length = 272 Score = 151 bits (380), Expect = 5e-35, Method: Composition-based stats. Identities = 57/240 (23%), Positives = 98/240 (40%), Gaps = 11/240 (4%) Query: 64 RIPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLL 123 RIP + A + H+ H G R++ + D + ++V ++ GL+ Sbjct: 8 RIPRDFW-LEEWEREATVNFFHE-HPDEGYRRLTYMMLDADVVAVSPASVLRVLRAAGLM 65 Query: 124 PG--ASPGIPATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTD 181 P TG + P++ W +D + G + ++LD SRF L Sbjct: 66 RKWSPPPSQKGTGFKQPLEPHKHWHIDI-SYLNIQGTFYYLCSVLDGCSRFILHWEIRES 124 Query: 182 ERRETVQQQLVSVFERYG-LPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYH 240 + + V+ L+ E Y R+ DNG + + ++ G+ + PY+ Sbjct: 125 MKEDEVEVILLRAKEAYPEAKPRVISDNGPQF-----VAKDFKTFIRESGMTHVRTSPYY 179 Query: 241 PQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQ 300 PQ+ GKLERFH +LK E ++ + + QR + + YN R H A P R + Sbjct: 180 PQSNGKLERFHGTLKRECIRPQTPLSLEDAQRVVEGYVEHYNTYRLHSATGYITPKDRLE 239 >UniRef50_A9B8J0 Integrase catalytic region n=3 Tax=Herpetosiphon aurantiacus ATCC 23779 RepID=A9B8J0_HERA2 Length = 577 Score = 151 bits (380), Expect = 6e-35, Method: Composition-based stats. Identities = 73/381 (19%), Positives = 130/381 (34%), Gaps = 28/381 (7%) Query: 6 PWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRI 65 P+ R +L S +G + + IS +T Y RW +EG AGLQ + R Sbjct: 139 PFHDNPDPIQRRHAILVLSLEGWTKKRIATYLQISRSTVYNTFARWHKEGFAGLQAKSRA 198 Query: 66 PHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPG 125 P + + I +R H GA ++ L +G + + T +MA + L Sbjct: 199 PRRRHPKVTIAIQQRVRRLQRNH-LLGAWRMHAALRREGIRL-SPRTCGRIMAVNRDLCP 256 Query: 126 ASPGIPATGR-------FEHDAPNRLWQMDF--KGHFPFGGGRCHPLTLLDDHSRFSLCL 176 P + + F ++ W +D GGG + +++++++SR L Sbjct: 257 ELPKRQRSRKHEPRAMPFAAQYRHQYWTIDIRYLDMHRLGGGHIYCISIVENYSRAILSS 316 Query: 177 AHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHS 236 A + V + L +YG PD + D+GS + L+ L I+ Sbjct: 317 AISRIQDTTAVLKVLYDAVAKYGCPDGIVSDSGSVFRSHR-----LQEVCQHLRIQQCPI 371 Query: 237 RPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDH----WRTVYNLERPHEAL-- 290 P +E + + + + + A H W YN E H A Sbjct: 372 EKRQPWQS-YIETTFGIQRRMADEAEEGFRAAQSWDALWHAHRTWLLHYNTE-VHWAHRQ 429 Query: 291 ---DMAVPGSRY-QPSARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGE 346 P R Y + + R++D G L V+ + + E Sbjct: 430 RQDGRETPAEVLTWIRGRPYPERLLQRIFAATRVKRRLDRVGFLRVRRWRIYSEIGLAKE 489 Query: 347 RVGLKEMQEDGSYEVWWYSTK 367 V + + + + + Sbjct: 490 AVEVWLEAQHVTITYADHHLR 510 >UniRef50_A5AQ03 Putative uncharacterized protein n=5 Tax=Vitis vinifera RepID=A5AQ03_VITVI Length = 1873 Score = 151 bits (380), Expect = 6e-35, Method: Composition-based stats. Identities = 43/215 (20%), Positives = 88/215 (40%), Gaps = 34/215 (15%) Query: 141 PNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGL 200 +W +DF G FP G + L +D S++ + +++ + ++ +F R+G+ Sbjct: 1585 IFDVWGIDFMGPFPMSFGHSYILVGVDYISKWVEAIPCRSNDHKVVLKFLKDHIFARFGV 1644 Query: 201 PDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQ 260 P + D G+ + + E L + G++ + PYHPQT G++E +R +K +++ Sbjct: 1645 PKAIISDGGTHFCN-----KPFETLLAKYGVKHKVATPYHPQTSGQVELANREIKNILMK 1699 Query: 261 GKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGNTTPPEYDEGVM 320 L + +RT Y L M+ Y+ + EY Sbjct: 1700 ---------LLDSLWAYRTAYKTI-----LGMSP----YRLVYGKACHLPVEIEYKAWWA 1741 Query: 321 VRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 ++K++ + +A + L E++E Sbjct: 1742 IKKLN-----------MDLIRAGLKRCLDLNELEE 1765 >UniRef50_Q1N8F6 Transposase n=2 Tax=Sphingomonas RepID=Q1N8F6_9SPHN Length = 466 Score = 150 bits (379), Expect = 7e-35, Method: Composition-based stats. Identities = 86/404 (21%), Positives = 141/404 (34%), Gaps = 53/404 (13%) Query: 15 LRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSS 74 R + DG + L R +S T +WL R+ EG AGL PR + Sbjct: 13 ERYRVLEPHLADGIPLADLARTGTLSERTLQRWLGRYRAEGLAGLARLPRNDRGRLH-LP 71 Query: 75 DDITALLRMAHDRHERWGARKIKRWLED----QGHTMPAFSTVHNLMARHGLLPGASPGI 130 + + L R + R I R +++ GH P+++ V ++ A+ Sbjct: 72 EHLVELTRTLATKRPRPPVAAIHRKVQELAIAHGHRTPSYAAVARVVRAIPASQIAAASD 131 Query: 131 PATGRFEHD--------APNRLWQMDFKG-HFPFGGG-----RCHPLTLLDDHSRFSLCL 176 PA R +H+ N +WQ D R ++DDHSR Sbjct: 132 PAVYRDQHELVHRREAATSNEMWQADHTVLDILVLDDAGTPVRPWLTVIVDDHSRAIAGY 191 Query: 177 AHCTDERRE-TVQQQLVSVFERY--------GLPDRMTMDNGSPWGDTTGTWTALELWLM 227 D L R G+P+++ +DNGS + +E + Sbjct: 192 FLSLDAPSALNTALALRQAIWRKPNPEWIVSGIPEQLYVDNGSDFISEH-----IEQACI 246 Query: 228 RLGIRVGHSRPYHPQTQGKLERFHRSLKAEVL------------QGKWFADSGELQRAFD 275 L IR+ HS P P+ +GK+ER R++ L EL+ F+ Sbjct: 247 ALKIRLIHSLPGRPRGRGKIERLFRTINDMFLPDLPGHLIAGKPLSAPVLTLDELRARFE 306 Query: 276 HWRTVYNLERPHEALDMAVPGSRYQPSARQYSGNTTPPEYDEGVMV----RKVDISGKLS 331 + RPH + P +R+Q + + + D ++ RKV G + Sbjct: 307 AFVCGVYHRRPHGSTG-EPPITRWQKGGFLPAMPDSLEQLDMLLVHVPKPRKVLRDG-IR 364 Query: 332 VKGVSLS--AGKAFRGERVGLKEMQEDGSYEVWWYSTKVGVIDL 373 + G AF GE+V D + ++ + L Sbjct: 365 LMGRRYVEPTLAAFVGEQVEAVYDPRDLTEIHVYHQGRFVCRAL 408 >UniRef50_UPI0001B416F4 ISA0963-5 transposase n=6 Tax=Ferroplasma acidarmanus fer1 RepID=UPI0001B416F4 Length = 318 Score = 149 bits (377), Expect = 1e-34, Method: Composition-based stats. Identities = 49/300 (16%), Positives = 113/300 (37%), Gaps = 32/300 (10%) Query: 14 SLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRS 73 + ++++ + + + I+ +++ G I Sbjct: 36 EKKIKYIIREKNKRRSSTEIAKEMKITTRYVNYIYKKYRDNGEY------TIGKRKHREL 89 Query: 74 SDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPAT 133 + +++ + G +I+++L+ +G + A + ++ ++ ++ ++ Sbjct: 90 NSKDIEIVKKIRYEYPMSGPERIRKYLKRKGIII-AKNNIYRILLLLNMVDNSNNKKKQR 148 Query: 134 G--RFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQL 191 ++E N LW MD+ + + + DD SRF + + +E + + L Sbjct: 149 KYIKYERKHSNSLWHMDWTKYSDSEK----LIIIEDDASRFIVGMGIYGEETIDNTIEAL 204 Query: 192 VSVFERYGLPDRMTMDNGSPWGDT-----TGTWTALELWLMRLGIRVGHSRPYHPQTQGK 246 YG P+ + D+G+ + G + +L I+ R HP+T GK Sbjct: 205 EIAINTYGKPEEILTDHGTQFFSNGKNGIPGDHNKFQEYLDNSNIKHILGRVKHPETNGK 264 Query: 247 LERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEAL----DMAVPGSRYQPS 302 LER + ++K +F+ E+ YN ER H++L ++ P Y+ Sbjct: 265 LERLNYTIKR---LRPYFSTWEEV-------VYHYNYERMHDSLSDGDNIVTPAMAYKNK 314 >UniRef50_A5C2R0 Putative uncharacterized protein n=10 Tax=Vitis vinifera RepID=A5C2R0_VITVI Length = 2116 Score = 149 bits (377), Expect = 1e-34, Method: Composition-based stats. Identities = 56/294 (19%), Positives = 111/294 (37%), Gaps = 41/294 (13%) Query: 75 DDITALLRMAHDRH--ERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPA 132 D+ +L H+ + ++K + G T P ++M R Sbjct: 1743 DEQQGILSHCHENACGGHFASQKTAMKVLQSGFTWPFLFKDAHIMCR--SCDRCQRLGKL 1800 Query: 133 TGRFEHDA-------PNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRE 185 T R + +W +DF G FP G + L +D S++ + ++ + Sbjct: 1801 TKRNQMPMNPILIVELFDVWGIDFMGPFPMSFGNSYILVGVDYVSKWVEAIPCKQNDHKV 1860 Query: 186 TVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQG 245 ++ ++F R+G+P + D G+ + + E L R G++ + PYHPQT G Sbjct: 1861 VLKFLKENIFSRFGVPKAIISDGGAHFCN-----KPFEALLSRYGVKHKVATPYHPQTSG 1915 Query: 246 KLERFHRSLKAEVLQGKWFADSGE----LQRAFDHWRTVYNLERPHEALDMAVPGSRYQP 301 ++E +R +K +++ + + L + +RT Y L M+ Y+ Sbjct: 1916 QVELANREIKNILMKVVN-SSRKDWSIRLHDSLWAYRTTYKTI-----LGMSP----YRL 1965 Query: 302 SARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 + EY ++K+++ +A + L EM+E Sbjct: 1966 VHGKACHLPVEVEYKAWRAIKKLNLD-----------LIRAGEKRYLDLNEMEE 2008 >UniRef50_A7VEZ2 Putative uncharacterized protein n=3 Tax=Bacteria RepID=A7VEZ2_9CLOT Length = 327 Score = 149 bits (376), Expect = 1e-34, Method: Composition-based stats. Identities = 61/310 (19%), Positives = 114/310 (36%), Gaps = 25/310 (8%) Query: 7 WDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIP 66 M R + +A + G + ++ + Y+W +R+ L+DR R P Sbjct: 19 ATITQDMRYRLSLIKYAERFG--VTKAAIKYKTNRQYIYRWKRRY-DGSIESLRDRSRRP 75 Query: 67 HHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGA 126 HH PN+ + + L+ R+ G L+ +G++ + ++ + + G++ Sbjct: 76 HHHPNQHTPEEIKLIFDMRRRNPNAGLVVFWVKLKQRGYSR-SIPGLYRFLRKQGIMAVH 134 Query: 127 SPGIP--ATGRFEHDAPNRLWQMDFKGHFPFG-------GGRCHPLTLLDDHSRFSLCLA 177 P + D P + Q+D K G + T +D++SR+ A Sbjct: 135 PPNPKYIPKPYEQMDYPGQRIQVDVKFVPSACLKNPKVIGKQFFQYTAIDEYSRWRFVEA 194 Query: 178 HCTDERRETVQQQLVSVFERYGLP-DRMTMDNGSPWGDTTGTWTA----LELWLMRLGIR 232 + + + + + LP + DNG+ + + T ++ L + GIR Sbjct: 195 FEEHNTYSSAM-FIEHLVKAFPLPIQCIQTDNGAEFTNRFTTHRDKPTLFQVHLKQHGIR 253 Query: 233 VGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHW-RTVYNL--ERPHEA 289 RP+ P+ GK+ER HR F + + + R YN RP Sbjct: 254 HKVIRPFTPRHNGKVERSHRKDNERFYATHTFYSFEDFAKQLKVYNRRDYNNFPMRP--- 310 Query: 290 LDMAVPGSRY 299 L P Sbjct: 311 LGWKSPNQVL 320 >UniRef50_A1VBQ7 Integrase, catalytic region n=9 Tax=Proteobacteria RepID=A1VBQ7_DESVV Length = 349 Score = 149 bits (376), Expect = 2e-34, Method: Composition-based stats. Identities = 66/325 (20%), Positives = 113/325 (34%), Gaps = 39/325 (12%) Query: 13 MSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHH-SPN 71 ++ R +L + + N+ CR G S Y+ + + GA GL DR P PN Sbjct: 7 VARRKLSLLELASELDNVSKACRIMGYSRQQFYEIRRNYQTFGAEGLADRLPGPREPHPN 66 Query: 72 RSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNL-----MARHGLLPGA 126 R + + G ++ + L QG + + + RH L Sbjct: 67 RVDEATEDRILAYSLEFPTHGPVRVAQQLVLQGVQVSSGGVRGVWSRNGMLTRHERLLRL 126 Query: 127 SPGIPATG---------------------RFEHDAPNRLWQMDFKGHFPFGG-GRCHPLT 164 + TG E L +D G G+ + + Sbjct: 127 ERHVRDTGIALNDDQVRTLERFSPEFRERHIETRCSGDLVAVDTFFVGTLKGVGKIYLQS 186 Query: 165 LLDDHSRFSLCLAHCTDERRETVQ---QQLVSVFERYGLPDR-MTMDNGSPWGDTTGTWT 220 +D HSR++ + + V + ++ FE + P + DNG + Sbjct: 187 AIDCHSRYAFGRLYTSKLPVTAVHMLNESVLPFFEEHDTPVVTVLSDNGREFCGRPDRH- 245 Query: 221 ALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVL----QGKWFADSGELQRAFDH 276 EL+L I +R PQ+ G +ER HR+L E + W+ E+Q + Sbjct: 246 PYELFLQLENIEHRTTRVRRPQSNGFVERLHRTLLDEHFRIQGRRNWYESLDEMQSDLNA 305 Query: 277 WRTVYNLERPHEA--LDMAVPGSRY 299 + YN ER H+ ++ P + Sbjct: 306 YLHHYNHERAHQGRNMNGRTPYQAF 330 >UniRef50_Q46NI5 Integrase, catalytic region n=26 Tax=Proteobacteria RepID=Q46NI5_RALEJ Length = 276 Score = 149 bits (375), Expect = 2e-34, Method: Composition-based stats. Identities = 61/286 (21%), Positives = 96/286 (33%), Gaps = 38/286 (13%) Query: 29 NIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMAHDRH 88 + + GI+ ++ Y Q RP + + H H Sbjct: 9 PVSRQAKLVGIARSSAYY-------------QPRPVSDA------DLKLMRRIDELHLEH 49 Query: 89 ERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPATGRFEH---------- 138 GAR + R L +G + V LM R G+ + H Sbjct: 50 PFAGARMLGRLLRREGIPV-GRRHVRTLMKRMGIEALYRRPNTSRKHAAHKIWPYLLRDR 108 Query: 139 --DAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFE 196 D N++W +D + P G + ++D SR L + L F Sbjct: 109 KIDRANQVWALD-TSYIPMARGFVYLTAVVDWASRKVLAYRLAITLESCHAVEALEEAFA 167 Query: 197 RYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKA 256 +YG P+ + D GS + T T L GI + + +ER RS+K Sbjct: 168 KYGTPEIVNTDQGSQFTATEFTDAVLNP-----GILLSMDGKGSWRDNVFVERLWRSVKY 222 Query: 257 EVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPS 302 E + K + +R+ ++ T YN RPH +L P Y S Sbjct: 223 EEVYLKAYDSVSHARRSIANYLTWYNQRRPHSSLADQTPDEAYFAS 268 >UniRef50_UPI00005104D7 transposase n=1 Tax=Brevibacterium linens BL2 RepID=UPI00005104D7 Length = 401 Score = 149 bits (375), Expect = 2e-34, Method: Composition-based stats. Identities = 58/275 (21%), Positives = 98/275 (35%), Gaps = 11/275 (4%) Query: 76 DITALLRMAHDRHERWGARKIKRWL----EDQGHTMPAFSTVHNLMARHGLLPGASPGIP 131 ++ + + +G + I G +P+ +T+ L+A G + + P Sbjct: 4 ELVRIRKQLKTGGWDYGPKTIHYEAIIADAFPGGKVPSPATIARLLASVGHVEASPKKRP 63 Query: 132 ATGR--FEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCL-AHCTDERRETVQ 188 + F LWQ+D + G LLDD +RF + AH E Sbjct: 64 KSCYIPFARSTAMALWQLDAFEYTLTTGTIVTIYQLLDDATRFDVGTSAHSRAENSADAH 123 Query: 189 QQLVSVFERYGLPDRMTMDNGSPWGD-TTGTWTALELWLMRLGIRVGHSRPYHPQTQGKL 247 + L + YG P + DN S + G A+E +L G P P TQGK Sbjct: 124 EILAAAITEYGAPKEVLSDNSSAFNQLRQGRIGAVETFLASKGAMPISGLPGKPTTQGKN 183 Query: 248 ERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYS 307 ER ++L L A +++ + YN RPH+++ A P + + + Sbjct: 184 ERSRQTLIR-FLDANTPASLEKIRALLRRFHDHYNNRRPHQSIGGATPATAWNLLEHTPA 242 Query: 308 GNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKA 342 P E + + ++ G A Sbjct: 243 TGPIPMAVLEAKAAEYLSKR--IRLRRNLNQVGLA 275 >UniRef50_A5B2X9 Putative uncharacterized protein n=4 Tax=Vitis vinifera RepID=A5B2X9_VITVI Length = 1595 Score = 149 bits (375), Expect = 2e-34, Method: Composition-based stats. Identities = 43/218 (19%), Positives = 90/218 (41%), Gaps = 28/218 (12%) Query: 141 PNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGL 200 +W +DF G FP G + L +D S++ + +++ + ++ ++F R+G+ Sbjct: 1295 IFDVWGIDFMGPFPMSFGHSYILVGVDYVSKWVEAIPCRSNDHKVVLKFLKDNIFARFGV 1354 Query: 201 PDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQ 260 P + D G+ + + E L + G++ + PYHPQT G++E +R +K +++ Sbjct: 1355 PKAIISDGGTHFCN-----KPFETLLAKYGVKHKVATPYHPQTSGQVELANREIKNILMK 1409 Query: 261 GKWFADSG---ELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGNTTPPEYDE 317 +L + +RT Y L M+ Y+ + EY Sbjct: 1410 VVNVNRKDWSIKLLDSLWAYRTTYKTI-----LGMSP----YRLVYGKACHLPMEIEYKA 1460 Query: 318 GVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 ++K++ + +A + L E++E Sbjct: 1461 WWAIKKLN-----------MDLTRAGLKRCLDLNELEE 1487 >UniRef50_A8LT45 Integrase n=5 Tax=Bacteria RepID=A8LT45_DINSH Length = 497 Score = 148 bits (374), Expect = 3e-34, Method: Composition-based stats. Identities = 84/409 (20%), Positives = 135/409 (33%), Gaps = 54/409 (13%) Query: 6 PWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRI 65 P D R R + D + + R GI T +W R+ + G AGL PR Sbjct: 19 PEDRRAEALRRFNILRQHLIDEVPLTEVARVSGIPLRTLQRWTSRYQRFGLAGLARAPRS 78 Query: 66 PHHSPNRSSDDITALLRMAHDRHERWGARKIKRWL----EDQGHTMPAFSTVHNLMAR-- 119 R S ++ L+ R I R + + + +P+++T+H+++ Sbjct: 79 DAGQ-RRLSSELVELIEGLALHKPRLSTAAIHRRIIPIVKSRDWPVPSYATIHSIVNSLD 137 Query: 120 -------HGLLPGASPGIPATGRFEHDAPNRLWQMDFKGHFPFGGG------RCHPLTLL 166 H R + PN +WQ D R +L Sbjct: 138 PALVTLAHDGAAAYRDRFEMIHRHRAERPNAVWQTDHTQLDLIILDTNGAPVRPWLTIVL 197 Query: 167 DDHSRFSLCLAHC-TDERRETVQQQLVSVFERY--------GLPDRMTMDNGSPWGDTTG 217 DDHSR A L R GLPD + D+GS + Sbjct: 198 DDHSRAVAGYAVFVGAPSAIQTALALRQAIWRKDTPSWPICGLPDVLYTDHGSDF----- 252 Query: 218 TWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVL------------QGKWFA 265 T LE L I + S PQ +GK+ERF ++ E+L Sbjct: 253 TSKHLEQVAADLRIELVFSTVGRPQGRGKIERFFGTINTELLPELPGALSNGKPASPPRL 312 Query: 266 DSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPS---AR-QYSGNTTPPEYDEGVMV 321 GEL+ A + T R H +D P ++ R S + Sbjct: 313 SLGELEVAVKTFVTAVYNARKHSEID-VPPNEAWRGDGWLPRMPNSLEQLDLLLVMALKT 371 Query: 322 RKVDISGKLSVKGVSLS--AGKAFRGERVGLKEMQEDGSYEVWWYSTKV 368 R+V G + +G+ + A+ G+ V ++ D + ++ + Sbjct: 372 RQVRRDG-IRFQGLLYTDPTLAAYVGKTVNIRYDPRDITELRVFHRDRF 419 >UniRef50_A4SIH8 IS3-family transposase n=42 Tax=Proteobacteria RepID=A4SIH8_AERS4 Length = 387 Score = 148 bits (374), Expect = 3e-34, Method: Composition-based stats. Identities = 63/292 (21%), Positives = 103/292 (35%), Gaps = 39/292 (13%) Query: 24 SQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRM 83 ++ A I +C F + + Y +L R ++ R L R+ Sbjct: 107 LREQAPITLVCCAFDVPKSCFYDYLARKRTINRERMKQRS---------------ELRRL 151 Query: 84 AHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGA------------SPGIP 131 + + G+R + + + G+ + F V NLM GL P IP Sbjct: 152 FKESRDSAGSRALMSMMRELGYQIGRFK-VRNLMKEAGLASKQPGAHRYKVACSERPDIP 210 Query: 132 --ATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQ 189 F+ PN++W D + + +LD H+R + A E + Sbjct: 211 NLLAREFDVPQPNQVWCGDIT-YVWTSARWHYLAVVLDLHTRRVVGWAMSDKPDAELAIK 269 Query: 190 QLVSVFERYGLPDRM--TMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKL 247 L +++ G P + D GS +G A L R + SR + + Sbjct: 270 ALEMAYQQRGCPSGVLFHSDQGSQYGSR-----AFRQRLWRYRMTQSMSRRGNCWDNAPM 324 Query: 248 ERFHRSLKAEVLQGKWFADSGELQRAFDHWR-TVYNLERPHEALDMAVPGSR 298 ER RSLK+E L + E +R ++ YN RPH+ D P Sbjct: 325 ERLFRSLKSEWLPATGYVSLREAKRDISYYLMDYYNWRRPHQHNDGIPPAEA 376 >UniRef50_A5BTM1 Putative uncharacterized protein n=31 Tax=Vitis vinifera RepID=A5BTM1_VITVI Length = 2292 Score = 148 bits (373), Expect = 3e-34, Method: Composition-based stats. Identities = 51/292 (17%), Positives = 112/292 (38%), Gaps = 37/292 (12%) Query: 75 DDITALLRMAHDRH--ERWGARKIKRWLEDQGHTMPA-FSTVHNLMARHGLLPGASPGIP 131 ++ +L H+ + ++K + G + P+ F H + I Sbjct: 1393 EEQQGILSHCHENACGGHFASKKTAMKVLQSGLSWPSLFKDAHTMCRSCDRCQRLEKLIR 1452 Query: 132 ATGRFEHDAP----NRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETV 187 + +W +DF G FP G + L +D S++ + ++ R + Sbjct: 1453 RNQMPMNPILIVDLFDVWGIDFMGPFPMSFGNSYILVGVDYVSKWVKAIPCKHNDHRVVL 1512 Query: 188 QQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKL 247 + ++F R+G+P + D G+ + + E L + G++ + PYHPQT G++ Sbjct: 1513 KFLKENIFSRFGVPKAIISDGGTHFCNR-----PFETLLAKYGVKHKVATPYHPQTSGQV 1567 Query: 248 ERFHRSLKAEVLQ----GKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSA 303 E+ ++ +K +++ + + +L + +RT Y L M+ Y+ Sbjct: 1568 EQANKGIKNILMKVVITSRKYWSI-KLHDSLWAYRTAYKTI-----LGMSP----YRLVY 1617 Query: 304 RQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 + EY +++++ + +A + L EM+E Sbjct: 1618 GKACHLLVEVEYKAWWAIKRLN-----------MDLIRAGAKRCLDLNEMEE 1658 >UniRef50_C6W069 Integrase catalytic region n=3 Tax=Dyadobacter fermentans DSM 18053 RepID=C6W069_DYAFD Length = 325 Score = 148 bits (373), Expect = 4e-34, Method: Composition-based stats. Identities = 66/307 (21%), Positives = 109/307 (35%), Gaps = 18/307 (5%) Query: 14 SLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRS 73 R ++V+ Q G NI RR GI+ +T +W++R E DR PH + Sbjct: 9 EARQKWVMLYRQIG-NISVAARRCGIARSTLQRWIKR---EDENLFTDRSHRPHRLGRQK 64 Query: 74 SDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGAS-PGIPA 132 L + ++G +I L + STV ++ +H + P Sbjct: 65 YRPEDEALVLKVRDEFKYGKLRICSHLFRLHDLKISTSTVARILEKHSVAPIRRFTKHSP 124 Query: 133 TGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLV 192 R+ P Q+D + T +DD +R + + + L Sbjct: 125 PIRYAKLVPGERVQLDVCKIRA----GLYQYTAIDDCTRLRVLKLYTRRSAANS-IDFLD 179 Query: 193 SVFERYGLP-DRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFH 251 + + + P R+ D G + A + LM I+ +P P GK+ER Sbjct: 180 KLIDEFHYPIQRIQTDRGQEF-----FAVAFQQKLMDYCIKFRPIKPRSPHLNGKVERSQ 234 Query: 252 RSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGS-RYQPSARQYSGNT 310 ++ E D +L W+ YN RPH +L P + SA+ + Sbjct: 235 QTDLQEFYTTVDLRDP-QLDDKLAQWQFHYNYFRPHSSLGGKTPIEFASEHSAKASFWDE 293 Query: 311 TPPEYDE 317 YDE Sbjct: 294 IEAIYDE 300 >UniRef50_Q24VK2 Putative uncharacterized protein n=3 Tax=Clostridiales RepID=Q24VK2_DESHY Length = 284 Score = 148 bits (373), Expect = 4e-34, Method: Composition-based stats. Identities = 47/289 (16%), Positives = 95/289 (32%), Gaps = 39/289 (13%) Query: 25 QDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMA 84 + + ++ ++ Y P+ + + + Sbjct: 16 SKELPLSTAAELLDVNRSSAYY-------------------KAKEPSETELAVKNAIDKM 56 Query: 85 HDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMAR---HGLLPGASPGIPATGRFEHD-- 139 H + WG+R++ + L+ G + T M H + P + PA G + Sbjct: 57 HTDNPAWGSRQLSKKLKRLGFDIGRLKT-RRYMQEMDIHTIYPKPNLSKPAKGHKVYPYL 115 Query: 140 -------APNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLV 192 PN+ W +D + G + ++D +SR + V+ L Sbjct: 116 LRNANITRPNQAWSIDIT-YIRLKHGFVYLTAIIDWYSRLIVGWELDDTLSTTMVKCALE 174 Query: 193 SVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHR 252 F P+ + D GS + T + +++ +ER+ R Sbjct: 175 KAFSVAK-PEILNSDQGSQF-----TGHEYINLVESNRVKISMDGKSRWADNIMIERWFR 228 Query: 253 SLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQP 301 +LK + + K + + + ++ + YN E+ H ALD P Y P Sbjct: 229 TLKYDEVYLKDYENIKDARKQIGEFIHTYNFEKLHSALDYQTPAENYYP 277 >UniRef50_C8WWR8 Integrase catalytic region n=1 Tax=Alicyclobacillus acidocaldarius subsp. acidocaldarius DSM 446 RepID=C8WWR8_ALIAD Length = 271 Score = 147 bits (372), Expect = 5e-34, Method: Composition-based stats. Identities = 53/287 (18%), Positives = 96/287 (33%), Gaps = 26/287 (9%) Query: 19 FVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDIT 78 V +++G + + + G++ Y L+ + D+ + Sbjct: 1 MVRQLAKEGFPVPVIAKALGLNRTYCYSLLKPPVPKPKRPPVDK-----------DALVK 49 Query: 79 ALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPG----IPATG 134 +R + +G R+I+ L + + V+ LM GLL A G Sbjct: 50 QWIRRLCEEFPTYGYRRIQVMLRRRYNLRVNHKRVYRLMKEMGLLVKAPKRGASRTKRRG 109 Query: 135 RFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSV 194 + N +Q D + G + ++D + R + + R E + + Sbjct: 110 KIPVTRSNEHFQCDMTKVWCGKDGWGYLFAVIDAYDREIVGYSFSRFCRTEDLLNAVDRA 169 Query: 195 FERY------GLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLE 248 G + DNG T + I + +P G +E Sbjct: 170 LNYRFPNGVQGAGLTLRTDNGCQ-----MTSRRFIEAMKACQINHERTGFNNPDADGYIE 224 Query: 249 RFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVP 295 RF RSLK E + + ++ E + + + YN ERPH AL P Sbjct: 225 RFFRSLKEEEVWLQEYSSFAEAKAGIESYIHFYNTERPHSALGYRSP 271 >UniRef50_A5CA04 Putative uncharacterized protein n=3 Tax=Vitis vinifera RepID=A5CA04_VITVI Length = 2174 Score = 147 bits (372), Expect = 5e-34, Method: Composition-based stats. Identities = 59/306 (19%), Positives = 117/306 (38%), Gaps = 43/306 (14%) Query: 65 IPHHSPNRSSDDITALLRMAHDRHERWGAR----KIKRWLEDQGHTMPAFSTVHNLMARH 120 IP+ + ++ ++H G K + G T P+ ++M R+ Sbjct: 1352 IPNQIIRKCVLEVEQQGILSHCHENACGGHFASQKTAMKVLQSGFTWPSLFKDAHIMCRN 1411 Query: 121 GLLPGASPGIPATGRFEHDA-------PNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFS 173 T R + +W +DF G FP G + L +D S++ Sbjct: 1412 --CDRCQRLGKLTKRNQMPMNPILIVELFDVWGIDFMGPFPMSFGNSYILVGVDYVSKWV 1469 Query: 174 LCLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRV 233 + + ++ R ++ ++F R+G+P + D G+ + + E L + G++ Sbjct: 1470 EAIPYKQNDHRVVLKFLKENIFSRFGVPKAIISDGGAHFCN-----KPFEALLSKYGVKH 1524 Query: 234 GHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGE----LQRAFDHWRTVYNLERPHEA 289 + PYHPQT G++E +R +K +++ ++ + L + +RT Y Sbjct: 1525 KVATPYHPQTSGQVELANREIKNILMKVVN-SNRKDWSIRLHDSLWAYRTAYKTI----- 1578 Query: 290 LDMAVPGSRYQPSARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVG 349 L M+ Y+ + EY +RK++++ KA + Sbjct: 1579 LRMSP----YRLVYCKACHLPVEVEYKAWWAIRKLNMN-----------LIKAGEKRFLD 1623 Query: 350 LKEMQE 355 L EM+E Sbjct: 1624 LNEMEE 1629 >UniRef50_A5AWA7 Putative uncharacterized protein n=6 Tax=Vitis vinifera RepID=A5AWA7_VITVI Length = 2136 Score = 147 bits (371), Expect = 6e-34, Method: Composition-based stats. Identities = 46/220 (20%), Positives = 91/220 (41%), Gaps = 30/220 (13%) Query: 140 APNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYG 199 +W +DF G FP G + L +D S++ + ++ R ++ ++F R+G Sbjct: 1848 KIFDVWGIDFMGPFPMSFGNSYILVGVDYVSKWVEAIPCKQNDHRVVLKFLKENIFSRFG 1907 Query: 200 LPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVL 259 +P + D G+ + + E L + G++ + PYHPQT G++E +R +K ++ Sbjct: 1908 VPKAIISDGGAHFCN-----KPFEALLSKYGVKHKVATPYHPQTSGQVELANREIKNILM 1962 Query: 260 QGKWFADSGE----LQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGNTTPPEY 315 + ++ + L + +RT Y L M+ Y+ + EY Sbjct: 1963 KVVN-SNRKDWSIRLHDSLWAYRTAYKTI-----LGMSP----YRLVYGKACHLPVEVEY 2012 Query: 316 DEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 ++K++ + KA + L EM+E Sbjct: 2013 KAWWAIKKLN-----------MDLIKAGEKRFLDLNEMEE 2041 >UniRef50_Q2Y8D0 Integrase, catalytic region n=1 Tax=Nitrosospira multiformis ATCC 25196 RepID=Q2Y8D0_NITMU Length = 167 Score = 146 bits (369), Expect = 9e-34, Method: Composition-based stats. Identities = 48/166 (28%), Positives = 71/166 (42%), Gaps = 9/166 (5%) Query: 165 LLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALEL 224 ++D+++R SL + + E V + L + R GLP + +DNGS + ++ Sbjct: 1 MVDNYTRESLAIEVGQSLKGEDVVKTLNHIATRRGLPSIIKVDNGSEFISRV-----MDK 55 Query: 225 WLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLE 284 W GI + SRP P ++E F+ + E L WF + +R D WR YN Sbjct: 56 WAYERGIELDFSRPGKPTDNARVESFNGRFRQECLNAHWFLSLEDARRKIDEWRQYYNEM 115 Query: 285 RPHEALDMAVPGS-RYQPSARQYSGNTTPPEY---DEGVMVRKVDI 326 RPH AL A P + T PE+ D G R V Sbjct: 116 RPHSALQWATPAEFARRARENALPDRPTEPEFSTLDRGAFNRSVQH 161 >UniRef50_A5C046 Putative uncharacterized protein n=3 Tax=Vitis vinifera RepID=A5C046_VITVI Length = 1565 Score = 146 bits (369), Expect = 9e-34, Method: Composition-based stats. Identities = 52/273 (19%), Positives = 97/273 (35%), Gaps = 31/273 (11%) Query: 89 ERWGARKIKRWLEDQ--GHTMPAFSTVHNLMARHGLLPGASPGIPATGRFEHD-APNRLW 145 + ++I + G + T + R L + +W Sbjct: 1111 PKEEQQRILIHCHENACGGHFTSQKTAMKELCRCQRLGKLTRRNQMPINPILIVDLFDVW 1170 Query: 146 QMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMT 205 +DF G F G L +D S++ + ++ ++ ++F R+G+P + Sbjct: 1171 GIDFMGQFLMSFGNSFILVGVDYVSKWVEVIPCKHNDHSVXLKFLKENIFSRFGVPKAII 1230 Query: 206 MDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAE---VLQGK 262 D G+ + + E L + G++ + PYHPQT G++E +R +K V+ Sbjct: 1231 SDGGTHFCN-----KPFETLLTKYGVKHKVATPYHPQTSGQVELANREIKNILTKVVNTS 1285 Query: 263 WFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGNTTPPEYDEGVMVR 322 S +L + +RT Y L M S Y + EY ++ Sbjct: 1286 RRDWSVKLHDSLWAYRTAYKTI-----LGM----SLYSLVYGKVCHLLVEVEYKAWWAIK 1336 Query: 323 KVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 KV+ + +A + L EM+E Sbjct: 1337 KVN-----------MDLIRARAKRCLDLNEMEE 1358 >UniRef50_A5AKZ0 Putative uncharacterized protein n=18 Tax=Vitis vinifera RepID=A5AKZ0_VITVI Length = 2140 Score = 146 bits (369), Expect = 1e-33, Method: Composition-based stats. Identities = 43/218 (19%), Positives = 90/218 (41%), Gaps = 28/218 (12%) Query: 141 PNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGL 200 +W +DF G FP G + L +D S++ + +++ + ++ ++F R+G+ Sbjct: 1366 IFDVWGIDFMGPFPMSFGHSYILVGVDYVSKWVEAIPCRSNDHKVVLKFLKDNIFARFGV 1425 Query: 201 PDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQ 260 P + D G+ + + E L + G++ + PYHPQT G++E +R +K +++ Sbjct: 1426 PKAIISDGGTHFCN-----KPFETLLAKYGVKHKVATPYHPQTSGQVELANREIKNILMK 1480 Query: 261 GKWFADSG---ELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGNTTPPEYDE 317 +L + +RT Y L M+ Y+ + EY Sbjct: 1481 VVNVNRKDWSIKLLDSLWAYRTAYKTI-----LGMSP----YRLVYGKACHLPVEIEYKA 1531 Query: 318 GVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 ++K++ + +A + L E++E Sbjct: 1532 WWAIKKLN-----------MDLIRAGLKRCLDLNELEE 1558 >UniRef50_B5CNC7 Putative uncharacterized protein n=5 Tax=Clostridiales RepID=B5CNC7_9FIRM Length = 384 Score = 146 bits (368), Expect = 1e-33, Method: Composition-based stats. Identities = 54/258 (20%), Positives = 93/258 (36%), Gaps = 24/258 (9%) Query: 60 QDRPRIPHHSPNRSSDDI--TALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLM 117 +R I + + S +++ ++ H + WGAR++ L+++G+ + M Sbjct: 129 INRTSIYYKTSPVSDEELACKEIIDHLHTDNPTWGARQMSAQLKNRGYHVGRRKA-RRYM 187 Query: 118 ARHGLLPGASPGIPATGRFEH-------------DAPNRLWQMDFKGHFPFGGGRCHPLT 164 + P P + + R + DAPN+ W +D + P G + Sbjct: 188 NEMDIYP-IYPKMNLSKRMQQAKVCPYLLRNVVIDAPNQAWSIDIT-YIPIRHGFLYLTA 245 Query: 165 LLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALEL 224 ++D +SR + V L F P + D G + T Sbjct: 246 VIDWYSRCIVGWEVDDTLDTRMVINVLKKAFAVSK-PQILNSDQGCQF-----TSQKYIE 299 Query: 225 WLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLE 284 ++ GIR +ER+ RS K E + + E + A + YN E Sbjct: 300 FVKENGIRQSMDGKSRWADNIMIERWFRSFKYEEAYLTLYNNIKEARVAIGRYVYTYNFE 359 Query: 285 RPHEALDMAVPGSRYQPS 302 R H ALD P Y P+ Sbjct: 360 RCHSALDYKTPAECYYPA 377 >UniRef50_A5BFN4 Putative uncharacterized protein n=2 Tax=Vitis vinifera RepID=A5BFN4_VITVI Length = 1956 Score = 146 bits (368), Expect = 1e-33, Method: Composition-based stats. Identities = 48/273 (17%), Positives = 100/273 (36%), Gaps = 28/273 (10%) Query: 66 PHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPG 125 P + S+ ++ + M K + G T P+ ++M R Sbjct: 1166 PRSVSLKKSNKGSSAIAMRMHVEATLPLMKAAMKVLQSGFTWPSLFKDSHIMCR--SCDR 1223 Query: 126 ASPGIPATGRFEHDA-------PNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAH 178 T R + +W +DF G FP G + L +D S++ + Sbjct: 1224 CQRLEKLTKRNQMPMNPILIVDIFYVWGIDFMGPFPMSFGNSYILVGVDYVSKWVEAIPC 1283 Query: 179 CTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRP 238 ++ R ++ ++F R+G+P + D G+ + + L + G++ + P Sbjct: 1284 KHNDHRVVLKFLKENIFSRFGVPKAIISDGGTHFCN-----KPFVTLLAKYGVKHKVATP 1338 Query: 239 YHPQTQGKLERFHRSLKAEVLQGKWFADSGE----LQRAFDHWRTVYNLERPHEALDMAV 294 YHPQT ++E +R +K +++ + L + +RT Y L M+ Sbjct: 1339 YHPQTSRQVELANREIKNILMKM-VITSRKDWSIKLHDSLWAYRTTYKTI-----LGMSP 1392 Query: 295 PGSRYQPSARQYSGNTTPPEYDEGVMVRKVDIS 327 Y+ + EY ++++++ Sbjct: 1393 ----YRLVYGKACHLPMEVEYKAWWAIKRLNMD 1421 >UniRef50_C4V4D7 Transposase n=5 Tax=Clostridiales RepID=C4V4D7_9FIRM Length = 329 Score = 146 bits (367), Expect = 2e-33, Method: Composition-based stats. Identities = 67/288 (23%), Positives = 100/288 (34%), Gaps = 24/288 (8%) Query: 30 IRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMAHDRHE 89 + ++ Y+ R+ L R R PHH PN SD AL+R R Sbjct: 38 VTKAAIKYHTYRQFIYRLRNRY-DGTPESLDPRSRRPHHHPNEHSDREIALIRRMRKRRP 96 Query: 90 RWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGA--SPGIPATGRFEHDAPNRLWQM 147 G L +G+T + + ++ M R GL G P + P + Q+ Sbjct: 97 NTGLICFWVHLRKKGYTR-SITGLYRCMKRLGLKAGKAKKPVYKPKPYEQATFPGQKVQI 155 Query: 148 DFK--------GHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYG 199 D K G G + + T +D+++RF A ++ L + R+ Sbjct: 156 DVKVVPSVCIVGQAKEQGEKMYQYTAIDEYTRFRFIAAFKEQSTYSSMC-FLQQLIRRFP 214 Query: 200 LP-DRMTMDNGSPWGDT-----TGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRS 253 ++ DNG+ + T E L RLGI RPY P+ GK+ER HR Sbjct: 215 FKIHKVQTDNGAEFTKRFQAADEANLTLFEKELKRLGIAHQKIRPYTPRHNGKVERSHRK 274 Query: 254 LKAEVLQGKWFADSGELQRAFDHWRTVYNL--ERPHEALDMAVPGSRY 299 E F + + YN RP L P Sbjct: 275 DNEEFYASHTFYSFEDFKMQLARRNREYNNFPMRP---LGWKSPREAL 319 >UniRef50_A5B9R1 Putative uncharacterized protein n=10 Tax=Vitis vinifera RepID=A5B9R1_VITVI Length = 2171 Score = 146 bits (367), Expect = 2e-33, Method: Composition-based stats. Identities = 56/294 (19%), Positives = 109/294 (37%), Gaps = 41/294 (13%) Query: 75 DDITALLRMAHDRH--ERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPA 132 D+ +L H+ + ++K + G T P+ +M R Sbjct: 1353 DEQQGILSHCHENACGGHFASQKTAMKVLQSGFTWPSLFKDAXIMCR--SCDRCQRLGKL 1410 Query: 133 TGRFEHDA-------PNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRE 185 T R + +W +DF G FP G + L +D S++ + ++ R Sbjct: 1411 TKRNQMPMNPILIVELFDVWGIDFMGPFPMSFGNSYILVGVDYVSKWVXAIPCXXNDHRV 1470 Query: 186 TVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQG 245 ++ ++F R+G+P + D G+ + + E L + G++ + PYHPQT G Sbjct: 1471 VLKFLKENIFSRFGVPKAIISDGGAHFCN-----KPFEALLSKYGVKHKVATPYHPQTSG 1525 Query: 246 KLERFHRSLKAEVLQ----GKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQP 301 ++E +R +K +++ + L + + T Y L M S Y+ Sbjct: 1526 QVELANREIKNILMKVVNSXRKDXSIR-LHDSLWAYXTAYKTI-----LGM----SXYRL 1575 Query: 302 SARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 + EY ++K++ + +A + L EM+E Sbjct: 1576 VYGKAXHLPVEVEYKAWWAIKKLN-----------MDLIRAGEKRYLDLNEMEE 1618 >UniRef50_Q0P7I8 IS1400 transposase B n=231 Tax=Bacteria RepID=Q0P7I8_ECOLX Length = 159 Score = 145 bits (366), Expect = 2e-33, Method: Composition-based stats. Identities = 38/153 (24%), Positives = 65/153 (42%), Gaps = 5/153 (3%) Query: 150 KGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNG 209 G R ++DD +R +L + + + V + L + G P + +DNG Sbjct: 1 MHDALVCGRRFRMFNVVDDFNREALSIEIDLNLPAQRVVRVLDRIAANRGYPAMLRLDNG 60 Query: 210 SPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGE 269 + AL W + I++ +P P +ERF+R+ + E+L F E Sbjct: 61 PEFISL-----ALAEWAEKHAIKLEFIQPGKPTQNAFIERFNRTYRTEILDFYLFRTLNE 115 Query: 270 LQRAFDHWRTVYNLERPHEALDMAVPGSRYQPS 302 ++ + W + YN ERPHE+L+ P Q Sbjct: 116 VREITEKWLSKYNCERPHESLNNMTPEEYRQRH 148 >UniRef50_A3QMY0 Transposase n=37 Tax=Bacilli RepID=A3QMY0_ENTFC Length = 366 Score = 145 bits (366), Expect = 2e-33, Method: Composition-based stats. Identities = 60/303 (19%), Positives = 106/303 (34%), Gaps = 38/303 (12%) Query: 10 RDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHS 69 R ++ +F+ +D ++ LCR I + Y + + P + Sbjct: 82 RKKLNSLVQFIEKWCKD-YHVSLLCRLLEIPRSVYYFYKNK---------------PLTA 125 Query: 70 PNRSSDDITALLR-MAHDRHERWGARKIKRWLEDQGHTM---PAFSTVHNLMARHGLLPG 125 ++ + + + +R+GA KI + L +G ++ + L R ++ Sbjct: 126 TEIRNNKLKKKISTIFFTNKQRYGATKIHQVLLKEGISVSLKHVLKLIKQLNLRSIVVKK 185 Query: 126 ASPGIPATGR----------FEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLC 175 P F + W D G C+ +++D H++ + Sbjct: 186 YRPQRSNKPIISKENLLNQDFSTETICEKWAADITYIPTKKNGWCYLSSIMDLHTKKIIS 245 Query: 176 LAHCTDERRETVQQQLVSVFERYGLPDRMT--MDNGSPWGDTTGTWTALELWLMRLGIRV 233 + V Q L Y +P+ M D GS + T +E WL IR Sbjct: 246 YTFSKRMTVDCVIQTLNKAKIHYHIPEGMILHTDLGSQY-----TAREVEQWLKTNKIRH 300 Query: 234 GHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRT-VYNLERPHEALDM 292 +SR P +E FH SLK E + ++D E RA + YN R H ++ Sbjct: 301 SYSRKGTPYDNAGIESFHASLKKEEVYTTSYSDFEEANRALFSYIEGFYNRNRIHSSIHY 360 Query: 293 AVP 295 P Sbjct: 361 LTP 363 Score = 45.5 bits (106), Expect = 0.003, Method: Composition-based stats. Identities = 22/118 (18%), Positives = 46/118 (38%), Gaps = 1/118 (0%) Query: 17 TEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDD 76 + ++ +Q G ++R L + +G+S AT YKW + + + GL + N ++ Sbjct: 9 KQMIVELNQTGRSVRGLAKEYGLSEATIYKWKNLYLPDQSTGLTGKEVAELRKENARLNE 68 Query: 77 ITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPATG 134 +L+ A R + +++E S + L+ + P T Sbjct: 69 ELEILKKAAAIFSRKKLNSLVQFIEKWCKDY-HVSLLCRLLEIPRSVYYFYKNKPLTA 125 >UniRef50_A5B5G8 Putative uncharacterized protein n=2 Tax=Vitis vinifera RepID=A5B5G8_VITVI Length = 1856 Score = 144 bits (364), Expect = 4e-33, Method: Composition-based stats. Identities = 43/218 (19%), Positives = 90/218 (41%), Gaps = 28/218 (12%) Query: 141 PNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGL 200 +W +DF G FP G + L +D S++ + +++ + ++ ++F R+G+ Sbjct: 1417 VFDVWGIDFMGPFPMSFGHSYILVGVDYVSKWVEAIPCRSNDHKVVLKFLKENIFSRFGV 1476 Query: 201 PDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQ 260 P + D G+ + + E L + G++ + PYHPQT G++E +R +K +++ Sbjct: 1477 PKAIINDGGTHFCN-----KPFETLLAKYGVKHKVATPYHPQTSGQVELANRKIKNILMK 1531 Query: 261 GKWFADSG---ELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGNTTPPEYDE 317 +L + +RT Y L M+ Y+ + EY Sbjct: 1532 VVNVNRKDWSIKLLDSLWAYRTAYKTI-----LGMSP----YRLVYGKACHLPVEIEYKA 1582 Query: 318 GVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 ++K++ + +A + L E++E Sbjct: 1583 WWAIKKLN-----------MDLTRAGLKRCLDLNELEE 1609 >UniRef50_A4Z1R9 Putative transposase, probably encoded by an unidentified IS element protein n=4 Tax=Bradyrhizobium RepID=A4Z1R9_BRASO Length = 285 Score = 144 bits (364), Expect = 4e-33, Method: Composition-based stats. Identities = 67/298 (22%), Positives = 103/298 (34%), Gaps = 43/298 (14%) Query: 19 FVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDIT 78 + A+++ LC G+ AT Y+ L R R R ++ Sbjct: 1 MTAQTADSSAHVQRLCALAGLPRATYYRHLNR-----------RSRAEAEC------ELR 43 Query: 79 ALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPG--IPATGR- 135 L+ +H +G R++ L G + A V LM + LL P T R Sbjct: 44 DQLQRICLKHPFYGYRRVTAALRRLGMAVNAKK-VLRLMRQDNLLAQRKTPFLKPPTERP 102 Query: 136 ------------FEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDER 183 AP+++W D + + +LD SR ++ A Sbjct: 103 TDVIVVPNLIRGLAPSAPDQIWVADIT-YVHLAKTFAYLAVILDGFSRKAVGWAFDNTLD 161 Query: 184 RETVQQQLVSVF-ERYGLPDRMT--MDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYH 240 L R P + D G + A L I++ S P + Sbjct: 162 ASLAIAALDKALKSRNPKPGSLIHHSDRGVQYASI-----AYRQRLADREIKISMSSPGN 216 Query: 241 PQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWR-TVYNLERPHEALDMAVPGS 297 P K E F ++LKAE + GK FAD + +R + + +YN ER H AL P Sbjct: 217 PFDNAKAESFMKTLKAEEVNGKTFADVNDARRRINSFIAEIYNKERLHSALGYRSPLE 274 >UniRef50_A5AMG6 Putative uncharacterized protein n=3 Tax=Vitis vinifera RepID=A5AMG6_VITVI Length = 1704 Score = 144 bits (364), Expect = 4e-33, Method: Composition-based stats. Identities = 48/266 (18%), Positives = 100/266 (37%), Gaps = 30/266 (11%) Query: 75 DDITALLRMAHDR--HERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPA 132 + +L H+ + ++KI + G T P+ ++M R Sbjct: 1175 QEQQGILNHCHENACEGHFASQKIAMKVLQSGFTWPSLFKDAHIMCR--SCDRCQRLGKL 1232 Query: 133 TGRFEHDA-------PNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRE 185 T R + +W +DF G FP G + L +D S++ + ++ R Sbjct: 1233 TKRNQMPMNPILIVDLFDVWGIDFMGPFPMSFGNSYILVGVDYDSKWVEAIPCKHNDHRV 1292 Query: 186 TVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQG 245 ++ ++F R+ +P + D G+ + + E L + G++ + YHPQT G Sbjct: 1293 VLKFLKENIFLRFRVPKAIISDGGTHFCN-----KPFETLLAKYGVKHKVATSYHPQTSG 1347 Query: 246 KLERFHRSLKAEVLQGKWFADSGE----LQRAFDHWRTVYNLERPHEALDMAVPGSRYQP 301 ++E +R +K +++ + L + +RT Y L M+ Y Sbjct: 1348 QVELANREIKNILMKV-VITSRKDWSIKLHDSLWAYRTTYKTI-----LGMSP----YHL 1397 Query: 302 SARQYSGNTTPPEYDEGVMVRKVDIS 327 + EY ++++++ Sbjct: 1398 IYGKACHVPVEVEYKVWWAIKRLNMD 1423 >UniRef50_A5BKJ4 Putative uncharacterized protein n=2 Tax=Vitis vinifera RepID=A5BKJ4_VITVI Length = 922 Score = 144 bits (364), Expect = 5e-33, Method: Composition-based stats. Identities = 53/272 (19%), Positives = 104/272 (38%), Gaps = 41/272 (15%) Query: 90 RWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPATGRFEHDAPNRLWQMDF 149 + +KI + G + P+ L + + G+ D +W +DF Sbjct: 80 HFAYQKIAMKVLQSGFSWPS------LFKDAHAMCKSXDRYQRLGKLTFD-FFDVWGIDF 132 Query: 150 KGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNG 209 G FP G + L +D S++ + ++ R ++ ++F R+G+P + D G Sbjct: 133 MGPFPMSFGNSYILVGVDYVSKWVEAILCKQNDHRVVLKFLKENIFSRFGVPKAIISDEG 192 Query: 210 SPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQ------GKW 263 + + + E L + G++ + PYHPQT G++E +R +K +++ W Sbjct: 193 THFCN-----KPFETLLAKYGVKHKVATPYHPQTFGQVELANREIKNILMKVVNTSRRNW 247 Query: 264 FADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGNTTPPEYDEGVMVRK 323 F +L + +RT Y L M+ Y+ + +Y ++K Sbjct: 248 FV---KLHDSLWAYRTTYKTI-----LGMSP----YRLVYGKACHLPVEVQYKAWWAIKK 295 Query: 324 VDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 V+ + + L EM+E Sbjct: 296 VNXD-----------LXRVGMKRCLDLNEMEE 316 >UniRef50_A5BWY6 Putative uncharacterized protein n=3 Tax=Vitis vinifera RepID=A5BWY6_VITVI Length = 1068 Score = 144 bits (363), Expect = 5e-33, Method: Composition-based stats. Identities = 51/287 (17%), Positives = 103/287 (35%), Gaps = 37/287 (12%) Query: 79 ALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPATGRFEH 138 ++R E+ G I G + T ++ G T R + Sbjct: 460 QIIRKCVPEEEQQGIL-IHCHENACGGHFASQKTAMKVLQS-GSCDRCQRLGKLTKRNQM 517 Query: 139 DA-------PNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQL 191 +W +DF G FP G + L +D S++ + ++ R ++ Sbjct: 518 PMNPIIIVDLFNVWGIDFMGPFPMSFGNSYILVGVDYVSKWVEAIPCKHNDHRVVLKFLK 577 Query: 192 VSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFH 251 ++F R+G+P + D G+ + + E L + G++ + PYHPQT ++E + Sbjct: 578 ENIFSRFGVPKAIISDGGTHFCN-----KPFETLLAKYGVKHKVATPYHPQTSEQVELAN 632 Query: 252 RSLKAEVLQGKWFADSG---ELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSG 308 R +K +++ +L + +R Y L M+ Y+ + Sbjct: 633 REIKNILMKVVIMRRKDWSIKLHDSLWAYRIAYKTI-----LGMSP----YRLVYGKACH 683 Query: 309 NTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 EY +++++ + +A + L EM+E Sbjct: 684 LPVEVEYKAWXAIKRLN-----------MDLIRAGAKRCLDLNEMEE 719 >UniRef50_C5CE17 Integrase catalytic region n=3 Tax=Kosmotoga olearia TBF 19.5.1 RepID=C5CE17_KOSOT Length = 367 Score = 144 bits (362), Expect = 7e-33, Method: Composition-based stats. Identities = 75/370 (20%), Positives = 133/370 (35%), Gaps = 33/370 (8%) Query: 20 VLFASQDGANIRSLCRRFGISPATGYKWLQR-WAQEGAAGLQDRPRIPHHSPNRSSDDIT 78 + ++G + ++ RR GI+P T K+L++ W Q G + P + Sbjct: 10 IRILYREGLSKLAIARRLGIAPNTVKKYLEKEWCQMAKRGSKLDPFKDY----------- 58 Query: 79 ALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPATGRFEH 138 + + A + + L ++G+T T+ M + P P I E Sbjct: 59 --IEKRLQEYPELTATVLFKELVERGYT--GKLTILR-MYVSSIRPKGKPEIVVRFETEP 113 Query: 139 DAPNRL-WQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFER 197 ++ W ++ +SR DE+ ET+ Q + FE Sbjct: 114 GRQFQVDWGT-GTTVIAGEKTTVKFFIMVLSYSRMLYA-EIVPDEKLETLIQAHLHAFEY 171 Query: 198 YG-LPDRMTMDNGSPWGDTTGTWTA----LELWLMRLGIRVGHSRPYHPQTQGKLERFHR 252 +G P DN + G +V RPY+P+ +GK+ER Sbjct: 172 FGGYPSEGLYDNMKTVVKKLQKQKEYNARFMDFANFYGFKVITHRPYNPKAKGKVERLVP 231 Query: 253 SLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGNTTP 312 ++ +L G+ ++ EL+ W + N +R H L P R++ N Sbjct: 232 YVRENILYGQSYSSLTELKNVLRDWLAIAN-QRLHSELK-ETPLERFEREKNHL--NKLS 287 Query: 313 PEYD-EGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQEDGSYEVWWYSTKVGVI 371 Y + R V G++ K + GK + GERV L Q +GS + ++ + Sbjct: 288 KSYPIRRLNTRLVRDKGQIVYKERAYHVGKKYTGERVNL---QVEGSLLKIYDGDELITV 344 Query: 372 DLKKKSITMG 381 K + Sbjct: 345 HPLKDQVEKR 354 >UniRef50_A5BYC4 Putative uncharacterized protein n=5 Tax=Vitis vinifera RepID=A5BYC4_VITVI Length = 1855 Score = 143 bits (361), Expect = 9e-33, Method: Composition-based stats. Identities = 50/301 (16%), Positives = 114/301 (37%), Gaps = 34/301 (11%) Query: 75 DDITALLRMAHDRH--ERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPA 132 ++ +L H+ + ++K + G T P+ ++M R Sbjct: 1489 EEQQGILSHCHENACGGHFASQKTAMKVLQSGFTWPSLFKDSHIMCR--SCDRCQRLGKL 1546 Query: 133 TGRFEHDA-------PNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRE 185 T R + +W +BF FP + L +D S++ + ++ R Sbjct: 1547 TKRNQMPMNPILIVDIFXVWGIBFMRPFPMSFSNSYILVGVDYVSKWVEAIPCKHNDHRV 1606 Query: 186 TVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQG 245 ++ ++F R+G+P + D G+ + + E L + G++ + PYHPQT G Sbjct: 1607 VLKFLKENIFSRFGVPKAIISDGGTHFCN-----KPFETLLAKYGVKHKVAIPYHPQTSG 1661 Query: 246 KLERFHRSLKAEVLQGKWFADSGE----LQRAFDHWRTVYNLERPHEALDMAVPGSRYQP 301 ++E +R +K +++ + L + +RT Y L M+ Y+ Sbjct: 1662 QVELANREIKNILMKV-VITSRKDWSIKLHDSLWAYRTAYKTI-----LGMSP----YRL 1711 Query: 302 SARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQEDGSYEV 361 + EY ++++++ + V + K + + + KE++ + Sbjct: 1712 VYGKACHLPVEVEYKAWWAIKRLNMD-LIRVGAKRM---KKWHDQLISNKELRNGQRVLL 1767 Query: 362 W 362 + Sbjct: 1768 Y 1768 >UniRef50_A9VK05 Integrase catalytic region n=17 Tax=Bacillaceae RepID=A9VK05_BACWK Length = 293 Score = 143 bits (361), Expect = 9e-33, Method: Composition-based stats. Identities = 52/299 (17%), Positives = 97/299 (32%), Gaps = 33/299 (11%) Query: 15 LRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSS 74 + E + ++ G + LC G++ + YKWL+R P + Sbjct: 8 KKFEVIHEMTKTGYTVTILCDIAGVTRSGYYKWLKRH------------TTPSKKQSEDI 55 Query: 75 DDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIP--- 131 + +L +G R+I+ WL+ + + LM+ G+ P Sbjct: 56 EIKKKILECHKKLRGIYGYRRIQVWLKATYNLHLNHKHIQRLMSELGIKAVIRKKRPYYG 115 Query: 132 -----------ATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCT 180 F+ PN W D + F G R + + D ++ + Sbjct: 116 KKEAYVISENHLNREFQASKPNEKWVTDIT-YLIFNGQRLYLSAIKDLYNNEIVAYETSR 174 Query: 181 DERRETVQQQLVSVFERYGLPDRM-TMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPY 239 + V L ++ + + D GS + T L + ++ SR Sbjct: 175 RNDLKLVLDTLKKAKKKRNVKGILLHSDQGSQY-----TSRQYNQLLKKYQMKASMSRRG 229 Query: 240 HPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSR 298 + +E F KAE F + E++ A + YN +R + L+ P Sbjct: 230 NCWDNACMENFFSHFKAECFHLYSFRKANEVKLAVRKYMHFYNHQRFQKKLNNLSPYKY 288 >UniRef50_D1PR45 Transposase n=1 Tax=Subdoligranulum variabile DSM 15176 RepID=D1PR45_9FIRM Length = 394 Score = 143 bits (361), Expect = 1e-32, Method: Composition-based stats. Identities = 63/281 (22%), Positives = 95/281 (33%), Gaps = 37/281 (13%) Query: 34 CRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALL--RMAHDRHERW 91 CR FGI + Y + + R P + N+ D++ L ++ +R Sbjct: 110 CRLFGIRKSNFY-----YRTQ---------RRPAKTQNQQRDEVLRPLVEKIYLRSGKRI 155 Query: 92 GARKIKRWLEDQGHTMPAF---STVHNLMARHGLLPGASPGIPATGRFEH---------- 138 + I++ L DQG ++ S + A G A RF H Sbjct: 156 SSEAIRQKLLDQGISISKRKVLSFLQEWKADKGTTSARMSAAAAQRRFCHLNLLDRQFNP 215 Query: 139 DAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERY 198 +APN+ W D + GG+ + +LD SR L V + + F R Sbjct: 216 EAPNKAWVSDITE-LHYAGGKLYLCVVLDLFSRKVLAARASCQNDTALVARTFETAFLRR 274 Query: 199 GLPDRM--TMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKA 256 G P + D G + L +R S P P +E F SLK Sbjct: 275 GRPRGLLFHSDQGRQYTSDY-----FRELLEEFSVRQSFSTPGVPYDNAVMESFFASLKK 329 Query: 257 EVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGS 297 E + G L + + YN +RPH L P Sbjct: 330 EEYHRYCYKSIGALLDSVQQYLLFYNRQRPHSRLGYRTPEE 370 >UniRef50_A5BFS9 Putative uncharacterized protein n=9 Tax=Vitis vinifera RepID=A5BFS9_VITVI Length = 2326 Score = 143 bits (360), Expect = 1e-32, Method: Composition-based stats. Identities = 45/219 (20%), Positives = 93/219 (42%), Gaps = 27/219 (12%) Query: 142 NRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLP 201 +W +DF G FP G + L +D S++ + T++ + ++ ++F R+G+P Sbjct: 929 FYVWGIDFMGPFPMSFGHSYILVGVDYVSKWVEAIPCRTNDHKVVLKFLKENIFSRFGVP 988 Query: 202 DRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQ- 260 + D G+ + + E L + GI+ + PYHPQT G++E +R +K +++ Sbjct: 989 KVIISDGGTHFCN-----KPFEALLAKYGIKHKVATPYHPQTSGQVELANREIKNILMKV 1043 Query: 261 -----GKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGNTTPPEY 315 W + + + +RT Y L M+ Y+ + E+ Sbjct: 1044 VNTNRKDWSVNLLD---SLWAYRTAYKTI-----LGMSP----YRLVYGKACHLPVEIEF 1091 Query: 316 DEGVMVRKVDIS----GKLSVKGVSLSAGKAFRGERVGL 350 ++K+++ ++ + F+G+RV L Sbjct: 1092 KAWWAIKKLNMDLTKENLKRWHDQLVTKKEFFKGQRVLL 1130 Score = 134 bits (337), Expect = 5e-30, Method: Composition-based stats. Identities = 49/293 (16%), Positives = 108/293 (36%), Gaps = 39/293 (13%) Query: 75 DDITALLRMAHDRH--ERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPA 132 ++ +L H+ + ++K W+ G P+ + M + + Sbjct: 1691 EEQQGILSHCHENACGGHFASQKTAMWVLQSGFYWPSLFKDAHTMCKSCDRCQRLGKLTR 1750 Query: 133 TGRFEHDAP-----NRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETV 187 + +W +DF G FP G + L +D S++ + ++ R + Sbjct: 1751 RNMMPLNPILIVDLFYVWGIDFMGPFPMSFGYSYILVRVDYVSKWVEAIPCNHNDHRVVL 1810 Query: 188 QQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKL 247 + ++F R+G+P + D G+ + + E L + ++ + YHPQT G++ Sbjct: 1811 KFLKENIFSRFGVPKAIISDGGTHFCN-----KPFETLLAKYRVKHKVATLYHPQTNGQV 1865 Query: 248 ERFHRSLKAEVLQ-----GKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPS 302 E +R +K +++ K + +L + ++T Y L+M+ Y+ Sbjct: 1866 ELANRKIKNILMKVVNTNRKDW--PVKLLDSLWAYKTAYKTI-----LEMSP----YRLV 1914 Query: 303 ARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 + EY +K +++ + + L EM+E Sbjct: 1915 YGKACHLLVELEYKAWWA-----------IKQLNMDLSRVGLKRFLDLNEMEE 1956 >UniRef50_A5BXG2 Putative uncharacterized protein n=1 Tax=Vitis vinifera RepID=A5BXG2_VITVI Length = 1268 Score = 143 bits (360), Expect = 1e-32, Method: Composition-based stats. Identities = 34/144 (23%), Positives = 70/144 (48%), Gaps = 10/144 (6%) Query: 142 NRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLP 201 +W +DF G FP G + L +D S++ + + ++ R ++ ++F R+G+P Sbjct: 1108 FDVWGIDFMGPFPMSFGNSYILVRVDYVSKWVEAIPNKHNDHRVVLKFLKENIFLRFGVP 1167 Query: 202 DRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQ- 260 + D G+ + + E L + G++ + PYHPQT G++E +R +K +++ Sbjct: 1168 KAIISDGGTHFCN-----KPFETLLAKYGVKHKVATPYHPQTSGQVELANREIKNILMKV 1222 Query: 261 ---GKWFADSGELQRAFDHWRTVY 281 + + +L + +RT Y Sbjct: 1223 VITSRKYWSI-KLHDSLWAYRTTY 1245 >UniRef50_A5B5S6 Putative uncharacterized protein n=2 Tax=Vitis vinifera RepID=A5B5S6_VITVI Length = 1310 Score = 142 bits (359), Expect = 1e-32, Method: Composition-based stats. Identities = 43/218 (19%), Positives = 86/218 (39%), Gaps = 30/218 (13%) Query: 142 NRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLP 201 +W +DF G FP G + L +D S++ + ++ R ++ ++F R+G+ Sbjct: 1014 FDVWGIDFMGPFPMSFGNSYILVGVDYVSKWFEAIPCKHNDHRVVLKFLKENIFSRFGVS 1073 Query: 202 DRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQG 261 + D G+ + + E L + G++ + PYHPQ G++E +R +K +++ Sbjct: 1074 KAIISDGGTHFYN-----KPFETLLAKYGVKHKVATPYHPQIFGQVELANREIKNILIKV 1128 Query: 262 KWFADSGE----LQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGNTTPPEYDE 317 + L + +RT Y L M+ Y+ + EY Sbjct: 1129 VN-TSRRDWSVKLHDSLWAYRTAYKTI-----LGMSP----YRLVXGKACHLPVEVEYKX 1178 Query: 318 GVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 ++KV+ + + + L EM+E Sbjct: 1179 WWAIKKVN-----------MDLTRXXIKRCLDLNEMEE 1205 >UniRef50_A5BI69 Putative uncharacterized protein n=1 Tax=Vitis vinifera RepID=A5BI69_VITVI Length = 1628 Score = 142 bits (359), Expect = 2e-32, Method: Composition-based stats. Identities = 43/218 (19%), Positives = 91/218 (41%), Gaps = 28/218 (12%) Query: 141 PNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGL 200 +W +DF G FP G + L +D S++ + +++ + ++ ++F R+G+ Sbjct: 1062 VFDVWGIDFMGPFPMSFGHSYILVGVDYVSKWVEVIPCRSNDHKVVLKFLKENIFARFGV 1121 Query: 201 PDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQ 260 P + D G+ + + E L + G++ + PYHPQT G++E +R +K +++ Sbjct: 1122 PKSIISDGGTHFCN-----KPFETLLAKYGVKHKVATPYHPQTSGQVELANREIKNILMK 1176 Query: 261 GKWFADSG---ELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGNTTPPEYDE 317 +L + +RT Y L+M+ Y+ + EY Sbjct: 1177 VVNVNRKDWSIKLLDSLWAYRTAYKTI-----LEMSP----YRLVYGKACHLPVEVEYKA 1227 Query: 318 GVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 ++K++ + +A + L E++E Sbjct: 1228 WWAIKKLN-----------MDLTRARLKRCLDLNELEE 1254 >UniRef50_A4G2L6 Transposase IS3 family, part 2 n=17 Tax=Bacteria RepID=A4G2L6_HERAR Length = 288 Score = 142 bits (359), Expect = 2e-32, Method: Composition-based stats. Identities = 69/280 (24%), Positives = 106/280 (37%), Gaps = 16/280 (5%) Query: 25 QDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMA 84 + + C G+S + Y WL R + + R HHS + SD R+ Sbjct: 10 RGIWPVALTCDTLGVSRSGFYAWLTRTPCKRRTENEQLGRAVHHSFIQ-SDRTYGARRVW 68 Query: 85 HDRH---ERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPATGR-FEHDA 140 HD R G +++R ++ Q + A +L G P R F+ A Sbjct: 69 HDLLASGYRCGLHRVERLMQAQ--ALRARPRRRSLPIDRGERPVIGIAANVLDRQFDASA 126 Query: 141 PNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGL 200 PNR W DF + G + +LD +SR + + + + V L+ R G Sbjct: 127 PNRKWVADFT-YIWSAEGWLYLAVVLDLYSRRVIGWSMKPEMNAQLVADALMMAVWRRGK 185 Query: 201 PDRM--TMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEV 258 P+ + D GS + T + L+ LG+ SR + +E F SLK E Sbjct: 186 PESVMHHSDRGSQY-----TSEQFQRLLLELGVTCSMSRAGNVWDNSAMESFFSSLKTER 240 Query: 259 LQGKWFADSGELQ-RAFDHWRTVYNLERPHEALDMAVPGS 297 L K F +++ FD+ YN R H L P Sbjct: 241 LSRKMFRTRDDIRAEVFDYIERFYNPVRRHSTLGYISPID 280 >UniRef50_A5AKV0 Putative uncharacterized protein n=16 Tax=Vitis vinifera RepID=A5AKV0_VITVI Length = 2067 Score = 142 bits (358), Expect = 2e-32, Method: Composition-based stats. Identities = 53/274 (19%), Positives = 108/274 (39%), Gaps = 41/274 (14%) Query: 94 RKIKRWLEDQGHTMPAF-------STVHNLMARHGLLPGASPGIPATGRFEHDAPNRLWQ 146 +K+ + G P+ S + R G L + +P D +W Sbjct: 498 QKMAMRVLQSGFWWPSLFKDAHEVSKGCDKCQRLGKLSRRNM-MPLNPILIVD-LFDVWG 555 Query: 147 MDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMTM 206 +DF G FP G + L +D S++ + T++ + ++ ++F R+G+P + Sbjct: 556 IDFMGPFPMSFGHSYILVGVDYVSKWVEAIPCRTNDHKVVLKFLKENIFSRFGVPKAIIS 615 Query: 207 DNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQ-----G 261 D G+ + + E L + ++ + PYHPQT G++E +R +K +++ Sbjct: 616 DGGTHFCN-----KPFEALLAKYRVKHKVATPYHPQTSGQVELANREIKNILMKVVNTNR 670 Query: 262 KWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGNTTPPEYDEGVMV 321 K ++ +L + +RT Y L M+ Y+ + E+ + Sbjct: 671 KDWS--VKLLDSLWAYRTAYKTI-----LGMSP----YRLVYGKACHLPVEIEFKAWWAI 719 Query: 322 RKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 +K++ + KA + L E++E Sbjct: 720 KKLN-----------MDLTKAGLKRSLDLNELEE 742 >UniRef50_A5BWF3 Putative uncharacterized protein n=9 Tax=Vitis vinifera RepID=A5BWF3_VITVI Length = 1924 Score = 142 bits (358), Expect = 2e-32, Method: Composition-based stats. Identities = 49/298 (16%), Positives = 105/298 (35%), Gaps = 39/298 (13%) Query: 71 NRSSDDITALLRMAHDRHERWGAR----KIKRWLEDQGHTMPAFSTVHNLMARHGLLPGA 126 + + ++H G K + G T P+ ++M R Sbjct: 1545 RKCVPEEEQQEILSHCHENACGGHFASQKTAMKVLQSGFTWPSLFKDSHIMCRSCDRCER 1604 Query: 127 SPGIPATGRFEHDAP-----NRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTD 181 + + + +W +DF FP G + L + S++ + + + Sbjct: 1605 LGKLTKRNQMPMNPILIVDLFDVWGIDFMRPFPMSFGNSYILVGVGYVSKWVEAIPYKHN 1664 Query: 182 ERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHP 241 + R ++ ++F R+G+P + D G+ + + E L + G++ + PYHP Sbjct: 1665 DHRVVLKFLKDNIFSRFGVPKSIINDGGTHFCN-----KPFETLLAKYGVKHKVATPYHP 1719 Query: 242 QTQGKLERFHRSLKAEVLQGKWFADSGE----LQRAFDHWRTVYNLERPHEALDMAVPGS 297 QT G++E +R +K +++ + L + +RT Y L M+ Sbjct: 1720 QTSGQVELANREIKNILMKV-VITSRKDWSIKLHDSLWAYRTTYKTI-----LGMSPCRL 1773 Query: 298 RYQPSARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 Y + EY +++++ + + + L EM+E Sbjct: 1774 VYGKACHL----PMEVEYKAWWAIKRLN-----------MDLIRVGEKRCLDLNEMEE 1816 >UniRef50_D2U9E3 Putative integrase protein n=1 Tax=Xanthomonas albilineans RepID=D2U9E3_XANAL Length = 307 Score = 142 bits (358), Expect = 2e-32, Method: Composition-based stats. Identities = 58/284 (20%), Positives = 89/284 (31%), Gaps = 43/284 (15%) Query: 8 DARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPH 67 R T + A I +LC+ IS Sbjct: 2 KKRHTEEQIITILREAEAGNVPITTLCKGHNIS--------------------------E 35 Query: 68 HSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGAS 127 H P + ++ L R+G R++ WL+ + V L GL Sbjct: 36 HQPEK-DRSLSERLLATSPHVPRFGYRRMAAWLD------VGQARVRRLWRALGLNIPPR 88 Query: 128 PGIPAT-----GRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDE 182 PN +W DF G L ++D+++R L + Sbjct: 89 RPKRRRSGSDIRLPGAVRPNAVWSYDFVHDQMVDGRGLKLLCVIDEYTRECLAIEVGARF 148 Query: 183 RRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQ 242 + V L + YG P + DNG+ + T + L I P P Sbjct: 149 SSQDVILTLSRLMRLYGKPAFVRSDNGAEF-----TAAKVMRCLRDAAIGPTFIAPGSPW 203 Query: 243 TQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERP 286 G +E F+ L+ E L +WF E + + WR YN +RP Sbjct: 204 QNGFVESFNGKLRDEPLNREWFRSRTEAKVLIERWRQFYNEQRP 247 >UniRef50_UPI0001986237 PREDICTED: hypothetical protein n=1 Tax=Vitis vinifera RepID=UPI0001986237 Length = 1360 Score = 142 bits (358), Expect = 2e-32, Method: Composition-based stats. Identities = 58/288 (20%), Positives = 108/288 (37%), Gaps = 34/288 (11%) Query: 75 DDITALLRMAHDRH--ERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPA 132 D+ +LRM H+ + +RK + G P N P Sbjct: 985 DEQQDILRMCHEGACGGHFASRKTSAKILQSGFYWPTMFKDCN--THCKSCPQCQQLGKI 1042 Query: 133 TGRFEHDA-------PNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRE 185 R++ W +DF G FP G + L +D S++ +A +++ + Sbjct: 1043 NTRYQMPQNHICVVEVFDCWGLDFMGPFPHSFGNLYILVGVDYVSKWVEAVACKSNDHKV 1102 Query: 186 TVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQG 245 ++ ++F R+G+P + D GS + + L + G+R S PYHPQT G Sbjct: 1103 VLKFLKENIFSRFGIPRAIISDGGSHFCN-----KPFSTLLQKYGVRHKVSTPYHPQTNG 1157 Query: 246 KLERFHRSLKAEVLQ-----GKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQ 300 + E +R +K + + K ++ +L A +RT Y L M+ Y+ Sbjct: 1158 QAELANREIKRILTKVVNTIRKDWST--KLSDALWAYRTAYKTV-----LGMSP----YR 1206 Query: 301 PSARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAG--KAFRGE 346 + + E+ ++K++ + +A+R E Sbjct: 1207 IAYGKACHLPVELEHRAYWAIKKMNFDSDQAGAKRKYDLNELEAYRNE 1254 >UniRef50_C8VYH9 Transposase IS3/IS911 family protein n=11 Tax=Bacteria RepID=C8VYH9_DESAS Length = 369 Score = 142 bits (357), Expect = 3e-32, Method: Composition-based stats. Identities = 57/300 (19%), Positives = 97/300 (32%), Gaps = 38/300 (12%) Query: 11 DTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSP 70 T R V +++ ++ + GI+ Y + P P Sbjct: 96 LTKEERLALVEPDNKE-ISLTAQADLLGINRTRIYY-------------KPAP------P 135 Query: 71 NRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVH--NLMARHGLLPGASP 128 + I + + RH +G+R+I L + + H M G+ PG + Sbjct: 136 SAEEIAIRHRIDEIYTRHPYYGSRRITAQLCRENIPVNRKRVQHYMRDMGLAGICPGPNL 195 Query: 129 GIPATGRF---------EHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHC 179 T + PN +W +D + G + + +LD SRF + Sbjct: 196 SKRNTEHRVYPYLLRGVTANHPNHIWGIDIT-YIRLKEGWMYLVAILDWFSRFIISWELD 254 Query: 180 TDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPY 239 V + ++ +P D GS + T L G+ + Sbjct: 255 QVLEIPFVLVAVKRALDK-KMPLIWNSDQGSHFTSPQYT-----QLLQNAGVLISMDGKG 308 Query: 240 HPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRY 299 ER RS+K E + + E +R + YN ERPH++LD P Y Sbjct: 309 RAIDNIFTERLWRSIKYEEVYLNDYMSPREARRGIRRYIDFYNNERPHQSLDYKTPFEIY 368 >UniRef50_C5CF68 Integrase catalytic region n=6 Tax=Thermotogaceae RepID=C5CF68_KOSOT Length = 276 Score = 142 bits (357), Expect = 3e-32, Method: Composition-based stats. Identities = 62/293 (21%), Positives = 102/293 (34%), Gaps = 34/293 (11%) Query: 20 VLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITA 79 + +G ++ R IS +T Y P R++D+ Sbjct: 1 MTLLLNEGFSVSEALRYLKISRSTYYY------------------KPREYSRRTNDEEIL 42 Query: 80 LLRMAHDR-HERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIP---ATGR 135 R H WG R+I L +G+ + V+ +M +GLL + Sbjct: 43 KEIEELKREHPYWGYRRIWAMLRKKGNKL-NRKRVYRIMKENGLLFKVEHKKACRTIQKK 101 Query: 136 FEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLC----LAHCTDERRETVQQQL 191 + P + +D F G + + ++D ++R L L T+E + + + L Sbjct: 102 IKPTRPREVLGIDMTKVFTRDAGWAYYIAVIDWYTREILGSEISLRCRTEEWLKALDRAL 161 Query: 192 VSVFER--YGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLER 249 + G + DNGS T T LGI+ + +P+ ER Sbjct: 162 NEGYPEGVRGEEVILVSDNGSQ-----PTSTKFLKECAVLGIKQIFTSYNNPKGNANTER 216 Query: 250 FHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPS 302 + R+ K EV EL + T YN E PH AL P ++ S Sbjct: 217 YFRTYKEEVAWVLDNPSYEELIEKTKTFETFYNEEYPHSALGYKSPKEVFEES 269 >UniRef50_A8F1V3 Transposase and inactivated derivative n=4 Tax=Bacteria RepID=A8F1V3_RICM5 Length = 289 Score = 142 bits (357), Expect = 3e-32, Method: Composition-based stats. Identities = 55/305 (18%), Positives = 107/305 (35%), Gaps = 42/305 (13%) Query: 13 MSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNR 72 M +R ++ ++ +I ++C+ +S + Y+WL P + +R Sbjct: 1 MPIRYAWIKE-NEGNFSIAAMCKFMKVSRSGYYEWLN---------------NPGCNRDR 44 Query: 73 SSDDITALLRMA-HDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIP 131 +++T +++ + +G R IK+ L Q + + + LM L+ Sbjct: 45 EDNELTNRIKIIFKEGRGNYGTRPIKKELSRQSIIV-SRRRIARLMKEASLICKTKRKFK 103 Query: 132 AT---------------GRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCL 176 AT +F N W D + P G + T++D SR + Sbjct: 104 ATTDSNHNKQIAPNLLDRKFTVPDANCYWVGDIT-YVPTSEGWLYLATVIDLFSRKIIGW 162 Query: 177 AHCTDERRETVQQQLVSVFERYGLPDRMT--MDNGSPWGDTTGTWTALELWLMRLGIRVG 234 + + + + V L+ + P + D GS + + + + GI+ Sbjct: 163 SMNNNMKADLVNNALLMAIWQRKPPKGLIWHTDRGSQYCSDSHL-----KIIKQHGIKQS 217 Query: 235 HSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRT-VYNLERPHEALDMA 293 SR + E F ++K E++ F E + + YN R H A D Sbjct: 218 MSRKGNCWDNAVAESFFHTIKTELVYQHKFKTREEAKHTIFEYIEVFYNRIRMHSANDYL 277 Query: 294 VPGSR 298 P Sbjct: 278 SPVKY 282 >UniRef50_A5BT93 Putative uncharacterized protein n=1 Tax=Vitis vinifera RepID=A5BT93_VITVI Length = 1184 Score = 141 bits (356), Expect = 3e-32, Method: Composition-based stats. Identities = 44/219 (20%), Positives = 92/219 (42%), Gaps = 32/219 (14%) Query: 142 NRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLP 201 +W +BF G FP G + L +D S++ + T+ + ++ ++F R+G+P Sbjct: 796 FDVWGIBFMGPFPMSFGHXYILVGVDYVSKWVEAIPCRTNXHKVVLKFLKENIFSRFGVP 855 Query: 202 DRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQ- 260 + D G+ + + E L + G++ + PYHPQT G++E +R +K +++ Sbjct: 856 KAIISDGGTHFCN-----KPFEALLAKYGVKHKVATPYHPQTSGQVELANREIKNILMKV 910 Query: 261 ----GKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGNTTPPEYD 316 K ++ +L + +RT Y L M+ Y+ + E+ Sbjct: 911 VNTNRKDWS--VKLLDSLWAYRTAYKTI-----LGMSP----YRLVYGKACHLPVEIEFK 959 Query: 317 EGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 ++K++ + KA + L E++E Sbjct: 960 AWWAIKKLN-----------MDLTKAGLKRSLDLNELEE 987 >UniRef50_A5AH70 Putative uncharacterized protein n=20 Tax=Vitis vinifera RepID=A5AH70_VITVI Length = 2203 Score = 141 bits (356), Expect = 4e-32, Method: Composition-based stats. Identities = 46/219 (21%), Positives = 90/219 (41%), Gaps = 32/219 (14%) Query: 142 NRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLP 201 +W +DF G FP G + L +D S++ + T++ + ++ ++F R+G+P Sbjct: 1517 FXVWGIDFMGPFPMSFGHSYILVGVDYVSKWVEAIPCRTNDHKVVLKFLKENIFSRFGVP 1576 Query: 202 DRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQ- 260 + D G+ + + E L + GI + PYHPQT G++E R +K +++ Sbjct: 1577 KAIISDGGTHFCN-----KPFEALLAKYGINHKVATPYHPQTSGQVELAKREIKNILMKV 1631 Query: 261 ----GKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGNTTPPEYD 316 K ++ +L + +RT Y L M+ Y + E+ Sbjct: 1632 VNTNRKDWS--VKLLDSLWAYRTAYKTI-----LGMSP----YHLVYGKACHLPVEIEFK 1680 Query: 317 EGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 ++K++ + KA + L E++E Sbjct: 1681 TWWAIKKLN-----------MDLTKAGLKRSLDLNELEE 1708 >UniRef50_C6N0S2 Putative uncharacterized protein n=1 Tax=Legionella drancourtii LLAP12 RepID=C6N0S2_9GAMM Length = 502 Score = 141 bits (356), Expect = 4e-32, Method: Composition-based stats. Identities = 69/407 (16%), Positives = 131/407 (32%), Gaps = 56/407 (13%) Query: 8 DARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPH 67 D RD + + + D ++++ GI T W++ + G AGL R R Sbjct: 28 DERDKACQKYQIIEPYINDEVLLKTIAGNSGIPIRTLGSWIKNYRSHGLAGLVRRSRDDK 87 Query: 68 HSPNRSSDDITALLRMAHDRHERWGARKIK----RWLEDQGHTMPAFSTVHNL------- 116 P + + + + + I + +P++ +V + Sbjct: 88 GLPRQYDAILQKTIEGIYLKKPMLSGANIHTLISEYCNKNNLKVPSYRSVCRIISNIPDD 147 Query: 117 ---MARHGLLPGASPGIPATGRFEHDAPNRLWQMDFK---GHFPFGG---GRCHPLTLLD 167 + + G R D PN LWQ D + ++D Sbjct: 148 MVLLGQQGSKTYRQQYDLLHIR-ATDKPNELWQADHVLLDFDILNDKNKPQKPWLTVVID 206 Query: 168 DHSRFSLCLAHCT-DERRETVQQQLVSVFERY--------GLPDRMTMDNGSPWGDTTGT 218 D SR + R G+P+ D+GS + T Sbjct: 207 DCSRAICGYELSFLSPSAQKTSLCFRHAIWRKSDPDWNILGVPEIFYTDHGSDF-----T 261 Query: 219 WTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQ----------GKWFADSG 268 +E + L IR+ S+ P+ +GK+ERF R+L +++ K F + Sbjct: 262 SKHIEQVCIDLKIRLIFSQVAQPRGRGKIERFFRTLNQKLIHTLQAVTQNGTQKIFINLK 321 Query: 269 ELQRAFDHWRTVYNLERPHEALDMAVPGSRYQ-----PSARQYSGNTTPPEYDEGVMVRK 323 L + YN H + P +R+Q P +E R+ Sbjct: 322 SLDALVSAFIIEYNH-TIHPEIG-ETPAARWQRNGFLPQLLDSLEYLDLLLLNEAKP-RR 378 Query: 324 VDISGKLSVKGVSL--SAGKAFRGERVGLKEMQEDGSYEVWWYSTKV 368 + G + +G+ + ++ GE V ++ + + +Y + Sbjct: 379 ILRDG-IRFQGLRYIDTILASYIGESVVIRYSPSNITSIRVFYKGRF 424 >UniRef50_A5BYU9 Putative uncharacterized protein n=8 Tax=Vitis vinifera RepID=A5BYU9_VITVI Length = 2103 Score = 141 bits (354), Expect = 6e-32, Method: Composition-based stats. Identities = 41/218 (18%), Positives = 86/218 (39%), Gaps = 28/218 (12%) Query: 141 PNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGL 200 +W +DF G FP G + L +D S++ + +++ + ++ +F R+G+ Sbjct: 1638 IFDVWGIDFMGPFPMSFGHSYILVGVDYVSKWVEAIPCRSNDHKVVLKFLKEDIFSRFGV 1697 Query: 201 PDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQ 260 P + D G+ + + E L + G++ + P HPQT G++E +R +K +++ Sbjct: 1698 PKAIISDGGTHFCN-----KPFETLLAKYGVKHKVATPDHPQTSGQVELANREIKNILMK 1752 Query: 261 GKWFADSG---ELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGNTTPPEYDE 317 +L + +R Y L M+ Y + EY Sbjct: 1753 VVNVNRKDWSIKLLDSLWAYRNAYKTI-----LGMSP----YHLVYGKACHLLVEVEYKA 1803 Query: 318 GVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 ++K++ + +A + L E++E Sbjct: 1804 WWAIKKLN-----------MDLTRAGLKRCLDLNELEE 1830 >UniRef50_Q8GAC2 Putative transposase n=2 Tax=Micrococcineae RepID=Q8GAC2_ARTNI Length = 487 Score = 141 bits (354), Expect = 6e-32, Method: Composition-based stats. Identities = 80/407 (19%), Positives = 137/407 (33%), Gaps = 57/407 (14%) Query: 12 TMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPN 71 T R + +G + L G+ T +W+ ++ G +R Sbjct: 7 TAEERAILLRRHLDEGIPLSRLSADSGVPARTLSRWMSQYRANGTVESLERRPRSDRGRR 66 Query: 72 RSSDDITALLRMAHDRHERWG----ARKIKRWLEDQGHTMPAFSTVHNL---------MA 118 D+ + R R+I +GH +P +STV L M Sbjct: 67 AIPQDLIEAIEGLALRQPPPTTAFIHRRIADIASARGHPVPGYSTVRALIAGIDPGLKML 126 Query: 119 RHGLLPGASPGIPATGRFEHDAPNRLWQMDFK------GHFPFGGGRCHPLTLLDDHSR- 171 H R + PN WQ D GR +LDD+SR Sbjct: 127 AHQGETAYRDTFELVYRRDATRPNEQWQADHTLLDIEIIDQKGRTGRPWLTIILDDYSRA 186 Query: 172 ---FSLCLAHCTDERRETVQQQLVS-----VFERYGLPDRMTMDNGSPWGDTTGTWTALE 223 +++ + + ER Q + ++ GLPD + D+G+ + T LE Sbjct: 187 AAGYTVFVGAPSAERTALALHQAIRGKTNPLWPVMGLPDMLYSDHGTDF-----TSARLE 241 Query: 224 LWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVL---------------QGKWFADSG 268 + I++ HS+ PQ +GK+ERF+ ++ E+L Sbjct: 242 RVCLDTHIQLIHSKIGVPQGRGKIERFYLTITTELLPHLPGYIPHGTRGRPSRPAELTLE 301 Query: 269 ELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGNTTPPEYDEGVMV----RKV 324 +L + + + RPH + + A P R+ S P + D ++ RKV Sbjct: 302 QLDQILEKFIVTEYNHRPHSSTNQA-PTERWSASGFIPRTPEHPEDLDLLLLTVATARKV 360 Query: 325 DISGKLSVKGVSLS--AGKAFRGERVGLKEMQED-GSYEVWWYSTKV 368 G + A+ GE V ++ D G V++ + Sbjct: 361 QRDG-IQFASTRYISPVLAAYVGEHVTVRYDPRDAGEIRVYFNDEFL 406 >UniRef50_A5BJP7 Putative uncharacterized protein n=6 Tax=Vitis vinifera RepID=A5BJP7_VITVI Length = 1265 Score = 140 bits (353), Expect = 8e-32, Method: Composition-based stats. Identities = 45/224 (20%), Positives = 93/224 (41%), Gaps = 20/224 (8%) Query: 142 NRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLP 201 +W +DF G FP G + L +D S++ + ++ R ++ ++F R+G+P Sbjct: 383 FDVWGIDFMGPFPMSFGNSYILVGVDYVSKWVEAIPCKQNDHRVVLKFLKDNIFSRFGVP 442 Query: 202 DRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQG 261 + D G+ + + E L + G++ + PYHPQT G++E +R +K +++ Sbjct: 443 KAIISDGGAHFCN-----KPFETLLAKYGVKHKVATPYHPQTSGQVELANREIKNILMKV 497 Query: 262 KWFADSG---ELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGNTTPPEYDEG 318 + L + +RT Y L M+ Y+ + EY Sbjct: 498 VNASRKNWSIRLHDSLWAYRTAYKTI-----LGMSP----YRLVYGKACHLPVEVEYKAW 548 Query: 319 VMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQEDGSYEVW 362 ++K+++ S + K + + + KE Q+ ++ Sbjct: 549 WAIKKLNMDLIQSWSKERM---KKWHDQLISNKEFQKGQRVLLY 589 >UniRef50_B7GET7 Transposase n=6 Tax=Bacillales RepID=B7GET7_ANOFW Length = 275 Score = 140 bits (353), Expect = 8e-32, Method: Composition-based stats. Identities = 51/288 (17%), Positives = 105/288 (36%), Gaps = 41/288 (14%) Query: 30 IRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMAH-DRH 88 + +C +S + YKW R P + +++T +R + + Sbjct: 1 MEKMCEVLKVSRSGYYKWRDR---------------PKSARQERREELTQEVRRVYIESR 45 Query: 89 ERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPATGRFEHDAP------- 141 + +G+ K+ + L +G + + TV +M G+ AT +H+ P Sbjct: 46 QLYGSPKVTKKLNHEGIKV-SQKTVSRIMKEKGMKSRTVKKHKATTNSKHNHPVHENVLN 104 Query: 142 --------NRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVS 193 N +W D + P G + +++D +SR + ++E V L Sbjct: 105 QNFTVTKPNEVWVADIT-YVPTDEGWLYLASVMDLYSRKIVGWHIDCSMKKELVLSALKQ 163 Query: 194 VFERYGLPDRM--TMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFH 251 ++R + D GS + + L+ G++ SR + ++ FH Sbjct: 164 AYQRQQPQGSILHHSDRGSQYAS-----NDYQAKLIEYGMKCSMSRKGNCYDNACIKSFH 218 Query: 252 RSLKAEVLQGKWFADSGELQRAFDHWRT-VYNLERPHEALDMAVPGSR 298 +K E++ + E +++ + YN +R H A + P Sbjct: 219 GIIKKELIYQTRYKTREEAKKSIFEYIEIFYNNKRIHSATEYFSPSEY 266 >UniRef50_A5B346 Putative uncharacterized protein n=2 Tax=Vitis vinifera RepID=A5B346_VITVI Length = 916 Score = 140 bits (353), Expect = 8e-32, Method: Composition-based stats. Identities = 49/229 (21%), Positives = 92/229 (40%), Gaps = 35/229 (15%) Query: 133 TGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLV 192 TG D N +W +DF G FP G + L +D S++ + ++ R ++ Sbjct: 168 TGEIPIDLFN-VWGIDFMGPFPMSFGNSYILVGVDYVSKWVEAIPCKQNDHRVVLKFLNE 226 Query: 193 SVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHR 252 ++F R+ +P + D G+ + + E L + G++ + PYHPQT G++E + Sbjct: 227 NIFSRFRVPKVIISDRGTHFCN-----KPFETLLAKYGVKHKVATPYHPQTSGQVELANT 281 Query: 253 SLKAEVLQ------GKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQY 306 +K +++ W+ +L + +RT Y L M+ Y+ + Sbjct: 282 EIKNILMKVVNTSRRDWYV---KLHDSLWAYRTTYKTI-----LGMSP----YRLVYGKT 329 Query: 307 SGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 EY ++KV+ + +A + L EM E Sbjct: 330 CHLPVEVEYKAWWAIKKVN-----------MDLNRAGMKRCLDLNEMDE 367 >UniRef50_A5AH69 Putative uncharacterized protein n=3 Tax=Vitis vinifera RepID=A5AH69_VITVI Length = 1631 Score = 140 bits (352), Expect = 9e-32, Method: Composition-based stats. Identities = 36/190 (18%), Positives = 79/190 (41%), Gaps = 19/190 (10%) Query: 142 NRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLP 201 +W +D G FP G + L +D S++ + + ++ R ++ ++F R+G+P Sbjct: 1426 FDVWGIDLMGPFPMSFGNSYILVGVDYVSKWIKAIPYKHNDHRVVLKFLKENIFSRFGVP 1485 Query: 202 DRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQG 261 + D G+ + + E L + G++ + PYHPQ G++E + +K +++ Sbjct: 1486 KAIISDRGTHFCNR-----PFETLLAKYGVKHKVATPYHPQNSGQVELANMEIKNILMKV 1540 Query: 262 KWFADSGE----LQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGNTTPPEYDE 317 + L + +RT Y L M+ Y+ + EY Sbjct: 1541 -VITSRRDWSIKLHDSLWAYRTTYKTI-----LGMSP----YRLVYGKACHPRVEVEYKA 1590 Query: 318 GVMVRKVDIS 327 ++++++ Sbjct: 1591 WWAIKRLNMD 1600 >UniRef50_C7I620 Integrase catalytic region n=4 Tax=Proteobacteria RepID=C7I620_THIIN Length = 276 Score = 140 bits (352), Expect = 1e-31, Method: Composition-based stats. Identities = 56/296 (18%), Positives = 92/296 (31%), Gaps = 36/296 (12%) Query: 23 ASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLR 82 +I G+S + Y D PR P+ + + + Sbjct: 3 CRDHPVSITQQAHLLGMSRSAVY---------------DLPR----PPSAADLALMRRID 43 Query: 83 MAHDRHERWGARKIKRWLEDQGHT--MPAFSTVHNLMARHGLLPGASPGIPATGR----- 135 H H G+R+I R L G T+ M H L P G Sbjct: 44 EIHLEHPFMGSRQIVRALRRDGLQAGRLHVRTLMRKMGLHALAPQPGTSQRHPGHKVFPY 103 Query: 136 ----FEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQL 191 N++W +D + P G + ++D SR L + + + Sbjct: 104 LLRALAIVRSNQVWALD-TTYIPMARGFVYLTAVVDVFSRRILAHRVAITLEAQHAVEAI 162 Query: 192 VSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFH 251 R+G P+ + D GS + T + G ++ + +ER Sbjct: 163 QEALARHGTPEIVNTDQGSQF-----TAQDFVDAVQNSGAQLSMDGRGAWRDNVFVERVW 217 Query: 252 RSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYS 307 RS+K E + + +A E + + YN ERPH + D P Y Sbjct: 218 RSVKYERVYLRAYASVREARADIGQYIDWYNRERPHSSQDGNTPEQAYLAGLPTLP 273 >UniRef50_UPI0001725BBF transposase n=1 Tax=Micrococcus luteus NCTC 2665 RepID=UPI0001725BBF Length = 274 Score = 139 bits (351), Expect = 1e-31, Method: Composition-based stats. Identities = 36/170 (21%), Positives = 61/170 (35%), Gaps = 8/170 (4%) Query: 141 PNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGL 200 PN +W +DF+ G +++D+H+R + + + L + G Sbjct: 104 PNVVWAVDFQFDADEHGRPIKICSIVDEHTRECIGGLVERSITADRLTAHLEDLVAARGA 163 Query: 201 PDRMTMDNGSPWGDTTGTWTALELWL-MRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVL 259 P + DNG + A+ W R G + + P P G +E F+ ++ E L Sbjct: 164 PAVLRSDNGPEFISE-----AMADWAGTRTG--LSYIPPGSPWRNGYVESFNSRIRDECL 216 Query: 260 QGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGN 309 F Q W+ YN R H +L P + Q + Sbjct: 217 NINSFYSLLHAQVIIGDWKDEYNHHRRHSSLGYLTPAEYARQCTHQMETD 266 >UniRef50_A5AWI1 Putative uncharacterized protein n=2 Tax=Vitis vinifera RepID=A5AWI1_VITVI Length = 905 Score = 139 bits (351), Expect = 1e-31, Method: Composition-based stats. Identities = 46/261 (17%), Positives = 105/261 (40%), Gaps = 35/261 (13%) Query: 142 NRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLP 201 +W +DF FP G + L ++D S++ + ++ + ++ ++F R+G+P Sbjct: 606 FYVWGIDFMRPFPMSFGYSYILVVVDYVSKWVEAIPCKRNDHKVVLKFLKENIFSRFGVP 665 Query: 202 DRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQ- 260 + D G+ + + + E L + G++ + PYHPQT ++E ++ +K +++ Sbjct: 666 RAIISDGGTHFCN-----KSFETLLAKYGVKHKVATPYHPQTSRQVELANQEIKNILIKV 720 Query: 261 -----GKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQ---------Y 306 WF +L + ++T Y L M+ Y + + Sbjct: 721 VNTSRRDWFV---KLHDSLWAYKTAYKTI-----LGMSPYCLVYGKACHLPIEVQYKVWW 772 Query: 307 SGNTTPPEYDEGVMVRKVDISGKLSVKGVSLS-------AGKAFRGERVGLKEMQEDGSY 359 + + + M R +D++ ++ + + K + + V LKE Q+ Sbjct: 773 AIKMLNMDLNRADMKRFLDLNEMEELRNDAYNNSNIAKQILKRWHDQLVSLKEFQKGQRV 832 Query: 360 EVWWYSTKVGVIDLKKKSITM 380 ++ + LK + I + Sbjct: 833 LLYDSKLHIFPRKLKSRWIGL 853 >UniRef50_A7VUR6 Putative uncharacterized protein n=3 Tax=Clostridium leptum DSM 753 RepID=A7VUR6_9CLOT Length = 381 Score = 139 bits (351), Expect = 1e-31, Method: Composition-based stats. Identities = 56/302 (18%), Positives = 96/302 (31%), Gaps = 34/302 (11%) Query: 11 DTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSP 70 + + + + ++ SLC +S T Y + R + + R Sbjct: 84 SPLQEKLK-AMEPLYGQFSVHSLCDALEVSRGTFYNHIFRNKKGDTLAAKRRS------- 135 Query: 71 NRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGI 130 + T + + D + +GA KI L++QG + V LM G+ ++ Sbjct: 136 ----ELKTQIQSIYDDSGQIYGAGKIAAILQNQG-VKTSKKYVSQLMKELGIGSVSTTAK 190 Query: 131 PATGR-------------FEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLA 177 + F + PN++W D F F + ++D SR + Sbjct: 191 KEYKKWEKGQNRNFLQQQFRTERPNQVWVSDITV-FKFHDKYYYLCVIIDLFSRKVISYR 249 Query: 178 HCTDERRETVQQQLVSVFERYGLPDRM--TMDNGSPWGDTTGTWTALELWLMRLGIRVGH 235 + + + + + D G+ + A L G+ Sbjct: 250 ISHKSSTQLLTKTFKQAYADRQPKAELMFHSDRGTQYMSY-----AFVHLLDDFGVEQSF 304 Query: 236 SRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVP 295 SR P E F K E L + + G L R H+ YN ERPH L P Sbjct: 305 SRTACPHDNAVSEAFFSIFKKEELYRRHYTSEGGLMRGIAHFIAFYNTERPHSTLQYKTP 364 Query: 296 GS 297 Sbjct: 365 EQ 366 >UniRef50_A5BQ80 Putative uncharacterized protein n=1 Tax=Vitis vinifera RepID=A5BQ80_VITVI Length = 305 Score = 139 bits (351), Expect = 1e-31, Method: Composition-based stats. Identities = 35/148 (23%), Positives = 67/148 (45%), Gaps = 14/148 (9%) Query: 140 APNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYG 199 +W +DF G FP G + L +D S++ + ++ R ++ ++F R+G Sbjct: 11 HLFYVWGIDFMGPFPMSFGYTYILVGVDYVSKWVKAVPCKYNDYRVVIKFLKENIFSRFG 70 Query: 200 LPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVL 259 +P + D G+ + + E L + G++ + PYHPQT G++E +R +K ++ Sbjct: 71 VPKAIISDGGTHFCN-----KPFETLLAKYGVKHKVATPYHPQTSGQIELANREIKNILM 125 Query: 260 Q------GKWFADSGELQRAFDHWRTVY 281 + W +L +RT Y Sbjct: 126 KVVNVNRKYWSIKLLDL---LWAYRTAY 150 >UniRef50_A5AMM4 Putative uncharacterized protein n=14 Tax=Vitis vinifera RepID=A5AMM4_VITVI Length = 2056 Score = 139 bits (351), Expect = 1e-31, Method: Composition-based stats. Identities = 42/218 (19%), Positives = 86/218 (39%), Gaps = 30/218 (13%) Query: 142 NRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLP 201 +W +DF G F G + L +D S++ + ++ R ++ ++F R+G+P Sbjct: 1394 FDVWSIDFMGPFLMSFGNSYILVGVDYVSKWVEAIPCKHNDHRVVLKFLKENIFSRFGVP 1453 Query: 202 DRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQG 261 + G+ + + E L + G++ + PYHPQT ++E +R +K +++ Sbjct: 1454 KAIISYGGTHFCN-----KPFETLLAKYGVKHKVATPYHPQTSRQVELENREIKNILMKV 1508 Query: 262 KWFADSGE----LQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGNTTPPEYDE 317 + L + +RT Y L M+ Y+ + EY Sbjct: 1509 -VITSRKDWSIKLHDSLWAYRTAYKTI-----LGMSP----YRLVYGKACHLPVEVEYKA 1558 Query: 318 GVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 ++K++ + +A + L EM+E Sbjct: 1559 WSAIKKLN-----------MDLIRAGAKRCLDLNEMEE 1585 >UniRef50_UPI000038392B COG2801: Transposase and inactivated derivatives n=2 Tax=Magnetospirillum magnetotacticum MS-1 RepID=UPI000038392B Length = 239 Score = 139 bits (350), Expect = 2e-31, Method: Composition-based stats. Identities = 48/214 (22%), Positives = 82/214 (38%), Gaps = 17/214 (7%) Query: 111 STVHNLMARHG--LLPGASPGIPATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDD 168 S++H + RHG LP P RF+ +D G + L ++D+ Sbjct: 33 SSLHRCLQRHGISRLPEVDGDKPRRSRFKAYPL----VLDV-----SEGRKFRMLNVVDE 83 Query: 169 HSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMR 228 +R L + + V L +F G+P + DNG + + ++ W+ Sbjct: 84 FTRECLAIRVSCKLKAADVIDVLSDLFILRGVPGHVRSDNGPEFIARS-----VQSWIAA 138 Query: 229 LGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHE 288 +G + + P P +E F+ L+ E+L G+ F E Q + WR +N RPH Sbjct: 139 VGSQTAYIAPGSPWENSYVESFNARLRDELLNGEIFYTLQEAQIIIESWRRHHNTIRPHG 198 Query: 289 ALDMAVPG-SRYQPSARQYSGNTTPPEYDEGVMV 321 AL + P+ + P + V Sbjct: 199 ALGYKPSAPEVFVPAPTAWPAARAQPASPAKLPV 232 >UniRef50_C1F0V3 IS3 family transposase orfB n=1 Tax=Acidobacterium capsulatum ATCC 51196 RepID=C1F0V3_ACIC5 Length = 349 Score = 139 bits (350), Expect = 2e-31, Method: Composition-based stats. Identities = 53/294 (18%), Positives = 98/294 (33%), Gaps = 42/294 (14%) Query: 22 FASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALL 81 Q +I +C G+S A Y++L+ P+ ++ + + Sbjct: 1 MPLQGSLSIERMCLLAGVSRAGFYRFLK-----------------AQVPSEEETEVRSAI 43 Query: 82 RMAHDRHER-WGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASP------------ 128 + +H R +G R++ L+ +G + V +M LL Sbjct: 44 QQVALQHRRRYGYRRVTAELKRRGMKV-NHKRVARIMREDNLLALQPKEFATTTDSNEPL 102 Query: 129 --GIPATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRET 186 + + R + + ++LW D + + +LD +SR + Sbjct: 103 EVYLNLSRRMQLNWVDQLWVADIT-YIRVQTEFVYLAVILDGYSRKVVGWKLDRSLTSRL 161 Query: 187 VQQQLVSVFE-RYGLPDRMT-MDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQ 244 L + R P + D G + T L G+ SRP +P Sbjct: 162 AVNALDGAIKLRRPRPGVVHHSDRGVQY-----TSPEYVAILKLHGMVQSMSRPANPYDN 216 Query: 245 GKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRT-VYNLERPHEALDMAVPGS 297 E F ++LK E + + D +L+ + + YN +R H AL P Sbjct: 217 ASCESFIKTLKREEIYANKYRDLQDLRSHIEEFIDGYYNQKRLHSALGYRTPEE 270 >UniRef50_A5CA05 Putative uncharacterized protein n=2 Tax=Vitis vinifera RepID=A5CA05_VITVI Length = 1066 Score = 139 bits (350), Expect = 2e-31, Method: Composition-based stats. Identities = 44/218 (20%), Positives = 89/218 (40%), Gaps = 30/218 (13%) Query: 142 NRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLP 201 +W ++F G FP G + L +D S++ + ++ R ++ ++F R+G+P Sbjct: 767 FYVWGINFMGPFPMPFGYSYILVGVDYVSKWVEAIPCKHNDHRVVLKFLKENIFSRFGVP 826 Query: 202 DRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQG 261 + D G+ + + E L + G++ + PYH QT G++E +R +K +++ Sbjct: 827 KAIISDGGTHFCN-----KPFETLLAKYGVKHKVATPYHHQTSGQVELANREIKNILMKV 881 Query: 262 ----KWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGNTTPPEYDE 317 + + L + +RT Y L M+ Y+ + + EY Sbjct: 882 VNTNRKYWSIKLL-DSLWAYRTTYKTI-----LGMSP----YRLVYGKACHLSVELEYKA 931 Query: 318 GVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 +K +++ KA + L EM+E Sbjct: 932 WWA-----------IKQLNMDLSKAGLKRFLDLNEMEE 958 >UniRef50_A5BSN7 Putative uncharacterized protein n=16 Tax=Vitis vinifera RepID=A5BSN7_VITVI Length = 2019 Score = 139 bits (350), Expect = 2e-31, Method: Composition-based stats. Identities = 53/293 (18%), Positives = 110/293 (37%), Gaps = 39/293 (13%) Query: 75 DDITALLRMAHDRH--ERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPA 132 D+ +L H+ ++ ++K + G T P+ ++M R Sbjct: 1356 DEQQGILSHCHENACGGQFASQKTTMKVLQSGFTWPSLFKDAHIMCR--SCDRCQRLGKL 1413 Query: 133 TGRFEHDA-------PNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRE 185 T R + +W +DF G FP G + L +D S++ + ++ R Sbjct: 1414 TKRNQMPMNPILIVELFDVWGIDFMGPFPMSFGNSYILVGVDYVSKWVEAIPCKQNDHRV 1473 Query: 186 TVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQG 245 ++ ++F R+G+P + G+ + + E L + ++ + PYHPQT Sbjct: 1474 VLKFLKENIFSRFGVPKAIISGGGAHFCN-----KPFEALLSKYRVKHKVATPYHPQTSR 1528 Query: 246 KLERFHRSLKAEVLQGKWFADSG---ELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPS 302 ++E +R +K +++ + +L + +RT Y L M+ Y+ Sbjct: 1529 QVELANREIKNILMKVVNSSRKDWSIKLHDSLWAYRTAYKTI-----LGMSP----YRLV 1579 Query: 303 ARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 + EY ++K++ + +A + L EM+E Sbjct: 1580 YGKACHLPVEVEYKAWWAIKKLN-----------MDLIRAKEKRYLDLNEMEE 1621 >UniRef50_Q1LNW1 Integrase, catalytic region n=42 Tax=Bacteria RepID=Q1LNW1_RALME Length = 463 Score = 139 bits (349), Expect = 2e-31, Method: Composition-based stats. Identities = 65/340 (19%), Positives = 120/340 (35%), Gaps = 37/340 (10%) Query: 1 MESLMPWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQ 60 M + +A T +R E + +R L +RFG+S +T +W +R A + Sbjct: 1 MHLRLHKNATTTPRIRAEIQV----SKEPMRVLAQRFGVSVSTIARWKKR------ASVH 50 Query: 61 DRPRIPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARH 120 D PH + +++ + + + H + + +H ++ RH Sbjct: 51 DASHTPHRLQTTLTPAQESIVLVLRKSLGLSLDDL-LAVVREFIHPTVSRAALHRMLKRH 109 Query: 121 GLLPGASPG--IPATGRFEHDAPNRLWQMDFKGHFPFGG--GRCHPLTLLDDHSRFSLCL 176 G+ + P T F+ P + +D K R + +D +R+ + Sbjct: 110 GVSAREALSVDRPRTKPFKAYEPGFV-HIDVKYLPQMADETTRRYLFVAIDRATRWVF-V 167 Query: 177 AHCTDERRETVQQQLVSVFERYGLPDR-MTMDNGSPWGDTTGTW--------TALELWLM 227 + ++ L + + R + DNG + D T + Sbjct: 168 RVYASKSATNARRFLKELHKAAAFRIRTILTDNGKEFTDRFITRGERTPTGRHQFDQLCE 227 Query: 228 RLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPH 287 LGI +RP HPQT G +ERF+ + L+ F EL+ + +YN + P Sbjct: 228 ELGIEHRLTRPKHPQTNGMVERFNGRIADI-LRTHHFHSGEELEATILRYVWLYNHQLPQ 286 Query: 288 EALDMAVPGSRYQPSARQYSGNTTPPEYDEGVMVRKVDIS 327 +AL P + R + + R+V Sbjct: 287 KALGHVSPIQAMKQWQRSHP----------ELFNRRVTNQ 316 >UniRef50_A5ASA6 Putative uncharacterized protein n=2 Tax=Vitis vinifera RepID=A5ASA6_VITVI Length = 1839 Score = 139 bits (349), Expect = 2e-31, Method: Composition-based stats. Identities = 40/192 (20%), Positives = 83/192 (43%), Gaps = 21/192 (10%) Query: 141 PNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGL 200 +W +DF G FP G + L +D S++ + +++ + ++ ++F R+G+ Sbjct: 1444 VFDVWGIDFMGPFPMSFGHSYILVGVDYVSKWVDAIPCRSNDHKVVLKFLKENIFSRFGV 1503 Query: 201 PDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAE--- 257 P + D G+ + + E L + G++ + PYHPQT G++E +R +K Sbjct: 1504 PKAIISDXGTHFCNXX-----FETLLAKYGVKHKVATPYHPQTSGQVELANREIKNILMK 1558 Query: 258 --VLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGNTTPPEY 315 + K ++ +L + +RT Y L M+ Y+ + EY Sbjct: 1559 VVNVNRKDWST--KLLDSLWAYRTAYKTI-----LRMSP----YRLVYGKACHLPVEVEY 1607 Query: 316 DEGVMVRKVDIS 327 ++K+++ Sbjct: 1608 KAWWAIKKLNMD 1619 >UniRef50_B8FYC8 Transposase IS3/IS911 family protein n=6 Tax=Clostridiales RepID=B8FYC8_DESHD Length = 393 Score = 139 bits (349), Expect = 2e-31, Method: Composition-based stats. Identities = 50/285 (17%), Positives = 98/285 (34%), Gaps = 11/285 (3%) Query: 18 EFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDI 77 + + AS + +C +S + Y W +R + + ++ S ++ I Sbjct: 103 QMIKKASSSSCPVEKMCETLEVSRSGYYDWDRREPSKRQKENETILKVMKESHTKAQAMI 162 Query: 78 --TALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPATGR 135 L + + G ++ R + G + +P + + Sbjct: 163 GLDKLWSDVKEAGFQCGRNRVYRLQKQHGLYSVRKKPYRVCLTDSNHDLPKAPNL-LNQK 221 Query: 136 FEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVF 195 F+ + PN++W D G + + + D + + A R E + L + Sbjct: 222 FKVEHPNKVWVTDITEFKTAKGSKLYLAAIKDLFHKEIVGWALAEHMRTELCLEALRNAV 281 Query: 196 ERYGLPDRMT--MDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRS 253 +R+ + D G + T L G+ SR +P E F + Sbjct: 282 KRHRPLKGLIHHSDQGRQYCSTVYV-----EELKHWGMIRSMSRKGNPFDNACAESFFST 336 Query: 254 LKAEVLQGKWFADSGELQRAFDHWRT-VYNLERPHEALDMAVPGS 297 +K+E L K + D E +R + YN +R H+AL P + Sbjct: 337 IKSERLHHKTYKDIEEARRDIFWYIECFYNRQRRHQALGNLTPAA 381 >UniRef50_A5BAC6 Putative uncharacterized protein n=1 Tax=Vitis vinifera RepID=A5BAC6_VITVI Length = 1485 Score = 139 bits (349), Expect = 2e-31, Method: Composition-based stats. Identities = 32/157 (20%), Positives = 65/157 (41%), Gaps = 6/157 (3%) Query: 112 TVHNLMARHGLLPGASPGIPATGRFEHD-APNRLWQMDFKGHFPFGGGRCHPLTLLDDHS 170 T+ R L + +W +DF G FP G + L +D S Sbjct: 1175 TMCRSCDRCQRLGKLTHRNQMPMNPILIVDIFDVWGIDFMGPFPMSFGNSYILVEVDYVS 1234 Query: 171 RFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLG 230 ++ + ++ R ++ ++F R+G+P + D G+ + + E L + G Sbjct: 1235 KWVEAILCKHNDHRVVLKFLRENIFSRFGVPKAIISDGGTHFCN-----KPFETLLAKYG 1289 Query: 231 IRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADS 267 ++ + PYHPQ G++E +R +K +++ + Sbjct: 1290 VKHKVATPYHPQNSGQVELANREIKNILMKRIAYKTI 1326 >UniRef50_A5AHC2 Putative uncharacterized protein n=3 Tax=Vitis vinifera RepID=A5AHC2_VITVI Length = 1270 Score = 138 bits (348), Expect = 3e-31, Method: Composition-based stats. Identities = 50/297 (16%), Positives = 106/297 (35%), Gaps = 37/297 (12%) Query: 71 NRSSDDITALLRMAHDRHERWGAR----KIKRWLEDQGHTMPAFSTVHNLMARHGLLPGA 126 + + ++H G KI + G P+ + M + Sbjct: 303 RKCVPEQEQSRILSHCHDSACGGHFASQKIAMKVIQSGFWWPSLFKDAHSMCKGCDRCQR 362 Query: 127 SPGIPATGRFEHDAP-----NRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTD 181 + + +W +DF G FP G + L +D S++ + ++ Sbjct: 363 LGKLTRRNMMPLNPILIVDIFYVWGIDFMGPFPMSFGHSYILVGVDYVSKWVEAIPCRSN 422 Query: 182 ERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHP 241 + + ++ ++F R+G+P + D G+ + + E L + G++ + PYHP Sbjct: 423 DHKVVLKFLKDNIFARFGVPKAIISDGGTHFCN-----KPFETLLAKYGVKHKEATPYHP 477 Query: 242 QTQGKLERFHRSLKAEVLQGKWFADSG---ELQRAFDHWRTVYNLERPHEALDMAVPGSR 298 QT G++E +R +K +++ +L + +RT Y L M+ Sbjct: 478 QTSGQVELANREIKNILMKVVNVNRKDWSIKLLDSLWAYRTAYKXI-----LGMSP---- 528 Query: 299 YQPSARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 Y+ + EY ++K++ + +A + L E+ E Sbjct: 529 YRLVYGKACHLPVEIEYKAWWAIKKLN-----------MDLTRAGLKRCLDLNELXE 574 >UniRef50_Q0ZCC0 Gag protein n=2 Tax=Populus trichocarpa RepID=Q0ZCC0_POPTR Length = 1886 Score = 138 bits (348), Expect = 3e-31, Method: Composition-based stats. Identities = 40/190 (21%), Positives = 76/190 (40%), Gaps = 17/190 (8%) Query: 141 PNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGL 200 W +DF G FP G + L +D S++ + T++ + ++ ++ R+G+ Sbjct: 956 IFECWGIDFMGPFPPSFGFLYILVAVDYVSKWIEAIPSRTNDHKTVIKFLKENILSRFGI 1015 Query: 201 PDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAE--- 257 P M D G+ + + E + + GI + PYHPQT G++E +R +K Sbjct: 1016 PRAMISDGGTHFCN-----KPFESLMKKYGITHKVATPYHPQTSGQVELANREIKQILEK 1070 Query: 258 VLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGNTTPPEYDE 317 + S L A +RT Y +L M+ Y+ + E+ Sbjct: 1071 TVNPNRKDWSLRLNDALWAYRTAY-----KTSLGMSP----YRLVYGKPCHLPVEIEHKA 1121 Query: 318 GVMVRKVDIS 327 ++ + + Sbjct: 1122 YWAIKAFNSN 1131 Score = 90.9 bits (224), Expect = 7e-17, Method: Composition-based stats. Identities = 22/109 (20%), Positives = 50/109 (45%), Gaps = 5/109 (4%) Query: 161 HPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWT 220 H + +D S++ + T++ + ++ ++ R+G+P M D G+ + + Sbjct: 727 HTSSGVDYVSKWIEAIPSRTNDHKTVIKFLKENILSRFGIPRAMISDGGTHFCN-----K 781 Query: 221 ALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGE 269 E + + GI + PYHPQT G+ + R ++ +L ++ + Sbjct: 782 PFESLMKKYGITHKVATPYHPQTSGQKDSKARLVRWILLLQEFDITIKD 830 >UniRef50_A5AWF5 Putative uncharacterized protein n=16 Tax=Vitis vinifera RepID=A5AWF5_VITVI Length = 2072 Score = 138 bits (348), Expect = 3e-31, Method: Composition-based stats. Identities = 33/143 (23%), Positives = 66/143 (46%), Gaps = 8/143 (5%) Query: 142 NRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLP 201 +W +DF G FP G + L +D S++ + ++ + ++ ++F R+G+P Sbjct: 1474 FDVWGIDFMGPFPMSFGNSYILVRVDYVSKWVEAIPCKHNDHKVVLKFLKENIFSRFGVP 1533 Query: 202 DRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQG 261 + D G+ + E L + G++ + PYHPQT G++E +R +K +++ Sbjct: 1534 KAIISDGGTHFCIR-----PFETLLAKYGVKHKVATPYHPQTSGQVELANREIKNMLMKV 1588 Query: 262 KWFADSG---ELQRAFDHWRTVY 281 +L + +RT Y Sbjct: 1589 VITRRRDWSIKLHDSLWAYRTAY 1611 >UniRef50_A5AY91 Putative uncharacterized protein n=1 Tax=Vitis vinifera RepID=A5AY91_VITVI Length = 1162 Score = 138 bits (347), Expect = 4e-31, Method: Composition-based stats. Identities = 43/218 (19%), Positives = 89/218 (40%), Gaps = 28/218 (12%) Query: 141 PNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGL 200 +W +DF G FP G + L +D S++ + +++ + ++ +F R+G+ Sbjct: 153 IFDVWGIDFMGPFPMSFGHSYILVGVDYVSKWVEAIPCRSNDHKVVLKFLKDHIFARFGV 212 Query: 201 PDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQ 260 P + D G+ + + E L + G++ + PYHPQT G++E +R +K +++ Sbjct: 213 PKAIISDGGTHFCN-----KPFETLLAKYGVKHKVATPYHPQTSGQVELANRXIKNILMK 267 Query: 261 GKWFADSG---ELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGNTTPPEYDE 317 +L + +RT Y L M+ Y+ + EY Sbjct: 268 VVNVNRKDWSIKLLDSLWAYRTAYKTI-----LGMSP----YRLVYGKACHLPVEIEYKA 318 Query: 318 GVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 ++K++ + +A + L E++E Sbjct: 319 WWAIKKLN-----------MDLTRAGLKRCLDLNELEE 345 >UniRef50_A5AQQ3 Putative uncharacterized protein n=1 Tax=Vitis vinifera RepID=A5AQQ3_VITVI Length = 1599 Score = 138 bits (347), Expect = 4e-31, Method: Composition-based stats. Identities = 37/176 (21%), Positives = 76/176 (43%), Gaps = 13/176 (7%) Query: 112 TVHNLMARHGLLPGASPG--IPATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDH 169 T+ R L + +P + N +W +DF G FP G + L +D Sbjct: 1150 TMCRSCDRCQRLGKLTQRNQMPMNPILIVNLFN-VWGIDFMGPFPMSFGNSYILVGVDYV 1208 Query: 170 SRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRL 229 S++ + ++ + ++ ++F R+G+P + D G+ + + E L + Sbjct: 1209 SKWVEVIPCKHNDHKVVLKFLKENIFSRFGVPKAIISDGGTHFCN-----KPFETLLAKY 1263 Query: 230 GIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGE----LQRAFDHWRTVY 281 G++ + YHPQT G++E +R +K+ +++ + L + +RT Y Sbjct: 1264 GVKHKVATHYHPQTSGQVELANREIKSILMKVVN-TSIRDWSVKLHDSLWAYRTAY 1318 >UniRef50_B7WRK9 Integrase catalytic region n=2 Tax=Comamonas testosteroni KF-1 RepID=B7WRK9_COMTE Length = 390 Score = 138 bits (347), Expect = 4e-31, Method: Composition-based stats. Identities = 79/385 (20%), Positives = 132/385 (34%), Gaps = 52/385 (13%) Query: 1 MESLMPWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQ 60 M + AR T +R E L RSL + +G++P T KW R + Sbjct: 1 MAIRLHGSARTTPRIRAELQLATGSH----RSLAKLYGLNPKTVAKWRAR------TSVL 50 Query: 61 DRPRIPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMP--AFSTVHNLMA 118 D P P +R+S ++ + G + + T P + S +H + Sbjct: 51 DEPMGPR---DRASQHLSQEQEHLAIALRKQGHLSLDDLMGQLLETAPKLSRSALHRCLQ 107 Query: 119 RHGLLPGASPGIPAT-GRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLA 177 RHG+ + +P G+FE + +D GR H +D ++F+ +A Sbjct: 108 RHGISRQPATQVPRRHGKFEETTLGFV-HID-SAEMKISSGRQHMFVAIDRVTKFT-HVA 164 Query: 178 HCTDERRETVQQQLVSVFERYGLP-DRMTMDNGSPWGDTTG---------TWTALELWLM 227 + Q L V + + DNG + E Sbjct: 165 FFDRATKANAAQFLRQVLVVFPYRIHTVLTDNGMAFTGQERFRGGVTDTCIGHIFERICK 224 Query: 228 RLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPH 287 GI H++PYHP T G +ER +R++K ++ + +L+ + YN + Sbjct: 225 FNGIEKRHTKPYHPWTNGMVERMNRTIKESTIKAYEYEGLEQLREHVQAFVQSYNFGKHL 284 Query: 288 EALDMAVP-----------GSRYQPSARQYSG--NTTPPEY--------DEGVMVRKVDI 326 +AL P SR++ Q++ N P Y R D Sbjct: 285 KALRWKTPFRAMCEAWEKEPSRFRLHPHQFTVGLNILVPLYTHPAGDSSPPTAGGR--DP 342 Query: 327 SGKLSVKGVSLSAGKAFRGERVGLK 351 + V G R ER+ + Sbjct: 343 NYHSPVNGAWYFVPFHLRDERIEAR 367 >UniRef50_Q9ANU8 OrfB (Fragment) n=8 Tax=Bacteria RepID=Q9ANU8_RUMGN Length = 303 Score = 137 bits (346), Expect = 4e-31, Method: Composition-based stats. Identities = 63/302 (20%), Positives = 108/302 (35%), Gaps = 43/302 (14%) Query: 18 EFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDI 77 F+ ++D +R L R+F I P Y +L+ + ++ D+I Sbjct: 19 RFIQKYNKD-FGLRWLLRKFNICPNAYYNYLKNRKAD---------------YHQQKDEI 62 Query: 78 TALLRMAHDRHERW-GARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPATG-- 134 +R + H G R I +L +G+ + + TVH M L S T Sbjct: 63 KDSIREIYHSHGGVDGYRTIHAYLIRKGYDI-SRLTVHKYMNTEMQLFSISRKKKTTYEH 121 Query: 135 -------------RFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTD 181 F D N W DF F G + + T++D H R + + Sbjct: 122 GVAHKVYENKLNQNFRADEINHKWCTDFTYLFLTDGSKRYNCTIIDLHDRSVIASITDRN 181 Query: 182 ERRETVQQQLVSVF-ERYGL---PDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSR 237 + ++ L + G+ + D GS + T + LGI S+ Sbjct: 182 ITADLAKRTLEKAIHSQPGIDLSKLLVHSDQGSQY-----TSKEFTEFCEELGITQSMSK 236 Query: 238 PYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTV-YNLERPHEALDMAVPG 296 +P +ER+ +LK +++ ++ EL A + + V YN RPH + P Sbjct: 237 AGYPYDNAPMERYFNTLKNDLIYQHYYRTEEELYTAIEEFAYVQYNHVRPHSYNNYKTPY 296 Query: 297 SR 298 Sbjct: 297 EA 298 >UniRef50_A5ACN5 Putative uncharacterized protein n=7 Tax=Vitis vinifera RepID=A5ACN5_VITVI Length = 1390 Score = 137 bits (346), Expect = 5e-31, Method: Composition-based stats. Identities = 43/218 (19%), Positives = 88/218 (40%), Gaps = 28/218 (12%) Query: 141 PNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGL 200 +W +DF FP G + L +D S+ + +++ R ++ ++F R+G+ Sbjct: 603 VFDVWGIDFMXPFPMSFGHSYILVGVDYVSKXVEAIPCRSNDHRVVLKFLKDNIFARFGV 662 Query: 201 PDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQ 260 P + D G+ + + E L + G++ + PYHPQT G++E +R +K +++ Sbjct: 663 PKAIISDGGTHFCN-----KPFETLLAKYGVKHKVATPYHPQTSGQVELANREIKNILMK 717 Query: 261 GKWFADSG---ELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGNTTPPEYDE 317 +L + +RT Y L M+ Y+ + EY Sbjct: 718 VVNVNRKDWSIKLLDSLWAYRTAYKTI-----LGMSP----YRLVYGKACHLPVEIEYKA 768 Query: 318 GVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 ++K++ + +A + L E++E Sbjct: 769 WWAIKKLN-----------MDLTRAGLKRCLDLNELEE 795 >UniRef50_A5B7N0 Putative uncharacterized protein n=17 Tax=Vitis vinifera RepID=A5B7N0_VITVI Length = 2000 Score = 137 bits (346), Expect = 5e-31, Method: Composition-based stats. Identities = 45/214 (21%), Positives = 89/214 (41%), Gaps = 30/214 (14%) Query: 146 QMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMT 205 +DF G FP G + L +D S++ + ++ R ++ ++F R+G+P + Sbjct: 1260 GIDFMGPFPMSFGNSYILVGVDYVSKWVEAIPCKQNDHRVVLKFLKENIFSRFGVPKAII 1319 Query: 206 MDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFA 265 D G+ + + E L + G++ + PYHPQT G++E +R +K +++ + Sbjct: 1320 SDGGAHFCN-----KPFEALLSKYGVKHKVATPYHPQTFGQVELANREIKNILMKVVN-S 1373 Query: 266 DSGE----LQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGNTTPPEYDEGVMV 321 + + L + +RT Y L M+ Y+ + EY + Sbjct: 1374 NRKDWSIRLHDSLWAYRTAYKTI-----LGMSP----YRLVYGKACHLPVEVEYKAWWAI 1424 Query: 322 RKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 +K++ + KA + L EM+E Sbjct: 1425 KKLN-----------MDLIKAGEKRYLDLNEMEE 1447 >UniRef50_A5BJ10 Putative uncharacterized protein n=3 Tax=Vitis vinifera RepID=A5BJ10_VITVI Length = 1362 Score = 137 bits (346), Expect = 5e-31, Method: Composition-based stats. Identities = 49/250 (19%), Positives = 93/250 (37%), Gaps = 33/250 (13%) Query: 112 TVHNLMARHGLLPGASPG--IPATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDH 169 T+ R L + +P D N +W +DF FP + L +D Sbjct: 920 TMCRSCDRCQRLGKLTRKNQMPMNPILIIDLFN-VWGIDFVRPFPMSFDNSYILVGVDYV 978 Query: 170 SRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRL 229 S++ ++ ++ R + ++F R+G+P + D G+ + + E L Sbjct: 979 SKWVEAISCKHNDHRIVLMFFKENIFSRFGVPKAIISDGGTHFCN-----KPFETLLANY 1033 Query: 230 GIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGE----LQRAFDHWRTVYNLER 285 G++ + PYHPQT ++E +R +K +++ + L + +RT Y Sbjct: 1034 GVKHKVATPYHPQTSRQVELANREIKNILMKVVN-TSRRDWSVKLYDSLWAYRTTYKTI- 1091 Query: 286 PHEALDMAVPGSRYQPSARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRG 345 L M Y+ + EY ++KV+ + KA Sbjct: 1092 ----LGMFP----YRLVYGKACHLLVEVEYKAWWAIKKVN-----------MDLNKAGMK 1132 Query: 346 ERVGLKEMQE 355 + L +M+E Sbjct: 1133 RCLDLNDMKE 1142 >UniRef50_A0P341 Transposase n=5 Tax=Alphaproteobacteria RepID=A0P341_9RHOB Length = 197 Score = 137 bits (345), Expect = 6e-31, Method: Composition-based stats. Identities = 44/182 (24%), Positives = 66/182 (36%), Gaps = 11/182 (6%) Query: 117 MARHGLLPGASPGIPAT-GRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLC 175 + P R E + +W MDF G + L ++D SRFS Sbjct: 15 LQIRNKTPKRRVKAKLREDRAEAVHADDVWAMDFVHDQLATGRKIRVLKVVDTFSRFSPV 74 Query: 176 LAHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGH 235 + R E V L G P + +D GS + L+L + + + Sbjct: 75 VNPRFSYRGEDVVATLEQACRFVGYPKTIRVDQGSEFISRD-----LDLLAYQRDVELDF 129 Query: 236 SRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVP 295 SRPY +E F+ + E L WF + + + WR Y+ RPH A+ P Sbjct: 130 SRPY-----AFIESFNGKFRTECLNAHWFLTFEDARSKMEEWRKDYSTVRPHIAIGNKPP 184 Query: 296 GS 297 S Sbjct: 185 IS 186 >UniRef50_A5C050 Putative uncharacterized protein n=32 Tax=Vitis vinifera RepID=A5C050_VITVI Length = 2064 Score = 137 bits (345), Expect = 7e-31, Method: Composition-based stats. Identities = 30/142 (21%), Positives = 64/142 (45%), Gaps = 8/142 (5%) Query: 141 PNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGL 200 +W +DF G FP G + L +D S++ + +++ + ++ ++F R+G+ Sbjct: 1456 VFDVWGIDFMGPFPMSFGHSYILVGVDYVSKWVEAIPCRSNDHKVVLKFLKDNIFARFGV 1515 Query: 201 PDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQ 260 P + D G+ + + E L + G++ PYHPQT G++E +R + +++ Sbjct: 1516 PKAIISDGGTHFCN-----KPFETLLAKYGVKHKVVTPYHPQTSGQVELANREINNILMK 1570 Query: 261 G---KWFADSGELQRAFDHWRT 279 + +S + W Sbjct: 1571 EMRNDAYLNSKIAKERLKKWHD 1592 >UniRef50_A5AIU0 Putative uncharacterized protein n=2 Tax=Vitis vinifera RepID=A5AIU0_VITVI Length = 1753 Score = 137 bits (345), Expect = 7e-31, Method: Composition-based stats. Identities = 49/293 (16%), Positives = 107/293 (36%), Gaps = 35/293 (11%) Query: 73 SSDDITALLRMAHDR--HERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGI 130 S ++ +L H+ + +K + G + P+ + M + Sbjct: 1126 SEEEQQXILSHCHESAYXGHFAXQKTXMKVLQSGFSWPSLFKDAHTMCXSCDRSQRLRKL 1185 Query: 131 PATGRFEHDAP-----NRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRE 185 + + +W +DF G FP G + L ++ S++ + ++ R Sbjct: 1186 TXRNQMPMNPILIVDLFDVWDIDFMGPFPMSFGNSYILVGVNYVSKWVEAIPCKHNDHRV 1245 Query: 186 TVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQG 245 ++ ++F R+G+P + D G+ + + E L + G++ + PYHP T G Sbjct: 1246 VLKFLKENIFSRFGVPKAIISDEGTHFCN-----KPFETLLAKYGVKHKVATPYHPXTSG 1300 Query: 246 KLERFHRSLKAEVLQGKWFADSG---ELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPS 302 ++E +R +K +++ + + + +RT Y L M+ Y+ Sbjct: 1301 QVELANREIKNILMKVVNTSKRDWSVKFHDSLXAYRTAYKTI-----LGMSP----YRLV 1351 Query: 303 ARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 + EY ++KV+ + + + L EM+E Sbjct: 1352 YGKACHLXXEVEYKAWWTIKKVN-----------MDLTRXXMKRCLDLNEMEE 1393 >UniRef50_Q0ZCB7 Integrase n=4 Tax=Eukaryota RepID=Q0ZCB7_POPTR Length = 1332 Score = 137 bits (344), Expect = 7e-31, Method: Composition-based stats. Identities = 38/157 (24%), Positives = 68/157 (43%), Gaps = 13/157 (8%) Query: 141 PNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGL 200 W +DF G FP G + L +D S++ + ++ + ++ ++ R+G+ Sbjct: 895 IFDCWGIDFMGPFPPSFGFLYILVAVDYVSKWIEAIPSRNNDHKTVIKFLKENILSRFGI 954 Query: 201 PDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAE--- 257 P M D G+ + +T+ E + + GI + PYHPQT G++E +R +K Sbjct: 955 PRAMISDGGTHFCNTS-----FESLMKKYGITHKVATPYHPQTSGQIELANREIKQILEK 1009 Query: 258 VLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAV 294 + S L A +RT Y +L M+ Sbjct: 1010 TVNPNRKDWSLRLNDALWAYRTAY-----KTSLGMSP 1041 >UniRef50_A5AFC7 Putative uncharacterized protein n=8 Tax=Vitis vinifera RepID=A5AFC7_VITVI Length = 1717 Score = 137 bits (344), Expect = 7e-31, Method: Composition-based stats. Identities = 33/146 (22%), Positives = 68/146 (46%), Gaps = 14/146 (9%) Query: 142 NRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLP 201 +W +DF FP G + L +D R+ ++ ++ R ++ ++F R+G+P Sbjct: 1160 FYVWGIDFMRPFPMSFGYSYILVGVDYVFRWVEAISCKCNDHRVVLKFLKENIFSRFGVP 1219 Query: 202 DRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQ- 260 + D G+ + + E L + ++ + PYHPQT G++E +R +K +++ Sbjct: 1220 KAIISDGGTHFCN-----KPFETLLAKYEVKHKVATPYHPQTSGQVELANREIKNILMKV 1274 Query: 261 -----GKWFADSGELQRAFDHWRTVY 281 WF +L+ + ++T Y Sbjct: 1275 VNTRRRYWFV---KLRDSLWAYKTTY 1297 >UniRef50_B3PDG9 IS3 family transposase, orfB n=3 Tax=Gammaproteobacteria RepID=B3PDG9_CELJU Length = 284 Score = 137 bits (344), Expect = 9e-31, Method: Composition-based stats. Identities = 60/300 (20%), Positives = 102/300 (34%), Gaps = 40/300 (13%) Query: 16 RTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSD 75 R F+ + ++ LCR +S + Y+WL + + P R Sbjct: 2 RFAFIREHASR-CRVKHLCRMLSVSRSRYYEWLGQQQDK-----------PDPEQQRLET 49 Query: 76 DITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPAT-- 133 + AL + + G+R++ R L+ QG + V LM + GL+ T Sbjct: 50 CMRALFV---ESNSSMGSRRMARRLQAQGFAAGRY-RVRRLMKKRGLVVKQKRKFRITTN 105 Query: 134 -------------GRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCT 180 +F PN+ W D + G + ++D +SR + Sbjct: 106 SNHKLPVAENILDRQFNPVTPNQAWAADIT-YIWTVEGWLYLAVVIDLYSRRVVGWCMDK 164 Query: 181 DERRETVQQQLVSVFERYGLPDRMT--MDNGSPWGDTTGTWTALELWLMRLGIRVGHSRP 238 + + V + L+ + D GS + + L + GI SR Sbjct: 165 RQTKSLVIRALMMAVNMRKPSAGLIHHSDRGSQYASLK-----YQASLKQHGIVCSMSRK 219 Query: 239 YHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHW-RTVYNLERPHEALDMAVPGS 297 + +ERF SLK E ++ + + R + T YN RPH L P Sbjct: 220 GNCWDNAVVERFFSSLKREWIRDNLYRYREDAIRDVRAYIVTWYNSRRPHSTLGYKSPIE 279 >UniRef50_A5APG9 Putative uncharacterized protein n=11 Tax=Vitis vinifera RepID=A5APG9_VITVI Length = 1754 Score = 136 bits (343), Expect = 1e-30, Method: Composition-based stats. Identities = 39/218 (17%), Positives = 86/218 (39%), Gaps = 28/218 (12%) Query: 141 PNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGL 200 +W +DF G FP G + L +D S++ + +++ + ++ ++F R+G+ Sbjct: 1454 VFDVWGIDFMGPFPMSFGHSYILVGVDYVSKWVEAIPCRSNDHKVVLKFLKENIFARFGV 1513 Query: 201 PDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQ 260 P + D G+ + + + L + G++ + PYH Q G++E + +K +++ Sbjct: 1514 PKAIISDGGTHFCN-----KPFQTLLAKYGVKHKVATPYHSQRSGQVELANWEIKNILMK 1568 Query: 261 GKWFADSG---ELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGNTTPPEYDE 317 +L + +RT Y L M+ Y + EY Sbjct: 1569 VVNVNRKDWSIKLLDSLWAYRTAYKTI-----LGMSPYSLVYGKACHL----PVEVEYKA 1619 Query: 318 GVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 ++K++ + +A + L E++E Sbjct: 1620 WWAIKKLN-----------MDLTRAGLKRCLDLNELEE 1646 >UniRef50_C7NJB3 Integrase family protein n=3 Tax=Actinomycetales RepID=C7NJB3_KYTSD Length = 422 Score = 136 bits (343), Expect = 1e-30, Method: Composition-based stats. Identities = 70/238 (29%), Positives = 96/238 (40%), Gaps = 45/238 (18%) Query: 109 AFSTVHNLMARHGLLPGASPGIPATG----RFEHDAPNRLWQMDFK--GHFPFGGG---- 158 A STVH ++ R+ L S ATG R+EHD P + +D K G+ P GGG Sbjct: 10 APSTVHRIL-RNARLNRLSHVDRATGEPIRRYEHDHPGAMLHVDVKKLGNIPDGGGWRYV 68 Query: 159 --------------------------RCHPLTLLDDHSRFSLCLAHC--TDERRETVQQQ 190 + + T++DDH+R + H T V + Sbjct: 69 GRQQGEKIRASTPGKPRNKYSDPLMGKAYVHTVIDDHTRVAYAEIHDDETAPTATAVLVR 128 Query: 191 LVSVFERYGLP-DRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLER 249 V F + G+ +R+ DNG + T LGI +RPY PQT GK+ER Sbjct: 129 AVEWFNQRGVTVERVLSDNGGAYRSHLWRET-----CAELGITHKRTRPYRPQTNGKVER 183 Query: 250 FHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYS 307 FHR++ + + E + D W YN RPH A P SR QYS Sbjct: 184 FHRTMADGWAYARCYTSEAERRGELDGWLHYYNRHRPHTACGNKPPFSRLTNVTGQYS 241 Score = 103 bits (256), Expect = 1e-20, Method: Composition-based stats. Identities = 43/152 (28%), Positives = 62/152 (40%), Gaps = 9/152 (5%) Query: 159 RCHPLTLLDDHSRFSLCLAHC--TDERRETVQQQLVSVFERYGLP-DRMTMDNGSPWGDT 215 T++D HSR + H T V + V F + +R+ DNG+ + Sbjct: 270 YAFVHTVID-HSRLAYSEVHDDETAITAVAVLHRAVDWFVDRSVIMERVLSDNGAAYRSF 328 Query: 216 TGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFD 275 L + +RPY PQT GK+ER HR++ + + G+ + + D Sbjct: 329 LWRDA-----CEALRVTPKRTRPYRPQTNGKVERLHRTMADGWAYSRCYTSEGDRRASLD 383 Query: 276 HWRTVYNLERPHEALDMAVPGSRYQPSARQYS 307 W YN RPH A D P SR QYS Sbjct: 384 GWLHQYNQHRPHSACDNQPPFSRLINVPDQYS 415 >UniRef50_A5AYI6 Putative uncharacterized protein n=7 Tax=Vitis vinifera RepID=A5AYI6_VITVI Length = 2067 Score = 136 bits (343), Expect = 1e-30, Method: Composition-based stats. Identities = 37/187 (19%), Positives = 78/187 (41%), Gaps = 17/187 (9%) Query: 141 PNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGL 200 +W +DF FP G + L +D S++ + +++ + ++ ++F R+G+ Sbjct: 1463 VFDVWGIDFMXPFPMSFGHSYILVGVDYVSKWVEAIPCRSNDHKVVLKFLKDNIFARFGV 1522 Query: 201 PDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQ 260 P + D G+ + + E L + G++ + PYHPQT G++E + +K +++ Sbjct: 1523 PKAIISDGGTHFCN-----KPFETLLAKYGVKHKVATPYHPQTSGQVELANWEIKNILMK 1577 Query: 261 GKWFADSG---ELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGNTTPPEYDE 317 +L + +RT Y L M+ Y+ + EY Sbjct: 1578 VVNVNRKDWSIKLLDSLWAYRTAYKTI-----LGMSP----YRLVYGKACHLPVEVEYKA 1628 Query: 318 GVMVRKV 324 ++K+ Sbjct: 1629 WWAIKKL 1635 >UniRef50_Q2YZQ9 Transposase n=3 Tax=Bacteria RepID=Q2YZQ9_9DELT Length = 282 Score = 136 bits (342), Expect = 1e-30, Method: Composition-based stats. Identities = 51/291 (17%), Positives = 94/291 (32%), Gaps = 39/291 (13%) Query: 25 QDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMA 84 + ++ +C+ G+S + Y W G +R + + + Sbjct: 5 RSEFAVKKMCQVLGVSRSGYYLW----------GKHNRSARQKQNERL----MVHIREAY 50 Query: 85 HDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPATGRFEHDA---- 140 +G+ +I L+D G + + LM +G+ AT RF+HD Sbjct: 51 ARGRGVYGSPRITAELKDNGIP-CGKNRIARLMKSNGIKAKTKRRFKATKRFKHDFLVAD 109 Query: 141 -----------PNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQ 189 N++W D G + +LD +R + + E + Sbjct: 110 NLLNQRFSADVANQIWVSDIT-FIWTREGWLYLAAILDIFNRKIVGWSMDNKLSHEVIAD 168 Query: 190 QLVSVFE-RYGLPDRM-TMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKL 247 L R P + D G+ + T A + + G S + + Sbjct: 169 ALHKAIRQRRPKPGVLFHSDRGTQY-----TSYAFRDLMEQYGFVQSMSSSGNCYDNAVM 223 Query: 248 ERFHRSLKAEVLQGKWFADSGELQRAFDHWRT-VYNLERPHEALDMAVPGS 297 E F +LK E++ + + E + + YN R H AL+ P Sbjct: 224 ESFFHTLKTELVYFEKYRTRQEARGGIFEYIEVFYNCVRRHSALNYCSPAE 274 >UniRef50_A9B827 Integrase catalytic region n=1 Tax=Herpetosiphon aurantiacus ATCC 23779 RepID=A9B827_HERA2 Length = 578 Score = 136 bits (342), Expect = 1e-30, Method: Composition-based stats. Identities = 64/364 (17%), Positives = 120/364 (32%), Gaps = 19/364 (5%) Query: 14 SLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRS 73 R VL +G + + S T + + R+ E A L R R Sbjct: 147 QTRRSAVLRLHLEGWPTKRIAAYLQTSRQTVHTIVTRFRSEDLAMLYPRSRARKPGARVV 206 Query: 74 SDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPAT 133 + + A ++ + R GA +I+ L+ Q + + T++ ++AR L P T Sbjct: 207 TTEGVAAIQQLR-ANPRLGAFRIRAALKQQQGIVYSRRTINRVLARLRALDPPPAKPPTT 265 Query: 134 GR----FEHDAPNRLWQMDF--KGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETV 187 F + +W +D G G + +T+LD++SR L A + Sbjct: 266 PAQPMPFAAATAHAVWSVDIRYLDMQDIGAGMLYAITILDNYSRAVLASAVSPRQDLNAY 325 Query: 188 QQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKL 247 Q L YG+P + D+G+ + + LGI P + Sbjct: 326 LQVLFIAVRNYGVPQCLVSDSGAVFRAQRA-----QTIYRMLGITKRVIARRQPWQN-YI 379 Query: 248 ERFHRSLKA-EVLQGKWFADSGELQRAFDHWRTVYNLERP--HEAL--DMAVPGSRY-QP 301 E + + +++ A D W YN + H+A P + Sbjct: 380 ETMFNIQRRMADYHFEQAQTWTDIRDAHDQWVMNYNHQEHWAHQARPDGQGTPMQVLDRA 439 Query: 302 SARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQEDGSYEV 361 Y+ + R+++ G + + + A + G R + + E + Sbjct: 440 YGTVYTEADLRLVFHAVRSQRRINRHGFIQYRAWRIYAEEGLAGIRTAVWLLDETLTIAY 499 Query: 362 WWYS 365 Sbjct: 500 HDEP 503 Score = 49.3 bits (116), Expect = 2e-04, Method: Composition-based stats. Identities = 24/130 (18%), Positives = 44/130 (33%), Gaps = 3/130 (2%) Query: 12 TMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPN 71 R E + G RR G T Y ++++ ++G GL + PH Sbjct: 23 PEQRRYETIRPIVLFGDAAEHQARRTGTPIRTLYHRVKQFDRQGIKGLLEAE-APH--AA 79 Query: 72 RSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIP 131 R ++ L+R + + ++ R E Q P +T+ ++ P P Sbjct: 80 RLPAEVPRLIRRLIADYPDFTPHELGRICEVQLAYRPHHTTIQRILRETPPAPAVVRRFP 139 Query: 132 ATGRFEHDAP 141 +H Sbjct: 140 VFHSMDHQTR 149 >UniRef50_A5CBC9 Putative uncharacterized protein n=1 Tax=Vitis vinifera RepID=A5CBC9_VITVI Length = 926 Score = 136 bits (341), Expect = 2e-30, Method: Composition-based stats. Identities = 43/206 (20%), Positives = 78/206 (37%), Gaps = 17/206 (8%) Query: 81 LRMAHDRHERWGAR----KIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPATGRF 136 + H WG KI + G P+ + M + G Sbjct: 278 IIRNHCHENAWGGHFASQKIAIRVLQSGFCWPSLFKDAHTMCKSCD-RCQRLGKLTCRNM 336 Query: 137 EHDAP------NRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQ 190 P +W +DF G FP G + L +D S++ + ++ R ++ Sbjct: 337 MPLNPILIVDFFYVWGIDFMGPFPMSFGYSYILVGVDYVSKWVEAVPCKHNDHRVVLKFL 396 Query: 191 LVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERF 250 ++F R+G+P + D G+ + + E L + G++ + PYHPQT G++E Sbjct: 397 KENIFSRFGVPKAIISDGGTHFCN-----KPFETLLTKYGVKHKVATPYHPQTSGQVELA 451 Query: 251 HRSLKAEVLQG-KWFADSGELQRAFD 275 +R +K ++ L+R D Sbjct: 452 NREIKNISMKMLNMDLSRAGLKRFLD 477 >UniRef50_Q43917 ORF2 gene product (Fragment) n=15 Tax=cellular organisms RepID=Q43917_ACIAD Length = 305 Score = 136 bits (341), Expect = 2e-30, Method: Composition-based stats. Identities = 54/302 (17%), Positives = 91/302 (30%), Gaps = 44/302 (14%) Query: 14 SLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRS 73 + + + + S C+ G+S + Y W +R Sbjct: 12 QEKYTVIQDLDVNEVTVSSACKCLGVSTSGYYAWRKR-------------------QTNL 52 Query: 74 SDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPAT 133 + L + H R GA + + D G++M + TV ++ + GL + T Sbjct: 53 AQKYNDLKAVYWQHHARLGAPSLVHDMHDLGYSM-SERTVGRMLKKLGLRSKIARKYKHT 111 Query: 134 ---------------GRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAH 178 +F + PN++W D + G + +LD SR + Sbjct: 112 TDSNHRLPTAPNLLDRQFTVNEPNKIWTTDIT-YIRTKQGWLYLCVMLDLFSRRIVGWQT 170 Query: 179 CTDERRETVQQQLVSVFERYGLPD--RMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHS 236 R+ V R G P + D GS + L+ S Sbjct: 171 SHRIDRQLVCDAFHYAMARQGYPMGVMVHSDQGSQYCSRD-----FRALLLTNNCVQSMS 225 Query: 237 RPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRT-VYNLERPHEALDMAVP 295 R + E F +LK ++ G FA E + YN R H P Sbjct: 226 RRGNCWDNAVTESFFHTLKGHMVHGSVFATRKEANAVLFDYIEIYYNRIRRHSTNGWLSP 285 Query: 296 GS 297 + Sbjct: 286 EA 287 >UniRef50_A5BWH5 Putative uncharacterized protein n=17 Tax=Vitis vinifera RepID=A5BWH5_VITVI Length = 2160 Score = 135 bits (340), Expect = 2e-30, Method: Composition-based stats. Identities = 44/217 (20%), Positives = 87/217 (40%), Gaps = 28/217 (12%) Query: 142 NRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLP 201 +W ++F G FP G + L +D S++ + ++ R ++ ++F R+G+P Sbjct: 1255 FDVWGINFMGPFPMSFGNSYILVGVDYVSKWVEAIPCKQNDHRVVLKFLKENIFSRFGVP 1314 Query: 202 DRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQG 261 + D G+ + + E L + ++ + PYHPQT G++E +R +K +++ Sbjct: 1315 KAIISDGGAHFCN-----KPFEALLSKYXVKHKVATPYHPQTSGQVELANREIKNTLMKV 1369 Query: 262 KWFADSG---ELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGNTTPPEYDEG 318 L + +RT Y L M+ Y+ + EY Sbjct: 1370 VNSXRKDWSIRLHDSLWAYRTAYKTI-----LRMSP----YRLVYGKACHLPVEVEYKAW 1420 Query: 319 VMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 ++K++ + KA + L EM+E Sbjct: 1421 WAIKKLN-----------MDLIKAGEKRYLXLNEMEE 1446 >UniRef50_A5AYT6 Putative uncharacterized protein n=3 Tax=Vitis vinifera RepID=A5AYT6_VITVI Length = 1897 Score = 135 bits (340), Expect = 2e-30, Method: Composition-based stats. Identities = 44/215 (20%), Positives = 91/215 (42%), Gaps = 32/215 (14%) Query: 146 QMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMT 205 +DF G FP G + L +D S++ + T++ + ++ ++F R+G+P + Sbjct: 1444 GIDFMGPFPMSFGHSYILVGVDYVSKWVEVIPCXTNDHKVVLKFLRENIFSRFGVPKAII 1503 Query: 206 MDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQ----- 260 D G+ + + E L + G++ + PYHPQT G++E +R +K +++ Sbjct: 1504 SDGGTHFCN-----KPFEALLAKYGVKHKVATPYHPQTSGQVELSNREIKNILMKVVNTN 1558 Query: 261 GKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGNTTPPEYDEGVM 320 K ++ +L + +RT Y L M+ Y+ + E+ Sbjct: 1559 RKDWS--VKLLDSLWAYRTAYKTI-----LGMSP----YRLVYGKACHLPIEIEFKAWWA 1607 Query: 321 VRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 ++K++ + KA + L E++E Sbjct: 1608 IKKLN-----------MDLTKAGLKRSLDLNELEE 1631 >UniRef50_A5BNX4 Putative uncharacterized protein n=2 Tax=Vitis vinifera RepID=A5BNX4_VITVI Length = 1468 Score = 135 bits (340), Expect = 3e-30, Method: Composition-based stats. Identities = 41/214 (19%), Positives = 79/214 (36%), Gaps = 43/214 (20%) Query: 142 NRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLP 201 +W +DF G FP G + L +D S++ + ++ R ++ ++F R+G+P Sbjct: 1190 FDVWGIDFMGPFPMSFGNSYILVGVDYVSKWVEAIPCKQNDHRVVLKFLKENIFSRFGVP 1249 Query: 202 DRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQG 261 + D G+ + + E L + G++ + PYHPQT G++E + K E + Sbjct: 1250 KAIISDGGTHFCN-----KPFETLLAKYGVKHKVATPYHPQTFGQVEPSKQGNK-EHIDE 1303 Query: 262 KWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGNTTPPEYDEGVMV 321 + L M+ Y + EY + Sbjct: 1304 SAYKTI----------------------LGMSP----YCLVYGKVCHLPVEVEYKAWWAI 1337 Query: 322 RKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 +K++ + +A + L EM+E Sbjct: 1338 KKLN-----------MDLIRAGAKRCLDLNEMEE 1360 >UniRef50_B0MP11 Putative uncharacterized protein n=1 Tax=Eubacterium siraeum DSM 15702 RepID=B0MP11_9FIRM Length = 379 Score = 135 bits (339), Expect = 3e-30, Method: Composition-based stats. Identities = 58/307 (18%), Positives = 102/307 (33%), Gaps = 35/307 (11%) Query: 6 PWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRI 65 P R + R + N+ +C F I T Y + R ++ Sbjct: 81 PCTPRAPLRQRL-YAAEQLYGKYNVHVICDAFDIPRGTFYNHVLRNKKDNT--------- 130 Query: 66 PHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPG 125 R + + + + ++ +GA KI ++ +G + V LM GL+ Sbjct: 131 --WYAKRREELRLRIQEIYDESNQIFGAAKIAAVMKSEGFKVSNEM-VRTLMRDMGLVSI 187 Query: 126 ASPGIPA------------TGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFS 173 F+ PN +W D +F +G + ++D SR Sbjct: 188 RQSAKKLYEDEGRKYKNLLNQEFDTCKPNEVWVSDVT-YFKYGENAYYICVIIDLFSRMV 246 Query: 174 LCLAHCTDERRETVQQQLVSVFERYGLPD---RMTMDNGSPWGDTTGTWTALELWLMRLG 230 + + V+ ++ PD D GS + T + ++ L Sbjct: 247 VGYKISKTNSTQLVKSTFQIAYKARQ-PDSSLVFHTDRGSNYRSKT-----MNDYMRSLH 300 Query: 231 IRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEAL 290 I SR + P +E F S+K E L + + + A D + YN +RPH+ L Sbjct: 301 ITHSFSRAHVPYDNSVMESFFASMKREELYRTKYRSESDFRSAVDKYMIFYNTKRPHKKL 360 Query: 291 DMAVPGS 297 P Sbjct: 361 QYKTPEQ 367 >UniRef50_A8ZKJ8 Integrase, catalytic region n=10 Tax=Bacteria RepID=A8ZKJ8_ACAM1 Length = 290 Score = 135 bits (339), Expect = 3e-30, Method: Composition-based stats. Identities = 54/287 (18%), Positives = 95/287 (33%), Gaps = 40/287 (13%) Query: 28 ANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMAHDR 87 I +C+ +S + Y W++R P H N + + + Sbjct: 8 VTITLMCKVLKLSRSGYYAWMKR------------QPSPRHQENAILSERIQQIHD--ES 53 Query: 88 HERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPAT-------------- 133 + +G+ +I L +G + V LMA+ G+ A T Sbjct: 54 RQTYGSPRIHASLIARGF-RASRQRVVRLMAQLGICAQAKRPFKVTTDSEHDGPIAPNIL 112 Query: 134 -GRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLV 192 F + P++ W D + G + ++D SR + + R V L Sbjct: 113 DRTFTTEEPDQAWVADIT-YIRTHEGWLYLAVIIDLFSRRVVGWSMAEHMRTPLVLNALK 171 Query: 193 SVFERYGLPDRMTMDNGSPWGDTTGTWTA---LELWLMRLGIRVGHSRPYHPQTQGKLER 249 + L R+ G + G+ A + L++ GI SR + E Sbjct: 172 AA-----LGQRIPAQTGLIFHSDRGSQYASGDYQQALLKRGITCSMSRRANCWDNAVAES 226 Query: 250 FHRSLKAEVLQGKWFADSGELQRAFDHWRT-VYNLERPHEALDMAVP 295 F +LK E++ FA+ + W YN +R H + P Sbjct: 227 FFGTLKTELIYPTTFANRAMAKTVIAEWIEVFYNRQRLHSTIGYCTP 273 >UniRef50_A5BIQ2 Putative uncharacterized protein n=1 Tax=Vitis vinifera RepID=A5BIQ2_VITVI Length = 1420 Score = 135 bits (339), Expect = 3e-30, Method: Composition-based stats. Identities = 45/241 (18%), Positives = 93/241 (38%), Gaps = 28/241 (11%) Query: 120 HGLLPGASPGIPATGRFEHDAPN--------RLWQMDFKGHFPFGGGRCHPLTLLDDHSR 171 + L+ G P EHDA N +W +DF G FP G + L +D S+ Sbjct: 1019 NYLVTGEVPKAWEVDTPEHDALNPILIVDLFDVWGIDFIGPFPMSFGYSYILVGVDYVSK 1078 Query: 172 FSLCLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGI 231 + + ++ R ++ ++F R+G+P + D + + + + L + G+ Sbjct: 1079 WVEAVPCKHNDHRMVLKFLKENIFSRFGVPKAIISDGSTHFYN-----KPFQTLLAKYGV 1133 Query: 232 RVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGE----LQRAFDHWRTVYNLERPH 287 + + PYH QT G++E +R +K ++ + + L + +RT Y Sbjct: 1134 KHKVATPYHXQTSGQVELANREIKNISMKVVN-TNRKDWSIKLLDSLWAYRTTYKTI--- 1189 Query: 288 EALDMAVPGSRYQPSARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGER 347 L M+ Y + + + R +D++ ++ + E+ Sbjct: 1190 --LGMSP----YHLVYGKACHLPLNMDLSRAGLKRFLDLNEMEELRNDTY-INSKIAKEK 1242 Query: 348 V 348 + Sbjct: 1243 L 1243 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P37007 Uncharacterized protein yagA n=15 Tax=Bacteria R... 405 e-111 UniRef50_C6AZP5 Integrase catalytic region n=3 Tax=Rhizobium leg... 312 2e-83 UniRef50_Q07SS7 Integrase, catalytic region n=5 Tax=Alphaproteob... 305 2e-81 UniRef50_C3MF40 Integrase catalytic core domain protein n=3 Tax=... 302 1e-80 UniRef50_A4WDP4 Integrase, catalytic region n=4 Tax=Enterobacter... 302 2e-80 UniRef50_A9BRN3 Integrase catalytic region n=7 Tax=Proteobacteri... 299 8e-80 UniRef50_A9ER25 Transposase n=4 Tax=Sorangium cellulosum 'So ce ... 299 1e-79 UniRef50_B8GMF2 Integrase catalytic region n=2 Tax=Gammaproteoba... 295 2e-78 UniRef50_B0T7X0 Integrase catalytic region n=15 Tax=Alphaproteob... 293 7e-78 UniRef50_C3K093 Putative integrase n=2 Tax=Pseudomonas fluoresce... 293 9e-78 UniRef50_Q01QQ4 Integrase, catalytic region n=9 Tax=Bacteria Rep... 292 1e-77 UniRef50_B4UMG9 Integrase catalytic region n=3 Tax=Bacteria RepI... 292 2e-77 UniRef50_D2LAL9 Integrase catalytic region n=1 Tax=Desulfovibrio... 284 5e-75 UniRef50_UPI00019025E5 ISHne2, transposase n=1 Tax=Rhizobium etl... 278 2e-73 UniRef50_D2ML31 Integrase, catalytic region (Fragment) n=1 Tax=C... 278 3e-73 UniRef50_UPI00017F3A47 integrase catalytic subunit n=1 Tax=Esche... 275 1e-72 UniRef50_A5ASD2 Putative uncharacterized protein n=7 Tax=Vitis v... 268 3e-70 UniRef50_Q82H05 Putative IS481 family ISMav2-like transposase n=... 265 2e-69 UniRef50_A5C4R5 Putative uncharacterized protein n=1 Tax=Vitis v... 263 9e-69 UniRef50_Q4JUH9 Transposase for IS3514b n=9 Tax=Corynebacterium ... 262 2e-68 UniRef50_A5CBG5 Putative uncharacterized protein n=4 Tax=Vitis v... 261 3e-68 UniRef50_A5C1P8 Putative uncharacterized protein n=4 Tax=Vitis v... 259 2e-67 UniRef50_B2HR82 Transposase for ISMyma05 n=5 Tax=Mycobacterium R... 257 4e-67 UniRef50_A5BYC4 Putative uncharacterized protein n=5 Tax=Vitis v... 256 7e-67 UniRef50_A1T2L4 Integrase, catalytic region n=4 Tax=Actinomyceta... 256 1e-66 UniRef50_A5AMG6 Putative uncharacterized protein n=3 Tax=Vitis v... 254 4e-66 UniRef50_Q4JWW8 Transposase for IS3511a n=7 Tax=Corynebacterium ... 253 7e-66 UniRef50_UPI0001B453C4 transposase for IS3514a n=1 Tax=Mycobacte... 252 1e-65 UniRef50_A5BTM1 Putative uncharacterized protein n=31 Tax=Vitis ... 251 3e-65 UniRef50_A5CA04 Putative uncharacterized protein n=3 Tax=Vitis v... 250 5e-65 UniRef50_Q5ZTP2 Transposase (ISmav2) n=14 Tax=Proteobacteria Rep... 250 5e-65 UniRef50_A5C2R0 Putative uncharacterized protein n=10 Tax=Vitis ... 250 7e-65 UniRef50_A5AQ03 Putative uncharacterized protein n=5 Tax=Vitis v... 247 6e-64 UniRef50_A5BFN4 Putative uncharacterized protein n=2 Tax=Vitis v... 246 9e-64 UniRef50_UPI0001B45627 transposase for ISMyma05 n=1 Tax=Mycobact... 246 1e-63 UniRef50_A5AKZ0 Putative uncharacterized protein n=18 Tax=Vitis ... 246 1e-63 UniRef50_A5B5G8 Putative uncharacterized protein n=2 Tax=Vitis v... 244 4e-63 UniRef50_A5B9R1 Putative uncharacterized protein n=10 Tax=Vitis ... 244 6e-63 UniRef50_A5BI69 Putative uncharacterized protein n=1 Tax=Vitis v... 243 1e-62 UniRef50_A5BJN2 Putative uncharacterized protein n=5 Tax=Vitis v... 242 1e-62 UniRef50_A5BWF3 Putative uncharacterized protein n=9 Tax=Vitis v... 242 2e-62 UniRef50_A5BSN7 Putative uncharacterized protein n=16 Tax=Vitis ... 242 2e-62 UniRef50_A5BJ10 Putative uncharacterized protein n=3 Tax=Vitis v... 242 2e-62 UniRef50_C8XFB0 Integrase catalytic region n=4 Tax=Actinomycetal... 240 6e-62 UniRef50_A5B2X9 Putative uncharacterized protein n=4 Tax=Vitis v... 240 7e-62 UniRef50_A5BFS9 Putative uncharacterized protein n=9 Tax=Vitis v... 240 7e-62 UniRef50_B6J0H9 Transposase n=3 Tax=Coxiella burnetii RepID=B6J0... 240 8e-62 UniRef50_A5AIU0 Putative uncharacterized protein n=2 Tax=Vitis v... 240 9e-62 UniRef50_A5BYU9 Putative uncharacterized protein n=8 Tax=Vitis v... 239 1e-61 UniRef50_A8F1V3 Transposase and inactivated derivative n=4 Tax=B... 239 2e-61 UniRef50_B1ZP18 Integrase catalytic region n=1 Tax=Opitutus terr... 237 7e-61 UniRef50_A5AHC2 Putative uncharacterized protein n=3 Tax=Vitis v... 236 9e-61 UniRef50_A5AKV0 Putative uncharacterized protein n=16 Tax=Vitis ... 236 1e-60 UniRef50_A5AH69 Putative uncharacterized protein n=3 Tax=Vitis v... 235 2e-60 UniRef50_A5BKJ4 Putative uncharacterized protein n=2 Tax=Vitis v... 235 2e-60 UniRef50_A5BT93 Putative uncharacterized protein n=1 Tax=Vitis v... 235 3e-60 UniRef50_A5BMC5 Putative uncharacterized protein n=9 Tax=Vitis v... 234 6e-60 UniRef50_A5APG9 Putative uncharacterized protein n=11 Tax=Vitis ... 233 7e-60 UniRef50_A5AYI6 Putative uncharacterized protein n=7 Tax=Vitis v... 233 1e-59 UniRef50_A1UAJ3 Integrase, catalytic region n=7 Tax=Actinomyceta... 232 1e-59 UniRef50_A6DU50 Putative ISmav2-like transposase n=1 Tax=Lentisp... 232 2e-59 UniRef50_A5AY91 Putative uncharacterized protein n=1 Tax=Vitis v... 232 2e-59 UniRef50_C5D6W5 Integrase catalytic region n=19 Tax=Firmicutes R... 231 4e-59 UniRef50_A5BWY6 Putative uncharacterized protein n=3 Tax=Vitis v... 230 5e-59 UniRef50_A3TPQ6 Transposase n=7 Tax=Actinomycetales RepID=A3TPQ6... 230 8e-59 UniRef50_A5AH70 Putative uncharacterized protein n=20 Tax=Vitis ... 229 1e-58 UniRef50_A5AWF5 Putative uncharacterized protein n=16 Tax=Vitis ... 229 2e-58 UniRef50_UPI0001B540A0 transposase for IS3514a n=1 Tax=Streptomy... 228 2e-58 UniRef50_Q3SW20 Helix-turn-helix, Fis-type n=112 Tax=Bacteria Re... 228 3e-58 UniRef50_B7GET7 Transposase n=6 Tax=Bacillales RepID=B7GET7_ANOFW 228 4e-58 UniRef50_A8ZKJ8 Integrase, catalytic region n=10 Tax=Bacteria Re... 227 6e-58 UniRef50_A5B960 Putative uncharacterized protein n=1 Tax=Vitis v... 227 6e-58 UniRef50_A5BSG6 Putative uncharacterized protein n=1 Tax=Vitis v... 226 1e-57 UniRef50_UPI0001986237 PREDICTED: hypothetical protein n=1 Tax=V... 226 1e-57 UniRef50_A5C4S0 Putative uncharacterized protein n=1 Tax=Vitis v... 226 1e-57 UniRef50_A5C046 Putative uncharacterized protein n=3 Tax=Vitis v... 225 2e-57 UniRef50_A5AMM4 Putative uncharacterized protein n=14 Tax=Vitis ... 225 2e-57 UniRef50_A5AVQ5 Putative uncharacterized protein n=9 Tax=Vitis v... 225 2e-57 UniRef50_A5AQQ3 Putative uncharacterized protein n=1 Tax=Vitis v... 225 2e-57 UniRef50_A5ASA6 Putative uncharacterized protein n=2 Tax=Vitis v... 225 2e-57 UniRef50_A5BI07 Putative uncharacterized protein n=7 Tax=Vitis v... 224 3e-57 UniRef50_A5BVP5 Putative uncharacterized protein n=2 Tax=Vitis v... 224 5e-57 UniRef50_A5AWA7 Putative uncharacterized protein n=6 Tax=Vitis v... 223 6e-57 UniRef50_A5BPW1 Putative uncharacterized protein n=3 Tax=Vitis v... 223 6e-57 UniRef50_A5B3F9 Putative uncharacterized protein n=5 Tax=Vitis v... 223 8e-57 UniRef50_C6PFD7 Integrase catalytic region n=3 Tax=Thermoanaerob... 222 1e-56 UniRef50_Q2JC89 Integrase n=5 Tax=Actinomycetales RepID=Q2JC89_F... 222 2e-56 UniRef50_A5B213 Putative uncharacterized protein n=1 Tax=Vitis v... 222 3e-56 UniRef50_A5BJP7 Putative uncharacterized protein n=6 Tax=Vitis v... 221 3e-56 UniRef50_A5ACN5 Putative uncharacterized protein n=7 Tax=Vitis v... 221 4e-56 UniRef50_Q3A8V0 ISChy3, transposase n=5 Tax=Clostridia RepID=Q3A... 220 6e-56 UniRef50_Q43917 ORF2 gene product (Fragment) n=15 Tax=cellular o... 219 1e-55 UniRef50_A5BWH5 Putative uncharacterized protein n=17 Tax=Vitis ... 219 1e-55 UniRef50_C4KRZ4 Integrase core domain protein n=59 Tax=Proteobac... 219 1e-55 UniRef50_B8FAB2 Integrase catalytic region n=70 Tax=Bacteria Rep... 219 1e-55 UniRef50_A5BY78 Putative uncharacterized protein n=2 Tax=Vitis v... 219 1e-55 UniRef50_C3LLF8 IS1627, transposase n=27 Tax=Bacillaceae RepID=C... 219 2e-55 UniRef50_A5B504 Putative uncharacterized protein n=11 Tax=Vitis ... 218 3e-55 UniRef50_D1VRH7 Integrase catalytic region n=1 Tax=Frankia sp. E... 217 4e-55 UniRef50_A5AFC7 Putative uncharacterized protein n=8 Tax=Vitis v... 217 4e-55 UniRef50_B4RV10 Integrase, catalytic region n=16 Tax=Proteobacte... 217 6e-55 UniRef50_Q2YZQ9 Transposase n=3 Tax=Bacteria RepID=Q2YZQ9_9DELT 217 7e-55 UniRef50_A5CA05 Putative uncharacterized protein n=2 Tax=Vitis v... 217 7e-55 UniRef50_P25438 Insertion element IS476 uncharacterized 39.2 kDa... 216 9e-55 UniRef50_A7B7Y8 Putative uncharacterized protein n=4 Tax=Clostri... 216 1e-54 UniRef50_A5AHS1 Putative uncharacterized protein n=2 Tax=Vitis v... 215 2e-54 UniRef50_A5C995 Putative uncharacterized protein n=1 Tax=Vitis v... 215 2e-54 UniRef50_A3J543 Helix-turn-helix, Fis-type protein n=6 Tax=Bacte... 215 2e-54 UniRef50_D1YV08 Putative transposase orfB for insertion sequence... 215 2e-54 UniRef50_A5AWI1 Putative uncharacterized protein n=2 Tax=Vitis v... 214 5e-54 UniRef50_P24577 Insertion element IS407 uncharacterized 31.7 kDa... 213 7e-54 UniRef50_A4G2L6 Transposase IS3 family, part 2 n=17 Tax=Bacteria... 213 9e-54 UniRef50_C8XFJ9 Integrase catalytic region n=1 Tax=Nakamurella m... 213 1e-53 UniRef50_C7MHU2 Integrase family protein n=3 Tax=Actinomycetales... 213 1e-53 UniRef50_A5AYT6 Putative uncharacterized protein n=3 Tax=Vitis v... 212 1e-53 UniRef50_A5B5S6 Putative uncharacterized protein n=2 Tax=Vitis v... 212 2e-53 UniRef50_B8FYC8 Transposase IS3/IS911 family protein n=6 Tax=Clo... 212 2e-53 UniRef50_B3PDG9 IS3 family transposase, orfB n=3 Tax=Gammaproteo... 212 2e-53 UniRef50_C1F0V3 IS3 family transposase orfB n=1 Tax=Acidobacteri... 212 3e-53 UniRef50_A5BN44 Putative uncharacterized protein n=2 Tax=Vitis v... 211 3e-53 UniRef50_Q7M7E8 Transposase and inactivated derivative n=14 Tax=... 211 4e-53 UniRef50_A3XG72 Transposase-like n=3 Tax=Leeuwenhoekiella blande... 211 4e-53 UniRef50_A5B346 Putative uncharacterized protein n=2 Tax=Vitis v... 211 4e-53 UniRef50_B3PKR5 Transposase n=7 Tax=Gammaproteobacteria RepID=B3... 210 5e-53 UniRef50_A5BPP5 Putative uncharacterized protein n=1 Tax=Vitis v... 210 7e-53 UniRef50_Q1DAH7 Transposase orfB, IS3 family n=29 Tax=Proteobact... 210 7e-53 UniRef50_B4RA95 Transposase, IS1477 n=35 Tax=Proteobacteria RepI... 210 7e-53 UniRef50_A9VK05 Integrase catalytic region n=17 Tax=Bacillaceae ... 210 8e-53 UniRef50_B0TDR5 Transposase, putative n=5 Tax=Firmicutes RepID=B... 210 9e-53 UniRef50_B7VPV6 Transposase (OrfB) of insertion sequence ISVisp1... 210 9e-53 UniRef50_A4TG41 Integrase, catalytic region n=32 Tax=Actinomycet... 209 1e-52 UniRef50_A5B7N0 Putative uncharacterized protein n=17 Tax=Vitis ... 209 1e-52 UniRef50_A5APW4 Putative uncharacterized protein n=3 Tax=Vitis v... 208 2e-52 UniRef50_C1XPR1 Transcriptional regulator/sugar kinase n=14 Tax=... 208 2e-52 UniRef50_C6VW29 Integrase catalytic region n=2 Tax=Sphingobacter... 208 2e-52 UniRef50_C7RJ38 Integrase catalytic region n=5 Tax=Proteobacteri... 208 3e-52 UniRef50_Q486I3 ISCps2, transposase orfB n=1 Tax=Colwellia psych... 208 3e-52 UniRef50_C1RGI3 Transposase n=2 Tax=Actinomycetales RepID=C1RGI3... 208 4e-52 UniRef50_A6VE65 Transposase InsF for insertion sequence IS3A/B/C... 207 4e-52 UniRef50_C1PC72 Integrase catalytic region n=1 Tax=Bacillus coag... 207 5e-52 UniRef50_A3ZSC8 Transposase orfB n=1 Tax=Blastopirellula marina ... 207 6e-52 UniRef50_A4JLW8 Integrase, catalytic region n=9 Tax=Proteobacter... 207 7e-52 UniRef50_B4S6V0 Integrase catalytic region n=10 Tax=Bacteria Rep... 207 9e-52 UniRef50_Q0ZCB7 Integrase n=4 Tax=Eukaryota RepID=Q0ZCB7_POPTR 206 1e-51 UniRef50_Q3BT31 Transposase n=22 Tax=Bacteria RepID=Q3BT31_XANC5 206 1e-51 UniRef50_D1ZYT7 Whole genome shotgun sequence assembly, contig_4... 206 1e-51 UniRef50_A7VUR6 Putative uncharacterized protein n=3 Tax=Clostri... 205 2e-51 UniRef50_A5BAC6 Putative uncharacterized protein n=1 Tax=Vitis v... 205 2e-51 UniRef50_B0NHH2 Putative uncharacterized protein (Fragment) n=2 ... 205 2e-51 UniRef50_A5BFP8 Putative uncharacterized protein n=4 Tax=Vitis v... 205 3e-51 UniRef50_Q122F4 Integrase, catalytic region n=19 Tax=Proteobacte... 204 4e-51 UniRef50_C8PWM8 Transposase B n=2 Tax=Enhydrobacter aerosaccus S... 204 4e-51 UniRef50_B5K429 Integrase, catalytic region n=41 Tax=cellular or... 204 5e-51 UniRef50_Q24ZR5 Putative uncharacterized protein n=3 Tax=Desulfi... 204 6e-51 UniRef50_B4E5J2 Transposase n=21 Tax=Proteobacteria RepID=B4E5J2... 204 6e-51 UniRef50_C3TSE1 Putative transposase n=1 Tax=Enterococcus faeciu... 203 6e-51 UniRef50_B2GC00 Transposase n=14 Tax=root RepID=B2GC00_LACF3 203 7e-51 UniRef50_B2JXI0 Integrase catalytic region n=13 Tax=Bacteria Rep... 203 7e-51 UniRef50_A5B281 Putative uncharacterized protein n=2 Tax=Vitis v... 203 8e-51 UniRef50_A4G5E5 Transposase IS3 family, partial pseudogene n=7 T... 203 8e-51 UniRef50_B4UYZ0 Integrase n=7 Tax=Streptomyces RepID=B4UYZ0_9ACTO 203 8e-51 UniRef50_C4FKG6 Integrase, catalytic region n=1 Tax=Sulfurihydro... 203 9e-51 UniRef50_A3QMY0 Transposase n=37 Tax=Bacilli RepID=A3QMY0_ENTFC 203 9e-51 UniRef50_B8J8P0 Integrase catalytic region n=1 Tax=Anaeromyxobac... 203 1e-50 UniRef50_C6P7L8 Integrase catalytic region n=1 Tax=Sideroxydans ... 202 1e-50 UniRef50_A5WDE3 Integrase, catalytic region n=13 Tax=Moraxellace... 202 1e-50 UniRef50_A3DCZ2 Integrase, catalytic region n=10 Tax=Clostridium... 202 1e-50 UniRef50_C7PFG8 Integrase catalytic region n=3 Tax=Chitinophaga ... 202 2e-50 UniRef50_D0KDH0 Integrase catalytic region n=3 Tax=Gammaproteoba... 202 2e-50 UniRef50_C8WWR8 Integrase catalytic region n=1 Tax=Alicyclobacil... 202 2e-50 UniRef50_Q1NW03 Integrase, catalytic region n=7 Tax=Proteobacter... 202 2e-50 UniRef50_A5B5S8 Putative uncharacterized protein n=2 Tax=Vitis v... 201 3e-50 UniRef50_A4Z1R9 Putative transposase, probably encoded by an uni... 201 3e-50 UniRef50_Q0RW72 Possible transposase n=13 Tax=Actinomycetales Re... 201 4e-50 UniRef50_A4J392 Integrase, catalytic region n=22 Tax=Clostridia ... 201 4e-50 UniRef50_C1DTA7 Putative transposase n=2 Tax=Sulfurihydrogenibiu... 201 4e-50 UniRef50_C4YYX7 Integrase catalytic region n=2 Tax=Rickettsia en... 200 5e-50 UniRef50_A6VYF3 Integrase catalytic region n=14 Tax=Bacteria Rep... 200 5e-50 UniRef50_A4SIH8 IS3-family transposase n=42 Tax=Proteobacteria R... 200 8e-50 UniRef50_Q2AA50 Retrotransposon gag protein n=6 Tax=Asparagus of... 200 9e-50 UniRef50_C4UEN4 Transposase n=1 Tax=Yersinia aldovae ATCC 35236 ... 200 9e-50 UniRef50_D1PJY7 ISMca2, transposase n=1 Tax=Subdoligranulum vari... 200 9e-50 UniRef50_A6Q4E4 Transposase n=2 Tax=Nitratiruptor sp. SB155-2 Re... 200 9e-50 UniRef50_B3E8B6 Integrase catalytic region n=1 Tax=Geobacter lov... 199 1e-49 UniRef50_C1F5F4 IS3 family transposase orfB n=1 Tax=Acidobacteri... 199 1e-49 UniRef50_Q1GCB4 Integrase catalytic region n=29 Tax=Alphaproteob... 199 2e-49 UniRef50_A5C747 Putative uncharacterized protein n=1 Tax=Vitis v... 199 2e-49 UniRef50_D0BAE7 IS3 family transposase (Fragment) n=153 Tax=Bact... 199 2e-49 UniRef50_C2GDW5 IS3514a transposase n=1 Tax=Corynebacterium gluc... 199 2e-49 UniRef50_A8LD08 Integrase catalytic region n=4 Tax=Frankia RepID... 198 2e-49 UniRef50_C1A8I3 Putative transposase orfB for insertion sequence... 198 3e-49 UniRef50_B4X146 Integrase core domain protein n=3 Tax=Alcanivora... 198 3e-49 UniRef50_C9XPC5 Transposase n=10 Tax=Bacteria RepID=C9XPC5_CLODC 198 3e-49 UniRef50_B3EBT2 Integrase catalytic region n=87 Tax=Bacteria Rep... 198 4e-49 UniRef50_C3KV53 Transposase n=9 Tax=Clostridium botulinum RepID=... 198 4e-49 UniRef50_B5CNC7 Putative uncharacterized protein n=5 Tax=Clostri... 197 4e-49 UniRef50_Q9JMT3 Transposase insF for insertion sequence IS3fB n=... 197 5e-49 UniRef50_A5GAT8 Integrase, catalytic region n=1 Tax=Geobacter ur... 197 5e-49 UniRef50_C9XK98 Integrase, catalytic region n=10 Tax=Clostridium... 197 5e-49 UniRef50_A5B8V2 Putative uncharacterized protein n=1 Tax=Vitis v... 197 5e-49 UniRef50_Q3A4V8 Transposase and inactivated derivatives n=9 Tax=... 197 5e-49 UniRef50_A1UQ90 Integrase, catalytic region n=31 Tax=Actinomycet... 197 8e-49 UniRef50_Q7TT98 InsB n=4 Tax=Bacteria RepID=Q7TT98_RHOBA 197 8e-49 UniRef50_O28862 ISA0963-5, putative transposase n=5 Tax=Archaeog... 197 8e-49 UniRef50_Q8XPL1 Isrso16-transposase orfb protein n=2 Tax=Ralston... 196 1e-48 UniRef50_B9XEB0 Integrase catalytic region n=1 Tax=bacterium Ell... 196 1e-48 UniRef50_Q01TL7 Integrase, catalytic region n=5 Tax=Bacteria Rep... 196 1e-48 UniRef50_A5BTF9 Putative uncharacterized protein n=1 Tax=Vitis v... 196 1e-48 UniRef50_B7IVG1 Integrase core domain protein n=61 Tax=Bacillus ... 196 1e-48 UniRef50_A0JV34 Integrase, catalytic region n=12 Tax=Actinomycet... 196 1e-48 UniRef50_B0MP11 Putative uncharacterized protein n=1 Tax=Eubacte... 196 1e-48 UniRef50_C8VYH9 Transposase IS3/IS911 family protein n=11 Tax=Ba... 196 1e-48 UniRef50_Q24VK2 Putative uncharacterized protein n=3 Tax=Clostri... 196 1e-48 UniRef50_A4JLF1 Integrase, catalytic region n=12 Tax=Proteobacte... 196 1e-48 UniRef50_A6WME4 Transposase IS3/IS911 family protein n=33 Tax=Ga... 196 1e-48 UniRef50_Q24NL7 Putative uncharacterized protein n=1 Tax=Desulfi... 196 1e-48 UniRef50_C0ZBG4 Putative transposase orfB for insertion sequence... 196 1e-48 UniRef50_Q4FQT2 Transposase OrfB n=179 Tax=Bacteria RepID=Q4FQT2... 195 2e-48 UniRef50_C0QA21 Transposase /integrase family protein n=1 Tax=De... 195 2e-48 UniRef50_Q310V5 Transposase-like n=4 Tax=Deltaproteobacteria Rep... 195 2e-48 UniRef50_Q315H1 Putative transposase B n=2 Tax=Desulfovibrio des... 195 2e-48 UniRef50_B2SZK4 Transposase IS3/IS911 family protein n=10 Tax=ro... 195 2e-48 UniRef50_Q1WRA6 Transposase ISLasa15, IS3 family n=27 Tax=Firmic... 195 2e-48 UniRef50_Q0ZCC0 Gag protein n=2 Tax=Populus trichocarpa RepID=Q0... 195 3e-48 UniRef50_Q033F5 Transposase n=46 Tax=Streptococcaceae RepID=Q033... 195 3e-48 UniRef50_A6LT97 Integrase, catalytic region n=5 Tax=Firmicutes R... 195 3e-48 UniRef50_B0BYN1 Integrase, catalytic region n=6 Tax=Bacteria Rep... 195 3e-48 UniRef50_A6BYW2 Integrase, catalytic region n=1 Tax=Planctomyces... 194 4e-48 UniRef50_A1T428 Integrase, catalytic region n=2 Tax=Actinobacter... 194 4e-48 UniRef50_C1F6R2 ISAca4, transposase orfB n=1 Tax=Acidobacterium ... 194 5e-48 UniRef50_A0AXB8 Integrase, catalytic region n=27 Tax=Betaproteob... 194 5e-48 UniRef50_A8L908 Integrase catalytic region n=8 Tax=Actinomycetal... 194 6e-48 UniRef50_A5BNX4 Putative uncharacterized protein n=2 Tax=Vitis v... 193 6e-48 UniRef50_Q2P621 ISXoo3 transposase n=194 Tax=Proteobacteria RepI... 193 6e-48 UniRef50_A1SU53 Integrase, catalytic region n=15 Tax=Proteobacte... 193 7e-48 UniRef50_A4A249 Transposase orfB n=4 Tax=Planctomycetaceae RepID... 193 7e-48 UniRef50_C1XUW8 Transposase n=2 Tax=Meiothermus silvanus DSM 994... 193 7e-48 UniRef50_B5YKC0 Putative transposase n=3 Tax=Thermodesulfovibrio... 193 8e-48 UniRef50_C1F2E9 IS3 family transposase orfB n=1 Tax=Acidobacteri... 193 1e-47 UniRef50_A1VJC3 Integrase, catalytic region n=25 Tax=Bacteria Re... 193 1e-47 UniRef50_A8LT45 Integrase n=5 Tax=Bacteria RepID=A8LT45_DINSH 193 1e-47 UniRef50_Q8NL32 Predicted transposase n=7 Tax=Corynebacterium Re... 192 1e-47 UniRef50_UPI0001B46226 putative transposase n=1 Tax=Mycobacteriu... 192 2e-47 UniRef50_A1JLT7 Transposase for insertion element IS1222 n=8 Tax... 192 2e-47 Sequences not found previously or not previously below threshold: UniRef50_A5CBG2 Putative uncharacterized protein n=1 Tax=Vitis v... 211 4e-53 UniRef50_A5C6P4 Putative uncharacterized protein n=7 Tax=Vitis v... 211 5e-53 UniRef50_A5BI47 Putative uncharacterized protein n=10 Tax=Vitis ... 205 3e-51 UniRef50_A5B4Q4 Putative uncharacterized protein n=13 Tax=Vitis ... 198 3e-49 UniRef50_A5AHC9 Putative uncharacterized protein n=14 Tax=Vitis ... 197 9e-49 UniRef50_B6BJU9 Transposase n=3 Tax=Campylobacterales bacterium ... 192 2e-47 >UniRef50_P37007 Uncharacterized protein yagA n=15 Tax=Bacteria RepID=YAGA_ECOLI Length = 384 Score = 405 bits (1040), Expect = e-111, Method: Composition-based stats. Identities = 384/384 (100%), Positives = 384/384 (100%) Query: 1 MESLMPWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQ 60 MESLMPWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQ Sbjct: 1 MESLMPWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQ 60 Query: 61 DRPRIPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARH 120 DRPRIPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARH Sbjct: 61 DRPRIPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARH 120 Query: 121 GLLPGASPGIPATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCT 180 GLLPGASPGIPATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCT Sbjct: 121 GLLPGASPGIPATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCT 180 Query: 181 DERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYH 240 DERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYH Sbjct: 181 DERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYH 240 Query: 241 PQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQ 300 PQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQ Sbjct: 241 PQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQ 300 Query: 301 PSARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQEDGSYE 360 PSARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQEDGSYE Sbjct: 301 PSARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQEDGSYE 360 Query: 361 VWWYSTKVGVIDLKKKSITMGKGC 384 VWWYSTKVGVIDLKKKSITMGKGC Sbjct: 361 VWWYSTKVGVIDLKKKSITMGKGC 384 >UniRef50_C6AZP5 Integrase catalytic region n=3 Tax=Rhizobium leguminosarum bv. trifolii WSM1325 RepID=C6AZP5_RHILS Length = 402 Score = 312 bits (799), Expect = 2e-83, Method: Composition-based stats. Identities = 124/384 (32%), Positives = 181/384 (47%), Gaps = 10/384 (2%) Query: 5 MPWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPR 64 M W M R FV + +LC +GIS TGYKWL+R+ G AGL D PR Sbjct: 1 MVWRETGIMDERLRFVGECLAGEETMTALCAAYGISRKTGYKWLERYRALGPAGLIDLPR 60 Query: 65 IPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQG--HTMPAFSTVHNLMARHGL 122 P ++ ++ A + + + +WG +K+ L+ PA ST+ ++ RHGL Sbjct: 61 APLEHGRATAAELVARIVAEKEANPQWGPKKVLARLKRSAPQLCWPAASTIGEILKRHGL 120 Query: 123 LPGASPGIPAT---GRFEHDAPNRLWQMDFKGHFPF-GGGRCHPLTLLDDHSRFSLCLAH 178 + A + PN +W D+KG F G RC PLT++D SRF L L Sbjct: 121 VGRRRHRWRAAGCGPFAPANGPNAVWSADYKGWFRTRDGRRCEPLTVMDTASRFLLALEA 180 Query: 179 CTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGT-WTALELWLMRLGIRVGHSR 237 C +F +GLP+R DNGSP+ T T L + ++LGI + + Sbjct: 181 CATPAEVEAWPVFERLFAEHGLPERFRSDNGSPFAAIGVTGLTTLAVRFIKLGIGLERIQ 240 Query: 238 PYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGS 297 P PQ G+ ERFH ++ L D Q FD +R YN ERPHEAL M VP Sbjct: 241 PGKPQQNGRHERFHLTML--PLAMAPEVDHAAQQAVFDAFRQNYNAERPHEALAMDVPAD 298 Query: 298 RYQPSARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQEDG 357 Y+PS R+ P+Y VR+V +G++ G + A GE V ++E + G Sbjct: 299 HYRPSLRRLPDRLPEPDYPAEAAVRRVRSNGEIKWNGDLVYVAAALAGEVVAIEESEA-G 357 Query: 358 SYEVWWYSTKVGVIDLKKKSITMG 381 + + +++ +G+ID K K + Sbjct: 358 IWTLRFHAHPLGIIDKKTKRLVRP 381 >UniRef50_Q07SS7 Integrase, catalytic region n=5 Tax=Alphaproteobacteria RepID=Q07SS7_RHOP5 Length = 582 Score = 305 bits (782), Expect = 2e-81, Method: Composition-based stats. Identities = 122/389 (31%), Positives = 175/389 (44%), Gaps = 11/389 (2%) Query: 5 MPWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPR 64 M W + R FV+ + +CRRFG+S TGYKWL+R+ EG AGL DR R Sbjct: 1 MGWMETRVVDERMRFVMAVADHEEAFAVVCRRFGVSRRTGYKWLERYDAEGVAGLMDRSR 60 Query: 65 IPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHT--MPAFSTVHNLMARHGL 122 PH P + + H WG KI+ WL ++ PA ST+ L+ R GL Sbjct: 61 APHSHPQAIAAPLAERCLAVRRAHPTWGPVKIRHWLAERDGATEWPAPSTIGALLDREGL 120 Query: 123 LPGASPGIPATGRFEH----DAPNRLWQMDFKGHFPFGGGRCH-PLTLLDDHSRFSLCLA 177 + N +W MDFKG F G G C PLTL D +SR+ L Sbjct: 121 TVKRRLRRRSPPSSVPFGHCGGANDIWCMDFKGWFLTGDGSCCEPLTLSDAYSRYLLRCQ 180 Query: 178 HCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGD-TTGTWTALELWLMRLGIRVGHS 236 V L + F +GLP R+ DNG P+ G + L + +++ G+ Sbjct: 181 ALARTDTAHVWPVLEAAFREFGLPHRLRSDNGPPFASCGAGGLSRLAVQVIKAGVVPERI 240 Query: 237 RPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPG 296 P PQ G+LER H +LK + +L+R ++ +YN ERPH+AL P Sbjct: 241 APGKPQQNGRLERLHLTLKQDTAMPPAQTLPEQLKR-LRAFQRLYNEERPHQALGNDTPS 299 Query: 297 SRYQPSARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQED 356 Y S R++ G P+Y VR+V +G + G + +A GE +GL E + Sbjct: 300 QHYARSPRRFDGCLRAPDYGPDQTVRRVRSNGAIKWGGNEIYINEALAGEPIGLTEQP-N 358 Query: 357 GSYEVWWYSTKVGVIDLKKKSITMG-KGC 384 GS+ + +GVI + + +GC Sbjct: 359 GSFAASYGPIVLGVIAHRGNQLRKAKRGC 387 >UniRef50_C3MF40 Integrase catalytic core domain protein n=3 Tax=Rhizobium sp. NGR234 RepID=C3MF40_RHISN Length = 400 Score = 302 bits (774), Expect = 1e-80, Method: Composition-based stats. Identities = 125/383 (32%), Positives = 176/383 (45%), Gaps = 10/383 (2%) Query: 5 MPWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPR 64 M W M R +FV + LC +GIS TGYKWL+R+ G AGL+D PR Sbjct: 1 MVWRETGIMEERLKFVAACLSGEETMAGLCALYGISRKTGYKWLRRFQLRGPAGLEDLPR 60 Query: 65 IPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQG--HTMPAFSTVHNLMARHGL 122 P + ++ ++ + + H WG +KI L Q P+ ST ++ RHGL Sbjct: 61 APLNHGRATAAELVERIVAEKEAHPLWGPKKIVARLARQDPATAWPSASTAGAILNRHGL 120 Query: 123 LPGASPGIPATGR---FEHDAPNRLWQMDFKGHFPFGGGR-CHPLTLLDDHSRFSLCLAH 178 + G E PN +W D KG F G C PLT++D SR+ L L Sbjct: 121 VGRRRARWKGAGNGPWPEPAMPNAVWTGDHKGWFTTRDGWRCEPLTVMDVKSRYLLALEA 180 Query: 179 CTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGT-WTALELWLMRLGIRVGHSR 237 E +F+ +GLPDR+ DNG P+ T T L L +RLGI + Sbjct: 181 TGSTGDEEAWPVFERLFDEHGLPDRIRTDNGPPFAAAGVTGLTPLSLRFVRLGITLERIA 240 Query: 238 PYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGS 297 P PQ GK ERFH ++ L AD AF+ +R YN ERPHE L M P Sbjct: 241 PGKPQQNGKHERFHLTML--PLAKAPAADRAAQAEAFEAFRREYNEERPHETLGMDTPAE 298 Query: 298 RYQPSARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQEDG 357 Y+ S R+ + P+Y VRKV +G + +G + GE V ++E E G Sbjct: 299 HYRASTRKMPVSPPEPDYPAEAAVRKVRHNGAVKWQGAEIYVSATLVGEVVAIEE-TESG 357 Query: 358 SYEVWWYSTKVGVIDLKKKSITM 380 + + +Y+ ++G ID K+ + Sbjct: 358 EWAMRFYAHRLGFIDEKRGRLVR 380 >UniRef50_A4WDP4 Integrase, catalytic region n=4 Tax=Enterobacter sp. 638 RepID=A4WDP4_ENT38 Length = 382 Score = 302 bits (773), Expect = 2e-80, Method: Composition-based stats. Identities = 122/384 (31%), Positives = 175/384 (45%), Gaps = 10/384 (2%) Query: 5 MPWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPR 64 MPW TM R +FV + + +CRRF IS TGYKWL R++ + A L DR R Sbjct: 1 MPWTETVTM-QRLQFVAACLEGNLPVAEVCRRFNISRKTGYKWLARFSPDDTASLADRSR 59 Query: 65 IPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHT-MPAFSTVHNLMARHGLL 123 HH N + + + LL +H WG KI++ L + T +PA ST+ L HGL+ Sbjct: 60 ARHHQ-NSTPEPMVQLLLDTKQQHPLWGPDKIRQRLLNLNITGVPAASTIGELFRVHGLV 118 Query: 124 PGASPGIPATGRF----EHDAPNRLWQMDFKGHFP-FGGGRCHPLTLLDDHSRFSLCLAH 178 P + R PN +W DFKG F GG CHP TL D+ SR L Sbjct: 119 KKRRPPAFKSTRPHELHTVAHPNDVWSADFKGKFTHTGGRWCHPFTLTDNCSRIVLACDA 178 Query: 179 CTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDT-TGTWTALELWLMRLGIRVGHSR 237 V L VF G+P + DNG P+ + + +WL++ G+ R Sbjct: 179 TYMPDGRFVIPCLERVFRECGMPQVLRTDNGPPFAGAGLWGLSQMSIWLIKCGVLPERIR 238 Query: 238 PYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGS 297 P P G+ ER HR+LK + + F E Q D WR+ +N RPH+AL PGS Sbjct: 239 PGKPTENGRHERMHRTLKDALKRHTKFTSLEEQQAWLDAWRSEFNDIRPHKALGGKTPGS 298 Query: 298 RYQPSARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQEDG 357 + PS R ++G + +V + G L + +A RGE + +K+++ED Sbjct: 299 VWYPSERIFTGPLKAMPVPDDARTLRVSVKGDLCFNSTRIFLSEALRGEWIWMKQVEEDL 358 Query: 358 SYEVWWYSTKVGVIDLKKKSITMG 381 E+ + + D + I Sbjct: 359 D-EIGFGELILARYDRRNHRIIRA 381 >UniRef50_A9BRN3 Integrase catalytic region n=7 Tax=Proteobacteria RepID=A9BRN3_DELAS Length = 395 Score = 299 bits (767), Expect = 8e-80, Method: Composition-based stats. Identities = 111/378 (29%), Positives = 178/378 (47%), Gaps = 11/378 (2%) Query: 5 MPWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPR 64 MPW M + F+ + GA + LCRR+GIS T YKW++R+ Q G GLQ+R R Sbjct: 1 MPWKECAPMDEKLLFIADHLRGGAPLSELCRRYGISRKTAYKWVERYRQLGMDGLQERSR 60 Query: 65 IPHHSPNRSSDDITALLRMAH-DRHERWGARKIKRWLEDQGHT--MPAFSTVHNLMARHG 121 PH + S + + + G +K+ L + P+ +T++N++ G Sbjct: 61 RPHGNNQAISYAQRRAIIELRTQQRSQMGPKKLHALLLQRWGPQETPSKTTIYNVLKAEG 120 Query: 122 LLPGASPGIPATGRFEH----DAPNRLWQMDFKGHFPFGGG-RCHPLTLLDDHSRFSLCL 176 L+ + + PN +W D+KG F G C+PLT++D SR+ L + Sbjct: 121 LVCSRRVRRRSVPTAQPLRTSKQPNGVWSADYKGQFKTADGHWCYPLTIMDHASRYLLAV 180 Query: 177 AHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPW-GDTTGTWTALELWLMRLGIRVGH 235 E ++ VF +YGLP+R+ DNG P+ + L +W +RLGIR Sbjct: 181 HVYDSPNYEDAKRSFEQVFRQYGLPERIRSDNGPPFATTGVAGLSRLAIWWIRLGIRPER 240 Query: 236 SRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVP 295 PQ G+ ER HR+LK L + AD LQ D + YN +RPHEAL ++P Sbjct: 241 IERGKPQQNGRHERMHRTLK-HALGKEPAADKAALQMQLDAFVEHYNQQRPHEALQQSMP 299 Query: 296 GSRYQPSARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 Y SAR Y +Y + +V +G + + + + G G+ +G++E+ Sbjct: 300 AQHYSDSARPYPSKLPELQYPKHWERVRVSHNGLIYWRALRVYIGYLLAGQWIGMQEVAA 359 Query: 356 DGSYEVWWYSTKVGVIDL 373 G ++V+ ++G + Sbjct: 360 -GQWDVYLGPVRLGCFNE 376 >UniRef50_A9ER25 Transposase n=4 Tax=Sorangium cellulosum 'So ce 56' RepID=A9ER25_SORC5 Length = 387 Score = 299 bits (765), Expect = 1e-79, Method: Composition-based stats. Identities = 128/387 (33%), Positives = 189/387 (48%), Gaps = 11/387 (2%) Query: 5 MPWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPR 64 MPW ++ R F+ ++ LCRRFGIS TGYKW++R+ Q G +GL++R Sbjct: 1 MPWKETCSVDERLRFIAQVNESDETFAELCRRFGISRKTGYKWVERYEQAGPSGLEERRP 60 Query: 65 IPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHT-MPAFSTVHNLMARHGLL 123 + H P+ + + L WG +K++ LE G +PA ST+ L+ +HGL+ Sbjct: 61 VAHTFPHATPTVLVDALIELRKERPTWGPKKLRARLESLGLEGLPAASTIGELLKKHGLI 120 Query: 124 PGASPGIP------ATGRFEHDAPNRLWQMDFKGHFPFGGG-RCHPLTLLDDHSRFSLCL 176 + + + PN W DFKGHF G RCHPLTL D SR+ L Sbjct: 121 RPRRRRVVTPTTAMPSPLAPAEQPNDTWCADFKGHFALGDRTRCHPLTLTDQASRYLLKC 180 Query: 177 AHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGD-TTGTWTALELWLMRLGIRVGH 235 +V+ F +GLP R+ DNG P+ G +AL + ++LGI Sbjct: 181 EGVAKPHEASVRPHFERAFREFGLPHRIRSDNGPPFATIGIGGLSALSVSWIKLGIHPER 240 Query: 236 SRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVP 295 P PQ G+ ER H++LKAE A+ QR FD +R YN +RPHEAL P Sbjct: 241 IEPGKPQQNGRHERMHKTLKAEATSPPE-ANLAAQQRVFDRFRHEYNDQRPHEALGQRTP 299 Query: 296 GSRYQPSARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 SRY PS R + PEY + + VR++D G++ G + GE VGL + + Sbjct: 300 ASRYTPSRRSMPSKPSSPEYPDTMAVRRLDEQGRMLFGGAQTNVSTLLAGEPVGLTPIAD 359 Query: 356 DGSYEVWWYSTKVGVIDLKKKSITMGK 382 D +E+++ + + LK K + + + Sbjct: 360 D-VWELYYGPVLLAQVTLKNKELKLAR 385 >UniRef50_B8GMF2 Integrase catalytic region n=2 Tax=Gammaproteobacteria RepID=B8GMF2_THISH Length = 391 Score = 295 bits (756), Expect = 2e-78, Method: Composition-based stats. Identities = 130/387 (33%), Positives = 181/387 (46%), Gaps = 11/387 (2%) Query: 5 MPWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPR 64 MPW M R +F+ + +LCR FGIS TG KW++R A G GL++ R Sbjct: 1 MPWKETCAMDQRVQFIGAWLSGRYSKSALCRHFGISRPTGDKWIRRHALVGVDGLKESSR 60 Query: 65 IPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQ--GHTMPAFSTVHNLMARHGL 122 PH+ PNR S+ + + A H+ WG +K+ WL + PA ST ++ R GL Sbjct: 61 APHNQPNRISEALCERIVQAKLAHQDWGPKKVLDWLRAREPEVVWPADSTGGEILRRAGL 120 Query: 123 LPGASPGIPATGRFEH----DAPNRLWQMDFKG-HFPFGGGRCHPLTLLDDHSRFSLCLA 177 + + N +W +DFKG + G RC+PLTL D SR+ L Sbjct: 121 VKPRRRRRVVPPHEAPFADCEQSNAVWAVDFKGDYRLGEGRRCYPLTLSDSFSRYLLLCR 180 Query: 178 HCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTT-GTWTALELWLMRLGIRVGHS 236 V F YGLP + DNG+P+ G +AL W + LGI Sbjct: 181 GLARPSGAAVHPWFEWAFREYGLPQAIRSDNGAPFASRAVGGLSALSKWWIDLGIHPERI 240 Query: 237 RPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPG 296 RP P G+ ER HRSLK G QR + +R YN ER HEAL PG Sbjct: 241 RPGRPDQNGRHERMHRSLKG--WLGTPAQGLEAEQRRLEAFRAEYNWERSHEALSRRTPG 298 Query: 297 SRYQPSARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQED 356 S Y S R Y PP+YD+GV VR+V +G++ +G + + GE V L E D Sbjct: 299 SLYAASPRPYPPCIEPPDYDQGVEVRRVRNNGEIKWRGRLIYLSEVLIGEPVAL-EPAGD 357 Query: 357 GSYEVWWYSTKVGVIDLKKKSITMGKG 383 G +E+ + +G+++ + IT +G Sbjct: 358 GLWELRYRFHPLGLLNEQNDRITPARG 384 >UniRef50_B0T7X0 Integrase catalytic region n=15 Tax=Alphaproteobacteria RepID=B0T7X0_CAUSK Length = 400 Score = 293 bits (751), Expect = 7e-78, Method: Composition-based stats. Identities = 180/354 (50%), Positives = 221/354 (62%), Gaps = 5/354 (1%) Query: 5 MPWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPR 64 MPW M R EFV A +GAN R LCRRFGISP GYKWL R G L DR R Sbjct: 1 MPWREVSVMEQRREFVRLARLEGANRRELCRRFGISPEVGYKWLARSKA-GDEALADRSR 59 Query: 65 IPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLP 124 PH+SP RS+ +I A + D H WGARKI WLED+G PA ST+H ++ RHG + Sbjct: 60 RPHNSPWRSAAEIEAAVLAVRDAHPAWGARKIGAWLEDRGVDPPAVSTIHAILRRHGRID 119 Query: 125 GASPGI-PATGRFEHDAPNRLWQMDFKGHFPFG-GGRCHPLTLLDDHSRFSLCLAHCTDE 182 A RFE PN+LWQMDFKG F G CHPLT++DDHSR S CL C D+ Sbjct: 120 DFPTSPGKAWRRFEKAEPNQLWQMDFKGWFRLSSGQPCHPLTIVDDHSRLSPCLKACADQ 179 Query: 183 RRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTG-TWTALELWLMRLGIRVGHSRPYHP 241 + +TV+ L + F RYGLP +DNG PWG+ +G WT LE+WL++LG+ V HSRPYHP Sbjct: 180 QGQTVRPHLEAAFRRYGLPLAFFVDNGPPWGEPSGERWTRLEVWLLKLGVDVLHSRPYHP 239 Query: 242 QTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQP 301 Q++GK+ERFHRSL AEVL + F ++QRAFD WR VYN ERPHEALD+ P +RYQP Sbjct: 240 QSRGKIERFHRSLAAEVLDLQRFDSFAQVQRAFDRWREVYNFERPHEALDLDCPANRYQP 299 Query: 302 SARQYSGNTTPPEYDEGVMVRKVD-ISGKLSVKGVSLSAGKAFRGERVGLKEMQ 354 S R + P YD G ++R V + KG KAF+GER+ L+ + Sbjct: 300 SPRAMPDHPPEPRYDSGEILRTVSTTKAYVRFKGRLWRVPKAFQGERLALRPPE 353 >UniRef50_C3K093 Putative integrase n=2 Tax=Pseudomonas fluorescens SBW25 RepID=C3K093_PSEFS Length = 382 Score = 293 bits (750), Expect = 9e-78, Method: Composition-based stats. Identities = 117/375 (31%), Positives = 175/375 (46%), Gaps = 13/375 (3%) Query: 5 MPWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPR 64 MPW+ M+ R + V L RRFG+S T KW+ R L + R Sbjct: 1 MPWNQESPMNQRIKLVADWLSGNFTKSQLARRFGVSRPTVDKWISRHN-GDLKSLAEVSR 59 Query: 65 IPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQG--HTMPAFSTVHNLMARHGL 122 PH+SPN++ D+I A + + H++WG +K+ L + P+ ST + R GL Sbjct: 60 RPHNSPNKTDDEILARVVAMKEAHDKWGPKKLIELLRIEDPSIDWPSPSTAGQWLDRLGL 119 Query: 123 LPGASPGIPATG----RFEHDAPNRLWQMDFKGHFPF-GGGRCHPLTLLDDHSRFSLCLA 177 + E + PN+ W D+KG F C PLT+ D SR L Sbjct: 120 VNKRRFKRRHGTSHIEMREANDPNKTWCADYKGQFKMLNAQMCFPLTVTDHASRLILACR 179 Query: 178 HCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGD-TTGTWTALELWLMRLGIRVGHS 236 + + V+Q +F+ YG+P+ + DNG P+ + L +W +RLGI + Sbjct: 180 AHPKIKTQPVKQTFERLFQEYGMPEVIRSDNGVPFASPGLARMSTLAVWWIRLGIYPERT 239 Query: 237 RPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPG 296 P P G+ ER HRSLK E+ G ++ E Q +H++ +N RPHEAL M PG Sbjct: 240 MPGRPAQNGRHERMHRSLKLELPLG---SNLVEQQLLLEHFKHEFNYVRPHEALGMKRPG 296 Query: 297 SRYQPSARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQED 356 Y PS R Y G EY + VR V G + G + +A GER+GLKE ++D Sbjct: 297 DVYMPSTRLYPGCLPDVEYPAEMRVRSVRQDGSIKWNGKLVFVSEALSGERIGLKEAEDD 356 Query: 357 GSYEVWWYSTKVGVI 371 ++++ +G + Sbjct: 357 -VWDLYLCDYPLGRL 370 >UniRef50_Q01QQ4 Integrase, catalytic region n=9 Tax=Bacteria RepID=Q01QQ4_SOLUE Length = 395 Score = 292 bits (749), Expect = 1e-77, Method: Composition-based stats. Identities = 125/386 (32%), Positives = 188/386 (48%), Gaps = 12/386 (3%) Query: 5 MPWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPR 64 MPW R + ++G +I L +G+S T YKWL+R ++G GLQ + R Sbjct: 1 MPWQEIRVEEQRLLMIRDH-EEGMSISELAEVYGVSRKTVYKWLERHDEQGFLGLQAQSR 59 Query: 65 IPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWL--EDQGHTMPAFSTVHNLMARHGL 122 PH SPN+ + ++ + A + WG K++ L +D PA ST+ ++ +GL Sbjct: 60 RPHRSPNQVTSEVEGAIIAARHKWG-WGPGKLRVKLFQQDSRVPWPAVSTIAAVLKANGL 118 Query: 123 LPGASPGIP----ATGRFEHDAPNRLWQMDFKGHFPFGGG-RCHPLTLLDDHSRFSLCLA 177 + D PN +W +D+KG F G G R PLT+ D SR+ L Sbjct: 119 VVSRRNRPRVPIQRPPYLAADGPNAVWNIDYKGWFRCGDGTRVDPLTISDGFSRYLLRCQ 178 Query: 178 HCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGD-TTGTWTALELWLMRLGIRVGHS 236 H E + V+ F+ +GLP + DNG+P+ G + L +W ++LGI V S Sbjct: 179 HVEQTGYELTRAVFVATFQEFGLPGAIHSDNGTPFASVAPGGLSRLSIWFVKLGIVVERS 238 Query: 237 RPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPG 296 RP PQ G+ ER HR+LKA + A Q+AF ++ YN ERPHEALD P Sbjct: 239 RPACPQDNGRHERMHRTLKAATAKPPQ-ATVRLQQQAFHAFQREYNEERPHEALDNKTPH 297 Query: 297 SRYQPSARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQED 356 S YQ SAR Y EY + + R + G L KGV + F E +G+K + E Sbjct: 298 SCYQASARSYPRRVPELEYGDDMETRVISQQGSLKWKGVRTFISEVFAYETLGIKVIDER 357 Query: 357 GSYEVWWYSTKVGVIDLKKKSITMGK 382 E+++ ++G +D +++ + K Sbjct: 358 W-VELYFGPIRLGWLDGYRQTFSRRK 382 >UniRef50_B4UMG9 Integrase catalytic region n=3 Tax=Bacteria RepID=B4UMG9_ANASK Length = 407 Score = 292 bits (747), Expect = 2e-77, Method: Composition-based stats. Identities = 125/378 (33%), Positives = 183/378 (48%), Gaps = 13/378 (3%) Query: 5 MPWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPR 64 MPW MS + EFV A GAN+ +LCR FGIS T +KWL+R+ +G GL ++ R Sbjct: 1 MPWKELRPMSQKLEFVEKAIVPGANVSALCRDFGISRQTAHKWLRRYRDQGYLGLVEKSR 60 Query: 65 IPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWL-EDQGHTMPAFSTVHNLMARHGLL 123 P SP +++D+ + +H WG +KI L G P+ +TV ++ R G + Sbjct: 61 RPASSPLATAEDVVVSIIELRSKHASWGPQKIAGVLARRLGPEAPSPTTVARVLRRLGKV 120 Query: 124 PGASPGIPA-----TGRFEHDAPNRLWQMDFKGHFP-FGGGRCHPLTLLDDHSRFSLCLA 177 P R E A N LW +DFKG + G +C PLT+ D SR L +A Sbjct: 121 KRRRPAARIWSVDGRPRIEVKASNDLWTIDFKGWWRALNGDKCEPLTVRDAFSRRVLAVA 180 Query: 178 HCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPW-GDT-TGTWTALELWLMRLGIRVGH 235 V++ L +F ++GLP + DNGSP+ G T L WL+ LGIR+ Sbjct: 181 LVPATTAAHVRRVLELLFRKHGLPSAIQSDNGSPFICSRSRGGLTVLSAWLVSLGIRIVR 240 Query: 236 SRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVP 295 SRP HPQ G ER HR L LQ QR D W +N RPH+AL P Sbjct: 241 SRPGHPQDNGGHERMHRDLSE--LQLSPARSRRAQQRQCDRWMLDFNHVRPHDALGGKTP 298 Query: 296 GSRYQPSARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 Y+ S R+ S + P Y + R+ + +G + + G + A + +GL++ E Sbjct: 299 AELYRNSTRR-SLSPLLPTYPPEWLTRRANKAGYVRINGDQVFVATALARQLIGLRQESE 357 Query: 356 DGSYEVWWYSTKVGVIDL 373 + ++ +G+I++ Sbjct: 358 -LRWSARFFDVDLGMIEI 374 >UniRef50_D2LAL9 Integrase catalytic region n=1 Tax=Desulfovibrio sp. FW1012B RepID=D2LAL9_9DELT Length = 390 Score = 284 bits (726), Expect = 5e-75, Method: Composition-based stats. Identities = 129/390 (33%), Positives = 188/390 (48%), Gaps = 16/390 (4%) Query: 5 MPWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPR 64 MPW + M R F++ SQ + +LCR++GIS TGYKWL+R+ GL +R R Sbjct: 1 MPWKKVNPMEERARFIVELSQRRESFAALCRKYGISRETGYKWLRRYQAG--EGLGERSR 58 Query: 65 IPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHT--MPAFSTVHNLMARHGL 122 + H P+++ D + LL + WG +K+ R L+D PA ST +++ RHGL Sbjct: 59 VARHCPHKTPDAVVTLLLALRQENPYWGPKKLVRLLQDVHGIEYPPAKSTAGDILKRHGL 118 Query: 123 LPGASPGIPATG-------RFEHDAPNRLWQMDFKGHFPFGGGR-CHPLTLLDDHSRFSL 174 + +G + N +W D+KG F CHPLT+ D SR+ L Sbjct: 119 ITATKAKRRQSGGRLRREDLRQPKQANDVWSADYKGWFRLEDRSICHPLTISDIFSRYVL 178 Query: 175 CLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTT-GTWTALELWLMRLGIRV 233 + E ++ + VF RYGLP + +DNG+P+G T T L +W ++LGI V Sbjct: 179 GCYVFPTQTLERTKEAMRRVFMRYGLPRAIRVDNGTPFGSTGIAGLTGLSVWWLQLGIVV 238 Query: 234 GHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMA 293 P P+ G ER HR+LK E A+ E Q + WR +N RPHEALD A Sbjct: 239 DFIAPGKPEQNGCHERMHRTLKLEATIPPS-ANLREQQERLESWRERFNSHRPHEALDQA 297 Query: 294 VPGSRYQPSARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEM 353 P S Y+PS+R+ N EY R V G + +G + G+AF R+GL Sbjct: 298 TPASIYRPSSRRLPRNEPCFEYPSSFESRTVRRDGMFNWEGRQIFLGEAFAKCRIGLTRN 357 Query: 354 QEDGSYEVWWYSTKVGVIDLK-KKSITMGK 382 +D + V+ +G K K + + Sbjct: 358 YDD-RWLVYLGEHLLGGFCPKDPKRVVPVR 386 >UniRef50_UPI00019025E5 ISHne2, transposase n=1 Tax=Rhizobium etli Brasil 5 RepID=UPI00019025E5 Length = 392 Score = 278 bits (711), Expect = 2e-73, Method: Composition-based stats. Identities = 118/368 (32%), Positives = 167/368 (45%), Gaps = 12/368 (3%) Query: 13 MSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNR 72 M R + ++ LCRR+G+S T Y W +R DR P+R Sbjct: 1 MEERVRMLSDYVSGHWSVSDLCRRYGVSRETFYSWRKRQMSGADDWFVDRSHGTVSCPHR 60 Query: 73 SSDDITALLRMAHDRHERWGARKIKRWLEDQG--HTMPAFSTVHNLMARHGLLPGASPGI 130 + + + R G RK+ L+ Q PA ST+ +++ R GL+ A Sbjct: 61 TPAALVDQVIALRQRFPHMGPRKLLALLQRQSAQTPWPAASTIGDILKRAGLVEVAKRRR 120 Query: 131 PA----TGRFEHDAPNRLWQMDFKGHFPFGG-GRCHPLTLLDDHSRFSLCLAHCTDERRE 185 A E N W +DFKG F R PLT+ D +SRF + + + E Sbjct: 121 RALDQSRPFTEATQANDEWSVDFKGWFRTRDQQRIDPLTISDSYSRFLIDVRIAP-QTIE 179 Query: 186 TVQQQLVSVFERYGLPDRMTMDNGSPWGDT-TGTWTALELWLMRLGIRVGHSRPYHPQTQ 244 V+ F +GLP + DNGSP+G G T L W ++LGI P PQ Sbjct: 180 GVRPVFEEAFRTHGLPFAIRCDNGSPFGSHGAGGLTRLSTWWIKLGIEAHFIAPASPQEN 239 Query: 245 GKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQP-SA 303 G+ ER HR+LKA+ + ++G+ Q FD +R YN ERPHEAL P Y+P Sbjct: 240 GRHERMHRTLKAQTSKP-PADNAGQQQVRFDAFRQHYNEERPHEALGQRPPADLYRPCQP 298 Query: 304 RQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQEDGSYEVWW 363 R P YD VR+V SG++ KG L +A GE VGL E+ E+G + V + Sbjct: 299 RAMPERLDDPWYDADHQVRRVRDSGEIKWKGGRLFVSEALAGELVGLSEL-ENGDHVVRF 357 Query: 364 YSTKVGVI 371 + +G+I Sbjct: 358 CNRDIGLI 365 >UniRef50_D2ML31 Integrase, catalytic region (Fragment) n=1 Tax=Candidatus Poribacteria sp. WGA-A3 RepID=D2ML31_9BACT Length = 411 Score = 278 bits (711), Expect = 3e-73, Method: Composition-based stats. Identities = 128/386 (33%), Positives = 174/386 (45%), Gaps = 11/386 (2%) Query: 5 MPWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPR 64 MPW M R FV +LC +GIS TG KWL R+ +G AGL D R Sbjct: 1 MPWKEIKIMDQREHFVSDYLTGDYPKGALCELYGISRPTGDKWLARYHAQGVAGLADLAR 60 Query: 65 IPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLE--DQGHTMPAFSTVHNLMARHGL 122 PH P+++ + + RH +G +KI+ L P STV ++ R GL Sbjct: 61 RPHTQPHQTPAAVIEAILTMKHRHPSFGPKKIRDRLRAVAPEEAWPVESTVGVILKRAGL 120 Query: 123 LPGASPGIPATGRFEH----DAPNRLWQMDFKGHFPFGGG-RCHPLTLLDDHSRFSLCLA 177 + + AP W DFKG FP G G RC+PLT++D SR+ L Sbjct: 121 VRPRRVRRRVPADPQRLSRGTAPAPTWSADFKGDFPLGTGPRCYPLTVMDHASRYLLRGE 180 Query: 178 HCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTT-GTWTALELWLMRLGIRVGHS 236 R VQ + VF YGLP + DNG P+ T G + L W +RLG+R Sbjct: 181 GLLQPTRAAVQPWVAWVFHEYGLPATIRTDNGPPFASTALGGLSRLAAWWVRLGLRPERI 240 Query: 237 RPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPG 296 RP P G ER HR+LKA V G A QR F + YN R HEA+ PG Sbjct: 241 RPGTPSENGCHERMHRALKAAV--GPPAATLAAQQRRFAAFVDEYNWARSHEAVARQPPG 298 Query: 297 SRYQPSARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQED 356 YQPS R Y P EY G +VR+V +G + +G + E VG ++ E Sbjct: 299 QVYQPSPRAYPAKLPPIEYAPGTLVRQVRQNGAVRWRGHGRYLSEVLAPEPVGFTQIGER 358 Query: 357 GSYEVWWYSTKVGVIDLKKKSITMGK 382 ++ + + + G +D + +I + Sbjct: 359 -TWAIHYRFHRRGTLDDRTLTIIPVR 383 >UniRef50_UPI00017F3A47 integrase catalytic subunit n=1 Tax=Escherichia coli O157:H7 str. EC4024 RepID=UPI00017F3A47 Length = 365 Score = 275 bits (705), Expect = 1e-72, Method: Composition-based stats. Identities = 134/366 (36%), Positives = 180/366 (49%), Gaps = 10/366 (2%) Query: 5 MPWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPR 64 MPW M R +F+ + +LCR FGIS TGYKWLQR+ + L DR R Sbjct: 1 MPWTETRPM-QRLDFIRACHAGTDSFSALCRLFGISRKTGYKWLQRFDPSDLSSLSDRSR 59 Query: 65 IPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQG--HTMPAFSTVHNLMARHGL 122 PH DDI A L +H WG +K++ WL + T+PA ST+ +++ R GL Sbjct: 60 APHSHSRTVPDDIAAQLTALRQKHPDWGPKKLRMWLLNHHADFTVPAASTIGDILKREGL 119 Query: 123 LPGASPGIPATGRFEHDAP----NRLWQMDFKGHF-PFGGGRCHPLTLLDDHSRFSLCLA 177 +P G + N++W DFKG F CHP TL D+HSR+ L Sbjct: 120 VPDKKRKRRTPGNRQPLTTISENNQVWSADFKGKFRLLSREYCHPFTLTDNHSRYLLSCR 179 Query: 178 HCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTT-GTWTALELWLMRLGIRVGHS 236 E V+Q L F YGLP+ + DNG P+ T + L +WL+RLGIR Sbjct: 180 GTDRESEPFVRQCLTDAFLEYGLPEVLRTDNGQPFAGTGIAGLSRLAVWLIRLGIRPERI 239 Query: 237 RPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPG 296 R HP+ G+ ER HRSLK+ V G F E QR F +R +N ERPHE+L A PG Sbjct: 240 RKGHPEENGRHERMHRSLKSAVSHGNTFMTMEEQQRWFSDYREEFNHERPHESLAGATPG 299 Query: 297 SRYQPSARQYSGNTTPPEYDEGVMVRKVDISGKLSV-KGVSLSAGKAFRGERVGLKEMQE 355 +QPS RQ+ G Y EG V +V G L + K ++ +A E + L+E + Sbjct: 300 MVWQPSCRQWDGRVPDYAYPEGGTVYRVKSRGTLYMGKKGTVFLSEALTDEYIMLEERDD 359 Query: 356 DGSYEV 361 + Sbjct: 360 GLEAII 365 >UniRef50_A5ASD2 Putative uncharacterized protein n=7 Tax=Vitis vinifera RepID=A5ASD2_VITVI Length = 1801 Score = 268 bits (685), Expect = 3e-70, Method: Composition-based stats. Identities = 58/326 (17%), Positives = 118/326 (36%), Gaps = 49/326 (15%) Query: 44 GYKWLQRWAQEGAAGLQDRPRIPHHSPNRS--SDDITALLRMAHDRH--ERWGARKIKRW 99 Y + + G P+ + ++ +L H+ + ++K Sbjct: 1403 WYAHIANYLVTGEV--------PNQIIRKCVPEEEQQGILSHCHENACGGHFASQKTIMK 1454 Query: 100 LEDQGHTMPAFSTVHNLMARHGLLPGASPGIPATGRFEHDA-------PNRLWQMDFKGH 152 + G T P+ ++M R T R + +W +DF G Sbjct: 1455 VLQSGFTWPSLFKDSHIMCR--SYDRCQRLGKLTRRNQMPMNPILIVDLFDVWGIDFMGP 1512 Query: 153 FPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPW 212 FP G + L +D S++ + ++ R ++ ++F R+G+P + D G+ + Sbjct: 1513 FPMSFGNSYILVGVDYVSKWVEAIPCKHNDHRVVLKFLKKNIFSRFGVPKAIISDRGTHF 1572 Query: 213 GDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSG---E 269 + E L + G++ + PYHPQT G++E +R +K +++ + Sbjct: 1573 CN-----KPFETLLAKYGVKHKVATPYHPQTSGQVELANREIKNILMKVVITTRRDWSIK 1627 Query: 270 LQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGNTTPPEYDEGVMVRKVDISGK 329 L + +RT Y L M+ Y+ + EY ++K++ Sbjct: 1628 LHDSLWAYRTTYKTI-----LGMSS----YRLVYGKACHLPVEVEYKAWWAIKKLN---- 1674 Query: 330 LSVKGVSLSAGKAFRGERVGLKEMQE 355 + +A + L EM+E Sbjct: 1675 -------MDLIRAGAKRCLDLNEMEE 1693 >UniRef50_Q82H05 Putative IS481 family ISMav2-like transposase n=1 Tax=Streptomyces avermitilis RepID=Q82H05_STRAW Length = 589 Score = 265 bits (678), Expect = 2e-69, Method: Composition-based stats. Identities = 104/388 (26%), Positives = 160/388 (41%), Gaps = 40/388 (10%) Query: 13 MSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNR 72 + R V+ + G + + R+G+S + + W++++ Q G AGL DR P P+R Sbjct: 2 VEQRYHAVMEVAA-GVPVTQVAARYGVSRQSVHSWVRKYEQSGLAGLTDRSHRPASCPHR 60 Query: 73 SSDDITALLRMAHDRHERWGARKIKRWLEDQGH-TMPAFSTVHNLMARHGLLPG--ASPG 129 + ++ A++ RH WG R++ LE +G +P+ +TV+ ++ R+ L+ Sbjct: 61 IASEVEAVVCELRRRHPTWGPRRLVHELERRGLAPVPSRATVYRVLIRNSLIEPGVRRRR 120 Query: 130 IPATGRFEHDAPNRLWQMDFK-GHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQ 188 R+E A LWQMD G GG C +T +DDHSRF + V Sbjct: 121 RSDYRRWERSAAMELWQMDIVGGLLLADGGECKMVTGIDDHSRFMVIAKVVQRATARAVC 180 Query: 189 QQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTA----LELWLMRLGIRVGHSRPYHPQTQ 244 R+G+P+ + DNG + + GI ++P P T Sbjct: 181 SAFGEALVRFGVPEEVLTDNGKQFTARFSPGKPGEAMFDRICRENGITHRLTKPRSPTTT 240 Query: 245 GKLERFHRSLKAEVLQGK-WFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSA 303 GK+ERFH++L+ E+L + F D Q D W YN RPH+ LDMAVP SR+ P Sbjct: 241 GKIERFHQTLRRELLDQQDPFTDLATAQATVDAWLEEYNRMRPHQGLDMAVPASRFVPRP 300 Query: 304 RQYSGNTT-------------------PPEYDEGVMV-----------RKVDISGKLSVK 333 R P + R V SG LS++ Sbjct: 301 RAEQDALPVRLPARLDPVPAPASAEPEPATVPRAWPMTEGEVGAIEVDRVVPASGNLSLR 360 Query: 334 GVSLSAGKAFRGERVGLKEMQEDGSYEV 361 G + G A G V L+ + Sbjct: 361 GQQIWFGPALAGTTVTLRIDVNRLHVLI 388 >UniRef50_A5C4R5 Putative uncharacterized protein n=1 Tax=Vitis vinifera RepID=A5C4R5_VITVI Length = 1398 Score = 263 bits (672), Expect = 9e-69, Method: Composition-based stats. Identities = 57/329 (17%), Positives = 121/329 (36%), Gaps = 54/329 (16%) Query: 74 SDDITALLRMAHDRH--ERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIP 131 ++ +L H+ + ++K + G T P+ ++M R Sbjct: 1024 EEEQQGILNHCHENACGGHFASQKTAMKVLQSGFTWPSLFKDSHIMCR--SCDRCQRLGK 1081 Query: 132 ATGRFEHDA-------PNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERR 184 T R + +W +DF G FP G + L +D S++ + + R Sbjct: 1082 LTKRNQMPMNPILIVDLFDVWGIDFMGPFPMSFGNSYILVGVDYVSKWVEAIPCKHNVHR 1141 Query: 185 ETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQ 244 ++ ++F R+G+P + D G+ + + E L + G++ + PYHPQT Sbjct: 1142 VVLKFLKENIFSRFGVPKAIISDGGTHFCN-----KPFEALLAKYGVKHKLATPYHPQTS 1196 Query: 245 GKLERFHRSLKAEVLQGKWFADSGE----LQRAFDHWRTVYNLERPHEALDMAVPGSRYQ 300 G++E +R +K +++ + L + +RT Y L M+ Y+ Sbjct: 1197 GQVELANREIKNILMKVV-ITSRKDWSIKLHDSLWAYRTAYKTI-----LGMSP----YR 1246 Query: 301 PSARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQEDGSYE 360 + EY ++++++ + +A + L EM+E + Sbjct: 1247 LVYGKACHLPMEVEYKAWWVIKRLN-----------MDLIRAGAKRCLDLNEMEE-LRND 1294 Query: 361 VW------------WYSTKVGVIDLKKKS 377 + W+ + +L+K+ Sbjct: 1295 AYINSKVAKQRMKKWHDQLISNKELRKRQ 1323 >UniRef50_Q4JUH9 Transposase for IS3514b n=9 Tax=Corynebacterium RepID=Q4JUH9_CORJK Length = 407 Score = 262 bits (670), Expect = 2e-68, Method: Composition-based stats. Identities = 93/365 (25%), Positives = 136/365 (37%), Gaps = 20/365 (5%) Query: 13 MSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNR 72 + V + G + + +RF IS YK L ++ GA + + R PH P Sbjct: 4 PNRNLAIVKAVREQGEPVTKVAKRFRISRQRIYKILSQFDAGGADAIAPKSRAPHTHPQA 63 Query: 73 SSDDITALLRMAHDRHER----WGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASP 128 + + + R G I L QG +P+ ST+ ++ GL+ Sbjct: 64 VPTSLRNQIIDMRKQLVRSGLDAGPETIAFHLHRQGLRVPSTSTIRRIITNAGLVTPQPQ 123 Query: 129 GIPATG--RFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRET 186 P + RFE PN WQ D G R L +DDHSR+ L + Sbjct: 124 KKPRSSFIRFEAAMPNECWQADITHLHLLDGTRLEVLDFIDDHSRYLLSITAAASFSGPA 183 Query: 187 VQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWT----ALELWLMRLGIRVGHSRPYHPQ 242 V +L + YG P DNG + A E L + I+ + RP HPQ Sbjct: 184 VAAELQRLIATYGPPASTLTDNGLVFTARLAGARGGRNAFEKTLNKYRIQQKNGRPGHPQ 243 Query: 243 TQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPS 302 TQGK+ERFH++LK + ELQR D + YN RPH AL P Y Sbjct: 244 TQGKIERFHQTLKKWIAAQSPAITLVELQRQLDTFADYYNTVRPHRALGRRTPHEVYTTG 303 Query: 303 ARQYSGNTTPPEYDEGVMVRK--VDISG--KLSVKGVSLSA--GKAFRGERVGLKEMQED 356 + + E VR V +G + G+ + GE + + Sbjct: 304 PKAEPNDKPEEE----WRVRNDVVTPNGKVTVRYASRLYQLGIGRKYTGETILMVITDNH 359 Query: 357 GSYEV 361 + + Sbjct: 360 VTTSL 364 >UniRef50_A5CBG5 Putative uncharacterized protein n=4 Tax=Vitis vinifera RepID=A5CBG5_VITVI Length = 2329 Score = 261 bits (668), Expect = 3e-68, Method: Composition-based stats. Identities = 61/323 (18%), Positives = 120/323 (37%), Gaps = 46/323 (14%) Query: 45 YKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMAHDRH--ERWGARKIKRWLED 102 Y W + + + A R +P D+ +L H+ + ++K + Sbjct: 2003 YYWEEPFLFKYCADQIXRKCVP-------EDEQQGILNHCHENACGGHFASQKTAMKVLQ 2055 Query: 103 QGHTMPAFSTVHNLMARHGLLPGASPGIPATGRFEHDA-------PNRLWQMDFKGHFPF 155 G T P+ ++M R T R + +W +DF G FP Sbjct: 2056 SGFTWPSXFKDAHIMCR--SCDRCQRLGKLTKRNQMPMNPILIVELFDVWGIDFMGPFPM 2113 Query: 156 GGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDT 215 G + L +D S++ + ++ R ++ ++F R+G+P + D G+ + + Sbjct: 2114 SFGNSYILVGVDYVSKWVEAIPCRKNDHRVVLKFLKENIFSRFGVPKAIISDGGAHFCN- 2172 Query: 216 TGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSG---ELQR 272 E L + G++ + PYHPQT G++E +R +K +++ + L Sbjct: 2173 ----KPFEALLSKYGVKHKVATPYHPQTSGQVELANREIKNILMKVVNASRKDWSIRLHD 2228 Query: 273 AFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGNTTPPEYDEGVMVRKVDISGKLSV 332 + +RT Y L M+ Y+ + EY ++K++ Sbjct: 2229 SLWAYRTXYKTI-----LGMSP----YRLVYGKACHLPVEVEYKAWXAIKKLN------- 2272 Query: 333 KGVSLSAGKAFRGERVGLKEMQE 355 + +A + L EM+E Sbjct: 2273 ----MDLIRAGAKRCLDLNEMEE 2291 >UniRef50_A5C1P8 Putative uncharacterized protein n=4 Tax=Vitis vinifera RepID=A5C1P8_VITVI Length = 1601 Score = 259 bits (661), Expect = 2e-67, Method: Composition-based stats. Identities = 52/294 (17%), Positives = 108/294 (36%), Gaps = 39/294 (13%) Query: 74 SDDITALLRMAHDRH--ERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIP 131 ++ +L H+R + ++K + G + P+ + M R Sbjct: 1230 EEEQQGILSHCHERACGGHFTSQKTTMKVLQSGFSWPSLFKNAHTMCR--SCDRYQRLRK 1287 Query: 132 ATGRFEHDA-------PNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERR 184 T R + +W +DF G FP G + L +D S++ + ++ R Sbjct: 1288 LTRRNQMPMNPILIVDLFDVWGIDFMGPFPMSFGNSYILVGVDYVSKWVEAIPCKHNDHR 1347 Query: 185 ETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQ 244 ++ ++F R+G+P + D G+ + + E L + G++ + PYHPQT Sbjct: 1348 VVLKFLKENIFSRFGVPKSIISDGGTHFCN-----KPFETLLAKYGVKHKVATPYHPQTS 1402 Query: 245 GKLERFHRSLKAEVLQGKWFADSG---ELQRAFDHWRTVYNLERPHEALDMAVPGSRYQP 301 G++E +R + +++ + +L + ++T Y + P Y Sbjct: 1403 GQVELANREIMNILMKVMSTSRRDWSIKLHDSLWAYKTTYKT------IFGMSP---YHL 1453 Query: 302 SARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 + EY ++KV+ + + + L EM+E Sbjct: 1454 VYGKACHLPVEVEYKAWWAIKKVN-----------MDLIRVGAKRCLDLNEMEE 1496 >UniRef50_B2HR82 Transposase for ISMyma05 n=5 Tax=Mycobacterium RepID=B2HR82_MYCMM Length = 518 Score = 257 bits (657), Expect = 4e-67, Method: Composition-based stats. Identities = 103/396 (26%), Positives = 164/396 (41%), Gaps = 29/396 (7%) Query: 8 DARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAA-GLQDRPRIP 66 +R VL DGA + ++ RRFG+S T + WL+R+A+EGAA L+DR P Sbjct: 3 QELRMSEMRYRAVLEVL-DGAPVTAVARRFGVSRQTVHAWLRRYAEEGAALNLEDRSSRP 61 Query: 67 HHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTM-PAFSTVHNLMARHGLLPG 125 H P++ ++ A + D H RWG +I L+ + P S+V+ + R+G + Sbjct: 62 HRCPHQMPVEVEARVLTLRDAHPRWGPTRIVYELQRDVVPVVPGRSSVYRALVRNGRIDP 121 Query: 126 ASPGIPAT--GRFEHDAPNRLWQMDFKGHFPFGGG-RCHPLTLLDDHSRFSLCLAHCTDE 182 A + R+E P LWQMD G G +T +DD+SRF + Sbjct: 122 AKRRRRRSDYKRWERGRPMELWQMDVVGGLHLRDGIEVKVVTGIDDNSRFVVSAKVVARA 181 Query: 183 RRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTA-----LELWLMRLGIRVGHSR 237 V L+ R+G+P+++ DNG + G + + + GI + Sbjct: 182 TARPVCAALLEALRRHGVPEQILTDNGKVFTGRFGPGGSSAEVLFDRVCVENGIGHLLTA 241 Query: 238 PYHPQTQGKLERFHRSLKAEVLQGK--WFADSGELQRAFDHWRTVYNLERPHEALDM--- 292 P P T GK+ER H++++AE+ F ELQ A D W YN RPH++L M Sbjct: 242 PRSPTTTGKVERLHKTMRAEIFAEVDGVFDAIAELQAAIDRWVQYYNTARPHQSLGMVAP 301 Query: 293 -----------AVPGSRYQPSARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGK 341 + + + + R V+ G +SV G Sbjct: 302 AARFALAAGPDLSVVEPVAAVPAEGQHPGALVDLRDAGVRRWVNRHGSISVAGFRYRVPI 361 Query: 342 AFRGERVGLKEMQEDGSYEVWWYSTKVGVIDLKKKS 377 GE V + + D ++ + V +K+ Sbjct: 362 VLAGEPVSV--VVADNLVSIYHHDVLVASHVQHRKT 395 >UniRef50_A5BYC4 Putative uncharacterized protein n=5 Tax=Vitis vinifera RepID=A5BYC4_VITVI Length = 1855 Score = 256 bits (655), Expect = 7e-67, Method: Composition-based stats. Identities = 53/320 (16%), Positives = 119/320 (37%), Gaps = 34/320 (10%) Query: 74 SDDITALLRMAHDRH--ERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIP 131 ++ +L H+ + ++K + G T P+ ++M R Sbjct: 1488 EEEQQGILSHCHENACGGHFASQKTAMKVLQSGFTWPSLFKDSHIMCR--SCDRCQRLGK 1545 Query: 132 ATGRFEHDA-------PNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERR 184 T R + +W +BF FP + L +D S++ + ++ R Sbjct: 1546 LTKRNQMPMNPILIVDIFXVWGIBFMRPFPMSFSNSYILVGVDYVSKWVEAIPCKHNDHR 1605 Query: 185 ETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQ 244 ++ ++F R+G+P + D G+ + + E L + G++ + PYHPQT Sbjct: 1606 VVLKFLKENIFSRFGVPKAIISDGGTHFCN-----KPFETLLAKYGVKHKVAIPYHPQTS 1660 Query: 245 GKLERFHRSLKAEVLQGKWFADSGE----LQRAFDHWRTVYNLERPHEALDMAVPGSRYQ 300 G++E +R +K +++ + L + +RT Y L M+ Y+ Sbjct: 1661 GQVELANREIKNILMKVV-ITSRKDWSIKLHDSLWAYRTAYKTI-----LGMSP----YR 1710 Query: 301 PSARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQEDGSYE 360 + EY ++++++ + V + K + + + KE++ Sbjct: 1711 LVYGKACHLPVEVEYKAWWAIKRLNMD-LIRVGAKRM---KKWHDQLISNKELRNGQRVL 1766 Query: 361 VWWYSTKVGVIDLKKKSITM 380 ++ + LK + I Sbjct: 1767 LYDSRLHIFPGKLKSRWIGP 1786 >UniRef50_A1T2L4 Integrase, catalytic region n=4 Tax=Actinomycetales RepID=A1T2L4_MYCVP Length = 597 Score = 256 bits (654), Expect = 1e-66, Method: Composition-based stats. Identities = 96/312 (30%), Positives = 142/312 (45%), Gaps = 10/312 (3%) Query: 1 MESLMPWDARD-TMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGL 59 M +P D + R V DG + + + G S + + W+ R+ G AGL Sbjct: 1 MTQQLPNSPNDWVIEHRYRAVRQVL-DGVSKSQVAQECGASRQSVHSWVIRYEALGVAGL 59 Query: 60 QDRPRIPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQG-HTMPAFSTVHNLMA 118 DR R P SPN S + A++ + RWGA++I L +G P+ S+V+ ++ Sbjct: 60 ADRSRRPLTSPNELSPAVVAMVCELRRTYPRWGAQRIAHELALRGVDAPPSRSSVYRILV 119 Query: 119 RHGLLPGASPGIPAT-GRFEHDAPNRLWQMDFK-GHFPFGGGRCHPLTLLDDHSRFSLCL 176 RHGL+ R++ DAP +LWQ+D G F G C +T +DDHSRF + Sbjct: 120 RHGLVAAQQQNHKRKYRRWQRDAPMQLWQIDIMGGVFLVDGRECKVVTGIDDHSRFVVMA 179 Query: 177 AHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTA----LELWLMRLGIR 232 D V + YG+P + DNG + E GI Sbjct: 180 TVVADPGARAVCAAFTATMAIYGVPSEVLTDNGKQFTGRFTKPYPAEVLFERICRENGIT 239 Query: 233 VGHSRPYHPQTQGKLERFHRSLKAEVLQ-GKWFADSGELQRAFDHWRTVYNLERPHEALD 291 ++P P T GK+ERFH++L+ E+L FA Q A D W YN RPH++L Sbjct: 240 TRLTKPRSPTTTGKIERFHKTLRRELLDSAGPFASIEVAQEAIDAWVHGYNHSRPHQSLG 299 Query: 292 MAVPGSRYQPSA 303 MA P + ++P+ Sbjct: 300 MATPATMFRPAP 311 >UniRef50_A5AMG6 Putative uncharacterized protein n=3 Tax=Vitis vinifera RepID=A5AMG6_VITVI Length = 1704 Score = 254 bits (650), Expect = 4e-66, Method: Composition-based stats. Identities = 54/320 (16%), Positives = 116/320 (36%), Gaps = 34/320 (10%) Query: 74 SDDITALLRMAHDRH--ERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIP 131 + +L H+ + ++KI + G T P+ ++M R Sbjct: 1174 EQEQQGILNHCHENACEGHFASQKIAMKVLQSGFTWPSLFKDAHIMCR--SCDRCQRLGK 1231 Query: 132 ATGRFEHDA-------PNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERR 184 T R + +W +DF G FP G + L +D S++ + ++ R Sbjct: 1232 LTKRNQMPMNPILIVDLFDVWGIDFMGPFPMSFGNSYILVGVDYDSKWVEAIPCKHNDHR 1291 Query: 185 ETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQ 244 ++ ++F R+ +P + D G+ + + E L + G++ + YHPQT Sbjct: 1292 VVLKFLKENIFLRFRVPKAIISDGGTHFCN-----KPFETLLAKYGVKHKVATSYHPQTS 1346 Query: 245 GKLERFHRSLKAEVLQGKWFADSGE----LQRAFDHWRTVYNLERPHEALDMAVPGSRYQ 300 G++E +R +K +++ + L + +RT Y L M+ Y Sbjct: 1347 GQVELANREIKNILMKVV-ITSRKDWSIKLHDSLWAYRTTYKTI-----LGMSP----YH 1396 Query: 301 PSARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQEDGSYE 360 + EY ++++++ + + K + + + KE + Sbjct: 1397 LIYGKACHVPVEVEYKVWWAIKRLNMD-LIKAGAKRM---KRWHDQLISNKEFWKGQRVL 1452 Query: 361 VWWYSTKVGVIDLKKKSITM 380 ++ + LK + I Sbjct: 1453 LYDSRLHIFPGKLKSRWIGP 1472 >UniRef50_Q4JWW8 Transposase for IS3511a n=7 Tax=Corynebacterium RepID=Q4JWW8_CORJK Length = 405 Score = 253 bits (647), Expect = 7e-66, Method: Composition-based stats. Identities = 92/376 (24%), Positives = 144/376 (38%), Gaps = 16/376 (4%) Query: 19 FVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDIT 78 + G + FG++ +R+ + G L + + PH +P ++ D Sbjct: 9 IIETMLATGMTQAEAAQHFGVTTRWIRTLQKRYNEGGVEALTPKSKRPHTNPRATTPDTV 68 Query: 79 ALLRMAHDR----HERWGARKIKRWLEDQGHTM-PAFSTVHNLMARHGLLPGASPGIPAT 133 + + GA I+ LE + T PA +T+H ++ +G + P + Sbjct: 69 DRILQLRNELTNRGTDAGAHTIRWHLEQEDTTPLPATATIHRILKNNGHVTLQPQKRPRS 128 Query: 134 G--RFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQL 191 RF+ D PN WQMD+ G R LT+LDDHSR+ L V + Sbjct: 129 SWIRFQADQPNETWQMDYSDWTIAGHQRVVILTILDDHSRYVLRCQAFNSATVTHVIEAF 188 Query: 192 VSVFERYGLPDRMTMDNGSPWGDTTGTWTA----LELWLMRLGIRVGHSRPYHPQTQGKL 247 +G P DNG + + E L+ LGI + RPYHPQTQGK+ Sbjct: 189 AYTAAIHGYPQSTLTDNGRAFTTSNDRTNPARNGFEQLLLDLGIEQKNGRPYHPQTQGKV 248 Query: 248 ERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYS 307 ERFH +LK + D L D YN +RPH AL+ P Y + + Sbjct: 249 ERFHYTLKLALRNKPQARDIDNLNEQLDDIIDYYNNKRPHRALNRCTPAEAYNALPKAHP 308 Query: 308 GNTTPPEYDEGVMVRKVDISG--KLSVKG--VSLSAGKAFRGERVGLKEMQEDGSYEVWW 363 +D + KV +G L G + G+ + GE + + + ++ Sbjct: 309 -RPGAKTHDYRLRTDKVAKNGKTTLRWGGQLRRIYIGRRWTGEPITIMCVDNTADIKITA 367 Query: 364 YSTKVGVIDLKKKSIT 379 + L I Sbjct: 368 TGQHIAHYTLTPDKIY 383 >UniRef50_UPI0001B453C4 transposase for IS3514a n=1 Tax=Mycobacterium intracellulare ATCC 13950 RepID=UPI0001B453C4 Length = 397 Score = 252 bits (645), Expect = 1e-65, Method: Composition-based stats. Identities = 95/386 (24%), Positives = 155/386 (40%), Gaps = 17/386 (4%) Query: 13 MSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNR 72 MS + +G + ++ R + +S ++ ++R+ EG A + R R PH +P Sbjct: 1 MSKAQLVITAVVLEGRSKSAVARDYEVSRYWVHQLVKRYEAEGPAAFEPRSRRPHTNPRA 60 Query: 73 SSDDITALLRMAH----DRHERWGARKIKRWLEDQGH--TMPAFSTVHNLMARHGLLPGA 126 + D+ + GA I L T+PA +T+ +++R G + Sbjct: 61 VAGDLEERIVRLRKTLLREGYDAGAATIAEHLARDPAVATVPALATIWRVLSRRGFITAQ 120 Query: 127 SPGIPA--TGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERR 184 P RFE D PN+ WQ D L ++DDHSR ++ Sbjct: 121 PQKRPRSSWKRFEADLPNQCWQADVTHWQLADHTSAEILNIIDDHSRLAIASTAYRTVTA 180 Query: 185 ETVQQQLVSVFERYGLPDRMTMDNGSPWGD--TTGTWTALELWLMRLGIRVGHSRPYHPQ 242 V + + F +G P + DNG+ + G TAL++ L LGI +SRPYHPQ Sbjct: 181 PDVVEAFTAAFATWGTPAALLTDNGAVFTATPRRGGRTALQILLGELGITYINSRPYHPQ 240 Query: 243 TQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPS 302 T GK+ERFH++LK + ELQ + + YN RPH A+ P + Sbjct: 241 TCGKVERFHQTLKKRLTAVPPATTITELQSHLNEFVNYYNTVRPHRAVGRRTPHHAFTSR 300 Query: 303 ARQYS-GNTTPPEYDEGVMVRKVDISGKLSVKGV----SLSAGKAFRGERVGLKEMQEDG 357 + G PP + + ++D +G ++V+ + K RG V + D Sbjct: 301 PAAFPTGYHIPPHF--RLRHDRIDAAGVITVRYNSRLHHIGLSKHLRGTHVIVLINNRDI 358 Query: 358 SYEVWWYSTKVGVIDLKKKSITMGKG 383 + + L +G Sbjct: 359 RVLARDTGQLIRKLTLDPTRDYQPRG 384 >UniRef50_A5BTM1 Putative uncharacterized protein n=31 Tax=Vitis vinifera RepID=A5BTM1_VITVI Length = 2292 Score = 251 bits (641), Expect = 3e-65, Method: Composition-based stats. Identities = 51/295 (17%), Positives = 112/295 (37%), Gaps = 41/295 (13%) Query: 74 SDDITALLRMAHDRH--ERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIP 131 ++ +L H+ + ++K + G + P+ + M R Sbjct: 1392 EEEQQGILSHCHENACGGHFASKKTAMKVLQSGLSWPSLFKDAHTMCR--SCDRCQRLEK 1449 Query: 132 ATGRFEHDA-------PNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERR 184 R + +W +DF G FP G + L +D S++ + ++ R Sbjct: 1450 LIRRNQMPMNPILIVDLFDVWGIDFMGPFPMSFGNSYILVGVDYVSKWVKAIPCKHNDHR 1509 Query: 185 ETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQ 244 ++ ++F R+G+P + D G+ + + E L + G++ + PYHPQT Sbjct: 1510 VVLKFLKENIFSRFGVPKAIISDGGTHFCNR-----PFETLLAKYGVKHKVATPYHPQTS 1564 Query: 245 GKLERFHRSLKAEVLQG----KWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQ 300 G++E+ ++ +K +++ + + +L + +RT Y L M+ Y+ Sbjct: 1565 GQVEQANKGIKNILMKVVITSRKYWSI-KLHDSLWAYRTAYKTI-----LGMSP----YR 1614 Query: 301 PSARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 + EY +++++ + +A + L EM+E Sbjct: 1615 LVYGKACHLLVEVEYKAWWAIKRLN-----------MDLIRAGAKRCLDLNEMEE 1658 >UniRef50_A5CA04 Putative uncharacterized protein n=3 Tax=Vitis vinifera RepID=A5CA04_VITVI Length = 2174 Score = 250 bits (640), Expect = 5e-65, Method: Composition-based stats. Identities = 62/326 (19%), Positives = 122/326 (37%), Gaps = 49/326 (15%) Query: 44 GYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITA--LLRMAHDRH--ERWGARKIKRW 99 Y + + G IP+ + ++ +L H+ + ++K Sbjct: 1339 WYAHIANYLVTGE--------IPNQIIRKCVLEVEQQGILSHCHENACGGHFASQKTAMK 1390 Query: 100 LEDQGHTMPAFSTVHNLMARHGLLPGASPGIPATGRFEHDA-------PNRLWQMDFKGH 152 + G T P+ ++M R+ T R + +W +DF G Sbjct: 1391 VLQSGFTWPSLFKDAHIMCRN--CDRCQRLGKLTKRNQMPMNPILIVELFDVWGIDFMGP 1448 Query: 153 FPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPW 212 FP G + L +D S++ + + ++ R ++ ++F R+G+P + D G+ + Sbjct: 1449 FPMSFGNSYILVGVDYVSKWVEAIPYKQNDHRVVLKFLKENIFSRFGVPKAIISDGGAHF 1508 Query: 213 GDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQG---KWFADSGE 269 + E L + G++ + PYHPQT G++E +R +K +++ S Sbjct: 1509 CN-----KPFEALLSKYGVKHKVATPYHPQTSGQVELANREIKNILMKVVNSNRKDWSIR 1563 Query: 270 LQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGNTTPPEYDEGVMVRKVDISGK 329 L + +RT Y L M+ Y+ + EY +RK++ Sbjct: 1564 LHDSLWAYRTAYKTI-----LRMSP----YRLVYCKACHLPVEVEYKAWWAIRKLN---- 1610 Query: 330 LSVKGVSLSAGKAFRGERVGLKEMQE 355 ++ KA + L EM+E Sbjct: 1611 -------MNLIKAGEKRFLDLNEMEE 1629 >UniRef50_Q5ZTP2 Transposase (ISmav2) n=14 Tax=Proteobacteria RepID=Q5ZTP2_LEGPH Length = 341 Score = 250 bits (640), Expect = 5e-65, Method: Composition-based stats. Identities = 109/335 (32%), Positives = 157/335 (46%), Gaps = 11/335 (3%) Query: 5 MPWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPR 64 MPW + + +FV DG + SLC+ FGIS TG+K R+ + G GL DR R Sbjct: 1 MPWQECTKVDEKIKFVARLL-DGEQMSSLCQEFGISRKTGHKIYNRYKESGLEGLNDRSR 59 Query: 65 IPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTM--PAFSTVHNLMARHGL 122 PH N+ + + WGA KI+ + Q + PA ST+H ++ ++GL Sbjct: 60 KPHRYANQLPFQLEKEILKVKKEKPTWGAPKIREKILRQYPDVKSPAISTIHTILDKYGL 119 Query: 123 LPGASPGI---PATGRFEHDAPNRLWQMDFKGHF-PFGGGRCHPLTLLDDHSRFSLCLAH 178 + T PN LW D+KG F C+PLTL D +SR+ L Sbjct: 120 VTKRKRRRYKAEGTKLTNGKTPNELWCADYKGEFQLGSKEYCYPLTLTDFNSRYLLACEG 179 Query: 179 CTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTT--GTWTALELWLMRLGIRVGHS 236 + + + VF+ YGLP+ + DNG P+ + L +W +RLGI + Sbjct: 180 LSTTKEQYAITVFERVFKEYGLPNAIRTDNGVPFSSVQALFGLSKLSVWWLRLGISIERI 239 Query: 237 RPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPG 296 RP +PQ G+ ER H +LK E + + + Q FD + YN ERPH+ALDM PG Sbjct: 240 RPGNPQENGRHERMHLTLKKETTKPSG-ENFLQQQEKFDRFIDEYNNERPHQALDMRYPG 298 Query: 297 SRYQPSARQYSGNTTPPEYDEGVMVRKVDISGKLS 331 Y PS ++Y G +Y + V G+L Sbjct: 299 EVYIPSNKEYKGLP-EVDYPFHDKMITVTHCGRLC 332 >UniRef50_A5C2R0 Putative uncharacterized protein n=10 Tax=Vitis vinifera RepID=A5C2R0_VITVI Length = 2116 Score = 250 bits (638), Expect = 7e-65, Method: Composition-based stats. Identities = 56/295 (18%), Positives = 111/295 (37%), Gaps = 41/295 (13%) Query: 74 SDDITALLRMAHDRH--ERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIP 131 D+ +L H+ + ++K + G T P ++M R Sbjct: 1742 EDEQQGILSHCHENACGGHFASQKTAMKVLQSGFTWPFLFKDAHIMCR--SCDRCQRLGK 1799 Query: 132 ATGRFEHDA-------PNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERR 184 T R + +W +DF G FP G + L +D S++ + ++ + Sbjct: 1800 LTKRNQMPMNPILIVELFDVWGIDFMGPFPMSFGNSYILVGVDYVSKWVEAIPCKQNDHK 1859 Query: 185 ETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQ 244 ++ ++F R+G+P + D G+ + + E L R G++ + PYHPQT Sbjct: 1860 VVLKFLKENIFSRFGVPKAIISDGGAHFCN-----KPFEALLSRYGVKHKVATPYHPQTS 1914 Query: 245 GKLERFHRSLKAEVLQGKWFADSGE----LQRAFDHWRTVYNLERPHEALDMAVPGSRYQ 300 G++E +R +K +++ + + L + +RT Y L M+ Y+ Sbjct: 1915 GQVELANREIKNILMKVVN-SSRKDWSIRLHDSLWAYRTTYKTI-----LGMSP----YR 1964 Query: 301 PSARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 + EY ++K+++ +A + L EM+E Sbjct: 1965 LVHGKACHLPVEVEYKAWRAIKKLNLD-----------LIRAGEKRYLDLNEMEE 2008 >UniRef50_A5AQ03 Putative uncharacterized protein n=5 Tax=Vitis vinifera RepID=A5AQ03_VITVI Length = 1873 Score = 247 bits (631), Expect = 6e-64, Method: Composition-based stats. Identities = 52/291 (17%), Positives = 107/291 (36%), Gaps = 45/291 (15%) Query: 74 SDDITALLRMAHDRH--ERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIP 131 + + +L HD + ++K + G P+ + M + Sbjct: 1511 EQEQSGILSHCHDSACGGHFASQKTAMKVIQSGFWWPSLFKDAHSMCKG--CDRCQRLGK 1568 Query: 132 ATGRFEHD-------APNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERR 184 T R +W +DF G FP G + L +D S++ + +++ + Sbjct: 1569 LTRRNMMPLNPILIVDIFDVWGIDFMGPFPMSFGHSYILVGVDYISKWVEAIPCRSNDHK 1628 Query: 185 ETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQ 244 ++ +F R+G+P + D G+ + + E L + G++ + PYHPQT Sbjct: 1629 VVLKFLKDHIFARFGVPKAIISDGGTHFCN-----KPFETLLAKYGVKHKVATPYHPQTS 1683 Query: 245 GKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSAR 304 G++E +R +K +++ L + +RT Y L M+ Y+ Sbjct: 1684 GQVELANREIKNILMK---------LLDSLWAYRTAYKTI-----LGMSP----YRLVYG 1725 Query: 305 QYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 + EY ++K++ + +A + L E++E Sbjct: 1726 KACHLPVEIEYKAWWAIKKLN-----------MDLIRAGLKRCLDLNELEE 1765 >UniRef50_A5BFN4 Putative uncharacterized protein n=2 Tax=Vitis vinifera RepID=A5BFN4_VITVI Length = 1956 Score = 246 bits (629), Expect = 9e-64, Method: Composition-based stats. Identities = 56/347 (16%), Positives = 121/347 (34%), Gaps = 40/347 (11%) Query: 44 GYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQ 103 Y + + G P + S+ ++ + M K + Sbjct: 1152 WYAHIANYLVTGEV--------PRSVSLKKSNKGSSAIAMRMHVEATLPLMKAAMKVLQS 1203 Query: 104 GHTMPAFSTVHNLMARHGLLPGASPGIPATGRFEHDA-------PNRLWQMDFKGHFPFG 156 G T P+ ++M R T R + +W +DF G FP Sbjct: 1204 GFTWPSLFKDSHIMCR--SCDRCQRLEKLTKRNQMPMNPILIVDIFYVWGIDFMGPFPMS 1261 Query: 157 GGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTT 216 G + L +D S++ + ++ R ++ ++F R+G+P + D G+ + + Sbjct: 1262 FGNSYILVGVDYVSKWVEAIPCKHNDHRVVLKFLKENIFSRFGVPKAIISDGGTHFCN-- 1319 Query: 217 GTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGE----LQR 272 L + G++ + PYHPQT ++E +R +K +++ + L Sbjct: 1320 ---KPFVTLLAKYGVKHKVATPYHPQTSRQVELANREIKNILMKMV-ITSRKDWSIKLHD 1375 Query: 273 AFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGNTTPPEYDEGVMVRKVDISGKLSV 332 + +RT Y L M+ Y+ + EY ++++++ + Sbjct: 1376 SLWAYRTTYKTI-----LGMSP----YRLVYGKACHLPMEVEYKAWWAIKRLNMD-LIRA 1425 Query: 333 KGVSLSAGKAFRGERVGLKEMQEDGSYEVWWYSTKVGVIDLKKKSIT 379 + K + + + KE ++ ++ + LK + I Sbjct: 1426 SAKRM---KRWHDQLISNKEFRKGQRVLLYDSRLHIFPGKLKSRWIG 1469 >UniRef50_UPI0001B45627 transposase for ISMyma05 n=1 Tax=Mycobacterium intracellulare ATCC 13950 RepID=UPI0001B45627 Length = 519 Score = 246 bits (628), Expect = 1e-63, Method: Composition-based stats. Identities = 108/396 (27%), Positives = 169/396 (42%), Gaps = 30/396 (7%) Query: 8 DARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAA-GLQDRPRIP 66 +R VL CR +G+S T + WL+R+A+EGA L+DR P Sbjct: 3 RELRVSEMRYRAVLEVLDGAVISTVACR-YGVSRQTVHAWLRRYAREGAVLNLEDRSSRP 61 Query: 67 HHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGH-TMPAFSTVHNLMARHGLLPG 125 H P++ + ++ A + + D H RWG +I L +G +P S+V+ + R+G + Sbjct: 62 HGCPHQMAAELEARVLVLRDAHPRWGPTRIVYELVREGVVAVPGRSSVYRALVRNGRIDP 121 Query: 126 ASPGIPAT--GRFEHDAPNRLWQMDFKGHFPFGGGR-CHPLTLLDDHSRFSLCLAHCTDE 182 A R+E P LWQMD G G +T +DD+SRF +C A Sbjct: 122 ARRRRRRADYKRWERGRPMELWQMDVVGGVHLCDGVEVKVITGIDDNSRFVVCAAVVARA 181 Query: 183 RRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTA-----LELWLMRLGIRVGHSR 237 V + L++ R+G+P+++ DNG + G + + GIR + Sbjct: 182 TARPVCEALLAALARHGVPEQILTDNGKVFTGRFGPGGSSSEALFDRVCAENGIRHLLTA 241 Query: 238 PYHPQTQGKLERFHRSLKAEVL--QGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVP 295 P P T GK+ER H++++AE FA ELQ A D W YN ERPH++L M P Sbjct: 242 PRSPTTTGKVERLHKTMRAEFFTDADGRFATIAELQAALDGWVGQYNTERPHQSLGMRPP 301 Query: 296 GSRYQPSAR---------------QYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAG 340 R+ +A T P+ + R VD G++ + G Sbjct: 302 AERFALAAAGPDPAVVDPIAAVAVHQQSPTRRPDLRHAGVQRWVDQRGRIRLAGFGYRVP 361 Query: 341 KAFRGERVGLKEMQEDGSYEVWWYSTKVGVIDLKKK 376 GE V + D +++ + V ++K Sbjct: 362 IVLAGEPVEA--VVADNLVQIYHHDVLVASHVQRRK 395 >UniRef50_A5AKZ0 Putative uncharacterized protein n=18 Tax=Vitis vinifera RepID=A5AKZ0_VITVI Length = 2140 Score = 246 bits (628), Expect = 1e-63, Method: Composition-based stats. Identities = 51/285 (17%), Positives = 105/285 (36%), Gaps = 37/285 (12%) Query: 81 LRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPATGRFEHD- 139 + M + ++K + G P+ + M + T R Sbjct: 1301 IAMIXHVEGHFASQKTAMKVIQSGFWWPSLFKDAHSMCKG--CDRCQRLGKLTRRNMMPL 1358 Query: 140 ------APNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVS 193 +W +DF G FP G + L +D S++ + +++ + ++ + Sbjct: 1359 NPILIVDIFDVWGIDFMGPFPMSFGHSYILVGVDYVSKWVEAIPCRSNDHKVVLKFLKDN 1418 Query: 194 VFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRS 253 +F R+G+P + D G+ + + E L + G++ + PYHPQT G++E +R Sbjct: 1419 IFARFGVPKAIISDGGTHFCN-----KPFETLLAKYGVKHKVATPYHPQTSGQVELANRE 1473 Query: 254 LKAEVL---QGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGNT 310 +K ++ S +L + +RT Y L M+ Y+ + Sbjct: 1474 IKNILMKVVNVNRKDWSIKLLDSLWAYRTAYKTI-----LGMSP----YRLVYGKACHLP 1524 Query: 311 TPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 EY ++K++ + +A + L E++E Sbjct: 1525 VEIEYKAWWAIKKLN-----------MDLIRAGLKRCLDLNELEE 1558 >UniRef50_A5B5G8 Putative uncharacterized protein n=2 Tax=Vitis vinifera RepID=A5B5G8_VITVI Length = 1856 Score = 244 bits (623), Expect = 4e-63, Method: Composition-based stats. Identities = 51/293 (17%), Positives = 109/293 (37%), Gaps = 35/293 (11%) Query: 73 SSDDITALLRMAHDRH--ERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGI 130 + + +L HD + ++K + G P+ + M + + Sbjct: 1342 LEQEKSGILSHCHDSACGGHFASQKTAMRVVQSGFWWPSLFKDAHSMCKGCDRCQRQGKL 1401 Query: 131 PATGRFEHDAP-----NRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRE 185 + +W +DF G FP G + L +D S++ + +++ + Sbjct: 1402 TRQNMMPLNPILIVDVFDVWGIDFMGPFPMSFGHSYILVGVDYVSKWVEAIPCRSNDHKV 1461 Query: 186 TVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQG 245 ++ ++F R+G+P + D G+ + + E L + G++ + PYHPQT G Sbjct: 1462 VLKFLKENIFSRFGVPKAIINDGGTHFCN-----KPFETLLAKYGVKHKVATPYHPQTSG 1516 Query: 246 KLERFHRSLKAEVL---QGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPS 302 ++E +R +K ++ S +L + +RT Y L M+ Y+ Sbjct: 1517 QVELANRKIKNILMKVVNVNRKDWSIKLLDSLWAYRTAYKTI-----LGMSP----YRLV 1567 Query: 303 ARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 + EY ++K++ + +A + L E++E Sbjct: 1568 YGKACHLPVEIEYKAWWAIKKLN-----------MDLTRAGLKRCLDLNELEE 1609 >UniRef50_A5B9R1 Putative uncharacterized protein n=10 Tax=Vitis vinifera RepID=A5B9R1_VITVI Length = 2171 Score = 244 bits (622), Expect = 6e-63, Method: Composition-based stats. Identities = 55/293 (18%), Positives = 108/293 (36%), Gaps = 39/293 (13%) Query: 75 DDITALLRMAHDRH--ERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPA 132 D+ +L H+ + ++K + G T P+ +M R Sbjct: 1353 DEQQGILSHCHENACGGHFASQKTAMKVLQSGFTWPSLFKDAXIMCR--SCDRCQRLGKL 1410 Query: 133 TGRFEHDA-------PNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRE 185 T R + +W +DF G FP G + L +D S++ + ++ R Sbjct: 1411 TKRNQMPMNPILIVELFDVWGIDFMGPFPMSFGNSYILVGVDYVSKWVXAIPCXXNDHRV 1470 Query: 186 TVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQG 245 ++ ++F R+G+P + D G+ + + E L + G++ + PYHPQT G Sbjct: 1471 VLKFLKENIFSRFGVPKAIISDGGAHFCN-----KPFEALLSKYGVKHKVATPYHPQTSG 1525 Query: 246 KLERFHRSLKAEVLQGKWFADSG---ELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPS 302 ++E +R +K +++ L + + T Y L M+ Y+ Sbjct: 1526 QVELANREIKNILMKVVNSXRKDXSIRLHDSLWAYXTAYKTI-----LGMS----XYRLV 1576 Query: 303 ARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 + EY ++K++ + +A + L EM+E Sbjct: 1577 YGKAXHLPVEVEYKAWWAIKKLN-----------MDLIRAGEKRYLDLNEMEE 1618 >UniRef50_A5BI69 Putative uncharacterized protein n=1 Tax=Vitis vinifera RepID=A5BI69_VITVI Length = 1628 Score = 243 bits (620), Expect = 1e-62, Method: Composition-based stats. Identities = 53/293 (18%), Positives = 109/293 (37%), Gaps = 39/293 (13%) Query: 75 DDITALLRMAHDRH--ERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPA 132 + +L HD + ++K + G P+ + M + Sbjct: 989 PKQSGILSHCHDSACGGHFASQKTAMKVIQSGFWWPSLFKDAHTMCKG--CDRCQRLGKL 1046 Query: 133 TGRFEHD-------APNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRE 185 T R +W +DF G FP G + L +D S++ + +++ + Sbjct: 1047 TRRNMMPLNPILIVDVFDVWGIDFMGPFPMSFGHSYILVGVDYVSKWVEVIPCRSNDHKV 1106 Query: 186 TVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQG 245 ++ ++F R+G+P + D G+ + + E L + G++ + PYHPQT G Sbjct: 1107 VLKFLKENIFARFGVPKSIISDGGTHFCN-----KPFETLLAKYGVKHKVATPYHPQTSG 1161 Query: 246 KLERFHRSLKAEVL---QGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPS 302 ++E +R +K ++ S +L + +RT Y L+M+ Y+ Sbjct: 1162 QVELANREIKNILMKVVNVNRKDWSIKLLDSLWAYRTAYKTI-----LEMSP----YRLV 1212 Query: 303 ARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 + EY ++K++ + +A + L E++E Sbjct: 1213 YGKACHLPVEVEYKAWWAIKKLN-----------MDLTRARLKRCLDLNELEE 1254 >UniRef50_A5BJN2 Putative uncharacterized protein n=5 Tax=Vitis vinifera RepID=A5BJN2_VITVI Length = 1380 Score = 242 bits (619), Expect = 1e-62, Method: Composition-based stats. Identities = 57/335 (17%), Positives = 118/335 (35%), Gaps = 54/335 (16%) Query: 44 GYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDD-----------ITALLRMAHDRHERWG 92 Y + + G + + + H + + ++R E+ G Sbjct: 805 WYAHIANYLVTGEVPSEWKAQDKKHFFAKIHAYYWEEPFLFKYCVDQIIRKCVPEEEQQG 864 Query: 93 ARKIKRWLEDQGH-TMPAFSTVHNLMARHGLLPGASPGIPATGRFEHDA-------PNRL 144 I + T P+ ++M R T R + + Sbjct: 865 ---ILSHCHENAWFTWPSLFKDSHIMCR--SCDRCQRLGKLTKRNQMPMNPILIVDLFDV 919 Query: 145 WQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRM 204 W +DF G FP G + L +D S++ + + ++ R ++ ++F R+G+P + Sbjct: 920 WGIDFMGPFPMSFGNSYILVGVDYVSKWVEAIPYKHNDHRVVLKFLKENIFSRFGVPKAI 979 Query: 205 TMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWF 264 D G+ + + E L + G++ + PYHPQT G++E +R +K +++ Sbjct: 980 ISDGGTHFCN-----KPFETLLAKYGVKHKVATPYHPQTSGQVELANREIKNILMKVV-I 1033 Query: 265 ADSGE----LQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGNTTPPEYDEGVM 320 + L + +RT Y L M+ Y+ + EY Sbjct: 1034 TSRKDWSIKLHDSLWAYRTAYKTI-----LGMSP----YRLVYGKACHLPVEVEYKAWWA 1084 Query: 321 VRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 +++++ + +A + L EM+E Sbjct: 1085 IKRLN-----------MDLIRARAKRCLDLNEMEE 1108 >UniRef50_A5BWF3 Putative uncharacterized protein n=9 Tax=Vitis vinifera RepID=A5BWF3_VITVI Length = 1924 Score = 242 bits (618), Expect = 2e-62, Method: Composition-based stats. Identities = 50/295 (16%), Positives = 108/295 (36%), Gaps = 41/295 (13%) Query: 74 SDDITALLRMAHDRH--ERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIP 131 ++ +L H+ + ++K + G T P+ ++M R Sbjct: 1550 EEEQQEILSHCHENACGGHFASQKTAMKVLQSGFTWPSLFKDSHIMCR--SCDRCERLGK 1607 Query: 132 ATGRFEHDA-------PNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERR 184 T R + +W +DF FP G + L + S++ + + ++ R Sbjct: 1608 LTKRNQMPMNPILIVDLFDVWGIDFMRPFPMSFGNSYILVGVGYVSKWVEAIPYKHNDHR 1667 Query: 185 ETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQ 244 ++ ++F R+G+P + D G+ + + E L + G++ + PYHPQT Sbjct: 1668 VVLKFLKDNIFSRFGVPKSIINDGGTHFCN-----KPFETLLAKYGVKHKVATPYHPQTS 1722 Query: 245 GKLERFHRSLKAEVLQGKWFADSGE----LQRAFDHWRTVYNLERPHEALDMAVPGSRYQ 300 G++E +R +K +++ + L + +RT Y L M+ + Sbjct: 1723 GQVELANREIKNILMKVV-ITSRKDWSIKLHDSLWAYRTTYKTI-----LGMSPC----R 1772 Query: 301 PSARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 + EY +++++ + + + L EM+E Sbjct: 1773 LVYGKACHLPMEVEYKAWWAIKRLN-----------MDLIRVGEKRCLDLNEMEE 1816 >UniRef50_A5BSN7 Putative uncharacterized protein n=16 Tax=Vitis vinifera RepID=A5BSN7_VITVI Length = 2019 Score = 242 bits (618), Expect = 2e-62, Method: Composition-based stats. Identities = 54/294 (18%), Positives = 110/294 (37%), Gaps = 39/294 (13%) Query: 74 SDDITALLRMAHDRH--ERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIP 131 D+ +L H+ ++ ++K + G T P+ ++M R Sbjct: 1355 EDEQQGILSHCHENACGGQFASQKTTMKVLQSGFTWPSLFKDAHIMCR--SCDRCQRLGK 1412 Query: 132 ATGRFEHDA-------PNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERR 184 T R + +W +DF G FP G + L +D S++ + ++ R Sbjct: 1413 LTKRNQMPMNPILIVELFDVWGIDFMGPFPMSFGNSYILVGVDYVSKWVEAIPCKQNDHR 1472 Query: 185 ETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQ 244 ++ ++F R+G+P + G+ + + E L + ++ + PYHPQT Sbjct: 1473 VVLKFLKENIFSRFGVPKAIISGGGAHFCN-----KPFEALLSKYRVKHKVATPYHPQTS 1527 Query: 245 GKLERFHRSLKAEVLQGK---WFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQP 301 ++E +R +K +++ S +L + +RT Y L M+ Y+ Sbjct: 1528 RQVELANREIKNILMKVVNSSRKDWSIKLHDSLWAYRTAYKTI-----LGMSP----YRL 1578 Query: 302 SARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 + EY ++K++ + +A + L EM+E Sbjct: 1579 VYGKACHLPVEVEYKAWWAIKKLN-----------MDLIRAKEKRYLDLNEMEE 1621 >UniRef50_A5BJ10 Putative uncharacterized protein n=3 Tax=Vitis vinifera RepID=A5BJ10_VITVI Length = 1362 Score = 242 bits (617), Expect = 2e-62, Method: Composition-based stats. Identities = 55/322 (17%), Positives = 113/322 (35%), Gaps = 42/322 (13%) Query: 44 GYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMAHDRH--ERWGARKIKRWLE 101 Y W + + + A R +P + +L H+ + ++K + Sbjct: 853 VYYWEEPFLFKYCADQIIRKCVPK-------QEQQGILSHCHESACGGHFASQKTTMKVL 905 Query: 102 DQGHTMPAFSTVHNLMARHGLLPGASPGIPATGRFEHDAP-----NRLWQMDFKGHFPFG 156 G P+ + M R + + + +W +DF FP Sbjct: 906 QSGFNWPSLFKDAHTMCRSCDRCQRLGKLTRKNQMPMNPILIIDLFNVWGIDFVRPFPMS 965 Query: 157 GGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTT 216 + L +D S++ ++ ++ R + ++F R+G+P + D G+ + + Sbjct: 966 FDNSYILVGVDYVSKWVEAISCKHNDHRIVLMFFKENIFSRFGVPKAIISDGGTHFCN-- 1023 Query: 217 GTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSG---ELQRA 273 E L G++ + PYHPQT ++E +R +K +++ + +L + Sbjct: 1024 ---KPFETLLANYGVKHKVATPYHPQTSRQVELANREIKNILMKVVNTSRRDWSVKLYDS 1080 Query: 274 FDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGNTTPPEYDEGVMVRKVDISGKLSVK 333 +RT Y L M Y+ + EY ++KV+ Sbjct: 1081 LWAYRTTYKTI-----LGMFP----YRLVYGKACHLLVEVEYKAWWAIKKVN-------- 1123 Query: 334 GVSLSAGKAFRGERVGLKEMQE 355 + KA + L +M+E Sbjct: 1124 ---MDLNKAGMKRCLDLNDMKE 1142 >UniRef50_C8XFB0 Integrase catalytic region n=4 Tax=Actinomycetales RepID=C8XFB0_NAKMY Length = 607 Score = 240 bits (613), Expect = 6e-62, Method: Composition-based stats. Identities = 94/329 (28%), Positives = 146/329 (44%), Gaps = 17/329 (5%) Query: 5 MPWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPR 64 M + R + V GA + + G+S + + WL+R+ EG GL DR Sbjct: 1 MALVVLSKVEQRLDAVRAVLA-GATVTEVAAAVGVSRVSVHAWLRRYLTEGVTGLADRSH 59 Query: 65 IPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQ--GHTMPAFSTVHNLMARHGL 122 P P+++ D+++ + H RWGA++I+ L + G T+P+ +T++ ++ RHGL Sbjct: 60 RPRSCPHQAGDEVSVRVAELRRTHPRWGAKRIRMELLRKPAGLTVPSTATINRILIRHGL 119 Query: 123 LPGASPGIPAT--GRFEHDAPNRLWQMDFKGHFPFGG------GRCHPLTLLDDHSRFSL 174 + P + R+E P +LWQ+D G +T +DDHSRF + Sbjct: 120 VTPRRRKRPRSSYQRWERPGPMQLWQLDIVGDVWLVNPATGVLRGVKVVTGVDDHSRFCV 179 Query: 175 CLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTA--LELWLMRLGIR 232 A V L + R+G+P + DNG + G + GI Sbjct: 180 IAAVVERATGRAVCLALAAALARFGVPGEILTDNGKQFTARFGRGGEVLFDKICRHNGIT 239 Query: 233 VGHSRPYHPQTQGKLERFHRSLKAEVL-QGKWFADSGELQRAFDHWRTVYNLERPHEALD 291 ++P P T GK+ERFH +L+ E+L + F Q A D + VYN ERPH+ALD Sbjct: 240 HRLTQPASPTTTGKIERFHLTLRRELLDDHEPFESLAAAQAAVDEFVRVYNTERPHQALD 299 Query: 292 MAVPGSRYQPSARQYSGNTTPPEYDEGVM 320 P S P+ R N E + Sbjct: 300 GQRPVS---PADRFTPINPAEAELVPLWL 325 >UniRef50_A5B2X9 Putative uncharacterized protein n=4 Tax=Vitis vinifera RepID=A5B2X9_VITVI Length = 1595 Score = 240 bits (613), Expect = 7e-62, Method: Composition-based stats. Identities = 54/290 (18%), Positives = 106/290 (36%), Gaps = 43/290 (14%) Query: 79 ALLRMAHDRHERWGARKIKRWLEDQ---GHTMPAFSTVHNLMARHGLLPGASPGIPATGR 135 ++R E+ G I D G P+ + M + T R Sbjct: 1228 QIIRKCVPEQEQSG---ILSHCHDSACGGFWWPSLFKDAHSMCK--RCDRCQRLGKLTRR 1282 Query: 136 FEHD-------APNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQ 188 +W +DF G FP G + L +D S++ + +++ + ++ Sbjct: 1283 NMMPLNPILIVDIFDVWGIDFMGPFPMSFGHSYILVGVDYVSKWVEAIPCRSNDHKVVLK 1342 Query: 189 QQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLE 248 ++F R+G+P + D G+ + + E L + G++ + PYHPQT G++E Sbjct: 1343 FLKDNIFARFGVPKAIISDGGTHFCN-----KPFETLLAKYGVKHKVATPYHPQTSGQVE 1397 Query: 249 RFHRSLKAEVL---QGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQ 305 +R +K ++ S +L + +RT Y L M+ Y+ + Sbjct: 1398 LANREIKNILMKVVNVNRKDWSIKLLDSLWAYRTTYKTI-----LGMSP----YRLVYGK 1448 Query: 306 YSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 EY ++K++ + +A + L E++E Sbjct: 1449 ACHLPMEIEYKAWWAIKKLN-----------MDLTRAGLKRCLDLNELEE 1487 >UniRef50_A5BFS9 Putative uncharacterized protein n=9 Tax=Vitis vinifera RepID=A5BFS9_VITVI Length = 2326 Score = 240 bits (613), Expect = 7e-62, Method: Composition-based stats. Identities = 52/336 (15%), Positives = 115/336 (34%), Gaps = 56/336 (16%) Query: 74 SDDITALLRMAHDRH--ERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIP 131 ++ +L H+ + ++K W+ G P+ + M + Sbjct: 1690 EEEQQGILSHCHENACGGHFASQKTAMWVLQSGFYWPSLFKDAHTMCK--SCDRCQRLGK 1747 Query: 132 ATGRFEHD-------APNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERR 184 T R +W +DF G FP G + L +D S++ + ++ R Sbjct: 1748 LTRRNMMPLNPILIVDLFYVWGIDFMGPFPMSFGYSYILVRVDYVSKWVEAIPCNHNDHR 1807 Query: 185 ETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQ 244 ++ ++F R+G+P + D G+ + + E L + ++ + YHPQT Sbjct: 1808 VVLKFLKENIFSRFGVPKAIISDGGTHFCN-----KPFETLLAKYRVKHKVATLYHPQTN 1862 Query: 245 GKLERFHRSLKAEVLQG---KWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQP 301 G++E +R +K +++ +L + ++T Y L+M+ Y+ Sbjct: 1863 GQVELANRKIKNILMKVVNTNRKDWPVKLLDSLWAYKTAYKTI-----LEMSP----YRL 1913 Query: 302 SARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE------ 355 + EY +++++ + + + L EM+E Sbjct: 1914 VYGKACHLLVELEYKAWWAIKQLN-----------MDLSRVGLKRFLDLNEMEELRNDAY 1962 Query: 356 -----------DGSYEVWWYSTKVGVIDLKKKSITM 380 ++ + + LK + I Sbjct: 1963 INSKIAKEKLKRQRILLYDSKLHIFLGKLKSRWIGP 1998 Score = 238 bits (608), Expect = 2e-61, Method: Composition-based stats. Identities = 51/318 (16%), Positives = 115/318 (36%), Gaps = 33/318 (10%) Query: 74 SDDITALLRMAHDRH--ERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIP 131 + +L H + ++K + G P+ + +++ + Sbjct: 854 EQEKHGILSHCHGNACGGHFASQKTAMRVLQSGFWWPSLFKDAHEVSKGCDKCQRIGKLS 913 Query: 132 ATGRFEHDAP-----NRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRET 186 + +W +DF G FP G + L +D S++ + T++ + Sbjct: 914 RRNMMPLNPILIVDLFYVWGIDFMGPFPMSFGHSYILVGVDYVSKWVEAIPCRTNDHKVV 973 Query: 187 VQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGK 246 ++ ++F R+G+P + D G+ + + E L + GI+ + PYHPQT G+ Sbjct: 974 LKFLKENIFSRFGVPKVIISDGGTHFCN-----KPFEALLAKYGIKHKVATPYHPQTSGQ 1028 Query: 247 LERFHRSLKAEVLQGKWFADSGE----LQRAFDHWRTVYNLERPHEALDMAVPGSRYQPS 302 +E +R +K +++ + + L + +RT Y L M+ Y+ Sbjct: 1029 VELANREIKNILMKVVN-TNRKDWSVNLLDSLWAYRTAYKTI-----LGMSP----YRLV 1078 Query: 303 ARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQEDGSYEVW 362 + E+ ++K+++ K + + V KE + ++ Sbjct: 1079 YGKACHLPVEIEFKAWWAIKKLNMDLTK-------ENLKRWHDQLVTKKEFFKGQRVLLY 1131 Query: 363 WYSTKVGVIDLKKKSITM 380 + LK + + Sbjct: 1132 DSKLHLFPGKLKSRWVGP 1149 >UniRef50_B6J0H9 Transposase n=3 Tax=Coxiella burnetii RepID=B6J0H9_COXB2 Length = 317 Score = 240 bits (612), Expect = 8e-62, Method: Composition-based stats. Identities = 128/277 (46%), Positives = 167/277 (60%), Gaps = 3/277 (1%) Query: 102 DQGHTMPAFSTVHNLMARHGLLP-GASPGIPATGRFEHDAPNRLWQMDFKGHF-PFGGGR 159 +G+ MP TV+ ++ R+G + S RFEH+ PN LWQMDFKGHF R Sbjct: 37 KKGYIMPCIKTVNRILKRYGRITIEESLKRKKFIRFEHEHPNDLWQMDFKGHFRLTNKIR 96 Query: 160 CHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWG-DTTGT 218 CHPLTLLDD +R+SL + C DER ETV+Q L+ +F ++GLP RMTMDNG+PWG + Sbjct: 97 CHPLTLLDDCTRYSLGIIACGDERLETVKQALIDIFRKWGLPKRMTMDNGAPWGYSGSQN 156 Query: 219 WTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWR 278 +T L +WL++ I V HSRPYHPQTQGKLERFHR+ K E L +F + Q+ FD WR Sbjct: 157 YTQLTVWLIQQTIYVSHSRPYHPQTQGKLERFHRTFKQEFLNRYYFDTLAQAQKVFDWWR 216 Query: 279 TVYNLERPHEALDMAVPGSRYQPSARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLS 338 YN ERPH A++ P Y S R Y P EY + VRKV+ G +S KG Sbjct: 217 DFYNDERPHSAIEAYSPSEIYHRSERSYCEKIQPYEYATEMDVRKVNQKGIMSYKGRRYF 276 Query: 339 AGKAFRGERVGLKEMQEDGSYEVWWYSTKVGVIDLKK 375 G+AF G+ +GL E+ V++ KV +DL + Sbjct: 277 VGEAFGGQAMGLMPSNENDIVNVYFCHQKVFKLDLNQ 313 >UniRef50_A5AIU0 Putative uncharacterized protein n=2 Tax=Vitis vinifera RepID=A5AIU0_VITVI Length = 1753 Score = 240 bits (612), Expect = 9e-62, Method: Composition-based stats. Identities = 56/340 (16%), Positives = 118/340 (34%), Gaps = 48/340 (14%) Query: 73 SSDDITALLRMAHDRH--ERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGI 130 S ++ +L H+ + +K + G + P+ + M + Sbjct: 1126 SEEEQQXILSHCHESAYXGHFAXQKTXMKVLQSGFSWPSLFKDAHTM--CXSCDRSQRLR 1183 Query: 131 PATGRFEHDA-------PNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDER 183 T R + +W +DF G FP G + L ++ S++ + ++ Sbjct: 1184 KLTXRNQMPMNPILIVDLFDVWDIDFMGPFPMSFGNSYILVGVNYVSKWVEAIPCKHNDH 1243 Query: 184 RETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQT 243 R ++ ++F R+G+P + D G+ + + E L + G++ + PYHP T Sbjct: 1244 RVVLKFLKENIFSRFGVPKAIISDEGTHFCN-----KPFETLLAKYGVKHKVATPYHPXT 1298 Query: 244 QGKLERFHRSLKAEVLQGKWFADSG---ELQRAFDHWRTVYNLERPHEALDMAVPGSRYQ 300 G++E +R +K +++ + + + +RT Y L M+ Y+ Sbjct: 1299 SGQVELANREIKNILMKVVNTSKRDWSVKFHDSLXAYRTAYKTI-----LGMSP----YR 1349 Query: 301 PSARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAG-------------------- 340 + EY ++KV++ L Sbjct: 1350 LVYGKACHLXXEVEYKAWWTIKKVNMDLTRXXMKRCLDLNEMEELRNDAYNNSKVAKQRM 1409 Query: 341 KAFRGERVGLKEMQEDGSYEVWWYSTKVGVIDLKKKSITM 380 K + + + KE Q+ ++ S + LK + I Sbjct: 1410 KRWHDQLISNKEFQKGQRVLLYDSSLHIFPGKLKSRWIGP 1449 >UniRef50_A5BYU9 Putative uncharacterized protein n=8 Tax=Vitis vinifera RepID=A5BYU9_VITVI Length = 2103 Score = 239 bits (610), Expect = 1e-61, Method: Composition-based stats. Identities = 51/294 (17%), Positives = 105/294 (35%), Gaps = 39/294 (13%) Query: 74 SDDITALLRMAHDRH--ERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIP 131 + + +L HD + ++K + G P+ + M + Sbjct: 1564 EQEQSGILSHCHDSACGGHFASQKTAMRVVQSGFLWPSLFKDAHSMCKG--CERCQRLGK 1621 Query: 132 ATGRFEHD-------APNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERR 184 T R +W +DF G FP G + L +D S++ + +++ + Sbjct: 1622 LTRRNMMPLNPILIVDIFDVWGIDFMGPFPMSFGHSYILVGVDYVSKWVEAIPCRSNDHK 1681 Query: 185 ETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQ 244 ++ +F R+G+P + D G+ + + E L + G++ + P HPQT Sbjct: 1682 VVLKFLKEDIFSRFGVPKAIISDGGTHFCN-----KPFETLLAKYGVKHKVATPDHPQTS 1736 Query: 245 GKLERFHRSLKAEVL---QGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQP 301 G++E +R +K ++ S +L + +R Y L M+ Y Sbjct: 1737 GQVELANREIKNILMKVVNVNRKDWSIKLLDSLWAYRNAYKTI-----LGMSP----YHL 1787 Query: 302 SARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 + EY ++K++ + +A + L E++E Sbjct: 1788 VYGKACHLLVEVEYKAWWAIKKLN-----------MDLTRAGLKRCLDLNELEE 1830 >UniRef50_A8F1V3 Transposase and inactivated derivative n=4 Tax=Bacteria RepID=A8F1V3_RICM5 Length = 289 Score = 239 bits (609), Expect = 2e-61, Method: Composition-based stats. Identities = 55/305 (18%), Positives = 106/305 (34%), Gaps = 42/305 (13%) Query: 13 MSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNR 72 M +R ++ ++ +I ++C+ +S + Y+WL P + +R Sbjct: 1 MPIRYAWIKE-NEGNFSIAAMCKFMKVSRSGYYEWLN---------------NPGCNRDR 44 Query: 73 SSDDITALLRMAHDR-HERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIP 131 +++T +++ +G R IK+ L Q + + + LM L+ Sbjct: 45 EDNELTNRIKIIFKEGRGNYGTRPIKKELSRQSIIV-SRRRIARLMKEASLICKTKRKFK 103 Query: 132 AT---------------GRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCL 176 AT +F N W D + P G + T++D SR + Sbjct: 104 ATTDSNHNKQIAPNLLDRKFTVPDANCYWVGDIT-YVPTSEGWLYLATVIDLFSRKIIGW 162 Query: 177 AHCTDERRETVQQQLVSVFERYGLPDRM--TMDNGSPWGDTTGTWTALELWLMRLGIRVG 234 + + + + V L+ + P + D GS + + + + GI+ Sbjct: 163 SMNNNMKADLVNNALLMAIWQRKPPKGLIWHTDRGSQYCSDSH-----LKIIKQHGIKQS 217 Query: 235 HSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRT-VYNLERPHEALDMA 293 SR + E F ++K E++ F E + + YN R H A D Sbjct: 218 MSRKGNCWDNAVAESFFHTIKTELVYQHKFKTREEAKHTIFEYIEVFYNRIRMHSANDYL 277 Query: 294 VPGSR 298 P Sbjct: 278 SPVKY 282 >UniRef50_B1ZP18 Integrase catalytic region n=1 Tax=Opitutus terrae PB90-1 RepID=B1ZP18_OPITP Length = 387 Score = 237 bits (604), Expect = 7e-61, Method: Composition-based stats. Identities = 107/381 (28%), Positives = 158/381 (41%), Gaps = 12/381 (3%) Query: 5 MPWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPR 64 MPW + R ++ ++ +LC RFG+S T YKW R+ +G GL R Sbjct: 1 MPWKIKTAEQQRQALAREMTRGTVSVTALCARFGVSRTTAYKWAARYVAQGVNGLVARQP 60 Query: 65 IPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQ--GHTMPAFSTVHNLMARHGL 122 +++ A + +A WGA K++ WLE G +P T+H + G Sbjct: 61 GRPKQVSQALARWHARVLLARQARPSWGAPKLRWWLERTHPGERVPCSRTLHRWLVAAGR 120 Query: 123 LPGASPGIP----ATGRFEHDAPNRLWQMDFKGHFPFGGG-RCHPLTLLDDHSRFSLCLA 177 + + + N +W DFKG F G LT+ D +SRF L Sbjct: 121 VHQRRRKLRAGPGRPATVLAERVNAVWTADFKGDFYTKDGAWILALTVRDLYSRFMLTAH 180 Query: 178 HCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWG-DTTGTWTALELWLMRLGIRVGHS 236 + V++ +F R+G+P + +D G+P+ TAL LW RLGI V Sbjct: 181 PVPRQSEPVVRRVFARLFRRFGVPQAIRVDRGTPFCGSGPYGLTALSLWWQRLGIEVQFV 240 Query: 237 RPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPG 296 E+ HR LKAE + +++R W YN +RPHE L P Sbjct: 241 SRKRRLDNNAHEQMHRMLKAEAATPVSRSYGAQVRR-LQRWCGRYNHDRPHEGLAGRTPA 299 Query: 297 SRYQPSARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQED 356 S Y+PS R PP+Y G + R+V G + + G G+AF G V Sbjct: 300 SLYRPSTRLLP-RLVPPQYPLGCVTRRVRPHGYVKLDGSHRHIGRAFVGLTVAFT--PYR 356 Query: 357 GSYEVWWYSTKVGVIDLKKKS 377 Y V + S +G ID + Sbjct: 357 QLYRVHFDSLLLGTIDPRLTR 377 >UniRef50_A5AHC2 Putative uncharacterized protein n=3 Tax=Vitis vinifera RepID=A5AHC2_VITVI Length = 1270 Score = 236 bits (603), Expect = 9e-61, Method: Composition-based stats. Identities = 54/294 (18%), Positives = 109/294 (37%), Gaps = 39/294 (13%) Query: 74 SDDITALLRMAHDRH--ERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIP 131 + + +L HD + ++KI + G P+ + M + Sbjct: 308 EQEQSRILSHCHDSACGGHFASQKIAMKVIQSGFWWPSLFKDAHSMCKG--CDRCQRLGK 365 Query: 132 ATGRFEHD-------APNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERR 184 T R +W +DF G FP G + L +D S++ + +++ + Sbjct: 366 LTRRNMMPLNPILIVDIFYVWGIDFMGPFPMSFGHSYILVGVDYVSKWVEAIPCRSNDHK 425 Query: 185 ETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQ 244 ++ ++F R+G+P + D G+ + + E L + G++ + PYHPQT Sbjct: 426 VVLKFLKDNIFARFGVPKAIISDGGTHFCN-----KPFETLLAKYGVKHKEATPYHPQTS 480 Query: 245 GKLERFHRSLKAEVL---QGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQP 301 G++E +R +K ++ S +L + +RT Y L M+ Y+ Sbjct: 481 GQVELANREIKNILMKVVNVNRKDWSIKLLDSLWAYRTAYKXI-----LGMSP----YRL 531 Query: 302 SARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 + EY ++K++ + +A + L E+ E Sbjct: 532 VYGKACHLPVEIEYKAWWAIKKLN-----------MDLTRAGLKRCLDLNELXE 574 >UniRef50_A5AKV0 Putative uncharacterized protein n=16 Tax=Vitis vinifera RepID=A5AKV0_VITVI Length = 2067 Score = 236 bits (602), Expect = 1e-60, Method: Composition-based stats. Identities = 50/303 (16%), Positives = 114/303 (37%), Gaps = 36/303 (11%) Query: 74 SDDITALLRMAHDRH--ERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIP 131 + +L H+ + ++K+ + G P+ + +++ + Sbjct: 476 EQEKHGILSHCHENACGGHFASQKMAMRVLQSGFWWPSLFKDAHEVSKGCDKCQRLGKLS 535 Query: 132 ATGRFEHDAP-----NRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRET 186 + +W +DF G FP G + L +D S++ + T++ + Sbjct: 536 RRNMMPLNPILIVDLFDVWGIDFMGPFPMSFGHSYILVGVDYVSKWVEAIPCRTNDHKVV 595 Query: 187 VQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGK 246 ++ ++F R+G+P + D G+ + + E L + ++ + PYHPQT G+ Sbjct: 596 LKFLKENIFSRFGVPKAIISDGGTHFCN-----KPFEALLAKYRVKHKVATPYHPQTSGQ 650 Query: 247 LERFHRSLKAEVLQG---KWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSA 303 +E +R +K +++ S +L + +RT Y L M+ Y+ Sbjct: 651 VELANREIKNILMKVVNTNRKDWSVKLLDSLWAYRTAYKTI-----LGMSP----YRLVY 701 Query: 304 RQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQEDGSYEVWW 363 + E+ ++K++ + KA + L E++E + ++ Sbjct: 702 GKACHLPVEIEFKAWWAIKKLN-----------MDLTKAGLKRSLDLNELEE-LRNDAYF 749 Query: 364 YST 366 S Sbjct: 750 NSK 752 >UniRef50_A5AH69 Putative uncharacterized protein n=3 Tax=Vitis vinifera RepID=A5AH69_VITVI Length = 1631 Score = 235 bits (601), Expect = 2e-60, Method: Composition-based stats. Identities = 48/306 (15%), Positives = 107/306 (34%), Gaps = 51/306 (16%) Query: 74 SDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPAT 133 ++ +L H+ + G P+ + M R T Sbjct: 1361 EEEQQRILIHCHENAST--------KVLQSGFYWPSLFKDAHTMCR--SCDRCQRLGKLT 1410 Query: 134 GRFEHDA-------PNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRET 186 R + +W +D G FP G + L +D S++ + + ++ R Sbjct: 1411 RRNQMPMNPILIVDLFDVWGIDLMGPFPMSFGNSYILVGVDYVSKWIKAIPYKHNDHRVV 1470 Query: 187 VQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGK 246 ++ ++F R+G+P + D G+ + + E L + G++ + PYHPQ G+ Sbjct: 1471 LKFLKENIFSRFGVPKAIISDRGTHFCNR-----PFETLLAKYGVKHKVATPYHPQNSGQ 1525 Query: 247 LERFHRSLKAEVLQGKWFADSG---ELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSA 303 +E + +K +++ + +L + +RT Y L M+ Y+ Sbjct: 1526 VELANMEIKNILMKVVITSRRDWSIKLHDSLWAYRTTYKTI-----LGMSP----YRLVY 1576 Query: 304 RQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGER------VGLKEMQEDG 357 + EY +++++ + +A + LK Q++ Sbjct: 1577 GKACHPRVEVEYKAWWAIKRLN-----------MDLIRAGAKRCDATHGSLALKAYQQNL 1625 Query: 358 SYEVWW 363 + ++ Sbjct: 1626 YLKYYY 1631 >UniRef50_A5BKJ4 Putative uncharacterized protein n=2 Tax=Vitis vinifera RepID=A5BKJ4_VITVI Length = 922 Score = 235 bits (600), Expect = 2e-60, Method: Composition-based stats. Identities = 52/287 (18%), Positives = 105/287 (36%), Gaps = 37/287 (12%) Query: 74 SDDITALLRMAHDRH--ERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIP 131 + +L + + +KI + G + P+ + M + Sbjct: 62 EQEQQGILSHCLESACGGHFAYQKIAMKVLQSGFSWPSLFKDAHAMCK--SXDRYQRLGK 119 Query: 132 ATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQL 191 T F +W +DF G FP G + L +D S++ + ++ R ++ Sbjct: 120 LTFDF-----FDVWGIDFMGPFPMSFGNSYILVGVDYVSKWVEAILCKQNDHRVVLKFLK 174 Query: 192 VSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFH 251 ++F R+G+P + D G+ + + E L + G++ + PYHPQT G++E + Sbjct: 175 ENIFSRFGVPKAIISDEGTHFCN-----KPFETLLAKYGVKHKVATPYHPQTFGQVELAN 229 Query: 252 RSLKAEVLQGK---WFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSG 308 R +K +++ +L + +RT Y L M+ Y+ + Sbjct: 230 REIKNILMKVVNTSRRNWFVKLHDSLWAYRTTYKTI-----LGMSP----YRLVYGKACH 280 Query: 309 NTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 +Y ++KV+ + + L EM+E Sbjct: 281 LPVEVQYKAWWAIKKVNXD-----------LXRVGMKRCLDLNEMEE 316 >UniRef50_A5BT93 Putative uncharacterized protein n=1 Tax=Vitis vinifera RepID=A5BT93_VITVI Length = 1184 Score = 235 bits (599), Expect = 3e-60, Method: Composition-based stats. Identities = 49/292 (16%), Positives = 109/292 (37%), Gaps = 35/292 (11%) Query: 74 SDDITALLRMAHDRH--ERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIP 131 + +L H+ + ++K + G P+ + +++ + Sbjct: 721 EQEKHGILSHCHENACGGHFASQKTAMRVLQSGFWWPSLFKDAHEVSKGCDKCQRLGKLS 780 Query: 132 ATGRFEHDAP-----NRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRET 186 + +W +BF G FP G + L +D S++ + T+ + Sbjct: 781 RRNMMPLNPILIVDLFDVWGIBFMGPFPMSFGHXYILVGVDYVSKWVEAIPCRTNXHKVV 840 Query: 187 VQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGK 246 ++ ++F R+G+P + D G+ + + E L + G++ + PYHPQT G+ Sbjct: 841 LKFLKENIFSRFGVPKAIISDGGTHFCN-----KPFEALLAKYGVKHKVATPYHPQTSGQ 895 Query: 247 LERFHRSLKAEVLQG---KWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSA 303 +E +R +K +++ S +L + +RT Y L M+ Y+ Sbjct: 896 VELANREIKNILMKVVNTNRKDWSVKLLDSLWAYRTAYKTI-----LGMSP----YRLVY 946 Query: 304 RQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 + E+ ++K++ + KA + L E++E Sbjct: 947 GKACHLPVEIEFKAWWAIKKLN-----------MDLTKAGLKRSLDLNELEE 987 >UniRef50_A5BMC5 Putative uncharacterized protein n=9 Tax=Vitis vinifera RepID=A5BMC5_VITVI Length = 1382 Score = 234 bits (596), Expect = 6e-60, Method: Composition-based stats. Identities = 51/291 (17%), Positives = 105/291 (36%), Gaps = 35/291 (12%) Query: 75 DDITALLRMAHDR--HERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPA 132 +L H+ + ++K + G + P+ + M R + Sbjct: 376 PKQHGILSHCHESTCGGHFASQKTAMKVLQLGFSWPSLFKDAHTMCRSCDRCQRLGNLTR 435 Query: 133 TGRFEHDAP-----NRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETV 187 + + +W +DF FP G + L +D S++ + ++ R + Sbjct: 436 RNQMPMNPILIVDLFDVWGIDFMRPFPMSFGNSYILVGIDYVSKWVEAILCKQNDHRIVL 495 Query: 188 QQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKL 247 + ++F R+G+P + D G+ + + E L + G++ + PYHPQT ++ Sbjct: 496 KFLKENIFLRFGVPKAIISDGGTHFCN-----KPFETLLAKYGVKHKVATPYHPQTSRQV 550 Query: 248 ERFHRSLKAEVLQGKWFADSG---ELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSAR 304 E +R +K ++ + +L + +RT Y L M+ Y Sbjct: 551 ELANREIKNILMTVVNTSRRDWSVKLHDSLCAYRTTYKTI-----LGMSS----YCLVYG 601 Query: 305 QYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 + EY ++KV+ + +A + L EM+E Sbjct: 602 KACHLPVEVEYKAWWAIKKVN-----------MDLNRAGMKRCLYLNEMEE 641 >UniRef50_A5APG9 Putative uncharacterized protein n=11 Tax=Vitis vinifera RepID=A5APG9_VITVI Length = 1754 Score = 233 bits (595), Expect = 7e-60, Method: Composition-based stats. Identities = 47/291 (16%), Positives = 104/291 (35%), Gaps = 35/291 (12%) Query: 75 DDITALLRMAHDRH--ERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPA 132 + +L HD + ++K + G P+ + M + + Sbjct: 1381 PKQSGILSHCHDSACGGHFASQKTAMKVIQSGFWWPSLFKDAHSMCKGCDRCQRLGKLTR 1440 Query: 133 TGRFEHDAP-----NRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETV 187 + +W +DF G FP G + L +D S++ + +++ + + Sbjct: 1441 QNMMLLNPILIVDVFDVWGIDFMGPFPMSFGHSYILVGVDYVSKWVEAIPCRSNDHKVVL 1500 Query: 188 QQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKL 247 + ++F R+G+P + D G+ + + + L + G++ + PYH Q G++ Sbjct: 1501 KFLKENIFARFGVPKAIISDGGTHFCN-----KPFQTLLAKYGVKHKVATPYHSQRSGQV 1555 Query: 248 ERFHRSLKAEVL---QGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSAR 304 E + +K ++ S +L + +RT Y L M+ Y Sbjct: 1556 ELANWEIKNILMKVVNVNRKDWSIKLLDSLWAYRTAYKTI-----LGMSP----YSLVYG 1606 Query: 305 QYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 + EY ++K++ + +A + L E++E Sbjct: 1607 KACHLPVEVEYKAWWAIKKLN-----------MDLTRAGLKRCLDLNELEE 1646 >UniRef50_A5AYI6 Putative uncharacterized protein n=7 Tax=Vitis vinifera RepID=A5AYI6_VITVI Length = 2067 Score = 233 bits (594), Expect = 1e-59, Method: Composition-based stats. Identities = 53/327 (16%), Positives = 113/327 (34%), Gaps = 40/327 (12%) Query: 74 SDDITALLRMAHDRH--ERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIP 131 + + +L HD + ++K + G P+ + M + Sbjct: 1389 EQEQSGILSHCHDSACGSHFASQKTSMKVIQSGFWWPSPFKDAHSMCKG--CDRCQRLGK 1446 Query: 132 ATGRFEHD-------APNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERR 184 T R +W +DF FP G + L +D S++ + +++ + Sbjct: 1447 LTRRNMMPLNPILIVDVFDVWGIDFMXPFPMSFGHSYILVGVDYVSKWVEAIPCRSNDHK 1506 Query: 185 ETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQ 244 ++ ++F R+G+P + D G+ + + E L + G++ + PYHPQT Sbjct: 1507 VVLKFLKDNIFARFGVPKAIISDGGTHFCN-----KPFETLLAKYGVKHKVATPYHPQTS 1561 Query: 245 GKLERFHRSLKAEVL---QGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQP 301 G++E + +K ++ S +L + +RT Y L M+ Y+ Sbjct: 1562 GQVELANWEIKNILMKVVNVNRKDWSIKLLDSLWAYRTAYKTI-----LGMSP----YRL 1612 Query: 302 SARQYSGNTTPPEYDEGVMVRKVDI---SGKLSVKGVSLSAGKAFRGERVGLKEMQ---- 354 + EY ++K+ ++ +K + ER+ Q Sbjct: 1613 VYGKACHLPVEVEYKAWWAIKKLIHGFDKSRVEMKNDAY-LNSKIAKERLKKWHDQLVNQ 1671 Query: 355 ----EDGSYEVWWYSTKVGVIDLKKKS 377 + ++ + LK + Sbjct: 1672 KNFAKGQRVLLYDSKLHLFPGKLKSRW 1698 >UniRef50_A1UAJ3 Integrase, catalytic region n=7 Tax=Actinomycetales RepID=A1UAJ3_MYCSK Length = 426 Score = 232 bits (593), Expect = 1e-59, Method: Composition-based stats. Identities = 89/395 (22%), Positives = 152/395 (38%), Gaps = 30/395 (7%) Query: 4 LMPWDARDTMSLRTEFVLFA-SQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAG-LQD 61 ++ + +R + + + + C +GIS + Y+ +R EG A L+ Sbjct: 1 MVAVNEPIDPLVRLAISQWPDNAPRGAVSTFCAEYGISRKSFYELRKRVKTEGPAAVLEP 60 Query: 62 RPRIPHHSPNRSSDDITALLRMAHDRHER----WGARKIKRWLEDQGH-TMPAFSTVHNL 116 R P SP++ SD++ E G + + G +P+ +++ + Sbjct: 61 MTRRPKSSPSKLSDEVKEQALAVRAALEATGLDHGPISVHDKMHAMGLERVPSTASLARV 120 Query: 117 MARHGLLPGASPGIPA--TGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSL 174 G+ P RF + APN WQ+D + GG RC L+DDHSR++L Sbjct: 121 FREAGVARLEPKKKPRSAWRRFVYPAPNACWQLDATEYVLSGGRRCVIFQLIDDHSRYAL 180 Query: 175 CLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWG-DTTGTWTALELWLMRLGIRV 233 E + + +G+P R+ DNG G L L LG+ Sbjct: 181 ASHVALSETAKEAIAVVDKAIAAHGVPQRLLSDNGIALNPSRRGHVGQLVAHLAALGVEA 240 Query: 234 GHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEAL-DM 292 +PY P TQGK ERFH++L + + ELQ D + +YN ERPH+ L Sbjct: 241 ITGKPYKPTTQGKNERFHQTLFRYLDKQPIAESLAELQCHVDAFDGIYNTERPHQGLPGR 300 Query: 293 AVPGSRYQPSA-------------------RQYSGNTTPPEYDEGVMVRKVDISGKLSVK 333 P + ++ +A R + TP + G V+ ++ +G + Sbjct: 301 VTPRTAWEATAKAPAPRPKPDPPSFDHAVVRPHRPAPTPADLPHGTSVKTLNTAGAFVLA 360 Query: 334 GVSLSAGKAFRGERVGLKEMQEDGSYEVWWYSTKV 368 GV+ E+V + + + + Sbjct: 361 GVTYKVDGRRSLEQVLVVIDGDKITAADLDGEVLI 395 >UniRef50_A6DU50 Putative ISmav2-like transposase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DU50_9BACT Length = 598 Score = 232 bits (592), Expect = 2e-59, Method: Composition-based stats. Identities = 69/288 (23%), Positives = 120/288 (41%), Gaps = 7/288 (2%) Query: 16 RTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSD 75 + + V ++G + + G + + W++R+ +EG GL+ RP+ + Sbjct: 14 KLKCVKLCLEEGYPRKFVAAESGANLKSLGAWIKRYNEEGPQGLKPRPKGKKG-RQQIHP 72 Query: 76 DITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPATGR 135 + + ++ +G ++I L+ + TV + L+ + Sbjct: 73 ETKEKIIELKKQYPIFGIKRISDLLKRVFFLKASPETVRKTLNEENLIQKERKKPRKNPQ 132 Query: 136 ----FEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQL 191 FE PN++WQ D F GG + L +DD+SR+ + L + E + + Sbjct: 133 KPRFFERSRPNQMWQTDIFS-FRLGGQAAYLLAFIDDYSRYMVGLGLYRRQTAENLLEVY 191 Query: 192 VSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFH 251 Y P M DNG + + GT T E L + I+ S+P+HP T GK+ERF Sbjct: 192 RRATGEYNCPAEMLTDNGRQYTNWRGT-TRFEKELKKDRIKHIRSQPHHPMTLGKIERFW 250 Query: 252 RSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRY 299 +++ E L F Q+ W YN +RPH+ + P RY Sbjct: 251 KTIWTEFLDRCQFDCMETAQQRITLWIKYYNHQRPHQGIGGLCPADRY 298 >UniRef50_A5AY91 Putative uncharacterized protein n=1 Tax=Vitis vinifera RepID=A5AY91_VITVI Length = 1162 Score = 232 bits (591), Expect = 2e-59, Method: Composition-based stats. Identities = 53/294 (18%), Positives = 108/294 (36%), Gaps = 39/294 (13%) Query: 74 SDDITALLRMAHDRH--ERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIP 131 + + +L HD + ++K + G P+ + M + Sbjct: 79 EQEQSGILSXCHDSACGGHFASQKTAMKVIQSGFWWPSLFKDAHSMCKG--CDRCQRLGK 136 Query: 132 ATGRFEHD-------APNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERR 184 T R +W +DF G FP G + L +D S++ + +++ + Sbjct: 137 LTXRNMMPLNPILIVDIFDVWGIDFMGPFPMSFGHSYILVGVDYVSKWVEAIPCRSNDHK 196 Query: 185 ETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQ 244 ++ +F R+G+P + D G+ + + E L + G++ + PYHPQT Sbjct: 197 VVLKFLKDHIFARFGVPKAIISDGGTHFCN-----KPFETLLAKYGVKHKVATPYHPQTS 251 Query: 245 GKLERFHRSLKAEVL---QGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQP 301 G++E +R +K ++ S +L + +RT Y L M+ Y+ Sbjct: 252 GQVELANRXIKNILMKVVNVNRKDWSIKLLDSLWAYRTAYKTI-----LGMSP----YRL 302 Query: 302 SARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 + EY ++K++ + +A + L E++E Sbjct: 303 VYGKACHLPVEIEYKAWWAIKKLN-----------MDLTRAGLKRCLDLNELEE 345 >UniRef50_C5D6W5 Integrase catalytic region n=19 Tax=Firmicutes RepID=C5D6W5_GEOSW Length = 417 Score = 231 bits (589), Expect = 4e-59, Method: Composition-based stats. Identities = 77/344 (22%), Positives = 125/344 (36%), Gaps = 20/344 (5%) Query: 39 ISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMAHDRHERWGARKIKR 98 I+ T W R+ + G L+ + R R S D + H Sbjct: 49 IAAKTILDWCTRYKKGGFDALKPKRRSDRGHSRRLSPDDEDHILALRKEHPTMPVTVFYE 108 Query: 99 WLEDQGHT---MPAFSTVHNLMARHGLLPGASPGIPATGRFEHDAPNRLWQMDFKGHFPF 155 L +QG ++ T++ L+ +H L+ +P RF +D N LWQ D Sbjct: 109 HLIEQGEIPENHISYFTIYRLLKKHNLVGKEILPMPERKRFAYDQINELWQGDLSHGPTI 168 Query: 156 G----GGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSP 211 + + +DD SRF E+ + ++ R G P R+ DNG Sbjct: 169 RVNGKAQKTFLIAYIDDCSRFVPYAQFFPSEKFDGLRIVTKEAVLRCGKPKRIYSDNGKI 228 Query: 212 WGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEV---LQGKWFADSG 268 + L+ +GI + H++PY PQ++GK+ERF R+++ L+ Sbjct: 229 YRSEV-----LQYACAEMGITLIHTQPYDPQSKGKIERFFRTVQTRFYPLLELDPPKSLE 283 Query: 269 ELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGNTTPPEYDE---GVMVRKVD 325 EL F W +PH +LD P +Q S D RKV Sbjct: 284 ELNERFWRWLEEEYHRKPHASLDGKTPHEVFQSQVHLVSFIEDGDWLDAIFLKREHRKVK 343 Query: 326 ISGKLSVKGVSLSAGKAFRGERVGLKEMQEDGSYEVWWYSTKVG 369 G +++ F G+ + L+ + V+ KV Sbjct: 344 ADGTITLNKQLYEVPPRFIGQSIELRYDERG--VYVYEDGRKVA 385 >UniRef50_A5BWY6 Putative uncharacterized protein n=3 Tax=Vitis vinifera RepID=A5BWY6_VITVI Length = 1068 Score = 230 bits (588), Expect = 5e-59, Method: Composition-based stats. Identities = 55/333 (16%), Positives = 114/333 (34%), Gaps = 48/333 (14%) Query: 44 GYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDD-----------ITALLRMAHDRHERWG 92 Y + + G + + + H + ++R E+ G Sbjct: 414 WYAHIANYLVTGEVPRKWKAQDRKHFFAKIDAYFWEEPFLFKYCADQIIRKCVPEEEQQG 473 Query: 93 ARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPATGRFEHDA-------PNRLW 145 I G + T ++ G T R + +W Sbjct: 474 I-LIHCHENACGGHFASQKTAMKVLQS-GSCDRCQRLGKLTKRNQMPMNPIIIVDLFNVW 531 Query: 146 QMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMT 205 +DF G FP G + L +D S++ + ++ R ++ ++F R+G+P + Sbjct: 532 GIDFMGPFPMSFGNSYILVGVDYVSKWVEAIPCKHNDHRVVLKFLKENIFSRFGVPKAII 591 Query: 206 MDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQG---K 262 D G+ + + E L + G++ + PYHPQT ++E +R +K +++ + Sbjct: 592 SDGGTHFCN-----KPFETLLAKYGVKHKVATPYHPQTSEQVELANREIKNILMKVVIMR 646 Query: 263 WFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGNTTPPEYDEGVMVR 322 S +L + +R Y L M+ Y+ + EY ++ Sbjct: 647 RKDWSIKLHDSLWAYRIAYKTI-----LGMSP----YRLVYGKACHLPVEVEYKAWXAIK 697 Query: 323 KVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 +++ + +A + L EM+E Sbjct: 698 RLN-----------MDLIRAGAKRCLDLNEMEE 719 >UniRef50_A3TPQ6 Transposase n=7 Tax=Actinomycetales RepID=A3TPQ6_9MICO Length = 402 Score = 230 bits (586), Expect = 8e-59, Method: Composition-based stats. Identities = 97/387 (25%), Positives = 150/387 (38%), Gaps = 16/387 (4%) Query: 13 MSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNR 72 MS + +G + RR+G+ A YK R+ EG A + R R P SP Sbjct: 1 MSKARLVITALFVEGLKPAEVSRRYGVHRAWVYKLKARYEAEGEAAFEPRSRRPTTSPRA 60 Query: 73 SSDDITALLRMAHD----RHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASP 128 + + L+ + + GA I L + +TVH ++ R G + Sbjct: 61 TPEGTVDLVLRLREDLTGKGLDGGADTIVWHLLHGHGVTLSRATVHRILTRAGKVTAEPG 120 Query: 129 GIPATG--RFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRET 186 P + RFE + PN WQ DF + G +T LDDHSR++L ++ T + Sbjct: 121 KRPKSSFIRFEAEQPNETWQSDFTHYRLSTGADVEVITWLDDHSRYALHVSAHTRTTAKI 180 Query: 187 VQQQLVSVFERYGLPDRMTMDNGSPWGDTTGT----WTALELWLMRLGIRVGHSRPYHPQ 242 V + G P DNG + T TALE L L I +SRP HP Sbjct: 181 VLATFRAATAEQGCPAGTLTDNGMVYTVRFATGPGGRTALEHELRTLNIVQKNSRPNHPT 240 Query: 243 TQGKLERFHRSLKAEV-LQGKWFADSGELQRAFDHWRTVYNLERPHEAL-DMAVPGSRYQ 300 T GK+ERF +++K + Q A EL + T YN RPH +L + P + Y Sbjct: 241 TCGKVERFQQTMKNWLRAQPDQPATVAELNTLLAAFVTEYNTRRPHRSLPHRSTPATAYN 300 Query: 301 PSARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGV----SLSAGKAFRGERVGLKEMQED 356 + + + V K++ +G ++++ + G+ RV L + Sbjct: 301 ARPKATPTTDRTDDTHDRVRTDKINKNGVVTLRYQGTLHKIGVGRTHARTRVFLLVQDLN 360 Query: 357 GSYEVWWYSTKVGVIDLKKKSITMGKG 383 + + L G G Sbjct: 361 VRIVDAATGELLRELTLDPHRTYHGTG 387 >UniRef50_A5AH70 Putative uncharacterized protein n=20 Tax=Vitis vinifera RepID=A5AH70_VITVI Length = 2203 Score = 229 bits (585), Expect = 1e-58, Method: Composition-based stats. Identities = 51/292 (17%), Positives = 107/292 (36%), Gaps = 35/292 (11%) Query: 74 SDDITALLRMAHDRH--ERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIP 131 + +L H + ++K+ + G P+ + +++ + Sbjct: 1442 EQEKHGILSHCHXNACGGHFASQKMAMRVXQSGFWWPSLFKDAHEVSKGCDKCQRLXKLS 1501 Query: 132 ATGRFEHDAP-----NRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRET 186 + +W +DF G FP G + L +D S++ + T++ + Sbjct: 1502 RRNMMPLNPILIVDLFXVWGIDFMGPFPMSFGHSYILVGVDYVSKWVEAIPCRTNDHKVV 1561 Query: 187 VQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGK 246 ++ ++F R+G+P + D G+ + + E L + GI + PYHPQT G+ Sbjct: 1562 LKFLKENIFSRFGVPKAIISDGGTHFCN-----KPFEALLAKYGINHKVATPYHPQTSGQ 1616 Query: 247 LERFHRSLKAEVLQG---KWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSA 303 +E R +K +++ S +L + +RT Y L M+ Y Sbjct: 1617 VELAKREIKNILMKVVNTNRKDWSVKLLDSLWAYRTAYKTI-----LGMSP----YHLVY 1667 Query: 304 RQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 + E+ ++K++ + KA + L E++E Sbjct: 1668 GKACHLPVEIEFKTWWAIKKLN-----------MDLTKAGLKRSLDLNELEE 1708 >UniRef50_A5AWF5 Putative uncharacterized protein n=16 Tax=Vitis vinifera RepID=A5AWF5_VITVI Length = 2072 Score = 229 bits (584), Expect = 2e-58, Method: Composition-based stats. Identities = 50/294 (17%), Positives = 103/294 (35%), Gaps = 50/294 (17%) Query: 74 SDDITALLRMAHDRH--ERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIP 131 ++ +L H+ + ++K + G T P+ ++M Sbjct: 1399 EEEQQGILSHCHESACGGHFASQKTAMKVLQSGCTWPSLFKDAHIM--FRSCDRCQRLGK 1456 Query: 132 ATGRFEHDA-------PNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERR 184 T R + +W +DF G FP G + L +D S++ + ++ + Sbjct: 1457 LTRRNQMPMNLILIVDLFDVWGIDFMGPFPMSFGNSYILVRVDYVSKWVEAIPCKHNDHK 1516 Query: 185 ETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQ 244 ++ ++F R+G+P + D G+ + E L + G++ + PYHPQT Sbjct: 1517 VVLKFLKENIFSRFGVPKAIISDGGTHFC-----IRPFETLLAKYGVKHKVATPYHPQTS 1571 Query: 245 GKLERFHRSLKAEVLQGKWFADSG---ELQRAFDHWRTVYNLERPHEALDMAVPGSRYQP 301 G++E +R +K +++ +L + +RT Y L M+ Y+ Sbjct: 1572 GQVELANREIKNMLMKVVITRRRDWSIKLHDSLWAYRTAYKTI-----LGMSP----YRL 1622 Query: 302 SARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 + ++ +A + L EM+E Sbjct: 1623 VYGKACHLPVEL----------------------NMDLIRAGTKRCLDLNEMEE 1654 >UniRef50_UPI0001B540A0 transposase for IS3514a n=1 Tax=Streptomyces sp. AA4 RepID=UPI0001B540A0 Length = 383 Score = 228 bits (583), Expect = 2e-58, Method: Composition-based stats. Identities = 103/366 (28%), Positives = 157/366 (42%), Gaps = 23/366 (6%) Query: 26 DGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALL---- 81 + NI CR G+S +K+L R+ EGA G R PH P ++ + Sbjct: 16 EDVNIARFCREHGVSRTVFHKYLNRFRAEGADGFTRRSTAPHRRPTALGTEVAEAVLRAR 75 Query: 82 RMAHDRHERWGARKIKRWLEDQGHT-MPAFSTVHNLMARHG-LLPGASPGIPATGRFEHD 139 + D G I+ LE QG +P+ S V+ ++ HG ++P RFE+ Sbjct: 76 KELADEGLDNGPISIRWRLEAQGAAAVPSQSAVYRILRAHGQIVPQPRKKPRTRRRFEYA 135 Query: 140 APNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYG 199 PN WQ+D H G + + +LDDHSR + T E L F +G Sbjct: 136 DPNGCWQIDGMEHHLADGTKVCIIQILDDHSRLDVGAYAATGETTAATWAALQHAFAGHG 195 Query: 200 LPDRMTMDNGSPWGDTT-GTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEV 258 LP + DNG + G LE L LGI + P+HPQT GK ER H++L+ + Sbjct: 196 LPVALLSDNGLAFSGKHRGRMVELERRLAALGITAIAAAPHHPQTCGKNERSHQTLQKWL 255 Query: 259 LQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGNTTPPEYDEG 318 +LQ D +RT+YN R H++L+ P RY + T P G Sbjct: 256 AARPAAGTLAQLQELLDEYRTIYNH-RRHQSLNGDTPRQRYDARPKAVP--ATGPRRPSG 312 Query: 319 VMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQEDGSYEVWWYSTKVGVI--DLKKK 376 + R V +G ++ G S+ G+ + G + V+W +V V+ D + Sbjct: 313 LATRPVSATGVIAFSGCSIVLGRRWAG-----------HTASVYWQGDRVTVMINDTIAR 361 Query: 377 SITMGK 382 +T+ + Sbjct: 362 QLTLDR 367 >UniRef50_Q3SW20 Helix-turn-helix, Fis-type n=112 Tax=Bacteria RepID=Q3SW20_NITWN Length = 785 Score = 228 bits (581), Expect = 3e-58, Method: Composition-based stats. Identities = 80/300 (26%), Positives = 120/300 (40%), Gaps = 16/300 (5%) Query: 10 RDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHS 69 R S + E + Q + + GI AT Y+W R+ G L D P Sbjct: 461 RYPASEKAEIIALVEQSHLPAKRTLDKLGIPRATFYRWYDRYRAGGIEALADHRSRPDRV 520 Query: 70 PNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLP-GASP 128 NR DD+ + R++ D+ + ++V+ L+ H L+ A Sbjct: 521 WNRIPDDVRGQIIDLALELPELSPRELAVRFTDERKYFVSEASVYRLLKAHDLITSPAYV 580 Query: 129 GIPATGRF--EHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRET 186 I A F + A N+LWQ DF G G + T+LDD SR+ + Sbjct: 581 VIKAANEFKDKTTAANQLWQTDFTYLKITGWGWYYLSTVLDDFSRYIVAWRLGPTMCASD 640 Query: 187 VQQQLVSVFERYGL-------PDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPY 239 V L GL R+ DNGS + L WL ++ PY Sbjct: 641 VTATLDQALAASGLDHVSVRQRPRLLSDNGSSYVADD-----LATWLRAKDMQHVRGAPY 695 Query: 240 HPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRY 299 HPQTQGK+ER+H++LK +L ++ +L+R + YN +R HE++ P Y Sbjct: 696 HPQTQGKIERWHQTLKNRILLENYYL-PDDLKRQVAAFVEHYNHDRYHESIGNVTPADVY 754 >UniRef50_B7GET7 Transposase n=6 Tax=Bacillales RepID=B7GET7_ANOFW Length = 275 Score = 228 bits (581), Expect = 4e-58, Method: Composition-based stats. Identities = 51/295 (17%), Positives = 106/295 (35%), Gaps = 41/295 (13%) Query: 30 IRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMAH-DRH 88 + +C +S + YKW R P + +++T +R + + Sbjct: 1 MEKMCEVLKVSRSGYYKWRDR---------------PKSARQERREELTQEVRRVYIESR 45 Query: 89 ERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPAT--------------- 133 + +G+ K+ + L +G + + TV +M G+ AT Sbjct: 46 QLYGSPKVTKKLNHEGIKV-SQKTVSRIMKEKGMKSRTVKKHKATTNSKHNHPVHENVLN 104 Query: 134 GRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVS 193 F PN +W D + P G + +++D +SR + ++E V L Sbjct: 105 QNFTVTKPNEVWVADIT-YVPTDEGWLYLASVMDLYSRKIVGWHIDCSMKKELVLSALKQ 163 Query: 194 VFERYGLPDRM--TMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFH 251 ++R + D GS + + L+ G++ SR + ++ FH Sbjct: 164 AYQRQQPQGSILHHSDRGSQYASND-----YQAKLIEYGMKCSMSRKGNCYDNACIKSFH 218 Query: 252 RSLKAEVLQGKWFADSGELQRAFDHWRT-VYNLERPHEALDMAVPGSRYQPSARQ 305 +K E++ + E +++ + YN +R H A + P + ++ Sbjct: 219 GIIKKELIYQTRYKTREEAKKSIFEYIEIFYNNKRIHSATEYFSPSEYERMYYKK 273 >UniRef50_A8ZKJ8 Integrase, catalytic region n=10 Tax=Bacteria RepID=A8ZKJ8_ACAM1 Length = 290 Score = 227 bits (579), Expect = 6e-58, Method: Composition-based stats. Identities = 54/306 (17%), Positives = 99/306 (32%), Gaps = 42/306 (13%) Query: 21 LFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITAL 80 + + + I +C+ +S + Y W++R P H N + Sbjct: 1 MESEKVNVTITLMCKVLKLSRSGYYAWMKRQPS------------PRHQENAILSERIQQ 48 Query: 81 LRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPAT------- 133 + + + +G+ +I L +G + V LMA+ G+ A T Sbjct: 49 IHD--ESRQTYGSPRIHASLIARGFR-ASRQRVVRLMAQLGICAQAKRPFKVTTDSEHDG 105 Query: 134 --------GRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRE 185 F + P++ W D + G + ++D SR + + R Sbjct: 106 PIAPNILDRTFTTEEPDQAWVADIT-YIRTHEGWLYLAVIIDLFSRRVVGWSMAEHMRTP 164 Query: 186 TVQQQLVSVFERYGLPD----RMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHP 241 V L + + +P D GS + + L++ GI SR + Sbjct: 165 LVLNALKAALGQR-IPAQTGLIFHSDRGSQYASGD-----YQQALLKRGITCSMSRRANC 218 Query: 242 QTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRT-VYNLERPHEALDMAVPGSRYQ 300 E F +LK E++ FA+ + W YN +R H + P + Sbjct: 219 WDNAVAESFFGTLKTELIYPTTFANRAMAKTVIAEWIEVFYNRQRLHSTIGYCTPVQFEE 278 Query: 301 PSARQY 306 R Sbjct: 279 NYWRTL 284 >UniRef50_A5B960 Putative uncharacterized protein n=1 Tax=Vitis vinifera RepID=A5B960_VITVI Length = 1144 Score = 227 bits (579), Expect = 6e-58, Method: Composition-based stats. Identities = 44/291 (15%), Positives = 104/291 (35%), Gaps = 35/291 (12%) Query: 75 DDITALLRMAHDRH--ERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPA 132 +L H+ + ++KI + G P+ + M + + Sbjct: 863 PKQQGILSHCHESACGGHFSSQKIAMKVLQSGFCWPSLFKDAHTMCKSCDRCQRLGKLTL 922 Query: 133 TGRFEHDAP-----NRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETV 187 + +W +DF G FP G + L +D S++ + ++ R + Sbjct: 923 KNMMPLNLILIVDLFYVWAIDFMGPFPMSFGYSYILVGVDYVSKWVEAIPCKRNDHRVVL 982 Query: 188 QQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKL 247 + ++F ++ +P + D + + + + E+ L + G++ + PYHPQT ++ Sbjct: 983 KFLKENIFSKFRVPKAIISDGSTHFCN-----KSFEILLAKYGVKHKVATPYHPQTSSQV 1037 Query: 248 ERFHRSLKAEVLQGKWFADSG---ELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSAR 304 + +K +++ + +L ++T YN L ++ Y+ Sbjct: 1038 GLANWEIKNILMKVVNASIRDWSVKLHDLLWAYKTAYNTI-----LGISP----YRLVYG 1088 Query: 305 QYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 + +Y ++ ++ + KA + L EM+E Sbjct: 1089 KACHLPVEVQYKAWWAIKMLN-----------IDLNKADMKRFLDLNEMEE 1128 >UniRef50_A5BSG6 Putative uncharacterized protein n=1 Tax=Vitis vinifera RepID=A5BSG6_VITVI Length = 1013 Score = 226 bits (577), Expect = 1e-57, Method: Composition-based stats. Identities = 48/295 (16%), Positives = 104/295 (35%), Gaps = 35/295 (11%) Query: 74 SDDITALLRMAHDRH--ERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIP 131 + + H+ + ++K + G P+F + M R + Sbjct: 519 EQEQQGIFSHCHNSACGGHFASQKTAMKVLQSGFCCPSFFKDTHTMCRSCDKCQRLGKLT 578 Query: 132 ATGRFEHDAP-----NRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRET 186 + +W +D FP G L +D S++ + T++ R Sbjct: 579 HRNMMPLNPILIVDFFYVWGIDCMRPFPMSFGYSFILMGVDYVSKWVEAIPCKTNDHRVV 638 Query: 187 VQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGK 246 ++ ++F R+G+P + D G+ + + E + + G++ + PYH QT + Sbjct: 639 LKFLKENIFSRFGVPKVIISDGGTHFCN-----KPFETLVAKYGVKHKVATPYHSQTSRQ 693 Query: 247 LERFHRSLKAEVLQGKW---FADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSA 303 +E +R +K +++ S +L + ++T Y L M+ Y+ Sbjct: 694 VELANREIKNILMKVVNTSIRDWSVKLHDSLWAYKTTYKTI-----LGMSP----YRLVY 744 Query: 304 RQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQEDGS 358 EY ++K++ + +A + + L EM+E Sbjct: 745 GNACHLPMEVEYKAWWAIKKLN-----------MDLSQAGMKKFLDLNEMEETWR 788 >UniRef50_UPI0001986237 PREDICTED: hypothetical protein n=1 Tax=Vitis vinifera RepID=UPI0001986237 Length = 1360 Score = 226 bits (576), Expect = 1e-57, Method: Composition-based stats. Identities = 67/354 (18%), Positives = 127/354 (35%), Gaps = 41/354 (11%) Query: 39 ISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMAHDRH--ERWGARKI 96 +S A Y W + + R +P D+ +LRM H+ + +RK Sbjct: 956 LSRAKHYAWDDPYLYKFCPDQIMRRCVP-------EDEQQDILRMCHEGACGGHFASRKT 1008 Query: 97 KRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPATGRFEHDA-------PNRLWQMDF 149 + G P N P R++ W +DF Sbjct: 1009 SAKILQSGFYWPTMFKDCNT--HCKSCPQCQQLGKINTRYQMPQNHICVVEVFDCWGLDF 1066 Query: 150 KGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNG 209 G FP G + L +D S++ +A +++ + ++ ++F R+G+P + D G Sbjct: 1067 MGPFPHSFGNLYILVGVDYVSKWVEAVACKSNDHKVVLKFLKENIFSRFGIPRAIISDGG 1126 Query: 210 SPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGK---WFAD 266 S + + L + G+R S PYHPQT G+ E +R +K + + Sbjct: 1127 SHFCN-----KPFSTLLQKYGVRHKVSTPYHPQTNGQAELANREIKRILTKVVNTIRKDW 1181 Query: 267 SGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGNTTPPEYDEGVMVRKVDI 326 S +L A +RT Y L M+ Y+ + + E+ ++K++ Sbjct: 1182 STKLSDALWAYRTAYKTV-----LGMSP----YRIAYGKACHLPVELEHRAYWAIKKMNF 1232 Query: 327 SGKLSVKGVSLSAG--KAFRGERVG-LKEMQEDGSYEVWWYSTKVGVIDLKKKS 377 + +A+R E L+ +E +++ + + K+ Sbjct: 1233 DSDQAGAKRKYDLNELEAYRNESYECLRNAREKHK---FYHDKLILRREFKQGE 1283 >UniRef50_A5C4S0 Putative uncharacterized protein n=1 Tax=Vitis vinifera RepID=A5C4S0_VITVI Length = 1374 Score = 226 bits (576), Expect = 1e-57, Method: Composition-based stats. Identities = 47/292 (16%), Positives = 103/292 (35%), Gaps = 35/292 (11%) Query: 74 SDDITALLRMAHDRH--ERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIP 131 + +L H+ + ++KI + G P+ M R + Sbjct: 1079 EQEQQGILSYCHESAYGGHFASQKIAMKVLQSGFCWPSLFKDALTMCRSCDKCQRLGKLT 1138 Query: 132 ATGRFEHDAP-----NRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRET 186 + +W +DF HFP G + L +D S++ + ++ R Sbjct: 1139 RKNMMPLNPILIVDLFYVWGIDFMRHFPMSFGYSYILVGVDYVSKWVEAIPCKRNDHRVV 1198 Query: 187 VQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGK 246 ++ ++F R+G+P + D G+ + + E L + ++ + PYHPQT G+ Sbjct: 1199 IKFLKENIFSRFGVPKAIISDGGTHFCN-----KPFETLLAKYEVKHKVATPYHPQTSGQ 1253 Query: 247 LERFHRSLKAEVLQGKWFADSG---ELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSA 303 ++ +R +K +++ + +L + ++ Y +L Y Sbjct: 1254 VKLANREIKKVLMRVVNTSRRDWCVKLHDSLWAYKIAYKTILR-MSL--------YCLVY 1304 Query: 304 RQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 + +Y ++ ++ + KA + L EM+E Sbjct: 1305 GKACHLPVEVQYKAWWAIKTLN-----------MDLNKADMKRFLDLNEMEE 1345 >UniRef50_A5C046 Putative uncharacterized protein n=3 Tax=Vitis vinifera RepID=A5C046_VITVI Length = 1565 Score = 225 bits (575), Expect = 2e-57, Method: Composition-based stats. Identities = 57/316 (18%), Positives = 109/316 (34%), Gaps = 52/316 (16%) Query: 45 YKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQG 104 Y W + + + A R +P ++ +L H+ G Sbjct: 1090 YYWEEPFLFKYCADQIIRKCVPK-------EEQQRILIHCHEN--------------ACG 1128 Query: 105 HTMPAFSTVHNLMARHGLLPG--ASPGIPATGRFEHDAPNRLWQMDFKGHFPFGGGRCHP 162 + T + R L +P D +W +DF G F G Sbjct: 1129 GHFTSQKTAMKELCRCQRLGKLTRRNQMPINPILIVD-LFDVWGIDFMGQFLMSFGNSFI 1187 Query: 163 LTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTAL 222 L +D S++ + ++ ++ ++F R+G+P + D G+ + + Sbjct: 1188 LVGVDYVSKWVEVIPCKHNDHSVXLKFLKENIFSRFGVPKAIISDGGTHFCN-----KPF 1242 Query: 223 ELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSG---ELQRAFDHWRT 279 E L + G++ + PYHPQT G++E +R +K + + + +L + +RT Sbjct: 1243 ETLLTKYGVKHKVATPYHPQTSGQVELANREIKNILTKVVNTSRRDWSVKLHDSLWAYRT 1302 Query: 280 VYNLERPHEALDMAVPGSRYQPSARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSA 339 Y L M+ Y + EY ++KV+ + Sbjct: 1303 AYKTI-----LGMS----LYSLVYGKVCHLLVEVEYKAWWAIKKVN-----------MDL 1342 Query: 340 GKAFRGERVGLKEMQE 355 +A + L EM+E Sbjct: 1343 IRARAKRCLDLNEMEE 1358 >UniRef50_A5AMM4 Putative uncharacterized protein n=14 Tax=Vitis vinifera RepID=A5AMM4_VITVI Length = 2056 Score = 225 bits (575), Expect = 2e-57, Method: Composition-based stats. Identities = 49/288 (17%), Positives = 103/288 (35%), Gaps = 53/288 (18%) Query: 74 SDDITALLRMAHDRH--ERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIP 131 ++ +L H+ + +K + G T P+ ++M Sbjct: 1345 EEEQQGILSHCHENACGGHFAFQKTTMKVLQSGFTWPSLFKDAHIM-------------- 1390 Query: 132 ATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQL 191 +W +DF G F G + L +D S++ + ++ R ++ Sbjct: 1391 -------FDLFDVWSIDFMGPFLMSFGNSYILVGVDYVSKWVEAIPCKHNDHRVVLKFLK 1443 Query: 192 VSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFH 251 ++F R+G+P + G+ + + E L + G++ + PYHPQT ++E + Sbjct: 1444 ENIFSRFGVPKAIISYGGTHFCN-----KPFETLLAKYGVKHKVATPYHPQTSRQVELEN 1498 Query: 252 RSLKAEVLQGKWFADSGE----LQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYS 307 R +K +++ + L + +RT Y L M+ Y+ + Sbjct: 1499 REIKNILMKVV-ITSRKDWSIKLHDSLWAYRTAYKTI-----LGMSP----YRLVYGKAC 1548 Query: 308 GNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 EY ++K++ + +A + L EM+E Sbjct: 1549 HLPVEVEYKAWSAIKKLN-----------MDLIRAGAKRCLDLNEMEE 1585 >UniRef50_A5AVQ5 Putative uncharacterized protein n=9 Tax=Vitis vinifera RepID=A5AVQ5_VITVI Length = 1928 Score = 225 bits (574), Expect = 2e-57, Method: Composition-based stats. Identities = 58/326 (17%), Positives = 109/326 (33%), Gaps = 66/326 (20%) Query: 44 GYKWLQRWAQEGAAGLQDRPRIPHHSPNRS--SDDITALLRMAHDRH--ERWGARKIKRW 99 Y + + G IP+ + D+ +L H+ + ++K Sbjct: 1118 WYAHIANYLVTGE--------IPNQIIRKCVPEDEQQGILSHCHENACGGHFASQKTAMK 1169 Query: 100 LEDQGHTMPAFSTVHNLMARHGLLPGASPGIPATGRFEHDA-------PNRLWQMDFKGH 152 + G T P+ ++M R T R + +W +DF G Sbjct: 1170 VLQSGFTWPSLFKDAHIMCR--SCDRCQRLGKLTKRNQMPMNPILIVELFDVWGIDFMGP 1227 Query: 153 FPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPW 212 FP G + L +D S++ + ++ R + D G+ + Sbjct: 1228 FPMSFGNSYILVGVDYVSKWVEAIPCKQNDHRV-----------------AIISDGGAHF 1270 Query: 213 GDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQG---KWFADSGE 269 + E L + G++ + PYHPQT G++E +R +K +++ S Sbjct: 1271 CN-----KPFEALLSKYGVKHKVATPYHPQTSGQVELANREIKNILMKVVNSNRKDWSIR 1325 Query: 270 LQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGNTTPPEYDEGVMVRKVDISGK 329 L + +RT Y L M+ Y+ + EY ++K++ Sbjct: 1326 LHDSLWAYRTTYKTI-----LGMSP----YRLVYGKACHLPXEVEYKAWWAIKKLN---- 1372 Query: 330 LSVKGVSLSAGKAFRGERVGLKEMQE 355 + KA + L EM+E Sbjct: 1373 -------MDLIKAGEKXFLDLNEMEE 1391 >UniRef50_A5AQQ3 Putative uncharacterized protein n=1 Tax=Vitis vinifera RepID=A5AQQ3_VITVI Length = 1599 Score = 225 bits (574), Expect = 2e-57, Method: Composition-based stats. Identities = 47/261 (18%), Positives = 99/261 (37%), Gaps = 28/261 (10%) Query: 75 DDITALLRMAHDRH--ERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPA 132 +L H+ + + ++K + G + + + M R Sbjct: 1107 PKQQGILSHCHESACGDHFASQKTTMKVLQSGFSWSSLFKDAHTMCR--SCDRCQRLGKL 1164 Query: 133 TGRFEHDA-------PNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRE 185 T R + +W +DF G FP G + L +D S++ + ++ + Sbjct: 1165 TQRNQMPMNPILIVNLFNVWGIDFMGPFPMSFGNSYILVGVDYVSKWVEVIPCKHNDHKV 1224 Query: 186 TVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQG 245 ++ ++F R+G+P + D G+ + + E L + G++ + YHPQT G Sbjct: 1225 VLKFLKENIFSRFGVPKAIISDGGTHFCN-----KPFETLLAKYGVKHKVATHYHPQTSG 1279 Query: 246 KLERFHRSLKAEVLQGKW---FADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPS 302 ++E +R +K+ +++ S +L + +RT Y L M+ Y+ Sbjct: 1280 QVELANREIKSILMKVVNTSIRDWSVKLHDSLWAYRTAYKTM-----LAMSP----YRLV 1330 Query: 303 ARQYSGNTTPPEYDEGVMVRK 323 + EY +++K Sbjct: 1331 YGKTCHLPVEVEYKAWWVIKK 1351 >UniRef50_A5ASA6 Putative uncharacterized protein n=2 Tax=Vitis vinifera RepID=A5ASA6_VITVI Length = 1839 Score = 225 bits (574), Expect = 2e-57, Method: Composition-based stats. Identities = 51/286 (17%), Positives = 105/286 (36%), Gaps = 35/286 (12%) Query: 80 LLRMAHDRH--ERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPATGRFE 137 +L HD + ++K + G P + M + + Sbjct: 1376 ILSHCHDSACGGHFASQKTAMRVVQSGFWWPFLFKDAHSMCKGCDQCQRLGKLTCRNMMP 1435 Query: 138 HDAP-----NRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLV 192 + +W +DF G FP G + L +D S++ + +++ + ++ Sbjct: 1436 LNPILIVDVFDVWGIDFMGPFPMSFGHSYILVGVDYVSKWVDAIPCRSNDHKVVLKFLKE 1495 Query: 193 SVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHR 252 ++F R+G+P + D G+ + + E L + G++ + PYHPQT G++E +R Sbjct: 1496 NIFSRFGVPKAIISDXGTHFCNXX-----FETLLAKYGVKHKVATPYHPQTSGQVELANR 1550 Query: 253 SLKAEVL---QGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGN 309 +K ++ S +L + +RT Y L M+ Y+ + Sbjct: 1551 EIKNILMKVVNVNRKDWSTKLLDSLWAYRTAYKTI-----LRMSP----YRLVYGKACHL 1601 Query: 310 TTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 EY ++K++ + A + L E++E Sbjct: 1602 PVEVEYKAWWAIKKLN-----------MDLTSAGLKRYLHLNELEE 1636 >UniRef50_A5BI07 Putative uncharacterized protein n=7 Tax=Vitis vinifera RepID=A5BI07_VITVI Length = 1803 Score = 224 bits (572), Expect = 3e-57, Method: Composition-based stats. Identities = 45/222 (20%), Positives = 89/222 (40%), Gaps = 28/222 (12%) Query: 137 EHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFE 196 +W +DF G FP G + L +D S++ + ++ R ++ ++F Sbjct: 1196 TAMKLFDVWGIDFMGPFPMSFGNSYILVGVDYVSKWVEAIPCKQNDHRVVLKFLKENIFS 1255 Query: 197 RYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKA 256 R+G+P + D G+ + + E L + G++ + PYHPQT G++E +R +K Sbjct: 1256 RFGVPKAIISDGGAHFCN-----KPFETLLAKYGVKHKVATPYHPQTSGQVELANREIKN 1310 Query: 257 EVLQGKWFADSG---ELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGNTTPP 313 +++ + L + +RT Y L M+ Y+ + Sbjct: 1311 ILMKVVNASRKDWSIRLHDSLWAYRTTYKTI-----LGMSP----YRLVYGKACHLLMEV 1361 Query: 314 EYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 EY ++K++ + +A + L EM+E Sbjct: 1362 EYKAWWAIKKLN-----------MDLIRAGAKRCLDLNEMEE 1392 >UniRef50_A5BVP5 Putative uncharacterized protein n=2 Tax=Vitis vinifera RepID=A5BVP5_VITVI Length = 1979 Score = 224 bits (571), Expect = 5e-57, Method: Composition-based stats. Identities = 53/321 (16%), Positives = 113/321 (35%), Gaps = 42/321 (13%) Query: 45 YKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMAHDRH--ERWGARKIKRWLED 102 Y W + + + AG R +P + + L HD + ++K + Sbjct: 1271 YYWEEPFLFKYCAGQIIRKCVP-------EQEQSGXLSHCHDSACGXHFASQKTAMRVVQ 1323 Query: 103 QGHTMPAFSTVHNLMARHGLLPGASPGIPATGRFEHDAP-----NRLWQMDFKGHFPFGG 157 G P+ M + + + +W +DF FP Sbjct: 1324 SGFWXPSXFKDAXSMCKGCDRCQRLGKLTRRNMMXLNPILIVDVFDVWGIDFMXPFPMSF 1383 Query: 158 GRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTG 217 G + +D S++ + +++ + ++ ++F R+G+ + D G+ + + Sbjct: 1384 GHSYIXVGVDYVSKWVEAIPCRSNDHKVVLKFLKENIFSRFGVXKAIISDGGTHFCN--- 1440 Query: 218 TWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVL---QGKWFADSGELQRAF 274 E L + G++ + PYHP T G++E +R +K ++ S +L + Sbjct: 1441 --KPFETLLAKYGVKHKVATPYHPXTSGQVELANREIKNILMKVVNVNRKDWSIKLLDSL 1498 Query: 275 DHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGNTTPPEYDEGVMVRKVDISGKLSVKG 334 +RT Y L M+ Y+ + EY ++K++ Sbjct: 1499 WAYRTXYXTI-----LGMSP----YRLVYGKACHLPXEVEYKAWWAIKKLN--------- 1540 Query: 335 VSLSAGKAFRGERVGLKEMQE 355 + +A + L E++E Sbjct: 1541 --MDLTRARLKRCLDLNELEE 1559 >UniRef50_A5AWA7 Putative uncharacterized protein n=6 Tax=Vitis vinifera RepID=A5AWA7_VITVI Length = 2136 Score = 223 bits (570), Expect = 6e-57, Method: Composition-based stats. Identities = 47/222 (21%), Positives = 89/222 (40%), Gaps = 28/222 (12%) Query: 137 EHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFE 196 +W +DF G FP G + L +D S++ + ++ R ++ ++F Sbjct: 1845 TAMKIFDVWGIDFMGPFPMSFGNSYILVGVDYVSKWVEAIPCKQNDHRVVLKFLKENIFS 1904 Query: 197 RYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKA 256 R+G+P + D G+ + + E L + G++ + PYHPQT G++E +R +K Sbjct: 1905 RFGVPKAIISDGGAHFCN-----KPFEALLSKYGVKHKVATPYHPQTSGQVELANREIKN 1959 Query: 257 EVLQG---KWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGNTTPP 313 +++ S L + +RT Y L M+ Y+ + Sbjct: 1960 ILMKVVNSNRKDWSIRLHDSLWAYRTAYKTI-----LGMSP----YRLVYGKACHLPVEV 2010 Query: 314 EYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 EY ++K++ + KA + L EM+E Sbjct: 2011 EYKAWWAIKKLN-----------MDLIKAGEKRFLDLNEMEE 2041 >UniRef50_A5BPW1 Putative uncharacterized protein n=3 Tax=Vitis vinifera RepID=A5BPW1_VITVI Length = 1335 Score = 223 bits (570), Expect = 6e-57, Method: Composition-based stats. Identities = 48/295 (16%), Positives = 107/295 (36%), Gaps = 41/295 (13%) Query: 74 SDDITALLRMAHDRH--ERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIP 131 ++ +L H+ + ++K + T P+ + M R Sbjct: 567 EEEQQGILSHCHENACGGHFASQKTTMKVLQSEFTWPSLFKDAHTMCR--SCDKCQRLGK 624 Query: 132 ATGRFEHDA-------PNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERR 184 T R + +W +DF G FP G + L +D ++ + + ++ R Sbjct: 625 LTRRNQMPMNPILIVDFFYVWGIDFMGPFPMSFGNSYILVGVDYVFKWVEPIPYKHNDHR 684 Query: 185 ETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQ 244 ++ ++F R+G+P + D G+ + + L+ L + G++ + PYHPQT Sbjct: 685 VVLKFLKENIFLRFGVPKAIISDGGTHFCN-----KPLDTLLAKYGVKHKVATPYHPQTS 739 Query: 245 GKLERFHRSLKAEVLQGKWFADSGE----LQRAFDHWRTVYNLERPHEALDMAVPGSRYQ 300 G+++ +R +K +++ + L + +R L M+ Y+ Sbjct: 740 GQVKLANREIKNILMKVV-ITSRKDWSIKLHDSLWAYRAARKTI-----LGMSP----YR 789 Query: 301 PSARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 + EY +++++ + + + L EM+E Sbjct: 790 LVNGKACHLPMEVEYKAWWAIKRLN-----------MDLIRVGVKRCLNLNEMEE 833 >UniRef50_A5B3F9 Putative uncharacterized protein n=5 Tax=Vitis vinifera RepID=A5B3F9_VITVI Length = 1686 Score = 223 bits (569), Expect = 8e-57, Method: Composition-based stats. Identities = 46/286 (16%), Positives = 100/286 (34%), Gaps = 35/286 (12%) Query: 75 DDITALLRMAHDRH--ERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPA 132 L H+ + ++K + G +P+ + M + + Sbjct: 1056 PKQQRTLSHFHENACGGHFASQKTAMRVLQSGFCLPSLFKDAHTMCKSCNRCQRVGKLTR 1115 Query: 133 TGRFEHDAP-----NRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETV 187 + +W +DF FP G + L +D S++ + ++ R + Sbjct: 1116 KNMMPLNPILIVDLFYVWGIDFMRPFPMSFGYSYILVGVDYVSKWVEVVPSKHNDHRVVL 1175 Query: 188 QQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKL 247 + ++ R+G+P + + G+ + + E L + G++ + PYHPQT G++ Sbjct: 1176 KFLKENICSRFGVPKAIISNGGTHFCN-----KPFETLLTKYGVKHKVATPYHPQTSGQV 1230 Query: 248 ERFHRSLKAEVLQG---KWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSAR 304 E + +K +++ S +L + +RT Y L M+ Y+ Sbjct: 1231 ELANWEIKNILMKVVNTNRKDWSAKLFDSLWAYRTTYKTI-----LGMSP----YRLVYD 1281 Query: 305 QYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGL 350 + EY ++K++ + KA + L Sbjct: 1282 KACHLPVELEYKAWWAIKKLN-----------MDLSKARMKRFLDL 1316 >UniRef50_C6PFD7 Integrase catalytic region n=3 Tax=Thermoanaerobacterium thermosaccharolyticum DSM 571 RepID=C6PFD7_CLOTS Length = 412 Score = 222 bits (567), Expect = 1e-56, Method: Composition-based stats. Identities = 71/384 (18%), Positives = 144/384 (37%), Gaps = 23/384 (5%) Query: 8 DARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPH 67 + T E++ + ++ SL +R SP T WL + + G GL + R Sbjct: 22 NENYTQETAKEYMEVITSKVYDVPSLGKR-EFSPNTIKTWLYCYRKYGFEGLYPKSRCDK 80 Query: 68 HSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHT---MPAFSTVHNLMARHGLLP 124 + +DD+ A ++ + R A+ I + L + + STV + + + Sbjct: 81 GASRVLTDDVKAYIKNLKLDNPRRSAKSIYQELLVKKFIELDKVSLSTVQRYLRKTKIST 140 Query: 125 GASPGIPATGRFEHDAPNRLWQMDF-KGHFPF---GGGRCHPLTLLDDHSRFSLCLAHCT 180 ++ FE + PN WQ D G + + + + LDD SR Sbjct: 141 -SALNTKDRRSFEMEYPNDCWQSDISMGPYLIINDKKIKTYLIAFLDDSSRLITHAEFYD 199 Query: 181 DERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYH 240 + ++ + G+P ++ +DNG + L L LG + ++ PY Sbjct: 200 TDNVISLIDAFKKAVSKRGVPKKLFVDNGKVFQSE-----QLHLICASLGTSLCYAEPYS 254 Query: 241 PQTQGKLERFHRSLKAEVLQGKWF---ADSGELQRAFDHWRTVYNLERPHEALDMAVPGS 297 P+++GK+ERF R+LK + + G + + EL + + + H + +M P Sbjct: 255 PESKGKIERFFRTLKDQWMYGFDWQKISSIDELNENLNKYIEGIYHQTVHSSTNMK-PIE 313 Query: 298 RYQPSARQYSGNTTPPEYD---EGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQ 354 ++ + + D + R+V +S++ + + G+ V ++ Sbjct: 314 KFIKYTDTMKFINSKEDLDNIFLYRVKRRVIKDATVSIEKIKFEVPMQYIGDYVNIRYYP 373 Query: 355 EDGSYEVWW--YSTKVGVIDLKKK 376 + + + I K Sbjct: 374 KSLDKAYIFSEDGKLLQTIHPVNK 397 >UniRef50_Q2JC89 Integrase n=5 Tax=Actinomycetales RepID=Q2JC89_FRASC Length = 281 Score = 222 bits (565), Expect = 2e-56, Method: Composition-based stats. Identities = 60/297 (20%), Positives = 96/297 (32%), Gaps = 38/297 (12%) Query: 30 IRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMAH-DRH 88 + CR G+S + Y W+ R P + + + + H + Sbjct: 3 VAVACRVLGVSRSGYYDWIGR---------------PPSLREQENTLLAKQIERIHLESR 47 Query: 89 ERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPA------------TGRF 136 +G ++ L V LM GL +F Sbjct: 48 GTYGWPRVHAELALGLGVPVNHKRVARLMREAGLQGVYRRRARRGPVAEATAEDLVNRQF 107 Query: 137 EHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFE 196 DAP+RLW D H P G G+ + ++D +SR + + R E V L Sbjct: 108 AVDAPDRLWLTDITEH-PTGDGKLYCAAVMDAYSRRIIGWSIAHHIRTELVLDALGMAIL 166 Query: 197 RYGLPD---RMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRS 253 R P+ + D+G+ + A L G+ +E F + Sbjct: 167 RRRPPEKQTILHSDHGTQYTSW-----AFGNRLRIAGLLPSMGTVGDCYDNSMMESFWGT 221 Query: 254 LKAEVLQGKWFADSGELQRAFDHWRT-VYNLERPHEALDMAVPGSRYQPSARQYSGN 309 L+ EVL + + EL A W YN +R H +L M P + S + Sbjct: 222 LQLEVLDRHTWENRDELANAIFEWIECWYNPKRRHSSLGMLSPIDYEAAHLPRSSPD 278 >UniRef50_A5B213 Putative uncharacterized protein n=1 Tax=Vitis vinifera RepID=A5B213_VITVI Length = 1197 Score = 222 bits (565), Expect = 3e-56, Method: Composition-based stats. Identities = 47/287 (16%), Positives = 104/287 (36%), Gaps = 41/287 (14%) Query: 74 SDDITALLRMAHDRH--ERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIP 131 ++ +L H+ + ++K + G P+ + M +P Sbjct: 801 EEEQQGILSHCHENACGGHFASQKTTMRVLQSGFYWPSLFKDAHTMN----------MMP 850 Query: 132 ATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQL 191 D +W ++ G FP G + L +D S++ + ++ ++ Sbjct: 851 LNPILVVD-LFYVWGINLMGPFPMSFGYSYILVGVDYVSKWVEAVPCKHNDHGMVLKFLN 909 Query: 192 VSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFH 251 ++ R+G+P + D + + + E L + G++ + PYHPQT G++E + Sbjct: 910 ENISSRFGVPKAIISDGATHFCN-----KPFETLLAKYGVKHKVAIPYHPQTSGQVELAN 964 Query: 252 RSLKAEVLQG---KWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSG 308 R +K +++ S +L + +RT Y L M+ Y+ + Sbjct: 965 REIKNILMKVMNTNRKDWSAKLLDSLWAYRTTYKTI-----LGMSP----YRLVYGKTCH 1015 Query: 309 NTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 EY ++K++ + + + L E++E Sbjct: 1016 LPVELEYKAWWAIKKLN-----------MDLSRVGFKRFLDLNELEE 1051 >UniRef50_A5BJP7 Putative uncharacterized protein n=6 Tax=Vitis vinifera RepID=A5BJP7_VITVI Length = 1265 Score = 221 bits (564), Expect = 3e-56, Method: Composition-based stats. Identities = 51/259 (19%), Positives = 102/259 (39%), Gaps = 27/259 (10%) Query: 131 PATGRFEHDA-------PNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDER 183 T R + +W +DF G FP G + L +D S++ + ++ Sbjct: 365 RLTKRNQMPMNPILIVELFDVWGIDFMGPFPMSFGNSYILVGVDYVSKWVEAIPCKQNDH 424 Query: 184 RETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQT 243 R ++ ++F R+G+P + D G+ + + E L + G++ + PYHPQT Sbjct: 425 RVVLKFLKDNIFSRFGVPKAIISDGGAHFCN-----KPFETLLAKYGVKHKVATPYHPQT 479 Query: 244 QGKLERFHRSLKAEVLQGKWFA---DSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQ 300 G++E +R +K +++ + S L + +RT Y L M+ Y+ Sbjct: 480 SGQVELANREIKNILMKVVNASRKNWSIRLHDSLWAYRTAYKTI-----LGMSP----YR 530 Query: 301 PSARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQEDGSYE 360 + EY ++K+++ S + K + + + KE Q+ Sbjct: 531 LVYGKACHLPVEVEYKAWWAIKKLNMDLIQSWSKERM---KKWHDQLISNKEFQKGQRVL 587 Query: 361 VWWYSTKVGVIDLKKKSIT 379 ++ + LK + I Sbjct: 588 LYDTRLHIFPGKLKSRWIG 606 >UniRef50_A5ACN5 Putative uncharacterized protein n=7 Tax=Vitis vinifera RepID=A5ACN5_VITVI Length = 1390 Score = 221 bits (563), Expect = 4e-56, Method: Composition-based stats. Identities = 48/298 (16%), Positives = 103/298 (34%), Gaps = 34/298 (11%) Query: 66 PHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPG 125 P + A + + + + + P+ + M + Sbjct: 524 PXEWSAQDKRHFFAKIHAYYWEEPFL-FKYCADQIIRKWFWWPSLFKDAHSMCKGCDRCQ 582 Query: 126 ASPGIPATGRFEHDAP-----NRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCT 180 + + +W +DF FP G + L +D S+ + + Sbjct: 583 XLGKLTRRNMMPLNPILIVDVFDVWGIDFMXPFPMSFGHSYILVGVDYVSKXVEAIPCRS 642 Query: 181 DERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYH 240 ++ R ++ ++F R+G+P + D G+ + + E L + G++ + PYH Sbjct: 643 NDHRVVLKFLKDNIFARFGVPKAIISDGGTHFCN-----KPFETLLAKYGVKHKVATPYH 697 Query: 241 PQTQGKLERFHRSLKAEVL---QGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGS 297 PQT G++E +R +K ++ S +L + +RT Y L M+ Sbjct: 698 PQTSGQVELANREIKNILMKVVNVNRKDWSIKLLDSLWAYRTAYKTI-----LGMSP--- 749 Query: 298 RYQPSARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 Y+ + EY ++K++ + +A + L E++E Sbjct: 750 -YRLVYGKACHLPVEIEYKAWWAIKKLN-----------MDLTRAGLKRCLDLNELEE 795 >UniRef50_Q3A8V0 ISChy3, transposase n=5 Tax=Clostridia RepID=Q3A8V0_CARHZ Length = 448 Score = 220 bits (562), Expect = 6e-56, Method: Composition-based stats. Identities = 92/382 (24%), Positives = 158/382 (41%), Gaps = 28/382 (7%) Query: 15 LRTEFVL-------FASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPH 67 R + A++ + + R IS T ++LQ + Q+G +GL + R + Sbjct: 17 KRFALIAPLLEPDLEAAEKRQRRKEILARSEISSRTLRRYLQLYRQQGLSGLMPKIRSDN 76 Query: 68 HSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHT---MPAFSTVHNLMARHGLLP 124 S S +I + +I LE + M A ST+ ++R GL Sbjct: 77 GSSRTISHEIIEEAVKLKEELPERSVSQIIAILEGEKKVPAGMLARSTLGRHLSRLGLTQ 136 Query: 125 G-ASPGIPATGRFEHDAPNRLWQMDFKGHF-------PFGGGRCHPLTLLDDHSRFSLCL 176 A+ I RF + NRLWQ D K P R + + +DD +R Sbjct: 137 KEANQKISGHRRFAKEQRNRLWQADIKYGPYLPHPKNPKRKVRTYLVAFIDDATRLLCHG 196 Query: 177 AHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHS 236 D++R ++ + G+PD + +DNG + L RLGIR ++ Sbjct: 197 EFYLDQKRPVLEDCFRKAILKRGIPDAVYVDNGKIFVSRW-----FRLGCARLGIRPINT 251 Query: 237 RPYHPQTQGKLERFHRSLKA--EVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAV 294 +PY P+++GK+ERF+R++++ ++ + EL +AF W PH +L+ Sbjct: 252 KPYSPESKGKIERFNRTVESFIAEIELQQPETLAELNQAFAVWVEEGYNHHPHSSLENET 311 Query: 295 PGSRYQPSARQYSGNTTPP--EYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKE 352 P +R+Q R+ + E R+VD +G + ++G G + + V L+ Sbjct: 312 PANRFQKDTRRLRFASLEECREAFLWEASRRVDKTGCIKLEGRFYEIGLEWIRKTVDLRY 371 Query: 353 MQEDG-SYEVWWYSTKVGVIDL 373 D S E W+ K G+ Sbjct: 372 DPFDLESIEFWYNGQKQGLAKP 393 >UniRef50_Q43917 ORF2 gene product (Fragment) n=15 Tax=cellular organisms RepID=Q43917_ACIAD Length = 305 Score = 219 bits (559), Expect = 1e-55, Method: Composition-based stats. Identities = 54/310 (17%), Positives = 93/310 (30%), Gaps = 45/310 (14%) Query: 14 SLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRS 73 + + + + S C+ G+S + Y W +R Sbjct: 12 QEKYTVIQDLDVNEVTVSSACKCLGVSTSGYYAWRKR-------------------QTNL 52 Query: 74 SDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPAT 133 + L + H R GA + + D G++M + TV ++ + GL + T Sbjct: 53 AQKYNDLKAVYWQHHARLGAPSLVHDMHDLGYSM-SERTVGRMLKKLGLRSKIARKYKHT 111 Query: 134 ---------------GRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAH 178 +F + PN++W D + G + +LD SR + Sbjct: 112 TDSNHRLPTAPNLLDRQFTVNEPNKIWTTDIT-YIRTKQGWLYLCVMLDLFSRRIVGWQT 170 Query: 179 CTDERRETVQQQLVSVFERYGLP--DRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHS 236 R+ V R G P + D GS + L+ S Sbjct: 171 SHRIDRQLVCDAFHYAMARQGYPMGVMVHSDQGSQYCSRD-----FRALLLTNNCVQSMS 225 Query: 237 RPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRT-VYNLERPHEALDMAVP 295 R + E F +LK ++ G FA E + YN R H P Sbjct: 226 RRGNCWDNAVTESFFHTLKGHMVHGSVFATRKEANAVLFDYIEIYYNRIRRHSTNGWLSP 285 Query: 296 GSRYQPSARQ 305 ++ + Sbjct: 286 -EAFEQKYFK 294 >UniRef50_A5BWH5 Putative uncharacterized protein n=17 Tax=Vitis vinifera RepID=A5BWH5_VITVI Length = 2160 Score = 219 bits (559), Expect = 1e-55, Method: Composition-based stats. Identities = 53/289 (18%), Positives = 104/289 (35%), Gaps = 44/289 (15%) Query: 79 ALLRMAHDRHERWGARKIKRWLEDQ--GHTMPAFSTVHNLMARHGLLPGASPGIPATGRF 136 ++R E+ G I + G + T ++ T R Sbjct: 1190 QIIRKCVXEDEQQG---ILSHCHENACGGHFASQKTAMKVL----SCDRCQRLGKLTKRN 1242 Query: 137 EHDA-------PNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQ 189 + +W ++F G FP G + L +D S++ + ++ R ++ Sbjct: 1243 QMPMNXILIVELFDVWGINFMGPFPMSFGNSYILVGVDYVSKWVEAIPCKQNDHRVVLKF 1302 Query: 190 QLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLER 249 ++F R+G+P + D G+ + + E L + ++ + PYHPQT G++E Sbjct: 1303 LKENIFSRFGVPKAIISDGGAHFCN-----KPFEALLSKYXVKHKVATPYHPQTSGQVEL 1357 Query: 250 FHRSLKAEVLQGK---WFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQY 306 +R +K +++ S L + +RT Y L M+ Y+ + Sbjct: 1358 ANREIKNTLMKVVNSXRKDWSIRLHDSLWAYRTAYKTI-----LRMSP----YRLVYGKA 1408 Query: 307 SGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 EY ++K++ + KA + L EM+E Sbjct: 1409 CHLPVEVEYKAWWAIKKLN-----------MDLIKAGEKRYLXLNEMEE 1446 >UniRef50_C4KRZ4 Integrase core domain protein n=59 Tax=Proteobacteria RepID=C4KRZ4_BURPS Length = 318 Score = 219 bits (559), Expect = 1e-55, Method: Composition-based stats. Identities = 64/300 (21%), Positives = 102/300 (34%), Gaps = 31/300 (10%) Query: 8 DARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPH 67 + R + ++ + C GIS + + Sbjct: 29 KVASPQAKREAVRILMTERTMGVTRACGLVGISRSLLHY--------------------E 68 Query: 68 HSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGAS 127 + +T + + R+G R+I L+ G + L ++ GL Sbjct: 69 SRRRVDDEALTGRMMAIAAQKRRYGYRRIHVLLQRDG-CFANHKRIWRLYSKAGLSVRKR 127 Query: 128 PGI-----PATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDE 182 T PN+ W MDF G R L ++DD++R L + T Sbjct: 128 RRKRIAAVERTPLPLPTGPNQSWSMDFVSDGLAYGRRFRCLNVVDDYTRECLAIEVDTSL 187 Query: 183 RRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQ 242 VQQ L + E GLP +T+DNG + L+ W G+ + RP P Sbjct: 188 PGLRVQQVLARLKEMRGLPASITVDNGPEFAG-----KVLDAWAYEAGVTLSFIRPGKPV 242 Query: 243 TQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPS 302 +E F+ + E L WF ++ + WR YN ERPH +L P + Sbjct: 243 ENAYIESFNGRFRDECLNEHWFVSMRHAKQLIEEWRIEYNTERPHSSLGYLTPAQFARAH 302 >UniRef50_B8FAB2 Integrase catalytic region n=70 Tax=Bacteria RepID=B8FAB2_DESAA Length = 327 Score = 219 bits (559), Expect = 1e-55, Method: Composition-based stats. Identities = 63/323 (19%), Positives = 107/323 (33%), Gaps = 39/323 (12%) Query: 16 RTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSD 75 R + V S + +I C +S ++ Y + +P H Sbjct: 6 RKQAV-EPSSEELSITRQCELLSMSRSSYYY-------------RPKPVSDH------DL 45 Query: 76 DITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPATGR 135 ++ L+ + + WG+R ++ +L G+ + V LM G+ + Sbjct: 46 ELMRLIDEQYLKQPTWGSRSMRNFLRGLGYKI-NRKKVRRLMRIMGICAVYPKPRTSLPH 104 Query: 136 FEH------------DAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDER 183 H D N++W D + P G + ++D HSR L Sbjct: 105 PGHKVYPYLLKGVSIDRANQVWSSDIT-YIPMRKGFMYLCAVIDWHSRKVLSWRLSNTMD 163 Query: 184 RETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQT 243 + +RYG P+ D G + L GIR+ Sbjct: 164 ADFCVDAAAEAIDRYGPPEIFNTDQGVQFTSAD-----FTGLLKGHGIRISMDGKGRCLD 218 Query: 244 QGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSA 303 +ER +LK + + F D +L++ W YN ERPH++LD P Y + Sbjct: 219 NIFVERLWWTLKYHYVYLRDFEDGVQLRKGLAGWFDFYNRERPHQSLDGKTPNEAYFNVS 278 Query: 304 RQYSGNTTPPEYDEGVMVRKVDI 326 S P + + R V Sbjct: 279 GPISWVREPMKPVRSMEQRGVTA 301 >UniRef50_A5BY78 Putative uncharacterized protein n=2 Tax=Vitis vinifera RepID=A5BY78_VITVI Length = 1947 Score = 219 bits (559), Expect = 1e-55, Method: Composition-based stats. Identities = 48/292 (16%), Positives = 104/292 (35%), Gaps = 35/292 (11%) Query: 74 SDDITALLRMAHDRH--ERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIP 131 + +L H + ++K + G P +++ + Sbjct: 1283 EQEKHGILSHCHGNACGGHFASQKTAMRVLQSGFWWPXLFKDAXEVSKGCDKCQRLGKLS 1342 Query: 132 ATGRFEHDAP-----NRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRET 186 + +W +DF G FP G + L +D S++ + T++ + Sbjct: 1343 RRNMMPLNPILIVDLFYVWGIDFMGPFPMSFGHSYILVGVDYVSKWVEXIPCRTNDHKVV 1402 Query: 187 VQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGK 246 ++ ++F R+ +P + D G+ + + E L + GI+ + PYHP T G+ Sbjct: 1403 LKFLKENIFSRFXVPKAIIXDXGTHFCN-----KPFEALLAKYGIKHKVATPYHPXTSGQ 1457 Query: 247 LERFHRSLKAEVLQG---KWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSA 303 +E +R +K +++ S +L + +R Y L M+ Y+ Sbjct: 1458 VELANREIKNILMKVXNTNRKDWSXKLLDSLWAYRXAYKTI-----LGMS----XYRLVY 1508 Query: 304 RQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 + E+ ++K++ + KA + L E++E Sbjct: 1509 GKACHLPVEIEFKAWWAIKKLN-----------MDLTKAGLKRSLDLNELEE 1549 >UniRef50_C3LLF8 IS1627, transposase n=27 Tax=Bacillaceae RepID=C3LLF8_BACAC Length = 274 Score = 219 bits (558), Expect = 2e-55, Method: Composition-based stats. Identities = 58/296 (19%), Positives = 104/296 (35%), Gaps = 40/296 (13%) Query: 25 QDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMA 84 +D +I+ +C GI +T Y+W + A L+ A+L + Sbjct: 2 KDEYSIKEICILIGIPRSTYYRWKNKEKDVKEAKLE-----------------QAILTIC 44 Query: 85 HDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIP------------- 131 H R+G RK+ L+ + + P TV +M + L Sbjct: 45 MTNHFRYGHRKVTALLKRKYNYHPNRKTVQKIMQKKNLQCRVKRKRRTWINGESRIVVEN 104 Query: 132 -ATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQ 190 F+ + PN W D + PFG + L+++D ++ + + V + Sbjct: 105 LLNRNFQANKPNEKWVTDIT-YLPFGTEMLYLLSIMDLYNNEIIAYEISNRQDVTLVLRT 163 Query: 191 LVSVFERYGLPDRM-TMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLER 249 + + + D G+ + A + + GI SR + +E Sbjct: 164 VEKAIKLQQKTQIILHSDQGAVYTSY-----AFQTLSKKNGITTSMSRKGNCHDNAVIES 218 Query: 250 FHRSLKAEVLQ--GKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSA 303 FH SLK+E+ K + L++ + YN ER E L+ P + A Sbjct: 219 FHSSLKSELFYSQEKQIHSTSTLKQLIHDYIEYYNTERIQEKLNYLSPIEYKKQVA 274 >UniRef50_A5B504 Putative uncharacterized protein n=11 Tax=Vitis vinifera RepID=A5B504_VITVI Length = 2320 Score = 218 bits (555), Expect = 3e-55, Method: Composition-based stats. Identities = 50/274 (18%), Positives = 100/274 (36%), Gaps = 32/274 (11%) Query: 75 DDITALLRMAHDRH--ERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPA 132 + + +L HD + + K + G P+ + M + Sbjct: 1384 QEQSGILSHCHDSACGGHFASXKTAMKVIQSGFWWPSLFKDAHXMCKG--CDRCQRLGKL 1441 Query: 133 TGRFEHD-------APNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRE 185 T R +W +DF G FP G + L +D S++ + +++ + Sbjct: 1442 TRRNMMPLNPILIVDIFDVWGVDFMGPFPMSFGHSYILVGVDYVSKWVEAIPCRSNDHKV 1501 Query: 186 TVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQG 245 ++ +F R+G+P + D G+ + + E L + ++ + PYHPQT G Sbjct: 1502 VLKFLKDHIFARFGVPKAIISDGGTHFCN-----KPFETLLAKXXVKHKVATPYHPQTSG 1556 Query: 246 KLERFHRSLKAEVL---QGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPS 302 ++E +R +K ++ S +L + +RT Y L M+ Y+ Sbjct: 1557 QVELANREIKNILMKVVNVNRKDWSIKLLDSLWAYRTAYKTI-----LGMSP----YRLV 1607 Query: 303 ARQYSGNTTPPEYDEGV----MVRKVDISGKLSV 332 + EY ++ +V +G + V Sbjct: 1608 YGKACHLPVEIEYKAWWTGPFIIHEVHPNGVVEV 1641 >UniRef50_D1VRH7 Integrase catalytic region n=1 Tax=Frankia sp. EuI1c RepID=D1VRH7_9ACTO Length = 410 Score = 217 bits (554), Expect = 4e-55, Method: Composition-based stats. Identities = 98/394 (24%), Positives = 144/394 (36%), Gaps = 23/394 (5%) Query: 13 MSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNR 72 MS + +G + RR+ +S YK R+ EG + R R P SP Sbjct: 1 MSKARLVITALVVEGQTAAQVARRYEVSRGWVYKLKARYDAEGEVAFEPRSRRPVSSPTA 60 Query: 73 SSDDITALLRMAHDRHE----RWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASP 128 +S + L+ GA I L T + +T++ ++ R G + Sbjct: 61 TSVAMVDLVLRLRKELAEAGLDAGADTIGWHLAHHHDTTLSRATINRILNRAGAVTPEPA 120 Query: 129 GIPATG--RFEHDAPNRLWQMDFKGHFPFG-----GGRCHPLTLLDDHSRFSLCLAHCTD 181 P + RF+ D PN WQ DF + G LT LDDHSRF+L ++ Sbjct: 121 KRPRSSYIRFQADQPNECWQSDFTHYRLTRPNGKIGIDTEILTWLDDHSRFALRVSAHLK 180 Query: 182 ERRETVQQQLVSVFERYGLPDRMTMDNGSPWG------DTTGTWTALELWLMRLGIRVGH 235 V + +G P DNG + G T E L RLGI + Sbjct: 181 ITGRIVVASFRQAADLHGYPASTLTDNGMVYTVRLASAGVAGGRTGFEAELRRLGIVQKN 240 Query: 236 SRPYHPQTQGKLERFHRSLKAEV-LQGKWFADSGELQRAFDHWRTVYNLERPHEAL-DMA 293 SRP HP T GK+ERF ++LK + Q A + LQ D + YN RPH +L Sbjct: 241 SRPNHPTTCGKVERFQQTLKKWLAAQPVQPASTYALQTLIDQFVETYNQHRPHRSLPGRC 300 Query: 294 VPGSRYQPSARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGV----SLSAGKAFRGERVG 349 P YQ + + + V VD +GKL+++ + G+ V Sbjct: 301 TPAVAYQARPKARPNTDRSADSHDRVRRDHVDANGKLTLRVNGRLHHIGIGRTHARTPVL 360 Query: 350 LKEMQEDGSYEVWWYSTKVGVIDLKKKSITMGKG 383 L + + + G Sbjct: 361 LLVHDLHARIIHATTGEIIRELTIDPTRDYQPTG 394 >UniRef50_A5AFC7 Putative uncharacterized protein n=8 Tax=Vitis vinifera RepID=A5AFC7_VITVI Length = 1717 Score = 217 bits (554), Expect = 4e-55, Method: Composition-based stats. Identities = 42/258 (16%), Positives = 94/258 (36%), Gaps = 24/258 (9%) Query: 73 SSDDITALLRMAHDR--HERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGI 130 + +L H+ + ++K + G P+ + M R + Sbjct: 1084 LEQEQQRILSHCHESTCGGHFASQKTTMKVLQSGFCWPSLFKDAHTMCRSYDKCQRLGKL 1143 Query: 131 PATGRFEHDAP-----NRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRE 185 + +W +DF FP G + L +D R+ ++ ++ R Sbjct: 1144 TRKNMMPLNPILIVDLFYVWGIDFMRPFPMSFGYSYILVGVDYVFRWVEAISCKCNDHRV 1203 Query: 186 TVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQG 245 ++ ++F R+G+P + D G+ + + E L + ++ + PYHPQT G Sbjct: 1204 VLKFLKENIFSRFGVPKAIISDGGTHFCN-----KPFETLLAKYEVKHKVATPYHPQTSG 1258 Query: 246 KLERFHRSLKAEVLQG---KWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPS 302 ++E +R +K +++ + +L+ + ++T Y L M+ Y+ Sbjct: 1259 QVELANREIKNILMKVVNTRRRYWFVKLRDSLWAYKTTYKTI-----LGMSP----YRLV 1309 Query: 303 ARQYSGNTTPPEYDEGVM 320 + +Y + Sbjct: 1310 YGKACHLPVEVQYKAWWV 1327 >UniRef50_B4RV10 Integrase, catalytic region n=16 Tax=Proteobacteria RepID=B4RV10_ALTMD Length = 267 Score = 217 bits (553), Expect = 6e-55, Method: Composition-based stats. Identities = 65/290 (22%), Positives = 103/290 (35%), Gaps = 30/290 (10%) Query: 14 SLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRS 73 + + E G +I C GIS +T Y+ P ++ Sbjct: 3 AEKRECASILVDAGLSIVKACLFVGISRSTFYR-------------------PERDWRKA 43 Query: 74 SDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPAT 133 + + D+ R G K + +G V+ + + GL Sbjct: 44 DAAVIDAINAVLDKSPRAGFWKCFGRMRFKGFPF-NHKRVYRVYCQMGLNLRRRTKRVLP 102 Query: 134 GRFEHD-----APNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQ 188 R N W +DF + G R L ++D+ +R L + T V Sbjct: 103 KRIAQPLEVLEQANYQWALDFMHDTLYCGKRFRTLNVVDEGTRECLAIEVDTSLPAGRVV 162 Query: 189 QQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLE 248 + L + GLP ++ MDNG T T W I + + +P PQ G +E Sbjct: 163 RVLEQLKTERGLPKQLRMDNGPELISATLT-----DWCQNHNIELLYIQPGKPQQNGFVE 217 Query: 249 RFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSR 298 RF+ S + E L F + G+++ WR YN ER HE+L P + Sbjct: 218 RFNGSFRREFLDAYLFENIGQVREMSWFWRLDYNEERTHESLGNLPPAAY 267 >UniRef50_Q2YZQ9 Transposase n=3 Tax=Bacteria RepID=Q2YZQ9_9DELT Length = 282 Score = 217 bits (552), Expect = 7e-55, Method: Composition-based stats. Identities = 47/301 (15%), Positives = 94/301 (31%), Gaps = 39/301 (12%) Query: 21 LFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITAL 80 + + ++ +C+ G+S + Y W G +R + + + Sbjct: 1 MENHRSEFAVKKMCQVLGVSRSGYYLW----------GKHNRSARQKQNER----LMVHI 46 Query: 81 LRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPATGRFE--- 137 +G+ +I L+D G + + LM +G+ AT RF+ Sbjct: 47 REAYARGRGVYGSPRITAELKDNGIP-CGKNRIARLMKSNGIKAKTKRRFKATKRFKHDF 105 Query: 138 ------------HDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRE 185 D N++W D G + +LD +R + + E Sbjct: 106 LVADNLLNQRFSADVANQIWVSDIT-FIWTREGWLYLAAILDIFNRKIVGWSMDNKLSHE 164 Query: 186 TVQQQLVSVFERYGLPDRM--TMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQT 243 + L + + D G+ + A + + G S + Sbjct: 165 VIADALHKAIRQRRPKPGVLFHSDRGTQYTSY-----AFRDLMEQYGFVQSMSSSGNCYD 219 Query: 244 QGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRT-VYNLERPHEALDMAVPGSRYQPS 302 +E F +LK E++ + + E + + YN R H AL+ P + + Sbjct: 220 NAVMESFFHTLKTELVYFEKYRTRQEARGGIFEYIEVFYNCVRRHSALNYCSPAEFERRA 279 Query: 303 A 303 Sbjct: 280 C 280 >UniRef50_A5CA05 Putative uncharacterized protein n=2 Tax=Vitis vinifera RepID=A5CA05_VITVI Length = 1066 Score = 217 bits (552), Expect = 7e-55, Method: Composition-based stats. Identities = 47/280 (16%), Positives = 99/280 (35%), Gaps = 48/280 (17%) Query: 120 HGLLPGASPGIPATGRFEHD-------APNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRF 172 + T R +W ++F G FP G + L +D S++ Sbjct: 738 CKSCDRSQRLGKLTCRKMMPLNPILIVDLFYVWGINFMGPFPMPFGYSYILVGVDYVSKW 797 Query: 173 SLCLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIR 232 + ++ R ++ ++F R+G+P + D G+ + + E L + G++ Sbjct: 798 VEAIPCKHNDHRVVLKFLKENIFSRFGVPKAIISDGGTHFCN-----KPFETLLAKYGVK 852 Query: 233 VGHSRPYHPQTQGKLERFHRSLKAEVLQG----KWFADSGELQRAFDHWRTVYNLERPHE 288 + PYH QT G++E +R +K +++ + + L + +RT Y Sbjct: 853 HKVATPYHHQTSGQVELANREIKNILMKVVNTNRKYWSIK-LLDSLWAYRTTYKTI---- 907 Query: 289 ALDMAVPGSRYQPSARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERV 348 L M+ Y+ + + EY +++++ + KA + Sbjct: 908 -LGMSP----YRLVYGKACHLSVELEYKAWWAIKQLN-----------MDLSKAGLKRFL 951 Query: 349 GLKEMQE-----------DGSYEVWWYSTKVGVIDLKKKS 377 L EM+E W+ + + +K Sbjct: 952 DLNEMEELRNDAYINSKIAKKKLKRWHDQLISCKEFRKGQ 991 >UniRef50_P25438 Insertion element IS476 uncharacterized 39.2 kDa protein n=21 Tax=Proteobacteria RepID=YI61_XANEU Length = 346 Score = 216 bits (551), Expect = 9e-55, Method: Composition-based stats. Identities = 61/311 (19%), Positives = 92/311 (29%), Gaps = 32/311 (10%) Query: 9 ARDTMSLRTEFVLFASQDG-ANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPH 67 + E + + + R CR G+S Sbjct: 56 KTLAPQRKREAIRRMLEHTPLSERRACRLAGLSRDAF--------------------RHA 95 Query: 68 HSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGL----- 122 P ++ ++A L H R+G R++ L + ++ L L Sbjct: 96 PVPTPATQALSARLVELAQTHRRFGYRRLHDLLRPE-FPSVNHKKIYRLYEEAELKVRKR 154 Query: 123 LPGASPGIPATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDE 182 P PN W MDF R LT++DD +R S+ +A Sbjct: 155 RKAKRPVGERQKLLASSMPNDTWSMDFVFDALANARRIKCLTVVDDFTRESVDIAVDHGI 214 Query: 183 RRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQ 242 V + L G P + DNG + A W + GI P P Sbjct: 215 SGAYVVRLLDQAACFRGYPRAVRTDNGPEFTSR-----AFIAWTQQHGIEHILIEPGAPT 269 Query: 243 TQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPS 302 +E F+ + E L WF + + WR YN RPH + P Sbjct: 270 QNAYIESFNGKFRDECLNEHWFTSLAQARDVIADWRRHYNQIRPHSSCGRIPPAQFAANY 329 Query: 303 ARQYSGNTTPP 313 Q + N P Sbjct: 330 RTQQANNAVPF 340 >UniRef50_A7B7Y8 Putative uncharacterized protein n=4 Tax=Clostridiales RepID=A7B7Y8_RUMGN Length = 417 Score = 216 bits (551), Expect = 1e-54, Method: Composition-based stats. Identities = 72/357 (20%), Positives = 134/357 (37%), Gaps = 22/357 (6%) Query: 35 RRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMAHDRHERWGAR 94 + +PAT KW + G GL + R + +++ +R + R A Sbjct: 52 KLHHYAPATIEKWYLDYQNHGFEGLVPKGRSDAGMSRKLDEELQERIRYFKTNYPRMSAA 111 Query: 95 KIKRWLEDQGHTM---PAFSTVHNLMARHGLLPGASPGIPATGRFEHDAPNRLWQMD-FK 150 I R L+ G + + STV + R +P R+E N +W D Sbjct: 112 AIYRQLKSDGSVINGQVSESTVSRFVKRLQSELRQTPN-KDMRRYERPHINEVWCGDSSV 170 Query: 151 GHFPFG----GGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMTM 206 G R + + L+DD SRF + ++ + + S +YG P Sbjct: 171 GPRLTDSDGKKHRIYIIALIDDASRFITGIDVFYNDNFINLMSVMRSAIAKYGRPKVFNF 230 Query: 207 DNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVL---QGKW 263 DNG + + +EL R+G + + +PY P + K+ER+ R++K + + + Sbjct: 231 DNGKSYKN-----KQMELLAARIGTTLSYCQPYTPTGKAKIERWFRTMKDQWMAALDMRD 285 Query: 264 FADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGNTTPPEYDEGVMV-- 321 F EL+ + + YN PH +L P R+ Q + + + ++ Sbjct: 286 FHSLEELRGSLHAFVQRYNQS-PHSSLHGLSPQDRFFSEPEQIR-RLSEEDITQNFLLEI 343 Query: 322 -RKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQEDGSYEVWWYSTKVGVIDLKKKS 377 R+V + + + F +R+ L+ + + + I L K+ Sbjct: 344 ERRVSADSVIVIDQIEYEVDYRFARQRIRLRYSPDMKEIFIVESDGTLTPIRLLNKT 400 >UniRef50_A5AHS1 Putative uncharacterized protein n=2 Tax=Vitis vinifera RepID=A5AHS1_VITVI Length = 1410 Score = 215 bits (549), Expect = 2e-54, Method: Composition-based stats. Identities = 46/281 (16%), Positives = 100/281 (35%), Gaps = 43/281 (15%) Query: 73 SSDDITALLRMAHDRH--ERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGI 130 ++ +L H+ + ++K + G P+ + M + Sbjct: 734 LEEEQHGILNHCHENACGGHFASQKTAMRVLQLGFCWPSLFKDAHTM----------DMM 783 Query: 131 PATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQ 190 P D +W +DF G FP G + L +D S++ + + ++ Sbjct: 784 PLNPILIVD-LFYVWGIDFMGPFPMSLGYSYILVGVDYVSKWVKAVPCKHNNHIVVLKFL 842 Query: 191 LVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERF 250 ++F R+G+P + D G+ + + E L + G++ + PYHPQT G+++ Sbjct: 843 KQNIFSRFGVPKTIISDGGTHFCN-----KPFETLLAKYGVKHKVATPYHPQTSGQVKLA 897 Query: 251 HRSLKAEVLQGKWFADSGE----LQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQY 306 +R +K +++ + + L + + T Y L M+ Y+ + Sbjct: 898 NREIKNILMKMVN-TNREDWSVKLLDSLWAYITAYKTI-----LGMSP----YRIVYGKA 947 Query: 307 SGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGER 347 EY ++K++ + +A Sbjct: 948 CLLPVEVEYKAWWTIKKLN-----------MDLSRAGSKRP 977 >UniRef50_A5C995 Putative uncharacterized protein n=1 Tax=Vitis vinifera RepID=A5C995_VITVI Length = 1549 Score = 215 bits (548), Expect = 2e-54, Method: Composition-based stats. Identities = 47/276 (17%), Positives = 97/276 (35%), Gaps = 45/276 (16%) Query: 90 RWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPATGRFEHDA-------PN 142 + ++K + G P+ ++M + T R + Sbjct: 1259 HFASQKTAMKVLQSGFCWPSLFKDAHIMCK--SCDRCQRLGKLTKRNQMPMNPILIVDLF 1316 Query: 143 RLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPD 202 +W +DF FP G + L +D S++ + + ++ R + ++F R+G+P Sbjct: 1317 DVWGIDFMRPFPMSFGNSYILVEVDYVSKWVEAIPYKHNDHRVVFKFLKENIFSRFGVPK 1376 Query: 203 RMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGK 262 + D G+ + + + PYHPQT G++E ++ +K +++ Sbjct: 1377 AIINDGGAHFCNRLFE-------------THKVATPYHPQTFGQVELANKEIKNILMKVV 1423 Query: 263 WFADSG---ELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGNTTPPEYDEGV 319 + +L + +RT Y L M+ Y+ + EY Sbjct: 1424 ITSRRDWSIKLHDSLWAYRTAYKTI-----LSMSP----YRLVYGKACHLLVEVEYKAWW 1474 Query: 320 MVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 ++K+ + KA + L EM+E Sbjct: 1475 AIKKLS-----------MDLIKAGATRCLDLNEMEE 1499 >UniRef50_A3J543 Helix-turn-helix, Fis-type protein n=6 Tax=Bacteria RepID=A3J543_9FLAO Length = 336 Score = 215 bits (548), Expect = 2e-54, Method: Composition-based stats. Identities = 72/298 (24%), Positives = 137/298 (45%), Gaps = 14/298 (4%) Query: 10 RDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHS 69 R T+S + E + ++ + R GI+ + Y W +++ G GL R + Sbjct: 2 RLTVSEKQEIIHMVTRSEIGVNRTLREIGINKSMFYNWYHAYSENGVEGLLPTKRASNRQ 61 Query: 70 PNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPG 129 N + L+ + +R++ + D+ + S+V+ ++ GL+ + Sbjct: 62 WNSIPQEQKNLVVKLALEYPDLSSRELAYKVTDEQQIFLSESSVYRILKSRGLITAPAHI 121 Query: 130 IPATGRFEHDA---PNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRET 186 + G D +++WQ DF G G + T+LDD+SR+ + C++ + + Sbjct: 122 FLSAGNEFTDKTSFVHQMWQTDFTYFKILGWGWYYLSTVLDDYSRYIVHWELCSNMKADD 181 Query: 187 VQQQLVSVFERYGL----PDRMTMDNGSPWGDTTGTWTALELWLMR-LGIRVGHSRPYHP 241 V++ + S ++ L ++ D GS + + L+ +L ++ H RP HP Sbjct: 182 VKRTVDSAIKKAKLVTKQKPKLLSDKGSCYI-----ASELKTYLKDNYQMQQVHGRPNHP 236 Query: 242 QTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRY 299 QTQGK+ER+HR++K V++ + EL+ A + + YN ER HE+L+ P Y Sbjct: 237 QTQGKIERYHRTIKN-VVKLDNYFAPEELEAALEKFVYRYNNERYHESLNNLTPADVY 293 >UniRef50_D1YV08 Putative transposase orfB for insertion sequence element n=2 Tax=Methanocella paludicola SANAE RepID=D1YV08_METPS Length = 287 Score = 215 bits (548), Expect = 2e-54, Method: Composition-based stats. Identities = 52/293 (17%), Positives = 89/293 (30%), Gaps = 39/293 (13%) Query: 24 SQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRM 83 + + G+S + Y+WL P S IT ++ Sbjct: 6 CRGSLPVSRAVELMGVSRSGYYRWLH--------------TRNMIQPTESDLLITEEIQR 51 Query: 84 AHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPATGRFEHDAP-- 141 + R+G R+ L ++G + + V+ LM ++ LL T HD P Sbjct: 52 IALDYPRYGYRRAWVELGNRGFIV-SRKKVYMLMRQYNLLCVRRRYRVCTTDSNHDKPVY 110 Query: 142 ------------NRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQ 189 N+LW D + ++D +SR + Sbjct: 111 ENLAGGMRVSGINQLWVADITYIQLSRE-FVYLAVVIDVYSRRCVGWQLSHSIDTRLTLG 169 Query: 190 QLVSVFERYGL---PDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGK 246 L + E D G + L+ GIRV SR +P Sbjct: 170 ALHNALETRRTELGGLVHHSDQGVQYAS-----KEYVECLLEHGIRVSMSRRGNPYDNAF 224 Query: 247 LERFHRSLKAEVLQGKWFADSGELQRAFDHWRTV-YNLERPHEALDMAVPGSR 298 E F ++LK E + + + + + YN +R H ++ P Sbjct: 225 AESFMKTLKYEEVYLNEYETFKDAMENIERFIDEVYNQKRLHSSIGYQSPIEY 277 >UniRef50_A5AWI1 Putative uncharacterized protein n=2 Tax=Vitis vinifera RepID=A5AWI1_VITVI Length = 905 Score = 214 bits (545), Expect = 5e-54, Method: Composition-based stats. Identities = 53/333 (15%), Positives = 117/333 (35%), Gaps = 60/333 (18%) Query: 73 SSDDITALLRMAHDR--HERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGI 130 + +L H + + ++K M R G L + + Sbjct: 556 LEQEQQEILSHCHKSACRDHFASQKTA-------------------MKRLGKLTHTNM-M 595 Query: 131 PATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQ 190 P D +W +DF FP G + L ++D S++ + ++ + ++ Sbjct: 596 PLNPVLIVD-LFYVWGIDFMRPFPMSFGYSYILVVVDYVSKWVEAIPCKRNDHKVVLKFL 654 Query: 191 LVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERF 250 ++F R+G+P + D G+ + + + E L + G++ + PYHPQT ++E Sbjct: 655 KENIFSRFGVPRAIISDGGTHFCN-----KSFETLLAKYGVKHKVATPYHPQTSRQVELA 709 Query: 251 HRSLKAEVLQGK---WFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYS 307 ++ +K +++ +L + ++T Y L M+ Y + Sbjct: 710 NQEIKNILIKVVNTSRRDWFVKLHDSLWAYKTAYKTI-----LGMSP----YCLVYGKAC 760 Query: 308 GNTTPPEYDEGVMV-------------RKVDISGKLSVKGVSL-------SAGKAFRGER 347 +Y + R +D++ ++ + K + + Sbjct: 761 HLPIEVQYKVWWAIKMLNMDLNRADMKRFLDLNEMEELRNDAYNNSNIAKQILKRWHDQL 820 Query: 348 VGLKEMQEDGSYEVWWYSTKVGVIDLKKKSITM 380 V LKE Q+ ++ + LK + I + Sbjct: 821 VSLKEFQKGQRVLLYDSKLHIFPRKLKSRWIGL 853 >UniRef50_P24577 Insertion element IS407 uncharacterized 31.7 kDa protein n=164 Tax=Proteobacteria RepID=YI71_BURM1 Length = 277 Score = 213 bits (544), Expect = 7e-54, Method: Composition-based stats. Identities = 57/289 (19%), Positives = 96/289 (33%), Gaps = 32/289 (11%) Query: 27 GANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMAHD 86 + R CR G+S + + P+ ++ + A L Sbjct: 9 NISERRACRLVGLSRSVLHY--------------------DAKPDHENEVLAARLVELAH 48 Query: 87 RHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIP-----ATGRFEHDAP 141 R+G R++ +E +G T ++ L GL AP Sbjct: 49 ERRRFGYRRLHALVEREG-THANHKRIYRLYREAGLAVRRRRKRQGVMIEREQLALPGAP 107 Query: 142 NRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLP 201 N +W +DF G R LT++DD ++ ++ + V + L G P Sbjct: 108 NEVWSIDFVMDALSNGRRVKCLTVVDDFTKEAVDIVVDHGISGLYVARALDRAARFRGYP 167 Query: 202 DRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQG 261 + D G + AL+ W G+ + + P +E F+ + E L Sbjct: 168 KAVRTDQGPEFTSR-----ALDQWAYANGVTLKLIQAGKPTQNAYIESFNGKFRDECLNE 222 Query: 262 KWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGNT 310 WF + WR YN +RPH AL+ P + R + Sbjct: 223 HWFTTLAHARAVIAAWRQDYNEQRPHSALNYLAPSE-FAAKHRATADAP 270 >UniRef50_A4G2L6 Transposase IS3 family, part 2 n=17 Tax=Bacteria RepID=A4G2L6_HERAR Length = 288 Score = 213 bits (543), Expect = 9e-54, Method: Composition-based stats. Identities = 62/299 (20%), Positives = 95/299 (31%), Gaps = 40/299 (13%) Query: 24 SQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRM 83 + + C G+S + Y WL R + + R HHS Sbjct: 9 HRGIWPVALTCDTLGVSRSGFYAWLTRTPCKRRTENEQLGRAVHHS-------------- 54 Query: 84 AHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPG-------------- 129 +GAR++ L G+ V LM L Sbjct: 55 FIQSDRTYGARRVWHDLLASGYR-CGLHRVERLMQAQALRARPRRRSLPIDRGERPVIGI 113 Query: 130 --IPATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETV 187 +F+ APNR W DF + G + +LD +SR + + + + V Sbjct: 114 AANVLDRQFDASAPNRKWVADFT-YIWSAEGWLYLAVVLDLYSRRVIGWSMKPEMNAQLV 172 Query: 188 QQQLVSVFERYGLPD--RMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQG 245 L+ R G P+ D GS + + L+ LG+ SR + Sbjct: 173 ADALMMAVWRRGKPESVMHHSDRGSQYTSE-----QFQRLLLELGVTCSMSRAGNVWDNS 227 Query: 246 KLERFHRSLKAEVLQGKWFADSGELQRAFDHWRT-VYNLERPHEALDMAVPGSRYQPSA 303 +E F SLK E L K F +++ + YN R H L P Q + Sbjct: 228 AMESFFSSLKTERLSRKMFRTRDDIRAEVFDYIERFYNPVRRHSTLGYISPIDFEQQAQ 286 >UniRef50_C8XFJ9 Integrase catalytic region n=1 Tax=Nakamurella multipartita DSM 44233 RepID=C8XFJ9_NAKMY Length = 294 Score = 213 bits (542), Expect = 1e-53, Method: Composition-based stats. Identities = 57/300 (19%), Positives = 94/300 (31%), Gaps = 41/300 (13%) Query: 18 EFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDI 77 +A + +C++ G+S + Y W R P + D+ Sbjct: 5 AIADWADAGEFPVEFMCQQLGVSRSGYYAWRTR---------------PVSHRKLTDIDL 49 Query: 78 TALLRMAHDR-HERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPAT--- 133 T+ +R H + G R+++ L +G + +H LM GL T Sbjct: 50 TSRIRRIHQQGRGNPGVRRVRAGLAAEGIR-CGLARIHRLMQAAGLQGRHPKAWRRTTIA 108 Query: 134 ------------GRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTD 181 F AP++ W D + G + T++D HSR + A Sbjct: 109 GAKPVSAPDLIGRNFTAPAPDKAWCGDIT-YVKTWTGWAYVATVIDLHSRMVVGWAVADH 167 Query: 182 ERRETVQQQLVSVFERYGLPD--RMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPY 239 R V L +R P D G+ + + + IR R Sbjct: 168 MRTSLVLDALQMALDRRRPPAGVIFHSDRGTQYTSQ-----EFADFCRKNDIRRSLGRTG 222 Query: 240 HPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVY-NLERPHEALDMAVPGSR 298 E F + K E++ + + +L+ W Y N R H LD P Sbjct: 223 VCWDNAVAESFFATYKKELIHNRPWPTINQLKTETFSWIEAYHNRTRRHSTLDYLTPSEY 282 >UniRef50_C7MHU2 Integrase family protein n=3 Tax=Actinomycetales RepID=C7MHU2_BRAFD Length = 434 Score = 213 bits (542), Expect = 1e-53, Method: Composition-based stats. Identities = 95/418 (22%), Positives = 153/418 (36%), Gaps = 53/418 (12%) Query: 5 MPWDARDTMSLRTEFVLFASQD-GANIRSLCRRFGISPATGYKWLQRWAQEGAAG-LQDR 62 M + +R + + S C GIS T Y R +EG A L+ + Sbjct: 1 MSKNQPVDPRVRLAISRWPEDAPRGTVTSFCVEHGISRKTFYVLRARLREEGPAAVLEPK 60 Query: 63 PRIPHHSPNRSSDDITALLRMAHDRHER----WGARKIKRWLEDQGHTMPAFSTVHNLMA 118 R P SP R +D+ E G + + G P+ + + + Sbjct: 61 SRRPSSSPTRIGEDVKDQAVAVRAALEASGLDHGPISVFDRMGAMGLESPSVAALARIFR 120 Query: 119 RHGLLPGASPGIPAT--GRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCL 176 G+ P + RF + APN WQ+D G+ G C L DDHSR ++ Sbjct: 121 ERGVARADPKKKPRSAYRRFVYPAPNACWQLDATGYVLIDGRSCTIFQLQDDHSRLAVAS 180 Query: 177 AHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTW-TALELWLMRLGIRVGH 235 E + + RYG+P R+ DNG+ T W + L LG++ Sbjct: 181 LVAPAETTQAALDVFLKGVARYGVPQRLLTDNGAAMNPTRRGWPSPLVTHATGLGVQAIT 240 Query: 236 SRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEAL-DMAV 294 +P+ P TQGK ERFH++L + Q +LQ D++ +YN +R H+ L Sbjct: 241 GKPFKPTTQGKNERFHQTLFRWLDQQPLAETISQLQAMVDNFDIIYNQQRRHQGLPGRIT 300 Query: 295 PGSRY--------------------------------QPSARQYSGNTTPPEYDEGVMVR 322 P + +P A Y+ P + + G V Sbjct: 301 PQQAWDATPVAEAPKPPAAPIDVLLPAPLDEVLHPGEEPQALDYTAWGDPFDKEAGQRVL 360 Query: 323 KVDISGKLSVKGVSLSAGKAFRGERVGLKEMQEDGSYEVWWYSTKVGVIDLKKKSITM 380 + +G + ++ ++ K GE V V W +T + VID+ + + Sbjct: 361 RTGSNGSIVLRRITFYLSKRRAGEHV-----------RVIWDATGLVVIDVHGEVLIK 407 >UniRef50_A5AYT6 Putative uncharacterized protein n=3 Tax=Vitis vinifera RepID=A5AYT6_VITVI Length = 1897 Score = 212 bits (541), Expect = 1e-53, Method: Composition-based stats. Identities = 49/287 (17%), Positives = 103/287 (35%), Gaps = 59/287 (20%) Query: 74 SDDITALLRMAHDRH--ERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIP 131 + +L H+ + ++K + G P+ + Sbjct: 1399 EQEKHGILSHCHENACGGHFASQKTAMRVLQSGFWWPSLFKDAHE--------------- 1443 Query: 132 ATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQL 191 +DF G FP G + L +D S++ + T++ + ++ Sbjct: 1444 --------------GIDFMGPFPMSFGHSYILVGVDYVSKWVEVIPCXTNDHKVVLKFLR 1489 Query: 192 VSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFH 251 ++F R+G+P + D G+ + + E L + G++ + PYHPQT G++E + Sbjct: 1490 ENIFSRFGVPKAIISDGGTHFCN-----KPFEALLAKYGVKHKVATPYHPQTSGQVELSN 1544 Query: 252 RSLKAEVLQG---KWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSG 308 R +K +++ S +L + +RT Y L M+ Y+ + Sbjct: 1545 REIKNILMKVVNTNRKDWSVKLLDSLWAYRTAYKTI-----LGMSP----YRLVYGKACH 1595 Query: 309 NTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 E+ ++K++ + KA + L E++E Sbjct: 1596 LPIEIEFKAWWAIKKLN-----------MDLTKAGLKRSLDLNELEE 1631 >UniRef50_A5B5S6 Putative uncharacterized protein n=2 Tax=Vitis vinifera RepID=A5B5S6_VITVI Length = 1310 Score = 212 bits (540), Expect = 2e-53, Method: Composition-based stats. Identities = 43/217 (19%), Positives = 87/217 (40%), Gaps = 28/217 (12%) Query: 142 NRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLP 201 +W +DF G FP G + L +D S++ + ++ R ++ ++F R+G+ Sbjct: 1014 FDVWGIDFMGPFPMSFGNSYILVGVDYVSKWFEAIPCKHNDHRVVLKFLKENIFSRFGVS 1073 Query: 202 DRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQG 261 + D G+ + + E L + G++ + PYHPQ G++E +R +K +++ Sbjct: 1074 KAIISDGGTHFYN-----KPFETLLAKYGVKHKVATPYHPQIFGQVELANREIKNILIKV 1128 Query: 262 KWFADSG---ELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGNTTPPEYDEG 318 + +L + +RT Y L M+ Y+ + EY Sbjct: 1129 VNTSRRDWSVKLHDSLWAYRTAYKTI-----LGMSP----YRLVXGKACHLPVEVEYKXW 1179 Query: 319 VMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 ++KV+ + + + L EM+E Sbjct: 1180 WAIKKVN-----------MDLTRXXIKRCLDLNEMEE 1205 >UniRef50_B8FYC8 Transposase IS3/IS911 family protein n=6 Tax=Clostridiales RepID=B8FYC8_DESHD Length = 393 Score = 212 bits (539), Expect = 2e-53, Method: Composition-based stats. Identities = 53/311 (17%), Positives = 104/311 (33%), Gaps = 41/311 (13%) Query: 13 MSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNR 72 + + + AS + +C +S + Y W +R + Sbjct: 98 PEIIYQMIKKASSSSCPVEKMCETLEVSRSGYYDWDRREPS---------------KRQK 142 Query: 73 SSDDITALLRMAHDRHERW-GARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIP 131 ++ I +++ +H + + G K+ +++ G + V+ L +HGL Sbjct: 143 ENETILKVMKESHTKAQAMIGLDKLWSDVKEAGFQ-CGRNRVYRLQKQHGLYSVRKKPYR 201 Query: 132 ----------------ATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLC 175 +F+ + PN++W D G + + + D + + Sbjct: 202 VCLTDSNHDLPKAPNLLNQKFKVEHPNKVWVTDITEFKTAKGSKLYLAAIKDLFHKEIVG 261 Query: 176 LAHCTDERRETVQQQLVSVFERYGLPD--RMTMDNGSPWGDTTGTWTALELWLMRLGIRV 233 A R E + L + +R+ D G + T L G+ Sbjct: 262 WALAEHMRTELCLEALRNAVKRHRPLKGLIHHSDQGRQYCS-----TVYVEELKHWGMIR 316 Query: 234 GHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRT-VYNLERPHEALDM 292 SR +P E F ++K+E L K + D E +R + YN +R H+AL Sbjct: 317 SMSRKGNPFDNACAESFFSTIKSERLHHKTYKDIEEARRDIFWYIECFYNRQRRHQALGN 376 Query: 293 AVPGSRYQPSA 303 P + + Sbjct: 377 LTPAAFLKKHC 387 >UniRef50_B3PDG9 IS3 family transposase, orfB n=3 Tax=Gammaproteobacteria RepID=B3PDG9_CELJU Length = 284 Score = 212 bits (539), Expect = 2e-53, Method: Composition-based stats. Identities = 60/302 (19%), Positives = 101/302 (33%), Gaps = 40/302 (13%) Query: 15 LRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSS 74 +R F+ ++ LCR +S + Y+WL + + P R Sbjct: 1 MRFAFIREH-ASRCRVKHLCRMLSVSRSRYYEWLGQQQDK-----------PDPEQQRLE 48 Query: 75 DDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPAT- 133 + AL + + G+R++ R L+ QG + V LM + GL+ T Sbjct: 49 TCMRAL---FVESNSSMGSRRMARRLQAQGFAAGRY-RVRRLMKKRGLVVKQKRKFRITT 104 Query: 134 --------------GRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHC 179 +F PN+ W D + G + ++D +SR + Sbjct: 105 NSNHKLPVAENILDRQFNPVTPNQAWAADIT-YIWTVEGWLYLAVVIDLYSRRVVGWCMD 163 Query: 180 TDERRETVQQQLVSVFERYGLPD--RMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSR 237 + + V + L+ D GS + + L + GI SR Sbjct: 164 KRQTKSLVIRALMMAVNMRKPSAGLIHHSDRGSQYASLK-----YQASLKQHGIVCSMSR 218 Query: 238 PYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWR-TVYNLERPHEALDMAVPG 296 + +ERF SLK E ++ + + R + T YN RPH L P Sbjct: 219 KGNCWDNAVVERFFSSLKREWIRDNLYRYREDAIRDVRAYIVTWYNSRRPHSTLGYKSPI 278 Query: 297 SR 298 Sbjct: 279 EF 280 >UniRef50_C1F0V3 IS3 family transposase orfB n=1 Tax=Acidobacterium capsulatum ATCC 51196 RepID=C1F0V3_ACIC5 Length = 349 Score = 212 bits (539), Expect = 3e-53, Method: Composition-based stats. Identities = 54/311 (17%), Positives = 99/311 (31%), Gaps = 42/311 (13%) Query: 24 SQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRM 83 Q +I +C G+S A Y++L+ P+ ++ + ++ Sbjct: 3 LQGSLSIERMCLLAGVSRAGFYRFLK-----------------AQVPSEEETEVRSAIQQ 45 Query: 84 AHDRHER-WGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPAT--------- 133 +H R +G R++ L+ +G + V +M LL T Sbjct: 46 VALQHRRRYGYRRVTAELKRRGMKV-NHKRVARIMREDNLLALQPKEFATTTDSNEPLEV 104 Query: 134 -----GRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQ 188 R + + ++LW D + + +LD +SR + Sbjct: 105 YLNLSRRMQLNWVDQLWVADIT-YIRVQTEFVYLAVILDGYSRKVVGWKLDRSLTSRLAV 163 Query: 189 QQLVSVFE-RYGLPDRMT-MDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGK 246 L + R P + D G + L G+ SRP +P Sbjct: 164 NALDGAIKLRRPRPGVVHHSDRGVQYTSP-----EYVAILKLHGMVQSMSRPANPYDNAS 218 Query: 247 LERFHRSLKAEVLQGKWFADSGELQRAFDHWRT-VYNLERPHEALDMAVPGSRYQPSARQ 305 E F ++LK E + + D +L+ + + YN +R H AL P + + Sbjct: 219 CESFIKTLKREEIYANKYRDLQDLRSHIEEFIDGYYNQKRLHSALGYRTPEEFEAQTHGK 278 Query: 306 YSGNTTPPEYD 316 P Sbjct: 279 TQAELYAPTLR 289 >UniRef50_A5BN44 Putative uncharacterized protein n=2 Tax=Vitis vinifera RepID=A5BN44_VITVI Length = 972 Score = 211 bits (538), Expect = 3e-53, Method: Composition-based stats. Identities = 43/289 (14%), Positives = 95/289 (32%), Gaps = 53/289 (18%) Query: 74 SDDITALLRMAHDRH--ERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIP 131 + +L H+ + ++K + G P+ + M + + Sbjct: 424 EKEQQRILSHCHENAYGGHFASQKTTMRVLQSGFCWPSLFKYAHTMCKSCDRCQRLGKLI 483 Query: 132 ATGRFEHDAP-----NRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRET 186 + +W +DF G FP + L +D S++ + ++ R Sbjct: 484 RRNMMPLNPILIVDLFDVWGIDFMGPFPMSFDYSYILVGVDYVSKWVEAIPCKHNDHRMV 543 Query: 187 VQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGK 246 ++ ++F R+G+P + D G+ + + E L + ++ + PYHPQT G+ Sbjct: 544 LKFLKENIFSRFGVPKAIISDGGTHFCN-----KPFETLLAKYRVKHKVATPYHPQTSGQ 598 Query: 247 LERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQY 306 +E ++ +K ++ L M+ Y + Sbjct: 599 VELANKEIKNILMNVTI--------------------------LGMSP----YHLVYGKT 628 Query: 307 SGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 EY ++K++ + + + L EM+E Sbjct: 629 CHLPMELEYKAWWAIKKLN-----------MDLSRDGLKRFLDLNEMEE 666 >UniRef50_Q7M7E8 Transposase and inactivated derivative n=14 Tax=Proteobacteria RepID=Q7M7E8_VIBVY Length = 283 Score = 211 bits (538), Expect = 4e-53, Method: Composition-based stats. Identities = 56/302 (18%), Positives = 97/302 (32%), Gaps = 40/302 (13%) Query: 15 LRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSS 74 ++ EF+ + +I +CR +S YKW E R + Sbjct: 1 MKYEFIESYT-GEYSISLMCRTLEVSRGGYYKWCHHTQSE-------RSKRRERFEQLV- 51 Query: 75 DDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIP--- 131 + R+G+ +I L + GH + V ++M G+ G Sbjct: 52 ------MCTFAQYRARYGSVRIAEELNEAGHA-CCVNYVADIMKEKGIRARNGKGFKYSK 104 Query: 132 ------------ATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHC 179 FE + PN+ W D + T++D HSR + + Sbjct: 105 DVAAMTNVADNLLRRDFESEMPNQKWVTDIT-YIWVKSRWLFLATVMDLHSRRIVGWSLG 163 Query: 180 TDERRETVQQQLVSVFERYGLPD--RMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSR 237 T E + + L FE P + D G + + ++ + G SR Sbjct: 164 TTMTVELITKALKMAFESRKPPKGVIIHSDRGVQYRAYK-----YQDFMRKHGGVPSMSR 218 Query: 238 PYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRT-VYNLERPHEALDMAVPG 296 + +E F+ LK E++ + + E + + YN R H AL P Sbjct: 219 QGNCWDNAVMESFYSRLKVELIYAEDYQTVEEARMGIFEYIEVFYNRRRRHSALGHVSPV 278 Query: 297 SR 298 Sbjct: 279 EY 280 >UniRef50_A3XG72 Transposase-like n=3 Tax=Leeuwenhoekiella blandensis MED217 RepID=A3XG72_9FLAO Length = 284 Score = 211 bits (537), Expect = 4e-53, Method: Composition-based stats. Identities = 49/290 (16%), Positives = 90/290 (31%), Gaps = 44/290 (15%) Query: 27 GANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMAHD 86 + +C +S + Y WL+ P + + +++L++ + Sbjct: 2 KFPVEKMCTMLAVSKSGYYHWLK--------------SGPSTLW-KENQKLSSLIKDIFE 46 Query: 87 -RHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPATGR---------- 135 H+ +GA +IK L+ G + + V +M + L AT Sbjct: 47 DSHQSYGAPRIKAELKALGFKV-SKPRVARIMKANYLYAKRKRKFKATTDSNHKYPIAPN 105 Query: 136 -----FEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRE-TVQQ 189 F N++W D + G + ++D +R + A E TV + Sbjct: 106 LLNQCFNVARANQVWVSDIT-YVQTNQGWSYLTVIIDLFNRKVIGWALSDTLNTEDTVIK 164 Query: 190 QLVSVFERYGL--PDRMTMDNGSPWGDTTGTWTALELWLMRLG--IRVGHSRPYHPQTQG 245 + L P D G + L ++ SR + Sbjct: 165 AWQMAIKNTTLTQPLIFHSDQGIQYASQR-----FTNLLKSYNDLVKQSMSRKGNCWDNA 219 Query: 246 KLERFHRSLKAEVLQGKWFADSGELQRAFDHWRT-VYNLERPHEALDMAV 294 E F +SLK E + + + + + W YN R H L Sbjct: 220 VAESFFKSLKVEWVYWHKYKLKSQAELSIFQWIETWYNTRRRHSYLGNRT 269 >UniRef50_A5B346 Putative uncharacterized protein n=2 Tax=Vitis vinifera RepID=A5B346_VITVI Length = 916 Score = 211 bits (537), Expect = 4e-53, Method: Composition-based stats. Identities = 44/217 (20%), Positives = 87/217 (40%), Gaps = 28/217 (12%) Query: 142 NRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLP 201 +W +DF G FP G + L +D S++ + ++ R ++ ++F R+ +P Sbjct: 176 FNVWGIDFMGPFPMSFGNSYILVGVDYVSKWVEAIPCKQNDHRVVLKFLNENIFSRFRVP 235 Query: 202 DRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQG 261 + D G+ + + E L + G++ + PYHPQT G++E + +K +++ Sbjct: 236 KVIISDRGTHFCN-----KPFETLLAKYGVKHKVATPYHPQTSGQVELANTEIKNILMKV 290 Query: 262 KWFADSG---ELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGNTTPPEYDEG 318 + +L + +RT Y L M+ Y+ + EY Sbjct: 291 VNTSRRDWYVKLHDSLWAYRTTYKTI-----LGMSP----YRLVYGKTCHLPVEVEYKAW 341 Query: 319 VMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 ++KV+ + +A + L EM E Sbjct: 342 WAIKKVN-----------MDLNRAGMKRCLDLNEMDE 367 >UniRef50_A5CBG2 Putative uncharacterized protein n=1 Tax=Vitis vinifera RepID=A5CBG2_VITVI Length = 1297 Score = 211 bits (537), Expect = 4e-53, Method: Composition-based stats. Identities = 47/324 (14%), Positives = 98/324 (30%), Gaps = 62/324 (19%) Query: 44 GYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMAHDRH-----ERWGARKIKR 98 Y + + G + + + H + L + + H + ++K Sbjct: 1003 WYAHIANYLVIGEVPREWKAQDRKHFFAKIHAYYWEXLFLCNHCHENTCGGHFTSQKTTM 1062 Query: 99 WLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPATGRFEHDA-------PNRLWQMDFKG 151 + G P+ + M R T R + +W +D G Sbjct: 1063 KVLQSGFNWPSLFKDAHTMCR--SCDRCQRLGKLTRRNQMPMNPILIVDLFNVWGIDIMG 1120 Query: 152 HFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSP 211 FP G + L +D S++ + ++ R ++ ++F R+G+P + D G+ Sbjct: 1121 PFPMSFGNSYILVGVDYVSKWVEAIPCKHNDHRVVLKFLKENIFSRFGVPKAIISDGGTH 1180 Query: 212 WGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQ 271 + E L + G++ + S +L Sbjct: 1181 FCS-----KPFETPLAKYGVKHKV-----------------------VNTSRRDWSVKLH 1212 Query: 272 RAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGNTTPPEYDEGVMVRKVDISGKLS 331 + +RT Y L M+ Y+ + EY +++V+ Sbjct: 1213 DSLWAYRTAYKTI-----LGMSP----YRLVYGKACHLPVEVEYKAWWAIKRVN------ 1257 Query: 332 VKGVSLSAGKAFRGERVGLKEMQE 355 + +A + L EM+E Sbjct: 1258 -----MDLNRARMKRCLDLNEMKE 1276 >UniRef50_A5C6P4 Putative uncharacterized protein n=7 Tax=Vitis vinifera RepID=A5C6P4_VITVI Length = 1866 Score = 211 bits (537), Expect = 5e-53, Method: Composition-based stats. Identities = 55/324 (16%), Positives = 108/324 (33%), Gaps = 57/324 (17%) Query: 42 ATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMAHDRH--ERWGARKIKRW 99 A+ Y + + G Q + + +L H+R R+ KI Sbjct: 1569 ASWYAHIANYLVTGEVRNQIIRKCVPKQ------EQQRILNHCHERACGGRFPYHKIVMK 1622 Query: 100 LEDQGHTMPAFSTVHNLMARHGLLPGASPGIPATGRFEHDAP-----NRLWQMDFKGHFP 154 + G + P+ + M R + + + +W +DF G FP Sbjct: 1623 VLQSGFSWPSLFKDGHTMCRSCDRCQRLGKLTRRNQIPMNPILIVDLFDVWVIDFIGPFP 1682 Query: 155 FGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGD 214 G + L S++ + ++ R ++ ++F R+G Sbjct: 1683 ISFGNSYILVGAGYVSKWVEAIPCKHNDHRVVLKFLKENIFSRFG--------------- 1727 Query: 215 TTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSG---ELQ 271 E L + G++ + PYHPQT ++E +R +K +++ + +L Sbjct: 1728 ------PFETLLAKYGVKHKVATPYHPQTSRQVELANREIKNILMKVVNTSRRDWSVKLH 1781 Query: 272 RAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGNTTPPEYDEGVMVRKVDISGKLS 331 + +RT Y LDM+ Y+ + EY ++KV+ Sbjct: 1782 DSLWAYRTAYKTI-----LDMSP----YRLVYGKACHLLVEVEYKAWWAIKKVN------ 1826 Query: 332 VKGVSLSAGKAFRGERVGLKEMQE 355 + + + L EM+E Sbjct: 1827 -----MDLNRVRMKRCLDLNEMEE 1845 >UniRef50_B3PKR5 Transposase n=7 Tax=Gammaproteobacteria RepID=B3PKR5_CELJU Length = 280 Score = 210 bits (536), Expect = 5e-53, Method: Composition-based stats. Identities = 63/290 (21%), Positives = 102/290 (35%), Gaps = 30/290 (10%) Query: 14 SLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRS 73 + + + G +I S C+ +S A+ Y+ P + Sbjct: 3 AEKKTCAQALVEHGIDIASACKLADLSRASYYR-------------------PERDWRKC 43 Query: 74 SDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPAT 133 + + R + G K + +G+ V+ + + GL Sbjct: 44 DAAVIDAINNELKRSPQAGFWKCYGRIRHKGYPF-NHKRVYRVYCQMGLNLKRRVKRVLP 102 Query: 134 GRFEHD-----APNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQ 188 R N W +DF + G R L +LD+ +R L + T E V Sbjct: 103 RRIVQPLAVVAQANHQWALDFMHDSLYCGKRFRTLNVLDEGTRECLAIEVDTSLPAERVV 162 Query: 189 QQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLE 248 + L + GLP ++ +DNG T W GIR+ + P PQ G +E Sbjct: 163 RALEQIKVERGLPTQLRVDNGPELISARLT-----DWCEENGIRLVYIEPGKPQQNGFVE 217 Query: 249 RFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSR 298 RF+ S + E L F +++ WR YN ER HE+L P + Sbjct: 218 RFNGSFRREFLNAYLFESLTQVREMAWFWRMDYNEERTHESLGHLPPAAY 267 >UniRef50_A5BPP5 Putative uncharacterized protein n=1 Tax=Vitis vinifera RepID=A5BPP5_VITVI Length = 1583 Score = 210 bits (535), Expect = 7e-53, Method: Composition-based stats. Identities = 62/312 (19%), Positives = 115/312 (36%), Gaps = 39/312 (12%) Query: 74 SDDITALLRMAHDRH--ERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIP 131 ++ +L H+ + ++K + G T P+ +++ R Sbjct: 1173 EEEQQGILNXCHENACGGHFASQKXAMKVXXSGFTWPSLFKDAHIICR--SCDRCQRLGK 1230 Query: 132 ATGRFEHDA-------PNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERR 184 T R + +W + F G FP G + L +D S++ + ++ R Sbjct: 1231 LTKRNQMPMNPILIVELFDVWGIXFMGPFPMSFGNSYILVGVDYVSKWVEAIPCKQNDHR 1290 Query: 185 ETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQ 244 + ++F R+G+P + D G+ + + E L + G++ + PYHPQT Sbjct: 1291 VVLXFLKENIFSRFGVPKAIISDGGTHFCN-----KPFETLLAKYGVKHKVATPYHPQTS 1345 Query: 245 GKLERFHRSLKA---EVLQGKWFADSGEL---QRAFDHWRTV------YNLERPHEALDM 292 G++E +R +K E K F WR + + H L+ Sbjct: 1346 GQVELANREIKNILMESGNLKCFNSPEPTLGKASDLRPWRFTSLSLASFWEVKDH--LEW 1403 Query: 293 AVPGSRYQPSARQYSGNTT---PPEYDEG------VMVRKVDISGKLSVKGVSLSAGKAF 343 V G RY+P P Y+E + ++ KL VK +++ +A Sbjct: 1404 QVLGERYEPLQGASEKKQVTGTPFLYEEYEPSDLKLQETFFFLNTKLGVKKLNMDLIRAG 1463 Query: 344 RGERVGLKEMQE 355 + L EM+E Sbjct: 1464 AKRCLDLNEMEE 1475 >UniRef50_Q1DAH7 Transposase orfB, IS3 family n=29 Tax=Proteobacteria RepID=Q1DAH7_MYXXD Length = 293 Score = 210 bits (535), Expect = 7e-53, Method: Composition-based stats. Identities = 64/299 (21%), Positives = 99/299 (33%), Gaps = 31/299 (10%) Query: 16 RTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSD 75 R V +A G + R C ++ ++ Sbjct: 7 RRRQVRYAMGKGVSQRRACALLQVAGSSL--------------------GYASRKEAKDA 46 Query: 76 DITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPA--- 132 + A LR R+G R+ L +G + VH L + GL Sbjct: 47 ALVAQLRDIARARPRFGYRRAWALLRREGPAV-NVKRVHRLWRKEGLALSRRRPRKRLRL 105 Query: 133 --TGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQ 190 + + + N +W DF G + LT++D+HSR L + V + Sbjct: 106 GQQRQPKPEGVNSVWAWDFVHDRCANGQKLKCLTVVDEHSRECLAIDVAGRISARRVIEV 165 Query: 191 LVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERF 250 L + +G P + DNG + AL WL GI+ + P P G E F Sbjct: 166 LSRLVAVHGPPKYLRSDNGPEFI-----AKALRRWLEANGIQTAYIAPGKPWQNGTNESF 220 Query: 251 HRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGN 309 + + E L +WF+ E + WR YN +RPH +L P A Sbjct: 221 NGRFRDECLSAEWFSTRREAVVLIEAWRRDYNEKRPHSSLGYKTPAEVGARRAHAGPVA 279 >UniRef50_B4RA95 Transposase, IS1477 n=35 Tax=Proteobacteria RepID=B4RA95_PHEZH Length = 361 Score = 210 bits (535), Expect = 7e-53, Method: Composition-based stats. Identities = 70/304 (23%), Positives = 110/304 (36%), Gaps = 34/304 (11%) Query: 12 TMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPN 71 + R + ++ G + R C + P T + Sbjct: 64 PAARREAVLRLMAERGFSQRRACGLVQVDPKTVRR----------------------VAQ 101 Query: 72 RSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGI- 130 ++ A LR R+G R++ LE +G +M + L GL G Sbjct: 102 PGDAEVRARLRGLAAERRRFGYRRLGILLEREGVSM-NKKKLFRLYREEGLAVRRRRGRK 160 Query: 131 ----PATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRET 186 D PN+ W +DF G R L ++DD +R +L L T Sbjct: 161 RATGTRAPMALPDGPNQRWSLDFVADTLSWGRRFRILCIVDDFTREALALVVDTSIGGHR 220 Query: 187 VQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGK 246 + ++L ++ R G P + DNG+ A+ W R G+ + P PQ G Sbjct: 221 MARELDALIARRGRPATIVSDNGTEMTSR-----AMLEWTNRTGVDWHYIAPGKPQQNGF 275 Query: 247 LERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVP-GSRYQPSARQ 305 +E F+ L+ E L + FA+ E + + WR YN RPH A P R P+A + Sbjct: 276 VESFNGKLRDECLNEEVFANLAEARAVIERWRLDYNHVRPHSAHGGLTPEAVRLNPAAGR 335 Query: 306 YSGN 309 Sbjct: 336 LRNL 339 >UniRef50_A9VK05 Integrase catalytic region n=17 Tax=Bacillaceae RepID=A9VK05_BACWK Length = 293 Score = 210 bits (535), Expect = 8e-53, Method: Composition-based stats. Identities = 52/305 (17%), Positives = 96/305 (31%), Gaps = 35/305 (11%) Query: 15 LRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSS 74 + E + ++ G + LC G++ + YKWL+R Sbjct: 8 KKFEVIHEMTKTGYTVTILCDIAGVTRSGYYKWLKRHTT-------------PSKKQSED 54 Query: 75 DDITALLRMAHDR-HERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIP-- 131 +I + H + +G R+I+ WL+ + + LM+ G+ P Sbjct: 55 IEIKKKILECHKKLRGIYGYRRIQVWLKATYNLHLNHKHIQRLMSELGIKAVIRKKRPYY 114 Query: 132 ------------ATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHC 179 F+ PN W D F G R + + D ++ + Sbjct: 115 GKKEAYVISENHLNREFQASKPNEKWVTDITYLI-FNGQRLYLSAIKDLYNNEIVAYETS 173 Query: 180 TDERRETVQQQLVSVFERYGLPDRM-TMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRP 238 + V L ++ + + D GS + L + ++ SR Sbjct: 174 RRNDLKLVLDTLKKAKKKRNVKGILLHSDQGSQYTSR-----QYNQLLKKYQMKASMSRR 228 Query: 239 YHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSR 298 + +E F KAE F + E++ A + YN +R + L+ P Sbjct: 229 GNCWDNACMENFFSHFKAECFHLYSFRKANEVKLAVRKYMHFYNHQRFQKKLNNLSPYKY 288 Query: 299 YQPSA 303 A Sbjct: 289 RTQVA 293 >UniRef50_B0TDR5 Transposase, putative n=5 Tax=Firmicutes RepID=B0TDR5_HELMI Length = 451 Score = 210 bits (534), Expect = 9e-53, Method: Composition-based stats. Identities = 83/375 (22%), Positives = 142/375 (37%), Gaps = 30/375 (8%) Query: 8 DARDTMSLRTEFVLFASQDGANIRSL-------CRRFGISPATGYKWLQRWAQEGAAGLQ 60 A D S R + + +G + C + GIS T ++L ++ ++G +GL+ Sbjct: 6 KAEDIASQRVQLLSPLLAEGLDAARARLMKQQICEQAGISERTLRRYLSQYREKGFSGLK 65 Query: 61 DRPRIPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPA---FSTVHNLM 117 + + S + + R +I + LE +G P ST+ + Sbjct: 66 PKGKGRSRSEEAIPHALLEEAILLRREVPRRSIAQIIQILEWEGKAEPGKLKRSTLQEKL 125 Query: 118 ARHGLLPGASPGIPAT----GRFEHDAPNRLWQMDFKG--HFPFG----GGRCHPLTLLD 167 A G T RF+ N+LW D K + P G + + +T D Sbjct: 126 AERGYSTRHMQMYANTGVAARRFQQKHRNQLWHSDIKYGPYLPIGPDGAKKQVYLVTFFD 185 Query: 168 DHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLM 227 D +RF L + V+ +YG P+ + DNG + + Sbjct: 186 DATRFVLHGQFYPTLDQVIVEDCFRQAILKYGAPEAVFFDNGKQYR-----TKWMHRACA 240 Query: 228 RLGIRVGHSRPYHPQTQGKLERFHRSLKA--EVLQGKWFADSGELQRAFDHWRTVYNLER 285 ++GIR+ ++PY P++ GK+ERF+R++ A + + L + F W + Sbjct: 241 KMGIRLLFAKPYSPESTGKVERFNRTVDAFLQEAALEKPHTLDRLNQLFWVWLDECYQNK 300 Query: 286 PHEAL-DMAVPGSRYQPSARQYSGNTTPPEYDE--GVMVRKVDISGKLSVKGVSLSAGKA 342 PH AL P + Y+ + + RKVD SG +S +G G + Sbjct: 301 PHSALAGNVSPDTAYRSDKKAVKFLDPDVVANAFLHCESRKVDKSGCISFEGRKYEVGLS 360 Query: 343 FRGERVGLKEMQEDG 357 F G V + D Sbjct: 361 FIGCTVDVIYDPADI 375 >UniRef50_B7VPV6 Transposase (OrfB) of insertion sequence ISVisp1 ; IS3 family subgroup IS3 n=10 Tax=Gammaproteobacteria RepID=B7VPV6_VIBSL Length = 295 Score = 210 bits (534), Expect = 9e-53, Method: Composition-based stats. Identities = 53/303 (17%), Positives = 102/303 (33%), Gaps = 40/303 (13%) Query: 14 SLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRS 73 S + F+ + +I +CR +S + Y WL R P Sbjct: 10 STKFRFI-AHYKTRYSIVLMCRFLSVSKSGYYAWLDRE--------------PSRYDQEE 54 Query: 74 SDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASP---GI 130 ++ + E +G+ ++ L QG + + V +M GL + + Sbjct: 55 QALKKRIIEVFTQSRETYGSPRVHAELRRQGVLV-SRKRVARIMREQGLRARSYRIYMKM 113 Query: 131 PATGRF------------EHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAH 178 RF + A N+ W D + G + ++D +SR + + Sbjct: 114 AKLHRFYQSIKNIKKDTPKPTAVNQQWSGDLT-YIKQGKRWMYLAVVIDLYSRKIVGWSL 172 Query: 179 CTDERRETVQQQLVSVFERYGLPDRM--TMDNGSPWGDTTGTWTALELWLMRLGIRVGHS 236 + + + L +R+ D GS + ++ L + GI + Sbjct: 173 GSKKSTQLTMSSLRMAIRNRKPQERLLFHTDRGSEYR-----AHEVQALLSKNGIVPSMN 227 Query: 237 RPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWR-TVYNLERPHEALDMAVP 295 RP H ++E F +LK ++++ F +L+ + YN R H +L P Sbjct: 228 RPGHCTDNAEVESFFHTLKGDIIRKNSFKSEKQLRDKLAGYIQHFYNRYRLHSSLGYRTP 287 Query: 296 GSR 298 Sbjct: 288 HEY 290 >UniRef50_A4TG41 Integrase, catalytic region n=32 Tax=Actinomycetales RepID=A4TG41_MYCGI Length = 522 Score = 209 bits (533), Expect = 1e-52, Method: Composition-based stats. Identities = 87/381 (22%), Positives = 132/381 (34%), Gaps = 32/381 (8%) Query: 11 DTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSP 70 + R + V + I + ++ AT +W++R+ G L PR Sbjct: 58 LSTKQRGKLVREIADRRH-IDPFGAQVQVARATLDRWIRRYRTGGFEALVPEPRR---LG 113 Query: 71 NRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGAS-PG 129 R+ + L + ++ R L P+ ST+ R L+ + Sbjct: 114 TRTDTQVLELAVSLKRENPARTVAQVARILRTATGWAPSESTLLRHFHRCELMGPTAGQP 173 Query: 130 IPATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQ 189 GRFE PN LW D G + + LDDHSR + E + Sbjct: 174 GEVFGRFEAADPNELWVGDALHGPRVGDRKTYLFAFLDDHSRLVVGHRFGFAEDTVRLAA 233 Query: 190 QLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLER 249 L G+P + +DNGS + D L +LGIR+ HS P PQ +GK+ER Sbjct: 234 ALKPALAARGVPASIYVDNGSAFVDAW-----LLRACAKLGIRLVHSAPGRPQGRGKIER 288 Query: 250 FHRSLKAEVLQGKWFADSG--------------ELQRAFDHWRTVYNLERPHEALDMAVP 295 F R+++ + L + EL R F W R H P Sbjct: 289 FFRTVRDQFLVEVTDTSAEDLTAAGVDHRGALLELNRLFMAWTETEYHRRTHSETG-QSP 347 Query: 296 GSRYQPSARQYSGNTTPP------EYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVG 349 R++ + G+ P E R V + +S+ + A G RV Sbjct: 348 LDRWEDGWDRLGGSPALPTAADLTEAFLWSEFRVVTKTATVSLHSNTYRVDPALAGRRVE 407 Query: 350 LKEMQEDG-SYEVWWYSTKVG 369 L D S EV + G Sbjct: 408 LVFSPFDLESIEVRYRDQSFG 428 >UniRef50_A5B7N0 Putative uncharacterized protein n=17 Tax=Vitis vinifera RepID=A5B7N0_VITVI Length = 2000 Score = 209 bits (533), Expect = 1e-52, Method: Composition-based stats. Identities = 46/213 (21%), Positives = 87/213 (40%), Gaps = 28/213 (13%) Query: 146 QMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMT 205 +DF G FP G + L +D S++ + ++ R ++ ++F R+G+P + Sbjct: 1260 GIDFMGPFPMSFGNSYILVGVDYVSKWVEAIPCKQNDHRVVLKFLKENIFSRFGVPKAII 1319 Query: 206 MDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQG---K 262 D G+ + + E L + G++ + PYHPQT G++E +R +K +++ Sbjct: 1320 SDGGAHFCN-----KPFEALLSKYGVKHKVATPYHPQTFGQVELANREIKNILMKVVNSN 1374 Query: 263 WFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGNTTPPEYDEGVMVR 322 S L + +RT Y L M+ Y+ + EY ++ Sbjct: 1375 RKDWSIRLHDSLWAYRTAYKTI-----LGMSP----YRLVYGKACHLPVEVEYKAWWAIK 1425 Query: 323 KVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 K++ + KA + L EM+E Sbjct: 1426 KLN-----------MDLIKAGEKRYLDLNEMEE 1447 >UniRef50_A5APW4 Putative uncharacterized protein n=3 Tax=Vitis vinifera RepID=A5APW4_VITVI Length = 1536 Score = 208 bits (531), Expect = 2e-52, Method: Composition-based stats. Identities = 54/315 (17%), Positives = 108/315 (34%), Gaps = 43/315 (13%) Query: 79 ALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASP--GIPATGRF 136 ++R + E+ G + G + R L + +P Sbjct: 895 EIIRKCVPKAEQQGILR-HCHENACGGHFASQKNTMRSCDRCQRLGKLTRGNMMPLNPIL 953 Query: 137 EHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFE 196 D +W +DF G F G + L + S++ + ++ R ++ ++F Sbjct: 954 IVD-LFYVWGIDFMGPFSMSFGYSYILVGVYYISKWVETVPCKHNDHRVVLKFLKENIFS 1012 Query: 197 RYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKA 256 R+G+P + D G+ + + E L + G++ + PYHPQT G++E ++ +K Sbjct: 1013 RFGVPKVIISDEGTHFCN-----KPFETLLAKYGVKHKVATPYHPQTSGQVELTNKEIKN 1067 Query: 257 EVLQG---KWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGNTTPP 313 +++ S +L + +R Y L M+ Y+ + Sbjct: 1068 ILMKMVNTNRKDWSVKLFDSLWAYRKTYKTI-----LGMSP----YRLVYGKAYHLPVEL 1118 Query: 314 EYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE-----------DGSYEVW 362 EY ++KV+ + K + L EM+E Sbjct: 1119 EYKAWWAIKKVN-----------MDLSKVGLKRFLDLNEMEELRNDAYINSKIAKEILKR 1167 Query: 363 WYSTKVGVIDLKKKS 377 W+ + D +K Sbjct: 1168 WHDQLISYKDFQKGQ 1182 >UniRef50_C1XPR1 Transcriptional regulator/sugar kinase n=14 Tax=Meiothermus silvanus DSM 9946 RepID=C1XPR1_9DEIN Length = 777 Score = 208 bits (531), Expect = 2e-52, Method: Composition-based stats. Identities = 83/392 (21%), Positives = 138/392 (35%), Gaps = 39/392 (9%) Query: 15 LRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRS- 73 + V + + + + GIS AT ++W + ++G AGL+ R R P H + Sbjct: 35 RKLRLVKALRESKKSWKEIQDLVGISRATYHRWQKALKEKGLAGLKPRSRRPKHLRTKVH 94 Query: 74 -SDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMA-------------- 118 + + + + WG I L +G M + TV ++A Sbjct: 95 WTPGLLIRIETLRKENPTWGRWSIWLTLRKEGFQM-SERTVGRILAYLEKHRRIESVAGY 153 Query: 119 ----RHGLLPGA--SPGIPATGR-FEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSR 171 + G L P R +E AP L Q+D G + +D HSR Sbjct: 154 LARTQRGKLKRRVNRPYAKRKPRGYEARAPGDLVQVDTLTLTLGPGSMVKHFSAIDLHSR 213 Query: 172 FSLCLAHCTDERRETVQQQLVSVFERYGLP-DRMTMDNGSPWGDTTGTWTALELWLMRLG 230 F L + + + L + R P + +D GS + E LG Sbjct: 214 FVLA-EVHSRATAKLSEGFLSLLLARAPFPIRAIQVDGGSEF------MAEFEEACCALG 266 Query: 231 IRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEAL 290 I + P P+ G +ER R+ K E ELQ D + YN RPH AL Sbjct: 267 IALFVLPPRSPKLNGHVERMQRTFKEEFYTRPLPTPLSELQAELDTYLDYYNRRRPHMAL 326 Query: 291 DMAVPGSRY-----QPSARQYSGNTTPPEYDEGVMVR-KVDISGKLSVKGVSLSAGKAFR 344 P + ++ S T Y ++ GKL++ G + + + Sbjct: 327 GGLAPLEFLAKMQEESVPQRVSNVLTDYTYLTPRAGWSRLSSFGKLTIGG-LVPMSPSRK 385 Query: 345 GERVGLKEMQEDGSYEVWWYSTKVGVIDLKKK 376 G+ + +K + E S + +L ++ Sbjct: 386 GDPLEMKRINRRAILEALKGSRSLTRAELARR 417 >UniRef50_C6VW29 Integrase catalytic region n=2 Tax=Sphingobacteriales RepID=C6VW29_DYAFD Length = 273 Score = 208 bits (530), Expect = 2e-52, Method: Composition-based stats. Identities = 55/282 (19%), Positives = 92/282 (32%), Gaps = 33/282 (11%) Query: 21 LFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITAL 80 + + C+ + + Y S + ++ Sbjct: 4 EIVREHDIAVSRACKIVSLVRSQYYY----------------------SSKKDDSEVIES 41 Query: 81 LRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPATGRF---- 136 L+ +H +G RK+ +L G VH + L R Sbjct: 42 LQDLAFKHPSYGFRKLFAYLRRSGKPW-NHKRVHRIYQVLKLNKRRKGKRRLPDRVRQPL 100 Query: 137 -EHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVF 195 + N +W +DF G + ++DD SR +L + T + + + L + Sbjct: 101 AQPAQVNEVWSVDFMSDSMVGNRKFRTFNVIDDCSREALAIEIDTSLSAKRIIRTLNRIG 160 Query: 196 ERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLK 255 E G P + DNG + +W GI +P P G +ERF+R + Sbjct: 161 ESRGFPMAIRSDNGPEFTSGN-----FTIWCEEKGIEAKFIQPGKPTQNGYIERFNRLYR 215 Query: 256 AEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGS 297 VL F D ++++ W YN RPHE L P Sbjct: 216 EAVLDAYLFFDLDQVRQLTAEWIEEYNQRRPHEGLGNLTPFE 257 >UniRef50_C7RJ38 Integrase catalytic region n=5 Tax=Proteobacteria RepID=C7RJ38_9PROT Length = 441 Score = 208 bits (529), Expect = 3e-52, Method: Composition-based stats. Identities = 87/375 (23%), Positives = 142/375 (37%), Gaps = 37/375 (9%) Query: 12 TMSLRTEFVLFASQDGANIRSLC-----------RRFGISPATGYKWLQRWAQEGAAGLQ 60 + R Q ++R L R G T W R+ G GL Sbjct: 17 PLISRQRLARGELQK--SLRELATREYVIPGTDRRLLG--EKTIEGWYYRYRARGLDGLI 72 Query: 61 DRPRIPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHT---MPAFSTVHNLM 117 + R ++ S + A + A + R R+I+R LE G + S++H L+ Sbjct: 73 PKVRADRGQ-SKLSASVQAAILAAKRENPRRSIRQIQRVLEIGGIVARGTLSRSSLHRLL 131 Query: 118 ARHGL--LPGASPGIPATGRFEHDAPNRLWQMDFKG----HFPFGGGRCHPLTLLDDHSR 171 +HGL LPG++ F LW D G+ + ++L DD SR Sbjct: 132 QQHGLSRLPGSASLPEEKRSFVAACAGELWYSDVMHGPRVPIGGRLGKSYLVSLFDDASR 191 Query: 172 FSLCLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGI 231 A C E ++ L + G+P ++ +DNG+ + T L+ RLGI Sbjct: 192 LVAHGAFCRGETALDIEGVLKQALLKRGVPVKLVVDNGAAYVAQT-----LQGICARLGI 246 Query: 232 RVGHSRPYHPQTQGKLERFHRSLKAEVL---QGKWFADSGELQRAFDHWRTVYNLERPHE 288 + H RPY P+++GK+ER+HR+ + + L + + +L W PH+ Sbjct: 247 VLVHCRPYAPESKGKIERWHRTCRDQFLSEVEERHVLSLDDLNARLWAWLEQVYHRTPHD 306 Query: 289 ALDMAVPGSRYQPSARQY----SGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFR 344 L+ P +RYQ + T + R V G +S +G Sbjct: 307 GLEGQTPLARYQQDLPKIRLLGPLAATLDTLFLHRVRRLVRKDGTVSYQGGRFEVPFELT 366 Query: 345 GERVGLKEMQEDGSY 359 G+ V L+ + Sbjct: 367 GKTVCLRVDPHTETV 381 >UniRef50_Q486I3 ISCps2, transposase orfB n=1 Tax=Colwellia psychrerythraea 34H RepID=Q486I3_COLP3 Length = 272 Score = 208 bits (529), Expect = 3e-52, Method: Composition-based stats. Identities = 51/285 (17%), Positives = 82/285 (28%), Gaps = 41/285 (14%) Query: 33 LCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMAHDRHER-W 91 +C IS + Y WL R P + + +T + ++ + Sbjct: 2 MCALLDISRSGYYAWLSR---------------PLCKTKQENIQLTEQISKVFEQSRCVY 46 Query: 92 GARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPA---------------TGRF 136 GA +I+ L G + V LM L F Sbjct: 47 GAPRIRAALNADGQA-CGKNRVARLMRVLQLKGRPKRVFRRSAKSNPYVEPAPNLLHQNF 105 Query: 137 EHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFE 196 A N++W D + G + ++D +SR + + + R L Sbjct: 106 VSKAVNQVWSSDIT-YIQTKQGFVYLAVVMDLYSRKIIGWSMDKNMGRHIAMNALGMAVA 164 Query: 197 RYGLPD--RMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSL 254 D + D GS + + L GI S + +E F SL Sbjct: 165 ARNPSDGLIIHSDRGSQYISDD-----YQQMLNENGILCSMSARGNCYDNAVVESFFGSL 219 Query: 255 KAEVLQGKWFADSGELQRAFDHWRT-VYNLERPHEALDMAVPGSR 298 K E ++ + E Q + YN R H L P Sbjct: 220 KRERIRRYNYRTRQEAQIDVFDYIECFYNKRRLHSYLKYKNPSDF 264 >UniRef50_C1RGI3 Transposase n=2 Tax=Actinomycetales RepID=C1RGI3_9CELL Length = 291 Score = 208 bits (529), Expect = 4e-52, Method: Composition-based stats. Identities = 61/298 (20%), Positives = 92/298 (30%), Gaps = 38/298 (12%) Query: 18 EFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDI 77 V + DG + CR GIS + Y + R P + + + Sbjct: 4 RLVQELAADGVPVALTCRVLGISRSGLY---------------EALRRPPSARQLADAAL 48 Query: 78 TALLRMAHDR-HERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPA---- 132 + + H R +GA ++ L V LM GL+ Sbjct: 49 SNTIAAIHHRSRATYGAPRVHAELRLGLGVACGRKRVARLMRAAGLVGVCHRRKRRGQRP 108 Query: 133 ---------TGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDER 183 RF D P+RLW D H P G+ + +LD +R + + R Sbjct: 109 LPAPHEDLVQRRFVADGPDRLWCTDITEH-PTATGKVYCAAVLDVFTRKIVGWSIADHMR 167 Query: 184 RETVQQQLVSVFERYGL--PDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHP 241 E V L R + D GS + L G+ R Sbjct: 168 AELVVDALQMAIWRRRPAPGAIVHADRGSQYTSWI-----FGHRLRAAGLLGSMGRVASS 222 Query: 242 QTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRT-VYNLERPHEALDMAVPGSR 298 +E F +L+ E+L + +A +L A W YN R H +L P Sbjct: 223 VDNTMMESFWSTLQRELLDRRSWASQADLASAIFEWIEGFYNPRRRHSSLGYRSPNQF 280 >UniRef50_A6VE65 Transposase InsF for insertion sequence IS3A/B/C/D/E/fA n=6 Tax=Proteobacteria RepID=A6VE65_PSEA7 Length = 288 Score = 207 bits (528), Expect = 4e-52, Method: Composition-based stats. Identities = 62/299 (20%), Positives = 92/299 (30%), Gaps = 38/299 (12%) Query: 22 FASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALL 81 A + +CR +S + Y W QR + + Sbjct: 7 EALAGRYPVAPMCRLLQVSRSGFYAWQQRPPSVREMANRRLS--------------KEIR 52 Query: 82 RMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPAT-------- 133 + H+ + +G R+IK L G V LM GL + Sbjct: 53 TIHHEVNGIYGHRRIKAELTAMGQA-CGRHRVARLMREAGLRVRSRKRWRLVSSSRHDLP 111 Query: 134 -------GRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRET 186 +F D NR W D + G + +LD +SR + A ++ Sbjct: 112 IAPNHLDRQFVSDRANRHWVSDMT-YVRTAQGWLYLAVVLDLYSRAVVGWAMHHRMQQAL 170 Query: 187 VQQQLVSVFERYGLPDRM--TMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQ 244 V L R + D GS + + L R I HSRP + Sbjct: 171 VHAALEMAVARRQPQAEVLLHSDRGSQYCAYD-----YQALLRRHRIVPSHSRPGNCWDN 225 Query: 245 GKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSA 303 +E F RSLKAE + +A E + + YN R H L P + A Sbjct: 226 AAMESFFRSLKAERVYLTRYASYQEAKTDLFDYIRFYNHRRRHSTLGYLSPMEFERRYA 284 >UniRef50_C1PC72 Integrase catalytic region n=1 Tax=Bacillus coagulans 36D1 RepID=C1PC72_BACCO Length = 262 Score = 207 bits (528), Expect = 5e-52, Method: Composition-based stats. Identities = 50/266 (18%), Positives = 94/266 (35%), Gaps = 25/266 (9%) Query: 62 RPRIPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHG 121 R + P + + + H+ +G+ ++ L + G+T+ + TV LM G Sbjct: 2 RNQGPSKKEAYLKEIRQKISKSFHESQGTYGSPRVHNDLVEWGYTI-SQKTVARLMKEMG 60 Query: 122 LLPGASPGIPAT---------------GRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLL 166 L + T +F+ + PN++W +D + G + +++ Sbjct: 61 LSASSKEKFVVTTDSNHDMKIYPNLLKRQFKTEGPNQVWVVDIT-YIWTLEGWVYLSSVM 119 Query: 167 DDHSRFSLCLAHCTDERRETVQQQLVSVFERY--GLPDRMTMDNGSPWGDTTGTWTALEL 224 D SR + ++E Q L G D GS + Sbjct: 120 DLFSRKIVGWRMGGRMKKELPIQALNMAITSRQPGEGLVHHSDRGSQYCS-----KEYTD 174 Query: 225 WLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWR-TVYNL 283 L GI++ SR P +E FH ++K +++ + F RA +++ + YN Sbjct: 175 ILKANGIQISMSRKGDPYDNACIESFHATIKKDLIHRRRFETRAAAMRAINYYISSFYNE 234 Query: 284 ERPHEALDMAVPGSRYQPSARQYSGN 309 R H L P + + N Sbjct: 235 RRKHSTLGYVSPNQFERKHQQITEEN 260 >UniRef50_A3ZSC8 Transposase orfB n=1 Tax=Blastopirellula marina DSM 3645 RepID=A3ZSC8_9PLAN Length = 281 Score = 207 bits (527), Expect = 6e-52, Method: Composition-based stats. Identities = 54/232 (23%), Positives = 88/232 (37%), Gaps = 14/232 (6%) Query: 87 RHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPAT--------GRFEH 138 R+G R I R L +G + F V+ L R GL R + Sbjct: 40 EFPRYGYRMITRLLRQEGWQV-NFKRVYRLWRREGLKVPVKQAKKRRLGTVDGGITRRQA 98 Query: 139 DAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERY 198 + PN +W +DF G L+L+D+ +R + L + + + L +F Sbjct: 99 ERPNHVWSIDFIFDRTENGRPLKILSLVDEFTRECIALEVNRKFTGDHLVELLADLFAIR 158 Query: 199 GLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEV 258 G+P+ + DNG + ++ +L ++ + + + P P G +ERFH L+ E Sbjct: 159 GVPEFIRSDNGPEFISRR-----VQKFLEKIDVGMSYIEPGSPWQNGYVERFHSRLRDEC 213 Query: 259 LQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGNT 310 L + F E + WR YN RPH +L P Sbjct: 214 LACELFTTLAEARTVIAAWRQTYNHRRPHSSLGGQTPADFASQWPASVPAAP 265 >UniRef50_A4JLW8 Integrase, catalytic region n=9 Tax=Proteobacteria RepID=A4JLW8_BURVG Length = 277 Score = 207 bits (527), Expect = 7e-52, Method: Composition-based stats. Identities = 62/281 (22%), Positives = 99/281 (35%), Gaps = 33/281 (11%) Query: 27 GANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMAHD 86 G R CR IS + S + + Sbjct: 9 GVAERRACRLVAISRSVYQY---------------------RSHRDPETALRQRMCEIAA 47 Query: 87 RHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPATGRFEHDA------ 140 R+G RKI+ L +G+ + + + ++ L GL P + A Sbjct: 48 TRVRYGYRKIRVLLLREGYQV-SKNRLYRLYREEGLSLRYRPNRKRRAQMSRPARAKSTA 106 Query: 141 PNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGL 200 N+ W +DF G R LT++D +R +L + V + L + + G Sbjct: 107 ANQAWSLDFVADQLSNGQRFRALTIIDVFTREALAIDVGQRLSASDVVRVLDELRSKRGA 166 Query: 201 PDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQ 260 P + DNGS + ++LW + + SRP P +E F+ +L+ E L Sbjct: 167 PRTLFCDNGSEFTSQV-----MDLWAYHHKVEIAFSRPGKPTDNAFVESFNGTLRDECLN 221 Query: 261 GKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQP 301 WF + + + WR YN RPH ALD P + Sbjct: 222 VHWFTSLADAREQIERWRVEYNESRPHRALDEVPPAEYVRQ 262 >UniRef50_B4S6V0 Integrase catalytic region n=10 Tax=Bacteria RepID=B4S6V0_PROA2 Length = 282 Score = 207 bits (526), Expect = 9e-52, Method: Composition-based stats. Identities = 67/299 (22%), Positives = 103/299 (34%), Gaps = 32/299 (10%) Query: 10 RDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHS 69 + R G ++R LC G++ ++ Y Sbjct: 2 VTPEAKRNAVKHLHDTFGQSLRKLCILIGLNRSSWYY---------------------EP 40 Query: 70 PNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPG 129 S++ I LR D +RWG R++ L +G + L L+ Sbjct: 41 QPDSNEPIRKRLRELADERKRWGYRRLHYLLRREGFQI-NHKRTERLYREENLMLRVRRR 99 Query: 130 IP-----ATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERR 184 + N W MDF + G R L +LD +SR L T Sbjct: 100 RKMASESRVAPPPPERKNHCWAMDFMSDNLYNGRRFRVLNVLDSYSRDYLGFEVDTSING 159 Query: 185 ETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQ 244 + V L + GLP+ +T+DNG + AL+ W R G+++ +RP P Sbjct: 160 KRVCSVLERIAWFKGLPELITVDNGPEFIG-----KALDAWAHRHGVKLVFNRPGKPVDN 214 Query: 245 GKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSA 303 +E F+ L+ E L WF G + W+ YN RPH +L P Sbjct: 215 TYIESFNGRLRDECLNVNWFMSLGHAREVIAEWQEDYNSVRPHSSLGTRTPEEFLVQQT 273 >UniRef50_Q0ZCB7 Integrase n=4 Tax=Eukaryota RepID=Q0ZCB7_POPTR Length = 1332 Score = 206 bits (525), Expect = 1e-51, Method: Composition-based stats. Identities = 46/264 (17%), Positives = 98/264 (37%), Gaps = 24/264 (9%) Query: 74 SDDITALLRMAHDRH--ERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIP 131 ++++++++ H + +RK + G P + + I Sbjct: 821 DNEVSSVIKFCHSEACGGHFSSRKTTAKILQSGFYWPTMFKDSHAFCKTCENYQKLGSIS 880 Query: 132 ATGRFEHDAP-----NRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRET 186 + W +DF G FP G + L +D S++ + ++ + Sbjct: 881 KHHMMPLNPILVIEIFDCWGIDFMGPFPPSFGFLYILVAVDYVSKWIEAIPSRNNDHKTV 940 Query: 187 VQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGK 246 ++ ++ R+G+P M D G+ + + T+ E + + GI + PYHPQT G+ Sbjct: 941 IKFLKENILSRFGIPRAMISDGGTHFCN-----TSFESLMKKYGITHKVATPYHPQTSGQ 995 Query: 247 LERFHRSLKAEV---LQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSA 303 +E +R +K + + S L A +RT Y +L M+ Y+ Sbjct: 996 IELANREIKQILEKTVNPNRKDWSLRLNDALWAYRTAYKT-----SLGMSP----YKLVY 1046 Query: 304 RQYSGNTTPPEYDEGVMVRKVDIS 327 + E+ ++ + + Sbjct: 1047 GKPCHLPVELEHKAYWAIKAFNSN 1070 >UniRef50_Q3BT31 Transposase n=22 Tax=Bacteria RepID=Q3BT31_XANC5 Length = 343 Score = 206 bits (525), Expect = 1e-51, Method: Composition-based stats. Identities = 67/324 (20%), Positives = 102/324 (31%), Gaps = 37/324 (11%) Query: 6 PWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRI 65 + R G + R C G+S R Sbjct: 45 HKKMVTPGAKREAVAHAREHHGLSERRACNLVGVSRRVIRY---------------RSSR 89 Query: 66 PHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPG 125 P P + LR R+G R++ L +G T P + + L Sbjct: 90 PDDGP------LRQRLRELAAERRRFGYRRLGYLLAREGIT-PNHKKLLRVYREENLRVR 142 Query: 126 ASPGI-----PATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCT 180 G D PN+ W +DF R L ++DD++R L L T Sbjct: 143 RRGGRKRALGTRAPMVLPDGPNQRWSLDFVSDTLTCSRRFRILCVVDDYTRECLALVADT 202 Query: 181 DERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYH 240 V ++L + G P + DNG+ T +A+ W + + P Sbjct: 203 SLSGVRVARELTRLIGMRGKPHTVVSDNGTE-----LTSSAILRWSQERRVEWHYIAPGK 257 Query: 241 PQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGS--- 297 P G +E F+ L+ E L F + D WR YN RPH L P Sbjct: 258 PMQNGFVESFNGRLRDECLNETLFTSLPHARFVLDAWRHDYNHVRPHSKLGGRTPAEKAG 317 Query: 298 --RYQPSARQYSGNTTPPEYDEGV 319 ++ + RQ + +T G+ Sbjct: 318 KPVWEHAPRQVAITSTNHHVGAGL 341 >UniRef50_D1ZYT7 Whole genome shotgun sequence assembly, contig_4407 n=1 Tax=Sordaria macrospora RepID=D1ZYT7_SORMA Length = 379 Score = 206 bits (524), Expect = 1e-51, Method: Composition-based stats. Identities = 62/315 (19%), Positives = 100/315 (31%), Gaps = 43/315 (13%) Query: 10 RDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHS 69 R + +R EFV + + +C G+S + + WL R A Sbjct: 85 RQGLDMRFEFVAKH-RGIWPVSWICEALGVSRSGFHAWLVRAPSARA------------- 130 Query: 70 PNRSSDDITALLRM-AHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASP 128 RS D+ A +R + +GAR++ R L + + + LM GL Sbjct: 131 --RSDDEFGARIRASFISSYRTYGARRVWRDLLAEDLS-CGLHRIERLMRVQGLRARPRR 187 Query: 129 GIPAT--------------GRFEHDAPNRLWQMDFKGHFPFGG---GRCHPLTLLDDHSR 171 +F +APN+ W DF + G L+ SR Sbjct: 188 RGLPKDDGLRSVIADNILDRQFTAEAPNQRWIADFTYIWTADGPPKDGSTSLSSSTCFSR 247 Query: 172 FSLCLAHCTDERRETVQQQLVSVFERYGLPDRM--TMDNGSPWGDTTGTWTALELWLMRL 229 + + + V L+ R G PD + D GS + + + Sbjct: 248 RVVGWSMSDSMTAQLVTDALMMAIWRRGKPDALLHHSDQGSQYTSEK-----FQRLMTDN 302 Query: 230 GIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRT-VYNLERPHE 288 G+ SR + +E F SLK E K + + + YN R H Sbjct: 303 GVTCSMSRSGNVWDNAAMESFFSSLKTERTGRKTYRSRNHAKAHVFDYIERFYNPTRRHS 362 Query: 289 ALDMAVPGSRYQPSA 303 L P + + Sbjct: 363 TLGYLSPMEFERQAQ 377 >UniRef50_A7VUR6 Putative uncharacterized protein n=3 Tax=Clostridium leptum DSM 753 RepID=A7VUR6_9CLOT Length = 381 Score = 205 bits (523), Expect = 2e-51, Method: Composition-based stats. Identities = 56/302 (18%), Positives = 95/302 (31%), Gaps = 34/302 (11%) Query: 11 DTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSP 70 + + + + ++ SLC +S T Y + R + + R Sbjct: 84 SPLQEKLKAMEPLY-GQFSVHSLCDALEVSRGTFYNHIFRNKKGDTLAAKRRS------- 135 Query: 71 NRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGI 130 + T + + D + +GA KI L++QG + V LM G+ ++ Sbjct: 136 ----ELKTQIQSIYDDSGQIYGAGKIAAILQNQGVKT-SKKYVSQLMKELGIGSVSTTAK 190 Query: 131 P-------------ATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLA 177 +F + PN++W D F F + ++D SR + Sbjct: 191 KEYKKWEKGQNRNFLQQQFRTERPNQVWVSDIT-VFKFHDKYYYLCVIIDLFSRKVISYR 249 Query: 178 HCTDERRETVQQQLVSVFERYGLPD--RMTMDNGSPWGDTTGTWTALELWLMRLGIRVGH 235 + + + + D G+ + A L G+ Sbjct: 250 ISHKSSTQLLTKTFKQAYADRQPKAELMFHSDRGTQYMSY-----AFVHLLDDFGVEQSF 304 Query: 236 SRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVP 295 SR P E F K E L + + G L R H+ YN ERPH L P Sbjct: 305 SRTACPHDNAVSEAFFSIFKKEELYRRHYTSEGGLMRGIAHFIAFYNTERPHSTLQYKTP 364 Query: 296 GS 297 Sbjct: 365 EQ 366 Score = 41.8 bits (97), Expect = 0.039, Method: Composition-based stats. Identities = 14/75 (18%), Positives = 27/75 (36%), Gaps = 1/75 (1%) Query: 12 TMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPN 71 T + FV QDG + +C I +T Y+W+Q + Q Sbjct: 4 TKEEKIAFVKRY-QDGETVIKICNENQIPRSTFYRWIQDYQQTVTDTGTVVTPQEFQYLK 62 Query: 72 RSSDDITALLRMAHD 86 R + + ++++ Sbjct: 63 RRINKLEDMIQVLKT 77 >UniRef50_A5BAC6 Putative uncharacterized protein n=1 Tax=Vitis vinifera RepID=A5BAC6_VITVI Length = 1485 Score = 205 bits (523), Expect = 2e-51, Method: Composition-based stats. Identities = 52/319 (16%), Positives = 104/319 (32%), Gaps = 75/319 (23%) Query: 44 GYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQ 103 Y W + + + A R +P + +L H+ Sbjct: 1127 VYYWEEPFLFKYCADQIIRKCVPK-------QEQQGILSHFHES---------------- 1163 Query: 104 GHTMPAFSTVHNLMARHGLLPGASPGIPATGRFEHDA-------PNRLWQMDFKGHFPFG 156 + P+ + M R T R + +W +DF G FP Sbjct: 1164 -FSWPSIFKDAHTMCR--SCDRCQRLGKLTHRNQMPMNPILIVDIFDVWGIDFMGPFPMS 1220 Query: 157 GGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTT 216 G + L +D S++ + ++ R ++ ++F R+G+P + D G+ + + Sbjct: 1221 FGNSYILVEVDYVSKWVEAILCKHNDHRVVLKFLRENIFSRFGVPKAIISDGGTHFCN-- 1278 Query: 217 GTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDH 276 E L + G++ + PYHPQ G++E +R +K +++ + Sbjct: 1279 ---KPFETLLAKYGVKHKVATPYHPQNSGQVELANREIKNILMKRIAYKTI--------- 1326 Query: 277 WRTVYNLERPHEALDMAVPGSRYQPSARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVS 336 L M+ Y+ + EY ++KV+ Sbjct: 1327 -------------LCMSP----YRLVYGKACHLPVEVEYXXXWXIKKVN----------- 1358 Query: 337 LSAGKAFRGERVGLKEMQE 355 + + + L EM+E Sbjct: 1359 MDLNRXXMKRCLDLNEMEE 1377 >UniRef50_B0NHH2 Putative uncharacterized protein (Fragment) n=2 Tax=Clostridium scindens ATCC 35704 RepID=B0NHH2_EUBSP Length = 422 Score = 205 bits (522), Expect = 2e-51, Method: Composition-based stats. Identities = 72/342 (21%), Positives = 127/342 (37%), Gaps = 19/342 (5%) Query: 36 RFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMAHDRHERWGARK 95 F SP T KW+ + G L R R + D + R + + Sbjct: 54 VFRYSPKTISKWVSLYQNGGIDALMPRERSDKGATRVLPDTAIEEICRLKAAFPRLNSTQ 113 Query: 96 IKRWLEDQGHTMPAFS--TVHNLMARHGLLPGASPGIPATGRFEHDAPNRLWQMDFKG-- 151 I + L ++ + S V + +H L ++P + FE DA ++WQ D Sbjct: 114 IHKHLVEEAFIPASVSVCAVQRFVKKHDLKSASNPNLRDRKAFEEDAFGKMWQADTCYLP 173 Query: 152 HFPFGG--GRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNG 209 + G R + + ++DDHSRF + ++ Q+ L +G+P ++ +DNG Sbjct: 174 YITENGQRRRVYCILVIDDHSRFLVGGGLFYNDTAYNFQKVLKDAVAAHGIPSKLYVDNG 233 Query: 210 SPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQ---GKWFAD 266 + L L +G + H++ ++ K+ER R+LK L Sbjct: 234 CSYVGA-----QLSLICGSIGTVLLHTKVRDGASKAKIERQFRTLKETWLYTLDMDSITS 288 Query: 267 SGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQP---SARQYSGNTTPPEYDEGVMVRK 323 + + YN H + P +RYQ S R+ E + RK Sbjct: 289 LAQFNGLLKDYMRSYNTS-VHSGIG-TTPLARYQQTRSSIRRPKSREWLEECFLNRITRK 346 Query: 324 VDISGKLSVKGVSLSAGKAFRGERVGLKEMQEDGSYEVWWYS 365 V+ +S+ V+ F +V ++ + +D S Y Sbjct: 347 VNKDSTVSIDRVAYDVPMQFISSKVEIRFLPDDMSSAFILYE 388 >UniRef50_A5BFP8 Putative uncharacterized protein n=4 Tax=Vitis vinifera RepID=A5BFP8_VITVI Length = 1563 Score = 205 bits (521), Expect = 3e-51, Method: Composition-based stats. Identities = 47/249 (18%), Positives = 94/249 (37%), Gaps = 30/249 (12%) Query: 74 SDDITALLRMAHDRH--ERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIP 131 ++ +L H+ + +KI + G P+ + M L P Sbjct: 945 EEEQHGILSHCHENACGGHFAYQKIVMRVLQSGFCSPSLFKDAHTMNMMPLNP------- 997 Query: 132 ATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQL 191 +W +DF G FP G + L ++ ++ + ++ R ++ Sbjct: 998 ----ILVIDLFYVWGIDFMGPFPMSFGYSYILVGVNYVFKWVEAIPCKHNDHRVVLKFLK 1053 Query: 192 VSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFH 251 ++F R+G+P + D G+ + + E+ L + G++ + PYHPQT G++E + Sbjct: 1054 ENIFSRFGVPKAIISDGGTHFCN-----KPFEMLLAKYGVKHKVATPYHPQTSGQVELAN 1108 Query: 252 RSLKAEVLQG---KWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSG 308 R +K +++ S +L + +RT Y L M+ Y+ + Sbjct: 1109 REIKNILMKVVNTNRKDWSIKLLDSLWAYRTAYKTI-----LGMSP----YRLVYGKACH 1159 Query: 309 NTTPPEYDE 317 EY Sbjct: 1160 LPVELEYKA 1168 >UniRef50_A5BI47 Putative uncharacterized protein n=10 Tax=Vitis vinifera RepID=A5BI47_VITVI Length = 1486 Score = 205 bits (521), Expect = 3e-51, Method: Composition-based stats. Identities = 50/327 (15%), Positives = 102/327 (31%), Gaps = 63/327 (19%) Query: 74 SDDITALLRMAHDRH--ERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIP 131 ++ +L H+ + ++K + G P+ + M + Sbjct: 1125 EEEQQGILNHCHENACGGHFASQKTTMRVLQSGFCWPSLFKDAHTMCK--SCDRCQRLGK 1182 Query: 132 ATGRFEHD-------APNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERR 184 T R +W +DF FP G + L +D S++ + ++ R Sbjct: 1183 LTRRNMMPLNLILIVDLFYVWGIDFMRTFPMSFGYSYILVGVDYVSKWVEAVPCKHNDHR 1242 Query: 185 ETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQ 244 + ++ R+G+P + D +G++ + PYHPQT Sbjct: 1243 VVLNFLKENILSRFGVPKAIISDE------------------PSMGVKHKVATPYHPQTS 1284 Query: 245 GKLERFHRSLKAEVLQG---KWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQP 301 G++E +R +K +++ +L + +RT Y P Y Sbjct: 1285 GQVELANREIKNILMKVVNTNRKNWLIQLLNSLWAYRTAYKTIL------WISP---YHL 1335 Query: 302 SARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE------ 355 + EY ++K++ + +A + L EM+E Sbjct: 1336 VYGKTGHLLVELEYKAWWAIKKLN-----------MDLSRAGLKRFLNLNEMEELRNDAY 1384 Query: 356 -----DGSYEVWWYSTKVGVIDLKKKS 377 W+ + D +K Sbjct: 1385 INSKIAKEKLKRWHDQLISRKDFRKGQ 1411 >UniRef50_Q122F4 Integrase, catalytic region n=19 Tax=Proteobacteria RepID=Q122F4_POLSJ Length = 298 Score = 204 bits (520), Expect = 4e-51, Method: Composition-based stats. Identities = 65/319 (20%), Positives = 103/319 (32%), Gaps = 44/319 (13%) Query: 17 TEFVLFASQD---GANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRS 73 + V + + CR IS + Y +R Q Sbjct: 3 YQLVEDLQKKATPQVPVSQTCRILEISRSGYYAARKRSQQAPVVC--------------- 47 Query: 74 SDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPAT 133 L +G+R+++ L +G T+ V +LM HGL T Sbjct: 48 -AASVHLQAAFAASGRAYGSRRLRAALHARGVTVGRH-RVRSLMRAHGLRSVWRRKFVHT 105 Query: 134 ---------------GRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAH 178 +FE AP+R W D + G + +LD HSR + A Sbjct: 106 TNSKHGLAVSPNVLDRQFEQTAPDRAWVCDIT-YIRTRSGWLYLAAVLDLHSRRIVGWAT 164 Query: 179 CTDERRETVQQQLVSVFERYGLPD--RMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHS 236 D + V L + + D G+ + +A + L + G+ S Sbjct: 165 AGDMQATLVTTALQIAIAQRNPSPGLIVHSDRGTQYAS-----SAHQALLKKHGLVGSMS 219 Query: 237 RPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWR-TVYNLERPHEALDMAVP 295 R + +ERF +LK E + K +A+ E + YN ER H L P Sbjct: 220 RKGNCWDNAVMERFFLNLKMERVWQKDYANHSEATNDVADYIVGFYNCERLHSKLGNLSP 279 Query: 296 GSRYQPSARQYSGNTTPPE 314 + Q SA Q + + Sbjct: 280 IAFEQKSATQQPIDVSEIT 298 >UniRef50_C8PWM8 Transposase B n=2 Tax=Enhydrobacter aerosaccus SK60 RepID=C8PWM8_9GAMM Length = 271 Score = 204 bits (520), Expect = 4e-51, Method: Composition-based stats. Identities = 70/299 (23%), Positives = 109/299 (36%), Gaps = 37/299 (12%) Query: 5 MPWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPR 64 MP R ++ + + Q +I C+ IS Y L D Sbjct: 1 MPVVERKALAQQLQA-----QHNISIVVSCQIVCISRTAYYY---------EPKLND--- 43 Query: 65 IPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLP 124 D I L D+H RWG K + L G+ V+ + L Sbjct: 44 ---------DDAIVDKLTELTDKHTRWGFPKCYKRLRKLGYVW-NHKRVYRVYTAMKLNL 93 Query: 125 GASPGIPATGRFE-----HDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHC 179 R ++ + W MDF R ++DD++R L + Sbjct: 94 RRKAKRRLPTRAPEPLTVPNSLDHTWSMDFMSDKLHNNSRFRTFNVIDDYNRELLGIDIG 153 Query: 180 TDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPY 239 T V + L + E +G P+++ +DNGS + + W I V + +P Sbjct: 154 TSIPSLRVIRYLDQLAECHGYPNKIRIDNGSEFTS-----SVFTDWAASHSILVDYIKPG 208 Query: 240 HPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSR 298 P +ERF+RS + EVL F + E+++ D W VYN ERPH++L P Sbjct: 209 CPYQNAYIERFNRSYRNEVLDCYLFNNLNEVRQLTDEWINVYNHERPHDSLGNMTPAEF 267 >UniRef50_B5K429 Integrase, catalytic region n=41 Tax=cellular organisms RepID=B5K429_9RHOB Length = 397 Score = 204 bits (519), Expect = 5e-51, Method: Composition-based stats. Identities = 51/309 (16%), Positives = 90/309 (29%), Gaps = 37/309 (11%) Query: 12 TMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPN 71 TM+ + FV + LCR IS Y G Q + + Sbjct: 98 TMTNKRAFVTAHKAQ-YAVSILCRLLEISRGWFY---------GFPASQPARDQRQANRD 147 Query: 72 RSSDDITALLRMA-HDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGI 130 + ++ + +G+++I + L + + V +M H + P Sbjct: 148 ARDQALLPKIKTFFKASKKCYGSKRIHQDLLADSE-VVSERRVARIMKEHKVSPLLRKRR 206 Query: 131 PAT----------------GRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSL 174 +F PN +W D + G + + D +R + Sbjct: 207 KPITTDSNHKLKPSPNLLEQKFHSQTPNAVWLADIT-YIDTDEGWLYLAGVKDMTTREIV 265 Query: 175 CLAHCTDERRETVQQQLVSVFERYGL--PDRMTMDNGSPWGDTTGTWTALELWLMRLGIR 232 A R E L R G D G + + + + Sbjct: 266 GWAMEDHMRAELCCAALEMALGRRGPVPGLIHHSDRGGQYAGGD-----YRKLIKKAKLT 320 Query: 233 VGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRT-VYNLERPHEALD 291 SR +E F SLK E++ + F E + A + YN +R H + Sbjct: 321 QSMSRKGQCLDNAPMESFFASLKKEMVHQRRFRTHAEAKAAIFEYIEVFYNRQRRHSGVG 380 Query: 292 MAVPGSRYQ 300 P ++ Sbjct: 381 YKTPKQAFE 389 >UniRef50_Q24ZR5 Putative uncharacterized protein n=3 Tax=Desulfitobacterium hafniense Y51 RepID=Q24ZR5_DESHY Length = 329 Score = 204 bits (519), Expect = 6e-51, Method: Composition-based stats. Identities = 54/310 (17%), Positives = 99/310 (31%), Gaps = 43/310 (13%) Query: 13 MSLRTEFVL-FASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPN 71 + E + + G + +C+ +S + Y W R S Sbjct: 33 PEIIYEIIKKESHGSGFPVEKMCQTLEVSRSGFYDWDGREPS---------------SRQ 77 Query: 72 RSSDDITALLRMAHDRHERW-GARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGI 130 + ++I +L+ H + G K+ +++ G + V+ L + L Sbjct: 78 KEDEEILKVLKKKHTEAQGIIGLDKLWEDVKEAGFQ-CGRNRVYRLQKANQLYSVRKKPF 136 Query: 131 P----------------ATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSL 174 F+ D PN +W D F G + + + D + + Sbjct: 137 RIGLTDSNHSLPKAPNLLNQNFQADTPNTVWVTDITQ-FKVGSQKVYLAAIKDLFHKAIV 195 Query: 175 CLAHCTDERRETVQQQLVSVFERYGLPD--RMTMDNGSPWGDTTGTWTALELWLMRLGIR 232 A R E + L + +R+ D GS +G A L + GI Sbjct: 196 GWAVANHMRTELCLEALRNALKRHRPAKGLIHHSDGGSQYGSA-----AYIQELEQHGII 250 Query: 233 VGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRT-VYNLERPHEALD 291 S + E F ++K+E L + + D E + + YN +R H+AL Sbjct: 251 RSMSSKGNCYDNASAETFFSTIKSERLHHRKYRDLTEARNDIFWYIESFYNRKRRHQALG 310 Query: 292 MAVPGSRYQP 301 P + Sbjct: 311 HIAPEEFLKR 320 >UniRef50_B4E5J2 Transposase n=21 Tax=Proteobacteria RepID=B4E5J2_BURCJ Length = 276 Score = 204 bits (519), Expect = 6e-51, Method: Composition-based stats. Identities = 68/300 (22%), Positives = 104/300 (34%), Gaps = 34/300 (11%) Query: 23 ASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLR 82 + GA+ R C +S T Y++ S R + ++ Sbjct: 2 MQRFGASQRQTCALLQLSR-TVYRY--------------------ESVARDQSALEMRIK 40 Query: 83 MAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIP------ATGRF 136 + +GA ++ L +G V + GL + Sbjct: 41 EITEVRVHYGAPRVYVMLRREGWR-DNHKRVERVYRELGLSLRHKRPRRNKSARRRQPKQ 99 Query: 137 EHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFE 196 A N +W MDF F G R LT++D+++R L + R E V L + + Sbjct: 100 SVSAINEIWSMDFVADALFDGRRLRTLTIVDNYTRECLAIEVDGSLRGEHVVAALTRLAQ 159 Query: 197 RYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKA 256 LP + DNGS + L+ W G+ + SRP P K E F+ + Sbjct: 160 HRPLPRYIKADNGSEFIS-----KTLDKWAYENGVEIDFSRPGKPTDNAKNESFNGRFRE 214 Query: 257 EVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSA-RQYSGNTTPPEY 315 E L WF + +R + WR YN RPH AL P + R PE+ Sbjct: 215 ECLNAHWFLSLEDARRKIEVWREYYNEARPHSALQWMTPAEFARQCTDRADPARPEEPEF 274 >UniRef50_C3TSE1 Putative transposase n=1 Tax=Enterococcus faecium RepID=C3TSE1_ENTFC Length = 307 Score = 203 bits (518), Expect = 6e-51, Method: Composition-based stats. Identities = 45/303 (14%), Positives = 92/303 (30%), Gaps = 41/303 (13%) Query: 17 TEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDD 76 E + ++ + ++ G+S + W +R +P + R + Sbjct: 14 KEIKSLPVKRQVSVSGILKKLGVSRSGYNAWKKR--------------VPSDTSVRRAVL 59 Query: 77 ITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPATG-- 134 + ++ + H+ +GA KI L G + + TV N M + G+ T Sbjct: 60 KEKIQKIYEESHQNYGAPKITAELRKSGEYV-SEKTVGNYMRQMGIRAHWVKPYIQTTID 118 Query: 135 -------------RFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTD 181 P+ +W D + G + +++D +SR + Sbjct: 119 SDLSQKLKNILNEECNPAHPDAVWCTDIT-YIWTFEGFVYLTSVMDLYSRKIISWVLSET 177 Query: 182 ERRETVQQQLVSVFERYGL--PDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPY 239 V + + + P D G + A + G+ +S+ Sbjct: 178 LEASHVVECVEKAKRVRNVEKPLIFHCDRGCQYVSE-----AFQK--ATKGMIHSYSKKA 230 Query: 240 HPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRT-VYNLERPHEALDMAVPGSR 298 +P +E FH +K E + + + + YN R H P Sbjct: 231 YPWDNACIESFHALMKREWINRFKIFNYAHAHKLIFEYIETFYNTVRIHSHCGYLSPNEY 290 Query: 299 YQP 301 + Sbjct: 291 EEQ 293 >UniRef50_B2GC00 Transposase n=14 Tax=root RepID=B2GC00_LACF3 Length = 280 Score = 203 bits (518), Expect = 7e-51, Method: Composition-based stats. Identities = 56/288 (19%), Positives = 89/288 (30%), Gaps = 41/288 (14%) Query: 33 LCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMAHDRHERWG 92 +CR G+S A Y++ R P D +LR+ + +R+G Sbjct: 1 MCRILGVSRAQYYRY--------------RSPKPSKRRAEDVDLKQRILRIFAEFKQRYG 46 Query: 93 ARKIKRWLEDQGHT---MPAFSTVHNLMARHGL---------------LPGASPGIPATG 134 KI L + + + LM + Sbjct: 47 VMKIHHELNLELQPLQRRCSPRRISRLMKELDIHSVTVNKWKAASASKTKVEQRPNLLKQ 106 Query: 135 RFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSV 194 F N+ W D G C+ T++D HSR + + + V + L S Sbjct: 107 DFSTTGLNQKWTADMTYIQTKRNGWCYLSTIMDLHSRRIIGYSFSKKMDTDLVLKTLESA 166 Query: 195 FERYGLP--DRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHR 252 + + + D GS + L L IR +SR +P +E FH Sbjct: 167 VKNRTITGDPIIHTDLGSQYTSDD-----YNQRLTELHIRHSYSRKGYPYDNAPMESFHA 221 Query: 253 SLKAEVLQGKW-FADSGELQRAFDHWRT-VYNLERPHEALDMAVPGSR 298 SLK E + F + + YN +R H +L P Sbjct: 222 SLKKECVYPVPVFENYETAAAVLFEYVHAFYNRKRIHSSLGYQTPLQV 269 >UniRef50_B2JXI0 Integrase catalytic region n=13 Tax=Bacteria RepID=B2JXI0_BURP8 Length = 319 Score = 203 bits (518), Expect = 7e-51, Method: Composition-based stats. Identities = 63/323 (19%), Positives = 99/323 (30%), Gaps = 42/323 (13%) Query: 21 LFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITAL 80 + A++ I ++ R +S + Y WL R + S + A Sbjct: 1 MKANRARWPIATMARLLAVSTSGYYAWLVREPS---------------AHACSDAQLLAR 45 Query: 81 LRMAHDR-HERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPATGR---- 135 +R H +GA +I L +G + V LM GL + P T R Sbjct: 46 IRTLHASSRGTYGAPRIHAQLAREGVHVG-RKRVARLMRIAGLCGASRRRWPHTTRPRAG 104 Query: 136 -----------FEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERR 184 F A N LW D + P G + +LD SR + A Sbjct: 105 ARRAPDLVQRHFSAGAANVLWVADAT-YIPTDEGFLYLAVVLDVFSRRIVGWAMSNHLYT 163 Query: 185 ETVQQQLVSVFERYGLPDRMT-MDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQT 243 E + + L + + D G + A G+ Sbjct: 164 ELMLRALDMALLQRRPEGVIHHSDQGCQYTSI-----AFGRRCREAGVHPSMGTAGDAYD 218 Query: 244 QGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRT-VYNLERPHEALDMAVPGSRYQPS 302 E F +L+AE+L + FA + +R + YN+ R H +L P Sbjct: 219 NAMCESFFGTLEAELLSREHFATHEQARRRLFSFLEGWYNVRRLHSSLGYHSPLEFENLH 278 Query: 303 AR--QYSGNTTPPEYDEGVMVRK 323 A+ S P R+ Sbjct: 279 AKDQISSHCGLPTARQRHGRDRR 301 >UniRef50_A5B281 Putative uncharacterized protein n=2 Tax=Vitis vinifera RepID=A5B281_VITVI Length = 721 Score = 203 bits (517), Expect = 8e-51, Method: Composition-based stats. Identities = 57/355 (16%), Positives = 123/355 (34%), Gaps = 46/355 (12%) Query: 45 YKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMAHDRH--ERWGARKIKRWLED 102 Y W + + + A R IP + +L H+ + ++K + Sbjct: 40 YYWEEPFLFKYCADQIIRKFIP-------EQEQQGILSHCHESACGGHFDSQKTAMKVLQ 92 Query: 103 QGHTMPAFSTVHNLMARHGLLPGASPGIPATGRFEHDAPNRLWQMDF--------KGHFP 154 G P+ M R T + N + +D G FP Sbjct: 93 SGFCGPSLFKDALTMCR--SCDKCQRLGKLTCKNMMP-LNPILIVDLFMYGALTSMGPFP 149 Query: 155 FGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGD 214 G + L +D S++ + ++ R ++ ++F R+G+P + D G+ + + Sbjct: 150 MSFGYSYILVGVDYVSKWVEAIPCKQNDHRVVLKFLKENIFSRFGVPKAIISDGGTHFCN 209 Query: 215 TTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSG---ELQ 271 E+ L + G++ + PYHPQT G++E +R +K +++ + L Sbjct: 210 -----KPFEILLAKYGVKHKVATPYHPQTFGQVELANREIKNILMKVVNTSRRDCSVRLH 264 Query: 272 RAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGNTTPP----EYDEGVMVRKVDIS 327 + +RT Y L M+ Y+ + + ++++ Sbjct: 265 DSLWAYRTAYKTI-----LGMSP----YRLVYGKACHLPVEVASMKRFLYLNEMKELRND 315 Query: 328 GKL--SVKGVSLSAGKAFRGERVGLKEMQEDGSYEVWWYSTKVGVIDLKKKSITM 380 + ++ K + + V K+ Q+ ++ + LK + I + Sbjct: 316 AYINSNIAKQR---LKRWHDQLVSHKQFQKGQRVLLYDSKLHIFPGKLKSRWIGL 367 >UniRef50_A4G5E5 Transposase IS3 family, partial pseudogene n=7 Tax=Proteobacteria RepID=A4G5E5_HERAR Length = 301 Score = 203 bits (517), Expect = 8e-51, Method: Composition-based stats. Identities = 57/315 (18%), Positives = 100/315 (31%), Gaps = 40/315 (12%) Query: 8 DARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPH 67 A + +F+ +D ++ S+C+ + + Y WL+ P Sbjct: 5 KASSKVRQAYKFIESH-RDEFSVISMCQALDVERSGYYAWLK---------------NPL 48 Query: 68 HSPNRSSDDITALLRM-AHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGA 126 + + L+R H +GA ++ + L ++G + + V LM +GL Sbjct: 49 SDRAQEDARLLKLIRASFMASHGIYGAPRVFQDLRERGE-LCSKHRVARLMRENGLRALH 107 Query: 127 SPGIPA--------------TGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRF 172 I +F PN W D + G + ++D SR Sbjct: 108 GYRIRHIPVSKPSPLIPNLLQRQFTVTRPNEAWVTDIT-YIRTWQGWLYLAVVMDLFSRK 166 Query: 173 SLCLAHCTDERRETVQQQLVSVFE-RYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGI 231 + A RE V + R + D G+ +G A + + Sbjct: 167 VVGWAVRPTIHRELVLDAIAKAVRSRRPRHTLIHSDQGTQYGSD-----AWRRFCKANHL 221 Query: 232 RVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWR-TVYNLERPHEAL 290 SR + +E F SLK E ++ +AD + YN R H L Sbjct: 222 EPSMSRRANCWDNAVVESFFSSLKKERVKKHIYADRETATLDLAEYIDDFYNGVRRHSHL 281 Query: 291 DMAVPGSRYQPSARQ 305 P + R+ Sbjct: 282 GGVSPNTFETAHRRR 296 >UniRef50_B4UYZ0 Integrase n=7 Tax=Streptomyces RepID=B4UYZ0_9ACTO Length = 290 Score = 203 bits (517), Expect = 8e-51, Method: Composition-based stats. Identities = 54/301 (17%), Positives = 87/301 (28%), Gaps = 43/301 (14%) Query: 19 FVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDIT 78 F+ G N++ C +S Y R S + ++T Sbjct: 6 FIEAEKTAGHNVKRTCELLKVSRTAYYT---------------RRNGTPGSRSVRDAELT 50 Query: 79 ALLRMAHDR-HERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPG------------ 125 + H+R +GA ++ L+ +G V LM + GL Sbjct: 51 EYITAVHERSRGTYGAPRVHAVLKREG-AGCGRRRVARLMRQAGLAGRHRRRRHRTTVPD 109 Query: 126 -----ASPGIPATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCT 180 + + + A + W D + G + T++D SR + A Sbjct: 110 PHAVTRPDLVLRNFQPDPAAIDTRWCGDIT-YIATDEGWLYLATVIDIASRRVVGWATAD 168 Query: 181 DERRETVQQQLVSVFERYGL--PDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRP 238 R E + L + P D G + + L GIR+ R Sbjct: 169 HLRTELIADALTAACRTRRPAGPVIFHSDRGCQYTS-----SELASLATDFGIRLSVGRT 223 Query: 239 YHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRT-VYNLERPHEALDMAVPGS 297 E F +LK E+ + A W YNL R H +L P Sbjct: 224 GQCWDNALAESFFSTLKNELGDTHPWPTRAAAHTAIFEWIESWYNLHRLHSSLGYRSPAE 283 Query: 298 R 298 Sbjct: 284 Y 284 >UniRef50_C4FKG6 Integrase, catalytic region n=1 Tax=Sulfurihydrogenibium yellowstonense SS-5 RepID=C4FKG6_9AQUI Length = 305 Score = 203 bits (517), Expect = 9e-51, Method: Composition-based stats. Identities = 49/285 (17%), Positives = 90/285 (31%), Gaps = 37/285 (12%) Query: 28 ANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMAHDR 87 ++ IS Y + + + + + Sbjct: 26 LSLNRQLELLSISKTAYYYTKK-----------------EPFSSEEDKILLDAIDKIYTE 68 Query: 88 HERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGI-----PATGRFE----- 137 H +GAR++++ LE G + + G+ P ++ Sbjct: 69 HPYYGARRMQKALESIGIKVGKRK-LSRTYKFMGIRALYPPPKTTILNKENKKYPYLLEQ 127 Query: 138 ---HDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSV 194 PN++W D + G + T++D HS+ L L Sbjct: 128 ITTTQRPNQIWSGDIT-YIKLEKGYAYLATIIDWHSKKVLSWKLGNTMDSYLTTSILEEA 186 Query: 195 FERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSL 254 ERYG P+ D GS + L + GI++ + +ERF R+L Sbjct: 187 IERYGKPEIFNSDQGSQYTSKEHI-----EILEKNGIKISMNANGRSIDNTVIERFWRAL 241 Query: 255 KAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRY 299 K E + K + E + + + +YN +R H ++ P Y Sbjct: 242 KYENVYPKGYNTIKEAREGINQYIEIYNSQRIHSSIGYKTPDMVY 286 >UniRef50_A3QMY0 Transposase n=37 Tax=Bacilli RepID=A3QMY0_ENTFC Length = 366 Score = 203 bits (517), Expect = 9e-51, Method: Composition-based stats. Identities = 59/304 (19%), Positives = 105/304 (34%), Gaps = 40/304 (13%) Query: 10 RDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHS 69 R ++ +F+ +D ++ LCR I + Y + + P + Sbjct: 82 RKKLNSLVQFIEKWCKD-YHVSLLCRLLEIPRSVYYFYKNK---------------PLTA 125 Query: 70 PNRSSDDITALLRMAH-DRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASP 128 ++ + + +R+GA KI + L +G ++ + V L+ + L Sbjct: 126 TEIRNNKLKKKISTIFFTNKQRYGATKIHQVLLKEGISV-SLKHVLKLIKQLNLRSIVVK 184 Query: 129 GIPATGR--------------FEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSL 174 F + W D G C+ +++D H++ + Sbjct: 185 KYRPQRSNKPIISKENLLNQDFSTETICEKWAADITYIPTKKNGWCYLSSIMDLHTKKII 244 Query: 175 CLAHCTDERRETVQQQLVSVFERYGLPD--RMTMDNGSPWGDTTGTWTALELWLMRLGIR 232 + V Q L Y +P+ + D GS + T +E WL IR Sbjct: 245 SYTFSKRMTVDCVIQTLNKAKIHYHIPEGMILHTDLGSQY-----TAREVEQWLKTNKIR 299 Query: 233 VGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRT-VYNLERPHEALD 291 +SR P +E FH SLK E + ++D E RA + YN R H ++ Sbjct: 300 HSYSRKGTPYDNAGIESFHASLKKEEVYTTSYSDFEEANRALFSYIEGFYNRNRIHSSIH 359 Query: 292 MAVP 295 P Sbjct: 360 YLTP 363 Score = 46.8 bits (110), Expect = 0.001, Method: Composition-based stats. Identities = 21/144 (14%), Positives = 44/144 (30%), Gaps = 4/144 (2%) Query: 17 TEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDD 76 + ++ +Q G ++R L + +G+S AT YKW + + + GL + + + Sbjct: 9 KQMIVELNQTGRSVRGLAKEYGLSEATIYKWKNLYLPDQSTGLTGKEVA---ELRKENAR 65 Query: 77 ITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPATGRF 136 + L + + +K+ L V L + Sbjct: 66 LNEELEILKKAAAIFSRKKL-NSLVQFIEKWCKDYHVSLLCRLLEIPRSVYYFYKNKPLT 124 Query: 137 EHDAPNRLWQMDFKGHFPFGGGRC 160 + N + F R Sbjct: 125 ATEIRNNKLKKKISTIFFTNKQRY 148 >UniRef50_B8J8P0 Integrase catalytic region n=1 Tax=Anaeromyxobacter dehalogenans 2CP-1 RepID=B8J8P0_ANAD2 Length = 281 Score = 203 bits (516), Expect = 1e-50, Method: Composition-based stats. Identities = 68/304 (22%), Positives = 109/304 (35%), Gaps = 32/304 (10%) Query: 12 TMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPN 71 +LR + ++ R CR G++P+T Y R P Sbjct: 4 PAALRPAVIELGAKFAMKKRRACRVVGLAPSTLYYC---------------SRRPER--- 45 Query: 72 RSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIP 131 ++ A LR + RWG R++ L+ +GH V L GL Sbjct: 46 ---AEVRARLRDLAAQRPRWGYRRLHVLLDREGH-HLNHKLVFRLYRSEGLAVRRKRRKR 101 Query: 132 AT-----GRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRET 186 T P + W MDF G + L L+D +R L + Sbjct: 102 ITSSLRVVPPPPTRPRQQWTMDFTQDSLASGRQFRTLNLIDAFTRECLLIEADHSLTGAR 161 Query: 187 VQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGK 246 V + L + E +G P+ + +DNG+ + +A++ W +R+ P P G Sbjct: 162 VVRALERLRELHGTPEVIRIDNGTEFTS-----SAVDAWAYTNQVRLDFITPGKPTENGH 216 Query: 247 LERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQY 306 +E F+ + E L WF +++R + +R YN RPH +LD P Sbjct: 217 IESFNGKFRDECLNENWFISLDDVRRKVEAYRVDYNEVRPHSSLDNRTPNELAHSLTGLA 276 Query: 307 SGNT 310 S Sbjct: 277 SSAA 280 >UniRef50_C6P7L8 Integrase catalytic region n=1 Tax=Sideroxydans lithotrophicus ES-1 RepID=C6P7L8_9PROT Length = 286 Score = 202 bits (515), Expect = 1e-50, Method: Composition-based stats. Identities = 57/307 (18%), Positives = 98/307 (31%), Gaps = 41/307 (13%) Query: 15 LRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSS 74 ++ F+ ++ + SLCR +S + Y W R P S + Sbjct: 1 MKYAFIQKY-ENEYRVSSLCRVMQVSRSGYYTWRDR---------------PAKSDAPQN 44 Query: 75 DDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPAT- 133 + ++ + R+ + +GA+K L+ +G T S V L + G+ T Sbjct: 45 ELLSQIRRVHMQSRQAYGAKKTWLALKSRGVTCGKHS-VARLRKQAGIEARRKRRFRITV 103 Query: 134 --------------GRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHC 179 +F D PNR+W D + G + LLD +SR + + Sbjct: 104 ENHATAPAAPNLVQQQFRVDHPNRIWVGDMT-YIRTRQGWLYLAILLDLYSRRVVGWSMS 162 Query: 180 TDERRETVQQQLVSVFERYGLPD--RMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSR 237 + L E+ D G + + GI+ S Sbjct: 163 DRPDLALILNALDMALEQRQPRAGLIHHTDQGPIYAARK-----YRERMAAHGIQPSMSA 217 Query: 238 PYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRT-VYNLERPHEALDMAVPG 296 + E F +LK EV+ F + A + YN R H++L P Sbjct: 218 KGNAYDNAVAESFFGNLKNEVIHHIDFESRDTARAAVFDYIELFYNRSRMHQSLGYVSPV 277 Query: 297 SRYQPSA 303 + Sbjct: 278 EFERSMC 284 >UniRef50_A5WDE3 Integrase, catalytic region n=13 Tax=Moraxellaceae RepID=A5WDE3_PSYWF Length = 284 Score = 202 bits (515), Expect = 1e-50, Method: Composition-based stats. Identities = 52/299 (17%), Positives = 95/299 (31%), Gaps = 39/299 (13%) Query: 27 GANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMAHD 86 I+ + F +S + Y WL+R + NR + + D Sbjct: 7 KYCIQRMATVFEVSLSGYYDWLKRGMSKR-----------KQHHNRCELLVKS---AHMD 52 Query: 87 RHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPAT------------- 133 + +G ++ + L QGH + + V + HG+ T Sbjct: 53 TQQSYGHERLHQHLTSQGHDI-SLYMVRQIKQEHGIYCKRHKRSKVTTDSNHNKPVYPNL 111 Query: 134 --GRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQL 191 +F+ APN W D + G + D +S+ + A + V + L Sbjct: 112 LEQQFDVAAPNIAWVSDIT-YIWTNEGWVYLAAFKDLYSKEIVGYALNKRMTADLVCEAL 170 Query: 192 VSVFERYGLPD--RMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLER 249 + + + D GS + + + G SR + +E Sbjct: 171 NNAIKYKRPARGLIVHSDRGSQYCS-----HQYRQIIDKYGFAGSMSRKGNCYDNAPIES 225 Query: 250 FHRSLKAEVLQGKWFADSGELQRAFDHWRT-VYNLERPHEALDMAVPGSRYQPSARQYS 307 F LK E++ K + E + + YN +R + L P +Q RQ + Sbjct: 226 FWGQLKNELIYHKVYETRDEAIKDVVRYIEIFYNRQRIQKGLGFKSPTQVFQDFYRQAA 284 >UniRef50_A3DCZ2 Integrase, catalytic region n=10 Tax=Clostridium RepID=A3DCZ2_CLOTH Length = 278 Score = 202 bits (515), Expect = 1e-50, Method: Composition-based stats. Identities = 51/287 (17%), Positives = 94/287 (32%), Gaps = 37/287 (12%) Query: 25 QDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMA 84 + +I ++ + Y N I ++ Sbjct: 6 EKKLSITRQAELLSLNRTSVYY-------------------KPAPVNEEEYLIKRIIDEI 46 Query: 85 HDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGL---LPGASPGIPAT-------- 133 + + +G R++ L H M G+ PG + Sbjct: 47 YASYPEYGYRRMTSILNKDYHIHINRKRTRRYMREMGIHGFCPGPNLSKRIHGKNLYPYL 106 Query: 134 -GRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLV 192 + D PN++W +D + G + + ++D +SR+ + + V + + Sbjct: 107 LRNLKIDHPNQVWSIDVT-YCRMKRGFMYMVAIIDWYSRYIVGFELSNTLDKTFVIEAIQ 165 Query: 193 SVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHR 252 +RYG P+ M D GS + L GI++ ++ERF R Sbjct: 166 KAIKRYGKPEIMNSDQGSQFTSDD-----YINLLKNNGIKISMDGKGRALDNQRIERFFR 220 Query: 253 SLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRY 299 S K E L + +L++ + YN RPH++LD P Y Sbjct: 221 SYKWEKLYLEECETVQQLRQITKEYVEHYNHRRPHQSLDYKTPAEYY 267 >UniRef50_C7PFG8 Integrase catalytic region n=3 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PFG8_CHIPD Length = 274 Score = 202 bits (515), Expect = 2e-50, Method: Composition-based stats. Identities = 53/286 (18%), Positives = 84/286 (29%), Gaps = 41/286 (14%) Query: 33 LCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMAHDRHERWG 92 +C+ F +S + Y WL R +P + SD I A+ R+G Sbjct: 1 MCKVFKVSRSGYYAWLIR-----------KPSKQAIENHALSDRIEAI---YRSGKGRYG 46 Query: 93 ARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIP----------------ATGRF 136 + K+ R L+ G + + V +M GL F Sbjct: 47 SPKVTRVLKSDGIHV-SQRRVARIMRSKGLRSVIVGKFKVCTTDSNHDKEVSSNILNREF 105 Query: 137 EHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERR-ETVQQQLVSVF 195 +P+ W D + G + ++D R L A ETV Sbjct: 106 TATSPSEKWVSDIT-YIRTKSGWLYLTVIMDLFDRKILGWAMSKGMTAAETVVAAWKMAI 164 Query: 196 ERYGLPD--RMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRS 253 + + + D G + + L IR SR + E F + Sbjct: 165 RNRSVKEHMIIHSDRGVQYAS-----HEFRILLKSGQIRQSMSRKGNCWDNSVCENFFKI 219 Query: 254 LKAEVLQGKWFADSGELQRAFDHWRT-VYNLERPHEALDMAVPGSR 298 LK+E + +R + YN R H L P Sbjct: 220 LKSETGINIIYDSFEVARREIFEFIEIWYNRRRIHSRLGYMTPEQF 265 >UniRef50_D0KDH0 Integrase catalytic region n=3 Tax=Gammaproteobacteria RepID=D0KDH0_PECWW Length = 280 Score = 202 bits (515), Expect = 2e-50, Method: Composition-based stats. Identities = 52/288 (18%), Positives = 87/288 (30%), Gaps = 41/288 (14%) Query: 30 IRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMAHD-RH 88 ++ +C +SP W R S D ++ L H Sbjct: 11 VKQICLLLNVSPRGYQSWRNRTLS---------------QRQLSDDILSQRLIQLHRDSR 55 Query: 89 ERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPAT--------------- 133 +G R+++ L+D+ S + LM + GL T Sbjct: 56 GTYGIRRLQSDLQDE-QRFHGKSRISRLMKQCGLKAANQTRYKVTTNSQHDYPIAPNLLN 114 Query: 134 GRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVS 193 F A + W D + G + T++D +SR + + + + V L Sbjct: 115 REFSPAAADVAWATDIT-YIRTEEGWLYLATVIDLYSRRIIGWSLSKRLKTQVVIDALEM 173 Query: 194 VFERYGL--PDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFH 251 + P D GS + L +R S +E F+ Sbjct: 174 AIRQRKPTRPVITHSDRGSQYASYR-----YRDVLKDNDLRCSMSGKGCCYDNAVMESFY 228 Query: 252 RSLKAEVLQGKWFADSGELQRAFDHWRT-VYNLERPHEALDMAVPGSR 298 +LK E+++GK F + A + YN R H L P Sbjct: 229 HTLKTELMRGKAFVSREQAMNALFDYIEVFYNRRRKHSTLGYQTPVDY 276 >UniRef50_C8WWR8 Integrase catalytic region n=1 Tax=Alicyclobacillus acidocaldarius subsp. acidocaldarius DSM 446 RepID=C8WWR8_ALIAD Length = 271 Score = 202 bits (514), Expect = 2e-50, Method: Composition-based stats. Identities = 52/287 (18%), Positives = 95/287 (33%), Gaps = 26/287 (9%) Query: 19 FVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDIT 78 V +++G + + + G++ Y L+ + D+ + Sbjct: 1 MVRQLAKEGFPVPVIAKALGLNRTYCYSLLKPPVPKPKRPPVDK-----------DALVK 49 Query: 79 ALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGI----PATG 134 +R + +G R+I+ L + + V+ LM GLL A G Sbjct: 50 QWIRRLCEEFPTYGYRRIQVMLRRRYNLRVNHKRVYRLMKEMGLLVKAPKRGASRTKRRG 109 Query: 135 RFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSV 194 + N +Q D + G + ++D + R + + R E + + Sbjct: 110 KIPVTRSNEHFQCDMTKVWCGKDGWGYLFAVIDAYDREIVGYSFSRFCRTEDLLNAVDRA 169 Query: 195 FERY------GLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLE 248 G + DNG + I + +P G +E Sbjct: 170 LNYRFPNGVQGAGLTLRTDNGCQMTSRR-----FIEAMKACQINHERTGFNNPDADGYIE 224 Query: 249 RFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVP 295 RF RSLK E + + ++ E + + + YN ERPH AL P Sbjct: 225 RFFRSLKEEEVWLQEYSSFAEAKAGIESYIHFYNTERPHSALGYRSP 271 >UniRef50_Q1NW03 Integrase, catalytic region n=7 Tax=Proteobacteria RepID=Q1NW03_9DELT Length = 447 Score = 202 bits (513), Expect = 2e-50, Method: Composition-based stats. Identities = 74/346 (21%), Positives = 124/346 (35%), Gaps = 21/346 (6%) Query: 43 TGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLED 102 T WL+++ ++G GL + R ++ LL + + R++ D Sbjct: 59 TIRDWLKKYRKDGFNGLLPKGRNDKGRSRSLPPEVADLLIATKEENPELSIRQVIAATAD 118 Query: 103 QGHTMPAFSTVHNLMARHGLLPGAS--PGIPATGRFEHDAPNRLWQMDFKGHFPFG---- 156 + PA STVH L+A GL+ P RF + L+ D Sbjct: 119 RIPVQPAPSTVHALLAGKGLMKKKGEDPDSKDHRRFSYQFAGDLFMCDVMHGPTVRTSGN 178 Query: 157 -GGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDT 215 + + + +DD +R A E R G+P R+ +DNG+ + Sbjct: 179 KRRKTYLIAFIDDATRVIAFAAFAMSESTADFMTVFKQTIIRRGIPLRLFVDNGAAFRSQ 238 Query: 216 TGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWF---ADSGELQR 272 L L +LGI + H+R YH Q +GK+ER+ R+++ + L L R Sbjct: 239 H-----LALVCAKLGITLIHARAYHAQAKGKIERWFRTIRLQFLPLLDPASTDSLEALNR 293 Query: 273 AFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGNTTPPEYDEGV---MVRKVDISGK 329 A + + H L P R+ + + D+ RKV Sbjct: 294 ALWSYVEMEYHRNHHRMLG-ETPLDRWARLGHKVRYPEPGLDLDDLFLFEAKRKVHKDRT 352 Query: 330 LSVKGVSLSAGKAFRGERVGLKEMQED--GSYEVWWYSTKVGVIDL 373 +S+ ++ A GE V L+ +VW + V Sbjct: 353 VSLNTLAYEVDAALVGETVTLRFDPSRPGEPVQVWHQGSFVHTAKP 398 >UniRef50_A5B5S8 Putative uncharacterized protein n=2 Tax=Vitis vinifera RepID=A5B5S8_VITVI Length = 1494 Score = 201 bits (512), Expect = 3e-50, Method: Composition-based stats. Identities = 46/272 (16%), Positives = 101/272 (37%), Gaps = 26/272 (9%) Query: 74 SDDITALLRMAHDRH--ERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIP 131 + +L H+ + ++K + G + P+ + M R Sbjct: 943 EQEQQGILSHCHESACGGHFASQKTTMKVLQSGFSWPSLFKDAHTMCR--SCDRCQRLGK 1000 Query: 132 ATGRFEHDA-------PNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERR 184 T R + +W +DF FP + L +D S++ + ++ R Sbjct: 1001 LTQRNQMPMNPILIVDLYDVWGIDFMRPFPMSFSNSYILVGVDYVSKWVEAIPCKHNDHR 1060 Query: 185 ETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQ 244 ++ ++F R+G+P + D G+ + + E L + G+ + PYHPQ Sbjct: 1061 VVLKFLKENIFSRFGVPKAIISDGGTHFCN-----KPFETLLAKYGVEHKVATPYHPQIF 1115 Query: 245 GKLERFHRSLKAEVLQGKWFADSG---ELQRAFDHWRTVYNLERPHEALDMAVPG-SRYQ 300 G++E +R +K +++ + +L + +RT Y L M+ S ++ Sbjct: 1116 GQVELANREIKNILMKVVNTSRRDWSVKLHDSLWAYRTAYKTI-----LGMSPYRLSLWK 1170 Query: 301 PSARQYSGNTTPPEYDEGVMVRKVDISGKLSV 332 + G ++ +V ++G + + Sbjct: 1171 SMLHIFPGKLKSRWIRLYIIH-QVHLNGVVEL 1201 >UniRef50_A4Z1R9 Putative transposase, probably encoded by an unidentified IS element protein n=4 Tax=Bradyrhizobium RepID=A4Z1R9_BRASO Length = 285 Score = 201 bits (512), Expect = 3e-50, Method: Composition-based stats. Identities = 62/288 (21%), Positives = 97/288 (33%), Gaps = 43/288 (14%) Query: 30 IRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMAHDRHE 89 ++ LC G+ AT Y+ L R S + ++ L+ +H Sbjct: 12 VQRLCALAGLPRATYYRHLNR-----------------RSRAEAECELRDQLQRICLKHP 54 Query: 90 RWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPG--IPATGRFE---------- 137 +G R++ L G + V LM + LL P T R Sbjct: 55 FYGYRRVTAALRRLGMAV-NAKKVLRLMRQDNLLAQRKTPFLKPPTERPTDVIVVPNLIR 113 Query: 138 ---HDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSV 194 AP+++W D + + +LD SR ++ A L Sbjct: 114 GLAPSAPDQIWVADIT-YVHLAKTFAYLAVILDGFSRKAVGWAFDNTLDASLAIAALDKA 172 Query: 195 FERYGLPD---RMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFH 251 + D G + A L I++ S P +P K E F Sbjct: 173 LKSRNPKPGSLIHHSDRGVQYASI-----AYRQRLADREIKISMSSPGNPFDNAKAESFM 227 Query: 252 RSLKAEVLQGKWFADSGELQRAFDHWR-TVYNLERPHEALDMAVPGSR 298 ++LKAE + GK FAD + +R + + +YN ER H AL P Sbjct: 228 KTLKAEEVNGKTFADVNDARRRINSFIAEIYNKERLHSALGYRSPLEF 275 >UniRef50_Q0RW72 Possible transposase n=13 Tax=Actinomycetales RepID=Q0RW72_RHOSR Length = 299 Score = 201 bits (512), Expect = 4e-50, Method: Composition-based stats. Identities = 60/307 (19%), Positives = 92/307 (29%), Gaps = 40/307 (13%) Query: 14 SLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRS 73 + R +F+ + ++ LCR G S + YK L Sbjct: 3 TRRWDFISDHRAE-FGVQRLCRALGTSRSAYYKHLITEPA-------------RIERQAE 48 Query: 74 SDDITALLRMAHDRHE-RWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPA 132 A +R H H +GA ++ L +G + V LM H ++ Sbjct: 49 EAATVAEIRAIHTEHRSAYGAPRVHAELRSRGRKI-NRKRVTRLMRIHHVVGRHLRRSKR 107 Query: 133 T---------------GRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLA 177 T F A + W D + P G T++D SR + + Sbjct: 108 TTIADKSAPSVPDLVMRDFTATAVDTKWCGDIT-YIPVGSSWLFLATVIDICSRRVVGWS 166 Query: 178 HCTDERRETVQQQLVSVFERYGL---PDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVG 234 R E V + G D GS + T + R GI+ Sbjct: 167 IADHMRTELVTDAIEMAVRTRGGDVGGVIFHSDRGSQY-----TAASFVDVCRRHGIQQS 221 Query: 235 HSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAV 294 R E F + LK E L G+ + + + W + +N R H AL Sbjct: 222 RGRVGSSYDNALAESFFQGLKREWLHGRSWTSKSQARLELFEWLSYFNRRRRHSALGYLT 281 Query: 295 PGSRYQP 301 P Q Sbjct: 282 PVEFEQR 288 >UniRef50_A4J392 Integrase, catalytic region n=22 Tax=Clostridia RepID=A4J392_DESRM Length = 459 Score = 201 bits (512), Expect = 4e-50, Method: Composition-based stats. Identities = 68/353 (19%), Positives = 123/353 (34%), Gaps = 28/353 (7%) Query: 39 ISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMAHDRHERWGARKIKR 98 SP T WL + + G L+ R + S +I +R + R + + Sbjct: 51 YSPKTLMCWLSDYRRGGLDSLKPGYRSDKGKSRKVSLEIADEIRKKRSQMPRITSALLYE 110 Query: 99 WLEDQGHTMP---AFSTVHNLMARHGLLP----GASPGIPATGRFEHDAPNRLWQMDFKG 151 L +P + +T + + + L +PG RF H N LWQ D Sbjct: 111 ELVKDKVILPEKLSRATFYRFLVANPELAAGKDPENPGEKELKRFSHQRINELWQTDIMF 170 Query: 152 HFPFGGGRC----HPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMTMD 207 G+ + + +DD SR + ++ L + G+P + D Sbjct: 171 GPYISIGKSKKQAYLIAFIDDASRLITHAQFFFFQNFVALRVALKEAVLKRGIPKMIYTD 230 Query: 208 NGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFA-- 265 NG + LG + H+ P+ P ++GK+ERF +++ L Sbjct: 231 NGKVYRSDQLNM-----LCAGLGCSLIHTEPFTPTSKGKIERFFHTVRQRFLSRLDPTKL 285 Query: 266 -DSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGNTTPPEYDEG---VMV 321 +L F W + H AL+M P + + P +E + Sbjct: 286 KSLDQLNLYFWQWLEEDYQCKTHSALNM-SPLDFFMAQVHNINFLANPQLLEEHFLLRVT 344 Query: 322 RKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE-----DGSYEVWWYSTKVG 369 RKV+ LSV+ + ++ R+ ++ + + ++ KVG Sbjct: 345 RKVNHDATLSVESILYETEQSLANSRLEVRYDPDWLANSNQPILLYRDGMKVG 397 >UniRef50_C1DTA7 Putative transposase n=2 Tax=Sulfurihydrogenibium azorense Az-Fu1 RepID=C1DTA7_SULAA Length = 327 Score = 201 bits (511), Expect = 4e-50, Method: Composition-based stats. Identities = 69/303 (22%), Positives = 123/303 (40%), Gaps = 24/303 (7%) Query: 10 RDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHS 69 R +++ QD NI CR FGIS T YKW +R+ ++G GL DRP+ P ++ Sbjct: 29 TKEAKKRLKWIQ-HYQDTKNISKTCRYFGISRTTFYKWFERYKKDGLEGLLDRPKTPKNT 87 Query: 70 PNRSSDDI-TALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASP 128 + + + ++ W KI +L+++ + + STV+ ++ GL+ Sbjct: 88 RKPTIRNQYREQIIKVRKQNPTWSKEKISAYLQEEKNIKVSPSTVYKVLKEEGLIERTKS 147 Query: 129 GIPATGR------------FEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCL 176 R + AP + Q+D K G + T +D +SRF Sbjct: 148 IKIQNKRKKSIKKKRTKRGLQAQAPGDVVQIDVKH-LNIAGATYYQFTAIDKYSRFCFA- 205 Query: 177 AHCTDERRETVQQQLVSVFERYGLP-DRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGH 235 + + ++ + + E + R+ DNGS + +L +G+ Sbjct: 206 RVYESKNSKKTKEFYIELNEYFEFEIKRVQTDNGSEFLG------EFNKYLTDIGVEHYF 259 Query: 236 SRPYHPQTQGKLERFHRSLKAEVLQGKWFA-DSGELQRAFDHWRTVYNLERPHEALDMAV 294 S P P+T G +ER R+++ E+ + E+ + + YN RPH +L Sbjct: 260 SYPRSPKTNGVVERLIRTIEEELWLIEGLDYTLEEMNKKLRKYVRKYNFIRPHHSLGYKR 319 Query: 295 PGS 297 P Sbjct: 320 PAD 322 >UniRef50_C4YYX7 Integrase catalytic region n=2 Tax=Rickettsia endosymbiont of Ixodes scapularis RepID=C4YYX7_9RICK Length = 280 Score = 200 bits (510), Expect = 5e-50, Method: Composition-based stats. Identities = 52/300 (17%), Positives = 101/300 (33%), Gaps = 37/300 (12%) Query: 11 DTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSP 70 +++R V +I C I+ ++ Y + +G Sbjct: 1 MDLAIRKNMVDKDC-SNLSIARQCTLLFINKSSYY-----YKPQGL-------------- 40 Query: 71 NRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAF--STVHNLMARHGLLPGASP 128 + +I ++ + +H +GAR++ L G T+ S + +MA + P + Sbjct: 41 TQKDLEIMQVIDEIYTQHPYFGARRMSEHLVPFGITIGREAVSRYYRIMAIEAIYPKMNL 100 Query: 129 GIPATGRFEHDAP---------NRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHC 179 + N++W D + G + + ++D SR+ + Sbjct: 101 SKRNQAHKIYPYLLKGVEIIKVNQVWSTDIT-YIRMAQGFVYLVAIIDRFSRYIVSWKVS 159 Query: 180 TDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPY 239 + L +YG P+ D GS + L++ I++ Sbjct: 160 ISLESDFCIDALEEAIIKYGQPEIFNTDQGSQFTS-----KNFTDKLIKREIKIIMDGKG 214 Query: 240 HPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRY 299 +ERF RSLK E + E + A + YN +R H++L+ P Y Sbjct: 215 RALDNVFIERFWRSLKQEKIYLMVLNTVKEAKNAITDYINFYNRKRMHQSLEYLTPEQVY 274 >UniRef50_A6VYF3 Integrase catalytic region n=14 Tax=Bacteria RepID=A6VYF3_MARMS Length = 290 Score = 200 bits (510), Expect = 5e-50, Method: Composition-based stats. Identities = 54/285 (18%), Positives = 95/285 (33%), Gaps = 38/285 (13%) Query: 27 GANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMAHD 86 +I+ C I+ +T Y + G + ++ L+ H Sbjct: 24 QLSIKRQCELLNIARSTAY-----YQPIGL--------------STEEIELRRLIDEIHL 64 Query: 87 RHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPAT------------G 134 ++ G+R+I+ L + H + + +M G+ Sbjct: 65 QYPYMGSRRIRTELAKKDHHV-NRKRIVRIMRDMGIGAIYPKPKTTVTNQAHKVYPYLLR 123 Query: 135 RFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSV 194 + PN+ W +D + P G + + ++D +SR L + L Sbjct: 124 DIKVTYPNQAWAIDIT-YIPMAKGFLYLVAIIDWYSRKVLSWRLSNTMDVSFCIEALEEA 182 Query: 195 FERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSL 254 + YG PD D GS + T L+ G+R+ +ER RSL Sbjct: 183 LKHYGPPDIFNSDQGSQFTS-----TEFTQKLLDHGVRISMDGKGRWVDNVFIERLWRSL 237 Query: 255 KAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRY 299 K E + K + E + H+ YN R H+ L+ P Y Sbjct: 238 KYEEVYLKAYTTPREAELEISHYMVFYNEARHHQGLNELTPDEVY 282 >UniRef50_A4SIH8 IS3-family transposase n=42 Tax=Proteobacteria RepID=A4SIH8_AERS4 Length = 387 Score = 200 bits (509), Expect = 8e-50, Method: Composition-based stats. Identities = 59/292 (20%), Positives = 99/292 (33%), Gaps = 39/292 (13%) Query: 24 SQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRM 83 ++ A I +C F + + Y +L R ++ R L R+ Sbjct: 107 LREQAPITLVCCAFDVPKSCFYDYLARKRTINRERMKQRS---------------ELRRL 151 Query: 84 AHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIP------------ 131 + + G+R + + + G+ + V NLM GL Sbjct: 152 FKESRDSAGSRALMSMMRELGYQIG-RFKVRNLMKEAGLASKQPGAHRYKVACSERPDIP 210 Query: 132 --ATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQ 189 F+ PN++W D + + +LD H+R + A E + Sbjct: 211 NLLAREFDVPQPNQVWCGDIT-YVWTSARWHYLAVVLDLHTRRVVGWAMSDKPDAELAIK 269 Query: 190 QLVSVFERYGLPDRM--TMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKL 247 L +++ G P + D GS +G A L R + SR + + Sbjct: 270 ALEMAYQQRGCPSGVLFHSDQGSQYGSR-----AFRQRLWRYRMTQSMSRRGNCWDNAPM 324 Query: 248 ERFHRSLKAEVLQGKWFADSGELQRAFDHW-RTVYNLERPHEALDMAVPGSR 298 ER RSLK+E L + E +R ++ YN RPH+ D P Sbjct: 325 ERLFRSLKSEWLPATGYVSLREAKRDISYYLMDYYNWRRPHQHNDGIPPAEA 376 >UniRef50_Q2AA50 Retrotransposon gag protein n=6 Tax=Asparagus officinalis RepID=Q2AA50_ASPOF Length = 1788 Score = 200 bits (508), Expect = 9e-50, Method: Composition-based stats. Identities = 53/265 (20%), Positives = 104/265 (39%), Gaps = 26/265 (9%) Query: 74 SDDITALLRMAHDRH--ERWGARKIKRWLEDQGHTMPAFST----VHNLMARHGLLPG-- 125 +++ ++L H++ +G RK + G P R LL Sbjct: 1348 TEETRSVLSFCHEQACGGHFGPRKTAEKVLQSGLYWPTLFKDSFEFCKTCNRCQLLGKVT 1407 Query: 126 ASPGIPATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRE 185 +P + LW +DF G FP G + L ++ S++ +A T++ + Sbjct: 1408 RRNMMPLQPILSVE-LFDLWGIDFMGPFPNSFGNVYILVAVEYMSKWVEAVACKTNDNKV 1466 Query: 186 TVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQG 245 V+ ++F R+G+P + DNG+ + + + E + + I S PYHPQT G Sbjct: 1467 VVKFLKENIFARFGVPRAIISDNGTHFCNR-----SFEALMRKYSITHKLSTPYHPQTSG 1521 Query: 246 KLERFHRSLKAEV---LQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPS 302 ++E +R +K + + S +L A +RT + L M+ Y+ Sbjct: 1522 QVEVTNRQIKQILEKTVNHNRKDWSVKLCDALWAYRTAFKAN-----LGMSP----YRLV 1572 Query: 303 ARQYSGNTTPPEYDEGVMVRKVDIS 327 + E+ +++++ Sbjct: 1573 FGKACHLPVELEHRAMWAIKQLNFD 1597 >UniRef50_C4UEN4 Transposase n=1 Tax=Yersinia aldovae ATCC 35236 RepID=C4UEN4_YERAL Length = 281 Score = 200 bits (508), Expect = 9e-50, Method: Composition-based stats. Identities = 56/296 (18%), Positives = 102/296 (34%), Gaps = 30/296 (10%) Query: 11 DTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSP 70 ++ + + G ++ + C+ +S A+ Y+ Sbjct: 1 MVITEKKSCAGLLTASGLSVITACKLTSLSRASFYR-------------------RGTDW 41 Query: 71 NRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGI 130 + ++ + G K L +G V+ + R GL Sbjct: 42 REKDKVVIDAIQAVLSESPQAGFWKCYYRLRFKGFIF-NHKRVYRVYCRLGLNLKRRIKK 100 Query: 131 PATGRFEHDA-----PNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRE 185 R P+ W +DF + G R L ++D+ +R L + T + Sbjct: 101 TLPKRENKPLSIVNLPDIQWALDFMHDALYCGKRFRTLNIIDEGTRECLAIEVDTSLPTD 160 Query: 186 TVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQG 245 V + L + + GLP ++ +DNG L + I + H +P PQ G Sbjct: 161 RVIRVLDRLKKERGLPQQLRVDNGPELISVN-----LLNYCEYNHITLCHIQPGKPQQNG 215 Query: 246 KLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQP 301 +ERF+ S + E L F +++ W+ YNL R HE+L P + + Sbjct: 216 FIERFNGSFRREFLNAYLFESLSQVREMAWFWQQDYNLNRTHESLGHLPPETYRKQ 271 >UniRef50_D1PJY7 ISMca2, transposase n=1 Tax=Subdoligranulum variabile DSM 15176 RepID=D1PJY7_9FIRM Length = 274 Score = 200 bits (508), Expect = 9e-50, Method: Composition-based stats. Identities = 54/296 (18%), Positives = 86/296 (29%), Gaps = 41/296 (13%) Query: 30 IRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMAHDRHE 89 + +CR +S + Y+WL R + ++ L H R+ Sbjct: 1 MELVCRLLEVSRSGYYEWLGRKPS---------------LRRQKDQELKRRLLSLHQRYP 45 Query: 90 RWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPAT---------------G 134 G + + Q + +H LM + AT Sbjct: 46 ALGLDSLYHLIRPQ--LSCSRKRIHRLMNEMNISSTRRRAYKATTNSRHAHPIAPNLLAR 103 Query: 135 RFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSV 194 RF D P+ W D + P G G + + D ++ + A L Sbjct: 104 RFSFDKPDTAWVGDIT-YIPTGEGWLYCAVVKDLCTKQIVGYAFSDRIDTNLTLAALGMA 162 Query: 195 FERYGL--PDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHR 252 R D G + A L LGIR SR P E F Sbjct: 163 VRRRKPLPGLIFHSDRGVQYA-----AYAYRQRLASLGIRQSMSRKGDPYDNAVAENFFS 217 Query: 253 SLKAEVLQGKWFADSGELQRAFDHWRT-VYNLERPHEALDMAVPGSRYQPSARQYS 307 LK E + + FA + + YN RPH ++ P + + + + Sbjct: 218 CLKCECVHLRHFASRAQAMADVFAYIETFYNPVRPHSSIGWRPPDAFARALSEHPA 273 >UniRef50_A6Q4E4 Transposase n=2 Tax=Nitratiruptor sp. SB155-2 RepID=A6Q4E4_NITSB Length = 271 Score = 200 bits (508), Expect = 9e-50, Method: Composition-based stats. Identities = 48/300 (16%), Positives = 88/300 (29%), Gaps = 42/300 (14%) Query: 17 TEFVLFASQDGANIR--SLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSS 74 + +G +I LC F + + Y P + Sbjct: 1 MQVQSEMRNEGYSISITKLCSLFDLPRRSFYY------------------KPIKRMQKLD 42 Query: 75 DDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPATG 134 + ++ ++ +G R++ L V ++ Sbjct: 43 EGRVKKVKEMIEKFPTYGYRRLALLLG------MNKKAVQRILQLKSWQVRKRSKGHRPR 96 Query: 135 RFEHD----APNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQ 190 PN+ W +D + G G ++D ++R + + T + Sbjct: 97 AKMMPSRSHYPNQRWAIDMTRVYSSGDGWSTLACVIDTYTREIVGWRLSKSGKATTAEAV 156 Query: 191 LVS-VFERYGL------PDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQT 243 L + R+G P + DNG + + PY P+ Sbjct: 157 LQEGLIYRFGKLKRLQEPIILRSDNGLVFSS-----KSFTKTAQDYNFTQEFITPYTPEQ 211 Query: 244 QGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSA 303 G +ERF R++K E + F E + W YN +R H AL P ++ A Sbjct: 212 NGMIERFFRTIKEECIWHYNFKSLKEANKIIGEWINFYNQKRKHSALQYKTPAEVFRLVA 271 >UniRef50_B3E8B6 Integrase catalytic region n=1 Tax=Geobacter lovleyi SZ RepID=B3E8B6_GEOLS Length = 269 Score = 199 bits (506), Expect = 1e-49, Method: Composition-based stats. Identities = 56/249 (22%), Positives = 91/249 (36%), Gaps = 12/249 (4%) Query: 59 LQDRPRIPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMA 118 L R + + + +R + R+G +I L +G + V+ + Sbjct: 20 LLCRGSYRYVHHGKDDSALRLRIRQIAETRIRYGYLRIHTLLRREGWHV-NHKRVYRIYC 78 Query: 119 RHGLLPGASPGIP------ATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRF 172 L R + N W MDF F G R LT++D+ SR Sbjct: 79 EECLNLRRKRPRRRVSAAHRANRPVASSLNDSWSMDFVADSLFNGRRFRALTVVDNWSRQ 138 Query: 173 SLCLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIR 232 L + + + V + + + P R+ +DNGS + +L+ W G+ Sbjct: 139 CLAIRVDQAMKGDDVVDAMSELTQIRNCPKRIFLDNGSEFIS-----KSLDRWAYENGVT 193 Query: 233 VGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDM 292 + SRP P +E F+ S + E L WF + ++ + WR YN RPH AL Sbjct: 194 LDFSRPGKPTDNALIESFNGSFRDECLSVNWFLSMDDARQKIEDWRQEYNDFRPHTALKN 253 Query: 293 AVPGSRYQP 301 P Sbjct: 254 LTPNEYANQ 262 >UniRef50_C1F5F4 IS3 family transposase orfB n=1 Tax=Acidobacterium capsulatum ATCC 51196 RepID=C1F5F4_ACIC5 Length = 296 Score = 199 bits (506), Expect = 1e-49, Method: Composition-based stats. Identities = 75/299 (25%), Positives = 116/299 (38%), Gaps = 35/299 (11%) Query: 13 MSLRTEFVLFASQ-DGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPN 71 S R V + G R CR + AT R+ L R Sbjct: 3 PSGRGPMVEHLERMHGVAERRACRVLCVPRATY-----RYRSC----LDPR--------- 44 Query: 72 RSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPG-- 129 ++ +R R+G RKI+ L +G + + V+ L GL Sbjct: 45 ---TELRMRIREIAQSRVRYGYRKIRVLLNREGWNVGRYL-VYPLYCEEGLCLQRMRPAG 100 Query: 130 -----IPATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERR 184 +F+ AP++ W MDF GG R LT++D ++R ++ + + Sbjct: 101 KHKASRSRAEKFKATAPDQAWSMDFVSDQLQGGTRFRSLTIVDVYTREAVVIEAGQSLKG 160 Query: 185 ETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQ 244 E V + L V + G+P + DNGS + A++LW R ++ SRP P Sbjct: 161 EDVVRTLNRVKQERGVPKILFCDNGSEFTSQ-----AMDLWAYRNNTKIDFSRPGKPTDN 215 Query: 245 GKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSA 303 +E F+ + ++E L WFAD E + + WR YN RPH +L P A Sbjct: 216 AFVEGFNGTFRSECLNTHWFADLREAKVLIEAWRKEYNESRPHASLADRTPSEFASQYA 274 >UniRef50_Q1GCB4 Integrase catalytic region n=29 Tax=Alphaproteobacteria RepID=Q1GCB4_SILST Length = 265 Score = 199 bits (506), Expect = 2e-49, Method: Composition-based stats. Identities = 66/277 (23%), Positives = 99/277 (35%), Gaps = 33/277 (11%) Query: 27 GANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMAHD 86 +IR C P T + R P + +R Sbjct: 6 QVSIRRACEVIRFDPRTYRY---------------KSRRPGQ------AALEQRIREICQ 44 Query: 87 RHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPATG------RFEHDA 140 R+G R++ L +G + A T + + G+ + R E + Sbjct: 45 TRVRFGYRRVHVLLRREGWEINAKKT-YRIYKELGMQLRSKTPKRRVKAKLRDDRKEAVS 103 Query: 141 PNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGL 200 PN +W MDF G + LT++D SR+ L R E V L V +R G Sbjct: 104 PNDVWAMDFVHDQLATGWKLRVLTVVDTFSRYVPVLDARFTYRGEDVVATLEQVCKRTGY 163 Query: 201 PDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQ 260 P + +D GS + ++LW + + SRP P +E F+ +AE L Sbjct: 164 PATIRVDQGSEFIS-----KDMDLWAYANDVTLDFSRPGKPTDNAFIEAFNGRFRAECLN 218 Query: 261 GKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGS 297 WF + + WR YN ERPH A+ VP Sbjct: 219 AHWFMSLEDAAEKLEAWRRDYNEERPHGAIGNKVPAD 255 >UniRef50_A5C747 Putative uncharacterized protein n=1 Tax=Vitis vinifera RepID=A5C747_VITVI Length = 615 Score = 199 bits (506), Expect = 2e-49, Method: Composition-based stats. Identities = 43/217 (19%), Positives = 86/217 (39%), Gaps = 28/217 (12%) Query: 142 NRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLP 201 +W +DF FP G + + +D S++ + ++ R ++ ++F R+G P Sbjct: 24 FDVWGIDFMRPFPISFGYSYIIVGVDYVSKWVEAILCRYNDHRIVLKFLKENIFLRFGAP 83 Query: 202 DRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQG 261 + D G+ + E L + G++ + PYHPQT G++E ++ +K +++ Sbjct: 84 KVIISDEGAHCCN-----KPFETLLAKYGVKHKVATPYHPQTSGQVELVNKEIKNILMKV 138 Query: 262 KWF---ADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGNTTPPEYDEG 318 S ++ + +RT Y L M Y+ + EY Sbjct: 139 VNASRKDWSVKIHDSLWAYRTTYKTI-----LGMLP----YRLVYGKGCHLPMEVEYKAW 189 Query: 319 VMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 ++K++ + KA + L EM+E Sbjct: 190 WAIKKLN-----------MDLNKASMKRFLDLNEMEE 215 >UniRef50_D0BAE7 IS3 family transposase (Fragment) n=153 Tax=Bacteria RepID=D0BAE7_BRUME Length = 301 Score = 199 bits (506), Expect = 2e-49, Method: Composition-based stats. Identities = 49/306 (16%), Positives = 92/306 (30%), Gaps = 47/306 (15%) Query: 15 LRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSS 74 ++ F+ + ++ LC G+S + Y W R L D Sbjct: 1 MKFAFI-DTEKAHMSLSRLCAFAGVSISGYYAWKHRLPSRRQ--LDDMSI---------- 47 Query: 75 DDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPAT- 133 + + E +G+ ++ L ++G T LM +GL T Sbjct: 48 --LAHIRNQFALSRETYGSPRMHVELNEEGIRAGRHRT-ARLMRENGLKARQKTRFKRTT 104 Query: 134 --------------GRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHC 179 F D P++ W +D + G + ++D +SR + Sbjct: 105 DSNHGEPVAPNLLDQDFTCDRPDQKWGVDI-SYIWTAEGWLYLAIVVDLYSRRIIGWEAR 163 Query: 180 TDERRETVQQQLVSVFE-RYGLPDRM-TMDNGSPWGDTTGTWTALELWLMRLGIRVGHSR 237 +++ L RY P + D GS + L + S Sbjct: 164 DRMKKDLAICALKKAIAIRYPKPGLIQHSDRGSQYASY-----EYRKILKSHSMLPSMSG 218 Query: 238 PYHPQTQGKLERFHR-------SLKAEVLQGKWFADSGELQRAFDHWRT-VYNLERPHEA 289 + +E + ++K+E++ F + +A + YN R H Sbjct: 219 KGNCYDNAMVETVFKTIKSELITIKSELIWRAAFQTRNDAIKAIGKYIDGFYNPVRRHST 278 Query: 290 LDMAVP 295 L P Sbjct: 279 LGYKSP 284 >UniRef50_C2GDW5 IS3514a transposase n=1 Tax=Corynebacterium glucuronolyticum ATCC 51866 RepID=C2GDW5_9CORY Length = 322 Score = 199 bits (506), Expect = 2e-49, Method: Composition-based stats. Identities = 67/257 (26%), Positives = 99/257 (38%), Gaps = 10/257 (3%) Query: 17 TEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDD 76 V ++ + RF S Y L+R+ + G ++ R P P + S++ Sbjct: 8 LAIVKAITEQHLTVSEASVRFKRSRQWIYTLLRRYEEGGPEAVKPRSTAPKTHPTKVSEE 67 Query: 77 ITALLRMAHDR----HERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPA 132 + + G I LE +G PA ST+ ++ ++G++ P Sbjct: 68 VIKQIIKIRRELASKGADNGPETIAWVLEQRGFHAPAESTIRRILTKNGMVTPQPKKRPK 127 Query: 133 T--GRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQ 190 RFE PN WQ D G L LDDHSRF L L TV Sbjct: 128 AYLRRFEATLPNECWQADVTSTRLLNGQVVEILDFLDDHSRFLLYLGAYKRVAGPTVVTA 187 Query: 191 LVSVFERYGLPDRMTMDNGSPWGDTTGTWT----ALELWLMRLGIRVGHSRPYHPQTQGK 246 ++ ++YG P DNG + E +L + I + R HPQTQGK Sbjct: 188 AETITKKYGFPQSTLTDNGLVFTARLAGAKGGKNGFEKFLEKHSILQKNGRAGHPQTQGK 247 Query: 247 LERFHRSLKAEVLQGKW 263 +ERFH++LK Sbjct: 248 IERFHQTLKNGSAHDHP 264 >UniRef50_A8LD08 Integrase catalytic region n=4 Tax=Frankia RepID=A8LD08_FRASN Length = 296 Score = 198 bits (504), Expect = 2e-49, Method: Composition-based stats. Identities = 55/301 (18%), Positives = 91/301 (30%), Gaps = 39/301 (12%) Query: 13 MSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNR 72 M+ + E + A +D + +C+ +S + Y W +R A Sbjct: 1 MTDKYELI-AAEKDNYPVTKMCQWLVVSTSGFYDWHRRPASTRT-----------RRHTT 48 Query: 73 SSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPA 132 + A+ + +GAR+I L G + + +M R GL Sbjct: 49 LDTHVRAV---FAASRQTYGARRIAAALTASGLRV-SVRLARRIMRRAGLEACQPRAYRR 104 Query: 133 T--------------GRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAH 178 T F P D + G G + T++D +SR + Sbjct: 105 TTIPGQAPAPADHVRRDFTATRPGEKLIGDIT-YIRTGEGWLYLATVIDCYSRKVAGWSM 163 Query: 179 CTDERRETVQQQLVSVFERYGLP--DRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHS 236 T R E + R L D G+ + A L R G+R Sbjct: 164 ATHLRTELIIDAFTMAASRTTLAPNALFHSDRGAQYTSD-----AYHRVLKRHGVRPSVG 218 Query: 237 RPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRT-VYNLERPHEALDMAVP 295 E F +LK E++ + + + A + YN +R H L P Sbjct: 219 ATGVCWDNSAAESFFGTLKNELVYRENYPTRRAARTAIAEYIEVFYNRQRLHSTLGYRTP 278 Query: 296 G 296 Sbjct: 279 E 279 >UniRef50_C1A8I3 Putative transposase orfB for insertion sequence element n=1 Tax=Gemmatimonas aurantiaca T-27 RepID=C1A8I3_GEMAT Length = 295 Score = 198 bits (504), Expect = 3e-49, Method: Composition-based stats. Identities = 68/292 (23%), Positives = 104/292 (35%), Gaps = 30/292 (10%) Query: 9 ARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHH 68 AR ++R +Q G ++ CR G++ A Y P Sbjct: 10 ARARPAMRQVATTLVTQHGLSVVRACRIAGLARAAYY-------------------TPLS 50 Query: 69 SPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASP 128 ++ A L RWG K L GH + VH + L Sbjct: 51 DRVERDAEVIAALTTLAAARPRWGFWKCFDRLRLDGHGW-NWKRVHRVYCALRLNLPRRT 109 Query: 129 GIPATGRFE-----HDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDER 183 R + N +DF + G R L +LD+ +R +L + T Sbjct: 110 KRRLPQRVQQPLDAPPQLNHTRALDFMHDMLYDGRRFRTLNVLDEGNREALAIEVSTSLP 169 Query: 184 RETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQT 243 V L + +G P + DNG AL W + G+R+ H +P P Sbjct: 170 GTRVVSVLEQLLAIHGAPCTIRCDNGPELIS-----HALTTWCEQHGVRLQHIQPGKPNQ 224 Query: 244 QGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVP 295 +ERF+R+ + EVL FA +++ + W YN ERPH++L P Sbjct: 225 NAYIERFNRTYRREVLDAYIFASLAQVRAETETWLMTYNTERPHDSLGGVPP 276 >UniRef50_B4X146 Integrase core domain protein n=3 Tax=Alcanivorax sp. DG881 RepID=B4X146_9GAMM Length = 270 Score = 198 bits (504), Expect = 3e-49, Method: Composition-based stats. Identities = 57/290 (19%), Positives = 94/290 (32%), Gaps = 40/290 (13%) Query: 30 IRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMAHDRHE 89 +R LC+ G+S + Y W R E ++ + + H Sbjct: 1 MRRLCQLLGVSRSGYYAWRSRSESE---------------RSQYDGRLKEAMVELHQGFR 45 Query: 90 R-WGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGI---PATGRF--------- 136 R +GAR++ + L G T + V LM G+ + P F Sbjct: 46 RAYGARRLHQQLRKNGFT-CSVRRVSRLMKEAGIHASSKGLYVWNPGRHEFYSSAGNVLG 104 Query: 137 ---EHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVS 193 D+ + W DF + G G + ++D +SR + + E + L Sbjct: 105 AEESADSEGKHWAGDFT-YIRTGSGWLYHAVVVDLYSRRVVGWSFSRKRNSELTKSALRM 163 Query: 194 VFERY--GLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFH 251 R L D G + + G+R SR P +E F Sbjct: 164 ALSREQPRLGCVFHSDQGIEYA-----AHEYRDLVAAAGLRRSMSRKATPLDNAMVESFF 218 Query: 252 RSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQP 301 +LK E++ + F + E + YN ER H + P S + Sbjct: 219 HTLKTELVHLRKFENDIEAVAGIVEFIEFYNRERLHSGIGYQSPASYVRA 268 >UniRef50_C9XPC5 Transposase n=10 Tax=Bacteria RepID=C9XPC5_CLODC Length = 275 Score = 198 bits (504), Expect = 3e-49, Method: Composition-based stats. Identities = 51/293 (17%), Positives = 91/293 (31%), Gaps = 33/293 (11%) Query: 25 QDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMA 84 +I S+C+ G+S + YKWL + G++DR + ++ + Sbjct: 2 SRKYSISSMCKLIGVSRSGYYKWLSYSKKSSDRGIKDRIIKDY------------IIEIH 49 Query: 85 HDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPAT----------- 133 +G ++I +L + V+ LM G+ + Sbjct: 50 KKYRGTYGRKRICTYLNKILDSPINHKKVYRLMKELGIKSIIRKKVYRRKFKSYEVYDNI 109 Query: 134 --GRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQL 191 F + P MD + P G + D + + T + V + + Sbjct: 110 LNREFRANQPLEKICMDIT-YIPIGKKFLYMNVAKDLFNGEIVAYEISTKMDTKLVNKTV 168 Query: 192 VSVFERYGLPDRM-TMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERF 250 + D + D GS + + L GI SR + +E F Sbjct: 169 NQLINMNLAKDCILHTDQGSQYTSR-----SYSKRLKDNGIIQSMSRRGNCWDNAPIESF 223 Query: 251 HRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSA 303 K+E++ D E+ + + YN ER M P SA Sbjct: 224 FSHFKSELIYLIDTTDPKEMISLINDYIYFYNNERIQLKNGM-SPIEYRTHSA 275 >UniRef50_A5B4Q4 Putative uncharacterized protein n=13 Tax=Vitis vinifera RepID=A5B4Q4_VITVI Length = 1595 Score = 198 bits (504), Expect = 3e-49, Method: Composition-based stats. Identities = 40/267 (14%), Positives = 90/267 (33%), Gaps = 50/267 (18%) Query: 97 KRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPATGRFEHDAP-----NRLWQMDFKG 151 + T P+ + M R + + + +W +DF G Sbjct: 782 TMKVLQSRFTWPSLFKDAHTMCRSCDRGQRLEKLTKRNQVSMNPIPIVDLFDVWGIDFMG 841 Query: 152 HFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSP 211 FP G + L +D S++ + ++ + ++ ++F R+G+P + D G+ Sbjct: 842 PFPMSFGNSYILVGVDYVSKWVEVIPCKHNDHKVVLKFLKENIFSRFGVPKAIISDGGTH 901 Query: 212 WGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQG---KWFADSG 268 + + E L + ++E ++ +K +++ + S Sbjct: 902 FCNR-----PFETLLAK-----------------QVELENKEIKNILMKVVITRRKDWSI 939 Query: 269 ELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGNTTPPEYDEGVMVRKVDISG 328 +L + +RT Y L M+ Y+ + EY ++K++ Sbjct: 940 KLHDSLWAYRTAYKTI-----LGMSP----YRLVYGKACHLPVEVEYKAWWAIKKLN--- 987 Query: 329 KLSVKGVSLSAGKAFRGERVGLKEMQE 355 + +A + L EM+E Sbjct: 988 --------MDLIRAGAKRCLDLNEMEE 1006 >UniRef50_B3EBT2 Integrase catalytic region n=87 Tax=Bacteria RepID=B3EBT2_GEOLS Length = 305 Score = 198 bits (503), Expect = 4e-49, Method: Composition-based stats. Identities = 62/303 (20%), Positives = 96/303 (31%), Gaps = 40/303 (13%) Query: 9 ARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHH 68 A T R +I C G+S + Y +P Sbjct: 9 ACSTARKRELI--DWRHPTISIARQCELLGVSRSCLYYH----------------PVPAS 50 Query: 69 SPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASP 128 + NR + LL + RH G K+ WL QG+ V L+ GL+ Sbjct: 51 NENRL---LMRLLDEEYTRHPFLGVIKLTNWLRSQGYWHIGTRRVRRLLRLMGLMAIYPG 107 Query: 129 G---IPAT---------GRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCL 176 PA E + N++W D + G + + ++D SR+ L Sbjct: 108 PNLSKPAPGHKIYPYLLRNVEVERVNQVWSADIT-YIRLKTGFVYLVAVVDWCSRYILAF 166 Query: 177 AHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHS 236 + + L G P+ D GS + T L G+R+ Sbjct: 167 EISITLEADFCIEALQQALT-RGTPEIFNSDQGSQFTSPRHT-----EILHLAGVRISMD 220 Query: 237 RPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPG 296 +ERF RS+K E + + E + + YN ER H++L P Sbjct: 221 GKGRALDNIFVERFWRSVKYEEVYLHDYESVQEARIGLKRYIEYYNNERQHQSLGYQTPA 280 Query: 297 SRY 299 Y Sbjct: 281 EVY 283 >UniRef50_C3KV53 Transposase n=9 Tax=Clostridium botulinum RepID=C3KV53_CLOB6 Length = 291 Score = 198 bits (503), Expect = 4e-49, Method: Composition-based stats. Identities = 49/301 (16%), Positives = 88/301 (29%), Gaps = 43/301 (14%) Query: 19 FVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDIT 78 ++ + +I+ LC IS + YKW Q + D+ Sbjct: 9 LIIKSLSSKFSIKLLCEISNISRSAYYKWTNTSKQCKDQEIMDK---------------- 52 Query: 79 ALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGI-------- 130 +L + +D + +G R+IK L+ V LM + + Sbjct: 53 -ILVIYNDNRKVYGYRRIKISLKRTFGINVNHKKVLRLMQKLKIQSIIKMKKFKYKNPIA 111 Query: 131 ---------PATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTD 181 FE N+ W D G R + ++D ++ + T Sbjct: 112 NEYAKIENNVLNRNFEAKTLNQKWVTDITYLRYGNGCRAYLSAMMDLNNNEIIGYKLSTS 171 Query: 182 ERRETVQQQLVSVFERYGLPD----RMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSR 237 E V + + + + D G + + + L R I SR Sbjct: 172 LEIEFVIDTVKTAISKCSREKLKDLIIHSDQGCHYMSR-----SYKNLLKRYKITQSMSR 226 Query: 238 PYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGS 297 + +E F LK E++ + +L + + YN ER P Sbjct: 227 KGNCYDNACIESFFSKLKTELIYQNKYYSKKDLFESIHKYIYWYNNERFQSKFKNHTPVE 286 Query: 298 R 298 Sbjct: 287 Y 287 >UniRef50_B5CNC7 Putative uncharacterized protein n=5 Tax=Clostridiales RepID=B5CNC7_9FIRM Length = 384 Score = 197 bits (502), Expect = 4e-49, Method: Composition-based stats. Identities = 53/306 (17%), Positives = 95/306 (31%), Gaps = 44/306 (14%) Query: 12 TMSLRTEFVL-FASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSP 70 ++ R V + + I+ + Y + Sbjct: 101 DLTTRVSLVQNLLTTKEIPASVGAKLLDINRTSIYY---------------------KTS 139 Query: 71 NRSSDDI--TALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASP 128 S +++ ++ H + WGAR++ L+++G+ + M + P Sbjct: 140 PVSDEELACKEIIDHLHTDNPTWGARQMSAQLKNRGYHVGRRKA-RRYMNEMDIYPIYPK 198 Query: 129 GIPATGRFEH------------DAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCL 176 + + DAPN+ W +D + P G + ++D +SR + Sbjct: 199 MNLSKRMQQAKVCPYLLRNVVIDAPNQAWSIDIT-YIPIRHGFLYLTAVIDWYSRCIVGW 257 Query: 177 AHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHS 236 V L F P + D G + ++ GIR Sbjct: 258 EVDDTLDTRMVINVLKKAFAV-SKPQILNSDQGCQFTSQK-----YIEFVKENGIRQSMD 311 Query: 237 RPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPG 296 +ER+ RS K E + + E + A + YN ER H ALD P Sbjct: 312 GKSRWADNIMIERWFRSFKYEEAYLTLYNNIKEARVAIGRYVYTYNFERCHSALDYKTPA 371 Query: 297 SRYQPS 302 Y P+ Sbjct: 372 ECYYPA 377 >UniRef50_Q9JMT3 Transposase insF for insertion sequence IS3fB n=147 Tax=Proteobacteria RepID=INF7_ECOLI Length = 288 Score = 197 bits (502), Expect = 5e-49, Method: Composition-based stats. Identities = 57/305 (18%), Positives = 93/305 (30%), Gaps = 38/305 (12%) Query: 15 LRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSS 74 ++ F+ Q +I+++CR ++ + Y W QR R RI R Sbjct: 1 MKYVFIEKH-QAEFSIKAMCRVLRVARSGWYTWCQR-----------RTRISTRQQFRQH 48 Query: 75 DDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPAT- 133 D +L +R+ A ++ L QG+ TV + GL AS Sbjct: 49 CDSV-VLAAFTRSKQRYCAPRLTDELRAQGYPF-NVKTVAASLRCQGLRAKASRKFSPVS 106 Query: 134 --------------GRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHC 179 F PN+ W D + G + ++D SR + + Sbjct: 107 YRAHGLPVSENLLEQDFYASGPNQKWAGDIT-YLRTDEGWLYLAVVIDLWSRAVIGWSMS 165 Query: 180 TDERRETVQQQLVSVFERYGLP--DRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSR 237 + L R P + D G + + L R +R S Sbjct: 166 PRMTAQLACDALQMALWRRKRPWNVIVHTDRGGQYCSAD-----YQAQLKRHNLRGSMSA 220 Query: 238 PYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRT-VYNLERPHEALDMAVPG 296 +E F SLK E + G+ F ++ ++ YN R H P Sbjct: 221 KGCCYDNACVESFFHSLKVECIHGEHFISREIMRATVFNYIECDYNRWRRHSWCGGLSPE 280 Query: 297 SRYQP 301 Sbjct: 281 QFENQ 285 >UniRef50_A5GAT8 Integrase, catalytic region n=1 Tax=Geobacter uraniireducens Rf4 RepID=A5GAT8_GEOUR Length = 426 Score = 197 bits (502), Expect = 5e-49, Method: Composition-based stats. Identities = 76/358 (21%), Positives = 126/358 (35%), Gaps = 19/358 (5%) Query: 9 ARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEG--AAGLQDRPRIP 66 AR R + I + +S +T +W+ + +E L + R Sbjct: 24 ARLDHGERERLLREKCARKWEIP-CSNQTRLSRSTILRWIGLYLKERGKLEALYPQGRND 82 Query: 67 HHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTM----PAFSTVHNLMARHGL 122 + L + ++ + + ++ + ST + + RH L Sbjct: 83 RGMSRVLDQETGLALAQLRRQQPELTVPELVKQMHERRLVTDGVGLSLSTAYRFLHRHDL 142 Query: 123 LPGASPGIPATGRFEHDAPNRLWQMDFKG-HFPFGG---GRCHPLTLLDDHSRFSLCLAH 178 + P +FE + PN LWQ D G + + + +DDHSR Sbjct: 143 MGK-QPAPVDRRKFEAELPNDLWQSDVMHGPMLLSGDKRRKTYLIAFIDDHSRLIPHGRF 201 Query: 179 CTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRP 238 E R GLP ++ +DNGS + LE LGI + H+RP Sbjct: 202 YLSEGVACFMSAFSDAVLRRGLPRKLYVDNGSAFRSR-----QLEYTAAALGIALVHARP 256 Query: 239 YHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSR 298 Y PQ +GK+ERF ++++ L E+ AF+ W Y +R H + P R Sbjct: 257 YQPQGKGKIERFFKNVRTSFLPSFKGETLEEINEAFELWLNDYYHQRSHGSTG-ETPFKR 315 Query: 299 YQPSARQYSGNTTPP-EYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 + +Y + R V+ + V A G+RV L +E Sbjct: 316 FTSRMECLRPAPDNLKDYFRKTVRRLVNKDRSVVVDRRLFEAPVELIGKRVELLYFEE 373 >UniRef50_C9XK98 Integrase, catalytic region n=10 Tax=Clostridium difficile RepID=C9XK98_CLODC Length = 299 Score = 197 bits (502), Expect = 5e-49, Method: Composition-based stats. Identities = 49/316 (15%), Positives = 89/316 (28%), Gaps = 40/316 (12%) Query: 1 MESLMPWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQ 60 M + + ++ A++ LC ++ + YKWL R Sbjct: 1 MVITLSKNE----DKYKAIQSLFNEKKASLIQLCEIAKVNRSGYYKWLNR---------- 46 Query: 61 DRPRIPHHSPNRSSDDITALLRMAHDRH-ERWGARKIKRWLEDQGHTMPAFSTVHNLMAR 119 + + + L+ ++ +G+ ++ L + V+ LM Sbjct: 47 -----DKSNLELENIKLATLITKIYEEKKGVFGSLRMTLQLNREYKLNVNHKRVYRLMRA 101 Query: 120 HGLLPGASPGI--------------PATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTL 165 GL +F D ++ W D G + + + Sbjct: 102 VGLRSICRKKKFSYVKCTQEVIAENVLNRKFSADKTSQKWLTDVTEFKLTDGTKAYLSAI 161 Query: 166 LDDHSRFSLCLAHCTDERRETVQQQLVSVFERYG-LPDRMTMDNGSPWGDTTGTWTALEL 224 LD R + V + E Y D G + L Sbjct: 162 LDLGDRSIVSYVIGKSNNNNLVFETFNKAIEVYPNAKPIFHSDRGYQYTSRVFKSKLLTQ 221 Query: 225 WLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLE 284 G+ SR H +E F LK E+ + F +L +A D + YN Sbjct: 222 -----GMIQSMSRVGHCIDNAPMEGFWGILKVEMYYLRKFDTYEQLVKAIDDYIYYYNNF 276 Query: 285 RPHEALDMAVPGSRYQ 300 R + L+ P + Sbjct: 277 RYQKRLNSMSPLEFRK 292 >UniRef50_A5B8V2 Putative uncharacterized protein n=1 Tax=Vitis vinifera RepID=A5B8V2_VITVI Length = 1257 Score = 197 bits (502), Expect = 5e-49, Method: Composition-based stats. Identities = 43/251 (17%), Positives = 94/251 (37%), Gaps = 26/251 (10%) Query: 142 NRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLP 201 +W +DF G FP G + L +D S++ +++ ++ ++F R+ +P Sbjct: 809 FYVWXIDFMGPFPMSFGYSYILVGVDYVSKWVEXXPCKHNDQXVILKFLKKNIFSRFVVP 868 Query: 202 DRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQG 261 + D G+ + + E L + G++ PYHPQT G++E +R +K +++ Sbjct: 869 KAIISDGGTHFCN-----KPFETLLAKYGVKHKVXTPYHPQTSGQVELANREIKNILMKV 923 Query: 262 ---KWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGNTTPPEYDEG 318 S +L + +R Y L M+ Y+ + EY Sbjct: 924 VNTNRKDXSVKLLDSLWAYRIXYKTI-----LGMSP----YRLVYGKACHLPMKSEYKAW 974 Query: 319 VMVRKVDISGK-LSVKGVSLSAG--------KAFRGERVGLKEMQEDGSYEVWWYSTKVG 369 ++K+++ +K + + + + K+ + + + Sbjct: 975 WAIKKLNVDLSRAGLKRNDAYINSKIXTKKLRRWHDQLIAHKDFHKGQRVLLHDSKLHIF 1034 Query: 370 VIDLKKKSITM 380 + LK + I Sbjct: 1035 LGKLKSRWIXP 1045 >UniRef50_Q3A4V8 Transposase and inactivated derivatives n=9 Tax=Bacteria RepID=Q3A4V8_PELCD Length = 336 Score = 197 bits (502), Expect = 5e-49, Method: Composition-based stats. Identities = 68/332 (20%), Positives = 123/332 (37%), Gaps = 21/332 (6%) Query: 5 MPWDARDTMSLRTEFVLFASQD-GANIRSLCRRFGISPATGYKWLQRWAQEGA-AGLQDR 62 MP D RD + +FV ++ G + + G++ + Y W QR+ + L R Sbjct: 1 MPHDIRDAV---IDFVKHWAKRTGIAVTHIIDWLGLAVSKFYNWQQRYGKANEHNALIPR 57 Query: 63 PRIPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGL 122 + + + + + G R++ + DQ + S+V+ ++ GL Sbjct: 58 DFW-------LEEKEKQAIIKFYQQKPQEGYRRLTFMMLDQDVVAVSPSSVYRVLNAAGL 110 Query: 123 LPGASPG--IPATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCT 180 L + TG + P+ W +D + G + LLD SR+ + Sbjct: 111 LRRWNGKQSKKGTGFVQPLKPHEHWHID-VSYINICGTFYYLCCLLDGCSRYIVHWELRE 169 Query: 181 DERRETVQQQLVSVFERYGL-PDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPY 239 V+ L E++ R+ DNG + + ++ G+ + PY Sbjct: 170 AMTEANVEIILQRAREKHPAATPRIISDNGPQFI-----TKDFKEFIRVAGMTHVRTSPY 224 Query: 240 HPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRY 299 +PQ+ GKLERFH ++K E ++ K E + + YN ER H AL P + Sbjct: 225 YPQSNGKLERFHGTIKQECIRPKVPLSLEEARAQVADYIRYYNDERLHSALGYVAPKVKL 284 Query: 300 QPSARQYSGNTTPPEYDEGVMVRKVDISGKLS 331 + Q ++ K+ Sbjct: 285 EGREEQIFKERDSKLEAAREARKQKRRQEKIR 316 >UniRef50_A1UQ90 Integrase, catalytic region n=31 Tax=Actinomycetales RepID=A1UQ90_MYCSK Length = 299 Score = 197 bits (500), Expect = 8e-49, Method: Composition-based stats. Identities = 59/305 (19%), Positives = 93/305 (30%), Gaps = 39/305 (12%) Query: 16 RTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSD 75 R +FV +D ++ LC ++ ++ Y WL A + + Sbjct: 3 RFQFVADH-RDTFEVKWLCAVVEVARSSFYAWLAAADGRAAR-------------RAADE 48 Query: 76 DITALLRMAHDRHERWGARKIKRWLEDQ--GHTMPAFSTVHNLMARHGLLPGASPGIPAT 133 + A +R HD +GA +I L D V +M G+ T Sbjct: 49 VLEARIRTVHDTDNTYGAPRITAELNDGAPPDERVNHKRVARVMRTAGIAGYRRRRRVKT 108 Query: 134 ---------------GRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAH 178 F N + D GG + T++D SR A Sbjct: 109 TQSDPANQKVPDLLKRDFTAAQVNTRYVGDITYLPLATGGNLYLATVIDCCSRRVTGWAI 168 Query: 179 CTDERRETVQQQLVSVFERYG--LPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHS 236 R E V L + G D+GS + +G+ Sbjct: 169 ADHMRTELVADALSAATALRGSMAGAVFHTDHGSQYTSRD-----FATLCGEMGVIQSMG 223 Query: 237 RPYHPQTQGKLERFHRSLKAEVLQGK-WFADSGELQRAFDHWRTVYNLERPHEALDMAVP 295 E F+ +LK EVLQ + + D+ +R W T YN +R H + P Sbjct: 224 GVGSSADNALAESFNGTLKREVLQDRACWPDAATCRREVFRWLTRYNTQRRHSHCRYSSP 283 Query: 296 GSRYQ 300 + Sbjct: 284 AIYER 288 >UniRef50_Q7TT98 InsB n=4 Tax=Bacteria RepID=Q7TT98_RHOBA Length = 292 Score = 197 bits (500), Expect = 8e-49, Method: Composition-based stats. Identities = 54/304 (17%), Positives = 92/304 (30%), Gaps = 41/304 (13%) Query: 17 TEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDD 76 +F+ +D I LCR ++ A Y++ R P + Sbjct: 1 MKFIGEC-RDRWPIAVLCRTLEVTRAAYYRFAGR-----------GPTATEIKQTQIIQA 48 Query: 77 ITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIP----- 131 + + H H+ +G+ +++R + +G + +TV M G+ Sbjct: 49 VKEIRLEKH--HDAYGSPRMQRAIVKRG-VVCCRNTVAKCMRHAGIQANRRTKFRISTTD 105 Query: 132 -----------ATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCT 180 F +A NR+W D + P G + +D HSR + Sbjct: 106 SNHDQPIASNLLGQNFTTEAINRVWLTDIT-YIPTQEGSTYLCAFVDLHSRKIVSWKTSR 164 Query: 181 DERRETVQQQLVSVFERYGL--PDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRP 238 + E V + D GS + L G+ SR Sbjct: 165 NMDSELVVGAFDQALTFRKPNAGLIVHSDRGSQFASDH-----FRRRLAASGLVQSMSRR 219 Query: 239 YHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRT-VYNLERPHEALDMAVPGS 297 + +E F +S K E Q + + R + YN R H +L P Sbjct: 220 GNCYDNAPMESFFKSYKTEEAQ-QIYDTHEHATRGVSDYIERFYNPHRLHSSLGYLSPID 278 Query: 298 RYQP 301 Q Sbjct: 279 FEQA 282 >UniRef50_O28862 ISA0963-5, putative transposase n=5 Tax=Archaeoglobus fulgidus RepID=O28862_ARCFU Length = 357 Score = 197 bits (500), Expect = 8e-49, Method: Composition-based stats. Identities = 64/320 (20%), Positives = 126/320 (39%), Gaps = 25/320 (7%) Query: 6 PWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRI 65 P R + + +++ GA ++ + ++P Y+ +++ + G + Sbjct: 56 PLHVRKLTNKKIRWIIRQLDKGAPVKEIAAVMRVTPRRIYQLKKQYEETGQ---IPELKQ 112 Query: 66 PHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPG 125 P P ++ ++ A+ ++ R +++ +E + +T++ ++ +HGL+ Sbjct: 113 PGRKPKEIDEETEKIILQAYKKY-RLSPVPLEKLIERDYGIHISHNTIYKVLLKHGLVEE 171 Query: 126 --ASPGIPATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDER 183 + R+E LWQ D+K G + +DD SRF C Sbjct: 172 NMSKKKRRKWVRYERTHSMSLWQGDWKRL-----GEKWIIAFMDDASRFITCYGVFDSAT 226 Query: 184 RETVQQQLVSVFERYGLPDRMTMDNGSPWG---DTTGTWTALELWLMRLGIRVGHSRPYH 240 E + L F YG+PD + D+G+ + +L G+R +R H Sbjct: 227 TENTIRVLKVGFREYGIPDEILTDHGTQFVAAKSREKAKHRFREFLAENGVRHVLARINH 286 Query: 241 PQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQ 300 PQT GK+ERF ++ ++ L + D + YN +PH +L+ + YQ Sbjct: 287 PQTNGKIERFFGLMEQKI----------HLFDSLDEFIYWYNYVKPHMSLNFDELETPYQ 336 Query: 301 PSARQYSGNTTPPEYDEGVM 320 R+ EY + Sbjct: 337 AFLRKLPAERV-FEYGRWLF 355 >UniRef50_A5AHC9 Putative uncharacterized protein n=14 Tax=Vitis vinifera RepID=A5AHC9_VITVI Length = 2107 Score = 197 bits (500), Expect = 9e-49, Method: Composition-based stats. Identities = 43/291 (14%), Positives = 92/291 (31%), Gaps = 59/291 (20%) Query: 74 SDDITALLRMAHDRH--ERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIP 131 + +L HD + ++K + G P+ + M R Sbjct: 1773 EKEQQGILSHYHDNACGGHFASQKTAMEVLQSGFCCPSLFKDAHTMCR--SCDRCQRLGK 1830 Query: 132 ATGRFEHDA-------PNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERR 184 T R + +W +DF G FP G + L +D S++ + ++ + Sbjct: 1831 LTKRNQMPMNPILIVDLLDVWGIDFMGSFPMSFGNSYILVGVDYVSKWVEAIPCKHNDHK 1890 Query: 185 ETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQ 244 ++ ++F R+ +P + D G+ + + + L + G++ Sbjct: 1891 VVLKFLKENIFSRFXVPKAIISDGGTHFCNR-----PFKTLLAKYGVKHKV--------- 1936 Query: 245 GKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSAR 304 + S +L + +RT Y LDM+ Y+ Sbjct: 1937 --------------VITSRRDWSIKLHDSLWAYRTAYKTI-----LDMSP----YRLVYG 1973 Query: 305 QYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 + EY +++++ + +A + L EM+E Sbjct: 1974 KACHLPVEVEYKASWAIKRLN-----------MDLIRAGAKRCLDLNEMEE 2013 >UniRef50_Q8XPL1 Isrso16-transposase orfb protein n=2 Tax=Ralstonia solanacearum RepID=Q8XPL1_RALSO Length = 269 Score = 196 bits (499), Expect = 1e-48, Method: Composition-based stats. Identities = 54/243 (22%), Positives = 94/243 (38%), Gaps = 12/243 (4%) Query: 71 NRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPG- 129 +R + + +G ++ ++ + V++L GL Sbjct: 18 SRKRKRLVGIACTHQRDCGHYGYHRVHV-MQQRESWKDNHKRVYHLYRAEGLSLRHKRPK 76 Query: 130 -----IPATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERR 184 + N +W MDF G R LT++D+++R SL + + Sbjct: 77 CNKSARLRQPKSIVMGINEIWSMDFVSDALLDGQRLRALTVVDNYTRESLAIEVGQSLKG 136 Query: 185 ETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQ 244 + V + L +V ++G P + +DNG+ + A++ W G+ + SRP P Sbjct: 137 KDVVRVLDAVVAQHGTPQTIKVDNGTEFIS-----KAMDRWAYEHGVELDFSRPGTPTDN 191 Query: 245 GKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSAR 304 K+E F+ + E L WF + Q WR YN RPH AL A P + + + Sbjct: 192 AKVESFNGRFRQECLNEHWFLSLEDAQSKIADWRRHYNESRPHSALQWATPDEFARQARK 251 Query: 305 QYS 307 S Sbjct: 252 SAS 254 >UniRef50_B9XEB0 Integrase catalytic region n=1 Tax=bacterium Ellin514 RepID=B9XEB0_9BACT Length = 280 Score = 196 bits (499), Expect = 1e-48, Method: Composition-based stats. Identities = 48/298 (16%), Positives = 85/298 (28%), Gaps = 43/298 (14%) Query: 24 SQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLR- 82 + + L ++ + Y P + + + ++ Sbjct: 4 LKADYQVEELAHALEVTSSGFYAHQ---------------HKPEGARRQQDQKLLKRIQP 48 Query: 83 MAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPATGR------- 135 + + +G+ +I L+ QG + + V LM ++ L Sbjct: 49 IFKESRSTYGSPRIHAALKRQGEP-CSKNRVARLMRQNHLRARQKRRFVPRTTQSDHDLP 107 Query: 136 ---------FEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRET 186 D PNR+W +D + G + +LD SR + + + Sbjct: 108 IAPNWLAKVPTPDRPNRVWVVDIT-YIATAEGWTYLAVVLDACSRKVVGWSMASSLETFL 166 Query: 187 VQQQLVSVFERY--GLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQ 244 V + L + D G + +A L I SR +P Sbjct: 167 VTEALARAQKERLPQPGLLHHSDRGVQYAS-----SAYRALLADYQITPSMSRAANPYDN 221 Query: 245 GKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRT-VYNLERPHEALDMAVPGSRYQP 301 E F +LK E K GE + + YN +R H AL P Sbjct: 222 ALAESFMATLKTECFD-KPPNTHGEAKLMIFDYLETFYNSKRLHSALGYQSPVEFENQ 278 >UniRef50_Q01TL7 Integrase, catalytic region n=5 Tax=Bacteria RepID=Q01TL7_SOLUE Length = 280 Score = 196 bits (499), Expect = 1e-48, Method: Composition-based stats. Identities = 45/294 (15%), Positives = 83/294 (28%), Gaps = 42/294 (14%) Query: 28 ANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMAHDR 87 ++ +C GIS A Y++L P + D+ + Sbjct: 1 MSVVQMCESLGISRAGYYRFL-----------------DPEKPAPADMDLRDEMHRVALD 43 Query: 88 HERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIP---------------A 132 +G+R+I L+ +G + + LM LL Sbjct: 44 WPCYGSRRIVEELKARGWEV-NRKRIQRLMREDNLLCVIKRKFVVATTDSRHGLKVYPNR 102 Query: 133 TGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLV 192 ++LW D + + +L+ SR + L Sbjct: 103 AAEMTLTGVDQLWVADIT-YIRLEEEFVYLAVILEAFSRRVIGWHLGETLEVSLTLNALR 161 Query: 193 SVFERYGL--PDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERF 250 + + D G + L GI + SR +P E F Sbjct: 162 MALGQRSVSPGPVHHSDRGVQYAS-----HDYTQLLQDNGIEISMSRKANPWDNAACESF 216 Query: 251 HRSLKAEVLQGKWFADSGELQRAFDHWRT-VYNLERPHEALDMAVPGSRYQPSA 303 ++LK E + + + + + + +YN +R H +L P A Sbjct: 217 MKTLKYEEVHRTEYRNLAHARASIKTFLEKIYNQKRLHSSLSYRSPVEFEHSLA 270 >UniRef50_A5BTF9 Putative uncharacterized protein n=1 Tax=Vitis vinifera RepID=A5BTF9_VITVI Length = 1209 Score = 196 bits (499), Expect = 1e-48, Method: Composition-based stats. Identities = 42/240 (17%), Positives = 89/240 (37%), Gaps = 29/240 (12%) Query: 119 RHGLLPGASPGIPATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAH 178 GL +P D +W +DF G FP + L +D S++ + Sbjct: 967 ARGLGNTRKNMMPLNPILIVD-LFYVWGIDFMGPFPMSFNYSYILVGVDYVSKWVEVIPC 1025 Query: 179 CTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRP 238 ++ R ++ ++F R+G+P + D G+ + + E+ L + G++ + P Sbjct: 1026 KRNDHRVVLKFLKENIFSRFGVPKAIISDGGTHFCN-----KPFEILLAKYGVKHKVATP 1080 Query: 239 YHPQTQGKLERFHRSLKAEVLQGKWFADSG---ELQRAFDHWRTVYNLERPHEALDMAVP 295 YHPQT ++E + +K +++ + L + ++ Y L P Sbjct: 1081 YHPQTSRQVELANLEIKNILMKVVNTSRRDWSVRLHDSLWAYKIAYKTI-----LG-LSP 1134 Query: 296 GSRYQPSARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQE 355 Y+ + +Y ++ ++ + +A + L EM+E Sbjct: 1135 ---YRLVYGKAWHLLVEVQYKAWWAIKTLN-----------MDLNRADMKRFLNLNEMEE 1180 >UniRef50_B7IVG1 Integrase core domain protein n=61 Tax=Bacillus RepID=B7IVG1_BACC2 Length = 294 Score = 196 bits (499), Expect = 1e-48, Method: Composition-based stats. Identities = 52/297 (17%), Positives = 95/297 (31%), Gaps = 44/297 (14%) Query: 24 SQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRM 83 Q ++ LC + +T Y+WLQR DDI ++ Sbjct: 23 VQTKITVKDLCNVLELPRSTFYRWLQRTED-------------------LKDDIEEKVKD 63 Query: 84 AHDRHE-RWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPAT--------- 133 RH+ R+G R++ L G + V +M ++ +L Sbjct: 64 VCLRHKFRYGYRRVTATLRKMGLCV-NHKKVLRIMRQNHILSKVRRKKKKYINGAEPMVA 122 Query: 134 -----GRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQ 188 +F+ PN W D + FG + T++D +R + + Sbjct: 123 PHRLERQFDASKPNEKWFTDVT-YLLFGERTLYVSTIMDAFNREIISCVISESQTLTLAM 181 Query: 189 QQLVSVFERYGLPDRM-TMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKL 247 + L + D + D GS + T + + G+ SR + + Sbjct: 182 KTLKQAMRGRKVKDVLLHSDQGSIY-----TAKEFQAYAKENGMITSMSRRGNCHDNAVM 236 Query: 248 ERFHRSLKAEVLQGKWFA--DSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPS 302 E F LK+E + + +++ + YN R E L+ P + Sbjct: 237 ESFFGHLKSEAFYSQKITKVSNTTVRKIVLEYIHYYNCVRIQEKLNHLSPKEFREQV 293 >UniRef50_A0JV34 Integrase, catalytic region n=12 Tax=Actinomycetales RepID=A0JV34_ARTS2 Length = 325 Score = 196 bits (499), Expect = 1e-48, Method: Composition-based stats. Identities = 79/331 (23%), Positives = 117/331 (35%), Gaps = 32/331 (9%) Query: 1 MESLMPWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQ 60 M + +A T R + + G +R RF SPAT KW+ R+ G G+ Sbjct: 1 MSYVTHANADLTPKARGKLARLVIEQGWTLRRAAERFQCSPATAKKWVDRYRARGEDGMA 60 Query: 61 DRPRIPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARH 120 D P SPNR+ + RWG +I L T+ T + + Sbjct: 61 DLSSRPRRSPNRTDVRTERRILALRFTR-RWGPHRIAAHLHLARSTVGKVLTRYRMPRLA 119 Query: 121 GLLPGA--SPGIPATGRFEHDAPNRLWQMDFKGHFPFGGG-------------------- 158 L G PA R+EHD P L +D K G Sbjct: 120 CLDQGTGLPIRKPAPQRYEHDHPGDLVHVDIKKLGRIPDGGGHRALGRAAGRKNRRAGTG 179 Query: 159 RCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSV--FERYGLP-DRMTMDNGSPWGDT 215 + +DDHSR + +++ + F +G+ + DNG+ + Sbjct: 180 YAYLHHAVDDHSRLAYSEILTDEKKETATAFWFRAASFFAAHGITVRAVLTDNGACYRSR 239 Query: 216 TGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFD 275 T + I+ +RPY PQT GK+ERF+R+L E + + E + Sbjct: 240 AFTA------ALGPNIKHRRTRPYRPQTNGKVERFNRTLNTEWAYARPYTSEAERAATYP 293 Query: 276 HWRTVYNLERPHEALDMAVPGSRYQPSARQY 306 W YN R H + P SR Y Sbjct: 294 GWLHQYNHHRTHTGIGGKTPISRVHNLRGNY 324 >UniRef50_B0MP11 Putative uncharacterized protein n=1 Tax=Eubacterium siraeum DSM 15702 RepID=B0MP11_9FIRM Length = 379 Score = 196 bits (499), Expect = 1e-48, Method: Composition-based stats. Identities = 57/316 (18%), Positives = 107/316 (33%), Gaps = 35/316 (11%) Query: 6 PWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRI 65 P R + R + N+ +C F I T Y + R ++ + R Sbjct: 81 PCTPRAPLRQRL-YAAEQLYGKYNVHVICDAFDIPRGTFYNHVLRNKKDNTWYAKRR--- 136 Query: 66 PHHSPNRSSDDITALLRMAHDR-HERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLP 124 +++ ++ +D ++ +GA KI ++ +G + V LM GL+ Sbjct: 137 ---------EELRLRIQEIYDESNQIFGAAKIAAVMKSEGFKVSNE-MVRTLMRDMGLVS 186 Query: 125 GASPGIPA------------TGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRF 172 F+ PN +W D +F +G + ++D SR Sbjct: 187 IRQSAKKLYEDEGRKYKNLLNQEFDTCKPNEVWVSDVT-YFKYGENAYYICVIIDLFSRM 245 Query: 173 SLCLAHCTDERRETVQQQLVSVFERYGL--PDRMTMDNGSPWGDTTGTWTALELWLMRLG 230 + + V+ ++ D GS + + ++ L Sbjct: 246 VVGYKISKTNSTQLVKSTFQIAYKARQPDSSLVFHTDRGSNYRS-----KTMNDYMRSLH 300 Query: 231 IRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEAL 290 I SR + P +E F S+K E L + + + A D + YN +RPH+ L Sbjct: 301 ITHSFSRAHVPYDNSVMESFFASMKREELYRTKYRSESDFRSAVDKYMIFYNTKRPHKKL 360 Query: 291 DMAVPGSRYQPSARQY 306 P + + A + Sbjct: 361 QYKTPEQKEEEYALKL 376 >UniRef50_C8VYH9 Transposase IS3/IS911 family protein n=11 Tax=Bacteria RepID=C8VYH9_DESAS Length = 369 Score = 196 bits (498), Expect = 1e-48, Method: Composition-based stats. Identities = 55/301 (18%), Positives = 94/301 (31%), Gaps = 40/301 (13%) Query: 11 DTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSP 70 T R V +++ ++ + GI+ Y P Sbjct: 96 LTKEERLALVEPDNKE-ISLTAQADLLGINRTRIYY-------------------KPAPP 135 Query: 71 NRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGI 130 + I + + RH +G+R+I L + + V + M GL Sbjct: 136 SAEEIAIRHRIDEIYTRHPYYGSRRITAQLCRENIPV-NRKRVQHYMRDMGLAGICPGPN 194 Query: 131 PATGRFE------------HDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAH 178 + E + PN +W +D + G + + +LD SRF + Sbjct: 195 LSKRNTEHRVYPYLLRGVTANHPNHIWGIDIT-YIRLKEGWMYLVAILDWFSRFIISWEL 253 Query: 179 CTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRP 238 V + ++ +P D GS + T L G+ + Sbjct: 254 DQVLEIPFVLVAVKRALDK-KMPLIWNSDQGSHFTSPQYT-----QLLQNAGVLISMDGK 307 Query: 239 YHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSR 298 ER RS+K E + + E +R + YN ERPH++LD P Sbjct: 308 GRAIDNIFTERLWRSIKYEEVYLNDYMSPREARRGIRRYIDFYNNERPHQSLDYKTPFEI 367 Query: 299 Y 299 Y Sbjct: 368 Y 368 >UniRef50_Q24VK2 Putative uncharacterized protein n=3 Tax=Clostridiales RepID=Q24VK2_DESHY Length = 284 Score = 196 bits (498), Expect = 1e-48, Method: Composition-based stats. Identities = 46/290 (15%), Positives = 94/290 (32%), Gaps = 39/290 (13%) Query: 25 QDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMA 84 + + ++ ++ Y P+ + + + Sbjct: 16 SKELPLSTAAELLDVNRSSAYY-------------------KAKEPSETELAVKNAIDKM 56 Query: 85 HDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMAR---HGLLPGASPGIPATGRFEHD-- 139 H + WG+R++ + L+ G + T M H + P + PA G + Sbjct: 57 HTDNPAWGSRQLSKKLKRLGFDIGRLKT-RRYMQEMDIHTIYPKPNLSKPAKGHKVYPYL 115 Query: 140 -------APNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLV 192 PN+ W +D + G + ++D +SR + V+ L Sbjct: 116 LRNANITRPNQAWSIDIT-YIRLKHGFVYLTAIIDWYSRLIVGWELDDTLSTTMVKCALE 174 Query: 193 SVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHR 252 F P+ + D GS + + +++ +ER+ R Sbjct: 175 KAFSVA-KPEILNSDQGSQFTG-----HEYINLVESNRVKISMDGKSRWADNIMIERWFR 228 Query: 253 SLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPS 302 +LK + + K + + + ++ + YN E+ H ALD P Y P Sbjct: 229 TLKYDEVYLKDYENIKDARKQIGEFIHTYNFEKLHSALDYQTPAENYYPV 278 >UniRef50_A4JLF1 Integrase, catalytic region n=12 Tax=Proteobacteria RepID=A4JLF1_BURVG Length = 283 Score = 196 bits (498), Expect = 1e-48, Method: Composition-based stats. Identities = 51/296 (17%), Positives = 90/296 (30%), Gaps = 37/296 (12%) Query: 25 QDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMA 84 + + +CR G+S + Y W +R P + +L Sbjct: 2 RQDYPVPPMCRVLGVSVSGYYAWRKR--------------GPSERTQQEPRLEAEVLAAH 47 Query: 85 HDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPAT----------- 133 E +G ++K+ L+++G + T Sbjct: 48 QRTRESFGPERLKQHLDERGVRIGVHRIRRLRRKLGLRCKQKRRFKATTNSKHDLPVAPN 107 Query: 134 ---GRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQ 190 F APN+ W D + G + L D +S + A + V Q Sbjct: 108 LLNQDFSVTAPNQAWCGDIT-YIATDEGWLYLAGLKDLYSGEIVGYAMSERMTKNLVMQA 166 Query: 191 LVSVFERYGLPD--RMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLE 248 L P D GS + A + + + +R SR + +E Sbjct: 167 LFRAVATRRPPAGLIHHSDRGSQYCAL-----AYQALIGQFDMRASMSRRGNCYDNAPIE 221 Query: 249 RFHRSLKAEVLQGKWFADSGELQRAFDHWRT-VYNLERPHEALDMAVPGSRYQPSA 303 F +LK E++ + FA + + A + YN +R L+ P + Q Sbjct: 222 SFWGTLKNELVYHQRFATREQARLAISEYIEIFYNRQRTQARLNYQSPVAFTQRFY 277 >UniRef50_A6WME4 Transposase IS3/IS911 family protein n=33 Tax=Gammaproteobacteria RepID=A6WME4_SHEB8 Length = 386 Score = 196 bits (498), Expect = 1e-48, Method: Composition-based stats. Identities = 54/309 (17%), Positives = 96/309 (31%), Gaps = 41/309 (13%) Query: 14 SLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRS 73 +R F+ + I LCR +S + Y W +R P + + Sbjct: 96 EVRFRFIKQQ-SNLFPITLLCRVMSVSKSGYYDWHKR---------------PANVISVE 139 Query: 74 SDDITALLRMA-HDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPA 132 + + L+R G R++ + L +G+ + + V +M R L Sbjct: 140 TLKLYRLVRQLFKQSRGSLGNREMVKKLRKEGYQVGRYL-VRKIMHRLRLKATQRRAYKV 198 Query: 133 TGR---------------FEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLA 177 T + F + N +W D + G G + ++D +SR + Sbjct: 199 TTQRKHSDAVADNLLNMNFNPVSANEVWAGDVT-YLKTGEGWMYLAVVMDLYSRRIVGWH 257 Query: 178 HCTDERRETVQQQLVSVFERYGLPD--RMTMDNGSPWGDTTGTWTALELWLMRLGIRVGH 235 + + + L+ + D GS + L GIR Sbjct: 258 IDKRMTTDLISKALIKAYNLRQPARGLVFHSDRGSQYTS-----KQFGRLLSSYGIRASM 312 Query: 236 SRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVP 295 +ERF SLK + + +++ + YNLER H A D P Sbjct: 313 GDVGACWDNAVVERFFGSLKHDWIFKVAQPTREFMKQDVTAYMKYYNLERLHSANDDLSP 372 Query: 296 GSRYQPSAR 304 + Sbjct: 373 VEFENSQVK 381 >UniRef50_Q24NL7 Putative uncharacterized protein n=1 Tax=Desulfitobacterium hafniense Y51 RepID=Q24NL7_DESHY Length = 282 Score = 196 bits (498), Expect = 1e-48, Method: Composition-based stats. Identities = 50/293 (17%), Positives = 89/293 (30%), Gaps = 45/293 (15%) Query: 33 LCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMAHDRHERWG 92 +C +S + Y W+ R + A + + + + + + +G Sbjct: 1 MCSVLRVSKSGYYAWINRHDSKRAEDNK--------------LLLQDIRDVFNASNGVYG 46 Query: 93 ARKI-----KRWLEDQG--HTMPAFSTVHNLMARHGLLPGASPGIPAT------------ 133 + K+ + LED+ + +M G+L AT Sbjct: 47 SIKVKKAIEAKKLEDRYSKLARINHKRIERIMRNEGILSKVHKKYKATTNSNHNLPVAEN 106 Query: 134 ---GRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQ 190 FE PN D + G + + D + L+ + V Sbjct: 107 ILNREFEAKRPNEKLVSDIT-YISTEEGWLYVAGIQDLCGGKMVGLSMSDRMTKALVLNA 165 Query: 191 LVSVFERYGLPD--RMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLE 248 L + G P + D GS + A + L G SR + +E Sbjct: 166 LHDAYRTAGRPQNAILHSDRGSQYCSY-----AYQEKLKEYGYTCSMSRKGNCWDNAPME 220 Query: 249 RFHRSLKAEVLQGKWFADSGELQRAFDHWR-TVYNLERPHEALDMAVPGSRYQ 300 F +K E + G+ F E + + YN +R H + D P + Y Sbjct: 221 SFWGKMKQEWINGQKFKTREEAKAKVFEYIMIFYNRKRLHASYDYKTPDAYYN 273 >UniRef50_C0ZBG4 Putative transposase orfB for insertion sequence element IS3 family n=5 Tax=Paenibacillaceae RepID=C0ZBG4_BREBN Length = 297 Score = 196 bits (498), Expect = 1e-48, Method: Composition-based stats. Identities = 50/308 (16%), Positives = 92/308 (29%), Gaps = 45/308 (14%) Query: 15 LRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSS 74 V+ +D +++ LC GIS + Y +++R + Sbjct: 11 QNKHIVVDELRDKRSVQELCACLGISRSGYYAYVKRKDNDP------------------D 52 Query: 75 DDITALLRMAHDRH-ERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPAT 133 + + +R ++ + G R+++ L Q + V LM G+ Sbjct: 53 EHLKRKIRSIYEERDKTVGYRRVQDELYRQYNLKVNHKKVLRLMQELGMQAIIRRKYIHR 112 Query: 134 ------------------GRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLC 175 F + PN+ W D F R + + D + + Sbjct: 113 TSYETAVSDGRITENLLQRNFTAEGPNQKWVTDVTQ-FRVFDHRIYLSAIKDLWNNEIVA 171 Query: 176 LAHCTDERRETVQQQLVSVFERYG--LPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRV 233 V + FE+ + D G + A L ++G ++ Sbjct: 172 YHLSQRNDNPLVLETFKKAFEKQKDVAGLIVHSDQGYQYTS-----HAYHDMLPKVGAQI 226 Query: 234 GHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMA 293 SR + +E F LK E L E QR + + YN +R L+ Sbjct: 227 SMSRRGNCYDNASMESFFSHLKVEALYPYDIRSIEEAQRRIEEFILFYNEKRAQRKLNKL 286 Query: 294 VPGSRYQP 301 P + Sbjct: 287 TPVEYRRQ 294 >UniRef50_Q4FQT2 Transposase OrfB n=179 Tax=Bacteria RepID=Q4FQT2_PSYA2 Length = 288 Score = 195 bits (497), Expect = 2e-48, Method: Composition-based stats. Identities = 59/306 (19%), Positives = 108/306 (35%), Gaps = 33/306 (10%) Query: 3 SLMPWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDR 62 MP + + ++ +IR C F IS Y Sbjct: 11 YRMPCQKSGSAAWACHYIKE---RRISIRRACLIFNISVTCYYH---------------- 51 Query: 63 PRIPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGL 122 + + + I LL +++ WG L + V+ + L Sbjct: 52 ----KSAASDENKQIADLLVELTTQNKNWGFGLCFLSLRNVLGLPYNHKRVYRIYCELEL 107 Query: 123 LPGASPGIPATGRFEHD-----APNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLA 177 P PN+ W MDF G ++DD++R +L + Sbjct: 108 NLRIKPKRRIKRVKPVPLAVPVEPNQSWSMDFMHDALTDGRAFRLFNVIDDYNREALTVE 167 Query: 178 HCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSR 237 + V + L + E G P ++ DNG+ + AL+ W + GI + + Sbjct: 168 IDFSLPAQRVIRSLNQLIEYRGKPVQVRCDNGAEYISN-----ALKDWAVNQGITIRYIE 222 Query: 238 PYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGS 297 P +PQ +ER++R+++ + L + F D ++++ + W YN ERP+ P Sbjct: 223 PGNPQQNAYVERYNRTMRYDWLNQELFTDLDQVRQQAEDWLYHYNNERPNMGNGGFTPIQ 282 Query: 298 RYQPSA 303 + +A Sbjct: 283 KLHQAA 288 >UniRef50_C0QA21 Transposase /integrase family protein n=1 Tax=Desulfobacterium autotrophicum HRM2 RepID=C0QA21_DESAH Length = 402 Score = 195 bits (497), Expect = 2e-48, Method: Composition-based stats. Identities = 75/341 (21%), Positives = 125/341 (36%), Gaps = 21/341 (6%) Query: 39 ISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMAHDRHERWGARKIKR 98 SP T KW R+ G L + PR + I L + H RW ++ Sbjct: 52 YSPDTLKKWFYRYRNGGLPALNNSPRKDIGTHGTIPQTIVDRLFKLREEHPRWTLSRMLD 111 Query: 99 WLEDQGHT---MPAFSTVHNLMARHGLLP-GASPGIPATGRFEHDAPNRLWQMDFKGHFP 154 L + PA ST++ L F + +LW DF Sbjct: 112 QLVQENLWDKKSPARSTLYRFAQTANLKRDPHLAAHVPARPFAYSFFGQLWMADFLHGPK 171 Query: 155 FGG----GRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNGS 210 + + ++DD +R+ + T E E + +L++ +G P R DNG+ Sbjct: 172 IREKGKKRKTYLHAIIDDATRYIVHAGFFTAESTEVMMAELMASVRTHGKPIRFYTDNGA 231 Query: 211 PWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGK--WFADSG 268 + L+ LGI + H+ P P+ +GK+ERF RS++ + L GK Sbjct: 232 CYASKH-----LKFVCANLGIHLIHTPPGKPRGRGKVERFFRSVRDQFLDGKKAPAKTLD 286 Query: 269 ELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGNTTPPEYDEGVMVRK---VD 325 L +AF W Y +R H +L P + E + +++ V Sbjct: 287 GLNKAFREWVASY-HKRIHSSLG-ISPLQKRLSHQSACKALPETVEIEPLFRMKRRCKVY 344 Query: 326 ISGKLSVKGVSLSAGKAFRGERVGLKEMQEDGSYEVWWYST 366 ++ + +K A G+RV + M + VW+ Sbjct: 345 LNNTIRLKRRIYEVIDALPGQRVDVWFMPWNLD-MVWYGPE 384 >UniRef50_Q310V5 Transposase-like n=4 Tax=Deltaproteobacteria RepID=Q310V5_DESDG Length = 279 Score = 195 bits (497), Expect = 2e-48, Method: Composition-based stats. Identities = 55/300 (18%), Positives = 89/300 (29%), Gaps = 43/300 (14%) Query: 30 IRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMAHDRHE 89 ++ +C G+S + Y W R P R + + +R + H+ Sbjct: 1 MKKMCHMLGVSQSGFYSWATR---------------PPSQRARHNAQLRVRIRELFEEHK 45 Query: 90 R-WGARKIKRWLEDQG-HTMPAFSTVHNLMARHGLLPGASPGIPAT-------------- 133 R G+ I L + + S V M GL T Sbjct: 46 RTAGSPMITEDLRAEDAFRHVSRSRVARHMREMGLRCQTMKKFVVTTDSSHSEPVAPNVL 105 Query: 134 -GRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLV 192 RF APN W D + G + +D +SR + R +V Sbjct: 106 NRRFSASAPNVTWVSDIT-YIKIGRTWHYLTVFIDLYSRLVVGWDLSASLERHSVIHAFR 164 Query: 193 SVFERYGLPDR--MTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERF 250 R + D G + L R G SR + E F Sbjct: 165 KAVARRRPSKGLLVHSDRGIQYASRD-----FRKELHRHGCIQSMSRKGNCWDNAVAESF 219 Query: 251 HRSLKAEVLQGKWFADSGELQRAFDHWRT-VYNLERPHEALDMAVPG--SRYQPSARQYS 307 +LK +++ + F D E + A H+ YN R H + P + ++ + Sbjct: 220 FHTLKTQLIYQRKFTDRREAELALFHYIEAYYNRRRRHSSNGWMSPAAFENFNQEYKKVA 279 >UniRef50_Q315H1 Putative transposase B n=2 Tax=Desulfovibrio desulfuricans subsp. desulfuricans str. G20 RepID=Q315H1_DESDG Length = 272 Score = 195 bits (497), Expect = 2e-48, Method: Composition-based stats. Identities = 55/286 (19%), Positives = 88/286 (30%), Gaps = 40/286 (13%) Query: 39 ISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMAHDRH-ERWGARKIK 97 +SP+ Y WL+ L R + + +R H R +G RKI Sbjct: 2 VSPSGFYSWLK----APEDALMSRS-----------ESLRKAIRFYHQRSSGVYGYRKIH 46 Query: 98 RWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIP---------------ATGRFEHDAPN 142 + + ++ TV M GL + F P+ Sbjct: 47 KDIIEETTLRCCRETVRRAMKSDGLRSKVTRKHRYPANIEQIPRAAPNVLARDFTAATPD 106 Query: 143 RLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERY--GL 200 + W D + G + +LD SR + A Q L + + GL Sbjct: 107 QKWTADIT-YLWTNEGWLYLAVVLDLFSRRVVGWAMSERADAHLACQALEAAIQLRRPGL 165 Query: 201 PDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQ 260 D G + A L + GI SR + E F LK E ++ Sbjct: 166 GLLHHSDQGCQYTSG-----AFSAVLDQYGIVCSMSRRGNCWDNAVTESFFSKLKREWVR 220 Query: 261 GKWFADSGELQRAFDHWRT-VYNLERPHEALDMAVPGSRYQPSARQ 305 GK + + ++ + YN +R H L P + Q + Sbjct: 221 GKRYRTRNDARQDIFLYLEAFYNRKRRHAFLGYQSPEAFEQLFYQA 266 >UniRef50_B2SZK4 Transposase IS3/IS911 family protein n=10 Tax=root RepID=B2SZK4_BURPP Length = 378 Score = 195 bits (496), Expect = 2e-48, Method: Composition-based stats. Identities = 56/296 (18%), Positives = 102/296 (34%), Gaps = 40/296 (13%) Query: 20 VLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITA 79 V+ + + + +CR + ++ Y + R + GA S + A Sbjct: 101 VIRSLKKAWPVSLMCRLLKVPRSSYYAFAARPPKPGA----------------SPPLLKA 144 Query: 80 LLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPAT------ 133 + ++ + +G+R++ + L+ QG+ + + +LM L Sbjct: 145 VRQIHSESRSSYGSRRMAQALQQQGYAIGRY-RARSLMREAQLAVARRRTHRYRKAEGEA 203 Query: 134 --------GRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRE 185 +FE A NR+W D + G + ++D HSR + A E Sbjct: 204 LIAPNLLERQFEPGAINRVWAGDIT-YVRTRQGWSYLAIVMDLHSRRIVGWAFALQADTE 262 Query: 186 TVQQQLVSVFERYGLPDRM--TMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQT 243 V Q L ++ + D G + L GI SR + Sbjct: 263 LVIQALQQARQKRRPAAGLMFHSDQGCQYTSER-----FVGDLKANGIVQSMSRKGNCWD 317 Query: 244 QGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWR-TVYNLERPHEALDMAVPGSR 298 +ERF RSLK+E + + + + +R + YN R H A + P Sbjct: 318 NAVVERFFRSLKSEWIGEQEYGNHELARRDIAGYIADFYNYRRIHSAANNLPPVRY 373 Score = 44.1 bits (103), Expect = 0.008, Method: Composition-based stats. Identities = 16/92 (17%), Positives = 29/92 (31%), Gaps = 4/92 (4%) Query: 11 DTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSP 70 T + E V G I C G+ + +W+ +W + A R + Sbjct: 6 LTDEFKAEAVQLVVAQGYPITKACEALGVGDSALRRWVAQWRAQQAE--PPRSDVQIQRD 63 Query: 71 NRSSDDITALLRMAHDRHERWGARKIKRWLED 102 R ++ A + E +K+ L Sbjct: 64 QRRIRELEARVMELEREREIL--KKVYGLLRQ 93 >UniRef50_Q1WRA6 Transposase ISLasa15, IS3 family n=27 Tax=Firmicutes RepID=Q1WRA6_LACS1 Length = 275 Score = 195 bits (496), Expect = 2e-48, Method: Composition-based stats. Identities = 53/286 (18%), Positives = 91/286 (31%), Gaps = 42/286 (14%) Query: 33 LCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMAHDRHER-W 91 + +R G+S + Y R +G + ++R I L+ + + + Sbjct: 1 MLKRIGLSKSGYY----RHKFKGLSKSKERQ-----------LKIKDLIMEIWEESQHLY 45 Query: 92 GARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPATGR---------------F 136 GA KI L G + + TV M + T R F Sbjct: 46 GAPKITAKLRKMGIQI-SERTVGKYMKELKIRAKYCKPFTVTTRDSNLNNKLKNTLDEQF 104 Query: 137 EHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFE 196 +AP+ +W +D + P G + +++D SR + T + + V Sbjct: 105 NPEAPDTIWCID-TTYIPTKEGFVYLTSIMDLFSRRIISWDLSTTLETTNIISLINKVKA 163 Query: 197 RYGL-PDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLK 255 P + D GS + GI +S+ +P +E FH +K Sbjct: 164 TRKSNPKIIHSDRGSQFTSKEYCEIT-------AGITRSYSKKAYPWDNACIESFHAIIK 216 Query: 256 AEVLQGKWFADSGELQRAFDHWRT-VYNLERPHEALDMAVPGSRYQ 300 E L + + +R + YN R H P Q Sbjct: 217 RECLNEYNISSYNKARRLVFSYIDGFYNTVRIHSHCGYISPLEFEQ 262 >UniRef50_Q0ZCC0 Gag protein n=2 Tax=Populus trichocarpa RepID=Q0ZCC0_POPTR Length = 1886 Score = 195 bits (496), Expect = 3e-48, Method: Composition-based stats. Identities = 40/191 (20%), Positives = 77/191 (40%), Gaps = 17/191 (8%) Query: 140 APNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYG 199 W +DF G FP G + L +D S++ + T++ + ++ ++ R+G Sbjct: 955 EIFECWGIDFMGPFPPSFGFLYILVAVDYVSKWIEAIPSRTNDHKTVIKFLKENILSRFG 1014 Query: 200 LPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEV- 258 +P M D G+ + + E + + GI + PYHPQT G++E +R +K + Sbjct: 1015 IPRAMISDGGTHFCN-----KPFESLMKKYGITHKVATPYHPQTSGQVELANREIKQILE 1069 Query: 259 --LQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGNTTPPEYD 316 + S L A +RT Y +L M+ Y+ + E+ Sbjct: 1070 KTVNPNRKDWSLRLNDALWAYRTAYKT-----SLGMSP----YRLVYGKPCHLPVEIEHK 1120 Query: 317 EGVMVRKVDIS 327 ++ + + Sbjct: 1121 AYWAIKAFNSN 1131 Score = 98.4 bits (244), Expect = 4e-19, Method: Composition-based stats. Identities = 19/83 (22%), Positives = 40/83 (48%), Gaps = 5/83 (6%) Query: 164 TLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALE 223 + +D S++ + T++ + ++ ++ R+G+P M D G+ + + E Sbjct: 730 SGVDYVSKWIEAIPSRTNDHKTVIKFLKENILSRFGIPRAMISDGGTHFCN-----KPFE 784 Query: 224 LWLMRLGIRVGHSRPYHPQTQGK 246 + + GI + PYHPQT G+ Sbjct: 785 SLMKKYGITHKVATPYHPQTSGQ 807 >UniRef50_Q033F5 Transposase n=46 Tax=Streptococcaceae RepID=Q033F5_LACLS Length = 284 Score = 195 bits (495), Expect = 3e-48, Method: Composition-based stats. Identities = 54/286 (18%), Positives = 97/286 (33%), Gaps = 36/286 (12%) Query: 28 ANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMAHDR 87 N+R C+ + ++ Y+ + R P + R + ++ + Sbjct: 15 LNVRLSCQLLDVPESSYYQRINRH--------------PSKTQLRRQYLSLKISQLFNAN 60 Query: 88 HERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPATG------------R 135 E +GA KI L QG + V LM + L + Sbjct: 61 REIYGAPKIHHLLLKQGEKVG-LKLVQKLMKQLQLKSVVIKKFKPGYSLSDHINRKNLIQ 119 Query: 136 FEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVF 195 E N++W D + P G + T++D +++ + E VQ+ L Sbjct: 120 TEPTKKNKVWSTDIT-YIPTQQGWAYLSTIMDRYTKKVIAWDLGKRMTVELVQRTLNKAI 178 Query: 196 ERYGLPDRM--TMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRS 253 + P+ + D GS + E L G+ SR +P LE +H Sbjct: 179 KSQDYPEAVILHSDQGSQYTSL-----EYEELLKYYGMTHSFSRRGYPYHNASLESWHGH 233 Query: 254 LKAEVLQGKWFADSGELQRAFDHWRT-VYNLERPHEALDMAVPGSR 298 LK E + + + E ++ + YN +R H++L P Sbjct: 234 LKREWVYQFKYKNFEEAYQSIFWYIEAFYNSKRIHQSLGYLTPNQF 279 >UniRef50_A6LT97 Integrase, catalytic region n=5 Tax=Firmicutes RepID=A6LT97_CLOB8 Length = 271 Score = 195 bits (495), Expect = 3e-48, Method: Composition-based stats. Identities = 46/280 (16%), Positives = 86/280 (30%), Gaps = 37/280 (13%) Query: 33 LCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMAHDRHERWG 92 +C+ + +T Y Q P S +L++ + + R+G Sbjct: 1 MCKVLHMPKSTYY--------------QSHHYTPSRRTLESETLKDQILKIYTESNRRYG 46 Query: 93 ARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIP--------------ATGRFEH 138 A KI++ LE GH + + V M + + F Sbjct: 47 APKIQKILEQSGHKI-SIKRVQRFMKQLNIYSIVIKKFRPAKANRKVIERKNLLKQDFTT 105 Query: 139 DAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERY 198 + N W D + G C+ +++D +R + + L + + Sbjct: 106 NKINEKWVGDITYIYTLKDGWCYLASVMDLCTRKIIGYSFSKTMDASVAVAALNNAYTLQ 165 Query: 199 GL--PDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKA 256 D G + T L + ++ +SR P +E FH LK Sbjct: 166 QPVGSVIFHSDLGVQYTS-----TDFLNRLKKYKMKSSNSRKGCPYDNACIESFHSILKK 220 Query: 257 EVLQGKWFADSGELQRAFDHWRT-VYNLERPHEALDMAVP 295 E + + D + + YN +R H ++ P Sbjct: 221 EQVNNVQYLDFESAKLDLFIFIESWYNRKRIHGSIGYITP 260 >UniRef50_B0BYN1 Integrase, catalytic region n=6 Tax=Bacteria RepID=B0BYN1_ACAM1 Length = 285 Score = 195 bits (495), Expect = 3e-48, Method: Composition-based stats. Identities = 47/306 (15%), Positives = 91/306 (29%), Gaps = 46/306 (15%) Query: 22 FASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALL 81 Q +IR +C+ + Y H ++ A + Sbjct: 3 RQLQQDYSIRQICQVLNYPRSQVYY--------------------HARGQPDESELKAAI 42 Query: 82 RMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPATGRFEH--- 138 + +G R+I L+ QG+ + V LM + G++ T EH Sbjct: 43 AGVAGAYPTYGYRRITAQLQRQGYCV-NHKRVARLMRQIGIMAKTKVKRKRTTNSEHSFP 101 Query: 139 -----------DAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETV 187 D P ++W D + + ++D +R ++ Sbjct: 102 RYGNRVLNLSIDHPEQVWVADIT-YIRLQQEFVYLAVVMDVFTRAIRGWHLSRHIDQQLT 160 Query: 188 QQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKL 247 + L E P+ D G + A L + +++ + G Sbjct: 161 LRALNKALE-RATPEIHHSDQGVQYA-----AAAYMQLLQQHQVQISMAEVGQAWQNGYA 214 Query: 248 ERFHRSLKAEVLQGKWFADSGELQRAFDHWRTV-YNLERPHEALDMAVPGSR---YQPSA 303 ER R++K E + + + E + + Y +R H +L P ++ Sbjct: 215 ERLMRTIKEEEVDLSDYRNFTEAYEHIEQFLEDVYMHKRIHSSLGYLTPCEYEQQWRQQN 274 Query: 304 RQYSGN 309 Y N Sbjct: 275 NHYCMN 280 >UniRef50_A6BYW2 Integrase, catalytic region n=1 Tax=Planctomyces maris DSM 8797 RepID=A6BYW2_9PLAN Length = 269 Score = 194 bits (494), Expect = 4e-48, Method: Composition-based stats. Identities = 56/247 (22%), Positives = 94/247 (38%), Gaps = 14/247 (5%) Query: 63 PRIPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGL 122 + P + + RH R+G R+I R ++ G + ++ L R GL Sbjct: 7 SQRYQSEPPDDEPALLKQILDLVRRHPRFGYRRIGRMIQADGWKV-NLKRIYRLWRREGL 65 Query: 123 LPGASPGIPA--------TGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSL 174 + N +W DF G L++LD+++R L Sbjct: 66 KVPRKQKKKRALGTGANACHLRRAERKNHVWCWDFIFDRTETGTTLKWLSVLDEYTRECL 125 Query: 175 CLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVG 234 L E V L +F+ +G+P+ + DNGS + A+ WL ++G+ Sbjct: 126 VLKVDRHITSEDVINVLAELFKTHGVPEHIRSDNGSEFV-----AQAIREWLKQIGVETL 180 Query: 235 HSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAV 294 + P P G E FH ++ E + + F + ++ D W+ YN RPH +L Sbjct: 181 YIEPASPWENGYAESFHSRVRDEFMNCEIFENLRSARKQTDSWKEFYNEVRPHSSLGYLT 240 Query: 295 PGSRYQP 301 P Q Sbjct: 241 PRQFSQQ 247 >UniRef50_A1T428 Integrase, catalytic region n=2 Tax=Actinobacteria (class) RepID=A1T428_MYCVP Length = 297 Score = 194 bits (494), Expect = 4e-48, Method: Composition-based stats. Identities = 57/302 (18%), Positives = 95/302 (31%), Gaps = 37/302 (12%) Query: 16 RTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSD 75 R E + D +I +L G+S + Y W R R P D Sbjct: 3 RFELIAAECADH-DIATLTELLGVSRSGYYAWEARQ----------RRTEPTARQQWRRD 51 Query: 76 DITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPAT-- 133 +L + +G+ +I L G ++ + +TV +MA G+ + Sbjct: 52 LEVKILAHWKESKRTYGSPRITADLHAAGVSV-SVNTVAAIMAEMGIEGISPRTFKVKTT 110 Query: 134 --------------GRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHC 179 F+ + +W D + G G + + D+HSR L + Sbjct: 111 VVDPAASFPPDRVGRIFDQGRLDAVWTSDIT-YLTCGEGDAYLCAIRDEHSRRVLGWSVA 169 Query: 180 TDERRETVQQQLVSVFERYG---LPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHS 236 R E V+Q + + G + D GS + L G+R Sbjct: 170 DHMRTELVEQAVDAAVFTRGGSVAGTILHSDRGSQYTS-----HDLAKACSDHGLRRSMG 224 Query: 237 RPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPG 296 E ++K E + F L D++ YN +R H +L M P Sbjct: 225 ATGICWDNAGSESLWSTVKHEYYKRHAFTTYANLTAGLDNYLHYYNHDRRHSSLGMISPI 284 Query: 297 SR 298 Sbjct: 285 DF 286 >UniRef50_C1F6R2 ISAca4, transposase orfB n=1 Tax=Acidobacterium capsulatum ATCC 51196 RepID=C1F6R2_ACIC5 Length = 308 Score = 194 bits (493), Expect = 5e-48, Method: Composition-based stats. Identities = 66/275 (24%), Positives = 103/275 (37%), Gaps = 31/275 (11%) Query: 28 ANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMAHDR 87 + R C+ G+ A+ + RP + ++ L + Sbjct: 1 MSERRACKLLGVDRASYRY-------------EPRPDR--------NAELRDELVKLARQ 39 Query: 88 HERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGL----LPGASPGIPATGRFEHDAPNR 143 R+G R++ LE +G T+ V+ L A GL G + N+ Sbjct: 40 KPRYGYRRLHAVLERRGQTV-NVKRVYRLYAEEGLAVRRRRRKRLVRERVGEVQLIRANQ 98 Query: 144 LWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDR 203 W MDF G L+++D +R L L T V + L + E GLP+ Sbjct: 99 EWAMDFIVDGLANGRMVRILSVVDAFTRECLALEADTSLGSGRVTRALDRLIEERGLPEN 158 Query: 204 MTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKW 263 + DNG + + W I + H +P P G +E FH L+ E L W Sbjct: 159 VRSDNGPEFTSRR-----MLGWAEERKINLVHIQPGRPMQNGHVESFHGRLRDECLNVSW 213 Query: 264 FADSGELQRAFDHWRTVYNLERPHEALDMAVPGSR 298 F +++R D++R YN ERPH +L P Sbjct: 214 FRTLNDVRRTLDNYRQEYNCERPHSSLAYRTPAEF 248 >UniRef50_A0AXB8 Integrase, catalytic region n=27 Tax=Betaproteobacteria RepID=A0AXB8_BURCH Length = 279 Score = 194 bits (493), Expect = 5e-48, Method: Composition-based stats. Identities = 64/294 (21%), Positives = 101/294 (34%), Gaps = 36/294 (12%) Query: 13 MSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNR 72 + R E + ++ G + R CR G+S L++ ++ + G Sbjct: 4 PTGRREALEVLTRRGLSQRKACRYLGLSRRVAIYTLKQPEKDRSLG-------------- 49 Query: 73 SSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPA 132 L A R+G R+I WL S V + L Sbjct: 50 ------ERLIAASQEVPRFGYRRISAWL------SLGESRVRRMWRALKLNIPKRRPRRR 97 Query: 133 T-----GRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETV 187 PN +W DF G L ++D+++R L + R + V Sbjct: 98 RCGSDIRLPGATKPNSVWSYDFVHDQLVDGRVLKMLCVIDEYTRECLAIEVGASLRSQDV 157 Query: 188 QQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKL 247 L + YG P + DNG+ + T + WL I P +P G + Sbjct: 158 ILVLSRLMRLYGKPAFIRSDNGAEF-----TAAKVMRWLRDAAIGPAFITPGNPWQNGFV 212 Query: 248 ERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQP 301 E F+ L+ E+L +WF E + + WR YN RPH A P + + Sbjct: 213 ESFNGKLRDELLNREWFRSRAEAKVLIERWRQFYNERRPHSAHRYQPPATVRRA 266 >UniRef50_A8L908 Integrase catalytic region n=8 Tax=Actinomycetales RepID=A8L908_FRASN Length = 304 Score = 194 bits (493), Expect = 6e-48, Method: Composition-based stats. Identities = 54/290 (18%), Positives = 88/290 (30%), Gaps = 40/290 (13%) Query: 29 NIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMAHDRH 88 ++ +C +S + Y+W R A + R T ++R Sbjct: 26 SVIQMCAWLSVSRSGFYEWRDRPASA--------------TATRRGALATLVVRSFTASD 71 Query: 89 ERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPAT--------------- 133 +G R++ L G + V LM GL+P T Sbjct: 72 GTYGYRRVHSDLLAWG-RPCSPELVRALMREQGLVPCQPRPWRHTLTEPGQTPAAIPDLL 130 Query: 134 -GRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLV 192 F D P D + P G + T++D H++ + A + + ++ + Sbjct: 131 QRDFTADTPGTKMVGDIT-YIPTWEGWLYLATVIDCHTKAVIGWAMDDNYKTGLIETAIT 189 Query: 193 SVFERYGLP--DRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERF 250 + L D GS + T L LGIR R E F Sbjct: 190 MAARNHQLTDGAIFHSDRGSNYTS-----TQFAATLKNLGIRQSVGRTGICYDNAMAESF 244 Query: 251 HRSLKAEVLQGKWFADSGELQRAFDHWRT-VYNLERPHEALDMAVPGSRY 299 +LK E + + +R + YN R H LD P + Sbjct: 245 FAALKNERVHRTAYPTREHARRDIARYIELRYNTTRRHSGLDYRTPQQVH 294 >UniRef50_A5BNX4 Putative uncharacterized protein n=2 Tax=Vitis vinifera RepID=A5BNX4_VITVI Length = 1468 Score = 193 bits (492), Expect = 6e-48, Method: Composition-based stats. Identities = 40/244 (16%), Positives = 80/244 (32%), Gaps = 47/244 (19%) Query: 116 LMARHGLLPGASPGIPATGRFEHD----APNRLWQMDFKGHFPFGGGRCHPLTLLDDHSR 171 + + +W +DF G FP G + L +D S+ Sbjct: 1160 IRKSCDRCQRLGKLAKGNQMPMNPILIVELFDVWGIDFMGPFPMSFGNSYILVGVDYVSK 1219 Query: 172 FSLCLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGI 231 + + ++ R ++ ++F R+G+P + D G+ + + E L + G+ Sbjct: 1220 WVEAIPCKQNDHRVVLKFLKENIFSRFGVPKAIISDGGTHFCN-----KPFETLLAKYGV 1274 Query: 232 RVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALD 291 + + PYHPQT G++E + + + Y L Sbjct: 1275 KHKVATPYHPQTFGQVE------------PSKQGNKEHIDES------AYKTI-----LG 1311 Query: 292 MAVPGSRYQPSARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGLK 351 M+ Y + EY ++K++ + +A + L Sbjct: 1312 MSP----YCLVYGKVCHLPVEVEYKAWWAIKKLN-----------MDLIRAGAKRCLDLN 1356 Query: 352 EMQE 355 EM+E Sbjct: 1357 EMEE 1360 >UniRef50_Q2P621 ISXoo3 transposase n=194 Tax=Proteobacteria RepID=Q2P621_XANOM Length = 349 Score = 193 bits (492), Expect = 6e-48, Method: Composition-based stats. Identities = 54/293 (18%), Positives = 88/293 (30%), Gaps = 32/293 (10%) Query: 16 RTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSD 75 R V GA+ R G+S + P N Sbjct: 75 RRALVREWIAGGASERCALAAIGMSASALRY------------------RPREDRNV--- 113 Query: 76 DITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIP---- 131 ++ + RH R+G I L +G + + V L L Sbjct: 114 ELRERILALAHRHRRYGVGMISLKLRQEGRLV-NYKRVERLYCEQQLQVRRRKRKKVPLG 172 Query: 132 -ATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQ 190 N +W MD G L ++DD + ++ + V + Sbjct: 173 ERAPLLRPTKANPVWSMDVVFDRTAEGRAIKCLVIVDDATHEAVAIDVERAISGHGVVRV 232 Query: 191 LVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERF 250 L + GLP + DNG + A+ W +++ +P P +E F Sbjct: 233 LDRLAHSRGLPKMIRTDNGKEFCG-----KAMVAWAHANRVQLRQIQPGKPNQNAYVESF 287 Query: 251 HRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSA 303 + L+ E L F + + WR YN RP + + P + Q A Sbjct: 288 NGRLRDECLNKHGFPTLLHARTEIERWRREYNEHRPKKTIGGMTPAAYAQQLA 340 >UniRef50_A1SU53 Integrase, catalytic region n=15 Tax=Proteobacteria RepID=A1SU53_PSYIN Length = 276 Score = 193 bits (492), Expect = 7e-48, Method: Composition-based stats. Identities = 48/285 (16%), Positives = 89/285 (31%), Gaps = 40/285 (14%) Query: 30 IRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMAHDR-H 88 +R +C G+ Y +L+R + D+ +++ + Sbjct: 1 MRVICPLIGVEIRNYYSYLKRRETKKFD--------------PDHGDMIEMIQKISESSD 46 Query: 89 ERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPAT--------------- 133 +G+R++++ L + + T LM G+ T Sbjct: 47 HTYGSRRMQKSLNALSFPVGRWKT-AQLMKEAGVWVRYKKKYKVTTNSEHKKPVYKNVLK 105 Query: 134 GRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVS 193 F++D PN+ + D + G + ++D +SR + + + + V L Sbjct: 106 QNFKYDQPNKAYVGDIT-YIWTTEGWLYLAVIIDLYSRKVVGWSMSSRMKASLVCDALTR 164 Query: 194 VFERYGLPD--RMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFH 251 + P + D G + L S+ E F Sbjct: 165 AIWQRNPPPGLIVHSDQGVQYASD-----QYRKLLKSHNFIGSMSKKGCCWDNAVAESFF 219 Query: 252 RSLKAEVLQGKWFADSGELQRAFDHWRT-VYNLERPHEALDMAVP 295 SLK E + +A E Q+ H+ T YN R H L P Sbjct: 220 GSLKQERVHWNNYATRYEAQQDILHYITMWYNSNRLHSYLGYCCP 264 >UniRef50_A4A249 Transposase orfB n=4 Tax=Planctomycetaceae RepID=A4A249_9PLAN Length = 279 Score = 193 bits (492), Expect = 7e-48, Method: Composition-based stats. Identities = 53/259 (20%), Positives = 88/259 (33%), Gaps = 14/259 (5%) Query: 63 PRIPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGL 122 + P +T + + RWG R+I + L +G T+ ++ L GL Sbjct: 8 SQRFEGKPKDEDVRLTKRILELVRQRPRWGYRQICQLLRREGETL-NMKKMYRLWKAAGL 66 Query: 123 LPGASPGIPA--------TGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSL 174 + +W DF G L ++D+++R L Sbjct: 67 KVPQKRRKKRATGVSTNACHVQPAGFRHDVWTWDFIQSSTIDGRTIRFLNIVDEYTRQCL 126 Query: 175 CLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVG 234 + E L +F +G+P R+ DNG + A++ WL +G+ V Sbjct: 127 AIKVGRSITSEDAIDTLAELFAMHGVPKRIRCDNGPEFIS-----CAIKTWLDLIGVEVL 181 Query: 235 HSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAV 294 + P P G F+ L+ E L + + WR +N RPH +L Sbjct: 182 YIEPGSPWQNGLCVSFNSRLRDEYLHQTDLLSLEDARIKARAWREDFNHNRPHSSLGYLT 241 Query: 295 PGSRYQPSARQYSGNTTPP 313 P + A S P Sbjct: 242 PAEFARRCAASTSVAALLP 260 >UniRef50_C1XUW8 Transposase n=2 Tax=Meiothermus silvanus DSM 9946 RepID=C1XUW8_9DEIN Length = 278 Score = 193 bits (492), Expect = 7e-48, Method: Composition-based stats. Identities = 66/290 (22%), Positives = 109/290 (37%), Gaps = 43/290 (14%) Query: 23 ASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLR 82 A ++ +R LCR G+ +T Y + +G PN + LR Sbjct: 2 ALKEAYPLRLLCRALGVPRSTLY-----YRSKG--------------PNPEEAVLRGRLR 42 Query: 83 MAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPATGRFEHDAP- 141 R+G R++ L +G + V +LM R GLL P P T E P Sbjct: 43 ELAGAWPRYGYRRLAALLRGEGFGVG-EKRVRSLMRREGLLLTRKPLKPRTTLPEELLPE 101 Query: 142 --------------NRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETV 187 +++W D G G + ++D H+R L +A + + Sbjct: 102 GVPNLLLGLEVTGFHQVWVADLSYVVL-GEGVAYLAVVMDLHTRKILGVALGPRLS-QGL 159 Query: 188 QQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKL 247 + + R G P+ D G + A L+ LG+R+ ++ P G Sbjct: 160 ALAALEMALREGCPEVHHSDRGVQYTSR-----AYVERLLGLGVRLSYAGTGRPWENGHA 214 Query: 248 ERFHRSLKAEVLQGKWFADSGELQRAFDHWR-TVYNLERPHEALDMAVPG 296 ER R++K E + + + E + + + + VYN +RPH AL P Sbjct: 215 ERLIRTVKEEWVDLREYRTLEEARASVEAFVFEVYNRKRPHSALGYLTPE 264 >UniRef50_B5YKC0 Putative transposase n=3 Tax=Thermodesulfovibrio yellowstonii DSM 11347 RepID=B5YKC0_THEYD Length = 286 Score = 193 bits (491), Expect = 8e-48, Method: Composition-based stats. Identities = 55/299 (18%), Positives = 100/299 (33%), Gaps = 31/299 (10%) Query: 19 FVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSD-DI 77 V ++G ++ C+ G+S + Y ++ P + ++ +I Sbjct: 1 MVKQLLKEGYTVKESCKASGLSRSRYYSFINL--------------REIEKPKKINEIEI 46 Query: 78 TALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLP----GASPGIPAT 133 ++ H WG R++ WL + + TV +M H LL + Sbjct: 47 LEKIKAIKSEHPFWGYRRVTAWLRHREGVLINHKTVSKIMKEHSLLASQTVHKAKRKAEG 106 Query: 134 GRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVS 193 + P +W +D G + + +LD +++ + R + L Sbjct: 107 RKPRTQRPKEIWGIDMTKFMIPCIGWAYLVVVLDWYTKKIVGWEISLRGRTAEWKSALDK 166 Query: 194 VFER------YGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKL 247 G ++ DNGS T A + LGI + +P+ Sbjct: 167 GLVSEFKEGVRGRGLKLVSDNGSQ-----PTSRAFMKEMAVLGIEQIFTSYDNPKGNADT 221 Query: 248 ERFHRSLKAEVLQGKWFADSGELQRAFDHWRTV-YNLERPHEALDMAVPGSRYQPSARQ 305 ER R++K E++ F E + + W T YN H AL P R+ Sbjct: 222 ERVIRTIKEELIWLNEFRSLDEARERIEDWITNCYNKLYVHSALGYLSPEEYELKYYRE 280 >UniRef50_C1F2E9 IS3 family transposase orfB n=1 Tax=Acidobacterium capsulatum ATCC 51196 RepID=C1F2E9_ACIC5 Length = 309 Score = 193 bits (490), Expect = 1e-47, Method: Composition-based stats. Identities = 57/295 (19%), Positives = 102/295 (34%), Gaps = 29/295 (9%) Query: 12 TMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPN 71 + R +F ++ R C + + A R Sbjct: 4 PQAEREAVRVFREATKSSERRACGQLEVVRAMVRY---RPRASRYEA------------- 47 Query: 72 RSSDDITALLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIP 131 +++ + LR D RWG R++ L+ +G + + V+ + L+ Sbjct: 48 -ANEKLRKRLRELADERRRWGYRRLHILLKREGWKVNS-KRVYRIYVEEKLVVRRRRRRR 105 Query: 132 ------ATGRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRE 185 N W MDF G + L++ D ++R L + T Sbjct: 106 RVCAQARVPLLPPTRLNETWTMDFLHDALANGRKLRTLSIEDAYTREMLAIEVDTSLPAL 165 Query: 186 TVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQG 245 V + L + GLP+R+ +D+G+ + L+ W + + + P P G Sbjct: 166 RVVRVLERLRLERGLPERIVIDHGTEFTS-----KLLDQWAYKNQVTLHFITPGLPMENG 220 Query: 246 KLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQ 300 +E FH + E L WF + ++ + WR YN RPH +L P + Sbjct: 221 YIESFHGKFREECLNEHWFLMLDDARQTIESWRIDYNWVRPHSSLGYLTPEEFRR 275 >UniRef50_A1VJC3 Integrase, catalytic region n=25 Tax=Bacteria RepID=A1VJC3_POLNA Length = 325 Score = 193 bits (490), Expect = 1e-47, Method: Composition-based stats. Identities = 63/287 (21%), Positives = 97/287 (33%), Gaps = 36/287 (12%) Query: 30 IRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMAHDRHE 89 + C GIS A Y R + + + L+ + RH Sbjct: 1 MSRQCVLAGISRAALYA-----------------RRKPKRIVQDDELLLRLIDEEYTRHP 43 Query: 90 RWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPATGRFEH----------- 138 +G+RK+ L GH++ LM GL A + +H Sbjct: 44 FYGSRKMVVHLGRCGHSV-NRKWAQRLMRSLGLAGMAPGPNTSRAHPQHKVYPYLLRGVA 102 Query: 139 -DAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFER 197 PN++W D + G + + ++D +SR L L Sbjct: 103 ISRPNQVWSTDIT-YIRLARGFAYLVAVIDWYSRRVLSWRISNSMETVFCVDCLEEALRI 161 Query: 198 YGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAE 257 +G P+ D GS + A L R G+ + +ER RS+K E Sbjct: 162 HGKPEVFNTDQGSQFTSE-----AFTSVLKREGVIISMDGRGRALDNIFVERLWRSVKHE 216 Query: 258 VLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSAR 304 + K ++ GEL + T YN ERPH+AL P YQ + Sbjct: 217 DVYLKGYSAMGELLIGLTQYFTFYNGERPHQALKNLTPDVVYQRAQG 263 >UniRef50_A8LT45 Integrase n=5 Tax=Bacteria RepID=A8LT45_DINSH Length = 497 Score = 193 bits (490), Expect = 1e-47, Method: Composition-based stats. Identities = 84/413 (20%), Positives = 137/413 (33%), Gaps = 54/413 (13%) Query: 5 MPWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPR 64 +P D R R + D + + R GI T +W R+ + G AGL PR Sbjct: 18 LPEDRRAEALRRFNILRQHLIDEVPLTEVARVSGIPLRTLQRWTSRYQRFGLAGLARAPR 77 Query: 65 IPHHSPNRSSDDITALLRMAHDRHERWG----ARKIKRWLEDQGHTMPAFSTVHNLMA-- 118 R S ++ L+ R R+I ++ + +P+++T+H+++ Sbjct: 78 SDAGQ-RRLSSELVELIEGLALHKPRLSTAAIHRRIIPIVKSRDWPVPSYATIHSIVNSL 136 Query: 119 -------RHGLLPGASPGIPATGRFEHDAPNRLWQMDFKG---HFPFGGG---RCHPLTL 165 H R + PN +WQ D G R + Sbjct: 137 DPALVTLAHDGAAAYRDRFEMIHRHRAERPNAVWQTDHTQLDLIILDTNGAPVRPWLTIV 196 Query: 166 LDDHSRFSLCLAHC-TDERRETVQQQLVSVFER--------YGLPDRMTMDNGSPWGDTT 216 LDDHSR A L R GLPD + D+GS + Sbjct: 197 LDDHSRAVAGYAVFVGAPSAIQTALALRQAIWRKDTPSWPICGLPDVLYTDHGSDFTSKH 256 Query: 217 GTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQG------------KWF 264 LE L I + S PQ +GK+ERF ++ E+L Sbjct: 257 -----LEQVAADLRIELVFSTVGRPQGRGKIERFFGTINTELLPELPGALSNGKPASPPR 311 Query: 265 ADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQP---SAR-QYSGNTTPPEYDEGVM 320 GEL+ A + T R H +D P ++ R S + Sbjct: 312 LSLGELEVAVKTFVTAVYNARKHSEID-VPPNEAWRGDGWLPRMPNSLEQLDLLLVMALK 370 Query: 321 VRKVDISGKLSVKGVSLS--AGKAFRGERVGLKEMQEDGSYEVWWYSTKVGVI 371 R+V G + +G+ + A+ G+ V ++ D + ++ + Sbjct: 371 TRQVRRDG-IRFQGLLYTDPTLAAYVGKTVNIRYDPRDITELRVFHRDRFLCR 422 >UniRef50_Q8NL32 Predicted transposase n=7 Tax=Corynebacterium RepID=Q8NL32_CORGL Length = 500 Score = 192 bits (489), Expect = 1e-47, Method: Composition-based stats. Identities = 77/339 (22%), Positives = 126/339 (37%), Gaps = 22/339 (6%) Query: 4 LMPWDARDTMSLRTEFVL--FASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQD 61 +MP R + + + + +I C R IS + Y R+ Q+ A L Sbjct: 19 IMPK--PLPPETRRKIIDFDPFAPNSPSIEEFCSRLKISRRSFYNIRNRYQQDANAALHP 76 Query: 62 RPRIPHHSPNRSSDDITALLRMAHDRHER----WGARKIKRWLEDQGH---TMPAFSTVH 114 P + + IT+ L R + +G I+ G +P+ ST+ Sbjct: 77 HSSAPITARRTYDESITSTLLSIRARLKAQGWEYGPISIRFEGISTGELTAPIPSVSTIA 136 Query: 115 NLMARHGLLPGASPGIPAT--GRFEHDAPNRLWQMDFKGHFP--FGGGRCHPLTLLDDHS 170 L+ G + P + RF+ +WQ+D + R +LDD + Sbjct: 137 RLLRAAGAVESNPKKRPKSSVVRFQRGQAMEMWQIDGFIYTLHDTDLTRVTIYQILDDAT 196 Query: 171 RFSLC-LAHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPW-GDTTGTWTALELWLMR 228 RF + +E + L +G P + DNGS + G +LE +L Sbjct: 197 RFDVGTCVFPANENSVDARTALEQAIAHFGAPHELLSDNGSAFNRMRQGYVGSLESYLAT 256 Query: 229 LGIRVGHSRPYHPQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHE 288 +G +P HPQTQGK ER HR+L LQ E + +R YN RPH+ Sbjct: 257 VGCLSITGKPGHPQTQGKNERSHRTLFR-FLQAHQPHTLEECAHYIEQFRDHYNNRRPHQ 315 Query: 289 AL-DMAVPGSRYQP---SARQYSGNTTPPEYDEGVMVRK 323 L + P + ++ +Q + + R+ Sbjct: 316 GLPNNLTPAAAWEIVGCVEQQPPIDPVVLQQQADHYARR 354 >UniRef50_UPI0001B46226 putative transposase n=1 Tax=Mycobacterium intracellulare ATCC 13950 RepID=UPI0001B46226 Length = 288 Score = 192 bits (489), Expect = 2e-47, Method: Composition-based stats. Identities = 57/296 (19%), Positives = 88/296 (29%), Gaps = 37/296 (12%) Query: 19 FVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDIT 78 + GA + C+ GI+ YK R P +T Sbjct: 1 MIDRLVDAGAPVDRCCKVLGITRQNYYKHK---------------RTPTTPTQLRRQWLT 45 Query: 79 ALLRMAH-DRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGL--LPGASPGIPAT-- 133 L+R H +G R+I L TV LM + G+ LPG + Sbjct: 46 GLIREVHVASRGTYGYRRIHAELTLGMGITVCSRTVSVLMTQAGIYGLPGPTRLKRLRGV 105 Query: 134 --------GRFEHDAPNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRE 185 +F PN LW D H G + +LD SR + + + Sbjct: 106 VTADDLVNRKFHRLHPNELWVTDITQH-RTREGWLYCCAVLDAFSRRIVGWSMDSRADST 164 Query: 186 TVQQQLVSVFERYGLPD--RMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQT 243 V L P + D+G+ + + G+ Sbjct: 165 LVVNALDMAIRNRRPPAGGIVHADHGTQFTSWV-----FGEKIRAAGLVPSFGTIGDGLD 219 Query: 244 QGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRT-VYNLERPHEALDMAVPGSR 298 +E F S++ E+L + + EL A + +N R H AL P Sbjct: 220 NAMMESFWSSMQIELLDRRKWRTRVELANAIFDYLEIFHNRRRRHSALGYRTPIEY 275 >UniRef50_A1JLT7 Transposase for insertion element IS1222 n=8 Tax=Yersinia RepID=A1JLT7_YERE8 Length = 249 Score = 192 bits (488), Expect = 2e-47, Method: Composition-based stats. Identities = 62/279 (22%), Positives = 94/279 (33%), Gaps = 48/279 (17%) Query: 20 VLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITA 79 +L G + R CR G+S +T R+ + A ++ Sbjct: 1 MLMCDATGLSQRRACRLTGLSLSTC-----RYEAQRPAA---------------DAHLSG 40 Query: 80 LLRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPATGRFEHD 139 + R+G R+I + L +G Sbjct: 41 RIIELALERRRFGYRRIWQLLRRKGLAT-----------------------ERLPLLRPA 77 Query: 140 APNRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYG 199 APN W MDF G R LT +DD ++ L + V + L S+ G Sbjct: 78 APNLTWSMDFVMDALATGRRIKCLTCVDDFTKECLTVTVAFGISGVQVTRILDSIALFRG 137 Query: 200 LPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVL 259 P + D G + T AL+ W G+ + +P P G +E F+ + E L Sbjct: 138 YPATIRTDQGPEF-----TCRALDQWAFEHGVELRLIQPGKPTQNGFIESFNGRFRDECL 192 Query: 260 QGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSR 298 WF+D ++ WR YN RPH AL+ P Sbjct: 193 NEHWFSDVSHARKTISEWRQDYNECRPHSALNYQTPSEF 231 >UniRef50_B6BJU9 Transposase n=3 Tax=Campylobacterales bacterium GD 1 RepID=B6BJU9_9PROT Length = 270 Score = 192 bits (488), Expect = 2e-47, Method: Composition-based stats. Identities = 44/280 (15%), Positives = 82/280 (29%), Gaps = 39/280 (13%) Query: 37 FGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMAHDRHERWGARKI 96 + + Y+WL + Q + + + + + +G R I Sbjct: 1 MQVHRSGYYQWLNQPISNRELENQ--------------ELLIQIKEAYKESNGVYGHRNI 46 Query: 97 KRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIP---------------ATGRFEHDAP 141 + L++ G + V LM+ L + F P Sbjct: 47 HKDLKELGIHV-NKKRVARLMSEAKLYGVGTYKRKPYSKAGPVHKAHPNHLHQCFISGKP 105 Query: 142 NRLWQMDFKGHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGL- 200 N W D + G T++D +SR + + + + L R Sbjct: 106 NDTWVSDIT-YIRTKEGWIFLATVIDLYSRKIIGWSTGHRQTTSLIISALKMAVARLSKD 164 Query: 201 -PDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERFHRSLKAEVL 259 + D GS + + + I + SR + E F ++LK E++ Sbjct: 165 GKVILHSDQGSQYSSY-----EYKKFAKEHNIILSMSRRGNCYDNAVAESFFKTLKKELV 219 Query: 260 QGKWFADSGELQRAFDHWRT-VYNLERPHEALDMAVPGSR 298 + + F + YN +R H LD P Sbjct: 220 RKQIFLTREIAASKIFEYIEMFYNSKRRHSYLDYISPNEF 259 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.313 0.152 0.432 Lambda K H 0.267 0.0463 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 2,305,394,279 Number of Sequences: 3077464 Number of extensions: 97008989 Number of successful extensions: 402135 Number of sequences better than 1.0e-01: 250 Number of HSP's better than 0.1 without gapping: 5179 Number of HSP's successfully gapped in prelim test: 5829 Number of HSP's that attempted gapping in prelim test: 378063 Number of HSP's gapped (non-prelim): 13779 length of query: 384 length of database: 1,040,396,356 effective HSP length: 130 effective length of query: 254 effective length of database: 640,326,036 effective search space: 162642813144 effective search space used: 162642813144 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.2 bits) S2: 94 (40.6 bits)