BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (376 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_Q47688 Putative uncharacterized protein ykfC n=18 Tax=P... 772 0.0 UniRef50_B0EYP4 Reverse transcriptase-like protein n=48 Tax=cell... 744 0.0 UniRef50_B9K440 18S rRNA intron 1 protein n=1 Tax=Agrobacterium ... 287 4e-76 UniRef50_C3B599 Group II intron-encoded protein LtrA n=3 Tax=cel... 212 1e-53 UniRef50_A9B955 RNA-directed DNA polymerase n=1 Tax=Herpetosipho... 211 3e-53 UniRef50_D2QVU8 RNA-directed DNA polymerase (Reverse transcripta... 211 5e-53 UniRef50_A7VUZ6 Putative uncharacterized protein n=1 Tax=Clostri... 209 1e-52 UniRef50_A4XCB1 RNA-directed DNA polymerase n=7 Tax=Actinomyceta... 207 6e-52 UniRef50_C3QDF8 RNA-directed DNA polymerase n=9 Tax=Bacteroidale... 206 1e-51 UniRef50_Q02718 Reverse transcriptase homologue COI iA grp II pr... 206 1e-51 UniRef50_Q9T654 Cox1I1a maturase (Fragment) n=2 Tax=cellular org... 203 7e-51 UniRef50_B0I1N9 Reverse transcriptase homolog n=7 Tax=cellular o... 203 7e-51 UniRef50_B9K434 Reverse transcriptase-like protein n=9 Tax=Prote... 202 2e-50 UniRef50_Q56VE2 MatR n=6 Tax=Bacteria RepID=Q56VE2_BACFR 200 8e-50 UniRef50_C0YLQ1 RNA-directed DNA polymerase n=40 Tax=Bacteria Re... 196 2e-48 UniRef50_Q1J4S7 Reverse transcriptase / RNA maturase / Endonucle... 195 3e-48 UniRef50_B1L2I7 Reverse transcriptase/endonuclease protein n=37 ... 194 3e-48 UniRef50_Q2G689 RNA-directed DNA polymerase n=5 Tax=Bacteria Rep... 194 5e-48 UniRef50_B8FP60 RNA-directed DNA polymerase n=3 Tax=Firmicutes R... 193 9e-48 UniRef50_A0L945 RNA-directed DNA polymerase (Reverse transcripta... 193 1e-47 UniRef50_B4WUH1 Group II intron, maturase-specific domain family... 192 1e-47 UniRef50_C8VXL4 RNA-directed DNA polymerase (Reverse transcripta... 192 2e-47 UniRef50_Q92Y56 Reverse transcriptase n=3 Tax=Alphaproteobacteri... 192 2e-47 UniRef50_UPI00019088D4 mobile mitochondrial group II intron of C... 190 9e-47 UniRef50_Q3ESS6 Reverse transcriptase / RNA maturase / Endonucle... 189 1e-46 UniRef50_Q3CZ44 Prophage LambdaSa1, reverse transcriptase/matura... 188 2e-46 UniRef50_A8MI91 RNA-directed DNA polymerase (Reverse transcripta... 187 4e-46 UniRef50_B9IYU4 Reverse transcriptase n=21 Tax=Bacteria RepID=B9... 187 4e-46 UniRef50_C3FAV3 RNA-directed DNA polymerase n=11 Tax=Bacteria Re... 186 8e-46 UniRef50_Q9MD87 Putative maturase n=1 Tax=Cryphonectria parasiti... 186 1e-45 UniRef50_P0A3U1 DNA endonuclease n=8 Tax=Firmicutes RepID=LTRA_L... 186 1e-45 UniRef50_A3WXE2 Putative reverse transcriptase n=1 Tax=Nitrobact... 184 3e-45 UniRef50_A5CZB8 Retron-type reverse transcriptase n=20 Tax=Bacte... 184 3e-45 UniRef50_C6J6N9 RNA-directed DNA polymerase n=1 Tax=Paenibacillu... 184 5e-45 UniRef50_C6CA58 RNA-directed DNA polymerase n=3 Tax=Enterobacter... 184 5e-45 UniRef50_C8R0Q7 RNA-directed DNA polymerase (Reverse transcripta... 184 5e-45 UniRef50_C5CID2 RNA-directed DNA polymerase (Reverse transcripta... 183 8e-45 UniRef50_C0JX29 Putative reverse transcriptase and intron matura... 183 1e-44 UniRef50_C5RJ16 RNA-directed DNA polymerase (Reverse transcripta... 181 4e-44 UniRef50_B0TA92 Reverse transcriptase (RNA-dependent DNA polymer... 181 5e-44 UniRef50_A5CZJ0 Retron-type reverse transcriptase n=1 Tax=Peloto... 181 5e-44 UniRef50_Q02717 Reverse transcriptase homologue COI ialpha grp I... 178 3e-43 UniRef50_UPI0001C42942 reverse transcriptase n=1 Tax=Bacillus ps... 177 5e-43 UniRef50_P38478 Uncharacterized mitochondrial protein ymf40 n=1 ... 176 1e-42 UniRef50_A5ZWA2 Putative uncharacterized protein n=2 Tax=Clostri... 176 1e-42 UniRef50_Q35056 CoxII intron2 ORF n=4 Tax=Embryophyta RepID=Q350... 175 2e-42 UniRef50_A9AUN7 RNA-directed DNA polymerase n=4 Tax=Bacteria Rep... 175 2e-42 UniRef50_C9S0G0 RNA-directed DNA polymerase n=2 Tax=Geobacillus ... 175 2e-42 UniRef50_D2M2V6 RNA-directed DNA polymerase (Reverse transcripta... 174 4e-42 UniRef50_O47500 RT-like protein n=1 Tax=Venturia inaequalis RepI... 174 5e-42 UniRef50_A8VT23 S-layer domain protein n=12 Tax=Bacilli RepID=A8... 174 6e-42 UniRef50_B8FP59 RNA-directed DNA polymerase n=9 Tax=Firmicutes R... 173 9e-42 UniRef50_A9IAV6 Mobile mitochondrial group II intron of COX1 whi... 173 1e-41 UniRef50_Q3A299 Prophage LambdaSa1, reverse transcriptase/matura... 173 1e-41 UniRef50_B7I148 Reverse transcriptase n=9 Tax=Bacillus RepID=B7I... 172 1e-41 UniRef50_B9M2H7 RNA-directed DNA polymerase (Reverse transcripta... 172 1e-41 UniRef50_C3FJT8 RNA-directed DNA polymerase (Reverse transcripta... 172 2e-41 UniRef50_Q1Q0X4 Similar to Group II intron encoded reverse trans... 172 2e-41 UniRef50_A7BUU9 RNA-directed DNA polymerase n=1 Tax=Beggiatoa sp... 172 3e-41 UniRef50_B4D9Y9 RNA-directed DNA polymerase (Reverse transcripta... 171 3e-41 UniRef50_Q24QQ9 Putative uncharacterized protein n=1 Tax=Desulfi... 171 4e-41 UniRef50_A9BGC0 RNA-directed DNA polymerase (Reverse transcripta... 171 4e-41 UniRef50_C3B585 Reverse transcriptase/endonuclease protein n=1 T... 171 5e-41 UniRef50_B2A0J9 RNA-directed DNA polymerase (Reverse transcripta... 170 6e-41 UniRef50_C9P0Q5 Retron-type reverse transcriptase n=3 Tax=Vibrio... 169 2e-40 UniRef50_A9IAY4 Reverse transcriptase n=38 Tax=Bacteria RepID=A9... 168 3e-40 UniRef50_D2CJC8 Putative uncharacterized protein orf2 (Fragment)... 168 3e-40 UniRef50_C4ZCX5 RNA-directed DNA polymerase n=24 Tax=Bacteria Re... 167 6e-40 UniRef50_Q08WW1 Prophage LambdaSa1, reverse transcriptase/matura... 166 9e-40 UniRef50_A4KVN1 Probable reverse transcriptase n=2 Tax=Sinorhizo... 166 1e-39 UniRef50_P03876 Putative COX1/OXI3 intron 2 protein n=3 Tax=Sacc... 166 1e-39 UniRef50_B4D379 RNA-directed DNA polymerase (Reverse transcripta... 166 1e-39 UniRef50_C7RV41 RNA-directed DNA polymerase (Reverse transcripta... 165 3e-39 UniRef50_B1VA32 Retron-type reverse transcriptase n=7 Tax=Candid... 165 3e-39 UniRef50_Q5U7I7 Maturase-related protein n=20 Tax=Gammaproteobac... 164 6e-39 UniRef50_C9B0U1 RNA directed DNA polymerase n=2 Tax=Enterococcus... 163 1e-38 UniRef50_B1HW67 Possible group II intron reverse transcriptase/m... 163 1e-38 UniRef50_Q3S275 ORF718 n=2 Tax=Eukaryota RepID=Q3S275_THAPS 162 1e-38 UniRef50_A7GTD4 RNA-directed DNA polymerase n=1 Tax=Bacillus cyt... 162 2e-38 UniRef50_C1PA09 RNA-directed DNA polymerase n=3 Tax=Firmicutes R... 161 3e-38 UniRef50_Q1PUN9 Strong similarity to group II intron-encoded pro... 161 3e-38 UniRef50_B4D301 RNA-directed DNA polymerase (Reverse transcripta... 160 5e-38 UniRef50_Q3A4Z2 Group II intron-encoding maturase n=98 Tax=Bacte... 159 1e-37 UniRef50_Q188V0 Group II intron reverse transcriptase/maturase n... 159 2e-37 UniRef50_Q94Z00 Orf757 n=4 Tax=stramenopiles RepID=Q94Z00_PYLLI 159 2e-37 UniRef50_A6DJK4 Reverse transcriptase/maturase n=21 Tax=Chlamydi... 159 2e-37 UniRef50_C3BJV7 D-alanine--D-alanine ligase A (D-alanylalanine s... 159 2e-37 UniRef50_C2KES2 Reverse transcriptase/maturase n=14 Tax=Firmicut... 158 2e-37 UniRef50_C3LL08 Group II intron reverse transcriptase/maturase n... 158 3e-37 UniRef50_B9J6F8 Reverse transcriptase n=7 Tax=Bacillus cereus gr... 158 3e-37 UniRef50_C2XKK9 D-alanine--D-alanine ligase A (D-alanylalanine s... 158 3e-37 UniRef50_Q1QGR6 RNA-directed DNA polymerase (Reverse transcripta... 158 4e-37 UniRef50_Q024N3 RNA-directed DNA polymerase n=6 Tax=Bacteria Rep... 158 4e-37 UniRef50_Q5ZTU1 Reverse transcriptase n=1 Tax=Legionella pneumop... 157 4e-37 UniRef50_B7JTB6 Group II intron reverse transcriptase/maturase n... 157 8e-37 UniRef50_B2AJV8 RNA-directed DNA polymerase, retrotranscriptase ... 156 1e-36 UniRef50_Q35062 CoxI intron2 ORF n=2 Tax=Marchantia polymorpha R... 156 1e-36 UniRef50_Q82RB7 Putative reverse transcriptase homolog; similar ... 155 3e-36 UniRef50_B1N1A3 NicA n=1 Tax=Pseudomonas putida RepID=B1N1A3_PSEPU 154 3e-36 UniRef50_A6LY84 RNA-directed DNA polymerase (Reverse transcripta... 154 3e-36 UniRef50_B1C301 Putative uncharacterized protein n=6 Tax=Clostri... 154 4e-36 UniRef50_B4CYA7 RNA-directed DNA polymerase (Reverse transcripta... 154 4e-36 UniRef50_A5VLF2 RNA-directed DNA polymerase (Reverse transcripta... 154 4e-36 UniRef50_Q1VQM5 Prophage LambdaSa1, reverse transcriptase/matura... 154 5e-36 UniRef50_D0LS09 RNA-directed DNA polymerase n=1 Tax=Haliangium o... 154 5e-36 UniRef50_Q11ZP4 RNA-directed DNA polymerase n=33 Tax=Bacteria Re... 154 5e-36 UniRef50_D2FQY0 Regulatory protein GntR n=2 Tax=Staphylococcus a... 154 7e-36 UniRef50_C6IQ61 Putative uncharacterized protein n=10 Tax=Bacter... 154 7e-36 UniRef50_Q01P79 RNA-directed DNA polymerase (Reverse transcripta... 153 8e-36 UniRef50_Q64E53 Prophage LambdaSa1 transcriptase/maturase family... 153 9e-36 UniRef50_A6YEC9 Putative reverse transcriptase and intron matura... 153 1e-35 UniRef50_A7BYN3 RNA-directed DNA polymerase n=2 Tax=Beggiatoa sp... 153 1e-35 UniRef50_D2CK02 Putative uncharacterized protein orf3 (Fragment)... 152 2e-35 UniRef50_A5VH22 RNA-directed DNA polymerase n=1 Tax=Sphingomonas... 152 2e-35 UniRef50_C3EEI5 Group II intron reverse transcriptase/maturase n... 152 2e-35 UniRef50_A1T776 RNA-directed DNA polymerase n=1 Tax=Mycobacteriu... 152 2e-35 UniRef50_B8R181 Putative intron-encoded reverse transcriptase n=... 152 2e-35 UniRef50_Q7YAJ3 Putative reverse transcriptase and intron matura... 152 2e-35 UniRef50_Q0S063 RNA-directed DNA polymerase (Reverse transcripta... 151 3e-35 UniRef50_A7MS60 Putative uncharacterized protein n=21 Tax=Vibrio... 151 4e-35 UniRef50_C1BDP2 Putative RNA-directed DNA polymerase n=2 Tax=Rho... 151 4e-35 UniRef50_UPI0001C42A66 RNA-directed DNA polymerase (Reverse tran... 151 4e-35 UniRef50_Q74P60 Group II intron reverse transcriptase/maturase n... 151 4e-35 UniRef50_B8I7I5 RNA-directed DNA polymerase (Reverse transcripta... 151 4e-35 UniRef50_Q9G8T2 Orf762 n=2 Tax=Eukaryota RepID=Q9G8T2_RHDSA 150 5e-35 UniRef50_C6MRB5 RNA-directed DNA polymerase n=1 Tax=Geobacter sp... 150 7e-35 UniRef50_Q7UY81 Reverse transcriptase/maturase n=1 Tax=Rhodopire... 150 7e-35 UniRef50_A2TD24 Intron encoded protein n=2 Tax=Bacillaceae RepID... 150 9e-35 UniRef50_B1I9Z1 GBSi1, group II intron, maturase n=33 Tax=Firmic... 149 2e-34 UniRef50_A1ZX33 Group II intron-encoded protein LtrA n=2 Tax=Bac... 149 2e-34 UniRef50_Q8HQ89 ORF777 (Fragment) n=1 Tax=Schizosaccharomyces oc... 148 2e-34 UniRef50_B7CEC9 Putative uncharacterized protein n=1 Tax=Eubacte... 148 3e-34 UniRef50_B7C9E4 Putative uncharacterized protein n=1 Tax=Eubacte... 148 3e-34 UniRef50_C4ZES6 RNA-directed DNA polymerase n=27 Tax=Bacteria Re... 147 7e-34 UniRef50_B0K6R3 RNA-directed DNA polymerase (Reverse transcripta... 147 8e-34 UniRef50_C3KST3 Group II intron reverse transcriptase/maturase n... 146 1e-33 UniRef50_Q93PB4 MS117, putative maturase n=1 Tax=Microscilla sp.... 146 1e-33 UniRef50_C4K5N9 Group II intron encoded reverse transcriptase n=... 145 2e-33 UniRef50_B4WW73 Group II intron, maturase-specific domain family... 145 2e-33 UniRef50_C7V8C7 Reverse transcriptase n=1 Tax=Enterococcus faeca... 145 2e-33 UniRef50_B7KM76 RNA-directed DNA polymerase (Reverse transcripta... 145 3e-33 UniRef50_B7HM08 Group II intron reverse transcriptase/maturase n... 145 3e-33 UniRef50_B2JXR4 RNA-directed DNA polymerase n=10 Tax=Bacteria Re... 143 8e-33 UniRef50_C9BNF1 Group II intron reverse transcriptase/maturase n... 143 9e-33 UniRef50_Q0AW97 RNA-directed DNA polymerase (Reverse transcripta... 143 1e-32 UniRef50_B0URY2 RNA-directed DNA polymerase n=25 Tax=cellular or... 142 1e-32 UniRef50_C1L365 Group II intron-encoded protein n=1 Tax=Bacillus... 142 1e-32 UniRef50_B3PDY2 Putative maturase n=1 Tax=Cellvibrio japonicus U... 142 2e-32 UniRef50_P03875 Putative COX1/OXI3 intron 1 protein n=3 Tax=Fung... 142 2e-32 UniRef50_A4C8M3 RNA-directed DNA polymerase (Reverse transcripta... 142 2e-32 UniRef50_Q10VN2 RNA-directed DNA polymerase (Reverse transcripta... 142 3e-32 UniRef50_A7UDN1 Putative reverse transcriptase n=2 Tax=Candida z... 142 3e-32 UniRef50_A0RHJ0 Reverse transcriptase/endonuclease protein n=6 T... 141 4e-32 UniRef50_A6P1G1 Putative uncharacterized protein n=1 Tax=Bactero... 140 7e-32 UniRef50_Q8YLU0 All5206 protein n=1 Tax=Nostoc sp. PCC 7120 RepI... 140 1e-31 UniRef50_B7K703 RNA-directed DNA polymerase (Reverse transcripta... 139 1e-31 UniRef50_Q47DU4 RNA-directed DNA polymerase (Reverse transcripta... 139 2e-31 UniRef50_P05511 Uncharacterized 91 kDa protein in cob intron n=1... 139 2e-31 UniRef50_Q35064 Atp9 intron ORF n=1 Tax=Marchantia polymorpha Re... 139 2e-31 UniRef50_Q119U8 RNA-directed DNA polymerase n=30 Tax=Bacteria Re... 139 2e-31 UniRef50_C6MXE9 RNA-directed DNA polymerase n=1 Tax=Geobacter sp... 138 3e-31 UniRef50_B0I1N8 Reverse transcriptase homolog n=2 Tax=Pylaiella ... 138 3e-31 UniRef50_B0JX80 Reverse transcriptase n=82 Tax=Bacteria RepID=B0... 138 4e-31 UniRef50_B3GTB4 Putative reverse-transcriptase protein n=1 Tax=V... 137 5e-31 UniRef50_Q6TFE1 Putative group II intron-encoded maturase n=1 Ta... 137 6e-31 UniRef50_C5ER86 RNA-directed DNA polymerase n=1 Tax=Clostridiale... 136 1e-30 UniRef50_C5D9G3 RNA-directed DNA polymerase (Reverse transcripta... 136 2e-30 UniRef50_D2LU13 RNA-directed DNA polymerase (Reverse transcripta... 135 2e-30 UniRef50_Q94Z24 Orf568 n=1 Tax=Pylaiella littoralis RepID=Q94Z24... 135 2e-30 UniRef50_Q6EI10 Reverse transcriptase/HNH endonuclease n=2 Tax=E... 135 3e-30 UniRef50_A8ZN56 RNA-directed DNA polymerase n=2 Tax=Cyanobacteri... 134 4e-30 UniRef50_Q3B1V7 RNA-directed DNA polymerase n=31 Tax=Bacteria Re... 134 4e-30 UniRef50_C6MS68 RNA-directed DNA polymerase n=1 Tax=Geobacter sp... 134 6e-30 UniRef50_C5ZZQ5 Putative uncharacterized protein n=1 Tax=Escheri... 133 1e-29 UniRef50_C6I8L1 CRISPR-associated protein n=1 Tax=Bacteroides sp... 132 3e-29 UniRef50_A9ENQ0 Integron/retron-type RNA-directed DNA polymerase... 131 3e-29 UniRef50_D0VMZ3 Putative reverse-transcriptase protein n=1 Tax=V... 131 5e-29 UniRef50_Q7GEU5 Putative uncharacterized protein (Fragment) n=3 ... 130 6e-29 UniRef50_A5IEI2 Reverse transcriptase n=7 Tax=Bacteria RepID=A5I... 130 7e-29 UniRef50_Q1D1V6 Group II intron, maturase n=2 Tax=Myxococcales R... 130 9e-29 UniRef50_Q1Q3I7 Putative uncharacterized protein n=1 Tax=Candida... 130 1e-28 UniRef50_UPI00016C4F75 RNA-directed DNA polymerase (Reverse tran... 129 1e-28 UniRef50_Q94Z25 Orf557 n=2 Tax=Pylaiella littoralis RepID=Q94Z25... 129 1e-28 UniRef50_O99479 Reverse transcriptase homolog n=2 Tax=Eukaryota ... 129 1e-28 UniRef50_Q8TJY1 Reverse transcriptase n=5 Tax=Methanosarcina Rep... 129 2e-28 UniRef50_Q8A4I4 Reverse transcriptase n=1 Tax=Bacteroides thetai... 129 3e-28 UniRef50_Q2FUJ3 RNA-directed DNA polymerase n=1 Tax=Methanospiri... 128 4e-28 UniRef50_B3JM52 Putative uncharacterized protein n=1 Tax=Bactero... 128 4e-28 UniRef50_A6DE66 Putative uncharacterized protein n=1 Tax=Caminib... 127 5e-28 UniRef50_Q12UG1 RNA-directed DNA polymerase n=53 Tax=cellular or... 127 8e-28 UniRef50_Q67M30 Group II intron-encoding maturase n=1 Tax=Symbio... 125 2e-27 UniRef50_A8LGE6 RNA-directed DNA polymerase n=1 Tax=Frankia sp. ... 124 6e-27 UniRef50_Q8RSV8 Maturase n=1 Tax=uncultured marine bacterium Rep... 122 2e-26 UniRef50_P19593 Probable reverse transcriptase n=2 Tax=Scenedesm... 122 3e-26 UniRef50_Q47277 Orf protein n=63 Tax=cellular organisms RepID=Q4... 121 4e-26 UniRef50_B4WV39 Group II intron, maturase-specific domain family... 121 5e-26 UniRef50_Q8GAR1 Reverse transcriptase n=20 Tax=Enterobacteriacea... 119 1e-25 UniRef50_D1RME6 Reverse transcriptase family protein n=1 Tax=Leg... 119 1e-25 UniRef50_Q8YKQ2 Alr7241 protein n=14 Tax=Cyanobacteria RepID=Q8Y... 119 2e-25 UniRef50_B6FPD0 Putative uncharacterized protein n=1 Tax=Clostri... 119 3e-25 UniRef50_B1I2T7 Retron-type reverse transcriptase-like protein n... 118 4e-25 UniRef50_Q8YWX6 Alr1468 protein n=4 Tax=Cyanobacteria RepID=Q8YW... 118 4e-25 UniRef50_A6YE98 Putative reverse transcriptase and intron matura... 118 4e-25 UniRef50_C8PKY6 Putative CRISPR-associated protein Cas1 n=1 Tax=... 117 7e-25 UniRef50_D2CMF7 Reverse transcriptase n=1 Tax=Agaricus bisporus ... 117 7e-25 UniRef50_Q9G8T4 Orf621 n=1 Tax=Rhodomonas salina RepID=Q9G8T4_RHDSA 116 1e-24 UniRef50_D2R8Z2 CRISPR-associated protein Cas1 n=1 Tax=Pirellula... 114 5e-24 UniRef50_C0JWS6 Putative reverse transcriptase and intron matura... 114 9e-24 UniRef50_B4UZZ3 RNA-directed DNA polymerase n=1 Tax=Streptomyces... 113 1e-23 UniRef50_D2LF37 RNA-directed DNA polymerase (Reverse transcripta... 112 1e-23 UniRef50_A1BI39 CRISPR-associated protein Cas1 n=5 Tax=Chlorobia... 112 2e-23 UniRef50_A1KEP0 Possible maturase n=10 Tax=Mycobacterium tubercu... 112 2e-23 UniRef50_B3CUR8 Reverse transcriptase n=24 Tax=Orientia tsutsuga... 112 2e-23 UniRef50_Q8HQ84 ORF786 n=1 Tax=Schizosaccharomyces octosporus Re... 110 8e-23 UniRef50_Q9MR93 Reverse transcriptase homologue ND5 i4 grp II pr... 108 2e-22 UniRef50_C6PFC6 RNA-directed DNA polymerase (Reverse transcripta... 108 3e-22 UniRef50_Q7YAJ6 Putative reverse transcriptase and intron matura... 108 3e-22 UniRef50_Q0QIN8 Putative reverse transcriptase n=1 Tax=Oltmannsi... 107 8e-22 UniRef50_C1DF40 Group II intron-encoding maturase n=1 Tax=Azotob... 106 2e-21 UniRef50_B8R160 Reverse transcriptase n=2 Tax=Volvox carteri Rep... 105 2e-21 UniRef50_D2MKC4 RNA-directed DNA polymerase (Reverse transcripta... 105 2e-21 UniRef50_B9VL91 Putative maturase/reverse transcriptase n=1 Tax=... 105 4e-21 UniRef50_P38456 Uncharacterized mitochondrial protein ymf11 n=1 ... 103 1e-20 UniRef50_Q7M1J5 Reverse transcription like protein 2, intron-enc... 102 2e-20 UniRef50_UPI0001C15D3C hypothetical protein CRC_00192 n=2 Tax=No... 102 2e-20 UniRef50_A8KXN1 RNA-directed DNA polymerase (Reverse transcripta... 101 4e-20 UniRef50_A7NMI6 RNA-directed DNA polymerase (Reverse transcripta... 101 4e-20 UniRef50_C0FSR2 Putative uncharacterized protein n=1 Tax=Rosebur... 101 4e-20 UniRef50_Q4C6J4 RNA-directed DNA polymerase (Reverse transcripta... 100 6e-20 UniRef50_C0A8Z3 RNA-directed DNA polymerase (Reverse transcripta... 100 1e-19 UniRef50_O99970 Orf546 n=2 Tax=Porphyra purpurea RepID=O99970_PORPU 99 2e-19 UniRef50_B0VI85 RNA-directed DNA polymerase (Reverse transcripta... 99 2e-19 UniRef50_Q8TIC7 Reverse transcriptase n=1 Tax=Methanosarcina ace... 99 3e-19 UniRef50_Q5P2A1 Reverse transcriptase/retron type n=2 Tax=Proteo... 98 4e-19 UniRef50_C3AUZ6 Reverse transcriptase n=3 Tax=Firmicutes RepID=C... 97 1e-18 UniRef50_Q9FJR9 Similarity to maturase-related protein n=4 Tax=M... 96 2e-18 UniRef50_A5N448 Predicted reverse transcriptase/maturase family ... 96 3e-18 UniRef50_Q4E9Y9 Reverse transcriptase n=1 Tax=Wolbachia endosymb... 94 7e-18 UniRef50_D1I8B4 Whole genome shotgun sequence of line PN40024, s... 94 8e-18 UniRef50_Q7YAJ4 Putative reverse transcriptase and intron matura... 94 1e-17 UniRef50_A9AYP7 RNA-directed DNA polymerase (Reverse transcripta... 93 1e-17 UniRef50_B3JKP4 Putative uncharacterized protein n=1 Tax=Bactero... 91 7e-17 UniRef50_Q35063 CoxI intron1 ORF n=2 Tax=Eukaryota RepID=Q35063_... 90 2e-16 UniRef50_B6IMH7 Phage-encoded reverse transcriptase, putative n=... 89 2e-16 UniRef50_Q7MTH7 CRISPR-associated protein Cas1 n=3 Tax=Porphyrom... 89 2e-16 UniRef50_C8VZI0 RNA-directed DNA polymerase (Reverse transcripta... 89 2e-16 UniRef50_Q4FUJ8 Possible RNA-directed DNA polymerase (Reverse tr... 89 3e-16 UniRef50_A4BSH6 Transposase n=1 Tax=Nitrococcus mobilis Nb-231 R... 88 4e-16 UniRef50_UPI0001C388AF RNA-directed DNA polymerase n=1 Tax=Arthr... 86 3e-15 UniRef50_UPI00016C45C3 RNA-directed DNA polymerase (Reverse tran... 85 3e-15 >UniRef50_Q47688 Putative uncharacterized protein ykfC n=18 Tax=Proteobacteria RepID=YKFC_ECOLI Length = 376 Score = 772 bits (1994), Expect = 0.0, Method: Compositional matrix adjust. Identities = 376/376 (100%), Positives = 376/376 (100%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARL 60 MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARL Sbjct: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARL 60 Query: 61 AVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWES 120 AVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWES Sbjct: 61 AVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWES 120 Query: 121 DFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRR 180 DFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRR Sbjct: 121 DFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRR 180 Query: 181 RISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLS 240 RISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLS Sbjct: 181 RISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLS 240 Query: 241 GKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECR 300 GKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECR Sbjct: 241 GKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECR 300 Query: 301 GVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAASL 360 GVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAASL Sbjct: 301 GVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAASL 360 Query: 361 TALLWKVRISGEILLG 376 TALLWKVRISGEILLG Sbjct: 361 TALLWKVRISGEILLG 376 >UniRef50_B0EYP4 Reverse transcriptase-like protein n=48 Tax=cellular organisms RepID=B0EYP4_ECOLX Length = 507 Score = 744 bits (1921), Expect = 0.0, Method: Compositional matrix adjust. Identities = 361/364 (99%), Positives = 362/364 (99%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARL 60 MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARL Sbjct: 6 MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARL 65 Query: 61 AVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWES 120 AVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWES Sbjct: 66 AVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWES 125 Query: 121 DFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRR 180 DFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRR Sbjct: 126 DFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRR 185 Query: 181 RISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLS 240 RISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLS Sbjct: 186 RISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLS 245 Query: 241 GKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECR 300 GKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQ EAIREECR Sbjct: 246 GKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQAEAIREECR 305 Query: 301 GVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAASL 360 GVLEGSLKLRLNMDKTKI HVNDGFIFLGHR+IRKRSRYGEMRVVSTIPQEKARNFAASL Sbjct: 306 GVLEGSLKLRLNMDKTKITHVNDGFIFLGHRIIRKRSRYGEMRVVSTIPQEKARNFAASL 365 Query: 361 TALL 364 TALL Sbjct: 366 TALL 369 >UniRef50_B9K440 18S rRNA intron 1 protein n=1 Tax=Agrobacterium vitis S4 RepID=B9K440_AGRVS Length = 257 Score = 287 bits (735), Expect = 4e-76, Method: Compositional matrix adjust. Identities = 154/245 (62%), Positives = 183/245 (74%), Gaps = 7/245 (2%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARL 60 MQ KLATWA +DP+ R RLLRLI EWLAE AR+ L+S GA TPG+DG++K LQ +L Sbjct: 6 MQHKLATWAESDPNRRFDRLLRLIANREWLAETARMVLASSGARTPGIDGMDKQRLQVKL 65 Query: 61 AVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWES 120 L LR LL Y+P P +R+YIPK NGKLRPL IP L DRIVQRAMLMAM PIWES Sbjct: 66 DQHLDDLRTSLLEESYRPQPVKRIYIPKPNGKLRPLDIPTLTDRIVQRAMLMAMGPIWES 125 Query: 121 DFHTLSYGFRPERSVHHAIRTVKLQLTDCGE-TRGRWVIEGDLSSYFDTVHHRLLMKAVR 179 DFH LSYGFR ER+VHHA+RTV++QL D + TRGRW+IEGDL+SYFDTVHHRLL++ VR Sbjct: 126 DFHRLSYGFRSERNVHHAVRTVRIQLQDGADTTRGRWIIEGDLASYFDTVHHRLLLRCVR 185 Query: 180 RRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQ--YLHER 237 RR+ D RF+ LLW+ +KAGHID GLF A+SEGVPQGG L S +DQ +H Sbjct: 186 RRVQDGRFVDLLWRFLKAGHIDRGLFTASSEGVPQGG----LWSADHKPPYDQRCKMHSA 241 Query: 238 YLSGK 242 +L G+ Sbjct: 242 WLQGR 246 >UniRef50_C3B599 Group II intron-encoded protein LtrA n=3 Tax=cellular organisms RepID=C3B599_BACMY Length = 623 Score = 212 bits (540), Expect = 1e-53, Method: Compositional matrix adjust. Identities = 125/392 (31%), Positives = 196/392 (50%), Gaps = 57/392 (14%) Query: 10 ATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRD 69 + + + + RL R + P + +A S++G T G DG N + ++ IL + Sbjct: 15 SNNKNYKYNRLYRNLYNPAFYLKAYTNISSNQGNMTKGTDGKN---IDGFSLEKINILIE 71 Query: 70 ELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGF 129 L YQP P++RV+IPK NG RPLGIP +D+++Q + M +E I+E F S+GF Sbjct: 72 SLKDESYQPHPSKRVFIPKKNGSKRPLGIPTFKDKLLQEVIRMILEAIYEMSFKESSHGF 131 Query: 130 RPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMT 189 RP+RS H A+ V+ T +W +EGD+S +FD + H L+ +RRRI+D +F+ Sbjct: 132 RPKRSCHTALHKVRKTFTGV-----KWFVEGDISGFFDNIDHHTLIALLRRRITDEKFIR 186 Query: 190 LLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHE------------- 236 L+WK ++AG+++ +R + G PQGG+ISPLLSN+ L E D ++ E Sbjct: 187 LIWKFLRAGYLEEWKYRGSYSGTPQGGIISPLLSNVYLTELDTFMEEYQKEFSKGAKRKT 246 Query: 237 -------RYLSGKARKD-RWYWNNSIQRGRSTAV------RENWQWKPA----------V 272 YL+ K RK + W ++ + + +E P + Sbjct: 247 TKAYKRQEYLTYKHRKQLKENWGQLTEQEKKQGIIKYKSLKEELLKTPFGDPMDDSYKRI 306 Query: 273 AYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRL 332 Y RYADDF++ + G KA ++E+ L LKL L+ +KT I H + FLG+ + Sbjct: 307 QYVRYADDFLVGIIGNKADAVKVKEDLTNFLRDKLKLELSQEKTLITHSSKKARFLGYNI 366 Query: 333 IRKR------------SRYGEMRVVSTIPQEK 352 R +R+ MR +P EK Sbjct: 367 TVSRMPVTKRDKNGFLTRHQIMRCKLYLPSEK 398 >UniRef50_A9B955 RNA-directed DNA polymerase n=1 Tax=Herpetosiphon aurantiacus ATCC 23779 RepID=A9B955_HERA2 Length = 587 Score = 211 bits (538), Expect = 3e-53, Method: Compositional matrix adjust. Identities = 116/299 (38%), Positives = 174/299 (58%), Gaps = 29/299 (9%) Query: 38 LSSKGAHTPGVDGVNKTMLQARLAVE--LQILRDELLSGHYQPLPARRVYIPKSNGKLRP 95 LS+ GA T G+DG+ K + + +Q + +L + Y+P P RRVYIPK+NG+ RP Sbjct: 55 LSNTGARTAGIDGMTKKHIATDTEQQALVQEIWHDLTTHQYRPAPVRRVYIPKANGQQRP 114 Query: 96 LGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRG- 154 LGIP ++DR+VQ + + ++PI+ES F+ SYGFRP R+ HHA+ ++L D RG Sbjct: 115 LGIPTIKDRVVQEMVRLILDPIYESTFYRHSYGFRPYRATHHAV----VRLRDLIGRRGY 170 Query: 155 RWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQ 214 + +EGD+ + FD +HH L++ +RR I D R +T++ + +KAG +D G +R +G PQ Sbjct: 171 QMALEGDIRACFDRIHHTTLIRILRRTIKDERLITVIHQMLKAGVMDDGQWRVTEDGTPQ 230 Query: 215 GGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAY 274 GG++SPLL+NI LNE DQ++ +RW ++R + K Y Sbjct: 231 GGIVSPLLANIYLNELDQWV----------ANRWDTYTPLER--------YYHRKAGTGY 272 Query: 275 ----CRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLG 329 RYADDFV+++ GT A+ ++ L L L L+ +KT I V GF FLG Sbjct: 273 PCQITRYADDFVVLLHGTHAEATTLKTALATFLADHLHLELSAEKTLITPVEQGFDFLG 331 >UniRef50_D2QVU8 RNA-directed DNA polymerase (Reverse transcriptase) n=1 Tax=Spirosoma linguale DSM 74 RepID=D2QVU8_9SPHI Length = 507 Score = 211 bits (536), Expect = 5e-53, Method: Compositional matrix adjust. Identities = 142/386 (36%), Positives = 207/386 (53%), Gaps = 34/386 (8%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARL 60 +QRKL W+ T P+ + L +T L EA R +KG TPG+DG+ ++ R+ Sbjct: 15 VQRKLYQWSQTHPTEAYRELWNWLTDLRNLREAWRRVAQNKGKRTPGIDGMTVGSIRQRI 74 Query: 61 --AVELQILRDELLSGHYQPLPARRVYIPKSN--GKLRPLGIPALRDRIVQRAMLMAMEP 116 A L L+ +L +G Y+P P RR IPK+ GK RPLGIP + DR+VQ A+ +EP Sbjct: 75 GEAPFLATLQQQLRTGSYKPSPCRRKLIPKAGKPGKFRPLGIPTIADRVVQSAIKQVLEP 134 Query: 117 IWESDFHTLSYGFRPERSVHHAI---------RTVKLQLTDCGETRGRWVIEGDLSSYFD 167 I E+ F +SYGFRP R H A+ R V Q E +WVIEGD+ S FD Sbjct: 135 ILEARFWPVSYGFRPGRGCHGALEHIRMSMRPRKVNKQDNKRHEMPYQWVIEGDIQSCFD 194 Query: 168 TVHHRLLMKAVRRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIML 227 + H LM +R+ +D R LL + +KAG + F G PQGG++SPLL+N+ L Sbjct: 195 HIDHHQLMDRIRQHSADRRVNQLLVQFLKAGILSEEQFLRTDAGTPQGGIVSPLLANVAL 254 Query: 228 NEFDQYLHERYLS--GKARKDRW-------YWNNSIQRGRSTAVRENWQWKPAVAYCRYA 278 ++ +ER+++ K R+ R W+ S+ R AV RYA Sbjct: 255 GLIEER-YERWVNHQTKRRQSRQCDGIKAAMWSRSVDRQAGRAV---------YFPFRYA 304 Query: 279 DDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRL-IRKRS 337 DDFV++V GT+ +A R+ + +L+ + L L+ +KTKI + +GF FLGHR+ +R Sbjct: 305 DDFVILVSGTQENAQAERKVLQTLLQEKMGLTLSPEKTKITPLTEGFQFLGHRVSMRWDY 364 Query: 338 RYGEMRVVSTIPQEKARNFAASLTAL 363 RYG + IP++KA + + L Sbjct: 365 RYGWTPRLE-IPKQKAADLRYRIKQL 389 >UniRef50_A7VUZ6 Putative uncharacterized protein n=1 Tax=Clostridium leptum DSM 753 RepID=A7VUZ6_9CLOT Length = 605 Score = 209 bits (532), Expect = 1e-52, Method: Compositional matrix adjust. Identities = 132/372 (35%), Positives = 202/372 (54%), Gaps = 34/372 (9%) Query: 5 LATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVEL 64 L + D S R R+ R + ++ A + + +G TPG DG T + ++ Sbjct: 16 LKQKSKQDESHRYDRIYRNLFNEDFFLRAYQKIHAKQGNMTPGTDG---TTIDGFSRKQI 72 Query: 65 QILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHT 124 L + L YQP P RR YIPK NGK+RPLGIPA D++VQ + +E I+E F Sbjct: 73 SQLIELLKWERYQPKPVRRTYIPKKNGKMRPLGIPAFADKLVQEVVRQILEAIYEPIFSD 132 Query: 125 LSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISD 184 S+GFRP RS H A+ +K + C T WVIEGD++ FD + H +L+K + ++I D Sbjct: 133 NSHGFRPNRSCHTALYQIK---STCRGTN--WVIEGDITGCFDHIDHEILLKILLKKIDD 187 Query: 185 ARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLH---ERYLSG 241 RF+ L+WK +KAG+++ + G PQGG+ISP+L+NI L+EFD+++ Y G Sbjct: 188 GRFLELIWKFLKAGYLEFNQKYNSLSGTPQGGIISPILANIYLHEFDKFMEGISAEYTKG 247 Query: 242 KARKD-------RWYWNNSIQRGRSTAVRE---NWQWKPA----------VAYCRYADDF 281 K R+ ++ N + ++G E Q PA V Y RYADDF Sbjct: 248 KQRRPYREYQILQYKRNRAKKKGNQEQADEYLRQMQNIPALDPMDKNYQRVKYVRYADDF 307 Query: 282 VLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFI-FLGHRLIRKRSRYG 340 V+ + G+KA I+E G ++ L L L+ +KTKI +++D + FLG+ + +S+ Sbjct: 308 VVCIIGSKATANEIKERIAGFMQKELHLELSREKTKITNLSDKRVRFLGYEIT--KSQEN 365 Query: 341 EMRVVSTIPQEK 352 +VV +I ++K Sbjct: 366 TKQVVDSIGRKK 377 >UniRef50_A4XCB1 RNA-directed DNA polymerase n=7 Tax=Actinomycetales RepID=A4XCB1_SALTO Length = 488 Score = 207 bits (527), Expect = 6e-52, Method: Compositional matrix adjust. Identities = 135/369 (36%), Positives = 199/369 (53%), Gaps = 22/369 (5%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARL 60 MQ KL WAA P R L L+ P L A + GA T GVDG+ ++ + Sbjct: 25 MQAKLHRWAAAGPGRRFDDLFNLVHDPATLLVAYSRVAGNLGARTAGVDGMTVADVERHI 84 Query: 61 AVE--LQILRDELLSGHYQPLPARRVYIPK--SNGKLRPLGIPALRDRIVQRAMLMAMEP 116 V L LR ++ +G ++PLP R IPK +GK+R LGIP + DR+VQ A+ + +EP Sbjct: 85 GVPGFLDDLRVQVKTGTFRPLPVRERKIPKPGGSGKVRRLGIPTVADRVVQAALKLVLEP 144 Query: 117 IWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMK 176 I+E+DF +SYGFRP+R AI + G RWV++ D+ + FD++ H LM Sbjct: 145 IFEADFLPVSYGFRPKRRAQDAIAEIHY----YGTHGYRWVLDADIEACFDSIDHVALMD 200 Query: 177 AVRRRISDARFMTLLWKTIKAGHI-DVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLH 235 VR RI D R +TL+ +KAG + ++G R G PQGG++SPLL+NI L D++L Sbjct: 201 RVRTRIKDKRVLTLVKAFLKAGILTELGDRRDTHTGTPQGGILSPLLANIALTVLDEHLM 260 Query: 236 ERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAI 295 + + +RG +T W+ RYADDFV++V G + +A+ Sbjct: 261 AGWRPDATMASEYRRAQLRKRGEAT-------WR----LVRYADDFVVLVHGGEDHAQAL 309 Query: 296 REECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSR-YGEMRVVSTIPQEKAR 354 RE+ +L L LRL+ KT++ H++DGF FLG + +R R + V + I + R Sbjct: 310 REDVATML-APLGLRLSPAKTRVVHLSDGFDFLGFHIQWRRKRGTNKWHVYTFIAKRPIR 368 Query: 355 NFAASLTAL 363 + A + AL Sbjct: 369 SLKAKVRAL 377 >UniRef50_C3QDF8 RNA-directed DNA polymerase n=9 Tax=Bacteroidales RepID=C3QDF8_9BACE Length = 606 Score = 206 bits (524), Expect = 1e-51, Method: Compositional matrix adjust. Identities = 130/403 (32%), Positives = 203/403 (50%), Gaps = 64/403 (15%) Query: 16 RIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDG--VNKTMLQARLAVELQILRDELLS 73 + +RL R++ E A + + G TPG DG +N+ LQ R+ + LRDE Sbjct: 25 KFERLYRILFNEEMFHVAYQRIYAKPGNMTPGTDGKTINRMSLQ-RINKVIASLRDE--- 80 Query: 74 GHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPER 133 Y+P PA+R +IPK NGK RPLGIP+ D++VQ + M +E I+E F S+GFRP R Sbjct: 81 -SYKPNPAKRTHIPKKNGKKRPLGIPSFEDKLVQEVVRMILEAIYEEVFANTSHGFRPNR 139 Query: 134 SVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWK 193 S H A+ ++ T +W +EGD+ +FD + H +L+ +R+RI D RF+ L+ K Sbjct: 140 SCHTALTHIQKTFTGT-----KWFVEGDIKGFFDNIDHNVLIATLRKRIDDNRFLRLIRK 194 Query: 194 TIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHE---RYLSGKARKDRWYW 250 + AG+I+ F ++G PQGG ISP+L+NI L+ FD+Y+ E R+ GK R + Sbjct: 195 LLNAGYIEDWRFHNTNKGTPQGGNISPILANIYLDNFDKYMEEYALRFNKGKERHITKEY 254 Query: 251 ---------------------------NNSIQRGRSTAVRENWQWKPAVA-------YCR 276 + ++ GR R+ + + ++ Y R Sbjct: 255 KQLSGKMQGILKSIKNIKDADARLQLRDEYVKLGRE---RQKIESRDSMDETYRRFRYVR 311 Query: 277 YADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKR 336 YADDF++ V G+KA I+ + +E +LKL L+ +KT I + FLG + ++ Sbjct: 312 YADDFLIGVIGSKADCVKIKSDITNYMEENLKLELSQEKTLITNAQTPAKFLGFEVSVRK 371 Query: 337 S------------RYGEMRVVSTIPQEKARNFAASLTALLWKV 367 S RY ++V + E RN +A+ +KV Sbjct: 372 SDVVKRNKNNVSARYYNGKIVLKVAIETVRNKLEEYSAIRYKV 414 >UniRef50_Q02718 Reverse transcriptase homologue COI iA grp II protein (Fragment) n=2 Tax=Fungi RepID=Q02718_PODAN Length = 790 Score = 206 bits (523), Expect = 1e-51, Method: Compositional matrix adjust. Identities = 130/360 (36%), Positives = 193/360 (53%), Gaps = 41/360 (11%) Query: 16 RIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGH 75 + L +LI + L +A R S+ G TP +D + + L+ L EL S Sbjct: 193 KFVNLYQLICSKDLLIQAYRNVRSNPGGMTPSIDNITYDGINDEF---LEKLILELKSER 249 Query: 76 YQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSV 135 ++ +RVYIPK+NGK RPLGIP +D+IVQ AM + +E I+E F +S+GFRP+RS Sbjct: 250 FKFTSVKRVYIPKANGKTRPLGIPTSKDKIVQEAMKILLELIYEPIFLDVSHGFRPKRSC 309 Query: 136 HHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTI 195 H A+ Q++ T W++EGD+ +F+ V H++L+K + ++I D RF LLWK Sbjct: 310 HTALH----QISKWNGTT--WMLEGDIKGFFNEVDHQVLIKILEKKIKDQRFFDLLWKLF 363 Query: 196 KAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWY-WNNSI 254 +AG+ID G+ GVPQGGVISP+LSNI L+EFD ++ + KD N I Sbjct: 364 RAGYIDDGVKYNTYTGVPQGGVISPVLSNIYLHEFDLFVETLIKKYSSEKDFISKVNPVI 423 Query: 255 QRGRSTAVRENWQWKPA----------------------------VAYCRYADDFVLIVK 286 + S R N +++ V Y RYADD+V+ + Sbjct: 424 VKYSSKLSRLNDEYQTTKDKEILKEIIKLRAERNKLPSRIRNGIRVRYTRYADDWVIGII 483 Query: 287 GTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFI-FLGHRLIRKRSRYGEMRVV 345 G + V I+EEC+ L LKL L+ +KTKI ++ + + FLG + RK S GE +++ Sbjct: 484 GDQELVAKIKEECKAFLRDILKLELSEEKTKITNITEKEVRFLGVDIKRKDS--GESKII 541 >UniRef50_Q9T654 Cox1I1a maturase (Fragment) n=2 Tax=cellular organisms RepID=Q9T654_SCHPO Length = 787 Score = 203 bits (517), Expect = 7e-51, Method: Compositional matrix adjust. Identities = 127/386 (32%), Positives = 204/386 (52%), Gaps = 54/386 (13%) Query: 23 LITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLPAR 82 L+ + ++A +I+ S+KG+ T G+D K L E++ ++L +Q PAR Sbjct: 203 LLNEDLFIAAYQKIS-SNKGSVTAGID---KITLDGYSINEIKKTIEQLKDHSFQFKPAR 258 Query: 83 RVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTV 142 R YIPK+NGKLRPLGIP+ RD+IVQ+ M+ +E I+E F S GFRP + H A++++ Sbjct: 259 REYIPKANGKLRPLGIPSPRDKIVQQVMVFVLESIFEQKFLDCSNGFRPNKGTHTALKSI 318 Query: 143 KLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHIDV 202 G WVIEGD+ SYFD + H+ L+ + I+D F+ L WK I+AG+++V Sbjct: 319 ------AGWKALDWVIEGDIKSYFDLIDHQTLISLLSNVINDKEFIDLCWKAIRAGYVEV 372 Query: 203 GLFRAASE--GVPQGGVISPLLSNIMLNEFDQYLHER----------------------- 237 + + G PQG V+SP+L+NI L+EFD+++ E+ Sbjct: 373 KMNKKIDTIIGTPQGSVLSPILANIYLHEFDKFMMEKVNLSLDSGSTSKRFKPYRLLEAK 432 Query: 238 --YLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPA--------VAYCRYADDFVLIVKG 287 Y+ RK+ N ++ + + N P+ + Y RYADDF++ + G Sbjct: 433 INYIYQLERKNGSLTNEQVKSLKKLTIERNKL--PSTIGGPGYRIYYVRYADDFLIGING 490 Query: 288 TKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGF-IFLGHRLIRKRSRYGEMRVVS 346 + ++ E L +LKL ++++KTK+ ++ D + +FLG + R SR +V+S Sbjct: 491 KRTLALQLKSEINEFLTNTLKLTMSVEKTKVTNIKDDYALFLGAEIHRLTSRNNNSKVIS 550 Query: 347 TIPQEKARNF----AASLTALLWKVR 368 Q + NF A S T+LL V+ Sbjct: 551 --KQYSSGNFNSRVANSRTSLLIPVK 574 >UniRef50_B0I1N9 Reverse transcriptase homolog n=7 Tax=cellular organisms RepID=B0I1N9_PYLLI Length = 749 Score = 203 bits (517), Expect = 7e-51, Method: Compositional matrix adjust. Identities = 134/361 (37%), Positives = 201/361 (55%), Gaps = 31/361 (8%) Query: 12 DPSLRIQRLLRLITQPEWLAEAARITLSSK-GAHTPGVDGVNKTMLQARLAVELQILRDE 70 +P+ +R+L L++ + L EAA I + SK G T GVDG KT+ + L+ L + Sbjct: 182 NPNHLNERILSLVSSYDML-EAAYIKIKSKPGNMTKGVDG--KTLDGVNVG-WLKSLSRD 237 Query: 71 LLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFR 130 + SG Y+PLP+RRV IPK G RPLGIP+ RD+IVQ ++ ++ I+E F S+GFR Sbjct: 238 VGSGSYKPLPSRRVMIPKPQGGERPLGIPSPRDKIVQESIRTVLQAIYEPSFIACSHGFR 297 Query: 131 PERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTL 190 P RS H A++ KL + W IEGD+ FD++ HR+L + RRI D FM L Sbjct: 298 PGRSCHTALKEAKLTFANT-----TWFIEGDIEKCFDSIDHRVLSTLLERRIKDKGFMDL 352 Query: 191 LWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYL---HERYLSGKARKDR 247 WK +K G++ +G + +G PQG V+SPLLSNI L+E D+++ E + G RK Sbjct: 353 YWKMVKVGYMSLGKINQSDKGTPQGSVVSPLLSNIYLHELDKWMTRKKESFDKGTRRKAN 412 Query: 248 WYWNNSIQ--RGRSTAVRENWQWKPAVA-------YCRYADDFVLIVKGTKAQVEAIREE 298 + ++ G S A R N + Y RYA DF++ + G+K + +E Sbjct: 413 PVYTKYVRVAGGASAARRLNIPSADPLDPNFKRLRYVRYAGDFLIGIIGSKTDGICLIKE 472 Query: 299 CRGVLEGSLKLRLNMDKTKIPH-VNDGFIFLGH--RLIR------KRSRYGEMRVVSTIP 349 + L LKL LN+ KTK+ H +++ FLG ++I K+++ G++ VS+ P Sbjct: 473 LKEFLHDILKLDLNLTKTKLTHTMSEKAYFLGTWVKIIPVSGFQIKKNKSGKITRVSSRP 532 Query: 350 Q 350 Q Sbjct: 533 Q 533 >UniRef50_B9K434 Reverse transcriptase-like protein n=9 Tax=Proteobacteria RepID=B9K434_AGRVS Length = 278 Score = 202 bits (514), Expect = 2e-50, Method: Compositional matrix adjust. Identities = 94/139 (67%), Positives = 110/139 (79%) Query: 226 MLNEFDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIV 285 ML+EFD +L +YL+ KARKDRW WN IQ+GR VREN QWKPAVAYCRYADDFV+IV Sbjct: 1 MLHEFDMWLEAKYLNKKARKDRWAWNFGIQQGRPITVRENRQWKPAVAYCRYADDFVVIV 60 Query: 286 KGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSRYGEMRVV 345 KGT+AQ E IREECR LEG LKL LNM+KT + HVNDGF+FLGHR+IRKR +G M +V Sbjct: 61 KGTRAQAEEIREECRTFLEGELKLTLNMEKTHVTHVNDGFVFLGHRIIRKRGTHGRMSIV 120 Query: 346 STIPQEKARNFAASLTALL 364 +TIP+EKA+ F LT L Sbjct: 121 TTIPKEKAKGFVRRLTETL 139 >UniRef50_Q56VE2 MatR n=6 Tax=Bacteria RepID=Q56VE2_BACFR Length = 599 Score = 200 bits (508), Expect = 8e-50, Method: Compositional matrix adjust. Identities = 122/363 (33%), Positives = 187/363 (51%), Gaps = 44/363 (12%) Query: 10 ATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRD 69 + +P+ + +RL RL+ A A ++ G T G DG + + + +Q + D Sbjct: 15 SQNPNYKFERLYRLLFNENLYALAYQMMSKKTGNMTKGTDGQTISGMSIK---RIQSIID 71 Query: 70 ELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGF 129 +L YQP PA+R+YIPK NGK RPLGIP+ D++VQ+ + M +E I+E F S+GF Sbjct: 72 KLRDESYQPHPAKRIYIPKKNGKQRPLGIPSFEDKLVQKVIQMILESIYEGSFEKCSHGF 131 Query: 130 RPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMT 189 RP R+ H A+ ++ G RW IEGD+ +FD + H +++ + RI+D RF+ Sbjct: 132 RPHRNCHTAMASIME-----GFDGTRWFIEGDIKGFFDNIDHDIMITILSERIADERFLR 186 Query: 190 LLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLS----GKARK 245 L+ K + AG+++ F G PQGG+ISP+L+NI L++ D+Y+ E Y+S GK RK Sbjct: 187 LIRKFLNAGYLEKWKFHKTFSGTPQGGIISPILANIYLDQLDKYVVE-YISQFNRGKMRK 245 Query: 246 DRWYWNNSIQRG-------------------RSTAVR--ENWQWKPA----------VAY 274 + R RS V Q PA + Y Sbjct: 246 RNPEYKRIASRKDKRVKKLKTETDEQKRAALRSEIVELHREMQKHPATLDMDEDFRRMRY 305 Query: 275 CRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIR 334 RYADDF++ + G+K I+ + + L LKL L+ +KT I H +D FLG + Sbjct: 306 VRYADDFLIGIIGSKDDCVNIKADIKRFLCEKLKLELSDEKTLITHGHDHAKFLGFEVTI 365 Query: 335 KRS 337 ++S Sbjct: 366 RKS 368 >UniRef50_C0YLQ1 RNA-directed DNA polymerase n=40 Tax=Bacteria RepID=C0YLQ1_9FLAO Length = 624 Score = 196 bits (497), Expect = 2e-48, Method: Compositional matrix adjust. Identities = 117/353 (33%), Positives = 186/353 (52%), Gaps = 44/353 (12%) Query: 12 DPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVN-KTMLQARLAVELQILRDE 70 + + +RL +++ E A + S G T GVDG M +R+ + LR+E Sbjct: 37 NTDYKFERLYKVLFNEEMYFIAYQKIYSKVGNMTAGVDGKTIDGMSISRIERLIASLRNE 96 Query: 71 LLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFR 130 YQP P++R YIPK NGK RPLGIP+ D++VQ + M +E I+E F S+GFR Sbjct: 97 T----YQPNPSKRTYIPKKNGKKRPLGIPSFDDKLVQEVIRMILEAIYEGSFEHTSHGFR 152 Query: 131 PERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTL 190 P RS H A+ +V+ T RW IEGD+ +FD ++H +L+ ++ RI+D RF+ L Sbjct: 153 PNRSCHTALLSVQQSFTAV-----RWFIEGDIKGFFDNINHEILIGILKERIADDRFIRL 207 Query: 191 LWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQY----------------- 233 + K + AG+I+ ++ G PQGG++SP+L+NI L++ D+Y Sbjct: 208 IRKFLNAGYIEDWVYHKTYSGTPQGGIVSPILANIYLDKLDKYVKDYIKDFDKGKRTTAT 267 Query: 234 ----LHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWK---PA----------VAYCR 276 LHE+ A+K + + +++ ++E Q + PA + Y R Sbjct: 268 RQYRLHEQRRYRLAKKLKCETDETVREQMIKDIKELRQERNKYPAYDKMDGSFRKLKYVR 327 Query: 277 YADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLG 329 YADDF++ V G+K + I+E+ + L+ LKL L+ +KT + + FLG Sbjct: 328 YADDFLIGVIGSKEDCKKIKEDIKVYLDEKLKLELSDEKTLVTNAKKPAKFLG 380 >UniRef50_Q1J4S7 Reverse transcriptase / RNA maturase / Endonuclease n=2 Tax=Bacteria RepID=Q1J4S7_STRPF Length = 625 Score = 195 bits (495), Expect = 3e-48, Method: Compositional matrix adjust. Identities = 117/378 (30%), Positives = 188/378 (49%), Gaps = 51/378 (13%) Query: 18 QRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQ 77 +RL R + + +A + ++ G T GVD N+T+ L +I+ D L Y Sbjct: 43 KRLYRNLYNIDLFLQAYQNIYANAGNMTKGVD--NQTISAMSLERINKII-DSLKDESYS 99 Query: 78 PLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHH 137 P P +RVYIPK NGKLRPLGIP++ D++VQ M + I++ F S+GFR RS H Sbjct: 100 PTPTKRVYIPKKNGKLRPLGIPSIGDKLVQEVCRMLLNSIYDESFEDTSHGFRDNRSCHT 159 Query: 138 AIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKA 197 A+R ++ + C +W +EGD+ +FD + H +++ + +RI D RF+ L+ K +K+ Sbjct: 160 ALRQIQNRFVRC-----KWFVEGDIKGFFDNIDHNIMIDILSKRIDDERFLRLIRKFLKS 214 Query: 198 GHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLH---ERYLSGKARKD-------- 246 G+++ + G+PQG +ISP+LSNI L++FD+Y+ E + G RK Sbjct: 215 GYMEQNQYHNTYSGMPQGSIISPILSNIYLDKFDKYMQNYKESFDKGNKRKQNKEYKALY 274 Query: 247 --RWYWNNSIQRGRSTA------------------------VRENWQWKPAVAYCRYADD 280 R N + + + + EN++ + Y RYADD Sbjct: 275 DRRKRLENKLSKTTNKTEIDDIKSEIEEINKRYFNIPCLNPMDENFK---RIQYVRYADD 331 Query: 281 FVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSRYG 340 F++ + G+KA E ++++ ++ L L L+ +KT + D FLG + R G Sbjct: 332 FIIGIIGSKADAEMVKQDIGQFIKSELNLELSDEKTLVTKSTDRAKFLGFDI---RVTPG 388 Query: 341 EMRVVSTIPQEKARNFAA 358 T KARNF Sbjct: 389 SNHTKRTKAGIKARNFGG 406 >UniRef50_B1L2I7 Reverse transcriptase/endonuclease protein n=37 Tax=Firmicutes RepID=B1L2I7_CLOBM Length = 607 Score = 194 bits (494), Expect = 3e-48, Method: Compositional matrix adjust. Identities = 117/317 (36%), Positives = 177/317 (55%), Gaps = 30/317 (9%) Query: 16 RIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGH 75 ++QRL+ L ++ L R+T +KG T G+DG R+ + L+D + + Sbjct: 52 KLQRLM-LRSKANLLISIKRVTQINKGKRTAGIDGFKVITEWDRIKL-FNSLKDYSIK-N 108 Query: 76 YQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSV 135 + PA+R YIPK NGKLRPLGIP ++DRI Q + A+EP WES F +++YGFRP+RS Sbjct: 109 IKSQPAKRTYIPKKNGKLRPLGIPIIKDRIYQNIVKNALEPQWESKFESIAYGFRPKRST 168 Query: 136 HHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTI 195 H AI + L+L G R +W+ EGD FD ++H +M+ I+D +++ + Sbjct: 169 HDAIEQLYLKLRK-GSKR-QWIFEGDFKGCFDNLNHEYIMEC----INDFPAKEAVYRWL 222 Query: 196 KAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQ 255 KAG+ID +FR +EG PQGG+ISPLL+NI L+ ++ L +Y Sbjct: 223 KAGYIDNNVFRNTNEGTPQGGIISPLLANIALHGMEEELGVKY--------------QFT 268 Query: 256 RGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDK 315 + + +R+N ++ +YADDFV++ K TK + E + E L+ L L DK Sbjct: 269 KRQGYCLRDN-----SIGIVKYADDFVILCK-TKEEAETMYERLSPYLKKR-GLELAEDK 321 Query: 316 TKIPHVNDGFIFLGHRL 332 T I H++ GF FLG + Sbjct: 322 TGITHISKGFDFLGFNI 338 >UniRef50_Q2G689 RNA-directed DNA polymerase n=5 Tax=Bacteria RepID=Q2G689_NOVAD Length = 633 Score = 194 bits (493), Expect = 5e-48, Method: Compositional matrix adjust. Identities = 118/356 (33%), Positives = 189/356 (53%), Gaps = 47/356 (13%) Query: 16 RIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGH 75 ++ L RL+ P A + +KGA TPGVDG +++ + + L +G Sbjct: 22 KVNGLYRLLKSPLLWEHAYQRIAPNKGAMTPGVDG---QTFDGFSPDKVRSIIERLANGT 78 Query: 76 YQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSV 135 Y+P PARRVYIPK+NG+ RPLG+P D++VQ + +E I+E F S+GFRP+RS Sbjct: 79 YRPQPARRVYIPKANGQKRPLGVPTTEDKLVQEVVRTILEQIYEPLFSRHSHGFRPKRSC 138 Query: 136 HHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTI 195 H A+ +++ T +W+I+ D+ +FD + H +L+ + +RI+D RF+ L+ + Sbjct: 139 HTALESIRAIWTGV-----KWLIDVDVVGFFDNIDHDVLVSLLEKRIADRRFVRLIRGLL 193 Query: 196 KAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLH----------ERYLSGKARK 245 KAG+++ +F G PQGGV+SP+L+NI L+E D ++ +R S AR+ Sbjct: 194 KAGYVEDWVFHKTYSGTPQGGVVSPMLANIYLHELDMFMQAKMAGFDKGKQRSPSPDARR 253 Query: 246 DR---WYWNNSIQRGRST-----------------------AVRENWQWKPA---VAYCR 276 R Y ++ + R+ AV + + P + YCR Sbjct: 254 IRNRLSYVRRTVDQLRAKGRGDDPRVTSFLEEIGRLKAERLAVPASDAFDPNYRRLRYCR 313 Query: 277 YADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRL 332 YADDF++ V G+K++ I EE R L LKL ++ +K+ I +DG FLG+ + Sbjct: 314 YADDFIIGVTGSKSEARQIMEEVRTYLSDHLKLAVSAEKSGIHKASDGARFLGYEV 369 >UniRef50_B8FP60 RNA-directed DNA polymerase n=3 Tax=Firmicutes RepID=B8FP60_DESHD Length = 607 Score = 193 bits (490), Expect = 9e-48, Method: Compositional matrix adjust. Identities = 129/386 (33%), Positives = 197/386 (51%), Gaps = 57/386 (14%) Query: 16 RIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGH 75 R +RL R + P++ A + ++ GA TPGVD +T L ++ L L + Sbjct: 21 RFERLYRNLYNPDFYLLAYQKLYANNGAMTPGVD---RTTLDGTGMERIESLIQSLKNRS 77 Query: 76 YQPLPARRVYIPKSNGK-LRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERS 134 YQP PARR YIPK +GK RPLGI A D++VQ + M +E I+E F S+GFRP RS Sbjct: 78 YQPQPARRRYIPKKSGKGQRPLGIQAANDKLVQEVVRMLLESIYEPTFLDSSHGFRPNRS 137 Query: 135 VHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKT 194 H A+ ++Q + G +W +EGD+ +YFDT+ H +L+ +RRRI D F++L+WK Sbjct: 138 CHTAL--ARMQRSFNGV---KWFVEGDIKAYFDTIDHHMLVNILRRRIQDENFISLIWKF 192 Query: 195 IKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHE---RYLSG---------- 241 ++AG+++ + A G PQG SPLL+N+ L+E D Y+ E R+ G Sbjct: 193 LRAGYLEDWQYNATYSGSPQGSGASPLLANLYLHELDLYMEEYKQRFDKGNRRQAGKDYG 252 Query: 242 ---------KARKDRWYWNNSIQRGRST-----AVRENWQWKPA----------VAYCRY 277 K + DR + + + ++ A+R+ PA + Y RY Sbjct: 253 RAQRTHQYRKVKYDRLWLTLTDEEKKAAQREIRALRKRMLECPANDPMDGTYRRIQYVRY 312 Query: 278 ADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGH------- 330 DDF++ + G+K E ++ + L LKL L+ +KT I H D FLG+ Sbjct: 313 CDDFLVGIIGSKTDAEKVKADICRFLSDKLKLTLSPEKTLITHGQDKARFLGYDIAVCQD 372 Query: 331 ----RLIRKRSRYGEMRVVSTIPQEK 352 R R +SR +V +P++K Sbjct: 373 NTTRRTSRGQSRVHSGKVKLYVPKDK 398 >UniRef50_A0L945 RNA-directed DNA polymerase (Reverse transcriptase) n=1 Tax=Magnetococcus sp. MC-1 RepID=A0L945_MAGSM Length = 433 Score = 193 bits (490), Expect = 1e-47, Method: Compositional matrix adjust. Identities = 132/360 (36%), Positives = 194/360 (53%), Gaps = 44/360 (12%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVN-KTMLQAR 59 ++RK+ A D S R L + +PE L A + +KGA PG+DGV + + ++ Sbjct: 11 LRRKIYRKAKIDKSWRFWGLYHHVCKPETLNTAYEMARKNKGA--PGIDGVTFEAIEESG 68 Query: 60 LAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWE 119 + L +R EL+SG Y+PL RR IPK +GK R LGIP++RDR+VQ A+ + +EPI+E Sbjct: 69 VEQFLGEVRKELVSGSYRPLKNRRKAIPKGDGKERVLGIPSIRDRVVQGALKLILEPIFE 128 Query: 120 SDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVR 179 +DF + SYG+RP+R H A+ V + + G+T+ VI+ DL SYFDTV H L ++ V Sbjct: 129 ADFQSGSYGYRPKRMAHQAVNRVAIAIAQ-GKTQ---VIDADLKSYFDTVQHDLALRKVS 184 Query: 180 RRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYL 239 R+ D + M LL K + GVPQGGVISPL+SN+ LNE D+ L Sbjct: 185 ERVDDDQVMHLLKLIFKT---------SGKRGVPQGGVISPLISNLYLNEVDKMLERA-- 233 Query: 240 SGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREEC 299 ++G+ T + Y R+ADD V++V G R+ Sbjct: 234 -----------KEVTRKGKYT----------HIEYARFADDLVILVDGHHRWNGLARKVY 272 Query: 300 RGVLE--GSLKLRLNMDKTKIPHVNDG--FIFLGHRLIRKRSRYGEMRVVSTIPQEKARN 355 + + E LK++LN++KT++ + G F FLG IR+R + V IP+ KAR Sbjct: 273 QRLGEELAKLKVQLNLEKTRVVDLTRGEDFTFLGFN-IRQRMTLQGKQGVLCIPRMKART 331 >UniRef50_B4WUH1 Group II intron, maturase-specific domain family n=2 Tax=Cyanobacteria RepID=B4WUH1_9SYNE Length = 621 Score = 192 bits (489), Expect = 1e-47, Method: Compositional matrix adjust. Identities = 126/298 (42%), Positives = 171/298 (57%), Gaps = 31/298 (10%) Query: 36 ITLSSKGAHTPGVDGVNKTMLQARLAVEL-QILRDELLSGHYQPLPARRVYIPKSNGKLR 94 +T ++G T G+DG +T L A V+L L+D L +Q LP +RVYIPK+NGKLR Sbjct: 103 VTQENQGRQTAGLDG--QTALTAEKRVQLVNRLQDHSL---WQVLPTKRVYIPKANGKLR 157 Query: 95 PLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRG 154 PLGIPAL +R+ Q M A+EP WE+ F SYGFRP RS H AI L+L +T Sbjct: 158 PLGIPALENRVAQTIMKNALEPHWEARFEGHSYGFRPGRSCHDAIEQCFLRLRHGCDT-- 215 Query: 155 RWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQ 214 WV++ DL FD + H ++ + + R + W +KAG+++ +F A +G PQ Sbjct: 216 -WVLDADLKGAFDNLSHSFILDTI--GLVPGRELIKQW--LKAGYVEAEMFHATPKGAPQ 270 Query: 215 GGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAY 274 GG ISPLL NI LN ++ L LS R Y +S + +S+ R + P Y Sbjct: 271 GGSISPLLLNIALNGMEKLL----LSFTT--TRTYQPSSKAKSQSSYKRTS----PTYGY 320 Query: 275 CRYADDFVLIVKGTKAQVEAIREECRGVLEGSLK---LRLNMDKTKIPHVNDGFIFLG 329 CRYADDFV+ K TKA +EA+ +L+ LK L LNM+KT+I +V GF FLG Sbjct: 321 CRYADDFVVTAK-TKADIEAVVP----ILQAWLKPRGLTLNMEKTQIVNVQQGFPFLG 373 >UniRef50_C8VXL4 RNA-directed DNA polymerase (Reverse transcriptase) n=5 Tax=Bacteria RepID=C8VXL4_DESAS Length = 434 Score = 192 bits (488), Expect = 2e-47, Method: Compositional matrix adjust. Identities = 133/364 (36%), Positives = 197/364 (54%), Gaps = 54/364 (14%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQAR- 59 ++RK+ A D + R L + + E L EA ++ + GA PG+DG+ ++A Sbjct: 11 LRRKIYIKAKADKTWRFWGLYVHVCKIETLQEAYKMAKNKNGA--PGIDGITFDNIEASG 68 Query: 60 LAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWE 119 + + LQ ++ EL+SG Y P RR IPK +GK R LGIP +RDR+VQ A+ + +EPI+E Sbjct: 69 IEIFLQQIQKELISGTYWPTQNRRKEIPKGDGKYRILGIPTIRDRVVQGALKLILEPIFE 128 Query: 120 SDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVR 179 +DF SYG+RP+R+ H AI V + + +TR VI+ DL SYFDTV H LL+K V Sbjct: 129 ADFQEGSYGYRPKRNPHQAIDRVAKAVVE-NKTR---VIDLDLRSYFDTVRHDLLLKKVA 184 Query: 180 RRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYL 239 +R++D M LL +KA + GVPQGGVISPLL+N+ LNE D+ L Sbjct: 185 KRVNDENVMRLLKLILKA---------SGKRGVPQGGVISPLLANLYLNEVDKML----- 230 Query: 240 SGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKG-------TKAQV 292 ++ + E + + Y R+ADD V+++ KA Sbjct: 231 ---------------EKAKEVTRHEQY---THIEYARFADDIVILIDAYPKWNWLEKAVY 272 Query: 293 EAIREECRGVLEGSLKLRLNMDKTKIPHVNDG--FIFLGHRLIRKRSRYGEMRVVSTIPQ 350 + + EE L ++LN +KT+I ++ +G F FLG R R+R G+ V+ T P+ Sbjct: 273 QRLLEEL-----TKLDVQLNEEKTRIVNLANGESFGFLGFDFRRSRTRKGKWGVLFT-PK 326 Query: 351 EKAR 354 KAR Sbjct: 327 MKAR 330 >UniRef50_Q92Y56 Reverse transcriptase n=3 Tax=Alphaproteobacteria RepID=Q92Y56_RHIME Length = 505 Score = 192 bits (487), Expect = 2e-47, Method: Compositional matrix adjust. Identities = 139/394 (35%), Positives = 205/394 (52%), Gaps = 23/394 (5%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVN----KTML 56 +QRKL W+ +P + + + +T L A + S+KG T GVDG+ + Sbjct: 15 IQRKLYQWSKANPDDQWRDMWGWLTDLRVLRHAWQRVASNKGGRTAGVDGMTVGRIRNRS 74 Query: 57 QARLAVELQILRDELLSGHYQPLPARRVYIPKSN--GKLRPLGIPALRDRIVQRAMLMAM 114 + R V+LQ +L SG Y+P PARR IPK+ G+ RPLGIP +RDR+VQ A + + Sbjct: 75 EHRFLVDLQA---DLRSGAYRPSPARRKLIPKAGKPGQFRPLGIPTIRDRVVQGAAKILL 131 Query: 115 EPIWESDFHTLSYGFRPERSVHHAIRTVK----LQLTDCGETRGR----WVIEGDLSSYF 166 EPI+E+ F +SYGFRP R+ H A+ ++ Q D R R WVIEGD+ F Sbjct: 132 EPIFEAQFWHVSYGFRPGRNTHGALEYIRRAALPQKRDEDTRRNRLPYPWVIEGDIKGCF 191 Query: 167 DTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIM 226 D ++H L++ +R+RI D R + L+ +KAG + F G PQGG+ISPLL+NI Sbjct: 192 DNINHHHLLERMRKRIGDRRVVRLVGLFLKAGVLTEDQFLRTDAGTPQGGIISPLLANIA 251 Query: 227 LNEFDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVK 286 L+ ++ +ER+ + + +N + S + + RYADDFV++V Sbjct: 252 LSAIEER-YERWTYHRKKTQARRKSNGVAAAASARDSDRIAGRCVYLPVRYADDFVVLVS 310 Query: 287 GTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRL-IRKRSRYGEMRVV 345 G+ + A + L + L L +KTK+ + +GF FLG R + RYG V Sbjct: 311 GSLEEAMAEKSALADYLIKTTGLTLLPEKTKVTAMTEGFEFLGFRFSVHWDKRYGYGPRV 370 Query: 346 STIPQEKARNFAASLTALLWKVRIS---GEILLG 376 IP+ KA N + L + IS GE L G Sbjct: 371 E-IPKAKAANLRHKVKQLTQRDSISVSLGEKLRG 403 >UniRef50_UPI00019088D4 mobile mitochondrial group II intron of COX1 which IS involved in pre-mRNA splicing and in deletion of introns from n=1 Tax=Rhizobium etli CIAT 894 RepID=UPI00019088D4 Length = 533 Score = 190 bits (482), Expect = 9e-47, Method: Compositional matrix adjust. Identities = 125/368 (33%), Positives = 187/368 (50%), Gaps = 45/368 (12%) Query: 16 RIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGH 75 RI L RL+ P A+A + GA TPG+D N L L + + G Sbjct: 22 RINGLHRLLDCPNIWAQAYEAIARNSGALTPGIDPRN--TLDGFSLDRLNGIMRRVKEGS 79 Query: 76 YQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSV 135 Y+ P RR YIPK+NGKLRPLGIP D++VQ A+ + +E I+E +F S+GFRP+RS Sbjct: 80 YRFKPVRRHYIPKANGKLRPLGIPDADDKLVQAAVKLVLEQIYEPNFSRRSHGFRPKRSC 139 Query: 136 HHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTI 195 H A+ ++ Q T G W++E D++ +FD + H +LM +R+RI D RF+ L+ + Sbjct: 140 HTALASI--QKTWGGTV---WLVEADIAGFFDNIDHDILMNLLRKRIDDERFLKLIRGML 194 Query: 196 KAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQY---LHERYLSGKARKDRWYWNN 252 AG+++ + A+ G PQGGVISPLL+N+ L+EFD++ L R+ G R + Sbjct: 195 TAGYMEDWKWHASYSGTPQGGVISPLLANVYLHEFDEFMDSLKTRFDRGIERPTNPEYQK 254 Query: 253 SIQRG---------------RSTAVRENWQWKPAVA-------------------YCRYA 278 + +G + A R Q +P + Y RYA Sbjct: 255 LLSKGAHCRQRIAKLRSSGREAEAERLRAQLQPLIVAARKLPSKDFHDTFFRRLQYVRYA 314 Query: 279 DDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFI-FLGHRLIRKRS 337 DDF++ V GTK + I E L+ L L + +K+ I +DG + FLG+ + + Sbjct: 315 DDFLISVIGTKQEAADILNEVTSFLQDQLHLEVAPEKSGITKADDGGVTFLGYAVRSTKR 374 Query: 338 RYGEMRVV 345 + E RV Sbjct: 375 AFREKRVT 382 >UniRef50_Q3ESS6 Reverse transcriptase / RNA maturase / Endonuclease n=13 Tax=Firmicutes RepID=Q3ESS6_BACTI Length = 607 Score = 189 bits (481), Expect = 1e-46, Method: Compositional matrix adjust. Identities = 119/356 (33%), Positives = 181/356 (50%), Gaps = 48/356 (13%) Query: 16 RIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGH 75 + +RL R + PE+ A + GA T GVD K + ++ L + L Sbjct: 21 KFRRLYRNLYNPEFYFTAYDNLSKNDGALTMGVD---KRSIDGFSIEIIEELIETLKQRT 77 Query: 76 YQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSV 135 YQP P++RVYIPK NGK RPLGIP+ D++VQ + M +E I+E F S+ ++ +S Sbjct: 78 YQPFPSKRVYIPKKNGKKRPLGIPSFADKLVQEVVRMILEAIYEPTFSISSHAYQKGKSC 137 Query: 136 HHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTI 195 H A++ ++ T +W IEGD+ +FD ++H L+ +R+RI D F+ L+WK + Sbjct: 138 HTALQEIQRTFTG-----SKWFIEGDIKGFFDNINHHTLIGILRKRIEDEAFIELIWKFL 192 Query: 196 KAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLH---ERYLSGKARKDRWYWNN 252 +AG+++ F G PQGG+ISPLLSNI LNE D+Y+ +++ GK RK + Sbjct: 193 RAGYMEEWKFHNTFSGAPQGGIISPLLSNIYLNELDKYMMDFIQKFNQGKKRKINPDYER 252 Query: 253 SIQRGRSTAVR------ENWQWKPA-----------------------------VAYCRY 277 + R A+R EN Q A + Y RY Sbjct: 253 KYTQMRK-AIRKYKIALENEQMGEAEQHLEQAKALKKELSSIPYSNPMDSNYKRLTYVRY 311 Query: 278 ADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFI-FLGHRL 332 ADDF++ V G+K I+E L +L+L L+ +KT I H ++ FLG+ + Sbjct: 312 ADDFLIGVIGSKYDARNIKETLTEYLMETLQLELSQEKTLITHASENHAGFLGYNI 367 >UniRef50_Q3CZ44 Prophage LambdaSa1, reverse transcriptase/maturase family protein n=2 Tax=Streptococcus agalactiae RepID=Q3CZ44_STRAG Length = 439 Score = 188 bits (478), Expect = 2e-46, Method: Compositional matrix adjust. Identities = 121/339 (35%), Positives = 173/339 (51%), Gaps = 39/339 (11%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQAR- 59 QRK+ D + L + + + L A +KG + G+D ++A Sbjct: 16 FQRKIYLSTKADNKRKFGVLYDKVYRKDILKVAWFYVKRNKG--SAGIDDFTIEEIEAYG 73 Query: 60 LAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWE 119 + L + D+L + YQP +RVYIPK+NGK RPLGIP +RDR+VQ A+ + +EPI+E Sbjct: 74 VQKFLDEIEDQLRNKKYQPKAVKRVYIPKANGKKRPLGIPTVRDRVVQTAVKIVIEPIFE 133 Query: 120 SDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVR 179 +DF SYGFRP+RS + AIR + L E WVI+ DL YFDT+ H L+ V+ Sbjct: 134 ADFQKFSYGFRPKRSANQAIREIYKYLNYGCE----WVIDADLKGYFDTIPHDKLLLLVK 189 Query: 180 RRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYL 239 R++D + LL ++AG ++ R+ G PQGGVISPLL+NI LN D+ Sbjct: 190 ERVTDKSIIKLLSLWLEAGIMEDNQVRSNILGTPQGGVISPLLANIYLNALDR------- 242 Query: 240 SGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGT-KAQVEAIREE 298 YW N+ GR RYADDFV++ K + ++ Sbjct: 243 ---------YWKNNRLEGRGHDAH----------LIRYADDFVILCSNNPKKYYQYAKQR 283 Query: 299 CRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRS 337 L L LN +KT+I H +GF FLG+ L + +S Sbjct: 284 I-----DKLGLTLNEEKTRIVHATEGFDFLGYTLRKSKS 317 >UniRef50_A8MI91 RNA-directed DNA polymerase (Reverse transcriptase) n=2 Tax=Clostridiales RepID=A8MI91_ALKOO Length = 421 Score = 187 bits (476), Expect = 4e-46, Method: Compositional matrix adjust. Identities = 116/310 (37%), Positives = 162/310 (52%), Gaps = 39/310 (12%) Query: 20 LLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPL 79 L+ I + E L A + + GA PG+DG L + ++ L D+L + Y+P Sbjct: 7 LIDKIYRKENLELAFKYVKKNNGA--PGIDGETVFNFHLNLELNIEFLHDKLKTNGYEPS 64 Query: 80 PARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAI 139 P RRV I K +G +R LGIP ++DR+VQ+A++ +EPI++ FH SYG+RP S H A+ Sbjct: 65 PVRRVEIQKPDGGVRLLGIPTVKDRVVQQAIVNIIEPIFDKTFHPSSYGYRPNHSQHGAV 124 Query: 140 RTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGH 199 + + G V++ DLS FDT+ H ++MKAV RISD R + L+ K +KAG Sbjct: 125 AKAERFMNKYGLEH---VVDMDLSKCFDTLDHEIMMKAVSERISDGRVLKLIEKFLKAGV 181 Query: 200 IDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGRS 259 + F G PQGGVISPLLSNI LN+FDQ R S Sbjct: 182 MHSDNFSRTEVGSPQGGVISPLLSNIYLNQFDQ-----------------------RMMS 218 Query: 260 TAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIP 319 +R R+ADD +LI K + VLE LKL++N +KTK+ Sbjct: 219 KGIR----------IVRFADD-ILIFAKDKKTAGNYKAYATQVLENELKLKVNNEKTKLT 267 Query: 320 HVNDGFIFLG 329 +VN+G FLG Sbjct: 268 NVNEGVEFLG 277 >UniRef50_B9IYU4 Reverse transcriptase n=21 Tax=Bacteria RepID=B9IYU4_BACCQ Length = 607 Score = 187 bits (476), Expect = 4e-46, Method: Compositional matrix adjust. Identities = 114/363 (31%), Positives = 179/363 (49%), Gaps = 45/363 (12%) Query: 7 TWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQI 66 T A + RL R + ++ +A G T G D K + ++ Sbjct: 16 TKNAVKENYIFTRLYRNLYNKKFFLDAYGNIYHKPGNMTQGTD---KETIDGFSMDWIEN 72 Query: 67 LRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLS 126 + L Y+P P+RRVYIPK + K RPLGIP+++D+I+Q + + ++E F S Sbjct: 73 IISSLKDESYKPNPSRRVYIPKKDDKQRPLGIPSIKDKIIQEVVKEILVSMYEPIFSKAS 132 Query: 127 YGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDAR 186 +GFRP +S H A+ +K+ +W IEGD+ +FD + H +L+ +R+RI D + Sbjct: 133 HGFRPNKSCHSALNDIKMTFGGI-----KWWIEGDIKGFFDNIDHHVLIGILRKRIKDEK 187 Query: 187 FMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHE---RYLSGKA 243 F+ L+WK +KAG+++ F G PQGG+ISP+L+NI L+E D ++ + ++ GK Sbjct: 188 FIKLIWKFLKAGYMEDWKFNKTFSGTPQGGIISPVLANIYLHELDAFMEKQIIKFDEGKR 247 Query: 244 RKDR----------WYWNNSI------------------------QRGRSTAVRENWQWK 269 R+D WY N + +R + +AV Sbjct: 248 RRDNPVYKKYNTAIWYRKNKLKEKWNTLNDDERKELQSEISTLEKEREKHSAVDNMDASF 307 Query: 270 PAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLG 329 + Y RYADDFV+ V G+K + I+EE L SLKL L+ +KT I + FLG Sbjct: 308 KRLKYVRYADDFVVGVIGSKEDSKRIKEEITEFLHTSLKLELSQEKTLITSNKNLIKFLG 367 Query: 330 HRL 332 + + Sbjct: 368 YEI 370 >UniRef50_C3FAV3 RNA-directed DNA polymerase n=11 Tax=Bacteria RepID=C3FAV3_BACTU Length = 652 Score = 186 bits (473), Expect = 8e-46, Method: Compositional matrix adjust. Identities = 124/411 (30%), Positives = 199/411 (48%), Gaps = 71/411 (17%) Query: 14 SLRIQRLLRLITQPEWLAEAARITLSSKGAHTPG-----VDGVNKTMLQARLAVELQILR 68 + + +RL R + PE+ + + ++ G T G VDG +K + + Sbjct: 40 NYKFKRLYRNLYNPEFYYKGYQEIYANPGNMTRGTINNTVDGFSKNRVSKII-------- 91 Query: 69 DELLSGHYQPLPARRVYIPKSNGKL-RPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSY 127 + + +G+Y+P P +RVYI K K RPLG+P D++VQ + +E I+E +F S+ Sbjct: 92 NNIKNGNYKPTPVKRVYIDKKGSKKKRPLGVPTFDDKLVQLVIKYILEAIYEPNFSENSH 151 Query: 128 GFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARF 187 GFR R H A++ +K +W IEGD+ +FD + H +L+ +R+RI+D Sbjct: 152 GFRKNRGCHTALKQIKKSGNGT-----KWFIEGDIQGFFDNIDHHILINLLRKRINDETL 206 Query: 188 MTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYL--HERYLSGKARK 245 + L+WK ++AG+++ F G PQGG++SPLL+NI LNE D Y+ + + K Sbjct: 207 IGLIWKFLRAGYMEDWQFHKTFSGTPQGGILSPLLANIYLNELDIYMGKYAKKFGKGQPK 266 Query: 246 DR------WYWNNSIQRGRSTA----------------------VRENWQ---WKP---- 270 DR Y + I+RGR A V+E Q + P Sbjct: 267 DREVDKRYQYLHLKIKRGRKKADLLREQGKHNEAQELIEQVNEWVKERGQRPYYNPMSDK 326 Query: 271 --AVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFIFL 328 ++ Y RYADDF++++ G+K +AI+ + L LKL L+ +KT I H + FL Sbjct: 327 FKSLKYVRYADDFIVMIIGSKDDAKAIKSDIAQFLNEELKLTLSEEKTLITHSSKKATFL 386 Query: 329 GHRLIRKRS-------------RYGEMRVVSTIPQEKARNFAASLTALLWK 366 G+ + R+ R+ ++V IP E RN +L L K Sbjct: 387 GYNVNITRNELFTKYSVKGVKRRHHNLKVRLEIPHEAWRNKLLALNVLEMK 437 >UniRef50_Q9MD87 Putative maturase n=1 Tax=Cryphonectria parasitica RepID=Q9MD87_CRYPA Length = 778 Score = 186 bits (472), Expect = 1e-45, Method: Compositional matrix adjust. Identities = 118/340 (34%), Positives = 180/340 (52%), Gaps = 39/340 (11%) Query: 20 LLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPL 79 +L ++ P +L +G T G D L V+ EL +G Y Sbjct: 213 ILNVLADPNFLIACYDEIKGKQGNMTRGYDKATLDGLDYNWFVKTA---GELKAGKYNFK 269 Query: 80 PARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAI 139 P+RRV IPK+NGK RPLG+ + RD+IVQ+A+ +E I+E F S+GFRP RS H A+ Sbjct: 270 PSRRVEIPKANGKTRPLGVGSPRDKIVQKALHAILEAIFEPLFLPSSHGFRPNRSTHSAL 329 Query: 140 RTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGH 199 V L + WVI+GD++ FD++ H +++K + +I D +++ L+ K ++AGH Sbjct: 330 LKVYLS-----GNKHNWVIQGDITKCFDSIPHSIILKRIGAQIGDKKYLNLISKYLEAGH 384 Query: 200 ID--VGLFRAASEGVPQGGVISPLLSNIMLNEFDQY---LHERYLSGKARKDRWYWNNSI 254 ID G + G PQGG++SP+LSNI+L+EFD+Y L E + GK R+ WN + Sbjct: 385 IDPKTGTKVVLNYGTPQGGILSPILSNIVLHEFDKYMAKLSESFHKGKKRR----WNPAY 440 Query: 255 Q-----RGRSTAVRENWQ-------------WKPA---VAYCRYADDFVLIVKGTKAQVE 293 + RGR+ ++ E + P + Y RYADDFV+ + G+ Sbjct: 441 KRLLARRGRTKSLEEKQTLLKQMRTMRSIDAFDPNFRRLDYVRYADDFVVFISGSSKDAL 500 Query: 294 AIREECRGVLEGSLKLRLNMDKTKIPHV-NDGFIFLGHRL 332 IR + L+ + L LN+DKT I ++ + + FLG L Sbjct: 501 FIRNNLKDYLKVNCGLELNVDKTAISNLATEKWKFLGAEL 540 >UniRef50_P0A3U1 DNA endonuclease n=8 Tax=Firmicutes RepID=LTRA_LACLM Length = 599 Score = 186 bits (471), Expect = 1e-45, Method: Compositional matrix adjust. Identities = 125/388 (32%), Positives = 199/388 (51%), Gaps = 65/388 (16%) Query: 19 RLLRLITQPEWLAEAARITLSSKGAHTPGV-DGVNKTMLQARLAVELQILRDELLSGHYQ 77 RL R + +P+ A + S+KGA T G+ D + ++ +Q L+D G Y Sbjct: 25 RLYRYLLRPDIYYVAYQNLYSNKGASTKGILDDTADGFSEEKIKKIIQSLKD----GTYY 80 Query: 78 PLPARRVYIPKSNGK-LRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVH 136 P P RR+YI K N K +RPLGIP D+++Q A+ + +E I+E F +S+GFRP+RS H Sbjct: 81 PQPVRRMYIAKKNSKKMRPLGIPTFTDKLIQEAVRIILESIYEPVFEDVSHGFRPQRSCH 140 Query: 137 HAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIK 196 A++T+K + RW +EGD+ FD + H L+ + +I D + L++K +K Sbjct: 141 TALKTIKREFGG-----ARWFVEGDIKGCFDNIDHVTLIGLINLKIKDMKMSQLIYKFLK 195 Query: 197 AGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWY------- 249 AG+++ + G PQGG++SPLL+NI L+E D+++ L K + DR Sbjct: 196 AGYLENWQYHKTYSGTPQGGILSPLLANIYLHELDKFV----LQLKMKFDRESPERITPE 251 Query: 250 ---WNNSIQR---------GRSTA-VRENWQWK-------PAVA-------YCRYADDFV 282 +N I+R G A V +Q K P + Y RYADDF+ Sbjct: 252 YRELHNEIKRISHRLKKLEGEEKAKVLLEYQEKRKRLPTLPCTSQTNKVLKYVRYADDFI 311 Query: 283 LIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRL---------- 332 + VKG+K + I+E+ + + LK+ L+ +KT I H + FLG+ + Sbjct: 312 ISVKGSKEDCQWIKEQLKLFIHNKLKMELSEEKTLITHSSQPARFLGYDIRVRRSGTIKR 371 Query: 333 ---IRKRSRYGEMRVVSTIP-QEKARNF 356 ++KR+ G + ++ IP Q+K R F Sbjct: 372 SGKVKKRTLNGSVELL--IPLQDKIRQF 397 >UniRef50_A3WXE2 Putative reverse transcriptase n=1 Tax=Nitrobacter sp. Nb-311A RepID=A3WXE2_9BRAD Length = 486 Score = 184 bits (468), Expect = 3e-45, Method: Compositional matrix adjust. Identities = 118/329 (35%), Positives = 177/329 (53%), Gaps = 15/329 (4%) Query: 48 VDGVNKTMLQARLAVE--LQILRDELLSGHYQPLPARRVYIPKSN--GKLRPLGIPALRD 103 +D V ++ R+ VE L R LL G Y+P RRV IPK G RPLG+P ++D Sbjct: 1 MDKVTVKHIETRIGVERFLTTTRTMLLDGSYRPQAVRRVMIPKRGRPGLFRPLGVPTVQD 60 Query: 104 RIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQL--------TDCGETRGR 155 R+VQ A+L +EPI+E+ F T+SYGFRP+R+ A+ ++ + TD + Sbjct: 61 RVVQAALLQLLEPIFEAVFLTVSYGFRPKRACRDALEHIRNAIRPTGQKTETDWPRPPYQ 120 Query: 156 WVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQG 215 WVIEGD+ FD + H +M +RRR+SD R L+ +KAG + G G PQG Sbjct: 121 WVIEGDIKRCFDNIDHHHVMTCLRRRVSDRRVTRLVRAFLKAGVLSEGSLVRTKAGTPQG 180 Query: 216 GVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYC 275 GV+SPLL+N+ML+ ++ + +Y+ + +D + R E + Sbjct: 181 GVLSPLLANVMLDGIERR-YAKYVVPRLTRDGKPYARPGNELRKFRHYERKAGRVVFLPI 239 Query: 276 RYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRL-IR 334 RYADDFV++V GT+ Q A +E L +KL L +KT + + +GF FLGHR+ +R Sbjct: 240 RYADDFVVLVNGTEEQARAEKEALAVFLREEMKLTLAPEKTHVTSLTEGFEFLGHRVRLR 299 Query: 335 KRSRYGEMRVVSTIPQEKARNFAASLTAL 363 R+G V IP+ + ++F + L Sbjct: 300 WDDRWGYWPRVE-IPKARVKDFHHRIKQL 327 >UniRef50_A5CZB8 Retron-type reverse transcriptase n=20 Tax=Bacteria RepID=A5CZB8_PELTS Length = 423 Score = 184 bits (468), Expect = 3e-45, Method: Compositional matrix adjust. Identities = 108/300 (36%), Positives = 158/300 (52%), Gaps = 39/300 (13%) Query: 30 LAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLPARRVYIPKS 89 L +A + +++GA PG+DG L L L EL +G Y+P P +RV IPK Sbjct: 17 LEKAYQAVRANRGA--PGIDGETVEAFGQNLGQRLIQLHHELKTGTYEPQPVKRVEIPKP 74 Query: 90 NGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDC 149 +G RPLGIP +RDR+VQ+A+L ++PI+E FH SYG+RP RS H A+ + + Sbjct: 75 DGSTRPLGIPTVRDRVVQQALLNILQPIFEPGFHPSSYGYRPGRSCHQAVAKAERFMNKY 134 Query: 150 GETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHIDVGLFRAAS 209 G +V++ DLS FD + H L+++ V R+ISD + L+ K + AG + G + Sbjct: 135 GL---EYVVDMDLSKCFDRLDHELILEEVNRKISDGSVLKLIKKFLTAGVMKDGQWDEID 191 Query: 210 EGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWK 269 G PQGGVISPLL+NI L+ FDQ + R Sbjct: 192 TGSPQGGVISPLLANIYLDRFDQAMKSR-------------------------------- 219 Query: 270 PAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLG 329 + RYADD +L+ T+ + R+ +LEG LKL +N +KT + V++G +LG Sbjct: 220 -GIRIVRYADD-ILVFARTRKEAGNYRQVATQILEGELKLEVNKEKTHLTSVHEGVAYLG 277 >UniRef50_C6J6N9 RNA-directed DNA polymerase n=1 Tax=Paenibacillus sp. oral taxon 786 str. D14 RepID=C6J6N9_9BACL Length = 430 Score = 184 bits (467), Expect = 5e-45, Method: Compositional matrix adjust. Identities = 132/351 (37%), Positives = 184/351 (52%), Gaps = 50/351 (14%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVN----KTML 56 +Q KL A + R L + + + L EA R +++G + GVDG + Sbjct: 16 LQGKLGHAAKENKKRRFHALYDKVYRVDILWEAWRRVRANEG--SAGVDGETLADIEKQG 73 Query: 57 QARLAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEP 116 + R +E Q L L G Y P P RR YIPK +GKLRPLGIP +RDR++Q A + MEP Sbjct: 74 EMRFVLECQRL---LKEGKYHPQPVRRHYIPKKDGKLRPLGIPTVRDRVIQMATKLVMEP 130 Query: 117 IWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMK 176 I+E+DF S+GFRP+RS A+ ++ C +G WV++ D+ YFD ++ LMK Sbjct: 131 IFEADFQDTSFGFRPKRSAKQALERIR---KACNR-KGNWVVDVDIQGYFDNINQEKLMK 186 Query: 177 AVRRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHE 236 + RISD R + L+ K + AG ++ G R + G PQGGVISPLL+NI LN FD L E Sbjct: 187 LIEMRISDRRILKLVRKWLGAGVMEEGNIRRSDLGTPQGGVISPLLANIYLNYFD-LLWE 245 Query: 237 RYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIR 296 R+ GK G T RYADD V+I K TK + Sbjct: 246 RH-GGKL-------------GELT---------------RYADDLVIICK-TKKDAQRAY 275 Query: 297 EECRGVLEGSLKLRLNMDKTKIPHV---NDGFIFLG--HRLIRKRSRYGEM 342 E R ++E L+L L+ KT+I + +GF FLG HR + + G++ Sbjct: 276 ELIRAIME-RLELTLHPTKTRIVGLWTGEEGFDFLGMHHRKTKAETSKGQV 325 >UniRef50_C6CA58 RNA-directed DNA polymerase n=3 Tax=Enterobacteriaceae RepID=C6CA58_DICDC Length = 626 Score = 184 bits (467), Expect = 5e-45, Method: Compositional matrix adjust. Identities = 122/400 (30%), Positives = 210/400 (52%), Gaps = 59/400 (14%) Query: 10 ATDPSLRIQRLLRLITQPEWL-AEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILR 68 A+ +I++L +++ + L A+A S++GA T G++ N TM + ++V+ I Sbjct: 16 ASTQGYKIKKLHKIMCSNKDLWAQAYANIYSNQGAMTRGIN--NNTMDE--MSVDRIINL 71 Query: 69 DELL-SGHYQPLPARRVYIPKS----NGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFH 123 +L+ S Y+P P RR +IPK NGK RPLGIP D+++Q M M +E I+E F Sbjct: 72 IQLINSDSYKPKPCRRTHIPKDARKPNGKKRPLGIPTGDDKLIQEVMRMLLEEIYEPVFS 131 Query: 124 TLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRIS 183 SYGFRP+RS H A++ ++ G +WV + D+ YFD + H LL+K + +RI+ Sbjct: 132 DWSYGFRPKRSCHSALKEIR-----NGWKGTKWVCDVDIKGYFDNIDHDLLLKFLSKRIA 186 Query: 184 DARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYL--------- 234 D +F+ LL K +K G++D + G PQGG+ISP+L+N+ L+E D+++ Sbjct: 187 DNKFLALLKKFLKTGYLDNWRYFGTHSGTPQGGIISPILANVFLHELDEFMKNRISEFGT 246 Query: 235 ---------HERYLSGKARKDRW---------------------YWNNSIQ---RGRSTA 261 ++R L +A + +W Y + ++ R S+ Sbjct: 247 GGRRKPNPIYKRALQNRANRIKWIRQGFGASGMPADEQKIQKWKYEADELEKQLRTLSSV 306 Query: 262 VRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHV 321 + ++ ++K + Y RYADDF++ V G+K++ + I +E +E L L ++ +K+ I Sbjct: 307 IMDDTEFK-RMRYVRYADDFLIGVIGSKSEAKKIMKEVVDFVENELHLEISKEKSGIIDP 365 Query: 322 NDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAASLT 361 GF FLG+ I+ R ++ V + + ++ A T Sbjct: 366 KKGFTFLGYE-IKTRRESKRVKCVVGLNTDGSKTHAVKRT 404 >UniRef50_C8R0Q7 RNA-directed DNA polymerase (Reverse transcriptase) n=6 Tax=Bacteria RepID=C8R0Q7_9DELT Length = 508 Score = 184 bits (467), Expect = 5e-45, Method: Compositional matrix adjust. Identities = 119/327 (36%), Positives = 169/327 (51%), Gaps = 45/327 (13%) Query: 10 ATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRD 69 A P L R++++ L R+ ++KGA PG+D + A + +R Sbjct: 81 ADQPDPDANLLERILSRANMLKAWERVK-ANKGA--PGMDNMPIADFMAFAREHWEEIRA 137 Query: 70 ELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGF 129 LL+G YQPLP +RV IPK G RPLGIP + DR++Q+AM + PI++ DF SYGF Sbjct: 138 SLLAGTYQPLPVKRVEIPKPTGGTRPLGIPTVLDRLIQQAMAQVLLPIFDPDFSEASYGF 197 Query: 130 RPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMT 189 RP RS H AI V+ D R ++ DLS +FDTV H LLM V R++ D R + Sbjct: 198 RPGRSAHDAIHRVR----DYIRQGYRVAVDADLSKFFDTVDHDLLMNRVGRKVRDQRVLR 253 Query: 190 LLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWY 249 L+ K ++AG + G R +GVPQGG +SPLLSNI+L++ D+ L R Sbjct: 254 LVGKYLRAGVMIDGRRRETRKGVPQGGPLSPLLSNILLDDLDKELERR------------ 301 Query: 250 WNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKL 309 + RYADDF+++VK +A + R LE LKL Sbjct: 302 ---------------------GHRFARYADDFIILVKSRRAGERVMTGITR-FLESKLKL 339 Query: 310 RLNMDKTKIPHVND----GFIFLGHRL 332 +N +K+K+ N+ GFIF G ++ Sbjct: 340 VVNQEKSKVAPTNESGFLGFIFKGAKI 366 >UniRef50_C5CID2 RNA-directed DNA polymerase (Reverse transcriptase) n=4 Tax=Kosmotoga olearia TBF 19.5.1 RepID=C5CID2_KOSOT Length = 423 Score = 183 bits (465), Expect = 8e-45, Method: Compositional matrix adjust. Identities = 119/347 (34%), Positives = 181/347 (52%), Gaps = 51/347 (14%) Query: 30 LAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLPARRVYIPKS 89 LA+A + GA PG+DGV L ++ L ++L G Y+P P +RV IPK Sbjct: 17 LAKAYHKVRRNNGA--PGIDGVTVQEYGENLLERIKKLSEKLRKGEYRPSPVKRVEIPKG 74 Query: 90 NGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDC 149 NGK R LGIP + DRIVQ+++ MEPI+E FH SYG+R R+ H A+ K C Sbjct: 75 NGKTRMLGIPTVEDRIVQQSLKEIMEPIFEEGFHPSSYGYRKGRNPHQAVE--KAYAFAC 132 Query: 150 GETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHIDVGLFRAAS 209 + + ++V++ DLS FDT+ H ++ AV RISD + + L+ +K+G I ++ + Sbjct: 133 -KYKMKYVVQLDLSQCFDTLDHEKMIDAVAERISDGKILRLIRSFLKSGVI-TDQYQPSE 190 Query: 210 EGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWK 269 G PQGGVISPLL+NI LN+FDQ + R Sbjct: 191 MGSPQGGVISPLLANIYLNKFDQKMMAR-------------------------------- 218 Query: 270 PAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLG 329 + RYADD ++ K K+ + ++ R +LE LKL++N +KT+I ++DG FLG Sbjct: 219 -GIRIVRYADDILIFAKSYKSAEKYLKIAIR-ILEKELKLKVNKEKTRITTIDDGIEFLG 276 Query: 330 HRLIRKRSRYGEMRVVSTIPQEKARNFAASLTALLWKVRIS--GEIL 374 + + + R I ++K + F A + L + + + GEI+ Sbjct: 277 FTIQKGKIR---------IQEKKIKRFKAKVKTLTRRNQCTPIGEII 314 >UniRef50_C0JX29 Putative reverse transcriptase and intron maturase n=1 Tax=Pyramimonas parkeae RepID=C0JX29_9CHLO Length = 608 Score = 183 bits (464), Expect = 1e-44, Method: Compositional matrix adjust. Identities = 121/357 (33%), Positives = 191/357 (53%), Gaps = 56/357 (15%) Query: 20 LLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVEL-QILRDELLSGHYQP 78 L RL+ + L A + S G TPG D + + RL + + LRD+ ++ Sbjct: 36 LYRLLCNKQLLTLAYNLIKSKPGNMTPGTDKLTLDKMSERLIDKTSRQLRDQT----FKF 91 Query: 79 LPARRVYIPKSNGK-LRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHH 137 P RRV+IPK N K +RPLG+P+ RD++VQ+AML+ M+ I+E+ F T S+GFRP RS H Sbjct: 92 KPVRRVFIPKGNSKDIRPLGVPSSRDKVVQKAMLLIMDNIYETTFSTHSHGFRPGRSCHS 151 Query: 138 AIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKA 197 A++ ++ + + +W IEGD+ +D V+H++L+ +R +I D RF+ LLWK ++A Sbjct: 152 ALKEIRSEWSGI-----KWAIEGDIKGCYDNVNHQILINILREKIKDERFIQLLWKLLRA 206 Query: 198 GHIDVG-LFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLS-------GKARKDRWY 249 G I+V + G PQGG++SPLL+NI LNEFD+++ LS K R+D Sbjct: 207 G-IEVNRTIERSKIGTPQGGILSPLLANIYLNEFDKFVSN--LSQKIGLTYNKTRRDNPE 263 Query: 250 WNN---SIQRGRST-AVRENWQWKPA-----------------------------VAYCR 276 ++ I R R+ V +P+ + + R Sbjct: 264 YHKIRGKIYRLRTKRTVSGTVNVQPSKSDLKQIQILSKTQRTLPSKDPFDPQYRKILFIR 323 Query: 277 YADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKI-PHVNDGFIFLGHRL 332 YADD+++ V G + I+E+ + L+ L+L L+ +KTKI P + FLG++L Sbjct: 324 YADDWIVGVIGGHEFAQGIKEQIQTFLKEQLELTLSPEKTKITPFSSKKVTFLGYKL 380 >UniRef50_C5RJ16 RNA-directed DNA polymerase (Reverse transcriptase) n=2 Tax=Clostridiales RepID=C5RJ16_CLOCL Length = 626 Score = 181 bits (459), Expect = 4e-44, Method: Compositional matrix adjust. Identities = 106/299 (35%), Positives = 169/299 (56%), Gaps = 23/299 (7%) Query: 39 SSKGAHTPGVDGVNKTMLQARLAVELQILRDELLS------GHYQPLPARRVYIPKSNGK 92 S+ G+ TPG D + + LQ+ ++E++S +Y+P PARR YIPKSNGK Sbjct: 52 SNSGSKTPGTD-------RNTIDKYLQMSKEEVISLVKKSASNYKPKPARREYIPKSNGK 104 Query: 93 LRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGET 152 RPLGIP + DRI+ + + +EPI E+ F+ SYGFRP R+ HAI ++ ++ + Sbjct: 105 KRPLGIPTVIDRIILECIRIVIEPICEAKFYPHSYGFRPYRACSHAIASIVHVISSTSKD 164 Query: 153 RGRWVIEGDLSSYFDTVHHRLLMKAV-RRRISDARFMTLLWKTIKAGHIDVGLFRAASEG 211 +VIEGD+ SYFD ++H++L+ + + + D R + L+ +KAG+I+ LF G Sbjct: 165 IPHYVIEGDIKSYFDNINHKVLINKLWKMGVHDKRMLCLIKLMLKAGYIERDLFYLTEAG 224 Query: 212 VPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPA 271 PQGG+ISPLL+N+ LN FD + Y K + + ++ ++ R T ++ + Sbjct: 225 TPQGGIISPLLANVYLNSFDWMIGRMYQEPKGIETKNDRSHCREKLRRTGIKPKY----- 279 Query: 272 VAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVN-DGFIFLG 329 RYADD+V I+ ++ + E + R + LKL L+ +KT I + + FLG Sbjct: 280 --LVRYADDWV-ILTTSRQEAERLLHYIRRYFKHKLKLELSEEKTVITDIKCEKVKFLG 335 >UniRef50_B0TA92 Reverse transcriptase (RNA-dependent DNA polymerase) n=44 Tax=Bacteria RepID=B0TA92_HELMI Length = 475 Score = 181 bits (458), Expect = 5e-44, Method: Compositional matrix adjust. Identities = 112/336 (33%), Positives = 173/336 (51%), Gaps = 44/336 (13%) Query: 5 LATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVEL 64 L A P L+ + + + EA + +++KGA G+DG+ L+ L E Sbjct: 42 LPAQEAKQPREETYDLMEKVVERGNMTEAYKRVMANKGAA--GIDGMGLESLRPYLKEEW 99 Query: 65 QILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHT 124 ++ ELL G Y+P P RRV IPK G R LGIP + DR++Q+A+ + PI++ DF T Sbjct: 100 SRIKQELLEGTYRPQPVRRVEIPKPQGGTRKLGIPTVVDRLIQQALNQILMPIFDPDFST 159 Query: 125 LSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISD 184 SYGFRP +S H A++ K + D RWV++ DL+ +FD V+H +LM V R++ D Sbjct: 160 NSYGFRPGKSAHQAVKKAKEYIADG----YRWVVDMDLAQFFDRVNHDILMARVARKVKD 215 Query: 185 ARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKAR 244 R + L+ + +KAG + G+ + EG PQGG +SPLL+NI+L++ D+ L R Sbjct: 216 KRILKLIREYLKAGVMLNGIRVKSEEGTPQGGPLSPLLANIILDDLDKALESR------- 268 Query: 245 KDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLE 304 +CRYADD + V+ +A + + E LE Sbjct: 269 --------------------------GHRFCRYADDCNVYVRSRRAG-QRVMEGMAKFLE 301 Query: 305 GSLKLRLNMDKTKIPHVND----GFIFLGHRLIRKR 336 G LKL++N +K+ + + GF F H+ + R Sbjct: 302 GRLKLQVNWEKSAVDRPWNRKFLGFSFTWHKAAKIR 337 >UniRef50_A5CZJ0 Retron-type reverse transcriptase n=1 Tax=Pelotomaculum thermopropionicum SI RepID=A5CZJ0_PELTS Length = 428 Score = 181 bits (458), Expect = 5e-44, Method: Compositional matrix adjust. Identities = 125/360 (34%), Positives = 191/360 (53%), Gaps = 46/360 (12%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARL 60 QRKL A + + R L + + + L A + ++KGA PG DG + ++ ++ Sbjct: 15 FQRKLYVKAKQEKTFRFYSLYDKLYREDVLQYAWQQCRANKGA--PGADGQSFKDIEEKV 72 Query: 61 AVE--LQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIW 118 VE L+ + +EL +G Y+P+P RRVYI K +G RPLGIP ++DRI Q A L ++PI+ Sbjct: 73 GVERFLKEIAEELRNGTYRPMPVRRVYILKPDGSQRPLGIPTIKDRIAQMACLTVIQPIF 132 Query: 119 ESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAV 178 E+DF SYGFRP+R+ H AI + + G T V + DL+ FD++ HRL+M ++ Sbjct: 133 EADFLDCSYGFRPKRNAHQAIGAITENIKQ-GFT---AVYDADLTKCFDSIQHRLIMDSL 188 Query: 179 RRRISDARFMTLLWKTIKAGHIDVG---LFRAASEGVPQGGVISPLLSNIMLNEFDQYLH 235 RI+D + + L+ ++A ++ G R +G PQGGVISPLL+NI+LN D+ H Sbjct: 189 AERITDGKVLRLIKGWLEAPIVEPGGPKQGRKNYQGTPQGGVISPLLANIVLNRLDRLWH 248 Query: 236 ERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAI 295 R RE + + RYADDFV++ + E I Sbjct: 249 ----------------------RPGGPRERYNAR----LVRYADDFVVLARFIG---EPI 279 Query: 296 REECRGVLEGSLKLRLNMDKTKIPHVNDGFI--FLGHRLIRKRSRYGEMRVVSTIPQEKA 353 + E ++ S+ L LN KT+I +N G I FLG+ + R + R ++ P +KA Sbjct: 280 KNELESIIT-SMGLNLNEKKTRILDLNKGDILNFLGYSIRISRDK---NRRITIKPSDKA 335 >UniRef50_Q02717 Reverse transcriptase homologue COI ialpha grp II protein (Fragment) n=3 Tax=Podospora anserina RepID=Q02717_PODAN Length = 788 Score = 178 bits (451), Expect = 3e-43, Method: Compositional matrix adjust. Identities = 121/371 (32%), Positives = 185/371 (49%), Gaps = 59/371 (15%) Query: 23 LITQPEWLAEAARITLSSKGAHTPGVD-----GVNKTMLQARLAVELQILRDELLSGHYQ 77 +I + E L A S G TP VD G++K + + ++L S ++ Sbjct: 188 VICKLEALYTAYMNIKSEPGNMTPRVDSETLDGISKEWFEK--------ISEQLKSEQFR 239 Query: 78 PLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHH 137 P RRVYIPK+NGK+RPLGI + RD+IVQ +E + E FH+ S+GFRP R H Sbjct: 240 FRPTRRVYIPKANGKMRPLGIASPRDKIVQEVFRAILEQVLEPRFHSSSHGFRPGRGCHS 299 Query: 138 AIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKA 197 A+ T++ +W IEGD+ +FD + H +L K + + D RF+ L WK +KA Sbjct: 300 ALATIRYW------NGIKWFIEGDIKGFFDNIDHHILEKLLVKHFQDQRFIDLYWKMVKA 353 Query: 198 GHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYL-------HERYLSGK-ARKDRWY 249 G+++ +++ GVPQGG+ SP+LSN++LNE D+++ +E+ GK K+ Y Sbjct: 354 GYVEFDKDKSSIIGVPQGGIASPILSNLVLNELDEFVQNIVDEFNEKLKGGKHTSKNPAY 413 Query: 250 WNNSIQRGRSTAVRENWQWK----------------------------PAVA---YCRYA 278 + G+ T + + K P +A Y RYA Sbjct: 414 VVIDSRIGKITRLERKLKSKGQELDSGRKLERMKLIKVRATMPSMIPNPDLAKIYYVRYA 473 Query: 279 DDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVN-DGFIFLGHRLIRKRS 337 DD+++ V G+ AI+E L+ LKL L+M+KT I + + D FLG + R S Sbjct: 474 DDWLIGVAGSSETARAIKERIAAYLKDILKLELSMEKTLITNASEDKAYFLGTEIQRISS 533 Query: 338 RYGEMRVVSTI 348 GE++ I Sbjct: 534 VKGEIKRFKNI 544 >UniRef50_UPI0001C42942 reverse transcriptase n=1 Tax=Bacillus pseudofirmus OF4 RepID=UPI0001C42942 Length = 644 Score = 177 bits (449), Expect = 5e-43, Method: Compositional matrix adjust. Identities = 122/344 (35%), Positives = 184/344 (53%), Gaps = 23/344 (6%) Query: 20 LLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTM-LQARLAVELQILRDELLSGHYQP 78 L+ L+ + + A S+KG+ T G+D NKT+ L E + + + Y P Sbjct: 35 LIELMKNQQTIMTALHNIKSNKGSKTVGID--NKTIDYYLHLPYEDLVSQVQTCIEDYNP 92 Query: 79 LPARRVYIPKSNG-KLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHH 137 P RR YIPK N KLRPLGIP + DRI+Q + +EPI E+ F+ SYGFRP RS H Sbjct: 93 EPVRRKYIPKENSDKLRPLGIPTMIDRIIQEITRLVIEPIAEAKFYKFSYGFRPMRSAEH 152 Query: 138 AIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAV-RRRISDARFMTLLWKTIK 196 A+ + L +++ WVIEGD+ YFD ++H L+ + + I D R ++++ K +K Sbjct: 153 AMAEI---LEKARKSKTYWVIEGDIKGYFDNINHNKLITMLWKIGIKDKRVLSIIKKMLK 209 Query: 197 AGHIDV-GLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQ 255 +G ++ G + G PQGG+ISPLL+NI LN FD + E + D+ ++ N+ + Sbjct: 210 SGIVEEDGEIYPSDLGSPQGGIISPLLANIYLNFFDWMIAEEF-------DQHHYINNYE 262 Query: 256 RGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDK 315 R R +R + V RYADD+V++ +K Q + + + R L+ L L L+ +K Sbjct: 263 R-RDKGLRAIRRDHKPVYSIRYADDWVVLC-SSKKQADTLLIKIRKYLKHQLSLELSEEK 320 Query: 316 TKIPH-VNDGFIFLGHRLI---RKRSRYGEMRVVSTIPQEKARN 355 TKI + V + FLG RK+ + G V IP K N Sbjct: 321 TKITNLVEEKASFLGFEFFVEPRKKGK-GNKMVAKMIPDRKKSN 363 >UniRef50_P38478 Uncharacterized mitochondrial protein ymf40 n=1 Tax=Marchantia polymorpha RepID=YMF40_MARPO Length = 502 Score = 176 bits (447), Expect = 1e-42, Method: Compositional matrix adjust. Identities = 118/334 (35%), Positives = 173/334 (51%), Gaps = 40/334 (11%) Query: 24 ITQPEWLAEAARITLSSKGAHTPGVDGVN-KTMLQARLAVELQILRDELLSGHYQPLPAR 82 + PE A + S G PG D QA + ++ L+DE +Q P+R Sbjct: 9 LLDPEIFRLAYELKKSKSGNMKPGADKETLDGFSQAYVEKVVRQLKDE----SFQFRPSR 64 Query: 83 RVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTV 142 R +IPK++GKLR LGIP+ RD+IVQ M +EP++E F S+GFRP RS H A+R + Sbjct: 65 REFIPKADGKLRSLGIPSPRDKIVQEVMRRILEPVFEPRFLDSSHGFRPHRSPHTALRQI 124 Query: 143 KLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHIDV 202 + T W+IEGD+ YFD + H LL + + D R + L WK ++AG+++ Sbjct: 125 RRW------TGTSWMIEGDIKGYFDNIDHHLLAGFIAELVKDQRLLALYWKLVRAGYVNQ 178 Query: 203 GLFRAASE---GVPQGGVISPLLSNIMLNEFDQYLHE---RYLSG-----------KARK 245 G +A GVPQG ++SPLLSNI L++FD ++ E +Y + KAR Sbjct: 179 G--KAEPHLLTGVPQGRILSPLLSNIYLHQFDLFMEEIKVKYTTTGALSKNNPIYLKARN 236 Query: 246 DRWYWNNSIQRGRSTAVRE---------NWQWKPAVAYCRYADDFVLIVKGTKAQVEAIR 296 + S++ + +R Q V Y RYADD+V+ V G KA I+ Sbjct: 237 KYYKLVKSLKASSAEIIRARRDMLKMTYGIQTGSRVRYVRYADDWVIGVTGPKALAVQIK 296 Query: 297 EECRGVLEGSLKLRLNMDKTKIPHVN-DGFIFLG 329 EE L+ LKL L +KT+I +++ +FLG Sbjct: 297 EEVSTFLQEKLKLSLQAEKTRITNLSRSEALFLG 330 >UniRef50_A5ZWA2 Putative uncharacterized protein n=2 Tax=Clostridiales RepID=A5ZWA2_9FIRM Length = 428 Score = 176 bits (446), Expect = 1e-42, Method: Compositional matrix adjust. Identities = 123/352 (34%), Positives = 186/352 (52%), Gaps = 50/352 (14%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARL 60 +Q KL A R L + + + L EA + ++KG + GVDG+ ++ ++ Sbjct: 13 LQNKLYLTAKKCQKRRFHALYDKVYRDDVLIEAWKRVKANKG--SSGVDGIRIEDIE-KM 69 Query: 61 AVE--LQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIW 118 +E L+ L+ EL+ G Y P P +RV IPK +G RPLGIP +RDRIVQ A +A+EP++ Sbjct: 70 GIEKYLKELKKELIEGKYIPSPVKRVMIPKPDGSERPLGIPTVRDRIVQMAAKIAIEPVF 129 Query: 119 ESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAV 178 E+DF SYGFRP+RS A+ V+ +G +V++ D+ +FD V+ LM + Sbjct: 130 EADFRECSYGFRPKRSAKQALEVVR----KACNNKGYYVVDADIEKFFDNVNQDKLMILI 185 Query: 179 RRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERY 238 +RISD R + L+ + +++G + + + G QG VISPLL+NI LN D+ L E+Y Sbjct: 186 EQRISDRRILKLIRQWMRSGILYGSILTISELGTSQGSVISPLLANIYLNTLDR-LWEKY 244 Query: 239 LSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREE 298 GR+ + RYAD+ V+I K K+ A + Sbjct: 245 ------------------GRTHGI-----------LVRYADNTVIICKNKKSVNHA--QS 273 Query: 299 CRGVLEGSLKLRLNMDKTKIPHV---NDGFIFLG--HR----LIRKRSRYGE 341 + G L LRL+ KTKI ++ +GF FLG HR + +K +RYGE Sbjct: 274 LLQYIMGKLDLRLHPVKTKIVNMWDGTEGFDFLGLHHRRFLKINKKGNRYGE 325 >UniRef50_Q35056 CoxII intron2 ORF n=4 Tax=Embryophyta RepID=Q35056_MARPO Length = 827 Score = 175 bits (444), Expect = 2e-42, Method: Compositional matrix adjust. Identities = 110/303 (36%), Positives = 169/303 (55%), Gaps = 19/303 (6%) Query: 42 GAHTPGVDGVNKTMLQARLAVELQILRDELLSGH--YQPLPARRVYIPKSNGKLRPLGIP 99 G+ PG G+ R +LQ LRD +L Y+ + + P K+ LGIP Sbjct: 356 GSMNPGTHGLTIDGTSFR---KLQALRDAVLDSESPYEWGGTKIITKPGKREKIS-LGIP 411 Query: 100 ALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIE 159 +DRIVQ + M +EPI+ES F S+G+RP RS H A+RT++ +D +T W++ Sbjct: 412 CFQDRIVQEVLKMLLEPIYESIFSRRSHGWRPGRSAHTALRTIR---SDFKKTN--WIVP 466 Query: 160 GDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAG-HIDVGLFRAASEGVPQGGVI 218 G+++ FD V+H +L +RR+I D + + L+ +KA H+ G ++ G PQG ++ Sbjct: 467 GNINKLFDIVNHGILCHIMRRKIRDKKLLKLIAGGLKAKIHMPYGNIEESNLGTPQGRIL 526 Query: 219 SPLLSNIMLNEFDQYLHER---YLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPA---V 272 SP+LSNI L+EFD ++ ER Y G+ W + R +R + + P + Sbjct: 527 SPILSNIYLHEFDIWIEERIQQYNLGRKETRSWVLLRKQGKMRKARLRSD-PFNPLYRRM 585 Query: 273 AYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRL 332 Y RY DDF++ ++G + +AIR+EC L LKL LNM+KT I H++ G FLGHR+ Sbjct: 586 EYRRYGDDFLIAIRGPLSDAKAIRQECETFLREKLKLLLNMEKTHIKHISVGIPFLGHRI 645 Query: 333 IRK 335 R+ Sbjct: 646 GRR 648 >UniRef50_A9AUN7 RNA-directed DNA polymerase n=4 Tax=Bacteria RepID=A9AUN7_HERA2 Length = 595 Score = 175 bits (444), Expect = 2e-42, Method: Compositional matrix adjust. Identities = 112/335 (33%), Positives = 177/335 (52%), Gaps = 41/335 (12%) Query: 40 SKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIP 99 +KG+ +DG M A++ + LR E Y+ P RRVYIPK+ GK RPLG+P Sbjct: 49 TKGSTDETIDG----MSMAKIHRIIADLRRET----YRWTPVRRVYIPKATGKTRPLGVP 100 Query: 100 ALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIE 159 D++VQ + ++ ++ S+GFRP R H A++ ++ T RW IE Sbjct: 101 TWSDKLVQEVLRSILDAYYDPQMSDHSHGFRPNRGCHTALKAIQRCWTGT-----RWFIE 155 Query: 160 GDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVIS 219 GD++ YFDT++H L+ + +RI D RF+ L+ ++AG++ ++ G PQGGVIS Sbjct: 156 GDIAQYFDTINHTTLLTILAKRIHDGRFLRLIQTLLQAGYLHDWVYHPTLSGTPQGGVIS 215 Query: 220 PLLSNIMLNEFDQYLHER----YLSGKARKDRWYWNNSIQR----------GRSTAVREN 265 PLL+NI L+EFDQ++ Y G+ RK + QR T + + Sbjct: 216 PLLANIYLHEFDQFVEHTLIPAYTKGQRRKVNPAYAQMEQRISKLRRQREYASVTPLLKE 275 Query: 266 WQWKPA----------VAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDK 315 + P+ + Y RYADDF+L GTK + EAI+++ L L+L+L+ K Sbjct: 276 LRTLPSRDVHDPDYRRLRYVRYADDFLLGFAGTKVEAEAIKQQINVWLYDHLQLKLSTQK 335 Query: 316 TKIPHV-NDGFIFLGHRLIRKRS---RYGEMRVVS 346 T I H +D FLG+ ++ +++ + G R+V+ Sbjct: 336 TLITHASSDPAHFLGYDIVTQQANSKQTGNRRIVN 370 >UniRef50_C9S0G0 RNA-directed DNA polymerase n=2 Tax=Geobacillus RepID=C9S0G0_GEOSY Length = 635 Score = 175 bits (444), Expect = 2e-42, Method: Compositional matrix adjust. Identities = 116/330 (35%), Positives = 179/330 (54%), Gaps = 36/330 (10%) Query: 20 LLRLITQPEWLAEAARITLSSKGAHTPGVDG--VNKTMLQARLAVELQILRDELLSGHYQ 77 +L L+ + EA R S+KG+ T G+D V+ +L V +++ +L Y+ Sbjct: 34 MLELLQNDVVILEAIRNIKSNKGSKTAGIDQKIVDDYLLMPTEKV-FGMIKAKL--NDYK 90 Query: 78 PLPARRVYIPKSN--------------GKLRPLGIPALRDRIVQRAMLMAMEPIWESDFH 123 P+P RR PK N G+ RPLGI A+ DRI+Q + + +EPI+E+ F+ Sbjct: 91 PIPVRRCNKPKGNAKSSKRKGNSPNEEGETRPLGISAVTDRIIQEMLRIVLEPIFEAQFY 150 Query: 124 TLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAV-RRRI 182 SYGFRP RS HA+ + L ++ WV++GD+ SYFD ++H+ L+ + + Sbjct: 151 PHSYGFRPYRSTEHALAWM---LKIINGSKLYWVVKGDIESYFDHINHKKLLNIMWNMGV 207 Query: 183 SDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGK 242 D R + ++ K +KAG + G F ++G+PQGG+ISPLL+N+ LN FD + + Y Sbjct: 208 RDKRVLCIVKKMLKAGQVIQGKFYPTAKGIPQGGIISPLLANVYLNSFDWMVGQEY---- 263 Query: 243 ARKDRWYWNNSIQRGRSTAVR--ENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECR 300 ++ NN+ R + A+ N P V Y RYADD+V I+ TK E IRE+C+ Sbjct: 264 ----EYHPNNANYREKKNALAALRNKGHHP-VFYIRYADDWV-ILTDTKEYAEKIREQCK 317 Query: 301 GVLEGSLKLRLNMDKTKIPHVNDGFI-FLG 329 L L L L+ +KT I + + + FLG Sbjct: 318 QYLACELHLTLSDEKTFIADIREQRVKFLG 347 >UniRef50_D2M2V6 RNA-directed DNA polymerase (Reverse transcriptase) n=1 Tax=Bacillus cellulosilyticus DSM 2522 RepID=D2M2V6_BACS4 Length = 441 Score = 174 bits (441), Expect = 4e-42, Method: Compositional matrix adjust. Identities = 105/304 (34%), Positives = 160/304 (52%), Gaps = 41/304 (13%) Query: 30 LAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLPARRVYIPKS 89 + EA + ++G+ GVDGV+ + + +Q+L+ EL Y+P P +RV+IPK+ Sbjct: 35 MEEAFKEVKRNRGS--AGVDGVSIRTFEHGVEDNVQVLQRELKEKAYRPRPVKRVFIPKT 92 Query: 90 NGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDC 149 +G RPLGIP +RDR+VQ A+ +EPI+E F S+GFRP +S H A+ ++ L D Sbjct: 93 DGTKRPLGIPTVRDRVVQAAVRRIIEPIFEDKFLDCSFGFRPNKSAHMALEKIRKDLMD- 151 Query: 150 GETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHIDVGLFRAAS 209 G +VI+ DL +YFDT+ L++AVR + D + L+ ++AG +D G F Sbjct: 152 GYV---YVIDADLKAYFDTIPQDKLIQAVREEVVDGSVIRLIQSFLQAGVMDGGSFHLTE 208 Query: 210 EGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWK 269 +G PQGGVISPLL+NI L+ D+ + +R Sbjct: 209 KGTPQGGVISPLLANIYLHPLDELMTKR-------------------------------- 236 Query: 270 PAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTK-IPHVNDGFIFL 328 RYADDFV+ K K ++ R L L L ++ +KTK + ++ + F+FL Sbjct: 237 -GHRITRYADDFVICCKSQKGAERVLKSVTR-FLNEELGLTVHPEKTKVVNNLEEPFLFL 294 Query: 329 GHRL 332 GH Sbjct: 295 GHEF 298 >UniRef50_O47500 RT-like protein n=1 Tax=Venturia inaequalis RepID=O47500_VENIN Length = 760 Score = 174 bits (441), Expect = 5e-42, Method: Compositional matrix adjust. Identities = 115/374 (30%), Positives = 196/374 (52%), Gaps = 48/374 (12%) Query: 4 KLATWAATDPSLRIQR-LLRLITQPEWLAEAARITLSSKGAHTPGV-----DGVNKTMLQ 57 KL+ + ++P+ I R L L T + L A S G T GV DG+++ Sbjct: 246 KLSIRSKSNPNSIIDRELYTLATSVDTLIYAYENIKSKPGNMTQGVLPETLDGISRE--- 302 Query: 58 ARLAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPI 117 +L L D L S + P+RR+ IPK++G RPL I + D+IVQ AM + +E I Sbjct: 303 -----KLTKLSDSLRSEKFSFSPSRRIQIPKASGGSRPLSIASPMDKIVQEAMRLVLEAI 357 Query: 118 WESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKA 177 ++ F S+GFRP +S H A+++V + +WVIEGDL+ +FD++ H LMK Sbjct: 358 YDPVFLDCSHGFRPNKSCHTALKSVSQEFQPV-----QWVIEGDLAKFFDSISHSKLMKL 412 Query: 178 VRRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQY---L 234 V +I+D RF L+WK + AG+ + ++++ G PQG ++SP+L+NI L++ D + L Sbjct: 413 VESKITDRRFTNLIWKALTAGYFEFKIYKSNIVGTPQGSIVSPILANIFLHQLDLFVNCL 472 Query: 235 HERYLSG----KARKDRWYWNNSI----------------QRGRSTAVRENWQWKPAVAY 274 + G +++ R+Y +++ +R ++ ++ + + Y Sbjct: 473 KRDFDKGTRAPRSKSSRYYEYHTLKARKAGDTLQLQKLIAERSQNPSIDFGSESFKRLVY 532 Query: 275 CRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDG-FIFLGHRLI 333 RYADD+++ ++GT+ Q + I + R S+ L L+ KTK+ ++ +FLG + Sbjct: 533 VRYADDWIIGIRGTREQAKYILTKVREFCT-SIDLELSEHKTKLTSLHSQPILFLGTSIS 591 Query: 334 R----KRSRYGEMR 343 R + SR G +R Sbjct: 592 RSSHVRYSRIGSVR 605 >UniRef50_A8VT23 S-layer domain protein n=12 Tax=Bacilli RepID=A8VT23_9BACI Length = 422 Score = 174 bits (440), Expect = 6e-42, Method: Compositional matrix adjust. Identities = 106/300 (35%), Positives = 157/300 (52%), Gaps = 40/300 (13%) Query: 19 RLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQP 78 +L+ + P+ L A +S+KG PGVDG+ L+A + + L ++ G YQP Sbjct: 2 QLIDRVVCPDNLNLAMNRVISNKG--NPGVDGMTVDQLEAHVRQYAKPLIAKIQKGTYQP 59 Query: 79 LPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHA 138 LP +RV IPK NGK R LGIPA+RDR+VQ+A+ +EPI + F SYGFRP ++ A Sbjct: 60 LPVKRVEIPKENGKKRKLGIPAVRDRMVQQAIFQVIEPIIDPHFSPNSYGFRPGKNAKQA 119 Query: 139 IRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAG 198 I+ Q + + V++ DL SYFDT+ H+ LM + + I D + L+WK +K+G Sbjct: 120 IK----QAAKYYDEGFKMVVDIDLKSYFDTIPHQKLMNYLEQYIQDPIILKLIWKFLKSG 175 Query: 199 HIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGR 258 + + ++ G PQGG +SP+LSN+ L+E D+ L R Sbjct: 176 IMIGDNWESSRNGAPQGGNLSPILSNVYLHELDKELERR--------------------- 214 Query: 259 STAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKI 318 + RYADDF + VK +A E + LEG+LKL +N +K+ I Sbjct: 215 ------------GHRFVRYADDFCIYVKSRRA-AERVLLNTTTFLEGTLKLSVNQEKSAI 261 >UniRef50_B8FP59 RNA-directed DNA polymerase n=9 Tax=Firmicutes RepID=B8FP59_DESHD Length = 320 Score = 173 bits (438), Expect = 9e-42, Method: Compositional matrix adjust. Identities = 113/321 (35%), Positives = 162/321 (50%), Gaps = 58/321 (18%) Query: 4 KLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTML-QARLAV 62 K T + D S + QRL R + PE+ A + ++KG+ TPG+DG + + + R+ Sbjct: 9 KSLTEKSKDESYKFQRLYRNLYNPEFYWLAYQNIYANKGSMTPGMDGTTISGINEERIRQ 68 Query: 63 ELQILRDELLSGHYQPLPARRVYIPKSNG-KLRPLGIPALRDRIVQRAMLMAMEPIWESD 121 + L+D+ YQP PARRVYI K N K RPLGI D++VQ + M +E I+E Sbjct: 69 IIASLKDQ----SYQPHPARRVYIEKKNSQKKRPLGISTANDKLVQEVVRMILESIFEPT 124 Query: 122 FHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRR 181 F S+GFRP RS A+ ++ T W +EGD+ + FD+ H +L++ ++RR Sbjct: 125 FSDKSHGFRPVRSCQTALLQIQGNFTGVN-----WFVEGDIEACFDSFDHHVLIELLQRR 179 Query: 182 ISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFD---------- 231 I DA F++L+WK +KAG+++ + +GVPQG ISP+LSNI L+E D Sbjct: 180 IDDASFISLMWKFLKAGYMEQWEYNMTYDGVPQGSGISPILSNIYLSELDTFMSEYKAAF 239 Query: 232 ------------QYLHERYLSGKARKD-RWYWNN----------SIQRGRSTAVRENWQW 268 Y H RY++ K RK+ R W N QR S RE Sbjct: 240 DIRKTHGRKPSSNYCHARYMASKYRKESRAIWGNLTPSEKKSRMKEQRALSVKQRET--- 296 Query: 269 KPA----------VAYCRYAD 279 PA + Y RYAD Sbjct: 297 -PAHDVFDDTFKIIQYARYAD 316 >UniRef50_A9IAV6 Mobile mitochondrial group II intron of COX1 which is involved in pre-mRNA splicing and in deletion of introns from mitochondrial DNA n=1 Tax=Bordetella petrii DSM 12804 RepID=A9IAV6_BORPD Length = 606 Score = 173 bits (438), Expect = 1e-41, Method: Compositional matrix adjust. Identities = 126/395 (31%), Positives = 194/395 (49%), Gaps = 62/395 (15%) Query: 16 RIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVN-KTMLQARLAVELQILRDELLSG 74 RI L RL+ P +A ++ GA T GVDG + M RLA + ++ SG Sbjct: 22 RINGLSRLMENPILWKQAYVNIYANSGATTAGVDGSSLDGMSYERLAGLMAAVK----SG 77 Query: 75 HYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERS 134 +Y+ P RRV IPKSNGK RPLGIP D++VQ + M + I+E F S+GFR RS Sbjct: 78 NYRFKPVRRVLIPKSNGKTRPLGIPTGDDKLVQEVVRMLLVKIYEPVFSDDSHGFRNGRS 137 Query: 135 VHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKT 194 H A+ V+ + T +W++ D+ YFD + H +L+ + +RI D RF+ L+ Sbjct: 138 CHTALMQVRQKWTGM-----KWIVNMDIKGYFDNIDHEVLVDVLAKRIDDKRFLGLIHSM 192 Query: 195 IKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQY---LHERYLSGKAR-KDRWY- 249 +KAG+++ F G PQGGV+SP+L+NI L+E D+Y L + G R +R Y Sbjct: 193 LKAGYMEDWKFHDTFSGTPQGGVVSPVLANIYLHELDEYVAGLKAEFNRGNRRASNREYK 252 Query: 250 -WNNSIQR-----------GRSTAVREN-------WQWKPAVA-------------YCRY 277 + +I+R G S V E + + A++ Y RY Sbjct: 253 RISGAIERLMKRIDAYKADGDSPKVEEAKRELAELYLRRKALSSSDPMDANYRRLVYVRY 312 Query: 278 ADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGH------- 330 ADDF++ + G++ + + + G + L L + +K+ + H +DG FLG+ Sbjct: 313 ADDFLIGIIGSRDEAVTVMQRVAGFISDKLHLEIAEEKSGVVHASDGVRFLGYDVRTYSG 372 Query: 331 ----RLIRK----RSRYGEMRVVSTIPQEKARNFA 357 R +R +R R+ +P EK R+F Sbjct: 373 DRNVRTVRSGRSITARSVSERMQLHVPAEKLRSFC 407 >UniRef50_Q3A299 Prophage LambdaSa1, reverse transcriptase/maturase family protein n=5 Tax=Proteobacteria RepID=Q3A299_PELCD Length = 446 Score = 173 bits (438), Expect = 1e-41, Method: Compositional matrix adjust. Identities = 124/350 (35%), Positives = 179/350 (51%), Gaps = 48/350 (13%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARL 60 +QRKL A +P+ R L + + + L+ A + ++KG + G+DGV ++ R Sbjct: 11 LQRKLYRKAKQEPACRFHALYDKVYRADILSHAYALVRANKG--SAGIDGVTFAAIEERE 68 Query: 61 AVELQI--LRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIW 118 V I L + L S Y+P P +RV IPK++G RPLGIP +RDR+ Q A+ + +EPI+ Sbjct: 69 GVSALIAELEEALRSKTYKPDPVKRVMIPKADGSQRPLGIPTIRDRVAQMAVKLVVEPIF 128 Query: 119 ESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAV 178 E+DF SYGFRP++S H A+ V + + G T VI+ D+S YFDT+ H LM V Sbjct: 129 EADFCDTSYGFRPKKSAHDAVDDVAYAM-NIGYTE---VIDADISKYFDTIPHTNLMAVV 184 Query: 179 RRRISDARFMTLLWKTIKAGHIDVGL---------FRAASEGVPQGGVISPLLSNIMLNE 229 RI D + L+ +K+ ++VG + G PQGGVISPLL+N+ L+ Sbjct: 185 AERICDGAILHLIQMWLKSSVMEVGKDGKKRNVGGGKGNRRGTPQGGVISPLLANLYLHI 244 Query: 230 FDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTK 289 D+ W R N Q + RYADD VL+ + K Sbjct: 245 LDR----------------IWE-----------RRNLQQRLNARIVRYADDTVLLCRRNK 277 Query: 290 AQVEAIREECRGVLEGSLKLRLNMDKTKIPH-VNDGFIFLGHRLIRKRSR 338 + EA+ R +LE L L LN KTK+ + GF FLG + +SR Sbjct: 278 SD-EAM-AVLRQILE-RLGLTLNEAKTKVVNGYKGGFDFLGFSIRMGKSR 324 >UniRef50_B7I148 Reverse transcriptase n=9 Tax=Bacillus RepID=B7I148_BACC7 Length = 632 Score = 172 bits (437), Expect = 1e-41, Method: Compositional matrix adjust. Identities = 108/299 (36%), Positives = 176/299 (58%), Gaps = 24/299 (8%) Query: 39 SSKGAHTPGVDGVNKTMLQA-RLAVELQILRDELLSGHYQPLPA---RRVYIPKSNGKLR 94 S+KG+ TPGVDG KT+ RL+ E I EL+ G A +RV+IPK+NG R Sbjct: 59 SNKGSMTPGVDG--KTIQDYLRLSEEKLI---ELIRGRLTNFKAHLIKRVFIPKANGGQR 113 Query: 95 PLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRG 154 PLGIP + DRI+Q+ M +EP+ E+ F S+GFRPER+ +HA+ VK+ + + G Sbjct: 114 PLGIPTIEDRIIQQMMKQVLEPVLEAQFFKYSFGFRPERTTYHALERVKVLVHNTGY--- 170 Query: 155 RWVIEGDLSSYFDTVHHRLLMKAV-RRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVP 213 W++EGD+ +FD V+HR+L+K + I D R + L+ + +KAG I + R + G P Sbjct: 171 HWIVEGDIRQFFDKVNHRILIKKLWSMGIKDRRILCLITEFLKAG-IFKNIIRNDN-GTP 228 Query: 214 QGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVA 273 QGG++SPLL+N+ L+ FD+++ +++ R + ++ ++ +S+ ++ + Sbjct: 229 QGGILSPLLANVYLHSFDKWVAKQFEEFTTRHEYSKHDHKLRGLKSSNLKPGY------- 281 Query: 274 YCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFI-FLGHR 331 RYADD+VL+ K+ + + L+ LKL L+ +KT+I ++ I FLG + Sbjct: 282 LIRYADDWVLVT-NNKSHAYRWKTVIKNFLQKELKLELSEEKTRITNIRHKPIEFLGFK 339 >UniRef50_B9M2H7 RNA-directed DNA polymerase (Reverse transcriptase) n=7 Tax=Bacteria RepID=B9M2H7_GEOSF Length = 446 Score = 172 bits (437), Expect = 1e-41, Method: Compositional matrix adjust. Identities = 116/338 (34%), Positives = 165/338 (48%), Gaps = 44/338 (13%) Query: 10 ATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRD 69 DP LL I PE + A + ++KGA PGVDGVN +R Sbjct: 18 VVDPQTPENHLLERILSPENMELAWKRVRANKGA--PGVDGVNIDDFPDITRPLWGDIRA 75 Query: 70 ELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGF 129 L +G Y P P RV IPK G RPLGIP + DR++Q+++ + PI++ F S+GF Sbjct: 76 SLATGSYLPKPVLRVEIPKPTGGNRPLGIPTVLDRLIQQSIAQVLTPIFDPGFSESSFGF 135 Query: 130 RPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMT 189 RP RS H A+R QL + R ++ DL+ +FDTV+H LLM V R++ D R + Sbjct: 136 RPGRSAHDAVR----QLREYLRQGYRIAVDIDLAKFFDTVNHDLLMTFVGRKVRDKRVLA 191 Query: 190 LLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWY 249 L+ + ++AG G GVPQGG +SPLL+NI+L+ D+ L +R Sbjct: 192 LIGRYLRAGVEVDGRLEKTRMGVPQGGPLSPLLANILLDHLDKELEKR------------ 239 Query: 250 WNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKL 309 + RYADDFV++VK +A E + R L LKL Sbjct: 240 ---------------------GHKFVRYADDFVILVKSERAG-ERVMGSVRKHLTTKLKL 277 Query: 310 RLNMDKTKIPHVND----GFIFLGHRLIRKRSRYGEMR 343 +N DK+K+ + GF+F G +++ Y E R Sbjct: 278 TVNEDKSKVAKSDQISFLGFVFKGTKILWSDKAYKEFR 315 >UniRef50_C3FJT8 RNA-directed DNA polymerase (Reverse transcriptase) n=11 Tax=Bacteria RepID=C3FJT8_BACTB Length = 443 Score = 172 bits (436), Expect = 2e-41, Method: Compositional matrix adjust. Identities = 125/351 (35%), Positives = 187/351 (53%), Gaps = 43/351 (12%) Query: 9 AATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVE-LQIL 67 A ++P+ R L IT+ L EA + + GA PG+DG + ++ + L + Sbjct: 27 AKSEPTHRFWGLFTHITKMTTLHEAYQQARKNNGA--PGIDGKSFADIELEGVIPFLTGI 84 Query: 68 RDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSY 127 ++EL +G Y+P R+V IPK+NGK+R L IP +RDR+VQ A+ + +E I+E+DF SY Sbjct: 85 QEELQAGIYRPQSNRKVEIPKANGKMRTLQIPCIRDRVVQGALKLILEAIFEADFCPNSY 144 Query: 128 GFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARF 187 GFRP+RS H A+ V+ + R +I+ DLS YFDT+ H +L++ + +R+ D + Sbjct: 145 GFRPKRSPHQALAEVRRSIL----RRMTIIIDVDLSRYFDTIRHNILLEKIAKRVQDPQV 200 Query: 188 MTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDR 247 M L+ + IKA A GVPQGG SPL +NI LNE D Sbjct: 201 MHLVKQVIKA---------AGKIGVPQGGPFSPLAANIYLNEVD---------------- 235 Query: 248 WYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLE--G 305 W + ++I+R + E AV Y R+ADD V+ V G ++ R + E Sbjct: 236 WTF-DAIRRKTAEGNYE------AVNYHRFADDIVIAVSGHSSKSGWAELALRRLWEQLK 288 Query: 306 SLKLRLNMDKTKIPHVNDG--FIFLGHRLIRKRSRYGEMRVVSTIPQEKAR 354 L + LN++KT++ +V G F FLG L R +R V IP++KAR Sbjct: 289 PLGVELNLEKTQMINVLKGESFGFLGFDLRRIPNRNKNGFFVFMIPKKKAR 339 >UniRef50_Q1Q0X4 Similar to Group II intron encoded reverse transcriptase n=6 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1Q0X4_9BACT Length = 298 Score = 172 bits (435), Expect = 2e-41, Method: Compositional matrix adjust. Identities = 97/284 (34%), Positives = 154/284 (54%), Gaps = 38/284 (13%) Query: 47 GVDGVNKTMLQARLAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIV 106 G+D V+ ++ L V + + EL + Y P P RVYIPK RPLGIP ++DRIV Sbjct: 32 GLDRVSIKQFESNLDVNIMSIHQELKTAIYNPAPVLRVYIPKGRHDKRPLGIPIVKDRIV 91 Query: 107 QRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYF 166 Q+A +EPI+E F S+GFRP+R H AI+ ++ Q G T V++ D+ +++ Sbjct: 92 QQAFRQIIEPIFEKGFSDNSFGFRPDRCCHDAIKRLE-QYKQEGYTS---VLDADIMAFY 147 Query: 167 DTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIM 226 DT+ H+L+M ++R +I+D + + +KAG ++ G+ ++G PQGGVISPLL+N++ Sbjct: 148 DTIPHKLIMDSLREKIADGWVLNSIENMLKAGVMEDGIVHETNKGTPQGGVISPLLANLI 207 Query: 227 LNEFDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVK 286 + D+ L K + RYADDFV++ K Sbjct: 208 GDIIDKELE---------------------------------KAGYKFVRYADDFVVMTK 234 Query: 287 GTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGH 330 TK ++ A + ++ G L ++L+ DKTK+ + GF FLG+ Sbjct: 235 -TKDELPAALSYVKEIIAGKLGMKLSEDKTKLTNFERGFRFLGY 277 >UniRef50_A7BUU9 RNA-directed DNA polymerase n=1 Tax=Beggiatoa sp. PS RepID=A7BUU9_9GAMM Length = 585 Score = 172 bits (435), Expect = 3e-41, Method: Compositional matrix adjust. Identities = 118/324 (36%), Positives = 167/324 (51%), Gaps = 35/324 (10%) Query: 30 LAEAARITLSSKGAHTPGVDG--VNKTMLQARLAVELQILRDELLSGHYQPLPARRVYIP 87 L ++T ++G T GVDG + + RLA ++I G PL RRV+IP Sbjct: 65 LLSVRKVTQDNRGKKTAGVDGKVITSEKDRWRLASNVRI------DGKSNPL--RRVWIP 116 Query: 88 KSNGK-LRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQL 146 KSN K LRPLGIP + DR+ Q + + +EPI+E YGFRP RSVH AI + + Sbjct: 117 KSNSKELRPLGIPTIEDRVKQMMLKLEIEPIYEVQAEPNVYGFRPARSVHDAIEACFIAI 176 Query: 147 TDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRR-ISDARFMTLLWKTIKAGHIDVGLF 205 C + G WV+EGD S +FD ++ L+ ++ + I+D + + IK+G ID +F Sbjct: 177 -GC-KKEGAWVLEGDFSKFFDNINKEHLLNMMKSKGITDKETLQQVQAWIKSGVIDKEVF 234 Query: 206 RAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVREN 265 +G PQGGVISPLL+NI L+ + LH+ + K K R N Sbjct: 235 TKTDKGTPQGGVISPLLANIALHGMENMLHDWVDTWKGTK-----------------RSN 277 Query: 266 WQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGF 325 Q + + RYADDFV+I K KA +E + L+ + ++LN KTKI H +GF Sbjct: 278 HQ---SFSVIRYADDFVVIHK-DKAVIEEAKLCIEEWLDKGVGVKLNQTKTKITHTTEGF 333 Query: 326 IFLGHRLIRKRSRYGEMRVVSTIP 349 FLG + + R G T P Sbjct: 334 DFLGFNVRQYRVNNGSQLKFLTKP 357 >UniRef50_B4D9Y9 RNA-directed DNA polymerase (Reverse transcriptase) n=3 Tax=Chthoniobacter flavus Ellin428 RepID=B4D9Y9_9BACT Length = 495 Score = 171 bits (434), Expect = 3e-41, Method: Compositional matrix adjust. Identities = 131/378 (34%), Positives = 190/378 (50%), Gaps = 51/378 (13%) Query: 9 AATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVE---LQ 65 A + P R L + + + LA A R ++ GA GVDGV L A VE L Sbjct: 85 AQSKPEYRFWSLYGEVQRADVLAAAWRRVKANAGA--AGVDGVTIEKLAADAQVEAAWLN 142 Query: 66 ILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTL 125 LR+EL Y+P P RRV IPK++G R LGIP L+DR+VQ A+ + + PI+E+DFH Sbjct: 143 GLREELHGKTYRPAPVRRVKIPKASGGYRGLGIPTLKDRVVQMAVYLVLMPIFEADFHPR 202 Query: 126 SYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDA 185 SYGFRP R+ H A+ ++ L G+T V++ DL+ YFDT+ HRLLM+ V RR+SD Sbjct: 203 SYGFRPGRNAHQAVEEIREALR-MGKTE---VVDADLAQYFDTIPHRLLMRQVARRVSDG 258 Query: 186 RFMTLLWKTIKAGHIDVG-----LFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLS 240 + L+ ++A ++ +A G PQGGVISPLL+NI L+ D Sbjct: 259 MILKLIKAWLRAPILEEEEGGGRRMKANPCGTPQGGVISPLLANIYLHPLDD-------- 310 Query: 241 GKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECR 300 +V ++ Q KP + RYADD V++ + + ++E Sbjct: 311 --------------------SVNDHCQQKPRM--IRYADDLVILCR--PGEGRGMKERLA 346 Query: 301 GVLEGSLKLRLNMDKTKIPH-VNDGFIFLGHRLIRKRSRYGEMRV---VSTIPQEKARNF 356 L+ S L LN KT++ GF FLG ++S+ G V S ++ RN Sbjct: 347 RWLQ-SRGLTLNETKTRVVQSCESGFEFLGFTFRWQQSKKGTPYVHTEPSPAAKQSLRNR 405 Query: 357 AASLTALLWKVRISGEIL 374 LT R++G+ + Sbjct: 406 VRELTRRSTTWRVTGQTV 423 >UniRef50_Q24QQ9 Putative uncharacterized protein n=1 Tax=Desulfitobacterium hafniense Y51 RepID=Q24QQ9_DESHY Length = 591 Score = 171 bits (433), Expect = 4e-41, Method: Compositional matrix adjust. Identities = 120/356 (33%), Positives = 179/356 (50%), Gaps = 40/356 (11%) Query: 10 ATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVN-KTMLQARLAVELQILR 68 T+P + L RL + A S+ GA T G DG + + + + +R Sbjct: 18 TTNPGYVNEDLYRLFYSRDLYIIAYNSVKSNDGAETSGADGTSLHGFCEEWITQLITSMR 77 Query: 69 DELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYG 128 DE YQP P R IPK +GKLR L P +D++VQ A+ + +E I+E F LS+G Sbjct: 78 DE----SYQPQPNRTTMIPKKSGKLRKLSFPNGKDKLVQEAIRIILECIYEPTFSNLSHG 133 Query: 129 FRPERSVHHAIRTVKLQLTDCGETRGR-WVIEGDLSSYFDTVHHRLLMKAVRRRISDARF 187 FRP+RS AI V+ RG W IEGD+S+ FD + HR L +R RI D RF Sbjct: 134 FRPKRSTQSAIAEVQTW-------RGTIWFIEGDISACFDDIDHRTLETILRERIRDERF 186 Query: 188 MTLLWKTIKAGHIDVG-LFRAASEGVPQGGVISPLLSNIMLNEFDQYLH---ERYLSGKA 243 + L+ K +KAG+ D+ L++ G QG SPLL NI L++ D+++ E+ G Sbjct: 187 IRLVNKVLKAGYFDMQHLYQKTKTGNAQGSCCSPLLCNIYLDKLDKFMENVMEQDTMGGY 246 Query: 244 R-------KDRWYWNNSIQRGRSTAVRENWQ-----------WKPA---VAYCRYADDFV 282 R K R+ + +++ G ++ + + P V Y RYADDF+ Sbjct: 247 RRQNPDYAKARYLYKKALKSGSDPQTVQHLKRTMEHLPTTDRYDPNFRRVNYVRYADDFL 306 Query: 283 LIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFI-FLGHRLIRKRS 337 + V +K ++ + L+ L LRL+ +KTKI H D + FLG+ ++RK S Sbjct: 307 IGVIASKKYALDLKLNLKEFLQNELSLRLSDEKTKITHAADKHVSFLGY-ILRKGS 361 >UniRef50_A9BGC0 RNA-directed DNA polymerase (Reverse transcriptase) n=49 Tax=Bacteria RepID=A9BGC0_PETMO Length = 472 Score = 171 bits (433), Expect = 4e-41, Method: Compositional matrix adjust. Identities = 115/325 (35%), Positives = 162/325 (49%), Gaps = 42/325 (12%) Query: 20 LLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPL 79 +L I + + +A + ++KGA PG+DG+ L L + LR ELL G Y P Sbjct: 53 MLEKILSKDNMNKAYKKVKANKGA--PGIDGMKVEELFGYLRQHGEELRQELLEGRYTPK 110 Query: 80 PARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAI 139 RR IPK +G R LGIP DR++Q+++ + PI+E F SYGFRP R AI Sbjct: 111 SVRRKEIPKPDGGKRLLGIPTSIDRVIQQSIAQVLTPIYEKKFVDNSYGFRPLRDAKQAI 170 Query: 140 RTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGH 199 R K L G T WV++ DL YFDTV+H LM+ + + + D R ++L+ K +K+G Sbjct: 171 RKSKEYLNK-GHT---WVVDIDLERYFDTVNHDKLMRIISKDVKDGRVISLIRKYLKSGV 226 Query: 200 IDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGRS 259 + G+ EG PQGG +SPLLSNIML+E D L +R Sbjct: 227 MVNGVVIETEEGTPQGGPLSPLLSNIMLHELDVELTKR---------------------- 264 Query: 260 TAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIP 319 +CRYADD + VK K+ + E +E LKL++N K+K+ Sbjct: 265 -----------GHKFCRYADDCNIYVKSEKSAYRVM-ESITKYIEKKLKLKVNSKKSKVV 312 Query: 320 HVNDGFIFLGHRLIRKRSRYGEMRV 344 D +LG K +Y E+RV Sbjct: 313 RPWD-LKYLGFSFYVKEEKY-EIRV 335 >UniRef50_C3B585 Reverse transcriptase/endonuclease protein n=1 Tax=Bacillus mycoides Rock3-17 RepID=C3B585_BACMY Length = 614 Score = 171 bits (433), Expect = 5e-41, Method: Compositional matrix adjust. Identities = 110/317 (34%), Positives = 173/317 (54%), Gaps = 31/317 (9%) Query: 17 IQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHY 76 +QRLL + ++ L ++T +KG T GVDG R+ + ++ L+ H Sbjct: 53 LQRLL-MRSEATLLISIRQVTQLNKGKRTAGVDGFKAIKPTERIKL-FHKMKAMNLATH- 109 Query: 77 QPLPARRVYIPKSNG--KLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERS 134 +P P +R+ IPK KLRPLGIP + DR+ Q + +A+EP WE F SYGFRP+R Sbjct: 110 KPSPVKRIEIPKDTAGKKLRPLGIPIIIDRVYQNVVKLALEPQWEVHFEPTSYGFRPKRG 169 Query: 135 VHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKT 194 AI ++ L+L ++ RWV EGD FD ++H +++ +I D + ++ K Sbjct: 170 CQDAITSIFLKLKTT--SKKRWVFEGDFKGCFDNLNHDYILE----QIKDLPYKEIVKKW 223 Query: 195 IKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYL--SGKARKDRWYWNN 252 ++AG + G+F + G PQGG+ISPLL+NI L+ ++ + +Y+ + +K YW Sbjct: 224 LRAGFVHNGVFNLTNNGTPQGGIISPLLANIALHGMEEEIGVKYINRTHPRKKGERYW-- 281 Query: 253 SIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLN 312 ++Q +S RYADDFV I+ TK + E++ E+ + L L L Sbjct: 282 TVQDTKSVV--------------RYADDFV-IMTDTKEEAESMYEKLKPYLVKR-GLELA 325 Query: 313 MDKTKIPHVNDGFIFLG 329 +KTK+ HV++GF FLG Sbjct: 326 PEKTKVVHVSEGFDFLG 342 >UniRef50_B2A0J9 RNA-directed DNA polymerase (Reverse transcriptase) n=43 Tax=Bacteria RepID=B2A0J9_NATTJ Length = 475 Score = 170 bits (431), Expect = 6e-41, Method: Compositional matrix adjust. Identities = 105/325 (32%), Positives = 165/325 (50%), Gaps = 44/325 (13%) Query: 11 TDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDE 70 ++ +L LL I + + +A + S+KG+H G+DG+ L L LR Sbjct: 49 SNANLSKGNLLEEILDRDNMNKAFKKIKSNKGSH--GIDGMGVDELLQYLKENGDHLRQR 106 Query: 71 LLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFR 130 +L G Y+P P RRV IPK +GK R LGIP + DR++Q+A+ + PI+E F SYGFR Sbjct: 107 VLDGKYRPNPVRRVEIPKEDGKKRKLGIPTVVDRVIQQAIAQVLSPIYEEQFSDNSYGFR 166 Query: 131 PERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTL 190 P RS H AI+ + + + ++V++ DL YFDTV+ L++ + + I D R ++L Sbjct: 167 PGRSTHDAIKKSQQNINEG----YKYVVDMDLEKYFDTVNQSKLIEVLSKTIKDGRVISL 222 Query: 191 LWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYW 250 + K ++AG + ++ GVPQGG +SP+LSNIML+E D+ L +R Sbjct: 223 INKYLRAGVMIKHTYKDTEVGVPQGGPLSPILSNIMLHELDKELEKR------------- 269 Query: 251 NNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLR 310 + RYADD ++ K ++ ++ +E L L+ Sbjct: 270 --------------------GHEFVRYADDLLIFCKSRRSAGRTLK-NILPFIENKLFLK 308 Query: 311 LNMDKTKIPHVND----GFIFLGHR 331 +N DKT + +V GF F H+ Sbjct: 309 VNKDKTVVAYVGKVRFLGFGFYRHK 333 >UniRef50_C9P0Q5 Retron-type reverse transcriptase n=3 Tax=Vibrio RepID=C9P0Q5_VIBME Length = 436 Score = 169 bits (427), Expect = 2e-40, Method: Compositional matrix adjust. Identities = 112/316 (35%), Positives = 160/316 (50%), Gaps = 43/316 (13%) Query: 19 RLLRLITQPEWLAEAARITLSSKGAHTPGVD--GVNKTMLQARLAVELQILRDELLSGHY 76 RLL + P L A + +KG GVD + T+ + R Q LR LL G Y Sbjct: 5 RLLEQMFSPGNLNAATKQVKRNKGCG--GVDRLTITATLEKLRQLDNGQQLRQSLLDGSY 62 Query: 77 QPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVH 136 QP P V IPK G +R LGIP ++DRIVQ+AM + ++E F S+GFRP RS H Sbjct: 63 QPSPVLGVEIPKPKGGVRQLGIPTVQDRIVQQAMAQLLTQLYEPKFSKSSFGFRPRRSAH 122 Query: 137 HAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIK 196 HA+ + E RG +V++ DL YFDTV+H LM + + I+D R + L+ K ++ Sbjct: 123 HALSKASEYIR---EGRG-YVVDIDLEKYFDTVNHDRLMYRLSQDIADKRVLKLIRKYLQ 178 Query: 197 AGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQR 256 +G + G+ G PQGG +SPLLSNI+L+E D+ L R Sbjct: 179 SGLMRNGVIERRQRGTPQGGPLSPLLSNIVLDELDKELERR------------------- 219 Query: 257 GRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKT 316 +CRYADD + V G++A + ++ LE +LKLR+N +K+ Sbjct: 220 --------------GHKFCRYADDCQIYV-GSEAAAQRVKTSVTEFLEQTLKLRVNREKS 264 Query: 317 KIPHVNDGFIFLGHRL 332 V++ +LGHR Sbjct: 265 AATRVSERS-YLGHRF 279 >UniRef50_A9IAY4 Reverse transcriptase n=38 Tax=Bacteria RepID=A9IAY4_BORPD Length = 572 Score = 168 bits (425), Expect = 3e-40, Method: Compositional matrix adjust. Identities = 109/305 (35%), Positives = 166/305 (54%), Gaps = 39/305 (12%) Query: 31 AEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLPARRVYIPKSN 90 A A R ++G TPGVDG+ + +A+ L I R Y+P P +RVYIPK+N Sbjct: 65 ALAVRRVTENQGKKTPGVDGITWSTPEAKSQAMLSIKRR-----GYRPQPLKRVYIPKAN 119 Query: 91 GKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCG 150 GK+RPLGIP ++DR +Q L+A+EP+ E+ S+GFRPERS AI QL Sbjct: 120 GKMRPLGIPTMKDRAMQALYLLALEPVAETTADRSSFGFRPERSTADAIGLCFTQL--AL 177 Query: 151 ETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHIDVGLFRAASE 210 + +W++EGD+ FD + H LM + +D ++ K +KAG+++ Sbjct: 178 KRSPKWILEGDIKGCFDNISHDWLMGHI---PTDREILS---KWLKAGYMEDRQLFPTEA 231 Query: 211 GVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKP 270 G PQGG+ISP L+N++L+ + L + G+AR Y N G+ T + Sbjct: 232 GTPQGGIISPTLANLVLDGLEAKLEAVF--GRAR----YIN-----GKQTRL-------- 272 Query: 271 AVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLK---LRLNMDKTKIPHVNDGFIF 327 AV Y RYADDF++ + E + +E ++E ++ L L+ +KTKI H+++GF F Sbjct: 273 AVNYVRYADDFIVTARSK----ELLEQEVMPLVEEFMRERGLTLSPEKTKITHIDEGFDF 328 Query: 328 LGHRL 332 LG + Sbjct: 329 LGQNI 333 >UniRef50_D2CJC8 Putative uncharacterized protein orf2 (Fragment) n=1 Tax=Candida sojae RepID=D2CJC8_9ASCO Length = 773 Score = 168 bits (425), Expect = 3e-40, Method: Compositional matrix adjust. Identities = 111/349 (31%), Positives = 170/349 (48%), Gaps = 32/349 (9%) Query: 4 KLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVE 63 KL P + + +I+ PE+L A + S G TPG NK L + Sbjct: 182 KLLKEGLNKPDEVYKNIRPIISDPEFLMYAYSLIKSKPGNMTPGT---NKETLDGITSET 238 Query: 64 LQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFH 123 +++ E+ SG Y+ P RR+ IPK+ G +RPL I + RD+IVQ AM + +E I+E Sbjct: 239 FKVMGREIGSGAYKFRPNRRIEIPKAKGGIRPLSIASPRDKIVQMAMKIILEAIFEPHMS 298 Query: 124 TLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRIS 183 S+GFRP RS H A+ ++ + W IE D++ FDT+ L++K V +RI Sbjct: 299 DFSHGFRPNRSTHTALYQLRGIFHEVS-----WFIEADITKCFDTLPQDLIIKEVEKRIK 353 Query: 184 DARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYL---HERYLS 240 D F+ L+ K AG+I+ F+ S G PQG +ISP+L NI+L D++L ER+ Sbjct: 354 DQVFLDLIHKCFNAGYIENN-FKIPSAGTPQGSIISPILCNILLTVMDEWLMEYSERFSV 412 Query: 241 GKARKDRWYWNN-------------------SIQRGRSTAVRENWQWKPAVAYCRYADDF 281 G R+ + I R + ++ N + + RYADDF Sbjct: 413 GTRRRANPVYTKLVRGINKASLLNQKINIRAQIHRDKKRSLLGNDPNFKRMRFVRYADDF 472 Query: 282 VLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVN-DGFIFLG 329 ++ V G+ I+++ L+ L++ L+ DKT I D FLG Sbjct: 473 IIGVIGSYQDSCKIKQDLTNFLKDRLRVELSQDKTLITSATKDKAHFLG 521 >UniRef50_C4ZCX5 RNA-directed DNA polymerase n=24 Tax=Bacteria RepID=C4ZCX5_EUBR3 Length = 464 Score = 167 bits (423), Expect = 6e-40, Method: Compositional matrix adjust. Identities = 115/318 (36%), Positives = 160/318 (50%), Gaps = 42/318 (13%) Query: 19 RLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQP 78 RLL I + A + ++KGA PG+DG+ L Q + D + G Y P Sbjct: 41 RLLETILYKDNFNRAYKRVKANKGA--PGIDGMTIEEALPYLKEHQQEITDRIYRGKYTP 98 Query: 79 LPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHA 138 P RRV IPK +G +R LGIP + DR +Q+A+ + PI+E F SYG+RP RS A Sbjct: 99 SPVRRVEIPKPDGGVRKLGIPTVIDRTLQQAITQQLVPIYEPLFADGSYGYRPNRSAKDA 158 Query: 139 IRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAG 198 I VK + + G T + + DLS YFDT++H +L+ +R+ + D R + L+ + +K+G Sbjct: 159 ILKVK-EYAEQGYT---FAVVLDLSKYFDTLNHEILINLLRKNVKDERVVQLIKRYLKSG 214 Query: 199 HIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGR 258 ++ G+ EG PQGG +SPLL+NI LNEFDQ YL Sbjct: 215 VMENGVVIDTEEGSPQGGNLSPLLANIYLNEFDQ----EYL------------------- 251 Query: 259 STAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKI 318 K V RYADD VL+ K +A E + E LE LKL +N +K++ Sbjct: 252 ----------KRGVPCIRYADDIVLLAKSKRAS-ERLLESSTKYLEERLKLTVNREKSRT 300 Query: 319 PHVND--GFIFLGHRLIR 334 V F FLG L R Sbjct: 301 VSVFAIRNFKFLGFALGR 318 >UniRef50_Q08WW1 Prophage LambdaSa1, reverse transcriptase/maturase family protein n=2 Tax=Bacteria RepID=Q08WW1_STIAU Length = 421 Score = 166 bits (421), Expect = 9e-40, Method: Compositional matrix adjust. Identities = 103/277 (37%), Positives = 145/277 (52%), Gaps = 39/277 (14%) Query: 28 EWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLPARRVYIP 87 EWL A T A G+D +A L V L+ L + + SG Y+ P RR YIP Sbjct: 13 EWLRYAYEQTRKDGAA---GIDRQTAKDYEANLEVNLKSLLERIKSGRYKAPPVRRTYIP 69 Query: 88 KSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLT 147 K++G RPLGIP D++ QRA+++ +EPI+E DF S+GFRP RS H A+R ++ + Sbjct: 70 KADGSQRPLGIPTFEDKVAQRAIVLLLEPIYEQDFRPFSFGFRPGRSAHQALRELRSSIL 129 Query: 148 DCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHIDVGLFRA 207 E GRWV++ DL YFDT+ H L + + RR++D ++ K +KAG ++ G Sbjct: 130 ---ERNGRWVLDVDLRRYFDTIEHGKLREVLARRVADGVVRRMIDKWLKAGVLEEGPLLR 186 Query: 208 ASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQ 267 +G PQGGVISPLL+N+ YLH Y+ D WY + R Sbjct: 187 LEQGTPQGGVISPLLANV-------YLH--YVL-----DEWYEREVVPR----------- 221 Query: 268 WKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLE 304 K + RYADD V++ + +CR VLE Sbjct: 222 MKGKCSLIRYADDLVMVFEDFL--------DCRRVLE 250 >UniRef50_A4KVN1 Probable reverse transcriptase n=2 Tax=Sinorhizobium meliloti RepID=A4KVN1_RHIME Length = 490 Score = 166 bits (421), Expect = 1e-39, Method: Compositional matrix adjust. Identities = 119/357 (33%), Positives = 182/357 (50%), Gaps = 43/357 (12%) Query: 19 RLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQP 78 +LL + LA A + +KGA PG DG M +A+ + LR ELL+G Y+P Sbjct: 56 QLLEEVASEANLATALLNVVRNKGA--PGRDGQTVDMAEAKATSIIGRLRRELLNGKYRP 113 Query: 79 LPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHA 138 RRV++PK+ G R LGIP + DR+VQ+A+L +EPI+E FH S+GFRP+R H A Sbjct: 114 GDVRRVWLPKAGGGRRGLGIPNIVDRVVQQAVLQVLEPIFEPVFHDSSHGFRPKRGAHTA 173 Query: 139 IRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAG 198 I L + +T +++ DL+S+FD VHH+ L+ + +R+ D R +TL+ +KA Sbjct: 174 IAEASKYLKEGYQT----IVDLDLASFFDRVHHQRLLARIAQRVKDQRIITLINLMLKAA 229 Query: 199 HI-DVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRG 257 + G A EG PQGG +SPLLSNI+L+E D+ L R L Sbjct: 230 VVMPDGTRVAPQEGTPQGGPLSPLLSNIVLDELDRELARRRLR----------------- 272 Query: 258 RSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTK 317 + RYADD + V+ +A + + R LE ++L++N +K+ Sbjct: 273 ----------------FVRYADDSNIFVRSERAG-QRVMSSIRDFLERRMRLQVNEEKSG 315 Query: 318 IPHVNDGFIFLGHRLIRKRSRYGEMRV-VSTIPQEKARNFAASLTALLWKVRISGEI 373 + N+ FLG R + G++ V +S +++ R +T W I+ I Sbjct: 316 MRTPNE-VHFLGFRFRCPKGEGGDVVVLLSRKAEQRLRAKVREMTPPTWGRSIASCI 371 >UniRef50_P03876 Putative COX1/OXI3 intron 2 protein n=3 Tax=Saccharomycetaceae RepID=AI2M_YEAST Length = 854 Score = 166 bits (420), Expect = 1e-39, Method: Compositional matrix adjust. Identities = 106/348 (30%), Positives = 173/348 (49%), Gaps = 56/348 (16%) Query: 19 RLLRLITQPEWLAEAARITLSSKGAHTPG-----VDGVNKTMLQARLAVELQILRDELLS 73 R+L+L++ L A S KG + G +DG+N + L L ++ + Sbjct: 284 RILKLMSDIRMLLIAYNKIKSKKGNMSKGSNNITLDGINISYLNK--------LSKDINT 335 Query: 74 GHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPER 133 ++ P RRV IPK++G RPL + R++IVQ +M M +E I+ + F S+GFRP Sbjct: 336 NMFKFSPVRRVEIPKTSGGFRPLSVGNPREKIVQESMRMMLEIIYNNSFSYYSHGFRPNL 395 Query: 134 SVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWK 193 S AI K + C W I+ DL+ FDT+ H +L+ + RI D FM LL+K Sbjct: 396 SCLTAIIQCKNYMQYCN-----WFIKVDLNKCFDTIPHNMLINVLNERIKDKGFMDLLYK 450 Query: 194 TIKAGHID-VGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNN 252 ++AG++D + + G+PQG V+SP+L NI L++ D+YL ++ ++ + N Sbjct: 451 LLRAGYVDKNNNYHNTTLGIPQGSVVSPILCNIFLDKLDKYLENKF------ENEFNTGN 504 Query: 253 SIQRGRSTA----------------------VRENWQ--------WKPAVAYCRYADDFV 282 RGR+ +R+++Q +K A + RYADD + Sbjct: 505 MSNRGRNPIYNSLSSKIYRCKLLSEKLKLIRLRDHYQRNMGSDKSFKRAY-FVRYADDII 563 Query: 283 LIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGH 330 + V G+ + I + L+ +L + +NMDK+ I H +G FLG+ Sbjct: 564 IGVMGSHNDCKNILNDINNFLKENLGMSINMDKSVIKHSKEGVSFLGY 611 >UniRef50_B4D379 RNA-directed DNA polymerase (Reverse transcriptase) n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4D379_9BACT Length = 441 Score = 166 bits (420), Expect = 1e-39, Method: Compositional matrix adjust. Identities = 110/295 (37%), Positives = 156/295 (52%), Gaps = 40/295 (13%) Query: 28 EWLAEA-ARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLPARRVYI 86 EW+ EA R+ S PGVDG + L L L D SG YQ P RRV+I Sbjct: 38 EWMREAYGRVRKGS----APGVDGKSVADYGRELDKNLGGLIDRAKSGSYQAPPVRRVHI 93 Query: 87 PKSNGK-LRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQ 145 PK+NGK RP+G+P + D+I+QRA++M +EP++E +F SYGFRP RS H A++ + Sbjct: 94 PKANGKETRPIGMPTVEDKILQRAVVMLLEPMYEREFGDFSYGFRPGRSAHQALKAI--- 150 Query: 146 LTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHIDVGLF 205 T+ WV++ D+ ++FDT+ H +LM +++R+ D + L+ K +KAG ++ G Sbjct: 151 WQGINRTQAGWVVDVDIRAFFDTLDHGVLMGILQKRVKDGVILKLVAKWLKAGVMEAGAL 210 Query: 206 RAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQR--GRSTAVR 263 G PQGGVISPLLSNI YLHE D W+ I R GR V Sbjct: 211 SYPEAGTPQGGVISPLLSNI-------YLHEVL-------DEWFEAAVIPRLQGRGFMV- 255 Query: 264 ENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKI 318 RYADDFV+ + + E + + G G L+L+ KT++ Sbjct: 256 ------------RYADDFVMGFE-CREDAERVMKALPGRF-GRYGLKLHEGKTRL 296 >UniRef50_C7RV41 RNA-directed DNA polymerase (Reverse transcriptase) n=2 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RV41_9PROT Length = 453 Score = 165 bits (417), Expect = 3e-39, Method: Compositional matrix adjust. Identities = 104/322 (32%), Positives = 163/322 (50%), Gaps = 41/322 (12%) Query: 20 LLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPL 79 L+ + P L +A R S++GA PG+DG+ A +R L G YQP Sbjct: 38 LMEAVLSPANLKQAWRRVKSNRGA--PGIDGLRIEDFPAYACEHWPAIRQTLSEGRYQPQ 95 Query: 80 PARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAI 139 RRV IPK NG R LGIP + DR+VQ+A+ M PI++ +F SYGFRP RS H A+ Sbjct: 96 AVRRVIIPKPNGGERALGIPTVVDRVVQQAIAQIMTPIFDPEFSESSYGFRPRRSAHGAL 155 Query: 140 RTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGH 199 + V+ L + R ++ DL+ +FD V H +LM V R++SD R + L+ + ++AG Sbjct: 156 KQVRADL----KAGYRIAVDLDLAKFFDNVDHDILMARVARKVSDKRLLALIGRYLRAGV 211 Query: 200 IDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGRS 259 + + + G PQGG +SPLL+NI+L++ D R L G+ + Sbjct: 212 MIGSTLQPSELGTPQGGPLSPLLANILLDDLD-----RTLEGRGHR-------------- 252 Query: 260 TAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIP 319 + RYADD +++VK +A + ++ L LKL +N K+++ Sbjct: 253 --------------FARYADDLMVLVKSERAG-QRVKASLTAYLGRQLKLPVNEKKSQVA 297 Query: 320 HVNDGFIFLGHRLIRKRSRYGE 341 + +FLG + + R+ + Sbjct: 298 KIEQ-CVFLGFTFRKNKLRWSD 318 >UniRef50_B1VA32 Retron-type reverse transcriptase n=7 Tax=Candidatus Phytoplasma RepID=B1VA32_PHYAS Length = 521 Score = 165 bits (417), Expect = 3e-39, Method: Compositional matrix adjust. Identities = 105/316 (33%), Positives = 166/316 (52%), Gaps = 13/316 (4%) Query: 20 LLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPL 79 L R + E +A ++KG+ T GV NKT+ +L + ++ E ++ Y P Sbjct: 32 LKREMNNIENTYKAFNSIATNKGSGTEGVG--NKTIDGIKLEM-IKKYHKEYVNNQYNPQ 88 Query: 80 PARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAI 139 P ++V IPK K RPLGIP ++DRI+Q+AM + +E+ F S+GFR ++S H A+ Sbjct: 89 PVKKVLIPKGKNKTRPLGIPTIKDRIIQKAMEQLLTLYFENIFLEWSFGFRSKKSCHDAV 148 Query: 140 RTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGH 199 + VK + D+ YF+T++H +L K + + I A+ + + + +KAG Sbjct: 149 KRVKQRFKGIDYIIKI-----DIKGYFETINHDILNKMLNKYIRKAKTLKTINQWLKAGI 203 Query: 200 IDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKA--RKDRWYWNNSIQRG 257 ++ G+ + G PQGG+ISPLLSNI L+ D+ + E +GK + + YW ++ Sbjct: 204 MENGIKYESLSGTPQGGIISPLLSNIYLHYIDKKMEELIRNGKPIMKANPEYWKAYTKKQ 263 Query: 258 R---STAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMD 314 S N P + Y RYADDF++ +KG + E I+ LE LKL +N D Sbjct: 264 HHNLSIDSEINLNPNPRIEYIRYADDFIIGIKGEHHEAERIKTHVLKWLEQDLKLVVNRD 323 Query: 315 KTKIPHVNDGFIFLGH 330 K+KI G FL + Sbjct: 324 KSKIVKTTKGTRFLSY 339 >UniRef50_Q5U7I7 Maturase-related protein n=20 Tax=Gammaproteobacteria RepID=Q5U7I7_ECOLX Length = 451 Score = 164 bits (414), Expect = 6e-39, Method: Compositional matrix adjust. Identities = 110/339 (32%), Positives = 177/339 (52%), Gaps = 55/339 (16%) Query: 40 SKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIP 99 +KGA G+D ++ + ++ +LL+G YQPLP +RV IPK +G R LGIP Sbjct: 54 NKGA--AGIDNMSIEEFNDFAKLHWLGIKQQLLNGSYQPLPVKRVMIPKPDGGERMLGIP 111 Query: 100 ALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIE 159 A+ DR++Q+A+ + P +E F SYG+RP + A+ V+ C + + ++ Sbjct: 112 AVIDRVIQQAIAQVISPYFEPQFSPHSYGYRPHKRASQAVNHVQ----SCVKQGYKTAVD 167 Query: 160 GDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAG--HIDVGLFRAASEGVPQGGV 217 DLS +FD V H +LM V R+I D M LL K ++AG + GL+ +++GVPQGG Sbjct: 168 IDLSKFFDEVDHDMLMNRVSRKIKDKALMRLLGKYLRAGIAERETGLWFESTKGVPQGGP 227 Query: 218 ISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRY 277 +SPLLSNI+L+E D+ L ++L + RY Sbjct: 228 LSPLLSNILLDELDKKLTYKHLK---------------------------------FARY 254 Query: 278 ADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRS 337 ADD +++VK TK++ I+ E + LKL++N K+++ V+ G FLG Sbjct: 255 ADDIIILVK-TKSEGLIIQREITAFITKRLKLKVNESKSRVGPVS-GSKFLGFTF----- 307 Query: 338 RYGEMRVVSTIPQEKARNFAAS---LTALLWKVRISGEI 373 RYG+++ I ++ + F A+ LT W + ++ +I Sbjct: 308 RYGQVQ----IHEQALKKFKANVRELTNRNWGISMTLQI 342 >UniRef50_C9B0U1 RNA directed DNA polymerase n=2 Tax=Enterococcus casseliflavus RepID=C9B0U1_ENTCA Length = 620 Score = 163 bits (412), Expect = 1e-38, Method: Compositional matrix adjust. Identities = 119/360 (33%), Positives = 188/360 (52%), Gaps = 26/360 (7%) Query: 20 LLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPL 79 L+ ++T P + A R + G+ +PGVD N L+A +E I + Y+P Sbjct: 38 LMEIVTSPNNILLAFRNVKGNSGSTSPGVDKKNIDDLKAIPNIEF-IKTVQTKFSEYKPQ 96 Query: 80 PARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHH-- 137 P +RV IPK NGK RPLGIP + DRI+Q+ +L +EPI E+ FH +YGFRP RS HH Sbjct: 97 PVKRVDIPKPNGKTRPLGIPTIWDRIIQQCLLQVLEPIMEAKFHDKNYGFRPNRSAHHAF 156 Query: 138 --AIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAV-RRRISDARFMTLLWKT 194 A+R +L ++ +V++ D+ +FD V+H L+K + D + ++ Sbjct: 157 AQAVRMAQL-------SKLTFVVDIDIEGFFDNVNHSKLIKQFWTLGVRDKWLLGVIRAM 209 Query: 195 IKAG--HIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKD-RWYWN 251 +KA H D G +G PQGGV+SPLL+N++LNE D ++ ++ + RK+ Y Sbjct: 210 LKAPIIHRD-GRIEHPKKGTPQGGVLSPLLANVVLNELDWWISSQWETHPTRKNYDCYHQ 268 Query: 252 NSIQRGRSTAVRENWQWKPAVAY-CRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLR 310 QR +S R K Y RYADDF + + ++ + I + L+ LKL Sbjct: 269 TRKQRIKSNKYRALRASKLKEIYIVRYADDFKIFCR-KRSDADKIFLATKLWLKDRLKLD 327 Query: 311 LNMDKTKIPHV---NDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAASLTALLWKV 367 ++ +K+K+ ++ F+ +L+RKR + V+ T KA + A++ A KV Sbjct: 328 ISAEKSKVVNLKKQKSDFLGFTMKLVRKRKSF----VIETHMCAKAMSAASNKLAKQIKV 383 >UniRef50_B1HW67 Possible group II intron reverse transcriptase/maturase n=23 Tax=Firmicutes RepID=B1HW67_LYSSC Length = 601 Score = 163 bits (412), Expect = 1e-38, Method: Compositional matrix adjust. Identities = 106/316 (33%), Positives = 178/316 (56%), Gaps = 28/316 (8%) Query: 33 AARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSG--HYQPLPARRVYIPKSN 90 A R ++ G+ T G DG+ T+ Q ++ +++ DE+ + +Y+P RRV IPK N Sbjct: 47 AYRNIKANTGSKTAGTDGI--TIEQYKIE-DVETFVDEIRATLKNYKPQTVRRVEIPKPN 103 Query: 91 GKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCG 150 GK RPLGIP +RDR++Q+ +EPI E+ F+ SYGFRP RS HHA+ + L + Sbjct: 104 GKTRPLGIPTMRDRLIQQMFKQILEPICEARFYNHSYGFRPNRSTHHAMGRCQF-LANIA 162 Query: 151 ETRGRWVIEGDLSSYFDTVHHRLLMKAVRR-RISDARFMTLLWKTIKAGHIDVGLFRAAS 209 + V++ D+ +FD V H L+K + I D R ++++ K +KA +G+ + Sbjct: 163 --LNQHVVDIDIQGFFDNVSHSKLLKQMYSIGICDKRVLSVVSKMLKAPIKGIGI---PT 217 Query: 210 EGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKAR---KDRWYWNNSIQRGRSTAVRENW 266 +G PQGG++SPLLSNI+LN+ D ++ ++ + K + K+R N + R+T ++E + Sbjct: 218 KGTPQGGILSPLLSNIVLNDLDWWISNQWENMKTKFNYKER--KNKVLMIKRTTTLKEMY 275 Query: 267 QWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVN---D 323 RYADDF + K K ++ + +G L+ L L ++ +K+KI ++ Sbjct: 276 -------IVRYADDFKIFTKSHKNAIK-LYHAVKGYLKNHLNLDISNEKSKITNLRKRAS 327 Query: 324 GFIFLGHRLIRKRSRY 339 F+ ++I+K RY Sbjct: 328 EFLGFSLKVIKKGKRY 343 >UniRef50_Q3S275 ORF718 n=2 Tax=Eukaryota RepID=Q3S275_THAPS Length = 718 Score = 162 bits (411), Expect = 1e-38, Method: Compositional matrix adjust. Identities = 108/347 (31%), Positives = 187/347 (53%), Gaps = 23/347 (6%) Query: 20 LLRLITQPEWLAEA-ARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQP 78 L +++ P +L A ARI S+ G+ T + +K L L+ + + +G +Q Sbjct: 163 LSSIMSDPNFLIAAWARIR-SNSGSLTFAL---SKETLDGIALSWLEETANTMRNGIFQF 218 Query: 79 LPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHA 138 P+RR YI KS+G RPL IP+ RD+IVQ AM + ++E DF S+G+ R H A Sbjct: 219 SPSRRTYISKSDGGKRPLTIPSPRDKIVQEAMRFLLMLVFEGDFSKNSHGWVSGRGCHTA 278 Query: 139 IRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAG 198 + +K++ W+IEGD+ F +++H++L+ ++ +I D F+ L++K ++ G Sbjct: 279 LNQIKMEF-----AHDNWLIEGDIDQQFPSLNHQVLVNLLKTKIDDQAFIDLIYKYLRVG 333 Query: 199 HIDV-GLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHE----RYLSGKARKDRWYWNNS 253 + + G QGGV+SP+L+NI + FD+++ +Y GK RK + Sbjct: 334 YGESPDKIVKMRIGTSQGGVLSPVLANIYMTPFDKWVERDLIPKYTKGKRRKANPVYTKM 393 Query: 254 IQRGRST-----AVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLK 308 I+ G+ T ++ + + + Y RYADDF++ + G K + I +EC+ L LK Sbjct: 394 IRSGKVTDHSIPSLYAHDRNFIRLHYVRYADDFIMGLNGPKVYCKQIVDECKTFLFEQLK 453 Query: 309 LRLNMDKTKIPHVN-DGFIFLGHRLIRKRSRYGEMRVVSTIPQEKAR 354 L LN++KTKI H D FLG+R+ +++ +M++ + + +R Sbjct: 454 LTLNIEKTKITHSQLDSATFLGYRVY--KTKLSKMKIAHNLKGQLSR 498 >UniRef50_A7GTD4 RNA-directed DNA polymerase n=1 Tax=Bacillus cytotoxicus NVH 391-98 RepID=A7GTD4_BACCN Length = 543 Score = 162 bits (409), Expect = 2e-38, Method: Compositional matrix adjust. Identities = 113/371 (30%), Positives = 180/371 (48%), Gaps = 66/371 (17%) Query: 40 SKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIP 99 +KG +DG LQ ++ + D+L G ++ P RR YI K+NGK RPLG+P Sbjct: 50 TKGTSEETIDGF---YLQ-----KIDEIIDQLKKGMFRFAPVRRAYISKANGKKRPLGVP 101 Query: 100 ALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIE 159 +D++VQ M M +E ++E F S+GFR RS H A+ +K T G T W IE Sbjct: 102 NFKDKLVQEVMRMILENVYEPTFSDNSHGFREGRSCHTALSQIK--NTWKGLT---WCIE 156 Query: 160 GDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVIS 219 G + +FD + H +L+ + +R++D RF+ L+ + +G ++ ++ G PQGG++S Sbjct: 157 GAIKGFFDHIDHSVLINLISKRMNDHRFLLLIHNALASGVMENWTYQKTYSGTPQGGILS 216 Query: 220 PLLSNIMLNEFDQYLHE------------------------RYLSGKARKDRWYWNNSIQ 255 PLL+NI L+EFD +L + R LS K + + + Sbjct: 217 PLLANIYLHEFDIFLEKQIEKFDKEKLRARNKEYTKIHSEIRSLSRKVKSLDDRTGHRLW 276 Query: 256 RGRSTAVRENWQWK----------------PAVAYCRYADDFVLIVKGTKAQVEAIREEC 299 +GR + + K + Y RYADDFV+ + G+K I+E Sbjct: 277 KGREKVIETIAELKRKQIGISSVNPMDNDYQKMKYVRYADDFVIGIAGSKDCAVNIKETI 336 Query: 300 RGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIR----KRSRY----GEMRVVS----- 346 + L+ L L L+ +KT I H+ + FLG+ + KR+R + R +S Sbjct: 337 KNFLKQELHLELSEEKTLINHLENPISFLGYEFRKWNEIKRTRVLYKNHKQRALSRAIKL 396 Query: 347 TIPQEKARNFA 357 IP++K + FA Sbjct: 397 EIPKKKMKEFA 407 >UniRef50_C1PA09 RNA-directed DNA polymerase n=3 Tax=Firmicutes RepID=C1PA09_BACCO Length = 432 Score = 161 bits (408), Expect = 3e-38, Method: Compositional matrix adjust. Identities = 117/360 (32%), Positives = 187/360 (51%), Gaps = 44/360 (12%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARL 60 Q +L A D + + I + + L EA + + GA GVD V+ ++A Sbjct: 16 FQNRLYLAAKADRKRKFYAIYDKIYRKDILEEAWKRVKQNGGA--GGVDKVSIEDVKAYG 73 Query: 61 AVEL-QILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWE 119 +L + +EL + Y+ P RR YIPK +G+ R LGIP ++DR+VQ A + +EP++E Sbjct: 74 EEKLLNEIAEELRTEKYRCKPVRRTYIPKQDGRKRALGIPTIKDRVVQMATKIVIEPVFE 133 Query: 120 SDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVR 179 ++F SYGFRP+RS A+ + ++ D G WVI+ D+ YF +++H L+ V+ Sbjct: 134 ANFQPCSYGFRPKRSAKQAMDRI-FEVADKGG--ALWVIDADIKDYFGSINHDKLLLLVK 190 Query: 180 RRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYL 239 +RI+D R + L+ +KAG ++ + ++ G PQGGVISPLLSN+ LN FD Sbjct: 191 QRITDRRVLKLIKGWLKAGVLEDSQYSESTVGAPQGGVISPLLSNVYLNYFDI------- 243 Query: 240 SGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREEC 299 YWN + G + RYADDFV++ K EA+R Sbjct: 244 ---------YWNKAF--GHLGEL------------VRYADDFVILCKRLSHAEEALR-AV 279 Query: 300 RGVLEGSLKLRLNMDKTKIPHV---NDGFIFLG--HRLIRKRSRYGEMR-VVSTIPQEKA 353 + ++ L+L L+ +KT++ + D F FLG +R R R++ + + +P +KA Sbjct: 280 KWIMR-KLELTLHSEKTRLVDMYFGKDSFDFLGFNNRFQRFRNKSWQWYWTLQQVPSKKA 338 >UniRef50_Q1PUN9 Strong similarity to group II intron-encoded protein LtrA n=3 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1PUN9_9BACT Length = 432 Score = 161 bits (408), Expect = 3e-38, Method: Compositional matrix adjust. Identities = 125/339 (36%), Positives = 179/339 (52%), Gaps = 45/339 (13%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARL 60 QRKL A + R L + +L EA + ++ G + G DG+ +++ Sbjct: 21 FQRKLYRKAKQEEGFRFYVLYDKVRMLHFLREAYKRCKANGG--SAGADGITFEDVES-Y 77 Query: 61 AVE--LQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIW 118 VE L + +EL + Y+P P RVYIPK+NGK RPLGIP ++DR+VQ ++ + +EPI+ Sbjct: 78 GVEKFLGEIIEELENKTYEPQPVLRVYIPKTNGKTRPLGIPVIKDRVVQMSVKLVIEPIF 137 Query: 119 ESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAV 178 E+DF SYGFRP RS A+R +K +L + G+T V + DLSSYFDT+ H+ L+ + Sbjct: 138 EADFEDSSYGFRPGRSAGDAVRKIKEKLRE-GKTE---VFDADLSSYFDTIPHKELLLLI 193 Query: 179 RRRISDARFMTLLWKTIKAGHIDVGL---FRAASEGVPQGGVISPLLSNIMLNEFDQYLH 235 RISD + L+ +KA I+ G R G PQG VISPLL+NI L+ D+ ++ Sbjct: 194 GMRISDKNVLHLIKMWLKAPVIEEGKPGGGRKNKIGTPQGSVISPLLANIYLHMLDKAVN 253 Query: 236 ERYLSGKARKDRWYWNNSIQRGRSTAVRENWQ-WKPAVAYCRYADDFVLIVKGTKAQVEA 294 REN +K + RYADD+VL+ K + EA Sbjct: 254 ---------------------------RENGVFYKYGITIIRYADDWVLMAK--RIPREA 284 Query: 295 IREECRGVLEGSLKLRLNMDKTKIPHV-NDGFIFLGHRL 332 + R + LKL LN DK+KI + F FLGH + Sbjct: 285 LDYLNRLLK--KLKLSLNEDKSKIVKAEEESFDFLGHTI 321 >UniRef50_B4D301 RNA-directed DNA polymerase (Reverse transcriptase) n=4 Tax=Chthoniobacter flavus Ellin428 RepID=B4D301_9BACT Length = 415 Score = 160 bits (406), Expect = 5e-38, Method: Compositional matrix adjust. Identities = 103/300 (34%), Positives = 153/300 (51%), Gaps = 40/300 (13%) Query: 17 IQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHY 76 I++L+ PE A + +S+ GA PG+DG+ + L L + +R +LL+G Y Sbjct: 2 IEQLMEEAVSPENWHTAWKAVVSNGGA--PGIDGMRCSELVEHLQRHGEAIRAKLLAGRY 59 Query: 77 QPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVH 136 P P R IPK G R LGIP + DR VQ+ +L + PI+E F SYGFRP RS H Sbjct: 60 TPSPVLRTKIPKPGGGERDLGIPTVLDRFVQQLLLQVLTPIYEPRFSARSYGFRPGRSTH 119 Query: 137 HAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIK 196 A+R + + + G++ +VI+ D+ +FD V+H LLM +R +SD R TL+ + +K Sbjct: 120 DAVRQAQAYVKE-GKS---YVIDLDIEKFFDRVNHNLLMHRLRETVSDVRVRTLIGRYLK 175 Query: 197 AGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQR 256 AG + G+ + EG PQGG +SPLL+NI L+ D L R Sbjct: 176 AGVMVNGVVQDNEEGTPQGGPLSPLLANIYLDPLDWELEGR------------------- 216 Query: 257 GRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKT 316 +AY RYADD + V + A + + G +E L+L++N K+ Sbjct: 217 --------------GLAYVRYADDCNIYVS-SAAAAQRVLSSLIGWIEKKLRLKVNQTKS 261 >UniRef50_Q3A4Z2 Group II intron-encoding maturase n=98 Tax=Bacteria RepID=Q3A4Z2_PELCD Length = 529 Score = 159 bits (403), Expect = 1e-37, Method: Compositional matrix adjust. Identities = 97/300 (32%), Positives = 155/300 (51%), Gaps = 40/300 (13%) Query: 19 RLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQP 78 RL+ + + A + + +KGA G+DG+ L+ L + +++ELL+G YQP Sbjct: 110 RLMEEVVSRGNMMAAYQRVVRNKGA--AGIDGMPVGDLKTYLQEQWPRIKEELLTGTYQP 167 Query: 79 LPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHA 138 P R+V IPK G +R LGIP + DR++Q+A+ + ++E +F SYGFRP RS H A Sbjct: 168 QPVRKVEIPKPGGGMRMLGIPTVLDRLIQQALHQELMRLFEPEFSEHSYGFRPGRSAHQA 227 Query: 139 IRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAG 198 +++ + + + RW ++ DL +FD + H +LM V R++ D R + L+ + + G Sbjct: 228 VQSARRHVA----SGRRWAVDIDLEKFFDRMGHDILMSRVARKVKDRRVLGLIRRYLTVG 283 Query: 199 HIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGR 258 ++ G+ +G PQGG +SPLLSNI+L+EFD+ L R Sbjct: 284 VLEGGIISPRVQGTPQGGPLSPLLSNILLDEFDKELERR--------------------- 322 Query: 259 STAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKI 318 A+CRYADD + V +A E + LE LKL++N K+ + Sbjct: 323 ------------GHAFCRYADDCNIYVHSRRA-AERVMTSLTRFLEQQLKLKVNRVKSAV 369 >UniRef50_Q188V0 Group II intron reverse transcriptase/maturase n=7 Tax=Firmicutes RepID=Q188V0_CLOD6 Length = 609 Score = 159 bits (402), Expect = 2e-37, Method: Compositional matrix adjust. Identities = 109/322 (33%), Positives = 181/322 (56%), Gaps = 19/322 (5%) Query: 20 LLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRD-ELLSGHYQP 78 L+ ITQ E + A R S+KG+ T G + +T++ Q+++ + ++QP Sbjct: 40 LMSYITQEENILLAYRNIKSNKGSKTAGTNK--RTIIDVGEENPYQLVQYVQNRFNNFQP 97 Query: 79 LPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHA 138 RRV IPK NGK RPLGIP + DR+VQ+ + +EPI E+ FH SYGFRPERS HHA Sbjct: 98 HSIRRVEIPKPNGKTRPLGIPTIEDRLVQQCIKQILEPILEAKFHKHSYGFRPERSSHHA 157 Query: 139 IRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAV-RRRISDARFMTLLWKTIKA 197 I + Q T G +V++ D+ +FD V+H L+K + +I D F+++L + +KA Sbjct: 158 IAIFQ-QWTFKG---FHYVVDIDIKGFFDNVNHGKLLKQLWTMKIRDKTFISILSRMLKA 213 Query: 198 GHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRG 257 +G +++G PQGG++SPLL+N++LNE D ++ ++ G K ++ S Sbjct: 214 EVKGIG---KSTKGTPQGGILSPLLANVVLNELDWWIDSQW-DGFPTKRKY----SSLLS 265 Query: 258 RSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTK 317 ++ ++R+ K + RYADDF ++ K + I + L+ L L ++ +K+K Sbjct: 266 KTQSIRKYSNLK-EIKIVRYADDFKIMCKDYHT-AQKIFLATKQWLKVRLDLDISPEKSK 323 Query: 318 IPHVNDGFI-FLGHRLIRKRSR 338 + ++ + FLG +L K+ + Sbjct: 324 VTNLRKNYSDFLGFKLKVKKGK 345 >UniRef50_Q94Z00 Orf757 n=4 Tax=stramenopiles RepID=Q94Z00_PYLLI Length = 757 Score = 159 bits (402), Expect = 2e-37, Method: Compositional matrix adjust. Identities = 98/301 (32%), Positives = 157/301 (52%), Gaps = 28/301 (9%) Query: 69 DELLSGHYQPLPARRVYIPK-SNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSY 127 + L +G ++ ARRV+IPK + KLRPLG+ + RD+++ A+L +EP +E F +S+ Sbjct: 223 NNLKAGKFKFSNARRVHIPKPGSSKLRPLGVVSPRDKVILTAVLQVLEPFYEKKFLDISH 282 Query: 128 GFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARF 187 FRP R H A+ ++L+ + W IEGD++ FD + H +L+ +RR I + Sbjct: 283 AFRPGRGCHTALNFIQLRFGN-----SNWAIEGDIARCFDDIDHDILLGILRRDIKCDKT 337 Query: 188 MTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQY---LHERYLSGKAR 244 + L+ K++K ++ G+ +G QG +SP L NI L+E D + L E Y+SG R Sbjct: 338 IALIKKSLKNPFVEDGVTVKPQKGTFQGSPLSPFLCNIYLHEMDLFIKGLSEDYISGTHR 397 Query: 245 KDRWYWNN--------SIQRGRSTAVRENWQWKPA----------VAYCRYADDFVLIVK 286 + + S++ + + + P+ +Y RYADDFV+ + Sbjct: 398 RKSPQYRKIQYELSKPSLKVTERKLLNKKLRAIPSKDPVDPDFRRFSYVRYADDFVIGIT 457 Query: 287 GTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVN-DGFIFLGHRLIRKRSRYGEMRVV 345 G K E +R + R L L L L+MDKT I H N +G FLG R+ + R +R + Sbjct: 458 GPKKDCEEVRNKLREFLTKILALELSMDKTIISHFNQEGITFLGTRISGNKEREKVIRKI 517 Query: 346 S 346 S Sbjct: 518 S 518 >UniRef50_A6DJK4 Reverse transcriptase/maturase n=21 Tax=Chlamydiae/Verrucomicrobia group RepID=A6DJK4_9BACT Length = 446 Score = 159 bits (401), Expect = 2e-37, Method: Compositional matrix adjust. Identities = 108/312 (34%), Positives = 156/312 (50%), Gaps = 45/312 (14%) Query: 30 LAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLPARRVYIPKS 89 + EA S+KG H GVD V+ ++ L L +EL G Y P RRV IPK Sbjct: 57 IMEAWEKVCSNKGKH--GVDMVSIERYESELEYNNAKLLEELQDGRYDPSAVRRVEIPKG 114 Query: 90 NG-KLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTD 148 +G K RPLGIP +RDR+VQ A+ +EPI++ DF S+GFRP+ A+R V +L Sbjct: 115 DGRKTRPLGIPTVRDRVVQTALKHVIEPIFDIDFSPYSFGFRPKLGCKDALRRVN-ELLK 173 Query: 149 CGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHIDVGLFRAA 208 G +V++ D+ SYFDT+ H LM V+ +I D + + L+ + +KA D Sbjct: 174 QGYL---YVMDADIQSYFDTIPHEKLMSRVKEKIIDGKILDLIEQFLKANIFDGLKHWEP 230 Query: 209 SEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQW 268 EG PQGG+ISPLL+NI L+ FD + E Sbjct: 231 EEGTPQGGIISPLLANIYLDLFDHKMTE-------------------------------- 258 Query: 269 KPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDG---F 325 RYADDF+++ K ++ A+R+ R + LK L+ +KT+I + + F Sbjct: 259 -AGFEIVRYADDFLIMCKSKESAKRALRKTRRWMKANGLK--LHPEKTRIVDMTEKCEYF 315 Query: 326 IFLGHRLIRKRS 337 FLG+ R R+ Sbjct: 316 EFLGYHFERTRN 327 >UniRef50_C3BJV7 D-alanine--D-alanine ligase A (D-alanylalanine synthetaseA) n=2 Tax=Bacillus RepID=C3BJV7_9BACI Length = 620 Score = 159 bits (401), Expect = 2e-37, Method: Compositional matrix adjust. Identities = 108/325 (33%), Positives = 172/325 (52%), Gaps = 16/325 (4%) Query: 8 WAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVN-KTMLQARLAVELQI 66 + + + R L + + A +++GA+TPGVDG + + LQ +++ Sbjct: 21 YELSKKNTRFHSLYEMAFNETTIITAIHKIKANRGANTPGVDGHDIRRYLQMDKNNVIKL 80 Query: 67 LRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLS 126 + + +Y+ PARRVYI K++G RPLGIP + DRI+Q + +EPI E+ F+ S Sbjct: 81 ITKA--ARNYKSKPARRVYIEKADGSQRPLGIPTVVDRIIQECIRTILEPIVEAKFYDHS 138 Query: 127 YGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAV-RRRISDA 185 YGFRP RS HA+R V + T+ + IEGD+ YFD ++HR L+K + R I D Sbjct: 139 YGFRPYRSSKHAVRQVNHFI---NTTKSYYAIEGDIKGYFDNINHRFLIKKLWRLGIRDK 195 Query: 186 RFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARK 245 R + ++ +KAG+++ +G PQGG+ISPLL+N+ LN+FD + R+ K Sbjct: 196 RIIKIIQIMLKAGYMEYDFKFTTEKGTPQGGIISPLLANVYLNDFDWMVARRFYKAKPTG 255 Query: 246 DRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEG 305 + ++ R VR Q + RYADD++++ + T + E R Sbjct: 256 ----ISKEPRKQRERLVR---QGRNKCYLVRYADDWIILTQ-TYQEARRYLEYLRKYFRI 307 Query: 306 SLKLRLNMDKTKIPHVNDG-FIFLG 329 LKL L+ +KT I + + +FLG Sbjct: 308 KLKLELSKEKTVITDLREKPALFLG 332 >UniRef50_C2KES2 Reverse transcriptase/maturase n=14 Tax=Firmicutes RepID=C2KES2_9LACO Length = 432 Score = 158 bits (400), Expect = 2e-37, Method: Compositional matrix adjust. Identities = 102/299 (34%), Positives = 152/299 (50%), Gaps = 40/299 (13%) Query: 20 LLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPL 79 L+ I L EA R +KGA PGVD L + ++D ++ Y+P Sbjct: 3 LIEQILSQNNLKEAIRRVKINKGA--PGVDRRTVDELDSYFKKHQVEIKDAIMKMKYRPQ 60 Query: 80 PARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAI 139 RRVYIPK+NGK RPLGIP + DR++Q+A+ + I++ F SYGFRP RS H AI Sbjct: 61 AVRRVYIPKANGKKRPLGIPTVVDRVIQQAIAQVLMKIYDPHFSEHSYGFRPGRSAHDAI 120 Query: 140 RTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGH 199 V L+ + G +WV++ D+ YFDTV+H L+ +R +++D + L+ +KAG Sbjct: 121 EQV-LEYLNEGY---QWVVDLDIEKYFDTVNHDKLISIIREQVNDKTTLHLIRAFLKAGV 176 Query: 200 IDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGRS 259 ++ G + GVPQGG +SP+LSNI L++ D+ L +R Sbjct: 177 MEDGWVKPNKLGVPQGGPLSPILSNIYLDKMDKELEQR---------------------- 214 Query: 260 TAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKI 318 + + RYADD + VK K+ + + LE L L++N KTK+ Sbjct: 215 -----------GLRFVRYADDCNIFVKSGKS-AKRVMNSISSWLERKLFLKVNATKTKV 261 >UniRef50_C3LL08 Group II intron reverse transcriptase/maturase n=34 Tax=Firmicutes RepID=C3LL08_BACAC Length = 643 Score = 158 bits (400), Expect = 3e-37, Method: Compositional matrix adjust. Identities = 105/314 (33%), Positives = 171/314 (54%), Gaps = 20/314 (6%) Query: 17 IQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLS--G 74 + L+ +I E + A R +KG+ T D VN ++ +E +E+ Sbjct: 39 FKNLMSIIISDENILLAYRNIKGNKGSRTAACDNVNIKNIEG---MEQSYFLNEVKRRFQ 95 Query: 75 HYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERS 134 +YQP RR I K NG+ RPLGIPA+ DRI+Q+ +L MEPI E+ F SYGFRP RS Sbjct: 96 NYQPQKVRRKEISKPNGQTRPLGIPAMWDRIIQQCILQVMEPICEAHFSNRSYGFRPNRS 155 Query: 135 VHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAV-RRRISDARFMTLLWK 193 HA+ +++ T +V++ D+ +FD V+H LM+ + I D + + ++ K Sbjct: 156 AEHALADASVRVNKQNLT---YVVDVDIKGFFDEVNHVKLMRQLWTLGIRDKQLLVIIRK 212 Query: 194 TIKAG-HIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKD-----R 247 +KA + G ++G PQGG++SP+L+N+ LNEFD ++ ++ + KA+K R Sbjct: 213 ILKAPVQMPDGTTMFPTKGTPQGGILSPILANVNLNEFDWWISRQWETFKAKKVKPRCMR 272 Query: 248 WYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSL 307 W N + +T + + + KP + RYADDF I T++ E I + + LE L Sbjct: 273 GIWCNDVV---TTQLTKTSKMKP-MYIVRYADDFK-IFTNTRSNAEKIFKATQMWLEERL 327 Query: 308 KLRLNMDKTKIPHV 321 KL ++ +K+K+ ++ Sbjct: 328 KLSISAEKSKVTNL 341 >UniRef50_B9J6F8 Reverse transcriptase n=7 Tax=Bacillus cereus group RepID=B9J6F8_BACCQ Length = 624 Score = 158 bits (399), Expect = 3e-37, Method: Compositional matrix adjust. Identities = 106/336 (31%), Positives = 176/336 (52%), Gaps = 16/336 (4%) Query: 20 LLRLITQPEWLAEAARITLSSKGAHTPGVDGVN-KTMLQARLAVELQILRDELLSGHYQP 78 ++ L+ + + A S++G+ T G+D + LQ ++++R + +Y+P Sbjct: 35 IIELMKNKQTIKTAIHNIKSNRGSMTVGIDKKDVNYYLQMEAKQLIKLIRQHI--DNYKP 92 Query: 79 LPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHA 138 P RR YI K NGK RPLGIP + DRI+Q + +EPI E+ F SYGFRP RS H+A Sbjct: 93 NPVRREYINKGNGKKRPLGIPTMIDRIIQEIARIVLEPIAEAKFFNHSYGFRPYRSCHYA 152 Query: 139 IRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAV-RRRISDARFMTLLWKTIKA 197 I V L ++ IEGD+ S+FD ++H L++ + I D RF+ ++ K ++A Sbjct: 153 IGRV---LNTISRSKTYIAIEGDIKSFFDHINHNKLVEMMWNMGIKDKRFLIIIKKMLRA 209 Query: 198 GHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRG 257 G ++ + G PQGG+ISPLL+NI LN FD + + + +A R+ ++ + G Sbjct: 210 GVLEDKVILPTEIGTPQGGIISPLLANIYLNNFDWMVAKEFEEHRA---RYTVKHAFRSG 266 Query: 258 RSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTK 317 + R + + RYADD++++ + T Q + + + LKL L+ +KT Sbjct: 267 LTKVGRRHKK----CFLIRYADDWIILCEDT-VQARILLTKIDKYYKHILKLELSKEKTF 321 Query: 318 IPHVNDGFI-FLGHRLIRKRSRYGEMRVVSTIPQEK 352 I + + FLG + ++ R + IP +K Sbjct: 322 ITDLREKPARFLGFDIKAEKMRLKDRIAGKAIPNKK 357 >UniRef50_C2XKK9 D-alanine--D-alanine ligase A (D-alanylalanine synthetaseA) n=1 Tax=Bacillus cereus F65185 RepID=C2XKK9_BACCE Length = 647 Score = 158 bits (399), Expect = 3e-37, Method: Compositional matrix adjust. Identities = 106/336 (31%), Positives = 176/336 (52%), Gaps = 16/336 (4%) Query: 20 LLRLITQPEWLAEAARITLSSKGAHTPGVDGVN-KTMLQARLAVELQILRDELLSGHYQP 78 ++ L+ + + A S++G+ T G+D + LQ ++++R + +Y+P Sbjct: 58 IIELMKNKQTIKTAIHNIKSNRGSMTVGIDKKDVNYYLQMEAKQLIKLIRQHI--DNYKP 115 Query: 79 LPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHA 138 P RR YI K NGK RPLGIP + DRI+Q + +EPI E+ F SYGFRP RS H+A Sbjct: 116 NPVRREYINKGNGKKRPLGIPTMIDRIIQEIARIVLEPIAEAKFFNHSYGFRPYRSCHYA 175 Query: 139 IRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAV-RRRISDARFMTLLWKTIKA 197 I V L ++ IEGD+ S+FD ++H L++ + I D RF+ ++ K ++A Sbjct: 176 IGRV---LNTISRSKTYIAIEGDIKSFFDHINHNKLVEMMWNMGIKDKRFLIIIKKMLRA 232 Query: 198 GHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRG 257 G ++ + G PQGG+ISPLL+NI LN FD + + + +A R+ ++ + G Sbjct: 233 GVLEDKVILPTEIGTPQGGIISPLLANIYLNNFDWMVAKEFEEHRA---RYTVKHAFRSG 289 Query: 258 RSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTK 317 + R + + RYADD++++ + T Q + + + LKL L+ +KT Sbjct: 290 LTKVGRRHKK----CFLIRYADDWIILCEDT-VQARILLTKIDKYYKHILKLELSKEKTF 344 Query: 318 IPHVNDGFI-FLGHRLIRKRSRYGEMRVVSTIPQEK 352 I + + FLG + ++ R + IP +K Sbjct: 345 ITDLREKPARFLGFDIKAEKMRLKDRIAGKAIPNKK 380 >UniRef50_Q1QGR6 RNA-directed DNA polymerase (Reverse transcriptase) n=3 Tax=Bradyrhizobiaceae RepID=Q1QGR6_NITHX Length = 440 Score = 158 bits (399), Expect = 4e-37, Method: Compositional matrix adjust. Identities = 120/342 (35%), Positives = 172/342 (50%), Gaps = 47/342 (13%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQ-AR 59 +QRKL A +P+ R L I + + L A TL+ A PGVDG+ ++ A Sbjct: 12 LQRKLYRKAKAEPAFRFYILYDKICREDVLLRA--YTLARANAGAPGVDGMTFGQIEGAG 69 Query: 60 LAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWE 119 + L LR++L+S YQP P RRV IPK G RPLGIP +RDR+VQ A + +EPI+E Sbjct: 70 VDAWLAGLREDLVSKTYQPDPVRRVMIPKPGGGERPLGIPTIRDRVVQAAAKIVLEPIFE 129 Query: 120 SDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVR 179 + F +YG+RP RS A++ +L G T V++ DLS YFDT+ H L+++V Sbjct: 130 AGFEDGAYGYRPRRSAIDAVKETH-RLLCRGYTD---VVDADLSKYFDTIPHADLLRSVA 185 Query: 180 RRISDARFMTL--LWKTIKA------GHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFD 231 RR+ D + L LW + G + ++++ G PQGGV+SPLLS I +N F Sbjct: 186 RRVLDRNVLRLIKLWLQVPVEERDGDGKRHMSGGKSSTRGTPQGGVVSPLLSVIYMNRFL 245 Query: 232 QYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQ 291 ++ W + GR + YADDFV++ +G Sbjct: 246 KH----------------WRLT---GRGEVFHAH--------VISYADDFVILSRG---H 275 Query: 292 VEAIREECRGVLEGSLKLRLNMDKTKIPHV-NDGFIFLGHRL 332 E R V+ L L LN KT + + +GF FLG+ L Sbjct: 276 AEEALTWTRAVMT-KLGLTLNEAKTSVKNARREGFDFLGYTL 316 >UniRef50_Q024N3 RNA-directed DNA polymerase n=6 Tax=Bacteria RepID=Q024N3_SOLUE Length = 435 Score = 158 bits (399), Expect = 4e-37, Method: Compositional matrix adjust. Identities = 123/368 (33%), Positives = 183/368 (49%), Gaps = 52/368 (14%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVN-KTMLQAR 59 +++KL A +P R L I + + L A + GA PGVDGV + +++ Sbjct: 8 LRQKLGQKAKQEPKFRFYALYDRIWRKDVLETAWERVRQNDGA--PGVDGVTIEEIMKTD 65 Query: 60 LAVE--LQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPI 117 V L+ + + L Y+P +RVYI K NGKLRPLGIP +RDR+VQ A L+ +EPI Sbjct: 66 QGVAGFLEGIENSLRRKTYRPEAVQRVYIEKENGKLRPLGIPTVRDRVVQMATLLILEPI 125 Query: 118 WESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKA 177 +E+DF SYGFRP RS H A+ ++ + E + V + DL YFD++ H L+ Sbjct: 126 FEADFLDCSYGFRPGRSAHQALEEIRGHV----EAGYQAVYDADLKGYFDSIPHTQLLAC 181 Query: 178 VRRRISDARFMTLLWKTIKAGHIDVGLFRAASE------GVPQGGVISPLLSNIMLNEFD 231 VR R+ D + L+ ++A ++ S+ G PQGGV SPLL+N+ L+ FD Sbjct: 182 VRMRVVDRSVLKLIRMWLEAPVVEREEGGGGSKWSRPEKGTPQGGVASPLLANLYLHWFD 241 Query: 232 QYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVK--GTK 289 + G G++ A RYADDFV++ K GT+ Sbjct: 242 ALFYGPEGPG---------------GKADA-----------KLVRYADDFVVMAKQMGTE 275 Query: 290 AQVEAIREECRGVLEGSLKLRLNMDKTKIPHVND---GFIFLGHRLIRKRSRYG-EMRVV 345 +E I E R LEG +L +N +KT++ + + FL H R R G + + + Sbjct: 276 T-IEFI--ESR--LEGKFQLEINREKTRVVDLREEGASLDFLSHTFRRDRDLKGRDRKYL 330 Query: 346 STIPQEKA 353 + P KA Sbjct: 331 NVFPSAKA 338 >UniRef50_Q5ZTU1 Reverse transcriptase n=1 Tax=Legionella pneumophila subsp. pneumophila str. Philadelphia 1 RepID=Q5ZTU1_LEGPH Length = 506 Score = 157 bits (398), Expect = 4e-37, Method: Compositional matrix adjust. Identities = 100/312 (32%), Positives = 164/312 (52%), Gaps = 52/312 (16%) Query: 33 AARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLPARRVYIPKSNGK 92 A R ++KG+ TPG+DGV T + + + +R+ L + Y+ P RR+YIPK NGK Sbjct: 67 AVRRVTTAKGSKTPGIDGVVWTTSEEKC----EAVRN-LKARGYKATPLRRIYIPKKNGK 121 Query: 93 LRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLT--DCG 150 RPL IP L+DR +Q L+A+EP+ E+ SYGFRP+RS H AI L +C Sbjct: 122 ERPLSIPTLKDRAMQALYLLALEPVGETTADLNSYGFRPKRSTHDAIYQCYATLARKNCA 181 Query: 151 ETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHIDVGLFRAASE 210 + W++EGD+ + FD + H L + I D R +T + ++AG+++ + Sbjct: 182 Q----WILEGDIKACFDEIDHGWLKSNI---IIDQRVLT---QWLQAGYMEKNQLFETAR 231 Query: 211 GVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKP 270 G PQGG SPLL+N++L+ ++ +H SG + ++ Sbjct: 232 GTPQGGPASPLLANMVLDGLEREIH----SGCGQGNK----------------------- 264 Query: 271 AVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLK---LRLNMDKTKIPHVNDGFIF 327 + Y R+ADDF++ T + ++E+ ++ L L L+ +KTKI H+ +GF F Sbjct: 265 -INYIRFADDFIV----TANSPDILKEKVMPIISNFLAQRGLSLSQEKTKIVHIEEGFDF 319 Query: 328 LGHRLIRKRSRY 339 LG + + + ++ Sbjct: 320 LGFNVRKYKGKF 331 >UniRef50_B7JTB6 Group II intron reverse transcriptase/maturase n=6 Tax=Bacillus cereus RepID=B7JTB6_BACC0 Length = 599 Score = 157 bits (396), Expect = 8e-37, Method: Compositional matrix adjust. Identities = 116/360 (32%), Positives = 191/360 (53%), Gaps = 28/360 (7%) Query: 17 IQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVE---LQILRDELLS 73 + LL ++ E + A R S+ G+ TPG D +KT+L + + +R+ +L+ Sbjct: 32 FKNLLEIVISDENILLAYRQVKSNTGSKTPGTD--DKTILDLANTNQDEFIHYMRELVLN 89 Query: 74 GHYQPLPARRVYIPK--SNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRP 131 Y+P RRV+I K S GK RPLGIP ++DRIVQ+ L +EPI E F+ SYGFRP Sbjct: 90 --YKPKSVRRVWIDKNYSKGK-RPLGIPCIQDRIVQQMFLNVLEPICEGKFYNHSYGFRP 146 Query: 132 ERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAV-RRRISDARFMTL 190 R+ HA+ V+ T + + ++ D+ +FD V+H +L+K V I D R + + Sbjct: 147 TRTTRHAVARVQ---TLVNINKYHYTVDIDIKGFFDNVNHSILLKQVWNIGIRDKRVIAV 203 Query: 191 LWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYW 250 + K +KA G+ ++GVPQGG++SPLLSNI+LN+ DQ++ +++ + R + Sbjct: 204 ISKMLKAPIKGEGI---PTKGVPQGGILSPLLSNIVLNDLDQWVADQWECFETR-----Y 255 Query: 251 NNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLR 310 S+ + +R N + K RYADDF ++ + V+ L LKL Sbjct: 256 QYSVNYSKYVNLRRNSKLKEGFL-VRYADDFRIMTNTHDSAVKWF-HAVVDFLNKRLKLE 313 Query: 311 LNMDKTKIPHV-NDGFIFLGHRLIRKRSRYGEMRV-VSTIPQEKARNFAASLTALLWKVR 368 ++ +K+KI ++ FLG++ K + G RV S I +K + L +++++ Sbjct: 314 ISPNKSKIINLRKKSSSFLGYKF--KSTIKGNKRVFFSHIDDDKQKQIITKLKERIYEIQ 371 >UniRef50_B2AJV8 RNA-directed DNA polymerase, retrotranscriptase n=45 Tax=root RepID=B2AJV8_CUPTR Length = 607 Score = 156 bits (395), Expect = 1e-36, Method: Compositional matrix adjust. Identities = 106/280 (37%), Positives = 148/280 (52%), Gaps = 34/280 (12%) Query: 41 KGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPA 100 K + GVDGV + L + L D + +G Y+ LP+RRVYIPK++GK RPLGI A Sbjct: 209 KKSAAAGVDGVTWHDYEECLVERIGKLWDAVQAGRYRALPSRRVYIPKADGKQRPLGIAA 268 Query: 101 LRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEG 160 L D+IVQ+A++ + PI+ESDF SYGFRP R H A+ + + L + WV++ Sbjct: 269 LEDKIVQQAVVTVLTPIYESDFLGFSYGFRPGRGQHQALDALWVGLH---WKKVNWVLDA 325 Query: 161 DLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISP 220 D+ S+FDTV H +M+ + RI+D R + L+ K + AG I+ G G PQG VISP Sbjct: 326 DIRSFFDTVDHGWMMRFLEHRIADKRLLRLIRKWLTAGVIENGAKTEIRVGTPQGAVISP 385 Query: 221 LLSNIMLNE-FDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYAD 279 LL+NI L+ FD +L RW ++ K V RYAD Sbjct: 386 LLANIYLHYVFDLWLQ-----------RWRRRDA---------------KGDVIVVRYAD 419 Query: 280 DFVLIVKGTKAQVEAIRE-ECRGVLEGSLKLRLNMDKTKI 318 D V+ G +A+ +A R E L LN KT++ Sbjct: 420 DSVV---GFEAEADASRFLEALKARFAQFGLSLNEQKTRV 456 >UniRef50_Q35062 CoxI intron2 ORF n=2 Tax=Marchantia polymorpha RepID=Q35062_MARPO Length = 802 Score = 156 bits (394), Expect = 1e-36, Method: Compositional matrix adjust. Identities = 119/401 (29%), Positives = 203/401 (50%), Gaps = 52/401 (12%) Query: 9 AATDPSLRIQRLLRLITQPEWLA---EAARITLSSKGAHTPGVDGVNKTMLQARLAVELQ 65 A P R L+ +I+ P +LA E+ R + G+ +DG + +Q Sbjct: 299 AYLGPDNRYNGLIHIISDPTFLALCYESIRGKPGTSGSDAKPLDGP-EWFVQ-------- 349 Query: 66 ILRDELLSGHYQPLPARRVYIPKSNGKLRPLGI-------PALRDRIVQRAMLMAMEPIW 118 + ++L G ++ PARR+ P K RPLGI ++IVQ+A+ + +E I+ Sbjct: 350 -VGEKLKKGQFEFSPARRITKPGKKEK-RPLGINSPVKQKKCYGEKIVQKALQLVLEAIY 407 Query: 119 ESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAV 178 E F S+GFR RS H A++ + L+ WV+EG++ +FD++ H++++ + Sbjct: 408 EPIFLDCSHGFRIHRSCHTALKRLCLE-----GGHYPWVVEGNIRKFFDSIPHKVILHKI 462 Query: 179 RRRISDARFMTLLWKTIKAGHID--VGLFRAASEGVPQGGVISPLLSNIMLNEFDQY--- 233 +++ R + LL + ++AG+ D G + EG QG V+SPLL NI+L+ D++ Sbjct: 463 SQKVKCHRTLELLQRALRAGYKDPTSGQVISLDEGTSQGSVLSPLLCNIILHYLDEFVMK 522 Query: 234 LHERYLSGKARKDRWYWN------NSIQRGRSTAVRENW--------QWKPAVAYCRYAD 279 L +R+ GK+R+ + N+ ++ RS ++ + + Y RYAD Sbjct: 523 LRDRFNKGKSRRINPEYKLLTRHMNANRQDRSLLIKRRLIPSKDPLDPYFRRILYVRYAD 582 Query: 280 DFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHV-NDGFIFLGHRLIRKRSR 338 DFV++V GT+ + AI+ + L SL+L L+++KT + H+ N GF FLG R RSR Sbjct: 583 DFVILVSGTRLETFAIQASLQNFLHRSLRLELSLEKTVVSHLANKGFHFLGTYCKRTRSR 642 Query: 339 YGEMRVVS----TIPQEKARNF--AASLTALLWKVRISGEI 373 + V + TI Q A +T L +K++ G + Sbjct: 643 HRIFHVRTVRGKTIKQRSTERLRVCAPITKLFYKLKEKGFV 683 >UniRef50_Q82RB7 Putative reverse transcriptase homolog; similar to GII intron n=1 Tax=Streptomyces avermitilis RepID=Q82RB7_STRAW Length = 588 Score = 155 bits (391), Expect = 3e-36, Method: Compositional matrix adjust. Identities = 120/358 (33%), Positives = 169/358 (47%), Gaps = 47/358 (13%) Query: 17 IQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHY 76 +Q+L+R ++ L R+ S G T G+DG + R QIL D + Sbjct: 55 LQKLMRR-SRANTLTSVRRVCQVSTGKKTAGIDGQKALSPEKRGKTARQILADPM----S 109 Query: 77 QPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVH 136 P P RRVYIPK+NGK RPLGIP +RDR+ Q A+EP WE+ F SYGFRP R Sbjct: 110 HPQPVRRVYIPKANGKRRPLGIPVIRDRVDQARFKNALEPEWEARFEARSYGFRPGRGAW 169 Query: 137 HAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIK 196 AI + + + WV++ DLS+ FD + H+ LM +V + R W ++ Sbjct: 170 DAIEMI-FNVAGRRTAKRLWVLDADLSAAFDHISHQHLMDSV--GLFPGRRQIQQW--LR 224 Query: 197 AGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQR 256 AG ++ G F + EG PQGGVISPLL NI L+ + + G R WN Sbjct: 225 AGVMEDGRFVSTPEGTPQGGVISPLLMNIALHGMGEVI------GANRP----WNAKTT- 273 Query: 257 GRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKT 316 + RYADDFV+ ++A +++ LE L N +KT Sbjct: 274 --------------SPTLVRYADDFVVFCTTENEAIKA-KQDLAAWLE-PRGLSFNEEKT 317 Query: 317 KIPHVNDGFIFLG-------HRLIRKRSRYGEMRV---VSTIPQEKARNFAASLTALL 364 ++ H++ G FLG +LI K SR R +ST +E + + SL L Sbjct: 318 RVVHLSSGVDFLGFNVGRFRQKLIIKPSRDALQRARKRISTTARENSGSPTESLVRAL 375 >UniRef50_B1N1A3 NicA n=1 Tax=Pseudomonas putida RepID=B1N1A3_PSEPU Length = 618 Score = 154 bits (390), Expect = 3e-36, Method: Compositional matrix adjust. Identities = 113/363 (31%), Positives = 173/363 (47%), Gaps = 51/363 (14%) Query: 9 AATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVEL-QIL 67 A +DPS R+ RL+ + + A S G TPG DG R + + Sbjct: 17 ANSDPSYVNDRIYRLMYKEDLYIAAYEKIKSKPGNMTPGQDGTTLDEFSIRTIRNIINKM 76 Query: 68 RDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSY 127 +DE + ARRV IPK+NGK RPL + D++VQ + +E I+E F S+ Sbjct: 77 KDESFTFR----GARRVLIPKANGKTRPLSVAPPTDKVVQEVIRSILEAIYEPTFSKNSH 132 Query: 128 GFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARF 187 GFR +S H A++ V+ + G T WVIEGD+ FD + H L+ +R RI D RF Sbjct: 133 GFRAGKSCHTALKQVRESWS--GVT---WVIEGDIKGCFDNISHSKLIDQLRLRIKDERF 187 Query: 188 MTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYL------HERYLSG 241 + L+ K + AG+ + G F +A+ G PQG +ISP+L+N+ L++ D+ + H + G Sbjct: 188 INLIRKALNAGYFENGAFFSATLGTPQGSIISPILANVFLDQLDRKVEQLIKDHHQGEEG 247 Query: 242 KARKDRWYWNNSIQRGRSTAVRENWQWKPA------------------------------ 271 D Y +QR +++ ++ + + A Sbjct: 248 DKITDPAY--RKLQRQKTSLRKKAEKQEGAERDATLSLAREANSKLLSMSPYLTRNNGFI 305 Query: 272 -VAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHV-NDGFIFLG 329 V Y RYADD+++ V G K E +R LE + L L+++KT I H ++ FLG Sbjct: 306 RVKYVRYADDWIIGVNGPKLLAEELRSVVGEFLENA-GLELSIEKTHIRHAKSETAKFLG 364 Query: 330 HRL 332 L Sbjct: 365 TNL 367 >UniRef50_A6LY84 RNA-directed DNA polymerase (Reverse transcriptase) n=17 Tax=Firmicutes RepID=A6LY84_CLOB8 Length = 433 Score = 154 bits (390), Expect = 3e-36, Method: Compositional matrix adjust. Identities = 90/276 (32%), Positives = 149/276 (53%), Gaps = 32/276 (11%) Query: 42 GAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPAL 101 G G+D V K L ++ L +L + Y+P +RVYIPK +GK RPLGIP+ Sbjct: 41 GNKATGIDDVTKQEYSKELDNNIENLIVKLRNHSYKPQAVKRVYIPKGDGKTRPLGIPSY 100 Query: 102 RDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGD 161 D++VQ A+ ++ I+E++F SYGFRP+R+ H AI+ + ++ + G R +V++ D Sbjct: 101 EDKLVQMALNKILQSIYEAEFKDFSYGFRPKRNCHSAIKALN-KVIENG--RINYVVDAD 157 Query: 162 LSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPL 221 + +F+ V+H ++K + RI D ++L+ K +KAG +D G+ + G PQG ++SP Sbjct: 158 IKGFFNNVNHEWMIKFLEVRIGDPNIISLVKKFLKAGLMDNGIIKTTEIGTPQGSIVSPT 217 Query: 222 LSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDF 281 L+NI YLH D W+ ++ N++ + + RYADDF Sbjct: 218 LANI-------YLHYSL-------DLWF---------EKVIKRNFRGQSEIT--RYADDF 252 Query: 282 VLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTK 317 V + + EA R+ CR ++ K L +++TK Sbjct: 253 VCCF---QYESEA-RQFCRLLVSRLNKFNLEVERTK 284 >UniRef50_B1C301 Putative uncharacterized protein n=6 Tax=Clostridium spiroforme DSM 1552 RepID=B1C301_9FIRM Length = 270 Score = 154 bits (390), Expect = 4e-36, Method: Compositional matrix adjust. Identities = 81/208 (38%), Positives = 125/208 (60%), Gaps = 6/208 (2%) Query: 18 QRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQ 77 ++LL I + + +A + +S+KG+ GVD + ++ A L+ ++ GHY Sbjct: 52 EKLLETIMEDANIEKAIQRVMSNKGS--GGVDKMQVAEVRTHFAQHWSYLKKLIMEGHYS 109 Query: 78 PLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHH 137 P +RV IPK NGK R LGIP + DR++Q+A++ + PI+E F SYGFRP R+ H Sbjct: 110 PQAVKRVEIPKDNGKKRELGIPTVTDRVIQQAIVQVLTPIFEPQFSDNSYGFRPRRNAHQ 169 Query: 138 AIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKA 197 A+R V ++ + G R+ ++ DL YFDTV+H L++ + + I D R ++L+ K + A Sbjct: 170 AVRKV-VEYANEGY---RYTVDLDLEKYFDTVNHSRLIQILSQTIKDGRVISLIHKYLNA 225 Query: 198 GHIDVGLFRAASEGVPQGGVISPLLSNI 225 G I F ++GVPQGG +SPLLSNI Sbjct: 226 GVIVKHKFEETTKGVPQGGPLSPLLSNI 253 >UniRef50_B4CYA7 RNA-directed DNA polymerase (Reverse transcriptase) n=6 Tax=Bacteria RepID=B4CYA7_9BACT Length = 482 Score = 154 bits (390), Expect = 4e-36, Method: Compositional matrix adjust. Identities = 123/367 (33%), Positives = 179/367 (48%), Gaps = 53/367 (14%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQAR- 59 +Q L A T+PS R L + + ++L EA R + G + GVDG ++A Sbjct: 17 LQSSLQAKAKTEPSYRFYSLWDKVCRGDFLVEAYRRCRRNGG--SAGVDGETFEQIEAAG 74 Query: 60 LAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWE 119 L L L++EL + Y+ P RV+IPKSNG RPLGIP +RDR+VQ A +M + PI+E Sbjct: 75 LDAWLGKLQEELRTKQYRTQPLLRVWIPKSNGGQRPLGIPTVRDRVVQMATVMVLGPIFE 134 Query: 120 SDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVR 179 +D GFRP R A+R V Q+ G V++ DLS YF+TV H LMK++ Sbjct: 135 TDLCEEQMGFRPGRDAKTAVRLVYYQVRQKGRQE---VVDADLSDYFNTVPHGALMKSLS 191 Query: 180 RRISDARFMTLLWKTIKA----GHIDVGLFRA-----ASEGVPQGGVISPLLSNIMLNEF 230 RRI+D + ++++ + ++A D L R+ A G PQGGVISPLL+N+ F Sbjct: 192 RRIADGQVLSVIARWLEAPVEECTPDGRLVRSTPAKDAGRGTPQGGVISPLLANVYFRRF 251 Query: 231 DQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVK---G 287 L + L + D N YADDFV+ + G Sbjct: 252 --VLAWKQLGYEQAFDSVIVN-------------------------YADDFVICCRPGNG 284 Query: 288 TKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHV-NDGFIFLGHRLIRKRSRYGEMRVVS 346 A+ R + G + L +N KTK+ V + F FLG+ + + G+ + Sbjct: 285 NDARKAMTR------VMGKIGLTVNEQKTKVVRVPGESFDFLGYTIGGFYGQGGKP-YIG 337 Query: 347 TIPQEKA 353 T P +KA Sbjct: 338 TRPSKKA 344 >UniRef50_A5VLF2 RNA-directed DNA polymerase (Reverse transcriptase) n=40 Tax=Lactobacillus RepID=A5VLF2_LACRD Length = 460 Score = 154 bits (390), Expect = 4e-36, Method: Compositional matrix adjust. Identities = 101/279 (36%), Positives = 144/279 (51%), Gaps = 40/279 (14%) Query: 40 SKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIP 99 +KGA G+D + L L L L G Y+P P +RV IPK NG +R LGIP Sbjct: 62 NKGA--AGIDDMTVNDLLPYLRENKTELIASLREGKYKPAPVKRVEIPKPNGGVRKLGIP 119 Query: 100 ALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIE 159 + DR+VQ+A+ + PI+E F S+GFRP R H AI V + L + G R V++ Sbjct: 120 TVVDRMVQQAVAQILTPIFERVFSDNSFGFRPHRGAHDAIAKV-VDLYNQGYRR---VVD 175 Query: 160 GDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVIS 219 DL +YFD V+H L++K +++ I D + L+ K + +G +D GLF + +G PQGG +S Sbjct: 176 LDLKAYFDNVNHDLMIKYLQQYIDDPWTLRLIRKFLTSGVLDHGLFAKSEKGTPQGGPLS 235 Query: 220 PLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYAD 279 PLL+NI LNE D+ L R + RYAD Sbjct: 236 PLLANIYLNELDKELTRR---------------------------------GHHFVRYAD 262 Query: 280 DFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKI 318 D + VK +A +R + LE LK+++N DKTK+ Sbjct: 263 DCNIYVKSQRAGERVMRSITQ-FLEKRLKVKVNSDKTKV 300 >UniRef50_Q1VQM5 Prophage LambdaSa1, reverse transcriptase/maturase family protein n=7 Tax=Psychroflexus torquis ATCC 700755 RepID=Q1VQM5_9FLAO Length = 456 Score = 154 bits (389), Expect = 5e-36, Method: Compositional matrix adjust. Identities = 115/346 (33%), Positives = 177/346 (51%), Gaps = 42/346 (12%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLS--SKGAHTPGVDGVNKTMLQA 58 QRKL A D + L + + L EA R S SKG GVD + ++ Sbjct: 26 FQRKLYIRAKQDKGFKAYSLYGKLCEDHTLIEAYRRVRSNYSKGV---GVDNQSSDAIEK 82 Query: 59 R-LAVELQILRDELLSGHYQPLPARRVYIPKSN-GKLRPLGIPALRDRIVQRAMLMAMEP 116 + ++V L ++ +L Y+ ++ IPK G R LGIP +RDR+VQ A+ M +EP Sbjct: 83 QGISVFLGEIQQDLQGHTYRSQAVKQKLIPKEKEGDFRVLGIPTIRDRVVQMAVKMLIEP 142 Query: 117 IWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMK 176 +WE+DF S+GFRP+R AI+ VK + D R ++V + DLS YFDT+ H L Sbjct: 143 LWEADFEHTSFGFRPKRGAKDAIKQVKQNIYD----RHQFVYDADLSKYFDTIPHTKLFI 198 Query: 177 AVRRRISDARFMTLLWKTIKAG-HIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLH 235 +++R+ D ++L+ + + A + G A+++G PQGGVISPLLSNI L+ FDQ + Sbjct: 199 LLKKRLVDHSILSLIHQWLTAPVRLPNGKLVASTKGSPQGGVISPLLSNIYLHAFDQIV- 257 Query: 236 ERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAI 295 NN +G+ K + RYADDF+L+ G + I Sbjct: 258 ---------------NNP--KGKFA--------KANIRIVRYADDFLLM--GKWYFSKEI 290 Query: 296 REECRGVLEGSLKLRLNMDKTKIPHVN-DGFIFLGHRLIRKRSRYG 340 + +++ ++ L LN +KTK+ H + FLG +S++G Sbjct: 291 LDYITSIMD-NMGLTLNKEKTKLLHSSKSSLFFLGFEFRSIKSKFG 335 >UniRef50_D0LS09 RNA-directed DNA polymerase n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LS09_HALO1 Length = 449 Score = 154 bits (389), Expect = 5e-36, Method: Compositional matrix adjust. Identities = 117/347 (33%), Positives = 167/347 (48%), Gaps = 58/347 (16%) Query: 29 WLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSG-HYQPLPARRVYIP 87 WL EA R T + PG+DG L L+ L + G Y+ +P RRV IP Sbjct: 40 WLREAYRRTRKNAA---PGIDGQTGRAYAEALESNLESLLERAKDGDRYRAMPVRRVAIP 96 Query: 88 KSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLT 147 K +G++RPLGIP D+++QRA++M +E ++E DF SYG+R RS H A++ V+ Sbjct: 97 KGDGRMRPLGIPTFEDKVLQRAVVMVLEAVYEQDFLDCSYGYRRGRSAHDAVKAVRAHTM 156 Query: 148 DCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHIDVGLFRA 207 + RG WV+E D+ ++FDTV H L + +R+R+ D + + K + AG ++ G Sbjct: 157 ---KLRGGWVLEADIEAFFDTVDHAKLREILRQRVRDGVLLRWIGKWLNAGVMEEGNVYY 213 Query: 208 ASEGVPQGGVISPLLSNIMLNE-FDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENW 266 G PQGGVISP+L+NI LNE DQ W + R R Sbjct: 214 PEGGTPQGGVISPVLANIFLNEVIDQ-----------------WFEHVVRPRL------- 249 Query: 267 QWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLK---LRLNMDKTKI----- 318 K RYADD V+I + + +A R E VL L LR++ +KT++ Sbjct: 250 --KGQGYLVRYADDLVMIF---EREDDARRVET--VLPKRLSKYGLRIHPEKTRLIQFLR 302 Query: 319 ------PHVNDG-----FIFLGHRLIRKRSRYGEMRVVSTIPQEKAR 354 P DG F FLG RSR G V+ ++ R Sbjct: 303 PGYGTRPTRRDGNRPGTFDFLGFTHYWARSRKGSWVVMQKTAAKRLR 349 >UniRef50_Q11ZP4 RNA-directed DNA polymerase n=33 Tax=Bacteria RepID=Q11ZP4_POLSJ Length = 461 Score = 154 bits (389), Expect = 5e-36, Method: Compositional matrix adjust. Identities = 103/325 (31%), Positives = 166/325 (51%), Gaps = 46/325 (14%) Query: 39 SSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGI 98 ++KGA GVDG++ + +R ELL+G Y+P P RRV IPK +G R LGI Sbjct: 72 ANKGA--AGVDGLDIEHTAQTIRNHWSQIRQELLAGTYRPSPVRRVMIPKPDGSQRELGI 129 Query: 99 PALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVI 158 P + DR++Q+A+L ++P+ + F S+GFRP R H A++ + + ++ R V+ Sbjct: 130 PTVLDRLIQQALLQVLQPLIDPTFSEHSHGFRPGRRAHDAVKAARAHV----QSGKRVVV 185 Query: 159 EGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVI 218 + DLS +FD V+H +L+ +R+R+ DA + L+ + AG +D G+ +G PQGG + Sbjct: 186 DVDLSKFFDRVNHDILIDRLRKRVDDAGVIRLIRAYLNAGIMDGGVVMDRQQGTPQGGPL 245 Query: 219 SPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYA 278 SPLL+N++L+E D+ L R S + RYA Sbjct: 246 SPLLANVLLDEVDKVLEARGYS---------------------------------FARYA 272 Query: 279 DDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSR 338 DD + V KA + + E R + G LKL++N K+ + G FLG+ L + + Sbjct: 273 DDCNVYVGSVKAG-QRVMELLRKLYAG-LKLQINEAKSAVASAF-GRKFLGYALWVAKGK 329 Query: 339 YGEMRVVSTIPQEKARNFAASLTAL 363 V + ++ R+F A + L Sbjct: 330 ----EVKCKVAEKPLRDFKARIRQL 350 >UniRef50_D2FQY0 Regulatory protein GntR n=2 Tax=Staphylococcus aureus subsp. aureus RepID=D2FQY0_STAAU Length = 431 Score = 154 bits (388), Expect = 7e-36, Method: Compositional matrix adjust. Identities = 93/274 (33%), Positives = 139/274 (50%), Gaps = 39/274 (14%) Query: 46 PGVDGVNKTMLQARLAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRI 105 PG+DG+ + +Q A ++ +LL G Y+P ++V IPK+NGK R LGIP +RDR+ Sbjct: 35 PGIDGMKVSEIQGHFAQYFPEIKQKLLEGTYKPQAVKKVEIPKANGKKRVLGIPVVRDRV 94 Query: 106 VQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSY 165 +Q+A+ +EP + F S+GFRP RS A++ + G T ++ DL Sbjct: 95 IQQAIKQVIEPSIDRTFSKHSHGFRPNRSTGTALKEC-ASYYEAGYT---IAVDCDLKQC 150 Query: 166 FDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHIDV-GLFRAASEGVPQGGVISPLLSN 224 FD ++H LM R I D T + ++++ G ID+ G G PQGGVISPLL N Sbjct: 151 FDNINHDKLMYLFERHIKDKAVSTFIRRSLQVGAIDLSGEVAERKIGAPQGGVISPLLCN 210 Query: 225 IMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLI 284 I L+E D+ L +R+ + RYADDFV+ Sbjct: 211 IYLHELDKELEKRHHR---------------------------------FVRYADDFVIF 237 Query: 285 VKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKI 318 VK TK E + + + + +LKL +N DK+K+ Sbjct: 238 VK-TKRAGERVMDSIKTFIHKTLKLEVNNDKSKV 270 >UniRef50_C6IQ61 Putative uncharacterized protein n=10 Tax=Bacteroidales RepID=C6IQ61_9BACE Length = 560 Score = 154 bits (388), Expect = 7e-36, Method: Compositional matrix adjust. Identities = 106/314 (33%), Positives = 163/314 (51%), Gaps = 42/314 (13%) Query: 31 AEAARITLSSKGAHTPGVDGV--NKTMLQARLAVELQILRDELLSGHYQPLPARRVYIPK 88 A A + S+KG +T GVD V + + +A EL+ RD Y P+P +RV I K Sbjct: 64 ALAVKRVTSNKGKNTSGVDKVLWSTPIAKANAITELK-RRD------YNPMPLKRVNIRK 116 Query: 89 SNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTD 148 SNGKLRPLGIP ++DR +Q LMA++P+ E+ SYGFR ER AI + L+ Sbjct: 117 SNGKLRPLGIPTMKDRAMQALYLMALDPVAETTADNHSYGFRKERCTGDAIHQCYINLSK 176 Query: 149 CGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHIDVGLFRAA 208 E+ +W++EGD+ FD ++H L+ + +L K +K+G I Sbjct: 177 --ESSPQWILEGDIKGCFDHINHEWLLNNI------PMDKVMLRKWLKSGFIFNKQLFPT 228 Query: 209 SEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQW 268 EG PQGG+ISP L+N+ L+ L ++ R D + S +R + Sbjct: 229 EEGTPQGGIISPTLANMALDGLQTMLEAKF----HRVDLY----SPKRS----------Y 270 Query: 269 KPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLR---LNMDKTKIPHVNDGF 325 P V RYADDF++ T E + +E +++ L+ R L+ +KTKI H+++GF Sbjct: 271 YPKVHLIRYADDFII----TSISKEMLEQEIMPMVKEFLQARGLTLSEEKTKITHIDEGF 326 Query: 326 IFLGHRLIRKRSRY 339 FLG + + + ++ Sbjct: 327 DFLGFNIRKYKGKF 340 >UniRef50_Q01P79 RNA-directed DNA polymerase (Reverse transcriptase) n=16 Tax=Bacteria RepID=Q01P79_SOLUE Length = 462 Score = 153 bits (387), Expect = 8e-36, Method: Compositional matrix adjust. Identities = 102/322 (31%), Positives = 159/322 (49%), Gaps = 44/322 (13%) Query: 10 ATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRD 69 T+ RL+ + + E L A + ++KG +PGVDG+ ++ L +R Sbjct: 41 GTENPASTNRLMEEVCERENLKAALQRVKANKG--SPGVDGMTVIGIKDYLKQHWPAIRG 98 Query: 70 ELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGF 129 +LLSG Y+P P RRV I K +G +R LGIP + DR +Q+A++ ++ W+ F SYGF Sbjct: 99 QLLSGTYEPKPVRRVEIAKPDGGVRKLGIPTVLDRFIQQAVMQVLQRRWDRTFSDYSYGF 158 Query: 130 RPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMT 189 RP RS A+ + + E G W ++ DL +FD V+H LM + +RI+D R + Sbjct: 159 RPGRSAQQAVAQAQQYIA---EGHG-WCVDLDLEKFFDRVNHDKLMGQIAKRIADKRLLK 214 Query: 190 LLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWY 249 L+ + AG ++ GL + EG PQGG +SPLLSN++L+EFD+ L R Sbjct: 215 LIRAFLNAGVMENGLVSPSVEGTPQGGPLSPLLSNLVLDEFDRELERR------------ 262 Query: 250 WNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKL 309 + RYADD + V+ +A + + E + LKL Sbjct: 263 ---------------------GHRFVRYADDCNIYVRSERAG-QRVMESITQFITQKLKL 300 Query: 310 RLNMDKTKIPHVND----GFIF 327 ++N K+ + + GF F Sbjct: 301 KVNETKSAVARPQERKFLGFSF 322 >UniRef50_Q64E53 Prophage LambdaSa1 transcriptase/maturase family protein n=1 Tax=uncultured archaeon GZfos14B8 RepID=Q64E53_9ARCH Length = 430 Score = 153 bits (387), Expect = 9e-36, Method: Compositional matrix adjust. Identities = 86/208 (41%), Positives = 126/208 (60%), Gaps = 7/208 (3%) Query: 30 LAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLPARRVYIPKS 89 L EA ++GA G+D V + L L ++ L Y P P +RVYIPK Sbjct: 62 LNEAWEKVKQNRGAG--GIDDVTIDEFERNLEQNLNEIQRLLRQDRYVPKPVKRVYIPKP 119 Query: 90 NGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDC 149 +GK RPLGIP +RDR+VQ+A+ +EPI+E++F S+G+RP +S AI ++ + D Sbjct: 120 DGKQRPLGIPTIRDRVVQQALKNVIEPIFEAEFLDSSFGYRPGKSAKQAIEQIE-TVRDE 178 Query: 150 GETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHIDVGLFRAAS 209 G WV++ D+ ++FDTV+H L+ AV RISD R + L+ ++A ++ G RA + Sbjct: 179 GH---EWVVDADIKAFFDTVNHEKLIDAVAERISDGRVLGLIRAFLEADIMEQGQGRAKN 235 Query: 210 -EGVPQGGVISPLLSNIMLNEFDQYLHE 236 G PQGGVISPLL+NI L+ FD+ + E Sbjct: 236 VVGTPQGGVISPLLANIYLHYFDERMAE 263 >UniRef50_A6YEC9 Putative reverse transcriptase and intron maturase n=1 Tax=Chlorokybus atmophyticus RepID=A6YEC9_CHLAT Length = 755 Score = 153 bits (386), Expect = 1e-35, Method: Compositional matrix adjust. Identities = 115/359 (32%), Positives = 176/359 (49%), Gaps = 48/359 (13%) Query: 4 KLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVE 63 + T+A D +L+ I P+ L A + S G T GV+ L V Sbjct: 146 RFVTFAPLDAG----KLIHAIAHPDLLWLAYELIKSKPGNMT---RGVSTETLDGLSRVW 198 Query: 64 LQILRDELLSGHYQPLPARRVYIPKSNGKL--RPLGIPALRDRIVQRAMLMAMEPIWESD 121 + EL +G Y+ ARR+ IPK GK RPL + + R+++VQ+A+ M ++ ++E Sbjct: 199 IDKTSSELRAGKYRFGLARRIMIPKV-GKPGERPLTMASFREKVVQKAIQMVLQELFEPR 257 Query: 122 FHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRR 181 F S+GFRP R H A LQ+ D G+WVIE D++ FDT+ H L+ +RR Sbjct: 258 FLNTSHGFRPGRGCHTA-----LQMVDQHFRGGKWVIEADITKCFDTIPHDKLLAVLRRH 312 Query: 182 ISDARFMTLLWKTIKAGHIDVGLFRAASE---GVPQGGVISPLLSNIMLNEFDQY---LH 235 I+ ++ + L+ +KAG++ +G +A+ E G PQG ++SPLL+NI L+ DQ+ L Sbjct: 313 ITCSKTLALIHSGLKAGYVVLG--KASQEQMVGTPQGSILSPLLNNIFLHLLDQFMERLS 370 Query: 236 ERYLSGKARKDRWYWNN-----SIQRGRSTAV-----------------RENWQWKPA-- 271 ++ GK R+ + S +G + A+ + Q P Sbjct: 371 AKHTLGKTRRKNPEYRKLQSELSKNKGDADAMSKLRRRKLPRSGRIWLMQSKDQMDPGFR 430 Query: 272 -VAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLG 329 +AY RYAD F++ V G + E+ R L+ L L LN KT I +DG FLG Sbjct: 431 RLAYVRYADHFLICVTGPHQLAVDVMEQVRTFLDKELGLELNQSKTLITKFSDGINFLG 489 >UniRef50_A7BYN3 RNA-directed DNA polymerase n=2 Tax=Beggiatoa sp. PS RepID=A7BYN3_9GAMM Length = 497 Score = 153 bits (386), Expect = 1e-35, Method: Compositional matrix adjust. Identities = 100/296 (33%), Positives = 151/296 (51%), Gaps = 40/296 (13%) Query: 42 GAHTPGVDGVNKTMLQARLAVELQILRDELLSG--HYQPLPARRVYIPKSNGKLRPLGIP 99 G + G+DG+ +AR + +++ D L G +Y+ PA+R YIPK+NGKLRPLGIP Sbjct: 2 GKRSSGIDGLKYLTPKARERLAKRLM-DWALKGWDNYKAKPAKRKYIPKANGKLRPLGIP 60 Query: 100 ALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIE 159 DR++Q + A+EP +E+ F + SYGFRP + H AI + ++T +WV++ Sbjct: 61 TQEDRVIQHVIKSALEPFYEAQFESNSYGFRPAQGCHDAIEAI-FKITS---HEPKWVLD 116 Query: 160 GDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVIS 219 D+ FD + H L + + + + W +KAG+ D G G PQGG+IS Sbjct: 117 ADIKGCFDNIDHNYLTECI---THGQKKLVKEW--LKAGYTDDGHIHPTKNGTPQGGIIS 171 Query: 220 PLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYAD 279 PLL+NI L+ + L ++ G + + R T V RYAD Sbjct: 172 PLLANIALDGLETNLRQKLQIG-------IYQTQFNQSRLTVV-------------RYAD 211 Query: 280 DFVLIVKGTKAQVEAIREECRGVLEGSLK---LRLNMDKTKIPHVNDGFIFLGHRL 332 DFV+I K K + +E + ++ LK L L+ +KTKI GF FLG + Sbjct: 212 DFVIIHKDKK-----VIKESQLIISQWLKKRGLELSPEKTKIVKTPQGFDFLGFNI 262 >UniRef50_D2CK02 Putative uncharacterized protein orf3 (Fragment) n=1 Tax=Candida viswanathii RepID=D2CK02_9ASCO Length = 801 Score = 152 bits (385), Expect = 2e-35, Method: Compositional matrix adjust. Identities = 104/332 (31%), Positives = 161/332 (48%), Gaps = 33/332 (9%) Query: 23 LITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLPAR 82 I + L A S G TPG+ N T L L + ++L + ++ P + Sbjct: 216 FILNKDLLRTAYEKLKSRPGMMTPGI---NPTTLDGMSEERLDNIINKLRNKSFKFTPGK 272 Query: 83 RVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTV 142 R+ IPKSNGK RPL + + D++VQ M M +E I+E F +S+G+RP+RS H A+R + Sbjct: 273 RIIIPKSNGKRRPLTLGSPEDKLVQEVMRMVLEAIYEPLFLDVSHGYRPKRSCHSALRAI 332 Query: 143 KLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHIDV 202 + C W IEGD+ S FD + H LMK + +I D F+ L+ K +KAG++ Sbjct: 333 FTKFKGC-----TWWIEGDIKSCFDDIPHDKLMKVLSNKIKDQSFLELIRKCLKAGYMYQ 387 Query: 203 GLFRAASEGVPQGGVISPLLSNIMLNEFD---------------QYLHE------RYLSG 241 + GVPQG VISP+L+NI L++ D +Y H+ +Y Sbjct: 388 YTNKTDIIGVPQGSVISPILANIYLHQLDLFIMNIKDSFDWKGPRYKHDIGHAKLQYQLR 447 Query: 242 KARK---DRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREE 298 KA+K D+ + R+ + + + Y RY DD+++ + G+ Q I Sbjct: 448 KAKKAGSDKRVLHKMAVELRNHKMNFKGERTNKLTYVRYVDDWIVAINGSHKQAVEILSS 507 Query: 299 CRGVLEGSLKLRLNMDKTKIPH-VNDGFIFLG 329 L L ++ +KTKI + D +FLG Sbjct: 508 ISEYCMNELGLTISPEKTKITNSYKDHILFLG 539 >UniRef50_A5VH22 RNA-directed DNA polymerase n=1 Tax=Sphingomonas wittichii RW1 RepID=A5VH22_SPHWW Length = 572 Score = 152 bits (384), Expect = 2e-35, Method: Compositional matrix adjust. Identities = 116/325 (35%), Positives = 159/325 (48%), Gaps = 50/325 (15%) Query: 45 TPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDR 104 TPGVDG +T LA L L + G Y+P P RRVYIPK NGK+RPLGIP DR Sbjct: 2 TPGVDG--QTFDGMTLA-RLDRLTQGVAEGRYRPRPVRRVYIPKGNGKMRPLGIPTADDR 58 Query: 105 IVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSS 164 IVQ A M + I+E F S+GFR RS H A+ ++ T +W+IE D+ Sbjct: 59 IVQEAARMILAAIYEPVFSKHSHGFRAGRSCHTALEEIRRTWTG-----AKWLIEVDVRG 113 Query: 165 YFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSN 224 +FD + H +L+ + RRI D F+ L+ +KAG +D F G PQGGVISPLL+N Sbjct: 114 FFDNIDHDILLSLLARRIDDPVFIDLIGTMLKAGCMDEWKFERTYSGTPQGGVISPLLAN 173 Query: 225 IMLNEFDQYLHE---RYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVA-------- 273 I L+E D ++ E R+ G R+ + +Q + A+R+ AV Sbjct: 174 IYLHELDLFMEEMRARFDKGVKRRANPVY--VVQSQKIAALRKEIDAIRAVGADEAEVRT 231 Query: 274 -----------------------------YCRYADDFVLIVKGTKAQVEAIREECRGVLE 304 YCRYADDF++ V G+KA I + + L Sbjct: 232 RLARIEAINRDRRKISSVDQMDPNFRRLRYCRYADDFLVGVIGSKADAVRIMADIQHFLA 291 Query: 305 GSLKLRLNMDKTKIPHVNDGFIFLG 329 L L ++ +KT + + G FLG Sbjct: 292 DRLNLTVSPEKTGVRDASRGSPFLG 316 >UniRef50_C3EEI5 Group II intron reverse transcriptase/maturase n=1 Tax=Bacillus thuringiensis serovar pakistani str. T13001 RepID=C3EEI5_BACTU Length = 539 Score = 152 bits (384), Expect = 2e-35, Method: Compositional matrix adjust. Identities = 92/262 (35%), Positives = 153/262 (58%), Gaps = 17/262 (6%) Query: 59 RLAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIW 118 RL VE + + + + Y+P RRV+IPK NG RPLGIP + DRI Q+ +L ++PI Sbjct: 8 RLHVEDVVEKVKAMFSWYEPQTVRRVFIPKPNGDQRPLGIPTIWDRIFQQCILQVLDPIC 67 Query: 119 ESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAV 178 E+ FH SYGFR RS HHA+ K + G ++ I D+ +FD V+H L+K + Sbjct: 68 EAKFHKHSYGFRSNRSTHHALGRFKNLINIAGFSQ---CIAIDIKGFFDNVNHGKLLKQI 124 Query: 179 -RRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHER 237 I D + +++L + +K+ ++G +G PQGG++SPLLSNI LNE D ++ + Sbjct: 125 WSLGIRDKKLLSILSRLLKS---NIGGEGILEKGTPQGGILSPLLSNIALNELDWWVSNQ 181 Query: 238 YLSGKARKDRWYWNNSIQRG-RSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIR 296 + + ++ K +++ +S+ + + T ++E + RYADDF ++ + T++Q + Sbjct: 182 WETFRS-KYKYFMTSSMYKALKKTKLKECY-------IIRYADDFKILCR-TRSQANRMF 232 Query: 297 EECRGVLEGSLKLRLNMDKTKI 318 + L+ LKL +NM+K+KI Sbjct: 233 LAVKQFLKERLKLDINMEKSKI 254 >UniRef50_A1T776 RNA-directed DNA polymerase n=1 Tax=Mycobacterium vanbaalenii PYR-1 RepID=A1T776_MYCVP Length = 454 Score = 152 bits (384), Expect = 2e-35, Method: Compositional matrix adjust. Identities = 113/334 (33%), Positives = 164/334 (49%), Gaps = 46/334 (13%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARL 60 +QR L A R L + + + L EA ++GA GVD V ++ Sbjct: 30 LQRTLWAAAKQSQGRRFHALYDRVYRGDVLWEAWERVRKNRGA--AGVDRVTLVAVEE-Y 86 Query: 61 AVE--LQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIW 118 V+ L+ LR +L G Y P PARRV IPK G RPLGIP +RDR+ Q A + +EPI+ Sbjct: 87 GVDRMLRELRHDLREGVYCPAPARRVEIPKPRGGTRPLGIPTVRDRVAQAAAKIVLEPIF 146 Query: 119 ESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAV 178 E+DF + SYGFRP+RS A+ +++ + + +V+E D++++F + H L+ V Sbjct: 147 EADFMSCSYGFRPKRSATQAMERLRVGFIEGSQ----FVVEFDIANFFGEIDHDRLLAEV 202 Query: 179 RRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERY 238 RR+SD R + LL ++AG + G+ G PQGGVISPLL+NI L+ D L R Sbjct: 203 SRRVSDRRVLKLLRLWLQAGVMVDGVVSRTVAGTPQGGVISPLLANIYLHVLDTELARRN 262 Query: 239 LSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREE 298 + RYADD V++ + A+ Sbjct: 263 VG--------------------------------ELVRYADDGVVLCRSAAQAEHALAAV 290 Query: 299 CRGVLEGSLKLRLNMDKTKIPHV---NDGFIFLG 329 G + SL LRL+ DKTK+ + +G FLG Sbjct: 291 --GEILASLGLRLHPDKTKVVDLREGGEGLDFLG 322 >UniRef50_B8R181 Putative intron-encoded reverse transcriptase n=2 Tax=Volvox carteri RepID=B8R181_VOLCA Length = 598 Score = 152 bits (384), Expect = 2e-35, Method: Compositional matrix adjust. Identities = 107/344 (31%), Positives = 184/344 (53%), Gaps = 16/344 (4%) Query: 27 PEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLPARRVYI 86 P +A A R +KG+ TPGVD V+ L+++ + + I + ++ +Y+P RRV+I Sbjct: 45 PRNIALAFRNLKFNKGSKTPGVDDVHIGNLKSK-PLNIFIRDIQKMAKNYKPSLVRRVWI 103 Query: 87 PKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQL 146 PK NGK RP+GIP L DR+ Q+ + +EPI E+ FH SY FRP RS A+ + Sbjct: 104 PKPNGKKRPIGIPTLADRLFQQCIKQVIEPICEAKFHPHSYVFRPNRSTSDALARALFLM 163 Query: 147 TDCGETRGRWVIEGDLSSYFDTVHH-RLLMKAVRRRISDARFMTLLWKTIKAGHIDVGLF 205 + +V++ D+ S+FDT+ H +LL + I D + ++++ K +KA + G+ Sbjct: 164 ---NQNELHYVVDIDIQSFFDTIDHGKLLKQCWAIGIRDKKILSIMSKMLKAEVVGEGI- 219 Query: 206 RAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVREN 265 +++G PQGG++SPLLSNI LNE D + ++ + ++ + + R A+R N Sbjct: 220 --STKGTPQGGILSPLLSNICLNELDWWYTSQWATFPT---KYPYKRTSHAAR--ALRGN 272 Query: 266 WQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHV-NDG 324 + K RYADDF + + K V+ I R L+ L L+++ +K+ I ++ Sbjct: 273 SKLK-EFHSVRYADDFKIFCRDYKTAVK-IFAATRLWLKDRLNLQISSEKSSITNLRKKS 330 Query: 325 FIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAASLTALLWKVR 368 FLG L ++ G+ S + + +++ A + K++ Sbjct: 331 SPFLGISLRACHNKNGKFSCRSRVLDKAMETMKSTIKAAIIKLQ 374 >UniRef50_Q7YAJ3 Putative reverse transcriptase and intron maturase n=1 Tax=Chara vulgaris RepID=Q7YAJ3_CHAVU Length = 760 Score = 152 bits (383), Expect = 2e-35, Method: Compositional matrix adjust. Identities = 106/294 (36%), Positives = 151/294 (51%), Gaps = 28/294 (9%) Query: 45 TPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDR 104 TPG D +T + +L LRD ++ G ++ A R+ P N K RPLG+ +DR Sbjct: 323 TPGPD---QTTIDGTSLEKLLKLRDAIVKGEFE-WGATRIPKPGKNEK-RPLGVSCFQDR 377 Query: 105 IVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRG-RWVIEGDLS 163 IVQ + M +EPI+E F T S+GFRP RS H A+ + G G +W IEG L Sbjct: 378 IVQEVLRMILEPIYEPRFSTYSHGFRPGRSAHTALNVI------MGTFHGAQWYIEGSLE 431 Query: 164 SYF-DTVHHRLLMKAVRRRISDARFMTLLWKTIKAG-HIDVGLFRAASEGVPQGGVISPL 221 + V+H L K +RR I D R + L+ ++A H+ G A+ G + +SPL Sbjct: 432 AEGPGAVNHGTLYKIIRRTIRDKRILKLIRSGLQAFFHMPHGEIEEATIGAARPFGLSPL 491 Query: 222 LSNIMLNEFDQYLHERYLS-GKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADD 280 LSNI LNE D ++ KAR R+ A R N + + Y R+ADD Sbjct: 492 LSNIYLNELDHFIETTIREYNKAR-------------RADASRLNPTNRRRMHYIRFADD 538 Query: 281 FVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIR 334 F++ + G +++ +R E L L+L L+MDKT I HV+ G FLGH + R Sbjct: 539 FLVAISGPRSEAVKLRSELESFLRDKLQLTLSMDKTHITHVSKGVPFLGHNIFR 592 >UniRef50_Q0S063 RNA-directed DNA polymerase (Reverse transcriptase) n=3 Tax=Actinomycetales RepID=Q0S063_RHOSR Length = 459 Score = 151 bits (382), Expect = 3e-35, Method: Compositional matrix adjust. Identities = 116/350 (33%), Positives = 168/350 (48%), Gaps = 53/350 (15%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQ--- 57 +Q L A DP R L+ +++ + L + GA PG+D + ++ Sbjct: 22 LQHALYRAAKVDPGRRFHALMDKVSRRDVLWRGWVAVRRNNGA--PGIDRITLEEVEEYG 79 Query: 58 -ARLAVELQILRDELLSGHYQPLPARRVYIPKSNG-KLRPLGIPALRDRIVQRAMLMAME 115 ARL EL + EL G Y+PLPARRV+IPK + RPL IP++RDRIVQ A + E Sbjct: 80 VARLLDELAV---ELKEGSYRPLPARRVFIPKPGTVEQRPLSIPSVRDRIVQAAWKLVAE 136 Query: 116 PIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLM 175 P++E+DF S+GFRP R H A++ L D RWV+E D+++ F+ + LM Sbjct: 137 PVFEADFLPCSFGFRPRRGAHDALQV----LIDESWRGCRWVVETDIANCFEAIPIEKLM 192 Query: 176 KAVRRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLH 235 +AV R+ D F+ LL ++AG ++ G R G PQGGV S LL N+ L+ D+ Sbjct: 193 QAVEERVCDQPFLKLLRVMLRAGVMEEGQVRRPVTGTPQGGVASALLCNVYLHRLDR--- 249 Query: 236 ERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAI 295 W RYADD +++ + ++ Q EA Sbjct: 250 -----------------------------AWDVDEHGVLVRYADDALVMCR-SRRQAEAA 279 Query: 296 REECRGVLEGSLKLRLNMDKTKIPHV---NDGFIFLG--HRLIRKRSRYG 340 R +L L L KT+I H+ +G FLG HRL+ +R G Sbjct: 280 LTRLRELL-ADLGLEPKEAKTRIVHLRVGGEGVDFLGFHHRLVNAPARPG 328 >UniRef50_A7MS60 Putative uncharacterized protein n=21 Tax=Vibrio harveyi ATCC BAA-1116 RepID=A7MS60_VIBHB Length = 430 Score = 151 bits (382), Expect = 4e-35, Method: Compositional matrix adjust. Identities = 109/305 (35%), Positives = 153/305 (50%), Gaps = 43/305 (14%) Query: 30 LAEAARITLSSKGAHTPGVD--GVNKTMLQARLAVELQILRDELLSGHYQPLPARRVYIP 87 L +A R +KG GVD ++ T+ + R A Q LR LL G Y+P P V IP Sbjct: 10 LNQALRRVKKNKGC--AGVDKLDIDATIFKLRQASNGQALRQSLLDGSYRPQPVLGVGIP 67 Query: 88 KSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLT 147 K +G +R LGIP + DRIVQ+A+ + I+E+ F SYGFRP RS HHA+ + Sbjct: 68 KPSGGVRQLGIPTVLDRIVQQAITSVLSDIYEAKFSNSSYGFRPNRSAHHALAAASRYIR 127 Query: 148 DCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHIDVGLFRA 207 E RG +V++ DL+ YFDTV+H LM + I+D R + L+ ++AG + GL Sbjct: 128 ---EGRG-YVVDIDLAKYFDTVNHDRLMHRLSEDIADKRVLKLIRSYLQAGIMRNGLVEQ 183 Query: 208 ASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQ 267 G PQGG +SPLLSNI+L+E D+ L R Sbjct: 184 RQRGTPQGGPLSPLLSNIVLDELDKELERR------------------------------ 213 Query: 268 WKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFIF 327 +CRYADD + V G++ ++E LE LKL +N +K+ V + + Sbjct: 214 ---GHKFCRYADDCQIYV-GSEEAAYRVKESITEYLEQKLKLTVNREKSAATRVTE-RTY 268 Query: 328 LGHRL 332 L HR Sbjct: 269 LSHRF 273 >UniRef50_C1BDP2 Putative RNA-directed DNA polymerase n=2 Tax=Rhodococcus opacus B4 RepID=C1BDP2_RHOOB Length = 605 Score = 151 bits (382), Expect = 4e-35, Method: Compositional matrix adjust. Identities = 107/309 (34%), Positives = 150/309 (48%), Gaps = 31/309 (10%) Query: 30 LAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLPARRVYIPKS 89 L ++T + G T G+DG +AR + +++ ++P+P RRVYIPK+ Sbjct: 68 LVSVRQVTQRNAGRRTAGIDGETALSPEARANMAVRVHESR---SSWEPVPVRRVYIPKA 124 Query: 90 NGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDC 149 GK RPLGIP + DR Q + A+EP WE+ F SYGFRP RS AI + L Sbjct: 125 GGKRRPLGIPVVMDRCHQARVRTALEPEWEARFEARSYGFRPGRSCADAIGALYSTLNGS 184 Query: 150 GETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHIDVGLFRAAS 209 R W+++ DLS+ FD + H L+ + L+ K + AG I+ G F ++ Sbjct: 185 RAKRV-WILDADLSAAFDRIDHPRLLDT----LGSFPARELIGKWLTAGVIENGRFASSE 239 Query: 210 EGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWK 269 EG PQGGVISPLL N+ L+ ++ RYL D RS Sbjct: 240 EGTPQGGVISPLLLNVALHGLEEAAGVRYLKAADTLD----------ARSV--------- 280 Query: 270 PAV-AYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHV-NDGFIF 327 P RYADD V ++ Q E +++ G L L N DKT I H+ +GF F Sbjct: 281 PGTPVLVRYADDLVACCH-SRQQAELVKDRLAGWL-APRGLAFNEDKTHIVHLEEEGFDF 338 Query: 328 LGHRLIRKR 336 LG+ + R R Sbjct: 339 LGYNIRRYR 347 >UniRef50_UPI0001C42A66 RNA-directed DNA polymerase (Reverse transcriptase) n=1 Tax=Bacillus pseudofirmus OF4 RepID=UPI0001C42A66 Length = 624 Score = 151 bits (381), Expect = 4e-35, Method: Compositional matrix adjust. Identities = 112/314 (35%), Positives = 164/314 (52%), Gaps = 18/314 (5%) Query: 20 LLRLITQPEWLAEAARITLSSKGAHTPGVDGVN-KTMLQARLAVELQILRDELLSGHYQP 78 LL +I E + A ++KG++T G DG +LQ + +R L+ Y P Sbjct: 40 LLEIIQSDEVILTAIHKIKANKGSNTKGTDGETIDDILQDGYESVISRVRKCFLA--YNP 97 Query: 79 LPARRVYIPKSNGK-LRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHH 137 RRV+I K K RPLGIPA+ DRI+Q + M +EPI E+ F + SYGFRP RS H Sbjct: 98 KLLRRVHIDKQVSKDKRPLGIPAIIDRIIQECIRMIIEPILEAQFFSHSYGFRPYRSAEH 157 Query: 138 AIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRR-ISDARFMTLLWKTIK 196 A+ V D T WV+EGD+ +FD V+H +L+K + I D R + ++ ++ Sbjct: 158 ALSKVTNTAYD---TNYCWVVEGDIKKFFDNVNHTILIKKLYSMGIRDRRVLMIIKAMLQ 214 Query: 197 AGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQR 256 G + G + G PQGG+ISPLL+N L+ D ++ R K K + S Sbjct: 215 CGVL--GEAEQTTVGTPQGGIISPLLANAYLDSLDHWI-TREWENKETKHEY----SRLD 267 Query: 257 GRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKT 316 G+ A++ KPA + RYADD+VLI +KA ++ L+ LKL L+ +KT Sbjct: 268 GKYRALKNASNLKPA-HFVRYADDWVLIT-NSKANAIKWKQRIAKHLKEQLKLELSEEKT 325 Query: 317 KIPHVNDGFI-FLG 329 I ++ I F+G Sbjct: 326 LITNIKKKAIKFVG 339 >UniRef50_Q74P60 Group II intron reverse transcriptase/maturase n=13 Tax=Bacilli RepID=Q74P60_BACC1 Length = 627 Score = 151 bits (381), Expect = 4e-35, Method: Compositional matrix adjust. Identities = 109/336 (32%), Positives = 183/336 (54%), Gaps = 26/336 (7%) Query: 17 IQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHY 76 ++L+++IT + A R + G+ T G+DGV ++ +L+ E I + +Y Sbjct: 39 FKQLMKVITSESNILLAFRNIKRNSGSITEGIDGVTIKDVE-KLSQEDFIKIVQKRFSNY 97 Query: 77 QPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVH 136 P RRV IPK NGK RPLGIP++ DRI Q+ + +EPI E+ F+ S+GFRP RS Sbjct: 98 TPRKVRRVEIPKPNGKTRPLGIPSMWDRIAQQCIKQVLEPICEAKFNKHSHGFRPNRSPE 157 Query: 137 HAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAV-RRRISDARFMTLLWKTI 195 A+ L++ + ++V+ D+ +FD V+H+ LM+ + I D + + ++ K + Sbjct: 158 TAMADATLRV---NRSHMQYVVNVDIQGFFDEVNHKKLMRQLWTMGIRDKQLLVIIRKML 214 Query: 196 KAGHI-DVGLFRAASEGVPQGGVISPLLSNIMLNEF---------DQYLHERYLSGKARK 245 KA + G + ++G PQGG++SPLL+NI LNEF D+ L E L+ K Sbjct: 215 KAPIVLPNGEMQYPNKGTPQGGILSPLLANINLNEFDWWITNQWEDRLLKELSLTIKKGG 274 Query: 246 DRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEG 305 + + + ++TA++E + RYADDF I TK+ + I + C L+ Sbjct: 275 HVDKYPHYSKMRKTTALKEMY-------IVRYADDFK-IFTATKSNAQKIFKACEMWLQE 326 Query: 306 SLKLRLNMDKTKIPHV-NDGFIFLGH--RLIRKRSR 338 LKL ++ +K+KI ++ + FLG ++++K S+ Sbjct: 327 RLKLPISKEKSKITNLRKESSEFLGFEIKMVKKGSK 362 >UniRef50_B8I7I5 RNA-directed DNA polymerase (Reverse transcriptase) n=29 Tax=Bacteria RepID=B8I7I5_CLOCE Length = 618 Score = 151 bits (381), Expect = 4e-35, Method: Compositional matrix adjust. Identities = 113/364 (31%), Positives = 194/364 (53%), Gaps = 38/364 (10%) Query: 8 WAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVE--LQ 65 +A + L+++I+ E + A R S+ G+HT G D +N ++ +L+VE ++ Sbjct: 23 YAQSKQGKSFTNLMKVISSEENIRLAYRNIKSNSGSHTSGTDTLNIKDIE-KLSVEKLVE 81 Query: 66 ILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTL 125 +++ +L YQP P +RV IPK NGK RPLGIP + DR+VQ+ +L +EPI E+ F+ Sbjct: 82 MMQRKL--AWYQPKPVKRVEIPKPNGKTRPLGIPTIVDRLVQQCILQVLEPICEAKFYER 139 Query: 126 SYGFRPERSVHHAI----RTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAV-RR 180 S GFRP RS HA+ R V+ Q +V++ D+ +FD V+H L++ + Sbjct: 140 SNGFRPNRSAEHAMAQCYRMVQKQ-------NLYFVVDVDIKGFFDNVNHSKLIRQMWAM 192 Query: 181 RISDARFMTLLWKTIKAGHI-DVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYL 239 I D + + ++ + +KA + G ++G PQGG++SPLL+NI+LNE D ++ ++ Sbjct: 193 GIRDKQLICIIKQMLKAPVVMPDGETLYPTKGTPQGGILSPLLANIVLNELDWWISSQWE 252 Query: 240 SGKARKDRWYW---NNSIQRG------RSTAVRENWQWKPAVAYCRYADDFVLIVKGTKA 290 ++ + N S+ + R +A++E + RYADDF + + ++ Sbjct: 253 DMLTHREYYVSVNNNGSLNKSGVFRTLRRSALKEMY-------IVRYADDFKIFCR-KRS 304 Query: 291 QVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFI-FLGHRLIRKRSRYGEMRVVSTIP 349 I + L+ LKL ++ +K+K+ ++ + FLG + K R G VV + Sbjct: 305 DANKIFVAVKKWLKDRLKLEISEEKSKVVNLKKHYSEFLGFQF--KTVRKGRKFVVRSHM 362 Query: 350 QEKA 353 EKA Sbjct: 363 SEKA 366 >UniRef50_Q9G8T2 Orf762 n=2 Tax=Eukaryota RepID=Q9G8T2_RHDSA Length = 762 Score = 150 bits (380), Expect = 5e-35, Method: Compositional matrix adjust. Identities = 102/340 (30%), Positives = 174/340 (51%), Gaps = 31/340 (9%) Query: 12 DPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDEL 71 D + +RL+ +I + L A S G G ++ T+ LA ++ + + Sbjct: 171 DKNFVNKRLINIIGDVQTLIVAYEFVKSKPGQMVKG--SIDSTLDDIDLA-WIKSISKVI 227 Query: 72 LSGHYQPLPARRVYIPKSNGK-LRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFR 130 +G ++ +P+RR+Y+ K+ K RP+ RD++VQ+A+ + +EPI+E+ F S+GFR Sbjct: 228 KAGKFKFIPSRRIYVSKTGCKERRPIMTGFPRDKLVQKAIQLVLEPIYENVFLENSHGFR 287 Query: 131 PERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTL 190 P R H A++++K G T WVIE D++S F +V+H +L+ ++ RI + + L Sbjct: 288 PARGCHTALKSIKQGFH--GVT---WVIESDIASCFSSVNHEVLLSIIKERIKCVKTLAL 342 Query: 191 LWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHER-----YLSGK--- 242 + +++G++D+G F + G+PQG +SPLL NI L++FD +++E Y S K Sbjct: 343 IRNLLESGYVDLGAFCKSKLGIPQGSSLSPLLCNIYLHKFDTFMYELKQRFVYTSSKDPR 402 Query: 243 -----ARKDRWYWNNS--------IQRGRSTAVRENWQWKP-AVAYCRYADDFVLIVKGT 288 R R N IQ R T ++ + K + Y RYADDF + + G Sbjct: 403 INPAYKRLQRQIQNTPGLVEKSKFIQELRKTPSKDLFDPKYRRLFYIRYADDFSIGITGQ 462 Query: 289 KAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFIFL 328 K I ++ + L LK+ L K ++ H+ IF Sbjct: 463 KKDAVEILDQAKIFLSEELKMDLKESKIRVVHLKKQSIFF 502 >UniRef50_C6MRB5 RNA-directed DNA polymerase n=1 Tax=Geobacter sp. M18 RepID=C6MRB5_9DELT Length = 433 Score = 150 bits (379), Expect = 7e-35, Method: Compositional matrix adjust. Identities = 96/260 (36%), Positives = 142/260 (54%), Gaps = 11/260 (4%) Query: 18 QRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQ 77 +RL+ I + A + +KGA PGVD + T L+ L +++ELL+G Y Sbjct: 171 ERLMEEIVSRGNMMAAYSKVVGNKGA--PGVDNMPVTELKGYLQEHWPRIKEELLAGKYI 228 Query: 78 PLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHH 137 P P R+V IPK +G R LGIP + DR++Q+A+ + ++ F SYG+ P RS H Sbjct: 229 PQPVRKVEIPKPDGGKRMLGIPTVLDRLIQQAVSQVLGRLFIPCFSKHSYGYIPGRSTHQ 288 Query: 138 AIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKA 197 AI+ + + E R RW ++ DL +FD V+H +LM V+R++ D + + L+ +KA Sbjct: 289 AIQAARQYVA---EGR-RWAVDIDLEKFFDRVNHDILMSLVKRKVKDRQVLKLIDSYLKA 344 Query: 198 GHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRG 257 G GL EG PQG +SPLLSNIML+E D+ L +R G A R+ +I Sbjct: 345 GMFIGGLVSPRQEGTPQGSPLSPLLSNIMLDELDKELEKR---GHAFC-RYAVMPTIATS 400 Query: 258 RSTAVRE-NWQWKPAVAYCR 276 R N W+P+ A+CR Sbjct: 401 MLPPRRAVNGSWQPSPAFCR 420 >UniRef50_Q7UY81 Reverse transcriptase/maturase n=1 Tax=Rhodopirellula baltica RepID=Q7UY81_RHOBA Length = 459 Score = 150 bits (379), Expect = 7e-35, Method: Compositional matrix adjust. Identities = 112/340 (32%), Positives = 163/340 (47%), Gaps = 46/340 (13%) Query: 33 AARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLPARRVYIPKSNGK 92 +AR + KGA GVD + + E++ L ++L +G Y+P RRV IPK K Sbjct: 82 SARKVVGKKGA--AGVDRQSTEDFSEKEIAEIKQLYEQLRTGTYRPQAVRRVQIPKPGSK 139 Query: 93 -LRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGE 151 RPLGIP +RDR+VQ A++ +EPI++++FH S+GFR RS H A+R V+ L E Sbjct: 140 QTRPLGIPTVRDRVVQTALVNVIEPIFDNEFHERSFGFRHGRSCHDALRVVEELL----E 195 Query: 152 TRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHIDVGLFRAASEG 211 T +V++ DL YFDT+ L+ V +ISD R + L+ + + ++ G Sbjct: 196 TDHVFVVDADLQGYFDTIPKDRLLALVSEKISDRRVLDLVKRFLDQSILEELREWTPESG 255 Query: 212 VPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPA 271 VPQG V+SPLLSN+ LNE D + D Y Sbjct: 256 VPQGAVLSPLLSNLYLNELDHRM----------ADLGY---------------------- 283 Query: 272 VAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHR 331 RYADDFV++ + + A+ E R V E L L VN F FLG+ Sbjct: 284 -EMVRYADDFVILCRSQEQAELALEEVKRFVCEAGLTLHPEKTHIVDSRVN-SFDFLGYS 341 Query: 332 LIRKRSRY----GEMRVVSTIPQEKARNFAASLTALLWKV 367 R + R+ ++V TI + R SL A + ++ Sbjct: 342 F-RGKLRFPRAKSHQKMVDTIRRLTPRKSGQSLEATIVQI 380 >UniRef50_A2TD24 Intron encoded protein n=2 Tax=Bacillaceae RepID=A2TD24_BACSO Length = 604 Score = 150 bits (378), Expect = 9e-35, Method: Compositional matrix adjust. Identities = 108/327 (33%), Positives = 166/327 (50%), Gaps = 29/327 (8%) Query: 16 RIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLS-- 73 R + L+ + + + A S++G T G DG KT+ L + DE ++ Sbjct: 16 RFKGLVEIASSDVVIVSAIHKIKSNQGNSTAGTDG--KTISDI-----LTLNYDEAINFV 68 Query: 74 ----GHYQPLPARRVYIPKSNGK-LRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYG 128 Y P P RRV+IPK K RPLGI + DRI+Q + M +EPI E+ F SYG Sbjct: 69 KRCFKKYTPNPIRRVHIPKPGKKEKRPLGILTIADRIIQECVRMVIEPILEAQFFQHSYG 128 Query: 129 FRPERSVHHAI-RTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAV-RRRISDAR 186 FRP R AI R V + C WVIEGD+ +FD V+H +L+K + I D R Sbjct: 129 FRPYRDAKQAIERCVFI----CNRIGYNWVIEGDIKGFFDNVNHTILIKQLWHMGIRDRR 184 Query: 187 FMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKD 246 + ++ +KAG I + G PQGG+ISPLL+N+ L++ DQ++ + K R Sbjct: 185 MLMIIKAMLKAGVIKET--KINEMGTPQGGIISPLLANVYLHKLDQWITREWEEKKMRN- 241 Query: 247 RWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGS 306 +I+ + ++R++ Y RYADD++L ++ E + + L+ + Sbjct: 242 ----GTTIRTAKYKSLRDHSTITKPEFYVRYADDWILFT-NSRGNAEKWKYRIKKYLKEN 296 Query: 307 LKLRLNMDKTKIPHVNDGFI-FLGHRL 332 LKL L+ DKT I ++ + FLG ++ Sbjct: 297 LKLELSDDKTLITNIKKKPMKFLGFKI 323 >UniRef50_B1I9Z1 GBSi1, group II intron, maturase n=33 Tax=Firmicutes RepID=B1I9Z1_STRPI Length = 425 Score = 149 bits (376), Expect = 2e-34, Method: Compositional matrix adjust. Identities = 99/302 (32%), Positives = 146/302 (48%), Gaps = 40/302 (13%) Query: 17 IQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHY 76 + +LL I E + EA S+KG + G+DG+ + L ++ ++ + Y Sbjct: 1 MSKLLDKILSRENMLEAYNQVKSNKG--SAGIDGMTIEEMDNYLRQNWRLTKELIKQRKY 58 Query: 77 QPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVH 136 +P P +V IPK +G +R LGIP + DR++Q+A++ + PI E F +SYGFRP RS Sbjct: 59 KPQPVLKVEIPKPDGGIRQLGIPTVMDRMIQQAIVQVISPICEPHFSDMSYGFRPNRSCE 118 Query: 137 HAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIK 196 AI L D E W+++ DL +FDTV LM V I D +L+ K + Sbjct: 119 KAIMKFLEYLNDGYE----WIVDIDLEKFFDTVPQDRLMSLVHNIIEDGDTESLIRKYLH 174 Query: 197 AGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQR 256 +G I G G PQGG +SPLLSN+MLNE D+ L +R Sbjct: 175 SGVIINGQRHKTLVGTPQGGNLSPLLSNVMLNELDKELEKR------------------- 215 Query: 257 GRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKT 316 + + RYADD V+ V G++A + + +E L L++NM K Sbjct: 216 --------------GLRFVRYADDCVITV-GSEAAAKRVMYSASRFIEKRLGLKVNMTKA 260 Query: 317 KI 318 KI Sbjct: 261 KI 262 >UniRef50_A1ZX33 Group II intron-encoded protein LtrA n=2 Tax=Bacteria RepID=A1ZX33_9SPHI Length = 594 Score = 149 bits (375), Expect = 2e-34, Method: Compositional matrix adjust. Identities = 104/348 (29%), Positives = 175/348 (50%), Gaps = 43/348 (12%) Query: 15 LRIQRLLRLITQPEWLAEAARITLSSKGAHTPGV-----DGVNKTMLQARLAVELQILRD 69 L I+R+ RL+ P A +KGA T G+ DG++ +Q + Sbjct: 19 LPIERVYRLLYNPNLYLLAYSNLYGNKGALTSGITPETADGMSLDKIQDIIC-------- 70 Query: 70 ELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGF 129 +L Y+ P++R++IPK NG+ RPL IP D+++Q + + +E +E F S+GF Sbjct: 71 KLKQESYRWKPSKRIFIPKKNGQPRPLSIPCWSDKLLQEVIRLILEAYFEPQFCESSHGF 130 Query: 130 RPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMT 189 R R H A++ ++L+ +W IEGD+ FD ++H+L++K + ++ D RF+ Sbjct: 131 RTGRGCHSALKQMRLKGKG-----SKWFIEGDIQGCFDNINHQLIIKLLSDKLYDPRFIR 185 Query: 190 LLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYL----SGKARK 245 L+ + +K G+I+ + GVPQG +I P+L+NI+LNE D+++ + + GK R+ Sbjct: 186 LISQLLKTGYIEGWKYNKTYSGVPQGSIIGPILTNIVLNELDKFVENKLIPANTKGKRRR 245 Query: 246 D-------RWYWNNSIQRGRSTAVRE-NWQWKPAVA------------YCRYADDFVLIV 285 + + + ++G RE N Q + + Y RYA+D +L Sbjct: 246 SCPKYALIKRQASKARKQGDMDKCRELNKQAQKIPSRDTNDPKYRRLWYIRYANDTLLGY 305 Query: 286 KGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHV-NDGFIFLGHRL 332 G K + I+E+ L L L LN DKT I H + FLG+ + Sbjct: 306 IGKKEEAIKIKEQIADFLANELHLTLNSDKTLITHAQSQKASFLGYHI 353 >UniRef50_Q8HQ89 ORF777 (Fragment) n=1 Tax=Schizosaccharomyces octosporus RepID=Q8HQ89_SCHOT Length = 777 Score = 148 bits (374), Expect = 2e-34, Method: Compositional matrix adjust. Identities = 88/301 (29%), Positives = 148/301 (49%), Gaps = 49/301 (16%) Query: 80 PARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAI 139 P+R++ + N + R + I +DR++Q+A M ++ +E F S+GFRP RS H AI Sbjct: 258 PSRKIMVTCRNNQKREISIANGKDRVIQQAFKMILQSAYEPIFLNYSHGFRPGRSPHSAI 317 Query: 140 RTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGH 199 V+ TR W+I+GD+ +FD V H +L + ++I D + WK ++AG+ Sbjct: 318 FEVR------KWTRITWMIKGDIKGFFDNVDHHILANLLSKKIKDKNLIDFYWKLVRAGY 371 Query: 200 IDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYL-------------------HERYLS 240 ++ G ++ ++ G+PQGG++SPLL+NI L+E D Y+ H R S Sbjct: 372 VNNGNYKVSNLGIPQGGILSPLLANIYLHELDVYMEQLIQKYTVNKPVSKKNKEHTRLFS 431 Query: 241 GKARKDRWYWNN---------------SIQRGRSTAVRENWQWKPAVAYCRYADDFVLIV 285 K + + + S R ST R + Y RY DD+V+ V Sbjct: 432 EITAKSKKKFPDFELIKRMRKELRRIPSTIRDSSTGTR--------IYYNRYGDDYVIGV 483 Query: 286 KGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFI-FLGHRLIRKRSRYGEMRV 344 G K E I+ L+ L + LN +K++I H+ + +LG+ + R+ +Y E ++ Sbjct: 484 VGPKNLAETIQNLVSDFLKNELLIDLNKEKSQITHLTSKSLKYLGYEIFRRNRKYSESQL 543 Query: 345 V 345 Sbjct: 544 T 544 >UniRef50_B7CEC9 Putative uncharacterized protein n=1 Tax=Eubacterium biforme DSM 3989 RepID=B7CEC9_9FIRM Length = 458 Score = 148 bits (374), Expect = 3e-34, Method: Compositional matrix adjust. Identities = 101/326 (30%), Positives = 158/326 (48%), Gaps = 36/326 (11%) Query: 41 KGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPA 100 K PG+D V K + Q L + L L S Y+P P RRV I K NGK RPLGIP Sbjct: 66 KKDKAPGIDMVTKEVYQENLNENIDDLMHRLKSFSYKPQPVRRVEIDKGNGKKRPLGIPV 125 Query: 101 LRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEG 160 DR+ Q AM + ++E F SYGFRP R H AI+ + + + ++++ Sbjct: 126 YEDRLFQGAMADILSDVYEPRFLDCSYGFRPNRKAHDAIKVINDTVM---HKKINYILDC 182 Query: 161 DLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISP 220 D+ +FD V+H LMK +R I+D ++ + + +K+G + G + S G PQGG+ISP Sbjct: 183 DIKGFFDNVNHEWLMKFLRNDIADPNYLKYIARMLKSGVMIEGKYEDTSVGTPQGGLISP 242 Query: 221 LLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADD 280 +L+N+ YLH Y+ D W+ I++ Q R+ADD Sbjct: 243 ILANV-------YLH--YVL-----DLWF-EKCIKK----------QLCGEAYLVRFADD 277 Query: 281 FVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKI------PHVNDGFIFLGHRLIR 334 F+++ + + + + E +E L L+ +KT+I + F FLG Sbjct: 278 FLIMFQYER-DAQRVYEAVINRME-LFGLELSKEKTRILPFGRYSDSRETFDFLGFTHFN 335 Query: 335 KRSRYGEMRVVSTIPQEKARNFAASL 360 ++R G V I ++K + F ++L Sbjct: 336 SKTRKGYYSVGHKISRKKKKQFKSNL 361 >UniRef50_B7C9E4 Putative uncharacterized protein n=1 Tax=Eubacterium biforme DSM 3989 RepID=B7C9E4_9FIRM Length = 623 Score = 148 bits (374), Expect = 3e-34, Method: Compositional matrix adjust. Identities = 114/351 (32%), Positives = 183/351 (52%), Gaps = 24/351 (6%) Query: 1 MQRKLATWAATDPSLRI-QRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQAR 59 MQ+ + A S + L+ +I+ P + A R + G+HT G DG +T+ Sbjct: 19 MQKTFDSLYADSKSGEVFGHLMDIISAPSNIKLAFRNIKGNDGSHTAGTDG--RTIESLA 76 Query: 60 LAVELQILRDELLSGH---YQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEP 116 + E + ++ L+ Y+P +RV IPK NGK+RPLGIP + DRIVQ+ +L MEP Sbjct: 77 VMPEDKFVK--LIQKQFRRYEPKAVKRVEIPKPNGKMRPLGIPCIIDRIVQQCILQVMEP 134 Query: 117 IWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMK 176 I E+ F+ SYGFRP RS +AI + L + +V++ D+ +FD V HR L+K Sbjct: 135 ICEAKFYEHSYGFRPCRSAENAI-SYAYGLAQ--RNKLHYVVDVDVKGFFDNVDHRKLLK 191 Query: 177 AV-RRRISDARFMTLLWKTIKAG-HIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYL 234 + I D + + ++ +KA + G S+G PQGG++SPLL+NI+LNE D ++ Sbjct: 192 QIWTLGIRDTKLIQIIKAMLKAPIEMPDGENVLPSKGTPQGGILSPLLANIVLNELDWWI 251 Query: 235 HE------RYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGT 288 R++ K +Y N + ++ S + K + RYADDF + + T Sbjct: 252 ASQWDEMVRHMKHPC-KVTYYPNGAEKKCNSYTALKKSNLK-EMRIVRYADDFKIFCR-T 308 Query: 289 KAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVND-GFIFLGHRL-IRKRS 337 K E + L LKL ++ +K+K+ ++ FLG R+ +R++S Sbjct: 309 KEDAEKTYYAVKDWLWKRLKLEVSDEKSKVTNLRKRDSEFLGFRIKLRRKS 359 >UniRef50_C4ZES6 RNA-directed DNA polymerase n=27 Tax=Bacteria RepID=C4ZES6_EUBR3 Length = 554 Score = 147 bits (370), Expect = 7e-34, Method: Compositional matrix adjust. Identities = 106/311 (34%), Positives = 153/311 (49%), Gaps = 51/311 (16%) Query: 31 AEAARITLSSKGAHTPGVDG-VNKTMLQARLAVELQILRDELLSGHYQPLPARRVYIPKS 89 A A + S+KG +T GVD + KT A+E +L Y+P P RRVYIPK Sbjct: 61 ALAVKRVTSNKGKNTAGVDHELWKTPKGKFEAIE------KLKRRGYKPQPLRRVYIPKK 114 Query: 90 NGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDC 149 NGKLRPL IP + DR +Q A+EP+ E+ SYGFR RS H AI L Sbjct: 115 NGKLRPLSIPTMTDRAMQTLYKFALEPLAETLADPNSYGFRIGRSTHDAIGQCFNDLCRA 174 Query: 150 GETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHIDVGLFRAAS 209 G +W++EGD+ FD + H L+ + D + +L K +K G ++ Sbjct: 175 GSP--QWILEGDIKGCFDHISHNWLLANI---PMDKK---MLGKWLKCGFVETKKLFPTE 226 Query: 210 EGVPQGGVISPLLSNIMLNEFDQYLHERY-----LSGKARKDRWYWNNSIQRGRSTAVRE 264 EG PQGG ISP+L N+ L+ ++ L ER+ ++GK D+ Sbjct: 227 EGTPQGGTISPVLMNMTLDGLERILKERFPMRRTVAGKTVYDQ----------------- 269 Query: 265 NWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLK---LRLNMDKTKIPHV 321 + + RYADDF++ T E +R E +++ L L+L+ +KT I H+ Sbjct: 270 -------INFVRYADDFIV----TGKSPETLRNEVMPLIKDFLAERGLQLSEEKTVITHI 318 Query: 322 NDGFIFLGHRL 332 +DGF FLG + Sbjct: 319 SDGFDFLGQNV 329 >UniRef50_B0K6R3 RNA-directed DNA polymerase (Reverse transcriptase) n=3 Tax=Thermoanaerobacter RepID=B0K6R3_THEPX Length = 427 Score = 147 bits (370), Expect = 8e-34, Method: Compositional matrix adjust. Identities = 105/340 (30%), Positives = 163/340 (47%), Gaps = 45/340 (13%) Query: 16 RIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGH 75 ++Q L+ + PE L +A + K A GVD V + ++ L ++ Sbjct: 17 KVQNLISYVN-PETL-KAKHEEMPKKKA--SGVDKVTWEEYDVNVDENVETLIAKMKRFS 72 Query: 76 YQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSV 135 Y+P PARRVYIPK+NGKLRPLGIP D++V M + ++E+ F SYGFRP RS Sbjct: 73 YRPQPARRVYIPKANGKLRPLGIPCYEDKLVAAVMADILNEVYENIFLDTSYGFRPGRSC 132 Query: 136 HHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTI 195 H AI+ + + C + +V+E D+ +FD V + LM+ + I D F + + + Sbjct: 133 HDAIKELNRIIGRC---KISYVLEADIKGFFDNVDQKQLMEFIAHDIDDKNFSRYIVRFL 189 Query: 196 KAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQ 255 K+G ++ G + + +G QG +SP+L+NI YLH D W+ Sbjct: 190 KSGIMEEGKYHESDKGTAQGSPLSPILANI-------YLHYTL-------DVWF------ 229 Query: 256 RGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDK 315 ++ N +++ RYADDFV++ + K+ + + E + L L MDK Sbjct: 230 ----AYLKRNGKFRGEAYIVRYADDFVMLFQ-YKSDADKMYEALPKRM-AKFGLELAMDK 283 Query: 316 TKI--------PHVNDG----FIFLGHRLIRKRSRYGEMR 343 TKI + DG F FLG +R G+ R Sbjct: 284 TKILPFGRFAKQNSKDGKTETFDFLGFTFSNGTTRNGKYR 323 >UniRef50_C3KST3 Group II intron reverse transcriptase/maturase n=10 Tax=Firmicutes RepID=C3KST3_CLOB6 Length = 626 Score = 146 bits (369), Expect = 1e-33, Method: Compositional matrix adjust. Identities = 105/338 (31%), Positives = 175/338 (51%), Gaps = 24/338 (7%) Query: 16 RIQRLLRLITQPEWLAEAARITLSSKGAHTPGVD--GVNKTMLQARLAVELQILRDELLS 73 + + L++ IT E + A R + G+HT G + +N ++ + ++ +R L Sbjct: 42 QFKNLIKTITSKENILLAYRNIKKNDGSHTKGTNHKTINDIAGESEDEI-IEYVRKRL-- 98 Query: 74 GHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPER 133 + P +R+YIPK+NG RPLGIP + DR++QR++L +EPI E+ FH SYGFRP R Sbjct: 99 NKFYPHSVKRIYIPKNNGDKRPLGIPTIEDRLIQRSILQVLEPICEAKFHPHSYGFRPNR 158 Query: 134 SVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAV----RRRISDARFMT 189 S HAI +T + +V++ D+ +FD V+H L+K + + + ++ Sbjct: 159 STEHAIARA---MTLINMNKLHYVVDVDIKGFFDNVNHGKLLKQLWTLGIKDKKLIKIIS 215 Query: 190 LLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKD--- 246 L+ +KA D + +G PQGG+ISPLL+N++LNE D ++ ++ + + + + Sbjct: 216 LM---LKAQIKDGSMITNPVKGTPQGGIISPLLANVVLNELDWWISSQWETFETKHNYSK 272 Query: 247 -RWYWNNSIQRGRSTAVRENWQWKPAVAY-CRYADDFVLIVKGTKAQVEAIREECRGVLE 304 R + N + +S R K Y RYADDF + K K E I + L+ Sbjct: 273 LRTFKNGTTTIDKSHKYRALRNGKLKEIYIVRYADDFKVFCKNPK-DAEKIFIAIKLWLK 331 Query: 305 GSLKLRLNMDKTKIPHVNDGFI-FLGHRLI--RKRSRY 339 L L + +K+K+ ++ FLG L +KR +Y Sbjct: 332 ERLDLETSPEKSKVTNLRKHPTEFLGFELKAEKKRKKY 369 >UniRef50_Q93PB4 MS117, putative maturase n=1 Tax=Microscilla sp. PRE1 RepID=Q93PB4_9SPHI Length = 462 Score = 146 bits (368), Expect = 1e-33, Method: Compositional matrix adjust. Identities = 102/330 (30%), Positives = 159/330 (48%), Gaps = 47/330 (14%) Query: 41 KGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPA 100 K +PGVDG+ L+ + Q L ++L G+Y+P+ + IPK G +R LGIP Sbjct: 64 KNGGSPGVDGMQVKELRYWFSNNHQKLIEQLKEGNYRPMTIKGQEIPKPGGGVRQLGIPT 123 Query: 101 LRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEG 160 ++DR+VQ+A+ + ++ F SYGFR R+ H A+R + + +V++ Sbjct: 124 VQDRLVQQAIAQQLSKRYDPTFSQYSYGFRKGRNAHQALRQAGAYVKEGFN----YVVDL 179 Query: 161 DLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISP 220 DL +FD V+H LM + RRISD R + L+ K +++G + GL G PQG +SP Sbjct: 180 DLEKFFDKVNHDRLMWLLGRRISDKRVLKLIGKFLRSGILIGGLENQRISGTPQGSPLSP 239 Query: 221 LLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADD 280 LLSNI+L+E D+ L R + RYADD Sbjct: 240 LLSNIVLDELDKELERR---------------------------------GHRFVRYADD 266 Query: 281 FVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKI--PHVNDGFIFLGHRLI----R 334 +L+V+ +A E +E L L++N DK++I P+ + FLGH ++ Sbjct: 267 MILLVRSQEA-AERAYSSITSFIENRLLLKVNKDKSRICRPYQLN---FLGHSIMWDGKL 322 Query: 335 KRSRYGEMRVVSTIPQEKARNFAASLTALL 364 SR E R + + RN SL ++ Sbjct: 323 GLSRQSEQRFKEKVKKVTRRNRGISLEQMV 352 >UniRef50_C4K5N9 Group II intron encoded reverse transcriptase n=10 Tax=Candidatus Hamiltonella defensa 5AT (Acyrthosiphon pisum) RepID=C4K5N9_HAMD5 Length = 570 Score = 145 bits (367), Expect = 2e-33, Method: Compositional matrix adjust. Identities = 107/329 (32%), Positives = 162/329 (49%), Gaps = 48/329 (14%) Query: 9 AATDPSLRIQRLLRLI--TQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQI 66 +AT +++ L +L+ ++ L ++T ++G HT GVD N+ + + L Sbjct: 40 SATGDLKKVRNLQKLMMKSRANHLLAIRKVTQVNRGKHTAGVD--NQVINDHKGREHLYK 97 Query: 67 LRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLS 126 L + S + P +RVYI K NGK RPLGIP + DR Q + A+EP WE+ F +S Sbjct: 98 LLSQTTSE--KVYPVKRVYIAKKNGKKRPLGIPTILDRCRQAIVKSALEPYWEAKFEPVS 155 Query: 127 YGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDAR 186 YGFRP RS H AI+ + + TR WV++ D+ FD + H L+K +I Sbjct: 156 YGFRPGRSAHDAIQKI-FCIARARGTR-HWVLDADIKGAFDNIDHNFLIK----KIGGFP 209 Query: 187 FMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKD 246 ++ + ++AG ++ G + G PQGG+ISPLL+NI L+ + L +Y Sbjct: 210 ERNMIKQWLQAGVLEHGNYIPNVAGTPQGGIISPLLANIALHGMETLLGIQYWK------ 263 Query: 247 RWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRG----- 301 N + ++G+ AV RYADDFV+ K REEC Sbjct: 264 ----NGTPKQGQPYAV------------VRYADDFVVFGKS--------REECETAKIKL 299 Query: 302 -VLEGSLKLRLNMDKTKIPHVNDGFIFLG 329 + L L+ +KT I H+ +GF FLG Sbjct: 300 QIWLAQRGLALSEEKTSIKHLKEGFDFLG 328 >UniRef50_B4WW73 Group II intron, maturase-specific domain family n=1 Tax=Synechococcus sp. PCC 7335 RepID=B4WW73_9SYNE Length = 479 Score = 145 bits (367), Expect = 2e-33, Method: Compositional matrix adjust. Identities = 97/286 (33%), Positives = 147/286 (51%), Gaps = 35/286 (12%) Query: 80 PARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAI 139 P RRVYIPKSNGKLRPLGI + DR +Q + A+EP WE+ +YG+RP RS H A+ Sbjct: 25 PTRRVYIPKSNGKLRPLGISTIADRCLQMVVKTALEPEWEAKLEGSTYGYRPGRSCHDAV 84 Query: 140 RTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGH 199 + KL + + WV++ DL FD + L+ + R AR + W +++G+ Sbjct: 85 K--KLYFMSLPKNKKHWVVDADLQGCFDNIDQSFLLAKLER--FPARGLVEQW--LQSGY 138 Query: 200 IDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGRS 259 ++ G ++ + G PQG ISPLL+NI L+ ++ L Y S RGR Sbjct: 139 VEYGRWQPTTAGTPQGNCISPLLANIALHGMEEALGITYDS---------------RGRI 183 Query: 260 TAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRG-VLEGSLKLRLNMDKTKI 318 R +YADDFV++ + +KA EA E+ + +LE L + +KT++ Sbjct: 184 NGKR---------GLAKYADDFVVMCE-SKADAEAAIEDLKPWLLERGLS--FSEEKTQV 231 Query: 319 PHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAASLTALL 364 H+ +GF FLG R + G+ R + + +R S+ A L Sbjct: 232 VHLREGFDFLGFNF-RHYATPGKTRTGWKLLTKPSRKSIKSIKARL 276 >UniRef50_C7V8C7 Reverse transcriptase n=1 Tax=Enterococcus faecalis CH188 RepID=C7V8C7_ENTFA Length = 496 Score = 145 bits (366), Expect = 2e-33, Method: Compositional matrix adjust. Identities = 117/368 (31%), Positives = 184/368 (50%), Gaps = 50/368 (13%) Query: 17 IQRLLRLITQPEWL-AEAARITLSSKGAHTPGVDGVN-KTMLQARLAVELQILRDELLSG 74 ++RL LIT + A A + +S+KG +T G+DGV KT Q + A+E +L Sbjct: 46 VRRLQYLITHSFYAKALAVKKVISNKGKNTAGIDGVIWKTDSQKKQAIE------QLNPN 99 Query: 75 HYQPLPARRVYIPKSNGK-LRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPER 133 HY P +R+YI K K RPLGIP + DR +Q L A+EP+ E + SYGFR + Sbjct: 100 HYSPKAVKRIYITKFGKKEKRPLGIPCMLDRAMQALYLQALEPVSECISDSNSYGFRRFK 159 Query: 134 SVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWK 193 S A V L C + +W++EGD+ FD + H+ L+ + +L K Sbjct: 160 SAKDAGEKVFKVL--CRQYSAQWILEGDIKGCFDNISHQWLIDNIPLE------KNMLRK 211 Query: 194 TIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNS 253 +K+G+++ + G QGG+ISP L+NI L+ ++ + +Y S K Sbjct: 212 FLKSGYMEKKKLFPTTMGTAQGGIISPTLANITLDGLEKRIKSKYWSNK----------- 260 Query: 254 IQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLK---LR 310 +G + VR N K V + RYADDF IV G + I + + ++ LK L Sbjct: 261 --KG-TIGVRYN---KHKVNFVRYADDF--IVTGDSPE---ILLKIKNMINEFLKERGLS 309 Query: 311 LNMDKTKIPHVNDGFIFLG--------HRLIRKRSRYGEMRVVSTIPQEKARNFAASLTA 362 L+ +KT I H+N GF FLG ++LI + S+ R+ T+ Q ++++S Sbjct: 310 LSEEKTLITHINQGFDFLGWNFRKYKRYKLIVQPSKKSIKRMKQTLKQVVKVHYSSSQDL 369 Query: 363 LLWKVRIS 370 L+ + ++ Sbjct: 370 LIQNLNLT 377 >UniRef50_B7KM76 RNA-directed DNA polymerase (Reverse transcriptase) n=1 Tax=Cyanothece sp. PCC 7424 RepID=B7KM76_CYAP7 Length = 309 Score = 145 bits (365), Expect = 3e-33, Method: Compositional matrix adjust. Identities = 95/290 (32%), Positives = 141/290 (48%), Gaps = 41/290 (14%) Query: 47 GVDGVNKTMLQARLAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIV 106 G+DG L L L + + + +Y P P ++V IPKS K R L IP +RDRIV Sbjct: 29 GIDGETIEHFALNLDFNLTFLLNSVTNSNYIPQPLKQVLIPKSQEKWRELRIPTVRDRIV 88 Query: 107 QRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYF 166 Q+A+L + P+ E F S+ +RP RS A++ D G +WV++ D+ YF Sbjct: 89 QQALLNVLYPVMEERFSDASFAYRPNRSYLDAVKRAAY-WRDLGY---QWVLDADIVEYF 144 Query: 167 DTVHHRLLMKAVRRRISDARFMTLLWKTIKAG-HIDVGLFRAASEGVPQGGVISPLLSNI 225 D + H LL+K VR+ + ++ + L+ I AG D G+ +GVPQG VISP+L+NI Sbjct: 145 DNISHSLLLKEVRKTVDNSGILCLIKAWISAGVSTDKGII-FPEKGVPQGAVISPMLANI 203 Query: 226 MLNEFDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIV 285 L+EFD + + L RYADDF+++ Sbjct: 204 YLDEFDHRITQSDLK---------------------------------LVRYADDFLVLS 230 Query: 286 KGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRK 335 + A + + L L+L+ +KT+I H GF FLGH +RK Sbjct: 231 DTEDGIMRAYSQVVQ--LLHFWGLKLHEEKTQITHFKKGFQFLGHGFLRK 278 >UniRef50_B7HM08 Group II intron reverse transcriptase/maturase n=30 Tax=Firmicutes RepID=B7HM08_BACC7 Length = 610 Score = 145 bits (365), Expect = 3e-33, Method: Compositional matrix adjust. Identities = 106/337 (31%), Positives = 177/337 (52%), Gaps = 20/337 (5%) Query: 1 MQRKL-ATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVN-KTMLQA 58 MQRK ++ + +L+ +I E + A R ++KG++T GVD + K + Sbjct: 15 MQRKYDELYSNSLNGNNFYKLIDIIGSEENIRLAYRNIKTNKGSNTAGVDNLTIKDIWHL 74 Query: 59 RLAVELQILRDELLSGHYQPLPARRVYIPKS-NGKLRPLGIPALRDRIVQRAMLMAMEPI 117 + +R L +YQP +RV IPK + K RPLGIP + DR+VQ+++L +EPI Sbjct: 75 NDTKIIHEVRKRL--NNYQPQAVKRVLIPKEGSDKKRPLGIPTIWDRLVQQSILQVLEPI 132 Query: 118 WESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKA 177 E+ FH SYGFRP RS HHA+ V + L + G + ++ D+ +FD V H+ L++ Sbjct: 133 CEAKFHNHSYGFRPNRSTHHALSRV-VSLINIGHQ--HYCVDIDIKGFFDNVCHKKLLRQ 189 Query: 178 V-RRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHE 236 + I D + ++ K +K+ G+ ++G PQGG+ISPLLSNI+LNE D ++ Sbjct: 190 MWTLGIRDKSLLCVISKILKSEIEGEGI---PNKGTPQGGIISPLLSNIVLNELDWWISS 246 Query: 237 RYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIR 296 ++ + K + ++ G R+ K RYADDF ++ + T + + Sbjct: 247 QWETYKPHRI-----STRHLGFRQYARKYTNLKCGYV-VRYADDFKIMCR-TYDEAQRFY 299 Query: 297 EECRGVLEGSLKLRLNMDKTKIPHV-NDGFIFLGHRL 332 L+ L L +N K+K+ ++ + +FLG ++ Sbjct: 300 HATVDFLKSRLGLEINPKKSKVVNLKKNSSVFLGFKI 336 >UniRef50_B2JXR4 RNA-directed DNA polymerase n=10 Tax=Bacteria RepID=B2JXR4_BURP8 Length = 503 Score = 143 bits (361), Expect = 8e-33, Method: Compositional matrix adjust. Identities = 113/330 (34%), Positives = 157/330 (47%), Gaps = 56/330 (16%) Query: 12 DPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDEL 71 D + +QRLL + LA R+T ++G TPGVDG + A+ L L Sbjct: 50 DKAKVLQRLLTRSHSAKMLA-VKRVT-ENRGKRTPGVDGRVWSSSAAKWKGML-----SL 102 Query: 72 LSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRP 131 Y+ +P RR+YIPKSNGK RPLGIP +R R +Q +A+EPI E+ SYGFRP Sbjct: 103 RHRGYRAMPLRRIYIPKSNGKKRPLGIPCMRCRSMQALWKLALEPIAETLADANSYGFRP 162 Query: 132 ERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLL 191 ERS AI L WV+EGD+ FD H +K + +L Sbjct: 163 ERSTADAIEQCFTVLAR--RISPEWVLEGDIRGCFDNFSHSWFLKHI------PMDKVIL 214 Query: 192 WKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERY-LSGKARKDRWYW 250 K ++AG+ID G + G PQGG+ISP+++N+ L+ + +H S +ARK Sbjct: 215 RKWLEAGYIDEGTLFESRAGTPQGGIISPVIANMALDGLEAAVHASVGTSARARK----- 269 Query: 251 NNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQV------EAIRE--ECRGV 302 + ++ RYADDFV V G V A+R+ RG Sbjct: 270 ------------------RAQLSVIRYADDFV--VTGVSKDVLELKVLPAVRQFMAVRG- 308 Query: 303 LEGSLKLRLNMDKTKIPHVNDGFIFLGHRL 332 L L+ +KT+I H+ GF FLG + Sbjct: 309 ------LELSEEKTRITHIAAGFDFLGQNV 332 >UniRef50_C9BNF1 Group II intron reverse transcriptase/maturase n=6 Tax=Bacilli RepID=C9BNF1_ENTFC Length = 600 Score = 143 bits (361), Expect = 9e-33, Method: Compositional matrix adjust. Identities = 95/308 (30%), Positives = 163/308 (52%), Gaps = 24/308 (7%) Query: 16 RIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVN----KTMLQARLAVELQILRDEL 71 + +L LI + A R ++KG+ TPG D K M QA + ++ L Sbjct: 29 KFYQLYELIISENNILLAYRTIKANKGSSTPGTDSFTIDNYKEMNQAEF---IHLILSHL 85 Query: 72 LSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRP 131 +Y+P +RV IPK NG+ RPLGIP + DR++Q+ +EPI E+ F+ SYGFRP Sbjct: 86 --ENYKPKSIKRVMIPKPNGEKRPLGIPCMIDRMIQQMFKQILEPICEAKFYEHSYGFRP 143 Query: 132 ERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAV-RRRISDARFMTL 190 RS HA+ + + ++ + ++ D+ +FD V+HRLL+K + I D + + + Sbjct: 144 LRSAKHALGRIMYLI---NISKMHYAVDIDIKGFFDNVNHRLLIKQLWNIGICDKQVLAI 200 Query: 191 LWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYW 250 L K++K+ G+ S+G QGG+ISPLLSN++LN+ D ++ +++ + + + Sbjct: 201 LSKSLKSPIQGEGI---PSKGTIQGGIISPLLSNVVLNDLDHWVSKQWHTFETKYPYTKG 257 Query: 251 NNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLR 310 N + R T +++ + RYADDF ++ ++ + L+ LKL Sbjct: 258 YNKFRALRDTNLKQGY-------IVRYADDFKIMTNDYPTALKWF-HAVKLYLKDRLKLD 309 Query: 311 LNMDKTKI 318 ++ +K+KI Sbjct: 310 ISNEKSKI 317 >UniRef50_Q0AW97 RNA-directed DNA polymerase (Reverse transcriptase) n=24 Tax=cellular organisms RepID=Q0AW97_SYNWW Length = 443 Score = 143 bits (360), Expect = 1e-32, Method: Compositional matrix adjust. Identities = 109/368 (29%), Positives = 168/368 (45%), Gaps = 50/368 (13%) Query: 4 KLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVE 63 ++A A +P R L+ I E L E L G+ GVD V K + L Sbjct: 7 RIAEIARQNPKERFTALIHHINH-ETLKECH---LEISGSKASGVDQVTKQAYEENLEAN 62 Query: 64 LQILRDELLSGHYQPLPARRVYIPKS-NGKLRPLGIPALRDRIVQRAMLMAMEPIWESDF 122 + L + Y+P P RRVYIPK + K RPLGIP+ D++VQ+ + + I+E DF Sbjct: 63 IADLIGRMKRQAYKPQPVRRVYIPKEGSNKRRPLGIPSYEDKLVQKGLARILNTIYEQDF 122 Query: 123 HTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRI 182 S+GFRP R H A++ + + + ++++ D+ +FD V H +MK + RI Sbjct: 123 LDCSFGFRPGRGCHDALKVLNHIIE---RKKVNYIVDADIRGFFDHVDHEWMMKFLELRI 179 Query: 183 SDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGK 242 +D + L+ + +KAG ++ G+ +G PQGG++SP+L+NI YLH Y+ Sbjct: 180 ADPNLLRLIKRFLKAGVMEAGIVYDTPKGTPQGGIVSPILANI-------YLH--YVL-- 228 Query: 243 ARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGV 302 D W+ V++ Q + + RYADDFV + K+ E R Sbjct: 229 ---DLWF---------EKVVKKRCQGEAYLV--RYADDFVCCFQ-NKSDAEWFYANLRER 273 Query: 303 LEGSLKLRLNMDKTKIPHVN---------------DGFIFLGHRLIRKRSRYGEMRVVST 347 L L + +KT+I D F LG +S+ G RV Sbjct: 274 L-NKFNLEVAEEKTRIIAFGRFADKESKKQGRKKPDTFDLLGFTHYCSKSKKGWFRVKRK 332 Query: 348 IPQEKARN 355 Q+K R+ Sbjct: 333 TSQKKYRS 340 >UniRef50_B0URY2 RNA-directed DNA polymerase n=25 Tax=cellular organisms RepID=B0URY2_HAES2 Length = 575 Score = 142 bits (359), Expect = 1e-32, Method: Compositional matrix adjust. Identities = 106/339 (31%), Positives = 168/339 (49%), Gaps = 49/339 (14%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWL-AEAARITLSSKGAHTPGVDG-VNKTMLQA 58 MQ ++A +++ L R++T + A A R + G T G+D + T Sbjct: 40 MQVRIAKATQESNWRKVKNLQRMLTHSFYAKALAVRRVTENTGKRTAGIDKRIWDTPESK 99 Query: 59 RLAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIW 118 +A++ +L S YQP P RRV+IPKSNGK RPLGIP ++DR +Q L+A++PI Sbjct: 100 WIAIQ------DLSSKGYQPKPLRRVFIPKSNGKKRPLGIPTMKDRAMQMLYLLALQPIA 153 Query: 119 ESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETR---GRWVIEGDLSSYFDTVHHRLLM 175 E+ SYGFR RS AI + + G WV++ D+ FD ++H L+ Sbjct: 154 ETTADNNSYGFRLNRSTADAISHIHSIFSTKGNQSRQMAEWVLDADIHGCFDFINHDWLL 213 Query: 176 KAV--RRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQY 233 K + +RI L K +K+G ++ G + +EG PQG +ISP L+N+ L+ ++ Sbjct: 214 KHIPMNKRI--------LKKWLKSGVVEFGQLKPTTEGTPQGDIISPTLANMALDGLEKE 265 Query: 234 LHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVE 293 L + + + + K I + R+ V RYADDF++ + E Sbjct: 266 LIKHFGAKNSLK--------IAKHRTYLV-------------RYADDFII----SGISKE 300 Query: 294 AIREECRGVLEGSLK---LRLNMDKTKIPHVNDGFIFLG 329 + E+ +++ L L L+ KTK+ H+ GF FLG Sbjct: 301 LLEEQVIPMVKNFLAERGLSLSESKTKVVHIEHGFDFLG 339 >UniRef50_C1L365 Group II intron-encoded protein n=1 Tax=Bacillus thuringiensis RepID=C1L365_BACTU Length = 598 Score = 142 bits (359), Expect = 1e-32, Method: Compositional matrix adjust. Identities = 98/337 (29%), Positives = 180/337 (53%), Gaps = 24/337 (7%) Query: 39 SSKGAHTPGVDGVNKTMLQARLAVEL-QILRDELLSGHYQPLPARRVYIPKSNGKLRPLG 97 S++G+ TPG D + + E+ ++R++L++ ++P RRV+IPK++GK RPLG Sbjct: 52 SNQGSKTPGTDQITINEYKGFSDDEIIHLVREKLIN--FKPDSVRRVFIPKADGKSRPLG 109 Query: 98 IPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWV 157 IP + DRI+Q+ +EPI E+ F+ SYGFRP RS HHAI + + + Sbjct: 110 IPTMLDRIIQQCFKQILEPIVEAKFYEHSYGFRPLRSTHHAIARTNFLINI---NKLHYC 166 Query: 158 IEGDLSSYFDTVHHRLLMKAV-RRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGG 216 ++ D+ +FD ++H L+K + + D + + ++ K +K + G A +G PQGG Sbjct: 167 VDIDIKGFFDNINHNKLIKQLWNIGVRDKQVLAIIKKMLKCEIVGEG---KAEKGTPQGG 223 Query: 217 VISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCR 276 ++SPLL+N++LN+ D ++ ++ + ++ ++ R R V++ + K R Sbjct: 224 ILSPLLANVVLNDLDHWVASQWHNFPTNT---HYEDNRPRYR---VQKKTKLKQGFI-VR 276 Query: 277 YADDFVLIVKGTKAQVEAIR--EECRGVLEGSLKLRLNMDKTKIPHVND-GFIFLGHRLI 333 YADDF + T + A R + +E +LKL ++ K+KI ++ +FLG Sbjct: 277 YADDFKIF---TNSYNSAKRWFHAVKNYIEKNLKLEISESKSKITNLRKRKSLFLGIEFK 333 Query: 334 RKRSRYGEMRVVSTIPQEKARNFAASLTALLWKVRIS 370 + + S IP++ ++ A++ + +RI+ Sbjct: 334 AVAKK-KKFVAQSYIPKDSMKSIQANIKKKVKSIRIN 369 >UniRef50_B3PDY2 Putative maturase n=1 Tax=Cellvibrio japonicus Ueda107 RepID=B3PDY2_CELJU Length = 402 Score = 142 bits (359), Expect = 2e-32, Method: Compositional matrix adjust. Identities = 97/315 (30%), Positives = 150/315 (47%), Gaps = 49/315 (15%) Query: 42 GAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPAL 101 A G D V M + L EL L + + SG Y P +RV + K++GKLRPLGIP + Sbjct: 21 NAGVAGADNVCIDMFEHNLENELYKLWNRMSSGSYMAPPVKRVEMAKADGKLRPLGIPTV 80 Query: 102 RDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGD 161 DR+ Q + M +EP W+S FH S+G+RP RS HHA++ K+ +C + WVI+ D Sbjct: 81 ADRVAQMVVKMTLEPEWDSKFHASSFGYRPRRSAHHAVQAAKI---NCWKY--SWVIDLD 135 Query: 162 LSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAG-HIDVGLFRAASEGVPQGGVISP 220 + +FD ++H L K V + D + + I AG + G ++G PQGGVISP Sbjct: 136 IKGFFDNLNHDQLQKFVAQATDDPWCKLYIKRWITAGVQMPGGELHKTAKGTPQGGVISP 195 Query: 221 LLSNIMLNE-FDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYAD 279 LL+N+ L++ FD ++ + + P + RYAD Sbjct: 196 LLANLYLHKVFDSWMQKYF-------------------------------PQNPFERYAD 224 Query: 280 DFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVN---------DGFIFLGH 330 D V + T+ + E + ++ L L+ +KTKI + F FLG Sbjct: 225 DIVCHCR-TEHEAEQLLSAISRRMQ-RFDLTLHPEKTKIVYCGRRKIERTKAQSFDFLGF 282 Query: 331 RLIRKRSRYGEMRVV 345 R+ + + ++V Sbjct: 283 TFRRRTVKRKDGKLV 297 >UniRef50_P03875 Putative COX1/OXI3 intron 1 protein n=3 Tax=Fungi/Metazoa group RepID=AI1M_YEAST Length = 834 Score = 142 bits (357), Expect = 2e-32, Method: Compositional matrix adjust. Identities = 100/306 (32%), Positives = 150/306 (49%), Gaps = 40/306 (13%) Query: 39 SSKGAHTPG-----VDGVNKTMLQARLAVELQILRDELLSGHYQPLPARRVYIPKSNGKL 93 S G TPG +DG+N + L L +EL +G ++ P R V IPK G + Sbjct: 271 SKPGNMTPGTTLETLDGMN--------MMYLNKLSNELGTGKFKFKPMRMVNIPKPKGGM 322 Query: 94 RPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETR 153 RPL + RD+IVQ M M ++ I++ T S+GFR S AI V+ Sbjct: 323 RPLSVGNPRDKIVQEVMRMILDTIFDKKMSTHSHGFRKNMSCQTAIWEVRNMFGG----- 377 Query: 154 GRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHID-VGLFRAASEGV 212 W IE DL FDT+ H L++K ++R ISD F+ L++K ++AG+ID G + G+ Sbjct: 378 SNWFIEVDLKKCFDTISHDLIIKELKRYISDKGFIDLVYKLLRAGYIDEKGTYHKPMLGL 437 Query: 213 PQGGVISPLLSNIMLNEFDQYLHER---YLSGKARKDRWYWNN---SIQRGRSTAVRENW 266 PQG +ISP+L NI++ D +L + Y GK +K + I + + + R Sbjct: 438 PQGSLISPILCNIVMTLVDNWLEDYINLYNKGKVKKQHPTYKKLSRMIAKAKMFSTRLKL 497 Query: 267 QWKPA--------------VAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLN 312 + A + Y RYADD ++ V G+K + I+ + L SL L +N Sbjct: 498 HKERAKGPTFIYNDPNFKRMKYVRYADDILIGVLGSKNDCKMIKRDLNNFLN-SLGLTMN 556 Query: 313 MDKTKI 318 +KT I Sbjct: 557 EEKTLI 562 >UniRef50_A4C8M3 RNA-directed DNA polymerase (Reverse transcriptase) n=12 Tax=Pseudoalteromonas tunicata D2 RepID=A4C8M3_9GAMM Length = 429 Score = 142 bits (357), Expect = 2e-32, Method: Compositional matrix adjust. Identities = 102/334 (30%), Positives = 158/334 (47%), Gaps = 47/334 (14%) Query: 47 GVDGVNKTMLQARLAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIV 106 G+DG+ Q +L + L D L ++ +RV+IPK+NGK RPLG+P + D++V Sbjct: 46 GIDGITMPAYQQQLVGNITRLSDALKHKRFRANDIKRVFIPKANGKQRPLGLPTVDDKLV 105 Query: 107 QRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYF 166 Q+ + ++ IWE+DF SYG+RP +S H A+ ++ L L G +++E D+ +F Sbjct: 106 QQGVSQILQSIWEADFLPNSYGYRPNKSAHQALHSLALNLQFKGYG---YIVEADIKGFF 162 Query: 167 DTVHHRLLMKAVRRRISDARFMTLLWKTIKAG-HIDVGLFRAASEGVPQGGVISPLLSNI 225 + + H LMK +++RI D ++L+ + +KA G+F G PQGG+ISP+L+NI Sbjct: 163 NNLDHNWLMKMLKQRIDDKAMLSLISQWLKARIKSPEGVFEYPKSGTPQGGIISPVLANI 222 Query: 226 MLNEFDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIV 285 YLH D W+ R R A+ RYADDFV Sbjct: 223 -------YLHYAL-------DLWFEKKVKPRMRGRAM-----------LIRYADDFVCAF 257 Query: 286 KGTKAQVEAIREECRGVLEGSLK---LRLNMDKTKI-------PHVNDGFIFLGHRLIRK 335 Q E VL LK L + +KT + P F+FLG Sbjct: 258 -----QYANDAERFYEVLPKRLKKFNLEVAEEKTSLLRFSRFHPSRKRQFVFLGFAFYWA 312 Query: 336 RSRYGEMRVVSTIPQEKARNFAASLTALLWKVRI 369 + G+ R+ EK R AS++ +++ Sbjct: 313 KDAQGKPRLRRRTGAEKHR---ASMSEFYQYIKV 343 >UniRef50_Q10VN2 RNA-directed DNA polymerase (Reverse transcriptase) n=3 Tax=Cyanobacteria RepID=Q10VN2_TRIEI Length = 437 Score = 142 bits (357), Expect = 3e-32, Method: Compositional matrix adjust. Identities = 105/331 (31%), Positives = 165/331 (49%), Gaps = 35/331 (10%) Query: 30 LAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGH-YQPLPARRVYIPK 88 L R+T ++G G D A+ +VE L E+L+ +Q PA+RVYIPK Sbjct: 12 LLSVRRVTQENQGIRRMGRDAQT-----AKTSVEKVKLVKEMLTYRLWQAKPAKRVYIPK 66 Query: 89 SNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTD 148 +N + PLGIP +++R+ Q + +EPIW+++F T SYGF P RS H + ++L Sbjct: 67 ANRQQGPLGIPTVKNRVAQAVVKNGLEPIWDAEFETNSYGFHPGRSCHDPLEQFWIRLQK 126 Query: 149 CGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHIDVGLFRAA 208 +T W+++ D+ FD + H ++KA I + L+ + +KAG+++ +F Sbjct: 127 GKDT---WILDVDIKQDFDNITHEYILKA----IGEIPGRELIKQWLKAGYLEAEVFHKT 179 Query: 209 SEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQW 268 G G+ISPLL+NI + ++ L RY + K + Q R T E + Sbjct: 180 EGGTSSRGIISPLLANIAFDGMERLL-ARYKTVK----------TYQCTRPTTDEEYTKK 228 Query: 269 KP--AVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFI 326 K + RYADDF++ + ++ ++AI L L LN DKT + H+ GF Sbjct: 229 KKLDKYGFIRYADDFIITAR-SEEDIKAIIPTIEKWL-SERGLELNKDKTNLVHIEQGFN 286 Query: 327 FLGHRLIRKRSRYGEMRVVSTIPQ-EKARNF 356 FLG + R G +V PQ EK + F Sbjct: 287 FLGFNV---RQFNGSCFIV---PQKEKVKEF 311 >UniRef50_A7UDN1 Putative reverse transcriptase n=2 Tax=Candida zemplinina RepID=A7UDN1_CANZE Length = 445 Score = 142 bits (357), Expect = 3e-32, Method: Compositional matrix adjust. Identities = 104/335 (31%), Positives = 167/335 (49%), Gaps = 57/335 (17%) Query: 16 RIQRLLRLITQPEWLAEAARITL-SSKGAHTPGVDG--VNKTMLQARL-AVELQILRDEL 71 +++ L R+ + + E A T+ SS G+ TPGVD + M +A++ V +I + Sbjct: 27 KVRDLQRMTVRSGYARELAVDTIASSPGSKTPGVDNFIIKNEMDKAKMIKVTGKIEQ--- 83 Query: 72 LSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRP 131 Y P P +R+YIPK+NGKLRP GIP + DR +Q A +PI E+ S+GFRP Sbjct: 84 ----YNPKPVKRIYIPKANGKLRPTGIPTMADRAMQCTFSFATQPIAETLGDQHSFGFRP 139 Query: 132 ERSV----HHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARF 187 RS +H R ++ ++ W ++ D+ +FD + H ++ ++ + R Sbjct: 140 NRSTIDAFNHLYRAQFIKSSNAPV--NNWAVDADIKGFFDNISHEWILNNIK---IEPR- 193 Query: 188 MTLLWKTIKAGHID----VGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKA 243 +L K KAG I+ V F + GVPQGGV+SP+++N++ + +Q++++ Sbjct: 194 --MLAKFTKAGFIEYNNQVNEFHDTNTGVPQGGVMSPMMANMVTDGLEQHIYD------G 245 Query: 244 RKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVL 303 K R +N +R V + RYADDFV+I E + + ++ Sbjct: 246 TKARNIYNGMTKR---------------VHFIRYADDFVIIT-----PYEWVAQRTMPIV 285 Query: 304 EGSLK---LRLNMDKTKIPHVN-DGFIFLGHRLIR 334 LK L LNMDKT I ++ D FLG+ R Sbjct: 286 NSFLKERSLSLNMDKTHILDISKDNLDFLGYTTKR 320 >UniRef50_A0RHJ0 Reverse transcriptase/endonuclease protein n=6 Tax=Firmicutes RepID=A0RHJ0_BACAH Length = 608 Score = 141 bits (356), Expect = 4e-32, Method: Compositional matrix adjust. Identities = 102/322 (31%), Positives = 167/322 (51%), Gaps = 25/322 (7%) Query: 17 IQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHY 76 + + LI E + A R S+ G+ T G +G L A +L L + L +Y Sbjct: 34 FKNIYELIISEENIRLAFRNLKSNIGSKTKGTNGHTIKHLNKIDADKLIRLTQKRLE-NY 92 Query: 77 QPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVH 136 P RR++I K NGK+RPLGIP + DR++Q+ +EPI E FH SYGFRP+R H Sbjct: 93 MPHAVRRLFISKPNGKMRPLGIPTIEDRLIQQMFQQVLEPIVEGKFHPQSYGFRPKRGTH 152 Query: 137 HAIRTVKLQLTDC----GETRGRWVIEGDLSSYFDTVHHRLLMKAV-RRRISDARFMTLL 191 A L C + +V++ D+ +FD V+H+ LM+ + I D + ++++ Sbjct: 153 DA-------LARCYHMVNHSHQHFVVDIDIKGFFDNVNHKKLMRQLWTIGIRDKKVLSII 205 Query: 192 WKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWN 251 K +KA G+ +G PQGG++SPLL+N++LNE D ++ ++ + R Sbjct: 206 KKMLKAEVTGEGI---PVKGTPQGGILSPLLANVVLNELDWWVSNQWETKPTRVPY---- 258 Query: 252 NSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRL 311 ++R ++ A+++ + KP + RYADDF I + I+ L+ L L + Sbjct: 259 -KLKRNKTDALKKT-RLKP-MYLVRYADDFK-IFTNSYDNARKIKIAVEKWLKERLGLEI 314 Query: 312 NMDKTKIPHV-NDGFIFLGHRL 332 + +K+KI ++ +G FLG R Sbjct: 315 SEEKSKITNLRKNGTDFLGIRF 336 >UniRef50_A6P1G1 Putative uncharacterized protein n=1 Tax=Bacteroides capillosus ATCC 29799 RepID=A6P1G1_9BACE Length = 320 Score = 140 bits (353), Expect = 7e-32, Method: Compositional matrix adjust. Identities = 73/192 (38%), Positives = 113/192 (58%), Gaps = 8/192 (4%) Query: 10 ATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRD 69 A + S + +RL R + P++ A + + G T G DG KT + V + L + Sbjct: 8 ACNQSYKYERLYRNLYNPQFYLLAYQRIQAKPGNMTAGTDG--KT-IDGMGMVRVNALIE 64 Query: 70 ELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGF 129 ++ YQP PARR YIPKSNGK+RPLGIP+ D+++Q + + +E I+E F S+GF Sbjct: 65 KMRDFSYQPNPARRTYIPKSNGKMRPLGIPSFDDKLIQEVVRLILESIYEPTFSDHSHGF 124 Query: 130 RPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMT 189 R +S H A++ V+ T +W +EGD+ FD V H +L+ +R+RI+D F+ Sbjct: 125 RMNKSCHTALKYVQKYFTGT-----KWFVEGDIRGCFDNVDHHVLIAILRKRIADEHFIG 179 Query: 190 LLWKTIKAGHID 201 LLWK +KAG+++ Sbjct: 180 LLWKFLKAGYME 191 >UniRef50_Q8YLU0 All5206 protein n=1 Tax=Nostoc sp. PCC 7120 RepID=Q8YLU0_ANASP Length = 539 Score = 140 bits (352), Expect = 1e-31, Method: Compositional matrix adjust. Identities = 115/352 (32%), Positives = 174/352 (49%), Gaps = 41/352 (11%) Query: 16 RIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGH 75 ++Q+LL L ++ L ++T + G T GVDG RLA+ ++L Sbjct: 58 KLQKLL-LSSKAAKLLAIRQVTQLNTGRKTAGVDGKKALEPSQRLAL-YEVLVKNWKQWK 115 Query: 76 YQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSV 135 +QPL +RVYIPK++G R LGIP + DR Q + A+EP E+ F+ SYGFRP RS Sbjct: 116 HQPL--KRVYIPKADGTRRGLGIPTISDRAYQCLIKYALEPAAEAMFNARSYGFRPGRSC 173 Query: 136 HHAIRTVKLQLTDCGETRG--RWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWK 193 H + + L + G+ G + ++E D+ FD + H+ LM++V ++ A + W Sbjct: 174 HDVQKLLFSNL-NGGQANGLSKRILELDIERCFDKIDHKFLMQSV--QLPKAAKQGIFW- 229 Query: 194 TIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHE-RYLSGKARKDRWYWNN 252 IKAG G F ++ G PQGGVISPLL+NI+L+ + HE RY Sbjct: 230 AIKAG--VRGEFPSSESGTPQGGVISPLLANIVLHGLENVGHELRY-------------- 273 Query: 253 SIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLN 312 VR + + RYADD V ++K + EA+R+ LE L++ Sbjct: 274 --------KVRSGGRQIDTIKGFRYADDVVFLLK-PEDNPEALRQNIDTFLEAR-GLKVK 323 Query: 313 MDKTKIPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAASLTALL 364 KTKI H D F FLG + K + + +ST Q+ + A + ++ Sbjct: 324 EAKTKIVHSTDSFDFLGWNFVVKPN----GKFISTPSQKATSSIKAKVKEVM 371 >UniRef50_B7K703 RNA-directed DNA polymerase (Reverse transcriptase) n=85 Tax=Bacteria RepID=B7K703_CYAP7 Length = 661 Score = 139 bits (351), Expect = 1e-31, Method: Compositional matrix adjust. Identities = 114/349 (32%), Positives = 174/349 (49%), Gaps = 45/349 (12%) Query: 9 AATDPSLRIQRLLRLITQPEWLAE---AARITLSSKGAHTPGVDGVNKTMLQARLAVELQ 65 AA+ ++R R L+ W A+ R+T ++G T GVDGV + R+ + Q Sbjct: 44 AASRGNVRTVRRLQKTLLRSWSAKMLAVRRVTQDNQGKKTAGVDGVKSLTPKQRMNLVGQ 103 Query: 66 ILRDELLSGHYQPLPARRVYIPK-SNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHT 124 L + P RRV+IPK + RP IP + DR +Q + +A+EP WE+ F Sbjct: 104 ------LKLTCKTKPTRRVWIPKPGKDEKRPFLIPCMSDRALQALVKIALEPEWEAKFEP 157 Query: 125 LSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISD 184 SYGFRP R H AI + QL + ++V++ D+S FD ++H L++ + Sbjct: 158 NSYGFRPGRGCHDAIGAIFNQLG----AKAKYVLDADISKCFDKINHEKLLQKL-NTFPT 212 Query: 185 ARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKAR 244 R W +KAG +D EG PQGGV+SPLL+NI L+ ++ + + + Sbjct: 213 LRRQIRAW--LKAGVMDGNKLFPTEEGTPQGGVVSPLLANIALHGMEEII--KSFAQNPG 268 Query: 245 KDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLE 304 + R ++N RG+ RE +++ RYADDFVLI + A+ E+ + ++E Sbjct: 269 ELRQEFSN---RGKG---REQ-----SISLIRYADDFVLIHESL-----AVVEKGKEIIE 312 Query: 305 G---SLKLRLNMDKTKIPHVND------GFIFLGHRLIR-KRSRYGEMR 343 L L L +KT+I H D GF FLG + KRSR+ M+ Sbjct: 313 TWLRELGLTLKPEKTQITHTLDKHQGKVGFNFLGFNIRHYKRSRHKSMK 361 >UniRef50_Q47DU4 RNA-directed DNA polymerase (Reverse transcriptase) n=1 Tax=Dechloromonas aromatica RCB RepID=Q47DU4_DECAR Length = 429 Score = 139 bits (350), Expect = 2e-31, Method: Compositional matrix adjust. Identities = 119/342 (34%), Positives = 173/342 (50%), Gaps = 55/342 (16%) Query: 9 AATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQA----RLAVEL 64 A ++ R L I + + LA A S+KGA PGVD + ++A R EL Sbjct: 7 AKSESGYRFYALYDKIYRTDILAHAYAQCRSNKGA--PGVDRQDFEDVEAYGVRRWLEEL 64 Query: 65 QI-LRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFH 123 + L++E Y+P P RRV+IPK+NGKLRPLGI L DR+ A ++ +EPI+E+D Sbjct: 65 ALALKEE----SYRPDPIRRVFIPKANGKLRPLGISTLHDRVCMTAAMLVLEPIFEADLP 120 Query: 124 TLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRIS 183 Y +RP R+ A VK +L G+T V++ DLS YF ++ H LMK++ RRI Sbjct: 121 DEQYAYRPGRNAQQAAEEVKNRLY-LGQTD---VVDADLSDYFGSIPHSELMKSLARRIV 176 Query: 184 DARFMTL--LWKTIKAGHIDV-GLFRAASE------GVPQGGVISPLLSNIMLNEFDQYL 234 D R + L +W D G + +E G+PQG ISPLLSN+ + F Sbjct: 177 DRRVLHLIKMWLECAVEETDQRGRKKRTTEAKDQGRGIPQGSPISPLLSNLYMRRF---- 232 Query: 235 HERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEA 294 L+ K ++R + + YADD V++ K KA+ EA Sbjct: 233 ---VLAWK--------KLGLERSLGSRI------------VTYADDLVILCKCGKAE-EA 268 Query: 295 IREECRGVLEGSLKLRLNMDKTKIPHVNDG-FIFLGHRLIRK 335 + + R ++ G LKL +N +KT+I V G F FLG+ R+ Sbjct: 269 L-QWMRTIM-GKLKLTVNEEKTRICQVPAGTFDFLGYSFGRR 308 >UniRef50_P05511 Uncharacterized 91 kDa protein in cob intron n=1 Tax=Schizosaccharomyces pombe RepID=YMC6_SCHPO Length = 807 Score = 139 bits (350), Expect = 2e-31, Method: Compositional matrix adjust. Identities = 88/286 (30%), Positives = 143/286 (50%), Gaps = 32/286 (11%) Query: 70 ELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGF 129 L S + P RR+ I K++G RPL I + RD++VQ + + +E I+E F+T S+GF Sbjct: 284 SLKSEEFNFTPGRRILIDKASGGKRPLTIGSPRDKLVQEILRIVLEAIYEPLFNTASHGF 343 Query: 130 RPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMT 189 RP RS H A+R++ C W IEGD+ + FD++ H L+ + +I D RF+ Sbjct: 344 RPGRSCHSALRSIFTNFKGC-----TWWIEGDIKACFDSIPHDKLIALLSSKIKDQRFIQ 398 Query: 190 LLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLH----ERYLSGKARK 245 L+ K + AG++ ++ G PQG ++SP+L+NI L++ D+++ E G + Sbjct: 399 LIRKALNAGYLTENRYKYDIVGTPQGSIVSPILANIYLHQLDEFIENLKSEFDYKGPIAR 458 Query: 246 DRWYWNNSIQRGRSTAVRENWQWKP---------------------AVAYCRYADDFVLI 284 R + + + A REN K + Y RYADD+++ Sbjct: 459 KRTSESRHLHYLMAKAKRENADSKTIRKIAIEMRNVPNKIHGIQSNKLMYVRYADDWIVA 518 Query: 285 VKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPH-VNDGFIFLG 329 V G+ Q + I + S+ L ++ KTKI + D +FLG Sbjct: 519 VNGSYTQTKEILAKIT-CFCSSIGLTVSPTKTKITNSYTDKILFLG 563 >UniRef50_Q35064 Atp9 intron ORF n=1 Tax=Marchantia polymorpha RepID=Q35064_MARPO Length = 710 Score = 139 bits (349), Expect = 2e-31, Method: Compositional matrix adjust. Identities = 104/349 (29%), Positives = 158/349 (45%), Gaps = 76/349 (21%) Query: 41 KGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPA 100 KG PG+DG K + + L+ L EL Y P PA+R+ I K +G RPL I + Sbjct: 277 KGNVAPGIDGRTKADMTDK---ALEKLSKELRRQAYAPKPAKRIIITKPDGGSRPLSIAS 333 Query: 101 LRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEG 160 D++VQ + +EP +ES F S+GFRP RS H A+R ++ T W+++ Sbjct: 334 TVDKVVQSTLKELVEPHFESLFRDSSHGFRPGRSCHKALRDLRYSWTAL-----TWLVQI 388 Query: 161 DLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHIDV----GLFRAASEGVPQGG 216 D+ FD +HH LL+K + + L+ K + AG+IDV + +EGV QG Sbjct: 389 DIKKDFDKIHHDLLIKEMESVLRSKALQDLMRKLLNAGYIDVYNLTDRTQYNTEGVTQGS 448 Query: 217 VISPLLSNIMLNEFDQYLHE----RYLSGKAR--------------KDRWY--------- 249 +ISPL +NI L++ D Y+ + Y G R KD+ + Sbjct: 449 IISPLCANIFLHKLDCYVEDILIPNYNVGNMRPASAEYKKRLNIHSKDKAFFKYYTELEQ 508 Query: 250 ---------WNNSIQRGRSTAVRENWQWK-------PAVA-------------------- 273 W N Q+ +S V++ + ++ P V+ Sbjct: 509 AIKNIKHLKWINREQQKKSILVKKKYFFENLFFFRNPKVSCPLGRRTLLEMAEKEGLKRL 568 Query: 274 -YCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHV 321 Y RYAD+ +L V G+K IR+ + L+ LKL +N K+KI H Sbjct: 569 KYLRYADNIILGVIGSKQDALDIRKAVQNFLQEELKLDINEQKSKILHA 617 >UniRef50_Q119U8 RNA-directed DNA polymerase n=30 Tax=Bacteria RepID=Q119U8_TRIEI Length = 635 Score = 139 bits (349), Expect = 2e-31, Method: Compositional matrix adjust. Identities = 104/329 (31%), Positives = 165/329 (50%), Gaps = 55/329 (16%) Query: 16 RIQRLLRLITQPEW--LAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLS 73 ++++ +L+T+ + L R+T ++G T G+DG+ RL + +++L L Sbjct: 69 KMRKYQKLLTKSYYARLLAVRRVTQDNQGKKTAGIDGIKSLPPMQRLNL-VEMLGSRFLK 127 Query: 74 GHYQPLPARRVYIPKSN-GKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPE 132 P RRV+IPK + RPLGIP + DR +Q + + MEP WE+ F SYGFRP Sbjct: 128 AS----PIRRVWIPKPGREEKRPLGIPTMYDRALQALVKLGMEPEWEALFEPNSYGFRPG 183 Query: 133 RSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLW 192 RS + AI + + + + ++V++ D+S FD ++H L+ +I + + L+ Sbjct: 184 RSTYDAIAAIYVSINH----KPKYVLDADISKCFDRINHDALLG----KIGKSPYRKLVK 235 Query: 193 KTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHE--RYLSGKARKDRWYW 250 + +K+G D F EG PQGGVISPLL+NI L+ ++ L + L G R + Sbjct: 236 QWLKSGVFDNKQFSNTVEGTPQGGVISPLLANIALHGMEKCLEDYAETLPGTKRDN---- 291 Query: 251 NNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLE---GSL 307 QR A++ RYADDFV++ K K ++A + V++ + Sbjct: 292 ----QR--------------ALSLIRYADDFVILHKDIKVLLQA-----KTVIQEWLNQV 328 Query: 308 KLRLNMDKTKIPHV-------NDGFIFLG 329 L L +KTKI H GF FLG Sbjct: 329 GLELKPEKTKIAHTLEEYEGNKPGFDFLG 357 >UniRef50_C6MXE9 RNA-directed DNA polymerase n=1 Tax=Geobacter sp. M18 RepID=C6MXE9_9DELT Length = 512 Score = 138 bits (348), Expect = 3e-31, Method: Compositional matrix adjust. Identities = 94/298 (31%), Positives = 141/298 (47%), Gaps = 42/298 (14%) Query: 33 AARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLPARRVYIPKSNGK 92 A + S+KG TPGVDGV + R + R Y+P P +R+YIPK NGK Sbjct: 67 AVKRVTSNKGKKTPGVDGVLWKTAKVRWRAACSLRRRG-----YKPQPLKRIYIPKKNGK 121 Query: 93 LRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGET 152 RPL IP ++DR +Q +A+ P+ E+ SYGFR RS A L+ Sbjct: 122 KRPLSIPTMQDRAMQALYKLALAPVAETTADGNSYGFREGRSCADATAAAFNALSKPNS- 180 Query: 153 RGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHIDVGLFRAASEGV 212 WV+E D++ +D + +++ + +L K ++AG+I+ G+ + +G Sbjct: 181 -APWVLEADITGCYDNICQNWMLENI------PMDREVLRKWLEAGYIEDGILYPSHKGT 233 Query: 213 PQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAV 272 PQGG+ISP L+N+ L+ ++ + TAV + V Sbjct: 234 PQGGIISPTLANMTLDGLERVIR------------------------TAVPRRCR----V 265 Query: 273 AYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGH 330 + RYADDF++ K + AIR L G L L+ +KT I H+ DGF FLG Sbjct: 266 NFVRYADDFIVTGKSRRLLETAIRPAIEKFLSGR-GLSLSPEKTAITHIKDGFTFLGQ 322 >UniRef50_B0I1N8 Reverse transcriptase homolog n=2 Tax=Pylaiella littoralis RepID=B0I1N8_PYLLI Length = 796 Score = 138 bits (348), Expect = 3e-31, Method: Compositional matrix adjust. Identities = 105/326 (32%), Positives = 169/326 (51%), Gaps = 46/326 (14%) Query: 38 LSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLPARRVYIPKSNGK--LRP 95 +S+KG +DG++ LQA + ++ LSG + P RRVYI K GK LRP Sbjct: 211 ISAKGVDDSSLDGISLRTLQA--------MSNDTLSGRIKFSPVRRVYIKKE-GKTDLRP 261 Query: 96 LGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGR 155 LGI + R +I+Q+++ M + I+E F S+G R RS H A++ ++L++ + Sbjct: 262 LGISSPRQKIIQKSIEMVLTSIFEEIFLDCSHGSRIGRSCHTALKNLQLKVGNVSTYS-- 319 Query: 156 WVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHI---DVGLF-RAASE- 210 WV+EGD+ FD + H +MK +++++ + L+ K + AG+I D+ F R ++ Sbjct: 320 WVVEGDIKGCFDNIPHSQIMKRIKQKVDCLPTINLVKKILDAGYILDEDLKKFGRKNAQV 379 Query: 211 -----GVPQGGVISPLLSNIMLNEFDQY----LHERYLSGKARKDRWYWNN---SIQRGR 258 G QG ++SPL SNI+L+E D++ L E + GK RK + I++ Sbjct: 380 FKPDVGTTQGIILSPLFSNIVLHELDKFIEVILKEEFSKGKKRKANLEYRKLRYQIKKED 439 Query: 259 STAVR----ENWQWKPAVA----------YCRYADDFVLIVKGTKAQVEAIREECRGVLE 304 + R E+ + P+ + Y RY DD+V++V G+ + IR+ L Sbjct: 440 NLKKRRKLIEDCKSVPSKSIEDPDFKRLFYVRYVDDWVILVSGSLSDSRLIRDRVSRKLR 499 Query: 305 GSLKLRLNMDKTKIPHVNDGFI-FLG 329 L L LNM+KTKI + G FLG Sbjct: 500 -ELGLELNMEKTKITSLRKGKCRFLG 524 >UniRef50_B0JX80 Reverse transcriptase n=82 Tax=Bacteria RepID=B0JX80_MICAN Length = 613 Score = 138 bits (347), Expect = 4e-31, Method: Compositional matrix adjust. Identities = 109/349 (31%), Positives = 166/349 (47%), Gaps = 59/349 (16%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEW--LAEAARITLSSKGAHTPGVDGVNKTMLQA 58 +Q+++ A + +++RL RL+ + + L ++T ++G T GVDG+ + Sbjct: 37 LQKRIYQAAKSGQDAKVRRLQRLLVKSYYARLLAVRKVTQDNQGKKTAGVDGMIAISPEQ 96 Query: 59 RLAVELQILRDELLSGHYQPLPARRVYIPKSN-GKLRPLGIPALRDRIVQRAMLMAMEPI 117 RL + E + G + P RRV+IPK + RPLGIP ++DR Q + A+EP Sbjct: 97 RLNLT------EEIKGTLKAKPLRRVWIPKPGRDEKRPLGIPTIKDRARQALIKSALEPE 150 Query: 118 WESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKA 177 WES SYGFRP RS H AI + + + +V++ D++ FD ++H L+ Sbjct: 151 WESKMEGTSYGFRPGRSDHDAISRIYITINQSS----YFVLDADIAKCFDRINHDFLLSK 206 Query: 178 VRRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHER 237 + S R + W +KAG +D G+F G PQGGVISPLL+NI L+ + + E Sbjct: 207 IHCPSSLKRDIKQ-W--LKAGVLDNGVFEETETGTPQGGVISPLLANIALDGMARLI-ET 262 Query: 238 YLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIRE 297 K G++ AV RYADDFV+I +E I E Sbjct: 263 LFPKKGN------------GKNQAV-----------LIRYADDFVVI----SPSLEII-E 294 Query: 298 ECRGVLEGSLK---LRLNMDKTKIPHV-----------NDGFIFLGHRL 332 +C+ + LK L L +KT++ H GF FLG + Sbjct: 295 QCKTAISEWLKPIGLELKPEKTRVCHTLKPIEYNGKMEEPGFDFLGFNI 343 >UniRef50_B3GTB4 Putative reverse-transcriptase protein n=1 Tax=Volvox carteri f. nagariensis RepID=B3GTB4_VOLCA Length = 749 Score = 137 bits (346), Expect = 5e-31, Method: Compositional matrix adjust. Identities = 99/371 (26%), Positives = 177/371 (47%), Gaps = 40/371 (10%) Query: 20 LLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVEL---QILRDELLSGHY 76 L+ +I+ L A + S G TPG+ ++ A++L + + +++ +G + Sbjct: 232 LIHIISDTNLLIFAYELLKSKSGYMTPGITE------ESLDAIDLAWYKHISNDIKAGKF 285 Query: 77 QPLPARRVYIPK-SNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSV 135 + ARRV IPK +LRPLG+ + RD++VQ+A+ + ++ I++ F S+G+RP +S Sbjct: 286 KFSQARRVMIPKPGKSELRPLGVVSPRDKVVQKALELVLQCIFDPMFLDCSHGYRPGKSQ 345 Query: 136 HHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTI 195 H A++ + Q + WVI+GD+S FDT+ H +LM + +RIS + + L+ + Sbjct: 346 HTALKMLDQQFKN-----ATWVIKGDISKCFDTIDHEILMHLIGKRISCNKTLALIKSAL 400 Query: 196 KAGHI-DVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHE---RYLSGKARKDRWYWN 251 KAG++ D L+ + G P V SPLL NI ++EFD ++ + ++ G R+ + Sbjct: 401 KAGNVLDGKLYANEAVGTPHSSVSSPLLCNIYMHEFDLFVKDIIVKFNKGTKRRQNPEYT 460 Query: 252 NSIQRGRSTAVRENWQWKPA--------------------VAYCRYADDFVLIVKGTKAQ 291 + + N+ + Y R+ADDFV+ + G Sbjct: 461 KILNMLYKALEQFNFSKYAKLRKDLRRVRQVNIMDLDYVRIKYVRFADDFVISIIGPYKL 520 Query: 292 VEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFI-FLGHRLIRKRSRYGEMRVVSTIPQ 350 + + L LKL LN KT I + I FL ++ + + +++V + Sbjct: 521 ACDTKVMVKDFLMNKLKLTLNESKTAITKFSKKPIYFLDTEIMNRYPKVKPVKLVKRLGV 580 Query: 351 EKARNFAASLT 361 K N L+ Sbjct: 581 SKLANVTPRLS 591 >UniRef50_Q6TFE1 Putative group II intron-encoded maturase n=1 Tax=Caedibacter taeniospiralis RepID=Q6TFE1_CAETA Length = 341 Score = 137 bits (345), Expect = 6e-31, Method: Compositional matrix adjust. Identities = 77/243 (31%), Positives = 128/243 (52%), Gaps = 34/243 (13%) Query: 47 GVDGVNKTMLQARLAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIV 106 G+D V+ +A +++ L L ++P+P+RRVYIPK NG+ RPLGI A+ ++IV Sbjct: 54 GIDKVSWQEYGVDVADKIENLVMRLKRKTFKPMPSRRVYIPKGNGESRPLGISAIENKIV 113 Query: 107 QRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYF 166 + +++ ++ I+E DF SYGFRP R+ H A+ V + ++E D+ +F Sbjct: 114 ESGIMLILQSIYEQDFLECSYGFRPGRNTHQALNEVDKAIM---TQPVNHLVEADIKGFF 170 Query: 167 DTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIM 226 D V H L V+ R+ D + L+ ++AG+ID G+ +G PQG ++SP+L+NI Sbjct: 171 DNVSHEKLKDFVKIRVKDTSLLHLIDCFLRAGYIDKGVLIDTEKGTPQGSILSPMLANIF 230 Query: 227 LNEFDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYC---RYADDFVL 283 L+ Y+ D+W+ E+ + +C RYADDFV Sbjct: 231 LH---------YVL-----DKWF--------------EDTVKQHVEGFCRLVRYADDFVC 262 Query: 284 IVK 286 +++ Sbjct: 263 LIQ 265 >UniRef50_C5ER86 RNA-directed DNA polymerase n=1 Tax=Clostridiales bacterium 1_7_47FAA RepID=C5ER86_9FIRM Length = 635 Score = 136 bits (343), Expect = 1e-30, Method: Compositional matrix adjust. Identities = 100/329 (30%), Positives = 166/329 (50%), Gaps = 28/329 (8%) Query: 16 RIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGH 75 + + L+ L+ E + A R + G+ TPG+DG KT+ E +++ EL+ Sbjct: 35 KFKNLMELVLMEENIKLAYRNMKKNDGSTTPGIDG--KTIEHLAKMTEKEVI--ELVRNK 90 Query: 76 ---YQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPE 132 Y P RRV I K NGK RPLGI ++ DR++Q+ +L +EPI E+ FH S GFRP Sbjct: 91 LEWYTPKAIRRVEIDKGNGKKRPLGIASIEDRLIQQCILQVLEPICEAKFHDRSNGFRPN 150 Query: 133 RSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAV-RRRISDARFMTLL 191 R V +A+ + + + V++ D+ +FD V H L+K + I D + ++++ Sbjct: 151 RGVENALAQAEKLIQS---NKLYIVVDIDIKGFFDNVSHGKLLKQLWTIGIQDKKLISII 207 Query: 192 WKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFD-------QYLHERYLSGKAR 244 +K ++G +G QG +ISPLLSN++LNE D +++ R++ +A Sbjct: 208 SAMLKGEIAEIGF---PEKGTAQGSIISPLLSNVVLNELDWWIASQWEFMPTRHVYKEAI 264 Query: 245 KDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLE 304 K N + + + N K RYADDF + + K V + E + L+ Sbjct: 265 K----ANGTQSKSKKYRALRNSTLKECFI-VRYADDFKIFCRKHKDAV-VMFEATKQWLK 318 Query: 305 GSLKLRLNMDKTKIPHVNDGFI-FLGHRL 332 L L ++ +K+KI ++ + FLG R+ Sbjct: 319 TRLGLDISPEKSKIVNLKHSYSEFLGFRI 347 >UniRef50_C5D9G3 RNA-directed DNA polymerase (Reverse transcriptase) n=2 Tax=Firmicutes RepID=C5D9G3_GEOSW Length = 605 Score = 136 bits (342), Expect = 2e-30, Method: Compositional matrix adjust. Identities = 103/323 (31%), Positives = 164/323 (50%), Gaps = 21/323 (6%) Query: 16 RIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDG--VNKTMLQARLAVELQILRDELLS 73 R + L+ + + + A +++G+ T G DG +N + + V + R Sbjct: 16 RFKGLVEIASSDVVIVSAIHKIKANQGSKTAGTDGQTINDILTKNYDEVINFVKR---CF 72 Query: 74 GHYQPLPARRVYIPKSNGKLRP-LGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPE 132 +Y+P RRVYIPK K + LGIP + DR +Q + M +EPI E+ F SYGFRP Sbjct: 73 KNYKPKLIRRVYIPKPGKKKKRPLGIPTIADRTIQECVRMTIEPILEAQFFQHSYGFRPY 132 Query: 133 RSVHHAI-RTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAV-RRRISDARFMTL 190 R AI R V + C WVIEGD+ +FD V+H +L+K + I D R + + Sbjct: 133 RDTKQAIERCVFI----CNRIGYNWVIEGDIKGFFDNVNHTILIKQLWHMGIRDRRMLMI 188 Query: 191 LWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYW 250 + +KAG + + G PQGG+ISPLL+N+ L++ DQ++ + K R Sbjct: 189 IKAMLKAGVMKET--KVNEIGTPQGGIISPLLANVYLHKLDQWITREWEEKKMRN----- 241 Query: 251 NNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLR 310 +I+ + ++R + Y RYADD+VL ++ E + + L+ +LKL Sbjct: 242 GTAIRTSKFNSLRNHSTITRPEFYVRYADDWVLFT-DSRENAEKWKYRIKKYLKENLKLE 300 Query: 311 LNMDKTKIPHVNDGFI-FLGHRL 332 L+ DKT I ++ + FLG ++ Sbjct: 301 LSDDKTLITNIKKKPMKFLGFKI 323 >UniRef50_D2LU13 RNA-directed DNA polymerase (Reverse transcriptase) n=1 Tax=Bacillus cellulosilyticus DSM 2522 RepID=D2LU13_BACS4 Length = 464 Score = 135 bits (340), Expect = 2e-30, Method: Compositional matrix adjust. Identities = 91/273 (33%), Positives = 134/273 (49%), Gaps = 38/273 (13%) Query: 46 PGVDGVNKTMLQARLAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRI 105 PG+DG++ L LA+E + L + G Y+P P +RV I K +G R LGIP ++DR+ Sbjct: 68 PGIDGMSVDELLPYLALEDRNLILSIKDGSYRPQPVKRVEIKKPDGGKRKLGIPTVKDRL 127 Query: 106 VQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSY 165 VQ+ +L +E + F SYGFRP RS H A+R K Q + G R+V++ D+ Y Sbjct: 128 VQQMILQVIEKKIDPQFSDNSYGFRPNRSAHDAMRKAK-QYYEEG---FRYVVDIDMKQY 183 Query: 166 FDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNI 225 FDTV+ LM V + I D + L+ K +++G + G PQGG +SP+L NI Sbjct: 184 FDTVNQDKLMHHVEQFIDDPTVLILIRKFLRSGISIDEEIEPSEVGTPQGGNLSPILGNI 243 Query: 226 MLNEFDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIV 285 L++ D L R + RYADD + V Sbjct: 244 YLHQLDLELERR---------------------------------GHKFIRYADDCNIYV 270 Query: 286 KGTKAQVEAIREECRGVLEGSLKLRLNMDKTKI 318 K KA ++ + LE LKL +N DK+++ Sbjct: 271 KSRKAGDRVLKSITK-FLEEELKLTVNKDKSEV 302 >UniRef50_Q94Z24 Orf568 n=1 Tax=Pylaiella littoralis RepID=Q94Z24_PYLLI Length = 568 Score = 135 bits (340), Expect = 2e-30, Method: Compositional matrix adjust. Identities = 106/351 (30%), Positives = 167/351 (47%), Gaps = 39/351 (11%) Query: 31 AEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLPARRVYIPKSN 90 A A R ++KG +TPG++G +L ++ R +Y P +RVYIPKS Sbjct: 57 ALAVRAITTNKGKNTPGINGEIWDTSIKKLDAIHRLGR----VSNYSCSPVKRVYIPKSG 112 Query: 91 GKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCG 150 GKLRPLGIP + DR +Q +A++PI E SYGFR RS + L L+ Sbjct: 113 GKLRPLGIPNMYDRGLQYLWKLALDPIAECRADRHSYGFRKGRSTQDVHTILHLLLSP-- 170 Query: 151 ETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHIDVGL--FRAA 208 ++R WV+E D+ +FD ++H +++ + +L + +KAG ++ F Sbjct: 171 KSRCDWVLEADIRGFFDNINHDWIIQNI------PMDKNILREWLKAGALETTTQEFHKG 224 Query: 209 SEGVPQGGVISPLLSNIMLNEFDQYLHE--RYLSGKARKDRWYWNNSIQRGRSTAVRENW 266 GVPQGG ISPL++N+ L+ + ++ ++L K+++ Sbjct: 225 IAGVPQGGPISPLIANMTLDGLEVWVANSVKHLYKKSKET-------------------- 264 Query: 267 QWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFI 326 W P V RYADDFV + TK +E I + S L LN +KT I V GF Sbjct: 265 SWSPKVNVVRYADDFV-VTAATKRILEDIVKPSIQDFLASRGLVLNQEKTCITSVKKGFD 323 Query: 327 FLG--HRLIRKRSRYGEMRVVSTIPQEKARNFAASLTALLWKVRISGEILL 375 F+G R+ +S + + +E R + + + + SGEI++ Sbjct: 324 FVGFNFRVYPDKSGPKGAKSIVKPTKEGKRRLRSKIRNAVKTNKSSGEIIV 374 >UniRef50_Q6EI10 Reverse transcriptase/HNH endonuclease n=2 Tax=Eukaryota RepID=Q6EI10_9CHLO Length = 600 Score = 135 bits (339), Expect = 3e-30, Method: Compositional matrix adjust. Identities = 96/333 (28%), Positives = 160/333 (48%), Gaps = 47/333 (14%) Query: 35 RITLSSKGAHTPGVDGVNKTMLQARL--AVELQILRDELLSGHYQPLPARRVYIPKSNGK 92 R T + G T GVD + RL A L+I Q P RRV+IPK Sbjct: 70 RATQDNTGKKTAGVDKARALTPKQRLELASSLRI--------PTQSSPLRRVWIPKPGTD 121 Query: 93 -LRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGE 151 +RPLGIP ++DR +Q + +EP WE+ F S+GFRP RS A+ ++ + + Sbjct: 122 VMRPLGIPTIKDRCLQALFKLMLEPEWEAKFEPSSFGFRPGRSCRDALAAIQANI----Q 177 Query: 152 TRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHIDVGLFRAASEG 211 R ++V++ D++ FD ++H+ L+ + R + L W +KAG +D F G Sbjct: 178 KRSKYVLDADIAKCFDRINHKALLDKIGMTGGFGRQL-LAW--LKAGVLDGSTFSETDLG 234 Query: 212 VPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPA 271 PQGG++SP+L+NI L+ + +L ++ + T ++++ + + Sbjct: 235 TPQGGIVSPVLANIALHRMEDHL-----------KKFVCQFPMTYASGTVIKKSRRGE-T 282 Query: 272 VAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVND-------- 323 V RYADDFV++ K + E R + G + L + KT++ H + Sbjct: 283 VTLIRYADDFVVLHHDKKILLACKAELVRWL--GEMGLEFSPTKTRLTHTLELQSDDVEA 340 Query: 324 -------GFIFLGHRLIRKRSRYGEMRVVSTIP 349 GF+FLG+++ + S+YG + + IP Sbjct: 341 EGFDGTVGFVFLGYQIKQFASKYGSAKSTAGIP 373 >UniRef50_A8ZN56 RNA-directed DNA polymerase n=2 Tax=Cyanobacteria RepID=A8ZN56_ACAM1 Length = 432 Score = 134 bits (338), Expect = 4e-30, Method: Compositional matrix adjust. Identities = 110/350 (31%), Positives = 157/350 (44%), Gaps = 56/350 (16%) Query: 39 SSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGI 98 S G GVDGV K Q L LQ L +L Y+P P R+V IPK +G +RPLGI Sbjct: 39 SLDGNKALGVDGVTKAEYQENLETNLQNLHLKLRQMSYRPQPVRQVEIPKEDGSMRPLGI 98 Query: 99 PALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVI 158 D++VQ +E I+E F SYGFRP+RS H A+R + ++ WV Sbjct: 99 SCTEDKVVQEMTRRILEAIYEPVFIDTSYGFRPKRSCHDALRQLNREVM---RKPVNWVA 155 Query: 159 EGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVI 218 + DL+ +FDT+ H+ ++ + RI D + L+ + +KAG G G PQG ++ Sbjct: 156 DIDLAKFFDTMPHQEILSVLSIRIKDGNLLRLIARMLKAGIQTPGGVVYDELGSPQGSIV 215 Query: 219 SPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYC--- 275 SP+++NI L D L D+W+ N VR + + YC Sbjct: 216 SPVIANIFL---DYVL-----------DQWFTN---------VVRHHCR-----GYCAII 247 Query: 276 RYADDFVLIVKGTKAQVEAIREECRGVLEGSLK---LRLNMDKTKIPHVN---------- 322 RYADD + + + + +R VL L+ LRLN KT + Sbjct: 248 RYADDVAAVFEHEEDAIRFMR-----VLPRRLEKYGLRLNTKKTHLLAFGKRNARRCFQT 302 Query: 323 ----DGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAASLTALLWKVR 368 F FLG RSR G +R+ +++ R L L KVR Sbjct: 303 GQRPSTFDFLGLTHYWGRSRKGYVRMKRKTSKKRLRRSLKQLKMWLRKVR 352 >UniRef50_Q3B1V7 RNA-directed DNA polymerase n=31 Tax=Bacteria RepID=Q3B1V7_PELLD Length = 495 Score = 134 bits (338), Expect = 4e-30, Method: Compositional matrix adjust. Identities = 103/304 (33%), Positives = 147/304 (48%), Gaps = 52/304 (17%) Query: 33 AARITLSSKGAHTPGVDG-VNKTMLQARLAVELQILRDELLSGHYQPLPARRVYIPKSNG 91 A + ++G +TPGVDG V KT A L Y+PLP RR YIPK NG Sbjct: 63 AVKRVTENRGKNTPGVDGDVWKTSKAKANAAA------SLRRRGYKPLPLRRTYIPKKNG 116 Query: 92 KLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGE 151 K RPLGIP ++DR +Q +A+EP+ E+ SYGFRP RS V Q C Sbjct: 117 KQRPLGIPTMKDRAMQALYWLALEPVAETTADGNSYGFRPWRSTA----DVAEQCFICLA 172 Query: 152 TR--GRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHI-DVGLFRAA 208 R +W++E D++ FD + H+ L+ + +L K +KAG + + LF A Sbjct: 173 RRDSAQWILEADIAGCFDAISHQWLVDNI------PMDTPILRKWLKAGFVFNNELFPTA 226 Query: 209 SEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQW 268 S G PQGG+ISP L+N+ L+ +Q L + + R + Sbjct: 227 S-GTPQGGIISPGLANMSLDGLEQALATAFPQARRRGLK--------------------- 264 Query: 269 KPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLR---LNMDKTKIPHVNDGF 325 + RYADDF++ T E + E V+ LK R L+ +KT++ H+ +GF Sbjct: 265 ---MHMVRYADDFII----TGNSKEWLEHEIMPVVVDFLKKRGLWLSEEKTRVTHITEGF 317 Query: 326 IFLG 329 FLG Sbjct: 318 DFLG 321 >UniRef50_C6MS68 RNA-directed DNA polymerase n=1 Tax=Geobacter sp. M18 RepID=C6MS68_9DELT Length = 439 Score = 134 bits (337), Expect = 6e-30, Method: Compositional matrix adjust. Identities = 114/360 (31%), Positives = 158/360 (43%), Gaps = 46/360 (12%) Query: 3 RKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAV 62 R +A A D R + L R +T L + S+ GVD V L Sbjct: 16 RGIADKAKADKQHRFRNLYRELTAEYLLNCWPDLNKSA----ASGVDKVTAEAYAEELHG 71 Query: 63 ELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDF 122 + L + L Y+ RR +IPK N K RPLGIPAL D++VQ A + I+E DF Sbjct: 72 NILNLAERLKDKKYRTKLVRRCWIPKENEKERPLGIPALEDKLVQLACAKLLIAIYEQDF 131 Query: 123 HTLSYGFRPERSVHHAIR--TVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRR 180 SYG+RP RS HA++ T LQ G +++E D+ +FD + H L+K + Sbjct: 132 LDHSYGYRPGRSAKHAVQDLTFDLQYGSYG-----YIVEADIKGFFDRMDHDWLLKMLSL 186 Query: 181 RISDARFMTLLWKTIKAGHIDV-GLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYL 239 RI+D F+ L+ K +KAG ++ G G PQGG++SP+L+N+ YLH Sbjct: 187 RINDRAFLHLIEKWLKAGILETDGTVTNPYTGTPQGGIVSPVLANV-------YLH---- 235 Query: 240 SGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREEC 299 D W+ R + A CRYADD+V + K + E Sbjct: 236 ---FALDLWFEEVVKPRCKGDA-----------RICRYADDWVCAFQ-LKDDAQRFYWEL 280 Query: 300 RGVLEGSLKLRLNMDKTKI-------PHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEK 352 LE L +KT I P + F FLG R R GE RV ++K Sbjct: 281 PNRLE-KFHLETAPEKTNIVRFSRFHPGMERRFTFLGFEFFWLRDRQGEPRVKRRTSRKK 339 >UniRef50_C5ZZQ5 Putative uncharacterized protein n=1 Tax=Escherichia coli Vir68 RepID=C5ZZQ5_ECOLX Length = 255 Score = 133 bits (335), Expect = 1e-29, Method: Compositional matrix adjust. Identities = 65/95 (68%), Positives = 74/95 (77%) Query: 8 WAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQIL 67 DPS RI +LLRLITQ E LAEAAR+T SSKGAH+P V+GVN LQA + ELQ L Sbjct: 145 MGGQDPSRRISQLLRLITQSEGLAEAARLTFSSKGAHSPCVEGVNTAKLQAGRSAELQRL 204 Query: 68 RDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALR 102 +EL+ GHYQP+PAR YIPKSNGKLRPLGIPA+R Sbjct: 205 SEELIFGHYQPMPARWGYIPKSNGKLRPLGIPAMR 239 >UniRef50_C6I8L1 CRISPR-associated protein n=1 Tax=Bacteroides sp. 3_2_5 RepID=C6I8L1_9BACE Length = 756 Score = 132 bits (331), Expect = 3e-29, Method: Compositional matrix adjust. Identities = 90/285 (31%), Positives = 140/285 (49%), Gaps = 42/285 (14%) Query: 47 GVDGVNKTMLQARLAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIV 106 G+DG + + RL L L+ EL+S + P P R+ I K+ + R LG+ ++D+IV Sbjct: 29 GIDGFTLSHFEKRLNDNLIELQHELISQTWNPEPYLRIEITKNETEKRKLGLLCIKDKIV 88 Query: 107 QRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYF 166 Q+A+ A+EP E F LSYG+RP + AI+ V + D + + +V + D+ +YF Sbjct: 89 QQAIKTAIEPQLEKTFLNLSYGYRPNKGPERAIKRV---VHDLKKLKSGYVAKLDIDNYF 145 Query: 167 DTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHIDVGL-FRAASEGVPQGGVISPLLSNI 225 DT++H L + + D + L+ I+ G + L ++ ++GVPQG ++SPLL+N Sbjct: 146 DTINHERLFTRLANWLKDDETLRLIRLCIQTGIVTPQLQWQEINKGVPQGAILSPLLANF 205 Query: 226 MLNEFDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIV 285 L+ FDQ+ + Y RYADDF LI Sbjct: 206 YLHPFDQFAANK---------------------------------VPMYIRYADDF-LIA 231 Query: 286 KGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPH-VNDGFIFLG 329 T+ Q++ E + LE L+LN T I H +DG FLG Sbjct: 232 TSTEKQIKEAVELVKEELESQFYLQLN---TPIIHNFHDGIEFLG 273 >UniRef50_A9ENQ0 Integron/retron-type RNA-directed DNA polymerase (Reverse transcriptase) n=3 Tax=Bacteria RepID=A9ENQ0_SORC5 Length = 439 Score = 131 bits (330), Expect = 3e-29, Method: Compositional matrix adjust. Identities = 95/282 (33%), Positives = 136/282 (48%), Gaps = 37/282 (13%) Query: 4 KLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVE 63 K+ A DP + L LI + E L A R S + GVDG+ K L Sbjct: 16 KVRERAERDPEGVLLALAHLIDE-EALQRAYR---SLRNEAAVGVDGITKEQYGQDLEHN 71 Query: 64 LQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFH 123 ++ L + S Y+ P RRV+IPK GK RP+GI D+IVQ A+ +E I+E F Sbjct: 72 VRDLHARMKSMRYRHQPIRRVHIPKERGKTRPIGISCTEDKIVQAAVREMLEVIYEPVFR 131 Query: 124 TLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRIS 183 +SYGFRP RS H A+R + L E W++E D+ S+FD++ LM+ ++ R++ Sbjct: 132 DVSYGFRPGRSAHDALRALNRMLLGGVE----WILEADIESFFDSIDRTKLMEMLQARVA 187 Query: 184 DARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKA 243 D + L+ K + G +D F A +G QG V+SPLL N+ YLH Sbjct: 188 DKSLLRLVGKCLHVGVLDGAEFYAPEDGTVQGSVLSPLLGNV-------YLHHVL----- 235 Query: 244 RKDRWYWNNSIQR--GRSTAVRENWQWKPAVAYCRYADDFVL 283 D W R G++T + RYADDF++ Sbjct: 236 --DLWIEREVQPRLVGKATLI-------------RYADDFII 262 >UniRef50_D0VMZ3 Putative reverse-transcriptase protein n=1 Tax=Volvox carteri f. nagariensis RepID=D0VMZ3_VOLCA Length = 611 Score = 131 bits (329), Expect = 5e-29, Method: Compositional matrix adjust. Identities = 111/351 (31%), Positives = 174/351 (49%), Gaps = 46/351 (13%) Query: 30 LAEAARITLSSKGAHTPGVDG--VNKTMLQARLAVELQILRDELLSGHYQPLPARRVYIP 87 L ++T +KG TPGVD V + R+A EL+ L G +P+ RRV++P Sbjct: 62 LLSVLQVTALNKGRKTPGVDKQIVIAGPDKLRMAKELR------LDGTAKPI--RRVWLP 113 Query: 88 K-SNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQL 146 K + RPLGIP + DR Q +A+EP WE+ F SYGFRP RS H AI + L L Sbjct: 114 KPGKDEKRPLGIPTIEDRAKQNLAKLALEPEWEAIFEPNSYGFRPGRSCHDAIEAIFLNL 173 Query: 147 TDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHIDVGLFR 206 + +++ + D+ FD + H L+ + + + ++ +KAG ++ G Sbjct: 174 R---HKKTKFIYDADIRKCFDRIDHGALIAKLN---TFPQMERQIYAWLKAGIME-GYAN 226 Query: 207 A-------ASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGRS 259 A ++ G PQGG+ISPLL+NI L+ + +L E + + K + + N Q + Sbjct: 227 APKSYEPESNLGTPQGGIISPLLANIALHGLENHLKE-FCATKVSNEIFQTRNRSQESK- 284 Query: 260 TAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIP 319 + A RYADDFV I+ K +E E + L+ + L +N +K+K+ Sbjct: 285 ---------RKACGVIRYADDFV-IIHENKQVIELCVTETKNWLQ-HIGLDINNEKSKLR 333 Query: 320 HVNDGFIFLGHR--LIRKRSRYGEMRVVSTIPQEKARNFAASLTALLWKVR 368 +GF FLG + LIR+ G+ + P +K + ++LL KVR Sbjct: 334 DSREGFKFLGLQIILIRRGKMDGQGYRIKITPSKKNQ------SSLLEKVR 378 >UniRef50_Q7GEU5 Putative uncharacterized protein (Fragment) n=3 Tax=Candida parapsilosis RepID=Q7GEU5_CANPA Length = 933 Score = 130 bits (328), Expect = 6e-29, Method: Compositional matrix adjust. Identities = 102/357 (28%), Positives = 168/357 (47%), Gaps = 45/357 (12%) Query: 5 LATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVEL 64 LA L ++R + L+ + A S+G+ T GVD V + A+ + Sbjct: 308 LANIHGAKSELVMKRQMMLVNSIIFRLHAVDKLSHSRGSLTSGVDNVCIEGVDKDRALLV 367 Query: 65 QILRDELLSGH------YQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIW 118 +I+ E L Y+ P +RV+IPK N KLRP+GIP L+DR +Q + + +EP+ Sbjct: 368 EIV--EWLGTTVKHPKLYKSDPVKRVWIPKGNEKLRPIGIPTLKDRGLQYLINLVVEPLV 425 Query: 119 ESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETR---------------------GRWV 157 E +YGFRP RS +AI ++ L ++ +W+ Sbjct: 426 EMTSDPHNYGFRPYRSTKNAIAYLRSHLHTIDSSKKGNHFTTASNVENNLLRLLPENKWI 485 Query: 158 IEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGV 217 ++ D+ +FD ++H L+ + + + ++ +K+G ID +F+ G PQGGV Sbjct: 486 LDADIKGFFDNINHDWLLNNLTLH---PKLLLIIKAWLKSGVIDGKIFQLTESGTPQGGV 542 Query: 218 ISPLLSNIMLNEFDQYLHER-YLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCR 276 ISP L N LN ++ + E Y K+++ R ++ G T V ++AY R Sbjct: 543 ISPTLVNFTLNGLEKVVMEALYPLTKSKEQR--IRIKLKDGTYTCV------ASSLAYVR 594 Query: 277 YADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVND---GFIFLGH 330 YADDFV++V+ + I+ L L LN +KTK+ ++D FLG+ Sbjct: 595 YADDFVVLVRSKHIMLTFIKPAIEKFL-AERGLNLNEEKTKLFRLSDPGCQLDFLGY 650 >UniRef50_A5IEI2 Reverse transcriptase n=7 Tax=Bacteria RepID=A5IEI2_LEGPC Length = 454 Score = 130 bits (327), Expect = 7e-29, Method: Compositional matrix adjust. Identities = 102/313 (32%), Positives = 146/313 (46%), Gaps = 54/313 (17%) Query: 42 GAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPAL 101 A + G+D + L L L + + SG Y P + V IPK G +R LGIP + Sbjct: 62 NAGSAGIDNQSIDEFSQDLKGNLYKLWNRMSSGSYFPPAVKEVAIPKKQGGVRKLGIPTV 121 Query: 102 RDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGD 161 DRI Q + + MEP+ E F SYG+RP +S A+ V + C E WV+E D Sbjct: 122 ADRIAQMTVKLMMEPLLEPHFLDDSYGYRPNKS---ALDAVGVTRKRCWEYD--WVVEFD 176 Query: 162 LSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHIDV-GLFRAASEGVPQGGVISP 220 + FD + H LLMKAV+ ISD + + + + A D G + G PQGGVISP Sbjct: 177 IKGLFDNLSHELLMKAVKHHISDRWILLYVERWLTAPIQDQHGGCLPRTAGTPQGGVISP 236 Query: 221 LLSNIMLN-EFDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYAD 279 LLSN+ L+ FD ++ + + P +CRYAD Sbjct: 237 LLSNLFLHYAFDHWMTKHH-------------------------------PDNPWCRYAD 265 Query: 280 DFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDG----------FIFLG 329 D + + T+ + E + +E + SL L ++ DKTKI + DG F FLG Sbjct: 266 DGLAHCR-TEKEAEQMLKEIDKRFK-SLGLEIHPDKTKIVYCKDGARKGKYKNKSFDFLG 323 Query: 330 H----RLIRKRSR 338 + R ++ RSR Sbjct: 324 YTFKARRVKVRSR 336 >UniRef50_Q1D1V6 Group II intron, maturase n=2 Tax=Myxococcales RepID=Q1D1V6_MYXXD Length = 436 Score = 130 bits (326), Expect = 9e-29, Method: Compositional matrix adjust. Identities = 111/340 (32%), Positives = 165/340 (48%), Gaps = 62/340 (18%) Query: 9 AATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQAR-LAVELQIL 67 A + P+ R L + +PE L EAA + G PG DG+ ++ R A L + Sbjct: 19 AKSAPTHRFWGLYVHVLKPEVL-EAAYLEARRNGG-APGQDGITFEHIEERGRAGFLGAV 76 Query: 68 RDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSY 127 +EL +G Y+P P RR IPK GK+R + IP++RDR+VQ A+ + +EPI+E+DF S+ Sbjct: 77 AEELRTGTYRPRPYRRREIPKEGGKVRVISIPSIRDRVVQGALRLVLEPIFEADFSGSSF 136 Query: 128 GFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARF 187 G RP RS H AI TV+ L V++ DL +YFD++ H L++ V RR+ D Sbjct: 137 GARPGRSAHEAIDTVRQGLRRRRHR----VVDVDLKAYFDSIRHAPLLERVARRVQDGEV 192 Query: 188 MTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDR 247 + L+ + +++ G+PQG +SPLL+NI LN+ D L Sbjct: 193 LALVKQFLRS---------TGDRGIPQGSPLSPLLANIALNDLDHVL------------- 230 Query: 248 WYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKA-------QVEAIREECR 300 RGR + Y RY DD V++ ++ +E IR+E Sbjct: 231 -------DRGRGF-----------LTYARYLDDMVVLAPDSEKGRRWAARALERIRQEAE 272 Query: 301 GVLEGSLKLRLNMDKTKIPHVND---GFIFLGHRLIRKRS 337 +L + LN +KT+ + D F FLG KRS Sbjct: 273 -----ALGVSLNKEKTRTVTMTDRNASFAFLGFDFRWKRS 307 >UniRef50_Q1Q3I7 Putative uncharacterized protein n=1 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1Q3I7_9BACT Length = 316 Score = 130 bits (326), Expect = 1e-28, Method: Compositional matrix adjust. Identities = 69/188 (36%), Positives = 106/188 (56%), Gaps = 4/188 (2%) Query: 47 GVDGVNKTMLQARLAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIV 106 G DGV + L + L+I+R EL Y PLP R+ + K NG+ R L IP++RDRIV Sbjct: 28 GADGVTIERYEGNLDLNLRIMRKELTEQTYFPLPLLRILVDKGNGEARALCIPSVRDRIV 87 Query: 107 QRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYF 166 Q A+L +EP+ E +F S+ +R RSV A+ V+ + E +WV++ D+ ++F Sbjct: 88 QAAVLQLIEPVLEKEFEECSFAYRKGRSVKQAVYKVR----EYYEQGYQWVVDADIDAFF 143 Query: 167 DTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIM 226 D+V + LL+ + I D L+ +K D +G+PQG ISP+L+N+ Sbjct: 144 DSVDYSLLLLKFKCYIHDPCIQNLVGLWLKGEVWDGKTVTTLKKGIPQGSPISPILANLY 203 Query: 227 LNEFDQYL 234 L+EFD+ L Sbjct: 204 LDEFDEEL 211 >UniRef50_UPI00016C4F75 RNA-directed DNA polymerase (Reverse transcriptase) n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C4F75 Length = 257 Score = 129 bits (325), Expect = 1e-28, Method: Compositional matrix adjust. Identities = 81/214 (37%), Positives = 119/214 (55%), Gaps = 9/214 (4%) Query: 9 AATDPSLRI-QRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQIL 67 AA +PS + RL+ + Q L +A ++KGA PG+DG+ +A Q L Sbjct: 45 AALEPSRALTDRLMEEVCQRGNLNQAYSRVKANKGA--PGIDGMTVEDSLRWIAEHKQEL 102 Query: 68 RDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSY 127 LL G Y+P P R V IPK G R LGIP + DR+VQ+A+L + + + F SY Sbjct: 103 LSSLLDGSYRPSPVRGVLIPKPGGGERQLGIPTVVDRLVQQAILQVLTRLLDPTFSESSY 162 Query: 128 GFRPERSVHHAIRTVKLQLTDCGETRGRW-VIEGDLSSYFDTVHHRLLMKAVRRRISDAR 186 GFRP +S H A+ K + D GR V++ DL +FD V+H +LM + RR+SD R Sbjct: 163 GFRPGKSAHQALLKAKEYVAD-----GRAIVVDVDLEKFFDRVNHDILMARLARRVSDTR 217 Query: 187 FMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISP 220 + ++ + ++AG + G+ A EG PQGG++ P Sbjct: 218 LLRIVRRFLEAGLMQDGVCVARHEGTPQGGIVDP 251 >UniRef50_Q94Z25 Orf557 n=2 Tax=Pylaiella littoralis RepID=Q94Z25_PYLLI Length = 557 Score = 129 bits (325), Expect = 1e-28, Method: Compositional matrix adjust. Identities = 96/285 (33%), Positives = 140/285 (49%), Gaps = 39/285 (13%) Query: 15 LRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELL-- 72 R+Q+ +L+ E A A R +S++G+ T G DG Q + + +RD LL Sbjct: 40 FRLQQ--QLMHSFEGRATAVRKVVSNEGSKTKGPDGKTWKKSQDKYRA-IADIRDHLLTK 96 Query: 73 SGHYQPLPARRVYIPKSN-GKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRP 131 SG Y+ RRV+IPKS+ G+LRPLGIP + DR +Q +L ++PI E + + SYGFR Sbjct: 97 SGSYKAGAVRRVWIPKSSPGELRPLGIPNMIDRALQALVLSCLDPIVEENSDSCSYGFRK 156 Query: 132 ERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLL 191 RS + AI+ ++ L G R W + D+S FD + H L K VR + + L+ Sbjct: 157 YRSTNDAIQRIRFILDKAGAPRYIW--DADISKCFDNISHTFLNKVVRENLC-RKGCELV 213 Query: 192 WKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHE-----RYLSGKARKD 246 +KA I+ G S G PQGGV+SPLL N+ LN + + + +G+ K Sbjct: 214 EAWLKAPIIEKGSKSYPSRGTPQGGVLSPLLCNMTLNGLENVIRDGLPSSSSTAGRKLKG 273 Query: 247 RWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQ 291 RW RYADDF++ K+Q Sbjct: 274 RW-------------------------VVRYADDFIITNPIGKSQ 293 >UniRef50_O99479 Reverse transcriptase homolog n=2 Tax=Eukaryota RepID=O99479_PAVLU Length = 636 Score = 129 bits (325), Expect = 1e-28, Method: Compositional matrix adjust. Identities = 97/330 (29%), Positives = 156/330 (47%), Gaps = 49/330 (14%) Query: 39 SSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGI 98 ++K + PG+DG K V LQ EL+S YQP +RV IPK NG +R LGI Sbjct: 67 ATKSSAAPGLDGDRKANFSEAKLVALQA---ELISQKYQPKTTKRVAIPKPNGGIRYLGI 123 Query: 99 PALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVI 158 + RD+IVQ ++ A++ + F S+GFRP H A++ V+ + W+I Sbjct: 124 SSQRDKIVQASIQNALQSKYGKHFSPDSFGFRPGLGCHDALKHVRNTWQNI-----TWII 178 Query: 159 EGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHI---DVGLF--RAASEGVP 213 D+ FDT++H +L++ + R + D + L+ K IK G++ D F ++ G P Sbjct: 179 SIDIEKCFDTINHTILLQ-ILRPLVDQPTLELISKLIKVGYVEMFDTTCFPISESTIGTP 237 Query: 214 QGGVISPLLSNIMLNEFDQYLHE------------RYLSGKARKDRWYWNN--------- 252 QG +ISPLL N ++ D +L + Y+ G + N+ Sbjct: 238 QGSLISPLLCNFYMHILDTFLQKVLIPQWNVGDERSYVKGCQNRKAMDVNDKAIVEAYPE 297 Query: 253 ---SIQR---------GRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECR 300 IQR G+ + + ++ + Y RYADD V+ G ++ I ++ Sbjct: 298 LEGQIQRIKHNRWVTEGKGSRDPNDANFR-RLRYVRYADDIVIGFTGPYSEALVILDQVV 356 Query: 301 GVLEGSLKLRLNMDKTKIPHV-NDGFIFLG 329 LE +L ++N +K+ I H +G FLG Sbjct: 357 KFLEKTLCFKVNKEKSSINHSETNGIKFLG 386 >UniRef50_Q8TJY1 Reverse transcriptase n=5 Tax=Methanosarcina RepID=Q8TJY1_METAC Length = 512 Score = 129 bits (323), Expect = 2e-28, Method: Compositional matrix adjust. Identities = 95/333 (28%), Positives = 165/333 (49%), Gaps = 38/333 (11%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWLAE-AARITLSSKGAHTPGVDGVNKTMLQAR 59 +Q ++A+ A + + +L RL+T+ + + R ++KG+ TPG+DG+ + + Sbjct: 53 LQSRIASAAKNGKWITVNKLSRLLTRSLYAKLLSVRKVTTNKGSRTPGIDGIIWSSSADK 112 Query: 60 LAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWE 119 + LQ L + Y+ P R YI K NGKLRPL IP + DR +Q + + PI Sbjct: 113 MRSALQ-----LTNKGYRAKPLTRKYIRKKNGKLRPLSIPTMYDRAMQTLHSLVLGPIES 167 Query: 120 SDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVR 179 + S+GF+P RS A + + L+ + W++EGD+ + FD ++H ++ + Sbjct: 168 AIGDKTSFGFKPYRSTKDAYAYLHICLSK--KIAPEWIVEGDIKACFDEINHTWILDNIP 225 Query: 180 RRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYL 239 D R +L + +KAG+++ +G PQG ISP++ N+ LN + L R+ Sbjct: 226 M---DKR---ILKEFLKAGYVENYHLFPTEKGTPQGSPISPIIGNMALNGLENALAMRFY 279 Query: 240 SGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREEC 299 S R D + ++ Q + V R+ADDFV + +E I Sbjct: 280 S---RSD-------------GTIDKSHQNRHKVNCARFADDFVATADSPETALEII---- 319 Query: 300 RGVLEGSLK---LRLNMDKTKIPHVNDGFIFLG 329 V++ L L+L+ +KT + ++++GF FLG Sbjct: 320 -DVIQEFLDPRGLKLSEEKTLVTNISEGFNFLG 351 >UniRef50_Q8A4I4 Reverse transcriptase n=1 Tax=Bacteroides thetaiotaomicron RepID=Q8A4I4_BACTN Length = 430 Score = 129 bits (323), Expect = 3e-28, Method: Compositional matrix adjust. Identities = 100/330 (30%), Positives = 149/330 (45%), Gaps = 75/330 (22%) Query: 45 TPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDR 104 + G+D V + L L L + + SG Y P + V IPKS G RPLGIP + DR Sbjct: 29 SAGIDKVTLEDYEKNLRGNLYKLWNRMSSGSYFPPSVKLVEIPKSTGGKRPLGIPTVSDR 88 Query: 105 IVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGR-----WVIE 159 + Q A++M + P E FH SY +RP RS H A+ G+ R R WV++ Sbjct: 89 VAQMAVVMLITPSIEPCFHEDSYAYRPHRSAHDAV----------GKARERCWKYAWVLD 138 Query: 160 GDLSSYFDTVHHRLLMKAVRRRISDA-------RFMTLLWKTIKAGHIDVGLFRAASEGV 212 D+S +FDT+ H LL+KA++R + R++ + ++ +D L GV Sbjct: 139 MDISKFFDTIDHELLLKALKRHTQEKWVLMYIERWLKVPYEKSDGSQVDRAL------GV 192 Query: 213 PQGGVISPLLSNIMLN-EFDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPA 271 PQG VI P+L+N+ L+ FD+++ + + P Sbjct: 193 PQGSVIGPVLANLFLHYTFDKWMEKNF-------------------------------PR 221 Query: 272 VAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHV---------- 321 V + RYADD + K Q E ++ + E +LRLN +KTKI + Sbjct: 222 VPFERYADDTICHCHSLK-QAEYMQAMIQQRFE-CCRLRLNEEKTKIVYCKSSRQKECYP 279 Query: 322 NDGFIFLGHRLIRKRS--RYGEMRVVSTIP 349 N F FLG + S +YG R +P Sbjct: 280 NVTFDFLGFTFQPRESVDKYGN-RFTGFLP 308 >UniRef50_Q2FUJ3 RNA-directed DNA polymerase n=1 Tax=Methanospirillum hungatei JF-1 RepID=Q2FUJ3_METHJ Length = 487 Score = 128 bits (321), Expect = 4e-28, Method: Compositional matrix adjust. Identities = 94/327 (28%), Positives = 153/327 (46%), Gaps = 53/327 (16%) Query: 9 AATDPSLRIQRLLRLITQPEWLAE--AARITLSSKGAHTPGVDGVNKTMLQARLAVELQI 66 A + R+ R L+ + + A+ A + ++G + GVDG T + ++ L Sbjct: 45 AVKNGQYRLARRLQYLLTHSFYAKMLAVQRVTKNRGKRSAGVDGEKWTTPEQKMKAAL-- 102 Query: 67 LRDELLSGHYQPLPARRVYIPK-SNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTL 125 L Y+ P RR+YIPK + K+RPL IP + DR +Q MA+ P E+ Sbjct: 103 ---TLSDKGYRAKPLRRIYIPKPQSSKMRPLSIPTMYDRAMQALYAMALMPWAETTADKT 159 Query: 126 SYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDA 185 S+GFR +R+ A L+ +T G+W++EGD+ FD H+ ++ + D Sbjct: 160 SFGFRMKRNAQDAASYTFQCLSR--KTSGQWILEGDIRGCFDNFAHQWMLDNI---PLDQ 214 Query: 186 RFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARK 245 R + + +KAG+I G+ G PQGG+ISPLL+N+ L+ ++ L E + K Sbjct: 215 RILN---QFLKAGYIYDGILYRNKSGTPQGGLISPLLANMALDGMERMLKEHFPGNK--- 268 Query: 246 DRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEG 305 V R+ADDF++ + + +C+ ++ Sbjct: 269 --------------------------VHLIRFADDFLVTADSQETAL-----QCKELITE 297 Query: 306 SLK---LRLNMDKTKIPHVNDGFIFLG 329 L L L+ +KTKI H+N+GF FLG Sbjct: 298 FLHERGLELSEEKTKIVHINEGFDFLG 324 >UniRef50_B3JM52 Putative uncharacterized protein n=1 Tax=Bacteroides coprocola DSM 17136 RepID=B3JM52_9BACE Length = 431 Score = 128 bits (321), Expect = 4e-28, Method: Compositional matrix adjust. Identities = 87/295 (29%), Positives = 142/295 (48%), Gaps = 40/295 (13%) Query: 24 ITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLPARR 83 + P L A +++KG+ GVDGV+ L+ + + L + + G+YQ P Sbjct: 5 VVHPFNLQRALEHVIANKGS--AGVDGVSIRELRKVFSEKKLQLIEAIKQGNYQVQPILG 62 Query: 84 VYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVK 143 + IPK NGK R LG+P +R++Q+A+ + P++E +F SYGFRP ++ A+ Sbjct: 63 IEIPKGNGKTRLLGVPTTTERVLQQALAQTIAPLFEPEFSNYSYGFRPHKNARQAVG--- 119 Query: 144 LQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHIDVG 203 Q D + +++ DL ++FD V H LL+ + +++ M L+ K ++A G Sbjct: 120 -QSRDYIHSGLNHIVDIDLKNFFDEVDHCLLLNLIYQKVKCKATMQLIRKWLRAPIKING 178 Query: 204 LFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVR 263 R +GVPQG +SPLLSNI+L++ D+ + R Sbjct: 179 KLRKRRKGVPQGSPLSPLLSNILLHQLDKEMTRR-------------------------- 212 Query: 264 ENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKI 318 + RYADDF + K + Q +A R L+ LKL +N +K+ I Sbjct: 213 -------GHKFVRYADDFSIYCK-SHNQAKATRVVIEKFLKNKLKLTINEEKSGI 259 >UniRef50_A6DE66 Putative uncharacterized protein n=1 Tax=Caminibacter mediatlanticus TB-2 RepID=A6DE66_9PROT Length = 377 Score = 127 bits (320), Expect = 5e-28, Method: Compositional matrix adjust. Identities = 75/271 (27%), Positives = 133/271 (49%), Gaps = 42/271 (15%) Query: 59 RLAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIW 118 ++ ++++ L DELL G Y P P + + KSN K R + I +D++VQ+ + ++ + Sbjct: 26 KIDIDVKKLSDELLRGKYIPSPLQSFELKKSNNKTREIKILTDKDKLVQKVLYESINEFF 85 Query: 119 ESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAV 178 + F SYG+R +S AI+ K D + + +V + D+ ++F+ ++H L+ + Sbjct: 86 DKQFSNRSYGYRIGKSTIKAIKRCK----DFIKRKYFYVFKSDIKNFFENINHNKLISLL 141 Query: 179 RRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERY 238 R I D R + L+ + IK+G + F + GV QG ++SPLLSNI LNEFD++L + Sbjct: 142 DRNIEDKRIIRLIVQFIKSGILKKEYF-SHEIGVHQGDILSPLLSNIYLNEFDKFLESK- 199 Query: 239 LSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREE 298 + + RYADDFV+ +K ++ E Sbjct: 200 --------------------------------NIEFVRYADDFVIFMKKNNKEI----PE 223 Query: 299 CRGVLEGSLKLRLNMDKTKIPHVNDGFIFLG 329 + ++ L ++ +K+ + GF FLG Sbjct: 224 ILNIFLKNIDLEISEEKSYFSDIYKGFSFLG 254 >UniRef50_Q12UG1 RNA-directed DNA polymerase n=53 Tax=cellular organisms RepID=Q12UG1_METBU Length = 592 Score = 127 bits (318), Expect = 8e-28, Method: Compositional matrix adjust. Identities = 93/318 (29%), Positives = 154/318 (48%), Gaps = 39/318 (12%) Query: 17 IQRLLRLITQPEWLAE-AARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGH 75 +++L L+T + A R + +KG T G+DG + ++ R L Sbjct: 80 VKKLSYLLTHSHYAKLLAVRKVIRNKGRRTAGIDG----EFWSTPVSKVNAAR-SLSDKR 134 Query: 76 YQPLPARRVYIPK-SNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERS 134 Y+ P +R++I K + K RPLGIP + DR +Q +A++PI E S+GFR RS Sbjct: 135 YKAKPLKRIFIEKYGSDKKRPLGIPTMYDRAMQALYALALDPIAEVTADKRSFGFRKFRS 194 Query: 135 VHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKT 194 H A + ++ + + ++EGD+ FD + H+ L+ + S +L + Sbjct: 195 THDACSQIFGTISK--KDSAQCILEGDIKGCFDNISHQWLIDNIPMDKS------ILKQF 246 Query: 195 IKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSI 254 +KAG + G PQGG+ISP+L+N+ L+ + L ++Y Sbjct: 247 LKAGFVYENSLFPTKAGTPQGGIISPILANMTLDGIEGVLADKY---------------- 290 Query: 255 QRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLK---LRL 311 RG S + + K V + RYADDF++ A+ + I EE + +++ L L L Sbjct: 291 HRGVSGKITTRQRAKHKVNFVRYADDFIVT-----AKTKEIAEEAKELIKNFLTDRGLEL 345 Query: 312 NMDKTKIPHVNDGFIFLG 329 + +KT I H++DGF FLG Sbjct: 346 SDEKTLITHIDDGFDFLG 363 >UniRef50_Q67M30 Group II intron-encoding maturase n=1 Tax=Symbiobacterium thermophilum RepID=Q67M30_SYMTH Length = 248 Score = 125 bits (315), Expect = 2e-27, Method: Compositional matrix adjust. Identities = 65/156 (41%), Positives = 94/156 (60%), Gaps = 4/156 (2%) Query: 40 SKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIP 99 + PGVDGV L+ ++ VE + +R+ELL G Y+P P RRV IPK G R LGIP Sbjct: 18 ERNGGAPGVDGVPTERLRDQIRVEWERIREELLRGTYRPQPVRRVEIPKPGGGKRMLGIP 77 Query: 100 ALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIE 159 + DR++Q+A+L + PI++ F SYGFRP R H A+R + Q + G WV++ Sbjct: 78 TVMDRLIQQALLQVLTPIFDPTFSESSYGFRPGRRGHDAVRKAR-QYVEEGYD---WVVD 133 Query: 160 GDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTI 195 DL +FD V+H +LM V RR++D R + L+ + Sbjct: 134 MDLEKFFDRVNHDVLMARVARRVTDKRVLRLIRGVV 169 >UniRef50_A8LGE6 RNA-directed DNA polymerase n=1 Tax=Frankia sp. EAN1pec RepID=A8LGE6_FRASN Length = 351 Score = 124 bits (311), Expect = 6e-27, Method: Compositional matrix adjust. Identities = 85/246 (34%), Positives = 132/246 (53%), Gaps = 17/246 (6%) Query: 14 SLRIQRLLR-LITQPEWLAEAARITLSSKGAHTPG--VDGVNKTMLQARLAVELQILRDE 70 L ++R+ R L +L R+ S+KGA TPG VDG++ LA +I+ D Sbjct: 38 GLPLERVYRQLFNAALYLVAYGRL-YSNKGAMTPGETVDGMS-------LATIDRII-DA 88 Query: 71 LLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFR 130 + Y+ P +RV+IPK NGK RPLG+P D++V + + +E +E F S+GFR Sbjct: 89 MRHERYRWKPVKRVHIPKKNGKKRPLGLPTWSDKLVAEVVRLLLEAYYEPTFSDHSHGFR 148 Query: 131 PERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTL 190 P R+ H A+ V D + W IEGD++ F+ + H++++ V RI D RF+ L Sbjct: 149 PGRACHTALGEV----VDVWKGT-HWFIEGDIARCFEELDHQVMLDTVGERIHDNRFLGL 203 Query: 191 LWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYW 250 L ++AG+++ + A G QGG SP+LSNI L+ D ++ L R +R Sbjct: 204 LKAMLRAGYLEDWKWGATLSGTVQGGPASPILSNIYLDRLDSFVVTHLLPDYNRGERRAS 263 Query: 251 NNSIQR 256 N + Q+ Sbjct: 264 NPAYQK 269 >UniRef50_Q8RSV8 Maturase n=1 Tax=uncultured marine bacterium RepID=Q8RSV8_9BACT Length = 386 Score = 122 bits (306), Expect = 2e-26, Method: Compositional matrix adjust. Identities = 91/276 (32%), Positives = 135/276 (48%), Gaps = 44/276 (15%) Query: 47 GVDGVNKTMLQARLAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIV 106 GVD V+ + + L L + L SG Y P P + V IPK +GK R LGIP + DR+ Sbjct: 2 GVDHVSMEAIASNPRKYLYPLWNRLSSGSYFPPPVKLVPIPKGDGKERMLGIPTIIDRVA 61 Query: 107 QRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETR-GRW-VIEGDLSS 164 Q + +E I E FH S+G+RP +S H A L C + RW V++ D+ Sbjct: 62 QEVIKAELEVIVEPRFHPSSFGYRPHKSAHEA-------LEQCAKNSWERWYVVDLDIKG 114 Query: 165 YFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHID-VGLFRAASEGVPQGGVISPLLS 223 +FD + H +M +R+ + + + +K D VG +A +G PQGGVISPLL+ Sbjct: 115 FFDNIDHEKMMGILRKHTNKKHILLYCDRWLKTPMQDRVGGVQARMKGTPQGGVISPLLA 174 Query: 224 NIMLNE-FDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFV 282 N+ L+E FDQ++ ST +P + + RYADD V Sbjct: 175 NLYLHEAFDQWI------------------------STT-------QPRIVFERYADDIV 203 Query: 283 LIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKI 318 + + + Q I ++ + L+ S L L+ DKTKI Sbjct: 204 IHTRSME-QSHFILDKLKARLK-SYSLELHPDKTKI 237 >UniRef50_P19593 Probable reverse transcriptase n=2 Tax=Scenedesmus obliquus RepID=RDPO_SCEOB Length = 608 Score = 122 bits (305), Expect = 3e-26, Method: Compositional matrix adjust. Identities = 103/349 (29%), Positives = 165/349 (47%), Gaps = 58/349 (16%) Query: 31 AEAARITLSSKGAHTPGVD----GVNKTMLQARLAVELQILRDELLSGHYQPLPARRVYI 86 A A + SSKG+ +PG+ NK + A +A QI + Y+ P R+YI Sbjct: 69 AVAVQTVASSKGSRSPGLSRESFKTNKNYV-AMMATLEQITSNP---HKYKATPLSRIYI 124 Query: 87 PKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQL 146 PK +G RPL IP+ DR +Q +A+EP+ E SYGFRP R+V A+ V L Sbjct: 125 PKRDGSARPLSIPSYTDRCLQALYKLAIEPMAEEVADLSSYGFRPMRNVSWAVGRV-LNG 183 Query: 147 TDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHI--DVGL 204 + ++V+E D+ D ++H+ + +++ +LW +K G+I + Sbjct: 184 LNNPLANYQYVVEIDIKGCVDNINHQFI-----SQVTPFIPKKILWAWLKCGYIERNSNT 238 Query: 205 FRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRE 264 + + GVPQGG+ISPL+ N+ L+ + +++++ IQ+ S + Sbjct: 239 LQPTTTGVPQGGIISPLIMNLTLDGLEFHIYKK----------------IQKSSSQS--- 279 Query: 265 NWQWKPAVAYCRYADDFVLIVKGTKA---QVEAIRE--ECRGVLEGSLKLRLNMDKTKIP 319 YCRYADD V++ + + A++E RG L + + KT I Sbjct: 280 -----KGNTYCRYADDMVILTTTEETALIALPAVKEFLAVRG-------LEVKLAKTTIK 327 Query: 320 H-VND--GFIFLGHRLIRKRSRYGEMRVVST--IPQEKARNFAASLTAL 363 + +ND GF FL R RK R R+ S IP +NF ++ A+ Sbjct: 328 NIINDRNGFEFLSFRF-RKVYRRNRKRLTSQVGIPISAIKNFRKNIKAI 375 >UniRef50_Q47277 Orf protein n=63 Tax=cellular organisms RepID=Q47277_ECOLX Length = 416 Score = 121 bits (304), Expect = 4e-26, Method: Compositional matrix adjust. Identities = 105/325 (32%), Positives = 147/325 (45%), Gaps = 66/325 (20%) Query: 30 LAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLPARRVYIPKS 89 + A R +S GA G+D + RL L + + L SG Y P + V IPK Sbjct: 14 VVSAYRRVKTSAGA--AGIDKQSLADFDKRLVDNLYKIWNRLSSGSYFPPAVKAVAIPKK 71 Query: 90 NGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDC 149 G R LGIP + DRI Q + +A EP E F SYG+RP +S AI Sbjct: 72 LGGERILGIPTVSDRIAQTVVKLAFEPQVEPHFLADSYGYRPNKSALDAI---------- 121 Query: 150 GETRGR-----WVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLL---WKTIKAGHID 201 G TR R WV+E D+ FD + H L+MKAV + + AR++ L W T D Sbjct: 122 GVTRKRCWYYDWVLEFDIKGLFDNIPHELIMKAVDKH-NPARWVKLYIQRWLTAPMVMSD 180 Query: 202 VGLFRAASEGVPQGGVISPLLSNIMLNE-FDQYLHERYLSGKARKDRWYWNNSIQRGRST 260 G RA + G PQGGVISPLL+N+ ++ FD++L + Y Sbjct: 181 -GEVRARTMGTPQGGVISPLLANLFMHYVFDKWLAKYY---------------------- 217 Query: 261 AVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPH 320 P V + RYADD +L ++A+ +RE R L ++ +KT++ + Sbjct: 218 ---------PKVPWYRYADDGILHCH-SEAEATEMREVLRKRF-SECGLEMHPEKTRVIY 266 Query: 321 VNDG----------FIFLGHRLIRK 335 DG F FLG+ R+ Sbjct: 267 CKDGSRKGDYEHTMFDFLGYTFRRR 291 >UniRef50_B4WV39 Group II intron, maturase-specific domain family n=2 Tax=Synechococcus sp. PCC 7335 RepID=B4WV39_9SYNE Length = 586 Score = 121 bits (303), Expect = 5e-26, Method: Compositional matrix adjust. Identities = 100/324 (30%), Positives = 143/324 (44%), Gaps = 60/324 (18%) Query: 17 IQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHY 76 +QRLL LA R+T +KG T GVDG+ R + + D+ Sbjct: 46 LQRLLSTSWSARMLA-VRRVTQENKGKKTAGVDGIASLKAPERTELAKNLTLDKNAD--- 101 Query: 77 QPLPARRVYIPK-SNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSV 135 P RRV IPK + R LGIP +RDR Q +A+EP WE+ F SYGFRP RS Sbjct: 102 ---PVRRVLIPKPGKSEYRKLGIPTMRDRAKQALAKLALEPQWEALFAPNSYGFRPGRSP 158 Query: 136 HHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTI 195 A+ Q+ C + +WV++ D+++ FD + H L+ + + W + Sbjct: 159 QDALE----QVHRCISQKPKWVLDADIAACFDQISHGPLVARLSQSHPSIARQCKAW--L 212 Query: 196 KAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQ 255 KAG +D G + G PQGG+ SPLL+NI L+ + + Sbjct: 213 KAGVLDNGQIQLTERGTPQGGIASPLLANIALHGLETLV--------------------- 251 Query: 256 RGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEA---IREECRGVLEGSLKLRLN 312 T +R R+ADDFV+ + +A +A IR+ S L+L Sbjct: 252 ---KTTIR-------GAHLVRFADDFVVFHQDREAIFKAQTLIRQWL-----ASKGLKLR 296 Query: 313 MDKTKIPHV-------NDGFIFLG 329 DKT+I H + GF FLG Sbjct: 297 ADKTRIVHTLNGGAEYSTGFDFLG 320 >UniRef50_Q8GAR1 Reverse transcriptase n=20 Tax=Enterobacteriaceae RepID=Q8GAR1_ECOLX Length = 410 Score = 119 bits (299), Expect = 1e-25, Method: Compositional matrix adjust. Identities = 98/304 (32%), Positives = 139/304 (45%), Gaps = 55/304 (18%) Query: 40 SKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIP 99 +KGA PG DG M + L + + L SG + P P IPKSNGK R LGIP Sbjct: 23 NKGA--PGCDGQTLKMFDQQRDGNLYKIWNRLCSGTWFPPPVLEKRIPKSNGKERILGIP 80 Query: 100 ALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIE 159 + DRI Q A+ + ME + FH SYG+RP +S H A++ ++ R W++E Sbjct: 81 TVSDRIAQGAIKLFMEEKLDPIFHADSYGYRPGKSAHDALKQCAIRC-----WRYSWILE 135 Query: 160 GDLSSYFDTVHHRLLMKAVRRRISDARFMTLL---WKTIKAGHIDVGLFRAASEGVPQGG 216 D+S++FD V H L++KA+ +++ L W + G + G PQGG Sbjct: 136 VDISAFFDHVRHDLVLKALEHH-GMPKWVILYCRRWMEAPMQSCENGELITRTRGTPQGG 194 Query: 217 VISPLLSNIMLN-EFDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYC 275 VISPLL+N+ L+ FD ++ Y RG V + Sbjct: 195 VISPLLANLFLHYAFDLWMEREY-----------------RG--------------VPFE 223 Query: 276 RYADDFVLIVKGTKAQVEAIREECRGVLEGS-LKLRLNMDKTKIPHVN--------DGFI 326 RYADD IV +A R + R S + L LN KT I +++ F Sbjct: 224 RYADD---IVVHCSRMSDATRLKNRLSERFSEVGLVLNAGKTNIAYIDTFKRRNVATSFT 280 Query: 327 FLGH 330 FLG+ Sbjct: 281 FLGY 284 >UniRef50_D1RME6 Reverse transcriptase family protein n=1 Tax=Legionella longbeachae D-4968 RepID=D1RME6_LEGLO Length = 444 Score = 119 bits (299), Expect = 1e-25, Method: Compositional matrix adjust. Identities = 72/207 (34%), Positives = 110/207 (53%), Gaps = 9/207 (4%) Query: 42 GAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPAL 101 G+ G+DGV K + +L LQ L + Y P +R V IPK +G RPL I Sbjct: 51 GSKAIGIDGVTKEVYGKKLEDNLQDLLARIRRHAYTPQASRLVEIPKEDGSTRPLAISCF 110 Query: 102 RDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGD 161 D+IVQ A+ + I+E F SYG+R ++ H A+R + + E R +E D Sbjct: 111 EDKIVQMAVTKLLTAIYEPLFLPCSYGYREGKNGHEALRAL---MKYSNEFRKGATLEID 167 Query: 162 LSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPL 221 L YF+T+ H L++ + ++I+D RF+ L+ K I++ + G G PQG +ISP+ Sbjct: 168 LRKYFNTIPHGKLLEILEKKITDRRFLKLIRKLIRSPVVANGKAELNELGCPQGSIISPI 227 Query: 222 LSNIMLNE-----FDQYLHERYLSGKA 243 LSNI L+ FD+ + + +L GK Sbjct: 228 LSNIYLHSVVDSWFDE-ISKSHLIGKT 253 >UniRef50_Q8YKQ2 Alr7241 protein n=14 Tax=Cyanobacteria RepID=Q8YKQ2_ANASP Length = 562 Score = 119 bits (298), Expect = 2e-25, Method: Compositional matrix adjust. Identities = 98/334 (29%), Positives = 164/334 (49%), Gaps = 42/334 (12%) Query: 35 RITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLPARRVYIPKSNGKLR 94 +I+ + G T G+DGV R +E+ + + SG++ R + IPK +G R Sbjct: 101 QISQLNAGKKTAGIDGVKSLDFNGRFELEITLKQS---SGNWHHQELREIPIPKKDGTTR 157 Query: 95 PLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRG 154 L IP + DR Q A+EP E+ FH SYGFR R+ H A + + L+ + Sbjct: 158 MLKIPTIADRCWQCLAKYALEPAHEATFHARSYGFRTGRAAHDAQQFLFSNLSSKAKRIS 217 Query: 155 RWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQ 214 + VIE D+ FD ++H +M+ + I+ +++ +KAG I+ +G PQ Sbjct: 218 KRVIELDIEKCFDRINHSTIMENL---IAPKGIKLGIYRCLKAG-INPEF---PEQGTPQ 270 Query: 215 GGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAY 274 GGV+SPLL+NI LN + +H R++ +N QR + + ++ P+V Sbjct: 271 GGVVSPLLANIALNGIES-IH-----------RYHKDN--QRITNKTPESDIRY-PSV-- 313 Query: 275 CRYADDFVLIVKGTKAQVEAIREECRGVLEGSLK---LRLNMDKTKIPHVNDGFIFLG-H 330 RYADD V+++ + Q +A E +E L ++++ KTKI DGF FLG H Sbjct: 314 -RYADDMVIVL---RPQDDA--NEILAKIEDFLNARGMKVSAKKTKITATTDGFDFLGWH 367 Query: 331 RLIRKRSRYGEMRVVSTIPQEKARNFAASLTALL 364 +++ ++ T +E + F + A++ Sbjct: 368 IIVQSNGKFN-----CTPSEENFKKFRQKVKAIV 396 >UniRef50_B6FPD0 Putative uncharacterized protein n=1 Tax=Clostridium nexile DSM 1787 RepID=B6FPD0_9CLOT Length = 252 Score = 119 bits (297), Expect = 3e-25, Method: Compositional matrix adjust. Identities = 72/206 (34%), Positives = 116/206 (56%), Gaps = 13/206 (6%) Query: 16 RIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVE--LQILRDELLS 73 + RL+ LI + + A R + G+ TPG DG T +Q+ L +E ++ +R++L Sbjct: 47 KFDRLMPLIVSEQNIILAYRNICKNNGSKTPGTDGKTITEIQS-LPIETVIKTVRNKL-- 103 Query: 74 GHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPER 133 YQP RRV IPK NGK RPLGIP++ DR++Q+ +L +EPI E+ FH + GFRP R Sbjct: 104 NFYQPKKVRRVEIPKDNGKTRPLGIPSIWDRLIQQCILQILEPICEAKFHERNNGFRPYR 163 Query: 134 SVHHAI-RTVKL-QLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAV-RRRISDARFMTL 190 S +AI + K+ QL + +V++ D+ +FD + H L++ + I D + + + Sbjct: 164 STQNAIAQCYKMAQLQNL-----HFVVDVDIVGFFDNIDHNKLIRQLWGLGIQDRKLIMI 218 Query: 191 LWKTIKAGHIDVGLFRAASEGVPQGG 216 + + +KA + + G PQGG Sbjct: 219 IKQMLKAEILFNDIVITPETGTPQGG 244 >UniRef50_B1I2T7 Retron-type reverse transcriptase-like protein n=1 Tax=Candidatus Desulforudis audaxviator MP104C RepID=B1I2T7_DESAP Length = 309 Score = 118 bits (295), Expect = 4e-25, Method: Compositional matrix adjust. Identities = 64/158 (40%), Positives = 92/158 (58%), Gaps = 5/158 (3%) Query: 28 EWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLPARRVYIP 87 E L +A R ++ GA PGVDG L L L EL +G Y+P P RV IP Sbjct: 15 ENLIQAYRAVRANNGA--PGVDGETVEAFGRNLDERLDQLHHELKTGTYEPQPVLRVEIP 72 Query: 88 KSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLT 147 K +G RPLGIP +RDR+VQ+A+L ++PI++ DFH SYG+R RS H A+ + + Sbjct: 73 KPDGSNRPLGIPTVRDRVVQQALLNILQPIFDPDFHPSSYGYRLGRSCHQAVAKAERFMN 132 Query: 148 DCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDA 185 G V++ DLS FD ++H L+++ + R++SD Sbjct: 133 RYGLEH---VVDMDLSKCFDRLNHELILEGINRKVSDG 167 >UniRef50_Q8YWX6 Alr1468 protein n=4 Tax=Cyanobacteria RepID=Q8YWX6_ANASP Length = 668 Score = 118 bits (295), Expect = 4e-25, Method: Compositional matrix adjust. Identities = 85/291 (29%), Positives = 139/291 (47%), Gaps = 39/291 (13%) Query: 42 GAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPAL 101 G+ + GVDG++ + ++ +LQ + +L Y PA+ YIPK NG R +GI + Sbjct: 17 GSKSAGVDGISVDLFESMATEQLQNIAYQLKEETYTANPAKGFYIPKKNGTKRLIGIHTV 76 Query: 102 RDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGD 161 RDRI+QR +L + E F SY +RP S+ A++ L + + +W+I+ D Sbjct: 77 RDRIIQRLLLDELYFPLEDTFLDCSYAYRPGHSIQQAVQ----HLYGYYQYQPKWIIKAD 132 Query: 162 LSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPL 221 ++ +FD + LL+ + + + LL + +K+G I G +R +GV QGG++S Sbjct: 133 VADFFDNLSWALLLTYLEELSLEPSLLQLLEQQLKSGIIIAGQYRNFGKGVLQGGILSGA 192 Query: 222 LSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDF 281 L+N+ L FD+ + +G + RY DDF Sbjct: 193 LANLYLTSFDR-------------------KCLSQG--------------INLVRYGDDF 219 Query: 282 VLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRL 332 V I + + I ++ G L G + L L +KT+I ND F FLG+R Sbjct: 220 V-IACNSWLEANRILDKITGWL-GEVYLTLQPEKTQIFTPNDEFTFLGYRF 268 >UniRef50_A6YE98 Putative reverse transcriptase and intron maturase n=1 Tax=Chlorokybus atmophyticus RepID=A6YE98_CHLAT Length = 845 Score = 118 bits (295), Expect = 4e-25, Method: Compositional matrix adjust. Identities = 94/335 (28%), Positives = 156/335 (46%), Gaps = 42/335 (12%) Query: 29 WLAEAARITLSSKGAHTPGVDG-VNKTMLQARLAVELQILRDELLSGHYQPLPARRVYIP 87 W+A ++ + G G ++ T L++ L+ LRD + G +Q + +VY Sbjct: 311 WIAAYKKLAPNPGSLTKSGAGGKIDGTSLKS-----LEWLRDRVSEGKFQFGRSEKVYTL 365 Query: 88 KSN-GKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQL 146 K G PL IP +DR+VQ + +E ++E F S+GFRP +S H A+ V+ + Sbjct: 366 KPKVGNGIPLDIPEFQDRLVQEVVRTILEVLYEPQFLESSHGFRPNKSQHTAMVDVRQKF 425 Query: 147 TDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHI-DVGLF 205 W I+GD+S FDT+ + L+ +R+++ D +F L++K +K+ + G+ Sbjct: 426 KGV-----VWCIKGDISKSFDTIDKKKLITQMRKKVKDEKFCHLIYKGLKSRLLMPEGIM 480 Query: 206 RAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWY-----WNNSIQRGRST 260 +G+ Q G+ SPLL NI L++ D ++ ER + D + + + + R Sbjct: 481 EVLKKGISQKGICSPLLCNIALHQLDLFI-ERLKKIVNKVDSSHIVSQPYQSQMVPQRER 539 Query: 261 AVRENWQWKPAV----------------------AYCRYADDFVLIVKGTKAQVEAIREE 298 A W+ A+ Y RYADDF++ V G K E I E Sbjct: 540 AAIGTGDWRGAIKAIKMARKMGYGDHQDPNLRRLTYVRYADDFLIGVTGPKKLAERIGEL 599 Query: 299 CRGVLEGSLKLRLNMDKTKIPHVNDGFI-FLGHRL 332 L+ L L LN +K I ++ I FLG ++ Sbjct: 600 VSRFLKIRLNLTLNQEKIVISKLSGKKIPFLGFQI 634 >UniRef50_C8PKY6 Putative CRISPR-associated protein Cas1 n=1 Tax=Campylobacter gracilis RM3268 RepID=C8PKY6_9PROT Length = 731 Score = 117 bits (293), Expect = 7e-25, Method: Compositional matrix adjust. Identities = 80/264 (30%), Positives = 128/264 (48%), Gaps = 41/264 (15%) Query: 67 LRDELLSGHYQPLPARRVYIPK-SNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTL 125 L+ E+ S Y P P +R +IPK + +LR L +P+L+D+ VQ + + ++ F Sbjct: 46 LKSEIFSLSYSPQPLKRAFIPKEAKDELRKLAVPSLKDKFVQNILTRELSGYFDKSFSNR 105 Query: 126 SYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDA 185 SY +R +S +AI + D + + ++ D+ +F+ + H L++ +R I DA Sbjct: 106 SYAYRNGKSYANAIYRAR----DFFQIFS-FAVKTDIKDFFENIDHEKLLEILRANIRDA 160 Query: 186 RFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARK 245 R + L+ IK G + +RA ++GV QG V+SPLLSNI LN+ D++L Sbjct: 161 RIIRLIELWIKNGIFERFDYRAHTKGVHQGDVLSPLLSNIYLNQMDKFL----------- 209 Query: 246 DRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEG 305 EN V + RYADDFV+ +A E + L+ Sbjct: 210 ------------------EN----SGVEFVRYADDFVMFFASYEA-AEMRLARLKDFLK- 245 Query: 306 SLKLRLNMDKTKIPHVNDGFIFLG 329 ++ L LN KT I + F+FLG Sbjct: 246 TISLSLNEAKTSIHGKDSEFVFLG 269 >UniRef50_D2CMF7 Reverse transcriptase n=1 Tax=Agaricus bisporus RepID=D2CMF7_AGABI Length = 402 Score = 117 bits (293), Expect = 7e-25, Method: Compositional matrix adjust. Identities = 80/247 (32%), Positives = 128/247 (51%), Gaps = 38/247 (15%) Query: 81 ARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIR 140 AR++ PK N L I + R++IVQ+A+ + M I++ F LSY FRP RS + ++ Sbjct: 2 ARQILNPKPNKPGEALLIASPREQIVQKALQVVMNAIFDPYFSKLSYLFRPGRSSINGLK 61 Query: 141 TVKLQLTDCGETRG---RWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKA 197 + TRG W I GD+S FD V H ++ +V+ RIS AR + L+ + +KA Sbjct: 62 RI--------HTRGGPMSWGINGDISKCFDRVPHDTIISSVKERISCARTLALIERGLKA 113 Query: 198 GHIDV-GLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHE-------------------- 236 G++DV G G PQG ++SPLLSNI+LN+ D+Y+ Sbjct: 114 GYVDVKGQIIKTKIGTPQGSILSPLLSNIVLNKLDKYIESLDSELNVEKKGKTFRVPANV 173 Query: 237 ---RYLSGKARKDRWYWNNSIQRGR--STAVRENWQWKPAVAYCRYADDFVLIVKGTKAQ 291 S + RK N +++ R S + N + + A+ Y RYADDFV+++ ++ Sbjct: 174 RVVALRSYQKRKQLNLANKYLKKMRLISKFDKHNEELRRAI-YIRYADDFVILLASSRKF 232 Query: 292 VEAIREE 298 +++E+ Sbjct: 233 AISLKEK 239 >UniRef50_Q9G8T4 Orf621 n=1 Tax=Rhodomonas salina RepID=Q9G8T4_RHDSA Length = 621 Score = 116 bits (291), Expect = 1e-24, Method: Compositional matrix adjust. Identities = 103/371 (27%), Positives = 165/371 (44%), Gaps = 70/371 (18%) Query: 20 LLRLITQPEWLAEAARITLSSKGAHTPG-----------VDGVNKTMLQARLAVELQILR 68 L++ + L A +KGA T G +DG+N +++ L Sbjct: 37 LMKFLYDEGMLWNAVEKLKKNKGAATFGPTNNKDRKSIEIDGLN--------LGQIKRLS 88 Query: 69 DELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFH-TLSY 127 +EL ++ P RR+ +PK N K RPLGI + DRIVQ + + I+E F ++ Sbjct: 89 EELREETFKWSPTRRIEVPKKNNKRRPLGIFSFEDRIVQEGIRTILNAIYEPTFSGNNNH 148 Query: 128 GFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARF 187 GFRP S A+ +K G+T IEGD+ ++ ++H +LMK ++++I+D +F Sbjct: 149 GFRPRLSSETALELLKRNRK--GKTHA---IEGDIKKAYEGINHNILMKILKKKINDKKF 203 Query: 188 MTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQ----YLHERYL---- 239 + ++ + +K G + VPQG + SP+L NI +NEFD+ + E + Sbjct: 204 LRIIEEGLKCGIEKNRKIYNSITVVPQGSICSPILFNIYMNEFDEAIKTIIEEIFTRLNE 263 Query: 240 SGKARKDRWYWNNS--IQRGRS----TAVRE-------------------NWQWKPAVA- 273 S +RK N I + RS +RE W+ K A Sbjct: 264 SRDSRKSSSTMNKEYKILKSRSDEMNKKIRERLKQESPFIKTLLKSHKKIRWKLKRKKAI 323 Query: 274 ----------YCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHV-N 322 Y RYADD+++I G E I+ ++ LKL L KT+I ++ Sbjct: 324 DYGKRTLEYFYLRYADDWIIITNGNVRVCEEIKIRISTWIKEELKLELEQSKTRITNMEK 383 Query: 323 DGFIFLGHRLI 333 D FLG ++ Sbjct: 384 DPIKFLGFSIM 394 >UniRef50_D2R8Z2 CRISPR-associated protein Cas1 n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R8Z2_9PLAN Length = 942 Score = 114 bits (286), Expect = 5e-24, Method: Compositional matrix adjust. Identities = 85/269 (31%), Positives = 129/269 (47%), Gaps = 49/269 (18%) Query: 63 ELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDF 122 E+Q +L+ G YQP P R+ IPKSNG R L IP+ DR++QR++L + P E F Sbjct: 341 EVQSAARDLVKGTYQPQPCFRLDIPKSNGDRRQLAIPSRLDRVLQRSILDVIAPALELFF 400 Query: 123 HTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRI 182 S+ +R H A R + TD RW + D +FDT+ H+LL + + + Sbjct: 401 EESSFAYRRGLGRHTAARHLSQAFTDG----YRWALHADFFDFFDTIDHKLLRRRLAAYL 456 Query: 183 SDARFMTLL--WKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLS 240 +D + ++ W A H D G+P G +SP+L+N+ L++FD+ +H Sbjct: 457 ADPSLVEVIMRWVETGAPHPD--------HGIPTGAPLSPILANLFLDQFDEAMH----- 503 Query: 241 GKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECR 300 S+ R RYADDFV++ + +++ +A+ E R Sbjct: 504 ------------SVGR----------------RLVRYADDFVVLFR-DQSEAQAVISEVR 534 Query: 301 GVLEGSLKLRLNMDKTKIPHVNDGFIFLG 329 E SL+L LN DKT H+ F FLG Sbjct: 535 QAAE-SLRLELNRDKTHTLHLATSFDFLG 562 >UniRef50_C0JWS6 Putative reverse transcriptase and intron maturase n=1 Tax=Pycnococcus provasolii RepID=C0JWS6_9CHLO Length = 583 Score = 114 bits (284), Expect = 9e-24, Method: Compositional matrix adjust. Identities = 104/339 (30%), Positives = 152/339 (44%), Gaps = 53/339 (15%) Query: 35 RITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLPARRVYIPKSNGKL- 93 R+ +++G + GVD K +L + +L + ++ L G +P+ RR K+ GK+ Sbjct: 57 RVAQTNRGKRSAGVD--RKRVLTSE--QKLNLAQNLKLKGKAKPI--RRTCSTKA-GKVD 109 Query: 94 -RPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGET 152 RPLGIP L DR VQ+ + A+EP WE+ F SYGFRP R+ AI + L Sbjct: 110 KRPLGIPTLEDRAVQQLVKFALEPEWEAKFEPNSYGFRPGRACQDAIMALFQHL------ 163 Query: 153 RGR--WVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHIDVGLFRAASE 210 RGR +V++ D+ FD + H L+ + LL IK + G+ S Sbjct: 164 RGRSLYVLDADIKKCFDRIDHDKLLAKLNT-------FPLLENQIKVW-LKAGVIEGYSN 215 Query: 211 -------------GVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRG 257 G PQGGVISPLL+NI L + L + Y+ N + +G Sbjct: 216 SYKNYNKVTPNLLGTPQGGVISPLLANIALTGLEDEL------------KHYYANHLYKG 263 Query: 258 RSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTK 317 S + VA RY DDFV++ K + +R+ L ++ L L +KTK Sbjct: 264 SSRIGLSDKL--TQVAVIRYVDDFVVLHKDENV-IRQLRDHTAKWLYTTMGLELLPEKTK 320 Query: 318 IPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNF 356 I GF FLG +I S + + I E N Sbjct: 321 ILDTKQGFTFLGFHIISIYSGENKYKCKIHISHESKNNL 359 >UniRef50_B4UZZ3 RNA-directed DNA polymerase n=1 Tax=Streptomyces sp. Mg1 RepID=B4UZZ3_9ACTO Length = 317 Score = 113 bits (282), Expect = 1e-23, Method: Compositional matrix adjust. Identities = 74/199 (37%), Positives = 106/199 (53%), Gaps = 15/199 (7%) Query: 156 WVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHI-DVGLFRAASEGVPQ 214 W++EGD+ + FD + H LM+ R R+ D R + L+ +KAG + + GL R G PQ Sbjct: 14 WIVEGDIKACFDEISHTALMERARARVGDKRVLALVKAFLKAGILSEDGLLRDNDTGTPQ 73 Query: 215 GGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAY 274 G ++SPLLSN+ L+ D+++ + G A D N +R R + P Sbjct: 74 GSILSPLLSNVALSVLDEHIAQ--APGGAGTDL----NERRRRRRRGL-------PNYRL 120 Query: 275 CRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIR 334 RYADD+ L+V GTKA E +REE VL ++ LRL+ +KT I H+ G FLG + R Sbjct: 121 VRYADDWCLMVHGTKADAETLREEIAEVLT-TMGLRLSPEKTLITHIEQGLDFLGWHIQR 179 Query: 335 KRSRYGEMRVVSTIPQEKA 353 R V T P +KA Sbjct: 180 HRKPGTNRYYVYTYPAKKA 198 >UniRef50_D2LF37 RNA-directed DNA polymerase (Reverse transcriptase) n=1 Tax=Rhodomicrobium vannielii ATCC 17100 RepID=D2LF37_RHOVA Length = 293 Score = 112 bits (281), Expect = 1e-23, Method: Compositional matrix adjust. Identities = 89/313 (28%), Positives = 145/313 (46%), Gaps = 54/313 (17%) Query: 21 LRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLP 80 L + P L +A +KG PG DGV + VEL+ LR E L+G Y+P Sbjct: 22 LEKVVAPACLQQAWTRVRKNKGG--PGGDGVTIEIFAQNAEVELEKLRAETLAGIYRPRK 79 Query: 81 ARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIR 140 R +PK G R L IP++ DRI+Q A ++++ + F + S+ +R R V A+ Sbjct: 80 VRHAIVPKPKGGERKLTIPSVVDRILQTATMLSLGQTVDHHFSSASWAYREGRGVDDALA 139 Query: 141 TVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHI 200 ++ +L + G W + D+ YFD + H+ L+ + + D R + L+ +++ Sbjct: 140 DLR-RLRNSGLF---WTFDADIMQYFDRILHKRLIDDLFIWVDDLRIVRLIQLWLRS--- 192 Query: 201 DVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGRST 260 F G+ QG ISPLL+N+ L+ D+ L L G Sbjct: 193 ----FSYWGRGIAQGAPISPLLANLFLHPMDRLLE---LEG------------------- 226 Query: 261 AVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLK---LRLNMDKTK 317 +A RYADDFV++ + +A+ ++ + ++ L L+LNM KT+ Sbjct: 227 -----------LASVRYADDFVVLCRS-----KALAQKAQLIVASHLAARGLKLNMSKTR 270 Query: 318 IPHVNDGFIFLGH 330 I ++ FIFLG Sbjct: 271 ILAPSEAFIFLGQ 283 >UniRef50_A1BI39 CRISPR-associated protein Cas1 n=5 Tax=Chlorobiaceae RepID=A1BI39_CHLPD Length = 731 Score = 112 bits (281), Expect = 2e-23, Method: Compositional matrix adjust. Identities = 89/317 (28%), Positives = 147/317 (46%), Gaps = 44/317 (13%) Query: 20 LLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPL 79 L + PE + +A S+ G PG D + RL L+ L LL+G Y+ Sbjct: 4 LYNQMAMPETIFQAWYKVASNDG--RPGWDNTSIQDYSLRLEENLKSLSHALLTGTYRQS 61 Query: 80 PARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAI 139 P ++ + K +GK R L IP + DR+ Q A + + PI E++ ++ +RP S A Sbjct: 62 PLLKLVMLKPDGKERVLLIPGVIDRVAQTAASIVLSPIIEAELGNCTFAYRPGISREGAA 121 Query: 140 RTVKLQLTDCGETRG-RWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAG 198 R + D G +WV++ D+ ++FD V H LL + + + D ++LL + + A Sbjct: 122 REI-----DRLHREGYQWVLDADIRNFFDNVRHDLLFQRLVELVDDKEMISLLHRWLTAE 176 Query: 199 HIDVGLFRA-ASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRG 257 +D R + G+PQG ISP L+N+ L+ FD+ + ++ Sbjct: 177 IVDGLNPRTRNTMGLPQGCPISPALANLYLDRFDETMEQQ-------------------- 216 Query: 258 RSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTK 317 R+ADD++++ K T+ + EA + L LKL L+ DKT+ Sbjct: 217 -------------GFKLVRFADDYLVLCK-TRPKAEAALKLSESAL-AELKLELHSDKTR 261 Query: 318 IPHVNDGFIFLGHRLIR 334 I +GF +LG+ IR Sbjct: 262 ITTFAEGFKYLGYLFIR 278 >UniRef50_A1KEP0 Possible maturase n=10 Tax=Mycobacterium tuberculosis complex RepID=A1KEP0_MYCBP Length = 250 Score = 112 bits (280), Expect = 2e-23, Method: Compositional matrix adjust. Identities = 65/155 (41%), Positives = 92/155 (59%), Gaps = 9/155 (5%) Query: 47 GVDGVNKTMLQARL-AVE-LQILRDELLSGHYQPLPARRVYIPKSNG--KLRPLGIPALR 102 G DG+ +++ + A+E L LR EL SG ++P P R IPK G K+R LGIP + Sbjct: 45 GSDGLTVARIESEIGALEFLNELRTELKSGQFRPQPVRERKIPKPGGLGKVRRLGIPTVA 104 Query: 103 DRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDL 162 DR+VQ A+ + +EPI+E+DF +SYGFRP R H I + L G RWV++ D+ Sbjct: 105 DRVVQAALKLVLEPIFETDFEPVSYGFRPARRAHDTIAEIHL----FGTQEYRWVLDADI 160 Query: 163 SSYFDTVHHRLLMKAVRRRISDARFMTLL-WKTIK 196 + FD + H LM VR RI D R + L+ W+ I+ Sbjct: 161 KACFDRIDHADLMDRVRHRIKDKRVLRLVNWQRIR 195 >UniRef50_B3CUR8 Reverse transcriptase n=24 Tax=Orientia tsutsugamushi RepID=B3CUR8_ORITI Length = 379 Score = 112 bits (280), Expect = 2e-23, Method: Compositional matrix adjust. Identities = 66/191 (34%), Positives = 103/191 (53%), Gaps = 5/191 (2%) Query: 38 LSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLG 97 L SK A G+DG+ K +L L L + YQ PAR V IPK +G RPL Sbjct: 49 LDSKKA--IGIDGITKEDYGKKLKANLLSLLTRIRKWQYQAKPARIVKIPKEDGGKRPLV 106 Query: 98 IPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWV 157 I D+I++ A+ + ++E F SYGF P+ + H A+R +L +G + Sbjct: 107 ISCFEDKIIESAVSKILNSVFEPIFLKYSYGFGPKLNAHDALR--ELNRLTYNFNKGA-I 163 Query: 158 IEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGV 217 +E D++ F+T+ H LM+ +R+RISD +F+ L+ K I+ I+ EG QG + Sbjct: 164 VEIDITKCFNTIKHCELMEFLRKRISDKKFLRLVMKLIETPIIENDTIVTNKEGCRQGSI 223 Query: 218 ISPLLSNIMLN 228 +SP+L+N+ L+ Sbjct: 224 VSPILANVFLH 234 >UniRef50_Q8HQ84 ORF786 n=1 Tax=Schizosaccharomyces octosporus RepID=Q8HQ84_SCHOT Length = 786 Score = 110 bits (275), Expect = 8e-23, Method: Compositional matrix adjust. Identities = 79/303 (26%), Positives = 149/303 (49%), Gaps = 25/303 (8%) Query: 48 VDGVNKTMLQARLAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQ 107 + +NK + ++ L++ +L+G ++ + + K+ + L ++ D++VQ Sbjct: 203 IKNLNKLNINPLTKTKIMKLKESVLNGTFEWTNTKHQLLHKTPN--QNLNQTSINDKLVQ 260 Query: 108 RAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFD 167 + +EPI+E +F +S+GFRP R+ H A++ + ++ D W IEG++ + + Sbjct: 261 EVLKNILEPIFELNFLEVSHGFRPNRNCHIALKYLNTKMKD-----SIWFIEGNIEN--N 313 Query: 168 TVHHRLLMKAVRRRISDARFMTLLWKTIKAGHI-DVGLFRAASEGVPQGGVISPLLSNIM 226 T++ LL++ + +R+ D ++LL +K+ + L G+ G + LL+NI Sbjct: 314 TLNTTLLIELISKRVKDKLILSLLRSALKSNVLMSKELLFIPEVGIEHGSTLKTLLTNIY 373 Query: 227 LNEFDQYLH---ERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWK-PA----------V 272 +E D YL + Y +R +++ R E ++ + P+ V Sbjct: 374 YHELDNYLQNLSQNYEGSIKASNRRKNPANLRLLREGKKSEAYKLRLPSRDPFEKEYRNV 433 Query: 273 AYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRL 332 Y RY D F++ V G++ IRE+ G L L + LN D TKI H+++G FLG+ Sbjct: 434 KYIRYGDKFLIGVLGSRKLTLEIREKVSGFLNDKLNITLNPD-TKINHISNGISFLGYIF 492 Query: 333 IRK 335 RK Sbjct: 493 SRK 495 >UniRef50_Q9MR93 Reverse transcriptase homologue ND5 i4 grp II protein (Fragment) n=2 Tax=Podospora anserina RepID=Q9MR93_PODAN Length = 779 Score = 108 bits (271), Expect = 2e-22, Method: Compositional matrix adjust. Identities = 77/265 (29%), Positives = 123/265 (46%), Gaps = 65/265 (24%) Query: 94 RPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETR 153 RPL R+++VQ M M +E I+E F ++GFRP R H A++ +K Q + G + Sbjct: 296 RPL--TTFRNKLVQEVMRMILEVIFEPTFSEHNHGFRPGRGCHSALKEIKAQAINFGVST 353 Query: 154 GRWVIEGDLSSYF----DTVHH------------------------------RLLMKAVR 179 W EGD+S Y D V H R+LM + Sbjct: 354 --WYFEGDISKYLVTCADEVMHASSSFVTSRRGGDGGGKAAAAAALVSSQIERVLMNIIE 411 Query: 180 RRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHE--- 236 +I D RF L+WK +KAG+ + +F+ ++ G QG ++SP+LSNI LNE D+++ + Sbjct: 412 NKIKDRRFTDLIWKALKAGYFEFKIFKDSNTGTTQGDILSPILSNIYLNELDRFISKLKL 471 Query: 237 RYLSGKARKDRWYWNNSIQ-RGRSTAVRENWQ-------------WKPA---VAYCRYAD 279 Y G K Y+N + ++ V+ + P+ + Y RYA+ Sbjct: 472 EYDKGIKPKVNPYYNKLCNMKNKTLDVQTRIRIHKLHLKTPYYKTLDPSFKKLVYVRYAN 531 Query: 280 DFVLIVKGTKAQVEAIREECRGVLE 304 +V+ V+G+K E+C +LE Sbjct: 532 VWVIGVRGSK-------EDCNILLE 549 >UniRef50_C6PFC6 RNA-directed DNA polymerase (Reverse transcriptase) (Fragment) n=6 Tax=Thermoanaerobacterales RepID=C6PFC6_CLOTS Length = 209 Score = 108 bits (271), Expect = 3e-22, Method: Compositional matrix adjust. Identities = 64/179 (35%), Positives = 98/179 (54%), Gaps = 12/179 (6%) Query: 9 AATDPSLRIQR----LLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVEL 64 A+ D +QR LL +I E + EA + +++KG+H GVDG+ L L Sbjct: 37 ASKDRRNNVQRYTSNLLEMILDRENMKEAYKRVVANKGSH--GVDGMEVDELLPYLKENW 94 Query: 65 QILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHT 124 ++ +LL G Y+P P RV IPK +G R LGIP + DR++Q+A+ + I++ F Sbjct: 95 LTIKQQLLEGKYKPQPVLRVEIPKPDGGTRLLGIPTVLDRLIQQAIAQILSGIYDHTFSE 154 Query: 125 LSYGFRPERSVHHAIRTVKLQLTD-CGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRI 182 SYGFRP RS A+ + + + C WV++ DL +FD V+H +LM + +RI Sbjct: 155 NSYGFRPRRSAKDAVIAAETYINEGCT-----WVVDIDLEKFFDRVNHDILMSKLEKRI 208 >UniRef50_Q7YAJ6 Putative reverse transcriptase and intron maturase n=1 Tax=Chara vulgaris RepID=Q7YAJ6_CHAVU Length = 550 Score = 108 bits (270), Expect = 3e-22, Method: Compositional matrix adjust. Identities = 76/263 (28%), Positives = 132/263 (50%), Gaps = 42/263 (15%) Query: 91 GKLRP-LGIPALRDRIVQRAMLMAMEPIWE-SDF----------HTLSYGFRPERSVHHA 138 GK P + + R++IVQ+A+ + ++ I++ ++F H +R++H Sbjct: 190 GKFEPRFALISPREKIVQKALTVVLDSIYDPAEFRIYDPALDCSHAPKEARGAKRAIHKV 249 Query: 139 IRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAG 198 RT K WVIEGD++ ++ H++++ + I+ +FM+L+ K++ G Sbjct: 250 DRTFK---------SATWVIEGDITKCSASLPHKVILGILEEEIACRKFMSLVRKSLSVG 300 Query: 199 HIDVGLFRAASEGVPQGGVI--SPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQR 256 ++D R PQ +PLL NI L++ D+Y++E + K + D+ + + +R Sbjct: 301 YVDEKGKRHHPNRPPQALTFLRAPLLCNITLHQLDKYIYE---TLKEQYDKHEVDPNFRR 357 Query: 257 GRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKT 316 ++Y RYADDFV+ + G K IR+ L +L L LN +KT Sbjct: 358 ---------------LSYVRYADDFVIGITGPKTDAIEIRDLISTFL-STLGLELNKEKT 401 Query: 317 KIPHVNDGFIFLGHRLIRKRSRY 339 KI H++ GF FLG ++ R R RY Sbjct: 402 KISHIDSGFFFLGTQISRGRRRY 424 >UniRef50_Q0QIN8 Putative reverse transcriptase n=1 Tax=Oltmannsiellopsis viridis RepID=Q0QIN8_OLTVI Length = 606 Score = 107 bits (267), Expect = 8e-22, Method: Compositional matrix adjust. Identities = 81/309 (26%), Positives = 134/309 (43%), Gaps = 78/309 (25%) Query: 94 RPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETR 153 + L I +LRD++VQ+A+ + + PI+ES F S+GFRP R H AI+ ++ Sbjct: 155 KTLTIGSLRDKVVQKAIELVLSPIYESIFLENSHGFRPARGCHTAIKDIRKWFHKIS--- 211 Query: 154 GRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVP 213 WVIE D+ + F +V+H +L+ +R ++ + + L+ +++G+I +F + GVP Sbjct: 212 --WVIESDIGNCFPSVYHTVLLSIIREKVKCDKTVALIRNLLESGYISPKVFCESKMGVP 269 Query: 214 QGGVISPLLSNIMLNEFDQYLHE-----RYLSGKARKDR---------WYWNNSIQRGRS 259 QG +S LL NI L++ D ++ Y GK+ ++ + N S+Q R+ Sbjct: 270 QGSRLSTLLCNIYLHKLDVFMANLRQEFAYTGGKSLQNTRATASCHRLTFSNFSLQNTRA 329 Query: 260 TAV-----------RENWQWKP------------------------------------AV 272 TA R N ++K Sbjct: 330 TASCHRLTFSNFSRRTNPEYKDLQRRIQKTFDPVERSKLIQKLRKTSSKDPFDPDYLQNE 389 Query: 273 AYC---------RYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHV-- 321 +C RYAD FV+ + G++ + I E L LK+ L KT+I H Sbjct: 390 TFCSFARRLFYTRYADGFVIGITGSRKEATEILERVNIFLSEELKMNLKESKTRIVHFKK 449 Query: 322 -NDGFIFLG 329 +FLG Sbjct: 450 KKQSILFLG 458 >UniRef50_C1DF40 Group II intron-encoding maturase n=1 Tax=Azotobacter vinelandii DJ RepID=C1DF40_AZOVD Length = 225 Score = 106 bits (264), Expect = 2e-21, Method: Compositional matrix adjust. Identities = 66/185 (35%), Positives = 100/185 (54%), Gaps = 10/185 (5%) Query: 7 TWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQI 66 W +P ++R+L P L A + + ++GA PG DG+ L + I Sbjct: 46 AWTNAEPDTLMERVL----APANLKRAYQQVVRNRGA--PGADGMTVADLAGYVKQYWPI 99 Query: 67 LRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLS 126 L+ LL+G Y P R V IPK G R LGIP++ DR++Q+A+L + PI++ F S Sbjct: 100 LKARLLAGEYHPQAVRAVEIPKPQGGTRQLGIPSVVDRLIQQALLQQLVPIFDPLFSDYS 159 Query: 127 YGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDAR 186 YGFRP RS H A+ + + G+ RW +E D+ +FD V+H +LM V RR+ D + Sbjct: 160 YGFRPGRSAHQAVEMARAHVA-AGQ---RWCVELDVEKFFDRVNHDVLMACVERRVEDKQ 215 Query: 187 FMTLL 191 + L+ Sbjct: 216 VLRLI 220 >UniRef50_B8R160 Reverse transcriptase n=2 Tax=Volvox carteri RepID=B8R160_VOLCA Length = 607 Score = 105 bits (263), Expect = 2e-21, Method: Compositional matrix adjust. Identities = 105/366 (28%), Positives = 164/366 (44%), Gaps = 60/366 (16%) Query: 36 ITLSSKGAHTPGVDGVNKTMLQARL--AVELQILRDELLSGHYQPLPARRVYIPK-SNGK 92 +T +KG +T G+DG T + +L A +LQI L RR +IPK + Sbjct: 78 VTTLNKGKNTAGIDGYKATTSEEKLLLAKKLQINGKANL--------VRRTWIPKPGKTE 129 Query: 93 LRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGET 152 +PLGI ++DR +Q +A+EP WE+ F SYGFRP R AI + + Sbjct: 130 KQPLGIYTIQDRALQALCKLALEPEWEAKFEPNSYGFRPGRRAQDAIEAI---FQNLHHD 186 Query: 153 RGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHIDVGLFRAASE-- 210 ++V + D+ FDT+ H L+ S + L+ K I A + G+F + Sbjct: 187 ADKYVFDADIRKCFDTIDHAALL-------SKLKTFPLMEKQISAW-LKAGIFDQYANTP 238 Query: 211 -------GVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVR 263 G PQGG+ISPLL+NI L+ +++L L+ +RK+ + A R Sbjct: 239 KVSTPEMGTPQGGIISPLLANIALHGLEEHL----LNMVSRKE-------FPKPHPKAAR 287 Query: 264 ENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVND 323 + A+ RYADDFV+I + I E + + + L ++ +K+ + + Sbjct: 288 GAKAKRAALGIIRYADDFVIIHRNLDIMKTVITETKTWLAQ--MGLAISEEKSALRLASK 345 Query: 324 GFIFLGHRLIRKRSRYGEMRVVSTIPQ--------EKARNF-----AASLTALLWKVRIS 370 F FLG ++ R + V P K RN A+S L+ K+R Sbjct: 346 SFKFLGFQVAYVRDKIQNKYRVRITPSRENVKLIISKTRNIIQKNKASSAYELIGKLR-- 403 Query: 371 GEILLG 376 +LLG Sbjct: 404 -PVLLG 408 >UniRef50_D2MKC4 RNA-directed DNA polymerase (Reverse transcriptase) n=1 Tax=Candidatus Poribacteria sp. WGA-A3 RepID=D2MKC4_9BACT Length = 345 Score = 105 bits (262), Expect = 2e-21, Method: Compositional matrix adjust. Identities = 76/222 (34%), Positives = 116/222 (52%), Gaps = 36/222 (16%) Query: 101 LRDRIVQRAMLMA-MEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIE 159 + D+IVQ AM+ +EPI+E +F SYGFRP RS H + + + + +V++ Sbjct: 1 MEDKIVQCAMVKCILEPIYEMEFCGFSYGFRPGRSAHDGLDALAYVIE---RRKVNYVVD 57 Query: 160 GDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVIS 219 D+ +FD V LM+ +R RI D R + ++ K +KAG ++ G +R + +G PQG VIS Sbjct: 58 ADIRKFFDEVDQEWLMRFLRHRIGDERVLRIIVKFLKAGVMEDGSWRESEQGTPQGAVIS 117 Query: 220 PLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQR---GRSTAVRENWQWKPAVAYCR 276 P+L+N+ YLH Y+ D W+ + R G S V R Sbjct: 118 PILANL-------YLH--YVL-----DLWFKSQRKSRNIGGESYMV-------------R 150 Query: 277 YADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKI 318 YADDFV+ + +A E ++ R LE LRL+ DKT++ Sbjct: 151 YADDFVVCFQHKEA-AERFLKDLRKRLE-KFGLRLHPDKTRL 190 >UniRef50_B9VL91 Putative maturase/reverse transcriptase n=1 Tax=Caulerpa filiformis RepID=B9VL91_9CHLO Length = 180 Score = 105 bits (261), Expect = 4e-21, Method: Compositional matrix adjust. Identities = 57/154 (37%), Positives = 93/154 (60%), Gaps = 11/154 (7%) Query: 86 IPKSNGK-LRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKL 144 IPK K +RPL I + +IVQ + +E I+E F +S+GFRP +S H A+ ++ Sbjct: 2 IPKPGKKVMRPLDI---QGKIVQEIVRRILEAIFEPVFLDVSHGFRPGKSCHSALSQIEK 58 Query: 145 QLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHIDVG- 203 + W+IEGD++ +D + H L+ +R+RI D RF++L+WK ++AG++ G Sbjct: 59 RFQGM-----YWIIEGDITGCYDNIPHSTLVNILRKRIRDERFISLIWKFLRAGYLYSGK 113 Query: 204 -LFRAASEGVPQGGVISPLLSNIMLNEFDQYLHE 236 + G P+G +ISPLL+NI LNE D+++ + Sbjct: 114 IITHPTLIGTPRGSIISPLLANIYLNELDRWVSD 147 >UniRef50_P38456 Uncharacterized mitochondrial protein ymf11 n=1 Tax=Marchantia polymorpha RepID=YMF11_MARPO Length = 732 Score = 103 bits (256), Expect = 1e-20, Method: Compositional matrix adjust. Identities = 76/241 (31%), Positives = 116/241 (48%), Gaps = 39/241 (16%) Query: 102 RDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGD 161 +D +VQ + +E ++E F + S+GFRP RS H ++ ++ W IEG+ Sbjct: 243 KDILVQEVIRSILETLYEPYFLSCSHGFRPGRSQHTCLKQIRRDFVGT-----VWFIEGE 297 Query: 162 LSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPL 221 S YF+ + ++L+ +RRRI D RF+ L+ K IK RA +EG G PL Sbjct: 298 TSQYFNKIDKQVLIGLMRRRIRDNRFLNLVQKEIKTS------LRAGAEGTGVG----PL 347 Query: 222 LSNIMLNEFDQYLH---------ERYLSGKARKDRWYWNNSIQRGRSTAVRE-------- 264 L NI+L+E D ++ R K+ W + ++ R+TA R Sbjct: 348 LCNIVLHELDLFVMRLKRIVDRGRRRAVNPESKELWRQSAALI-DRTTAHRARVPFPSGA 406 Query: 265 ------NWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKI 318 + Q + Y R+ADDF++ V G +A E IR +E LKLRL +DKT+ Sbjct: 407 FGRGLGHPQETRQINYVRFADDFLIGVIGPRALAERIRGLVTRFIEVRLKLRLTLDKTRK 466 Query: 319 P 319 P Sbjct: 467 P 467 >UniRef50_Q7M1J5 Reverse transcription like protein 2, intron-encoded (Fragment) n=1 Tax=Pylaiella littoralis RepID=Q7M1J5_PYLLI Length = 308 Score = 102 bits (255), Expect = 2e-20, Method: Compositional matrix adjust. Identities = 80/250 (32%), Positives = 115/250 (46%), Gaps = 62/250 (24%) Query: 80 PARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAI 139 P +RVYIPKS GKLRPLGIP + DR +Q +A++PI E SYGFR RS Sbjct: 4 PVKRVYIPKSGGKLRPLGIPNMYDRGLQYLWKLALDPIAECRADRHSYGFRKGRSTQDVH 63 Query: 140 RTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGH 199 + L L+ ++R WV+E D+ +FD ++H +++ + +L + +KAG Sbjct: 64 TILHLLLSP--KSRCDWVLEADIRGFFDNINHDWIIQNIPMD------KNILREWLKAGA 115 Query: 200 IDVGL--FRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRG 257 ++ F GVPQGG ISPL++N+ L+ + + Sbjct: 116 LETTTQEFHKGIAGVPQGGPISPLIANMTLDGLEVW------------------------ 151 Query: 258 RSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTK 317 VA RYADDFV + TK +E RG++ LN +KT Sbjct: 152 --------------VAVVRYADDFV-VTAATKRILE------RGLV-------LNQEKTC 183 Query: 318 IPHVNDGFIF 327 I GF F Sbjct: 184 ITFDFVGFNF 193 >UniRef50_UPI0001C15D3C hypothetical protein CRC_00192 n=2 Tax=Nostocaceae RepID=UPI0001C15D3C Length = 566 Score = 102 bits (255), Expect = 2e-20, Method: Compositional matrix adjust. Identities = 80/264 (30%), Positives = 126/264 (47%), Gaps = 34/264 (12%) Query: 77 QPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVH 136 + +P R+ IPKSNG+ R LGI + DR Q MA+EP WE+ F +YGFRP RS H Sbjct: 88 KAIPLTRMEIPKSNGESRNLGISKMEDRAKQALAKMALEPEWEAKFEPNNYGFRPGRSCH 147 Query: 137 HAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKT-I 195 AI ++ Q+ R +V+ D+S FD V H +A+ + + M + + Sbjct: 148 DAISAIESQV----RRRTSYVLSVDISGCFDKVKH----EAIVEKCNTFPIMERQIRAWL 199 Query: 196 KAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQ 255 K+G + +F +G P GVISPLL+NI L+ + ++ ++ S S Sbjct: 200 KSGVMIGEVFHPLEKGEPVEGVISPLLANIALHGLETHISHKFPSVSP---------SEA 250 Query: 256 RGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDK 315 +G+ + E R A DF L++ + ++A++ E L G + L LN K Sbjct: 251 QGKVGEIGE-------ARLIRCAHDF-LVLHWEEKTIKAVKTEVETWL-GEIGLNLNQQK 301 Query: 316 TKIPHVND-------GFIFLGHRL 332 ++ H + G FLG + Sbjct: 302 IRMCHTMEEYNGEKPGLDFLGFNI 325 >UniRef50_A8KXN1 RNA-directed DNA polymerase (Reverse transcriptase) n=1 Tax=Frankia sp. EAN1pec RepID=A8KXN1_FRASN Length = 417 Score = 101 bits (252), Expect = 4e-20, Method: Compositional matrix adjust. Identities = 90/301 (29%), Positives = 130/301 (43%), Gaps = 43/301 (14%) Query: 27 PEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLPARRVYI 86 P+ L A + + G PG DGV +A + L +L + + SG Y P P V I Sbjct: 14 PKQLVWDAWLKVKENGG-APGPDGVTVEQFEANVKDRLYVLWNRMSSGSYFPGPVGAVEI 72 Query: 87 PKSN--GKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKL 144 PK G R LGIP + DR+ Q + +A+EP E FH SYG+RP RS A+ + Sbjct: 73 PKKGVKGGARTLGIPNVVDRVAQTVLKLALEPKVEPVFHRDSYGYRPGRSQRQALEVCRK 132 Query: 145 QLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKA--GHIDV 202 + WV++ D+ +FDTV L+KAV + + + +KA H D Sbjct: 133 RCWSHD-----WVVDLDVRKFFDTVPWEKLLKAVAYHTDQKWVLMYVERCLKAPTKHAD- 186 Query: 203 GLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGRSTAV 262 G + + G QGG SPL +NI YLH + AR+ Sbjct: 187 GTLQERTMGTVQGGPFSPLAANI-------YLHWGLDAWMARE----------------- 222 Query: 263 RENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVN 322 P V + R+ADD V + Q +R+ L + L + DKT+I + Sbjct: 223 ------FPTVPFERWADDVVFHCVSLE-QAREVRDAVVARLV-EVGLEAHPDKTRIVYCK 274 Query: 323 D 323 D Sbjct: 275 D 275 >UniRef50_A7NMI6 RNA-directed DNA polymerase (Reverse transcriptase) n=4 Tax=Chloroflexaceae RepID=A7NMI6_ROSCS Length = 447 Score = 101 bits (252), Expect = 4e-20, Method: Compositional matrix adjust. Identities = 54/154 (35%), Positives = 90/154 (58%), Gaps = 4/154 (2%) Query: 47 GVDGVNKTMLQARLAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIV 106 G D V +A ++ L DEL G Y+PLPA+R+ IPK++G R + I ++RDR+ Sbjct: 43 GPDAVTLRDFEADWTRQMAQLADELQQGTYRPLPAKRIAIPKASGGERAIAILSVRDRVA 102 Query: 107 QRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYF 166 QRA+ ++P+++ F SYG RP V A+ V+ + D G WV++ D++ YF Sbjct: 103 QRAVQQVLDPLFDPCFLDCSYGCRPHVGVPEAVARVQ-RYADQGLG---WVVDADIAGYF 158 Query: 167 DTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHI 200 D + R+L+ VR+RI + + L+ + ++AG + Sbjct: 159 DAIDQRVLLGLVRQRIDELPVLKLIAQWLEAGML 192 >UniRef50_C0FSR2 Putative uncharacterized protein n=1 Tax=Roseburia inulinivorans DSM 16841 RepID=C0FSR2_9FIRM Length = 273 Score = 101 bits (251), Expect = 4e-20, Method: Compositional matrix adjust. Identities = 73/299 (24%), Positives = 124/299 (41%), Gaps = 40/299 (13%) Query: 20 LLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPL 79 +L + E L EA +H G+DGV + L+A + +++ + +G Y+ Sbjct: 2 VLEDVFSDENLEEAFESFADKHDSH--GLDGVKLSELRAYWETNGKKIKESIFNGTYKVG 59 Query: 80 PARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAI 139 + I GK R + + DR + RA+ M WE F SY ++ + V A+ Sbjct: 60 AVEQRQIVNRKGKKRTISLMNSIDRFIFRALYQKMASEWEKQFSQYSYAYQNNKGVLTAV 119 Query: 140 RTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGH 199 Q E W +E D+ ++FD ++H +++ ++ I D R + LL + Sbjct: 120 E----QAAKYMEEGKDWSVELDIQNFFDNINHSIIISKLKAGIEDVRVLDLLIAYLTCTL 175 Query: 200 IDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGRS 259 +D +F +GV QGG +SPLL+N+ +NE D Y+ Sbjct: 176 LDDHVFHQMEQGVLQGGPLSPLLANVYMNELDHYME------------------------ 211 Query: 260 TAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKI 318 K ++CR+ DD + I T + + +E +L LN KT I Sbjct: 212 ---------KQGYSFCRFGDD-INIYCSTYEEATVAFSDVTARMEKIEQLPLNHGKTGI 260 >UniRef50_Q4C6J4 RNA-directed DNA polymerase (Reverse transcriptase) n=2 Tax=Crocosphaera watsonii WH 8501 RepID=Q4C6J4_CROWT Length = 489 Score = 100 bits (250), Expect = 6e-20, Method: Compositional matrix adjust. Identities = 80/250 (32%), Positives = 129/250 (51%), Gaps = 46/250 (18%) Query: 98 IPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWV 157 I A++DR +Q + +A+EP WE+ F SYGFRP RSVH AI + LQ+ + ++V Sbjct: 87 ICAIKDRAMQALVKLALEPYWEAQFEETSYGFRPGRSVHDAIGRI-LQVIG---NKPKYV 142 Query: 158 IEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGV 217 ++ D++ +FD ++H L+ V + ++ + +K+G +D G+F Q V Sbjct: 143 LDADIAEWFDKINHDYLLSKVD---CPSNIKKIIKQWLKSGVMDNGVFAETESTTSQKDV 199 Query: 218 ISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQR-GRSTAVRENWQWKPAVAYCR 276 ISPLL+NI L+ G ++ R + NSI R G++ N+Q P + R Sbjct: 200 ISPLLANIALH------------GMMKEVRDNFPNSITREGKAI---NNFQ--PTII--R 240 Query: 277 YADDFVLIVKGTKAQVEAIREECRGVLEGSLK---LRLNMDKTKIPH------VND---- 323 YA +FV+ + E I+ +C+ ++ LK L + +KTKI H +ND Sbjct: 241 YAHNFVIFHR----DYEVIK-QCKILIRKCLKKLGLEIETEKTKICHSLNEIEINDQKVE 295 Query: 324 -GFIFLGHRL 332 GF FLG + Sbjct: 296 PGFDFLGFNI 305 >UniRef50_C0A8Z3 RNA-directed DNA polymerase (Reverse transcriptase) n=1 Tax=Opitutaceae bacterium TAV2 RepID=C0A8Z3_9BACT Length = 358 Score = 99.8 bits (247), Expect = 1e-19, Method: Compositional matrix adjust. Identities = 82/286 (28%), Positives = 128/286 (44%), Gaps = 58/286 (20%) Query: 58 ARLAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPI 117 A L + LRDELL+G +QP R Y ++ K R + RDR+V A++ +EPI Sbjct: 40 AELEKTVVTLRDELLAGTWQP--GRYYYFTITDPKEREVAAAPFRDRVVHHALVRVLEPI 97 Query: 118 WESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKA 177 +E F S+ RP + H A+ + + T R R+ ++ D+ YF + H LL++ Sbjct: 98 FEPRFIADSFACRPGKGTHAALARAR-EFT----RRHRYCLKCDIKKYFPNIDHALLLRE 152 Query: 178 VRRRISDARFMTLLWKTIKAGHID---------VGLFRAAS--EGVPQGGVISPLLSNIM 226 V R + DAR + L+ + I A H D GLF G+P G + S L+N+ Sbjct: 153 VGRAVDDARVLELIGR-ILASHADGAAQEWRAGAGLFDVEQRPRGLPIGNLTSQFLANVH 211 Query: 227 LNEFDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVK 286 L+ D + V++ + K Y RY DDF+L Sbjct: 212 LHPLDLF----------------------------VKQTLRVK---GYVRYVDDFLLFGD 240 Query: 287 ---GTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLG 329 KA + +RE R +L+LR++ DK ++ G F+G Sbjct: 241 DRAALKAHGQRVREFVR-----TLRLRVHPDKFRLSRTEQGVDFVG 281 >UniRef50_O99970 Orf546 n=2 Tax=Porphyra purpurea RepID=O99970_PORPU Length = 546 Score = 99.4 bits (246), Expect = 2e-19, Method: Compositional matrix adjust. Identities = 92/314 (29%), Positives = 133/314 (42%), Gaps = 67/314 (21%) Query: 35 RITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLPARRVYIPKSNGKLR 94 ++T + G T GVD ++ R+ + I D + RRV I K NGK R Sbjct: 55 KVTQDNLGKRTAGVDRISNLTPDERMELVQNIQIDN------KSDKIRRVTILKPNGKER 108 Query: 95 PLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRG 154 LGIP +RDR Q + A+EP +E+ F SYGFRP RS + A + + C + Sbjct: 109 HLGIPTIRDRAKQCLVKFALEPQYEAIFEPNSYGFRPGRSSNDARQAI----VKCLQQLP 164 Query: 155 RWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKA---GHIDVGLFRAASE- 210 + V++ D+ FD + H L+ + LL + ++A I G E Sbjct: 165 KHVLDADIERCFDNIDHSKLIHGINT-------FPLLREQVRAWLKACILTGFKENIKEV 217 Query: 211 ----GVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENW 266 G PQGG+ISPLL+NI L+ G AV Sbjct: 218 IPEAGTPQGGIISPLLANIALH----------------------------GMEKAV---- 245 Query: 267 QWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVND--- 323 K V RYADDF+++ K EA + + L+ +L L+L+ KTKI + Sbjct: 246 -CKSGVYLIRYADDFLILCNEEKELSEA-KNKIEIFLQ-NLGLKLSESKTKITYTGSSEY 302 Query: 324 ----GFIFLGHRLI 333 G FLG + Sbjct: 303 SRTKGVDFLGFNFV 316 >UniRef50_B0VI85 RNA-directed DNA polymerase (Reverse transcriptase) n=1 Tax=Candidatus Cloacamonas acidaminovorans RepID=B0VI85_9BACT Length = 343 Score = 99.0 bits (245), Expect = 2e-19, Method: Compositional matrix adjust. Identities = 70/288 (24%), Positives = 128/288 (44%), Gaps = 41/288 (14%) Query: 67 LRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLS 126 L+ EL++G+Y+P P R Y K R + + RDR+V +++ +EP +ES F S Sbjct: 49 LQKELINGNYRPQPYR--YFTIKEPKERLISVAVFRDRLVHHSLINVIEPYFESIFIKDS 106 Query: 127 YGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDAR 186 Y R + +H A+ V+ + W ++ D+ +F+ + H +L+K + +I D Sbjct: 107 YATRKGKGLHLAVLAVQKY-----SRQYPWFLKLDIEKFFNNIDHNILLKLISSKIKDPM 161 Query: 187 FMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKD 246 + L +K ++ + G+P G + S +NI LN+ D Y Sbjct: 162 IINLCSIILKNQNLSMN--HNEEIGLPVGNLTSQFFANIYLNQLDHY------------- 206 Query: 247 RWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGS 306 +++N +K Y RY DDF+L + K ++++ + L Sbjct: 207 ---------------IKQNLGYK---GYVRYMDDFILFSEN-KDKLKSDLLLIKYFLSNI 247 Query: 307 LKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKAR 354 LKL++ ++ VN G FLG+R+ K R + + + + R Sbjct: 248 LKLKIKDKSIQMNKVNQGIPFLGYRVFPKLIRVSNINLKRCLQNMQKR 295 >UniRef50_Q8TIC7 Reverse transcriptase n=1 Tax=Methanosarcina acetivorans RepID=Q8TIC7_METAC Length = 313 Score = 98.6 bits (244), Expect = 3e-19, Method: Compositional matrix adjust. Identities = 70/226 (30%), Positives = 113/226 (50%), Gaps = 25/226 (11%) Query: 17 IQRLLRLITQPEWLAE--AARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSG 74 ++RL L+T + A+ A RI + G T GVDG T ++ L L Sbjct: 54 VKRLTYLLTH-SYSAKLLAVRIVTQNHGKRTAGVDGQLWTTASDKMQAAL-----SLSDR 107 Query: 75 HYQPLPARRVYIPK-SNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFR--- 130 HY+ P RR+YIPK R L IP + DR +Q +A++P+ E+ T S+GFR Sbjct: 108 HYRAHPLRRIYIPKPGKSTKRHLSIPTMSDRAMQALYSLALQPVAETTADTRSFGFRLFK 167 Query: 131 -PERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMT 189 + + +A R + +T W++EGD+ FD + H L + +D+ + Sbjct: 168 CAQDASSYAFRCL------WRDTSNPWILEGDIKGCFDNISHSWLKNNIP---TDS---S 215 Query: 190 LLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLH 235 +L + +K+G I F +G PQG +ISPLL+N+ L+ ++ L+ Sbjct: 216 ILSQFLKSGFIFDDTFHHTDKGAPQGSIISPLLANMTLDGIEKLLN 261 >UniRef50_Q5P2A1 Reverse transcriptase/retron type n=2 Tax=Proteobacteria RepID=Q5P2A1_AZOSE Length = 299 Score = 98.2 bits (243), Expect = 4e-19, Method: Compositional matrix adjust. Identities = 63/192 (32%), Positives = 98/192 (51%), Gaps = 10/192 (5%) Query: 32 EAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLPARRVYIPKSNG 91 EA + ++ GA GVD + + L L + + + SG Y P P + V IPK NG Sbjct: 17 EAYQAVKANAGA--AGVDQQSIEAFEQDLKGNLYKIWNRMSSGSYFPPPVKAVAIPKKNG 74 Query: 92 KLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGE 151 +R LG+P + DR+ Q + +EP E+ F SYG+RP RS A+ V + C + Sbjct: 75 GVRILGVPTVADRVAQMVVKRVIEPELEARFLPDSYGYRPGRS---ALEAVAVTRQRCWQ 131 Query: 152 TRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLL--WKTIKAGHIDVGLFRAAS 209 WV+E D+ FD + H LL++A+++ + M + W T H D G + Sbjct: 132 Y--PWVLEFDIKGLFDNIDHVLLLRALKKHVKCEWAMLYIKRWLTAPLQHAD-GTLEERT 188 Query: 210 EGVPQGGVISPL 221 +G PQGGV++ + Sbjct: 189 KGTPQGGVVTAM 200 >UniRef50_C3AUZ6 Reverse transcriptase n=3 Tax=Firmicutes RepID=C3AUZ6_BACMY Length = 174 Score = 96.7 bits (239), Expect = 1e-18, Method: Compositional matrix adjust. Identities = 58/151 (38%), Positives = 79/151 (52%), Gaps = 5/151 (3%) Query: 47 GVDGVNKTMLQARLAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIV 106 G+DGV+ T L L L + + SG Y P P + V I K NGK LGIP + DRI Sbjct: 5 GIDGVDFTEFDKDLKNNLYKLWNRMSSGSYFPNPVKGVEISKKNGKKGLLGIPTISDRIA 64 Query: 107 QRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYF 166 Q + M EP+ E F SYG+RP RS AI V + C + W+IE D+ F Sbjct: 65 QMIVRMNFEPLVEPIFCADSYGYRPNRS---AIDAVGMTRQRCWKM--AWLIEFDIKGLF 119 Query: 167 DTVHHRLLMKAVRRRISDARFMTLLWKTIKA 197 D + H L+MKAV R ++ + + + +KA Sbjct: 120 DNIDHELMMKAVHRHTNNNWVIHYIGRFLKA 150 >UniRef50_Q9FJR9 Similarity to maturase-related protein n=4 Tax=Magnoliophyta RepID=Q9FJR9_ARATH Length = 735 Score = 95.9 bits (237), Expect = 2e-18, Method: Compositional matrix adjust. Identities = 94/403 (23%), Positives = 162/403 (40%), Gaps = 81/403 (20%) Query: 29 WLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSG-------------- 74 W+ ++ GA+ P + ++ L+ LA+ +L D G Sbjct: 106 WVLAYQKVCCDELGAYVPR-SSIQRSALENLLALRNSVLDDRFKWGSRLDFYIKSPRDKT 164 Query: 75 HYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERS 134 Y+ L R++ + + P +DRIVQ +LM +EPI+ES F S+ FRP R+ Sbjct: 165 DYESLSKRKIKAILTTTQPTPF-----QDRIVQEVLLMILEPIYESRFSQKSFAFRPGRT 219 Query: 135 VHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTL---- 190 H +R ++ W ++GDLS D + ++ ++ R + D + + L Sbjct: 220 AHTVLRVIRRNFAGY-----LWYVKGDLSVVLDGMKVGFVISSLMRDVRDKKVIDLIKSA 274 Query: 191 LWKTIKAGHIDVG-----------LFRAASEGVPQ------------------------- 214 L + ++ G R +E P+ Sbjct: 275 LVTPVVTSKVEDGEKKKTKKRKYQKKRVLAEDEPKPDPYWLETFFGFAPEEAGKSPQWGH 334 Query: 215 GGVISPLLSNIMLNEFDQYLHERYLSG-KARKDRWYWNNSIQRGRSTAVRENW-QWKPA- 271 G++SPLL N+ L+E D+++ + + K WNN G + +W ++ P Sbjct: 335 CGILSPLLVNVCLDELDRWMETKVKDFYRPSKSDVIWNNP--EGEADQGNTSWPEFVPTS 392 Query: 272 -------VAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDG 324 + Y RY ++ V+G +A +R+E ++ LRL+ + I H+ G Sbjct: 393 GPDKTRKMDYVRYGGHILIGVRGPRADAATLRKELIEFVDQKYMLRLDNENLPIEHITKG 452 Query: 325 FIFLGHRLIRKRSRYGEMRVVST---IPQEKARNFAASLTALL 364 +FL H L R R Y +R +T I EK S+TA L Sbjct: 453 IMFLDHVLCR-RVVYPTLRYTATGGKIISEKGVGTLLSVTASL 494 >UniRef50_A5N448 Predicted reverse transcriptase/maturase family protein n=2 Tax=Clostridium kluyveri RepID=A5N448_CLOK5 Length = 462 Score = 95.5 bits (236), Expect = 3e-18, Method: Compositional matrix adjust. Identities = 73/349 (20%), Positives = 153/349 (43%), Gaps = 63/349 (18%) Query: 5 LATWAATDPSLRIQRLLRLITQPEWLAEAARITLS---------SKGAHTPGVDGVNKTM 55 L + +P+ + QR+ R + ++ A S ++ H+ +GV+K + Sbjct: 11 LKDKSNNNPNYKFQRIYRYLFNIDFYFRAYSQVYSAGENIGNVVTEKVHSFNNEGVHKII 70 Query: 56 LQARLAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAME 115 ++L + Y P + K N K I L D ++Q+ ++ ++ Sbjct: 71 -------------EKLKNESYCPESLEKS--DKQNKKHSQ--IKGLYDNLIQQIIVEILQ 113 Query: 116 PIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLM 175 I+ +F S+ F P ++ H A+ +K T C + RW ++G++ S F +++ ++ Sbjct: 114 SIYNVNFSVNSHAFIPNKNCHTALYKIK---TTC--SGARWAVKGNIESCFYNINYDFVI 168 Query: 176 KAVRRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLH 235 K++ +ISD RF+ L+ K + AG+ G+ Q ++ +L NI L++FD+Y++ Sbjct: 169 KSLCEKISDGRFINLIRKFLAAGYTKEKKNCDTWSGISQRESLANILINIYLDKFDKYIN 228 Query: 236 ERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAI 295 + + V Y RY D+F++ + GTK E + Sbjct: 229 KEF-------------------------------GQVKYTRYLDNFIIFISGTKDLAEYM 257 Query: 296 REECRGVLEGSLKLRLNMDKTKIPHVNDGFI-FLGHRLIRKRSRYGEMR 343 E+ + L+ L + ++ I +N + FLG+ + + + + + + Sbjct: 258 IEKIKVFLKDKLNIETTEEEIFIIDLNKQRVKFLGYEITKLKHNFKDNK 306 >UniRef50_Q4E9Y9 Reverse transcriptase n=1 Tax=Wolbachia endosymbiont of Drosophila ananassae RepID=Q4E9Y9_9RICK Length = 142 Score = 94.4 bits (233), Expect = 7e-18, Method: Compositional matrix adjust. Identities = 47/102 (46%), Positives = 63/102 (61%), Gaps = 2/102 (1%) Query: 77 QPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVH 136 Q P +R+YI KSNGK RPLGIP ++DR +Q L A+EPI E+ SYGFRP+RS Sbjct: 18 QTSPLKRIYISKSNGKRRPLGIPTIKDRAMQALYLFALEPIAETISDRHSYGFRPKRSCA 77 Query: 137 HAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAV 178 A TV L + +W++EGD+ FD ++H LMK + Sbjct: 78 DA--TVACHLLLASRNQLQWILEGDIKGCFDNINHEWLMKHI 117 >UniRef50_D1I8B4 Whole genome shotgun sequence of line PN40024, scaffold_35.assembly12x (Fragment) n=2 Tax=Vitis vinifera RepID=D1I8B4_VITVI Length = 730 Score = 94.0 bits (232), Expect = 8e-18, Method: Compositional matrix adjust. Identities = 80/309 (25%), Positives = 136/309 (44%), Gaps = 61/309 (19%) Query: 101 LRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEG 160 +D+IVQ + M +EPI+E+ F S+ FRP R+ H +R ++ W I+G Sbjct: 194 FQDKIVQEVLFMILEPIYEARFSEKSFAFRPGRNAHSVLRVIRRSFAGY-----LWYIKG 248 Query: 161 DLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHID------------------- 201 DLS+ D + L++ A+ R + D + + LL + I Sbjct: 249 DLSTILDGMKVGLVISALMRDVRDKKVIDLLKAALVTPVITSQKRVLAEDEPKPDPYWLE 308 Query: 202 --VGLFRAASEGVP---QGGVISPLLSNIMLNEFDQYLHERYLSGKAR------KDRWYW 250 G +E +P G++SPLL+N+ L+E D R++ GK + K W Sbjct: 309 TFFGFAPEEAEKLPSWGHCGILSPLLANVCLDELD-----RWMEGKIKEFYRPSKSDVIW 363 Query: 251 NN---SIQRGRSTAVRENW-QWKPA--------VAYCRYADDFVLIVKGTKAQVEAIREE 298 N+ +++G ++ W ++ P + Y RY ++ V+G +A +R++ Sbjct: 364 NSPDGEVEQGNTS-----WPEFVPTSGPDKTRKMDYIRYGGHILIGVRGPRADAAILRKQ 418 Query: 299 CRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSRYGEMRVVST---IPQEKARN 355 + L+L+ + I H+ G +FL H L R R Y +R +T I EK Sbjct: 419 LIEFCDQKYMLKLDSESLPIEHITKGIMFLDHVLCR-RVVYPTLRYTATGGKIISEKGVG 477 Query: 356 FAASLTALL 364 S+TA L Sbjct: 478 TLLSVTASL 486 >UniRef50_Q7YAJ4 Putative reverse transcriptase and intron maturase n=1 Tax=Chara vulgaris RepID=Q7YAJ4_CHAVU Length = 576 Score = 93.6 bits (231), Expect = 1e-17, Method: Compositional matrix adjust. Identities = 93/335 (27%), Positives = 152/335 (45%), Gaps = 60/335 (17%) Query: 14 SLRIQRLLRLITQPEWLAEAARITLSSKGAHTPG-----VDGVNKTMLQARLAVELQILR 68 S R L+ L+ P++L S G TPG +DG+ K LA LQ Sbjct: 103 SGRYTDLIGLLADPKFLIYCYETIKSKPGNMTPGKARSALDGLTKEWF-THLATLLQ--- 158 Query: 69 DELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYG 128 G + P K+ +R I +IVQ+AM + +E I+E F S+ Sbjct: 159 ----QGRFAP--------DKATDIVRGRFI----FKIVQKAMQVILEMIYEEKFIDCSHA 202 Query: 129 FRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFM 188 F+P R A+ L + + W IEGD+++ FD++ H ++M+ +++ I+ +F+ Sbjct: 203 FQPGRG-SAALTLASLHVGI--KKHQTWAIEGDITNCFDSIDHNVIMQIIKKEIACEKFL 259 Query: 189 TLLWKTIKAGH-IDVGLFRAASEGVPQG---GVISPLLSNIMLNEFDQYLHE-------- 236 L+ ++KAG+ +VG + G PQ +PLL ++L+E D+++ Sbjct: 260 ALVKGSLKAGYKTNVG--KVHIPGTPQALTSLCFAPLLCKVVLHELDKFVSSTLNKAKFD 317 Query: 237 -----RYLSGKARKDRWYWNNSIQRGRSTAVRENWQ-----WKPA---VAYCRYADDFVL 283 R LSG+A +++Q ++ R W+ P + Y RYADDFV+ Sbjct: 318 TETPRRRLSGEA----IVRASALQTTKTGNRRTLWKTSSDPMDPGFKRLFYVRYADDFVI 373 Query: 284 IVK-GTKAQVEAIREECRGVLEGSLKLRLNMDKTK 317 I+ G++A I+ L LKL LN TK Sbjct: 374 IITAGSRADAVEIKRLVTQFLSEELKLELNQRSTK 408 >UniRef50_A9AYP7 RNA-directed DNA polymerase (Reverse transcriptase) n=1 Tax=Herpetosiphon aurantiacus ATCC 23779 RepID=A9AYP7_HERA2 Length = 431 Score = 93.2 bits (230), Expect = 1e-17, Method: Compositional matrix adjust. Identities = 58/163 (35%), Positives = 88/163 (53%), Gaps = 5/163 (3%) Query: 36 ITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRP 95 I +S +G + G D V +A +Q L EL S Y+PLP RR+++ K +G R Sbjct: 30 IRISQRG-RSHGPDAVTILDFEAAWVDHMQQLAMELQSQIYRPLPPRRLFLDKRDGGKRS 88 Query: 96 LGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGR 155 + I A+RDRI QRA+L +EP E F SYGFRP V HA+ ++ + + Sbjct: 89 IAILAVRDRIAQRAVLQILEPEIEPTFLDCSYGFRPYVGVPHALTRIE----RYRQQGLQ 144 Query: 156 WVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAG 198 WV D+S F T+ H++L+ + +RISD + L+ + + G Sbjct: 145 WVAHADISDCFGTIDHQILLSQLHQRISDRAVVELIGQWLSVG 187 Score = 61.6 bits (148), Expect = 4e-08, Method: Compositional matrix adjust. Identities = 50/169 (29%), Positives = 67/169 (39%), Gaps = 50/169 (29%) Query: 179 RRRISDARFMTLLWKTIKAGHIDVG---------------LFRAASEGVPQGGVISPLLS 223 +R IS R + L+K + G + G L R G QGG ISP+L+ Sbjct: 268 KRVISGLRSLAPLFKQVPGGSLTWGAAGIATLALIPLSQRLLRQHERGTLQGGAISPMLA 327 Query: 224 NIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVL 283 NI L+ FD+ + ER R+ADDFVL Sbjct: 328 NIYLDSFDRAMTER---------------------------------GHILVRFADDFVL 354 Query: 284 IVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRL 332 + +A VE + VL+ L+L KT + H NDG FLGHR Sbjct: 355 L-GAHQAAVEQALADATNVLK-RLRLATKESKTGVQHFNDGLTFLGHRF 401 >UniRef50_B3JKP4 Putative uncharacterized protein n=1 Tax=Bacteroides coprocola DSM 17136 RepID=B3JKP4_9BACE Length = 184 Score = 90.9 bits (224), Expect = 7e-17, Method: Compositional matrix adjust. Identities = 50/137 (36%), Positives = 71/137 (51%), Gaps = 6/137 (4%) Query: 8 WAATDPSLRI----QRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVE 63 W D + + + L I P L + + + +KG G+D ++ L L Sbjct: 34 WICEDNIVEVPFDKEHLFEQILSPANLNRSYKAVIGNKGCG--GIDNMSCEQLFLWLLAN 91 Query: 64 LQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFH 123 L L+ G Y+P P +RV IPK NGK+R LGIP + DR+VQ+A+ + PI+E F Sbjct: 92 KDALIRSLMDGSYRPNPVKRVEIPKDNGKMRLLGIPTVVDRLVQQAINQVLSPIYEKQFS 151 Query: 124 TLSYGFRPERSVHHAIR 140 SYGFRP R H A+R Sbjct: 152 RRSYGFRPRRGCHDAVR 168 >UniRef50_Q35063 CoxI intron1 ORF n=2 Tax=Eukaryota RepID=Q35063_MARPO Length = 902 Score = 89.7 bits (221), Expect = 2e-16, Method: Compositional matrix adjust. Identities = 103/379 (27%), Positives = 160/379 (42%), Gaps = 67/379 (17%) Query: 2 QRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLA 61 +RKL T + S + + + + T P +L A S TP + + ++A Sbjct: 179 RRKLETLKRNEKSGKFENIYSICTDPNFLIAAYEQIKSHTSNMTPEGGEERENLFLRQVA 238 Query: 62 VELQIL-------RDELL-SGHYQPLPARRVYIPKSNG--KLRPLGIPALRDRIVQRAML 111 L+ L ELL S ++ ARR+ IP N + RPL I + D IVQ+AM Sbjct: 239 SPLESLDRAWFERTAELLRSEQFRFKLARRIMIPTPNKPREFRPLTIGS--DNIVQQAMK 296 Query: 112 MAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHH 171 + ME I+E F S+GFRP R H + + L+ T W +E D+ F+++ Sbjct: 297 IVMEHIYEPKFLDTSHGFRPGRGCHSGLEQICLKWTGAS-----WFLEFDIKRCFNSMDR 351 Query: 172 RLLMKAVRRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVP---QGGVIS--------- 219 L+ +++ I D R+M L+ K AG + L G P QG V+S Sbjct: 352 HKLVFILQKDIEDQRWMDLVHKLFTAGLVGGEL------GGPDPLQGSVLSPWSSPPWAL 405 Query: 220 -PLLSNIMLNEFDQYL----HERYLSGKARKDRWYW-----------------------N 251 PL NI L++ DQ + +E S K R D+ Sbjct: 406 APLFCNIYLHDLDQEVAKMANELSRSRKRRVDKRTTAATRTPRTKAFRALTPQAEIMRVR 465 Query: 252 NSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRL 311 RG S R++ + A Y RYA +F+L + G + V ++ R V + +L L Sbjct: 466 RKAARGLSPTDRKDPNYARAF-YVRYAGNFLLGIAGPRELVATVKS--RIVQFVNSELHL 522 Query: 312 NMDKTKIPHVN-DGFIFLG 329 + I H++ + FLG Sbjct: 523 ELTGGSISHISAESVKFLG 541 >UniRef50_B6IMH7 Phage-encoded reverse transcriptase, putative n=2 Tax=Proteobacteria RepID=B6IMH7_RHOCS Length = 356 Score = 89.4 bits (220), Expect = 2e-16, Method: Compositional matrix adjust. Identities = 78/273 (28%), Positives = 126/273 (46%), Gaps = 42/273 (15%) Query: 60 LAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWE 119 L +L L+ ++++G ++P R + + K R + P DR+V A++ +EP+ E Sbjct: 42 LEEKLFDLQGQMVNGVWRPGRPREFMV--RDPKPRLISAPPFADRVVHHAVVRVIEPVLE 99 Query: 120 SDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGR-WVIEGDLSSYFDTVHHRLLMKAV 178 F SY R R VH A+ ++ L + G+ WV++ D+S YF +++H LM + Sbjct: 100 RRFIFDSYACRKGRGVHTAVDRLQRHLREASCEGGKVWVLKADISKYFASINHGRLMAIL 159 Query: 179 RRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERY 238 R ISD + + L +K D G+ G+P G + S L +NI L++ D ++ + Sbjct: 160 GRSISDKKVLWLCRTNLKGYGFDEGV------GIPVGALTSQLFANIYLDQLDHWIKDEL 213 Query: 239 LSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREE 298 I+R Y RY DDFV IV +KA + A+ + Sbjct: 214 --------------GIKR-----------------YVRYMDDFV-IVGHSKADLWALYDA 241 Query: 299 CRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHR 331 L L LRLN T +P + G F G+R Sbjct: 242 IADFLATKLALRLNRKTTVLP-ASGGIDFCGYR 273 >UniRef50_Q7MTH7 CRISPR-associated protein Cas1 n=3 Tax=Porphyromonas gingivalis RepID=Q7MTH7_PORGI Length = 1031 Score = 89.4 bits (220), Expect = 2e-16, Method: Compositional matrix adjust. Identities = 89/334 (26%), Positives = 140/334 (41%), Gaps = 60/334 (17%) Query: 45 TPGVDGVNKTMLQARLA--------VELQILRDELLSGHYQPLPARRVYIPKSNGKLRPL 96 TPG D + + L + A E Q L + L Y P P V IPK +G R L Sbjct: 299 TPGNDTLYRKWLSSLAADRDLPMAEAERQDLLEALRICSYIPQPYHSVNIPKGDGSYRQL 358 Query: 97 GIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRW 156 IP+ D +QR++ + PI ES SY +R + A+R V+ L E Sbjct: 359 HIPSAVDLHLQRSLAGILYPITESLSIAQSYAYRKGKGAVAAVRRVQHLLDSLDENHT-- 416 Query: 157 VIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHID-VGLFRAASEGVPQG 215 V+ D+ ++FD++ L++ V+R D +L +K+G +D + AS G+PQG Sbjct: 417 VVRCDIDNFFDSIPVPSLLQKVQRTTEDPFLTRMLSLWMKSGVVDRKQQYARASSGIPQG 476 Query: 216 GVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYC 275 ++PLLSN+ L + D RY++G + Sbjct: 477 SPLAPLLSNLYLEDTD-----RYIAGHI---------------------------TTEFI 504 Query: 276 RYADDFVLIVKGTKAQVEAIREECRGVLEGSLK----LRLNMDKTKIPHVNDGFIFL--- 328 RYADD +L + + A+++ L LK L+LN D + + F FL Sbjct: 505 RYADDLLLFLPEKVDPLNALQD-----LSEHLKYRKGLKLNRDFV-VSSIKSSFSFLGIT 558 Query: 329 ----GHRLIRKRSRYGEMRVVSTIPQEKARNFAA 358 G R + + + G R ++ NF+A Sbjct: 559 FCADGSRSMSRDKKEGLKRKITLALHRDTENFSA 592 >UniRef50_C8VZI0 RNA-directed DNA polymerase (Reverse transcriptase) n=1 Tax=Desulfotomaculum acetoxidans DSM 771 RepID=C8VZI0_DESAS Length = 369 Score = 89.4 bits (220), Expect = 2e-16, Method: Compositional matrix adjust. Identities = 77/285 (27%), Positives = 129/285 (45%), Gaps = 45/285 (15%) Query: 58 ARLAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPI 117 A L L +EL+S Y+ P R+ ++ + +L + +P DRIVQ ++ + P+ Sbjct: 39 ANLGENLIQAEEELISKSYRVSPYRKSFVYEPKKRL-VMALP-FGDRIVQWSVYRTLNPL 96 Query: 118 WESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGR-WVIEGDLSSYFDTVHHRLLMK 176 + + SY R H A++ ++ L GR +V++ D++ YF V H ++M Sbjct: 97 LNKRYISHSYACRTGYGSHRAVKQLQYWLRYLERRHGRIYVLKADMTKYFYRVDHDIIMN 156 Query: 177 AVRRRISDARFMTLLWKTIKAGHIDVGL------FRA---ASEGVPQGGVISPLLSNIML 227 + R I D + LL + ++ H GL F G+P G + S +++N+ L Sbjct: 157 ILERIIGDYDLIWLLEEIVRCEHTWFGLPLDAEGFECELTGEVGIPIGNLTSQMIANLYL 216 Query: 228 NEFDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKG 287 NE DQY + N Q K Y RY DD VLI+ Sbjct: 217 NELDQY----------------------------AKHNLQIK---YYMRYMDD-VLILHN 244 Query: 288 TKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRL 332 K + I+EE L+ +L+L+LN +KT + G ++G+R+ Sbjct: 245 DKKYLWHIKEEIEEFLDRNLRLKLN-NKTCVRTNTQGIDWIGYRV 288 >UniRef50_Q4FUJ8 Possible RNA-directed DNA polymerase (Reverse transcriptase) n=2 Tax=Proteobacteria RepID=Q4FUJ8_PSYA2 Length = 362 Score = 89.0 bits (219), Expect = 3e-16, Method: Compositional matrix adjust. Identities = 82/300 (27%), Positives = 126/300 (42%), Gaps = 60/300 (20%) Query: 56 LQARLAVELQILRDELLSGHYQPLPARR--VYIPKSNGKLRPLGIPALRDRIVQRAMLMA 113 + L EL L++EL + Y+P P + VY PK R + PA RD +VQ A+ + Sbjct: 44 FEKSLGRELNELQEELANNTYKPRPYFKFIVYEPKK----REIYAPAFRDCVVQYAIYLR 99 Query: 114 MEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRL 173 + PI++ F S+ R H A + L G + ++ D+ +F ++ Sbjct: 100 VMPIFDKTFIDQSFACRTGLGTHKAAEYAQDALRRAGPN--TYTLQLDIKKFFYSIDRPT 157 Query: 174 LMKAVRRRISDARFMTLLWKTIKAGHIDVGLFRAASE--GVPQGGVISPLLSNIMLNEFD 231 L K + R+I D R + L+ LF E G+P G ++S + + I +N D Sbjct: 158 LRKLLERKIKDKRLVDLMM-----------LFADYPEPKGIPIGNLLSQMFALIYMNPVD 206 Query: 232 QYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQ 291 Y +T V KPA YCRY DDF L+ T+AQ Sbjct: 207 HY-------------------------ATRV-----LKPAAGYCRYVDDF-LLFGLTRAQ 235 Query: 292 VEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGH------RLIRKRSRYGEMRVV 345 R+ +E KL+L + ++ I + G F G+ R IRK S Y + V Sbjct: 236 ALTYRKLLTDFVEQ--KLKLTLSRSTIANTKRGANFCGYRTWRSGRFIRKHSLYKTRKAV 293 >UniRef50_A4BSH6 Transposase n=1 Tax=Nitrococcus mobilis Nb-231 RepID=A4BSH6_9GAMM Length = 258 Score = 88.2 bits (217), Expect = 4e-16, Method: Compositional matrix adjust. Identities = 53/157 (33%), Positives = 83/157 (52%), Gaps = 35/157 (22%) Query: 182 ISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSG 241 ++D +F+TLL + +KAG + G+F ++G PQG +SP+LSNI+LNE D L R L Sbjct: 1 MADEQFVTLLARLLKAGVVVKGVFEKTTKGCPQGSPLSPILSNIVLNELDHPLEARNL-- 58 Query: 242 KARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRG 301 YCR+ADDFV++V+ +A + + E Sbjct: 59 -------------------------------GYCRWADDFVIVVRSHRA-AQRVMEITVA 86 Query: 302 VLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSR 338 LEG L ++++ DK+++ + D IFLG +L+R R R Sbjct: 87 YLEGGLGVKVSRDKSQVAPIKD-VIFLGFQLLRGRIR 122 >UniRef50_UPI0001C388AF RNA-directed DNA polymerase n=1 Tax=Arthrospira platensis str. Paraca RepID=UPI0001C388AF Length = 498 Score = 85.5 bits (210), Expect = 3e-15, Method: Compositional matrix adjust. Identities = 96/349 (27%), Positives = 161/349 (46%), Gaps = 49/349 (14%) Query: 17 IQRLLRLITQP-EWLAEAARITLSSKGAH-TPGVDGVNKTMLQARLAVELQILRDELLSG 74 +++L RL+T + A R + G+ T GVDGV + Q +L + L+ L Sbjct: 41 VRKLQRLLTNSRDAKILAVREVIPENGSQKTAGVDGVKRLRNQEKLDL-ANCLK---LGR 96 Query: 75 HYQPLPARRVYIPK-SNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPER 133 Q L RRV IP+ + R +GI + ++ Q + +A+EP WE+ F SYGFRP R Sbjct: 97 KTQGL--RRVSIPEPGRDEKRAVGILMMMEKAKQGLVKLALEPEWEARFDRNSYGFRPRR 154 Query: 134 SVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWK 193 S AI + + + ++V++ + F+ ++H+ L+ + + R + W Sbjct: 155 SAQDAIAAIFNGMKE----DHKYVLDAHIEKCFEGIYHQKLLAKLNTYPTLRREIK-AW- 208 Query: 194 TIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNS 253 +K+G +D PQGG++ PLL+NI L+ + L +++ G A + Sbjct: 209 -LKSGVMDGKELFPTETDTPQGGLM-PLLANIALDGLESLLEDKFQGGVA---------N 257 Query: 254 IQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNM 313 G++T V RYADD V ++ G + A +E L + L+L Sbjct: 258 CGNGKATVV-------------RYADDLV-VLDGELEVILAAKETIEAWLM-EMGLKLKD 302 Query: 314 DKTKIPHV------NDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNF 356 T+I H N G+ FLG+ + + +R E R +T E+ R F Sbjct: 303 GNTRISHTFIEHEGNIGWDFLGYNIRQYPTR--EKRSRATGNTEQKRGF 349 >UniRef50_UPI00016C45C3 RNA-directed DNA polymerase (Reverse transcriptase) n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C45C3 Length = 282 Score = 85.1 bits (209), Expect = 3e-15, Method: Compositional matrix adjust. Identities = 68/221 (30%), Positives = 107/221 (48%), Gaps = 50/221 (22%) Query: 96 LGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGR 155 +GIP + DR+VQ+A+L + P+ + F SYGFRP+R H A+ K R Sbjct: 1 MGIPTVVDRLVQQAILQVLTPLLDPTFSNSSYGFRPKRGAHDALAAAK-----------R 49 Query: 156 WVIEG-----DLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHIDVGLFRAASE 210 +V EG +L + D V+H +LM RR++D + ++ + ++AG + G+ A E Sbjct: 50 YVEEGRTIVVNLERFSDRVNHDILMARRARRVADKHLLRVVRRFLEAGLMQDGVCIARHE 109 Query: 211 GVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKP 270 G PQGG +SPLL+N++L+++D+ L R Sbjct: 110 GTPQGGPLSPLLANLLLDDWDRELERR--------------------------------- 136 Query: 271 AVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRL 311 +CRYADD + V+ A + R LEG L+LR+ Sbjct: 137 GHRFCRYADDCNVDVQSPGAGERVMASLIR-FLEGKLRLRV 176 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_Q47688 Putative uncharacterized protein ykfC n=18 Tax=P... 516 e-145 UniRef50_B0EYP4 Reverse transcriptase-like protein n=48 Tax=cell... 500 e-140 UniRef50_A4XCB1 RNA-directed DNA polymerase n=7 Tax=Actinomyceta... 345 1e-93 UniRef50_D2QVU8 RNA-directed DNA polymerase (Reverse transcripta... 338 2e-91 UniRef50_Q92Y56 Reverse transcriptase n=3 Tax=Alphaproteobacteri... 331 2e-89 UniRef50_A7VUZ6 Putative uncharacterized protein n=1 Tax=Clostri... 322 1e-86 UniRef50_B4D9Y9 RNA-directed DNA polymerase (Reverse transcripta... 312 1e-83 UniRef50_C3QDF8 RNA-directed DNA polymerase n=9 Tax=Bacteroidale... 308 2e-82 UniRef50_Q1J4S7 Reverse transcriptase / RNA maturase / Endonucle... 307 4e-82 UniRef50_C3B599 Group II intron-encoded protein LtrA n=3 Tax=cel... 307 4e-82 UniRef50_B8FP60 RNA-directed DNA polymerase n=3 Tax=Firmicutes R... 307 6e-82 UniRef50_UPI00019088D4 mobile mitochondrial group II intron of C... 305 1e-81 UniRef50_Q2G689 RNA-directed DNA polymerase n=5 Tax=Bacteria Rep... 305 1e-81 UniRef50_Q56VE2 MatR n=6 Tax=Bacteria RepID=Q56VE2_BACFR 304 3e-81 UniRef50_A9B955 RNA-directed DNA polymerase n=1 Tax=Herpetosipho... 304 4e-81 UniRef50_Q3CZ44 Prophage LambdaSa1, reverse transcriptase/matura... 302 1e-80 UniRef50_A9IAV6 Mobile mitochondrial group II intron of COX1 whi... 302 2e-80 UniRef50_Q02718 Reverse transcriptase homologue COI iA grp II pr... 301 3e-80 UniRef50_B9IYU4 Reverse transcriptase n=21 Tax=Bacteria RepID=B9... 299 1e-79 UniRef50_Q3A299 Prophage LambdaSa1, reverse transcriptase/matura... 298 2e-79 UniRef50_C0YLQ1 RNA-directed DNA polymerase n=40 Tax=Bacteria Re... 297 4e-79 UniRef50_C3FAV3 RNA-directed DNA polymerase n=11 Tax=Bacteria Re... 297 6e-79 UniRef50_C8R0Q7 RNA-directed DNA polymerase (Reverse transcripta... 295 2e-78 UniRef50_B0TA92 Reverse transcriptase (RNA-dependent DNA polymer... 294 3e-78 UniRef50_A5CZB8 Retron-type reverse transcriptase n=20 Tax=Bacte... 294 4e-78 UniRef50_Q3ESS6 Reverse transcriptase / RNA maturase / Endonucle... 293 6e-78 UniRef50_P0A3U1 DNA endonuclease n=8 Tax=Firmicutes RepID=LTRA_L... 293 8e-78 UniRef50_D2M2V6 RNA-directed DNA polymerase (Reverse transcripta... 292 1e-77 UniRef50_B8I7I5 RNA-directed DNA polymerase (Reverse transcripta... 290 5e-77 UniRef50_A5CZJ0 Retron-type reverse transcriptase n=1 Tax=Peloto... 290 5e-77 UniRef50_C6CA58 RNA-directed DNA polymerase n=3 Tax=Enterobacter... 289 1e-76 UniRef50_C5RJ16 RNA-directed DNA polymerase (Reverse transcripta... 289 1e-76 UniRef50_B0I1N9 Reverse transcriptase homolog n=7 Tax=cellular o... 288 2e-76 UniRef50_Q024N3 RNA-directed DNA polymerase n=6 Tax=Bacteria Rep... 288 3e-76 UniRef50_A3WXE2 Putative reverse transcriptase n=1 Tax=Nitrobact... 287 5e-76 UniRef50_A9AUN7 RNA-directed DNA polymerase n=4 Tax=Bacteria Rep... 286 7e-76 UniRef50_Q0S063 RNA-directed DNA polymerase (Reverse transcripta... 286 9e-76 UniRef50_C6J6N9 RNA-directed DNA polymerase n=1 Tax=Paenibacillu... 286 1e-75 UniRef50_A9IAY4 Reverse transcriptase n=38 Tax=Bacteria RepID=A9... 285 1e-75 UniRef50_C5CID2 RNA-directed DNA polymerase (Reverse transcripta... 285 3e-75 UniRef50_O47500 RT-like protein n=1 Tax=Venturia inaequalis RepI... 284 4e-75 UniRef50_D2CJC8 Putative uncharacterized protein orf2 (Fragment)... 283 6e-75 UniRef50_C9B0U1 RNA directed DNA polymerase n=2 Tax=Enterococcus... 283 6e-75 UniRef50_B9M2H7 RNA-directed DNA polymerase (Reverse transcripta... 283 6e-75 UniRef50_C7RV41 RNA-directed DNA polymerase (Reverse transcripta... 283 1e-74 UniRef50_B1N1A3 NicA n=1 Tax=Pseudomonas putida RepID=B1N1A3_PSEPU 281 2e-74 UniRef50_Q3A4Z2 Group II intron-encoding maturase n=98 Tax=Bacte... 280 5e-74 UniRef50_Q02717 Reverse transcriptase homologue COI ialpha grp I... 280 7e-74 UniRef50_A6YEC9 Putative reverse transcriptase and intron matura... 278 2e-73 UniRef50_B7C9E4 Putative uncharacterized protein n=1 Tax=Eubacte... 278 2e-73 UniRef50_Q74P60 Group II intron reverse transcriptase/maturase n... 278 2e-73 UniRef50_Q9MD87 Putative maturase n=1 Tax=Cryphonectria parasiti... 278 2e-73 UniRef50_A1T776 RNA-directed DNA polymerase n=1 Tax=Mycobacteriu... 278 3e-73 UniRef50_A4KVN1 Probable reverse transcriptase n=2 Tax=Sinorhizo... 278 3e-73 UniRef50_A7GTD4 RNA-directed DNA polymerase n=1 Tax=Bacillus cyt... 278 3e-73 UniRef50_B4D301 RNA-directed DNA polymerase (Reverse transcripta... 277 3e-73 UniRef50_Q9T654 Cox1I1a maturase (Fragment) n=2 Tax=cellular org... 277 4e-73 UniRef50_Q188V0 Group II intron reverse transcriptase/maturase n... 277 5e-73 UniRef50_C0JX29 Putative reverse transcriptase and intron matura... 277 5e-73 UniRef50_B1HW67 Possible group II intron reverse transcriptase/m... 277 6e-73 UniRef50_B8R181 Putative intron-encoded reverse transcriptase n=... 276 7e-73 UniRef50_B4WUH1 Group II intron, maturase-specific domain family... 275 2e-72 UniRef50_C2KES2 Reverse transcriptase/maturase n=14 Tax=Firmicut... 275 2e-72 UniRef50_A5VLF2 RNA-directed DNA polymerase (Reverse transcripta... 275 2e-72 UniRef50_C3FJT8 RNA-directed DNA polymerase (Reverse transcripta... 273 5e-72 UniRef50_A8MI91 RNA-directed DNA polymerase (Reverse transcripta... 273 8e-72 UniRef50_D2CK02 Putative uncharacterized protein orf3 (Fragment)... 273 8e-72 UniRef50_C9BNF1 Group II intron reverse transcriptase/maturase n... 272 1e-71 UniRef50_A5ZWA2 Putative uncharacterized protein n=2 Tax=Clostri... 272 1e-71 UniRef50_B4CYA7 RNA-directed DNA polymerase (Reverse transcripta... 272 1e-71 UniRef50_Q11ZP4 RNA-directed DNA polymerase n=33 Tax=Bacteria Re... 272 1e-71 UniRef50_Q01P79 RNA-directed DNA polymerase (Reverse transcripta... 272 1e-71 UniRef50_C1PA09 RNA-directed DNA polymerase n=3 Tax=Firmicutes R... 272 2e-71 UniRef50_B7HM08 Group II intron reverse transcriptase/maturase n... 272 2e-71 UniRef50_UPI0001C42942 reverse transcriptase n=1 Tax=Bacillus ps... 271 2e-71 UniRef50_C1L365 Group II intron-encoded protein n=1 Tax=Bacillus... 271 2e-71 UniRef50_C1BDP2 Putative RNA-directed DNA polymerase n=2 Tax=Rho... 270 5e-71 UniRef50_B2A0J9 RNA-directed DNA polymerase (Reverse transcripta... 270 5e-71 UniRef50_Q1QGR6 RNA-directed DNA polymerase (Reverse transcripta... 270 6e-71 UniRef50_C3LL08 Group II intron reverse transcriptase/maturase n... 270 7e-71 UniRef50_C3B585 Reverse transcriptase/endonuclease protein n=1 T... 270 9e-71 UniRef50_B2AJV8 RNA-directed DNA polymerase, retrotranscriptase ... 270 9e-71 UniRef50_A0RHJ0 Reverse transcriptase/endonuclease protein n=6 T... 269 9e-71 UniRef50_Q1Q0X4 Similar to Group II intron encoded reverse trans... 269 9e-71 UniRef50_C2XKK9 D-alanine--D-alanine ligase A (D-alanylalanine s... 269 1e-70 UniRef50_C5ER86 RNA-directed DNA polymerase n=1 Tax=Clostridiale... 269 1e-70 UniRef50_B0URY2 RNA-directed DNA polymerase n=25 Tax=cellular or... 268 2e-70 UniRef50_C4ZES6 RNA-directed DNA polymerase n=27 Tax=Bacteria Re... 268 2e-70 UniRef50_B4D379 RNA-directed DNA polymerase (Reverse transcripta... 268 2e-70 UniRef50_A2TD24 Intron encoded protein n=2 Tax=Bacillaceae RepID... 268 3e-70 UniRef50_Q24QQ9 Putative uncharacterized protein n=1 Tax=Desulfi... 268 3e-70 UniRef50_B7JTB6 Group II intron reverse transcriptase/maturase n... 268 3e-70 UniRef50_C3BJV7 D-alanine--D-alanine ligase A (D-alanylalanine s... 268 3e-70 UniRef50_B9J6F8 Reverse transcriptase n=7 Tax=Bacillus cereus gr... 268 3e-70 UniRef50_Q64E53 Prophage LambdaSa1 transcriptase/maturase family... 268 3e-70 UniRef50_Q0AW97 RNA-directed DNA polymerase (Reverse transcripta... 267 4e-70 UniRef50_C8VXL4 RNA-directed DNA polymerase (Reverse transcripta... 267 4e-70 UniRef50_C3KST3 Group II intron reverse transcriptase/maturase n... 266 7e-70 UniRef50_B3GTB4 Putative reverse-transcriptase protein n=1 Tax=V... 266 8e-70 UniRef50_A6DJK4 Reverse transcriptase/maturase n=21 Tax=Chlamydi... 266 1e-69 UniRef50_C6IQ61 Putative uncharacterized protein n=10 Tax=Bacter... 266 1e-69 UniRef50_C4ZCX5 RNA-directed DNA polymerase n=24 Tax=Bacteria Re... 265 1e-69 UniRef50_Q7UY81 Reverse transcriptase/maturase n=1 Tax=Rhodopire... 265 2e-69 UniRef50_P38478 Uncharacterized mitochondrial protein ymf40 n=1 ... 265 2e-69 UniRef50_P05511 Uncharacterized 91 kDa protein in cob intron n=1... 263 6e-69 UniRef50_A8VT23 S-layer domain protein n=12 Tax=Bacilli RepID=A8... 263 7e-69 UniRef50_B9K440 18S rRNA intron 1 protein n=1 Tax=Agrobacterium ... 263 7e-69 UniRef50_Q82RB7 Putative reverse transcriptase homolog; similar ... 263 8e-69 UniRef50_C9P0Q5 Retron-type reverse transcriptase n=3 Tax=Vibrio... 263 9e-69 UniRef50_B7I148 Reverse transcriptase n=9 Tax=Bacillus RepID=B7I... 263 1e-68 UniRef50_Q5ZTU1 Reverse transcriptase n=1 Tax=Legionella pneumop... 263 1e-68 UniRef50_Q1PUN9 Strong similarity to group II intron-encoded pro... 263 1e-68 UniRef50_C6MS68 RNA-directed DNA polymerase n=1 Tax=Geobacter sp... 262 1e-68 UniRef50_A9BGC0 RNA-directed DNA polymerase (Reverse transcripta... 262 2e-68 UniRef50_Q94Z00 Orf757 n=4 Tax=stramenopiles RepID=Q94Z00_PYLLI 261 2e-68 UniRef50_UPI0001C42A66 RNA-directed DNA polymerase (Reverse tran... 261 3e-68 UniRef50_Q5U7I7 Maturase-related protein n=20 Tax=Gammaproteobac... 261 4e-68 UniRef50_B2JXR4 RNA-directed DNA polymerase n=10 Tax=Bacteria Re... 260 4e-68 UniRef50_A0L945 RNA-directed DNA polymerase (Reverse transcripta... 260 6e-68 UniRef50_C9S0G0 RNA-directed DNA polymerase n=2 Tax=Geobacillus ... 260 7e-68 UniRef50_A5VH22 RNA-directed DNA polymerase n=1 Tax=Sphingomonas... 260 8e-68 UniRef50_A1ZX33 Group II intron-encoded protein LtrA n=2 Tax=Bac... 259 1e-67 UniRef50_Q3S275 ORF718 n=2 Tax=Eukaryota RepID=Q3S275_THAPS 258 2e-67 UniRef50_B7CEC9 Putative uncharacterized protein n=1 Tax=Eubacte... 258 3e-67 UniRef50_B0JX80 Reverse transcriptase n=82 Tax=Bacteria RepID=B0... 258 3e-67 UniRef50_B1L2I7 Reverse transcriptase/endonuclease protein n=37 ... 257 5e-67 UniRef50_C4K5N9 Group II intron encoded reverse transcriptase n=... 257 6e-67 UniRef50_Q93PB4 MS117, putative maturase n=1 Tax=Microscilla sp.... 257 6e-67 UniRef50_B1I9Z1 GBSi1, group II intron, maturase n=33 Tax=Firmic... 256 6e-67 UniRef50_B0K6R3 RNA-directed DNA polymerase (Reverse transcripta... 256 7e-67 UniRef50_Q35062 CoxI intron2 ORF n=2 Tax=Marchantia polymorpha R... 256 9e-67 UniRef50_P03876 Putative COX1/OXI3 intron 2 protein n=3 Tax=Sacc... 256 1e-66 UniRef50_B3PDY2 Putative maturase n=1 Tax=Cellvibrio japonicus U... 255 2e-66 UniRef50_Q35056 CoxII intron2 ORF n=4 Tax=Embryophyta RepID=Q350... 255 2e-66 UniRef50_A8ZN56 RNA-directed DNA polymerase n=2 Tax=Cyanobacteri... 255 3e-66 UniRef50_D0LS09 RNA-directed DNA polymerase n=1 Tax=Haliangium o... 254 3e-66 UniRef50_C5D9G3 RNA-directed DNA polymerase (Reverse transcripta... 253 9e-66 UniRef50_B7K703 RNA-directed DNA polymerase (Reverse transcripta... 252 1e-65 UniRef50_Q47DU4 RNA-directed DNA polymerase (Reverse transcripta... 252 2e-65 UniRef50_C7V8C7 Reverse transcriptase n=1 Tax=Enterococcus faeca... 251 3e-65 UniRef50_P03875 Putative COX1/OXI3 intron 1 protein n=3 Tax=Fung... 251 3e-65 UniRef50_B7KM76 RNA-directed DNA polymerase (Reverse transcripta... 251 4e-65 UniRef50_Q1VQM5 Prophage LambdaSa1, reverse transcriptase/matura... 250 5e-65 UniRef50_Q8A4I4 Reverse transcriptase n=1 Tax=Bacteroides thetai... 250 5e-65 UniRef50_Q08WW1 Prophage LambdaSa1, reverse transcriptase/matura... 250 8e-65 UniRef50_Q119U8 RNA-directed DNA polymerase n=30 Tax=Bacteria Re... 250 9e-65 UniRef50_A5IEI2 Reverse transcriptase n=7 Tax=Bacteria RepID=A5I... 250 9e-65 UniRef50_A9ENQ0 Integron/retron-type RNA-directed DNA polymerase... 250 1e-64 UniRef50_Q3B1V7 RNA-directed DNA polymerase n=31 Tax=Bacteria Re... 249 1e-64 UniRef50_Q94Z24 Orf568 n=1 Tax=Pylaiella littoralis RepID=Q94Z24... 248 3e-64 UniRef50_A7BUU9 RNA-directed DNA polymerase n=1 Tax=Beggiatoa sp... 248 3e-64 UniRef50_Q6EI10 Reverse transcriptase/HNH endonuclease n=2 Tax=E... 247 4e-64 UniRef50_A4C8M3 RNA-directed DNA polymerase (Reverse transcripta... 246 7e-64 UniRef50_C6MRB5 RNA-directed DNA polymerase n=1 Tax=Geobacter sp... 246 8e-64 UniRef50_B1VA32 Retron-type reverse transcriptase n=7 Tax=Candid... 245 2e-63 UniRef50_Q12UG1 RNA-directed DNA polymerase n=53 Tax=cellular or... 245 3e-63 UniRef50_Q8TJY1 Reverse transcriptase n=5 Tax=Methanosarcina Rep... 244 3e-63 UniRef50_A7MS60 Putative uncharacterized protein n=21 Tax=Vibrio... 244 4e-63 UniRef50_B8FP59 RNA-directed DNA polymerase n=9 Tax=Firmicutes R... 244 5e-63 UniRef50_C6MXE9 RNA-directed DNA polymerase n=1 Tax=Geobacter sp... 244 5e-63 UniRef50_Q9G8T2 Orf762 n=2 Tax=Eukaryota RepID=Q9G8T2_RHDSA 243 6e-63 UniRef50_Q2FUJ3 RNA-directed DNA polymerase n=1 Tax=Methanospiri... 242 1e-62 UniRef50_Q8YLU0 All5206 protein n=1 Tax=Nostoc sp. PCC 7120 RepI... 242 2e-62 UniRef50_Q47277 Orf protein n=63 Tax=cellular organisms RepID=Q4... 241 3e-62 UniRef50_A6LY84 RNA-directed DNA polymerase (Reverse transcripta... 239 1e-61 UniRef50_A7BYN3 RNA-directed DNA polymerase n=2 Tax=Beggiatoa sp... 239 2e-61 UniRef50_Q8YKQ2 Alr7241 protein n=14 Tax=Cyanobacteria RepID=Q8Y... 237 4e-61 UniRef50_D2LU13 RNA-directed DNA polymerase (Reverse transcripta... 235 2e-60 UniRef50_B0I1N8 Reverse transcriptase homolog n=2 Tax=Pylaiella ... 234 3e-60 UniRef50_B4WW73 Group II intron, maturase-specific domain family... 234 5e-60 UniRef50_B3JM52 Putative uncharacterized protein n=1 Tax=Bactero... 234 5e-60 UniRef50_Q8HQ89 ORF777 (Fragment) n=1 Tax=Schizosaccharomyces oc... 233 1e-59 UniRef50_Q7GEU5 Putative uncharacterized protein (Fragment) n=3 ... 233 1e-59 UniRef50_D0VMZ3 Putative reverse-transcriptase protein n=1 Tax=V... 233 1e-59 UniRef50_Q7YAJ3 Putative reverse transcriptase and intron matura... 232 1e-59 UniRef50_B4WV39 Group II intron, maturase-specific domain family... 227 6e-58 UniRef50_O99479 Reverse transcriptase homolog n=2 Tax=Eukaryota ... 227 6e-58 UniRef50_A8KXN1 RNA-directed DNA polymerase (Reverse transcripta... 226 9e-58 UniRef50_C3EEI5 Group II intron reverse transcriptase/maturase n... 226 1e-57 UniRef50_Q94Z25 Orf557 n=2 Tax=Pylaiella littoralis RepID=Q94Z25... 226 1e-57 UniRef50_D2FQY0 Regulatory protein GntR n=2 Tax=Staphylococcus a... 225 2e-57 UniRef50_Q1D1V6 Group II intron, maturase n=2 Tax=Myxococcales R... 224 3e-57 UniRef50_C0JWS6 Putative reverse transcriptase and intron matura... 224 3e-57 UniRef50_Q6TFE1 Putative group II intron-encoded maturase n=1 Ta... 224 5e-57 UniRef50_B8R160 Reverse transcriptase n=2 Tax=Volvox carteri Rep... 223 7e-57 UniRef50_A1BI39 CRISPR-associated protein Cas1 n=5 Tax=Chlorobia... 223 8e-57 UniRef50_A7UDN1 Putative reverse transcriptase n=2 Tax=Candida z... 223 1e-56 UniRef50_Q8GAR1 Reverse transcriptase n=20 Tax=Enterobacteriacea... 223 1e-56 UniRef50_A8LGE6 RNA-directed DNA polymerase n=1 Tax=Frankia sp. ... 222 1e-56 UniRef50_A6YE98 Putative reverse transcriptase and intron matura... 221 4e-56 UniRef50_UPI00016C4F75 RNA-directed DNA polymerase (Reverse tran... 219 8e-56 UniRef50_Q9G8T4 Orf621 n=1 Tax=Rhodomonas salina RepID=Q9G8T4_RHDSA 219 1e-55 UniRef50_Q10VN2 RNA-directed DNA polymerase (Reverse transcripta... 219 1e-55 UniRef50_D2LF37 RNA-directed DNA polymerase (Reverse transcripta... 216 1e-54 UniRef50_Q8YWX6 Alr1468 protein n=4 Tax=Cyanobacteria RepID=Q8YW... 216 1e-54 UniRef50_Q8RSV8 Maturase n=1 Tax=uncultured marine bacterium Rep... 213 6e-54 UniRef50_Q35064 Atp9 intron ORF n=1 Tax=Marchantia polymorpha Re... 213 7e-54 UniRef50_B1C301 Putative uncharacterized protein n=6 Tax=Clostri... 213 1e-53 UniRef50_O99970 Orf546 n=2 Tax=Porphyra purpurea RepID=O99970_PORPU 211 3e-53 UniRef50_A6P1G1 Putative uncharacterized protein n=1 Tax=Bactero... 211 4e-53 UniRef50_D1RME6 Reverse transcriptase family protein n=1 Tax=Leg... 211 4e-53 UniRef50_Q35063 CoxI intron1 ORF n=2 Tax=Eukaryota RepID=Q35063_... 208 3e-52 UniRef50_C6I8L1 CRISPR-associated protein n=1 Tax=Bacteroides sp... 206 1e-51 UniRef50_UPI0001C15D3C hypothetical protein CRC_00192 n=2 Tax=No... 205 2e-51 UniRef50_UPI0001C388AF RNA-directed DNA polymerase n=1 Tax=Arthr... 204 4e-51 UniRef50_P19593 Probable reverse transcriptase n=2 Tax=Scenedesm... 204 4e-51 UniRef50_Q1Q3I7 Putative uncharacterized protein n=1 Tax=Candida... 203 1e-50 UniRef50_Q67M30 Group II intron-encoding maturase n=1 Tax=Symbio... 201 4e-50 UniRef50_B3CUR8 Reverse transcriptase n=24 Tax=Orientia tsutsuga... 199 1e-49 UniRef50_B5W904 Group II intron maturase-specific domain protein... 197 5e-49 UniRef50_Q7YAJ6 Putative reverse transcriptase and intron matura... 193 7e-48 UniRef50_A6TR85 RNA-directed DNA polymerase (Reverse transcripta... 193 8e-48 UniRef50_A4WS58 RNA-directed DNA polymerase (Reverse transcripta... 192 2e-47 UniRef50_B6IMH7 Phage-encoded reverse transcriptase, putative n=... 190 6e-47 UniRef50_C0FSR2 Putative uncharacterized protein n=1 Tax=Rosebur... 190 8e-47 UniRef50_D2LKK7 RNA-directed DNA polymerase (Reverse transcripta... 189 9e-47 UniRef50_B8GRZ4 RNA-directed DNA polymerase (Reverse transcripta... 189 2e-46 UniRef50_B6FPD0 Putative uncharacterized protein n=1 Tax=Clostri... 187 4e-46 UniRef50_B0VI85 RNA-directed DNA polymerase (Reverse transcripta... 187 4e-46 UniRef50_B1I2T7 Retron-type reverse transcriptase-like protein n... 187 6e-46 UniRef50_A1RKU4 RNA-directed DNA polymerase (Reverse transcripta... 187 7e-46 UniRef50_C0A8Z3 RNA-directed DNA polymerase (Reverse transcripta... 186 1e-45 UniRef50_C8PKY6 Putative CRISPR-associated protein Cas1 n=1 Tax=... 184 6e-45 UniRef50_C9L4G0 Reverse transcriptase family protein n=3 Tax=Clo... 184 6e-45 UniRef50_A5ZQ10 Putative uncharacterized protein n=1 Tax=Ruminoc... 182 1e-44 UniRef50_D2MKC4 RNA-directed DNA polymerase (Reverse transcripta... 181 3e-44 UniRef50_Q7YAJ4 Putative reverse transcriptase and intron matura... 181 4e-44 UniRef50_Q5P2A1 Reverse transcriptase/retron type n=2 Tax=Proteo... 181 4e-44 UniRef50_A5N448 Predicted reverse transcriptase/maturase family ... 180 6e-44 UniRef50_Q2W777 Retron-type reverse transcriptase n=2 Tax=Magnet... 179 1e-43 UniRef50_C1DF40 Group II intron-encoding maturase n=1 Tax=Azotob... 179 2e-43 UniRef50_UPI000198600A PREDICTED: similar to intron maturase, ty... 178 3e-43 UniRef50_Q4FUJ8 Possible RNA-directed DNA polymerase (Reverse tr... 177 4e-43 UniRef50_B4SDM1 RNA-directed DNA polymerase (Reverse transcripta... 177 4e-43 UniRef50_D2R8Z2 CRISPR-associated protein Cas1 n=1 Tax=Pirellula... 176 1e-42 UniRef50_C6PFC6 RNA-directed DNA polymerase (Reverse transcripta... 176 1e-42 UniRef50_C8VZI0 RNA-directed DNA polymerase (Reverse transcripta... 176 2e-42 UniRef50_B4UZZ3 RNA-directed DNA polymerase n=1 Tax=Streptomyces... 175 2e-42 UniRef50_Q775D8 Reverse transcriptase n=1 Tax=Bordetella phage B... 174 4e-42 UniRef50_A4VK83 Reverse transcriptase n=1 Tax=Pseudomonas stutze... 174 4e-42 UniRef50_Q1XGD2 Group II intron-associated open reading frame n=... 174 6e-42 UniRef50_Q8TIC7 Reverse transcriptase n=1 Tax=Methanosarcina ace... 173 8e-42 UniRef50_B4S9Q0 RNA-directed DNA polymerase (Reverse transcripta... 173 9e-42 UniRef50_Q7M1J5 Reverse transcription like protein 2, intron-enc... 173 1e-41 UniRef50_Q9FJR9 Similarity to maturase-related protein n=4 Tax=M... 172 1e-41 UniRef50_A6DE66 Putative uncharacterized protein n=1 Tax=Caminib... 172 1e-41 UniRef50_Q8HQ84 ORF786 n=1 Tax=Schizosaccharomyces octosporus Re... 171 3e-41 UniRef50_D1I8B4 Whole genome shotgun sequence of line PN40024, s... 171 3e-41 UniRef50_Q8YRF1 Alr3497 protein n=10 Tax=Cyanobacteria RepID=Q8Y... 171 6e-41 UniRef50_C7RQ26 RNA-directed DNA polymerase (Reverse transcripta... 170 7e-41 Sequences not found previously or not previously below threshold: >UniRef50_Q47688 Putative uncharacterized protein ykfC n=18 Tax=Proteobacteria RepID=YKFC_ECOLI Length = 376 Score = 516 bits (1330), Expect = e-145, Method: Composition-based stats. Identities = 376/376 (100%), Positives = 376/376 (100%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARL 60 MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARL Sbjct: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARL 60 Query: 61 AVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWES 120 AVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWES Sbjct: 61 AVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWES 120 Query: 121 DFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRR 180 DFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRR Sbjct: 121 DFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRR 180 Query: 181 RISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLS 240 RISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLS Sbjct: 181 RISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLS 240 Query: 241 GKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECR 300 GKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECR Sbjct: 241 GKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECR 300 Query: 301 GVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAASL 360 GVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAASL Sbjct: 301 GVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAASL 360 Query: 361 TALLWKVRISGEILLG 376 TALLWKVRISGEILLG Sbjct: 361 TALLWKVRISGEILLG 376 >UniRef50_B0EYP4 Reverse transcriptase-like protein n=48 Tax=cellular organisms RepID=B0EYP4_ECOLX Length = 507 Score = 500 bits (1287), Expect = e-140, Method: Composition-based stats. Identities = 361/364 (99%), Positives = 362/364 (99%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARL 60 MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARL Sbjct: 6 MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARL 65 Query: 61 AVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWES 120 AVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWES Sbjct: 66 AVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWES 125 Query: 121 DFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRR 180 DFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRR Sbjct: 126 DFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRR 185 Query: 181 RISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLS 240 RISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLS Sbjct: 186 RISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLS 245 Query: 241 GKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECR 300 GKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQ EAIREECR Sbjct: 246 GKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQAEAIREECR 305 Query: 301 GVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAASL 360 GVLEGSLKLRLNMDKTKI HVNDGFIFLGHR+IRKRSRYGEMRVVSTIPQEKARNFAASL Sbjct: 306 GVLEGSLKLRLNMDKTKITHVNDGFIFLGHRIIRKRSRYGEMRVVSTIPQEKARNFAASL 365 Query: 361 TALL 364 TALL Sbjct: 366 TALL 369 >UniRef50_A4XCB1 RNA-directed DNA polymerase n=7 Tax=Actinomycetales RepID=A4XCB1_SALTO Length = 488 Score = 345 bits (885), Expect = 1e-93, Method: Composition-based stats. Identities = 134/372 (36%), Positives = 197/372 (52%), Gaps = 22/372 (5%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARL 60 MQ KL WAA P R L L+ P L A + GA T GVDG+ ++ + Sbjct: 25 MQAKLHRWAAAGPGRRFDDLFNLVHDPATLLVAYSRVAGNLGARTAGVDGMTVADVERHI 84 Query: 61 AV--ELQILRDELLSGHYQPLPARRVYIPKSNG--KLRPLGIPALRDRIVQRAMLMAMEP 116 V L LR ++ +G ++PLP R IPK G K+R LGIP + DR+VQ A+ + +EP Sbjct: 85 GVPGFLDDLRVQVKTGTFRPLPVRERKIPKPGGSGKVRRLGIPTVADRVVQAALKLVLEP 144 Query: 117 IWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMK 176 I+E+DF +SYGFRP+R AI + G RWV++ D+ + FD++ H LM Sbjct: 145 IFEADFLPVSYGFRPKRRAQDAIAEIHY----YGTHGYRWVLDADIEACFDSIDHVALMD 200 Query: 177 AVRRRISDARFMTLLWKTIKAGHI-DVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLH 235 VR RI D R +TL+ +KAG + ++G R G PQGG++SPLL+NI L D++L Sbjct: 201 RVRTRIKDKRVLTLVKAFLKAGILTELGDRRDTHTGTPQGGILSPLLANIALTVLDEHLM 260 Query: 236 ERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAI 295 + + +RG +T RYADDFV++V G + +A+ Sbjct: 261 AGWRPDATMASEYRRAQLRKRGEATW-----------RLVRYADDFVVLVHGGEDHAQAL 309 Query: 296 REECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSRY-GEMRVVSTIPQEKAR 354 RE+ +L L LRL+ KT++ H++DGF FLG + +R R + V + I + R Sbjct: 310 REDVATML-APLGLRLSPAKTRVVHLSDGFDFLGFHIQWRRKRGTNKWHVYTFIAKRPIR 368 Query: 355 NFAASLTALLWK 366 + A + AL + Sbjct: 369 SLKAKVRALTRR 380 >UniRef50_D2QVU8 RNA-directed DNA polymerase (Reverse transcriptase) n=1 Tax=Spirosoma linguale DSM 74 RepID=D2QVU8_9SPHI Length = 507 Score = 338 bits (866), Expect = 2e-91, Method: Composition-based stats. Identities = 132/380 (34%), Positives = 208/380 (54%), Gaps = 16/380 (4%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARL 60 +QRKL W+ T P+ + L +T L EA R +KG TPG+DG+ ++ R+ Sbjct: 15 VQRKLYQWSQTHPTEAYRELWNWLTDLRNLREAWRRVAQNKGKRTPGIDGMTVGSIRQRI 74 Query: 61 --AVELQILRDELLSGHYQPLPARRVYIPKSN--GKLRPLGIPALRDRIVQRAMLMAMEP 116 A L L+ +L +G Y+P P RR IPK+ GK RPLGIP + DR+VQ A+ +EP Sbjct: 75 GEAPFLATLQQQLRTGSYKPSPCRRKLIPKAGKPGKFRPLGIPTIADRVVQSAIKQVLEP 134 Query: 117 IWESDFHTLSYGFRPERSVHHAIRTVKLQLT---------DCGETRGRWVIEGDLSSYFD 167 I E+ F +SYGFRP R H A+ +++ + E +WVIEGD+ S FD Sbjct: 135 ILEARFWPVSYGFRPGRGCHGALEHIRMSMRPRKVNKQDNKRHEMPYQWVIEGDIQSCFD 194 Query: 168 TVHHRLLMKAVRRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIML 227 + H LM +R+ +D R LL + +KAG + F G PQGG++SPLL+N+ L Sbjct: 195 HIDHHQLMDRIRQHSADRRVNQLLVQFLKAGILSEEQFLRTDAGTPQGGIVSPLLANVAL 254 Query: 228 NEFDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKG 287 ++ +ER+++ + ++ + + I+ + + + RYADDFV++V G Sbjct: 255 GLIEER-YERWVNHQTKRRQSRQCDGIKAAMWSRSVDRQAGRAVYFPFRYADDFVILVSG 313 Query: 288 TKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRL-IRKRSRYGEMRVVS 346 T+ +A R+ + +L+ + L L+ +KTKI + +GF FLGHR+ +R RYG + Sbjct: 314 TQENAQAERKVLQTLLQEKMGLTLSPEKTKITPLTEGFQFLGHRVSMRWDYRYGWTPRLE 373 Query: 347 TIPQEKARNFAASLTALLWK 366 IP++KA + + L + Sbjct: 374 -IPKQKAADLRYRIKQLTGR 392 >UniRef50_Q92Y56 Reverse transcriptase n=3 Tax=Alphaproteobacteria RepID=Q92Y56_RHIME Length = 505 Score = 331 bits (849), Expect = 2e-89, Method: Composition-based stats. Identities = 135/391 (34%), Positives = 201/391 (51%), Gaps = 17/391 (4%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARL 60 +QRKL W+ +P + + + +T L A + S+KG T GVDG+ ++ R Sbjct: 15 IQRKLYQWSKANPDDQWRDMWGWLTDLRVLRHAWQRVASNKGGRTAGVDGMTVGRIRNRS 74 Query: 61 AVE-LQILRDELLSGHYQPLPARRVYIPKSN--GKLRPLGIPALRDRIVQRAMLMAMEPI 117 L L+ +L SG Y+P PARR IPK+ G+ RPLGIP +RDR+VQ A + +EPI Sbjct: 75 EHRFLVDLQADLRSGAYRPSPARRKLIPKAGKPGQFRPLGIPTIRDRVVQGAAKILLEPI 134 Query: 118 WESDFHTLSYGFRPERSVHHAIRTVKLQL--------TDCGETRGRWVIEGDLSSYFDTV 169 +E+ F +SYGFRP R+ H A+ ++ T WVIEGD+ FD + Sbjct: 135 FEAQFWHVSYGFRPGRNTHGALEYIRRAALPQKRDEDTRRNRLPYPWVIEGDIKGCFDNI 194 Query: 170 HHRLLMKAVRRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNE 229 +H L++ +R+RI D R + L+ +KAG + F G PQGG+ISPLL+NI L+ Sbjct: 195 NHHHLLERMRKRIGDRRVVRLVGLFLKAGVLTEDQFLRTDAGTPQGGIISPLLANIALSA 254 Query: 230 FDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTK 289 ++ +ER+ + + +N + S + + RYADDFV++V G+ Sbjct: 255 IEER-YERWTYHRKKTQARRKSNGVAAAASARDSDRIAGRCVYLPVRYADDFVVLVSGSL 313 Query: 290 AQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRL-IRKRSRYGEMRVVSTI 348 + A + L + L L +KTK+ + +GF FLG R + RYG V I Sbjct: 314 EEAMAEKSALADYLIKTTGLTLLPEKTKVTAMTEGFEFLGFRFSVHWDKRYGYGPRVE-I 372 Query: 349 PQEKARNFAASLTALLWKVRIS---GEILLG 376 P+ KA N + L + IS GE L G Sbjct: 373 PKAKAANLRHKVKQLTQRDSISVSLGEKLRG 403 >UniRef50_A7VUZ6 Putative uncharacterized protein n=1 Tax=Clostridium leptum DSM 753 RepID=A7VUZ6_9CLOT Length = 605 Score = 322 bits (826), Expect = 1e-86, Method: Composition-based stats. Identities = 124/372 (33%), Positives = 190/372 (51%), Gaps = 34/372 (9%) Query: 5 LATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVEL 64 L + D S R R+ R + ++ A + + +G TPG DG ++ Sbjct: 16 LKQKSKQDESHRYDRIYRNLFNEDFFLRAYQKIHAKQGNMTPGTDGTTIDGFS---RKQI 72 Query: 65 QILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHT 124 L + L YQP P RR YIPK NGK+RPLGIPA D++VQ + +E I+E F Sbjct: 73 SQLIELLKWERYQPKPVRRTYIPKKNGKMRPLGIPAFADKLVQEVVRQILEAIYEPIFSD 132 Query: 125 LSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISD 184 S+GFRP RS H A+ +K WVIEGD++ FD + H +L+K + ++I D Sbjct: 133 NSHGFRPNRSCHTALYQIKSTCR-----GTNWVIEGDITGCFDHIDHEILLKILLKKIDD 187 Query: 185 ARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLH---ERYLSG 241 RF+ L+WK +KAG+++ + G PQGG+ISP+L+NI L+EFD+++ Y G Sbjct: 188 GRFLELIWKFLKAGYLEFNQKYNSLSGTPQGGIISPILANIYLHEFDKFMEGISAEYTKG 247 Query: 242 KARKDRWYWNNSIQRGRSTAVREN--------------------WQWKPAVAYCRYADDF 281 K R+ + + + N + V Y RYADDF Sbjct: 248 KQRRPYREYQILQYKRNRAKKKGNQEQADEYLRQMQNIPALDPMDKNYQRVKYVRYADDF 307 Query: 282 VLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDG-FIFLGHRLIRKRSRYG 340 V+ + G+KA I+E G ++ L L L+ +KTKI +++D FLG+ + +S+ Sbjct: 308 VVCIIGSKATANEIKERIAGFMQKELHLELSREKTKITNLSDKRVRFLGYEIT--KSQEN 365 Query: 341 EMRVVSTIPQEK 352 +VV +I ++K Sbjct: 366 TKQVVDSIGRKK 377 >UniRef50_B4D9Y9 RNA-directed DNA polymerase (Reverse transcriptase) n=3 Tax=Chthoniobacter flavus Ellin428 RepID=B4D9Y9_9BACT Length = 495 Score = 312 bits (799), Expect = 1e-83, Method: Composition-based stats. Identities = 129/386 (33%), Positives = 189/386 (48%), Gaps = 51/386 (13%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARL 60 +Q L A + P R L + + + LA A R ++ GA GVDGV L A Sbjct: 77 LQITLYRKAQSKPEYRFWSLYGEVQRADVLAAAWRRVKANAGA--AGVDGVTIEKLAADA 134 Query: 61 AVE---LQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPI 117 VE L LR+EL Y+P P RRV IPK++G R LGIP L+DR+VQ A+ + + PI Sbjct: 135 QVEAAWLNGLREELHGKTYRPAPVRRVKIPKASGGYRGLGIPTLKDRVVQMAVYLVLMPI 194 Query: 118 WESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKA 177 +E+DFH SYGFRP R+ H A+ ++ L V++ DL+ YFDT+ HRLLM+ Sbjct: 195 FEADFHPRSYGFRPGRNAHQAVEEIREAL----RMGKTEVVDADLAQYFDTIPHRLLMRQ 250 Query: 178 VRRRISDARFMTLLWKTIKAGHIDVGL-----FRAASEGVPQGGVISPLLSNIMLNEFDQ 232 V RR+SD + L+ ++A ++ +A G PQGGVISPLL+NI L+ D Sbjct: 251 VARRVSDGMILKLIKAWLRAPILEEEEGGGRRMKANPCGTPQGGVISPLLANIYLHPLD- 309 Query: 233 YLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQV 292 +V ++ Q KP + RYADD V++ + + Sbjct: 310 ---------------------------DSVNDHCQQKPRM--IRYADDLVILCRPGEG-- 338 Query: 293 EAIREECRGVLEGSLKLRLNMDKTKIP-HVNDGFIFLGHRLIRKRSRYGEMRVVSTIP-- 349 ++E L+ L LN KT++ GF FLG ++S+ G V + Sbjct: 339 RGMKERLARWLQSR-GLTLNETKTRVVQSCESGFEFLGFTFRWQQSKKGTPYVHTEPSPA 397 Query: 350 -QEKARNFAASLTALLWKVRISGEIL 374 ++ RN LT R++G+ + Sbjct: 398 AKQSLRNRVRELTRRSTTWRVTGQTV 423 >UniRef50_C3QDF8 RNA-directed DNA polymerase n=9 Tax=Bacteroidales RepID=C3QDF8_9BACE Length = 606 Score = 308 bits (790), Expect = 2e-82, Method: Composition-based stats. Identities = 121/406 (29%), Positives = 191/406 (47%), Gaps = 54/406 (13%) Query: 14 SLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLS 73 + +RL R++ E A + + G TPG DG + + + L Sbjct: 23 DYKFERLYRILFNEEMFHVAYQRIYAKPGNMTPGTDGKTINRMS---LQRINKVIASLRD 79 Query: 74 GHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPER 133 Y+P PA+R +IPK NGK RPLGIP+ D++VQ + M +E I+E F S+GFRP R Sbjct: 80 ESYKPNPAKRTHIPKKNGKKRPLGIPSFEDKLVQEVVRMILEAIYEEVFANTSHGFRPNR 139 Query: 134 SVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWK 193 S H A+ ++ T +W +EGD+ +FD + H +L+ +R+RI D RF+ L+ K Sbjct: 140 SCHTALTHIQKTF-----TGTKWFVEGDIKGFFDNIDHNVLIATLRKRIDDNRFLRLIRK 194 Query: 194 TIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHE---RYLSGKARKDRWYW 250 + AG+I+ F ++G PQGG ISP+L+NI L+ FD+Y+ E R+ GK R + Sbjct: 195 LLNAGYIEDWRFHNTNKGTPQGGNISPILANIYLDNFDKYMEEYALRFNKGKERHITKEY 254 Query: 251 NN-------------------------------SIQRGRSTAVRENWQWKPAVAYCRYAD 279 +R + + + Y RYAD Sbjct: 255 KQLSGKMQGILKSIKNIKDADARLQLRDEYVKLGRERQKIESRDSMDETYRRFRYVRYAD 314 Query: 280 DFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRS-- 337 DF++ V G+KA I+ + +E +LKL L+ +KT I + FLG + ++S Sbjct: 315 DFLIGVIGSKADCVKIKSDITNYMEENLKLELSQEKTLITNAQTPAKFLGFEVSVRKSDV 374 Query: 338 ----------RYGEMRVVSTIPQEKARNFAASLTALLWKVRISGEI 373 RY ++V + E RN +A+ +KV ++ Sbjct: 375 VKRNKNNVSARYYNGKIVLKVAIETVRNKLEEYSAIRYKVENGRQV 420 >UniRef50_Q1J4S7 Reverse transcriptase / RNA maturase / Endonuclease n=2 Tax=Bacteria RepID=Q1J4S7_STRPF Length = 625 Score = 307 bits (787), Expect = 4e-82, Method: Composition-based stats. Identities = 110/389 (28%), Positives = 183/389 (47%), Gaps = 45/389 (11%) Query: 7 TWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQI 66 + + + +RL R + + +A + ++ G T GVD + A + Sbjct: 32 RKHSKEENYTYKRLYRNLYNIDLFLQAYQNIYANAGNMTKGVDNQTIS---AMSLERINK 88 Query: 67 LRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLS 126 + D L Y P P +RVYIPK NGKLRPLGIP++ D++VQ M + I++ F S Sbjct: 89 IIDSLKDESYSPTPTKRVYIPKKNGKLRPLGIPSIGDKLVQEVCRMLLNSIYDESFEDTS 148 Query: 127 YGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDAR 186 +GFR RS H A+R ++ + C +W +EGD+ +FD + H +++ + +RI D R Sbjct: 149 HGFRDNRSCHTALRQIQNRFVRC-----KWFVEGDIKGFFDNIDHNIMIDILSKRIDDER 203 Query: 187 FMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYL---HERYLSGKA 243 F+ L+ K +K+G+++ + G+PQG +ISP+LSNI L++FD+Y+ E + G Sbjct: 204 FLRLIRKFLKSGYMEQNQYHNTYSGMPQGSIISPILSNIYLDKFDKYMQNYKESFDKGNK 263 Query: 244 RKDRWYWNNSIQRGRS-------------------------------TAVRENWQWKPAV 272 RK + R + + + + Sbjct: 264 RKQNKEYKALYDRRKRLENKLSKTTNKTEIDDIKSEIEEINKRYFNIPCLNPMDENFKRI 323 Query: 273 AYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRL 332 Y RYADDF++ + G+KA E ++++ ++ L L L+ +KT + D FLG + Sbjct: 324 QYVRYADDFIIGIIGSKADAEMVKQDIGQFIKSELNLELSDEKTLVTKSTDRAKFLGFDI 383 Query: 333 IRKRSRYGEMRVVSTIPQEKARNFAASLT 361 R + I KARNF + Sbjct: 384 RVTPGSNHTKRTKAGI---KARNFGGHVR 409 >UniRef50_C3B599 Group II intron-encoded protein LtrA n=3 Tax=cellular organisms RepID=C3B599_BACMY Length = 623 Score = 307 bits (787), Expect = 4e-82, Method: Composition-based stats. Identities = 123/403 (30%), Positives = 191/403 (47%), Gaps = 59/403 (14%) Query: 11 TDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDE 70 + + + RL R + P + +A S++G T G DG N ++ IL + Sbjct: 16 NNKNYKYNRLYRNLYNPAFYLKAYTNISSNQGNMTKGTDGKNIDGFS---LEKINILIES 72 Query: 71 LLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFR 130 L YQP P++RV+IPK NG RPLGIP +D+++Q + M +E I+E F S+GFR Sbjct: 73 LKDESYQPHPSKRVFIPKKNGSKRPLGIPTFKDKLLQEVIRMILEAIYEMSFKESSHGFR 132 Query: 131 PERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTL 190 P+RS H A+ V+ T +W +EGD+S +FD + H L+ +RRRI+D +F+ L Sbjct: 133 PKRSCHTALHKVRKTF-----TGVKWFVEGDISGFFDNIDHHTLIALLRRRITDEKFIRL 187 Query: 191 LWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYW 250 +WK ++AG+++ +R + G PQGG+ISPLLSN+ L E D ++ E Y ++ + Sbjct: 188 IWKFLRAGYLEEWKYRGSYSGTPQGGIISPLLSNVYLTELDTFMEE-YQKEFSKGAKRKT 246 Query: 251 NNSIQRGRSTAVRENWQWK--------------------------------------PAV 272 + +R + Q K + Sbjct: 247 TKAYKRQEYLTYKHRKQLKENWGQLTEQEKKQGIIKYKSLKEELLKTPFGDPMDDSYKRI 306 Query: 273 AYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRL 332 Y RYADDF++ + G KA ++E+ L LKL L+ +KT I H + FLG+ + Sbjct: 307 QYVRYADDFLVGIIGNKADAVKVKEDLTNFLRDKLKLELSQEKTLITHSSKKARFLGYNI 366 Query: 333 IRKR------------SRYGEMRVVSTIPQEKARNFAASLTAL 363 R +R+ MR +P EK AL Sbjct: 367 TVSRMPVTKRDKNGFLTRHQIMRCKLYLPSEKWIEKLKQYGAL 409 >UniRef50_B8FP60 RNA-directed DNA polymerase n=3 Tax=Firmicutes RepID=B8FP60_DESHD Length = 607 Score = 307 bits (785), Expect = 6e-82, Method: Composition-based stats. Identities = 121/402 (30%), Positives = 188/402 (46%), Gaps = 57/402 (14%) Query: 7 TWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQI 66 + R +RL R + P++ A + ++ GA TPGVD L ++ Sbjct: 12 QRQSQKQDYRFERLYRNLYNPDFYLLAYQKLYANNGAMTPGVDRTT---LDGTGMERIES 68 Query: 67 LRDELLSGHYQPLPARRVYIPKSNGK-LRPLGIPALRDRIVQRAMLMAMEPIWESDFHTL 125 L L + YQP PARR YIPK +GK RPLGI A D++VQ + M +E I+E F Sbjct: 69 LIQSLKNRSYQPQPARRRYIPKKSGKGQRPLGIQAANDKLVQEVVRMLLESIYEPTFLDS 128 Query: 126 SYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDA 185 S+GFRP RS H A+ ++ +W +EGD+ +YFDT+ H +L+ +RRRI D Sbjct: 129 SHGFRPNRSCHTALARMQRSF-----NGVKWFVEGDIKAYFDTIDHHMLVNILRRRIQDE 183 Query: 186 RFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHE---RYLSGK 242 F++L+WK ++AG+++ + A G PQG SPLL+N+ L+E D Y+ E R+ G Sbjct: 184 NFISLIWKFLRAGYLEDWQYNATYSGSPQGSGASPLLANLYLHELDLYMEEYKQRFDKGN 243 Query: 243 ARKDRWYWNNSIQRGRSTAVRENWQW---------------------------------- 268 R+ + + + + V+ + W Sbjct: 244 RRQAGKDYGRAQRTHQYRKVKYDRLWLTLTDEEKKAAQREIRALRKRMLECPANDPMDGT 303 Query: 269 KPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFIFL 328 + Y RY DDF++ + G+K E ++ + L LKL L+ +KT I H D FL Sbjct: 304 YRRIQYVRYCDDFLVGIIGSKTDAEKVKADICRFLSDKLKLTLSPEKTLITHGQDKARFL 363 Query: 329 GHRL-----------IRKRSRYGEMRVVSTIPQEKARNFAAS 359 G+ + R +SR +V +P++K Sbjct: 364 GYDIAVCQDNTTRRTSRGQSRVHSGKVKLYVPKDKWVGKLRE 405 >UniRef50_UPI00019088D4 mobile mitochondrial group II intron of COX1 which IS involved in pre-mRNA splicing and in deletion of introns from n=1 Tax=Rhizobium etli CIAT 894 RepID=UPI00019088D4 Length = 533 Score = 305 bits (782), Expect = 1e-81, Method: Composition-based stats. Identities = 123/413 (29%), Positives = 195/413 (47%), Gaps = 62/413 (15%) Query: 2 QRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLA 61 +R ++ A + RI L RL+ P A+A + GA TPG+D + L Sbjct: 8 RRLISLPALSRQGKRINGLHRLLDCPNIWAQAYEAIARNSGALTPGID--PRNTLDGFSL 65 Query: 62 VELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESD 121 L + + G Y+ P RR YIPK+NGKLRPLGIP D++VQ A+ + +E I+E + Sbjct: 66 DRLNGIMRRVKEGSYRFKPVRRHYIPKANGKLRPLGIPDADDKLVQAAVKLVLEQIYEPN 125 Query: 122 FHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRR 181 F S+GFRP+RS H A+ +++ W++E D++ +FD + H +LM +R+R Sbjct: 126 FSRRSHGFRPKRSCHTALASIQKTWG-----GTVWLVEADIAGFFDNIDHDILMNLLRKR 180 Query: 182 ISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYL---HERY 238 I D RF+ L+ + AG+++ + A+ G PQGGVISPLL+N+ L+EFD+++ R+ Sbjct: 181 IDDERFLKLIRGMLTAGYMEDWKWHASYSGTPQGGVISPLLANVYLHEFDEFMDSLKTRF 240 Query: 239 LSGKARKDRWYWNNSIQRG----------------------------------RSTAVRE 264 G R + + +G + + Sbjct: 241 DRGIERPTNPEYQKLLSKGAHCRQRIAKLRSSGREAEAERLRAQLQPLIVAARKLPSKDF 300 Query: 265 NWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVND- 323 + + + Y RYADDF++ V GTK + I E L+ L L + +K+ I +D Sbjct: 301 HDTFFRRLQYVRYADDFLISVIGTKQEAADILNEVTSFLQDQLHLEVAPEKSGITKADDG 360 Query: 324 GFIFLGHRLIRKRSRYGEMRVVS-----------------TIPQEKARNFAAS 359 G FLG+ + + + E RV +P+EK FA Sbjct: 361 GVTFLGYAVRSTKRAFREKRVTRKGRRAMVKRTPVRGIRLHLPREKLAAFAKR 413 >UniRef50_Q2G689 RNA-directed DNA polymerase n=5 Tax=Bacteria RepID=Q2G689_NOVAD Length = 633 Score = 305 bits (782), Expect = 1e-81, Method: Composition-based stats. Identities = 118/399 (29%), Positives = 198/399 (49%), Gaps = 48/399 (12%) Query: 14 SLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLS 73 ++ L RL+ P A + +KGA TPGVDG +++ + + L + Sbjct: 20 GRKVNGLYRLLKSPLLWEHAYQRIAPNKGAMTPGVDGQTFDGFSP---DKVRSIIERLAN 76 Query: 74 GHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPER 133 G Y+P PARRVYIPK+NG+ RPLG+P D++VQ + +E I+E F S+GFRP+R Sbjct: 77 GTYRPQPARRVYIPKANGQKRPLGVPTTEDKLVQEVVRTILEQIYEPLFSRHSHGFRPKR 136 Query: 134 SVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWK 193 S H A+ +++ T +W+I+ D+ +FD + H +L+ + +RI+D RF+ L+ Sbjct: 137 SCHTALESIRAIW-----TGVKWLIDVDVVGFFDNIDHDVLVSLLEKRIADRRFVRLIRG 191 Query: 194 TIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHER---YLSGKARKDRW-- 248 +KAG+++ +F G PQGGV+SP+L+NI L+E D ++ + + GK R Sbjct: 192 LLKAGYVEDWVFHKTYSGTPQGGVVSPMLANIYLHELDMFMQAKMAGFDKGKQRSPSPDA 251 Query: 249 --------YWNNSIQRGRSTAVRENWQWK--------------------------PAVAY 274 Y ++ + R+ ++ + + Y Sbjct: 252 RRIRNRLSYVRRTVDQLRAKGRGDDPRVTSFLEEIGRLKAERLAVPASDAFDPNYRRLRY 311 Query: 275 CRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIR 334 CRYADDF++ V G+K++ I EE R L LKL ++ +K+ I +DG FLG+ + Sbjct: 312 CRYADDFIIGVTGSKSEARQIMEEVRTYLSDHLKLAVSAEKSGIHKASDGARFLGYEVRT 371 Query: 335 KRSRYGEMRVVSTIPQEKARNFAASLTALLWKVRISGEI 373 + + P + R A + L+ + R+ + Sbjct: 372 MTNPNPHKAIFDGRPAVR-RGLADRMKLLVPRDRVVRFV 409 >UniRef50_Q56VE2 MatR n=6 Tax=Bacteria RepID=Q56VE2_BACFR Length = 599 Score = 304 bits (779), Expect = 3e-81, Method: Composition-based stats. Identities = 115/366 (31%), Positives = 183/366 (50%), Gaps = 42/366 (11%) Query: 11 TDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDE 70 +P+ + +RL RL+ A A ++ G T G DG + + + +Q + D+ Sbjct: 16 QNPNYKFERLYRLLFNENLYALAYQMMSKKTGNMTKGTDGQTISGMSIK---RIQSIIDK 72 Query: 71 LLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFR 130 L YQP PA+R+YIPK NGK RPLGIP+ D++VQ+ + M +E I+E F S+GFR Sbjct: 73 LRDESYQPHPAKRIYIPKKNGKQRPLGIPSFEDKLVQKVIQMILESIYEGSFEKCSHGFR 132 Query: 131 PERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTL 190 P R+ H A+ ++ G RW IEGD+ +FD + H +++ + RI+D RF+ L Sbjct: 133 PHRNCHTAMASIME-----GFDGTRWFIEGDIKGFFDNIDHDIMITILSERIADERFLRL 187 Query: 191 LWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHE---RYLSGKARKDR 247 + K + AG+++ F G PQGG+ISP+L+NI L++ D+Y+ E ++ GK RK Sbjct: 188 IRKFLNAGYLEKWKFHKTFSGTPQGGIISPILANIYLDQLDKYVVEYISQFNRGKMRKRN 247 Query: 248 WYWNNSIQR-------------------------------GRSTAVRENWQWKPAVAYCR 276 + R + A + + + Y R Sbjct: 248 PEYKRIASRKDKRVKKLKTETDEQKRAALRSEIVELHREMQKHPATLDMDEDFRRMRYVR 307 Query: 277 YADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKR 336 YADDF++ + G+K I+ + + L LKL L+ +KT I H +D FLG + ++ Sbjct: 308 YADDFLIGIIGSKDDCVNIKADIKRFLCEKLKLELSDEKTLITHGHDHAKFLGFEVTIRK 367 Query: 337 SRYGEM 342 S Sbjct: 368 SEKTRK 373 >UniRef50_A9B955 RNA-directed DNA polymerase n=1 Tax=Herpetosiphon aurantiacus ATCC 23779 RepID=A9B955_HERA2 Length = 587 Score = 304 bits (778), Expect = 4e-81, Method: Composition-based stats. Identities = 115/352 (32%), Positives = 183/352 (51%), Gaps = 24/352 (6%) Query: 17 IQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVE--LQILRDELLSG 74 R+ L+ A LS+ GA T G+DG+ K + + +Q + +L + Sbjct: 34 FHRVFNLMRTRRLATVALNRVLSNTGARTAGIDGMTKKHIATDTEQQALVQEIWHDLTTH 93 Query: 75 HYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERS 134 Y+P P RRVYIPK+NG+ RPLGIP ++DR+VQ + + ++PI+ES F+ SYGFRP R+ Sbjct: 94 QYRPAPVRRVYIPKANGQQRPLGIPTIKDRVVQEMVRLILDPIYESTFYRHSYGFRPYRA 153 Query: 135 VHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKT 194 HHA+ ++ + G + +EGD+ + FD +HH L++ +RR I D R +T++ + Sbjct: 154 THHAVVRLRDLI---GRRGYQMALEGDIRACFDRIHHTTLIRILRRTIKDERLITVIHQM 210 Query: 195 IKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSI 254 +KAG +D G +R +G PQGG++SPLL+NI LNE DQ++ R+ + + ++ Sbjct: 211 LKAGVMDDGQWRVTEDGTPQGGIVSPLLANIYLNELDQWVANRWDTYTPLERYYHRKAGT 270 Query: 255 QRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMD 314 RYADDFV+++ GT A+ ++ L L L L+ + Sbjct: 271 GY--------------PCQITRYADDFVVLLHGTHAEATTLKTALATFLADHLHLELSAE 316 Query: 315 KTKIPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAASLTALLWK 366 KT I V GF FLG + + + T ++ F + K Sbjct: 317 KTLITPVEQGFDFLGFHIRKYQDS-----TRITPSRKAIATFKREAADRIGK 363 >UniRef50_Q3CZ44 Prophage LambdaSa1, reverse transcriptase/maturase family protein n=2 Tax=Streptococcus agalactiae RepID=Q3CZ44_STRAG Length = 439 Score = 302 bits (773), Expect = 1e-80, Method: Composition-based stats. Identities = 120/372 (32%), Positives = 183/372 (49%), Gaps = 40/372 (10%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQAR- 59 QRK+ D + L + + + L A +KG+ G+D ++A Sbjct: 16 FQRKIYLSTKADNKRKFGVLYDKVYRKDILKVAWFYVKRNKGS--AGIDDFTIEEIEAYG 73 Query: 60 LAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWE 119 + L + D+L + YQP +RVYIPK+NGK RPLGIP +RDR+VQ A+ + +EPI+E Sbjct: 74 VQKFLDEIEDQLRNKKYQPKAVKRVYIPKANGKKRPLGIPTVRDRVVQTAVKIVIEPIFE 133 Query: 120 SDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVR 179 +DF SYGFRP+RS + AIR + L E WVI+ DL YFDT+ H L+ V+ Sbjct: 134 ADFQKFSYGFRPKRSANQAIREIYKYLNYGCE----WVIDADLKGYFDTIPHDKLLLLVK 189 Query: 180 RRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYL 239 R++D + LL ++AG ++ R+ G PQGGVISPLL+NI LN D+Y L Sbjct: 190 ERVTDKSIIKLLSLWLEAGIMEDNQVRSNILGTPQGGVISPLLANIYLNALDRYWKNNRL 249 Query: 240 SGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVK-GTKAQVEAIREE 298 G+ RYADDFV++ K + ++ Sbjct: 250 EGRGHDA--------------------------HLIRYADDFVILCSNNPKKYYQYAKQR 283 Query: 299 CRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRS-RYGEMRVVSTIPQEKARNFA 357 L L LN +KT+I H +GF FLG+ L + +S + G+ + ++ ++ Sbjct: 284 I-----DKLGLTLNEEKTRIVHATEGFDFLGYTLRKSKSHKSGKYKTYYYPSRKSMKSIK 338 Query: 358 ASLTALLWKVRI 369 + ++ + Sbjct: 339 GKVKDVIQTGQH 350 >UniRef50_A9IAV6 Mobile mitochondrial group II intron of COX1 which is involved in pre-mRNA splicing and in deletion of introns from mitochondrial DNA n=1 Tax=Bordetella petrii DSM 12804 RepID=A9IAV6_BORPD Length = 606 Score = 302 bits (772), Expect = 2e-80, Method: Composition-based stats. Identities = 115/401 (28%), Positives = 179/401 (44%), Gaps = 60/401 (14%) Query: 11 TDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDE 70 + RI L RL+ P +A ++ GA T GVDG + L L L Sbjct: 17 SQQGKRINGLSRLMENPILWKQAYVNIYANSGATTAGVDG---SSLDGMSYERLAGLMAA 73 Query: 71 LLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFR 130 + SG+Y+ P RRV IPKSNGK RPLGIP D++VQ + M + I+E F S+GFR Sbjct: 74 VKSGNYRFKPVRRVLIPKSNGKTRPLGIPTGDDKLVQEVVRMLLVKIYEPVFSDDSHGFR 133 Query: 131 PERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTL 190 RS H A+ V+ + T +W++ D+ YFD + H +L+ + +RI D RF+ L Sbjct: 134 NGRSCHTALMQVRQKWTGM-----KWIVNMDIKGYFDNIDHEVLVDVLAKRIDDKRFLGL 188 Query: 191 LWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQY---LHERYLSGKARKDR 247 + +KAG+++ F G PQGGV+SP+L+NI L+E D+Y L + G R Sbjct: 189 IHSMLKAGYMEDWKFHDTFSGTPQGGVVSPVLANIYLHELDEYVAGLKAEFNRGNRRASN 248 Query: 248 WYWNN----------------------------------SIQRGRSTAVRENWQWKPAVA 273 + ++R ++ + Sbjct: 249 REYKRISGAIERLMKRIDAYKADGDSPKVEEAKRELAELYLRRKALSSSDPMDANYRRLV 308 Query: 274 YCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLI 333 Y RYADDF++ + G++ + + + G + L L + +K+ + H +DG FLG+ + Sbjct: 309 YVRYADDFLIGIIGSRDEAVTVMQRVAGFISDKLHLEIAEEKSGVVHASDGVRFLGYDVR 368 Query: 334 RKRS---------------RYGEMRVVSTIPQEKARNFAAS 359 R R+ +P EK R+F Sbjct: 369 TYSGDRNVRTVRSGRSITARSVSERMQLHVPAEKLRSFCQR 409 >UniRef50_Q02718 Reverse transcriptase homologue COI iA grp II protein (Fragment) n=2 Tax=Fungi RepID=Q02718_PODAN Length = 790 Score = 301 bits (770), Expect = 3e-80, Method: Composition-based stats. Identities = 131/379 (34%), Positives = 194/379 (51%), Gaps = 43/379 (11%) Query: 13 PSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELL 72 + L +LI + L +A R S+ G TP +D + + L+ L EL Sbjct: 190 KDKKFVNLYQLICSKDLLIQAYRNVRSNPGGMTPSIDNITYDGIN---DEFLEKLILELK 246 Query: 73 SGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPE 132 S ++ +RVYIPK+NGK RPLGIP +D+IVQ AM + +E I+E F +S+GFRP+ Sbjct: 247 SERFKFTSVKRVYIPKANGKTRPLGIPTSKDKIVQEAMKILLELIYEPIFLDVSHGFRPK 306 Query: 133 RSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLW 192 RS H A+ + W++EGD+ +F+ V H++L+K + ++I D RF LLW Sbjct: 307 RSCHTALHQISKW------NGTTWMLEGDIKGFFNEVDHQVLIKILEKKIKDQRFFDLLW 360 Query: 193 KTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKD-RWYWN 251 K +AG+ID G+ GVPQGGVISP+LSNI L+EFD ++ + KD N Sbjct: 361 KLFRAGYIDDGVKYNTYTGVPQGGVISPVLSNIYLHEFDLFVETLIKKYSSEKDFISKVN 420 Query: 252 NSIQRGRSTAVRENWQWKP----------------------------AVAYCRYADDFVL 283 I + S R N +++ V Y RYADD+V+ Sbjct: 421 PVIVKYSSKLSRLNDEYQTTKDKEILKEIIKLRAERNKLPSRIRNGIRVRYTRYADDWVI 480 Query: 284 IVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDG-FIFLGHRLIRKRSRYGEM 342 + G + V I+EEC+ L LKL L+ +KTKI ++ + FLG + RK S GE Sbjct: 481 GIIGDQELVAKIKEECKAFLRDILKLELSEEKTKITNITEKEVRFLGVDIKRKDS--GES 538 Query: 343 RVVSTIPQEKARNFAASLT 361 +++ Q K R + + Sbjct: 539 KIIQR--QVKGRLIKSRIN 555 >UniRef50_B9IYU4 Reverse transcriptase n=21 Tax=Bacteria RepID=B9IYU4_BACCQ Length = 607 Score = 299 bits (765), Expect = 1e-79, Method: Composition-based stats. Identities = 115/402 (28%), Positives = 184/402 (45%), Gaps = 57/402 (14%) Query: 7 TWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQI 66 T A + RL R + ++ +A G T G D ++ Sbjct: 16 TKNAVKENYIFTRLYRNLYNKKFFLDAYGNIYHKPGNMTQGTDKETIDGFSMDW---IEN 72 Query: 67 LRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLS 126 + L Y+P P+RRVYIPK + K RPLGIP+++D+I+Q + + ++E F S Sbjct: 73 IISSLKDESYKPNPSRRVYIPKKDDKQRPLGIPSIKDKIIQEVVKEILVSMYEPIFSKAS 132 Query: 127 YGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDAR 186 +GFRP +S H A+ +K+ +W IEGD+ +FD + H +L+ +R+RI D + Sbjct: 133 HGFRPNKSCHSALNDIKMTFGGI-----KWWIEGDIKGFFDNIDHHVLIGILRKRIKDEK 187 Query: 187 FMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHE---RYLSGKA 243 F+ L+WK +KAG+++ F G PQGG+ISP+L+NI L+E D ++ + ++ GK Sbjct: 188 FIKLIWKFLKAGYMEDWKFNKTFSGTPQGGIISPVLANIYLHELDAFMEKQIIKFDEGKR 247 Query: 244 RKDRWYWNNS----------------------------------IQRGRSTAVRENWQWK 269 R+D + +R + +AV Sbjct: 248 RRDNPVYKKYNTAIWYRKNKLKEKWNTLNDDERKELQSEISTLEKEREKHSAVDNMDASF 307 Query: 270 PAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLG 329 + Y RYADDFV+ V G+K + I+EE L SLKL L+ +KT I + FLG Sbjct: 308 KRLKYVRYADDFVVGVIGSKEDSKRIKEEITEFLHTSLKLELSQEKTLITSNKNLIKFLG 367 Query: 330 HRLIRK------------RSRYGEMRVVSTIPQEKARNFAAS 359 + + + R+ + + +P RNF Sbjct: 368 YEIGIDGGHDSKTNTNGIKKRHLSGKPMLYLPYNNMRNFLLK 409 >UniRef50_Q3A299 Prophage LambdaSa1, reverse transcriptase/maturase family protein n=5 Tax=Proteobacteria RepID=Q3A299_PELCD Length = 446 Score = 298 bits (764), Expect = 2e-79, Method: Composition-based stats. Identities = 124/384 (32%), Positives = 182/384 (47%), Gaps = 52/384 (13%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARL 60 +QRKL A +P+ R L + + + L+ A + ++KG+ G+DGV ++ R Sbjct: 11 LQRKLYRKAKQEPACRFHALYDKVYRADILSHAYALVRANKGS--AGIDGVTFAAIEERE 68 Query: 61 AV--ELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIW 118 V + L + L S Y+P P +RV IPK++G RPLGIP +RDR+ Q A+ + +EPI+ Sbjct: 69 GVSALIAELEEALRSKTYKPDPVKRVMIPKADGSQRPLGIPTIRDRVAQMAVKLVVEPIF 128 Query: 119 ESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAV 178 E+DF SYGFRP++S H A+ V + VI+ D+S YFDT+ H LM V Sbjct: 129 EADFCDTSYGFRPKKSAHDAVDDVAYAMN----IGYTEVIDADISKYFDTIPHTNLMAVV 184 Query: 179 RRRISDARFMTLLWKTIKAGHIDVGL---------FRAASEGVPQGGVISPLLSNIMLNE 229 RI D + L+ +K+ ++VG + G PQGGVISPLL+N+ L+ Sbjct: 185 AERICDGAILHLIQMWLKSSVMEVGKDGKKRNVGGGKGNRRGTPQGGVISPLLANLYLHI 244 Query: 230 FDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTK 289 D+ R N Q + RYADD VL+ + K Sbjct: 245 LDRIWE---------------------------RRNLQQRLNARIVRYADDTVLLCRRNK 277 Query: 290 AQVEAIREECRGVLEGSLKLRLNMDKTKIPHVND-GFIFLGHRLIRKRSRYGEMRVVSTI 348 + R +LE L L LN KTK+ + GF FLG + +SR Sbjct: 278 SD--EAMAVLRQILE-RLGLTLNEAKTKVVNGYKGGFDFLGFSIRMGKSRRTGNYYPHVQ 334 Query: 349 PQEK----ARNFAASLTALLWKVR 368 P +K ++ LT + VR Sbjct: 335 PSKKSLQVIKDRVTKLTNRVRTVR 358 >UniRef50_C0YLQ1 RNA-directed DNA polymerase n=40 Tax=Bacteria RepID=C0YLQ1_9FLAO Length = 624 Score = 297 bits (761), Expect = 4e-79, Method: Composition-based stats. Identities = 116/378 (30%), Positives = 187/378 (49%), Gaps = 46/378 (12%) Query: 12 DPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDEL 71 + + +RL +++ E A + S G T GVDG ++ L L Sbjct: 37 NTDYKFERLYKVLFNEEMYFIAYQKIYSKVGNMTAGVDGKTI---DGMSISRIERLIASL 93 Query: 72 LSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRP 131 + YQP P++R YIPK NGK RPLGIP+ D++VQ + M +E I+E F S+GFRP Sbjct: 94 RNETYQPNPSKRTYIPKKNGKKRPLGIPSFDDKLVQEVIRMILEAIYEGSFEHTSHGFRP 153 Query: 132 ERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLL 191 RS H A+ +V+ T RW IEGD+ +FD ++H +L+ ++ RI+D RF+ L+ Sbjct: 154 NRSCHTALLSVQQSFTAV-----RWFIEGDIKGFFDNINHEILIGILKERIADDRFIRLI 208 Query: 192 WKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQY------------------ 233 K + AG+I+ ++ G PQGG++SP+L+NI L++ D+Y Sbjct: 209 RKFLNAGYIEDWVYHKTYSGTPQGGIVSPILANIYLDKLDKYVKDYIKDFDKGKRTTATR 268 Query: 234 ---LHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWK-------------PAVAYCRY 277 LHE+ A+K + + +++ ++E Q + + Y RY Sbjct: 269 QYRLHEQRRYRLAKKLKCETDETVREQMIKDIKELRQERNKYPAYDKMDGSFRKLKYVRY 328 Query: 278 ADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRS 337 ADDF++ V G+K + I+E+ + L+ LKL L+ +KT + + FLG + + S Sbjct: 329 ADDFLIGVIGSKEDCKKIKEDIKVYLDEKLKLELSDEKTLVTNAKKPAKFLGFDVSVRNS 388 Query: 338 ----RYGEMRVVSTIPQE 351 R R V + Sbjct: 389 DESKRDKHGRTVRCFGDK 406 >UniRef50_C3FAV3 RNA-directed DNA polymerase n=11 Tax=Bacteria RepID=C3FAV3_BACTU Length = 652 Score = 297 bits (759), Expect = 6e-79, Method: Composition-based stats. Identities = 117/413 (28%), Positives = 193/413 (46%), Gaps = 61/413 (14%) Query: 7 TWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQI 66 + + + +RL R + PE+ + + ++ G T G + Sbjct: 33 SKNTIKENYKFKRLYRNLYNPEFYYKGYQEIYANPGNMTRGTINNTVDGFSKN---RVSK 89 Query: 67 LRDELLSGHYQPLPARRVYIPKSN-GKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTL 125 + + + +G+Y+P P +RVYI K K RPLG+P D++VQ + +E I+E +F Sbjct: 90 IINNIKNGNYKPTPVKRVYIDKKGSKKKRPLGVPTFDDKLVQLVIKYILEAIYEPNFSEN 149 Query: 126 SYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDA 185 S+GFR R H A++ +K +W IEGD+ +FD + H +L+ +R+RI+D Sbjct: 150 SHGFRKNRGCHTALKQIKK-----SGNGTKWFIEGDIQGFFDNIDHHILINLLRKRINDE 204 Query: 186 RFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYL---HERYLSGK 242 + L+WK ++AG+++ F G PQGG++SPLL+NI LNE D Y+ +++ G+ Sbjct: 205 TLIGLIWKFLRAGYMEDWQFHKTFSGTPQGGILSPLLANIYLNELDIYMGKYAKKFGKGQ 264 Query: 243 AR-----KDRWYWNNSIQRGRSTAVRENWQWK---------------------------- 269 + K Y + I+RGR A Q K Sbjct: 265 PKDREVDKRYQYLHLKIKRGRKKADLLREQGKHNEAQELIEQVNEWVKERGQRPYYNPMS 324 Query: 270 ---PAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFI 326 ++ Y RYADDF++++ G+K +AI+ + L LKL L+ +KT I H + Sbjct: 325 DKFKSLKYVRYADDFIVMIIGSKDDAKAIKSDIAQFLNEELKLTLSEEKTLITHSSKKAT 384 Query: 327 FLGHRLIRKRS-------------RYGEMRVVSTIPQEKARNFAASLTALLWK 366 FLG+ + R+ R+ ++V IP E RN +L L K Sbjct: 385 FLGYNVNITRNELFTKYSVKGVKRRHHNLKVRLEIPHEAWRNKLLALNVLEMK 437 >UniRef50_C8R0Q7 RNA-directed DNA polymerase (Reverse transcriptase) n=6 Tax=Bacteria RepID=C8R0Q7_9DELT Length = 508 Score = 295 bits (754), Expect = 2e-78, Method: Composition-based stats. Identities = 119/352 (33%), Positives = 174/352 (49%), Gaps = 44/352 (12%) Query: 19 RLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQP 78 LL I + +A ++KGA PG+D + A + +R LL+G YQP Sbjct: 89 NLLERILSRANMLKAWERVKANKGA--PGMDNMPIADFMAFAREHWEEIRASLLAGTYQP 146 Query: 79 LPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHA 138 LP +RV IPK G RPLGIP + DR++Q+AM + PI++ DF SYGFRP RS H A Sbjct: 147 LPVKRVEIPKPTGGTRPLGIPTVLDRLIQQAMAQVLLPIFDPDFSEASYGFRPGRSAHDA 206 Query: 139 IRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAG 198 I V+ + R ++ DLS +FDTV H LLM V R++ D R + L+ K ++AG Sbjct: 207 IHRVRDYI----RQGYRVAVDADLSKFFDTVDHDLLMNRVGRKVRDQRVLRLVGKYLRAG 262 Query: 199 HIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGR 258 + G R +GVPQGG +SPLLSNI+L++ D+ L R Sbjct: 263 VMIDGRRRETRKGVPQGGPLSPLLSNILLDDLDKELERR--------------------- 301 Query: 259 STAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKI 318 + RYADDF+++VK +A E + LE LKL +N +K+K+ Sbjct: 302 ------------GHRFARYADDFIILVKSRRAG-ERVMTGITRFLESKLKLVVNQEKSKV 348 Query: 319 PHVNDGFIFLGHRLIRKRSRYGEM---RVVSTIPQEKARNFAASLTALLWKV 367 N+ FLG + R+ + + + R++ S+ L K+ Sbjct: 349 APTNES-GFLGFIFKGAKIRWSDKAFAEFKRRVKKLTGRSWGVSMAFRLAKL 399 >UniRef50_B0TA92 Reverse transcriptase (RNA-dependent DNA polymerase) n=44 Tax=Bacteria RepID=B0TA92_HELMI Length = 475 Score = 294 bits (753), Expect = 3e-78, Method: Composition-based stats. Identities = 115/366 (31%), Positives = 182/366 (49%), Gaps = 42/366 (11%) Query: 5 LATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVEL 64 L A P L+ + + + EA + +++KGA G+DG+ L+ L E Sbjct: 42 LPAQEAKQPREETYDLMEKVVERGNMTEAYKRVMANKGA--AGIDGMGLESLRPYLKEEW 99 Query: 65 QILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHT 124 ++ ELL G Y+P P RRV IPK G R LGIP + DR++Q+A+ + PI++ DF T Sbjct: 100 SRIKQELLEGTYRPQPVRRVEIPKPQGGTRKLGIPTVVDRLIQQALNQILMPIFDPDFST 159 Query: 125 LSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISD 184 SYGFRP +S H A++ K + D RWV++ DL+ +FD V+H +LM V R++ D Sbjct: 160 NSYGFRPGKSAHQAVKKAKEYIAD----GYRWVVDMDLAQFFDRVNHDILMARVARKVKD 215 Query: 185 ARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKAR 244 R + L+ + +KAG + G+ + EG PQGG +SPLL+NI+L++ D+ L R Sbjct: 216 KRILKLIREYLKAGVMLNGIRVKSEEGTPQGGPLSPLLANIILDDLDKALESR------- 268 Query: 245 KDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLE 304 +CRYADD + V+ +A + + E LE Sbjct: 269 --------------------------GHRFCRYADDCNVYVRSRRAG-QRVMEGMAKFLE 301 Query: 305 GSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKR-SRYGEMRVVSTIPQEKARNFAASLTAL 363 G LKL++N +K+ + + FLG + ++ +EK R F ++ Sbjct: 302 GRLKLQVNWEKSAVDRPWNR-KFLGFSFTWHKAAKIRLAPQTVKRVKEKIRQFTGRNRSI 360 Query: 364 LWKVRI 369 + R+ Sbjct: 361 AMEDRL 366 >UniRef50_A5CZB8 Retron-type reverse transcriptase n=20 Tax=Bacteria RepID=A5CZB8_PELTS Length = 423 Score = 294 bits (752), Expect = 4e-78, Method: Composition-based stats. Identities = 116/365 (31%), Positives = 181/365 (49%), Gaps = 45/365 (12%) Query: 15 LRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSG 74 + L+ + + + L +A + +++GA PG+DG L L L EL +G Sbjct: 2 KKWYSLIDKVYRLDNLEKAYQAVRANRGA--PGIDGETVEAFGQNLGQRLIQLHHELKTG 59 Query: 75 HYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERS 134 Y+P P +RV IPK +G RPLGIP +RDR+VQ+A+L ++PI+E FH SYG+RP RS Sbjct: 60 TYEPQPVKRVEIPKPDGSTRPLGIPTVRDRVVQQALLNILQPIFEPGFHPSSYGYRPGRS 119 Query: 135 VHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKT 194 H A+ + + G +V++ DLS FD + H L+++ V R+ISD + L+ K Sbjct: 120 CHQAVAKAERFMNKYG---LEYVVDMDLSKCFDRLDHELILEEVNRKISDGSVLKLIKKF 176 Query: 195 IKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSI 254 + AG + G + G PQGGVISPLL+NI L+ FDQ + R Sbjct: 177 LTAGVMKDGQWDEIDTGSPQGGVISPLLANIYLDRFDQAMKSR----------------- 219 Query: 255 QRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMD 314 + RYADD ++ + T+ + R+ +LEG LKL +N + Sbjct: 220 ----------------GIRIVRYADDILVFAR-TRKEAGNYRQVATQILEGELKLEVNKE 262 Query: 315 KTKIPHVNDGFIFLGHRLIRKRSR---YGEMRVVSTIPQEKARNFAASLTALLWKVRISG 371 KT + V++G +LG + +K + I + RN SL ++ ++ Sbjct: 263 KTHLTSVHEGVAYLGFIIHKKHVSIHPKKIKKFKDRIRELTPRNHGMSLKEMIKRL---N 319 Query: 372 EILLG 376 +L G Sbjct: 320 PVLRG 324 >UniRef50_Q3ESS6 Reverse transcriptase / RNA maturase / Endonuclease n=13 Tax=Firmicutes RepID=Q3ESS6_BACTI Length = 607 Score = 293 bits (751), Expect = 6e-78, Method: Composition-based stats. Identities = 121/407 (29%), Positives = 186/407 (45%), Gaps = 58/407 (14%) Query: 7 TWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQI 66 + + +RL R + PE+ A + GA T GVD + + EL Sbjct: 12 QKQSQKEDYKFRRLYRNLYNPEFYFTAYDNLSKNDGALTMGVDKRSIDGFSIEIIEELIE 71 Query: 67 LRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLS 126 L YQP P++RVYIPK NGK RPLGIP+ D++VQ + M +E I+E F S Sbjct: 72 T---LKQRTYQPFPSKRVYIPKKNGKKRPLGIPSFADKLVQEVVRMILEAIYEPTFSISS 128 Query: 127 YGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDAR 186 + ++ +S H A++ ++ T +W IEGD+ +FD ++H L+ +R+RI D Sbjct: 129 HAYQKGKSCHTALQEIQRTF-----TGSKWFIEGDIKGFFDNINHHTLIGILRKRIEDEA 183 Query: 187 FMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHE---RYLSGKA 243 F+ L+WK ++AG+++ F G PQGG+ISPLLSNI LNE D+Y+ + ++ GK Sbjct: 184 FIELIWKFLRAGYMEEWKFHNTFSGAPQGGIISPLLSNIYLNELDKYMMDFIQKFNQGKK 243 Query: 244 RKDRWYWNNSIQRGRSTAVRENWQWK---------------------------------- 269 RK + + R + + Sbjct: 244 RKINPDYERKYTQMRKAIRKYKIALENEQMGEAEQHLEQAKALKKELSSIPYSNPMDSNY 303 Query: 270 PAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVND-GFIFL 328 + Y RYADDF++ V G+K I+E L +L+L L+ +KT I H ++ FL Sbjct: 304 KRLTYVRYADDFLIGVIGSKYDARNIKETLTEYLMETLQLELSQEKTLITHASENHAGFL 363 Query: 329 GHRLIRKRS------------RYGEMRVVSTIPQEKARNFAASLTAL 363 G+ + R RY +V +P E N A+ Sbjct: 364 GYNIRVFRGSEPRKDAIGRVCRYLNGKVQLKMPHEAWVNKLKKYQAI 410 >UniRef50_P0A3U1 DNA endonuclease n=8 Tax=Firmicutes RepID=LTRA_LACLM Length = 599 Score = 293 bits (750), Expect = 8e-78, Method: Composition-based stats. Identities = 112/396 (28%), Positives = 187/396 (47%), Gaps = 41/396 (10%) Query: 4 KLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVE 63 +++ + + RL R + +P+ A + S+KGA T G+ + Sbjct: 10 RISKNSQENIDEVFTRLYRYLLRPDIYYVAYQNLYSNKGASTKGILDDTADGFS---EEK 66 Query: 64 LQILRDELLSGHYQPLPARRVYIPKSNGKL-RPLGIPALRDRIVQRAMLMAMEPIWESDF 122 ++ + L G Y P P RR+YI K N K RPLGIP D+++Q A+ + +E I+E F Sbjct: 67 IKKIIQSLKDGTYYPQPVRRMYIAKKNSKKMRPLGIPTFTDKLIQEAVRIILESIYEPVF 126 Query: 123 HTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRI 182 +S+GFRP+RS H A++T+K + RW +EGD+ FD + H L+ + +I Sbjct: 127 EDVSHGFRPQRSCHTALKTIKREFGGA-----RWFVEGDIKGCFDNIDHVTLIGLINLKI 181 Query: 183 SDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYL-------- 234 D + L++K +KAG+++ + G PQGG++SPLL+NI L+E D+++ Sbjct: 182 KDMKMSQLIYKFLKAGYLENWQYHKTYSGTPQGGILSPLLANIYLHELDKFVLQLKMKFD 241 Query: 235 ----------------------HERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAV 272 H K + +R R + Q + Sbjct: 242 RESPERITPEYRELHNEIKRISHRLKKLEGEEKAKVLLEYQEKRKRLPTLPCTSQTNKVL 301 Query: 273 AYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRL 332 Y RYADDF++ VKG+K + I+E+ + + LK+ L+ +KT I H + FLG+ + Sbjct: 302 KYVRYADDFIISVKGSKEDCQWIKEQLKLFIHNKLKMELSEEKTLITHSSQPARFLGYDI 361 Query: 333 IRKRSRYGEMRVVSTIPQEKARNFAASLTALLWKVR 368 +RS G ++ + + L L K+R Sbjct: 362 RVRRS--GTIKRSGKVKKRTLNGSVELLIPLQDKIR 395 >UniRef50_D2M2V6 RNA-directed DNA polymerase (Reverse transcriptase) n=1 Tax=Bacillus cellulosilyticus DSM 2522 RepID=D2M2V6_BACS4 Length = 441 Score = 292 bits (748), Expect = 1e-77, Method: Composition-based stats. Identities = 110/371 (29%), Positives = 176/371 (47%), Gaps = 49/371 (13%) Query: 2 QRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLA 61 Q+ + + L+ + + EA + ++G+ GVDGV+ + + Sbjct: 7 QKSFRDGVKSTQKRKWYSLMDKVWAMSNMEEAFKEVKRNRGS--AGVDGVSIRTFEHGVE 64 Query: 62 VELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESD 121 +Q+L+ EL Y+P P +RV+IPK++G RPLGIP +RDR+VQ A+ +EPI+E Sbjct: 65 DNVQVLQRELKEKAYRPRPVKRVFIPKTDGTKRPLGIPTVRDRVVQAAVRRIIEPIFEDK 124 Query: 122 FHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRR 181 F S+GFRP +S H A+ ++ L D +VI+ DL +YFDT+ L++AVR Sbjct: 125 FLDCSFGFRPNKSAHMALEKIRKDLMD----GYVYVIDADLKAYFDTIPQDKLIQAVREE 180 Query: 182 ISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSG 241 + D + L+ ++AG +D G F +G PQGGVISPLL+NI L+ D+ + +R Sbjct: 181 VVDGSVIRLIQSFLQAGVMDGGSFHLTEKGTPQGGVISPLLANIYLHPLDELMTKR---- 236 Query: 242 KARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRG 301 RYADDFV+ K K E + + Sbjct: 237 -----------------------------GHRITRYADDFVICCKSQK-GAERVLKSVTR 266 Query: 302 VLEGSLKLRLNMDKTKIP-HVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAASL 360 L L L ++ +KTK+ ++ + F+FLGH VS P+ + F + Sbjct: 267 FLNEELGLTVHPEKTKVVNNLEEPFLFLGHEF-------KGGYYVSASPKA-LKKFKEKV 318 Query: 361 TALLWKVRISG 371 + + + Sbjct: 319 KEITRRNQTVN 329 >UniRef50_B8I7I5 RNA-directed DNA polymerase (Reverse transcriptase) n=29 Tax=Bacteria RepID=B8I7I5_CLOCE Length = 618 Score = 290 bits (743), Expect = 5e-77, Method: Composition-based stats. Identities = 109/368 (29%), Positives = 186/368 (50%), Gaps = 13/368 (3%) Query: 8 WAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQIL 67 +A + L+++I+ E + A R S+ G+HT G D +N ++ +L + Sbjct: 23 YAQSKQGKSFTNLMKVISSEENIRLAYRNIKSNSGSHTSGTDTLNIKDIEKLSVEKLVEM 82 Query: 68 RDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSY 127 L+ YQP P +RV IPK NGK RPLGIP + DR+VQ+ +L +EPI E+ F+ S Sbjct: 83 MQRKLA-WYQPKPVKRVEIPKPNGKTRPLGIPTIVDRLVQQCILQVLEPICEAKFYERSN 141 Query: 128 GFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLM-KAVRRRISDAR 186 GFRP RS HA+ + + +V++ D+ +FD V+H L+ + I D + Sbjct: 142 GFRPNRSAEHAMAQCYRMVQ---KQNLYFVVDVDIKGFFDNVNHSKLIRQMWAMGIRDKQ 198 Query: 187 FMTLLWKTIKAG-HIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARK 245 + ++ + +KA + G ++G PQGG++SPLL+NI+LNE D ++ ++ + Sbjct: 199 LICIIKQMLKAPVVMPDGETLYPTKGTPQGGILSPLLANIVLNELDWWISSQWEDMLTHR 258 Query: 246 DRWY-WNNSIQRGRSTAVRENWQWKPAVAY-CRYADDFVLIVKGTKAQVEAIREECRGVL 303 + + NN+ +S R + Y RYADDF + + ++ I + L Sbjct: 259 EYYVSVNNNGSLNKSGVFRTLRRSALKEMYIVRYADDFKIFCR-KRSDANKIFVAVKKWL 317 Query: 304 EGSLKLRLNMDKTKIPHVNDGF-IFLGHRLIRKRSRYGEMRVVSTIPQEKA-RNFAASLT 361 + LKL ++ +K+K+ ++ + FLG + K R G VV + EKA + L Sbjct: 318 KDRLKLEISEEKSKVVNLKKHYSEFLGFQF--KTVRKGRKFVVRSHMSEKAIKRETEKLK 375 Query: 362 ALLWKVRI 369 + ++ Sbjct: 376 EQIKEIAH 383 >UniRef50_A5CZJ0 Retron-type reverse transcriptase n=1 Tax=Pelotomaculum thermopropionicum SI RepID=A5CZJ0_PELTS Length = 428 Score = 290 bits (742), Expect = 5e-77, Method: Composition-based stats. Identities = 120/376 (31%), Positives = 191/376 (50%), Gaps = 45/376 (11%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARL 60 QRKL A + + R L + + + L A + ++KGA PG DG + ++ ++ Sbjct: 15 FQRKLYVKAKQEKTFRFYSLYDKLYREDVLQYAWQQCRANKGA--PGADGQSFKDIEEKV 72 Query: 61 AVE--LQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIW 118 VE L+ + +EL +G Y+P+P RRVYI K +G RPLGIP ++DRI Q A L ++PI+ Sbjct: 73 GVERFLKEIAEELRNGTYRPMPVRRVYILKPDGSQRPLGIPTIKDRIAQMACLTVIQPIF 132 Query: 119 ESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAV 178 E+DF SYGFRP+R+ H AI + + + V + DL+ FD++ HRL+M ++ Sbjct: 133 EADFLDCSYGFRPKRNAHQAIGAITENI----KQGFTAVYDADLTKCFDSIQHRLIMDSL 188 Query: 179 RRRISDARFMTLLWKTIKAGHIDVG---LFRAASEGVPQGGVISPLLSNIMLNEFDQYLH 235 RI+D + + L+ ++A ++ G R +G PQGGVISPLL+NI+LN D+ H Sbjct: 189 AERITDGKVLRLIKGWLEAPIVEPGGPKQGRKNYQGTPQGGVISPLLANIVLNRLDRLWH 248 Query: 236 ERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAI 295 R RE + + RYADDFV++ + E I Sbjct: 249 ----------------------RPGGPRERYNAR----LVRYADDFVVLA---RFIGEPI 279 Query: 296 REECRGVLEGSLKLRLNMDKTKIPHVNDG--FIFLGHRLIRKRSRYGEMRVVSTIPQEKA 353 + E ++ S+ L LN KT+I +N G FLG+ + R + R+ + Sbjct: 280 KNELESIIT-SMGLNLNEKKTRILDLNKGDILNFLGYSIRISRDK--NRRITIKPSDKAI 336 Query: 354 RNFAASLTALLWKVRI 369 + ++ + R+ Sbjct: 337 ARLRDKIREIISRERL 352 >UniRef50_C6CA58 RNA-directed DNA polymerase n=3 Tax=Enterobacteriaceae RepID=C6CA58_DICDC Length = 626 Score = 289 bits (740), Expect = 1e-76, Method: Composition-based stats. Identities = 116/410 (28%), Positives = 198/410 (48%), Gaps = 59/410 (14%) Query: 2 QRKLATWAATDP----SLRIQRLLRLIT-QPEWLAEAARITLSSKGAHTPGVDGVNKTML 56 QR L + + +I++L +++ + A+A S++GA T G++ + Sbjct: 4 QRTLNALSGINKASTQGYKIKKLHKIMCSNKDLWAQAYANIYSNQGAMTRGINNNTMDEM 63 Query: 57 QARLAVELQILRDELLSGHYQPLPARRVYIPK----SNGKLRPLGIPALRDRIVQRAMLM 112 + L L + S Y+P P RR +IPK NGK RPLGIP D+++Q M M Sbjct: 64 SVDRIINLIQLIN---SDSYKPKPCRRTHIPKDARKPNGKKRPLGIPTGDDKLIQEVMRM 120 Query: 113 AMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHR 172 +E I+E F SYGFRP+RS H A++ ++ G +WV + D+ YFD + H Sbjct: 121 LLEEIYEPVFSDWSYGFRPKRSCHSALKEIRN-----GWKGTKWVCDVDIKGYFDNIDHD 175 Query: 173 LLMKAVRRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQ 232 LL+K + +RI+D +F+ LL K +K G++D + G PQGG+ISP+L+N+ L+E D+ Sbjct: 176 LLLKFLSKRIADNKFLALLKKFLKTGYLDNWRYFGTHSGTPQGGIISPILANVFLHELDE 235 Query: 233 YLHER---YLSGKARKDRWYWNNSIQRGRS------------------------------ 259 ++ R + +G RK + ++Q + Sbjct: 236 FMKNRISEFGTGGRRKPNPIYKRALQNRANRIKWIRQGFGASGMPADEQKIQKWKYEADE 295 Query: 260 --------TAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRL 311 ++V + + Y RYADDF++ V G+K++ + I +E +E L L + Sbjct: 296 LEKQLRTLSSVIMDDTEFKRMRYVRYADDFLIGVIGSKSEAKKIMKEVVDFVENELHLEI 355 Query: 312 NMDKTKIPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAASLT 361 + +K+ I GF FLG+ + + R ++ V + + ++ A T Sbjct: 356 SKEKSGIIDPKKGFTFLGYEI-KTRRESKRVKCVVGLNTDGSKTHAVKRT 404 >UniRef50_C5RJ16 RNA-directed DNA polymerase (Reverse transcriptase) n=2 Tax=Clostridiales RepID=C5RJ16_CLOCL Length = 626 Score = 289 bits (739), Expect = 1e-76, Method: Composition-based stats. Identities = 115/377 (30%), Positives = 188/377 (49%), Gaps = 26/377 (6%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARL 60 MQ KL + L+ + + A S+ G+ TPG D Sbjct: 22 MQSKL--------NKTFTGLMEVAFNEVTIITAVHNIKSNSGSKTPGTDRNTIDKYLQMS 73 Query: 61 AVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWES 120 E+ L + + +Y+P PARR YIPKSNGK RPLGIP + DRI+ + + +EPI E+ Sbjct: 74 KEEVISLVKK-SASNYKPKPARREYIPKSNGKKRPLGIPTVIDRIILECIRIVIEPICEA 132 Query: 121 DFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAV-R 179 F+ SYGFRP R+ HAI ++ ++ + +VIEGD+ SYFD ++H++L+ + + Sbjct: 133 KFYPHSYGFRPYRACSHAIASIVHVISSTSKDIPHYVIEGDIKSYFDNINHKVLINKLWK 192 Query: 180 RRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYL 239 + D R + L+ +KAG+I+ LF G PQGG+ISPLL+N+ LN FD + Y Sbjct: 193 MGVHDKRMLCLIKLMLKAGYIERDLFYLTEAGTPQGGIISPLLANVYLNSFDWMIGRMYQ 252 Query: 240 SGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREEC 299 K + + ++ ++ R T ++ + RYADD+V++ ++ + E + Sbjct: 253 EPKGIETKNDRSHCREKLRRTGIKPKY-------LVRYADDWVILTT-SRQEAERLLHYI 304 Query: 300 RGVLEGSLKLRLNMDKTKIPHVN-DGFIFLGHRL-----IRKRSRYGEMRVVST--IPQE 351 R + LKL L+ +KT I + + FLG + + S+ +VV E Sbjct: 305 RRYFKHKLKLELSEEKTVITDIKCEKVKFLGFDVLAELPRKTPSKPNPKKVVGKAIPNTE 364 Query: 352 KARNFAASLTALLWKVR 368 K + + + K++ Sbjct: 365 KVKQQVKDICREIKKLK 381 >UniRef50_B0I1N9 Reverse transcriptase homolog n=7 Tax=cellular organisms RepID=B0I1N9_PYLLI Length = 749 Score = 288 bits (737), Expect = 2e-76, Method: Composition-based stats. Identities = 123/360 (34%), Positives = 187/360 (51%), Gaps = 29/360 (8%) Query: 12 DPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDEL 71 +P+ +R+L L++ + L A S G T GVDG + L+ L ++ Sbjct: 182 NPNHLNERILSLVSSYDMLEAAYIKIKSKPGNMTKGVDGKTLDGVNVGW---LKSLSRDV 238 Query: 72 LSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRP 131 SG Y+PLP+RRV IPK G RPLGIP+ RD+IVQ ++ ++ I+E F S+GFRP Sbjct: 239 GSGSYKPLPSRRVMIPKPQGGERPLGIPSPRDKIVQESIRTVLQAIYEPSFIACSHGFRP 298 Query: 132 ERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLL 191 RS H A++ KL + W IEGD+ FD++ HR+L + RRI D FM L Sbjct: 299 GRSCHTALKEAKLTFAN-----TTWFIEGDIEKCFDSIDHRVLSTLLERRIKDKGFMDLY 353 Query: 192 WKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYL---HERYLSGKARKDRW 248 WK +K G++ +G + +G PQG V+SPLLSNI L+E D+++ E + G RK Sbjct: 354 WKMVKVGYMSLGKINQSDKGTPQGSVVSPLLSNIYLHELDKWMTRKKESFDKGTRRKANP 413 Query: 249 YWNNSI---------QRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREEC 299 + + +R + + Y RYA DF++ + G+K + +E Sbjct: 414 VYTKYVRVAGGASAARRLNIPSADPLDPNFKRLRYVRYAGDFLIGIIGSKTDGICLIKEL 473 Query: 300 RGVLEGSLKLRLNMDKTKIPH-VNDGFIFLGH--------RLIRKRSRYGEMRVVSTIPQ 350 + L LKL LN+ KTK+ H +++ FLG K+++ G++ VS+ PQ Sbjct: 474 KEFLHDILKLDLNLTKTKLTHTMSEKAYFLGTWVKIIPVSGFQIKKNKSGKITRVSSRPQ 533 >UniRef50_Q024N3 RNA-directed DNA polymerase n=6 Tax=Bacteria RepID=Q024N3_SOLUE Length = 435 Score = 288 bits (736), Expect = 3e-76, Method: Composition-based stats. Identities = 119/387 (30%), Positives = 176/387 (45%), Gaps = 49/387 (12%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTML---Q 57 +++KL A +P R L I + + L A + GA PGVDGV + Sbjct: 8 LRQKLGQKAKQEPKFRFYALYDRIWRKDVLETAWERVRQNDGA--PGVDGVTIEEIMKTD 65 Query: 58 ARLAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPI 117 +A L+ + + L Y+P +RVYI K NGKLRPLGIP +RDR+VQ A L+ +EPI Sbjct: 66 QGVAGFLEGIENSLRRKTYRPEAVQRVYIEKENGKLRPLGIPTVRDRVVQMATLLILEPI 125 Query: 118 WESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKA 177 +E+DF SYGFRP RS H A+ ++ + E + V + DL YFD++ H L+ Sbjct: 126 FEADFLDCSYGFRPGRSAHQALEEIRGHV----EAGYQAVYDADLKGYFDSIPHTQLLAC 181 Query: 178 VRRRISDARFMTLLWKTIKAGHID------VGLFRAASEGVPQGGVISPLLSNIMLNEFD 231 VR R+ D + L+ ++A ++ + +G PQGGV SPLL+N+ L+ FD Sbjct: 182 VRMRVVDRSVLKLIRMWLEAPVVEREEGGGGSKWSRPEKGTPQGGVASPLLANLYLHWFD 241 Query: 232 QYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQ 291 + G K RYADDFV++ K Sbjct: 242 ALFYGPEGPG--------------------------GKADAKLVRYADDFVVMA---KQM 272 Query: 292 VEAIREECRGVLEGSLKLRLNMDKTKIPHVND---GFIFLGHRLIRKRSRYG-EMRVVST 347 E LEG +L +N +KT++ + + FL H R R G + + ++ Sbjct: 273 GTETIEFIESRLEGKFQLEINREKTRVVDLREEGASLDFLSHTFRRDRDLKGRDRKYLNV 332 Query: 348 IPQEKA-RNFAASLTALLWKVRISGEI 373 P KA R L + + I Sbjct: 333 FPSAKAVRKERRKLHEMTDSHQCFKPI 359 >UniRef50_A3WXE2 Putative reverse transcriptase n=1 Tax=Nitrobacter sp. Nb-311A RepID=A3WXE2_9BRAD Length = 486 Score = 287 bits (734), Expect = 5e-76, Method: Composition-based stats. Identities = 118/333 (35%), Positives = 176/333 (52%), Gaps = 15/333 (4%) Query: 49 DGVNKTMLQARLAVE--LQILRDELLSGHYQPLPARRVYIPKSN--GKLRPLGIPALRDR 104 D V ++ R+ VE L R LL G Y+P RRV IPK G RPLG+P ++DR Sbjct: 2 DKVTVKHIETRIGVERFLTTTRTMLLDGSYRPQAVRRVMIPKRGRPGLFRPLGVPTVQDR 61 Query: 105 IVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLT--------DCGETRGRW 156 +VQ A+L +EPI+E+ F T+SYGFRP+R+ A+ ++ + D +W Sbjct: 62 VVQAALLQLLEPIFEAVFLTVSYGFRPKRACRDALEHIRNAIRPTGQKTETDWPRPPYQW 121 Query: 157 VIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGG 216 VIEGD+ FD + H +M +RRR+SD R L+ +KAG + G G PQGG Sbjct: 122 VIEGDIKRCFDNIDHHHVMTCLRRRVSDRRVTRLVRAFLKAGVLSEGSLVRTKAGTPQGG 181 Query: 217 VISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCR 276 V+SPLL+N+ML+ ++ + +Y+ + +D + R E + R Sbjct: 182 VLSPLLANVMLDGIERR-YAKYVVPRLTRDGKPYARPGNELRKFRHYERKAGRVVFLPIR 240 Query: 277 YADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLI-RK 335 YADDFV++V GT+ Q A +E L +KL L +KT + + +GF FLGHR+ R Sbjct: 241 YADDFVVLVNGTEEQARAEKEALAVFLREEMKLTLAPEKTHVTSLTEGFEFLGHRVRLRW 300 Query: 336 RSRYGEMRVVSTIPQEKARNFAASLTALLWKVR 368 R+G V IP+ + ++F + L + R Sbjct: 301 DDRWGYWPRVE-IPKARVKDFHHRIKQLTTRGR 332 >UniRef50_A9AUN7 RNA-directed DNA polymerase n=4 Tax=Bacteria RepID=A9AUN7_HERA2 Length = 595 Score = 286 bits (733), Expect = 7e-76, Method: Composition-based stats. Identities = 108/354 (30%), Positives = 173/354 (48%), Gaps = 35/354 (9%) Query: 19 RLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQP 78 R+ R++ + A + +GA T G + ++ + +L Y+ Sbjct: 23 RVYRILFNEDLYLRAYGRLATKQGALTKG---STDETIDGMSMAKIHRIIADLRRETYRW 79 Query: 79 LPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHA 138 P RRVYIPK+ GK RPLG+P D++VQ + ++ ++ S+GFRP R H A Sbjct: 80 TPVRRVYIPKATGKTRPLGVPTWSDKLVQEVLRSILDAYYDPQMSDHSHGFRPNRGCHTA 139 Query: 139 IRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAG 198 ++ ++ T RW IEGD++ YFDT++H L+ + +RI D RF+ L+ ++AG Sbjct: 140 LKAIQR-----CWTGTRWFIEGDIAQYFDTINHTTLLTILAKRIHDGRFLRLIQTLLQAG 194 Query: 199 HIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHE----RYLSGKARKDRWYWNNSI 254 ++ ++ G PQGGVISPLL+NI L+EFDQ++ Y G+ RK + Sbjct: 195 YLHDWVYHPTLSGTPQGGVISPLLANIYLHEFDQFVEHTLIPAYTKGQRRKVNPAYAQME 254 Query: 255 QRGRSTAVRENWQW--------------------KPAVAYCRYADDFVLIVKGTKAQVEA 294 QR + + + Y RYADDF+L GTK + EA Sbjct: 255 QRISKLRRQREYASVTPLLKELRTLPSRDVHDPDYRRLRYVRYADDFLLGFAGTKVEAEA 314 Query: 295 IREECRGVLEGSLKLRLNMDKTKIPHV-NDGFIFLGHRLI--RKRSRYGEMRVV 345 I+++ L L+L+L+ KT I H +D FLG+ ++ + S+ R + Sbjct: 315 IKQQINVWLYDHLQLKLSTQKTLITHASSDPAHFLGYDIVTQQANSKQTGNRRI 368 >UniRef50_Q0S063 RNA-directed DNA polymerase (Reverse transcriptase) n=3 Tax=Actinomycetales RepID=Q0S063_RHOSR Length = 459 Score = 286 bits (732), Expect = 9e-76, Method: Composition-based stats. Identities = 113/379 (29%), Positives = 170/379 (44%), Gaps = 50/379 (13%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQAR- 59 +Q L A DP R L+ +++ + L + GA PG+D + ++ Sbjct: 22 LQHALYRAAKVDPGRRFHALMDKVSRRDVLWRGWVAVRRNNGA--PGIDRITLEEVEEYG 79 Query: 60 LAVELQILRDELLSGHYQPLPARRVYIPKSNG-KLRPLGIPALRDRIVQRAMLMAMEPIW 118 +A L L EL G Y+PLPARRV+IPK + RPL IP++RDRIVQ A + EP++ Sbjct: 80 VARLLDELAVELKEGSYRPLPARRVFIPKPGTVEQRPLSIPSVRDRIVQAAWKLVAEPVF 139 Query: 119 ESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAV 178 E+DF S+GFRP R H A++ L D RWV+E D+++ F+ + LM+AV Sbjct: 140 EADFLPCSFGFRPRRGAHDALQV----LIDESWRGCRWVVETDIANCFEAIPIEKLMQAV 195 Query: 179 RRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERY 238 R+ D F+ LL ++AG ++ G R G PQGGV S LL N+ L+ D+ Sbjct: 196 EERVCDQPFLKLLRVMLRAGVMEEGQVRRPVTGTPQGGVASALLCNVYLHRLDR------ 249 Query: 239 LSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREE 298 W RYADD +++ + ++ Q EA Sbjct: 250 --------------------------AWDVDEHGVLVRYADDALVMCR-SRRQAEAALTR 282 Query: 299 CRGVLEGSLKLRLNMDKTKIPHVN---DGFIFLGHRLI-----RKRSRYGEMRVVSTIPQ 350 R +L L L KT+I H+ +G FLG + R + Sbjct: 283 LRELLAD-LGLEPKEAKTRIVHLRVGGEGVDFLGFHHRLVNAPARPGRRPFPFLARWPAD 341 Query: 351 EKARNFAASLTALLWKVRI 369 + R+ + L + R+ Sbjct: 342 KAVRHARERIRELTDRSRL 360 >UniRef50_C6J6N9 RNA-directed DNA polymerase n=1 Tax=Paenibacillus sp. oral taxon 786 str. D14 RepID=C6J6N9_9BACL Length = 430 Score = 286 bits (731), Expect = 1e-75, Method: Composition-based stats. Identities = 121/366 (33%), Positives = 177/366 (48%), Gaps = 42/366 (11%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARL 60 +Q KL A + R L + + + L EA R +++G+ GVDG ++ + Sbjct: 16 LQGKLGHAAKENKKRRFHALYDKVYRVDILWEAWRRVRANEGS--AGVDGETLADIEKQG 73 Query: 61 AVE-LQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWE 119 + + + L G Y P P RR YIPK +GKLRPLGIP +RDR++Q A + MEPI+E Sbjct: 74 EMRFVLECQRLLKEGKYHPQPVRRHYIPKKDGKLRPLGIPTVRDRVIQMATKLVMEPIFE 133 Query: 120 SDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVR 179 +DF S+GFRP+RS A+ ++ +G WV++ D+ YFD ++ LMK + Sbjct: 134 ADFQDTSFGFRPKRSAKQALERIRKACNR----KGNWVVDVDIQGYFDNINQEKLMKLIE 189 Query: 180 RRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYL 239 RISD R + L+ K + AG ++ G R + G PQGGVISPLL+NI LN FD Sbjct: 190 MRISDRRILKLVRKWLGAGVMEEGNIRRSDLGTPQGGVISPLLANIYLNYFDLLWERH-- 247 Query: 240 SGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREEC 299 RYADD V+I K TK + E Sbjct: 248 ----------------------------GGKLGELTRYADDLVIICK-TKKDAQRAYELI 278 Query: 300 RGVLEGSLKLRLNMDKTKIPHV---NDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNF 356 R ++E L+L L+ KT+I + +GF FLG + ++ + +V T Q R Sbjct: 279 RAIME-RLELTLHPTKTRIVGLWTGEEGFDFLGMHHRKTKAETSKGQVYYTTQQWLCRKA 337 Query: 357 AASLTA 362 + Sbjct: 338 EERIRE 343 >UniRef50_A9IAY4 Reverse transcriptase n=38 Tax=Bacteria RepID=A9IAY4_BORPD Length = 572 Score = 285 bits (730), Expect = 1e-75, Method: Composition-based stats. Identities = 105/373 (28%), Positives = 177/373 (47%), Gaps = 39/373 (10%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEW-LAEAARITLSSKGAHTPGVDGVNKTMLQAR 59 +Q ++ +++ L ++T A A R ++G TPGVDG+ + +A+ Sbjct: 34 LQARIVKATREGKHGKVKALQWILTHSFSGKALAVRRVTENQGKKTPGVDGITWSTPEAK 93 Query: 60 LAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWE 119 L + Y+P P +RVYIPK+NGK+RPLGIP ++DR +Q L+A+EP+ E Sbjct: 94 SQAML-----SIKRRGYRPQPLKRVYIPKANGKMRPLGIPTMKDRAMQALYLLALEPVAE 148 Query: 120 SDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVR 179 + S+GFRPERS AI QL + +W++EGD+ FD + H LM + Sbjct: 149 TTADRSSFGFRPERSTADAIGLCFTQL--ALKRSPKWILEGDIKGCFDNISHDWLMGHI- 205 Query: 180 RRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYL 239 +L K +KAG+++ G PQGG+ISP L+N++L+ + L + Sbjct: 206 -----PTDREILSKWLKAGYMEDRQLFPTEAGTPQGGIISPTLANLVLDGLEAKLEAVF- 259 Query: 240 SGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREEC 299 GR+ + Q + AV Y RYADDF++ + + + + Sbjct: 260 -----------------GRARYINGK-QTRLAVNYVRYADDFIVTARSKELLEQEVMPLV 301 Query: 300 RGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAAS 359 + L L+ +KTKI H+++GF FLG + + + +++ + F Sbjct: 302 EEFMRER-GLTLSPEKTKITHIDEGFDFLGQNIRKY-----DGKLLIKPSKANVATFLGK 355 Query: 360 LTALLWKVRISGE 372 + A + + + Sbjct: 356 VRAAIKGNKAVNQ 368 >UniRef50_C5CID2 RNA-directed DNA polymerase (Reverse transcriptase) n=4 Tax=Kosmotoga olearia TBF 19.5.1 RepID=C5CID2_KOSOT Length = 423 Score = 285 bits (728), Expect = 3e-75, Method: Composition-based stats. Identities = 118/362 (32%), Positives = 183/362 (50%), Gaps = 51/362 (14%) Query: 15 LRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSG 74 + L+ + LA+A + GA PG+DGV L ++ L ++L G Sbjct: 2 RKYYSLIDKVYLESNLAKAYHKVRRNNGA--PGIDGVTVQEYGENLLERIKKLSEKLRKG 59 Query: 75 HYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERS 134 Y+P P +RV IPK NGK R LGIP + DRIVQ+++ MEPI+E FH SYG+R R+ Sbjct: 60 EYRPSPVKRVEIPKGNGKTRMLGIPTVEDRIVQQSLKEIMEPIFEEGFHPSSYGYRKGRN 119 Query: 135 VHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKT 194 H A+ + + ++V++ DLS FDT+ H ++ AV RISD + + L+ Sbjct: 120 PHQAVEKAYAF---ACKYKMKYVVQLDLSQCFDTLDHEKMIDAVAERISDGKILRLIRSF 176 Query: 195 IKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSI 254 +K+G I ++ + G PQGGVISPLL+NI LN+FDQ + R Sbjct: 177 LKSGVITD-QYQPSEMGSPQGGVISPLLANIYLNKFDQKMMAR----------------- 218 Query: 255 QRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMD 314 + RYADD ++ K K+ + ++ R +LE LKL++N + Sbjct: 219 ----------------GIRIVRYADDILIFAKSYKSAEKYLKIAIR-ILEKELKLKVNKE 261 Query: 315 KTKIPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAASLTALLWKVRIS--GE 372 KT+I ++DG FLG + + + R I ++K + F A + L + + + GE Sbjct: 262 KTRITTIDDGIEFLGFTIQKGKIR---------IQEKKIKRFKAKVKTLTRRNQCTPIGE 312 Query: 373 IL 374 I+ Sbjct: 313 II 314 >UniRef50_O47500 RT-like protein n=1 Tax=Venturia inaequalis RepID=O47500_VENIN Length = 760 Score = 284 bits (726), Expect = 4e-75, Method: Composition-based stats. Identities = 119/396 (30%), Positives = 199/396 (50%), Gaps = 38/396 (9%) Query: 3 RKLATWAATDPSLRIQR-LLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLA 61 KL+ + ++P+ I R L L T + L A S G T GV L Sbjct: 245 NKLSIRSKSNPNSIIDRELYTLATSVDTLIYAYENIKSKPGNMTQGV---LPETLDGISR 301 Query: 62 VELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESD 121 +L L D L S + P+RR+ IPK++G RPL I + D+IVQ AM + +E I++ Sbjct: 302 EKLTKLSDSLRSEKFSFSPSRRIQIPKASGGSRPLSIASPMDKIVQEAMRLVLEAIYDPV 361 Query: 122 FHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRR 181 F S+GFRP +S H A+++V + +WVIEGDL+ +FD++ H LMK V + Sbjct: 362 FLDCSHGFRPNKSCHTALKSVSQEF-----QPVQWVIEGDLAKFFDSISHSKLMKLVESK 416 Query: 182 ISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFD---QYLHERY 238 I+D RF L+WK + AG+ + ++++ G PQG ++SP+L+NI L++ D L + Sbjct: 417 ITDRRFTNLIWKALTAGYFEFKIYKSNIVGTPQGSIVSPILANIFLHQLDLFVNCLKRDF 476 Query: 239 LSG----KARKDRWYWNNSIQ----------------RGRSTAVRENWQWKPAVAYCRYA 278 G +++ R+Y ++++ R ++ ++ + + Y RYA Sbjct: 477 DKGTRAPRSKSSRYYEYHTLKARKAGDTLQLQKLIAERSQNPSIDFGSESFKRLVYVRYA 536 Query: 279 DDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHV-NDGFIFLGHRLIR-KR 336 DD+++ ++GT+ Q + I + R S+ L L+ KTK+ + + +FLG + R Sbjct: 537 DDWIIGIRGTREQAKYILTKVREFCT-SIDLELSEHKTKLTSLHSQPILFLGTSISRSSH 595 Query: 337 SRYGEMRVVSTIPQEKARNFAASLTALLWKVRISGE 372 RY + V I + K L A L +++ E Sbjct: 596 VRYSRIGSVRRIRRNK---LGLRLEAPLDRIKKKLE 628 >UniRef50_D2CJC8 Putative uncharacterized protein orf2 (Fragment) n=1 Tax=Candida sojae RepID=D2CJC8_9ASCO Length = 773 Score = 283 bits (725), Expect = 6e-75, Method: Composition-based stats. Identities = 118/402 (29%), Positives = 187/402 (46%), Gaps = 45/402 (11%) Query: 4 KLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVE 63 KL P + + +I+ PE+L A + S G TPG NK L + Sbjct: 182 KLLKEGLNKPDEVYKNIRPIISDPEFLMYAYSLIKSKPGNMTPGT---NKETLDGITSET 238 Query: 64 LQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFH 123 +++ E+ SG Y+ P RR+ IPK+ G +RPL I + RD+IVQ AM + +E I+E Sbjct: 239 FKVMGREIGSGAYKFRPNRRIEIPKAKGGIRPLSIASPRDKIVQMAMKIILEAIFEPHMS 298 Query: 124 TLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRIS 183 S+GFRP RS H A+ ++ + W IE D++ FDT+ L++K V +RI Sbjct: 299 DFSHGFRPNRSTHTALYQLRGIFHEV-----SWFIEADITKCFDTLPQDLIIKEVEKRIK 353 Query: 184 DARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLH---ERYLS 240 D F+ L+ K AG+I+ F+ S G PQG +ISP+L NI+L D++L ER+ Sbjct: 354 DQVFLDLIHKCFNAGYIENN-FKIPSAGTPQGSIISPILCNILLTVMDEWLMEYSERFSV 412 Query: 241 GKARKDRWYWNN-------------------SIQRGRSTAVRENWQWKPAVAYCRYADDF 281 G R+ + I R + ++ N + + RYADDF Sbjct: 413 GTRRRANPVYTKLVRGINKASLLNQKINIRAQIHRDKKRSLLGNDPNFKRMRFVRYADDF 472 Query: 282 VLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVN-DGFIFLGHRL-------- 332 ++ V G+ I+++ L+ L++ L+ DKT I D FLG + Sbjct: 473 IIGVIGSYQDSCKIKQDLTNFLKDRLRVELSQDKTLITSATKDKAHFLGFDIAITPYEKR 532 Query: 333 --IRKRSRYGEMRVVSTIPQEKARNFAASLTALLWKVRISGE 372 + + G R+V+ + A +T ++ K+ G Sbjct: 533 QLMWVKRADGSTRLVAQTSRP---QILAPITKIVAKLGDKGF 571 >UniRef50_C9B0U1 RNA directed DNA polymerase n=2 Tax=Enterococcus casseliflavus RepID=C9B0U1_ENTCA Length = 620 Score = 283 bits (725), Expect = 6e-75, Method: Composition-based stats. Identities = 113/365 (30%), Positives = 186/365 (50%), Gaps = 12/365 (3%) Query: 8 WAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQIL 67 + + + L+ ++T P + A R + G+ +PGVD N L+A +E Sbjct: 26 YKESSENKIFSNLMEIVTSPNNILLAFRNVKGNSGSTSPGVDKKNIDDLKAIPNIEFIKT 85 Query: 68 RDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSY 127 + Y+P P +RV IPK NGK RPLGIP + DRI+Q+ +L +EPI E+ FH +Y Sbjct: 86 V-QTKFSEYKPQPVKRVDIPKPNGKTRPLGIPTIWDRIIQQCLLQVLEPIMEAKFHDKNY 144 Query: 128 GFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHH-RLLMKAVRRRISDAR 186 GFRP RS HHA + ++ +V++ D+ +FD V+H +L+ + + D Sbjct: 145 GFRPNRSAHHAFAQA---VRMAQLSKLTFVVDIDIEGFFDNVNHSKLIKQFWTLGVRDKW 201 Query: 187 FMTLLWKTIKAGHID-VGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARK 245 + ++ +KA I G +G PQGGV+SPLL+N++LNE D ++ ++ + RK Sbjct: 202 LLGVIRAMLKAPIIHRDGRIEHPKKGTPQGGVLSPLLANVVLNELDWWISSQWETHPTRK 261 Query: 246 DRWYWNNS-IQRGRSTAVRENWQWKPAVAY-CRYADDFVLIVKGTKAQVEAIREECRGVL 303 + ++ + QR +S R K Y RYADDF + + ++ + I + L Sbjct: 262 NYDCYHQTRKQRIKSNKYRALRASKLKEIYIVRYADDFKIFCR-KRSDADKIFLATKLWL 320 Query: 304 EGSLKLRLNMDKTKIPHVND-GFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAASLTA 362 + LKL ++ +K+K+ ++ FLG + R R V+ T KA + A++ A Sbjct: 321 KDRLKLDISAEKSKVVNLKKQKSDFLGFTMKLVRKRKS--FVIETHMCAKAMSAASNKLA 378 Query: 363 LLWKV 367 KV Sbjct: 379 KQIKV 383 >UniRef50_B9M2H7 RNA-directed DNA polymerase (Reverse transcriptase) n=7 Tax=Bacteria RepID=B9M2H7_GEOSF Length = 446 Score = 283 bits (724), Expect = 6e-75, Method: Composition-based stats. Identities = 119/362 (32%), Positives = 175/362 (48%), Gaps = 46/362 (12%) Query: 12 DPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDEL 71 DP LL I PE + A + ++KGA PGVDGVN +R L Sbjct: 20 DPQTPENHLLERILSPENMELAWKRVRANKGA--PGVDGVNIDDFPDITRPLWGDIRASL 77 Query: 72 LSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRP 131 +G Y P P RV IPK G RPLGIP + DR++Q+++ + PI++ F S+GFRP Sbjct: 78 ATGSYLPKPVLRVEIPKPTGGNRPLGIPTVLDRLIQQSIAQVLTPIFDPGFSESSFGFRP 137 Query: 132 ERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLL 191 RS H A+R ++ L R ++ DL+ +FDTV+H LLM V R++ D R + L+ Sbjct: 138 GRSAHDAVRQLREYL----RQGYRIAVDIDLAKFFDTVNHDLLMTFVGRKVRDKRVLALI 193 Query: 192 WKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWN 251 + ++AG G GVPQGG +SPLL+NI+L+ D+ L +R Sbjct: 194 GRYLRAGVEVDGRLEKTRMGVPQGGPLSPLLANILLDHLDKELEKR-------------- 239 Query: 252 NSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRL 311 + RYADDFV++VK +A E + R L LKL + Sbjct: 240 -------------------GHKFVRYADDFVILVKSERAG-ERVMGSVRKHLTTKLKLTV 279 Query: 312 NMDKTKIPHVND----GFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAASLTALLWKV 367 N DK+K+ + GF+F G +++ Y E R + + R++ S+ L K+ Sbjct: 280 NEDKSKVAKSDQISFLGFVFKGTKILWSDKAYKEFR--RRVRKYTGRSWFVSMEYRLNKL 337 Query: 368 RI 369 Sbjct: 338 ST 339 >UniRef50_C7RV41 RNA-directed DNA polymerase (Reverse transcriptase) n=2 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RV41_9PROT Length = 453 Score = 283 bits (723), Expect = 1e-74, Method: Composition-based stats. Identities = 106/349 (30%), Positives = 168/349 (48%), Gaps = 50/349 (14%) Query: 18 QRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQ 77 + L+ + P L +A R S++GA PG+DG+ A +R L G YQ Sbjct: 36 KDLMEAVLSPANLKQAWRRVKSNRGA--PGIDGLRIEDFPAYACEHWPAIRQTLSEGRYQ 93 Query: 78 PLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHH 137 P RRV IPK NG R LGIP + DR+VQ+A+ M PI++ +F SYGFRP RS H Sbjct: 94 PQAVRRVIIPKPNGGERALGIPTVVDRVVQQAIAQIMTPIFDPEFSESSYGFRPRRSAHG 153 Query: 138 AIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKA 197 A++ V+ L + R ++ DL+ +FD V H +LM V R++SD R + L+ + ++A Sbjct: 154 ALKQVRADL----KAGYRIAVDLDLAKFFDNVDHDILMARVARKVSDKRLLALIGRYLRA 209 Query: 198 GHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRG 257 G + + + G PQGG +SPLL+NI+L++ D+ L R Sbjct: 210 GVMIGSTLQPSELGTPQGGPLSPLLANILLDDLDRTLEGR-------------------- 249 Query: 258 RSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTK 317 + RYADD +++VK +A + ++ L LKL +N K++ Sbjct: 250 -------------GHRFARYADDLMVLVKSERAG-QRVKASLTAYLGRQLKLPVNEKKSQ 295 Query: 318 IPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAASLTALLWK 366 + + + +FLG + + R+ + +F L L + Sbjct: 296 VAKI-EQCVFLGFTFRKNKLRWSD---------AAFADFKHRLRELTGR 334 >UniRef50_B1N1A3 NicA n=1 Tax=Pseudomonas putida RepID=B1N1A3_PSEPU Length = 618 Score = 281 bits (720), Expect = 2e-74, Method: Composition-based stats. Identities = 110/384 (28%), Positives = 176/384 (45%), Gaps = 45/384 (11%) Query: 7 TWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQI 66 A +DPS R+ RL+ + + A S G TPG DG R ++ Sbjct: 15 RKANSDPSYVNDRIYRLMYKEDLYIAAYEKIKSKPGNMTPGQDGTTLDEFSIRT---IRN 71 Query: 67 LRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLS 126 + +++ + ARRV IPK+NGK RPL + D++VQ + +E I+E F S Sbjct: 72 IINKMKDESFTFRGARRVLIPKANGKTRPLSVAPPTDKVVQEVIRSILEAIYEPTFSKNS 131 Query: 127 YGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDAR 186 +GFR +S H A++ V+ + WVIEGD+ FD + H L+ +R RI D R Sbjct: 132 HGFRAGKSCHTALKQVRESWSGV-----TWVIEGDIKGCFDNISHSKLIDQLRLRIKDER 186 Query: 187 FMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFD----QYLHERYLSGK 242 F+ L+ K + AG+ + G F +A+ G PQG +ISP+L+N+ L++ D Q + + + + Sbjct: 187 FINLIRKALNAGYFENGAFFSATLGTPQGSIISPILANVFLDQLDRKVEQLIKDHHQGEE 246 Query: 243 ARKDRWYWNNSIQRGRSTAVRENWQWKPA------------------------------- 271 K +QR +++ ++ + + A Sbjct: 247 GDKITDPAYRKLQRQKTSLRKKAEKQEGAERDATLSLAREANSKLLSMSPYLTRNNGFIR 306 Query: 272 VAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVN-DGFIFLGH 330 V Y RYADD+++ V G K E +R LE + L L+++KT I H + FLG Sbjct: 307 VKYVRYADDWIIGVNGPKLLAEELRSVVGEFLENA-GLELSIEKTHIRHAKSETAKFLGT 365 Query: 331 RLIRKRSRYGEMRVVSTIPQEKAR 354 L M+V+ + R Sbjct: 366 NLRIGSENSKIMKVLRNGKKFPKR 389 >UniRef50_Q3A4Z2 Group II intron-encoding maturase n=98 Tax=Bacteria RepID=Q3A4Z2_PELCD Length = 529 Score = 280 bits (717), Expect = 5e-74, Method: Composition-based stats. Identities = 104/346 (30%), Positives = 168/346 (48%), Gaps = 47/346 (13%) Query: 19 RLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQP 78 RL+ + + A + + +KGA G+DG+ L+ L + +++ELL+G YQP Sbjct: 110 RLMEEVVSRGNMMAAYQRVVRNKGA--AGIDGMPVGDLKTYLQEQWPRIKEELLTGTYQP 167 Query: 79 LPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHA 138 P R+V IPK G +R LGIP + DR++Q+A+ + ++E +F SYGFRP RS H A Sbjct: 168 QPVRKVEIPKPGGGMRMLGIPTVLDRLIQQALHQELMRLFEPEFSEHSYGFRPGRSAHQA 227 Query: 139 IRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAG 198 +++ + + + RW ++ DL +FD + H +LM V R++ D R + L+ + + G Sbjct: 228 VQSARRHVA----SGRRWAVDIDLEKFFDRMGHDILMSRVARKVKDRRVLGLIRRYLTVG 283 Query: 199 HIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGR 258 ++ G+ +G PQGG +SPLLSNI+L+EFD+ L R Sbjct: 284 VLEGGIISPRVQGTPQGGPLSPLLSNILLDEFDKELERR--------------------- 322 Query: 259 STAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKI 318 A+CRYADD + V +A E + LE LKL++N K+ + Sbjct: 323 ------------GHAFCRYADDCNIYVHSRRA-AERVMTSLTRFLEQQLKLKVNRVKSAV 369 Query: 319 PHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAASLTALL 364 + FLG+ S + + + F SL + Sbjct: 370 GRPWER-TFLGY------SMTSHKKPRLKVAGSSVKRFKTSLREIF 408 >UniRef50_Q02717 Reverse transcriptase homologue COI ialpha grp II protein (Fragment) n=3 Tax=Podospora anserina RepID=Q02717_PODAN Length = 788 Score = 280 bits (715), Expect = 7e-74, Method: Composition-based stats. Identities = 117/369 (31%), Positives = 180/369 (48%), Gaps = 49/369 (13%) Query: 21 LRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLP 80 +I + E L A S G TP VD L + + ++L S ++ P Sbjct: 186 YEVICKLEALYTAYMNIKSEPGNMTPRVD---SETLDGISKEWFEKISEQLKSEQFRFRP 242 Query: 81 ARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIR 140 RRVYIPK+NGK+RPLGI + RD+IVQ +E + E FH+ S+GFRP R H A+ Sbjct: 243 TRRVYIPKANGKMRPLGIASPRDKIVQEVFRAILEQVLEPRFHSSSHGFRPGRGCHSALA 302 Query: 141 TVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHI 200 T++ +W IEGD+ +FD + H +L K + + D RF+ L WK +KAG++ Sbjct: 303 TIRYW------NGIKWFIEGDIKGFFDNIDHHILEKLLVKHFQDQRFIDLYWKMVKAGYV 356 Query: 201 DVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYL-------HERYLSGK-ARKDRWYWNN 252 + +++ GVPQGG+ SP+LSN++LNE D+++ +E+ GK K+ Y Sbjct: 357 EFDKDKSSIIGVPQGGIASPILSNLVLNELDEFVQNIVDEFNEKLKGGKHTSKNPAYVVI 416 Query: 253 SIQRGRSTAVRENWQWKP-------------------------------AVAYCRYADDF 281 + G+ T + + K + Y RYADD+ Sbjct: 417 DSRIGKITRLERKLKSKGQELDSGRKLERMKLIKVRATMPSMIPNPDLAKIYYVRYADDW 476 Query: 282 VLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVND-GFIFLGHRLIRKRSRYG 340 ++ V G+ AI+E L+ LKL L+M+KT I + ++ FLG + R S G Sbjct: 477 LIGVAGSSETARAIKERIAAYLKDILKLELSMEKTLITNASEDKAYFLGTEIQRISSVKG 536 Query: 341 EMRVVSTIP 349 E++ I Sbjct: 537 EIKRFKNIK 545 >UniRef50_A6YEC9 Putative reverse transcriptase and intron maturase n=1 Tax=Chlorokybus atmophyticus RepID=A6YEC9_CHLAT Length = 755 Score = 278 bits (711), Expect = 2e-73, Method: Composition-based stats. Identities = 110/390 (28%), Positives = 185/390 (47%), Gaps = 45/390 (11%) Query: 19 RLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQP 78 +L+ I P+ L A + S G T GV + L V + EL +G Y+ Sbjct: 157 KLIHAIAHPDLLWLAYELIKSKPGNMTRGV---STETLDGLSRVWIDKTSSELRAGKYRF 213 Query: 79 LPARRVYIPK-SNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHH 137 ARR+ IPK RPL + + R+++VQ+A+ M ++ ++E F S+GFRP R H Sbjct: 214 GLARRIMIPKVGKPGERPLTMASFREKVVQKAIQMVLQELFEPRFLNTSHGFRPGRGCHT 273 Query: 138 AIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKA 197 A++ V G+WVIE D++ FDT+ H L+ +RR I+ ++ + L+ +KA Sbjct: 274 ALQMVDQHFR-----GGKWVIEADITKCFDTIPHDKLLAVLRRHITCSKTLALIHSGLKA 328 Query: 198 GHIDVGLFRAASE-GVPQGGVISPLLSNIMLNEFDQYLHE---RYLSGKARKDRWYWNN- 252 G++ +G G PQG ++SPLL+NI L+ DQ++ ++ GK R+ + Sbjct: 329 GYVVLGKASQEQMVGTPQGSILSPLLNNIFLHLLDQFMERLSAKHTLGKTRRKNPEYRKL 388 Query: 253 ----SIQRGRSTAVRENWQWK--------------------PAVAYCRYADDFVLIVKGT 288 S +G + A+ + + K +AY RYAD F++ V G Sbjct: 389 QSELSKNKGDADAMSKLRRRKLPRSGRIWLMQSKDQMDPGFRRLAYVRYADHFLICVTGP 448 Query: 289 KAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSRYGEMRVVSTI 348 + E+ R L+ L L LN KT I +DG FLG + + +++++ Sbjct: 449 HQLAVDVMEQVRTFLDKELGLELNQSKTLITKFSDGINFLGGIITNRTVSEKPIKLMAAG 508 Query: 349 PQE------KAR-NFAASLTALLWKVRISG 371 P + R +F A + L+ ++++ G Sbjct: 509 PAKGHLVRVSPRLSFHAPIAKLIDRLQLRG 538 >UniRef50_B7C9E4 Putative uncharacterized protein n=1 Tax=Eubacterium biforme DSM 3989 RepID=B7C9E4_9FIRM Length = 623 Score = 278 bits (711), Expect = 2e-73, Method: Composition-based stats. Identities = 114/389 (29%), Positives = 184/389 (47%), Gaps = 25/389 (6%) Query: 1 MQRKL-ATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQAR 59 MQ+ + +A + L+ +I+ P + A R + G+HT G DG L Sbjct: 19 MQKTFDSLYADSKSGEVFGHLMDIISAPSNIKLAFRNIKGNDGSHTAGTDGRTIESLAVM 78 Query: 60 LAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWE 119 + L + Y+P +RV IPK NGK+RPLGIP + DRIVQ+ +L MEPI E Sbjct: 79 PEDKFVKLIQK-QFRRYEPKAVKRVEIPKPNGKMRPLGIPCIIDRIVQQCILQVMEPICE 137 Query: 120 SDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVR 179 + F+ SYGFRP RS +AI + +V++ D+ +FD V HR L+K + Sbjct: 138 AKFYEHSYGFRPCRSAENAISYAY---GLAQRNKLHYVVDVDVKGFFDNVDHRKLLKQIW 194 Query: 180 R-RISDARFMTLLWKTIKAGH-IDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHER 237 I D + + ++ +KA + G S+G PQGG++SPLL+NI+LNE D ++ + Sbjct: 195 TLGIRDTKLIQIIKAMLKAPIEMPDGENVLPSKGTPQGGILSPLLANIVLNELDWWIASQ 254 Query: 238 YLSGKAR-----KDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQV 292 + K +Y N + ++ S + K + RYADDF + + TK Sbjct: 255 WDEMVRHMKHPCKVTYYPNGAEKKCNSYTALKKSNLK-EMRIVRYADDFKIFCR-TKEDA 312 Query: 293 EAIREECRGVLEGSLKLRLNMDKTKIPHVNDG-FIFLGHRLIRKRSRYGEM-------RV 344 E + L LKL ++ +K+K+ ++ FLG R+ +R + + Sbjct: 313 EKTYYAVKDWLWKRLKLEVSDEKSKVTNLRKRDSEFLGFRIKLRRKSNSWVITSNVCDKA 372 Query: 345 VSTIPQE---KARNFAASLTALLWKVRIS 370 + I +E + +S TA + + Sbjct: 373 IHRISKEMADSVKAIQSSKTAEEISLNVG 401 >UniRef50_Q74P60 Group II intron reverse transcriptase/maturase n=13 Tax=Bacilli RepID=Q74P60_BACC1 Length = 627 Score = 278 bits (711), Expect = 2e-73, Method: Composition-based stats. Identities = 103/371 (27%), Positives = 187/371 (50%), Gaps = 12/371 (3%) Query: 8 WAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQIL 67 +A + ++L+++IT + A R + G+ T G+DGV ++ + + Sbjct: 30 YAKSLNGGNFKQLMKVITSESNILLAFRNIKRNSGSITEGIDGVTIKDVEKLSQEDFIKI 89 Query: 68 RDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSY 127 + +Y P RRV IPK NGK RPLGIP++ DRI Q+ + +EPI E+ F+ S+ Sbjct: 90 VQK-RFSNYTPRKVRRVEIPKPNGKTRPLGIPSMWDRIAQQCIKQVLEPICEAKFNKHSH 148 Query: 128 GFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVR-RRISDAR 186 GFRP RS A+ L++ + ++V+ D+ +FD V+H+ LM+ + I D + Sbjct: 149 GFRPNRSPETAMADATLRVN---RSHMQYVVNVDIQGFFDEVNHKKLMRQLWTMGIRDKQ 205 Query: 187 FMTLLWKTIKAGH-IDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARK 245 + ++ K +KA + G + ++G PQGG++SPLL+NI LNEFD ++ ++ ++ Sbjct: 206 LLVIIRKMLKAPIVLPNGEMQYPNKGTPQGGILSPLLANINLNEFDWWITNQWEDRLLKE 265 Query: 246 DRWYWNNSIQRGRSTAVRENWQWK--PAVAYCRYADDFVLIVKGTKAQVEAIREECRGVL 303 + + + + RYADDF + TK+ + I + C L Sbjct: 266 LSLTIKKGGHVDKYPHYSKMRKTTALKEMYIVRYADDFKIFTA-TKSNAQKIFKACEMWL 324 Query: 304 EGSLKLRLNMDKTKIPHV-NDGFIFLGHRL--IRKRSRYGEMRVVSTIPQEKARNFAASL 360 + LKL ++ +K+KI ++ + FLG + ++K S+ +S ++K + Sbjct: 325 QERLKLPISKEKSKITNLRKESSEFLGFEIKMVKKGSKLIARTHISNKTKKKIQKQFEDQ 384 Query: 361 TALLWKVRISG 371 A++ + + G Sbjct: 385 IAVIQRSKNEG 395 >UniRef50_Q9MD87 Putative maturase n=1 Tax=Cryphonectria parasitica RepID=Q9MD87_CRYPA Length = 778 Score = 278 bits (711), Expect = 2e-73, Method: Composition-based stats. Identities = 119/375 (31%), Positives = 189/375 (50%), Gaps = 38/375 (10%) Query: 3 RKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAV 62 RKL + + + +L ++ P +L +G T G D L Sbjct: 197 RKLLDQSKI-ANQKYYNILNVLADPNFLIACYDEIKGKQGNMTRGYDKATLDGLDYNW-- 253 Query: 63 ELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDF 122 EL +G Y P+RRV IPK+NGK RPLG+ + RD+IVQ+A+ +E I+E F Sbjct: 254 -FVKTAGELKAGKYNFKPSRRVEIPKANGKTRPLGVGSPRDKIVQKALHAILEAIFEPLF 312 Query: 123 HTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRI 182 S+GFRP RS H A+ V L + WVI+GD++ FD++ H +++K + +I Sbjct: 313 LPSSHGFRPNRSTHSALLKVYL-----SGNKHNWVIQGDITKCFDSIPHSIILKRIGAQI 367 Query: 183 SDARFMTLLWKTIKAGHIDV--GLFRAASEGVPQGGVISPLLSNIMLNEFDQYLH---ER 237 D +++ L+ K ++AGHID G + G PQGG++SP+LSNI+L+EFD+Y+ E Sbjct: 368 GDKKYLNLISKYLEAGHIDPKTGTKVVLNYGTPQGGILSPILSNIVLHEFDKYMAKLSES 427 Query: 238 YLSGKARKDRWYWNNSI-QRGRSTAVRENWQW----------------KPAVAYCRYADD 280 + GK R+ + + +RGR+ ++ E + Y RYADD Sbjct: 428 FHKGKKRRWNPAYKRLLARRGRTKSLEEKQTLLKQMRTMRSIDAFDPNFRRLDYVRYADD 487 Query: 281 FVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHV-NDGFIFLGHRLIRKR--- 336 FV+ + G+ IR + L+ + L LN+DKT I ++ + + FLG L + + Sbjct: 488 FVVFISGSSKDALFIRNNLKDYLKVNCGLELNVDKTAISNLATEKWKFLGAELSKIKLNA 547 Query: 337 ---SRYGEMRVVSTI 348 +G R++ T Sbjct: 548 NWLVSHGRKRIIGTP 562 >UniRef50_A1T776 RNA-directed DNA polymerase n=1 Tax=Mycobacterium vanbaalenii PYR-1 RepID=A1T776_MYCVP Length = 454 Score = 278 bits (711), Expect = 3e-73, Method: Composition-based stats. Identities = 118/384 (30%), Positives = 174/384 (45%), Gaps = 51/384 (13%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQAR- 59 +QR L A R L + + + L EA ++GA GVD V ++ Sbjct: 30 LQRTLWAAAKQSQGRRFHALYDRVYRGDVLWEAWERVRKNRGA--AGVDRVTLVAVEEYG 87 Query: 60 LAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWE 119 + L+ LR +L G Y P PARRV IPK G RPLGIP +RDR+ Q A + +EPI+E Sbjct: 88 VDRMLRELRHDLREGVYCPAPARRVEIPKPRGGTRPLGIPTVRDRVAQAAAKIVLEPIFE 147 Query: 120 SDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVR 179 +DF + SYGFRP+RS A+ +++ + ++V+E D++++F + H L+ V Sbjct: 148 ADFMSCSYGFRPKRSATQAMERLRVGFIE----GSQFVVEFDIANFFGEIDHDRLLAEVS 203 Query: 180 RRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYL 239 RR+SD R + LL ++AG + G+ G PQGGVISPLL+NI L+ D L R + Sbjct: 204 RRVSDRRVLKLLRLWLQAGVMVDGVVSRTVAGTPQGGVISPLLANIYLHVLDTELARRNV 263 Query: 240 SGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREEC 299 RYADD V++ + A+ Sbjct: 264 G--------------------------------ELVRYADDGVVLCRSAAQAEHALAAVG 291 Query: 300 RGVLEGSLKLRLNMDKTKIPHVN---DGFIFLGHRLI-RKRSRYGEMRVVST------IP 349 + SL LRL+ DKTK+ + +G FLG R R E R + Sbjct: 292 E--ILASLGLRLHPDKTKVVDLREGGEGLDFLGCHFRARMSGRLWEQRRIVRYYLHRWPS 349 Query: 350 QEKARNFAASLTALLWKVRISGEI 373 Q + + R+ +I Sbjct: 350 QTAMVRLREKVRERTGRNRVGFDI 373 >UniRef50_A4KVN1 Probable reverse transcriptase n=2 Tax=Sinorhizobium meliloti RepID=A4KVN1_RHIME Length = 490 Score = 278 bits (710), Expect = 3e-73, Method: Composition-based stats. Identities = 118/357 (33%), Positives = 182/357 (50%), Gaps = 43/357 (12%) Query: 19 RLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQP 78 +LL + LA A + +KGA PG DG M +A+ + LR ELL+G Y+P Sbjct: 56 QLLEEVASEANLATALLNVVRNKGA--PGRDGQTVDMAEAKATSIIGRLRRELLNGKYRP 113 Query: 79 LPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHA 138 RRV++PK+ G R LGIP + DR+VQ+A+L +EPI+E FH S+GFRP+R H A Sbjct: 114 GDVRRVWLPKAGGGRRGLGIPNIVDRVVQQAVLQVLEPIFEPVFHDSSHGFRPKRGAHTA 173 Query: 139 IRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAG 198 I L + +T +++ DL+S+FD VHH+ L+ + +R+ D R +TL+ +KA Sbjct: 174 IAEASKYLKEGYQT----IVDLDLASFFDRVHHQRLLARIAQRVKDQRIITLINLMLKAA 229 Query: 199 -HIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRG 257 + G A EG PQGG +SPLLSNI+L+E D+ L R Sbjct: 230 VVMPDGTRVAPQEGTPQGGPLSPLLSNIVLDELDRELARR-------------------- 269 Query: 258 RSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTK 317 + + RYADD + V+ +A + + R LE ++L++N +K+ Sbjct: 270 -------------RLRFVRYADDSNIFVRSERAG-QRVMSSIRDFLERRMRLQVNEEKSG 315 Query: 318 IPHVNDGFIFLGHRLIRKRSRYGEMRV-VSTIPQEKARNFAASLTALLWKVRISGEI 373 + N+ FLG R + G++ V +S +++ R +T W I+ I Sbjct: 316 MRTPNE-VHFLGFRFRCPKGEGGDVVVLLSRKAEQRLRAKVREMTPPTWGRSIASCI 371 >UniRef50_A7GTD4 RNA-directed DNA polymerase n=1 Tax=Bacillus cytotoxicus NVH 391-98 RepID=A7GTD4_BACCN Length = 543 Score = 278 bits (710), Expect = 3e-73, Method: Composition-based stats. Identities = 109/405 (26%), Positives = 180/405 (44%), Gaps = 61/405 (15%) Query: 8 WAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQIL 67 A+ I RL+ + E +A + G T G ++ + ++ + Sbjct: 13 RKASQNGKIITDCYRLMYKRELWIKAYVKLYPNAGNLTKGT---SEETIDGFYLQKIDEI 69 Query: 68 RDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSY 127 D+L G ++ P RR YI K+NGK RPLG+P +D++VQ M M +E ++E F S+ Sbjct: 70 IDQLKKGMFRFAPVRRAYISKANGKKRPLGVPNFKDKLVQEVMRMILENVYEPTFSDNSH 129 Query: 128 GFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARF 187 GFR RS H A+ +K W IEG + +FD + H +L+ + +R++D RF Sbjct: 130 GFREGRSCHTALSQIKNTWK-----GLTWCIEGAIKGFFDHIDHSVLINLISKRMNDHRF 184 Query: 188 MTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDR 247 + L+ + +G ++ ++ G PQGG++SPLL+NI L+EFD +L ++ K R Sbjct: 185 LLLIHNALASGVMENWTYQKTYSGTPQGGILSPLLANIYLHEFDIFLEKQIEKFDKEKLR 244 Query: 248 ------------------------WYWNNSIQRGRSTAVRENWQWKP------------- 270 + + +GR + + K Sbjct: 245 ARNKEYTKIHSEIRSLSRKVKSLDDRTGHRLWKGREKVIETIAELKRKQIGISSVNPMDN 304 Query: 271 ---AVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFIF 327 + Y RYADDFV+ + G+K I+E + L+ L L L+ +KT I H+ + F Sbjct: 305 DYQKMKYVRYADDFVIGIAGSKDCAVNIKETIKNFLKQELHLELSEEKTLINHLENPISF 364 Query: 328 LGHRLI------RKRSRYGEMR-------VVSTIPQEKARNFAAS 359 LG+ R R Y + + IP++K + FA Sbjct: 365 LGYEFRKWNEIKRTRVLYKNHKQRALSRAIKLEIPKKKMKEFAIK 409 >UniRef50_B4D301 RNA-directed DNA polymerase (Reverse transcriptase) n=4 Tax=Chthoniobacter flavus Ellin428 RepID=B4D301_9BACT Length = 415 Score = 277 bits (709), Expect = 3e-73, Method: Composition-based stats. Identities = 111/355 (31%), Positives = 164/355 (46%), Gaps = 49/355 (13%) Query: 17 IQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHY 76 I++L+ PE A + +S+ GA PG+DG+ + L L + +R +LL+G Y Sbjct: 2 IEQLMEEAVSPENWHTAWKAVVSNGGA--PGIDGMRCSELVEHLQRHGEAIRAKLLAGRY 59 Query: 77 QPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVH 136 P P R IPK G R LGIP + DR VQ+ +L + PI+E F SYGFRP RS H Sbjct: 60 TPSPVLRTKIPKPGGGERDLGIPTVLDRFVQQLLLQVLTPIYEPRFSARSYGFRPGRSTH 119 Query: 137 HAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIK 196 A+R + + + +VI+ D+ +FD V+H LLM +R +SD R TL+ + +K Sbjct: 120 DAVRQAQAYVKE----GKSYVIDLDIEKFFDRVNHNLLMHRLRETVSDVRVRTLIGRYLK 175 Query: 197 AGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQR 256 AG + G+ + EG PQGG +SPLL+NI L+ D L R Sbjct: 176 AGVMVNGVVQDNEEGTPQGGPLSPLLANIYLDPLDWELEGR------------------- 216 Query: 257 GRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKT 316 +AY RYADD + V + A + + G +E L+L++N K+ Sbjct: 217 --------------GLAYVRYADDCNIYV-SSAAAAQRVLSSLIGWIEKKLRLKVNQTKS 261 Query: 317 KIPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAASLTALLWKVRISG 371 G LG + + V I Q+ + + A R G Sbjct: 262 GTGPTT-GRRLLGFSI--------NAQGVIEIAQKSLTHLQEKVRAFWRPQRHRG 307 >UniRef50_Q9T654 Cox1I1a maturase (Fragment) n=2 Tax=cellular organisms RepID=Q9T654_SCHPO Length = 787 Score = 277 bits (709), Expect = 4e-73, Method: Composition-based stats. Identities = 124/403 (30%), Positives = 203/403 (50%), Gaps = 50/403 (12%) Query: 5 LATWAATDPSLRIQRLLRL-ITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVE 63 L+ + + + + + + A + S+KG+ T G+D + L E Sbjct: 183 LSDIQKESRNNLVSNIYKRCLLNEDLFIAAYQKISSNKGSVTAGIDKIT---LDGYSINE 239 Query: 64 LQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFH 123 ++ ++L +Q PARR YIPK+NGKLRPLGIP+ RD+IVQ+ M+ +E I+E F Sbjct: 240 IKKTIEQLKDHSFQFKPARREYIPKANGKLRPLGIPSPRDKIVQQVMVFVLESIFEQKFL 299 Query: 124 TLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRIS 183 S GFRP + H A++++ G WVIEGD+ SYFD + H+ L+ + I+ Sbjct: 300 DCSNGFRPNKGTHTALKSI------AGWKALDWVIEGDIKSYFDLIDHQTLISLLSNVIN 353 Query: 184 DARFMTLLWKTIKAGHIDVGLFRA--ASEGVPQGGVISPLLSNIMLNEFDQYLHER---- 237 D F+ L WK I+AG+++V + + G PQG V+SP+L+NI L+EFD+++ E+ Sbjct: 354 DKEFIDLCWKAIRAGYVEVKMNKKIDTIIGTPQGSVLSPILANIYLHEFDKFMMEKVNLS 413 Query: 238 ---------------------YLSGKARKDRWYWNNSIQRGRSTAVRENWQWKP------ 270 Y+ RK+ N ++ + + N Sbjct: 414 LDSGSTSKRFKPYRLLEAKINYIYQLERKNGSLTNEQVKSLKKLTIERNKLPSTIGGPGY 473 Query: 271 AVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGF-IFLG 329 + Y RYADDF++ + G + ++ E L +LKL ++++KTK+ ++ D + +FLG Sbjct: 474 RIYYVRYADDFLIGINGKRTLALQLKSEINEFLTNTLKLTMSVEKTKVTNIKDDYALFLG 533 Query: 330 HRLIRKRSRYGEMRVVSTIPQEKARNF----AASLTALLWKVR 368 + R SR +V+S Q + NF A S T+LL V+ Sbjct: 534 AEIHRLTSRNNNSKVISK--QYSSGNFNSRVANSRTSLLIPVK 574 >UniRef50_Q188V0 Group II intron reverse transcriptase/maturase n=7 Tax=Firmicutes RepID=Q188V0_CLOD6 Length = 609 Score = 277 bits (708), Expect = 5e-73, Method: Composition-based stats. Identities = 110/370 (29%), Positives = 190/370 (51%), Gaps = 20/370 (5%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTML-QAR 59 +Q +L +A + S + L+ ITQ E + A R S+KG+ T G + + + Sbjct: 22 IQDELYQLSA-EGSHVFRDLMSYITQEENILLAYRNIKSNKGSKTAGTNKRTIIDVGEEN 80 Query: 60 LAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWE 119 +Q +++ ++QP RRV IPK NGK RPLGIP + DR+VQ+ + +EPI E Sbjct: 81 PYQLVQYVQNRF--NNFQPHSIRRVEIPKPNGKTRPLGIPTIEDRLVQQCIKQILEPILE 138 Query: 120 SDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVR 179 + FH SYGFRPERS HHAI + +V++ D+ +FD V+H L+K + Sbjct: 139 AKFHKHSYGFRPERSSHHAIAIFQQW----TFKGFHYVVDIDIKGFFDNVNHGKLLKQLW 194 Query: 180 -RRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERY 238 +I D F+++L + +KA +G +++G PQGG++SPLL+N++LNE D ++ ++ Sbjct: 195 TMKIRDKTFISILSRMLKAEVKGIG---KSTKGTPQGGILSPLLANVVLNELDWWIDSQW 251 Query: 239 LSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREE 298 ++ S ++ ++R+ K + RYADDF ++ K + I Sbjct: 252 DGFPTKR-----KYSSLLSKTQSIRKYSNLK-EIKIVRYADDFKIMCKDYHT-AQKIFLA 304 Query: 299 CRGVLEGSLKLRLNMDKTKIPHVNDGF-IFLGHRLIRKRSRYGEMRVVSTIPQEKARNFA 357 + L+ L L ++ +K+K+ ++ + FLG +L K+ + S + + N Sbjct: 305 TKQWLKVRLDLDISPEKSKVTNLRKNYSDFLGFKLKVKKGKANGYTNRSRMCDKAKINAV 364 Query: 358 ASLTALLWKV 367 L + + Sbjct: 365 DKLKNNIKTI 374 >UniRef50_C0JX29 Putative reverse transcriptase and intron maturase n=1 Tax=Pyramimonas parkeae RepID=C0JX29_9CHLO Length = 608 Score = 277 bits (708), Expect = 5e-73, Method: Composition-based stats. Identities = 118/408 (28%), Positives = 191/408 (46%), Gaps = 63/408 (15%) Query: 5 LATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVEL 64 L + + L RL+ + L A + S G TPG D + + RL + Sbjct: 21 LKKRNCENKEAVNKDLYRLLCNKQLLTLAYNLIKSKPGNMTPGTDKLTLDKMSERL---I 77 Query: 65 QILRDELLSGHYQPLPARRVYIPKSNGK-LRPLGIPALRDRIVQRAMLMAMEPIWESDFH 123 +L ++ P RRV+IPK N K +RPLG+P+ RD++VQ+AML+ M+ I+E+ F Sbjct: 78 DKTSRQLRDQTFKFKPVRRVFIPKGNSKDIRPLGVPSSRDKVVQKAMLLIMDNIYETTFS 137 Query: 124 TLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRIS 183 T S+GFRP RS H A++ ++ + + +W IEGD+ +D V+H++L+ +R +I Sbjct: 138 THSHGFRPGRSCHSALKEIRSEWSGI-----KWAIEGDIKGCYDNVNHQILINILREKIK 192 Query: 184 DARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHE-----RY 238 D RF+ LLWK ++AG + G PQGG++SPLL+NI LNEFD+++ Sbjct: 193 DERFIQLLWKLLRAGIEVNRTIERSKIGTPQGGILSPLLANIYLNEFDKFVSNLSQKIGL 252 Query: 239 LSGKARKDRWYWNNSIQRGRSTAVRENW-------------------------------- 266 K R+D ++ RG+ +R Sbjct: 253 TYNKTRRDNPEYHKI--RGKIYRLRTKRTVSGTVNVQPSKSDLKQIQILSKTQRTLPSKD 310 Query: 267 ---QWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHV-N 322 + + RYADD+++ V G + I+E+ + L+ L+L L+ +KTKI + Sbjct: 311 PFDPQYRKILFIRYADDWIVGVIGGHEFAQGIKEQIQTFLKEQLELTLSPEKTKITPFSS 370 Query: 323 DGFIFLGHRLI-----------RKRSRYGEMRVVSTIPQEKARNFAAS 359 FLG++L R + R + IP +K + Sbjct: 371 KKVTFLGYKLQISARSSYSSSGRHKKRTVGWQPKLYIPMDKIVRKLSE 418 >UniRef50_B1HW67 Possible group II intron reverse transcriptase/maturase n=23 Tax=Firmicutes RepID=B1HW67_LYSSC Length = 601 Score = 277 bits (708), Expect = 6e-73, Method: Composition-based stats. Identities = 107/368 (29%), Positives = 183/368 (49%), Gaps = 22/368 (5%) Query: 5 LATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQ-ARLAVE 63 L + + + + L I + A R ++ G+ T G DG+ + + Sbjct: 20 LYERSKNNATKGL-NLYEHIISKNNILLAYRNIKANTGSKTAGTDGITIEQYKIEDVETF 78 Query: 64 LQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFH 123 + +R L + Y+P RRV IPK NGK RPLGIP +RDR++Q+ +EPI E+ F+ Sbjct: 79 VDEIRATLKN--YKPQTVRRVEIPKPNGKTRPLGIPTMRDRLIQQMFKQILEPICEARFY 136 Query: 124 TLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAV-RRRI 182 SYGFRP RS HHA+ + V++ D+ +FD V H L+K + I Sbjct: 137 NHSYGFRPNRSTHHAMGRCQFLANIALNQH---VVDIDIQGFFDNVSHSKLLKQMYSIGI 193 Query: 183 SDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGK 242 D R ++++ K +KA +G+ ++G PQGG++SPLLSNI+LN+ D ++ ++ + K Sbjct: 194 CDKRVLSVVSKMLKAPIKGIGI---PTKGTPQGGILSPLLSNIVLNDLDWWISNQWENMK 250 Query: 243 ARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGV 302 + + N + + T + + RYADDF + K K ++ + +G Sbjct: 251 TKFNYKERKNKVLMIKRTTTLKE------MYIVRYADDFKIFTKSHKNAIK-LYHAVKGY 303 Query: 303 LEGSLKLRLNMDKTKIPHVNDGF-IFLGHRLIRKRSRYGEMRVVST-IPQEKARNFAASL 360 L+ L L ++ +K+KI ++ FLG L K + G+ V +T + +K + Sbjct: 304 LKNHLNLDISNEKSKITNLRKRASEFLGFSL--KVIKKGKRYVANTHVMDDKVKTILQKA 361 Query: 361 TALLWKVR 368 L+ +++ Sbjct: 362 RKLIHEIK 369 >UniRef50_B8R181 Putative intron-encoded reverse transcriptase n=2 Tax=Volvox carteri RepID=B8R181_VOLCA Length = 598 Score = 276 bits (707), Expect = 7e-73, Method: Composition-based stats. Identities = 107/361 (29%), Positives = 187/361 (51%), Gaps = 18/361 (4%) Query: 11 TDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQAR-LAVELQILRD 69 + ++ L P +A A R +KG+ TPGVD V+ L+++ L + ++ ++ Sbjct: 29 SKRGGSLKNLYDTAFSPRNIALAFRNLKFNKGSKTPGVDDVHIGNLKSKPLNIFIRDIQK 88 Query: 70 ELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGF 129 + +Y+P RRV+IPK NGK RP+GIP L DR+ Q+ + +EPI E+ FH SY F Sbjct: 89 M--AKNYKPSLVRRVWIPKPNGKKRPIGIPTLADRLFQQCIKQVIEPICEAKFHPHSYVF 146 Query: 130 RPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHH-RLLMKAVRRRISDARFM 188 RP RS A+ L + +V++ D+ S+FDT+ H +LL + I D + + Sbjct: 147 RPNRSTSDALARA---LFLMNQNELHYVVDIDIQSFFDTIDHGKLLKQCWAIGIRDKKIL 203 Query: 189 TLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRW 248 +++ K +KA + G+ +++G PQGG++SPLLSNI LNE D + ++ + + Sbjct: 204 SIMSKMLKAEVVGEGI---STKGTPQGGILSPLLSNICLNELDWWYTSQWATFPTKYPYK 260 Query: 249 YWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLK 308 +++ + A+R N + K RYADDF + + K I R L+ L Sbjct: 261 RTSHA-----ARALRGNSKLK-EFHSVRYADDFKIFCRDYKT-AVKIFAATRLWLKDRLN 313 Query: 309 LRLNMDKTKIPHV-NDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAASLTALLWKV 367 L+++ +K+ I ++ FLG L ++ G+ S + + +++ A + K+ Sbjct: 314 LQISSEKSSITNLRKKSSPFLGISLRACHNKNGKFSCRSRVLDKAMETMKSTIKAAIIKL 373 Query: 368 R 368 + Sbjct: 374 Q 374 >UniRef50_B4WUH1 Group II intron, maturase-specific domain family n=2 Tax=Cyanobacteria RepID=B4WUH1_9SYNE Length = 621 Score = 275 bits (704), Expect = 2e-72, Method: Composition-based stats. Identities = 121/369 (32%), Positives = 186/369 (50%), Gaps = 30/369 (8%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQP--EWLAEAARITLSSKGAHTPGVDGVNKTMLQA 58 ++R++ +++ L++L+ + L +T ++G T G+DG + Sbjct: 66 LRRRIYRATQNGQWNQVRSLMKLMLRSYSNLLLSVRHVTQENQGRQTAGLDGQTALTAEK 125 Query: 59 RLAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIW 118 R+ + + L+D L +Q LP +RVYIPK+NGKLRPLGIPAL +R+ Q M A+EP W Sbjct: 126 RVQL-VNRLQDHSL---WQVLPTKRVYIPKANGKLRPLGIPALENRVAQTIMKNALEPHW 181 Query: 119 ESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAV 178 E+ F SYGFRP RS H AI L+L +T WV++ DL FD + H ++ + Sbjct: 182 EARFEGHSYGFRPGRSCHDAIEQCFLRLRHGCDT---WVLDADLKGAFDNLSHSFILDTI 238 Query: 179 RRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERY 238 L+ + +KAG+++ +F A +G PQGG ISPLL NI LN ++ L Sbjct: 239 GLVPG----RELIKQWLKAGYVEAEMFHATPKGAPQGGSISPLLLNIALNGMEKLL---- 290 Query: 239 LSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREE 298 ++ + + P YCRYADDFV+ K TKA +EA+ Sbjct: 291 ------LSFTTTRTYQPSSKAKSQSSYKRTSPTYGYCRYADDFVVTAK-TKADIEAVVPI 343 Query: 299 CRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAA 358 + L+ L LNM+KT+I +V GF FLG + R+ + + + +EK F Sbjct: 344 LQAWLKPR-GLTLNMEKTQIVNVQQGFPFLGFSI-----RHHKGKCLCKPQKEKILAFLK 397 Query: 359 SLTALLWKV 367 + + L Sbjct: 398 RIRSWLKHN 406 >UniRef50_C2KES2 Reverse transcriptase/maturase n=14 Tax=Firmicutes RepID=C2KES2_9LACO Length = 432 Score = 275 bits (703), Expect = 2e-72, Method: Composition-based stats. Identities = 110/365 (30%), Positives = 171/365 (46%), Gaps = 51/365 (13%) Query: 20 LLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPL 79 L+ I L EA R +KGA PGVD L + ++D ++ Y+P Sbjct: 3 LIEQILSQNNLKEAIRRVKINKGA--PGVDRRTVDELDSYFKKHQVEIKDAIMKMKYRPQ 60 Query: 80 PARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAI 139 RRVYIPK+NGK RPLGIP + DR++Q+A+ + I++ F SYGFRP RS H AI Sbjct: 61 AVRRVYIPKANGKKRPLGIPTVVDRVIQQAIAQVLMKIYDPHFSEHSYGFRPGRSAHDAI 120 Query: 140 RTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGH 199 V L + +WV++ D+ YFDTV+H L+ +R +++D + L+ +KAG Sbjct: 121 EQVLEYLNE----GYQWVVDLDIEKYFDTVNHDKLISIIREQVNDKTTLHLIRAFLKAGV 176 Query: 200 IDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGRS 259 ++ G + GVPQGG +SP+LSNI L++ D+ L +R Sbjct: 177 MEDGWVKPNKLGVPQGGPLSPILSNIYLDKMDKELEQR---------------------- 214 Query: 260 TAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIP 319 + + RYADD + VK K+ + + LE L L++N KTK+ Sbjct: 215 -----------GLRFVRYADDCNIFVKSGKS-AKRVMNSISSWLERKLFLKVNATKTKVV 262 Query: 320 HVNDGFIFLGHRLIRK----RSRYGEMRVVSTIPQEKARNFAASLTALLWKVRIS----G 371 FLG + +++ G+ R + +K R A + + Sbjct: 263 RPTKS-NFLGFTFWKSGEYWQAKPGDDRKIKLY--DKMRELLCRRKAAAQPLSLVFTKIN 319 Query: 372 EILLG 376 +++ G Sbjct: 320 QVVRG 324 >UniRef50_A5VLF2 RNA-directed DNA polymerase (Reverse transcriptase) n=40 Tax=Lactobacillus RepID=A5VLF2_LACRD Length = 460 Score = 275 bits (703), Expect = 2e-72, Method: Composition-based stats. Identities = 113/356 (31%), Positives = 166/356 (46%), Gaps = 48/356 (13%) Query: 20 LLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPL 79 + L+ L +A +KGA G+D + L L L L G Y+P Sbjct: 42 IQDLVLDRNNLNQAYLRVKRNKGA--AGIDDMTVNDLLPYLRENKTELIASLREGKYKPA 99 Query: 80 PARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAI 139 P +RV IPK NG +R LGIP + DR+VQ+A+ + PI+E F S+GFRP R H AI Sbjct: 100 PVKRVEIPKPNGGVRKLGIPTVVDRMVQQAVAQILTPIFERVFSDNSFGFRPHRGAHDAI 159 Query: 140 RTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGH 199 V D R V++ DL +YFD V+H L++K +++ I D + L+ K + +G Sbjct: 160 AKV----VDLYNQGYRRVVDLDLKAYFDNVNHDLMIKYLQQYIDDPWTLRLIRKFLTSGV 215 Query: 200 IDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGRS 259 +D GLF + +G PQGG +SPLL+NI LNE D+ L R Sbjct: 216 LDHGLFAKSEKGTPQGGPLSPLLANIYLNELDKELTRR---------------------- 253 Query: 260 TAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIP 319 + RYADD + VK +A E + LE LK+++N DKTK+ Sbjct: 254 -----------GHHFVRYADDCNIYVKSQRAG-ERVMRSITQFLEKRLKVKVNSDKTKVG 301 Query: 320 HVNDGFIFLGHRL-------IRKRSRYGEMRVVSTIPQEKARNFAASLTALLWKVR 368 FLG L + ++ + R+ + RN SLT + +++ Sbjct: 302 SPLR-LKFLGFSLGVDHNGAYARPAKQSQQRIKKALKLLTKRNRGISLTRMFEEIQ 356 >UniRef50_C3FJT8 RNA-directed DNA polymerase (Reverse transcriptase) n=11 Tax=Bacteria RepID=C3FJT8_BACTB Length = 443 Score = 273 bits (699), Expect = 5e-72, Method: Composition-based stats. Identities = 125/372 (33%), Positives = 192/372 (51%), Gaps = 44/372 (11%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARL 60 +++++ A ++P+ R L IT+ L EA + + GA PG+DG + ++ Sbjct: 19 LRQRIYRKAKSEPTHRFWGLFTHITKMTTLHEAYQQARKNNGA--PGIDGKSFADIELEG 76 Query: 61 AV-ELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWE 119 + L +++EL +G Y+P R+V IPK+NGK+R L IP +RDR+VQ A+ + +E I+E Sbjct: 77 VIPFLTGIQEELQAGIYRPQSNRKVEIPKANGKMRTLQIPCIRDRVVQGALKLILEAIFE 136 Query: 120 SDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVR 179 +DF SYGFRP+RS H A+ V+ + R +I+ DLS YFDT+ H +L++ + Sbjct: 137 ADFCPNSYGFRPKRSPHQALAEVRRSILR----RMTIIIDVDLSRYFDTIRHNILLEKIA 192 Query: 180 RRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYL 239 +R+ D + M L+ + IKA A GVPQGG SPL +NI LNE D Sbjct: 193 KRVQDPQVMHLVKQVIKA---------AGKIGVPQGGPFSPLAANIYLNEVDW------- 236 Query: 240 SGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREEC 299 + R N++ AV Y R+ADD V+ V G ++ Sbjct: 237 -------------TFDAIRRKTAEGNYE---AVNYHRFADDIVIAVSGHSSKSGWAELAL 280 Query: 300 RGVLE--GSLKLRLNMDKTKIPHVNDG--FIFLGHRLIRKRSRYGEMRVVSTIPQEKARN 355 R + E L + LN++KT++ +V G F FLG L R +R V IP++KAR Sbjct: 281 RRLWEQLKPLGVELNLEKTQMINVLKGESFGFLGFDLRRIPNRNKNGFFVFMIPKKKART 340 Query: 356 F-AASLTALLWK 366 A + L+ Sbjct: 341 TVKAKIRELIQN 352 >UniRef50_A8MI91 RNA-directed DNA polymerase (Reverse transcriptase) n=2 Tax=Clostridiales RepID=A8MI91_ALKOO Length = 421 Score = 273 bits (698), Expect = 8e-72, Method: Composition-based stats. Identities = 118/365 (32%), Positives = 176/365 (48%), Gaps = 45/365 (12%) Query: 15 LRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSG 74 + L+ I + E L A + + GA PG+DG L + ++ L D+L + Sbjct: 2 RKWYSLIDKIYRKENLELAFKYVKKNNGA--PGIDGETVFNFHLNLELNIEFLHDKLKTN 59 Query: 75 HYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERS 134 Y+P P RRV I K +G +R LGIP ++DR+VQ+A++ +EPI++ FH SYG+RP S Sbjct: 60 GYEPSPVRRVEIQKPDGGVRLLGIPTVKDRVVQQAIVNIIEPIFDKTFHPSSYGYRPNHS 119 Query: 135 VHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKT 194 H A+ + + G V++ DLS FDT+ H ++MKAV RISD R + L+ K Sbjct: 120 QHGAVAKAERFMNKYGLEH---VVDMDLSKCFDTLDHEIMMKAVSERISDGRVLKLIEKF 176 Query: 195 IKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSI 254 +KAG + F G PQGGVISPLLSNI LN+FDQ + + Sbjct: 177 LKAGVMHSDNFSRTEVGSPQGGVISPLLSNIYLNQFDQRMMSK----------------- 219 Query: 255 QRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMD 314 + R+ADD ++ K K + VLE LKL++N + Sbjct: 220 ----------------GIRIVRFADDILIFAKDKKT-AGNYKAYATQVLENELKLKVNNE 262 Query: 315 KTKIPHVNDGFIFLGHRLIRKRSRYGEM---RVVSTIPQEKARNFAASLTALLWKVRISG 371 KTK+ +VN+G FLG + K R + + RN L ++ + Sbjct: 263 KTKLTNVNEGVEFLGFVIKDKWLGVNPKRIERFKDKVRSKTKRNAGRKLEDIIKDL---N 319 Query: 372 EILLG 376 ++ G Sbjct: 320 PVIRG 324 >UniRef50_D2CK02 Putative uncharacterized protein orf3 (Fragment) n=1 Tax=Candida viswanathii RepID=D2CK02_9ASCO Length = 801 Score = 273 bits (698), Expect = 8e-72, Method: Composition-based stats. Identities = 111/402 (27%), Positives = 182/402 (45%), Gaps = 43/402 (10%) Query: 5 LATWAATDPSLRIQR-LLRL-ITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAV 62 L + + P+ I R L + I + L A S G TPG+ N T L Sbjct: 196 LKLRSKSHPNEIIDRDLYKTFILNKDLLRTAYEKLKSRPGMMTPGI---NPTTLDGMSEE 252 Query: 63 ELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDF 122 L + ++L + ++ P +R+ IPKSNGK RPL + + D++VQ M M +E I+E F Sbjct: 253 RLDNIINKLRNKSFKFTPGKRIIIPKSNGKRRPLTLGSPEDKLVQEVMRMVLEAIYEPLF 312 Query: 123 HTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRI 182 +S+G+RP+RS H A+R + + C W IEGD+ S FD + H LMK + +I Sbjct: 313 LDVSHGYRPKRSCHSALRAIFTKFKGC-----TWWIEGDIKSCFDDIPHDKLMKVLSNKI 367 Query: 183 SDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYL-------- 234 D F+ L+ K +KAG++ + GVPQG VISP+L+NI L++ D ++ Sbjct: 368 KDQSFLELIRKCLKAGYMYQYTNKTDIIGVPQGSVISPILANIYLHQLDLFIMNIKDSFD 427 Query: 235 ----------------HERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYA 278 ++ + KA D+ + R+ + + + Y RY Sbjct: 428 WKGPRYKHDIGHAKLQYQLRKAKKAGSDKRVLHKMAVELRNHKMNFKGERTNKLTYVRYV 487 Query: 279 DDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHV-NDGFIFLGHRLIRK-- 335 DD+++ + G+ Q I L L ++ +KTKI + D +FLG + Sbjct: 488 DDWIVAINGSHKQAVEILSSISEYCMNELGLTISPEKTKITNSYKDHILFLGTLIKHSIH 547 Query: 336 ---RSRYGEMRVVST---IPQEKARNFAASLTALLWKVRISG 371 R G ++ ++ I ++ L + R +G Sbjct: 548 PTFSIRNGHLKQRNSGLLILSAPMKSIQDKLINSGFLWRRTG 589 >UniRef50_C9BNF1 Group II intron reverse transcriptase/maturase n=6 Tax=Bacilli RepID=C9BNF1_ENTFC Length = 600 Score = 272 bits (696), Expect = 1e-71, Method: Composition-based stats. Identities = 101/365 (27%), Positives = 180/365 (49%), Gaps = 18/365 (4%) Query: 8 WAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQIL 67 + + + +L LI + A R ++KG+ TPG D + E L Sbjct: 21 FTQSRNGKKFYQLYELIISENNILLAYRTIKANKGSSTPGTDSFTIDNYKEMNQAEFIHL 80 Query: 68 RDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSY 127 L +Y+P +RV IPK NG+ RPLGIP + DR++Q+ +EPI E+ F+ SY Sbjct: 81 ILSHLE-NYKPKSIKRVMIPKPNGEKRPLGIPCMIDRMIQQMFKQILEPICEAKFYEHSY 139 Query: 128 GFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAV-RRRISDAR 186 GFRP RS HA+ + + ++ + ++ D+ +FD V+HRLL+K + I D + Sbjct: 140 GFRPLRSAKHALGRI---MYLINISKMHYAVDIDIKGFFDNVNHRLLIKQLWNIGICDKQ 196 Query: 187 FMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKD 246 + +L K++K+ G+ S+G QGG+ISPLLSN++LN+ D ++ +++ + + + Sbjct: 197 VLAILSKSLKSPIQGEGI---PSKGTIQGGIISPLLSNVVLNDLDHWVSKQWHTFETKYP 253 Query: 247 RWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGS 306 N + R T +++ + RYADDF ++ ++ + L+ Sbjct: 254 YTKGYNKFRALRDTNLKQGY-------IVRYADDFKIMTNDYPTALK-WFHAVKLYLKDR 305 Query: 307 LKLRLNMDKTKIPHVND-GFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAASLTALLW 365 LKL ++ +K+KI ++ FLG + K+ + + S I +K + + Sbjct: 306 LKLDISNEKSKIVNLRKCKSEFLGFAICVKQ-KGKKWVCNSRISNKKKDQIKEEIKQRIK 364 Query: 366 KVRIS 370 ++ S Sbjct: 365 DIQKS 369 >UniRef50_A5ZWA2 Putative uncharacterized protein n=2 Tax=Clostridiales RepID=A5ZWA2_9FIRM Length = 428 Score = 272 bits (696), Expect = 1e-71, Method: Composition-based stats. Identities = 123/386 (31%), Positives = 188/386 (48%), Gaps = 50/386 (12%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQAR- 59 +Q KL A R L + + + L EA + ++KG+ GVDG+ ++ Sbjct: 13 LQNKLYLTAKKCQKRRFHALYDKVYRDDVLIEAWKRVKANKGS--SGVDGIRIEDIEKMG 70 Query: 60 LAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWE 119 + L+ L+ EL+ G Y P P +RV IPK +G RPLGIP +RDRIVQ A +A+EP++E Sbjct: 71 IEKYLKELKKELIEGKYIPSPVKRVMIPKPDGSERPLGIPTVRDRIVQMAAKIAIEPVFE 130 Query: 120 SDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVR 179 +DF SYGFRP+RS A+ V+ + +G +V++ D+ +FD V+ LM + Sbjct: 131 ADFRECSYGFRPKRSAKQALEVVRKACNN----KGYYVVDADIEKFFDNVNQDKLMILIE 186 Query: 180 RRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYL 239 +RISD R + L+ + +++G + + + G QG VISPLL+NI LN D+ Sbjct: 187 QRISDRRILKLIRQWMRSGILYGSILTISELGTSQGSVISPLLANIYLNTLDRLWE---- 242 Query: 240 SGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREEC 299 + GR+ + RYAD+ V+I K K+ A + Sbjct: 243 ---------------KYGRTHGI-----------LVRYADNTVIICKNKKSVNHA--QSL 274 Query: 300 RGVLEGSLKLRLNMDKTKIPHV---NDGFIFLGHRLIR------KRSRYGEMRVVSTIPQ 350 + G L LRL+ KTKI ++ +GF FLG R K +RYGE + Sbjct: 275 LQYIMGKLDLRLHPVKTKIVNMWDGTEGFDFLGLHHRRFLKINKKGNRYGETYQYPSKKA 334 Query: 351 EK--ARNFAASLTALLWKVRISGEIL 374 K R S+ V+ E++ Sbjct: 335 MKKMKRTVKESINQRYLLVKTEEELI 360 >UniRef50_B4CYA7 RNA-directed DNA polymerase (Reverse transcriptase) n=6 Tax=Bacteria RepID=B4CYA7_9BACT Length = 482 Score = 272 bits (696), Expect = 1e-71, Method: Composition-based stats. Identities = 120/398 (30%), Positives = 181/398 (45%), Gaps = 58/398 (14%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQAR- 59 +Q L A T+PS R L + + ++L EA R + G+ GVDG ++A Sbjct: 17 LQSSLQAKAKTEPSYRFYSLWDKVCRGDFLVEAYRRCRRNGGS--AGVDGETFEQIEAAG 74 Query: 60 LAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWE 119 L L L++EL + Y+ P RV+IPKSNG RPLGIP +RDR+VQ A +M + PI+E Sbjct: 75 LDAWLGKLQEELRTKQYRTQPLLRVWIPKSNGGQRPLGIPTVRDRVVQMATVMVLGPIFE 134 Query: 120 SDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVR 179 +D GFRP R A+R V Q+ G V++ DLS YF+TV H LMK++ Sbjct: 135 TDLCEEQMGFRPGRDAKTAVRLVYYQVRQKGRQE---VVDADLSDYFNTVPHGALMKSLS 191 Query: 180 RRISDARFMTLLWKTIKAGHI----DVGLFRAAS-----EGVPQGGVISPLLSNIMLNEF 230 RRI+D + ++++ + ++A D L R+ G PQGGVISPLL+N+ F Sbjct: 192 RRIADGQVLSVIARWLEAPVEECTPDGRLVRSTPAKDAGRGTPQGGVISPLLANVYFRRF 251 Query: 231 DQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVK-GTK 289 L + L + D N YADDFV+ + G Sbjct: 252 --VLAWKQLGYEQAFDSVIVN-------------------------YADDFVICCRPGNG 284 Query: 290 AQVEAIREECRGVLEGSLKLRLNMDKTKIPHV-NDGFIFLGHRLIRKRSRYGEMRVVSTI 348 G + L +N KTK+ V + F FLG+ + + G+ + + Sbjct: 285 NDARKAMTRVM----GKIGLTVNEQKTKVVRVPGESFDFLGYTIGGFYGQGGKPYIGTRP 340 Query: 349 PQEKARNFAASL----------TALLWKVRISGEILLG 376 ++ + + +V++ E L G Sbjct: 341 SKKAILRIMGEIHEQTSSQWNASEPETRVKVLNEKLRG 378 >UniRef50_Q11ZP4 RNA-directed DNA polymerase n=33 Tax=Bacteria RepID=Q11ZP4_POLSJ Length = 461 Score = 272 bits (695), Expect = 1e-71, Method: Composition-based stats. Identities = 107/348 (30%), Positives = 171/348 (49%), Gaps = 46/348 (13%) Query: 19 RLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQP 78 LL E L A + ++KGA GVDG++ + +R ELL+G Y+P Sbjct: 52 GLLEAALTRENLQVAWKRVKANKGA--AGVDGLDIEHTAQTIRNHWSQIRQELLAGTYRP 109 Query: 79 LPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHA 138 P RRV IPK +G R LGIP + DR++Q+A+L ++P+ + F S+GFRP R H A Sbjct: 110 SPVRRVMIPKPDGSQRELGIPTVLDRLIQQALLQVLQPLIDPTFSEHSHGFRPGRRAHDA 169 Query: 139 IRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAG 198 ++ + + + R V++ DLS +FD V+H +L+ +R+R+ DA + L+ + AG Sbjct: 170 VKAARAHVQ----SGKRVVVDVDLSKFFDRVNHDILIDRLRKRVDDAGVIRLIRAYLNAG 225 Query: 199 HIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGR 258 +D G+ +G PQGG +SPLL+N++L+E D+ L R Sbjct: 226 IMDGGVVMDRQQGTPQGGPLSPLLANVLLDEVDKVLEAR--------------------- 264 Query: 259 STAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKI 318 ++ RYADD + V KA + + E R L LKL++N K+ + Sbjct: 265 ------------GYSFARYADDCNVYVGSVKAG-QRVMELLRK-LYAGLKLQINEAKSAV 310 Query: 319 PHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAASLTALLWK 366 G FLG+ L + + V + ++ R+F A + L + Sbjct: 311 ASAF-GRKFLGYALWVAKGKE----VKCKVAEKPLRDFKARIRQLTSR 353 >UniRef50_Q01P79 RNA-directed DNA polymerase (Reverse transcriptase) n=16 Tax=Bacteria RepID=Q01P79_SOLUE Length = 462 Score = 272 bits (695), Expect = 1e-71, Method: Composition-based stats. Identities = 101/351 (28%), Positives = 160/351 (45%), Gaps = 47/351 (13%) Query: 18 QRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQ 77 RL+ + + E L A + ++KG+ PGVDG+ ++ L +R +LLSG Y+ Sbjct: 49 NRLMEEVCERENLKAALQRVKANKGS--PGVDGMTVIGIKDYLKQHWPAIRGQLLSGTYE 106 Query: 78 PLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHH 137 P P RRV I K +G +R LGIP + DR +Q+A++ ++ W+ F SYGFRP RS Sbjct: 107 PKPVRRVEIAKPDGGVRKLGIPTVLDRFIQQAVMQVLQRRWDRTFSDYSYGFRPGRSA-- 164 Query: 138 AIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKA 197 + Q W ++ DL +FD V+H LM + +RI+D R + L+ + A Sbjct: 165 --QQAVAQAQQYIAEGHGWCVDLDLEKFFDRVNHDKLMGQIAKRIADKRLLKLIRAFLNA 222 Query: 198 GHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRG 257 G ++ GL + EG PQGG +SPLLSN++L+EFD+ L R Sbjct: 223 GVMENGLVSPSVEGTPQGGPLSPLLSNLVLDEFDRELERR-------------------- 262 Query: 258 RSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTK 317 + RYADD + V+ +A + + E + LKL++N K+ Sbjct: 263 -------------GHRFVRYADDCNIYVRSERAG-QRVMESITQFITQKLKLKVNETKSA 308 Query: 318 IPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAASLTALLWKVR 368 + + FLG I + F + + + + Sbjct: 309 VARPQER-KFLGFSFT------AGPEAKRVIAPKALDRFKRRIREITGRAK 352 >UniRef50_C1PA09 RNA-directed DNA polymerase n=3 Tax=Firmicutes RepID=C1PA09_BACCO Length = 432 Score = 272 bits (695), Expect = 2e-71, Method: Composition-based stats. Identities = 113/370 (30%), Positives = 183/370 (49%), Gaps = 45/370 (12%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARL 60 Q +L A D + + I + + L EA + + GA GVD V+ ++A Sbjct: 16 FQNRLYLAAKADRKRKFYAIYDKIYRKDILEEAWKRVKQNGGAG--GVDKVSIEDVKAYG 73 Query: 61 AVEL-QILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWE 119 +L + +EL + Y+ P RR YIPK +G+ R LGIP ++DR+VQ A + +EP++E Sbjct: 74 EEKLLNEIAEELRTEKYRCKPVRRTYIPKQDGRKRALGIPTIKDRVVQMATKIVIEPVFE 133 Query: 120 SDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVR 179 ++F SYGFRP+RS A+ + + WVI+ D+ YF +++H L+ V+ Sbjct: 134 ANFQPCSYGFRPKRSAKQAMDRIFEV---ADKGGALWVIDADIKDYFGSINHDKLLLLVK 190 Query: 180 RRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYL 239 +RI+D R + L+ +KAG ++ + ++ G PQGGVISPLLSN+ LN FD Y ++ + Sbjct: 191 QRITDRRVLKLIKGWLKAGVLEDSQYSESTVGAPQGGVISPLLSNVYLNYFDIYWNKAFG 250 Query: 240 SGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREEC 299 RYADDFV++ K EA+R Sbjct: 251 ------------------------------HLGELVRYADDFVILCKRLSHAEEALRAV- 279 Query: 300 RGVLEGSLKLRLNMDKTKIPHV---NDGFIFLGH--RLIRKRSRYGEMR-VVSTIPQEKA 353 + L+L L+ +KT++ + D F FLG R R R++ + + +P +KA Sbjct: 280 -KWIMRKLELTLHSEKTRLVDMYFGKDSFDFLGFNNRFQRFRNKSWQWYWTLQQVPSKKA 338 Query: 354 -RNFAASLTA 362 + A++ Sbjct: 339 MKKMRANIKE 348 >UniRef50_B7HM08 Group II intron reverse transcriptase/maturase n=30 Tax=Firmicutes RepID=B7HM08_BACC7 Length = 610 Score = 272 bits (695), Expect = 2e-71, Method: Composition-based stats. Identities = 107/376 (28%), Positives = 183/376 (48%), Gaps = 24/376 (6%) Query: 1 MQRKL-ATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQAR 59 MQRK ++ + +L+ +I E + A R ++KG++T GVD + + Sbjct: 15 MQRKYDELYSNSLNGNNFYKLIDIIGSEENIRLAYRNIKTNKGSNTAGVDNLTIKDI--- 71 Query: 60 LAVELQILRDELLS--GHYQPLPARRVYIPKSN-GKLRPLGIPALRDRIVQRAMLMAMEP 116 + + E+ +YQP +RV IPK K RPLGIP + DR+VQ+++L +EP Sbjct: 72 WHLNDTKIIHEVRKRLNNYQPQAVKRVLIPKEGSDKKRPLGIPTIWDRLVQQSILQVLEP 131 Query: 117 IWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMK 176 I E+ FH SYGFRP RS HHA+ V + + + ++ D+ +FD V H+ L++ Sbjct: 132 ICEAKFHNHSYGFRPNRSTHHALSRVVSLINIGHQ---HYCVDIDIKGFFDNVCHKKLLR 188 Query: 177 AVRR-RISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLH 235 + I D + ++ K +K+ G+ ++G PQGG+ISPLLSNI+LNE D ++ Sbjct: 189 QMWTLGIRDKSLLCVISKILKSEIEGEGI---PNKGTPQGGIISPLLSNIVLNELDWWIS 245 Query: 236 ERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAI 295 ++ + K + Q R + RYADDF ++ + T + + Sbjct: 246 SQWETYKPHRISTRHLGFRQYARKYTNLKCG------YVVRYADDFKIMCR-TYDEAQRF 298 Query: 296 REECRGVLEGSLKLRLNMDKTKIPHVND-GFIFLGHRL-IRKRSRYGEMRVVSTIPQEKA 353 L+ L L +N K+K+ ++ +FLG ++ + K+ + + T +KA Sbjct: 299 YHATVDFLKSRLGLEINPKKSKVVNLKKNSSVFLGFKIKVVKQGKAKFGYIAKTSMSDKA 358 Query: 354 RNFAAS-LTALLWKVR 368 A L + ++ Sbjct: 359 ITKAKRQLKERIKGIQ 374 >UniRef50_UPI0001C42942 reverse transcriptase n=1 Tax=Bacillus pseudofirmus OF4 RepID=UPI0001C42942 Length = 644 Score = 271 bits (694), Expect = 2e-71, Method: Composition-based stats. Identities = 120/368 (32%), Positives = 184/368 (50%), Gaps = 23/368 (6%) Query: 1 MQRKL---ATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQ 57 +Q L A + + L+ L+ + + A S+KG+ T G+D Sbjct: 14 LQNTLDEMYDLAKNN-NEPFYNLIELMKNQQTIMTALHNIKSNKGSKTVGIDNKTIDYY- 71 Query: 58 ARLAVELQILRDELLSGHYQPLPARRVYIPKSN-GKLRPLGIPALRDRIVQRAMLMAMEP 116 L E + + + Y P P RR YIPK N KLRPLGIP + DRI+Q + +EP Sbjct: 72 LHLPYEDLVSQVQTCIEDYNPEPVRRKYIPKENSDKLRPLGIPTMIDRIIQEITRLVIEP 131 Query: 117 IWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMK 176 I E+ F+ SYGFRP RS HA+ + L +++ WVIEGD+ YFD ++H L+ Sbjct: 132 IAEAKFYKFSYGFRPMRSAEHAMAEI---LEKARKSKTYWVIEGDIKGYFDNINHNKLIT 188 Query: 177 AVRR-RISDARFMTLLWKTIKAGHID-VGLFRAASEGVPQGGVISPLLSNIMLNEFDQYL 234 + + I D R ++++ K +K+G ++ G + G PQGG+ISPLL+NI LN FD + Sbjct: 189 MLWKIGIKDKRVLSIIKKMLKSGIVEEDGEIYPSDLGSPQGGIISPLLANIYLNFFDWMI 248 Query: 235 HERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEA 294 E + D+ ++ N+ +R R +R + V RYADD+V++ +K Q + Sbjct: 249 AEEF-------DQHHYINNYER-RDKGLRAIRRDHKPVYSIRYADDWVVLC-SSKKQADT 299 Query: 295 IREECRGVLEGSLKLRLNMDKTKIPH-VNDGFIFLGHRLI--RKRSRYGEMRVVSTIPQE 351 + + R L+ L L L+ +KTKI + V + FLG ++ G V IP Sbjct: 300 LLIKIRKYLKHQLSLELSEEKTKITNLVEEKASFLGFEFFVEPRKKGKGNKMVAKMIPDR 359 Query: 352 KARNFAAS 359 K N Sbjct: 360 KKSNKKVR 367 >UniRef50_C1L365 Group II intron-encoded protein n=1 Tax=Bacillus thuringiensis RepID=C1L365_BACTU Length = 598 Score = 271 bits (694), Expect = 2e-71, Method: Composition-based stats. Identities = 96/364 (26%), Positives = 180/364 (49%), Gaps = 20/364 (5%) Query: 10 ATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQIL-R 68 + + L I + A S++G+ TPG D + + E+ L R Sbjct: 23 KSKNGGNFKDLYSFIIDERNIKLAFGTIKSNQGSKTPGTDQITINEYKGFSDDEIIHLVR 82 Query: 69 DELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYG 128 ++L++ ++P RRV+IPK++GK RPLGIP + DRI+Q+ +EPI E+ F+ SYG Sbjct: 83 EKLIN--FKPDSVRRVFIPKADGKSRPLGIPTMLDRIIQQCFKQILEPIVEAKFYEHSYG 140 Query: 129 FRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAV-RRRISDARF 187 FRP RS HHAI + + + ++ D+ +FD ++H L+K + + D + Sbjct: 141 FRPLRSTHHAIARTNFLINI---NKLHYCVDIDIKGFFDNINHNKLIKQLWNIGVRDKQV 197 Query: 188 MTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDR 247 + ++ K +K + G A +G PQGG++SPLL+N++LN+ D ++ ++ + Sbjct: 198 LAIIKKMLKCEIVGEG---KAEKGTPQGGILSPLLANVVLNDLDHWVASQWHNFPTNTHY 254 Query: 248 WYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSL 307 + + T +++ RYADDF + + + + +E +L Sbjct: 255 EDNRPRYRVQKKTKLKQG-------FIVRYADDFKI-FTNSYNSAKRWFHAVKNYIEKNL 306 Query: 308 KLRLNMDKTKIPHVND-GFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAASLTALLWK 366 KL ++ K+KI ++ +FLG + ++ + S IP++ ++ A++ + Sbjct: 307 KLEISESKSKITNLRKRKSLFLGIE-FKAVAKKKKFVAQSYIPKDSMKSIQANIKKKVKS 365 Query: 367 VRIS 370 +RI+ Sbjct: 366 IRIN 369 >UniRef50_C1BDP2 Putative RNA-directed DNA polymerase n=2 Tax=Rhodococcus opacus B4 RepID=C1BDP2_RHOOB Length = 605 Score = 270 bits (691), Expect = 5e-71, Method: Composition-based stats. Identities = 109/374 (29%), Positives = 171/374 (45%), Gaps = 31/374 (8%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQP--EWLAEAARITLSSKGAHTPGVDGVNKTMLQA 58 ++R++ ++ L +L+ L ++T + G T G+DG +A Sbjct: 37 LRRRIFKATREQDWAAVRSLQKLMLGSWSNTLVSVRQVTQRNAGRRTAGIDGETALSPEA 96 Query: 59 RLAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIW 118 R + +++ ++P+P RRVYIPK+ GK RPLGIP + DR Q + A+EP W Sbjct: 97 RANMAVRVHESR---SSWEPVPVRRVYIPKAGGKRRPLGIPVVMDRCHQARVRTALEPEW 153 Query: 119 ESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAV 178 E+ F SYGFRP RS AI + L + + W+++ DLS+ FD + H L+ + Sbjct: 154 EARFEARSYGFRPGRSCADAIGALYSTL-NGSRAKRVWILDADLSAAFDRIDHPRLLDTL 212 Query: 179 RRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERY 238 L+ K + AG I+ G F ++ EG PQGGVISPLL N+ L+ ++ RY Sbjct: 213 ----GSFPARELIGKWLTAGVIENGRFASSEEGTPQGGVISPLLLNVALHGLEEAAGVRY 268 Query: 239 LSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREE 298 L D + RYADD V ++ Q E +++ Sbjct: 269 LKAADTLDARSVPGTP------------------VLVRYADDLVACCH-SRQQAELVKDR 309 Query: 299 CRGVLEGSLKLRLNMDKTKIPHV-NDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFA 357 G L L N DKT I H+ +GF FLG+ + R R +++ + + Sbjct: 310 LAGWLAPR-GLAFNEDKTHIVHLEEEGFDFLGYNIRRYRRGTRPAKLLIKPNSDAVKRIH 368 Query: 358 ASLTALLWKVRISG 371 L + ++R S Sbjct: 369 RRLANEMRRMRGSN 382 >UniRef50_B2A0J9 RNA-directed DNA polymerase (Reverse transcriptase) n=43 Tax=Bacteria RepID=B2A0J9_NATTJ Length = 475 Score = 270 bits (691), Expect = 5e-71, Method: Composition-based stats. Identities = 111/359 (30%), Positives = 177/359 (49%), Gaps = 45/359 (12%) Query: 11 TDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDE 70 ++ +L LL I + + +A + S+KG+H G+DG+ L L LR Sbjct: 49 SNANLSKGNLLEEILDRDNMNKAFKKIKSNKGSH--GIDGMGVDELLQYLKENGDHLRQR 106 Query: 71 LLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFR 130 +L G Y+P P RRV IPK +GK R LGIP + DR++Q+A+ + PI+E F SYGFR Sbjct: 107 VLDGKYRPNPVRRVEIPKEDGKKRKLGIPTVVDRVIQQAIAQVLSPIYEEQFSDNSYGFR 166 Query: 131 PERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTL 190 P RS H AI+ + + + ++V++ DL YFDTV+ L++ + + I D R ++L Sbjct: 167 PGRSTHDAIKKSQQNINE----GYKYVVDMDLEKYFDTVNQSKLIEVLSKTIKDGRVISL 222 Query: 191 LWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYW 250 + K ++AG + ++ GVPQGG +SP+LSNIML+E D+ L +R Sbjct: 223 INKYLRAGVMIKHTYKDTEVGVPQGGPLSPILSNIMLHELDKELEKR------------- 269 Query: 251 NNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLR 310 + RYADD ++ K ++ ++ +E L L+ Sbjct: 270 --------------------GHEFVRYADDLLIFCKSRRSAGRTLKN-ILPFIENKLFLK 308 Query: 311 LNMDKTKIPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQE-KARNFAASLTALLWKVR 368 +N DKT + +V FLG R + G+ R+ + K R LT+ + + Sbjct: 309 VNKDKTVVAYVGK-VRFLGFGFYRHK---GKARLRVHLKSVTKMRTRIKELTSRSYGIS 363 >UniRef50_Q1QGR6 RNA-directed DNA polymerase (Reverse transcriptase) n=3 Tax=Bradyrhizobiaceae RepID=Q1QGR6_NITHX Length = 440 Score = 270 bits (690), Expect = 6e-71, Method: Composition-based stats. Identities = 114/375 (30%), Positives = 175/375 (46%), Gaps = 48/375 (12%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQ-AR 59 +QRKL A +P+ R L I + + L A + ++ GA PGVDG+ ++ A Sbjct: 12 LQRKLYRKAKAEPAFRFYILYDKICREDVLLRAYTLARANAGA--PGVDGMTFGQIEGAG 69 Query: 60 LAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWE 119 + L LR++L+S YQP P RRV IPK G RPLGIP +RDR+VQ A + +EPI+E Sbjct: 70 VDAWLAGLREDLVSKTYQPDPVRRVMIPKPGGGERPLGIPTIRDRVVQAAAKIVLEPIFE 129 Query: 120 SDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVR 179 + F +YG+RP RS A++ L V++ DLS YFDT+ H L+++V Sbjct: 130 AGFEDGAYGYRPRRSAIDAVKETHRLL----CRGYTDVVDADLSKYFDTIPHADLLRSVA 185 Query: 180 RRISDARFMTLLWKTIKAGHIDVGL--------FRAASEGVPQGGVISPLLSNIMLNEFD 231 RR+ D + L+ ++ + ++++ G PQGGV+SPLLS I +N F Sbjct: 186 RRVLDRNVLRLIKLWLQVPVEERDGDGKRHMSGGKSSTRGTPQGGVVSPLLSVIYMNRF- 244 Query: 232 QYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQ 291 L L+G+ YADDFV++ +G + Sbjct: 245 --LKHWRLTGRGEVFH------------------------AHVISYADDFVILSRGHAEE 278 Query: 292 VEAIREECRGVLEGSLKLRLNMDKTKIPHVN-DGFIFLGHRLIRKR-SRYGEMRVVSTIP 349 L L LN KT + + +GF FLG+ L + G + ++ Sbjct: 279 ALTWTRAVMT----KLGLTLNEAKTSVKNARREGFDFLGYTLGPRHLPNGGRWYLGASPS 334 Query: 350 QEKARNFAASLTALL 364 ++ + + LL Sbjct: 335 KKSMQRVKVKIGELL 349 >UniRef50_C3LL08 Group II intron reverse transcriptase/maturase n=34 Tax=Firmicutes RepID=C3LL08_BACAC Length = 643 Score = 270 bits (689), Expect = 7e-71, Method: Composition-based stats. Identities = 114/382 (29%), Positives = 185/382 (48%), Gaps = 19/382 (4%) Query: 5 LATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVEL 64 L A S + L+ +I E + A R +KG+ T D VN ++ Sbjct: 29 LYQKATKGNS--FKNLMSIIISDENILLAYRNIKGNKGSRTAACDNVNIKNIEGMEQSYF 86 Query: 65 QILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHT 124 +YQP RR I K NG+ RPLGIPA+ DRI+Q+ +L MEPI E+ F Sbjct: 87 LNEVKR-RFQNYQPQKVRRKEISKPNGQTRPLGIPAMWDRIIQQCILQVMEPICEAHFSN 145 Query: 125 LSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRR-RIS 183 SYGFRP RS HA+ +++ + +V++ D+ +FD V+H LM+ + I Sbjct: 146 RSYGFRPNRSAEHALADASVRVN---KQNLTYVVDVDIKGFFDEVNHVKLMRQLWTLGIR 202 Query: 184 DARFMTLLWKTIKAGH-IDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGK 242 D + + ++ K +KA + G ++G PQGG++SP+L+N+ LNEFD ++ ++ + K Sbjct: 203 DKQLLVIIRKILKAPVQMPDGTTMFPTKGTPQGGILSPILANVNLNEFDWWISRQWETFK 262 Query: 243 ARKDRWYWNNSIQRGRSTAVRENWQWKPAVAY-CRYADDFVLIVKGTKAQVEAIREECRG 301 A+K + I + K Y RYADDF + T++ E I + + Sbjct: 263 AKKVKPRCMRGIWCNDVVTTQLTKTSKMKPMYIVRYADDFKI-FTNTRSNAEKIFKATQM 321 Query: 302 VLEGSLKLRLNMDKTKIPHVND-GFIFLGHRL---IRKRSRYGEMRVVSTI---PQ--EK 352 LE LKL ++ +K+K+ ++ FLG L + + + G+ R ++ P+ EK Sbjct: 322 WLEERLKLSISAEKSKVTNLTKQQSEFLGFTLKAVKKGKKKNGDTRYIAVTHVSPKALEK 381 Query: 353 ARNFAASLTALLWKVRISGEIL 374 + A + K S E + Sbjct: 382 TKQDLAKQVRRIQKTPNSNETI 403 >UniRef50_C3B585 Reverse transcriptase/endonuclease protein n=1 Tax=Bacillus mycoides Rock3-17 RepID=C3B585_BACMY Length = 614 Score = 270 bits (689), Expect = 9e-71, Method: Composition-based stats. Identities = 108/378 (28%), Positives = 192/378 (50%), Gaps = 31/378 (8%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPE--WLAEAARITLSSKGAHTPGVDGVNKTMLQA 58 +Q+++ +++ L RL+ + E L ++T +KG T GVDG Sbjct: 34 IQQRIYRAEQLGQRRKVKGLQRLLMRSEATLLISIRQVTQLNKGKRTAGVDGFKAIKPTE 93 Query: 59 RLAVELQILRDELLSGHYQPLPARRVYIPK--SNGKLRPLGIPALRDRIVQRAMLMAMEP 116 R+ + ++ L ++P P +R+ IPK + KLRPLGIP + DR+ Q + +A+EP Sbjct: 94 RIKLFHKMKAMNL--ATHKPSPVKRIEIPKDTAGKKLRPLGIPIIIDRVYQNVVKLALEP 151 Query: 117 IWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMK 176 WE F SYGFRP+R AI ++ L+L ++ RWV EGD FD ++H +++ Sbjct: 152 QWEVHFEPTSYGFRPKRGCQDAITSIFLKLKT--TSKKRWVFEGDFKGCFDNLNHDYILE 209 Query: 177 AVRRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHE 236 ++ D + ++ K ++AG + G+F + G PQGG+ISPLL+NI L+ ++ + Sbjct: 210 QIK----DLPYKEIVKKWLRAGFVHNGVFNLTNNGTPQGGIISPLLANIALHGMEEEIGV 265 Query: 237 RYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIR 296 +Y++ + + ++Q +S RYADDFV++ TK + E++ Sbjct: 266 KYINRTHPRKKGERYWTVQDTKS--------------VVRYADDFVIMT-DTKEEAESMY 310 Query: 297 EECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIR---KRSRYGEMRVVSTIPQEKA 353 E+ + L L L +KTK+ HV++GF FLG + + + + ++ + ++ Sbjct: 311 EKLKPYLVKR-GLELAPEKTKVVHVSEGFDFLGFTIRQFPTAKEKGRLWKLFTKPSKKSI 369 Query: 354 RNFAASLTALLWKVRISG 371 + + A K + S Sbjct: 370 KKAVTKIKACFEKYKGSN 387 >UniRef50_B2AJV8 RNA-directed DNA polymerase, retrotranscriptase n=45 Tax=root RepID=B2AJV8_CUPTR Length = 607 Score = 270 bits (689), Expect = 9e-71, Method: Composition-based stats. Identities = 118/384 (30%), Positives = 173/384 (45%), Gaps = 52/384 (13%) Query: 10 ATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRD 69 A+ ++ L+ IT P L ++ + GVDGV + L + L D Sbjct: 182 ASGKKVQFTALMHHIT-PRLLIDSFMHLKK---SAAAGVDGVTWHDYEECLVERIGKLWD 237 Query: 70 ELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGF 129 + +G Y+ LP+RRVYIPK++GK RPLGI AL D+IVQ+A++ + PI+ESDF SYGF Sbjct: 238 AVQAGRYRALPSRRVYIPKADGKQRPLGIAALEDKIVQQAVVTVLTPIYESDFLGFSYGF 297 Query: 130 RPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMT 189 RP R H A+ + + L + WV++ D+ S+FDTV H +M+ + RI+D R + Sbjct: 298 RPGRGQHQALDALWVGL---HWKKVNWVLDADIRSFFDTVDHGWMMRFLEHRIADKRLLR 354 Query: 190 LLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNE-FDQYLHERYLSGKARKDRW 248 L+ K + AG I+ G G PQG VISPLL+NI L+ FD +L Sbjct: 355 LIRKWLTAGVIENGAKTEIRVGTPQGAVISPLLANIYLHYVFDLWLQRW----------- 403 Query: 249 YWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLK 308 K V RYADD V+ + +A E + Sbjct: 404 ---------------RRRDAKGDVIVVRYADDSVVGFEA-EADASRFLEALKARF-AQFG 446 Query: 309 LRLNMDKTKIPHVN---------------DGFIFLGHRLIRKRSRYGEMRVVSTI-PQEK 352 L LN KT++ F FLG I R +V + ++ Sbjct: 447 LSLNEQKTRVLQFGRYAASLRKRAGLGRPQTFDFLGFTHICATKRSNGGFIVRRLTSSKR 506 Query: 353 ARNFAASLTALLWKVRISGEILLG 376 R +L L++ R ++G Sbjct: 507 MRATLKALRQALYRRRHEPIAVVG 530 >UniRef50_A0RHJ0 Reverse transcriptase/endonuclease protein n=6 Tax=Firmicutes RepID=A0RHJ0_BACAH Length = 608 Score = 269 bits (688), Expect = 9e-71, Method: Composition-based stats. Identities = 102/359 (28%), Positives = 173/359 (48%), Gaps = 19/359 (5%) Query: 1 MQRKLATWAATDPSLR-IQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQAR 59 +Q R + + LI E + A R S+ G+ T G +G L Sbjct: 17 LQSTFDNLYEESKKGRYFKNIYELIISEENIRLAFRNLKSNIGSKTKGTNGHTIKHLNKI 76 Query: 60 LAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWE 119 A +L L + L +Y P RR++I K NGK+RPLGIP + DR++Q+ +EPI E Sbjct: 77 DADKLIRLTQKRLE-NYMPHAVRRLFISKPNGKMRPLGIPTIEDRLIQQMFQQVLEPIVE 135 Query: 120 SDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVR 179 FH SYGFRP+R H A+ + + +V++ D+ +FD V+H+ LM+ + Sbjct: 136 GKFHPQSYGFRPKRGTHDALARCYHMVN---HSHQHFVVDIDIKGFFDNVNHKKLMRQLW 192 Query: 180 -RRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERY 238 I D + ++++ K +KA G+ +G PQGG++SPLL+N++LNE D ++ ++ Sbjct: 193 TIGIRDKKVLSIIKKMLKAEVTGEGI---PVKGTPQGGILSPLLANVVLNELDWWVSNQW 249 Query: 239 LSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREE 298 + R N + T ++ + RYADDF + + I+ Sbjct: 250 ETKPTRVPYKLKRNKTDALKKTRLKP-------MYLVRYADDFKI-FTNSYDNARKIKIA 301 Query: 299 CRGVLEGSLKLRLNMDKTKIPHV-NDGFIFLGHRLI-RKRSRYGEMRVVSTIPQEKARN 355 L+ L L ++ +K+KI ++ +G FLG R ++ +V++ KA++ Sbjct: 302 VEKWLKERLGLEISEEKSKITNLRKNGTDFLGIRFRAVQKGNAKTGYIVNSKMDPKAKD 360 >UniRef50_Q1Q0X4 Similar to Group II intron encoded reverse transcriptase n=6 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1Q0X4_9BACT Length = 298 Score = 269 bits (688), Expect = 9e-71, Method: Composition-based stats. Identities = 103/341 (30%), Positives = 171/341 (50%), Gaps = 46/341 (13%) Query: 16 RIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGH 75 + L + + L A + +KG G+D V+ ++ L V + + EL + Sbjct: 3 KYHSLRDKVFSLKNLYAAFKHVKKNKGK--AGLDRVSIKQFESNLDVNIMSIHQELKTAI 60 Query: 76 YQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSV 135 Y P P RVYIPK RPLGIP ++DRIVQ+A +EPI+E F S+GFRP+R Sbjct: 61 YNPAPVLRVYIPKGRHDKRPLGIPIVKDRIVQQAFRQIIEPIFEKGFSDNSFGFRPDRCC 120 Query: 136 HHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTI 195 H AI+ ++ + V++ D+ +++DT+ H+L+M ++R +I+D + + + Sbjct: 121 HDAIKRLEQ----YKQEGYTSVLDADIMAFYDTIPHKLIMDSLREKIADGWVLNSIENML 176 Query: 196 KAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQ 255 KAG ++ G+ ++G PQGGVISPLL+N++ + D+ L Sbjct: 177 KAGVMEDGIVHETNKGTPQGGVISPLLANLIGDIIDKELE-------------------- 216 Query: 256 RGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDK 315 K + RYADDFV++ K TK ++ A + ++ G L ++L+ DK Sbjct: 217 -------------KAGYKFVRYADDFVVMTK-TKDELPAALSYVKEIIAGKLGMKLSEDK 262 Query: 316 TKIPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNF 356 TK+ + GF FLG+ G+ + +ST K ++ Sbjct: 263 TKLTNFERGFRFLGYDF------KGKYKGISTKSLNKLKSL 297 >UniRef50_C2XKK9 D-alanine--D-alanine ligase A (D-alanylalanine synthetaseA) n=1 Tax=Bacillus cereus F65185 RepID=C2XKK9_BACCE Length = 647 Score = 269 bits (688), Expect = 1e-70, Method: Composition-based stats. Identities = 108/351 (30%), Positives = 177/351 (50%), Gaps = 15/351 (4%) Query: 5 LATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVEL 64 + A + S ++ L+ + + A S++G+ T G+D + A +L Sbjct: 44 IYQKAKEENSC-FHGIIELMKNKQTIKTAIHNIKSNRGSMTVGIDKKDVNYYLQMEAKQL 102 Query: 65 QILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHT 124 L + + +Y+P P RR YI K NGK RPLGIP + DRI+Q + +EPI E+ F Sbjct: 103 IKLIRQHID-NYKPNPVRREYINKGNGKKRPLGIPTMIDRIIQEIARIVLEPIAEAKFFN 161 Query: 125 LSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAV-RRRIS 183 SYGFRP RS H+AI V L ++ IEGD+ S+FD ++H L++ + I Sbjct: 162 HSYGFRPYRSCHYAIGRV---LNTISRSKTYIAIEGDIKSFFDHINHNKLVEMMWNMGIK 218 Query: 184 DARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKA 243 D RF+ ++ K ++AG ++ + G PQGG+ISPLL+NI LN FD + + + +A Sbjct: 219 DKRFLIIIKKMLRAGVLEDKVILPTEIGTPQGGIISPLLANIYLNNFDWMVAKEFEEHRA 278 Query: 244 RKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVL 303 R + ++ + G + R + RYADD++++ + T Q + + Sbjct: 279 R---YTVKHAFRSGLTKVGRRH----KKCFLIRYADDWIILCEDT-VQARILLTKIDKYY 330 Query: 304 EGSLKLRLNMDKTKIPHVNDG-FIFLGHRLIRKRSRYGEMRVVSTIPQEKA 353 + LKL L+ +KT I + + FLG + ++ R + IP +K Sbjct: 331 KHILKLELSKEKTFITDLREKPARFLGFDIKAEKMRLKDRIAGKAIPNKKK 381 >UniRef50_C5ER86 RNA-directed DNA polymerase n=1 Tax=Clostridiales bacterium 1_7_47FAA RepID=C5ER86_9FIRM Length = 635 Score = 269 bits (688), Expect = 1e-70, Method: Composition-based stats. Identities = 102/380 (26%), Positives = 179/380 (47%), Gaps = 22/380 (5%) Query: 10 ATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRD 69 + + + + L+ L+ E + A R + G+ TPG+DG L E+ L Sbjct: 29 KSLENRKFKNLMELVLMEENIKLAYRNMKKNDGSTTPGIDGKTIEHLAKMTEKEVIELVR 88 Query: 70 ELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGF 129 L Y P RRV I K NGK RPLGI ++ DR++Q+ +L +EPI E+ FH S GF Sbjct: 89 NKLE-WYTPKAIRRVEIDKGNGKKRPLGIASIEDRLIQQCILQVLEPICEAKFHDRSNGF 147 Query: 130 RPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVR-RRISDARFM 188 RP R V +A+ + + + V++ D+ +FD V H L+K + I D + + Sbjct: 148 RPNRGVENALAQAEKLIQS---NKLYIVVDIDIKGFFDNVSHGKLLKQLWTIGIQDKKLI 204 Query: 189 TLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKAR---K 245 +++ +K ++G +G QG +ISPLLSN++LNE D ++ ++ R K Sbjct: 205 SIISAMLKGEIAEIGF---PEKGTAQGSIISPLLSNVVLNELDWWIASQWEFMPTRHVYK 261 Query: 246 DRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEG 305 + N + + + N K RYADDF + + K + E + L+ Sbjct: 262 EAIKANGTQSKSKKYRALRNSTLK-ECFIVRYADDFKIFCRKHK-DAVVMFEATKQWLKT 319 Query: 306 SLKLRLNMDKTKIPHVNDGF-IFLGHRL-IRKRSRYGEMR-------VVSTIPQEKARNF 356 L L ++ +K+KI ++ + FLG R+ + K+ + + + V S I ++ + Sbjct: 320 RLGLDISPEKSKIVNLKHSYSEFLGFRIKVHKKGKDTKCKPPVDKYVVKSHISEKALKKI 379 Query: 357 AASLTALLWKVRISGEILLG 376 + + ++ + +G Sbjct: 380 KTNAKERIIAIQKTNGSRVG 399 >UniRef50_B0URY2 RNA-directed DNA polymerase n=25 Tax=cellular organisms RepID=B0URY2_HAES2 Length = 575 Score = 268 bits (686), Expect = 2e-70, Method: Composition-based stats. Identities = 103/376 (27%), Positives = 175/376 (46%), Gaps = 42/376 (11%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWL-AEAARITLSSKGAHTPGVDGVNKTMLQAR 59 MQ ++A +++ L R++T + A A R + G T G+D +++ Sbjct: 40 MQVRIAKATQESNWRKVKNLQRMLTHSFYAKALAVRRVTENTGKRTAGIDKRIWDTPESK 99 Query: 60 LAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWE 119 I +L S YQP P RRV+IPKSNGK RPLGIP ++DR +Q L+A++PI E Sbjct: 100 W-----IAIQDLSSKGYQPKPLRRVFIPKSNGKKRPLGIPTMKDRAMQMLYLLALQPIAE 154 Query: 120 SDFHTLSYGFRPERSVHHAIRTVKLQLT---DCGETRGRWVIEGDLSSYFDTVHHRLLMK 176 + SYGFR RS AI + + + WV++ D+ FD ++H L+K Sbjct: 155 TTADNNSYGFRLNRSTADAISHIHSIFSTKGNQSRQMAEWVLDADIHGCFDFINHDWLLK 214 Query: 177 AVRRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHE 236 + +L K +K+G ++ G + +EG PQG +ISP L+N+ L+ ++ L + Sbjct: 215 HI------PMNKRILKKWLKSGVVEFGQLKPTTEGTPQGDIISPTLANMALDGLEKELIK 268 Query: 237 RYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIR 296 + + + K K RYADDF++ + E + Sbjct: 269 HFGAKNSLKI---------------------AKHRTYLVRYADDFIISGISKELLEEQVI 307 Query: 297 EECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNF 356 + L L L+ KTK+ H+ GF FLG + R + +++ ++ A+ F Sbjct: 308 PMVKNFLAER-GLSLSESKTKVVHIEHGFDFLGWTVKRF-----DKKLIIKPSKKNAKAF 361 Query: 357 AASLTALLWKVRISGE 372 + + K++++ + Sbjct: 362 YDKVKQSISKMKMAKQ 377 >UniRef50_C4ZES6 RNA-directed DNA polymerase n=27 Tax=Bacteria RepID=C4ZES6_EUBR3 Length = 554 Score = 268 bits (686), Expect = 2e-70, Method: Composition-based stats. Identities = 102/376 (27%), Positives = 171/376 (45%), Gaps = 39/376 (10%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWL-AEAARITLSSKGAHTPGVDGVNKTMLQAR 59 +Q ++ +++ L L+T + A A + S+KG +T GVD + + Sbjct: 30 LQMRIVKAQKDGHYNKVKTLQWLLTHSFYAKALAVKRVTSNKGKNTAGVDHELWKTPKGK 89 Query: 60 LAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWE 119 ++L Y+P P RRVYIPK NGKLRPL IP + DR +Q A+EP+ E Sbjct: 90 FEA-----IEKLKRRGYKPQPLRRVYIPKKNGKLRPLSIPTMTDRAMQTLYKFALEPLAE 144 Query: 120 SDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVR 179 + SYGFR RS H AI L C +W++EGD+ FD + H L+ + Sbjct: 145 TLADPNSYGFRIGRSTHDAIGQCFNDL--CRAGSPQWILEGDIKGCFDHISHNWLLANIP 202 Query: 180 RRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYL 239 +L K +K G ++ EG PQGG ISP+L N+ L+ ++ L ER+ Sbjct: 203 MD------KKMLGKWLKCGFVETKKLFPTEEGTPQGGTISPVLMNMTLDGLERILKERFP 256 Query: 240 SGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREEC 299 + + ++ + + RYADDF++ K + + Sbjct: 257 MRRTVAGKTVYDQ-------------------INFVRYADDFIVTGKSPETLRNEVMPLI 297 Query: 300 RGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAAS 359 + L L+L+ +KT I H++DGF FLG + + +++ + ++F Sbjct: 298 KDFLAER-GLQLSEEKTVITHISDGFDFLGQNVRKY-----NGKLLIKPSKNAIKSFLKK 351 Query: 360 LTALLWKVRISGEILL 375 + ++ + + + + LL Sbjct: 352 VRTIVRENKTATQDLL 367 >UniRef50_B4D379 RNA-directed DNA polymerase (Reverse transcriptase) n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4D379_9BACT Length = 441 Score = 268 bits (686), Expect = 2e-70, Method: Composition-based stats. Identities = 115/385 (29%), Positives = 182/385 (47%), Gaps = 52/385 (13%) Query: 2 QRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLA 61 QR++A A + ++ L + EW+ EA KG+ PGVDG + L Sbjct: 13 QRRIAELAEYHGAEGLRTLGHHL-DLEWMREAYGRVR--KGS-APGVDGKSVADYGRELD 68 Query: 62 VELQILRDELLSGHYQPLPARRVYIPKSNGK-LRPLGIPALRDRIVQRAMLMAMEPIWES 120 L L D SG YQ P RRV+IPK+NGK RP+G+P + D+I+QRA++M +EP++E Sbjct: 69 KNLGGLIDRAKSGSYQAPPVRRVHIPKANGKETRPIGMPTVEDKILQRAVVMLLEPMYER 128 Query: 121 DFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRR 180 +F SYGFRP RS H A++ + T+ WV++ D+ ++FDT+ H +LM +++ Sbjct: 129 EFGDFSYGFRPGRSAHQALKAI---WQGINRTQAGWVVDVDIRAFFDTLDHGVLMGILQK 185 Query: 181 RISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNE-FDQYLHERYL 239 R+ D + L+ K +KAG ++ G G PQGGVISPLLSNI L+E D++ + Sbjct: 186 RVKDGVILKLVAKWLKAGVMEAGALSYPEAGTPQGGVISPLLSNIYLHEVLDEWFEAAVI 245 Query: 240 SGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREEC 299 + + RYADDFV+ + + E + + Sbjct: 246 P--------------------------RLQGRGFMVRYADDFVMGFE-CREDAERVMKAL 278 Query: 300 RGVLEGSLKLRLNMDKTKIPHVN---------------DGFIFLGHRLIRKRSRYGEMRV 344 G G L+L+ KT++ + F FLG +SR G + Sbjct: 279 PGRF-GRYGLKLHEGKTRLVRFGKPEDGSGGGGGSGKPETFDFLGFTHHWAKSRKGRWYI 337 Query: 345 VSTIPQEKARNFAASLTALLWKVRI 369 +++ R ++ + + Sbjct: 338 QRKTARKRLRRALKTIHQWCRQNQH 362 >UniRef50_A2TD24 Intron encoded protein n=2 Tax=Bacillaceae RepID=A2TD24_BACSO Length = 604 Score = 268 bits (685), Expect = 3e-70, Method: Composition-based stats. Identities = 105/358 (29%), Positives = 175/358 (48%), Gaps = 18/358 (5%) Query: 16 RIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNK-TMLQARLAVELQILRDELLSG 74 R + L+ + + + A S++G T G DG +L + ++ Sbjct: 16 RFKGLVEIASSDVVIVSAIHKIKSNQGNSTAGTDGKTISDILTLNYDEAINFVKRCFK-- 73 Query: 75 HYQPLPARRVYIPKSNGK-LRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPER 133 Y P P RRV+IPK K RPLGI + DRI+Q + M +EPI E+ F SYGFRP R Sbjct: 74 KYTPNPIRRVHIPKPGKKEKRPLGILTIADRIIQECVRMVIEPILEAQFFQHSYGFRPYR 133 Query: 134 SVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAV-RRRISDARFMTLLW 192 A + ++ + C WVIEGD+ +FD V+H +L+K + I D R + ++ Sbjct: 134 ---DAKQAIERCVFICNRIGYNWVIEGDIKGFFDNVNHTILIKQLWHMGIRDRRMLMIIK 190 Query: 193 KTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNN 252 +KAG I + G PQGG+ISPLL+N+ L++ DQ++ + K R Sbjct: 191 AMLKAGVI--KETKINEMGTPQGGIISPLLANVYLHKLDQWITREWEEKKMRN-----GT 243 Query: 253 SIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLN 312 +I+ + ++R++ Y RYADD+ ++ ++ E + + L+ +LKL L+ Sbjct: 244 TIRTAKYKSLRDHSTITKPEFYVRYADDW-ILFTNSRGNAEKWKYRIKKYLKENLKLELS 302 Query: 313 MDKTKIPHVNDGF-IFLGHRL-IRKRSRYGEMRVVSTIPQEKARNFAASLTALLWKVR 368 DKT I ++ FLG ++ + + G+ ++ EK + + L K++ Sbjct: 303 DDKTLITNIKKKPMKFLGFKIKMIPHGKGGKYIGYASADTEKIKGKVEQIRKDLRKLK 360 >UniRef50_Q24QQ9 Putative uncharacterized protein n=1 Tax=Desulfitobacterium hafniense Y51 RepID=Q24QQ9_DESHY Length = 591 Score = 268 bits (684), Expect = 3e-70, Method: Composition-based stats. Identities = 113/363 (31%), Positives = 169/363 (46%), Gaps = 35/363 (9%) Query: 7 TWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQI 66 T+P + L RL + A S+ GA T G DG T L + Sbjct: 15 RRNTTNPGYVNEDLYRLFYSRDLYIIAYNSVKSNDGAETSGADG---TSLHGFCEEWITQ 71 Query: 67 LRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLS 126 L + YQP P R IPK +GKLR L P +D++VQ A+ + +E I+E F LS Sbjct: 72 LITSMRDESYQPQPNRTTMIPKKSGKLRKLSFPNGKDKLVQEAIRIILECIYEPTFSNLS 131 Query: 127 YGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDAR 186 +GFRP+RS AI V+ W IEGD+S+ FD + HR L +R RI D R Sbjct: 132 HGFRPKRSTQSAIAEVQTW------RGTIWFIEGDISACFDDIDHRTLETILRERIRDER 185 Query: 187 FMTLLWKTIKAGHID-VGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHE--------- 236 F+ L+ K +KAG+ D L++ G QG SPLL NI L++ D+++ Sbjct: 186 FIRLVNKVLKAGYFDMQHLYQKTKTGNAQGSCCSPLLCNIYLDKLDKFMENVMEQDTMGG 245 Query: 237 -RYLSGKARKDRWYWNNSIQRGRSTAVRENWQ--------------WKPAVAYCRYADDF 281 R + K R+ + +++ G ++ + V Y RYADDF Sbjct: 246 YRRQNPDYAKARYLYKKALKSGSDPQTVQHLKRTMEHLPTTDRYDPNFRRVNYVRYADDF 305 Query: 282 VLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDG-FIFLGHRLIRKRSRYG 340 ++ V +K ++ + L+ L LRL+ +KTKI H D FLG+ L + ++ Sbjct: 306 LIGVIASKKYALDLKLNLKEFLQNELSLRLSDEKTKITHAADKHVSFLGYILRKGSVKHS 365 Query: 341 EMR 343 + + Sbjct: 366 KFQ 368 >UniRef50_B7JTB6 Group II intron reverse transcriptase/maturase n=6 Tax=Bacillus cereus RepID=B7JTB6_BACC0 Length = 599 Score = 268 bits (684), Expect = 3e-70, Method: Composition-based stats. Identities = 108/372 (29%), Positives = 187/372 (50%), Gaps = 21/372 (5%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTML-QAR 59 +Q L + + + + LL ++ E + A R S+ G+ TPG D L Sbjct: 17 IQNDLFQRSR-EGTKNFKNLLEIVISDENILLAYRQVKSNTGSKTPGTDDKTILDLANTN 75 Query: 60 LAVELQILRDELLSGHYQPLPARRVYIPKS-NGKLRPLGIPALRDRIVQRAMLMAMEPIW 118 + +R+ +L+ Y+P RRV+I K+ + RPLGIP ++DRIVQ+ L +EPI Sbjct: 76 QDEFIHYMRELVLN--YKPKSVRRVWIDKNYSKGKRPLGIPCIQDRIVQQMFLNVLEPIC 133 Query: 119 ESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAV 178 E F+ SYGFRP R+ HA+ V+ + + + ++ D+ +FD V+H +L+K V Sbjct: 134 EGKFYNHSYGFRPTRTTRHAVARVQTLVNI---NKYHYTVDIDIKGFFDNVNHSILLKQV 190 Query: 179 -RRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHER 237 I D R + ++ K +KA G+ ++GVPQGG++SPLLSNI+LN+ DQ++ ++ Sbjct: 191 WNIGIRDKRVIAVISKMLKAPIKGEGI---PTKGVPQGGILSPLLSNIVLNDLDQWVADQ 247 Query: 238 YLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIRE 297 + + R + S+ + +R N + K RYADDF ++ T Sbjct: 248 WECFETR-----YQYSVNYSKYVNLRRNSKLKEGF-LVRYADDFRIMT-NTHDSAVKWFH 300 Query: 298 ECRGVLEGSLKLRLNMDKTKIPHV-NDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNF 356 L LKL ++ +K+KI ++ FLG++ + + + S I +K + Sbjct: 301 AVVDFLNKRLKLEISPNKSKIINLRKKSSSFLGYKF-KSTIKGNKRVFFSHIDDDKQKQI 359 Query: 357 AASLTALLWKVR 368 L +++++ Sbjct: 360 ITKLKERIYEIQ 371 >UniRef50_C3BJV7 D-alanine--D-alanine ligase A (D-alanylalanine synthetaseA) n=2 Tax=Bacillus RepID=C3BJV7_9BACI Length = 620 Score = 268 bits (684), Expect = 3e-70, Method: Composition-based stats. Identities = 111/372 (29%), Positives = 178/372 (47%), Gaps = 19/372 (5%) Query: 2 QRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLA 61 Q +L + + R L + + A +++GA+TPGVDG + Sbjct: 17 QDQLYELSK--KNTRFHSLYEMAFNETTIITAIHKIKANRGANTPGVDGHDIRRYLQMDK 74 Query: 62 VELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESD 121 + L + + +Y+ PARRVYI K++G RPLGIP + DRI+Q + +EPI E+ Sbjct: 75 NNVIKLITK-AARNYKSKPARRVYIEKADGSQRPLGIPTVVDRIIQECIRTILEPIVEAK 133 Query: 122 FHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRR 181 F+ SYGFRP RS HA+R V + T+ + IEGD+ YFD ++HR L+K + R Sbjct: 134 FYDHSYGFRPYRSSKHAVRQVNHFIN---TTKSYYAIEGDIKGYFDNINHRFLIKKLWRL 190 Query: 182 ISDARFMTLLWK-TIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLS 240 + + + + +KAG+++ +G PQGG+ISPLL+N+ LN+FD + R+ Sbjct: 191 GIRDKRIIKIIQIMLKAGYMEYDFKFTTEKGTPQGGIISPLLANVYLNDFDWMVARRFYK 250 Query: 241 GKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECR 300 K + R R Q + RYADD++++ + T + E R Sbjct: 251 AKPTGIS-------KEPRKQRERLVRQGRNKCYLVRYADDWIILTQ-TYQEARRYLEYLR 302 Query: 301 GVLEGSLKLRLNMDKTKIPHVNDG-FIFLGHRLI---RKRSRYGEMRVVSTIPQEKARNF 356 LKL L+ +KT I + + +FLG + RS G + + K Sbjct: 303 KYFRIKLKLELSKEKTVITDLREKPALFLGFDIYAETPLRSNSGNIVGKNKPNHRKVSGQ 362 Query: 357 AASLTALLWKVR 368 + + + K+R Sbjct: 363 ISKVCKEIRKMR 374 >UniRef50_B9J6F8 Reverse transcriptase n=7 Tax=Bacillus cereus group RepID=B9J6F8_BACCQ Length = 624 Score = 268 bits (684), Expect = 3e-70, Method: Composition-based stats. Identities = 108/351 (30%), Positives = 177/351 (50%), Gaps = 15/351 (4%) Query: 5 LATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVEL 64 + A + S ++ L+ + + A S++G+ T G+D + A +L Sbjct: 21 IYQKAKEENSC-FHGIIELMKNKQTIKTAIHNIKSNRGSMTVGIDKKDVNYYLQMEAKQL 79 Query: 65 QILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHT 124 L + + +Y+P P RR YI K NGK RPLGIP + DRI+Q + +EPI E+ F Sbjct: 80 IKLIRQHID-NYKPNPVRREYINKGNGKKRPLGIPTMIDRIIQEIARIVLEPIAEAKFFN 138 Query: 125 LSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAV-RRRIS 183 SYGFRP RS H+AI V L ++ IEGD+ S+FD ++H L++ + I Sbjct: 139 HSYGFRPYRSCHYAIGRV---LNTISRSKTYIAIEGDIKSFFDHINHNKLVEMMWNMGIK 195 Query: 184 DARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKA 243 D RF+ ++ K ++AG ++ + G PQGG+ISPLL+NI LN FD + + + +A Sbjct: 196 DKRFLIIIKKMLRAGVLEDKVILPTEIGTPQGGIISPLLANIYLNNFDWMVAKEFEEHRA 255 Query: 244 RKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVL 303 R + ++ + G + R + RYADD++++ + T Q + + Sbjct: 256 R---YTVKHAFRSGLTKVGRRH----KKCFLIRYADDWIILCEDT-VQARILLTKIDKYY 307 Query: 304 EGSLKLRLNMDKTKIPHVNDG-FIFLGHRLIRKRSRYGEMRVVSTIPQEKA 353 + LKL L+ +KT I + + FLG + ++ R + IP +K Sbjct: 308 KHILKLELSKEKTFITDLREKPARFLGFDIKAEKMRLKDRIAGKAIPNKKK 358 >UniRef50_Q64E53 Prophage LambdaSa1 transcriptase/maturase family protein n=1 Tax=uncultured archaeon GZfos14B8 RepID=Q64E53_9ARCH Length = 430 Score = 268 bits (684), Expect = 3e-70, Method: Composition-based stats. Identities = 107/363 (29%), Positives = 169/363 (46%), Gaps = 48/363 (13%) Query: 18 QRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQ 77 L+ + L EA ++GA G+D V + L L ++ L Y Sbjct: 50 HSLIDKVWNWRNLNEAWEKVKQNRGAG--GIDDVTIDEFERNLEQNLNEIQRLLRQDRYV 107 Query: 78 PLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHH 137 P P +RVYIPK +GK RPLGIP +RDR+VQ+A+ +EPI+E++F S+G+RP +S Sbjct: 108 PKPVKRVYIPKPDGKQRPLGIPTIRDRVVQQALKNVIEPIFEAEFLDSSFGYRPGKSAKQ 167 Query: 138 AIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKA 197 AI ++ + WV++ D+ ++FDTV+H L+ AV RISD R + L+ ++A Sbjct: 168 AIEQIE----TVRDEGHEWVVDADIKAFFDTVNHEKLIDAVAERISDGRVLGLIRAFLEA 223 Query: 198 GHIDVGLFR-AASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQR 256 ++ G R G PQGGVISPLL+NI L+ FD+ + E Sbjct: 224 DIMEQGQGRAKNVVGTPQGGVISPLLANIYLHYFDERMAEL------------------- 264 Query: 257 GRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKT 316 RYADD +++ + EAI + E L L+ KT Sbjct: 265 --------------GFEVVRYADDVLVLCGSEEEAEEAISHVKEILEELEL--TLHPQKT 308 Query: 317 KIPHVNDGFIFLGHRLIRKRS---RYGEMRVVSTIPQEKARNFAASLTALLWKVRISGEI 373 KI + ++G FLG + + + + + RN +L ++ + + Sbjct: 309 KIKNFSEGVDFLGFTVYVSHKVPRKEAVRKYKGAVRRATRRNLPINLEMVIQGL---NPV 365 Query: 374 LLG 376 ++G Sbjct: 366 VIG 368 >UniRef50_Q0AW97 RNA-directed DNA polymerase (Reverse transcriptase) n=24 Tax=cellular organisms RepID=Q0AW97_SYNWW Length = 443 Score = 267 bits (683), Expect = 4e-70, Method: Composition-based stats. Identities = 107/379 (28%), Positives = 163/379 (43%), Gaps = 50/379 (13%) Query: 4 KLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVE 63 ++A A +P R L+ I E L E G+ GVD V K + L Sbjct: 7 RIAEIARQNPKERFTALIHHI-NHETLKECHLEI---SGSKASGVDQVTKQAYEENLEAN 62 Query: 64 LQILRDELLSGHYQPLPARRVYIPKSN-GKLRPLGIPALRDRIVQRAMLMAMEPIWESDF 122 + L + Y+P P RRVYIPK K RPLGIP+ D++VQ+ + + I+E DF Sbjct: 63 IADLIGRMKRQAYKPQPVRRVYIPKEGSNKRRPLGIPSYEDKLVQKGLARILNTIYEQDF 122 Query: 123 HTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRI 182 S+GFRP R H A++ + + + ++++ D+ +FD V H +MK + RI Sbjct: 123 LDCSFGFRPGRGCHDALKVLNHIIE---RKKVNYIVDADIRGFFDHVDHEWMMKFLELRI 179 Query: 183 SDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGK 242 +D + L+ + +KAG ++ G+ +G PQGG++SP+L+NI L+ Sbjct: 180 ADPNLLRLIKRFLKAGVMEAGIVYDTPKGTPQGGIVSPILANIYLHYV------------ 227 Query: 243 ARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGV 302 D W+ +R + A RYADDFV + K+ E R Sbjct: 228 --LDLWFEKVVKKRCQGEA-----------YLVRYADDFVCCFQ-NKSDAEWFYANLRER 273 Query: 303 LEGSLKLRLNMDKTKIPHVN---------------DGFIFLGHRLIRKRSRYGEMRVVST 347 L L + +KT+I D F LG +S+ G RV Sbjct: 274 L-NKFNLEVAEEKTRIIAFGRFADKESKKQGRKKPDTFDLLGFTHYCSKSKKGWFRVKRK 332 Query: 348 IPQEKARNFAASLTALLWK 366 Q+K R+ L K Sbjct: 333 TSQKKYRSSLLKCKTWLRK 351 >UniRef50_C8VXL4 RNA-directed DNA polymerase (Reverse transcriptase) n=5 Tax=Bacteria RepID=C8VXL4_DESAS Length = 434 Score = 267 bits (683), Expect = 4e-70, Method: Composition-based stats. Identities = 131/374 (35%), Positives = 196/374 (52%), Gaps = 45/374 (12%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQAR- 59 ++RK+ A D + R L + + E L EA ++ + GA PG+DG+ ++A Sbjct: 11 LRRKIYIKAKADKTWRFWGLYVHVCKIETLQEAYKMAKNKNGA--PGIDGITFDNIEASG 68 Query: 60 LAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWE 119 + + LQ ++ EL+SG Y P RR IPK +GK R LGIP +RDR+VQ A+ + +EPI+E Sbjct: 69 IEIFLQQIQKELISGTYWPTQNRRKEIPKGDGKYRILGIPTIRDRVVQGALKLILEPIFE 128 Query: 120 SDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVR 179 +DF SYG+RP+R+ H AI V + + VI+ DL SYFDTV H LL+K V Sbjct: 129 ADFQEGSYGYRPKRNPHQAIDRVAKAVVENKTR----VIDLDLRSYFDTVRHDLLLKKVA 184 Query: 180 RRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYL 239 +R++D M LL +KA GVPQGGVISPLL+N+ LNE D+ L Sbjct: 185 KRVNDENVMRLLKLILKASG---------KRGVPQGGVISPLLANLYLNEVDKMLE---- 231 Query: 240 SGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREEC 299 ++ V + Q+ + Y R+ADD V+++ + Sbjct: 232 ------------------KAKEVTRHEQYTH-IEYARFADDIVILIDAYPKWNWLEKAVY 272 Query: 300 RGVLEG--SLKLRLNMDKTKIPHVNDG--FIFLGHRLIRKRSRYGEMRVVSTIPQEKAR- 354 + +LE L ++LN +KT+I ++ +G F FLG R R+R G+ V+ T P+ KAR Sbjct: 273 QRLLEELTKLDVQLNEEKTRIVNLANGESFGFLGFDFRRSRTRKGKWGVLFT-PKMKART 331 Query: 355 NFAASLTALLWKVR 368 L + + Sbjct: 332 KILTELKETFRRFQ 345 >UniRef50_C3KST3 Group II intron reverse transcriptase/maturase n=10 Tax=Firmicutes RepID=C3KST3_CLOB6 Length = 626 Score = 266 bits (681), Expect = 7e-70, Method: Composition-based stats. Identities = 110/376 (29%), Positives = 183/376 (48%), Gaps = 14/376 (3%) Query: 1 MQRKLATWAATDPSLR-IQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQAR 59 MQ + + L++ IT E + A R + G+HT G + + Sbjct: 26 MQSIFDELYKQSKDGKQFKNLIKTITSKENILLAYRNIKKNDGSHTKGTNHKTINDIAGE 85 Query: 60 LAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWE 119 E+ + L+ Y P +R+YIPK+NG RPLGIP + DR++QR++L +EPI E Sbjct: 86 SEDEIIEYVRKRLNKFY-PHSVKRIYIPKNNGDKRPLGIPTIEDRLIQRSILQVLEPICE 144 Query: 120 SDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVR 179 + FH SYGFRP RS HAI +T + +V++ D+ +FD V+H L+K + Sbjct: 145 AKFHPHSYGFRPNRSTEHAIARA---MTLINMNKLHYVVDVDIKGFFDNVNHGKLLKQLW 201 Query: 180 R-RISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERY 238 I D + + ++ +KA D + +G PQGG+ISPLL+N++LNE D ++ ++ Sbjct: 202 TLGIKDKKLIKIISLMLKAQIKDGSMITNPVKGTPQGGIISPLLANVVLNELDWWISSQW 261 Query: 239 LSGKAR----KDRWYWNNSIQRGRSTAVRENWQWKPAVAY-CRYADDFVLIVKGTKAQVE 293 + + + K R + N + +S R K Y RYADDF + K K E Sbjct: 262 ETFETKHNYSKLRTFKNGTTTIDKSHKYRALRNGKLKEIYIVRYADDFKVFCKNPK-DAE 320 Query: 294 AIREECRGVLEGSLKLRLNMDKTKIPHVNDGFI-FLGHRLIRKRSRYGEMRVVSTIPQEK 352 I + L+ L L + +K+K+ ++ FLG L ++ R + S + ++ Sbjct: 321 KIFIAIKLWLKERLDLETSPEKSKVTNLRKHPTEFLGFELKAEKKRKKYV-CQSHVSKKA 379 Query: 353 ARNFAASLTALLWKVR 368 R + A + +++ Sbjct: 380 KRLIQEKIKAKIKELQ 395 >UniRef50_B3GTB4 Putative reverse-transcriptase protein n=1 Tax=Volvox carteri f. nagariensis RepID=B3GTB4_VOLCA Length = 749 Score = 266 bits (681), Expect = 8e-70, Method: Composition-based stats. Identities = 100/383 (26%), Positives = 179/383 (46%), Gaps = 34/383 (8%) Query: 5 LATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVEL 64 L + D + L+ +I+ L A + S G TPG+ + L A Sbjct: 217 LKKANSIDLARVNNGLIHIISDTNLLIFAYELLKSKSGYMTPGI---TEESLDAIDLAWY 273 Query: 65 QILRDELLSGHYQPLPARRVYIPKSNG-KLRPLGIPALRDRIVQRAMLMAMEPIWESDFH 123 + + +++ +G ++ ARRV IPK +LRPLG+ + RD++VQ+A+ + ++ I++ F Sbjct: 274 KHISNDIKAGKFKFSQARRVMIPKPGKSELRPLGVVSPRDKVVQKALELVLQCIFDPMFL 333 Query: 124 TLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRIS 183 S+G+RP +S H A++ + Q + WVI+GD+S FDT+ H +LM + +RIS Sbjct: 334 DCSHGYRPGKSQHTALKMLDQQFKNA-----TWVIKGDISKCFDTIDHEILMHLIGKRIS 388 Query: 184 DARFMTLLWKTIKAG-HIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHE---RYL 239 + + L+ +KAG +D L+ + G P V SPLL NI ++EFD ++ + ++ Sbjct: 389 CNKTLALIKSALKAGNVLDGKLYANEAVGTPHSSVSSPLLCNIYMHEFDLFVKDIIVKFN 448 Query: 240 SGKARKDRWYWNNSIQR----------GRSTAVRENWQWKP----------AVAYCRYAD 279 G R+ + + + +R++ + + Y R+AD Sbjct: 449 KGTKRRQNPEYTKILNMLYKALEQFNFSKYAKLRKDLRRVRQVNIMDLDYVRIKYVRFAD 508 Query: 280 DFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDG-FIFLGHRLIRKRSR 338 DFV+ + G + + L LKL LN KT I + FL ++ + + Sbjct: 509 DFVISIIGPYKLACDTKVMVKDFLMNKLKLTLNESKTAITKFSKKPIYFLDTEIMNRYPK 568 Query: 339 YGEMRVVSTIPQEKARNFAASLT 361 +++V + K N L+ Sbjct: 569 VKPVKLVKRLGVSKLANVTPRLS 591 >UniRef50_A6DJK4 Reverse transcriptase/maturase n=21 Tax=Chlamydiae/Verrucomicrobia group RepID=A6DJK4_9BACT Length = 446 Score = 266 bits (679), Expect = 1e-69, Method: Composition-based stats. Identities = 110/366 (30%), Positives = 172/366 (46%), Gaps = 54/366 (14%) Query: 16 RIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGH 75 + L + + + EA S+KG H GVD V+ ++ L L +EL G Sbjct: 43 KWYSLSDKLMRKNNIMEAWEKVCSNKGKH--GVDMVSIERYESELEYNNAKLLEELQDGR 100 Query: 76 YQPLPARRVYIPKSNG-KLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERS 134 Y P RRV IPK +G K RPLGIP +RDR+VQ A+ +EPI++ DF S+GFRP+ Sbjct: 101 YDPSAVRRVEIPKGDGRKTRPLGIPTVRDRVVQTALKHVIEPIFDIDFSPYSFGFRPKLG 160 Query: 135 VHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKT 194 A+R V L + +V++ D+ SYFDT+ H LM V+ +I D + + L+ + Sbjct: 161 CKDALRRVNELL----KQGYLYVMDADIQSYFDTIPHEKLMSRVKEKIIDGKILDLIEQF 216 Query: 195 IKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSI 254 +KA D EG PQGG+ISPLL+NI L+ FD + E Sbjct: 217 LKANIFDGLKHWEPEEGTPQGGIISPLLANIYLDLFDHKMTEA----------------- 259 Query: 255 QRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMD 314 RYADDF+++ K +K + + R ++ + L+L+ + Sbjct: 260 ----------------GFEIVRYADDFLIMCK-SKESAKRALRKTRRWMKAN-GLKLHPE 301 Query: 315 KTKIPHVNDG---FIFLGHRLIRKRS---------RYGEMRVVSTIPQEKARNFAASLTA 362 KT+I + + F FLG+ R R+ + + I ++ R+ S+ Sbjct: 302 KTRIVDMTEKCEYFEFLGYHFERTRNTHRIKRWPRKQSLKKCKDAIRKKTRRSNKDSIED 361 Query: 363 LLWKVR 368 ++ +R Sbjct: 362 IIAYLR 367 >UniRef50_C6IQ61 Putative uncharacterized protein n=10 Tax=Bacteroidales RepID=C6IQ61_9BACE Length = 560 Score = 266 bits (679), Expect = 1e-69, Method: Composition-based stats. Identities = 106/365 (29%), Positives = 170/365 (46%), Gaps = 38/365 (10%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWL-AEAARITLSSKGAHTPGVDGVNKTMLQAR 59 +Q ++ +++ L L+T A A + S+KG +T GVD V + A+ Sbjct: 33 LQARIVKVQKEGRYGKVKALQWLLTHSFAAKALAVKRVTSNKGKNTSGVDKVLWSTPIAK 92 Query: 60 LAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWE 119 EL Y P+P +RV I KSNGKLRPLGIP ++DR +Q LMA++P+ E Sbjct: 93 ANA-----ITELKRRDYNPMPLKRVNIRKSNGKLRPLGIPTMKDRAMQALYLMALDPVAE 147 Query: 120 SDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVR 179 + SYGFR ER AI + L+ E+ +W++EGD+ FD ++H L+ + Sbjct: 148 TTADNHSYGFRKERCTGDAIHQCYINLSK--ESSPQWILEGDIKGCFDHINHEWLLNNIP 205 Query: 180 RRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYL 239 +L K +K+G I EG PQGG+ISP L+N+ L+ L ++ Sbjct: 206 MD------KVMLRKWLKSGFIFNKQLFPTEEGTPQGGIISPTLANMALDGLQTMLEAKFH 259 Query: 240 SGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREEC 299 R + P V RYADDF++ + + I Sbjct: 260 ------------------RVDLYSPKRSYYPKVHLIRYADDFIITSISKEMLEQEIMPMV 301 Query: 300 RGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAAS 359 + L+ L L+ +KTKI H+++GF FLG + + + ++ + T +E + F Sbjct: 302 KEFLQAR-GLTLSEEKTKITHIDEGFDFLGFNIRKYKGKF-----LITPSKESQKKFQRK 355 Query: 360 LTALL 364 + ++ Sbjct: 356 INEIV 360 >UniRef50_C4ZCX5 RNA-directed DNA polymerase n=24 Tax=Bacteria RepID=C4ZCX5_EUBR3 Length = 464 Score = 265 bits (678), Expect = 1e-69, Method: Composition-based stats. Identities = 117/360 (32%), Positives = 169/360 (46%), Gaps = 49/360 (13%) Query: 19 RLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQP 78 RLL I + A + ++KGA PG+DG+ L Q + D + G Y P Sbjct: 41 RLLETILYKDNFNRAYKRVKANKGA--PGIDGMTIEEALPYLKEHQQEITDRIYRGKYTP 98 Query: 79 LPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHA 138 P RRV IPK +G +R LGIP + DR +Q+A+ + PI+E F SYG+RP RS A Sbjct: 99 SPVRRVEIPKPDGGVRKLGIPTVIDRTLQQAITQQLVPIYEPLFADGSYGYRPNRSAKDA 158 Query: 139 IRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAG 198 I VK + E + + DLS YFDT++H +L+ +R+ + D R + L+ + +K+G Sbjct: 159 ILKVK----EYAEQGYTFAVVLDLSKYFDTLNHEILINLLRKNVKDERVVQLIKRYLKSG 214 Query: 199 HIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGR 258 ++ G+ EG PQGG +SPLL+NI LNEFDQ Sbjct: 215 VMENGVVIDTEEGSPQGGNLSPLLANIYLNEFDQEY------------------------ 250 Query: 259 STAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKI 318 K V RYADD VL+ K +K E + E LE LKL +N +K++ Sbjct: 251 ---------LKRGVPCIRYADDIVLLAK-SKRASERLLESSTKYLEERLKLTVNREKSRT 300 Query: 319 PHVN--DGFIFLGHRLIRKRSR-------YGEMRVVSTIPQEKARNFAASLTALLWKVRI 369 V F FLG L R + S + + +R S+ L K+++ Sbjct: 301 VSVFAIRNFKFLGFALGRNGKGIYVRVHPKSWKKFKSRLKELSSRKRCQSIKPSLEKIKV 360 >UniRef50_Q7UY81 Reverse transcriptase/maturase n=1 Tax=Rhodopirellula baltica RepID=Q7UY81_RHOBA Length = 459 Score = 265 bits (678), Expect = 2e-69, Method: Composition-based stats. Identities = 111/360 (30%), Positives = 171/360 (47%), Gaps = 46/360 (12%) Query: 14 SLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLS 73 + L+ + + L +AR + KGA GVD + + E++ L ++L + Sbjct: 63 GGKWHALIDKVYRELNLFVSARKVVGKKGA--AGVDRQSTEDFSEKEIAEIKQLYEQLRT 120 Query: 74 GHYQPLPARRVYIPKSNGK-LRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPE 132 G Y+P RRV IPK K RPLGIP +RDR+VQ A++ +EPI++++FH S+GFR Sbjct: 121 GTYRPQAVRRVQIPKPGSKQTRPLGIPTVRDRVVQTALVNVIEPIFDNEFHERSFGFRHG 180 Query: 133 RSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLW 192 RS H A+R V+ L ET +V++ DL YFDT+ L+ V +ISD R + L+ Sbjct: 181 RSCHDALRVVEELL----ETDHVFVVDADLQGYFDTIPKDRLLALVSEKISDRRVLDLVK 236 Query: 193 KTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNN 252 + + ++ GVPQG V+SPLLSN+ LNE D + + Sbjct: 237 RFLDQSILEELREWTPESGVPQGAVLSPLLSNLYLNELDHRMADL--------------- 281 Query: 253 SIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLN 312 RYADDFV++ + ++ Q E EE + + + L L+ Sbjct: 282 ------------------GYEMVRYADDFVILCR-SQEQAELALEEVKRFVCEA-GLTLH 321 Query: 313 MDKTKIPHVN-DGFIFLGHRLI---RKRSRYGEMRVVSTIPQEKARNFAASLTALLWKVR 368 +KT I + F FLG+ R ++V TI + R SL A + ++ Sbjct: 322 PEKTHIVDSRVNSFDFLGYSFRGKLRFPRAKSHQKMVDTIRRLTPRKSGQSLEATIVQIN 381 >UniRef50_P38478 Uncharacterized mitochondrial protein ymf40 n=1 Tax=Marchantia polymorpha RepID=YMF40_MARPO Length = 502 Score = 265 bits (677), Expect = 2e-69, Method: Composition-based stats. Identities = 113/344 (32%), Positives = 168/344 (48%), Gaps = 34/344 (9%) Query: 21 LRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLP 80 + PE A + S G PG D ++ + +L +Q P Sbjct: 6 YEQLLDPEIFRLAYELKKSKSGNMKPGADKETLDGFSQ---AYVEKVVRQLKDESFQFRP 62 Query: 81 ARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIR 140 +RR +IPK++GKLR LGIP+ RD+IVQ M +EP++E F S+GFRP RS H A+R Sbjct: 63 SRREFIPKADGKLRSLGIPSPRDKIVQEVMRRILEPVFEPRFLDSSHGFRPHRSPHTALR 122 Query: 141 TVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHI 200 ++ T W+IEGD+ YFD + H LL + + D R + L WK ++AG++ Sbjct: 123 QIRRW------TGTSWMIEGDIKGYFDNIDHHLLAGFIAELVKDQRLLALYWKLVRAGYV 176 Query: 201 DVGLFRAA-SEGVPQGGVISPLLSNIMLNEFDQYLHERYLSG--------------KARK 245 + G GVPQG ++SPLLSNI L++FD ++ E + KAR Sbjct: 177 NQGKAEPHLLTGVPQGRILSPLLSNIYLHQFDLFMEEIKVKYTTTGALSKNNPIYLKARN 236 Query: 246 DRWYWNNSIQRGRSTAVRENW---------QWKPAVAYCRYADDFVLIVKGTKAQVEAIR 296 + S++ + +R Q V Y RYADD+V+ V G KA I+ Sbjct: 237 KYYKLVKSLKASSAEIIRARRDMLKMTYGIQTGSRVRYVRYADDWVIGVTGPKALAVQIK 296 Query: 297 EECRGVLEGSLKLRLNMDKTKIPHVNDG-FIFLGHRLIRKRSRY 339 EE L+ LKL L +KT+I +++ +FLG + +Y Sbjct: 297 EEVSTFLQEKLKLSLQAEKTRITNLSRSEALFLGTLISITTRKY 340 >UniRef50_P05511 Uncharacterized 91 kDa protein in cob intron n=1 Tax=Schizosaccharomyces pombe RepID=YMC6_SCHPO Length = 807 Score = 263 bits (673), Expect = 6e-69, Method: Composition-based stats. Identities = 105/398 (26%), Positives = 178/398 (44%), Gaps = 37/398 (9%) Query: 5 LATWAATDPSLRIQR-LL-RLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAV 62 L+ + P+L I R L + + A S+ G T G + L Sbjct: 220 LSKRSKNYPNLVIDRNLYKDFLLNRDMFLIAYNKLKSNPGMMTHG---LKPDTLDGMSID 276 Query: 63 ELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDF 122 + + L S + P RR+ I K++G RPL I + RD++VQ + + +E I+E F Sbjct: 277 VIDKIIQSLKSEEFNFTPGRRILIDKASGGKRPLTIGSPRDKLVQEILRIVLEAIYEPLF 336 Query: 123 HTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRI 182 +T S+GFRP RS H A+R++ C W IEGD+ + FD++ H L+ + +I Sbjct: 337 NTASHGFRPGRSCHSALRSIFTNFKGC-----TWWIEGDIKACFDSIPHDKLIALLSSKI 391 Query: 183 SDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLH----ERY 238 D RF+ L+ K + AG++ ++ G PQG ++SP+L+NI L++ D+++ E Sbjct: 392 KDQRFIQLIRKALNAGYLTENRYKYDIVGTPQGSIVSPILANIYLHQLDEFIENLKSEFD 451 Query: 239 LSGKARKDRWYWNNSIQRGRSTAVRENWQWKP---------------------AVAYCRY 277 G + R + + + A REN K + Y RY Sbjct: 452 YKGPIARKRTSESRHLHYLMAKAKRENADSKTIRKIAIEMRNVPNKIHGIQSNKLMYVRY 511 Query: 278 ADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHV-NDGFIFLGHRLIRKR 336 ADD+++ V G+ Q + I + S+ L ++ KTKI + D +FLG + + Sbjct: 512 ADDWIVAVNGSYTQTKEILAKITCFC-SSIGLTVSPTKTKITNSYTDKILFLGTNISHSK 570 Query: 337 SRYGEMRVVSTIPQEKARNFAASLTALLWKVRISGEIL 374 + +A + + K+R +G +L Sbjct: 571 NVTFSRHFGILQRNSGFILLSAPMDRIAKKLRETGLML 608 >UniRef50_A8VT23 S-layer domain protein n=12 Tax=Bacilli RepID=A8VT23_9BACI Length = 422 Score = 263 bits (672), Expect = 7e-69, Method: Composition-based stats. Identities = 114/354 (32%), Positives = 170/354 (48%), Gaps = 46/354 (12%) Query: 19 RLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQP 78 +L+ + P+ L A +S+KG PGVDG+ L+A + + L ++ G YQP Sbjct: 2 QLIDRVVCPDNLNLAMNRVISNKGN--PGVDGMTVDQLEAHVRQYAKPLIAKIQKGTYQP 59 Query: 79 LPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHA 138 LP +RV IPK NGK R LGIPA+RDR+VQ+A+ +EPI + F SYGFRP ++ A Sbjct: 60 LPVKRVEIPKENGKKRKLGIPAVRDRMVQQAIFQVIEPIIDPHFSPNSYGFRPGKNAKQA 119 Query: 139 IRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAG 198 I+ + + V++ DL SYFDT+ H+ LM + + I D + L+WK +K+G Sbjct: 120 IKQA----AKYYDEGFKMVVDIDLKSYFDTIPHQKLMNYLEQYIQDPIILKLIWKFLKSG 175 Query: 199 HIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGR 258 + + ++ G PQGG +SP+LSN+ L+E D+ L R Sbjct: 176 IMIGDNWESSRNGAPQGGNLSPILSNVYLHELDKELERR--------------------- 214 Query: 259 STAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKI 318 + RYADDF + VK +A E + LEG+LKL +N +K+ I Sbjct: 215 ------------GHRFVRYADDFCIYVKSRRA-AERVLLNTTTFLEGTLKLSVNQEKSAI 261 Query: 319 PHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAASLTALLWKVRISGE 372 FLG + E R + F A L L + + + Sbjct: 262 GSPTKR-KFLGFCI---HKSNNETRCRPHHASK--AKFKAKLKYLTRRNQANSF 309 >UniRef50_B9K440 18S rRNA intron 1 protein n=1 Tax=Agrobacterium vitis S4 RepID=B9K440_AGRVS Length = 257 Score = 263 bits (672), Expect = 7e-69, Method: Composition-based stats. Identities = 154/245 (62%), Positives = 182/245 (74%), Gaps = 7/245 (2%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARL 60 MQ KLATWA +DP+ R RLLRLI EWLAE AR+ L+S GA TPG+DG++K LQ +L Sbjct: 6 MQHKLATWAESDPNRRFDRLLRLIANREWLAETARMVLASSGARTPGIDGMDKQRLQVKL 65 Query: 61 AVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWES 120 L LR LL Y+P P +R+YIPK NGKLRPL IP L DRIVQRAMLMAM PIWES Sbjct: 66 DQHLDDLRTSLLEESYRPQPVKRIYIPKPNGKLRPLDIPTLTDRIVQRAMLMAMGPIWES 125 Query: 121 DFHTLSYGFRPERSVHHAIRTVKLQLTD-CGETRGRWVIEGDLSSYFDTVHHRLLMKAVR 179 DFH LSYGFR ER+VHHA+RTV++QL D TRGRW+IEGDL+SYFDTVHHRLL++ VR Sbjct: 126 DFHRLSYGFRSERNVHHAVRTVRIQLQDGADTTRGRWIIEGDLASYFDTVHHRLLLRCVR 185 Query: 180 RRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQ--YLHER 237 RR+ D RF+ LLW+ +KAGHID GLF A+SEGVPQGG L S +DQ +H Sbjct: 186 RRVQDGRFVDLLWRFLKAGHIDRGLFTASSEGVPQGG----LWSADHKPPYDQRCKMHSA 241 Query: 238 YLSGK 242 +L G+ Sbjct: 242 WLQGR 246 >UniRef50_Q82RB7 Putative reverse transcriptase homolog; similar to GII intron n=1 Tax=Streptomyces avermitilis RepID=Q82RB7_STRAW Length = 588 Score = 263 bits (672), Expect = 8e-69, Method: Composition-based stats. Identities = 112/379 (29%), Positives = 171/379 (45%), Gaps = 48/379 (12%) Query: 1 MQRKLATWAATDPSLRIQRLLRLI--TQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQA 58 +++++ A +++ L +L+ ++ L R+ S G T G+DG + Sbjct: 36 LRQRIFRAAREGDMKQVRNLQKLMRRSRANTLTSVRRVCQVSTGKKTAGIDGQKALSPEK 95 Query: 59 RLAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIW 118 R QIL D + P P RRVYIPK+NGK RPLGIP +RDR+ Q A+EP W Sbjct: 96 RGKTARQILADPMSH----PQPVRRVYIPKANGKRRPLGIPVIRDRVDQARFKNALEPEW 151 Query: 119 ESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAV 178 E+ F SYGFRP R AI + + + WV++ DLS+ FD + H+ LM +V Sbjct: 152 EARFEARSYGFRPGRGAWDAIEMIF-NVAGRRTAKRLWVLDADLSAAFDHISHQHLMDSV 210 Query: 179 RRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERY 238 + + ++AG ++ G F + EG PQGGVISPLL NI L+ + + Sbjct: 211 GLFPG----RRQIQQWLRAGVMEDGRFVSTPEGTPQGGVISPLLMNIALHGMGEVI---- 262 Query: 239 LSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREE 298 A R + RYADDFV+ T+ + +++ Sbjct: 263 ---------------------GANRPWNAKTTSPTLVRYADDFVVFCT-TENEAIKAKQD 300 Query: 299 CRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSR----------YGEMRVVSTI 348 LE L N +KT++ H++ G FLG + R R + + +ST Sbjct: 301 LAAWLEPR-GLSFNEEKTRVVHLSSGVDFLGFNVGRFRQKLIIKPSRDALQRARKRISTT 359 Query: 349 PQEKARNFAASLTALLWKV 367 +E + + SL L Sbjct: 360 ARENSGSPTESLVRALSPF 378 >UniRef50_C9P0Q5 Retron-type reverse transcriptase n=3 Tax=Vibrio RepID=C9P0Q5_VIBME Length = 436 Score = 263 bits (672), Expect = 9e-69, Method: Composition-based stats. Identities = 116/360 (32%), Positives = 172/360 (47%), Gaps = 51/360 (14%) Query: 19 RLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLA--VELQILRDELLSGHY 76 RLL + P L A + +KG GVD + T +L Q LR LL G Y Sbjct: 5 RLLEQMFSPGNLNAATKQVKRNKGCG--GVDRLTITATLEKLRQLDNGQQLRQSLLDGSY 62 Query: 77 QPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVH 136 QP P V IPK G +R LGIP ++DRIVQ+AM + ++E F S+GFRP RS H Sbjct: 63 QPSPVLGVEIPKPKGGVRQLGIPTVQDRIVQQAMAQLLTQLYEPKFSKSSFGFRPRRSAH 122 Query: 137 HAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIK 196 HA+ + + +V++ DL YFDTV+H LM + + I+D R + L+ K ++ Sbjct: 123 HALSKASEYIREGR----GYVVDIDLEKYFDTVNHDRLMYRLSQDIADKRVLKLIRKYLQ 178 Query: 197 AGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQR 256 +G + G+ G PQGG +SPLLSNI+L+E D+ L R Sbjct: 179 SGLMRNGVIERRQRGTPQGGPLSPLLSNIVLDELDKELERR------------------- 219 Query: 257 GRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKT 316 +CRYADD + V G++A + ++ LE +LKLR+N +K+ Sbjct: 220 --------------GHKFCRYADDCQIYV-GSEAAAQRVKTSVTEFLEQTLKLRVNREKS 264 Query: 317 KIPHVNDGFIFLGHRL-------IRKRSRYGEMRVVSTIPQE-KARNFAASLTALLWKVR 368 V++ +LGHR I + + + V + + + R F A + L +R Sbjct: 265 AATRVSER-SYLGHRFNDDGVIGISDDAMHQMKKRVRQVTKRNRGRTFPAIIKELSTYLR 323 >UniRef50_B7I148 Reverse transcriptase n=9 Tax=Bacillus RepID=B7I148_BACC7 Length = 632 Score = 263 bits (671), Expect = 1e-68, Method: Composition-based stats. Identities = 115/385 (29%), Positives = 197/385 (51%), Gaps = 33/385 (8%) Query: 4 KLATWAAT--DPSLRIQR--LLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQAR 59 KL + + RI+ LL + + A S+KG+ TPGVDG Sbjct: 20 KLYSKTKEHMEKKTRIKHTSLLEIAMSKPNIVTAIHSLKSNKGSMTPGVDGKTIQDYLRL 79 Query: 60 LAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWE 119 +L L L+ +++ +RV+IPK+NG RPLGIP + DRI+Q+ M +EP+ E Sbjct: 80 SEEKLIELIRGRLT-NFKAHLIKRVFIPKANGGQRPLGIPTIEDRIIQQMMKQVLEPVLE 138 Query: 120 SDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAV- 178 + F S+GFRPER+ +HA+ VK+ + + T W++EGD+ +FD V+HR+L+K + Sbjct: 139 AQFFKYSFGFRPERTTYHALERVKVLVHN---TGYHWIVEGDIRQFFDKVNHRILIKKLW 195 Query: 179 RRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERY 238 I D R + L+ + +KAG G PQGG++SPLL+N+ L+ FD+++ +++ Sbjct: 196 SMGIKDRRILCLITEFLKAGIF--KNIIRNDNGTPQGGILSPLLANVYLHSFDKWVAKQF 253 Query: 239 LSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREE 298 R + ++ ++ +S+ ++ + RYADD+VL V K+ + Sbjct: 254 EEFTTRHEYSKHDHKLRGLKSSNLKPGY-------LIRYADDWVL-VTNNKSHAYRWKTV 305 Query: 299 CRGVLEGSLKLRLNMDKTKIPHVNDG-FIFLGHRL-------IRKRSRYGEMRVVSTIP- 349 + L+ LKL L+ +KT+I ++ FLG + K+ + + R +S I Sbjct: 306 IKNFLQKELKLELSEEKTRITNIRHKPIEFLGFKYKVVLKGVKGKKKKDKKTRYISQITP 365 Query: 350 -----QEKARNFAASLTALLWKVRI 369 + K + A+LT+L ++ Sbjct: 366 SDKKIKRKVKELRATLTSLGKRLSH 390 >UniRef50_Q5ZTU1 Reverse transcriptase n=1 Tax=Legionella pneumophila subsp. pneumophila str. Philadelphia 1 RepID=Q5ZTU1_LEGPH Length = 506 Score = 263 bits (671), Expect = 1e-68, Method: Composition-based stats. Identities = 102/367 (27%), Positives = 171/367 (46%), Gaps = 48/367 (13%) Query: 1 MQRKLATWAATDPSLRIQRLL-RLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQAR 59 +Q ++A + +++ L L+ A R ++KG+ TPG+DGV T + + Sbjct: 34 LQVRIAKAVSNKQHGKVKSLQWLLVNSISAKLLAVRRVTTAKGSKTPGIDGVVWTTSEEK 93 Query: 60 LAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWE 119 L + Y+ P RR+YIPK NGK RPL IP L+DR +Q L+A+EP+ E Sbjct: 94 CEAV-----RNLKARGYKATPLRRIYIPKKNGKERPLSIPTLKDRAMQALYLLALEPVGE 148 Query: 120 SDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVR 179 + SYGFRP+RS H AI L + +W++EGD+ + FD + H L + Sbjct: 149 TTADLNSYGFRPKRSTHDAIYQCYATLAR--KNCAQWILEGDIKACFDEIDHGWLKSNI- 205 Query: 180 RRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYL 239 I D R +T + ++AG+++ + G PQGG SPLL+N++L+ ++ +H Sbjct: 206 --IIDQRVLT---QWLQAGYMEKNQLFETARGTPQGGPASPLLANMVLDGLEREIHSGCG 260 Query: 240 SGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREEC 299 G + Y R+ADDF++ E + Sbjct: 261 QGN----------------------------KINYIRFADDFIVTANSPDILKEKVMPII 292 Query: 300 RGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAAS 359 L L L+ +KTKI H+ +GF FLG + + + ++ ++T ++ ++ Sbjct: 293 SNFLAQR-GLSLSQEKTKIVHIEEGFDFLGFNVRKYKGKF-----LTTPSKDSIKSVQMK 346 Query: 360 LTALLWK 366 + + K Sbjct: 347 IKETVKK 353 >UniRef50_Q1PUN9 Strong similarity to group II intron-encoded protein LtrA n=3 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1PUN9_9BACT Length = 432 Score = 263 bits (671), Expect = 1e-68, Method: Composition-based stats. Identities = 122/374 (32%), Positives = 178/374 (47%), Gaps = 51/374 (13%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQAR- 59 QRKL A + R L + +L EA + ++ G+ G DG+ +++ Sbjct: 21 FQRKLYRKAKQEEGFRFYVLYDKVRMLHFLREAYKRCKANGGS--AGADGITFEDVESYG 78 Query: 60 LAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWE 119 + L + +EL + Y+P P RVYIPK+NGK RPLGIP ++DR+VQ ++ + +EPI+E Sbjct: 79 VEKFLGEIIEELENKTYEPQPVLRVYIPKTNGKTRPLGIPVIKDRVVQMSVKLVIEPIFE 138 Query: 120 SDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVR 179 +DF SYGFRP RS A+R +K +L + V + DLSSYFDT+ H+ L+ + Sbjct: 139 ADFEDSSYGFRPGRSAGDAVRKIKEKLREGKTE----VFDADLSSYFDTIPHKELLLLIG 194 Query: 180 RRISDARFMTLLWKTIKAGHIDVGL---FRAASEGVPQGGVISPLLSNIMLNEFDQYLHE 236 RISD + L+ +KA I+ G R G PQG VISPLL+NI L+ D+ ++ Sbjct: 195 MRISDKNVLHLIKMWLKAPVIEEGKPGGGRKNKIGTPQGSVISPLLANIYLHMLDKAVNR 254 Query: 237 RYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIR 296 +K + RYADD+VL+ K + Sbjct: 255 --------------------------ENGVFYKYGITIIRYADDWVLMAKRIPREALDYL 288 Query: 297 EECRGVLEGSLKLRLNMDKTKIPHV-NDGFIFLGHR-------LIRKRSRYGEM---RVV 345 L+ SL N DK+KI + F FLGH RK +Y + R Sbjct: 289 NRLLKKLKLSL----NEDKSKIVKAEEESFDFLGHTISFSEDLFGRKHKKYWNIEPSRKS 344 Query: 346 STIPQEKARNFAAS 359 +EK N+ S Sbjct: 345 QKKVREKIGNYLKS 358 >UniRef50_C6MS68 RNA-directed DNA polymerase n=1 Tax=Geobacter sp. M18 RepID=C6MS68_9DELT Length = 439 Score = 262 bits (670), Expect = 1e-68, Method: Composition-based stats. Identities = 108/376 (28%), Positives = 157/376 (41%), Gaps = 44/376 (11%) Query: 3 RKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAV 62 R +A A D R + L R +T L + S GVD V L Sbjct: 16 RGIADKAKADKQHRFRNLYRELTAEYLLNCWPDLNKS----AASGVDKVTAEAYAEELHG 71 Query: 63 ELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDF 122 + L + L Y+ RR +IPK N K RPLGIPAL D++VQ A + I+E DF Sbjct: 72 NILNLAERLKDKKYRTKLVRRCWIPKENEKERPLGIPALEDKLVQLACAKLLIAIYEQDF 131 Query: 123 HTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRI 182 SYG+RP RS HA++ + L +++E D+ +FD + H L+K + RI Sbjct: 132 LDHSYGYRPGRSAKHAVQDLTFDLQ---YGSYGYIVEADIKGFFDRMDHDWLLKMLSLRI 188 Query: 183 SDARFMTLLWKTIKAGHID-VGLFRAASEGVPQGGVISPLLSNIMLN-EFDQYLHERYLS 240 +D F+ L+ K +KAG ++ G G PQGG++SP+L+N+ L+ D + E Sbjct: 189 NDRAFLHLIEKWLKAGILETDGTVTNPYTGTPQGGIVSPVLANVYLHFALDLWFEEVVKP 248 Query: 241 GKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECR 300 + K CRYADD+V + K + E Sbjct: 249 --------------------------RCKGDARICRYADDWVCAFQ-LKDDAQRFYWELP 281 Query: 301 GVLEGSLKLRLNMDKTKIPH-------VNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKA 353 LE L +KT I + F FLG R R GE RV ++K Sbjct: 282 NRLE-KFHLETAPEKTNIVRFSRFHPGMERRFTFLGFEFFWLRDRQGEPRVKRRTSRKKL 340 Query: 354 RNFAASLTALLWKVRI 369 + + + + R Sbjct: 341 QGACKRIKKWIKENRH 356 >UniRef50_A9BGC0 RNA-directed DNA polymerase (Reverse transcriptase) n=49 Tax=Bacteria RepID=A9BGC0_PETMO Length = 472 Score = 262 bits (669), Expect = 2e-68, Method: Composition-based stats. Identities = 115/375 (30%), Positives = 173/375 (46%), Gaps = 45/375 (12%) Query: 6 ATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQ 65 + D + +L I + + +A + ++KGA PG+DG+ L L + Sbjct: 39 SERGRNDDKGCSEGMLEKILSKDNMNKAYKKVKANKGA--PGIDGMKVEELFGYLRQHGE 96 Query: 66 ILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTL 125 LR ELL G Y P RR IPK +G R LGIP DR++Q+++ + PI+E F Sbjct: 97 ELRQELLEGRYTPKSVRRKEIPKPDGGKRLLGIPTSIDRVIQQSIAQVLTPIYEKKFVDN 156 Query: 126 SYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDA 185 SYGFRP R AIR K L WV++ DL YFDTV+H LM+ + + + D Sbjct: 157 SYGFRPLRDAKQAIRKSKEYLNK----GHTWVVDIDLERYFDTVNHDKLMRIISKDVKDG 212 Query: 186 RFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARK 245 R ++L+ K +K+G + G+ EG PQGG +SPLLSNIML+E D L +R Sbjct: 213 RVISLIRKYLKSGVMVNGVVIETEEGTPQGGPLSPLLSNIMLHELDVELTKR-------- 264 Query: 246 DRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEG 305 +CRYADD + VK K+ + E +E Sbjct: 265 -------------------------GHKFCRYADDCNIYVKSEKS-AYRVMESITKYIEK 298 Query: 306 SLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSRY----GEMRVVSTIPQEKARNFAASLT 361 LKL++N K+K+ D +LG K +Y + + K +S Sbjct: 299 KLKLKVNSKKSKVVRPWD-LKYLGFSFYVKEEKYEIRVHGKSIKEFKKKLKGETKRSSGR 357 Query: 362 ALLWKVRISGEILLG 376 ++ +++ +I+ G Sbjct: 358 SMAYRLSRIKQIITG 372 >UniRef50_Q94Z00 Orf757 n=4 Tax=stramenopiles RepID=Q94Z00_PYLLI Length = 757 Score = 261 bits (668), Expect = 2e-68, Method: Composition-based stats. Identities = 104/357 (29%), Positives = 169/357 (47%), Gaps = 31/357 (8%) Query: 14 SLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLS 73 + R+ +++ I + + A S G TP N L + + L + Sbjct: 171 NDRLTKVIHDIASLKNITRAYESIKSKPGNMTP---SANSETLDGFGLAWVVKASNNLKA 227 Query: 74 GHYQPLPARRVYIPKSNG-KLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPE 132 G ++ ARRV+IPK KLRPLG+ + RD+++ A+L +EP +E F +S+ FRP Sbjct: 228 GKFKFSNARRVHIPKPGSSKLRPLGVVSPRDKVILTAVLQVLEPFYEKKFLDISHAFRPG 287 Query: 133 RSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLW 192 R H A+ ++L+ + W IEGD++ FD + H +L+ +RR I + + L+ Sbjct: 288 RGCHTALNFIQLRFGN-----SNWAIEGDIARCFDDIDHDILLGILRRDIKCDKTIALIK 342 Query: 193 KTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLH---ERYLSGKARKDRWY 249 K++K ++ G+ +G QG +SP L NI L+E D ++ E Y+SG R+ Sbjct: 343 KSLKNPFVEDGVTVKPQKGTFQGSPLSPFLCNIYLHEMDLFIKGLSEDYISGTHRRKSPQ 402 Query: 250 WNNSIQRGRSTAVRENWQW------------------KPAVAYCRYADDFVLIVKGTKAQ 291 + +++ + +Y RYADDFV+ + G K Sbjct: 403 YRKIQYELSKPSLKVTERKLLNKKLRAIPSKDPVDPDFRRFSYVRYADDFVIGITGPKKD 462 Query: 292 VEAIREECRGVLEGSLKLRLNMDKTKIPHVN-DGFIFLGHRLIRKRSRYGEMRVVST 347 E +R + R L L L L+MDKT I H N +G FLG R+ + R +R +S Sbjct: 463 CEEVRNKLREFLTKILALELSMDKTIISHFNQEGITFLGTRISGNKEREKVIRKISK 519 >UniRef50_UPI0001C42A66 RNA-directed DNA polymerase (Reverse transcriptase) n=1 Tax=Bacillus pseudofirmus OF4 RepID=UPI0001C42A66 Length = 624 Score = 261 bits (667), Expect = 3e-68, Method: Composition-based stats. Identities = 114/374 (30%), Positives = 181/374 (48%), Gaps = 28/374 (7%) Query: 4 KLATWAATDPSLR----IQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTM-LQA 58 KL + + + LL +I E + A ++KG++T G DG LQ Sbjct: 20 KLYLISKEYKDSKKHPCFKGLLEIIQSDEVILTAIHKIKANKGSNTKGTDGETIDDILQD 79 Query: 59 RLAVELQILRDELLSGHYQPLPARRVYIPKSNGK-LRPLGIPALRDRIVQRAMLMAMEPI 117 + +R L+ Y P RRV+I K K RPLGIPA+ DRI+Q + M +EPI Sbjct: 80 GYESVISRVRKCFLA--YNPKLLRRVHIDKQVSKDKRPLGIPAIIDRIIQECIRMIIEPI 137 Query: 118 WESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKA 177 E+ F + SYGFRP RS HA+ V +T WV+EGD+ +FD V+H +L+K Sbjct: 138 LEAQFFSHSYGFRPYRSAEHALSKVT---NTAYDTNYCWVVEGDIKKFFDNVNHTILIKK 194 Query: 178 V-RRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHE 236 + I D R + ++ ++ G + G + G PQGG+ISPLL+N L+ D ++ Sbjct: 195 LYSMGIRDRRVLMIIKAMLQCGVL--GEAEQTTVGTPQGGIISPLLANAYLDSLDHWITR 252 Query: 237 RYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIR 296 + + + + + S G+ A++ KPA + RYADD+VLI +KA + Sbjct: 253 EWENKETKHEY-----SRLDGKYRALKNASNLKPA-HFVRYADDWVLIT-NSKANAIKWK 305 Query: 297 EECRGVLEGSLKLRLNMDKTKIPHVNDG-FIFLGHRLIR---KRSRYGEMRVVSTIPQE- 351 + L+ LKL L+ +KT I ++ F+G + + + G + P+ Sbjct: 306 QRIAKHLKEQLKLELSEEKTLITNIKKKAIKFVGFHFKQVKNGKGKNGWVTRTEPDPKRL 365 Query: 352 --KARNFAASLTAL 363 K + +L A+ Sbjct: 366 EIKIQTIRKNLKAI 379 >UniRef50_Q5U7I7 Maturase-related protein n=20 Tax=Gammaproteobacteria RepID=Q5U7I7_ECOLX Length = 451 Score = 261 bits (666), Expect = 4e-68, Method: Composition-based stats. Identities = 112/374 (29%), Positives = 186/374 (49%), Gaps = 49/374 (13%) Query: 2 QRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLA 61 QR + + + L+ + ++ A + +KGA G+D ++ Sbjct: 16 QRSTVNNTSNEYNQIDHDLMAKVLSNHNISAAWQHVKRNKGA--AGIDNMSIEEFNDFAK 73 Query: 62 VELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESD 121 + ++ +LL+G YQPLP +RV IPK +G R LGIPA+ DR++Q+A+ + P +E Sbjct: 74 LHWLGIKQQLLNGSYQPLPVKRVMIPKPDGGERMLGIPAVIDRVIQQAIAQVISPYFEPQ 133 Query: 122 FHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRR 181 F SYG+RP + A+ V+ C + + ++ DLS +FD V H +LM V R+ Sbjct: 134 FSPHSYGYRPHKRASQAVNHVQ----SCVKQGYKTAVDIDLSKFFDEVDHDMLMNRVSRK 189 Query: 182 ISDARFMTLLWKTIKAGH--IDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYL 239 I D M LL K ++AG + GL+ +++GVPQGG +SPLLSNI+L+E D+ L ++L Sbjct: 190 IKDKALMRLLGKYLRAGIAERETGLWFESTKGVPQGGPLSPLLSNILLDELDKKLTYKHL 249 Query: 240 SGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREEC 299 + RYADD +++VK TK++ I+ E Sbjct: 250 K---------------------------------FARYADDIIILVK-TKSEGLIIQREI 275 Query: 300 RGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAAS 359 + LKL++N K+++ V+ G FLG RYG+++ + +K + Sbjct: 276 TAFITKRLKLKVNESKSRVGPVS-GSKFLGFTF-----RYGQVQ-IHEQALKKFKANVRE 328 Query: 360 LTALLWKVRISGEI 373 LT W + ++ +I Sbjct: 329 LTNRNWGISMTLQI 342 >UniRef50_B2JXR4 RNA-directed DNA polymerase n=10 Tax=Bacteria RepID=B2JXR4_BURP8 Length = 503 Score = 260 bits (665), Expect = 4e-68, Method: Composition-based stats. Identities = 108/388 (27%), Positives = 171/388 (44%), Gaps = 55/388 (14%) Query: 1 MQRKLATWAAT---DPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQ 57 +Q ++A A D + +QRLL + L A + ++G TPGVDG + Sbjct: 36 LQARIAKAAREGRWDKAKVLQRLLTRSHSAKML--AVKRVTENRGKRTPGVDGRVWSSSA 93 Query: 58 ARLAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPI 117 A+ L L Y+ +P RR+YIPKSNGK RPLGIP +R R +Q +A+EPI Sbjct: 94 AKWKGML-----SLRHRGYRAMPLRRIYIPKSNGKKRPLGIPCMRCRSMQALWKLALEPI 148 Query: 118 WESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKA 177 E+ SYGFRPERS AI L WV+EGD+ FD H +K Sbjct: 149 AETLADANSYGFRPERSTADAIEQCFTVLAR--RISPEWVLEGDIRGCFDNFSHSWFLKH 206 Query: 178 VRRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHER 237 + +L K ++AG+ID G + G PQGG+ISP+++N+ L+ + +H Sbjct: 207 IPMD------KVILRKWLEAGYIDEGTLFESRAGTPQGGIISPVIANMALDGLEAAVHA- 259 Query: 238 YLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIRE 297 S + + ++ RYADDFV+ + Sbjct: 260 ---------------------SVGTSARARKRAQLSVIRYADDFVVTGVSKDVLELKVLP 298 Query: 298 ECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFA 357 R + L L+ +KT+I H+ GF FLG + + + +++ ++ ++ Sbjct: 299 AVRQFMAVR-GLELSEEKTRITHIAAGFDFLGQNVRKY-----DGKLLIKPAKKSIKSLT 352 Query: 358 ASLTALLWK---------VRISGEILLG 376 + A++ +R ++ G Sbjct: 353 DKVGAIIKGNASATQEALIRQLNPVIRG 380 >UniRef50_A0L945 RNA-directed DNA polymerase (Reverse transcriptase) n=1 Tax=Magnetococcus sp. MC-1 RepID=A0L945_MAGSM Length = 433 Score = 260 bits (664), Expect = 6e-68, Method: Composition-based stats. Identities = 129/375 (34%), Positives = 187/375 (49%), Gaps = 47/375 (12%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQAR- 59 ++RK+ A D S R L + +PE L A + +KGA PG+DGV ++ Sbjct: 11 LRRKIYRKAKIDKSWRFWGLYHHVCKPETLNTAYEMARKNKGA--PGIDGVTFEAIEESG 68 Query: 60 LAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWE 119 + L +R EL+SG Y+PL RR IPK +GK R LGIP++RDR+VQ A+ + +EPI+E Sbjct: 69 VEQFLGEVRKELVSGSYRPLKNRRKAIPKGDGKERVLGIPSIRDRVVQGALKLILEPIFE 128 Query: 120 SDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVR 179 +DF + SYG+RP+R H A+ V + + VI+ DL SYFDTV H L ++ V Sbjct: 129 ADFQSGSYGYRPKRMAHQAVNRVAIAIAQGKTQ----VIDADLKSYFDTVQHDLALRKVS 184 Query: 180 RRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYL 239 R+ D + M LL K GVPQGGVISPL+SN+ LNE D+ L Sbjct: 185 ERVDDDQVMHLLKLIFKTSG---------KRGVPQGGVISPLISNLYLNEVDKMLERAKE 235 Query: 240 SGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKA---QVEAIR 296 + K + Y R+ADD V++V G + Sbjct: 236 VTRKGK-----------------------YTHIEYARFADDLVILVDGHHRWNGLARKVY 272 Query: 297 EECRGVLEGSLKLRLNMDKTKIPHVNDG--FIFLGHRLIRKRSRYGEMRVVSTIPQEKAR 354 + L LK++LN++KT++ + G F FLG + R+R + V IP+ KAR Sbjct: 273 QRLGEEL-AKLKVQLNLEKTRVVDLTRGEDFTFLGFNI-RQRMTLQGKQGVLCIPRMKAR 330 Query: 355 -NFAASLTALLWKVR 368 + L + +R Sbjct: 331 TSLLGRLKEVFKHLR 345 >UniRef50_C9S0G0 RNA-directed DNA polymerase n=2 Tax=Geobacillus RepID=C9S0G0_GEOSY Length = 635 Score = 260 bits (664), Expect = 7e-68, Method: Composition-based stats. Identities = 115/383 (30%), Positives = 184/383 (48%), Gaps = 32/383 (8%) Query: 5 LATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVEL 64 L A +L L+ + EA R S+KG+ T G+D ++ Sbjct: 21 LYQEAK--KGKHFYGMLELLQNDVVILEAIRNIKSNKGSKTAGIDQKIVDDYLLMPTEKV 78 Query: 65 QILRDELLSGHYQPLPARRVYIPKSN--------------GKLRPLGIPALRDRIVQRAM 110 + L+ Y+P+P RR PK N G+ RPLGI A+ DRI+Q + Sbjct: 79 FGMIKAKLN-DYKPIPVRRCNKPKGNAKSSKRKGNSPNEEGETRPLGISAVTDRIIQEML 137 Query: 111 LMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVH 170 + +EPI+E+ F+ SYGFRP RS HA+ + + ++ WV++GD+ SYFD ++ Sbjct: 138 RIVLEPIFEAQFYPHSYGFRPYRSTEHALAWMLKIING---SKLYWVVKGDIESYFDHIN 194 Query: 171 HRLLMKAV-RRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNE 229 H+ L+ + + D R + ++ K +KAG + G F ++G+PQGG+ISPLL+N+ LN Sbjct: 195 HKKLLNIMWNMGVRDKRVLCIVKKMLKAGQVIQGKFYPTAKGIPQGGIISPLLANVYLNS 254 Query: 230 FDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTK 289 FD + + Y + N++ R+ + V Y RYADD+V++ TK Sbjct: 255 FDWMVGQEYEYHPNNANYREKKNALAALRN-------KGHHPVFYIRYADDWVILT-DTK 306 Query: 290 AQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDG-FIFLGHRLIRKRSRY-GEMRVVST 347 E IRE+C+ L L L L+ +KT I + + FLG + + R+ + Sbjct: 307 EYAEKIREQCKQYLACELHLTLSDEKTFIADIREQRVKFLGFCIEAGKRRFHKKGFAARM 366 Query: 348 IPQ-EKARNFAASLTALLWKVRI 369 IP EK + + +R Sbjct: 367 IPDMEKVNAKVKEIKRDIRLLRT 389 >UniRef50_A5VH22 RNA-directed DNA polymerase n=1 Tax=Sphingomonas wittichii RW1 RepID=A5VH22_SPHWW Length = 572 Score = 260 bits (663), Expect = 8e-68, Method: Composition-based stats. Identities = 111/327 (33%), Positives = 155/327 (47%), Gaps = 46/327 (14%) Query: 44 HTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRD 103 TPGVDG L L + G Y+P P RRVYIPK NGK+RPLGIP D Sbjct: 1 MTPGVDGQT---FDGMTLARLDRLTQGVAEGRYRPRPVRRVYIPKGNGKMRPLGIPTADD 57 Query: 104 RIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLS 163 RIVQ A M + I+E F S+GFR RS H A+ ++ T +W+IE D+ Sbjct: 58 RIVQEAARMILAAIYEPVFSKHSHGFRAGRSCHTALEEIRRTW-----TGAKWLIEVDVR 112 Query: 164 SYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLS 223 +FD + H +L+ + RRI D F+ L+ +KAG +D F G PQGGVISPLL+ Sbjct: 113 GFFDNIDHDILLSLLARRIDDPVFIDLIGTMLKAGCMDEWKFERTYSGTPQGGVISPLLA 172 Query: 224 NIMLNEFDQYLHE---RYLSGKARKDRWYWNNSIQ------------------------- 255 NI L+E D ++ E R+ G R+ + Q Sbjct: 173 NIYLHELDLFMEEMRARFDKGVKRRANPVYVVQSQKIAALRKEIDAIRAVGADEAEVRTR 232 Query: 256 ----------RGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEG 305 R + ++V + + YCRYADDF++ V G+KA I + + L Sbjct: 233 LARIEAINRDRRKISSVDQMDPNFRRLRYCRYADDFLVGVIGSKADAVRIMADIQHFLAD 292 Query: 306 SLKLRLNMDKTKIPHVNDGFIFLGHRL 332 L L ++ +KT + + G FLG + Sbjct: 293 RLNLTVSPEKTGVRDASRGSPFLGFHV 319 >UniRef50_A1ZX33 Group II intron-encoded protein LtrA n=2 Tax=Bacteria RepID=A1ZX33_9SPHI Length = 594 Score = 259 bits (662), Expect = 1e-67, Method: Composition-based stats. Identities = 97/347 (27%), Positives = 164/347 (47%), Gaps = 33/347 (9%) Query: 12 DPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDEL 71 + L I+R+ RL+ P A +KGA T G+ ++Q + +L Sbjct: 16 ERKLPIERVYRLLYNPNLYLLAYSNLYGNKGALTSGI---TPETADGMSLDKIQDIICKL 72 Query: 72 LSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRP 131 Y+ P++R++IPK NG+ RPL IP D+++Q + + +E +E F S+GFR Sbjct: 73 KQESYRWKPSKRIFIPKKNGQPRPLSIPCWSDKLLQEVIRLILEAYFEPQFCESSHGFRT 132 Query: 132 ERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLL 191 R H A++ ++L+ +W IEGD+ FD ++H+L++K + ++ D RF+ L+ Sbjct: 133 GRGCHSALKQMRLK-----GKGSKWFIEGDIQGCFDNINHQLIIKLLSDKLYDPRFIRLI 187 Query: 192 WKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWN 251 + +K G+I+ + GVPQG +I P+L+NI+LNE D+++ + + + R Sbjct: 188 SQLLKTGYIEGWKYNKTYSGVPQGSIIGPILTNIVLNELDKFVENKLIPANTKGKRRRSC 247 Query: 252 NSIQRGRSTAVRENWQWK------------------------PAVAYCRYADDFVLIVKG 287 + A + Q + Y RYA+D +L G Sbjct: 248 PKYALIKRQASKARKQGDMDKCRELNKQAQKIPSRDTNDPKYRRLWYIRYANDTLLGYIG 307 Query: 288 TKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHV-NDGFIFLGHRLI 333 K + I+E+ L L L LN DKT I H + FLG+ + Sbjct: 308 KKEEAIKIKEQIADFLANELHLTLNSDKTLITHAQSQKASFLGYHIR 354 >UniRef50_Q3S275 ORF718 n=2 Tax=Eukaryota RepID=Q3S275_THAPS Length = 718 Score = 258 bits (660), Expect = 2e-67, Method: Composition-based stats. Identities = 105/358 (29%), Positives = 188/358 (52%), Gaps = 21/358 (5%) Query: 14 SLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLS 73 + + L +++ P +L A S+ G+ T ++K L L+ + + + Sbjct: 157 NKKCVNLSSIMSDPNFLIAAWARIRSNSGSLTF---ALSKETLDGIALSWLEETANTMRN 213 Query: 74 GHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPER 133 G +Q P+RR YI KS+G RPL IP+ RD+IVQ AM + ++E DF S+G+ R Sbjct: 214 GIFQFSPSRRTYISKSDGGKRPLTIPSPRDKIVQEAMRFLLMLVFEGDFSKNSHGWVSGR 273 Query: 134 SVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWK 193 H A+ +K++ W+IEGD+ F +++H++L+ ++ +I D F+ L++K Sbjct: 274 GCHTALNQIKMEFA-----HDNWLIEGDIDQQFPSLNHQVLVNLLKTKIDDQAFIDLIYK 328 Query: 194 TIKAGHID-VGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHE----RYLSGKARKDRW 248 ++ G+ + G QGGV+SP+L+NI + FD+++ +Y GK RK Sbjct: 329 YLRVGYGESPDKIVKMRIGTSQGGVLSPVLANIYMTPFDKWVERDLIPKYTKGKRRKANP 388 Query: 249 YWNNSIQRGRST-----AVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVL 303 + I+ G+ T ++ + + + Y RYADDF++ + G K + I +EC+ L Sbjct: 389 VYTKMIRSGKVTDHSIPSLYAHDRNFIRLHYVRYADDFIMGLNGPKVYCKQIVDECKTFL 448 Query: 304 EGSLKLRLNMDKTKIPHVN-DGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAASL 360 LKL LN++KTKI H D FLG+R+ +++ +M++ + + +R ++ Sbjct: 449 FEQLKLTLNIEKTKITHSQLDSATFLGYRVY--KTKLSKMKIAHNLKGQLSRRTTNTV 504 >UniRef50_B7CEC9 Putative uncharacterized protein n=1 Tax=Eubacterium biforme DSM 3989 RepID=B7CEC9_9FIRM Length = 458 Score = 258 bits (659), Expect = 3e-67, Method: Composition-based stats. Identities = 102/364 (28%), Positives = 165/364 (45%), Gaps = 39/364 (10%) Query: 14 SLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLS 73 S R +L L+ + + R K PG+D V K + Q L + L L S Sbjct: 40 SKRYPKLETLMYRVDK-ESLIRQHRLQKKDKAPGIDMVTKEVYQENLNENIDDLMHRLKS 98 Query: 74 GHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPER 133 Y+P P RRV I K NGK RPLGIP DR+ Q AM + ++E F SYGFRP R Sbjct: 99 FSYKPQPVRRVEIDKGNGKKRPLGIPVYEDRLFQGAMADILSDVYEPRFLDCSYGFRPNR 158 Query: 134 SVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWK 193 H AI+ + + + ++++ D+ +FD V+H LMK +R I+D ++ + + Sbjct: 159 KAHDAIKVINDTV---MHKKINYILDCDIKGFFDNVNHEWLMKFLRNDIADPNYLKYIAR 215 Query: 194 TIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNE-FDQYLHERYLSGKARKDRWYWNN 252 +K+G + G + S G PQGG+ISP+L+N+ L+ D + + Sbjct: 216 MLKSGVMIEGKYEDTSVGTPQGGLISPILANVYLHYVLDLWFEKCIKK------------ 263 Query: 253 SIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLN 312 Q R+ADDF+++ + + + + E +E L L+ Sbjct: 264 --------------QLCGEAYLVRFADDFLIMFQYER-DAQRVYEAVINRME-LFGLELS 307 Query: 313 MDKTKI------PHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAASLTALLWK 366 +KT+I + F FLG ++R G V I ++K + F ++L + + Sbjct: 308 KEKTRILPFGRYSDSRETFDFLGFTHFNSKTRKGYYSVGHKISRKKKKQFKSNLKKWVKE 367 Query: 367 VRIS 370 R + Sbjct: 368 NRNT 371 >UniRef50_B0JX80 Reverse transcriptase n=82 Tax=Bacteria RepID=B0JX80_MICAN Length = 613 Score = 258 bits (658), Expect = 3e-67, Method: Composition-based stats. Identities = 104/395 (26%), Positives = 176/395 (44%), Gaps = 64/395 (16%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEW--LAEAARITLSSKGAHTPGVDGVNKTMLQA 58 +Q+++ A + +++RL RL+ + + L ++T ++G T GVDG+ + Sbjct: 37 LQKRIYQAAKSGQDAKVRRLQRLLVKSYYARLLAVRKVTQDNQGKKTAGVDGMIAISPEQ 96 Query: 59 RLAVELQILRDELLSGHYQPLPARRVYIPKSN-GKLRPLGIPALRDRIVQRAMLMAMEPI 117 RL L +E+ G + P RRV+IPK + RPLGIP ++DR Q + A+EP Sbjct: 97 RLN-----LTEEIK-GTLKAKPLRRVWIPKPGRDEKRPLGIPTIKDRARQALIKSALEPE 150 Query: 118 WESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKA 177 WES SYGFRP RS H AI + + + +V++ D++ FD ++H L+ Sbjct: 151 WESKMEGTSYGFRPGRSDHDAISRIYITIN----QSSYFVLDADIAKCFDRINHDFLLSK 206 Query: 178 VRRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHER 237 + + + + +KAG +D G+F G PQGGVISPLL+NI L+ + + Sbjct: 207 I---HCPSSLKRDIKQWLKAGVLDNGVFEETETGTPQGGVISPLLANIALDGMARLIETL 263 Query: 238 YLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIRE 297 + + K RYADDFV+I + +E + Sbjct: 264 FP------------------------KKGNGKNQAVLIRYADDFVVISPSLE-IIEQCKT 298 Query: 298 ECRGVLEGSLKLRLNMDKTKIPHVND-----------GFIFLGHRLIR------KRSRYG 340 L+ + L L +KT++ H GF FLG + + K + G Sbjct: 299 AISEWLK-PIGLELKPEKTRVCHTLKPIEYNGKMEEPGFDFLGFNIRQYPVGKYKSGKDG 357 Query: 341 EMRVVST-----IPQEKARNFAASLTALLWKVRIS 370 R++ Q+ + ++ ++ K + + Sbjct: 358 AKRLIGHKTHIKPSQKAVKTHTEAIKGVIKKHKTA 392 >UniRef50_B1L2I7 Reverse transcriptase/endonuclease protein n=37 Tax=Firmicutes RepID=B1L2I7_CLOBM Length = 607 Score = 257 bits (657), Expect = 5e-67, Method: Composition-based stats. Identities = 114/378 (30%), Positives = 187/378 (49%), Gaps = 31/378 (8%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQP--EWLAEAARITLSSKGAHTPGVDGVNKTMLQA 58 +++++ ++++L RL+ + L R+T +KG T G+DG Sbjct: 34 LRQRIFRAEQLGQKRKVKKLQRLMLRSKANLLISIKRVTQINKGKRTAGIDGFKVIT--E 91 Query: 59 RLAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIW 118 ++L + + + PA+R YIPK NGKLRPLGIP ++DRI Q + A+EP W Sbjct: 92 WDRIKLFNSLKDYSIKNIKSQPAKRTYIPKKNGKLRPLGIPIIKDRIYQNIVKNALEPQW 151 Query: 119 ESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAV 178 ES F +++YGFRP+RS H AI + L+L + + W+ EGD FD ++H +M+ + Sbjct: 152 ESKFESIAYGFRPKRSTHDAIEQLYLKLRKGSKRQ--WIFEGDFKGCFDNLNHEYIMECI 209 Query: 179 RRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERY 238 +D +++ +KAG+ID +FR +EG PQGG+ISPLL+NI L+ ++ L +Y Sbjct: 210 ----NDFPAKEAVYRWLKAGYIDNNVFRNTNEGTPQGGIISPLLANIALHGMEEELGVKY 265 Query: 239 LSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREE 298 K + ++ +YADDFV++ K TK + E + E Sbjct: 266 QFTKR-------------------QGYCLRDNSIGIVKYADDFVILCK-TKEEAETMYER 305 Query: 299 CRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAA 358 L+ L L DKT I H++ GF FLG + + + G ++ + + Sbjct: 306 LSPYLKKR-GLELAEDKTGITHISKGFDFLGFNIRQYKKIKGMTLLIKPSKASMKKAKKS 364 Query: 359 SLTALLWKVRISGEILLG 376 S E+++G Sbjct: 365 IKEVFERYRGNSVEVIIG 382 >UniRef50_C4K5N9 Group II intron encoded reverse transcriptase n=10 Tax=Candidatus Hamiltonella defensa 5AT (Acyrthosiphon pisum) RepID=C4K5N9_HAMD5 Length = 570 Score = 257 bits (656), Expect = 6e-67, Method: Composition-based stats. Identities = 106/369 (28%), Positives = 176/369 (47%), Gaps = 37/369 (10%) Query: 1 MQRKLATWAATDPSLRIQRLLRLI--TQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQA 58 +++++ +AT +++ L +L+ ++ L ++T ++G HT GVD N+ + Sbjct: 32 LRQRIYRASATGDLKKVRNLQKLMMKSRANHLLAIRKVTQVNRGKHTAGVD--NQVINDH 89 Query: 59 RLAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIW 118 + L L + S + P +RVYI K NGK RPLGIP + DR Q + A+EP W Sbjct: 90 KGREHLYKLLSQTTSE--KVYPVKRVYIAKKNGKKRPLGIPTILDRCRQAIVKSALEPYW 147 Query: 119 ESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAV 178 E+ F +SYGFRP RS H AI+ + G WV++ D+ FD + H L+K + Sbjct: 148 EAKFEPVSYGFRPGRSAHDAIQKIFCIARARGTRH--WVLDADIKGAFDNIDHNFLIKKI 205 Query: 179 RRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERY 238 ++ + ++AG ++ G + G PQGG+ISPLL+NI L+ + L +Y Sbjct: 206 ----GGFPERNMIKQWLQAGVLEHGNYIPNVAGTPQGGIISPLLANIALHGMETLLGIQY 261 Query: 239 LSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREE 298 K + A RYADDFV+ K ++ + E + + Sbjct: 262 WKNGTPKQGQPY----------------------AVVRYADDFVVFGK-SREECETAKIK 298 Query: 299 CRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSRY-GEMRVVSTIPQEKARNFA 357 + L L L+ +KT I H+ +GF FLG + +R+ ++ T P +++ Sbjct: 299 LQIWLAQR-GLALSEEKTSIKHLKEGFDFLGFNIRHYDNRHRKRGYILLTKPSKESMKRY 357 Query: 358 ASLTALLWK 366 + WK Sbjct: 358 KQQMRMTWK 366 >UniRef50_Q93PB4 MS117, putative maturase n=1 Tax=Microscilla sp. PRE1 RepID=Q93PB4_9SPHI Length = 462 Score = 257 bits (656), Expect = 6e-67, Method: Composition-based stats. Identities = 105/363 (28%), Positives = 170/363 (46%), Gaps = 48/363 (13%) Query: 18 QRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQ 77 + L+ + L + + + + G+ PGVDG+ L+ + Q L ++L G+Y+ Sbjct: 43 RGLMYKVCDLSNLTASLKQVVKNGGS--PGVDGMQVKELRYWFSNNHQKLIEQLKEGNYR 100 Query: 78 PLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHH 137 P+ + IPK G +R LGIP ++DR+VQ+A+ + ++ F SYGFR R+ H Sbjct: 101 PMTIKGQEIPKPGGGVRQLGIPTVQDRLVQQAIAQQLSKRYDPTFSQYSYGFRKGRNAHQ 160 Query: 138 AIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKA 197 A+R + + +V++ DL +FD V+H LM + RRISD R + L+ K +++ Sbjct: 161 ALRQAGAYVKE----GFNYVVDLDLEKFFDKVNHDRLMWLLGRRISDKRVLKLIGKFLRS 216 Query: 198 GHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRG 257 G + GL G PQG +SPLLSNI+L+E D+ L R Sbjct: 217 GILIGGLENQRISGTPQGSPLSPLLSNIVLDELDKELERR-------------------- 256 Query: 258 RSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTK 317 + RYADD +L+V+ +A E +E L L++N DK++ Sbjct: 257 -------------GHRFVRYADDMILLVRSQEA-AERAYSSITSFIENRLLLKVNKDKSR 302 Query: 318 IPHVNDGFIFLGHRLIRKR----SRYGEMRVVSTIPQEKARNFAASLTALLWKVRISGEI 373 I FLGH ++ SR E R + + RN SL ++ ++ + Sbjct: 303 ICRPYQ-LNFLGHSIMWDGKLGLSRQSEQRFKEKVKKVTRRNRGISLEQMVKEL---NRV 358 Query: 374 LLG 376 L G Sbjct: 359 LRG 361 >UniRef50_B1I9Z1 GBSi1, group II intron, maturase n=33 Tax=Firmicutes RepID=B1I9Z1_STRPI Length = 425 Score = 256 bits (655), Expect = 6e-67, Method: Composition-based stats. Identities = 106/350 (30%), Positives = 159/350 (45%), Gaps = 46/350 (13%) Query: 17 IQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHY 76 + +LL I E + EA S+KG+ G+DG+ + L ++ ++ + Y Sbjct: 1 MSKLLDKILSRENMLEAYNQVKSNKGS--AGIDGMTIEEMDNYLRQNWRLTKELIKQRKY 58 Query: 77 QPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVH 136 +P P +V IPK +G +R LGIP + DR++Q+A++ + PI E F +SYGFRP RS Sbjct: 59 KPQPVLKVEIPKPDGGIRQLGIPTVMDRMIQQAIVQVISPICEPHFSDMSYGFRPNRSCE 118 Query: 137 HAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIK 196 AI L D W+++ DL +FDTV LM V I D +L+ K + Sbjct: 119 KAIMKFLEYLND----GYEWIVDIDLEKFFDTVPQDRLMSLVHNIIEDGDTESLIRKYLH 174 Query: 197 AGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQR 256 +G I G G PQGG +SPLLSN+MLNE D+ L +R Sbjct: 175 SGVIINGQRHKTLVGTPQGGNLSPLLSNVMLNELDKELEKR------------------- 215 Query: 257 GRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKT 316 + + RYADD V+ V G++A + + +E L L++NM K Sbjct: 216 --------------GLRFVRYADDCVITV-GSEAAAKRVMYSASRFIEKRLGLKVNMTKA 260 Query: 317 KIPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAASLTALLWK 366 KI + +LG + + S Q+ R F L L + Sbjct: 261 KITRPRE-LKYLGFGFWKSSDGWK-----SRPHQDSVRRFKLKLKKLTQR 304 >UniRef50_B0K6R3 RNA-directed DNA polymerase (Reverse transcriptase) n=3 Tax=Thermoanaerobacter RepID=B0K6R3_THEPX Length = 427 Score = 256 bits (655), Expect = 7e-67, Method: Composition-based stats. Identities = 101/368 (27%), Positives = 165/368 (44%), Gaps = 47/368 (12%) Query: 16 RIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGH 75 ++Q L+ + PE L K GVD V + ++ L ++ Sbjct: 17 KVQNLISYV-NPETLKAKHEEMPKKK---ASGVDKVTWEEYDVNVDENVETLIAKMKRFS 72 Query: 76 YQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSV 135 Y+P PARRVYIPK+NGKLRPLGIP D++V M + ++E+ F SYGFRP RS Sbjct: 73 YRPQPARRVYIPKANGKLRPLGIPCYEDKLVAAVMADILNEVYENIFLDTSYGFRPGRSC 132 Query: 136 HHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTI 195 H AI+ + + C + +V+E D+ +FD V + LM+ + I D F + + + Sbjct: 133 HDAIKELNRIIGRC---KISYVLEADIKGFFDNVDQKQLMEFIAHDIDDKNFSRYIVRFL 189 Query: 196 KAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNE-FDQYLHERYLSGKARKDRWYWNNSI 254 K+G ++ G + + +G QG +SP+L+NI L+ D + Sbjct: 190 KSGIMEEGKYHESDKGTAQGSPLSPILANIYLHYTLDVWFAY------------------ 231 Query: 255 QRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMD 314 ++ N +++ RYADDFV++ + K+ + + E + L L MD Sbjct: 232 -------LKRNGKFRGEAYIVRYADDFVMLFQ-YKSDADKMYEALPKRM-AKFGLELAMD 282 Query: 315 KTKIPHV------------NDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAASLTA 362 KTKI + F FLG +R G+ R ++K + + A Sbjct: 283 KTKILPFGRFAKQNSKDGKTETFDFLGFTFSNGTTRNGKYRAHIQTNKKKLKAKRQVVKA 342 Query: 363 LLWKVRIS 370 L + + + Sbjct: 343 WLKEQQHA 350 >UniRef50_Q35062 CoxI intron2 ORF n=2 Tax=Marchantia polymorpha RepID=Q35062_MARPO Length = 802 Score = 256 bits (654), Expect = 9e-67, Method: Composition-based stats. Identities = 116/400 (29%), Positives = 196/400 (49%), Gaps = 50/400 (12%) Query: 9 AATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILR 68 A P R L+ +I+ P +LA G T G D + + Sbjct: 299 AYLGPDNRYNGLIHIISDPTFLALCYESIRGKPG--TSGSDAKPLDGPE-----WFVQVG 351 Query: 69 DELLSGHYQPLPARRVYIPKSNGK-LRPLGI-------PALRDRIVQRAMLMAMEPIWES 120 ++L G ++ PARR I K K RPLGI ++IVQ+A+ + +E I+E Sbjct: 352 EKLKKGQFEFSPARR--ITKPGKKEKRPLGINSPVKQKKCYGEKIVQKALQLVLEAIYEP 409 Query: 121 DFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRR 180 F S+GFR RS H A++ + L+ WV+EG++ +FD++ H++++ + + Sbjct: 410 IFLDCSHGFRIHRSCHTALKRLCLE-----GGHYPWVVEGNIRKFFDSIPHKVILHKISQ 464 Query: 181 RISDARFMTLLWKTIKAGHIDV--GLFRAASEGVPQGGVISPLLSNIMLNEFDQY---LH 235 ++ R + LL + ++AG+ D G + EG QG V+SPLL NI+L+ D++ L Sbjct: 465 KVKCHRTLELLQRALRAGYKDPTSGQVISLDEGTSQGSVLSPLLCNIILHYLDEFVMKLR 524 Query: 236 ERYLSGKARKDRWYWN------NSIQRGRSTAVRENW--------QWKPAVAYCRYADDF 281 +R+ GK+R+ + N+ ++ RS ++ + + Y RYADDF Sbjct: 525 DRFNKGKSRRINPEYKLLTRHMNANRQDRSLLIKRRLIPSKDPLDPYFRRILYVRYADDF 584 Query: 282 VLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHV-NDGFIFLGHRLIRKRSRYG 340 V++V GT+ + AI+ + L SL+L L+++KT + H+ N GF FLG R RSR+ Sbjct: 585 VILVSGTRLETFAIQASLQNFLHRSLRLELSLEKTVVSHLANKGFHFLGTYCKRTRSRHR 644 Query: 341 -------EMRVVSTIPQEKARNFAASLTALLWKVRISGEI 373 + + E+ R A +T L +K++ G + Sbjct: 645 IFHVRTVRGKTIKQRSTERLR-VCAPITKLFYKLKEKGFV 683 >UniRef50_P03876 Putative COX1/OXI3 intron 2 protein n=3 Tax=Saccharomycetaceae RepID=AI2M_YEAST Length = 854 Score = 256 bits (653), Expect = 1e-66, Method: Composition-based stats. Identities = 98/353 (27%), Positives = 169/353 (47%), Gaps = 32/353 (9%) Query: 19 RLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQP 78 R+L+L++ L A S KG + G + + + L L ++ + ++ Sbjct: 284 RILKLMSDIRMLLIAYNKIKSKKGNMSKGSNNITLDGINI---SYLNKLSKDINTNMFKF 340 Query: 79 LPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHA 138 P RRV IPK++G RPL + R++IVQ +M M +E I+ + F S+GFRP S A Sbjct: 341 SPVRRVEIPKTSGGFRPLSVGNPREKIVQESMRMMLEIIYNNSFSYYSHGFRPNLSCLTA 400 Query: 139 IRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAG 198 I K + C W I+ DL+ FDT+ H +L+ + RI D FM LL+K ++AG Sbjct: 401 IIQCKNYMQYCN-----WFIKVDLNKCFDTIPHNMLINVLNERIKDKGFMDLLYKLLRAG 455 Query: 199 HID-VGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKD----------- 246 ++D + + G+PQG V+SP+L NI L++ D+YL ++ + + Sbjct: 456 YVDKNNNYHNTTLGIPQGSVVSPILCNIFLDKLDKYLENKFENEFNTGNMSNRGRNPIYN 515 Query: 247 -----RWYWNNSIQRGRSTAVRENWQ-------WKPAVAYCRYADDFVLIVKGTKAQVEA 294 + ++ + +R+++Q + RYADD ++ V G+ + Sbjct: 516 SLSSKIYRCKLLSEKLKLIRLRDHYQRNMGSDKSFKRAYFVRYADDIIIGVMGSHNDCKN 575 Query: 295 IREECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSRYGEMRVVST 347 I + L+ +L + +NMDK+ I H +G FLG+ + R++ Sbjct: 576 ILNDINNFLKENLGMSINMDKSVIKHSKEGVSFLGYDVKVTPWEKRPYRMIKK 628 >UniRef50_B3PDY2 Putative maturase n=1 Tax=Cellvibrio japonicus Ueda107 RepID=B3PDY2_CELJU Length = 402 Score = 255 bits (652), Expect = 2e-66, Method: Composition-based stats. Identities = 98/356 (27%), Positives = 158/356 (44%), Gaps = 55/356 (15%) Query: 26 QPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLPARRVY 85 + + A ++ G G D V M + L EL L + + SG Y P +RV Sbjct: 7 SKQLVYAAWLKVKANAG--VAGADNVCIDMFEHNLENELYKLWNRMSSGSYMAPPVKRVE 64 Query: 86 IPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQ 145 + K++GKLRPLGIP + DR+ Q + M +EP W+S FH S+G+RP RS HHA++ K+ Sbjct: 65 MAKADGKLRPLGIPTVADRVAQMVVKMTLEPEWDSKFHASSFGYRPRRSAHHAVQAAKIN 124 Query: 146 LTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGH-IDVGL 204 + WVI+ D+ +FD ++H L K V + D + + I AG + G Sbjct: 125 CW-----KYSWVIDLDIKGFFDNLNHDQLQKFVAQATDDPWCKLYIKRWITAGVQMPGGE 179 Query: 205 FRAASEGVPQGGVISPLLSNIMLNE-FDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVR 263 ++G PQGGVISPLL+N+ L++ FD ++ + + Sbjct: 180 LHKTAKGTPQGGVISPLLANLYLHKVFDSWMQKYF------------------------- 214 Query: 264 ENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVN- 322 P + RYADD V + T+ + E + ++ L L+ +KTKI + Sbjct: 215 ------PQNPFERYADDIVCHCR-TEHEAEQLLSAISRRMQ-RFDLTLHPEKTKIVYCGR 266 Query: 323 --------DGFIFLGHRLIRKRSRYGEMR----VVSTIPQEKARNFAASLTALLWK 366 F FLG R+ + + + V I + + ++ + Sbjct: 267 RKIERTKAQSFDFLGFTFRRRTVKRKDGKLVPGFVPAISNKAKKAIVKTMREWNVR 322 >UniRef50_Q35056 CoxII intron2 ORF n=4 Tax=Embryophyta RepID=Q35056_MARPO Length = 827 Score = 255 bits (651), Expect = 2e-66, Method: Composition-based stats. Identities = 113/361 (31%), Positives = 179/361 (49%), Gaps = 21/361 (5%) Query: 5 LATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVEL 64 L + L + + + A + + G+ PG G+ R +L Sbjct: 319 LWISSFRRRDWIYHDLSNYLKSMDIWSIAYQKLRPNPGSMNPGTHGLTIDGTSFR---KL 375 Query: 65 QILRDELLSGH--YQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDF 122 Q LRD +L Y+ + + P K+ LGIP +DRIVQ + M +EPI+ES F Sbjct: 376 QALRDAVLDSESPYEWGGTKIITKPGKREKI-SLGIPCFQDRIVQEVLKMLLEPIYESIF 434 Query: 123 HTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRI 182 S+G+RP RS H A+RT++ + W++ G+++ FD V+H +L +RR+I Sbjct: 435 SRRSHGWRPGRSAHTALRTIRSDFK-----KTNWIVPGNINKLFDIVNHGILCHIMRRKI 489 Query: 183 SDARFMTLLWKTIKAG-HIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHER---Y 238 D + + L+ +KA H+ G ++ G PQG ++SP+LSNI L+EFD ++ ER Y Sbjct: 490 RDKKLLKLIAGGLKAKIHMPYGNIEESNLGTPQGRILSPILSNIYLHEFDIWIEERIQQY 549 Query: 239 LSGKARKDRWYWNNSIQRGRSTAVRENW--QWKPAVAYCRYADDFVLIVKGTKAQVEAIR 296 G+ W + R +R + + Y RY DDF++ ++G + +AIR Sbjct: 550 NLGRKETRSWVLLRKQGKMRKARLRSDPFNPLYRRMEYRRYGDDFLIAIRGPLSDAKAIR 609 Query: 297 EECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRK----RSRYGEMRVVSTIPQEK 352 +EC L LKL LNM+KT I H++ G FLGHR+ R+ + RY ++K Sbjct: 610 QECETFLREKLKLLLNMEKTHIKHISVGIPFLGHRIGRRVVHTKQRYQTQEGWRWRIKKK 669 Query: 353 A 353 Sbjct: 670 V 670 >UniRef50_A8ZN56 RNA-directed DNA polymerase n=2 Tax=Cyanobacteria RepID=A8ZN56_ACAM1 Length = 432 Score = 255 bits (650), Expect = 3e-66, Method: Composition-based stats. Identities = 108/380 (28%), Positives = 159/380 (41%), Gaps = 50/380 (13%) Query: 4 KLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVE 63 ++A+ A P L+ E L G GVDGV K Q L Sbjct: 8 RIASRARNHPEEPFTALMHH-YSVENLRACFESL---DGNKALGVDGVTKAEYQENLETN 63 Query: 64 LQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFH 123 LQ L +L Y+P P R+V IPK +G +RPLGI D++VQ +E I+E F Sbjct: 64 LQNLHLKLRQMSYRPQPVRQVEIPKEDGSMRPLGISCTEDKVVQEMTRRILEAIYEPVFI 123 Query: 124 TLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRIS 183 SYGFRP+RS H A+R + + WV + DL+ +FDT+ H+ ++ + RI Sbjct: 124 DTSYGFRPKRSCHDALRQLN---REVMRKPVNWVADIDLAKFFDTMPHQEILSVLSIRIK 180 Query: 184 DARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNE-FDQYLHERYLSGK 242 D + L+ + +KAG G G PQG ++SP+++NI L+ DQ+ Sbjct: 181 DGNLLRLIARMLKAGIQTPGGVVYDELGSPQGSIVSPVIANIFLDYVLDQWF-------- 232 Query: 243 ARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGV 302 + VR + + A+ RYADD + + + + +R Sbjct: 233 ----------------TNVVRHHCRGYCAI--IRYADDVAAVFEHEEDAIRFMR--VLPR 272 Query: 303 LEGSLKLRLNMDKTKIPHVND--------------GFIFLGHRLIRKRSRYGEMRVVSTI 348 LRLN KT + F FLG RSR G +R+ Sbjct: 273 RLEKYGLRLNTKKTHLLAFGKRNARRCFQTGQRPSTFDFLGLTHYWGRSRKGYVRMKRKT 332 Query: 349 PQEKARNFAASLTALLWKVR 368 +++ R L L KVR Sbjct: 333 SKKRLRRSLKQLKMWLRKVR 352 >UniRef50_D0LS09 RNA-directed DNA polymerase n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LS09_HALO1 Length = 449 Score = 254 bits (649), Expect = 3e-66, Method: Composition-based stats. Identities = 113/377 (29%), Positives = 167/377 (44%), Gaps = 53/377 (14%) Query: 5 LATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVEL 64 +A A P + + L I WL EA R T + PG+DG L L Sbjct: 17 IAKRAREMPEVALTTLAHHI-DLVWLREAYRRTRKN---AAPGIDGQTGRAYAEALESNL 72 Query: 65 QILRDELLSG-HYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFH 123 + L + G Y+ +P RRV IPK +G++RPLGIP D+++QRA++M +E ++E DF Sbjct: 73 ESLLERAKDGDRYRAMPVRRVAIPKGDGRMRPLGIPTFEDKVLQRAVVMVLEAVYEQDFL 132 Query: 124 TLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRIS 183 SYG+R RS H A++ V+ + RG WV+E D+ ++FDTV H L + +R+R+ Sbjct: 133 DCSYGYRRGRSAHDAVKAVRAH---TMKLRGGWVLEADIEAFFDTVDHAKLREILRQRVR 189 Query: 184 DARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNE-FDQYLHERYLSGK 242 D + + K + AG ++ G G PQGGVISP+L+NI LNE DQ+ Sbjct: 190 DGVLLRWIGKWLNAGVMEEGNVYYPEGGTPQGGVISPVLANIFLNEVIDQWFEHVVRP-- 247 Query: 243 ARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGV 302 + K RYADD V+I + + + Sbjct: 248 ------------------------RLKGQGYLVRYADDLVMIFE-REDDARRVETVLPKR 282 Query: 303 LEGSLKLRLNMDKTK-IPHVNDG---------------FIFLGHRLIRKRSRYGEMRVVS 346 L LR++ +KT+ I + G F FLG RSR G V+ Sbjct: 283 L-SKYGLRIHPEKTRLIQFLRPGYGTRPTRRDGNRPGTFDFLGFTHYWARSRKGSWVVMQ 341 Query: 347 TIPQEKARNFAASLTAL 363 ++ R Sbjct: 342 KTAAKRLRRALGRFVEW 358 >UniRef50_C5D9G3 RNA-directed DNA polymerase (Reverse transcriptase) n=2 Tax=Firmicutes RepID=C5D9G3_GEOSW Length = 605 Score = 253 bits (645), Expect = 9e-66, Method: Composition-based stats. Identities = 104/358 (29%), Positives = 173/358 (48%), Gaps = 18/358 (5%) Query: 16 RIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNK-TMLQARLAVELQILRDELLSG 74 R + L+ + + + A +++G+ T G DG +L + ++ + Sbjct: 16 RFKGLVEIASSDVVIVSAIHKIKANQGSKTAGTDGQTINDILTKNYDEVINFVKRCFKN- 74 Query: 75 HYQPLPARRVYIPKSNGKLRP-LGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPER 133 Y+P RRVYIPK K + LGIP + DR +Q + M +EPI E+ F SYGFRP R Sbjct: 75 -YKPKLIRRVYIPKPGKKKKRPLGIPTIADRTIQECVRMTIEPILEAQFFQHSYGFRPYR 133 Query: 134 SVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAV-RRRISDARFMTLLW 192 AI + C WVIEGD+ +FD V+H +L+K + I D R + ++ Sbjct: 134 DTKQAIERC---VFICNRIGYNWVIEGDIKGFFDNVNHTILIKQLWHMGIRDRRMLMIIK 190 Query: 193 KTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNN 252 +KAG + + G PQGG+ISPLL+N+ L++ DQ++ + K R Sbjct: 191 AMLKAGVM--KETKVNEIGTPQGGIISPLLANVYLHKLDQWITREWEEKKMRN-----GT 243 Query: 253 SIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLN 312 +I+ + ++R + Y RYADD+VL ++ E + + L+ +LKL L+ Sbjct: 244 AIRTSKFNSLRNHSTITRPEFYVRYADDWVL-FTDSRENAEKWKYRIKKYLKENLKLELS 302 Query: 313 MDKTKIPHVNDGF-IFLGHRL-IRKRSRYGEMRVVSTIPQEKARNFAASLTALLWKVR 368 DKT I ++ FLG ++ + + G+ ++ EK + + L K++ Sbjct: 303 DDKTLITNIKKKPMKFLGFKIKMLPHGKNGKYVGYASADTEKVKRKVEQIKKDLRKLK 360 >UniRef50_B7K703 RNA-directed DNA polymerase (Reverse transcriptase) n=85 Tax=Bacteria RepID=B7K703_CYAP7 Length = 661 Score = 252 bits (644), Expect = 1e-65, Method: Composition-based stats. Identities = 106/353 (30%), Positives = 167/353 (47%), Gaps = 38/353 (10%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQP--EWLAEAARITLSSKGAHTPGVDGVNKTMLQA 58 +Q+++ A+ ++RL + + + + R+T ++G T GVDGV + Sbjct: 37 LQKRIYQAASRGNVRTVRRLQKTLLRSWSAKMLAVRRVTQDNQGKKTAGVDGVKSLTPKQ 96 Query: 59 RLAVELQILRDELLSGHYQPLPARRVYIPKSNG-KLRPLGIPALRDRIVQRAMLMAMEPI 117 R+ + Q L + P RRV+IPK + RP IP + DR +Q + +A+EP Sbjct: 97 RMNLVGQ------LKLTCKTKPTRRVWIPKPGKDEKRPFLIPCMSDRALQALVKIALEPE 150 Query: 118 WESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKA 177 WE+ F SYGFRP R H AI + QL + ++V++ D+S FD ++H L++ Sbjct: 151 WEAKFEPNSYGFRPGRGCHDAIGAIFNQLG----AKAKYVLDADISKCFDKINHEKLLQK 206 Query: 178 VRRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHER 237 + + + +KAG +D EG PQGGV+SPLL+NI L+ ++ + Sbjct: 207 LN---TFPTLRRQIRAWLKAGVMDGNKLFPTEEGTPQGGVVSPLLANIALHGMEEIIKS- 262 Query: 238 YLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIRE 297 + + R + +++ RYADDFVLI + A VE +E Sbjct: 263 ------------FAQNPGELRQEFSNRGKGREQSISLIRYADDFVLIHESL-AVVEKGKE 309 Query: 298 ECRGVLEGSLKLRLNMDKTKIPHVND------GFIFLGHRLIR-KRSRYGEMR 343 L L L L +KT+I H D GF FLG + KRSR+ M+ Sbjct: 310 IIETWLR-ELGLTLKPEKTQITHTLDKHQGKVGFNFLGFNIRHYKRSRHKSMK 361 >UniRef50_Q47DU4 RNA-directed DNA polymerase (Reverse transcriptase) n=1 Tax=Dechloromonas aromatica RCB RepID=Q47DU4_DECAR Length = 429 Score = 252 bits (643), Expect = 2e-65, Method: Composition-based stats. Identities = 115/374 (30%), Positives = 172/374 (45%), Gaps = 48/374 (12%) Query: 5 LATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQAR-LAVE 63 L A ++ R L I + + LA A S+KGA PGVD + ++A + Sbjct: 3 LHAKAKSESGYRFYALYDKIYRTDILAHAYAQCRSNKGA--PGVDRQDFEDVEAYGVRRW 60 Query: 64 LQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFH 123 L+ L L Y+P P RRV+IPK+NGKLRPLGI L DR+ A ++ +EPI+E+D Sbjct: 61 LEELALALKEESYRPDPIRRVFIPKANGKLRPLGISTLHDRVCMTAAMLVLEPIFEADLP 120 Query: 124 TLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRIS 183 Y +RP R+ A VK +L V++ DLS YF ++ H LMK++ RRI Sbjct: 121 DEQYAYRPGRNAQQAAEEVKNRL----YLGQTDVVDADLSDYFGSIPHSELMKSLARRIV 176 Query: 184 DARFMTLLWKTIKAGHIDV---GLFRAASE------GVPQGGVISPLLSNIMLNEFDQYL 234 D R + L+ ++ + G + +E G+PQG ISPLLSN+ + F Sbjct: 177 DRRVLHLIKMWLECAVEETDQRGRKKRTTEAKDQGRGIPQGSPISPLLSNLYMRRF---- 232 Query: 235 HERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEA 294 + RS R YADD V++ K K E Sbjct: 233 -------------VLAWKKLGLERSLGSR----------IVTYADDLVILCKCGK--AEE 267 Query: 295 IREECRGVLEGSLKLRLNMDKTKIPHVNDG-FIFLGHRLIRKRS-RYGEMRVVSTIPQEK 352 + R ++ G LKL +N +KT+I V G F FLG+ R+ R G+ ++ ++ Sbjct: 268 ALQWMRTIM-GKLKLTVNEEKTRICQVPAGTFDFLGYSFGRRYVPRTGKPQIALWPSKKS 326 Query: 353 ARNFAASLTALLWK 366 R + + + Sbjct: 327 IRRMVEKIHDMTER 340 >UniRef50_C7V8C7 Reverse transcriptase n=1 Tax=Enterococcus faecalis CH188 RepID=C7V8C7_ENTFA Length = 496 Score = 251 bits (641), Expect = 3e-65, Method: Composition-based stats. Identities = 100/367 (27%), Positives = 168/367 (45%), Gaps = 38/367 (10%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWL-AEAARITLSSKGAHTPGVDGVNKTMLQAR 59 +Q ++ ++RL LIT + A A + +S+KG +T G+DGV + Sbjct: 30 LQLRIVKATQQGKWRLVRRLQYLITHSFYAKALAVKKVISNKGKNTAGIDGVIWKTDSQK 89 Query: 60 LAVELQILRDELLSGHYQPLPARRVYIPKSNGK-LRPLGIPALRDRIVQRAMLMAMEPIW 118 + ++L HY P +R+YI K K RPLGIP + DR +Q L A+EP+ Sbjct: 90 -----KQAIEQLNPNHYSPKAVKRIYITKFGKKEKRPLGIPCMLDRAMQALYLQALEPVS 144 Query: 119 ESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAV 178 E + SYGFR +S A V L C + +W++EGD+ FD + H+ L+ + Sbjct: 145 ECISDSNSYGFRRFKSAKDAGEKVFKVL--CRQYSAQWILEGDIKGCFDNISHQWLIDNI 202 Query: 179 RRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERY 238 +L K +K+G+++ + G QGG+ISP L+NI L+ ++ + +Y Sbjct: 203 ------PLEKNMLRKFLKSGYMEKKKLFPTTMGTAQGGIISPTLANITLDGLEKRIKSKY 256 Query: 239 LSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREE 298 S K +N K V + RYADDF++ + ++ I+ Sbjct: 257 WSNKKGTIGVRYN-----------------KHKVNFVRYADDFIVTGDSPEILLK-IKNM 298 Query: 299 CRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAA 358 L+ L L+ +KT I H+N GF FLG +Y +++ ++ + Sbjct: 299 INEFLKER-GLSLSEEKTLITHINQGFDFLGWNFR----KYKRYKLIVQPSKKSIKRMKQ 353 Query: 359 SLTALLW 365 +L ++ Sbjct: 354 TLKQVVK 360 >UniRef50_P03875 Putative COX1/OXI3 intron 1 protein n=3 Tax=Fungi/Metazoa group RepID=AI1M_YEAST Length = 834 Score = 251 bits (641), Expect = 3e-65, Method: Composition-based stats. Identities = 106/356 (29%), Positives = 161/356 (45%), Gaps = 32/356 (8%) Query: 23 LITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLPAR 82 ++ + L A S G TPG L + L L +EL +G ++ P R Sbjct: 255 IMKNVDMLMLAYNRIKSKPGNMTPGT---TLETLDGMNMMYLNKLSNELGTGKFKFKPMR 311 Query: 83 RVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTV 142 V IPK G +RPL + RD+IVQ M M ++ I++ T S+GFR S AI V Sbjct: 312 MVNIPKPKGGMRPLSVGNPRDKIVQEVMRMILDTIFDKKMSTHSHGFRKNMSCQTAIWEV 371 Query: 143 KLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHIDV 202 + W IE DL FDT+ H L++K ++R ISD F+ L++K ++AG+ID Sbjct: 372 RNMFG-----GSNWFIEVDLKKCFDTISHDLIIKELKRYISDKGFIDLVYKLLRAGYIDE 426 Query: 203 -GLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHER---YLSGKARKDRWYWNN------ 252 G + G+PQG +ISP+L NI++ D +L + Y GK +K + Sbjct: 427 KGTYHKPMLGLPQGSLISPILCNIVMTLVDNWLEDYINLYNKGKVKKQHPTYKKLSRMIA 486 Query: 253 -----------SIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRG 301 +R + N + Y RYADD ++ V G+K + I+ + Sbjct: 487 KAKMFSTRLKLHKERAKGPTFIYNDPNFKRMKYVRYADDILIGVLGSKNDCKMIKRDLNN 546 Query: 302 VLEGSLKLRLNMDKTKIPHVNDG-FIFLGHRLIRKRSRYGEMRVVSTIPQEKARNF 356 L SL L +N +KT I + FLG+ + + V TI + R+ Sbjct: 547 FL-NSLGLTMNEEKTLITCATETPARFLGYNISITPLKR-MPTVTKTIRGKTIRSR 600 >UniRef50_B7KM76 RNA-directed DNA polymerase (Reverse transcriptase) n=1 Tax=Cyanothece sp. PCC 7424 RepID=B7KM76_CYAP7 Length = 309 Score = 251 bits (640), Expect = 4e-65, Method: Composition-based stats. Identities = 94/319 (29%), Positives = 145/319 (45%), Gaps = 41/319 (12%) Query: 17 IQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHY 76 + L+ +A +KG G+DG L L L + + + +Y Sbjct: 1 MTNLIDEFLSLPNFRQAWFKVADNKGC--AGIDGETIEHFALNLDFNLTFLLNSVTNSNY 58 Query: 77 QPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVH 136 P P ++V IPKS K R L IP +RDRIVQ+A+L + P+ E F S+ +RP RS Sbjct: 59 IPQPLKQVLIPKSQEKWRELRIPTVRDRIVQQALLNVLYPVMEERFSDASFAYRPNRSYL 118 Query: 137 HAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIK 196 A++ + +WV++ D+ YFD + H LL+K VR+ + ++ + L+ I Sbjct: 119 DAVKRAAYW----RDLGYQWVLDADIVEYFDNISHSLLLKEVRKTVDNSGILCLIKAWIS 174 Query: 197 AGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQR 256 AG +GVPQG VISP+L+NI L+EFD + Sbjct: 175 AGVSTDKGIIFPEKGVPQGAVISPMLANIYLDEFDHRI---------------------- 212 Query: 257 GRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKT 316 + + RYADDF+++ + A + + L L+L+ +KT Sbjct: 213 -----------TQSDLKLVRYADDFLVLSDTEDGIMRAYSQVVQ--LLHFWGLKLHEEKT 259 Query: 317 KIPHVNDGFIFLGHRLIRK 335 +I H GF FLGH +RK Sbjct: 260 QITHFKKGFQFLGHGFLRK 278 >UniRef50_Q1VQM5 Prophage LambdaSa1, reverse transcriptase/maturase family protein n=7 Tax=Psychroflexus torquis ATCC 700755 RepID=Q1VQM5_9FLAO Length = 456 Score = 250 bits (639), Expect = 5e-65, Method: Composition-based stats. Identities = 114/376 (30%), Positives = 183/376 (48%), Gaps = 41/376 (10%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQAR- 59 QRKL A D + L + + L EA R S+ + GVD + ++ + Sbjct: 26 FQRKLYIRAKQDKGFKAYSLYGKLCEDHTLIEAYRRVRSNY-SKGVGVDNQSSDAIEKQG 84 Query: 60 LAVELQILRDELLSGHYQPLPARRVYIPKSN-GKLRPLGIPALRDRIVQRAMLMAMEPIW 118 ++V L ++ +L Y+ ++ IPK G R LGIP +RDR+VQ A+ M +EP+W Sbjct: 85 ISVFLGEIQQDLQGHTYRSQAVKQKLIPKEKEGDFRVLGIPTIRDRVVQMAVKMLIEPLW 144 Query: 119 ESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAV 178 E+DF S+GFRP+R AI+ VK + D R ++V + DLS YFDT+ H L + Sbjct: 145 EADFEHTSFGFRPKRGAKDAIKQVKQNIYD----RHQFVYDADLSKYFDTIPHTKLFILL 200 Query: 179 RRRISDARFMTLLWKTIKAGH-IDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHER 237 ++R+ D ++L+ + + A + G A+++G PQGGVISPLLSNI L+ FDQ ++ Sbjct: 201 KKRLVDHSILSLIHQWLTAPVRLPNGKLVASTKGSPQGGVISPLLSNIYLHAFDQIVNN- 259 Query: 238 YLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIRE 297 + K + RYADDF+L+ G + I + Sbjct: 260 -------------------------PKGKFAKANIRIVRYADDFLLM--GKWYFSKEILD 292 Query: 298 ECRGVLEGSLKLRLNMDKTKIPHVNDGFI-FLGHRLIRKRSRYGE--MRVVSTIPQEKAR 354 +++ + L LN +KTK+ H + + FLG +S++G + P K+R Sbjct: 293 YITSIMDN-MGLTLNKEKTKLLHSSKSSLFFLGFEFRSIKSKFGWNAKNYTNVRPSMKSR 351 Query: 355 NFA-ASLTALLWKVRI 369 + + L L + Sbjct: 352 SKLFSKLRELFANRKH 367 >UniRef50_Q8A4I4 Reverse transcriptase n=1 Tax=Bacteroides thetaiotaomicron RepID=Q8A4I4_BACTN Length = 430 Score = 250 bits (639), Expect = 5e-65, Method: Composition-based stats. Identities = 96/354 (27%), Positives = 159/354 (44%), Gaps = 56/354 (15%) Query: 26 QPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLPARRVY 85 + + +A +++G+ G+D V + L L L + + SG Y P + V Sbjct: 12 SKQLVYDAFLRVKANRGS--AGIDKVTLEDYEKNLRGNLYKLWNRMSSGSYFPPSVKLVE 69 Query: 86 IPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQ 145 IPKS G RPLGIP + DR+ Q A++M + P E FH SY +RP RS H A+ + + Sbjct: 70 IPKSTGGKRPLGIPTVSDRVAQMAVVMLITPSIEPCFHEDSYAYRPHRSAHDAVGKARER 129 Query: 146 LTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHI-DVGL 204 + WV++ D+S +FDT+ H LL+KA++R + + + + +K + G Sbjct: 130 CW-----KYAWVLDMDISKFFDTIDHELLLKALKRHTQEKWVLMYIERWLKVPYEKSDGS 184 Query: 205 FRAASEGVPQGGVISPLLSNIMLNE-FDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVR 263 + GVPQG VI P+L+N+ L+ FD+++ + + Sbjct: 185 QVDRALGVPQGSVIGPVLANLFLHYTFDKWMEKNF------------------------- 219 Query: 264 ENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVND 323 P V + RYADD + K Q E ++ + E +LRLN +KTKI + Sbjct: 220 ------PRVPFERYADDTICHCHSLK-QAEYMQAMIQQRFEC-CRLRLNEEKTKIVYCKS 271 Query: 324 G----------FIFLGHRLIRKRS--RYGEMR--VVSTIPQEKARNFAASLTAL 363 F FLG + S +YG + I ++ + ++ + Sbjct: 272 SRQKECYPNVTFDFLGFTFQPRESVDKYGNRFTGFLPAISRKSMKRINETMRSW 325 >UniRef50_Q08WW1 Prophage LambdaSa1, reverse transcriptase/maturase family protein n=2 Tax=Bacteria RepID=Q08WW1_STIAU Length = 421 Score = 250 bits (637), Expect = 8e-65, Method: Composition-based stats. Identities = 107/359 (29%), Positives = 164/359 (45%), Gaps = 49/359 (13%) Query: 26 QPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLPARRVY 85 EWL A T GA G+D +A L V L+ L + + SG Y+ P RR Y Sbjct: 11 DEEWLRYAYEQTR-KDGA--AGIDRQTAKDYEANLEVNLKSLLERIKSGRYKAPPVRRTY 67 Query: 86 IPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQ 145 IPK++G RPLGIP D++ QRA+++ +EPI+E DF S+GFRP RS H A+R ++ Sbjct: 68 IPKADGSQRPLGIPTFEDKVAQRAIVLLLEPIYEQDFRPFSFGFRPGRSAHQALRELRSS 127 Query: 146 LTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHIDVGLF 205 + E GRWV++ DL YFDT+ H L + + RR++D ++ K +KAG ++ G Sbjct: 128 IL---ERNGRWVLDVDLRRYFDTIEHGKLREVLARRVADGVVRRMIDKWLKAGVLEEGPL 184 Query: 206 RAASEGVPQGGVISPLLSNIMLNE-FDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRE 264 +G PQGGVISPLL+N+ L+ D++ + Sbjct: 185 LRLEQGTPQGGVISPLLANVYLHYVLDEWYEREVVP------------------------ 220 Query: 265 NWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVN-- 322 + K + RYADD V++ + + E L L L+ KT++ Sbjct: 221 --RMKGKCSLIRYADDLVMVFEDF-LDCRRVLEVLGKRL-AKYGLTLHPGKTRMVDFRFK 276 Query: 323 ------------DGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAASLTALLWKVRI 369 F FLG + +S+ G+ V + + ++ + R Sbjct: 277 RPGGGQHPATQATTFDFLGFTHVWGKSQRGKNVVYQVTAKSRYARAVKAVWEWCKRNRH 335 >UniRef50_Q119U8 RNA-directed DNA polymerase n=30 Tax=Bacteria RepID=Q119U8_TRIEI Length = 635 Score = 250 bits (637), Expect = 9e-65, Method: Composition-based stats. Identities = 105/373 (28%), Positives = 178/373 (47%), Gaps = 46/373 (12%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEW--LAEAARITLSSKGAHTPGVDGVNKTMLQA 58 +Q+ + ++ ++++ +L+T+ + L R+T ++G T G+DG+ Sbjct: 54 LQKLIYRASSRGEIRKMRKYQKLLTKSYYARLLAVRRVTQDNQGKKTAGIDGIKSLPPMQ 113 Query: 59 RLAVELQILRDELLSGHYQPLPARRVYIPKSN-GKLRPLGIPALRDRIVQRAMLMAMEPI 117 RL L + L S + P RRV+IPK + RPLGIP + DR +Q + + MEP Sbjct: 114 RLN-----LVEMLGSRFLKASPIRRVWIPKPGREEKRPLGIPTMYDRALQALVKLGMEPE 168 Query: 118 WESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKA 177 WE+ F SYGFRP RS + AI + + + + ++V++ D+S FD ++H L+ Sbjct: 169 WEALFEPNSYGFRPGRSTYDAIAAIYVSINH----KPKYVLDADISKCFDRINHDALLGK 224 Query: 178 VRRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHER 237 + + + + L+ + +K+G D F EG PQGGVISPLL+NI L+ ++ L + Sbjct: 225 IGK----SPYRKLVKQWLKSGVFDNKQFSNTVEGTPQGGVISPLLANIALHGMEKCLEDY 280 Query: 238 YLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIRE 297 + K + A++ RYADDFV++ K K ++A + Sbjct: 281 AETLPGTK--------------------RDNQRALSLIRYADDFVILHKDIKVLLQA-KT 319 Query: 298 ECRGVLEGSLKLRLNMDKTKIPHVND-------GFIFLGHRLIRKRSRYGEMRVVSTI-P 349 + L + L L +KTKI H + GF FLG + + + + + + I P Sbjct: 320 VIQEWL-NQVGLELKPEKTKIAHTLEEYEGNKPGFDFLGFTIRQWKGKTTKQGFKTLIKP 378 Query: 350 QEKARNFAASLTA 362 K+ A Sbjct: 379 SSKSIKTHYRKLA 391 >UniRef50_A5IEI2 Reverse transcriptase n=7 Tax=Bacteria RepID=A5IEI2_LEGPC Length = 454 Score = 250 bits (637), Expect = 9e-65, Method: Composition-based stats. Identities = 101/359 (28%), Positives = 157/359 (43%), Gaps = 58/359 (16%) Query: 27 PEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLPARRVYI 86 + + +A ++ ++ G+ G+D + L L L + + SG Y P + V I Sbjct: 49 KQLVWQAYKLVRANAGS--AGIDNQSIDEFSQDLKGNLYKLWNRMSSGSYFPPAVKEVAI 106 Query: 87 PKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQL 146 PK G +R LGIP + DRI Q + + MEP+ E F SYG+RP +S A+ + + Sbjct: 107 PKKQGGVRKLGIPTVADRIAQMTVKLMMEPLLEPHFLDDSYGYRPNKSALDAVGVTRKRC 166 Query: 147 TDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHIDV-GLF 205 + WV+E D+ FD + H LLMKAV+ ISD + + + + A D G Sbjct: 167 WE-----YDWVVEFDIKGLFDNLSHELLMKAVKHHISDRWILLYVERWLTAPIQDQHGGC 221 Query: 206 RAASEGVPQGGVISPLLSNIMLNE-FDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRE 264 + G PQGGVISPLLSN+ L+ FD ++ + + Sbjct: 222 LPRTAGTPQGGVISPLLSNLFLHYAFDHWMTKHH-------------------------- 255 Query: 265 NWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDG 324 P +CRYADD + + K + ++E + SL L ++ DKTKI + DG Sbjct: 256 -----PDNPWCRYADDGLAHCRTEKEAEQMLKEIDKRF--KSLGLEIHPDKTKIVYCKDG 308 Query: 325 ----------FIFLGHRL--IRKRSRYGEMRVVSTIPQEKARNFAASLTALLWKVRISG 371 F FLG+ R + R + P + + A+ K+R Sbjct: 309 ARKGKYKNKSFDFLGYTFKARRVKVRSRNSFFIGFTPVVSLKA----VKAMTLKLRRGN 363 >UniRef50_A9ENQ0 Integron/retron-type RNA-directed DNA polymerase (Reverse transcriptase) n=3 Tax=Bacteria RepID=A9ENQ0_SORC5 Length = 439 Score = 250 bits (637), Expect = 1e-64, Method: Composition-based stats. Identities = 106/370 (28%), Positives = 157/370 (42%), Gaps = 54/370 (14%) Query: 4 KLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVE 63 K+ A DP + L LI E L A R + GVDG+ K L Sbjct: 16 KVRERAERDPEGVLLALAHLI-DEEALQRAYRSL---RNEAAVGVDGITKEQYGQDLEHN 71 Query: 64 LQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFH 123 ++ L + S Y+ P RRV+IPK GK RP+GI D+IVQ A+ +E I+E F Sbjct: 72 VRDLHARMKSMRYRHQPIRRVHIPKERGKTRPIGISCTEDKIVQAAVREMLEVIYEPVFR 131 Query: 124 TLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRIS 183 +SYGFRP RS H A+R + L E W++E D+ S+FD++ LM+ ++ R++ Sbjct: 132 DVSYGFRPGRSAHDALRALNRMLLGGVE----WILEADIESFFDSIDRTKLMEMLQARVA 187 Query: 184 DARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNE-FDQYLHERYLSGK 242 D + L+ K + G +D F A +G QG V+SPLL N+ L+ D ++ Sbjct: 188 DKSLLRLVGKCLHVGVLDGAEFYAPEDGTVQGSVLSPLLGNVYLHHVLDLWIEREVQP-- 245 Query: 243 ARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGV 302 + RYADDF++ + + + + E Sbjct: 246 ------------------------RLVGKATLIRYADDFIIGFE-REDDAKRVTEVLPRR 280 Query: 303 LEGSLKLRLNMDKTKIPHVND------------GFIFLGHRLIRKRSRYGEMRVVSTIPQ 350 E L+L+ DKT++ F FLG +RSR G +P Sbjct: 281 FE-RYGLKLHPDKTRLLPFGRPDNGQPGGKGPATFDFLGFTHYWRRSRAGRW-----MPS 334 Query: 351 EKARNFAASL 360 K R Sbjct: 335 MKTRKARLRR 344 >UniRef50_Q3B1V7 RNA-directed DNA polymerase n=31 Tax=Bacteria RepID=Q3B1V7_PELLD Length = 495 Score = 249 bits (636), Expect = 1e-64, Method: Composition-based stats. Identities = 95/374 (25%), Positives = 162/374 (43%), Gaps = 44/374 (11%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEW-LAEAARITLSSKGAHTPGVDGVNKTMLQAR 59 +Q ++A R++ L L+T A + ++G +TPGVDG +A+ Sbjct: 30 LQARIAKATKEGRHGRVKALQWLLTHSHSGKVLAVKRVTENRGKNTPGVDGDVWKTSKAK 89 Query: 60 LAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWE 119 L Y+PLP RR YIPK NGK RPLGIP ++DR +Q +A+EP+ E Sbjct: 90 ANAAAS-----LRRRGYKPLPLRRTYIPKKNGKQRPLGIPTMKDRAMQALYWLALEPVAE 144 Query: 120 SDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVR 179 + SYGFRP RS + L +W++E D++ FD + H+ L+ + Sbjct: 145 TTADGNSYGFRPWRSTADVAEQCFICLAR--RDSAQWILEADIAGCFDAISHQWLVDNIP 202 Query: 180 RRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYL 239 +L K +KAG + + G PQGG+ISP L+N+ L+ +Q L + Sbjct: 203 MDTP------ILRKWLKAGFVFNNELFPTASGTPQGGIISPGLANMSLDGLEQALATAFP 256 Query: 240 SGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREEC 299 + R + + RYADDF++ + I Sbjct: 257 QARRRGLK------------------------MHMVRYADDFIITGNSKEWLEHEIMPVV 292 Query: 300 RGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAAS 359 L+ L L+ +KT++ H+ +GF FLG + + + +++ + + Sbjct: 293 VDFLKKR-GLWLSEEKTRVTHITEGFDFLGWNMRKY-----DGKLLIKPSKANIKAHLTK 346 Query: 360 LTALLWKVRISGEI 373 + ++ + ++ Sbjct: 347 VRGIIKAKKTIKQV 360 >UniRef50_Q94Z24 Orf568 n=1 Tax=Pylaiella littoralis RepID=Q94Z24_PYLLI Length = 568 Score = 248 bits (632), Expect = 3e-64, Method: Composition-based stats. Identities = 105/379 (27%), Positives = 170/379 (44%), Gaps = 36/379 (9%) Query: 2 QRKLATWAATDPSLRIQRLLRLITQPEW-LAEAARITLSSKGAHTPGVDGVNKTMLQARL 60 Q LA S + +L R + A A R ++KG +TPG++G +L Sbjct: 27 QNNLAVAELKGDSGLVTKLQRNLVNSFAGRALAVRAITTNKGKNTPGINGEIWDTSIKKL 86 Query: 61 AVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWES 120 + +Y P +RVYIPKS GKLRPLGIP + DR +Q +A++PI E Sbjct: 87 DA----IHRLGRVSNYSCSPVKRVYIPKSGGKLRPLGIPNMYDRGLQYLWKLALDPIAEC 142 Query: 121 DFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRR 180 SYGFR RS + L L+ ++R WV+E D+ +FD ++H +++ + Sbjct: 143 RADRHSYGFRKGRSTQDVHTILHLLLS--PKSRCDWVLEADIRGFFDNINHDWIIQNIPM 200 Query: 181 RISDARFMTLLWKTIKAGHIDVGL--FRAASEGVPQGGVISPLLSNIMLNEFDQYLHERY 238 +L + +KAG ++ F GVPQGG ISPL++N+ L+ + ++ Sbjct: 201 D------KNILREWLKAGALETTTQEFHKGIAGVPQGGPISPLIANMTLDGLEVWVAN-- 252 Query: 239 LSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREE 298 + ++ + T+ W P V RYADDFV+ + + ++ Sbjct: 253 ----------SVKHLYKKSKETS------WSPKVNVVRYADDFVVTAATKRILEDIVKPS 296 Query: 299 CRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSRYGEM--RVVSTIPQEKARNF 356 + L L LN +KT I V GF F+G + G + + +E R Sbjct: 297 IQDFLASR-GLVLNQEKTCITSVKKGFDFVGFNFRVYPDKSGPKGAKSIVKPTKEGKRRL 355 Query: 357 AASLTALLWKVRISGEILL 375 + + + + SGEI++ Sbjct: 356 RSKIRNAVKTNKSSGEIIV 374 >UniRef50_A7BUU9 RNA-directed DNA polymerase n=1 Tax=Beggiatoa sp. PS RepID=A7BUU9_9GAMM Length = 585 Score = 248 bits (632), Expect = 3e-64, Method: Composition-based stats. Identities = 119/375 (31%), Positives = 182/375 (48%), Gaps = 38/375 (10%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEW--LAEAARITLSSKGAHTPGVDG--VNKTML 56 ++R +A ++ +L R+ + L ++T ++G T GVDG + Sbjct: 34 LERSIAKAVEHHDFAKVAKLRRIFRTSKNVRLLSVRKVTQDNRGKKTAGVDGKVITSEKD 93 Query: 57 QARLAVELQILRDELLSGHYQPLPARRVYIPKSNGK-LRPLGIPALRDRIVQRAMLMAME 115 + RLA ++ + G + P RRV+IPKSN K LRPLGIP + DR+ Q + + +E Sbjct: 94 RWRLASNVR------IDG--KSNPLRRVWIPKSNSKELRPLGIPTIEDRVKQMMLKLEIE 145 Query: 116 PIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLM 175 PI+E YGFRP RSVH AI + + + G WV+EGD S +FD ++ L+ Sbjct: 146 PIYEVQAEPNVYGFRPARSVHDAIEACFIAI--GCKKEGAWVLEGDFSKFFDNINKEHLL 203 Query: 176 KAVR-RRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYL 234 ++ + I+D + + IK+G ID +F +G PQGGVISPLL+NI L+ + L Sbjct: 204 NMMKSKGITDKETLQQVQAWIKSGVIDKEVFTKTDKGTPQGGVISPLLANIALHGMENML 263 Query: 235 HERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEA 294 H+ + K K + + RYADDFV+I K KA +E Sbjct: 264 HDWVDTWKGTK--------------------RSNHQSFSVIRYADDFVVIHKD-KAVIEE 302 Query: 295 IREECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQE-KA 353 + L+ + ++LN KTKI H +GF FLG + + R G T P K Sbjct: 303 AKLCIEEWLDKGVGVKLNQTKTKITHTTEGFDFLGFNVRQYRVNNGSQLKFLTKPSMDKV 362 Query: 354 RNFAASLTALLWKVR 368 + S+ + +R Sbjct: 363 KAHMESIRQVTKTMR 377 >UniRef50_Q6EI10 Reverse transcriptase/HNH endonuclease n=2 Tax=Eukaryota RepID=Q6EI10_9CHLO Length = 600 Score = 247 bits (631), Expect = 4e-64, Method: Composition-based stats. Identities = 99/367 (26%), Positives = 163/367 (44%), Gaps = 45/367 (12%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQ--PEWLAEAARITLSSKGAHTPGVDGVNKTMLQA 58 +Q + + + R+++ L+ L R T + G T GVD + Sbjct: 34 LQGFIYSASKAGDIKRVRKFQHLLVNSYEAKLLAIRRATQDNTGKKTAGVDKARALTPKQ 93 Query: 59 RLAVELQILRDELLSGHYQPLPARRVYIPKSNGK-LRPLGIPALRDRIVQRAMLMAMEPI 117 RL L L Q P RRV+IPK +RPLGIP ++DR +Q + +EP Sbjct: 94 RLE-----LASSLRIPT-QSSPLRRVWIPKPGTDVMRPLGIPTIKDRCLQALFKLMLEPE 147 Query: 118 WESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKA 177 WE+ F S+GFRP RS A+ ++ + R ++V++ D++ FD ++H+ L+ Sbjct: 148 WEAKFEPSSFGFRPGRSCRDALAAIQANIQK----RSKYVLDADIAKCFDRINHKALLDK 203 Query: 178 VRRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHER 237 + R + +KAG +D F G PQGG++SP+L+NI L+ + +L + Sbjct: 204 IGMTGGFGRQLL---AWLKAGVLDGSTFSETDLGTPQGGIVSPVLANIALHRMEDHLKKF 260 Query: 238 YLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIRE 297 +RG + V RYADDFV++ K + A + Sbjct: 261 VCQFPMTYASGTVIKKSRRGET------------VTLIRYADDFVVL-HHDKKILLACKA 307 Query: 298 ECRGVLEGSLKLRLNMDKTKIPHVND---------------GFIFLGHRLIRKRSRYGEM 342 E L G + L + KT++ H + GF+FLG+++ + S+YG Sbjct: 308 ELVRWL-GEMGLEFSPTKTRLTHTLELQSDDVEAEGFDGTVGFVFLGYQIKQFASKYGSA 366 Query: 343 RVVSTIP 349 + + IP Sbjct: 367 KSTAGIP 373 >UniRef50_A4C8M3 RNA-directed DNA polymerase (Reverse transcriptase) n=12 Tax=Pseudoalteromonas tunicata D2 RepID=A4C8M3_9GAMM Length = 429 Score = 246 bits (629), Expect = 7e-64, Method: Composition-based stats. Identities = 103/371 (27%), Positives = 170/371 (45%), Gaps = 45/371 (12%) Query: 8 WAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQIL 67 + P R Q L L+ + ++L ++ +K A G+DG+ Q +L + L Sbjct: 11 KSQRHPKHRFQNLYGLL-REDFLYQSWGQL--NKQA-AAGIDGITMPAYQQQLVGNITRL 66 Query: 68 RDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSY 127 D L ++ +RV+IPK+NGK RPLG+P + D++VQ+ + ++ IWE+DF SY Sbjct: 67 SDALKHKRFRANDIKRVFIPKANGKQRPLGLPTVDDKLVQQGVSQILQSIWEADFLPNSY 126 Query: 128 GFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARF 187 G+RP +S H A+ ++ L L +++E D+ +F+ + H LMK +++RI D Sbjct: 127 GYRPNKSAHQALHSLALNLQ---FKGYGYIVEADIKGFFNNLDHNWLMKMLKQRIDDKAM 183 Query: 188 MTLLWKTIKAGH-IDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKD 246 ++L+ + +KA G+F G PQGG+ISP+L+NI L+ D Sbjct: 184 LSLISQWLKARIKSPEGVFEYPKSGTPQGGIISPVLANIYLHY--------------ALD 229 Query: 247 RWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGS 306 W+ R R A+ RYADDFV + E E L+ Sbjct: 230 LWFEKKVKPRMRGRAM-----------LIRYADDFVCAFQ-YANDAERFYEVLPKRLK-K 276 Query: 307 LKLRLNMDKTKIPHV-------NDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAAS 359 L + +KT + F+FLG + G+ R+ EK R AS Sbjct: 277 FNLEVAEEKTSLLRFSRFHPSRKRQFVFLGFAFYWAKDAQGKPRLRRRTGAEKHR---AS 333 Query: 360 LTALLWKVRIS 370 ++ +++ Sbjct: 334 MSEFYQYIKVK 344 >UniRef50_C6MRB5 RNA-directed DNA polymerase n=1 Tax=Geobacter sp. M18 RepID=C6MRB5_9DELT Length = 433 Score = 246 bits (629), Expect = 8e-64, Method: Composition-based stats. Identities = 91/259 (35%), Positives = 138/259 (53%), Gaps = 9/259 (3%) Query: 18 QRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQ 77 +RL+ I + A + +KGA PGVD + T L+ L +++ELL+G Y Sbjct: 171 ERLMEEIVSRGNMMAAYSKVVGNKGA--PGVDNMPVTELKGYLQEHWPRIKEELLAGKYI 228 Query: 78 PLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHH 137 P P R+V IPK +G R LGIP + DR++Q+A+ + ++ F SYG+ P RS H Sbjct: 229 PQPVRKVEIPKPDGGKRMLGIPTVLDRLIQQAVSQVLGRLFIPCFSKHSYGYIPGRSTHQ 288 Query: 138 AIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKA 197 AI+ + + + RW ++ DL +FD V+H +LM V+R++ D + + L+ +KA Sbjct: 289 AIQAARQYVAEGR----RWAVDIDLEKFFDRVNHDILMSLVKRKVKDRQVLKLIDSYLKA 344 Query: 198 GHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRG 257 G GL EG PQG +SPLLSNIML+E D+ L +R + S+ Sbjct: 345 GMFIGGLVSPRQEGTPQGSPLSPLLSNIMLDELDKELEKRGHAFCRYAVMPTIATSMLPP 404 Query: 258 RSTAVRENWQWKPAVAYCR 276 R N W+P+ A+CR Sbjct: 405 RRAV---NGSWQPSPAFCR 420 >UniRef50_B1VA32 Retron-type reverse transcriptase n=7 Tax=Candidatus Phytoplasma RepID=B1VA32_PHYAS Length = 521 Score = 245 bits (626), Expect = 2e-63, Method: Composition-based stats. Identities = 104/365 (28%), Positives = 177/365 (48%), Gaps = 24/365 (6%) Query: 11 TDPSLRIQR-LLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRD 69 + + +++ L R + E +A ++KG+ T GV ++ + ++ Sbjct: 22 SKNNYSLKKVLKREMNNIENTYKAFNSIATNKGSGTEGVGNKTIDGIKLEM---IKKYHK 78 Query: 70 ELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGF 129 E ++ Y P P ++V IPK K RPLGIP ++DRI+Q+AM + +E+ F S+GF Sbjct: 79 EYVNNQYNPQPVKKVLIPKGKNKTRPLGIPTIKDRIIQKAMEQLLTLYFENIFLEWSFGF 138 Query: 130 RPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMT 189 R ++S H A++ VK + + YF+T++H +L K + + I A+ + Sbjct: 139 RSKKSCHDAVKRVKQRFKGIDYIIKIDI-----KGYFETINHDILNKMLNKYIRKAKTLK 193 Query: 190 LLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKA--RKDR 247 + + +KAG ++ G+ + G PQGG+ISPLLSNI L+ D+ + E +GK + + Sbjct: 194 TINQWLKAGIMENGIKYESLSGTPQGGIISPLLSNIYLHYIDKKMEELIRNGKPIMKANP 253 Query: 248 WYWNNSIQRGR---STAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLE 304 YW ++ S N P + Y RYADDF++ +KG + E I+ LE Sbjct: 254 EYWKAYTKKQHHNLSIDSEINLNPNPRIEYIRYADDFIIGIKGEHHEAERIKTHVLKWLE 313 Query: 305 GSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSRYG----------EMRVVSTIPQEKAR 354 LKL +N DK+KI G FL + + + +V IP++ R Sbjct: 314 QDLKLVVNRDKSKIVKTTKGTRFLSYMVKVNPTNKTRKKKTTKNSLNGQVQIQIPKDTTR 373 Query: 355 NFAAS 359 ++ Sbjct: 374 DYGKE 378 >UniRef50_Q12UG1 RNA-directed DNA polymerase n=53 Tax=cellular organisms RepID=Q12UG1_METBU Length = 592 Score = 245 bits (624), Expect = 3e-63, Method: Composition-based stats. Identities = 93/340 (27%), Positives = 159/340 (46%), Gaps = 33/340 (9%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWL-AEAARITLSSKGAHTPGVDGVNKTMLQAR 59 +Q ++ +++L L+T + A R + +KG T G+DG + ++ Sbjct: 64 IQVRITKAVINKNWNLVKKLSYLLTHSHYAKLLAVRKVIRNKGRRTAGIDGEFWSTPVSK 123 Query: 60 LAVELQILRDELLSGHYQPLPARRVYIPK-SNGKLRPLGIPALRDRIVQRAMLMAMEPIW 118 + L Y+ P +R++I K + K RPLGIP + DR +Q +A++PI Sbjct: 124 VNA-----ARSLSDKRYKAKPLKRIFIEKYGSDKKRPLGIPTMYDRAMQALYALALDPIA 178 Query: 119 ESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAV 178 E S+GFR RS H A + ++ + + ++EGD+ FD + H+ L+ + Sbjct: 179 EVTADKRSFGFRKFRSTHDACSQIFGTISK--KDSAQCILEGDIKGCFDNISHQWLIDNI 236 Query: 179 RRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERY 238 ++L + +KAG + G PQGG+ISP+L+N+ L+ + L ++Y Sbjct: 237 PMD------KSILKQFLKAGFVYENSLFPTKAGTPQGGIISPILANMTLDGIEGVLADKY 290 Query: 239 LSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREE 298 RG S + + K V + RYADDF++ K TK E +E Sbjct: 291 ----------------HRGVSGKITTRQRAKHKVNFVRYADDFIVTAK-TKEIAEEAKEL 333 Query: 299 CRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSR 338 + L L L+ +KT I H++DGF FLG + + + + Sbjct: 334 IKNFLTDR-GLELSDEKTLITHIDDGFDFLGWNVRKYKGK 372 >UniRef50_Q8TJY1 Reverse transcriptase n=5 Tax=Methanosarcina RepID=Q8TJY1_METAC Length = 512 Score = 244 bits (623), Expect = 3e-63, Method: Composition-based stats. Identities = 96/386 (24%), Positives = 180/386 (46%), Gaps = 46/386 (11%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWL-AEAARITLSSKGAHTPGVDGVNKTMLQAR 59 +Q ++A+ A + + +L RL+T+ + + R ++KG+ TPG+DG+ + + Sbjct: 53 LQSRIASAAKNGKWITVNKLSRLLTRSLYAKLLSVRKVTTNKGSRTPGIDGIIWSSSADK 112 Query: 60 LAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWE 119 + LQ L + Y+ P R YI K NGKLRPL IP + DR +Q + + PI Sbjct: 113 MRSALQ-----LTNKGYRAKPLTRKYIRKKNGKLRPLSIPTMYDRAMQTLHSLVLGPIES 167 Query: 120 SDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVR 179 + S+GF+P RS A + + L+ + W++EGD+ + FD ++H ++ + Sbjct: 168 AIGDKTSFGFKPYRSTKDAYAYLHICLSK--KIAPEWIVEGDIKACFDEINHTWILDNIP 225 Query: 180 RRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYL 239 +L + +KAG+++ +G PQG ISP++ N+ LN + L R+ Sbjct: 226 MD------KRILKEFLKAGYVENYHLFPTEKGTPQGSPISPIIGNMALNGLENALAMRFY 279 Query: 240 SGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREEC 299 S + ++ Q + V R+ADDFV + +E I + Sbjct: 280 SR----------------SDGTIDKSHQNRHKVNCARFADDFVATADSPETALE-IIDVI 322 Query: 300 RGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAAS 359 + L+ L+L+ +KT + ++++GF FLG + + + ++ ++ R Sbjct: 323 QEFLDPR-GLKLSEEKTLVTNISEGFNFLGWNFRKYKGK-----LLPKPSKDSQREIIKK 376 Query: 360 LTALLWK---------VRISGEILLG 376 ++ ++ K +RI I+ G Sbjct: 377 ISDVIHKAKAWDQDRLIRILNPIIRG 402 >UniRef50_A7MS60 Putative uncharacterized protein n=21 Tax=Vibrio harveyi ATCC BAA-1116 RepID=A7MS60_VIBHB Length = 430 Score = 244 bits (622), Expect = 4e-63, Method: Composition-based stats. Identities = 112/353 (31%), Positives = 168/353 (47%), Gaps = 47/353 (13%) Query: 21 LRLITQPEWLAEAARITLSSKGAHTPGVDG--VNKTMLQARLAVELQILRDELLSGHYQP 78 + I L +A R +KG GVD ++ T+ + R A Q LR LL G Y+P Sbjct: 1 MEQICSSTNLNQALRRVKKNKGC--AGVDKLDIDATIFKLRQASNGQALRQSLLDGSYRP 58 Query: 79 LPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHA 138 P V IPK +G +R LGIP + DRIVQ+A+ + I+E+ F SYGFRP RS HHA Sbjct: 59 QPVLGVGIPKPSGGVRQLGIPTVLDRIVQQAITSVLSDIYEAKFSNSSYGFRPNRSAHHA 118 Query: 139 IRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAG 198 + + + +V++ DL+ YFDTV+H LM + I+D R + L+ ++AG Sbjct: 119 LAAASRYIREGR----GYVVDIDLAKYFDTVNHDRLMHRLSEDIADKRVLKLIRSYLQAG 174 Query: 199 HIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGR 258 + GL G PQGG +SPLLSNI+L+E D+ L R Sbjct: 175 IMRNGLVEQRQRGTPQGGPLSPLLSNIVLDELDKELERR--------------------- 213 Query: 259 STAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKI 318 +CRYADD + V G++ ++E LE LKL +N +K+ Sbjct: 214 ------------GHKFCRYADDCQIYV-GSEEAAYRVKESITEYLEQKLKLTVNREKSAA 260 Query: 319 PHVNDGFIFLGHRLIRKR----SRYGEMRVVSTIPQEKARNFAASLTALLWKV 367 V + +L HR S+ + ++ + + Q RN L ++ ++ Sbjct: 261 TRVTER-TYLSHRFGIDGTIHISKPAQTQMKTRVRQITKRNRGRELKVVIAEL 312 >UniRef50_B8FP59 RNA-directed DNA polymerase n=9 Tax=Firmicutes RepID=B8FP59_DESHD Length = 320 Score = 244 bits (622), Expect = 5e-63, Method: Composition-based stats. Identities = 104/316 (32%), Positives = 156/316 (49%), Gaps = 48/316 (15%) Query: 4 KLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVE 63 K T + D S + QRL R + PE+ A + ++KG+ TPG+DG + + Sbjct: 9 KSLTEKSKDESYKFQRLYRNLYNPEFYWLAYQNIYANKGSMTPGMDGTTISGIN---EER 65 Query: 64 LQILRDELLSGHYQPLPARRVYIPKSNG-KLRPLGIPALRDRIVQRAMLMAMEPIWESDF 122 ++ + L YQP PARRVYI K N K RPLGI D++VQ + M +E I+E F Sbjct: 66 IRQIIASLKDQSYQPHPARRVYIEKKNSQKKRPLGISTANDKLVQEVVRMILESIFEPTF 125 Query: 123 HTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRI 182 S+GFRP RS A+ ++ T W +EGD+ + FD+ H +L++ ++RRI Sbjct: 126 SDKSHGFRPVRSCQTALLQIQGNFTGVN-----WFVEGDIEACFDSFDHHVLIELLQRRI 180 Query: 183 SDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYL-------- 234 DA F++L+WK +KAG+++ + +GVPQG ISP+LSNI L+E D ++ Sbjct: 181 DDASFISLMWKFLKAGYMEQWEYNMTYDGVPQGSGISPILSNIYLSELDTFMSEYKAAFD 240 Query: 235 --------------HERYLSGKARK-DRWYWNNSIQRGRSTAVRENWQW----------- 268 H RY++ K RK R W N + + ++E Sbjct: 241 IRKTHGRKPSSNYCHARYMASKYRKESRAIWGNLTPSEKKSRMKEQRALSVKQRETPAHD 300 Query: 269 -----KPAVAYCRYAD 279 + Y RYAD Sbjct: 301 VFDDTFKIIQYARYAD 316 >UniRef50_C6MXE9 RNA-directed DNA polymerase n=1 Tax=Geobacter sp. M18 RepID=C6MXE9_9DELT Length = 512 Score = 244 bits (622), Expect = 5e-63, Method: Composition-based stats. Identities = 97/335 (28%), Positives = 153/335 (45%), Gaps = 43/335 (12%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWL-AEAARITLSSKGAHTPGVDGVNKTMLQAR 59 +Q ++A + +++ L L+T+ A + S+KG TPGVDGV + R Sbjct: 34 LQIRIAKAVLENRWNKVKTLQHLLTRSFHAKLLAVKRVTSNKGKKTPGVDGVLWKTAKVR 93 Query: 60 LAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWE 119 L Y+P P +R+YIPK NGK RPL IP ++DR +Q +A+ P+ E Sbjct: 94 WRAAC-----SLRRRGYKPQPLKRIYIPKKNGKKRPLSIPTMQDRAMQALYKLALAPVAE 148 Query: 120 SDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVR 179 + SYGFR RS A L+ WV+E D++ +D + +++ + Sbjct: 149 TTADGNSYGFREGRSCADATAAAFNALSK--PNSAPWVLEADITGCYDNICQNWMLENIP 206 Query: 180 RRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYL 239 +L K ++AG+I+ G+ + +G PQGG+ISP L+N+ L+ ++ + Sbjct: 207 MDR------EVLRKWLEAGYIEDGILYPSHKGTPQGGIISPTLANMTLDGLERVI----- 255 Query: 240 SGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREEC 299 R + V + RYADDF++ K + AIR Sbjct: 256 -----------------------RTAVPRRCRVNFVRYADDFIVTGKSRRLLETAIRPAI 292 Query: 300 RGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIR 334 L G L L+ +KT I H+ DGF FLG + Sbjct: 293 EKFLSGR-GLSLSPEKTAITHIKDGFTFLGQTYRK 326 >UniRef50_Q9G8T2 Orf762 n=2 Tax=Eukaryota RepID=Q9G8T2_RHDSA Length = 762 Score = 243 bits (621), Expect = 6e-63, Method: Composition-based stats. Identities = 99/367 (26%), Positives = 172/367 (46%), Gaps = 34/367 (9%) Query: 6 ATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPG-VDGVNKTMLQARLAVEL 64 D + +RL+ +I + L A S G G +D + L + Sbjct: 165 HRKFNKDKNFVNKRLINIIGDVQTLIVAYEFVKSKPGQMVKGSID----STLDDIDLAWI 220 Query: 65 QILRDELLSGHYQPLPARRVYIPKSNGKLR-PLGIPALRDRIVQRAMLMAMEPIWESDFH 123 + + + +G ++ +P+RR+Y+ K+ K R P+ RD++VQ+A+ + +EPI+E+ F Sbjct: 221 KSISKVIKAGKFKFIPSRRIYVSKTGCKERRPIMTGFPRDKLVQKAIQLVLEPIYENVFL 280 Query: 124 TLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRIS 183 S+GFRP R H A++++K G WVIE D++S F +V+H +L+ ++ RI Sbjct: 281 ENSHGFRPARGCHTALKSIKQ-----GFHGVTWVIESDIASCFSSVNHEVLLSIIKERIK 335 Query: 184 DARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHE------- 236 + + L+ +++G++D+G F + G+PQG +SPLL NI L++FD +++E Sbjct: 336 CVKTLALIRNLLESGYVDLGAFCKSKLGIPQGSSLSPLLCNIYLHKFDTFMYELKQRFVY 395 Query: 237 ------RYLSGKARKDRWYWNNSIQRGRSTAVRENWQ---------WKPAVAYCRYADDF 281 R R R N +S ++E + + Y RYADDF Sbjct: 396 TSSKDPRINPAYKRLQRQIQNTPGLVEKSKFIQELRKTPSKDLFDPKYRRLFYIRYADDF 455 Query: 282 VLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVND-GFIFLGHRLIRKRSRYG 340 + + G K I ++ + L LK+ L K ++ H+ FLG + Sbjct: 456 SIGITGQKKDAVEILDQAKIFLSEELKMDLKESKIRVVHLKKQSIFFLGTTIYGISCVEK 515 Query: 341 EMRVVST 347 MR V Sbjct: 516 PMRTVKH 522 >UniRef50_Q2FUJ3 RNA-directed DNA polymerase n=1 Tax=Methanospirillum hungatei JF-1 RepID=Q2FUJ3_METHJ Length = 487 Score = 242 bits (618), Expect = 1e-62, Method: Composition-based stats. Identities = 100/382 (26%), Positives = 167/382 (43%), Gaps = 50/382 (13%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWLA-EAARITLSSKGAHTPGVDGVNKTMLQAR 59 +Q ++A +RL L+T + A + ++G + GVDG T + + Sbjct: 38 LQTRIAKAVKNGQYRLARRLQYLLTHSFYAKMLAVQRVTKNRGKRSAGVDGEKWTTPEQK 97 Query: 60 LAVELQILRDELLSGHYQPLPARRVYIPKS-NGKLRPLGIPALRDRIVQRAMLMAMEPIW 118 + L L Y+ P RR+YIPK + K+RPL IP + DR +Q MA+ P Sbjct: 98 MKAALT-----LSDKGYRAKPLRRIYIPKPQSSKMRPLSIPTMYDRAMQALYAMALMPWA 152 Query: 119 ESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAV 178 E+ S+GFR +R+ A L+ +T G+W++EGD+ FD H+ ++ + Sbjct: 153 ETTADKTSFGFRMKRNAQDAASYTFQCLSR--KTSGQWILEGDIRGCFDNFAHQWMLDNI 210 Query: 179 RRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERY 238 +L + +KAG+I G+ G PQGG+ISPLL+N+ L+ ++ L E + Sbjct: 211 ------PLDQRILNQFLKAGYIYDGILYRNKSGTPQGGLISPLLANMALDGMERMLKEHF 264 Query: 239 LSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREE 298 K V R+ADDF++ + ++ +E Sbjct: 265 PGNK-----------------------------VHLIRFADDFLVTADSQETALQ-CKEL 294 Query: 299 CRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQ--EKARNF 356 L L L+ +KTKI H+N+GF FLG + + ++ I +K R Sbjct: 295 ITEFLHER-GLELSEEKTKIVHINEGFDFLGWNFRKFKGKFLIQPSKKAIAAIIDKVRVI 353 Query: 357 AASLTALLWK--VRISGEILLG 376 S A + ++ ++ G Sbjct: 354 IKSAKAWKQEDLIKALNPVIKG 375 >UniRef50_Q8YLU0 All5206 protein n=1 Tax=Nostoc sp. PCC 7120 RepID=Q8YLU0_ANASP Length = 539 Score = 242 bits (617), Expect = 2e-62, Method: Composition-based stats. Identities = 108/387 (27%), Positives = 175/387 (45%), Gaps = 46/387 (11%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEW--LAEAARITLSSKGAHTPGVDGVNKTMLQA 58 +Q ++ + +++L +L+ + L ++T + G T GVDG Sbjct: 40 LQVRIFKAQKNGNTRLVRKLQKLLLSSKAAKLLAIRQVTQLNTGRKTAGVDGKKALEPSQ 99 Query: 59 RLAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIW 118 RLA+ ++++ ++ P +RVYIPK++G R LGIP + DR Q + A+EP Sbjct: 100 RLALYEVLVKN---WKQWKHQPLKRVYIPKADGTRRGLGIPTISDRAYQCLIKYALEPAA 156 Query: 119 ESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETR-GRWVIEGDLSSYFDTVHHRLLMKA 177 E+ F+ SYGFRP RS H + + L + ++E D+ FD + H+ LM++ Sbjct: 157 EAMFNARSYGFRPGRSCHDVQKLLFSNLNGGQANGLSKRILELDIERCFDKIDHKFLMQS 216 Query: 178 VRRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHER 237 V+ ++ IKAG G F ++ G PQGGVISPLL+NI+L+ + HE Sbjct: 217 VQ---LPKAAKQGIFWAIKAGVR--GEFPSSESGTPQGGVISPLLANIVLHGLENVGHEL 271 Query: 238 YLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIRE 297 VR + + RYADD V ++K + EA+R+ Sbjct: 272 ---------------------RYKVRSGGRQIDTIKGFRYADDVVFLLK-PEDNPEALRQ 309 Query: 298 ECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFA 357 LE L++ KTKI H D F FLG + K + +ST Q+ + Sbjct: 310 NIDTFLEAR-GLKVKEAKTKIVHSTDSFDFLGWNFVVK----PNGKFISTPSQKATSSIK 364 Query: 358 ASLTALLW--------KVRISGEILLG 376 A + ++ ++ G I+ G Sbjct: 365 AKVKEVMKDSCFTLEERIDKCGAIVRG 391 >UniRef50_Q47277 Orf protein n=63 Tax=cellular organisms RepID=Q47277_ECOLX Length = 416 Score = 241 bits (615), Expect = 3e-62, Method: Composition-based stats. Identities = 102/360 (28%), Positives = 156/360 (43%), Gaps = 58/360 (16%) Query: 26 QPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLPARRVY 85 + A R +S GA G+D + RL L + + L SG Y P + V Sbjct: 10 DKSLVVSAYRRVKTSAGA--AGIDKQSLADFDKRLVDNLYKIWNRLSSGSYFPPAVKAVA 67 Query: 86 IPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQ 145 IPK G R LGIP + DRI Q + +A EP E F SYG+RP +S AI + + Sbjct: 68 IPKKLGGERILGIPTVSDRIAQTVVKLAFEPQVEPHFLADSYGYRPNKSALDAIGVTRKR 127 Query: 146 LTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAG-HIDVGL 204 WV+E D+ FD + H L+MKAV + + + + A + G Sbjct: 128 CWY-----YDWVLEFDIKGLFDNIPHELIMKAVDKHNPARWVKLYIQRWLTAPMVMSDGE 182 Query: 205 FRAASEGVPQGGVISPLLSNIMLNE-FDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVR 263 RA + G PQGGVISPLL+N+ ++ FD++L + Y Sbjct: 183 VRARTMGTPQGGVISPLLANLFMHYVFDKWLAKYY------------------------- 217 Query: 264 ENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVND 323 P V + RYADD +L ++A+ +RE R L ++ +KT++ + D Sbjct: 218 ------PKVPWYRYADDGILHCH-SEAEATEMREVLRKRF-SECGLEMHPEKTRVIYCKD 269 Query: 324 G----------FIFLGHRLIRK--RSRYGEMRVVSTIPQEKARNFAASLTALLWKVRISG 371 G F FLG+ R+ ++ VS P ++L A+ +++ +G Sbjct: 270 GSRKGDYEHTMFDFLGYTFRRRVVKNVKRNSLFVSFTPAAS----KSALKAMRREIKATG 325 >UniRef50_A6LY84 RNA-directed DNA polymerase (Reverse transcriptase) n=17 Tax=Firmicutes RepID=A6LY84_CLOB8 Length = 433 Score = 239 bits (610), Expect = 1e-61, Method: Composition-based stats. Identities = 100/386 (25%), Positives = 165/386 (42%), Gaps = 51/386 (13%) Query: 4 KLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVE 63 K+A A + L I E L + + L G G+D V K L Sbjct: 7 KIAEIARQKTKEKFTSLYHYI-NKEMLLKCHKQLL---GNKATGIDDVTKQEYSKELDNN 62 Query: 64 LQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFH 123 ++ L +L + Y+P +RVYIPK +GK RPLGIP+ D++VQ A+ ++ I+E++F Sbjct: 63 IENLIVKLRNHSYKPQAVKRVYIPKGDGKTRPLGIPSYEDKLVQMALNKILQSIYEAEFK 122 Query: 124 TLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRIS 183 SYGFRP+R+ H AI+ + + + R +V++ D+ +F+ V+H ++K + RI Sbjct: 123 DFSYGFRPKRNCHSAIKALNKVIENG---RINYVVDADIKGFFNNVNHEWMIKFLEVRIG 179 Query: 184 DARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNE-FDQYLHERYLSGK 242 D ++L+ K +KAG +D G+ + G PQG ++SP L+NI L+ D + + Sbjct: 180 DPNIISLVKKFLKAGLMDNGIIKTTEIGTPQGSIVSPTLANIYLHYSLDLWFEKVI---- 235 Query: 243 ARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGV 302 ++ RYADDFV + + R + Sbjct: 236 ----------------------KRNFRGQSEITRYADDFVCCFQYESEARQFCRLLVSRL 273 Query: 303 LEGSLKLRLNMDKTKIPHVN---------------DGFIFLGHRLIRKRSRYGEMRVVST 347 L + K+K+ + F FLG +S+ G RV Sbjct: 274 --NKFNLEVERTKSKLILFGRFAEEIRKSRGFKNAETFDFLGFTHYCAKSKRGYFRVKRK 331 Query: 348 IPQEKARNFAASLTALLWKVRISGEI 373 ++K + + VR I Sbjct: 332 TSKKKFKAKIKDFNQWIKSVRNKLHI 357 >UniRef50_A7BYN3 RNA-directed DNA polymerase n=2 Tax=Beggiatoa sp. PS RepID=A7BYN3_9GAMM Length = 497 Score = 239 bits (609), Expect = 2e-61, Method: Composition-based stats. Identities = 96/328 (29%), Positives = 154/328 (46%), Gaps = 32/328 (9%) Query: 42 GAHTPGVDGVNKTMLQARLAVELQILRDELL-SGHYQPLPARRVYIPKSNGKLRPLGIPA 100 G + G+DG+ +AR + +++ L +Y+ PA+R YIPK+NGKLRPLGIP Sbjct: 2 GKRSSGIDGLKYLTPKARERLAKRLMDWALKGWDNYKAKPAKRKYIPKANGKLRPLGIPT 61 Query: 101 LRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEG 160 DR++Q + A+EP +E+ F + SYGFRP + H AI + +WV++ Sbjct: 62 QEDRVIQHVIKSALEPFYEAQFESNSYGFRPAQGCHDAIEAIFK----ITSHEPKWVLDA 117 Query: 161 DLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISP 220 D+ FD + H L + + L+ + +KAG+ D G G PQGG+ISP Sbjct: 118 DIKGCFDNIDHNYLTECITHGQ-----KKLVKEWLKAGYTDDGHIHPTKNGTPQGGIISP 172 Query: 221 LLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADD 280 LL+NI L+ + L ++ G + + + + RYADD Sbjct: 173 LLANIALDGLETNLRQKLQIG--------------------IYQTQFNQSRLTVVRYADD 212 Query: 281 FVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSRYG 340 FV+I K K E+ + L+ L L+ +KTKI GF FLG + +++ Sbjct: 213 FVIIHKDKKVIKES-QLIISQWLKKR-GLELSPEKTKIVKTPQGFDFLGFNIKWCKNKAK 270 Query: 341 EMRVVSTIPQEKARNFAASLTALLWKVR 368 I + K + + +T ++ Sbjct: 271 GHYKRHLIKEGKYKEYGIRITPSSKSLK 298 >UniRef50_Q8YKQ2 Alr7241 protein n=14 Tax=Cyanobacteria RepID=Q8YKQ2_ANASP Length = 562 Score = 237 bits (605), Expect = 4e-61, Method: Composition-based stats. Identities = 92/366 (25%), Positives = 157/366 (42%), Gaps = 36/366 (9%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEW--LAEAARITLSSKGAHTPGVDGVNKTMLQA 58 +QR++ + + L +LI + + +I+ + G T G+DGV Sbjct: 65 LQRRVFKAVQAGNKRKARFLQKLILKSKAGRFLAIRQISQLNAGKKTAGIDGVKSLDFNG 124 Query: 59 RLAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIW 118 R +E+ + + SG++ R + IPK +G R L IP + DR Q A+EP Sbjct: 125 RFELEITLKQS---SGNWHHQELREIPIPKKDGTTRMLKIPTIADRCWQCLAKYALEPAH 181 Query: 119 ESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAV 178 E+ FH SYGFR R+ H A + + L+ + + VIE D+ FD ++H +M+ + Sbjct: 182 EATFHARSYGFRTGRAAHDAQQFLFSNLSSKAKRISKRVIELDIEKCFDRINHSTIMENL 241 Query: 179 RRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERY 238 I+ +++ +KAG +G PQGGV+SPLL+NI LN + Sbjct: 242 ---IAPKGIKLGIYRCLKAGI----NPEFPEQGTPQGGVVSPLLANIALNGIES------ 288 Query: 239 LSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREE 298 + + + + RYADD V++++ + I + Sbjct: 289 ------------IHRYHKDNQRITNKTPESDIRYPSVRYADDMVIVLR-PQDDANEILAK 335 Query: 299 CRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAA 358 L ++++ KTKI DGF FLG +I + T +E + F Sbjct: 336 IEDFLNAR-GMKVSAKKTKITATTDGFDFLGWHIIV----QSNGKFNCTPSEENFKKFRQ 390 Query: 359 SLTALL 364 + A++ Sbjct: 391 KVKAIV 396 >UniRef50_D2LU13 RNA-directed DNA polymerase (Reverse transcriptase) n=1 Tax=Bacillus cellulosilyticus DSM 2522 RepID=D2LU13_BACS4 Length = 464 Score = 235 bits (599), Expect = 2e-60, Method: Composition-based stats. Identities = 103/361 (28%), Positives = 158/361 (43%), Gaps = 52/361 (14%) Query: 19 RLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQP 78 ++R I + L A + +K PG+DG++ L LA+E + L + G Y+P Sbjct: 43 DIIRRIINQDNLQRAYKKVKKNK--GKPGIDGMSVDELLPYLALEDRNLILSIKDGSYRP 100 Query: 79 LPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHA 138 P +RV I K +G R LGIP ++DR+VQ+ +L +E + F SYGFRP RS H A Sbjct: 101 QPVKRVEIKKPDGGKRKLGIPTVKDRLVQQMILQVIEKKIDPQFSDNSYGFRPNRSAHDA 160 Query: 139 IRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAG 198 +R K E R+V++ D+ YFDTV+ LM V + I D + L+ K +++G Sbjct: 161 MRKAKQ----YYEEGFRYVVDIDMKQYFDTVNQDKLMHHVEQFIDDPTVLILIRKFLRSG 216 Query: 199 HIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGR 258 + G PQGG +SP+L NI L++ D L R Sbjct: 217 ISIDEEIEPSEVGTPQGGNLSPILGNIYLHQLDLELERR--------------------- 255 Query: 259 STAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKI 318 + RYADD + VK KA + + + LE LKL +N DK+++ Sbjct: 256 ------------GHKFIRYADDCNIYVKSRKAG-DRVLKSITKFLEEELKLTVNKDKSEV 302 Query: 319 PHVNDGFIFLGHRLIR-----------KRSRYGEMRVVSTIPQEKARNFAASLTALLWKV 367 FLG + K + E ++ +++ F + L Sbjct: 303 GRPTKR-KFLGFCIHSTKAGVGFRPHIKSKKRLEQKIRYLTSRKRPGEFREIIKELNQTT 361 Query: 368 R 368 R Sbjct: 362 R 362 >UniRef50_B0I1N8 Reverse transcriptase homolog n=2 Tax=Pylaiella littoralis RepID=B0I1N8_PYLLI Length = 796 Score = 234 bits (598), Expect = 3e-60, Method: Composition-based stats. Identities = 102/388 (26%), Positives = 176/388 (45%), Gaps = 39/388 (10%) Query: 11 TDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDE 70 + R L+++I + A + +++G GVD + L LQ + ++ Sbjct: 179 KNKDGRYGNLIQIIGSLSTIKLAYLMIKNNRGISAKGVD---DSSLDGISLRTLQAMSND 235 Query: 71 LLSGHYQPLPARRVYIPKSNGK-LRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGF 129 LSG + P RRVYI K LRPLGI + R +I+Q+++ M + I+E F S+G Sbjct: 236 TLSGRIKFSPVRRVYIKKEGKTDLRPLGISSPRQKIIQKSIEMVLTSIFEEIFLDCSHGS 295 Query: 130 RPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMT 189 R RS H A++ ++L++ + + WV+EGD+ FD + H +MK +++++ + Sbjct: 296 RIGRSCHTALKNLQLKVGNV--STYSWVVEGDIKGCFDNIPHSQIMKRIKQKVDCLPTIN 353 Query: 190 LLWKTIKAGHIDVGLFR----------AASEGVPQGGVISPLLSNIMLNEFDQYLH---- 235 L+ K + AG+I + G QG ++SPL SNI+L+E D+++ Sbjct: 354 LVKKILDAGYILDEDLKKFGRKNAQVFKPDVGTTQGIILSPLFSNIVLHELDKFIEVILK 413 Query: 236 ERYLSGKARKDRWYWNNSIQRGRSTAVRENWQW-----------------KPAVAYCRYA 278 E + GK RK + + + + + + Y RY Sbjct: 414 EEFSKGKKRKANLEYRKLRYQIKKEDNLKKRRKLIEDCKSVPSKSIEDPDFKRLFYVRYV 473 Query: 279 DDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDG-FIFLGHRLIRKRS 337 DD+V++V G+ + IR+ L L L LNM+KTKI + G FLG +++ Sbjct: 474 DDWVILVSGSLSDSRLIRDRVSRKLR-ELGLELNMEKTKITSLRKGKCRFLGIDFFIRKN 532 Query: 338 RYGEMRVVSTIPQEKARNFAASLTALLW 365 + VS + + + T L Sbjct: 533 TNQHYKPVSLVKKNTNISIRQRFTPRLI 560 >UniRef50_B4WW73 Group II intron, maturase-specific domain family n=1 Tax=Synechococcus sp. PCC 7335 RepID=B4WW73_9SYNE Length = 479 Score = 234 bits (596), Expect = 5e-60, Method: Composition-based stats. Identities = 94/291 (32%), Positives = 147/291 (50%), Gaps = 33/291 (11%) Query: 76 YQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSV 135 ++ P RRVYIPKSNGKLRPLGI + DR +Q + A+EP WE+ +YG+RP RS Sbjct: 21 WRVQPTRRVYIPKSNGKLRPLGISTIADRCLQMVVKTALEPEWEAKLEGSTYGYRPGRSC 80 Query: 136 HHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTI 195 H A++ KL + + WV++ DL FD + L+ + R L+ + + Sbjct: 81 HDAVK--KLYFMSLPKNKKHWVVDADLQGCFDNIDQSFLLAKLER----FPARGLVEQWL 134 Query: 196 KAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQ 255 ++G+++ G ++ + G PQG ISPLL+NI L+ ++ L Y S Sbjct: 135 QSGYVEYGRWQPTTAGTPQGNCISPLLANIALHGMEEALGITYDS--------------- 179 Query: 256 RGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDK 315 RGR R +YADDFV++ + +KA EA E+ + L L + +K Sbjct: 180 RGRINGKR---------GLAKYADDFVVMCE-SKADAEAAIEDLKPWLLER-GLSFSEEK 228 Query: 316 TKIPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAASLTALLWK 366 T++ H+ +GF FLG R + G+ R + + +R S+ A L + Sbjct: 229 TQVVHLREGFDFLGFNF-RHYATPGKTRTGWKLLTKPSRKSIKSIKARLKR 278 >UniRef50_B3JM52 Putative uncharacterized protein n=1 Tax=Bacteroides coprocola DSM 17136 RepID=B3JM52_9BACE Length = 431 Score = 234 bits (596), Expect = 5e-60, Method: Composition-based stats. Identities = 90/349 (25%), Positives = 158/349 (45%), Gaps = 45/349 (12%) Query: 20 LLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPL 79 ++ + P L A +++KG+ GVDGV+ L+ + + L + + G+YQ Sbjct: 1 MITRVVHPFNLQRALEHVIANKGS--AGVDGVSIRELRKVFSEKKLQLIEAIKQGNYQVQ 58 Query: 80 PARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAI 139 P + IPK NGK R LG+P +R++Q+A+ + P++E +F SYGFRP ++ A+ Sbjct: 59 PILGIEIPKGNGKTRLLGVPTTTERVLQQALAQTIAPLFEPEFSNYSYGFRPHKNARQAV 118 Query: 140 RTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGH 199 + + + +++ DL ++FD V H LL+ + +++ M L+ K ++A Sbjct: 119 GQSRDYI----HSGLNHIVDIDLKNFFDEVDHCLLLNLIYQKVKCKATMQLIRKWLRAPI 174 Query: 200 IDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGRS 259 G R +GVPQG +SPLLSNI+L++ D+ + R Sbjct: 175 KINGKLRKRRKGVPQGSPLSPLLSNILLHQLDKEMTRR---------------------- 212 Query: 260 TAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIP 319 + RYADDF + K + Q +A R L+ LKL +N +K+ I Sbjct: 213 -----------GHKFVRYADDFSIYCK-SHNQAKATRVVIEKFLKNKLKLTINEEKSGI- 259 Query: 320 HVNDGFIF--LGHRLIRKRSRYGEMRVVSTIPQEKARNFAASLTALLWK 366 F LG + + + + + ++ + L + K Sbjct: 260 --RKPIHFTILGFGFVPTYEKGSKNQYQLIVSEKAWKKLKERLKTITRK 306 >UniRef50_Q8HQ89 ORF777 (Fragment) n=1 Tax=Schizosaccharomyces octosporus RepID=Q8HQ89_SCHOT Length = 777 Score = 233 bits (593), Expect = 1e-59, Method: Composition-based stats. Identities = 92/372 (24%), Positives = 160/372 (43%), Gaps = 37/372 (9%) Query: 12 DPSLRIQRL-LRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDE 70 + + L R + + + A S+ N L + Sbjct: 192 NDLNKFYELNYRQLYNLDLIKTAYLDLKSNSDNFEK---NNNNNSLDKISNEWAEKTCKS 248 Query: 71 LLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFR 130 L + P+R++ + N + R + I +DR++Q+A M ++ +E F S+GFR Sbjct: 249 LKDRSFIFKPSRKIMVTCRNNQKREISIANGKDRVIQQAFKMILQSAYEPIFLNYSHGFR 308 Query: 131 PERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTL 190 P RS H AI V+ TR W+I+GD+ +FD V H +L + ++I D + Sbjct: 309 PGRSPHSAIFEVRKW------TRITWMIKGDIKGFFDNVDHHILANLLSKKIKDKNLIDF 362 Query: 191 LWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYW 250 WK ++AG+++ G ++ ++ G+PQGG++SPLL+NI L+E D Y+ + K Sbjct: 363 YWKLVRAGYVNNGNYKVSNLGIPQGGILSPLLANIYLHELDVYMEQLIQKYTVNKPVSKK 422 Query: 251 NNSIQRGRSTAVRENWQWKP--------------------------AVAYCRYADDFVLI 284 N R S ++ + P + Y RY DD+V+ Sbjct: 423 NKEHTRLFSEITAKSKKKFPDFELIKRMRKELRRIPSTIRDSSTGTRIYYNRYGDDYVIG 482 Query: 285 VKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVN-DGFIFLGHRLIRKRSRYGEMR 343 V G K E I+ L+ L + LN +K++I H+ +LG+ + R+ +Y E + Sbjct: 483 VVGPKNLAETIQNLVSDFLKNELLIDLNKEKSQITHLTSKSLKYLGYEIFRRNRKYSESQ 542 Query: 344 VVSTIPQEKARN 355 + R Sbjct: 543 LTYNSKTNTYRK 554 >UniRef50_Q7GEU5 Putative uncharacterized protein (Fragment) n=3 Tax=Candida parapsilosis RepID=Q7GEU5_CANPA Length = 933 Score = 233 bits (593), Expect = 1e-59, Method: Composition-based stats. Identities = 103/399 (25%), Positives = 178/399 (44%), Gaps = 47/399 (11%) Query: 5 LATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVEL 64 LA L ++R + L+ + A S+G+ T GVD V + L Sbjct: 308 LANIHGAKSELVMKRQMMLVNSIIFRLHAVDKLSHSRGSLTSGVDNVCIEGVDKD-RALL 366 Query: 65 QILRDELLS-----GHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWE 119 + + L + Y+ P +RV+IPK N KLRP+GIP L+DR +Q + + +EP+ E Sbjct: 367 VEIVEWLGTTVKHPKLYKSDPVKRVWIPKGNEKLRPIGIPTLKDRGLQYLINLVVEPLVE 426 Query: 120 SDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRG---------------------RWVI 158 +YGFRP RS +AI ++ L ++ +W++ Sbjct: 427 MTSDPHNYGFRPYRSTKNAIAYLRSHLHTIDSSKKGNHFTTASNVENNLLRLLPENKWIL 486 Query: 159 EGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVI 218 + D+ +FD ++H L+ + + + ++ +K+G ID +F+ G PQGGVI Sbjct: 487 DADIKGFFDNINHDWLLNNLTLH---PKLLLIIKAWLKSGVIDGKIFQLTESGTPQGGVI 543 Query: 219 SPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYA 278 SP L N LN ++ + E K++ ++ G T V ++AY RYA Sbjct: 544 SPTLVNFTLNGLEKVVMEALYPLTKSKEQRI-RIKLKDGTYTCV------ASSLAYVRYA 596 Query: 279 DDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVND---GFIFLGHRLIRK 335 DDFV++V+ + I+ L L LN +KTK+ ++D FLG+ + Sbjct: 597 DDFVVLVRSKHIMLTFIKPAIEKFLAER-GLNLNEEKTKLFRLSDPGCQLDFLGYTF-KY 654 Query: 336 RSRYGEMRVVSTIPQEKARNFAA-----SLTALLWKVRI 369 + ++ R V +R A + + + K+++ Sbjct: 655 QDKWSIKRHVFYHNHAGSRGIALYPNKTKVLSFIEKLKL 693 >UniRef50_D0VMZ3 Putative reverse-transcriptase protein n=1 Tax=Volvox carteri f. nagariensis RepID=D0VMZ3_VOLCA Length = 611 Score = 233 bits (593), Expect = 1e-59, Method: Composition-based stats. Identities = 106/380 (27%), Positives = 176/380 (46%), Gaps = 37/380 (9%) Query: 1 MQRKLATWAATDPSLRIQRL-LRLITQPEW-LAEAARITLSSKGAHTPGVDGVNKTMLQA 58 MQ ++ + L +RLI + L ++T +KG TPGVD Sbjct: 31 MQHRIFKAKRNGKQKTVTGLQIRLINSLDAKLLSVLQVTALNKGRKTPGVDKQIVIAGPD 90 Query: 59 RLAVELQILRDELLSGHYQPLPARRVYIPKSNG-KLRPLGIPALRDRIVQRAMLMAMEPI 117 +L + ++ L G P RRV++PK + RPLGIP + DR Q +A+EP Sbjct: 91 KLRMA----KELRLDGT--AKPIRRVWLPKPGKDEKRPLGIPTIEDRAKQNLAKLALEPE 144 Query: 118 WESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKA 177 WE+ F SYGFRP RS H AI + L L + +++ + D+ FD + H L+ Sbjct: 145 WEAIFEPNSYGFRPGRSCHDAIEAIFLNLR---HKKTKFIYDADIRKCFDRIDHGALIAK 201 Query: 178 VRRRISDARFMTLLWKTIKAGHIDV------GLFRAASEGVPQGGVISPLLSNIMLNEFD 231 + + + ++ +KAG ++ ++ G PQGG+ISPLL+NI L+ + Sbjct: 202 LN---TFPQMERQIYAWLKAGIMEGYANAPKSYEPESNLGTPQGGIISPLLANIALHGLE 258 Query: 232 QYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQ 291 +L E + + K + + N Q + + A RYADDFV+I K Sbjct: 259 NHLKE-FCATKVSNEIFQTRNRSQESK----------RKACGVIRYADDFVII-HENKQV 306 Query: 292 VEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRL--IRKRSRYGEMRVVSTIP 349 +E E + L+ + L +N +K+K+ +GF FLG ++ IR+ G+ + P Sbjct: 307 IELCVTETKNWLQ-HIGLDINNEKSKLRDSREGFKFLGLQIILIRRGKMDGQGYRIKITP 365 Query: 350 QEKAR-NFAASLTALLWKVR 368 +K + + + ++ R Sbjct: 366 SKKNQSSLLEKVRKVIQNNR 385 >UniRef50_Q7YAJ3 Putative reverse transcriptase and intron maturase n=1 Tax=Chara vulgaris RepID=Q7YAJ3_CHAVU Length = 760 Score = 232 bits (592), Expect = 1e-59, Method: Composition-based stats. Identities = 104/334 (31%), Positives = 155/334 (46%), Gaps = 30/334 (8%) Query: 5 LATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVEL 64 L + P L + A +++ TPG D +T + +L Sbjct: 287 LWIHSYKKPFRVYYDLGGFLRNIGIWIIAYKLSQ----KVTPGPD---QTTIDGTSLEKL 339 Query: 65 QILRDELLSGHYQPLPARRVYIPKS-NGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFH 123 LRD ++ G ++ R IPK + RPLG+ +DRIVQ + M +EPI+E F Sbjct: 340 LKLRDAIVKGEFEWGATR---IPKPGKNEKRPLGVSCFQDRIVQEVLRMILEPIYEPRFS 396 Query: 124 TLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDT-VHHRLLMKAVRRRI 182 T S+GFRP RS H A+ + +W IEG L + V+H L K +RR I Sbjct: 397 TYSHGFRPGRSAHTALNVIMGTFHGA-----QWYIEGSLEAEGPGAVNHGTLYKIIRRTI 451 Query: 183 SDARFMTLLWKTIKAG-HIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSG 241 D R + L+ ++A H+ G A+ G + +SPLLSNI LNE D ++ Sbjct: 452 RDKRILKLIRSGLQAFFHMPHGEIEEATIGAARPFGLSPLLSNIYLNELDHFIETTIREY 511 Query: 242 KARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRG 301 + R+ A R N + + Y R+ADDF++ + G +++ +R E Sbjct: 512 NKAR------------RADASRLNPTNRRRMHYIRFADDFLVAISGPRSEAVKLRSELES 559 Query: 302 VLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRK 335 L L+L L+MDKT I HV+ G FLGH + R Sbjct: 560 FLRDKLQLTLSMDKTHITHVSKGVPFLGHNIFRS 593 >UniRef50_B4WV39 Group II intron, maturase-specific domain family n=2 Tax=Synechococcus sp. PCC 7335 RepID=B4WV39_9SYNE Length = 586 Score = 227 bits (578), Expect = 6e-58, Method: Composition-based stats. Identities = 102/378 (26%), Positives = 160/378 (42%), Gaps = 55/378 (14%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQP--EWLAEAARITLSSKGAHTPGVDGVNKTMLQA 58 +Q+++ + ++ L RL++ + R+T +KG T GVDG+ Sbjct: 27 LQQRIYRASQRGDDRTVKSLQRLLSTSWSARMLAVRRVTQENKGKKTAGVDGIASLKAPE 86 Query: 59 RLAVELQILRDELLSGHYQPLPARRVYIPKSNG-KLRPLGIPALRDRIVQRAMLMAMEPI 117 R + + D+ P RRV IPK + R LGIP +RDR Q +A+EP Sbjct: 87 RTELAKNLTLDK------NADPVRRVLIPKPGKSEYRKLGIPTMRDRAKQALAKLALEPQ 140 Query: 118 WESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKA 177 WE+ F SYGFRP RS A+ V C + +WV++ D+++ FD + H L+ Sbjct: 141 WEALFAPNSYGFRPGRSPQDALEQVH----RCISQKPKWVLDADIAACFDQISHGPLVAR 196 Query: 178 VRRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHER 237 + + +KAG +D G + G PQGG+ SPLL+NI L+ + + Sbjct: 197 LSQSHPSIARQC--KAWLKAGVLDNGQIQLTERGTPQGGIASPLLANIALHGLETLVKT- 253 Query: 238 YLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIRE 297 R+ADDFV+ + +A +A + Sbjct: 254 ------------------------------TIRGAHLVRFADDFVVFHQDREAIFKA-QT 282 Query: 298 ECRGVLEGSLKLRLNMDKTKIPHV-------NDGFIFLGHRLIRKRSRYGEMRVVSTIPQ 350 R L L+L DKT+I H + GF FLG + + R+R + + Sbjct: 283 LIRQWLASK-GLKLRADKTRIVHTLNGGAEYSTGFDFLGCHVRQYRTRRRRINKSQRPYK 341 Query: 351 EKARNFAASLTALLWKVR 368 + ASL L+ K+R Sbjct: 342 TLIKPSKASLKKLVTKLR 359 >UniRef50_O99479 Reverse transcriptase homolog n=2 Tax=Eukaryota RepID=O99479_PAVLU Length = 636 Score = 227 bits (578), Expect = 6e-58, Method: Composition-based stats. Identities = 94/356 (26%), Positives = 156/356 (43%), Gaps = 50/356 (14%) Query: 21 LRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLP 80 I + L T S + PG+DG K +L L+ EL+S YQP Sbjct: 52 YTEIYDLQSLRLGLEATKS---SAAPGLDGDRKANFS---EAKLVALQAELISQKYQPKT 105 Query: 81 ARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIR 140 +RV IPK NG +R LGI + RD+IVQ ++ A++ + F S+GFRP H A++ Sbjct: 106 TKRVAIPKPNGGIRYLGISSQRDKIVQASIQNALQSKYGKHFSPDSFGFRPGLGCHDALK 165 Query: 141 TVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHI 200 V+ + W+I D+ FDT++H +L++ + R + D + L+ K IK G++ Sbjct: 166 HVRNTWQNI-----TWIISIDIEKCFDTINHTILLQIL-RPLVDQPTLELISKLIKVGYV 219 Query: 201 DVGL-----FRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQ 255 ++ ++ G PQG +ISPLL N ++ D +L + + D + Q Sbjct: 220 EMFDTTCFPISESTIGTPQGSLISPLLCNFYMHILDTFLQKVLIPQWNVGDERSYVKGCQ 279 Query: 256 RGRSTAVRE--------------------------------NWQWKPAVAYCRYADDFVL 283 ++ V + N + Y RYADD V+ Sbjct: 280 NRKAMDVNDKAIVEAYPELEGQIQRIKHNRWVTEGKGSRDPNDANFRRLRYVRYADDIVI 339 Query: 284 IVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHV-NDGFIFLGHRLIRKRSR 338 G ++ I ++ LE +L ++N +K+ I H +G FLG + ++ Sbjct: 340 GFTGPYSEALVILDQVVKFLEKTLCFKVNKEKSSINHSETNGIKFLGTFIKYLPNK 395 >UniRef50_A8KXN1 RNA-directed DNA polymerase (Reverse transcriptase) n=1 Tax=Frankia sp. EAN1pec RepID=A8KXN1_FRASN Length = 417 Score = 226 bits (577), Expect = 9e-58, Method: Composition-based stats. Identities = 95/363 (26%), Positives = 148/363 (40%), Gaps = 59/363 (16%) Query: 27 PEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLPARRVYI 86 + + +A + GA PG DGV +A + L +L + + SG Y P P V I Sbjct: 15 KQLVWDAWLKVKENGGA--PGPDGVTVEQFEANVKDRLYVLWNRMSSGSYFPGPVGAVEI 72 Query: 87 PKS--NGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKL 144 PK G R LGIP + DR+ Q + +A+EP E FH SYG+RP RS A+ + Sbjct: 73 PKKGVKGGARTLGIPNVVDRVAQTVLKLALEPKVEPVFHRDSYGYRPGRSQRQALEVCRK 132 Query: 145 QLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHID-VG 203 + WV++ D+ +FDTV L+KAV + + + +KA G Sbjct: 133 RCWS-----HDWVVDLDVRKFFDTVPWEKLLKAVAYHTDQKWVLMYVERCLKAPTKHADG 187 Query: 204 LFRAASEGVPQGGVISPLLSNIMLN-EFDQYLHERYLSGKARKDRWYWNNSIQRGRSTAV 262 + + G QGG SPL +NI L+ D ++ + Sbjct: 188 TLQERTMGTVQGGPFSPLAANIYLHWGLDAWMAREF------------------------ 223 Query: 263 RENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVN 322 P V + R+ADD V + Q +R+ L + L + DKT+I + Sbjct: 224 -------PTVPFERWADDVVFHCVSLE-QAREVRDAVVARL-VEVGLEAHPDKTRIVYCK 274 Query: 323 D----------GFIFLGHRLIRKRSRYG--EMRVVSTIPQ---EKARNFAASLTALLWKV 367 D F FL + + + G + R S IP ++ +F+ + L Sbjct: 275 DSNRGGDYENTSFTFLSYTFRPRVAWNGTQKKRFTSFIPGAAPDRVASFSREMRDLRLHR 334 Query: 368 RIS 370 R + Sbjct: 335 RTN 337 >UniRef50_C3EEI5 Group II intron reverse transcriptase/maturase n=1 Tax=Bacillus thuringiensis serovar pakistani str. T13001 RepID=C3EEI5_BACTU Length = 539 Score = 226 bits (576), Expect = 1e-57, Method: Composition-based stats. Identities = 97/302 (32%), Positives = 160/302 (52%), Gaps = 18/302 (5%) Query: 75 HYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERS 134 Y+P RRV+IPK NG RPLGIP + DRI Q+ +L ++PI E+ FH SYGFR RS Sbjct: 24 WYEPQTVRRVFIPKPNGDQRPLGIPTIWDRIFQQCILQVLDPICEAKFHKHSYGFRSNRS 83 Query: 135 VHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHH-RLLMKAVRRRISDARFMTLLWK 193 HHA+ K + G ++ I D+ +FD V+H +LL + I D + +++L + Sbjct: 84 THHALGRFKNLINIAGFSQ---CIAIDIKGFFDNVNHGKLLKQIWSLGIRDKKLLSILSR 140 Query: 194 TIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNS 253 +K+ G+ +G PQGG++SPLLSNI LNE D ++ ++ + +++ + ++ Sbjct: 141 LLKSNIGGEGI---LEKGTPQGGILSPLLSNIALNELDWWVSNQWETFRSKYKYFMTSSM 197 Query: 254 IQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNM 313 + + T ++E RYADDF ++ + T++Q + + L+ LKL +NM Sbjct: 198 YKALKKTKLKE-------CYIIRYADDFKILCR-TRSQANRMFLAVKQFLKERLKLDINM 249 Query: 314 DKTKIPHVNDG-FIFLGHRL-IRKRSRYGEMRVVSTIPQEKA-RNFAASLTALLWKVRIS 370 +K+KI + FLG R+ KR V + +KA +N + + K++ Sbjct: 250 EKSKIISLKRKETEFLGFRIKFIKRGGTKHGYVAHSNMTDKAFKNAHFKIRESIKKIQKR 309 Query: 371 GE 372 Sbjct: 310 AC 311 >UniRef50_Q94Z25 Orf557 n=2 Tax=Pylaiella littoralis RepID=Q94Z25_PYLLI Length = 557 Score = 226 bits (575), Expect = 1e-57, Method: Composition-based stats. Identities = 107/387 (27%), Positives = 180/387 (46%), Gaps = 38/387 (9%) Query: 1 MQRKLATWAATDPSLRIQRLLR-LITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQAR 59 +Q+K+ + + RL + L+ E A A R +S++G+ T G DG Q + Sbjct: 23 LQQKMVVAYKNNNWSEVFRLQQQLMHSFEGRATAVRKVVSNEGSKTKGPDGKTWKKSQDK 82 Query: 60 LAVELQILRDELL--SGHYQPLPARRVYIPKSN-GKLRPLGIPALRDRIVQRAMLMAMEP 116 + +RD LL SG Y+ RRV+IPKS+ G+LRPLGIP + DR +Q +L ++P Sbjct: 83 YRA-IADIRDHLLTKSGSYKAGAVRRVWIPKSSPGELRPLGIPNMIDRALQALVLSCLDP 141 Query: 117 IWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMK 176 I E + + SYGFR RS + AI+ ++ L G R W + D+S FD + H L K Sbjct: 142 IVEENSDSCSYGFRKYRSTNDAIQRIRFILDKAGAPRYIW--DADISKCFDNISHTFLNK 199 Query: 177 AVRRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHE 236 VR + + L+ +KA I+ G S G PQGGV+SPLL N+ LN + + + Sbjct: 200 VVRENL-CRKGCELVEAWLKAPIIEKGSKSYPSRGTPQGGVLSPLLCNMTLNGLENVIRD 258 Query: 237 RYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQ-VEAI 295 S + R + RYADDF++ K+Q ++ Sbjct: 259 GLPSSSSTAGRKLKGRWV--------------------VRYADDFIITNPIGKSQFIDND 298 Query: 296 REECRGVLEGSLKLRLNMDKTKIPHV-NDGFIFLGHRLIRKR-----SRYGEMRVVSTIP 349 + L ++ K++I + + F FLG ++ +++ ++ + R+V + Sbjct: 299 IPLINKFMADR-GLEISEKKSRIIDLEKESFNFLGWKISQRKRNISVNKASDSRLVLVVE 357 Query: 350 --QEKARNFAASLTALLWKVRISGEIL 374 +E + + + + G ++ Sbjct: 358 PTKESIKRLKSRIKLEFRSNKPIGALI 384 >UniRef50_D2FQY0 Regulatory protein GntR n=2 Tax=Staphylococcus aureus subsp. aureus RepID=D2FQY0_STAAU Length = 431 Score = 225 bits (574), Expect = 2e-57, Method: Composition-based stats. Identities = 106/350 (30%), Positives = 163/350 (46%), Gaps = 47/350 (13%) Query: 20 LLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPL 79 ++ L+ + + +A + +KGA PG+DG+ + +Q A ++ +LL G Y+P Sbjct: 11 MMELVVREHNIQKAIKKVKKNKGA--PGIDGMKVSEIQGHFAQYFPEIKQKLLEGTYKPQ 68 Query: 80 PARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAI 139 ++V IPK+NGK R LGIP +RDR++Q+A+ +EP + F S+GFRP RS A+ Sbjct: 69 AVKKVEIPKANGKKRVLGIPVVRDRVIQQAIKQVIEPSIDRTFSKHSHGFRPNRSTGTAL 128 Query: 140 RTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGH 199 + E ++ DL FD ++H LM R I D T + ++++ G Sbjct: 129 KEC----ASYYEAGYTIAVDCDLKQCFDNINHDKLMYLFERHIKDKAVSTFIRRSLQVGA 184 Query: 200 IDV-GLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGR 258 ID+ G G PQGGVISPLL NI L+E D+ L +R+ Sbjct: 185 IDLSGEVAERKIGAPQGGVISPLLCNIYLHELDKELEKRH-------------------- 224 Query: 259 STAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKI 318 + RYADDFV+ VK TK E + + + + +LKL +N DK+K+ Sbjct: 225 -------------HRFVRYADDFVIFVK-TKRAGERVMDSIKTFIHKTLKLEVNNDKSKV 270 Query: 319 PHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAASLTALLWKVR 368 FL S+ + K RN +L L + R Sbjct: 271 GSPTR-LKFL----SCLMSKVNGTYRFRPTTEAK-RNLKRNLKWLTRRSR 314 >UniRef50_Q1D1V6 Group II intron, maturase n=2 Tax=Myxococcales RepID=Q1D1V6_MYXXD Length = 436 Score = 224 bits (572), Expect = 3e-57, Method: Composition-based stats. Identities = 106/375 (28%), Positives = 170/375 (45%), Gaps = 58/375 (15%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARL 60 ++ ++ A + P+ R L + +PE L A + GA PG DG+ ++ R Sbjct: 11 LRARIGHRAKSAPTHRFWGLYVHVLKPEVLEAAYLEARRNGGA--PGQDGITFEHIEERG 68 Query: 61 AV-ELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWE 119 L + +EL +G Y+P P RR IPK GK+R + IP++RDR+VQ A+ + +EPI+E Sbjct: 69 RAGFLGAVAEELRTGTYRPRPYRRREIPKEGGKVRVISIPSIRDRVVQGALRLVLEPIFE 128 Query: 120 SDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVR 179 +DF S+G RP RS H AI TV+ L ++ YFD++ H L++ V Sbjct: 129 ADFSGSSFGARPGRSAHEAIDTVRQGLRRRRHRVVDVDLKA----YFDSIRHAPLLERVA 184 Query: 180 RRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYL 239 RR+ D + L+ + +++ G+PQG +SPLL+NI LN+ D L Sbjct: 185 RRVQDGEVLALVKQFLRS---------TGDRGIPQGSPLSPLLANIALNDLDHVLD---- 231 Query: 240 SGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREEC 299 + + + Y RY DD V++ ++ Sbjct: 232 ---------------------------RGRGFLTYARYLDDMVVLAPDSEKGRRWAARAL 264 Query: 300 RGVLE--GSLKLRLNMDKTKIPHVND---GFIFLGHRLIRKRSRYGEMRVVSTIPQEKAR 354 + + +L + LN +KT+ + D F FLG KRS +T P+ K Sbjct: 265 ERIRQEAEALGVSLNKEKTRTVTMTDRNASFAFLGFDFRWKRSPKTGTWYPNTNPRRK-- 322 Query: 355 NFAASLTALLWKVRI 369 +T +L +VR Sbjct: 323 ----KVTEVLRRVRH 333 >UniRef50_C0JWS6 Putative reverse transcriptase and intron maturase n=1 Tax=Pycnococcus provasolii RepID=C0JWS6_9CHLO Length = 583 Score = 224 bits (572), Expect = 3e-57, Method: Composition-based stats. Identities = 101/382 (26%), Positives = 161/382 (42%), Gaps = 39/382 (10%) Query: 1 MQRKLATWAATDPSLRIQRLLRLIT-QPEWLAEAARIT-LSSKGAHTPGVDGVNKTMLQA 58 +Q ++ + +++ L + I + A R +++G + GVD + Sbjct: 21 LQVRIFKASQKQQQNKVRYLQKKILRSIDAKVLAVRRVAQTNRGKRSAGVDRKRVLTSEQ 80 Query: 59 RLAVELQILRDELLSGHYQPLPARRVYIPKSNG-KLRPLGIPALRDRIVQRAMLMAMEPI 117 +L + ++ L G + P RR K+ RPLGIP L DR VQ+ + A+EP Sbjct: 81 KLNLA----QNLKLKG--KAKPIRRTCSTKAGKVDKRPLGIPTLEDRAVQQLVKFALEPE 134 Query: 118 WESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKA 177 WE+ F SYGFRP R+ AI + L R +V++ D+ FD + H L+ Sbjct: 135 WEAKFEPNSYGFRPGRACQDAIMALFQHLRG----RSLYVLDADIKKCFDRIDHDKLLAK 190 Query: 178 VRRRISDARFMTLLWKTIKAGHIDV--------GLFRAASEGVPQGGVISPLLSNIMLNE 229 + + + +KAG I+ G PQGGVISPLL+NI L Sbjct: 191 LN---TFPLLENQIKVWLKAGVIEGYSNSYKNYNKVTPNLLGTPQGGVISPLLANIALTG 247 Query: 230 FDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTK 289 + L + Y+ N + +G S + + VA RY DDFV++ K + Sbjct: 248 LEDEL------------KHYYANHLYKGSSRIGLSDKLTQ--VAVIRYVDDFVVLHKD-E 292 Query: 290 AQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIP 349 + +R+ L ++ L L +KTKI GF FLG +I S + + I Sbjct: 293 NVIRQLRDHTAKWLYTTMGLELLPEKTKILDTKQGFTFLGFHIISIYSGENKYKCKIHIS 352 Query: 350 QEKARNFAASLTALLWKVRISG 371 E N + + R + Sbjct: 353 HESKNNLLSKTREIFRSNRSAS 374 >UniRef50_Q6TFE1 Putative group II intron-encoded maturase n=1 Tax=Caedibacter taeniospiralis RepID=Q6TFE1_CAETA Length = 341 Score = 224 bits (570), Expect = 5e-57, Method: Composition-based stats. Identities = 95/354 (26%), Positives = 165/354 (46%), Gaps = 50/354 (14%) Query: 5 LATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVEL 64 +A A + S L L+ E+L+ +K G+D V+ +A ++ Sbjct: 16 IAECARGNRSFEFTSLAHLL-DAEFLSYCYYGLDRNK---AVGIDKVSWQEYGVDVADKI 71 Query: 65 QILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHT 124 + L L ++P+P+RRVYIPK NG+ RPLGI A+ ++IV+ +++ ++ I+E DF Sbjct: 72 ENLVMRLKRKTFKPMPSRRVYIPKGNGESRPLGISAIENKIVESGIMLILQSIYEQDFLE 131 Query: 125 LSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISD 184 SYGFRP R+ H A+ V + ++E D+ +FD V H L V+ R+ D Sbjct: 132 CSYGFRPGRNTHQALNEVDKAIMTQPVNH---LVEADIKGFFDNVSHEKLKDFVKIRVKD 188 Query: 185 ARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNE-FDQYLHERYLSGKA 243 + L+ ++AG+ID G+ +G PQG ++SP+L+NI L+ D++ + Sbjct: 189 TSLLHLIDCFLRAGYIDKGVLIDTEKGTPQGSILSPMLANIFLHYVLDKWFED------- 241 Query: 244 RKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVL 303 V+++ + + RYADDFV +++ + I Sbjct: 242 -----------------TVKQHVEGFCRL--VRYADDFVCLIQ-YQEDARKIERALANRF 281 Query: 304 EGSLKLRLNMDKT---------KIPHVNDG-----FIFLGHRLIRKRSRYGEMR 343 +L+L+ +K+ K+ +N G F FLG ++R G + Sbjct: 282 -NKHELQLHPEKSRNISFGRFEKLNALNAGRKANTFDFLGFTHFCDKTRKGYFQ 334 >UniRef50_B8R160 Reverse transcriptase n=2 Tax=Volvox carteri RepID=B8R160_VOLCA Length = 607 Score = 223 bits (569), Expect = 7e-57, Method: Composition-based stats. Identities = 96/380 (25%), Positives = 169/380 (44%), Gaps = 35/380 (9%) Query: 1 MQRKLATWAATDPSLRIQRLLRLIT-QPEWLAEAARITLS-SKGAHTPGVDGVNKTMLQA 58 +Q+++ + + R+ L +L+ P A +I + +KG +T G+DG T + Sbjct: 41 IQKRIFKASLAGDTKRVWFLQKLLLRNPHAKLIAVQIVTTLNKGKNTAGIDGYKATTSEE 100 Query: 59 RLAVELQILRDEL-LSGHYQPLPARRVYIPKSNG-KLRPLGIPALRDRIVQRAMLMAMEP 116 +L +L +L ++G + RR +IPK + +PLGI ++DR +Q +A+EP Sbjct: 101 KL-----LLAKKLQING--KANLVRRTWIPKPGKTEKQPLGIYTIQDRALQALCKLALEP 153 Query: 117 IWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMK 176 WE+ F SYGFRP R AI + L ++V + D+ FDT+ H L+ Sbjct: 154 EWEAKFEPNSYGFRPGRRAQDAIEAIFQNL---HHDADKYVFDADIRKCFDTIDHAALLS 210 Query: 177 AVRRRISDARFMTLLWKTIKAGHID----VGLFRAASEGVPQGGVISPLLSNIMLNEFDQ 232 ++ + + +KAG D G PQGG+ISPLL+NI L+ ++ Sbjct: 211 KLK---TFPLMEKQISAWLKAGIFDQYANTPKVSTPEMGTPQGGIISPLLANIALHGLEE 267 Query: 233 YLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQV 292 +L + A R + A+ RYADDFV+I + + Sbjct: 268 HL-----------LNMVSRKEFPKPHPKAARGAKAKRAALGIIRYADDFVIIHRNL-DIM 315 Query: 293 EAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIP-QE 351 + + E + L + L ++ +K+ + + F FLG ++ R + V P +E Sbjct: 316 KTVITETKTWL-AQMGLAISEEKSALRLASKSFKFLGFQVAYVRDKIQNKYRVRITPSRE 374 Query: 352 KARNFAASLTALLWKVRISG 371 + + ++ K + S Sbjct: 375 NVKLIISKTRNIIQKNKASS 394 >UniRef50_A1BI39 CRISPR-associated protein Cas1 n=5 Tax=Chlorobiaceae RepID=A1BI39_CHLPD Length = 731 Score = 223 bits (568), Expect = 8e-57, Method: Composition-based stats. Identities = 87/317 (27%), Positives = 145/317 (45%), Gaps = 42/317 (13%) Query: 20 LLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPL 79 L + PE + +A S+ G PG D + RL L+ L LL+G Y+ Sbjct: 4 LYNQMAMPETIFQAWYKVASNDGR--PGWDNTSIQDYSLRLEENLKSLSHALLTGTYRQS 61 Query: 80 PARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAI 139 P ++ + K +GK R L IP + DR+ Q A + + PI E++ ++ +RP S A Sbjct: 62 PLLKLVMLKPDGKERVLLIPGVIDRVAQTAASIVLSPIIEAELGNCTFAYRPGISREGAA 121 Query: 140 RTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGH 199 R + +WV++ D+ ++FD V H LL + + + D ++LL + + A Sbjct: 122 REI----DRLHREGYQWVLDADIRNFFDNVRHDLLFQRLVELVDDKEMISLLHRWLTAEI 177 Query: 200 IDVGLFRA-ASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGR 258 +D R + G+PQG ISP L+N+ L+ FD+ + ++ Sbjct: 178 VDGLNPRTRNTMGLPQGCPISPALANLYLDRFDETMEQQ--------------------- 216 Query: 259 STAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKI 318 R+ADD++++ K T+ + EA + L LKL L+ DKT+I Sbjct: 217 ------------GFKLVRFADDYLVLCK-TRPKAEAALKLSESAL-AELKLELHSDKTRI 262 Query: 319 PHVNDGFIFLGHRLIRK 335 +GF +LG+ IR Sbjct: 263 TTFAEGFKYLGYLFIRS 279 >UniRef50_A7UDN1 Putative reverse transcriptase n=2 Tax=Candida zemplinina RepID=A7UDN1_CANZE Length = 445 Score = 223 bits (568), Expect = 1e-56, Method: Composition-based stats. Identities = 101/390 (25%), Positives = 171/390 (43%), Gaps = 52/390 (13%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWLAE-AARITLSSKGAHTPGVDGVNKTMLQAR 59 MQ ++ +++ L R+ + + E A SS G+ TPGVD Sbjct: 12 MQSRIIAAVKDQNWTKVRDLQRMTVRSGYARELAVDTIASSPGSKTPGVDNFII----KN 67 Query: 60 LAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWE 119 + ++++ Y P P +R+YIPK+NGKLRP GIP + DR +Q A +PI E Sbjct: 68 EMDKAKMIKVTGKIEQYNPKPVKRIYIPKANGKLRPTGIPTMADRAMQCTFSFATQPIAE 127 Query: 120 SDFHTLSYGFRPERSVHHAIRTVKLQ--LTDCGETRGRWVIEGDLSSYFDTVHHRLLMKA 177 + S+GFRP RS A + + W ++ D+ +FD + H ++ Sbjct: 128 TLGDQHSFGFRPNRSTIDAFNHLYRAQFIKSSNAPVNNWAVDADIKGFFDNISHEWILNN 187 Query: 178 VRRRISDARFMTLLWKTIKAGHID----VGLFRAASEGVPQGGVISPLLSNIMLNEFDQY 233 ++ +L K KAG I+ V F + GVPQGGV+SP+++N++ + +Q+ Sbjct: 188 IKIEP------RMLAKFTKAGFIEYNNQVNEFHDTNTGVPQGGVMSPMMANMVTDGLEQH 241 Query: 234 LHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVE 293 +++ K R +N +R V + RYADDFV+I + + Sbjct: 242 IYD------GTKARNIYNGMTKR---------------VHFIRYADDFVIITP-YEWVAQ 279 Query: 294 AIREECRGVLEGSLKLRLNMDKTKIPHV-NDGFIFLGHRLIRKRSRYGEMRVVSTIPQEK 352 L+ L LNMDKT I + D FLG+ ++ + + ++ EK Sbjct: 280 RTMPIVNSFLKER-SLSLNMDKTHILDISKDNLDFLGYT-----TKRVDGKSMTMPNPEK 333 Query: 353 ARNFAASLTALLWKVRI------SGEILLG 376 + F ++ + K + ++ G Sbjct: 334 VKLFTKNIRTKIKKCNSREDMMATNSVIRG 363 >UniRef50_Q8GAR1 Reverse transcriptase n=20 Tax=Enterobacteriaceae RepID=Q8GAR1_ECOLX Length = 410 Score = 223 bits (567), Expect = 1e-56, Method: Composition-based stats. Identities = 97/352 (27%), Positives = 148/352 (42%), Gaps = 55/352 (15%) Query: 31 AEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLPARRVYIPKSN 90 + +KGA PG DG M + L + + L SG + P P IPKSN Sbjct: 14 WASYLDVRRNKGA--PGCDGQTLKMFDQQRDGNLYKIWNRLCSGTWFPPPVLEKRIPKSN 71 Query: 91 GKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCG 150 GK R LGIP + DRI Q A+ + ME + FH SYG+RP +S H A++ ++ Sbjct: 72 GKERILGIPTVSDRIAQGAIKLFMEEKLDPIFHADSYGYRPGKSAHDALKQCAIRCW--- 128 Query: 151 ETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGH--IDVGLFRAA 208 R W++E D+S++FD V H L++KA+ + + ++A + G Sbjct: 129 --RYSWILEVDISAFFDHVRHDLVLKALEHHGMPKWVILYCRRWMEAPMQSCENGELITR 186 Query: 209 SEGVPQGGVISPLLSNIMLNE-FDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQ 267 + G PQGGVISPLL+N+ L+ FD ++ Y Sbjct: 187 TRGTPQGGVISPLLANLFLHYAFDLWMEREY----------------------------- 217 Query: 268 WKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVND---- 323 V + RYADD V+ + ++ + L LN KT I +++ Sbjct: 218 --RGVPFERYADDIVVHC-SRMSDATRLKNRLSERF-SEVGLVLNAGKTNIAYIDTFKRR 273 Query: 324 ----GFIFLGHRLIRK--RSRYGEMRVVSTIPQEKARNFAASLTALLWKVRI 369 F FLG+ + ++ GE+ A +T + K RI Sbjct: 274 NVATSFTFLGYDFKVRTLKNFKGELYRKCMPGASNAA--MRKITETIKKWRI 323 >UniRef50_A8LGE6 RNA-directed DNA polymerase n=1 Tax=Frankia sp. EAN1pec RepID=A8LGE6_FRASN Length = 351 Score = 222 bits (566), Expect = 1e-56, Method: Composition-based stats. Identities = 85/292 (29%), Positives = 132/292 (45%), Gaps = 39/292 (13%) Query: 14 SLRIQRLLRLITQPEWLAEAARITLSSKGAHTPG--VDGVNKTMLQARLAVELQILRDEL 71 L ++R+ R + A S+KGA TPG VDG++ + + D + Sbjct: 38 GLPLERVYRQLFNAALYLVAYGRLYSNKGAMTPGETVDGMSLATID--------RIIDAM 89 Query: 72 LSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRP 131 Y+ P +RV+IPK NGK RPLG+P D++V + + +E +E F S+GFRP Sbjct: 90 RHERYRWKPVKRVHIPKKNGKKRPLGLPTWSDKLVAEVVRLLLEAYYEPTFSDHSHGFRP 149 Query: 132 ERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLL 191 R+ H A+ V W IEGD++ F+ + H++++ V RI D RF+ LL Sbjct: 150 GRACHTALGEVVDVWK-----GTHWFIEGDIARCFEELDHQVMLDTVGERIHDNRFLGLL 204 Query: 192 WKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWN 251 ++AG+++ + A G QGG SP+LSNI L+ D ++ L R +R N Sbjct: 205 KAMLRAGYLEDWKWGATLSGTVQGGPASPILSNIYLDRLDSFVVTHLLPDYNRGERRASN 264 Query: 252 NSIQRGRSTAVRE------------------------NWQWKPAVAYCRYAD 279 + Q+ R + + Y RYAD Sbjct: 265 PAYQKIEYAIARARRHGDRPALRRLRQQRRQLPSQDPHDPSYRRLRYVRYAD 316 >UniRef50_A6YE98 Putative reverse transcriptase and intron maturase n=1 Tax=Chlorokybus atmophyticus RepID=A6YE98_CHLAT Length = 845 Score = 221 bits (562), Expect = 4e-56, Method: Composition-based stats. Identities = 93/362 (25%), Positives = 156/362 (43%), Gaps = 47/362 (12%) Query: 11 TDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHT---PG--VDGVNKTMLQARLAVELQ 65 + P + L +L+ A + + G+ T G +DG + L+ Sbjct: 292 SSPHFQPSGLWKLVRDINLWIAAYKKLAPNPGSLTKSGAGGKIDGTSLKSLE-------- 343 Query: 66 ILRDELLSGHYQPLPARRVYIPKSN-GKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHT 124 LRD + G +Q + +VY K G PL IP +DR+VQ + +E ++E F Sbjct: 344 WLRDRVSEGKFQFGRSEKVYTLKPKVGNGIPLDIPEFQDRLVQEVVRTILEVLYEPQFLE 403 Query: 125 LSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISD 184 S+GFRP +S H A+ V+ + W I+GD+S FDT+ + L+ +R+++ D Sbjct: 404 SSHGFRPNKSQHTAMVDVRQKFKGV-----VWCIKGDISKSFDTIDKKKLITQMRKKVKD 458 Query: 185 ARFMTLLWKTIKAG-HIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKA 243 +F L++K +K+ + G+ +G+ Q G+ SPLL NI L++ D ++ Sbjct: 459 EKFCHLIYKGLKSRLLMPEGIMEVLKKGISQKGICSPLLCNIALHQLDLFIERLKKIVNK 518 Query: 244 RKDRWYWNNSIQ-----------------RGRSTAVRENWQWK---------PAVAYCRY 277 + Q RG A++ + + Y RY Sbjct: 519 VDSSHIVSQPYQSQMVPQRERAAIGTGDWRGAIKAIKMARKMGYGDHQDPNLRRLTYVRY 578 Query: 278 ADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVN-DGFIFLGHRLIRKR 336 ADDF++ V G K E I E L+ L L LN +K I ++ FLG ++ + Sbjct: 579 ADDFLIGVTGPKKLAERIGELVSRFLKIRLNLTLNQEKIVISKLSGKKIPFLGFQIYQPP 638 Query: 337 SR 338 + Sbjct: 639 LK 640 >UniRef50_UPI00016C4F75 RNA-directed DNA polymerase (Reverse transcriptase) n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C4F75 Length = 257 Score = 219 bits (559), Expect = 8e-56, Method: Composition-based stats. Identities = 78/211 (36%), Positives = 116/211 (54%), Gaps = 7/211 (3%) Query: 9 AATDPSLRI-QRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQIL 67 AA +PS + RL+ + Q L +A ++KGA PG+DG+ +A Q L Sbjct: 45 AALEPSRALTDRLMEEVCQRGNLNQAYSRVKANKGA--PGIDGMTVEDSLRWIAEHKQEL 102 Query: 68 RDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSY 127 LL G Y+P P R V IPK G R LGIP + DR+VQ+A+L + + + F SY Sbjct: 103 LSSLLDGSYRPSPVRGVLIPKPGGGERQLGIPTVVDRLVQQAILQVLTRLLDPTFSESSY 162 Query: 128 GFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARF 187 GFRP +S H A+ K + D V++ DL +FD V+H +LM + RR+SD R Sbjct: 163 GFRPGKSAHQALLKAKEYVADGRAI----VVDVDLEKFFDRVNHDILMARLARRVSDTRL 218 Query: 188 MTLLWKTIKAGHIDVGLFRAASEGVPQGGVI 218 + ++ + ++AG + G+ A EG PQGG++ Sbjct: 219 LRIVRRFLEAGLMQDGVCVARHEGTPQGGIV 249 >UniRef50_Q9G8T4 Orf621 n=1 Tax=Rhodomonas salina RepID=Q9G8T4_RHDSA Length = 621 Score = 219 bits (558), Expect = 1e-55, Method: Composition-based stats. Identities = 99/409 (24%), Positives = 168/409 (41%), Gaps = 57/409 (13%) Query: 3 RKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGV----DGVNKTMLQA 58 +KL + + L++ + L A +KGA T G D + + Sbjct: 20 KKLYELNKENTNKSNDNLMKFLYDEGMLWNAVEKLKKNKGAATFGPTNNKDRKSIEIDGL 79 Query: 59 RLAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIW 118 L ++ L +EL ++ P RR+ +PK N K RPLGI + DRIVQ + + I+ Sbjct: 80 NLGQ-IKRLSEELREETFKWSPTRRIEVPKKNNKRRPLGIFSFEDRIVQEGIRTILNAIY 138 Query: 119 ESDFH-TLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKA 177 E F ++GFRP S A+ +K IEGD+ ++ ++H +LMK Sbjct: 139 EPTFSGNNNHGFRPRLSSETALELLKRN-----RKGKTHAIEGDIKKAYEGINHNILMKI 193 Query: 178 VRRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLH-- 235 ++++I+D +F+ ++ + +K G + VPQG + SP+L NI +NEFD+ + Sbjct: 194 LKKKINDKKFLRIIEEGLKCGIEKNRKIYNSITVVPQGSICSPILFNIYMNEFDEAIKTI 253 Query: 236 ------ERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAV----------------- 272 S +RK N + +S + N + + + Sbjct: 254 IEEIFTRLNESRDSRKSSSTMNKEYKILKSRSDEMNKKIRERLKQESPFIKTLLKSHKKI 313 Query: 273 -------------------AYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNM 313 Y RYADD+++I G E I+ ++ LKL L Sbjct: 314 RWKLKRKKAIDYGKRTLEYFYLRYADDWIIITNGNVRVCEEIKIRISTWIKEELKLELEQ 373 Query: 314 DKTKIPHV-NDGFIFLGHRLI-RKRSRYGEMRVVSTIPQEKARNFAASL 360 KT+I ++ D FLG ++ +R G + I + N + Sbjct: 374 SKTRITNMEKDPIKFLGFSIMTPINTRIGTIEKKIGIRTTRRTNLGPRI 422 >UniRef50_Q10VN2 RNA-directed DNA polymerase (Reverse transcriptase) n=3 Tax=Cyanobacteria RepID=Q10VN2_TRIEI Length = 437 Score = 219 bits (558), Expect = 1e-55, Method: Composition-based stats. Identities = 95/349 (27%), Positives = 162/349 (46%), Gaps = 33/349 (9%) Query: 21 LRLITQP--EWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQP 78 ++L+ + L R+T ++G G D ++ + ++L L +Q Sbjct: 1 MKLMLRSYSNLLLSVRRVTQENQGIRRMGRDAQTAKTSVEKVKLVKEMLTYRL----WQA 56 Query: 79 LPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHA 138 PA+RVYIPK+N + PLGIP +++R+ Q + +EPIW+++F T SYGF P RS H Sbjct: 57 KPAKRVYIPKANRQQGPLGIPTVKNRVAQAVVKNGLEPIWDAEFETNSYGFHPGRSCHDP 116 Query: 139 IRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAG 198 + ++L + + W+++ D+ FD + H ++KA+ L+ + +KAG Sbjct: 117 LEQFWIRLQ---KGKDTWILDVDIKQDFDNITHEYILKAIGEIPG----RELIKQWLKAG 169 Query: 199 HIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGR 258 +++ +F G G+ISPLL+NI + ++ L R+ + Q R Sbjct: 170 YLEAEVFHKTEGGTSSRGIISPLLANIAFDGMERLLA-----------RYKTVKTYQCTR 218 Query: 259 STAVRENWQWKP--AVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKT 316 T E + K + RYADDF++ + ++ ++AI L L LN DKT Sbjct: 219 PTTDEEYTKKKKLDKYGFIRYADDFIITAR-SEEDIKAIIPTIEKWLSER-GLELNKDKT 276 Query: 317 KIPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAASLTALLW 365 + H+ GF FLG + R +EK + F + L Sbjct: 277 NLVHIEQGFNFLGFNV-----RQFNGSCFIVPQKEKVKEFLTLIRGWLK 320 >UniRef50_D2LF37 RNA-directed DNA polymerase (Reverse transcriptase) n=1 Tax=Rhodomicrobium vannielii ATCC 17100 RepID=D2LF37_RHOVA Length = 293 Score = 216 bits (550), Expect = 1e-54, Method: Composition-based stats. Identities = 86/312 (27%), Positives = 138/312 (44%), Gaps = 48/312 (15%) Query: 21 LRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLP 80 L + P L +A +KG PG DGV + VEL+ LR E L+G Y+P Sbjct: 22 LEKVVAPACLQQAWTRVRKNKGG--PGGDGVTIEIFAQNAEVELEKLRAETLAGIYRPRK 79 Query: 81 ARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIR 140 R +PK G R L IP++ DRI+Q A ++++ + F + S+ +R R V A+ Sbjct: 80 VRHAIVPKPKGGERKLTIPSVVDRILQTATMLSLGQTVDHHFSSASWAYREGRGVDDALA 139 Query: 141 TVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHI 200 ++ + W + D+ YFD + H+ L+ + + D R + L+ +++ Sbjct: 140 DLR----RLRNSGLFWTFDADIMQYFDRILHKRLIDDLFIWVDDLRIVRLIQLWLRS--- 192 Query: 201 DVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGRST 260 F G+ QG ISPLL+N+ L+ D+ L Sbjct: 193 ----FSYWGRGIAQGAPISPLLANLFLHPMDRLLE------------------------- 223 Query: 261 AVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPH 320 +A RYADDFV++ + +KA + + L L+LNM KT+I Sbjct: 224 --------LEGLASVRYADDFVVLCR-SKALAQKAQLIVASHLAAR-GLKLNMSKTRILA 273 Query: 321 VNDGFIFLGHRL 332 ++ FIFLG + Sbjct: 274 PSEAFIFLGQTV 285 >UniRef50_Q8YWX6 Alr1468 protein n=4 Tax=Cyanobacteria RepID=Q8YWX6_ANASP Length = 668 Score = 216 bits (549), Expect = 1e-54, Method: Composition-based stats. Identities = 86/309 (27%), Positives = 142/309 (45%), Gaps = 41/309 (13%) Query: 24 ITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLPARR 83 + E L A + G+ + GVDG++ + ++ +LQ + +L Y PA+ Sbjct: 1 MFTIEHLNFAWLQVRA--GSKSAGVDGISVDLFESMATEQLQNIAYQLKEETYTANPAKG 58 Query: 84 VYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVK 143 YIPK NG R +GI +RDRI+QR +L + E F SY +RP S+ A++ Sbjct: 59 FYIPKKNGTKRLIGIHTVRDRIIQRLLLDELYFPLEDTFLDCSYAYRPGHSIQQAVQ--- 115 Query: 144 LQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHIDVG 203 L + + +W+I+ D++ +FD + LL+ + + + LL + +K+G I G Sbjct: 116 -HLYGYYQYQPKWIIKADVADFFDNLSWALLLTYLEELSLEPSLLQLLEQQLKSGIIIAG 174 Query: 204 LFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVR 263 +R +GV QGG++S L+N+ L FD+ + Sbjct: 175 QYRNFGKGVLQGGILSGALANLYLTSFDRKCLSQ-------------------------- 208 Query: 264 ENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVND 323 + RY DDFV+ + + I ++ G L G + L L +KT+I ND Sbjct: 209 -------GINLVRYGDDFVIAC-NSWLEANRILDKITGWL-GEVYLTLQPEKTQIFTPND 259 Query: 324 GFIFLGHRL 332 F FLG+R Sbjct: 260 EFTFLGYRF 268 >UniRef50_Q8RSV8 Maturase n=1 Tax=uncultured marine bacterium RepID=Q8RSV8_9BACT Length = 386 Score = 213 bits (543), Expect = 6e-54, Method: Composition-based stats. Identities = 93/336 (27%), Positives = 144/336 (42%), Gaps = 57/336 (16%) Query: 47 GVDGVNKTMLQARLAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIV 106 GVD V+ + + L L + L SG Y P P + V IPK +GK R LGIP + DR+ Sbjct: 2 GVDHVSMEAIASNPRKYLYPLWNRLSSGSYFPPPVKLVPIPKGDGKERMLGIPTIIDRVA 61 Query: 107 QRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYF 166 Q + +E I E FH S+G+RP +S H A+ + +V++ D+ +F Sbjct: 62 QEVIKAELEVIVEPRFHPSSFGYRPHKSAHEALEQCAKNSWERW-----YVVDLDIKGFF 116 Query: 167 DTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHID-VGLFRAASEGVPQGGVISPLLSNI 225 D + H +M +R+ + + + +K D VG +A +G PQGGVISPLL+N+ Sbjct: 117 DNIDHEKMMGILRKHTNKKHILLYCDRWLKTPMQDRVGGVQARMKGTPQGGVISPLLANL 176 Query: 226 MLNE-FDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLI 284 L+E FDQ++ +P + + RYADD V+ Sbjct: 177 YLHEAFDQWI-------------------------------STTQPRIVFERYADDIVIH 205 Query: 285 VKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVND-------------GFIFLGHR 331 + + Q I ++ + L+ S L L+ DKTKI + F FLG Sbjct: 206 TRSME-QSHFILDKLKARLK-SYSLELHPDKTKIVYCYRTARFHKEGKEIPVSFDFLGFT 263 Query: 332 LIR----KRSRYGEMRVVSTIPQEKARNFAASLTAL 363 K + I ++ + L L Sbjct: 264 FKPRLCLKSNGEKFWGFRPAISKKSEKRILGELRKL 299 >UniRef50_Q35064 Atp9 intron ORF n=1 Tax=Marchantia polymorpha RepID=Q35064_MARPO Length = 710 Score = 213 bits (543), Expect = 7e-54, Method: Composition-based stats. Identities = 98/378 (25%), Positives = 155/378 (41%), Gaps = 80/378 (21%) Query: 21 LRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLP 80 I + L + KG PG+DG K + L+ L EL Y P P Sbjct: 260 YEDIYNIDNLRAGYKRL---KGNVAPGIDGRTKADM---TDKALEKLSKELRRQAYAPKP 313 Query: 81 ARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIR 140 A+R+ I K +G RPL I + D++VQ + +EP +ES F S+GFRP RS H A+R Sbjct: 314 AKRIIITKPDGGSRPLSIASTVDKVVQSTLKELVEPHFESLFRDSSHGFRPGRSCHKALR 373 Query: 141 TVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHI 200 ++ T W+++ D+ FD +HH LL+K + + L+ K + AG+I Sbjct: 374 DLR-----YSWTALTWLVQIDIKKDFDKIHHDLLIKEMESVLRSKALQDLMRKLLNAGYI 428 Query: 201 DV----GLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHE----RYLSGKARKDRWYWNN 252 DV + +EGV QG +ISPL +NI L++ D Y+ + Y G R + Sbjct: 429 DVYNLTDRTQYNTEGVTQGSIISPLCANIFLHKLDCYVEDILIPNYNVGNMRPASAEYKK 488 Query: 253 SI-------------------------------QRGRSTAVRENWQWKPAVAYCR----- 276 + ++ + + + + + + + R Sbjct: 489 RLNIHSKDKAFFKYYTELEQAIKNIKHLKWINREQQKKSILVKKKYFFENLFFFRNPKVS 548 Query: 277 ------------------------YADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLN 312 YAD+ +L V G+K IR+ + L+ LKL +N Sbjct: 549 CPLGRRTLLEMAEKEGLKRLKYLRYADNIILGVIGSKQDALDIRKAVQNFLQEELKLDIN 608 Query: 313 MDKTKIPHVNDG-FIFLG 329 K+KI H +LG Sbjct: 609 EQKSKILHAKSEMAKYLG 626 >UniRef50_B1C301 Putative uncharacterized protein n=6 Tax=Clostridium spiroforme DSM 1552 RepID=B1C301_9FIRM Length = 270 Score = 213 bits (541), Expect = 1e-53, Method: Composition-based stats. Identities = 81/221 (36%), Positives = 127/221 (57%), Gaps = 6/221 (2%) Query: 5 LATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVEL 64 ++ T+ + ++LL I + + +A + +S+KG+ GVD + ++ A Sbjct: 39 ISHTKNTNRFVVHEKLLETIMEDANIEKAIQRVMSNKGSG--GVDKMQVAEVRTHFAQHW 96 Query: 65 QILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHT 124 L+ ++ GHY P +RV IPK NGK R LGIP + DR++Q+A++ + PI+E F Sbjct: 97 SYLKKLIMEGHYSPQAVKRVEIPKDNGKKRELGIPTVTDRVIQQAIVQVLTPIFEPQFSD 156 Query: 125 LSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISD 184 SYGFRP R+ H A+R V + R+ ++ DL YFDTV+H L++ + + I D Sbjct: 157 NSYGFRPRRNAHQAVRKV----VEYANEGYRYTVDLDLEKYFDTVNHSRLIQILSQTIKD 212 Query: 185 ARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNI 225 R ++L+ K + AG I F ++GVPQGG +SPLLSNI Sbjct: 213 GRVISLIHKYLNAGVIVKHKFEETTKGVPQGGPLSPLLSNI 253 >UniRef50_O99970 Orf546 n=2 Tax=Porphyra purpurea RepID=O99970_PORPU Length = 546 Score = 211 bits (537), Expect = 3e-53, Method: Composition-based stats. Identities = 89/346 (25%), Positives = 143/346 (41%), Gaps = 55/346 (15%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEW--LAEAARITLSSKGAHTPGVDGVNKTMLQA 58 +Q K+ ++ + + + I + ++ ++T + G T GVD ++ Sbjct: 19 LQCKIFKFSKEGDMNSVFLIQKQIIKHDFSKFLAVRKVTQDNLGKRTAGVDRISNLTPDE 78 Query: 59 RLAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIW 118 R+ + I D + RRV I K NGK R LGIP +RDR Q + A+EP + Sbjct: 79 RMELVQNIQIDN------KSDKIRRVTILKPNGKERHLGIPTIRDRAKQCLVKFALEPQY 132 Query: 119 ESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAV 178 E+ F SYGFRP RS + A + + C + + V++ D+ FD + H L+ + Sbjct: 133 EAIFEPNSYGFRPGRSSNDA----RQAIVKCLQQLPKHVLDADIERCFDNIDHSKLIHGI 188 Query: 179 RR-RISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHER 237 + + L I G + G PQGG+ISPLL+NI L+ ++ + Sbjct: 189 NTFPLLREQVRAWLKACILTGFKENIKEVIPEAGTPQGGIISPLLANIALHGMEKAV--- 245 Query: 238 YLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIRE 297 K V RYADDF+++ K EA + Sbjct: 246 ------------------------------CKSGVYLIRYADDFLILCNEEKELSEA-KN 274 Query: 298 ECRGVLEGSLKLRLNMDKTKIPHV-------NDGFIFLGHRLIRKR 336 + L+ L L+L+ KTKI + G FLG + + Sbjct: 275 KIEIFLQN-LGLKLSESKTKITYTGSSEYSRTKGVDFLGFNFVNYK 319 >UniRef50_A6P1G1 Putative uncharacterized protein n=1 Tax=Bacteroides capillosus ATCC 29799 RepID=A6P1G1_9BACE Length = 320 Score = 211 bits (537), Expect = 4e-53, Method: Composition-based stats. Identities = 71/199 (35%), Positives = 111/199 (55%), Gaps = 8/199 (4%) Query: 10 ATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRD 69 A + S + +RL R + P++ A + + G T G DG V + L + Sbjct: 8 ACNQSYKYERLYRNLYNPQFYLLAYQRIQAKPGNMTAGTDGKTI---DGMGMVRVNALIE 64 Query: 70 ELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGF 129 ++ YQP PARR YIPKSNGK+RPLGIP+ D+++Q + + +E I+E F S+GF Sbjct: 65 KMRDFSYQPNPARRTYIPKSNGKMRPLGIPSFDDKLIQEVVRLILESIYEPTFSDHSHGF 124 Query: 130 RPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMT 189 R +S H A++ V+ T +W +EGD+ FD V H +L+ +R+RI+D F+ Sbjct: 125 RMNKSCHTALKYVQKYF-----TGTKWFVEGDIRGCFDNVDHHVLIAILRKRIADEHFIG 179 Query: 190 LLWKTIKAGHIDVGLFRAA 208 LLWK +KAG+++ + Sbjct: 180 LLWKFLKAGYMEDWNYHKR 198 >UniRef50_D1RME6 Reverse transcriptase family protein n=1 Tax=Legionella longbeachae D-4968 RepID=D1RME6_LEGLO Length = 444 Score = 211 bits (536), Expect = 4e-53, Method: Composition-based stats. Identities = 96/381 (25%), Positives = 153/381 (40%), Gaps = 52/381 (13%) Query: 5 LATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVEL 64 ++ A + E L E + G+ G+DGV K + +L L Sbjct: 18 ISKRARLQKDTVFNN-IGHALDTELLRECYQEL---DGSKAIGIDGVTKEVYGKKLEDNL 73 Query: 65 QILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHT 124 Q L + Y P +R V IPK +G RPL I D+IVQ A+ + I+E F Sbjct: 74 QDLLARIRRHAYTPQASRLVEIPKEDGSTRPLAISCFEDKIVQMAVTKLLTAIYEPLFLP 133 Query: 125 LSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISD 184 SYG+R ++ H A+R + + E R +E DL YF+T+ H L++ + ++I+D Sbjct: 134 CSYGYREGKNGHEALRAL---MKYSNEFRKGATLEIDLRKYFNTIPHGKLLEILEKKITD 190 Query: 185 ARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNE-FDQYLHERYLSGKA 243 RF+ L+ K I++ + G G PQG +ISP+LSNI L+ D + E Sbjct: 191 RRFLKLIRKLIRSPVVANGKAELNELGCPQGSIISPILSNIYLHSVVDSWFDEI------ 244 Query: 244 RKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVL 303 A R+ADD V + + ++ E + L Sbjct: 245 --------------------SKSHLIGKTAMVRFADDMVFLFQRSE-DAEKFYKVLPKRL 283 Query: 304 EGSLKLRLNMDKTKIPHVNDG--------------FIFLGHRLIRKRSRYGEMRVVSTIP 349 E L+L++DK+ + + FLG +S G+ + Sbjct: 284 E-KYGLQLHVDKSSLLKSGSKEAEEADTRGERLQTYKFLGFTCYWGKSLDGKSWRLKFKS 342 Query: 350 QEKARNFAASLTALLWKVRIS 370 + F A L L ++ S Sbjct: 343 RSD--RFTAKLRGLREYLKKS 361 >UniRef50_Q35063 CoxI intron1 ORF n=2 Tax=Eukaryota RepID=Q35063_MARPO Length = 902 Score = 208 bits (529), Expect = 3e-52, Method: Composition-based stats. Identities = 102/407 (25%), Positives = 162/407 (39%), Gaps = 63/407 (15%) Query: 2 QRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTP--GVDGVN------K 53 +RKL T + S + + + + T P +L A S TP G + N Sbjct: 179 RRKLETLKRNEKSGKFENIYSICTDPNFLIAAYEQIKSHTSNMTPEGGEERENLFLRQVA 238 Query: 54 TMLQARLAVELQILRDELLSGHYQPLPARRVYIPKSNG--KLRPLGIPALRDRIVQRAML 111 + L++ + + L S ++ ARR+ IP N + RPL I + D IVQ+AM Sbjct: 239 SPLESLDRAWFERTAELLRSEQFRFKLARRIMIPTPNKPREFRPLTIGS--DNIVQQAMK 296 Query: 112 MAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHH 171 + ME I+E F S+GFRP R H + + L+ T W +E D+ F+++ Sbjct: 297 IVMEHIYEPKFLDTSHGFRPGRGCHSGLEQICLKW-----TGASWFLEFDIKRCFNSMDR 351 Query: 172 RLLMKAVRRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPL---------- 221 L+ +++ I D R+M L+ K AG + L QG V+SP Sbjct: 352 HKLVFILQKDIEDQRWMDLVHKLFTAGLVGGELGGPDPL---QGSVLSPWSSPPWALAPL 408 Query: 222 LSNIMLNEFDQYL----HERYLSGKARKDRW-----------------------YWNNSI 254 NI L++ DQ + +E S K R D+ Sbjct: 409 FCNIYLHDLDQEVAKMANELSRSRKRRVDKRTTAATRTPRTKAFRALTPQAEIMRVRRKA 468 Query: 255 QRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMD 314 RG S R++ + A Y RYA +F+L + G + V ++ + L L L Sbjct: 469 ARGLSPTDRKDPNYARAF-YVRYAGNFLLGIAGPRELVATVKSRIVQFVNSELHLELTGG 527 Query: 315 KTKIPHVN-DGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAASL 360 I H++ + FLG + S ++R EK R + Sbjct: 528 --SISHISAESVKFLGMEIKVVPS--SKLRRRFGKAMEKRRRVRNRI 570 >UniRef50_C6I8L1 CRISPR-associated protein n=1 Tax=Bacteroides sp. 3_2_5 RepID=C6I8L1_9BACE Length = 756 Score = 206 bits (523), Expect = 1e-51, Method: Composition-based stats. Identities = 98/361 (27%), Positives = 161/361 (44%), Gaps = 56/361 (15%) Query: 21 LRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLP 80 IT L A R + A G+DG + + RL L L+ EL+S + P P Sbjct: 5 YHSITTLHALQNAWRAVRAKNAAG--GIDGFTLSHFEKRLNDNLIELQHELISQTWNPEP 62 Query: 81 ARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIR 140 R+ I K+ + R LG+ ++D+IVQ+A+ A+EP E F LSYG+RP + AI+ Sbjct: 63 YLRIEITKNETEKRKLGLLCIKDKIVQQAIKTAIEPQLEKTFLNLSYGYRPNKGPERAIK 122 Query: 141 TVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHI 200 V L + + +V + D+ +YFDT++H L + + D + L+ I+ G + Sbjct: 123 RVVHDLK---KLKSGYVAKLDIDNYFDTINHERLFTRLANWLKDDETLRLIRLCIQTGIV 179 Query: 201 DVGL-FRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGRS 259 L ++ ++GVPQG ++SPLL+N L+ FDQ+ + Sbjct: 180 TPQLQWQEINKGVPQGAILSPLLANFYLHPFDQFAANKVP-------------------- 219 Query: 260 TAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIP 319 Y RYADDF++ K EA+ E + LE L+LN T I Sbjct: 220 -------------MYIRYADDFLIATSTEKQIKEAV-ELVKEELESQFYLQLN---TPII 262 Query: 320 H-VNDGFIFLGHRL---------IRKRS---RYGEMRVVSTIPQEKARNFAASLTALLWK 366 H +DG FLG + +K++ R ++ + + +++ + K Sbjct: 263 HNFHDGIEFLGITISDTGLSITEKKKKTLQERINSIKFIKSSLSSQSKETLQGIKNYYAK 322 Query: 367 V 367 + Sbjct: 323 L 323 >UniRef50_UPI0001C15D3C hypothetical protein CRC_00192 n=2 Tax=Nostocaceae RepID=UPI0001C15D3C Length = 566 Score = 205 bits (522), Expect = 2e-51, Method: Composition-based stats. Identities = 95/386 (24%), Positives = 166/386 (43%), Gaps = 58/386 (15%) Query: 1 MQRKLATWAATDPS---LRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQ 57 +Q K+ + T ++ Q++L + L ++T +KG T DG + L Sbjct: 24 LQSKIYQASLTGDKSSVVKYQKILINSYSAK-LLAVKKVTQDNKGKKTA--DGDQRIDLA 80 Query: 58 ARLAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPI 117 L ++ + + P R+ IPKSNG+ R LGI + DR Q MA+EP Sbjct: 81 KNLELDGKAI------------PLTRMEIPKSNGESRNLGISKMEDRAKQALAKMALEPE 128 Query: 118 WESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKA 177 WE+ F +YGFRP RS H AI ++ Q+ R +V+ D+S FD V H +++ Sbjct: 129 WEAKFEPNNYGFRPGRSCHDAISAIESQVRR----RTSYVLSVDISGCFDKVKHEAIVEK 184 Query: 178 VRRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHER 237 + + +K+G + +F +G P GVISPLL+NI L+ + ++ + Sbjct: 185 CN---TFPIMERQIRAWLKSGVMIGEVFHPLEKGEPVEGVISPLLANIALHGLETHISHK 241 Query: 238 YLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIRE 297 + + S +G+ + E R A DF+++ + ++A++ Sbjct: 242 F---------PSVSPSEAQGKVGEIGEA-------RLIRCAHDFLVL-HWEEKTIKAVKT 284 Query: 298 ECRGVLEGSLKLRLNMDKTKIPHVND-------GFIFLGHRLIRKRSRYGEMRVVSTIPQ 350 E L G + L LN K ++ H + G FLG + R G+ + + Sbjct: 285 EVETWL-GEIGLNLNQQKIRMCHTMEEYNGEKPGLDFLGFNIRTYRI--GKYKSNEKVNG 341 Query: 351 E------KARNFAASLTALLWKVRIS 370 E K + S+ + ++ + Sbjct: 342 ESPGMLTKIKPSEKSVERFMTDIKET 367 >UniRef50_UPI0001C388AF RNA-directed DNA polymerase n=1 Tax=Arthrospira platensis str. Paraca RepID=UPI0001C388AF Length = 498 Score = 204 bits (519), Expect = 4e-51, Method: Composition-based stats. Identities = 90/377 (23%), Positives = 159/377 (42%), Gaps = 49/377 (12%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQP-EWLAEAARITLSSKGA-HTPGVDGVNKTMLQA 58 +Q+++ + +++L RL+T + A R + G+ T GVDGV + Q Sbjct: 25 LQKRIYRASKRGDVKAVRKLQRLLTNSRDAKILAVREVIPENGSQKTAGVDGVKRLRNQE 84 Query: 59 RLAVELQILRDELLSGHYQPLPARRVYIPKSN-GKLRPLGIPALRDRIVQRAMLMAMEPI 117 +L L + L G + RRV IP+ + R +GI + ++ Q + +A+EP Sbjct: 85 KLD-----LANCLKLGR-KTQGLRRVSIPEPGRDEKRAVGILMMMEKAKQGLVKLALEPE 138 Query: 118 WESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKA 177 WE+ F SYGFRP RS AI + + + ++V++ + F+ ++H+ L+ Sbjct: 139 WEARFDRNSYGFRPRRSAQDAIAAIF----NGMKEDHKYVLDAHIEKCFEGIYHQKLLAK 194 Query: 178 VRRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHER 237 + + R + +K+G +D PQGG++ PLL+NI L+ + L ++ Sbjct: 195 LNTYPTLRR---EIKAWLKSGVMDGKELFPTETDTPQGGLM-PLLANIALDGLESLLEDK 250 Query: 238 YLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIRE 297 + G A RYADD V++ G + A +E Sbjct: 251 FQGGVANC----------------------GNGKATVVRYADDLVVL-DGELEVILAAKE 287 Query: 298 ECRGVLEGSLKLRLNMDKTKIPHV------NDGFIFLGHRLIRKRSRYGEMRVVSTIPQE 351 L + L+L T+I H N G+ FLG+ + + +R R Q+ Sbjct: 288 TIEAWLM-EMGLKLKDGNTRISHTFIEHEGNIGWDFLGYNIRQYPTREKRSRATGNTEQK 346 Query: 352 KARNFAASLTALLWKVR 368 R F + ++ Sbjct: 347 --RGFQTIIKPSQGAIK 361 >UniRef50_P19593 Probable reverse transcriptase n=2 Tax=Scenedesmus obliquus RepID=RDPO_SCEOB Length = 608 Score = 204 bits (519), Expect = 4e-51, Method: Composition-based stats. Identities = 94/375 (25%), Positives = 170/375 (45%), Gaps = 41/375 (10%) Query: 2 QRKLATWAATDPSLRIQRL-LRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARL 60 Q +A + +++L ++ A A + SSKG+ +PG+ + + + Sbjct: 39 QESIACAKREGNIVLVEKLAQEIVNSSFGRAVAVQTVASSKGSRSPGLSRESFKTNKNYV 98 Query: 61 AVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWES 120 A+ + + Y+ P R+YIPK +G RPL IP+ DR +Q +A+EP+ E Sbjct: 99 AMMATLEQITSNPHKYKATPLSRIYIPKRDGSARPLSIPSYTDRCLQALYKLAIEPMAEE 158 Query: 121 DFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRR 180 SYGFRP R+V A+ V L + ++V+E D+ D ++H+ + Sbjct: 159 VADLSSYGFRPMRNVSWAVGRVLNGLNN-PLANYQYVVEIDIKGCVDNINHQFI-----S 212 Query: 181 RISDARFMTLLWKTIKAGHID--VGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERY 238 +++ +LW +K G+I+ + + GVPQGG+ISPL+ N+ L+ + +++++ Sbjct: 213 QVTPFIPKKILWAWLKCGYIERNSNTLQPTTTGVPQGGIISPLIMNLTLDGLEFHIYKK- 271 Query: 239 LSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREE 298 ++++ YCRYADD V++ + + A+ Sbjct: 272 -----------------------IQKSSSQSKGNTYCRYADDMVILTTTEETALIAL-PA 307 Query: 299 CRGVLEGSLKLRLNMDKTKIPHV---NDGFIFLGHRLIRKRSRYGEMRVVST--IPQEKA 353 + L L + + KT I ++ +GF FL R RK R R+ S IP Sbjct: 308 VKEFLAVR-GLEVKLAKTTIKNIINDRNGFEFLSFRF-RKVYRRNRKRLTSQVGIPISAI 365 Query: 354 RNFAASLTALLWKVR 368 +NF ++ A+ + Sbjct: 366 KNFRKNIKAISKTRK 380 >UniRef50_Q1Q3I7 Putative uncharacterized protein n=1 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1Q3I7_9BACT Length = 316 Score = 203 bits (516), Expect = 1e-50, Method: Composition-based stats. Identities = 89/336 (26%), Positives = 152/336 (45%), Gaps = 48/336 (14%) Query: 20 LLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPL 79 + L A + G G DGV + L + L+I+R EL Y PL Sbjct: 3 MKEFALNLSALYSAFDAVKENHGC--AGADGVTIERYEGNLDLNLRIMRKELTEQTYFPL 60 Query: 80 PARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAI 139 P R+ + K NG+ R L IP++RDRIVQ A+L +EP+ E +F S+ +R RSV A+ Sbjct: 61 PLLRILVDKGNGEARALCIPSVRDRIVQAAVLQLIEPVLEKEFEECSFAYRKGRSVKQAV 120 Query: 140 RTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGH 199 V+ + E +WV++ D+ ++FD+V + LL+ + I D L+ +K Sbjct: 121 YKVR----EYYEQGYQWVVDADIDAFFDSVDYSLLLLKFKCYIHDPCIQNLVGLWLKGEV 176 Query: 200 IDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGRS 259 D +G+PQG ISP+L+N+ L+EFD+ L Sbjct: 177 WDGKTVTTLKKGIPQGSPISPILANLYLDEFDEEL------------------------- 211 Query: 260 TAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIP 319 + R++DDF+++ K + E+++ + + + L+ +D+ ++ Sbjct: 212 --------TRNGYKLVRFSDDFIILCKNSGMAKESLKLTKKILEKLLLE----LDEEQVI 259 Query: 320 HVNDGFIFLGHRLIRK-----RSRYGEMRVVSTIPQ 350 + + GF FLG ++ R + R V P+ Sbjct: 260 NFDQGFKFLGVIFVKSMIMVPFDRPKKERKVLFFPK 295 >UniRef50_Q67M30 Group II intron-encoding maturase n=1 Tax=Symbiobacterium thermophilum RepID=Q67M30_SYMTH Length = 248 Score = 201 bits (510), Expect = 4e-50, Method: Composition-based stats. Identities = 68/172 (39%), Positives = 100/172 (58%), Gaps = 6/172 (3%) Query: 21 LRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLP 80 + + E + A + + GA PGVDGV L+ ++ VE + +R+ELL G Y+P P Sbjct: 1 MEQVVARENMLAALKRVERNGGA--PGVDGVPTERLRDQIRVEWERIREELLRGTYRPQP 58 Query: 81 ARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIR 140 RRV IPK G R LGIP + DR++Q+A+L + PI++ F SYGFRP R H A+R Sbjct: 59 VRRVEIPKPGGGKRMLGIPTVMDRLIQQALLQVLTPIFDPTFSESSYGFRPGRRGHDAVR 118 Query: 141 TVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLW 192 + + E WV++ DL +FD V+H +LM V RR++D R + L+ Sbjct: 119 KARQYV----EEGYDWVVDMDLEKFFDRVNHDVLMARVARRVTDKRVLRLIR 166 >UniRef50_B3CUR8 Reverse transcriptase n=24 Tax=Orientia tsutsugamushi RepID=B3CUR8_ORITI Length = 379 Score = 199 bits (507), Expect = 1e-49, Method: Composition-based stats. Identities = 87/379 (22%), Positives = 158/379 (41%), Gaps = 50/379 (13%) Query: 4 KLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVE 63 ++ ++ + ++ L +I + L E + S K G+DG+ K +L Sbjct: 17 RIKLLSSKNQDIKFNNLGHII-DLKMLEEQYKELDSKK---AIGIDGITKEDYGKKLKAN 72 Query: 64 LQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFH 123 L L + YQ PAR V IPK +G RPL I D+I++ A+ + ++E F Sbjct: 73 LLSLLTRIRKWQYQAKPARIVKIPKEDGGKRPLVISCFEDKIIESAVSKILNSVFEPIFL 132 Query: 124 TLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRIS 183 SYGF P+ + H A+R + + + ++E D++ F+T+ H LM+ +R+RIS Sbjct: 133 KYSYGFGPKLNAHDALRELNRLTYNFNKG---AIVEIDITKCFNTIKHCELMEFLRKRIS 189 Query: 184 DARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKA 243 D +F+ L+ K I+ I+ EG QG ++SP+L+N+ L+ Y+ + + + + Sbjct: 190 DKKFLRLVMKLIETPIIENDTIVTNKEGCRQGSIVSPILANVFLH----YVIDSWFAKIS 245 Query: 244 RKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVL 303 ++ RY DD V + + ++A + + L Sbjct: 246 EEN---------------------LIGQTGMVRYCDDMVFVFE-SEADAKRFYDVLPKRL 283 Query: 304 EGSLKLRLNMDKTKIPHVND--------------GFIFLGHRLIRKRSRYGEMRVVSTIP 349 L +N K+++ + FLG +SR+G + Sbjct: 284 -NKYGLNINEAKSQMIKSGRDHAANLAKQGKKIASYNFLGFTCYWSKSRFGTTWRLKYTS 342 Query: 350 QEKARNFAASLTALLWKVR 368 + F L L +R Sbjct: 343 RRD--RFTEKLKGLRKYLR 359 >UniRef50_B5W904 Group II intron maturase-specific domain protein n=1 Tax=Arthrospira maxima CS-328 RepID=B5W904_SPIMA Length = 502 Score = 197 bits (501), Expect = 5e-49, Method: Composition-based stats. Identities = 86/393 (21%), Positives = 162/393 (41%), Gaps = 59/393 (15%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQ--PEWLAEAARITLSSKGAHTPGVDGVNKTMLQA 58 +Q+++ + ++RL RL+T + + + GVDGVN+ Q Sbjct: 25 LQKRIYRASERGDVKAVRRLQRLLTNAMDAKILAVRELIQDNGTEKRAGVDGVNRLRNQE 84 Query: 59 RLAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRP-LGIPALRDRIVQRAMLMAMEPI 117 +L L + L G + R+V IP+ + +P GI + ++ Q + +A+EP Sbjct: 85 KLD-----LANCLKLGK-KTQTRRQVSIPEPGKEEKPAFGILMMMEKAKQGLVKLALEPE 138 Query: 118 WESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKA 177 WE+ F SYGFRP RS AI + + + + +V++ + + ++H+ L+ Sbjct: 139 WEAKFDRNSYGFRPGRSAQDAIAAIFNGIKEDHK----YVLDAHIEKCCEGIYHQKLLAK 194 Query: 178 VRRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHER 237 + R + +K+G +D PQGG++ PLL+NI L+ + L + Sbjct: 195 LNTY---PRLRRPIKAWLKSGVMDGKELFPTETDTPQGGLM-PLLANIALDGLESLLEDT 250 Query: 238 YLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIRE 297 + G A + Q G++T RYADD V++ + + A +E Sbjct: 251 FKGGVA---------NCQNGKAT-------------VVRYADDLVVLDEELAVILTA-QE 287 Query: 298 ECRGVLEGSLKLRLNMDKTKIPHV------NDGFIFLGHRLIRKRSRYGE---------- 341 L G + L+L T+I H N G+ FLG+ + + + Sbjct: 288 TIEAWL-GEMGLKLKDSNTRITHTFTEHNGNLGWDFLGYNIRQYPTSKKRSLATGNTEQK 346 Query: 342 --MRVVSTIPQEKARNFAASLTALLWKVRISGE 372 + + +E + + ++ + S + Sbjct: 347 IGCQTIIKPSKEAIKRHLQKIDEIIGSHKNSSQ 379 >UniRef50_Q7YAJ6 Putative reverse transcriptase and intron maturase n=1 Tax=Chara vulgaris RepID=Q7YAJ6_CHAVU Length = 550 Score = 193 bits (491), Expect = 7e-48, Method: Composition-based stats. Identities = 89/358 (24%), Positives = 160/358 (44%), Gaps = 60/358 (16%) Query: 16 RIQRLLRLITQPEWLAEAARITLSSKGAHTPG--VDGVNKTMLQARLAVELQILRDELLS 73 + L+ +++ +L + + G TPG +DG+ L +L + Sbjct: 138 KYCNLIEVVSDVSFLIYCYELIRGNPGNMTPGATLDGLTINWFS--------KLSQQLQA 189 Query: 74 GHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWES-DF------HTLS 126 G ++P A + + R++IVQ+A+ + ++ I++ +F S Sbjct: 190 GKFEPRFA----------------LISPREKIVQKALTVVLDSIYDPAEFRIYDPALDCS 233 Query: 127 YGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDAR 186 + + R AI V WVIEGD++ ++ H++++ + I+ + Sbjct: 234 HAPKEARGAKRAIHKVDRTFKSA-----TWVIEGDITKCSASLPHKVILGILEEEIACRK 288 Query: 187 FMTLLWKTIKAGHIDVGLFRAASEGVPQGGVI--SPLLSNIMLNEFDQYLHERYLSGKAR 244 FM+L+ K++ G++D R PQ +PLL NI L++ D+Y+ Y + K + Sbjct: 289 FMSLVRKSLSVGYVDEKGKRHHPNRPPQALTFLRAPLLCNITLHQLDKYI---YETLKEQ 345 Query: 245 KDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLE 304 D+ + + +R ++Y RYADDFV+ + G K IR+ L Sbjct: 346 YDKHEVDPNFRR---------------LSYVRYADDFVIGITGPKTDAIEIRDLISTFL- 389 Query: 305 GSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAASLTA 362 +L L LN +KTKI H++ GF FLG ++ R R RY E+R + R ++L Sbjct: 390 STLGLELNKEKTKISHIDSGFFFLGTQISRGRRRY-EVRSPRLVLHAPIRKLLSTLRE 446 >UniRef50_A6TR85 RNA-directed DNA polymerase (Reverse transcriptase) n=2 Tax=Clostridiales RepID=A6TR85_ALKMQ Length = 360 Score = 193 bits (491), Expect = 8e-48, Method: Composition-based stats. Identities = 78/335 (23%), Positives = 128/335 (38%), Gaps = 54/335 (16%) Query: 15 LRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSG 74 R L I E L A K A L L +++EL+ Sbjct: 2 KRYGYLYEQIYDFENLYFAYLEARKDK------RFRDEILKFSANLEENLIQIQNELIWK 55 Query: 75 HYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERS 134 Y+ R Y+ + K R + +DR+VQ A+ + P++E + SY R R Sbjct: 56 AYKVGRYREFYVHEP--KKRLIMALPFKDRVVQWAIYRVLNPLFEKTYTEHSYACRIGRG 113 Query: 135 VHHAIRTVKLQLTDCGET-RGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWK 193 H A + ++ L + + ++ D+S YF V H + +K +R++I D + L+ + Sbjct: 114 THQAAKKLQYWLRQIDRKPQKYYYLKMDISKYFYRVDHSIALKILRKKIKDKDVLWLMEE 173 Query: 194 TIKAGHIDVGL------------FRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSG 241 I++ + GL R +G+P G + S LL+NI LNE DQ+ + Sbjct: 174 IIQSEDMAFGLPLGMEPGDCPKYMRLHDKGMPIGNLTSQLLANIYLNELDQFCKHKLQIK 233 Query: 242 KARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRG 301 + RY DDF+++ K + ++ E Sbjct: 234 -------------------------------YFIRYMDDFIVL-HHDKKYLHRLKVEIEN 261 Query: 302 VLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKR 336 L L+L LN KT I G F+G R+ Sbjct: 262 FLNSELELHLNR-KTCIRPTPVGIEFVGFRIWPTH 295 >UniRef50_A4WS58 RNA-directed DNA polymerase (Reverse transcriptase) n=1 Tax=Rhodobacter sphaeroides ATCC 17025 RepID=A4WS58_RHOS5 Length = 366 Score = 192 bits (487), Expect = 2e-47, Method: Composition-based stats. Identities = 84/344 (24%), Positives = 143/344 (41%), Gaps = 54/344 (15%) Query: 17 IQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHY 76 L I E L EA +SKG H + +A L L +++ L+ Y Sbjct: 5 YNHLWPQIIAFETLVEAW--ARTSKGRHR----QRDVIAFEADLEPNLFAIQESLIQKTY 58 Query: 77 QPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVH 136 + P R ++ + K R + L+DR+VQ A++ +EPI+E+ F S+ R + H Sbjct: 59 RTGPYHRFFVYEP--KKREIASLPLKDRVVQHALVSVIEPIFEARFIDQSFACRVGKGAH 116 Query: 137 HAIRTVKLQLTDCGETRGR-WVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTI 195 TV+ + + +G+ + ++ D+S YF +V H L + +RRRI+ + L+ + Sbjct: 117 KGADTVQRYMREVLREQGQVFALKADISKYFPSVCHDALRRIIRRRIACPDTLWLIDSIL 176 Query: 196 KAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQ 255 ++ L G+P G + S + +NI L+E D ++ + Sbjct: 177 ESSAEPGAL---TPRGIPIGNLTSQMFANIYLHELDHFVKHTLRERR------------- 220 Query: 256 RGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDK 315 Y RY DDF +I KA + +R C L L LR N K Sbjct: 221 ------------------YVRYMDDFAVI-HHDKAHLHEVRRACEDFLWAELGLRTNA-K 260 Query: 316 TKIPHVNDG---FIFLGHRL------IRKRSRYGEMRVVSTIPQ 350 T++ + + FLG+R+ +RK S R + + Sbjct: 261 TQVFPIGEPGRALDFLGYRIWPTHRALRKDSVNRMKRKMRRMAS 304 >UniRef50_B6IMH7 Phage-encoded reverse transcriptase, putative n=2 Tax=Proteobacteria RepID=B6IMH7_RHOCS Length = 356 Score = 190 bits (483), Expect = 6e-47, Method: Composition-based stats. Identities = 92/353 (26%), Positives = 146/353 (41%), Gaps = 54/353 (15%) Query: 19 RLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQP 78 L +T E L A + KG D V L +L L+ ++++G ++P Sbjct: 7 GLWDSVTAFENLYGAY--LAARKGKRYS--DEV--LEFGFGLEEKLFDLQGQMVNGVWRP 60 Query: 79 LPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHA 138 R + + K R + P DR+V A++ +EP+ E F SY R R VH A Sbjct: 61 GRPREFMV--RDPKPRLISAPPFADRVVHHAVVRVIEPVLERRFIFDSYACRKGRGVHTA 118 Query: 139 IRTVKLQLTDC-GETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKA 197 + ++ L + E WV++ D+S YF +++H LM + R ISD + + L +K Sbjct: 119 VDRLQRHLREASCEGGKVWVLKADISKYFASINHGRLMAILGRSISDKKVLWLCRTNLKG 178 Query: 198 GHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRG 257 D G+ G+P G + S L +NI L++ D ++ + + Sbjct: 179 YGFDEGV------GIPVGALTSQLFANIYLDQLDHWIKDELGIKR--------------- 217 Query: 258 RSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTK 317 Y RY DDFV+ V +KA + A+ + L L LRLN KT Sbjct: 218 ----------------YVRYMDDFVI-VGHSKADLWALYDAIADFLATKLALRLNR-KTT 259 Query: 318 IPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAASLTALLWKVRIS 370 + + G F G+R + V +KAR L AL + +S Sbjct: 260 VLPASGGIDFCGYRTWTTHLLPRKRNV------KKARATFRELAALYRRGEVS 306 >UniRef50_C0FSR2 Putative uncharacterized protein n=1 Tax=Roseburia inulinivorans DSM 16841 RepID=C0FSR2_9FIRM Length = 273 Score = 190 bits (482), Expect = 8e-47, Method: Composition-based stats. Identities = 75/315 (23%), Positives = 129/315 (40%), Gaps = 45/315 (14%) Query: 20 LLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPL 79 +L + E L EA +H G+DGV + L+A + +++ + +G Y+ Sbjct: 2 VLEDVFSDENLEEAFESFADKHDSH--GLDGVKLSELRAYWETNGKKIKESIFNGTYKVG 59 Query: 80 PARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAI 139 + I GK R + + DR + RA+ M WE F SY ++ + V A+ Sbjct: 60 AVEQRQIVNRKGKKRTISLMNSIDRFIFRALYQKMASEWEKQFSQYSYAYQNNKGVLTAV 119 Query: 140 RTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGH 199 + E W +E D+ ++FD ++H +++ ++ I D R + LL + Sbjct: 120 EQAAKYM----EEGKDWSVELDIQNFFDNINHSIIISKLKAGIEDVRVLDLLIAYLTCTL 175 Query: 200 IDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGRS 259 +D +F +GV QGG +SPLL+N+ +NE D Y+ Sbjct: 176 LDDHVFHQMEQGVLQGGPLSPLLANVYMNELDHYME------------------------ 211 Query: 260 TAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIP 319 K ++CR+ DD + T + + +E +L LN KT I Sbjct: 212 ---------KQGYSFCRFGDDINIYC-STYEEATVAFSDVTARMEKIEQLPLNHGKTGIF 261 Query: 320 HVNDGFI--FLGHRL 332 G +LG+R Sbjct: 262 ---KGINRKYLGYRF 273 >UniRef50_D2LKK7 RNA-directed DNA polymerase (Reverse transcriptase) n=1 Tax=Rhodomicrobium vannielii ATCC 17100 RepID=D2LKK7_RHOVA Length = 365 Score = 189 bits (481), Expect = 9e-47, Method: Composition-based stats. Identities = 87/366 (23%), Positives = 131/366 (35%), Gaps = 59/366 (16%) Query: 15 LRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSG 74 R + L I + L AAR + + PG A L E+ L EL G Sbjct: 3 KRHEGLFERIASFKALRAAARTAI-NGKRKKPG-----AAAFMANLEREILRLERELRDG 56 Query: 75 HYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERS 134 Y+P R V I + K R + RDR+V A+ + P++E+ F ++ R + Sbjct: 57 SYRPG--RYVEILVKDPKERLISAAPFRDRVVHHALCAVVCPLFEAGFTDHTFANRTGKG 114 Query: 135 VHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKT 194 H AIR L + +V+ D+ YF + H +L RR+I+ R + L+ Sbjct: 115 THKAIR-----LYERYRDNHSYVLRADIFRYFPAIDHEILKAEFRRKIACERTLWLMDLI 169 Query: 195 -----------IKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKA 243 + D+ G+P G + S +N+ LN FD ++ E+ Sbjct: 170 VDCSNSQEPVELHFPGDDLFTPYTRRRGLPIGNLTSQFFANLYLNRFDHWVIEKL----- 224 Query: 244 RKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVL 303 Y RY DDF L RE+ L Sbjct: 225 ---------------------------GAPYVRYVDDFALFHDDPGILA-TWREKIERCL 256 Query: 304 EGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSRYGE-MRVVSTIPQEKARNFAASLTA 362 EG +L+L+ KT I V + FLG L R + R + F L Sbjct: 257 EGR-RLKLHPRKTLILPVAEPSPFLGFELHPGPRRTAKGGRGRRKLLDGNVARFRNRLRG 315 Query: 363 LLWKVR 368 L + R Sbjct: 316 LRDRWR 321 >UniRef50_B8GRZ4 RNA-directed DNA polymerase (Reverse transcriptase) n=1 Tax=Thioalkalivibrio sp. HL-EbGR7 RepID=B8GRZ4_THISH Length = 377 Score = 189 bits (479), Expect = 2e-46, Method: Composition-based stats. Identities = 87/359 (24%), Positives = 142/359 (39%), Gaps = 56/359 (15%) Query: 14 SLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLS 73 R +RL+ I + L EA R+ +G D +A L EL L+ E+L Sbjct: 32 GKRHKRLIEAIVDWDNLQEAHRLAR--RGKR----DRHEVATFEANLWEELGALQMEMLW 85 Query: 74 GHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPER 133 G YQP R + + K R + RDR+ Q A+ PIW++ SY RP + Sbjct: 86 GSYQPGRYRSFLVYEP--KRREILAAPYRDRVAQHAICTLCGPIWDAAMIDDSYACRPGK 143 Query: 134 SVHHAIRTVKLQLTDCGETRG-RWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLW 192 H V+ L G WV++ D+S YF ++ H L VR +IS + L+ Sbjct: 144 GTHVGATRVEQWLRGMTAAGGAVWVVKMDVSKYFASIRHDLAKAVVRDKISCPATLQLID 203 Query: 193 KTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNN 252 I + G+P G ++S ++N++ N DQ+ + Sbjct: 204 AIIDST---ADPADPDPVGIPVGNLLSQWIANLVGNRIDQWAKRELRLKR---------- 250 Query: 253 SIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLN 312 Y RY DD V++V+ TK + IR++ L S+ +R + Sbjct: 251 ---------------------YARYMDDMVVLVR-TKQEALTIRDQFDDKL-ASMGMRFS 287 Query: 313 MDKTKIPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAASLTALLWKVRISG 371 K + + G FLG+R+ + + ++ R +L A+ W+ G Sbjct: 288 --KASVLPASRGVNFLGYRIWAHK---------RLLRRDSVRRIKRNLKAMRWQYARGG 335 >UniRef50_B6FPD0 Putative uncharacterized protein n=1 Tax=Clostridium nexile DSM 1787 RepID=B6FPD0_9CLOT Length = 252 Score = 187 bits (476), Expect = 4e-46, Method: Composition-based stats. Identities = 67/206 (32%), Positives = 108/206 (52%), Gaps = 7/206 (3%) Query: 13 PSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQAR-LAVELQILRDEL 71 + RL+ LI + + A R + G+ TPG DG T +Q+ + ++ +R++L Sbjct: 44 KGNKFDRLMPLIVSEQNIILAYRNICKNNGSKTPGTDGKTITEIQSLPIETVIKTVRNKL 103 Query: 72 LSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRP 131 YQP RRV IPK NGK RPLGIP++ DR++Q+ +L +EPI E+ FH + GFRP Sbjct: 104 NF--YQPKKVRRVEIPKDNGKTRPLGIPSIWDRLIQQCILQILEPICEAKFHERNNGFRP 161 Query: 132 ERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAV-RRRISDARFMTL 190 RS +AI +V++ D+ +FD + H L++ + I D + + + Sbjct: 162 YRSTQNAIAQCYKM---AQLQNLHFVVDVDIVGFFDNIDHNKLIRQLWGLGIQDRKLIMI 218 Query: 191 LWKTIKAGHIDVGLFRAASEGVPQGG 216 + + +KA + + G PQGG Sbjct: 219 IKQMLKAEILFNDIVITPETGTPQGG 244 >UniRef50_B0VI85 RNA-directed DNA polymerase (Reverse transcriptase) n=1 Tax=Candidatus Cloacamonas acidaminovorans RepID=B0VI85_9BACT Length = 343 Score = 187 bits (476), Expect = 4e-46, Method: Composition-based stats. Identities = 74/340 (21%), Positives = 136/340 (40%), Gaps = 47/340 (13%) Query: 15 LRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSG 74 R+ L +T + L A + K + L L+ EL++G Sbjct: 3 KRVGYLWEKLTSWQNLYLAYKNACKHKKSKYE------TAEWMFYCEKNLWELQKELING 56 Query: 75 HYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERS 134 +Y+P P R I + K R + + RDR+V +++ +EP +ES F SY R + Sbjct: 57 NYRPQPYRYFTIKEP--KERLISVAVFRDRLVHHSLINVIEPYFESIFIKDSYATRKGKG 114 Query: 135 VHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKT 194 +H A+ V+ + W ++ D+ +F+ + H +L+K + +I D + L Sbjct: 115 LHLAVLAVQKY-----SRQYPWFLKLDIEKFFNNIDHNILLKLISSKIKDPMIINLCSII 169 Query: 195 IKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSI 254 +K ++ + G+P G + S +NI LN+ D Y+ + Sbjct: 170 LKNQNLSMN--HNEEIGLPVGNLTSQFFANIYLNQLDHYIKQNLGYK------------- 214 Query: 255 QRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMD 314 Y RY DDF ++ K ++++ + L LKL++ Sbjct: 215 ------------------GYVRYMDDF-ILFSENKDKLKSDLLLIKYFLSNILKLKIKDK 255 Query: 315 KTKIPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKAR 354 ++ VN G FLG+R+ K R + + + + R Sbjct: 256 SIQMNKVNQGIPFLGYRVFPKLIRVSNINLKRCLQNMQKR 295 >UniRef50_B1I2T7 Retron-type reverse transcriptase-like protein n=1 Tax=Candidatus Desulforudis audaxviator MP104C RepID=B1I2T7_DESAP Length = 309 Score = 187 bits (474), Expect = 6e-46, Method: Composition-based stats. Identities = 65/173 (37%), Positives = 97/173 (56%), Gaps = 5/173 (2%) Query: 15 LRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSG 74 + L+ + + E L +A R ++ GA PGVDG L L L EL +G Sbjct: 2 KKWYSLIDKVYRLENLIQAYRAVRANNGA--PGVDGETVEAFGRNLDERLDQLHHELKTG 59 Query: 75 HYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERS 134 Y+P P RV IPK +G RPLGIP +RDR+VQ+A+L ++PI++ DFH SYG+R RS Sbjct: 60 TYEPQPVLRVEIPKPDGSNRPLGIPTVRDRVVQQALLNILQPIFDPDFHPSSYGYRLGRS 119 Query: 135 VHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARF 187 H A+ + + G V++ DLS FD ++H L+++ + R++SD Sbjct: 120 CHQAVAKAERFMNRYGLEH---VVDMDLSKCFDRLNHELILEGINRKVSDGSV 169 >UniRef50_A1RKU4 RNA-directed DNA polymerase (Reverse transcriptase) n=4 Tax=root RepID=A1RKU4_SHESW Length = 424 Score = 187 bits (474), Expect = 7e-46, Method: Composition-based stats. Identities = 80/361 (22%), Positives = 139/361 (38%), Gaps = 55/361 (15%) Query: 20 LLRLITQPEWLA-EAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQP 78 L I Q E L A + ++ + + L + +++EL+ G Y Sbjct: 83 LFEQIYQFENLLNAAYQCRKGKTKSN-------STLVFFNNLEENIIQIQNELIWGMYLS 135 Query: 79 LPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHA 138 P Y+ + K R + P+ RDR+V RA+ +EPI + + SY R + H Sbjct: 136 SPYHHFYVFEP--KRRLISAPSFRDRVVHRAIYNVIEPILDRQYIYDSYACRRGKGTHRG 193 Query: 139 IRTVKLQLTDCGETRGR-WVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKA 197 +L + +T + + ++ D+S YF ++ H +L V +I R LL+ I + Sbjct: 194 ADRAQLFIRRVEKTHSKAYALKADISRYFSSIDHHILKSLVSAKIQCERTKCLLFYIIDS 253 Query: 198 GHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRG 257 D A G+P G + S + +N+ LNE D++ Sbjct: 254 SPSD-----AHGVGIPLGNLTSQVFANLYLNELDRFAKHTL------------------- 289 Query: 258 RSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTK 317 A Y RY DDFV+I K Q+ R + L+L+ N KT+ Sbjct: 290 ------------KAKNYVRYMDDFVII-HHDKQQLHQWRVMIERFINCQLRLKTN-SKTQ 335 Query: 318 IPHV----NDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAASLTALLWKVRISGEI 373 + V FLG+R+ + + V + K + F +A ++ + Sbjct: 336 VFPVAASAGRSLDFLGYRIYANKKLLRKSSVKRI--KAKLKIFRKKYSAGEIDIKDINQT 393 Query: 374 L 374 + Sbjct: 394 I 394 >UniRef50_C0A8Z3 RNA-directed DNA polymerase (Reverse transcriptase) n=1 Tax=Opitutaceae bacterium TAV2 RepID=C0A8Z3_9BACT Length = 358 Score = 186 bits (472), Expect = 1e-45, Method: Composition-based stats. Identities = 80/326 (24%), Positives = 133/326 (40%), Gaps = 56/326 (17%) Query: 15 LRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSG 74 + + L + E L AA + P G A L + LRDELL+G Sbjct: 3 RKHRHLFEKVITLENLFAAAENASRGRSGKVPVARGF------AELEKTVVTLRDELLAG 56 Query: 75 HYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERS 134 +QP R Y ++ K R + RDR+V A++ +EPI+E F S+ RP + Sbjct: 57 TWQPG--RYYYFTITDPKEREVAAAPFRDRVVHHALVRVLEPIFEPRFIADSFACRPGKG 114 Query: 135 VHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKT 194 H A+ + R R+ ++ D+ YF + H LL++ V R + DAR + L+ + Sbjct: 115 THAALARARE-----FTRRHRYCLKCDIKKYFPNIDHALLLREVGRAVDDARVLELIGRI 169 Query: 195 IKA--------GHIDVGLF--RAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKAR 244 + + GLF G+P G + S L+N+ L+ D ++ + Sbjct: 170 LASHADGAAQEWRAGAGLFDVEQRPRGLPIGNLTSQFLANVHLHPLDLFVKQTLRVK--- 226 Query: 245 KDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLE 304 Y RY DDF L+ +A ++A + R + Sbjct: 227 ----------------------------GYVRYVDDF-LLFGDDRAALKAHGQRVREFVR 257 Query: 305 GSLKLRLNMDKTKIPHVNDGFIFLGH 330 +L+LR++ DK ++ G F+G Sbjct: 258 -TLRLRVHPDKFRLSRTEQGVDFVGF 282 >UniRef50_C8PKY6 Putative CRISPR-associated protein Cas1 n=1 Tax=Campylobacter gracilis RM3268 RepID=C8PKY6_9PROT Length = 731 Score = 184 bits (466), Expect = 6e-45, Method: Composition-based stats. Identities = 86/350 (24%), Positives = 147/350 (42%), Gaps = 43/350 (12%) Query: 20 LLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPL 79 + L + + A A L G DG++ + L+ E+ S Y P Sbjct: 1 MFDLSLEDIFTAGAFEYALKRLKRTALGFDGLSADDI--CSGEFYAELKSEIFSLSYSPQ 58 Query: 80 PARRVYIPKS-NGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHA 138 P +R +IPK +LR L +P+L+D+ VQ + + ++ F SY +R +S +A Sbjct: 59 PLKRAFIPKEAKDELRKLAVPSLKDKFVQNILTRELSGYFDKSFSNRSYAYRNGKSYANA 118 Query: 139 IRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAG 198 I + + ++ D+ +F+ + H L++ +R I DAR + L+ IK G Sbjct: 119 IYRARDFF-----QIFSFAVKTDIKDFFENIDHEKLLEILRANIRDARIIRLIELWIKNG 173 Query: 199 HIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGR 258 + +RA ++GV QG V+SPLLSNI LN+ D++L Sbjct: 174 IFERFDYRAHTKGVHQGDVLSPLLSNIYLNQMDKFLE----------------------- 210 Query: 259 STAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKI 318 V + RYADDFV+ +A E + L+ ++ L LN KT I Sbjct: 211 ----------NSGVEFVRYADDFVMFFASYEA-AEMRLARLKDFLK-TISLSLNEAKTSI 258 Query: 319 PHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAASLTALLWKVR 368 + F+FLG GE + + + + +++ + + Sbjct: 259 HGKDSEFVFLGVSFKGTNLSIGEEKFKRILAKLASSAKKQAISQSVENLN 308 >UniRef50_C9L4G0 Reverse transcriptase family protein n=3 Tax=Clostridiales RepID=C9L4G0_RUMHA Length = 437 Score = 184 bits (466), Expect = 6e-45, Method: Composition-based stats. Identities = 88/363 (24%), Positives = 131/363 (36%), Gaps = 57/363 (15%) Query: 20 LLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPL 79 LL I E L +A S K + T A L L ++++L+ Y+ Sbjct: 65 LLERIYSWENLLDAYHEAASEKWYRN------DVTAFAANLEENLISIQNDLIWHAYKVG 118 Query: 80 PARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAI 139 R+ Y+ + K R + RDR+VQ A+ + ++ SYG R + A Sbjct: 119 RYRQFYVHEP--KKRLVMALGFRDRVVQWAIYLQTNQHLDNGMIYHSYGCRVGKGTTRAA 176 Query: 140 RTVKLQLTDCGETRGRWV-IEGDLSSYFDTVHHRLLMKAVRRRISDAR-FMTLLWKTIKA 197 ++ T G W ++ D+S YF V HR+L+ +RR+ + ++ L+ I Sbjct: 177 DRLQYWCTLVDRKPGNWYYLKLDVSKYFYRVDHRVLLDILRRKFPNEDGYLWLMETIINC 236 Query: 198 GHIDVGLF------------RAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARK 245 H GL R G+P G + S LL+N+ LNE DQY+ Sbjct: 237 DHTPFGLPPGKSADEIPPSERLFEVGMPIGNLTSQLLANVCLNELDQYIKHEL------- 289 Query: 246 DRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEG 305 A Y RY DD L+ A + R L Sbjct: 290 ------------------------KAHFYDRYMDDMALLYPDA-ATLNRWRAAIEKYLNE 324 Query: 306 SLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAASLTALLW 365 L L LN KT I V G F+G R+ + V + + R A A L Sbjct: 325 VLHLELN-SKTTIGLVKHGITFVGCRIYPGYRKPTAQSVKKM--KARMRYIAKEYEAGLI 381 Query: 366 KVR 368 Sbjct: 382 DFD 384 >UniRef50_A5ZQ10 Putative uncharacterized protein n=1 Tax=Ruminococcus obeum ATCC 29174 RepID=A5ZQ10_9FIRM Length = 379 Score = 182 bits (463), Expect = 1e-44, Method: Composition-based stats. Identities = 77/364 (21%), Positives = 146/364 (40%), Gaps = 56/364 (15%) Query: 15 LRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSG 74 ++I+ + LI + L EA + +SKG + +Q + ++ ++ ++ SG Sbjct: 1 MKIKHVFDLIFSDDNLYEAIQ--DASKGRRY----NKDVLRVQHDIWNVIEQIQQDVRSG 54 Query: 75 HYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERS 134 Y Y+ + K R + RIVQ A+ + P+ + +YG P R Sbjct: 55 KYTIDKYYIFYVYEP--KKRMIMSITFYHRIVQWAIYRVINPLLVKGYIKDTYGCIPGRG 112 Query: 135 VHHAIRTVKLQLTDCGETRGRWV-IEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWK 193 A++ ++ + G W ++ D+S YF + H +L + + R+I D + + +L+ Sbjct: 113 SLAAMQRLRYWIKSVEHKPGTWYYLKLDISKYFYRISHEVLKEILARKIKDQQLLQVLYN 172 Query: 194 TIKAGHIDVGLF------------RAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSG 241 I + GL R G+P G ++S + +NI L+ DQ+ Sbjct: 173 IIDCQYTPFGLPPGKGPGEVPLEERLYDVGMPVGNLLSQVFANIYLDALDQFCKRTLCI- 231 Query: 242 KARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRG 301 Y RY DD ++I+ +K Q+ ++E + Sbjct: 232 ------------------------------HFYVRYMDD-IIILSDSKEQLHMWKDEIQK 260 Query: 302 VLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKR--SRYGEMRVVSTIPQEKARNFAAS 359 +E +L+L LN KT I ++ G F+G+R+ R + + K + A Sbjct: 261 FVETTLRLSLNQ-KTCIRPISQGIEFVGYRIWPHYVTIRKSTTLEMKRHLRRKVEEYNAG 319 Query: 360 LTAL 363 L + Sbjct: 320 LIEM 323 >UniRef50_D2MKC4 RNA-directed DNA polymerase (Reverse transcriptase) n=1 Tax=Candidatus Poribacteria sp. WGA-A3 RepID=D2MKC4_9BACT Length = 345 Score = 181 bits (460), Expect = 3e-44, Method: Composition-based stats. Identities = 77/283 (27%), Positives = 126/283 (44%), Gaps = 47/283 (16%) Query: 101 LRDRIVQRAM-LMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIE 159 + D+IVQ AM +EPI+E +F SYGFRP RS H + + + + +V++ Sbjct: 1 MEDKIVQCAMVKCILEPIYEMEFCGFSYGFRPGRSAHDGLDALAYVIE---RRKVNYVVD 57 Query: 160 GDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVIS 219 D+ +FD V LM+ +R RI D R + ++ K +KAG ++ G +R + +G PQG VIS Sbjct: 58 ADIRKFFDEVDQEWLMRFLRHRIGDERVLRIIVKFLKAGVMEDGSWRESEQGTPQGAVIS 117 Query: 220 PLLSNIMLNE-FDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYA 278 P+L+N+ L+ D + + R++ RYA Sbjct: 118 PILANLYLHYVLDLWFKSQ-------------------------RKSRNIGGESYMVRYA 152 Query: 279 DDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIP---------------HVND 323 DDFV+ + K E ++ R LE LRL+ DKT++ Sbjct: 153 DDFVVCFQH-KEAAERFLKDLRKRLE-KFGLRLHPDKTRLIEFGRFAERDRKKRGDSSPK 210 Query: 324 GFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAASLTALLWK 366 F FLG +++R G+ + ++ F + L + Sbjct: 211 TFDFLGFTHYCRKTRKGQFGLGRKPIAKRMVRFVKRIYTKLRQ 253 >UniRef50_Q7YAJ4 Putative reverse transcriptase and intron maturase n=1 Tax=Chara vulgaris RepID=Q7YAJ4_CHAVU Length = 576 Score = 181 bits (458), Expect = 4e-44, Method: Composition-based stats. Identities = 93/398 (23%), Positives = 167/398 (41%), Gaps = 67/398 (16%) Query: 14 SLRIQRLLRLITQPEWLAEAARITLSSKGAHTPG-----VDGVNKTMLQARLAVELQILR 68 S R L+ L+ P++L S G TPG +DG+ K L Sbjct: 103 SGRYTDLIGLLADPKFLIYCYETIKSKPGNMTPGKARSALDGLTKEWF--------THLA 154 Query: 69 DELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYG 128 L G + P K+ +R I +IVQ+AM + +E I+E F S+ Sbjct: 155 TLLQQGRFAP--------DKATDIVRGRFIF----KIVQKAMQVILEMIYEEKFIDCSHA 202 Query: 129 FRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFM 188 F+P R A+ L + + W IEGD+++ FD++ H ++M+ +++ I+ +F+ Sbjct: 203 FQPGRGS-AALTLASLHV--GIKKHQTWAIEGDITNCFDSIDHNVIMQIIKKEIACEKFL 259 Query: 189 TLLWKTIKAGHIDVGLFRAASEGVPQ---GGVISPLLSNIMLNEFDQYLHERYLSGKARK 245 L+ ++KAG+ + + G PQ +PLL ++L+E D+++ K Sbjct: 260 ALVKGSLKAGY-KTNVGKVHIPGTPQALTSLCFAPLLCKVVLHELDKFVSSTLNKAKFDT 318 Query: 246 DRWYWN---------NSIQRGRSTAVRENWQ--------WKPAVAYCRYADDFVLIVK-G 287 + +++Q ++ R W+ + Y RYADDFV+I+ G Sbjct: 319 ETPRRRLSGEAIVRASALQTTKTGNRRTLWKTSSDPMDPGFKRLFYVRYADDFVIIITAG 378 Query: 288 TKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRL------------IRK 335 ++A I+ L LKL LN TK ++G L ++ + Sbjct: 379 SRADAVEIKRLVTQFLSEELKLELNQRSTK---SHNGISLLSTQIHAVKWKAQNQVSWVR 435 Query: 336 RSRYGEMRVVSTIPQEKARNFAASLTALLWKVRISGEI 373 R +G++ P+ R A + L+ +++ G + Sbjct: 436 REAHGKVYRRRIHPRLILR--RAPIKELVERLKKYGFV 471 >UniRef50_Q5P2A1 Reverse transcriptase/retron type n=2 Tax=Proteobacteria RepID=Q5P2A1_AZOSE Length = 299 Score = 181 bits (458), Expect = 4e-44, Method: Composition-based stats. Identities = 60/189 (31%), Positives = 95/189 (50%), Gaps = 8/189 (4%) Query: 31 AEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLPARRVYIPKSN 90 EA + ++ GA GVD + + L L + + + SG Y P P + V IPK N Sbjct: 16 YEAYQAVKANAGA--AGVDQQSIEAFEQDLKGNLYKIWNRMSSGSYFPPPVKAVAIPKKN 73 Query: 91 GKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCG 150 G +R LG+P + DR+ Q + +EP E+ F SYG+RP RS A+ V + C Sbjct: 74 GGVRILGVPTVADRVAQMVVKRVIEPELEARFLPDSYGYRPGRS---ALEAVAVTRQRCW 130 Query: 151 ETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHID-VGLFRAAS 209 + WV+E D+ FD + H LL++A+++ + M + + + A G + Sbjct: 131 --QYPWVLEFDIKGLFDNIDHVLLLRALKKHVKCEWAMLYIKRWLTAPLQHADGTLEERT 188 Query: 210 EGVPQGGVI 218 +G PQGGV+ Sbjct: 189 KGTPQGGVV 197 >UniRef50_A5N448 Predicted reverse transcriptase/maturase family protein n=2 Tax=Clostridium kluyveri RepID=A5N448_CLOK5 Length = 462 Score = 180 bits (457), Expect = 6e-44, Method: Composition-based stats. Identities = 75/375 (20%), Positives = 158/375 (42%), Gaps = 49/375 (13%) Query: 5 LATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPG--VDGVNKTMLQARLAV 62 L + +P+ + QR+ R + ++ A S G + V + + Sbjct: 11 LKDKSNNNPNYKFQRIYRYLFNIDFYFRAYSQVYS------AGENIGNVVTEKVHSFNNE 64 Query: 63 ELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDF 122 + + ++L + Y P + K N K I L D ++Q+ ++ ++ I+ +F Sbjct: 65 GVHKIIEKLKNESYCPESLEKS--DKQNKKHSQ--IKGLYDNLIQQIIVEILQSIYNVNF 120 Query: 123 HTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRI 182 S+ F P ++ H A+ +K + RW ++G++ S F +++ ++K++ +I Sbjct: 121 SVNSHAFIPNKNCHTALYKIKTTCSGA-----RWAVKGNIESCFYNINYDFVIKSLCEKI 175 Query: 183 SDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGK 242 SD RF+ L+ K + AG+ G+ Q ++ +L NI L++FD+Y+++ + Sbjct: 176 SDGRFINLIRKFLAAGYTKEKKNCDTWSGISQRESLANILINIYLDKFDKYINKEF---- 231 Query: 243 ARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGV 302 V Y RY D+F++ + GTK E + E+ + Sbjct: 232 ---------------------------GQVKYTRYLDNFIIFISGTKDLAEYMIEKIKVF 264 Query: 303 LEGSLKLRLNMDKTKIPHVNDG-FIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAASLT 361 L+ L + ++ I +N FLG+ + + + + + + + + + Sbjct: 265 LKDKLNIETTEEEIFIIDLNKQRVKFLGYEITKLKHNFKDNKTDKSKEKMDNEIIQLLIP 324 Query: 362 ALLWKVRISGEILLG 376 A + + RI IL G Sbjct: 325 AEVIRKRIKPFILNG 339 >UniRef50_Q2W777 Retron-type reverse transcriptase n=2 Tax=Magnetospirillum magneticum AMB-1 RepID=Q2W777_MAGSA Length = 470 Score = 179 bits (455), Expect = 1e-43, Method: Composition-based stats. Identities = 82/369 (22%), Positives = 129/369 (34%), Gaps = 70/369 (18%) Query: 30 LAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLPARRVYIPKS 89 L EA K +D + L L L EL +G + P PA I + Sbjct: 109 LFEAYYDCRRHKRNTASALD------FEMVLENNLMDLLAELQAGTWMPGPATVFAITRP 162 Query: 90 NGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDC 149 + R + RDRIV + A+ P++E F S R + + L Sbjct: 163 --RPREVWAAQFRDRIVHHLVYRAINPLFEPAFIADSCACIKGRGTLYGAERLHRHLRSA 220 Query: 150 --GETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHIDVGLFRA 207 ++ + ++ D++++F ++ H L + RRI D + L K + + + Sbjct: 221 TENWSKPAFYLKADIANFFGSIRHADLFAMLARRIKDPTMLELCRKLVFQDVRRDAIVKD 280 Query: 208 ------------------ASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWY 249 G+P G + S +N+ L+ DQ + R Sbjct: 281 GAGTLALVPPHKSLFQALPGIGLPIGNLSSQFFANVYLDGLDQMIKRRLGMR-------- 332 Query: 250 WNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKL 309 Y RY DD VLI +KA + A E R L G + L Sbjct: 333 -----------------------HYVRYVDDMVLIHPESKALLSAA-EAIRDHLSG-IGL 367 Query: 310 RLNMDKTKIPHVNDGFIFLGHRLI------RKRSRYGEMRVVSTIPQEKARNFAASLTAL 363 +L KT + V G F+GH + R R+ +R + IP E +F S + Sbjct: 368 KLAEHKTFVAPVTKGVDFVGHVIRPHRRQGRPRTHRAALRRLMEIPAE---DFMPSCNSY 424 Query: 364 LWKVRISGE 372 L R G Sbjct: 425 LGLFRHGGS 433 >UniRef50_C1DF40 Group II intron-encoding maturase n=1 Tax=Azotobacter vinelandii DJ RepID=C1DF40_AZOVD Length = 225 Score = 179 bits (454), Expect = 2e-43, Method: Composition-based stats. Identities = 64/190 (33%), Positives = 99/190 (52%), Gaps = 10/190 (5%) Query: 7 TWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQI 66 W +P L+ + P L A + + ++GA PG DG+ L + I Sbjct: 46 AWTNAEPDT----LMERVLAPANLKRAYQQVVRNRGA--PGADGMTVADLAGYVKQYWPI 99 Query: 67 LRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLS 126 L+ LL+G Y P R V IPK G R LGIP++ DR++Q+A+L + PI++ F S Sbjct: 100 LKARLLAGEYHPQAVRAVEIPKPQGGTRQLGIPSVVDRLIQQALLQQLVPIFDPLFSDYS 159 Query: 127 YGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDAR 186 YGFRP RS H A+ + + RW +E D+ +FD V+H +LM V RR+ D + Sbjct: 160 YGFRPGRSAHQAVEMARAHVA----AGQRWCVELDVEKFFDRVNHDVLMACVERRVEDKQ 215 Query: 187 FMTLLWKTIK 196 + L+ + ++ Sbjct: 216 VLRLIRRYLE 225 >UniRef50_UPI000198600A PREDICTED: similar to intron maturase, type II family protein n=1 Tax=Vitis vinifera RepID=UPI000198600A Length = 1155 Score = 178 bits (451), Expect = 3e-43, Method: Composition-based stats. Identities = 75/361 (20%), Positives = 144/361 (39%), Gaps = 34/361 (9%) Query: 14 SLRIQRLL-RLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELL 72 + + Q L+ ++I P+ L +A + +DG N + + + +ELL Sbjct: 459 NGKFQDLMVKVIANPQTLEDAYNCIRINSNVDLA-LDGDNIS---------FKSMAEELL 508 Query: 73 SGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPE 132 G + + I + + L +P+L+ ++VQ A+ + +E ++ F +S+G R Sbjct: 509 GGSFN-VNVNTFSISTKSARKEVLILPSLKLKVVQEAIRIVLEIVYRPYFSKISHGCRSG 567 Query: 133 RSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLW 192 R A++ + ++++ W ++ D V L+ ++ +I D ++ Sbjct: 568 RGHSTALKYISKEISN-----PDWWFILHVNKKLDAVVLAKLISTMQDKIEDPNLFVMIQ 622 Query: 193 KTIKAGH--IDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSG--------- 241 A ++ G F G+PQ GV+SP+L NI L+ FD + + Sbjct: 623 NMFHAQVLNLEFGGFPKGH-GLPQEGVLSPILMNIYLDLFDHEFYRMSMRYEALDPGMCI 681 Query: 242 ---KARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREE 298 K+ W +G V CR+ D+ + G+K + E Sbjct: 682 DHDKSHSKLRSWFRRQLKGNDVKYTGRESSNFRVHSCRFMDEIFFAISGSKDIAIEFKSE 741 Query: 299 CRGVLEGSLKLRLNMDKTKIP-HVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFA 357 ++ SL L ++ +P H G FLG L+++ R +EK R FA Sbjct: 742 ILNYMQNSLHLDVSNQSELLPCHGPHGIQFLG-TLVKRSVRESPTVRAVHKLKEKVRLFA 800 Query: 358 A 358 + Sbjct: 801 S 801 >UniRef50_Q4FUJ8 Possible RNA-directed DNA polymerase (Reverse transcriptase) n=2 Tax=Proteobacteria RepID=Q4FUJ8_PSYA2 Length = 362 Score = 177 bits (450), Expect = 4e-43, Method: Composition-based stats. Identities = 93/363 (25%), Positives = 141/363 (38%), Gaps = 64/363 (17%) Query: 15 LRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSG 74 RI L + E L E SKG + L EL L++EL + Sbjct: 9 KRIGNLYESVVSGESLWEGYLGAKKSKGGRR------GCFQFEKSLGRELNELQEELANN 62 Query: 75 HYQPLPARR--VYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPE 132 Y+P P + VY PK R + PA RD +VQ A+ + + PI++ F S+ R Sbjct: 63 TYKPRPYFKFIVYEPKK----REIYAPAFRDCVVQYAIYLRVMPIFDKTFIDQSFACRTG 118 Query: 133 RSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLW 192 H A + L G + ++ D+ +F ++ L K + R+I D R + L+ Sbjct: 119 LGTHKAAEYAQDALRRAGPN--TYTLQLDIKKFFYSIDRPTLRKLLERKIKDKRLVDLM- 175 Query: 193 KTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNN 252 + A + + +G+P G ++S + + I +N D Y Sbjct: 176 -MLFADYPE-------PKGIPIGNLLSQMFALIYMNPVDHYATRV--------------- 212 Query: 253 SIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLN 312 KPA YCRY DDF L+ T+AQ R+ +E LKL L+ Sbjct: 213 ---------------LKPAAGYCRYVDDF-LLFGLTRAQALTYRKLLTDFVEQKLKLTLS 256 Query: 313 MDKTKIPHVNDGFIFLGH------RLIRKRSRYGEMRVVSTIPQEKARNF--AASLTALL 364 + I + G F G+ R IRK S Y + V E + AS T L Sbjct: 257 R--STIANTKRGANFCGYRTWRSGRFIRKHSLYKTRKAVRANKLESVISHLAHASKTHSL 314 Query: 365 WKV 367 + Sbjct: 315 QHL 317 >UniRef50_B4SDM1 RNA-directed DNA polymerase (Reverse transcriptase) n=2 Tax=Chlorobium/Pelodictyon group RepID=B4SDM1_PELPB Length = 343 Score = 177 bits (450), Expect = 4e-43, Method: Composition-based stats. Identities = 76/368 (20%), Positives = 136/368 (36%), Gaps = 63/368 (17%) Query: 18 QRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQ 77 + L + I E + AA+ ++KG + + L L + EL + +Q Sbjct: 5 KNLFQSIVTFENVLSAAQ--KAAKGKR----ENQSVLHFFTFLEENLWQILSELRTKTWQ 58 Query: 78 PLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHH 137 P + I K K R + +DR+V A++ + P+ E F +Y R + H Sbjct: 59 PGSYKTFSIYKP--KPRMISAAPFKDRVVHHALITIVGPLLERSFIFDTYANRTAKGTHK 116 Query: 138 AIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKA 197 AI + L + +V++ D+ YF ++ H +L +RR+I+ A + L+ I Sbjct: 117 AIERYQHYLK-----KYAYVLKCDIRKYFPSIDHEILKSLLRRKIACADTLWLIDTIIDN 171 Query: 198 GHIDVGLF-----------RAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKD 246 +I F +G+P G + S +N L+ D Y+ E Sbjct: 172 SNIQAEHFHYFPGDTLFTPHERRKGLPIGNLTSQFFANYYLSFLDHYVKEVLRCK----- 226 Query: 247 RWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGS 306 Y RY DD+VL +K ++ ++ L+ Sbjct: 227 --------------------------GYVRYVDDYVL-FSDSKDELWEWKKAIEEFLQ-Q 258 Query: 307 LKLRLNMDKTKIPHVNDGFIFLG------HRLIRKRSRYGEMRVVSTIPQEKARNFAASL 360 +L LN +T++ +G FLG +RL+ + + + K SL Sbjct: 259 FRLTLNSGRTELYPATEGKCFLGQKVFQSYRLLPSANVRRAKKRIQCTLLAKPETLQKSL 318 Query: 361 TALLWKVR 368 + R Sbjct: 319 AGWVGHAR 326 >UniRef50_D2R8Z2 CRISPR-associated protein Cas1 n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R8Z2_9PLAN Length = 942 Score = 176 bits (446), Expect = 1e-42, Method: Composition-based stats. Identities = 79/273 (28%), Positives = 126/273 (46%), Gaps = 45/273 (16%) Query: 63 ELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDF 122 E+Q +L+ G YQP P R+ IPKSNG R L IP+ DR++QR++L + P E F Sbjct: 341 EVQSAARDLVKGTYQPQPCFRLDIPKSNGDRRQLAIPSRLDRVLQRSILDVIAPALELFF 400 Query: 123 HTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRI 182 S+ +R H A R + TD RW + D +FDT+ H+LL + + + Sbjct: 401 EESSFAYRRGLGRHTAARHLSQAFTD----GYRWALHADFFDFFDTIDHKLLRRRLAAYL 456 Query: 183 SDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGK 242 +D + ++ + ++ G G+P G +SP+L+N+ L++FD+ +H Sbjct: 457 ADPSLVEVIMRWVETG------APHPDHGIPTGAPLSPILANLFLDQFDEAMHSV----- 505 Query: 243 ARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGV 302 RYADDFV++ + +++ +A+ E R Sbjct: 506 ----------------------------GRRLVRYADDFVVLFRD-QSEAQAVISEVRQA 536 Query: 303 LEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRK 335 SL+L LN DKT H+ F FLG + Sbjct: 537 -AESLRLELNRDKTHTLHLATSFDFLGLHFEPR 568 >UniRef50_C6PFC6 RNA-directed DNA polymerase (Reverse transcriptase) (Fragment) n=6 Tax=Thermoanaerobacterales RepID=C6PFC6_CLOTS Length = 209 Score = 176 bits (446), Expect = 1e-42, Method: Composition-based stats. Identities = 59/165 (35%), Positives = 91/165 (55%), Gaps = 6/165 (3%) Query: 19 RLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQP 78 LL +I E + EA + +++KG+H GVDG+ L L ++ +LL G Y+P Sbjct: 51 NLLEMILDRENMKEAYKRVVANKGSH--GVDGMEVDELLPYLKENWLTIKQQLLEGKYKP 108 Query: 79 LPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHA 138 P RV IPK +G R LGIP + DR++Q+A+ + I++ F SYGFRP RS A Sbjct: 109 QPVLRVEIPKPDGGTRLLGIPTVLDRLIQQAIAQILSGIYDHTFSENSYGFRPRRSAKDA 168 Query: 139 IRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRIS 183 + + + + WV++ DL +FD V+H +LM + +RI Sbjct: 169 VIAAETYINE----GCTWVVDIDLEKFFDRVNHDILMSKLEKRIG 209 >UniRef50_C8VZI0 RNA-directed DNA polymerase (Reverse transcriptase) n=1 Tax=Desulfotomaculum acetoxidans DSM 771 RepID=C8VZI0_DESAS Length = 369 Score = 176 bits (445), Expect = 2e-42, Method: Composition-based stats. Identities = 83/348 (23%), Positives = 135/348 (38%), Gaps = 55/348 (15%) Query: 17 IQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHY 76 L I E L ++ K A L L +EL+S Y Sbjct: 4 YSNLYSSICSFEGLYQSYLKARKRKRYRNE------VLKYTANLGENLIQAEEELISKSY 57 Query: 77 QPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVH 136 + P R+ ++ + K R + DRIVQ ++ + P+ + + SY R H Sbjct: 58 RVSPYRKSFVYEP--KKRLVMALPFGDRIVQWSVYRTLNPLLNKRYISHSYACRTGYGSH 115 Query: 137 HAIRTVKLQLTDCGETRGR-WVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTI 195 A++ ++ L GR +V++ D++ YF V H ++M + R I D + LL + + Sbjct: 116 RAVKQLQYWLRYLERRHGRIYVLKADMTKYFYRVDHDIIMNILERIIGDYDLIWLLEEIV 175 Query: 196 KAGHIDVGLFRA---------ASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKD 246 + H GL G+P G + S +++N+ LNE DQY Sbjct: 176 RCEHTWFGLPLDAEGFECELTGEVGIPIGNLTSQMIANLYLNELDQYAKHNLQIK----- 230 Query: 247 RWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGS 306 Y RY DD VLI+ K + I+EE L+ + Sbjct: 231 --------------------------YYMRYMDD-VLILHNDKKYLWHIKEEIEEFLDRN 263 Query: 307 LKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKAR 354 L+L+LN KT + G ++G+R+ + ST + KAR Sbjct: 264 LRLKLNN-KTCVRTNTQGIDWIGYRVWPTHVKL----RKSTAQRMKAR 306 >UniRef50_B4UZZ3 RNA-directed DNA polymerase n=1 Tax=Streptomyces sp. Mg1 RepID=B4UZZ3_9ACTO Length = 317 Score = 175 bits (444), Expect = 2e-42, Method: Composition-based stats. Identities = 77/224 (34%), Positives = 113/224 (50%), Gaps = 16/224 (7%) Query: 152 TRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHI-DVGLFRAASE 210 + W++EGD+ + FD + H LM+ R R+ D R + L+ +KAG + + GL R Sbjct: 10 KKYEWIVEGDIKACFDEISHTALMERARARVGDKRVLALVKAFLKAGILSEDGLLRDNDT 69 Query: 211 GVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKP 270 G PQG ++SPLLSN+ L+ D+++ + G A D +RG P Sbjct: 70 GTPQGSILSPLLSNVALSVLDEHIAQA--PGGAGTDLNERRRRRRRGL-----------P 116 Query: 271 AVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGH 330 RYADD+ L+V GTKA E +REE VL ++ LRL+ +KT I H+ G FLG Sbjct: 117 NYRLVRYADDWCLMVHGTKADAETLREEIAEVLT-TMGLRLSPEKTLITHIEQGLDFLGW 175 Query: 331 RLIRKRSRYGEMRVVSTIPQEKA-RNFAASLTALLWKVRISGEI 373 + R R V T P +KA R A + L +V + + Sbjct: 176 HIQRHRKPGTNRYYVYTYPAKKALRAVMAKVKTLCREVGTNQPL 219 >UniRef50_Q775D8 Reverse transcriptase n=1 Tax=Bordetella phage BPP-1 RepID=Q775D8_9CAUD Length = 328 Score = 174 bits (441), Expect = 4e-42, Method: Composition-based stats. Identities = 74/346 (21%), Positives = 130/346 (37%), Gaps = 57/346 (16%) Query: 14 SLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLS 73 R + L+ IT E L +A R T S T G + L L+ EL + Sbjct: 2 GKRHRNLIDQITTWENLLDAYRKT-SHGKRRTWG-----YLEFKEYDLANLLALQAELKA 55 Query: 74 GHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPER 133 G+Y+ P R + + K R + +DR+VQ A+ + PI+E+ +Y RP++ Sbjct: 56 GNYERGPYREFLVYEP--KPRLISALEFKDRLVQHALCNIVAPIFEAGLLPYTYACRPDK 113 Query: 134 SVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWK 193 H + V+ +L TR ++ D S +F ++ L + ++I A LL Sbjct: 114 GTHAGVCHVQAELR---RTRATHFLKSDFSKFFPSIDRAALYAMIDKKIHCAATRRLLRV 170 Query: 194 TIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNS 253 + + G+P G + S L +N+ D+ LH+ Sbjct: 171 VLPDEGV----------GIPIGSLTSQLFANVYGGAVDRLLHDELKQR------------ 208 Query: 254 IQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNM 313 + RY DD V++ + ++ A+ R L L+++ Sbjct: 209 -------------------HWARYMDDIVVLGDDPE-ELRAVFYRLRDFASERLGLKISH 248 Query: 314 DKTKIPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAAS 359 ++ V+ G FLG+R+ + V + K NF Sbjct: 249 --WQVAPVSRGINFLGYRIWPTHKLLRKSSVKR--AKRKVANFIKH 290 >UniRef50_A4VK83 Reverse transcriptase n=1 Tax=Pseudomonas stutzeri A1501 RepID=A4VK83_PSEU5 Length = 342 Score = 174 bits (441), Expect = 4e-42, Method: Composition-based stats. Identities = 68/290 (23%), Positives = 121/290 (41%), Gaps = 50/290 (17%) Query: 96 LGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGR 155 + P + DR+ Q + + +EP +S FH SYG+RP RS A+ + + R Sbjct: 1 MDYPTVSDRVAQTVVKLLIEPELDSIFHPDSYGYRPGRSAKQAVAITRERCW-----RYD 55 Query: 156 WVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAG-HIDVGLFRAASEGVPQ 214 WV+E D+ + FD + H LLMKAVR I + + + + + A G+ G PQ Sbjct: 56 WVVEFDIKAAFDQIDHGLLMKAVRAHIKEDWILLYIERWLVAPFETKDGVCVPRDRGTPQ 115 Query: 215 GGVISPLLSNIMLNE-FDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVA 273 GGV+SP+L N+ ++ FD ++ + Sbjct: 116 GGVVSPILMNLFMHYAFDMWMQ-------------------------------RTSANCP 144 Query: 274 YCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVND---------- 323 + RYADD V+ + ++ Q E + L L ++ +K+KI + D Sbjct: 145 FARYADDAVVHCR-SRKQAEYMMRTIASRLAD-CGLTMHPEKSKIVYCKDSNRTEQHLHV 202 Query: 324 GFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAASLTALLWKVRISGEI 373 F FLG +++ E R+ ++ + + + + R++ + Sbjct: 203 SFTFLGFMFRPRKALSKEGRLFTSFLPGASEGALKRMRQTVRRWRLNSQT 252 >UniRef50_Q1XGD2 Group II intron-associated open reading frame n=1 Tax=Physcomitrella patens RepID=Q1XGD2_PHYPA Length = 622 Score = 174 bits (440), Expect = 6e-42, Method: Composition-based stats. Identities = 86/367 (23%), Positives = 151/367 (41%), Gaps = 39/367 (10%) Query: 2 QRKLATWAAT----DPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQ 57 QR L+ AA+ +P LR +L + TQ A + + TPGV+G +L+ Sbjct: 24 QRILSLLAASLGFGNPLLR-DCMLEICTQLTSRIIAVDNLYYTSRSRTPGVNG---KILE 79 Query: 58 ARLAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPI 117 A + L + +Y+ P R V+I + R L IP + DR VQ +A+EP+ Sbjct: 80 ASSRIGLVRRISWINLKNYKSSPVRHVFIFEPQNGERLLRIPTIFDRTVQHLFKLAIEPV 139 Query: 118 WESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGR----------------WVIEGD 161 E S+G R + H A+ + L + G+ WVI+ + Sbjct: 140 TEPFADKYSFGSRRGKCAHMAVGEIAYILDTRRQRVGKVSNNKMEQNSKNFVSKWVIDAN 199 Query: 162 LSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPL 221 + +F + HR +++ S + + +K+ G + G VISPL Sbjct: 200 IKGFFGRIFHRWILENFPMPSSTEIVL---EEWLKSPIEYPGELEVSFRG-----VISPL 251 Query: 222 LSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDF 281 ++N L+ + + + + R W ++G + + + +V RYADDF Sbjct: 252 IANFALDGLENRITSGHRTTMTDPKRTEWTQ--KKGHRYLFNQTRKTE-SVRLIRYADDF 308 Query: 282 VLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPH--VNDGFIFLGHRLIRKRSRY 339 V+I + VE + E+ + L L+L+ +KT+I + + F+G Sbjct: 309 VIITND-ETLVERLFEKAKSFLAER-GLQLSSEKTRIFPWKIGERLNFIGFIFHNIDRAR 366 Query: 340 GEMRVVS 346 +V S Sbjct: 367 SHSQVTS 373 >UniRef50_Q8TIC7 Reverse transcriptase n=1 Tax=Methanosarcina acetivorans RepID=Q8TIC7_METAC Length = 313 Score = 173 bits (439), Expect = 8e-42, Method: Composition-based stats. Identities = 77/287 (26%), Positives = 122/287 (42%), Gaps = 23/287 (8%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWL-AEAARITLSSKGAHTPGVDGVNKTMLQAR 59 +Q ++A ++RL L+T A RI + G T GVDG T + Sbjct: 38 LQTRIAKATHNKNWNLVKRLTYLLTHSYSAKLLAVRIVTQNHGKRTAGVDGQLWTTASDK 97 Query: 60 LAVELQILRDELLSGHYQPLPARRVYIPKSNGK-LRPLGIPALRDRIVQRAMLMAMEPIW 118 + L L HY+ P RR+YIPK R L IP + DR +Q +A++P+ Sbjct: 98 MQAAL-----SLSDRHYRAHPLRRIYIPKPGKSTKRHLSIPTMSDRAMQALYSLALQPVA 152 Query: 119 ESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAV 178 E+ T S+GFR + A L +T W++EGD+ FD + H L + Sbjct: 153 ETTADTRSFGFRLFKCAQDASSYAFRCL--WRDTSNPWILEGDIKGCFDNISHSWLKNNI 210 Query: 179 RRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERY 238 S +L + +K+G I F +G PQG +ISPLL+N+ L+ ++ L+ Sbjct: 211 PTDSS------ILSQFLKSGFIFDDTFHHTDKGAPQGSIISPLLANMTLDGIEKLLNAGC 264 Query: 239 LS--------GKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRY 277 L + R ++ + + VR +V R+ Sbjct: 265 LKTILYHTFGSETRNLYLFYKTNSSVSHVSKVRSKAAVFTSVRPGRF 311 >UniRef50_B4S9Q0 RNA-directed DNA polymerase (Reverse transcriptase) n=1 Tax=Prosthecochloris aestuarii DSM 271 RepID=B4S9Q0_PROA2 Length = 337 Score = 173 bits (439), Expect = 9e-42, Method: Composition-based stats. Identities = 72/326 (22%), Positives = 118/326 (36%), Gaps = 49/326 (15%) Query: 15 LRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSG 74 R+ L + E L A R K + + E L+ EL G Sbjct: 2 KRVGLLFERVVAFENLLHATRQAARGKKSQ------LRVAHFLFHQEKECLRLQTELKQG 55 Query: 75 HYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERS 134 +QP R I + K R + +DR+VQ A+ + P+ E ++ R + Sbjct: 56 IWQPSGFRVFEIREP--KPRRISAADFQDRVVQHALCNILGPLCERRLIFDTWACRRGKG 113 Query: 135 VHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKT 194 H A++ + R + ++ D+ YFD+V H +L + + R I D + LL + Sbjct: 114 SHLAMKRAQ-----AFSRRFPYFLKCDIRRYFDSVDHTILKRLLWRLIKDKPVLNLLDRI 168 Query: 195 IKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSI 254 I +G+P G + S +N+ L E D L +R Sbjct: 169 IDHPLPGA----LPGKGLPIGNLTSQHFANLYLGELDHQLKDRMGVK------------- 211 Query: 255 QRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMD 314 AY RY DD LI K+++ + ++ L+L L Sbjct: 212 ------------------AYLRYMDDM-LIFADDKSRLHELVTGIEDFVKQHLQLSLRPS 252 Query: 315 KTKIPHVNDGFIFLGHRLIRKRSRYG 340 T + V++G FLG R+ R Sbjct: 253 ATLVAPVSEGVPFLGFRIFPGLVRVN 278 >UniRef50_Q7M1J5 Reverse transcription like protein 2, intron-encoded (Fragment) n=1 Tax=Pylaiella littoralis RepID=Q7M1J5_PYLLI Length = 308 Score = 173 bits (438), Expect = 1e-41, Method: Composition-based stats. Identities = 78/257 (30%), Positives = 113/257 (43%), Gaps = 67/257 (26%) Query: 79 LPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHA 138 P +RVYIPKS GKLRPLGIP + DR +Q +A++PI E SYGFR RS Sbjct: 3 SPVKRVYIPKSGGKLRPLGIPNMYDRGLQYLWKLALDPIAECRADRHSYGFRKGRSTQDV 62 Query: 139 IRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAG 198 + L L+ ++R WV+E D+ +FD ++H +++ + +L + +KAG Sbjct: 63 HTILHLLLS--PKSRCDWVLEADIRGFFDNINHDWIIQNIPMD------KNILREWLKAG 114 Query: 199 HIDVGL--FRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQR 256 ++ F GVPQGG ISPL++N+ L+ + + Sbjct: 115 ALETTTQEFHKGIAGVPQGGPISPLIANMTLDGLEVW----------------------- 151 Query: 257 GRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKT 316 VA RYADDFV+ + +LE L LN +KT Sbjct: 152 ---------------VAVVRYADDFVVT------------AATKRILER--GLVLNQEKT 182 Query: 317 KIPHVNDGFIFLGHRLI 333 I F F+G Sbjct: 183 CIT-----FDFVGFNFR 194 >UniRef50_Q9FJR9 Similarity to maturase-related protein n=4 Tax=Magnoliophyta RepID=Q9FJR9_ARATH Length = 735 Score = 172 bits (437), Expect = 1e-41, Method: Composition-based stats. Identities = 93/423 (21%), Positives = 155/423 (36%), Gaps = 78/423 (18%) Query: 11 TDPSLRIQRLLRLITQPEWLAEAARITL-SSKGAHTPGVDGVNKTMLQARLAVELQILRD 69 +P L + + E A + GA+ P ++ +Q L LR+ Sbjct: 87 KEPDKTATNLTSYLRRFELWVLAYQKVCCDELGAYVP------RSSIQRSALENLLALRN 140 Query: 70 ELLSGHYQPLPARRVYIPKSNGK-------LRPL-GIPA------LRDRIVQRAMLMAME 115 +L ++ YI K R + I +DRIVQ +LM +E Sbjct: 141 SVLDDRFKWGSRLDFYIKSPRDKTDYESLSKRKIKAILTTTQPTPFQDRIVQEVLLMILE 200 Query: 116 PIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLM 175 PI+ES F S+ FRP R+ H +R ++ W ++GDLS D + ++ Sbjct: 201 PIYESRFSQKSFAFRPGRTAHTVLRVIRRNFA-----GYLWYVKGDLSVVLDGMKVGFVI 255 Query: 176 KAVRRRISDARFMTLLWKTIKAGHIDVGL------------------------------- 204 ++ R + D + + L+ + + + Sbjct: 256 SSLMRDVRDKKVIDLIKSALVTPVVTSKVEDGEKKKTKKRKYQKKRVLAEDEPKPDPYWL 315 Query: 205 -----FRAASEG-VPQ---GGVISPLLSNIMLNEFDQYLHERYLSG-KARKDRWYWNNSI 254 F G PQ G++SPLL N+ L+E D+++ + + K WNN Sbjct: 316 ETFFGFAPEEAGKSPQWGHCGILSPLLVNVCLDELDRWMETKVKDFYRPSKSDVIWNNPE 375 Query: 255 QRGRSTAV-------RENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSL 307 + Y RY ++ V+G +A +R+E ++ Sbjct: 376 GEADQGNTSWPEFVPTSGPDKTRKMDYVRYGGHILIGVRGPRADAATLRKELIEFVDQKY 435 Query: 308 KLRLNMDKTKIPHVNDGFIFLGHRLIRKRSRYGEMRVVST---IPQEKARNFAASLTALL 364 LRL+ + I H+ G +FL H L R R Y +R +T I EK S+TA L Sbjct: 436 MLRLDNENLPIEHITKGIMFLDHVLCR-RVVYPTLRYTATGGKIISEKGVGTLLSVTASL 494 Query: 365 WKV 367 + Sbjct: 495 KQC 497 >UniRef50_A6DE66 Putative uncharacterized protein n=1 Tax=Caminibacter mediatlanticus TB-2 RepID=A6DE66_9PROT Length = 377 Score = 172 bits (437), Expect = 1e-41, Method: Composition-based stats. Identities = 76/294 (25%), Positives = 138/294 (46%), Gaps = 42/294 (14%) Query: 59 RLAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIW 118 ++ ++++ L DELL G Y P P + + KSN K R + I +D++VQ+ + ++ + Sbjct: 26 KIDIDVKKLSDELLRGKYIPSPLQSFELKKSNNKTREIKILTDKDKLVQKVLYESINEFF 85 Query: 119 ESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAV 178 + F SYG+R +S AI+ K + + + +V + D+ ++F+ ++H L+ + Sbjct: 86 DKQFSNRSYGYRIGKSTIKAIKRCKDFI----KRKYFYVFKSDIKNFFENINHNKLISLL 141 Query: 179 RRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERY 238 R I D R + L+ + IK+G + F + GV QG ++SPLLSNI LNEFD++L + Sbjct: 142 DRNIEDKRIIRLIVQFIKSGILKKEYF-SHEIGVHQGDILSPLLSNIYLNEFDKFLESK- 199 Query: 239 LSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREE 298 + + RYADDFV+ +K ++ E Sbjct: 200 --------------------------------NIEFVRYADDFVIFMKKNNKEI----PE 223 Query: 299 CRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEK 352 + ++ L ++ +K+ + GF FLG + ++ I K Sbjct: 224 ILNIFLKNIDLEISEEKSYFSDIYKGFSFLGCFFRNNDVYIDKKKLYFHIESVK 277 >UniRef50_Q8HQ84 ORF786 n=1 Tax=Schizosaccharomyces octosporus RepID=Q8HQ84_SCHOT Length = 786 Score = 171 bits (434), Expect = 3e-41, Method: Composition-based stats. Identities = 79/338 (23%), Positives = 151/338 (44%), Gaps = 33/338 (9%) Query: 13 PSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELL 72 L + + + + +NK + ++ L++ +L Sbjct: 176 KERIYYDLKGYLKLDDLWYTSYLKLRK--------IKNLNKLNINPLTKTKIMKLKESVL 227 Query: 73 SGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPE 132 +G ++ + + K+ + L ++ D++VQ + +EPI+E +F +S+GFRP Sbjct: 228 NGTFEWTNTKHQLLHKTPNQ--NLNQTSINDKLVQEVLKNILEPIFELNFLEVSHGFRPN 285 Query: 133 RSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLW 192 R+ H A++ + ++ D W IEG++ + +T++ LL++ + +R+ D ++LL Sbjct: 286 RNCHIALKYLNTKMKD-----SIWFIEGNIEN--NTLNTTLLIELISKRVKDKLILSLLR 338 Query: 193 KTIKAGH-IDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLH---ERYLSGKARKDRW 248 +K+ + L G+ G + LL+NI +E D YL + Y +R Sbjct: 339 SALKSNVLMSKELLFIPEVGIEHGSTLKTLLTNIYYHELDNYLQNLSQNYEGSIKASNRR 398 Query: 249 YWNNSIQRGRSTAVRENWQWK-----------PAVAYCRYADDFVLIVKGTKAQVEAIRE 297 +++ R E ++ + V Y RY D F++ V G++ IRE Sbjct: 399 KNPANLRLLREGKKSEAYKLRLPSRDPFEKEYRNVKYIRYGDKFLIGVLGSRKLTLEIRE 458 Query: 298 ECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRK 335 + G L L + LN D TKI H+++G FLG+ RK Sbjct: 459 KVSGFLNDKLNITLNPD-TKINHISNGISFLGYIFSRK 495 >UniRef50_D1I8B4 Whole genome shotgun sequence of line PN40024, scaffold_35.assembly12x (Fragment) n=2 Tax=Vitis vinifera RepID=D1I8B4_VITVI Length = 730 Score = 171 bits (434), Expect = 3e-41, Method: Composition-based stats. Identities = 91/405 (22%), Positives = 164/405 (40%), Gaps = 62/405 (15%) Query: 13 PSLRIQRLLRLITQPEWLAEAARITLSSK-GAHTPGVDGVNKTMLQARLAVELQILRDEL 71 P + L + + E A + + + GA+ P ++ +Q +L LR+ + Sbjct: 97 PDATVTNLTSFLRRFELWVLAYQKVCADEMGAYMP------RSAIQRSALEDLLGLRNAV 150 Query: 72 LSGHYQPLPARRVYIPKSNGK-------LRPL-GIPA------LRDRIVQRAMLMAMEPI 117 L ++ +I K R + I +D+IVQ + M +EPI Sbjct: 151 LDSRFKWGARLEFFIKSPKDKTEYESLSKRKIRAILTTTQPAPFQDKIVQEVLFMILEPI 210 Query: 118 WESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKA 177 +E+ F S+ FRP R+ H +R ++ W I+GDLS+ D + L++ A Sbjct: 211 YEARFSEKSFAFRPGRNAHSVLRVIRRSFA-----GYLWYIKGDLSTILDGMKVGLVISA 265 Query: 178 VRRRISDARFMTLLWKTIKAGHID---------------------VGLFRAASEGVP--- 213 + R + D + + LL + I G +E +P Sbjct: 266 LMRDVRDKKVIDLLKAALVTPVITSQKRVLAEDEPKPDPYWLETFFGFAPEEAEKLPSWG 325 Query: 214 QGGVISPLLSNIMLNEFDQYLH----ERYLSGKARKDRWYWNNSIQRGRST----AVREN 265 G++SPLL+N+ L+E D+++ E Y K+ + +++G ++ Sbjct: 326 HCGILSPLLANVCLDELDRWMEGKIKEFYRPSKSDVIWNSPDGEVEQGNTSWPEFVPTSG 385 Query: 266 WQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGF 325 + Y RY ++ V+G +A +R++ + L+L+ + I H+ G Sbjct: 386 PDKTRKMDYIRYGGHILIGVRGPRADAAILRKQLIEFCDQKYMLKLDSESLPIEHITKGI 445 Query: 326 IFLGHRLIRKRSRYGEMRVVST---IPQEKARNFAASLTALLWKV 367 +FL H L R R Y +R +T I EK S+TA L + Sbjct: 446 MFLDHVLCR-RVVYPTLRYTATGGKIISEKGVGTLLSVTASLKQC 489 >UniRef50_Q8YRF1 Alr3497 protein n=10 Tax=Cyanobacteria RepID=Q8YRF1_ANASP Length = 352 Score = 171 bits (432), Expect = 6e-41, Method: Composition-based stats. Identities = 76/338 (22%), Positives = 121/338 (35%), Gaps = 57/338 (16%) Query: 15 LRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSG 74 R L + I E + A+R SK N L EL L+ L Sbjct: 2 KRYGNLYQEIINFENILIASRQAQKSK------RFRDNVLDFNYHLETELIKLQKHLTDK 55 Query: 75 HYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERS 134 YQP R + +N K R + RDR+V A+ + PI+E F SY R Sbjct: 56 TYQPGAYRTFRL--TNPKSRLISAAPYRDRVVHHALCNIIVPIFERAFVADSYANRIGFG 113 Query: 135 VHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKT 194 H A++ + +V++ D+ YF ++ H +L + +RR+I + L+ Sbjct: 114 THRALKKFTHSARNSP-----YVLQCDIRKYFPSIDHIILKELIRRKIKCPDTLWLIDTI 168 Query: 195 IKAGHIDVGLFR-----------AASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKA 243 I + + +G+P G + S +NI LN FD+ + E K Sbjct: 169 IDNSNEQETVIDYFAGDDLLSPITRRKGLPIGNLTSQFFANIYLNGFDKIIKEELKISK- 227 Query: 244 RKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVL 303 Y RY DDF L + + R L Sbjct: 228 ------------------------------YVRYVDDFAL-FSDDRELLADARLAIEAYL 256 Query: 304 EGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSRYGE 341 L+L+++ K+++ G FLG R+ + R Sbjct: 257 -AELRLKIHPIKSQLFETKIGATFLGFRIFSHKIRVRN 293 >UniRef50_C7RQ26 RNA-directed DNA polymerase (Reverse transcriptase) n=1 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RQ26_9PROT Length = 364 Score = 170 bits (431), Expect = 7e-41, Method: Composition-based stats. Identities = 74/312 (23%), Positives = 122/312 (39%), Gaps = 53/312 (16%) Query: 56 LQARLAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAME 115 + LA L L+ EL +G Y+P +I + K R + RDR+V + +E Sbjct: 37 FEFALADRLLELKRELETGQYRPGGYLNFFIHEP--KRRKISAAPFRDRVVHHPLCNVIE 94 Query: 116 PIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLM 175 P +E F SY R + H AI ++ R R+V+ D+ +F ++ H++L Sbjct: 95 PRFERLFIADSYANRRGKGTHRAIDRLQ-----HFAQRHRYVLRADIVKHFPSIDHQVLH 149 Query: 176 KAVRRRISDARFMTLLWKTIKAGHI-------------DVGLFRAASEGVPQGGVISPLL 222 + R + +A M L+ + I +G D L G+P G + S Sbjct: 150 AILARVVPEADLMALIDRIIASGAGVLDEEYATVYFPGDDLLAACRPRGLPIGNLTSQFW 209 Query: 223 SNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFV 282 SN L+ FDQ++ R+ RW AY RY DDF Sbjct: 210 SNCYLHPFDQFV--------TRELRWA-----------------------AYLRYVDDFA 238 Query: 283 LIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSRYGEM 342 L +K ++ A + L L+L ++ ++ V +G +LG + R Sbjct: 239 L-FSDSKRELWAWKRAIVERL-ARLRLTIHEGPAQVVPVENGIPWLGFVVFPGYRRVKAR 296 Query: 343 RVVSTIPQEKAR 354 +V + R Sbjct: 297 KVRGATSRLSGR 308 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_Q47688 Putative uncharacterized protein ykfC n=18 Tax=P... 397 e-109 UniRef50_B0EYP4 Reverse transcriptase-like protein n=48 Tax=cell... 385 e-105 UniRef50_D2QVU8 RNA-directed DNA polymerase (Reverse transcripta... 313 1e-83 UniRef50_A4XCB1 RNA-directed DNA polymerase n=7 Tax=Actinomyceta... 311 2e-83 UniRef50_Q92Y56 Reverse transcriptase n=3 Tax=Alphaproteobacteri... 306 6e-82 UniRef50_B4D9Y9 RNA-directed DNA polymerase (Reverse transcripta... 289 1e-76 UniRef50_Q2G689 RNA-directed DNA polymerase n=5 Tax=Bacteria Rep... 286 1e-75 UniRef50_C8R0Q7 RNA-directed DNA polymerase (Reverse transcripta... 284 2e-75 UniRef50_A9IAV6 Mobile mitochondrial group II intron of COX1 whi... 283 7e-75 UniRef50_C3QDF8 RNA-directed DNA polymerase n=9 Tax=Bacteroidale... 283 7e-75 UniRef50_UPI00019088D4 mobile mitochondrial group II intron of C... 283 7e-75 UniRef50_Q3CZ44 Prophage LambdaSa1, reverse transcriptase/matura... 283 8e-75 UniRef50_A9B955 RNA-directed DNA polymerase n=1 Tax=Herpetosipho... 282 1e-74 UniRef50_A7VUZ6 Putative uncharacterized protein n=1 Tax=Clostri... 282 1e-74 UniRef50_B8FP60 RNA-directed DNA polymerase n=3 Tax=Firmicutes R... 281 2e-74 UniRef50_A5CZB8 Retron-type reverse transcriptase n=20 Tax=Bacte... 279 1e-73 UniRef50_Q3A299 Prophage LambdaSa1, reverse transcriptase/matura... 276 9e-73 UniRef50_A9IAY4 Reverse transcriptase n=38 Tax=Bacteria RepID=A9... 276 1e-72 UniRef50_Q1J4S7 Reverse transcriptase / RNA maturase / Endonucle... 274 3e-72 UniRef50_B8I7I5 RNA-directed DNA polymerase (Reverse transcripta... 274 5e-72 UniRef50_Q024N3 RNA-directed DNA polymerase n=6 Tax=Bacteria Rep... 272 1e-71 UniRef50_C3B599 Group II intron-encoded protein LtrA n=3 Tax=cel... 272 2e-71 UniRef50_Q56VE2 MatR n=6 Tax=Bacteria RepID=Q56VE2_BACFR 271 2e-71 UniRef50_B9IYU4 Reverse transcriptase n=21 Tax=Bacteria RepID=B9... 271 2e-71 UniRef50_C0YLQ1 RNA-directed DNA polymerase n=40 Tax=Bacteria Re... 271 3e-71 UniRef50_Q02718 Reverse transcriptase homologue COI iA grp II pr... 271 3e-71 UniRef50_D2M2V6 RNA-directed DNA polymerase (Reverse transcripta... 271 4e-71 UniRef50_B0TA92 Reverse transcriptase (RNA-dependent DNA polymer... 271 4e-71 UniRef50_Q0S063 RNA-directed DNA polymerase (Reverse transcripta... 269 1e-70 UniRef50_A5CZJ0 Retron-type reverse transcriptase n=1 Tax=Peloto... 268 2e-70 UniRef50_B9M2H7 RNA-directed DNA polymerase (Reverse transcripta... 268 2e-70 UniRef50_A6YEC9 Putative reverse transcriptase and intron matura... 267 5e-70 UniRef50_P0A3U1 DNA endonuclease n=8 Tax=Firmicutes RepID=LTRA_L... 267 5e-70 UniRef50_B0I1N9 Reverse transcriptase homolog n=7 Tax=cellular o... 267 6e-70 UniRef50_A1T776 RNA-directed DNA polymerase n=1 Tax=Mycobacteriu... 267 6e-70 UniRef50_C9B0U1 RNA directed DNA polymerase n=2 Tax=Enterococcus... 265 2e-69 UniRef50_C7RV41 RNA-directed DNA polymerase (Reverse transcripta... 265 2e-69 UniRef50_Q3ESS6 Reverse transcriptase / RNA maturase / Endonucle... 264 3e-69 UniRef50_C6J6N9 RNA-directed DNA polymerase n=1 Tax=Paenibacillu... 264 4e-69 UniRef50_C6CA58 RNA-directed DNA polymerase n=3 Tax=Enterobacter... 263 6e-69 UniRef50_A9AUN7 RNA-directed DNA polymerase n=4 Tax=Bacteria Rep... 263 6e-69 UniRef50_C3FAV3 RNA-directed DNA polymerase n=11 Tax=Bacteria Re... 263 7e-69 UniRef50_B7C9E4 Putative uncharacterized protein n=1 Tax=Eubacte... 263 9e-69 UniRef50_B4WUH1 Group II intron, maturase-specific domain family... 263 1e-68 UniRef50_C1BDP2 Putative RNA-directed DNA polymerase n=2 Tax=Rho... 263 1e-68 UniRef50_C5CID2 RNA-directed DNA polymerase (Reverse transcripta... 262 1e-68 UniRef50_O47500 RT-like protein n=1 Tax=Venturia inaequalis RepI... 262 2e-68 UniRef50_C5RJ16 RNA-directed DNA polymerase (Reverse transcripta... 262 2e-68 UniRef50_B1N1A3 NicA n=1 Tax=Pseudomonas putida RepID=B1N1A3_PSEPU 261 2e-68 UniRef50_Q74P60 Group II intron reverse transcriptase/maturase n... 261 2e-68 UniRef50_B4D379 RNA-directed DNA polymerase (Reverse transcripta... 261 3e-68 UniRef50_Q3A4Z2 Group II intron-encoding maturase n=98 Tax=Bacte... 261 3e-68 UniRef50_B4CYA7 RNA-directed DNA polymerase (Reverse transcripta... 261 4e-68 UniRef50_B2AJV8 RNA-directed DNA polymerase, retrotranscriptase ... 260 5e-68 UniRef50_B1HW67 Possible group II intron reverse transcriptase/m... 260 5e-68 UniRef50_A3WXE2 Putative reverse transcriptase n=1 Tax=Nitrobact... 259 9e-68 UniRef50_B4D301 RNA-directed DNA polymerase (Reverse transcripta... 259 9e-68 UniRef50_Q01P79 RNA-directed DNA polymerase (Reverse transcripta... 259 1e-67 UniRef50_A4KVN1 Probable reverse transcriptase n=2 Tax=Sinorhizo... 259 2e-67 UniRef50_Q188V0 Group II intron reverse transcriptase/maturase n... 258 2e-67 UniRef50_B0K6R3 RNA-directed DNA polymerase (Reverse transcripta... 258 3e-67 UniRef50_C3FJT8 RNA-directed DNA polymerase (Reverse transcripta... 256 8e-67 UniRef50_C2KES2 Reverse transcriptase/maturase n=14 Tax=Firmicut... 256 1e-66 UniRef50_Q1QGR6 RNA-directed DNA polymerase (Reverse transcripta... 256 1e-66 UniRef50_C9BNF1 Group II intron reverse transcriptase/maturase n... 256 1e-66 UniRef50_B8R181 Putative intron-encoded reverse transcriptase n=... 256 1e-66 UniRef50_C4ZES6 RNA-directed DNA polymerase n=27 Tax=Bacteria Re... 255 2e-66 UniRef50_B2A0J9 RNA-directed DNA polymerase (Reverse transcripta... 255 2e-66 UniRef50_D2CJC8 Putative uncharacterized protein orf2 (Fragment)... 254 3e-66 UniRef50_A8MI91 RNA-directed DNA polymerase (Reverse transcripta... 253 7e-66 UniRef50_C5ER86 RNA-directed DNA polymerase n=1 Tax=Clostridiale... 253 7e-66 UniRef50_B0URY2 RNA-directed DNA polymerase n=25 Tax=cellular or... 253 8e-66 UniRef50_C3B585 Reverse transcriptase/endonuclease protein n=1 T... 253 8e-66 UniRef50_C6IQ61 Putative uncharacterized protein n=10 Tax=Bacter... 253 1e-65 UniRef50_C3LL08 Group II intron reverse transcriptase/maturase n... 252 1e-65 UniRef50_A0RHJ0 Reverse transcriptase/endonuclease protein n=6 T... 252 1e-65 UniRef50_Q1Q0X4 Similar to Group II intron encoded reverse trans... 252 2e-65 UniRef50_Q82RB7 Putative reverse transcriptase homolog; similar ... 252 2e-65 UniRef50_Q9MD87 Putative maturase n=1 Tax=Cryphonectria parasiti... 252 2e-65 UniRef50_B7HM08 Group II intron reverse transcriptase/maturase n... 251 2e-65 UniRef50_A8ZN56 RNA-directed DNA polymerase n=2 Tax=Cyanobacteri... 251 3e-65 UniRef50_C3KST3 Group II intron reverse transcriptase/maturase n... 251 3e-65 UniRef50_A5VLF2 RNA-directed DNA polymerase (Reverse transcripta... 251 3e-65 UniRef50_D2CK02 Putative uncharacterized protein orf3 (Fragment)... 251 3e-65 UniRef50_C1PA09 RNA-directed DNA polymerase n=3 Tax=Firmicutes R... 251 3e-65 UniRef50_D0LS09 RNA-directed DNA polymerase n=1 Tax=Haliangium o... 251 4e-65 UniRef50_UPI0001C42942 reverse transcriptase n=1 Tax=Bacillus ps... 251 4e-65 UniRef50_Q5ZTU1 Reverse transcriptase n=1 Tax=Legionella pneumop... 251 5e-65 UniRef50_C0JX29 Putative reverse transcriptase and intron matura... 250 5e-65 UniRef50_B7CEC9 Putative uncharacterized protein n=1 Tax=Eubacte... 250 7e-65 UniRef50_B7JTB6 Group II intron reverse transcriptase/maturase n... 250 7e-65 UniRef50_B0JX80 Reverse transcriptase n=82 Tax=Bacteria RepID=B0... 250 7e-65 UniRef50_C8VXL4 RNA-directed DNA polymerase (Reverse transcripta... 249 9e-65 UniRef50_C1L365 Group II intron-encoded protein n=1 Tax=Bacillus... 249 9e-65 UniRef50_Q24QQ9 Putative uncharacterized protein n=1 Tax=Desulfi... 249 1e-64 UniRef50_Q9T654 Cox1I1a maturase (Fragment) n=2 Tax=cellular org... 249 1e-64 UniRef50_A9BGC0 RNA-directed DNA polymerase (Reverse transcripta... 249 1e-64 UniRef50_Q64E53 Prophage LambdaSa1 transcriptase/maturase family... 249 1e-64 UniRef50_C4K5N9 Group II intron encoded reverse transcriptase n=... 249 2e-64 UniRef50_Q02717 Reverse transcriptase homologue COI ialpha grp I... 248 2e-64 UniRef50_Q11ZP4 RNA-directed DNA polymerase n=33 Tax=Bacteria Re... 248 2e-64 UniRef50_A7GTD4 RNA-directed DNA polymerase n=1 Tax=Bacillus cyt... 248 2e-64 UniRef50_C3BJV7 D-alanine--D-alanine ligase A (D-alanylalanine s... 248 2e-64 UniRef50_B3GTB4 Putative reverse-transcriptase protein n=1 Tax=V... 248 3e-64 UniRef50_Q0AW97 RNA-directed DNA polymerase (Reverse transcripta... 248 3e-64 UniRef50_Q08WW1 Prophage LambdaSa1, reverse transcriptase/matura... 247 4e-64 UniRef50_A5VH22 RNA-directed DNA polymerase n=1 Tax=Sphingomonas... 247 4e-64 UniRef50_A5ZWA2 Putative uncharacterized protein n=2 Tax=Clostri... 247 5e-64 UniRef50_A6DJK4 Reverse transcriptase/maturase n=21 Tax=Chlamydi... 247 6e-64 UniRef50_B7K703 RNA-directed DNA polymerase (Reverse transcripta... 246 9e-64 UniRef50_Q3B1V7 RNA-directed DNA polymerase n=31 Tax=Bacteria Re... 245 2e-63 UniRef50_B2JXR4 RNA-directed DNA polymerase n=10 Tax=Bacteria Re... 245 2e-63 UniRef50_A9ENQ0 Integron/retron-type RNA-directed DNA polymerase... 245 2e-63 UniRef50_P03876 Putative COX1/OXI3 intron 2 protein n=3 Tax=Sacc... 244 3e-63 UniRef50_A1ZX33 Group II intron-encoded protein LtrA n=2 Tax=Bac... 244 4e-63 UniRef50_A2TD24 Intron encoded protein n=2 Tax=Bacillaceae RepID... 244 5e-63 UniRef50_Q119U8 RNA-directed DNA polymerase n=30 Tax=Bacteria Re... 243 7e-63 UniRef50_Q7UY81 Reverse transcriptase/maturase n=1 Tax=Rhodopire... 243 8e-63 UniRef50_P03875 Putative COX1/OXI3 intron 1 protein n=3 Tax=Fung... 243 8e-63 UniRef50_C9S0G0 RNA-directed DNA polymerase n=2 Tax=Geobacillus ... 242 1e-62 UniRef50_Q5U7I7 Maturase-related protein n=20 Tax=Gammaproteobac... 242 2e-62 UniRef50_C2XKK9 D-alanine--D-alanine ligase A (D-alanylalanine s... 242 2e-62 UniRef50_Q1PUN9 Strong similarity to group II intron-encoded pro... 242 2e-62 UniRef50_P38478 Uncharacterized mitochondrial protein ymf40 n=1 ... 241 2e-62 UniRef50_C4ZCX5 RNA-directed DNA polymerase n=24 Tax=Bacteria Re... 241 3e-62 UniRef50_P05511 Uncharacterized 91 kDa protein in cob intron n=1... 241 3e-62 UniRef50_Q47DU4 RNA-directed DNA polymerase (Reverse transcripta... 241 3e-62 UniRef50_C9P0Q5 Retron-type reverse transcriptase n=3 Tax=Vibrio... 241 4e-62 UniRef50_B7I148 Reverse transcriptase n=9 Tax=Bacillus RepID=B7I... 241 5e-62 UniRef50_B9J6F8 Reverse transcriptase n=7 Tax=Bacillus cereus gr... 240 6e-62 UniRef50_A0L945 RNA-directed DNA polymerase (Reverse transcripta... 240 6e-62 UniRef50_Q35062 CoxI intron2 ORF n=2 Tax=Marchantia polymorpha R... 240 6e-62 UniRef50_Q94Z00 Orf757 n=4 Tax=stramenopiles RepID=Q94Z00_PYLLI 240 7e-62 UniRef50_B1L2I7 Reverse transcriptase/endonuclease protein n=37 ... 240 7e-62 UniRef50_UPI0001C42A66 RNA-directed DNA polymerase (Reverse tran... 239 1e-61 UniRef50_C6MS68 RNA-directed DNA polymerase n=1 Tax=Geobacter sp... 238 2e-61 UniRef50_A8VT23 S-layer domain protein n=12 Tax=Bacilli RepID=A8... 238 2e-61 UniRef50_Q93PB4 MS117, putative maturase n=1 Tax=Microscilla sp.... 237 6e-61 UniRef50_Q3S275 ORF718 n=2 Tax=Eukaryota RepID=Q3S275_THAPS 236 1e-60 UniRef50_Q6EI10 Reverse transcriptase/HNH endonuclease n=2 Tax=E... 235 2e-60 UniRef50_Q35056 CoxII intron2 ORF n=4 Tax=Embryophyta RepID=Q350... 235 2e-60 UniRef50_C6MRB5 RNA-directed DNA polymerase n=1 Tax=Geobacter sp... 235 2e-60 UniRef50_B1I9Z1 GBSi1, group II intron, maturase n=33 Tax=Firmic... 235 2e-60 UniRef50_C7V8C7 Reverse transcriptase n=1 Tax=Enterococcus faeca... 234 4e-60 UniRef50_B7KM76 RNA-directed DNA polymerase (Reverse transcripta... 233 9e-60 UniRef50_B4WV39 Group II intron, maturase-specific domain family... 232 2e-59 UniRef50_Q8YLU0 All5206 protein n=1 Tax=Nostoc sp. PCC 7120 RepI... 232 2e-59 UniRef50_Q94Z24 Orf568 n=1 Tax=Pylaiella littoralis RepID=Q94Z24... 232 2e-59 UniRef50_Q8A4I4 Reverse transcriptase n=1 Tax=Bacteroides thetai... 232 2e-59 UniRef50_C6MXE9 RNA-directed DNA polymerase n=1 Tax=Geobacter sp... 232 2e-59 UniRef50_B3PDY2 Putative maturase n=1 Tax=Cellvibrio japonicus U... 231 4e-59 UniRef50_A7BUU9 RNA-directed DNA polymerase n=1 Tax=Beggiatoa sp... 230 7e-59 UniRef50_C5D9G3 RNA-directed DNA polymerase (Reverse transcripta... 229 1e-58 UniRef50_Q8TJY1 Reverse transcriptase n=5 Tax=Methanosarcina Rep... 228 3e-58 UniRef50_Q2FUJ3 RNA-directed DNA polymerase n=1 Tax=Methanospiri... 228 3e-58 UniRef50_A5IEI2 Reverse transcriptase n=7 Tax=Bacteria RepID=A5I... 227 6e-58 UniRef50_B1VA32 Retron-type reverse transcriptase n=7 Tax=Candid... 226 8e-58 UniRef50_Q1D1V6 Group II intron, maturase n=2 Tax=Myxococcales R... 226 2e-57 UniRef50_Q8YKQ2 Alr7241 protein n=14 Tax=Cyanobacteria RepID=Q8Y... 224 3e-57 UniRef50_Q12UG1 RNA-directed DNA polymerase n=53 Tax=cellular or... 224 5e-57 UniRef50_Q1VQM5 Prophage LambdaSa1, reverse transcriptase/matura... 224 6e-57 UniRef50_A7MS60 Putative uncharacterized protein n=21 Tax=Vibrio... 224 6e-57 UniRef50_B8FP59 RNA-directed DNA polymerase n=9 Tax=Firmicutes R... 224 6e-57 UniRef50_Q8HQ89 ORF777 (Fragment) n=1 Tax=Schizosaccharomyces oc... 223 1e-56 UniRef50_B0I1N8 Reverse transcriptase homolog n=2 Tax=Pylaiella ... 222 1e-56 UniRef50_Q9G8T2 Orf762 n=2 Tax=Eukaryota RepID=Q9G8T2_RHDSA 221 4e-56 UniRef50_Q7YAJ3 Putative reverse transcriptase and intron matura... 220 8e-56 UniRef50_A4C8M3 RNA-directed DNA polymerase (Reverse transcripta... 220 8e-56 UniRef50_D0VMZ3 Putative reverse-transcriptase protein n=1 Tax=V... 219 1e-55 UniRef50_Q47277 Orf protein n=63 Tax=cellular organisms RepID=Q4... 218 2e-55 UniRef50_B3JM52 Putative uncharacterized protein n=1 Tax=Bactero... 218 3e-55 UniRef50_A7BYN3 RNA-directed DNA polymerase n=2 Tax=Beggiatoa sp... 217 5e-55 UniRef50_A6LY84 RNA-directed DNA polymerase (Reverse transcripta... 216 1e-54 UniRef50_D2LU13 RNA-directed DNA polymerase (Reverse transcripta... 216 1e-54 UniRef50_A8KXN1 RNA-directed DNA polymerase (Reverse transcripta... 216 1e-54 UniRef50_C0JWS6 Putative reverse transcriptase and intron matura... 216 1e-54 UniRef50_O99479 Reverse transcriptase homolog n=2 Tax=Eukaryota ... 215 2e-54 UniRef50_B4WW73 Group II intron, maturase-specific domain family... 215 2e-54 UniRef50_Q94Z25 Orf557 n=2 Tax=Pylaiella littoralis RepID=Q94Z25... 214 3e-54 UniRef50_Q8GAR1 Reverse transcriptase n=20 Tax=Enterobacteriacea... 212 1e-53 UniRef50_Q7XXA4 OSJNBa0019G23.12 protein n=2 Tax=Oryza sativa Re... 212 2e-53 UniRef50_A7UDN1 Putative reverse transcriptase n=2 Tax=Candida z... 212 2e-53 UniRef50_Q6TFE1 Putative group II intron-encoded maturase n=1 Ta... 211 5e-53 UniRef50_Q7GEU5 Putative uncharacterized protein (Fragment) n=3 ... 210 6e-53 UniRef50_A1BI39 CRISPR-associated protein Cas1 n=5 Tax=Chlorobia... 210 7e-53 UniRef50_UPI00016C4F75 RNA-directed DNA polymerase (Reverse tran... 209 2e-52 UniRef50_B8R160 Reverse transcriptase n=2 Tax=Volvox carteri Rep... 208 2e-52 UniRef50_B5W904 Group II intron maturase-specific domain protein... 208 3e-52 UniRef50_D2FQY0 Regulatory protein GntR n=2 Tax=Staphylococcus a... 208 3e-52 UniRef50_A6YE98 Putative reverse transcriptase and intron matura... 207 6e-52 UniRef50_A8LGE6 RNA-directed DNA polymerase n=1 Tax=Frankia sp. ... 207 7e-52 UniRef50_Q8YWX6 Alr1468 protein n=4 Tax=Cyanobacteria RepID=Q8YW... 207 7e-52 UniRef50_D2LF37 RNA-directed DNA polymerase (Reverse transcripta... 207 8e-52 UniRef50_UPI0001C388AF RNA-directed DNA polymerase n=1 Tax=Arthr... 206 8e-52 UniRef50_B9K440 18S rRNA intron 1 protein n=1 Tax=Agrobacterium ... 206 1e-51 UniRef50_D1RME6 Reverse transcriptase family protein n=1 Tax=Leg... 206 1e-51 UniRef50_C3EEI5 Group II intron reverse transcriptase/maturase n... 204 4e-51 UniRef50_A5B8L7 Putative uncharacterized protein n=2 Tax=Vitis v... 204 6e-51 UniRef50_Q9G8T4 Orf621 n=1 Tax=Rhodomonas salina RepID=Q9G8T4_RHDSA 203 7e-51 UniRef50_Q10VN2 RNA-directed DNA polymerase (Reverse transcripta... 203 7e-51 UniRef50_UPI0001982DF4 PREDICTED: hypothetical protein n=1 Tax=V... 202 2e-50 UniRef50_O99970 Orf546 n=2 Tax=Porphyra purpurea RepID=O99970_PORPU 201 3e-50 UniRef50_Q35063 CoxI intron1 ORF n=2 Tax=Eukaryota RepID=Q35063_... 201 4e-50 UniRef50_B1C301 Putative uncharacterized protein n=6 Tax=Clostri... 200 8e-50 UniRef50_A5B6Q1 Putative uncharacterized protein n=4 Tax=Vitis v... 199 2e-49 UniRef50_Q1KSC2 Putative non-LTR retroelement reverse transcript... 199 2e-49 UniRef50_C5NNP0 Putative unclassified retrotransposon protein n=... 199 2e-49 UniRef50_UPI0001C15D3C hypothetical protein CRC_00192 n=2 Tax=No... 198 3e-49 UniRef50_Q53NG0 Retrotransposon protein, putative, unclassified ... 198 3e-49 UniRef50_A6P1G1 Putative uncharacterized protein n=1 Tax=Bactero... 196 1e-48 UniRef50_C6I8L1 CRISPR-associated protein n=1 Tax=Bacteroides sp... 195 2e-48 UniRef50_A4WS58 RNA-directed DNA polymerase (Reverse transcripta... 194 5e-48 UniRef50_B3CUR8 Reverse transcriptase n=24 Tax=Orientia tsutsuga... 194 5e-48 UniRef50_Q1Q3I7 Putative uncharacterized protein n=1 Tax=Candida... 194 5e-48 UniRef50_A5B242 Putative uncharacterized protein n=1 Tax=Vitis v... 193 7e-48 UniRef50_UPI00019859F4 PREDICTED: hypothetical protein n=1 Tax=V... 193 1e-47 UniRef50_Q8RSV8 Maturase n=1 Tax=uncultured marine bacterium Rep... 192 1e-47 UniRef50_A5BQN8 Putative uncharacterized protein n=3 Tax=Vitis v... 192 2e-47 UniRef50_A5BBH2 Putative uncharacterized protein (Fragment) n=1 ... 192 2e-47 UniRef50_A5BN31 Putative uncharacterized protein n=2 Tax=Vitis v... 192 2e-47 UniRef50_P19593 Probable reverse transcriptase n=2 Tax=Scenedesm... 191 3e-47 UniRef50_UPI00019844C7 PREDICTED: hypothetical protein n=1 Tax=V... 191 4e-47 UniRef50_UPI0000DF064A Os02g0261600 n=1 Tax=Oryza sativa Japonic... 191 5e-47 UniRef50_Q2R4V1 Retrotransposon protein, putative, unclassified ... 191 5e-47 UniRef50_A5BKT2 Putative uncharacterized protein n=3 Tax=Vitis v... 190 7e-47 UniRef50_Q7XPE7 OSJNBa0060N03.14 protein n=6 Tax=Oryza sativa Re... 189 2e-46 UniRef50_D2LKK7 RNA-directed DNA polymerase (Reverse transcripta... 188 2e-46 UniRef50_Q67M30 Group II intron-encoding maturase n=1 Tax=Symbio... 188 3e-46 UniRef50_B6IMH7 Phage-encoded reverse transcriptase, putative n=... 188 3e-46 UniRef50_Q7XUD8 OSJNBa0088A01.7 protein n=2 Tax=Poaceae RepID=Q7... 187 6e-46 UniRef50_A1RKU4 RNA-directed DNA polymerase (Reverse transcripta... 186 1e-45 UniRef50_Q7XE51 Retrotransposon protein, putative, unclassified ... 186 2e-45 UniRef50_A5B2E0 Putative uncharacterized protein n=7 Tax=Vitis v... 185 2e-45 UniRef50_Q7YAJ6 Putative reverse transcriptase and intron matura... 185 2e-45 UniRef50_A6TR85 RNA-directed DNA polymerase (Reverse transcripta... 184 3e-45 UniRef50_B0VI85 RNA-directed DNA polymerase (Reverse transcripta... 184 6e-45 UniRef50_Q35064 Atp9 intron ORF n=1 Tax=Marchantia polymorpha Re... 183 9e-45 UniRef50_Q2QNF1 Retrotransposon protein, putative, unclassified ... 183 9e-45 UniRef50_B8GRZ4 RNA-directed DNA polymerase (Reverse transcripta... 183 1e-44 UniRef50_A5AV27 Putative uncharacterized protein n=9 Tax=Vitis v... 182 1e-44 UniRef50_C0FSR2 Putative uncharacterized protein n=1 Tax=Rosebur... 182 1e-44 UniRef50_Q01HH7 OSIGBa0142I02-OSIGBa0101B20.12 protein n=9 Tax=O... 182 2e-44 UniRef50_A5ZQ10 Putative uncharacterized protein n=1 Tax=Ruminoc... 181 4e-44 UniRef50_A5BR45 Putative uncharacterized protein n=1 Tax=Vitis v... 180 5e-44 UniRef50_A5ARA6 Putative uncharacterized protein n=1 Tax=Vitis v... 180 6e-44 UniRef50_C0A8Z3 RNA-directed DNA polymerase (Reverse transcripta... 180 6e-44 UniRef50_UPI0001983C14 PREDICTED: hypothetical protein n=1 Tax=V... 179 2e-43 Sequences not found previously or not previously below threshold: UniRef50_A5ALM2 Putative uncharacterized protein n=1 Tax=Vitis v... 182 2e-44 UniRef50_A5CBN4 Putative uncharacterized protein n=2 Tax=Vitis v... 182 2e-44 UniRef50_A5ATF6 Putative uncharacterized protein n=5 Tax=Vitis v... 180 8e-44 >UniRef50_Q47688 Putative uncharacterized protein ykfC n=18 Tax=Proteobacteria RepID=YKFC_ECOLI Length = 376 Score = 397 bits (1020), Expect = e-109, Method: Composition-based stats. Identities = 376/376 (100%), Positives = 376/376 (100%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARL 60 MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARL Sbjct: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARL 60 Query: 61 AVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWES 120 AVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWES Sbjct: 61 AVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWES 120 Query: 121 DFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRR 180 DFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRR Sbjct: 121 DFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRR 180 Query: 181 RISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLS 240 RISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLS Sbjct: 181 RISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLS 240 Query: 241 GKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECR 300 GKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECR Sbjct: 241 GKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECR 300 Query: 301 GVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAASL 360 GVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAASL Sbjct: 301 GVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAASL 360 Query: 361 TALLWKVRISGEILLG 376 TALLWKVRISGEILLG Sbjct: 361 TALLWKVRISGEILLG 376 >UniRef50_B0EYP4 Reverse transcriptase-like protein n=48 Tax=cellular organisms RepID=B0EYP4_ECOLX Length = 507 Score = 385 bits (988), Expect = e-105, Method: Composition-based stats. Identities = 361/364 (99%), Positives = 362/364 (99%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARL 60 MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARL Sbjct: 6 MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARL 65 Query: 61 AVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWES 120 AVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWES Sbjct: 66 AVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWES 125 Query: 121 DFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRR 180 DFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRR Sbjct: 126 DFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRR 185 Query: 181 RISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLS 240 RISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLS Sbjct: 186 RISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLS 245 Query: 241 GKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECR 300 GKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQ EAIREECR Sbjct: 246 GKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQAEAIREECR 305 Query: 301 GVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAASL 360 GVLEGSLKLRLNMDKTKI HVNDGFIFLGHR+IRKRSRYGEMRVVSTIPQEKARNFAASL Sbjct: 306 GVLEGSLKLRLNMDKTKITHVNDGFIFLGHRIIRKRSRYGEMRVVSTIPQEKARNFAASL 365 Query: 361 TALL 364 TALL Sbjct: 366 TALL 369 >UniRef50_D2QVU8 RNA-directed DNA polymerase (Reverse transcriptase) n=1 Tax=Spirosoma linguale DSM 74 RepID=D2QVU8_9SPHI Length = 507 Score = 313 bits (801), Expect = 1e-83, Method: Composition-based stats. Identities = 128/398 (32%), Positives = 202/398 (50%), Gaps = 24/398 (6%) Query: 2 QRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLA 61 QRKL W+ T P+ + L +T L EA R +KG TPG+DG+ ++ R+ Sbjct: 16 QRKLYQWSQTHPTEAYRELWNWLTDLRNLREAWRRVAQNKGKRTPGIDGMTVGSIRQRIG 75 Query: 62 V--ELQILRDELLSGHYQPLPARRVYIPK--SNGKLRPLGIPALRDRIVQRAMLMAMEPI 117 L L+ +L +G Y+P P RR IPK GK RPLGIP + DR+VQ A+ +EPI Sbjct: 76 EAPFLATLQQQLRTGSYKPSPCRRKLIPKAGKPGKFRPLGIPTIADRVVQSAIKQVLEPI 135 Query: 118 WESDFHTLSYGFRPERSVHHAIRTVKLQLT---------DCGETRGRWVIEGDLSSYFDT 168 E+ F +SYGFRP R H A+ +++ + E +WVIEGD+ S FD Sbjct: 136 LEARFWPVSYGFRPGRGCHGALEHIRMSMRPRKVNKQDNKRHEMPYQWVIEGDIQSCFDH 195 Query: 169 VHHRLLMKAVRRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLN 228 + H LM +R+ +D R LL + +KAG + F G PQGG++SPLL+N+ L Sbjct: 196 IDHHQLMDRIRQHSADRRVNQLLVQFLKAGILSEEQFLRTDAGTPQGGIVSPLLANVALG 255 Query: 229 EFDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGT 288 ++ +++ + ++ + + I+ + + + RYADDFV++V GT Sbjct: 256 LIEERYER-WVNHQTKRRQSRQCDGIKAAMWSRSVDRQAGRAVYFPFRYADDFVILVSGT 314 Query: 289 KAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSRYGEMRVVSTI 348 + +A R+ + +L+ + L L+ +KTKI + +GF FLGHR+ + I Sbjct: 315 QENAQAERKVLQTLLQEKMGLTLSPEKTKITPLTEGFQFLGHRVSMRWDYRYGWTPRLEI 374 Query: 349 PQEKARNFAASLTALLWK----------VRISGEILLG 376 P++KA + + L + ++ IL G Sbjct: 375 PKQKAADLRYRIKQLTGRATLGWSLDELLQKLNPILRG 412 >UniRef50_A4XCB1 RNA-directed DNA polymerase n=7 Tax=Actinomycetales RepID=A4XCB1_SALTO Length = 488 Score = 311 bits (798), Expect = 2e-83, Method: Composition-based stats. Identities = 134/372 (36%), Positives = 197/372 (52%), Gaps = 22/372 (5%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARL 60 MQ KL WAA P R L L+ P L A + GA T GVDG+ ++ + Sbjct: 25 MQAKLHRWAAAGPGRRFDDLFNLVHDPATLLVAYSRVAGNLGARTAGVDGMTVADVERHI 84 Query: 61 AV--ELQILRDELLSGHYQPLPARRVYIPKSNG--KLRPLGIPALRDRIVQRAMLMAMEP 116 V L LR ++ +G ++PLP R IPK G K+R LGIP + DR+VQ A+ + +EP Sbjct: 85 GVPGFLDDLRVQVKTGTFRPLPVRERKIPKPGGSGKVRRLGIPTVADRVVQAALKLVLEP 144 Query: 117 IWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMK 176 I+E+DF +SYGFRP+R AI + G RWV++ D+ + FD++ H LM Sbjct: 145 IFEADFLPVSYGFRPKRRAQDAIAEIH----YYGTHGYRWVLDADIEACFDSIDHVALMD 200 Query: 177 AVRRRISDARFMTLLWKTIKAGHI-DVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLH 235 VR RI D R +TL+ +KAG + ++G R G PQGG++SPLL+NI L D++L Sbjct: 201 RVRTRIKDKRVLTLVKAFLKAGILTELGDRRDTHTGTPQGGILSPLLANIALTVLDEHLM 260 Query: 236 ERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAI 295 + + +RG +T RYADDFV++V G + +A+ Sbjct: 261 AGWRPDATMASEYRRAQLRKRGEATW-----------RLVRYADDFVVLVHGGEDHAQAL 309 Query: 296 REECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSR-YGEMRVVSTIPQEKAR 354 RE+ +L L LRL+ KT++ H++DGF FLG + +R R + V + I + R Sbjct: 310 REDVATML-APLGLRLSPAKTRVVHLSDGFDFLGFHIQWRRKRGTNKWHVYTFIAKRPIR 368 Query: 355 NFAASLTALLWK 366 + A + AL + Sbjct: 369 SLKAKVRALTRR 380 >UniRef50_Q92Y56 Reverse transcriptase n=3 Tax=Alphaproteobacteria RepID=Q92Y56_RHIME Length = 505 Score = 306 bits (785), Expect = 6e-82, Method: Composition-based stats. Identities = 129/390 (33%), Positives = 193/390 (49%), Gaps = 15/390 (3%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARL 60 +QRKL W+ +P + + + +T L A + S+KG T GVDG+ ++ R Sbjct: 15 IQRKLYQWSKANPDDQWRDMWGWLTDLRVLRHAWQRVASNKGGRTAGVDGMTVGRIRNRS 74 Query: 61 A-VELQILRDELLSGHYQPLPARRVYIPK--SNGKLRPLGIPALRDRIVQRAMLMAMEPI 117 L L+ +L SG Y+P PARR IPK G+ RPLGIP +RDR+VQ A + +EPI Sbjct: 75 EHRFLVDLQADLRSGAYRPSPARRKLIPKAGKPGQFRPLGIPTIRDRVVQGAAKILLEPI 134 Query: 118 WESDFHTLSYGFRPERSVHHAIRTVKLQL--------TDCGETRGRWVIEGDLSSYFDTV 169 +E+ F +SYGFRP R+ H A+ ++ T WVIEGD+ FD + Sbjct: 135 FEAQFWHVSYGFRPGRNTHGALEYIRRAALPQKRDEDTRRNRLPYPWVIEGDIKGCFDNI 194 Query: 170 HHRLLMKAVRRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNE 229 +H L++ +R+RI D R + L+ +KAG + F G PQGG+ISPLL+NI L+ Sbjct: 195 NHHHLLERMRKRIGDRRVVRLVGLFLKAGVLTEDQFLRTDAGTPQGGIISPLLANIALSA 254 Query: 230 FDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTK 289 ++ + + + +N + S + + RYADDFV++V G+ Sbjct: 255 IEERYER-WTYHRKKTQARRKSNGVAAAASARDSDRIAGRCVYLPVRYADDFVVLVSGSL 313 Query: 290 AQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIP 349 + A + L + L L +KTK+ + +GF FLG R + IP Sbjct: 314 EEAMAEKSALADYLIKTTGLTLLPEKTKVTAMTEGFEFLGFRFSVHWDKRYGYGPRVEIP 373 Query: 350 QEKARNFAASLTALLWKVRIS---GEILLG 376 + KA N + L + IS GE L G Sbjct: 374 KAKAANLRHKVKQLTQRDSISVSLGEKLRG 403 >UniRef50_B4D9Y9 RNA-directed DNA polymerase (Reverse transcriptase) n=3 Tax=Chthoniobacter flavus Ellin428 RepID=B4D9Y9_9BACT Length = 495 Score = 289 bits (740), Expect = 1e-76, Method: Composition-based stats. Identities = 127/386 (32%), Positives = 187/386 (48%), Gaps = 51/386 (13%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARL 60 +Q L A + P R L + + + LA A R ++ G GVDGV L A Sbjct: 77 LQITLYRKAQSKPEYRFWSLYGEVQRADVLAAAWRRVKANAG--AAGVDGVTIEKLAADA 134 Query: 61 AV---ELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPI 117 V L LR+EL Y+P P RRV IPK++G R LGIP L+DR+VQ A+ + + PI Sbjct: 135 QVEAAWLNGLREELHGKTYRPAPVRRVKIPKASGGYRGLGIPTLKDRVVQMAVYLVLMPI 194 Query: 118 WESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKA 177 +E+DFH SYGFRP R+ H A+ ++ L V++ DL+ YFDT+ HRLLM+ Sbjct: 195 FEADFHPRSYGFRPGRNAHQAVEEIREAL----RMGKTEVVDADLAQYFDTIPHRLLMRQ 250 Query: 178 VRRRISDARFMTLLWKTIKAGHID-----VGLFRAASEGVPQGGVISPLLSNIMLNEFDQ 232 V RR+SD + L+ ++A ++ +A G PQGGVISPLL+NI L+ D Sbjct: 251 VARRVSDGMILKLIKAWLRAPILEEEEGGGRRMKANPCGTPQGGVISPLLANIYLHPLD- 309 Query: 233 YLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQV 292 +V ++ Q KP + RYADD V++ + + Sbjct: 310 ---------------------------DSVNDHCQQKPRM--IRYADDLVILCRPGE--G 338 Query: 293 EAIREECRGVLEGSLKLRLNMDKTKIP-HVNDGFIFLGHRLIRKRSRYGEMRVVSTIP-- 349 ++E L+ L LN KT++ GF FLG ++S+ G V + Sbjct: 339 RGMKERLARWLQSR-GLTLNETKTRVVQSCESGFEFLGFTFRWQQSKKGTPYVHTEPSPA 397 Query: 350 -QEKARNFAASLTALLWKVRISGEIL 374 ++ RN LT R++G+ + Sbjct: 398 AKQSLRNRVRELTRRSTTWRVTGQTV 423 >UniRef50_Q2G689 RNA-directed DNA polymerase n=5 Tax=Bacteria RepID=Q2G689_NOVAD Length = 633 Score = 286 bits (731), Expect = 1e-75, Method: Composition-based stats. Identities = 116/399 (29%), Positives = 190/399 (47%), Gaps = 48/399 (12%) Query: 14 SLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLS 73 ++ L RL+ P A + +KGA TPGVDG +++ + + L + Sbjct: 20 GRKVNGLYRLLKSPLLWEHAYQRIAPNKGAMTPGVDGQT---FDGFSPDKVRSIIERLAN 76 Query: 74 GHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPER 133 G Y+P PARRVYIPK+NG+ RPLG+P D++VQ + +E I+E F S+GFRP+R Sbjct: 77 GTYRPQPARRVYIPKANGQKRPLGVPTTEDKLVQEVVRTILEQIYEPLFSRHSHGFRPKR 136 Query: 134 SVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWK 193 S H A+ +++ T +W+I+ D+ +FD + H +L+ + +RI+D RF+ L+ Sbjct: 137 SCHTALESIR-----AIWTGVKWLIDVDVVGFFDNIDHDVLVSLLEKRIADRRFVRLIRG 191 Query: 194 TIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHER---YLSGKARKDRWYW 250 +KAG+++ +F G PQGGV+SP+L+NI L+E D ++ + + GK R Sbjct: 192 LLKAGYVEDWVFHKTYSGTPQGGVVSPMLANIYLHELDMFMQAKMAGFDKGKQRSPSPDA 251 Query: 251 NNSIQRGRSTAVREN------------------------------------WQWKPAVAY 274 R + + Y Sbjct: 252 RRIRNRLSYVRRTVDQLRAKGRGDDPRVTSFLEEIGRLKAERLAVPASDAFDPNYRRLRY 311 Query: 275 CRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIR 334 CRYADDF++ V G+K++ I EE R L LKL ++ +K+ I +DG FLG+ + Sbjct: 312 CRYADDFIIGVTGSKSEARQIMEEVRTYLSDHLKLAVSAEKSGIHKASDGARFLGYEVRT 371 Query: 335 KRSRYGEMRVVSTIPQEKARNFAASLTALLWKVRISGEI 373 + + + R A + L+ + R+ + Sbjct: 372 MTN-PNPHKAIFDGRPAVRRGLADRMKLLVPRDRVVRFV 409 >UniRef50_C8R0Q7 RNA-directed DNA polymerase (Reverse transcriptase) n=6 Tax=Bacteria RepID=C8R0Q7_9DELT Length = 508 Score = 284 bits (728), Expect = 2e-75, Method: Composition-based stats. Identities = 117/352 (33%), Positives = 172/352 (48%), Gaps = 44/352 (12%) Query: 19 RLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQP 78 LL I + +A ++KG PG+D + A + +R LL+G YQP Sbjct: 89 NLLERILSRANMLKAWERVKANKG--APGMDNMPIADFMAFAREHWEEIRASLLAGTYQP 146 Query: 79 LPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHA 138 LP +RV IPK G RPLGIP + DR++Q+AM + PI++ DF SYGFRP RS H A Sbjct: 147 LPVKRVEIPKPTGGTRPLGIPTVLDRLIQQAMAQVLLPIFDPDFSEASYGFRPGRSAHDA 206 Query: 139 IRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAG 198 I V+ + R ++ DLS +FDTV H LLM V R++ D R + L+ K ++AG Sbjct: 207 IHRVRDYI----RQGYRVAVDADLSKFFDTVDHDLLMNRVGRKVRDQRVLRLVGKYLRAG 262 Query: 199 HIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGR 258 + G R +GVPQGG +SPLLSNI+L++ D+ L Sbjct: 263 VMIDGRRRETRKGVPQGGPLSPLLSNILLDDLDKELER---------------------- 300 Query: 259 STAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKI 318 + RYADDF+++VK +A E + LE LKL +N +K+K+ Sbjct: 301 -----------RGHRFARYADDFIILVKSRRAG-ERVMTGITRFLESKLKLVVNQEKSKV 348 Query: 319 PHVNDGFIFLGHRLIRKRSRYGEM---RVVSTIPQEKARNFAASLTALLWKV 367 N+ FLG + R+ + + + R++ S+ L K+ Sbjct: 349 APTNES-GFLGFIFKGAKIRWSDKAFAEFKRRVKKLTGRSWGVSMAFRLAKL 399 >UniRef50_A9IAV6 Mobile mitochondrial group II intron of COX1 which is involved in pre-mRNA splicing and in deletion of introns from mitochondrial DNA n=1 Tax=Bordetella petrii DSM 12804 RepID=A9IAV6_BORPD Length = 606 Score = 283 bits (724), Expect = 7e-75, Method: Composition-based stats. Identities = 115/401 (28%), Positives = 178/401 (44%), Gaps = 60/401 (14%) Query: 11 TDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDE 70 + RI L RL+ P +A ++ GA T GVDG + L L L Sbjct: 17 SQQGKRINGLSRLMENPILWKQAYVNIYANSGATTAGVDG---SSLDGMSYERLAGLMAA 73 Query: 71 LLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFR 130 + SG+Y+ P RRV IPKSNGK RPLGIP D++VQ + M + I+E F S+GFR Sbjct: 74 VKSGNYRFKPVRRVLIPKSNGKTRPLGIPTGDDKLVQEVVRMLLVKIYEPVFSDDSHGFR 133 Query: 131 PERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTL 190 RS H A+ V+ T +W++ D+ YFD + H +L+ + +RI D RF+ L Sbjct: 134 NGRSCHTALMQVRQ-----KWTGMKWIVNMDIKGYFDNIDHEVLVDVLAKRIDDKRFLGL 188 Query: 191 LWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQY---LHERYLSGKARKDR 247 + +KAG+++ F G PQGGV+SP+L+NI L+E D+Y L + G R Sbjct: 189 IHSMLKAGYMEDWKFHDTFSGTPQGGVVSPVLANIYLHELDEYVAGLKAEFNRGNRRASN 248 Query: 248 WYWNN----------------------------------SIQRGRSTAVRENWQWKPAVA 273 + ++R ++ + Sbjct: 249 REYKRISGAIERLMKRIDAYKADGDSPKVEEAKRELAELYLRRKALSSSDPMDANYRRLV 308 Query: 274 YCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLI 333 Y RYADDF++ + G++ + + + G + L L + +K+ + H +DG FLG+ + Sbjct: 309 YVRYADDFLIGIIGSRDEAVTVMQRVAGFISDKLHLEIAEEKSGVVHASDGVRFLGYDVR 368 Query: 334 RKRS---------------RYGEMRVVSTIPQEKARNFAAS 359 R R+ +P EK R+F Sbjct: 369 TYSGDRNVRTVRSGRSITARSVSERMQLHVPAEKLRSFCQR 409 >UniRef50_C3QDF8 RNA-directed DNA polymerase n=9 Tax=Bacteroidales RepID=C3QDF8_9BACE Length = 606 Score = 283 bits (724), Expect = 7e-75, Method: Composition-based stats. Identities = 121/406 (29%), Positives = 190/406 (46%), Gaps = 54/406 (13%) Query: 14 SLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLS 73 + +RL R++ E A + + G TPG DG + + L Sbjct: 23 DYKFERLYRILFNEEMFHVAYQRIYAKPGNMTPGTDGKTI---NRMSLQRINKVIASLRD 79 Query: 74 GHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPER 133 Y+P PA+R +IPK NGK RPLGIP+ D++VQ + M +E I+E F S+GFRP R Sbjct: 80 ESYKPNPAKRTHIPKKNGKKRPLGIPSFEDKLVQEVVRMILEAIYEEVFANTSHGFRPNR 139 Query: 134 SVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWK 193 S H A+ ++ T +W +EGD+ +FD + H +L+ +R+RI D RF+ L+ K Sbjct: 140 SCHTALTHIQKTF-----TGTKWFVEGDIKGFFDNIDHNVLIATLRKRIDDNRFLRLIRK 194 Query: 194 TIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHE---RYLSGKARKDRWYW 250 + AG+I+ F ++G PQGG ISP+L+NI L+ FD+Y+ E R+ GK R + Sbjct: 195 LLNAGYIEDWRFHNTNKGTPQGGNISPILANIYLDNFDKYMEEYALRFNKGKERHITKEY 254 Query: 251 NN-------------------------------SIQRGRSTAVRENWQWKPAVAYCRYAD 279 +R + + + Y RYAD Sbjct: 255 KQLSGKMQGILKSIKNIKDADARLQLRDEYVKLGRERQKIESRDSMDETYRRFRYVRYAD 314 Query: 280 DFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRS-- 337 DF++ V G+KA I+ + +E +LKL L+ +KT I + FLG + ++S Sbjct: 315 DFLIGVIGSKADCVKIKSDITNYMEENLKLELSQEKTLITNAQTPAKFLGFEVSVRKSDV 374 Query: 338 ----------RYGEMRVVSTIPQEKARNFAASLTALLWKVRISGEI 373 RY ++V + E RN +A+ +KV ++ Sbjct: 375 VKRNKNNVSARYYNGKIVLKVAIETVRNKLEEYSAIRYKVENGRQV 420 >UniRef50_UPI00019088D4 mobile mitochondrial group II intron of COX1 which IS involved in pre-mRNA splicing and in deletion of introns from n=1 Tax=Rhizobium etli CIAT 894 RepID=UPI00019088D4 Length = 533 Score = 283 bits (724), Expect = 7e-75, Method: Composition-based stats. Identities = 122/424 (28%), Positives = 197/424 (46%), Gaps = 56/424 (13%) Query: 2 QRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLA 61 +R ++ A + RI L RL+ P A+A + GA TPG+D + L Sbjct: 8 RRLISLPALSRQGKRINGLHRLLDCPNIWAQAYEAIARNSGALTPGID--PRNTLDGFSL 65 Query: 62 VELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESD 121 L + + G Y+ P RR YIPK+NGKLRPLGIP D++VQ A+ + +E I+E + Sbjct: 66 DRLNGIMRRVKEGSYRFKPVRRHYIPKANGKLRPLGIPDADDKLVQAAVKLVLEQIYEPN 125 Query: 122 FHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRR 181 F S+GFRP+RS H A+ +++ W++E D++ +FD + H +LM +R+R Sbjct: 126 FSRRSHGFRPKRSCHTALASIQKT-----WGGTVWLVEADIAGFFDNIDHDILMNLLRKR 180 Query: 182 ISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYL---HERY 238 I D RF+ L+ + AG+++ + A+ G PQGGVISPLL+N+ L+EFD+++ R+ Sbjct: 181 IDDERFLKLIRGMLTAGYMEDWKWHASYSGTPQGGVISPLLANVYLHEFDEFMDSLKTRF 240 Query: 239 LSGKARKDRWYWNNSIQRG----------------------------------RSTAVRE 264 G R + + +G + + Sbjct: 241 DRGIERPTNPEYQKLLSKGAHCRQRIAKLRSSGREAEAERLRAQLQPLIVAARKLPSKDF 300 Query: 265 NWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVND- 323 + + + Y RYADDF++ V GTK + I E L+ L L + +K+ I +D Sbjct: 301 HDTFFRRLQYVRYADDFLISVIGTKQEAADILNEVTSFLQDQLHLEVAPEKSGITKADDG 360 Query: 324 GFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNF----------AASLTALLWKVRISG-E 372 G FLG+ + + + E RV + + L A + R+ Sbjct: 361 GVTFLGYAVRSTKRAFREKRVTRKGRRAMVKRTPVRGIRLHLPREKLAAFAKRNRLGSYH 420 Query: 373 ILLG 376 + G Sbjct: 421 AIRG 424 >UniRef50_Q3CZ44 Prophage LambdaSa1, reverse transcriptase/maturase family protein n=2 Tax=Streptococcus agalactiae RepID=Q3CZ44_STRAG Length = 439 Score = 283 bits (724), Expect = 8e-75, Method: Composition-based stats. Identities = 119/374 (31%), Positives = 182/374 (48%), Gaps = 40/374 (10%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQAR- 59 QRK+ D + L + + + L A +KG+ G+D ++A Sbjct: 16 FQRKIYLSTKADNKRKFGVLYDKVYRKDILKVAWFYVKRNKGS--AGIDDFTIEEIEAYG 73 Query: 60 LAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWE 119 + L + D+L + YQP +RVYIPK+NGK RPLGIP +RDR+VQ A+ + +EPI+E Sbjct: 74 VQKFLDEIEDQLRNKKYQPKAVKRVYIPKANGKKRPLGIPTVRDRVVQTAVKIVIEPIFE 133 Query: 120 SDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVR 179 +DF SYGFRP+RS + AIR + L WVI+ DL YFDT+ H L+ V+ Sbjct: 134 ADFQKFSYGFRPKRSANQAIREIYKYLNY----GCEWVIDADLKGYFDTIPHDKLLLLVK 189 Query: 180 RRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYL 239 R++D + LL ++AG ++ R+ G PQGGVISPLL+NI LN D+Y L Sbjct: 190 ERVTDKSIIKLLSLWLEAGIMEDNQVRSNILGTPQGGVISPLLANIYLNALDRYWKNNRL 249 Query: 240 SGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIV-KGTKAQVEAIREE 298 G+ RYADDFV++ K + ++ Sbjct: 250 EGRGH--------------------------DAHLIRYADDFVILCSNNPKKYYQYAKQR 283 Query: 299 CRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRS-RYGEMRVVSTIPQEKARNFA 357 L L LN +KT+I H +GF FLG+ L + +S + G+ + ++ ++ Sbjct: 284 I-----DKLGLTLNEEKTRIVHATEGFDFLGYTLRKSKSHKSGKYKTYYYPSRKSMKSIK 338 Query: 358 ASLTALLWKVRISG 371 + ++ + Sbjct: 339 GKVKDVIQTGQHLN 352 >UniRef50_A9B955 RNA-directed DNA polymerase n=1 Tax=Herpetosiphon aurantiacus ATCC 23779 RepID=A9B955_HERA2 Length = 587 Score = 282 bits (722), Expect = 1e-74, Method: Composition-based stats. Identities = 118/367 (32%), Positives = 187/367 (50%), Gaps = 26/367 (7%) Query: 2 QRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLA 61 Q LA + R+ L+ A LS+ GA T G+DG+ K + Sbjct: 21 QEVLAHRSLNQQ--PFHRVFNLMRTRRLATVALNRVLSNTGARTAGIDGMTKKHIATDTE 78 Query: 62 VE--LQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWE 119 + +Q + +L + Y+P P RRVYIPK+NG+ RPLGIP ++DR+VQ + + ++PI+E Sbjct: 79 QQALVQEIWHDLTTHQYRPAPVRRVYIPKANGQQRPLGIPTIKDRVVQEMVRLILDPIYE 138 Query: 120 SDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVR 179 S F+ SYGFRP R+ HHA+ ++ + G + +EGD+ + FD +HH L++ +R Sbjct: 139 STFYRHSYGFRPYRATHHAVVRLRDLI---GRRGYQMALEGDIRACFDRIHHTTLIRILR 195 Query: 180 RRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYL 239 R I D R +T++ + +KAG +D G +R +G PQGG++SPLL+NI LNE DQ++ R+ Sbjct: 196 RTIKDERLITVIHQMLKAGVMDDGQWRVTEDGTPQGGIVSPLLANIYLNELDQWVANRWD 255 Query: 240 SGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREEC 299 + + ++ RYADDFV+++ GT A+ ++ Sbjct: 256 TYTPLERYYHRKAGTGY--------------PCQITRYADDFVVLLHGTHAEATTLKTAL 301 Query: 300 RGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAAS 359 L L L L+ +KT I V GF FLG + + + T ++ F Sbjct: 302 ATFLADHLHLELSAEKTLITPVEQGFDFLGFHIRKYQDS-----TRITPSRKAIATFKRE 356 Query: 360 LTALLWK 366 + K Sbjct: 357 AADRIGK 363 >UniRef50_A7VUZ6 Putative uncharacterized protein n=1 Tax=Clostridium leptum DSM 753 RepID=A7VUZ6_9CLOT Length = 605 Score = 282 bits (722), Expect = 1e-74, Method: Composition-based stats. Identities = 125/373 (33%), Positives = 191/373 (51%), Gaps = 34/373 (9%) Query: 5 LATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVEL 64 L + D S R R+ R + ++ A + + +G TPG DG + ++ Sbjct: 16 LKQKSKQDESHRYDRIYRNLFNEDFFLRAYQKIHAKQGNMTPGTDGTTIDGFSRK---QI 72 Query: 65 QILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHT 124 L + L YQP P RR YIPK NGK+RPLGIPA D++VQ + +E I+E F Sbjct: 73 SQLIELLKWERYQPKPVRRTYIPKKNGKMRPLGIPAFADKLVQEVVRQILEAIYEPIFSD 132 Query: 125 LSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISD 184 S+GFRP RS H A+ +K WVIEGD++ FD + H +L+K + ++I D Sbjct: 133 NSHGFRPNRSCHTALYQIKSTCR-----GTNWVIEGDITGCFDHIDHEILLKILLKKIDD 187 Query: 185 ARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLH---ERYLSG 241 RF+ L+WK +KAG+++ + G PQGG+ISP+L+NI L+EFD+++ Y G Sbjct: 188 GRFLELIWKFLKAGYLEFNQKYNSLSGTPQGGIISPILANIYLHEFDKFMEGISAEYTKG 247 Query: 242 KARKDRWYWN--------------------NSIQRGRSTAVRENWQWKPAVAYCRYADDF 281 K R+ + Q A+ + V Y RYADDF Sbjct: 248 KQRRPYREYQILQYKRNRAKKKGNQEQADEYLRQMQNIPALDPMDKNYQRVKYVRYADDF 307 Query: 282 VLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDG-FIFLGHRLIRKRSRYG 340 V+ + G+KA I+E G ++ L L L+ +KTKI +++D FLG+ + +S+ Sbjct: 308 VVCIIGSKATANEIKERIAGFMQKELHLELSREKTKITNLSDKRVRFLGYEI--TKSQEN 365 Query: 341 EMRVVSTIPQEKA 353 +VV +I ++K Sbjct: 366 TKQVVDSIGRKKR 378 >UniRef50_B8FP60 RNA-directed DNA polymerase n=3 Tax=Firmicutes RepID=B8FP60_DESHD Length = 607 Score = 281 bits (720), Expect = 2e-74, Method: Composition-based stats. Identities = 121/403 (30%), Positives = 188/403 (46%), Gaps = 57/403 (14%) Query: 7 TWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQI 66 + R +RL R + P++ A + ++ GA TPGVD L ++ Sbjct: 12 QRQSQKQDYRFERLYRNLYNPDFYLLAYQKLYANNGAMTPGVDRTT---LDGTGMERIES 68 Query: 67 LRDELLSGHYQPLPARRVYIPKSNGK-LRPLGIPALRDRIVQRAMLMAMEPIWESDFHTL 125 L L + YQP PARR YIPK +GK RPLGI A D++VQ + M +E I+E F Sbjct: 69 LIQSLKNRSYQPQPARRRYIPKKSGKGQRPLGIQAANDKLVQEVVRMLLESIYEPTFLDS 128 Query: 126 SYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDA 185 S+GFRP RS H A+ ++ +W +EGD+ +YFDT+ H +L+ +RRRI D Sbjct: 129 SHGFRPNRSCHTALARMQRSF-----NGVKWFVEGDIKAYFDTIDHHMLVNILRRRIQDE 183 Query: 186 RFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHE---RYLSGK 242 F++L+WK ++AG+++ + A G PQG SPLL+N+ L+E D Y+ E R+ G Sbjct: 184 NFISLIWKFLRAGYLEDWQYNATYSGSPQGSGASPLLANLYLHELDLYMEEYKQRFDKGN 243 Query: 243 ARKDRWYWNNSIQRGRSTAVRENWQW---------------------------------- 268 R+ + + + + V+ + W Sbjct: 244 RRQAGKDYGRAQRTHQYRKVKYDRLWLTLTDEEKKAAQREIRALRKRMLECPANDPMDGT 303 Query: 269 KPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFIFL 328 + Y RY DDF++ + G+K E ++ + L LKL L+ +KT I H D FL Sbjct: 304 YRRIQYVRYCDDFLVGIIGSKTDAEKVKADICRFLSDKLKLTLSPEKTLITHGQDKARFL 363 Query: 329 GHRL-----------IRKRSRYGEMRVVSTIPQEKARNFAASL 360 G+ + R +SR +V +P++K Sbjct: 364 GYDIAVCQDNTTRRTSRGQSRVHSGKVKLYVPKDKWVGKLREY 406 >UniRef50_A5CZB8 Retron-type reverse transcriptase n=20 Tax=Bacteria RepID=A5CZB8_PELTS Length = 423 Score = 279 bits (713), Expect = 1e-73, Method: Composition-based stats. Identities = 113/365 (30%), Positives = 178/365 (48%), Gaps = 45/365 (12%) Query: 15 LRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSG 74 + L+ + + + L +A + +++G PG+DG L L L EL +G Sbjct: 2 KKWYSLIDKVYRLDNLEKAYQAVRANRG--APGIDGETVEAFGQNLGQRLIQLHHELKTG 59 Query: 75 HYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERS 134 Y+P P +RV IPK +G RPLGIP +RDR+VQ+A+L ++PI+E FH SYG+RP RS Sbjct: 60 TYEPQPVKRVEIPKPDGSTRPLGIPTVRDRVVQQALLNILQPIFEPGFHPSSYGYRPGRS 119 Query: 135 VHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKT 194 H A+ + + +V++ DLS FD + H L+++ V R+ISD + L+ K Sbjct: 120 CHQAVAKAERFMNKY---GLEYVVDMDLSKCFDRLDHELILEEVNRKISDGSVLKLIKKF 176 Query: 195 IKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSI 254 + AG + G + G PQGGVISPLL+NI L+ FDQ + Sbjct: 177 LTAGVMKDGQWDEIDTGSPQGGVISPLLANIYLDRFDQAMKS------------------ 218 Query: 255 QRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMD 314 + RYADD ++ + T+ + R+ +LEG LKL +N + Sbjct: 219 ---------------RGIRIVRYADDILVFAR-TRKEAGNYRQVATQILEGELKLEVNKE 262 Query: 315 KTKIPHVNDGFIFLGHRLIRKRSR---YGEMRVVSTIPQEKARNFAASLTALLWKVRISG 371 KT + V++G +LG + +K + I + RN SL ++ ++ Sbjct: 263 KTHLTSVHEGVAYLGFIIHKKHVSIHPKKIKKFKDRIRELTPRNHGMSLKEMIKRL---N 319 Query: 372 EILLG 376 +L G Sbjct: 320 PVLRG 324 >UniRef50_Q3A299 Prophage LambdaSa1, reverse transcriptase/maturase family protein n=5 Tax=Proteobacteria RepID=Q3A299_PELCD Length = 446 Score = 276 bits (706), Expect = 9e-73, Method: Composition-based stats. Identities = 122/386 (31%), Positives = 184/386 (47%), Gaps = 49/386 (12%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQAR- 59 +QRKL A +P+ R L + + + L+ A + ++KG+ G+DGV ++ R Sbjct: 11 LQRKLYRKAKQEPACRFHALYDKVYRADILSHAYALVRANKGS--AGIDGVTFAAIEERE 68 Query: 60 -LAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIW 118 ++ + L + L S Y+P P +RV IPK++G RPLGIP +RDR+ Q A+ + +EPI+ Sbjct: 69 GVSALIAELEEALRSKTYKPDPVKRVMIPKADGSQRPLGIPTIRDRVAQMAVKLVVEPIF 128 Query: 119 ESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAV 178 E+DF SYGFRP++S H A+ V + VI+ D+S YFDT+ H LM V Sbjct: 129 EADFCDTSYGFRPKKSAHDAVDDVAYAMN----IGYTEVIDADISKYFDTIPHTNLMAVV 184 Query: 179 RRRISDARFMTLLWKTIKAGHIDVGL---------FRAASEGVPQGGVISPLLSNIMLNE 229 RI D + L+ +K+ ++VG + G PQGGVISPLL+N+ L+ Sbjct: 185 AERICDGAILHLIQMWLKSSVMEVGKDGKKRNVGGGKGNRRGTPQGGVISPLLANLYLHI 244 Query: 230 FDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTK 289 D+ R N Q + RYADD VL+ + K Sbjct: 245 LDRIWERR---------------------------NLQQRLNARIVRYADDTVLLCRRNK 277 Query: 290 AQVEAIREECRGVLEGSLKLRLNMDKTKIPHVND-GFIFLGHRLIRKRSRY-GEMRVVST 347 + R +LE L L LN KTK+ + GF FLG + +SR G Sbjct: 278 SD--EAMAVLRQILE-RLGLTLNEAKTKVVNGYKGGFDFLGFSIRMGKSRRTGNYYPHVQ 334 Query: 348 IPQEKARNFAASLTALLWKVRISGEI 373 ++ + +T L +VR + Sbjct: 335 PSKKSLQVIKDRVTKLTNRVRTVRPL 360 >UniRef50_A9IAY4 Reverse transcriptase n=38 Tax=Bacteria RepID=A9IAY4_BORPD Length = 572 Score = 276 bits (705), Expect = 1e-72, Method: Composition-based stats. Identities = 103/373 (27%), Positives = 173/373 (46%), Gaps = 39/373 (10%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEW-LAEAARITLSSKGAHTPGVDGVNKTMLQAR 59 +Q ++ +++ L ++T A A R ++G TPGVDG+ + +A+ Sbjct: 34 LQARIVKATREGKHGKVKALQWILTHSFSGKALAVRRVTENQGKKTPGVDGITWSTPEAK 93 Query: 60 LAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWE 119 L + Y+P P +RVYIPK+NGK+RPLGIP ++DR +Q L+A+EP+ E Sbjct: 94 SQAML-----SIKRRGYRPQPLKRVYIPKANGKMRPLGIPTMKDRAMQALYLLALEPVAE 148 Query: 120 SDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVR 179 + S+GFRPERS AI QL + +W++EGD+ FD + H LM + Sbjct: 149 TTADRSSFGFRPERSTADAIGLCFTQL--ALKRSPKWILEGDIKGCFDNISHDWLMGHI- 205 Query: 180 RRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYL 239 +L K +KAG+++ G PQGG+ISP L+N++L+ + L + Sbjct: 206 -----PTDREILSKWLKAGYMEDRQLFPTEAGTPQGGIISPTLANLVLDGLEAKLEAVFG 260 Query: 240 SGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREEC 299 Q + AV Y RYADDF++ + + + + Sbjct: 261 -------------------RARYINGKQTRLAVNYVRYADDFIVTARSKELLEQEVMPLV 301 Query: 300 RGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAAS 359 + L L+ +KTKI H+++GF FLG + + + +++ + F Sbjct: 302 EEFMRER-GLTLSPEKTKITHIDEGFDFLGQNIRKY-----DGKLLIKPSKANVATFLGK 355 Query: 360 LTALLWKVRISGE 372 + A + + + Sbjct: 356 VRAAIKGNKAVNQ 368 >UniRef50_Q1J4S7 Reverse transcriptase / RNA maturase / Endonuclease n=2 Tax=Bacteria RepID=Q1J4S7_STRPF Length = 625 Score = 274 bits (702), Expect = 3e-72, Method: Composition-based stats. Identities = 110/389 (28%), Positives = 183/389 (47%), Gaps = 45/389 (11%) Query: 7 TWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQI 66 + + + +RL R + + +A + ++ G T GVD + A + Sbjct: 32 RKHSKEENYTYKRLYRNLYNIDLFLQAYQNIYANAGNMTKGVDNQTIS---AMSLERINK 88 Query: 67 LRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLS 126 + D L Y P P +RVYIPK NGKLRPLGIP++ D++VQ M + I++ F S Sbjct: 89 IIDSLKDESYSPTPTKRVYIPKKNGKLRPLGIPSIGDKLVQEVCRMLLNSIYDESFEDTS 148 Query: 127 YGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDAR 186 +GFR RS H A+R ++ + C +W +EGD+ +FD + H +++ + +RI D R Sbjct: 149 HGFRDNRSCHTALRQIQNRFVRC-----KWFVEGDIKGFFDNIDHNIMIDILSKRIDDER 203 Query: 187 FMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYL---HERYLSGKA 243 F+ L+ K +K+G+++ + G+PQG +ISP+LSNI L++FD+Y+ E + G Sbjct: 204 FLRLIRKFLKSGYMEQNQYHNTYSGMPQGSIISPILSNIYLDKFDKYMQNYKESFDKGNK 263 Query: 244 RKDRWYWNNSIQRGRS-------------------------------TAVRENWQWKPAV 272 RK + R + + + + Sbjct: 264 RKQNKEYKALYDRRKRLENKLSKTTNKTEIDDIKSEIEEINKRYFNIPCLNPMDENFKRI 323 Query: 273 AYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRL 332 Y RYADDF++ + G+KA E ++++ ++ L L L+ +KT + D FLG + Sbjct: 324 QYVRYADDFIIGIIGSKADAEMVKQDIGQFIKSELNLELSDEKTLVTKSTDRAKFLGFDI 383 Query: 333 IRKRSRYGEMRVVSTIPQEKARNFAASLT 361 R + I KARNF + Sbjct: 384 RVTPGSNHTKRTKAGI---KARNFGGHVR 409 >UniRef50_B8I7I5 RNA-directed DNA polymerase (Reverse transcriptase) n=29 Tax=Bacteria RepID=B8I7I5_CLOCE Length = 618 Score = 274 bits (700), Expect = 5e-72, Method: Composition-based stats. Identities = 101/366 (27%), Positives = 182/366 (49%), Gaps = 11/366 (3%) Query: 9 AATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILR 68 A + L+++I+ E + A R S+ G+HT G D +N ++ +L + Sbjct: 24 AQSKQGKSFTNLMKVISSEENIRLAYRNIKSNSGSHTSGTDTLNIKDIEKLSVEKLVEMM 83 Query: 69 DELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYG 128 L+ YQP P +RV IPK NGK RPLGIP + DR+VQ+ +L +EPI E+ F+ S G Sbjct: 84 QRKLA-WYQPKPVKRVEIPKPNGKTRPLGIPTIVDRLVQQCILQVLEPICEAKFYERSNG 142 Query: 129 FRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVR-RRISDARF 187 FRP RS HA+ + + +V++ D+ +FD V+H L++ + I D + Sbjct: 143 FRPNRSAEHAMAQCYRMVQ---KQNLYFVVDVDIKGFFDNVNHSKLIRQMWAMGIRDKQL 199 Query: 188 MTLLWKTIKAG-HIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKD 246 + ++ + +KA + G ++G PQGG++SPLL+NI+LNE D ++ ++ ++ Sbjct: 200 ICIIKQMLKAPVVMPDGETLYPTKGTPQGGILSPLLANIVLNELDWWISSQWEDMLTHRE 259 Query: 247 RWYWNNSIQRGRSTAV--RENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLE 304 + N+ + V + RYADDF + + ++ I + L+ Sbjct: 260 YYVSVNNNGSLNKSGVFRTLRRSALKEMYIVRYADDFKIFCR-KRSDANKIFVAVKKWLK 318 Query: 305 GSLKLRLNMDKTKIPHVNDGF-IFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAASLTAL 363 LKL ++ +K+K+ ++ + FLG + R + + V S + ++ + L Sbjct: 319 DRLKLEISEEKSKVVNLKKHYSEFLGFQFKTVR-KGRKFVVRSHMSEKAIKRETEKLKEQ 377 Query: 364 LWKVRI 369 + ++ Sbjct: 378 IKEIAH 383 >UniRef50_Q024N3 RNA-directed DNA polymerase n=6 Tax=Bacteria RepID=Q024N3_SOLUE Length = 435 Score = 272 bits (696), Expect = 1e-71, Method: Composition-based stats. Identities = 115/387 (29%), Positives = 170/387 (43%), Gaps = 49/387 (12%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTML---Q 57 +++KL A +P R L I + + L A + G PGVDGV + Sbjct: 8 LRQKLGQKAKQEPKFRFYALYDRIWRKDVLETAWERVRQNDG--APGVDGVTIEEIMKTD 65 Query: 58 ARLAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPI 117 +A L+ + + L Y+P +RVYI K NGKLRPLGIP +RDR+VQ A L+ +EPI Sbjct: 66 QGVAGFLEGIENSLRRKTYRPEAVQRVYIEKENGKLRPLGIPTVRDRVVQMATLLILEPI 125 Query: 118 WESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKA 177 +E+DF SYGFRP RS H A+ ++ + E + V + DL YFD++ H L+ Sbjct: 126 FEADFLDCSYGFRPGRSAHQALEEIRGHV----EAGYQAVYDADLKGYFDSIPHTQLLAC 181 Query: 178 VRRRISDARFMTLLWKTIKAGHID------VGLFRAASEGVPQGGVISPLLSNIMLNEFD 231 VR R+ D + L+ ++A ++ + +G PQGGV SPLL+N+ L+ FD Sbjct: 182 VRMRVVDRSVLKLIRMWLEAPVVEREEGGGGSKWSRPEKGTPQGGVASPLLANLYLHWFD 241 Query: 232 QYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQ 291 + G K RYADDFV++ K Sbjct: 242 ALFYGPEGPG--------------------------GKADAKLVRYADDFVVMA---KQM 272 Query: 292 VEAIREECRGVLEGSLKLRLNMDKTKIPHVND---GFIFLGHRLIRKRSRYG--EMRVVS 346 E LEG +L +N +KT++ + + FL H R R G + Sbjct: 273 GTETIEFIESRLEGKFQLEINREKTRVVDLREEGASLDFLSHTFRRDRDLKGRDRKYLNV 332 Query: 347 TIPQEKARNFAASLTALLWKVRISGEI 373 + R L + + I Sbjct: 333 FPSAKAVRKERRKLHEMTDSHQCFKPI 359 >UniRef50_C3B599 Group II intron-encoded protein LtrA n=3 Tax=cellular organisms RepID=C3B599_BACMY Length = 623 Score = 272 bits (695), Expect = 2e-71, Method: Composition-based stats. Identities = 123/405 (30%), Positives = 192/405 (47%), Gaps = 59/405 (14%) Query: 9 AATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILR 68 + + + + RL R + P + +A S++G T G DG N ++ IL Sbjct: 14 KSNNKNYKYNRLYRNLYNPAFYLKAYTNISSNQGNMTKGTDGKNIDGFS---LEKINILI 70 Query: 69 DELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYG 128 + L YQP P++RV+IPK NG RPLGIP +D+++Q + M +E I+E F S+G Sbjct: 71 ESLKDESYQPHPSKRVFIPKKNGSKRPLGIPTFKDKLLQEVIRMILEAIYEMSFKESSHG 130 Query: 129 FRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFM 188 FRP+RS H A+ V+ T +W +EGD+S +FD + H L+ +RRRI+D +F+ Sbjct: 131 FRPKRSCHTALHKVRKTF-----TGVKWFVEGDISGFFDNIDHHTLIALLRRRITDEKFI 185 Query: 189 TLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRW 248 L+WK ++AG+++ +R + G PQGG+ISPLLSN+ L E D ++ E Y ++ + Sbjct: 186 RLIWKFLRAGYLEEWKYRGSYSGTPQGGIISPLLSNVYLTELDTFMEE-YQKEFSKGAKR 244 Query: 249 YWNNSIQRGRSTAVRENWQWK--------------------------------------P 270 + +R + Q K Sbjct: 245 KTTKAYKRQEYLTYKHRKQLKENWGQLTEQEKKQGIIKYKSLKEELLKTPFGDPMDDSYK 304 Query: 271 AVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGH 330 + Y RYADDF++ + G KA ++E+ L LKL L+ +KT I H + FLG+ Sbjct: 305 RIQYVRYADDFLVGIIGNKADAVKVKEDLTNFLRDKLKLELSQEKTLITHSSKKARFLGY 364 Query: 331 RLIRKR------------SRYGEMRVVSTIPQEKARNFAASLTAL 363 + R +R+ MR +P EK AL Sbjct: 365 NITVSRMPVTKRDKNGFLTRHQIMRCKLYLPSEKWIEKLKQYGAL 409 >UniRef50_Q56VE2 MatR n=6 Tax=Bacteria RepID=Q56VE2_BACFR Length = 599 Score = 271 bits (694), Expect = 2e-71, Method: Composition-based stats. Identities = 114/367 (31%), Positives = 181/367 (49%), Gaps = 42/367 (11%) Query: 10 ATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRD 69 + +P+ + +RL RL+ A A ++ G T G DG + +Q + D Sbjct: 15 SQNPNYKFERLYRLLFNENLYALAYQMMSKKTGNMTKGTDGQTIS---GMSIKRIQSIID 71 Query: 70 ELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGF 129 +L YQP PA+R+YIPK NGK RPLGIP+ D++VQ+ + M +E I+E F S+GF Sbjct: 72 KLRDESYQPHPAKRIYIPKKNGKQRPLGIPSFEDKLVQKVIQMILESIYEGSFEKCSHGF 131 Query: 130 RPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMT 189 RP R+ H A+ ++ RW IEGD+ +FD + H +++ + RI+D RF+ Sbjct: 132 RPHRNCHTAMASIMEGFD-----GTRWFIEGDIKGFFDNIDHDIMITILSERIADERFLR 186 Query: 190 LLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHE---RYLSGKARKD 246 L+ K + AG+++ F G PQGG+ISP+L+NI L++ D+Y+ E ++ GK RK Sbjct: 187 LIRKFLNAGYLEKWKFHKTFSGTPQGGIISPILANIYLDQLDKYVVEYISQFNRGKMRKR 246 Query: 247 RWYWNNSIQR-------------------------------GRSTAVRENWQWKPAVAYC 275 + R + A + + + Y Sbjct: 247 NPEYKRIASRKDKRVKKLKTETDEQKRAALRSEIVELHREMQKHPATLDMDEDFRRMRYV 306 Query: 276 RYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRK 335 RYADDF++ + G+K I+ + + L LKL L+ +KT I H +D FLG + + Sbjct: 307 RYADDFLIGIIGSKDDCVNIKADIKRFLCEKLKLELSDEKTLITHGHDHAKFLGFEVTIR 366 Query: 336 RSRYGEM 342 +S Sbjct: 367 KSEKTRK 373 >UniRef50_B9IYU4 Reverse transcriptase n=21 Tax=Bacteria RepID=B9IYU4_BACCQ Length = 607 Score = 271 bits (694), Expect = 2e-71, Method: Composition-based stats. Identities = 114/401 (28%), Positives = 183/401 (45%), Gaps = 57/401 (14%) Query: 8 WAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQIL 67 A + RL R + ++ +A G T G D ++ + Sbjct: 17 KNAVKENYIFTRLYRNLYNKKFFLDAYGNIYHKPGNMTQGTDKETIDGFSMDW---IENI 73 Query: 68 RDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSY 127 L Y+P P+RRVYIPK + K RPLGIP+++D+I+Q + + ++E F S+ Sbjct: 74 ISSLKDESYKPNPSRRVYIPKKDDKQRPLGIPSIKDKIIQEVVKEILVSMYEPIFSKASH 133 Query: 128 GFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARF 187 GFRP +S H A+ +K+ +W IEGD+ +FD + H +L+ +R+RI D +F Sbjct: 134 GFRPNKSCHSALNDIKMTF-----GGIKWWIEGDIKGFFDNIDHHVLIGILRKRIKDEKF 188 Query: 188 MTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHE---RYLSGKAR 244 + L+WK +KAG+++ F G PQGG+ISP+L+NI L+E D ++ + ++ GK R Sbjct: 189 IKLIWKFLKAGYMEDWKFNKTFSGTPQGGIISPVLANIYLHELDAFMEKQIIKFDEGKRR 248 Query: 245 KDRWYWNN----------------------------------SIQRGRSTAVRENWQWKP 270 +D + +R + +AV Sbjct: 249 RDNPVYKKYNTAIWYRKNKLKEKWNTLNDDERKELQSEISTLEKEREKHSAVDNMDASFK 308 Query: 271 AVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGH 330 + Y RYADDFV+ V G+K + I+EE L SLKL L+ +KT I + FLG+ Sbjct: 309 RLKYVRYADDFVVGVIGSKEDSKRIKEEITEFLHTSLKLELSQEKTLITSNKNLIKFLGY 368 Query: 331 RLIRK------------RSRYGEMRVVSTIPQEKARNFAAS 359 + + R+ + + +P RNF Sbjct: 369 EIGIDGGHDSKTNTNGIKKRHLSGKPMLYLPYNNMRNFLLK 409 >UniRef50_C0YLQ1 RNA-directed DNA polymerase n=40 Tax=Bacteria RepID=C0YLQ1_9FLAO Length = 624 Score = 271 bits (693), Expect = 3e-71, Method: Composition-based stats. Identities = 109/365 (29%), Positives = 175/365 (47%), Gaps = 42/365 (11%) Query: 12 DPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDEL 71 + + +RL +++ E A + S G T GVDG ++ L L Sbjct: 37 NTDYKFERLYKVLFNEEMYFIAYQKIYSKVGNMTAGVDGKTI---DGMSISRIERLIASL 93 Query: 72 LSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRP 131 + YQP P++R YIPK NGK RPLGIP+ D++VQ + M +E I+E F S+GFRP Sbjct: 94 RNETYQPNPSKRTYIPKKNGKKRPLGIPSFDDKLVQEVIRMILEAIYEGSFEHTSHGFRP 153 Query: 132 ERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLL 191 RS H A+ +V+ T RW IEGD+ +FD ++H +L+ ++ RI+D RF+ L+ Sbjct: 154 NRSCHTALLSVQQSFTAV-----RWFIEGDIKGFFDNINHEILIGILKERIADDRFIRLI 208 Query: 192 WKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWN 251 K + AG+I+ ++ G PQGG++SP+L+NI L++ D+Y+ + K Sbjct: 209 RKFLNAGYIEDWVYHKTYSGTPQGGIVSPILANIYLDKLDKYVKDYIKDFDKGKRTTATR 268 Query: 252 N----------------------------------SIQRGRSTAVRENWQWKPAVAYCRY 277 +R + A + + Y RY Sbjct: 269 QYRLHEQRRYRLAKKLKCETDETVREQMIKDIKELRQERNKYPAYDKMDGSFRKLKYVRY 328 Query: 278 ADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRS 337 ADDF++ V G+K + I+E+ + L+ LKL L+ +KT + + FLG + + S Sbjct: 329 ADDFLIGVIGSKEDCKKIKEDIKVYLDEKLKLELSDEKTLVTNAKKPAKFLGFDVSVRNS 388 Query: 338 RYGEM 342 + Sbjct: 389 DESKR 393 >UniRef50_Q02718 Reverse transcriptase homologue COI iA grp II protein (Fragment) n=2 Tax=Fungi RepID=Q02718_PODAN Length = 790 Score = 271 bits (693), Expect = 3e-71, Method: Composition-based stats. Identities = 129/398 (32%), Positives = 194/398 (48%), Gaps = 46/398 (11%) Query: 13 PSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELL 72 + L +LI + L +A R S+ G TP +D + L+ L EL Sbjct: 190 KDKKFVNLYQLICSKDLLIQAYRNVRSNPGGMTPSIDNIT---YDGINDEFLEKLILELK 246 Query: 73 SGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPE 132 S ++ +RVYIPK+NGK RPLGIP +D+IVQ AM + +E I+E F +S+GFRP+ Sbjct: 247 SERFKFTSVKRVYIPKANGKTRPLGIPTSKDKIVQEAMKILLELIYEPIFLDVSHGFRPK 306 Query: 133 RSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLW 192 RS H A+ + W++EGD+ +F+ V H++L+K + ++I D RF LLW Sbjct: 307 RSCHTALHQISKW------NGTTWMLEGDIKGFFNEVDHQVLIKILEKKIKDQRFFDLLW 360 Query: 193 KTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNN 252 K +AG+ID G+ GVPQGGVISP+LSNI L+EFD ++ + KD N Sbjct: 361 KLFRAGYIDDGVKYNTYTGVPQGGVISPVLSNIYLHEFDLFVETLIKKYSSEKDFISKVN 420 Query: 253 SI-----------------------------QRGRSTAVRENWQWKPAVAYCRYADDFVL 283 + R + + V Y RYADD+V+ Sbjct: 421 PVIVKYSSKLSRLNDEYQTTKDKEILKEIIKLRAERNKLPSRIRNGIRVRYTRYADDWVI 480 Query: 284 IVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDG-FIFLGHRLIRKRS----- 337 + G + V I+EEC+ L LKL L+ +KTKI ++ + FLG + RK S Sbjct: 481 GIIGDQELVAKIKEECKAFLRDILKLELSEEKTKITNITEKEVRFLGVDIKRKDSGESKI 540 Query: 338 --RYGEMRVVSTIPQEKARNFAASLTALLWKVRISGEI 373 R + R++ + F + ++ K+ +G I Sbjct: 541 IQRQVKGRLIKSRINNNRLYFYVPVRDIINKLEKAGFI 578 >UniRef50_D2M2V6 RNA-directed DNA polymerase (Reverse transcriptase) n=1 Tax=Bacillus cellulosilyticus DSM 2522 RepID=D2M2V6_BACS4 Length = 441 Score = 271 bits (692), Expect = 4e-71, Method: Composition-based stats. Identities = 110/371 (29%), Positives = 175/371 (47%), Gaps = 49/371 (13%) Query: 2 QRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLA 61 Q+ + + L+ + + EA + ++G+ GVDGV+ + + Sbjct: 7 QKSFRDGVKSTQKRKWYSLMDKVWAMSNMEEAFKEVKRNRGS--AGVDGVSIRTFEHGVE 64 Query: 62 VELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESD 121 +Q+L+ EL Y+P P +RV+IPK++G RPLGIP +RDR+VQ A+ +EPI+E Sbjct: 65 DNVQVLQRELKEKAYRPRPVKRVFIPKTDGTKRPLGIPTVRDRVVQAAVRRIIEPIFEDK 124 Query: 122 FHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRR 181 F S+GFRP +S H A+ ++ L D +VI+ DL +YFDT+ L++AVR Sbjct: 125 FLDCSFGFRPNKSAHMALEKIRKDLMD----GYVYVIDADLKAYFDTIPQDKLIQAVREE 180 Query: 182 ISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSG 241 + D + L+ ++AG +D G F +G PQGGVISPLL+NI L+ D+ + Sbjct: 181 VVDGSVIRLIQSFLQAGVMDGGSFHLTEKGTPQGGVISPLLANIYLHPLDELM------- 233 Query: 242 KARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRG 301 K RYADDFV+ K K E + + Sbjct: 234 --------------------------TKRGHRITRYADDFVICCKSQK-GAERVLKSVTR 266 Query: 302 VLEGSLKLRLNMDKTKIP-HVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAASL 360 L L L ++ +KTK+ ++ + F+FLGH VS P+ + F + Sbjct: 267 FLNEELGLTVHPEKTKVVNNLEEPFLFLGHEF-------KGGYYVSASPK-ALKKFKEKV 318 Query: 361 TALLWKVRISG 371 + + + Sbjct: 319 KEITRRNQTVN 329 >UniRef50_B0TA92 Reverse transcriptase (RNA-dependent DNA polymerase) n=44 Tax=Bacteria RepID=B0TA92_HELMI Length = 475 Score = 271 bits (692), Expect = 4e-71, Method: Composition-based stats. Identities = 106/353 (30%), Positives = 172/353 (48%), Gaps = 47/353 (13%) Query: 18 QRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQ 77 L+ + + + EA + +++KG G+DG+ L+ L E ++ ELL G Y+ Sbjct: 55 YDLMEKVVERGNMTEAYKRVMANKG--AAGIDGMGLESLRPYLKEEWSRIKQELLEGTYR 112 Query: 78 PLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHH 137 P P RRV IPK G R LGIP + DR++Q+A+ + PI++ DF T SYGFRP +S H Sbjct: 113 PQPVRRVEIPKPQGGTRKLGIPTVVDRLIQQALNQILMPIFDPDFSTNSYGFRPGKSAHQ 172 Query: 138 AIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKA 197 A++ K + D RWV++ DL+ +FD V+H +LM V R++ D R + L+ + +KA Sbjct: 173 AVKKAKEYIAD----GYRWVVDMDLAQFFDRVNHDILMARVARKVKDKRILKLIREYLKA 228 Query: 198 GHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRG 257 G + G+ + EG PQGG +SPLL+NI+L++ D+ L Sbjct: 229 GVMLNGIRVKSEEGTPQGGPLSPLLANIILDDLDKALES--------------------- 267 Query: 258 RSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTK 317 +CRYADD + V+ +A + + E LEG LKL++N +K+ Sbjct: 268 ------------RGHRFCRYADDCNVYVRSRRAG-QRVMEGMAKFLEGRLKLQVNWEKSA 314 Query: 318 IPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAASLTALLWKVRIS 370 + + FLG ++ + + + + + R Sbjct: 315 VDRPWNR-KFLGFSFTWHKA------AKIRLAPQTVKRVKEKIRQFTGRNRSI 360 >UniRef50_Q0S063 RNA-directed DNA polymerase (Reverse transcriptase) n=3 Tax=Actinomycetales RepID=Q0S063_RHOSR Length = 459 Score = 269 bits (688), Expect = 1e-70, Method: Composition-based stats. Identities = 109/379 (28%), Positives = 167/379 (44%), Gaps = 50/379 (13%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQAR- 59 +Q L A DP R L+ +++ + L + G PG+D + ++ Sbjct: 22 LQHALYRAAKVDPGRRFHALMDKVSRRDVLWRGWVAVRRNNG--APGIDRITLEEVEEYG 79 Query: 60 LAVELQILRDELLSGHYQPLPARRVYIPKSNG-KLRPLGIPALRDRIVQRAMLMAMEPIW 118 +A L L EL G Y+PLPARRV+IPK + RPL IP++RDRIVQ A + EP++ Sbjct: 80 VARLLDELAVELKEGSYRPLPARRVFIPKPGTVEQRPLSIPSVRDRIVQAAWKLVAEPVF 139 Query: 119 ESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAV 178 E+DF S+GFRP R H A++ + + RWV+E D+++ F+ + LM+AV Sbjct: 140 EADFLPCSFGFRPRRGAHDALQVLIDE----SWRGCRWVVETDIANCFEAIPIEKLMQAV 195 Query: 179 RRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERY 238 R+ D F+ LL ++AG ++ G R G PQGGV S LL N+ L+ D+ Sbjct: 196 EERVCDQPFLKLLRVMLRAGVMEEGQVRRPVTGTPQGGVASALLCNVYLHRLDRAWDVDE 255 Query: 239 LSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREE 298 RYADD +++ + + Q EA Sbjct: 256 HG--------------------------------VLVRYADDALVMCRSRR-QAEAALTR 282 Query: 299 CRGVLEGSLKLRLNMDKTKIPHVN---DGFIFLGHRLI-----RKRSRYGEMRVVSTIPQ 350 R +L L L KT+I H+ +G FLG + R + Sbjct: 283 LRELLAD-LGLEPKEAKTRIVHLRVGGEGVDFLGFHHRLVNAPARPGRRPFPFLARWPAD 341 Query: 351 EKARNFAASLTALLWKVRI 369 + R+ + L + R+ Sbjct: 342 KAVRHARERIRELTDRSRL 360 >UniRef50_A5CZJ0 Retron-type reverse transcriptase n=1 Tax=Pelotomaculum thermopropionicum SI RepID=A5CZJ0_PELTS Length = 428 Score = 268 bits (686), Expect = 2e-70, Method: Composition-based stats. Identities = 115/393 (29%), Positives = 185/393 (47%), Gaps = 55/393 (13%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARL 60 QRKL A + + R L + + + L A + ++KG PG DG + ++ ++ Sbjct: 15 FQRKLYVKAKQEKTFRFYSLYDKLYREDVLQYAWQQCRANKG--APGADGQSFKDIEEKV 72 Query: 61 --AVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIW 118 L+ + +EL +G Y+P+P RRVYI K +G RPLGIP ++DRI Q A L ++PI+ Sbjct: 73 GVERFLKEIAEELRNGTYRPMPVRRVYILKPDGSQRPLGIPTIKDRIAQMACLTVIQPIF 132 Query: 119 ESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAV 178 E+DF SYGFRP+R+ H AI + + + V + DL+ FD++ HRL+M ++ Sbjct: 133 EADFLDCSYGFRPKRNAHQAIGAITENI----KQGFTAVYDADLTKCFDSIQHRLIMDSL 188 Query: 179 RRRISDARFMTLLWKTIKAGHIDVG---LFRAASEGVPQGGVISPLLSNIMLNEFDQYLH 235 RI+D + + L+ ++A ++ G R +G PQGGVISPLL+NI+LN D+ H Sbjct: 189 AERITDGKVLRLIKGWLEAPIVEPGGPKQGRKNYQGTPQGGVISPLLANIVLNRLDRLWH 248 Query: 236 ERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAI 295 + + RYADDFV++ + ++ Sbjct: 249 R--------------------------PGGPRERYNARLVRYADDFVVLARFIGEPIKNE 282 Query: 296 REECRGVLEGSLKLRLNMDKTKIPHVNDG--FIFLGHRLIRKRSRYGEMRVVSTIPQEKA 353 E S+ L LN KT+I +N G FLG+ + SR R+ + Sbjct: 283 LESIIT----SMGLNLNEKKTRILDLNKGDILNFLGYSIRI--SRDKNRRITIKPSDKAI 336 Query: 354 RNFAASLTALLWKVRI----------SGEILLG 376 + ++ + R+ +L G Sbjct: 337 ARLRDKIREIISRERLYHGLKGIIAEINPVLRG 369 >UniRef50_B9M2H7 RNA-directed DNA polymerase (Reverse transcriptase) n=7 Tax=Bacteria RepID=B9M2H7_GEOSF Length = 446 Score = 268 bits (686), Expect = 2e-70, Method: Composition-based stats. Identities = 114/362 (31%), Positives = 169/362 (46%), Gaps = 47/362 (12%) Query: 18 QRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQ 77 LL I PE + A + ++KG PGVDGVN +R L +G Y Sbjct: 26 NHLLERILSPENMELAWKRVRANKG--APGVDGVNIDDFPDITRPLWGDIRASLATGSYL 83 Query: 78 PLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHH 137 P P RV IPK G RPLGIP + DR++Q+++ + PI++ F S+GFRP RS H Sbjct: 84 PKPVLRVEIPKPTGGNRPLGIPTVLDRLIQQSIAQVLTPIFDPGFSESSFGFRPGRSAHD 143 Query: 138 AIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKA 197 A+R ++ L R ++ DL+ +FDTV+H LLM V R++ D R + L+ + ++A Sbjct: 144 AVRQLREYL----RQGYRIAVDIDLAKFFDTVNHDLLMTFVGRKVRDKRVLALIGRYLRA 199 Query: 198 GHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRG 257 G G GVPQGG +SPLL+NI+L+ D+ L Sbjct: 200 GVEVDGRLEKTRMGVPQGGPLSPLLANILLDHLDKELE---------------------- 237 Query: 258 RSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTK 317 K + RYADDFV++VK +A E + R L LKL +N DK+K Sbjct: 238 -----------KRGHKFVRYADDFVILVKSERAG-ERVMGSVRKHLTTKLKLTVNEDKSK 285 Query: 318 IPHVNDGFIFLGHRLIRKRSRYGE---MRVVSTIPQEKARNFAASLTALLWKVRISGEIL 374 + +D FLG + + + + + R++ S+ L K+ + Sbjct: 286 VAK-SDQISFLGFVFKGTKILWSDKAYKEFRRRVRKYTGRSWFVSMEYRLNKLSTY---I 341 Query: 375 LG 376 G Sbjct: 342 RG 343 >UniRef50_A6YEC9 Putative reverse transcriptase and intron maturase n=1 Tax=Chlorokybus atmophyticus RepID=A6YEC9_CHLAT Length = 755 Score = 267 bits (682), Expect = 5e-70, Method: Composition-based stats. Identities = 105/391 (26%), Positives = 177/391 (45%), Gaps = 45/391 (11%) Query: 18 QRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQ 77 +L+ I P+ L A + S G T GV + L V + EL +G Y+ Sbjct: 156 GKLIHAIAHPDLLWLAYELIKSKPGNMTRGV---STETLDGLSRVWIDKTSSELRAGKYR 212 Query: 78 PLPARRVYIPKSNG-KLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVH 136 ARR+ IPK RPL + + R+++VQ+A+ M ++ ++E F S+GFRP R H Sbjct: 213 FGLARRIMIPKVGKPGERPLTMASFREKVVQKAIQMVLQELFEPRFLNTSHGFRPGRGCH 272 Query: 137 HAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIK 196 A++ V G+WVIE D++ FDT+ H L+ +RR I+ ++ + L+ +K Sbjct: 273 TALQMVDQHFR-----GGKWVIEADITKCFDTIPHDKLLAVLRRHITCSKTLALIHSGLK 327 Query: 197 AGHIDVGLF-RAASEGVPQGGVISPLLSNIMLNEFDQYLHE---RYLSGKARKDRWYWNN 252 AG++ +G + G PQG ++SPLL+NI L+ DQ++ ++ GK R+ + Sbjct: 328 AGYVVLGKASQEQMVGTPQGSILSPLLNNIFLHLLDQFMERLSAKHTLGKTRRKNPEYRK 387 Query: 253 SIQRGRSTA-------------------------VRENWQWKPAVAYCRYADDFVLIVKG 287 + +AY RYAD F++ V G Sbjct: 388 LQSELSKNKGDADAMSKLRRRKLPRSGRIWLMQSKDQMDPGFRRLAYVRYADHFLICVTG 447 Query: 288 TKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSRYGEMRVVST 347 + E+ R L+ L L LN KT I +DG FLG + + +++++ Sbjct: 448 PHQLAVDVMEQVRTFLDKELGLELNQSKTLITKFSDGINFLGGIITNRTVSEKPIKLMAA 507 Query: 348 IPQE-------KARNFAASLTALLWKVRISG 371 P + +F A + L+ ++++ G Sbjct: 508 GPAKGHLVRVSPRLSFHAPIAKLIDRLQLRG 538 >UniRef50_P0A3U1 DNA endonuclease n=8 Tax=Firmicutes RepID=LTRA_LACLM Length = 599 Score = 267 bits (682), Expect = 5e-70, Method: Composition-based stats. Identities = 112/396 (28%), Positives = 188/396 (47%), Gaps = 41/396 (10%) Query: 4 KLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVE 63 +++ + + RL R + +P+ A + S+KGA T G+ + Sbjct: 10 RISKNSQENIDEVFTRLYRYLLRPDIYYVAYQNLYSNKGASTKGILDDTADGFS---EEK 66 Query: 64 LQILRDELLSGHYQPLPARRVYIPKSN-GKLRPLGIPALRDRIVQRAMLMAMEPIWESDF 122 ++ + L G Y P P RR+YI K N K+RPLGIP D+++Q A+ + +E I+E F Sbjct: 67 IKKIIQSLKDGTYYPQPVRRMYIAKKNSKKMRPLGIPTFTDKLIQEAVRIILESIYEPVF 126 Query: 123 HTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRI 182 +S+GFRP+RS H A++T+K + RW +EGD+ FD + H L+ + +I Sbjct: 127 EDVSHGFRPQRSCHTALKTIKREF-----GGARWFVEGDIKGCFDNIDHVTLIGLINLKI 181 Query: 183 SDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYL-------- 234 D + L++K +KAG+++ + G PQGG++SPLL+NI L+E D+++ Sbjct: 182 KDMKMSQLIYKFLKAGYLENWQYHKTYSGTPQGGILSPLLANIYLHELDKFVLQLKMKFD 241 Query: 235 ----------------------HERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAV 272 H K + +R R + Q + Sbjct: 242 RESPERITPEYRELHNEIKRISHRLKKLEGEEKAKVLLEYQEKRKRLPTLPCTSQTNKVL 301 Query: 273 AYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRL 332 Y RYADDF++ VKG+K + I+E+ + + LK+ L+ +KT I H + FLG+ + Sbjct: 302 KYVRYADDFIISVKGSKEDCQWIKEQLKLFIHNKLKMELSEEKTLITHSSQPARFLGYDI 361 Query: 333 IRKRSRYGEMRVVSTIPQEKARNFAASLTALLWKVR 368 +RS G ++ + + L L K+R Sbjct: 362 RVRRS--GTIKRSGKVKKRTLNGSVELLIPLQDKIR 395 >UniRef50_B0I1N9 Reverse transcriptase homolog n=7 Tax=cellular organisms RepID=B0I1N9_PYLLI Length = 749 Score = 267 bits (682), Expect = 6e-70, Method: Composition-based stats. Identities = 125/390 (32%), Positives = 195/390 (50%), Gaps = 34/390 (8%) Query: 5 LATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVEL 64 + +P+ +R+L L++ + L A S G T GVDG + L Sbjct: 175 IIKINIENPNHLNERILSLVSSYDMLEAAYIKIKSKPGNMTKGVDGKTLDGVNVGW---L 231 Query: 65 QILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHT 124 + L ++ SG Y+PLP+RRV IPK G RPLGIP+ RD+IVQ ++ ++ I+E F Sbjct: 232 KSLSRDVGSGSYKPLPSRRVMIPKPQGGERPLGIPSPRDKIVQESIRTVLQAIYEPSFIA 291 Query: 125 LSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISD 184 S+GFRP RS H A++ KL + W IEGD+ FD++ HR+L + RRI D Sbjct: 292 CSHGFRPGRSCHTALKEAKLTFAN-----TTWFIEGDIEKCFDSIDHRVLSTLLERRIKD 346 Query: 185 ARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHER---YLSG 241 FM L WK +K G++ +G + +G PQG V+SPLLSNI L+E D+++ + + G Sbjct: 347 KGFMDLYWKMVKVGYMSLGKINQSDKGTPQGSVVSPLLSNIYLHELDKWMTRKKESFDKG 406 Query: 242 KARKDRWYWNNSI---------QRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQV 292 RK + + +R + + Y RYA DF++ + G+K Sbjct: 407 TRRKANPVYTKYVRVAGGASAARRLNIPSADPLDPNFKRLRYVRYAGDFLIGIIGSKTDG 466 Query: 293 EAIREECRGVLEGSLKLRLNMDKTKIPHV-NDGFIFLGH--------RLIRKRSRYGEMR 343 + +E + L LKL LN+ KTK+ H ++ FLG K+++ G++ Sbjct: 467 ICLIKELKEFLHDILKLDLNLTKTKLTHTMSEKAYFLGTWVKIIPVSGFQIKKNKSGKIT 526 Query: 344 VVSTIPQEKARNFAASLTALLWKVRISGEI 373 VS+ PQ + L L+ ++ G + Sbjct: 527 RVSSRPQLRV-----PLKLLVDRLERKGFL 551 >UniRef50_A1T776 RNA-directed DNA polymerase n=1 Tax=Mycobacterium vanbaalenii PYR-1 RepID=A1T776_MYCVP Length = 454 Score = 267 bits (682), Expect = 6e-70, Method: Composition-based stats. Identities = 116/397 (29%), Positives = 173/397 (43%), Gaps = 61/397 (15%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQAR- 59 +QR L A R L + + + L EA ++G GVD V ++ Sbjct: 30 LQRTLWAAAKQSQGRRFHALYDRVYRGDVLWEAWERVRKNRG--AAGVDRVTLVAVEEYG 87 Query: 60 LAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWE 119 + L+ LR +L G Y P PARRV IPK G RPLGIP +RDR+ Q A + +EPI+E Sbjct: 88 VDRMLRELRHDLREGVYCPAPARRVEIPKPRGGTRPLGIPTVRDRVAQAAAKIVLEPIFE 147 Query: 120 SDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVR 179 +DF + SYGFRP+RS A+ +++ + ++V+E D++++F + H L+ V Sbjct: 148 ADFMSCSYGFRPKRSATQAMERLRVGFIE----GSQFVVEFDIANFFGEIDHDRLLAEVS 203 Query: 180 RRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYL 239 RR+SD R + LL ++AG + G+ G PQGGVISPLL+NI L+ D L R + Sbjct: 204 RRVSDRRVLKLLRLWLQAGVMVDGVVSRTVAGTPQGGVISPLLANIYLHVLDTELARRNV 263 Query: 240 SGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREEC 299 RYADD V++ + A+ Sbjct: 264 G--------------------------------ELVRYADDGVVLCRSAAQAEHALAAVG 291 Query: 300 RGVLEGSLKLRLNMDKTKIPHVN---DGFIFLGHRLI-------RKRSRYGEMRVVSTIP 349 + SL LRL+ DKTK+ + +G FLG ++ R + Sbjct: 292 E--ILASLGLRLHPDKTKVVDLREGGEGLDFLGCHFRARMSGRLWEQRRIVRYYLHRWPS 349 Query: 350 QEKARNFAASLTALLWKVRIS----------GEILLG 376 Q + + R+ IL G Sbjct: 350 QTAMVRLREKVRERTGRNRVGFDIRDVIAVLNPILRG 386 >UniRef50_C9B0U1 RNA directed DNA polymerase n=2 Tax=Enterococcus casseliflavus RepID=C9B0U1_ENTCA Length = 620 Score = 265 bits (678), Expect = 2e-69, Method: Composition-based stats. Identities = 108/367 (29%), Positives = 181/367 (49%), Gaps = 11/367 (2%) Query: 9 AATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILR 68 + + L+ ++T P + A R + G+ +PGVD N L+A +E Sbjct: 27 KESSENKIFSNLMEIVTSPNNILLAFRNVKGNSGSTSPGVDKKNIDDLKAIPNIEFIKTV 86 Query: 69 DELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYG 128 S Y+P P +RV IPK NGK RPLGIP + DRI+Q+ +L +EPI E+ FH +YG Sbjct: 87 QTKFSE-YKPQPVKRVDIPKPNGKTRPLGIPTIWDRIIQQCLLQVLEPIMEAKFHDKNYG 145 Query: 129 FRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRI-SDARF 187 FRP RS HHA + ++ +V++ D+ +FD V+H L+K D Sbjct: 146 FRPNRSAHHAFAQA---VRMAQLSKLTFVVDIDIEGFFDNVNHSKLIKQFWTLGVRDKWL 202 Query: 188 MTLLWKTIKAGHID-VGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKD 246 + ++ +KA I G +G PQGGV+SPLL+N++LNE D ++ ++ + RK+ Sbjct: 203 LGVIRAMLKAPIIHRDGRIEHPKKGTPQGGVLSPLLANVVLNELDWWISSQWETHPTRKN 262 Query: 247 RWYWNNS-IQRGRSTAVRENWQWKPA-VAYCRYADDFVLIVKGTKAQVEAIREECRGVLE 304 ++ + QR +S R K + RYADDF + + ++ + I + L+ Sbjct: 263 YDCYHQTRKQRIKSNKYRALRASKLKEIYIVRYADDFKIFCR-KRSDADKIFLATKLWLK 321 Query: 305 GSLKLRLNMDKTKIPHVND-GFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAASLTAL 363 LKL ++ +K+K+ ++ FLG + R R + + + + + L Sbjct: 322 DRLKLDISAEKSKVVNLKKQKSDFLGFTMKLVRKRKS-FVIETHMCAKAMSAASNKLAKQ 380 Query: 364 LWKVRIS 370 + ++ S Sbjct: 381 IKVIQHS 387 >UniRef50_C7RV41 RNA-directed DNA polymerase (Reverse transcriptase) n=2 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RV41_9PROT Length = 453 Score = 265 bits (677), Expect = 2e-69, Method: Composition-based stats. Identities = 107/362 (29%), Positives = 172/362 (47%), Gaps = 47/362 (12%) Query: 18 QRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQ 77 + L+ + P L +A R S++G PG+DG+ A +R L G YQ Sbjct: 36 KDLMEAVLSPANLKQAWRRVKSNRG--APGIDGLRIEDFPAYACEHWPAIRQTLSEGRYQ 93 Query: 78 PLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHH 137 P RRV IPK NG R LGIP + DR+VQ+A+ M PI++ +F SYGFRP RS H Sbjct: 94 PQAVRRVIIPKPNGGERALGIPTVVDRVVQQAIAQIMTPIFDPEFSESSYGFRPRRSAHG 153 Query: 138 AIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKA 197 A++ V+ L + R ++ DL+ +FD V H +LM V R++SD R + L+ + ++A Sbjct: 154 ALKQVRADL----KAGYRIAVDLDLAKFFDNVDHDILMARVARKVSDKRLLALIGRYLRA 209 Query: 198 GHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRG 257 G + + + G PQGG +SPLL+NI+L++ D+ L Sbjct: 210 GVMIGSTLQPSELGTPQGGPLSPLLANILLDDLDRTLE---------------------- 247 Query: 258 RSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTK 317 + RYADD +++VK +A + ++ L LKL +N K++ Sbjct: 248 -----------GRGHRFARYADDLMVLVKSERAG-QRVKASLTAYLGRQLKLPVNEKKSQ 295 Query: 318 IPHVNDGFIFLGHRLIRKRSRYGEMR---VVSTIPQEKARNFAASLTALLWKVRISGEIL 374 + + FLG + + R+ + + + R++ S+ K+ G+ L Sbjct: 296 VAKIEQCV-FLGFTFRKNKLRWSDAAFADFKHRLRELTGRSWGVSMPHRFEKL---GQYL 351 Query: 375 LG 376 G Sbjct: 352 RG 353 >UniRef50_Q3ESS6 Reverse transcriptase / RNA maturase / Endonuclease n=13 Tax=Firmicutes RepID=Q3ESS6_BACTI Length = 607 Score = 264 bits (675), Expect = 3e-69, Method: Composition-based stats. Identities = 121/407 (29%), Positives = 185/407 (45%), Gaps = 58/407 (14%) Query: 7 TWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQI 66 + + +RL R + PE+ A + GA T GVD + + EL Sbjct: 12 QKQSQKEDYKFRRLYRNLYNPEFYFTAYDNLSKNDGALTMGVDKRSIDGFSIEIIEELIE 71 Query: 67 LRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLS 126 L YQP P++RVYIPK NGK RPLGIP+ D++VQ + M +E I+E F S Sbjct: 72 T---LKQRTYQPFPSKRVYIPKKNGKKRPLGIPSFADKLVQEVVRMILEAIYEPTFSISS 128 Query: 127 YGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDAR 186 + ++ +S H A++ ++ T +W IEGD+ +FD ++H L+ +R+RI D Sbjct: 129 HAYQKGKSCHTALQEIQRTF-----TGSKWFIEGDIKGFFDNINHHTLIGILRKRIEDEA 183 Query: 187 FMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLH---ERYLSGKA 243 F+ L+WK ++AG+++ F G PQGG+ISPLLSNI LNE D+Y+ +++ GK Sbjct: 184 FIELIWKFLRAGYMEEWKFHNTFSGAPQGGIISPLLSNIYLNELDKYMMDFIQKFNQGKK 243 Query: 244 RKDRWYWNNSIQRGRSTAVR----------------------------------ENWQWK 269 RK + + R + Sbjct: 244 RKINPDYERKYTQMRKAIRKYKIALENEQMGEAEQHLEQAKALKKELSSIPYSNPMDSNY 303 Query: 270 PAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDG-FIFL 328 + Y RYADDF++ V G+K I+E L +L+L L+ +KT I H ++ FL Sbjct: 304 KRLTYVRYADDFLIGVIGSKYDARNIKETLTEYLMETLQLELSQEKTLITHASENHAGFL 363 Query: 329 GHRLIRKRS------------RYGEMRVVSTIPQEKARNFAASLTAL 363 G+ + R RY +V +P E N A+ Sbjct: 364 GYNIRVFRGSEPRKDAIGRVCRYLNGKVQLKMPHEAWVNKLKKYQAI 410 >UniRef50_C6J6N9 RNA-directed DNA polymerase n=1 Tax=Paenibacillus sp. oral taxon 786 str. D14 RepID=C6J6N9_9BACL Length = 430 Score = 264 bits (675), Expect = 4e-69, Method: Composition-based stats. Identities = 121/370 (32%), Positives = 178/370 (48%), Gaps = 42/370 (11%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARL 60 +Q KL A + R L + + + L EA R +++G+ GVDG ++ + Sbjct: 16 LQGKLGHAAKENKKRRFHALYDKVYRVDILWEAWRRVRANEGS--AGVDGETLADIEKQG 73 Query: 61 A-VELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWE 119 + + L G Y P P RR YIPK +GKLRPLGIP +RDR++Q A + MEPI+E Sbjct: 74 EMRFVLECQRLLKEGKYHPQPVRRHYIPKKDGKLRPLGIPTVRDRVIQMATKLVMEPIFE 133 Query: 120 SDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVR 179 +DF S+GFRP+RS A+ ++ +G WV++ D+ YFD ++ LMK + Sbjct: 134 ADFQDTSFGFRPKRSAKQALERIRKACNR----KGNWVVDVDIQGYFDNINQEKLMKLIE 189 Query: 180 RRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYL 239 RISD R + L+ K + AG ++ G R + G PQGGVISPLL+NI LN FD Sbjct: 190 MRISDRRILKLVRKWLGAGVMEEGNIRRSDLGTPQGGVISPLLANIYLNYFDLLWERHGG 249 Query: 240 SGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREEC 299 RYADD V+I K TK + E Sbjct: 250 KLG------------------------------ELTRYADDLVIICK-TKKDAQRAYELI 278 Query: 300 RGVLEGSLKLRLNMDKTKIPHV---NDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNF 356 R ++E L+L L+ KT+I + +GF FLG + ++ + +V T Q R Sbjct: 279 RAIME-RLELTLHPTKTRIVGLWTGEEGFDFLGMHHRKTKAETSKGQVYYTTQQWLCRKA 337 Query: 357 AASLTALLWK 366 + + + Sbjct: 338 EERIREGVKE 347 >UniRef50_C6CA58 RNA-directed DNA polymerase n=3 Tax=Enterobacteriaceae RepID=C6CA58_DICDC Length = 626 Score = 263 bits (673), Expect = 6e-69, Method: Composition-based stats. Identities = 117/412 (28%), Positives = 197/412 (47%), Gaps = 59/412 (14%) Query: 2 QRKLATWAATDP----SLRIQRLLRLITQ-PEWLAEAARITLSSKGAHTPGVDGVNKTML 56 QR L + + +I++L +++ + A+A S++GA T G++ + Sbjct: 4 QRTLNALSGINKASTQGYKIKKLHKIMCSNKDLWAQAYANIYSNQGAMTRGINNNTMDEM 63 Query: 57 QARLAVELQILRDELLSGHYQPLPARRVYIPK----SNGKLRPLGIPALRDRIVQRAMLM 112 + L L + S Y+P P RR +IPK NGK RPLGIP D+++Q M M Sbjct: 64 SVDRIINLIQLIN---SDSYKPKPCRRTHIPKDARKPNGKKRPLGIPTGDDKLIQEVMRM 120 Query: 113 AMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHR 172 +E I+E F SYGFRP+RS H A++ ++ G +WV + D+ YFD + H Sbjct: 121 LLEEIYEPVFSDWSYGFRPKRSCHSALKEIRN-----GWKGTKWVCDVDIKGYFDNIDHD 175 Query: 173 LLMKAVRRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQ 232 LL+K + +RI+D +F+ LL K +K G++D + G PQGG+ISP+L+N+ L+E D+ Sbjct: 176 LLLKFLSKRIADNKFLALLKKFLKTGYLDNWRYFGTHSGTPQGGIISPILANVFLHELDE 235 Query: 233 YLHER---YLSGKARKDRWYWNNSIQRGRS------------------------------ 259 ++ R + +G RK + ++Q + Sbjct: 236 FMKNRISEFGTGGRRKPNPIYKRALQNRANRIKWIRQGFGASGMPADEQKIQKWKYEADE 295 Query: 260 --------TAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRL 311 ++V + + Y RYADDF++ V G+K++ + I +E +E L L + Sbjct: 296 LEKQLRTLSSVIMDDTEFKRMRYVRYADDFLIGVIGSKSEAKKIMKEVVDFVENELHLEI 355 Query: 312 NMDKTKIPHVNDGFIFLGHRLIRKR-SRYGEMRVVSTIPQEKARNFAASLTA 362 + +K+ I GF FLG+ + +R S+ + V K ++T Sbjct: 356 SKEKSGIIDPKKGFTFLGYEIKTRRESKRVKCVVGLNTDGSKTHAVKRTITE 407 >UniRef50_A9AUN7 RNA-directed DNA polymerase n=4 Tax=Bacteria RepID=A9AUN7_HERA2 Length = 595 Score = 263 bits (673), Expect = 6e-69, Method: Composition-based stats. Identities = 108/354 (30%), Positives = 173/354 (48%), Gaps = 35/354 (9%) Query: 19 RLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQP 78 R+ R++ + A + +GA T G + ++ + +L Y+ Sbjct: 23 RVYRILFNEDLYLRAYGRLATKQGALTKGS---TDETIDGMSMAKIHRIIADLRRETYRW 79 Query: 79 LPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHA 138 P RRVYIPK+ GK RPLG+P D++VQ + ++ ++ S+GFRP R H A Sbjct: 80 TPVRRVYIPKATGKTRPLGVPTWSDKLVQEVLRSILDAYYDPQMSDHSHGFRPNRGCHTA 139 Query: 139 IRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAG 198 ++ ++ T RW IEGD++ YFDT++H L+ + +RI D RF+ L+ ++AG Sbjct: 140 LKAIQRC-----WTGTRWFIEGDIAQYFDTINHTTLLTILAKRIHDGRFLRLIQTLLQAG 194 Query: 199 HIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHE----RYLSGKARKDRWYWNNSI 254 ++ ++ G PQGGVISPLL+NI L+EFDQ++ Y G+ RK + Sbjct: 195 YLHDWVYHPTLSGTPQGGVISPLLANIYLHEFDQFVEHTLIPAYTKGQRRKVNPAYAQME 254 Query: 255 QRGRSTAVRE--------------------NWQWKPAVAYCRYADDFVLIVKGTKAQVEA 294 QR + + + Y RYADDF+L GTK + EA Sbjct: 255 QRISKLRRQREYASVTPLLKELRTLPSRDVHDPDYRRLRYVRYADDFLLGFAGTKVEAEA 314 Query: 295 IREECRGVLEGSLKLRLNMDKTKIPHV-NDGFIFLGHRLI--RKRSRYGEMRVV 345 I+++ L L+L+L+ KT I H +D FLG+ ++ + S+ R + Sbjct: 315 IKQQINVWLYDHLQLKLSTQKTLITHASSDPAHFLGYDIVTQQANSKQTGNRRI 368 >UniRef50_C3FAV3 RNA-directed DNA polymerase n=11 Tax=Bacteria RepID=C3FAV3_BACTU Length = 652 Score = 263 bits (672), Expect = 7e-69, Method: Composition-based stats. Identities = 117/415 (28%), Positives = 189/415 (45%), Gaps = 61/415 (14%) Query: 12 DPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDEL 71 + + +RL R + PE+ + + ++ G T G + + + + Sbjct: 38 KENYKFKRLYRNLYNPEFYYKGYQEIYANPGNMTRGTINNTVDGFSKN---RVSKIINNI 94 Query: 72 LSGHYQPLPARRVYIPKSN-GKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFR 130 +G+Y+P P +RVYI K K RPLG+P D++VQ + +E I+E +F S+GFR Sbjct: 95 KNGNYKPTPVKRVYIDKKGSKKKRPLGVPTFDDKLVQLVIKYILEAIYEPNFSENSHGFR 154 Query: 131 PERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTL 190 R H A++ +K +W IEGD+ +FD + H +L+ +R+RI+D + L Sbjct: 155 KNRGCHTALKQIKK-----SGNGTKWFIEGDIQGFFDNIDHHILINLLRKRINDETLIGL 209 Query: 191 LWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSG--------K 242 +WK ++AG+++ F G PQGG++SPLL+NI LNE D Y+ + + Sbjct: 210 IWKFLRAGYMEDWQFHKTFSGTPQGGILSPLLANIYLNELDIYMGKYAKKFGKGQPKDRE 269 Query: 243 ARKDRWYWNNSIQRGRSTAVRENWQWK-------------------------------PA 271 K Y + I+RGR A Q K + Sbjct: 270 VDKRYQYLHLKIKRGRKKADLLREQGKHNEAQELIEQVNEWVKERGQRPYYNPMSDKFKS 329 Query: 272 VAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHR 331 + Y RYADDF++++ G+K +AI+ + L LKL L+ +KT I H + FLG+ Sbjct: 330 LKYVRYADDFIVMIIGSKDDAKAIKSDIAQFLNEELKLTLSEEKTLITHSSKKATFLGYN 389 Query: 332 LIRKRS-------------RYGEMRVVSTIPQEKARNFAASLTALLWKVRISGEI 373 + R+ R+ ++V IP E RN +L L K E Sbjct: 390 VNITRNELFTKYSVKGVKRRHHNLKVRLEIPHEAWRNKLLALNVLEMKYVNGKET 444 >UniRef50_B7C9E4 Putative uncharacterized protein n=1 Tax=Eubacterium biforme DSM 3989 RepID=B7C9E4_9FIRM Length = 623 Score = 263 bits (671), Expect = 9e-69, Method: Composition-based stats. Identities = 108/381 (28%), Positives = 174/381 (45%), Gaps = 18/381 (4%) Query: 1 MQRK---LATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQ 57 MQ+ L + + L+ +I+ P + A R + G+HT G DG L Sbjct: 19 MQKTFDSLYADSKSGE--VFGHLMDIISAPSNIKLAFRNIKGNDGSHTAGTDGRTIESLA 76 Query: 58 ARLAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPI 117 + L + Y+P +RV IPK NGK+RPLGIP + DRIVQ+ +L MEPI Sbjct: 77 VMPEDKFVKLIQK-QFRRYEPKAVKRVEIPKPNGKMRPLGIPCIIDRIVQQCILQVMEPI 135 Query: 118 WESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKA 177 E+ F+ SYGFRP RS +AI + +V++ D+ +FD V HR L+K Sbjct: 136 CEAKFYEHSYGFRPCRSAENAISYAYGL---AQRNKLHYVVDVDVKGFFDNVDHRKLLKQ 192 Query: 178 VRRR-ISDARFMTLLWKTIKAGH-IDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLH 235 + I D + + ++ +KA + G S+G PQGG++SPLL+NI+LNE D ++ Sbjct: 193 IWTLGIRDTKLIQIIKAMLKAPIEMPDGENVLPSKGTPQGGILSPLLANIVLNELDWWIA 252 Query: 236 ERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPA----VAYCRYADDFVLIVKGTKAQ 291 ++ G K + + RYADDF + + TK Sbjct: 253 SQWDEMVRHMKHPCKVTYYPNGAEKKCNSYTALKKSNLKEMRIVRYADDFKIFCR-TKED 311 Query: 292 VEAIREECRGVLEGSLKLRLNMDKTKIPHVNDG-FIFLGHRLIRKRSRYGEMRVVSTIPQ 350 E + L LKL ++ +K+K+ ++ FLG R+ +R + + S + Sbjct: 312 AEKTYYAVKDWLWKRLKLEVSDEKSKVTNLRKRDSEFLGFRIKLRR-KSNSWVITSNVCD 370 Query: 351 EKARNFAASLTALLWKVRISG 371 + + + + ++ S Sbjct: 371 KAIHRISKEMADSVKAIQSSK 391 >UniRef50_B4WUH1 Group II intron, maturase-specific domain family n=2 Tax=Cyanobacteria RepID=B4WUH1_9SYNE Length = 621 Score = 263 bits (671), Expect = 1e-68, Method: Composition-based stats. Identities = 121/388 (31%), Positives = 182/388 (46%), Gaps = 41/388 (10%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQP--EWLAEAARITLSSKGAHTPGVDGVNKTMLQA 58 ++R++ +++ L++L+ + L +T ++G T G+DG + Sbjct: 66 LRRRIYRATQNGQWNQVRSLMKLMLRSYSNLLLSVRHVTQENQGRQTAGLDGQTALTAEK 125 Query: 59 RLAVELQILRDELLSGH-YQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPI 117 R L + L +Q LP +RVYIPK+NGKLRPLGIPAL +R+ Q M A+EP Sbjct: 126 R-----VQLVNRLQDHSLWQVLPTKRVYIPKANGKLRPLGIPALENRVAQTIMKNALEPH 180 Query: 118 WESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKA 177 WE+ F SYGFRP RS H AI L+L WV++ DL FD + H ++ Sbjct: 181 WEARFEGHSYGFRPGRSCHDAIEQCFLRLR---HGCDTWVLDADLKGAFDNLSHSFILDT 237 Query: 178 VRRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHER 237 + L+ + +KAG+++ +F A +G PQGG ISPLL NI LN ++ L Sbjct: 238 IGLVPG----RELIKQWLKAGYVEAEMFHATPKGAPQGGSISPLLLNIALNGMEKLLLSF 293 Query: 238 YLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIRE 297 ++ + + P YCRYADDFV+ K TKA +EA+ Sbjct: 294 T----------TTRTYQPSSKAKSQSSYKRTSPTYGYCRYADDFVVTAK-TKADIEAVVP 342 Query: 298 ECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFA 357 + L+ L LNM+KT+I +V GF FLG + + + + +EK F Sbjct: 343 ILQAWLKPR-GLTLNMEKTQIVNVQQGFPFLGFSIRHHK-----GKCLCKPQKEKILAFL 396 Query: 358 ASLTALLWK---------VRISGEILLG 376 + + L + IL G Sbjct: 397 KRIRSWLKHNVSISPAAVIHHLNPILRG 424 >UniRef50_C1BDP2 Putative RNA-directed DNA polymerase n=2 Tax=Rhodococcus opacus B4 RepID=C1BDP2_RHOOB Length = 605 Score = 263 bits (671), Expect = 1e-68, Method: Composition-based stats. Identities = 110/388 (28%), Positives = 174/388 (44%), Gaps = 40/388 (10%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQP--EWLAEAARITLSSKGAHTPGVDGVNKTMLQA 58 ++R++ ++ L +L+ L ++T + G T G+DG +A Sbjct: 37 LRRRIFKATREQDWAAVRSLQKLMLGSWSNTLVSVRQVTQRNAGRRTAGIDGETALSPEA 96 Query: 59 RLAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIW 118 R + ++ E S ++P+P RRVYIPK+ GK RPLGIP + DR Q + A+EP W Sbjct: 97 RANMAVR--VHESRS-SWEPVPVRRVYIPKAGGKRRPLGIPVVMDRCHQARVRTALEPEW 153 Query: 119 ESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAV 178 E+ F SYGFRP RS AI + L + + W+++ DLS+ FD + H L+ + Sbjct: 154 EARFEARSYGFRPGRSCADAIGALYSTL-NGSRAKRVWILDADLSAAFDRIDHPRLLDTL 212 Query: 179 RRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERY 238 + L+ K + AG I+ G F ++ EG PQGGVISPLL N+ L+ ++ RY Sbjct: 213 GSFPA----RELIGKWLTAGVIENGRFASSEEGTPQGGVISPLLLNVALHGLEEAAGVRY 268 Query: 239 LSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREE 298 ++ + RYADD V + Q E +++ Sbjct: 269 ------------------LKAADTLDARSVPGTPVLVRYADDLVACCHS-RQQAELVKDR 309 Query: 299 CRGVLEGSLKLRLNMDKTKIPHV-NDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFA 357 G L L N DKT I H+ +GF FLG+ + R R +++ + + Sbjct: 310 LAGWLAPR-GLAFNEDKTHIVHLEEEGFDFLGYNIRRYRRGTRPAKLLIKPNSDAVKRIH 368 Query: 358 ASLTALLWKVR---------ISGEILLG 376 L + ++R I+ G Sbjct: 369 RRLANEMRRMRGSNALAVIARLNPIIRG 396 >UniRef50_C5CID2 RNA-directed DNA polymerase (Reverse transcriptase) n=4 Tax=Kosmotoga olearia TBF 19.5.1 RepID=C5CID2_KOSOT Length = 423 Score = 262 bits (670), Expect = 1e-68, Method: Composition-based stats. Identities = 113/362 (31%), Positives = 178/362 (49%), Gaps = 40/362 (11%) Query: 15 LRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSG 74 + L+ + LA+A + G PG+DGV L ++ L ++L G Sbjct: 2 RKYYSLIDKVYLESNLAKAYHKVRRNNG--APGIDGVTVQEYGENLLERIKKLSEKLRKG 59 Query: 75 HYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERS 134 Y+P P +RV IPK NGK R LGIP + DRIVQ+++ MEPI+E FH SYG+R R+ Sbjct: 60 EYRPSPVKRVEIPKGNGKTRMLGIPTVEDRIVQQSLKEIMEPIFEEGFHPSSYGYRKGRN 119 Query: 135 VHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKT 194 H A+ + + ++V++ DLS FDT+ H ++ AV RISD + + L+ Sbjct: 120 PHQAVEKAYAF---ACKYKMKYVVQLDLSQCFDTLDHEKMIDAVAERISDGKILRLIRSF 176 Query: 195 IKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSI 254 +K+G I ++ + G PQGGVISPLL+NI LN+FDQ + Sbjct: 177 LKSGVITD-QYQPSEMGSPQGGVISPLLANIYLNKFDQKMMA------------------ 217 Query: 255 QRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMD 314 + RYADD ++ K K+ + ++ R +LE LKL++N + Sbjct: 218 ---------------RGIRIVRYADDILIFAKSYKSAEKYLKIAIR-ILEKELKLKVNKE 261 Query: 315 KTKIPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAASLTALLWKVRISGEIL 374 KT+I ++DG FLG + + + R E ++ + K T + ++ ++L Sbjct: 262 KTRITTIDDGIEFLGFTIQKGKIRIQEKKIKRFKAKVKTLTRRNQCTPIGEIIKRLNQLL 321 Query: 375 LG 376 G Sbjct: 322 RG 323 >UniRef50_O47500 RT-like protein n=1 Tax=Venturia inaequalis RepID=O47500_VENIN Length = 760 Score = 262 bits (669), Expect = 2e-68, Method: Composition-based stats. Identities = 120/401 (29%), Positives = 194/401 (48%), Gaps = 36/401 (8%) Query: 3 RKLATWAATDPSLRIQR-LLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLA 61 KL+ + ++P+ I R L L T + L A S G T GV L Sbjct: 245 NKLSIRSKSNPNSIIDRELYTLATSVDTLIYAYENIKSKPGNMTQGV---LPETLDGISR 301 Query: 62 VELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESD 121 +L L D L S + P+RR+ IPK++G RPL I + D+IVQ AM + +E I++ Sbjct: 302 EKLTKLSDSLRSEKFSFSPSRRIQIPKASGGSRPLSIASPMDKIVQEAMRLVLEAIYDPV 361 Query: 122 FHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRR 181 F S+GFRP +S H A+++V + +WVIEGDL+ +FD++ H LMK V + Sbjct: 362 FLDCSHGFRPNKSCHTALKSVSQEF-----QPVQWVIEGDLAKFFDSISHSKLMKLVESK 416 Query: 182 ISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFD---------- 231 I+D RF L+WK + AG+ + ++++ G PQG ++SP+L+NI L++ D Sbjct: 417 ITDRRFTNLIWKALTAGYFEFKIYKSNIVGTPQGSIVSPILANIFLHQLDLFVNCLKRDF 476 Query: 232 ------------QYLHERYLSGKARKDRWYWNNSI-QRGRSTAVRENWQWKPAVAYCRYA 278 +Y L + D I +R ++ ++ + + Y RYA Sbjct: 477 DKGTRAPRSKSSRYYEYHTLKARKAGDTLQLQKLIAERSQNPSIDFGSESFKRLVYVRYA 536 Query: 279 DDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHV-NDGFIFLGHRLIR-KR 336 DD+++ ++GT+ Q + I + R S+ L L+ KTK+ + + +FLG + R Sbjct: 537 DDWIIGIRGTREQAKYILTKVREFCT-SIDLELSEHKTKLTSLHSQPILFLGTSISRSSH 595 Query: 337 SRYGEMRVVSTIPQEKAR-NFAASLTALLWKVRISGEILLG 376 RY + V I + K A L + K+ S + G Sbjct: 596 VRYSRIGSVRRIRRNKLGLRLEAPLDRIKKKLENSNFMSKG 636 >UniRef50_C5RJ16 RNA-directed DNA polymerase (Reverse transcriptase) n=2 Tax=Clostridiales RepID=C5RJ16_CLOCL Length = 626 Score = 262 bits (669), Expect = 2e-68, Method: Composition-based stats. Identities = 115/379 (30%), Positives = 185/379 (48%), Gaps = 26/379 (6%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARL 60 MQ KL + L+ + + A S+ G+ TPG D Sbjct: 22 MQSKL--------NKTFTGLMEVAFNEVTIITAVHNIKSNSGSKTPGTDRNTIDKYLQMS 73 Query: 61 AVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWES 120 E+ L + + +Y+P PARR YIPKSNGK RPLGIP + DRI+ + + +EPI E+ Sbjct: 74 KEEVISLVKK-SASNYKPKPARREYIPKSNGKKRPLGIPTVIDRIILECIRIVIEPICEA 132 Query: 121 DFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRR 180 F+ SYGFRP R+ HAI ++ ++ + +VIEGD+ SYFD ++H++L+ + + Sbjct: 133 KFYPHSYGFRPYRACSHAIASIVHVISSTSKDIPHYVIEGDIKSYFDNINHKVLINKLWK 192 Query: 181 RI-SDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYL 239 D R + L+ +KAG+I+ LF G PQGG+ISPLL+N+ LN FD + Y Sbjct: 193 MGVHDKRMLCLIKLMLKAGYIERDLFYLTEAGTPQGGIISPLLANVYLNSFDWMIGRMYQ 252 Query: 240 SGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREEC 299 K + + ++ ++ R T ++ RYADD+V++ + + E + Sbjct: 253 EPKGIETKNDRSHCREKLRRTGIKPK-------YLVRYADDWVILTTS-RQEAERLLHYI 304 Query: 300 RGVLEGSLKLRLNMDKTKIPHVN-DGFIFLGHRL-----IRKRSRYGEMRVVST--IPQE 351 R + LKL L+ +KT I + + FLG + + S+ +VV E Sbjct: 305 RRYFKHKLKLELSEEKTVITDIKCEKVKFLGFDVLAELPRKTPSKPNPKKVVGKAIPNTE 364 Query: 352 KARNFAASLTALLWKVRIS 370 K + + + K++ Sbjct: 365 KVKQQVKDICREIKKLKAI 383 >UniRef50_B1N1A3 NicA n=1 Tax=Pseudomonas putida RepID=B1N1A3_PSEPU Length = 618 Score = 261 bits (668), Expect = 2e-68, Method: Composition-based stats. Identities = 108/394 (27%), Positives = 174/394 (44%), Gaps = 50/394 (12%) Query: 7 TWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQI 66 A +DPS R+ RL+ + + A S G TPG DG R ++ Sbjct: 15 RKANSDPSYVNDRIYRLMYKEDLYIAAYEKIKSKPGNMTPGQDGTTLDEFSIRT---IRN 71 Query: 67 LRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLS 126 + +++ + ARRV IPK+NGK RPL + D++VQ + +E I+E F S Sbjct: 72 IINKMKDESFTFRGARRVLIPKANGKTRPLSVAPPTDKVVQEVIRSILEAIYEPTFSKNS 131 Query: 127 YGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDAR 186 +GFR +S H A++ V+ + WVIEGD+ FD + H L+ +R RI D R Sbjct: 132 HGFRAGKSCHTALKQVRE-----SWSGVTWVIEGDIKGCFDNISHSKLIDQLRLRIKDER 186 Query: 187 FMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQ----YLHERYLSGK 242 F+ L+ K + AG+ + G F +A+ G PQG +ISP+L+N+ L++ D+ + + + + Sbjct: 187 FINLIRKALNAGYFENGAFFSATLGTPQGSIISPILANVFLDQLDRKVEQLIKDHHQGEE 246 Query: 243 ARKDRWYWNNSIQRGRSTAVRE-------------------------------NWQWKPA 271 K +QR +++ ++ Sbjct: 247 GDKITDPAYRKLQRQKTSLRKKAEKQEGAERDATLSLAREANSKLLSMSPYLTRNNGFIR 306 Query: 272 VAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVN-DGFIFLGH 330 V Y RYADD+++ V G K E +R LE + L L+++KT I H + FLG Sbjct: 307 VKYVRYADDWIIGVNGPKLLAEELRSVVGEFLENA-GLELSIEKTHIRHAKSETAKFLGT 365 Query: 331 RLIRKRSRYGEMRVVSTIPQEKARNFAASLTALL 364 L M+V+ + F + Sbjct: 366 NLRIGSENSKIMKVLR-----NGKKFPKRVAGWT 394 >UniRef50_Q74P60 Group II intron reverse transcriptase/maturase n=13 Tax=Bacilli RepID=Q74P60_BACC1 Length = 627 Score = 261 bits (668), Expect = 2e-68, Method: Composition-based stats. Identities = 104/370 (28%), Positives = 186/370 (50%), Gaps = 12/370 (3%) Query: 9 AATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILR 68 A + ++L+++IT + A R + G+ T G+DGV ++ + + Sbjct: 31 AKSLNGGNFKQLMKVITSESNILLAFRNIKRNSGSITEGIDGVTIKDVEKLSQEDFIKIV 90 Query: 69 DELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYG 128 + S +Y P RRV IPK NGK RPLGIP++ DRI Q+ + +EPI E+ F+ S+G Sbjct: 91 QKRFS-NYTPRKVRRVEIPKPNGKTRPLGIPSMWDRIAQQCIKQVLEPICEAKFNKHSHG 149 Query: 129 FRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVR-RRISDARF 187 FRP RS A+ L++ + ++V+ D+ +FD V+H+ LM+ + I D + Sbjct: 150 FRPNRSPETAMADATLRV---NRSHMQYVVNVDIQGFFDEVNHKKLMRQLWTMGIRDKQL 206 Query: 188 MTLLWKTIKAGHI-DVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKD 246 + ++ K +KA + G + ++G PQGG++SPLL+NI LNEFD ++ ++ ++ Sbjct: 207 LVIIRKMLKAPIVLPNGEMQYPNKGTPQGGILSPLLANINLNEFDWWITNQWEDRLLKEL 266 Query: 247 RWYWNNSIQRGRSTAVRENWQWK--PAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLE 304 + + + + RYADDF + TK+ + I + C L+ Sbjct: 267 SLTIKKGGHVDKYPHYSKMRKTTALKEMYIVRYADDFKIFT-ATKSNAQKIFKACEMWLQ 325 Query: 305 GSLKLRLNMDKTKIPHV-NDGFIFLGHRLI--RKRSRYGEMRVVSTIPQEKARNFAASLT 361 LKL ++ +K+KI ++ + FLG + +K S+ +S ++K + Sbjct: 326 ERLKLPISKEKSKITNLRKESSEFLGFEIKMVKKGSKLIARTHISNKTKKKIQKQFEDQI 385 Query: 362 ALLWKVRISG 371 A++ + + G Sbjct: 386 AVIQRSKNEG 395 >UniRef50_B4D379 RNA-directed DNA polymerase (Reverse transcriptase) n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4D379_9BACT Length = 441 Score = 261 bits (667), Expect = 3e-68, Method: Composition-based stats. Identities = 115/385 (29%), Positives = 182/385 (47%), Gaps = 52/385 (13%) Query: 2 QRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLA 61 QR++A A + ++ L + EW+ EA KG+ PGVDG + L Sbjct: 13 QRRIAELAEYHGAEGLRTLGHHL-DLEWMREAYGRVR--KGS-APGVDGKSVADYGRELD 68 Query: 62 VELQILRDELLSGHYQPLPARRVYIPKSNGKL-RPLGIPALRDRIVQRAMLMAMEPIWES 120 L L D SG YQ P RRV+IPK+NGK RP+G+P + D+I+QRA++M +EP++E Sbjct: 69 KNLGGLIDRAKSGSYQAPPVRRVHIPKANGKETRPIGMPTVEDKILQRAVVMLLEPMYER 128 Query: 121 DFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRR 180 +F SYGFRP RS H A++ + T+ WV++ D+ ++FDT+ H +LM +++ Sbjct: 129 EFGDFSYGFRPGRSAHQALKAI---WQGINRTQAGWVVDVDIRAFFDTLDHGVLMGILQK 185 Query: 181 RISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNE-FDQYLHERYL 239 R+ D + L+ K +KAG ++ G G PQGGVISPLLSNI L+E D++ + Sbjct: 186 RVKDGVILKLVAKWLKAGVMEAGALSYPEAGTPQGGVISPLLSNIYLHEVLDEWFEAAVI 245 Query: 240 SGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREEC 299 + + RYADDFV+ + + E + + Sbjct: 246 P--------------------------RLQGRGFMVRYADDFVMGFEC-REDAERVMKAL 278 Query: 300 RGVLEGSLKLRLNMDKTKIPHVN---------------DGFIFLGHRLIRKRSRYGEMRV 344 G G L+L+ KT++ + F FLG +SR G + Sbjct: 279 PGRF-GRYGLKLHEGKTRLVRFGKPEDGSGGGGGSGKPETFDFLGFTHHWAKSRKGRWYI 337 Query: 345 VSTIPQEKARNFAASLTALLWKVRI 369 +++ R ++ + + Sbjct: 338 QRKTARKRLRRALKTIHQWCRQNQH 362 >UniRef50_Q3A4Z2 Group II intron-encoding maturase n=98 Tax=Bacteria RepID=Q3A4Z2_PELCD Length = 529 Score = 261 bits (667), Expect = 3e-68, Method: Composition-based stats. Identities = 100/347 (28%), Positives = 165/347 (47%), Gaps = 47/347 (13%) Query: 18 QRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQ 77 RL+ + + A + + +KG G+DG+ L+ L + +++ELL+G YQ Sbjct: 109 TRLMEEVVSRGNMMAAYQRVVRNKG--AAGIDGMPVGDLKTYLQEQWPRIKEELLTGTYQ 166 Query: 78 PLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHH 137 P P R+V IPK G +R LGIP + DR++Q+A+ + ++E +F SYGFRP RS H Sbjct: 167 PQPVRKVEIPKPGGGMRMLGIPTVLDRLIQQALHQELMRLFEPEFSEHSYGFRPGRSAHQ 226 Query: 138 AIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKA 197 A+++ + + W ++ DL +FD + H +LM V R++ D R + L+ + + Sbjct: 227 AVQSARRHVASGRR----WAVDIDLEKFFDRMGHDILMSRVARKVKDRRVLGLIRRYLTV 282 Query: 198 GHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRG 257 G ++ G+ +G PQGG +SPLLSNI+L+EFD+ L Sbjct: 283 GVLEGGIISPRVQGTPQGGPLSPLLSNILLDEFDKELER--------------------- 321 Query: 258 RSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTK 317 A+CRYADD + V +A E + LE LKL++N K+ Sbjct: 322 ------------RGHAFCRYADDCNIYVHSRRA-AERVMTSLTRFLEQQLKLKVNRVKSA 368 Query: 318 IPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAASLTALL 364 + + FLG+ + + + + + F SL + Sbjct: 369 VGRPWER-TFLGYSMTSHK------KPRLKVAGSSVKRFKTSLREIF 408 >UniRef50_B4CYA7 RNA-directed DNA polymerase (Reverse transcriptase) n=6 Tax=Bacteria RepID=B4CYA7_9BACT Length = 482 Score = 261 bits (666), Expect = 4e-68, Method: Composition-based stats. Identities = 115/398 (28%), Positives = 177/398 (44%), Gaps = 58/398 (14%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQAR- 59 +Q L A T+PS R L + + ++L EA R + G+ GVDG ++A Sbjct: 17 LQSSLQAKAKTEPSYRFYSLWDKVCRGDFLVEAYRRCRRNGGS--AGVDGETFEQIEAAG 74 Query: 60 LAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWE 119 L L L++EL + Y+ P RV+IPKSNG RPLGIP +RDR+VQ A +M + PI+E Sbjct: 75 LDAWLGKLQEELRTKQYRTQPLLRVWIPKSNGGQRPLGIPTVRDRVVQMATVMVLGPIFE 134 Query: 120 SDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVR 179 +D GFRP R A+R V Q+ G V++ DLS YF+TV H LMK++ Sbjct: 135 TDLCEEQMGFRPGRDAKTAVRLVYYQVRQKGRQE---VVDADLSDYFNTVPHGALMKSLS 191 Query: 180 RRISDARFMTLLWKTIKAGHID---VGL------FRAASEGVPQGGVISPLLSNIMLNEF 230 RRI+D + ++++ + ++A + G + A G PQGGVISPLL+N+ F Sbjct: 192 RRIADGQVLSVIARWLEAPVEECTPDGRLVRSTPAKDAGRGTPQGGVISPLLANVYFRRF 251 Query: 231 DQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVK-GTK 289 + ++ YADDFV+ + G Sbjct: 252 VLAWK---------------------------QLGYEQAFDSVIVNYADDFVICCRPGNG 284 Query: 290 AQVEAIREECRGVLEGSLKLRLNMDKTKIPHV-NDGFIFLGHRLIRKRSRYGEMRVVSTI 348 G + L +N KTK+ V + F FLG+ + + G+ + + Sbjct: 285 NDARKAMTRV----MGKIGLTVNEQKTKVVRVPGESFDFLGYTIGGFYGQGGKPYIGTRP 340 Query: 349 PQEKARNFAASLTALLW----------KVRISGEILLG 376 ++ + +V++ E L G Sbjct: 341 SKKAILRIMGEIHEQTSSQWNASEPETRVKVLNEKLRG 378 >UniRef50_B2AJV8 RNA-directed DNA polymerase, retrotranscriptase n=45 Tax=root RepID=B2AJV8_CUPTR Length = 607 Score = 260 bits (665), Expect = 5e-68, Method: Composition-based stats. Identities = 117/385 (30%), Positives = 170/385 (44%), Gaps = 52/385 (13%) Query: 9 AATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILR 68 A+ ++ L+ IT P L ++ + GVDGV + L + L Sbjct: 181 KASGKKVQFTALMHHIT-PRLLIDSFMHLKK---SAAAGVDGVTWHDYEECLVERIGKLW 236 Query: 69 DELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYG 128 D + +G Y+ LP+RRVYIPK++GK RPLGI AL D+IVQ+A++ + PI+ESDF SYG Sbjct: 237 DAVQAGRYRALPSRRVYIPKADGKQRPLGIAALEDKIVQQAVVTVLTPIYESDFLGFSYG 296 Query: 129 FRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFM 188 FRP R H A+ + + WV++ D+ S+FDTV H +M+ + RI+D R + Sbjct: 297 FRPGRGQHQALDAL---WVGLHWKKVNWVLDADIRSFFDTVDHGWMMRFLEHRIADKRLL 353 Query: 189 TLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNE-FDQYLHERYLSGKARKDR 247 L+ K + AG I+ G G PQG VISPLL+NI L+ FD +L Sbjct: 354 RLIRKWLTAGVIENGAKTEIRVGTPQGAVISPLLANIYLHYVFDLWLQRW---------- 403 Query: 248 WYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSL 307 K V RYADD V+ + +A E + Sbjct: 404 ----------------RRRDAKGDVIVVRYADDSVVGFEA-EADASRFLEALKARF-AQF 445 Query: 308 KLRLNMDKTKIPHVND---------------GFIFLGHRLIRKRSRYGEMRVVST-IPQE 351 L LN KT++ F FLG I R +V + Sbjct: 446 GLSLNEQKTRVLQFGRYAASLRKRAGLGRPQTFDFLGFTHICATKRSNGGFIVRRLTSSK 505 Query: 352 KARNFAASLTALLWKVRISGEILLG 376 + R +L L++ R ++G Sbjct: 506 RMRATLKALRQALYRRRHEPIAVVG 530 >UniRef50_B1HW67 Possible group II intron reverse transcriptase/maturase n=23 Tax=Firmicutes RepID=B1HW67_LYSSC Length = 601 Score = 260 bits (665), Expect = 5e-68, Method: Composition-based stats. Identities = 103/369 (27%), Positives = 177/369 (47%), Gaps = 20/369 (5%) Query: 5 LATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQ-ARLAVE 63 L + + + + L I + A R ++ G+ T G DG+ + + Sbjct: 20 LYERSKNNATKGL-NLYEHIISKNNILLAYRNIKANTGSKTAGTDGITIEQYKIEDVETF 78 Query: 64 LQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFH 123 + +R L + Y+P RRV IPK NGK RPLGIP +RDR++Q+ +EPI E+ F+ Sbjct: 79 VDEIRATLKN--YKPQTVRRVEIPKPNGKTRPLGIPTMRDRLIQQMFKQILEPICEARFY 136 Query: 124 TLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAV-RRRI 182 SYGFRP RS HHA+ + V++ D+ +FD V H L+K + I Sbjct: 137 NHSYGFRPNRSTHHAMGRCQFLANIALNQH---VVDIDIQGFFDNVSHSKLLKQMYSIGI 193 Query: 183 SDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGK 242 D R ++++ K +KA +G+ ++G PQGG++SPLLSNI+LN+ D ++ ++ + K Sbjct: 194 CDKRVLSVVSKMLKAPIKGIGI---PTKGTPQGGILSPLLSNIVLNDLDWWISNQWENMK 250 Query: 243 ARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGV 302 + + N + + T + RYADDF + K K + +G Sbjct: 251 TKFNYKERKNKVLMIKRTTTL------KEMYIVRYADDFKIFTKSHKN-AIKLYHAVKGY 303 Query: 303 LEGSLKLRLNMDKTKIPHVNDGF-IFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAASLT 361 L+ L L ++ +K+KI ++ FLG L + + + + +K + Sbjct: 304 LKNHLNLDISNEKSKITNLRKRASEFLGFSLKVIK-KGKRYVANTHVMDDKVKTILQKAR 362 Query: 362 ALLWKVRIS 370 L+ +++ + Sbjct: 363 KLIHEIKKN 371 >UniRef50_A3WXE2 Putative reverse transcriptase n=1 Tax=Nitrobacter sp. Nb-311A RepID=A3WXE2_9BRAD Length = 486 Score = 259 bits (663), Expect = 9e-68, Method: Composition-based stats. Identities = 114/350 (32%), Positives = 171/350 (48%), Gaps = 23/350 (6%) Query: 49 DGVNKTMLQARL--AVELQILRDELLSGHYQPLPARRVYIPKSN--GKLRPLGIPALRDR 104 D V ++ R+ L R LL G Y+P RRV IPK G RPLG+P ++DR Sbjct: 2 DKVTVKHIETRIGVERFLTTTRTMLLDGSYRPQAVRRVMIPKRGRPGLFRPLGVPTVQDR 61 Query: 105 IVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLT--------DCGETRGRW 156 +VQ A+L +EPI+E+ F T+SYGFRP+R+ A+ ++ + D +W Sbjct: 62 VVQAALLQLLEPIFEAVFLTVSYGFRPKRACRDALEHIRNAIRPTGQKTETDWPRPPYQW 121 Query: 157 VIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGG 216 VIEGD+ FD + H +M +RRR+SD R L+ +KAG + G G PQGG Sbjct: 122 VIEGDIKRCFDNIDHHHVMTCLRRRVSDRRVTRLVRAFLKAGVLSEGSLVRTKAGTPQGG 181 Query: 217 VISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCR 276 V+SPLL+N+ML+ ++ + + R + + R E + R Sbjct: 182 VLSPLLANVMLDGIERRYAKYVVPRLTRDGKP-YARPGNELRKFRHYERKAGRVVFLPIR 240 Query: 277 YADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKR 336 YADDFV++V GT+ Q A +E L +KL L +KT + + +GF FLGHR+ + Sbjct: 241 YADDFVVLVNGTEEQARAEKEALAVFLREEMKLTLAPEKTHVTSLTEGFEFLGHRVRLRW 300 Query: 337 SRYGEMRVVSTIPQEKARNFAASLTALLWKVRI----------SGEILLG 376 IP+ + ++F + L + R +LLG Sbjct: 301 DDRWGYWPRVEIPKARVKDFHHRIKQLTTRGRSRLSFQEVIDALNPVLLG 350 >UniRef50_B4D301 RNA-directed DNA polymerase (Reverse transcriptase) n=4 Tax=Chthoniobacter flavus Ellin428 RepID=B4D301_9BACT Length = 415 Score = 259 bits (663), Expect = 9e-68, Method: Composition-based stats. Identities = 109/355 (30%), Positives = 162/355 (45%), Gaps = 49/355 (13%) Query: 17 IQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHY 76 I++L+ PE A + +S+ G PG+DG+ + L L + +R +LL+G Y Sbjct: 2 IEQLMEEAVSPENWHTAWKAVVSNGG--APGIDGMRCSELVEHLQRHGEAIRAKLLAGRY 59 Query: 77 QPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVH 136 P P R IPK G R LGIP + DR VQ+ +L + PI+E F SYGFRP RS H Sbjct: 60 TPSPVLRTKIPKPGGGERDLGIPTVLDRFVQQLLLQVLTPIYEPRFSARSYGFRPGRSTH 119 Query: 137 HAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIK 196 A+R + + + +VI+ D+ +FD V+H LLM +R +SD R TL+ + +K Sbjct: 120 DAVRQAQAYV----KEGKSYVIDLDIEKFFDRVNHNLLMHRLRETVSDVRVRTLIGRYLK 175 Query: 197 AGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQR 256 AG + G+ + EG PQGG +SPLL+NI L+ D L Sbjct: 176 AGVMVNGVVQDNEEGTPQGGPLSPLLANIYLDPLDWELE--------------------- 214 Query: 257 GRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKT 316 +AY RYADD + V + A + + G +E L+L++N K+ Sbjct: 215 ------------GRGLAYVRYADDCNIYV-SSAAAAQRVLSSLIGWIEKKLRLKVNQTKS 261 Query: 317 KIPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAASLTALLWKVRISG 371 G LG + + V I Q+ + + A R G Sbjct: 262 GTGPTT-GRRLLGFSI--------NAQGVIEIAQKSLTHLQEKVRAFWRPQRHRG 307 >UniRef50_Q01P79 RNA-directed DNA polymerase (Reverse transcriptase) n=16 Tax=Bacteria RepID=Q01P79_SOLUE Length = 462 Score = 259 bits (662), Expect = 1e-67, Method: Composition-based stats. Identities = 100/351 (28%), Positives = 159/351 (45%), Gaps = 47/351 (13%) Query: 18 QRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQ 77 RL+ + + E L A + ++KG+ PGVDG+ ++ L +R +LLSG Y+ Sbjct: 49 NRLMEEVCERENLKAALQRVKANKGS--PGVDGMTVIGIKDYLKQHWPAIRGQLLSGTYE 106 Query: 78 PLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHH 137 P P RRV I K +G +R LGIP + DR +Q+A++ ++ W+ F SYGFRP RS Sbjct: 107 PKPVRRVEIAKPDGGVRKLGIPTVLDRFIQQAVMQVLQRRWDRTFSDYSYGFRPGRS--- 163 Query: 138 AIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKA 197 + Q W ++ DL +FD V+H LM + +RI+D R + L+ + A Sbjct: 164 -AQQAVAQAQQYIAEGHGWCVDLDLEKFFDRVNHDKLMGQIAKRIADKRLLKLIRAFLNA 222 Query: 198 GHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRG 257 G ++ GL + EG PQGG +SPLLSN++L+EFD+ L Sbjct: 223 GVMENGLVSPSVEGTPQGGPLSPLLSNLVLDEFDRELER--------------------- 261 Query: 258 RSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTK 317 + RYADD + V+ +A + + E + LKL++N K+ Sbjct: 262 ------------RGHRFVRYADDCNIYVRSERAG-QRVMESITQFITQKLKLKVNETKSA 308 Query: 318 IPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAASLTALLWKVR 368 + + FLG I + F + + + + Sbjct: 309 VARPQER-KFLGFSFTAGPE------AKRVIAPKALDRFKRRIREITGRAK 352 >UniRef50_A4KVN1 Probable reverse transcriptase n=2 Tax=Sinorhizobium meliloti RepID=A4KVN1_RHIME Length = 490 Score = 259 bits (661), Expect = 2e-67, Method: Composition-based stats. Identities = 115/357 (32%), Positives = 179/357 (50%), Gaps = 43/357 (12%) Query: 19 RLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQP 78 +LL + LA A + +KG PG DG M +A+ + LR ELL+G Y+P Sbjct: 56 QLLEEVASEANLATALLNVVRNKG--APGRDGQTVDMAEAKATSIIGRLRRELLNGKYRP 113 Query: 79 LPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHA 138 RRV++PK+ G R LGIP + DR+VQ+A+L +EPI+E FH S+GFRP+R H A Sbjct: 114 GDVRRVWLPKAGGGRRGLGIPNIVDRVVQQAVLQVLEPIFEPVFHDSSHGFRPKRGAHTA 173 Query: 139 IRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIK-A 197 I L + + +++ DL+S+FD VHH+ L+ + +R+ D R +TL+ +K A Sbjct: 174 IAEASKYL----KEGYQTIVDLDLASFFDRVHHQRLLARIAQRVKDQRIITLINLMLKAA 229 Query: 198 GHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRG 257 + G A EG PQGG +SPLLSNI+L+E D+ L Sbjct: 230 VVMPDGTRVAPQEGTPQGGPLSPLLSNIVLDELDRELAR--------------------- 268 Query: 258 RSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTK 317 + + RYADD + V+ +A + + R LE ++L++N +K+ Sbjct: 269 ------------RRLRFVRYADDSNIFVRSERAG-QRVMSSIRDFLERRMRLQVNEEKSG 315 Query: 318 IPHVNDGFIFLGHRLIRKRSRYGEMRV-VSTIPQEKARNFAASLTALLWKVRISGEI 373 + N+ FLG R + G++ V +S +++ R +T W I+ I Sbjct: 316 MRTPNE-VHFLGFRFRCPKGEGGDVVVLLSRKAEQRLRAKVREMTPPTWGRSIASCI 371 >UniRef50_Q188V0 Group II intron reverse transcriptase/maturase n=7 Tax=Firmicutes RepID=Q188V0_CLOD6 Length = 609 Score = 258 bits (659), Expect = 2e-67, Method: Composition-based stats. Identities = 112/388 (28%), Positives = 189/388 (48%), Gaps = 29/388 (7%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTML-QAR 59 +Q +L +A S + L+ ITQ E + A R S+KG+ T G + + + Sbjct: 22 IQDELYQLSAEG-SHVFRDLMSYITQEENILLAYRNIKSNKGSKTAGTNKRTIIDVGEEN 80 Query: 60 LAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWE 119 +Q +++ + +QP RRV IPK NGK RPLGIP + DR+VQ+ + +EPI E Sbjct: 81 PYQLVQYVQNRFNN--FQPHSIRRVEIPKPNGKTRPLGIPTIEDRLVQQCIKQILEPILE 138 Query: 120 SDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVR 179 + FH SYGFRPERS HHAI + +V++ D+ +FD V+H L+K + Sbjct: 139 AKFHKHSYGFRPERSSHHAIAIFQQW----TFKGFHYVVDIDIKGFFDNVNHGKLLKQLW 194 Query: 180 -RRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERY 238 +I D F+++L + +KA +V +++G PQGG++SPLL+N++LNE D ++ ++ Sbjct: 195 TMKIRDKTFISILSRMLKA---EVKGIGKSTKGTPQGGILSPLLANVVLNELDWWIDSQW 251 Query: 239 LSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREE 298 ++ + Q R + + RYADDF ++ K + I Sbjct: 252 DGFPTKRKYSSLLSKTQSIRKYSNL------KEIKIVRYADDFKIMCKDYHT-AQKIFLA 304 Query: 299 CRGVLEGSLKLRLNMDKTKIPHVNDGF-IFLGHRLIRKRSRYGEMRVVSTIPQEKARNFA 357 + L+ L L ++ +K+K+ ++ + FLG +L K+ + S + + N Sbjct: 305 TKQWLKVRLDLDISPEKSKVTNLRKNYSDFLGFKLKVKKGKANGYTNRSRMCDKAKINAV 364 Query: 358 ASLTALLW---------KVRISGEILLG 376 L + V ++LG Sbjct: 365 DKLKNNIKTIAANPTVDNVNKYNSVVLG 392 >UniRef50_B0K6R3 RNA-directed DNA polymerase (Reverse transcriptase) n=3 Tax=Thermoanaerobacter RepID=B0K6R3_THEPX Length = 427 Score = 258 bits (658), Expect = 3e-67, Method: Composition-based stats. Identities = 102/371 (27%), Positives = 163/371 (43%), Gaps = 47/371 (12%) Query: 13 PSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELL 72 ++Q L+ + PE L GVD V + ++ L ++ Sbjct: 14 KYGKVQNLISYV-NPETLKAKHEE---MPKKKASGVDKVTWEEYDVNVDENVETLIAKMK 69 Query: 73 SGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPE 132 Y+P PARRVYIPK+NGKLRPLGIP D++V M + ++E+ F SYGFRP Sbjct: 70 RFSYRPQPARRVYIPKANGKLRPLGIPCYEDKLVAAVMADILNEVYENIFLDTSYGFRPG 129 Query: 133 RSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLW 192 RS H AI+ + + C + +V+E D+ +FD V + LM+ + I D F + Sbjct: 130 RSCHDAIKELNRIIGRC---KISYVLEADIKGFFDNVDQKQLMEFIAHDIDDKNFSRYIV 186 Query: 193 KTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNE-FDQYLHERYLSGKARKDRWYWN 251 + +K+G ++ G + + +G QG +SP+L+NI L+ D + +GK R + Sbjct: 187 RFLKSGIMEEGKYHESDKGTAQGSPLSPILANIYLHYTLDVWFAYLKRNGKFRGE----- 241 Query: 252 NSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRL 311 RYADDFV++ + K+ + + E + L L Sbjct: 242 --------------------AYIVRYADDFVMLFQ-YKSDADKMYEALPKRM-AKFGLEL 279 Query: 312 NMDKTKIPHV------------NDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAAS 359 MDKTKI + F FLG +R G+ R ++K + Sbjct: 280 AMDKTKILPFGRFAKQNSKDGKTETFDFLGFTFSNGTTRNGKYRAHIQTNKKKLKAKRQV 339 Query: 360 LTALLWKVRIS 370 + A L + + + Sbjct: 340 VKAWLKEQQHA 350 >UniRef50_C3FJT8 RNA-directed DNA polymerase (Reverse transcriptase) n=11 Tax=Bacteria RepID=C3FJT8_BACTB Length = 443 Score = 256 bits (655), Expect = 8e-67, Method: Composition-based stats. Identities = 123/391 (31%), Positives = 193/391 (49%), Gaps = 53/391 (13%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQ-AR 59 +++++ A ++P+ R L IT+ L EA + + G PG+DG + ++ Sbjct: 19 LRQRIYRKAKSEPTHRFWGLFTHITKMTTLHEAYQQARKNNG--APGIDGKSFADIELEG 76 Query: 60 LAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWE 119 + L +++EL +G Y+P R+V IPK+NGK+R L IP +RDR+VQ A+ + +E I+E Sbjct: 77 VIPFLTGIQEELQAGIYRPQSNRKVEIPKANGKMRTLQIPCIRDRVVQGALKLILEAIFE 136 Query: 120 SDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVR 179 +DF SYGFRP+RS H A+ V+ + R +I+ DLS YFDT+ H +L++ + Sbjct: 137 ADFCPNSYGFRPKRSPHQALAEVRRSILR----RMTIIIDVDLSRYFDTIRHNILLEKIA 192 Query: 180 RRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYL 239 +R+ D + M L+ + IKA A GVPQGG SPL +NI LNE D Sbjct: 193 KRVQDPQVMHLVKQVIKA---------AGKIGVPQGGPFSPLAANIYLNEVDWTFD---- 239 Query: 240 SGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREEC 299 + + AV Y R+ADD V+ V G ++ Sbjct: 240 -------------------AIRRKTAEGNYEAVNYHRFADDIVIAVSGHSSKSGWAELAL 280 Query: 300 RGVLE--GSLKLRLNMDKTKIPHV--NDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKAR- 354 R + E L + LN++KT++ +V + F FLG L R +R V IP++KAR Sbjct: 281 RRLWEQLKPLGVELNLEKTQMINVLKGESFGFLGFDLRRIPNRNKNGFFVFMIPKKKART 340 Query: 355 NFAASLTALLWK---------VRISGEILLG 376 A + L+ ++ +L G Sbjct: 341 TVKAKIRELIQNAGAKPAQDLIKQINAVLTG 371 >UniRef50_C2KES2 Reverse transcriptase/maturase n=14 Tax=Firmicutes RepID=C2KES2_9LACO Length = 432 Score = 256 bits (654), Expect = 1e-66, Method: Composition-based stats. Identities = 106/363 (29%), Positives = 164/363 (45%), Gaps = 47/363 (12%) Query: 20 LLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPL 79 L+ I L EA R +KG PGVD L + ++D ++ Y+P Sbjct: 3 LIEQILSQNNLKEAIRRVKINKG--APGVDRRTVDELDSYFKKHQVEIKDAIMKMKYRPQ 60 Query: 80 PARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAI 139 RRVYIPK+NGK RPLGIP + DR++Q+A+ + I++ F SYGFRP RS H AI Sbjct: 61 AVRRVYIPKANGKKRPLGIPTVVDRVIQQAIAQVLMKIYDPHFSEHSYGFRPGRSAHDAI 120 Query: 140 RTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGH 199 V L + +WV++ D+ YFDTV+H L+ +R +++D + L+ +KAG Sbjct: 121 EQVLEYLNE----GYQWVVDLDIEKYFDTVNHDKLISIIREQVNDKTTLHLIRAFLKAGV 176 Query: 200 IDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGRS 259 ++ G + GVPQGG +SP+LSNI L++ D+ L + Sbjct: 177 MEDGWVKPNKLGVPQGGPLSPILSNIYLDKMDKELEQ----------------------- 213 Query: 260 TAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIP 319 + + RYADD + VK K+ + + LE L L++N KTK+ Sbjct: 214 ----------RGLRFVRYADDCNIFVKSGKS-AKRVMNSISSWLERKLFLKVNATKTKVV 262 Query: 320 HVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQ--EKARNFAASLTALLWKVRI----SGEI 373 FLG + + + +K R A + + ++ Sbjct: 263 RPTKS-NFLGFTFWKSGEYWQAKPGDDRKIKLYDKMRELLCRRKAAAQPLSLVFTKINQV 321 Query: 374 LLG 376 + G Sbjct: 322 VRG 324 >UniRef50_Q1QGR6 RNA-directed DNA polymerase (Reverse transcriptase) n=3 Tax=Bradyrhizobiaceae RepID=Q1QGR6_NITHX Length = 440 Score = 256 bits (653), Expect = 1e-66, Method: Composition-based stats. Identities = 111/375 (29%), Positives = 171/375 (45%), Gaps = 48/375 (12%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQ-AR 59 +QRKL A +P+ R L I + + L A + ++ G PGVDG+ ++ A Sbjct: 12 LQRKLYRKAKAEPAFRFYILYDKICREDVLLRAYTLARANAG--APGVDGMTFGQIEGAG 69 Query: 60 LAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWE 119 + L LR++L+S YQP P RRV IPK G RPLGIP +RDR+VQ A + +EPI+E Sbjct: 70 VDAWLAGLREDLVSKTYQPDPVRRVMIPKPGGGERPLGIPTIRDRVVQAAAKIVLEPIFE 129 Query: 120 SDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVR 179 + F +YG+RP RS A++ L V++ DLS YFDT+ H L+++V Sbjct: 130 AGFEDGAYGYRPRRSAIDAVKETHRLLCR----GYTDVVDADLSKYFDTIPHADLLRSVA 185 Query: 180 RRISDARFMTLLWKTIKAGHID---VGLFR-----AASEGVPQGGVISPLLSNIMLNEFD 231 RR+ D + L+ ++ + G +++ G PQGGV+SPLLS I +N F Sbjct: 186 RRVLDRNVLRLIKLWLQVPVEERDGDGKRHMSGGKSSTRGTPQGGVVSPLLSVIYMNRFL 245 Query: 232 QYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQ 291 ++ YADDFV++ +G + Sbjct: 246 KHWR---------------------------LTGRGEVFHAHVISYADDFVILSRGHAEE 278 Query: 292 VEAIREECRGVLEGSLKLRLNMDKTKIPHV-NDGFIFLGHRLIRKR-SRYGEMRVVSTIP 349 L L LN KT + + +GF FLG+ L + G + ++ Sbjct: 279 ALTWTRAVMT----KLGLTLNEAKTSVKNARREGFDFLGYTLGPRHLPNGGRWYLGASPS 334 Query: 350 QEKARNFAASLTALL 364 ++ + + LL Sbjct: 335 KKSMQRVKVKIGELL 349 >UniRef50_C9BNF1 Group II intron reverse transcriptase/maturase n=6 Tax=Bacilli RepID=C9BNF1_ENTFC Length = 600 Score = 256 bits (653), Expect = 1e-66, Method: Composition-based stats. Identities = 100/364 (27%), Positives = 177/364 (48%), Gaps = 18/364 (4%) Query: 9 AATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILR 68 + + +L LI + A R ++KG+ TPG D + E L Sbjct: 22 TQSRNGKKFYQLYELIISENNILLAYRTIKANKGSSTPGTDSFTIDNYKEMNQAEFIHLI 81 Query: 69 DELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYG 128 L +Y+P +RV IPK NG+ RPLGIP + DR++Q+ +EPI E+ F+ SYG Sbjct: 82 LSHLE-NYKPKSIKRVMIPKPNGEKRPLGIPCMIDRMIQQMFKQILEPICEAKFYEHSYG 140 Query: 129 FRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRR-RISDARF 187 FRP RS HA+ + + ++ + ++ D+ +FD V+HRLL+K + I D + Sbjct: 141 FRPLRSAKHALGRI---MYLINISKMHYAVDIDIKGFFDNVNHRLLIKQLWNIGICDKQV 197 Query: 188 MTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDR 247 + +L K++K+ G+ S+G QGG+ISPLLSN++LN+ D ++ +++ + + + Sbjct: 198 LAILSKSLKSPIQGEGI---PSKGTIQGGIISPLLSNVVLNDLDHWVSKQWHTFETKYPY 254 Query: 248 WYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSL 307 N + R T +++ RYADDF ++ ++ + L+ L Sbjct: 255 TKGYNKFRALRDTNLKQG-------YIVRYADDFKIMTNDYPTALKWF-HAVKLYLKDRL 306 Query: 308 KLRLNMDKTKIPHVND-GFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAASLTALLWK 366 KL ++ +K+KI ++ FLG + + + + S I +K + + Sbjct: 307 KLDISNEKSKIVNLRKCKSEFLGFAI-CVKQKGKKWVCNSRISNKKKDQIKEEIKQRIKD 365 Query: 367 VRIS 370 ++ S Sbjct: 366 IQKS 369 >UniRef50_B8R181 Putative intron-encoded reverse transcriptase n=2 Tax=Volvox carteri RepID=B8R181_VOLCA Length = 598 Score = 256 bits (653), Expect = 1e-66, Method: Composition-based stats. Identities = 105/368 (28%), Positives = 184/368 (50%), Gaps = 20/368 (5%) Query: 5 LATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQAR-LAVE 63 + + SL+ L P +A A R +KG+ TPGVD V+ L+++ L + Sbjct: 25 IYHVSKRGGSLK--NLYDTAFSPRNIALAFRNLKFNKGSKTPGVDDVHIGNLKSKPLNIF 82 Query: 64 LQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFH 123 ++ ++ + +Y+P RRV+IPK NGK RP+GIP L DR+ Q+ + +EPI E+ FH Sbjct: 83 IRDIQKM--AKNYKPSLVRRVWIPKPNGKKRPIGIPTLADRLFQQCIKQVIEPICEAKFH 140 Query: 124 TLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMK-AVRRRI 182 SY FRP RS A+ + + +V++ D+ S+FDT+ H L+K I Sbjct: 141 PHSYVFRPNRSTSDALARALFLM---NQNELHYVVDIDIQSFFDTIDHGKLLKQCWAIGI 197 Query: 183 SDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGK 242 D + ++++ K +KA + G+ +++G PQGG++SPLLSNI LNE D + ++ + Sbjct: 198 RDKKILSIMSKMLKAEVVGEGI---STKGTPQGGILSPLLSNICLNELDWWYTSQWATFP 254 Query: 243 ARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGV 302 + +++ + R + RYADDF + + K I R Sbjct: 255 TKYPYKRTSHAARALRGNS------KLKEFHSVRYADDFKIFCRDYKT-AVKIFAATRLW 307 Query: 303 LEGSLKLRLNMDKTKIPHV-NDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAASLT 361 L+ L L+++ +K+ I ++ FLG L ++ G+ S + + +++ Sbjct: 308 LKDRLNLQISSEKSSITNLRKKSSPFLGISLRACHNKNGKFSCRSRVLDKAMETMKSTIK 367 Query: 362 ALLWKVRI 369 A + K++ Sbjct: 368 AAIIKLQK 375 >UniRef50_C4ZES6 RNA-directed DNA polymerase n=27 Tax=Bacteria RepID=C4ZES6_EUBR3 Length = 554 Score = 255 bits (652), Expect = 2e-66, Method: Composition-based stats. Identities = 100/386 (25%), Positives = 170/386 (44%), Gaps = 48/386 (12%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWLAE-AARITLSSKGAHTPGVDGVNKTMLQAR 59 +Q ++ +++ L L+T + A + S+KG +T GVD + + Sbjct: 30 LQMRIVKAQKDGHYNKVKTLQWLLTHSFYAKALAVKRVTSNKGKNTAGVDHELWKTPKGK 89 Query: 60 LAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWE 119 ++L Y+P P RRVYIPK NGKLRPL IP + DR +Q A+EP+ E Sbjct: 90 FEA-----IEKLKRRGYKPQPLRRVYIPKKNGKLRPLSIPTMTDRAMQTLYKFALEPLAE 144 Query: 120 SDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVR 179 + SYGFR RS H AI L C +W++EGD+ FD + H L+ + Sbjct: 145 TLADPNSYGFRIGRSTHDAIGQCFNDL--CRAGSPQWILEGDIKGCFDHISHNWLLANIP 202 Query: 180 RRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYL 239 +L K +K G ++ EG PQGG ISP+L N+ L+ ++ L ER+ Sbjct: 203 MD------KKMLGKWLKCGFVETKKLFPTEEGTPQGGTISPVLMNMTLDGLERILKERFP 256 Query: 240 SGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREEC 299 + + ++ + + RYADDF++ K + + Sbjct: 257 MRRTVAGKTVYDQ-------------------INFVRYADDFIVTGKSPETLRNEVMPLI 297 Query: 300 RGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAAS 359 + L L+L+ +KT I H++DGF FLG + + +++ + ++F Sbjct: 298 KDFLAER-GLQLSEEKTVITHISDGFDFLGQNVRKY-----NGKLLIKPSKNAIKSFLKK 351 Query: 360 LTALLWKVRIS---------GEILLG 376 + ++ + + + ++ G Sbjct: 352 VRTIVRENKTATQDLLIRKLNPVIRG 377 >UniRef50_B2A0J9 RNA-directed DNA polymerase (Reverse transcriptase) n=43 Tax=Bacteria RepID=B2A0J9_NATTJ Length = 475 Score = 255 bits (651), Expect = 2e-66, Method: Composition-based stats. Identities = 106/363 (29%), Positives = 172/363 (47%), Gaps = 46/363 (12%) Query: 4 KLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVE 63 ++ ++ +L LL I + + +A + S+KG+H G+DG+ L L Sbjct: 42 RITENNISNANLSKGNLLEEILDRDNMNKAFKKIKSNKGSH--GIDGMGVDELLQYLKEN 99 Query: 64 LQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFH 123 LR +L G Y+P P RRV IPK +GK R LGIP + DR++Q+A+ + PI+E F Sbjct: 100 GDHLRQRVLDGKYRPNPVRRVEIPKEDGKKRKLGIPTVVDRVIQQAIAQVLSPIYEEQFS 159 Query: 124 TLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRIS 183 SYGFRP RS H AI+ + + + ++V++ DL YFDTV+ L++ + + I Sbjct: 160 DNSYGFRPGRSTHDAIKKSQQNINE----GYKYVVDMDLEKYFDTVNQSKLIEVLSKTIK 215 Query: 184 DARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKA 243 D R ++L+ K ++AG + ++ GVPQGG +SP+LSNIML+E D+ L Sbjct: 216 DGRVISLINKYLRAGVMIKHTYKDTEVGVPQGGPLSPILSNIMLHELDKELE-------- 267 Query: 244 RKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVL 303 K + RYADD ++ K ++ ++ + Sbjct: 268 -------------------------KRGHEFVRYADDLLIFCKSRRSAGRTLKN-ILPFI 301 Query: 304 EGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAASLTAL 363 E L L++N DKT + +V FLG R + + + + + L Sbjct: 302 ENKLFLKVNKDKTVVAYVGK-VRFLGFGFYRHK-----GKARLRVHLKSVTKMRTRIKEL 355 Query: 364 LWK 366 + Sbjct: 356 TSR 358 >UniRef50_D2CJC8 Putative uncharacterized protein orf2 (Fragment) n=1 Tax=Candida sojae RepID=D2CJC8_9ASCO Length = 773 Score = 254 bits (649), Expect = 3e-66, Method: Composition-based stats. Identities = 117/407 (28%), Positives = 186/407 (45%), Gaps = 55/407 (13%) Query: 4 KLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGV-----DGVNKTMLQA 58 KL P + + +I+ PE+L A + S G TPG DG+ + Sbjct: 182 KLLKEGLNKPDEVYKNIRPIISDPEFLMYAYSLIKSKPGNMTPGTNKETLDGITSETFK- 240 Query: 59 RLAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIW 118 ++ E+ SG Y+ P RR+ IPK+ G +RPL I + RD+IVQ AM + +E I+ Sbjct: 241 -------VMGREIGSGAYKFRPNRRIEIPKAKGGIRPLSIASPRDKIVQMAMKIILEAIF 293 Query: 119 ESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAV 178 E S+GFRP RS H A+ ++ + W IE D++ FDT+ L++K V Sbjct: 294 EPHMSDFSHGFRPNRSTHTALYQLRGIFHEVS-----WFIEADITKCFDTLPQDLIIKEV 348 Query: 179 RRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLH--- 235 +RI D F+ L+ K AG+I+ F+ S G PQG +ISP+L NI+L D++L Sbjct: 349 EKRIKDQVFLDLIHKCFNAGYIENN-FKIPSAGTPQGSIISPILCNILLTVMDEWLMEYS 407 Query: 236 ERYLSGKARKDRWYWNN-------------------SIQRGRSTAVRENWQWKPAVAYCR 276 ER+ G R+ + I R + ++ N + + R Sbjct: 408 ERFSVGTRRRANPVYTKLVRGINKASLLNQKINIRAQIHRDKKRSLLGNDPNFKRMRFVR 467 Query: 277 YADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPH-VNDGFIFLGHRL--- 332 YADDF++ V G+ I+++ L+ L++ L+ DKT I D FLG + Sbjct: 468 YADDFIIGVIGSYQDSCKIKQDLTNFLKDRLRVELSQDKTLITSATKDKAHFLGFDIAIT 527 Query: 333 -------IRKRSRYGEMRVVSTIPQEKARNFAASLTALLWKVRISGE 372 + + G R+V+ + A +T ++ K+ G Sbjct: 528 PYEKRQLMWVKRADGSTRLVAQTSRP---QILAPITKIVAKLGDKGF 571 >UniRef50_A8MI91 RNA-directed DNA polymerase (Reverse transcriptase) n=2 Tax=Clostridiales RepID=A8MI91_ALKOO Length = 421 Score = 253 bits (646), Expect = 7e-66, Method: Composition-based stats. Identities = 114/362 (31%), Positives = 174/362 (48%), Gaps = 39/362 (10%) Query: 15 LRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSG 74 + L+ I + E L A + + G PG+DG L + ++ L D+L + Sbjct: 2 RKWYSLIDKIYRKENLELAFKYVKKNNG--APGIDGETVFNFHLNLELNIEFLHDKLKTN 59 Query: 75 HYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERS 134 Y+P P RRV I K +G +R LGIP ++DR+VQ+A++ +EPI++ FH SYG+RP S Sbjct: 60 GYEPSPVRRVEIQKPDGGVRLLGIPTVKDRVVQQAIVNIIEPIFDKTFHPSSYGYRPNHS 119 Query: 135 VHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKT 194 H A+ + + V++ DLS FDT+ H ++MKAV RISD R + L+ K Sbjct: 120 QHGAVAKAERFMNKY---GLEHVVDMDLSKCFDTLDHEIMMKAVSERISDGRVLKLIEKF 176 Query: 195 IKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSI 254 +KAG + F G PQGGVISPLLSNI LN+FDQ + Sbjct: 177 LKAGVMHSDNFSRTEVGSPQGGVISPLLSNIYLNQFDQRMMS------------------ 218 Query: 255 QRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMD 314 + R+ADD ++ K K + VLE LKL++N + Sbjct: 219 ---------------KGIRIVRFADDILIFAKDKKT-AGNYKAYATQVLENELKLKVNNE 262 Query: 315 KTKIPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAASLTALLWKVRISGEIL 374 KTK+ +VN+G FLG + K R+ + +++ + L ++ ++ Sbjct: 263 KTKLTNVNEGVEFLGFVIKDKWLGVNPKRIERFKDKVRSKTKRNAGRKLEDIIKDLNPVI 322 Query: 375 LG 376 G Sbjct: 323 RG 324 >UniRef50_C5ER86 RNA-directed DNA polymerase n=1 Tax=Clostridiales bacterium 1_7_47FAA RepID=C5ER86_9FIRM Length = 635 Score = 253 bits (646), Expect = 7e-66, Method: Composition-based stats. Identities = 98/384 (25%), Positives = 171/384 (44%), Gaps = 22/384 (5%) Query: 5 LATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVEL 64 L + + + + L+ L+ E + A R + G+ TPG+DG L E+ Sbjct: 26 LYQKSLEN--RKFKNLMELVLMEENIKLAYRNMKKNDGSTTPGIDGKTIEHLAKMTEKEV 83 Query: 65 QILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHT 124 L L Y P RRV I K NGK RPLGI ++ DR++Q+ +L +EPI E+ FH Sbjct: 84 IELVRNKLE-WYTPKAIRRVEIDKGNGKKRPLGIASIEDRLIQQCILQVLEPICEAKFHD 142 Query: 125 LSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRR-RIS 183 S GFRP R V +A+ + + + V++ D+ +FD V H L+K + I Sbjct: 143 RSNGFRPNRGVENALAQAEKLIQS---NKLYIVVDIDIKGFFDNVSHGKLLKQLWTIGIQ 199 Query: 184 DARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKA 243 D + ++++ +K ++G +G QG +ISPLLSN++LNE D ++ ++ Sbjct: 200 DKKLISIISAMLKGEIAEIG---FPEKGTAQGSIISPLLSNVVLNELDWWIASQWEFMPT 256 Query: 244 RKDRWYWNNSIQRGRSTAVRENWQWK--PAVAYCRYADDFVLIVKGTKAQVEAIREECRG 301 R + + + RYADDF + + K + E + Sbjct: 257 RHVYKEAIKANGTQSKSKKYRALRNSTLKECFIVRYADDFKIFCRKHK-DAVVMFEATKQ 315 Query: 302 VLEGSLKLRLNMDKTKIPHVNDGF-IFLGHRLIRKRSRY--------GEMRVVSTIPQEK 352 L+ L L ++ +K+KI ++ + FLG R+ + + V S I ++ Sbjct: 316 WLKTRLGLDISPEKSKIVNLKHSYSEFLGFRIKVHKKGKDTKCKPPVDKYVVKSHISEKA 375 Query: 353 ARNFAASLTALLWKVRISGEILLG 376 + + + ++ + +G Sbjct: 376 LKKIKTNAKERIIAIQKTNGSRVG 399 >UniRef50_B0URY2 RNA-directed DNA polymerase n=25 Tax=cellular organisms RepID=B0URY2_HAES2 Length = 575 Score = 253 bits (646), Expect = 8e-66, Method: Composition-based stats. Identities = 101/376 (26%), Positives = 173/376 (46%), Gaps = 42/376 (11%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWLAE-AARITLSSKGAHTPGVDGVNKTMLQAR 59 MQ ++A +++ L R++T + A R + G T G+D +++ Sbjct: 40 MQVRIAKATQESNWRKVKNLQRMLTHSFYAKALAVRRVTENTGKRTAGIDKRIWDTPESK 99 Query: 60 LAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWE 119 +L S YQP P RRV+IPKSNGK RPLGIP ++DR +Q L+A++PI E Sbjct: 100 WIA-----IQDLSSKGYQPKPLRRVFIPKSNGKKRPLGIPTMKDRAMQMLYLLALQPIAE 154 Query: 120 SDFHTLSYGFRPERSVHHAIRTVKLQLT---DCGETRGRWVIEGDLSSYFDTVHHRLLMK 176 + SYGFR RS AI + + + WV++ D+ FD ++H L+K Sbjct: 155 TTADNNSYGFRLNRSTADAISHIHSIFSTKGNQSRQMAEWVLDADIHGCFDFINHDWLLK 214 Query: 177 AVRRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHE 236 + +L K +K+G ++ G + +EG PQG +ISP L+N+ L+ ++ L + Sbjct: 215 HIPMN------KRILKKWLKSGVVEFGQLKPTTEGTPQGDIISPTLANMALDGLEKELIK 268 Query: 237 RYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIR 296 + + + K K RYADDF++ + E + Sbjct: 269 HFGAKNSLKI---------------------AKHRTYLVRYADDFIISGISKELLEEQVI 307 Query: 297 EECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNF 356 + L L L+ KTK+ H+ GF FLG + R + +++ ++ A+ F Sbjct: 308 PMVKNFLAER-GLSLSESKTKVVHIEHGFDFLGWTVKRF-----DKKLIIKPSKKNAKAF 361 Query: 357 AASLTALLWKVRISGE 372 + + K++++ + Sbjct: 362 YDKVKQSISKMKMAKQ 377 >UniRef50_C3B585 Reverse transcriptase/endonuclease protein n=1 Tax=Bacillus mycoides Rock3-17 RepID=C3B585_BACMY Length = 614 Score = 253 bits (646), Expect = 8e-66, Method: Composition-based stats. Identities = 110/392 (28%), Positives = 193/392 (49%), Gaps = 40/392 (10%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPE--WLAEAARITLSSKGAHTPGVDGVNKTMLQA 58 +Q+++ +++ L RL+ + E L ++T +KG T GVDG ++ Sbjct: 34 IQQRIYRAEQLGQRRKVKGLQRLLMRSEATLLISIRQVTQLNKGKRTAGVDGF--KAIKP 91 Query: 59 RLAVELQILRDELLSGHYQPLPARRVYIPKS--NGKLRPLGIPALRDRIVQRAMLMAMEP 116 ++L + ++P P +R+ IPK KLRPLGIP + DR+ Q + +A+EP Sbjct: 92 TERIKLFHKMKAMNLATHKPSPVKRIEIPKDTAGKKLRPLGIPIIIDRVYQNVVKLALEP 151 Query: 117 IWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMK 176 WE F SYGFRP+R AI ++ L+L + R WV EGD FD ++H ++ Sbjct: 152 QWEVHFEPTSYGFRPKRGCQDAITSIFLKLKTTSKKR--WVFEGDFKGCFDNLNHDYIL- 208 Query: 177 AVRRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHE 236 +I D + ++ K ++AG + G+F + G PQGG+ISPLL+NI L+ ++ + Sbjct: 209 ---EQIKDLPYKEIVKKWLRAGFVHNGVFNLTNNGTPQGGIISPLLANIALHGMEEEIGV 265 Query: 237 RYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIR 296 +Y++ + + ++Q +S RYADDFV++ TK + E++ Sbjct: 266 KYINRTHPRKKGERYWTVQDTKS--------------VVRYADDFVIMT-DTKEEAESMY 310 Query: 297 EECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRS---RYGEMRVVSTIPQEKA 353 E+ + L L L +KTK+ HV++GF FLG + + + + ++ + ++ Sbjct: 311 EKLKPYLVKR-GLELAPEKTKVVHVSEGFDFLGFTIRQFPTAKEKGRLWKLFTKPSKKSI 369 Query: 354 RNFAASLTALLWK---------VRISGEILLG 376 + + A K +R+ I+ G Sbjct: 370 KKAVTKIKACFEKYKGSNISALIRVLNSIIRG 401 >UniRef50_C6IQ61 Putative uncharacterized protein n=10 Tax=Bacteroidales RepID=C6IQ61_9BACE Length = 560 Score = 253 bits (645), Expect = 1e-65, Method: Composition-based stats. Identities = 105/373 (28%), Positives = 170/373 (45%), Gaps = 38/373 (10%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEW-LAEAARITLSSKGAHTPGVDGVNKTMLQAR 59 +Q ++ +++ L L+T A A + S+KG +T GVD V + A+ Sbjct: 33 LQARIVKVQKEGRYGKVKALQWLLTHSFAAKALAVKRVTSNKGKNTSGVDKVLWSTPIAK 92 Query: 60 LAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWE 119 EL Y P+P +RV I KSNGKLRPLGIP ++DR +Q LMA++P+ E Sbjct: 93 ANA-----ITELKRRDYNPMPLKRVNIRKSNGKLRPLGIPTMKDRAMQALYLMALDPVAE 147 Query: 120 SDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVR 179 + SYGFR ER AI + + E+ +W++EGD+ FD ++H L+ + Sbjct: 148 TTADNHSYGFRKERCTGDAIH--QCYINLSKESSPQWILEGDIKGCFDHINHEWLLNNIP 205 Query: 180 RRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYL 239 +L K +K+G I EG PQGG+ISP L+N+ L+ L ++ Sbjct: 206 MD------KVMLRKWLKSGFIFNKQLFPTEEGTPQGGIISPTLANMALDGLQTMLEAKFH 259 Query: 240 SGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREEC 299 R + P V RYADDF++ + + I Sbjct: 260 ------------------RVDLYSPKRSYYPKVHLIRYADDFIITSISKEMLEQEIMPMV 301 Query: 300 RGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAAS 359 + L+ L L+ +KTKI H+++GF FLG + + + + + T +E + F Sbjct: 302 KEFLQAR-GLTLSEEKTKITHIDEGFDFLGFNIRKYK-----GKFLITPSKESQKKFQRK 355 Query: 360 LTALLWKVRISGE 372 + ++ + + Sbjct: 356 INEIVNSHKTIPQ 368 >UniRef50_C3LL08 Group II intron reverse transcriptase/maturase n=34 Tax=Firmicutes RepID=C3LL08_BACAC Length = 643 Score = 252 bits (644), Expect = 1e-65, Method: Composition-based stats. Identities = 115/384 (29%), Positives = 184/384 (47%), Gaps = 21/384 (5%) Query: 5 LATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAV-E 63 L A S + L+ +I E + A R +KG+ T D VN ++ Sbjct: 29 LYQKATKGNS--FKNLMSIIISDENILLAYRNIKGNKGSRTAACDNVNIKNIEGMEQSYF 86 Query: 64 LQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFH 123 L ++ + YQP RR I K NG+ RPLGIPA+ DRI+Q+ +L MEPI E+ F Sbjct: 87 LNEVKRRFQN--YQPQKVRRKEISKPNGQTRPLGIPAMWDRIIQQCILQVMEPICEAHFS 144 Query: 124 TLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRR-I 182 SYGFRP RS HA+ +++ + +V++ D+ +FD V+H LM+ + I Sbjct: 145 NRSYGFRPNRSAEHALADASVRV---NKQNLTYVVDVDIKGFFDEVNHVKLMRQLWTLGI 201 Query: 183 SDARFMTLLWKTIKAGH-IDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSG 241 D + + ++ K +KA + G ++G PQGG++SP+L+N+ LNEFD ++ ++ + Sbjct: 202 RDKQLLVIIRKILKAPVQMPDGTTMFPTKGTPQGGILSPILANVNLNEFDWWISRQWETF 261 Query: 242 KARKDRWYWNNSIQRGRSTAVRENWQWK-PAVAYCRYADDFVLIVKGTKAQVEAIREECR 300 KA+K + I + K + RYADDF + T++ E I + + Sbjct: 262 KAKKVKPRCMRGIWCNDVVTTQLTKTSKMKPMYIVRYADDFKIFTN-TRSNAEKIFKATQ 320 Query: 301 GVLEGSLKLRLNMDKTKIPHVND-GFIFLGHRL--------IRKRSRYGEMRVVSTIPQE 351 LE LKL ++ +K+K+ ++ FLG L +RY + VS E Sbjct: 321 MWLEERLKLSISAEKSKVTNLTKQQSEFLGFTLKAVKKGKKKNGDTRYIAVTHVSPKALE 380 Query: 352 KARNFAASLTALLWKVRISGEILL 375 K + A + K S E + Sbjct: 381 KTKQDLAKQVRRIQKTPNSNETIK 404 >UniRef50_A0RHJ0 Reverse transcriptase/endonuclease protein n=6 Tax=Firmicutes RepID=A0RHJ0_BACAH Length = 608 Score = 252 bits (644), Expect = 1e-65, Method: Composition-based stats. Identities = 106/375 (28%), Positives = 176/375 (46%), Gaps = 20/375 (5%) Query: 1 MQRKLATWAATDPSLR-IQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQAR 59 +Q R + + LI E + A R S+ G+ T G +G L Sbjct: 17 LQSTFDNLYEESKKGRYFKNIYELIISEENIRLAFRNLKSNIGSKTKGTNGHTIKHLNKI 76 Query: 60 LAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWE 119 A +L L + L +Y P RR++I K NGK+RPLGIP + DR++Q+ +EPI E Sbjct: 77 DADKLIRLTQKRLE-NYMPHAVRRLFISKPNGKMRPLGIPTIEDRLIQQMFQQVLEPIVE 135 Query: 120 SDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVR 179 FH SYGFRP+R H A+ + + +V++ D+ +FD V+H+ LM+ + Sbjct: 136 GKFHPQSYGFRPKRGTHDALARCYHMV---NHSHQHFVVDIDIKGFFDNVNHKKLMRQLW 192 Query: 180 R-RISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERY 238 I D + ++++ K +KA G+ +G PQGG++SPLL+N++LNE D ++ ++ Sbjct: 193 TIGIRDKKVLSIIKKMLKAEVTGEGIPV---KGTPQGGILSPLLANVVLNELDWWVSNQW 249 Query: 239 LSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREE 298 + R N + T ++ + RYADDF + I+ Sbjct: 250 ETKPTRVPYKLKRNKTDALKKTRLKP-------MYLVRYADDFKIFTNSY-DNARKIKIA 301 Query: 299 CRGVLEGSLKLRLNMDKTKIPHVND-GFIFLGHRLI-RKRSRYGEMRVVSTIPQEKAR-N 355 L+ L L ++ +K+KI ++ G FLG R ++ +V++ KA+ Sbjct: 302 VEKWLKERLGLEISEEKSKITNLRKNGTDFLGIRFRAVQKGNAKTGYIVNSKMDPKAKDK 361 Query: 356 FAASLTALLWKVRIS 370 + L K+R S Sbjct: 362 VLGVIRHQLVKLRKS 376 >UniRef50_Q1Q0X4 Similar to Group II intron encoded reverse transcriptase n=6 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1Q0X4_9BACT Length = 298 Score = 252 bits (643), Expect = 2e-65, Method: Composition-based stats. Identities = 102/341 (29%), Positives = 171/341 (50%), Gaps = 46/341 (13%) Query: 16 RIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGH 75 + L + + L A + +KG G+D V+ ++ L V + + EL + Sbjct: 3 KYHSLRDKVFSLKNLYAAFKHVKKNKGK--AGLDRVSIKQFESNLDVNIMSIHQELKTAI 60 Query: 76 YQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSV 135 Y P P RVYIPK RPLGIP ++DRIVQ+A +EPI+E F S+GFRP+R Sbjct: 61 YNPAPVLRVYIPKGRHDKRPLGIPIVKDRIVQQAFRQIIEPIFEKGFSDNSFGFRPDRCC 120 Query: 136 HHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTI 195 H AI+ ++ + V++ D+ +++DT+ H+L+M ++R +I+D + + + Sbjct: 121 HDAIKRLEQ----YKQEGYTSVLDADIMAFYDTIPHKLIMDSLREKIADGWVLNSIENML 176 Query: 196 KAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQ 255 KAG ++ G+ ++G PQGGVISPLL+N++ + D+ L + Sbjct: 177 KAGVMEDGIVHETNKGTPQGGVISPLLANLIGDIIDKELEKA------------------ 218 Query: 256 RGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDK 315 + RYADDFV++ K TK ++ A + ++ G L ++L+ DK Sbjct: 219 ---------------GYKFVRYADDFVVMTK-TKDELPAALSYVKEIIAGKLGMKLSEDK 262 Query: 316 TKIPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNF 356 TK+ + GF FLG+ G+ + +ST K ++ Sbjct: 263 TKLTNFERGFRFLGYDF------KGKYKGISTKSLNKLKSL 297 >UniRef50_Q82RB7 Putative reverse transcriptase homolog; similar to GII intron n=1 Tax=Streptomyces avermitilis RepID=Q82RB7_STRAW Length = 588 Score = 252 bits (643), Expect = 2e-65, Method: Composition-based stats. Identities = 106/370 (28%), Positives = 167/370 (45%), Gaps = 43/370 (11%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQP--EWLAEAARITLSSKGAHTPGVDGVNKTMLQA 58 +++++ A +++ L +L+ + L R+ S G T G+DG + Sbjct: 36 LRQRIFRAAREGDMKQVRNLQKLMRRSRANTLTSVRRVCQVSTGKKTAGIDGQKALSPEK 95 Query: 59 RLAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIW 118 R QIL D + P P RRVYIPK+NGK RPLGIP +RDR+ Q A+EP W Sbjct: 96 RGKTARQILADPMSH----PQPVRRVYIPKANGKRRPLGIPVIRDRVDQARFKNALEPEW 151 Query: 119 ESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAV 178 E+ F SYGFRP R AI + + + WV++ DLS+ FD + H+ LM +V Sbjct: 152 EARFEARSYGFRPGRGAWDAIEMIFN-VAGRRTAKRLWVLDADLSAAFDHISHQHLMDSV 210 Query: 179 RRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERY 238 + + ++AG ++ G F + EG PQGGVISPLL NI L+ + + Sbjct: 211 GLFPG----RRQIQQWLRAGVMEDGRFVSTPEGTPQGGVISPLLMNIALHGMGEVI---- 262 Query: 239 LSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREE 298 A R + RYADDFV+ T+ + +++ Sbjct: 263 ---------------------GANRPWNAKTTSPTLVRYADDFVVFCT-TENEAIKAKQD 300 Query: 299 CRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAA 358 LE L N +KT++ H++ G FLG + R R +++ ++ + Sbjct: 301 LAAWLEPR-GLSFNEEKTRVVHLSSGVDFLGFNVGRFR-----QKLIIKPSRDALQRARK 354 Query: 359 SLTALLWKVR 368 ++ + Sbjct: 355 RISTTARENS 364 >UniRef50_Q9MD87 Putative maturase n=1 Tax=Cryphonectria parasitica RepID=Q9MD87_CRYPA Length = 778 Score = 252 bits (643), Expect = 2e-65, Method: Composition-based stats. Identities = 119/396 (30%), Positives = 193/396 (48%), Gaps = 34/396 (8%) Query: 3 RKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAV 62 RKL + + + +L ++ P +L +G T G D L Sbjct: 197 RKLLDQSKI-ANQKYYNILNVLADPNFLIACYDEIKGKQGNMTRGYDKATLDGLDYNW-- 253 Query: 63 ELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDF 122 EL +G Y P+RRV IPK+NGK RPLG+ + RD+IVQ+A+ +E I+E F Sbjct: 254 -FVKTAGELKAGKYNFKPSRRVEIPKANGKTRPLGVGSPRDKIVQKALHAILEAIFEPLF 312 Query: 123 HTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRI 182 S+GFRP RS H A+ V + WVI+GD++ FD++ H +++K + +I Sbjct: 313 LPSSHGFRPNRSTHSALLKV-----YLSGNKHNWVIQGDITKCFDSIPHSIILKRIGAQI 367 Query: 183 SDARFMTLLWKTIKAGHIDV--GLFRAASEGVPQGGVISPLLSNIMLNEFDQYL---HER 237 D +++ L+ K ++AGHID G + G PQGG++SP+LSNI+L+EFD+Y+ E Sbjct: 368 GDKKYLNLISKYLEAGHIDPKTGTKVVLNYGTPQGGILSPILSNIVLHEFDKYMAKLSES 427 Query: 238 YLSGKARKDRWYWNNSI-QRGRSTAVREN----------------WQWKPAVAYCRYADD 280 + GK R+ + + +RGR+ ++ E + Y RYADD Sbjct: 428 FHKGKKRRWNPAYKRLLARRGRTKSLEEKQTLLKQMRTMRSIDAFDPNFRRLDYVRYADD 487 Query: 281 FVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHV-NDGFIFLGHRLIRKRSRY 339 FV+ + G+ IR + L+ + L LN+DKT I ++ + + FLG L + + Sbjct: 488 FVVFISGSSKDALFIRNNLKDYLKVNCGLELNVDKTAISNLATEKWKFLGAELSKIKLNA 547 Query: 340 GEMRV--VSTIPQEKARNFAASLTALLWKVRISGEI 373 + I A ++ L+ ++ G + Sbjct: 548 NWLVSHGRKRIIGTPMLLVNAPISGLISSLKKVGIV 583 >UniRef50_B7HM08 Group II intron reverse transcriptase/maturase n=30 Tax=Firmicutes RepID=B7HM08_BACC7 Length = 610 Score = 251 bits (642), Expect = 2e-65, Method: Composition-based stats. Identities = 103/379 (27%), Positives = 179/379 (47%), Gaps = 28/379 (7%) Query: 1 MQRK---LATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQ 57 MQRK L + + + +L+ +I E + A R ++KG++T GVD + + Sbjct: 15 MQRKYDELYSNSLNGNN--FYKLIDIIGSEENIRLAYRNIKTNKGSNTAGVDNLTIKDI- 71 Query: 58 ARLAVELQILRDELLS--GHYQPLPARRVYIPKSN-GKLRPLGIPALRDRIVQRAMLMAM 114 + + E+ +YQP +RV IPK K RPLGIP + DR+VQ+++L + Sbjct: 72 --WHLNDTKIIHEVRKRLNNYQPQAVKRVLIPKEGSDKKRPLGIPTIWDRLVQQSILQVL 129 Query: 115 EPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLL 174 EPI E+ FH SYGFRP RS HHA+ V + + + ++ D+ +FD V H+ L Sbjct: 130 EPICEAKFHNHSYGFRPNRSTHHALSRVVSLINIGHQ---HYCVDIDIKGFFDNVCHKKL 186 Query: 175 MKAVRRR-ISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQY 233 ++ + I D + ++ K +K+ G+ ++G PQGG+ISPLLSNI+LNE D + Sbjct: 187 LRQMWTLGIRDKSLLCVISKILKSEIEGEGI---PNKGTPQGGIISPLLSNIVLNELDWW 243 Query: 234 LHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVE 293 + ++ + K + Q R + RYADDF ++ + T + + Sbjct: 244 ISSQWETYKPHRISTRHLGFRQYARKYTNLKCG------YVVRYADDFKIMCR-TYDEAQ 296 Query: 294 AIREECRGVLEGSLKLRLNMDKTKIPHVND-GFIFLGHRLIRKRSRYGEMRVVSTI--PQ 350 L+ L L +N K+K+ ++ +FLG ++ + + ++ Sbjct: 297 RFYHATVDFLKSRLGLEINPKKSKVVNLKKNSSVFLGFKIKVVKQGKAKFGYIAKTSMSD 356 Query: 351 EKARNFAASLTALLWKVRI 369 + L + ++ Sbjct: 357 KAITKAKRQLKERIKGIQK 375 >UniRef50_A8ZN56 RNA-directed DNA polymerase n=2 Tax=Cyanobacteria RepID=A8ZN56_ACAM1 Length = 432 Score = 251 bits (642), Expect = 3e-65, Method: Composition-based stats. Identities = 110/382 (28%), Positives = 157/382 (41%), Gaps = 50/382 (13%) Query: 4 KLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVE 63 ++A+ A P L+ E L S G GVDGV K Q L Sbjct: 8 RIASRARNHPEEPFTALMHH-YSVENLRACFE---SLDGNKALGVDGVTKAEYQENLETN 63 Query: 64 LQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFH 123 LQ L +L Y+P P R+V IPK +G +RPLGI D++VQ +E I+E F Sbjct: 64 LQNLHLKLRQMSYRPQPVRQVEIPKEDGSMRPLGISCTEDKVVQEMTRRILEAIYEPVFI 123 Query: 124 TLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRIS 183 SYGFRP+RS H A+R + + WV + DL+ +FDT+ H+ ++ + RI Sbjct: 124 DTSYGFRPKRSCHDALRQLN---REVMRKPVNWVADIDLAKFFDTMPHQEILSVLSIRIK 180 Query: 184 DARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNE-FDQYLHERYLSGK 242 D + L+ + +KAG G G PQG ++SP+++NI L+ DQ+ Sbjct: 181 DGNLLRLIARMLKAGIQTPGGVVYDELGSPQGSIVSPVIANIFLDYVLDQWFTN------ 234 Query: 243 ARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGV 302 VR + + A RYADD + + + Sbjct: 235 ------------------VVRHHCRGY--CAIIRYADDVAAVFEH-EEDAIRFMRVLPRR 273 Query: 303 LEGSLKLRLNMDKTKIPHVND--------------GFIFLGHRLIRKRSRYGEMRVVSTI 348 LE LRLN KT + F FLG RSR G +R+ Sbjct: 274 LE-KYGLRLNTKKTHLLAFGKRNARRCFQTGQRPSTFDFLGLTHYWGRSRKGYVRMKRKT 332 Query: 349 PQEKARNFAASLTALLWKVRIS 370 +++ R L L KVR Sbjct: 333 SKKRLRRSLKQLKMWLRKVRNI 354 >UniRef50_C3KST3 Group II intron reverse transcriptase/maturase n=10 Tax=Firmicutes RepID=C3KST3_CLOB6 Length = 626 Score = 251 bits (642), Expect = 3e-65, Method: Composition-based stats. Identities = 107/377 (28%), Positives = 180/377 (47%), Gaps = 14/377 (3%) Query: 1 MQRKLATWAATDPSLR-IQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQAR 59 MQ + + L++ IT E + A R + G+HT G + + Sbjct: 26 MQSIFDELYKQSKDGKQFKNLIKTITSKENILLAYRNIKKNDGSHTKGTNHKTINDIAGE 85 Query: 60 LAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWE 119 E+ + L+ Y P +R+YIPK+NG RPLGIP + DR++QR++L +EPI E Sbjct: 86 SEDEIIEYVRKRLNKFY-PHSVKRIYIPKNNGDKRPLGIPTIEDRLIQRSILQVLEPICE 144 Query: 120 SDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVR 179 + FH SYGFRP RS HAI +T + +V++ D+ +FD V+H L+K + Sbjct: 145 AKFHPHSYGFRPNRSTEHAIARA---MTLINMNKLHYVVDVDIKGFFDNVNHGKLLKQLW 201 Query: 180 RRISDARFMTLLWK-TIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERY 238 + + + +KA D + +G PQGG+ISPLL+N++LNE D ++ ++ Sbjct: 202 TLGIKDKKLIKIISLMLKAQIKDGSMITNPVKGTPQGGIISPLLANVVLNELDWWISSQW 261 Query: 239 LSGKAR----KDRWYWNNSIQRGRSTAVRENWQWKPA-VAYCRYADDFVLIVKGTKAQVE 293 + + + K R + N + +S R K + RYADDF + K K E Sbjct: 262 ETFETKHNYSKLRTFKNGTTTIDKSHKYRALRNGKLKEIYIVRYADDFKVFCKNPK-DAE 320 Query: 294 AIREECRGVLEGSLKLRLNMDKTKIPHVND-GFIFLGHRLIRKRSRYGEMRVVSTIPQEK 352 I + L+ L L + +K+K+ ++ FLG L ++ R + S + ++ Sbjct: 321 KIFIAIKLWLKERLDLETSPEKSKVTNLRKHPTEFLGFELKAEKKRKKYV-CQSHVSKKA 379 Query: 353 ARNFAASLTALLWKVRI 369 R + A + +++ Sbjct: 380 KRLIQEKIKAKIKELQK 396 >UniRef50_A5VLF2 RNA-directed DNA polymerase (Reverse transcriptase) n=40 Tax=Lactobacillus RepID=A5VLF2_LACRD Length = 460 Score = 251 bits (641), Expect = 3e-65, Method: Composition-based stats. Identities = 111/358 (31%), Positives = 164/358 (45%), Gaps = 48/358 (13%) Query: 20 LLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPL 79 + L+ L +A +KG G+D + L L L L G Y+P Sbjct: 42 IQDLVLDRNNLNQAYLRVKRNKG--AAGIDDMTVNDLLPYLRENKTELIASLREGKYKPA 99 Query: 80 PARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAI 139 P +RV IPK NG +R LGIP + DR+VQ+A+ + PI+E F S+GFRP R H AI Sbjct: 100 PVKRVEIPKPNGGVRKLGIPTVVDRMVQQAVAQILTPIFERVFSDNSFGFRPHRGAHDAI 159 Query: 140 RTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGH 199 V D R V++ DL +YFD V+H L++K +++ I D + L+ K + +G Sbjct: 160 AKV----VDLYNQGYRRVVDLDLKAYFDNVNHDLMIKYLQQYIDDPWTLRLIRKFLTSGV 215 Query: 200 IDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGRS 259 +D GLF + +G PQGG +SPLL+NI LNE D+ L Sbjct: 216 LDHGLFAKSEKGTPQGGPLSPLLANIYLNELDKELTR----------------------- 252 Query: 260 TAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIP 319 + RYADD + VK +A E + LE LK+++N DKTK+ Sbjct: 253 ----------RGHHFVRYADDCNIYVKSQRAG-ERVMRSITQFLEKRLKVKVNSDKTKVG 301 Query: 320 HVNDGFIFLGHRL-------IRKRSRYGEMRVVSTIPQEKARNFAASLTALLWKVRIS 370 FLG L + ++ + R+ + RN SLT + +++ Sbjct: 302 SPLR-LKFLGFSLGVDHNGAYARPAKQSQQRIKKALKLLTKRNRGISLTRMFEEIQRK 358 >UniRef50_D2CK02 Putative uncharacterized protein orf3 (Fragment) n=1 Tax=Candida viswanathii RepID=D2CK02_9ASCO Length = 801 Score = 251 bits (641), Expect = 3e-65, Method: Composition-based stats. Identities = 106/402 (26%), Positives = 173/402 (43%), Gaps = 43/402 (10%) Query: 5 LATWAATDPSLRI-QRLLR-LITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAV 62 L + + P+ I + L + I + L A S G TPG+ N T L Sbjct: 196 LKLRSKSHPNEIIDRDLYKTFILNKDLLRTAYEKLKSRPGMMTPGI---NPTTLDGMSEE 252 Query: 63 ELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDF 122 L + ++L + ++ P +R+ IPKSNGK RPL + + D++VQ M M +E I+E F Sbjct: 253 RLDNIINKLRNKSFKFTPGKRIIIPKSNGKRRPLTLGSPEDKLVQEVMRMVLEAIYEPLF 312 Query: 123 HTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRI 182 +S+G+RP+RS H A+R + + W IEGD+ S FD + H LMK + +I Sbjct: 313 LDVSHGYRPKRSCHSALRAIFTKF-----KGCTWWIEGDIKSCFDDIPHDKLMKVLSNKI 367 Query: 183 SDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGK 242 D F+ L+ K +KAG++ + GVPQG VISP+L+NI L++ D ++ S Sbjct: 368 KDQSFLELIRKCLKAGYMYQYTNKTDIIGVPQGSVISPILANIYLHQLDLFIMNIKDSFD 427 Query: 243 ARK------------------------DRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYA 278 + D+ + R+ + + + Y RY Sbjct: 428 WKGPRYKHDIGHAKLQYQLRKAKKAGSDKRVLHKMAVELRNHKMNFKGERTNKLTYVRYV 487 Query: 279 DDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHV-NDGFIFLGH------- 330 DD+++ + G+ Q I L L ++ +KTKI + D +FLG Sbjct: 488 DDWIVAINGSHKQAVEILSSISEYCMNELGLTISPEKTKITNSYKDHILFLGTLIKHSIH 547 Query: 331 -RLIRKRSRYGEMRVVSTIPQEKARNFAASLTALLWKVRISG 371 + + I ++ L + R +G Sbjct: 548 PTFSIRNGHLKQRNSGLLILSAPMKSIQDKLINSGFLWRRTG 589 >UniRef50_C1PA09 RNA-directed DNA polymerase n=3 Tax=Firmicutes RepID=C1PA09_BACCO Length = 432 Score = 251 bits (641), Expect = 3e-65, Method: Composition-based stats. Identities = 112/395 (28%), Positives = 187/395 (47%), Gaps = 56/395 (14%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARL 60 Q +L A D + + I + + L EA + + GA GVD V+ ++A Sbjct: 16 FQNRLYLAAKADRKRKFYAIYDKIYRKDILEEAWKRVKQNGGAG--GVDKVSIEDVKAYG 73 Query: 61 AVELQ-ILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWE 119 +L + +EL + Y+ P RR YIPK +G+ R LGIP ++DR+VQ A + +EP++E Sbjct: 74 EEKLLNEIAEELRTEKYRCKPVRRTYIPKQDGRKRALGIPTIKDRVVQMATKIVIEPVFE 133 Query: 120 SDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVR 179 ++F SYGFRP+RS A+ + + WVI+ D+ YF +++H L+ V+ Sbjct: 134 ANFQPCSYGFRPKRSAKQAMDRIFEV---ADKGGALWVIDADIKDYFGSINHDKLLLLVK 190 Query: 180 RRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYL 239 +RI+D R + L+ +KAG ++ + ++ G PQGGVISPLLSN+ LN FD Y ++ + Sbjct: 191 QRITDRRVLKLIKGWLKAGVLEDSQYSESTVGAPQGGVISPLLSNVYLNYFDIYWNKAFG 250 Query: 240 SGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREEC 299 RYADDFV++ K EA+R Sbjct: 251 ------------------------------HLGELVRYADDFVILCKRLSHAEEALRAV- 279 Query: 300 RGVLEGSLKLRLNMDKTKIPHV---NDGFIFLGH--RLIRKRSRYGEMR--VVSTIPQEK 352 + L+L L+ +KT++ + D F FLG R R R++ + + ++ Sbjct: 280 -KWIMRKLELTLHSEKTRLVDMYFGKDSFDFLGFNNRFQRFRNKSWQWYWTLQQVPSKKA 338 Query: 353 ARNFAASLTALL-----------WKVRISGEILLG 376 + A++ + V++ ++G Sbjct: 339 MKKMRANIKEVFASPSKLLLSMEKMVKLLNPKIIG 373 >UniRef50_D0LS09 RNA-directed DNA polymerase n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LS09_HALO1 Length = 449 Score = 251 bits (640), Expect = 4e-65, Method: Composition-based stats. Identities = 112/383 (29%), Positives = 166/383 (43%), Gaps = 53/383 (13%) Query: 5 LATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVEL 64 +A A P + + L I WL EA R T + PG+DG L L Sbjct: 17 IAKRAREMPEVALTTLAHHI-DLVWLREAYRRTRKN---AAPGIDGQTGRAYAEALESNL 72 Query: 65 QILRDELLSG-HYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFH 123 + L + G Y+ +P RRV IPK +G++RPLGIP D+++QRA++M +E ++E DF Sbjct: 73 ESLLERAKDGDRYRAMPVRRVAIPKGDGRMRPLGIPTFEDKVLQRAVVMVLEAVYEQDFL 132 Query: 124 TLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRIS 183 SYG+R RS H A++ V+ + RG WV+E D+ ++FDTV H L + +R+R+ Sbjct: 133 DCSYGYRRGRSAHDAVKAVRAH---TMKLRGGWVLEADIEAFFDTVDHAKLREILRQRVR 189 Query: 184 DARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNE-FDQYLHERYLSGK 242 D + + K + AG ++ G G PQGGVISP+L+NI LNE DQ+ Sbjct: 190 DGVLLRWIGKWLNAGVMEEGNVYYPEGGTPQGGVISPVLANIFLNEVIDQWFEHVVRP-- 247 Query: 243 ARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGV 302 + K RYADD V+I + + + Sbjct: 248 ------------------------RLKGQGYLVRYADDLVMIFE-REDDARRVETVLPKR 282 Query: 303 LEGSLKLRLNMDKTKIPHVNDG----------------FIFLGHRLIRKRSRYGEMRVVS 346 L LR++ +KT++ F FLG RSR G V+ Sbjct: 283 L-SKYGLRIHPEKTRLIQFLRPGYGTRPTRRDGNRPGTFDFLGFTHYWARSRKGSWVVMQ 341 Query: 347 TIPQEKARNFAASLTALLWKVRI 369 ++ R R Sbjct: 342 KTAAKRLRRALGRFVEWCRDHRH 364 >UniRef50_UPI0001C42942 reverse transcriptase n=1 Tax=Bacillus pseudofirmus OF4 RepID=UPI0001C42942 Length = 644 Score = 251 bits (640), Expect = 4e-65, Method: Composition-based stats. Identities = 111/378 (29%), Positives = 179/378 (47%), Gaps = 21/378 (5%) Query: 1 MQRKL---ATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQ 57 +Q L A + + L+ L+ + + A S+KG+ T G+D Sbjct: 14 LQNTLDEMYDLAKNN-NEPFYNLIELMKNQQTIMTALHNIKSNKGSKTVGIDNKTIDYYL 72 Query: 58 ARLAVELQILRDELLSGHYQPLPARRVYIPKSN-GKLRPLGIPALRDRIVQRAMLMAMEP 116 +L + Y P P RR YIPK N KLRPLGIP + DRI+Q + +EP Sbjct: 73 HLPYEDLVSQVQTCIE-DYNPEPVRRKYIPKENSDKLRPLGIPTMIDRIIQEITRLVIEP 131 Query: 117 IWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMK 176 I E+ F+ SYGFRP RS HA+ + L +++ WVIEGD+ YFD ++H L+ Sbjct: 132 IAEAKFYKFSYGFRPMRSAEHAMAEI---LEKARKSKTYWVIEGDIKGYFDNINHNKLIT 188 Query: 177 AVRR-RISDARFMTLLWKTIKAGHI-DVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYL 234 + + I D R ++++ K +K+G + + G + G PQGG+ISPLL+NI LN FD + Sbjct: 189 MLWKIGIKDKRVLSIIKKMLKSGIVEEDGEIYPSDLGSPQGGIISPLLANIYLNFFDWMI 248 Query: 235 HERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEA 294 E + + + ++ R + V RYADD+V++ +K Q + Sbjct: 249 AEEFDQHHYINNYERRDKGLRAIR--------RDHKPVYSIRYADDWVVLC-SSKKQADT 299 Query: 295 IREECRGVLEGSLKLRLNMDKTKIPH-VNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKA 353 + + R L+ L L L+ +KTKI + V + FLG + + G+ + Sbjct: 300 LLIKIRKYLKHQLSLELSEEKTKITNLVEEKASFLGFEFFVEPRKKGKGNKMVAKMIPDR 359 Query: 354 RNFAASLTALLWKVRISG 371 + + + ++R Sbjct: 360 KKSNKKVREINREIRAIN 377 >UniRef50_Q5ZTU1 Reverse transcriptase n=1 Tax=Legionella pneumophila subsp. pneumophila str. Philadelphia 1 RepID=Q5ZTU1_LEGPH Length = 506 Score = 251 bits (640), Expect = 5e-65, Method: Composition-based stats. Identities = 103/386 (26%), Positives = 173/386 (44%), Gaps = 57/386 (14%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQP-EWLAEAARITLSSKGAHTPGVDGVNKTMLQAR 59 +Q ++A + +++ L L+ A R ++KG+ TPG+DGV T + + Sbjct: 34 LQVRIAKAVSNKQHGKVKSLQWLLVNSISAKLLAVRRVTTAKGSKTPGIDGVVWTTSEEK 93 Query: 60 LAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWE 119 L + Y+ P RR+YIPK NGK RPL IP L+DR +Q L+A+EP+ E Sbjct: 94 CEAV-----RNLKARGYKATPLRRIYIPKKNGKERPLSIPTLKDRAMQALYLLALEPVGE 148 Query: 120 SDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVR 179 + SYGFRP+RS H AI L + +W++EGD+ + FD + H L + Sbjct: 149 TTADLNSYGFRPKRSTHDAIYQCYATL--ARKNCAQWILEGDIKACFDEIDHGWLKSNI- 205 Query: 180 RRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYL 239 I D R + + ++AG+++ + G PQGG SPLL+N++L+ ++ +H Sbjct: 206 --IIDQRVL---TQWLQAGYMEKNQLFETARGTPQGGPASPLLANMVLDGLEREIHSGCG 260 Query: 240 SGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREEC 299 G + Y R+ADDF++ E + Sbjct: 261 QGN----------------------------KINYIRFADDFIVTANSPDILKEKVMPII 292 Query: 300 RGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAAS 359 L L L+ +KTKI H+ +GF FLG + + + + ++T ++ ++ Sbjct: 293 SNFLAQR-GLSLSQEKTKIVHIEEGFDFLGFNVRKYK-----GKFLTTPSKDSIKSVQMK 346 Query: 360 LTALLWK---------VRISGEILLG 376 + + K + I+ G Sbjct: 347 IKETVKKGYGWKGSELISALNPIIKG 372 >UniRef50_C0JX29 Putative reverse transcriptase and intron maturase n=1 Tax=Pyramimonas parkeae RepID=C0JX29_9CHLO Length = 608 Score = 250 bits (639), Expect = 5e-65, Method: Composition-based stats. Identities = 114/406 (28%), Positives = 186/406 (45%), Gaps = 59/406 (14%) Query: 5 LATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVEL 64 L + + L RL+ + L A + S G TPG D + L + Sbjct: 21 LKKRNCENKEAVNKDLYRLLCNKQLLTLAYNLIKSKPGNMTPGTDKLT---LDKMSERLI 77 Query: 65 QILRDELLSGHYQPLPARRVYIPKSNGK-LRPLGIPALRDRIVQRAMLMAMEPIWESDFH 123 +L ++ P RRV+IPK N K +RPLG+P+ RD++VQ+AML+ M+ I+E+ F Sbjct: 78 DKTSRQLRDQTFKFKPVRRVFIPKGNSKDIRPLGVPSSRDKVVQKAMLLIMDNIYETTFS 137 Query: 124 TLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRIS 183 T S+GFRP RS H A++ ++ + + +W IEGD+ +D V+H++L+ +R +I Sbjct: 138 THSHGFRPGRSCHSALKEIRSE-----WSGIKWAIEGDIKGCYDNVNHQILINILREKIK 192 Query: 184 DARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLS--- 240 D RF+ LLWK ++AG + G PQGG++SPLL+NI LNEFD+++ Sbjct: 193 DERFIQLLWKLLRAGIEVNRTIERSKIGTPQGGILSPLLANIYLNEFDKFVSNLSQKIGL 252 Query: 241 --GKARKDRWYWNNSIQRGRSTAVREN--------------------------------- 265 K R+D ++ + + Sbjct: 253 TYNKTRRDNPEYHKIRGKIYRLRTKRTVSGTVNVQPSKSDLKQIQILSKTQRTLPSKDPF 312 Query: 266 WQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHV-NDG 324 + + RYADD+++ V G + I+E+ + L+ L+L L+ +KTKI + Sbjct: 313 DPQYRKILFIRYADDWIVGVIGGHEFAQGIKEQIQTFLKEQLELTLSPEKTKITPFSSKK 372 Query: 325 FIFLGHRLI-----------RKRSRYGEMRVVSTIPQEKARNFAAS 359 FLG++L R + R + IP +K + Sbjct: 373 VTFLGYKLQISARSSYSSSGRHKKRTVGWQPKLYIPMDKIVRKLSE 418 >UniRef50_B7CEC9 Putative uncharacterized protein n=1 Tax=Eubacterium biforme DSM 3989 RepID=B7CEC9_9FIRM Length = 458 Score = 250 bits (638), Expect = 7e-65, Method: Composition-based stats. Identities = 101/366 (27%), Positives = 166/366 (45%), Gaps = 39/366 (10%) Query: 14 SLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLS 73 S R +L L+ + + + + K PG+D V K + Q L + L L S Sbjct: 40 SKRYPKLETLMYRVDKESLIRQH-RLQKKDKAPGIDMVTKEVYQENLNENIDDLMHRLKS 98 Query: 74 GHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPER 133 Y+P P RRV I K NGK RPLGIP DR+ Q AM + ++E F SYGFRP R Sbjct: 99 FSYKPQPVRRVEIDKGNGKKRPLGIPVYEDRLFQGAMADILSDVYEPRFLDCSYGFRPNR 158 Query: 134 SVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWK 193 H AI+ + + + ++++ D+ +FD V+H LMK +R I+D ++ + + Sbjct: 159 KAHDAIKVINDTV---MHKKINYILDCDIKGFFDNVNHEWLMKFLRNDIADPNYLKYIAR 215 Query: 194 TIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNE-FDQYLHERYLSGKARKDRWYWNN 252 +K+G + G + S G PQGG+ISP+L+N+ L+ D + + Sbjct: 216 MLKSGVMIEGKYEDTSVGTPQGGLISPILANVYLHYVLDLWFEKCIKK------------ 263 Query: 253 SIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLN 312 Q R+ADDF+++ + + + + E +E L L+ Sbjct: 264 --------------QLCGEAYLVRFADDFLIMFQ-YERDAQRVYEAVINRME-LFGLELS 307 Query: 313 MDKTKIPHV------NDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAASLTALLWK 366 +KT+I + F FLG ++R G V I ++K + F ++L + + Sbjct: 308 KEKTRILPFGRYSDSRETFDFLGFTHFNSKTRKGYYSVGHKISRKKKKQFKSNLKKWVKE 367 Query: 367 VRISGE 372 R + Sbjct: 368 NRNTQF 373 >UniRef50_B7JTB6 Group II intron reverse transcriptase/maturase n=6 Tax=Bacillus cereus RepID=B7JTB6_BACC0 Length = 599 Score = 250 bits (638), Expect = 7e-65, Method: Composition-based stats. Identities = 110/389 (28%), Positives = 190/389 (48%), Gaps = 30/389 (7%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTML-QAR 59 +Q L + + + LL ++ E + A R S+ G+ TPG D L Sbjct: 17 IQNDLFQRSREG-TKNFKNLLEIVISDENILLAYRQVKSNTGSKTPGTDDKTILDLANTN 75 Query: 60 LAVELQILRDELLSGHYQPLPARRVYIPKS-NGKLRPLGIPALRDRIVQRAMLMAMEPIW 118 + +R+ +L+ Y+P RRV+I K+ + RPLGIP ++DRIVQ+ L +EPI Sbjct: 76 QDEFIHYMRELVLN--YKPKSVRRVWIDKNYSKGKRPLGIPCIQDRIVQQMFLNVLEPIC 133 Query: 119 ESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAV 178 E F+ SYGFRP R+ HA+ V+ + + + ++ D+ +FD V+H +L+K V Sbjct: 134 EGKFYNHSYGFRPTRTTRHAVARVQTLV---NINKYHYTVDIDIKGFFDNVNHSILLKQV 190 Query: 179 RR-RISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHER 237 I D R + ++ K +KA G+ ++GVPQGG++SPLLSNI+LN+ DQ++ ++ Sbjct: 191 WNIGIRDKRVIAVISKMLKAPIKGEGI---PTKGVPQGGILSPLLSNIVLNDLDQWVADQ 247 Query: 238 YLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIRE 297 + + R S+ + +R N + K RYADDF ++ T Sbjct: 248 WECFETRYQY-----SVNYSKYVNLRRNSKLKEG-FLVRYADDFRIMTN-THDSAVKWFH 300 Query: 298 ECRGVLEGSLKLRLNMDKTKIPHV-NDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNF 356 L LKL ++ +K+KI ++ FLG++ + + + S I +K + Sbjct: 301 AVVDFLNKRLKLEISPNKSKIINLRKKSSSFLGYKF-KSTIKGNKRVFFSHIDDDKQKQI 359 Query: 357 AASLTALLWKVR---------ISGEILLG 376 L +++++ + ++LG Sbjct: 360 ITKLKERIYEIQKHPSAGNANLYNSVVLG 388 >UniRef50_B0JX80 Reverse transcriptase n=82 Tax=Bacteria RepID=B0JX80_MICAN Length = 613 Score = 250 bits (638), Expect = 7e-65, Method: Composition-based stats. Identities = 104/410 (25%), Positives = 177/410 (43%), Gaps = 73/410 (17%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEW--LAEAARITLSSKGAHTPGVDGVNKTMLQA 58 +Q+++ A + +++RL RL+ + + L ++T ++G T GVDG+ + Sbjct: 37 LQKRIYQAAKSGQDAKVRRLQRLLVKSYYARLLAVRKVTQDNQGKKTAGVDGMIAISPEQ 96 Query: 59 RLAVELQILRDELLSGHYQPLPARRVYIPKSN-GKLRPLGIPALRDRIVQRAMLMAMEPI 117 RL L +E+ G + P RRV+IPK + RPLGIP ++DR Q + A+EP Sbjct: 97 RLN-----LTEEIK-GTLKAKPLRRVWIPKPGRDEKRPLGIPTIKDRARQALIKSALEPE 150 Query: 118 WESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKA 177 WES SYGFRP RS H AI + + + +V++ D++ FD ++H L+ Sbjct: 151 WESKMEGTSYGFRPGRSDHDAISRIYITIN----QSSYFVLDADIAKCFDRINHDFLLSK 206 Query: 178 VRRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHER 237 + + + + +KAG +D G+F G PQGGVISPLL+NI L+ + + Sbjct: 207 IH---CPSSLKRDIKQWLKAGVLDNGVFEETETGTPQGGVISPLLANIALDGMARLIETL 263 Query: 238 YLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIRE 297 + K RYADDFV+ + + +E + Sbjct: 264 FPK------------------------KGNGKNQAVLIRYADDFVV-ISPSLEIIEQCKT 298 Query: 298 ECRGVLEGSLKLRLNMDKTKIPHVND-----------GFIFLGHRLIRKRS------RYG 340 L+ + L L +KT++ H GF FLG + + + G Sbjct: 299 AISEWLK-PIGLELKPEKTRVCHTLKPIEYNGKMEEPGFDFLGFNIRQYPVGKYKSGKDG 357 Query: 341 EMRVV-----STIPQEKARNFAASLTALLWKVRIS---------GEILLG 376 R++ Q+ + ++ ++ K + + I+ G Sbjct: 358 AKRLIGHKTHIKPSQKAVKTHTEAIKGVIKKHKTAPQSALISRLNPIIRG 407 >UniRef50_C8VXL4 RNA-directed DNA polymerase (Reverse transcriptase) n=5 Tax=Bacteria RepID=C8VXL4_DESAS Length = 434 Score = 249 bits (637), Expect = 9e-65, Method: Composition-based stats. Identities = 125/390 (32%), Positives = 190/390 (48%), Gaps = 52/390 (13%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQAR- 59 ++RK+ A D + R L + + E L EA ++ + G PG+DG+ ++A Sbjct: 11 LRRKIYIKAKADKTWRFWGLYVHVCKIETLQEAYKMAKNKNG--APGIDGITFDNIEASG 68 Query: 60 LAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWE 119 + + LQ ++ EL+SG Y P RR IPK +GK R LGIP +RDR+VQ A+ + +EPI+E Sbjct: 69 IEIFLQQIQKELISGTYWPTQNRRKEIPKGDGKYRILGIPTIRDRVVQGALKLILEPIFE 128 Query: 120 SDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVR 179 +DF SYG+RP+R+ H AI V + + VI+ DL SYFDTV H LL+K V Sbjct: 129 ADFQEGSYGYRPKRNPHQAIDRVAKAVVENKTR----VIDLDLRSYFDTVRHDLLLKKVA 184 Query: 180 RRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYL 239 +R++D M LL +KA GVPQGGVISPLL+N+ LNE D+ L + Sbjct: 185 KRVNDENVMRLLKLILKAS---------GKRGVPQGGVISPLLANLYLNEVDKMLEKAKE 235 Query: 240 SGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREEC 299 + + Y R+ADD V+++ + Sbjct: 236 -----------------------VTRHEQYTHIEYARFADDIVILIDAYPKWNWLEKAVY 272 Query: 300 RGVLEG--SLKLRLNMDKTKIPHV--NDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARN 355 + +LE L ++LN +KT+I ++ + F FLG R R+R G+ V+ T + Sbjct: 273 QRLLEELTKLDVQLNEEKTRIVNLANGESFGFLGFDFRRSRTRKGKWGVLFTPKMKARTK 332 Query: 356 FAASLTALLWKVR---------ISGEILLG 376 L + + + +L G Sbjct: 333 ILTELKETFRRFQSQPVSRVIELINPVLRG 362 >UniRef50_C1L365 Group II intron-encoded protein n=1 Tax=Bacillus thuringiensis RepID=C1L365_BACTU Length = 598 Score = 249 bits (637), Expect = 9e-65, Method: Composition-based stats. Identities = 96/363 (26%), Positives = 177/363 (48%), Gaps = 18/363 (4%) Query: 10 ATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRD 69 + + L I + A S++G+ TPG D + + E+ L Sbjct: 23 KSKNGGNFKDLYSFIIDERNIKLAFGTIKSNQGSKTPGTDQITINEYKGFSDDEIIHLVR 82 Query: 70 ELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGF 129 E L +++P RRV+IPK++GK RPLGIP + DRI+Q+ +EPI E+ F+ SYGF Sbjct: 83 EKLI-NFKPDSVRRVFIPKADGKSRPLGIPTMLDRIIQQCFKQILEPIVEAKFYEHSYGF 141 Query: 130 RPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRR-RISDARFM 188 RP RS HHAI + + + ++ D+ +FD ++H L+K + + D + + Sbjct: 142 RPLRSTHHAIARTNFLI---NINKLHYCVDIDIKGFFDNINHNKLIKQLWNIGVRDKQVL 198 Query: 189 TLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRW 248 ++ K +K + G A +G PQGG++SPLL+N++LN+ D ++ ++ + Sbjct: 199 AIIKKMLKCEIVGEG---KAEKGTPQGGILSPLLANVVLNDLDHWVASQWHNFPTNTHYE 255 Query: 249 YWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLK 308 + + T +++ RYADDF + + + + +E +LK Sbjct: 256 DNRPRYRVQKKTKLKQG-------FIVRYADDFKIFTNSYNS-AKRWFHAVKNYIEKNLK 307 Query: 309 LRLNMDKTKIPHVND-GFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAASLTALLWKV 367 L ++ K+KI ++ +FLG + ++ + S IP++ ++ A++ + + Sbjct: 308 LEISESKSKITNLRKRKSLFLGIE-FKAVAKKKKFVAQSYIPKDSMKSIQANIKKKVKSI 366 Query: 368 RIS 370 RI+ Sbjct: 367 RIN 369 >UniRef50_Q24QQ9 Putative uncharacterized protein n=1 Tax=Desulfitobacterium hafniense Y51 RepID=Q24QQ9_DESHY Length = 591 Score = 249 bits (637), Expect = 1e-64, Method: Composition-based stats. Identities = 111/363 (30%), Positives = 166/363 (45%), Gaps = 35/363 (9%) Query: 7 TWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQI 66 T+P + L RL + A S+ GA T G DG + L + Sbjct: 15 RRNTTNPGYVNEDLYRLFYSRDLYIIAYNSVKSNDGAETSGADGTS---LHGFCEEWITQ 71 Query: 67 LRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLS 126 L + YQP P R IPK +GKLR L P +D++VQ A+ + +E I+E F LS Sbjct: 72 LITSMRDESYQPQPNRTTMIPKKSGKLRKLSFPNGKDKLVQEAIRIILECIYEPTFSNLS 131 Query: 127 YGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDAR 186 +GFRP+RS AI V+ W IEGD+S+ FD + HR L +R RI D R Sbjct: 132 HGFRPKRSTQSAIAEVQTW------RGTIWFIEGDISACFDDIDHRTLETILRERIRDER 185 Query: 187 FMTLLWKTIKAGHID-VGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKA-- 243 F+ L+ K +KAG+ D L++ G QG SPLL NI L++ D+++ Sbjct: 186 FIRLVNKVLKAGYFDMQHLYQKTKTGNAQGSCCSPLLCNIYLDKLDKFMENVMEQDTMGG 245 Query: 244 --------RKDRWYWNNSIQRGRSTAVREN--------------WQWKPAVAYCRYADDF 281 K R+ + +++ G ++ V Y RYADDF Sbjct: 246 YRRQNPDYAKARYLYKKALKSGSDPQTVQHLKRTMEHLPTTDRYDPNFRRVNYVRYADDF 305 Query: 282 VLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDG-FIFLGHRLIRKRSRYG 340 ++ V +K ++ + L+ L LRL+ +KTKI H D FLG+ L + ++ Sbjct: 306 LIGVIASKKYALDLKLNLKEFLQNELSLRLSDEKTKITHAADKHVSFLGYILRKGSVKHS 365 Query: 341 EMR 343 + + Sbjct: 366 KFQ 368 >UniRef50_Q9T654 Cox1I1a maturase (Fragment) n=2 Tax=cellular organisms RepID=Q9T654_SCHPO Length = 787 Score = 249 bits (636), Expect = 1e-64, Method: Composition-based stats. Identities = 115/378 (30%), Positives = 190/378 (50%), Gaps = 44/378 (11%) Query: 5 LATWAATDPSLRIQRLLRL-ITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVE 63 L+ + + + + + + A + S+KG+ T G+D + L E Sbjct: 183 LSDIQKESRNNLVSNIYKRCLLNEDLFIAAYQKISSNKGSVTAGIDKIT---LDGYSINE 239 Query: 64 LQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFH 123 ++ ++L +Q PARR YIPK+NGKLRPLGIP+ RD+IVQ+ M+ +E I+E F Sbjct: 240 IKKTIEQLKDHSFQFKPARREYIPKANGKLRPLGIPSPRDKIVQQVMVFVLESIFEQKFL 299 Query: 124 TLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRIS 183 S GFRP + H A++++ G WVIEGD+ SYFD + H+ L+ + I+ Sbjct: 300 DCSNGFRPNKGTHTALKSI------AGWKALDWVIEGDIKSYFDLIDHQTLISLLSNVIN 353 Query: 184 DARFMTLLWKTIKAGHIDVGLFRAASE--GVPQGGVISPLLSNIMLNEFDQYLHER---- 237 D F+ L WK I+AG+++V + + G PQG V+SP+L+NI L+EFD+++ E+ Sbjct: 354 DKEFIDLCWKAIRAGYVEVKMNKKIDTIIGTPQGSVLSPILANIYLHEFDKFMMEKVNLS 413 Query: 238 ---------------------YLSGKARKDRWYWNNSIQRGRSTAVRENWQWKP------ 270 Y+ RK+ N ++ + + N Sbjct: 414 LDSGSTSKRFKPYRLLEAKINYIYQLERKNGSLTNEQVKSLKKLTIERNKLPSTIGGPGY 473 Query: 271 AVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFI-FLG 329 + Y RYADDF++ + G + ++ E L +LKL ++++KTK+ ++ D + FLG Sbjct: 474 RIYYVRYADDFLIGINGKRTLALQLKSEINEFLTNTLKLTMSVEKTKVTNIKDDYALFLG 533 Query: 330 HRLIRKRSRYGEMRVVST 347 + R SR +V+S Sbjct: 534 AEIHRLTSRNNNSKVISK 551 >UniRef50_A9BGC0 RNA-directed DNA polymerase (Reverse transcriptase) n=49 Tax=Bacteria RepID=A9BGC0_PETMO Length = 472 Score = 249 bits (636), Expect = 1e-64, Method: Composition-based stats. Identities = 113/375 (30%), Positives = 170/375 (45%), Gaps = 45/375 (12%) Query: 6 ATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQ 65 + D + +L I + + +A + ++KG PG+DG+ L L + Sbjct: 39 SERGRNDDKGCSEGMLEKILSKDNMNKAYKKVKANKG--APGIDGMKVEELFGYLRQHGE 96 Query: 66 ILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTL 125 LR ELL G Y P RR IPK +G R LGIP DR++Q+++ + PI+E F Sbjct: 97 ELRQELLEGRYTPKSVRRKEIPKPDGGKRLLGIPTSIDRVIQQSIAQVLTPIYEKKFVDN 156 Query: 126 SYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDA 185 SYGFRP R AIR K L WV++ DL YFDTV+H LM+ + + + D Sbjct: 157 SYGFRPLRDAKQAIRKSKEYLN----KGHTWVVDIDLERYFDTVNHDKLMRIISKDVKDG 212 Query: 186 RFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARK 245 R ++L+ K +K+G + G+ EG PQGG +SPLLSNIML+E D L Sbjct: 213 RVISLIRKYLKSGVMVNGVVIETEEGTPQGGPLSPLLSNIMLHELDVEL----------- 261 Query: 246 DRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEG 305 K +CRYADD + VK K+ + E +E Sbjct: 262 ----------------------TKRGHKFCRYADDCNIYVKSEKS-AYRVMESITKYIEK 298 Query: 306 SLKLRLNMDKTKIPHVNDGFIFLGHRLI----RKRSRYGEMRVVSTIPQEKARNFAASLT 361 LKL++N K+K+ D +LG + R + + K +S Sbjct: 299 KLKLKVNSKKSKVVRPWD-LKYLGFSFYVKEEKYEIRVHGKSIKEFKKKLKGETKRSSGR 357 Query: 362 ALLWKVRISGEILLG 376 ++ +++ +I+ G Sbjct: 358 SMAYRLSRIKQIITG 372 >UniRef50_Q64E53 Prophage LambdaSa1 transcriptase/maturase family protein n=1 Tax=uncultured archaeon GZfos14B8 RepID=Q64E53_9ARCH Length = 430 Score = 249 bits (635), Expect = 1e-64, Method: Composition-based stats. Identities = 107/364 (29%), Positives = 169/364 (46%), Gaps = 48/364 (13%) Query: 17 IQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHY 76 L+ + L EA ++GA G+D V + L L ++ L Y Sbjct: 49 WHSLIDKVWNWRNLNEAWEKVKQNRGAG--GIDDVTIDEFERNLEQNLNEIQRLLRQDRY 106 Query: 77 QPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVH 136 P P +RVYIPK +GK RPLGIP +RDR+VQ+A+ +EPI+E++F S+G+RP +S Sbjct: 107 VPKPVKRVYIPKPDGKQRPLGIPTIRDRVVQQALKNVIEPIFEAEFLDSSFGYRPGKSAK 166 Query: 137 HAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIK 196 AI ++ + WV++ D+ ++FDTV+H L+ AV RISD R + L+ ++ Sbjct: 167 QAIEQIE----TVRDEGHEWVVDADIKAFFDTVNHEKLIDAVAERISDGRVLGLIRAFLE 222 Query: 197 AGHIDVGLFR-AASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQ 255 A ++ G R G PQGGVISPLL+NI L+ FD+ + E Sbjct: 223 ADIMEQGQGRAKNVVGTPQGGVISPLLANIYLHYFDERMAE------------------- 263 Query: 256 RGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDK 315 RYADD +++ + EAI + E L L+ K Sbjct: 264 --------------LGFEVVRYADDVLVLCGSEEEAEEAISHVKEILEELEL--TLHPQK 307 Query: 316 TKIPHVNDGFIFLGHRLIRKRS---RYGEMRVVSTIPQEKARNFAASLTALLWKVRISGE 372 TKI + ++G FLG + + + + + RN +L ++ + Sbjct: 308 TKIKNFSEGVDFLGFTVYVSHKVPRKEAVRKYKGAVRRATRRNLPINLEMVIQGL---NP 364 Query: 373 ILLG 376 +++G Sbjct: 365 VVIG 368 >UniRef50_C4K5N9 Group II intron encoded reverse transcriptase n=10 Tax=Candidatus Hamiltonella defensa 5AT (Acyrthosiphon pisum) RepID=C4K5N9_HAMD5 Length = 570 Score = 249 bits (635), Expect = 2e-64, Method: Composition-based stats. Identities = 103/371 (27%), Positives = 170/371 (45%), Gaps = 38/371 (10%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQP--EWLAEAARITLSSKGAHTPGVDGVNKTMLQA 58 +++++ +AT +++ L +L+ + L ++T ++G HT GVD Sbjct: 32 LRQRIYRASATGDLKKVRNLQKLMMKSRANHLLAIRKVTQVNRGKHTAGVDNQVIN--DH 89 Query: 59 RLAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIW 118 + L L + S + P +RVYI K NGK RPLGIP + DR Q + A+EP W Sbjct: 90 KGREHLYKLLSQTTSE--KVYPVKRVYIAKKNGKKRPLGIPTILDRCRQAIVKSALEPYW 147 Query: 119 ESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAV 178 E+ F +SYGFRP RS H AI+ + G WV++ D+ FD + H L+K + Sbjct: 148 EAKFEPVSYGFRPGRSAHDAIQKIFCIARARGTRH--WVLDADIKGAFDNIDHNFLIKKI 205 Query: 179 RRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERY 238 R ++ + ++AG ++ G + G PQGG+ISPLL+NI L+ + L +Y Sbjct: 206 --GGFPER--NMIKQWLQAGVLEHGNYIPNVAGTPQGGIISPLLANIALHGMETLLGIQY 261 Query: 239 LSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREE 298 K + A RYADDFV+ K + + E + + Sbjct: 262 WKNGTPKQGQPY----------------------AVVRYADDFVVFGKS-REECETAKIK 298 Query: 299 CRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSRYGEMRVVS--TIPQEKARNF 356 + L L L+ +KT I H+ +GF FLG + +R+ + + +E + + Sbjct: 299 LQIWLAQR-GLALSEEKTSIKHLKEGFDFLGFNIRHYDNRHRKRGYILLTKPSKESMKRY 357 Query: 357 AASLTALLWKV 367 + + Sbjct: 358 KQQMRMTWKGI 368 >UniRef50_Q02717 Reverse transcriptase homologue COI ialpha grp II protein (Fragment) n=3 Tax=Podospora anserina RepID=Q02717_PODAN Length = 788 Score = 248 bits (634), Expect = 2e-64, Method: Composition-based stats. Identities = 115/369 (31%), Positives = 177/369 (47%), Gaps = 49/369 (13%) Query: 21 LRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLP 80 +I + E L A S G TP VD L + + ++L S ++ P Sbjct: 186 YEVICKLEALYTAYMNIKSEPGNMTPRVD---SETLDGISKEWFEKISEQLKSEQFRFRP 242 Query: 81 ARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIR 140 RRVYIPK+NGK+RPLGI + RD+IVQ +E + E FH+ S+GFRP R H A+ Sbjct: 243 TRRVYIPKANGKMRPLGIASPRDKIVQEVFRAILEQVLEPRFHSSSHGFRPGRGCHSALA 302 Query: 141 TVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHI 200 T++ +W IEGD+ +FD + H +L K + + D RF+ L WK +KAG++ Sbjct: 303 TIRYW------NGIKWFIEGDIKGFFDNIDHHILEKLLVKHFQDQRFIDLYWKMVKAGYV 356 Query: 201 DVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYL-------HERYLSGKARKDRWYW--- 250 + +++ GVPQGG+ SP+LSN++LNE D+++ +E+ GK + Sbjct: 357 EFDKDKSSIIGVPQGGIASPILSNLVLNELDEFVQNIVDEFNEKLKGGKHTSKNPAYVVI 416 Query: 251 -----------------------NNSIQRGRSTAVRENWQW------KPAVAYCRYADDF 281 ++R + VR + Y RYADD+ Sbjct: 417 DSRIGKITRLERKLKSKGQELDSGRKLERMKLIKVRATMPSMIPNPDLAKIYYVRYADDW 476 Query: 282 VLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVND-GFIFLGHRLIRKRSRYG 340 ++ V G+ AI+E L+ LKL L+M+KT I + ++ FLG + R S G Sbjct: 477 LIGVAGSSETARAIKERIAAYLKDILKLELSMEKTLITNASEDKAYFLGTEIQRISSVKG 536 Query: 341 EMRVVSTIP 349 E++ I Sbjct: 537 EIKRFKNIK 545 >UniRef50_Q11ZP4 RNA-directed DNA polymerase n=33 Tax=Bacteria RepID=Q11ZP4_POLSJ Length = 461 Score = 248 bits (634), Expect = 2e-64, Method: Composition-based stats. Identities = 105/349 (30%), Positives = 168/349 (48%), Gaps = 46/349 (13%) Query: 18 QRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQ 77 LL E L A + ++KG GVDG++ + +R ELL+G Y+ Sbjct: 51 GGLLEAALTRENLQVAWKRVKANKG--AAGVDGLDIEHTAQTIRNHWSQIRQELLAGTYR 108 Query: 78 PLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHH 137 P P RRV IPK +G R LGIP + DR++Q+A+L ++P+ + F S+GFRP R H Sbjct: 109 PSPVRRVMIPKPDGSQRELGIPTVLDRLIQQALLQVLQPLIDPTFSEHSHGFRPGRRAHD 168 Query: 138 AIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKA 197 A++ + + + R V++ DLS +FD V+H +L+ +R+R+ DA + L+ + A Sbjct: 169 AVKAARAHVQ----SGKRVVVDVDLSKFFDRVNHDILIDRLRKRVDDAGVIRLIRAYLNA 224 Query: 198 GHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRG 257 G +D G+ +G PQGG +SPLL+N++L+E D+ L Sbjct: 225 GIMDGGVVMDRQQGTPQGGPLSPLLANVLLDEVDKVLEA--------------------- 263 Query: 258 RSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTK 317 ++ RYADD + V KA + + E R L LKL++N K+ Sbjct: 264 ------------RGYSFARYADDCNVYVGSVKAG-QRVMELLRK-LYAGLKLQINEAKSA 309 Query: 318 IPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAASLTALLWK 366 + G FLG+ L + + + +V R+F A + L + Sbjct: 310 VASAF-GRKFLGYALWVAKGKEVKCKVAEKP----LRDFKARIRQLTSR 353 >UniRef50_A7GTD4 RNA-directed DNA polymerase n=1 Tax=Bacillus cytotoxicus NVH 391-98 RepID=A7GTD4_BACCN Length = 543 Score = 248 bits (634), Expect = 2e-64, Method: Composition-based stats. Identities = 110/419 (26%), Positives = 178/419 (42%), Gaps = 63/419 (15%) Query: 8 WAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQIL 67 A+ I RL+ + E +A + G T G ++ + Sbjct: 13 RKASQNGKIITDCYRLMYKRELWIKAYVKLYPNAGNLTKGTSEETIDGF---YLQKIDEI 69 Query: 68 RDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSY 127 D+L G ++ P RR YI K+NGK RPLG+P +D++VQ M M +E ++E F S+ Sbjct: 70 IDQLKKGMFRFAPVRRAYISKANGKKRPLGVPNFKDKLVQEVMRMILENVYEPTFSDNSH 129 Query: 128 GFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARF 187 GFR RS H A+ +K W IEG + +FD + H +L+ + +R++D RF Sbjct: 130 GFREGRSCHTALSQIKNT-----WKGLTWCIEGAIKGFFDHIDHSVLINLISKRMNDHRF 184 Query: 188 MTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSG---KAR 244 + L+ + +G ++ ++ G PQGG++SPLL+NI L+EFD +L ++ K R Sbjct: 185 LLLIHNALASGVMENWTYQKTYSGTPQGGILSPLLANIYLHEFDIFLEKQIEKFDKEKLR 244 Query: 245 KDRWYWNNSIQRGRS-------------------------------------TAVRENWQ 267 + RS ++V Sbjct: 245 ARNKEYTKIHSEIRSLSRKVKSLDDRTGHRLWKGREKVIETIAELKRKQIGISSVNPMDN 304 Query: 268 WKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFIF 327 + Y RYADDFV+ + G+K I+E + L+ L L L+ +KT I H+ + F Sbjct: 305 DYQKMKYVRYADDFVIGIAGSKDCAVNIKETIKNFLKQELHLELSEEKTLINHLENPISF 364 Query: 328 LGHRLI------RKRSRYGEMR-------VVSTIPQEKARNFAASLTALLWKVRISGEI 373 LG+ R R Y + + IP++K + FA + + I Sbjct: 365 LGYEFRKWNEIKRTRVLYKNHKQRALSRAIKLEIPKKKMKEFA--IKNGYGNLDNFKSI 421 >UniRef50_C3BJV7 D-alanine--D-alanine ligase A (D-alanylalanine synthetaseA) n=2 Tax=Bacillus RepID=C3BJV7_9BACI Length = 620 Score = 248 bits (633), Expect = 2e-64, Method: Composition-based stats. Identities = 111/373 (29%), Positives = 178/373 (47%), Gaps = 19/373 (5%) Query: 2 QRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLA 61 Q +L + + R L + + A +++GA+TPGVDG + Sbjct: 17 QDQLYELSK--KNTRFHSLYEMAFNETTIITAIHKIKANRGANTPGVDGHDIRRYLQMDK 74 Query: 62 VELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESD 121 + L + + +Y+ PARRVYI K++G RPLGIP + DRI+Q + +EPI E+ Sbjct: 75 NNVIKLITK-AARNYKSKPARRVYIEKADGSQRPLGIPTVVDRIIQECIRTILEPIVEAK 133 Query: 122 FHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRR 181 F+ SYGFRP RS HA+R V + T+ + IEGD+ YFD ++HR L+K + R Sbjct: 134 FYDHSYGFRPYRSSKHAVRQVNHFINT---TKSYYAIEGDIKGYFDNINHRFLIKKLWRL 190 Query: 182 ISDARFMTLLWKT-IKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLS 240 + + + + +KAG+++ +G PQGG+ISPLL+N+ LN+FD + R+ Sbjct: 191 GIRDKRIIKIIQIMLKAGYMEYDFKFTTEKGTPQGGIISPLLANVYLNDFDWMVARRFYK 250 Query: 241 GKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECR 300 K + R R Q + RYADD++++ + T + E R Sbjct: 251 AKPTGIS-------KEPRKQRERLVRQGRNKCYLVRYADDWIILTQ-TYQEARRYLEYLR 302 Query: 301 GVLEGSLKLRLNMDKTKIPHVNDG-FIFLGHRLI---RKRSRYGEMRVVSTIPQEKARNF 356 LKL L+ +KT I + + +FLG + RS G + + K Sbjct: 303 KYFRIKLKLELSKEKTVITDLREKPALFLGFDIYAETPLRSNSGNIVGKNKPNHRKVSGQ 362 Query: 357 AASLTALLWKVRI 369 + + + K+R Sbjct: 363 ISKVCKEIRKMRK 375 >UniRef50_B3GTB4 Putative reverse-transcriptase protein n=1 Tax=Volvox carteri f. nagariensis RepID=B3GTB4_VOLCA Length = 749 Score = 248 bits (633), Expect = 3e-64, Method: Composition-based stats. Identities = 100/382 (26%), Positives = 172/382 (45%), Gaps = 34/382 (8%) Query: 5 LATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVEL 64 L + D + L+ +I+ L A + S G TPG+ + L A Sbjct: 217 LKKANSIDLARVNNGLIHIISDTNLLIFAYELLKSKSGYMTPGI---TEESLDAIDLAWY 273 Query: 65 QILRDELLSGHYQPLPARRVYIPKSNGKL-RPLGIPALRDRIVQRAMLMAMEPIWESDFH 123 + + +++ +G ++ ARRV IPK RPLG+ + RD++VQ+A+ + ++ I++ F Sbjct: 274 KHISNDIKAGKFKFSQARRVMIPKPGKSELRPLGVVSPRDKVVQKALELVLQCIFDPMFL 333 Query: 124 TLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRIS 183 S+G+RP +S H A++ + Q + WVI+GD+S FDT+ H +LM + +RIS Sbjct: 334 DCSHGYRPGKSQHTALKMLDQQFKNA-----TWVIKGDISKCFDTIDHEILMHLIGKRIS 388 Query: 184 DARFMTLLWKTIKAG-HIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHE---RYL 239 + + L+ +KAG +D L+ + G P V SPLL NI ++EFD ++ + ++ Sbjct: 389 CNKTLALIKSALKAGNVLDGKLYANEAVGTPHSSVSSPLLCNIYMHEFDLFVKDIIVKFN 448 Query: 240 SGKARKDRWYWNNSIQRG--------------------RSTAVRENWQWKPAVAYCRYAD 279 G R+ + + R V + Y R+AD Sbjct: 449 KGTKRRQNPEYTKILNMLYKALEQFNFSKYAKLRKDLRRVRQVNIMDLDYVRIKYVRFAD 508 Query: 280 DFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDG-FIFLGHRLIRKRSR 338 DFV+ + G + + L LKL LN KT I + FL ++ + + Sbjct: 509 DFVISIIGPYKLACDTKVMVKDFLMNKLKLTLNESKTAITKFSKKPIYFLDTEIMNRYPK 568 Query: 339 YGEMRVVSTIPQEKARNFAASL 360 +++V + K N L Sbjct: 569 VKPVKLVKRLGVSKLANVTPRL 590 >UniRef50_Q0AW97 RNA-directed DNA polymerase (Reverse transcriptase) n=24 Tax=cellular organisms RepID=Q0AW97_SYNWW Length = 443 Score = 248 bits (633), Expect = 3e-64, Method: Composition-based stats. Identities = 104/380 (27%), Positives = 161/380 (42%), Gaps = 52/380 (13%) Query: 4 KLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVE 63 ++A A +P R L+ I E L E G+ GVD V K + L Sbjct: 7 RIAEIARQNPKERFTALIHHI-NHETLKECHLEI---SGSKASGVDQVTKQAYEENLEAN 62 Query: 64 LQILRDELLSGHYQPLPARRVYIPKSN-GKLRPLGIPALRDRIVQRAMLMAMEPIWESDF 122 + L + Y+P P RRVYIPK K RPLGIP+ D++VQ+ + + I+E DF Sbjct: 63 IADLIGRMKRQAYKPQPVRRVYIPKEGSNKRRPLGIPSYEDKLVQKGLARILNTIYEQDF 122 Query: 123 HTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRI 182 S+GFRP R H A++ + + + ++++ D+ +FD V H +MK + RI Sbjct: 123 LDCSFGFRPGRGCHDALKVLNHIIE---RKKVNYIVDADIRGFFDHVDHEWMMKFLELRI 179 Query: 183 SDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNE-FDQYLHERYLSG 241 +D + L+ + +KAG ++ G+ +G PQGG++SP+L+NI L+ D + + Sbjct: 180 ADPNLLRLIKRFLKAGVMEAGIVYDTPKGTPQGGIVSPILANIYLHYVLDLWFEKVVKK- 238 Query: 242 KARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRG 301 + + RYADDFV + K+ E R Sbjct: 239 -------------------------RCQGEAYLVRYADDFVCCFQN-KSDAEWFYANLRE 272 Query: 302 VLEGSLKLRLNMDKTKIPHVN---------------DGFIFLGHRLIRKRSRYGEMRVVS 346 L L + +KT+I D F LG +S+ G RV Sbjct: 273 RL-NKFNLEVAEEKTRIIAFGRFADKESKKQGRKKPDTFDLLGFTHYCSKSKKGWFRVKR 331 Query: 347 TIPQEKARNFAASLTALLWK 366 Q+K R+ L K Sbjct: 332 KTSQKKYRSSLLKCKTWLRK 351 >UniRef50_Q08WW1 Prophage LambdaSa1, reverse transcriptase/maturase family protein n=2 Tax=Bacteria RepID=Q08WW1_STIAU Length = 421 Score = 247 bits (631), Expect = 4e-64, Method: Composition-based stats. Identities = 104/360 (28%), Positives = 162/360 (45%), Gaps = 49/360 (13%) Query: 25 TQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLPARRV 84 EWL A T G+D +A L V L+ L + + SG Y+ P RR Sbjct: 10 LDEEWLRYAYEQTRK---DGAAGIDRQTAKDYEANLEVNLKSLLERIKSGRYKAPPVRRT 66 Query: 85 YIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKL 144 YIPK++G RPLGIP D++ QRA+++ +EPI+E DF S+GFRP RS H A+R ++ Sbjct: 67 YIPKADGSQRPLGIPTFEDKVAQRAIVLLLEPIYEQDFRPFSFGFRPGRSAHQALRELRS 126 Query: 145 QLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHIDVGL 204 + + GRWV++ DL YFDT+ H L + + RR++D ++ K +KAG ++ G Sbjct: 127 SILERN---GRWVLDVDLRRYFDTIEHGKLREVLARRVADGVVRRMIDKWLKAGVLEEGP 183 Query: 205 FRAASEGVPQGGVISPLLSNIMLNE-FDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVR 263 +G PQGGVISPLL+N+ L+ D++ + Sbjct: 184 LLRLEQGTPQGGVISPLLANVYLHYVLDEWYEREVVP----------------------- 220 Query: 264 ENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVN- 322 + K + RYADD V++ + + E L L L+ KT++ Sbjct: 221 ---RMKGKCSLIRYADDLVMVFEDF-LDCRRVLEVLGKRL-AKYGLTLHPGKTRMVDFRF 275 Query: 323 -------------DGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAASLTALLWKVRI 369 F FLG + +S+ G+ V + + ++ + R Sbjct: 276 KRPGGGQHPATQATTFDFLGFTHVWGKSQRGKNVVYQVTAKSRYARAVKAVWEWCKRNRH 335 >UniRef50_A5VH22 RNA-directed DNA polymerase n=1 Tax=Sphingomonas wittichii RW1 RepID=A5VH22_SPHWW Length = 572 Score = 247 bits (631), Expect = 4e-64, Method: Composition-based stats. Identities = 115/384 (29%), Positives = 169/384 (44%), Gaps = 60/384 (15%) Query: 44 HTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRD 103 TPGVDG L L + G Y+P P RRVYIPK NGK+RPLGIP D Sbjct: 1 MTPGVDGQT---FDGMTLARLDRLTQGVAEGRYRPRPVRRVYIPKGNGKMRPLGIPTADD 57 Query: 104 RIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLS 163 RIVQ A M + I+E F S+GFR RS H A+ ++ T +W+IE D+ Sbjct: 58 RIVQEAARMILAAIYEPVFSKHSHGFRAGRSCHTALEEIRRT-----WTGAKWLIEVDVR 112 Query: 164 SYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLS 223 +FD + H +L+ + RRI D F+ L+ +KAG +D F G PQGGVISPLL+ Sbjct: 113 GFFDNIDHDILLSLLARRIDDPVFIDLIGTMLKAGCMDEWKFERTYSGTPQGGVISPLLA 172 Query: 224 NIMLNEFDQYLHE---RYLSGKARKDRWYWNNSIQ------------------------- 255 NI L+E D ++ E R+ G R+ + Q Sbjct: 173 NIYLHELDLFMEEMRARFDKGVKRRANPVYVVQSQKIAALRKEIDAIRAVGADEAEVRTR 232 Query: 256 ----------RGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEG 305 R + ++V + + YCRYADDF++ V G+KA I + + L Sbjct: 233 LARIEAINRDRRKISSVDQMDPNFRRLRYCRYADDFLVGVIGSKADAVRIMADIQHFLAD 292 Query: 306 SLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSRY------------GEMRVVSTIPQEKA 353 L L ++ +KT + + G FLG + R G R++ + Sbjct: 293 RLNLTVSPEKTGVRDASRGSPFLGFHVCAFTLRSPGTMAGRQAVGGGMRRILRRPTRGNI 352 Query: 354 RNF--AASLTALLWKVRISGEILL 375 + + + A + ++ + Sbjct: 353 KLWVPRDRVYAFCRRKKLGNLDMR 376 >UniRef50_A5ZWA2 Putative uncharacterized protein n=2 Tax=Clostridiales RepID=A5ZWA2_9FIRM Length = 428 Score = 247 bits (630), Expect = 5e-64, Method: Composition-based stats. Identities = 112/370 (30%), Positives = 176/370 (47%), Gaps = 42/370 (11%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQAR- 59 +Q KL A R L + + + L EA + ++KG+ GVDG+ ++ Sbjct: 13 LQNKLYLTAKKCQKRRFHALYDKVYRDDVLIEAWKRVKANKGSS--GVDGIRIEDIEKMG 70 Query: 60 LAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWE 119 + L+ L+ EL+ G Y P P +RV IPK +G RPLGIP +RDRIVQ A +A+EP++E Sbjct: 71 IEKYLKELKKELIEGKYIPSPVKRVMIPKPDGSERPLGIPTVRDRIVQMAAKIAIEPVFE 130 Query: 120 SDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVR 179 +DF SYGFRP+RS A+ V+ + +G +V++ D+ +FD V+ LM + Sbjct: 131 ADFRECSYGFRPKRSAKQALEVVRKACNN----KGYYVVDADIEKFFDNVNQDKLMILIE 186 Query: 180 RRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYL 239 +RISD R + L+ + +++G + + + G QG VISPLL+NI LN D+ + Sbjct: 187 QRISDRRILKLIRQWMRSGILYGSILTISELGTSQGSVISPLLANIYLNTLDRLWEKYGR 246 Query: 240 SGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREEC 299 + RYAD+ V+I K K+ A + Sbjct: 247 THGI------------------------------LVRYADNTVIICKNKKSVNHA--QSL 274 Query: 300 RGVLEGSLKLRLNMDKTKIPHV---NDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNF 356 + G L LRL+ KTKI ++ +GF FLG R + Q ++ Sbjct: 275 LQYIMGKLDLRLHPVKTKIVNMWDGTEGFDFLGLHHRRFLKINKKGNRYGETYQYPSKKA 334 Query: 357 AASLTALLWK 366 + + + Sbjct: 335 MKKMKRTVKE 344 >UniRef50_A6DJK4 Reverse transcriptase/maturase n=21 Tax=Chlamydiae/Verrucomicrobia group RepID=A6DJK4_9BACT Length = 446 Score = 247 bits (630), Expect = 6e-64, Method: Composition-based stats. Identities = 110/367 (29%), Positives = 171/367 (46%), Gaps = 54/367 (14%) Query: 15 LRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSG 74 + L + + + EA S+KG H GVD V+ ++ L L +EL G Sbjct: 42 GKWYSLSDKLMRKNNIMEAWEKVCSNKGKH--GVDMVSIERYESELEYNNAKLLEELQDG 99 Query: 75 HYQPLPARRVYIPKSNG-KLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPER 133 Y P RRV IPK +G K RPLGIP +RDR+VQ A+ +EPI++ DF S+GFRP+ Sbjct: 100 RYDPSAVRRVEIPKGDGRKTRPLGIPTVRDRVVQTALKHVIEPIFDIDFSPYSFGFRPKL 159 Query: 134 SVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWK 193 A+R V L + +V++ D+ SYFDT+ H LM V+ +I D + + L+ + Sbjct: 160 GCKDALRRVNELL----KQGYLYVMDADIQSYFDTIPHEKLMSRVKEKIIDGKILDLIEQ 215 Query: 194 TIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNS 253 +KA D EG PQGG+ISPLL+NI L+ FD + E Sbjct: 216 FLKANIFDGLKHWEPEEGTPQGGIISPLLANIYLDLFDHKMTEA---------------- 259 Query: 254 IQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNM 313 RYADDF+++ K K + + R ++ + L+L+ Sbjct: 260 -----------------GFEIVRYADDFLIMCKS-KESAKRALRKTRRWMKAN-GLKLHP 300 Query: 314 DKTKIPHVNDG---FIFLGHRLIRKRS---------RYGEMRVVSTIPQEKARNFAASLT 361 +KT+I + + F FLG+ R R+ + + I ++ R+ S+ Sbjct: 301 EKTRIVDMTEKCEYFEFLGYHFERTRNTHRIKRWPRKQSLKKCKDAIRKKTRRSNKDSIE 360 Query: 362 ALLWKVR 368 ++ +R Sbjct: 361 DIIAYLR 367 >UniRef50_B7K703 RNA-directed DNA polymerase (Reverse transcriptase) n=85 Tax=Bacteria RepID=B7K703_CYAP7 Length = 661 Score = 246 bits (628), Expect = 9e-64, Method: Composition-based stats. Identities = 111/395 (28%), Positives = 178/395 (45%), Gaps = 48/395 (12%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQP--EWLAEAARITLSSKGAHTPGVDGVNKTMLQA 58 +Q+++ A+ ++RL + + + + R+T ++G T GVDGV + Sbjct: 37 LQKRIYQAASRGNVRTVRRLQKTLLRSWSAKMLAVRRVTQDNQGKKTAGVDGVKSLTPKQ 96 Query: 59 RLAVELQILRDELLSGHYQPLPARRVYIPKSNGK-LRPLGIPALRDRIVQRAMLMAMEPI 117 R+ + Q L + P RRV+IPK RP IP + DR +Q + +A+EP Sbjct: 97 RMNLVGQ------LKLTCKTKPTRRVWIPKPGKDEKRPFLIPCMSDRALQALVKIALEPE 150 Query: 118 WESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKA 177 WE+ F SYGFRP R H AI + QL + ++V++ D+S FD ++H L++ Sbjct: 151 WEAKFEPNSYGFRPGRGCHDAIGAIFNQLGA----KAKYVLDADISKCFDKINHEKLLQK 206 Query: 178 VRRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHER 237 + + + +KAG +D EG PQGGV+SPLL+NI L+ ++ + Sbjct: 207 LN---TFPTLRRQIRAWLKAGVMDGNKLFPTEEGTPQGGVVSPLLANIALHGMEEIIKS- 262 Query: 238 YLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIRE 297 + + R + +++ RYADDFVLI + A VE +E Sbjct: 263 ------------FAQNPGELRQEFSNRGKGREQSISLIRYADDFVLIHESL-AVVEKGKE 309 Query: 298 ECRGVLEGSLKLRLNMDKTKIPHVND------GFIFLGHRLIR-KRSRYGEMR------- 343 L L L L +KT+I H D GF FLG + KRSR+ M+ Sbjct: 310 IIETWLR-ELGLTLKPEKTQITHTLDKHQGKVGFNFLGFNIRHYKRSRHKSMKNNKGISL 368 Query: 344 ---VVSTIPQEKARNFAASLTALLWKVRISGEILL 375 + +EK L ++ + +I+L Sbjct: 369 GFTLTIKPTREKVLEHYKKLRDIVDAHKAISQIVL 403 >UniRef50_Q3B1V7 RNA-directed DNA polymerase n=31 Tax=Bacteria RepID=Q3B1V7_PELLD Length = 495 Score = 245 bits (626), Expect = 2e-63, Method: Composition-based stats. Identities = 95/374 (25%), Positives = 161/374 (43%), Gaps = 44/374 (11%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEW-LAEAARITLSSKGAHTPGVDGVNKTMLQAR 59 +Q ++A R++ L L+T A + ++G +TPGVDG +A+ Sbjct: 30 LQARIAKATKEGRHGRVKALQWLLTHSHSGKVLAVKRVTENRGKNTPGVDGDVWKTSKAK 89 Query: 60 LAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWE 119 L Y+PLP RR YIPK NGK RPLGIP ++DR +Q +A+EP+ E Sbjct: 90 ANA-----AASLRRRGYKPLPLRRTYIPKKNGKQRPLGIPTMKDRAMQALYWLALEPVAE 144 Query: 120 SDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVR 179 + SYGFRP RS + L +W++E D++ FD + H+ L+ + Sbjct: 145 TTADGNSYGFRPWRSTADVAEQCFICL--ARRDSAQWILEADIAGCFDAISHQWLVDNIP 202 Query: 180 RRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYL 239 +L K +KAG + + G PQGG+ISP L+N+ L+ +Q L + Sbjct: 203 MDTP------ILRKWLKAGFVFNNELFPTASGTPQGGIISPGLANMSLDGLEQALATAFP 256 Query: 240 SGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREEC 299 + R + RYADDF++ + I Sbjct: 257 QARRRGL------------------------KMHMVRYADDFIITGNSKEWLEHEIMPVV 292 Query: 300 RGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAAS 359 L+ L L+ +KT++ H+ +GF FLG + + + +++ + + Sbjct: 293 VDFLKKR-GLWLSEEKTRVTHITEGFDFLGWNMRKY-----DGKLLIKPSKANIKAHLTK 346 Query: 360 LTALLWKVRISGEI 373 + ++ + ++ Sbjct: 347 VRGIIKAKKTIKQV 360 >UniRef50_B2JXR4 RNA-directed DNA polymerase n=10 Tax=Bacteria RepID=B2JXR4_BURP8 Length = 503 Score = 245 bits (626), Expect = 2e-63, Method: Composition-based stats. Identities = 106/386 (27%), Positives = 170/386 (44%), Gaps = 51/386 (13%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEW-LAEAARITLSSKGAHTPGVDGVNKTMLQAR 59 +Q ++A A + + L RL+T+ A + ++G TPGVDG + A+ Sbjct: 36 LQARIAKAAREGRWDKAKVLQRLLTRSHSAKMLAVKRVTENRGKRTPGVDGRVWSSSAAK 95 Query: 60 LAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWE 119 L L Y+ +P RR+YIPKSNGK RPLGIP +R R +Q +A+EPI E Sbjct: 96 WKGML-----SLRHRGYRAMPLRRIYIPKSNGKKRPLGIPCMRCRSMQALWKLALEPIAE 150 Query: 120 SDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVR 179 + SYGFRPERS AI L WV+EGD+ FD H +K + Sbjct: 151 TLADANSYGFRPERSTADAIEQCFTVL--ARRISPEWVLEGDIRGCFDNFSHSWFLKHIP 208 Query: 180 RRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYL 239 D + K ++AG+ID G + G PQGG+ISP+++N+ L+ + +H Sbjct: 209 M---DKVILR---KWLEAGYIDEGTLFESRAGTPQGGIISPVIANMALDGLEAAVHA--- 259 Query: 240 SGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREEC 299 S + + ++ RYADDFV+ + Sbjct: 260 -------------------SVGTSARARKRAQLSVIRYADDFVVTGVSKDVLELKVLPAV 300 Query: 300 RGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAAS 359 R + L L+ +KT+I H+ GF FLG + + + +++ ++ ++ Sbjct: 301 RQFMAVR-GLELSEEKTRITHIAAGFDFLGQNVRKY-----DGKLLIKPAKKSIKSLTDK 354 Query: 360 LTALLWK---------VRISGEILLG 376 + A++ +R ++ G Sbjct: 355 VGAIIKGNASATQEALIRQLNPVIRG 380 >UniRef50_A9ENQ0 Integron/retron-type RNA-directed DNA polymerase (Reverse transcriptase) n=3 Tax=Bacteria RepID=A9ENQ0_SORC5 Length = 439 Score = 245 bits (625), Expect = 2e-63, Method: Composition-based stats. Identities = 105/379 (27%), Positives = 160/379 (42%), Gaps = 49/379 (12%) Query: 4 KLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVE 63 K+ A DP + L LI E L A R S + GVDG+ K L Sbjct: 16 KVRERAERDPEGVLLALAHLI-DEEALQRAYR---SLRNEAAVGVDGITKEQYGQDLEHN 71 Query: 64 LQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFH 123 ++ L + S Y+ P RRV+IPK GK RP+GI D+IVQ A+ +E I+E F Sbjct: 72 VRDLHARMKSMRYRHQPIRRVHIPKERGKTRPIGISCTEDKIVQAAVREMLEVIYEPVFR 131 Query: 124 TLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRIS 183 +SYGFRP RS H A+R + L W++E D+ S+FD++ LM+ ++ R++ Sbjct: 132 DVSYGFRPGRSAHDALRALNRMLL----GGVEWILEADIESFFDSIDRTKLMEMLQARVA 187 Query: 184 DARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNE-FDQYLHERYLSGK 242 D + L+ K + G +D F A +G QG V+SPLL N+ L+ D ++ Sbjct: 188 DKSLLRLVGKCLHVGVLDGAEFYAPEDGTVQGSVLSPLLGNVYLHHVLDLWIEREVQP-- 245 Query: 243 ARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGV 302 + RYADDF++ + + + + E Sbjct: 246 ------------------------RLVGKATLIRYADDFIIGFE-REDDAKRVTEVLPRR 280 Query: 303 LEGSLKLRLNMDKTKIPHVNDG------------FIFLGHRLIRKRSRYGEMRVVSTIPQ 350 E L+L+ DKT++ F FLG +RSR G + Sbjct: 281 FE-RYGLKLHPDKTRLLPFGRPDNGQPGGKGPATFDFLGFTHYWRRSRAGRWMPSMKTRK 339 Query: 351 EKARNFAASLTALLWKVRI 369 + R ++ + R Sbjct: 340 ARLRRAITAVADFCRRHRH 358 >UniRef50_P03876 Putative COX1/OXI3 intron 2 protein n=3 Tax=Saccharomycetaceae RepID=AI2M_YEAST Length = 854 Score = 244 bits (624), Expect = 3e-63, Method: Composition-based stats. Identities = 102/369 (27%), Positives = 167/369 (45%), Gaps = 32/369 (8%) Query: 3 RKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAV 62 KL R+L+L++ L A S KG + G + + L Sbjct: 268 NKLMENNHNKTETINTRILKLMSDIRMLLIAYNKIKSKKGNMSKGSNNIT---LDGINIS 324 Query: 63 ELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDF 122 L L ++ + ++ P RRV IPK++G RPL + R++IVQ +M M +E I+ + F Sbjct: 325 YLNKLSKDINTNMFKFSPVRRVEIPKTSGGFRPLSVGNPREKIVQESMRMMLEIIYNNSF 384 Query: 123 HTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRI 182 S+GFRP S AI K + C W I+ DL+ FDT+ H +L+ + RI Sbjct: 385 SYYSHGFRPNLSCLTAIIQCKNYMQYCN-----WFIKVDLNKCFDTIPHNMLINVLNERI 439 Query: 183 SDARFMTLLWKTIKAGHID-VGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYL-- 239 D FM LL+K ++AG++D + + G+PQG V+SP+L NI L++ D+YL ++ Sbjct: 440 KDKGFMDLLYKLLRAGYVDKNNNYHNTTLGIPQGSVVSPILCNIFLDKLDKYLENKFENE 499 Query: 240 ----SGKARKDRWYWNNSIQ-----------------RGRSTAVRENWQWKPAVAYCRYA 278 + R +N+ R + + + RYA Sbjct: 500 FNTGNMSNRGRNPIYNSLSSKIYRCKLLSEKLKLIRLRDHYQRNMGSDKSFKRAYFVRYA 559 Query: 279 DDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSR 338 DD ++ V G+ + I + L+ +L + +NMDK+ I H +G FLG+ + Sbjct: 560 DDIIIGVMGSHNDCKNILNDINNFLKENLGMSINMDKSVIKHSKEGVSFLGYDVKVTPWE 619 Query: 339 YGEMRVVST 347 R++ Sbjct: 620 KRPYRMIKK 628 >UniRef50_A1ZX33 Group II intron-encoded protein LtrA n=2 Tax=Bacteria RepID=A1ZX33_9SPHI Length = 594 Score = 244 bits (623), Expect = 4e-63, Method: Composition-based stats. Identities = 96/349 (27%), Positives = 162/349 (46%), Gaps = 33/349 (9%) Query: 10 ATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRD 69 + L I+R+ RL+ P A +KGA T G+ ++Q + Sbjct: 14 RGERKLPIERVYRLLYNPNLYLLAYSNLYGNKGALTSGI---TPETADGMSLDKIQDIIC 70 Query: 70 ELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGF 129 +L Y+ P++R++IPK NG+ RPL IP D+++Q + + +E +E F S+GF Sbjct: 71 KLKQESYRWKPSKRIFIPKKNGQPRPLSIPCWSDKLLQEVIRLILEAYFEPQFCESSHGF 130 Query: 130 RPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMT 189 R R H A++ ++ +W IEGD+ FD ++H+L++K + ++ D RF+ Sbjct: 131 RTGRGCHSALKQMR-----LKGKGSKWFIEGDIQGCFDNINHQLIIKLLSDKLYDPRFIR 185 Query: 190 LLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWY 249 L+ + +K G+I+ + GVPQG +I P+L+NI+LNE D+++ + + + R Sbjct: 186 LISQLLKTGYIEGWKYNKTYSGVPQGSIIGPILTNIVLNELDKFVENKLIPANTKGKRRR 245 Query: 250 WNNSI------------------------QRGRSTAVRENWQWKPAVAYCRYADDFVLIV 285 Q + + N + Y RYA+D +L Sbjct: 246 SCPKYALIKRQASKARKQGDMDKCRELNKQAQKIPSRDTNDPKYRRLWYIRYANDTLLGY 305 Query: 286 KGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHV-NDGFIFLGHRLI 333 G K + I+E+ L L L LN DKT I H + FLG+ + Sbjct: 306 IGKKEEAIKIKEQIADFLANELHLTLNSDKTLITHAQSQKASFLGYHIR 354 >UniRef50_A2TD24 Intron encoded protein n=2 Tax=Bacillaceae RepID=A2TD24_BACSO Length = 604 Score = 244 bits (622), Expect = 5e-63, Method: Composition-based stats. Identities = 104/364 (28%), Positives = 173/364 (47%), Gaps = 18/364 (4%) Query: 10 ATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQA-RLAVELQILR 68 R + L+ + + + A S++G T G DG + + + ++ Sbjct: 10 EKGDVPRFKGLVEIASSDVVIVSAIHKIKSNQGNSTAGTDGKTISDILTLNYDEAINFVK 69 Query: 69 DELLSGHYQPLPARRVYIPKSNGK-LRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSY 127 Y P P RRV+IPK K RPLGI + DRI+Q + M +EPI E+ F SY Sbjct: 70 RCFK--KYTPNPIRRVHIPKPGKKEKRPLGILTIADRIIQECVRMVIEPILEAQFFQHSY 127 Query: 128 GFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRR-ISDAR 186 GFRP R A + ++ + C WVIEGD+ +FD V+H +L+K + I D R Sbjct: 128 GFRPYR---DAKQAIERCVFICNRIGYNWVIEGDIKGFFDNVNHTILIKQLWHMGIRDRR 184 Query: 187 FMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKD 246 + ++ +KAG I G PQGG+ISPLL+N+ L++ DQ++ + K R Sbjct: 185 MLMIIKAMLKAGVIKETKINEM--GTPQGGIISPLLANVYLHKLDQWITREWEEKKMRN- 241 Query: 247 RWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGS 306 +I+ + ++R++ Y RYADD+ ++ ++ E + + L+ + Sbjct: 242 ----GTTIRTAKYKSLRDHSTITKPEFYVRYADDW-ILFTNSRGNAEKWKYRIKKYLKEN 296 Query: 307 LKLRLNMDKTKIPHVNDGF-IFLGHRLI-RKRSRYGEMRVVSTIPQEKARNFAASLTALL 364 LKL L+ DKT I ++ FLG ++ + G+ ++ EK + + L Sbjct: 297 LKLELSDDKTLITNIKKKPMKFLGFKIKMIPHGKGGKYIGYASADTEKIKGKVEQIRKDL 356 Query: 365 WKVR 368 K++ Sbjct: 357 RKLK 360 >UniRef50_Q119U8 RNA-directed DNA polymerase n=30 Tax=Bacteria RepID=Q119U8_TRIEI Length = 635 Score = 243 bits (621), Expect = 7e-63, Method: Composition-based stats. Identities = 103/397 (25%), Positives = 181/397 (45%), Gaps = 56/397 (14%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEW--LAEAARITLSSKGAHTPGVDGVNKTMLQA 58 +Q+ + ++ ++++ +L+T+ + L R+T ++G T G+DG+ Sbjct: 54 LQKLIYRASSRGEIRKMRKYQKLLTKSYYARLLAVRRVTQDNQGKKTAGIDGIKSLPPMQ 113 Query: 59 RLAVELQILRDELLSGHYQPLPARRVYIPKSN-GKLRPLGIPALRDRIVQRAMLMAMEPI 117 RL L + L S + P RRV+IPK + RPLGIP + DR +Q + + MEP Sbjct: 114 RLN-----LVEMLGSRFLKASPIRRVWIPKPGREEKRPLGIPTMYDRALQALVKLGMEPE 168 Query: 118 WESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKA 177 WE+ F SYGFRP RS + AI + + + + ++V++ D+S FD ++H L+ Sbjct: 169 WEALFEPNSYGFRPGRSTYDAIAAIYVSIN----HKPKYVLDADISKCFDRINHDALLGK 224 Query: 178 VRRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHER 237 + + + + L+ + +K+G D F EG PQGGVISPLL+NI L+ ++ L + Sbjct: 225 IGK----SPYRKLVKQWLKSGVFDNKQFSNTVEGTPQGGVISPLLANIALHGMEKCLEDY 280 Query: 238 YLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIRE 297 + K + A++ RYADDFV++ K K ++A + Sbjct: 281 AETLPGTK--------------------RDNQRALSLIRYADDFVILHKDIKVLLQA-KT 319 Query: 298 ECRGVLEGSLKLRLNMDKTKIPHVND-------GFIFLGHRLIRKRSR--YGEMRVVSTI 348 + L + L L +KTKI H + GF FLG + + + + + + Sbjct: 320 VIQEWL-NQVGLELKPEKTKIAHTLEEYEGNKPGFDFLGFTIRQWKGKTTKQGFKTLIKP 378 Query: 349 PQEKARNFAASLTALLWKVRI---------SGEILLG 376 + + L + + ++ G Sbjct: 379 SSKSIKTHYRKLADIGDTYKTVPTKALIAKLNPVIRG 415 >UniRef50_Q7UY81 Reverse transcriptase/maturase n=1 Tax=Rhodopirellula baltica RepID=Q7UY81_RHOBA Length = 459 Score = 243 bits (620), Expect = 8e-63, Method: Composition-based stats. Identities = 110/360 (30%), Positives = 168/360 (46%), Gaps = 46/360 (12%) Query: 13 PSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELL 72 + L+ + + L +AR + KG GVD + + E++ L ++L Sbjct: 62 RGGKWHALIDKVYRELNLFVSARKVVGKKG--AAGVDRQSTEDFSEKEIAEIKQLYEQLR 119 Query: 73 SGHYQPLPARRVYIPKSNGK-LRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRP 131 +G Y+P RRV IPK K RPLGIP +RDR+VQ A++ +EPI++++FH S+GFR Sbjct: 120 TGTYRPQAVRRVQIPKPGSKQTRPLGIPTVRDRVVQTALVNVIEPIFDNEFHERSFGFRH 179 Query: 132 ERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLL 191 RS H A+R V+ L ET +V++ DL YFDT+ L+ V +ISD R + L+ Sbjct: 180 GRSCHDALRVVEELL----ETDHVFVVDADLQGYFDTIPKDRLLALVSEKISDRRVLDLV 235 Query: 192 WKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWN 251 + + ++ GVPQG V+SPLLSN+ LNE D + + Sbjct: 236 KRFLDQSILEELREWTPESGVPQGAVLSPLLSNLYLNELDHRMAD--------------- 280 Query: 252 NSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRL 311 RYADDFV++ + + Q E EE + + L L Sbjct: 281 ------------------LGYEMVRYADDFVILCRS-QEQAELALEEVKRFV-CEAGLTL 320 Query: 312 NMDKTKIPHVN-DGFIFLGHRLI---RKRSRYGEMRVVSTIPQEKARNFAASLTALLWKV 367 + +KT I + F FLG+ R ++V TI + R SL A + ++ Sbjct: 321 HPEKTHIVDSRVNSFDFLGYSFRGKLRFPRAKSHQKMVDTIRRLTPRKSGQSLEATIVQI 380 >UniRef50_P03875 Putative COX1/OXI3 intron 1 protein n=3 Tax=Fungi/Metazoa group RepID=AI1M_YEAST Length = 834 Score = 243 bits (620), Expect = 8e-63, Method: Composition-based stats. Identities = 109/378 (28%), Positives = 168/378 (44%), Gaps = 39/378 (10%) Query: 23 LITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLPAR 82 ++ + L A S G TPG L + L L +EL +G ++ P R Sbjct: 255 IMKNVDMLMLAYNRIKSKPGNMTPGT---TLETLDGMNMMYLNKLSNELGTGKFKFKPMR 311 Query: 83 RVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTV 142 V IPK G +RPL + RD+IVQ M M ++ I++ T S+GFR S AI V Sbjct: 312 MVNIPKPKGGMRPLSVGNPRDKIVQEVMRMILDTIFDKKMSTHSHGFRKNMSCQTAIWEV 371 Query: 143 KLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHIDV 202 + W IE DL FDT+ H L++K ++R ISD F+ L++K ++AG+ID Sbjct: 372 RNMF-----GGSNWFIEVDLKKCFDTISHDLIIKELKRYISDKGFIDLVYKLLRAGYIDE 426 Query: 203 -GLFRAASEGVPQGGVISPLLSNIMLNEFDQYLH---ERYLSGKARKDRWYWNNSIQ--- 255 G + G+PQG +ISP+L NI++ D +L Y GK +K + + Sbjct: 427 KGTYHKPMLGLPQGSLISPILCNIVMTLVDNWLEDYINLYNKGKVKKQHPTYKKLSRMIA 486 Query: 256 --------------RGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRG 301 R + N + Y RYADD ++ V G+K + I+ + Sbjct: 487 KAKMFSTRLKLHKERAKGPTFIYNDPNFKRMKYVRYADDILIGVLGSKNDCKMIKRDLNN 546 Query: 302 VLEGSLKLRLNMDKTKIPHVND-GFIFLGHRLIRKRSRYGEMRVVSTIPQEKARN----- 355 L SL L +N +KT I + FLG+ + + V TI + R+ Sbjct: 547 FL-NSLGLTMNEEKTLITCATETPARFLGYNISITPLK-RMPTVTKTIRGKTIRSRNTTR 604 Query: 356 --FAASLTALLWKVRISG 371 A + ++ K+ +G Sbjct: 605 PIINAPIRDIINKLATNG 622 >UniRef50_C9S0G0 RNA-directed DNA polymerase n=2 Tax=Geobacillus RepID=C9S0G0_GEOSY Length = 635 Score = 242 bits (618), Expect = 1e-62, Method: Composition-based stats. Identities = 115/388 (29%), Positives = 183/388 (47%), Gaps = 32/388 (8%) Query: 5 LATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVEL 64 L A +L L+ + EA R S+KG+ T G+D ++ Sbjct: 21 LYQEAKKGKH--FYGMLELLQNDVVILEAIRNIKSNKGSKTAGIDQKIVDDYLLMPTEKV 78 Query: 65 QILRDELLSGHYQPLPARRVYIPKSN--------------GKLRPLGIPALRDRIVQRAM 110 + L+ Y+P+P RR PK N G+ RPLGI A+ DRI+Q + Sbjct: 79 FGMIKAKLN-DYKPIPVRRCNKPKGNAKSSKRKGNSPNEEGETRPLGISAVTDRIIQEML 137 Query: 111 LMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVH 170 + +EPI+E+ F+ SYGFRP RS HA+ L ++ WV++GD+ SYFD ++ Sbjct: 138 RIVLEPIFEAQFYPHSYGFRPYRSTEHALA---WMLKIINGSKLYWVVKGDIESYFDHIN 194 Query: 171 HRLLMKAV-RRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNE 229 H+ L+ + + D R + ++ K +KAG + G F ++G+PQGG+ISPLL+N+ LN Sbjct: 195 HKKLLNIMWNMGVRDKRVLCIVKKMLKAGQVIQGKFYPTAKGIPQGGIISPLLANVYLNS 254 Query: 230 FDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTK 289 FD + + Y + N++ R+ + V Y RYADD+V++ TK Sbjct: 255 FDWMVGQEYEYHPNNANYREKKNALAALRN-------KGHHPVFYIRYADDWVILT-DTK 306 Query: 290 AQVEAIREECRGVLEGSLKLRLNMDKTKIPHVND-GFIFLGHRLIRKRSRYGEMRVVST- 347 E IRE+C+ L L L L+ +KT I + + FLG + + R+ + + Sbjct: 307 EYAEKIREQCKQYLACELHLTLSDEKTFIADIREQRVKFLGFCIEAGKRRFHKKGFAARM 366 Query: 348 -IPQEKARNFAASLTALLWKVRISGEIL 374 EK + + +R L Sbjct: 367 IPDMEKVNAKVKEIKRDIRLLRTRKSEL 394 >UniRef50_Q5U7I7 Maturase-related protein n=20 Tax=Gammaproteobacteria RepID=Q5U7I7_ECOLX Length = 451 Score = 242 bits (618), Expect = 2e-62, Method: Composition-based stats. Identities = 105/368 (28%), Positives = 175/368 (47%), Gaps = 52/368 (14%) Query: 2 QRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLA 61 QR + + + L+ + ++ A + +KG G+D ++ Sbjct: 16 QRSTVNNTSNEYNQIDHDLMAKVLSNHNISAAWQHVKRNKG--AAGIDNMSIEEFNDFAK 73 Query: 62 VELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESD 121 + ++ +LL+G YQPLP +RV IPK +G R LGIPA+ DR++Q+A+ + P +E Sbjct: 74 LHWLGIKQQLLNGSYQPLPVKRVMIPKPDGGERMLGIPAVIDRVIQQAIAQVISPYFEPQ 133 Query: 122 FHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRR 181 F SYG+RP + A+ + C + + ++ DLS +FD V H +LM V R+ Sbjct: 134 FSPHSYGYRPHKRASQAV----NHVQSCVKQGYKTAVDIDLSKFFDEVDHDMLMNRVSRK 189 Query: 182 ISDARFMTLLWKTIKAGH--IDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYL 239 I D M LL K ++AG + GL+ +++GVPQGG +SPLLSNI+L+E D+ L Sbjct: 190 IKDKALMRLLGKYLRAGIAERETGLWFESTKGVPQGGPLSPLLSNILLDELDKKL----- 244 Query: 240 SGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREEC 299 + + RYADD +++VK TK++ I+ E Sbjct: 245 ----------------------------TYKHLKFARYADDIIILVK-TKSEGLIIQREI 275 Query: 300 RGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAAS 359 + LKL++N K+++ V+ G FLG + I ++ + F A+ Sbjct: 276 TAFITKRLKLKVNESKSRVGPVS-GSKFLGFTFRYGQV---------QIHEQALKKFKAN 325 Query: 360 LTALLWKV 367 + L + Sbjct: 326 VRELTNRN 333 >UniRef50_C2XKK9 D-alanine--D-alanine ligase A (D-alanylalanine synthetaseA) n=1 Tax=Bacillus cereus F65185 RepID=C2XKK9_BACCE Length = 647 Score = 242 bits (617), Expect = 2e-62, Method: Composition-based stats. Identities = 107/363 (29%), Positives = 180/363 (49%), Gaps = 17/363 (4%) Query: 5 LATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVEL 64 + A + S ++ L+ + + A S++G+ T G+D + A +L Sbjct: 44 IYQKAKEENSC-FHGIIELMKNKQTIKTAIHNIKSNRGSMTVGIDKKDVNYYLQMEAKQL 102 Query: 65 QILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHT 124 L + + +Y+P P RR YI K NGK RPLGIP + DRI+Q + +EPI E+ F Sbjct: 103 IKLIRQHID-NYKPNPVRREYINKGNGKKRPLGIPTMIDRIIQEIARIVLEPIAEAKFFN 161 Query: 125 LSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAV-RRRIS 183 SYGFRP RS H+AI V L ++ IEGD+ S+FD ++H L++ + I Sbjct: 162 HSYGFRPYRSCHYAIGRV---LNTISRSKTYIAIEGDIKSFFDHINHNKLVEMMWNMGIK 218 Query: 184 DARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKA 243 D RF+ ++ K ++AG ++ + G PQGG+ISPLL+NI LN FD + + + +A Sbjct: 219 DKRFLIIIKKMLRAGVLEDKVILPTEIGTPQGGIISPLLANIYLNNFDWMVAKEFEEHRA 278 Query: 244 RKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVL 303 R +++ + + + + RYADD++++ + T Q + + Sbjct: 279 RY-------TVKHAFRSGLTKVGRRHKKCFLIRYADDWIILCEDT-VQARILLTKIDKYY 330 Query: 304 EGSLKLRLNMDKTKIPHVNDG-FIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAASLTA 362 + LKL L+ +KT I + + FLG + ++ R + IP +K + + Sbjct: 331 KHILKLELSKEKTFITDLREKPARFLGFDIKAEKMRLKDRIAGKAIPNKK--KLTSKMRE 388 Query: 363 LLW 365 +L Sbjct: 389 VLR 391 >UniRef50_Q1PUN9 Strong similarity to group II intron-encoded protein LtrA n=3 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1PUN9_9BACT Length = 432 Score = 242 bits (617), Expect = 2e-62, Method: Composition-based stats. Identities = 116/378 (30%), Positives = 174/378 (46%), Gaps = 44/378 (11%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQAR- 59 QRKL A + R L + +L EA + ++ G+ G DG+ +++ Sbjct: 21 FQRKLYRKAKQEEGFRFYVLYDKVRMLHFLREAYKRCKANGGS--AGADGITFEDVESYG 78 Query: 60 LAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWE 119 + L + +EL + Y+P P RVYIPK+NGK RPLGIP ++DR+VQ ++ + +EPI+E Sbjct: 79 VEKFLGEIIEELENKTYEPQPVLRVYIPKTNGKTRPLGIPVIKDRVVQMSVKLVIEPIFE 138 Query: 120 SDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVR 179 +DF SYGFRP RS A+R +K +L + V + DLSSYFDT+ H+ L+ + Sbjct: 139 ADFEDSSYGFRPGRSAGDAVRKIKEKLREGKTE----VFDADLSSYFDTIPHKELLLLIG 194 Query: 180 RRISDARFMTLLWKTIKAGHIDVGLF---RAASEGVPQGGVISPLLSNIMLNEFDQYLHE 236 RISD + L+ +KA I+ G R G PQG VISPLL+NI L+ D+ ++ Sbjct: 195 MRISDKNVLHLIKMWLKAPVIEEGKPGGGRKNKIGTPQGSVISPLLANIYLHMLDKAVNR 254 Query: 237 RYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIR 296 +K + RYADD+VL+ K + Sbjct: 255 ENGVF--------------------------YKYGITIIRYADDWVLMAKRIPREALDYL 288 Query: 297 EECRGVLEGSLKLRLNMDKTKIPHV-NDGFIFLGHRLIRK---RSRYGEMRVVSTIPQEK 352 L+ SL N DK+KI + F FLGH + R + ++ Sbjct: 289 NRLLKKLKLSL----NEDKSKIVKAEEESFDFLGHTISFSEDLFGRKHKKYWNIEPSRKS 344 Query: 353 ARNFAASLTALLWKVRIS 370 + + L Sbjct: 345 QKKVREKIGNYLKSNGHK 362 >UniRef50_P38478 Uncharacterized mitochondrial protein ymf40 n=1 Tax=Marchantia polymorpha RepID=YMF40_MARPO Length = 502 Score = 241 bits (616), Expect = 2e-62, Method: Composition-based stats. Identities = 113/379 (29%), Positives = 176/379 (46%), Gaps = 37/379 (9%) Query: 21 LRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLP 80 + PE A + S G PG D ++ + +L +Q P Sbjct: 6 YEQLLDPEIFRLAYELKKSKSGNMKPGADKETLDGFSQ---AYVEKVVRQLKDESFQFRP 62 Query: 81 ARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIR 140 +RR +IPK++GKLR LGIP+ RD+IVQ M +EP++E F S+GFRP RS H A+R Sbjct: 63 SRREFIPKADGKLRSLGIPSPRDKIVQEVMRRILEPVFEPRFLDSSHGFRPHRSPHTALR 122 Query: 141 TVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHI 200 ++ T W+IEGD+ YFD + H LL + + D R + L WK ++AG++ Sbjct: 123 QIRRW------TGTSWMIEGDIKGYFDNIDHHLLAGFIAELVKDQRLLALYWKLVRAGYV 176 Query: 201 DVGLFRAA-SEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKAR-----------KDRW 248 + G GVPQG ++SPLLSNI L++FD ++ E + K R Sbjct: 177 NQGKAEPHLLTGVPQGRILSPLLSNIYLHQFDLFMEEIKVKYTTTGALSKNNPIYLKARN 236 Query: 249 YWNNSIQRGRSTAVRENWQWK------------PAVAYCRYADDFVLIVKGTKAQVEAIR 296 + ++ ++++ + V Y RYADD+V+ V G KA I+ Sbjct: 237 KYYKLVKSLKASSAEIIRARRDMLKMTYGIQTGSRVRYVRYADDWVIGVTGPKALAVQIK 296 Query: 297 EECRGVLEGSLKLRLNMDKTKIPHVNDG-FIFLGHRLIRKRSRYGEMRVV---STIPQEK 352 EE L+ LKL L +KT+I +++ +FLG + +Y + + V Sbjct: 297 EEVSTFLQEKLKLSLQAEKTRITNLSRSEALFLGTLISITTRKYVQSQKVGGGHRRASLG 356 Query: 353 ARNFAASLTALLWKVRISG 371 + L+ K+ G Sbjct: 357 RIRLCIPIDILIGKLSQMG 375 >UniRef50_C4ZCX5 RNA-directed DNA polymerase n=24 Tax=Bacteria RepID=C4ZCX5_EUBR3 Length = 464 Score = 241 bits (616), Expect = 3e-62, Method: Composition-based stats. Identities = 114/360 (31%), Positives = 168/360 (46%), Gaps = 49/360 (13%) Query: 19 RLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQP 78 RLL I + A + ++KG PG+DG+ L Q + D + G Y P Sbjct: 41 RLLETILYKDNFNRAYKRVKANKG--APGIDGMTIEEALPYLKEHQQEITDRIYRGKYTP 98 Query: 79 LPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHA 138 P RRV IPK +G +R LGIP + DR +Q+A+ + PI+E F SYG+RP RS A Sbjct: 99 SPVRRVEIPKPDGGVRKLGIPTVIDRTLQQAITQQLVPIYEPLFADGSYGYRPNRSAKDA 158 Query: 139 IRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAG 198 I VK + E + + DLS YFDT++H +L+ +R+ + D R + L+ + +K+G Sbjct: 159 ILKVK----EYAEQGYTFAVVLDLSKYFDTLNHEILINLLRKNVKDERVVQLIKRYLKSG 214 Query: 199 HIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGR 258 ++ G+ EG PQGG +SPLL+NI LNEFDQ Sbjct: 215 VMENGVVIDTEEGSPQGGNLSPLLANIYLNEFDQE------------------------- 249 Query: 259 STAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKI 318 K V RYADD VL+ K +A E + E LE LKL +N +K++ Sbjct: 250 --------YLKRGVPCIRYADDIVLLAKSKRAS-ERLLESSTKYLEERLKLTVNREKSRT 300 Query: 319 PHVN--DGFIFLGHR-------LIRKRSRYGEMRVVSTIPQEKARNFAASLTALLWKVRI 369 V F FLG + + + S + + +R S+ L K+++ Sbjct: 301 VSVFAIRNFKFLGFALGRNGKGIYVRVHPKSWKKFKSRLKELSSRKRCQSIKPSLEKIKV 360 >UniRef50_P05511 Uncharacterized 91 kDa protein in cob intron n=1 Tax=Schizosaccharomyces pombe RepID=YMC6_SCHPO Length = 807 Score = 241 bits (615), Expect = 3e-62, Method: Composition-based stats. Identities = 98/399 (24%), Positives = 173/399 (43%), Gaps = 37/399 (9%) Query: 5 LATWAATDPSLRI-QRLLR-LITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAV 62 L+ + P+L I + L + + + A S+ G T G + L Sbjct: 220 LSKRSKNYPNLVIDRNLYKDFLLNRDMFLIAYNKLKSNPGMMTHG---LKPDTLDGMSID 276 Query: 63 ELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDF 122 + + L S + P RR+ I K++G RPL I + RD++VQ + + +E I+E F Sbjct: 277 VIDKIIQSLKSEEFNFTPGRRILIDKASGGKRPLTIGSPRDKLVQEILRIVLEAIYEPLF 336 Query: 123 HTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRI 182 +T S+GFRP RS H A+R++ W IEGD+ + FD++ H L+ + +I Sbjct: 337 NTASHGFRPGRSCHSALRSIFTNF-----KGCTWWIEGDIKACFDSIPHDKLIALLSSKI 391 Query: 183 SDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGK 242 D RF+ L+ K + AG++ ++ G PQG ++SP+L+NI L++ D+++ Sbjct: 392 KDQRFIQLIRKALNAGYLTENRYKYDIVGTPQGSIVSPILANIYLHQLDEFIENLKSEFD 451 Query: 243 ARKDRWYWNNSIQR-------------------------GRSTAVRENWQWKPAVAYCRY 277 + S R R+ + + + Y RY Sbjct: 452 YKGPIARKRTSESRHLHYLMAKAKRENADSKTIRKIAIEMRNVPNKIHGIQSNKLMYVRY 511 Query: 278 ADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHV-NDGFIFLGHRLIRKR 336 ADD+++ V G+ Q + I + S+ L ++ KTKI + D +FLG + + Sbjct: 512 ADDWIVAVNGSYTQTKEILAKITCFC-SSIGLTVSPTKTKITNSYTDKILFLGTNISHSK 570 Query: 337 SRYGEMRVVSTIPQEKARNFAASLTALLWKVRISGEILL 375 + +A + + K+R +G +L Sbjct: 571 NVTFSRHFGILQRNSGFILLSAPMDRIAKKLRETGLMLN 609 >UniRef50_Q47DU4 RNA-directed DNA polymerase (Reverse transcriptase) n=1 Tax=Dechloromonas aromatica RCB RepID=Q47DU4_DECAR Length = 429 Score = 241 bits (615), Expect = 3e-62, Method: Composition-based stats. Identities = 109/374 (29%), Positives = 165/374 (44%), Gaps = 48/374 (12%) Query: 5 LATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQAR-LAVE 63 L A ++ R L I + + LA A S+KG PGVD + ++A + Sbjct: 3 LHAKAKSESGYRFYALYDKIYRTDILAHAYAQCRSNKG--APGVDRQDFEDVEAYGVRRW 60 Query: 64 LQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFH 123 L+ L L Y+P P RRV+IPK+NGKLRPLGI L DR+ A ++ +EPI+E+D Sbjct: 61 LEELALALKEESYRPDPIRRVFIPKANGKLRPLGISTLHDRVCMTAAMLVLEPIFEADLP 120 Query: 124 TLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRIS 183 Y +RP R+ A VK +L V++ DLS YF ++ H LMK++ RRI Sbjct: 121 DEQYAYRPGRNAQQAAEEVKNRLYL----GQTDVVDADLSDYFGSIPHSELMKSLARRIV 176 Query: 184 DARFMTLLWKTIKAGHIDVGL---------FRAASEGVPQGGVISPLLSNIMLNEFDQYL 234 D R + L+ ++ + + G+PQG ISPLLSN+ + F Sbjct: 177 DRRVLHLIKMWLECAVEETDQRGRKKRTTEAKDQGRGIPQGSPISPLLSNLYMRRFVLAW 236 Query: 235 HERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEA 294 + + YADD V++ K K E Sbjct: 237 K---------------------------KLGLERSLGSRIVTYADDLVILCKCGK--AEE 267 Query: 295 IREECRGVLEGSLKLRLNMDKTKIPHVNDG-FIFLGHRL-IRKRSRYGEMRVVSTIPQEK 352 + R ++ G LKL +N +KT+I V G F FLG+ R R G+ ++ ++ Sbjct: 268 ALQWMRTIM-GKLKLTVNEEKTRICQVPAGTFDFLGYSFGRRYVPRTGKPQIALWPSKKS 326 Query: 353 ARNFAASLTALLWK 366 R + + + Sbjct: 327 IRRMVEKIHDMTER 340 >UniRef50_C9P0Q5 Retron-type reverse transcriptase n=3 Tax=Vibrio RepID=C9P0Q5_VIBME Length = 436 Score = 241 bits (614), Expect = 4e-62, Method: Composition-based stats. Identities = 115/361 (31%), Positives = 168/361 (46%), Gaps = 51/361 (14%) Query: 18 QRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLA--VELQILRDELLSGH 75 RLL + P L A + +KG GVD + T +L Q LR LL G Sbjct: 4 TRLLEQMFSPGNLNAATKQVKRNKGCG--GVDRLTITATLEKLRQLDNGQQLRQSLLDGS 61 Query: 76 YQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSV 135 YQP P V IPK G +R LGIP ++DRIVQ+AM + ++E F S+GFRP RS Sbjct: 62 YQPSPVLGVEIPKPKGGVRQLGIPTVQDRIVQQAMAQLLTQLYEPKFSKSSFGFRPRRSA 121 Query: 136 HHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTI 195 HHA+ + + +V++ DL YFDTV+H LM + + I+D R + L+ K + Sbjct: 122 HHALSKASEYIRE----GRGYVVDIDLEKYFDTVNHDRLMYRLSQDIADKRVLKLIRKYL 177 Query: 196 KAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQ 255 ++G + G+ G PQGG +SPLLSNI+L+E D+ L Sbjct: 178 QSGLMRNGVIERRQRGTPQGGPLSPLLSNIVLDELDKELER------------------- 218 Query: 256 RGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDK 315 +CRYADD + V G++A + ++ LE +LKLR+N +K Sbjct: 219 --------------RGHKFCRYADDCQIYV-GSEAAAQRVKTSVTEFLEQTLKLRVNREK 263 Query: 316 TKIPHVNDGFIFLGHRLIRKRS--------RYGEMRVVSTIPQEKARNFAASLTALLWKV 367 + V++ +LGHR + RV + + R F A + L + Sbjct: 264 SAATRVSER-SYLGHRFNDDGVIGISDDAMHQMKKRVRQVTKRNRGRTFPAIIKELSTYL 322 Query: 368 R 368 R Sbjct: 323 R 323 >UniRef50_B7I148 Reverse transcriptase n=9 Tax=Bacillus RepID=B7I148_BACC7 Length = 632 Score = 241 bits (614), Expect = 5e-62, Method: Composition-based stats. Identities = 108/385 (28%), Positives = 186/385 (48%), Gaps = 33/385 (8%) Query: 4 KLATWAATDPSLR----IQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQAR 59 KL + + LL + + A S+KG+ TPGVDG Sbjct: 20 KLYSKTKEHMEKKTRIKHTSLLEIAMSKPNIVTAIHSLKSNKGSMTPGVDGKTIQDYLRL 79 Query: 60 LAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWE 119 +L L L+ +++ +RV+IPK+NG RPLGIP + DRI+Q+ M +EP+ E Sbjct: 80 SEEKLIELIRGRLT-NFKAHLIKRVFIPKANGGQRPLGIPTIEDRIIQQMMKQVLEPVLE 138 Query: 120 SDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVR 179 + F S+GFRPER+ +HA+ VK+ + + T W++EGD+ +FD V+HR+L+K + Sbjct: 139 AQFFKYSFGFRPERTTYHALERVKVLVHN---TGYHWIVEGDIRQFFDKVNHRILIKKLW 195 Query: 180 -RRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERY 238 I D R + L+ + +KAG G PQGG++SPLL+N+ L+ FD+++ +++ Sbjct: 196 SMGIKDRRILCLITEFLKAGIFKN--IIRNDNGTPQGGILSPLLANVYLHSFDKWVAKQF 253 Query: 239 LSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREE 298 R + ++ ++ +S+ ++ RYADD+V +V K+ + Sbjct: 254 EEFTTRHEYSKHDHKLRGLKSSNLKPG-------YLIRYADDWV-LVTNNKSHAYRWKTV 305 Query: 299 CRGVLEGSLKLRLNMDKTKIPHVN-DGFIFLGHRLIRKRSRYGEMRVVS----------- 346 + L+ LKL L+ +KT+I ++ FLG + + Sbjct: 306 IKNFLQKELKLELSEEKTRITNIRHKPIEFLGFKYKVVLKGVKGKKKKDKKTRYISQITP 365 Query: 347 --TIPQEKARNFAASLTALLWKVRI 369 + K + A+LT+L ++ Sbjct: 366 SDKKIKRKVKELRATLTSLGKRLSH 390 >UniRef50_B9J6F8 Reverse transcriptase n=7 Tax=Bacillus cereus group RepID=B9J6F8_BACCQ Length = 624 Score = 240 bits (613), Expect = 6e-62, Method: Composition-based stats. Identities = 107/363 (29%), Positives = 180/363 (49%), Gaps = 17/363 (4%) Query: 5 LATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVEL 64 + A + S ++ L+ + + A S++G+ T G+D + A +L Sbjct: 21 IYQKAKEENSC-FHGIIELMKNKQTIKTAIHNIKSNRGSMTVGIDKKDVNYYLQMEAKQL 79 Query: 65 QILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHT 124 L + + +Y+P P RR YI K NGK RPLGIP + DRI+Q + +EPI E+ F Sbjct: 80 IKLIRQHID-NYKPNPVRREYINKGNGKKRPLGIPTMIDRIIQEIARIVLEPIAEAKFFN 138 Query: 125 LSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAV-RRRIS 183 SYGFRP RS H+AI V L ++ IEGD+ S+FD ++H L++ + I Sbjct: 139 HSYGFRPYRSCHYAIGRV---LNTISRSKTYIAIEGDIKSFFDHINHNKLVEMMWNMGIK 195 Query: 184 DARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKA 243 D RF+ ++ K ++AG ++ + G PQGG+ISPLL+NI LN FD + + + +A Sbjct: 196 DKRFLIIIKKMLRAGVLEDKVILPTEIGTPQGGIISPLLANIYLNNFDWMVAKEFEEHRA 255 Query: 244 RKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVL 303 R +++ + + + + RYADD++++ + T Q + + Sbjct: 256 RY-------TVKHAFRSGLTKVGRRHKKCFLIRYADDWIILCEDT-VQARILLTKIDKYY 307 Query: 304 EGSLKLRLNMDKTKIPHVNDG-FIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAASLTA 362 + LKL L+ +KT I + + FLG + ++ R + IP +K + + Sbjct: 308 KHILKLELSKEKTFITDLREKPARFLGFDIKAEKMRLKDRIAGKAIPNKK--KLTSKMRE 365 Query: 363 LLW 365 +L Sbjct: 366 VLR 368 >UniRef50_A0L945 RNA-directed DNA polymerase (Reverse transcriptase) n=1 Tax=Magnetococcus sp. MC-1 RepID=A0L945_MAGSM Length = 433 Score = 240 bits (613), Expect = 6e-62, Method: Composition-based stats. Identities = 125/391 (31%), Positives = 186/391 (47%), Gaps = 54/391 (13%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQAR- 59 ++RK+ A D S R L + +PE L A + +KG PG+DGV ++ Sbjct: 11 LRRKIYRKAKIDKSWRFWGLYHHVCKPETLNTAYEMARKNKG--APGIDGVTFEAIEESG 68 Query: 60 LAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWE 119 + L +R EL+SG Y+PL RR IPK +GK R LGIP++RDR+VQ A+ + +EPI+E Sbjct: 69 VEQFLGEVRKELVSGSYRPLKNRRKAIPKGDGKERVLGIPSIRDRVVQGALKLILEPIFE 128 Query: 120 SDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVR 179 +DF + SYG+RP+R H A+ V + + VI+ DL SYFDTV H L ++ V Sbjct: 129 ADFQSGSYGYRPKRMAHQAVNRVAIAIAQGKTQ----VIDADLKSYFDTVQHDLALRKVS 184 Query: 180 RRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYL 239 R+ D + M LL K GVPQGGVISPL+SN+ LNE D+ L Sbjct: 185 ERVDDDQVMHLLKLIFKTS---------GKRGVPQGGVISPLISNLYLNEVDKMLERAKE 235 Query: 240 SGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKA---QVEAIR 296 + K + Y R+ADD V++V G + Sbjct: 236 VTRKGK-----------------------YTHIEYARFADDLVILVDGHHRWNGLARKVY 272 Query: 297 EECRGVLEGSLKLRLNMDKTKIPHVNDG--FIFLGHRLIRKRSRYGEMRVVSTIPQEKAR 354 + L LK++LN++KT++ + G F FLG + ++ + G+ V+ + Sbjct: 273 QRLGEEL-AKLKVQLNLEKTRVVDLTRGEDFTFLGFNIRQRMTLQGKQGVLCIPRMKART 331 Query: 355 NFAASLTALLWKVR---------ISGEILLG 376 + L + +R IL G Sbjct: 332 SLLGRLKEVFKHLRSQPTEWVVNTINPILRG 362 >UniRef50_Q35062 CoxI intron2 ORF n=2 Tax=Marchantia polymorpha RepID=Q35062_MARPO Length = 802 Score = 240 bits (613), Expect = 6e-62, Method: Composition-based stats. Identities = 113/402 (28%), Positives = 190/402 (47%), Gaps = 50/402 (12%) Query: 9 AATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILR 68 A P R L+ +I+ P +LA G T G D + + Sbjct: 299 AYLGPDNRYNGLIHIISDPTFLALCYESIRGKPG--TSGSDAKPLDGPE-----WFVQVG 351 Query: 69 DELLSGHYQPLPARRVYIPKSNGK-LRPLGI-------PALRDRIVQRAMLMAMEPIWES 120 ++L G ++ PARR I K K RPLGI ++IVQ+A+ + +E I+E Sbjct: 352 EKLKKGQFEFSPARR--ITKPGKKEKRPLGINSPVKQKKCYGEKIVQKALQLVLEAIYEP 409 Query: 121 DFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRR 180 F S+GFR RS H A++ + WV+EG++ +FD++ H++++ + + Sbjct: 410 IFLDCSHGFRIHRSCHTALKRLC-----LEGGHYPWVVEGNIRKFFDSIPHKVILHKISQ 464 Query: 181 RISDARFMTLLWKTIKAGHIDV--GLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHE-- 236 ++ R + LL + ++AG+ D G + EG QG V+SPLL NI+L+ D+++ + Sbjct: 465 KVKCHRTLELLQRALRAGYKDPTSGQVISLDEGTSQGSVLSPLLCNIILHYLDEFVMKLR 524 Query: 237 -RYLSGKARKDRWYWNNS--------------IQRGRSTAVRENWQWKPAVAYCRYADDF 281 R+ GK+R+ + I+R + + + Y RYADDF Sbjct: 525 DRFNKGKSRRINPEYKLLTRHMNANRQDRSLLIKRRLIPSKDPLDPYFRRILYVRYADDF 584 Query: 282 VLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHV-NDGFIFLGHRLIRKRSRYG 340 V++V GT+ + AI+ + L SL+L L+++KT + H+ N GF FLG R RSR+ Sbjct: 585 VILVSGTRLETFAIQASLQNFLHRSLRLELSLEKTVVSHLANKGFHFLGTYCKRTRSRHR 644 Query: 341 -------EMRVVSTIPQEKARNFAASLTALLWKVRISGEILL 375 + + E+ R A +T L +K++ G + Sbjct: 645 IFHVRTVRGKTIKQRSTERLR-VCAPITKLFYKLKEKGFVKR 685 >UniRef50_Q94Z00 Orf757 n=4 Tax=stramenopiles RepID=Q94Z00_PYLLI Length = 757 Score = 240 bits (612), Expect = 7e-62, Method: Composition-based stats. Identities = 104/357 (29%), Positives = 169/357 (47%), Gaps = 31/357 (8%) Query: 14 SLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLS 73 + R+ +++ I + + A S G TP N L + + L + Sbjct: 171 NDRLTKVIHDIASLKNITRAYESIKSKPGNMTPSA---NSETLDGFGLAWVVKASNNLKA 227 Query: 74 GHYQPLPARRVYIPKSNG-KLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPE 132 G ++ ARRV+IPK KLRPLG+ + RD+++ A+L +EP +E F +S+ FRP Sbjct: 228 GKFKFSNARRVHIPKPGSSKLRPLGVVSPRDKVILTAVLQVLEPFYEKKFLDISHAFRPG 287 Query: 133 RSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLW 192 R H A+ ++L+ + W IEGD++ FD + H +L+ +RR I + + L+ Sbjct: 288 RGCHTALNFIQLRFGN-----SNWAIEGDIARCFDDIDHDILLGILRRDIKCDKTIALIK 342 Query: 193 KTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLH---ERYLSGKARKDRWY 249 K++K ++ G+ +G QG +SP L NI L+E D ++ E Y+SG R+ Sbjct: 343 KSLKNPFVEDGVTVKPQKGTFQGSPLSPFLCNIYLHEMDLFIKGLSEDYISGTHRRKSPQ 402 Query: 250 WNNSIQRGRSTAVRENWQW------------------KPAVAYCRYADDFVLIVKGTKAQ 291 + +++ + +Y RYADDFV+ + G K Sbjct: 403 YRKIQYELSKPSLKVTERKLLNKKLRAIPSKDPVDPDFRRFSYVRYADDFVIGITGPKKD 462 Query: 292 VEAIREECRGVLEGSLKLRLNMDKTKIPHVN-DGFIFLGHRLIRKRSRYGEMRVVST 347 E +R + R L L L L+MDKT I H N +G FLG R+ + R +R +S Sbjct: 463 CEEVRNKLREFLTKILALELSMDKTIISHFNQEGITFLGTRISGNKEREKVIRKISK 519 >UniRef50_B1L2I7 Reverse transcriptase/endonuclease protein n=37 Tax=Firmicutes RepID=B1L2I7_CLOBM Length = 607 Score = 240 bits (612), Expect = 7e-62, Method: Composition-based stats. Identities = 117/390 (30%), Positives = 189/390 (48%), Gaps = 47/390 (12%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQP--EWLAEAARITLSSKGAHTPGVDGVNKTMLQA 58 +++++ ++++L RL+ + L R+T +KG T G+DG Sbjct: 34 LRQRIFRAEQLGQKRKVKKLQRLMLRSKANLLISIKRVTQINKGKRTAGIDGFKVIT--- 90 Query: 59 RLAVELQILRDELLS---GHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAME 115 L + L + + PA+R YIPK NGKLRPLGIP ++DRI Q + A+E Sbjct: 91 EWDRI--KLFNSLKDYSIKNIKSQPAKRTYIPKKNGKLRPLGIPIIKDRIYQNIVKNALE 148 Query: 116 PIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLM 175 P WES F +++YGFRP+RS H AI + L+L + +W+ EGD FD ++H +M Sbjct: 149 PQWESKFESIAYGFRPKRSTHDAIEQLYLKLRKGSKR--QWIFEGDFKGCFDNLNHEYIM 206 Query: 176 KAVRRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLH 235 + I+D +++ +KAG+ID +FR +EG PQGG+ISPLL+NI L+ ++ L Sbjct: 207 EC----INDFPAKEAVYRWLKAGYIDNNVFRNTNEGTPQGGIISPLLANIALHGMEEELG 262 Query: 236 ERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAI 295 +Y T + ++ +YADDFV++ K TK + E + Sbjct: 263 VKYQ-------------------FTKRQGYCLRDNSIGIVKYADDFVILCK-TKEEAETM 302 Query: 296 REECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARN 355 E L+ L L DKT I H++ GF FLG + + + + M ++ + + Sbjct: 303 YERLSPYLKKR-GLELAEDKTGITHISKGFDFLGFNIRQYK-KIKGMTLLIKPSKASMKK 360 Query: 356 FAASLTALLWKVR---------ISGEILLG 376 S+ + + R I+ G Sbjct: 361 AKKSIKEVFERYRGNSVEVIIGKINPIIRG 390 >UniRef50_UPI0001C42A66 RNA-directed DNA polymerase (Reverse transcriptase) n=1 Tax=Bacillus pseudofirmus OF4 RepID=UPI0001C42A66 Length = 624 Score = 239 bits (610), Expect = 1e-61, Method: Composition-based stats. Identities = 102/377 (27%), Positives = 176/377 (46%), Gaps = 24/377 (6%) Query: 4 KLATWAATDPSLR----IQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQAR 59 KL + + + LL +I E + A ++KG++T G DG + Sbjct: 20 KLYLISKEYKDSKKHPCFKGLLEIIQSDEVILTAIHKIKANKGSNTKGTDGETIDDILQD 79 Query: 60 -LAVELQILRDELLSGHYQPLPARRVYIPKS-NGKLRPLGIPALRDRIVQRAMLMAMEPI 117 + +R L+ Y P RRV+I K + RPLGIPA+ DRI+Q + M +EPI Sbjct: 80 GYESVISRVRKCFLA--YNPKLLRRVHIDKQVSKDKRPLGIPAIIDRIIQECIRMIIEPI 137 Query: 118 WESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKA 177 E+ F + SYGFRP RS HA+ V +T WV+EGD+ +FD V+H +L+K Sbjct: 138 LEAQFFSHSYGFRPYRSAEHALSKVT---NTAYDTNYCWVVEGDIKKFFDNVNHTILIKK 194 Query: 178 V-RRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHE 236 + I D R + ++ ++ G + G + G PQGG+ISPLL+N L+ D ++ Sbjct: 195 LYSMGIRDRRVLMIIKAMLQCGVL--GEAEQTTVGTPQGGIISPLLANAYLDSLDHWITR 252 Query: 237 RYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIR 296 + + + + + + + ++ + + RYADD+VLI KA + Sbjct: 253 EWENKETKHEYSRLDGKYRALKNASNL------KPAHFVRYADDWVLITNS-KANAIKWK 305 Query: 297 EECRGVLEGSLKLRLNMDKTKIPHVNDG-FIFLGHRLIRKRSRYGE--MRVVSTIPQEKA 353 + L+ LKL L+ +KT I ++ F+G + ++ G+ + ++ Sbjct: 306 QRIAKHLKEQLKLELSEEKTLITNIKKKAIKFVGFHFKQVKNGKGKNGWVTRTEPDPKRL 365 Query: 354 RNFAASLTALLWKVRIS 370 ++ L ++ + Sbjct: 366 EIKIQTIRKNLKAIKRT 382 >UniRef50_C6MS68 RNA-directed DNA polymerase n=1 Tax=Geobacter sp. M18 RepID=C6MS68_9DELT Length = 439 Score = 238 bits (608), Expect = 2e-61, Method: Composition-based stats. Identities = 108/376 (28%), Positives = 158/376 (42%), Gaps = 44/376 (11%) Query: 3 RKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAV 62 R +A A D R + L R +T E+L + GVD V L Sbjct: 16 RGIADKAKADKQHRFRNLYRELTA-EYLLNCW---PDLNKSAASGVDKVTAEAYAEELHG 71 Query: 63 ELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDF 122 + L + L Y+ RR +IPK N K RPLGIPAL D++VQ A + I+E DF Sbjct: 72 NILNLAERLKDKKYRTKLVRRCWIPKENEKERPLGIPALEDKLVQLACAKLLIAIYEQDF 131 Query: 123 HTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRI 182 SYG+RP RS HA++ + L +++E D+ +FD + H L+K + RI Sbjct: 132 LDHSYGYRPGRSAKHAVQDLTFDLQYGS---YGYIVEADIKGFFDRMDHDWLLKMLSLRI 188 Query: 183 SDARFMTLLWKTIKAGHID-VGLFRAASEGVPQGGVISPLLSNIMLN-EFDQYLHERYLS 240 +D F+ L+ K +KAG ++ G G PQGG++SP+L+N+ L+ D + E Sbjct: 189 NDRAFLHLIEKWLKAGILETDGTVTNPYTGTPQGGIVSPVLANVYLHFALDLWFEEVVKP 248 Query: 241 GKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECR 300 + K CRYADD+V + K + E Sbjct: 249 --------------------------RCKGDARICRYADDWVCAFQ-LKDDAQRFYWELP 281 Query: 301 GVLEGSLKLRLNMDKTKIPHVN-------DGFIFLGHRLIRKRSRYGEMRVVSTIPQEKA 353 LE L +KT I + F FLG R R GE RV ++K Sbjct: 282 NRLE-KFHLETAPEKTNIVRFSRFHPGMERRFTFLGFEFFWLRDRQGEPRVKRRTSRKKL 340 Query: 354 RNFAASLTALLWKVRI 369 + + + + R Sbjct: 341 QGACKRIKKWIKENRH 356 >UniRef50_A8VT23 S-layer domain protein n=12 Tax=Bacilli RepID=A8VT23_9BACI Length = 422 Score = 238 bits (608), Expect = 2e-61, Method: Composition-based stats. Identities = 113/354 (31%), Positives = 169/354 (47%), Gaps = 46/354 (12%) Query: 19 RLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQP 78 +L+ + P+ L A +S+KG PGVDG+ L+A + + L ++ G YQP Sbjct: 2 QLIDRVVCPDNLNLAMNRVISNKGN--PGVDGMTVDQLEAHVRQYAKPLIAKIQKGTYQP 59 Query: 79 LPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHA 138 LP +RV IPK NGK R LGIPA+RDR+VQ+A+ +EPI + F SYGFRP ++ A Sbjct: 60 LPVKRVEIPKENGKKRKLGIPAVRDRMVQQAIFQVIEPIIDPHFSPNSYGFRPGKNAKQA 119 Query: 139 IRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAG 198 I+ + + V++ DL SYFDT+ H+ LM + + I D + L+WK +K+G Sbjct: 120 IKQA----AKYYDEGFKMVVDIDLKSYFDTIPHQKLMNYLEQYIQDPIILKLIWKFLKSG 175 Query: 199 HIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGR 258 + + ++ G PQGG +SP+LSN+ L+E D+ L Sbjct: 176 IMIGDNWESSRNGAPQGGNLSPILSNVYLHELDKELER---------------------- 213 Query: 259 STAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKI 318 + RYADDF + VK +A E + LEG+LKL +N +K+ I Sbjct: 214 -----------RGHRFVRYADDFCIYVKSRRA-AERVLLNTTTFLEGTLKLSVNQEKSAI 261 Query: 319 PHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAASLTALLWKVRISGE 372 FLG + E R + F A L L + + + Sbjct: 262 GSPTKR-KFLGFCI---HKSNNETRCRPHHASKA--KFKAKLKYLTRRNQANSF 309 >UniRef50_Q93PB4 MS117, putative maturase n=1 Tax=Microscilla sp. PRE1 RepID=Q93PB4_9SPHI Length = 462 Score = 237 bits (604), Expect = 6e-61, Method: Composition-based stats. Identities = 103/363 (28%), Positives = 168/363 (46%), Gaps = 48/363 (13%) Query: 18 QRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQ 77 + L+ + L + + + + G+ PGVDG+ L+ + Q L ++L G+Y+ Sbjct: 43 RGLMYKVCDLSNLTASLKQVVKNGGS--PGVDGMQVKELRYWFSNNHQKLIEQLKEGNYR 100 Query: 78 PLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHH 137 P+ + IPK G +R LGIP ++DR+VQ+A+ + ++ F SYGFR R+ H Sbjct: 101 PMTIKGQEIPKPGGGVRQLGIPTVQDRLVQQAIAQQLSKRYDPTFSQYSYGFRKGRNAHQ 160 Query: 138 AIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKA 197 A+R + + +V++ DL +FD V+H LM + RRISD R + L+ K +++ Sbjct: 161 ALRQAGAYV----KEGFNYVVDLDLEKFFDKVNHDRLMWLLGRRISDKRVLKLIGKFLRS 216 Query: 198 GHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRG 257 G + GL G PQG +SPLLSNI+L+E D+ L Sbjct: 217 GILIGGLENQRISGTPQGSPLSPLLSNIVLDELDKELER--------------------- 255 Query: 258 RSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTK 317 + RYADD +L+V+ + E +E L L++N DK++ Sbjct: 256 ------------RGHRFVRYADDMILLVRS-QEAAERAYSSITSFIENRLLLKVNKDKSR 302 Query: 318 IPHVNDGFIFLGHRLIRKR----SRYGEMRVVSTIPQEKARNFAASLTALLWKVRISGEI 373 I FLGH ++ SR E R + + RN SL ++ ++ + Sbjct: 303 ICRPYQ-LNFLGHSIMWDGKLGLSRQSEQRFKEKVKKVTRRNRGISLEQMVKEL---NRV 358 Query: 374 LLG 376 L G Sbjct: 359 LRG 361 >UniRef50_Q3S275 ORF718 n=2 Tax=Eukaryota RepID=Q3S275_THAPS Length = 718 Score = 236 bits (601), Expect = 1e-60, Method: Composition-based stats. Identities = 104/358 (29%), Positives = 187/358 (52%), Gaps = 21/358 (5%) Query: 14 SLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLS 73 + + L +++ P +L A S+ G+ T ++K L L+ + + + Sbjct: 157 NKKCVNLSSIMSDPNFLIAAWARIRSNSGSLTFA---LSKETLDGIALSWLEETANTMRN 213 Query: 74 GHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPER 133 G +Q P+RR YI KS+G RPL IP+ RD+IVQ AM + ++E DF S+G+ R Sbjct: 214 GIFQFSPSRRTYISKSDGGKRPLTIPSPRDKIVQEAMRFLLMLVFEGDFSKNSHGWVSGR 273 Query: 134 SVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWK 193 H A+ +K++ W+IEGD+ F +++H++L+ ++ +I D F+ L++K Sbjct: 274 GCHTALNQIKMEFA-----HDNWLIEGDIDQQFPSLNHQVLVNLLKTKIDDQAFIDLIYK 328 Query: 194 TIKAGHID-VGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHE----RYLSGKARKDRW 248 ++ G+ + G QGGV+SP+L+NI + FD+++ +Y GK RK Sbjct: 329 YLRVGYGESPDKIVKMRIGTSQGGVLSPVLANIYMTPFDKWVERDLIPKYTKGKRRKANP 388 Query: 249 YWNNSIQRGR-----STAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVL 303 + I+ G+ ++ + + + Y RYADDF++ + G K + I +EC+ L Sbjct: 389 VYTKMIRSGKVTDHSIPSLYAHDRNFIRLHYVRYADDFIMGLNGPKVYCKQIVDECKTFL 448 Query: 304 EGSLKLRLNMDKTKIPHVN-DGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAASL 360 LKL LN++KTKI H D FLG+R+ +++ +M++ + + +R ++ Sbjct: 449 FEQLKLTLNIEKTKITHSQLDSATFLGYRV--YKTKLSKMKIAHNLKGQLSRRTTNTV 504 >UniRef50_Q6EI10 Reverse transcriptase/HNH endonuclease n=2 Tax=Eukaryota RepID=Q6EI10_9CHLO Length = 600 Score = 235 bits (600), Expect = 2e-60, Method: Composition-based stats. Identities = 103/414 (24%), Positives = 174/414 (42%), Gaps = 65/414 (15%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQ--PEWLAEAARITLSSKGAHTPGVDGVNKTMLQA 58 +Q + + + R+++ L+ L R T + G T GVD + Sbjct: 34 LQGFIYSASKAGDIKRVRKFQHLLVNSYEAKLLAIRRATQDNTGKKTAGVDKARALTPKQ 93 Query: 59 RLAVELQILRDELLSGHYQPLPARRVYIPKSNGK-LRPLGIPALRDRIVQRAMLMAMEPI 117 RL L L Q P RRV+IPK +RPLGIP ++DR +Q + +EP Sbjct: 94 RLE-----LASSLRIPT-QSSPLRRVWIPKPGTDVMRPLGIPTIKDRCLQALFKLMLEPE 147 Query: 118 WESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKA 177 WE+ F S+GFRP RS A+ ++ + R ++V++ D++ FD ++H+ L+ Sbjct: 148 WEAKFEPSSFGFRPGRSCRDALAAIQANIQ----KRSKYVLDADIAKCFDRINHKALLDK 203 Query: 178 VRRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHER 237 + R L +KAG +D F G PQGG++SP+L+NI L+ + +L + Sbjct: 204 IGMTGGFGR---QLLAWLKAGVLDGSTFSETDLGTPQGGIVSPVLANIALHRMEDHLKKF 260 Query: 238 YLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIRE 297 +RG + V RYADDFV++ K + A + Sbjct: 261 VCQFPMTYASGTVIKKSRRGET------------VTLIRYADDFVVLHHD-KKILLACKA 307 Query: 298 ECRGVLEGSLKLRLNMDKTKIPHVND---------------GFIFLGHRLIRKRSRYGEM 342 E L G + L + KT++ H + GF+FLG+++ + S+YG Sbjct: 308 ELVRWL-GEMGLEFSPTKTRLTHTLELQSDDVEAEGFDGTVGFVFLGYQIKQFASKYGSA 366 Query: 343 RVVSTIP----------QEKARNFAASLTALLW----------KVRISGEILLG 376 + + IP + + L L+ +++ ++ G Sbjct: 367 KSTAGIPLGYKTLIFASAKARKKHQDKLHELIIATGKGLGQLALIKVLNPVIRG 420 >UniRef50_Q35056 CoxII intron2 ORF n=4 Tax=Embryophyta RepID=Q35056_MARPO Length = 827 Score = 235 bits (600), Expect = 2e-60, Method: Composition-based stats. Identities = 112/361 (31%), Positives = 178/361 (49%), Gaps = 21/361 (5%) Query: 5 LATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVEL 64 L + L + + + A + + G+ PG G+ R +L Sbjct: 319 LWISSFRRRDWIYHDLSNYLKSMDIWSIAYQKLRPNPGSMNPGTHGLTIDGTSFR---KL 375 Query: 65 QILRDELLSGH--YQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDF 122 Q LRD +L Y+ + + P K+ LGIP +DRIVQ + M +EPI+ES F Sbjct: 376 QALRDAVLDSESPYEWGGTKIITKPGKREKI-SLGIPCFQDRIVQEVLKMLLEPIYESIF 434 Query: 123 HTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRI 182 S+G+RP RS H A+RT++ + W++ G+++ FD V+H +L +RR+I Sbjct: 435 SRRSHGWRPGRSAHTALRTIRSDF-----KKTNWIVPGNINKLFDIVNHGILCHIMRRKI 489 Query: 183 SDARFMTLLWKTIKAGH-IDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSG 241 D + + L+ +KA + G ++ G PQG ++SP+LSNI L+EFD ++ ER Sbjct: 490 RDKKLLKLIAGGLKAKIHMPYGNIEESNLGTPQGRILSPILSNIYLHEFDIWIEERIQQY 549 Query: 242 KA-RKDRWYWNNSIQRGRSTAVRENW----QWKPAVAYCRYADDFVLIVKGTKAQVEAIR 296 RK+ W ++G+ R + Y RY DDF++ ++G + +AIR Sbjct: 550 NLGRKETRSWVLLRKQGKMRKARLRSDPFNPLYRRMEYRRYGDDFLIAIRGPLSDAKAIR 609 Query: 297 EECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRK----RSRYGEMRVVSTIPQEK 352 +EC L LKL LNM+KT I H++ G FLGHR+ R+ + RY ++K Sbjct: 610 QECETFLREKLKLLLNMEKTHIKHISVGIPFLGHRIGRRVVHTKQRYQTQEGWRWRIKKK 669 Query: 353 A 353 Sbjct: 670 V 670 >UniRef50_C6MRB5 RNA-directed DNA polymerase n=1 Tax=Geobacter sp. M18 RepID=C6MRB5_9DELT Length = 433 Score = 235 bits (600), Expect = 2e-60, Method: Composition-based stats. Identities = 83/251 (33%), Positives = 127/251 (50%), Gaps = 6/251 (2%) Query: 18 QRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQ 77 +RL+ I + A + +KG PGVD + T L+ L +++ELL+G Y Sbjct: 171 ERLMEEIVSRGNMMAAYSKVVGNKG--APGVDNMPVTELKGYLQEHWPRIKEELLAGKYI 228 Query: 78 PLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHH 137 P P R+V IPK +G R LGIP + DR++Q+A+ + ++ F SYG+ P RS H Sbjct: 229 PQPVRKVEIPKPDGGKRMLGIPTVLDRLIQQAVSQVLGRLFIPCFSKHSYGYIPGRSTHQ 288 Query: 138 AIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKA 197 AI+ + + + W ++ DL +FD V+H +LM V+R++ D + + L+ +KA Sbjct: 289 AIQAARQYVAEGRR----WAVDIDLEKFFDRVNHDILMSLVKRKVKDRQVLKLIDSYLKA 344 Query: 198 GHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRG 257 G GL EG PQG +SPLLSNIML+E D+ L +R + S+ Sbjct: 345 GMFIGGLVSPRQEGTPQGSPLSPLLSNIMLDELDKELEKRGHAFCRYAVMPTIATSMLPP 404 Query: 258 RSTAVRENWQW 268 R Sbjct: 405 RRAVNGSWQPS 415 >UniRef50_B1I9Z1 GBSi1, group II intron, maturase n=33 Tax=Firmicutes RepID=B1I9Z1_STRPI Length = 425 Score = 235 bits (599), Expect = 2e-60, Method: Composition-based stats. Identities = 106/350 (30%), Positives = 158/350 (45%), Gaps = 46/350 (13%) Query: 17 IQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHY 76 + +LL I E + EA S+KG+ G+DG+ + L ++ ++ + Y Sbjct: 1 MSKLLDKILSRENMLEAYNQVKSNKGS--AGIDGMTIEEMDNYLRQNWRLTKELIKQRKY 58 Query: 77 QPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVH 136 +P P +V IPK +G +R LGIP + DR++Q+A++ + PI E F +SYGFRP RS Sbjct: 59 KPQPVLKVEIPKPDGGIRQLGIPTVMDRMIQQAIVQVISPICEPHFSDMSYGFRPNRSCE 118 Query: 137 HAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIK 196 AI L D W+++ DL +FDTV LM V I D +L+ K + Sbjct: 119 KAIMKFLEYLND----GYEWIVDIDLEKFFDTVPQDRLMSLVHNIIEDGDTESLIRKYLH 174 Query: 197 AGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQR 256 +G I G G PQGG +SPLLSN+MLNE D+ L Sbjct: 175 SGVIINGQRHKTLVGTPQGGNLSPLLSNVMLNELDKELE--------------------- 213 Query: 257 GRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKT 316 K + + RYADD V+ V G++A + + +E L L++NM K Sbjct: 214 ------------KRGLRFVRYADDCVITV-GSEAAAKRVMYSASRFIEKRLGLKVNMTKA 260 Query: 317 KIPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAASLTALLWK 366 KI + +LG + + S Q+ R F L L + Sbjct: 261 KITRPRE-LKYLGFGFWKSSDGWK-----SRPHQDSVRRFKLKLKKLTQR 304 >UniRef50_C7V8C7 Reverse transcriptase n=1 Tax=Enterococcus faecalis CH188 RepID=C7V8C7_ENTFA Length = 496 Score = 234 bits (597), Expect = 4e-60, Method: Composition-based stats. Identities = 100/367 (27%), Positives = 168/367 (45%), Gaps = 38/367 (10%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEW-LAEAARITLSSKGAHTPGVDGVNKTMLQAR 59 +Q ++ ++RL LIT + A A + +S+KG +T G+DGV Sbjct: 30 LQLRIVKATQQGKWRLVRRLQYLITHSFYAKALAVKKVISNKGKNTAGIDGVIWKT---- 85 Query: 60 LAVELQILRDELLSGHYQPLPARRVYIPKSNGK-LRPLGIPALRDRIVQRAMLMAMEPIW 118 + + ++L HY P +R+YI K K RPLGIP + DR +Q L A+EP+ Sbjct: 86 -DSQKKQAIEQLNPNHYSPKAVKRIYITKFGKKEKRPLGIPCMLDRAMQALYLQALEPVS 144 Query: 119 ESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAV 178 E + SYGFR +S A V L C + +W++EGD+ FD + H+ L+ + Sbjct: 145 ECISDSNSYGFRRFKSAKDAGEKVFKVL--CRQYSAQWILEGDIKGCFDNISHQWLIDNI 202 Query: 179 RRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERY 238 +L K +K+G+++ + G QGG+ISP L+NI L+ ++ + +Y Sbjct: 203 ------PLEKNMLRKFLKSGYMEKKKLFPTTMGTAQGGIISPTLANITLDGLEKRIKSKY 256 Query: 239 LSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREE 298 S K +N K V + RYADDF++ + ++ I+ Sbjct: 257 WSNKKGTIGVRYN-----------------KHKVNFVRYADDFIVTGDSPEILLK-IKNM 298 Query: 299 CRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAA 358 L+ L L+ +KT I H+N GF FLG +Y +++ ++ + Sbjct: 299 INEFLKER-GLSLSEEKTLITHINQGFDFLGWNFR----KYKRYKLIVQPSKKSIKRMKQ 353 Query: 359 SLTALLW 365 +L ++ Sbjct: 354 TLKQVVK 360 >UniRef50_B7KM76 RNA-directed DNA polymerase (Reverse transcriptase) n=1 Tax=Cyanothece sp. PCC 7424 RepID=B7KM76_CYAP7 Length = 309 Score = 233 bits (594), Expect = 9e-60, Method: Composition-based stats. Identities = 94/319 (29%), Positives = 146/319 (45%), Gaps = 41/319 (12%) Query: 17 IQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHY 76 + L+ +A +KG G+DG L L L + + + +Y Sbjct: 1 MTNLIDEFLSLPNFRQAWFKVADNKGC--AGIDGETIEHFALNLDFNLTFLLNSVTNSNY 58 Query: 77 QPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVH 136 P P ++V IPKS K R L IP +RDRIVQ+A+L + P+ E F S+ +RP RS Sbjct: 59 IPQPLKQVLIPKSQEKWRELRIPTVRDRIVQQALLNVLYPVMEERFSDASFAYRPNRSYL 118 Query: 137 HAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIK 196 A++ + +WV++ D+ YFD + H LL+K VR+ + ++ + L+ I Sbjct: 119 DAVKRA----AYWRDLGYQWVLDADIVEYFDNISHSLLLKEVRKTVDNSGILCLIKAWIS 174 Query: 197 AGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQR 256 AG +GVPQG VISP+L+NI L+EFD + Sbjct: 175 AGVSTDKGIIFPEKGVPQGAVISPMLANIYLDEFDHRI---------------------- 212 Query: 257 GRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKT 316 + + RYADDF+++ T+ + + +L L+L+ +KT Sbjct: 213 -----------TQSDLKLVRYADDFLVL-SDTEDGIMRAYSQVVQLLH-FWGLKLHEEKT 259 Query: 317 KIPHVNDGFIFLGHRLIRK 335 +I H GF FLGH +RK Sbjct: 260 QITHFKKGFQFLGHGFLRK 278 >UniRef50_B4WV39 Group II intron, maturase-specific domain family n=2 Tax=Synechococcus sp. PCC 7335 RepID=B4WV39_9SYNE Length = 586 Score = 232 bits (592), Expect = 2e-59, Method: Composition-based stats. Identities = 95/402 (23%), Positives = 159/402 (39%), Gaps = 71/402 (17%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQP--EWLAEAARITLSSKGAHTPGVDGVNKTMLQA 58 +Q+++ + ++ L RL++ + R+T +KG T GVDG+ Sbjct: 27 LQQRIYRASQRGDDRTVKSLQRLLSTSWSARMLAVRRVTQENKGKKTAGVDGIASLKAPE 86 Query: 59 RLAVELQILRDELLSGHYQPLPARRVYIPKSNG-KLRPLGIPALRDRIVQRAMLMAMEPI 117 R + + D+ P RRV IPK + R LGIP +RDR Q +A+EP Sbjct: 87 RTELAKNLTLDK------NADPVRRVLIPKPGKSEYRKLGIPTMRDRAKQALAKLALEPQ 140 Query: 118 WESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKA 177 WE+ F SYGFRP RS A+ V ++ + +WV++ D+++ FD + H L+ Sbjct: 141 WEALFAPNSYGFRPGRSPQDALEQVHRCIS----QKPKWVLDADIAACFDQISHGPLVAR 196 Query: 178 VRRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHER 237 + + S +KAG +D G + G PQGG+ SPLL+NI L+ + + Sbjct: 197 LSQ--SHPSIARQCKAWLKAGVLDNGQIQLTERGTPQGGIASPLLANIALHGLETLVKTT 254 Query: 238 YLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIRE 297 R+ADDFV+ + + + + Sbjct: 255 I-------------------------------RGAHLVRFADDFVVFHQD-REAIFKAQT 282 Query: 298 ECRGVLEGSLKLRLNMDKTKIPHV-------NDGFIFLGHRLIRK-------RSRYGEMR 343 R L L+L DKT+I H + GF FLG + + + Sbjct: 283 LIRQWLASK-GLKLRADKTRIVHTLNGGAEYSTGFDFLGCHVRQYRTRRRRINKSQRPYK 341 Query: 344 VVSTIPQEKARNFAASLTALLWKVR---------ISGEILLG 376 + + + L ++ + R +++G Sbjct: 342 TLIKPSKASLKKLVTKLREIIKQHRGCSQAALIEALNPVIVG 383 >UniRef50_Q8YLU0 All5206 protein n=1 Tax=Nostoc sp. PCC 7120 RepID=Q8YLU0_ANASP Length = 539 Score = 232 bits (592), Expect = 2e-59, Method: Composition-based stats. Identities = 108/387 (27%), Positives = 175/387 (45%), Gaps = 46/387 (11%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEW--LAEAARITLSSKGAHTPGVDGVNKTMLQA 58 +Q ++ + +++L +L+ + L ++T + G T GVDG Sbjct: 40 LQVRIFKAQKNGNTRLVRKLQKLLLSSKAAKLLAIRQVTQLNTGRKTAGVDGKKALEPSQ 99 Query: 59 RLAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIW 118 RLA+ ++++ ++ P +RVYIPK++G R LGIP + DR Q + A+EP Sbjct: 100 RLALYEVLVKN---WKQWKHQPLKRVYIPKADGTRRGLGIPTISDRAYQCLIKYALEPAA 156 Query: 119 ESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETR-GRWVIEGDLSSYFDTVHHRLLMKA 177 E+ F+ SYGFRP RS H + + L + ++E D+ FD + H+ LM++ Sbjct: 157 EAMFNARSYGFRPGRSCHDVQKLLFSNLNGGQANGLSKRILELDIERCFDKIDHKFLMQS 216 Query: 178 VRRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHER 237 V+ ++ IKAG G F ++ G PQGGVISPLL+NI+L+ + HE Sbjct: 217 VQ---LPKAAKQGIFWAIKAGVR--GEFPSSESGTPQGGVISPLLANIVLHGLENVGHEL 271 Query: 238 YLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIRE 297 VR + + RYADD V ++K + EA+R+ Sbjct: 272 ---------------------RYKVRSGGRQIDTIKGFRYADDVVFLLK-PEDNPEALRQ 309 Query: 298 ECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFA 357 LE L++ KTKI H D F FLG + K + +ST Q+ + Sbjct: 310 NIDTFLEAR-GLKVKEAKTKIVHSTDSFDFLGWNFVVKP----NGKFISTPSQKATSSIK 364 Query: 358 ASLTALLW--------KVRISGEILLG 376 A + ++ ++ G I+ G Sbjct: 365 AKVKEVMKDSCFTLEERIDKCGAIVRG 391 >UniRef50_Q94Z24 Orf568 n=1 Tax=Pylaiella littoralis RepID=Q94Z24_PYLLI Length = 568 Score = 232 bits (591), Expect = 2e-59, Method: Composition-based stats. Identities = 104/379 (27%), Positives = 166/379 (43%), Gaps = 36/379 (9%) Query: 2 QRKLATWAATDPSLRIQRLLRLITQPEW-LAEAARITLSSKGAHTPGVDGVNKTMLQARL 60 Q LA S + +L R + A A R ++KG +TPG++G +L Sbjct: 27 QNNLAVAELKGDSGLVTKLQRNLVNSFAGRALAVRAITTNKGKNTPGINGEIWDTSIKKL 86 Query: 61 AVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWES 120 + +Y P +RVYIPKS GKLRPLGIP + DR +Q +A++PI E Sbjct: 87 DA----IHRLGRVSNYSCSPVKRVYIPKSGGKLRPLGIPNMYDRGLQYLWKLALDPIAEC 142 Query: 121 DFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRR 180 SYGFR RS + L L+ ++R WV+E D+ +FD ++H +++ + Sbjct: 143 RADRHSYGFRKGRSTQDVHTILHLLLS--PKSRCDWVLEADIRGFFDNINHDWIIQNIPM 200 Query: 181 RISDARFMTLLWKTIKAGHID--VGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERY 238 +L + +KAG ++ F GVPQGG ISPL++N+ L+ + ++ Sbjct: 201 D------KNILREWLKAGALETTTQEFHKGIAGVPQGGPISPLIANMTLDGLEVWVANSV 254 Query: 239 LSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREE 298 + W+ P V RYADDFV+ + + ++ Sbjct: 255 KHLYKKSKETSWS------------------PKVNVVRYADDFVVTAATKRILEDIVKPS 296 Query: 299 CRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSRYGEM--RVVSTIPQEKARNF 356 + L L LN +KT I V GF F+G + G + + +E R Sbjct: 297 IQDFLASR-GLVLNQEKTCITSVKKGFDFVGFNFRVYPDKSGPKGAKSIVKPTKEGKRRL 355 Query: 357 AASLTALLWKVRISGEILL 375 + + + + SGEI++ Sbjct: 356 RSKIRNAVKTNKSSGEIIV 374 >UniRef50_Q8A4I4 Reverse transcriptase n=1 Tax=Bacteroides thetaiotaomicron RepID=Q8A4I4_BACTN Length = 430 Score = 232 bits (591), Expect = 2e-59, Method: Composition-based stats. Identities = 97/366 (26%), Positives = 159/366 (43%), Gaps = 57/366 (15%) Query: 25 TQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLPARRV 84 + + +A +++G+ G+D V + L L L + + SG Y P + V Sbjct: 11 ISKQLVYDAFLRVKANRGS--AGIDKVTLEDYEKNLRGNLYKLWNRMSSGSYFPPSVKLV 68 Query: 85 YIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKL 144 IPKS G RPLGIP + DR+ Q A++M + P E FH SY +RP RS H A+ + Sbjct: 69 EIPKSTGGKRPLGIPTVSDRVAQMAVVMLITPSIEPCFHEDSYAYRPHRSAHDAVGKARE 128 Query: 145 QLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHID-VG 203 + + WV++ D+S +FDT+ H LL+KA++R + + + + +K + G Sbjct: 129 RCW-----KYAWVLDMDISKFFDTIDHELLLKALKRHTQEKWVLMYIERWLKVPYEKSDG 183 Query: 204 LFRAASEGVPQGGVISPLLSNIMLNE-FDQYLHERYLSGKARKDRWYWNNSIQRGRSTAV 262 + GVPQG VI P+L+N+ L+ FD+++ + + Sbjct: 184 SQVDRALGVPQGSVIGPVLANLFLHYTFDKWMEKNF------------------------ 219 Query: 263 RENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVN 322 P V + RYADD + K Q E ++ + E +LRLN +KTKI + Sbjct: 220 -------PRVPFERYADDTICHCHSLK-QAEYMQAMIQQRFEC-CRLRLNEEKTKIVYCK 270 Query: 323 DG----------FIFLGHRLIRKRSRYGEMRVVS----TIPQEKARNFAASLTALLWKVR 368 F FLG + S + I ++ + ++ + R Sbjct: 271 SSRQKECYPNVTFDFLGFTFQPRESVDKYGNRFTGFLPAISRKSMKRINETMRSW-HLNR 329 Query: 369 ISGEIL 374 S L Sbjct: 330 HSNLTL 335 >UniRef50_C6MXE9 RNA-directed DNA polymerase n=1 Tax=Geobacter sp. M18 RepID=C6MXE9_9DELT Length = 512 Score = 232 bits (591), Expect = 2e-59, Method: Composition-based stats. Identities = 104/386 (26%), Positives = 167/386 (43%), Gaps = 57/386 (14%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEW-LAEAARITLSSKGAHTPGVDGVNKTMLQAR 59 +Q ++A + +++ L L+T+ A + S+KG TPGVDGV + R Sbjct: 34 LQIRIAKAVLENRWNKVKTLQHLLTRSFHAKLLAVKRVTSNKGKKTPGVDGVLWKTAKVR 93 Query: 60 LAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWE 119 L Y+P P +R+YIPK NGK RPL IP ++DR +Q +A+ P+ E Sbjct: 94 WRAAC-----SLRRRGYKPQPLKRIYIPKKNGKKRPLSIPTMQDRAMQALYKLALAPVAE 148 Query: 120 SDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVR 179 + SYGFR RS A L+ WV+E D++ +D + +++ + Sbjct: 149 TTADGNSYGFREGRSCADATAAAFNALSKPN--SAPWVLEADITGCYDNICQNWMLENIP 206 Query: 180 RRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYL 239 +L K ++AG+I+ G+ + +G PQGG+ISP L+N+ L+ ++ + Sbjct: 207 MD------REVLRKWLEAGYIEDGILYPSHKGTPQGGIISPTLANMTLDGLERVI----- 255 Query: 240 SGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREEC 299 R + V + RYADDF++ K + AIR Sbjct: 256 -----------------------RTAVPRRCRVNFVRYADDFIVTGKSRRLLETAIRPAI 292 Query: 300 RGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAAS 359 L G L L+ +KT I H+ DGF FLG + + + T +E Sbjct: 293 EKFLSGR-GLSLSPEKTAITHIKDGFTFLGQTYRKTGNV-----LHITPAKEGVLALRRK 346 Query: 360 LTALLWK---------VRISGEILLG 376 + L+ + V+ E L G Sbjct: 347 VGTLIRRHVSAPMPILVKKLNETLRG 372 >UniRef50_B3PDY2 Putative maturase n=1 Tax=Cellvibrio japonicus Ueda107 RepID=B3PDY2_CELJU Length = 402 Score = 231 bits (588), Expect = 4e-59, Method: Composition-based stats. Identities = 97/377 (25%), Positives = 158/377 (41%), Gaps = 65/377 (17%) Query: 25 TQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLPARRV 84 + + A ++ G G D V M + L EL L + + SG Y P +RV Sbjct: 6 ISKQLVYAAWLKVKANAG--VAGADNVCIDMFEHNLENELYKLWNRMSSGSYMAPPVKRV 63 Query: 85 YIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKL 144 + K++GKLRPLGIP + DR+ Q + M +EP W+S FH S+G+RP RS HHA++ K+ Sbjct: 64 EMAKADGKLRPLGIPTVADRVAQMVVKMTLEPEWDSKFHASSFGYRPRRSAHHAVQAAKI 123 Query: 145 QLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGH-IDVG 203 + WVI+ D+ +FD ++H L K V + D + + I AG + G Sbjct: 124 NCW-----KYSWVIDLDIKGFFDNLNHDQLQKFVAQATDDPWCKLYIKRWITAGVQMPGG 178 Query: 204 LFRAASEGVPQGGVISPLLSNIMLN-EFDQYLHERYLSGKARKDRWYWNNSIQRGRSTAV 262 ++G PQGGVISPLL+N+ L+ FD ++ + + + Sbjct: 179 ELHKTAKGTPQGGVISPLLANLYLHKVFDSWMQKYFPQNPFER----------------- 221 Query: 263 RENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVN 322 YADD V + T+ + E + ++ L L+ +KTKI + Sbjct: 222 --------------YADDIVCHCR-TEHEAEQLLSAISRRMQ-RFDLTLHPEKTKIVYCG 265 Query: 323 D---------GFIFLGHRLIRKRSRYGEMR----VVSTIPQEKARNFAASLTALLWKV-- 367 F FLG R+ + + + V I + + ++ + Sbjct: 266 RRKIERTKAQSFDFLGFTFRRRTVKRKDGKLVPGFVPAISNKAKKAIVKTMREWNVRRLS 325 Query: 368 --------RISGEILLG 376 + + G Sbjct: 326 RLSIVTLSKKLNPQIRG 342 >UniRef50_A7BUU9 RNA-directed DNA polymerase n=1 Tax=Beggiatoa sp. PS RepID=A7BUU9_9GAMM Length = 585 Score = 230 bits (586), Expect = 7e-59, Method: Composition-based stats. Identities = 117/390 (30%), Positives = 187/390 (47%), Gaps = 43/390 (11%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEW--LAEAARITLSSKGAHTPGVDGVNKTMLQA 58 ++R +A ++ +L R+ + L ++T ++G T GVDG T + Sbjct: 34 LERSIAKAVEHHDFAKVAKLRRIFRTSKNVRLLSVRKVTQDNRGKKTAGVDGKVITSEKD 93 Query: 59 RLAVELQILRDELLSGHYQPLPARRVYIPKSNGKL-RPLGIPALRDRIVQRAMLMAMEPI 117 R + + + G + P RRV+IPKSN K RPLGIP + DR+ Q + + +EPI Sbjct: 94 RWRLA----SNVRIDG--KSNPLRRVWIPKSNSKELRPLGIPTIEDRVKQMMLKLEIEPI 147 Query: 118 WESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKA 177 +E YGFRP RSVH AI + + + G WV+EGD S +FD ++ L+ Sbjct: 148 YEVQAEPNVYGFRPARSVHDAIEACFIAI--GCKKEGAWVLEGDFSKFFDNINKEHLLNM 205 Query: 178 VR-RRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHE 236 ++ + I+D + + IK+G ID +F +G PQGGVISPLL+NI L+ + LH+ Sbjct: 206 MKSKGITDKETLQQVQAWIKSGVIDKEVFTKTDKGTPQGGVISPLLANIALHGMENMLHD 265 Query: 237 RYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIR 296 + K K + + RYADDFV+I K KA +E + Sbjct: 266 WVDTWKGTK--------------------RSNHQSFSVIRYADDFVVIHKD-KAVIEEAK 304 Query: 297 EECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSRYG-EMRVVSTIPQEKARN 355 L+ + ++LN KTKI H +GF FLG + + R G +++ ++ +K + Sbjct: 305 LCIEEWLDKGVGVKLNQTKTKITHTTEGFDFLGFNVRQYRVNNGSQLKFLTKPSMDKVKA 364 Query: 356 FAASLTALLWKVR---------ISGEILLG 376 S+ + +R I++G Sbjct: 365 HMESIRQVTKTMRAVSTQTLIDKLNPIIIG 394 >UniRef50_C5D9G3 RNA-directed DNA polymerase (Reverse transcriptase) n=2 Tax=Firmicutes RepID=C5D9G3_GEOSW Length = 605 Score = 229 bits (583), Expect = 1e-58, Method: Composition-based stats. Identities = 104/364 (28%), Positives = 172/364 (47%), Gaps = 18/364 (4%) Query: 10 ATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNK-TMLQARLAVELQILR 68 R + L+ + + + A +++G+ T G DG +L + ++ Sbjct: 10 EKGDIPRFKGLVEIASSDVVIVSAIHKIKANQGSKTAGTDGQTINDILTKNYDEVINFVK 69 Query: 69 DELLSGHYQPLPARRVYIPKSNGKLRP-LGIPALRDRIVQRAMLMAMEPIWESDFHTLSY 127 + Y+P RRVYIPK K + LGIP + DR +Q + M +EPI E+ F SY Sbjct: 70 RCFKN--YKPKLIRRVYIPKPGKKKKRPLGIPTIADRTIQECVRMTIEPILEAQFFQHSY 127 Query: 128 GFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRR-ISDAR 186 GFRP R AI + C WVIEGD+ +FD V+H +L+K + I D R Sbjct: 128 GFRPYRDTKQAIER---CVFICNRIGYNWVIEGDIKGFFDNVNHTILIKQLWHMGIRDRR 184 Query: 187 FMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKD 246 + ++ +KAG + + G PQGG+ISPLL+N+ L++ DQ++ + K R Sbjct: 185 MLMIIKAMLKAGVM--KETKVNEIGTPQGGIISPLLANVYLHKLDQWITREWEEKKMRN- 241 Query: 247 RWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGS 306 +I+ + ++R + Y RYADD+VL ++ E + + L+ + Sbjct: 242 ----GTAIRTSKFNSLRNHSTITRPEFYVRYADDWVLFT-DSRENAEKWKYRIKKYLKEN 296 Query: 307 LKLRLNMDKTKIPHVNDGF-IFLGHRLI-RKRSRYGEMRVVSTIPQEKARNFAASLTALL 364 LKL L+ DKT I ++ FLG ++ + G+ ++ EK + + L Sbjct: 297 LKLELSDDKTLITNIKKKPMKFLGFKIKMLPHGKNGKYVGYASADTEKVKRKVEQIKKDL 356 Query: 365 WKVR 368 K++ Sbjct: 357 RKLK 360 >UniRef50_Q8TJY1 Reverse transcriptase n=5 Tax=Methanosarcina RepID=Q8TJY1_METAC Length = 512 Score = 228 bits (581), Expect = 3e-58, Method: Composition-based stats. Identities = 96/386 (24%), Positives = 179/386 (46%), Gaps = 46/386 (11%) Query: 1 MQRKLATWAATDPSLRIQRLLRLIT-QPEWLAEAARITLSSKGAHTPGVDGVNKTMLQAR 59 +Q ++A+ A + + +L RL+T + R ++KG+ TPG+DG+ + + Sbjct: 53 LQSRIASAAKNGKWITVNKLSRLLTRSLYAKLLSVRKVTTNKGSRTPGIDGIIWSSSADK 112 Query: 60 LAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWE 119 + L +L + Y+ P R YI K NGKLRPL IP + DR +Q + + PI Sbjct: 113 MRSAL-----QLTNKGYRAKPLTRKYIRKKNGKLRPLSIPTMYDRAMQTLHSLVLGPIES 167 Query: 120 SDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVR 179 + S+GF+P RS A + + L+ + W++EGD+ + FD ++H ++ + Sbjct: 168 AIGDKTSFGFKPYRSTKDAYAYLHICLSK--KIAPEWIVEGDIKACFDEINHTWILDNIP 225 Query: 180 RRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYL 239 D R + + +KAG+++ +G PQG ISP++ N+ LN + L R+ Sbjct: 226 M---DKRIL---KEFLKAGYVENYHLFPTEKGTPQGSPISPIIGNMALNGLENALAMRFY 279 Query: 240 SGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREEC 299 S + ++ Q + V R+ADDFV + +E I + Sbjct: 280 SR----------------SDGTIDKSHQNRHKVNCARFADDFVATADSPETALE-IIDVI 322 Query: 300 RGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAAS 359 + L+ L+L+ +KT + ++++GF FLG + + +++ ++ R Sbjct: 323 QEFLDPR-GLKLSEEKTLVTNISEGFNFLGWNFRKYK-----GKLLPKPSKDSQREIIKK 376 Query: 360 LTALLWK---------VRISGEILLG 376 ++ ++ K +RI I+ G Sbjct: 377 ISDVIHKAKAWDQDRLIRILNPIIRG 402 >UniRef50_Q2FUJ3 RNA-directed DNA polymerase n=1 Tax=Methanospirillum hungatei JF-1 RepID=Q2FUJ3_METHJ Length = 487 Score = 228 bits (581), Expect = 3e-58, Method: Composition-based stats. Identities = 100/382 (26%), Positives = 167/382 (43%), Gaps = 50/382 (13%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEW-LAEAARITLSSKGAHTPGVDGVNKTMLQAR 59 +Q ++A +RL L+T + A + ++G + GVDG T + + Sbjct: 38 LQTRIAKAVKNGQYRLARRLQYLLTHSFYAKMLAVQRVTKNRGKRSAGVDGEKWTTPEQK 97 Query: 60 LAVELQILRDELLSGHYQPLPARRVYIPKS-NGKLRPLGIPALRDRIVQRAMLMAMEPIW 118 + L L Y+ P RR+YIPK + K+RPL IP + DR +Q MA+ P Sbjct: 98 MKAALT-----LSDKGYRAKPLRRIYIPKPQSSKMRPLSIPTMYDRAMQALYAMALMPWA 152 Query: 119 ESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAV 178 E+ S+GFR +R+ A L+ +T G+W++EGD+ FD H+ ++ + Sbjct: 153 ETTADKTSFGFRMKRNAQDAASYTFQCLSR--KTSGQWILEGDIRGCFDNFAHQWMLDNI 210 Query: 179 RRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERY 238 D R + + +KAG+I G+ G PQGG+ISPLL+N+ L+ ++ L E + Sbjct: 211 P---LDQRILN---QFLKAGYIYDGILYRNKSGTPQGGLISPLLANMALDGMERMLKEHF 264 Query: 239 LSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREE 298 K V R+ADDF++ + ++ +E Sbjct: 265 PGNK-----------------------------VHLIRFADDFLVTADSQETALQ-CKEL 294 Query: 299 CRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSR--YGEMRVVSTIPQEKARNF 356 L L L+ +KTKI H+N+GF FLG + + + + +K R Sbjct: 295 ITEFLHER-GLELSEEKTKIVHINEGFDFLGWNFRKFKGKFLIQPSKKAIAAIIDKVRVI 353 Query: 357 AASLTALLWK--VRISGEILLG 376 S A + ++ ++ G Sbjct: 354 IKSAKAWKQEDLIKALNPVIKG 375 >UniRef50_A5IEI2 Reverse transcriptase n=7 Tax=Bacteria RepID=A5IEI2_LEGPC Length = 454 Score = 227 bits (578), Expect = 6e-58, Method: Composition-based stats. Identities = 101/362 (27%), Positives = 157/362 (43%), Gaps = 55/362 (15%) Query: 27 PEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLPARRVYI 86 + + +A ++ ++ G+ G+D + L L L + + SG Y P + V I Sbjct: 49 KQLVWQAYKLVRANAGS--AGIDNQSIDEFSQDLKGNLYKLWNRMSSGSYFPPAVKEVAI 106 Query: 87 PKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQL 146 PK G +R LGIP + DRI Q + + MEP+ E F SYG+RP +S A+ + + Sbjct: 107 PKKQGGVRKLGIPTVADRIAQMTVKLMMEPLLEPHFLDDSYGYRPNKSALDAVGVTRKRC 166 Query: 147 TDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHIDV-GLF 205 + WV+E D+ FD + H LLMKAV+ ISD + + + + A D G Sbjct: 167 WE-----YDWVVEFDIKGLFDNLSHELLMKAVKHHISDRWILLYVERWLTAPIQDQHGGC 221 Query: 206 RAASEGVPQGGVISPLLSNIMLNE-FDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRE 264 + G PQGGVISPLLSN+ L+ FD ++ + + Sbjct: 222 LPRTAGTPQGGVISPLLSNLFLHYAFDHWMTKHH-------------------------- 255 Query: 265 NWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDG 324 P +CRYADD + + K + ++E + SL L ++ DKTKI + DG Sbjct: 256 -----PDNPWCRYADDGLAHCRTEKEAEQMLKEIDKRF--KSLGLEIHPDKTKIVYCKDG 308 Query: 325 ----------FIFLGHRL--IRKRSRYGEMRVVSTIPQEKARNFAA-SLTALLWKVRISG 371 F FLG+ R + R + P + A +L R + Sbjct: 309 ARKGKYKNKSFDFLGYTFKARRVKVRSRNSFFIGFTPVVSLKAVKAMTLKLRRGNWRNNT 368 Query: 372 EI 373 + Sbjct: 369 SL 370 >UniRef50_B1VA32 Retron-type reverse transcriptase n=7 Tax=Candidatus Phytoplasma RepID=B1VA32_PHYAS Length = 521 Score = 226 bits (577), Expect = 8e-58, Method: Composition-based stats. Identities = 105/371 (28%), Positives = 177/371 (47%), Gaps = 24/371 (6%) Query: 5 LATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVEL 64 + + + SL+ + L R + E +A ++KG+ T GV ++ + Sbjct: 18 ILNHSKNNYSLK-KVLKREMNNIENTYKAFNSIATNKGSGTEGVGNKTIDGIK---LEMI 73 Query: 65 QILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHT 124 + E ++ Y P P ++V IPK K RPLGIP ++DRI+Q+AM + +E+ F Sbjct: 74 KKYHKEYVNNQYNPQPVKKVLIPKGKNKTRPLGIPTIKDRIIQKAMEQLLTLYFENIFLE 133 Query: 125 LSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISD 184 S+GFR ++S H A++ VK + + YF+T++H +L K + + I Sbjct: 134 WSFGFRSKKSCHDAVKRVKQRFKGIDYIIKID-----IKGYFETINHDILNKMLNKYIRK 188 Query: 185 ARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGK-A 243 A+ + + + +KAG ++ G+ + G PQGG+ISPLLSNI L+ D+ + E +GK Sbjct: 189 AKTLKTINQWLKAGIMENGIKYESLSGTPQGGIISPLLSNIYLHYIDKKMEELIRNGKPI 248 Query: 244 RKDRWYWNNSIQRGR----STAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREEC 299 K + + + + S N P + Y RYADDF++ +KG + E I+ Sbjct: 249 MKANPEYWKAYTKKQHHNLSIDSEINLNPNPRIEYIRYADDFIIGIKGEHHEAERIKTHV 308 Query: 300 RGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSRYG----------EMRVVSTIP 349 LE LKL +N DK+KI G FL + + + +V IP Sbjct: 309 LKWLEQDLKLVVNRDKSKIVKTTKGTRFLSYMVKVNPTNKTRKKKTTKNSLNGQVQIQIP 368 Query: 350 QEKARNFAASL 360 ++ R++ Sbjct: 369 KDTTRDYGKEY 379 >UniRef50_Q1D1V6 Group II intron, maturase n=2 Tax=Myxococcales RepID=Q1D1V6_MYXXD Length = 436 Score = 226 bits (575), Expect = 2e-57, Method: Composition-based stats. Identities = 104/392 (26%), Positives = 169/392 (43%), Gaps = 62/392 (15%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARL 60 ++ ++ A + P+ R L + +PE L A + G PG DG+ ++ R Sbjct: 11 LRARIGHRAKSAPTHRFWGLYVHVLKPEVLEAAYLEARRNGG--APGQDGITFEHIEERG 68 Query: 61 AV-ELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWE 119 L + +EL +G Y+P P RR IPK GK+R + IP++RDR+VQ A+ + +EPI+E Sbjct: 69 RAGFLGAVAEELRTGTYRPRPYRRREIPKEGGKVRVISIPSIRDRVVQGALRLVLEPIFE 128 Query: 120 SDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVR 179 +DF S+G RP RS H AI TV+ L ++ YFD++ H L++ V Sbjct: 129 ADFSGSSFGARPGRSAHEAIDTVRQGLRRRRHRVVDVDLKA----YFDSIRHAPLLERVA 184 Query: 180 RRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYL 239 RR+ D + L+ + +++ G+PQG +SPLL+NI LN+ D L Sbjct: 185 RRVQDGEVLALVKQFLRS---------TGDRGIPQGSPLSPLLANIALNDLDHVLDR--- 232 Query: 240 SGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREEC 299 + + Y RY DD V++ ++ Sbjct: 233 ----------------------------GRGFLTYARYLDDMVVLAPDSEKGRRWAARAL 264 Query: 300 RGVLE--GSLKLRLNMDKTKIPHVND---GFIFLGHRLIRKRS-RYGEMRVVSTIPQEKA 353 + + +L + LN +KT+ + D F FLG KRS + G + ++K Sbjct: 265 ERIRQEAEALGVSLNKEKTRTVTMTDRNASFAFLGFDFRWKRSPKTGTWYPNTNPRRKKV 324 Query: 354 RNFAASLTALLWKVRIS---------GEILLG 376 + +L R I+ G Sbjct: 325 TEVLRRVRHVLRDSRALAVQSAVVEVNAIVRG 356 >UniRef50_Q8YKQ2 Alr7241 protein n=14 Tax=Cyanobacteria RepID=Q8YKQ2_ANASP Length = 562 Score = 224 bits (572), Expect = 3e-57, Method: Composition-based stats. Identities = 92/377 (24%), Positives = 158/377 (41%), Gaps = 37/377 (9%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEW--LAEAARITLSSKGAHTPGVDGVNKTMLQA 58 +QR++ + + L +LI + + +I+ + G T G+DGV Sbjct: 65 LQRRVFKAVQAGNKRKARFLQKLILKSKAGRFLAIRQISQLNAGKKTAGIDGVKSLDFNG 124 Query: 59 RLAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIW 118 R +E+ + + SG++ R + IPK +G R L IP + DR Q A+EP Sbjct: 125 RFELEITLKQS---SGNWHHQELREIPIPKKDGTTRMLKIPTIADRCWQCLAKYALEPAH 181 Query: 119 ESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAV 178 E+ FH SYGFR R+ H A + + L+ + + VIE D+ FD ++H +M+ + Sbjct: 182 EATFHARSYGFRTGRAAHDAQQFLFSNLSSKAKRISKRVIELDIEKCFDRINHSTIMENL 241 Query: 179 RRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERY 238 I+ +++ +KAG +G PQGGV+SPLL+NI LN + Sbjct: 242 ---IAPKGIKLGIYRCLKAGI----NPEFPEQGTPQGGVVSPLLANIALNGIESI----- 289 Query: 239 LSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREE 298 + + + + RYADD V++++ + I + Sbjct: 290 -------------HRYHKDNQRITNKTPESDIRYPSVRYADDMVIVLR-PQDDANEILAK 335 Query: 299 CRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAA 358 L ++++ KTKI DGF FLG + + T +E + F Sbjct: 336 IEDFLNAR-GMKVSAKKTKITATTDGFDFLGWHI----IVQSNGKFNCTPSEENFKKFRQ 390 Query: 359 SLTALLWKVRISGEILL 375 + A++ G + Sbjct: 391 KVKAIV-NCSNYGSSVK 406 >UniRef50_Q12UG1 RNA-directed DNA polymerase n=53 Tax=cellular organisms RepID=Q12UG1_METBU Length = 592 Score = 224 bits (570), Expect = 5e-57, Method: Composition-based stats. Identities = 98/382 (25%), Positives = 168/382 (43%), Gaps = 37/382 (9%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEW-LAEAARITLSSKGAHTPGVDGVNKTMLQAR 59 +Q ++ +++L L+T + A R + +KG T G+DG + ++ Sbjct: 64 IQVRITKAVINKNWNLVKKLSYLLTHSHYAKLLAVRKVIRNKGRRTAGIDGEFWSTPVSK 123 Query: 60 LAVELQILRDELLSGHYQPLPARRVYIPK-SNGKLRPLGIPALRDRIVQRAMLMAMEPIW 118 + L Y+ P +R++I K + K RPLGIP + DR +Q +A++PI Sbjct: 124 VNA-----ARSLSDKRYKAKPLKRIFIEKYGSDKKRPLGIPTMYDRAMQALYALALDPIA 178 Query: 119 ESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAV 178 E S+GFR RS H A + ++ + + ++EGD+ FD + H+ L+ + Sbjct: 179 EVTADKRSFGFRKFRSTHDACSQIFGTISK--KDSAQCILEGDIKGCFDNISHQWLIDNI 236 Query: 179 RRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERY 238 ++L + +KAG + G PQGG+ISP+L+N+ L+ + L ++Y Sbjct: 237 PMD------KSILKQFLKAGFVYENSLFPTKAGTPQGGIISPILANMTLDGIEGVLADKY 290 Query: 239 LSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREE 298 RG S + + K V + RYADDF++ K TK E +E Sbjct: 291 ----------------HRGVSGKITTRQRAKHKVNFVRYADDFIVTAK-TKEIAEEAKEL 333 Query: 299 CRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSR--YGEMRVVSTIPQEKARNF 356 + L L L+ +KT I H++DGF FLG + + + + + EK N Sbjct: 334 IKNFLTDR-GLELSDEKTLITHIDDGFDFLGWNVRKYKGKLLIKPSKKSIKKVTEKISNT 392 Query: 357 AASLTALLWK--VRISGEILLG 376 + + I+ G Sbjct: 393 IKDGKTWTQEDLISKLNPIITG 414 >UniRef50_Q1VQM5 Prophage LambdaSa1, reverse transcriptase/maturase family protein n=7 Tax=Psychroflexus torquis ATCC 700755 RepID=Q1VQM5_9FLAO Length = 456 Score = 224 bits (570), Expect = 6e-57, Method: Composition-based stats. Identities = 112/392 (28%), Positives = 180/392 (45%), Gaps = 50/392 (12%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQAR- 59 QRKL A D + L + + L EA R S+ + GVD + ++ + Sbjct: 26 FQRKLYIRAKQDKGFKAYSLYGKLCEDHTLIEAYRRVRSNY-SKGVGVDNQSSDAIEKQG 84 Query: 60 LAVELQILRDELLSGHYQPLPARRVYIPK-SNGKLRPLGIPALRDRIVQRAMLMAMEPIW 118 ++V L ++ +L Y+ ++ IPK G R LGIP +RDR+VQ A+ M +EP+W Sbjct: 85 ISVFLGEIQQDLQGHTYRSQAVKQKLIPKEKEGDFRVLGIPTIRDRVVQMAVKMLIEPLW 144 Query: 119 ESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAV 178 E+DF S+GFRP+R AI+ VK + D + +V + DLS YFDT+ H L + Sbjct: 145 EADFEHTSFGFRPKRGAKDAIKQVKQNIYDRHQ----FVYDADLSKYFDTIPHTKLFILL 200 Query: 179 RRRISDARFMTLLWKTIKAGH-IDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHER 237 ++R+ D ++L+ + + A + G A+++G PQGGVISPLLSNI L+ FDQ ++ Sbjct: 201 KKRLVDHSILSLIHQWLTAPVRLPNGKLVASTKGSPQGGVISPLLSNIYLHAFDQIVNNP 260 Query: 238 YLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIRE 297 K + RYADDF+L+ K + I + Sbjct: 261 KGKF--------------------------AKANIRIVRYADDFLLMGKWY--FSKEILD 292 Query: 298 ECRGVLEGSLKLRLNMDKTKIPHVND-GFIFLGHRLIRKRSRYG---EMRVVSTIPQEKA 353 +++ + L LN +KTK+ H + FLG +S++G + + Sbjct: 293 YITSIMDN-MGLTLNKEKTKLLHSSKSSLFFLGFEFRSIKSKFGWNAKNYTNVRPSMKSR 351 Query: 354 RNFAASLTALLWKVRI---------SGEILLG 376 + L L + ++L G Sbjct: 352 SKLFSKLRELFANRKHWTIEWIVWKVNQLLRG 383 >UniRef50_A7MS60 Putative uncharacterized protein n=21 Tax=Vibrio harveyi ATCC BAA-1116 RepID=A7MS60_VIBHB Length = 430 Score = 224 bits (570), Expect = 6e-57, Method: Composition-based stats. Identities = 110/353 (31%), Positives = 165/353 (46%), Gaps = 47/353 (13%) Query: 21 LRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTM--LQARLAVELQILRDELLSGHYQP 78 + I L +A R +KG GVD ++ + R A Q LR LL G Y+P Sbjct: 1 MEQICSSTNLNQALRRVKKNKGC--AGVDKLDIDATIFKLRQASNGQALRQSLLDGSYRP 58 Query: 79 LPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHA 138 P V IPK +G +R LGIP + DRIVQ+A+ + I+E+ F SYGFRP RS HHA Sbjct: 59 QPVLGVGIPKPSGGVRQLGIPTVLDRIVQQAITSVLSDIYEAKFSNSSYGFRPNRSAHHA 118 Query: 139 IRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAG 198 + + + +V++ DL+ YFDTV+H LM + I+D R + L+ ++AG Sbjct: 119 LAAASRYIRE----GRGYVVDIDLAKYFDTVNHDRLMHRLSEDIADKRVLKLIRSYLQAG 174 Query: 199 HIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGR 258 + GL G PQGG +SPLLSNI+L+E D+ L Sbjct: 175 IMRNGLVEQRQRGTPQGGPLSPLLSNIVLDELDKELER---------------------- 212 Query: 259 STAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKI 318 +CRYADD + V G++ ++E LE LKL +N +K+ Sbjct: 213 -----------RGHKFCRYADDCQIYV-GSEEAAYRVKESITEYLEQKLKLTVNREKSAA 260 Query: 319 PHVNDGFIFLGHRLIRKR----SRYGEMRVVSTIPQEKARNFAASLTALLWKV 367 V + +L HR S+ + ++ + + Q RN L ++ ++ Sbjct: 261 TRVTER-TYLSHRFGIDGTIHISKPAQTQMKTRVRQITKRNRGRELKVVIAEL 312 >UniRef50_B8FP59 RNA-directed DNA polymerase n=9 Tax=Firmicutes RepID=B8FP59_DESHD Length = 320 Score = 224 bits (570), Expect = 6e-57, Method: Composition-based stats. Identities = 92/263 (34%), Positives = 143/263 (54%), Gaps = 12/263 (4%) Query: 9 AATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILR 68 + D S + QRL R + PE+ A + ++KG+ TPG+DG + + ++ + Sbjct: 14 KSKDESYKFQRLYRNLYNPEFYWLAYQNIYANKGSMTPGMDGTTISGIN---EERIRQII 70 Query: 69 DELLSGHYQPLPARRVYIPKSNG-KLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSY 127 L YQP PARRVYI K N K RPLGI D++VQ + M +E I+E F S+ Sbjct: 71 ASLKDQSYQPHPARRVYIEKKNSQKKRPLGISTANDKLVQEVVRMILESIFEPTFSDKSH 130 Query: 128 GFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARF 187 GFRP RS A+ ++ T W +EGD+ + FD+ H +L++ ++RRI DA F Sbjct: 131 GFRPVRSCQTALLQIQGNF-----TGVNWFVEGDIEACFDSFDHHVLIELLQRRIDDASF 185 Query: 188 MTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHE---RYLSGKAR 244 ++L+WK +KAG+++ + +GVPQG ISP+LSNI L+E D ++ E + K Sbjct: 186 ISLMWKFLKAGYMEQWEYNMTYDGVPQGSGISPILSNIYLSELDTFMSEYKAAFDIRKTH 245 Query: 245 KDRWYWNNSIQRGRSTAVRENWQ 267 + N R ++ R+ + Sbjct: 246 GRKPSSNYCHARYMASKYRKESR 268 >UniRef50_Q8HQ89 ORF777 (Fragment) n=1 Tax=Schizosaccharomyces octosporus RepID=Q8HQ89_SCHOT Length = 777 Score = 223 bits (568), Expect = 1e-56, Method: Composition-based stats. Identities = 91/373 (24%), Positives = 158/373 (42%), Gaps = 37/373 (9%) Query: 11 TDPSLRIQRL-LRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRD 69 + + L R + + + A S+ N L + Sbjct: 191 ENDLNKFYELNYRQLYNLDLIKTAYLDLKSNSDNFEK---NNNNNSLDKISNEWAEKTCK 247 Query: 70 ELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGF 129 L + P+R++ + N + R + I +DR++Q+A M ++ +E F S+GF Sbjct: 248 SLKDRSFIFKPSRKIMVTCRNNQKREISIANGKDRVIQQAFKMILQSAYEPIFLNYSHGF 307 Query: 130 RPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMT 189 RP RS H AI V+ TR W+I+GD+ +FD V H +L + ++I D + Sbjct: 308 RPGRSPHSAIFEVRKW------TRITWMIKGDIKGFFDNVDHHILANLLSKKIKDKNLID 361 Query: 190 LLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWY 249 WK ++AG+++ G ++ ++ G+PQGG++SPLL+NI L+E D Y+ + K Sbjct: 362 FYWKLVRAGYVNNGNYKVSNLGIPQGGILSPLLANIYLHELDVYMEQLIQKYTVNKPVSK 421 Query: 250 WNNSIQRG--------------------------RSTAVRENWQWKPAVAYCRYADDFVL 283 N R R + + + Y RY DD+V+ Sbjct: 422 KNKEHTRLFSEITAKSKKKFPDFELIKRMRKELRRIPSTIRDSSTGTRIYYNRYGDDYVI 481 Query: 284 IVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVN-DGFIFLGHRLIRKRSRYGEM 342 V G K E I+ L+ L + LN +K++I H+ +LG+ + R+ +Y E Sbjct: 482 GVVGPKNLAETIQNLVSDFLKNELLIDLNKEKSQITHLTSKSLKYLGYEIFRRNRKYSES 541 Query: 343 RVVSTIPQEKARN 355 ++ R Sbjct: 542 QLTYNSKTNTYRK 554 >UniRef50_B0I1N8 Reverse transcriptase homolog n=2 Tax=Pylaiella littoralis RepID=B0I1N8_PYLLI Length = 796 Score = 222 bits (567), Expect = 1e-56, Method: Composition-based stats. Identities = 102/387 (26%), Positives = 175/387 (45%), Gaps = 39/387 (10%) Query: 11 TDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDE 70 + R L+++I + A + +++G GVD + L LQ + ++ Sbjct: 179 KNKDGRYGNLIQIIGSLSTIKLAYLMIKNNRGISAKGVDD---SSLDGISLRTLQAMSND 235 Query: 71 LLSGHYQPLPARRVYIPKSNG-KLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGF 129 LSG + P RRVYI K LRPLGI + R +I+Q+++ M + I+E F S+G Sbjct: 236 TLSGRIKFSPVRRVYIKKEGKTDLRPLGISSPRQKIIQKSIEMVLTSIFEEIFLDCSHGS 295 Query: 130 RPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMT 189 R RS H A++ ++L++ + + WV+EGD+ FD + H +MK +++++ + Sbjct: 296 RIGRSCHTALKNLQLKVGNV--STYSWVVEGDIKGCFDNIPHSQIMKRIKQKVDCLPTIN 353 Query: 190 LLWKTIKAGHI----------DVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLH---- 235 L+ K + AG+I G QG ++SPL SNI+L+E D+++ Sbjct: 354 LVKKILDAGYILDEDLKKFGRKNAQVFKPDVGTTQGIILSPLFSNIVLHELDKFIEVILK 413 Query: 236 ERYLSGKARKDRWYWNNSIQRGRSTAVRENWQW-----------------KPAVAYCRYA 278 E + GK RK + + + + + + Y RY Sbjct: 414 EEFSKGKKRKANLEYRKLRYQIKKEDNLKKRRKLIEDCKSVPSKSIEDPDFKRLFYVRYV 473 Query: 279 DDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDG-FIFLGHRLIRKRS 337 DD+V++V G+ + IR+ L L L LNM+KTKI + G FLG +++ Sbjct: 474 DDWVILVSGSLSDSRLIRDRVSRKLR-ELGLELNMEKTKITSLRKGKCRFLGIDFFIRKN 532 Query: 338 RYGEMRVVSTIPQEKARNFAASLTALL 364 + VS + + + T L Sbjct: 533 TNQHYKPVSLVKKNTNISIRQRFTPRL 559 >UniRef50_Q9G8T2 Orf762 n=2 Tax=Eukaryota RepID=Q9G8T2_RHDSA Length = 762 Score = 221 bits (562), Expect = 4e-56, Method: Composition-based stats. Identities = 95/377 (25%), Positives = 173/377 (45%), Gaps = 34/377 (9%) Query: 6 ATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPG-VDGVNKTMLQARLAVEL 64 D + +RL+ +I + L A S G G +D + L + Sbjct: 165 HRKFNKDKNFVNKRLINIIGDVQTLIVAYEFVKSKPGQMVKGSID----STLDDIDLAWI 220 Query: 65 QILRDELLSGHYQPLPARRVYIPKSNGKLR-PLGIPALRDRIVQRAMLMAMEPIWESDFH 123 + + + +G ++ +P+RR+Y+ K+ K R P+ RD++VQ+A+ + +EPI+E+ F Sbjct: 221 KSISKVIKAGKFKFIPSRRIYVSKTGCKERRPIMTGFPRDKLVQKAIQLVLEPIYENVFL 280 Query: 124 TLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRIS 183 S+GFRP R H A++++K WVIE D++S F +V+H +L+ ++ RI Sbjct: 281 ENSHGFRPARGCHTALKSIKQGFH-----GVTWVIESDIASCFSSVNHEVLLSIIKERIK 335 Query: 184 DARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHE-----RY 238 + + L+ +++G++D+G F + G+PQG +SPLL NI L++FD +++E Y Sbjct: 336 CVKTLALIRNLLESGYVDLGAFCKSKLGIPQGSSLSPLLCNIYLHKFDTFMYELKQRFVY 395 Query: 239 LSGKARKDRWYWNNSIQR-----------------GRSTAVRENWQWKPAVAYCRYADDF 281 S K + + ++ ++ + + Y RYADDF Sbjct: 396 TSSKDPRINPAYKRLQRQIQNTPGLVEKSKFIQELRKTPSKDLFDPKYRRLFYIRYADDF 455 Query: 282 VLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVND-GFIFLGHRLIRKRSRYG 340 + + G K I ++ + L LK+ L K ++ H+ FLG + Sbjct: 456 SIGITGQKKDAVEILDQAKIFLSEELKMDLKESKIRVVHLKKQSIFFLGTTIYGISCVEK 515 Query: 341 EMRVVSTIPQEKARNFA 357 MR V + + Sbjct: 516 PMRTVKHSNWKTSIKIR 532 >UniRef50_Q7YAJ3 Putative reverse transcriptase and intron maturase n=1 Tax=Chara vulgaris RepID=Q7YAJ3_CHAVU Length = 760 Score = 220 bits (560), Expect = 8e-56, Method: Composition-based stats. Identities = 104/351 (29%), Positives = 152/351 (43%), Gaps = 30/351 (8%) Query: 5 LATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVEL 64 L + P L + A +++ TPG D +L Sbjct: 287 LWIHSYKKPFRVYYDLGGFLRNIGIWIIAYKLSQ----KVTPGPDQTTI---DGTSLEKL 339 Query: 65 QILRDELLSGHYQPLPARRVYIPKSNGK-LRPLGIPALRDRIVQRAMLMAMEPIWESDFH 123 LRD ++ G ++ R IPK RPLG+ +DRIVQ + M +EPI+E F Sbjct: 340 LKLRDAIVKGEFEWGATR---IPKPGKNEKRPLGVSCFQDRIVQEVLRMILEPIYEPRFS 396 Query: 124 TLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDT-VHHRLLMKAVRRRI 182 T S+GFRP RS H A+ + +W IEG L + V+H L K +RR I Sbjct: 397 TYSHGFRPGRSAHTALNVIMGTFH-----GAQWYIEGSLEAEGPGAVNHGTLYKIIRRTI 451 Query: 183 SDARFMTLLWKTIKAG-HIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSG 241 D R + L+ ++A H+ G A+ G + +SPLLSNI LNE D ++ Sbjct: 452 RDKRILKLIRSGLQAFFHMPHGEIEEATIGAARPFGLSPLLSNIYLNELDHFIETTIREY 511 Query: 242 KARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRG 301 + R+ A R N + + Y R+ADDF++ + G +++ +R E Sbjct: 512 NKAR------------RADASRLNPTNRRRMHYIRFADDFLVAISGPRSEAVKLRSELES 559 Query: 302 VLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEK 352 L L+L L+MDKT I HV+ G FLGH + R E Sbjct: 560 FLRDKLQLTLSMDKTHITHVSKGVPFLGHNIFRSMRWQPYGSKSRRFASES 610 >UniRef50_A4C8M3 RNA-directed DNA polymerase (Reverse transcriptase) n=12 Tax=Pseudoalteromonas tunicata D2 RepID=A4C8M3_9GAMM Length = 429 Score = 220 bits (560), Expect = 8e-56, Method: Composition-based stats. Identities = 97/370 (26%), Positives = 165/370 (44%), Gaps = 44/370 (11%) Query: 5 LATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVEL 64 + + P R Q L L+ + ++L ++ +K A G+DG+ Q +L + Sbjct: 8 ITFKSQRHPKHRFQNLYGLL-REDFLYQSWGQL--NKQA-AAGIDGITMPAYQQQLVGNI 63 Query: 65 QILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHT 124 L D L ++ +RV+IPK+NGK RPLG+P + D++VQ+ + ++ IWE+DF Sbjct: 64 TRLSDALKHKRFRANDIKRVFIPKANGKQRPLGLPTVDDKLVQQGVSQILQSIWEADFLP 123 Query: 125 LSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISD 184 SYG+RP +S H A+ ++ L L +++E D+ +F+ + H LMK +++RI D Sbjct: 124 NSYGYRPNKSAHQALHSLALNLQF---KGYGYIVEADIKGFFNNLDHNWLMKMLKQRIDD 180 Query: 185 ARFMTLLWKTIKAGH-IDVGLFRAASEGVPQGGVISPLLSNIMLNE-FDQYLHERYLSGK 242 ++L+ + +KA G+F G PQGG+ISP+L+NI L+ D + ++ Sbjct: 181 KAMLSLISQWLKARIKSPEGVFEYPKSGTPQGGIISPVLANIYLHYALDLWFEKKVKP-- 238 Query: 243 ARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGV 302 + + RYADDFV + E E Sbjct: 239 ------------------------RMRGRAMLIRYADDFVCAFQ-YANDAERFYEVLPKR 273 Query: 303 LEGSLKLRLNMDKTKIPHV-------NDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARN 355 L+ L + +KT + F+FLG + G+ R+ EK R Sbjct: 274 LK-KFNLEVAEEKTSLLRFSRFHPSRKRQFVFLGFAFYWAKDAQGKPRLRRRTGAEKHRA 332 Query: 356 FAASLTALLW 365 + + Sbjct: 333 SMSEFYQYIK 342 >UniRef50_D0VMZ3 Putative reverse-transcriptase protein n=1 Tax=Volvox carteri f. nagariensis RepID=D0VMZ3_VOLCA Length = 611 Score = 219 bits (559), Expect = 1e-55, Method: Composition-based stats. Identities = 104/384 (27%), Positives = 171/384 (44%), Gaps = 37/384 (9%) Query: 1 MQRKLATWAATDPSLRIQRLL-RLITQPEW-LAEAARITLSSKGAHTPGVDGVNKTMLQA 58 MQ ++ + L RLI + L ++T +KG TPGVD Sbjct: 31 MQHRIFKAKRNGKQKTVTGLQIRLINSLDAKLLSVLQVTALNKGRKTPGVDKQIVIA--- 87 Query: 59 RLAVELQILRDELLSGHYQPLPARRVYIPKSNGK-LRPLGIPALRDRIVQRAMLMAMEPI 117 +L++ ++ L G P RRV++PK RPLGIP + DR Q +A+EP Sbjct: 88 -GPDKLRMAKELRLDGT--AKPIRRVWLPKPGKDEKRPLGIPTIEDRAKQNLAKLALEPE 144 Query: 118 WESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKA 177 WE+ F SYGFRP RS H AI + L L + +++ + D+ FD + H L+ Sbjct: 145 WEAIFEPNSYGFRPGRSCHDAIEAIFLNLR---HKKTKFIYDADIRKCFDRIDHGALIAK 201 Query: 178 VRRRISDARFMTLLWKTIKAGHIDV------GLFRAASEGVPQGGVISPLLSNIMLNEFD 231 + + + ++ +KAG ++ ++ G PQGG+ISPLL+NI L+ + Sbjct: 202 LN---TFPQMERQIYAWLKAGIMEGYANAPKSYEPESNLGTPQGGIISPLLANIALHGLE 258 Query: 232 QYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQ 291 +L E + + + N S + R A RYADDFV+I + K Sbjct: 259 NHLKEFCATKVSNEIFQTRNRSQESKR-----------KACGVIRYADDFVIIHEN-KQV 306 Query: 292 VEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHR---LIRKRSRYGEMRVVSTI 348 +E E + L+ + L +N +K+K+ +GF FLG + + R + R+ T Sbjct: 307 IELCVTETKNWLQ-HIGLDINNEKSKLRDSREGFKFLGLQIILIRRGKMDGQGYRIKITP 365 Query: 349 PQEKARNFAASLTALLWKVRISGE 372 ++ + + ++ R Sbjct: 366 SKKNQSSLLEKVRKVIQNNRAISS 389 >UniRef50_Q47277 Orf protein n=63 Tax=cellular organisms RepID=Q47277_ECOLX Length = 416 Score = 218 bits (556), Expect = 2e-55, Method: Composition-based stats. Identities = 99/366 (27%), Positives = 149/366 (40%), Gaps = 57/366 (15%) Query: 25 TQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLPARRV 84 + A R +S G G+D + RL L + + L SG Y P + V Sbjct: 9 IDKSLVVSAYRRVKTSAG--AAGIDKQSLADFDKRLVDNLYKIWNRLSSGSYFPPAVKAV 66 Query: 85 YIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKL 144 IPK G R LGIP + DRI Q + +A EP E F SYG+RP +S AI Sbjct: 67 AIPKKLGGERILGIPTVSDRIAQTVVKLAFEPQVEPHFLADSYGYRPNKSALDAI----- 121 Query: 145 QLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAG-HIDVG 203 +T WV+E D+ FD + H L+MKAV + + + + A + G Sbjct: 122 GVTRKRCWYYDWVLEFDIKGLFDNIPHELIMKAVDKHNPARWVKLYIQRWLTAPMVMSDG 181 Query: 204 LFRAASEGVPQGGVISPLLSNIMLNE-FDQYLHERYLSGKARKDRWYWNNSIQRGRSTAV 262 RA + G PQGGVISPLL+N+ ++ FD++L + Y Sbjct: 182 EVRARTMGTPQGGVISPLLANLFMHYVFDKWLAKYY------------------------ 217 Query: 263 RENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVN 322 P V + RYADD +L +A+ +RE R L ++ +KT++ + Sbjct: 218 -------PKVPWYRYADDGILHCHS-EAEATEMREVLRKRF-SECGLEMHPEKTRVIYCK 268 Query: 323 DG----------FIFLGHRLIRKRSRYGEMR-----VVSTIPQEKARNFAASLTALLWKV 367 DG F FLG+ R+ + + + + + A + Sbjct: 269 DGSRKGDYEHTMFDFLGYTFRRRVVKNVKRNSLFVSFTPAASKSALKAMRREIKATGIRK 328 Query: 368 RISGEI 373 R+ I Sbjct: 329 RVDLSI 334 >UniRef50_B3JM52 Putative uncharacterized protein n=1 Tax=Bacteroides coprocola DSM 17136 RepID=B3JM52_9BACE Length = 431 Score = 218 bits (555), Expect = 3e-55, Method: Composition-based stats. Identities = 89/349 (25%), Positives = 155/349 (44%), Gaps = 45/349 (12%) Query: 20 LLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPL 79 ++ + P L A +++KG+ GVDGV+ L+ + + L + + G+YQ Sbjct: 1 MITRVVHPFNLQRALEHVIANKGS--AGVDGVSIRELRKVFSEKKLQLIEAIKQGNYQVQ 58 Query: 80 PARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAI 139 P + IPK NGK R LG+P +R++Q+A+ + P++E +F SYGFRP ++ A+ Sbjct: 59 PILGIEIPKGNGKTRLLGVPTTTERVLQQALAQTIAPLFEPEFSNYSYGFRPHKNARQAV 118 Query: 140 RTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGH 199 + + +++ DL ++FD V H LL+ + +++ M L+ K ++A Sbjct: 119 GQSRDYIHS----GLNHIVDIDLKNFFDEVDHCLLLNLIYQKVKCKATMQLIRKWLRAPI 174 Query: 200 IDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGRS 259 G R +GVPQG +SPLLSNI+L++ D+ + Sbjct: 175 KINGKLRKRRKGVPQGSPLSPLLSNILLHQLDKEMTR----------------------- 211 Query: 260 TAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIP 319 + RYADDF + K Q +A R L+ LKL +N +K+ I Sbjct: 212 ----------RGHKFVRYADDFSIYCKS-HNQAKATRVVIEKFLKNKLKLTINEEKSGI- 259 Query: 320 HVNDGFIF--LGHRLIRKRSRYGEMRVVSTIPQEKARNFAASLTALLWK 366 F LG + + + + + ++ + L + K Sbjct: 260 --RKPIHFTILGFGFVPTYEKGSKNQYQLIVSEKAWKKLKERLKTITRK 306 >UniRef50_A7BYN3 RNA-directed DNA polymerase n=2 Tax=Beggiatoa sp. PS RepID=A7BYN3_9GAMM Length = 497 Score = 217 bits (553), Expect = 5e-55, Method: Composition-based stats. Identities = 95/329 (28%), Positives = 154/329 (46%), Gaps = 32/329 (9%) Query: 42 GAHTPGVDGVNKTMLQARLAVELQILRDELL-SGHYQPLPARRVYIPKSNGKLRPLGIPA 100 G + G+DG+ +AR + +++ L +Y+ PA+R YIPK+NGKLRPLGIP Sbjct: 2 GKRSSGIDGLKYLTPKARERLAKRLMDWALKGWDNYKAKPAKRKYIPKANGKLRPLGIPT 61 Query: 101 LRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEG 160 DR++Q + A+EP +E+ F + SYGFRP + H AI + +WV++ Sbjct: 62 QEDRVIQHVIKSALEPFYEAQFESNSYGFRPAQGCHDAIEAIF----KITSHEPKWVLDA 117 Query: 161 DLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISP 220 D+ FD + H L + + L+ + +KAG+ D G G PQGG+ISP Sbjct: 118 DIKGCFDNIDHNYLTECITHGQ-----KKLVKEWLKAGYTDDGHIHPTKNGTPQGGIISP 172 Query: 221 LLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADD 280 LL+NI L+ + L ++ G + + + + RYADD Sbjct: 173 LLANIALDGLETNLRQKLQIG--------------------IYQTQFNQSRLTVVRYADD 212 Query: 281 FVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSRYG 340 FV+I K K ++ + L+ L L+ +KTKI GF FLG + +++ Sbjct: 213 FVIIHKDKKV-IKESQLIISQWLKKR-GLELSPEKTKIVKTPQGFDFLGFNIKWCKNKAK 270 Query: 341 EMRVVSTIPQEKARNFAASLTALLWKVRI 369 I + K + + +T ++ Sbjct: 271 GHYKRHLIKEGKYKEYGIRITPSSKSLKK 299 >UniRef50_A6LY84 RNA-directed DNA polymerase (Reverse transcriptase) n=17 Tax=Firmicutes RepID=A6LY84_CLOB8 Length = 433 Score = 216 bits (550), Expect = 1e-54, Method: Composition-based stats. Identities = 100/386 (25%), Positives = 165/386 (42%), Gaps = 51/386 (13%) Query: 4 KLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVE 63 K+A A + L I E L + + L G G+D V K L Sbjct: 7 KIAEIARQKTKEKFTSLYHYI-NKEMLLKCHKQLL---GNKATGIDDVTKQEYSKELDNN 62 Query: 64 LQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFH 123 ++ L +L + Y+P +RVYIPK +GK RPLGIP+ D++VQ A+ ++ I+E++F Sbjct: 63 IENLIVKLRNHSYKPQAVKRVYIPKGDGKTRPLGIPSYEDKLVQMALNKILQSIYEAEFK 122 Query: 124 TLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRIS 183 SYGFRP+R+ H AI+ + + + R +V++ D+ +F+ V+H ++K + RI Sbjct: 123 DFSYGFRPKRNCHSAIKALNKVIEN---GRINYVVDADIKGFFNNVNHEWMIKFLEVRIG 179 Query: 184 DARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNE-FDQYLHERYLSGK 242 D ++L+ K +KAG +D G+ + G PQG ++SP L+NI L+ D + + Sbjct: 180 DPNIISLVKKFLKAGLMDNGIIKTTEIGTPQGSIVSPTLANIYLHYSLDLWFEKVI---- 235 Query: 243 ARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGV 302 ++ RYADDFV +++ Sbjct: 236 ----------------------KRNFRGQSEITRYADDFV-CCFQYESEARQFCRLLVSR 272 Query: 303 LEGSLKLRLNMDKTKIPHVN---------------DGFIFLGHRLIRKRSRYGEMRVVST 347 L L + K+K+ + F FLG +S+ G RV Sbjct: 273 L-NKFNLEVERTKSKLILFGRFAEEIRKSRGFKNAETFDFLGFTHYCAKSKRGYFRVKRK 331 Query: 348 IPQEKARNFAASLTALLWKVRISGEI 373 ++K + + VR I Sbjct: 332 TSKKKFKAKIKDFNQWIKSVRNKLHI 357 >UniRef50_D2LU13 RNA-directed DNA polymerase (Reverse transcriptase) n=1 Tax=Bacillus cellulosilyticus DSM 2522 RepID=D2LU13_BACS4 Length = 464 Score = 216 bits (550), Expect = 1e-54, Method: Composition-based stats. Identities = 100/351 (28%), Positives = 152/351 (43%), Gaps = 46/351 (13%) Query: 18 QRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQ 77 ++R I + L A + K PG+DG++ L LA+E + L + G Y+ Sbjct: 42 TDIIRRIINQDNLQRAYKK--VKKNKGKPGIDGMSVDELLPYLALEDRNLILSIKDGSYR 99 Query: 78 PLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHH 137 P P +RV I K +G R LGIP ++DR+VQ+ +L +E + F SYGFRP RS H Sbjct: 100 PQPVKRVEIKKPDGGKRKLGIPTVKDRLVQQMILQVIEKKIDPQFSDNSYGFRPNRSAHD 159 Query: 138 AIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKA 197 A+R K E R+V++ D+ YFDTV+ LM V + I D + L+ K +++ Sbjct: 160 AMRKAKQ----YYEEGFRYVVDIDMKQYFDTVNQDKLMHHVEQFIDDPTVLILIRKFLRS 215 Query: 198 GHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRG 257 G + G PQGG +SP+L NI L++ D L Sbjct: 216 GISIDEEIEPSEVGTPQGGNLSPILGNIYLHQLDLELER--------------------- 254 Query: 258 RSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTK 317 + RYADD + VK KA ++ LE LKL +N DK++ Sbjct: 255 ------------RGHKFIRYADDCNIYVKSRKAGDRVLKS-ITKFLEEELKLTVNKDKSE 301 Query: 318 IPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAASLTALLWKVR 368 + FLG + ++ G + + + L + R Sbjct: 302 VGRPTKR-KFLGFCIHSTKAGVGF-----RPHIKSKKRLEQKIRYLTSRKR 346 >UniRef50_A8KXN1 RNA-directed DNA polymerase (Reverse transcriptase) n=1 Tax=Frankia sp. EAN1pec RepID=A8KXN1_FRASN Length = 417 Score = 216 bits (549), Expect = 1e-54, Method: Composition-based stats. Identities = 94/363 (25%), Positives = 146/363 (40%), Gaps = 59/363 (16%) Query: 27 PEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLPARRVYI 86 + + +A + G PG DGV +A + L +L + + SG Y P P V I Sbjct: 15 KQLVWDAWLKVKENGG--APGPDGVTVEQFEANVKDRLYVLWNRMSSGSYFPGPVGAVEI 72 Query: 87 PKSN--GKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKL 144 PK G R LGIP + DR+ Q + +A+EP E FH SYG+RP RS A+ + Sbjct: 73 PKKGVKGGARTLGIPNVVDRVAQTVLKLALEPKVEPVFHRDSYGYRPGRSQRQALEVCRK 132 Query: 145 QLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHID-VG 203 + WV++ D+ +FDTV L+KAV + + + +KA G Sbjct: 133 RCWS-----HDWVVDLDVRKFFDTVPWEKLLKAVAYHTDQKWVLMYVERCLKAPTKHADG 187 Query: 204 LFRAASEGVPQGGVISPLLSNIMLN-EFDQYLHERYLSGKARKDRWYWNNSIQRGRSTAV 262 + + G QGG SPL +NI L+ D ++ + Sbjct: 188 TLQERTMGTVQGGPFSPLAANIYLHWGLDAWMAREF------------------------ 223 Query: 263 RENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVN 322 P V + R+ADD V Q +R+ L + L + DKT+I + Sbjct: 224 -------PTVPFERWADDVVFHCVSL-EQAREVRDAVVARL-VEVGLEAHPDKTRIVYCK 274 Query: 323 D----------GFIFLGHRLIRKRSRYG--EMRVVSTIP---QEKARNFAASLTALLWKV 367 D F FL + + + G + R S IP ++ +F+ + L Sbjct: 275 DSNRGGDYENTSFTFLSYTFRPRVAWNGTQKKRFTSFIPGAAPDRVASFSREMRDLRLHR 334 Query: 368 RIS 370 R + Sbjct: 335 RTN 337 >UniRef50_C0JWS6 Putative reverse transcriptase and intron maturase n=1 Tax=Pycnococcus provasolii RepID=C0JWS6_9CHLO Length = 583 Score = 216 bits (549), Expect = 1e-54, Method: Composition-based stats. Identities = 97/382 (25%), Positives = 152/382 (39%), Gaps = 39/382 (10%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQP--EWLAEAARITLSSKGAHTPGVDGVNKTMLQA 58 +Q ++ + +++ L + I + + R+ +++G + GVD + Sbjct: 21 LQVRIFKASQKQQQNKVRYLQKKILRSIDAKVLAVRRVAQTNRGKRSAGVDRKRVLTSEQ 80 Query: 59 RLAVELQILRDELLSGHYQPLPARRVYIPKSNG-KLRPLGIPALRDRIVQRAMLMAMEPI 117 +L L L + P RR K+ RPLGIP L DR VQ+ + A+EP Sbjct: 81 KLN-----LAQNLKLKG-KAKPIRRTCSTKAGKVDKRPLGIPTLEDRAVQQLVKFALEPE 134 Query: 118 WESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKA 177 WE+ F SYGFRP R+ AI + L R +V++ D+ FD + H L+ Sbjct: 135 WEAKFEPNSYGFRPGRACQDAIMALFQHL----RGRSLYVLDADIKKCFDRIDHDKLLAK 190 Query: 178 VRRRISDARFMTLLWKTIKAGHIDV--------GLFRAASEGVPQGGVISPLLSNIMLNE 229 + + + +KAG I+ G PQGGVISPLL+NI L Sbjct: 191 LN---TFPLLENQIKVWLKAGVIEGYSNSYKNYNKVTPNLLGTPQGGVISPLLANIALTG 247 Query: 230 FDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTK 289 + L Y + + VA RY DDFV++ K + Sbjct: 248 LEDELKHYYANHLYKGSSRI--------------GLSDKLTQVAVIRYVDDFVVLHKD-E 292 Query: 290 AQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIP 349 + +R+ L ++ L L +KTKI GF FLG +I S + + I Sbjct: 293 NVIRQLRDHTAKWLYTTMGLELLPEKTKILDTKQGFTFLGFHIISIYSGENKYKCKIHIS 352 Query: 350 QEKARNFAASLTALLWKVRISG 371 E N + + R + Sbjct: 353 HESKNNLLSKTREIFRSNRSAS 374 >UniRef50_O99479 Reverse transcriptase homolog n=2 Tax=Eukaryota RepID=O99479_PAVLU Length = 636 Score = 215 bits (548), Expect = 2e-54, Method: Composition-based stats. Identities = 93/356 (26%), Positives = 154/356 (43%), Gaps = 50/356 (14%) Query: 21 LRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLP 80 I + L T S + PG+DG K +L L+ EL+S YQP Sbjct: 52 YTEIYDLQSLRLGLEATKS---SAAPGLDGDRKANFS---EAKLVALQAELISQKYQPKT 105 Query: 81 ARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIR 140 +RV IPK NG +R LGI + RD+IVQ ++ A++ + F S+GFRP H A++ Sbjct: 106 TKRVAIPKPNGGIRYLGISSQRDKIVQASIQNALQSKYGKHFSPDSFGFRPGLGCHDALK 165 Query: 141 TVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHI 200 V+ + W+I D+ FDT++H +L++ +R + D + L+ K IK G++ Sbjct: 166 HVRNTWQNI-----TWIISIDIEKCFDTINHTILLQILRPLV-DQPTLELISKLIKVGYV 219 Query: 201 DVGL-----FRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQ 255 ++ ++ G PQG +ISPLL N ++ D +L + + D + Q Sbjct: 220 EMFDTTCFPISESTIGTPQGSLISPLLCNFYMHILDTFLQKVLIPQWNVGDERSYVKGCQ 279 Query: 256 RGR--------------------------------STAVRENWQWKPAVAYCRYADDFVL 283 + + N + Y RYADD V+ Sbjct: 280 NRKAMDVNDKAIVEAYPELEGQIQRIKHNRWVTEGKGSRDPNDANFRRLRYVRYADDIVI 339 Query: 284 IVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHV-NDGFIFLGHRLIRKRSR 338 G ++ I ++ LE +L ++N +K+ I H +G FLG + ++ Sbjct: 340 GFTGPYSEALVILDQVVKFLEKTLCFKVNKEKSSINHSETNGIKFLGTFIKYLPNK 395 >UniRef50_B4WW73 Group II intron, maturase-specific domain family n=1 Tax=Synechococcus sp. PCC 7335 RepID=B4WW73_9SYNE Length = 479 Score = 215 bits (547), Expect = 2e-54, Method: Composition-based stats. Identities = 90/314 (28%), Positives = 150/314 (47%), Gaps = 46/314 (14%) Query: 76 YQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSV 135 ++ P RRVYIPKSNGKLRPLGI + DR +Q + A+EP WE+ +YG+RP RS Sbjct: 21 WRVQPTRRVYIPKSNGKLRPLGISTIADRCLQMVVKTALEPEWEAKLEGSTYGYRPGRSC 80 Query: 136 HHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTI 195 H A++ + + + WV++ DL FD + L+ + R + L+ + + Sbjct: 81 HDAVKKL--YFMSLPKNKKHWVVDADLQGCFDNIDQSFLLAKLERFPA----RGLVEQWL 134 Query: 196 KAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQ 255 ++G+++ G ++ + G PQG ISPLL+NI L+ ++ L Y S Sbjct: 135 QSGYVEYGRWQPTTAGTPQGNCISPLLANIALHGMEEALGITYDS--------------- 179 Query: 256 RGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDK 315 R K +A +YADDFV++ + KA EA E+ + L L + +K Sbjct: 180 -------RGRINGKRGLA--KYADDFVVMCES-KADAEAAIEDLKPWLLER-GLSFSEEK 228 Query: 316 TKIPHVNDGFIFLGHRLI----RKRSRYGEMRVVSTIPQEKARNFAASLTALLWKVR--- 368 T++ H+ +GF FLG ++R G ++++ ++ ++ A L + Sbjct: 229 TQVVHLREGFDFLGFNFRHYATPGKTRTG-WKLLTKPSRKSIKSIKARLKREWLNLSGKP 287 Query: 369 ------ISGEILLG 376 I+ G Sbjct: 288 VKEVVMRLNPIIRG 301 >UniRef50_Q94Z25 Orf557 n=2 Tax=Pylaiella littoralis RepID=Q94Z25_PYLLI Length = 557 Score = 214 bits (546), Expect = 3e-54, Method: Composition-based stats. Identities = 105/396 (26%), Positives = 175/396 (44%), Gaps = 45/396 (11%) Query: 1 MQRKLATWAATDPSLRIQRLLR-LITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQAR 59 +Q+K+ + + RL + L+ E A A R +S++G+ T G DG Q + Sbjct: 23 LQQKMVVAYKNNNWSEVFRLQQQLMHSFEGRATAVRKVVSNEGSKTKGPDGKTWKKSQDK 82 Query: 60 LAVELQILRDELL--SGHYQPLPARRVYIPKSN-GKLRPLGIPALRDRIVQRAMLMAMEP 116 + +RD LL SG Y+ RRV+IPKS+ G+LRPLGIP + DR +Q +L ++P Sbjct: 83 YRA-IADIRDHLLTKSGSYKAGAVRRVWIPKSSPGELRPLGIPNMIDRALQALVLSCLDP 141 Query: 117 IWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMK 176 I E + + SYGFR RS + AI+ ++ L G R W + D+S FD + H L K Sbjct: 142 IVEENSDSCSYGFRKYRSTNDAIQRIRFILDKAGAPRYIW--DADISKCFDNISHTFLNK 199 Query: 177 AVRRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHE 236 VR + + L+ +KA I+ G S G PQGGV+SPLL N+ LN + + Sbjct: 200 VVRENL-CRKGCELVEAWLKAPIIEKGSKSYPSRGTPQGGVLSPLLCNMTLNGLENVI-- 256 Query: 237 RYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVK-GTKAQVEAI 295 + G ++ + RYADDF++ G ++ Sbjct: 257 ------------------RDGLPSSSSTAGRKLKGRWVVRYADDFIITNPIGKSQFIDND 298 Query: 296 REECRGVLEGSLKLRLNMDKTKIPHV-NDGFIFLGHRLIRKRSRYGEMR-------VVST 347 + L ++ K++I + + F FLG ++ +++ + +V Sbjct: 299 IPLINKFMADR-GLEISEKKSRIIDLEKESFNFLGWKISQRKRNISVNKASDSRLVLVVE 357 Query: 348 IPQEKARNFAASLTALLWK-------VRISGEILLG 376 +E + + + ++ +L G Sbjct: 358 PTKESIKRLKSRIKLEFRSNKPIGALIKDLNPVLRG 393 >UniRef50_Q8GAR1 Reverse transcriptase n=20 Tax=Enterobacteriaceae RepID=Q8GAR1_ECOLX Length = 410 Score = 212 bits (541), Expect = 1e-53, Method: Composition-based stats. Identities = 93/356 (26%), Positives = 146/356 (41%), Gaps = 51/356 (14%) Query: 29 WLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLPARRVYIPK 88 + + +KG PG DG M + L + + L SG + P P IPK Sbjct: 12 LVWASYLDVRRNKG--APGCDGQTLKMFDQQRDGNLYKIWNRLCSGTWFPPPVLEKRIPK 69 Query: 89 SNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTD 148 SNGK R LGIP + DRI Q A+ + ME + FH SYG+RP +S H A++ ++ Sbjct: 70 SNGKERILGIPTVSDRIAQGAIKLFMEEKLDPIFHADSYGYRPGKSAHDALKQCAIRCW- 128 Query: 149 CGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAG--HIDVGLFR 206 R W++E D+S++FD V H L++KA+ + + ++A + G Sbjct: 129 ----RYSWILEVDISAFFDHVRHDLVLKALEHHGMPKWVILYCRRWMEAPMQSCENGELI 184 Query: 207 AASEGVPQGGVISPLLSNIMLNE-FDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVREN 265 + G PQGGVISPLL+N+ L+ FD ++ Y Sbjct: 185 TRTRGTPQGGVISPLLANLFLHYAFDLWMEREY--------------------------- 217 Query: 266 WQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVN--- 322 V + RYADD V+ + ++ + L LN KT I +++ Sbjct: 218 ----RGVPFERYADDIVVHC-SRMSDATRLKNRLSERF-SEVGLVLNAGKTNIAYIDTFK 271 Query: 323 -----DGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAASLTALLWKVRISGEI 373 F FLG+ + + + + + +T + K RI Sbjct: 272 RRNVATSFTFLGYDFKVRTLKNFKGELYRKCMPGASNAAMRKITETIKKWRIHRST 327 >UniRef50_Q7XXA4 OSJNBa0019G23.12 protein n=2 Tax=Oryza sativa RepID=Q7XXA4_ORYSJ Length = 1140 Score = 212 bits (539), Expect = 2e-53, Method: Composition-based stats. Identities = 61/382 (15%), Positives = 133/382 (34%), Gaps = 55/382 (14%) Query: 30 LAEAARITLSSKGAHTPGVDGVNKTMLQARLAVE---LQILRDELLSGH---YQPLPARR 83 L E + PG DG Q V L L +E SG + Sbjct: 666 LREIKEAIFAMDHNKAPGPDGFPVEFYQKFWEVIKHDLLNLFNEFHSGSLPIFGLNFGVI 725 Query: 84 VYIPK-----SNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHA 138 IPK + RP+ + + +I + + + ++ + F R++ Sbjct: 726 TLIPKVEEANRIQQYRPICLLNVSYKIFTKVATNRISSVADNLVNPTQTAFMRGRNILDG 785 Query: 139 IRTVKLQLTDCGETRGRWVI-EGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKA 197 + + + + + VI + D +D V L++ +R + ++++ + I Sbjct: 786 VAIIHETVHELHRKKLNGVIFKIDFEKAYDKVKWPFLLQTLRMKGFSPKWISWIKSFIVG 845 Query: 198 GHI------DVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWN 251 G + DVG F +G+ QG +SP+L N +++ ++ G+ + Sbjct: 846 GSVAVKVNDDVGPFFQTKKGLQQGDPLSPILFNFIVDMLATLINRAKTQGQVDGLIPHL- 904 Query: 252 NSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRL 311 ++ +Y DD VL + + + ++ E L++ Sbjct: 905 ----------------IDGGLSILQYVDDIVLFMNHDLEKAQNMKSVLLAF-EQLSGLKI 947 Query: 312 NMDKTKIPHVND-------------------GFIFLGHRLIRKRSRYGEMRVVSTIPQEK 352 N K+++ + F +LG + ++ R E + V ++K Sbjct: 948 NFHKSELYCFGEALEYRDQYAQLFGCQVGNFPFRYLGIPIHCRKLRNAEWKEVVERFEKK 1007 Query: 353 ARNFAASLTALLWKVRISGEIL 374 ++ L +L ++ + +L Sbjct: 1008 LSSWKGKLLSLGGRLTLINSVL 1029 >UniRef50_A7UDN1 Putative reverse transcriptase n=2 Tax=Candida zemplinina RepID=A7UDN1_CANZE Length = 445 Score = 212 bits (539), Expect = 2e-53, Method: Composition-based stats. Identities = 94/386 (24%), Positives = 163/386 (42%), Gaps = 44/386 (11%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWLAE-AARITLSSKGAHTPGVDGVNKTMLQAR 59 MQ ++ +++ L R+ + + E A SS G+ TPGVD Sbjct: 12 MQSRIIAAVKDQNWTKVRDLQRMTVRSGYARELAVDTIASSPGSKTPGVDNFIIK----N 67 Query: 60 LAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWE 119 + ++++ Y P P +R+YIPK+NGKLRP GIP + DR +Q A +PI E Sbjct: 68 EMDKAKMIKVTGKIEQYNPKPVKRIYIPKANGKLRPTGIPTMADRAMQCTFSFATQPIAE 127 Query: 120 SDFHTLSYGFRPERSVHHAIRTVK--LQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKA 177 + S+GFRP RS A + + W ++ D+ +FD + H ++ Sbjct: 128 TLGDQHSFGFRPNRSTIDAFNHLYRAQFIKSSNAPVNNWAVDADIKGFFDNISHEWILNN 187 Query: 178 VRRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHER 237 + +I + V F + GVPQGGV+SP+++N++ + +Q++++ Sbjct: 188 I--KIEPRMLAKFTKAGFIEYNNQVNEFHDTNTGVPQGGVMSPMMANMVTDGLEQHIYD- 244 Query: 238 YLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIRE 297 K R +N V + RYADDFV+ + + + Sbjct: 245 -----GTKARNIYNG---------------MTKRVHFIRYADDFVI-ITPYEWVAQRTMP 283 Query: 298 ECRGVLEGSLKLRLNMDKTKIPHV-NDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNF 356 L+ L LNMDKT I + D FLG+ ++ + + ++ EK + F Sbjct: 284 IVNSFLKERS-LSLNMDKTHILDISKDNLDFLGYT-----TKRVDGKSMTMPNPEKVKLF 337 Query: 357 AASLTALLWKVRI------SGEILLG 376 ++ + K + ++ G Sbjct: 338 TKNIRTKIKKCNSREDMMATNSVIRG 363 >UniRef50_Q6TFE1 Putative group II intron-encoded maturase n=1 Tax=Caedibacter taeniospiralis RepID=Q6TFE1_CAETA Length = 341 Score = 211 bits (536), Expect = 5e-53, Method: Composition-based stats. Identities = 92/354 (25%), Positives = 161/354 (45%), Gaps = 50/354 (14%) Query: 5 LATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVEL 64 +A A + S L L+ E+L+ +K G+D V+ +A ++ Sbjct: 16 IAECARGNRSFEFTSLAHLL-DAEFLSYCYYGLDRNK---AVGIDKVSWQEYGVDVADKI 71 Query: 65 QILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHT 124 + L L ++P+P+RRVYIPK NG+ RPLGI A+ ++IV+ +++ ++ I+E DF Sbjct: 72 ENLVMRLKRKTFKPMPSRRVYIPKGNGESRPLGISAIENKIVESGIMLILQSIYEQDFLE 131 Query: 125 LSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISD 184 SYGFRP R+ H A+ V + ++E D+ +FD V H L V+ R+ D Sbjct: 132 CSYGFRPGRNTHQALNEVDKAIMT---QPVNHLVEADIKGFFDNVSHEKLKDFVKIRVKD 188 Query: 185 ARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNE-FDQYLHERYLSGKA 243 + L+ ++AG+ID G+ +G PQG ++SP+L+NI L+ D++ + Sbjct: 189 TSLLHLIDCFLRAGYIDKGVLIDTEKGTPQGSILSPMLANIFLHYVLDKWFED------- 241 Query: 244 RKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVL 303 V+++ + + RYADDFV +++ + I Sbjct: 242 -----------------TVKQHVEGFCRL--VRYADDFVCLIQ-YQEDARKIERALANRF 281 Query: 304 EGSLKLRLNMDKTKIPHVND--------------GFIFLGHRLIRKRSRYGEMR 343 +L+L+ +K++ F FLG ++R G + Sbjct: 282 -NKHELQLHPEKSRNISFGRFEKLNALNAGRKANTFDFLGFTHFCDKTRKGYFQ 334 >UniRef50_Q7GEU5 Putative uncharacterized protein (Fragment) n=3 Tax=Candida parapsilosis RepID=Q7GEU5_CANPA Length = 933 Score = 210 bits (535), Expect = 6e-53, Method: Composition-based stats. Identities = 100/410 (24%), Positives = 168/410 (40%), Gaps = 61/410 (14%) Query: 16 RIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLS-- 73 ++R + L+ + A S+G+ T GVD V + L + + L + Sbjct: 319 VMKRQMMLVNSIIFRLHAVDKLSHSRGSLTSGVDNVCIEGVDKD-RALLVEIVEWLGTTV 377 Query: 74 ---GHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFR 130 Y+ P +RV+IPK N KLRP+GIP L+DR +Q + + +EP+ E +YGFR Sbjct: 378 KHPKLYKSDPVKRVWIPKGNEKLRPIGIPTLKDRGLQYLINLVVEPLVEMTSDPHNYGFR 437 Query: 131 PERSVHHAIRTVKLQLTDCGETRG---------------------RWVIEGDLSSYFDTV 169 P RS +AI ++ L ++ +W+++ D+ +FD + Sbjct: 438 PYRSTKNAIAYLRSHLHTIDSSKKGNHFTTASNVENNLLRLLPENKWILDADIKGFFDNI 497 Query: 170 HHRLLMKAVRRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNE 229 +H L+ + + + ++ +K+G ID +F+ G PQGGVISP L N LN Sbjct: 498 NHDWLLNNLTLH---PKLLLIIKAWLKSGVIDGKIFQLTESGTPQGGVISPTLVNFTLNG 554 Query: 230 FDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTK 289 ++ + E K++ + +AY RYADDFV++V+ Sbjct: 555 LEKVVMEALYPLTKSKEQRIRIKLKDGTYTCVASS-------LAYVRYADDFVVLVRSKH 607 Query: 290 AQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDG---FIFLGHRLIR-----------K 335 + I+ L L LN +KTK+ ++D FLG+ Sbjct: 608 IMLTFIKPAIEKFLAER-GLNLNEEKTKLFRLSDPGCQLDFLGYTFKYQDKWSIKRHVFY 666 Query: 336 RSRYGEMRVVSTIPQEKARNFAASLTALLWK---------VRISGEILLG 376 + G + + K +F L + K + IL G Sbjct: 667 HNHAGSRGIALYPNKTKVLSFIEKLKLIFKKSMNLDAYNLITKLNPILRG 716 >UniRef50_A1BI39 CRISPR-associated protein Cas1 n=5 Tax=Chlorobiaceae RepID=A1BI39_CHLPD Length = 731 Score = 210 bits (534), Expect = 7e-53, Method: Composition-based stats. Identities = 84/316 (26%), Positives = 141/316 (44%), Gaps = 42/316 (13%) Query: 20 LLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPL 79 L + PE + +A S+ G PG D + RL L+ L LL+G Y+ Sbjct: 4 LYNQMAMPETIFQAWYKVASNDGR--PGWDNTSIQDYSLRLEENLKSLSHALLTGTYRQS 61 Query: 80 PARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAI 139 P ++ + K +GK R L IP + DR+ Q A + + PI E++ ++ +RP S A Sbjct: 62 PLLKLVMLKPDGKERVLLIPGVIDRVAQTAASIVLSPIIEAELGNCTFAYRPGISREGAA 121 Query: 140 RTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGH 199 R + +WV++ D+ ++FD V H LL + + + D ++LL + + A Sbjct: 122 REI----DRLHREGYQWVLDADIRNFFDNVRHDLLFQRLVELVDDKEMISLLHRWLTAEI 177 Query: 200 IDVGLFRAASE-GVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGR 258 +D R + G+PQG ISP L+N+ L+ FD+ + ++ Sbjct: 178 VDGLNPRTRNTMGLPQGCPISPALANLYLDRFDETMEQQ--------------------- 216 Query: 259 STAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKI 318 R+ADD++++ K A++ LKL L+ DKT+I Sbjct: 217 ------------GFKLVRFADDYLVLCKTRPKAEAALK--LSESALAELKLELHSDKTRI 262 Query: 319 PHVNDGFIFLGHRLIR 334 +GF +LG+ IR Sbjct: 263 TTFAEGFKYLGYLFIR 278 >UniRef50_UPI00016C4F75 RNA-directed DNA polymerase (Reverse transcriptase) n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C4F75 Length = 257 Score = 209 bits (532), Expect = 2e-52, Method: Composition-based stats. Identities = 74/202 (36%), Positives = 110/202 (54%), Gaps = 6/202 (2%) Query: 19 RLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQP 78 RL+ + Q L +A ++KG PG+DG+ +A Q L LL G Y+P Sbjct: 56 RLMEEVCQRGNLNQAYSRVKANKG--APGIDGMTVEDSLRWIAEHKQELLSSLLDGSYRP 113 Query: 79 LPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHA 138 P R V IPK G R LGIP + DR+VQ+A+L + + + F SYGFRP +S H A Sbjct: 114 SPVRGVLIPKPGGGERQLGIPTVVDRLVQQAILQVLTRLLDPTFSESSYGFRPGKSAHQA 173 Query: 139 IRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAG 198 + K + D V++ DL +FD V+H +LM + RR+SD R + ++ + ++AG Sbjct: 174 LLKAKEYVAD----GRAIVVDVDLEKFFDRVNHDILMARLARRVSDTRLLRIVRRFLEAG 229 Query: 199 HIDVGLFRAASEGVPQGGVISP 220 + G+ A EG PQGG++ P Sbjct: 230 LMQDGVCVARHEGTPQGGIVDP 251 >UniRef50_B8R160 Reverse transcriptase n=2 Tax=Volvox carteri RepID=B8R160_VOLCA Length = 607 Score = 208 bits (530), Expect = 2e-52, Method: Composition-based stats. Identities = 97/379 (25%), Positives = 169/379 (44%), Gaps = 33/379 (8%) Query: 1 MQRKLATWAATDPSLRIQRLLRLIT-QPEWLAEAARITLS-SKGAHTPGVDGVNKTMLQA 58 +Q+++ + + R+ L +L+ P A +I + +KG +T G+DG +A Sbjct: 41 IQKRIFKASLAGDTKRVWFLQKLLLRNPHAKLIAVQIVTTLNKGKNTAGIDG-----YKA 95 Query: 59 RLAVELQILRDELLSGHYQPLPARRVYIPKSNG-KLRPLGIPALRDRIVQRAMLMAMEPI 117 + E +L +L + RR +IPK + +PLGI ++DR +Q +A+EP Sbjct: 96 TTSEEKLLLAKKLQING-KANLVRRTWIPKPGKTEKQPLGIYTIQDRALQALCKLALEPE 154 Query: 118 WESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKA 177 WE+ F SYGFRP R AI + L ++V + D+ FDT+ H L+ Sbjct: 155 WEAKFEPNSYGFRPGRRAQDAIEAIFQNL---HHDADKYVFDADIRKCFDTIDHAALLSK 211 Query: 178 VRRRISDARFMTLLWKTIKAGHIDVG----LFRAASEGVPQGGVISPLLSNIMLNEFDQY 233 ++ + + +KAG D G PQGG+ISPLL+NI L+ +++ Sbjct: 212 LK---TFPLMEKQISAWLKAGIFDQYANTPKVSTPEMGTPQGGIISPLLANIALHGLEEH 268 Query: 234 LHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVE 293 L + K A R + A+ RYADDFV+I + ++ Sbjct: 269 LLNMVSRKEFPKPHP-----------KAARGAKAKRAALGIIRYADDFVIIHRNL-DIMK 316 Query: 294 AIREECRGVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSR-YGEMRVVSTIPQEK 352 + E + L + L ++ +K+ + + F FLG ++ R + + RV T +E Sbjct: 317 TVITETKTWL-AQMGLAISEEKSALRLASKSFKFLGFQVAYVRDKIQNKYRVRITPSREN 375 Query: 353 ARNFAASLTALLWKVRISG 371 + + ++ K + S Sbjct: 376 VKLIISKTRNIIQKNKASS 394 >UniRef50_B5W904 Group II intron maturase-specific domain protein n=1 Tax=Arthrospira maxima CS-328 RepID=B5W904_SPIMA Length = 502 Score = 208 bits (529), Expect = 3e-52, Method: Composition-based stats. Identities = 85/406 (20%), Positives = 158/406 (38%), Gaps = 68/406 (16%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQ--PEWLAEAARITLSSKGAHTPGVDGVNKTMLQA 58 +Q+++ + ++RL RL+T + + + GVDGVN+ Q Sbjct: 25 LQKRIYRASERGDVKAVRRLQRLLTNAMDAKILAVRELIQDNGTEKRAGVDGVNRLRNQE 84 Query: 59 RLAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRP-LGIPALRDRIVQRAMLMAMEPI 117 +L L + L G + R+V IP+ + +P GI + ++ Q + +A+EP Sbjct: 85 KLD-----LANCLKLGK-KTQTRRQVSIPEPGKEEKPAFGILMMMEKAKQGLVKLALEPE 138 Query: 118 WESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKA 177 WE+ F SYGFRP RS AI + + + + +V++ + + ++H+ L+ Sbjct: 139 WEAKFDRNSYGFRPGRSAQDAIAAIFNGIKEDHK----YVLDAHIEKCCEGIYHQKLLAK 194 Query: 178 VRRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHER 237 + R + +K+G +D PQGG++ PLL+NI L+ + L + Sbjct: 195 LNTY---PRLRRPIKAWLKSGVMDGKELFPTETDTPQGGLM-PLLANIALDGLESLLEDT 250 Query: 238 YLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIRE 297 + G A RYADD V++ + + A +E Sbjct: 251 FKGGVANCQN----------------------GKATVVRYADDLVVLDEELAVILTA-QE 287 Query: 298 ECRGVLEGSLKLRLNMDKTKIPHV------NDGFIFLGHRLIRKRSRYGE---------- 341 L G + L+L T+I H N G+ FLG+ + + + Sbjct: 288 TIEAWL-GEMGLKLKDSNTRITHTFTEHNGNLGWDFLGYNIRQYPTSKKRSLATGNTEQK 346 Query: 342 --MRVVSTIPQEKARNFAASLTALLWKVRIS---------GEILLG 376 + + +E + + ++ + S I+ G Sbjct: 347 IGCQTIIKPSKEAIKRHLQKIDEIIGSHKNSSQEQLINALNPIIRG 392 >UniRef50_D2FQY0 Regulatory protein GntR n=2 Tax=Staphylococcus aureus subsp. aureus RepID=D2FQY0_STAAU Length = 431 Score = 208 bits (529), Expect = 3e-52, Method: Composition-based stats. Identities = 106/368 (28%), Positives = 163/368 (44%), Gaps = 56/368 (15%) Query: 19 RLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQP 78 ++ L+ + + +A + +KG PG+DG+ + +Q A ++ +LL G Y+P Sbjct: 10 SMMELVVREHNIQKAIKKVKKNKG--APGIDGMKVSEIQGHFAQYFPEIKQKLLEGTYKP 67 Query: 79 LPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHA 138 ++V IPK+NGK R LGIP +RDR++Q+A+ +EP + F S+GFRP RS A Sbjct: 68 QAVKKVEIPKANGKKRVLGIPVVRDRVIQQAIKQVIEPSIDRTFSKHSHGFRPNRSTGTA 127 Query: 139 IRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAG 198 ++ E ++ DL FD ++H LM R I D T + ++++ G Sbjct: 128 LKE----CASYYEAGYTIAVDCDLKQCFDNINHDKLMYLFERHIKDKAVSTFIRRSLQVG 183 Query: 199 HID-VGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRG 257 ID G G PQGGVISPLL NI L+E D+ L +R+ Sbjct: 184 AIDLSGEVAERKIGAPQGGVISPLLCNIYLHELDKELEKRHH------------------ 225 Query: 258 RSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTK 317 + RYADDFV+ VK TK E + + + + +LKL +N DK+K Sbjct: 226 ---------------RFVRYADDFVIFVK-TKRAGERVMDSIKTFIHKTLKLEVNNDKSK 269 Query: 318 IPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAASLTALLWKVR--------- 368 + FL + + Y E RN +L L + R Sbjct: 270 VGSPTR-LKFLSCLMSKVNGTYRF-----RPTTEAKRNLKRNLKWLTRRSRPGSFQDIIT 323 Query: 369 ISGEILLG 376 + G Sbjct: 324 EINCVTRG 331 >UniRef50_A6YE98 Putative reverse transcriptase and intron maturase n=1 Tax=Chlorokybus atmophyticus RepID=A6YE98_CHLAT Length = 845 Score = 207 bits (526), Expect = 6e-52, Method: Composition-based stats. Identities = 91/370 (24%), Positives = 154/370 (41%), Gaps = 48/370 (12%) Query: 3 RKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHT---PG--VDGVNKTMLQ 57 L +++ P + L +L+ A + + G+ T G +DG + Sbjct: 285 HTLWKLSSS-PHFQPSGLWKLVRDINLWIAAYKKLAPNPGSLTKSGAGGKIDGTSLKS-- 341 Query: 58 ARLAVELQILRDELLSGHYQPLPARRVYIPKSN-GKLRPLGIPALRDRIVQRAMLMAMEP 116 L+ LRD + G +Q + +VY K G PL IP +DR+VQ + +E Sbjct: 342 ------LEWLRDRVSEGKFQFGRSEKVYTLKPKVGNGIPLDIPEFQDRLVQEVVRTILEV 395 Query: 117 IWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMK 176 ++E F S+GFRP +S H A+ V+ + W I+GD+S FDT+ + L+ Sbjct: 396 LYEPQFLESSHGFRPNKSQHTAMVDVRQKF-----KGVVWCIKGDISKSFDTIDKKKLIT 450 Query: 177 AVRRRISDARFMTLLWKTIKAGH-IDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLH 235 +R+++ D +F L++K +K+ + G+ +G+ Q G+ SPLL NI L++ D ++ Sbjct: 451 QMRKKVKDEKFCHLIYKGLKSRLLMPEGIMEVLKKGISQKGICSPLLCNIALHQLDLFIE 510 Query: 236 ERYLSGKARKDRWYWNNSIQRG--------------------------RSTAVRENWQWK 269 + Q + Sbjct: 511 RLKKIVNKVDSSHIVSQPYQSQMVPQRERAAIGTGDWRGAIKAIKMARKMGYGDHQDPNL 570 Query: 270 PAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVN-DGFIFL 328 + Y RYADDF++ V G K E I E L+ L L LN +K I ++ FL Sbjct: 571 RRLTYVRYADDFLIGVTGPKKLAERIGELVSRFLKIRLNLTLNQEKIVISKLSGKKIPFL 630 Query: 329 GHRLIRKRSR 338 G ++ + + Sbjct: 631 GFQIYQPPLK 640 >UniRef50_A8LGE6 RNA-directed DNA polymerase n=1 Tax=Frankia sp. EAN1pec RepID=A8LGE6_FRASN Length = 351 Score = 207 bits (526), Expect = 7e-52, Method: Composition-based stats. Identities = 84/288 (29%), Positives = 130/288 (45%), Gaps = 39/288 (13%) Query: 18 QRLLRLITQPEWLAEAARITLSSKGAHTPG--VDGVNKTMLQARLAVELQILRDELLSGH 75 +R+ R + A S+KGA TPG VDG++ + + D + Sbjct: 42 ERVYRQLFNAALYLVAYGRLYSNKGAMTPGETVDGMSLATID--------RIIDAMRHER 93 Query: 76 YQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSV 135 Y+ P +RV+IPK NGK RPLG+P D++V + + +E +E F S+GFRP R+ Sbjct: 94 YRWKPVKRVHIPKKNGKKRPLGLPTWSDKLVAEVVRLLLEAYYEPTFSDHSHGFRPGRAC 153 Query: 136 HHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTI 195 H A+ V W IEGD++ F+ + H++++ V RI D RF+ LL + Sbjct: 154 HTALGEVVDV-----WKGTHWFIEGDIARCFEELDHQVMLDTVGERIHDNRFLGLLKAML 208 Query: 196 KAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQ 255 +AG+++ + A G QGG SP+LSNI L+ D ++ L R +R N + Q Sbjct: 209 RAGYLEDWKWGATLSGTVQGGPASPILSNIYLDRLDSFVVTHLLPDYNRGERRASNPAYQ 268 Query: 256 RGRSTAVR------------------------ENWQWKPAVAYCRYAD 279 + R + + Y RYAD Sbjct: 269 KIEYAIARARRHGDRPALRRLRQQRRQLPSQDPHDPSYRRLRYVRYAD 316 >UniRef50_Q8YWX6 Alr1468 protein n=4 Tax=Cyanobacteria RepID=Q8YWX6_ANASP Length = 668 Score = 207 bits (526), Expect = 7e-52, Method: Composition-based stats. Identities = 86/309 (27%), Positives = 141/309 (45%), Gaps = 41/309 (13%) Query: 24 ITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLPARR 83 + E L A + G+ + GVDG++ + ++ +LQ + +L Y PA+ Sbjct: 1 MFTIEHLNFAWLQVRA--GSKSAGVDGISVDLFESMATEQLQNIAYQLKEETYTANPAKG 58 Query: 84 VYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVK 143 YIPK NG R +GI +RDRI+QR +L + E F SY +RP S+ A++ Sbjct: 59 FYIPKKNGTKRLIGIHTVRDRIIQRLLLDELYFPLEDTFLDCSYAYRPGHSIQQAVQ--- 115 Query: 144 LQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHIDVG 203 L + + +W+I+ D++ +FD + LL+ + + + LL + +K+G I G Sbjct: 116 -HLYGYYQYQPKWIIKADVADFFDNLSWALLLTYLEELSLEPSLLQLLEQQLKSGIIIAG 174 Query: 204 LFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVR 263 +R +GV QGG++S L+N+ L FD+ + Sbjct: 175 QYRNFGKGVLQGGILSGALANLYLTSFDRKCLSQ-------------------------- 208 Query: 264 ENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVND 323 + RY DDFV+ + I ++ G L G + L L +KT+I ND Sbjct: 209 -------GINLVRYGDDFVIACNS-WLEANRILDKITGWL-GEVYLTLQPEKTQIFTPND 259 Query: 324 GFIFLGHRL 332 F FLG+R Sbjct: 260 EFTFLGYRF 268 >UniRef50_D2LF37 RNA-directed DNA polymerase (Reverse transcriptase) n=1 Tax=Rhodomicrobium vannielii ATCC 17100 RepID=D2LF37_RHOVA Length = 293 Score = 207 bits (526), Expect = 8e-52, Method: Composition-based stats. Identities = 87/314 (27%), Positives = 136/314 (43%), Gaps = 48/314 (15%) Query: 21 LRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLP 80 L + P L +A +KG PG DGV + VEL+ LR E L+G Y+P Sbjct: 22 LEKVVAPACLQQAWTRVRKNKGG--PGGDGVTIEIFAQNAEVELEKLRAETLAGIYRPRK 79 Query: 81 ARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIR 140 R +PK G R L IP++ DRI+Q A ++++ + F + S+ +R R V A+ Sbjct: 80 VRHAIVPKPKGGERKLTIPSVVDRILQTATMLSLGQTVDHHFSSASWAYREGRGVDDALA 139 Query: 141 TVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHI 200 L + W + D+ YFD + H+ L+ + + D R + L+ +++ Sbjct: 140 ----DLRRLRNSGLFWTFDADIMQYFDRILHKRLIDDLFIWVDDLRIVRLIQLWLRS--- 192 Query: 201 DVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGRST 260 F G+ QG ISPLL+N+ L+ D+ L Sbjct: 193 ----FSYWGRGIAQGAPISPLLANLFLHPMDRLLE------------------------- 223 Query: 261 AVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPH 320 +A RYADDFV++ + KA + + L L+LNM KT+I Sbjct: 224 --------LEGLASVRYADDFVVLCRS-KALAQKAQLIVASHLAAR-GLKLNMSKTRILA 273 Query: 321 VNDGFIFLGHRLIR 334 ++ FIFLG + Sbjct: 274 PSEAFIFLGQTVEP 287 >UniRef50_UPI0001C388AF RNA-directed DNA polymerase n=1 Tax=Arthrospira platensis str. Paraca RepID=UPI0001C388AF Length = 498 Score = 206 bits (525), Expect = 8e-52, Method: Composition-based stats. Identities = 86/406 (21%), Positives = 157/406 (38%), Gaps = 68/406 (16%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEW--LAEAARITLSSKGAHTPGVDGVNKTMLQA 58 +Q+++ + +++L RL+T + + + T GVDGV + Sbjct: 25 LQKRIYRASKRGDVKAVRKLQRLLTNSRDAKILAVREVIPENGSQKTAGVDGV-----KR 79 Query: 59 RLAVELQILRDELLSGHYQPLPARRVYIPKSN-GKLRPLGIPALRDRIVQRAMLMAMEPI 117 E L + L G + RRV IP+ + R +GI + ++ Q + +A+EP Sbjct: 80 LRNQEKLDLANCLKLGR-KTQGLRRVSIPEPGRDEKRAVGILMMMEKAKQGLVKLALEPE 138 Query: 118 WESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKA 177 WE+ F SYGFRP RS AI + + + + +V++ + F+ ++H+ L+ Sbjct: 139 WEARFDRNSYGFRPRRSAQDAIAAIFNGMKEDHK----YVLDAHIEKCFEGIYHQKLLAK 194 Query: 178 VRRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHER 237 + + +K+G +D PQGG++ PLL+NI L+ + L ++ Sbjct: 195 LNTY---PTLRREIKAWLKSGVMDGKELFPTETDTPQGGLM-PLLANIALDGLESLLEDK 250 Query: 238 YLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIRE 297 + G A RYADD V++ G + A +E Sbjct: 251 FQGGVANC----------------------GNGKATVVRYADDLVVL-DGELEVILAAKE 287 Query: 298 ECRGVLEGSLKLRLNMDKTKIPHV------NDGFIFLGHRLIRKRSRYGEMRVVSTIPQE 351 L + L+L T+I H N G+ FLG+ + + +R R Q+ Sbjct: 288 TIEAWLM-EMGLKLKDGNTRISHTFIEHEGNIGWDFLGYNIRQYPTREKRSRATGNTEQK 346 Query: 352 K------------ARNFAASLTALLWKVRIS---------GEILLG 376 + + + ++ + S I+ G Sbjct: 347 RGFQTIIKPSQGAIKRHLQKVDEIIRSHKNSSQEQLINALNPIIRG 392 >UniRef50_B9K440 18S rRNA intron 1 protein n=1 Tax=Agrobacterium vitis S4 RepID=B9K440_AGRVS Length = 257 Score = 206 bits (524), Expect = 1e-51, Method: Composition-based stats. Identities = 148/220 (67%), Positives = 173/220 (78%), Gaps = 1/220 (0%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARL 60 MQ KLATWA +DP+ R RLLRLI EWLAE AR+ L+S GA TPG+DG++K LQ +L Sbjct: 6 MQHKLATWAESDPNRRFDRLLRLIANREWLAETARMVLASSGARTPGIDGMDKQRLQVKL 65 Query: 61 AVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWES 120 L LR LL Y+P P +R+YIPK NGKLRPL IP L DRIVQRAMLMAM PIWES Sbjct: 66 DQHLDDLRTSLLEESYRPQPVKRIYIPKPNGKLRPLDIPTLTDRIVQRAMLMAMGPIWES 125 Query: 121 DFHTLSYGFRPERSVHHAIRTVKLQLTD-CGETRGRWVIEGDLSSYFDTVHHRLLMKAVR 179 DFH LSYGFR ER+VHHA+RTV++QL D TRGRW+IEGDL+SYFDTVHHRLL++ VR Sbjct: 126 DFHRLSYGFRSERNVHHAVRTVRIQLQDGADTTRGRWIIEGDLASYFDTVHHRLLLRCVR 185 Query: 180 RRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVIS 219 RR+ D RF+ LLW+ +KAGHID GLF A+SEGVPQGG+ S Sbjct: 186 RRVQDGRFVDLLWRFLKAGHIDRGLFTASSEGVPQGGLWS 225 >UniRef50_D1RME6 Reverse transcriptase family protein n=1 Tax=Legionella longbeachae D-4968 RepID=D1RME6_LEGLO Length = 444 Score = 206 bits (524), Expect = 1e-51, Method: Composition-based stats. Identities = 97/381 (25%), Positives = 154/381 (40%), Gaps = 52/381 (13%) Query: 5 LATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVEL 64 ++ A + + E L E + G+ G+DGV K + +L L Sbjct: 18 ISKRARLQKDTVFNNIGHAL-DTELLRECYQEL---DGSKAIGIDGVTKEVYGKKLEDNL 73 Query: 65 QILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHT 124 Q L + Y P +R V IPK +G RPL I D+IVQ A+ + I+E F Sbjct: 74 QDLLARIRRHAYTPQASRLVEIPKEDGSTRPLAISCFEDKIVQMAVTKLLTAIYEPLFLP 133 Query: 125 LSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISD 184 SYG+R ++ H A+R + + E R +E DL YF+T+ H L++ + ++I+D Sbjct: 134 CSYGYREGKNGHEALRAL---MKYSNEFRKGATLEIDLRKYFNTIPHGKLLEILEKKITD 190 Query: 185 ARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLN-EFDQYLHERYLSGKA 243 RF+ L+ K I++ + G G PQG +ISP+LSNI L+ D + E S Sbjct: 191 RRFLKLIRKLIRSPVVANGKAELNELGCPQGSIISPILSNIYLHSVVDSWFDEISKSHLI 250 Query: 244 RKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVL 303 A R+ADD V + + + E + L Sbjct: 251 --------------------------GKTAMVRFADDMVFLFQRS-EDAEKFYKVLPKRL 283 Query: 304 EGSLKLRLNMDKTKIPHVN--------------DGFIFLGHRLIRKRSRYGEMRVVSTIP 349 E L+L++DK+ + + FLG +S G+ + Sbjct: 284 E-KYGLQLHVDKSSLLKSGSKEAEEADTRGERLQTYKFLGFTCYWGKSLDGKSWRLKFKS 342 Query: 350 QEKARNFAASLTALLWKVRIS 370 + F A L L ++ S Sbjct: 343 RSD--RFTAKLRGLREYLKKS 361 >UniRef50_C3EEI5 Group II intron reverse transcriptase/maturase n=1 Tax=Bacillus thuringiensis serovar pakistani str. T13001 RepID=C3EEI5_BACTU Length = 539 Score = 204 bits (519), Expect = 4e-51, Method: Composition-based stats. Identities = 93/302 (30%), Positives = 159/302 (52%), Gaps = 18/302 (5%) Query: 75 HYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERS 134 Y+P RRV+IPK NG RPLGIP + DRI Q+ +L ++PI E+ FH SYGFR RS Sbjct: 24 WYEPQTVRRVFIPKPNGDQRPLGIPTIWDRIFQQCILQVLDPICEAKFHKHSYGFRSNRS 83 Query: 135 VHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRR-ISDARFMTLLWK 193 HHA+ K + G ++ I D+ +FD V+H L+K + I D + +++L + Sbjct: 84 THHALGRFKNLINIAGFSQ---CIAIDIKGFFDNVNHGKLLKQIWSLGIRDKKLLSILSR 140 Query: 194 TIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNS 253 +K+ G+ +G PQGG++SPLLSNI LNE D ++ ++ + +++ + ++ Sbjct: 141 LLKSNIGGEGIL---EKGTPQGGILSPLLSNIALNELDWWVSNQWETFRSKYKYFMTSSM 197 Query: 254 IQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNM 313 + + T ++E RYADDF ++ + T++Q + + L+ LKL +NM Sbjct: 198 YKALKKTKLKE-------CYIIRYADDFKILCR-TRSQANRMFLAVKQFLKERLKLDINM 249 Query: 314 DKTKIPHVNDG-FIFLGHRLIRKRSRYGEMRVVST--IPQEKARNFAASLTALLWKVRIS 370 +K+KI + FLG R+ + + V+ + + +N + + K++ Sbjct: 250 EKSKIISLKRKETEFLGFRIKFIKRGGTKHGYVAHSNMTDKAFKNAHFKIRESIKKIQKR 309 Query: 371 GE 372 Sbjct: 310 AC 311 >UniRef50_A5B8L7 Putative uncharacterized protein n=2 Tax=Vitis vinifera RepID=A5B8L7_VITVI Length = 1832 Score = 204 bits (518), Expect = 6e-51, Method: Composition-based stats. Identities = 49/342 (14%), Positives = 122/342 (35%), Gaps = 39/342 (11%) Query: 22 RLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLPA 81 ++ +P E + G PG DG Q+ + + + + Sbjct: 1294 EILERPFTEEEIHGALMEMNGDKAPGPDGFTLAFWQSCWEFIKEKIIEMFKEFYDHSSFL 1353 Query: 82 RR------VYIPKS-----NGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFR 130 + V IPK G RP+ + +++ + + ++ + F Sbjct: 1354 KSLNNTFLVLIPKKCGAEDLGDFRPISLLGGLYKLLAKVLANRLKRVVGKVVSNSQNAFV 1413 Query: 131 PERSVHHAIRTVKLQLTDCGETRGR-WVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMT 189 R + A + + + + + + D+ +D+++ + L+K +++ ++++ Sbjct: 1414 RGRQILDASLIANEVIDSWQKRKEKGLICKLDIEKAYDSINWKFLLKVLQKMGFGSKWVG 1473 Query: 190 LLWKTIKA---GHIDVGL---FRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKA 243 +W + + + G+ F + +G+ QG +SP L + + D + G Sbjct: 1474 WMWSCLSSAKFSVMVNGVPAGFFPSXKGLRQGDPLSPYLFVMGMEVLDVLIRRAVEGGFL 1533 Query: 244 RKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVL 303 +R + +++ +ADD ++ + K + + Sbjct: 1534 SGCN--------------IRGGSEPPLYISHLFFADDTIIFCEARKDXLTHL-SWILFWF 1578 Query: 304 EGSLKLRLNMDKTKIPHVNDGFIF------LGHRLIRKRSRY 339 E + L++N+ K++I V + LG R+ S+Y Sbjct: 1579 EAASGLKINLAKSEIIPVGEVVEMEELAVELGCRVGSLPSQY 1620 >UniRef50_Q9G8T4 Orf621 n=1 Tax=Rhodomonas salina RepID=Q9G8T4_RHDSA Length = 621 Score = 203 bits (517), Expect = 7e-51, Method: Composition-based stats. Identities = 97/409 (23%), Positives = 166/409 (40%), Gaps = 57/409 (13%) Query: 3 RKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGV----DGVNKTMLQA 58 +KL + + L++ + L A +KGA T G D + + Sbjct: 20 KKLYELNKENTNKSNDNLMKFLYDEGMLWNAVEKLKKNKGAATFGPTNNKDRKSIE-IDG 78 Query: 59 RLAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIW 118 +++ L +EL ++ P RR+ +PK N K RPLGI + DRIVQ + + I+ Sbjct: 79 LNLGQIKRLSEELREETFKWSPTRRIEVPKKNNKRRPLGIFSFEDRIVQEGIRTILNAIY 138 Query: 119 ESDFH-TLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKA 177 E F ++GFRP S A+ +K IEGD+ ++ ++H +LMK Sbjct: 139 EPTFSGNNNHGFRPRLSSETALELLKR-----NRKGKTHAIEGDIKKAYEGINHNILMKI 193 Query: 178 VRRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHER 237 ++++I+D +F+ ++ + +K G + VPQG + SP+L NI +NEFD+ + Sbjct: 194 LKKKINDKKFLRIIEEGLKCGIEKNRKIYNSITVVPQGSICSPILFNIYMNEFDEAIKTI 253 Query: 238 YLSGKA--------RKDRWYWNNSIQRGRSTAVRENWQWKPAVA---------------- 273 RK N + +S + N + + + Sbjct: 254 IEEIFTRLNESRDSRKSSSTMNKEYKILKSRSDEMNKKIRERLKQESPFIKTLLKSHKKI 313 Query: 274 --------------------YCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNM 313 Y RYADD+++I G E I+ ++ LKL L Sbjct: 314 RWKLKRKKAIDYGKRTLEYFYLRYADDWIIITNGNVRVCEEIKIRISTWIKEELKLELEQ 373 Query: 314 DKTKIPHV-NDGFIFLGHRLI-RKRSRYGEMRVVSTIPQEKARNFAASL 360 KT+I ++ D FLG ++ +R G + I + N + Sbjct: 374 SKTRITNMEKDPIKFLGFSIMTPINTRIGTIEKKIGIRTTRRTNLGPRI 422 >UniRef50_Q10VN2 RNA-directed DNA polymerase (Reverse transcriptase) n=3 Tax=Cyanobacteria RepID=Q10VN2_TRIEI Length = 437 Score = 203 bits (517), Expect = 7e-51, Method: Composition-based stats. Identities = 90/367 (24%), Positives = 158/367 (43%), Gaps = 38/367 (10%) Query: 21 LRLITQP--EWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQP 78 ++L+ + L R+T ++G G D ++ + ++L L +Q Sbjct: 1 MKLMLRSYSNLLLSVRRVTQENQGIRRMGRDAQTAKTSVEKVKLVKEMLTYRL----WQA 56 Query: 79 LPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHA 138 PA+RVYIPK+N + PLGIP +++R+ Q + +EPIW+++F T SYGF P RS H Sbjct: 57 KPAKRVYIPKANRQQGPLGIPTVKNRVAQAVVKNGLEPIWDAEFETNSYGFHPGRSCHDP 116 Query: 139 IRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAG 198 + + + W+++ D+ FD + H ++KA+ L+ + +KAG Sbjct: 117 LEQ---FWIRLQKGKDTWILDVDIKQDFDNITHEYILKAIGEIPG----RELIKQWLKAG 169 Query: 199 HIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGR 258 +++ +F G G+ISPLL+NI + ++ L + R + + + Sbjct: 170 YLEAEVFHKTEGGTSSRGIISPLLANIAFDGMERLLARYKTVKTYQCTRPTTDEEYTKKK 229 Query: 259 STAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKI 318 + RYADDF++ + + ++AI L L LN DKT + Sbjct: 230 KLD---------KYGFIRYADDFIITARS-EEDIKAIIPTIEKWLSER-GLELNKDKTNL 278 Query: 319 PHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAASLTALLWK---------VRI 369 H+ GF FLG + + +EK + F + L + Sbjct: 279 VHIEQGFNFLGFNVRQF-----NGSCFIVPQKEKVKEFLTLIRGWLKAHPTSTQEAVIAN 333 Query: 370 SGEILLG 376 I+ G Sbjct: 334 LNSIIRG 340 >UniRef50_UPI0001982DF4 PREDICTED: hypothetical protein n=1 Tax=Vitis vinifera RepID=UPI0001982DF4 Length = 1473 Score = 202 bits (513), Expect = 2e-50, Method: Composition-based stats. Identities = 56/384 (14%), Positives = 127/384 (33%), Gaps = 51/384 (13%) Query: 27 PEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLPARR--- 83 P + E + G PG DG Q + + D + Q + Sbjct: 980 PFTMEEIHSALMDMNGDKAPGPDGFTGAFWQNCWEFVKEEIMDLFKEFYVQKSFEKSLNT 1039 Query: 84 ---VYIPKSNG-----KLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSV 135 V IPK G + RP+ + ++V + + ++ + F R + Sbjct: 1040 TFLVLIPKKGGAEDLGEFRPISLLGGLYKLVAKVLANRLKKVLGKVVSMDQNAFVRGRQI 1099 Query: 136 HHAIRTVKLQLTDCGETRGR-WVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKT 194 A + + + + + + D+ +D+++ + LMK +++ R+M +W Sbjct: 1100 LDAALIANEVVDFWYKRKEKGLICKLDIEKAYDSINWKFLMKVLQKMGFGTRWMEWIWWC 1159 Query: 195 I---KAGHIDVGL---FRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRW 248 I K + G+ F +S+G+ QG +SP L + + + G Sbjct: 1160 ISTAKFSILVNGVPTGFFPSSKGLRQGDPLSPYLFVMGMEVLSALIRRAVGGGFVSGC-- 1217 Query: 249 YWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLK 308 +++ V++ +ADD ++ + K + ++ E + Sbjct: 1218 ------------SLKGRGGLVMEVSHLLFADDTIIFCEAKKEYLTSLC-WFLAWFEAASG 1264 Query: 309 LRLNMDKTKIPHVNDGFI------------------FLGHRLIRKRSRYGEMRVVSTIPQ 350 LR+N+ K+++ + + +LG L V + Sbjct: 1265 LRINLAKSELIPIGEVEDIEEMAVELGCKVGALPSVYLGLPLGAHHKAISMWDGVEERMR 1324 Query: 351 EKARNFAASLTALLWKVRISGEIL 374 + + + ++ + L Sbjct: 1325 RRLALWKRQYISKGGRITLIKSTL 1348 >UniRef50_O99970 Orf546 n=2 Tax=Porphyra purpurea RepID=O99970_PORPU Length = 546 Score = 201 bits (512), Expect = 3e-50, Method: Composition-based stats. Identities = 91/396 (22%), Positives = 152/396 (38%), Gaps = 66/396 (16%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEW--LAEAARITLSSKGAHTPGVDGVNKTMLQA 58 +Q K+ ++ + + + I + ++ ++T + G T GVD ++ Sbjct: 19 LQCKIFKFSKEGDMNSVFLIQKQIIKHDFSKFLAVRKVTQDNLGKRTAGVDRISNLTPDE 78 Query: 59 RLAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIW 118 R+ + I D + RRV I K NGK R LGIP +RDR Q + A+EP + Sbjct: 79 RMELVQNIQIDN------KSDKIRRVTILKPNGKERHLGIPTIRDRAKQCLVKFALEPQY 132 Query: 119 ESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAV 178 E+ F SYGFRP RS + A + + L + V++ D+ FD + H L+ + Sbjct: 133 EAIFEPNSYGFRPGRSSNDARQAIVKCLQQL----PKHVLDADIERCFDNIDHSKLIHGI 188 Query: 179 RRR-ISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHER 237 + + L I G + G PQGG+ISPLL+NI L+ ++ + Sbjct: 189 NTFPLLREQVRAWLKACILTGFKENIKEVIPEAGTPQGGIISPLLANIALHGMEKAVC-- 246 Query: 238 YLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIRE 297 K V RYADDF+++ K EA + Sbjct: 247 -------------------------------KSGVYLIRYADDFLILCNEEKELSEA-KN 274 Query: 298 ECRGVLEGSLKLRLNMDKTKIPHV-------NDGFIFLGHRLIRKRSRYGE--------- 341 + L+ L L+L+ KTKI + G FLG + + Sbjct: 275 KIEIFLQN-LGLKLSESKTKITYTGSSEYSRTKGVDFLGFNFVNYKVGKHTSAKNNQGIA 333 Query: 342 --MRVVSTIPQEKARNFAASLTALLWKVRISGEILL 375 + + + ++ + K + +L Sbjct: 334 TGWKSRCQPSYKSIESHLDNIKDITKKSTGLSQKVL 369 >UniRef50_Q35063 CoxI intron1 ORF n=2 Tax=Eukaryota RepID=Q35063_MARPO Length = 902 Score = 201 bits (511), Expect = 4e-50, Method: Composition-based stats. Identities = 99/422 (23%), Positives = 161/422 (38%), Gaps = 64/422 (15%) Query: 2 QRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTP--GVDGVN------K 53 +RKL T + S + + + + T P +L A S TP G + N Sbjct: 179 RRKLETLKRNEKSGKFENIYSICTDPNFLIAAYEQIKSHTSNMTPEGGEERENLFLRQVA 238 Query: 54 TMLQARLAVELQILRDELLSGHYQPLPARRVYIPKSNG--KLRPLGIPALRDRIVQRAML 111 + L++ + + L S ++ ARR+ IP N + RPL I + D IVQ+AM Sbjct: 239 SPLESLDRAWFERTAELLRSEQFRFKLARRIMIPTPNKPREFRPLTIGS--DNIVQQAMK 296 Query: 112 MAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHH 171 + ME I+E F S+GFRP R H + + T W +E D+ F+++ Sbjct: 297 IVMEHIYEPKFLDTSHGFRPGRGCHSGLEQIC-----LKWTGASWFLEFDIKRCFNSMDR 351 Query: 172 RLLMKAVRRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVIS----------PL 221 L+ +++ I D R+M L+ K AG + L QG V+S PL Sbjct: 352 HKLVFILQKDIEDQRWMDLVHKLFTAGLVGGELGGPDPL---QGSVLSPWSSPPWALAPL 408 Query: 222 LSNIMLNEFDQYLHERYLSGKARKDRW---------------------------YWNNSI 254 NI L++ DQ + + + R Sbjct: 409 FCNIYLHDLDQEVAKMANELSRSRKRRVDKRTTAATRTPRTKAFRALTPQAEIMRVRRKA 468 Query: 255 QRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMD 314 RG S R++ Y RYA +F+L + G + V ++ + L L L Sbjct: 469 ARGLSPTDRKD-PNYARAFYVRYAGNFLLGIAGPRELVATVKSRIVQFVNSELHLELTGG 527 Query: 315 KTKIPHVN-DGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAASLTAL-LWKVRISGE 372 I H++ + FLG + S ++R EK R + L + K + Sbjct: 528 --SISHISAESVKFLGMEIKVVPS--SKLRRRFGKAMEKRRRVRNRIFTLKVQKRKRQDS 583 Query: 373 IL 374 ++ Sbjct: 584 LV 585 >UniRef50_B1C301 Putative uncharacterized protein n=6 Tax=Clostridium spiroforme DSM 1552 RepID=B1C301_9FIRM Length = 270 Score = 200 bits (508), Expect = 8e-50, Method: Composition-based stats. Identities = 81/221 (36%), Positives = 127/221 (57%), Gaps = 6/221 (2%) Query: 5 LATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVEL 64 ++ T+ + ++LL I + + +A + +S+KG+ GVD + ++ A Sbjct: 39 ISHTKNTNRFVVHEKLLETIMEDANIEKAIQRVMSNKGSG--GVDKMQVAEVRTHFAQHW 96 Query: 65 QILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHT 124 L+ ++ GHY P +RV IPK NGK R LGIP + DR++Q+A++ + PI+E F Sbjct: 97 SYLKKLIMEGHYSPQAVKRVEIPKDNGKKRELGIPTVTDRVIQQAIVQVLTPIFEPQFSD 156 Query: 125 LSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISD 184 SYGFRP R+ H A+R V + R+ ++ DL YFDTV+H L++ + + I D Sbjct: 157 NSYGFRPRRNAHQAVRKVVEYANE----GYRYTVDLDLEKYFDTVNHSRLIQILSQTIKD 212 Query: 185 ARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNI 225 R ++L+ K + AG I F ++GVPQGG +SPLLSNI Sbjct: 213 GRVISLIHKYLNAGVIVKHKFEETTKGVPQGGPLSPLLSNI 253 >UniRef50_A5B6Q1 Putative uncharacterized protein n=4 Tax=Vitis vinifera RepID=A5B6Q1_VITVI Length = 1936 Score = 199 bits (505), Expect = 2e-49, Method: Composition-based stats. Identities = 48/315 (15%), Positives = 109/315 (34%), Gaps = 33/315 (10%) Query: 27 PEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLPARR--- 83 P AE + G PG DG Q + + + D + Q + Sbjct: 1145 PFSEAEIYAALMGMNGDKAPGPDGFTVAFWQNCWEIVKEDVLDMFKEFYDQNSFIKSLNH 1204 Query: 84 ---VYIPKSNG-----KLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSV 135 V IPK G RP+ + +++ + + ++ I + F R + Sbjct: 1205 TFLVLIPKKGGAEDLGDYRPISLLGGLYKLLAKVLANRLKKIIDKVISPDQNAFIKGRQI 1264 Query: 136 HHAIRTVKLQLTDCGETRGRWVI-EGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKT 194 + + + +I + D+ FD ++ + L+K + + ++++ +W Sbjct: 1265 LDGSLIANEVIDSWQKRGEKGLIXKLDIEKAFDNINWQFLLKVMHKMGFGSKWIGWMWSC 1324 Query: 195 IKA------GHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRW 248 I + F ++S+G+ QG +SP L + + + G R Sbjct: 1325 ISTIKYSMLVNGVPAGFFSSSKGLRQGDPLSPYLFIMGMEVLSALISRAVEGGFIYGCR- 1383 Query: 249 YWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLK 308 + + + + +ADD ++ + K + + E + Sbjct: 1384 -------------IWKGRGQPVNITHLLFADDTIVFCEAKKESLLYL-SWILLWFEAASG 1429 Query: 309 LRLNMDKTKIPHVND 323 L++N++K+ + V + Sbjct: 1430 LKINLEKSMVIPVGE 1444 >UniRef50_Q1KSC2 Putative non-LTR retroelement reverse transcriptase n=11 Tax=Sorghum bicolor RepID=Q1KSC2_SORBI Length = 1505 Score = 199 bits (505), Expect = 2e-49, Method: Composition-based stats. Identities = 58/390 (14%), Positives = 128/390 (32%), Gaps = 55/390 (14%) Query: 22 RLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGH------ 75 ++ P +E + PG DG + Q V L H Sbjct: 942 DILISPFTESEVREAIFQMEHNKAPGPDGFPAELYQVFWKVIKDDLLSLFSELHREALDL 1001 Query: 76 YQPLPARRVYIPKSNG-----KLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFR 130 Y IPK N + RP+ + + +I + + + ++ F Sbjct: 1002 YSLNFGIITLIPKVNNAIRIQQYRPICVLNVSFKIFTKVGTNWLNMVAKTVVTPTQMAFM 1061 Query: 131 PERSVHHAIRTVKLQLTDCGETRGRWVI-EGDLSSYFDTVHHRLLMKAVRRRISDARFMT 189 P R++ + + + + + VI + D +D V L + +R + ++ Sbjct: 1062 PGRNIMEGVVILHETVHELHTKKRNGVIFKIDFEKAYDKVKWSFLQQTLRMKGFSPKWCR 1121 Query: 190 LLWKTIKAGHI------DVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKA 243 + + G + D+G + G+ QG +SP+L NI+ + ++ G+ Sbjct: 1122 WVQTMVTGGSVGIKVNDDIGPYFQTKRGLRQGDPMSPILFNIVADMLALLINRAKADGQI 1181 Query: 244 RKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVL 303 R + ++ +YADD ++ + Q + ++ Sbjct: 1182 RGVIPHL-----------------IDDGLSILQYADDTIIFLDHDPEQAKNLKLLLCAF- 1223 Query: 304 EGSLKLRLNMDKTKI------------------PHVND-GFIFLGHRLIRKRSRYGEMRV 344 E L++N K++I + + F +LG + ++ E Sbjct: 1224 EQLSGLKINFHKSEIFCYGAAKEMESFYTNLLGCNAGEYPFCYLGIPMHHRQLLNSEWSK 1283 Query: 345 VSTIPQEKARNFAASLTALLWKVRISGEIL 374 V ++K + + ++ + +L Sbjct: 1284 VEDRFKQKLSCWKVKYLSYGGRLVLLNSVL 1313 >UniRef50_C5NNP0 Putative unclassified retrotransposon protein n=1 Tax=Oryza sativa Indica Group RepID=C5NNP0_ORYSI Length = 1283 Score = 199 bits (505), Expect = 2e-49, Method: Composition-based stats. Identities = 59/390 (15%), Positives = 133/390 (34%), Gaps = 55/390 (14%) Query: 22 RLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRD---ELLSGH--- 75 +T P E ++ PG DG Q V L + EL +G Sbjct: 791 EFLTSPFLEKEIRDAVFDTEHNKAPGPDGFPAEFYQKFWEVIKHDLMNLFHELHTGKLPL 850 Query: 76 YQPLPARRVYIPKSNG-----KLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFR 130 + +PK + RP+ + + ++ + + + + F Sbjct: 851 FNLNFGVITLLPKVKEANRIQQYRPICLFNVSFKLFTKVATNRINSVADHVVSPTQTAFM 910 Query: 131 PERSVHHAIRTVKLQLTDCGETRGRWVI-EGDLSSYFDTVHHRLLMKAVRRRISDARFMT 189 R++ + + L + + VI + D +D V LM+A+R + ++++ Sbjct: 911 RGRNILEGVVILHETLHELHRKKLNGVIVKIDFEKAYDKVKWPFLMQALRMKGFSTKWIS 970 Query: 190 LLWKTIKAGHI------DVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKA 243 + + G + DVG F +G+ QG +SP+L NI+ + ++ + G+ Sbjct: 971 WIESFVSGGSVSIKVNNDVGPFFQTKKGLRQGDPLSPMLFNIVADMLVILINRAKVDGQI 1030 Query: 244 RKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVL 303 + ++ +YADD +L + + ++ Sbjct: 1031 CGVVPHL-----------------VDDGISILQYADDTILFMDHDLEKACNMKLLLCAF- 1072 Query: 304 EGSLKLRLNMDKTKIPHVND-------------------GFIFLGHRLIRKRSRYGEMRV 344 E L++N K+++ D +LG + ++ R + Sbjct: 1073 EQLSGLKINFHKSELFCFGDARAMEDQYTDLFGCSSGEFPLCYLGIPIHYRKLRNADWTG 1132 Query: 345 VSTIPQEKARNFAASLTALLWKVRISGEIL 374 V +++ ++ L + ++ + +L Sbjct: 1133 VEERFEKRLSSWKGKLLSTGGRLTLINSVL 1162 >UniRef50_UPI0001C15D3C hypothetical protein CRC_00192 n=2 Tax=Nostocaceae RepID=UPI0001C15D3C Length = 566 Score = 198 bits (504), Expect = 3e-49, Method: Composition-based stats. Identities = 88/384 (22%), Positives = 152/384 (39%), Gaps = 59/384 (15%) Query: 1 MQRKLATWAATDPSLRIQRLLRLITQPEW--LAEAARITLSSKGAHTPGVDGVNKTMLQA 58 +Q K+ + T + + +++ L ++T +KG T DG + L Sbjct: 24 LQSKIYQASLTGDKSSVVKYQKILINSYSAKLLAVKKVTQDNKGKKTA--DGDQRIDLAK 81 Query: 59 RLAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIW 118 L ++ + + P R+ IPKSNG+ R LGI + DR Q MA+EP W Sbjct: 82 NLELDGKAI------------PLTRMEIPKSNGESRNLGISKMEDRAKQALAKMALEPEW 129 Query: 119 ESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAV 178 E+ F +YGFRP RS H AI ++ Q+ R +V+ D+S FD V H +++ Sbjct: 130 EAKFEPNNYGFRPGRSCHDAISAIESQVRR----RTSYVLSVDISGCFDKVKHEAIVEKC 185 Query: 179 RRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERY 238 + + +K+G + +F +G P GVISPLL+NI L+ + ++ ++ Sbjct: 186 N---TFPIMERQIRAWLKSGVMIGEVFHPLEKGEPVEGVISPLLANIALHGLETHISHKF 242 Query: 239 LSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREE 298 + + R A DF+++ + ++A++ E Sbjct: 243 ----------------PSVSPSEAQGKVGEIGEARLIRCAHDFLVL-HWEEKTIKAVKTE 285 Query: 299 CRGVLEGSLKLRLNMDKTKIPHVND-------GFIFLGHRLIRKRSRYGEMR-------- 343 L G + L LN K ++ H + G FLG + R + Sbjct: 286 VETWL-GEIGLNLNQQKIRMCHTMEEYNGEKPGLDFLGFNIRTYRIGKYKSNEKVNGESP 344 Query: 344 ---VVSTIPQEKARNFAASLTALL 364 ++ F + L Sbjct: 345 GMLTKIKPSEKSVERFMTDIKETL 368 >UniRef50_Q53NG0 Retrotransposon protein, putative, unclassified n=1 Tax=Oryza sativa Japonica Group RepID=Q53NG0_ORYSJ Length = 885 Score = 198 bits (503), Expect = 3e-49, Method: Composition-based stats. Identities = 59/380 (15%), Positives = 132/380 (34%), Gaps = 55/380 (14%) Query: 32 EAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHY--QPLPARRVYI--- 86 + + PG DG Q V L + H PL + I Sbjct: 433 KIRGAVFEMEHNKAPGPDGFPAEFYQNFWEVIKDDLMNLFRDFHVGDLPLFSLNFEIITL 492 Query: 87 -PKSNG-----KLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIR 140 PK + + RP+ + + +I + + + + + F R++ + Sbjct: 493 LPKVHEANHIQQYRPICLLNVSFKIFTKVATNRINFVADHVVSSSQIAFMRGRNILEGVV 552 Query: 141 TVKLQLTDCGETRGRWVI-EGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGH 199 + + + + VI + D +D V+ L++ + + ++++ + I G Sbjct: 553 VLHETVHELHRKKLNGVIFKVDFEKAYDKVNWPFLLQTLHMKGFSPKWISWVESFISGGS 612 Query: 200 I------DVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNS 253 + +VG F +GV QG +SP+L NI+ + ++ G+ + N Sbjct: 613 VVVKVNDEVGHFFQTKKGVRQGDPLSPILFNIIADMLTVLINRAKEDGQITGVVPHLIN- 671 Query: 254 IQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNM 313 ++ +YADD VL + + ++ E L++N Sbjct: 672 ----------------DDLSILQYADDTVLFMGHDLQKARNMKLLLFVF-EQLSGLKINF 714 Query: 314 DKTKIPHVNDG-------------------FIFLGHRLIRKRSRYGEMRVVSTIPQEKAR 354 K+++ +G F +LG + +R R + + V +++ Sbjct: 715 HKSELYCFGEGKDFENEYMELFGCSTGQFPFRYLGIPIHYRRLRNADWKEVVECFEKRLS 774 Query: 355 NFAASLTALLWKVRISGEIL 374 ++ L + ++ + +L Sbjct: 775 SWKGKLLSTGGRLTLINSVL 794 >UniRef50_A6P1G1 Putative uncharacterized protein n=1 Tax=Bacteroides capillosus ATCC 29799 RepID=A6P1G1_9BACE Length = 320 Score = 196 bits (498), Expect = 1e-48, Method: Composition-based stats. Identities = 71/201 (35%), Positives = 111/201 (55%), Gaps = 8/201 (3%) Query: 7 TWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQI 66 A + S + +RL R + P++ A + + G T G DG V + Sbjct: 5 KSKACNQSYKYERLYRNLYNPQFYLLAYQRIQAKPGNMTAGTDGKTI---DGMGMVRVNA 61 Query: 67 LRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLS 126 L +++ YQP PARR YIPKSNGK+RPLGIP+ D+++Q + + +E I+E F S Sbjct: 62 LIEKMRDFSYQPNPARRTYIPKSNGKMRPLGIPSFDDKLIQEVVRLILESIYEPTFSDHS 121 Query: 127 YGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDAR 186 +GFR +S H A++ V+ T +W +EGD+ FD V H +L+ +R+RI+D Sbjct: 122 HGFRMNKSCHTALKYVQKYF-----TGTKWFVEGDIRGCFDNVDHHVLIAILRKRIADEH 176 Query: 187 FMTLLWKTIKAGHIDVGLFRA 207 F+ LLWK +KAG+++ + Sbjct: 177 FIGLLWKFLKAGYMEDWNYHK 197 >UniRef50_C6I8L1 CRISPR-associated protein n=1 Tax=Bacteroides sp. 3_2_5 RepID=C6I8L1_9BACE Length = 756 Score = 195 bits (496), Expect = 2e-48, Method: Composition-based stats. Identities = 95/341 (27%), Positives = 153/341 (44%), Gaps = 44/341 (12%) Query: 21 LRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLP 80 IT L A R + G+DG + + RL L L+ EL+S + P P Sbjct: 5 YHSITTLHALQNAWRAVRAK--NAAGGIDGFTLSHFEKRLNDNLIELQHELISQTWNPEP 62 Query: 81 ARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIR 140 R+ I K+ + R LG+ ++D+IVQ+A+ A+EP E F LSYG+RP + AI+ Sbjct: 63 YLRIEITKNETEKRKLGLLCIKDKIVQQAIKTAIEPQLEKTFLNLSYGYRPNKGPERAIK 122 Query: 141 TVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHI 200 V + D + + +V + D+ +YFDT++H L + + D + L+ I+ G + Sbjct: 123 RV---VHDLKKLKSGYVAKLDIDNYFDTINHERLFTRLANWLKDDETLRLIRLCIQTGIV 179 Query: 201 DVG-LFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGRS 259 ++ ++GVPQG ++SPLL+N L+ FDQ+ + Sbjct: 180 TPQLQWQEINKGVPQGAILSPLLANFYLHPFDQFAANKVPM------------------- 220 Query: 260 TAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIP 319 Y RYADDF++ T+ Q++ E + LE L+LN T I Sbjct: 221 --------------YIRYADDFLI-ATSTEKQIKEAVELVKEELESQFYLQLN---TPII 262 Query: 320 H-VNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAAS 359 H +DG FLG + E + + + + F S Sbjct: 263 HNFHDGIEFLGITISDTGLSITEKKKKTLQERINSIKFIKS 303 >UniRef50_A4WS58 RNA-directed DNA polymerase (Reverse transcriptase) n=1 Tax=Rhodobacter sphaeroides ATCC 17025 RepID=A4WS58_RHOS5 Length = 366 Score = 194 bits (493), Expect = 5e-48, Method: Composition-based stats. Identities = 84/364 (23%), Positives = 142/364 (39%), Gaps = 50/364 (13%) Query: 15 LRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSG 74 L I E L EA +SKG H + +A L L +++ L+ Sbjct: 3 KTYNHLWPQIIAFETLVEAW--ARTSKGRHR----QRDVIAFEADLEPNLFAIQESLIQK 56 Query: 75 HYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERS 134 Y+ P R ++ + K R + L+DR+VQ A++ +EPI+E+ F S+ R + Sbjct: 57 TYRTGPYHRFFVYEP--KKREIASLPLKDRVVQHALVSVIEPIFEARFIDQSFACRVGKG 114 Query: 135 VHHAIRTVKLQLTD-CGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWK 193 H TV+ + + E + ++ D+S YF +V H L + +RRRI+ + L+ Sbjct: 115 AHKGADTVQRYMREVLREQGQVFALKADISKYFPSVCHDALRRIIRRRIACPDTLWLIDS 174 Query: 194 TIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNS 253 +++ L G+P G + S + +NI L+E D ++ Sbjct: 175 ILESSAEPGAL---TPRGIPIGNLTSQMFANIYLHELDHFVKHTLRER------------ 219 Query: 254 IQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNM 313 Y RY DDF +I KA + +R C L L LR N Sbjct: 220 -------------------RYVRYMDDFAVIHHD-KAHLHEVRRACEDFLWAELGLRTN- 258 Query: 314 DKTKIPHVNDG---FIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAASLTALLWKVRIS 370 KT++ + + FLG+R+ + V + K R A+ Sbjct: 259 AKTQVFPIGEPGRALDFLGYRIWPTHRALRKDSVNRM--KRKMRRMASLYHRGEITWDDI 316 Query: 371 GEIL 374 ++ Sbjct: 317 DPVI 320 >UniRef50_B3CUR8 Reverse transcriptase n=24 Tax=Orientia tsutsugamushi RepID=B3CUR8_ORITI Length = 379 Score = 194 bits (493), Expect = 5e-48, Method: Composition-based stats. Identities = 87/380 (22%), Positives = 151/380 (39%), Gaps = 52/380 (13%) Query: 4 KLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVE 63 ++ ++ + ++ L +I + L E + S G+DG+ K +L Sbjct: 17 RIKLLSSKNQDIKFNNLGHII-DLKMLEEQYKELDS---KKAIGIDGITKEDYGKKLKAN 72 Query: 64 LQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFH 123 L L + YQ PAR V IPK +G RPL I D+I++ A+ + ++E F Sbjct: 73 LLSLLTRIRKWQYQAKPARIVKIPKEDGGKRPLVISCFEDKIIESAVSKILNSVFEPIFL 132 Query: 124 TLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRIS 183 SYGF P+ + H A+R + + + ++E D++ F+T+ H LM+ +R+RIS Sbjct: 133 KYSYGFGPKLNAHDALRELNRLTYNFNKG---AIVEIDITKCFNTIKHCELMEFLRKRIS 189 Query: 184 DARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNE-FDQYLHERYLSGK 242 D +F+ L+ K I+ I+ EG QG ++SP+L+N+ L+ D + + Sbjct: 190 DKKFLRLVMKLIETPIIENDTIVTNKEGCRQGSIVSPILANVFLHYVIDSWFAKISEENL 249 Query: 243 ARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGV 302 RY DD V V ++A + + Sbjct: 250 I--------------------------GQTGMVRYCDDMV-FVFESEADAKRFYDVLPKR 282 Query: 303 LEGSLKLRLNMDKTKIPHVND--------------GFIFLGHRLIRKRSRYGEMRVVSTI 348 L L +N K+++ + FLG +SR+G + Sbjct: 283 L-NKYGLNINEAKSQMIKSGRDHAANLAKQGKKIASYNFLGFTCYWSKSRFGTTWRLKYT 341 Query: 349 PQEKARNFAASLTALLWKVR 368 + F L L +R Sbjct: 342 SRRD--RFTEKLKGLRKYLR 359 >UniRef50_Q1Q3I7 Putative uncharacterized protein n=1 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1Q3I7_9BACT Length = 316 Score = 194 bits (493), Expect = 5e-48, Method: Composition-based stats. Identities = 89/334 (26%), Positives = 150/334 (44%), Gaps = 48/334 (14%) Query: 22 RLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLPA 81 L A + G G DGV + L + L+I+R EL Y PLP Sbjct: 5 EFALNLSALYSAFDAVKENHGC--AGADGVTIERYEGNLDLNLRIMRKELTEQTYFPLPL 62 Query: 82 RRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRT 141 R+ + K NG+ R L IP++RDRIVQ A+L +EP+ E +F S+ +R RSV A+ Sbjct: 63 LRILVDKGNGEARALCIPSVRDRIVQAAVLQLIEPVLEKEFEECSFAYRKGRSVKQAVYK 122 Query: 142 VKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHID 201 V+ + E +WV++ D+ ++FD+V + LL+ + I D L+ +K D Sbjct: 123 VR----EYYEQGYQWVVDADIDAFFDSVDYSLLLLKFKCYIHDPCIQNLVGLWLKGEVWD 178 Query: 202 VGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGRSTA 261 +G+PQG ISP+L+N+ L+EFD+ L Sbjct: 179 GKTVTTLKKGIPQGSPISPILANLYLDEFDEELTRN------------------------ 214 Query: 262 VRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHV 321 R++DDF+++ K + E+++ + + + L+ +D+ ++ + Sbjct: 215 ---------GYKLVRFSDDFIILCKNSGMAKESLKLTKKILEKLLLE----LDEEQVINF 261 Query: 322 NDGFIFLGHRLIR-----KRSRYGEMRVVSTIPQ 350 + GF FLG ++ R + R V P+ Sbjct: 262 DQGFKFLGVIFVKSMIMVPFDRPKKERKVLFFPK 295 >UniRef50_A5B242 Putative uncharacterized protein n=1 Tax=Vitis vinifera RepID=A5B242_VITVI Length = 1300 Score = 193 bits (491), Expect = 7e-48, Method: Composition-based stats. Identities = 53/384 (13%), Positives = 124/384 (32%), Gaps = 51/384 (13%) Query: 27 PEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLPARR--- 83 P E L G DG Q + + D + Q A+ Sbjct: 737 PFTEEEIHXAXLEMNSDKAXGPDGFTVAFWQFCWDFVKEEIVDLFKXFYEQRSFAKSLNT 796 Query: 84 ---VYIPKSNG-----KLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSV 135 + IPK G RP+ + +++ + + ++ + + F R + Sbjct: 797 TFLILIPKKGGTEDLGDFRPISLLGGLYKLLAKVLANRLKKVLGNVVSVDQNAFVRGRXI 856 Query: 136 HHAIRTVKLQLTDCGETRGRW-VIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKT 194 A + + +W + + D+ +D+++ LMK + + + +M +W Sbjct: 857 LDATLIANEVNDFWHKRKEKWLICKLDIEKAYDSINWNFLMKVLHKMGFGSWWMEWIWWC 916 Query: 195 IKA------GHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRW 248 I + F + ++G+ QG +SP L + + + + G Sbjct: 917 ISTAKFFVLVNGVPAGFFSITKGLRQGDPLSPYLFVLGMEVLSALIRRVVVGGFISGC-- 974 Query: 249 YWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLK 308 ++R + + V++ +ADD ++ + K + ++ E + Sbjct: 975 ------------SLRGRGRMEMDVSHLLFADDTIIFCEARKEYLTSL-SWILAWFEAASG 1021 Query: 309 LRLNMDKTKIPHVND------------------GFIFLGHRLIRKRSRYGEMRVVSTIPQ 350 LR+N+ K+++ V + ++LG L V + Sbjct: 1022 LRINLAKSELILVGEVEEIEEMAMELGCKVGSLPSVYLGLPLGAHHKAISMWDGVEERMR 1081 Query: 351 EKARNFAASLTALLWKVRISGEIL 374 + + + ++ + L Sbjct: 1082 RRLALWKRQYISKGRRITLIKSTL 1105 >UniRef50_UPI00019859F4 PREDICTED: hypothetical protein n=1 Tax=Vitis vinifera RepID=UPI00019859F4 Length = 1482 Score = 193 bits (490), Expect = 1e-47, Method: Composition-based stats. Identities = 57/384 (14%), Positives = 124/384 (32%), Gaps = 51/384 (13%) Query: 27 PEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLPARR--- 83 P E L G PG DG Q + + + + Q + Sbjct: 804 PFSEDEIHSALLEMDGDKAPGPDGFTVAFWQECWDFVKEEILELFKEFYEQRSFVKSLNT 863 Query: 84 ---VYIPKSNG-----KLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSV 135 V IPK G RP+ + +++ + + ++ + F R + Sbjct: 864 TFLVLIPKKGGAEDLGDFRPISLLGGLYKLLAKVLANRLKKVLGRVVSLDQNAFVKGRQI 923 Query: 136 HHAIRTVKLQLTDCGETRGR-WVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKT 194 A + + + + + + D+ +D++ LMK +R+ +R+M +W Sbjct: 924 LDASLIANEVIDAWQKRKEKGLICKLDIEKAYDSISWDFLMKILRKLGFGSRWMEWIWWC 983 Query: 195 I---KAGHIDVGL---FRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRW 248 I K + G+ F ++++G+ Q +SP L + + L + G R+ Sbjct: 984 ISTAKFSVLVNGVPTGFFSSTKGLRQEDPLSPYLFVLGMEVLSALLRRAAVGGFFSGCRF 1043 Query: 249 YWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLK 308 + + +++ +ADD ++ + K E + Sbjct: 1044 --------------WGRGRMELNISHLLFADDTIIFCEARKEN-MTFLSWILAWFEAASG 1088 Query: 309 LRLNMDKTKIPHVNDGFI------------------FLGHRLIRKRSRYGEMRVVSTIPQ 350 LR+N+ K+++ V + +LG L V + Sbjct: 1089 LRINLAKSELIPVGEVEEIEEMAVELGCRVGSLPNVYLGLPLGVPHKASSMWDGVEEKMR 1148 Query: 351 EKARNFAASLTALLWKVRISGEIL 374 + + + +V + +L Sbjct: 1149 RRLALWKRQYISKGGRVTLIKSML 1172 >UniRef50_Q8RSV8 Maturase n=1 Tax=uncultured marine bacterium RepID=Q8RSV8_9BACT Length = 386 Score = 192 bits (489), Expect = 1e-47, Method: Composition-based stats. Identities = 93/339 (27%), Positives = 143/339 (42%), Gaps = 57/339 (16%) Query: 47 GVDGVNKTMLQARLAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIV 106 GVD V+ + + L L + L SG Y P P + V IPK +GK R LGIP + DR+ Sbjct: 2 GVDHVSMEAIASNPRKYLYPLWNRLSSGSYFPPPVKLVPIPKGDGKERMLGIPTIIDRVA 61 Query: 107 QRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYF 166 Q + +E I E FH S+G+RP +S H A+ + +V++ D+ +F Sbjct: 62 QEVIKAELEVIVEPRFHPSSFGYRPHKSAHEALEQCAKNSWERW-----YVVDLDIKGFF 116 Query: 167 DTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHID-VGLFRAASEGVPQGGVISPLLSNI 225 D + H +M +R+ + + + +K D VG +A +G PQGGVISPLL+N+ Sbjct: 117 DNIDHEKMMGILRKHTNKKHILLYCDRWLKTPMQDRVGGVQARMKGTPQGGVISPLLANL 176 Query: 226 MLNE-FDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLI 284 L+E FDQ++ +P + + RYADD V+ Sbjct: 177 YLHEAFDQWI-------------------------------STTQPRIVFERYADDIVIH 205 Query: 285 VKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVND-------------GFIFLGHR 331 + Q I ++ + L+ S L L+ DKTKI + F FLG Sbjct: 206 TRSM-EQSHFILDKLKARLK-SYSLELHPDKTKIVYCYRTARFHKEGKEIPVSFDFLGFT 263 Query: 332 LIR----KRSRYGEMRVVSTIPQEKARNFAASLTALLWK 366 K + I ++ + L L Sbjct: 264 FKPRLCLKSNGEKFWGFRPAISKKSEKRILGELRKLKIH 302 >UniRef50_A5BQN8 Putative uncharacterized protein n=3 Tax=Vitis vinifera RepID=A5BQN8_VITVI Length = 1680 Score = 192 bits (488), Expect = 2e-47, Method: Composition-based stats. Identities = 57/315 (18%), Positives = 116/315 (36%), Gaps = 33/315 (10%) Query: 27 PEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLA---VELQILRDELLSGHYQPLPARR 83 P + E + +G PG DG + Q E+ L E + Sbjct: 1048 PFSVEEIHFALMEMRGDKAPGSDGFSVAFWQGCWDFVKEEVVDLFKEFFAHGSFAKSLNT 1107 Query: 84 ---VYIPKSNG-----KLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSV 135 V IPK G RP+ + +++ + + ++ + + F R + Sbjct: 1108 TFLVLIPKKGGAEDLRDFRPISLLGGLYKLLAKVLANRLKKVLDRVVSVDQNAFVRGRQI 1167 Query: 136 HHAIRTVKLQLTDCGETRGRWV-IEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKT 194 A + + + + + + D+ +D+++ + LMK +R+ +R+M +W Sbjct: 1168 LDASLVANEMIDYWYKRKEKGLKCKLDIEKAYDSINWKFLMKVLRKMGFGSRWMDWMWWC 1227 Query: 195 I---KAGHIDVGL---FRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRW 248 I K + G+ F + S+G+ QG +SP L + + + G R Sbjct: 1228 ISXVKFSILINGVPAGFFSNSKGLRQGDPLSPYLFVLGMEVLSTLIRRXGEXGFISGCR- 1286 Query: 249 YWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLK 308 +R + AV++ +ADD ++ K Q+ + E + Sbjct: 1287 -------------LRGRGGEELAVSHLLFADDTLIFCKARXEQLTNL-SWILAWFEAASG 1332 Query: 309 LRLNMDKTKIPHVND 323 LR+N+ K+ + V + Sbjct: 1333 LRINLAKSVLIPVGE 1347 >UniRef50_A5BBH2 Putative uncharacterized protein (Fragment) n=1 Tax=Vitis vinifera RepID=A5BBH2_VITVI Length = 1331 Score = 192 bits (487), Expect = 2e-47, Method: Composition-based stats. Identities = 56/320 (17%), Positives = 118/320 (36%), Gaps = 33/320 (10%) Query: 22 RLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLPA 81 ++ P E + G PG+DG + Q + + + H Q Sbjct: 948 EILEAPFTEGEVQSALMEMNGDKAPGLDGFSVFFWQCYWDFVKEEIMEMFKELHVQNTFL 1007 Query: 82 RR------VYIPKSNG-----KLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFR 130 + V IPK G RP+ + +++ + + ++ + F Sbjct: 1008 KSINNTFLVLIPKKGGAEDLGDFRPISLLGGLYKLMAKVLANRLKKVIGKVVSHDQNTFV 1067 Query: 131 PERSVHHAIRTVKLQLTDCGETRGRWVI-EGDLSSYFDTVHHRLLMKAVRRRISDARFMT 189 R + A + + + VI + D+ +D+++ + L+K + + A+++ Sbjct: 1068 TGRQILDASLIANEAIDFWNKKGDKGVICKLDIEKAYDSINWQFLLKVMEKMGFGAKWLR 1127 Query: 190 LLWKTIKA------GHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKA 243 +W I + F ++S+G+ QG +SP L + + + G Sbjct: 1128 WMWWCISTAKFSIMVNRTPTGFFSSSKGLRQGDPLSPYLFVMGMEVLSILIRRAMEGGFI 1187 Query: 244 RKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVL 303 + IQR R AV +A+ +ADD ++ + K + + Sbjct: 1188 SGCK------IQRDRGRAVH--------IAHLLFADDTIVFYEAKKEYLTNL-NWILFWF 1232 Query: 304 EGSLKLRLNMDKTKIPHVND 323 E + +LR+N+ K++I V + Sbjct: 1233 EAASRLRINLAKSEIIPVGE 1252 >UniRef50_A5BN31 Putative uncharacterized protein n=2 Tax=Vitis vinifera RepID=A5BN31_VITVI Length = 1302 Score = 192 bits (487), Expect = 2e-47, Method: Composition-based stats. Identities = 61/378 (16%), Positives = 125/378 (33%), Gaps = 53/378 (14%) Query: 34 ARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLS----GHY--QPLPARRVYIP 87 + +G PG DG Q + + D G + V IP Sbjct: 624 WVLLEEMRGDKAPGPDGFTMAFWQECWEFVKEEVVDLFKEFFEHGSFSKCLNTTFLVLIP 683 Query: 88 KSNG-----KLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTV 142 K G RP+ + +++ + + ++ + + F R + A Sbjct: 684 KKGGAEDLGDFRPISLLGGLYKLLAKVLANRLKKVLDRVVSVDQNAFVRGRQILDASLVA 743 Query: 143 KLQLTDCGETRGR-WVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTI---KAG 198 + + + + + + D+ +D+++ + LMK +R+ +R+M +W I K Sbjct: 744 NEVIDYXHKRKEKGLICKLDIEKAYDSINWKXLMKVLRKMGFGSRWMDWMWWCISTAKFS 803 Query: 199 HIDVGL---FRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQ 255 + G+ F + S+G+ QG +SP L + + L G R Sbjct: 804 ILINGVPAGFFSNSKGLRQGDPLSPYLFVLGMEVLSTLLRRAGEGGFLSGCR-------- 855 Query: 256 RGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDK 315 +R + V++ +ADD ++ K + QV + LE + LR+N+ K Sbjct: 856 ------LRGRGGVELIVSHLLFADDTIIFCKAKREQVTNL-SWILVWLEAASGLRINLAK 908 Query: 316 TKIPHVNDGFI-------------------FLGHRLIRKRSRYGEMRVVSTIPQEKARNF 356 + + V +LG L V + + + Sbjct: 909 SALIPVGQ-VDELEELAAELGCRLGVLPTVYLGLPLGAHHKTSSXWDGVEERMRRRLAQW 967 Query: 357 AASLTALLWKVRISGEIL 374 + ++ + L Sbjct: 968 KRQYISKGGRITLIKSTL 985 >UniRef50_P19593 Probable reverse transcriptase n=2 Tax=Scenedesmus obliquus RepID=RDPO_SCEOB Length = 608 Score = 191 bits (485), Expect = 3e-47, Method: Composition-based stats. Identities = 90/374 (24%), Positives = 165/374 (44%), Gaps = 39/374 (10%) Query: 2 QRKLATWAATDPSLRIQRL-LRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARL 60 Q +A + +++L ++ A A + SSKG+ +PG+ + + + Sbjct: 39 QESIACAKREGNIVLVEKLAQEIVNSSFGRAVAVQTVASSKGSRSPGLSRESFKTNKNYV 98 Query: 61 AVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWES 120 A+ + + Y+ P R+YIPK +G RPL IP+ DR +Q +A+EP+ E Sbjct: 99 AMMATLEQITSNPHKYKATPLSRIYIPKRDGSARPLSIPSYTDRCLQALYKLAIEPMAEE 158 Query: 121 DFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRR 180 SYGFRP R+V A+ V L + ++V+E D+ D ++H+ + Sbjct: 159 VADLSSYGFRPMRNVSWAVGRVLNGLNNPLAN-YQYVVEIDIKGCVDNINHQFI-----S 212 Query: 181 RISDARFMTLLWKTIKAGHIDV--GLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERY 238 +++ +LW +K G+I+ + + GVPQGG+ISPL+ N+ L+ + +++++ Sbjct: 213 QVTPFIPKKILWAWLKCGYIERNSNTLQPTTTGVPQGGIISPLIMNLTLDGLEFHIYKK- 271 Query: 239 LSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREE 298 ++++ YCRYADD V++ T+ Sbjct: 272 -----------------------IQKSSSQSKGNTYCRYADDMVILTT-TEETALIALPA 307 Query: 299 CRGVLEGSLKLRLNMDKTKIPHV---NDGFIFLGHRLIRKRSRYGEM-RVVSTIPQEKAR 354 + L L + + KT I ++ +GF FL R + R + IP + Sbjct: 308 VKEFLAVR-GLEVKLAKTTIKNIINDRNGFEFLSFRFRKVYRRNRKRLTSQVGIPISAIK 366 Query: 355 NFAASLTALLWKVR 368 NF ++ A+ + Sbjct: 367 NFRKNIKAISKTRK 380 >UniRef50_UPI00019844C7 PREDICTED: hypothetical protein n=1 Tax=Vitis vinifera RepID=UPI00019844C7 Length = 1476 Score = 191 bits (485), Expect = 4e-47, Method: Composition-based stats. Identities = 58/390 (14%), Positives = 116/390 (29%), Gaps = 54/390 (13%) Query: 21 LRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLS----GHY 76 L ++ E + A G PG DG V + G + Sbjct: 680 LEVMFSEEEIFAALSSFC---GDKAPGPDGFTMAFWLFCWDVVKPEILGLFREFYLHGTF 736 Query: 77 Q--PLPARRVYIPKSNG-----KLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGF 129 Q + IPK G RP+ + +++ + + ++ + + F Sbjct: 737 QRSLNSTFLLLIPKKEGTEDLKDFRPISLVGSVYKLLAKVLANRLKTVMGEVISDSQHAF 796 Query: 130 RPERSVHHAIRTVKLQLTDCGE-TRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFM 188 R + A+ L + +++ D+ FD V+ LM+ + + R++ Sbjct: 797 VHGRQILDAVLIANEALDSRLKDNIPGLLLKMDIEKAFDHVNWNFLMEVMSKMGFGHRWI 856 Query: 189 TLLWKTIKAGHI------DVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGK 242 + F +S G+ QG +SP L + + Q L Sbjct: 857 NWIKWCCSTASFSILINGSPSGFFRSSRGLRQGDPLSPYLFLLAMEALSQLLSRARNGNF 916 Query: 243 ARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGV 302 R S V++ +ADD ++ Q++ + Sbjct: 917 ISCFRVGGRGSEGLF--------------VSHLLFADDTLIFCDADADQLQYL-SWTFMW 961 Query: 303 LEGSLKLRLNMDKTKIPHVNDGF------------------IFLGHRLIRKRSRYGEMRV 344 E L++N++KTK V +G +LG L Sbjct: 962 FEAISGLKVNLNKTKAIPVGEGIPMETLAVVLGCKIGSLPTSYLGLPLGAPYKSIRVWDA 1021 Query: 345 VSTIPQEKARNFAASLTALLWKVRISGEIL 374 V +++ + + ++ + L Sbjct: 1022 VEERFRKRLSLWKRQYLSKGGRLTLLKSTL 1051 >UniRef50_UPI0000DF064A Os02g0261600 n=1 Tax=Oryza sativa Japonica Group RepID=UPI0000DF064A Length = 958 Score = 191 bits (484), Expect = 5e-47, Method: Composition-based stats. Identities = 62/389 (15%), Positives = 122/389 (31%), Gaps = 58/389 (14%) Query: 24 ITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQIL-------RDELLSGHY 76 + +P E R PG DG + + + + + Sbjct: 437 LGEPFTEEEVHRAIKEMPADKAPGPDGFTGAFFKVCWEIIKDDILLVFSSIYNLRCAHLN 496 Query: 77 QPLPARRVYIPKSNG-----KLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRP 131 A V IPK +G RP+ + +I + + + + P F Sbjct: 497 LLNSANIVLIPKKDGAESVSDYRPISLIHSIAKIFAKMLALWLRPHMHELISVNQSAFIK 556 Query: 132 ERSVHHAIRTVKLQLTDCGE-TRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTL 190 RS+H V+ R + + D++ FD+V L+ ++R+ R++ Sbjct: 557 GRSIHDNYLFVRNMTRRYHRLRRAMLLFKLDITKAFDSVRWDYLLALLQRKGFSTRWIDW 616 Query: 191 LWKTIKAG------HIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKAR 244 L + + + G G+ QG +SPLL + ++ + L + G Sbjct: 617 LGALLSSSTSQVLLNGSPGQRIKHGRGLRQGDPLSPLLFILAIDPLQRILSKATELGAIS 676 Query: 245 KDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLE 304 K R + YADD V+ + T+ V Sbjct: 677 KLRGRTT-------------------RLQISMYADDAVIFINPTRNDVATFANILHRFGT 717 Query: 305 GSLKLRLNMDKTKIPHVNDG-------------------FIFLGHRLIRKRSRYGEMRVV 345 + L N+ K+++ + G +LG L+ R R +++ V Sbjct: 718 AT-GLVTNLQKSQVAAIRCGNIDLEEVLQGVPAKRANFPLKYLGLPLVLGRLRKTDLQPV 776 Query: 346 STIPQEKARNFAASLTALLWKVRISGEIL 374 + ++ A+ + + +L Sbjct: 777 FDKISGRVASWRGKNMAVAGRTTLVKSVL 805 >UniRef50_Q2R4V1 Retrotransposon protein, putative, unclassified n=8 Tax=Oryza sativa RepID=Q2R4V1_ORYSJ Length = 1296 Score = 191 bits (484), Expect = 5e-47, Method: Composition-based stats. Identities = 54/390 (13%), Positives = 128/390 (32%), Gaps = 55/390 (14%) Query: 22 RLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVE---LQILRDELLSGH--- 75 +TQ E + PG DG Q V L L E +G Sbjct: 838 EALTQDFTGKEIKEAIFQMEHNKAPGPDGFPTEFYQVFWNVIKNDLLELFKEFHNGSLPL 897 Query: 76 YQPLPARRVYIPK-----SNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFR 130 + + +PK + RP+ + + +I + + + + F Sbjct: 898 FSLNFGTIILLPKCVEAMKIQQYRPICLLNVSFKIFTKVATNRVMSVAQKVISPTQTAFI 957 Query: 131 PERSVHHAIRTVKLQLTDCGETRGRWVI-EGDLSSYFDTVHHRLLMKAVRRRISDARFMT 189 R++ + + L + VI + D +D V + + +R + ++ Sbjct: 958 LGRNIMEGVVILHETLHELHRKNNSGVILKIDFEKAYDKVKWSFVQQTLRMKGFSPKWCE 1017 Query: 190 LLWKTIKAGHI------DVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKA 243 + I+ GH+ +G +G+ QG +SP+L NI+ + + +G Sbjct: 1018 WIVSFIQGGHVGIKVNDQIGDNFQTHKGLRQGDPLSPILFNIVADMLALLIKRAKDNGLL 1077 Query: 244 RKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVL 303 + ++ +YAD+ ++ ++ Q + ++ V Sbjct: 1078 NGVIPHL-----------------VDDGLSILQYADNTIIFLEHDLQQAKNLK-LILSVF 1119 Query: 304 EGSLKLRLNMDKTKIPHVND-------------------GFIFLGHRLIRKRSRYGEMRV 344 E L++N K+++ + +LG + ++ + +V Sbjct: 1120 EKLSGLKINFHKSELFCFSQAKDCYDQYSSIFGCKLGSFPVKYLGIPMHFRKLSNNDWKV 1179 Query: 345 VSTIPQEKARNFAASLTALLWKVRISGEIL 374 + + K ++ ++ ++ + +L Sbjct: 1180 IEQRIERKLSSWKGKHMSVGGRLVLINSVL 1209 >UniRef50_A5BKT2 Putative uncharacterized protein n=3 Tax=Vitis vinifera RepID=A5BKT2_VITVI Length = 1429 Score = 190 bits (483), Expect = 7e-47, Method: Composition-based stats. Identities = 43/319 (13%), Positives = 112/319 (35%), Gaps = 33/319 (10%) Query: 23 LITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLPAR 82 ++ +P E + G PG +G Q+ + + + + + Sbjct: 786 ILERPFTEDEIHGALMEMNGDKAPGPNGFTLAFWQSCWEFIKEEIIEMFKEFYDHSSFLK 845 Query: 83 R------VYIPKS-----NGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRP 131 V IPK G RP+ + ++ + + ++ + F Sbjct: 846 SLNNTFLVLIPKKCGAEDLGDFRPISLLGGLYKLPAKVLANRLKIVVGKVVSNSQNAFVR 905 Query: 132 ERSVHHAIRTVKLQLTDCGETRGR-WVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTL 190 R + A + + + + + + D+ +D+++ + L+K +++ ++++ Sbjct: 906 GRQILDASLIANEVIDSWQKRKEKGLICKLDIEKAYDSINWKFLLKVMQKMGFGSKWVGW 965 Query: 191 LWKTIKA---GHIDVGL---FRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKAR 244 +W + + + G+ F S+G+ QG +SP L + + D + Sbjct: 966 VWSCLSSAKFSVMVNGVPAGFFPGSKGLRQGDPLSPYLFVMGMEVLDVLIRRAVEGSFLS 1025 Query: 245 KDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLE 304 +R + +++ + DD ++ + K + + E Sbjct: 1026 GCN--------------IRGGSEPPLNISHLFFVDDTIIFYEAKKDHLTHL-SWILFWFE 1070 Query: 305 GSLKLRLNMDKTKIPHVND 323 + LR+N+ K++I V + Sbjct: 1071 ATSGLRINLAKSEIIPVGE 1089 >UniRef50_Q7XPE7 OSJNBa0060N03.14 protein n=6 Tax=Oryza sativa RepID=Q7XPE7_ORYSJ Length = 1784 Score = 189 bits (479), Expect = 2e-46, Method: Composition-based stats. Identities = 63/394 (15%), Positives = 129/394 (32%), Gaps = 59/394 (14%) Query: 19 RLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDEL---LSGH 75 L L P E ++ PG DG Q V + L SG+ Sbjct: 1387 NLQELEV-PFSEEEVWKVIKEMPNEKAPGPDGFTGLFYQRCWQVIKGEVLAALTKFHSGN 1445 Query: 76 YQP----LPARRVYIPKSN-----GKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLS 126 +Q A +PK + RP+ + ++ + + + P Sbjct: 1446 HQNLDNLNTAVITLLPKKDAPTLIKDYRPISLIHSFSKLATKILASRLAPRMGDLVAENQ 1505 Query: 127 YGFRPERSVHHAIRTVK-LQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDA 185 F RS+H V+ L L + +++ D++ FDTV L+ +R R + Sbjct: 1506 TAFIRGRSIHENFIFVRGLALQFHRRKKPMILLKLDITKAFDTVSWCFLLNLLRNRGFGS 1565 Query: 186 RFMTLLWKTIKAG------HIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYL 239 R+ + + + + + G+ QG +SPLL ++++ L + Sbjct: 1566 RWRSWIAALLLTSETRILLNGHESDSFKPARGLRQGNPLSPLLFVLVMDALQGLLAKATS 1625 Query: 240 SGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREEC 299 G K + YADD ++ ++ + + A+ Sbjct: 1626 WGLLAKLDTRRSIPNTS-------------------IYADDTIVFLQPIEREATAVNAIL 1666 Query: 300 RGVLEGSLKLRLNMDKTKIPHVND-------------------GFIFLGHRLIRKRSRYG 340 + + + L+ N+ K+ + + +LG L +R Sbjct: 1667 QLFGKAT-GLKTNLSKSALTPIRCDDDVLVGVQQLLGCRVENFPITYLGLPLSLRRPTKA 1725 Query: 341 EMRVVSTIPQEKARNFAASLTALLWKVRISGEIL 374 E++ + +K + L + ++R+ +L Sbjct: 1726 EVQPILDQLSKKVAGWKPKLLSPDGRLRLIKSVL 1759 >UniRef50_D2LKK7 RNA-directed DNA polymerase (Reverse transcriptase) n=1 Tax=Rhodomicrobium vannielii ATCC 17100 RepID=D2LKK7_RHOVA Length = 365 Score = 188 bits (478), Expect = 2e-46, Method: Composition-based stats. Identities = 88/366 (24%), Positives = 133/366 (36%), Gaps = 59/366 (16%) Query: 15 LRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSG 74 R + L I + L AAR ++ K PG A L E+ L EL G Sbjct: 3 KRHEGLFERIASFKALRAAARTAINGKRKK-PG-----AAAFMANLEREILRLERELRDG 56 Query: 75 HYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERS 134 Y+P R V I + K R + RDR+V A+ + P++E+ F ++ R + Sbjct: 57 SYRPG--RYVEILVKDPKERLISAAPFRDRVVHHALCAVVCPLFEAGFTDHTFANRTGKG 114 Query: 135 VHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKT 194 H AIR L + +V+ D+ YF + H +L RR+I+ R + L+ Sbjct: 115 THKAIR-----LYERYRDNHSYVLRADIFRYFPAIDHEILKAEFRRKIACERTLWLMDLI 169 Query: 195 IKAGHI-----------DVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKA 243 + + D+ G+P G + S +N+ LN FD ++ E+ Sbjct: 170 VDCSNSQEPVELHFPGDDLFTPYTRRRGLPIGNLTSQFFANLYLNRFDHWVIEKL----- 224 Query: 244 RKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVL 303 Y RY DDF L RE+ L Sbjct: 225 ---------------------------GAPYVRYVDDFALFHDDPGILAT-WREKIERCL 256 Query: 304 EGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSRYGE-MRVVSTIPQEKARNFAASLTA 362 EG +L+L+ KT I V + FLG L R + R + F L Sbjct: 257 EGR-RLKLHPRKTLILPVAEPSPFLGFELHPGPRRTAKGGRGRRKLLDGNVARFRNRLRG 315 Query: 363 LLWKVR 368 L + R Sbjct: 316 LRDRWR 321 >UniRef50_Q67M30 Group II intron-encoding maturase n=1 Tax=Symbiobacterium thermophilum RepID=Q67M30_SYMTH Length = 248 Score = 188 bits (478), Expect = 3e-46, Method: Composition-based stats. Identities = 70/194 (36%), Positives = 103/194 (53%), Gaps = 13/194 (6%) Query: 21 LRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLP 80 + + E + A + + G PGVDGV L+ ++ VE + +R+ELL G Y+P P Sbjct: 1 MEQVVARENMLAALKRVERNGG--APGVDGVPTERLRDQIRVEWERIREELLRGTYRPQP 58 Query: 81 ARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIR 140 RRV IPK G R LGIP + DR++Q+A+L + PI++ F SYGFRP R H A+R Sbjct: 59 VRRVEIPKPGGGKRMLGIPTVMDRLIQQALLQVLTPIFDPTFSESSYGFRPGRRGHDAVR 118 Query: 141 TVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHI 200 + + + WV++ DL +FD V+H +LM V RR++D R + L+ Sbjct: 119 KARQYVEE----GYDWVVDMDLEKFFDRVNHDVLMARVARRVTDKRVLRLIR-------G 167 Query: 201 DVGLFRAASEGVPQ 214 V G PQ Sbjct: 168 VVPEIVEIQSGGPQ 181 >UniRef50_B6IMH7 Phage-encoded reverse transcriptase, putative n=2 Tax=Proteobacteria RepID=B6IMH7_RHOCS Length = 356 Score = 188 bits (477), Expect = 3e-46, Method: Composition-based stats. Identities = 88/350 (25%), Positives = 139/350 (39%), Gaps = 54/350 (15%) Query: 18 QRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQ 77 L +T E L A K L +L L+ ++++G ++ Sbjct: 6 GGLWDSVTAFENLYGAYLAARKGKRYS------DEVLEFGFGLEEKLFDLQGQMVNGVWR 59 Query: 78 PLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHH 137 P R + + K R + P DR+V A++ +EP+ E F SY R R VH Sbjct: 60 PGRPREFMV--RDPKPRLISAPPFADRVVHHAVVRVIEPVLERRFIFDSYACRKGRGVHT 117 Query: 138 AIRTVKLQLTDCG-ETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIK 196 A+ ++ L + E WV++ D+S YF +++H LM + R ISD + + L +K Sbjct: 118 AVDRLQRHLREASCEGGKVWVLKADISKYFASINHGRLMAILGRSISDKKVLWLCRTNLK 177 Query: 197 AGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQR 256 D G+ G+P G + S L +NI L++ D ++ + Sbjct: 178 GYGFDEGV------GIPVGALTSQLFANIYLDQLDHWIKDELGIK--------------- 216 Query: 257 GRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKT 316 Y RY DDFV+ V +KA + A+ + L L LRLN KT Sbjct: 217 ----------------RYVRYMDDFVI-VGHSKADLWALYDAIADFLATKLALRLNR-KT 258 Query: 317 KIPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAASLTALLWK 366 + + G F G+R + V +KAR L AL + Sbjct: 259 TVLPASGGIDFCGYRTWTTHLLPRKRNV------KKARATFRELAALYRR 302 >UniRef50_Q7XUD8 OSJNBa0088A01.7 protein n=2 Tax=Poaceae RepID=Q7XUD8_ORYSJ Length = 1324 Score = 187 bits (475), Expect = 6e-46, Method: Composition-based stats. Identities = 68/390 (17%), Positives = 131/390 (33%), Gaps = 58/390 (14%) Query: 23 LITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQ----- 77 + +P AE + G PG DG ++ + + + H Sbjct: 886 RLEEPFSEAEIRKAINEMPGDKAPGPDGFTGKFFKSCWDIIQVDVIAAFNALHNMRSVHL 945 Query: 78 --PLPARRVYIPKSNG-----KLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFR 130 A V IPK +G RP+ + ++ + + + + P+ +S F Sbjct: 946 NLLNSANVVLIPKKDGAEGINDFRPISLIHGFAKLSSKVLAIRLRPLMQSLISPNQSAFI 1005 Query: 131 PERSVHHAIRTVKLQLTDCGE-TRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMT 189 RS+H V+ + + R + + D+S FD+V L+ ++ R ++ Sbjct: 1006 RGRSIHDNFMYVRNMVRRYHKTRRPILLFKLDISKAFDSVRWDYLLALLQNRGLPQKWRD 1065 Query: 190 LLWKTIKAG------HIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKA 243 + + + G +G+ QG ISPLL + ++ + L + G Sbjct: 1066 WISGLLSTSTSKITLNGIPGETIRHWKGLRQGDPISPLLFILAIDPLQRLLDKATDIGIL 1125 Query: 244 RKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVL 303 K RGR+ R + YADD + + TK V A + Sbjct: 1126 SKL---------RGRAVRFRTS----------MYADDAAVFINPTKEDVSAFADLLNRFG 1166 Query: 304 EGSLKLRLNMDKTKIPHVN-DGFI------------------FLGHRLIRKRSRYGEMRV 344 + S L N+ K+++ V D +LG L R +++ Sbjct: 1167 KVS-GLCTNLQKSQVAPVRCDNLDLDDILHDTPATRASFPMKYLGLPLSTGRLCKVDLQP 1225 Query: 345 VSTIPQEKARNFAASLTALLWKVRISGEIL 374 + + ++ L + + +L Sbjct: 1226 LYDKSMSRVASWRGRHIGLAGRSTLVKSVL 1255 >UniRef50_A1RKU4 RNA-directed DNA polymerase (Reverse transcriptase) n=4 Tax=root RepID=A1RKU4_SHESW Length = 424 Score = 186 bits (472), Expect = 1e-45, Method: Composition-based stats. Identities = 81/360 (22%), Positives = 138/360 (38%), Gaps = 53/360 (14%) Query: 20 LLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPL 79 L I Q E L AA K + + L + +++EL+ G Y Sbjct: 83 LFEQIYQFENLLNAAYQCRKGKTKSN------STLVFFNNLEENIIQIQNELIWGMYLSS 136 Query: 80 PARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAI 139 P Y+ + K R + P+ RDR+V RA+ +EPI + + SY R + H Sbjct: 137 PYHHFYVFEP--KRRLISAPSFRDRVVHRAIYNVIEPILDRQYIYDSYACRRGKGTHRGA 194 Query: 140 RTVKLQLTDCGETRGR-WVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAG 198 +L + +T + + ++ D+S YF ++ H +L V +I R LL+ I + Sbjct: 195 DRAQLFIRRVEKTHSKAYALKADISRYFSSIDHHILKSLVSAKIQCERTKCLLFYIIDSS 254 Query: 199 HIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGR 258 D A G+P G + S + +N+ LNE D++ + Sbjct: 255 PSD-----AHGVGIPLGNLTSQVFANLYLNELDRFAKHTLKAK----------------- 292 Query: 259 STAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKI 318 Y RY DDFV+I K Q+ R + L+L+ N KT++ Sbjct: 293 --------------NYVRYMDDFVIIHHD-KQQLHQWRVMIERFINCQLRLKTN-SKTQV 336 Query: 319 PHV----NDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAASLTALLWKVRISGEIL 374 V FLG+R+ + + V + K + F +A ++ + + Sbjct: 337 FPVAASAGRSLDFLGYRIYANKKLLRKSSVKR--IKAKLKIFRKKYSAGEIDIKDINQTI 394 >UniRef50_Q7XE51 Retrotransposon protein, putative, unclassified n=6 Tax=Poaceae RepID=Q7XE51_ORYSJ Length = 1652 Score = 186 bits (471), Expect = 2e-45, Method: Composition-based stats. Identities = 58/390 (14%), Positives = 134/390 (34%), Gaps = 55/390 (14%) Query: 22 RLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLPA 81 L+++ E + PG DG Q V L H LP Sbjct: 1141 ELLSESFKELEVKEAIFQMEHNKAPGPDGFPAEFYQVFWDVIKDDLMAVFCDFHEGTLPL 1200 Query: 82 RR------VYIPKSNG-----KLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFR 130 R +PK + RP+ + + +I + M + + + F Sbjct: 1201 HRLKFGIITLLPKQKDASRIQQYRPICLLNVSFKIFTKVMANRIALVAQKVIKPSQTTFL 1260 Query: 131 PERSVHHAIRTVKLQLTDCGETRGRWVI-EGDLSSYFDTVHHRLLMKAVRRRISDARFMT 189 R++ + + L + + + VI + D +D V + L +++R + +++ Sbjct: 1261 SGRNIMEGVVILHETLHELHKKKKNGVILKLDFEKAYDKVDWKFLQQSLRMKGFSSKWCD 1320 Query: 190 LLWKTIKAGHI------DVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKA 243 + ++ G + ++G + +G+ QG +SP+L N++++ + G+ Sbjct: 1321 WIDSIVRGGSVAVKVNDEIGSYFQTRKGLRQGDPLSPILFNLVVDMLAILIQRAKDQGRF 1380 Query: 244 RKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVL 303 + + ++ +YADD +L + + ++ Sbjct: 1381 KGVVPHL-----------------VDNGLSILQYADDTILFMDHDLDEARDLKLVLSTF- 1422 Query: 304 EGSLKLRLNMDKTKIPHVND-------------------GFIFLGHRLIRKRSRYGEMRV 344 E L++N K+++ F +LG R+ KR + + Sbjct: 1423 EKLSSLKINFYKSELFCYGKAKDVEHEYVKLFGCDTEDYPFKYLGIRMHHKRINNKDWQG 1482 Query: 345 VSTIPQEKARNFAASLTALLWKVRISGEIL 374 V Q+K ++ ++ ++ + +L Sbjct: 1483 VEERIQKKLSSWKGKFLSVGGRLVLINSVL 1512 >UniRef50_A5B2E0 Putative uncharacterized protein n=7 Tax=Vitis vinifera RepID=A5B2E0_VITVI Length = 1875 Score = 185 bits (470), Expect = 2e-45, Method: Composition-based stats. Identities = 49/302 (16%), Positives = 104/302 (34%), Gaps = 33/302 (10%) Query: 27 PEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLPARR--- 83 P + E + G PG DG Q + + D Q A+ Sbjct: 1170 PFTMEEIHSALMDMNGDKAPGPDGFTGAFWQTCWEFVKEEIMDLFKEFFVQKSFAKSLNT 1229 Query: 84 ---VYIPKSNG-----KLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSV 135 V IPK G + RP+ + ++V + + ++ + F R + Sbjct: 1230 TFLVLIPKKGGAEDLGEFRPISLLGGLYKLVAKVLANRLKKVLGKVVSMDQNAFVRGRQI 1289 Query: 136 HHAIRTVKLQLTDCGETRGR-WVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKT 194 A + + + + + + D+ +D+++ LMK +++ +R+M +W Sbjct: 1290 LDASLIANEVVDFWYKRKEKGLICKLDIEKAYDSINWNFLMKVLQKMGFGSRWMEWIWWC 1349 Query: 195 I---KAGHIDVGL---FRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRW 248 I K + G+ F +S+G+ QG +SP L + + + G Sbjct: 1350 ISTAKFSILVNGVPAGFFPSSKGLRQGDPLSPYLFVMGMEVLSALIRRAVGGGFVSGC-- 1407 Query: 249 YWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLK 308 +++ V++ +ADD ++ + K + ++ E + Sbjct: 1408 ------------SLKGRGGLVMEVSHLLFADDTIIFCEAKKEYLTSL-SWILAWFEAASG 1454 Query: 309 LR 310 L Sbjct: 1455 LS 1456 >UniRef50_Q7YAJ6 Putative reverse transcriptase and intron maturase n=1 Tax=Chara vulgaris RepID=Q7YAJ6_CHAVU Length = 550 Score = 185 bits (470), Expect = 2e-45, Method: Composition-based stats. Identities = 86/359 (23%), Positives = 151/359 (42%), Gaps = 60/359 (16%) Query: 15 LRIQRLLRLITQPEWLAEAARITLSSKGAHTPG--VDGVNKTMLQARLAVELQILRDELL 72 + L+ +++ +L + + G TPG +DG+ L +L Sbjct: 137 GKYCNLIEVVSDVSFLIYCYELIRGNPGNMTPGATLDGLTINWFSK--------LSQQLQ 188 Query: 73 SGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESD-------FHTL 125 +G ++P A + + R++IVQ+A+ + ++ I++ Sbjct: 189 AGKFEPRFA----------------LISPREKIVQKALTVVLDSIYDPAEFRIYDPALDC 232 Query: 126 SYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDA 185 S+ + R AI V WVIEGD++ ++ H++++ + I+ Sbjct: 233 SHAPKEARGAKRAIHKVDRTFKSA-----TWVIEGDITKCSASLPHKVILGILEEEIACR 287 Query: 186 RFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVI--SPLLSNIMLNEFDQYLHERYLSGKA 243 +FM+L+ K++ G++D R PQ +PLL NI L++ D+Y++E Sbjct: 288 KFMSLVRKSLSVGYVDEKGKRHHPNRPPQALTFLRAPLLCNITLHQLDKYIYETLKE--- 344 Query: 244 RKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVL 303 E ++Y RYADDFV+ + G K IR+ L Sbjct: 345 ---------------QYDKHEVDPNFRRLSYVRYADDFVIGITGPKTDAIEIRDLISTFL 389 Query: 304 EGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAASLTA 362 +L L LN +KTKI H++ GF FLG ++ R R RY E+R + R ++L Sbjct: 390 -STLGLELNKEKTKISHIDSGFFFLGTQISRGRRRY-EVRSPRLVLHAPIRKLLSTLRE 446 >UniRef50_A6TR85 RNA-directed DNA polymerase (Reverse transcriptase) n=2 Tax=Clostridiales RepID=A6TR85_ALKMQ Length = 360 Score = 184 bits (468), Expect = 3e-45, Method: Composition-based stats. Identities = 79/352 (22%), Positives = 132/352 (37%), Gaps = 54/352 (15%) Query: 15 LRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSG 74 R L I E L A K A L L +++EL+ Sbjct: 2 KRYGYLYEQIYDFENLYFAYLEARKDKRFR------DEILKFSANLEENLIQIQNELIWK 55 Query: 75 HYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERS 134 Y+ R Y+ + K R + +DR+VQ A+ + P++E + SY R R Sbjct: 56 AYKVGRYREFYVHEP--KKRLIMALPFKDRVVQWAIYRVLNPLFEKTYTEHSYACRIGRG 113 Query: 135 VHHAIRTVKLQLTDCGET-RGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWK 193 H A + ++ L + + ++ D+S YF V H + +K +R++I D + L+ + Sbjct: 114 THQAAKKLQYWLRQIDRKPQKYYYLKMDISKYFYRVDHSIALKILRKKIKDKDVLWLMEE 173 Query: 194 TIKAGHIDVGL------------FRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSG 241 I++ + GL R +G+P G + S LL+NI LNE DQ+ + Sbjct: 174 IIQSEDMAFGLPLGMEPGDCPKYMRLHDKGMPIGNLTSQLLANIYLNELDQFCKHKLQIK 233 Query: 242 KARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRG 301 + RY DDF+++ K + ++ E Sbjct: 234 -------------------------------YFIRYMDDFIVLHHD-KKYLHRLKVEIEN 261 Query: 302 VLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKA 353 L L+L LN KT I G F+G R+ + + + K Sbjct: 262 FLNSELELHLNR-KTCIRPTPVGIEFVGFRIWPTHMKLKKKTAKKMKRRLKY 312 >UniRef50_B0VI85 RNA-directed DNA polymerase (Reverse transcriptase) n=1 Tax=Candidatus Cloacamonas acidaminovorans RepID=B0VI85_9BACT Length = 343 Score = 184 bits (466), Expect = 6e-45, Method: Composition-based stats. Identities = 74/342 (21%), Positives = 135/342 (39%), Gaps = 47/342 (13%) Query: 15 LRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSG 74 R+ L +T + L A + K + L L+ EL++G Sbjct: 3 KRVGYLWEKLTSWQNLYLAYKNACKHKKSK------YETAEWMFYCEKNLWELQKELING 56 Query: 75 HYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERS 134 +Y+P P R I K R + + RDR+V +++ +EP +ES F SY R + Sbjct: 57 NYRPQPYRYFTI--KEPKERLISVAVFRDRLVHHSLINVIEPYFESIFIKDSYATRKGKG 114 Query: 135 VHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKT 194 +H A+ V+ + W ++ D+ +F+ + H +L+K + +I D + L Sbjct: 115 LHLAVLAVQKY-----SRQYPWFLKLDIEKFFNNIDHNILLKLISSKIKDPMIINLCSII 169 Query: 195 IKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSI 254 +K ++ + G+P G + S +NI LN+ D Y+ + Sbjct: 170 LKNQNLSMN--HNEEIGLPVGNLTSQFFANIYLNQLDHYIKQNLGYKG------------ 215 Query: 255 QRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMD 314 Y RY DDF ++ K ++++ + L LKL++ Sbjct: 216 -------------------YVRYMDDF-ILFSENKDKLKSDLLLIKYFLSNILKLKIKDK 255 Query: 315 KTKIPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNF 356 ++ VN G FLG+R+ K R + + + + R Sbjct: 256 SIQMNKVNQGIPFLGYRVFPKLIRVSNINLKRCLQNMQKREK 297 >UniRef50_Q35064 Atp9 intron ORF n=1 Tax=Marchantia polymorpha RepID=Q35064_MARPO Length = 710 Score = 183 bits (464), Expect = 9e-45, Method: Composition-based stats. Identities = 93/378 (24%), Positives = 146/378 (38%), Gaps = 80/378 (21%) Query: 21 LRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLP 80 I + L + KG PG+DG K L+ L EL Y P P Sbjct: 260 YEDIYNIDNLRAGYKRL---KGNVAPGIDGRTKAD---MTDKALEKLSKELRRQAYAPKP 313 Query: 81 ARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIR 140 A+R+ I K +G RPL I + D++VQ + +EP +ES F S+GFRP RS H A+R Sbjct: 314 AKRIIITKPDGGSRPLSIASTVDKVVQSTLKELVEPHFESLFRDSSHGFRPGRSCHKALR 373 Query: 141 TVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHI 200 ++ T W+++ D+ FD +HH LL+K + + L+ K + AG+I Sbjct: 374 DLR-----YSWTALTWLVQIDIKKDFDKIHHDLLIKEMESVLRSKALQDLMRKLLNAGYI 428 Query: 201 DV----GLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHER----YLSGKARKDRWYWNN 252 DV + +EGV QG +ISPL +NI L++ D Y+ + Y G R + Sbjct: 429 DVYNLTDRTQYNTEGVTQGSIISPLCANIFLHKLDCYVEDILIPNYNVGNMRPASAEYKK 488 Query: 253 SIQRGRSTA-----VRENWQWKPAVAYCRYAD----------------DFVLIVKGTK-- 289 + E Q + + ++ + + + + K Sbjct: 489 RLNIHSKDKAFFKYYTELEQAIKNIKHLKWINREQQKKSILVKKKYFFENLFFFRNPKVS 548 Query: 290 -------------------------------------AQVEAIREECRGVLEGSLKLRLN 312 IR+ + L+ LKL +N Sbjct: 549 CPLGRRTLLEMAEKEGLKRLKYLRYADNIILGVIGSKQDALDIRKAVQNFLQEELKLDIN 608 Query: 313 MDKTKIPHVN-DGFIFLG 329 K+KI H + +LG Sbjct: 609 EQKSKILHAKSEMAKYLG 626 >UniRef50_Q2QNF1 Retrotransposon protein, putative, unclassified n=6 Tax=Oryza sativa Japonica Group RepID=Q2QNF1_ORYSJ Length = 1432 Score = 183 bits (464), Expect = 9e-45, Method: Composition-based stats. Identities = 69/418 (16%), Positives = 134/418 (32%), Gaps = 64/418 (15%) Query: 1 MQRKLATWAATDPSLRIQRL------LRLITQPEWLAEAARITLSSKGAHTPGVDGVNKT 54 Q ++ + Q L L ++ E S +PG DG Sbjct: 676 FQMIMSEPNSDSRQFNFQHLKLNTADLSMLDDQFSENEVWEAIKSLPNEKSPGPDGYTAL 735 Query: 55 MLQARLAVELQIL---RDELLSGHYQP----LPARRVYIPKSN-----GKLRPLGIPALR 102 Q + + ++ G+ Q A IPK + RP+ + Sbjct: 736 FYQKCWDIIKGDIMKAFEKFCRGNSQNLEMLNTAVITLIPKKDSPTLLKDYRPISLIHSF 795 Query: 103 DRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWV-IEGD 161 ++ + M + P F RS+H VK + + + ++ D Sbjct: 796 AKLAAKVMAQRLAPRMNELVPYTQNAFIRGRSIHENFIFVKGLVQQYHKQHKEMILLKLD 855 Query: 162 LSSYFDTVHHRLLMKAVRRRISDARFMTLLWK-TIKAG-----HIDVGLFRAASEGVPQG 215 +S FDTV L+ ++ R A++ L + A + + + G+ QG Sbjct: 856 ISKAFDTVSWCFLLDMLKWRGFGAKWRLWLVSLFLTAETNILINGNESNPFKPARGLRQG 915 Query: 216 GVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYC 275 +SPLL + ++ + + SG + +P + Sbjct: 916 DPLSPLLFVLAMDALQAVVAQAKASGLLSEPAPR-------------------RPVPSIS 956 Query: 276 RYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPHVNDG----------- 324 YADD VL K ++ + + ++ + S L N +KT I + Sbjct: 957 IYADDAVLFFKPSQQEAKVVKAILQIFGAAS-GLMTNYNKTAITPIQCSQEQLQVVADEL 1015 Query: 325 --------FIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAASLTALLWKVRISGEIL 374 I+LG L ++ E++ + K + L + ++ + +L Sbjct: 1016 QCNIQLFPIIYLGLPLSTRKPTKAEVQPILDKLANKVAGWKPKLLSPDGRLCLIKSVL 1073 >UniRef50_B8GRZ4 RNA-directed DNA polymerase (Reverse transcriptase) n=1 Tax=Thioalkalivibrio sp. HL-EbGR7 RepID=B8GRZ4_THISH Length = 377 Score = 183 bits (464), Expect = 1e-44, Method: Composition-based stats. Identities = 86/359 (23%), Positives = 139/359 (38%), Gaps = 56/359 (15%) Query: 14 SLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLS 73 R +RL+ I + L EA R+ K D +A L EL L+ E+L Sbjct: 32 GKRHKRLIEAIVDWDNLQEAHRLARRGKR------DRHEVATFEANLWEELGALQMEMLW 85 Query: 74 GHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPER 133 G YQP R + + K R + RDR+ Q A+ PIW++ SY RP + Sbjct: 86 GSYQPGRYRSFLVYEP--KRREILAAPYRDRVAQHAICTLCGPIWDAAMIDDSYACRPGK 143 Query: 134 SVHHAIRTVKLQLTDCGE-TRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLW 192 H V+ L WV++ D+S YF ++ H L VR +IS + L+ Sbjct: 144 GTHVGATRVEQWLRGMTAAGGAVWVVKMDVSKYFASIRHDLAKAVVRDKISCPATLQLID 203 Query: 193 KTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNN 252 I + G+P G ++S ++N++ N DQ+ Sbjct: 204 AIIDS---TADPADPDPVGIPVGNLLSQWIANLVGNRIDQWAKRELRLK----------- 249 Query: 253 SIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLN 312 Y RY DD V++V+ TK + IR++ L S+ +R + Sbjct: 250 --------------------RYARYMDDMVVLVR-TKQEALTIRDQFDDKL-ASMGMRFS 287 Query: 313 MDKTKIPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQEKARNFAASLTALLWKVRISG 371 K + + G FLG+R+ + + ++ R +L A+ W+ G Sbjct: 288 --KASVLPASRGVNFLGYRIWAHK---------RLLRRDSVRRIKRNLKAMRWQYARGG 335 >UniRef50_A5AV27 Putative uncharacterized protein n=9 Tax=Vitis vinifera RepID=A5AV27_VITVI Length = 1887 Score = 182 bits (463), Expect = 1e-44, Method: Composition-based stats. Identities = 57/389 (14%), Positives = 119/389 (30%), Gaps = 51/389 (13%) Query: 22 RLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLPA 81 + +P E R G PG DG + Q + + + H Sbjct: 1228 EGLEKPFTEEEVFRALSGCCGEKAPGPDGFSMAFWQFSWDFVKEEVMNFFRQFHETGSFV 1287 Query: 82 RR------VYIPKSNG-----KLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFR 130 R V IPK G RP+ + + + + + M+ + T F Sbjct: 1288 RSLNATFLVLIPKKGGAEDLKDFRPISLVGGLYKWLAKVLANRMKGVLAKVISTSQNAFV 1347 Query: 131 PERSVHHAIRTVKLQLTDCGE-TRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMT 189 R + A+ + + RG + + D+ +D V L+ + + R+ Sbjct: 1348 EGRQIMDAVLVANEAIDSIVKSNRGAILCKLDIEKAYDHVDXDFLLAVMEKMGFGERWCR 1407 Query: 190 LLWKTIKA---GHIDVG---LFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKA 243 + + + G F +S G+ QG +SP L +++ F + + G Sbjct: 1408 WIKWCLSTVRYSVMVNGSPTGFFQSSRGLRQGDPLSPYLFVVVMEAFSVLIKKAVAGGFL 1467 Query: 244 RKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVL 303 +R +++ +ADD ++ + + Q+ Sbjct: 1468 APC--------------LIRGRRGEGVQISHLLFADDTLIFCEAKEDQLLY-MGWLLMWF 1512 Query: 304 EGSLKLRLNMDKTKIPHVND------------------GFIFLGHRLIRKRSRYGEMRVV 345 E LR+N++K+++ V +LG L + Sbjct: 1513 EAISGLRVNLEKSELIPVGRVENVDELADEFGYRVGKLPSTYLGMPLGAPFKSVAAWDGI 1572 Query: 346 STIPQEKARNFAASLTALLWKVRISGEIL 374 ++K + + ++ + L Sbjct: 1573 EERFRKKLAMWKRQYISKGGRITLVRSTL 1601 >UniRef50_C0FSR2 Putative uncharacterized protein n=1 Tax=Roseburia inulinivorans DSM 16841 RepID=C0FSR2_9FIRM Length = 273 Score = 182 bits (463), Expect = 1e-44, Method: Composition-based stats. Identities = 74/314 (23%), Positives = 128/314 (40%), Gaps = 45/314 (14%) Query: 21 LRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLP 80 L + E L EA +H G+DGV + L+A + +++ + +G Y+ Sbjct: 3 LEDVFSDENLEEAFESFADKHDSH--GLDGVKLSELRAYWETNGKKIKESIFNGTYKVGA 60 Query: 81 ARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIR 140 + I GK R + + DR + RA+ M WE F SY ++ + V A+ Sbjct: 61 VEQRQIVNRKGKKRTISLMNSIDRFIFRALYQKMASEWEKQFSQYSYAYQNNKGVLTAVE 120 Query: 141 TVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHI 200 + + W +E D+ ++FD ++H +++ ++ I D R + LL + + Sbjct: 121 QAAKYMEE----GKDWSVELDIQNFFDNINHSIIISKLKAGIEDVRVLDLLIAYLTCTLL 176 Query: 201 DVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGRST 260 D +F +GV QGG +SPLL+N+ +NE D Y+ Sbjct: 177 DDHVFHQMEQGVLQGGPLSPLLANVYMNELDHYME------------------------- 211 Query: 261 AVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKLRLNMDKTKIPH 320 K ++CR+ DD + T + + +E +L LN KT I Sbjct: 212 --------KQGYSFCRFGDDINIYC-STYEEATVAFSDVTARMEKIEQLPLNHGKTGIF- 261 Query: 321 VNDGFI--FLGHRL 332 G +LG+R Sbjct: 262 --KGINRKYLGYRF 273 >UniRef50_A5ALM2 Putative uncharacterized protein n=1 Tax=Vitis vinifera RepID=A5ALM2_VITVI Length = 954 Score = 182 bits (463), Expect = 2e-44, Method: Composition-based stats. Identities = 56/389 (14%), Positives = 117/389 (30%), Gaps = 51/389 (13%) Query: 23 LITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLPAR 82 + + E G PG DG + + Q + H + R Sbjct: 568 RLEEAFIEEEVFSALSDMNGDKAPGPDGFSLSFWQFNWEFVKVEVMGFFKEFHERGRFVR 627 Query: 83 R------VYIPKSNG-----KLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRP 131 V IPK G + + +++ + + ++ + + F Sbjct: 628 SLNSTFLVLIPKKAGAEDLRDFXQISLVGGLYKLLAKVLANRLKKVMGKVVSSAQNAFVE 687 Query: 132 ERSVHHAIRTVKLQLTDCGETRGRWV-IEGDLSSYFDTVHHRLLMKAVRRRISDARFMTL 190 R + A + +++ V + D+ +D ++ L+ +RR R++ Sbjct: 688 GRQILDAALIANEAIDSMLKSKENGVLCKLDIEKAYDHLNWNFLLSVLRRMGFGERWIGW 747 Query: 191 LWKTIKAGHIDV------GLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKAR 244 + I V F S + QG +SP L I + F + +H G Sbjct: 748 ISWCISTATFSVLINDTPEGFFNKSRXLRQGDPLSPYLFVIGMEAFSRLIHRAMRGGFLS 807 Query: 245 KDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLE 304 + V++ +ADD ++ ++ Q+ + E Sbjct: 808 GCKINGRRGDGTL--------------VSHLLFADDTLVFCDSSQDQMTYL-SWLLMWFE 852 Query: 305 GSLKLRLNMDKTKIPHVND------------------GFIFLGHRLIRKRSRYGEMRVVS 346 LR+N+DK++I V +LG L V Sbjct: 853 ALSGLRINLDKSEILPVGRVENLELLALEIGCKVGRLPTSYLGIPLGANHKSVAVWDGVE 912 Query: 347 TIPQEKARNFAASLTALLWKVRISGEILL 375 +++ + + ++ + E L+ Sbjct: 913 ERFRKRLAKWKRQFISKGGRMTLIPEYLI 941 >UniRef50_A5CBN4 Putative uncharacterized protein n=2 Tax=Vitis vinifera RepID=A5CBN4_VITVI Length = 1420 Score = 182 bits (462), Expect = 2e-44, Method: Composition-based stats. Identities = 54/312 (17%), Positives = 109/312 (34%), Gaps = 33/312 (10%) Query: 27 PEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLPARR--- 83 P E G PG DG QA + + + + Q A+ Sbjct: 1120 PFXEEEIFSALXDMNGDKAPGPDGFTVAFWQACWDFVKEEVVELFKXFYDQKSFAKSLNS 1179 Query: 84 ---VYIPKSN-----GKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSV 135 V IPK G+ R + ++ + + ++ + + F R + Sbjct: 1180 TFLVIIPKKGGAEDLGEFRLINXLGGLYKLXAKVLANRLKMVXDXVVSABQNAFVRGRQI 1239 Query: 136 HHAIRTVKLQLTDCGETRGR-WVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKT 194 A + + + + V + D+ +D++ LMK +++ +R+M +W Sbjct: 1240 LDASLIANEVVDYWQKRKEKGLVCKLDIEKAYDSISWNFLMKVLKKMGFGSRWMEWMWWC 1299 Query: 195 I---KAGHIDVGL---FRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRW 248 K + G+ F ++S+G+ QG ISP L + + R Sbjct: 1300 FSTAKFSVLINGVPEGFFSSSKGLRQGDPISPYLFILGKEVLSALIRRAVQGNFISGCR- 1358 Query: 249 YWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLK 308 +R + V++ +ADD +L + +K Q+ + E + Sbjct: 1359 -------------LRGRGDAEIMVSHLLFADDTILFCEASKDQLTHL-GWILAWFEAASG 1404 Query: 309 LRLNMDKTKIPH 320 LR+N+ K+++ Sbjct: 1405 LRINLAKSELIS 1416 >UniRef50_Q01HH7 OSIGBa0142I02-OSIGBa0101B20.12 protein n=9 Tax=Oryza sativa RepID=Q01HH7_ORYSA Length = 1230 Score = 182 bits (461), Expect = 2e-44, Method: Composition-based stats. Identities = 58/318 (18%), Positives = 107/318 (33%), Gaps = 39/318 (12%) Query: 24 ITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQI-------LRDELLSGHY 76 + P AE PG DG ++ + L D + Sbjct: 692 LDDPFTKAEIHEAIKEMPIDKAPGPDGFTGKFFKSCWDIIKNDVVAAFNTLHDSRNTHFN 751 Query: 77 QPLPARRVYIPKSNG-----KLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRP 131 A V IPK G R + + +I + + + + P+ +S T F Sbjct: 752 LLNSANVVLIPKKEGAEGIGDYRLISLVHGFGKIFSKVLAIRLRPLMQSLIPTNQSAFIC 811 Query: 132 ERSVHHAIRTVKLQLTDCGE-TRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTL 190 RS+H V+ + R + + D+S FD+V L+ ++ R ++ Sbjct: 812 GRSIHDNFMYVRNMVRKYHRTRRPILLFKLDISKAFDSVRWDYLLSLLQNRGFPPKWREW 871 Query: 191 LWKTIKAG------HIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKAR 244 + + A + G +G+ QG +SPLL + ++ + L + G Sbjct: 872 ITGLLSASTSKIILNGTPGEAIKHGKGLRQGDPLSPLLFILAIDPLQRLLDKATELGAIS 931 Query: 245 KDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLE 304 K RG++ R + YADD + + K V+A + Sbjct: 932 KL---------RGKAVRFRTS----------LYADDAAIFINPNKEDVKAFTDLLGRFGH 972 Query: 305 GSLKLRLNMDKTKIPHVN 322 + L N+ K+ + + Sbjct: 973 AT-GLCTNLQKSHVAPIR 989 >UniRef50_A5ZQ10 Putative uncharacterized protein n=1 Tax=Ruminococcus obeum ATCC 29174 RepID=A5ZQ10_9FIRM Length = 379 Score = 181 bits (459), Expect = 4e-44, Method: Composition-based stats. Identities = 75/363 (20%), Positives = 143/363 (39%), Gaps = 56/363 (15%) Query: 16 RIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGH 75 +I+ + LI + L EA + +SKG + +Q + ++ ++ ++ SG Sbjct: 2 KIKHVFDLIFSDDNLYEAIQD--ASKGRRY----NKDVLRVQHDIWNVIEQIQQDVRSGK 55 Query: 76 YQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSV 135 Y Y+ + K R + RIVQ A+ + P+ + +YG P R Sbjct: 56 YTIDKYYIFYVYEP--KKRMIMSITFYHRIVQWAIYRVINPLLVKGYIKDTYGCIPGRGS 113 Query: 136 HHAIRTVKLQLTDCGETRGRWV-IEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKT 194 A++ ++ + G W ++ D+S YF + H +L + + R+I D + + +L+ Sbjct: 114 LAAMQRLRYWIKSVEHKPGTWYYLKLDISKYFYRISHEVLKEILARKIKDQQLLQVLYNI 173 Query: 195 IKAGHIDVG------------LFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGK 242 I + G R G+P G ++S + +NI L+ DQ+ Sbjct: 174 IDCQYTPFGLPPGKGPGEVPLEERLYDVGMPVGNLLSQVFANIYLDALDQFCKRTLCIHF 233 Query: 243 ARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGV 302 Y RY DD +++ +K Q+ ++E + Sbjct: 234 -------------------------------YVRYMDDIIIL-SDSKEQLHMWKDEIQKF 261 Query: 303 LEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKR--SRYGEMRVVSTIPQEKARNFAASL 360 +E +L+L LN KT I ++ G F+G+R+ R + + K + A L Sbjct: 262 VETTLRLSLN-QKTCIRPISQGIEFVGYRIWPHYVTIRKSTTLEMKRHLRRKVEEYNAGL 320 Query: 361 TAL 363 + Sbjct: 321 IEM 323 >UniRef50_A5BR45 Putative uncharacterized protein n=1 Tax=Vitis vinifera RepID=A5BR45_VITVI Length = 1670 Score = 180 bits (458), Expect = 5e-44, Method: Composition-based stats. Identities = 55/390 (14%), Positives = 109/390 (27%), Gaps = 54/390 (13%) Query: 21 LRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLS----GHY 76 L ++ E + A G PG DG V + G + Sbjct: 985 LEVMFSEEEIFAALSSFC---GDKAPGPDGFTMAFWLFCWDVIKPEILGLFREFYLHGTF 1041 Query: 77 Q--PLPARRVYIPKSNG-----KLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGF 129 Q + IPK G RP+ + +++ + + ++ + + F Sbjct: 1042 QRSLNSTFLLLIPKKEGTEDLRDFRPISLVGSVYKLLAKVLANXLKSVMGEVISDXQHAF 1101 Query: 130 RPERSVHHAIRTVKLQLTDCGE-TRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFM 188 R + + L + +++ D+ FD V LM + + R++ Sbjct: 1102 VHGRQILDXVLIANEALDSRLKGNNPGLLLKMDIEKAFDHVKWDFLMDVMSKMGFGHRWI 1161 Query: 189 TLLWKTIKAGHI------DVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGK 242 + F +S G+ QG +SP L + Q L G Sbjct: 1162 KWMNWCCSTATFSILINGSPSGFFRSSRGLRQGDPLSPYLFLFAMEALSQLLSCARNGGF 1221 Query: 243 ARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGV 302 S + +ADD ++ Q++ + Sbjct: 1222 ISGFXVGGRGRXGLLVS--------------HLLFADDTLIFCDAEADQLQYL-SWTFMW 1266 Query: 303 LEGSLKLRLNMDKTKIPHVNDGFI------------------FLGHRLIRKRSRYGEMRV 344 E L++N+ KT+ V +G +LG L Sbjct: 1267 FEAISGLKVNLSKTEAIPVGEGIPMETLASVLGCKIGSLPTSYLGLPLGAPYKSTRVWDA 1326 Query: 345 VSTIPQEKARNFAASLTALLWKVRISGEIL 374 V +++ + + ++ + L Sbjct: 1327 VEERFRKRLSLWKRQYLSKGGRLTLLKSTL 1356 >UniRef50_A5ARA6 Putative uncharacterized protein n=1 Tax=Vitis vinifera RepID=A5ARA6_VITVI Length = 1308 Score = 180 bits (458), Expect = 6e-44, Method: Composition-based stats. Identities = 51/314 (16%), Positives = 112/314 (35%), Gaps = 32/314 (10%) Query: 22 RLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLPA 81 ++ P E + G PG DG + Q + + + H Q Sbjct: 864 EMLEAPFSEGEVQSALMEMNGDKAPGPDGFSVFFWQCCWDFVKEEILEMFKEFHDQNTFL 923 Query: 82 RR------VYIPKSNG-----KLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFR 130 + V IPK G RP+ + +++ + + ++ + + F Sbjct: 924 KSINNTFLVLIPKKGGAEDFGDFRPISLLGGLYKLLAKVLANRLKKVIDKVVSHDQNAFV 983 Query: 131 PERSVHHAIRTVKLQLTDC-GETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMT 189 R + A + + + + + D+ +D ++ + L+ +++ ++ Sbjct: 984 KGRQILDASLIANEVIDNWXKKGBKGXICKLDIEKAYDNINWQFLL-----KVNGKKWDL 1038 Query: 190 LLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWY 249 L K + F ++S+G+ QG +SP L + + + G + Sbjct: 1039 ALTKFSVMVNGTPAGFFSSSKGLRQGDPLSPYLFVMGMEVLSVLIRRXMXGGFISGCK-- 1096 Query: 250 WNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEGSLKL 309 IQR R AV +A+ +ADD ++ + K + + E + L Sbjct: 1097 ----IQRDRGRAVH--------IAHLLFADDTIVFXEAKKEYLTNL-SWILFWFEAASGL 1143 Query: 310 RLNMDKTKIPHVND 323 R+N+ K+++ V + Sbjct: 1144 RINLAKSEVIPVGE 1157 >UniRef50_C0A8Z3 RNA-directed DNA polymerase (Reverse transcriptase) n=1 Tax=Opitutaceae bacterium TAV2 RepID=C0A8Z3_9BACT Length = 358 Score = 180 bits (458), Expect = 6e-44, Method: Composition-based stats. Identities = 80/341 (23%), Positives = 134/341 (39%), Gaps = 57/341 (16%) Query: 15 LRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSG 74 + + L + E L AA + P G A L + LRDELL+G Sbjct: 3 RKHRHLFEKVITLENLFAAAENASRGRSGKVPVARGF------AELEKTVVTLRDELLAG 56 Query: 75 HYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERS 134 +QP I ++ K R + RDR+V A++ +EPI+E F S+ RP + Sbjct: 57 TWQPGRYYYFTI--TDPKEREVAAAPFRDRVVHHALVRVLEPIFEPRFIADSFACRPGKG 114 Query: 135 VHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKT 194 H A+ + R R+ ++ D+ YF + H LL++ V R + DAR + L+ + Sbjct: 115 THAALARAREFTR-----RHRYCLKCDIKKYFPNIDHALLLREVGRAVDDARVLELIGRI 169 Query: 195 IKA---GHIDVGL-------FRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKAR 244 + + G G+P G + S L+N+ L+ D ++ + Sbjct: 170 LASHADGAAQEWRAGAGLFDVEQRPRGLPIGNLTSQFLANVHLHPLDLFVKQTLRVKG-- 227 Query: 245 KDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLE 304 Y RY DDF+L +A ++A + R + Sbjct: 228 -----------------------------YVRYVDDFLLFG-DDRAALKAHGQRVREFVR 257 Query: 305 GSLKLRLNMDKTKIPHVNDGFIFLGHR-LIRKRSRYGEMRV 344 +L+LR++ DK ++ G F+G R R + V Sbjct: 258 -TLRLRVHPDKFRLSRTEQGVDFVGFVAFPDGRIRVRDSNV 297 >UniRef50_A5ATF6 Putative uncharacterized protein n=5 Tax=Vitis vinifera RepID=A5ATF6_VITVI Length = 1742 Score = 180 bits (456), Expect = 8e-44, Method: Composition-based stats. Identities = 50/318 (15%), Positives = 105/318 (33%), Gaps = 33/318 (10%) Query: 24 ITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQIL---RDELLSGHYQPLP 80 + P E LS G PG DG++ Q + E Sbjct: 831 LESPFTEEEVFNALLSCNGDKAPGPDGLSMAFWQFAWDFVKADVLCFFKEFYENGKFVKS 890 Query: 81 ARR---VYIPKSNG-----KLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPE 132 V IPK G RP+ + + + + + ++ + F Sbjct: 891 LNATFLVLIPKKVGAEDLGDFRPISLVGSLYKWLAKVLANRLKKVVGKVISKAQGAFVEG 950 Query: 133 RSVHHAIRTVKLQLTDCGETRGRWV-IEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLL 191 R + A+ + + + + D+ +D V ++ +++ +++ + Sbjct: 951 RQILDAVLIANEAIDSTLKNNESAILCKLDIEKAYDNVDWTFILTVMQKMGFGEKWIRWI 1010 Query: 192 WKTIKAGHIDVGL------FRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARK 245 I V + F +S+G+ QG +S L I + F +L G Sbjct: 1011 KWCISTASFSVLVNGTPTGFFQSSKGLRQGDPLSXYLFVIAMEVFSAFLQRAVEGGYLSG 1070 Query: 246 DRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGVLEG 305 R V+ + +++ +ADD ++ K ++ + + E Sbjct: 1071 CR--------------VKGRSEEGALISHLLFADDTLVFCKPSQDHLTHL-SWLLMWFEA 1115 Query: 306 SLKLRLNMDKTKIPHVND 323 + LR+N+DK+++ V Sbjct: 1116 ASGLRINLDKSELIPVGR 1133 >UniRef50_UPI0001983C14 PREDICTED: hypothetical protein n=1 Tax=Vitis vinifera RepID=UPI0001983C14 Length = 1169 Score = 179 bits (453), Expect = 2e-43, Method: Composition-based stats. Identities = 57/390 (14%), Positives = 116/390 (29%), Gaps = 54/390 (13%) Query: 21 LRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLS----GHY 76 L ++ E + A G PG DG V + G + Sbjct: 412 LEVMFSEEEIFAALSSFC---GDKAPGPDGFTMAFWLFCWDVVKPEIIGLFREFYLHGTF 468 Query: 77 Q--PLPARRVYIPKSNG-----KLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGF 129 Q + IPK G RP+ + +++ + + ++ + + F Sbjct: 469 QRSLNSTFLLLIPKKEGTEDLKDFRPISLVGSVYKLLAKVLANRLKTVMGEVISDSQHAF 528 Query: 130 RPERSVHHAIRTVKLQLTDCGE-TRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFM 188 R + A+ L + +++ D+ FD V+ LM+ + + R++ Sbjct: 529 VHGRQILDAVLIANEALDSRLKDNIPGLLLKMDIEKAFDHVNWNFLMEVMSKMGFGHRWI 588 Query: 189 TLLWKTIKAGHI------DVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGK 242 + A F +S G+ QG +SP L + + Q L Sbjct: 589 NWIKWCCSATSFSILINGSPSGFFRSSRGLRQGDPLSPYLFLLAMEALSQLLSRARNGNF 648 Query: 243 ARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECRGV 302 R V V++ +ADD ++ Q++ + Sbjct: 649 ISGFR--------------VGGRGSEGLVVSHLLFADDTLIFCDADADQLQYL-SWTFMW 693 Query: 303 LEGSLKLRLNMDKTKIPHVNDGF------------------IFLGHRLIRKRSRYGEMRV 344 E L++N++KT+ V + +LG L Sbjct: 694 FEAISGLKVNLNKTEAIPVGEDIPMETLAAVLGCKIGSLPTSYLGLPLGAPYKSIRVWDA 753 Query: 345 VSTIPQEKARNFAASLTALLWKVRISGEIL 374 V +++ + + ++ + L Sbjct: 754 VEERFRKRLSLWKRQYLSKGGRLTLLKSTL 783 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.314 0.137 0.354 Lambda K H 0.267 0.0420 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,877,334,762 Number of Sequences: 3077464 Number of extensions: 65422368 Number of successful extensions: 227647 Number of sequences better than 1.0e-01: 250 Number of HSP's better than 0.1 without gapping: 1885 Number of HSP's successfully gapped in prelim test: 2525 Number of HSP's that attempted gapping in prelim test: 214937 Number of HSP's gapped (non-prelim): 6093 length of query: 376 length of database: 1,040,396,356 effective HSP length: 130 effective length of query: 246 effective length of database: 640,326,036 effective search space: 157520204856 effective search space used: 157520204856 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.5 bits) S2: 94 (40.8 bits)