BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (199 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_Q46897 Uncharacterized protein ygcH n=10 Tax=Enterobact... 412 e-114 UniRef50_D0FPP5 CRISPR-associated protein, Cse3 family n=2 Tax=E... 191 2e-47 UniRef50_Q74DC7 CRISPR-associated protein, CT1974 family n=2 Tax... 160 2e-38 UniRef50_C5SD47 CRISPR-associated protein, Cse3 family n=1 Tax=A... 139 8e-32 UniRef50_B2N0R4 Putative uncharacterized protein n=1 Tax=Escheri... 105 6e-22 UniRef50_A1SV70 CRISPR-associated protein, Cse3 family n=2 Tax=G... 103 3e-21 UniRef50_Q2JWC6 CRISPR-associated protein, Cse3 family n=2 Tax=C... 89 1e-16 UniRef50_A5GBK0 CRISPR-associated protein, Cse3 family n=1 Tax=G... 83 5e-15 UniRef50_B8IZA5 CRISPR-associated protein, Cse3 family n=1 Tax=D... 80 3e-14 UniRef50_Q2FNT6 CRISPR-associated protein, CT1974 n=1 Tax=Methan... 79 1e-13 UniRef50_D0Y917 CRISPR-associated protein, Cse3 family n=2 Tax=D... 79 1e-13 UniRef50_B6WQ61 Putative uncharacterized protein n=1 Tax=Desulfo... 79 1e-13 UniRef50_A6W167 CRISPR-associated protein, Cse3 family n=1 Tax=M... 78 2e-13 UniRef50_Q53WG9 Putative uncharacterized protein TTHB192 n=1 Tax... 75 9e-13 UniRef50_Q1J366 CRISPR-associated protein, CT1974 n=2 Tax=Deinoc... 74 2e-12 UniRef50_B7KJ27 CRISPR-associated protein, Cse3 family n=1 Tax=C... 74 2e-12 UniRef50_Q314I5 CRISPR-associated protein, CT1974 n=2 Tax=Desulf... 74 3e-12 UniRef50_D1A6Q6 CRISPR-associated protein, Cse3 family n=5 Tax=A... 72 1e-11 UniRef50_Q0W583 Predicted CRISPR-associated protein n=1 Tax=uncu... 69 1e-10 UniRef50_A0LM55 CRISPR-associated protein, Cse3 family n=1 Tax=S... 68 2e-10 UniRef50_Q12YA7 CRISPR-associated protein n=1 Tax=Methanococcoid... 68 2e-10 UniRef50_B8IMR1 CRISPR-associated protein, Cse3 family n=3 Tax=A... 68 2e-10 UniRef50_D1CAI9 CRISPR-associated protein, Cse3 family n=1 Tax=S... 67 2e-10 UniRef50_B8GIV2 CRISPR-associated protein, Cse3 family n=1 Tax=M... 67 2e-10 UniRef50_B4TTX1 Crispr-associated protein, Cse3 family n=15 Tax=... 66 7e-10 UniRef50_C6C417 CRISPR-associated protein, Cse3 family n=4 Tax=E... 65 1e-09 UniRef50_A9GV72 Putative uncharacterized protein ygcH n=1 Tax=So... 64 4e-09 UniRef50_C1DSI0 CRISPR-associated protein, CT1974 n=3 Tax=Pseudo... 63 5e-09 UniRef50_B4RSK4 CRISPR-associated protein, Cse3 family n=5 Tax=G... 63 7e-09 UniRef50_A5UR13 CRISPR-associated protein, Cse3 family n=1 Tax=R... 62 1e-08 UniRef50_Q67RN9 Putative uncharacterized protein n=1 Tax=Symbiob... 62 1e-08 UniRef50_Q2RY20 CRISPR-associated protein, CT1974 n=1 Tax=Rhodos... 60 3e-08 UniRef50_C1XG03 CRISPR-associated protein, Cse3 family n=1 Tax=M... 60 5e-08 UniRef50_Q04QB6 Putative uncharacterized protein n=2 Tax=Leptosp... 59 7e-08 UniRef50_C6WMQ8 CRISPR-associated protein, Cse3 family n=1 Tax=A... 58 2e-07 UniRef50_B5GAA2 Crispr-associated protein n=1 Tax=Streptomyces s... 58 2e-07 UniRef50_B6IWM2 CRISPR-associated protein, CT1974 family n=1 Tax... 57 3e-07 UniRef50_D1CGD5 CRISPR-associated protein, Cse3 family n=1 Tax=T... 57 3e-07 UniRef50_Q03C59 CRISPR-associated protein n=3 Tax=Lactobacillus ... 57 4e-07 UniRef50_B1LQ79 CRISPR-associated protein, Cse3 family n=54 Tax=... 57 5e-07 UniRef50_D2RB03 CRISPR system CASCADE complex protein CasE n=4 T... 56 6e-07 UniRef50_B8FDH8 CRISPR-associated protein, Cse3 family n=1 Tax=D... 56 7e-07 UniRef50_D1NTI2 CRISPR-associated protein, Cse3 family n=1 Tax=B... 56 8e-07 UniRef50_C7MQD7 CRISPR-associated protein, Cse3 family n=1 Tax=S... 55 1e-06 UniRef50_A1ARH5 CRISPR-associated protein, Cse3 family n=3 Tax=B... 53 6e-06 UniRef50_B6B784 CRISPR-associated protein, Cse3 family n=1 Tax=R... 53 7e-06 UniRef50_B6XT65 Putative uncharacterized protein n=2 Tax=Bifidob... 52 9e-06 UniRef50_C2BET7 CRISPR-associated protein n=1 Tax=Anaerococcus l... 52 1e-05 UniRef50_A9HLC4 CRISPR-associated protein, Cse3 family n=1 Tax=G... 52 1e-05 UniRef50_C0VRW4 CRISPR-associated protein n=1 Tax=Corynebacteriu... 51 2e-05 UniRef50_C2KP48 Putative uncharacterized protein n=1 Tax=Mobilun... 51 3e-05 UniRef50_A8LYZ8 CRISPR-associated protein, Cse3 family n=1 Tax=S... 51 3e-05 UniRef50_C7MTM6 CRISPR-associated protein, Cse3 family n=1 Tax=S... 50 4e-05 UniRef50_B1VIX9 CRISPR-associated protein n=6 Tax=Actinomycetale... 50 4e-05 UniRef50_Q0BSC8 Putative uncharacterized protein n=1 Tax=Granuli... 50 5e-05 UniRef50_Q47PI8 CRISPR-associated protein, Cse3 family n=1 Tax=T... 49 8e-05 UniRef50_C2CRP4 Putative uncharacterized protein n=1 Tax=Coryneb... 48 2e-04 UniRef50_A8M405 CRISPR-associated protein, Cse3 family n=3 Tax=A... 48 2e-04 UniRef50_D1A5U1 CRISPR-associated protein, Cse3 family n=2 Tax=A... 48 2e-04 UniRef50_C7MTL5 CRISPR-associated protein, Cse3 family n=1 Tax=S... 48 2e-04 UniRef50_D1YEE5 CRISPR system CASCADE complex protein CasE n=1 T... 47 3e-04 UniRef50_C9M2Y7 CRISPR-associated protein n=3 Tax=Lactobacillus ... 47 3e-04 UniRef50_C5V9N5 CRISPR-associated protein, Cse3 family n=1 Tax=C... 47 3e-04 UniRef50_Q0RTG6 Putative uncharacterized protein n=1 Tax=Frankia... 47 4e-04 UniRef50_Q2JH26 Putative uncharacterized protein n=1 Tax=Frankia... 46 6e-04 UniRef50_B0LU87 CRISPR-associated protein Cas3 n=2 Tax=Streptomy... 46 7e-04 UniRef50_A8SDR6 Putative uncharacterized protein n=1 Tax=Faecali... 46 7e-04 UniRef50_C7QEM3 CRISPR-associated protein, Cse3 family n=9 Tax=A... 44 0.003 UniRef50_Q3A5Z3 CRISPR-associated protein, Cse3 family n=2 Tax=D... 44 0.003 UniRef50_C7JIG8 CRISPR-associated protein Cse3 n=8 Tax=Acetobact... 44 0.003 UniRef50_D0WFC7 CRISPR-associated protein, Cse3 family n=1 Tax=S... 44 0.004 UniRef50_B5GY64 Putative uncharacterized protein n=1 Tax=Strepto... 44 0.004 UniRef50_D0MET7 CRISPR-associated protein, Cse3 family n=1 Tax=R... 44 0.005 UniRef50_A8LMM7 CRISPR-associated protein n=2 Tax=Alphaproteobac... 43 0.005 UniRef50_Q6NEQ5 Putative uncharacterized protein n=1 Tax=Coryneb... 42 0.010 UniRef50_Q4JWK1 Putative uncharacterized protein n=2 Tax=Coryneb... 42 0.011 UniRef50_Q47PJ5 CRISPR-associated protein, Cse3 family n=1 Tax=T... 40 0.043 UniRef50_C4ZJY2 CRISPR-associated protein, Cse3 family n=1 Tax=T... 40 0.048 >UniRef50_Q46897 Uncharacterized protein ygcH n=10 Tax=Enterobacteriaceae RepID=YGCH_ECOLI Length = 199 Score = 412 bits (1060), Expect = e-114, Method: Compositional matrix adjust. Identities = 199/199 (100%), Positives = 199/199 (100%) Query: 1 MYLSKVIIARAWSRDLYQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGCHVLLQSAQMP 60 MYLSKVIIARAWSRDLYQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGCHVLLQSAQMP Sbjct: 1 MYLSKVIIARAWSRDLYQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGCHVLLQSAQMP 60 Query: 61 VSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRVPLIKEA 120 VSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRVPLIKEA Sbjct: 61 VSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRVPLIKEA 120 Query: 121 EQIAWLQRKLGNAARVEDVHPISERPQYFSGDGKSGKIQTVCFEGVLTINDAPALIDLVQ 180 EQIAWLQRKLGNAARVEDVHPISERPQYFSGDGKSGKIQTVCFEGVLTINDAPALIDLVQ Sbjct: 121 EQIAWLQRKLGNAARVEDVHPISERPQYFSGDGKSGKIQTVCFEGVLTINDAPALIDLVQ 180 Query: 181 QGIGPAKSMGCGLLSLAPL 199 QGIGPAKSMGCGLLSLAPL Sbjct: 181 QGIGPAKSMGCGLLSLAPL 199 >UniRef50_D0FPP5 CRISPR-associated protein, Cse3 family n=2 Tax=Erwinia pyrifoliae RepID=D0FPP5_ERWPY Length = 200 Score = 191 bits (484), Expect = 2e-47, Method: Compositional matrix adjust. Identities = 93/197 (47%), Positives = 126/197 (63%) Query: 1 MYLSKVIIARAWSRDLYQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGCHVLLQSAQMP 60 +YLS++ + +W++D YQ+H+ LW LFP+RP RDFLF VE R+ G VLLQS Q+P Sbjct: 2 IYLSQIDVPWSWAKDPYQMHRALWQLFPDRPSDRRDFLFRVETRHAGSGQRVLLQSPQLP 61 Query: 61 VSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRVPLIKEA 120 + A A V+ +K + L G L+FRLRANP+KTI D + RL+S+G +K CRVPLI + Sbjct: 62 QNCAAAKVLASKVMHLNLSPGQRLHFRLRANPVKTIKDKRGRLNSRGEVKSCRVPLIDDN 121 Query: 121 EQIAWLQRKLGNAARVEDVHPISERPQYFSGDGKSGKIQTVCFEGVLTINDAPALIDLVQ 180 + + WL RKL AA + E F+ +GKIQ VCFEG+L + + Sbjct: 122 QLMQWLVRKLEGAAVLNSASVSKEPALCFNKQAVAGKIQPVCFEGILQVTSETHFYQCMA 181 Query: 181 QGIGPAKSMGCGLLSLA 197 GIGPAKSMGCG+LS+A Sbjct: 182 DGIGPAKSMGCGMLSIA 198 >UniRef50_Q74DC7 CRISPR-associated protein, CT1974 family n=2 Tax=Desulfuromonadales RepID=Q74DC7_GEOSL Length = 202 Score = 160 bits (406), Expect = 2e-38, Method: Compositional matrix adjust. Identities = 96/202 (47%), Positives = 122/202 (60%), Gaps = 7/202 (3%) Query: 1 MYLSKVIIARAWSRDLYQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGCHVLLQSAQMP 60 MYLSKV+I R+ Y++H+ LW LFP DA RDFLF VE R+ + VLLQS + P Sbjct: 1 MYLSKVLINGTACRNPYEIHRVLWKLFPEDADAERDFLFRVE-RSGQQSVEVLLQSRREP 59 Query: 61 VSTAVATVI--KTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRVPLIK 118 A V+ +K LQ L F L ANPIKTI D RL+S IK+CRVPLI+ Sbjct: 60 TMAASREVLLMGSKPYLLSLQQDQQLRFMLVANPIKTINDESARLNSANEIKKCRVPLIR 119 Query: 119 EAEQIAWLQRKLGNAARVEDVHPISERPQYF---SGDGKSGKIQTVCFEGVLTINDAPAL 175 E + AWL+RKL A +E V + +RP + + + GK+Q V F GVL++ D L Sbjct: 120 EEDLRAWLKRKLEGVAVIEAVE-VEKRPAMNFRKAREKRVGKVQAVSFHGVLSVTDPVGL 178 Query: 176 IDLVQQGIGPAKSMGCGLLSLA 197 I L+ GIGPAK+ GCGLLSLA Sbjct: 179 ISLINTGIGPAKAFGCGLLSLA 200 >UniRef50_C5SD47 CRISPR-associated protein, Cse3 family n=1 Tax=Allochromatium vinosum DSM 180 RepID=C5SD47_CHRVI Length = 209 Score = 139 bits (349), Expect = 8e-32, Method: Compositional matrix adjust. Identities = 81/202 (40%), Positives = 107/202 (52%), Gaps = 8/202 (3%) Query: 1 MYLSKVIIARAWSRDLYQLHQGLWHLFPNR-------PDAAR-DFLFHVEKRNTPEGCHV 52 M LS+ I + +R+ Y +H+ +W LFP PD R FLF VE V Sbjct: 1 MILSRAEIPWSEARNPYDMHRAIWRLFPGEAAESRRTPDQPRRGFLFRVEDHRPGRPAQV 60 Query: 53 LLQSAQMPVSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRC 112 L+QS MP A +I ++++ Q G L F L ANPIKTI D Q + C Sbjct: 61 LIQSRCMPQPEATLNLIGSREINPQPSQGQRLAFILTANPIKTIKDRQADTKPRKTRDTC 120 Query: 113 RVPLIKEAEQIAWLQRKLGNAARVEDVHPISERPQYFSGDGKSGKIQTVCFEGVLTINDA 172 RVPLI E Q +WL ++L + A VE V P YF + GKI FEG+LT+ D Sbjct: 121 RVPLITEETQKSWLIQRLKDVAEVEAVAVTPHPPLYFRKANRGGKILCATFEGLLTVLDP 180 Query: 173 PALIDLVQQGIGPAKSMGCGLL 194 AL+ L++ G+GPAK+ GCGLL Sbjct: 181 NALVALLENGLGPAKAFGCGLL 202 >UniRef50_B2N0R4 Putative uncharacterized protein n=1 Tax=Escherichia coli 53638 RepID=B2N0R4_ECOLX Length = 58 Score = 105 bits (263), Expect = 6e-22, Method: Composition-based stats. Identities = 48/51 (94%), Positives = 51/51 (100%) Query: 149 FSGDGKSGKIQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAPL 199 FSG+GK+GKIQTVCFEGVLTINDAPALIDL+QQGIGPAKSMGCGLLSLAPL Sbjct: 8 FSGEGKNGKIQTVCFEGVLTINDAPALIDLLQQGIGPAKSMGCGLLSLAPL 58 >UniRef50_A1SV70 CRISPR-associated protein, Cse3 family n=2 Tax=Gammaproteobacteria RepID=A1SV70_PSYIN Length = 180 Score = 103 bits (258), Expect = 3e-21, Method: Compositional matrix adjust. Identities = 74/199 (37%), Positives = 104/199 (52%), Gaps = 24/199 (12%) Query: 1 MYLSKVIIARAWSRDLYQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGCHVLLQSAQMP 60 MYLS+V++ + D+Y+ HQ +W LF N D RD LF VE + + C VLLQS+ P Sbjct: 1 MYLSQVMLN---THDIYEQHQAIWSLFENVADRKRDHLFRVEVAD-RQSCKVLLQSSTEP 56 Query: 61 VSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRVPLIKEA 120 S+ A V+ +K +++ F+L A P K + S+G +V IKEA Sbjct: 57 KSSEQAKVLASKSFLAEIKQDAFYKFKLLAYPTKCL--------SQGK----KVIEIKEA 104 Query: 121 -EQIAWLQRKLGNA-ARVEDVHPISERPQYFSGDGKSGKIQTVCFEGVLTINDAPALIDL 178 EQ+ WLQRKL A V + + R + KS + VCFEG+L + D+ + Sbjct: 105 NEQVQWLQRKLSGANVTVTAMDDLMVRSK------KSYNSRFVCFEGILQVTDSEQIQRA 158 Query: 179 VQQGIGPAKSMGCGLLSLA 197 + GIG K G GLLSLA Sbjct: 159 LVMGIGRKKHAGAGLLSLA 177 >UniRef50_Q2JWC6 CRISPR-associated protein, Cse3 family n=2 Tax=Chroococcales RepID=Q2JWC6_SYNJA Length = 210 Score = 88.6 bits (218), Expect = 1e-16, Method: Compositional matrix adjust. Identities = 72/218 (33%), Positives = 104/218 (47%), Gaps = 32/218 (14%) Query: 1 MYLSKVI-------IARAWSRDLYQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGCHVL 53 MYLS++I + R S + + LHQ + H FP++P +H+ R P+GC +L Sbjct: 1 MYLSRLILNERQLLVQRELS-NAHALHQRIMHGFPDQPTKTPRSDWHILYRQEPDGCTIL 59 Query: 54 LQSAQMPVSTAVATVIKTKQVEFQ--------LQVGVPLYFRLRANPIKTILDNQKRLDS 105 +QS P + + + E + L G FRLRANP K D Sbjct: 60 VQSVIQPDWSRLPQGYVQRDPEVKIFDLRPEVLSKGRCFQFRLRANPSKR--------DK 111 Query: 106 KGNIKRCRVPLIKEAEQIAWLQRK-LGNAARVEDVHPISERPQYFS-GDGKSG--KIQTV 161 K R V + +Q+ WL+R+ + V I PQ F G SG +I TV Sbjct: 112 K---TRKIVGFFRSEDQLEWLRRQGFQHGFEVLAAEGIPS-PQIFGIKKGLSGPVRIHTV 167 Query: 162 CFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAPL 199 F+G+L + D+ A + VQQGIG +S GCGLLSL+ + Sbjct: 168 LFQGILRVTDSEAFVKAVQQGIGRGRSYGCGLLSLSKI 205 >UniRef50_A5GBK0 CRISPR-associated protein, Cse3 family n=1 Tax=Geobacter uraniireducens Rf4 RepID=A5GBK0_GEOUR Length = 229 Score = 83.2 bits (204), Expect = 5e-15, Method: Compositional matrix adjust. Identities = 69/223 (30%), Positives = 100/223 (44%), Gaps = 35/223 (15%) Query: 4 SKVIIARAWSRDLYQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGCHVLLQSAQM---- 59 ++ + A S D+Y H+ LW +P++P+A RDFL +++ VL + + Sbjct: 11 AETVRAAGISEDVYAWHKLLWECYPDQPEAERDFLTRIDQLEGAYRFWVLAKRKPVMPRW 70 Query: 60 -PVSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTIL----DNQKRLDSKGNIKRC-R 113 PV I + Q F LRANP++ + + ++ LD+ G +R R Sbjct: 71 CPVDGFGLNEISPSFLSRQYYA-----FDLRANPVRAAVQRDANGEQVLDANGKRRRGKR 125 Query: 114 VPLIKEAEQIAWLQRKL--------------GNAARVED----VHPISERPQYFSGDGKS 155 VPL+K E AWL RK G VE+ + P+ E +F G+S Sbjct: 126 VPLVKPDELRAWLVRKGEVRCRDKETGLDVPGGFRLVEERSLEISPMVE--SHFRKKGQS 183 Query: 156 GKIQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAP 198 G V F G L + D I+ Q GIG AK G GLL LAP Sbjct: 184 GYHGGVQFRGTLEVTDRAKFIESYQSGIGSAKGFGFGLLLLAP 226 >UniRef50_B8IZA5 CRISPR-associated protein, Cse3 family n=1 Tax=Desulfovibrio desulfuricans subsp. desulfuricans str. ATCC 27774 RepID=B8IZA5_DESDA Length = 207 Score = 80.5 bits (197), Expect = 3e-14, Method: Compositional matrix adjust. Identities = 61/195 (31%), Positives = 91/195 (46%), Gaps = 19/195 (9%) Query: 15 DLYQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGCHVLLQSAQMPVSTAVATVIKTKQV 74 D Y H+ +W FPNRPDA+RDFLF +++ P G V + S P T + Sbjct: 21 DCYAWHKAIWQCFPNRPDASRDFLFRLDE--VPAGTLVHVLSPHEPQRPDFCT-----ED 73 Query: 75 EFQLQVGVPLYFRLRANPIKTILDNQKRLD---SKGNIKRC--RVPLIKEAEQIAWLQRK 129 +Q++ P + + I + ++++ S G K+ R +IK EQ AWL RK Sbjct: 74 HWQIKAVPPCFLKYNCYRFDVICNPGRKVEAFTSDGQRKKNSRREAIIKPDEQNAWLDRK 133 Query: 130 LGNAARVEDVHP----ISERPQY-FSGDGKSGKIQTVCFEGVLTINDAPALIDLVQQGIG 184 AA +V P I +Y F D +SG V F GVL + +G+G Sbjct: 134 A--AANGFEVLPGMRSIDPSTRYSFRKDHRSGTHIGVRFSGVLRVTQRDEFCRAFHKGLG 191 Query: 185 PAKSMGCGLLSLAPL 199 A+ G G+L L+P+ Sbjct: 192 SARGFGFGMLLLSPV 206 >UniRef50_Q2FNT6 CRISPR-associated protein, CT1974 n=1 Tax=Methanospirillum hungatei JF-1 RepID=Q2FNT6_METHJ Length = 228 Score = 79.0 bits (193), Expect = 1e-13, Method: Compositional matrix adjust. Identities = 68/225 (30%), Positives = 105/225 (46%), Gaps = 35/225 (15%) Query: 1 MYLSKVIIARAWS-----RDL----YQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGCH 51 M+ SK+ + R + RDL YQ+H+ +W LF + PD RDFL+ E T Sbjct: 1 MFFSKMTLDREAAISGRFRDLVTGPYQVHEVIWDLFADHPDRKRDFLYRAEL--TGRDPV 58 Query: 52 VLLQSAQMPVSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKT---------------- 95 V L SA+ PV I +K LQ L FR+R NP+ T Sbjct: 59 VYLLSARKPVYEGNVWNILSKPFHPVLQKDDLLNFRIRVNPVVTKTEPDPDRKRIRHRHD 118 Query: 96 -ILDNQKRLDSKGNIKRCRVPLIKEAEQIAWLQRKL--GNAARVED--VHPISERPQYFS 150 I+D ++RL+ + N L+++ E + WL+++ G + ED + + Q+ Sbjct: 119 VIMDAKRRLN-EANSSFSMSDLVQQ-ESVRWLRQRSEKGGFSLYEDRVIAGGYRKMQFSQ 176 Query: 151 GDGKSG-KIQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLL 194 G K+ I V +GVL + D + ++ G+GPAK GCGL+ Sbjct: 177 GRKKNTISISVVDCDGVLRVTDPDLFLQMICNGLGPAKGFGCGLM 221 >UniRef50_D0Y917 CRISPR-associated protein, Cse3 family n=2 Tax=Dehalococcoides RepID=D0Y917_9CHLR Length = 209 Score = 78.6 bits (192), Expect = 1e-13, Method: Compositional matrix adjust. Identities = 69/218 (31%), Positives = 100/218 (45%), Gaps = 30/218 (13%) Query: 1 MYLSKVIIARAWSRDL------YQLHQGLWHLFPNRPDAA-RDFLFHVEKRNTPEGCHVL 53 MYLS + + R L Y+LH+ L FP++ D LF ++ G VL Sbjct: 1 MYLSLLRLNPRSKRALTESSRPYELHRSLLKAFPDKADGGPGRVLFRLDMNEQTGGISVL 60 Query: 54 LQSAQMPV------STAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKG 107 +QS + P T T K K+ + L G L FRLRANP K R S G Sbjct: 61 IQSEKKPFWTNLNGYTEFVTECKCKEFKPALAPGQVLRFRLRANPTK-------RSKSTG 113 Query: 108 NIKRCRVPLIKEAEQIAWLQRKLGNAA-RVEDVHPISE---RPQYFSGD--GKSGKIQTV 161 R ++K EQ+ WL++K N V +V + E + + D G + +V Sbjct: 114 K----REGILKTEEQVEWLRKKGMNGGFEVCEVFTVDEGFAKDKMTDTDNAGHHTNMLSV 169 Query: 162 CFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAPL 199 F+G+L + D+ A ++ GIG AK G GLLS+A + Sbjct: 170 RFDGLLRVTDSDAFQSTLRDGIGSAKGFGFGLLSVASV 207 >UniRef50_B6WQ61 Putative uncharacterized protein n=1 Tax=Desulfovibrio piger ATCC 29098 RepID=B6WQ61_9DELT Length = 206 Score = 78.6 bits (192), Expect = 1e-13, Method: Compositional matrix adjust. Identities = 67/208 (32%), Positives = 89/208 (42%), Gaps = 18/208 (8%) Query: 1 MYLSKVIIARAWSRDLYQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGCHVLLQSAQMP 60 M L + +AR RD Y HQ LW FP PDA RDFL + P+GC + L + P Sbjct: 7 MLLDRQALARCRFRDSYAWHQALWECFPAMPDAGRDFLTRTDW--LPQGCRIYLLCRREP 64 Query: 61 VSTAV----ATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRC--RV 114 V + +K F LQ G F L ANP + + D+ G R R+ Sbjct: 65 VRPDWCPPGSWAVKNIAPAF-LQHGT-YAFDLLANPTRKV----AAFDAGGQRTRNGKRL 118 Query: 115 PLIKEAEQIAWLQRKLGNAARVEDVHPIS---ERPQYFSGDGKSGKIQTVCFEGVLTIND 171 L+ E + AW++ K G D P++ F +G V F G L + D Sbjct: 119 ALLDETSRQAWMEAKAGQHGFCLD-GPLALDDAGASIFWRRACAGTHIGVRFRGRLQVTD 177 Query: 172 APALIDLVQQGIGPAKSMGCGLLSLAPL 199 I GIG AK+ G G+L L PL Sbjct: 178 RERFIHAFYHGIGSAKAFGFGMLLLQPL 205 >UniRef50_A6W167 CRISPR-associated protein, Cse3 family n=1 Tax=Marinomonas sp. MWYL1 RepID=A6W167_MARMS Length = 224 Score = 77.8 bits (190), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 70/222 (31%), Positives = 99/222 (44%), Gaps = 31/222 (13%) Query: 1 MYLSKVII-ARAWSRDL---------YQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGC 50 MYLSKV A +R L Y HQ LW LF + R FLF E+ Sbjct: 1 MYLSKVSFQASQQARQLLLGFGGKGVYSTHQMLWQLFTEEDE--RSFLFREEQSADGSKA 58 Query: 51 HVLLQSAQMPVSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQ---KRLDSKG 107 +L S + P S +KTK +LQ G L F LRANP D + KR D Sbjct: 59 FFVLSSVK-PESDESTFNVKTKTFMPKLQSGQRLGFTLRANPTVCTTDEKGKSKRHDVMM 117 Query: 108 NIKRC----------RVPLIKEAEQIAWLQ--RKLGNAARVEDVHP-ISERPQYFSGDGK 154 + K+ + LI E W+ ++L N D P + Q+ S + Sbjct: 118 HAKKAAKESGVSDSEEIRLIMEQAAQEWIANPKRLENWGFTLDFLPEVQTYMQHRSDKNR 177 Query: 155 SGKIQ--TVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLL 194 KI+ +V ++GVLT+ D ++ +++G G AKS+GCGL+ Sbjct: 178 EDKIRFSSVDYQGVLTVQDPEKFLEQLEKGFGRAKSLGCGLM 219 >UniRef50_Q53WG9 Putative uncharacterized protein TTHB192 n=1 Tax=Thermus thermophilus HB8 RepID=Q53WG9_THET8 Length = 211 Score = 75.5 bits (184), Expect = 9e-13, Method: Compositional matrix adjust. Identities = 68/222 (30%), Positives = 104/222 (46%), Gaps = 35/222 (15%) Query: 1 MYLSKVII---ARAWSRDL---YQLHQGLWHLFPNRPDAARD-FLFHVEKRNTPEGCHVL 53 M+L+K+++ +RA RDL Y++H+ L + R+ L+ +E E VL Sbjct: 1 MWLTKLVLNPASRAARRDLANPYEMHRTLSKAVSRALEEGRERLLWRLEPARGLEPPVVL 60 Query: 54 LQSAQMP----VSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKGNI 109 +Q+ P + A V K L+ G L FRLRANP K RL + G Sbjct: 61 VQTLTEPDWSVLDEGYAQVFPPKPFHPALKPGQRLRFRLRANPAK-------RLAATGK- 112 Query: 110 KRCRVPLIKEAEQIAWLQRKLGNAAR-------------VEDVHPISERPQYFSGDGKSG 156 RV L AE++AWL+R+L ++D R + GK Sbjct: 113 ---RVALKTPAEKVAWLERRLEEGGFRLLEGERGPWVQILQDTFLEVRRKKDGEEAGKLL 169 Query: 157 KIQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAP 198 ++Q V FEG L + D + +++G+GP K++G GLLS+AP Sbjct: 170 QVQAVLFEGRLEVVDPERALATLRRGVGPGKALGLGLLSVAP 211 >UniRef50_Q1J366 CRISPR-associated protein, CT1974 n=2 Tax=Deinococci RepID=Q1J366_DEIGD Length = 211 Score = 74.3 bits (181), Expect = 2e-12, Method: Compositional matrix adjust. Identities = 76/224 (33%), Positives = 104/224 (46%), Gaps = 43/224 (19%) Query: 1 MYLSKVIIA---RAWSRDL---YQLHQGLWHLFPNR-------PDAARDFLFHVEKRNTP 47 +YLS++ R +RDL Y LHQ L F PD R L+ E R T Sbjct: 4 LYLSRLRFEDRDRRTARDLASPYALHQTLRWAFAGAGVEGAPLPDGER-ALWRQEDRAT- 61 Query: 48 EGCHVLLQSAQMPVSTAVAT---------VIKTKQVEFQLQVGVPLYFRLRANPIKTILD 98 +L+QS P A+ +KT + L G PL FRLRAN Sbjct: 62 ----LLVQSLTAPDWEALNARHPGSLRGWEVKTVDLAPALTPGRPLRFRLRANV------ 111 Query: 99 NQKRLDSKGNIKRCRVPLIKEAEQIAWLQR---KLGNAARVED-VHPISERPQYFSGDGK 154 ++LD KG +R V EQ+ WL R + G A D VH + + + S Sbjct: 112 TVRKLDEKGRSRRHAVR--GPHEQLEWLSRQGERCGFAVLAADIVHSGTVKTRKGSA--- 166 Query: 155 SGKIQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAP 198 + + TV FEG+L + D AL++ V+ G+G AK++GCGLLSL P Sbjct: 167 TITLHTVTFEGILRVTDPAALLEAVRGGLGHAKALGCGLLSLGP 210 >UniRef50_B7KJ27 CRISPR-associated protein, Cse3 family n=1 Tax=Cyanothece sp. PCC 7424 RepID=B7KJ27_CYAP7 Length = 219 Score = 74.3 bits (181), Expect = 2e-12, Method: Compositional matrix adjust. Identities = 67/224 (29%), Positives = 103/224 (45%), Gaps = 33/224 (14%) Query: 1 MYLSKV---IIARAWSRDL---YQLHQGLWHLFPNRPD----AARDFLFHVEKRNTPEGC 50 MYLSK+ I + A S DL ++LHQ + FPN + + L+ +E G Sbjct: 1 MYLSKIELNIRSSAVSTDLSDCHKLHQRVMQGFPNENNPEYRSEAKILYRLE------GS 54 Query: 51 HVLLQSAQMPVSTAV---ATVIKTKQVEFQ-LQVGVPLYFRLRANPIKTILDNQKRLDSK 106 + +QS P T + T + +++++ ++ G LYFRL NP++ + R D Sbjct: 55 ILFVQSKNKPDWTQLPKGYTAEEITEMDYEKIKKGDYLYFRLLGNPVQQT--TKLRTDDS 112 Query: 107 GNI-----------KRCRVPLIKEAEQIAWLQRKLGNAARVEDVHPISERPQYFSGDGKS 155 GNI K R L + QI WL L E S + K Sbjct: 113 GNIIMKNNSEKPQKKTVRRFLSNKDAQIQWLMNHLKGTILQECYVSASSDIRGQCKQSKR 172 Query: 156 GKIQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAPL 199 ++TV F+GVL + D+ + I +++GIG +S GCGLLS+A Sbjct: 173 IFLKTVLFDGVLQVTDSESFIKALREGIGRGRSYGCGLLSIAKF 216 >UniRef50_Q314I5 CRISPR-associated protein, CT1974 n=2 Tax=Desulfovibrio RepID=Q314I5_DESDG Length = 219 Score = 73.9 bits (180), Expect = 3e-12, Method: Compositional matrix adjust. Identities = 67/218 (30%), Positives = 102/218 (46%), Gaps = 28/218 (12%) Query: 1 MYLSKVII--ARAWSRDLYQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGCHVLLQSAQ 58 M++SK+++ RA ++LY H+ LW+LF + PD RDFLF R E L S + Sbjct: 1 MWMSKLVLDPRRAVGKNLYDTHRLLWNLFADAPDRTRDFLF----REQDEPYTFLTVSRR 56 Query: 59 MPVSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDN--QKRLD--------SKGN 108 P T I+ K +LQ G + F LR N + +N Q+R D K Sbjct: 57 QPEDTTGWWSIQIKPYAPKLQAGDAVAFSLRVNAVVKRNENGKQRRFDIVQDACLRMKEL 116 Query: 109 IKRCRVPLIKEAEQIA---WL---QRKLG----NAARVEDVHPISERPQYFSGDGKSGKI 158 + ++P E Q A WL Q+ LG +AA + + + + + D +SG + Sbjct: 117 NQNAQMPTRAEIAQEAGTRWLLARQQALGLSIESAAILVEGCKVERFVKRATRDTRSGVV 176 Query: 159 Q--TVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLL 194 + +G + D L+ + QG+GPAK GCGLL Sbjct: 177 SLGIMDLQGTAEVKDPQLLLQALFQGVGPAKGFGCGLL 214 >UniRef50_D1A6Q6 CRISPR-associated protein, Cse3 family n=5 Tax=Actinomycetales RepID=D1A6Q6_THECD Length = 214 Score = 72.0 bits (175), Expect = 1e-11, Method: Compositional matrix adjust. Identities = 63/199 (31%), Positives = 98/199 (49%), Gaps = 26/199 (13%) Query: 15 DLYQLHQGLWHLFPNR--PDAARD--FLFHVEKRNTPEGCHVLLQSA------QMPVSTA 64 D+ +LH+ + LFP+ P+A R LF +E+R P G +L+QS+ ++P S Sbjct: 25 DVVRLHRRIMSLFPDGLGPEARRRAAVLFRLEER--PTGTSILMQSSIEPALEKLPASYG 82 Query: 65 VATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRVPLIKEAEQIA 124 A + L+ GV +++R+ AN + + N + G K+ VPL AE Sbjct: 83 KARCKSLAPLLNGLREGVNVHYRIVANATRKLGRNT----TAGRPKQV-VPL-HGAEADE 136 Query: 125 WLQRKLGNAARVEDVHPISERPQYFSGDGKSGKIQTVC-----FEGVLTINDAPALIDLV 179 W +R+ A V + + R Q +G G+ V F+G T+ D ALID + Sbjct: 137 WWRRQADAAGLV--LRSLHSR-QLDTGTGRRSDNNRVTHARTQFDGTATVTDPKALIDRI 193 Query: 180 QQGIGPAKSMGCGLLSLAP 198 GIG K+ GCGLL++AP Sbjct: 194 HAGIGRGKAYGCGLLTIAP 212 >UniRef50_Q0W583 Predicted CRISPR-associated protein n=1 Tax=uncultured methanogenic archaeon RC-I RepID=Q0W583_UNCMA Length = 250 Score = 68.6 bits (166), Expect = 1e-10, Method: Compositional matrix adjust. Identities = 75/250 (30%), Positives = 111/250 (44%), Gaps = 54/250 (21%) Query: 1 MYLSKVII---ARAWSRDL---YQLHQGLWHLFPN---RPDAARDFL--FHVEKRNTPEG 49 MYLS++I+ RA RDL ++LH+ + FP+ + AR+ H + G Sbjct: 1 MYLSRLILNPRTRAVRRDLADCHELHRTILGGFPDLNGKGGEARETFGVLHRIDIHPRSG 60 Query: 50 CHVLL-QSAQMP-------------VSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKT 95 VLL QS + P T + +++ G FRLRANP K Sbjct: 61 AIVLLVQSQEKPDWSKLPEGYLLENTGTENPACKAIDEQYGKIKAGDVYAFRLRANPTKK 120 Query: 96 I----LDNQKRLDSKGNIKRCRVPLIKEAEQIAWLQRK-------LGNAARVEDVHP--I 142 I +++ K K N +R VP+ E++QI WL+RK L + R ++ I Sbjct: 121 IGTSRIEDIKAGKPKNNGRR--VPIRNESDQILWLKRKGAAGGFELMSTKRFSELSDVLI 178 Query: 143 SERPQ---YFSGDGKSGKIQ-----------TVCFEGVLTINDAPALIDLVQQGIGPAKS 188 SE Y G K+Q +V FEG L + +A ++ ++ GIG K+ Sbjct: 179 SEEGHQKIYTFDTGIKAKVQKNARENRLTFGSVLFEGTLKVTNAEKFLETLKSGIGSGKA 238 Query: 189 MGCGLLSLAP 198 G GLLSLAP Sbjct: 239 YGFGLLSLAP 248 >UniRef50_A0LM55 CRISPR-associated protein, Cse3 family n=1 Tax=Syntrophobacter fumaroxidans MPOB RepID=A0LM55_SYNFM Length = 202 Score = 68.2 bits (165), Expect = 2e-10, Method: Compositional matrix adjust. Identities = 69/218 (31%), Positives = 96/218 (44%), Gaps = 39/218 (17%) Query: 1 MYLSKVIIARAWS------RDLYQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGCHVLL 54 MYLS + + R D+Y LH+G+ F D R LF VE N +++ Sbjct: 1 MYLSLLSLDRLHRGTMRLLSDIYLLHKGIMSGFTRCGDGLR-VLFRVEPENDDRIVRIMV 59 Query: 55 QSAQMPV------STAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKGN 108 QS P ++TK L+ G FRLRANP + N KR Sbjct: 60 QSDGSPSWELFTERHPCVIDMRTKVFSPALRAGHSYRFRLRANP--AVKRNGKRYG---- 113 Query: 109 IKRCRVPLIKEAEQIAWLQRK---LGNAARVEDVHPISERPQYFSGDGKSG------KIQ 159 LI++ WL+RK LG R V + E Y +G + I+ Sbjct: 114 -------LIRDETLEEWLRRKEPALGLQFR--SVLALDE--GYVTGHKEGSGHPQRINIK 162 Query: 160 TVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLA 197 T FEG+LT+++ + + + GIGPAK+ GCGLLSLA Sbjct: 163 TARFEGILTVSEPHLVQNALCCGIGPAKAFGCGLLSLA 200 >UniRef50_Q12YA7 CRISPR-associated protein n=1 Tax=Methanococcoides burtonii DSM 6242 RepID=Q12YA7_METBU Length = 224 Score = 67.8 bits (164), Expect = 2e-10, Method: Compositional matrix adjust. Identities = 58/203 (28%), Positives = 91/203 (44%), Gaps = 23/203 (11%) Query: 17 YQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGCHVLLQSAQMPVSTAVATVIKTKQVEF 76 + +H+ +W LFP D R F++ + + +++ S P+ I KQ + Sbjct: 24 HNVHRLVWSLFPVNEDDKRKFIYRQDSMGSLPSFYLV--SENEPIDELNVWDIDVKQYDP 81 Query: 77 QLQVGVPLYFRLRANPI-------------KTILDNQKRL--DSKGNIKRCRVPLIKEAE 121 L+ G L F LRANPI ++D + RL ++ G+I+ +P I + + Sbjct: 82 ILKSGQKLAFSLRANPIVSKRDENDKQHRHDVVMDEKFRLKMENGGDIE-PNMPDIVQRK 140 Query: 122 QIAWLQRK---LGNAARVEDVH-PISERPQYFSGDGKSG-KIQTVCFEGVLTINDAPALI 176 WL RK G + E + + + F GK TV G LT+ D Sbjct: 141 GSEWLLRKGDMNGFSINAEQIRVDAYQNHKLFKPKGKHHVSFSTVDIVGTLTVTDPDIFR 200 Query: 177 DLVQQGIGPAKSMGCGLLSLAPL 199 D + +GIGPAK GCG+L + PL Sbjct: 201 DALFKGIGPAKGFGCGMLLVRPL 223 >UniRef50_B8IMR1 CRISPR-associated protein, Cse3 family n=3 Tax=Alphaproteobacteria RepID=B8IMR1_METNO Length = 243 Score = 67.8 bits (164), Expect = 2e-10, Method: Compositional matrix adjust. Identities = 58/205 (28%), Positives = 86/205 (41%), Gaps = 26/205 (12%) Query: 20 HQGLWHLFPNRPDAARDFLFHVEKRNTPEGCHVLLQSAQMPVSTAVATVIKTKQVEFQLQ 79 H+ LW LF + PD ARDFL+ + T + L+ S + P T I+TK L Sbjct: 36 HRLLWSLFADSPDRARDFLWCEDAGGTWQRATFLILSRRRPQDTRGLFEIETKPFAPVLA 95 Query: 80 VGVPLYFRLRANPIKT-----ILDNQKRLDSKGNIKRCRVPLIKEAEQ--------IAWL 126 G L FRLRA+P + + KR+D R P ++ + WL Sbjct: 96 PGQRLGFRLRASPAASDTPTAVGRRGKRIDPVARALRDLPPEVRAERRHSVLQEVGAGWL 155 Query: 127 QRKLGNAARV--EDVHPISERPQYFSGDGKSGKI-----------QTVCFEGVLTINDAP 173 R+ A + P R S DG+ + ++ FEGVL + D Sbjct: 156 ARQGARAGFTLCDAEAPSGTRQPCLSVDGERWNVLPREGAAPVRFSSLDFEGVLRVEDPS 215 Query: 174 ALIDLVQQGIGPAKSMGCGLLSLAP 198 + + +G G AK+ GCGL+ + P Sbjct: 216 LFLAALAEGFGRAKAFGCGLMLIRP 240 >UniRef50_D1CAI9 CRISPR-associated protein, Cse3 family n=1 Tax=Sphaerobacter thermophilus DSM 20745 RepID=D1CAI9_SPHTD Length = 257 Score = 67.4 bits (163), Expect = 2e-10, Method: Compositional matrix adjust. Identities = 77/255 (30%), Positives = 103/255 (40%), Gaps = 59/255 (23%) Query: 1 MYLSKVII---ARAWSRDLY---QLHQGLWHLFPN--RPDAAR---DFLFHVEKRNTPEG 49 MYLS++I+ +R RDL QLH+ + FPN P AR L+ +E Sbjct: 1 MYLSRLILNPRSREVRRDLADCQQLHRSVMSGFPNLAAPGDARARLGILYRLETHPRTGM 60 Query: 50 CHVLLQSA------QMPVSTAVATV-------IKTKQVEFQLQVGVPLYFRLRANPIKTI 96 +L+QSA Q+P + T + L G+ L FRLRANP K I Sbjct: 61 PTLLVQSAIEPTWSQLPADYLLNTAGVPNPDCKPVGPIYDALDAGMVLTFRLRANPTKRI 120 Query: 97 LDNQKRLDSKGNIKRCRVPLIKEAEQIAWLQRKLGNAA-RVEDVHPISERPQY------- 148 + + N RV L EA+Q+AWL+RK V V +E Y Sbjct: 121 KPDTD--PGRSNRLGKRVELRTEADQLAWLRRKGEQCGFEVLSVRATTEHEAYRWERAAA 178 Query: 149 -------------------------FSGDGKSGKIQTVCFEGVLTINDAPALIDLVQQGI 183 DG+ V F+G+L I DA + QGI Sbjct: 179 IFGLEADKPEPVPDVRAVRGSKVYGRRADGERMTFAAVTFDGLLRIVDADRFRAALVQGI 238 Query: 184 GPAKSMGCGLLSLAP 198 G AK+ G GLLS+AP Sbjct: 239 GSAKAYGFGLLSIAP 253 >UniRef50_B8GIV2 CRISPR-associated protein, Cse3 family n=1 Tax=Methanosphaerula palustris E1-9c RepID=B8GIV2_METPE Length = 225 Score = 67.4 bits (163), Expect = 2e-10, Method: Compositional matrix adjust. Identities = 65/204 (31%), Positives = 95/204 (46%), Gaps = 27/204 (13%) Query: 17 YQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGCHVLLQ-SAQMPVSTAVATVIKTKQVE 75 YQ H +W LF + P+ RDFLF ++ +G V S ++P IK+K Sbjct: 26 YQAHSLIWDLFSDGPERERDFLF---RQEVHQGMPVFWTVSERVPSDRNETWNIKSKPYA 82 Query: 76 FQLQVGVPLYFRLRANPIKTILDN---QKR----LDSKGNIKRCRV--------PLIKEA 120 L+ G+ L F LRANPI++ D+ Q R +D K +K + +I+EA Sbjct: 83 PILRQGMHLSFVLRANPIRSRRDDLGKQHRHDVVMDMKTALKDSKPGDQWPAEDQIIQEA 142 Query: 121 EQIAWLQRKLGNAA--RVED--VHPISERPQYFSGDGKSGKIQ--TVCFEGVLTINDAPA 174 + WL + GNA ++D V F K +Q T+ F G+LT+ D Sbjct: 143 G-LVWLANQ-GNAKGFSLQDGAVRVDGYTQHRFVKPKKKQMVQISTLDFTGLLTVTDPER 200 Query: 175 LIDLVQQGIGPAKSMGCGLLSLAP 198 + GIGPAK GCGL+ + P Sbjct: 201 FTTALFNGIGPAKGFGCGLMMVRP 224 >UniRef50_B4TTX1 Crispr-associated protein, Cse3 family n=15 Tax=Enterobacteriaceae RepID=B4TTX1_SALSV Length = 235 Score = 65.9 bits (159), Expect = 7e-10, Method: Compositional matrix adjust. Identities = 66/236 (27%), Positives = 102/236 (43%), Gaps = 50/236 (21%) Query: 1 MYLSKV----------IIARAWSRDLYQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGC 50 MYLS++ ++A+ S Y HQ LW LFP + R FLF R G Sbjct: 1 MYLSRIQLRFNNLRPEMLAKWNSARPYASHQWLWQLFPEQ--ELRQFLF----REEAHGG 54 Query: 51 HVLLQSAQMPVSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKT-------ILDNQKRL 103 +L SA P+S +I+TK QL G+ L F+LRANP+ T ++ N K Sbjct: 55 FFML-SAIPPLSQHSLFLIETKPFNPQLTNGLELDFQLRANPVITRNGKRSDVMMNAKHQ 113 Query: 104 DSKGNIKRCRVPLIKEAEQIAWLQRKLGNAARVEDVHPISERPQYFSGDGKS------GK 157 +++ R +++ AWL+++ G + P + ++GD S G Sbjct: 114 AKANGVEKERWWELQQQAAQAWLEQQ-GQQHGFRLIAPEPDDFAMWAGDEYSELQAHCGC 172 Query: 158 IQ-------------------TVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLL 194 +Q +V F G L I DA + G+G +K++GCG+L Sbjct: 173 VQAYQQYRFVRKDQQKPITFSSVDFSGALCITDAALFKQALFSGLGKSKALGCGML 228 >UniRef50_C6C417 CRISPR-associated protein, Cse3 family n=4 Tax=Enterobacteriaceae RepID=C6C417_DICDC Length = 215 Score = 65.5 bits (158), Expect = 1e-09, Method: Compositional matrix adjust. Identities = 59/209 (28%), Positives = 92/209 (44%), Gaps = 26/209 (12%) Query: 1 MYLSKV---------IIARAW-SRDLYQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGC 50 M+ S+V ++A W + +Y HQ LW LFP + +R FLF + T Sbjct: 1 MFFSRVTLQPAALPSVMAEKWQTTPVYASHQWLWQLFPQ--EGSRGFLFRQDDHATLSRY 58 Query: 51 HVLLQSAQMPVSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKT--------ILDNQKR 102 ++L SA P V++TK + QL G+PL F LRANP+ T ++D + Sbjct: 59 YLL--SACAPRQDHNLFVVETKPWQPQLNAGMPLAFSLRANPVVTRRQKRCDVLMDAKYH 116 Query: 103 LDSKGNIKRCRVPLIKEAEQIAWLQR---KLGNAARVEDVHPISERPQYFSGDGKSGKIQ 159 ++G P ++A + WL R + G A V Y Sbjct: 117 AKAQGADSAEIWPRQQQAA-VDWLVRQGERGGFAVHACHVDGYQRHRLYKPQQSGPVSFS 175 Query: 160 TVCFEGVLTINDAPALIDLVQQGIGPAKS 188 +V F+G+L I DA + V QG+G +++ Sbjct: 176 SVDFDGLLRITDAKRFAETVSQGLGKSRA 204 >UniRef50_A9GV72 Putative uncharacterized protein ygcH n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9GV72_SORC5 Length = 246 Score = 63.5 bits (153), Expect = 4e-09, Method: Compositional matrix adjust. Identities = 68/245 (27%), Positives = 101/245 (41%), Gaps = 49/245 (20%) Query: 1 MYLSKVII------ARAWSRDLYQLHQGLWHLFPN----RPDAARDFLFHVEKRNTPEGC 50 MYLS+ ++ RA D+ LH+ + FP+ P A LF V++ Sbjct: 1 MYLSRALLNPISRAVRADIADIEGLHRTIMRAFPDGAGPHPRRAHGVLFRVDEAVLRGRF 60 Query: 51 HVLLQSAQMPVSTAVA----------------------TVIKTKQVEFQLQVGVPLYFRL 88 +L+QSA P T + + + +++ G F L Sbjct: 61 VLLVQSATRPDFTRLPEDYFLDIQEDLGLTEPSPIENPAIREVGSERARIRAGDFFRFSL 120 Query: 89 RANPIKTILDNQKRLDSKGNIKRCRVPLIKEAEQIAWLQRKL----------GNAARVED 138 RA+P + I D + D K R RV L +A ++ WL+RK + A V Sbjct: 121 RASPTRRI-DTKSGDDGKRRNGR-RVELRDDASRLDWLRRKAMAGGFELCGAEDGAGVGG 178 Query: 139 VHPISERPQYFSGDGKSGKIQT-----VCFEGVLTINDAPALIDLVQQGIGPAKSMGCGL 193 V + E G G S + Q V FEG L + DA + + G+GPAK+ G GL Sbjct: 179 VSAVEEPKLTGRGSGASEQRQQLTLAPVLFEGRLRVTDADRFREALAAGVGPAKAYGFGL 238 Query: 194 LSLAP 198 LS+AP Sbjct: 239 LSIAP 243 >UniRef50_C1DSI0 CRISPR-associated protein, CT1974 n=3 Tax=Pseudomonadaceae RepID=C1DSI0_AZOVD Length = 205 Score = 63.2 bits (152), Expect = 5e-09, Method: Compositional matrix adjust. Identities = 67/217 (30%), Positives = 94/217 (43%), Gaps = 34/217 (15%) Query: 1 MYLSKVII------ARAWSRDLYQLHQGLWHLF-PNRPDAARDFLFHVEKRNTPEGCHVL 53 MYL+++ + AR D Y +H+ L F + DA FL+ +E L Sbjct: 1 MYLTRLTLDPRSAQARRDLADAYDMHRTLVRAFVRDERDAPGRFLWRLEPGADAWASPTL 60 Query: 54 L-QSAQ---------MPVSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRL 103 L QS + +P K +E ++ FRL ANP T ++ Sbjct: 61 LVQSCESGDWDVLQGLPGYLQRPAECKALDLEALIRPQWRYRFRLLANPTVTRAGKRR-- 118 Query: 104 DSKGNIKRCRVPLIKEAEQIAWLQR---KLGNAARVEDVHPISERPQYFSGDGKSGKIQT 160 L+ EAEQ+AWLQR + G A + V G G +Q Sbjct: 119 -----------GLLGEAEQLAWLQRQGERHGFAVKAVLVSASDLLDSRRKG-GAPIVLQR 166 Query: 161 VCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLA 197 VCFEG+L + +A AL + GIGPAK+ GCGLLS+A Sbjct: 167 VCFEGLLQVVEADALRRALASGIGPAKAFGCGLLSVA 203 >UniRef50_B4RSK4 CRISPR-associated protein, Cse3 family n=5 Tax=Gammaproteobacteria RepID=B4RSK4_ALTMD Length = 222 Score = 62.8 bits (151), Expect = 7e-09, Method: Compositional matrix adjust. Identities = 67/227 (29%), Positives = 103/227 (45%), Gaps = 39/227 (17%) Query: 1 MYLSKVI----------IARAWSRDLYQLHQGLWHLFPNRPDAARDFLFHVEKRNT--PE 48 M+LSKV +A+ +Y HQ +W LF N + R FL+ E T PE Sbjct: 1 MFLSKVTMVSSPQTAQELAKLQRNGVYASHQLIWQLFSNVTE--RSFLYREEMGITGMPE 58 Query: 49 GCHVLLQS---AQMPVSTAVATVIKTKQVEFQLQVGVPLYFRLRANP---IKTILDNQKR 102 +VL ++ A +P+ + V V + K L+ G L F+LR NP +K Q+R Sbjct: 59 -FYVLSKTEPQASLPIFSCVTKVFEPK-----LKKGQRLSFKLRVNPTVCVKGEDGKQRR 112 Query: 103 LD----SKGNIKR-----CRVPLIKEAEQIAWL--QRKLGNAARVEDVHPISERPQYFSG 151 D +K N+K + + E I WL +++L D P + Sbjct: 113 HDVMMQAKYNVKDELPDAQTLKMHMEQAAINWLNNEKRLDEWGITLDFQPSIDGYTQHKV 172 Query: 152 DGKSGKIQ--TVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSL 196 K +IQ +V ++G+LT+ D I+ +G G AK MGCGL+ + Sbjct: 173 QKKRHQIQFSSVDYQGMLTVQDPLKFINQYAKGFGRAKGMGCGLMMI 219 >UniRef50_A5UR13 CRISPR-associated protein, Cse3 family n=1 Tax=Roseiflexus sp. RS-1 RepID=A5UR13_ROSS1 Length = 238 Score = 62.0 bits (149), Expect = 1e-08, Method: Compositional matrix adjust. Identities = 65/238 (27%), Positives = 104/238 (43%), Gaps = 43/238 (18%) Query: 1 MYLSKVII------ARAWSRDLYQLHQGLWHLFPNRPD-----AARDFLFHVEK-RNTPE 48 MYLS++I+ R D+Y+LH+ + FP PD A L+ +E + P Sbjct: 1 MYLSRLILDVRQPRVRRDLSDVYRLHRTILSAFPQAPDNVPARAHFGILYRIEPISDMPW 60 Query: 49 GCHVLLQSAQMPVSTAVAT-------------VIKTKQVEF-QLQVGVPLYFRLRANPIK 94 +L+QS + P + + ++ E+ +++ + FRL ANP + Sbjct: 61 LVRLLVQSREQPDWSHIPDRMFGPALDERGNPALRRIDDEYARIRSDMQFLFRLLANPTR 120 Query: 95 TILDNQKRLDSKGNIKRCRVPLIKEAEQIAWLQRK-------LGNAARVEDVHPISERPQ 147 + + D + + RV L++E EQIAWL K L + + DV + Q Sbjct: 121 RLSNRSSERDDR--LLGKRVALLREEEQIAWLAHKGEQHGFRLLSTSVNPDVPAVQAAKQ 178 Query: 148 YFS-GDGKSGKIQT-------VCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLA 197 G K+ + QT V F G L + DA ++ GIG K+ G GLLS+A Sbjct: 179 ADEHGWRKATQTQTMHLTFGAVLFTGYLKVTDADRFRTALEHGIGSGKAFGFGLLSIA 236 >UniRef50_Q67RN9 Putative uncharacterized protein n=1 Tax=Symbiobacterium thermophilum RepID=Q67RN9_SYMTH Length = 224 Score = 61.6 bits (148), Expect = 1e-08, Method: Compositional matrix adjust. Identities = 61/213 (28%), Positives = 94/213 (44%), Gaps = 37/213 (17%) Query: 14 RDLYQLHQGLWHLFPNRPD---AARDF---LFHVEKRNTPEGCHVLLQS------AQMPV 61 RD+ LHQ + FP+ D AR + L+ +E + +QS ++P Sbjct: 20 RDVQALHQRVMSAFPDVLDPEVEARAYFGVLYRLELNRYSGQVLLYVQSRVEPDWGRLPA 79 Query: 62 STAVAT-------VIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRV 114 V + + +++ G L FRLRANP + I K N +R V Sbjct: 80 GYLTPADGLPNPAVKRVDEAYARIREGRVLRFRLRANPTRKIDTKSGPNGEKRNGRR--V 137 Query: 115 PLIKEAEQIAWLQRKLGNAARVEDVHPI---------SERPQYFSGDGKSGKIQTVCFEG 165 PL Q+ W++RK AR + SER + ++ G++ Q V FEG Sbjct: 138 PLSGLDAQLGWMERK----AREHGFELLEATVAAAGASERVRSYT-TGRT--FQGVLFEG 190 Query: 166 VLTINDAPALIDLVQQGIGPAKSMGCGLLSLAP 198 L + DA + +++GIGP K+ G GLLS+ P Sbjct: 191 RLVVRDAGRFREALERGIGPGKAYGYGLLSVGP 223 >UniRef50_Q2RY20 CRISPR-associated protein, CT1974 n=1 Tax=Rhodospirillum rubrum ATCC 11170 RepID=Q2RY20_RHORT Length = 220 Score = 60.5 bits (145), Expect = 3e-08, Method: Compositional matrix adjust. Identities = 58/191 (30%), Positives = 86/191 (45%), Gaps = 16/191 (8%) Query: 20 HQGLWHLFPNRPDAARDFLFHVEKRNTPEGCHVLLQSAQMPVSTAVATVIKTKQVEFQLQ 79 H+ +W LF + P A+RDF+F E L+ SA+ P ++TK + Sbjct: 33 HRLVWTLFADDPKASRDFVFR-----EAEPGRYLIVSARPPGDGQGLWRLETKPYAPAFR 87 Query: 80 VGVPLYFRLRANPIKTILD----NQKRLDSKGNIK-RCRVPL-IKEAEQIA--WL-QRKL 130 G F LRANP + KR+D+ + K R PL +++ E++A WL R+ Sbjct: 88 EGQRFGFTLRANPATAVKQAGETRGKRVDAIMHAKTRSATPLTVEDRERVALDWLLDRQQ 147 Query: 131 GNAARVEDV--HPISERPQYFSGDGKSGKIQTVCFEGVLTINDAPALIDLVQQGIGPAKS 188 G E R GK+ + +EGV T+ D L + +GIG AK+ Sbjct: 148 GFGVLFERALCSAGGYRQVRVPRGGKAITFSVIDYEGVFTVRDPGLLGQALVRGIGKAKA 207 Query: 189 MGCGLLSLAPL 199 GCGL+ L L Sbjct: 208 YGCGLMLLRRL 218 >UniRef50_C1XG03 CRISPR-associated protein, Cse3 family n=1 Tax=Meiothermus ruber DSM 1279 RepID=C1XG03_MEIRU Length = 156 Score = 59.7 bits (143), Expect = 5e-08, Method: Compositional matrix adjust. Identities = 51/125 (40%), Positives = 65/125 (52%), Gaps = 9/125 (7%) Query: 77 QLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRVPLIKEAEQIAWLQRK--LGNAA 134 LQ L FRLRANP T D DSK KR R L EQ+ WL R+ G + Sbjct: 35 HLQPAQVLRFRLRANPTVTKKDPNNP-DSK---KRKRHGLKTLEEQLEWLHRQGAKGGFS 90 Query: 135 RVEDVHPISERPQYFSGDGKSGKI--QTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCG 192 + + SER + + DG SG I Q+V +EG L I D A + G+G AK++G G Sbjct: 91 VLGAMVVQSERVRMYKHDG-SGPIVLQSVLYEGHLKITDLEAFKHTLAAGLGHAKALGFG 149 Query: 193 LLSLA 197 LLS+A Sbjct: 150 LLSIA 154 >UniRef50_Q04QB6 Putative uncharacterized protein n=2 Tax=Leptospira borgpetersenii serovar Hardjo-bovis RepID=Q04QB6_LEPBJ Length = 266 Score = 59.3 bits (142), Expect = 7e-08, Method: Compositional matrix adjust. Identities = 66/250 (26%), Positives = 100/250 (40%), Gaps = 62/250 (24%) Query: 8 IARAWSRDLYQLHQGLWHLFPNRPDAARD----FLFHVEKRNTPEGC--HVLLQSAQMP- 60 I W ++ Y +HQ LW F + FLF ++ + P +L+ S ++P Sbjct: 17 IVFNWIQNPYNIHQRLWMAFSEYSSKDKPQNSPFLFQLDYNSDPGKISPRILVFSEKLPN 76 Query: 61 ---------VSTAVATVIKTKQVE-FQLQVGVPLYFRLRANPIKTI-------------- 96 V T + + KQ+ +Q G L F L ANP K + Sbjct: 77 WERAFQEFKVLTEIPVGNQIKQISPTFIQAGAVLRFSLTANPTKKLKDYRSLFQEELEGF 136 Query: 97 ------------------LDNQKRLDSKGNIKRC---RVPLIKEAEQIAWLQRKLGNAAR 135 L++ K+ +K I++ RV + E E + WL +K G+ Sbjct: 137 PDKFDPSDRVSFLEGKSKLEDLKKTLTKDQIQKLKSKRVGIYHEKELLNWLSKK-GSDNG 195 Query: 136 VEDVHPISERPQYFSGDGKSG-------KIQTVCFEGVLTINDAPALIDLV-QQGIGPAK 187 + + E FS + G KI TV F G+L I D PAL + +GIG K Sbjct: 196 FSLLDAVVEFQSDFSANKIKGSLSPSIPKIHTVSFSGILKIMD-PALFKIAYTKGIGTGK 254 Query: 188 SMGCGLLSLA 197 + GCG+L LA Sbjct: 255 AFGCGMLLLA 264 >UniRef50_C6WMQ8 CRISPR-associated protein, Cse3 family n=1 Tax=Actinosynnema mirum DSM 43827 RepID=C6WMQ8_ACTMD Length = 230 Score = 58.2 bits (139), Expect = 2e-07, Method: Compositional matrix adjust. Identities = 65/227 (28%), Positives = 102/227 (44%), Gaps = 40/227 (17%) Query: 1 MYLSKVII---ARAWSRDL---YQLHQGLWHLFPNRPDAA-----RDFLFHVEKRNTPEG 49 M+L+K+ + +R + RDL +++H+ + +P D + L+ ++ TP G Sbjct: 1 MFLTKLTVDVRSREFRRDLANLHEMHRTVMSGYPRVEDGSPARQTHGVLWRLDA--TPAG 58 Query: 50 CHVLLQSAQMPVSTAVATVIKTKQVEFQ--------LQVGVPLYFRLRANPIKTILDNQK 101 +QS P T + + T E + ++ G L FRL AN K Sbjct: 59 YTQYVQSLTRPDWTGLPETLLTSPAEVRSLDPLLDAIEPGRVLAFRLLANATK------D 112 Query: 102 RLDSKGNIKRCRVPLIKEAEQIAWLQRK---LGNAAR-----VEDVHPISERPQYFSGDG 153 + ++ + RV Q++WL RK G A R V DV S +G Sbjct: 113 SVPAEPGGRGLRVAHRTPEAQVSWLARKGQRHGFALRDRPDGVPDVTLWSA--PRMTGRK 170 Query: 154 KSGK---IQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLA 197 K+G+ + V F+G L + DA L + V GIG AK+ GCG+LSLA Sbjct: 171 KAGRPITVDAVRFDGHLVVTDADELREAVGSGIGRAKAYGCGMLSLA 217 >UniRef50_B5GAA2 Crispr-associated protein n=1 Tax=Streptomyces sp. SPB74 RepID=B5GAA2_9ACTO Length = 217 Score = 57.8 bits (138), Expect = 2e-07, Method: Compositional matrix adjust. Identities = 56/196 (28%), Positives = 83/196 (42%), Gaps = 17/196 (8%) Query: 14 RDLYQLHQGLWHLFPN----RPDAARDFLFHVEKRNTPEGCHVLLQSAQMPVST---AVA 66 R LH+ + LFP+ R LF E+ T G +L+QS P T A Sbjct: 26 RSAVNLHKRVMSLFPDDLGERARQQTGALFRFEEDAT-RGSRLLVQSVVTPDPTRLPARY 84 Query: 67 TVIKTKQVEFQLQV---GVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRVPLIKEAEQI 123 +++ ++ LQ GV + +RL N +T+ R + G + +PL + Sbjct: 85 GAVRSTEITPLLQRLRPGVRVNYRLTGNATRTL----SRDTTAGRPNQV-IPLHGADAEE 139 Query: 124 AWLQRKLGNAARVEDVHPIS-ERPQYFSGDGKSGKIQTVCFEGVLTINDAPALIDLVQQG 182 WL+R + +H + D + + F+G T+ D AL V G Sbjct: 140 WWLRRAASAGLDIHKIHTTELDDAAGNRHDKQRIRHARTRFDGTATVTDPDALRTCVTTG 199 Query: 183 IGPAKSMGCGLLSLAP 198 IG KS GCGLLSLAP Sbjct: 200 IGRGKSYGCGLLSLAP 215 >UniRef50_B6IWM2 CRISPR-associated protein, CT1974 family n=1 Tax=Rhodospirillum centenum SW RepID=B6IWM2_RHOCS Length = 262 Score = 57.4 bits (137), Expect = 3e-07, Method: Compositional matrix adjust. Identities = 63/205 (30%), Positives = 92/205 (44%), Gaps = 30/205 (14%) Query: 20 HQGLWHLFPNRPDAARD--FLFHVEKRNTPEGCHVLLQSAQMPVSTAVATV--IKTKQVE 75 H+ LW LFP+RP A R+ FLFHVE P V +++P + + I T+ + Sbjct: 60 HRMLWTLFPDRPTARREGLFLFHVEG-TRPFSAIV---RSRVPPEDGLGGIWTITTRPFD 115 Query: 76 FQLQVGVPLYFRLRANPIKTILDNQKRLDSKGNI---------KRCRVP--LIKEAEQIA 124 L G+ L F LRA + +R + ++ + R P L K AE A Sbjct: 116 PALAPGLTLRFHLRAVASRWQPRPGERRGRRQDVIVAAWRDLPEEQRTPENLEKTAEHAA 175 Query: 125 --WLQR--KLGNAARVE------DVHPISERP-QYFSGDGKSGKIQTVCFEGVLTINDAP 173 WL R + G A VE D S R G +S + V +EG+LT+ D Sbjct: 176 LDWLARQGRRGGFAPVEGAVDVLDYDRASLRAGAKLGGRDRSIRFGAVTYEGLLTVTDPQ 235 Query: 174 ALIDLVQQGIGPAKSMGCGLLSLAP 198 A + QG+G ++ G GL+ +AP Sbjct: 236 AFRATLVQGLGAGRAYGNGLMQIAP 260 >UniRef50_D1CGD5 CRISPR-associated protein, Cse3 family n=1 Tax=Thermobaculum terrenum ATCC BAA-798 RepID=D1CGD5_THET1 Length = 240 Score = 57.4 bits (137), Expect = 3e-07, Method: Compositional matrix adjust. Identities = 44/143 (30%), Positives = 67/143 (46%), Gaps = 29/143 (20%) Query: 77 QLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRC------RVPLIKEAEQIAWLQRK- 129 ++ G FRLRANP + I ++GN ++ RV L +E +QI WL RK Sbjct: 95 RINQGDRFIFRLRANPTRRI--------ARGNTEQAERWRGKRVELQREEDQIDWLIRKG 146 Query: 130 -------LGNAARVEDVHPISERP-------QYFSGDGKSGKIQTVCFEGVLTINDAPAL 175 L R + V + P + +G + +V FEGVL + D + Sbjct: 147 DQHGFKLLSITVRQQAVPNLRVLPNNKTHGWRRDAGGNRRLTFGSVQFEGVLEVTDRESF 206 Query: 176 IDLVQQGIGPAKSMGCGLLSLAP 198 + ++QG+G K+ G GLLS+AP Sbjct: 207 MQALEQGVGSGKAFGFGLLSIAP 229 >UniRef50_Q03C59 CRISPR-associated protein n=3 Tax=Lactobacillus RepID=Q03C59_LACC3 Length = 215 Score = 57.0 bits (136), Expect = 4e-07, Method: Compositional matrix adjust. Identities = 65/226 (28%), Positives = 101/226 (44%), Gaps = 42/226 (18%) Query: 1 MYLSKVIIARAWSRDLYQLHQGLWHL----------FPNRPDAARDFLFHVEKRNTPEGC 50 MYLS+V + + + +Q+ + L HL FP R AA L H+ + ++ G Sbjct: 1 MYLSRVQV----NTNDHQIFKHLTHLGAYHDWVKRSFP-REIAAGTRLRHLWRLDSLNGR 55 Query: 51 HVLL-------QSAQMPVSTAVATVIKTKQVE---FQLQVGVPLYFRLRANPIKTILDNQ 100 LL + AQ+ VA +TK + L+ G L FRL ANP + I Sbjct: 56 DYLLVLSPDAPELAQL-ARYGVAGTAQTKDYDPFVTALRQGQRLRFRLTANPTRAIATPG 114 Query: 101 KRLDSKGNIKRCRVPLIKEAEQIAWLQRK---LGNAARVEDVHPI-----SERPQYFSGD 152 +R G++ P + A+Q+AWL + LG ++D P + P Sbjct: 115 QR----GHV----APHVTVAQQMAWLSERAAALGFELPIDDDGPQFQIVGRDYPALRRAQ 166 Query: 153 GKSGKIQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAP 198 GK ++ V FEG L ++D + + GIG K+ G GLL++ P Sbjct: 167 GKPVRLSRVSFEGTLVVSDLVRFKETLATGIGREKAFGMGLLTVIP 212 >UniRef50_B1LQ79 CRISPR-associated protein, Cse3 family n=54 Tax=Enterobacteriaceae RepID=B1LQ79_ECOSM Length = 216 Score = 56.6 bits (135), Expect = 5e-07, Method: Compositional matrix adjust. Identities = 58/219 (26%), Positives = 92/219 (42%), Gaps = 28/219 (12%) Query: 1 MYLSKVIIARAW----------SRDLYQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGC 50 MYLS++ + R Y +HQ LW LFP + R FL+ E+ Sbjct: 1 MYLSRITLHTGQLSPAQLLHLVDRGEYVMHQWLWDLFPGGKE--RQFLYRREELQGAFRF 58 Query: 51 HVLLQSAQMPVSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLD------ 104 VL S + P + T I+ + +L G L F LRANP T+ KR D Sbjct: 59 FVL--SQERPAESETFT-IECRSFAPELHTGQSLCFNLRANP--TVCKAGKRHDLLMEAK 113 Query: 105 --SKGNIKRCRVPLIKEAEQIAWLQRKLGNAA-RVEDVHPISERPQYFSGDGKSGKIQ-- 159 +G + V L ++ + WL + + + D + R Q + IQ Sbjct: 114 RQVRGQAEGRNVWLHQQQAALDWLAAQGERSGFTLLDTSVDAYRQQQLRRENSRQLIQFS 173 Query: 160 TVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAP 198 +V + G+LT+ D + + G G +++ GCGL+ + P Sbjct: 174 SVDYTGMLTVTDPGLFLQRLCLGYGKSRAFGCGLMLIKP 212 >UniRef50_D2RB03 CRISPR system CASCADE complex protein CasE n=4 Tax=Bacteria RepID=D2RB03_GARVA Length = 215 Score = 56.2 bits (134), Expect = 6e-07, Method: Compositional matrix adjust. Identities = 37/149 (24%), Positives = 73/149 (48%), Gaps = 7/149 (4%) Query: 54 LQSAQMPVSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCR 113 LQ +M + A+ + L G+ + FR+ NP+ +I DN + ++G + Sbjct: 69 LQRLEMYGVSGTASSKTYDKFLGSLMNGMRMQFRVTLNPVVSISDNAETHTARGRV---- 124 Query: 114 VPLIKEAEQIAWL---QRKLGNAARVEDVHPISERPQYFSGDGKSGKIQTVCFEGVLTIN 170 VP + +Q+ +L +KLG + + + F+ K ++ ++G+LTI+ Sbjct: 125 VPHVTYDQQMNFLLNRAQKLGFSLNENEFAIVERGYSLFTKSEKPIRLSKAVYQGILTIS 184 Query: 171 DAPALIDLVQQGIGPAKSMGCGLLSLAPL 199 DA + + +GIG K+ G G++++ PL Sbjct: 185 DADIMRKTLLEGIGKKKAYGFGMMTVIPL 213 >UniRef50_B8FDH8 CRISPR-associated protein, Cse3 family n=1 Tax=Desulfatibacillum alkenivorans AK-01 RepID=B8FDH8_DESAA Length = 199 Score = 55.8 bits (133), Expect = 7e-07, Method: Compositional matrix adjust. Identities = 49/189 (25%), Positives = 76/189 (40%), Gaps = 16/189 (8%) Query: 14 RDLYQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGCHVLLQSAQMPVSTAVATVIKTKQ 73 +D+Y +H+ +++LFP RDFLF +K G +L+ S + P+ I ++ Sbjct: 19 KDVYGVHKAVYNLFPENNGQGRDFLF-ADKGGDWNGRKILILSHREPIQPRHGA-IDCRE 76 Query: 74 VEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRVPLIKEAEQIAWLQRK---L 130 V F + NP++ + N R +P+ W +K L Sbjct: 77 VPAAFLDWDYYGFEVVLNPVR-----------RDNASRKLIPVRGRENLHEWFLKKAPGL 125 Query: 131 GNAARVEDVHPISERPQYFSGDGKSGKIQTVCFEGVLTINDAPALIDLVQQGIGPAKSMG 190 G + + ++ DG T F G L + D A QGIG AK+ G Sbjct: 126 GFEVEPHSLQVSRMGVEAYAKDGTMRTHNTATFIGKLRVIDPNAFKKSFAQGIGRAKAFG 185 Query: 191 CGLLSLAPL 199 GLL L PL Sbjct: 186 FGLLQLVPL 194 >UniRef50_D1NTI2 CRISPR-associated protein, Cse3 family n=1 Tax=Bifidobacterium gallicum DSM 20093 RepID=D1NTI2_9BIFI Length = 236 Score = 55.8 bits (133), Expect = 8e-07, Method: Compositional matrix adjust. Identities = 54/200 (27%), Positives = 89/200 (44%), Gaps = 27/200 (13%) Query: 23 LWHLFPNRPDAARDFLFHVEKRNTPEGCHVLLQSAQMPVSTAVATVIKTKQVEFQLQVGV 82 LW L NR D + +V + P+ H++ Q A P T T ++ +L G Sbjct: 38 LWRLDHNRQDHS--VWLYVVSPSQPDLLHIVEQ-AGWPGYAEWETKDYTPFLD-RLAQGQ 93 Query: 83 PLYFRLRANPIK---TILDNQKRLDSKGNIKRCRVPLIKEAEQIAWLQRKLG-NAARVED 138 ++R+ ANP++ T L+ L + +K R + +QI W +R+ N + + Sbjct: 94 QWHYRVCANPVRNAATDLNLHNSLATFDKMKGSRQAYVTVRQQIDWFERRAAANGFSLPE 153 Query: 139 VHPIS------------------ERPQYFSGDGKSG-KIQTVCFEGVLTINDAPALIDLV 179 P+S +R ++ D K+ + T FEG L + D +L + Sbjct: 154 RDPVSGFDEQVKDPLLLSSVRVIDRQRHKFRDRKNQVTLSTAVFEGTLQVEDPQSLRHAL 213 Query: 180 QQGIGPAKSMGCGLLSLAPL 199 GIG AK GCGL++LAP+ Sbjct: 214 CFGIGKAKGFGCGLMTLAPI 233 >UniRef50_C7MQD7 CRISPR-associated protein, Cse3 family n=1 Tax=Saccharomonospora viridis DSM 43017 RepID=C7MQD7_SACVD Length = 197 Score = 55.1 bits (131), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 54/210 (25%), Positives = 88/210 (41%), Gaps = 29/210 (13%) Query: 2 YLSKVIIARAWSRDLYQLHQGLWHLF--PN--RPDAARDFLFHVEKRNTPEGCHVLLQSA 57 YL+K+ ++ +RD+++ H+ L PN P L H +R G +L QSA Sbjct: 4 YLTKITTPKSVTRDIHRTHKILTTAVCPPNITTPGRVATRLLHRVERG---GREILAQSA 60 Query: 58 ------QMPVSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKR 111 ++ +A + L G + +++ ANP+ R Sbjct: 61 TPLDPTRLEGGCVIAGTKLLDPLLDHLDNGTVVRYKITANPVHAP-------------NR 107 Query: 112 CRVPLIKEAEQIAWLQRKLGNAARVEDVHPISERPQYFSGDGKSGKI--QTVCFEGVLTI 169 R P+ +AW R V D + + + SG + ++ QT EGV TI Sbjct: 108 VRRPITDPDRILAWWHRTADRIGLVLDSTALLDTAK-TSGMRRDQRVVVQTATMEGVATI 166 Query: 170 NDAPALIDLVQQGIGPAKSMGCGLLSLAPL 199 D + D + G+G A++ GCGLLS+ PL Sbjct: 167 RDVDTVRDAIVLGVGHARAYGCGLLSVVPL 196 >UniRef50_A1ARH5 CRISPR-associated protein, Cse3 family n=3 Tax=Bacteria RepID=A1ARH5_PELPD Length = 224 Score = 52.8 bits (125), Expect = 6e-06, Method: Compositional matrix adjust. Identities = 67/236 (28%), Positives = 97/236 (41%), Gaps = 50/236 (21%) Query: 1 MYLSKVII---ARAWSRDL---YQLHQGLWHLF--PNRPDAARDFLFHVEKRNTPEGC-H 51 M+LS++ + R RDL YQLH L F P +FL+ +E G Sbjct: 1 MFLSRLRLNLRCREARRDLSNPYQLHSTLCRAFSPPETKCPKGEFLWRLEPETDSSGYPR 60 Query: 52 VLLQSAQMPVSTAV-----------ATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQ 100 +++QS +P V A +K + L+ FRLRANP T N Sbjct: 61 IIVQSRNIPDWGGVGVNGWIQQADPAIDLKERLKLDLLKAEQRFRFRLRANPCVT--KNG 118 Query: 101 KRLDSKGNIKRCRVPLIKEAEQIAWLQRKLGNAARV------EDVHPISE--------RP 146 KRL L+K+ EQ WL+RK D + SE + Sbjct: 119 KRLG-----------LLKQDEQEKWLKRKGAQHGFCLPEFLSFDYYESSEDRIDVRISQE 167 Query: 147 QYFSGDGKSG---KIQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAPL 199 Q S S ++ +V ++G+LTI + ++ GIG K MG GLLS+ P+ Sbjct: 168 QMLSDKQHSDNSIRVFSVLYDGILTITEPEMFKIALKTGIGHGKVMGLGLLSVVPI 223 >UniRef50_B6B784 CRISPR-associated protein, Cse3 family n=1 Tax=Rhodobacterales bacterium Y4I RepID=B6B784_9RHOB Length = 223 Score = 52.8 bits (125), Expect = 7e-06, Method: Compositional matrix adjust. Identities = 56/226 (24%), Positives = 93/226 (41%), Gaps = 41/226 (18%) Query: 1 MYLSKVIIARAWSRDLYQL--------------HQGLWHLFPNRPDAARDFLFHVEKRNT 46 MYLS++ +AR S H+ +W F P A RDFL+ E R Sbjct: 2 MYLSRLTLARDPSVAALNALLDPDEKGAGADAHHRLIWSAFAGDPLAPRDFLWRAEGRG- 60 Query: 47 PEGCHVLLQSAQMPVSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTIL-DNQKRLDS 105 L+QS + PV + +++ L+ G + F LRAN K + + ++R+D Sbjct: 61 ----RFLVQSPEPPVGGPFFDPPEVRELAPDLRRGDQVSFLLRANATKDLRGEKRRRVDV 116 Query: 106 KGNI--------KRCRVPLIKEAEQIAWLQRKLGNAAR---------VEDVHPISERPQY 148 N+ ++ R + + W+ G AAR V+D ++ P + Sbjct: 117 VMNLLHDVPKAERQIRRMALAQQAAGEWMA---GQAARAGFCADHLEVQDYSTLTL-PGH 172 Query: 149 FSGDGKSGKIQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLL 194 S + + + G +T+ D + + QG G AK GCGL+ Sbjct: 173 RSRRRGAPRFGILDLTGRITVTDPQVFLAKLAQGFGRAKGFGCGLM 218 >UniRef50_B6XT65 Putative uncharacterized protein n=2 Tax=Bifidobacterium RepID=B6XT65_9BIFI Length = 233 Score = 52.4 bits (124), Expect = 9e-06, Method: Compositional matrix adjust. Identities = 61/231 (26%), Positives = 97/231 (41%), Gaps = 38/231 (16%) Query: 1 MYLSKVII--ARAWSRDL----YQLHQGLWHLFPNRPDAARD------------------ 36 M++S++ + AR +R L Y+LH + FP P+A R+ Sbjct: 1 MFISRIPLNKARYGARQLIGSPYKLHAAVECAFP--PNAVRNNDEGRILWRLDTSVNDNA 58 Query: 37 FLFHVEKRNTPEGCHVLLQSAQMPVSTAVATVIKTKQVEFQLQVGVPLYFRLRANPI-KT 95 +V P+ H++ Q A P T +E ++ G +F+LRANP K Sbjct: 59 VWLYVVSPEKPDFMHIVEQ-AGWPTHVEWETKNYEPLLE-RIAKGQQWHFKLRANPARKA 116 Query: 96 ILDNQKRLDSKGNIKRCRVPLIKEAEQIAWLQRK--------LGNAARVEDVHPISERPQ 147 D +R S G + + + + + +Q+ WL + L + DV + Sbjct: 117 KEDKGRRHRSDGIVGKVQGHITVD-QQLQWLIDRSASHGFTILNDQNDQPDVVVKERHKE 175 Query: 148 YFSGDGKSGKIQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAP 198 F + + T FEG L + DA + QGIG AK GCGLL++AP Sbjct: 176 NFKRADATVTLVTAVFEGRLEVTDAELFRKALCQGIGRAKGFGCGLLTIAP 226 >UniRef50_C2BET7 CRISPR-associated protein n=1 Tax=Anaerococcus lactolyticus ATCC 51172 RepID=C2BET7_9FIRM Length = 215 Score = 52.0 bits (123), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 60/221 (27%), Positives = 97/221 (43%), Gaps = 31/221 (14%) Query: 1 MYLSKV---IIARAWSRDLYQL---HQGLWHLFPNRPDAARDFLFHVEKRNTPEGCHVLL 54 MYLS+V I R +DL L H + H FP D L+ ++ N + ++++ Sbjct: 1 MYLSRVEIDINNRRKMKDLTHLGCYHGWVEHSFPQENDIRTRKLWRID--NIGDKYYLII 58 Query: 55 QSAQMPVSTAVAT--VIKTKQV----EF--QLQVGVPLYFRLRANPIKTILDNQKRLDSK 106 S +P + V T +V EF L+ G+ FR++ N + ++D + Sbjct: 59 LSEYIPDKEKLEKYGVESTTEVKDYDEFLASLKEGIRAKFRIKLNTVIA------KIDKE 112 Query: 107 GNIKRCRV-PLIKEAEQIAWLQRKLGNAARVE-DVHPISE-RPQYFSGDGKSGK------ 157 + KR R+ P+ E + + N V+ D IS+ +YF K K Sbjct: 113 NSTKRGRIMPVPNEKLNGFLVDKAQRNGFEVKTDEFGISKIDKEYFMNFDKEDKKKSRKN 172 Query: 158 IQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAP 198 I + +EG+LTI D + GIG K+ GCG L++ P Sbjct: 173 IVSATYEGMLTITDLEKFKVALVNGIGKKKAYGCGFLTIIP 213 >UniRef50_A9HLC4 CRISPR-associated protein, Cse3 family n=1 Tax=Gluconacetobacter diazotrophicus PAl 5 RepID=A9HLC4_GLUDA Length = 228 Score = 51.6 bits (122), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 54/211 (25%), Positives = 90/211 (42%), Gaps = 27/211 (12%) Query: 3 LSKVIIARAWSRDLYQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGCHVLLQSAQMPVS 62 L+++++ R H LW LF + PD RDFL+ E ++ SA+ PV Sbjct: 20 LARLLVPDGEGRQHAAAHHLLWALFGDDPDRTRDFLW-----RQMEAGRFMVLSAREPVD 74 Query: 63 TAVATVIKTKQVEFQLQVGVPLYFRLRAN-------PIKT-------ILDNQKRLDSKGN 108 + ++T+ + L+ G L F LRAN P +T ++D R + Sbjct: 75 SHGLFDVETRPFDPLLKEGDRLRFLLRANATVDRKTPGRTRSQRHDVVMDALHRRSQREG 134 Query: 109 IKRCRVPLIKEAEQIAWLQRKLGNAARVEDVHPISERPQYFSGDGKSGKIQTVCF----- 163 + R +I +A + W+ R+ G A P+ + +SG V F Sbjct: 135 AE-ARDSMIADALET-WMGRQ-GVRAGFAPASPLVIEGRDVLRIPRSGGRGIVSFGVVNL 191 Query: 164 EGVLTINDAPALIDLVQQGIGPAKSMGCGLL 194 G + + A +D + QG G A++ GCGL+ Sbjct: 192 TGEVRVTAPDAFLDSLMQGFGRARAFGCGLM 222 >UniRef50_C0VRW4 CRISPR-associated protein n=1 Tax=Corynebacterium glucuronolyticum ATCC 51867 RepID=C0VRW4_9CORY Length = 220 Score = 51.2 bits (121), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 55/199 (27%), Positives = 80/199 (40%), Gaps = 40/199 (20%) Query: 12 WSRDLYQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGCHVLLQSAQMPVSTAVATVIKT 71 W D + L+ + P RPD A + ++ ST A Sbjct: 47 WRLDQHDNEHILYIVGPERPDTAE-------------------LADRLGWSTRPAQTADY 87 Query: 72 KQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRVPLIKEAEQIAWL-QRKL 130 ++ L G F L ANP ++ KR S VPL + +QI WL QR Sbjct: 88 DKLLSSLAKGQQWCFELLANPSISLKTGGKRGKS--------VPLARIDQQIDWLLQRSE 139 Query: 131 GNAARV--------EDVHPISERPQYFSGDGKSGK----IQTVCFEGVLTINDAPALIDL 178 N +V D+ + + FS + + K + TV FEG L + DA AL Sbjct: 140 KNGFKVLPQGDSAEPDLRIANRKVMRFSKNPRDHKRTVALTTVRFEGTLEVTDAEALRAT 199 Query: 179 VQQGIGPAKSMGCGLLSLA 197 + QGIG ++ G GL++LA Sbjct: 200 LTQGIGKGRAYGLGLMTLA 218 >UniRef50_C2KP48 Putative uncharacterized protein n=1 Tax=Mobiluncus mulieris ATCC 35243 RepID=C2KP48_9ACTO Length = 212 Score = 50.8 bits (120), Expect = 3e-05, Method: Compositional matrix adjust. Identities = 57/200 (28%), Positives = 89/200 (44%), Gaps = 29/200 (14%) Query: 20 HQGLWHLFPN-----RPDAARDFLFHVEKRNTPEGC--HVLLQSAQMPVSTAVATVIKTK 72 H+ + LFP P AA LF +E T G ++QS P + ++ Sbjct: 22 HRAVMDLFPEFEGEQNPRAAASILFRLE---TLPGLAPRFVVQSDISPAVDKLPKGVEPL 78 Query: 73 QVEF-QLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRVPLIKEAEQIA-----WL 126 F +L G P+ FRL NP+ + + + D + P KE + A WL Sbjct: 79 GYTFPELGEGTPVSFRLAVNPV---IRHSQGKDGQPARTTTVAPFGKEPAESAASLETWL 135 Query: 127 QRKLGNAARVEDVHPISERPQYFSGDG-------KSGKIQTVCFEGVLTINDAPALIDLV 179 +KL + + +V+ I+ + + GDG K +I +GV + DA L ++ Sbjct: 136 SQKL--SPGLAEVNIINAQREII-GDGYPNQDISKIKRIVIDLVDGVACVGDAKTLNKML 192 Query: 180 QQGIGPAKSMGCGLLSLAPL 199 + G+G AKS GCGLLS+ L Sbjct: 193 RSGVGRAKSYGCGLLSVKQL 212 >UniRef50_A8LYZ8 CRISPR-associated protein, Cse3 family n=1 Tax=Salinispora arenicola CNS-205 RepID=A8LYZ8_SALAI Length = 206 Score = 50.8 bits (120), Expect = 3e-05, Method: Compositional matrix adjust. Identities = 58/198 (29%), Positives = 86/198 (43%), Gaps = 26/198 (13%) Query: 14 RDLYQLHQGLWHLFPN----RPDAARDFLFHVEKRNTPEGCHVLLQSA------QMPVST 63 RD LH+ + L P+ +P LF ++ +T G +L+Q+ ++P Sbjct: 22 RDTTALHRRVMSLVPDGLGEQPRHHAGVLFRLD--HTTTGPMLLVQTTLPPDPNRLPDGY 79 Query: 64 AVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRVPLI-KEAEQ 122 A + L G+ +++R+ AN K KGN V L ++AEQ Sbjct: 80 AAVDTRDVSPLLKALTNGMAMHYRIAANASKRAW--------KGNSAGKVVALSGQQAEQ 131 Query: 123 IAWLQRKLGNAARVEDVHPISERPQYFS-GDGKSGKIQTVCFEGVLTINDAPALIDLVQQ 181 W QRK A D+ + +PQ + G + FEG I DA + V Sbjct: 132 --WWQRK--AEATGLDLRHLRAQPQPAARGRAIPVRHAITLFEGQAVITDADQVRAAVLA 187 Query: 182 GIGPAKSMGCGLLSLAPL 199 GIG +S GCGLLSLAP+ Sbjct: 188 GIGRGRSFGCGLLSLAPM 205 >UniRef50_C7MTM6 CRISPR-associated protein, Cse3 family n=1 Tax=Saccharomonospora viridis DSM 43017 RepID=C7MTM6_SACVD Length = 241 Score = 50.4 bits (119), Expect = 4e-05, Method: Compositional matrix adjust. Identities = 43/145 (29%), Positives = 67/145 (46%), Gaps = 25/145 (17%) Query: 77 QLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRVPLIKEAEQIAWLQRK--LGNAA 134 QL G FR+ ANP++ + + + + K R +++ + A Q++ L A Sbjct: 94 QLGEGREFAFRVTANPVQNVPAPAQESTPEPSAKPGRA--VRKGHRTAAHQQRWFLERAE 151 Query: 135 R-----------------VEDVHPISERPQYFSGDGKSGK---IQTVCFEGVLTINDAPA 174 R V D+ I++R + K GK + T FEG L I D+ Sbjct: 152 RWGFQVPPALLDDPEADDVPDMR-ITQRQRLSFAKRKGGKPVILTTATFEGRLRITDSEL 210 Query: 175 LIDLVQQGIGPAKSMGCGLLSLAPL 199 + +G+GPAK+ GCGLL+LAPL Sbjct: 211 FTRTLLRGLGPAKAYGCGLLTLAPL 235 >UniRef50_B1VIX9 CRISPR-associated protein n=6 Tax=Actinomycetales RepID=B1VIX9_CORU7 Length = 234 Score = 50.1 bits (118), Expect = 4e-05, Method: Compositional matrix adjust. Identities = 54/205 (26%), Positives = 78/205 (38%), Gaps = 45/205 (21%) Query: 23 LWHLFPNRPDAARDFLFHVEKRNTPEGCHVLLQSA--QMPVSTAVATVIKTKQVEFQLQV 80 LW + P + + +V P G ++ Q+ +P TA + K L Sbjct: 46 LWRVDPGE----HEHVLYVVGPEKPTGAVLVEQAGWDTLPAQTADYSRFLGK-----LTR 96 Query: 81 GVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRVPLIKEAEQIAWLQRK-----LGNAAR 135 G F L ANP + R +G +K + QI WL RK G A R Sbjct: 97 GQRWRFELVANPTYA----EPRKGGRGKVK----AHVSVRHQIGWLYRKADAAGFGLAPR 148 Query: 136 VEDVHPISERPQYF---------------------SGDGKSGKIQTVCFEGVLTINDAPA 174 ++D ER ++ G G+ +I F G L + D Sbjct: 149 LDDEVSDEERSRWSEFDAPQVTERWTDVFHRNKAGGGRGRPVRIAKARFTGTLEVTDPEL 208 Query: 175 LIDLVQQGIGPAKSMGCGLLSLAPL 199 L + QGIG A+ GCGLL+LAP+ Sbjct: 209 LRQALAQGIGRARGYGCGLLTLAPI 233 >UniRef50_Q0BSC8 Putative uncharacterized protein n=1 Tax=Granulibacter bethesdensis CGDNIH1 RepID=Q0BSC8_GRABC Length = 227 Score = 50.1 bits (118), Expect = 5e-05, Method: Compositional matrix adjust. Identities = 47/193 (24%), Positives = 85/193 (44%), Gaps = 29/193 (15%) Query: 20 HQGLWHLFPNRPDAARDFLFHVEKRNTPEGCHVLLQSAQMPVSTAVATVIKTKQVEFQLQ 79 H LW +F + + RDFL+ E+ +G + L SA+ P+ + + + K L Sbjct: 41 HHLLWSVFADSEERKRDFLWREER----DGSFLTL-SARPPLQSDLFQPHRIKSYAPDLA 95 Query: 80 VGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRVPLIKEAEQI------------AWLQ 127 G L F LRAN + KR + ++ + + ++++E+ AWL+ Sbjct: 96 PGARLEFLLRANATRM-----KRGGKREDVVKAPIDALEQSERAERRMEIASSAGKAWLE 150 Query: 128 RKLGNAARVEDVHPISER------PQYFSGDGKSGKIQTVCFEGVLTINDAPALIDLVQQ 181 ++ G + + I+E P+ + D + + + G L + D + + Q Sbjct: 151 QQ-GEKSGFRVITAIAEDYRQLSLPRLGAIDRNAMTLGILDLSGHLEMTDPALFLTNLAQ 209 Query: 182 GIGPAKSMGCGLL 194 G G AKS GCGL+ Sbjct: 210 GFGRAKSFGCGLM 222 >UniRef50_Q47PI8 CRISPR-associated protein, Cse3 family n=1 Tax=Thermobifida fusca YX RepID=Q47PI8_THEFY Length = 207 Score = 49.3 bits (116), Expect = 8e-05, Method: Compositional matrix adjust. Identities = 55/195 (28%), Positives = 83/195 (42%), Gaps = 29/195 (14%) Query: 23 LWHLFPNRPDAARDFLFHVEKRNTPEGCHVLLQSAQMPVSTAVATVIKTKQVEFQLQVGV 82 LW + +R A FL+ V P+ H L++ A P + T + +L G Sbjct: 23 LWRI--DRTSRAEVFLYIVSP-PKPDLTH-LVEQAGWPTQPTWESYDYTPFLS-RLAKGD 77 Query: 83 PLYFRLRANPIKTILDNQKRLDSKGNIKRCRVPLIKEAEQIAWLQRKLGNAARV------ 136 FRL ANP+ +I G + L + ++ LQR+ RV Sbjct: 78 VWAFRLTANPVHSIRRKA------GEPTKLTAHLTQRYQKKWLLQRQDAAGFRVVEKPAE 131 Query: 137 ------EDVHPI---SERPQYFSGDGKSGK---IQTVCFEGVLTINDAPALIDLVQQGIG 184 D H + + R FS + G+ + TV F+G L + D AL + GIG Sbjct: 132 KRRLPEGDEHELIVHNRRDWNFSKGARKGRPVSLVTVTFDGRLEVTDPDALRRALISGIG 191 Query: 185 PAKSMGCGLLSLAPL 199 AK+ GCGL++LAP+ Sbjct: 192 RAKAYGCGLMTLAPV 206 >UniRef50_C2CRP4 Putative uncharacterized protein n=1 Tax=Corynebacterium striatum ATCC 6940 RepID=C2CRP4_CORST Length = 185 Score = 48.1 bits (113), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 46/156 (29%), Positives = 71/156 (45%), Gaps = 20/156 (12%) Query: 46 TPEGCHVLLQSAQMPVSTAVATVIKTKQVEFQLQV---GVPLYFRLRANPIKTILDNQKR 102 +P+ H++++ + PV A T+ V Q+ + + L NPI L + Sbjct: 38 SPDTKHLVVRH-ETPVDWIKAIRGVTQAVTLPTQIPAASARINYALIGNPI---LSQYQG 93 Query: 103 LDSKGNIKRCRVPLIKEAEQIAWLQRKLGNAARVEDVHPISERPQYFSGDGKSGKIQTV- 161 + +G K+ P K E WLQR++GNA + + P GK +QT+ Sbjct: 94 PNKRG--KKTPAPPEKWNE---WLQRRVGNALNLHSIDGTRLPP----AKGKKPDMQTIH 144 Query: 162 ---CFEGVLTINDAPALIDLVQQGIGPAKSMGCGLL 194 F G T+ D AL L++ GIG K+ GCGLL Sbjct: 145 HRILFTGRATVKDQDALQTLMESGIGSGKAYGCGLL 180 >UniRef50_A8M405 CRISPR-associated protein, Cse3 family n=3 Tax=Actinomycetales RepID=A8M405_SALAI Length = 227 Score = 48.1 bits (113), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 52/229 (22%), Positives = 96/229 (41%), Gaps = 38/229 (16%) Query: 1 MYLSKVII--ARAWSRDLYQ----LHQGLWHLFPNRPDAARD---------------FLF 39 MYL++ ++ AR +R L +H + FP D RD + Sbjct: 1 MYLTRFLVNPARRGARKLLASPQAMHAAVLSGFPRPEDHTRDGARTLWRLDHRQDRQVVL 60 Query: 40 HVEKRNTPEGCHVLLQSAQMPVSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDN 99 +V P+ H++ Q+ + AT ++ ++ L G FRL ANP + N Sbjct: 61 YVVSPTAPDLTHMVEQAGWPSNAETWATRPYSRLLD-SLDKGQRWAFRLTANPARAGRRN 119 Query: 100 QKRLDSKGNIKRCRVPLIKEAEQIAWLQRK-----LGNAARVE---DVHPISERPQYFSG 151 Q ++ R + +Q+ WL R+ G + + ++ + R F+ Sbjct: 120 QDTPTTQ------RYGHVTPVQQVEWLTRRAERNGFGVVRQTDGELNLITYNRRVHRFTR 173 Query: 152 DGKSGKIQ--TVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAP 198 + T ++GVL +++ ++ +GIG A++ GCGLL++AP Sbjct: 174 GHTQRPVTLVTATYDGVLEVDEPTLFRGVLTRGIGHARAYGCGLLTVAP 222 >UniRef50_D1A5U1 CRISPR-associated protein, Cse3 family n=2 Tax=Actinomycetales RepID=D1A5U1_THECD Length = 229 Score = 47.8 bits (112), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 42/139 (30%), Positives = 63/139 (45%), Gaps = 25/139 (17%) Query: 77 QLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRVPLIKEAEQIAWL-QRKLGNAAR 135 +L G FRL ANP+ T +R D++ V + Q+ WL QR+ R Sbjct: 96 RLAKGEEWAFRLTANPVHTA----RRNDTEPTKITAHVGM---RHQMQWLLQRQEAAGFR 148 Query: 136 VE------------DVHPISERPQYF-----SGDGKSGKIQTVCFEGVLTINDAPALIDL 178 V DVH + R + G+ + + TV F+G L + D AL Sbjct: 149 VVEKPRERQLIPGVDVHELVIRERRHLEFRKRGNSRPVTLVTVTFDGRLEVTDPDALRRT 208 Query: 179 VQQGIGPAKSMGCGLLSLA 197 + +G+G AK+ GCGL++LA Sbjct: 209 LTRGLGRAKAYGCGLMTLA 227 >UniRef50_C7MTL5 CRISPR-associated protein, Cse3 family n=1 Tax=Saccharomonospora viridis DSM 43017 RepID=C7MTL5_SACVD Length = 207 Score = 47.8 bits (112), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 51/210 (24%), Positives = 83/210 (39%), Gaps = 31/210 (14%) Query: 12 WSR---DLYQLHQGLWHLFP----NRPDAARDFLFHVEKRNTPEGCHVLLQSAQMPVSTA 64 WSR D LH+ + L P N+ + LF E +T G VL Q + P Sbjct: 2 WSRGTLDGGALHRDIMRLAPDALGNQARKEANVLFRAE--HTQRGLQVLAQLSCAPRVDN 59 Query: 65 VA-----TVIKTKQVEF---QLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRVPL 116 +A + + +E + G + +R+ ANP K RL + K+ R+ + Sbjct: 60 LAPDFAHGTPECRNIESLVSSMHSGTRVRYRIDANPTK-------RLGNSAGDKKGRLAV 112 Query: 117 IKEAEQIAWLQRKLGNAARVEDVHPISERPQYFSGDGKSGKIQTVC-------FEGVLTI 169 + A+ W R+ + S P + + + FEG + Sbjct: 113 LHGADAAEWWHRRAAESGLELLSATASAMPDILGSRNRDRRGRCRATSHGVTRFEGFAVV 172 Query: 170 NDAPALIDLVQQGIGPAKSMGCGLLSLAPL 199 D + V +GIG A++ GCGLLS+ P+ Sbjct: 173 ADPGKVRSAVVEGIGRARTYGCGLLSIVPV 202 >UniRef50_D1YEE5 CRISPR system CASCADE complex protein CasE n=1 Tax=Propionibacterium acnes J139 RepID=D1YEE5_PROAC Length = 221 Score = 47.4 bits (111), Expect = 3e-04, Method: Compositional matrix adjust. Identities = 38/132 (28%), Positives = 57/132 (43%), Gaps = 21/132 (15%) Query: 78 LQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRVPLIKEAEQIAWLQRKLGNAARVE 137 L+ G FR NP + +K S+G RV + +Q+ W +G R Sbjct: 97 LRSGSTWRFRCTINPTTAV---RKSAGSRGQ----RVAEVTAEQQLTWF---IGRVERHG 146 Query: 138 DVHPISERPQ-----------YFSGDGKSGKIQTVCFEGVLTINDAPALIDLVQQGIGPA 186 P++++ F + + +GV+ I DA A + QGIGPA Sbjct: 147 YTVPVNDQGAPSAQVTRREILRFRRQRSTVTLAVTQVDGVIQIQDADAARLALVQGIGPA 206 Query: 187 KSMGCGLLSLAP 198 KS GCGL++LAP Sbjct: 207 KSYGCGLMTLAP 218 >UniRef50_C9M2Y7 CRISPR-associated protein n=3 Tax=Lactobacillus RepID=C9M2Y7_LACHE Length = 217 Score = 47.4 bits (111), Expect = 3e-04, Method: Compositional matrix adjust. Identities = 45/132 (34%), Positives = 59/132 (44%), Gaps = 16/132 (12%) Query: 77 QLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRVPLIKEAEQIAWL-QRKLGNAAR 135 QL G FRL ANP I D SK R VP I +Q WL +R + Sbjct: 91 QLVEGKKYRFRLTANPTYRITD------SKSGKSRV-VPHITILQQTNWLLERTKKHGFE 143 Query: 136 V----EDVHP--ISER--PQYFSGDGKSGKIQTVCFEGVLTINDAPALIDLVQQGIGPAK 187 + E+V+ ISER P+ K+ V F+G+L I D + GIG K Sbjct: 144 IVRDSEEVYKLNISERDWPRLRRKGNHLIKLSRVTFDGILQITDLSKFKLALINGIGREK 203 Query: 188 SMGCGLLSLAPL 199 + G GLL++ PL Sbjct: 204 AYGMGLLTVIPL 215 >UniRef50_C5V9N5 CRISPR-associated protein, Cse3 family n=1 Tax=Corynebacterium matruchotii ATCC 14266 RepID=C5V9N5_9CORY Length = 220 Score = 47.0 bits (110), Expect = 3e-04, Method: Compositional matrix adjust. Identities = 50/193 (25%), Positives = 91/193 (47%), Gaps = 29/193 (15%) Query: 20 HQGLWHLFPNRPD----AARDFLFHVEKRNTP-EGCHVLLQSAQMPVSTAVATVIKTKQV 74 H+ + LFP+ D + + LF E P + + L+QS V+ + VI+TKQV Sbjct: 35 HRAVMGLFPDFEDNQARSRNNILFRYEF--IPGQAPYFLVQSDCDVVAPDLEGVIETKQV 92 Query: 75 EF-QLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRVPL----------IKEAEQ- 122 E+ + G P+ FRL N + ++ +++ G + P+ + AE+ Sbjct: 93 EYPSYENGTPIIFRLALNTV-----TRRTIETNGRKREVITPVALQPLDAETGLNPAEKH 147 Query: 123 IAWLQRKLGNAARVEDVHPISERPQYFSGDGKSGKIQTVCFEGVLTINDAPALIDLVQQG 182 +A+ KL A ++ + ++ Q S +Q F+ + + ++ AL ++ G Sbjct: 148 VAY---KLSTA--LQGIEFLNHNRQVLQVPKVSRALQIDTFDCMGVVTNSQALEHIMHAG 202 Query: 183 IGPAKSMGCGLLS 195 IG AK+ GCGLL+ Sbjct: 203 IGRAKAYGCGLLT 215 >UniRef50_Q0RTG6 Putative uncharacterized protein n=1 Tax=Frankia alni ACN14a RepID=Q0RTG6_FRAAA Length = 278 Score = 47.0 bits (110), Expect = 4e-04, Method: Compositional matrix adjust. Identities = 55/201 (27%), Positives = 79/201 (39%), Gaps = 27/201 (13%) Query: 23 LWHLFPNRPDAARDFLFHVEKRNTPEGCHVLLQSAQMPVSTAVATVIKTKQVEFQLQVGV 82 LW L RP A + L E R P H++ Q+ A V + + ++Q G Sbjct: 70 LWRLETGRPHRA-EVLILTESR--PSWEHLIEQAGWPNAEDPQALVRDYQPLLDRIQAGR 126 Query: 83 PLYFRLRANPIKTILD------NQKRLDSKGNIKRCRVPLIKEAEQIAWLQRKL---GNA 133 FRLRANP+ QK + + RV +Q+AW ++ G Sbjct: 127 EFAFRLRANPVAATRQPTSPSVAQKERLAGPRPRGVRVAHRTAGQQLAWFTDRVDRWGFT 186 Query: 134 ARVEDVHP-----------ISERPQYFSGDGKSGKIQ----TVCFEGVLTINDAPALIDL 178 + P +RP GK+ Q T F+G L + D Sbjct: 187 PLTTETGPAVQLNARERLTFRKRPPDGGNGGKNKGHQVVLSTATFDGALRVVDPDLARRA 246 Query: 179 VQQGIGPAKSMGCGLLSLAPL 199 + G+G AK+ GCGLL+LAPL Sbjct: 247 LLSGVGAAKAYGCGLLTLAPL 267 >UniRef50_Q2JH26 Putative uncharacterized protein n=1 Tax=Frankia sp. CcI3 RepID=Q2JH26_FRASC Length = 275 Score = 46.2 bits (108), Expect = 6e-04, Method: Compositional matrix adjust. Identities = 22/57 (38%), Positives = 32/57 (56%), Gaps = 3/57 (5%) Query: 146 PQYFSGDGKSGK---IQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAPL 199 P+ K+G+ + T FEG L + D + G+GPAK+ GCGL++LAPL Sbjct: 212 PKKAKNTEKTGRRVVLNTATFEGALRVTDPARARATLLHGVGPAKAYGCGLITLAPL 268 >UniRef50_B0LU87 CRISPR-associated protein Cas3 n=2 Tax=Streptomyces RepID=B0LU87_9ACTO Length = 270 Score = 46.2 bits (108), Expect = 7e-04, Method: Compositional matrix adjust. Identities = 54/225 (24%), Positives = 82/225 (36%), Gaps = 51/225 (22%) Query: 23 LWHLFPNRPDAARDFLFHVEKRNTPEGCHVLLQSAQMPVSTAVATVIKTKQVEFQLQVGV 82 LW + P+ P R LF V P+ H++ A V + QL VG Sbjct: 44 LWRMDPDNPH--RPHLF-VLSPTRPDWTHIIQDCGWPDADGDHAAVRDYTPLLSQLAVGR 100 Query: 83 PLYFRLRANPIKTILDNQK-------RLDSKGN----IKRCRVPLIKEAEQIAWLQRKLG 131 FRL A+P++ K RL + I+ R+ A Q+ W + Sbjct: 101 EFAFRLTASPVQNTATPTKATPAQAARLTAHAEDGKRIRGFRMGHRTAAAQLDWFLTRTD 160 Query: 132 N------AARVEDVHP----------------------------ISERPQY-FSGDGKSG 156 A R + P I+ R ++ F +G Sbjct: 161 RWGFDIPATRSDPTAPGIHAPTPPTAPRPTSPPRPDPNPPYEVRITARHRHSFQKNGHGA 220 Query: 157 KI--QTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAPL 199 + ++ FEG L I D + G+GP+++ GCGLL+LAPL Sbjct: 221 HVVFRSATFEGRLRITDTDRFTTSLLTGLGPSRAYGCGLLTLAPL 265 >UniRef50_A8SDR6 Putative uncharacterized protein n=1 Tax=Faecalibacterium prausnitzii M21/2 RepID=A8SDR6_9FIRM Length = 195 Score = 45.8 bits (107), Expect = 7e-04, Method: Compositional matrix adjust. Identities = 36/115 (31%), Positives = 51/115 (44%), Gaps = 9/115 (7%) Query: 86 FRLRANPIKTILDNQKRLDSKGNIKRCRVPLIKEAEQIAWL---QRKLGNAARVEDVHPI 142 FRL ANP K+ D Q C K+ WL K G A R E Sbjct: 81 FRLTANPTKSCKDTQNPAARGTVAAHCTTQYQKQ-----WLLERAAKRGFALREEGFTVT 135 Query: 143 SERPQYFSGDG-KSGKIQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSL 196 + Q+F+ G + + V +EGVL + DA L+ QG+G K+ G GL+++ Sbjct: 136 RVQWQHFAKHGTRPVTLLAVTYEGVLQVTDAEQFRALLCQGMGRGKAYGLGLMTV 190 >UniRef50_C7QEM3 CRISPR-associated protein, Cse3 family n=9 Tax=Actinomycetales RepID=C7QEM3_CATAD Length = 236 Score = 44.3 bits (103), Expect = 0.003, Method: Compositional matrix adjust. Identities = 50/200 (25%), Positives = 81/200 (40%), Gaps = 37/200 (18%) Query: 23 LWHLFPNRPDAARDFLFHVEKRNTPEGCHVLLQSAQMPVSTAVATVIKTKQVEFQLQVGV 82 LW L N A L ++ + P+ H++ Q A P + + + ++ +L G Sbjct: 48 LWRLDRN---ANNQVLLYIVSPDRPDLTHIVEQ-AGWPTTGSWDSFAYAPFLD-KLTAGD 102 Query: 83 PLYFRLRANPIKTILDNQKRLDSKGNIKRCRVPLIKEAEQIAWLQRKLGNAARVEDVHPI 142 FRL ANP+ +I ++ R I Q+ WL ++ A P Sbjct: 103 IWTFRLTANPVHSIR-------TRDGEPTKRTAHITVRHQLGWLLKQQERAGFTICEQP- 154 Query: 143 SERPQYFSGD-------------------GKSGKIQ-----TVCFEGVLTINDAPALIDL 178 E P+ D +S KI TV ++G L I+D + + Sbjct: 155 KELPRPTDMDEYQVVVHDRRSLDFTKKDPARSSKINNVQILTVTYDGRLRIDDPDKVRAV 214 Query: 179 VQQGIGPAKSMGCGLLSLAP 198 + G+G AK+ GCGL++LAP Sbjct: 215 LTTGLGKAKAYGCGLMTLAP 234 >UniRef50_Q3A5Z3 CRISPR-associated protein, Cse3 family n=2 Tax=Desulfuromonadales RepID=Q3A5Z3_PELCD Length = 299 Score = 43.9 bits (102), Expect = 0.003, Method: Compositional matrix adjust. Identities = 17/42 (40%), Positives = 27/42 (64%) Query: 153 GKSGKIQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLL 194 G+ +V F+GVL +N++ ++ GIGPAK+ GCGL+ Sbjct: 253 GRDAGFSSVDFDGVLQVNNSELFQAMLFNGIGPAKAFGCGLM 294 >UniRef50_C7JIG8 CRISPR-associated protein Cse3 n=8 Tax=Acetobacter pasteurianus RepID=C7JIG8_ACEP3 Length = 229 Score = 43.9 bits (102), Expect = 0.003, Method: Compositional matrix adjust. Identities = 58/217 (26%), Positives = 89/217 (41%), Gaps = 38/217 (17%) Query: 3 LSKVIIARAWSRDLYQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGCHVLLQSAQMPVS 62 L+ +++ + R H LW LF + RDFL+ R T G H ++ SA+ PV Sbjct: 20 LAGLLVPQGEGRQHGAAHHLLWVLFGDDSSRIRDFLW----RQTEPG-HFMILSARKPVD 74 Query: 63 TAVATVIKTKQVEFQLQVGVPLYFRLRANPI--------------KTILDNQKRLDSKGN 108 + I++++ +L+ G L F LR N ++D +L +K Sbjct: 75 SHRLFEIESREFTPKLREGNRLRFLLRVNATVDRKVPGRKRSQRHDVVMDALYKLPAKER 134 Query: 109 IKRCRVPLIKEAEQIAWLQRKLGNAARVE-----------DVHPISERPQYFSGDGKSGK 157 R L+ A + AWL R+ G+ E DV I R Q G GK+ Sbjct: 135 AA-ARESLVPTAME-AWLARQ-GHRTGFELKEGKLAIESCDVLHIP-RAQ---GQGKA-T 186 Query: 158 IQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLL 194 V G L + + QG G A++ GCGL+ Sbjct: 187 FGVVDVTGELCVRTPDLFTQALMQGFGRARAFGCGLM 223 >UniRef50_D0WFC7 CRISPR-associated protein, Cse3 family n=1 Tax=Slackia exigua ATCC 700122 RepID=D0WFC7_9ACTN Length = 255 Score = 43.5 bits (101), Expect = 0.004, Method: Compositional matrix adjust. Identities = 47/157 (29%), Positives = 68/157 (43%), Gaps = 36/157 (22%) Query: 77 QLQVGVPLYFRLRANPIKTILDNQKRL---DSKGNIKRCRVPLIKEAEQIAWLQRK---L 130 ++++G FRL ANP+ + R + KG KR + + +Q AWL K L Sbjct: 98 RIEIGQEYAFRLFANPVLSRSTRGGRTVPRNEKGKPKR--IGHLTVLQQAAWLIGKDAYL 155 Query: 131 GNAARVEDV--HPISERPQ----------------YFSGDGK----SGK------IQTVC 162 G+ V ++ H R Q S GK SG+ + T Sbjct: 156 GSGLEVPELFAHQEWNRAQRNGFEVLTNLDGTARLIVSHSGKQKLRSGRESCPITLSTAQ 215 Query: 163 FEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAPL 199 F+G L ++D L + GIG AK GCGLL+LAP+ Sbjct: 216 FDGFLRVSDPDLLRSALVNGIGHAKGFGCGLLTLAPM 252 >UniRef50_B5GY64 Putative uncharacterized protein n=1 Tax=Streptomyces clavuligerus ATCC 27064 RepID=B5GY64_STRCL Length = 312 Score = 43.5 bits (101), Expect = 0.004, Method: Compositional matrix adjust. Identities = 22/60 (36%), Positives = 34/60 (56%), Gaps = 3/60 (5%) Query: 142 ISERPQYFSGDGKSGK---IQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAP 198 I+ R ++ G+ G + +EG+L + D L + GIGP+K+ GCGLL+LAP Sbjct: 246 ITARQRHTFSKGRRGTQVTFHSATYEGLLRVTDPELLAARLLGGIGPSKAYGCGLLTLAP 305 >UniRef50_D0MET7 CRISPR-associated protein, Cse3 family n=1 Tax=Rhodothermus marinus DSM 4252 RepID=D0MET7_RHOM4 Length = 265 Score = 43.5 bits (101), Expect = 0.005, Method: Compositional matrix adjust. Identities = 29/92 (31%), Positives = 46/92 (50%), Gaps = 11/92 (11%) Query: 116 LIKEAEQIAWLQRKLG----NAARVEDVH----PISERPQYFSGDGKSGKI---QTVCFE 164 L +EA WL+R++ ARVE V I + +G +S + V E Sbjct: 171 LDREAVYAEWLRRQMARPEKGGARVETVRMTRFSIERMTRRTNGSSRSVTVIQRPDVTLE 230 Query: 165 GVLTINDAPALIDLVQQGIGPAKSMGCGLLSL 196 GVLT+ D+ A + ++++G+G S G G+L L Sbjct: 231 GVLTVTDSAAFMRMLRRGVGRHTSFGYGMLKL 262 >UniRef50_A8LMM7 CRISPR-associated protein n=2 Tax=Alphaproteobacteria RepID=A8LMM7_DINSH Length = 263 Score = 43.1 bits (100), Expect = 0.005, Method: Compositional matrix adjust. Identities = 25/82 (30%), Positives = 41/82 (50%), Gaps = 1/82 (1%) Query: 118 KEAEQIAWLQRKLGNAARVEDVHPISERPQYFS-GDGKSGKIQTVCFEGVLTINDAPALI 176 +E AWL + G AA +E V + R + + DG+ + G LT+ DA A Sbjct: 174 RETVYAAWLADRFGPAAELEQVTLAAFRRSFAARKDGRGCEGPDATLHGTLTVGDAKAFA 233 Query: 177 DLVQQGIGPAKSMGCGLLSLAP 198 + + +G+G K+ G G+L + P Sbjct: 234 ERLHRGVGRHKAYGYGMLLIRP 255 >UniRef50_Q6NEQ5 Putative uncharacterized protein n=1 Tax=Corynebacterium diphtheriae RepID=Q6NEQ5_CORDI Length = 228 Score = 42.4 bits (98), Expect = 0.010, Method: Compositional matrix adjust. Identities = 27/76 (35%), Positives = 41/76 (53%), Gaps = 3/76 (3%) Query: 125 WLQRKLGNAAR-VEDVHPISE--RPQYFSGDGKSGKIQTVCFEGVLTINDAPALIDLVQQ 181 W+Q+KL A R VE ++ E ++ G S IQ +G + D L +L+ Sbjct: 151 WVQKKLNGALRNVEILNHQREVIGTKHRGGKAASMTIQIDTVDGFGIVEDPELLNELILH 210 Query: 182 GIGPAKSMGCGLLSLA 197 G+G AK+ GCGLLS++ Sbjct: 211 GVGRAKAYGCGLLSVS 226 >UniRef50_Q4JWK1 Putative uncharacterized protein n=2 Tax=Corynebacterium jeikeium RepID=Q4JWK1_CORJK Length = 224 Score = 42.0 bits (97), Expect = 0.011, Method: Compositional matrix adjust. Identities = 31/88 (35%), Positives = 41/88 (46%), Gaps = 6/88 (6%) Query: 116 LIKEAEQIAWLQRKLGNAARVEDVHPISERPQY-FSGDGKSGK-----IQTVCFEGVLTI 169 L+ EA Q+ W K + I ER FS K+ K I TV + G L I Sbjct: 129 LVGEAAQLEWFNTKAKSCGFTPLETLIVERKTLRFSKLAKNPKGRQVVIGTVRYRGTLQI 188 Query: 170 NDAPALIDLVQQGIGPAKSMGCGLLSLA 197 +D + +GIG K+ GCGLL+LA Sbjct: 189 DDVETFKKSLVEGIGRGKAYGCGLLTLA 216 >UniRef50_Q47PJ5 CRISPR-associated protein, Cse3 family n=1 Tax=Thermobifida fusca YX RepID=Q47PJ5_THEFY Length = 232 Score = 40.0 bits (92), Expect = 0.043, Method: Compositional matrix adjust. Identities = 41/130 (31%), Positives = 59/130 (45%), Gaps = 9/130 (6%) Query: 77 QLQVGVPLYFRLRANPIKTI---LDNQKRLDSKGNIKRCRV---PLIKEAEQIAWLQRKL 130 +L G + +R+ A+P K + +N +RL K K+ R L A + W R Sbjct: 95 RLDKGSRVRYRIVASPTKRLGRSENNTQRLGLKEPPKKPREYTWALRGAAAEEWWHSRAA 154 Query: 131 GNAARVEDVHPISERPQYFSGDG-KSGKIQ--TVCFEGVLTINDAPALIDLVQQGIGPAK 187 N + + + G +S KI+ V F+G I+D A+ V GIG K Sbjct: 155 ANGLELLSTYAQTLDDVRDPGTADRSRKIRHPAVRFDGEAVISDVDAVRHAVLNGIGRGK 214 Query: 188 SMGCGLLSLA 197 S GCGLLSLA Sbjct: 215 SYGCGLLSLA 224 >UniRef50_C4ZJY2 CRISPR-associated protein, Cse3 family n=1 Tax=Thauera sp. MZ1T RepID=C4ZJY2_THASP Length = 238 Score = 40.0 bits (92), Expect = 0.048, Method: Compositional matrix adjust. Identities = 39/133 (29%), Positives = 55/133 (41%), Gaps = 14/133 (10%) Query: 80 VGVPLYFRLRANPIKTILDNQKR---LDSKGNIKRCRVPLIKEAEQIAWLQRKL--GNAA 134 G L F LR P+ D ++R L L +EA + WLQR+L G+AA Sbjct: 102 AGRRLGFELRVRPVLRTKDGRERDVFLSQAEKRGVAEKELSREAVYLEWLQRELARGDAA 161 Query: 135 RVEDVHPISER--PQYFSGDGKSGK--IQTVC-----FEGVLTINDAPALIDLVQQGIGP 185 V+ R G G+ Q V F G LT+ D L+ +G+G Sbjct: 162 NVDRAQLDGFRLTSSLRKGSAVVGRRPAQRVTGPDALFSGELTVRDPAGFAALIARGVGR 221 Query: 186 AKSMGCGLLSLAP 198 ++ G G+L L P Sbjct: 222 HRAFGFGMLLLRP 234 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_Q46897 Uncharacterized protein ygcH n=10 Tax=Enterobact... 297 2e-79 UniRef50_D0FPP5 CRISPR-associated protein, Cse3 family n=2 Tax=E... 239 4e-62 UniRef50_C5SD47 CRISPR-associated protein, Cse3 family n=1 Tax=A... 202 8e-51 UniRef50_Q74DC7 CRISPR-associated protein, CT1974 family n=2 Tax... 198 6e-50 UniRef50_D0Y917 CRISPR-associated protein, Cse3 family n=2 Tax=D... 179 5e-44 UniRef50_Q12YA7 CRISPR-associated protein n=1 Tax=Methanococcoid... 173 3e-42 UniRef50_A1SV70 CRISPR-associated protein, Cse3 family n=2 Tax=G... 172 6e-42 UniRef50_B6XT65 Putative uncharacterized protein n=2 Tax=Bifidob... 171 1e-41 UniRef50_B7KJ27 CRISPR-associated protein, Cse3 family n=1 Tax=C... 170 2e-41 UniRef50_A6W167 CRISPR-associated protein, Cse3 family n=1 Tax=M... 170 3e-41 UniRef50_Q2FNT6 CRISPR-associated protein, CT1974 n=1 Tax=Methan... 169 5e-41 UniRef50_B4RSK4 CRISPR-associated protein, Cse3 family n=5 Tax=G... 167 2e-40 UniRef50_Q0W583 Predicted CRISPR-associated protein n=1 Tax=uncu... 167 2e-40 UniRef50_Q2JWC6 CRISPR-associated protein, Cse3 family n=2 Tax=C... 167 2e-40 UniRef50_B1LQ79 CRISPR-associated protein, Cse3 family n=54 Tax=... 166 4e-40 UniRef50_B8IMR1 CRISPR-associated protein, Cse3 family n=3 Tax=A... 165 9e-40 UniRef50_D1CAI9 CRISPR-associated protein, Cse3 family n=1 Tax=S... 165 9e-40 UniRef50_B6WQ61 Putative uncharacterized protein n=1 Tax=Desulfo... 164 1e-39 UniRef50_B8GIV2 CRISPR-associated protein, Cse3 family n=1 Tax=M... 163 2e-39 UniRef50_B4TTX1 Crispr-associated protein, Cse3 family n=15 Tax=... 161 1e-38 UniRef50_B8IZA5 CRISPR-associated protein, Cse3 family n=1 Tax=D... 158 7e-38 UniRef50_Q314I5 CRISPR-associated protein, CT1974 n=2 Tax=Desulf... 157 2e-37 UniRef50_C6WMQ8 CRISPR-associated protein, Cse3 family n=1 Tax=A... 157 2e-37 UniRef50_A0LM55 CRISPR-associated protein, Cse3 family n=1 Tax=S... 156 4e-37 UniRef50_A5UR13 CRISPR-associated protein, Cse3 family n=1 Tax=R... 156 5e-37 UniRef50_A5GBK0 CRISPR-associated protein, Cse3 family n=1 Tax=G... 155 1e-36 UniRef50_A9GV72 Putative uncharacterized protein ygcH n=1 Tax=So... 154 1e-36 UniRef50_Q67RN9 Putative uncharacterized protein n=1 Tax=Symbiob... 154 1e-36 UniRef50_Q53WG9 Putative uncharacterized protein TTHB192 n=1 Tax... 154 2e-36 UniRef50_A1ARH5 CRISPR-associated protein, Cse3 family n=3 Tax=B... 153 3e-36 UniRef50_C6C417 CRISPR-associated protein, Cse3 family n=4 Tax=E... 153 4e-36 UniRef50_D2RB03 CRISPR system CASCADE complex protein CasE n=4 T... 152 8e-36 UniRef50_B6B784 CRISPR-associated protein, Cse3 family n=1 Tax=R... 151 1e-35 UniRef50_D1CGD5 CRISPR-associated protein, Cse3 family n=1 Tax=T... 151 2e-35 UniRef50_B8FDH8 CRISPR-associated protein, Cse3 family n=1 Tax=D... 150 2e-35 UniRef50_C1DSI0 CRISPR-associated protein, CT1974 n=3 Tax=Pseudo... 150 3e-35 UniRef50_Q2RY20 CRISPR-associated protein, CT1974 n=1 Tax=Rhodos... 149 6e-35 UniRef50_D1NTI2 CRISPR-associated protein, Cse3 family n=1 Tax=B... 147 2e-34 UniRef50_A8M405 CRISPR-associated protein, Cse3 family n=3 Tax=A... 146 5e-34 UniRef50_Q04QB6 Putative uncharacterized protein n=2 Tax=Leptosp... 145 1e-33 UniRef50_C2BET7 CRISPR-associated protein n=1 Tax=Anaerococcus l... 145 1e-33 UniRef50_B5GAA2 Crispr-associated protein n=1 Tax=Streptomyces s... 144 2e-33 UniRef50_A9HLC4 CRISPR-associated protein, Cse3 family n=1 Tax=G... 144 2e-33 UniRef50_Q1J366 CRISPR-associated protein, CT1974 n=2 Tax=Deinoc... 143 4e-33 UniRef50_C9M2Y7 CRISPR-associated protein n=3 Tax=Lactobacillus ... 143 5e-33 UniRef50_D1A5U1 CRISPR-associated protein, Cse3 family n=2 Tax=A... 142 6e-33 UniRef50_Q47PI8 CRISPR-associated protein, Cse3 family n=1 Tax=T... 140 2e-32 UniRef50_Q03C59 CRISPR-associated protein n=3 Tax=Lactobacillus ... 140 2e-32 UniRef50_D1A6Q6 CRISPR-associated protein, Cse3 family n=5 Tax=A... 138 1e-31 UniRef50_C7MTM6 CRISPR-associated protein, Cse3 family n=1 Tax=S... 138 1e-31 UniRef50_Q0RTG6 Putative uncharacterized protein n=1 Tax=Frankia... 137 2e-31 UniRef50_A8LYZ8 CRISPR-associated protein, Cse3 family n=1 Tax=S... 137 3e-31 UniRef50_Q0BSC8 Putative uncharacterized protein n=1 Tax=Granuli... 135 9e-31 UniRef50_C0VRW4 CRISPR-associated protein n=1 Tax=Corynebacteriu... 131 1e-29 UniRef50_B1VIX9 CRISPR-associated protein n=6 Tax=Actinomycetale... 131 1e-29 UniRef50_B6IWM2 CRISPR-associated protein, CT1974 family n=1 Tax... 131 2e-29 UniRef50_C7MTL5 CRISPR-associated protein, Cse3 family n=1 Tax=S... 129 7e-29 UniRef50_C2KP48 Putative uncharacterized protein n=1 Tax=Mobilun... 128 1e-28 UniRef50_C1XG03 CRISPR-associated protein, Cse3 family n=1 Tax=M... 128 1e-28 UniRef50_B0LU87 CRISPR-associated protein Cas3 n=2 Tax=Streptomy... 126 3e-28 UniRef50_C5V9N5 CRISPR-associated protein, Cse3 family n=1 Tax=C... 125 8e-28 UniRef50_Q2JH26 Putative uncharacterized protein n=1 Tax=Frankia... 124 2e-27 UniRef50_C7MQD7 CRISPR-associated protein, Cse3 family n=1 Tax=S... 123 3e-27 UniRef50_D1YEE5 CRISPR system CASCADE complex protein CasE n=1 T... 120 4e-26 UniRef50_A8SDR6 Putative uncharacterized protein n=1 Tax=Faecali... 116 5e-25 UniRef50_C2CRP4 Putative uncharacterized protein n=1 Tax=Coryneb... 99 8e-20 UniRef50_B2N0R4 Putative uncharacterized protein n=1 Tax=Escheri... 88 1e-16 Sequences not found previously or not previously below threshold: UniRef50_Q1R113 CRISPR-associated protein, CT1974 n=1 Tax=Chromo... 150 2e-35 UniRef50_C7QEM3 CRISPR-associated protein, Cse3 family n=9 Tax=A... 135 8e-31 UniRef50_D0WFC7 CRISPR-associated protein, Cse3 family n=1 Tax=S... 131 2e-29 UniRef50_C7JIG8 CRISPR-associated protein Cse3 n=8 Tax=Acetobact... 131 2e-29 UniRef50_A7BA62 Putative uncharacterized protein n=1 Tax=Actinom... 126 3e-28 UniRef50_Q4JWK1 Putative uncharacterized protein n=2 Tax=Coryneb... 125 8e-28 UniRef50_C7LYW5 CRISPR-associated protein, Cse3 family n=1 Tax=A... 120 2e-26 UniRef50_B5GY64 Putative uncharacterized protein n=1 Tax=Strepto... 119 5e-26 UniRef50_C2GEY9 Putative uncharacterized protein n=1 Tax=Coryneb... 117 2e-25 UniRef50_C4X9I8 Crispr-associated Cse3 family protein n=6 Tax=Ga... 117 3e-25 UniRef50_Q47PJ5 CRISPR-associated protein, Cse3 family n=1 Tax=T... 114 2e-24 UniRef50_C6HV94 CRISPR-associated protein, Cas3 n=1 Tax=Leptospi... 113 4e-24 UniRef50_B3ENH7 CRISPR-associated protein, Cse3 family n=3 Tax=C... 110 2e-23 UniRef50_B0S4B5 Putative uncharacterized protein n=1 Tax=Finegol... 106 4e-22 UniRef50_UPI0001B51C2A CRISPR-associated protein, Cse3 family n=... 103 3e-21 UniRef50_Q6NEQ5 Putative uncharacterized protein n=1 Tax=Coryneb... 101 2e-20 UniRef50_C0W6T9 Possible CRISPR-associated protein n=1 Tax=Actin... 92 9e-18 UniRef50_Q3A5Z3 CRISPR-associated protein, Cse3 family n=2 Tax=D... 89 8e-17 UniRef50_C7RP63 CRISPR-associated protein, Cse3 family n=1 Tax=C... 72 1e-11 UniRef50_A8ZZ18 CRISPR-associated protein, Cse3 family n=1 Tax=D... 72 1e-11 UniRef50_Q60AD3 CRISPR-associated protein, CT1974 family n=1 Tax... 70 4e-11 UniRef50_B4UE72 CRISPR-associated CT1974 family protein n=2 Tax=... 69 7e-11 UniRef50_C4ZJY2 CRISPR-associated protein, Cse3 family n=1 Tax=T... 69 1e-10 UniRef50_A8LMM7 CRISPR-associated protein n=2 Tax=Alphaproteobac... 67 4e-10 UniRef50_D0MET7 CRISPR-associated protein, Cse3 family n=1 Tax=R... 65 2e-09 UniRef50_C1XXW2 CRISPR associated protein n=1 Tax=Meiothermus si... 63 4e-09 UniRef50_Q0BRF7 Putative uncharacterized protein n=1 Tax=Granuli... 63 8e-09 UniRef50_B4V4N6 Putative uncharacterized protein n=1 Tax=Strepto... 58 2e-07 UniRef50_D1Y485 Crispr-associated family protein n=1 Tax=Pyramid... 56 1e-06 UniRef50_C9M9R8 CRISPR-associated protein, CT1974 family n=1 Tax... 54 3e-06 UniRef50_C1YTK2 Putative uncharacterized protein n=1 Tax=Nocardi... 51 2e-05 UniRef50_C6NY67 Putative uncharacterized protein n=1 Tax=Acidith... 51 3e-05 UniRef50_D1BYL2 Putative uncharacterized protein n=1 Tax=Xylanim... 44 0.003 UniRef50_Q21QB0 Putative uncharacterized protein n=1 Tax=Rhodofe... 43 0.007 UniRef50_C4XCX4 Putative uncharacterized protein n=1 Tax=Klebsie... 41 0.027 >UniRef50_Q46897 Uncharacterized protein ygcH n=10 Tax=Enterobacteriaceae RepID=YGCH_ECOLI Length = 199 Score = 297 bits (760), Expect = 2e-79, Method: Composition-based stats. Identities = 199/199 (100%), Positives = 199/199 (100%) Query: 1 MYLSKVIIARAWSRDLYQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGCHVLLQSAQMP 60 MYLSKVIIARAWSRDLYQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGCHVLLQSAQMP Sbjct: 1 MYLSKVIIARAWSRDLYQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGCHVLLQSAQMP 60 Query: 61 VSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRVPLIKEA 120 VSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRVPLIKEA Sbjct: 61 VSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRVPLIKEA 120 Query: 121 EQIAWLQRKLGNAARVEDVHPISERPQYFSGDGKSGKIQTVCFEGVLTINDAPALIDLVQ 180 EQIAWLQRKLGNAARVEDVHPISERPQYFSGDGKSGKIQTVCFEGVLTINDAPALIDLVQ Sbjct: 121 EQIAWLQRKLGNAARVEDVHPISERPQYFSGDGKSGKIQTVCFEGVLTINDAPALIDLVQ 180 Query: 181 QGIGPAKSMGCGLLSLAPL 199 QGIGPAKSMGCGLLSLAPL Sbjct: 181 QGIGPAKSMGCGLLSLAPL 199 >UniRef50_D0FPP5 CRISPR-associated protein, Cse3 family n=2 Tax=Erwinia pyrifoliae RepID=D0FPP5_ERWPY Length = 200 Score = 239 bits (610), Expect = 4e-62, Method: Composition-based stats. Identities = 93/196 (47%), Positives = 125/196 (63%) Query: 2 YLSKVIIARAWSRDLYQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGCHVLLQSAQMPV 61 YLS++ + +W++D YQ+H+ LW LFP+RP RDFLF VE R+ G VLLQS Q+P Sbjct: 3 YLSQIDVPWSWAKDPYQMHRALWQLFPDRPSDRRDFLFRVETRHAGSGQRVLLQSPQLPQ 62 Query: 62 STAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRVPLIKEAE 121 + A A V+ +K + L G L+FRLRANP+KTI D + RL+S+G +K CRVPLI + + Sbjct: 63 NCAAAKVLASKVMHLNLSPGQRLHFRLRANPVKTIKDKRGRLNSRGEVKSCRVPLIDDNQ 122 Query: 122 QIAWLQRKLGNAARVEDVHPISERPQYFSGDGKSGKIQTVCFEGVLTINDAPALIDLVQQ 181 + WL RKL AA + E F+ +GKIQ VCFEG+L + + Sbjct: 123 LMQWLVRKLEGAAVLNSASVSKEPALCFNKQAVAGKIQPVCFEGILQVTSETHFYQCMAD 182 Query: 182 GIGPAKSMGCGLLSLA 197 GIGPAKSMGCG+LS+A Sbjct: 183 GIGPAKSMGCGMLSIA 198 >UniRef50_C5SD47 CRISPR-associated protein, Cse3 family n=1 Tax=Allochromatium vinosum DSM 180 RepID=C5SD47_CHRVI Length = 209 Score = 202 bits (513), Expect = 8e-51, Method: Composition-based stats. Identities = 81/205 (39%), Positives = 108/205 (52%), Gaps = 8/205 (3%) Query: 1 MYLSKVIIARAWSRDLYQLHQGLWHLFPNR-------PDA-ARDFLFHVEKRNTPEGCHV 52 M LS+ I + +R+ Y +H+ +W LFP PD R FLF VE V Sbjct: 1 MILSRAEIPWSEARNPYDMHRAIWRLFPGEAAESRRTPDQPRRGFLFRVEDHRPGRPAQV 60 Query: 53 LLQSAQMPVSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRC 112 L+QS MP A +I ++++ Q G L F L ANPIKTI D Q + C Sbjct: 61 LIQSRCMPQPEATLNLIGSREINPQPSQGQRLAFILTANPIKTIKDRQADTKPRKTRDTC 120 Query: 113 RVPLIKEAEQIAWLQRKLGNAARVEDVHPISERPQYFSGDGKSGKIQTVCFEGVLTINDA 172 RVPLI E Q +WL ++L + A VE V P YF + GKI FEG+LT+ D Sbjct: 121 RVPLITEETQKSWLIQRLKDVAEVEAVAVTPHPPLYFRKANRGGKILCATFEGLLTVLDP 180 Query: 173 PALIDLVQQGIGPAKSMGCGLLSLA 197 AL+ L++ G+GPAK+ GCGLL + Sbjct: 181 NALVALLENGLGPAKAFGCGLLLVR 205 >UniRef50_Q74DC7 CRISPR-associated protein, CT1974 family n=2 Tax=Desulfuromonadales RepID=Q74DC7_GEOSL Length = 202 Score = 198 bits (505), Expect = 6e-50, Method: Composition-based stats. Identities = 95/201 (47%), Positives = 118/201 (58%), Gaps = 5/201 (2%) Query: 1 MYLSKVIIARAWSRDLYQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGCHVLLQSAQMP 60 MYLSKV+I R+ Y++H+ LW LFP DA RDFLF VE R+ + VLLQS + P Sbjct: 1 MYLSKVLINGTACRNPYEIHRVLWKLFPEDADAERDFLFRVE-RSGQQSVEVLLQSRREP 59 Query: 61 VSTAVATVIK--TKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRVPLIK 118 A V+ +K LQ L F L ANPIKTI D RL+S IK+CRVPLI+ Sbjct: 60 TMAASREVLLMGSKPYLLSLQQDQQLRFMLVANPIKTINDESARLNSANEIKKCRVPLIR 119 Query: 119 EAEQIAWLQRKLGNAARVEDVHPISERPQYFS--GDGKSGKIQTVCFEGVLTINDAPALI 176 E + AWL+RKL A +E V F + + GK+Q V F GVL++ D LI Sbjct: 120 EEDLRAWLKRKLEGVAVIEAVEVEKRPAMNFRKAREKRVGKVQAVSFHGVLSVTDPVGLI 179 Query: 177 DLVQQGIGPAKSMGCGLLSLA 197 L+ GIGPAK+ GCGLLSLA Sbjct: 180 SLINTGIGPAKAFGCGLLSLA 200 >UniRef50_D0Y917 CRISPR-associated protein, Cse3 family n=2 Tax=Dehalococcoides RepID=D0Y917_9CHLR Length = 209 Score = 179 bits (454), Expect = 5e-44, Method: Composition-based stats. Identities = 64/218 (29%), Positives = 93/218 (42%), Gaps = 30/218 (13%) Query: 1 MYLSKVIIARAWSRDL------YQLHQGLWHLFPNRPDA-ARDFLFHVEKRNTPEGCHVL 53 MYLS + + R L Y+LH+ L FP++ D LF ++ G VL Sbjct: 1 MYLSLLRLNPRSKRALTESSRPYELHRSLLKAFPDKADGGPGRVLFRLDMNEQTGGISVL 60 Query: 54 LQSAQMPVST------AVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKG 107 +QS + P T T K K+ + L G L FRLRANP K K Sbjct: 61 IQSEKKPFWTNLNGYTEFVTECKCKEFKPALAPGQVLRFRLRANPTKRSKSTGK------ 114 Query: 108 NIKRCRVPLIKEAEQIAWLQRKLGNAAR------VEDVHPISERPQYFSGDGKSGKIQTV 161 R ++K EQ+ WL++K N D ++ G + +V Sbjct: 115 -----REGILKTEEQVEWLRKKGMNGGFEVCEVFTVDEGFAKDKMTDTDNAGHHTNMLSV 169 Query: 162 CFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAPL 199 F+G+L + D+ A ++ GIG AK G GLLS+A + Sbjct: 170 RFDGLLRVTDSDAFQSTLRDGIGSAKGFGFGLLSVASV 207 >UniRef50_Q12YA7 CRISPR-associated protein n=1 Tax=Methanococcoides burtonii DSM 6242 RepID=Q12YA7_METBU Length = 224 Score = 173 bits (439), Expect = 3e-42, Method: Composition-based stats. Identities = 56/209 (26%), Positives = 84/209 (40%), Gaps = 21/209 (10%) Query: 10 RAWSRDLYQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGCHVLLQSAQMPVSTAVATVI 69 R + + +H+ +W LFP D R F++ + + +++ S P+ I Sbjct: 17 RGNMGNEHNVHRLVWSLFPVNEDDKRKFIYRQDSMGSLPSFYLV--SENEPIDELNVWDI 74 Query: 70 KTKQVEFQLQVGVPLYFRLRANPIKTILD---NQKRLD-----------SKGNIKRCRVP 115 KQ + L+ G L F LRANPI + D Q R D G +P Sbjct: 75 DVKQYDPILKSGQKLAFSLRANPIVSKRDENDKQHRHDVVMDEKFRLKMENGGDIEPNMP 134 Query: 116 LIKEAEQIAWLQRKL---GNAARVEDVHPISERPQYF--SGDGKSGKIQTVCFEGVLTIN 170 I + + WL RK G + E + + + TV G LT+ Sbjct: 135 DIVQRKGSEWLLRKGDMNGFSINAEQIRVDAYQNHKLFKPKGKHHVSFSTVDIVGTLTVT 194 Query: 171 DAPALIDLVQQGIGPAKSMGCGLLSLAPL 199 D D + +GIGPAK GCG+L + PL Sbjct: 195 DPDIFRDALFKGIGPAKGFGCGMLLVRPL 223 >UniRef50_A1SV70 CRISPR-associated protein, Cse3 family n=2 Tax=Gammaproteobacteria RepID=A1SV70_PSYIN Length = 180 Score = 172 bits (436), Expect = 6e-42, Method: Composition-based stats. Identities = 66/197 (33%), Positives = 96/197 (48%), Gaps = 20/197 (10%) Query: 1 MYLSKVIIARAWSRDLYQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGCHVLLQSAQMP 60 MYLS+V++ + D+Y+ HQ +W LF N D RD LF VE + C VLLQS+ P Sbjct: 1 MYLSQVMLN---THDIYEQHQAIWSLFENVADRKRDHLFRVEV-ADRQSCKVLLQSSTEP 56 Query: 61 VSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRVPLIKEA 120 S+ A V+ +K +++ F+L A P K + +K + + + Sbjct: 57 KSSEQAKVLASKSFLAEIKQDAFYKFKLLAYPTKCLSQGKK-----------VIEIKEAN 105 Query: 121 EQIAWLQRKLGNAARVEDVHPISERPQYFSGDGKSGKIQTVCFEGVLTINDAPALIDLVQ 180 EQ+ WLQRKL A ++ KS + VCFEG+L + D+ + + Sbjct: 106 EQVQWLQRKLSGAN-----VTVTAMDDLMVRSKKSYNSRFVCFEGILQVTDSEQIQRALV 160 Query: 181 QGIGPAKSMGCGLLSLA 197 GIG K G GLLSLA Sbjct: 161 MGIGRKKHAGAGLLSLA 177 >UniRef50_B6XT65 Putative uncharacterized protein n=2 Tax=Bifidobacterium RepID=B6XT65_9BIFI Length = 233 Score = 171 bits (434), Expect = 1e-41, Method: Composition-based stats. Identities = 55/226 (24%), Positives = 91/226 (40%), Gaps = 28/226 (12%) Query: 1 MYLSKVII--ARAWSRDL----YQLHQGLWHLFP---NRPDAARDFLFHVEKRNTPEGCH 51 M++S++ + AR +R L Y+LH + FP R + L+ ++ Sbjct: 1 MFISRIPLNKARYGARQLIGSPYKLHAAVECAFPPNAVRNNDEGRILWRLDTSVNDNAVW 60 Query: 52 VLLQSAQMPV------STAVATVIK--TKQVEF---QLQVGVPLYFRLRANPIKTILDNQ 100 + + S + P T ++ TK E ++ G +F+LRANP + +++ Sbjct: 61 LYVVSPEKPDFMHIVEQAGWPTHVEWETKNYEPLLERIAKGQQWHFKLRANPARKAKEDK 120 Query: 101 KRLDSKGNIKRCRVPLIKEAEQIAWLQRKLGNAARV--------EDVHPISERPQYFSGD 152 R I I +Q+ WL + + DV + F Sbjct: 121 GRRHRSDGIVGKVQGHITVDQQLQWLIDRSASHGFTILNDQNDQPDVVVKERHKENFKRA 180 Query: 153 GKSGKIQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAP 198 + + T FEG L + DA + QGIG AK GCGLL++AP Sbjct: 181 DATVTLVTAVFEGRLEVTDAELFRKALCQGIGRAKGFGCGLLTIAP 226 >UniRef50_B7KJ27 CRISPR-associated protein, Cse3 family n=1 Tax=Cyanothece sp. PCC 7424 RepID=B7KJ27_CYAP7 Length = 219 Score = 170 bits (431), Expect = 2e-41, Method: Composition-based stats. Identities = 61/222 (27%), Positives = 96/222 (43%), Gaps = 29/222 (13%) Query: 1 MYLSKVIIARAWSR------DLYQLHQGLWHLFPNRP----DAARDFLFHVEKRNTPEGC 50 MYLSK+ + S D ++LHQ + FPN + L+ +E G Sbjct: 1 MYLSKIELNIRSSAVSTDLSDCHKLHQRVMQGFPNENNPEYRSEAKILYRLE------GS 54 Query: 51 HVLLQSAQMPVSTAVA---TVIKTKQVEFQ-LQVGVPLYFRLRANPIKTILDNQ------ 100 + +QS P T + T + +++++ ++ G LYFRL NP++ + Sbjct: 55 ILFVQSKNKPDWTQLPKGYTAEEITEMDYEKIKKGDYLYFRLLGNPVQQTTKLRTDDSGN 114 Query: 101 ---KRLDSKGNIKRCRVPLIKEAEQIAWLQRKLGNAARVEDVHPISERPQYFSGDGKSGK 157 K K K R L + QI WL L E S + K Sbjct: 115 IIMKNNSEKPQKKTVRRFLSNKDAQIQWLMNHLKGTILQECYVSASSDIRGQCKQSKRIF 174 Query: 158 IQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAPL 199 ++TV F+GVL + D+ + I +++GIG +S GCGLLS+A Sbjct: 175 LKTVLFDGVLQVTDSESFIKALREGIGRGRSYGCGLLSIAKF 216 >UniRef50_A6W167 CRISPR-associated protein, Cse3 family n=1 Tax=Marinomonas sp. MWYL1 RepID=A6W167_MARMS Length = 224 Score = 170 bits (430), Expect = 3e-41, Method: Composition-based stats. Identities = 67/227 (29%), Positives = 98/227 (43%), Gaps = 31/227 (13%) Query: 1 MYLSKVII-ARAWSRDL---------YQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGC 50 MYLSKV A +R L Y HQ LW LF + R FLF E+ Sbjct: 1 MYLSKVSFQASQQARQLLLGFGGKGVYSTHQMLWQLFTE--EDERSFLFREEQSADGSKA 58 Query: 51 HVLLQSAQMPVSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQ---KRLDSKG 107 + S+ P S +KTK +LQ G L F LRANP D + KR D Sbjct: 59 FF-VLSSVKPESDESTFNVKTKTFMPKLQSGQRLGFTLRANPTVCTTDEKGKSKRHDVMM 117 Query: 108 NIKRC----------RVPLIKEAEQIAWLQ--RKLGNAARVEDVHP-ISERPQYFSGDGK 154 + K+ + LI E W+ ++L N D P + Q+ S + Sbjct: 118 HAKKAAKESGVSDSEEIRLIMEQAAQEWIANPKRLENWGFTLDFLPEVQTYMQHRSDKNR 177 Query: 155 --SGKIQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAPL 199 + +V ++GVLT+ D ++ +++G G AKS+GCGL+ + + Sbjct: 178 EDKIRFSSVDYQGVLTVQDPEKFLEQLEKGFGRAKSLGCGLMLIKSI 224 >UniRef50_Q2FNT6 CRISPR-associated protein, CT1974 n=1 Tax=Methanospirillum hungatei JF-1 RepID=Q2FNT6_METHJ Length = 228 Score = 169 bits (428), Expect = 5e-41, Method: Composition-based stats. Identities = 67/225 (29%), Positives = 99/225 (44%), Gaps = 31/225 (13%) Query: 1 MYLSKVIIARAWS-----RDL----YQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGCH 51 M+ SK+ + R + RDL YQ+H+ +W LF + PD RDFL+ E T Sbjct: 1 MFFSKMTLDREAAISGRFRDLVTGPYQVHEVIWDLFADHPDRKRDFLYRAEL--TGRDPV 58 Query: 52 VLLQSAQMPVSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDN-------QKRLD 104 V L SA+ PV I +K LQ L FR+R NP+ T + + R D Sbjct: 59 VYLLSARKPVYEGNVWNILSKPFHPVLQKDDLLNFRIRVNPVVTKTEPDPDRKRIRHRHD 118 Query: 105 SKGNIKRCR--------VPLIKEAEQIAWLQR---KLGNAARVEDVHPISERPQYFSGDG 153 + KR + + + E + WL++ K G + + V R FS Sbjct: 119 VIMDAKRRLNEANSSFSMSDLVQQESVRWLRQRSEKGGFSLYEDRVIAGGYRKMQFSQGR 178 Query: 154 KSGKIQT--VCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSL 196 K I V +GVL + D + ++ G+GPAK GCGL+ + Sbjct: 179 KKNTISISVVDCDGVLRVTDPDLFLQMICNGLGPAKGFGCGLMMV 223 >UniRef50_B4RSK4 CRISPR-associated protein, Cse3 family n=5 Tax=Gammaproteobacteria RepID=B4RSK4_ALTMD Length = 222 Score = 167 bits (423), Expect = 2e-40, Method: Composition-based stats. Identities = 60/222 (27%), Positives = 91/222 (40%), Gaps = 29/222 (13%) Query: 1 MYLSKVI----------IARAWSRDLYQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGC 50 M+LSKV +A+ +Y HQ +W LF N + R FL+ E T Sbjct: 1 MFLSKVTMVSSPQTAQELAKLQRNGVYASHQLIWQLFSNVTE--RSFLYREEMGITG-MP 57 Query: 51 HVLLQSAQMPVSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILD---NQKRLDSKG 107 + S P ++ TK E +L+ G L F+LR NP + Q+R D Sbjct: 58 EFYVLSKTEPQASLPIFSCVTKVFEPKLKKGQRLSFKLRVNPTVCVKGEDGKQRRHDVMM 117 Query: 108 NIK---------RCRVPLIKEAEQIAWL--QRKLGNAARVEDVHPISERPQYFSGDGKSG 156 K + + E I WL +++L D P + K Sbjct: 118 QAKYNVKDELPDAQTLKMHMEQAAINWLNNEKRLDEWGITLDFQPSIDGYTQHKVQKKRH 177 Query: 157 KIQ--TVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSL 196 +IQ +V ++G+LT+ D I+ +G G AK MGCGL+ + Sbjct: 178 QIQFSSVDYQGMLTVQDPLKFINQYAKGFGRAKGMGCGLMMI 219 >UniRef50_Q0W583 Predicted CRISPR-associated protein n=1 Tax=uncultured methanogenic archaeon RC-I RepID=Q0W583_UNCMA Length = 250 Score = 167 bits (423), Expect = 2e-40, Method: Composition-based stats. Identities = 61/248 (24%), Positives = 94/248 (37%), Gaps = 50/248 (20%) Query: 1 MYLSKVIIA------RAWSRDLYQLHQGLWHLFPN------RPDAARDFLFHVEKRNTPE 48 MYLS++I+ R D ++LH+ + FP+ L ++ Sbjct: 1 MYLSRLILNPRTRAVRRDLADCHELHRTILGGFPDLNGKGGEARETFGVLHRIDIHPRSG 60 Query: 49 GCHVLLQSAQMPVS-------------TAVATVIKTKQVEFQLQVGVPLYFRLRANPIKT 95 +L+QS + P T + +++ G FRLRANP K Sbjct: 61 AIVLLVQSQEKPDWSKLPEGYLLENTGTENPACKAIDEQYGKIKAGDVYAFRLRANPTKK 120 Query: 96 ILDNQKRLDSKGNIK--RCRVPLIKEAEQIAWLQRKLGNAA----------RVEDVHPIS 143 I ++ G K RVP+ E++QI WL+RK + DV Sbjct: 121 IGTSRIEDIKAGKPKNNGRRVPIRNESDQILWLKRKGAAGGFELMSTKRFSELSDVLISE 180 Query: 144 ERPQYF-------------SGDGKSGKIQTVCFEGVLTINDAPALIDLVQQGIGPAKSMG 190 E Q + +V FEG L + +A ++ ++ GIG K+ G Sbjct: 181 EGHQKIYTFDTGIKAKVQKNARENRLTFGSVLFEGTLKVTNAEKFLETLKSGIGSGKAYG 240 Query: 191 CGLLSLAP 198 GLLSLAP Sbjct: 241 FGLLSLAP 248 >UniRef50_Q2JWC6 CRISPR-associated protein, Cse3 family n=2 Tax=Chroococcales RepID=Q2JWC6_SYNJA Length = 210 Score = 167 bits (422), Expect = 2e-40, Method: Composition-based stats. Identities = 63/216 (29%), Positives = 96/216 (44%), Gaps = 28/216 (12%) Query: 1 MYLSKVIIARAWS------RDLYQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGCHVLL 54 MYLS++I+ + + LHQ + H FP++P +H+ R P+GC +L+ Sbjct: 1 MYLSRLILNERQLLVQRELSNAHALHQRIMHGFPDQPTKTPRSDWHILYRQEPDGCTILV 60 Query: 55 QSAQMPVSTAVAT-----VIKTKQVEFQ---LQVGVPLYFRLRANPIKTILDNQKRLDSK 106 QS P + + + K + + L G FRLRANP K + Sbjct: 61 QSVIQPDWSRLPQGYVQRDPEVKIFDLRPEVLSKGRCFQFRLRANPSK-----------R 109 Query: 107 GNIKRCRVPLIKEAEQIAWLQRKLGNAARVEDVHPISERPQYFSGDGK---SGKIQTVCF 163 R V + +Q+ WL+R+ PQ F +I TV F Sbjct: 110 DKKTRKIVGFFRSEDQLEWLRRQGFQHGFEVLAAEGIPSPQIFGIKKGLSGPVRIHTVLF 169 Query: 164 EGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAPL 199 +G+L + D+ A + VQQGIG +S GCGLLSL+ + Sbjct: 170 QGILRVTDSEAFVKAVQQGIGRGRSYGCGLLSLSKI 205 >UniRef50_B1LQ79 CRISPR-associated protein, Cse3 family n=54 Tax=Enterobacteriaceae RepID=B1LQ79_ECOSM Length = 216 Score = 166 bits (420), Expect = 4e-40, Method: Composition-based stats. Identities = 51/217 (23%), Positives = 86/217 (39%), Gaps = 24/217 (11%) Query: 1 MYLSKVIIARAWS----------RDLYQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGC 50 MYLS++ + R Y +HQ LW LFP R FL+ E+ Sbjct: 1 MYLSRITLHTGQLSPAQLLHLVDRGEYVMHQWLWDLFPG--GKERQFLYRREELQGA--F 56 Query: 51 HVLLQSAQMPVSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQK------RLD 104 + S + P + I+ + +L G L F LRANP + + Sbjct: 57 RFFVLSQERPAESET-FTIECRSFAPELHTGQSLCFNLRANPTVCKAGKRHDLLMEAKRQ 115 Query: 105 SKGNIKRCRVPLIKEAEQIAWLQRKLGNAAR-VEDVHPISERPQYFSGDGKSGKIQ--TV 161 +G + V L ++ + WL + + + D + R Q + IQ +V Sbjct: 116 VRGQAEGRNVWLHQQQAALDWLAAQGERSGFTLLDTSVDAYRQQQLRRENSRQLIQFSSV 175 Query: 162 CFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAP 198 + G+LT+ D + + G G +++ GCGL+ + P Sbjct: 176 DYTGMLTVTDPGLFLQRLCLGYGKSRAFGCGLMLIKP 212 >UniRef50_B8IMR1 CRISPR-associated protein, Cse3 family n=3 Tax=Alphaproteobacteria RepID=B8IMR1_METNO Length = 243 Score = 165 bits (418), Expect = 9e-40, Method: Composition-based stats. Identities = 59/222 (26%), Positives = 92/222 (41%), Gaps = 26/222 (11%) Query: 3 LSKVIIARAWSRDLYQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGCHVLLQSAQMPVS 62 L+ +++ + + H+ LW LF + PD ARDFL+ + T + L+ S + P Sbjct: 19 LAHLLLPKTEGARVAAGHRLLWSLFADSPDRARDFLWCEDAGGTWQRATFLILSRRRPQD 78 Query: 63 TAVATVIKTKQVEFQLQVGVPLYFRLRANPI-----KTILDNQKRLDSKGNIKRCRVPLI 117 T I+TK L G L FRLRA+P + KR+D R P + Sbjct: 79 TRGLFEIETKPFAPVLAPGQRLGFRLRASPAASDTPTAVGRRGKRIDPVARALRDLPPEV 138 Query: 118 KEAEQ--------IAWLQRKLGNAAR--VEDVHPISERPQYFSGDGK-----------SG 156 + + WL R+ A + P R S DG+ Sbjct: 139 RAERRHSVLQEVGAGWLARQGARAGFTLCDAEAPSGTRQPCLSVDGERWNVLPREGAAPV 198 Query: 157 KIQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAP 198 + ++ FEGVL + D + + +G G AK+ GCGL+ + P Sbjct: 199 RFSSLDFEGVLRVEDPSLFLAALAEGFGRAKAFGCGLMLIRP 240 >UniRef50_D1CAI9 CRISPR-associated protein, Cse3 family n=1 Tax=Sphaerobacter thermophilus DSM 20745 RepID=D1CAI9_SPHTD Length = 257 Score = 165 bits (418), Expect = 9e-40, Method: Composition-based stats. Identities = 75/255 (29%), Positives = 99/255 (38%), Gaps = 59/255 (23%) Query: 1 MYLSKVIIA------RAWSRDLYQLHQGLWHLFPN--RPDAAR---DFLFHVEKRNTPEG 49 MYLS++I+ R D QLH+ + FPN P AR L+ +E Sbjct: 1 MYLSRLILNPRSREVRRDLADCQQLHRSVMSGFPNLAAPGDARARLGILYRLETHPRTGM 60 Query: 50 CHVLLQSAQMPVSTAVATV----------IKTKQVEF---QLQVGVPLYFRLRANPIKTI 96 +L+QSA P + + K V L G+ L FRLRANP K I Sbjct: 61 PTLLVQSAIEPTWSQLPADYLLNTAGVPNPDCKPVGPIYDALDAGMVLTFRLRANPTKRI 120 Query: 97 LDNQKRLDSKGNIKRCRVPLIKEAEQIAWLQRKLGN-AARVEDVHPISERPQYF------ 149 + + N RV L EA+Q+AWL+RK V V +E Y Sbjct: 121 KPDTDP--GRSNRLGKRVELRTEADQLAWLRRKGEQCGFEVLSVRATTEHEAYRWERAAA 178 Query: 150 --------------------------SGDGKSGKIQTVCFEGVLTINDAPALIDLVQQGI 183 DG+ V F+G+L I DA + QGI Sbjct: 179 IFGLEADKPEPVPDVRAVRGSKVYGRRADGERMTFAAVTFDGLLRIVDADRFRAALVQGI 238 Query: 184 GPAKSMGCGLLSLAP 198 G AK+ G GLLS+AP Sbjct: 239 GSAKAYGFGLLSIAP 253 >UniRef50_B6WQ61 Putative uncharacterized protein n=1 Tax=Desulfovibrio piger ATCC 29098 RepID=B6WQ61_9DELT Length = 206 Score = 164 bits (416), Expect = 1e-39, Method: Composition-based stats. Identities = 59/203 (29%), Positives = 79/203 (38%), Gaps = 8/203 (3%) Query: 1 MYLSKVIIARAWSRDLYQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGCHVLLQSAQMP 60 M L + +AR RD Y HQ LW FP PDA RDFL + P+GC + L + P Sbjct: 7 MLLDRQALARCRFRDSYAWHQALWECFPAMPDAGRDFLTRTDWL--PQGCRIYLLCRREP 64 Query: 61 VSTAV--ATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRVPLIK 118 V K + F L ANP + + + R+ L+ Sbjct: 65 VRPDWCPPGSWAVKNIAPAFLQHGTYAFDLLANPTRKV--AAFDAGGQRTRNGKRLALLD 122 Query: 119 EAEQIAWLQRKLGNAARVED--VHPISERPQYFSGDGKSGKIQTVCFEGVLTINDAPALI 176 E + AW++ K G D + F +G V F G L + D I Sbjct: 123 ETSRQAWMEAKAGQHGFCLDGPLALDDAGASIFWRRACAGTHIGVRFRGRLQVTDRERFI 182 Query: 177 DLVQQGIGPAKSMGCGLLSLAPL 199 GIG AK+ G G+L L PL Sbjct: 183 HAFYHGIGSAKAFGFGMLLLQPL 205 >UniRef50_B8GIV2 CRISPR-associated protein, Cse3 family n=1 Tax=Methanosphaerula palustris E1-9c RepID=B8GIV2_METPE Length = 225 Score = 163 bits (414), Expect = 2e-39, Method: Composition-based stats. Identities = 63/226 (27%), Positives = 91/226 (40%), Gaps = 30/226 (13%) Query: 1 MYLSKVIIA---------RAWSRDLYQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGCH 51 M +SK+ + YQ H +W LF + P+ RDFLF E Sbjct: 1 MQISKIQLNADASDHPAFWEHVGGAYQAHSLIWDLFSDGPERERDFLFRQEVHQGMPVFW 60 Query: 52 VLLQSAQMPVSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILD---NQKRLDSKGN 108 + S ++P IK+K L+ G+ L F LRANPI++ D Q R D + Sbjct: 61 TV--SERVPSDRNETWNIKSKPYAPILRQGMHLSFVLRANPIRSRRDDLGKQHRHDVVMD 118 Query: 109 IKRC--------RVPLIKEAEQIA---WLQRKL---GNAARVEDVHPISERPQYFSGDGK 154 +K + P + Q A WL + G + + V F K Sbjct: 119 MKTALKDSKPGDQWPAEDQIIQEAGLVWLANQGNAKGFSLQDGAVRVDGYTQHRFVKPKK 178 Query: 155 SG--KIQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAP 198 +I T+ F G+LT+ D + GIGPAK GCGL+ + P Sbjct: 179 KQMVQISTLDFTGLLTVTDPERFTTALFNGIGPAKGFGCGLMMVRP 224 >UniRef50_B4TTX1 Crispr-associated protein, Cse3 family n=15 Tax=Enterobacteriaceae RepID=B4TTX1_SALSV Length = 235 Score = 161 bits (408), Expect = 1e-38, Method: Composition-based stats. Identities = 62/238 (26%), Positives = 98/238 (41%), Gaps = 50/238 (21%) Query: 1 MYLSKV----------IIARAWSRDLYQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGC 50 MYLS++ ++A+ S Y HQ LW LFP + R FLF E Sbjct: 1 MYLSRIQLRFNNLRPEMLAKWNSARPYASHQWLWQLFPEQE--LRQFLFREEAHGG---- 54 Query: 51 HVLLQSAQMPVSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKT-------ILDNQKRL 103 + SA P+S +I+TK QL G+ L F+LRANP+ T ++ N K Sbjct: 55 -FFMLSAIPPLSQHSLFLIETKPFNPQLTNGLELDFQLRANPVITRNGKRSDVMMNAKHQ 113 Query: 104 DSKGNIKRCRVPLIKEAEQIAWLQRKLGNAARVEDVHPISERPQYFSGDG---------- 153 +++ R +++ AWL+++ G + P + ++GD Sbjct: 114 AKANGVEKERWWELQQQAAQAWLEQQ-GQQHGFRLIAPEPDDFAMWAGDEYSELQAHCGC 172 Query: 154 ---------------KSGKIQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSL 196 K +V F G L I DA + G+G +K++GCG+L + Sbjct: 173 VQAYQQYRFVRKDQQKPITFSSVDFSGALCITDAALFKQALFSGLGKSKALGCGMLMV 230 >UniRef50_B8IZA5 CRISPR-associated protein, Cse3 family n=1 Tax=Desulfovibrio desulfuricans subsp. desulfuricans str. ATCC 27774 RepID=B8IZA5_DESDA Length = 207 Score = 158 bits (401), Expect = 7e-38, Method: Composition-based stats. Identities = 58/208 (27%), Positives = 87/208 (41%), Gaps = 14/208 (6%) Query: 2 YLSKVI-----IARAWSRDLYQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGCHVLLQS 56 ++++ + + R D Y H+ +W FPNRPDA+RDFLF ++ P G V + S Sbjct: 3 WMTRFMVELPALHRNRLSDCYAWHKAIWQCFPNRPDASRDFLFRLD--EVPAGTLVHVLS 60 Query: 57 AQMPVSTAVATVI--KTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRV 114 P T + K V F + NP + + D + R Sbjct: 61 PHEPQRPDFCTEDHWQIKAVPPCFLKYNCYRFDVICNPGRKV--EAFTSDGQRKKNSRRE 118 Query: 115 PLIKEAEQIAWLQRKLGNAAR--VEDVHPISERPQY-FSGDGKSGKIQTVCFEGVLTIND 171 +IK EQ AWL RK + + I +Y F D +SG V F GVL + Sbjct: 119 AIIKPDEQNAWLDRKAAANGFEVLPGMRSIDPSTRYSFRKDHRSGTHIGVRFSGVLRVTQ 178 Query: 172 APALIDLVQQGIGPAKSMGCGLLSLAPL 199 +G+G A+ G G+L L+P+ Sbjct: 179 RDEFCRAFHKGLGSARGFGFGMLLLSPV 206 >UniRef50_Q314I5 CRISPR-associated protein, CT1974 n=2 Tax=Desulfovibrio RepID=Q314I5_DESDG Length = 219 Score = 157 bits (398), Expect = 2e-37, Method: Composition-based stats. Identities = 62/223 (27%), Positives = 99/223 (44%), Gaps = 28/223 (12%) Query: 1 MYLSKVIIA--RAWSRDLYQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGCHVLLQSAQ 58 M++SK+++ RA ++LY H+ LW+LF + PD RDFLF E L S + Sbjct: 1 MWMSKLVLDPRRAVGKNLYDTHRLLWNLFADAPDRTRDFLFR----EQDEPYTFLTVSRR 56 Query: 59 MPVSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKR----------LDSKGN 108 P T I+ K +LQ G + F LR N + +N K+ L K Sbjct: 57 QPEDTTGWWSIQIKPYAPKLQAGDAVAFSLRVNAVVKRNENGKQRRFDIVQDACLRMKEL 116 Query: 109 IKRCRVPLIKEAEQIA---WLQRKL-------GNAARVEDVHPISERPQYFSGDGKSG-- 156 + ++P E Q A WL + +AA + + + + + D +SG Sbjct: 117 NQNAQMPTRAEIAQEAGTRWLLARQQALGLSIESAAILVEGCKVERFVKRATRDTRSGVV 176 Query: 157 KIQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAPL 199 + + +G + D L+ + QG+GPAK GCGLL + + Sbjct: 177 SLGIMDLQGTAEVKDPQLLLQALFQGVGPAKGFGCGLLLIRRV 219 >UniRef50_C6WMQ8 CRISPR-associated protein, Cse3 family n=1 Tax=Actinosynnema mirum DSM 43827 RepID=C6WMQ8_ACTMD Length = 230 Score = 157 bits (398), Expect = 2e-37, Method: Composition-based stats. Identities = 57/225 (25%), Positives = 90/225 (40%), Gaps = 36/225 (16%) Query: 1 MYLSKVIIA------RAWSRDLYQLHQGLWHLFPNRPD-----AARDFLFHVEKRNTPEG 49 M+L+K+ + R +L+++H+ + +P D L+ ++ TP G Sbjct: 1 MFLTKLTVDVRSREFRRDLANLHEMHRTVMSGYPRVEDGSPARQTHGVLWRLD--ATPAG 58 Query: 50 CHVLLQSAQMPVSTAVATVIKT--------KQVEFQLQVGVPLYFRLRANPIKTILDNQK 101 +QS P T + + T + ++ G L FRL AN K Sbjct: 59 YTQYVQSLTRPDWTGLPETLLTSPAEVRSLDPLLDAIEPGRVLAFRLLANATK------D 112 Query: 102 RLDSKGNIKRCRVPLIKEAEQIAWLQRKLGNAAR--------VEDVHPISERPQYFSGD- 152 + ++ + RV Q++WL RK V DV S Sbjct: 113 SVPAEPGGRGLRVAHRTPEAQVSWLARKGQRHGFALRDRPDGVPDVTLWSAPRMTGRKKA 172 Query: 153 GKSGKIQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLA 197 G+ + V F+G L + DA L + V GIG AK+ GCG+LSLA Sbjct: 173 GRPITVDAVRFDGHLVVTDADELREAVGSGIGRAKAYGCGMLSLA 217 >UniRef50_A0LM55 CRISPR-associated protein, Cse3 family n=1 Tax=Syntrophobacter fumaroxidans MPOB RepID=A0LM55_SYNFM Length = 202 Score = 156 bits (395), Expect = 4e-37, Method: Composition-based stats. Identities = 63/216 (29%), Positives = 91/216 (42%), Gaps = 31/216 (14%) Query: 1 MYLSKVIIARAWS------RDLYQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGCHVLL 54 MYLS + + R D+Y LH+G+ F D R LF VE N +++ Sbjct: 1 MYLSLLSLDRLHRGTMRLLSDIYLLHKGIMSGFTRCGDGLR-VLFRVEPENDDRIVRIMV 59 Query: 55 QSAQMPVST------AVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKGN 108 QS P ++TK L+ G FRLRANP Sbjct: 60 QSDGSPSWELFTERHPCVIDMRTKVFSPALRAGHSYRFRLRANPAV-------------K 106 Query: 109 IKRCRVPLIKEAEQIAWLQRK-----LGNAARVEDVHPISERPQYFSGDGKSGKIQTVCF 163 R LI++ WL+RK L + + + SG + I+T F Sbjct: 107 RNGKRYGLIRDETLEEWLRRKEPALGLQFRSVLALDEGYVTGHKEGSGHPQRINIKTARF 166 Query: 164 EGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAPL 199 EG+LT+++ + + + GIGPAK+ GCGLLSLA + Sbjct: 167 EGILTVSEPHLVQNALCCGIGPAKAFGCGLLSLARV 202 >UniRef50_A5UR13 CRISPR-associated protein, Cse3 family n=1 Tax=Roseiflexus sp. RS-1 RepID=A5UR13_ROSS1 Length = 238 Score = 156 bits (394), Expect = 5e-37, Method: Composition-based stats. Identities = 56/238 (23%), Positives = 92/238 (38%), Gaps = 43/238 (18%) Query: 1 MYLSKVIIA------RAWSRDLYQLHQGLWHLFPNRPD-----AARDFLFHVEKRNT-PE 48 MYLS++I+ R D+Y+LH+ + FP PD A L+ +E + P Sbjct: 1 MYLSRLILDVRQPRVRRDLSDVYRLHRTILSAFPQAPDNVPARAHFGILYRIEPISDMPW 60 Query: 49 GCHVLLQSAQMPVSTAVATV--------------IKTKQVEFQLQVGVPLYFRLRANPIK 94 +L+QS + P + + + +++ + FRL ANP + Sbjct: 61 LVRLLVQSREQPDWSHIPDRMFGPALDERGNPALRRIDDEYARIRSDMQFLFRLLANPTR 120 Query: 95 TILDNQKRLDSKGNIKRCRVPLIKEAEQIAWLQRKLGNAARVEDVHPISERPQYFSGDGK 154 + + D + + RV L++E EQIAWL K ++ + Sbjct: 121 RLSNRSSERDDR--LLGKRVALLREEEQIAWLAHKGEQHGFRLLSTSVNPDVPAVQAAKQ 178 Query: 155 SG---------------KIQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLA 197 + V F G L + DA ++ GIG K+ G GLLS+A Sbjct: 179 ADEHGWRKATQTQTMHLTFGAVLFTGYLKVTDADRFRTALEHGIGSGKAFGFGLLSIA 236 >UniRef50_A5GBK0 CRISPR-associated protein, Cse3 family n=1 Tax=Geobacter uraniireducens Rf4 RepID=A5GBK0_GEOUR Length = 229 Score = 155 bits (391), Expect = 1e-36, Method: Composition-based stats. Identities = 64/226 (28%), Positives = 97/226 (42%), Gaps = 31/226 (13%) Query: 2 YLSKVIIARAWSR------DLYQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGCHVLLQ 55 +L+++ + R D+Y H+ LW +P++P+A RDFL +++ VL Sbjct: 3 WLARLEVDAETVRAAGISEDVYAWHKLLWECYPDQPEAERDFLTRIDQLEGAYRFWVL-- 60 Query: 56 SAQMPVSTAVA--TVIKTKQVEFQLQVGVPLYFRLRANPIKTI----LDNQKRLDSKGNI 109 + + PV ++ F LRANP++ + ++ LD+ G Sbjct: 61 AKRKPVMPRWCPVDGFGLNEISPSFLSRQYYAFDLRANPVRAAVQRDANGEQVLDANGKR 120 Query: 110 KR-CRVPLIKEAEQIAWLQRKLG-------------NAARVEDVHPISERP---QYFSGD 152 +R RVPL+K E AWL RK R+ + + P +F Sbjct: 121 RRGKRVPLVKPDELRAWLVRKGEVRCRDKETGLDVPGGFRLVEERSLEISPMVESHFRKK 180 Query: 153 GKSGKIQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAP 198 G+SG V F G L + D I+ Q GIG AK G GLL LAP Sbjct: 181 GQSGYHGGVQFRGTLEVTDRAKFIESYQSGIGSAKGFGFGLLLLAP 226 >UniRef50_A9GV72 Putative uncharacterized protein ygcH n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9GV72_SORC5 Length = 246 Score = 154 bits (390), Expect = 1e-36, Method: Composition-based stats. Identities = 65/245 (26%), Positives = 98/245 (40%), Gaps = 49/245 (20%) Query: 1 MYLSKVIIA------RAWSRDLYQLHQGLWHLFPN----RPDAARDFLFHVEKRNTPEGC 50 MYLS+ ++ RA D+ LH+ + FP+ P A LF V++ Sbjct: 1 MYLSRALLNPISRAVRADIADIEGLHRTIMRAFPDGAGPHPRRAHGVLFRVDEAVLRGRF 60 Query: 51 HVLLQSAQMPVSTAVA----------------------TVIKTKQVEFQLQVGVPLYFRL 88 +L+QSA P T + + + +++ G F L Sbjct: 61 VLLVQSATRPDFTRLPEDYFLDIQEDLGLTEPSPIENPAIREVGSERARIRAGDFFRFSL 120 Query: 89 RANPIKTILDNQKRLDSKGNIKRCRVPLIKEAEQIAWLQRKL----------GNAARVED 138 RA+P + I + K D RV L +A ++ WL+RK + A V Sbjct: 121 RASPTRRI--DTKSGDDGKRRNGRRVELRDDASRLDWLRRKAMAGGFELCGAEDGAGVGG 178 Query: 139 VHPISERPQYFSGDGKSG-----KIQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGL 193 V + E G G S + V FEG L + DA + + G+GPAK+ G GL Sbjct: 179 VSAVEEPKLTGRGSGASEQRQQLTLAPVLFEGRLRVTDADRFREALAAGVGPAKAYGFGL 238 Query: 194 LSLAP 198 LS+AP Sbjct: 239 LSIAP 243 >UniRef50_Q67RN9 Putative uncharacterized protein n=1 Tax=Symbiobacterium thermophilum RepID=Q67RN9_SYMTH Length = 224 Score = 154 bits (390), Expect = 1e-36, Method: Composition-based stats. Identities = 58/225 (25%), Positives = 89/225 (39%), Gaps = 29/225 (12%) Query: 1 MYLSKVIIA------RAWSRDLYQLHQGLWHLFPN------RPDAARDFLFHVEKRNTPE 48 MYLS + + + RD+ LHQ + FP+ A L+ +E Sbjct: 1 MYLSLLRLNPASAAVQRDLRDVQALHQRVMSAFPDVLDPEVEARAYFGVLYRLELNRYSG 60 Query: 49 GCHVLLQSAQMPVSTAVAT-------------VIKTKQVEFQLQVGVPLYFRLRANPIKT 95 + +QS P + V + + +++ G L FRLRANP + Sbjct: 61 QVLLYVQSRVEPDWGRLPAGYLTPADGLPNPAVKRVDEAYARIREGRVLRFRLRANPTRK 120 Query: 96 ILDNQKRLDSKGNIKRCRVPLIKEAEQIAWLQRKLGNAAR--VEDVHPISERPQYFSGDG 153 I K N + RVPL Q+ W++RK +E + + Sbjct: 121 IDTKSGPNGEKRNGR--RVPLSGLDAQLGWMERKAREHGFELLEATVAAAGASERVRSYT 178 Query: 154 KSGKIQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAP 198 Q V FEG L + DA + +++GIGP K+ G GLLS+ P Sbjct: 179 TGRTFQGVLFEGRLVVRDAGRFREALERGIGPGKAYGYGLLSVGP 223 >UniRef50_Q53WG9 Putative uncharacterized protein TTHB192 n=1 Tax=Thermus thermophilus HB8 RepID=Q53WG9_THET8 Length = 211 Score = 154 bits (389), Expect = 2e-36, Method: Composition-based stats. Identities = 63/222 (28%), Positives = 99/222 (44%), Gaps = 35/222 (15%) Query: 1 MYLSKVII------ARAWSRDLYQLHQGLWHLFPNRPDAARDFL-FHVEKRNTPEGCHVL 53 M+L+K+++ AR + Y++H+ L + R+ L + +E E VL Sbjct: 1 MWLTKLVLNPASRAARRDLANPYEMHRTLSKAVSRALEEGRERLLWRLEPARGLEPPVVL 60 Query: 54 LQSAQMPVST----AVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKGNI 109 +Q+ P + A V K L+ G L FRLRANP K + K Sbjct: 61 VQTLTEPDWSVLDEGYAQVFPPKPFHPALKPGQRLRFRLRANPAKRLAATGK-------- 112 Query: 110 KRCRVPLIKEAEQIAWLQRKLGNAAR-------------VEDVHPISERPQYFSGDGKSG 156 RV L AE++AWL+R+L ++D R + GK Sbjct: 113 ---RVALKTPAEKVAWLERRLEEGGFRLLEGERGPWVQILQDTFLEVRRKKDGEEAGKLL 169 Query: 157 KIQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAP 198 ++Q V FEG L + D + +++G+GP K++G GLLS+AP Sbjct: 170 QVQAVLFEGRLEVVDPERALATLRRGVGPGKALGLGLLSVAP 211 >UniRef50_A1ARH5 CRISPR-associated protein, Cse3 family n=3 Tax=Bacteria RepID=A1ARH5_PELPD Length = 224 Score = 153 bits (387), Expect = 3e-36, Method: Composition-based stats. Identities = 62/236 (26%), Positives = 92/236 (38%), Gaps = 50/236 (21%) Query: 1 MYLSKVIIA---RAWSRDL---YQLHQGLWHLF--PNRPDAARDFLFHVEKRNTPEGC-H 51 M+LS++ + R RDL YQLH L F P +FL+ +E G Sbjct: 1 MFLSRLRLNLRCREARRDLSNPYQLHSTLCRAFSPPETKCPKGEFLWRLEPETDSSGYPR 60 Query: 52 VLLQSAQMPVSTAV-----------ATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQ 100 +++QS +P V A +K + L+ FRLRANP T Sbjct: 61 IIVQSRNIPDWGGVGVNGWIQQADPAIDLKERLKLDLLKAEQRFRFRLRANPCVT----- 115 Query: 101 KRLDSKGNIKRCRVPLIKEAEQIAWLQRKLGNAARVE----------------DVHPISE 144 R+ L+K+ EQ WL+RK DV E Sbjct: 116 --------KNGKRLGLLKQDEQEKWLKRKGAQHGFCLPEFLSFDYYESSEDRIDVRISQE 167 Query: 145 RP-QYFSGDGKSGKIQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAPL 199 + S ++ +V ++G+LTI + ++ GIG K MG GLLS+ P+ Sbjct: 168 QMLSDKQHSDNSIRVFSVLYDGILTITEPEMFKIALKTGIGHGKVMGLGLLSVVPI 223 >UniRef50_C6C417 CRISPR-associated protein, Cse3 family n=4 Tax=Enterobacteriaceae RepID=C6C417_DICDC Length = 215 Score = 153 bits (386), Expect = 4e-36, Method: Composition-based stats. Identities = 56/209 (26%), Positives = 89/209 (42%), Gaps = 26/209 (12%) Query: 1 MYLSKVIIA----------RAWSRDLYQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGC 50 M+ S+V + + + +Y HQ LW LFP + +R FLF + T Sbjct: 1 MFFSRVTLQPAALPSVMAEKWQTTPVYASHQWLWQLFPQ--EGSRGFLFRQDDHATLSRY 58 Query: 51 HVLLQSAQMPVSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKT--------ILDNQKR 102 ++L SA P V++TK + QL G+PL F LRANP+ T ++D + Sbjct: 59 YLL--SACAPRQDHNLFVVETKPWQPQLNAGMPLAFSLRANPVVTRRQKRCDVLMDAKYH 116 Query: 103 LDSKGNIKRCRVPLIKEAEQIAWLQR---KLGNAARVEDVHPISERPQYFSGDGKSGKIQ 159 ++G P ++ + WL R + G A V Y Sbjct: 117 AKAQGADSAEIWP-RQQQAAVDWLVRQGERGGFAVHACHVDGYQRHRLYKPQQSGPVSFS 175 Query: 160 TVCFEGVLTINDAPALIDLVQQGIGPAKS 188 +V F+G+L I DA + V QG+G +++ Sbjct: 176 SVDFDGLLRITDAKRFAETVSQGLGKSRA 204 >UniRef50_D2RB03 CRISPR system CASCADE complex protein CasE n=4 Tax=Bacteria RepID=D2RB03_GARVA Length = 215 Score = 152 bits (383), Expect = 8e-36, Method: Composition-based stats. Identities = 47/217 (21%), Positives = 93/217 (42%), Gaps = 25/217 (11%) Query: 2 YLSKVIIAR------AWSRDLYQLHQGLWHLFPNR--PDAARDFLFHVEKRNTPEGCHVL 53 YLS+V I + + H + FP+ L+ V+ + ++L Sbjct: 3 YLSRVEIDYKKPSSLRDLKSVGAFHNWVEQSFPDEWENHERSRKLWRVDVLH--GKHYLL 60 Query: 54 LQSAQMPV--------STAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDS 105 + S P + A+ + L G+ + FR+ NP+ +I DN + + Sbjct: 61 IVSDSKPDLQRLEMYGVSGTASSKTYDKFLGSLMNGMRMQFRVTLNPVVSISDNAETHTA 120 Query: 106 KGNIKRCRVPLIKEAEQIAWL---QRKLGNAARVEDVHPISERPQYFSGDGKSGKIQTVC 162 +G + VP + +Q+ +L +KLG + + + F+ K ++ Sbjct: 121 RGRV----VPHVTYDQQMNFLLNRAQKLGFSLNENEFAIVERGYSLFTKSEKPIRLSKAV 176 Query: 163 FEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAPL 199 ++G+LTI+DA + + +GIG K+ G G++++ PL Sbjct: 177 YQGILTISDADIMRKTLLEGIGKKKAYGFGMMTVIPL 213 >UniRef50_B6B784 CRISPR-associated protein, Cse3 family n=1 Tax=Rhodobacterales bacterium Y4I RepID=B6B784_9RHOB Length = 223 Score = 151 bits (382), Expect = 1e-35, Method: Composition-based stats. Identities = 51/227 (22%), Positives = 87/227 (38%), Gaps = 33/227 (14%) Query: 1 MYLSKVIIARAWSRDLYQL--------------HQGLWHLFPNRPDAARDFLFHVEKRNT 46 MYLS++ +AR S H+ +W F P A RDFL+ E R Sbjct: 2 MYLSRLTLARDPSVAALNALLDPDEKGAGADAHHRLIWSAFAGDPLAPRDFLWRAEGRG- 60 Query: 47 PEGCHVLLQSAQMPVSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKR---- 102 L+QS + PV + +++ L+ G + F LRAN K + ++R Sbjct: 61 ----RFLVQSPEPPVGGPFFDPPEVRELAPDLRRGDQVSFLLRANATKDLRGEKRRRVDV 116 Query: 103 -----LDSKGNIKRCRVPLIKEAEQIAWLQRKLGNAARVEDVHPISER-----PQYFSGD 152 D ++ R + + W+ + A D + + P + S Sbjct: 117 VMNLLHDVPKAERQIRRMALAQQAAGEWMAGQAARAGFCADHLEVQDYSTLTLPGHRSRR 176 Query: 153 GKSGKIQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAPL 199 + + + G +T+ D + + QG G AK GCGL+ + + Sbjct: 177 RGAPRFGILDLTGRITVTDPQVFLAKLAQGFGRAKGFGCGLMLIRRV 223 >UniRef50_D1CGD5 CRISPR-associated protein, Cse3 family n=1 Tax=Thermobaculum terrenum ATCC BAA-798 RepID=D1CGD5_THET1 Length = 240 Score = 151 bits (381), Expect = 2e-35, Method: Composition-based stats. Identities = 53/232 (22%), Positives = 91/232 (39%), Gaps = 38/232 (16%) Query: 1 MYLSKVIIARAWS------RDLYQLHQGLWHLFP-----NRPDAARDFLFHVEKRNTPEG 49 +YLS++ + + + LH + FP + P A L+ +E+ Sbjct: 2 LYLSRLRLQPRHRDVQKDLSNCHALHSRILSAFPLLPTPSSPRAEMGVLYRLEEAG--RF 59 Query: 50 CHVLLQSAQMPVSTAVAT--------VIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQK 101 V++QS P + + + ++ G FRLRANP + I Sbjct: 60 PTVIVQSRLEPDWSRLPEGYLAFPAECKRVDDKYSRINQGDRFIFRLRANPTRRIARGN- 118 Query: 102 RLDSKGNIKRCRVPLIKEAEQIAWLQRKLGNAARVEDVHPISERP--------------- 146 + + RV L +E +QI WL RK + ++ Sbjct: 119 -TEQAERWRGKRVELQREEDQIDWLIRKGDQHGFKLLSITVRQQAVPNLRVLPNNKTHGW 177 Query: 147 QYFSGDGKSGKIQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAP 198 + +G + +V FEGVL + D + + ++QG+G K+ G GLLS+AP Sbjct: 178 RRDAGGNRRLTFGSVQFEGVLEVTDRESFMQALEQGVGSGKAFGFGLLSIAP 229 >UniRef50_B8FDH8 CRISPR-associated protein, Cse3 family n=1 Tax=Desulfatibacillum alkenivorans AK-01 RepID=B8FDH8_DESAA Length = 199 Score = 150 bits (380), Expect = 2e-35, Method: Composition-based stats. Identities = 52/203 (25%), Positives = 80/203 (39%), Gaps = 17/203 (8%) Query: 1 MY-LSKVIIARAWSRDLYQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGCHVLLQSAQM 59 MY L + +D+Y +H+ +++LFP RDFLF +K G +L+ S + Sbjct: 5 MYTLDRKDCKALGLKDVYGVHKAVYNLFPENNGQGRDFLF-ADKGGDWNGRKILILSHRE 63 Query: 60 PVSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRVPLIKE 119 P+ I ++V F + NP++ + N R +P+ Sbjct: 64 PIQPRH-GAIDCREVPAAFLDWDYYGFEVVLNPVR-----------RDNASRKLIPVRGR 111 Query: 120 AEQIAWLQRK---LGNAARVEDVHPISERPQYFSGDGKSGKIQTVCFEGVLTINDAPALI 176 W +K LG + + ++ DG T F G L + D A Sbjct: 112 ENLHEWFLKKAPGLGFEVEPHSLQVSRMGVEAYAKDGTMRTHNTATFIGKLRVIDPNAFK 171 Query: 177 DLVQQGIGPAKSMGCGLLSLAPL 199 QGIG AK+ G GLL L PL Sbjct: 172 KSFAQGIGRAKAFGFGLLQLVPL 194 >UniRef50_Q1R113 CRISPR-associated protein, CT1974 n=1 Tax=Chromohalobacter salexigens DSM 3043 RepID=Q1R113_CHRSD Length = 230 Score = 150 bits (379), Expect = 2e-35, Method: Composition-based stats. Identities = 60/229 (26%), Positives = 85/229 (37%), Gaps = 33/229 (14%) Query: 1 MYLS--KVIIARAWSRDL--------YQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEG- 49 MYLS +V + L Y HQ LW LF + + R FLF E G Sbjct: 1 MYLSSVRVDLNALTREQLFDVLEGGAYTAHQLLWTLFADTSEGERPFLFRQEMEEAANGK 60 Query: 50 ----CHVLLQSAQMPVSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTIL----DNQK 101 + S + P + A ++ K QL G L FRLRANP Sbjct: 61 SQGLPRFYVYSTRRPEAVAGL-DVQCKPFAPQLAKGERLAFRLRANPTVAKSAGEGQRSH 119 Query: 102 RLDSKGNIKRCRVPLIK---------EAEQIAWLQRKLGNAARVEDVHP----ISERPQY 148 R D N ++ P + E WL + V P + Sbjct: 120 RADVLMNARKPFSPGERTSQACVDAMETAARDWLAERAPRFGFELPVAPEMGAYRQHELK 179 Query: 149 FSGDGKSGKIQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLA 197 S + + +V +EG+L + D LI+ + G+G AK+ GCGL+ L Sbjct: 180 KSDRREPIRFSSVDYEGLLEVTDPRRLIETLAHGVGRAKAFGCGLMLLR 228 >UniRef50_C1DSI0 CRISPR-associated protein, CT1974 n=3 Tax=Pseudomonadaceae RepID=C1DSI0_AZOVD Length = 205 Score = 150 bits (379), Expect = 3e-35, Method: Composition-based stats. Identities = 65/216 (30%), Positives = 92/216 (42%), Gaps = 32/216 (14%) Query: 1 MYLSKVII------ARAWSRDLYQLHQGLWHLF-PNRPDAARDFLFHVEKRNTPE-GCHV 52 MYL+++ + AR D Y +H+ L F + DA FL+ +E + Sbjct: 1 MYLTRLTLDPRSAQARRDLADAYDMHRTLVRAFVRDERDAPGRFLWRLEPGADAWASPTL 60 Query: 53 LLQSAQ---------MPVSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRL 103 L+QS + +P K +E ++ FRL ANP T Sbjct: 61 LVQSCESGDWDVLQGLPGYLQRPAECKALDLEALIRPQWRYRFRLLANPTVT-------- 112 Query: 104 DSKGNIKRCRVPLIKEAEQIAWLQRKLGNAARVEDVHPISERPQYFSGD--GKSGKIQTV 161 R L+ EAEQ+AWLQR+ +S S G +Q V Sbjct: 113 -----RAGKRRGLLGEAEQLAWLQRQGERHGFAVKAVLVSASDLLDSRRKGGAPIVLQRV 167 Query: 162 CFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLA 197 CFEG+L + +A AL + GIGPAK+ GCGLLS+A Sbjct: 168 CFEGLLQVVEADALRRALASGIGPAKAFGCGLLSVA 203 >UniRef50_Q2RY20 CRISPR-associated protein, CT1974 n=1 Tax=Rhodospirillum rubrum ATCC 11170 RepID=Q2RY20_RHORT Length = 220 Score = 149 bits (376), Expect = 6e-35, Method: Composition-based stats. Identities = 57/191 (29%), Positives = 81/191 (42%), Gaps = 16/191 (8%) Query: 20 HQGLWHLFPNRPDAARDFLFHVEKRNTPEGCHVLLQSAQMPVSTAVATVIKTKQVEFQLQ 79 H+ +W LF + P A+RDF+F E L+ SA+ P ++TK + Sbjct: 33 HRLVWTLFADDPKASRDFVFR-----EAEPGRYLIVSARPPGDGQGLWRLETKPYAPAFR 87 Query: 80 VGVPLYFRLRANPIKTILD----NQKRLDSKGNIK-RCRVPLIKEAE---QIAWL-QRKL 130 G F LRANP + KR+D+ + K R PL E + WL R+ Sbjct: 88 EGQRFGFTLRANPATAVKQAGETRGKRVDAIMHAKTRSATPLTVEDRERVALDWLLDRQQ 147 Query: 131 GNAARVEDV--HPISERPQYFSGDGKSGKIQTVCFEGVLTINDAPALIDLVQQGIGPAKS 188 G E R GK+ + +EGV T+ D L + +GIG AK+ Sbjct: 148 GFGVLFERALCSAGGYRQVRVPRGGKAITFSVIDYEGVFTVRDPGLLGQALVRGIGKAKA 207 Query: 189 MGCGLLSLAPL 199 GCGL+ L L Sbjct: 208 YGCGLMLLRRL 218 >UniRef50_D1NTI2 CRISPR-associated protein, Cse3 family n=1 Tax=Bifidobacterium gallicum DSM 20093 RepID=D1NTI2_9BIFI Length = 236 Score = 147 bits (372), Expect = 2e-34, Method: Composition-based stats. Identities = 51/230 (22%), Positives = 81/230 (35%), Gaps = 41/230 (17%) Query: 9 ARAWSRDLYQLHQGLWHLFPNRPDAAR-----DFLFHVEKRNTPEGCHVLLQSAQMPV-- 61 AR + Y+LH + FP P A R L+ ++ + + S P Sbjct: 6 ARQLAASPYKLHAAVEASFP--PHAPRATDEGRILWRLDHNRQDHSVWLYVVSPSQPDLL 63 Query: 62 ---------STAVATVIKTKQVEFQLQVGVPLYFRLRANPIK---TILDNQKRLDSKGNI 109 A +L G ++R+ ANP++ T L+ L + + Sbjct: 64 HIVEQAGWPGYAEWETKDYTPFLDRLAQGQQWHYRVCANPVRNAATDLNLHNSLATFDKM 123 Query: 110 KRCRVPLIKEAEQIAWLQRKLGNAAR--------------------VEDVHPISERPQYF 149 K R + +QI W +R+ + V I + F Sbjct: 124 KGSRQAYVTVRQQIDWFERRAAANGFSLPERDPVSGFDEQVKDPLLLSSVRVIDRQRHKF 183 Query: 150 SGDGKSGKIQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAPL 199 + T FEG L + D +L + GIG AK GCGL++LAP+ Sbjct: 184 RDRKNQVTLSTAVFEGTLQVEDPQSLRHALCFGIGKAKGFGCGLMTLAPI 233 >UniRef50_A8M405 CRISPR-associated protein, Cse3 family n=3 Tax=Actinomycetales RepID=A8M405_SALAI Length = 227 Score = 146 bits (368), Expect = 5e-34, Method: Composition-based stats. Identities = 51/229 (22%), Positives = 94/229 (41%), Gaps = 38/229 (16%) Query: 1 MYLSKVII--ARAWSRDLYQ----LHQGLWHLFPNRPDAARD---FLFHVE--------- 42 MYL++ ++ AR +R L +H + FP D RD L+ ++ Sbjct: 1 MYLTRFLVNPARRGARKLLASPQAMHAAVLSGFPRPEDHTRDGARTLWRLDHRQDRQVVL 60 Query: 43 ---KRNTPEGCHVLLQSAQMPVSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDN 99 P+ H++ Q+ + AT ++ + L G FRL ANP + N Sbjct: 61 YVVSPTAPDLTHMVEQAGWPSNAETWATRPYSR-LLDSLDKGQRWAFRLTANPARAGRRN 119 Query: 100 QKRLDSKGNIKRCRVPLIKEAEQIAWLQRKLGNAAR--------VEDVHPISERPQYFSG 151 Q ++ R + +Q+ WL R+ ++ + R F+ Sbjct: 120 QDTPTTQ------RYGHVTPVQQVEWLTRRAERNGFGVVRQTDGELNLITYNRRVHRFTR 173 Query: 152 DG--KSGKIQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAP 198 + + T ++GVL +++ ++ +GIG A++ GCGLL++AP Sbjct: 174 GHTQRPVTLVTATYDGVLEVDEPTLFRGVLTRGIGHARAYGCGLLTVAP 222 >UniRef50_Q04QB6 Putative uncharacterized protein n=2 Tax=Leptospira borgpetersenii serovar Hardjo-bovis RepID=Q04QB6_LEPBJ Length = 266 Score = 145 bits (365), Expect = 1e-33, Method: Composition-based stats. Identities = 63/265 (23%), Positives = 96/265 (36%), Gaps = 69/265 (26%) Query: 1 MYLSKVIIA---------RAWSRDLYQLHQGLWHLFPNRPDAARD----FLFHVEKRNTP 47 M+LS++ + W ++ Y +HQ LW F + FLF ++ + P Sbjct: 1 MFLSQLKLDTHNTNNKIVFNWIQNPYNIHQRLWMAFSEYSSKDKPQNSPFLFQLDYNSDP 60 Query: 48 E--GCHVLLQSAQMPVS----------TAVATVIKTKQVEFQ-LQVGVPLYFRLRANPIK 94 +L+ S ++P T + + KQ+ +Q G L F L ANP K Sbjct: 61 GKISPRILVFSEKLPNWERAFQEFKVLTEIPVGNQIKQISPTFIQAGAVLRFSLTANPTK 120 Query: 95 TILDNQK-----------------------------------RLDSKGNIKRCRVPLIKE 119 + D + D +K RV + E Sbjct: 121 KLKDYRSLFQEELEGFPDKFDPSDRVSFLEGKSKLEDLKKTLTKDQIQKLKSKRVGIYHE 180 Query: 120 AEQIAWLQRKLGNAARVEDVHPISERPQYFSGDGKSG-------KIQTVCFEGVLTINDA 172 E + WL +K G+ + + E FS + G KI TV F G+L I D Sbjct: 181 KELLNWLSKK-GSDNGFSLLDAVVEFQSDFSANKIKGSLSPSIPKIHTVSFSGILKIMDP 239 Query: 173 PALIDLVQQGIGPAKSMGCGLLSLA 197 +GIG K+ GCG+L LA Sbjct: 240 ALFKIAYTKGIGTGKAFGCGMLLLA 264 >UniRef50_C2BET7 CRISPR-associated protein n=1 Tax=Anaerococcus lactolyticus ATCC 51172 RepID=C2BET7_9FIRM Length = 215 Score = 145 bits (365), Expect = 1e-33, Method: Composition-based stats. Identities = 49/221 (22%), Positives = 89/221 (40%), Gaps = 31/221 (14%) Query: 1 MYLSKV---IIARAWSRDLYQL---HQGLWHLFPNRPDAARDFLFHVEKRNTPEGCHVLL 54 MYLS+V I R +DL L H + H FP D L+ ++ N + ++++ Sbjct: 1 MYLSRVEIDINNRRKMKDLTHLGCYHGWVEHSFPQENDIRTRKLWRID--NIGDKYYLII 58 Query: 55 QSAQMPVSTAVAT--------VIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSK 106 S +P + V + L+ G+ FR++ N + ++D + Sbjct: 59 LSEYIPDKEKLEKYGVESTTEVKDYDEFLASLKEGIRAKFRIKLNTVIA------KIDKE 112 Query: 107 GNIKRCRVPLIKEAEQIAWL---QRKLGNAARVEDVHPISERPQYFS------GDGKSGK 157 + KR R+ + + +L ++ G + ++ +YF Sbjct: 113 NSTKRGRIMPVPNEKLNGFLVDKAQRNGFEVKTDEFGISKIDKEYFMNFDKEDKKKSRKN 172 Query: 158 IQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAP 198 I + +EG+LTI D + GIG K+ GCG L++ P Sbjct: 173 IVSATYEGMLTITDLEKFKVALVNGIGKKKAYGCGFLTIIP 213 >UniRef50_B5GAA2 Crispr-associated protein n=1 Tax=Streptomyces sp. SPB74 RepID=B5GAA2_9ACTO Length = 217 Score = 144 bits (364), Expect = 2e-33, Method: Composition-based stats. Identities = 54/201 (26%), Positives = 80/201 (39%), Gaps = 17/201 (8%) Query: 9 ARAWSRDLYQLHQGLWHLFPN----RPDAARDFLFHVEKRNTPEGCHVLLQSAQMPVSTA 64 A R LH+ + LFP+ R LF E+ T G +L+QS P T Sbjct: 21 ATRDLRSAVNLHKRVMSLFPDDLGERARQQTGALFRFEEDAT-RGSRLLVQSVVTPDPTR 79 Query: 65 VATVI------KTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRVPLIK 118 + + + +L+ GV + +RL N +T+ D+ +PL Sbjct: 80 LPARYGAVRSTEITPLLQRLRPGVRVNYRLTGNATRTLSR-----DTTAGRPNQVIPLHG 134 Query: 119 EAEQIAWLQRKLGNAARVEDVHPIS-ERPQYFSGDGKSGKIQTVCFEGVLTINDAPALID 177 + WL+R + +H + D + + F+G T+ D AL Sbjct: 135 ADAEEWWLRRAASAGLDIHKIHTTELDDAAGNRHDKQRIRHARTRFDGTATVTDPDALRT 194 Query: 178 LVQQGIGPAKSMGCGLLSLAP 198 V GIG KS GCGLLSLAP Sbjct: 195 CVTTGIGRGKSYGCGLLSLAP 215 >UniRef50_A9HLC4 CRISPR-associated protein, Cse3 family n=1 Tax=Gluconacetobacter diazotrophicus PAl 5 RepID=A9HLC4_GLUDA Length = 228 Score = 144 bits (363), Expect = 2e-33, Method: Composition-based stats. Identities = 48/212 (22%), Positives = 83/212 (39%), Gaps = 23/212 (10%) Query: 3 LSKVIIARAWSRDLYQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGCHVLLQSAQMPVS 62 L+++++ R H LW LF + PD RDFL+ E ++ SA+ PV Sbjct: 20 LARLLVPDGEGRQHAAAHHLLWALFGDDPDRTRDFLWR-----QMEAGRFMVLSAREPVD 74 Query: 63 TAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRVPLIKEAEQ 122 + ++T+ + L+ G L F LRAN + + ++ + + E Sbjct: 75 SHGLFDVETRPFDPLLKEGDRLRFLLRANATVDRKTPGRTRSQRHDVVMDALHRRSQREG 134 Query: 123 IA------------WLQRKLGNAARVEDVHPISERPQYFSGDGKSGKIQTVCF-----EG 165 W+ R+ G A P+ + +SG V F G Sbjct: 135 AEARDSMIADALETWMGRQ-GVRAGFAPASPLVIEGRDVLRIPRSGGRGIVSFGVVNLTG 193 Query: 166 VLTINDAPALIDLVQQGIGPAKSMGCGLLSLA 197 + + A +D + QG G A++ GCGL+ + Sbjct: 194 EVRVTAPDAFLDSLMQGFGRARAFGCGLMLIR 225 >UniRef50_Q1J366 CRISPR-associated protein, CT1974 n=2 Tax=Deinococci RepID=Q1J366_DEIGD Length = 211 Score = 143 bits (360), Expect = 4e-33, Method: Composition-based stats. Identities = 70/221 (31%), Positives = 95/221 (42%), Gaps = 37/221 (16%) Query: 1 MYLSKVII---ARAWSRDL---YQLHQGLWHLFPNR-------PDAARDFLFHVEKRNTP 47 +YLS++ R +RDL Y LHQ L F PD R L+ E R T Sbjct: 4 LYLSRLRFEDRDRRTARDLASPYALHQTLRWAFAGAGVEGAPLPDGERA-LWRQEDRAT- 61 Query: 48 EGCHVLLQSAQMPVSTAV---------ATVIKTKQVEFQLQVGVPLYFRLRANPIKTILD 98 +L+QS P A+ +KT + L G PL FRLRAN LD Sbjct: 62 ----LLVQSLTAPDWEALNARHPGSLRGWEVKTVDLAPALTPGRPLRFRLRANVTVRKLD 117 Query: 99 NQKRLDSKGNIKRCRVPLIKEAEQIAWLQRKLGNAARVEDVHPISERPQYFSGDGKS-GK 157 + R R + EQ+ WL R+ I + G + Sbjct: 118 EKGRS--------RRHAVRGPHEQLEWLSRQGERCGFAVLAADIVHSGTVKTRKGSATIT 169 Query: 158 IQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAP 198 + TV FEG+L + D AL++ V+ G+G AK++GCGLLSL P Sbjct: 170 LHTVTFEGILRVTDPAALLEAVRGGLGHAKALGCGLLSLGP 210 >UniRef50_C9M2Y7 CRISPR-associated protein n=3 Tax=Lactobacillus RepID=C9M2Y7_LACHE Length = 217 Score = 143 bits (360), Expect = 5e-33, Method: Composition-based stats. Identities = 56/224 (25%), Positives = 81/224 (36%), Gaps = 34/224 (15%) Query: 1 MYLSKV---IIARAWSRDLYQL---HQGLWHLFPNRPDAARD--FLFHVEKRNTPEGCHV 52 MYLS+V R DL L H + FP+ L+ +++ + ++ Sbjct: 1 MYLSRVEIDTNDRQKISDLTHLGSYHNWVEQSFPDEVKQGTRLRHLWRIDEFS--NKKYL 58 Query: 53 LLQSAQMP--------VSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLD 104 LL S P A + QL G FRL ANP I D++ Sbjct: 59 LLVSKNKPKLNNLERYGVPYTAATKDYDRFLNQLVEGKKYRFRLTANPTYRITDSKSG-- 116 Query: 105 SKGNIKRCRVPLIKEAEQIAWL---QRKLGNAARVEDVHPI------SERPQYFSGDGKS 155 K VP I +Q WL +K G + + P+ Sbjct: 117 -----KSRVVPHITILQQTNWLLERTKKHGFEIVRDSEEVYKLNISERDWPRLRRKGNHL 171 Query: 156 GKIQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAPL 199 K+ V F+G+L I D + GIG K+ G GLL++ PL Sbjct: 172 IKLSRVTFDGILQITDLSKFKLALINGIGREKAYGMGLLTVIPL 215 >UniRef50_D1A5U1 CRISPR-associated protein, Cse3 family n=2 Tax=Actinomycetales RepID=D1A5U1_THECD Length = 229 Score = 142 bits (359), Expect = 6e-33, Method: Composition-based stats. Identities = 52/236 (22%), Positives = 86/236 (36%), Gaps = 48/236 (20%) Query: 1 MYLSKVII------ARAWSRDLYQLHQGLWHLF---PNRPDAARDFLFHVEKRNT----- 46 MYL++ AR LH + F P + D L+ +++ Sbjct: 1 MYLTRFRFNTARVTARRILSSPQMLHAAVMSSFATPPVQEDDGPRVLWRIDRNGKSETYL 60 Query: 47 -------PEGCHVLLQSAQMPVSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDN 99 P+ H++ Q+ +T +L G FRL ANP+ T Sbjct: 61 YIVSPLKPDLTHLVEQAGWP--TTGTWQTYDYGPFLSRLAKGEEWAFRLTANPVHT---- 114 Query: 100 QKRLDSKGNIKRCRVPLIKEAEQIAWLQRKLGNAAR----------------VEDVHPIS 143 +R D++ + Q+ WL ++ A V ++ Sbjct: 115 ARRNDTEP---TKITAHVGMRHQMQWLLQRQEAAGFRVVEKPRERQLIPGVDVHELVIRE 171 Query: 144 ERPQYFSGDG--KSGKIQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLA 197 R F G + + TV F+G L + D AL + +G+G AK+ GCGL++LA Sbjct: 172 RRHLEFRKRGNSRPVTLVTVTFDGRLEVTDPDALRRTLTRGLGRAKAYGCGLMTLA 227 >UniRef50_Q47PI8 CRISPR-associated protein, Cse3 family n=1 Tax=Thermobifida fusca YX RepID=Q47PI8_THEFY Length = 207 Score = 140 bits (354), Expect = 2e-32, Method: Composition-based stats. Identities = 48/216 (22%), Positives = 75/216 (34%), Gaps = 45/216 (20%) Query: 19 LHQGLWHLFP----------------NRPDAARDFLFHVEKRNTPEGCHVLLQSAQMPVS 62 +H + FP +R A FL+ + P+ H++ Q A P Sbjct: 1 MHAAVMSSFPTLLPSDTDGPRVLWRIDRTSRAEVFLY-IVSPPKPDLTHLVEQ-AGWPTQ 58 Query: 63 TAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRVPLIKEAEQ 122 + +L G FRL ANP+ +I + + Q Sbjct: 59 PTWES-YDYTPFLSRLAKGDVWAFRLTANPVHSIRRKAGEPTK-------LTAHLTQRYQ 110 Query: 123 IAWLQRKLGNAARVEDVHPISERP-------------------QYFSGDGKSGKIQTVCF 163 WL ++ A P +R + G+ + TV F Sbjct: 111 KKWLLQRQDAAGFRVVEKPAEKRRLPEGDEHELIVHNRRDWNFSKGARKGRPVSLVTVTF 170 Query: 164 EGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAPL 199 +G L + D AL + GIG AK+ GCGL++LAP+ Sbjct: 171 DGRLEVTDPDALRRALISGIGRAKAYGCGLMTLAPV 206 >UniRef50_Q03C59 CRISPR-associated protein n=3 Tax=Lactobacillus RepID=Q03C59_LACC3 Length = 215 Score = 140 bits (354), Expect = 2e-32, Method: Composition-based stats. Identities = 53/222 (23%), Positives = 83/222 (37%), Gaps = 34/222 (15%) Query: 1 MYLSKVIIARAW------SRDLYQLHQGLWHLFPNR--PDAARDFLFHVEKRNTPEGCHV 52 MYLS+V + L H + FP L+ ++ N + ++ Sbjct: 1 MYLSRVQVNTNDHQIFKHLTHLGAYHDWVKRSFPREIAAGTRLRHLWRLDSLNGRD--YL 58 Query: 53 LLQSAQMP-----VSTAVATVIKTKQVEF---QLQVGVPLYFRLRANPIKTILDNQKRLD 104 L+ S P VA +TK + L+ G L FRL ANP + I +R Sbjct: 59 LVLSPDAPELAQLARYGVAGTAQTKDYDPFVTALRQGQRLRFRLTANPTRAIATPGQR-- 116 Query: 105 SKGNIKRCRVPLIKEAEQIAWLQRKLGNAARVEDVH--------PISERPQYFSGDGKSG 156 P + A+Q+AWL + + + P GK Sbjct: 117 ------GHVAPHVTVAQQMAWLSERAAALGFELPIDDDGPQFQIVGRDYPALRRAQGKPV 170 Query: 157 KIQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAP 198 ++ V FEG L ++D + + GIG K+ G GLL++ P Sbjct: 171 RLSRVSFEGTLVVSDLVRFKETLATGIGREKAFGMGLLTVIP 212 >UniRef50_D1A6Q6 CRISPR-associated protein, Cse3 family n=5 Tax=Actinomycetales RepID=D1A6Q6_THECD Length = 214 Score = 138 bits (348), Expect = 1e-31, Method: Composition-based stats. Identities = 61/202 (30%), Positives = 92/202 (45%), Gaps = 20/202 (9%) Query: 9 ARAWSRDLYQLHQGLWHLFPNR--PDAARD--FLFHVEKRNTPEGCHVLLQS------AQ 58 AR D+ +LH+ + LFP+ P+A R LF +E+R P G +L+QS + Sbjct: 19 ARDDLGDVVRLHRRIMSLFPDGLGPEARRRAAVLFRLEER--PTGTSILMQSSIEPALEK 76 Query: 59 MPVSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRVPLIK 118 +P S A + L+ GV +++R+ AN + + R + G K+ VPL Sbjct: 77 LPASYGKARCKSLAPLLNGLREGVNVHYRIVANATRKLG----RNTTAGRPKQV-VPLHG 131 Query: 119 EAEQIAWLQRKLGNAARVEDVHPIS--ERPQYFSGDGKSGKIQTVCFEGVLTINDAPALI 176 AE W +R+ A V + D F+G T+ D ALI Sbjct: 132 -AEADEWWRRQADAAGLVLRSLHSRQLDTGTGRRSDNNRVTHARTQFDGTATVTDPKALI 190 Query: 177 DLVQQGIGPAKSMGCGLLSLAP 198 D + GIG K+ GCGLL++AP Sbjct: 191 DRIHAGIGRGKAYGCGLLTIAP 212 >UniRef50_C7MTM6 CRISPR-associated protein, Cse3 family n=1 Tax=Saccharomonospora viridis DSM 43017 RepID=C7MTM6_SACVD Length = 241 Score = 138 bits (347), Expect = 1e-31, Method: Composition-based stats. Identities = 51/235 (21%), Positives = 84/235 (35%), Gaps = 39/235 (16%) Query: 2 YLSKVIIA------RAWSRDLYQLHQGLWHLFPNRPDAARDFLFHVEK------------ 43 +LS++ I R + ++ H + P+ P+A R L+ + Sbjct: 3 FLSRIRINPFRQKSRELLANPHKTHGAVLAGLPD-PEAERP-LWRWDTGRERRPYLLVLT 60 Query: 44 RNTPEGCHVLLQSAQMPVSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRL 103 + H++ Q + QL G FR+ ANP++ + + Sbjct: 61 HTVADWTHLVEQCGWPAADGDHVITRDYTPLIRQLGEGREFAFRVTANPVQNVPAPAQES 120 Query: 104 DSKGNIKRCRV---PLIKEAEQIAWLQRKLGNAAR--------------VEDVHPISERP 146 + + K R A Q W + V D+ + Sbjct: 121 TPEPSAKPGRAVRKGHRTAAHQQRWFLERAERWGFQVPPALLDDPEADDVPDMRITQRQR 180 Query: 147 QYFSGD--GKSGKIQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAPL 199 F+ GK + T FEG L I D+ + +G+GPAK+ GCGLL+LAPL Sbjct: 181 LSFAKRKGGKPVILTTATFEGRLRITDSELFTRTLLRGLGPAKAYGCGLLTLAPL 235 >UniRef50_Q0RTG6 Putative uncharacterized protein n=1 Tax=Frankia alni ACN14a RepID=Q0RTG6_FRAAA Length = 278 Score = 137 bits (346), Expect = 2e-31, Method: Composition-based stats. Identities = 51/240 (21%), Positives = 85/240 (35%), Gaps = 43/240 (17%) Query: 2 YLSKVII------ARAWSRDLYQLHQGLWHLFPNRPDAARDFLFHVEKRN---------- 45 YLS+V + A++ R+ ++H + +P R L+ +E Sbjct: 29 YLSRVWLNPLRTGAQSLLRNPERMHAAVLGGLTRQPVTER-VLWRLETGRPHRAEVLILT 87 Query: 46 --TPEGCHVLLQSAQMPVSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDN---- 99 P H++ Q+ A V + + ++Q G FRLRANP+ Sbjct: 88 ESRPSWEHLIEQAGWPNAEDPQALVRDYQPLLDRIQAGREFAFRLRANPVAATRQPTSPS 147 Query: 100 --QKRLDSKGNIKRCRVPLIKEAEQIAWLQRKLGNAARVEDVHPISERPQYFSGDG---- 153 QK + + RV +Q+AW ++ Q + + Sbjct: 148 VAQKERLAGPRPRGVRVAHRTAGQQLAWFTDRVDRWGFTPLTTETGPAVQLNARERLTFR 207 Query: 154 --------------KSGKIQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAPL 199 + T F+G L + D + G+G AK+ GCGLL+LAPL Sbjct: 208 KRPPDGGNGGKNKGHQVVLSTATFDGALRVVDPDLARRALLSGVGAAKAYGCGLLTLAPL 267 >UniRef50_A8LYZ8 CRISPR-associated protein, Cse3 family n=1 Tax=Salinispora arenicola CNS-205 RepID=A8LYZ8_SALAI Length = 206 Score = 137 bits (345), Expect = 3e-31, Method: Composition-based stats. Identities = 57/215 (26%), Positives = 89/215 (41%), Gaps = 30/215 (13%) Query: 2 YLSKVII------ARAWSRDLYQLHQGLWHLFPN----RPDAARDFLFHVEKRNTPEGCH 51 +L+++ + AR RD LH+ + L P+ +P LF ++ +T G Sbjct: 4 WLTRIALDLRHSAARRDLRDTTALHRRVMSLVPDGLGEQPRHHAGVLFRLD--HTTTGPM 61 Query: 52 VLLQSA------QMPVSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDS 105 +L+Q+ ++P A + L G+ +++R+ AN K Sbjct: 62 LLVQTTLPPDPNRLPDGYAAVDTRDVSPLLKALTNGMAMHYRIAANASKRAW-------- 113 Query: 106 KGNIKRCRVPLIKEAEQIAWLQRKLGNAARVEDVHPISERPQ-YFSGDGKSGKIQTVCFE 164 KGN V L + + W QRK D+ + +PQ G + FE Sbjct: 114 KGNSAGKVVALSGQQAE-QWWQRKAEATGL--DLRHLRAQPQPAARGRAIPVRHAITLFE 170 Query: 165 GVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAPL 199 G I DA + V GIG +S GCGLLSLAP+ Sbjct: 171 GQAVITDADQVRAAVLAGIGRGRSFGCGLLSLAPM 205 >UniRef50_C7QEM3 CRISPR-associated protein, Cse3 family n=9 Tax=Actinomycetales RepID=C7QEM3_CATAD Length = 236 Score = 135 bits (340), Expect = 8e-31, Method: Composition-based stats. Identities = 51/243 (20%), Positives = 85/243 (34%), Gaps = 54/243 (22%) Query: 1 MYLSKVII------ARAWSRDLYQLHQGLWHLFPNRP----DAARDFLFHVE-------- 42 MYL++ AR LH + F N P + L+ ++ Sbjct: 1 MYLTRFRFNTARTGARRLLTSPQILHAAVMQSFANVPALPDGNSPRVLWRLDRNANNQVL 60 Query: 43 ----KRNTPEGCHVLLQSAQMPVSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILD 98 + P+ H++ Q+ +T +L G FRL ANP+ +I Sbjct: 61 LYIVSPDRPDLTHIVEQAGWP--TTGSWDSFAYAPFLDKLTAGDIWTFRLTANPVHSIR- 117 Query: 99 NQKRLDSKGNIKRCRVPLIKEAEQIAWLQRKLGNAARVEDVHPIS--------------- 143 ++ R I Q+ WL ++ A P Sbjct: 118 ------TRDGEPTKRTAHITVRHQLGWLLKQQERAGFTICEQPKELPRPTDMDEYQVVVH 171 Query: 144 -ERPQYFSGDG-------KSGKIQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLS 195 R F+ + +I TV ++G L I+D + ++ G+G AK+ GCGL++ Sbjct: 172 DRRSLDFTKKDPARSSKINNVQILTVTYDGRLRIDDPDKVRAVLTTGLGKAKAYGCGLMT 231 Query: 196 LAP 198 LAP Sbjct: 232 LAP 234 >UniRef50_Q0BSC8 Putative uncharacterized protein n=1 Tax=Granulibacter bethesdensis CGDNIH1 RepID=Q0BSC8_GRABC Length = 227 Score = 135 bits (340), Expect = 9e-31, Method: Composition-based stats. Identities = 46/200 (23%), Positives = 80/200 (40%), Gaps = 17/200 (8%) Query: 10 RAWSRDLYQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGCHVLLQSAQMPVSTAVATVI 69 RA + H LW +F + + RDFL+ E+ + L SA+ P+ + + Sbjct: 31 RASGERVSAQHHLLWSVFADSEERKRDFLWREERDGS-----FLTLSARPPLQSDLFQPH 85 Query: 70 KTKQVEFQLQVGVPLYFRLRANPIKTILDNQK------RLDSKGNIKRC-RVPLIKEAEQ 122 + K L G L F LRAN + ++ +D+ +R R I + Sbjct: 86 RIKSYAPDLAPGARLEFLLRANATRMKRGGKREDVVKAPIDALEQSERAERRMEIASSAG 145 Query: 123 IAWLQRKLGNAARVEDVHPISER-----PQYFSGDGKSGKIQTVCFEGVLTINDAPALID 177 AWL+++ + + P+ + D + + + G L + D + Sbjct: 146 KAWLEQQGEKSGFRVITAIAEDYRQLSLPRLGAIDRNAMTLGILDLSGHLEMTDPALFLT 205 Query: 178 LVQQGIGPAKSMGCGLLSLA 197 + QG G AKS GCGL+ + Sbjct: 206 NLAQGFGRAKSFGCGLMIIR 225 >UniRef50_C0VRW4 CRISPR-associated protein n=1 Tax=Corynebacterium glucuronolyticum ATCC 51867 RepID=C0VRW4_9CORY Length = 220 Score = 131 bits (331), Expect = 1e-29, Method: Composition-based stats. Identities = 52/210 (24%), Positives = 84/210 (40%), Gaps = 30/210 (14%) Query: 10 RAWSRDLYQLHQGLWHLFP-NRPDAARDFLFHVEKRN--------TPEGCHVLLQSAQMP 60 R + +H + LFP + P L+ +++ + PE + ++ Sbjct: 17 RKVLTNPEAMHAEVRGLFPPDLPSDNGRVLWRLDQHDNEHILYIVGPERPDTAELADRLG 76 Query: 61 VSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRVPLIKEA 120 ST A ++ L G F L ANP ++ KR S VPL + Sbjct: 77 WSTRPAQTADYDKLLSSLAKGQQWCFELLANPSISLKTGGKRGKS--------VPLARID 128 Query: 121 EQIAWLQRKLGNAAR---------VEDVHPISERPQYFSGDGKSGK----IQTVCFEGVL 167 +QI WL ++ D+ + + FS + + K + TV FEG L Sbjct: 129 QQIDWLLQRSEKNGFKVLPQGDSAEPDLRIANRKVMRFSKNPRDHKRTVALTTVRFEGTL 188 Query: 168 TINDAPALIDLVQQGIGPAKSMGCGLLSLA 197 + DA AL + QGIG ++ G GL++LA Sbjct: 189 EVTDAEALRATLTQGIGKGRAYGLGLMTLA 218 >UniRef50_B1VIX9 CRISPR-associated protein n=6 Tax=Actinomycetales RepID=B1VIX9_CORU7 Length = 234 Score = 131 bits (331), Expect = 1e-29, Method: Composition-based stats. Identities = 55/228 (24%), Positives = 78/228 (34%), Gaps = 49/228 (21%) Query: 10 RAWSRDLYQLHQGLWHLFPNRPDAA-RDFLFHVEK-----------RNTPEGCHVLLQSA 57 R D ++H + FP D + L+ V+ P G ++ Q+ Sbjct: 17 RKLLSDPQRMHAAVRAAFPPELDESDARVLWRVDPGEHEHVLYVVGPEKPTGAVLVEQAG 76 Query: 58 QMPVSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRVPLI 117 T A + +L G F L ANP R K + Sbjct: 77 W---DTLPAQTADYSRFLGKLTRGQRWRFELVANPTYAEPRKGGRGKVK--------AHV 125 Query: 118 KEAEQIAWLQRKL-----GNAARVEDVHPISERPQY---------------------FSG 151 QI WL RK G A R++D ER ++ G Sbjct: 126 SVRHQIGWLYRKADAAGFGLAPRLDDEVSDEERSRWSEFDAPQVTERWTDVFHRNKAGGG 185 Query: 152 DGKSGKIQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAPL 199 G+ +I F G L + D L + QGIG A+ GCGLL+LAP+ Sbjct: 186 RGRPVRIAKARFTGTLEVTDPELLRQALAQGIGRARGYGCGLLTLAPI 233 >UniRef50_D0WFC7 CRISPR-associated protein, Cse3 family n=1 Tax=Slackia exigua ATCC 700122 RepID=D0WFC7_9ACTN Length = 255 Score = 131 bits (329), Expect = 2e-29, Method: Composition-based stats. Identities = 52/251 (20%), Positives = 83/251 (33%), Gaps = 54/251 (21%) Query: 2 YLSKVII------ARAWSRDLYQLHQGLWHLFPNR---PDAARDFLFHVEKRNTPEGCHV 52 YL++ I AR YQ+H + FP + L+ V+ + Sbjct: 3 YLTRFPINKTRRDARRLLASPYQMHAAIAGSFPVIHCLDSGKKRVLWRVDASEDGS-ARL 61 Query: 53 LLQSA------------QMPVSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQ 100 + S P ++++G FRL ANP+ + Sbjct: 62 YIVSPDKPSLVGLDEQIGWPDLPQQWETRSYDTFLSRIEIGQEYAFRLFANPVLSRSTRG 121 Query: 101 KRLDSKGNI-KRCRVPLIKEAEQIAWLQRK---LGNAARVED------------------ 138 R + K R+ + +Q AWL K LG+ V + Sbjct: 122 GRTVPRNEKGKPKRIGHLTVLQQAAWLIGKDAYLGSGLEVPELFAHQEWNRAQRNGFEVL 181 Query: 139 ----------VHPISERPQYFSGDGKSGKIQTVCFEGVLTINDAPALIDLVQQGIGPAKS 188 V ++ + + T F+G L ++D L + GIG AK Sbjct: 182 TNLDGTARLIVSHSGKQKLRSGRESCPITLSTAQFDGFLRVSDPDLLRSALVNGIGHAKG 241 Query: 189 MGCGLLSLAPL 199 GCGLL+LAP+ Sbjct: 242 FGCGLLTLAPM 252 >UniRef50_C7JIG8 CRISPR-associated protein Cse3 n=8 Tax=Acetobacter pasteurianus RepID=C7JIG8_ACEP3 Length = 229 Score = 131 bits (329), Expect = 2e-29, Method: Composition-based stats. Identities = 43/213 (20%), Positives = 74/213 (34%), Gaps = 24/213 (11%) Query: 3 LSKVIIARAWSRDLYQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGCHVLLQSAQMPVS 62 L+ +++ + R H LW LF + RDFL+ E H ++ SA+ PV Sbjct: 20 LAGLLVPQGEGRQHGAAHHLLWVLFGDDSSRIRDFLWR-----QTEPGHFMILSARKPVD 74 Query: 63 TAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSK------------GNIK 110 + I++++ +L+ G L F LR N ++ + + Sbjct: 75 SHRLFEIESREFTPKLREGNRLRFLLRVNATVDRKVPGRKRSQRHDVVMDALYKLPAKER 134 Query: 111 RCRVPLIKEAEQIAWLQRKLGNAARVEDVHPI------SERPQYFSGDGKSGKIQTVCFE 164 + AWL R+ + G GK+ V Sbjct: 135 AAARESLVPTAMEAWLARQGHRTGFELKEGKLAIESCDVLHIPRAQGQGKA-TFGVVDVT 193 Query: 165 GVLTINDAPALIDLVQQGIGPAKSMGCGLLSLA 197 G L + + QG G A++ GCGL+ + Sbjct: 194 GELCVRTPDLFTQALMQGFGRARAFGCGLMLVR 226 >UniRef50_B6IWM2 CRISPR-associated protein, CT1974 family n=1 Tax=Rhodospirillum centenum SW RepID=B6IWM2_RHOCS Length = 262 Score = 131 bits (329), Expect = 2e-29, Method: Composition-based stats. Identities = 53/211 (25%), Positives = 81/211 (38%), Gaps = 28/211 (13%) Query: 13 SRDLYQLHQGLWHLFPNRPDAARD--FLFHVEKRNTPEGCHVLLQSAQMPVST-AVATVI 69 +D H+ LW LFP+RP A R+ FLFHVE +++S P I Sbjct: 53 RQDGQFAHRMLWTLFPDRPTARREGLFLFHVE---GTRPFSAIVRSRVPPEDGLGGIWTI 109 Query: 70 KTKQVEFQLQVGVPLYFRLRANPIKTILDNQK-------------RLDSKGNIKRCRVPL 116 T+ + L G+ L F LRA + + R + + Sbjct: 110 TTRPFDPALAPGLTLRFHLRAVASRWQPRPGERRGRRQDVIVAAWRDLPEEQRTPENLEK 169 Query: 117 IKEAEQIAWLQR---KLGNAARVEDVHPISERPQYFS------GDGKSGKIQTVCFEGVL 167 E + WL R + G A V + G +S + V +EG+L Sbjct: 170 TAEHAALDWLARQGRRGGFAPVEGAVDVLDYDRASLRAGAKLGGRDRSIRFGAVTYEGLL 229 Query: 168 TINDAPALIDLVQQGIGPAKSMGCGLLSLAP 198 T+ D A + QG+G ++ G GL+ +AP Sbjct: 230 TVTDPQAFRATLVQGLGAGRAYGNGLMQIAP 260 >UniRef50_C7MTL5 CRISPR-associated protein, Cse3 family n=1 Tax=Saccharomonospora viridis DSM 43017 RepID=C7MTL5_SACVD Length = 207 Score = 129 bits (324), Expect = 7e-29, Method: Composition-based stats. Identities = 48/208 (23%), Positives = 81/208 (38%), Gaps = 28/208 (13%) Query: 11 AWSRDLYQLHQGLWHLFPN----RPDAARDFLFHVEKRNTPEGCHVLLQSAQMPVSTAVA 66 + D LH+ + L P+ + + LF E +T G VL Q + P +A Sbjct: 4 RGTLDGGALHRDIMRLAPDALGNQARKEANVLFRAE--HTQRGLQVLAQLSCAPRVDNLA 61 Query: 67 -----TVIKTKQVEF---QLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRVPLIK 118 + + +E + G + +R+ ANP K RL + K+ R+ ++ Sbjct: 62 PDFAHGTPECRNIESLVSSMHSGTRVRYRIDANPTK-------RLGNSAGDKKGRLAVLH 114 Query: 119 EAEQIAWLQRKLGNAARVEDVHPISERPQYFS-------GDGKSGKIQTVCFEGVLTIND 171 A+ W R+ + S P G ++ FEG + D Sbjct: 115 GADAAEWWHRRAAESGLELLSATASAMPDILGSRNRDRRGRCRATSHGVTRFEGFAVVAD 174 Query: 172 APALIDLVQQGIGPAKSMGCGLLSLAPL 199 + V +GIG A++ GCGLLS+ P+ Sbjct: 175 PGKVRSAVVEGIGRARTYGCGLLSIVPV 202 >UniRef50_C2KP48 Putative uncharacterized protein n=1 Tax=Mobiluncus mulieris ATCC 35243 RepID=C2KP48_9ACTO Length = 212 Score = 128 bits (322), Expect = 1e-28, Method: Composition-based stats. Identities = 51/197 (25%), Positives = 82/197 (41%), Gaps = 23/197 (11%) Query: 20 HQGLWHLFPN-----RPDAARDFLFHVEKRNTPEGCHVLLQSAQMPVSTAVATVIKTKQV 74 H+ + LFP P AA LF +E ++QS P + ++ Sbjct: 22 HRAVMDLFPEFEGEQNPRAAASILFRLETLPGL-APRFVVQSDISPAVDKLPKGVEPLGY 80 Query: 75 E-FQLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRVPLIKEAEQIA-----WLQR 128 +L G P+ FRL NP+ + + D + P KE + A WL + Sbjct: 81 TFPELGEGTPVSFRLAVNPVI---RHSQGKDGQPARTTTVAPFGKEPAESAASLETWLSQ 137 Query: 129 KLGNAARVEDVHPISERPQYFSGD------GKSGKIQTVCFEGVLTINDAPALIDLVQQG 182 KL + + +V+ I+ + + K +I +GV + DA L +++ G Sbjct: 138 KL--SPGLAEVNIINAQREIIGDGYPNQDISKIKRIVIDLVDGVACVGDAKTLNKMLRSG 195 Query: 183 IGPAKSMGCGLLSLAPL 199 +G AKS GCGLLS+ L Sbjct: 196 VGRAKSYGCGLLSVKQL 212 >UniRef50_C1XG03 CRISPR-associated protein, Cse3 family n=1 Tax=Meiothermus ruber DSM 1279 RepID=C1XG03_MEIRU Length = 156 Score = 128 bits (322), Expect = 1e-28, Method: Composition-based stats. Identities = 53/160 (33%), Positives = 75/160 (46%), Gaps = 16/160 (10%) Query: 52 VLLQSAQMPVSTAV--------ATVIKTKQV-EFQLQVGVPLYFRLRANPIKTILDNQKR 102 +L+QSA MP + A +K + LQ L FRLRANP T D Sbjct: 1 MLVQSAGMPDWEKLVQRFPGYFAQPPASKPIPLEHLQPAQVLRFRLRANPTVTKKDPNNP 60 Query: 103 LDSKGNIKRCRVPLIKEAEQIAWLQRKL--GNAARVEDVHPISERPQYFSGDGK-SGKIQ 159 + KR R L EQ+ WL R+ G + + + SER + + DG +Q Sbjct: 61 ----DSKKRKRHGLKTLEEQLEWLHRQGAKGGFSVLGAMVVQSERVRMYKHDGSGPIVLQ 116 Query: 160 TVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAPL 199 +V +EG L I D A + G+G AK++G GLLS+A + Sbjct: 117 SVLYEGHLKITDLEAFKHTLAAGLGHAKALGFGLLSIAKV 156 >UniRef50_B0LU87 CRISPR-associated protein Cas3 n=2 Tax=Streptomyces RepID=B0LU87_9ACTO Length = 270 Score = 126 bits (318), Expect = 3e-28, Method: Composition-based stats. Identities = 54/266 (20%), Positives = 85/266 (31%), Gaps = 71/266 (26%) Query: 2 YLSKVIIA------RAWSRDLYQLH--------------QGLWHLFPNRPDAARDFLFHV 41 YLS++ I R + +H + LW + P+ P R LF V Sbjct: 3 YLSRIRINPLRKDSRKLLSNPRAVHGAVMGGLPNHKPDDRVLWRMDPDNPH--RPHLF-V 59 Query: 42 EKRNTPEGCHVLLQSAQMPVSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQK 101 P+ H++ A V + QL VG FRL A+P++ K Sbjct: 60 LSPTRPDWTHIIQDCGWPDADGDHAAVRDYTPLLSQLAVGREFAFRLTASPVQNTATPTK 119 Query: 102 RLDSKG-----------NIKRCRVPLIKEAEQIAWLQRKLGNAAR--------------- 135 ++ I+ R+ A Q+ W + Sbjct: 120 ATPAQAARLTAHAEDGKRIRGFRMGHRTAAAQLDWFLTRTDRWGFDIPATRSDPTAPGIH 179 Query: 136 --------------------VEDVHPISERPQYFSGDGK--SGKIQTVCFEGVLTINDAP 173 +V + F +G ++ FEG L I D Sbjct: 180 APTPPTAPRPTSPPRPDPNPPYEVRITARHRHSFQKNGHGAHVVFRSATFEGRLRITDTD 239 Query: 174 ALIDLVQQGIGPAKSMGCGLLSLAPL 199 + G+GP+++ GCGLL+LAPL Sbjct: 240 RFTTSLLTGLGPSRAYGCGLLTLAPL 265 >UniRef50_A7BA62 Putative uncharacterized protein n=1 Tax=Actinomyces odontolyticus ATCC 17982 RepID=A7BA62_9ACTO Length = 249 Score = 126 bits (318), Expect = 3e-28, Method: Composition-based stats. Identities = 50/253 (19%), Positives = 89/253 (35%), Gaps = 62/253 (24%) Query: 1 MYLSKVIIA--RAWSRDLYQ----LHQGLWHLFP----NRPDAARDFLFHVE-------- 42 MYL+++ + R ++ L + LH + + FP + P A R L+ ++ Sbjct: 1 MYLTRIYLNPHRRGAKQLMRSRQTLHAAVLNCFPPSVLDDPGAPR-VLWRLDRPPAVRGA 59 Query: 43 -KRNTPEGCHVLLQSAQMPVSTAVATV-----------IKTKQVEFQLQVGVPLYFRLRA 90 R C + + S P + + L G FRL Sbjct: 60 APRQGSPSCSLYISSPVAPDPSHIVEEAGYATEGGVVIRDMSSFLEGLWAGQRWGFRLCV 119 Query: 91 NPIKTILDNQKRLDSKGNIKRCRVPLIKEAEQIAW-LQRKLGNAARVE------------ 137 NP +++++G K + + + +Q W L+R RV Sbjct: 120 NPT---FREGSQVNARGRKK--VLAHVTQDQQTQWVLERAEKCGFRVLTSAELGGELPVL 174 Query: 138 -------------DVHPISERPQYFSGDGKSGKIQTVCFEGVLTINDAPALIDLVQQGIG 184 ++ + F + + FEGVL + D A+ ++ GIG Sbjct: 175 EDSDGQRVDGANLLINGVERSIAEFKRGERRVTLGVATFEGVLEVTDPDAMRRVLTHGIG 234 Query: 185 PAKSMGCGLLSLA 197 K+ GCGL++LA Sbjct: 235 RGKAYGCGLMTLA 247 >UniRef50_C5V9N5 CRISPR-associated protein, Cse3 family n=1 Tax=Corynebacterium matruchotii ATCC 14266 RepID=C5V9N5_9CORY Length = 220 Score = 125 bits (315), Expect = 8e-28, Method: Composition-based stats. Identities = 53/220 (24%), Positives = 96/220 (43%), Gaps = 36/220 (16%) Query: 2 YLSKVIIARAWSRDLYQL-----------HQGLWHLFPNRPD----AARDFLFHVEKRNT 46 YL+K + A +R + H+ + LFP+ D + + LF E Sbjct: 6 YLTKFPVHVALARKPEKTQRWRVDDPEFRHRAVMGLFPDFEDNQARSRNNILFRYEFIP- 64 Query: 47 PEGCHVLLQSAQMPVSTAVATVIKTKQVE-FQLQVGVPLYFRLRANPIKTILDNQKRLDS 105 + + L+QS V+ + VI+TKQVE + G P+ FRL N + ++ +++ Sbjct: 65 GQAPYFLVQSDCDVVAPDLEGVIETKQVEYPSYENGTPIIFRLALNTV-----TRRTIET 119 Query: 106 KGNIKRCRVPL----------IKEAEQIAWLQRKLGNAARVEDVHPISERPQYFSGDGKS 155 G + P+ + AE+ + KL A ++ + ++ Q S Sbjct: 120 NGRKREVITPVALQPLDAETGLNPAEKH--VAYKLSTA--LQGIEFLNHNRQVLQVPKVS 175 Query: 156 GKIQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLS 195 +Q F+ + + ++ AL ++ GIG AK+ GCGLL+ Sbjct: 176 RALQIDTFDCMGVVTNSQALEHIMHAGIGRAKAYGCGLLT 215 >UniRef50_Q4JWK1 Putative uncharacterized protein n=2 Tax=Corynebacterium jeikeium RepID=Q4JWK1_CORJK Length = 224 Score = 125 bits (314), Expect = 8e-28, Method: Composition-based stats. Identities = 41/205 (20%), Positives = 74/205 (36%), Gaps = 20/205 (9%) Query: 9 ARAWSRDLYQLHQGLWHLFPNRPDAARDFLFHVE---KRNTPEGCHVLLQSAQMPVSTAV 65 AR + +H + FP L+ + + +++ + P A Sbjct: 16 ARKLLGNPQAMHAAVLSCFPKEVSEKERILWRHDGKVRGADEHFVYIVGPDSCDPTKIAE 75 Query: 66 ATVIKTKQ-------VEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRVPLIK 118 T ++ + L G ++ + NP+ ++G + L+ Sbjct: 76 QTGSESDPQKASYNRLLEALADGQQWHYEVVLNPVAAKKAPGSPRGTRGKL----TALVG 131 Query: 119 EAEQIAWLQRKLGNAARVE-DVHPISERPQYFSG-----DGKSGKIQTVCFEGVLTINDA 172 EA Q+ W K + + + + FS G+ I TV + G L I+D Sbjct: 132 EAAQLEWFNTKAKSCGFTPLETLIVERKTLRFSKLAKNPKGRQVVIGTVRYRGTLQIDDV 191 Query: 173 PALIDLVQQGIGPAKSMGCGLLSLA 197 + +GIG K+ GCGLL+LA Sbjct: 192 ETFKKSLVEGIGRGKAYGCGLLTLA 216 >UniRef50_Q2JH26 Putative uncharacterized protein n=1 Tax=Frankia sp. CcI3 RepID=Q2JH26_FRASC Length = 275 Score = 124 bits (311), Expect = 2e-27, Method: Composition-based stats. Identities = 52/267 (19%), Positives = 86/267 (32%), Gaps = 70/267 (26%) Query: 2 YLSKVII------ARAWSRDLYQLHQGLWHLFPNRPDAARDFLFHVEKRN---------- 45 YLS++ + A+A ++ ++H + +P R L+ +E Sbjct: 3 YLSRIWLNPLRTGAQALLKNPQRMHAAVLGGLSRQPVTER-VLWRLETGEGLRGADRPHR 61 Query: 46 ---------TPEGCHVLLQSAQMPVSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTI 96 TP H++ Q+ + V + + +L G FRLRAN + Sbjct: 62 AEVLVLTESTPSWEHLIEQAGWIHTDEPQVLVRDYQPLLDRLHTGREFRFRLRANTVSAT 121 Query: 97 LDN------QKRLDSKGNIKRCRVPLIKEAEQIAWLQRKLGNAARVEDVHPISERPQ--- 147 QK + + RV A Q WL ++ + P+ Sbjct: 122 RTPDNPSPAQKEHLAAPRPRGVRVGHRTAAHQTTWLTDRIDRWGFTLLTTADLDGPRNQP 181 Query: 148 -----------------------------------YFSGDGKSGKIQTVCFEGVLTINDA 172 G+ + T FEG L + D Sbjct: 182 DGPRNQPDGPGDTDEPAPALRLTARERLTFPKKAKNTEKTGRRVVLNTATFEGALRVTDP 241 Query: 173 PALIDLVQQGIGPAKSMGCGLLSLAPL 199 + G+GPAK+ GCGL++LAPL Sbjct: 242 ARARATLLHGVGPAKAYGCGLITLAPL 268 >UniRef50_C7MQD7 CRISPR-associated protein, Cse3 family n=1 Tax=Saccharomonospora viridis DSM 43017 RepID=C7MQD7_SACVD Length = 197 Score = 123 bits (309), Expect = 3e-27, Method: Composition-based stats. Identities = 48/206 (23%), Positives = 81/206 (39%), Gaps = 21/206 (10%) Query: 2 YLSKVIIARAWSRDLYQLHQGLWHLF-PNRPDAARDFLFHVEKRNTPEGCHVLLQSA--- 57 YL+K+ ++ +RD+++ H+ L P + R G +L QSA Sbjct: 4 YLTKITTPKSVTRDIHRTHKILTTAVCPPNITTPGRVATRLLHRVERGGREILAQSATPL 63 Query: 58 ---QMPVSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRV 114 ++ +A + L G + +++ ANP+ R R Sbjct: 64 DPTRLEGGCVIAGTKLLDPLLDHLDNGTVVRYKITANPVHA-------------PNRVRR 110 Query: 115 PLIKEAEQIAWLQRKLGNAARVEDVHPISERPQYFS-GDGKSGKIQTVCFEGVLTINDAP 173 P+ +AW R V D + + + + +QT EGV TI D Sbjct: 111 PITDPDRILAWWHRTADRIGLVLDSTALLDTAKTSGMRRDQRVVVQTATMEGVATIRDVD 170 Query: 174 ALIDLVQQGIGPAKSMGCGLLSLAPL 199 + D + G+G A++ GCGLLS+ PL Sbjct: 171 TVRDAIVLGVGHARAYGCGLLSVVPL 196 >UniRef50_C7LYW5 CRISPR-associated protein, Cse3 family n=1 Tax=Acidimicrobium ferrooxidans DSM 10331 RepID=C7LYW5_ACIFD Length = 226 Score = 120 bits (302), Expect = 2e-26, Method: Composition-based stats. Identities = 51/227 (22%), Positives = 85/227 (37%), Gaps = 37/227 (16%) Query: 1 MYLSKVII------ARAWSRDLYQLHQGLWHL----FPNRPDAARDFLFHVEKRNTPEGC 50 M+L+++ I A + R+ ++H + P ++ L+ V+ + P Sbjct: 1 MFLTRLYIDPQKQAALSVLRNPQRMHAIIAQATSASVPQEANSIGRTLWRVD-GDDPRVP 59 Query: 51 HVLLQSAQMPVSTAVATVI---------KTKQVEF---QLQVGVPLYFRLRANPIKTILD 98 + + SA P A + TK +L+ G FRL AN +++ Sbjct: 60 ILYVVSAVQPQFAHFAASVGQVVRGTDYDTKPYGPLLDRLETGQVYAFRLAANAVRSGRS 119 Query: 99 NQKRLDSKGNIKRCRVPLIKEAEQIAWLQRKLGNAARV--------EDVHPISERPQYFS 150 + D+K R + +Q+ WL + DV R F Sbjct: 120 SSGSADTK------RHGHVTITQQLGWLLARSEQHGFTIRTGSTGEPDVAVTGGRRMVFR 173 Query: 151 GDGKSGKIQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLA 197 G+ I F G L + D L + GIG A++ GCGLL+LA Sbjct: 174 RQGQRVTIALTEFMGHLEVLDRELLRRSLVTGIGHARAYGCGLLTLA 220 >UniRef50_D1YEE5 CRISPR system CASCADE complex protein CasE n=1 Tax=Propionibacterium acnes J139 RepID=D1YEE5_PROAC Length = 221 Score = 120 bits (300), Expect = 4e-26, Method: Composition-based stats. Identities = 45/225 (20%), Positives = 76/225 (33%), Gaps = 34/225 (15%) Query: 1 MYLSK--VIIARAWSRDLYQ----LHQGLWHLFP--NRPDAARDFLFHVEKRNTPEGCHV 52 M+L++ + +AR + L LH + FP L+ +++ + Sbjct: 1 MFLTQFDINVARRDAMRLLASPERLHAAVLGAFPPGQSVSNGARTLWRLDRGPARHDARL 60 Query: 53 LLQSAQMPV---------STAVATVIK--TKQVEFQLQVGVPLYFRLRANPIKTILDNQK 101 ++ S P + A+ L+ G FR NP + Sbjct: 61 MIVSPLRPDLTALNEQAGWSNGASSRSANYDPFLQALRSGSTWRFRCTINPTTAVR---- 116 Query: 102 RLDSKGNIKRCRVPLIKEAEQIAWLQRKLGNAA--------RVEDVHPISERPQYFSGDG 153 + RV + +Q+ W ++ F Sbjct: 117 ---KSAGSRGQRVAEVTAEQQLTWFIGRVERHGYTVPVNDQGAPSAQVTRREILRFRRQR 173 Query: 154 KSGKIQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAP 198 + + +GV+ I DA A + QGIGPAKS GCGL++LAP Sbjct: 174 STVTLAVTQVDGVIQIQDADAARLALVQGIGPAKSYGCGLMTLAP 218 >UniRef50_B5GY64 Putative uncharacterized protein n=1 Tax=Streptomyces clavuligerus ATCC 27064 RepID=B5GY64_STRCL Length = 312 Score = 119 bits (299), Expect = 5e-26, Method: Composition-based stats. Identities = 46/240 (19%), Positives = 78/240 (32%), Gaps = 65/240 (27%) Query: 21 QGLWHLFPNRPDAARDFLFHVEKRNTPEGCHVLLQSAQMPVSTAVATVIKTKQVEFQLQV 80 + LW L + P + +V P+ HV+ ++ A + + +L V Sbjct: 69 RVLWRLDADDPHRPQ---LYVLTPGRPDWSHVVERAGWPDADGEHAVIRDCAPLIERLAV 125 Query: 81 GVPLYFRLRANPIKTILDNQKRLDSK----------GNIKRCRVPLIKEAEQIAWLQRKL 130 G FRL ANP++T + ++ + R+ A Q+ W R+ Sbjct: 126 GQEYAFRLTANPVQTTATPVRPTSAQEKRIAERVEGERPRGFRLAHRTAAHQLNWFLRRT 185 Query: 131 GNAAR--------------------------------------------------VEDVH 140 V +V Sbjct: 186 DGWGFAVPPSRTDPAAPGLDAASGLDAASGLDGASGGDGASGGDGGPDSVGARDPVREVR 245 Query: 141 PISERPQYFSGDGKS--GKIQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAP 198 + + FS + + +EG+L + D L + GIGP+K+ GCGLL+LAP Sbjct: 246 ITARQRHTFSKGRRGTQVTFHSATYEGLLRVTDPELLAARLLGGIGPSKAYGCGLLTLAP 305 >UniRef50_C2GEY9 Putative uncharacterized protein n=1 Tax=Corynebacterium glucuronolyticum ATCC 51866 RepID=C2GEY9_9CORY Length = 225 Score = 117 bits (293), Expect = 2e-25, Method: Composition-based stats. Identities = 49/220 (22%), Positives = 87/220 (39%), Gaps = 23/220 (10%) Query: 2 YLSKVIIARAWSRDL----YQL------HQGLWHLFPN----RPDAARDFLFHVEKRNTP 47 +LSKV + +++ H+ + LF + +P + LF +E T Sbjct: 6 FLSKVPLHSLLMESPGTTYHRIASPTFRHRAVMGLFEDVDSVKPREKLNVLFRLETPTT- 64 Query: 48 EGCHVLLQSAQMPVSTAVATV--IKTKQVEF-QLQVGVPLYFRLRANPIKTILDNQKRLD 104 E ++L+QSA P A+ + ++ K++E G P+ FR+ N I+ Sbjct: 65 ETPYLLIQSAVSPSDEALMNISGLQCKEIELKAPTSGTPVAFRIAVNAIRRTTITIDPHK 124 Query: 105 SKGNIKRCRVPLIKEAE--QIAWLQRKLGNAARVEDVHPISERP---QYFSGDGKSGKIQ 159 + +K + W+ KL A V ++ +Q Sbjct: 125 RRTLVKPVELDGTDSPNPTISEWIAAKLEPALTELSVTNHLREVITDPRTKKKPRTMTVQ 184 Query: 160 TVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAPL 199 +GV + D +L ++ GIG K+ GCGLL++ PL Sbjct: 185 VDTIDGVARVADPVSLEKILSDGIGREKAYGCGLLTIRPL 224 >UniRef50_C4X9I8 Crispr-associated Cse3 family protein n=6 Tax=Gammaproteobacteria RepID=C4X9I8_KLEPN Length = 215 Score = 117 bits (293), Expect = 3e-25, Method: Composition-based stats. Identities = 38/206 (18%), Positives = 79/206 (38%), Gaps = 16/206 (7%) Query: 3 LSKVIIARAWSRDLYQLHQGLWHLFPN-RPDAAR-----DFLFHVEKRNTPEGCHVLLQS 56 L + + D Y LH+ ++ LF + R D + + + ++ G +L+ S Sbjct: 11 LDRAAVKALKISDAYSLHRVVYSLFADARTDREKCSHISSGIAYADQGGDFHGRKILIVS 70 Query: 57 AQMPVS--TAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRV 114 ++P + + + +K + F+++ NP++ KR+ KG + Sbjct: 71 DRLPAAKVDGLYGEVISKSIPAAFLSHSRYRFQVQVNPVRKDKQTGKRVAVKGRADIAQW 130 Query: 115 PLIKEAEQIAWLQRKLGNAARVEDVHPISERPQYFS-GDGKSGKIQTVCFEGVLTINDAP 173 + + W G + + + F G+ + +G+LT+ D Sbjct: 131 FI--QRAASRW-----GFDVDLPGLQVEAMEVLQFKDKGGRQVTLGKATVQGLLTVTDRQ 183 Query: 174 ALIDLVQQGIGPAKSMGCGLLSLAPL 199 GIG ++ GCGLL + P+ Sbjct: 184 KFQHSFHHGIGKGRAFGCGLLQIVPV 209 >UniRef50_A8SDR6 Putative uncharacterized protein n=1 Tax=Faecalibacterium prausnitzii M21/2 RepID=A8SDR6_9FIRM Length = 195 Score = 116 bits (290), Expect = 5e-25, Method: Composition-based stats. Identities = 48/194 (24%), Positives = 71/194 (36%), Gaps = 23/194 (11%) Query: 16 LYQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGCHVLLQSAQMPVSTAVA------TVI 69 +LH + F E ++LL S P + V Sbjct: 7 PQKLHGAVESAFAGERRRRL-----WRLDRLGERLYLLLLSEDAPELSGVVEQFGTGAAA 61 Query: 70 KTKQVEFQLQ---VGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRVPLIKEAEQIAWL 126 +T+ + LQ G FRL ANP K+ D Q + Q WL Sbjct: 62 ETRSYDPLLQRVEPGSCWQFRLTANPTKSCKDTQN-----PAARGTVAAHCTTQYQKQWL 116 Query: 127 ---QRKLGNAARVEDVHPISERPQYFSGDG-KSGKIQTVCFEGVLTINDAPALIDLVQQG 182 K G A R E + Q+F+ G + + V +EGVL + DA L+ QG Sbjct: 117 LERAAKRGFALREEGFTVTRVQWQHFAKHGTRPVTLLAVTYEGVLQVTDAEQFRALLCQG 176 Query: 183 IGPAKSMGCGLLSL 196 +G K+ G GL+++ Sbjct: 177 MGRGKAYGLGLMTV 190 >UniRef50_Q47PJ5 CRISPR-associated protein, Cse3 family n=1 Tax=Thermobifida fusca YX RepID=Q47PJ5_THEFY Length = 232 Score = 114 bits (286), Expect = 2e-24, Method: Composition-based stats. Identities = 53/215 (24%), Positives = 80/215 (37%), Gaps = 24/215 (11%) Query: 5 KVIIARAWSRDLYQLHQGLWHLFPN-------RPDAARDFLFHVEKRNTPEGCHVLLQS- 56 + RA R LH+ L L + P LF +E T ++L+QS Sbjct: 12 RYRQTRADFRTAGNLHRKLIRLSSDLGEERIANPRQQSGLLFRIE--ETRNELYLLVQSH 69 Query: 57 -----AQMPVSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILD---NQKRLDSKGN 108 ++ + +L G + +R+ A+P K + N +RL K Sbjct: 70 SPLRVDRLGPGYHGVQMRNLDPFLARLDKGSRVRYRIVASPTKRLGRSENNTQRLGLKEP 129 Query: 109 IKRCR---VPLIKEAEQIAWLQRKLGNAARVEDVHP---ISERPQYFSGDGKSGKIQTVC 162 K+ R L A + W R N + + R + + + V Sbjct: 130 PKKPREYTWALRGAAAEEWWHSRAAANGLELLSTYAQTLDDVRDPGTADRSRKIRHPAVR 189 Query: 163 FEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLA 197 F+G I+D A+ V GIG KS GCGLLSLA Sbjct: 190 FDGEAVISDVDAVRHAVLNGIGRGKSYGCGLLSLA 224 >UniRef50_C6HV94 CRISPR-associated protein, Cas3 n=1 Tax=Leptospirillum ferrodiazotrophum RepID=C6HV94_9BACT Length = 221 Score = 113 bits (282), Expect = 4e-24, Method: Composition-based stats. Identities = 42/207 (20%), Positives = 76/207 (36%), Gaps = 21/207 (10%) Query: 3 LSKVIIARAWSRDLYQLHQGLWHLFPNRPDAARDFL-----FHVEKRNTPEGCHVLLQSA 57 LS+ + RD Y LH+ ++ LF +R L + +K +L+ S Sbjct: 23 LSREDVRVLKIRDAYSLHKVVYGLFEDRRSKEEKSLVSSGILYADKGGDIHFRKLLILSD 82 Query: 58 QMPVSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRVPLI 117 + P T I+T+ V F + NP K + N +P+I Sbjct: 83 RRPHQTPQFGKIETRPVFSSFLNCDHYLFEVIVNPSK-----------RDNHSGKIMPVI 131 Query: 118 KEAEQIAWLQRKLGNAARV----EDVHPISERPQYFSGD-GKSGKIQTVCFEGVLTINDA 172 W + G++ + + + ++ Q F G+ + +G + D Sbjct: 132 GRENIRQWFLDRAGDSWGLSVSPDSLEVVNAGVQKFEKQNGQFITHGSATLKGEFHVVDR 191 Query: 173 PALIDLVQQGIGPAKSMGCGLLSLAPL 199 + + GIG K+ G GLL + P+ Sbjct: 192 ERFVKSFKNGIGRGKAFGFGLLQIVPV 218 >UniRef50_B3ENH7 CRISPR-associated protein, Cse3 family n=3 Tax=Chlorobiaceae RepID=B3ENH7_CHLPB Length = 208 Score = 110 bits (276), Expect = 2e-23, Method: Composition-based stats. Identities = 40/197 (20%), Positives = 71/197 (36%), Gaps = 24/197 (12%) Query: 14 RDLYQLHQGLWHLFPNRPDAAR-------DFLFHVEKRNTPEGCHVLLQSAQMPVSTAVA 66 D Y LH+ ++ LF +R A FL+ +K G +L+ S + P Sbjct: 19 TDDYSLHRVVYSLFEDRRSEAEKNASIPSGFLY-ADKGGDSNGRLILMLSDREPRKPEH- 76 Query: 67 TVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRVPLIKEAEQIAWL 126 +++K ++ + F + NP + + + R + + E W Sbjct: 77 GRLESKPIDETFLMFDRYRFSVVINPSR-----------RESKSRKIIAIRDRNEIAQWF 125 Query: 127 QRKL----GNAARVEDVHPISERPQYFSGDGKSGKIQTVCFEGVLTINDAPALIDLVQQG 182 +K G + + + + F G L + D I +QG Sbjct: 126 SQKAPASWGFTVNPVTLEVRTLQAKQFVKKEHCVTQNGAELTGELDVVDRTLFIKSFKQG 185 Query: 183 IGPAKSMGCGLLSLAPL 199 IG ++ G GLL +APL Sbjct: 186 IGRGRAFGFGLLQIAPL 202 >UniRef50_B0S4B5 Putative uncharacterized protein n=1 Tax=Finegoldia magna ATCC 29328 RepID=B0S4B5_FINM2 Length = 211 Score = 106 bits (266), Expect = 4e-22, Method: Composition-based stats. Identities = 43/216 (19%), Positives = 84/216 (38%), Gaps = 26/216 (12%) Query: 1 MYLSKVIIARAWS------RDLYQLHQGLWHLFPNR--PDAARDFLFHVEKRNTPEGCHV 52 MYLS+V++ + +L H+ + FPN + L+ ++ N ++ Sbjct: 1 MYLSRVMLKDNQNYRNYVYTNLQYFHKWVEESFPNEFKENIRTRKLWRLDSFN--NKNYL 58 Query: 53 LLQSAQMPVST--------AVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLD 104 ++ S Q P A + Q + F+++ NP+ ++ R Sbjct: 59 VMLSEQKPDIEMFERNGIKGTAKITNYDQFLDDISENKLYRFKIKYNPVSSV---YVRNS 115 Query: 105 SKGNIKRCRVPLIKEAEQIAWL-QRKLGNAARVEDVHPISERPQYFSGDGKSGKIQTVCF 163 +G+ CR + ++I +L R N V + I D + + Sbjct: 116 KRGDNFICR----NDEDKIKYLIDRSEKNGFEVLECTLIQSGYDKLVKDNQKAPVNKAVV 171 Query: 164 EGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAPL 199 EGVL + D +++ G G K+ G GL+++ P+ Sbjct: 172 EGVLAVKDVDKFKEILINGFGKRKAYGYGLMTILPI 207 >UniRef50_UPI0001B51C2A CRISPR-associated protein, Cse3 family n=1 Tax=Streptomyces viridochromogenes DSM 40736 RepID=UPI0001B51C2A Length = 252 Score = 103 bits (257), Expect = 3e-21, Method: Composition-based stats. Identities = 47/211 (22%), Positives = 73/211 (34%), Gaps = 31/211 (14%) Query: 15 DLYQLHQGLWHLF----PNRPDAAR---DFLFHVEKRNTPEGCHVLLQSAQMPVSTAVA- 66 D + +H+ + F P+ DA R L + +++QS P TA+ Sbjct: 28 DAHHMHRIVMGGFKGWVPDGADAPRAQVGVLSTWSADLATQTLLIIVQSRVRPDWTAIPR 87 Query: 67 ----TVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRVPLIKEAEQ 122 I + V+ + +G FR +P KT D L KG R+P + A Sbjct: 88 AALCAPIDVRAVDETISIGDRFTFRTVVSPTKTRAD----LKQKGKPVIKRLPHVLPAHV 143 Query: 123 IAWLQRKLGNAAR---------------VEDVHPISERPQYFSGDGKSGKIQTVCFEGVL 167 W + +L A I P + K KI G L Sbjct: 144 RTWFEDRLQPAGTPATALSGIPRLGADAERTTLAIRMLPPVSTDHHKGLKITRAEIRGTL 203 Query: 168 TINDAPALIDLVQQGIGPAKSMGCGLLSLAP 198 T+ D + G+G A++ CGL+ P Sbjct: 204 TVTDPATFTKTITTGLGRARAYSCGLILTRP 234 >UniRef50_Q6NEQ5 Putative uncharacterized protein n=1 Tax=Corynebacterium diphtheriae RepID=Q6NEQ5_CORDI Length = 228 Score = 101 bits (251), Expect = 2e-20, Method: Composition-based stats. Identities = 44/197 (22%), Positives = 74/197 (37%), Gaps = 28/197 (14%) Query: 20 HQGLWHLFPNR----PDAARDFLFHVEKRNTPEGCHVLLQSAQMPVSTAVAT------VI 69 H+ + LFP+ P + D LF E+ + L+QS P + Sbjct: 37 HRAVMALFPDTDSPLPRKSVDILFRFEQLA-GQPPFFLIQSTVAPKQVDNLDSEVQHRTV 95 Query: 70 KTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRVPLIKEAEQ------- 122 ++ + + FR+ N I + + I VP + + Sbjct: 96 SLRRFSPK----SAVRFRISIN---GIRRQTTEHNGRKRITTSPVPFDSDEKAPSHITRM 148 Query: 123 IAWLQRKLGNAARVEDVHPISERP---QYFSGDGKSGKIQTVCFEGVLTINDAPALIDLV 179 W+Q+KL A R ++ ++ G S IQ +G + D L +L+ Sbjct: 149 TPWVQKKLNGALRNVEILNHQREVIGTKHRGGKAASMTIQIDTVDGFGIVEDPELLNELI 208 Query: 180 QQGIGPAKSMGCGLLSL 196 G+G AK+ GCGLLS+ Sbjct: 209 LHGVGRAKAYGCGLLSV 225 >UniRef50_C2CRP4 Putative uncharacterized protein n=1 Tax=Corynebacterium striatum ATCC 6940 RepID=C2CRP4_CORST Length = 185 Score = 98.8 bits (245), Expect = 8e-20, Method: Composition-based stats. Identities = 49/201 (24%), Positives = 80/201 (39%), Gaps = 24/201 (11%) Query: 1 MYLSKVIIAR--AWSRDLYQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGCHVLLQSAQ 58 M + + + R + D +H+ L H + L+ +P+ H++++ Sbjct: 1 MLTTTLSLNRTTRIAFDSQAVHRTLLHA-----TDGKPVLW-----ASPDTKHLVVRHET 50 Query: 59 MPVSTAVATVIKTKQVEFQLQ---VGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRVP 115 PV A T+ V Q + + L NP IL + + +G K+ P Sbjct: 51 -PVDWIKAIRGVTQAVTLPTQIPAASARINYALIGNP---ILSQYQGPNKRG--KKTPAP 104 Query: 116 LIKEAEQIAWLQRKLGNAARVEDVHPISERPQYFSGDGKSGKIQTVCFEGVLTINDAPAL 175 K E WLQR++GNA + + P + F G T+ D AL Sbjct: 105 PEKWNE---WLQRRVGNALNLHSIDGTRLPPAKGKKPDMQTIHHRILFTGRATVKDQDAL 161 Query: 176 IDLVQQGIGPAKSMGCGLLSL 196 L++ GIG K+ GCGLL + Sbjct: 162 QTLMESGIGSGKAYGCGLLIV 182 >UniRef50_C0W6T9 Possible CRISPR-associated protein n=1 Tax=Actinomyces urogenitalis DSM 15434 RepID=C0W6T9_9ACTO Length = 129 Score = 92.3 bits (228), Expect = 9e-18, Method: Composition-based stats. Identities = 33/134 (24%), Positives = 49/134 (36%), Gaps = 25/134 (18%) Query: 84 LYFRLRANPIKTILDNQKRLDSKGNIKRCRVPLIKEAEQIAWLQRKL-----------GN 132 FRL ANP + I ++ R + +Q WL + G Sbjct: 2 WAFRLAANPSRAISQGI-------GVRGKRQGHVTLEQQRQWLLSRAAAHGFRMLPVNGA 54 Query: 133 AARVEDVHPISERPQYF-------SGDGKSGKIQTVCFEGVLTINDAPALIDLVQQGIGP 185 A V + R + G I FEG+L + D L + GIG Sbjct: 55 AESVGSSLTVVRRARPVFGRSNPEQGRRDRVTINRTVFEGLLQVTDPDLLRTALISGIGR 114 Query: 186 AKSMGCGLLSLAPL 199 +K+ GCGL++LA + Sbjct: 115 SKAYGCGLMTLAKV 128 >UniRef50_Q3A5Z3 CRISPR-associated protein, Cse3 family n=2 Tax=Desulfuromonadales RepID=Q3A5Z3_PELCD Length = 299 Score = 89.2 bits (220), Expect = 8e-17, Method: Composition-based stats. Identities = 44/182 (24%), Positives = 67/182 (36%), Gaps = 24/182 (13%) Query: 1 MYLSKVIIARAWSR----------DLYQLHQGLWHLFPNRPDAARDFLFHVE-------- 42 MY S+V + R + Y LHQ LW LFP + R FLF E Sbjct: 1 MYFSRVQLQPEVQRSSQLSQVLTSNSYGLHQLLWDLFP--AEEKRSFLFREEIAKEQLKN 58 Query: 43 KRNTPEGCHVLLQSAQMPVSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQK- 101 +R T + S P + +++K + G F+LRANPI K Sbjct: 59 QRRTKGESLFYIVSRHDPQTETPIFRVESKVYAPVISQGQQFAFKLRANPIVAKKKPGKK 118 Query: 102 ---RLDSKGNIKRCRVPLIKEAEQIAWLQRKLGNAARVEDVHPISERPQYFSGDGKSGKI 158 R D N +R + + I +QR+ + R + + + F + I Sbjct: 119 NSVRHDVVMNAQRRLLEELASCLGILDVQRQKKSVLRHRILTAWKDGEKRFCSERLREDI 178 Query: 159 QT 160 +T Sbjct: 179 RT 180 >UniRef50_B2N0R4 Putative uncharacterized protein n=1 Tax=Escherichia coli 53638 RepID=B2N0R4_ECOLX Length = 58 Score = 88.1 bits (217), Expect = 1e-16, Method: Composition-based stats. Identities = 48/51 (94%), Positives = 51/51 (100%) Query: 149 FSGDGKSGKIQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAPL 199 FSG+GK+GKIQTVCFEGVLTINDAPALIDL+QQGIGPAKSMGCGLLSLAPL Sbjct: 8 FSGEGKNGKIQTVCFEGVLTINDAPALIDLLQQGIGPAKSMGCGLLSLAPL 58 >UniRef50_C7RP63 CRISPR-associated protein, Cse3 family n=1 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RP63_9PROT Length = 245 Score = 72.3 bits (176), Expect = 1e-11, Method: Composition-based stats. Identities = 50/213 (23%), Positives = 78/213 (36%), Gaps = 35/213 (16%) Query: 17 YQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGCHV--------LLQSAQMPVSTAVATV 68 Y LH L F + FH + L S P + Sbjct: 33 YALHALLSEAFGDLAPKP----FHYLGGRQGLLAYTAADLEMLRLNASLAPPDVARALGL 88 Query: 69 --IKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKR----LDSKGNIKRCRVPL---IKE 119 + + + G L F R P+ D ++R +G + +V + I E Sbjct: 89 DHLDARPFPTAWRTGQRLGFEARVRPVVRGKDGRERDAYLHAVEGTVDTGQVGVDGSIAE 148 Query: 120 AEQI--AWLQRKLGN--AARVEDVHPIS---ERPQYFSGDGKSGKIQT-------VCFEG 165 I WL + AA++ + H S R +G G++GK +T V F+G Sbjct: 149 RTAIYSDWLAAQFAFDGAAQIAEAHLDSFRLTRVLRKAGSGENGKRKTTNNAGPDVVFKG 208 Query: 166 VLTINDAPALIDLVQQGIGPAKSMGCGLLSLAP 198 L + D PA L+ +GIG ++ G G+L L P Sbjct: 209 HLQVRDPPAFNRLLGRGIGRHRAFGFGMLLLRP 241 >UniRef50_A8ZZ18 CRISPR-associated protein, Cse3 family n=1 Tax=Desulfococcus oleovorans Hxd3 RepID=A8ZZ18_DESOH Length = 273 Score = 71.9 bits (175), Expect = 1e-11, Method: Composition-based stats. Identities = 30/174 (17%), Positives = 61/174 (35%), Gaps = 25/174 (14%) Query: 46 TPEGCHVLLQSAQMPVSTAVATVIKTKQVEFQL--QVGVPLYFRLRANPIKTILDNQKRL 103 + + L ++ + + + + K+ + L G L RL +LD R Sbjct: 105 DAQQTFLKLLCEELGLLSHLQGTPEKKEYKNVLLTHGGQRLDSRLT-----DLLDGDYRY 159 Query: 104 DSKGN-----IKRCRVPLIKEAEQI--AWLQRKLGNAARV--------EDVHPISERPQY 148 + + ++ L E + W+ ++ + + + Sbjct: 160 AERLDQKLTPREKLEWALRAEIDNTLDEWMAKQGKQNGFTIVKDTHGNLKLQNSAYQWHA 219 Query: 149 FSGD---GKSGKIQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAPL 199 +G GK V F G L ++D A + GIG +K+ GCGL+ + + Sbjct: 220 LTGKAAKGKKSGFSAVDFTGDLVVSDVEAFKKSLFNGIGRSKAFGCGLMLVKRI 273 Score = 69.6 bits (169), Expect = 5e-11, Method: Composition-based stats. Identities = 35/121 (28%), Positives = 57/121 (47%), Gaps = 12/121 (9%) Query: 8 IARAWSRDLYQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGCH--------VLLQSAQM 59 +A+ + +Y +H+ LW LFP + R+FL+ E G L S+ Sbjct: 1 MAKVLADSVYNIHRLLWDLFPGQ--KQRNFLYREEIAREQLGYQGGARGESLYYLVSSSA 58 Query: 60 PVSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRVPLIKE 119 P S + ++T++ E QLQ L F LRANP+ T N K+ D + ++ + L+ E Sbjct: 59 PSSQSPFFAVETRRYEPQLQPDEALRFELRANPVVT--KNGKKHDVVMDAQQTFLKLLCE 116 Query: 120 A 120 Sbjct: 117 E 117 >UniRef50_Q60AD3 CRISPR-associated protein, CT1974 family n=1 Tax=Methylococcus capsulatus RepID=Q60AD3_METCA Length = 239 Score = 70.0 bits (170), Expect = 4e-11, Method: Composition-based stats. Identities = 35/145 (24%), Positives = 57/145 (39%), Gaps = 7/145 (4%) Query: 61 VSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQK---RLDSKGNIKRCRVPLI 117 ++A I K + G L F + A PI + N+ R + R + P Sbjct: 93 QVCSLADGIAFKPMPESWPNGRKLGFEVMACPISRLGRNEDDVYRRHLRDCDARAQSPDS 152 Query: 118 KEAEQIAWLQRKLGNAARVEDVHPISERPQYFSGDGKSGKIQT----VCFEGVLTINDAP 173 +E WL R+ G+AA ++D R + + F G L++ D Sbjct: 153 REMVYRRWLTRQFGSAATLDDFSLDGFRYLRLLRKARGTRSGFLAPQALFRGTLSVRDGA 212 Query: 174 ALIDLVQQGIGPAKSMGCGLLSLAP 198 L+ +GIG ++ G G+L L P Sbjct: 213 GFGALLARGIGRHRAFGFGMLLLRP 237 >UniRef50_B4UE72 CRISPR-associated CT1974 family protein n=2 Tax=Anaeromyxobacter RepID=B4UE72_ANASK Length = 243 Score = 69.2 bits (168), Expect = 7e-11, Method: Composition-based stats. Identities = 31/153 (20%), Positives = 52/153 (33%), Gaps = 16/153 (10%) Query: 62 STAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIK-------RCRV 114 T K + Q + G + F +R P+ + + R + + R Sbjct: 89 GTCDWNAFDEKPMPGQWRTGERVGFEVRCCPVVRMSGDGPRWRAGAEVDAFLARCWRTEG 148 Query: 115 PLIKEAEQIAWLQR---KLGNAARVEDVHPISERPQYFSGDGKSGKIQT------VCFEG 165 + +EA WL + G A V +R D + + F G Sbjct: 149 TVEREAVYREWLADELGRRGGARIVSARVLGHQRAHLVRRDHRPERKAIGGERPEAVFSG 208 Query: 166 VLTINDAPALIDLVQQGIGPAKSMGCGLLSLAP 198 L + D A L+ +G+G + G G+L L P Sbjct: 209 ELDVTDPEAFAALLARGVGRHRGFGFGMLLLRP 241 >UniRef50_C4ZJY2 CRISPR-associated protein, Cse3 family n=1 Tax=Thauera sp. MZ1T RepID=C4ZJY2_THASP Length = 238 Score = 68.8 bits (167), Expect = 1e-10, Method: Composition-based stats. Identities = 41/202 (20%), Positives = 63/202 (31%), Gaps = 20/202 (9%) Query: 17 YQLHQGLWHLFPNRPDAARDFL-----FHVEKRNTPEGCHVLLQSAQMPVSTAV-ATVIK 70 Y LH L F + + H Q A V + Sbjct: 33 YALHTLLAAAFGDLAPKPFRHFGDVRGLLAYSGQGADRIHTAAQMAAPDVHAVLGLERFA 92 Query: 71 TKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRVP---LIKEAEQIAWLQ 127 + G L F LR P+ D ++R ++ V L +EA + WLQ Sbjct: 93 ARSFPTDWAAGRRLGFELRVRPVLRTKDGRERDVFLSQAEKRGVAEKELSREAVYLEWLQ 152 Query: 128 RKLGNAARVEDVHPISERPQYFSGDGKSGKIQT-----------VCFEGVLTINDAPALI 176 R+L + + S K + F G LT+ D Sbjct: 153 RELARGDAANVDRAQLDGFRLTSSLRKGSAVVGRRPAQRVTGPDALFSGELTVRDPAGFA 212 Query: 177 DLVQQGIGPAKSMGCGLLSLAP 198 L+ +G+G ++ G G+L L P Sbjct: 213 ALIARGVGRHRAFGFGMLLLRP 234 >UniRef50_A8LMM7 CRISPR-associated protein n=2 Tax=Alphaproteobacteria RepID=A8LMM7_DINSH Length = 263 Score = 66.9 bits (162), Expect = 4e-10, Method: Composition-based stats. Identities = 34/159 (21%), Positives = 55/159 (34%), Gaps = 26/159 (16%) Query: 66 ATVIKTKQVEFQLQVGVPLYFRLRANPIK----TILDNQKRLDSKGN------------- 108 I+TK + G L F +R P+ I R + + Sbjct: 97 PDRIETKPMPELAVPGRRLGFDIRLRPVVRLASAIPAPADRAAGRDHGFKAGAEVDAFLA 156 Query: 109 ---IKRCRVPLIKEAEQ-----IAWLQRKLGNAARVEDVHPISERPQYFSGDGKSGKIQT 160 + R + AWL + G AA +E V + R + + G Sbjct: 157 EALRQPDREAMHTAERSRETVYAAWLADRFGPAAELEQVTLAAFRRSFAARKDGRGCEGP 216 Query: 161 -VCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAP 198 G LT+ DA A + + +G+G K+ G G+L + P Sbjct: 217 DATLHGTLTVGDAKAFAERLHRGVGRHKAYGYGMLLIRP 255 >UniRef50_D0MET7 CRISPR-associated protein, Cse3 family n=1 Tax=Rhodothermus marinus DSM 4252 RepID=D0MET7_RHOM4 Length = 265 Score = 65.0 bits (157), Expect = 2e-09, Method: Composition-based stats. Identities = 41/199 (20%), Positives = 72/199 (36%), Gaps = 28/199 (14%) Query: 27 FPNRPDAARDFLFHVEKRNTPEGCHVLLQSAQ--MPVSTAVAT--VIKTKQVEFQLQVGV 82 F + R V + + P A+ +K + ++ G+ Sbjct: 65 FAVEGENRRGPWVRVLGYADVPWETLQELARGFASPAVYAICGWDRGASKPMPTEIPRGM 124 Query: 83 PLYFRLRANPIKTILDNQKRLDSKGNIKRCRVP-------------LIKEAEQIAWLQRK 129 L F +R P+ + + K + L +EA WL+R+ Sbjct: 125 RLAFSVRVCPVVRKASAGQSPRGRRWQKGQELDVFLDAAWSQPEAVLDREAVYAEWLRRQ 184 Query: 130 LG----NAARVEDVH----PISERPQYFSGDGKSGKIQT---VCFEGVLTINDAPALIDL 178 + ARVE V I + +G +S + V EGVLT+ D+ A + + Sbjct: 185 MARPEKGGARVETVRMTRFSIERMTRRTNGSSRSVTVIQRPDVTLEGVLTVTDSAAFMRM 244 Query: 179 VQQGIGPAKSMGCGLLSLA 197 +++G+G S G G+L L Sbjct: 245 LRRGVGRHTSFGYGMLKLR 263 >UniRef50_C1XXW2 CRISPR associated protein n=1 Tax=Meiothermus silvanus DSM 9946 RepID=C1XXW2_9DEIN Length = 116 Score = 63.4 bits (153), Expect = 4e-09, Method: Composition-based stats. Identities = 27/100 (27%), Positives = 41/100 (41%), Gaps = 18/100 (18%) Query: 1 MYLSKVII------ARAWSRDLYQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGCHVLL 54 MYLS++++ AR + YQ+H L H F FL+ E+ TP VL+ Sbjct: 1 MYLSRLLLDPRHKQARTDLANPYQMHATLCHAFAEPEQTPPRFLWRAEEGKTP---TVLV 57 Query: 55 QSAQMPVSTAVA--------TVIKTKQV-EFQLQVGVPLY 85 QS + P + ++K + LQ G L Sbjct: 58 QSIETPNWEKLTQRFPGYFSQRPESKPIPLEHLQSGQVLR 97 >UniRef50_Q0BRF7 Putative uncharacterized protein n=1 Tax=Granulibacter bethesdensis CGDNIH1 RepID=Q0BRF7_GRABC Length = 247 Score = 62.6 bits (151), Expect = 8e-09, Method: Composition-based stats. Identities = 33/209 (15%), Positives = 68/209 (32%), Gaps = 33/209 (15%) Query: 19 LHQGLWHLFPNRPDAARDFL------FHVEKRNTPEGCHVLLQSA--QMPVSTAVATVIK 70 LH L LF + + + + ++ Q+ P T V ++ + Sbjct: 35 LHHLLTQLFGRQMLQPFRVFTPEQANWSLYAYANQDATTLVEQARFSITPDMTEVISLER 94 Query: 71 TKQVE-FQLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRVPLIKEAEQ------- 122 + + G + F +R P++ + K+ D + + R + EA Sbjct: 95 LRSKAMPDAKPGQRIGFDVRIRPVR---RSAKQHDQESEKMQERDAFLAEALHNHADDKT 151 Query: 123 -------------IAWLQRKLGNAARVEDVHPISERPQYFSGDGKSGKIQTVCFEGVLTI 169 WL + A +E + + +G + G + I Sbjct: 152 GMKSANRTREMVYREWLAER-MPWATLETARLAHFQRRRVLRNGNGIEGPDATIHGTMII 210 Query: 170 NDAPALIDLVQQGIGPAKSMGCGLLSLAP 198 D + +++GIG + G G++ L P Sbjct: 211 GDPAQFSEALRKGIGRHSAYGYGMMMLRP 239 >UniRef50_B4V4N6 Putative uncharacterized protein n=1 Tax=Streptomyces sp. Mg1 RepID=B4V4N6_9ACTO Length = 114 Score = 58.0 bits (139), Expect = 2e-07, Method: Composition-based stats. Identities = 20/110 (18%), Positives = 32/110 (29%), Gaps = 16/110 (14%) Query: 102 RLDSKGNIKRCRVPLIKEAEQIAWLQR----------------KLGNAARVEDVHPISER 145 R + VP W R ++G A + Sbjct: 3 RAQVLESPSGHPVPHSTPDHVKNWFVRCLQAEDEPATGEGGVARVGATADPAALGVRMLP 62 Query: 146 PQYFSGDGKSGKIQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLS 195 K +I G LT+ D L+ + G+G A++ CGL+ Sbjct: 63 TVSSPAPHKGLRIARAEIRGSLTVTDPETLVTALSNGLGHARAYSCGLIL 112 >UniRef50_D1Y485 Crispr-associated family protein n=1 Tax=Pyramidobacter piscolens W5455 RepID=D1Y485_9BACT Length = 254 Score = 55.7 bits (133), Expect = 1e-06, Method: Composition-based stats. Identities = 30/146 (20%), Positives = 42/146 (28%), Gaps = 29/146 (19%) Query: 80 VGVPLYFRLRANPIK----------------TILDNQKRLDSKGNIKRCRVPLIKEAEQI 123 G F + P + + G I R E E Sbjct: 105 KGSRYRFSVYCRPTIRRGKVESDVWLMKNYFACEEARGNGTFDGTIHEFRQLHKGEIEGT 164 Query: 124 --AWLQRKLGNAARVEDVHPISERPQYFSGDGKSGKIQT-----------VCFEGVLTIN 170 WLQR+ AA + DV R Y + F G L + Sbjct: 165 YRQWLQRRFVPAAELRDVVITGSRSSYLTTRSAKDHCGAPTHSERRSYPETTFVGELCVT 224 Query: 171 DAPALIDLVQQGIGPAKSMGCGLLSL 196 + A LV+ G+G + G G+L L Sbjct: 225 EPQAFERLVRHGVGRHCAFGFGMLLL 250 >UniRef50_C9M9R8 CRISPR-associated protein, CT1974 family n=1 Tax=Jonquetella anthropi E3_33 E1 RepID=C9M9R8_9BACT Length = 262 Score = 54.2 bits (129), Expect = 3e-06, Method: Composition-based stats. Identities = 14/83 (16%), Positives = 29/83 (34%), Gaps = 10/83 (12%) Query: 124 AWLQRKLGNAARVEDVHPISERPQYFSGDGKSGKIQTVC----------FEGVLTINDAP 173 WL + + + + + + G V F G L + + Sbjct: 176 EWLTEQFARSGGARVLFSNIKGSRTIRVARRPGSGAPVLQAKRSTPEVLFRGCLQVENQD 235 Query: 174 ALIDLVQQGIGPAKSMGCGLLSL 196 A ++ +G+G ++ G G+L L Sbjct: 236 AFSQILARGVGRHRAFGFGMLLL 258 >UniRef50_C1YTK2 Putative uncharacterized protein n=1 Tax=Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 RepID=C1YTK2_NOCDA Length = 221 Score = 50.7 bits (120), Expect = 2e-05, Method: Composition-based stats. Identities = 31/181 (17%), Positives = 57/181 (31%), Gaps = 22/181 (12%) Query: 1 MYLSKVIIA--RAWSRDLYQ-LHQGLWHLFPNRPDAARDFLFHVEKRNTPEGCHVLLQSA 57 +YLS++ + R+ D + + Q + P++ L+ +++ S Sbjct: 34 LYLSRINLDPKRSARMDQWAVMGQAVRRAVDPDPESDARVLW-----ARTSPSTLVVSSD 88 Query: 58 QMPVSTAVATVIKT-KQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRVPL 116 P V + G + + L P ++ KR +P Sbjct: 89 TAPAWGKVPGATSAAIHPMPRYSEGETVRWELITAPTAPRGAGAAGEGARPRGKRAPLP- 147 Query: 117 IKEAEQIAWLQRKLGNAARVEDVHPISERPQYFSGDGKSGKIQTVCFEGVLTINDAPALI 176 E E WL K A + S R ++ G F G + D+ AL Sbjct: 148 --EEEFEGWLDVKFSGA-----LDVTSVRWKHLGGRPARYH-----FTGEAVVRDSEALQ 195 Query: 177 D 177 + Sbjct: 196 E 196 >UniRef50_C6NY67 Putative uncharacterized protein n=1 Tax=Acidithiobacillus caldus ATCC 51756 RepID=C6NY67_9GAMM Length = 129 Score = 50.7 bits (120), Expect = 3e-05, Method: Composition-based stats. Identities = 19/86 (22%), Positives = 29/86 (33%), Gaps = 2/86 (2%) Query: 109 IKRCRVPLIKEAEQIAWLQRKLGNAARVEDVHPISERPQYFSGDGKSGKIQTVCFEGVLT 168 + R ++K E WL L V E + G+ + F Sbjct: 42 RREHRERVVKPREFRGWLASLLERHGWVLRSIEKVESMEMTIRHGRRLTVVDTVF--TAQ 99 Query: 169 INDAPALIDLVQQGIGPAKSMGCGLL 194 + D + GIG K+ GCG+L Sbjct: 100 VVDRENADQSYRSGIGRYKAFGCGML 125 >UniRef50_D1BYL2 Putative uncharacterized protein n=1 Tax=Xylanimonas cellulosilytica DSM 15894 RepID=D1BYL2_XYLCX Length = 225 Score = 44.1 bits (103), Expect = 0.003, Method: Composition-based stats. Identities = 41/217 (18%), Positives = 72/217 (33%), Gaps = 32/217 (14%) Query: 7 IIARAWSRDLYQLHQGLWHL-------FPNRPDAARDFLFHVEKRNT-PEGCHVLLQSAQ 58 +A A D H+ L A L+ V + + +L++S+ Sbjct: 14 TMATALLSDRTTGHRMTMQLWDQIESTVHRGARAHVGCLWRVTGIDPVAQTGTLLVRSST 73 Query: 59 MPVSTAVATVIKTKQVEFQL-QVGVPLYFRLRA---------NPIKTILDNQKRLDSKGN 108 P V I+ +L + G + + P++ + + D Sbjct: 74 APTR-KVPWAIQQDAAVTELPETGATVDLTVTIAAMYTPMYDVPVEWRENLKAGADGTAR 132 Query: 109 I-------KRCRVPLIKEAEQIAWLQRKLGNAARVEDVHPISERPQYFSGDGKSGKIQTV 161 + +VP+ + Q W KL DV + G + +V Sbjct: 133 PPGEGLSYRSKQVPVPSDRLQ-EWSVTKLKRLGVDGDVVAHAAPVVRIKGALVATAHLSV 191 Query: 162 CFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAP 198 G T+ND L V+ GIG +S G GL+++ P Sbjct: 192 T--G-ATVND--GLEQCVRTGIGKGRSYGLGLVAVTP 223 >UniRef50_Q21QB0 Putative uncharacterized protein n=1 Tax=Rhodoferax ferrireducens T118 RepID=Q21QB0_RHOFD Length = 180 Score = 42.6 bits (99), Expect = 0.007, Method: Composition-based stats. Identities = 35/190 (18%), Positives = 61/190 (32%), Gaps = 41/190 (21%) Query: 14 RDLYQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGCHVLLQSAQMPVSTAVATVIKTKQ 73 DLY++HQ +W ++ F E +G + + +P K K Sbjct: 21 TDLYRIHQLVWQHVARAVESQGRFA-RPEFIYRIDGGMIRV-RGNLP---------KNKT 69 Query: 74 VEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRVPLIKEAEQIAWLQRKLGNA 133 + P++ L A G+ VP EA W K+ +A Sbjct: 70 SVSAFRANAPVHLDLAA--------------VWGSEHENAVP---EAHLADWCAEKIESA 112 Query: 134 AR-VEDVHPISERPQYF------SGDGKSGKIQTVCFEGVLTINDAPALIDLV--QQGIG 184 V + + + + + + +V T+ L + +QGIG Sbjct: 113 GFKVASLAVTNFQYRCGVKHATDNRQNIRIPVASVT----TTVTAGDTLACALTWRQGIG 168 Query: 185 PAKSMGCGLL 194 K G G+L Sbjct: 169 RGKRFGLGML 178 >UniRef50_C4XCX4 Putative uncharacterized protein n=1 Tax=Klebsiella pneumoniae NTUH-K2044 RepID=C4XCX4_KLEPN Length = 188 Score = 40.7 bits (94), Expect = 0.027, Method: Composition-based stats. Identities = 40/188 (21%), Positives = 69/188 (36%), Gaps = 24/188 (12%) Query: 16 LYQLHQGL---WHLFPNRPDAARDFLFHVEKRNTPEGCHVLLQSAQMPVSTAVATVIKTK 72 +YQ+HQ L R + F + T + +L+++A + K Sbjct: 17 VYQIHQHLDFFMQ---ERKGEKLPYSFKI-FPGTGDDSLLLVRTATA------LELPGEK 66 Query: 73 QVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRVPLIKEAEQIAWLQRKLGN 132 + E L G + F I + K S+G +R P + + Q+A KL Sbjct: 67 KRELILSEGHEIKF------ITLMAIFHKGTKSEGRGRRQFAPSEEASYQLA--LTKLAK 118 Query: 133 AARVEDVHPISERPQYFSGDGKSGKIQTV---CFEGVLTINDAPALIDLVQQGIGPAKSM 189 A +S G +G+ T+ +G I++ + G+GP + Sbjct: 119 AGFKPGQIVVSGPKFVHIDKGNAGRGFTLPVFTVQGTAIISNQQEAEVGIVYGVGPKRVF 178 Query: 190 GCGLLSLA 197 GCG + LA Sbjct: 179 GCGFMHLA 186 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_Q46897 Uncharacterized protein ygcH n=10 Tax=Enterobact... 252 4e-66 UniRef50_D0FPP5 CRISPR-associated protein, Cse3 family n=2 Tax=E... 225 6e-58 UniRef50_B6XT65 Putative uncharacterized protein n=2 Tax=Bifidob... 203 3e-51 UniRef50_C5SD47 CRISPR-associated protein, Cse3 family n=1 Tax=A... 200 2e-50 UniRef50_Q0W583 Predicted CRISPR-associated protein n=1 Tax=uncu... 190 2e-47 UniRef50_D0Y917 CRISPR-associated protein, Cse3 family n=2 Tax=D... 189 5e-47 UniRef50_Q74DC7 CRISPR-associated protein, CT1974 family n=2 Tax... 189 6e-47 UniRef50_Q2JWC6 CRISPR-associated protein, Cse3 family n=2 Tax=C... 184 2e-45 UniRef50_B7KJ27 CRISPR-associated protein, Cse3 family n=1 Tax=C... 183 4e-45 UniRef50_Q12YA7 CRISPR-associated protein n=1 Tax=Methanococcoid... 182 5e-45 UniRef50_D1CAI9 CRISPR-associated protein, Cse3 family n=1 Tax=S... 180 2e-44 UniRef50_D2RB03 CRISPR system CASCADE complex protein CasE n=4 T... 178 7e-44 UniRef50_A6W167 CRISPR-associated protein, Cse3 family n=1 Tax=M... 177 1e-43 UniRef50_D1NTI2 CRISPR-associated protein, Cse3 family n=1 Tax=B... 176 4e-43 UniRef50_B6WQ61 Putative uncharacterized protein n=1 Tax=Desulfo... 176 4e-43 UniRef50_D1CGD5 CRISPR-associated protein, Cse3 family n=1 Tax=T... 175 6e-43 UniRef50_B1LQ79 CRISPR-associated protein, Cse3 family n=54 Tax=... 175 1e-42 UniRef50_D1A5U1 CRISPR-associated protein, Cse3 family n=2 Tax=A... 174 1e-42 UniRef50_B8GIV2 CRISPR-associated protein, Cse3 family n=1 Tax=M... 174 2e-42 UniRef50_Q2FNT6 CRISPR-associated protein, CT1974 n=1 Tax=Methan... 174 2e-42 UniRef50_B4RSK4 CRISPR-associated protein, Cse3 family n=5 Tax=G... 173 3e-42 UniRef50_A5UR13 CRISPR-associated protein, Cse3 family n=1 Tax=R... 172 6e-42 UniRef50_Q67RN9 Putative uncharacterized protein n=1 Tax=Symbiob... 172 6e-42 UniRef50_B8IZA5 CRISPR-associated protein, Cse3 family n=1 Tax=D... 171 1e-41 UniRef50_C6WMQ8 CRISPR-associated protein, Cse3 family n=1 Tax=A... 171 1e-41 UniRef50_B8FDH8 CRISPR-associated protein, Cse3 family n=1 Tax=D... 170 3e-41 UniRef50_C9M2Y7 CRISPR-associated protein n=3 Tax=Lactobacillus ... 170 4e-41 UniRef50_C7QEM3 CRISPR-associated protein, Cse3 family n=9 Tax=A... 169 4e-41 UniRef50_B8IMR1 CRISPR-associated protein, Cse3 family n=3 Tax=A... 168 1e-40 UniRef50_B4TTX1 Crispr-associated protein, Cse3 family n=15 Tax=... 168 1e-40 UniRef50_Q4JWK1 Putative uncharacterized protein n=2 Tax=Coryneb... 168 1e-40 UniRef50_A1SV70 CRISPR-associated protein, Cse3 family n=2 Tax=G... 167 2e-40 UniRef50_C1DSI0 CRISPR-associated protein, CT1974 n=3 Tax=Pseudo... 167 2e-40 UniRef50_A8M405 CRISPR-associated protein, Cse3 family n=3 Tax=A... 167 2e-40 UniRef50_A1ARH5 CRISPR-associated protein, Cse3 family n=3 Tax=B... 166 5e-40 UniRef50_Q1R113 CRISPR-associated protein, CT1974 n=1 Tax=Chromo... 165 6e-40 UniRef50_C7MTM6 CRISPR-associated protein, Cse3 family n=1 Tax=S... 165 7e-40 UniRef50_Q0RTG6 Putative uncharacterized protein n=1 Tax=Frankia... 164 2e-39 UniRef50_A5GBK0 CRISPR-associated protein, Cse3 family n=1 Tax=G... 163 2e-39 UniRef50_D0WFC7 CRISPR-associated protein, Cse3 family n=1 Tax=S... 163 4e-39 UniRef50_C2BET7 CRISPR-associated protein n=1 Tax=Anaerococcus l... 162 7e-39 UniRef50_A0LM55 CRISPR-associated protein, Cse3 family n=1 Tax=S... 161 1e-38 UniRef50_Q47PI8 CRISPR-associated protein, Cse3 family n=1 Tax=T... 161 1e-38 UniRef50_Q03C59 CRISPR-associated protein n=3 Tax=Lactobacillus ... 161 2e-38 UniRef50_Q314I5 CRISPR-associated protein, CT1974 n=2 Tax=Desulf... 160 2e-38 UniRef50_A7BA62 Putative uncharacterized protein n=1 Tax=Actinom... 159 5e-38 UniRef50_Q53WG9 Putative uncharacterized protein TTHB192 n=1 Tax... 159 6e-38 UniRef50_B6B784 CRISPR-associated protein, Cse3 family n=1 Tax=R... 158 7e-38 UniRef50_Q2JH26 Putative uncharacterized protein n=1 Tax=Frankia... 158 1e-37 UniRef50_A9GV72 Putative uncharacterized protein ygcH n=1 Tax=So... 157 2e-37 UniRef50_C0VRW4 CRISPR-associated protein n=1 Tax=Corynebacteriu... 156 3e-37 UniRef50_B1VIX9 CRISPR-associated protein n=6 Tax=Actinomycetale... 154 1e-36 UniRef50_C6HV94 CRISPR-associated protein, Cas3 n=1 Tax=Leptospi... 154 2e-36 UniRef50_C6C417 CRISPR-associated protein, Cse3 family n=4 Tax=E... 152 5e-36 UniRef50_B0LU87 CRISPR-associated protein Cas3 n=2 Tax=Streptomy... 151 8e-36 UniRef50_D1YEE5 CRISPR system CASCADE complex protein CasE n=1 T... 151 9e-36 UniRef50_C7LYW5 CRISPR-associated protein, Cse3 family n=1 Tax=A... 151 1e-35 UniRef50_B5GAA2 Crispr-associated protein n=1 Tax=Streptomyces s... 151 2e-35 UniRef50_Q1J366 CRISPR-associated protein, CT1974 n=2 Tax=Deinoc... 150 2e-35 UniRef50_C4X9I8 Crispr-associated Cse3 family protein n=6 Tax=Ga... 150 4e-35 UniRef50_A9HLC4 CRISPR-associated protein, Cse3 family n=1 Tax=G... 149 5e-35 UniRef50_Q2RY20 CRISPR-associated protein, CT1974 n=1 Tax=Rhodos... 148 7e-35 UniRef50_C7JIG8 CRISPR-associated protein Cse3 n=8 Tax=Acetobact... 148 9e-35 UniRef50_D1A6Q6 CRISPR-associated protein, Cse3 family n=5 Tax=A... 146 3e-34 UniRef50_Q0BSC8 Putative uncharacterized protein n=1 Tax=Granuli... 146 3e-34 UniRef50_Q04QB6 Putative uncharacterized protein n=2 Tax=Leptosp... 145 6e-34 UniRef50_C2GEY9 Putative uncharacterized protein n=1 Tax=Coryneb... 145 8e-34 UniRef50_B3ENH7 CRISPR-associated protein, Cse3 family n=3 Tax=C... 143 2e-33 UniRef50_B0S4B5 Putative uncharacterized protein n=1 Tax=Finegol... 143 4e-33 UniRef50_A8LYZ8 CRISPR-associated protein, Cse3 family n=1 Tax=S... 142 5e-33 UniRef50_B5GY64 Putative uncharacterized protein n=1 Tax=Strepto... 136 3e-31 UniRef50_A8SDR6 Putative uncharacterized protein n=1 Tax=Faecali... 134 2e-30 UniRef50_UPI0001B51C2A CRISPR-associated protein, Cse3 family n=... 134 2e-30 UniRef50_C1XG03 CRISPR-associated protein, Cse3 family n=1 Tax=M... 134 2e-30 UniRef50_B6IWM2 CRISPR-associated protein, CT1974 family n=1 Tax... 132 5e-30 UniRef50_Q47PJ5 CRISPR-associated protein, Cse3 family n=1 Tax=T... 131 2e-29 UniRef50_C2KP48 Putative uncharacterized protein n=1 Tax=Mobilun... 130 2e-29 UniRef50_C7MQD7 CRISPR-associated protein, Cse3 family n=1 Tax=S... 126 5e-28 UniRef50_C5V9N5 CRISPR-associated protein, Cse3 family n=1 Tax=C... 125 7e-28 UniRef50_C7MTL5 CRISPR-associated protein, Cse3 family n=1 Tax=S... 124 2e-27 UniRef50_Q6NEQ5 Putative uncharacterized protein n=1 Tax=Coryneb... 123 5e-27 UniRef50_A8ZZ18 CRISPR-associated protein, Cse3 family n=1 Tax=D... 118 2e-25 UniRef50_C4ZJY2 CRISPR-associated protein, Cse3 family n=1 Tax=T... 115 1e-24 UniRef50_C2CRP4 Putative uncharacterized protein n=1 Tax=Coryneb... 114 2e-24 UniRef50_Q0BRF7 Putative uncharacterized protein n=1 Tax=Granuli... 111 2e-23 UniRef50_Q60AD3 CRISPR-associated protein, CT1974 family n=1 Tax... 109 4e-23 UniRef50_C7RP63 CRISPR-associated protein, Cse3 family n=1 Tax=C... 108 1e-22 UniRef50_Q3A5Z3 CRISPR-associated protein, Cse3 family n=2 Tax=D... 107 3e-22 UniRef50_C0W6T9 Possible CRISPR-associated protein n=1 Tax=Actin... 105 9e-22 UniRef50_D0MET7 CRISPR-associated protein, Cse3 family n=1 Tax=R... 101 2e-20 UniRef50_B4UE72 CRISPR-associated CT1974 family protein n=2 Tax=... 100 6e-20 UniRef50_A8LMM7 CRISPR-associated protein n=2 Tax=Alphaproteobac... 96 7e-19 UniRef50_D1Y485 Crispr-associated family protein n=1 Tax=Pyramid... 93 8e-18 UniRef50_C1YTK2 Putative uncharacterized protein n=1 Tax=Nocardi... 85 1e-15 UniRef50_B2N0R4 Putative uncharacterized protein n=1 Tax=Escheri... 81 3e-14 UniRef50_C9M9R8 CRISPR-associated protein, CT1974 family n=1 Tax... 80 4e-14 UniRef50_C1XXW2 CRISPR associated protein n=1 Tax=Meiothermus si... 76 8e-13 UniRef50_B4V4N6 Putative uncharacterized protein n=1 Tax=Strepto... 71 2e-11 UniRef50_C6NY67 Putative uncharacterized protein n=1 Tax=Acidith... 71 3e-11 Sequences not found previously or not previously below threshold: UniRef50_D1BYL2 Putative uncharacterized protein n=1 Tax=Xylanim... 51 3e-05 UniRef50_Q21QB0 Putative uncharacterized protein n=1 Tax=Rhodofe... 49 1e-04 UniRef50_C5B4T7 Putative uncharacterized protein n=1 Tax=Methylo... 45 0.001 UniRef50_C1MF15 Predicted protein n=1 Tax=Citrobacter sp. 30_2 R... 43 0.007 UniRef50_C4XCX4 Putative uncharacterized protein n=1 Tax=Klebsie... 42 0.010 UniRef50_Q1YSZ9 ATP-dependent helicase HrpA n=2 Tax=unclassified... 41 0.025 >UniRef50_Q46897 Uncharacterized protein ygcH n=10 Tax=Enterobacteriaceae RepID=YGCH_ECOLI Length = 199 Score = 252 bits (645), Expect = 4e-66, Method: Composition-based stats. Identities = 199/199 (100%), Positives = 199/199 (100%) Query: 1 MYLSKVIIARAWSRDLYQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGCHVLLQSAQMP 60 MYLSKVIIARAWSRDLYQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGCHVLLQSAQMP Sbjct: 1 MYLSKVIIARAWSRDLYQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGCHVLLQSAQMP 60 Query: 61 VSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRVPLIKEA 120 VSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRVPLIKEA Sbjct: 61 VSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRVPLIKEA 120 Query: 121 EQIAWLQRKLGNAARVEDVHPISERPQYFSGDGKSGKIQTVCFEGVLTINDAPALIDLVQ 180 EQIAWLQRKLGNAARVEDVHPISERPQYFSGDGKSGKIQTVCFEGVLTINDAPALIDLVQ Sbjct: 121 EQIAWLQRKLGNAARVEDVHPISERPQYFSGDGKSGKIQTVCFEGVLTINDAPALIDLVQ 180 Query: 181 QGIGPAKSMGCGLLSLAPL 199 QGIGPAKSMGCGLLSLAPL Sbjct: 181 QGIGPAKSMGCGLLSLAPL 199 >UniRef50_D0FPP5 CRISPR-associated protein, Cse3 family n=2 Tax=Erwinia pyrifoliae RepID=D0FPP5_ERWPY Length = 200 Score = 225 bits (574), Expect = 6e-58, Method: Composition-based stats. Identities = 93/197 (47%), Positives = 125/197 (63%) Query: 2 YLSKVIIARAWSRDLYQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGCHVLLQSAQMPV 61 YLS++ + +W++D YQ+H+ LW LFP+RP RDFLF VE R+ G VLLQS Q+P Sbjct: 3 YLSQIDVPWSWAKDPYQMHRALWQLFPDRPSDRRDFLFRVETRHAGSGQRVLLQSPQLPQ 62 Query: 62 STAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRVPLIKEAE 121 + A A V+ +K + L G L+FRLRANP+KTI D + RL+S+G +K CRVPLI + + Sbjct: 63 NCAAAKVLASKVMHLNLSPGQRLHFRLRANPVKTIKDKRGRLNSRGEVKSCRVPLIDDNQ 122 Query: 122 QIAWLQRKLGNAARVEDVHPISERPQYFSGDGKSGKIQTVCFEGVLTINDAPALIDLVQQ 181 + WL RKL AA + E F+ +GKIQ VCFEG+L + + Sbjct: 123 LMQWLVRKLEGAAVLNSASVSKEPALCFNKQAVAGKIQPVCFEGILQVTSETHFYQCMAD 182 Query: 182 GIGPAKSMGCGLLSLAP 198 GIGPAKSMGCG+LS+A Sbjct: 183 GIGPAKSMGCGMLSIAR 199 >UniRef50_B6XT65 Putative uncharacterized protein n=2 Tax=Bifidobacterium RepID=B6XT65_9BIFI Length = 233 Score = 203 bits (516), Expect = 3e-51, Method: Composition-based stats. Identities = 47/226 (20%), Positives = 82/226 (36%), Gaps = 28/226 (12%) Query: 1 MYLSKVIIA------RAWSRDLYQLHQGLWHLFPNRP---DAARDFLFHVEKRNTPEGCH 51 M++S++ + R Y+LH + FP + L+ ++ Sbjct: 1 MFISRIPLNKARYGARQLIGSPYKLHAAVECAFPPNAVRNNDEGRILWRLDTSVNDNAVW 60 Query: 52 VLLQSAQMPV---------STAV--ATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQ 100 + + S + P + + ++ G +F+LRANP + +++ Sbjct: 61 LYVVSPEKPDFMHIVEQAGWPTHVEWETKNYEPLLERIAKGQQWHFKLRANPARKAKEDK 120 Query: 101 KRLDSKGNIKRCRVPLIKEAEQIAWLQRKLGNAARV--------EDVHPISERPQYFSGD 152 R I I +Q+ WL + + DV + F Sbjct: 121 GRRHRSDGIVGKVQGHITVDQQLQWLIDRSASHGFTILNDQNDQPDVVVKERHKENFKRA 180 Query: 153 GKSGKIQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAP 198 + + T FEG L + DA + QGIG AK GCGLL++AP Sbjct: 181 DATVTLVTAVFEGRLEVTDAELFRKALCQGIGRAKGFGCGLLTIAP 226 >UniRef50_C5SD47 CRISPR-associated protein, Cse3 family n=1 Tax=Allochromatium vinosum DSM 180 RepID=C5SD47_CHRVI Length = 209 Score = 200 bits (509), Expect = 2e-50, Method: Composition-based stats. Identities = 81/206 (39%), Positives = 108/206 (52%), Gaps = 8/206 (3%) Query: 1 MYLSKVIIARAWSRDLYQLHQGLWHLFPNR-------PDAARD-FLFHVEKRNTPEGCHV 52 M LS+ I + +R+ Y +H+ +W LFP PD R FLF VE V Sbjct: 1 MILSRAEIPWSEARNPYDMHRAIWRLFPGEAAESRRTPDQPRRGFLFRVEDHRPGRPAQV 60 Query: 53 LLQSAQMPVSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRC 112 L+QS MP A +I ++++ Q G L F L ANPIKTI D Q + C Sbjct: 61 LIQSRCMPQPEATLNLIGSREINPQPSQGQRLAFILTANPIKTIKDRQADTKPRKTRDTC 120 Query: 113 RVPLIKEAEQIAWLQRKLGNAARVEDVHPISERPQYFSGDGKSGKIQTVCFEGVLTINDA 172 RVPLI E Q +WL ++L + A VE V P YF + GKI FEG+LT+ D Sbjct: 121 RVPLITEETQKSWLIQRLKDVAEVEAVAVTPHPPLYFRKANRGGKILCATFEGLLTVLDP 180 Query: 173 PALIDLVQQGIGPAKSMGCGLLSLAP 198 AL+ L++ G+GPAK+ GCGLL + Sbjct: 181 NALVALLENGLGPAKAFGCGLLLVRR 206 >UniRef50_Q0W583 Predicted CRISPR-associated protein n=1 Tax=uncultured methanogenic archaeon RC-I RepID=Q0W583_UNCMA Length = 250 Score = 190 bits (483), Expect = 2e-47, Method: Composition-based stats. Identities = 60/248 (24%), Positives = 94/248 (37%), Gaps = 50/248 (20%) Query: 1 MYLSKVIIA------RAWSRDLYQLHQGLWHLFPN------RPDAARDFLFHVEKRNTPE 48 MYLS++I+ R D ++LH+ + FP+ L ++ Sbjct: 1 MYLSRLILNPRTRAVRRDLADCHELHRTILGGFPDLNGKGGEARETFGVLHRIDIHPRSG 60 Query: 49 GCHVLLQSAQMPVSTAVA-------------TVIKTKQVEFQLQVGVPLYFRLRANPIKT 95 +L+QS + P + + + +++ G FRLRANP K Sbjct: 61 AIVLLVQSQEKPDWSKLPEGYLLENTGTENPACKAIDEQYGKIKAGDVYAFRLRANPTKK 120 Query: 96 ILDNQKRLDSKGNIK--RCRVPLIKEAEQIAWLQRKLGNAARVE----------DVHPIS 143 I ++ G K RVP+ E++QI WL+RK DV Sbjct: 121 IGTSRIEDIKAGKPKNNGRRVPIRNESDQILWLKRKGAAGGFELMSTKRFSELSDVLISE 180 Query: 144 ERPQYF-------------SGDGKSGKIQTVCFEGVLTINDAPALIDLVQQGIGPAKSMG 190 E Q + +V FEG L + +A ++ ++ GIG K+ G Sbjct: 181 EGHQKIYTFDTGIKAKVQKNARENRLTFGSVLFEGTLKVTNAEKFLETLKSGIGSGKAYG 240 Query: 191 CGLLSLAP 198 GLLSLAP Sbjct: 241 FGLLSLAP 248 >UniRef50_D0Y917 CRISPR-associated protein, Cse3 family n=2 Tax=Dehalococcoides RepID=D0Y917_9CHLR Length = 209 Score = 189 bits (480), Expect = 5e-47, Method: Composition-based stats. Identities = 63/218 (28%), Positives = 93/218 (42%), Gaps = 30/218 (13%) Query: 1 MYLSKVIIARAWSR------DLYQLHQGLWHLFPNRPDA-ARDFLFHVEKRNTPEGCHVL 53 MYLS + + R Y+LH+ L FP++ D LF ++ G VL Sbjct: 1 MYLSLLRLNPRSKRALTESSRPYELHRSLLKAFPDKADGGPGRVLFRLDMNEQTGGISVL 60 Query: 54 LQSAQMPVSTAV------ATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKG 107 +QS + P T + T K K+ + L G L FRLRANP K K Sbjct: 61 IQSEKKPFWTNLNGYTEFVTECKCKEFKPALAPGQVLRFRLRANPTKRSKSTGK------ 114 Query: 108 NIKRCRVPLIKEAEQIAWLQRKLGNAAR------VEDVHPISERPQYFSGDGKSGKIQTV 161 R ++K EQ+ WL++K N D ++ G + +V Sbjct: 115 -----REGILKTEEQVEWLRKKGMNGGFEVCEVFTVDEGFAKDKMTDTDNAGHHTNMLSV 169 Query: 162 CFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAPL 199 F+G+L + D+ A ++ GIG AK G GLLS+A + Sbjct: 170 RFDGLLRVTDSDAFQSTLRDGIGSAKGFGFGLLSVASV 207 >UniRef50_Q74DC7 CRISPR-associated protein, CT1974 family n=2 Tax=Desulfuromonadales RepID=Q74DC7_GEOSL Length = 202 Score = 189 bits (480), Expect = 6e-47, Method: Composition-based stats. Identities = 93/202 (46%), Positives = 118/202 (58%), Gaps = 5/202 (2%) Query: 1 MYLSKVIIARAWSRDLYQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGCHVLLQSAQMP 60 MYLSKV+I R+ Y++H+ LW LFP DA RDFLF VE R+ + VLLQS + P Sbjct: 1 MYLSKVLINGTACRNPYEIHRVLWKLFPEDADAERDFLFRVE-RSGQQSVEVLLQSRREP 59 Query: 61 VS--TAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRVPLIK 118 + ++ +K LQ L F L ANPIKTI D RL+S IK+CRVPLI+ Sbjct: 60 TMAASREVLLMGSKPYLLSLQQDQQLRFMLVANPIKTINDESARLNSANEIKKCRVPLIR 119 Query: 119 EAEQIAWLQRKLGNAARVEDVHPISERPQYFS--GDGKSGKIQTVCFEGVLTINDAPALI 176 E + AWL+RKL A +E V F + + GK+Q V F GVL++ D LI Sbjct: 120 EEDLRAWLKRKLEGVAVIEAVEVEKRPAMNFRKAREKRVGKVQAVSFHGVLSVTDPVGLI 179 Query: 177 DLVQQGIGPAKSMGCGLLSLAP 198 L+ GIGPAK+ GCGLLSLA Sbjct: 180 SLINTGIGPAKAFGCGLLSLAR 201 >UniRef50_Q2JWC6 CRISPR-associated protein, Cse3 family n=2 Tax=Chroococcales RepID=Q2JWC6_SYNJA Length = 210 Score = 184 bits (467), Expect = 2e-45, Method: Composition-based stats. Identities = 63/216 (29%), Positives = 97/216 (44%), Gaps = 28/216 (12%) Query: 1 MYLSKVIIA------RAWSRDLYQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGCHVLL 54 MYLS++I+ + + + LHQ + H FP++P +H+ R P+GC +L+ Sbjct: 1 MYLSRLILNERQLLVQRELSNAHALHQRIMHGFPDQPTKTPRSDWHILYRQEPDGCTILV 60 Query: 55 QSAQMPVSTAVAT-----VIKTKQVEFQ---LQVGVPLYFRLRANPIKTILDNQKRLDSK 106 QS P + + + K + + L G FRLRANP K + Sbjct: 61 QSVIQPDWSRLPQGYVQRDPEVKIFDLRPEVLSKGRCFQFRLRANPSK-----------R 109 Query: 107 GNIKRCRVPLIKEAEQIAWLQRKLGNAARVEDVHPISERPQYFSGDGK---SGKIQTVCF 163 R V + +Q+ WL+R+ PQ F +I TV F Sbjct: 110 DKKTRKIVGFFRSEDQLEWLRRQGFQHGFEVLAAEGIPSPQIFGIKKGLSGPVRIHTVLF 169 Query: 164 EGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAPL 199 +G+L + D+ A + VQQGIG +S GCGLLSL+ + Sbjct: 170 QGILRVTDSEAFVKAVQQGIGRGRSYGCGLLSLSKI 205 >UniRef50_B7KJ27 CRISPR-associated protein, Cse3 family n=1 Tax=Cyanothece sp. PCC 7424 RepID=B7KJ27_CYAP7 Length = 219 Score = 183 bits (464), Expect = 4e-45, Method: Composition-based stats. Identities = 59/222 (26%), Positives = 92/222 (41%), Gaps = 29/222 (13%) Query: 1 MYLSKVIIARA------WSRDLYQLHQGLWHLFPNRP----DAARDFLFHVEKRNTPEGC 50 MYLSK+ + D ++LHQ + FPN + L+ +E G Sbjct: 1 MYLSKIELNIRSSAVSTDLSDCHKLHQRVMQGFPNENNPEYRSEAKILYRLE------GS 54 Query: 51 HVLLQSAQMPVSTAVATVIKTKQV----EFQLQVGVPLYFRLRANPIKTILDNQ------ 100 + +QS P T + +++ +++ G LYFRL NP++ + Sbjct: 55 ILFVQSKNKPDWTQLPKGYTAEEITEMDYEKIKKGDYLYFRLLGNPVQQTTKLRTDDSGN 114 Query: 101 ---KRLDSKGNIKRCRVPLIKEAEQIAWLQRKLGNAARVEDVHPISERPQYFSGDGKSGK 157 K K K R L + QI WL L E S + K Sbjct: 115 IIMKNNSEKPQKKTVRRFLSNKDAQIQWLMNHLKGTILQECYVSASSDIRGQCKQSKRIF 174 Query: 158 IQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAPL 199 ++TV F+GVL + D+ + I +++GIG +S GCGLLS+A Sbjct: 175 LKTVLFDGVLQVTDSESFIKALREGIGRGRSYGCGLLSIAKF 216 >UniRef50_Q12YA7 CRISPR-associated protein n=1 Tax=Methanococcoides burtonii DSM 6242 RepID=Q12YA7_METBU Length = 224 Score = 182 bits (463), Expect = 5e-45, Method: Composition-based stats. Identities = 55/209 (26%), Positives = 81/209 (38%), Gaps = 21/209 (10%) Query: 10 RAWSRDLYQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGCHVLLQSAQMPVSTAVATVI 69 R + + +H+ +W LFP D R F++ + + +++ S P+ I Sbjct: 17 RGNMGNEHNVHRLVWSLFPVNEDDKRKFIYRQDSMGSLPSFYLV--SENEPIDELNVWDI 74 Query: 70 KTKQVEFQLQVGVPLYFRLRANPIKTILD---NQKRLDSKGNIK-----------RCRVP 115 KQ + L+ G L F LRANPI + D Q R D + K +P Sbjct: 75 DVKQYDPILKSGQKLAFSLRANPIVSKRDENDKQHRHDVVMDEKFRLKMENGGDIEPNMP 134 Query: 116 LIKEAEQIAWLQRKLGNAAR-----VEDVHPISERPQYFSGDGKSGKIQTVCFEGVLTIN 170 I + + WL RK V + TV G LT+ Sbjct: 135 DIVQRKGSEWLLRKGDMNGFSINAEQIRVDAYQNHKLFKPKGKHHVSFSTVDIVGTLTVT 194 Query: 171 DAPALIDLVQQGIGPAKSMGCGLLSLAPL 199 D D + +GIGPAK GCG+L + PL Sbjct: 195 DPDIFRDALFKGIGPAKGFGCGMLLVRPL 223 >UniRef50_D1CAI9 CRISPR-associated protein, Cse3 family n=1 Tax=Sphaerobacter thermophilus DSM 20745 RepID=D1CAI9_SPHTD Length = 257 Score = 180 bits (458), Expect = 2e-44, Method: Composition-based stats. Identities = 73/255 (28%), Positives = 97/255 (38%), Gaps = 59/255 (23%) Query: 1 MYLSKVIIA------RAWSRDLYQLHQGLWHLFPN-----RPDAARDFLFHVEKRNTPEG 49 MYLS++I+ R D QLH+ + FPN A L+ +E Sbjct: 1 MYLSRLILNPRSREVRRDLADCQQLHRSVMSGFPNLAAPGDARARLGILYRLETHPRTGM 60 Query: 50 CHVLLQSAQMPVSTAVATV----------IKTKQVEF---QLQVGVPLYFRLRANPIKTI 96 +L+QSA P + + K V L G+ L FRLRANP K I Sbjct: 61 PTLLVQSAIEPTWSQLPADYLLNTAGVPNPDCKPVGPIYDALDAGMVLTFRLRANPTKRI 120 Query: 97 LDNQKRLDSKGNIKRCRVPLIKEAEQIAWLQRKLGNAAR--------------------- 135 + + N RV L EA+Q+AWL+RK Sbjct: 121 KPDTDP--GRSNRLGKRVELRTEADQLAWLRRKGEQCGFEVLSVRATTEHEAYRWERAAA 178 Query: 136 -----------VEDVHPISERPQYFSG-DGKSGKIQTVCFEGVLTINDAPALIDLVQQGI 183 V DV + Y DG+ V F+G+L I DA + QGI Sbjct: 179 IFGLEADKPEPVPDVRAVRGSKVYGRRADGERMTFAAVTFDGLLRIVDADRFRAALVQGI 238 Query: 184 GPAKSMGCGLLSLAP 198 G AK+ G GLLS+AP Sbjct: 239 GSAKAYGFGLLSIAP 253 >UniRef50_D2RB03 CRISPR system CASCADE complex protein CasE n=4 Tax=Bacteria RepID=D2RB03_GARVA Length = 215 Score = 178 bits (453), Expect = 7e-44, Method: Composition-based stats. Identities = 47/217 (21%), Positives = 92/217 (42%), Gaps = 25/217 (11%) Query: 2 YLSKVIIAR------AWSRDLYQLHQGLWHLFPNR--PDAARDFLFHVEKRNTPEGCHVL 53 YLS+V I + + H + FP+ L+ V+ + ++L Sbjct: 3 YLSRVEIDYKKPSSLRDLKSVGAFHNWVEQSFPDEWENHERSRKLWRVDVLHGKH--YLL 60 Query: 54 LQSAQMPV--------STAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDS 105 + S P + A+ + L G+ + FR+ NP+ +I DN + + Sbjct: 61 IVSDSKPDLQRLEMYGVSGTASSKTYDKFLGSLMNGMRMQFRVTLNPVVSISDNAETHTA 120 Query: 106 KGNIKRCRVPLIKEAEQIAWL---QRKLGNAARVEDVHPISERPQYFSGDGKSGKIQTVC 162 +G VP + +Q+ +L +KLG + + + F+ K ++ Sbjct: 121 RGR----VVPHVTYDQQMNFLLNRAQKLGFSLNENEFAIVERGYSLFTKSEKPIRLSKAV 176 Query: 163 FEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAPL 199 ++G+LTI+DA + + +GIG K+ G G++++ PL Sbjct: 177 YQGILTISDADIMRKTLLEGIGKKKAYGFGMMTVIPL 213 >UniRef50_A6W167 CRISPR-associated protein, Cse3 family n=1 Tax=Marinomonas sp. MWYL1 RepID=A6W167_MARMS Length = 224 Score = 177 bits (450), Expect = 1e-43, Method: Composition-based stats. Identities = 65/227 (28%), Positives = 95/227 (41%), Gaps = 31/227 (13%) Query: 1 MYLSKVII-----ARAWS-----RDLYQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGC 50 MYLSKV AR + +Y HQ LW LF + R FLF E+ Sbjct: 1 MYLSKVSFQASQQARQLLLGFGGKGVYSTHQMLWQLF--TEEDERSFLFREEQSADGSKA 58 Query: 51 HVLLQSAQMPVSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQ---KRLDSKG 107 + S+ P S +KTK +LQ G L F LRANP D + KR D Sbjct: 59 FF-VLSSVKPESDESTFNVKTKTFMPKLQSGQRLGFTLRANPTVCTTDEKGKSKRHDVMM 117 Query: 108 NIKRC----------RVPLIKEAEQIAWLQ--RKLGNAARVEDVHPISERPQYFSGDGKS 155 + K+ + LI E W+ ++L N D P + D Sbjct: 118 HAKKAAKESGVSDSEEIRLIMEQAAQEWIANPKRLENWGFTLDFLPEVQTYMQHRSDKNR 177 Query: 156 ---GKIQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAPL 199 + +V ++GVLT+ D ++ +++G G AKS+GCGL+ + + Sbjct: 178 EDKIRFSSVDYQGVLTVQDPEKFLEQLEKGFGRAKSLGCGLMLIKSI 224 >UniRef50_D1NTI2 CRISPR-associated protein, Cse3 family n=1 Tax=Bifidobacterium gallicum DSM 20093 RepID=D1NTI2_9BIFI Length = 236 Score = 176 bits (447), Expect = 4e-43, Method: Composition-based stats. Identities = 47/228 (20%), Positives = 77/228 (33%), Gaps = 37/228 (16%) Query: 9 ARAWSRDLYQLHQGLWHLFPN---RPDAARDFLFHVEKRNTPEGCHVLLQSAQMPV---- 61 AR + Y+LH + FP R L+ ++ + + S P Sbjct: 6 ARQLAASPYKLHAAVEASFPPHAPRATDEGRILWRLDHNRQDHSVWLYVVSPSQPDLLHI 65 Query: 62 -------STAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSK---GNIKR 111 A +L G ++R+ ANP++ + +S +K Sbjct: 66 VEQAGWPGYAEWETKDYTPFLDRLAQGQQWHYRVCANPVRNAATDLNLHNSLATFDKMKG 125 Query: 112 CRVPLIKEAEQIAWLQRKLGNAAR--------------------VEDVHPISERPQYFSG 151 R + +QI W +R+ + V I + F Sbjct: 126 SRQAYVTVRQQIDWFERRAAANGFSLPERDPVSGFDEQVKDPLLLSSVRVIDRQRHKFRD 185 Query: 152 DGKSGKIQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAPL 199 + T FEG L + D +L + GIG AK GCGL++LAP+ Sbjct: 186 RKNQVTLSTAVFEGTLQVEDPQSLRHALCFGIGKAKGFGCGLMTLAPI 233 >UniRef50_B6WQ61 Putative uncharacterized protein n=1 Tax=Desulfovibrio piger ATCC 29098 RepID=B6WQ61_9DELT Length = 206 Score = 176 bits (447), Expect = 4e-43, Method: Composition-based stats. Identities = 59/203 (29%), Positives = 79/203 (38%), Gaps = 8/203 (3%) Query: 1 MYLSKVIIARAWSRDLYQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGCHVLLQSAQMP 60 M L + +AR RD Y HQ LW FP PDA RDFL + P+GC + L + P Sbjct: 7 MLLDRQALARCRFRDSYAWHQALWECFPAMPDAGRDFLTRTDWL--PQGCRIYLLCRREP 64 Query: 61 VSTAV--ATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRVPLIK 118 V K + F L ANP + + + R+ L+ Sbjct: 65 VRPDWCPPGSWAVKNIAPAFLQHGTYAFDLLANPTRKVA--AFDAGGQRTRNGKRLALLD 122 Query: 119 EAEQIAWLQRKLGNAARVED--VHPISERPQYFSGDGKSGKIQTVCFEGVLTINDAPALI 176 E + AW++ K G D + F +G V F G L + D I Sbjct: 123 ETSRQAWMEAKAGQHGFCLDGPLALDDAGASIFWRRACAGTHIGVRFRGRLQVTDRERFI 182 Query: 177 DLVQQGIGPAKSMGCGLLSLAPL 199 GIG AK+ G G+L L PL Sbjct: 183 HAFYHGIGSAKAFGFGMLLLQPL 205 >UniRef50_D1CGD5 CRISPR-associated protein, Cse3 family n=1 Tax=Thermobaculum terrenum ATCC BAA-798 RepID=D1CGD5_THET1 Length = 240 Score = 175 bits (445), Expect = 6e-43, Method: Composition-based stats. Identities = 53/232 (22%), Positives = 88/232 (37%), Gaps = 38/232 (16%) Query: 1 MYLSKVIIARA------WSRDLYQLHQGLWHLFP-----NRPDAARDFLFHVEKRNTPEG 49 +YLS++ + + + LH + FP + P A L+ +E Sbjct: 2 LYLSRLRLQPRHRDVQKDLSNCHALHSRILSAFPLLPTPSSPRAEMGVLYRLE--EAGRF 59 Query: 50 CHVLLQSAQMPVSTAVAT--------VIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQK 101 V++QS P + + + ++ G FRLRANP + I Sbjct: 60 PTVIVQSRLEPDWSRLPEGYLAFPAECKRVDDKYSRINQGDRFIFRLRANPTRRIARG-- 117 Query: 102 RLDSKGNIKRCRVPLIKEAEQIAWLQRKLGNAARVEDVHPIS---------------ERP 146 + + RV L +E +QI WL RK + Sbjct: 118 NTEQAERWRGKRVELQREEDQIDWLIRKGDQHGFKLLSITVRQQAVPNLRVLPNNKTHGW 177 Query: 147 QYFSGDGKSGKIQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAP 198 + +G + +V FEGVL + D + + ++QG+G K+ G GLLS+AP Sbjct: 178 RRDAGGNRRLTFGSVQFEGVLEVTDRESFMQALEQGVGSGKAFGFGLLSIAP 229 >UniRef50_B1LQ79 CRISPR-associated protein, Cse3 family n=54 Tax=Enterobacteriaceae RepID=B1LQ79_ECOSM Length = 216 Score = 175 bits (443), Expect = 1e-42, Method: Composition-based stats. Identities = 49/217 (22%), Positives = 85/217 (39%), Gaps = 24/217 (11%) Query: 1 MYLSKVIIARAWS----------RDLYQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGC 50 MYLS++ + R Y +HQ LW LFP R FL+ E+ Sbjct: 1 MYLSRITLHTGQLSPAQLLHLVDRGEYVMHQWLWDLFPG--GKERQFLYRREELQGA--F 56 Query: 51 HVLLQSAQMPVSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQK------RLD 104 + S + P + I+ + +L G L F LRANP + + Sbjct: 57 RFFVLSQERPAESET-FTIECRSFAPELHTGQSLCFNLRANPTVCKAGKRHDLLMEAKRQ 115 Query: 105 SKGNIKRCRVPLIKEAEQIAWLQRKLGNAAR-VEDVHPISERPQYFSGDGKS--GKIQTV 161 +G + V L ++ + WL + + + D + R Q + + +V Sbjct: 116 VRGQAEGRNVWLHQQQAALDWLAAQGERSGFTLLDTSVDAYRQQQLRRENSRQLIQFSSV 175 Query: 162 CFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAP 198 + G+LT+ D + + G G +++ GCGL+ + P Sbjct: 176 DYTGMLTVTDPGLFLQRLCLGYGKSRAFGCGLMLIKP 212 >UniRef50_D1A5U1 CRISPR-associated protein, Cse3 family n=2 Tax=Actinomycetales RepID=D1A5U1_THECD Length = 229 Score = 174 bits (442), Expect = 1e-42, Method: Composition-based stats. Identities = 50/235 (21%), Positives = 80/235 (34%), Gaps = 46/235 (19%) Query: 1 MYLSKVIIA------RAWSRDLYQLHQGLWHLF---PNRPDAARDFLFHVEKRNTPEGCH 51 MYL++ R LH + F P + D L+ +++ E + Sbjct: 1 MYLTRFRFNTARVTARRILSSPQMLHAAVMSSFATPPVQEDDGPRVLWRIDRNGKSE-TY 59 Query: 52 VLLQSAQMPVSTAV-----------ATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQ 100 + + S P T + +L G FRL ANP+ T N Sbjct: 60 LYIVSPLKPDLTHLVEQAGWPTTGTWQTYDYGPFLSRLAKGEEWAFRLTANPVHTARRN- 118 Query: 101 KRLDSKGNIKRCRVPLIKEAEQIAWLQRKLGNAAR----------------VEDVHPISE 144 + Q+ WL ++ A V ++ Sbjct: 119 ------DTEPTKITAHVGMRHQMQWLLQRQEAAGFRVVEKPRERQLIPGVDVHELVIRER 172 Query: 145 RPQYFSGDG--KSGKIQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLA 197 R F G + + TV F+G L + D AL + +G+G AK+ GCGL++LA Sbjct: 173 RHLEFRKRGNSRPVTLVTVTFDGRLEVTDPDALRRTLTRGLGRAKAYGCGLMTLA 227 >UniRef50_B8GIV2 CRISPR-associated protein, Cse3 family n=1 Tax=Methanosphaerula palustris E1-9c RepID=B8GIV2_METPE Length = 225 Score = 174 bits (442), Expect = 2e-42, Method: Composition-based stats. Identities = 62/226 (27%), Positives = 88/226 (38%), Gaps = 30/226 (13%) Query: 1 MYLSKVIIA---------RAWSRDLYQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGCH 51 M +SK+ + YQ H +W LF + P+ RDFLF E Sbjct: 1 MQISKIQLNADASDHPAFWEHVGGAYQAHSLIWDLFSDGPERERDFLFRQEVHQGMPVFW 60 Query: 52 VLLQSAQMPVSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILD---NQKRLDSKGN 108 + S ++P IK+K L+ G+ L F LRANPI++ D Q R D + Sbjct: 61 TV--SERVPSDRNETWNIKSKPYAPILRQGMHLSFVLRANPIRSRRDDLGKQHRHDVVMD 118 Query: 109 IK--------RCRVPLIKEAEQIA---WLQRKLGNAARVE---DVHPISERPQYFSGDGK 154 +K + P + Q A WL + V F K Sbjct: 119 MKTALKDSKPGDQWPAEDQIIQEAGLVWLANQGNAKGFSLQDGAVRVDGYTQHRFVKPKK 178 Query: 155 S--GKIQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAP 198 +I T+ F G+LT+ D + GIGPAK GCGL+ + P Sbjct: 179 KQMVQISTLDFTGLLTVTDPERFTTALFNGIGPAKGFGCGLMMVRP 224 >UniRef50_Q2FNT6 CRISPR-associated protein, CT1974 n=1 Tax=Methanospirillum hungatei JF-1 RepID=Q2FNT6_METHJ Length = 228 Score = 174 bits (441), Expect = 2e-42, Method: Composition-based stats. Identities = 61/227 (26%), Positives = 94/227 (41%), Gaps = 31/227 (13%) Query: 1 MYLSKVIIARAWS---------RDLYQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGCH 51 M+ SK+ + R + YQ+H+ +W LF + PD RDFL+ E T Sbjct: 1 MFFSKMTLDREAAISGRFRDLVTGPYQVHEVIWDLFADHPDRKRDFLYRAEL--TGRDPV 58 Query: 52 VLLQSAQMPVSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDN-------QKRLD 104 V L SA+ PV I +K LQ L FR+R NP+ T + + R D Sbjct: 59 VYLLSARKPVYEGNVWNILSKPFHPVLQKDDLLNFRIRVNPVVTKTEPDPDRKRIRHRHD 118 Query: 105 SKGNIKRCR--------VPLIKEAEQIAWLQRKLGNAAR---VEDVHPISERPQYFS--G 151 + KR + + + E + WL+++ + V R FS Sbjct: 119 VIMDAKRRLNEANSSFSMSDLVQQESVRWLRQRSEKGGFSLYEDRVIAGGYRKMQFSQGR 178 Query: 152 DGKSGKIQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAP 198 + I V +GVL + D + ++ G+GPAK GCGL+ + Sbjct: 179 KKNTISISVVDCDGVLRVTDPDLFLQMICNGLGPAKGFGCGLMMVKR 225 >UniRef50_B4RSK4 CRISPR-associated protein, Cse3 family n=5 Tax=Gammaproteobacteria RepID=B4RSK4_ALTMD Length = 222 Score = 173 bits (439), Expect = 3e-42, Method: Composition-based stats. Identities = 58/224 (25%), Positives = 89/224 (39%), Gaps = 29/224 (12%) Query: 1 MYLSKVI----------IARAWSRDLYQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGC 50 M+LSKV +A+ +Y HQ +W LF N R FL+ E T Sbjct: 1 MFLSKVTMVSSPQTAQELAKLQRNGVYASHQLIWQLFSNV--TERSFLYREEMGITG-MP 57 Query: 51 HVLLQSAQMPVSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILD---NQKRLDSKG 107 + S P ++ TK E +L+ G L F+LR NP + Q+R D Sbjct: 58 EFYVLSKTEPQASLPIFSCVTKVFEPKLKKGQRLSFKLRVNPTVCVKGEDGKQRRHDVMM 117 Query: 108 NIK---RCRVP------LIKEAEQIAWL--QRKLGNAARVEDVHPISERP--QYFSGDGK 154 K + +P + E I WL +++L D P + Sbjct: 118 QAKYNVKDELPDAQTLKMHMEQAAINWLNNEKRLDEWGITLDFQPSIDGYTQHKVQKKRH 177 Query: 155 SGKIQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAP 198 + +V ++G+LT+ D I+ +G G AK MGCGL+ + Sbjct: 178 QIQFSSVDYQGMLTVQDPLKFINQYAKGFGRAKGMGCGLMMIKR 221 >UniRef50_A5UR13 CRISPR-associated protein, Cse3 family n=1 Tax=Roseiflexus sp. RS-1 RepID=A5UR13_ROSS1 Length = 238 Score = 172 bits (437), Expect = 6e-42, Method: Composition-based stats. Identities = 56/238 (23%), Positives = 90/238 (37%), Gaps = 43/238 (18%) Query: 1 MYLSKVIIA------RAWSRDLYQLHQGLWHLFPNRPD-----AARDFLFHVEKRNT-PE 48 MYLS++I+ R D+Y+LH+ + FP PD A L+ +E + P Sbjct: 1 MYLSRLILDVRQPRVRRDLSDVYRLHRTILSAFPQAPDNVPARAHFGILYRIEPISDMPW 60 Query: 49 GCHVLLQSAQMPVSTAVATV--------------IKTKQVEFQLQVGVPLYFRLRANPIK 94 +L+QS + P + + + +++ + FRL ANP + Sbjct: 61 LVRLLVQSREQPDWSHIPDRMFGPALDERGNPALRRIDDEYARIRSDMQFLFRLLANPTR 120 Query: 95 TILDNQKRLDSKGNIKRCRVPLIKEAEQIAWLQRKLGNAARVEDVHPISERPQYFSGDGK 154 + + D + RV L++E EQIAWL K ++ + Sbjct: 121 RLSNRSSERD--DRLLGKRVALLREEEQIAWLAHKGEQHGFRLLSTSVNPDVPAVQAAKQ 178 Query: 155 ---------------SGKIQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLA 197 V F G L + DA ++ GIG K+ G GLLS+A Sbjct: 179 ADEHGWRKATQTQTMHLTFGAVLFTGYLKVTDADRFRTALEHGIGSGKAFGFGLLSIA 236 >UniRef50_Q67RN9 Putative uncharacterized protein n=1 Tax=Symbiobacterium thermophilum RepID=Q67RN9_SYMTH Length = 224 Score = 172 bits (436), Expect = 6e-42, Method: Composition-based stats. Identities = 56/225 (24%), Positives = 86/225 (38%), Gaps = 29/225 (12%) Query: 1 MYLSKVIIA------RAWSRDLYQLHQGLWHLFPN------RPDAARDFLFHVEKRNTPE 48 MYLS + + + RD+ LHQ + FP+ A L+ +E Sbjct: 1 MYLSLLRLNPASAAVQRDLRDVQALHQRVMSAFPDVLDPEVEARAYFGVLYRLELNRYSG 60 Query: 49 GCHVLLQSAQMPVSTAVAT-------------VIKTKQVEFQLQVGVPLYFRLRANPIKT 95 + +QS P + V + + +++ G L FRLRANP + Sbjct: 61 QVLLYVQSRVEPDWGRLPAGYLTPADGLPNPAVKRVDEAYARIREGRVLRFRLRANPTRK 120 Query: 96 ILDNQKRLDSKGNIKRCRVPLIKEAEQIAWLQRKLGNAAR--VEDVHPISERPQYFSGDG 153 I RVPL Q+ W++RK +E + + Sbjct: 121 IDTKSGPNG--EKRNGRRVPLSGLDAQLGWMERKAREHGFELLEATVAAAGASERVRSYT 178 Query: 154 KSGKIQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAP 198 Q V FEG L + DA + +++GIGP K+ G GLLS+ P Sbjct: 179 TGRTFQGVLFEGRLVVRDAGRFREALERGIGPGKAYGYGLLSVGP 223 >UniRef50_B8IZA5 CRISPR-associated protein, Cse3 family n=1 Tax=Desulfovibrio desulfuricans subsp. desulfuricans str. ATCC 27774 RepID=B8IZA5_DESDA Length = 207 Score = 171 bits (434), Expect = 1e-41, Method: Composition-based stats. Identities = 58/208 (27%), Positives = 87/208 (41%), Gaps = 14/208 (6%) Query: 2 YLSKVI-----IARAWSRDLYQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGCHVLLQS 56 ++++ + + R D Y H+ +W FPNRPDA+RDFLF ++ P G V + S Sbjct: 3 WMTRFMVELPALHRNRLSDCYAWHKAIWQCFPNRPDASRDFLFRLD--EVPAGTLVHVLS 60 Query: 57 AQMPVSTAVATV--IKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRV 114 P T + K V F + NP + + D + R Sbjct: 61 PHEPQRPDFCTEDHWQIKAVPPCFLKYNCYRFDVICNPGRKV--EAFTSDGQRKKNSRRE 118 Query: 115 PLIKEAEQIAWLQRKLGNAAR--VEDVHPISERPQY-FSGDGKSGKIQTVCFEGVLTIND 171 +IK EQ AWL RK + + I +Y F D +SG V F GVL + Sbjct: 119 AIIKPDEQNAWLDRKAAANGFEVLPGMRSIDPSTRYSFRKDHRSGTHIGVRFSGVLRVTQ 178 Query: 172 APALIDLVQQGIGPAKSMGCGLLSLAPL 199 +G+G A+ G G+L L+P+ Sbjct: 179 RDEFCRAFHKGLGSARGFGFGMLLLSPV 206 >UniRef50_C6WMQ8 CRISPR-associated protein, Cse3 family n=1 Tax=Actinosynnema mirum DSM 43827 RepID=C6WMQ8_ACTMD Length = 230 Score = 171 bits (434), Expect = 1e-41, Method: Composition-based stats. Identities = 56/226 (24%), Positives = 88/226 (38%), Gaps = 36/226 (15%) Query: 1 MYLSKVIIA------RAWSRDLYQLHQGLWHLFPNRPD-----AARDFLFHVEKRNTPEG 49 M+L+K+ + R +L+++H+ + +P D L+ ++ TP G Sbjct: 1 MFLTKLTVDVRSREFRRDLANLHEMHRTVMSGYPRVEDGSPARQTHGVLWRLD--ATPAG 58 Query: 50 CHVLLQSAQMPVSTAVATV--------IKTKQVEFQLQVGVPLYFRLRANPIKTILDNQK 101 +QS P T + + ++ G L FRL AN K Sbjct: 59 YTQYVQSLTRPDWTGLPETLLTSPAEVRSLDPLLDAIEPGRVLAFRLLANATK------D 112 Query: 102 RLDSKGNIKRCRVPLIKEAEQIAWLQRKLGNAAR--------VEDVHPISERPQYFSGD- 152 + ++ + RV Q++WL RK V DV S Sbjct: 113 SVPAEPGGRGLRVAHRTPEAQVSWLARKGQRHGFALRDRPDGVPDVTLWSAPRMTGRKKA 172 Query: 153 GKSGKIQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAP 198 G+ + V F+G L + DA L + V GIG AK+ GCG+LSLA Sbjct: 173 GRPITVDAVRFDGHLVVTDADELREAVGSGIGRAKAYGCGMLSLAR 218 >UniRef50_B8FDH8 CRISPR-associated protein, Cse3 family n=1 Tax=Desulfatibacillum alkenivorans AK-01 RepID=B8FDH8_DESAA Length = 199 Score = 170 bits (431), Expect = 3e-41, Method: Composition-based stats. Identities = 52/203 (25%), Positives = 80/203 (39%), Gaps = 17/203 (8%) Query: 1 MY-LSKVIIARAWSRDLYQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGCHVLLQSAQM 59 MY L + +D+Y +H+ +++LFP RDFLF +K G +L+ S + Sbjct: 5 MYTLDRKDCKALGLKDVYGVHKAVYNLFPENNGQGRDFLF-ADKGGDWNGRKILILSHRE 63 Query: 60 PVSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRVPLIKE 119 P+ I ++V F + NP++ + N R +P+ Sbjct: 64 PIQPRH-GAIDCREVPAAFLDWDYYGFEVVLNPVR-----------RDNASRKLIPVRGR 111 Query: 120 AEQIAWLQRK---LGNAARVEDVHPISERPQYFSGDGKSGKIQTVCFEGVLTINDAPALI 176 W +K LG + + ++ DG T F G L + D A Sbjct: 112 ENLHEWFLKKAPGLGFEVEPHSLQVSRMGVEAYAKDGTMRTHNTATFIGKLRVIDPNAFK 171 Query: 177 DLVQQGIGPAKSMGCGLLSLAPL 199 QGIG AK+ G GLL L PL Sbjct: 172 KSFAQGIGRAKAFGFGLLQLVPL 194 >UniRef50_C9M2Y7 CRISPR-associated protein n=3 Tax=Lactobacillus RepID=C9M2Y7_LACHE Length = 217 Score = 170 bits (430), Expect = 4e-41, Method: Composition-based stats. Identities = 56/224 (25%), Positives = 79/224 (35%), Gaps = 34/224 (15%) Query: 1 MYLSKVIIA---RAWSRDLYQL---HQGLWHLFPNRPDAARD--FLFHVEKRNTPEGCHV 52 MYLS+V I R DL L H + FP+ L+ +++ ++ Sbjct: 1 MYLSRVEIDTNDRQKISDLTHLGSYHNWVEQSFPDEVKQGTRLRHLWRIDEF--SNKKYL 58 Query: 53 LLQSAQMPV--------STAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLD 104 LL S P A + QL G FRL ANP I D++ Sbjct: 59 LLVSKNKPKLNNLERYGVPYTAATKDYDRFLNQLVEGKKYRFRLTANPTYRITDSKSG-- 116 Query: 105 SKGNIKRCRVPLIKEAEQIAWLQRKLGNAAR--------VEDVHPISERPQYFSGDGKS- 155 K VP I +Q WL + V ++ G Sbjct: 117 -----KSRVVPHITILQQTNWLLERTKKHGFEIVRDSEEVYKLNISERDWPRLRRKGNHL 171 Query: 156 GKIQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAPL 199 K+ V F+G+L I D + GIG K+ G GLL++ PL Sbjct: 172 IKLSRVTFDGILQITDLSKFKLALINGIGREKAYGMGLLTVIPL 215 >UniRef50_C7QEM3 CRISPR-associated protein, Cse3 family n=9 Tax=Actinomycetales RepID=C7QEM3_CATAD Length = 236 Score = 169 bits (429), Expect = 4e-41, Method: Composition-based stats. Identities = 50/243 (20%), Positives = 83/243 (34%), Gaps = 54/243 (22%) Query: 1 MYLSKVIIA------RAWSRDLYQLHQGLWHLFPNRP----DAARDFLFHVEKRN----- 45 MYL++ R LH + F N P + L+ +++ Sbjct: 1 MYLTRFRFNTARTGARRLLTSPQILHAAVMQSFANVPALPDGNSPRVLWRLDRNANNQVL 60 Query: 46 -------TPEGCHVLLQSAQMPVSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILD 98 P+ H++ Q+ +T +L G FRL ANP+ +I Sbjct: 61 LYIVSPDRPDLTHIVEQAGWP--TTGSWDSFAYAPFLDKLTAGDIWTFRLTANPVHSIRT 118 Query: 99 NQKRLDSKGNIKRCRVPLIKEAEQIAWLQRKLGNAARVED----------------VHPI 142 + R I Q+ WL ++ A V Sbjct: 119 -------RDGEPTKRTAHITVRHQLGWLLKQQERAGFTICEQPKELPRPTDMDEYQVVVH 171 Query: 143 SERPQYFSGDG-------KSGKIQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLS 195 R F+ + +I TV ++G L I+D + ++ G+G AK+ GCGL++ Sbjct: 172 DRRSLDFTKKDPARSSKINNVQILTVTYDGRLRIDDPDKVRAVLTTGLGKAKAYGCGLMT 231 Query: 196 LAP 198 LAP Sbjct: 232 LAP 234 >UniRef50_B8IMR1 CRISPR-associated protein, Cse3 family n=3 Tax=Alphaproteobacteria RepID=B8IMR1_METNO Length = 243 Score = 168 bits (426), Expect = 1e-40, Method: Composition-based stats. Identities = 57/222 (25%), Positives = 90/222 (40%), Gaps = 26/222 (11%) Query: 3 LSKVIIARAWSRDLYQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGCHVLLQSAQMPVS 62 L+ +++ + + H+ LW LF + PD ARDFL+ + T + L+ S + P Sbjct: 19 LAHLLLPKTEGARVAAGHRLLWSLFADSPDRARDFLWCEDAGGTWQRATFLILSRRRPQD 78 Query: 63 TAVATVIKTKQVEFQLQVGVPLYFRLRANPI-----KTILDNQKRLDSKGNIKRCRVPLI 117 T I+TK L G L FRLRA+P + KR+D R P + Sbjct: 79 TRGLFEIETKPFAPVLAPGQRLGFRLRASPAASDTPTAVGRRGKRIDPVARALRDLPPEV 138 Query: 118 K--------EAEQIAWLQRKLGNAARV------------EDVHPISERPQYFSGDG-KSG 156 + + WL R+ A + ER +G Sbjct: 139 RAERRHSVLQEVGAGWLARQGARAGFTLCDAEAPSGTRQPCLSVDGERWNVLPREGAAPV 198 Query: 157 KIQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAP 198 + ++ FEGVL + D + + +G G AK+ GCGL+ + P Sbjct: 199 RFSSLDFEGVLRVEDPSLFLAALAEGFGRAKAFGCGLMLIRP 240 >UniRef50_B4TTX1 Crispr-associated protein, Cse3 family n=15 Tax=Enterobacteriaceae RepID=B4TTX1_SALSV Length = 235 Score = 168 bits (425), Expect = 1e-40, Method: Composition-based stats. Identities = 63/241 (26%), Positives = 92/241 (38%), Gaps = 52/241 (21%) Query: 1 MYLSKV----------IIARAWSRDLYQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGC 50 MYLS++ ++A+ S Y HQ LW LFP + R FLF E Sbjct: 1 MYLSRIQLRFNNLRPEMLAKWNSARPYASHQWLWQLFPEQ--ELRQFLFREEAHGG---- 54 Query: 51 HVLLQSAQMPVSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIK 110 + SA P+S +I+TK QL G+ L F+LRANP+ T N KR D N K Sbjct: 55 -FFMLSAIPPLSQHSLFLIETKPFNPQLTNGLELDFQLRANPVIT--RNGKRSDVMMNAK 111 Query: 111 ---------RCRVPLIKEAEQIAWLQRKLGNAARVE------------------------ 137 + R +++ AWL+++ Sbjct: 112 HQAKANGVEKERWWELQQQAAQAWLEQQGQQHGFRLIAPEPDDFAMWAGDEYSELQAHCG 171 Query: 138 DVHPISERPQYFSGDGKSGKIQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLA 197 V + K +V F G L I DA + G+G +K++GCG+L + Sbjct: 172 CVQAYQQYRFVRKDQQKPITFSSVDFSGALCITDAALFKQALFSGLGKSKALGCGMLMVK 231 Query: 198 P 198 Sbjct: 232 R 232 >UniRef50_Q4JWK1 Putative uncharacterized protein n=2 Tax=Corynebacterium jeikeium RepID=Q4JWK1_CORJK Length = 224 Score = 168 bits (425), Expect = 1e-40, Method: Composition-based stats. Identities = 40/217 (18%), Positives = 76/217 (35%), Gaps = 26/217 (11%) Query: 3 LSKVIIA------RAWSRDLYQLHQGLWHLFPNRPDAARDFLFHVE---KRNTPEGCHVL 53 +++ + R + +H + FP L+ + + +++ Sbjct: 4 FTRLHLNPSSRQARKLLGNPQAMHAAVLSCFPKEVSEKERILWRHDGKVRGADEHFVYIV 63 Query: 54 LQSAQMPVSTAVATVIKTKQ-------VEFQLQVGVPLYFRLRANPIKTILDNQKRLDSK 106 + P A T ++ + L G ++ + NP+ ++ Sbjct: 64 GPDSCDPTKIAEQTGSESDPQKASYNRLLEALADGQQWHYEVVLNPVAAKKAPGSPRGTR 123 Query: 107 GNIKRCRVPLIKEAEQIAWLQRKLGNAARVE-DVHPISERPQYFSG-----DGKSGKIQT 160 G L+ EA Q+ W K + + + + FS G+ I T Sbjct: 124 G----KLTALVGEAAQLEWFNTKAKSCGFTPLETLIVERKTLRFSKLAKNPKGRQVVIGT 179 Query: 161 VCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLA 197 V + G L I+D + +GIG K+ GCGLL+LA Sbjct: 180 VRYRGTLQIDDVETFKKSLVEGIGRGKAYGCGLLTLA 216 >UniRef50_A1SV70 CRISPR-associated protein, Cse3 family n=2 Tax=Gammaproteobacteria RepID=A1SV70_PSYIN Length = 180 Score = 167 bits (424), Expect = 2e-40, Method: Composition-based stats. Identities = 65/198 (32%), Positives = 95/198 (47%), Gaps = 20/198 (10%) Query: 1 MYLSKVIIARAWSRDLYQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGCHVLLQSAQMP 60 MYLS+V++ + D+Y+ HQ +W LF N D RD LF VE + C VLLQS+ P Sbjct: 1 MYLSQVMLN---THDIYEQHQAIWSLFENVADRKRDHLFRVEV-ADRQSCKVLLQSSTEP 56 Query: 61 VSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRVPLIKEA 120 S+ A V+ +K +++ F+L A P K + + + + + Sbjct: 57 KSSEQAKVLASKSFLAEIKQDAFYKFKLLAYPTKCLSQG-----------KKVIEIKEAN 105 Query: 121 EQIAWLQRKLGNAARVEDVHPISERPQYFSGDGKSGKIQTVCFEGVLTINDAPALIDLVQ 180 EQ+ WLQRKL A ++ KS + VCFEG+L + D+ + + Sbjct: 106 EQVQWLQRKLSGAN-----VTVTAMDDLMVRSKKSYNSRFVCFEGILQVTDSEQIQRALV 160 Query: 181 QGIGPAKSMGCGLLSLAP 198 GIG K G GLLSLA Sbjct: 161 MGIGRKKHAGAGLLSLAR 178 >UniRef50_C1DSI0 CRISPR-associated protein, CT1974 n=3 Tax=Pseudomonadaceae RepID=C1DSI0_AZOVD Length = 205 Score = 167 bits (423), Expect = 2e-40, Method: Composition-based stats. Identities = 63/217 (29%), Positives = 89/217 (41%), Gaps = 32/217 (14%) Query: 1 MYLSKVII------ARAWSRDLYQLHQGLWHLF-PNRPDAARDFLFHVEKRNTPE-GCHV 52 MYL+++ + AR D Y +H+ L F + DA FL+ +E + Sbjct: 1 MYLTRLTLDPRSAQARRDLADAYDMHRTLVRAFVRDERDAPGRFLWRLEPGADAWASPTL 60 Query: 53 LLQSAQMPVST---------AVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRL 103 L+QS + K +E ++ FRL ANP T Sbjct: 61 LVQSCESGDWDVLQGLPGYLQRPAECKALDLEALIRPQWRYRFRLLANPTVTRA------ 114 Query: 104 DSKGNIKRCRVPLIKEAEQIAWLQRKLGNAARVEDVHPISERPQY--FSGDGKSGKIQTV 161 R L+ EAEQ+AWLQR+ +S G +Q V Sbjct: 115 -------GKRRGLLGEAEQLAWLQRQGERHGFAVKAVLVSASDLLDSRRKGGAPIVLQRV 167 Query: 162 CFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAP 198 CFEG+L + +A AL + GIGPAK+ GCGLLS+A Sbjct: 168 CFEGLLQVVEADALRRALASGIGPAKAFGCGLLSVAR 204 >UniRef50_A8M405 CRISPR-associated protein, Cse3 family n=3 Tax=Actinomycetales RepID=A8M405_SALAI Length = 227 Score = 167 bits (423), Expect = 2e-40, Method: Composition-based stats. Identities = 47/229 (20%), Positives = 84/229 (36%), Gaps = 38/229 (16%) Query: 1 MYLSKVIIA------RAWSRDLYQLHQGLWHLFPNRPDAAR---DFLFHVEKRNTPEGCH 51 MYL++ ++ R +H + FP D R L+ ++ R + Sbjct: 1 MYLTRFLVNPARRGARKLLASPQAMHAAVLSGFPRPEDHTRDGARTLWRLDHRQDRQ-VV 59 Query: 52 VLLQSAQMPV---------STAVATVIKTKQV---EFQLQVGVPLYFRLRANPIKTILDN 99 + + S P + A T+ L G FRL ANP + N Sbjct: 60 LYVVSPTAPDLTHMVEQAGWPSNAETWATRPYSRLLDSLDKGQRWAFRLTANPARAGRRN 119 Query: 100 QKRLDSKGNIKRCRVPLIKEAEQIAWLQRKLGNAAR--------VEDVHPISERPQYFSG 151 Q R + +Q+ WL R+ ++ + R F+ Sbjct: 120 QDT------PTTQRYGHVTPVQQVEWLTRRAERNGFGVVRQTDGELNLITYNRRVHRFTR 173 Query: 152 DG--KSGKIQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAP 198 + + T ++GVL +++ ++ +GIG A++ GCGLL++AP Sbjct: 174 GHTQRPVTLVTATYDGVLEVDEPTLFRGVLTRGIGHARAYGCGLLTVAP 222 >UniRef50_A1ARH5 CRISPR-associated protein, Cse3 family n=3 Tax=Bacteria RepID=A1ARH5_PELPD Length = 224 Score = 166 bits (420), Expect = 5e-40, Method: Composition-based stats. Identities = 59/236 (25%), Positives = 90/236 (38%), Gaps = 50/236 (21%) Query: 1 MYLSKVIIA------RAWSRDLYQLHQGLWHLF--PNRPDAARDFLFHVEKRNTPEG-CH 51 M+LS++ + R + YQLH L F P +FL+ +E G Sbjct: 1 MFLSRLRLNLRCREARRDLSNPYQLHSTLCRAFSPPETKCPKGEFLWRLEPETDSSGYPR 60 Query: 52 VLLQSAQMPVSTAV-----------ATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQ 100 +++QS +P V A +K + L+ FRLRANP T Sbjct: 61 IIVQSRNIPDWGGVGVNGWIQQADPAIDLKERLKLDLLKAEQRFRFRLRANPCVT----- 115 Query: 101 KRLDSKGNIKRCRVPLIKEAEQIAWLQRKLGNAARVE----------------DVHPISE 144 R+ L+K+ EQ WL+RK DV E Sbjct: 116 --------KNGKRLGLLKQDEQEKWLKRKGAQHGFCLPEFLSFDYYESSEDRIDVRISQE 167 Query: 145 RPQ-YFSGDGKSGKIQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAPL 199 + S ++ +V ++G+LTI + ++ GIG K MG GLLS+ P+ Sbjct: 168 QMLSDKQHSDNSIRVFSVLYDGILTITEPEMFKIALKTGIGHGKVMGLGLLSVVPI 223 >UniRef50_Q1R113 CRISPR-associated protein, CT1974 n=1 Tax=Chromohalobacter salexigens DSM 3043 RepID=Q1R113_CHRSD Length = 230 Score = 165 bits (419), Expect = 6e-40, Method: Composition-based stats. Identities = 61/230 (26%), Positives = 87/230 (37%), Gaps = 33/230 (14%) Query: 1 MYLS--KVIIA---RAWSRDL-----YQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEG- 49 MYLS +V + R D+ Y HQ LW LF + + R FLF E G Sbjct: 1 MYLSSVRVDLNALTREQLFDVLEGGAYTAHQLLWTLFADTSEGERPFLFRQEMEEAANGK 60 Query: 50 ----CHVLLQSAQMPVSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTIL----DNQK 101 + S + P + A ++ K QL G L FRLRANP Sbjct: 61 SQGLPRFYVYSTRRPEAVAGLD-VQCKPFAPQLAKGERLAFRLRANPTVAKSAGEGQRSH 119 Query: 102 RLDSKGNIKRCRVPLIK---------EAEQIAWLQRKLGNAARVEDVHP----ISERPQY 148 R D N ++ P + E WL + V P + Sbjct: 120 RADVLMNARKPFSPGERTSQACVDAMETAARDWLAERAPRFGFELPVAPEMGAYRQHELK 179 Query: 149 FSGDGKSGKIQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAP 198 S + + +V +EG+L + D LI+ + G+G AK+ GCGL+ L Sbjct: 180 KSDRREPIRFSSVDYEGLLEVTDPRRLIETLAHGVGRAKAFGCGLMLLRR 229 >UniRef50_C7MTM6 CRISPR-associated protein, Cse3 family n=1 Tax=Saccharomonospora viridis DSM 43017 RepID=C7MTM6_SACVD Length = 241 Score = 165 bits (419), Expect = 7e-40, Method: Composition-based stats. Identities = 51/235 (21%), Positives = 84/235 (35%), Gaps = 39/235 (16%) Query: 2 YLSKVIIA------RAWSRDLYQLHQGLWHLFPNRPDAARDFLFHVEKRNT--------- 46 +LS++ I R + ++ H + P+ P+A R L+ + Sbjct: 3 FLSRIRINPFRQKSRELLANPHKTHGAVLAGLPD-PEAERP-LWRWDTGRERRPYLLVLT 60 Query: 47 ---PEGCHVLLQSAQMPVSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRL 103 + H++ Q + QL G FR+ ANP++ + + Sbjct: 61 HTVADWTHLVEQCGWPAADGDHVITRDYTPLIRQLGEGREFAFRVTANPVQNVPAPAQES 120 Query: 104 DSKGNIKRCRV---PLIKEAEQIAWLQRKLGNAAR--------------VEDVHPISERP 146 + + K R A Q W + V D+ + Sbjct: 121 TPEPSAKPGRAVRKGHRTAAHQQRWFLERAERWGFQVPPALLDDPEADDVPDMRITQRQR 180 Query: 147 QYFSGD--GKSGKIQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAPL 199 F+ GK + T FEG L I D+ + +G+GPAK+ GCGLL+LAPL Sbjct: 181 LSFAKRKGGKPVILTTATFEGRLRITDSELFTRTLLRGLGPAKAYGCGLLTLAPL 235 >UniRef50_Q0RTG6 Putative uncharacterized protein n=1 Tax=Frankia alni ACN14a RepID=Q0RTG6_FRAAA Length = 278 Score = 164 bits (415), Expect = 2e-39, Method: Composition-based stats. Identities = 51/240 (21%), Positives = 84/240 (35%), Gaps = 43/240 (17%) Query: 2 YLSKVIIA------RAWSRDLYQLHQGLWHLFPNRPDAARDFLFHVEKRN---------- 45 YLS+V + ++ R+ ++H + +P R L+ +E Sbjct: 29 YLSRVWLNPLRTGAQSLLRNPERMHAAVLGGLTRQPVTER-VLWRLETGRPHRAEVLILT 87 Query: 46 --TPEGCHVLLQSAQMPVSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDN---- 99 P H++ Q+ A V + + ++Q G FRLRANP+ Sbjct: 88 ESRPSWEHLIEQAGWPNAEDPQALVRDYQPLLDRIQAGREFAFRLRANPVAATRQPTSPS 147 Query: 100 --QKRLDSKGNIKRCRVPLIKEAEQIAWLQRKLGNAARVE-------DVHPISERPQYFS 150 QK + + RV +Q+AW ++ V + F Sbjct: 148 VAQKERLAGPRPRGVRVAHRTAGQQLAWFTDRVDRWGFTPLTTETGPAVQLNARERLTFR 207 Query: 151 GDG-----------KSGKIQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAPL 199 + T F+G L + D + G+G AK+ GCGLL+LAPL Sbjct: 208 KRPPDGGNGGKNKGHQVVLSTATFDGALRVVDPDLARRALLSGVGAAKAYGCGLLTLAPL 267 >UniRef50_A5GBK0 CRISPR-associated protein, Cse3 family n=1 Tax=Geobacter uraniireducens Rf4 RepID=A5GBK0_GEOUR Length = 229 Score = 163 bits (414), Expect = 2e-39, Method: Composition-based stats. Identities = 58/226 (25%), Positives = 90/226 (39%), Gaps = 31/226 (13%) Query: 2 YLSKVIIARAWSR------DLYQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGCHVLLQ 55 +L+++ + R D+Y H+ LW +P++P+A RDFL +++ + Sbjct: 3 WLARLEVDAETVRAAGISEDVYAWHKLLWECYPDQPEAERDFLTRIDQLEGAYRFW--VL 60 Query: 56 SAQMPVSTAVA--TVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSK-----GN 108 + + PV ++ F LRANP++ + + Sbjct: 61 AKRKPVMPRWCPVDGFGLNEISPSFLSRQYYAFDLRANPVRAAVQRDANGEQVLDANGKR 120 Query: 109 IKRCRVPLIKEAEQIAWLQRKLG-------------NAARVEDVHPISERPQ---YFSGD 152 + RVPL+K E AWL RK R+ + + P +F Sbjct: 121 RRGKRVPLVKPDELRAWLVRKGEVRCRDKETGLDVPGGFRLVEERSLEISPMVESHFRKK 180 Query: 153 GKSGKIQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAP 198 G+SG V F G L + D I+ Q GIG AK G GLL LAP Sbjct: 181 GQSGYHGGVQFRGTLEVTDRAKFIESYQSGIGSAKGFGFGLLLLAP 226 >UniRef50_D0WFC7 CRISPR-associated protein, Cse3 family n=1 Tax=Slackia exigua ATCC 700122 RepID=D0WFC7_9ACTN Length = 255 Score = 163 bits (412), Expect = 4e-39, Method: Composition-based stats. Identities = 47/251 (18%), Positives = 77/251 (30%), Gaps = 54/251 (21%) Query: 2 YLSKVIIA------RAWSRDLYQLHQGLWHLFPN---RPDAARDFLFHVEKRNTPEGCHV 52 YL++ I R YQ+H + FP + L+ V+ + Sbjct: 3 YLTRFPINKTRRDARRLLASPYQMHAAIAGSFPVIHCLDSGKKRVLWRVDASEDGS-ARL 61 Query: 53 LLQSA------------QMPVSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQ 100 + S P ++++G FRL ANP+ + Sbjct: 62 YIVSPDKPSLVGLDEQIGWPDLPQQWETRSYDTFLSRIEIGQEYAFRLFANPVLSRSTRG 121 Query: 101 KRLDSKGNI-KRCRVPLIKEAEQIAWLQ---------------------RKLGNAARV-- 136 R + K R+ + +Q AWL + Sbjct: 122 GRTVPRNEKGKPKRIGHLTVLQQAAWLIGKDAYLGSGLEVPELFAHQEWNRAQRNGFEVL 181 Query: 137 --------EDVHPISERPQYFSGDGKSGKIQTVCFEGVLTINDAPALIDLVQQGIGPAKS 188 V ++ + + T F+G L ++D L + GIG AK Sbjct: 182 TNLDGTARLIVSHSGKQKLRSGRESCPITLSTAQFDGFLRVSDPDLLRSALVNGIGHAKG 241 Query: 189 MGCGLLSLAPL 199 GCGLL+LAP+ Sbjct: 242 FGCGLLTLAPM 252 >UniRef50_C2BET7 CRISPR-associated protein n=1 Tax=Anaerococcus lactolyticus ATCC 51172 RepID=C2BET7_9FIRM Length = 215 Score = 162 bits (410), Expect = 7e-39, Method: Composition-based stats. Identities = 49/221 (22%), Positives = 85/221 (38%), Gaps = 31/221 (14%) Query: 1 MYLSKVII---ARAWSRDLYQL---HQGLWHLFPNRPDAARDFLFHVEKRNTPEGCHVLL 54 MYLS+V I R +DL L H + H FP D L+ ++ N + ++++ Sbjct: 1 MYLSRVEIDINNRRKMKDLTHLGCYHGWVEHSFPQENDIRTRKLWRID--NIGDKYYLII 58 Query: 55 QSAQMPVSTAVA--------TVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSK 106 S +P + V + L+ G+ FR++ N + +D + Sbjct: 59 LSEYIPDKEKLEKYGVESTTEVKDYDEFLASLKEGIRAKFRIKLNTVIAK------IDKE 112 Query: 107 GNIKRCRVPLIKEAEQIAWLQRKLGNAAR---VEDVHPISERPQYF------SGDGKSGK 157 + KR R+ + + +L K ++ +YF Sbjct: 113 NSTKRGRIMPVPNEKLNGFLVDKAQRNGFEVKTDEFGISKIDKEYFMNFDKEDKKKSRKN 172 Query: 158 IQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAP 198 I + +EG+LTI D + GIG K+ GCG L++ P Sbjct: 173 IVSATYEGMLTITDLEKFKVALVNGIGKKKAYGCGFLTIIP 213 >UniRef50_A0LM55 CRISPR-associated protein, Cse3 family n=1 Tax=Syntrophobacter fumaroxidans MPOB RepID=A0LM55_SYNFM Length = 202 Score = 161 bits (408), Expect = 1e-38, Method: Composition-based stats. Identities = 63/216 (29%), Positives = 91/216 (42%), Gaps = 31/216 (14%) Query: 1 MYLSKVIIAR------AWSRDLYQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGCHVLL 54 MYLS + + R D+Y LH+G+ F D R LF VE N +++ Sbjct: 1 MYLSLLSLDRLHRGTMRLLSDIYLLHKGIMSGFTRCGDGLR-VLFRVEPENDDRIVRIMV 59 Query: 55 QSAQMPVST------AVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKGN 108 QS P ++TK L+ G FRLRANP Sbjct: 60 QSDGSPSWELFTERHPCVIDMRTKVFSPALRAGHSYRFRLRANPAVK------------- 106 Query: 109 IKRCRVPLIKEAEQIAWLQRK-----LGNAARVEDVHPISERPQYFSGDGKSGKIQTVCF 163 R LI++ WL+RK L + + + SG + I+T F Sbjct: 107 RNGKRYGLIRDETLEEWLRRKEPALGLQFRSVLALDEGYVTGHKEGSGHPQRINIKTARF 166 Query: 164 EGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAPL 199 EG+LT+++ + + + GIGPAK+ GCGLLSLA + Sbjct: 167 EGILTVSEPHLVQNALCCGIGPAKAFGCGLLSLARV 202 >UniRef50_Q47PI8 CRISPR-associated protein, Cse3 family n=1 Tax=Thermobifida fusca YX RepID=Q47PI8_THEFY Length = 207 Score = 161 bits (407), Expect = 1e-38, Method: Composition-based stats. Identities = 44/215 (20%), Positives = 71/215 (33%), Gaps = 43/215 (20%) Query: 19 LHQGLWHLFP---NRPDAARDFLFHVE------------KRNTPEGCHVLLQSAQMPVST 63 +H + FP L+ ++ P+ H++ Q+ T Sbjct: 1 MHAAVMSSFPTLLPSDTDGPRVLWRIDRTSRAEVFLYIVSPPKPDLTHLVEQAGWPTQPT 60 Query: 64 AVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRVPLIKEAEQI 123 +L G FRL ANP+ +I + + Q Sbjct: 61 --WESYDYTPFLSRLAKGDVWAFRLTANPVHSIRRKAGEP-------TKLTAHLTQRYQK 111 Query: 124 AWLQRKLGNAARVEDVHPISERPQ-------------------YFSGDGKSGKIQTVCFE 164 WL ++ A P +R + G+ + TV F+ Sbjct: 112 KWLLQRQDAAGFRVVEKPAEKRRLPEGDEHELIVHNRRDWNFSKGARKGRPVSLVTVTFD 171 Query: 165 GVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAPL 199 G L + D AL + GIG AK+ GCGL++LAP+ Sbjct: 172 GRLEVTDPDALRRALISGIGRAKAYGCGLMTLAPV 206 >UniRef50_Q03C59 CRISPR-associated protein n=3 Tax=Lactobacillus RepID=Q03C59_LACC3 Length = 215 Score = 161 bits (407), Expect = 2e-38, Method: Composition-based stats. Identities = 50/222 (22%), Positives = 78/222 (35%), Gaps = 34/222 (15%) Query: 1 MYLSKVIIARAW------SRDLYQLHQGLWHLFPNR--PDAARDFLFHVEKRNTPEGCHV 52 MYLS+V + L H + FP L+ ++ N + ++ Sbjct: 1 MYLSRVQVNTNDHQIFKHLTHLGAYHDWVKRSFPREIAAGTRLRHLWRLDSLNGRD--YL 58 Query: 53 LLQSAQMPV--------STAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLD 104 L+ S P A L+ G L FRL ANP + I +R Sbjct: 59 LVLSPDAPELAQLARYGVAGTAQTKDYDPFVTALRQGQRLRFRLTANPTRAIATPGQR-- 116 Query: 105 SKGNIKRCRVPLIKEAEQIAWLQRKLGNAARVEDVH--------PISERPQYFSGDGKSG 156 P + A+Q+AWL + + + P GK Sbjct: 117 ------GHVAPHVTVAQQMAWLSERAAALGFELPIDDDGPQFQIVGRDYPALRRAQGKPV 170 Query: 157 KIQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAP 198 ++ V FEG L ++D + + GIG K+ G GLL++ P Sbjct: 171 RLSRVSFEGTLVVSDLVRFKETLATGIGREKAFGMGLLTVIP 212 >UniRef50_Q314I5 CRISPR-associated protein, CT1974 n=2 Tax=Desulfovibrio RepID=Q314I5_DESDG Length = 219 Score = 160 bits (405), Expect = 2e-38, Method: Composition-based stats. Identities = 57/223 (25%), Positives = 93/223 (41%), Gaps = 28/223 (12%) Query: 1 MYLSKVIIA--RAWSRDLYQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGCHVLLQSAQ 58 M++SK+++ RA ++LY H+ LW+LF + PD RDFLF E L S + Sbjct: 1 MWMSKLVLDPRRAVGKNLYDTHRLLWNLFADAPDRTRDFLFR----EQDEPYTFLTVSRR 56 Query: 59 MPVSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDS------------- 105 P T I+ K +LQ G + F LR N + +N K+ Sbjct: 57 QPEDTTGWWSIQIKPYAPKLQAGDAVAFSLRVNAVVKRNENGKQRRFDIVQDACLRMKEL 116 Query: 106 KGNIKRCRVPLIKEAEQIAWLQRKL-------GNAARVEDVHPISERPQYFSGDGKS--G 156 N + I + WL + +AA + + + + + D +S Sbjct: 117 NQNAQMPTRAEIAQEAGTRWLLARQQALGLSIESAAILVEGCKVERFVKRATRDTRSGVV 176 Query: 157 KIQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAPL 199 + + +G + D L+ + QG+GPAK GCGLL + + Sbjct: 177 SLGIMDLQGTAEVKDPQLLLQALFQGVGPAKGFGCGLLLIRRV 219 >UniRef50_A7BA62 Putative uncharacterized protein n=1 Tax=Actinomyces odontolyticus ATCC 17982 RepID=A7BA62_9ACTO Length = 249 Score = 159 bits (403), Expect = 5e-38, Method: Composition-based stats. Identities = 43/254 (16%), Positives = 76/254 (29%), Gaps = 62/254 (24%) Query: 1 MYLSKVIIA------RAWSRDLYQLHQGLWHLFP----NRPDAARDFLFHVE-------- 42 MYL+++ + + R LH + + FP + P A R L+ ++ Sbjct: 1 MYLTRIYLNPHRRGAKQLMRSRQTLHAAVLNCFPPSVLDDPGAPR-VLWRLDRPPAVRGA 59 Query: 43 -KRNTPEGCHVLLQSAQMPVSTAVATV-----------IKTKQVEFQLQVGVPLYFRLRA 90 R C + + S P + + L G FRL Sbjct: 60 APRQGSPSCSLYISSPVAPDPSHIVEEAGYATEGGVVIRDMSSFLEGLWAGQRWGFRLCV 119 Query: 91 NPIKTILDNQKRLDSKGNIKRCRVPLIKEAEQIAWLQRKLGNAARVEDVHP--------- 141 NP ++ + + + +Q W+ + Sbjct: 120 NPTFREGSQ-----VNARGRKKVLAHVTQDQQTQWVLERAEKCGFRVLTSAELGGELPVL 174 Query: 142 -----------------ISERPQYFSGDGKSGKIQTVCFEGVLTINDAPALIDLVQQGIG 184 + F + + FEGVL + D A+ ++ GIG Sbjct: 175 EDSDGQRVDGANLLINGVERSIAEFKRGERRVTLGVATFEGVLEVTDPDAMRRVLTHGIG 234 Query: 185 PAKSMGCGLLSLAP 198 K+ GCGL++LA Sbjct: 235 RGKAYGCGLMTLAR 248 >UniRef50_Q53WG9 Putative uncharacterized protein TTHB192 n=1 Tax=Thermus thermophilus HB8 RepID=Q53WG9_THET8 Length = 211 Score = 159 bits (402), Expect = 6e-38, Method: Composition-based stats. Identities = 61/222 (27%), Positives = 96/222 (43%), Gaps = 35/222 (15%) Query: 1 MYLSKVIIA------RAWSRDLYQLHQGLWHLFPNRPDAAR-DFLFHVEKRNTPEGCHVL 53 M+L+K+++ R + Y++H+ L + R L+ +E E VL Sbjct: 1 MWLTKLVLNPASRAARRDLANPYEMHRTLSKAVSRALEEGRERLLWRLEPARGLEPPVVL 60 Query: 54 LQSAQMPVST----AVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKGNI 109 +Q+ P + A V K L+ G L FRLRANP K + Sbjct: 61 VQTLTEPDWSVLDEGYAQVFPPKPFHPALKPGQRLRFRLRANPAKRLA-----------A 109 Query: 110 KRCRVPLIKEAEQIAWLQRKLGNAAR-------------VEDVHPISERPQYFSGDGKSG 156 RV L AE++AWL+R+L ++D R + GK Sbjct: 110 TGKRVALKTPAEKVAWLERRLEEGGFRLLEGERGPWVQILQDTFLEVRRKKDGEEAGKLL 169 Query: 157 KIQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAP 198 ++Q V FEG L + D + +++G+GP K++G GLLS+AP Sbjct: 170 QVQAVLFEGRLEVVDPERALATLRRGVGPGKALGLGLLSVAP 211 >UniRef50_B6B784 CRISPR-associated protein, Cse3 family n=1 Tax=Rhodobacterales bacterium Y4I RepID=B6B784_9RHOB Length = 223 Score = 158 bits (401), Expect = 7e-38, Method: Composition-based stats. Identities = 50/227 (22%), Positives = 86/227 (37%), Gaps = 33/227 (14%) Query: 1 MYLSKVIIAR--------------AWSRDLYQLHQGLWHLFPNRPDAARDFLFHVEKRNT 46 MYLS++ +AR H+ +W F P A RDFL+ E R Sbjct: 2 MYLSRLTLARDPSVAALNALLDPDEKGAGADAHHRLIWSAFAGDPLAPRDFLWRAEGRG- 60 Query: 47 PEGCHVLLQSAQMPVSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKR---- 102 L+QS + PV + +++ L+ G + F LRAN K + ++R Sbjct: 61 ----RFLVQSPEPPVGGPFFDPPEVRELAPDLRRGDQVSFLLRANATKDLRGEKRRRVDV 116 Query: 103 -----LDSKGNIKRCRVPLIKEAEQIAWLQRKLGNAARVEDVHPISER-----PQYFSGD 152 D ++ R + + W+ + A D + + P + S Sbjct: 117 VMNLLHDVPKAERQIRRMALAQQAAGEWMAGQAARAGFCADHLEVQDYSTLTLPGHRSRR 176 Query: 153 GKSGKIQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAPL 199 + + + G +T+ D + + QG G AK GCGL+ + + Sbjct: 177 RGAPRFGILDLTGRITVTDPQVFLAKLAQGFGRAKGFGCGLMLIRRV 223 >UniRef50_Q2JH26 Putative uncharacterized protein n=1 Tax=Frankia sp. CcI3 RepID=Q2JH26_FRASC Length = 275 Score = 158 bits (399), Expect = 1e-37, Method: Composition-based stats. Identities = 51/267 (19%), Positives = 85/267 (31%), Gaps = 70/267 (26%) Query: 2 YLSKVIIA------RAWSRDLYQLHQGLWHLFPNRPDAARDFLFHVEKRN---------- 45 YLS++ + +A ++ ++H + +P R L+ +E Sbjct: 3 YLSRIWLNPLRTGAQALLKNPQRMHAAVLGGLSRQPVTER-VLWRLETGEGLRGADRPHR 61 Query: 46 ---------TPEGCHVLLQSAQMPVSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTI 96 TP H++ Q+ + V + + +L G FRLRAN + Sbjct: 62 AEVLVLTESTPSWEHLIEQAGWIHTDEPQVLVRDYQPLLDRLHTGREFRFRLRANTVSAT 121 Query: 97 LDN------QKRLDSKGNIKRCRVPLIKEAEQIAWLQRKLGNAARVEDVHPISERPQ--- 147 QK + + RV A Q WL ++ + P+ Sbjct: 122 RTPDNPSPAQKEHLAAPRPRGVRVGHRTAAHQTTWLTDRIDRWGFTLLTTADLDGPRNQP 181 Query: 148 -----------------------------------YFSGDGKSGKIQTVCFEGVLTINDA 172 G+ + T FEG L + D Sbjct: 182 DGPRNQPDGPGDTDEPAPALRLTARERLTFPKKAKNTEKTGRRVVLNTATFEGALRVTDP 241 Query: 173 PALIDLVQQGIGPAKSMGCGLLSLAPL 199 + G+GPAK+ GCGL++LAPL Sbjct: 242 ARARATLLHGVGPAKAYGCGLITLAPL 268 >UniRef50_A9GV72 Putative uncharacterized protein ygcH n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9GV72_SORC5 Length = 246 Score = 157 bits (398), Expect = 2e-37, Method: Composition-based stats. Identities = 61/245 (24%), Positives = 93/245 (37%), Gaps = 49/245 (20%) Query: 1 MYLSKVIIA------RAWSRDLYQLHQGLWHLFPN----RPDAARDFLFHVEKRNTPEGC 50 MYLS+ ++ RA D+ LH+ + FP+ P A LF V++ Sbjct: 1 MYLSRALLNPISRAVRADIADIEGLHRTIMRAFPDGAGPHPRRAHGVLFRVDEAVLRGRF 60 Query: 51 HVLLQSAQMPVSTAVATV----------------------IKTKQVEFQLQVGVPLYFRL 88 +L+QSA P T + + +++ G F L Sbjct: 61 VLLVQSATRPDFTRLPEDYFLDIQEDLGLTEPSPIENPAIREVGSERARIRAGDFFRFSL 120 Query: 89 RANPIKTILDNQKRLDSKGNIKRCRVPLIKEAEQIAWLQRKLGNAARVED---------- 138 RA+P + I + K D RV L +A ++ WL+RK Sbjct: 121 RASPTRRI--DTKSGDDGKRRNGRRVELRDDASRLDWLRRKAMAGGFELCGAEDGAGVGG 178 Query: 139 VHPISERPQYFSG-----DGKSGKIQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGL 193 V + E G + + V FEG L + DA + + G+GPAK+ G GL Sbjct: 179 VSAVEEPKLTGRGSGASEQRQQLTLAPVLFEGRLRVTDADRFREALAAGVGPAKAYGFGL 238 Query: 194 LSLAP 198 LS+AP Sbjct: 239 LSIAP 243 >UniRef50_C0VRW4 CRISPR-associated protein n=1 Tax=Corynebacterium glucuronolyticum ATCC 51867 RepID=C0VRW4_9CORY Length = 220 Score = 156 bits (396), Expect = 3e-37, Method: Composition-based stats. Identities = 49/224 (21%), Positives = 87/224 (38%), Gaps = 36/224 (16%) Query: 3 LSKVIIA------RAWSRDLYQLHQGLWHLFP-NRPDAARDFLFHVEKRNTPEGCHVL-- 53 ++V + R + +H + LFP + P L+ +++ + +++ Sbjct: 4 FTRVFVNPQKRHGRKVLTNPEAMHAEVRGLFPPDLPSDNGRVLWRLDQHDNEHILYIVGP 63 Query: 54 ------LQSAQMPVSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKG 107 + ++ ST A ++ L G F L ANP ++ KR Sbjct: 64 ERPDTAELADRLGWSTRPAQTADYDKLLSSLAKGQQWCFELLANPSISLKTGGKR----- 118 Query: 108 NIKRCRVPLIKEAEQIAWLQRKLGNAAR---------VEDVHPISERPQYFSG----DGK 154 VPL + +QI WL ++ D+ + + FS + Sbjct: 119 ---GKSVPLARIDQQIDWLLQRSEKNGFKVLPQGDSAEPDLRIANRKVMRFSKNPRDHKR 175 Query: 155 SGKIQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAP 198 + + TV FEG L + DA AL + QGIG ++ G GL++LA Sbjct: 176 TVALTTVRFEGTLEVTDAEALRATLTQGIGKGRAYGLGLMTLAR 219 >UniRef50_B1VIX9 CRISPR-associated protein n=6 Tax=Actinomycetales RepID=B1VIX9_CORU7 Length = 234 Score = 154 bits (390), Expect = 1e-36, Method: Composition-based stats. Identities = 54/241 (22%), Positives = 80/241 (33%), Gaps = 55/241 (22%) Query: 3 LSKVIIA------RAWSRDLYQLHQGLWHLFPNRPDAA-RDFLFHVEKRN---------- 45 +K+ + R D ++H + FP D + L+ V+ Sbjct: 4 FTKIQLNPHRREGRKLLSDPQRMHAAVRAAFPPELDESDARVLWRVDPGEHEHVLYVVGP 63 Query: 46 -TPEGCHVLLQSAQMPVSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLD 104 P G ++ Q+ T A + +L G F L ANP R Sbjct: 64 EKPTGAVLVEQAGW---DTLPAQTADYSRFLGKLTRGQRWRFELVANPTYAEPRKGGRGK 120 Query: 105 SKGNIKRCRVPLIKEAEQIAWLQRKLGNAAR-----VEDVHPISERPQY----------- 148 K + QI WL RK A ++D ER ++ Sbjct: 121 VK--------AHVSVRHQIGWLYRKADAAGFGLAPRLDDEVSDEERSRWSEFDAPQVTER 172 Query: 149 ----------FSGDGKSGKIQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAP 198 G G+ +I F G L + D L + QGIG A+ GCGLL+LAP Sbjct: 173 WTDVFHRNKAGGGRGRPVRIAKARFTGTLEVTDPELLRQALAQGIGRARGYGCGLLTLAP 232 Query: 199 L 199 + Sbjct: 233 I 233 >UniRef50_C6HV94 CRISPR-associated protein, Cas3 n=1 Tax=Leptospirillum ferrodiazotrophum RepID=C6HV94_9BACT Length = 221 Score = 154 bits (390), Expect = 2e-36, Method: Composition-based stats. Identities = 42/207 (20%), Positives = 76/207 (36%), Gaps = 21/207 (10%) Query: 3 LSKVIIARAWSRDLYQLHQGLWHLFPNRPDAARDFL-----FHVEKRNTPEGCHVLLQSA 57 LS+ + RD Y LH+ ++ LF +R L + +K +L+ S Sbjct: 23 LSREDVRVLKIRDAYSLHKVVYGLFEDRRSKEEKSLVSSGILYADKGGDIHFRKLLILSD 82 Query: 58 QMPVSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRVPLI 117 + P T I+T+ V F + NP K + N +P+I Sbjct: 83 RRPHQTPQFGKIETRPVFSSFLNCDHYLFEVIVNPSK-----------RDNHSGKIMPVI 131 Query: 118 KEAEQIAWLQRKLGNAARV----EDVHPISERPQYFSGD-GKSGKIQTVCFEGVLTINDA 172 W + G++ + + + ++ Q F G+ + +G + D Sbjct: 132 GRENIRQWFLDRAGDSWGLSVSPDSLEVVNAGVQKFEKQNGQFITHGSATLKGEFHVVDR 191 Query: 173 PALIDLVQQGIGPAKSMGCGLLSLAPL 199 + + GIG K+ G GLL + P+ Sbjct: 192 ERFVKSFKNGIGRGKAFGFGLLQIVPV 218 >UniRef50_C6C417 CRISPR-associated protein, Cse3 family n=4 Tax=Enterobacteriaceae RepID=C6C417_DICDC Length = 215 Score = 152 bits (385), Expect = 5e-36, Method: Composition-based stats. Identities = 52/208 (25%), Positives = 81/208 (38%), Gaps = 24/208 (11%) Query: 1 MYLSKVIIA----------RAWSRDLYQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGC 50 M+ S+V + + + +Y HQ LW LFP +R FLF + T Sbjct: 1 MFFSRVTLQPAALPSVMAEKWQTTPVYASHQWLWQLFPQE--GSRGFLFRQDDHATLSRY 58 Query: 51 HVLLQSAQMPVSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQ-------KRL 103 ++L SA P V++TK + QL G+PL F LRANP+ T + K Sbjct: 59 YLL--SACAPRQDHNLFVVETKPWQPQLNAGMPLAFSLRANPVVTRRQKRCDVLMDAKYH 116 Query: 104 DSKGNIKRCRVPLIKEAEQIAWLQRKLGNAARVE---DVHPISERPQYFSGDGKSGKIQT 160 + ++ + WL R+ V Y + Sbjct: 117 AKAQGADSAEIWPRQQQAAVDWLVRQGERGGFAVHACHVDGYQRHRLYKPQQSGPVSFSS 176 Query: 161 VCFEGVLTINDAPALIDLVQQGIGPAKS 188 V F+G+L I DA + V QG+G +++ Sbjct: 177 VDFDGLLRITDAKRFAETVSQGLGKSRA 204 >UniRef50_B0LU87 CRISPR-associated protein Cas3 n=2 Tax=Streptomyces RepID=B0LU87_9ACTO Length = 270 Score = 151 bits (383), Expect = 8e-36, Method: Composition-based stats. Identities = 54/265 (20%), Positives = 85/265 (32%), Gaps = 69/265 (26%) Query: 2 YLSKVIIA------RAWSRDLYQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGCHVLLQ 55 YLS++ I R + +H + PN R L+ ++ N P H+ + Sbjct: 3 YLSRIRINPLRKDSRKLLSNPRAVHGAVMGGLPNHKPDDR-VLWRMDPDN-PHRPHLFVL 60 Query: 56 SAQMPVSTA-------------VATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKR 102 S P T A V + QL VG FRL A+P++ K Sbjct: 61 SPTRPDWTHIIQDCGWPDADGDHAAVRDYTPLLSQLAVGREFAFRLTASPVQNTATPTKA 120 Query: 103 LDSKG-----------NIKRCRVPLIKEAEQIAWLQRKLGNAAR---------------- 135 ++ I+ R+ A Q+ W + Sbjct: 121 TPAQAARLTAHAEDGKRIRGFRMGHRTAAAQLDWFLTRTDRWGFDIPATRSDPTAPGIHA 180 Query: 136 -------------------VEDVHPISERPQYFSGDGK--SGKIQTVCFEGVLTINDAPA 174 +V + F +G ++ FEG L I D Sbjct: 181 PTPPTAPRPTSPPRPDPNPPYEVRITARHRHSFQKNGHGAHVVFRSATFEGRLRITDTDR 240 Query: 175 LIDLVQQGIGPAKSMGCGLLSLAPL 199 + G+GP+++ GCGLL+LAPL Sbjct: 241 FTTSLLTGLGPSRAYGCGLLTLAPL 265 >UniRef50_D1YEE5 CRISPR system CASCADE complex protein CasE n=1 Tax=Propionibacterium acnes J139 RepID=D1YEE5_PROAC Length = 221 Score = 151 bits (383), Expect = 9e-36, Method: Composition-based stats. Identities = 43/225 (19%), Positives = 73/225 (32%), Gaps = 34/225 (15%) Query: 1 MYLSKVIIAR------AWSRDLYQLHQGLWHLFP--NRPDAARDFLFHVEKRNTPEGCHV 52 M+L++ I +LH + FP L+ +++ + Sbjct: 1 MFLTQFDINVARRDAMRLLASPERLHAAVLGAFPPGQSVSNGARTLWRLDRGPARHDARL 60 Query: 53 LLQSAQMPV---------STAVATVIK--TKQVEFQLQVGVPLYFRLRANPIKTILDNQK 101 ++ S P + A+ L+ G FR NP + + Sbjct: 61 MIVSPLRPDLTALNEQAGWSNGASSRSANYDPFLQALRSGSTWRFRCTINPTTAVRKSAG 120 Query: 102 RLDSKGNIKRCRVPLIKEAEQIAWLQRKLGNAARV--------EDVHPISERPQYFSGDG 153 + RV + +Q+ W ++ F Sbjct: 121 S-------RGQRVAEVTAEQQLTWFIGRVERHGYTVPVNDQGAPSAQVTRREILRFRRQR 173 Query: 154 KSGKIQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAP 198 + + +GV+ I DA A + QGIGPAKS GCGL++LAP Sbjct: 174 STVTLAVTQVDGVIQIQDADAARLALVQGIGPAKSYGCGLMTLAP 218 >UniRef50_C7LYW5 CRISPR-associated protein, Cse3 family n=1 Tax=Acidimicrobium ferrooxidans DSM 10331 RepID=C7LYW5_ACIFD Length = 226 Score = 151 bits (383), Expect = 1e-35, Method: Composition-based stats. Identities = 47/227 (20%), Positives = 79/227 (34%), Gaps = 37/227 (16%) Query: 1 MYLSKVIIARAW------SRDLYQLHQGLWHL----FPNRPDAARDFLFHVEKRNTPEGC 50 M+L+++ I R+ ++H + P ++ L+ V+ + P Sbjct: 1 MFLTRLYIDPQKQAALSVLRNPQRMHAIIAQATSASVPQEANSIGRTLWRVD-GDDPRVP 59 Query: 51 HVLLQSAQMPVSTAVA------------TVIKTKQVEFQLQVGVPLYFRLRANPIKTILD 98 + + SA P A + +L+ G FRL AN +++ Sbjct: 60 ILYVVSAVQPQFAHFAASVGQVVRGTDYDTKPYGPLLDRLETGQVYAFRLAANAVRSGRS 119 Query: 99 NQKRLDSKGNIKRCRVPLIKEAEQIAWLQRKLGNAARV--------EDVHPISERPQYFS 150 + D R + +Q+ WL + DV R F Sbjct: 120 SSGSAD------TKRHGHVTITQQLGWLLARSEQHGFTIRTGSTGEPDVAVTGGRRMVFR 173 Query: 151 GDGKSGKIQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLA 197 G+ I F G L + D L + GIG A++ GCGLL+LA Sbjct: 174 RQGQRVTIALTEFMGHLEVLDRELLRRSLVTGIGHARAYGCGLLTLA 220 >UniRef50_B5GAA2 Crispr-associated protein n=1 Tax=Streptomyces sp. SPB74 RepID=B5GAA2_9ACTO Length = 217 Score = 151 bits (381), Expect = 2e-35, Method: Composition-based stats. Identities = 54/201 (26%), Positives = 80/201 (39%), Gaps = 17/201 (8%) Query: 9 ARAWSRDLYQLHQGLWHLFPN----RPDAARDFLFHVEKRNTPEGCHVLLQSAQMPVSTA 64 A R LH+ + LFP+ R LF E+ T G +L+QS P T Sbjct: 21 ATRDLRSAVNLHKRVMSLFPDDLGERARQQTGALFRFEEDAT-RGSRLLVQSVVTPDPTR 79 Query: 65 VATVI------KTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRVPLIK 118 + + + +L+ GV + +RL N +T+ D+ +PL Sbjct: 80 LPARYGAVRSTEITPLLQRLRPGVRVNYRLTGNATRTLSR-----DTTAGRPNQVIPLHG 134 Query: 119 EAEQIAWLQRKLGNAARVEDVHPIS-ERPQYFSGDGKSGKIQTVCFEGVLTINDAPALID 177 + WL+R + +H + D + + F+G T+ D AL Sbjct: 135 ADAEEWWLRRAASAGLDIHKIHTTELDDAAGNRHDKQRIRHARTRFDGTATVTDPDALRT 194 Query: 178 LVQQGIGPAKSMGCGLLSLAP 198 V GIG KS GCGLLSLAP Sbjct: 195 CVTTGIGRGKSYGCGLLSLAP 215 >UniRef50_Q1J366 CRISPR-associated protein, CT1974 n=2 Tax=Deinococci RepID=Q1J366_DEIGD Length = 211 Score = 150 bits (380), Expect = 2e-35, Method: Composition-based stats. Identities = 63/220 (28%), Positives = 89/220 (40%), Gaps = 35/220 (15%) Query: 1 MYLSKVII---ARAWSRD---LYQLHQGLWHLFPNRP------DAARDFLFHVEKRNTPE 48 +YLS++ R +RD Y LHQ L F L+ E + Sbjct: 4 LYLSRLRFEDRDRRTARDLASPYALHQTLRWAFAGAGVEGAPLPDGERALWRQE-----D 58 Query: 49 GCHVLLQSAQMPVSTAV---------ATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDN 99 +L+QS P A+ +KT + L G PL FRLRAN LD Sbjct: 59 RATLLVQSLTAPDWEALNARHPGSLRGWEVKTVDLAPALTPGRPLRFRLRANVTVRKLDE 118 Query: 100 QKRLDSKGNIKRCRVPLIKEAEQIAWLQRKLGNAAR-VEDVHPISERPQYFSGDGKSGKI 158 + R R + EQ+ WL R+ V + + + Sbjct: 119 KGR--------SRRHAVRGPHEQLEWLSRQGERCGFAVLAADIVHSGTVKTRKGSATITL 170 Query: 159 QTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAP 198 TV FEG+L + D AL++ V+ G+G AK++GCGLLSL P Sbjct: 171 HTVTFEGILRVTDPAALLEAVRGGLGHAKALGCGLLSLGP 210 >UniRef50_C4X9I8 Crispr-associated Cse3 family protein n=6 Tax=Gammaproteobacteria RepID=C4X9I8_KLEPN Length = 215 Score = 150 bits (378), Expect = 4e-35, Method: Composition-based stats. Identities = 36/206 (17%), Positives = 75/206 (36%), Gaps = 16/206 (7%) Query: 3 LSKVIIARAWSRDLYQLHQGLWHLFPNRPDAARD------FLFHVEKRNTPEGCHVLLQS 56 L + + D Y LH+ ++ LF + + + ++ G +L+ S Sbjct: 11 LDRAAVKALKISDAYSLHRVVYSLFADARTDREKCSHISSGIAYADQGGDFHGRKILIVS 70 Query: 57 AQMPV--STAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRV 114 ++P + + +K + F+++ NP++ KR+ KG + Sbjct: 71 DRLPAAKVDGLYGEVISKSIPAAFLSHSRYRFQVQVNPVRKDKQTGKRVAVKGRADIAQW 130 Query: 115 PLIKEAEQIAWLQRKLGNAARVEDVHPISERPQYFS-GDGKSGKIQTVCFEGVLTINDAP 173 + + W G + + + F G+ + +G+LT+ D Sbjct: 131 FI--QRAASRW-----GFDVDLPGLQVEAMEVLQFKDKGGRQVTLGKATVQGLLTVTDRQ 183 Query: 174 ALIDLVQQGIGPAKSMGCGLLSLAPL 199 GIG ++ GCGLL + P+ Sbjct: 184 KFQHSFHHGIGKGRAFGCGLLQIVPV 209 >UniRef50_A9HLC4 CRISPR-associated protein, Cse3 family n=1 Tax=Gluconacetobacter diazotrophicus PAl 5 RepID=A9HLC4_GLUDA Length = 228 Score = 149 bits (377), Expect = 5e-35, Method: Composition-based stats. Identities = 44/212 (20%), Positives = 77/212 (36%), Gaps = 21/212 (9%) Query: 3 LSKVIIARAWSRDLYQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGCHVLLQSAQMPVS 62 L+++++ R H LW LF + PD RDFL+ E ++ SA+ PV Sbjct: 20 LARLLVPDGEGRQHAAAHHLLWALFGDDPDRTRDFLWR-----QMEAGRFMVLSAREPVD 74 Query: 63 TAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRVPLIKEAEQ 122 + ++T+ + L+ G L F LRAN + + ++ + + E Sbjct: 75 SHGLFDVETRPFDPLLKEGDRLRFLLRANATVDRKTPGRTRSQRHDVVMDALHRRSQREG 134 Query: 123 IA------------WLQRKLGNAARVED--VHPISERPQYFSGDGKS--GKIQTVCFEGV 166 W+ R+ A + G V G Sbjct: 135 AEARDSMIADALETWMGRQGVRAGFAPASPLVIEGRDVLRIPRSGGRGIVSFGVVNLTGE 194 Query: 167 LTINDAPALIDLVQQGIGPAKSMGCGLLSLAP 198 + + A +D + QG G A++ GCGL+ + Sbjct: 195 VRVTAPDAFLDSLMQGFGRARAFGCGLMLIRR 226 >UniRef50_Q2RY20 CRISPR-associated protein, CT1974 n=1 Tax=Rhodospirillum rubrum ATCC 11170 RepID=Q2RY20_RHORT Length = 220 Score = 148 bits (375), Expect = 7e-35, Method: Composition-based stats. Identities = 51/191 (26%), Positives = 77/191 (40%), Gaps = 16/191 (8%) Query: 20 HQGLWHLFPNRPDAARDFLFHVEKRNTPEGCHVLLQSAQMPVSTAVATVIKTKQVEFQLQ 79 H+ +W LF + P A+RDF+F E L+ SA+ P ++TK + Sbjct: 33 HRLVWTLFADDPKASRDFVFRE-----AEPGRYLIVSARPPGDGQGLWRLETKPYAPAFR 87 Query: 80 VGVPLYFRLRANPIKTILD----NQKRLDSKGNIKRCRVPLIK----EAEQIAWLQRKLG 131 G F LRANP + KR+D+ + K + E + WL + Sbjct: 88 EGQRFGFTLRANPATAVKQAGETRGKRVDAIMHAKTRSATPLTVEDRERVALDWLLDRQQ 147 Query: 132 NAARV---EDVHPISERPQYFSGDGKSGKIQTVCFEGVLTINDAPALIDLVQQGIGPAKS 188 + R GK+ + +EGV T+ D L + +GIG AK+ Sbjct: 148 GFGVLFERALCSAGGYRQVRVPRGGKAITFSVIDYEGVFTVRDPGLLGQALVRGIGKAKA 207 Query: 189 MGCGLLSLAPL 199 GCGL+ L L Sbjct: 208 YGCGLMLLRRL 218 >UniRef50_C7JIG8 CRISPR-associated protein Cse3 n=8 Tax=Acetobacter pasteurianus RepID=C7JIG8_ACEP3 Length = 229 Score = 148 bits (374), Expect = 9e-35, Method: Composition-based stats. Identities = 43/214 (20%), Positives = 73/214 (34%), Gaps = 24/214 (11%) Query: 3 LSKVIIARAWSRDLYQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGCHVLLQSAQMPVS 62 L+ +++ + R H LW LF + RDFL+ E H ++ SA+ PV Sbjct: 20 LAGLLVPQGEGRQHGAAHHLLWVLFGDDSSRIRDFLWR-----QTEPGHFMILSARKPVD 74 Query: 63 TAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSK------------GNIK 110 + I++++ +L+ G L F LR N ++ + + Sbjct: 75 SHRLFEIESREFTPKLREGNRLRFLLRVNATVDRKVPGRKRSQRHDVVMDALYKLPAKER 134 Query: 111 RCRVPLIKEAEQIAWLQRKLGNAARVEDVHP------ISERPQYFSGDGKSGKIQTVCFE 164 + AWL R+ G GK+ V Sbjct: 135 AAARESLVPTAMEAWLARQGHRTGFELKEGKLAIESCDVLHIPRAQGQGKA-TFGVVDVT 193 Query: 165 GVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAP 198 G L + + QG G A++ GCGL+ + Sbjct: 194 GELCVRTPDLFTQALMQGFGRARAFGCGLMLVRR 227 >UniRef50_D1A6Q6 CRISPR-associated protein, Cse3 family n=5 Tax=Actinomycetales RepID=D1A6Q6_THECD Length = 214 Score = 146 bits (370), Expect = 3e-34, Method: Composition-based stats. Identities = 55/206 (26%), Positives = 86/206 (41%), Gaps = 20/206 (9%) Query: 5 KVIIARAWSRDLYQLHQGLWHLFPNR--PDAARD--FLFHVEKRNTPEGCHVLLQSAQMP 60 + AR D+ +LH+ + LFP+ P+A R LF +E P G +L+QS+ P Sbjct: 15 RDRAARDDLGDVVRLHRRIMSLFPDGLGPEARRRAAVLFRLE--ERPTGTSILMQSSIEP 72 Query: 61 VSTAVAT------VIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRV 114 + + L+ GV +++R+ AN + + N + + V Sbjct: 73 ALEKLPASYGKARCKSLAPLLNGLREGVNVHYRIVANATRKLGRN-----TTAGRPKQVV 127 Query: 115 PLIKEAEQIAWLQRKLGNAARVEDVHPISERPQYFSGD--GKSGKIQTVCFEGVLTINDA 172 PL AE W +R+ A V + F+G T+ D Sbjct: 128 PLHG-AEADEWWRRQADAAGLVLRSLHSRQLDTGTGRRSDNNRVTHARTQFDGTATVTDP 186 Query: 173 PALIDLVQQGIGPAKSMGCGLLSLAP 198 ALID + GIG K+ GCGLL++AP Sbjct: 187 KALIDRIHAGIGRGKAYGCGLLTIAP 212 >UniRef50_Q0BSC8 Putative uncharacterized protein n=1 Tax=Granulibacter bethesdensis CGDNIH1 RepID=Q0BSC8_GRABC Length = 227 Score = 146 bits (370), Expect = 3e-34, Method: Composition-based stats. Identities = 45/201 (22%), Positives = 78/201 (38%), Gaps = 17/201 (8%) Query: 10 RAWSRDLYQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGCHVLLQSAQMPVSTAVATVI 69 RA + H LW +F + + RDFL+ E+ + L SA+ P+ + + Sbjct: 31 RASGERVSAQHHLLWSVFADSEERKRDFLWREERDGS-----FLTLSARPPLQSDLFQPH 85 Query: 70 KTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKG-------NIKRCRVPLIKEAEQ 122 + K L G L F LRAN + ++ K + + R I + Sbjct: 86 RIKSYAPDLAPGARLEFLLRANATRMKRGGKREDVVKAPIDALEQSERAERRMEIASSAG 145 Query: 123 IAWLQRKLGNAARVEDVHPISER-----PQYFSGDGKSGKIQTVCFEGVLTINDAPALID 177 AWL+++ + + P+ + D + + + G L + D + Sbjct: 146 KAWLEQQGEKSGFRVITAIAEDYRQLSLPRLGAIDRNAMTLGILDLSGHLEMTDPALFLT 205 Query: 178 LVQQGIGPAKSMGCGLLSLAP 198 + QG G AKS GCGL+ + Sbjct: 206 NLAQGFGRAKSFGCGLMIIRR 226 >UniRef50_Q04QB6 Putative uncharacterized protein n=2 Tax=Leptospira borgpetersenii serovar Hardjo-bovis RepID=Q04QB6_LEPBJ Length = 266 Score = 145 bits (367), Expect = 6e-34, Method: Composition-based stats. Identities = 58/265 (21%), Positives = 91/265 (34%), Gaps = 67/265 (25%) Query: 1 MYLSKVIIAR---------AWSRDLYQLHQGLWHLFPNRPDAARD----FLFHVEKRNTP 47 M+LS++ + W ++ Y +HQ LW F + FLF ++ + P Sbjct: 1 MFLSQLKLDTHNTNNKIVFNWIQNPYNIHQRLWMAFSEYSSKDKPQNSPFLFQLDYNSDP 60 Query: 48 E--GCHVLLQSAQMPVSTAVATVI----------KTKQVEFQ-LQVGVPLYFRLRANPIK 94 +L+ S ++P + KQ+ +Q G L F L ANP K Sbjct: 61 GKISPRILVFSEKLPNWERAFQEFKVLTEIPVGNQIKQISPTFIQAGAVLRFSLTANPTK 120 Query: 95 TILDNQK-----------------------------------RLDSKGNIKRCRVPLIKE 119 + D + D +K RV + E Sbjct: 121 KLKDYRSLFQEELEGFPDKFDPSDRVSFLEGKSKLEDLKKTLTKDQIQKLKSKRVGIYHE 180 Query: 120 AEQIAWLQRKLGNAARVEDVHPISERPQYFSGDGKSG------KIQTVCFEGVLTINDAP 173 E + WL +K + + + + + K KI TV F G+L I D Sbjct: 181 KELLNWLSKKGSDNGFSLLDAVVEFQSDFSANKIKGSLSPSIPKIHTVSFSGILKIMDPA 240 Query: 174 ALIDLVQQGIGPAKSMGCGLLSLAP 198 +GIG K+ GCG+L LA Sbjct: 241 LFKIAYTKGIGTGKAFGCGMLLLAR 265 >UniRef50_C2GEY9 Putative uncharacterized protein n=1 Tax=Corynebacterium glucuronolyticum ATCC 51866 RepID=C2GEY9_9CORY Length = 225 Score = 145 bits (366), Expect = 8e-34, Method: Composition-based stats. Identities = 48/220 (21%), Positives = 83/220 (37%), Gaps = 23/220 (10%) Query: 2 YLSKVIIARAWSRDLYQL----------HQGLWHLFPN----RPDAARDFLFHVEKRNTP 47 +LSKV + H+ + LF + +P + LF +E T Sbjct: 6 FLSKVPLHSLLMESPGTTYHRIASPTFRHRAVMGLFEDVDSVKPREKLNVLFRLETP-TT 64 Query: 48 EGCHVLLQSAQMPVSTAV--ATVIKTKQV-EFQLQVGVPLYFRLRANPIKTILDNQKRLD 104 E ++L+QSA P A+ + ++ K++ G P+ FR+ N I+ Sbjct: 65 ETPYLLIQSAVSPSDEALMNISGLQCKEIELKAPTSGTPVAFRIAVNAIRRTTITIDPHK 124 Query: 105 SKGNIKRCRVPLIKEAE--QIAWLQRKLGNAARVEDVHPISERP---QYFSGDGKSGKIQ 159 + +K + W+ KL A V ++ +Q Sbjct: 125 RRTLVKPVELDGTDSPNPTISEWIAAKLEPALTELSVTNHLREVITDPRTKKKPRTMTVQ 184 Query: 160 TVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAPL 199 +GV + D +L ++ GIG K+ GCGLL++ PL Sbjct: 185 VDTIDGVARVADPVSLEKILSDGIGREKAYGCGLLTIRPL 224 >UniRef50_B3ENH7 CRISPR-associated protein, Cse3 family n=3 Tax=Chlorobiaceae RepID=B3ENH7_CHLPB Length = 208 Score = 143 bits (362), Expect = 2e-33, Method: Composition-based stats. Identities = 40/204 (19%), Positives = 71/204 (34%), Gaps = 24/204 (11%) Query: 7 IIARAWSRDLYQLHQGLWHLFPNRPDA-------ARDFLFHVEKRNTPEGCHVLLQSAQM 59 I D Y LH+ ++ LF +R FL+ +K G +L+ S + Sbjct: 12 EIKALRITDDYSLHRVVYSLFEDRRSEAEKNASIPSGFLY-ADKGGDSNGRLILMLSDRE 70 Query: 60 PVSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRVPLIKE 119 P +++K ++ + F + NP + + + R + + Sbjct: 71 PRKPEH-GRLESKPIDETFLMFDRYRFSVVINPSR-----------RESKSRKIIAIRDR 118 Query: 120 AEQIAWLQRKL----GNAARVEDVHPISERPQYFSGDGKSGKIQTVCFEGVLTINDAPAL 175 E W +K G + + + + F G L + D Sbjct: 119 NEIAQWFSQKAPASWGFTVNPVTLEVRTLQAKQFVKKEHCVTQNGAELTGELDVVDRTLF 178 Query: 176 IDLVQQGIGPAKSMGCGLLSLAPL 199 I +QGIG ++ G GLL +APL Sbjct: 179 IKSFKQGIGRGRAFGFGLLQIAPL 202 >UniRef50_B0S4B5 Putative uncharacterized protein n=1 Tax=Finegoldia magna ATCC 29328 RepID=B0S4B5_FINM2 Length = 211 Score = 143 bits (361), Expect = 4e-33, Method: Composition-based stats. Identities = 41/216 (18%), Positives = 83/216 (38%), Gaps = 26/216 (12%) Query: 1 MYLSKVIIARAWS------RDLYQLHQGLWHLFPNR--PDAARDFLFHVEKRNTPEGCHV 52 MYLS+V++ + +L H+ + FPN + L+ ++ N ++ Sbjct: 1 MYLSRVMLKDNQNYRNYVYTNLQYFHKWVEESFPNEFKENIRTRKLWRLDSFN--NKNYL 58 Query: 53 LLQSAQMPVST--------AVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLD 104 ++ S Q P A + Q + F+++ NP+ ++ R Sbjct: 59 VMLSEQKPDIEMFERNGIKGTAKITNYDQFLDDISENKLYRFKIKYNPVSSV---YVRNS 115 Query: 105 SKGNIKRCRVPLIKEAEQIAWLQRKLGNAAR-VEDVHPISERPQYFSGDGKSGKIQTVCF 163 +G+ CR + ++I +L + V + I D + + Sbjct: 116 KRGDNFICR----NDEDKIKYLIDRSEKNGFEVLECTLIQSGYDKLVKDNQKAPVNKAVV 171 Query: 164 EGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAPL 199 EGVL + D +++ G G K+ G GL+++ P+ Sbjct: 172 EGVLAVKDVDKFKEILINGFGKRKAYGYGLMTILPI 207 >UniRef50_A8LYZ8 CRISPR-associated protein, Cse3 family n=1 Tax=Salinispora arenicola CNS-205 RepID=A8LYZ8_SALAI Length = 206 Score = 142 bits (359), Expect = 5e-33, Method: Composition-based stats. Identities = 55/214 (25%), Positives = 87/214 (40%), Gaps = 28/214 (13%) Query: 2 YLSKVII------ARAWSRDLYQLHQGLWHLFPN----RPDAARDFLFHVEKRNTPEGCH 51 +L+++ + AR RD LH+ + L P+ +P LF ++ +T G Sbjct: 4 WLTRIALDLRHSAARRDLRDTTALHRRVMSLVPDGLGEQPRHHAGVLFRLD--HTTTGPM 61 Query: 52 VLLQSA------QMPVSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDS 105 +L+Q+ ++P A + L G+ +++R+ AN K Sbjct: 62 LLVQTTLPPDPNRLPDGYAAVDTRDVSPLLKALTNGMAMHYRIAANASKRAW-------- 113 Query: 106 KGNIKRCRVPLIKEAEQIAWLQRKLGNAARVEDVHPISERPQYFSGDGKSGKIQTVCFEG 165 KGN V L + W QRK ++ H ++ G + FEG Sbjct: 114 KGNSAGKVVALSGQQA-EQWWQRKAEATG-LDLRHLRAQPQPAARGRAIPVRHAITLFEG 171 Query: 166 VLTINDAPALIDLVQQGIGPAKSMGCGLLSLAPL 199 I DA + V GIG +S GCGLLSLAP+ Sbjct: 172 QAVITDADQVRAAVLAGIGRGRSFGCGLLSLAPM 205 >UniRef50_B5GY64 Putative uncharacterized protein n=1 Tax=Streptomyces clavuligerus ATCC 27064 RepID=B5GY64_STRCL Length = 312 Score = 136 bits (344), Expect = 3e-31, Method: Composition-based stats. Identities = 49/303 (16%), Positives = 85/303 (28%), Gaps = 106/303 (34%) Query: 2 YLSKVIIA------RAWSRDLYQLHQGLWHLF--------------------------PN 29 YLS++ I R +H + Sbjct: 3 YLSRIRINPLRAESRKLLASPRAMHGAVLGGVPGGAGAPGMPGVPGVPGVPGVSEGTEGE 62 Query: 30 RPDAARDFLFHVE------------KRNTPEGCHVLLQSAQMPVSTAVATVIKTKQVEFQ 77 + A L+ ++ P+ HV+ ++ A + + + Sbjct: 63 QAKGAPRVLWRLDADDPHRPQLYVLTPGRPDWSHVVERAGWPDADGEHAVIRDCAPLIER 122 Query: 78 LQVGVPLYFRLRANPIKTILDNQKRLDSK----------GNIKRCRVPLIKEAEQIAWLQ 127 L VG FRL ANP++T + ++ + R+ A Q+ W Sbjct: 123 LAVGQEYAFRLTANPVQTTATPVRPTSAQEKRIAERVEGERPRGFRLAHRTAAHQLNWFL 182 Query: 128 RKLGNAAR--------------------------------------------------VE 137 R+ V Sbjct: 183 RRTDGWGFAVPPSRTDPAAPGLDAASGLDAASGLDGASGGDGASGGDGGPDSVGARDPVR 242 Query: 138 DVHPISERPQYFSGDGKS--GKIQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLS 195 +V + + FS + + +EG+L + D L + GIGP+K+ GCGLL+ Sbjct: 243 EVRITARQRHTFSKGRRGTQVTFHSATYEGLLRVTDPELLAARLLGGIGPSKAYGCGLLT 302 Query: 196 LAP 198 LAP Sbjct: 303 LAP 305 >UniRef50_A8SDR6 Putative uncharacterized protein n=1 Tax=Faecalibacterium prausnitzii M21/2 RepID=A8SDR6_9FIRM Length = 195 Score = 134 bits (338), Expect = 2e-30, Method: Composition-based stats. Identities = 45/197 (22%), Positives = 70/197 (35%), Gaps = 23/197 (11%) Query: 13 SRDLYQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGCHVLLQSAQMPVSTAV------- 65 +LH + F E ++LL S P + V Sbjct: 4 LAAPQKLHGAVESAFAGERRRRLWR-----LDRLGERLYLLLLSEDAPELSGVVEQFGTG 58 Query: 66 --ATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRVPLIKEAEQI 123 A + +++ G FRL ANP K+ D Q + Q Sbjct: 59 AAAETRSYDPLLQRVEPGSCWQFRLTANPTKSCKDTQ-----NPAARGTVAAHCTTQYQK 113 Query: 124 AWLQRKL---GNAARVEDVHPISERPQYFSGDG-KSGKIQTVCFEGVLTINDAPALIDLV 179 WL + G A R E + Q+F+ G + + V +EGVL + DA L+ Sbjct: 114 QWLLERAAKRGFALREEGFTVTRVQWQHFAKHGTRPVTLLAVTYEGVLQVTDAEQFRALL 173 Query: 180 QQGIGPAKSMGCGLLSL 196 QG+G K+ G GL+++ Sbjct: 174 CQGMGRGKAYGLGLMTV 190 >UniRef50_UPI0001B51C2A CRISPR-associated protein, Cse3 family n=1 Tax=Streptomyces viridochromogenes DSM 40736 RepID=UPI0001B51C2A Length = 252 Score = 134 bits (337), Expect = 2e-30, Method: Composition-based stats. Identities = 47/213 (22%), Positives = 73/213 (34%), Gaps = 31/213 (14%) Query: 13 SRDLYQLHQGLWHLF----PNRPDAAR---DFLFHVEKRNTPEGCHVLLQSAQMPVSTAV 65 D + +H+ + F P+ DA R L + +++QS P TA+ Sbjct: 26 LMDAHHMHRIVMGGFKGWVPDGADAPRAQVGVLSTWSADLATQTLLIIVQSRVRPDWTAI 85 Query: 66 A-----TVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRVPLIKEA 120 I + V+ + +G FR +P KT D L KG R+P + A Sbjct: 86 PRAALCAPIDVRAVDETISIGDRFTFRTVVSPTKTRAD----LKQKGKPVIKRLPHVLPA 141 Query: 121 EQIAWLQRKLGNAAR---------------VEDVHPISERPQYFSGDGKSGKIQTVCFEG 165 W + +L A I P + K KI G Sbjct: 142 HVRTWFEDRLQPAGTPATALSGIPRLGADAERTTLAIRMLPPVSTDHHKGLKITRAEIRG 201 Query: 166 VLTINDAPALIDLVQQGIGPAKSMGCGLLSLAP 198 LT+ D + G+G A++ CGL+ P Sbjct: 202 TLTVTDPATFTKTITTGLGRARAYSCGLILTRP 234 >UniRef50_C1XG03 CRISPR-associated protein, Cse3 family n=1 Tax=Meiothermus ruber DSM 1279 RepID=C1XG03_MEIRU Length = 156 Score = 134 bits (337), Expect = 2e-30, Method: Composition-based stats. Identities = 52/160 (32%), Positives = 73/160 (45%), Gaps = 16/160 (10%) Query: 52 VLLQSAQMPVSTAV--------ATVIKTKQV-EFQLQVGVPLYFRLRANPIKTILDNQKR 102 +L+QSA MP + A +K + LQ L FRLRANP T D Sbjct: 1 MLVQSAGMPDWEKLVQRFPGYFAQPPASKPIPLEHLQPAQVLRFRLRANPTVTKKDPNNP 60 Query: 103 LDSKGNIKRCRVPLIKEAEQIAWLQRKLGNAAR--VEDVHPISERPQYFSGDGK-SGKIQ 159 + KR R L EQ+ WL R+ + + SER + + DG +Q Sbjct: 61 ----DSKKRKRHGLKTLEEQLEWLHRQGAKGGFSVLGAMVVQSERVRMYKHDGSGPIVLQ 116 Query: 160 TVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAPL 199 +V +EG L I D A + G+G AK++G GLLS+A + Sbjct: 117 SVLYEGHLKITDLEAFKHTLAAGLGHAKALGFGLLSIAKV 156 >UniRef50_B6IWM2 CRISPR-associated protein, CT1974 family n=1 Tax=Rhodospirillum centenum SW RepID=B6IWM2_RHOCS Length = 262 Score = 132 bits (333), Expect = 5e-30, Method: Composition-based stats. Identities = 50/211 (23%), Positives = 77/211 (36%), Gaps = 28/211 (13%) Query: 13 SRDLYQLHQGLWHLFPNRPDAARD--FLFHVEKRNTPEGCHVLLQSAQMPVST-AVATVI 69 +D H+ LW LFP+RP A R+ FLFHVE +++S P I Sbjct: 53 RQDGQFAHRMLWTLFPDRPTARREGLFLFHVE---GTRPFSAIVRSRVPPEDGLGGIWTI 109 Query: 70 KTKQVEFQLQVGVPLYFRLRANPIKTILDNQK-------------RLDSKGNIKRCRVPL 116 T+ + L G+ L F LRA + + R + + Sbjct: 110 TTRPFDPALAPGLTLRFHLRAVASRWQPRPGERRGRRQDVIVAAWRDLPEEQRTPENLEK 169 Query: 117 IKEAEQIAWLQRKLGNAARVEDV---------HPISERPQYFSGDGKSGKIQTVCFEGVL 167 E + WL R+ G +S + V +EG+L Sbjct: 170 TAEHAALDWLARQGRRGGFAPVEGAVDVLDYDRASLRAGAKLGGRDRSIRFGAVTYEGLL 229 Query: 168 TINDAPALIDLVQQGIGPAKSMGCGLLSLAP 198 T+ D A + QG+G ++ G GL+ +AP Sbjct: 230 TVTDPQAFRATLVQGLGAGRAYGNGLMQIAP 260 >UniRef50_Q47PJ5 CRISPR-associated protein, Cse3 family n=1 Tax=Thermobifida fusca YX RepID=Q47PJ5_THEFY Length = 232 Score = 131 bits (329), Expect = 2e-29, Method: Composition-based stats. Identities = 53/215 (24%), Positives = 79/215 (36%), Gaps = 24/215 (11%) Query: 5 KVIIARAWSRDLYQLHQGLWHLFPN-------RPDAARDFLFHVEKRNTPEGCHVLLQS- 56 + RA R LH+ L L + P LF +E T ++L+QS Sbjct: 12 RYRQTRADFRTAGNLHRKLIRLSSDLGEERIANPRQQSGLLFRIE--ETRNELYLLVQSH 69 Query: 57 -----AQMPVSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILD---NQKRLDSKGN 108 ++ + +L G + +R+ A+P K + N +RL K Sbjct: 70 SPLRVDRLGPGYHGVQMRNLDPFLARLDKGSRVRYRIVASPTKRLGRSENNTQRLGLKEP 129 Query: 109 IKRCR---VPLIKEAEQIAWLQRKLGNAARVEDV---HPISERPQYFSGDGKSGKIQTVC 162 K+ R L A + W R N + R + + + V Sbjct: 130 PKKPREYTWALRGAAAEEWWHSRAAANGLELLSTYAQTLDDVRDPGTADRSRKIRHPAVR 189 Query: 163 FEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLA 197 F+G I+D A+ V GIG KS GCGLLSLA Sbjct: 190 FDGEAVISDVDAVRHAVLNGIGRGKSYGCGLLSLA 224 >UniRef50_C2KP48 Putative uncharacterized protein n=1 Tax=Mobiluncus mulieris ATCC 35243 RepID=C2KP48_9ACTO Length = 212 Score = 130 bits (328), Expect = 2e-29, Method: Composition-based stats. Identities = 51/197 (25%), Positives = 80/197 (40%), Gaps = 23/197 (11%) Query: 20 HQGLWHLFPN-----RPDAARDFLFHVEKRNTPEGCHVLLQSAQMPVSTAVATVIKTKQV 74 H+ + LFP P AA LF +E ++QS P + ++ Sbjct: 22 HRAVMDLFPEFEGEQNPRAAASILFRLETLPGL-APRFVVQSDISPAVDKLPKGVEPLGY 80 Query: 75 E-FQLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRVPLIKEAEQIA-----WLQR 128 +L G P+ FRL NP+ + D + P KE + A WL + Sbjct: 81 TFPELGEGTPVSFRLAVNPVIRHSQGK---DGQPARTTTVAPFGKEPAESAASLETWLSQ 137 Query: 129 KLGNAARVEDVHPISERPQYFSGD------GKSGKIQTVCFEGVLTINDAPALIDLVQQG 182 KL + +V+ I+ + + K +I +GV + DA L +++ G Sbjct: 138 KLSPG--LAEVNIINAQREIIGDGYPNQDISKIKRIVIDLVDGVACVGDAKTLNKMLRSG 195 Query: 183 IGPAKSMGCGLLSLAPL 199 +G AKS GCGLLS+ L Sbjct: 196 VGRAKSYGCGLLSVKQL 212 >UniRef50_C7MQD7 CRISPR-associated protein, Cse3 family n=1 Tax=Saccharomonospora viridis DSM 43017 RepID=C7MQD7_SACVD Length = 197 Score = 126 bits (316), Expect = 5e-28, Method: Composition-based stats. Identities = 49/206 (23%), Positives = 80/206 (38%), Gaps = 21/206 (10%) Query: 2 YLSKVIIARAWSRDLYQLHQGLWHLF-PNRPDAARDFLFHVEKRNTPEGCHVLLQSAQMP 60 YL+K+ ++ +RD+++ H+ L P + R G +L QSA Sbjct: 4 YLTKITTPKSVTRDIHRTHKILTTAVCPPNITTPGRVATRLLHRVERGGREILAQSATPL 63 Query: 61 VSTAV------ATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRV 114 T + A + L G + +++ ANP+ R R Sbjct: 64 DPTRLEGGCVIAGTKLLDPLLDHLDNGTVVRYKITANPVHA-------------PNRVRR 110 Query: 115 PLIKEAEQIAWLQRKLGNAARVEDVHPISERPQYFS-GDGKSGKIQTVCFEGVLTINDAP 173 P+ +AW R V D + + + + +QT EGV TI D Sbjct: 111 PITDPDRILAWWHRTADRIGLVLDSTALLDTAKTSGMRRDQRVVVQTATMEGVATIRDVD 170 Query: 174 ALIDLVQQGIGPAKSMGCGLLSLAPL 199 + D + G+G A++ GCGLLS+ PL Sbjct: 171 TVRDAIVLGVGHARAYGCGLLSVVPL 196 >UniRef50_C5V9N5 CRISPR-associated protein, Cse3 family n=1 Tax=Corynebacterium matruchotii ATCC 14266 RepID=C5V9N5_9CORY Length = 220 Score = 125 bits (315), Expect = 7e-28, Method: Composition-based stats. Identities = 49/216 (22%), Positives = 85/216 (39%), Gaps = 22/216 (10%) Query: 2 YLSKVIIARAWSRDLYQL-----------HQGLWHLFPNRPDAARD----FLFHVEKRNT 46 YL+K + A +R + H+ + LFP+ D LF E Sbjct: 6 YLTKFPVHVALARKPEKTQRWRVDDPEFRHRAVMGLFPDFEDNQARSRNNILFRYEFIPG 65 Query: 47 PEGCHVLLQSAQMPVSTAVATVIKTKQVE-FQLQVGVPLYFRLRANPIKTIL---DNQKR 102 + + L+QS V+ + VI+TKQVE + G P+ FRL N + + +KR Sbjct: 66 -QAPYFLVQSDCDVVAPDLEGVIETKQVEYPSYENGTPIIFRLALNTVTRRTIETNGRKR 124 Query: 103 LDSKGNIKRCRVPLIKEAEQIAWLQRKLGNAARVEDVHPISERPQYFSGDGKSGKIQTVC 162 + + KL A + + + + ++ +Q Sbjct: 125 EVITPVALQPLDAETGLNPAEKHVAYKLSTALQGIEFLNHNRQVLQVPKVSRA--LQIDT 182 Query: 163 FEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAP 198 F+ + + ++ AL ++ GIG AK+ GCGLL+ Sbjct: 183 FDCMGVVTNSQALEHIMHAGIGRAKAYGCGLLTARR 218 >UniRef50_C7MTL5 CRISPR-associated protein, Cse3 family n=1 Tax=Saccharomonospora viridis DSM 43017 RepID=C7MTL5_SACVD Length = 207 Score = 124 bits (311), Expect = 2e-27, Method: Composition-based stats. Identities = 45/208 (21%), Positives = 80/208 (38%), Gaps = 28/208 (13%) Query: 11 AWSRDLYQLHQGLWHLFPN----RPDAARDFLFHVEKRNTPEGCHVLLQSAQMPVSTAVA 66 + D LH+ + L P+ + + LF E +T G VL Q + P +A Sbjct: 4 RGTLDGGALHRDIMRLAPDALGNQARKEANVLFRAE--HTQRGLQVLAQLSCAPRVDNLA 61 Query: 67 -----TVIKTKQVEF---QLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRVPLIK 118 + + +E + G + +R+ ANP K + ++ K+ R+ ++ Sbjct: 62 PDFAHGTPECRNIESLVSSMHSGTRVRYRIDANPTKRLGNSA-------GDKKGRLAVLH 114 Query: 119 EAEQIAWLQRKLGNAARVEDVHPISERPQYFSGDGK-------SGKIQTVCFEGVLTIND 171 A+ W R+ + S P + + FEG + D Sbjct: 115 GADAAEWWHRRAAESGLELLSATASAMPDILGSRNRDRRGRCRATSHGVTRFEGFAVVAD 174 Query: 172 APALIDLVQQGIGPAKSMGCGLLSLAPL 199 + V +GIG A++ GCGLLS+ P+ Sbjct: 175 PGKVRSAVVEGIGRARTYGCGLLSIVPV 202 >UniRef50_Q6NEQ5 Putative uncharacterized protein n=1 Tax=Corynebacterium diphtheriae RepID=Q6NEQ5_CORDI Length = 228 Score = 123 bits (308), Expect = 5e-27, Method: Composition-based stats. Identities = 45/206 (21%), Positives = 75/206 (36%), Gaps = 28/206 (13%) Query: 11 AWSRDLYQLHQGLWHLFPNR----PDAARDFLFHVEKRNTPEGCHVLLQSAQMPVSTAVA 66 D H+ + LFP+ P + D LF E+ + L+QS P Sbjct: 28 WELTDPSFRHRAVMALFPDTDSPLPRKSVDILFRFEQLAG-QPPFFLIQSTVAPKQVDNL 86 Query: 67 T------VIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRVPLIKEA 120 + ++ + + FR+ N I + + I VP + Sbjct: 87 DSEVQHRTVSLRRFSPK----SAVRFRISIN---GIRRQTTEHNGRKRITTSPVPFDSDE 139 Query: 121 EQ-------IAWLQRKLGNAARVEDVHPISERP---QYFSGDGKSGKIQTVCFEGVLTIN 170 + W+Q+KL A R ++ ++ G S IQ +G + Sbjct: 140 KAPSHITRMTPWVQKKLNGALRNVEILNHQREVIGTKHRGGKAASMTIQIDTVDGFGIVE 199 Query: 171 DAPALIDLVQQGIGPAKSMGCGLLSL 196 D L +L+ G+G AK+ GCGLLS+ Sbjct: 200 DPELLNELILHGVGRAKAYGCGLLSV 225 >UniRef50_A8ZZ18 CRISPR-associated protein, Cse3 family n=1 Tax=Desulfococcus oleovorans Hxd3 RepID=A8ZZ18_DESOH Length = 273 Score = 118 bits (295), Expect = 2e-25, Method: Composition-based stats. Identities = 52/278 (18%), Positives = 93/278 (33%), Gaps = 91/278 (32%) Query: 8 IARAWSRDLYQLHQGLWHLFPNRPDAARDFLFHVE---------KRNTPEGCHVLLQSAQ 58 +A+ + +Y +H+ LW LFP + R+FL+ E E + L+ S+ Sbjct: 1 MAKVLADSVYNIHRLLWDLFPGQ--KQRNFLYREEIAREQLGYQGGARGESLYYLVSSSA 58 Query: 59 MPVSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRVPL-- 116 S + ++T++ E QLQ L F LRANP+ T N K+ D + ++ + L Sbjct: 59 P-SSQSPFFAVETRRYEPQLQPDEALRFELRANPVVTK--NGKKHDVVMDAQQTFLKLLC 115 Query: 117 ---------------------------------------------------IKEAEQIAW 125 + E++ W Sbjct: 116 EELGLLSHLQGTPEKKEYKNVLLTHGGQRLDSRLTDLLDGDYRYAERLDQKLTPREKLEW 175 Query: 126 -------------LQRKLGNAARV--------EDVHPISERPQYFSG---DGKSGKIQTV 161 + ++ + + + +G GK V Sbjct: 176 ALRAEIDNTLDEWMAKQGKQNGFTIVKDTHGNLKLQNSAYQWHALTGKAAKGKKSGFSAV 235 Query: 162 CFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAPL 199 F G L ++D A + GIG +K+ GCGL+ + + Sbjct: 236 DFTGDLVVSDVEAFKKSLFNGIGRSKAFGCGLMLVKRI 273 >UniRef50_C4ZJY2 CRISPR-associated protein, Cse3 family n=1 Tax=Thauera sp. MZ1T RepID=C4ZJY2_THASP Length = 238 Score = 115 bits (288), Expect = 1e-24, Method: Composition-based stats. Identities = 39/202 (19%), Positives = 59/202 (29%), Gaps = 20/202 (9%) Query: 17 YQLHQGLWHLFPNRPDAARDFL-----FHVEKRNTPEGCHVLLQSAQMPVSTAV-ATVIK 70 Y LH L F + + H Q A V + Sbjct: 33 YALHTLLAAAFGDLAPKPFRHFGDVRGLLAYSGQGADRIHTAAQMAAPDVHAVLGLERFA 92 Query: 71 TKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRVP---LIKEAEQIAWLQ 127 + G L F LR P+ D ++R ++ V L +EA + WLQ Sbjct: 93 ARSFPTDWAAGRRLGFELRVRPVLRTKDGRERDVFLSQAEKRGVAEKELSREAVYLEWLQ 152 Query: 128 RKLGNA-----------ARVEDVHPISERPQYFSGDGKSGKIQTVCFEGVLTINDAPALI 176 R+L + F G LT+ D Sbjct: 153 RELARGDAANVDRAQLDGFRLTSSLRKGSAVVGRRPAQRVTGPDALFSGELTVRDPAGFA 212 Query: 177 DLVQQGIGPAKSMGCGLLSLAP 198 L+ +G+G ++ G G+L L P Sbjct: 213 ALIARGVGRHRAFGFGMLLLRP 234 >UniRef50_C2CRP4 Putative uncharacterized protein n=1 Tax=Corynebacterium striatum ATCC 6940 RepID=C2CRP4_CORST Length = 185 Score = 114 bits (286), Expect = 2e-24, Method: Composition-based stats. Identities = 39/200 (19%), Positives = 70/200 (35%), Gaps = 22/200 (11%) Query: 1 MYLSKVIIAR--AWSRDLYQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGCHVLLQSAQ 58 M + + + R + D +H+ L H + L+ +P+ H++++ Sbjct: 1 MLTTTLSLNRTTRIAFDSQAVHRTLLHA-----TDGKPVLW-----ASPDTKHLVVRHET 50 Query: 59 MPVSTAVATVIKTKQVEFQ--LQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRVPL 116 + + + L NP IL + + +G Sbjct: 51 PVDWIKAIRGVTQAVTLPTQIPAASARINYALIGNP---ILSQYQGPNKRGKKTPAP--- 104 Query: 117 IKEAEQIAWLQRKLGNAARVEDVHPISERPQYFSGDGKSGKIQTVCFEGVLTINDAPALI 176 + WLQR++GNA + + P + F G T+ D AL Sbjct: 105 --PEKWNEWLQRRVGNALNLHSIDGTRLPPAKGKKPDMQTIHHRILFTGRATVKDQDALQ 162 Query: 177 DLVQQGIGPAKSMGCGLLSL 196 L++ GIG K+ GCGLL + Sbjct: 163 TLMESGIGSGKAYGCGLLIV 182 >UniRef50_Q0BRF7 Putative uncharacterized protein n=1 Tax=Granulibacter bethesdensis CGDNIH1 RepID=Q0BRF7_GRABC Length = 247 Score = 111 bits (277), Expect = 2e-23, Method: Composition-based stats. Identities = 33/209 (15%), Positives = 68/209 (32%), Gaps = 33/209 (15%) Query: 19 LHQGLWHLFPNRPDAARDFL------FHVEKRNTPEGCHVLLQSA--QMPVSTAVATVIK 70 LH L LF + + + + ++ Q+ P T V ++ + Sbjct: 35 LHHLLTQLFGRQMLQPFRVFTPEQANWSLYAYANQDATTLVEQARFSITPDMTEVISLER 94 Query: 71 TKQV-EFQLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRVPLIKEA--------- 120 + + G + F +R P++ + K+ D + + R + EA Sbjct: 95 LRSKAMPDAKPGQRIGFDVRIRPVRR---SAKQHDQESEKMQERDAFLAEALHNHADDKT 151 Query: 121 -----------EQIAWLQRKLGNAARVEDVHPISERPQYFSGDGKSGKIQTVCFEGVLTI 169 WL + A +E + + +G + G + I Sbjct: 152 GMKSANRTREMVYREWLAER-MPWATLETARLAHFQRRRVLRNGNGIEGPDATIHGTMII 210 Query: 170 NDAPALIDLVQQGIGPAKSMGCGLLSLAP 198 D + +++GIG + G G++ L P Sbjct: 211 GDPAQFSEALRKGIGRHSAYGYGMMMLRP 239 >UniRef50_Q60AD3 CRISPR-associated protein, CT1974 family n=1 Tax=Methylococcus capsulatus RepID=Q60AD3_METCA Length = 239 Score = 109 bits (274), Expect = 4e-23, Method: Composition-based stats. Identities = 42/202 (20%), Positives = 68/202 (33%), Gaps = 20/202 (9%) Query: 17 YQLHQGLWHLFPNRPDAARDFLF--------HVEKRNTPEGCHVLL-----QSAQMPVST 63 Y H L L L + + +G + S Sbjct: 36 YATHAWLKALCGELAPKPFRLLQDGRNLRPPRLLGFSAHDGTRLTEHARAFASPLAAQVC 95 Query: 64 AVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQK---RLDSKGNIKRCRVPLIKEA 120 ++A I K + G L F + A PI + N+ R + R + P +E Sbjct: 96 SLADGIAFKPMPESWPNGRKLGFEVMACPISRLGRNEDDVYRRHLRDCDARAQSPDSREM 155 Query: 121 EQIAWLQRKLGNAARVEDVHPISERPQYFSGDGKSGKIQT----VCFEGVLTINDAPALI 176 WL R+ G+AA ++D R + + F G L++ D Sbjct: 156 VYRRWLTRQFGSAATLDDFSLDGFRYLRLLRKARGTRSGFLAPQALFRGTLSVRDGAGFG 215 Query: 177 DLVQQGIGPAKSMGCGLLSLAP 198 L+ +GIG ++ G G+L L P Sbjct: 216 ALLARGIGRHRAFGFGMLLLRP 237 >UniRef50_C7RP63 CRISPR-associated protein, Cse3 family n=1 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RP63_9PROT Length = 245 Score = 108 bits (271), Expect = 1e-22, Method: Composition-based stats. Identities = 39/213 (18%), Positives = 65/213 (30%), Gaps = 35/213 (16%) Query: 17 YQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGCHV--------LLQSAQMPVSTAVATV 68 Y LH L F + FH + L S P + Sbjct: 33 YALHALLSEAFGDLAPKP----FHYLGGRQGLLAYTAADLEMLRLNASLAPPDVARALGL 88 Query: 69 --IKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKR----LDSKGNIKRCRVPLIKEAEQ 122 + + + G L F R P+ D ++R +G + +V + + Sbjct: 89 DHLDARPFPTAWRTGQRLGFEARVRPVVRGKDGRERDAYLHAVEGTVDTGQVGVDGSIAE 148 Query: 123 I-----AWLQRKLGNAARVEDVHPISERPQYFSGDGKSGKI------------QTVCFEG 165 WL + + + + K+G V F+G Sbjct: 149 RTAIYSDWLAAQFAFDGAAQIAEAHLDSFRLTRVLRKAGSGENGKRKTTNNAGPDVVFKG 208 Query: 166 VLTINDAPALIDLVQQGIGPAKSMGCGLLSLAP 198 L + D PA L+ +GIG ++ G G+L L P Sbjct: 209 HLQVRDPPAFNRLLGRGIGRHRAFGFGMLLLRP 241 >UniRef50_Q3A5Z3 CRISPR-associated protein, Cse3 family n=2 Tax=Desulfuromonadales RepID=Q3A5Z3_PELCD Length = 299 Score = 107 bits (267), Expect = 3e-22, Method: Composition-based stats. Identities = 44/182 (24%), Positives = 67/182 (36%), Gaps = 24/182 (13%) Query: 1 MYLSKVIIARAWSR----------DLYQLHQGLWHLFPNRPDAARDFLFHVE-------- 42 MY S+V + R + Y LHQ LW LFP + R FLF E Sbjct: 1 MYFSRVQLQPEVQRSSQLSQVLTSNSYGLHQLLWDLFP--AEEKRSFLFREEIAKEQLKN 58 Query: 43 KRNTPEGCHVLLQSAQMPVSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQK- 101 +R T + S P + +++K + G F+LRANPI K Sbjct: 59 QRRTKGESLFYIVSRHDPQTETPIFRVESKVYAPVISQGQQFAFKLRANPIVAKKKPGKK 118 Query: 102 ---RLDSKGNIKRCRVPLIKEAEQIAWLQRKLGNAARVEDVHPISERPQYFSGDGKSGKI 158 R D N +R + + I +QR+ + R + + + F + I Sbjct: 119 NSVRHDVVMNAQRRLLEELASCLGILDVQRQKKSVLRHRILTAWKDGEKRFCSERLREDI 178 Query: 159 QT 160 +T Sbjct: 179 RT 180 >UniRef50_C0W6T9 Possible CRISPR-associated protein n=1 Tax=Actinomyces urogenitalis DSM 15434 RepID=C0W6T9_9ACTO Length = 129 Score = 105 bits (262), Expect = 9e-22, Method: Composition-based stats. Identities = 29/134 (21%), Positives = 45/134 (33%), Gaps = 25/134 (18%) Query: 84 LYFRLRANPIKTILDNQKRLDSKGNIKRCRVPLIKEAEQIAWLQRKLGNAARV------- 136 FRL ANP + I ++ R + +Q WL + Sbjct: 2 WAFRLAANPSRAISQGI-------GVRGKRQGHVTLEQQRQWLLSRAAAHGFRMLPVNGA 54 Query: 137 -----EDVHPISERPQYFSG------DGKSGKIQTVCFEGVLTINDAPALIDLVQQGIGP 185 + + F I FEG+L + D L + GIG Sbjct: 55 AESVGSSLTVVRRARPVFGRSNPEQGRRDRVTINRTVFEGLLQVTDPDLLRTALISGIGR 114 Query: 186 AKSMGCGLLSLAPL 199 +K+ GCGL++LA + Sbjct: 115 SKAYGCGLMTLAKV 128 >UniRef50_D0MET7 CRISPR-associated protein, Cse3 family n=1 Tax=Rhodothermus marinus DSM 4252 RepID=D0MET7_RHOM4 Length = 265 Score = 101 bits (251), Expect = 2e-20, Method: Composition-based stats. Identities = 47/226 (20%), Positives = 79/226 (34%), Gaps = 36/226 (15%) Query: 9 ARAWSRDLYQLHQGLWHLF--------PNRPDAARDFLFHVEKRNTPEGCHVLLQSAQ-- 58 AR Y +H L LF + R V + + Sbjct: 39 ARRPVSLSYLVHCALGELFQAQAPRPFAVEGENRRGPWVRVLGYADVPWETLQELARGFA 98 Query: 59 MPVSTAV--ATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRVP- 115 P A+ +K + ++ G+ L F +R P+ + + K + Sbjct: 99 SPAVYAICGWDRGASKPMPTEIPRGMRLAFSVRVCPVVRKASAGQSPRGRRWQKGQELDV 158 Query: 116 ------------LIKEAEQIAWLQRKLGN----AARVEDVH----PISERPQYFSGDGKS 155 L +EA WL+R++ ARVE V I + +G +S Sbjct: 159 FLDAAWSQPEAVLDREAVYAEWLRRQMARPEKGGARVETVRMTRFSIERMTRRTNGSSRS 218 Query: 156 GKI---QTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAP 198 + V EGVLT+ D+ A + ++++G+G S G G+L L Sbjct: 219 VTVIQRPDVTLEGVLTVTDSAAFMRMLRRGVGRHTSFGYGMLKLRR 264 >UniRef50_B4UE72 CRISPR-associated CT1974 family protein n=2 Tax=Anaeromyxobacter RepID=B4UE72_ANASK Length = 243 Score = 99.6 bits (247), Expect = 6e-20, Method: Composition-based stats. Identities = 41/213 (19%), Positives = 67/213 (31%), Gaps = 36/213 (16%) Query: 17 YQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGCHVLLQSAQM-------------PVST 63 Y LH L +F P++ F + VL S + P Sbjct: 34 YLLHCQLREMFG--PESPTPFAVR---DGSGRSVAVLAYSTRAAAELQRHAQQFAQPDVY 88 Query: 64 AVAT--VIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIK-------RCRV 114 K + Q + G + F +R P+ + + R + + R Sbjct: 89 GTCDWNAFDEKPMPGQWRTGERVGFEVRCCPVVRMSGDGPRWRAGAEVDAFLARCWRTEG 148 Query: 115 PLIKEAEQIAWLQRKLGNAARVEDV---HPISERPQYFSGDGKSGKIQT------VCFEG 165 + +EA WL +LG V +R D + + F G Sbjct: 149 TVEREAVYREWLADELGRRGGARIVSARVLGHQRAHLVRRDHRPERKAIGGERPEAVFSG 208 Query: 166 VLTINDAPALIDLVQQGIGPAKSMGCGLLSLAP 198 L + D A L+ +G+G + G G+L L P Sbjct: 209 ELDVTDPEAFAALLARGVGRHRGFGFGMLLLRP 241 >UniRef50_A8LMM7 CRISPR-associated protein n=2 Tax=Alphaproteobacteria RepID=A8LMM7_DINSH Length = 263 Score = 95.7 bits (237), Expect = 7e-19, Method: Composition-based stats. Identities = 48/256 (18%), Positives = 81/256 (31%), Gaps = 61/256 (23%) Query: 1 MYLSKVIIA----------------RAWSRDLY-QLHQGLWHLFPN----------RPDA 33 +YL+++ + R + D LH L F P A Sbjct: 3 LYLARLPVDLPALARAAGERGWTRGRRAAFDEGRALHHLLAETFGPGALQPFRLVVAPRA 62 Query: 34 ARDFLFHVEKRNTPEGCHVLLQSAQMPVSTA-----VATVIKTKQVEFQLQVGVPLYFRL 88 L+ + + +A +P+S A I+TK + G L F + Sbjct: 63 KSGTLW---AYTDVDATALREIAAPVPLSEAMVTALSPDRIETKPMPELAVPGRRLGFDI 119 Query: 89 RANPIKT----ILDNQKRLDSKGN----------------IKRCRVPLIKEAEQIA---- 124 R P+ I R + + + R + Sbjct: 120 RLRPVVRLASAIPAPADRAAGRDHGFKAGAEVDAFLAEALRQPDREAMHTAERSRETVYA 179 Query: 125 -WLQRKLGNAARVEDVHPISERPQYFSGDGKSGKIQT-VCFEGVLTINDAPALIDLVQQG 182 WL + G AA +E V + R + + G G LT+ DA A + + +G Sbjct: 180 AWLADRFGPAAELEQVTLAAFRRSFAARKDGRGCEGPDATLHGTLTVGDAKAFAERLHRG 239 Query: 183 IGPAKSMGCGLLSLAP 198 +G K+ G G+L + P Sbjct: 240 VGRHKAYGYGMLLIRP 255 >UniRef50_D1Y485 Crispr-associated family protein n=1 Tax=Pyramidobacter piscolens W5455 RepID=D1Y485_9BACT Length = 254 Score = 92.6 bits (229), Expect = 8e-18, Method: Composition-based stats. Identities = 44/234 (18%), Positives = 68/234 (29%), Gaps = 48/234 (20%) Query: 9 ARAWSRDL-YQLHQGLWHLFPNRPDAARDFLFHVEKRN----TPEGCHVLLQSAQ----- 58 R D Y +H +F + FL E+ TP L +S + Sbjct: 21 QRRLGDDPGYLVHAATRKIFGEL--GPQPFLVQSERSRVLGYTPADETFLRRSLENFRSD 78 Query: 59 ------MPVSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTIL--------------- 97 +P + + ++ + Q G F + P Sbjct: 79 EGSDSLLPAVFNLPEIC-SRVMPEQWSKGSRYRFSVYCRPTIRRGKVESDVWLMKNYFAC 137 Query: 98 -DNQKRLDSKGNIKRCRVPLIKEAE--QIAWLQRKLGNAARVEDVHPISERPQYFSGDGK 154 + + G I R E E WLQR+ AA + DV R Y + Sbjct: 138 EEARGNGTFDGTIHEFRQLHKGEIEGTYRQWLQRRFVPAAELRDVVITGSRSSYLTTRSA 197 Query: 155 SGKIQT-----------VCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLA 197 F G L + + A LV+ G+G + G G+L L Sbjct: 198 KDHCGAPTHSERRSYPETTFVGELCVTEPQAFERLVRHGVGRHCAFGFGMLLLK 251 >UniRef50_C1YTK2 Putative uncharacterized protein n=1 Tax=Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 RepID=C1YTK2_NOCDA Length = 221 Score = 84.9 bits (209), Expect = 1e-15, Method: Composition-based stats. Identities = 32/182 (17%), Positives = 58/182 (31%), Gaps = 22/182 (12%) Query: 1 MYLSKVIIA--RAWSRDLYQ-LHQGLWHLFPNRPDAARDFLFHVEKRNTPEGCHVLLQSA 57 +YLS++ + R+ D + + Q + P++ L+ +++ S Sbjct: 34 LYLSRINLDPKRSARMDQWAVMGQAVRRAVDPDPESDARVLW-----ARTSPSTLVVSSD 88 Query: 58 QMPVSTAVATVIKTK-QVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRVPL 116 P V + G + + L P ++ KR +P Sbjct: 89 TAPAWGKVPGATSAAIHPMPRYSEGETVRWELITAPTAPRGAGAAGEGARPRGKRAPLP- 147 Query: 117 IKEAEQIAWLQRKLGNAARVEDVHPISERPQYFSGDGKSGKIQTVCFEGVLTINDAPALI 176 E E WL K A + S R ++ G F G + D+ AL Sbjct: 148 --EEEFEGWLDVKFSGA-----LDVTSVRWKHLGGRPARYH-----FTGEAVVRDSEALQ 195 Query: 177 DL 178 +L Sbjct: 196 EL 197 >UniRef50_B2N0R4 Putative uncharacterized protein n=1 Tax=Escherichia coli 53638 RepID=B2N0R4_ECOLX Length = 58 Score = 80.7 bits (198), Expect = 3e-14, Method: Composition-based stats. Identities = 48/51 (94%), Positives = 51/51 (100%) Query: 149 FSGDGKSGKIQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAPL 199 FSG+GK+GKIQTVCFEGVLTINDAPALIDL+QQGIGPAKSMGCGLLSLAPL Sbjct: 8 FSGEGKNGKIQTVCFEGVLTINDAPALIDLLQQGIGPAKSMGCGLLSLAPL 58 >UniRef50_C9M9R8 CRISPR-associated protein, CT1974 family n=1 Tax=Jonquetella anthropi E3_33 E1 RepID=C9M9R8_9BACT Length = 262 Score = 80.3 bits (197), Expect = 4e-14, Method: Composition-based stats. Identities = 35/233 (15%), Positives = 66/233 (28%), Gaps = 58/233 (24%) Query: 17 YQLH---QGLWHLFPNRPDAARDFLFHVEKR-----NTPEGCHVLLQ----SAQMPVSTA 64 Y +H + LW A + F++ EK +G + S + Sbjct: 31 YTIHAATRALWG-----EIAPQPFVWQEEKGQILGYAASDGETLREVRNSCSREDAELLR 85 Query: 65 VATVIK---TKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLD---------------SK 106 A + TK++ G F++ PI+ K Sbjct: 86 KAFCLPEFCTKKMPEVFPAGQKFNFQVLCCPIRRQTSPTSGKKCQSDAWLGAVYNLYRGK 145 Query: 107 GNIKRCRVPLI-------------KEAEQIAWLQRKLGNAARVEDVHPISERPQYFSGDG 153 G + P + E WL + + + + + Sbjct: 146 GGAEGTGCPTVISFYRQHPDECPSPETVYKEWLTEQFARSGGARVLFSNIKGSRTIRVAR 205 Query: 154 KSGKIQTVC----------FEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSL 196 + G V F G L + + A ++ +G+G ++ G G+L L Sbjct: 206 RPGSGAPVLQAKRSTPEVLFRGCLQVENQDAFSQILARGVGRHRAFGFGMLLL 258 >UniRef50_C1XXW2 CRISPR associated protein n=1 Tax=Meiothermus silvanus DSM 9946 RepID=C1XXW2_9DEIN Length = 116 Score = 75.7 bits (185), Expect = 8e-13, Method: Composition-based stats. Identities = 27/100 (27%), Positives = 41/100 (41%), Gaps = 18/100 (18%) Query: 1 MYLSKVII------ARAWSRDLYQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGCHVLL 54 MYLS++++ AR + YQ+H L H F FL+ E+ TP VL+ Sbjct: 1 MYLSRLLLDPRHKQARTDLANPYQMHATLCHAFAEPEQTPPRFLWRAEEGKTP---TVLV 57 Query: 55 QSAQMPVSTAVA--------TVIKTKQV-EFQLQVGVPLY 85 QS + P + ++K + LQ G L Sbjct: 58 QSIETPNWEKLTQRFPGYFSQRPESKPIPLEHLQSGQVLR 97 >UniRef50_B4V4N6 Putative uncharacterized protein n=1 Tax=Streptomyces sp. Mg1 RepID=B4V4N6_9ACTO Length = 114 Score = 71.1 bits (173), Expect = 2e-11, Method: Composition-based stats. Identities = 20/112 (17%), Positives = 32/112 (28%), Gaps = 16/112 (14%) Query: 102 RLDSKGNIKRCRVPLIKEAEQIAWLQR----------------KLGNAARVEDVHPISER 145 R + VP W R ++G A + Sbjct: 3 RAQVLESPSGHPVPHSTPDHVKNWFVRCLQAEDEPATGEGGVARVGATADPAALGVRMLP 62 Query: 146 PQYFSGDGKSGKIQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLA 197 K +I G LT+ D L+ + G+G A++ CGL+ Sbjct: 63 TVSSPAPHKGLRIARAEIRGSLTVTDPETLVTALSNGLGHARAYSCGLILTR 114 >UniRef50_C6NY67 Putative uncharacterized protein n=1 Tax=Acidithiobacillus caldus ATCC 51756 RepID=C6NY67_9GAMM Length = 129 Score = 70.7 bits (172), Expect = 3e-11, Method: Composition-based stats. Identities = 19/86 (22%), Positives = 29/86 (33%), Gaps = 2/86 (2%) Query: 109 IKRCRVPLIKEAEQIAWLQRKLGNAARVEDVHPISERPQYFSGDGKSGKIQTVCFEGVLT 168 + R ++K E WL L V E + G+ + F Sbjct: 42 RREHRERVVKPREFRGWLASLLERHGWVLRSIEKVESMEMTIRHGRRLTVVDTVF--TAQ 99 Query: 169 INDAPALIDLVQQGIGPAKSMGCGLL 194 + D + GIG K+ GCG+L Sbjct: 100 VVDRENADQSYRSGIGRYKAFGCGML 125 >UniRef50_D1BYL2 Putative uncharacterized protein n=1 Tax=Xylanimonas cellulosilytica DSM 15894 RepID=D1BYL2_XYLCX Length = 225 Score = 50.7 bits (120), Expect = 3e-05, Method: Composition-based stats. Identities = 39/216 (18%), Positives = 69/216 (31%), Gaps = 30/216 (13%) Query: 7 IIARAWSRDLYQLHQGLWHL-------FPNRPDAARDFLFHVEK-RNTPEGCHVLLQSAQ 58 +A A D H+ L A L+ V + +L++S+ Sbjct: 14 TMATALLSDRTTGHRMTMQLWDQIESTVHRGARAHVGCLWRVTGIDPVAQTGTLLVRSST 73 Query: 59 MPVSTAVATVIKTKQVEFQLQVGVPLYFRLRAN---------PIKTILDNQKRLDSKGNI 109 P + + V + G + + P++ + + D Sbjct: 74 APTRKVPWAIQQDAAVTELPETGATVDLTVTIAAMYTPMYDVPVEWRENLKAGADGTARP 133 Query: 110 -------KRCRVPLIKEAEQIAWLQRKLGNAARVEDVHPISERPQYFSGDGKSGKIQTVC 162 + +VP+ + Q W KL DV + G + +V Sbjct: 134 PGEGLSYRSKQVPVPSDRLQ-EWSVTKLKRLGVDGDVVAHAAPVVRIKGALVATAHLSVT 192 Query: 163 FEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAP 198 G T+ND L V+ GIG +S G GL+++ P Sbjct: 193 --G-ATVND--GLEQCVRTGIGKGRSYGLGLVAVTP 223 >UniRef50_Q21QB0 Putative uncharacterized protein n=1 Tax=Rhodoferax ferrireducens T118 RepID=Q21QB0_RHOFD Length = 180 Score = 48.7 bits (115), Expect = 1e-04, Method: Composition-based stats. Identities = 32/188 (17%), Positives = 60/188 (31%), Gaps = 37/188 (19%) Query: 14 RDLYQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGCHVLLQSAQMPVSTAVATVIKTKQ 73 DLY++HQ +W ++ F E +G + ++ + + Sbjct: 21 TDLYRIHQLVWQHVARAVESQGRFA-RPEFIYRIDGGMIRVR-------GNLPKNKTS-- 70 Query: 74 VEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRVPLIKEAEQIAWLQRKLGNA 133 + P++ L A G+ VP EA W K+ +A Sbjct: 71 -VSAFRANAPVHLDLAA--------------VWGSEHENAVP---EAHLADWCAEKIESA 112 Query: 134 AR-VEDVHPISERPQYFSGD----GKSGKIQTVCFEGVLTINDAPALIDLV--QQGIGPA 186 V + + + + ++ +I T+ L + +QGIG Sbjct: 113 GFKVASLAVTNFQYRCGVKHATDNRQNIRIPVASV--TTTVTAGDTLACALTWRQGIGRG 170 Query: 187 KSMGCGLL 194 K G G+L Sbjct: 171 KRFGLGML 178 >UniRef50_C5B4T7 Putative uncharacterized protein n=1 Tax=Methylobacterium extorquens AM1 RepID=C5B4T7_METEA Length = 225 Score = 45.3 bits (106), Expect = 0.001, Method: Composition-based stats. Identities = 24/141 (17%), Positives = 47/141 (33%), Gaps = 14/141 (9%) Query: 63 TAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKG--NIKRCRVPLIKEA 120 + A+ I+ V ++ + G + L P + I R + R L A Sbjct: 76 SRFASEIEVGFVPYEARKGDEVLLDLIVAPTQKIELPGGRFREVDVAEAAKDRGAL---A 132 Query: 121 EQIAWLQRKLG-------NAARVEDVHPISERPQYFSGDGKSGKIQTVCFEGV--LTIND 171 WL+ + V +R + GK+ I+ + + I + Sbjct: 133 VYADWLKLQFSKPDTGCLPIGIPSFVETSQDRGVRKAIGGKTTTIRFPRVHALQKVRIVN 192 Query: 172 APALIDLVQQGIGPAKSMGCG 192 A ++ G+G ++ G G Sbjct: 193 QRAFERMLVAGLGRQRAFGYG 213 >UniRef50_C1MF15 Predicted protein n=1 Tax=Citrobacter sp. 30_2 RepID=C1MF15_9ENTR Length = 199 Score = 42.9 bits (100), Expect = 0.007, Method: Composition-based stats. Identities = 15/103 (14%), Positives = 27/103 (26%), Gaps = 3/103 (2%) Query: 95 TILDNQKRLDSKGNIKRCRVPLIKEAEQIAWLQRKLGNAARVEDVHPISER---PQYFSG 151 ++ ++ + C L+R R+ V P Sbjct: 90 SLATIRREEAGGRKREICPPAHELPDYIHYHLERAGVICGRLRIVRSTKFHIIKPGSVGR 149 Query: 152 DGKSGKIQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLL 194 + ++ I D AL GIG + G G + Sbjct: 150 TSHKIIVPMSQYDAECVIEDVGALEQAYAFGIGRKRIFGFGYM 192 >UniRef50_C4XCX4 Putative uncharacterized protein n=1 Tax=Klebsiella pneumoniae NTUH-K2044 RepID=C4XCX4_KLEPN Length = 188 Score = 42.2 bits (98), Expect = 0.010, Method: Composition-based stats. Identities = 23/122 (18%), Positives = 44/122 (36%), Gaps = 9/122 (7%) Query: 78 LQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRVPLIKEAEQIAWLQRKLGNAARVE 137 L G + F I + K S+G +R P + + Q+A L + + Sbjct: 72 LSEGHEIKF------ITLMAIFHKGTKSEGRGRRQFAPSEEASYQLA-LTKLAKAGFKPG 124 Query: 138 DVHPISERPQYFSGD--GKSGKIQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLS 195 + + + G+ + +G I++ + G+GP + GCG + Sbjct: 125 QIVVSGPKFVHIDKGNAGRGFTLPVFTVQGTAIISNQQEAEVGIVYGVGPKRVFGCGFMH 184 Query: 196 LA 197 LA Sbjct: 185 LA 186 >UniRef50_Q1YSZ9 ATP-dependent helicase HrpA n=2 Tax=unclassified Gammaproteobacteria (miscellaneous) RepID=Q1YSZ9_9GAMM Length = 1309 Score = 41.0 bits (95), Expect = 0.025, Method: Composition-based stats. Identities = 20/117 (17%), Positives = 37/117 (31%), Gaps = 8/117 (6%) Query: 67 TVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRVPLIKEAEQIAWL 126 I K G + + NP++ +++ + R+ L++ EQ+ +L Sbjct: 1033 GKISIKAWPALRDCGDSVSLEVMDNPLQAEKVSRQG--------QLRLALLRGREQVKYL 1084 Query: 127 QRKLGNAARVEDVHPISERPQYFSGDGKSGKIQTVCFEGVLTINDAPALIDLVQQGI 183 ++ L + Q S Q F G + D QGI Sbjct: 1085 EKNLLRGKDLALKAANVGARQRLIDALISASFQQAVFSGHSVVRDQQQFDQAYDQGI 1141 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.309 0.152 0.471 Lambda K H 0.267 0.0469 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,252,029,752 Number of Sequences: 3077464 Number of extensions: 48710727 Number of successful extensions: 157187 Number of sequences better than 1.0e-01: 115 Number of HSP's better than 0.1 without gapping: 243 Number of HSP's successfully gapped in prelim test: 59 Number of HSP's that attempted gapping in prelim test: 156239 Number of HSP's gapped (non-prelim): 337 length of query: 199 length of database: 1,040,396,356 effective HSP length: 122 effective length of query: 77 effective length of database: 664,945,748 effective search space: 51200822596 effective search space used: 51200822596 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.0 bits) S2: 90 (39.1 bits)