BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (363 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_Q46899 Uncharacterized protein ygcJ n=13 Tax=Proteobact... 753 0.0 UniRef50_C5SD49 CRISPR-associated protein, Cse4 family n=1 Tax=A... 296 8e-79 UniRef50_A1SV72 CRISPR-associated protein, Cse4 family n=2 Tax=G... 249 8e-65 UniRef50_C0W6U1 CRISPR-associated Cse4 family protein n=2 Tax=Ac... 168 2e-40 UniRef50_A0LM53 CRISPR-associated protein, Cse4 family n=1 Tax=S... 165 3e-39 UniRef50_Q03C61 CRISPR-associated protein n=6 Tax=Firmicutes Rep... 160 1e-37 UniRef50_D1CGD3 CRISPR-associated protein, Cse4 family n=1 Tax=T... 159 2e-37 UniRef50_B4S8P9 CRISPR-associated protein, Cse4 family n=9 Tax=B... 157 5e-37 UniRef50_C7LYW7 CRISPR-associated protein, Cse4 family n=1 Tax=A... 155 3e-36 UniRef50_B0TDU0 Crispr-associated protein, ct1973 family, putati... 153 9e-36 UniRef50_Q47PJ3 CRISPR-associated protein, Cse4 family n=1 Tax=T... 147 5e-34 UniRef50_C2BET9 CRISPR-associated protein n=3 Tax=Bacteria RepID... 147 8e-34 UniRef50_A8LYZ6 CRISPR-associated protein, Cse4 family n=1 Tax=S... 142 3e-32 UniRef50_Q2JH28 CRISPR-associated protein, CT1975 n=6 Tax=Actino... 141 3e-32 UniRef50_D2RB01 CRISPR system CASCADE complex protein CasC n=2 T... 141 4e-32 UniRef50_Q0AA32 CRISPR-associated protein, Cse4 family n=1 Tax=A... 140 5e-32 UniRef50_C6HV95 CRISPR-associated protein, Cas4 n=1 Tax=Leptospi... 140 7e-32 UniRef50_A5UR15 CRISPR-associated protein, Cse4 family n=1 Tax=R... 140 9e-32 UniRef50_A4XYU0 CRISPR-associated protein, Cse4 family n=5 Tax=B... 139 2e-31 UniRef50_Q1EQS8 CRISPR-associated protein n=3 Tax=Streptomyces R... 138 3e-31 UniRef50_Q2JWC4 CRISPR-associated protein, Cse4 family n=1 Tax=S... 137 4e-31 UniRef50_B6B782 CRISPR-associated protein, Cse4 family n=2 Tax=A... 137 7e-31 UniRef50_A1ARH7 CRISPR-associated protein, Cse4 family n=1 Tax=P... 136 1e-30 UniRef50_Q2RXJ6 CRISPR-associated protein, Cse4 family n=2 Tax=A... 135 2e-30 UniRef50_Q2FNL3 CRISPR-associated protein, CT1975 n=8 Tax=cellul... 135 3e-30 UniRef50_B1VIY1 CRISPR-associated protein n=3 Tax=Corynebacteriu... 134 4e-30 UniRef50_B8IZA6 CRISPR-associated protein, Cse4 family n=1 Tax=D... 134 4e-30 UniRef50_A3EQA5 CRISPR-ssociated protein, Cas4 n=4 Tax=Bacteria ... 134 6e-30 UniRef50_C4ZJY0 CRISPR-associated protein, Cse4 family n=1 Tax=T... 133 1e-29 UniRef50_A7BA64 Putative uncharacterized protein n=1 Tax=Actinom... 133 1e-29 UniRef50_C7MTA9 CRISPR-associated protein, Cse4 family n=1 Tax=S... 133 1e-29 UniRef50_B6XT63 Putative uncharacterized protein n=1 Tax=Bifidob... 132 1e-29 UniRef50_B6WQ62 Putative uncharacterized protein n=1 Tax=Desulfo... 132 2e-29 UniRef50_D0MET5 CRISPR-associated protein, Cse4 family n=1 Tax=R... 132 2e-29 UniRef50_Q0BRF9 Putative uncharacterized protein n=1 Tax=Granuli... 131 4e-29 UniRef50_A5FTJ7 CRISPR-associated protein, Cse4 family n=11 Tax=... 130 7e-29 UniRef50_C6SPJ0 Putative uncharacterized protein n=1 Tax=Strepto... 130 7e-29 UniRef50_C3PF94 CRISPR-associated protein n=5 Tax=Corynebacteriu... 130 8e-29 UniRef50_B4UE70 CRISPR-associated protein, Cse4 family n=2 Tax=A... 129 2e-28 UniRef50_D0Y919 CRISPR-associated protein, Cse4 family n=2 Tax=D... 125 2e-27 UniRef50_Q2RY18 CRISPR-associated protein, Cse4 family n=2 Tax=A... 125 3e-27 UniRef50_B3E5V0 CRISPR-associated protein, Cse4 family n=56 Tax=... 124 4e-27 UniRef50_D1YEE3 CRISPR system CASCADE complex protein CasC n=1 T... 124 6e-27 UniRef50_D1Y487 CRISPR-associated protein, Cse4 family n=1 Tax=P... 123 1e-26 UniRef50_C9M9R6 CRISPR-associated protein, Cse4 family n=1 Tax=J... 123 1e-26 UniRef50_C7QEM5 CRISPR-associated protein, Cse4 family n=13 Tax=... 123 1e-26 UniRef50_D2L2X7 CRISPR-associated protein, Cse4 family n=1 Tax=D... 123 1e-26 UniRef50_C4FG89 Putative uncharacterized protein n=1 Tax=Bifidob... 121 3e-26 UniRef50_Q60AD1 CRISPR-associated protein, CT1975 family n=1 Tax... 120 6e-26 UniRef50_Q3A5Z5 CRISPR-associated protein, Cse4 family n=23 Tax=... 120 7e-26 UniRef50_C7RP61 CRISPR-associated protein, Cse4 family n=1 Tax=C... 118 3e-25 UniRef50_Q67RP1 Putative uncharacterized protein n=1 Tax=Symbiob... 117 7e-25 UniRef50_C7MQD5 CRISPR-associated protein, Cse4 family n=1 Tax=S... 116 1e-24 UniRef50_Q1J368 CRISPR-associated protein, CT1975 n=1 Tax=Deinoc... 115 2e-24 UniRef50_A5GBK1 CRISPR-associated protein, Cse4 family n=1 Tax=G... 114 5e-24 UniRef50_D1A6Q4 CRISPR-associated protein, Cse4 family n=2 Tax=A... 114 8e-24 UniRef50_UPI0001AF1D4B hypothetical protein SghaA1_37372 n=1 Tax... 113 1e-23 UniRef50_C8P6I6 CRISPR-associated protein n=1 Tax=Lactobacillus ... 109 2e-22 UniRef50_D2TKK6 CRISPR-associated protein n=1 Tax=Citrobacter ro... 107 1e-21 UniRef50_D1CAJ1 CRISPR-associated protein, Cse4 family n=1 Tax=S... 106 1e-21 UniRef50_B8FDH9 CRISPR-associated protein, Cse4 family n=2 Tax=B... 103 7e-21 UniRef50_B7KJ25 CRISPR-associated protein, Cse4 family n=1 Tax=C... 98 4e-19 UniRef50_B8HWH9 CRISPR-associated protein, Cse4 family n=1 Tax=C... 97 1e-18 UniRef50_D1NTI0 CRISPR-associated protein, Cse4 family n=1 Tax=B... 91 6e-17 UniRef50_D0WFC9 CRISPR-associated protein, Cse4 family n=1 Tax=S... 88 5e-16 UniRef50_Q31XC0 Putative cytoplasmic protein n=1 Tax=Shigella bo... 86 1e-15 UniRef50_C2GEY7 CRISPR-associated Cse4 family protein n=6 Tax=Ac... 85 4e-15 UniRef50_UPI0001B51C2C hypothetical protein SvirD4_12600 n=1 Tax... 59 2e-07 UniRef50_UPI000190E665 hypothetical protein SentesTyp_08452 n=3 ... 58 7e-07 UniRef50_B6IWM4 CRISPR-associated protein, CT1975 family n=1 Tax... 54 1e-05 >UniRef50_Q46899 Uncharacterized protein ygcJ n=13 Tax=Proteobacteria RepID=YGCJ_ECOLI Length = 363 Score = 753 bits (1944), Expect = 0.0, Method: Compositional matrix adjust. Identities = 363/363 (100%), Positives = 363/363 (100%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGE 60 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGE Sbjct: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGE 60 Query: 61 SSLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGEIA 120 SSLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGEIA Sbjct: 61 SSLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGEIA 120 Query: 121 WFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGKVDG 180 WFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGKVDG Sbjct: 121 WFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGKVDG 180 Query: 181 AMSIAHAITTHQVDSDIDWFTAVDDLQEQGSAHLGTQEFSSGVFYRYANINLAQLQENLG 240 AMSIAHAITTHQVDSDIDWFTAVDDLQEQGSAHLGTQEFSSGVFYRYANINLAQLQENLG Sbjct: 181 AMSIAHAITTHQVDSDIDWFTAVDDLQEQGSAHLGTQEFSSGVFYRYANINLAQLQENLG 240 Query: 241 GASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFSDMPLSMANAFEKAVKA 300 GASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFSDMPLSMANAFEKAVKA Sbjct: 241 GASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFSDMPLSMANAFEKAVKA 300 Query: 301 KDGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITAQVKQMPTLEQLKSWVRNN 360 KDGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITAQVKQMPTLEQLKSWVRNN Sbjct: 301 KDGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITAQVKQMPTLEQLKSWVRNN 360 Query: 361 GEA 363 GEA Sbjct: 361 GEA 363 >UniRef50_C5SD49 CRISPR-associated protein, Cse4 family n=1 Tax=Allochromatium vinosum DSM 180 RepID=C5SD49_CHRVI Length = 393 Score = 296 bits (758), Expect = 8e-79, Method: Compositional matrix adjust. Identities = 182/395 (46%), Positives = 243/395 (61%), Gaps = 40/395 (10%) Query: 3 NFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESS 62 NF+N HVLISHSPSCLNRDDMNMQK AIFGGK RVRISSQSLKRA+R S YYA+ S Sbjct: 5 NFVNFHVLISHSPSCLNRDDMNMQKTAIFGGKTRVRISSQSLKRAIRYSDYYARYFISKS 64 Query: 63 LRTIHLA-QLRDVLRQKLGERFDQKIIDKT----LALLSGKS-VDEAEKISADAVTPWVV 116 RT L ++ D L I+K A+ GK+ +DE K D + + Sbjct: 65 QRTRRLFDKMADELSASAESAEQTTAIEKCALYAAAIFEGKTKIDEIGKYERDKKSDHIE 124 Query: 117 GEIAWF-CEQVAKA-----EADNLDDKKLLKVLKEDIAAIRVNLQQ----GVDIALSGRM 166 +I F C ++ EA +K ++ +K +I R+ +Q +D+ALSGRM Sbjct: 125 TQIIPFSCAEIEGIKQILLEAAGKPEKGRIEYMKAEIQ--RLEREQRTRIDLDVALSGRM 182 Query: 167 ATSGMMTELGKVDGAMSIAHAITTHQVDS-DIDWFTAVDDLQ----EQGSAHLGTQEFSS 221 A S ++ VDGA+++AHAITTH V+ DIDWFTAVDDL E G+ HL TQ+FS+ Sbjct: 183 ANSELIY---PVDGALAVAHAITTHTVEPQDIDWFTAVDDLTLDAGETGAGHLNTQQFSA 239 Query: 222 GVFYRYANINLAQLQENLG----------GASREQALEIATHVVHMLATEVPGAKQRTYA 271 GVFYRYA++NL QLQ NLG SR +AL+IA HV+H+LAT VP AKQ+++A Sbjct: 240 GVFYRYASLNLRQLQFNLGLLANINAEQTTESRARALDIARHVLHLLATVVPSAKQQSFA 299 Query: 272 AFNPADMVMVNFSDMPLSMANAFEKAVKAK---DGFLQPSIQAFNQYWDRVANGYGLNGA 328 A N AD V+V+ +D P+S+ANAFE+ ++ + GFLQPSI A YW RV + YGL+ Sbjct: 300 AHNLADFVIVSLADQPVSLANAFEEPIERERKIGGFLQPSITALADYWSRVNSAYGLDEQ 359 Query: 329 AAQFSLSDVDPITAQVKQMPTLEQLKSWVRNNGEA 363 A F+L + Q + + ++ L+ W+ N+G A Sbjct: 360 ARAFALRGGIKLGDQ-EVLTSIADLEQWLANDGRA 393 >UniRef50_A1SV72 CRISPR-associated protein, Cse4 family n=2 Tax=Gammaproteobacteria RepID=A1SV72_PSYIN Length = 337 Score = 249 bits (637), Expect = 8e-65, Method: Compositional matrix adjust. Identities = 145/356 (40%), Positives = 209/356 (58%), Gaps = 27/356 (7%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGE 60 M+ FINIH LISH S +NRDD MQK A+FGG R RISSQ LKRA+R+S Y + + E Sbjct: 1 MTTFINIHTLISHPSSMMNRDDSGMQKTAVFGGSVRSRISSQCLKRAIRQSDIYGEAVAE 60 Query: 61 SSLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGK-SVDEAEKI-SADAVTPWVVGE 118 S+RT +L D+ ++ + E D K+I+ L + K + D+ +I + DAV P+ +G Sbjct: 61 KSIRTNKFDELLDLCKEAMPET-DIKLIEDVLLNMGSKVTKDKKTEIRNFDAVQPYAIGS 119 Query: 119 IAWFCEQVAKAEADNLDD-KKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGK 177 I V E L D KK++++ +D+ALSGRM S Sbjct: 120 IREAINMVN--EGTELKDLKKIVQI-------------PTIDVALSGRMDAS---CPPRN 161 Query: 178 VDGAMSIAHAITTHQVDSDIDWFTAVDDLQEQGSAHLGTQEFSSGVFYRYANINLAQLQE 237 V+ AMS+AH++TTH D ++DWFTA DDL EQGS H+GT EFSSGVFYRYA+IN+ L + Sbjct: 162 VEAAMSVAHSLTTHSADIEVDWFTACDDLAEQGSGHIGTTEFSSGVFYRYASINVDLLAK 221 Query: 238 NLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFSDMPLSMANAFEKA 297 N+ E I T ++ A P AKQ+ +AA+N AD VM S+ P+S+ANAF K Sbjct: 222 NVKSTVSEVTPIINT-MIRCFAQVSPSAKQKVFAAYNQADFVMATHSNQPISLANAFRKP 280 Query: 298 VKAKDGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITAQVKQMPTLEQL 353 ++ ++ SI A ++++++ N Y L+ A L+D +AQ KQ+ + ++ Sbjct: 281 IENNGDVMENSIAALVKHYEKLTNAYELDSKAIALDLTD----SAQSKQINLVNKI 332 >UniRef50_C0W6U1 CRISPR-associated Cse4 family protein n=2 Tax=Actinomycetales RepID=C0W6U1_9ACTO Length = 374 Score = 168 bits (426), Expect = 2e-40, Method: Compositional matrix adjust. Identities = 133/387 (34%), Positives = 195/387 (50%), Gaps = 56/387 (14%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGE 60 MS F++IH++ S PSC+NRDD K A++GG RR+R+SSQS KRA R Y+ +++ Sbjct: 1 MSTFVDIHLIQSLPPSCVNRDDSGSPKSALYGGVRRLRVSSQSWKRATRL--YFNEHLDA 58 Query: 61 S--SLRTIHLAQLRDVLRQKLGERFD---QKIIDKTLALLSGKSVDEAEKIS-------- 107 + +RT + +L L +R + D LAL + V A KI Sbjct: 59 TDVGIRTKRVVEL-------LADRISAIAPDLADSALAL--AEQVFSAAKIKVAPPRGKK 109 Query: 108 -ADAVTPWVV----GEIAWFCEQVAKA--EADNLDDKKLLKVLKEDIAAIRVNLQQGVDI 160 A A + +++ +I E +A + +D K+ K+ KE + VDI Sbjct: 110 DAPAESGYLLFLSTSQINRLAEMATRAAHAGEKIDPKETKKIFKE---------EHAVDI 160 Query: 161 ALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDD------LQEQGSAHL 214 AL GRM +L VD A +AHAI+TH +++ D+FTAVDD ++ G+ + Sbjct: 161 ALFGRMVADD--ADL-NVDAACQVAHAISTHAAENEYDFFTAVDDEKSRAMEEDAGAGMM 217 Query: 215 GTQEFSSGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFN 274 GT EFSS YRYA +NL L ENLG R+ AL + + +P KQ T+A Sbjct: 218 GTVEFSSATMYRYATVNLDMLVENLG--DRDAALRALSVFLEGFCLSMPTGKQNTFANRT 275 Query: 275 PADMVMVNF-SDMPLSMANAFEKAVKA--KDGFLQPSIQAFNQYWDRVANGYGLNGAAA- 330 D V+V+ D P+S+ AFEK V+ DGFL S++A +Y + +GL A+ Sbjct: 276 LPDSVVVSVRDDQPVSLVGAFEKPVRTTESDGFLTRSVEALARYEHTIEENFGLKPQASF 335 Query: 331 QFSLSDVDPITAQVKQMPTLEQLKSWV 357 SL+DV P A + + T L V Sbjct: 336 VVSLADV-PELASLGERITFADLPGKV 361 >UniRef50_A0LM53 CRISPR-associated protein, Cse4 family n=1 Tax=Syntrophobacter fumaroxidans MPOB RepID=A0LM53_SYNFM Length = 384 Score = 165 bits (417), Expect = 3e-39, Method: Compositional matrix adjust. Identities = 133/397 (33%), Positives = 209/397 (52%), Gaps = 54/397 (13%) Query: 4 FINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNI----G 59 F++IH++ + +PS LNRDD N KD FGG RR RISSQ +KR +R ++Q + G Sbjct: 2 FVDIHIIQNFAPSNLNRDDTNSPKDCEFGGYRRARISSQCIKRVVRSHRSFSQAVVHAGG 61 Query: 60 ESSLRTIHL-AQLRDVLRQKLG--ERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVV 116 ++ +RT + ++L D+ +K G E + + + +T+ L G + + EK T +++ Sbjct: 62 DTGVRTKRIKSRLMDLFAKKYGKPEIVETEKVAETVIELLGLKLKDEEK------TEYLL 115 Query: 117 GEIAWFCEQVAKAEADNLD---------DKKL--------LKVLKEDIAAIRVNLQQ--- 156 Q+A+ D+ D DKK LK +E++ I ++ Sbjct: 116 YLGENEAAQLARLAVDSWDALLAIEPEQDKKKKKGTGQESLKEFQEELKGIVGKRRKEAR 175 Query: 157 --GVDIALSGRM-ATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDL---QEQG 210 DIAL GRM A + M VD A +AHA++T++V+ ++D+FTAVDDL +E G Sbjct: 176 SYAADIALFGRMIADNKNMN----VDAACQVAHAVSTNKVEMEMDYFTAVDDLLPGEETG 231 Query: 211 SAHLGTQEFSSGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTY 270 S +G EF+S FYRY+N+N+++L ENLG + + +V + + VP KQ + Sbjct: 232 SDMIGVVEFNSSCFYRYSNVNVSKLAENLGFNNDLTTAALLGYVEASVKS-VPTGKQNSM 290 Query: 271 AAFNPADM--VMVNFSDMPLSMANAFEKAVKAK--DGFLQPSIQAFNQYWDRVANGYGLN 326 AA NPA V+V P S+ANAF+K V+ + SI A +Y++R+ YG Sbjct: 291 AAQNPAGYARVIVRRDGFPWSLANAFQKPVRPSLDKSLEEASIDALERYFERLKAVYGTE 350 Query: 327 G--AAAQFSLSDVDPITAQVKQMPTLEQLKSWVRNNG 361 G A F+L D +++M L+ LK+ V G Sbjct: 351 GIVCDASFNLHRDD--GGSLRKM--LDALKACVAGEG 383 >UniRef50_Q03C61 CRISPR-associated protein n=6 Tax=Firmicutes RepID=Q03C61_LACC3 Length = 361 Score = 160 bits (404), Expect = 1e-37, Method: Compositional matrix adjust. Identities = 108/320 (33%), Positives = 174/320 (54%), Gaps = 27/320 (8%) Query: 4 FINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESSL 63 +I+IHVL + + +NRDD K A++GG R R+SSQS KRAMR + ++ ++ L Sbjct: 7 YIDIHVLQTVPSANINRDDTGAPKKALYGGVTRARVSSQSWKRAMRLR-FNQEDHDDAGL 65 Query: 64 RTIHLAQ-LRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGEIAWF 122 RT + Q LR L+ D++I K A+ S + KI+ D T ++ Sbjct: 66 RTKEVPQLLRQALKAAAPALTDEEIAAKVDAVFSTAKI----KITKDGQTGALMLISTGQ 121 Query: 123 CEQVAKAEADN--LDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGKVDG 180 +++A+ DN LD K+L K+ K +Q +D+AL GRM EL V+G Sbjct: 122 LKKLAQYALDNEALDKKELTKLFKG---------EQSLDLALFGRMVADN--PEL-NVEG 169 Query: 181 AMSIAHAITTHQVDSDIDWFTAVDDLQEQ---GSAHLGTQEFSSGVFYRYANINLAQLQE 237 + +AHAI+TH++ + D+FTA+DD + + G+A LGT E++S YRYAN+N + + Sbjct: 170 SAQVAHAISTHEIVPEFDYFTALDDFKPEDNAGAAMLGTVEYNSSTLYRYANLNFQEFEA 229 Query: 238 NLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNF-SDMPLSMANAFEK 296 N+GG + A+ A + +P KQ T+A + VMV D P+++ +AFE Sbjct: 230 NIGGRA---AVSGALSYIKEFLLSMPNGKQNTFANKTLPNYVMVTLRPDTPVNLVSAFED 286 Query: 297 AVKAKDGFLQPSIQAFNQYW 316 VK+ G+++ S++ Q + Sbjct: 287 PVKSNHGYVEASVKRLEQEY 306 >UniRef50_D1CGD3 CRISPR-associated protein, Cse4 family n=1 Tax=Thermobaculum terrenum ATCC BAA-798 RepID=D1CGD3_THET1 Length = 382 Score = 159 bits (401), Expect = 2e-37, Method: Compositional matrix adjust. Identities = 119/350 (34%), Positives = 184/350 (52%), Gaps = 40/350 (11%) Query: 4 FINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESSL 63 + +H++ + +PS LNRDD KD FGG RR RISSQ +KRA+R+ + QN L Sbjct: 2 LVELHMIQNFAPSNLNRDDTGSPKDCEFGGVRRARISSQCIKRAIRRE--FKQN---GLL 56 Query: 64 RTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAV--TPWVV----G 117 + +A+ ++ Q++ +R + D+ A + A K+ D T +++ G Sbjct: 57 DSERIAERTRLVTQEIADRLARLGRDREQATRVAGFLLSAAKLKVDNSQRTEYLLFLGRG 116 Query: 118 E---IAWFC----EQVAKAEADNLDD-----KKLLKVLKEDIAA---IRVNLQQGVDIAL 162 E I C +Q+A +L D KK + + D++ R++ + D+AL Sbjct: 117 EIDAITALCNERWDQLAPLADQSLSDQSNDKKKAAQQVPADMSRELLARLDGGKAADLAL 176 Query: 163 SGRMATSGMMTELG--KVDGAMSIAHAITTHQVDSDIDWFTAVDDLQ---EQGSAHLGTQ 217 GR M+ +L +D A +AHAI+TH+V + D++TAVDDLQ E G+ +GT Sbjct: 177 FGR-----MLADLPDKNIDAASQVAHAISTHRVSIEFDFYTAVDDLQPESETGAGMMGTV 231 Query: 218 EFSSGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPAD 277 EF+S FYRY+N+++ QL NL G RE AL+ +H +P KQ + AA NP Sbjct: 232 EFNSACFYRYSNVSMEQLITNLQG-DRELALKTLEAFIHASVRAIPTGKQNSMAAHNPPS 290 Query: 278 MVM-VNFSDMPLSMANAFEKAVKA--KDGFLQPSIQAFNQYWDRVANGYG 324 MV V P S+ANAF + V ++ + SIQA + YW ++ + YG Sbjct: 291 MVFAVVREGAPWSLANAFARPVAPGREEDLVGRSIQALDSYWGKLVSVYG 340 >UniRef50_B4S8P9 CRISPR-associated protein, Cse4 family n=9 Tax=Bacteria RepID=B4S8P9_PROA2 Length = 347 Score = 157 bits (398), Expect = 5e-37, Method: Compositional matrix adjust. Identities = 120/327 (36%), Positives = 172/327 (52%), Gaps = 44/327 (13%) Query: 5 INIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESSLR 64 I H+L S +CLNRDD+ K AI GG R R+SSQ KR +R S Q+ G Sbjct: 12 IEYHILQSFPVTCLNRDDVGAPKTAIVGGSTRARVSSQCWKRQVRLS---MQDFG----- 63 Query: 65 TIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGEIAWFCE 124 I L +R K F K L G S ++A + + + +F E Sbjct: 64 -IKLG-----IRSKKVSEFVAKA-----CLQKGASEEQAAECGKVISDSFSKDTLFFFSE 112 Query: 125 QVAKAEAD----------NLDDKKLLKVLKEDI-AAIRVNLQQGVDIALSGRMATSGMMT 173 A+A AD NL+DK++ KV K+ + AI G+DIAL GRM T Sbjct: 113 SEAQAFADYAREKNFDSKNLNDKEIRKVAKKALNPAI-----DGLDIALFGRMVAQA--T 165 Query: 174 ELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDL-QEQGSAHLGTQEFSSGVFYRYANINL 232 +L ++ A S +HAI+TH+V +++++FTA+DDL +E GSAH+G+ EF+S +YRY +++L Sbjct: 166 DLN-IEAAASFSHAISTHKVSNEVEFFTALDDLAEEPGSAHMGSLEFNSATYYRYISLDL 224 Query: 233 AQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFSDMPLSMAN 292 QL E++GG +A+E T L VP A+Q T + +P + + F + Sbjct: 225 GQLWESIGGEHLAEAVESLTKA---LFVAVPSARQTTQSGASPWEFAKI-FIRKGQRLQV 280 Query: 293 AFEKAVKAKD-GFLQPSIQAFNQYWDR 318 FE AVKAKD G+LQPSI A Y + Sbjct: 281 PFETAVKAKDGGYLQPSITALTDYLTK 307 >UniRef50_C7LYW7 CRISPR-associated protein, Cse4 family n=1 Tax=Acidimicrobium ferrooxidans DSM 10331 RepID=C7LYW7_ACIFD Length = 386 Score = 155 bits (391), Expect = 3e-36, Method: Compositional matrix adjust. Identities = 122/362 (33%), Positives = 179/362 (49%), Gaps = 56/362 (15%) Query: 5 INIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQN---IGES 61 I++HVL + PSCLNRDD N K A++GG RR R+SSQS KRA R+ Y+ +N IG Sbjct: 9 IDVHVLQTLPPSCLNRDDTNAPKTALYGGARRARVSSQSWKRATRR--YFNENLATIGTD 66 Query: 62 SLRT----IHLAQLRDVLRQKLGERF-DQKIIDKTLALLSGKSV-------DEAEKISAD 109 LR+ I +L +L +++ R D + + +A L + +E K A Sbjct: 67 WLRSRGGGIRTRKLAGLLHERVQARVRDLDVREDDVARLVNLAAGALLGLKEEKLKKRAQ 126 Query: 110 AVTPWVVGEIAWFCEQVAKAEA-----------DNLDDKKLLKVLKEDIAAIRVNLQQGV 158 P + E A F + A A D+LD L + D++ + Sbjct: 127 ETQPADL-EYALFVSESAIDAAVGELERSLRAGDDLDLDVLTTAMGRDLS---------L 176 Query: 159 DIALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDL---QEQGSAHLG 215 D+AL GRM T VD A +AHAI+TH+V S+ D++T VDDL E G+A +G Sbjct: 177 DVALFGRMIAD---TPNLNVDAACQVAHAISTHRVTSEFDFYTTVDDLAGDDETGAAMMG 233 Query: 216 TQEFSSGVFYRYANINLAQLQENLGGASREQALEIATHV---VHMLATEVPGAKQRTYAA 272 EF+S YR+A ++L +L +NLG + T V + A +P Q T+AA Sbjct: 234 FIEFNSATVYRFATVSLGRLADNLGDPD-----AVPTGVRAFIEAFAKSLPTGHQNTFAA 288 Query: 273 FNPADMVMVNF-SDMPLSMANAFEKAVKAKDGFLQPSIQAFNQYWDRVANGYG---LNGA 328 D+V V+ D P+S+ AFE V++ G++ S + Y D + YG LNG Sbjct: 289 LTVPDLVFVSMRGDQPVSLVGAFEAPVESDRGYVHASAERLATYADDIDGLYGVPRLNGW 348 Query: 329 AA 330 A+ Sbjct: 349 AS 350 >UniRef50_B0TDU0 Crispr-associated protein, ct1973 family, putative n=2 Tax=cellular organisms RepID=B0TDU0_HELMI Length = 385 Score = 153 bits (387), Expect = 9e-36, Method: Compositional matrix adjust. Identities = 118/356 (33%), Positives = 181/356 (50%), Gaps = 46/356 (12%) Query: 5 INIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESSL- 63 + IHVL +H+P+ LNRD+ KD +FGG RR RISSQ KR +R S + +IGES L Sbjct: 2 VEIHVLQNHAPANLNRDESGSPKDCMFGGVRRGRISSQCQKRTIRCSPLFQDSIGESRLG 61 Query: 64 -RTIHLAQL--RDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGEIA 120 RT L L +++R L E + K L + + ++I+A A+ ++ E Sbjct: 62 MRTRKLPFLVKEELMRLGLSEELAKIGARKASGLGNKDGKERDDEITAQAI--FLTQEDV 119 Query: 121 WFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQ----------GVDIALSGRMATSG 170 +A+ +L D K +K+ A LQ+ VD+AL GRM TS Sbjct: 120 SV---IARCLFRHLKD----KTVKQAKAIKAQELQKDPELVGWRPVTVDVALFGRMTTS- 171 Query: 171 MMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQ---EQGSAHLGTQEFSSGVFYRY 227 T V+ ++ + HAI+TH+VDS+ D+FTAVDDL + G+ +G EF+S +Y+Y Sbjct: 172 --TAFNDVEASVQVGHAISTHRVDSEFDYFTAVDDLMGDGDSGADMIGDTEFNSCCYYKY 229 Query: 228 ANINLAQLQENLGGASR------EQALEIATHVVHMLATEV-------PGAKQRTYAA-- 272 N+++ +L+ NL G R E+ ++A H++ + P KQ ++AA Sbjct: 230 FNVDMDELKRNLAGPDRLKKLTAEERQDLARDAAHIVKAFIESLVFCSPDGKQNSFAARQ 289 Query: 273 FNPADMVMVNFSDMPLSMANAFEKAVKAKD--GFLQPSIQAFNQYWDRVANGYGLN 326 A +V V +P+S ANAF K V A+ +Q S+ AF + +GL Sbjct: 290 LPSAVLVEVKKRKIPVSYANAFVKPVTARGEMDLVQASVNAFLDHVKETEKCFGLT 345 >UniRef50_Q47PJ3 CRISPR-associated protein, Cse4 family n=1 Tax=Thermobifida fusca YX RepID=Q47PJ3_THEFY Length = 373 Score = 147 bits (372), Expect = 5e-34, Method: Compositional matrix adjust. Identities = 111/335 (33%), Positives = 164/335 (48%), Gaps = 29/335 (8%) Query: 3 NFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESS 62 F++IH + + S +NRDD+ K ++GGK R R+SSQS KRA+R +G+ + Sbjct: 2 TFVDIHAIQTLPYSNINRDDLGSPKTVVYGGKERTRVSSQSWKRAVRHE--VEARLGDKA 59 Query: 63 LRTIHLAQLRDVLRQKLGER-FDQKIID---KTLALLSGKSVD---EAEKISADAVTPWV 115 +RT + + ++L ER +D + D + + L GK E EK S T + Sbjct: 60 VRTRRIIS---EIAKRLRERGWDADLADAGARQVVLSVGKKSGIKLEKEKDSEAPATSVL 116 Query: 116 -------VGEIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMAT 168 + E+A ++ A A K +L D V + V + L GRM Sbjct: 117 FYLPVPAIDELAAIADEHRDAVAKEAAKKTPKGILPAD-RITEVLKSRNVSVNLFGRMLA 175 Query: 169 SGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQEQ---GSAHLGTQEFSSGVFY 225 TE VDGA+ AHA T H ++D+FTAVDD+ ++ GS H+ +FS+G FY Sbjct: 176 ELPSTE---VDGAVQFAHAFTVHGTTVEVDFFTAVDDIPKENDHGSGHMNAGQFSAGTFY 232 Query: 226 RYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFS- 284 RYAN+NL +L EN G A + A + + VP KQ AA D+V + Sbjct: 233 RYANVNLDRLVENTGDA--QTARTAVAEFLRAFLSTVPSGKQNATAAMTLPDLVHIAVRF 290 Query: 285 DMPLSMANAFEKAVKAKDGFLQPSIQAFNQYWDRV 319 D P+S A AFE A+ DG+ + Q N Y +R+ Sbjct: 291 DRPISFAPAFETALYGSDGYTLRACQELNNYAERL 325 >UniRef50_C2BET9 CRISPR-associated protein n=3 Tax=Bacteria RepID=C2BET9_9FIRM Length = 359 Score = 147 bits (370), Expect = 8e-34, Method: Compositional matrix adjust. Identities = 107/330 (32%), Positives = 178/330 (53%), Gaps = 38/330 (11%) Query: 4 FINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYY-----AQNI 58 F++IH + + P+ +NRDD K A +GG R R+SSQS KRA+RK Y+ +N+ Sbjct: 10 FLDIHAIQTVPPANINRDDTGSPKTAQYGGVTRARVSSQSWKRAIRK--YFNENGDVENV 67 Query: 59 GESSLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGE 118 G SL + + V QK G ++ ++ ++ K+++ A+ + D + Sbjct: 68 GIRSLEIVRYVANKIV--QKDGSISIEEAME-----MADKTINNAKISTKDQKAKALFFM 120 Query: 119 IAWFCEQVAKAEADNLDDKKLLK-VLKEDIAAIRVNLQQGVDIALSGRM-ATSGMMTELG 176 E++A+A D ++DKK+L+ +LK D + +D+AL GRM A + E Sbjct: 121 SDKQAEELAQASIDKVNDKKILQEILKNDTS---------IDVALFGRMVADDASLNE-- 169 Query: 177 KVDGAMSIAHAITTHQVDSDIDWFTAVDDLQEQ---GSAHLGTQEFSSGVFYRYANINLA 233 D + +AHAI+TH + S+ D+FTAVDDL + G+ LGT E++S YRYANI L Sbjct: 170 --DASSQVAHAISTHAIQSEFDFFTAVDDLAPEDNAGAGMLGTVEYNSSTLYRYANIALH 227 Query: 234 QLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNF-SDMPLSMAN 292 L A +E+ ++ V +P K T+A ++V+ SD PL+M + Sbjct: 228 DFYRQL--ADKEETIKATKLFVKSFVESMPTGKINTFANQTLPQAIVVSLRSDRPLNMVS 285 Query: 293 AFEKAVKAKDGFLQPSIQA-FNQY--WDRV 319 AFE+ +K+ +G++ SI+ F++Y +D++ Sbjct: 286 AFEEPIKSDNGYVDKSIEKLFSEYTKYDKI 315 >UniRef50_A8LYZ6 CRISPR-associated protein, Cse4 family n=1 Tax=Salinispora arenicola CNS-205 RepID=A8LYZ6_SALAI Length = 380 Score = 142 bits (357), Expect = 3e-32, Method: Compositional matrix adjust. Identities = 115/354 (32%), Positives = 183/354 (51%), Gaps = 42/354 (11%) Query: 2 SNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGES 61 + +++IHVL + + LNRDD+ K FG R R+SSQS KRA+R+ ++ G+ Sbjct: 3 ARYVDIHVLQTVPYANLNRDDLGSPKTVRFGYADRTRVSSQSWKRAVRRE--LEESSGDK 60 Query: 62 SLRTIHLAQLRDVLRQKL-GERFDQKI-------IDKTLALLSGKSVD-EAEKISADAVT 112 + RT L Q ++ +L G +D ++ + TLA ++ K+ + +K + +A Sbjct: 61 AKRTRRLPQ---AIQARLTGPDWDSELAAFAATQVMATLATIAVKADGFKVDKATGEAQV 117 Query: 113 PWVVGEIAW-----FCEQ----VAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVD-IAL 162 + + E A+ C Q + + L KK L D A+R ++ D I L Sbjct: 118 LFYLPERAFDMLADVCVQQRDRLIGLRSGALKLKKGEAPLPAD--AVRAAMEHRSDVINL 175 Query: 163 SGRMATSGMMTEL--GKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQEQ----GSAHLGT 216 GRM + EL VDGA+ +AHA TTH D +D+FTAVDDL++ GS H+ + Sbjct: 176 FGRM-----LAELPGSNVDGAVQVAHAFTTHGTDPQVDFFTAVDDLKQDADQAGSGHMNS 230 Query: 217 QEFSSGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPA 276 EFS+G FYRYA++NL L NLG + A+E+ + T +P AK+ A F Sbjct: 231 AEFSTGTFYRYASVNLEDLAHNLGDPA--TAVELTRVFLSAFITAMPQAKKNATAPFTVP 288 Query: 277 DMVMVNF-SDMPLSMANAFEKAVKAK--DGFLQPSIQAFNQYWDRVANGYGLNG 327 ++ + +D P+S+A+AFE V+A G+ +PS + +Y ++ G G Sbjct: 289 ELAYIAVRTDRPVSLASAFETPVRATFDSGYAEPSRRQLAEYAGQIYRLIGDQG 342 >UniRef50_Q2JH28 CRISPR-associated protein, CT1975 n=6 Tax=Actinomycetales RepID=Q2JH28_FRASC Length = 384 Score = 141 bits (356), Expect = 3e-32, Method: Compositional matrix adjust. Identities = 105/338 (31%), Positives = 170/338 (50%), Gaps = 22/338 (6%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGE 60 M +I++H+L + PS LNRDD K A++GG +R R+SSQ+ KRA R + +A +I + Sbjct: 1 MRCYIDVHILQTVPPSNLNRDDAGTPKQAVYGGVKRARVSSQAWKRATRTA--FADHIDQ 58 Query: 61 SSLRTIHLAQLRDVLRQKLGER--FDQKIIDK-TLALLSGKSVDEAEKISADAVT----- 112 + L T ++ +L ++L R D + + +LL+ + +K + A Sbjct: 59 AQLGT-RTKRISALLAERLATRCALDAETSTRIATSLLTALKISAGKKAAETAYLLFFGR 117 Query: 113 PWVVGEIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMM 172 P + I E V + +L D LL +K+ + +D+AL GRM + Sbjct: 118 PQLERLIDLIVEDVPRLA--DLSDGDLLAAVKDVPVLATLGSDHPIDVALFGRMVAD--L 173 Query: 173 TELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQ---EQGSAHLGTQEFSSGVFYRYAN 229 L VD A +AHA++TH VD + D++TAVDD E G+ +GT EF S YR+A Sbjct: 174 ASL-NVDAATQVAHALSTHAVDVEFDYYTAVDDQNAKDETGAGMIGTVEFQSATLYRFAT 232 Query: 230 INLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNF-SDMPL 288 + L QL ENLGG E +E + T +P Q ++A +++ + D P+ Sbjct: 233 VGLHQLAENLGG-DIEATVEALRVFLTAFTTSMPTGHQNSFAHRTVPNLLTIAIRPDQPV 291 Query: 289 SMANAFEKAVKAKD-GFLQPSIQAFNQYWDRVANGYGL 325 ++ +AFEK V + G L S++ F + + +GL Sbjct: 292 NLVSAFEKPVLPRGRGVLTGSLEQFAIELNSASTLWGL 329 >UniRef50_D2RB01 CRISPR system CASCADE complex protein CasC n=2 Tax=Gardnerella vaginalis RepID=D2RB01_GARVA Length = 362 Score = 141 bits (355), Expect = 4e-32, Method: Compositional matrix adjust. Identities = 104/327 (31%), Positives = 171/327 (52%), Gaps = 28/327 (8%) Query: 4 FINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESSL 63 F++I + S P +NRDD K A +GG R R+SSQ K +MR+ Y+ ++ G+S++ Sbjct: 6 FLDIQAIQSVPPCNINRDDAGSPKTAQYGGVTRARVSSQCWKHSMRE--YFKEHSGDSNV 63 Query: 64 ----RTI--HLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVG 117 + I ++A L+ +L E+ + +KTL K+ + KI + +G Sbjct: 64 GMRSKNIVKYVADKIITLKPELSEQEALDLANKTLNNAGFKTKTDKGKIIPVVNVLFFLG 123 Query: 118 EIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRM-ATSGMMTELG 176 E +A+A +N+ DKK L+ + +D I DIAL GRM A + + E Sbjct: 124 ENQ--ANSLAQAAINNVTDKKQLEEILKDNPPI--------DIALFGRMLADNPSLNE-- 171 Query: 177 KVDGAMSIAHAITTHQVDSDIDWFTAVDDLQ---EQGSAHLGTQEFSSGVFYRYANINLA 233 D + +AHAI+TH V ++ D++TAVDDL G+ LGT E++S YRYAN+ + Sbjct: 172 --DASSQVAHAISTHAVRAEFDYYTAVDDLSVDDNAGAGMLGTIEYNSSTLYRYANVAIH 229 Query: 234 QLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNF-SDMPLSMAN 292 + L ++E + + A +P K T+A M++V D P+++ + Sbjct: 230 EFSHQLSD-NKESTINALKLFIEAFANAMPTGKVNTFANQTLPQMLVVTLREDRPVNLVS 288 Query: 293 AFEKAVKAKDGFLQPSIQAFNQYWDRV 319 AFE VKAKDG++ SI+ +Q +++V Sbjct: 289 AFEDPVKAKDGYVSKSIEKLSQEYEKV 315 >UniRef50_Q0AA32 CRISPR-associated protein, Cse4 family n=1 Tax=Alkalilimnicola ehrlichii MLHE-1 RepID=Q0AA32_ALHEH Length = 385 Score = 140 bits (354), Expect = 5e-32, Method: Compositional matrix adjust. Identities = 107/356 (30%), Positives = 174/356 (48%), Gaps = 46/356 (12%) Query: 4 FINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESSL 63 F+ IH L S+ + LNRDD + K FG R+R+SSQ LKR R++ ++ S + Sbjct: 2 FLQIHTLTSYHAALLNRDDAGLAKRIPFGSAERMRVSSQCLKRHWRQALKDVISL-PSGI 60 Query: 64 RTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEA-------EKISADAVTPWVV 116 RT H + R+V R+ + E + D+ L+GK +D EK S P + Sbjct: 61 RTRHFFE-REVCRRVIAE----GVEDEKARELTGKLIDAVMHSKEAREKDSLFLKQPVLF 115 Query: 117 G--EIAWF------CEQVAKAEADNLDD-----KKLLKVLKEDIAAIRVNLQQGVDIALS 163 G E +F C + + L D KK + L + AA +L+ G++ AL Sbjct: 116 GRPEADYFVSLITECARSGEDPGSTLKDRVKAEKKNFRALLQ--AAGGSDLESGIEGALF 173 Query: 164 GRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQEQ----GSAHLGTQEF 219 GR TS + L + D ++ +AHA T H +++++D+FT VDDL+E G+AH G E Sbjct: 174 GRFVTSDI---LARTDASVHVAHAFTVHSLNNEVDYFTVVDDLKEPGEDAGAAHAGDMEL 230 Query: 220 SSGVFYRYANINLAQLQENLGGASRE----------QALEIATHVVHMLATEVPGAKQRT 269 +G+FY Y +++ L NL G R+ A ++ +VH +AT PGAK Sbjct: 231 GAGLFYGYVVVDVPLLVSNLSGCERQAWREQTEACADARDVLAALVHSIATVSPGAKLGA 290 Query: 270 YAAFNPADMVMVNF-SDMPLSMANAFEKAVKAKDGFLQPSIQAFNQYWDRVANGYG 324 A + D ++ + P ++ANA+ + + A+ +Q S+ Y + + +G Sbjct: 291 TAPYARTDCALLETGTTQPRALANAYLEPLPARGDLMQQSVNTMGHYLKSLDDMFG 346 >UniRef50_C6HV95 CRISPR-associated protein, Cas4 n=1 Tax=Leptospirillum ferrodiazotrophum RepID=C6HV95_9BACT Length = 393 Score = 140 bits (353), Expect = 7e-32, Method: Compositional matrix adjust. Identities = 119/383 (31%), Positives = 189/383 (49%), Gaps = 49/383 (12%) Query: 5 INIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGES-SL 63 H+L S +CLNRDD+ K A+ GG +R R+SSQS KRA+R + + ++G + + Sbjct: 13 FEFHILQSFPVTCLNRDDVGSPKTAMIGGSQRARVSSQSWKRAVRLAMH---DLGVTHGV 69 Query: 64 RTIHLAQLRDVLRQKLGERFDQK--IIDKTLALL--------------SGKSVDEAEKIS 107 RT ++ L + LG +Q DK A+ G S + E++ Sbjct: 70 RTKLISPLIAEACRSLGATPEQARACGDKVEAVFIKKDEKGKKKSAKTKGDSDTQDEEVG 129 Query: 108 ADAVTPWV-------VGEIAWFCEQVAKAEAD------NLDDKKLLKVLKEDIAAIRVNL 154 +D+ + EI+ + K E D D KK K + + I + + Sbjct: 130 SDSSSEKTDTLLFLSPKEISVLANEFKKQEFDPGKVIVQSDPKKQAKEIADMIGKVPEGI 189 Query: 155 QQGVDIALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQ-EQGSAH 213 VDIAL GRM EL V+ A S AHAI+TH+V +++++FTA+DD + G+AH Sbjct: 190 D-AVDIALFGRMVAQA--AEL-NVEAAASFAHAISTHKVANEVEFFTALDDCAVDPGAAH 245 Query: 214 LGTQEFSSGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAF 273 +G+ EF+S +YRY +++L QL + L G + +E V L VP A+Q T + Sbjct: 246 MGSLEFNSATYYRYVSLDLGQLSQTLAGQHIPETIEA---FVKALFVSVPAARQSTQSGA 302 Query: 274 NPADMVMVNFSDMPLSMANAFEKAVKAKD-GFLQPSIQAFNQYWDRVANGYG-LNGAAAQ 331 +P D + + FE A+K+KD GFL+PSI+ Y +R +G L G A+ Sbjct: 303 SPWDFAKI-LVRTGHRIQIPFETAIKSKDGGFLKPSIEEMKAYLNRQEKLHGSLFGKKAE 361 Query: 332 FSLSD-----VDPITAQVKQMPT 349 ++ + +D + + +KQ T Sbjct: 362 YTYGEDENFTIDDLISALKQQAT 384 >UniRef50_A5UR15 CRISPR-associated protein, Cse4 family n=1 Tax=Roseiflexus sp. RS-1 RepID=A5UR15_ROSS1 Length = 402 Score = 140 bits (352), Expect = 9e-32, Method: Compositional matrix adjust. Identities = 122/399 (30%), Positives = 191/399 (47%), Gaps = 52/399 (13%) Query: 4 FINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESSL 63 I +H+L +H+PS LNRDD N KDAIFGG RR RISSQ++KR++R S ++ L Sbjct: 2 LIALHLLQNHAPSNLNRDDNNEPKDAIFGGVRRARISSQAIKRSIRWSDHFRAPFETQGL 61 Query: 64 RTIHLAQLRDVLRQKLGERF----DQKIIDKTLALL------SGKSVDEAEKISADAVTP 113 I L + +R L DQ+ I + A L S EA D P Sbjct: 62 LAIRTQLLPEKVRHHLVNAGLNDDDQRAIVEAAARLGKGEQRSPSGEGEAGDERGDQNQP 121 Query: 114 WVV---------GEIAWFCEQVAKAEADNLDD-------KKLLKVLKEDIAAIRVNLQQG 157 E++ A+ L + ++L+++++E A +N QG Sbjct: 122 RSSSRSRRSSRQSNTTGDAERIKTAQLMFLTENEIQQLAQRLIEIVREK-GAKHLNELQG 180 Query: 158 --------------VDIALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAV 203 VDIA+ GRM TS + V+ A+ +AHAI+TH V+ + D++TAV Sbjct: 181 DTLVREIGEYEPHSVDIAMFGRMTTSSPFKD---VEAAVQVAHAISTHAVEMEFDFYTAV 237 Query: 204 DDLQ-EQGSAHLGTQEFSSGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEV 262 DD+ E G+ +G F+S +Y+Y +I+ L +NL G + A + ++ + Sbjct: 238 DDISGEAGAGFIGDTTFNSATYYKYFSIDWDGLLKNLHG-EQNVARQSVEALIRAALFAI 296 Query: 263 PGAKQRTYAAFNPADMVMVNF--SDMPLSMANAFEKAVKA--KDGFLQPSIQAFNQYWDR 318 P KQ ++AA N D+ +V ++ LS ANAF K V+A K ++ S +A +Y Sbjct: 297 PSGKQNSFAAHNLPDLALVEVRKENIALSYANAFVKPVRATGKLSLIEASAKALEEYIPA 356 Query: 319 VANGYGLNGAAAQFSLSDVDPITAQVKQMPTLEQLKSWV 357 + Y L +A + LS V + + LE+L +W+ Sbjct: 357 INERYNL--SAQRAFLSTVPFTLSGAECCSDLEKLITWL 393 >UniRef50_A4XYU0 CRISPR-associated protein, Cse4 family n=5 Tax=Bacteria RepID=A4XYU0_PSEMY Length = 384 Score = 139 bits (349), Expect = 2e-31, Method: Compositional matrix adjust. Identities = 122/388 (31%), Positives = 188/388 (48%), Gaps = 43/388 (11%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGE 60 MS F+ H++ + +PS LNRDD KDA+FGG RR R+SSQ KRA+R + + + Sbjct: 1 MSLFVEFHLIQNFAPSNLNRDDTGAPKDALFGGHRRARVSSQCFKRAIRLAAQEHELVA- 59 Query: 61 SSLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVV---- 116 R + +L+ +L ++L R + K L+ + K+ D T +++ Sbjct: 60 PEFRGVRTKKLKTLLLERLAGRDPLEAEGKIEVALAAAGL----KLKDDGKTEYLLFLGE 115 Query: 117 GEIAWFC-------EQVAKAEADNLDDKKLL-----------KVLKEDIAAIRVNLQQGV 158 EIA F +++A A A + +V+K+ A ++ + V Sbjct: 116 AEIAGFATLIEQHWDELAGAPAGGEKKGEKKGKKEAKASAPAEVVKK--AKALLDGGKAV 173 Query: 159 DIALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDD---LQEQGSAHLG 215 D+AL GRM M E+ + D A +AHAI+TH+V+ + D+FTAVDD E G+ +G Sbjct: 174 DVALFGRMLAD--MPEVNQ-DAACQVAHAISTHRVEREFDYFTAVDDKGGPDETGAGMIG 230 Query: 216 TQEFSSGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNP 275 EF+S YRYA ++ +L NL RE L + +P KQ T+AA N Sbjct: 231 QVEFNSATLYRYAVVDAGKLLGNL-QQDRELTLSALEAFTQAMVRAIPTGKQNTFAAHNL 289 Query: 276 ADMVMVNFSDM-PLSMANAFEKAVKAK-DGFLQP-SIQAFNQYWDRVANGYGLNGAAAQF 332 V V PL++ANAFEK + A+ D L S+ ++ ++A Y A+ Q+ Sbjct: 290 PSFVGVCLRHAGPLNLANAFEKPIAARQDAALSSLSVTELAKHEGKLAAVYA--DASDQW 347 Query: 333 SLSDVDPITAQVKQMPT--LEQLKSWVR 358 + D+ Q K L +L SWVR Sbjct: 348 AYLDLSEAWPQQKGFAVQNLGELASWVR 375 >UniRef50_Q1EQS8 CRISPR-associated protein n=3 Tax=Streptomyces RepID=Q1EQS8_STRKN Length = 393 Score = 138 bits (347), Expect = 3e-31, Method: Compositional matrix adjust. Identities = 112/333 (33%), Positives = 162/333 (48%), Gaps = 52/333 (15%) Query: 2 SNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGES 61 + FI++H++ S + LNRDD N K +G R R+SSQS KRA R+ + + IG++ Sbjct: 5 ARFIDVHIVQSVPFANLNRDDTNSVKTVQYGNTLRTRVSSQSWKRATRE--VFQERIGQA 62 Query: 62 SLRTIHLAQLRDVLRQKLGERFDQKIIDK--TLALLSGKSVD---------EAEKISAD- 109 +LRT +++GER Q++ + AL E K AD Sbjct: 63 ALRT-----------RRIGERVTQELEGRGWPPALAQRAGGHAAAASSIKFELAKDPADN 111 Query: 110 --------------AVTPWVVGEIAWFCEQVAKAEADNLDDKKLL--KVLKEDIAAIRVN 153 V V E+A EQ + D KK VL +D + Sbjct: 112 KQFLPNTVLTNAMVYVPEAAVAELADLAEQHRQELESAKDIKKPADKSVLPKDAVEAVLR 171 Query: 154 LQQGVDIALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDL-----QE 208 + GV I L GRM + VDGA+ +AHA+TTH+ D ++D+F+AVDD+ Sbjct: 172 SRNGV-INLFGRMLAE---VDDAGVDGAVQVAHAMTTHETDVELDYFSAVDDITAAWKDS 227 Query: 209 QGSAHLGTQEFSSGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQR 268 GS H+G EFS+G FYRYA ++L L N+GG R IA + + + +P AK+ Sbjct: 228 TGSGHMGHTEFSAGTFYRYATVDLRDLATNIGGEVRAARELIAAFLASYIES-LPQAKKN 286 Query: 269 TYAAFNPADMVMVNF-SDMPLSMANAFEKAVKA 300 + A D+V ++ SD PLS A AFEK V+A Sbjct: 287 STAPHTIPDLVHISVRSDRPLSYAAAFEKPVRA 319 >UniRef50_Q2JWC4 CRISPR-associated protein, Cse4 family n=1 Tax=Synechococcus sp. JA-3-3Ab RepID=Q2JWC4_SYNJA Length = 380 Score = 137 bits (346), Expect = 4e-31, Method: Compositional matrix adjust. Identities = 112/340 (32%), Positives = 172/340 (50%), Gaps = 32/340 (9%) Query: 5 INIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESSLR 64 + IH++ S P+ LNRD+ M K IFGG+ R RISSQ KRA+RK YY Q + L Sbjct: 3 LEIHLIQSFPPANLNRDENGMPKSTIFGGRPRARISSQCQKRAVRK--YYHQY---AELD 57 Query: 65 TIHL-AQLRDVLRQ------KLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVG 117 H A+ R+ L + K G +Q + LAL G + +K A + Sbjct: 58 PAHFAARSRNWLPELKSKLVKAGIPDEQAGMAARLALEQGLKLKFNDKNEATTIVFLGKT 117 Query: 118 EIAWFCEQVAK----AEADNLDDK-KLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMM 172 E+ E + K E+ ++K KL + + + I V+ + D+AL GRM S Sbjct: 118 ELDAIAEILIKNWSAIESGLREEKPKLPQKIAKAIEKALVDTGKPGDVALFGRMMASLPT 177 Query: 173 TELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDL---QEQGSAHLGTQEFSSGVFYRYAN 229 VD A+ +AHAI+ + + + D+FTAVDDL ++ G+ H+G ++S +YR+A Sbjct: 178 V---NVDAAVQVAHAISINALQQEFDFFTAVDDLGSSEDTGADHMGETGYNSSTYYRFAV 234 Query: 230 INLAQLQENLGGASREQAL--EIATHVVHMLATEVPGAKQRTYAAFN-PADMVMVNFSDM 286 ++ QL ENLGG ++ AT +H VP Q +AA PA ++ V Sbjct: 235 LDKKQLVENLGGTEHLGSIIKAFATAFIHA----VPSGHQNGFAAHTRPALVMAVVREGQ 290 Query: 287 PLSMANAFEKAVKAKDGF--LQPSIQAFNQYWDRVANGYG 324 P+S+ +AFE V GF L+ +++A ++YW + YG Sbjct: 291 PISLVDAFENPVAPSGGFSLLENAVKALDEYWGSLVKMYG 330 >UniRef50_B6B782 CRISPR-associated protein, Cse4 family n=2 Tax=Alphaproteobacteria RepID=B6B782_9RHOB Length = 353 Score = 137 bits (344), Expect = 7e-31, Method: Compositional matrix adjust. Identities = 108/330 (32%), Positives = 164/330 (49%), Gaps = 19/330 (5%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNI-G 59 M+ F+ H+L ++ S NRDD K A+ GG R+RISSQSLKRA+R+S Y+AQ++ G Sbjct: 1 MTTFVQFHLLTTYPLSNPNRDDQGRPKQAMIGGSPRLRISSQSLKRALRESSYFAQDLAG 60 Query: 60 ESSLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGEI 119 + RT LA ++ + +G+ + D+T + G + EK S +A T + Sbjct: 61 HTGTRTRRLAT--ELKAELIGQGVEDAHADETATKI-GAVFSKTEKGSTNATTLAFISPD 117 Query: 120 AWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGKVD 179 W +A+ A + L K+ AI VDIA+ GRM + D Sbjct: 118 EW---ALARELAARDVAGEPLPAEKDLKKAILRRADGAVDIAMFGRMLAD---SPDYNRD 171 Query: 180 GAMSIAHAITTHQVDSDIDWFTAVDDLQ----EQGSAHLGTQEFSSGVFYRYANINLAQL 235 A+ +AHA TTH+ + DWF+AVDDL+ + G+ H+G F SG++Y YA +N+ L Sbjct: 172 AAVQVAHAFTTHRAQAQDDWFSAVDDLKTREVDAGAGHIGEHGFGSGIYYLYACVNVDLL 231 Query: 236 QENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFS-DMPLSMANAF 294 ENL G R A + + LAT P KQ ++A A + V P ++ AF Sbjct: 232 VENLAG-DRALAAKGMEALARALATATPKGKQNSHAHHPRAGFIRVERGQQQPRDLSGAF 290 Query: 295 EKAVKAKDGFLQPSIQAFNQYWDRVANGYG 324 K A + + S++A ++ YG Sbjct: 291 HKPTAADE---RASVEALQGMAAKIDRAYG 317 >UniRef50_A1ARH7 CRISPR-associated protein, Cse4 family n=1 Tax=Pelobacter propionicus DSM 2379 RepID=A1ARH7_PELPD Length = 374 Score = 136 bits (342), Expect = 1e-30, Method: Compositional matrix adjust. Identities = 112/316 (35%), Positives = 168/316 (53%), Gaps = 20/316 (6%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGE 60 M + IHVL + +PS LNRDD KDA+FGG RR R+SSQ LKR++R+ + QN G Sbjct: 1 MKTIVEIHVLQNFAPSNLNRDDTGAPKDALFGGTRRARVSSQCLKRSVREY-FKDQNKGW 59 Query: 61 SSLRTIHLA-QLRDVLRQKLGERFD------QKIIDKTLALL-SGKSVDEAEKISADAVT 112 + RT + L++ + L + D K I+ ++ L S K V ++ +D + Sbjct: 60 VADRTKRVVYALKERISPVLESQKDFSEDNLLKAIEVAVSNLGSNKKVKVDKEKKSDVLL 119 Query: 113 PWVVGEIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAI--RVNLQQGVDIALSGRMATSG 170 EI + VA++ AD L K +V++ AI + VD+AL GRM Sbjct: 120 FLSPKEIDALAQVVAESYADLLKTKLSDQVVRNLNDAIDGENKSRLSVDVALFGRML--A 177 Query: 171 MMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQEQ---GSAHLGTQEFSSGVFYRY 227 +M E + + A +AHAI+TH V+ + D++TAVDDL+ + G+ +GT EF+S FYRY Sbjct: 178 VMPEKNQ-NAACQVAHAISTHAVEREFDFYTAVDDLKPEDTAGADMMGTVEFNSACFYRY 236 Query: 228 ANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNF--SD 285 A ++ +L NL A A + + P KQ T+AA NP + V V + Sbjct: 237 AVVDWEKLLVNL-QADEALATKGLRAFLEGFVVAEPTGKQNTFAAHNPPEFVAVTVRRNA 295 Query: 286 MPLSMANAFEKAVKAK 301 P ++ANAFE AV+ + Sbjct: 296 APRNLANAFETAVRVR 311 >UniRef50_Q2RXJ6 CRISPR-associated protein, Cse4 family n=2 Tax=Alphaproteobacteria RepID=Q2RXJ6_RHORT Length = 381 Score = 135 bits (340), Expect = 2e-30, Method: Compositional matrix adjust. Identities = 116/382 (30%), Positives = 193/382 (50%), Gaps = 40/382 (10%) Query: 2 SNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKS---GYYAQNI 58 S F+ IH L S++ + LNRDD + K +GG R RISSQ LKR R + + Q + Sbjct: 4 SRFLQIHSLHSYTAALLNRDDSGLAKRLTYGGSNRTRISSQCLKRHWRMAEHDPHALQTL 63 Query: 59 GE--SSLRTIHLAQLRDVLRQKLGERFDQKIID----KTLALLSGKSVDEAEKISADAVT 112 G S R+ L + D++ + L R+ Q I+D + L+ G D+ +K + Sbjct: 64 GGYVGSFRSREL--VTDLVIKPLEGRYPQDILDVLEPEFQKLVYGDKADKGKK----SRQ 117 Query: 113 PWVVG--EIAWFCEQVAKAEADNLDDKKLLKVLKE-----DIAAIRVN--LQQGVDIALS 163 ++G E+AW + + A D K L K + + + A+ N L G+ AL Sbjct: 118 TLLLGQPELAWLARRAEELAAGANDAKALQKAVADWRKDANFKAMSENAALPGGLVAALF 177 Query: 164 GRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQ----EQGSAHLGTQEF 219 GRM TS +D + +AHA T H +++ D+FTAVDDL+ + G+ + E Sbjct: 178 GRMVTS---DPAANIDAPVHVAHAFTVHAEEAEGDYFTAVDDLKKDESDSGADTIQETEL 234 Query: 220 SSGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMV 279 +SG+FY Y I+L L N GG +E A ++ ++V+++A PGAK + A + AD++ Sbjct: 235 TSGLFYGYVVIDLPGLIGNCGG-DKEIAAQVVNNLVYLIAEVSPGAKLGSTAPYGRADLM 293 Query: 280 MVNFSD-MPLSMANAFEKAVKAKDGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVD 338 ++ D P S+A A+ KA+ A D + ++ A + ++ Y A SL++ Sbjct: 294 LIEAGDRQPRSLATAYRKAI-APD--REQAVAALDGCLAKLDATYETGEARRYLSLAETP 350 Query: 339 ---PITAQVKQMPTLEQLKSWV 357 P T+ ++++ +L+ L W Sbjct: 351 LTGPATSGLEKL-SLKALADWT 371 >UniRef50_Q2FNL3 CRISPR-associated protein, CT1975 n=8 Tax=cellular organisms RepID=Q2FNL3_METHJ Length = 382 Score = 135 bits (339), Expect = 3e-30, Method: Compositional matrix adjust. Identities = 115/382 (30%), Positives = 178/382 (46%), Gaps = 59/382 (15%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNI-G 59 MS FI IH+L S+ PS LNRDD+ K A GG +R+R+SSQSLKR+ R S ++ + G Sbjct: 1 MSEFIQIHMLASYPPSNLNRDDLGRPKTATVGGTQRIRVSSQSLKRSWRTSEAFSDALKG 60 Query: 60 ESSLRTIHLA-----------QLRDVLRQKLGERFDQKIIDKTLA-------------LL 95 +RT + L D+L K ++I D+ A + Sbjct: 61 AIGIRTRDMGVKIKKALVEGRLLSDILEGKESGVTRERIKDEKKAHEWAVKISSHFGKIE 120 Query: 96 SGKSVDEAEK------------ISADAVTPWVVGEIAWFCEQVAKAEADNLDDKKLLKVL 143 GK D +K +S + + EIA + + + KV Sbjct: 121 GGKEKDSDKKSEKTDEKSNKNPLSHKQMVHYSPEEIAGIDDLLGRISGGE-------KVS 173 Query: 144 KEDIAAIRVNLQQGVDIALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAV 203 +D +R + + VDIAL GRM + A+ ++HAIT H + D+FTAV Sbjct: 174 DDDCIRLRSD-HKAVDIALFGRMLADNAAY---NTEAAVQVSHAITVHDTPVEDDYFTAV 229 Query: 204 DDLQE----QGSAHLGTQEFSSGVFYRYANINLAQLQENLGGASR--EQALEIATHVVHM 257 DDL + G+ H+G EF +G+FY Y IN L+ENL G + +A+E M Sbjct: 230 DDLNQLDDTAGAGHIGEAEFGAGLFYTYICINRDLLKENLQGDNELSNRAIEALIRAASM 289 Query: 258 LATEVPGAKQRTYAAFNPADMVMV-NFSDMPLSMANAFEKAVKAKDGFLQPSIQAFNQYW 316 ++ P KQ ++A+ + A ++V ++ P S+A AF K V KD +++ Sbjct: 290 VS---PSGKQNSFASRSYASYLLVEKGTEQPRSLAAAFFKPVSGKD-IYGDAVKNLEGLR 345 Query: 317 DRVANGYGLNGAAAQFSLSDVD 338 DR+ N YG + + S++ +D Sbjct: 346 DRMDNAYGTSFKQSSRSMNVID 367 >UniRef50_B1VIY1 CRISPR-associated protein n=3 Tax=Corynebacterium RepID=B1VIY1_CORU7 Length = 376 Score = 134 bits (338), Expect = 4e-30, Method: Compositional matrix adjust. Identities = 110/353 (31%), Positives = 164/353 (46%), Gaps = 39/353 (11%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRK---SGYYAQN 57 MS I+I+ L S PS +NRDD + K+AIFGG R R+SSQS KRA+R+ + A N Sbjct: 1 MSKIIDIYALQSLPPSLINRDDTGVPKNAIFGGVPRQRVSSQSWKRAIRRYFFENFDAAN 60 Query: 58 IGESSLRTIHLAQLRDVLRQKLGERFDQK------IIDKTLALLSGKSVDEA------EK 105 IG+ S R L +K+ + +++ I++T L + A +K Sbjct: 61 IGDRSKR----------LPEKIARQLEEQGMEQGTAIERTEQLFKAAGIKTAVEKKPKDK 110 Query: 106 ISADAVTPWV-VGEIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSG 164 DA + G + + + ++ K + AI ++ + VDIA+ G Sbjct: 111 DETDAEVAYPQTGYLLFLSAHQIDNAVKAIQERDGKNFTKREAQAI-LDQEHSVDIAMFG 169 Query: 165 RMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDL----QEQGSAHLGTQEFS 220 RM VD A+ +AHA+ H + D+FTAVDDL +E G+ +GT + Sbjct: 170 RMVADDAAY---NVDAAVQVAHALGIHDSAPEFDYFTAVDDLAEEGEETGAGMIGTVQMM 226 Query: 221 SGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVM 280 S YRYA +NL L ENL S + A + A V +P K T+A ++V Sbjct: 227 SSTLYRYATVNLEGLAENLD--SEDAAKQAAVEFVEAFIASMPTGKINTFANQTLPELVY 284 Query: 281 VNFSDM-PLSMANAFEKAVKAKD--GFLQPSIQAFNQYWDRVANGYGLNGAAA 330 V D P+S+ NAFE V+A + G + + Q V N YG A+ Sbjct: 285 VAVRDTRPVSLVNAFEAPVEATEDKGRREVGAEVLAQEARDVENVYGFKPQAS 337 >UniRef50_B8IZA6 CRISPR-associated protein, Cse4 family n=1 Tax=Desulfovibrio desulfuricans subsp. desulfuricans str. ATCC 27774 RepID=B8IZA6_DESDA Length = 350 Score = 134 bits (338), Expect = 4e-30, Method: Compositional matrix adjust. Identities = 104/341 (30%), Positives = 166/341 (48%), Gaps = 35/341 (10%) Query: 5 INIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRK--SGYYAQNIGESS 62 + +H+L S +CLNRDD+ K A+FGG +R R+SSQ KRA+R+ Q+ Sbjct: 4 LELHILQSVPVACLNRDDLGSPKTAVFGGVQRARVSSQCWKRAIREYCGELLPQHFKGER 63 Query: 63 LRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEA-EKISADAVTPWVVGEIAW 121 R I + LRD+ G +D+ AL+ + E + DA + Sbjct: 64 TRLI-VEPLRDIFINTYG-------LDEATALVKANDLAEGLATLDKDAAKKNKLQTKTL 115 Query: 122 F------CEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTEL 175 F E +A +N + KK K + + DIAL GRM S EL Sbjct: 116 FFTSRSELEALAAIAVNNENIKKHAKTFAQSLCT------DAADIALFGRMVASA--PEL 167 Query: 176 GKVDGAMSIAHAITTHQVDSDIDWFTAVDDL---QEQGSAHLGTQEFSSGVFYRYANINL 232 ++GA +HA++TH+ D++ID+F+A+DDL +E G+ GT EF++ +YR+ +NL Sbjct: 168 -TLEGAAMFSHALSTHKADNEIDFFSALDDLLPSEETGAGMTGTLEFNAAAYYRFCALNL 226 Query: 233 AQL--QENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFSD--MPL 288 L ++LG S ++ I V +P A++ + A V+ D P+ Sbjct: 227 DMLADADHLGALSPDERQGIVAAFVEATLKAMPVARKNSMNANTMPAYVLCVLRDSGQPV 286 Query: 289 SMANAFEKAVKAKD--GFLQPSIQAFNQYWDRVANGYGLNG 327 + NAFEKAV + D G+++ SI+ + + R+ N +GL Sbjct: 287 QLVNAFEKAVYSPDGRGYVEASIKRMEEEYQRLENTWGLTA 327 >UniRef50_A3EQA5 CRISPR-ssociated protein, Cas4 n=4 Tax=Bacteria RepID=A3EQA5_9BACT Length = 398 Score = 134 bits (337), Expect = 6e-30, Method: Compositional matrix adjust. Identities = 116/355 (32%), Positives = 174/355 (49%), Gaps = 70/355 (19%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIG- 59 M I IHVL + +PS LNRDD KDA+FGG RR RISSQ +KR++R + + G Sbjct: 1 MKTLIEIHVLQNFAPSNLNRDDTGAPKDALFGGTRRARISSQCIKRSVRDFFCHKREDGI 60 Query: 60 ----ESSLRTIHLAQ-LRDVLRQK--LGERFDQ--------KIIDKT-----LALLSGKS 99 E +RT + Q + D+L++K + + + KI K L LS K Sbjct: 61 FSPDEIGVRTKRIYQAIADLLKEKRDISDTITKAKTALSYLKIKPKNEKTQYLLFLSPKE 120 Query: 100 VDEAEKISADAVTPW---VVGEIAWFCEQVAKAEADN--LDDKKLLKV------------ 142 + K A+A+ + +VGE E DN LD++ V Sbjct: 121 I----KDFANAIDEYWDQIVGE---------PIETDNSELDEETPDTVSLEEQKPKKGKK 167 Query: 143 ---------LKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGKVDGAMSIAHAITTHQV 193 +E + ++ +N + +DIAL GRM + E + + A +AHAI+TH V Sbjct: 168 NKKPNIPKEFQEKLESV-LNGGKSIDIALFGRMLAD--IPEKNQ-NAACQVAHAISTHAV 223 Query: 194 DSDIDWFTAVDDLQEQ---GSAHLGTQEFSSGVFYRYANINLAQLQENLGGASREQALEI 250 + + D++TA+DDL+ GS +GT EF+S FYRYA ++L L +NL S I Sbjct: 224 EREFDYYTAIDDLKPDDTAGSDMIGTVEFNSACFYRYAVVDLEALNKNLHDDSELTNKSI 283 Query: 251 ATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFSDM--PLSMANAFEKAVKAKDG 303 + + +E P KQ ++AA NP + + ++ P ++ANAFE AV K G Sbjct: 284 RAFLEAFIISE-PTGKQNSFAAHNPPEFIAISVRHNAGPRNLANAFETAVFPKKG 337 >UniRef50_C4ZJY0 CRISPR-associated protein, Cse4 family n=1 Tax=Thauera sp. MZ1T RepID=C4ZJY0_THASP Length = 394 Score = 133 bits (334), Expect = 1e-29, Method: Compositional matrix adjust. Identities = 125/405 (30%), Positives = 185/405 (45%), Gaps = 70/405 (17%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKS--GYYAQNI 58 + FI IH L ++ + LNRDD + K +GG R RISSQ LKR R + + + Sbjct: 3 LPRFIQIHTLHTYPAALLNRDDAGLAKRLPYGGAIRTRISSQCLKRHWRVADDAFSLAKL 62 Query: 59 G-ESSLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEA---EKI--SADAVT 112 G + RT ++A+L +RQ+L E+ ID+ A + +++ EA EK + V Sbjct: 63 GVPMATRTRYVAEL---IRQRLIEQG----IDEARAYATAEALLEALFGEKADKKKEGVK 115 Query: 113 PWVVG--------EIAWFCEQVAKAEADNLDD-------KKLLKVLKEDIAAIRVNLQQG 157 G EIA+ + D D K LK K++I A++ L G Sbjct: 116 ALQTGQAVLFGNEEIAYLARRCRDITGDFSDPVALKAEVAKFLKEEKKNIEAMK--LGSG 173 Query: 158 VDIALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQE----QGSAH 213 ++ AL GRM TS + L D ++S+AHA T H+ + D+FT VDD + GSA Sbjct: 174 LESALFGRMVTSDL---LANRDASVSVAHAFTVHEAQVENDYFTVVDDFAQAEDGAGSAG 230 Query: 214 LGTQEFSSGVFYRYANINLAQLQENLGGASREQALEIAT-----------HVVHMLATEV 262 + E +SG++Y Y I++ QL NL G E I H++H++AT Sbjct: 231 IFDTELASGLYYGYVVIDVPQLVANLEGIKVEDVFTIGADKRGLAGKVVQHLLHLIATVS 290 Query: 263 PGAKQRTYAAFNPADMVMVNFSD-MPLSMANAFEKAVKAKD---------GFLQPSIQAF 312 PGAK+ + A ++ A V+V D P S+A AF + K L I AF Sbjct: 291 PGAKRGSTAPYDWAKFVLVEAGDWQPRSLAAAFHDPIPLKGDSSIRGRAASKLAKEIAAF 350 Query: 313 NQYWDRVANGYGLNGAAAQFSLSDVDPITAQVKQMPTLEQLKSWV 357 + YG+ A SL D + + TL QL W+ Sbjct: 351 DA-------AYGMPTARRFLSL---DELAVPAAERATLSQLGEWI 385 >UniRef50_A7BA64 Putative uncharacterized protein n=1 Tax=Actinomyces odontolyticus ATCC 17982 RepID=A7BA64_9ACTO Length = 374 Score = 133 bits (334), Expect = 1e-29, Method: Compositional matrix adjust. Identities = 114/346 (32%), Positives = 165/346 (47%), Gaps = 34/346 (9%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGE 60 MS F++IHVL + PS NRDD K A FGG +R+RISSQ++KRA R+ G Sbjct: 1 MSVFVDIHVLQTLPPSNPNRDDTGAPKSATFGGVQRMRISSQAIKRATRQDFEGKIADGN 60 Query: 61 SSLRTIHLAQLRDVLRQKLGERFD------------QKIIDKTLALLSGKSVDEAEKISA 108 +RT + +L V R +R D K I LA G D K S Sbjct: 61 YGVRTKKIVEL--VARTITEKRPDLEAASIELAEMGLKAIGFKLAEPRGNKSDNELKESG 118 Query: 109 DAVTPWVVGEIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMAT 168 ++V A E V+ A + KE V+ +DIAL GRM Sbjct: 119 -----FLVFLSAKQIEHVSDALISVAHEDDPAAAFKELKPRSLVDTDHSIDIALFGRMVA 173 Query: 169 SGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQ------EQGSAHLGTQEFSSG 222 VD A +AHAI V+ + D++TAVDD + ++G+ +GT EF+S Sbjct: 174 EPNAL---NVDAACQVAHAIGVGAVEREYDYYTAVDDAKKRNDEADEGAGMIGTIEFASA 230 Query: 223 VFYRYANINLAQLQENLG-GASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMV 281 YRYA IN+ L+ENLG A ++A+E+ V +P K T+A + V+V Sbjct: 231 TVYRYATINVDLLRENLGDDAVADRAVEL---FVDSFVRSMPTGKVTTFANRTLPEAVLV 287 Query: 282 NF-SDMPLSMANAFEKAVKA-KDGFLQPSIQAFNQYWDRVANGYGL 325 D P++M+ AFE+ + A + GF +P+I F ++ ++ GL Sbjct: 288 QVRDDQPINMSGAFEEPIIAGQHGFAEPAIARFVEFESQLRELTGL 333 >UniRef50_C7MTA9 CRISPR-associated protein, Cse4 family n=1 Tax=Saccharomonospora viridis DSM 43017 RepID=C7MTA9_SACVD Length = 390 Score = 133 bits (334), Expect = 1e-29, Method: Compositional matrix adjust. Identities = 115/349 (32%), Positives = 172/349 (49%), Gaps = 42/349 (12%) Query: 4 FINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESSL 63 +I+IHV+ + S +NRDD K FGG R R+SSQS KR +R+ A GE+ Sbjct: 6 YIDIHVIQTLPFSNVNRDDTGSPKTVEFGGVERTRVSSQSWKRVVRQHVEEAVG-GETVR 64 Query: 64 RTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVD-EAEKISADAVTPWV------- 115 + + + L ++ E+ + + +AL +GK + + EK +D V Sbjct: 65 TRRVVVGVAERLIKQGWEKSEAEAAGVQIALSAGKKISLKQEKDESDEVVLTTNVLLLLP 124 Query: 116 ---VGEIAWFCEQ---VAKAEADNLDDKKLLKVLKEDIAAIRVN---LQQGVDIALSGRM 166 + E+A ++ V AEA K L +K + + R+N ++ I L GRM Sbjct: 125 ESGIDELAALADEHREVILAEAKK---AKKLTGMKPKLPSERINEILSRRSATINLFGRM 181 Query: 167 ATSGMMTEL--GKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQEQ----GSAHLGTQEFS 220 + EL VDGA+ +AHA TTH + D+FTAVDD++++ GS ++ T FS Sbjct: 182 -----VAELPGANVDGAVQVAHAFTTHGTAVEYDFFTAVDDIEQKLDLPGSGYMDTALFS 236 Query: 221 SGVFYRYANINLAQLQENLGGASREQALEIATHVVHM----LATEVPGAKQRTYAAFNPA 276 +G FYRYAN+NL L NL +Q ++A +V T VP KQ AA Sbjct: 237 AGTFYRYANVNLTDLLRNL-----DQDTDLARVLVKTFLDGFITTVPSGKQNATAAVTLP 291 Query: 277 DMVMVNF-SDMPLSMANAFEKAVKAKDGFLQPSIQAFNQYWDRVANGYG 324 D+V V D P+S+ANAFE V DGF++ S + + +A G Sbjct: 292 DLVHVTVRDDRPVSLANAFEAPVGGGDGFVRKSAHRLDSHAGAIAELLG 340 >UniRef50_B6XT63 Putative uncharacterized protein n=1 Tax=Bifidobacterium catenulatum DSM 16992 RepID=B6XT63_9BIFI Length = 371 Score = 132 bits (333), Expect = 1e-29, Method: Compositional matrix adjust. Identities = 118/369 (31%), Positives = 183/369 (49%), Gaps = 31/369 (8%) Query: 4 FINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESSL 63 F++IH L PS +NRDD K A GG R R+SSQS KRAMR+ ++ + S L Sbjct: 2 FVDIHCLQQVPPSNINRDDTGSPKTAYVGGALRARVSSQSWKRAMRE--MFSSKLDSSKL 59 Query: 64 --RTIH-LAQLRDVLRQKLGERFDQ--KIIDKTLALLSGKSVDEAEKISAD---AVTPWV 115 RT +A + V+ +K + ++ + +K LA +G V +++ AD + T ++ Sbjct: 60 GKRTKSAVALISSVIAEKRPDLVEESKSLAEKVLA-ATGVKVKASDRAGADKGSSATEYL 118 Query: 116 VGEIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTEL 175 + EQ+A D+ K +K+++AA+ + +Q +DIA GRM +L Sbjct: 119 IFIANREVEQLADIAITAFDEGKDPSKMKKEVAAV-FHGEQAIDIACFGRMLADA--PDL 175 Query: 176 GKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQEQ---GSAHLGTQEFSSGVFYRYANINL 232 D + +AHA + Q+ + D+FTAVDD G+A + T F+S YRYA +N+ Sbjct: 176 -NTDASAQVAHAFSIDQITPEYDYFTAVDDCASDDNAGAAMIDTIGFNSSTLYRYATVNV 234 Query: 233 AQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFN-PADMVMVNFSDMPLSMA 291 L++ L A+ A+E V +P KQ T+A P D+V+V P+S A Sbjct: 235 DALKDQLQDAN--AAVEGVAAFVDAFIKSMPSGKQNTFANHTLPEDIVIVLRDSQPISAA 292 Query: 292 NAFEKAVKAKDGFLQPSIQAFNQYWDRVAN---GYGLNGAAAQF-----SLSDVDPITAQ 343 +AFE +K KDG + S Q + DR+ YG A S+ +D + Q Sbjct: 293 DAFEDPIKRKDG-ISVSRQGIERLGDRLNEIRINYGEEPVKAWHVVSGGSVHSLDEWSEQ 351 Query: 344 VKQMPTLEQ 352 V +P LEQ Sbjct: 352 V-TLPELEQ 359 >UniRef50_B6WQ62 Putative uncharacterized protein n=1 Tax=Desulfovibrio piger ATCC 29098 RepID=B6WQ62_9DELT Length = 341 Score = 132 bits (333), Expect = 2e-29, Method: Compositional matrix adjust. Identities = 105/335 (31%), Positives = 165/335 (49%), Gaps = 32/335 (9%) Query: 5 INIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRK--SGYYAQNIGESS 62 + +H+L S +CLNRDD K A+FG +R R+SSQ KRA+R+ G Sbjct: 4 LELHILQSVPVACLNRDDFGSPKTALFGNVQRARVSSQCWKRAVRELMQEEVPALFGGQR 63 Query: 63 LRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGEIAWF 122 R I L L +L ++ G ++ G +V + + TP V + +F Sbjct: 64 TRLI-LDPLCRILHEQHGLAEEEARKKAEEL---GAAVSKLD-------TPPVRVKTLFF 112 Query: 123 C-----EQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGK 177 E +A A + KK +K L + L+ DIAL GRM S L Sbjct: 113 TSPLELEALAAAYVATGNAKKAVKELAKH------PLKDAADIALFGRMVASDHSLTL-- 164 Query: 178 VDGAMSIAHAITTHQVDSDIDWFTAVDDLQ---EQGSAHLGTQEFSSGVFYRYANINLAQ 234 +GA +HA++TH+V ++ID+F AVDDLQ E G+ GT EF+S +YR+A +NL Sbjct: 165 -EGAAMFSHALSTHKVSNEIDFFAAVDDLQPEDEAGAGMTGTLEFNSATYYRFAALNLDL 223 Query: 235 LQENLGGASREQALEIATHVVHMLATEVPGAKQRTY-AAFNPADMV-MVNFSDMPLSMAN 292 L+++L S E+ E+ + V VPGA++ + AA P+ ++ +V P+ + N Sbjct: 224 LEQHLSALSAEERREVVCNFVTATLRAVPGARKNSMNAATLPSHVLAVVREKGHPVQLVN 283 Query: 293 AFEKAVKAKDGFLQPSIQAFNQYWDRVANGYGLNG 327 AFEK V + G ++ S+ + + + +GL Sbjct: 284 AFEKPVWTRGGLMEESVSQLEREYTHLKETWGLEA 318 >UniRef50_D0MET5 CRISPR-associated protein, Cse4 family n=1 Tax=Rhodothermus marinus DSM 4252 RepID=D0MET5_RHOM4 Length = 423 Score = 132 bits (332), Expect = 2e-29, Method: Compositional matrix adjust. Identities = 117/382 (30%), Positives = 183/382 (47%), Gaps = 61/382 (15%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMR----KSGYYAQ 56 +S F+ IH L ++ + LNRDD K FGG R R+SSQ LK R + Y Sbjct: 2 VSAFVQIHTLTAYPAALLNRDDAGFAKRLPFGGAIRTRVSSQCLKYHWRNFSGEHALYGL 61 Query: 57 NIGES--SLRTIHLAQLRDVLRQKLGERFD-------QKII--DKTLAL-----LSGKSV 100 ++ S S T R ++ + R QK+I D++L+ L V Sbjct: 62 DVPRSLRSRETFKRCIARPLVEEGYPLRLVVAFALHLQKLIVSDESLSKTDFKKLMSDEV 121 Query: 101 DEA---EKISADAVTPWVVGEIAWFCEQVAKA----------EADNLDDKKLLKVLKEDI 147 D+A +++ ++ V E+ + ++ + A L D++L +V +E Sbjct: 122 DDATLLDQLKSNQVIILGRPEVDYLTRRIRERLDALREVWADAAAPLSDEQLERVYQELQ 181 Query: 148 AAIRVNLQQ---------GVDIALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDID 198 A + L++ G+D AL GRMATS + L + D A+ +AHA TTH +S+ D Sbjct: 182 AIGKGELKKNLKGLYLAAGLDAALFGRMATSDV---LARGDAAIHVAHAFTTHAEESESD 238 Query: 199 WFTAVDDLQEQ------GSAHLGTQEFSSGVFYRYANINLAQLQENLGG--------ASR 244 +FTAVD+L Q GS HL QE +SG+FY Y +++ L NL G A R Sbjct: 239 YFTAVDELVAQEGEGELGSGHLNNQELTSGLFYGYVVVDVPLLVSNLEGVPPAAWQEADR 298 Query: 245 EQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFS-DMPLSMANAFEKAVKAK-D 302 A E+ ++H++AT PGAK + A A ++V + P ++ANAF + V + Sbjct: 299 TLAAEVVRRLLHLIATVSPGAKLGSTAPHAYAQFMLVEWGRSQPRTLANAFHRPVSLDGE 358 Query: 303 GFLQPSIQAFNQYWDRVANGYG 324 G L S +A +Y +++ YG Sbjct: 359 GVLVNSYRALGRYVEQMDRMYG 380 >UniRef50_Q0BRF9 Putative uncharacterized protein n=1 Tax=Granulibacter bethesdensis CGDNIH1 RepID=Q0BRF9_GRABC Length = 386 Score = 131 bits (330), Expect = 4e-29, Method: Compositional matrix adjust. Identities = 120/393 (30%), Positives = 180/393 (45%), Gaps = 53/393 (13%) Query: 4 FINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKR--AMRKSGYYAQNI--G 59 F+ IH L S++ S LNRDD + K +G R RISSQ LKR M + + I Sbjct: 6 FLQIHSLHSYTASLLNRDDSGLAKRLPYGSAVRTRISSQCLKRHWRMDEGTFSLHRIEGA 65 Query: 60 ESSLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVG-- 117 E ++R+ L R LR+ L D I++ + + P + G Sbjct: 66 EEAVRSRDLVTKR--LREPLQGTVDVNILNAIEPAFQAAVYGKKGADDKSSRQPLLFGAP 123 Query: 118 EIAWFCEQV------------AKAEADNLDDKKLLKVLKEDIAAIR--VNLQQGVDIALS 163 E+ + EQ AKA A++ KL + + A+R V+L G+ AL Sbjct: 124 ELRYLAEQFTRIATSATDPKSAKAAAEDFTKDKL---FQNTMKAMRDSVSLPGGLTSALF 180 Query: 164 GRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDL---QEQGSAHLGTQEFS 220 GRM TS +D + +AHA TTH ++ D+F VDDL ++ G+ H+G+ E + Sbjct: 181 GRMVTSD---PEANIDAPVHVAHAFTTHAEQTESDYFAVVDDLAGVEDTGADHIGSTELT 237 Query: 221 SGVFYRYANINLAQLQENLGG--------ASREQALEIATHVVHMLATEVPGAKQRTYAA 272 SG+FY Y I++ L NL G A R+ A E+ ++ +AT PGAK + A Sbjct: 238 SGLFYGYVVIDVPTLVSNLTGVAASNWLAADRKMAAEVTACLIGQIATVSPGAKLGSTAP 297 Query: 273 FNPADMVMVNFSD-MPLSMANAF----EKAVKAKDGFLQPSIQAFNQYWDRVANGYGLNG 327 + A ++V D P S+A AF E VK + L ++AF++ Y Sbjct: 298 YGYATTMLVEAGDRQPRSLAEAFRDPAEPTVKDAEDKLHQKLKAFDE-------AYQTGE 350 Query: 328 AAAQFSLSDVDPITAQVKQMPTLEQLKSWVRNN 360 SLS+ DP V + +L +L WVR+ Sbjct: 351 DRRLLSLSN-DPGIKNVSRT-SLPELMQWVRDT 381 >UniRef50_A5FTJ7 CRISPR-associated protein, Cse4 family n=11 Tax=Acetobacteraceae RepID=A5FTJ7_ACICJ Length = 370 Score = 130 bits (327), Expect = 7e-29, Method: Compositional matrix adjust. Identities = 111/356 (31%), Positives = 169/356 (47%), Gaps = 46/356 (12%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKS--------- 51 M+ F+ +H+L PS +NRDD K A+ GG R+R+SSQ+LKRA R S Sbjct: 1 MTQFLQVHLLTFFPPSNMNRDDTGRPKTAMVGGAMRLRLSSQALKRAWRTSTIFSEALKG 60 Query: 52 --GYYAQNIGESSLRTIHLAQLRDV----LRQKLGERFDQKIIDKTLAL---LSGKSVDE 102 G Q +GE L+T+ + +V + + + +F + D+T A L+ S DE Sbjct: 61 YMGERTQRLGEEILKTLQAEGVSEVQALAVARAVAGQFGKLNEDETPARIQQLAFISPDE 120 Query: 103 AEKISADAVTPWVVGEIAWFCEQVAKAEADN-------LDDKKLLKVLKEDIAAIRVNLQ 155 K + D + GE+ + K N ++ ++L + + D AA Sbjct: 121 -RKAAFDLARRYAAGELPLPEKAKGKRGKANKTEGEEEVEAPEILLLRESDTAA------ 173 Query: 156 QGVDIALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDL----QEQGS 211 DIAL GRM + A +AHAITTH++ D D++TAVDDL ++ G+ Sbjct: 174 ---DIALFGRMLAD---KPAFNREAAAQVAHAITTHRISVDDDYYTAVDDLKRPSEDAGA 227 Query: 212 AHLGTQEFSSGVFYRYANINLAQLQENLGGA--SREQALEIATHVVHMLATEVPGAKQRT 269 +G F SGVFY Y +IN+ L NLGG +R+ A +V AT P KQ + Sbjct: 228 GFIGETGFGSGVFYTYMSINIDLLIRNLGGGDQARDLAATAIAALVEAAATTAPSGKQNS 287 Query: 270 YAAFNPADMVMVNFSD-MPLSMANAFEKAVKAKDGFLQPSIQAFNQYWDRVANGYG 324 +AA A ++ P ++A AF K V+ D + SI ++ + + YG Sbjct: 288 FAAHGRAGYILAERGKAQPRTLAGAFAKPVEGGD-IMDASIGRLEEFREAIDKAYG 342 >UniRef50_C6SPJ0 Putative uncharacterized protein n=1 Tax=Streptococcus mutans NN2025 RepID=C6SPJ0_STRMN Length = 359 Score = 130 bits (327), Expect = 7e-29, Method: Compositional matrix adjust. Identities = 104/350 (29%), Positives = 175/350 (50%), Gaps = 40/350 (11%) Query: 4 FINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESSL 63 F++I+ + + PS +NRDD K +GG RR R+SSQS K+AMR Y+ ++ E L Sbjct: 11 FLDIYAIQTLPPSNINRDDTGSPKTTQYGGVRRARVSSQSWKKAMR--DYFYEHAEEEQL 68 Query: 64 --RTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGEIAW 121 RT +K+ +KII + + L +S A I A P G++ + Sbjct: 69 GKRT-----------RKVVNYVAEKIIHQKIDLNEKESSKLATDILKLAGVP-TDGKVLF 116 Query: 122 F-----CEQVAKAEADNLDDK-KLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTEL 175 F E++A A + DK + K+++ ++A +D+AL GRM + TE Sbjct: 117 FIGNTEAEKLATAAVKGVKDKEEARKIMQSNLA---------LDVALFGRMVANDKETE- 166 Query: 176 GKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQEQGSAH---LGTQEFSSGVFYRYANINL 232 D + AH I+TH V ++ D++TAVDDL A LGT EF+S YRYAN+ + Sbjct: 167 --ADASSQFAHPISTHAVQTEFDFYTAVDDLASDDDAKAGMLGTVEFNSSTLYRYANVAI 224 Query: 233 AQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFN-PADMVMVNFSDMPLSMA 291 + G +RE ++ + A +P K ++A P +++ SD P+++ Sbjct: 225 HEFLVQRG--NREDLVDSLQLFIKAFAESMPRGKINSFANQTIPQTLIITVRSDRPVNLV 282 Query: 292 NAFEKAVKAKDGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPIT 341 +AFE+ VK+ +G++ SI+ ++ + +V + SL +V+ +T Sbjct: 283 SAFEEPVKSSNGYVTKSIEKLSKEFVKVEKMVKKPVLSFYVSLEEVEALT 332 >UniRef50_C3PF94 CRISPR-associated protein n=5 Tax=Corynebacterium RepID=C3PF94_CORA7 Length = 384 Score = 130 bits (327), Expect = 8e-29, Method: Compositional matrix adjust. Identities = 104/330 (31%), Positives = 158/330 (47%), Gaps = 41/330 (12%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQN--- 57 MS I+IH L + PS +NRDD K AIFGG R R+SSQS KRA+R Y+ +N Sbjct: 1 MSLVIDIHALQTLPPSLINRDDTGAPKSAIFGGVPRQRVSSQSWKRAIR--NYFEKNVDP 58 Query: 58 --IGESSLRTIH-----------------LAQLRDVLRQK-LGERFDQKIIDKTLALLSG 97 +G+ S R + Q+ D+ + + D K I + L Sbjct: 59 EFVGDRSKRLPEKIAKLVENHDGWDSERAIKQVSDLFKAAGISTEVDSKRIKE----LEK 114 Query: 98 KSVDEAEKISADAVTPWVVGEIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQG 157 ++ E++ +A P I +Q+ +A +D + +K+ A + ++ Q Sbjct: 115 SDAEDKEELIKEASYPRTKYLIFLSPQQIDRAVRAIVDADG--EKIKKAEAKVILDTQHS 172 Query: 158 VDIALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDL----QEQGSAH 213 VD+A+ GRM VD A+ +AHA+ H + D+FTAVDDL +E G+ Sbjct: 173 VDMAMFGRMIADDAAF---NVDAAVQVAHALGIHSSAPEFDYFTAVDDLAEDGEETGAGM 229 Query: 214 LGTQEFSSGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAF 273 +GT + S YR+A +N+A L +NL AS E A + A V +P K T+A Sbjct: 230 IGTVQMMSSTLYRFATVNVAGLTKNL--ASEENAKQAAVQFVDAFIKSMPTGKINTFANH 287 Query: 274 NPADMVMVNFSDM-PLSMANAFEKAVKAKD 302 ++V V D P+S+ AFE+ V+A D Sbjct: 288 TLPELVYVTVRDTRPVSLVTAFEEPVQATD 317 >UniRef50_B4UE70 CRISPR-associated protein, Cse4 family n=2 Tax=Anaeromyxobacter RepID=B4UE70_ANASK Length = 413 Score = 129 bits (324), Expect = 2e-28, Method: Compositional matrix adjust. Identities = 118/410 (28%), Positives = 183/410 (44%), Gaps = 57/410 (13%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRK-------SGY 53 M+ F+ IH L S+ S LNRDD K FGG R R+SSQ LKR R SG Sbjct: 1 MNRFVQIHTLTSYPASLLNRDDAGFAKRIPFGGVTRTRVSSQCLKRHWRTFEGEGALSG- 59 Query: 54 YAQNIGESSLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEA---------- 103 Q + S T ++ ++ + + +++ ++ + GKS A Sbjct: 60 LGQPMSVRSRYTFDELVVQPLVGEGVPAELAREVTRALMSEVLGKSAKAAKADARADEKE 119 Query: 104 ----------EKISADAVTPWVVGEIAWFCEQVAKAEADNLDDKKLLKVLKEDIAA-IRV 152 + +T E+A+ E D K+ K + + + A R Sbjct: 120 EEEDKDAKTESTLQTGQITVLGRPEVAYLLELARTVCRKKPDPAKIAKAVSDHLGADGRK 179 Query: 153 NLQQ-----GVDIALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQ 207 NL++ G+D A+ GRM TS + L + D A+ +AHA T H ++ D+F+AVDDL Sbjct: 180 NLRELRLGAGLDAAMFGRMVTSDI---LARGDAALHVAHAFTVHGEATETDYFSAVDDLP 236 Query: 208 ------EQGSAHLGTQEFSSGVFYRYANINLAQLQENLGG--------ASREQALEIATH 253 QGS H+G E +SG+FY Y I++ L NL G A R+ A ++A Sbjct: 237 MARTEDGQGSGHIGNAELTSGLFYGYVVIDVPLLVSNLEGVDRKAWEKADRKLAAQLAER 296 Query: 254 VVHMLATEVPGAKQRTYAAFNPADMVMVNFSD-MPLSMANAFEKAVKAKDGFLQPSIQAF 312 +V ++AT PGAK + A A +V+ + P ++ANAF + V P A+ Sbjct: 297 MVKLVATVSPGAKLGSTAPHAYAHLVLAESGNAQPRTLANAFLEPVVTGPRQPDPVAAAY 356 Query: 313 NQYWDRVANGYGLNGAAAQFSLSDVDPI--TAQVKQMP---TLEQLKSWV 357 A+ + G A Q L+ + P A V + P +L ++ +WV Sbjct: 357 RALARHSADLDRMYGPAFQRRLAAIGPADGLADVLRAPANASLAEVATWV 406 >UniRef50_D0Y919 CRISPR-associated protein, Cse4 family n=2 Tax=Dehalococcoides RepID=D0Y919_9CHLR Length = 427 Score = 125 bits (315), Expect = 2e-27, Method: Compositional matrix adjust. Identities = 109/379 (28%), Positives = 179/379 (47%), Gaps = 60/379 (15%) Query: 7 IHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNI-GESSLRT 65 IH++ + +PS LNRDD K A FGG RR RISSQ KR+ R G A+ + + ++RT Sbjct: 10 IHLIQNFAPSNLNRDDTGQPKSATFGGFRRARISSQCSKRSTRLQGPLAELLENQGAVRT 69 Query: 66 IHL-AQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVT--PWVVGE---- 118 L ++ + K E D++ I+ + ++ K S + +GE Sbjct: 70 RQLIMEIAKAIDTK--EEPDERTIEIVAGVFEAGGLERPAKRSGKVKSQAAEAIGEDGEI 127 Query: 119 -------------IAWFCEQVAKAE-----ADNLDD-----KKLLKVLKEDIAAIRVNLQ 155 I F +++A + +N DD K++ + + + + Sbjct: 128 NGNEGFESGNKTKILLFLDKMAFPKLIDVFKENWDDLAKGNKEVKEKACDKVGRLLFEAV 187 Query: 156 QGVDIALSGRMATSGMMTELGK----VDGAMSIAHAITTHQVDSDIDWFTAVDDLQ---E 208 + DIAL GRM T GK V+ A +AH I+TH++D ++D++TAVDDL E Sbjct: 188 KAPDIALFGRMLEVKNNTPFGKYNMSVEAACQVAHPISTHKIDMEMDFYTAVDDLNPDGE 247 Query: 209 QGSAHLGTQEFSSGVFYRYANINLAQLQENL--------GGASR-------EQALEIATH 253 G+ +G F+S +YRYA ++ QL NL GG ++ E+A ++ Sbjct: 248 TGAGMMGVVGFNSACYYRYALVDRDQLARNLARKTERKNGGWAQGLETQDYEEADKVVKA 307 Query: 254 VVHMLATEVPGAKQRTYAAFN-PADMVMVNF-SDMPLSMANAFE---KAVKAKDGFLQPS 308 + + +P KQ ++AA N P+ + V +P+S+ANAF + V+ D + S Sbjct: 308 FLEAMIYAIPTGKQNSFAAQNLPSFGLFVKRKGGVPVSLANAFSTPIRPVRDDDDLVGLS 367 Query: 309 IQAFNQYWDRVANGYGLNG 327 + A ++WD + YG G Sbjct: 368 VNALTKHWDAIKELYGDQG 386 >UniRef50_Q2RY18 CRISPR-associated protein, Cse4 family n=2 Tax=Alphaproteobacteria RepID=Q2RY18_RHORT Length = 359 Score = 125 bits (313), Expect = 3e-27, Method: Compositional matrix adjust. Identities = 116/366 (31%), Positives = 176/366 (48%), Gaps = 40/366 (10%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNI-G 59 MS F+ +HVL +++ S LNRDD K FGG R+R+SSQSLKRA R+S + + G Sbjct: 1 MSRFLQLHVLTAYAASNLNRDDTGRPKTLNFGGAERLRVSSQSLKRAFRQSELFQSRLPG 60 Query: 60 ESSLRTIHLAQ-LRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADA-----VTP 113 E R+ A+ L L + E + + I + AL+ + + +K A + P Sbjct: 61 ELGTRSQDFAKALVSALVARGVE--EAEAITRAEALIDHDKLGKVKKGKAQTEQLVHLGP 118 Query: 114 WVVGEIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRM--ATSGM 171 + I E++A + LDDK +L VLK + VDIA+ GRM G Sbjct: 119 DELAAIDALAERLATSA--TLDDKAML-VLKSK--------PRAVDIAMFGRMLAGNPGF 167 Query: 172 MTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDL------QEQGSAHLGTQEFSSGVFY 225 V+ A+ +AHA TTH+ + D++T VDD+ +++G+ LG E+ SG+FY Sbjct: 168 -----NVEAAVQVAHAFTTHRATPEDDYYTTVDDIKNADQEEDRGAGFLGILEYGSGLFY 222 Query: 226 RYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFS- 284 Y IN L +NL G + A E A ++ T P KQ T+A+ ++ Sbjct: 223 LYICINADLLVDNLAG-DQALAAEAAALLIEAACTISPTGKQNTFASRARGLYALLEIGE 281 Query: 285 DMPLSMANAFEKAVKAKDG---FLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPIT 341 + P S+A AF+ AV ++ L SIQ + + YG N +L DP T Sbjct: 282 ETPRSLAAAFQYAVGSRATEADHLAASIQRLTALREGFSKAYGEN--LRSVALDVTDPAT 339 Query: 342 AQVKQM 347 +K + Sbjct: 340 PGLKAL 345 >UniRef50_B3E5V0 CRISPR-associated protein, Cse4 family n=56 Tax=Proteobacteria RepID=B3E5V0_GEOLS Length = 356 Score = 124 bits (312), Expect = 4e-27, Method: Compositional matrix adjust. Identities = 105/318 (33%), Positives = 155/318 (48%), Gaps = 33/318 (10%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGE 60 MS F+ IH+L S+ P+ LNRDD K A GG R+R+SSQSLKRA R S + Q + E Sbjct: 1 MSRFVQIHLLTSYPPANLNRDDQGRPKTAKMGGYDRLRVSSQSLKRAWRTSDLFQQALTE 60 Query: 61 S-SLRTIHLAQL------RDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADA--- 110 RT L + +++K + QKI AL K D + + Sbjct: 61 HVGTRTKLLGVMAYEKLVAGGVKEKQAKESAQKIAGVFGALKKAKEKDSLVDLEIEQLVH 120 Query: 111 VTPWVVGEIAWFCEQ-VAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATS 169 V+P + I E +++ A + LL++ Q DIA+ GRM S Sbjct: 121 VSPSEIQAIESLLETLISQGRAPEDTELDLLRIQG-----------QSADIAMFGRMLAS 169 Query: 170 GMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDL----QEQGSAHLGTQEFSSGVFY 225 + V+ A +AHAI+ H V + D+FTAVDDL ++ G+AH+G F++G+FY Sbjct: 170 ---SPSYNVEAACQVAHAISVHPVVIEDDYFTAVDDLNDGSEDAGAAHIGETGFAAGLFY 226 Query: 226 RYANINLAQLQENLGGASREQALEIATHVVHMLATEV-PGAKQRTYAAFNPADMVMVNFS 284 Y IN L ENLGG E ++ + + A +V P KQ ++ + A V+ Sbjct: 227 SYICINRTLLVENLGG--DEALVQKSIQALIEAAVKVPPNGKQNSFGSRAYASYVLAEKG 284 Query: 285 D-MPLSMANAFEKAVKAK 301 D P S++ AF K V ++ Sbjct: 285 DQQPRSLSVAFLKPVTSQ 302 >UniRef50_D1YEE3 CRISPR system CASCADE complex protein CasC n=1 Tax=Propionibacterium acnes J139 RepID=D1YEE3_PROAC Length = 374 Score = 124 bits (311), Expect = 6e-27, Method: Compositional matrix adjust. Identities = 115/376 (30%), Positives = 182/376 (48%), Gaps = 32/376 (8%) Query: 2 SNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMR---KSGYYAQNI 58 S +++IHV+ S PS +NRDD K A++GG RR R+SSQ+ K+A+R K A Sbjct: 3 SYYVDIHVIQSVPPSNVNRDDTGSPKSALYGGVRRARVSSQAWKKAVRTSFKEFLPANQT 62 Query: 59 GESSLRTIHLAQLRDVLRQKLG---ERFDQKIIDKTLAL-LSGKSVDEAEKISADAV--T 112 G +LR + L R + G E QK ++ AL L + + ++ A+ + T Sbjct: 63 GSRTLRVVELLMNR-LTAAPYGLPEEDARQKALEVVKALGLKAEKPRKKDESGAEGIERT 121 Query: 113 PWVVGEIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMM 172 ++V +++A+ A D K+ + A + G+++AL GRM Sbjct: 122 QYLVFYSNQQLDRLAQLAATT--DGKITATDAKKAA----DSDHGIEVALFGRMVAD--- 172 Query: 173 TELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQ----EQGSAHLGTQEFSSGVFYRYA 228 ++ VD A+ +AHA++TH V+ + D+FTAVDD + + G+ +GT EF+S YR+A Sbjct: 173 SKDLNVDSAVQVAHALSTHAVEIESDYFTAVDDYKLDEDDAGAGMIGTVEFTSETLYRFA 232 Query: 229 NINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNF-SDMP 287 + ++ L++NLG + + A+ V +P KQ T+A D V+V Sbjct: 233 TVAVSTLKDNLGDV--DLTAQAASAFVRGFIMSMPTGKQNTFANNTIPDAVVVQVRKGRS 290 Query: 288 LSMANAFEKAVKAKD-GFLQPSIQAFNQYWDRVANGYGLNGAAAQFSL---SDVDPITAQ 343 S AFE V + D GF+ S QA Y + L A F S + I Sbjct: 291 ASFIGAFEDPVTSDDGGFVAASCQAVAAYAHDCEEAF-LGAPEASFVTRVGSRTEAIGTM 349 Query: 344 VKQMPTLEQLKSWVRN 359 QMP ++ L S VR+ Sbjct: 350 GTQMP-IDDLVSSVRD 364 >UniRef50_D1Y487 CRISPR-associated protein, Cse4 family n=1 Tax=Pyramidobacter piscolens W5455 RepID=D1Y487_9BACT Length = 408 Score = 123 bits (309), Expect = 1e-26, Method: Compositional matrix adjust. Identities = 113/373 (30%), Positives = 177/373 (47%), Gaps = 56/373 (15%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKS-GYYA-QNI 58 + FI I L ++ S LNRDD + K FGG R R+SSQ LKR R + G ++ QN+ Sbjct: 4 LPRFIQISTLTTYPASLLNRDDSGLSKRIPFGGVSRTRVSSQCLKRHWRMADGLWSLQNV 63 Query: 59 GE---SSLRT--IHLAQLRDVLRQKLGERFD-QKIIDKTLAL---LSGKSVDEAEKIS-- 107 + +S+R+ I ++ L +K G D +K++ + AL L G EA + Sbjct: 64 DKDIATSIRSRRIFPEKIEKPLIEKEG--LDAEKVVAASQALQSELYGAKGTEAAAKNKK 121 Query: 108 -----ADAVTP-------------WVVG--EIAWFCE---QVAKAEADNLDDKK----LL 140 ADA+ P V+G EI + + ++A A+ D K Sbjct: 122 TAKDDADALNPSIDAQLSAERSELVVLGHPEIQFLSKIVREMASADGSAADVGKKTGEWF 181 Query: 141 KVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWF 200 K K+D A++ G+D A+ GR + +V A+ +AHA T H +S+ D+F Sbjct: 182 KKHKKDFQALKCG--AGLDAAMFGRFISGDTD---ARVSAAVHVAHAFTVHAEESETDYF 236 Query: 201 TAVDDLQEQGSAHLGTQEFSSGVFYRYANINLAQLQENLGG--------ASREQALEIAT 252 TAVDDL GSAH+ E +SG+FY Y +++ QL N+ G A R+ A + Sbjct: 237 TAVDDLNNSGSAHINAAELTSGIFYNYVVVDVPQLVSNIEGCPSKQWQTAQRDVAGRLVK 296 Query: 253 HVVHMLATEVPGAKQRTYAAFNPADMVMVNFSD-MPLSMANAFEKAVKAKDGFLQPSIQA 311 H++H++AT PGAK + A + VM + P ++A+AF V + +++ Sbjct: 297 HLLHLIATVTPGAKLGSTAPYARPWFVMAEAGESQPHTLADAFYLPVPLRGDMRAQALRQ 356 Query: 312 FNQYWDRVANGYG 324 Y + YG Sbjct: 357 LEDYVGKSDEMYG 369 >UniRef50_C9M9R6 CRISPR-associated protein, Cse4 family n=1 Tax=Jonquetella anthropi E3_33 E1 RepID=C9M9R6_9BACT Length = 400 Score = 123 bits (309), Expect = 1e-26, Method: Compositional matrix adjust. Identities = 102/334 (30%), Positives = 168/334 (50%), Gaps = 42/334 (12%) Query: 4 FINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSG--YYAQNIGES 61 F+ I L ++S S LNRDD + K FG R RISSQ LKR R +G Y G++ Sbjct: 7 FVQISTLTTYSASLLNRDDSGLAKRIPFGDSVRTRISSQCLKRHWRNAGGPYGLDKAGDA 66 Query: 62 -SLRTIHLAQLRDVLRQKL-GERFDQKII----DKTLALL-SGK-------------SVD 101 SL +++ + L E +QK++ K LL +G+ +D Sbjct: 67 LSLSVRSRFSFPELIEKPLVAEGLEQKLVVSGSQKLQQLLYNGEEKGDTKKDKKKKIELD 126 Query: 102 EAEKISADAVTPWVVG--EIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQ--- 156 E + SA V+G E+ + + + A + + + K++ +K+ + NL Sbjct: 127 E-DGYSAKRNELVVLGRPELEYLKQIIRDAISSSSNIKEIDNAVKDFYTKRKSNLLALRA 185 Query: 157 --GVDIALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQEQGSAHL 214 GVD A+ GR + + KV A+ +AH+ T H S+ D+FTAVDDL EQG+ H+ Sbjct: 186 GCGVDAAMFGRFVSGDVD---AKVTAAVHVAHSFTIHGEQSETDYFTAVDDLVEQGTGHI 242 Query: 215 GTQEFSSGVFYRYANINLAQLQENLGG--------ASREQALEIATHVVHMLATEVPGAK 266 E ++G++Y Y +++ QL NL G A R A ++ ++++H++AT PGAK Sbjct: 243 NAAELNTGIYYGYVVVDVPQLISNLCGCDSKNSADADRTLAAQVTSNLIHLMATVTPGAK 302 Query: 267 QRTYAAFNPADMVMVNFSD-MPLSMANAFEKAVK 299 A + + +V+ +SD P ++A+AF + +K Sbjct: 303 LSGTAPYAASWLVLAEWSDSQPRTLADAFFEGLK 336 >UniRef50_C7QEM5 CRISPR-associated protein, Cse4 family n=13 Tax=Actinomycetales RepID=C7QEM5_CATAD Length = 399 Score = 123 bits (308), Expect = 1e-26, Method: Compositional matrix adjust. Identities = 101/353 (28%), Positives = 170/353 (48%), Gaps = 35/353 (9%) Query: 4 FINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESSL 63 ++IH+L + PS LNRDD K A++GG RR R+SSQ+ KRA R++ + E + Sbjct: 5 ILDIHILQTVPPSNLNRDDTGSPKTAVYGGVRRARVSSQAWKRATRQAFGDLLDPSELGV 64 Query: 64 RTIHLAQ--------LRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWV 115 RT +A+ L L ++I S ++ + +D Sbjct: 65 RTKRVAEQIANRMTALEPSLSPGDAVAVAVEVIKAATGAKSEVPKRKSAAVKSDQDATAA 124 Query: 116 VGEIAWFCEQVAKAEADNL------DDKKLLKVLKEDIAAIRV----NLQQGVDIALSGR 165 + E + +++++ +NL K + LK+ RV + + VDIAL GR Sbjct: 125 LPETGYLM-FLSESQLNNLARLGVEGSKDITAFLKDKDFKNRVRQAADTRHSVDIALFGR 183 Query: 166 MATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQ---EQGSAHLGTQEFSSG 222 M T++ VD A +AHAI+ H V+++ D+FTAVDD E G+ +G +F++ Sbjct: 184 MVADA--TDI-NVDAAAQVAHAISVHAVENESDYFTAVDDRSTEAEPGAGMIGIVDFNAA 240 Query: 223 VFYRYANINLAQLQENLG-----GASREQALEIATHV-VHMLATEVPGAKQRTYAAFNPA 276 YRYA +++ +L +NLG G S+ + + A + A +P K T+ Sbjct: 241 TLYRYAAVDVNRLADNLGAGLLEGESQTEPVRRAVEAFIRGFALSMPTGKVNTFGNHTVP 300 Query: 277 DMVMVNF-SDMPLSMANAFEKAVKAKD---GFLQPSIQAFNQYWDRVANGYGL 325 D+V+V + P+S A AFE+A+ A + G+L+ + + Y ++ Y L Sbjct: 301 DVVLVKLRASRPISFAAAFEEAISAGEHQGGYLKGACERLASYIPKLEQAYDL 353 >UniRef50_D2L2X7 CRISPR-associated protein, Cse4 family n=1 Tax=Desulfovibrio sp. FW1012B RepID=D2L2X7_9DELT Length = 385 Score = 123 bits (308), Expect = 1e-26, Method: Compositional matrix adjust. Identities = 116/351 (33%), Positives = 169/351 (48%), Gaps = 58/351 (16%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGE 60 MS FI +H+L S+ + LNRDD+ K FG R+R+SSQSLKRA R S + +G Sbjct: 1 MSRFIQLHILTSYPAANLNRDDLGAPKSMRFGEANRLRVSSQSLKRAWRTSDVFKATLGA 60 Query: 61 SSL--RTIHLA-QLRDVLRQ--KLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWV 115 L RT L ++ L Q L +D TLA L K+ E + A V Sbjct: 61 DHLGVRTKELGRKVFCALTQGASLDAVWDAPDATGTLAALKEKTAAEIARTIA-----GV 115 Query: 116 VGEIAWFCEQVAKAEADNLDDKK----------LLKVLKED---IAAIR----------- 151 G+I + A+ +AD + +K L V +E+ +AA+ Sbjct: 116 FGKIKKEADAKAEKDADPVKKRKELLDSLEIEQLAHVSQEERRAVAALTEACRDAGKAPD 175 Query: 152 ---VNL----QQGVDIALSGRM-ATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAV 203 +NL + DIA+ GRM A S V+ A+ +AHA+T H+ ++ D+FTAV Sbjct: 176 ANALNLLRSDAKAADIAMFGRMLAASARFN----VEAAVQVAHAVTVHRAVAEDDFFTAV 231 Query: 204 DDLQ--EQGSAHLGTQEFSSGVFYRYANINLAQLQENLGGASREQAL--EIATHVVHMLA 259 DDL + G+ H+G EF +GV+Y Y I+ A L ENLGG ++AL + T + Sbjct: 232 DDLNRDDAGAGHMGVSEFGAGVYYLYLCIDRALLAENLGG---DEALVQKALTALTTAAC 288 Query: 260 TEVPGAKQRTYAAFNPADMVMVNFS-DMPLSMANAFEKAV----KAKDGFL 305 T P KQ +YA+ A + D P +++ AF K V + +DG L Sbjct: 289 TVAPTGKQASYASRAYACFALAEKGDDTPRNLSLAFLKPVGEREEERDGHL 339 >UniRef50_C4FG89 Putative uncharacterized protein n=1 Tax=Bifidobacterium angulatum DSM 20098 RepID=C4FG89_9BIFI Length = 387 Score = 121 bits (304), Expect = 3e-26, Method: Compositional matrix adjust. Identities = 107/333 (32%), Positives = 159/333 (47%), Gaps = 47/333 (14%) Query: 4 FINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYY-----AQNI 58 F++IH + PS +NRDD K A GG R R+SSQ+ KRAMR G + + + Sbjct: 2 FMDIHCIQQVPPSNINRDDTGSPKTAYVGGALRSRVSSQAWKRAMR--GVFDDMLDSDKL 59 Query: 59 GESSLRTIHL-AQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVG 117 G+ + + L A R L E ++ + + + L G V + + +D T +V Sbjct: 60 GKRTKGVVALIASSITAKRPDLAESAEE--LGQRVLALEGIGVKASNRAGSDKGT--LVT 115 Query: 118 EIAWFCEQVAKAEADNLDD-----------------KKLLKVLKEDIAAIRVNLQ----- 155 + F +A E D L D K L K K D+A ++ + Sbjct: 116 DYLIF---IANNEIDKLADWAIAASDKGRDFSKVGKKGLSKAEKTDLAKMKNEVSEIFHG 172 Query: 156 -QGVDIALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQEQ---GS 211 Q +DIAL GRM + +L D + +AHA + Q+ + D+FTAVDD + G+ Sbjct: 173 PQAIDIALFGRMLANA--PDL-NTDASAQVAHAFSIDQITPEYDYFTAVDDCASEDNAGA 229 Query: 212 AHLGTQEFSSGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYA 271 A L T F+S YRYA +N+ L+E L AS A+E A V +P KQ T+A Sbjct: 230 AMLDTVGFNSSTLYRYAAVNIDALKEQLQDAS--AAVEGAVAFVEAFIKSMPSGKQNTFA 287 Query: 272 AFN-PADMVMVNFSDMPLSMANAFEKAVKAKDG 303 P D+V+V P+S A+AFE+ V+ K+G Sbjct: 288 NHTLPEDVVVVLRDSQPISAADAFEEPVRRKEG 320 >UniRef50_Q60AD1 CRISPR-associated protein, CT1975 family n=1 Tax=Methylococcus capsulatus RepID=Q60AD1_METCA Length = 414 Score = 120 bits (302), Expect = 6e-26, Method: Compositional matrix adjust. Identities = 107/383 (27%), Positives = 174/383 (45%), Gaps = 73/383 (19%) Query: 4 FINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIG-ESS 62 F+ IH L S+ + LNRDD + K FG R+R+SSQ LKR R+S Q I + Sbjct: 2 FLQIHSLTSYHATLLNRDDAGLAKRIPFGDAVRLRVSSQCLKRHWRES--LKQTIPLPTG 59 Query: 63 LRTIHLAQLRDVLRQKLGERFDQKIIDKTLA----------------------------- 93 LRT H V +++ R Q+ ++ +LA Sbjct: 60 LRTRH------VFEREIYPRLKQEGVEDSLAKQLTLSLMGLLLQKSDKTAKPEKAKKGKN 113 Query: 94 ---------LLSGKSVDEAEKISADAVTPWVVG--EIAWFCEQV-AKAEADNLDDKKLLK 141 G +E+ P + G E+ + + A AE + +K L Sbjct: 114 GHEEQAEFDFEEGAGTEESSAGDLRVKQPILFGRPEVDYLISLLKACAEEGSGAEKALQA 173 Query: 142 VLKEDIAAIRV--------NLQQGVDIALSGRMATSGMMTELGKVDGAMSIAHAITTHQV 193 LK D A + +L G++ AL GR TS +++ + D A+ +AH+ T H + Sbjct: 174 KLKGDKANFKAMLKAAGHGDLYAGLEGALFGRFVTSDVLS---RSDAAVHVAHSFTVHGL 230 Query: 194 DSDIDWFTAVDDL---QEQGSAHLGTQEFSSGVFYRYANINLAQLQENLGG------ASR 244 D+++D+FT VDDL +E G+AH G E +G+FY Y +++ L NL G A + Sbjct: 231 DTEVDYFTVVDDLNREEETGAAHAGDMELGAGLFYGYVAVDIPLLVSNLTGCDTTRWAEQ 290 Query: 245 EQA--LEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFSD-MPLSMANAFEKAVKAK 301 E A ++ T ++ +AT PGAK A + ++ V++ P +++NA+ +A+ + Sbjct: 291 EPADVRKVLTGLIRAIATVSPGAKLGATAPYAFSEFVLLETGKQQPRALSNAYLQALPMR 350 Query: 302 DGFLQPSIQAFNQYWDRVANGYG 324 LQ +I A +Y + YG Sbjct: 351 GDPLQAAIDALAKYLRALDAMYG 373 >UniRef50_Q3A5Z5 CRISPR-associated protein, Cse4 family n=23 Tax=Bacteria RepID=Q3A5Z5_PELCD Length = 373 Score = 120 bits (301), Expect = 7e-26, Method: Compositional matrix adjust. Identities = 108/328 (32%), Positives = 159/328 (48%), Gaps = 44/328 (13%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGE 60 MS FI +H+L S+ P+ LNRDD+ K A GG R+R+SSQSLKRA R S + + + Sbjct: 1 MSRFIQLHLLTSYPPANLNRDDLGRPKTAKMGGVDRLRVSSQSLKRAWRTSDLFGKTVKN 60 Query: 61 S-SLRTIHLAQLRDVLRQKLGERFDQKIIDKTLAL---------------LSGKSVDEAE 104 RT + +K+ ER +K I AL L+ K + Sbjct: 61 GLGTRTKEMG-------RKVYERLVEKGIGHKDALSWAGAIAGVFGKLKKLTDKEKTALK 113 Query: 105 KISADAVTPWVVGEIAWFCEQVA----KAEADNLDDKKLLKVLKEDIAAIRVNLQQ---- 156 K++ + + E+ EQ+A + E LD + KE +NL + Sbjct: 114 KLATEERREKELVEVE--IEQLAFFDLEEEQAVLDLTNSIAERKEGPQPEELNLLRQKMT 171 Query: 157 GVDIALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDL----QEQGSA 212 VDIAL GRM S + V+ A +AHAI+ H + + D+FTAVDDL ++ G+A Sbjct: 172 SVDIALFGRMLAS---SPAFNVEAACQVAHAISVHPIVIEDDYFTAVDDLNDGSEDAGAA 228 Query: 213 HLGTQEFSSGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEV-PGAKQRTYA 271 H+G F++G+FY Y IN L ENLGG E + A + A +V P KQ ++A Sbjct: 229 HIGETGFAAGLFYSYICINRDLLAENLGG--DEDLAQRAIAALTEAAVKVPPNGKQNSFA 286 Query: 272 AFNPADMVMVNFSD-MPLSMANAFEKAV 298 + A V+ + P S++ AF K + Sbjct: 287 SRAYASYVLAEKGEQQPRSLSVAFLKPI 314 >UniRef50_C7RP61 CRISPR-associated protein, Cse4 family n=1 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RP61_9PROT Length = 400 Score = 118 bits (295), Expect = 3e-25, Method: Compositional matrix adjust. Identities = 100/336 (29%), Positives = 162/336 (48%), Gaps = 49/336 (14%) Query: 3 NFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESS 62 FI IH L ++ + LNRDD + K G R RISSQ LKR R S Sbjct: 5 RFIQIHTLHTYPAALLNRDDAGLAKRLPLGNAVRTRISSQCLKRHWR---VVEDRFALSC 61 Query: 63 LRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEA--------EKISADAVTPW 114 L + R L + + +R + + +T+A + +++ +A EK DA+ Sbjct: 62 LDVPMAIRSRGTL-ELISKRIQESGVSETMAQAAAEAMRDAGLLDKGGKEKKGDDALKTG 120 Query: 115 ---VVG--EIAWFCEQVAKAEADNLDDKKLLKVL---------KEDIAAIRVNLQQGVDI 160 ++G EI + + +D +++K +++ K +I A++ G++ Sbjct: 121 QAVLLGKPEIDYLVRRCVDLASDGVEEKGFKELITLWLKGKDEKRNIEALKHG--SGLES 178 Query: 161 ALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDL----QEQGSAHLGT 216 AL GRM TS ++T + A+ +AHA T HQ + D+FT VDDL E GSA + Sbjct: 179 ALFGRMVTSDVLTSR---EAAVYVAHAFTVHQAQVENDYFTVVDDLLQDAGELGSAGIFD 235 Query: 217 QEFSSGVFYRYANINLAQLQENLGGAS-------------REQALEIATHVVHMLATEVP 263 E +SG++Y Y +++ QL +NL G R A ++ H++H++AT P Sbjct: 236 TELASGLYYGYVVVDVPQLVQNLEGEDFNECFASGTPADRRVLAGQVVQHLLHLIATVSP 295 Query: 264 GAKQRTYAAFNPADMVMVNFSD-MPLSMANAFEKAV 298 GAK+ + A F+ A ++V D P S+A AF A+ Sbjct: 296 GAKRGSTAPFDWAKFMLVEAGDWQPRSLAGAFHDAL 331 >UniRef50_Q67RP1 Putative uncharacterized protein n=1 Tax=Symbiobacterium thermophilum RepID=Q67RP1_SYMTH Length = 379 Score = 117 bits (293), Expect = 7e-25, Method: Compositional matrix adjust. Identities = 108/345 (31%), Positives = 174/345 (50%), Gaps = 33/345 (9%) Query: 4 FINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESSL 63 F+ +H+L + + S LNRDD K +FGG RR RISSQ LKRA+R + E +L Sbjct: 2 FVEMHLLQNFALSNLNRDDTGAPKSCVFGGTRRARISSQCLKRAVRTY------VREQAL 55 Query: 64 RTIHLAQLRDV-LRQKLGERFDQKIIDKTLA-LLSGKSVDEAEKISADAVTPWV--VGE- 118 L R L+++L R ++ A ++ ++++ E + T ++ VGE Sbjct: 56 VPSELLSYRTKWLQRELANRLAAGGVEAEQAGQVAARALELLEFRLKNGRTEYLLMVGER 115 Query: 119 ----IAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQG---VDIALSGRMATSGM 171 IA C + A A D + K +++A + + G VDIAL GRM + Sbjct: 116 EIARIADLCREHAAALQGG-DGGRKSKKEGDNLAGLFLKALDGGDAVDIALFGRM----I 170 Query: 172 MTELGK-VDGAMSIAHAITTHQVDSDIDWFTAV------DDLQEQGSAHLGTQEFSSGVF 224 T K VD A+ +AHA +T+ + ++ D+++AV DD + G+ LGT ++S + Sbjct: 171 ATHPEKNVDAAVQMAHAFSTNAIANEFDFYSAVDDLQQQDDDEGAGAGMLGTVLYNSSCY 230 Query: 225 YRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFS 284 YRYAN++L QL NLGG ++AL + + VP K+ A NP ++M Sbjct: 231 YRYANVDLRQLLTNLGG-DPDRALTAVRAFLLGMVHAVPTGKRTNSAPQNPPALIMAVVR 289 Query: 285 DMPL-SMANAFEKAVK-AKDGFLQPSIQAFNQYWDRVANGYGLNG 327 + L S+ANAF V A+ ++ S + +W++++ YG G Sbjct: 290 EHGLWSLANAFVVPVSGARGNLMELSAKEMLAHWNQLSELYGQEG 334 >UniRef50_C7MQD5 CRISPR-associated protein, Cse4 family n=1 Tax=Saccharomonospora viridis DSM 43017 RepID=C7MQD5_SACVD Length = 368 Score = 116 bits (291), Expect = 1e-24, Method: Compositional matrix adjust. Identities = 95/333 (28%), Positives = 155/333 (46%), Gaps = 24/333 (7%) Query: 4 FINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESSL 63 F++IH L + S +NRD++ K +GG R+R+SSQ+ KRA+RK+ Q++ + + Sbjct: 2 FVDIHALHTLPYSNVNRDNLGAPKSCWYGGTERIRVSSQAWKRAIRKA--VEQDLEQPTE 59 Query: 64 RTIHLAQL-RDVLRQKLGERFDQKIIDKTLALLSG-KSVDEAEKISADAVTPWVVGEIAW 121 RT +A L +L ++ D + + + G + + + TP +A Sbjct: 60 RTRRIASLVAGILTERGWGAEDARRAGRAVIYAYGLEPAADDDDTDTLLWTPPAAEALAG 119 Query: 122 FCEQ---------VAKAE---ADNLDDKKLLKVLKEDIAAIRVNLQQGVD-IALSGRMAT 168 E+ + K E A N K + +K ++ L + IAL GRM Sbjct: 120 VVEKHRDTVVTLPLPKGEGKKAKNPPAKDITDAVKPMAGEVKSILNRTTPTIALLGRMLA 179 Query: 169 SGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDD-LQEQGSAHLGTQEFSSGVFYRY 227 + G IAHA T H+ + D+FTAVDD G+ H+ T +F++G FYRY Sbjct: 180 D---RPDHTIYGLAEIAHAFTVHEAAPEFDYFTAVDDRAANTGAGHVNTAQFTTGTFYRY 236 Query: 228 ANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFSDMP 287 ++IN+ +L + +G + A + T P KQ AA AD+ + + P Sbjct: 237 SSINITRLVDVVG---EQDARAVLLAWARRFITVTPAGKQTATAARTAADLAHIVVRNAP 293 Query: 288 LSMANAFEKAVKAKDGFLQPSIQAFNQYWDRVA 320 S A AFE + + G+L P+ +A Y R+A Sbjct: 294 QSYAPAFETPIVSTGGYLDPAARALGDYATRLA 326 >UniRef50_Q1J368 CRISPR-associated protein, CT1975 n=1 Tax=Deinococcus geothermalis DSM 11300 RepID=Q1J368_DEIGD Length = 385 Score = 115 bits (289), Expect = 2e-24, Method: Compositional matrix adjust. Identities = 119/345 (34%), Positives = 166/345 (48%), Gaps = 46/345 (13%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRK--SGYYAQNI 58 M + +H L + +PS LNRDD KDA FGG RR+RISSQ+ KRAMR+ G Sbjct: 1 MKALLELHYLQNFAPSNLNRDDTGSPKDAFFGGTRRLRISSQAFKRAMRQDFGGRELLRP 60 Query: 59 GESSLRTIHLAQLRDVLRQKLGERFDQKIIDKTLAL-------LSGKS-------VDEAE 104 E +RT + L G +Q LAL GK+ DE Sbjct: 61 EEIGVRTKRAHEAIAELLAGEGRTEEQCRAAAELALGGLGLPVKDGKNQYLLFLGRDELR 120 Query: 105 KISADAVTPWVVGEIAWFCEQVAKAEADNLDDKKLLKV----LKEDIA---AIRVNLQQG 157 ++ AD ++G W Q A E ++ D KK L D+ A ++ + Sbjct: 121 RV-AD-----IIG-ANWAEFQAAAPEPESTDGKKKKASKKAALSGDLGKQLAGALDGSKA 173 Query: 158 VDIALSGRMATSGMMTELGKVDGAMSIAHA--ITTHQV-DSDIDWFTAVDDLQEQ---GS 211 VD+AL GRM + +L + + A I+TH + + D++TAVDDL+ G+ Sbjct: 174 VDVALFGRM-----LADLPDKNADAAAQVAHAISTHALRERQYDFYTAVDDLKPDDNAGA 228 Query: 212 AHLGTQEFSSGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYA 271 LGT EF+S YRYA I+L +L ENL G RE ++ P KQ T+A Sbjct: 229 DMLGTVEFASATVYRYACIDLGKLLENLQG-DRELLERGLRAFLYASVYAAPTGKQNTFA 287 Query: 272 AFN-PADMV-MVNFSDMPLSMANAFEKAVKAK--DGFLQPSIQAF 312 A N P MV +V + P ++ANAFEK V+A+ G+L PS+ A Sbjct: 288 AHNLPGLMVQVVRRNASPRNLANAFEKGVRAEGGQGYLAPSVAAL 332 >UniRef50_A5GBK1 CRISPR-associated protein, Cse4 family n=1 Tax=Geobacter uraniireducens Rf4 RepID=A5GBK1_GEOUR Length = 408 Score = 114 bits (286), Expect = 5e-24, Method: Compositional matrix adjust. Identities = 108/380 (28%), Positives = 179/380 (47%), Gaps = 63/380 (16%) Query: 5 INIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMR----------KSGYY 54 + +H++ S +CLNRDD+N K A+FGG +R R+SSQS KRA+R KS + Sbjct: 4 LELHIIQSVPVACLNRDDLNSPKTAVFGGVQRARVSSQSWKRAIREMAKEIAAEEKSDLF 63 Query: 55 AQNIGESSLRTIHLAQLRDVLRQK----------------LGERFDQKIIDKTLALLSGK 98 + G+ + R ++ R L +K + E D K+ + + K Sbjct: 64 S---GDRTRRMVYTLSTR--LAEKGITSQAAIAIAEQVADVVETLDSKVDSEGYKKI--K 116 Query: 99 SVDEAEKISADAVTPWVVG--EIAWFCEQVAKAEADNLDD------KKLLKVLKEDIAAI 150 +V K DA+ + E+ E + KA + D K ++K+L++ + Sbjct: 117 TVMFFSKAEYDAIAEAIATSDEVKNSVEALEKAAVEGNDREREKALKAMVKILEKGAISK 176 Query: 151 RVN---LQQGVDIALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQ 207 + L+ DIAL GRM + KVDGA AH ++TH+ D++ID+F AVDDL Sbjct: 177 TIKSAQLKDAADIALFGRMVANDPSL---KVDGASMFAHILSTHKADNEIDFFAAVDDLN 233 Query: 208 --EQGSAHLGTQEFSSGVFYRYANINLAQL--QENLGGASRE-----QALEIATHVVHM- 257 E G+ T EF+S +YR+A +NL L ++LG + + +++E VV Sbjct: 234 KDESGAGMTSTLEFNSATYYRFAALNLDALANDDHLGDITLKDGTVVRSVETRKQVVKTF 293 Query: 258 ---LATEVPGAKQRTYAAFNPADMVM--VNFSDMPLSMANAFEKAV-KAKDGFLQPSIQA 311 + +P A++ T V+ V P+ + NAFE V +++ GF+ SI Sbjct: 294 LKAIIQSIPSARKTTMNGNTLPVYVLGVVREKGHPIQLINAFETPVRRSEKGFVTESINR 353 Query: 312 FNQYWDRVANGYGLNGAAAQ 331 N + + +G++ A+ Sbjct: 354 MNIEYADLKETWGVDSLFAK 373 >UniRef50_D1A6Q4 CRISPR-associated protein, Cse4 family n=2 Tax=Actinomycetales RepID=D1A6Q4_THECD Length = 399 Score = 114 bits (284), Expect = 8e-24, Method: Compositional matrix adjust. Identities = 123/384 (32%), Positives = 183/384 (47%), Gaps = 44/384 (11%) Query: 4 FINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIG-ESS 62 FI H++ + + LNRDD N K +GGK R R+SSQ KRAMR Y ++G E++ Sbjct: 8 FIEAHIIQAIPFANLNRDDTNAVKTVTWGGKERTRVSSQCWKRAMRL--YLQTSLGQEAA 65 Query: 63 LRTIHLAQ-LRDVLRQK------LGERFDQKII-----------DKTLALLSGKSVDEAE 104 LRT L + L L + L ER + I+ KT +G + + Sbjct: 66 LRTRRLPEYLARHLEEHHGWPADLAERAGRHIVVASSVGGEAPKKKTDGEETGGTGEHWS 125 Query: 105 KISADAVTPWVVGEIAWFCEQVAKAEADNLDDKKLLKVLKED--IAAIRVN----LQQGV 158 + + V E+A Q +A + + K K ++D I +V+ + GV Sbjct: 126 TAAMVYIPSSAVPELAELAIQYREALENAKEPKDPAKFGRKDSVIPTGKVDEILRRRNGV 185 Query: 159 DIALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQE-----QGSAH 213 I L GRM + +VDGA+ +AHA TTH ++ID+F+AVDD+ + GSAH Sbjct: 186 -INLFGRMLA---QVDDAEVDGAVQVAHAFTTHATTTEIDYFSAVDDVTDIWGDTTGSAH 241 Query: 214 LGTQEFSSGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAF 273 +G E S+GV YRY ++L L NLGG E E+A ++ +P AK+ + A Sbjct: 242 MGQAEHSAGVLYRYIVLDLNDLHANLGG-DLEATRELAAGLLKAALLSLPRAKKNSTAPH 300 Query: 274 NPADMVMVNF-SDMPLSMANAFEKAVKA--KDGFLQPSIQAFNQYWDRVANGYGLNG--- 327 + + +D P+S A AFEK V A G +PS+ A N+Y V G +G Sbjct: 301 TIPHLAHLTVRTDRPVSYAGAFEKPVPADRHGGHSEPSVAALNEYAAAVQKLLGTSGCRY 360 Query: 328 -AAAQFSLSDVDPITAQVKQMPTL 350 A A S +D + +V+ L Sbjct: 361 AAHATLSQEKIDALGERVESFDKL 384 >UniRef50_UPI0001AF1D4B hypothetical protein SghaA1_37372 n=1 Tax=Streptomyces ghanaensis ATCC 14672 RepID=UPI0001AF1D4B Length = 383 Score = 113 bits (283), Expect = 1e-23, Method: Compositional matrix adjust. Identities = 107/348 (30%), Positives = 165/348 (47%), Gaps = 34/348 (9%) Query: 4 FINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESSL 63 I +H+L S S LNRDD+ K A FGG R RISSQSLKRA R AQ+ + S Sbjct: 2 LIELHLLQSFPVSNLNRDDLGQPKTARFGGHTRARISSQSLKRAART--LLAQHGLDPSE 59 Query: 64 RTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWV--VGE--- 118 + +LRD L ER +K + + + A +A +T ++ VG+ Sbjct: 60 LGVRTKRLRDAAASLLAERGREKEQAVEVCQAGLEEIGFAAH-TATGLTKYLLYVGKPAQ 118 Query: 119 --IAWFCEQ----VAKAEADNLDDKKLLKVLKEDIAAIRVNLQ------------QGVDI 160 +A +C++ +AK A+ K+ + AA + Q + DI Sbjct: 119 TLLADYCDERWDTLAKTVAEAKKRKEKQEKTPRKTAAKKPTKQAQEQAKRILDGTRAADI 178 Query: 161 ALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQ---EQGSAHLGTQ 217 AL GRM T+ V+ A +AHA++TH V ++ D++TA+DDL+ E + +GT Sbjct: 179 ALFGRMIADN--TDFN-VNAASQVAHALSTHAVVNEFDYYTALDDLRPDAEPAADMIGTV 235 Query: 218 EFSSGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFN-PA 276 +F++ FYRYAN++L QL NL + A +H VPG KQ + +A P Sbjct: 236 DFNAACFYRYANLDLEQLATNLPD-DPDLVARSARAWLHSFIHAVPGGKQNSMSARTMPQ 294 Query: 277 DMVMVNFSDMPLSMANAFEKAVKAKDGFLQPSIQAFNQYWDRVANGYG 324 ++ V ++ANAF V + S Q ++ ++ + YG Sbjct: 295 TLLGVVRETGAWNLANAFLSPVTDVPDLMAASTQRLVDHFQQLRSFYG 342 >UniRef50_C8P6I6 CRISPR-associated protein n=1 Tax=Lactobacillus antri DSM 16041 RepID=C8P6I6_9LACO Length = 311 Score = 109 bits (272), Expect = 2e-22, Method: Compositional matrix adjust. Identities = 70/204 (34%), Positives = 110/204 (53%), Gaps = 20/204 (9%) Query: 117 GEIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELG 176 G+IA E V + D LD K + + L+ D +D+AL GRM Sbjct: 72 GQIAKLAEYVR--QNDELDSKAVKEALQGD---------HSLDMALFGRMVADDPSL--- 117 Query: 177 KVDGAMSIAHAITTHQVDSDIDWFTAVDDLQ---EQGSAHLGTQEFSSGVFYRYANINLA 233 VD A +AHAI+TH++ + D++TAVDD + E GSA +GT E+ S YRYAN+N+ Sbjct: 118 NVDAACQVAHAISTHEIVPEYDYYTAVDDEKADDESGSAMIGTIEYDSATLYRYANVNVN 177 Query: 234 QLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNF-SDMPLSMAN 292 +L ++LG + A++ V +P KQ ++A V+V D P+++ + Sbjct: 178 ELVQSLGDV--DTAVKGLQLFVKDFVLSMPTGKQNSFANKTVPQYVLVTVREDTPVNLVS 235 Query: 293 AFEKAVKAKDGFLQPSIQAFNQYW 316 AFE+AVK++ G+LQPS+ + + Sbjct: 236 AFEEAVKSRHGYLQPSVAKLEKEY 259 >UniRef50_D2TKK6 CRISPR-associated protein n=1 Tax=Citrobacter rodentium ICC168 RepID=D2TKK6_CITRO Length = 363 Score = 107 bits (266), Expect = 1e-21, Method: Compositional matrix adjust. Identities = 103/327 (31%), Positives = 151/327 (46%), Gaps = 48/327 (14%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNI-G 59 M+ FI +H+L ++ + LNRDD K + GG R+RISSQSLKRA R S + Q + G Sbjct: 13 MTTFIQLHLLTAYPAANLNRDDTGAPKTVVLGGTTRLRISSQSLKRAWRTSELFEQALAG 72 Query: 60 ESSLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEA-EKISAD--AVTPWVV 116 +R+ +A+ + K G ID+ A+ +++ K+ AD A P Sbjct: 73 NIGIRSGRIAREAAEILIKSG-------IDEKKAVAYVEAIARCFGKVKADKKAKEPLTN 125 Query: 117 GEIAWFCE------QVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSG 170 E KA A L ++K + KE+ A+ + + VDIA+ GRM Sbjct: 126 SETEQLVHISPAEFDAVKALAHRLAEEK--RAPKEEELALLRHDRMAVDIAMFGRMLAD- 182 Query: 171 MMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQ----EQGSAHLGTQEFSSGVFYR 226 E V+ A +AHA + + D+FTAVDDL+ + G+ HLG F S +FY Sbjct: 183 -KPEFN-VEAACQVAHAFGVSETIVEDDFFTAVDDLRANSDDAGAGHLGYTGFGSALFYT 240 Query: 227 YANINLAQLQENLGG----------ASREQALEIATHVVHMLATEVPGAKQRTYAAFNPA 276 Y IN L +NL G A E AL+++ P KQ ++A+ A Sbjct: 241 YICINKDLLIKNLNGNVDLANQTLRAFTEAALKVS-----------PTGKQNSFASRAYA 289 Query: 277 DMVMV-NFSDMPLSMANAFEKAVKAKD 302 M +D P S+A AF K + D Sbjct: 290 CWAMAEKGTDQPRSLAAAFYKPIVGSD 316 >UniRef50_D1CAJ1 CRISPR-associated protein, Cse4 family n=1 Tax=Sphaerobacter thermophilus DSM 20745 RepID=D1CAJ1_SPHTD Length = 397 Score = 106 bits (265), Expect = 1e-21, Method: Compositional matrix adjust. Identities = 72/183 (39%), Positives = 100/183 (54%), Gaps = 26/183 (14%) Query: 156 QGVDIALSGRMATSGMMTELGK--VDGAMSIAHAITTHQVDSDIDWFTAVDDLQEQ---G 210 + D+AL GRM + +L + +D A +AHAI+TH+V ++ D++TAVDDL+ G Sbjct: 179 RSADVALFGRM-----LADLPEKNIDAASQVAHAISTHRVATEFDFYTAVDDLKPDDTAG 233 Query: 211 SAHLGTQEFSSGVFYRYANINLAQLQENLGG------ASREQALEIATHVVHMLATEVPG 264 + LGT EF+S FYRY+NI++ QL ENLGG + E L + H +P Sbjct: 234 ADMLGTVEFNSACFYRYSNIDVDQLIENLGGDVDLARTTVEAFLWASIHA-------IPT 286 Query: 265 AKQRTYAAFNPADMVMVNFSDMPL-SMANAFEKAV-KAKDG-FLQPSIQAFNQYWDRVAN 321 KQ + AA NP VM D L S+ANAF V A DG ++ S+ A YW + Sbjct: 287 GKQNSMAAQNPPSFVMAVVRDRGLWSLANAFVNPVAPAHDGDLIERSVDALEAYWSNLVR 346 Query: 322 GYG 324 YG Sbjct: 347 VYG 349 >UniRef50_B8FDH9 CRISPR-associated protein, Cse4 family n=2 Tax=Bacteria RepID=B8FDH9_DESAA Length = 383 Score = 103 bits (258), Expect = 7e-21, Method: Compositional matrix adjust. Identities = 65/164 (39%), Positives = 93/164 (56%), Gaps = 9/164 (5%) Query: 157 GVDIALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQ-EQGSAHLG 215 GVDIAL GRM V+GA S +HAI+TH+V +++++FTA+DDLQ E GSAH+G Sbjct: 185 GVDIALFGRMVAQAAAL---NVEGAASFSHAISTHKVTNEVEFFTALDDLQTEPGSAHMG 241 Query: 216 TQEFSSGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNP 275 EF+S +YRY +++ QL +NL G QALE V L +P A+Q T + Sbjct: 242 ALEFNSATYYRYVCLDMGQLWKNLAGQHLPQALE---GFVKALYLALPSARQATQSGACW 298 Query: 276 ADMVMVNFSDMPLSMANAFEKAVKAKD-GFLQPSIQAFNQYWDR 318 + V F + F+ AVK ++ G L+PS A Y ++ Sbjct: 299 WEFAKV-FVRKGQRLQAPFDTAVKPRNGGLLEPSKDALCAYLEK 341 Score = 49.7 bits (117), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 21/45 (46%), Positives = 28/45 (62%) Query: 5 INIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMR 49 + H+L S +CLNRDD+ K A+ GG R R+SSQ KR +R Sbjct: 11 VEFHILQSFPVTCLNRDDVGAPKTAVVGGATRARVSSQCWKRNIR 55 >UniRef50_B7KJ25 CRISPR-associated protein, Cse4 family n=1 Tax=Cyanothece sp. PCC 7424 RepID=B7KJ25_CYAP7 Length = 480 Score = 98.2 bits (243), Expect = 4e-19, Method: Compositional matrix adjust. Identities = 61/200 (30%), Positives = 95/200 (47%), Gaps = 13/200 (6%) Query: 159 DIALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQE------QGSA 212 D+AL GRM S VD ++S+AHAI+T+ + + D++TA D Q+ QG+ Sbjct: 243 DVALFGRMLAS---FSDASVDASVSVAHAISTNSIKREFDYWTAARDFQKNNSDESQGAG 299 Query: 213 HLGTQEFSSGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAA 272 H+G + F+SGVFYRY+ ++ QL ENLG +E + + P + Sbjct: 300 HIGDRPFASGVFYRYSCLDSNQLSENLGEIYQEDIQYLVEQYLDAFLHSRPSGYSHQFGH 359 Query: 273 FN-PADMVMVNFSDMPLSMANAFEKAVKAKDGFLQPSIQAFNQYWDRVANGYGLNGAAAQ 331 P + V P+S+ NAF+ +K D F + S +W+ + YG + Sbjct: 360 DTLPFAGIFVIRQSQPISLVNAFDIPIKKYDSFCRQSWNKLVDHWNEIQQAYGKRLPVKE 419 Query: 332 ---FSLSDVDPITAQVKQMP 348 FSL I+ VK +P Sbjct: 420 VHVFSLESFKDISELVKAVP 439 Score = 57.0 bits (136), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 35/94 (37%), Positives = 50/94 (53%), Gaps = 7/94 (7%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGE 60 M+NF+ IH+L S PS +NRD K A FGG R+R+SSQS K A+R+ YY + + + Sbjct: 1 MTNFLEIHLLQSTPPSNMNRDQNGSPKTAHFGGVERLRVSSQSWKHAVRQ--YYKKTLPD 58 Query: 61 SSLRTIHLAQLRDVLRQKLGERFDQKIIDKTLAL 94 H +L +R Q+ D+ L L Sbjct: 59 D-----HKTYRDKGWPTELAKRLKQEKFDEELNL 87 >UniRef50_B8HWH9 CRISPR-associated protein, Cse4 family n=1 Tax=Cyanothece sp. PCC 7425 RepID=B8HWH9_CYAP4 Length = 501 Score = 96.7 bits (239), Expect = 1e-18, Method: Compositional matrix adjust. Identities = 66/208 (31%), Positives = 110/208 (52%), Gaps = 21/208 (10%) Query: 159 DIALSGRMATSGMMTEL--GKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQEQ---GSAH 213 DIAL GRM M L KVD ++ +AHAI+ +++ + D+FTAV+DL E GS H Sbjct: 289 DIALFGRM-----MANLPNAKVDASVQVAHAISVNKLQQEFDFFTAVEDLAEPDSLGSGH 343 Query: 214 LGTQEFSSGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAF 273 +G ++S +YR+ ++ QL++NLG + + A IA +P Q +AA Sbjct: 344 MGETGYNSSTYYRFTTLDTEQLKQNLG--NEDNAATIAHAFAEAFVRAIPTGHQNGFAAH 401 Query: 274 N-PADMVMVNFSDMPLSMANAFEKAVKAKDG--FLQPSIQAFNQYWDRVANGYG-----L 325 + PA ++ V P+S+ +AFE V K G L+ ++ +++W ++ YG Sbjct: 402 SLPAAVMAVVRKGQPVSLVDAFENPVAPKAGKSLLENAVSKLDEHWAELSKMYGEKTVVF 461 Query: 326 NGAAAQFSLSDVDPITAQVKQMPTLEQL 353 G A+ L+ A V++ P++E+L Sbjct: 462 KGIVARAQLAQQLEYLAAVEK-PSVEEL 488 Score = 57.0 bits (136), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 33/91 (36%), Positives = 47/91 (51%), Gaps = 2/91 (2%) Query: 5 INIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESSLR 64 + IH+L S P+ LNRD+ M K +FGG R RISSQ KR R+ YY + E + Sbjct: 3 LEIHILQSFPPANLNRDENGMPKSTVFGGYPRARISSQCQKRRTRE--YYHEYCKELGVD 60 Query: 65 TIHLAQLRDVLRQKLGERFDQKIIDKTLALL 95 H A ++L E+ Q+ + + A L Sbjct: 61 LKHFANRSRNWIKQLKEKLTQRGVSEAQAEL 91 >UniRef50_D1NTI0 CRISPR-associated protein, Cse4 family n=1 Tax=Bifidobacterium gallicum DSM 20093 RepID=D1NTI0_9BIFI Length = 381 Score = 90.9 bits (224), Expect = 6e-17, Method: Compositional matrix adjust. Identities = 92/318 (28%), Positives = 147/318 (46%), Gaps = 26/318 (8%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGE 60 M+ + I+ + + PS +NRDD K AI+GG R R+SSQ+ KRAMR++ + + Sbjct: 1 MTTIVEIYAIQNVPPSNINRDDTGNPKTAIYGGVLRARVSSQAWKRAMREAFPEMLDADQ 60 Query: 61 SSLRTIH-LAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGEI 119 +RT + LAQ+ + K + D + + K + + EK ++ Sbjct: 61 LGIRTKNALAQIEQSIVAKRPD-IDVETVHKAATAALTATGAKVEKSKRKGSMEG--ADL 117 Query: 120 AWFCEQVAKAEADNLDD------------KKLLKVLKEDIAAIRVNLQQGVDIALSGRMA 167 + +A E D L D K K +K ++A+ + Q VDIAL GRM Sbjct: 118 TQYLIFIANREIDKLADLAIAWIDADEDLDKPSKEMKGQVSAV-FHGPQAVDIALFGRML 176 Query: 168 TSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQEQGSAH---LGTQEFSSGVF 224 EL D + +AHAI+ +V + D+FTA+DD +A L T F+S Sbjct: 177 ADA--PELN-TDASAQVAHAISVDEVTPEYDYFTAIDDDAADDNAGAAMLDTVGFNSSTL 233 Query: 225 YRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFN-PADMVMVNF 283 YRYA + + L E L A + ++ V+ +P KQ T+A P ++V Sbjct: 234 YRYATVAVDSLYEQLQSA--DMTVKAVDAFVNAFLRSMPTGKQNTFANRTLPTAALVVVR 291 Query: 284 SDMPLSMANAFEKAVKAK 301 + P++ AFE+ V A+ Sbjct: 292 NSQPINPVEAFERPVHAE 309 >UniRef50_D0WFC9 CRISPR-associated protein, Cse4 family n=1 Tax=Slackia exigua ATCC 700122 RepID=D0WFC9_9ACTN Length = 310 Score = 87.8 bits (216), Expect = 5e-16, Method: Compositional matrix adjust. Identities = 61/189 (32%), Positives = 102/189 (53%), Gaps = 14/189 (7%) Query: 116 VGEIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTEL 175 +G++A Q + + + +D K+ K+L D+ R VDIA+ GRM +L Sbjct: 44 IGKLAELAIQALR-DGEKVDKKEAKKIL--DVK--RSPALNAVDIAMFGRMVADA--PDL 96 Query: 176 GKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQEQ---GSAHLGTQEFSSGVFYRYANINL 232 VD ++ +AHAI+ +++ D+FTA+DD + G+A + T EF+S +FYRYAN+++ Sbjct: 97 N-VDASVQVAHAISVSSAETEFDYFTALDDKAPEDNAGAAMIETTEFTSAMFYRYANVDV 155 Query: 233 AQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFSD-MPLSMA 291 L ENLG S + A + + +P KQ ++A V++ D P+S+ Sbjct: 156 FHLCENLG--SPDAATKGINAFLQSFVKSMPTGKQNSFANRTLPSAVVIQLRDSQPVSLV 213 Query: 292 NAFEKAVKA 300 N+FE+ V A Sbjct: 214 NSFERPVVA 222 >UniRef50_Q31XC0 Putative cytoplasmic protein n=1 Tax=Shigella boydii Sb227 RepID=Q31XC0_SHIBS Length = 245 Score = 86.3 bits (212), Expect = 1e-15, Method: Compositional matrix adjust. Identities = 77/245 (31%), Positives = 124/245 (50%), Gaps = 28/245 (11%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNI-G 59 M+ FI +H+L ++ + LNRDD K I GG R+R+SSQSLKRA R S + Q + G Sbjct: 1 MTTFIQLHLLTAYPAANLNRDDSGSPKTVILGGATRLRVSSQSLKRAWRTSELFEQALAG 60 Query: 60 ESSLRTIHLA-QLRDVLRQKLGERFDQKIIDKTLAL---LSGKSVDEAEKISADAVTPWV 115 +R+ +A + +L +K E D+K I+ + + L D+ +K P Sbjct: 61 HIGVRSGRIAREAATILIEKGIE--DKKAIEWAVEIADYLGKAKKDKKQKNDKKPKDPLT 118 Query: 116 VGEIAWFCEQVAKAEADNL--------DDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMA 167 E ++ AE D + ++K+ K ++D+A +R + + VDIA+ GRM Sbjct: 119 SAETEQLV-HISPAEFDAVKALAHQLAEEKRAPK--EKDLALLRKD-RMAVDIAMFGRM- 173 Query: 168 TSGMMTELG-KVDGAMSIAHAITTHQVDSDIDWFTAVDDL----QEQGSAHLGTQEFSSG 222 + + G V+ A +AHA + + D+FTAVDDL ++ G+ H+ F S Sbjct: 174 ---LAKKPGFNVEAACQVAHAFGVSETIVENDFFTAVDDLRQASEDAGAGHVDETGFGSA 230 Query: 223 VFYRY 227 +FY Y Sbjct: 231 LFYTY 235 >UniRef50_C2GEY7 CRISPR-associated Cse4 family protein n=6 Tax=Actinomycetales RepID=C2GEY7_9CORY Length = 356 Score = 85.1 bits (209), Expect = 4e-15, Method: Compositional matrix adjust. Identities = 99/360 (27%), Positives = 156/360 (43%), Gaps = 38/360 (10%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGE 60 MSN + +H L S S LNRDD + K + GG R SSQS+KR R Y + Sbjct: 1 MSNQLTLHFLCSIPYSNLNRDDTGVPKRVMQGGALRALHSSQSIKRGSRV--LYENASQD 58 Query: 61 SSLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGK-SVDEAEKISADAV-TPWVVGE 118 S+R+ L + ++ D+K K A L G + EA+ DA + W+ E Sbjct: 59 LSIRSGRLDEEVAEKAMEMNPDLDEKTALKQAAKLIGNLTKGEAKSGEGDAKRSTWLSSE 118 Query: 119 IAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGKV 178 A A++ D ++ I N + IA GRM + T+L Sbjct: 119 EIL---TAATYVANSTDPREKF---------IDGNTTGSLAIAAFGRMFANA--TDL-NT 163 Query: 179 DGAMSIAHAITTHQVDSDIDWFTAVDDL----QEQGSAHLGTQEFSSGVFYRYANINLAQ 234 + A++++ AITTHQ + D+F+ DD+ + + +L ++SG FYR I+ Q Sbjct: 164 EAAVAVSPAITTHQATIETDYFSTADDINLRDHKANATYLDVSLYTSGTFYRTVTIDRNQ 223 Query: 235 LQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFSDMPLSMANAF 294 L+ + G E V L P K+ + A F +++ + +A F Sbjct: 224 LRTSWSGFESNSVRENLEAFVRSLVYGQPRGKKNSTAPFTMPSLILAE--EQQYRVAYDF 281 Query: 295 EKAVKA-KD--GFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITAQVKQMPTLE 351 E+ V+A KD GF++ SI+ ++A Y L A F + P+ A P L+ Sbjct: 282 ERPVEADKDGGGFMKSSIE-------KLAKQYTL---ARSFDPGNFGPVEALSGTYPDLD 331 >UniRef50_UPI0001B51C2C hypothetical protein SvirD4_12600 n=1 Tax=Streptomyces viridochromogenes DSM 40736 RepID=UPI0001B51C2C Length = 461 Score = 59.3 bits (142), Expect = 2e-07, Method: Compositional matrix adjust. Identities = 89/381 (23%), Positives = 145/381 (38%), Gaps = 91/381 (23%) Query: 4 FINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESSL 63 + ++H+L + + + RD+ M K +FGG R I++Q+ +RA R N G+ L Sbjct: 17 YFSLHLLETFTAALPVRDENGMPKQFVFGGDPRTMITAQARRRAERTHSRERANAGQGPL 76 Query: 64 -------RTIHLAQLR-------------------DVLRQKLGERFDQKIIDKTLALLSG 97 RT A+L L + +G +F K + L + Sbjct: 77 AGYTMGIRTREWAKLTAKALADRYGWDRADALATAKALLEGVGLKFGAKPTTRDLTQVLL 136 Query: 98 KSVDEAEKISADAVTPWVVGEIAWF-------------------------------CEQV 126 + ++A +I AD + AW + + Sbjct: 137 FAPEDAGQIIADWIQEHRAEVAAWTSDYLKAKEAGAAAAAAKKAAAAAARKAKKSGTDAL 196 Query: 127 AKAEADNL--DDKKLLKVLKEDIAAIRVNL--QQGVDIALSGRMATSGMMTELGKVDGAM 182 A A DN ++++L V ++ AI L + +DIAL GR + + VDGA+ Sbjct: 197 ASAADDNQPNNEEQLPPVPRKIREAILSALAPRDAIDIALYGRFLAE--IADSPNVDGAI 254 Query: 183 SIAHAITTHQVD------------------SDIDWFTAVDDLQEQGSAHLGTQEFSSGVF 224 AHA T H + +D+ A DD G+ G Q SG F Sbjct: 255 QTAHAFTVHAAEHIDDFYAAADDAKLHRKAHALDYIDAADD---SGAGMTGYQSLISGTF 311 Query: 225 YRYANINLAQLQENL--GGASREQ----ALEIATHVVHMLATEVPGAKQRTYAAFNPADM 278 YR+A ++ +L+ NL G +Q A V +P AK+ T AA Sbjct: 312 YRHAVLDRYKLRINLLASGMKPDQVQAAAEAAELEFVEAFTNAIPQAKKNTTAATGILPK 371 Query: 279 VMVNFSDM-PLSMANAFEKAV 298 +++ F+ P + A FEK + Sbjct: 372 LVMAFTGARPFNYAGIFEKPI 392 >UniRef50_UPI000190E665 hypothetical protein SentesTyp_08452 n=3 Tax=Salmonella enterica subsp. enterica serovar Typhi RepID=UPI000190E665 Length = 139 Score = 57.8 bits (138), Expect = 7e-07, Method: Compositional matrix adjust. Identities = 26/54 (48%), Positives = 37/54 (68%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYY 54 M+ FI +H+L +++P+ LNRD+ K A GG R+R+SSQSLKRA R S + Sbjct: 1 MTTFIQLHLLTAYAPANLNRDESGRPKTAFMGGVERLRVSSQSLKRAWRVSETF 54 >UniRef50_B6IWM4 CRISPR-associated protein, CT1975 family n=1 Tax=Rhodospirillum centenum SW RepID=B6IWM4_RHOCS Length = 435 Score = 53.9 bits (128), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 56/220 (25%), Positives = 97/220 (44%), Gaps = 21/220 (9%) Query: 158 VDIALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQ---EQGSAHL 214 +D AL GRM + V+ A ++ HA TTH+ + D+F+A ++L+ G+ Sbjct: 220 LDTALFGRMVAANANF---NVEAACAVGHAFTTHRFALEGDYFSAGEELKVLGGTGAVIT 276 Query: 215 GTQEFSSGVFYRYANINLAQLQENLG-GASREQALEIATHVVHMLATEV----PGAKQRT 269 G F GV+Y++A ++ L+ L G S E+A + V T + P K + Sbjct: 277 GYAFFGGGVYYQHAVLDRGHLRTTLSRGRSAEEAERLTVQAVDTFLTGLLFSQPRGKCNS 336 Query: 270 YAAFNPADMVMVNFSDMP-LSMANAFEKAVKAKD---GFLQPSIQAFNQYWDRVANGYGL 325 +A+ A V+ P L++ AF VKA + + SI+ + + YGL Sbjct: 337 HASDVAASYVLATRGGDPALNLGLAFLDPVKATEDVTDLMCASIRRLTDFHRALTAAYGL 396 Query: 326 NGAAAQFSLSDVDPI----TAQVKQMPTLEQLKSWVRNNG 361 A L+ P + ++ T+E + +V+ G Sbjct: 397 GNAVC--VLNAYPPARGNDAPRAPEVWTVEDFRRFVQGRG 434 Score = 45.1 bits (105), Expect = 0.004, Method: Compositional matrix adjust. Identities = 25/69 (36%), Positives = 37/69 (53%), Gaps = 2/69 (2%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYY--AQNI 58 +++ I HVL + P +NRD+ K GG R RISSQ+ KRA+R + ++ AQ Sbjct: 11 VADMIQFHVLTAFPPHNVNRDEDGRPKTCQLGGVTRGRISSQAKKRALRLAPHFPTAQRA 70 Query: 59 GESSLRTIH 67 + IH Sbjct: 71 TRTRKAGIH 79 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_Q46899 Uncharacterized protein ygcJ n=13 Tax=Proteobact... 486 e-136 UniRef50_C0W6U1 CRISPR-associated Cse4 family protein n=2 Tax=Ac... 343 5e-93 UniRef50_A0LM53 CRISPR-associated protein, Cse4 family n=1 Tax=S... 334 2e-90 UniRef50_A1SV72 CRISPR-associated protein, Cse4 family n=2 Tax=G... 334 3e-90 UniRef50_D2RB01 CRISPR system CASCADE complex protein CasC n=2 T... 330 6e-89 UniRef50_C5SD49 CRISPR-associated protein, Cse4 family n=1 Tax=A... 322 1e-86 UniRef50_C2BET9 CRISPR-associated protein n=3 Tax=Bacteria RepID... 320 3e-86 UniRef50_B6XT63 Putative uncharacterized protein n=1 Tax=Bifidob... 319 1e-85 UniRef50_D1CGD3 CRISPR-associated protein, Cse4 family n=1 Tax=T... 318 2e-85 UniRef50_Q2JWC4 CRISPR-associated protein, Cse4 family n=1 Tax=S... 318 3e-85 UniRef50_A4XYU0 CRISPR-associated protein, Cse4 family n=5 Tax=B... 315 2e-84 UniRef50_A7BA64 Putative uncharacterized protein n=1 Tax=Actinom... 312 1e-83 UniRef50_B0TDU0 Crispr-associated protein, ct1973 family, putati... 312 2e-83 UniRef50_B1VIY1 CRISPR-associated protein n=3 Tax=Corynebacteriu... 310 4e-83 UniRef50_Q2JH28 CRISPR-associated protein, CT1975 n=6 Tax=Actino... 309 1e-82 UniRef50_Q2FNL3 CRISPR-associated protein, CT1975 n=8 Tax=cellul... 307 3e-82 UniRef50_C6SPJ0 Putative uncharacterized protein n=1 Tax=Strepto... 307 4e-82 UniRef50_Q03C61 CRISPR-associated protein n=6 Tax=Firmicutes Rep... 307 5e-82 UniRef50_C7MTA9 CRISPR-associated protein, Cse4 family n=1 Tax=S... 307 6e-82 UniRef50_A1ARH7 CRISPR-associated protein, Cse4 family n=1 Tax=P... 305 2e-81 UniRef50_B3E5V0 CRISPR-associated protein, Cse4 family n=56 Tax=... 304 4e-81 UniRef50_A5UR15 CRISPR-associated protein, Cse4 family n=1 Tax=R... 303 5e-81 UniRef50_Q47PJ3 CRISPR-associated protein, Cse4 family n=1 Tax=T... 303 6e-81 UniRef50_C3PF94 CRISPR-associated protein n=5 Tax=Corynebacteriu... 303 7e-81 UniRef50_D1YEE3 CRISPR system CASCADE complex protein CasC n=1 T... 302 1e-80 UniRef50_D1CAJ1 CRISPR-associated protein, Cse4 family n=1 Tax=S... 302 1e-80 UniRef50_D0Y919 CRISPR-associated protein, Cse4 family n=2 Tax=D... 302 1e-80 UniRef50_D2TKK6 CRISPR-associated protein n=1 Tax=Citrobacter ro... 302 1e-80 UniRef50_C7LYW7 CRISPR-associated protein, Cse4 family n=1 Tax=A... 301 3e-80 UniRef50_Q3A5Z5 CRISPR-associated protein, Cse4 family n=23 Tax=... 300 7e-80 UniRef50_A3EQA5 CRISPR-ssociated protein, Cas4 n=4 Tax=Bacteria ... 299 8e-80 UniRef50_C7QEM5 CRISPR-associated protein, Cse4 family n=13 Tax=... 298 3e-79 UniRef50_Q2RXJ6 CRISPR-associated protein, Cse4 family n=2 Tax=A... 294 3e-78 UniRef50_B8IZA6 CRISPR-associated protein, Cse4 family n=1 Tax=D... 294 4e-78 UniRef50_C4FG89 Putative uncharacterized protein n=1 Tax=Bifidob... 293 5e-78 UniRef50_B6B782 CRISPR-associated protein, Cse4 family n=2 Tax=A... 293 5e-78 UniRef50_B6WQ62 Putative uncharacterized protein n=1 Tax=Desulfo... 292 1e-77 UniRef50_C4ZJY0 CRISPR-associated protein, Cse4 family n=1 Tax=T... 292 2e-77 UniRef50_B4UE70 CRISPR-associated protein, Cse4 family n=2 Tax=A... 291 3e-77 UniRef50_UPI0001AF1D4B hypothetical protein SghaA1_37372 n=1 Tax... 290 5e-77 UniRef50_Q0BRF9 Putative uncharacterized protein n=1 Tax=Granuli... 289 1e-76 UniRef50_A8LYZ6 CRISPR-associated protein, Cse4 family n=1 Tax=S... 288 2e-76 UniRef50_Q1EQS8 CRISPR-associated protein n=3 Tax=Streptomyces R... 287 4e-76 UniRef50_D0MET5 CRISPR-associated protein, Cse4 family n=1 Tax=R... 286 7e-76 UniRef50_Q0AA32 CRISPR-associated protein, Cse4 family n=1 Tax=A... 285 1e-75 UniRef50_D1Y487 CRISPR-associated protein, Cse4 family n=1 Tax=P... 285 2e-75 UniRef50_Q67RP1 Putative uncharacterized protein n=1 Tax=Symbiob... 284 3e-75 UniRef50_A5FTJ7 CRISPR-associated protein, Cse4 family n=11 Tax=... 280 5e-74 UniRef50_Q2RY18 CRISPR-associated protein, Cse4 family n=2 Tax=A... 280 7e-74 UniRef50_D1NTI0 CRISPR-associated protein, Cse4 family n=1 Tax=B... 276 1e-72 UniRef50_C7RP61 CRISPR-associated protein, Cse4 family n=1 Tax=C... 274 3e-72 UniRef50_C6HV95 CRISPR-associated protein, Cas4 n=1 Tax=Leptospi... 274 4e-72 UniRef50_C9M9R6 CRISPR-associated protein, Cse4 family n=1 Tax=J... 271 2e-71 UniRef50_C7MQD5 CRISPR-associated protein, Cse4 family n=1 Tax=S... 269 1e-70 UniRef50_B4S8P9 CRISPR-associated protein, Cse4 family n=9 Tax=B... 269 1e-70 UniRef50_Q1J368 CRISPR-associated protein, CT1975 n=1 Tax=Deinoc... 266 9e-70 UniRef50_B8FDH9 CRISPR-associated protein, Cse4 family n=2 Tax=B... 264 3e-69 UniRef50_D2L2X7 CRISPR-associated protein, Cse4 family n=1 Tax=D... 261 2e-68 UniRef50_A5GBK1 CRISPR-associated protein, Cse4 family n=1 Tax=G... 261 3e-68 UniRef50_D1A6Q4 CRISPR-associated protein, Cse4 family n=2 Tax=A... 261 4e-68 UniRef50_C2GEY7 CRISPR-associated Cse4 family protein n=6 Tax=Ac... 259 1e-67 UniRef50_Q60AD1 CRISPR-associated protein, CT1975 family n=1 Tax... 257 4e-67 UniRef50_B6IWM4 CRISPR-associated protein, CT1975 family n=1 Tax... 238 2e-61 UniRef50_C8P6I6 CRISPR-associated protein n=1 Tax=Lactobacillus ... 229 1e-58 UniRef50_B8HWH9 CRISPR-associated protein, Cse4 family n=1 Tax=C... 220 8e-56 UniRef50_D0WFC9 CRISPR-associated protein, Cse4 family n=1 Tax=S... 212 1e-53 UniRef50_B7KJ25 CRISPR-associated protein, Cse4 family n=1 Tax=C... 209 2e-52 UniRef50_Q31XC0 Putative cytoplasmic protein n=1 Tax=Shigella bo... 206 8e-52 UniRef50_UPI0001B51C2C hypothetical protein SvirD4_12600 n=1 Tax... 204 4e-51 UniRef50_UPI000190E665 hypothetical protein SentesTyp_08452 n=3 ... 109 2e-22 Sequences not found previously or not previously below threshold: UniRef50_UPI0001B58196 CRISPR-associated Cse4 family protein n=1... 75 3e-12 UniRef50_C2BS05 Possible CRISPR-associated protein n=1 Tax=Mobil... 52 3e-05 UniRef50_O87037 Z35f protein n=1 Tax=Vibrio cholerae RepID=O8703... 44 0.007 UniRef50_Q4PFD0 Putative uncharacterized protein n=3 Tax=Basidio... 41 0.053 >UniRef50_Q46899 Uncharacterized protein ygcJ n=13 Tax=Proteobacteria RepID=YGCJ_ECOLI Length = 363 Score = 486 bits (1252), Expect = e-136, Method: Composition-based stats. Identities = 363/363 (100%), Positives = 363/363 (100%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGE 60 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGE Sbjct: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGE 60 Query: 61 SSLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGEIA 120 SSLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGEIA Sbjct: 61 SSLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGEIA 120 Query: 121 WFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGKVDG 180 WFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGKVDG Sbjct: 121 WFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGKVDG 180 Query: 181 AMSIAHAITTHQVDSDIDWFTAVDDLQEQGSAHLGTQEFSSGVFYRYANINLAQLQENLG 240 AMSIAHAITTHQVDSDIDWFTAVDDLQEQGSAHLGTQEFSSGVFYRYANINLAQLQENLG Sbjct: 181 AMSIAHAITTHQVDSDIDWFTAVDDLQEQGSAHLGTQEFSSGVFYRYANINLAQLQENLG 240 Query: 241 GASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFSDMPLSMANAFEKAVKA 300 GASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFSDMPLSMANAFEKAVKA Sbjct: 241 GASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFSDMPLSMANAFEKAVKA 300 Query: 301 KDGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITAQVKQMPTLEQLKSWVRNN 360 KDGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITAQVKQMPTLEQLKSWVRNN Sbjct: 301 KDGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITAQVKQMPTLEQLKSWVRNN 360 Query: 361 GEA 363 GEA Sbjct: 361 GEA 363 >UniRef50_C0W6U1 CRISPR-associated Cse4 family protein n=2 Tax=Actinomycetales RepID=C0W6U1_9ACTO Length = 374 Score = 343 bits (880), Expect = 5e-93, Method: Composition-based stats. Identities = 118/375 (31%), Positives = 176/375 (46%), Gaps = 32/375 (8%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGE 60 MS F++IH++ S PSC+NRDD K A++GG RR+R+SSQS KRA R + + Sbjct: 1 MSTFVDIHLIQSLPPSCVNRDDSGSPKSALYGGVRRLRVSSQSWKRATRLYFNEHLDATD 60 Query: 61 SSLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEA---EKISADAVTPWVVG 117 +RT + +L + + + S + A K A A + +++ Sbjct: 61 VGIRTKRVVELLADRISAIAPDLADSALALAEQVFSAAKIKVAPPRGKKDAPAESGYLLF 120 Query: 118 EIAWFCEQVAKAE------ADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGM 171 ++A+ + +D K+ K+ KE+ VDIAL GRM Sbjct: 121 LSTSQINRLAEMATRAAHAGEKIDPKETKKIFKEE---------HAVDIALFGRMVADDA 171 Query: 172 MTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDD------LQEQGSAHLGTQEFSSGVFY 225 VD A +AHAI+TH +++ D+FTAVDD ++ G+ +GT EFSS Y Sbjct: 172 ---DLNVDAACQVAHAISTHAAENEYDFFTAVDDEKSRAMEEDAGAGMMGTVEFSSATMY 228 Query: 226 RYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMV-NFS 284 RYA +NL L ENLG R+ AL + + +P KQ T+A D V+V Sbjct: 229 RYATVNLDMLVENLG--DRDAALRALSVFLEGFCLSMPTGKQNTFANRTLPDSVVVSVRD 286 Query: 285 DMPLSMANAFEKAVKAK--DGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITA 342 D P+S+ AFEK V+ DGFL S++A +Y + +GL A+ P A Sbjct: 287 DQPVSLVGAFEKPVRTTESDGFLTRSVEALARYEHTIEENFGLKPQASFVVSLADVPELA 346 Query: 343 QVKQMPTLEQLKSWV 357 + + T L V Sbjct: 347 SLGERITFADLPGKV 361 >UniRef50_A0LM53 CRISPR-associated protein, Cse4 family n=1 Tax=Syntrophobacter fumaroxidans MPOB RepID=A0LM53_SYNFM Length = 384 Score = 334 bits (857), Expect = 2e-90, Method: Composition-based stats. Identities = 125/394 (31%), Positives = 195/394 (49%), Gaps = 48/394 (12%) Query: 4 FINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNI----G 59 F++IH++ + +PS LNRDD N KD FGG RR RISSQ +KR +R ++Q + G Sbjct: 2 FVDIHIIQNFAPSNLNRDDTNSPKDCEFGGYRRARISSQCIKRVVRSHRSFSQAVVHAGG 61 Query: 60 ESSLRTIHLA-QLRDVLRQKLG--ERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVV 116 ++ +RT + +L D+ +K G E + + + +T+ L G + + EK T +++ Sbjct: 62 DTGVRTKRIKSRLMDLFAKKYGKPEIVETEKVAETVIELLGLKLKDEEK------TEYLL 115 Query: 117 GEIAWFCEQVAKAEADNLDD---------------------KKLLKVLKEDIAAIRVNLQ 155 Q+A+ D+ D K+ + LK + R + Sbjct: 116 YLGENEAAQLARLAVDSWDALLAIEPEQDKKKKKGTGQESLKEFQEELKGIVGKRRKEAR 175 Query: 156 Q-GVDIALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDL---QEQGS 211 DIAL GRM VD A +AHA++T++V+ ++D+FTAVDDL +E GS Sbjct: 176 SYAADIALFGRMIADNKN---MNVDAACQVAHAVSTNKVEMEMDYFTAVDDLLPGEETGS 232 Query: 212 AHLGTQEFSSGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYA 271 +G EF+S FYRY+N+N+++L ENL G + + V VP KQ + A Sbjct: 233 DMIGVVEFNSSCFYRYSNVNVSKLAENL-GFNNDLTTAALLGYVEASVKSVPTGKQNSMA 291 Query: 272 AFNPADM--VMVNFSDMPLSMANAFEKAVKA--KDGFLQPSIQAFNQYWDRVANGYGLNG 327 A NPA V+V P S+ANAF+K V+ + SI A +Y++R+ YG G Sbjct: 292 AQNPAGYARVIVRRDGFPWSLANAFQKPVRPSLDKSLEEASIDALERYFERLKAVYGTEG 351 Query: 328 AAAQFSLSDVDPITAQVKQMPTLEQLKSWVRNNG 361 S + +++M L+ LK+ V G Sbjct: 352 IVCDASFNLHRDDGGSLRKM--LDALKACVAGEG 383 >UniRef50_A1SV72 CRISPR-associated protein, Cse4 family n=2 Tax=Gammaproteobacteria RepID=A1SV72_PSYIN Length = 337 Score = 334 bits (857), Expect = 3e-90, Method: Composition-based stats. Identities = 140/356 (39%), Positives = 205/356 (57%), Gaps = 23/356 (6%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGE 60 M+ FINIH LISH S +NRDD MQK A+FGG R RISSQ LKRA+R+S Y + + E Sbjct: 1 MTTFINIHTLISHPSSMMNRDDSGMQKTAVFGGSVRSRISSQCLKRAIRQSDIYGEAVAE 60 Query: 61 SSLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKI-SADAVTPWVVGEI 119 S+RT +L D+ ++ + E + I D L + S + D+ +I + DAV P+ +G I Sbjct: 61 KSIRTNKFDELLDLCKEAMPETDIKLIEDVLLNMGSKVTKDKKTEIRNFDAVQPYAIGSI 120 Query: 120 AWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGKVD 179 + + + K L K+++ +D+ALSGRM S V+ Sbjct: 121 ----REAINMVNEGTELKDLKKIVQ----------IPTIDVALSGRMDAS---CPPRNVE 163 Query: 180 GAMSIAHAITTHQVDSDIDWFTAVDDLQEQGSAHLGTQEFSSGVFYRYANINLAQLQENL 239 AMS+AH++TTH D ++DWFTA DDL EQGS H+GT EFSSGVFYRYA+IN+ L +N+ Sbjct: 164 AAMSVAHSLTTHSADIEVDWFTACDDLAEQGSGHIGTTEFSSGVFYRYASINVDLLAKNV 223 Query: 240 GGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFSDMPLSMANAFEKAVK 299 E I ++ A P AKQ+ +AA+N AD VM S+ P+S+ANAF K ++ Sbjct: 224 KSTVSEVTP-IINTMIRCFAQVSPSAKQKVFAAYNQADFVMATHSNQPISLANAFRKPIE 282 Query: 300 AKDGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITAQVKQMPTLEQLKS 355 ++ SI A ++++++ N Y L+ A L+D +AQ KQ+ + ++ Sbjct: 283 NNGDVMENSIAALVKHYEKLTNAYELDSKAIALDLTD----SAQSKQINLVNKISD 334 >UniRef50_D2RB01 CRISPR system CASCADE complex protein CasC n=2 Tax=Gardnerella vaginalis RepID=D2RB01_GARVA Length = 362 Score = 330 bits (845), Expect = 6e-89, Method: Composition-based stats. Identities = 103/360 (28%), Positives = 177/360 (49%), Gaps = 26/360 (7%) Query: 4 FINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGES-- 61 F++I + S P +NRDD K A +GG R R+SSQ K +MR+ Y+ ++ G+S Sbjct: 6 FLDIQAIQSVPPCNINRDDAGSPKTAQYGGVTRARVSSQCWKHSMRE--YFKEHSGDSNV 63 Query: 62 SLRTIHLAQLRD----VLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVG 117 +R+ ++ + L+ +L E+ + +KTL K+ + KI + +G Sbjct: 64 GMRSKNIVKYVADKIITLKPELSEQEALDLANKTLNNAGFKTKTDKGKIIPVVNVLFFLG 123 Query: 118 EIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGK 177 E +A+A +N+ DKK L+ + +D +DIAL GRM Sbjct: 124 E--NQANSLAQAAINNVTDKKQLEEILKDNPP--------IDIALFGRMLADN---PSLN 170 Query: 178 VDGAMSIAHAITTHQVDSDIDWFTAVDDL---QEQGSAHLGTQEFSSGVFYRYANINLAQ 234 D + +AHAI+TH V ++ D++TAVDDL G+ LGT E++S YRYAN+ + + Sbjct: 171 EDASSQVAHAISTHAVRAEFDYYTAVDDLSVDDNAGAGMLGTIEYNSSTLYRYANVAIHE 230 Query: 235 LQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVN-FSDMPLSMANA 293 L ++E + + A +P K T+A M++V D P+++ +A Sbjct: 231 FSHQLSD-NKESTINALKLFIEAFANAMPTGKVNTFANQTLPQMLVVTLREDRPVNLVSA 289 Query: 294 FEKAVKAKDGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITAQVKQMPTLEQL 353 FE VKAKDG++ SI+ +Q +++V A+ ++ + + +++QL Sbjct: 290 FEDPVKAKDGYVSKSIEKLSQEYEKVQKFVHKPLASFYVTMDSSNKEIKLGVEEQSMQQL 349 >UniRef50_C5SD49 CRISPR-associated protein, Cse4 family n=1 Tax=Allochromatium vinosum DSM 180 RepID=C5SD49_CHRVI Length = 393 Score = 322 bits (826), Expect = 1e-86, Method: Composition-based stats. Identities = 179/393 (45%), Positives = 239/393 (60%), Gaps = 36/393 (9%) Query: 3 NFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESS 62 NF+N HVLISHSPSCLNRDDMNMQK AIFGGK RVRISSQSLKRA+R S YYA+ S Sbjct: 5 NFVNFHVLISHSPSCLNRDDMNMQKTAIFGGKTRVRISSQSLKRAIRYSDYYARYFISKS 64 Query: 63 LRTIHL-AQLRDVLRQKLGERFDQKIIDK----TLALLSGK-SVDEAEKISADAVTPWVV 116 RT L ++ D L I+K A+ GK +DE K D + + Sbjct: 65 QRTRRLFDKMADELSASAESAEQTTAIEKCALYAAAIFEGKTKIDEIGKYERDKKSDHIE 124 Query: 117 GEIAWF-CEQVAKA-----EADNLDDKKLLKVLKEDIAAIRVNLQQ--GVDIALSGRMAT 168 +I F C ++ EA +K ++ +K +I + + +D+ALSGRMA Sbjct: 125 TQIIPFSCAEIEGIKQILLEAAGKPEKGRIEYMKAEIQRLEREQRTRIDLDVALSGRMAN 184 Query: 169 SGMMTELGKVDGAMSIAHAITTHQVDSD-IDWFTAVDDLQ----EQGSAHLGTQEFSSGV 223 S ++ VDGA+++AHAITTH V+ IDWFTAVDDL E G+ HL TQ+FS+GV Sbjct: 185 SELIYP---VDGALAVAHAITTHTVEPQDIDWFTAVDDLTLDAGETGAGHLNTQQFSAGV 241 Query: 224 FYRYANINLAQLQENLG----------GASREQALEIATHVVHMLATEVPGAKQRTYAAF 273 FYRYA++NL QLQ NLG SR +AL+IA HV+H+LAT VP AKQ+++AA Sbjct: 242 FYRYASLNLRQLQFNLGLLANINAEQTTESRARALDIARHVLHLLATVVPSAKQQSFAAH 301 Query: 274 NPADMVMVNFSDMPLSMANAFEKAVKAK---DGFLQPSIQAFNQYWDRVANGYGLNGAAA 330 N AD V+V+ +D P+S+ANAFE+ ++ + GFLQPSI A YW RV + YGL+ A Sbjct: 302 NLADFVIVSLADQPVSLANAFEEPIERERKIGGFLQPSITALADYWSRVNSAYGLDEQAR 361 Query: 331 QFSLSDVDPITAQVKQMPTLEQLKSWVRNNGEA 363 F+L + Q + + ++ L+ W+ N+G A Sbjct: 362 AFALRGGIKLGDQ-EVLTSIADLEQWLANDGRA 393 >UniRef50_C2BET9 CRISPR-associated protein n=3 Tax=Bacteria RepID=C2BET9_9FIRM Length = 359 Score = 320 bits (821), Expect = 3e-86, Method: Composition-based stats. Identities = 98/356 (27%), Positives = 173/356 (48%), Gaps = 22/356 (6%) Query: 4 FINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESSL 63 F++IH + + P+ +NRDD K A +GG R R+SSQS KRA+RK ++ + Sbjct: 10 FLDIHAIQTVPPANINRDDTGSPKTAQYGGVTRARVSSQSWKRAIRKYFNENGDVENVGI 69 Query: 64 RTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGEIAWFC 123 R++ + + + K+ ++ I++ + ++ K+++ A+ + D + Sbjct: 70 RSLEIVRY---VANKIVQKDGSISIEEAM-EMADKTINNAKISTKDQKAKALFFMSDKQA 125 Query: 124 EQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGKVDGAMS 183 E++A+A D ++DKK+L+ + ++ +D+AL GRM D + Sbjct: 126 EELAQASIDKVNDKKILQEILKN--------DTSIDVALFGRMVADDASL---NEDASSQ 174 Query: 184 IAHAITTHQVDSDIDWFTAVDDLQE---QGSAHLGTQEFSSGVFYRYANINLAQLQENLG 240 +AHAI+TH + S+ D+FTAVDDL G+ LGT E++S YRYANI L L Sbjct: 175 VAHAISTHAIQSEFDFFTAVDDLAPEDNAGAGMLGTVEYNSSTLYRYANIALHDFYRQL- 233 Query: 241 GASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVN-FSDMPLSMANAFEKAVK 299 A +E+ ++ V +P K T+A ++V+ SD PL+M +AFE+ +K Sbjct: 234 -ADKEETIKATKLFVKSFVESMPTGKINTFANQTLPQAIVVSLRSDRPLNMVSAFEEPIK 292 Query: 300 AKDGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITAQVKQMPTLEQLKS 355 + +G++ SI+ + + A L + + K +L L Sbjct: 293 SDNGYVDKSIEKLFSEYTKYDKILDKPIFTAYLILGNT-EVNEIGKSEASLNDLLE 347 >UniRef50_B6XT63 Putative uncharacterized protein n=1 Tax=Bifidobacterium catenulatum DSM 16992 RepID=B6XT63_9BIFI Length = 371 Score = 319 bits (817), Expect = 1e-85, Method: Composition-based stats. Identities = 109/373 (29%), Positives = 176/373 (47%), Gaps = 26/373 (6%) Query: 4 FINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESSL 63 F++IH L PS +NRDD K A GG R R+SSQS KRAMR+ + + Sbjct: 2 FVDIHCLQQVPPSNINRDDTGSPKTAYVGGALRARVSSQSWKRAMREMFSSKLDSSKLGK 61 Query: 64 RTIH-LAQLRDVLRQKLGERFDQ-KIIDKTLALLSGKSVDEAEKISAD---AVTPWVVGE 118 RT +A + V+ +K + ++ K + + + +G V +++ AD + T +++ Sbjct: 62 RTKSAVALISSVIAEKRPDLVEESKSLAEKVLAATGVKVKASDRAGADKGSSATEYLIFI 121 Query: 119 IAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGKV 178 EQ+A D+ K +K+++AA+ + +Q +DIA GRM Sbjct: 122 ANREVEQLADIAITAFDEGKDPSKMKKEVAAV-FHGEQAIDIACFGRMLADA---PDLNT 177 Query: 179 DGAMSIAHAITTHQVDSDIDWFTAVDDLQ---EQGSAHLGTQEFSSGVFYRYANINLAQL 235 D + +AHA + Q+ + D+FTAVDD G+A + T F+S YRYA +N+ L Sbjct: 178 DASAQVAHAFSIDQITPEYDYFTAVDDCASDDNAGAAMIDTIGFNSSTLYRYATVNVDAL 237 Query: 236 QENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPA-DMVMVNFSDMPLSMANAF 294 ++ L A+ A+E V +P KQ T+A D+V+V P+S A+AF Sbjct: 238 KDQLQDAN--AAVEGVAAFVDAFIKSMPSGKQNTFANHTLPEDIVIVLRDSQPISAADAF 295 Query: 295 EKAVKAKDGF--LQPSIQAFNQYWDRVANGYGLNGAAAQF-----SLSDVDPITAQVKQM 347 E +K KDG + I+ + + YG A S+ +D + QV Sbjct: 296 EDPIKRKDGISVSRQGIERLGDRLNEIRINYGEEPVKAWHVVSGGSVHSLDEWSEQV--- 352 Query: 348 PTLEQLKSWVRNN 360 TL +L+ +R Sbjct: 353 -TLPELEQGLRET 364 >UniRef50_D1CGD3 CRISPR-associated protein, Cse4 family n=1 Tax=Thermobaculum terrenum ATCC BAA-798 RepID=D1CGD3_THET1 Length = 382 Score = 318 bits (815), Expect = 2e-85, Method: Composition-based stats. Identities = 124/383 (32%), Positives = 193/383 (50%), Gaps = 38/383 (9%) Query: 4 FINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESSL 63 + +H++ + +PS LNRDD KD FGG RR RISSQ +KRA+R+ + QN L Sbjct: 2 LVELHMIQNFAPSNLNRDDTGSPKDCEFGGVRRARISSQCIKRAIRRE--FKQN---GLL 56 Query: 64 RTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISAD--AVTPWVV----G 117 + +A+ ++ Q++ +R + D+ A + A K+ D T +++ G Sbjct: 57 DSERIAERTRLVTQEIADRLARLGRDREQATRVAGFLLSAAKLKVDNSQRTEYLLFLGRG 116 Query: 118 EIAWFC-------EQVAKAEADNL-----DDKKLLKVLKEDIAA---IRVNLQQGVDIAL 162 EI +Q+A +L D KK + + D++ R++ + D+AL Sbjct: 117 EIDAITALCNERWDQLAPLADQSLSDQSNDKKKAAQQVPADMSRELLARLDGGKAADLAL 176 Query: 163 SGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQ---EQGSAHLGTQEF 219 GRM +D A +AHAI+TH+V + D++TAVDDLQ E G+ +GT EF Sbjct: 177 FGRMLAD---LPDKNIDAASQVAHAISTHRVSIEFDFYTAVDDLQPESETGAGMMGTVEF 233 Query: 220 SSGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMV 279 +S FYRY+N+++ QL NL G RE AL+ +H +P KQ + AA NP MV Sbjct: 234 NSACFYRYSNVSMEQLITNLQG-DRELALKTLEAFIHASVRAIPTGKQNSMAAHNPPSMV 292 Query: 280 M-VNFSDMPLSMANAFEKAVKA--KDGFLQPSIQAFNQYWDRVANGYGLNG--AAAQFSL 334 V P S+ANAF + V ++ + SIQA + YW ++ + YG + A +L Sbjct: 293 FAVVREGAPWSLANAFARPVAPGREEDLVGRSIQALDSYWGKLVSVYGGDDIRKKALITL 352 Query: 335 SDVDPITAQVKQMPTLEQLKSWV 357 DV ++ T++ L V Sbjct: 353 EDVPLQHLGDARVETVKALVEQV 375 >UniRef50_Q2JWC4 CRISPR-associated protein, Cse4 family n=1 Tax=Synechococcus sp. JA-3-3Ab RepID=Q2JWC4_SYNJA Length = 380 Score = 318 bits (814), Expect = 3e-85, Method: Composition-based stats. Identities = 105/333 (31%), Positives = 161/333 (48%), Gaps = 18/333 (5%) Query: 5 INIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRK--SGYYAQNIGESS 62 + IH++ S P+ LNRD+ M K IFGG+ R RISSQ KRA+RK Y + + Sbjct: 3 LEIHLIQSFPPANLNRDENGMPKSTIFGGRPRARISSQCQKRAVRKYYHQYAELDPAHFA 62 Query: 63 LRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGEIAWF 122 R+ + K G +Q + LAL G + +K A + E+ Sbjct: 63 ARSRNWLPELKSKLVKAGIPDEQAGMAARLALEQGLKLKFNDKNEATTIVFLGKTELDAI 122 Query: 123 CEQVAK----AEADNLDDK-KLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGK 177 E + K E+ ++K KL + + + I V+ + D+AL GRM S Sbjct: 123 AEILIKNWSAIESGLREEKPKLPQKIAKAIEKALVDTGKPGDVALFGRMMAS---LPTVN 179 Query: 178 VDGAMSIAHAITTHQVDSDIDWFTAVDDL---QEQGSAHLGTQEFSSGVFYRYANINLAQ 234 VD A+ +AHAI+ + + + D+FTAVDDL ++ G+ H+G ++S +YR+A ++ Q Sbjct: 180 VDAAVQVAHAISINALQQEFDFFTAVDDLGSSEDTGADHMGETGYNSSTYYRFAVLDKKQ 239 Query: 235 LQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVM-VNFSDMPLSMANA 293 L ENLGG E I VP Q +AA +VM V P+S+ +A Sbjct: 240 LVENLGGT--EHLGSIIKAFATAFIHAVPSGHQNGFAAHTRPALVMAVVREGQPISLVDA 297 Query: 294 FEKAVKAKDGF--LQPSIQAFNQYWDRVANGYG 324 FE V GF L+ +++A ++YW + YG Sbjct: 298 FENPVAPSGGFSLLENAVKALDEYWGSLVKMYG 330 >UniRef50_A4XYU0 CRISPR-associated protein, Cse4 family n=5 Tax=Bacteria RepID=A4XYU0_PSEMY Length = 384 Score = 315 bits (806), Expect = 2e-84, Method: Composition-based stats. Identities = 115/384 (29%), Positives = 177/384 (46%), Gaps = 35/384 (9%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGE 60 MS F+ H++ + +PS LNRDD KDA+FGG RR R+SSQ KRA+R + + + Sbjct: 1 MSLFVEFHLIQNFAPSNLNRDDTGAPKDALFGGHRRARVSSQCFKRAIRLAAQEHELVAP 60 Query: 61 SSLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGEIA 120 R + +L+ +L ++L R + K L+ + + + + EIA Sbjct: 61 E-FRGVRTKKLKTLLLERLAGRDPLEAEGKIEVALAAAGLKLKDDGKTEYLLFLGEAEIA 119 Query: 121 WFC-------EQVAKAEADNLDDK-----------KLLKVLKEDIAAIRVNLQQGVDIAL 162 F +++A A A +V+K+ A ++ + VD+AL Sbjct: 120 GFATLIEQHWDELAGAPAGGEKKGEKKGKKEAKASAPAEVVKK--AKALLDGGKAVDVAL 177 Query: 163 SGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDL---QEQGSAHLGTQEF 219 GRM D A +AHAI+TH+V+ + D+FTAVDD E G+ +G EF Sbjct: 178 FGRMLAD---MPEVNQDAACQVAHAISTHRVEREFDYFTAVDDKGGPDETGAGMIGQVEF 234 Query: 220 SSGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMV 279 +S YRYA ++ +L NL RE L + +P KQ T+AA N V Sbjct: 235 NSATLYRYAVVDAGKLLGNLQ-QDRELTLSALEAFTQAMVRAIPTGKQNTFAAHNLPSFV 293 Query: 280 MVNFSDM-PLSMANAFEKAVKAKDG--FLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSD 336 V PL++ANAFEK + A+ S+ ++ ++A Y A+ Q++ D Sbjct: 294 GVCLRHAGPLNLANAFEKPIAARQDAALSSLSVTELAKHEGKLAAVYA--DASDQWAYLD 351 Query: 337 VDPITAQVKQMP--TLEQLKSWVR 358 + Q K L +L SWVR Sbjct: 352 LSEAWPQQKGFAVQNLGELASWVR 375 >UniRef50_A7BA64 Putative uncharacterized protein n=1 Tax=Actinomyces odontolyticus ATCC 17982 RepID=A7BA64_9ACTO Length = 374 Score = 312 bits (799), Expect = 1e-83, Method: Composition-based stats. Identities = 111/380 (29%), Positives = 169/380 (44%), Gaps = 27/380 (7%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGE 60 MS F++IHVL + PS NRDD K A FGG +R+RISSQ++KRA R+ G Sbjct: 1 MSVFVDIHVLQTLPPSNPNRDDTGAPKSATFGGVQRMRISSQAIKRATRQDFEGKIADGN 60 Query: 61 SSLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEA-------EKISADAVTP 113 +RT + +L + +R D + LA + K++ + + + Sbjct: 61 YGVRTKKIVELVARTITE--KRPDLEAASIELAEMGLKAIGFKLAEPRGNKSDNELKESG 118 Query: 114 WVVGEIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMT 173 ++V A E V+ A + KE V+ +DIAL GRM Sbjct: 119 FLVFLSAKQIEHVSDALISVAHEDDPAAAFKELKPRSLVDTDHSIDIALFGRMVAE---P 175 Query: 174 ELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQ------EQGSAHLGTQEFSSGVFYRY 227 VD A +AHAI V+ + D++TAVDD + ++G+ +GT EF+S YRY Sbjct: 176 NALNVDAACQVAHAIGVGAVEREYDYYTAVDDAKKRNDEADEGAGMIGTIEFASATVYRY 235 Query: 228 ANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMV-NFSDM 286 A IN+ L+ENLG A V +P K T+A + V+V D Sbjct: 236 ATINVDLLRENLG--DDAVADRAVELFVDSFVRSMPTGKVTTFANRTLPEAVLVQVRDDQ 293 Query: 287 PLSMANAFEKAVKA-KDGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSL-----SDVDPI 340 P++M+ AFE+ + A + GF +P+I F ++ ++ GL + S + Sbjct: 294 PINMSGAFEEPIIAGQHGFAEPAIARFVEFESQLRELTGLEAVESLVSWTTPRGESFSEL 353 Query: 341 TAQVKQMPTLEQLKSWVRNN 360 QV+ E VR Sbjct: 354 GKQVRLASLGETAAEAVRGT 373 >UniRef50_B0TDU0 Crispr-associated protein, ct1973 family, putative n=2 Tax=cellular organisms RepID=B0TDU0_HELMI Length = 385 Score = 312 bits (798), Expect = 2e-83, Method: Composition-based stats. Identities = 119/389 (30%), Positives = 174/389 (44%), Gaps = 37/389 (9%) Query: 5 INIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGES--S 62 + IHVL +H+P+ LNRD+ KD +FGG RR RISSQ KR +R S + +IGES Sbjct: 2 VEIHVLQNHAPANLNRDESGSPKDCMFGGVRRGRISSQCQKRTIRCSPLFQDSIGESRLG 61 Query: 63 LRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGEIAWF 122 +RT L L +LG + I A G + K D +T + Sbjct: 62 MRTRKLPFLVKEELMRLGLSEELAKIGARKASGLGN---KDGKERDDEITAQAIFLTQED 118 Query: 123 CEQVAKAEADNLDDKKLLKVLKEDIAAIRVN------LQQGVDIALSGRMATSGMMTELG 176 +A+ +L DK + + ++ + VD+AL GRM TS T Sbjct: 119 VSVIARCLFRHLKDKTVKQAKAIKAQELQKDPELVGWRPVTVDVALFGRMTTS---TAFN 175 Query: 177 KVDGAMSIAHAITTHQVDSDIDWFTAVDDLQ---EQGSAHLGTQEFSSGVFYRYANINLA 233 V+ ++ + HAI+TH+VDS+ D+FTAVDDL + G+ +G EF+S +Y+Y N+++ Sbjct: 176 DVEASVQVGHAISTHRVDSEFDYFTAVDDLMGDGDSGADMIGDTEFNSCCYYKYFNVDMD 235 Query: 234 QLQENLGGASR-------------EQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVM 280 +L+ NL G R A I + L P KQ ++AA V+ Sbjct: 236 ELKRNLAGPDRLKKLTAEERQDLARDAAHIVKAFIESLVFCSPDGKQNSFAARQLPSAVL 295 Query: 281 VNFSDM--PLSMANAFEKAVKAKD--GFLQPSIQAFNQYWDRVANGYGLNGAAAQFSL-- 334 V P+S ANAF K V A+ +Q S+ AF + +GL L Sbjct: 296 VEVKKRKIPVSYANAFVKPVTARGEMDLVQASVNAFLDHVKETEKCFGLTPNRRWLLLMG 355 Query: 335 -SDVDPITAQVKQMPTLEQLKSWVRNNGE 362 T QV P L + + GE Sbjct: 356 CESPKMTTDQVSTFPALVEELTAALQQGE 384 >UniRef50_B1VIY1 CRISPR-associated protein n=3 Tax=Corynebacterium RepID=B1VIY1_CORU7 Length = 376 Score = 310 bits (795), Expect = 4e-83, Method: Composition-based stats. Identities = 110/374 (29%), Positives = 166/374 (44%), Gaps = 22/374 (5%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGE 60 MS I+I+ L S PS +NRDD + K+AIFGG R R+SSQS KRA+R+ + + Sbjct: 1 MSKIIDIYALQSLPPSLINRDDTGVPKNAIFGGVPRQRVSSQSWKRAIRRYFFENFDAAN 60 Query: 61 SSLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEA------EKISADAVTPW 114 R+ L + ++ G I++T L + A +K DA + Sbjct: 61 IGDRSKRLPEKIARQLEEQGMEQGT-AIERTEQLFKAAGIKTAVEKKPKDKDETDAEVAY 119 Query: 115 -VVGEIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMT 173 G + + + ++ K + AI ++ + VDIA+ GRM Sbjct: 120 PQTGYLLFLSAHQIDNAVKAIQERDGKNFTKREAQAI-LDQEHSVDIAMFGRMVADDAAY 178 Query: 174 ELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDL----QEQGSAHLGTQEFSSGVFYRYAN 229 VD A+ +AHA+ H + D+FTAVDDL +E G+ +GT + S YRYA Sbjct: 179 ---NVDAAVQVAHALGIHDSAPEFDYFTAVDDLAEEGEETGAGMIGTVQMMSSTLYRYAT 235 Query: 230 INLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFSD-MPL 288 +NL L ENL S + A + A V +P K T+A ++V V D P+ Sbjct: 236 VNLEGLAENL--DSEDAAKQAAVEFVEAFIASMPTGKINTFANQTLPELVYVAVRDTRPV 293 Query: 289 SMANAFEKAVKAKD--GFLQPSIQAFNQYWDRVANGYGLNGAAAQF-SLSDVDPITAQVK 345 S+ NAFE V+A + G + + Q V N YG A+ L + + Sbjct: 294 SLVNAFEAPVEATEDKGRREVGAEVLAQEARDVENVYGFKPQASFVMGLGQLAEPFTDIA 353 Query: 346 QMPTLEQLKSWVRN 359 TL +LK + Sbjct: 354 TQVTLPELKEQLAG 367 >UniRef50_Q2JH28 CRISPR-associated protein, CT1975 n=6 Tax=Actinomycetales RepID=Q2JH28_FRASC Length = 384 Score = 309 bits (791), Expect = 1e-82, Method: Composition-based stats. Identities = 102/340 (30%), Positives = 159/340 (46%), Gaps = 22/340 (6%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGE 60 M +I++H+L + PS LNRDD K A++GG +R R+SSQ+ KRA R + + + Sbjct: 1 MRCYIDVHILQTVPPSNLNRDDAGTPKQAVYGGVKRARVSSQAWKRATRTAFADHIDQAQ 60 Query: 61 SSLRTIHLAQLRDVLRQKLGER--FDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVG- 117 RT + +L ++L R D + + L A K +A+ G Sbjct: 61 LGTRTKRI---SALLAERLATRCALDAETSTRIATSLLTALKISAGKKAAETAYLLFFGR 117 Query: 118 -----EIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMM 172 I E V + +L D LL +K+ + +D+AL GRM Sbjct: 118 PQLERLIDLIVEDVPRLA--DLSDGDLLAAVKDVPVLATLGSDHPIDVALFGRMVADLAS 175 Query: 173 TELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQ---EQGSAHLGTQEFSSGVFYRYAN 229 VD A +AHA++TH VD + D++TAVDD E G+ +GT EF S YR+A Sbjct: 176 L---NVDAATQVAHALSTHAVDVEFDYYTAVDDQNAKDETGAGMIGTVEFQSATLYRFAT 232 Query: 230 INLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMV-MVNFSDMPL 288 + L QL ENLGG E +E + T +P Q ++A +++ + D P+ Sbjct: 233 VGLHQLAENLGG-DIEATVEALRVFLTAFTTSMPTGHQNSFAHRTVPNLLTIAIRPDQPV 291 Query: 289 SMANAFEKAVKAKD-GFLQPSIQAFNQYWDRVANGYGLNG 327 ++ +AFEK V + G L S++ F + + +GL Sbjct: 292 NLVSAFEKPVLPRGRGVLTGSLEQFAIELNSASTLWGLQP 331 >UniRef50_Q2FNL3 CRISPR-associated protein, CT1975 n=8 Tax=cellular organisms RepID=Q2FNL3_METHJ Length = 382 Score = 307 bits (787), Expect = 3e-82, Method: Composition-based stats. Identities = 112/399 (28%), Positives = 177/399 (44%), Gaps = 62/399 (15%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNI-G 59 MS FI IH+L S+ PS LNRDD+ K A GG +R+R+SSQSLKR+ R S ++ + G Sbjct: 1 MSEFIQIHMLASYPPSNLNRDDLGRPKTATVGGTQRIRVSSQSLKRSWRTSEAFSDALKG 60 Query: 60 ESSLRTIHLA-----------QLRDVLRQKLGERFDQKIIDKTLA-------------LL 95 +RT + L D+L K ++I D+ A + Sbjct: 61 AIGIRTRDMGVKIKKALVEGRLLSDILEGKESGVTRERIKDEKKAHEWAVKISSHFGKIE 120 Query: 96 SGKSVDEAEKISA------------DAVTPWVVGEIAWFCEQVAKAEADNLDDKKLLKVL 143 GK D +K + + EIA + + + + + Sbjct: 121 GGKEKDSDKKSEKTDEKSNKNPLSHKQMVHYSPEEIAGIDDLLGRISGG--------EKV 172 Query: 144 KEDIAAIRVNLQQGVDIALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAV 203 +D + + VDIAL GRM + A+ ++HAIT H + D+FTAV Sbjct: 173 SDDDCIRLRSDHKAVDIALFGRMLADNAAY---NTEAAVQVSHAITVHDTPVEDDYFTAV 229 Query: 204 DDLQE----QGSAHLGTQEFSSGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLA 259 DDL + G+ H+G EF +G+FY Y IN L+ENL G E + ++ + Sbjct: 230 DDLNQLDDTAGAGHIGEAEFGAGLFYTYICINRDLLKENLQG-DNELSNRAIEALIRAAS 288 Query: 260 TEVPGAKQRTYAAFNPADMVMVNFS-DMPLSMANAFEKAVKAKDGFLQPSIQAFNQYWDR 318 P KQ ++A+ + A ++V + P S+A AF K V KD + +++ DR Sbjct: 289 MVSPSGKQNSFASRSYASYLLVEKGTEQPRSLAAAFFKPVSGKDIY-GDAVKNLEGLRDR 347 Query: 319 VANGYGLNGAAAQFSLSDVDPITAQVKQMPTLEQLKSWV 357 + N YG + + S++ +D +L + S+V Sbjct: 348 MDNAYGTSFKQSSRSMNVIDGTG-------SLTDIISFV 379 >UniRef50_C6SPJ0 Putative uncharacterized protein n=1 Tax=Streptococcus mutans NN2025 RepID=C6SPJ0_STRMN Length = 359 Score = 307 bits (786), Expect = 4e-82, Method: Composition-based stats. Identities = 91/356 (25%), Positives = 163/356 (45%), Gaps = 24/356 (6%) Query: 4 FINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESSL 63 F++I+ + + PS +NRDD K +GG RR R+SSQS K+AMR Y + Sbjct: 11 FLDIYAIQTLPPSNINRDDTGSPKTTQYGGVRRARVSSQSWKKAMRDYFYEHAEEEQLGK 70 Query: 64 RTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGEIAWFC 123 RT + ++K K + + + D + +G Sbjct: 71 RTRKVVNYVAEKIIHQKIDLNEKESSKLAT-----DILKLAGVPTDGKVLFFIGNTE--A 123 Query: 124 EQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGKVDGAMS 183 E++A A + DK+ + + + +D+AL GRM + TE D + Sbjct: 124 EKLATAAVKGVKDKEEARKI--------MQSNLALDVALFGRMVANDKETEA---DASSQ 172 Query: 184 IAHAITTHQVDSDIDWFTAVDDLQ---EQGSAHLGTQEFSSGVFYRYANINLAQLQENLG 240 AH I+TH V ++ D++TAVDDL + + LGT EF+S YRYAN+ + + G Sbjct: 173 FAHPISTHAVQTEFDFYTAVDDLASDDDAKAGMLGTVEFNSSTLYRYANVAIHEFLVQRG 232 Query: 241 GASREQALEIATHVVHMLATEVPGAKQRTYAAFNPAD-MVMVNFSDMPLSMANAFEKAVK 299 +RE ++ + A +P K ++A +++ SD P+++ +AFE+ VK Sbjct: 233 --NREDLVDSLQLFIKAFAESMPRGKINSFANQTIPQTLIITVRSDRPVNLVSAFEEPVK 290 Query: 300 AKDGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITAQVKQMPTLEQLKS 355 + +G++ SI+ ++ + +V + SL +V+ +T + ++ +L Sbjct: 291 SSNGYVTKSIEKLSKEFVKVEKMVKKPVLSFYVSLEEVEALTKVGIEKNSITELVE 346 >UniRef50_Q03C61 CRISPR-associated protein n=6 Tax=Firmicutes RepID=Q03C61_LACC3 Length = 361 Score = 307 bits (786), Expect = 5e-82, Method: Composition-based stats. Identities = 106/322 (32%), Positives = 170/322 (52%), Gaps = 27/322 (8%) Query: 4 FINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESSL 63 +I+IHVL + + +NRDD K A++GG R R+SSQS KRAMR + ++ ++ L Sbjct: 7 YIDIHVLQTVPSANINRDDTGAPKKALYGGVTRARVSSQSWKRAMRLR-FNQEDHDDAGL 65 Query: 64 RTIHLAQ-LRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGEIAWF 122 RT + Q LR L+ D++I K A+ S + KI+ D T ++ Sbjct: 66 RTKEVPQLLRQALKAAAPALTDEEIAAKVDAVFSTAKI----KITKDGQTGALMLISTGQ 121 Query: 123 CEQVAKAEADN--LDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGKVDG 180 +++A+ DN LD K+L K+ K +Q +D+AL GRM V+G Sbjct: 122 LKKLAQYALDNEALDKKELTKLFK---------GEQSLDLALFGRMVADN---PELNVEG 169 Query: 181 AMSIAHAITTHQVDSDIDWFTAVDDLQE---QGSAHLGTQEFSSGVFYRYANINLAQLQE 237 + +AHAI+TH++ + D+FTA+DD + G+A LGT E++S YRYAN+N + + Sbjct: 170 SAQVAHAISTHEIVPEFDYFTALDDFKPEDNAGAAMLGTVEYNSSTLYRYANLNFQEFEA 229 Query: 238 NLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFS-DMPLSMANAFEK 296 N+GG A+ A + +P KQ T+A + VMV D P+++ +AFE Sbjct: 230 NIGGR---AAVSGALSYIKEFLLSMPNGKQNTFANKTLPNYVMVTLRPDTPVNLVSAFED 286 Query: 297 AVKAKDGFLQPSIQAFNQYWDR 318 VK+ G+++ S++ Q + Sbjct: 287 PVKSNHGYVEASVKRLEQEYQD 308 >UniRef50_C7MTA9 CRISPR-associated protein, Cse4 family n=1 Tax=Saccharomonospora viridis DSM 43017 RepID=C7MTA9_SACVD Length = 390 Score = 307 bits (785), Expect = 6e-82, Method: Composition-based stats. Identities = 105/344 (30%), Positives = 162/344 (47%), Gaps = 24/344 (6%) Query: 2 SNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGES 61 +I+IHV+ + S +NRDD K FGG R R+SSQS KR +R+ GE+ Sbjct: 4 PKYIDIHVIQTLPFSNVNRDDTGSPKTVEFGGVERTRVSSQSWKRVVRQH-VEEAVGGET 62 Query: 62 SLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVD-EAEKISAD----------A 110 + + + L ++ E+ + + +AL +GK + + EK +D Sbjct: 63 VRTRRVVVGVAERLIKQGWEKSEAEAAGVQIALSAGKKISLKQEKDESDEVVLTTNVLLL 122 Query: 111 VTPWVVGEIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVN---LQQGVDIALSGRMA 167 + + E+A ++ + K L +K + + R+N ++ I L GRM Sbjct: 123 LPESGIDELAALADEHREVILAEAKKAKKLTGMKPKLPSERINEILSRRSATINLFGRMV 182 Query: 168 TSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQE----QGSAHLGTQEFSSGV 223 VDGA+ +AHA TTH + D+FTAVDD+++ GS ++ T FS+G Sbjct: 183 AE---LPGANVDGAVQVAHAFTTHGTAVEYDFFTAVDDIEQKLDLPGSGYMDTALFSAGT 239 Query: 224 FYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNF 283 FYRYAN+NL L NL + A + + T VP KQ AA D+V V Sbjct: 240 FYRYANVNLTDLLRNL-DQDTDLARVLVKTFLDGFITTVPSGKQNATAAVTLPDLVHVTV 298 Query: 284 S-DMPLSMANAFEKAVKAKDGFLQPSIQAFNQYWDRVANGYGLN 326 D P+S+ANAFE V DGF++ S + + +A G + Sbjct: 299 RDDRPVSLANAFEAPVGGGDGFVRKSAHRLDSHAGAIAELLGES 342 >UniRef50_A1ARH7 CRISPR-associated protein, Cse4 family n=1 Tax=Pelobacter propionicus DSM 2379 RepID=A1ARH7_PELPD Length = 374 Score = 305 bits (780), Expect = 2e-81, Method: Composition-based stats. Identities = 116/370 (31%), Positives = 179/370 (48%), Gaps = 23/370 (6%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGE 60 M + IHVL + +PS LNRDD KDA+FGG RR R+SSQ LKR++R+ + QN G Sbjct: 1 MKTIVEIHVLQNFAPSNLNRDDTGAPKDALFGGTRRARVSSQCLKRSVREY-FKDQNKGW 59 Query: 61 SSLRTIHLA-QLRDVLRQKLGERFD------QKIIDKTLALLSGK-SVDEAEKISADAVT 112 + RT + L++ + L + D K I+ ++ L V ++ +D + Sbjct: 60 VADRTKRVVYALKERISPVLESQKDFSEDNLLKAIEVAVSNLGSNKKVKVDKEKKSDVLL 119 Query: 113 PWVVGEIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVN--LQQGVDIALSGRMATSG 170 EI + VA++ AD L K +V++ AI + VD+AL GRM Sbjct: 120 FLSPKEIDALAQVVAESYADLLKTKLSDQVVRNLNDAIDGENKSRLSVDVALFGRMLA-- 177 Query: 171 MMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQE---QGSAHLGTQEFSSGVFYRY 227 + + A +AHAI+TH V+ + D++TAVDDL+ G+ +GT EF+S FYRY Sbjct: 178 -VMPEKNQNAACQVAHAISTHAVEREFDFYTAVDDLKPEDTAGADMMGTVEFNSACFYRY 236 Query: 228 ANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFSD-- 285 A ++ +L NL A A + + P KQ T+AA NP + V V Sbjct: 237 AVVDWEKLLVNLQ-ADEALATKGLRAFLEGFVVAEPTGKQNTFAAHNPPEFVAVTVRRNA 295 Query: 286 MPLSMANAFEKAVKAKDG--FLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITAQ 343 P ++ANAFE AV+ + + S + + + +G + +L++ Sbjct: 296 APRNLANAFETAVRVRKDESLTRKSAEGLANKAKALQSAFGGDEKTFVLNLAEATIDGFG 355 Query: 344 VKQMPTLEQL 353 + MPTL L Sbjct: 356 I-VMPTLNDL 364 >UniRef50_B3E5V0 CRISPR-associated protein, Cse4 family n=56 Tax=Proteobacteria RepID=B3E5V0_GEOLS Length = 356 Score = 304 bits (778), Expect = 4e-81, Method: Composition-based stats. Identities = 101/340 (29%), Positives = 151/340 (44%), Gaps = 27/340 (7%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGE 60 MS F+ IH+L S+ P+ LNRDD K A GG R+R+SSQSLKRA R S + Q + E Sbjct: 1 MSRFVQIHLLTSYPPANLNRDDQGRPKTAKMGGYDRLRVSSQSLKRAWRTSDLFQQALTE 60 Query: 61 S-SLRTIHLAQL------RDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTP 113 RT L + +++K + QKI AL K D + + + Sbjct: 61 HVGTRTKLLGVMAYEKLVAGGVKEKQAKESAQKIAGVFGALKKAKEKDSLVDLEIEQLVH 120 Query: 114 WVVGEIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMT 173 EI + + + ++ + Q DIA+ GRM S + Sbjct: 121 VSPSEIQAIESLLETLISQG-------RAPEDTELDLLRIQGQSADIAMFGRMLAS---S 170 Query: 174 ELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDL----QEQGSAHLGTQEFSSGVFYRYAN 229 V+ A +AHAI+ H V + D+FTAVDDL ++ G+AH+G F++G+FY Y Sbjct: 171 PSYNVEAACQVAHAISVHPVVIEDDYFTAVDDLNDGSEDAGAAHIGETGFAAGLFYSYIC 230 Query: 230 INLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFSD-MPL 288 IN L ENLGG + ++ P KQ ++ + A V+ D P Sbjct: 231 INRTLLVENLGG-DEALVQKSIQALIEAAVKVPPNGKQNSFGSRAYASYVLAEKGDQQPR 289 Query: 289 SMANAFEKAVKAKD----GFLQPSIQAFNQYWDRVANGYG 324 S++ AF K V ++ F ++ A + YG Sbjct: 290 SLSVAFLKPVTSQGIEGTDFGTAAVDALTTQRQNMDAVYG 329 >UniRef50_A5UR15 CRISPR-associated protein, Cse4 family n=1 Tax=Roseiflexus sp. RS-1 RepID=A5UR15_ROSS1 Length = 402 Score = 303 bits (777), Expect = 5e-81, Method: Composition-based stats. Identities = 116/398 (29%), Positives = 182/398 (45%), Gaps = 50/398 (12%) Query: 4 FINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESSL 63 I +H+L +H+PS LNRDD N KDAIFGG RR RISSQ++KR++R S ++ L Sbjct: 2 LIALHLLQNHAPSNLNRDDNNEPKDAIFGGVRRARISSQAIKRSIRWSDHFRAPFETQGL 61 Query: 64 RTIHLAQLRDVLRQKLG----ERFDQKIIDKTLALLSG------KSVDEAEKISADAVTP 113 I L + +R L DQ+ I + A L EA D P Sbjct: 62 LAIRTQLLPEKVRHHLVNAGLNDDDQRAIVEAAARLGKGEQRSPSGEGEAGDERGDQNQP 121 Query: 114 WVV---------GEIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNL---------- 154 E++ A+ L + ++ ++ + I +R Sbjct: 122 RSSSRSRRSSRQSNTTGDAERIKTAQLMFLTENEIQQLAQRLIEIVREKGAKHLNELQGD 181 Query: 155 ----------QQGVDIALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVD 204 VDIA+ GRM TS + V+ A+ +AHAI+TH V+ + D++TAVD Sbjct: 182 TLVREIGEYEPHSVDIAMFGRMTTSSPFKD---VEAAVQVAHAISTHAVEMEFDFYTAVD 238 Query: 205 DLQ-EQGSAHLGTQEFSSGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVP 263 D+ E G+ +G F+S +Y+Y +I+ L +NL G + A + ++ +P Sbjct: 239 DISGEAGAGFIGDTTFNSATYYKYFSIDWDGLLKNLHG-EQNVARQSVEALIRAALFAIP 297 Query: 264 GAKQRTYAAFNPADMVMVNFS--DMPLSMANAFEKAVKAKD--GFLQPSIQAFNQYWDRV 319 KQ ++AA N D+ +V ++ LS ANAF K V+A ++ S +A +Y + Sbjct: 298 SGKQNSFAAHNLPDLALVEVRKENIALSYANAFVKPVRATGKLSLIEASAKALEEYIPAI 357 Query: 320 ANGYGLNGAAAQFSLSDVDPITAQVKQMPTLEQLKSWV 357 Y L+ A LS V + + LE+L +W+ Sbjct: 358 NERYNLSAQRAF--LSTVPFTLSGAECCSDLEKLITWL 393 >UniRef50_Q47PJ3 CRISPR-associated protein, Cse4 family n=1 Tax=Thermobifida fusca YX RepID=Q47PJ3_THEFY Length = 373 Score = 303 bits (776), Expect = 6e-81, Method: Composition-based stats. Identities = 106/368 (28%), Positives = 170/368 (46%), Gaps = 23/368 (6%) Query: 3 NFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESS 62 F++IH + + S +NRDD+ K ++GGK R R+SSQS KRA+R +G+ + Sbjct: 2 TFVDIHAIQTLPYSNINRDDLGSPKTVVYGGKERTRVSSQSWKRAVRHE--VEARLGDKA 59 Query: 63 LRTIH-LAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPW------- 114 +RT ++++ LR++ + + + L GK + D+ P Sbjct: 60 VRTRRIISEIAKRLRERGWDADLADAGARQVVLSVGKKSGIKLEKEKDSEAPATSVLFYL 119 Query: 115 ---VVGEIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGM 171 + E+A ++ A A K +L D + + V + L GRM Sbjct: 120 PVPAIDELAAIADEHRDAVAKEAAKKTPKGILPADRITEVLKSRN-VSVNLFGRMLAELP 178 Query: 172 MTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDL---QEQGSAHLGTQEFSSGVFYRYA 228 TE VDGA+ AHA T H ++D+FTAVDD+ + GS H+ +FS+G FYRYA Sbjct: 179 STE---VDGAVQFAHAFTVHGTTVEVDFFTAVDDIPKENDHGSGHMNAGQFSAGTFYRYA 235 Query: 229 NINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFS-DMP 287 N+NL +L EN G A + A + + VP KQ AA D+V + D P Sbjct: 236 NVNLDRLVENTGDA--QTARTAVAEFLRAFLSTVPSGKQNATAAMTLPDLVHIAVRFDRP 293 Query: 288 LSMANAFEKAVKAKDGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITAQVKQM 347 +S A AFE A+ DG+ + Q N Y +R+ + + ++ + + A ++ Sbjct: 294 ISFAPAFETALYGSDGYTLRACQELNNYAERLREVWPDDAIRGYATVENKTDLAALGERY 353 Query: 348 PTLEQLKS 355 + L Sbjct: 354 DSYPALID 361 >UniRef50_C3PF94 CRISPR-associated protein n=5 Tax=Corynebacterium RepID=C3PF94_CORA7 Length = 384 Score = 303 bits (775), Expect = 7e-81, Method: Composition-based stats. Identities = 109/380 (28%), Positives = 168/380 (44%), Gaps = 32/380 (8%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGE 60 MS I+IH L + PS +NRDD K AIFGG R R+SSQS KRA+R + Sbjct: 1 MSLVIDIHALQTLPPSLINRDDTGAPKSAIFGGVPRQRVSSQSWKRAIRNYFEKNVDPEF 60 Query: 61 SSLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSV-----------------DEA 103 R+ L + L + ++ I + L + ++ Sbjct: 61 VGDRSKRLPEKIAKLVENHDGWDSERAIKQVSDLFKAAGISTEVDSKRIKELEKSDAEDK 120 Query: 104 EKISADAVTPWVVGEIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALS 163 E++ +A P I +Q+ +A +D + +K+ A + ++ Q VD+A+ Sbjct: 121 EELIKEASYPRTKYLIFLSPQQIDRAVRAIVDADG--EKIKKAEAKVILDTQHSVDMAMF 178 Query: 164 GRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDL----QEQGSAHLGTQEF 219 GRM VD A+ +AHA+ H + D+FTAVDDL +E G+ +GT + Sbjct: 179 GRMIADDAAF---NVDAAVQVAHALGIHSSAPEFDYFTAVDDLAEDGEETGAGMIGTVQM 235 Query: 220 SSGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMV 279 S YR+A +N+A L +NL AS E A + A V +P K T+A ++V Sbjct: 236 MSSTLYRFATVNVAGLTKNL--ASEENAKQAAVQFVDAFIKSMPTGKINTFANHTLPELV 293 Query: 280 MVNFSD-MPLSMANAFEKAVKAKDG--FLQPSIQAFNQYWDRVANGYGLNGAAAQFS-LS 335 V D P+S+ AFE+ V+A D +A + YGL AA +S Sbjct: 294 YVTVRDTRPVSLVTAFEEPVQATDDKNLRLAGAEALAKEEREFEENYGLKPLAAFAVGVS 353 Query: 336 DVDPITAQVKQMPTLEQLKS 355 + A + + TL +L Sbjct: 354 EARAPFADIAETVTLPELSE 373 >UniRef50_D1YEE3 CRISPR system CASCADE complex protein CasC n=1 Tax=Propionibacterium acnes J139 RepID=D1YEE3_PROAC Length = 374 Score = 302 bits (774), Expect = 1e-80, Method: Composition-based stats. Identities = 112/376 (29%), Positives = 182/376 (48%), Gaps = 30/376 (7%) Query: 2 SNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGES 61 S +++IHV+ S PS +NRDD K A++GG RR R+SSQ+ K+A+R S ++ Sbjct: 3 SYYVDIHVIQSVPPSNVNRDDTGSPKSALYGGVRRARVSSQAWKKAVRTSFKEFLPANQT 62 Query: 62 SLRTIHLAQLR--DVLRQKLG---ERFDQKIIDKTLAL-LSGKSVDEAEKISADAV--TP 113 RT+ + +L + G E QK ++ AL L + + ++ A+ + T Sbjct: 63 GSRTLRVVELLMNRLTAAPYGLPEEDARQKALEVVKALGLKAEKPRKKDESGAEGIERTQ 122 Query: 114 WVVGEIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMT 173 ++V +++A+ A D K+ + A + G+++AL GRM + Sbjct: 123 YLVFYSNQQLDRLAQLAATT--DGKITATDAKKAA----DSDHGIEVALFGRMVAD---S 173 Query: 174 ELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQ----EQGSAHLGTQEFSSGVFYRYAN 229 + VD A+ +AHA++TH V+ + D+FTAVDD + + G+ +GT EF+S YR+A Sbjct: 174 KDLNVDSAVQVAHALSTHAVEIESDYFTAVDDYKLDEDDAGAGMIGTVEFTSETLYRFAT 233 Query: 230 INLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMV-NFSDMPL 288 + ++ L++NLG + + A+ V +P KQ T+A D V+V Sbjct: 234 VAVSTLKDNLG--DVDLTAQAASAFVRGFIMSMPTGKQNTFANNTIPDAVVVQVRKGRSA 291 Query: 289 SMANAFEKAVKAKD-GFLQPSIQAFNQYWDRVANGYGLNGAAAQFSL---SDVDPITAQV 344 S AFE V + D GF+ S QA Y + L A F S + I Sbjct: 292 SFIGAFEDPVTSDDGGFVAASCQAVAAYAHDCEEAF-LGAPEASFVTRVGSRTEAIGTMG 350 Query: 345 KQMPTLEQLKSWVRNN 360 QMP ++ L S VR+ Sbjct: 351 TQMP-IDDLVSSVRDQ 365 >UniRef50_D1CAJ1 CRISPR-associated protein, Cse4 family n=1 Tax=Sphaerobacter thermophilus DSM 20745 RepID=D1CAJ1_SPHTD Length = 397 Score = 302 bits (774), Expect = 1e-80, Method: Composition-based stats. Identities = 108/388 (27%), Positives = 171/388 (44%), Gaps = 40/388 (10%) Query: 4 FINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESSL 63 F+ +H++ + +PS LNRDD KD FGG RR RISSQ+LKRA+R + + E S Sbjct: 2 FVELHIIQNFAPSNLNRDDTGAPKDCQFGGYRRARISSQALKRAIRMTFGEENLLPEES- 60 Query: 64 RTIHLAQLRDVLRQKLG----ERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGEI 119 R ++ L ++L + + + G S ++ ++ + T +++ Sbjct: 61 RARRTKRIAGALVERLVASGKDAVAAAAVVEAAIQGIGLSFEKPKEGDTEKKTQYLLFLG 120 Query: 120 AWFCEQVAKAEADNLD--------------------DKKLLKVLKEDIAAIRVNLQQG-- 157 +A + D K L + + ++ G Sbjct: 121 QREINALADVCLAHWDTLVDVAPNADAASERDAKKAKKANKAALPKQVQLALLDALDGRS 180 Query: 158 VDIALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQE---QGSAHL 214 D+AL GRM +D A +AHAI+TH+V ++ D++TAVDDL+ G+ L Sbjct: 181 ADVALFGRMLAD---LPEKNIDAASQVAHAISTHRVATEFDFYTAVDDLKPDDTAGADML 237 Query: 215 GTQEFSSGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFN 274 GT EF+S FYRY+NI++ QL ENLGG + A + +P KQ + AA N Sbjct: 238 GTVEFNSACFYRYSNIDVDQLIENLGG-DVDLARTTVEAFLWASIHAIPTGKQNSMAAQN 296 Query: 275 PADMVMVNFSDMP-LSMANAFEKAVKA--KDGFLQPSIQAFNQYWDRVANGYGLNGAAAQ 331 P VM D S+ANAF V ++ S+ A YW + YG Sbjct: 297 PPSFVMAVVRDRGLWSLANAFVNPVAPAHDGDLIERSVDALEAYWSNLVRVYG-GELRGT 355 Query: 332 FSLSDVDPITAQVKQM--PTLEQLKSWV 357 + ++ ++++ T E+L V Sbjct: 356 WCVNVNPRELGPLEELHVDTFEELVDAV 383 >UniRef50_D0Y919 CRISPR-associated protein, Cse4 family n=2 Tax=Dehalococcoides RepID=D0Y919_9CHLR Length = 427 Score = 302 bits (774), Expect = 1e-80, Method: Composition-based stats. Identities = 103/382 (26%), Positives = 169/382 (44%), Gaps = 60/382 (15%) Query: 6 NIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIG-ESSLR 64 IH++ + +PS LNRDD K A FGG RR RISSQ KR+ R G A+ + + ++R Sbjct: 9 EIHLIQNFAPSNLNRDDTGQPKSATFGGFRRARISSQCSKRSTRLQGPLAELLENQGAVR 68 Query: 65 TIHL-AQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWV-------- 115 T L ++ + K E D++ I+ + ++ K S + Sbjct: 69 TRQLIMEIAKAIDTK--EEPDERTIEIVAGVFEAGGLERPAKRSGKVKSQAAEAIGEDGE 126 Query: 116 -----------VGEIAWFCEQVA-----KAEADNLDD-----KKLLKVLKEDIAAIRVNL 154 +I F +++A +N DD K++ + + + + Sbjct: 127 INGNEGFESGNKTKILLFLDKMAFPKLIDVFKENWDDLAKGNKEVKEKACDKVGRLLFEA 186 Query: 155 QQGVDIALSGRMATSGMMTELGK----VDGAMSIAHAITTHQVDSDIDWFTAVDDLQ--- 207 + DIAL GRM T GK V+ A +AH I+TH++D ++D++TAVDDL Sbjct: 187 VKAPDIALFGRMLEVKNNTPFGKYNMSVEAACQVAHPISTHKIDMEMDFYTAVDDLNPDG 246 Query: 208 EQGSAHLGTQEFSSGVFYRYANINLAQLQENLG---------------GASREQALEIAT 252 E G+ +G F+S +YRYA ++ QL NL E+A ++ Sbjct: 247 ETGAGMMGVVGFNSACYYRYALVDRDQLARNLARKTERKNGGWAQGLETQDYEEADKVVK 306 Query: 253 HVVHMLATEVPGAKQRTYAAFNPADM-VMVNF-SDMPLSMANAFEKAVKA---KDGFLQP 307 + + +P KQ ++AA N + V +P+S+ANAF ++ D + Sbjct: 307 AFLEAMIYAIPTGKQNSFAAQNLPSFGLFVKRKGGVPVSLANAFSTPIRPVRDDDDLVGL 366 Query: 308 SIQAFNQYWDRVANGYGLNGAA 329 S+ A ++WD + YG G Sbjct: 367 SVNALTKHWDAIKELYGDQGIK 388 >UniRef50_D2TKK6 CRISPR-associated protein n=1 Tax=Citrobacter rodentium ICC168 RepID=D2TKK6_CITRO Length = 363 Score = 302 bits (773), Expect = 1e-80, Method: Composition-based stats. Identities = 99/361 (27%), Positives = 157/361 (43%), Gaps = 27/361 (7%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNI-G 59 M+ FI +H+L ++ + LNRDD K + GG R+RISSQSLKRA R S + Q + G Sbjct: 13 MTTFIQLHLLTAYPAANLNRDDTGAPKTVVLGGTTRLRISSQSLKRAWRTSELFEQALAG 72 Query: 60 ESSLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDE---AEKISADAVTPWVV 116 +R+ +A ++ E + ID+ A+ +++ K A P Sbjct: 73 NIGIRSGRIA-------REAAEILIKSGIDEKKAVAYVEAIARCFGKVKADKKAKEPLTN 125 Query: 117 GEIAWFCEQVAKAEADNLDD-----KKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGM 171 E ++ AE D + + + KE+ A+ + + VDIA+ GRM Sbjct: 126 SETEQLV-HISPAEFDAVKALAHRLAEEKRAPKEEELALLRHDRMAVDIAMFGRMLADKP 184 Query: 172 MTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDL----QEQGSAHLGTQEFSSGVFYRY 227 V+ A +AHA + + D+FTAVDDL + G+ HLG F S +FY Y Sbjct: 185 EF---NVEAACQVAHAFGVSETIVEDDFFTAVDDLRANSDDAGAGHLGYTGFGSALFYTY 241 Query: 228 ANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFS-DM 286 IN L +NL G + + A + P KQ ++A+ A M D Sbjct: 242 ICINKDLLIKNLNG-NVDLANQTLRAFTEAALKVSPTGKQNSFASRAYACWAMAEKGTDQ 300 Query: 287 PLSMANAFEKAVKAKDGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITAQVKQ 346 P S+A AF K + D L ++Q + + + Y F++ + + V + Sbjct: 301 PRSLAAAFYKPIVGSD-HLNVAVQRVTELRENMNAVYEQQTEFVGFNVMNKEGSIKDVLE 359 Query: 347 M 347 Sbjct: 360 F 360 >UniRef50_C7LYW7 CRISPR-associated protein, Cse4 family n=1 Tax=Acidimicrobium ferrooxidans DSM 10331 RepID=C7LYW7_ACIFD Length = 386 Score = 301 bits (771), Expect = 3e-80, Method: Composition-based stats. Identities = 109/348 (31%), Positives = 164/348 (47%), Gaps = 23/348 (6%) Query: 5 INIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYA-QNIGESSL 63 I++HVL + PSCLNRDD N K A++GG RR R+SSQS KRA R+ IG L Sbjct: 9 IDVHVLQTLPPSCLNRDDTNAPKTALYGGARRARVSSQSWKRATRRYFNENLATIGTDWL 68 Query: 64 RTI----HLAQLRDVLRQKLGERF-DQKIIDKTLALLSGKS-------VDEAEKISADAV 111 R+ +L +L +++ R D + + +A L + +E K A Sbjct: 69 RSRGGGIRTRKLAGLLHERVQARVRDLDVREDDVARLVNLAAGALLGLKEEKLKKRAQET 128 Query: 112 TPWVVGEIAWFCEQVAKAEADNLDDK-KLLKVLKEDIAAIRVNLQQGVDIALSGRMATSG 170 P + + E A L+ + L D+ + +D+AL GRM Sbjct: 129 QPADLEYALFVSESAIDAAVGELERSLRAGDDLDLDVLTTAMGRDLSLDVALFGRMIAD- 187 Query: 171 MMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQ---EQGSAHLGTQEFSSGVFYRY 227 T VD A +AHAI+TH+V S+ D++T VDDL E G+A +G EF+S YR+ Sbjct: 188 --TPNLNVDAACQVAHAISTHRVTSEFDFYTTVDDLAGDDETGAAMMGFIEFNSATVYRF 245 Query: 228 ANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVN-FSDM 286 A ++L +L +NLG + + A +P Q T+AA D+V V+ D Sbjct: 246 ATVSLGRLADNLG--DPDAVPTGVRAFIEAFAKSLPTGHQNTFAALTVPDLVFVSMRGDQ 303 Query: 287 PLSMANAFEKAVKAKDGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSL 334 P+S+ AFE V++ G++ S + Y D + YG+ S Sbjct: 304 PVSLVGAFEAPVESDRGYVHASAERLATYADDIDGLYGVPRLNGWASY 351 >UniRef50_Q3A5Z5 CRISPR-associated protein, Cse4 family n=23 Tax=Bacteria RepID=Q3A5Z5_PELCD Length = 373 Score = 300 bits (767), Expect = 7e-80, Method: Composition-based stats. Identities = 108/357 (30%), Positives = 160/357 (44%), Gaps = 44/357 (12%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGE 60 MS FI +H+L S+ P+ LNRDD+ K A GG R+R+SSQSLKRA R S + + + Sbjct: 1 MSRFIQLHLLTSYPPANLNRDDLGRPKTAKMGGVDRLRVSSQSLKRAWRTSDLFGKTVKN 60 Query: 61 S-SLRTIHLAQLRDVLRQKLGERFDQKIIDKTLAL---------------LSGKSVDEAE 104 RT + +K+ ER +K I AL L+ K + Sbjct: 61 GLGTRTKEMG-------RKVYERLVEKGIGHKDALSWAGAIAGVFGKLKKLTDKEKTALK 113 Query: 105 KISADAV--TPWVVGEIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQ----GV 158 K++ + V EI + E LD + KE +NL + V Sbjct: 114 KLATEERREKELVEVEIEQLAFFDLEEEQAVLDLTNSIAERKEGPQPEELNLLRQKMTSV 173 Query: 159 DIALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDL----QEQGSAHL 214 DIAL GRM S V+ A +AHAI+ H + + D+FTAVDDL ++ G+AH+ Sbjct: 174 DIALFGRMLASSPAF---NVEAACQVAHAISVHPIVIEDDYFTAVDDLNDGSEDAGAAHI 230 Query: 215 GTQEFSSGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFN 274 G F++G+FY Y IN L ENLGG + A + P KQ ++A+ Sbjct: 231 GETGFAAGLFYSYICINRDLLAENLGG-DEDLAQRAIAALTEAAVKVPPNGKQNSFASRA 289 Query: 275 PADMVMVNFSD-MPLSMANAFEKAV------KAKDGFLQPSIQAFNQYWDRVANGYG 324 A V+ + P S++ AF K + + F +++A + + YG Sbjct: 290 YASYVLAEKGEQQPRSLSVAFLKPIDNRTLYRDDQDFGTAAVEALEAHRQNMNKVYG 346 >UniRef50_A3EQA5 CRISPR-ssociated protein, Cas4 n=4 Tax=Bacteria RepID=A3EQA5_9BACT Length = 398 Score = 299 bits (766), Expect = 8e-80, Method: Composition-based stats. Identities = 113/394 (28%), Positives = 181/394 (45%), Gaps = 47/394 (11%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIG- 59 M I IHVL + +PS LNRDD KDA+FGG RR RISSQ +KR++R + + G Sbjct: 1 MKTLIEIHVLQNFAPSNLNRDDTGAPKDALFGGTRRARISSQCIKRSVRDFFCHKREDGI 60 Query: 60 ----ESSLRTIHLAQ-LRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAE---------- 104 E +RT + Q + D+L++K L+ L K +E Sbjct: 61 FSPDEIGVRTKRIYQAIADLLKEKRDISDTITKAKTALSYLKIKPKNEKTQYLLFLSPKE 120 Query: 105 -KISADAVTPW---VVGE-IAWFCEQVAKAEADNLDDKK-------------LLKVLKED 146 K A+A+ + +VGE I ++ + D + ++ + K +E Sbjct: 121 IKDFANAIDEYWDQIVGEPIETDNSELDEETPDTVSLEEQKPKKGKKNKKPNIPKEFQEK 180 Query: 147 IAAIRVNLQQGVDIALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDL 206 + ++ +N + +DIAL GRM + A +AHAI+TH V+ + D++TA+DDL Sbjct: 181 LESV-LNGGKSIDIALFGRMLAD---IPEKNQNAACQVAHAISTHAVEREFDYYTAIDDL 236 Query: 207 QE---QGSAHLGTQEFSSGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVP 263 + GS +GT EF+S FYRYA ++L L +NL E + + P Sbjct: 237 KPDDTAGSDMIGTVEFNSACFYRYAVVDLEALNKNLHD-DSELTNKSIRAFLEAFIISEP 295 Query: 264 GAKQRTYAAFNPADMVMVNFSDM--PLSMANAFEKAVKAKDG--FLQPSIQAFNQYWDRV 319 KQ ++AA NP + + ++ P ++ANAFE AV K G + S + + Sbjct: 296 TGKQNSFAAHNPPEFIAISVRHNAGPRNLANAFETAVFPKKGESLTRKSADELVKKAKSL 355 Query: 320 ANGYGLNGAAAQFSLSDVDPITAQVKQMPTLEQL 353 + +G +L + + + +LE L Sbjct: 356 QSAFGGEDKTFLINLVGTN-VNGYGTVVASLEDL 388 >UniRef50_C7QEM5 CRISPR-associated protein, Cse4 family n=13 Tax=Actinomycetales RepID=C7QEM5_CATAD Length = 399 Score = 298 bits (762), Expect = 3e-79, Method: Composition-based stats. Identities = 103/391 (26%), Positives = 173/391 (44%), Gaps = 38/391 (9%) Query: 1 MSN-FINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIG 59 M+ ++IH+L + PS LNRDD K A++GG RR R+SSQ+ KRA R++ + Sbjct: 1 MTRVILDIHILQTVPPSNLNRDDTGSPKTAVYGGVRRARVSSQAWKRATRQAFGDLLDPS 60 Query: 60 ESSLRTIHLAQ--------LRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAV 111 E +RT +A+ L L ++I S ++ + +D Sbjct: 61 ELGVRTKRVAEQIANRMTALEPSLSPGDAVAVAVEVIKAATGAKSEVPKRKSAAVKSDQD 120 Query: 112 TPWVVGEIAWFCEQVAKAEADNLDD------KKLLKVLKEDIAAIRV----NLQQGVDIA 161 + E + +++++ +NL K + LK+ RV + + VDIA Sbjct: 121 ATAALPETGYLM-FLSESQLNNLARLGVEGSKDITAFLKDKDFKNRVRQAADTRHSVDIA 179 Query: 162 LSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDD---LQEQGSAHLGTQE 218 L GRM VD A +AHAI+ H V+++ D+FTAVDD E G+ +G + Sbjct: 180 LFGRMVADAT---DINVDAAAQVAHAISVHAVENESDYFTAVDDRSTEAEPGAGMIGIVD 236 Query: 219 FSSGVFYRYANINLAQLQENLGG------ASREQALEIATHVVHMLATEVPGAKQRTYAA 272 F++ YRYA +++ +L +NLG + E + A +P K T+ Sbjct: 237 FNAATLYRYAAVDVNRLADNLGAGLLEGESQTEPVRRAVEAFIRGFALSMPTGKVNTFGN 296 Query: 273 FNPADMVMVNFS-DMPLSMANAFEKAVKA---KDGFLQPSIQAFNQYWDRVANGYGLNGA 328 D+V+V P+S A AFE+A+ A + G+L+ + + Y ++ Y L Sbjct: 297 HTVPDVVLVKLRASRPISFAAAFEEAISAGEHQGGYLKGACERLASYIPKLEQAYDLQEG 356 Query: 329 AAQFSL--SDVDPITAQVKQMPTLEQLKSWV 357 + + Q ++ QL + V Sbjct: 357 TDSWVVCAGSATEALEQAGDPVSISQLVAAV 387 >UniRef50_Q2RXJ6 CRISPR-associated protein, Cse4 family n=2 Tax=Alphaproteobacteria RepID=Q2RXJ6_RHORT Length = 381 Score = 294 bits (753), Expect = 3e-78, Method: Composition-based stats. Identities = 106/381 (27%), Positives = 180/381 (47%), Gaps = 32/381 (8%) Query: 2 SNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGE- 60 S F+ IH L S++ + LNRDD + K +GG R RISSQ LKR R + + + Sbjct: 4 SRFLQIHSLHSYTAALLNRDDSGLAKRLTYGGSNRTRISSQCLKRHWRMAEHDPHALQTL 63 Query: 61 ----SSLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVV 116 S R+ L + D++ + L R+ Q I+D + + ++ Sbjct: 64 GGYVGSFRSREL--VTDLVIKPLEGRYPQDILDVLEPEFQKLVYGDKADKGKKSRQTLLL 121 Query: 117 G--EIAWFCEQVAKAEADNLDDKKLLKVLKE-------DIAAIRVNLQQGVDIALSGRMA 167 G E+AW + + A D K L K + + + L G+ AL GRM Sbjct: 122 GQPELAWLARRAEELAAGANDAKALQKAVADWRKDANFKAMSENAALPGGLVAALFGRMV 181 Query: 168 TSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQ----EQGSAHLGTQEFSSGV 223 TS +D + +AHA T H +++ D+FTAVDDL+ + G+ + E +SG+ Sbjct: 182 TSD---PAANIDAPVHVAHAFTVHAEEAEGDYFTAVDDLKKDESDSGADTIQETELTSGL 238 Query: 224 FYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNF 283 FY Y I+L L N GG +E A ++ ++V+++A PGAK + A + AD++++ Sbjct: 239 FYGYVVIDLPGLIGNCGG-DKEIAAQVVNNLVYLIAEVSPGAKLGSTAPYGRADLMLIEA 297 Query: 284 SD-MPLSMANAFEKAVKAKDGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVD---P 339 D P S+A A+ KA+ + ++ A + ++ Y A SL++ P Sbjct: 298 GDRQPRSLATAYRKAIAPD---REQAVAALDGCLAKLDATYETGEARRYLSLAETPLTGP 354 Query: 340 ITAQVKQMPTLEQLKSWVRNN 360 T+ ++++ +L+ L W + Sbjct: 355 ATSGLEKL-SLKALADWTASR 374 >UniRef50_B8IZA6 CRISPR-associated protein, Cse4 family n=1 Tax=Desulfovibrio desulfuricans subsp. desulfuricans str. ATCC 27774 RepID=B8IZA6_DESDA Length = 350 Score = 294 bits (752), Expect = 4e-78, Method: Composition-based stats. Identities = 99/355 (27%), Positives = 165/355 (46%), Gaps = 20/355 (5%) Query: 3 NFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESS 62 + +H+L S +CLNRDD+ K A+FGG +R R+SSQ KRA+R+ Sbjct: 2 RHLELHILQSVPVACLNRDDLGSPKTAVFGGVQRARVSSQCWKRAIREYCGELLPQHFKG 61 Query: 63 LRTIHLAQ-LRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGEIAW 121 RT + + LRD+ G ++ ++D+ T + Sbjct: 62 ERTRLIVEPLRDIFINTYGLDEATALVKANDLAEGLATLDKDAAKKNKLQTKTLFFTSRS 121 Query: 122 FCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGKVDGA 181 E +A +N + KK K + + DIAL GRM S L +GA Sbjct: 122 ELEALAAIAVNNENIKKHAKTFAQSLCT------DAADIALFGRMVASAPELTL---EGA 172 Query: 182 MSIAHAITTHQVDSDIDWFTAVDDL---QEQGSAHLGTQEFSSGVFYRYANINLAQL--Q 236 +HA++TH+ D++ID+F+A+DDL +E G+ GT EF++ +YR+ +NL L Sbjct: 173 AMFSHALSTHKADNEIDFFSALDDLLPSEETGAGMTGTLEFNAAAYYRFCALNLDMLADA 232 Query: 237 ENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFSD--MPLSMANAF 294 ++LG S ++ I V +P A++ + A V+ D P+ + NAF Sbjct: 233 DHLGALSPDERQGIVAAFVEATLKAMPVARKNSMNANTMPAYVLCVLRDSGQPVQLVNAF 292 Query: 295 EKAVKAKD--GFLQPSIQAFNQYWDRVANGYGLNGAAA-QFSLSDVDPITAQVKQ 346 EKAV + D G+++ SI+ + + R+ N +GL + L + + V++ Sbjct: 293 EKAVYSPDGRGYVEASIKRMEEEYQRLENTWGLTAVETIRMPLQSLGELLQGVRR 347 >UniRef50_C4FG89 Putative uncharacterized protein n=1 Tax=Bifidobacterium angulatum DSM 20098 RepID=C4FG89_9BIFI Length = 387 Score = 293 bits (751), Expect = 5e-78, Method: Composition-based stats. Identities = 108/385 (28%), Positives = 172/385 (44%), Gaps = 34/385 (8%) Query: 4 FINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESSL 63 F++IH + PS +NRDD K A GG R R+SSQ+ KRAMR + + Sbjct: 2 FMDIHCIQQVPPSNINRDDTGSPKTAYVGGALRSRVSSQAWKRAMRGVFDDMLDSDKLGK 61 Query: 64 RTIHLAQLRD---VLRQKLGERFDQKIIDKTLAL--LSGKSVDEAEKISADAVTPWVVGE 118 RT + L ++ +++ + LAL + K+ + A VT +++ Sbjct: 62 RTKGVVALIASSITAKRPDLAESAEELGQRVLALEGIGVKASNRAGSDKGTLVTDYLIFI 121 Query: 119 IAWFCEQVAKAEADNLD---------DKKLLKVLKEDIAAIR------VNLQQGVDIALS 163 +++A D K L K K D+A ++ + Q +DIAL Sbjct: 122 ANNEIDKLADWAIAASDKGRDFSKVGKKGLSKAEKTDLAKMKNEVSEIFHGPQAIDIALF 181 Query: 164 GRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQ---EQGSAHLGTQEFS 220 GRM + D + +AHA + Q+ + D+FTAVDD G+A L T F+ Sbjct: 182 GRMLANA---PDLNTDASAQVAHAFSIDQITPEYDYFTAVDDCASEDNAGAAMLDTVGFN 238 Query: 221 SGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPA-DMV 279 S YRYA +N+ L+E L AS A+E A V +P KQ T+A D+V Sbjct: 239 SSTLYRYAAVNIDALKEQLQDAS--AAVEGAVAFVEAFIKSMPSGKQNTFANHTLPEDVV 296 Query: 280 MVNFSDMPLSMANAFEKAVKAKDG--FLQPSIQAFNQYWDRVANGYGLNGAAAQF--SLS 335 +V P+S A+AFE+ V+ K+G + I+ + + + Y A + S Sbjct: 297 VVLRDSQPISAADAFEEPVRRKEGVSVSRQGIERLGKRLNEIRVNYSEEPVKAWYIASGG 356 Query: 336 DVDPITAQVKQMPTLEQLKSWVRNN 360 +VD + +Q+ +L L+ +R Sbjct: 357 EVDSLKEWSEQV-SLPDLEHGLRET 380 >UniRef50_B6B782 CRISPR-associated protein, Cse4 family n=2 Tax=Alphaproteobacteria RepID=B6B782_9RHOB Length = 353 Score = 293 bits (751), Expect = 5e-78, Method: Composition-based stats. Identities = 108/330 (32%), Positives = 164/330 (49%), Gaps = 19/330 (5%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNI-G 59 M+ F+ H+L ++ S NRDD K A+ GG R+RISSQSLKRA+R+S Y+AQ++ G Sbjct: 1 MTTFVQFHLLTTYPLSNPNRDDQGRPKQAMIGGSPRLRISSQSLKRALRESSYFAQDLAG 60 Query: 60 ESSLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGEI 119 + RT LA ++ + +G+ + D+T + G + EK S +A T + Sbjct: 61 HTGTRTRRLA--TELKAELIGQGVEDAHADETATKI-GAVFSKTEKGSTNATTLAFISPD 117 Query: 120 AWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGKVD 179 W +A+ A + L K+ AI VDIA+ GRM + D Sbjct: 118 EW---ALARELAARDVAGEPLPAEKDLKKAILRRADGAVDIAMFGRMLAD---SPDYNRD 171 Query: 180 GAMSIAHAITTHQVDSDIDWFTAVDDLQ----EQGSAHLGTQEFSSGVFYRYANINLAQL 235 A+ +AHA TTH+ + DWF+AVDDL+ + G+ H+G F SG++Y YA +N+ L Sbjct: 172 AAVQVAHAFTTHRAQAQDDWFSAVDDLKTREVDAGAGHIGEHGFGSGIYYLYACVNVDLL 231 Query: 236 QENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFS-DMPLSMANAF 294 ENL G R A + + LAT P KQ ++A A + V P ++ AF Sbjct: 232 VENLAG-DRALAAKGMEALARALATATPKGKQNSHAHHPRAGFIRVERGQQQPRDLSGAF 290 Query: 295 EKAVKAKDGFLQPSIQAFNQYWDRVANGYG 324 K A + + S++A ++ YG Sbjct: 291 HKPTAADE---RASVEALQGMAAKIDRAYG 317 >UniRef50_B6WQ62 Putative uncharacterized protein n=1 Tax=Desulfovibrio piger ATCC 29098 RepID=B6WQ62_9DELT Length = 341 Score = 292 bits (747), Expect = 1e-77, Method: Composition-based stats. Identities = 100/331 (30%), Positives = 156/331 (47%), Gaps = 20/331 (6%) Query: 3 NFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESS 62 + +H+L S +CLNRDD K A+FG +R R+SSQ KRA+R+ Sbjct: 2 RHLELHILQSVPVACLNRDDFGSPKTALFGNVQRARVSSQCWKRAVRELMQEEVPALFGG 61 Query: 63 LRTIH-LAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGEIAW 121 RT L L +L ++ G + A G +V + + T + + Sbjct: 62 QRTRLILDPLCRILHEQHGLAE---EEARKKAEELGAAVSKLDTPPVRVKTLFFTSPLE- 117 Query: 122 FCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGKVDGA 181 E +A A + KK +K L + L+ DIAL GRM S L +GA Sbjct: 118 -LEALAAAYVATGNAKKAVKELAKHP------LKDAADIALFGRMVASDHSLTL---EGA 167 Query: 182 MSIAHAITTHQVDSDIDWFTAVDDLQ---EQGSAHLGTQEFSSGVFYRYANINLAQLQEN 238 +HA++TH+V ++ID+F AVDDLQ E G+ GT EF+S +YR+A +NL L+++ Sbjct: 168 AMFSHALSTHKVSNEIDFFAAVDDLQPEDEAGAGMTGTLEFNSATYYRFAALNLDLLEQH 227 Query: 239 LGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFS--DMPLSMANAFEK 296 L S E+ E+ + V VPGA++ + A V+ P+ + NAFEK Sbjct: 228 LSALSAEERREVVCNFVTATLRAVPGARKNSMNAATLPSHVLAVVREKGHPVQLVNAFEK 287 Query: 297 AVKAKDGFLQPSIQAFNQYWDRVANGYGLNG 327 V + G ++ S+ + + + +GL Sbjct: 288 PVWTRGGLMEESVSQLEREYTHLKETWGLEA 318 >UniRef50_C4ZJY0 CRISPR-associated protein, Cse4 family n=1 Tax=Thauera sp. MZ1T RepID=C4ZJY0_THASP Length = 394 Score = 292 bits (746), Expect = 2e-77, Method: Composition-based stats. Identities = 109/394 (27%), Positives = 171/394 (43%), Gaps = 38/394 (9%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSG--YYAQNI 58 + FI IH L ++ + LNRDD + K +GG R RISSQ LKR R + + + Sbjct: 3 LPRFIQIHTLHTYPAALLNRDDAGLAKRLPYGGAIRTRISSQCLKRHWRVADDAFSLAKL 62 Query: 59 G-ESSLRTIHLAQL-RDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTP--- 113 G + RT ++A+L R L ++ + + L + +K A+ Sbjct: 63 GVPMATRTRYVAELIRQRLIEQGIDEARAYATAEALLEALFGEKADKKKEGVKALQTGQA 122 Query: 114 --WVVGEIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVN-----LQQGVDIALSGRM 166 + EIA+ + D D L + + + + N L G++ AL GRM Sbjct: 123 VLFGNEEIAYLARRCRDITGDFSDPVALKAEVAKFLKEEKKNIEAMKLGSGLESALFGRM 182 Query: 167 ATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQE----QGSAHLGTQEFSSG 222 TS + L D ++S+AHA T H+ + D+FT VDD + GSA + E +SG Sbjct: 183 VTSDL---LANRDASVSVAHAFTVHEAQVENDYFTVVDDFAQAEDGAGSAGIFDTELASG 239 Query: 223 VFYRYANINLAQLQENLGGASREQ-----------ALEIATHVVHMLATEVPGAKQRTYA 271 ++Y Y I++ QL NL G E A ++ H++H++AT PGAK+ + A Sbjct: 240 LYYGYVVIDVPQLVANLEGIKVEDVFTIGADKRGLAGKVVQHLLHLIATVSPGAKRGSTA 299 Query: 272 AFNPADMVMVNFSD-MPLSMANAFEKAVKAKDG--FLQPSIQAFNQYWDRVANGYGLNGA 328 ++ A V+V D P S+A AF + K + + YG+ A Sbjct: 300 PYDWAKFVLVEAGDWQPRSLAAAFHDPIPLKGDSSIRGRAASKLAKEIAAFDAAYGMPTA 359 Query: 329 AAQFSLSDVDPITAQVKQMPTLEQLKSWVRNNGE 362 SL D + + TL QL W+ Sbjct: 360 RRFLSL---DELAVPAAERATLSQLGEWIAQTVR 390 >UniRef50_B4UE70 CRISPR-associated protein, Cse4 family n=2 Tax=Anaeromyxobacter RepID=B4UE70_ANASK Length = 413 Score = 291 bits (744), Expect = 3e-77, Method: Composition-based stats. Identities = 107/415 (25%), Positives = 179/415 (43%), Gaps = 55/415 (13%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIG- 59 M+ F+ IH L S+ S LNRDD K FGG R R+SSQ LKR R G Sbjct: 1 MNRFVQIHTLTSYPASLLNRDDAGFAKRIPFGGVTRTRVSSQCLKRHWRTFEGEGALSGL 60 Query: 60 --ESSLRTIHLAQ---LRDVLRQKLGERFDQKIIDKTLALLSGKSVDEA----------- 103 S+R+ + ++ ++ + + +++ ++ + GKS A Sbjct: 61 GQPMSVRSRYTFDELVVQPLVGEGVPAELAREVTRALMSEVLGKSAKAAKADARADEKEE 120 Query: 104 ---------EKISADAVTPWVVGEIAWFCEQVAKAEADNLDDKKLLKVLKEDIAA----- 149 + +T E+A+ E D K+ K + + + A Sbjct: 121 EEDKDAKTESTLQTGQITVLGRPEVAYLLELARTVCRKKPDPAKIAKAVSDHLGADGRKN 180 Query: 150 -IRVNLQQGVDIALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDL-- 206 + L G+D A+ GRM TS + L + D A+ +AHA T H ++ D+F+AVDDL Sbjct: 181 LRELRLGAGLDAAMFGRMVTSDI---LARGDAALHVAHAFTVHGEATETDYFSAVDDLPM 237 Query: 207 ----QEQGSAHLGTQEFSSGVFYRYANINLAQLQENLGG--------ASREQALEIATHV 254 QGS H+G E +SG+FY Y I++ L NL G A R+ A ++A + Sbjct: 238 ARTEDGQGSGHIGNAELTSGLFYGYVVIDVPLLVSNLEGVDRKAWEKADRKLAAQLAERM 297 Query: 255 VHMLATEVPGAKQRTYAAFNPADMVMVNFSD-MPLSMANAFEKAV---KAKDGFLQPSIQ 310 V ++AT PGAK + A A +V+ + P ++ANAF + V + + + + Sbjct: 298 VKLVATVSPGAKLGSTAPHAYAHLVLAESGNAQPRTLANAFLEPVVTGPRQPDPVAAAYR 357 Query: 311 AFNQYWDRVANGYGLNGAAAQFSLSDVDPITA--QVKQMPTLEQLKSWVRNNGEA 363 A ++ + YG ++ D + + +L ++ +WV + Sbjct: 358 ALARHSADLDRMYGPAFQRRLAAIGPADGLADVLRAPANASLAEVATWVADQVRG 412 >UniRef50_UPI0001AF1D4B hypothetical protein SghaA1_37372 n=1 Tax=Streptomyces ghanaensis ATCC 14672 RepID=UPI0001AF1D4B Length = 383 Score = 290 bits (742), Expect = 5e-77, Method: Composition-based stats. Identities = 103/380 (27%), Positives = 159/380 (41%), Gaps = 34/380 (8%) Query: 4 FINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESSL 63 I +H+L S S LNRDD+ K A FGG R RISSQSLKRA R AQ+ + S Sbjct: 2 LIELHLLQSFPVSNLNRDDLGQPKTARFGGHTRARISSQSLKRAART--LLAQHGLDPSE 59 Query: 64 RTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKIS----------ADAVTP 113 + +LRD L ER +K + + + A + Sbjct: 60 LGVRTKRLRDAAASLLAERGREKEQAVEVCQAGLEEIGFAAHTATGLTKYLLYVGKPAQT 119 Query: 114 WVVGEIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIR------------VNLQQGVDIA 161 + + +AK A+ K+ + AA + ++ + DIA Sbjct: 120 LLADYCDERWDTLAKTVAEAKKRKEKQEKTPRKTAAKKPTKQAQEQAKRILDGTRAADIA 179 Query: 162 LSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDL---QEQGSAHLGTQE 218 L GRM V+ A +AHA++TH V ++ D++TA+DDL E + +GT + Sbjct: 180 LFGRMIADNTDF---NVNAASQVAHALSTHAVVNEFDYYTALDDLRPDAEPAADMIGTVD 236 Query: 219 FSSGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADM 278 F++ FYRYAN++L QL NL + A +H VPG KQ + +A Sbjct: 237 FNAACFYRYANLDLEQLATNLPD-DPDLVARSARAWLHSFIHAVPGGKQNSMSARTMPQT 295 Query: 279 VM-VNFSDMPLSMANAFEKAVKAKDGFLQPSIQAFNQYWDRVANGYGLNGAAAQ--FSLS 335 ++ V ++ANAF V + S Q ++ ++ + YG S+ Sbjct: 296 LLGVVRETGAWNLANAFLSPVTDVPDLMAASTQRLVDHFQQLRSFYGDTQLRHTTIASIG 355 Query: 336 DVDPITAQVKQMPTLEQLKS 355 + + PTL+ S Sbjct: 356 SDPAGMPENEIAPTLDDFVS 375 >UniRef50_Q0BRF9 Putative uncharacterized protein n=1 Tax=Granulibacter bethesdensis CGDNIH1 RepID=Q0BRF9_GRABC Length = 386 Score = 289 bits (740), Expect = 1e-76, Method: Composition-based stats. Identities = 110/388 (28%), Positives = 170/388 (43%), Gaps = 39/388 (10%) Query: 2 SNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMR----KSGYYAQN 57 F+ IH L S++ S LNRDD + K +G R RISSQ LKR R + Sbjct: 4 PRFLQIHSLHSYTASLLNRDDSGLAKRLPYGSAVRTRISSQCLKRHWRMDEGTFSLHRIE 63 Query: 58 IGESSLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVG 117 E ++R+ L + LR+ L D I++ + + P + G Sbjct: 64 GAEEAVRSRDL--VTKRLREPLQGTVDVNILNAIEPAFQAAVYGKKGADDKSSRQPLLFG 121 Query: 118 --EIAWFCEQVAKAEADNLDDK---------KLLKVLKEDIAAIR--VNLQQGVDIALSG 164 E+ + EQ + D K K+ + + A+R V+L G+ AL G Sbjct: 122 APELRYLAEQFTRIATSATDPKSAKAAAEDFTKDKLFQNTMKAMRDSVSLPGGLTSALFG 181 Query: 165 RMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDL---QEQGSAHLGTQEFSS 221 RM TS +D + +AHA TTH ++ D+F VDDL ++ G+ H+G+ E +S Sbjct: 182 RMVTSD---PEANIDAPVHVAHAFTTHAEQTESDYFAVVDDLAGVEDTGADHIGSTELTS 238 Query: 222 GVFYRYANINLAQLQENLGG--------ASREQALEIATHVVHMLATEVPGAKQRTYAAF 273 G+FY Y I++ L NL G A R+ A E+ ++ +AT PGAK + A + Sbjct: 239 GLFYGYVVIDVPTLVSNLTGVAASNWLAADRKMAAEVTACLIGQIATVSPGAKLGSTAPY 298 Query: 274 NPADMVMVNFSD-MPLSMANAFEKAVKAKDGFLQPSIQAFNQYWDRVANGYGLNGAAAQF 332 A ++V D P S+A AF + ++ + +Q Y Sbjct: 299 GYATTMLVEAGDRQPRSLAEAFRDPAEPT---VKDAEDKLHQKLKAFDEAYQTGEDRRLL 355 Query: 333 SLSDVDPITAQVKQMPTLEQLKSWVRNN 360 SLS+ DP V + +L +L WVR+ Sbjct: 356 SLSN-DPGIKNVSR-TSLPELMQWVRDT 381 >UniRef50_A8LYZ6 CRISPR-associated protein, Cse4 family n=1 Tax=Salinispora arenicola CNS-205 RepID=A8LYZ6_SALAI Length = 380 Score = 288 bits (737), Expect = 2e-76, Method: Composition-based stats. Identities = 108/385 (28%), Positives = 176/385 (45%), Gaps = 31/385 (8%) Query: 2 SNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGES 61 + +++IHVL + + LNRDD+ K FG R R+SSQS KRA+R+ ++ G+ Sbjct: 3 ARYVDIHVLQTVPYANLNRDDLGSPKTVRFGYADRTRVSSQSWKRAVRRE--LEESSGDK 60 Query: 62 SLRTIHLAQLRDV------LRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWV 115 + RT L Q +L +++ + + +K + +A + Sbjct: 61 AKRTRRLPQAIQARLTGPDWDSELAAFAATQVMATLATIAVKADGFKVDKATGEAQVLFY 120 Query: 116 VGEIAWFC---------EQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRM 166 + E A+ +++ + L KK L D + + V I L GRM Sbjct: 121 LPERAFDMLADVCVQQRDRLIGLRSGALKLKKGEAPLPADAVRAAMEHRSDV-INLFGRM 179 Query: 167 ATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQ----EQGSAHLGTQEFSSG 222 VDGA+ +AHA TTH D +D+FTAVDDL+ + GS H+ + EFS+G Sbjct: 180 LAE---LPGSNVDGAVQVAHAFTTHGTDPQVDFFTAVDDLKQDADQAGSGHMNSAEFSTG 236 Query: 223 VFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMV- 281 FYRYA++NL L NLG A+E+ + T +P AK+ A F ++ + Sbjct: 237 TFYRYASVNLEDLAHNLG--DPATAVELTRVFLSAFITAMPQAKKNATAPFTVPELAYIA 294 Query: 282 NFSDMPLSMANAFEKAVKA--KDGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDP 339 +D P+S+A+AFE V+A G+ +PS + +Y ++ G G S D Sbjct: 295 VRTDRPVSLASAFETPVRATFDSGYAEPSRRQLAEYAGQIYRLIGDQGMVYHGCASVDDK 354 Query: 340 ITAQV-KQMPTLEQLKSWVRNNGEA 363 Q+ + + + L + + A Sbjct: 355 GLEQLGETRQSFDNLIATAVDKLRA 379 >UniRef50_Q1EQS8 CRISPR-associated protein n=3 Tax=Streptomyces RepID=Q1EQS8_STRKN Length = 393 Score = 287 bits (735), Expect = 4e-76, Method: Composition-based stats. Identities = 114/380 (30%), Positives = 169/380 (44%), Gaps = 33/380 (8%) Query: 2 SNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGES 61 + FI++H++ S + LNRDD N K +G R R+SSQS KRA R+ + + IG++ Sbjct: 5 ARFIDVHIVQSVPFANLNRDDTNSVKTVQYGNTLRTRVSSQSWKRATRE--VFQERIGQA 62 Query: 62 SLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVT--------- 112 +LRT + + + G A + E K AD Sbjct: 63 ALRTRRIGERVTQELEGRGWPPALAQRAGGHAAAASSIKFELAKDPADNKQFLPNTVLTN 122 Query: 113 ------PWVVGEIAWFCEQVAKAEADNLDDKKL--LKVLKEDIAAIRVNLQQGVDIALSG 164 V E+A EQ + D KK VL +D + + GV I L G Sbjct: 123 AMVYVPEAAVAELADLAEQHRQELESAKDIKKPADKSVLPKDAVEAVLRSRNGV-INLFG 181 Query: 165 RMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDL-----QEQGSAHLGTQEF 219 RM + VDGA+ +AHA+TTH+ D ++D+F+AVDD+ GS H+G EF Sbjct: 182 RMLAE---VDDAGVDGAVQVAHAMTTHETDVELDYFSAVDDITAAWKDSTGSGHMGHTEF 238 Query: 220 SSGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMV 279 S+G FYRYA ++L L N+GG R A E+ + +P AK+ + A D+V Sbjct: 239 SAGTFYRYATVDLRDLATNIGGEVR-AARELIAAFLASYIESLPQAKKNSTAPHTIPDLV 297 Query: 280 MV-NFSDMPLSMANAFEKAVK--AKDGFLQPSIQAFNQYWDRVANGYGLNGAAAQ-FSLS 335 + SD PLS A AFEK V+ A GF + S Y G ++ Sbjct: 298 HISVRSDRPLSYAAAFEKPVRAGAPGGFGEVSRAELATYAQAANTLLGTGRIVTSGWASL 357 Query: 336 DVDPITAQVKQMPTLEQLKS 355 + +T + + + L + Sbjct: 358 ETKDLTGLGTRHESFDDLIT 377 >UniRef50_D0MET5 CRISPR-associated protein, Cse4 family n=1 Tax=Rhodothermus marinus DSM 4252 RepID=D0MET5_RHOM4 Length = 423 Score = 286 bits (732), Expect = 7e-76, Method: Composition-based stats. Identities = 116/420 (27%), Positives = 188/420 (44%), Gaps = 63/420 (15%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIG- 59 +S F+ IH L ++ + LNRDD K FGG R R+SSQ LK R G Sbjct: 2 VSAFVQIHTLTAYPAALLNRDDAGFAKRLPFGGAIRTRVSSQCLKYHWRNFSGEHALYGL 61 Query: 60 --ESSLRTIHLAQL---RDVLRQKLGERF--------------DQKIIDKTLALLSGKSV 100 SLR+ + R ++ + R D+ + L V Sbjct: 62 DVPRSLRSRETFKRCIARPLVEEGYPLRLVVAFALHLQKLIVSDESLSKTDFKKLMSDEV 121 Query: 101 DEA---EKISADAVTPWVVGEIAWFCEQVAKA----------EADNLDDKKLLKVLKEDI 147 D+A +++ ++ V E+ + ++ + A L D++L +V +E Sbjct: 122 DDATLLDQLKSNQVIILGRPEVDYLTRRIRERLDALREVWADAAAPLSDEQLERVYQELQ 181 Query: 148 AAIRVNLQQ---------GVDIALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDID 198 A + L++ G+D AL GRMATS + L + D A+ +AHA TTH +S+ D Sbjct: 182 AIGKGELKKNLKGLYLAAGLDAALFGRMATSDV---LARGDAAIHVAHAFTTHAEESESD 238 Query: 199 WFTAVDDL------QEQGSAHLGTQEFSSGVFYRYANINLAQLQENLGG--------ASR 244 +FTAVD+L E GS HL QE +SG+FY Y +++ L NL G A R Sbjct: 239 YFTAVDELVAQEGEGELGSGHLNNQELTSGLFYGYVVVDVPLLVSNLEGVPPAAWQEADR 298 Query: 245 EQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFS-DMPLSMANAFEKAVKAKD- 302 A E+ ++H++AT PGAK + A A ++V + P ++ANAF + V Sbjct: 299 TLAAEVVRRLLHLIATVSPGAKLGSTAPHAYAQFMLVEWGRSQPRTLANAFHRPVSLDGE 358 Query: 303 GFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITAQVK--QMPTLEQLKSWVRNN 360 G L S +A +Y +++ YG ++ + + Q++ + + ++ WV Sbjct: 359 GVLVNSYRALGRYVEQMDRMYGKLTERRLAAIDLPEAVQRQLQVDTLNAVPEIADWVAEK 418 >UniRef50_Q0AA32 CRISPR-associated protein, Cse4 family n=1 Tax=Alkalilimnicola ehrlichii MLHE-1 RepID=Q0AA32_ALHEH Length = 385 Score = 285 bits (730), Expect = 1e-75, Method: Composition-based stats. Identities = 101/363 (27%), Positives = 171/363 (47%), Gaps = 34/363 (9%) Query: 4 FINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESSL 63 F+ IH L S+ + LNRDD + K FG R+R+SSQ LKR R++ ++ S + Sbjct: 2 FLQIHTLTSYHAALLNRDDAGLAKRIPFGSAERMRVSSQCLKRHWRQALKDVISLP-SGI 60 Query: 64 RTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADA---VTPWVVG--E 118 RT H + R+V R+ + E + + + L + E D+ P + G E Sbjct: 61 RTRHFFE-REVCRRVIAEGVEDEKARELTGKLIDAVMHSKEAREKDSLFLKQPVLFGRPE 119 Query: 119 IAWFCEQVAKAEADNLDD----KKLLKVLKEDI-----AAIRVNLQQGVDIALSGRMATS 169 +F + + D K +K K++ AA +L+ G++ AL GR TS Sbjct: 120 ADYFVSLITECARSGEDPGSTLKDRVKAEKKNFRALLQAAGGSDLESGIEGALFGRFVTS 179 Query: 170 GMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQE----QGSAHLGTQEFSSGVFY 225 + L + D ++ +AHA T H +++++D+FT VDDL+E G+AH G E +G+FY Sbjct: 180 DI---LARTDASVHVAHAFTVHSLNNEVDYFTVVDDLKEPGEDAGAAHAGDMELGAGLFY 236 Query: 226 RYANINLAQLQENLGGASRE----------QALEIATHVVHMLATEVPGAKQRTYAAFNP 275 Y +++ L NL G R+ A ++ +VH +AT PGAK A + Sbjct: 237 GYVVVDVPLLVSNLSGCERQAWREQTEACADARDVLAALVHSIATVSPGAKLGATAPYAR 296 Query: 276 ADMVMVNFS-DMPLSMANAFEKAVKAKDGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSL 334 D ++ P ++ANA+ + + A+ +Q S+ Y + + +G + + Sbjct: 297 TDCALLETGTTQPRALANAYLEPLPARGDLMQQSVNTMGHYLKSLDDMFGEETSRFVSAT 356 Query: 335 SDV 337 D Sbjct: 357 RDT 359 >UniRef50_D1Y487 CRISPR-associated protein, Cse4 family n=1 Tax=Pyramidobacter piscolens W5455 RepID=D1Y487_9BACT Length = 408 Score = 285 bits (729), Expect = 2e-75, Method: Composition-based stats. Identities = 109/385 (28%), Positives = 175/385 (45%), Gaps = 54/385 (14%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSG--YYAQNI 58 + FI I L ++ S LNRDD + K FGG R R+SSQ LKR R + + QN+ Sbjct: 4 LPRFIQISTLTTYPASLLNRDDSGLSKRIPFGGVSRTRVSSQCLKRHWRMADGLWSLQNV 63 Query: 59 GE---SSLRTIHLA--QLRDVLRQKLGERFDQKIIDKTLAL---LSGKSVDEAEK----- 105 + +S+R+ + ++ L +K G +K++ + AL L G EA Sbjct: 64 DKDIATSIRSRRIFPEKIEKPLIEKEGLD-AEKVVAASQALQSELYGAKGTEAAAKNKKT 122 Query: 106 --ISADAVTPWVVGEIAW------------------FCEQVAKAEADNLDDKKLL----K 141 ADA+ P + +++ ++A A+ D K K Sbjct: 123 AKDDADALNPSIDAQLSAERSELVVLGHPEIQFLSKIVREMASADGSAADVGKKTGEWFK 182 Query: 142 VLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFT 201 K+D A++ G+D A+ GR + +V A+ +AHA T H +S+ D+FT Sbjct: 183 KHKKDFQALKCGA--GLDAAMFGRFISGDT---DARVSAAVHVAHAFTVHAEESETDYFT 237 Query: 202 AVDDLQEQGSAHLGTQEFSSGVFYRYANINLAQLQENLGG--------ASREQALEIATH 253 AVDDL GSAH+ E +SG+FY Y +++ QL N+ G A R+ A + H Sbjct: 238 AVDDLNNSGSAHINAAELTSGIFYNYVVVDVPQLVSNIEGCPSKQWQTAQRDVAGRLVKH 297 Query: 254 VVHMLATEVPGAKQRTYAAFNPADMVMVNFSD-MPLSMANAFEKAVKAKDGFLQPSIQAF 312 ++H++AT PGAK + A + VM + P ++A+AF V + +++ Sbjct: 298 LLHLIATVTPGAKLGSTAPYARPWFVMAEAGESQPHTLADAFYLPVPLRGDMRAQALRQL 357 Query: 313 NQYWDRVANGYGLNGAAAQFSLSDV 337 Y + YG + S+ DV Sbjct: 358 EDYVGKSDEMYGSDERRWIASMYDV 382 >UniRef50_Q67RP1 Putative uncharacterized protein n=1 Tax=Symbiobacterium thermophilum RepID=Q67RP1_SYMTH Length = 379 Score = 284 bits (727), Expect = 3e-75, Method: Composition-based stats. Identities = 97/344 (28%), Positives = 167/344 (48%), Gaps = 27/344 (7%) Query: 4 FINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESSL 63 F+ +H+L + + S LNRDD K +FGG RR RISSQ LKRA+R + Sbjct: 2 FVEMHLLQNFALSNLNRDDTGAPKSCVFGGTRRARISSQCLKRAVRTYVREQALVP---- 57 Query: 64 RTIHLAQLRDVLRQKLGERFDQKIIDKTLA-LLSGKSVDEAEKISADAVTPWVV----GE 118 + L+ L+++L R ++ A ++ ++++ E + T +++ E Sbjct: 58 -SELLSYRTKWLQRELANRLAAGGVEAEQAGQVAARALELLEFRLKNGRTEYLLMVGERE 116 Query: 119 IAWFCEQVAK--AEADNLDDKKLLKVLKEDIAAI---RVNLQQGVDIALSGRMATSGMMT 173 IA + + A D + K +++A + ++ VDIAL GRM + Sbjct: 117 IARIADLCREHAAALQGGDGGRKSKKEGDNLAGLFLKALDGGDAVDIALFGRMIATH--- 173 Query: 174 ELGKVDGAMSIAHAITTHQVDSDIDWFTAV------DDLQEQGSAHLGTQEFSSGVFYRY 227 VD A+ +AHA +T+ + ++ D+++AV DD + G+ LGT ++S +YRY Sbjct: 174 PEKNVDAAVQMAHAFSTNAIANEFDFYSAVDDLQQQDDDEGAGAGMLGTVLYNSSCYYRY 233 Query: 228 ANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFSDMP 287 AN++L QL NLGG ++AL + + VP K+ A NP ++M + Sbjct: 234 ANVDLRQLLTNLGG-DPDRALTAVRAFLLGMVHAVPTGKRTNSAPQNPPALIMAVVREHG 292 Query: 288 -LSMANAFEKAVK-AKDGFLQPSIQAFNQYWDRVANGYGLNGAA 329 S+ANAF V A+ ++ S + +W++++ YG G Sbjct: 293 LWSLANAFVVPVSGARGNLMELSAKEMLAHWNQLSELYGQEGVH 336 >UniRef50_A5FTJ7 CRISPR-associated protein, Cse4 family n=11 Tax=Acetobacteraceae RepID=A5FTJ7_ACICJ Length = 370 Score = 280 bits (717), Expect = 5e-74, Method: Composition-based stats. Identities = 98/354 (27%), Positives = 153/354 (43%), Gaps = 26/354 (7%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNI-G 59 M+ F+ +H+L PS +NRDD K A+ GG R+R+SSQ+LKRA R S +++ + G Sbjct: 1 MTQFLQVHLLTFFPPSNMNRDDTGRPKTAMVGGAMRLRLSSQALKRAWRTSTIFSEALKG 60 Query: 60 ESSLRTIHLA-QLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGE 118 RT L ++ L+ + + + +A GK ++ + E Sbjct: 61 YMGERTQRLGEEILKTLQAEGVSEVQALAVARAVAGQFGKLNEDETPARIQQLAFISPDE 120 Query: 119 IAWFCEQVAKAEADNLDDKKLLKVLKEDIAA-------------IRVNLQQGVDIALSGR 165 + + A L + K + + DIAL GR Sbjct: 121 RKAAFDLARRYAAGELPLPEKAKGKRGKANKTEGEEEVEAPEILLLRESDTAADIALFGR 180 Query: 166 MATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDL----QEQGSAHLGTQEFSS 221 M + A +AHAITTH++ D D++TAVDDL ++ G+ +G F S Sbjct: 181 MLADKPAF---NREAAAQVAHAITTHRISVDDDYYTAVDDLKRPSEDAGAGFIGETGFGS 237 Query: 222 GVFYRYANINLAQLQENLGGAS--REQALEIATHVVHMLATEVPGAKQRTYAAFNPADMV 279 GVFY Y +IN+ L NLGG R+ A +V AT P KQ ++AA A + Sbjct: 238 GVFYTYMSINIDLLIRNLGGGDQARDLAATAIAALVEAAATTAPSGKQNSFAAHGRAGYI 297 Query: 280 MVNFSD-MPLSMANAFEKAVKAKDGFLQPSIQAFNQYWDRVANGYGLNGAAAQF 332 + P ++A AF K V+ + SI ++ + + YG A + Sbjct: 298 LAERGKAQPRTLAGAFAKPVEG-GDIMDASIGRLEEFREAIDKAYGPTADATKV 350 >UniRef50_Q2RY18 CRISPR-associated protein, Cse4 family n=2 Tax=Alphaproteobacteria RepID=Q2RY18_RHORT Length = 359 Score = 280 bits (715), Expect = 7e-74, Method: Composition-based stats. Identities = 112/364 (30%), Positives = 171/364 (46%), Gaps = 36/364 (9%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNI-G 59 MS F+ +HVL +++ S LNRDD K FGG R+R+SSQSLKRA R+S + + G Sbjct: 1 MSRFLQLHVLTAYAASNLNRDDTGRPKTLNFGGAERLRVSSQSLKRAFRQSELFQSRLPG 60 Query: 60 ESSLRTIHLAQ-LRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVT-----P 113 E R+ A+ L L + E + + I + AL+ + + +K A P Sbjct: 61 ELGTRSQDFAKALVSALVARGVE--EAEAITRAEALIDHDKLGKVKKGKAQTEQLVHLGP 118 Query: 114 WVVGEIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMT 173 + I E++A + LDDK +L + + + VDIA+ GRM Sbjct: 119 DELAAIDALAERLATSA--TLDDKAML---------VLKSKPRAVDIAMFGRMLAGNPGF 167 Query: 174 ELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQEQ------GSAHLGTQEFSSGVFYRY 227 V+ A+ +AHA TTH+ + D++T VDD++ G+ LG E+ SG+FY Y Sbjct: 168 ---NVEAAVQVAHAFTTHRATPEDDYYTTVDDIKNADQEEDRGAGFLGILEYGSGLFYLY 224 Query: 228 ANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFS-DM 286 IN L +NL G + A E A ++ T P KQ T+A+ ++ + Sbjct: 225 ICINADLLVDNLAG-DQALAAEAAALLIEAACTISPTGKQNTFASRARGLYALLEIGEET 283 Query: 287 PLSMANAFEKAVKAK---DGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITAQ 343 P S+A AF+ AV ++ L SIQ + + YG N +L DP T Sbjct: 284 PRSLAAAFQYAVGSRATEADHLAASIQRLTALREGFSKAYGEN--LRSVALDVTDPATPG 341 Query: 344 VKQM 347 +K + Sbjct: 342 LKAL 345 >UniRef50_D1NTI0 CRISPR-associated protein, Cse4 family n=1 Tax=Bifidobacterium gallicum DSM 20093 RepID=D1NTI0_9BIFI Length = 381 Score = 276 bits (705), Expect = 1e-72, Method: Composition-based stats. Identities = 92/375 (24%), Positives = 158/375 (42%), Gaps = 25/375 (6%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGE 60 M+ + I+ + + PS +NRDD K AI+GG R R+SSQ+ KRAMR++ + + Sbjct: 1 MTTIVEIYAIQNVPPSNINRDDTGNPKTAIYGGVLRARVSSQAWKRAMREAFPEMLDADQ 60 Query: 61 SSLRTIH-LAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAV------TP 113 +RT + LAQ+ + K + D + + K + + EK T Sbjct: 61 LGIRTKNALAQIEQSIVAKRPD-IDVETVHKAATAALTATGAKVEKSKRKGSMEGADLTQ 119 Query: 114 WVVG----EIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATS 169 +++ EI + + D K K +K ++A+ + Q VDIAL GRM Sbjct: 120 YLIFIANREIDKLADLAIAWIDADEDLDKPSKEMKGQVSAV-FHGPQAVDIALFGRMLAD 178 Query: 170 GMMTELGKVDGAMSIAHAITTHQVDSDIDWFT---AVDDLQEQGSAHLGTQEFSSGVFYR 226 D + +AHAI+ +V + D+FT G+A L T F+S YR Sbjct: 179 A---PELNTDASAQVAHAISVDEVTPEYDYFTAIDDDAADDNAGAAMLDTVGFNSSTLYR 235 Query: 227 YANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFSD- 285 YA + + L E L A ++ V+ +P KQ T+A +V + Sbjct: 236 YATVAVDSLYEQLQSAD--MTVKAVDAFVNAFLRSMPTGKQNTFANRTLPTAALVVVRNS 293 Query: 286 MPLSMANAFEKAVKAKDG--FLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVD-PITA 342 P++ AFE+ V A+ + + + + + + YG AA ++ + Sbjct: 294 QPINPVEAFERPVHAERDKSISRVAAERLGRKLQDIQDTYGETPIAAWNIVAGQPVELLD 353 Query: 343 QVKQMPTLEQLKSWV 357 + + TL + + Sbjct: 354 SLSEHVTLPVMVESL 368 >UniRef50_C7RP61 CRISPR-associated protein, Cse4 family n=1 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RP61_9PROT Length = 400 Score = 274 bits (701), Expect = 3e-72, Method: Composition-based stats. Identities = 99/396 (25%), Positives = 173/396 (43%), Gaps = 46/396 (11%) Query: 2 SNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMR-KSGYYAQNIGE 60 FI IH L ++ + LNRDD + K G R RISSQ LKR R +A + + Sbjct: 4 PRFIQIHTLHTYPAALLNRDDAGLAKRLPLGNAVRTRISSQCLKRHWRVVEDRFALSCLD 63 Query: 61 --SSLRTIHLAQLRDV------LRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVT 112 ++R+ +L + + + + + + D L GK + + Sbjct: 64 VPMAIRSRGTLELISKRIQESGVSETMAQAAAEAMRDAGLLDKGGKEKKGDDALKTGQAV 123 Query: 113 PWVVGEIAWFCEQVAKAEADNLDDKKLLKVL---------KEDIAAIRVNLQQGVDIALS 163 EI + + +D +++K +++ K +I A++ G++ AL Sbjct: 124 LLGKPEIDYLVRRCVDLASDGVEEKGFKELITLWLKGKDEKRNIEALKHGS--GLESALF 181 Query: 164 GRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDL----QEQGSAHLGTQEF 219 GRM TS ++T + A+ +AHA T HQ + D+FT VDDL E GSA + E Sbjct: 182 GRMVTSDVLTSR---EAAVYVAHAFTVHQAQVENDYFTVVDDLLQDAGELGSAGIFDTEL 238 Query: 220 SSGVFYRYANINLAQLQENLGGAS-------------REQALEIATHVVHMLATEVPGAK 266 +SG++Y Y +++ QL +NL G R A ++ H++H++AT PGAK Sbjct: 239 ASGLYYGYVVVDVPQLVQNLEGEDFNECFASGTPADRRVLAGQVVQHLLHLIATVSPGAK 298 Query: 267 QRTYAAFNPADMVMVNFSD-MPLSMANAFEKAVK---AKDGFLQPSIQAFNQYWDRVANG 322 + + A F+ A ++V D P S+A AF A+ + + ++ + + + Sbjct: 299 RGSTAPFDWAKFMLVEAGDWQPRSLAGAFHDALPLSGSGGTIRERTVDRLTKEIAAMDDA 358 Query: 323 YGLNGAAAQFSLSDVDPITAQVKQMPTLEQLKSWVR 358 YG + ++ + + L L W + Sbjct: 359 YGAPLSRRFLAIDQ--EVQVPGAERLNLASLADWAK 392 >UniRef50_C6HV95 CRISPR-associated protein, Cas4 n=1 Tax=Leptospirillum ferrodiazotrophum RepID=C6HV95_9BACT Length = 393 Score = 274 bits (700), Expect = 4e-72, Method: Composition-based stats. Identities = 117/383 (30%), Positives = 186/383 (48%), Gaps = 49/383 (12%) Query: 5 INIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGES-SL 63 H+L S +CLNRDD+ K A+ GG +R R+SSQS KRA+R + + ++G + + Sbjct: 13 FEFHILQSFPVTCLNRDDVGSPKTAMIGGSQRARVSSQSWKRAVRLAMH---DLGVTHGV 69 Query: 64 RTIHLAQLRDVLRQKLGERFDQKII--DKTLALL--------------SGKSVDEAEKIS 107 RT ++ L + LG +Q DK A+ G S + E++ Sbjct: 70 RTKLISPLIAEACRSLGATPEQARACGDKVEAVFIKKDEKGKKKSAKTKGDSDTQDEEVG 129 Query: 108 ADAVTP-------WVVGEIAWFCEQVAKAEAD------NLDDKKLLKVLKEDIAAIRVNL 154 +D+ + EI+ + K E D D KK K + + I + Sbjct: 130 SDSSSEKTDTLLFLSPKEISVLANEFKKQEFDPGKVIVQSDPKKQAKEIADMIGKVP-EG 188 Query: 155 QQGVDIALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQ-EQGSAH 213 VDIAL GRM V+ A S AHAI+TH+V +++++FTA+DD + G+AH Sbjct: 189 IDAVDIALFGRMVAQAAEL---NVEAAASFAHAISTHKVANEVEFFTALDDCAVDPGAAH 245 Query: 214 LGTQEFSSGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAF 273 +G+ EF+S +YRY +++L QL + L G + +E V L VP A+Q T + Sbjct: 246 MGSLEFNSATYYRYVSLDLGQLSQTLAGQHIPETIE---AFVKALFVSVPAARQSTQSGA 302 Query: 274 NPADMVMVNFSDMPLSMANAFEKAVKAKD-GFLQPSIQAFNQYWDRVANGYG-LNGAAAQ 331 +P D + + FE A+K+KD GFL+PSI+ Y +R +G L G A+ Sbjct: 303 SPWDFAKILVR-TGHRIQIPFETAIKSKDGGFLKPSIEEMKAYLNRQEKLHGSLFGKKAE 361 Query: 332 FSLSD-----VDPITAQVKQMPT 349 ++ + +D + + +KQ T Sbjct: 362 YTYGEDENFTIDDLISALKQQAT 384 >UniRef50_C9M9R6 CRISPR-associated protein, Cse4 family n=1 Tax=Jonquetella anthropi E3_33 E1 RepID=C9M9R6_9BACT Length = 400 Score = 271 bits (693), Expect = 2e-71, Method: Composition-based stats. Identities = 100/396 (25%), Positives = 180/396 (45%), Gaps = 45/396 (11%) Query: 2 SNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSG--YYAQNIG 59 F+ I L ++S S LNRDD + K FG R RISSQ LKR R +G Y G Sbjct: 5 PRFVQISTLTTYSASLLNRDDSGLAKRIPFGDSVRTRISSQCLKRHWRNAGGPYGLDKAG 64 Query: 60 ES---SLRTI-HLAQLRD--VLRQKLGERFDQKIIDKTLALLSGKSV------------- 100 ++ S+R+ +L + ++ + L ++ K LL Sbjct: 65 DALSLSVRSRFSFPELIEKPLVAEGLEQKLVVSGSQKLQQLLYNGEEKGDTKKDKKKKIE 124 Query: 101 --DEAEKISADAVTPWVVGEIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQ-- 156 ++ + + E+ + + + A + + + K++ +K+ + NL Sbjct: 125 LDEDGYSAKRNELVVLGRPELEYLKQIIRDAISSSSNIKEIDNAVKDFYTKRKSNLLALR 184 Query: 157 ---GVDIALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQEQGSAH 213 GVD A+ GR + + KV A+ +AH+ T H S+ D+FTAVDDL EQG+ H Sbjct: 185 AGCGVDAAMFGRFVSGDV---DAKVTAAVHVAHSFTIHGEQSETDYFTAVDDLVEQGTGH 241 Query: 214 LGTQEFSSGVFYRYANINLAQLQENLGG--------ASREQALEIATHVVHMLATEVPGA 265 + E ++G++Y Y +++ QL NL G A R A ++ ++++H++AT PGA Sbjct: 242 INAAELNTGIYYGYVVVDVPQLISNLCGCDSKNSADADRTLAAQVTSNLIHLMATVTPGA 301 Query: 266 KQRTYAAFNPADMVMVNFSD-MPLSMANAFEKAVK--AKDGFLQPSIQAFNQYWDRVANG 322 K A + + +V+ +SD P ++A+AF + +K + ++Q +Y + Sbjct: 302 KLSGTAPYAASWLVLAEWSDSQPRTLADAFFEGLKLGSDGSARSLAVQMLAEYIRKYDAM 361 Query: 323 YGLNGAAAQFSLSDVDPITAQVKQMPTLEQLKSWVR 358 Y + ++P + +L++L V+ Sbjct: 362 Y---TPQLTRRCASIEPCQIPGAENGSLDELCEAVK 394 >UniRef50_C7MQD5 CRISPR-associated protein, Cse4 family n=1 Tax=Saccharomonospora viridis DSM 43017 RepID=C7MQD5_SACVD Length = 368 Score = 269 bits (687), Expect = 1e-70, Method: Composition-based stats. Identities = 97/367 (26%), Positives = 162/367 (44%), Gaps = 24/367 (6%) Query: 4 FINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESSL 63 F++IH L + S +NRD++ K +GG R+R+SSQ+ KRA+RK+ Q++ + + Sbjct: 2 FVDIHALHTLPYSNVNRDNLGAPKSCWYGGTERIRVSSQAWKRAIRKA--VEQDLEQPTE 59 Query: 64 RTIHLAQL-RDVLRQKLGERFDQKIIDKTLALLSG-KSVDEAEKISADAVTPWVVGEIAW 121 RT +A L +L ++ D + + + G + + + TP +A Sbjct: 60 RTRRIASLVAGILTERGWGAEDARRAGRAVIYAYGLEPAADDDDTDTLLWTPPAAEALAG 119 Query: 122 FCEQVAKAE------------ADNLDDKKLLKVLKEDIAAIRVNLQQGVD-IALSGRMAT 168 E+ A N K + +K ++ L + IAL GRM Sbjct: 120 VVEKHRDTVVTLPLPKGEGKKAKNPPAKDITDAVKPMAGEVKSILNRTTPTIALLGRMLA 179 Query: 169 SGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDD-LQEQGSAHLGTQEFSSGVFYRY 227 + G IAHA T H+ + D+FTAVDD G+ H+ T +F++G FYRY Sbjct: 180 D---RPDHTIYGLAEIAHAFTVHEAAPEFDYFTAVDDRAANTGAGHVNTAQFTTGTFYRY 236 Query: 228 ANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFSDMP 287 ++IN+ +L + +G + A + T P KQ AA AD+ + + P Sbjct: 237 SSINITRLVDVVG---EQDARAVLLAWARRFITVTPAGKQTATAARTAADLAHIVVRNAP 293 Query: 288 LSMANAFEKAVKAKDGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITAQVKQM 347 S A AFE + + G+L P+ +A Y R+A G ++ + + + Sbjct: 294 QSYAPAFETPIVSTGGYLDPAARALGDYATRLAAYLGDTPVEHGYATTLPTNVDGLGGRF 353 Query: 348 PTLEQLK 354 TL+ L Sbjct: 354 DTLDTLI 360 >UniRef50_B4S8P9 CRISPR-associated protein, Cse4 family n=9 Tax=Bacteria RepID=B4S8P9_PROA2 Length = 347 Score = 269 bits (687), Expect = 1e-70, Method: Composition-based stats. Identities = 115/352 (32%), Positives = 182/352 (51%), Gaps = 34/352 (9%) Query: 5 INIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIG-ESSL 63 I H+L S +CLNRDD+ K AI GG R R+SSQ KR +R S Q+ G + + Sbjct: 12 IEYHILQSFPVTCLNRDDVGAPKTAIVGGSTRARVSSQCWKRQVRLS---MQDFGIKLGI 68 Query: 64 RTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGEIAWFC 123 R+ +++ + + QK + A GK + ++ S D + + E F Sbjct: 69 RSKKVSEF-------VAKACLQKGASEEQAAECGKVISDS--FSKDTLFFFSESEAQAFA 119 Query: 124 EQVAKA--EADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGKVDGA 181 + + ++ NL+DK++ KV K+ + G+DIAL GRM ++ A Sbjct: 120 DYAREKNFDSKNLNDKEIRKVAKKALNPAI----DGLDIALFGRMVAQAT---DLNIEAA 172 Query: 182 MSIAHAITTHQVDSDIDWFTAVDDL-QEQGSAHLGTQEFSSGVFYRYANINLAQLQENLG 240 S +HAI+TH+V +++++FTA+DDL +E GSAH+G+ EF+S +YRY +++L QL E++G Sbjct: 173 ASFSHAISTHKVSNEVEFFTALDDLAEEPGSAHMGSLEFNSATYYRYISLDLGQLWESIG 232 Query: 241 GASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFSDMPLSMANAFEKAVKA 300 G +A+E T L VP A+Q T + +P + + + FE AVKA Sbjct: 233 GEHLAEAVESLT---KALFVAVPSARQTTQSGASPWEFAKIFIR-KGQRLQVPFETAVKA 288 Query: 301 KD-GFLQPSIQAFNQYWDRVANGYG-LNGAAAQFSLSD-----VDPITAQVK 345 KD G+LQPSI A Y + G L G +F+ + +D + +K Sbjct: 289 KDGGYLQPSITALTDYLTKKEALAGSLFGKEKEFTFGEDVNFSIDDLIKGLK 340 >UniRef50_Q1J368 CRISPR-associated protein, CT1975 n=1 Tax=Deinococcus geothermalis DSM 11300 RepID=Q1J368_DEIGD Length = 385 Score = 266 bits (680), Expect = 9e-70, Method: Composition-based stats. Identities = 119/379 (31%), Positives = 169/379 (44%), Gaps = 30/379 (7%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRK--SGYYAQNI 58 M + +H L + +PS LNRDD KDA FGG RR+RISSQ+ KRAMR+ G Sbjct: 1 MKALLELHYLQNFAPSNLNRDDTGSPKDAFFGGTRRLRISSQAFKRAMRQDFGGRELLRP 60 Query: 59 GESSLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEK-------ISADAV 111 E +RT + L G +Q LAL + K + Sbjct: 61 EEIGVRTKRAHEAIAELLAGEGRTEEQCRAAAELALGGLGLPVKDGKNQYLLFLGRDELR 120 Query: 112 TPWVVGEIAWFCEQVAKAEADNLDDKKLLKVLKEDIA-------AIRVNLQQGVDIALSG 164 + W Q A E ++ D KK K ++ A ++ + VD+AL G Sbjct: 121 RVADIIGANWAEFQAAAPEPESTDGKKKKASKKAALSGDLGKQLAGALDGSKAVDVALFG 180 Query: 165 RMATSGMMTELGKVDGAMSIAHAITTHQV-DSDIDWFTAVDDLQE---QGSAHLGTQEFS 220 RM D A +AHAI+TH + + D++TAVDDL+ G+ LGT EF+ Sbjct: 181 RMLAD---LPDKNADAAAQVAHAISTHALRERQYDFYTAVDDLKPDDNAGADMLGTVEFA 237 Query: 221 SGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMV- 279 S YRYA I+L +L ENL G RE ++ P KQ T+AA N ++ Sbjct: 238 SATVYRYACIDLGKLLENLQG-DRELLERGLRAFLYASVYAAPTGKQNTFAAHNLPGLMV 296 Query: 280 -MVNFSDMPLSMANAFEKAVKAKD--GFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSD 336 +V + P ++ANAFEK V+A+ G+L PS+ A +G G A + Sbjct: 297 QVVRRNASPRNLANAFEKGVRAEGGQGYLAPSVAALADEMRWQNGVFGDAGTARFVAREG 356 Query: 337 VDPITAQVKQMPTLEQLKS 355 D + + MP + L Sbjct: 357 GDAVFG--EAMPNVAALID 373 >UniRef50_B8FDH9 CRISPR-associated protein, Cse4 family n=2 Tax=Bacteria RepID=B8FDH9_DESAA Length = 383 Score = 264 bits (675), Expect = 3e-69, Method: Composition-based stats. Identities = 111/383 (28%), Positives = 176/383 (45%), Gaps = 46/383 (12%) Query: 5 INIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIG-ESSL 63 + H+L S +CLNRDD+ K A+ GG R R+SSQ KR +R + +++G Sbjct: 11 VEFHILQSFPVTCLNRDDVGAPKTAVVGGATRARVSSQCWKRNIRLT---MKDLGVPIGS 67 Query: 64 RTIHLAQLRDVLRQKLGERFDQK-----------IIDKTLALLSGKSVDEAEKISADAVT 112 RT + Q+ + +LG DQ I +K G + +DA+ Sbjct: 68 RTKLIHQMIEDACAELGADTDQAQACAAQVASVFIKEKKGKKDDGDDSEGNGSDKSDALI 127 Query: 113 PWVVGEIAWFCEQVAKA------EADNLDDKKLLKVLKEDIAAIRVNL-------QQGVD 159 E+ + + + + ++ K KV K + NL + GVD Sbjct: 128 FLSREEVKKIALALRENNFSTEFQEEKVNKKGDAKVEKIKLEKKIQNLLGKPDFSRDGVD 187 Query: 160 IALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQ-EQGSAHLGTQE 218 IAL GRM V+GA S +HAI+TH+V +++++FTA+DDLQ E GSAH+G E Sbjct: 188 IALFGRMVAQAAAL---NVEGAASFSHAISTHKVTNEVEFFTALDDLQTEPGSAHMGALE 244 Query: 219 FSSGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADM 278 F+S +YRY +++ QL +NL G QALE V L +P A+Q T + + Sbjct: 245 FNSATYYRYVCLDMGQLWKNLAGQHLPQALEG---FVKALYLALPSARQATQSGACWWEF 301 Query: 279 VMVNFSDMPLSMANAFEKAVKAK-DGFLQPSIQAFNQYWDRVANGYG-LNGAAAQFSLSD 336 V + F+ AVK + G L+PS A Y ++ G L A+F+ + Sbjct: 302 AKVFVR-KGQRLQAPFDTAVKPRNGGLLEPSKDALCAYLEKKEQQAGSLFRKIAEFTFGE 360 Query: 337 VDPITAQVKQMPTLEQLKSWVRN 359 + P+++ L +++ Sbjct: 361 DNG--------PSIDDLVLSIQD 375 >UniRef50_D2L2X7 CRISPR-associated protein, Cse4 family n=1 Tax=Desulfovibrio sp. FW1012B RepID=D2L2X7_9DELT Length = 385 Score = 261 bits (667), Expect = 2e-68, Method: Composition-based stats. Identities = 103/386 (26%), Positives = 167/386 (43%), Gaps = 43/386 (11%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIG- 59 MS FI +H+L S+ + LNRDD+ K FG R+R+SSQSLKRA R S + +G Sbjct: 1 MSRFIQLHILTSYPAANLNRDDLGAPKSMRFGEANRLRVSSQSLKRAWRTSDVFKATLGA 60 Query: 60 -ESSLRTIHLAQLR-----------------------DVLRQKLGERFDQKIID-----K 90 +RT L + L++K + I K Sbjct: 61 DHLGVRTKELGRKVFCALTQGASLDAVWDAPDATGTLAALKEKTAAEIARTIAGVFGKIK 120 Query: 91 TLALLSGKSVDEAEKISADAVTPWVVGEIAWFCEQVAKAEADNLDD-KKLLKVLKEDIAA 149 A + + K + + + ++A ++ +A A + + K + Sbjct: 121 KEADAKAEKDADPVKKRKELLDSLEIEQLAHVSQEERRAVAALTEACRDAGKAPDANALN 180 Query: 150 IRVNLQQGVDIALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQ-- 207 + + + DIA+ GRM + V+ A+ +AHA+T H+ ++ D+FTAVDDL Sbjct: 181 LLRSDAKAADIAMFGRMLAASARF---NVEAAVQVAHAVTVHRAVAEDDFFTAVDDLNRD 237 Query: 208 EQGSAHLGTQEFSSGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQ 267 + G+ H+G EF +GV+Y Y I+ A L ENLGG + T + T P KQ Sbjct: 238 DAGAGHMGVSEFGAGVYYLYLCIDRALLAENLGG-DEALVQKALTALTTAACTVAPTGKQ 296 Query: 268 RTYAAFNPADMVMVNFS-DMPLSMANAFEKAV----KAKDGFLQP-SIQAFNQYWDRVAN 321 +YA+ A + D P +++ AF K V + +DG L +I + ++ Sbjct: 297 ASYASRAYACFALAEKGDDTPRNLSLAFLKPVGEREEERDGHLGKTAIAELLKTKAKMDK 356 Query: 322 GYGLNGAAAQFSLSDVDPITAQVKQM 347 YG A F++ D A++ Sbjct: 357 VYGQTLADTSFNVFDGKGTLAELAAF 382 >UniRef50_A5GBK1 CRISPR-associated protein, Cse4 family n=1 Tax=Geobacter uraniireducens Rf4 RepID=A5GBK1_GEOUR Length = 408 Score = 261 bits (667), Expect = 3e-68, Method: Composition-based stats. Identities = 108/401 (26%), Positives = 175/401 (43%), Gaps = 55/401 (13%) Query: 5 INIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGE---- 60 + +H++ S +CLNRDD+N K A+FGG +R R+SSQS KRA+R+ + Sbjct: 4 LELHIIQSVPVACLNRDDLNSPKTAVFGGVQRARVSSQSWKRAIREMAKEIAAEEKSDLF 63 Query: 61 SSLRTIHLA-QLRDVLRQKLGERFDQKIIDKTLALLSG--------------KSVDEAEK 105 S RT + L L +K I + +A + K+V K Sbjct: 64 SGDRTRRMVYTLSTRLAEKGITSQAAIAIAEQVADVVETLDSKVDSEGYKKIKTVMFFSK 123 Query: 106 ISADAVTPWVVG--EIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVN---------- 153 DA+ + E+ E + KA + +D++ K LK + + Sbjct: 124 AEYDAIAEAIATSDEVKNSVEALEKAAVEG-NDREREKALKAMVKILEKGAISKTIKSAQ 182 Query: 154 LQQGVDIALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQ--EQGS 211 L+ DIAL GRM + KVDGA AH ++TH+ D++ID+F AVDDL E G+ Sbjct: 183 LKDAADIALFGRMVAND---PSLKVDGASMFAHILSTHKADNEIDFFAAVDDLNKDESGA 239 Query: 212 AHLGTQEFSSGVFYRYANINLAQLQ--ENLGG---------ASREQALEIATHVVHMLAT 260 T EF+S +YR+A +NL L ++LG S E ++ + + Sbjct: 240 GMTSTLEFNSATYYRFAALNLDALANDDHLGDITLKDGTVVRSVETRKQVVKTFLKAIIQ 299 Query: 261 EVPGAKQRTYAAFNPADMVM--VNFSDMPLSMANAFEKAV-KAKDGFLQPSIQAFNQYWD 317 +P A++ T V+ V P+ + NAFE V +++ GF+ SI N + Sbjct: 300 SIPSARKTTMNGNTLPVYVLGVVREKGHPIQLINAFETPVRRSEKGFVTESINRMNIEYA 359 Query: 318 RVANGYGLNGAAAQF----SLSDVDPITAQVKQMPTLEQLK 354 + +G++ A+ SL + + + + L Sbjct: 360 DLKETWGVDSLFAKAVVKGSLKEQIKANQGSIETCSQDDLI 400 >UniRef50_D1A6Q4 CRISPR-associated protein, Cse4 family n=2 Tax=Actinomycetales RepID=D1A6Q4_THECD Length = 399 Score = 261 bits (666), Expect = 4e-68, Method: Composition-based stats. Identities = 120/384 (31%), Positives = 180/384 (46%), Gaps = 42/384 (10%) Query: 3 NFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIG-ES 61 FI H++ + + LNRDD N K +GGK R R+SSQ KRAMR Y ++G E+ Sbjct: 7 RFIEAHIIQAIPFANLNRDDTNAVKTVTWGGKERTRVSSQCWKRAMRL--YLQTSLGQEA 64 Query: 62 SLRTIHLAQ-LRDVLRQK------LGERFDQKII-----------DKTLALLSGKSVDEA 103 +LRT L + L L + L ER + I+ KT +G + + Sbjct: 65 ALRTRRLPEYLARHLEEHHGWPADLAERAGRHIVVASSVGGEAPKKKTDGEETGGTGEHW 124 Query: 104 EKISADAVTPWVVGEIAWFCEQVAKAEADNLDDKKLLKVLKED--IAAIRVN---LQQGV 158 + + V E+A Q +A + + K K ++D I +V+ ++ Sbjct: 125 STAAMVYIPSSAVPELAELAIQYREALENAKEPKDPAKFGRKDSVIPTGKVDEILRRRNG 184 Query: 159 DIALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQE-----QGSAH 213 I L GRM + +VDGA+ +AHA TTH ++ID+F+AVDD+ + GSAH Sbjct: 185 VINLFGRMLAQ---VDDAEVDGAVQVAHAFTTHATTTEIDYFSAVDDVTDIWGDTTGSAH 241 Query: 214 LGTQEFSSGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAF 273 +G E S+GV YRY ++L L NLGG E E+A ++ +P AK+ + A Sbjct: 242 MGQAEHSAGVLYRYIVLDLNDLHANLGG-DLEATRELAAGLLKAALLSLPRAKKNSTAPH 300 Query: 274 NPADMVMVNFS-DMPLSMANAFEKAVKAK--DGFLQPSIQAFNQYWDRVANGYGLNGAA- 329 + + D P+S A AFEK V A G +PS+ A N+Y V G +G Sbjct: 301 TIPHLAHLTVRTDRPVSYAGAFEKPVPADRHGGHSEPSVAALNEYAAAVQKLLGTSGCRY 360 Query: 330 ---AQFSLSDVDPITAQVKQMPTL 350 A S +D + +V+ L Sbjct: 361 AAHATLSQEKIDALGERVESFDKL 384 >UniRef50_C2GEY7 CRISPR-associated Cse4 family protein n=6 Tax=Actinomycetales RepID=C2GEY7_9CORY Length = 356 Score = 259 bits (661), Expect = 1e-67, Method: Composition-based stats. Identities = 92/360 (25%), Positives = 148/360 (41%), Gaps = 38/360 (10%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGE 60 MSN + +H L S S LNRDD + K + GG R SSQS+KR R Y + Sbjct: 1 MSNQLTLHFLCSIPYSNLNRDDTGVPKRVMQGGALRALHSSQSIKRGSRV--LYENASQD 58 Query: 61 SSLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGK-SVDEAEKISADAV-TPWVVGE 118 S+R+ L + ++ D+K K A L G + EA+ DA + W+ E Sbjct: 59 LSIRSGRLDEEVAEKAMEMNPDLDEKTALKQAAKLIGNLTKGEAKSGEGDAKRSTWLSSE 118 Query: 119 IAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGKV 178 A A++ D ++ I N + IA GRM + Sbjct: 119 EILTA---ATYVANSTDPREKF---------IDGNTTGSLAIAAFGRMFANAT---DLNT 163 Query: 179 DGAMSIAHAITTHQVDSDIDWFTAVDDLQ----EQGSAHLGTQEFSSGVFYRYANINLAQ 234 + A++++ AITTHQ + D+F+ DD+ + + +L ++SG FYR I+ Q Sbjct: 164 EAAVAVSPAITTHQATIETDYFSTADDINLRDHKANATYLDVSLYTSGTFYRTVTIDRNQ 223 Query: 235 LQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFSDMPLSMANAF 294 L+ + G E V L P K+ + A F +++ + +A F Sbjct: 224 LRTSWSGFESNSVRENLEAFVRSLVYGQPRGKKNSTAPFTMPSLILAE--EQQYRVAYDF 281 Query: 295 EKAVKAK---DGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITAQVKQMPTLE 351 E+ V+A GF++ SI+ + + A F + P+ A P L+ Sbjct: 282 ERPVEADKDGGGFMKSSIEKLAKQYT----------LARSFDPGNFGPVEALSGTYPDLD 331 >UniRef50_Q60AD1 CRISPR-associated protein, CT1975 family n=1 Tax=Methylococcus capsulatus RepID=Q60AD1_METCA Length = 414 Score = 257 bits (657), Expect = 4e-67, Method: Composition-based stats. Identities = 103/391 (26%), Positives = 171/391 (43%), Gaps = 71/391 (18%) Query: 4 FINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESSL 63 F+ IH L S+ + LNRDD + K FG R+R+SSQ LKR R+S + + L Sbjct: 2 FLQIHSLTSYHATLLNRDDAGLAKRIPFGDAVRLRVSSQCLKRHWRESLKQTIPLP-TGL 60 Query: 64 RTIHLAQLRDVLRQKLGERFDQKIIDKTLAL----------------------------- 94 RT H+ + +++ R Q+ ++ +LA Sbjct: 61 RTRHVFE------REIYPRLKQEGVEDSLAKQLTLSLMGLLLQKSDKTAKPEKAKKGKNG 114 Query: 95 ---------LSGKSVDEAEKISADAVTPWVVG--EIAWFCEQV-AKAEADNLDDKKLLKV 142 G +E+ P + G E+ + + A AE + +K L Sbjct: 115 HEEQAEFDFEEGAGTEESSAGDLRVKQPILFGRPEVDYLISLLKACAEEGSGAEKALQAK 174 Query: 143 LKED--------IAAIRVNLQQGVDIALSGRMATSGMMTELGKVDGAMSIAHAITTHQVD 194 LK D AA +L G++ AL GR TS + L + D A+ +AH+ T H +D Sbjct: 175 LKGDKANFKAMLKAAGHGDLYAGLEGALFGRFVTSDV---LSRSDAAVHVAHSFTVHGLD 231 Query: 195 SDIDWFTAVDDL---QEQGSAHLGTQEFSSGVFYRYANINLAQLQENLGGAS-------- 243 +++D+FT VDDL +E G+AH G E +G+FY Y +++ L NL G Sbjct: 232 TEVDYFTVVDDLNREEETGAAHAGDMELGAGLFYGYVAVDIPLLVSNLTGCDTTRWAEQE 291 Query: 244 REQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFSD-MPLSMANAFEKAVKAKD 302 ++ T ++ +AT PGAK A + ++ V++ P +++NA+ +A+ + Sbjct: 292 PADVRKVLTGLIRAIATVSPGAKLGATAPYAFSEFVLLETGKQQPRALSNAYLQALPMRG 351 Query: 303 GFLQPSIQAFNQYWDRVANGYGLNGAAAQFS 333 LQ +I A +Y + YG + + Sbjct: 352 DPLQAAIDALAKYLRALDAMYGRTSDSRSVA 382 >UniRef50_B6IWM4 CRISPR-associated protein, CT1975 family n=1 Tax=Rhodospirillum centenum SW RepID=B6IWM4_RHOCS Length = 435 Score = 238 bits (608), Expect = 2e-61, Method: Composition-based stats. Identities = 98/427 (22%), Positives = 160/427 (37%), Gaps = 77/427 (18%) Query: 5 INIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESSLR 64 I HVL + P +NRD+ K GG R RISSQ+ KRA+R + ++ + + R Sbjct: 15 IQFHVLTAFPPHNVNRDEDGRPKTCQLGGVTRGRISSQAKKRALRLAPHFPTA--QRATR 72 Query: 65 TIH--------------------LAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAE 104 T A L G + + + +A K ++A Sbjct: 73 TRKAGIHTFLKLTAAGIDTTSAVWAALAVNHATGGGGKPPKAEDAQAIAAPDPKKQEDAY 132 Query: 105 KISADAVT---------------PWVVG-------------EIAWFCEQVAKAEADNLDD 136 K AVT W+ G E A E +A A D Sbjct: 133 KKKEKAVTDMMEKRGLDRAAAEQEWLTGQVGTEQGLVISTREFARIEEGIAHLTAAWAAD 192 Query: 137 KKLLKVLKED------IAAIRVNLQQGVDIALSGRMATSGMMTELGKVDGAMSIAHAITT 190 + + E ++ +D AL GRM + V+ A ++ HA TT Sbjct: 193 RDGFPAVLEGWVRQVCKESLLTKADHDLDTALFGRMVAANANF---NVEAACAVGHAFTT 249 Query: 191 HQVDSDIDWFTAVDDLQ---EQGSAHLGTQEFSSGVFYRYANINLAQLQENLG-GASREQ 246 H+ + D+F+A ++L+ G+ G F GV+Y++A ++ L+ L G S E+ Sbjct: 250 HRFALEGDYFSAGEELKVLGGTGAVITGYAFFGGGVYYQHAVLDRGHLRTTLSRGRSAEE 309 Query: 247 ALEIATHVVHMLAT----EVPGAKQRTYAAFNPADMVMVNFSDMP-LSMANAFEKAVKAK 301 A + V T P K ++A+ A V+ P L++ AF VKA Sbjct: 310 AERLTVQAVDTFLTGLLFSQPRGKCNSHASDVAASYVLATRGGDPALNLGLAFLDPVKAT 369 Query: 302 D---GFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPI----TAQVKQMPTLEQLK 354 + + SI+ + + YGL A L+ P + ++ T+E + Sbjct: 370 EDVTDLMCASIRRLTDFHRALTAAYGLGNAVC--VLNAYPPARGNDAPRAPEVWTVEDFR 427 Query: 355 SWVRNNG 361 +V+ G Sbjct: 428 RFVQGRG 434 >UniRef50_C8P6I6 CRISPR-associated protein n=1 Tax=Lactobacillus antri DSM 16041 RepID=C8P6I6_9LACO Length = 311 Score = 229 bits (583), Expect = 1e-58, Method: Composition-based stats. Identities = 80/303 (26%), Positives = 144/303 (47%), Gaps = 26/303 (8%) Query: 61 SSLRTIH-LAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKIS-ADAVTPWVVGE 118 + +RT+ L + L+++ + + + + + + + +K + A+ G+ Sbjct: 14 AGIRTMRGPLLLANELQKQDSNLSSDEAMAQAVDVFNKAKIKLDKKTNQTKALLMLSHGQ 73 Query: 119 IAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGKV 178 IA E V + D LD K + + L+ D +D+AL GRM V Sbjct: 74 IAKLAEYVRQN--DELDSKAVKEALQGD---------HSLDMALFGRMVADD---PSLNV 119 Query: 179 DGAMSIAHAITTHQVDSDIDWFTAVDDLQ---EQGSAHLGTQEFSSGVFYRYANINLAQL 235 D A +AHAI+TH++ + D++TAVDD + E GSA +GT E+ S YRYAN+N+ +L Sbjct: 120 DAACQVAHAISTHEIVPEYDYYTAVDDEKADDESGSAMIGTIEYDSATLYRYANVNVNEL 179 Query: 236 QENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFS-DMPLSMANAF 294 ++LG + A++ V +P KQ ++A V+V D P+++ +AF Sbjct: 180 VQSLG--DVDTAVKGLQLFVKDFVLSMPTGKQNSFANKTVPQYVLVTVREDTPVNLVSAF 237 Query: 295 EKAVKAKDGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITAQVKQMPTLEQLK 354 E+AVK++ G+LQPS+ + + A S+ + + + ++ L Sbjct: 238 EEAVKSRHGYLQPSVAKLEKEYQDTQQFVQTPLA----SVVVTNKESKISTKAADVDDLV 293 Query: 355 SWV 357 S + Sbjct: 294 SKI 296 >UniRef50_B8HWH9 CRISPR-associated protein, Cse4 family n=1 Tax=Cyanothece sp. PCC 7425 RepID=B8HWH9_CYAP4 Length = 501 Score = 220 bits (559), Expect = 8e-56, Method: Composition-based stats. Identities = 74/323 (22%), Positives = 143/323 (44%), Gaps = 25/323 (7%) Query: 48 MRKSGYYAQNIGESSLRTIHLAQLRDVLRQKLGERFDQKIIDKTLA----LLSGKSVDEA 103 R E R L VL + ++ + + ++ A + + ++ Sbjct: 174 WRTK--LQSEFAEMPERVDDQVSLWSVLSIQALQKSQEDLANEDEADDEKVDTSNTMFFV 231 Query: 104 EKISADAVTPWVVGEIAWFCEQVAKAEADNLDD--KKLLKVLKEDIAAIRVNLQQGVDIA 161 + + + +++ + + ++ + K++ +K V + DIA Sbjct: 232 GDVEIENLAGFLLNNLQVVQQDISASVPSFSKAVVDKIIDTIKHKDEKGNVIFPKPGDIA 291 Query: 162 LSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQEQ---GSAHLGTQE 218 L GRM + KVD ++ +AHAI+ +++ + D+FTAV+DL E GS H+G Sbjct: 292 LFGRMMAN---LPNAKVDASVQVAHAISVNKLQQEFDFFTAVEDLAEPDSLGSGHMGETG 348 Query: 219 FSSGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADM 278 ++S +YR+ ++ QL++NLG + + A IA +P Q +AA + Sbjct: 349 YNSSTYYRFTTLDTEQLKQNLG--NEDNAATIAHAFAEAFVRAIPTGHQNGFAAHSLPAA 406 Query: 279 VM-VNFSDMPLSMANAFEKAVKAKDG--FLQPSIQAFNQYWDRVANGYG-----LNGAAA 330 VM V P+S+ +AFE V K G L+ ++ +++W ++ YG G A Sbjct: 407 VMAVVRKGQPVSLVDAFENPVAPKAGKSLLENAVSKLDEHWAELSKMYGEKTVVFKGIVA 466 Query: 331 QFSLSDVDPITAQVKQMPTLEQL 353 + L+ A V++ P++E+L Sbjct: 467 RAQLAQQLEYLAAVEK-PSVEEL 488 Score = 109 bits (271), Expect = 2e-22, Method: Composition-based stats. Identities = 35/111 (31%), Positives = 52/111 (46%), Gaps = 10/111 (9%) Query: 5 INIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESSLR 64 + IH+L S P+ LNRD+ M K +FGG R RISSQ KR R+ YY + E + Sbjct: 3 LEIHILQSFPPANLNRDENGMPKSTVFGGYPRARISSQCQKRRTRE--YYHEYCKELGVD 60 Query: 65 TIHLAQLRDVLRQKLGERFDQKIIDKTLALLSG--------KSVDEAEKIS 107 H A ++L E+ Q+ + + A L + D+ K+ Sbjct: 61 LKHFANRSRNWIKQLKEKLTQRGVSEAQAELMASLTISVLSEKPDKKGKLK 111 >UniRef50_D0WFC9 CRISPR-associated protein, Cse4 family n=1 Tax=Slackia exigua ATCC 700122 RepID=D0WFC9_9ACTN Length = 310 Score = 212 bits (540), Expect = 1e-53, Method: Composition-based stats. Identities = 66/269 (24%), Positives = 118/269 (43%), Gaps = 14/269 (5%) Query: 79 LGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGEIAWFCEQVAKAEADNLDDKK 138 + E + I+K +L + +K + + +++ ++A+ L D + Sbjct: 1 MPEVSEGDAIEKAKEVLVALGF-KLKKEENEYLNEYLIFIGTLQIGKLAELAIQALRDGE 59 Query: 139 L--LKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSD 196 K K+ + R VDIA+ GRM VD ++ +AHAI+ +++ Sbjct: 60 KVDKKEAKKILDVKRSPALNAVDIAMFGRMVADA---PDLNVDASVQVAHAISVSSAETE 116 Query: 197 IDWFTAVDDLQE---QGSAHLGTQEFSSGVFYRYANINLAQLQENLGGASREQALEIATH 253 D+FTA+DD G+A + T EF+S +FYRYAN+++ L ENLG S + A + Sbjct: 117 FDYFTALDDKAPEDNAGAAMIETTEFTSAMFYRYANVDVFHLCENLG--SPDAATKGINA 174 Query: 254 VVHMLATEVPGAKQRTYAAFNPADMVMVNFSD-MPLSMANAFEKAVKA--KDGFLQPSIQ 310 + +P KQ ++A V++ D P+S+ N+FE+ V A L + + Sbjct: 175 FLQSFVKSMPTGKQNSFANRTLPSAVVIQLRDSQPVSLVNSFERPVVALRDKSQLTNAAE 234 Query: 311 AFNQYWDRVANGYGLNGAAAQFSLSDVDP 339 A + +G+ + D Sbjct: 235 ALVAQEKALDEAFGVTPQHTFVVAASPDA 263 >UniRef50_B7KJ25 CRISPR-associated protein, Cse4 family n=1 Tax=Cyanothece sp. PCC 7424 RepID=B7KJ25_CYAP7 Length = 480 Score = 209 bits (531), Expect = 2e-52, Method: Composition-based stats. Identities = 69/261 (26%), Positives = 112/261 (42%), Gaps = 20/261 (7%) Query: 100 VDEAEKISADAVTPWVVGEIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGV- 158 ++ I D T + +A E + KKL +K + + + V Sbjct: 187 LELPGAIQGDLKTSYKDNPLAKVVN-----EEEFNQLKKLCNEIKGILYDEKNKRIKPVP 241 Query: 159 -DIALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDL------QEQGS 211 D+AL GRM S VD ++S+AHAI+T+ + + D++TA D + QG+ Sbjct: 242 GDVALFGRMLAS---FSDASVDASVSVAHAISTNSIKREFDYWTAARDFQKNNSDESQGA 298 Query: 212 AHLGTQEFSSGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYA 271 H+G + F+SGVFYRY+ ++ QL ENLG +E + + P + Sbjct: 299 GHIGDRPFASGVFYRYSCLDSNQLSENLGEIYQEDIQYLVEQYLDAFLHSRPSGYSHQFG 358 Query: 272 AFNPA-DMVMVNFSDMPLSMANAFEKAVKAKDGFLQPSIQAFNQYWDRVANGYGLNGAAA 330 + V P+S+ NAF+ +K D F + S +W+ + YG Sbjct: 359 HDTLPFAGIFVIRQSQPISLVNAFDIPIKKYDSFCRQSWNKLVDHWNEIQQAYGKRLPVK 418 Query: 331 Q---FSLSDVDPITAQVKQMP 348 + FSL I+ VK +P Sbjct: 419 EVHVFSLESFKDISELVKAVP 439 Score = 110 bits (275), Expect = 7e-23, Method: Composition-based stats. Identities = 42/135 (31%), Positives = 59/135 (43%), Gaps = 22/135 (16%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGE 60 M+NF+ IH+L S PS +NRD K A FGG R+R+SSQS K A+R+ YY + + + Sbjct: 1 MTNFLEIHLLQSTPPSNMNRDQNGSPKTAHFGGVERLRVSSQSWKHAVRQ--YYKKTLPD 58 Query: 61 SSLRTIHLAQLRDVLRQKLGERFDQKIIDKTLA---------------LLSGKSVDEAEK 105 H +L +R Q+ D+ L LLS +K Sbjct: 59 D-----HKTYRDKGWPTELAKRLKQEKFDEELNLKDSDFSVVLPIAFMLLSAIGAKRDDK 113 Query: 106 ISADAVTPWVVGEIA 120 D T +GE Sbjct: 114 KEGDIDTMLFLGEAE 128 >UniRef50_Q31XC0 Putative cytoplasmic protein n=1 Tax=Shigella boydii Sb227 RepID=Q31XC0_SHIBS Length = 245 Score = 206 bits (525), Expect = 8e-52, Method: Composition-based stats. Identities = 68/240 (28%), Positives = 110/240 (45%), Gaps = 16/240 (6%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNI-G 59 M+ FI +H+L ++ + LNRDD K I GG R+R+SSQSLKRA R S + Q + G Sbjct: 1 MTTFIQLHLLTAYPAANLNRDDSGSPKTVILGGATRLRVSSQSLKRAWRTSELFEQALAG 60 Query: 60 ESSLRTIHLAQLRD--VLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVG 117 +R+ +A+ ++ + + ++ + + L D+ +K P Sbjct: 61 HIGVRSGRIAREAATILIEKGIEDKKAIEWAVEIADYLGKAKKDKKQKNDKKPKDPLTSA 120 Query: 118 EIAWFCEQVAKAEADNLDD-----KKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMM 172 E ++ AE D + + + KE A+ + VDIA+ GRM Sbjct: 121 ETEQLV-HISPAEFDAVKALAHQLAEEKRAPKEKDLALLRKDRMAVDIAMFGRMLAKKPG 179 Query: 173 TELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDL----QEQGSAHLGTQEFSSGVFYRYA 228 V+ A +AHA + + D+FTAVDDL ++ G+ H+ F S +FY Y Sbjct: 180 F---NVEAACQVAHAFGVSETIVENDFFTAVDDLRQASEDAGAGHVDETGFGSALFYTYI 236 >UniRef50_UPI0001B51C2C hypothetical protein SvirD4_12600 n=1 Tax=Streptomyces viridochromogenes DSM 40736 RepID=UPI0001B51C2C Length = 461 Score = 204 bits (519), Expect = 4e-51, Method: Composition-based stats. Identities = 90/402 (22%), Positives = 151/402 (37%), Gaps = 92/402 (22%) Query: 3 NFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGES- 61 + ++H+L + + + RD+ M K +FGG R I++Q+ +RA R N G+ Sbjct: 16 QYFSLHLLETFTAALPVRDENGMPKQFVFGGDPRTMITAQARRRAERTHSRERANAGQGP 75 Query: 62 ------SLRTIHLAQL-------------------RDVLRQKLGERFDQKIIDKTLALLS 96 +RT A+L L + +G +F K + L + Sbjct: 76 LAGYTMGIRTREWAKLTAKALADRYGWDRADALATAKALLEGVGLKFGAKPTTRDLTQVL 135 Query: 97 GKSVDEAEKISADAVTPWVVGEIAWF-------------------------------CEQ 125 + ++A +I AD + AW + Sbjct: 136 LFAPEDAGQIIADWIQEHRAEVAAWTSDYLKAKEAGAAAAAAKKAAAAAARKAKKSGTDA 195 Query: 126 VAKAEADNL--DDKKLLKVLKEDIAAIRVNL--QQGVDIALSGRMATSGMMTELGKVDGA 181 +A A DN ++++L V ++ AI L + +DIAL GR + + VDGA Sbjct: 196 LASAADDNQPNNEEQLPPVPRKIREAILSALAPRDAIDIALYGRFLAE--IADSPNVDGA 253 Query: 182 MSIAHAITTHQVD------------------SDIDWFTAVDDLQEQGSAHLGTQEFSSGV 223 + AHA T H + +D+ A DD G+ G Q SG Sbjct: 254 IQTAHAFTVHAAEHIDDFYAAADDAKLHRKAHALDYIDAADD---SGAGMTGYQSLISGT 310 Query: 224 FYRYANINLAQLQENL--GGASREQAL----EIATHVVHMLATEVPGAKQRTYAAFN-PA 276 FYR+A ++ +L+ NL G +Q V +P AK+ T AA Sbjct: 311 FYRHAVLDRYKLRINLLASGMKPDQVQAAAEAAELEFVEAFTNAIPQAKKNTTAATGILP 370 Query: 277 DMVMVNFSDMPLSMANAFEKAVKAKDGFLQPSIQAFNQYWDR 318 +VM P + A FEK + + + S+ A ++ ++ Sbjct: 371 KLVMAFTGARPFNYAGIFEKPIAEETDGV-ASVAAADRLLNQ 411 >UniRef50_UPI000190E665 hypothetical protein SentesTyp_08452 n=3 Tax=Salmonella enterica subsp. enterica serovar Typhi RepID=UPI000190E665 Length = 139 Score = 109 bits (271), Expect = 2e-22, Method: Composition-based stats. Identities = 37/114 (32%), Positives = 59/114 (51%), Gaps = 4/114 (3%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNI-G 59 M+ FI +H+L +++P+ LNRD+ K A GG R+R+SSQSLKRA R S + + G Sbjct: 1 MTTFIQLHLLTAYAPANLNRDESGRPKTAFMGGVERLRVSSQSLKRAWRVSETFEAAMDG 60 Query: 60 ESSLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTP 113 RT + D + + + + ++ I K+ + L K + K DA Sbjct: 61 FMGKRTRRIG--VDYVYRPMKDAGIEEKIAKSSSELIAKQFGKL-KSDKDAKPE 111 >UniRef50_UPI0001B58196 CRISPR-associated Cse4 family protein n=1 Tax=Streptomyces sp. C RepID=UPI0001B58196 Length = 91 Score = 75.2 bits (183), Expect = 3e-12, Method: Composition-based stats. Identities = 20/84 (23%), Positives = 35/84 (41%), Gaps = 4/84 (4%) Query: 262 VPGAKQRTYAAFNPADMVMVN-FSDMPLSMANAFEKAV---KAKDGFLQPSIQAFNQYWD 317 +P K T+ D+V+V S P+S AFEK V + +G ++ + +A ++ Sbjct: 1 MPTGKANTFGNHTLPDVVIVKLRSSRPVSFVGAFEKPVIQHETGEGHVRAAWKALAEHIP 60 Query: 318 RVANGYGLNGAAAQFSLSDVDPIT 341 + +G A P T Sbjct: 61 AIEKTFGATADATWILRVGEPPTT 84 >UniRef50_C2BS05 Possible CRISPR-associated protein n=1 Tax=Mobiluncus curtisii ATCC 43063 RepID=C2BS05_9ACTO Length = 435 Score = 52.4 bits (124), Expect = 3e-05, Method: Composition-based stats. Identities = 18/87 (20%), Positives = 36/87 (41%), Gaps = 3/87 (3%) Query: 274 NPADMVMVNFSD-MPLSMANAFEKAVK-AKDGFLQPSIQAFNQYWDRVANGYGLNGAAAQ 331 + ++V V D +S+ NAFE+ V + +Q +++ + + YG+ AA Sbjct: 339 SLPELVYVAVRDTRSVSLVNAFEEPVACERGSRVQAAVEVLANEETAIEDAYGMKPLAAF 398 Query: 332 -FSLSDVDPITAQVKQMPTLEQLKSWV 357 D + T+ +L S + Sbjct: 399 VVDPKDYAAKLEDIAHKVTVPELTSLI 425 >UniRef50_O87037 Z35f protein n=1 Tax=Vibrio cholerae RepID=O87037_VIBCH Length = 96 Score = 44.4 bits (103), Expect = 0.007, Method: Composition-based stats. Identities = 15/64 (23%), Positives = 29/64 (45%), Gaps = 2/64 (3%) Query: 262 VPGAKQRTYAAFNPADMVMVNFSDMPLSMANAFEKAVKAKD-GFLQPSIQAFNQYWDRVA 320 +P A+Q T + P + V + +FE+ V+A G+L P+ +A + ++ Sbjct: 1 MPNARQTTQSGACPWEYARVLVR-KGQRLQASFEQPVRAAGEGYLLPNKKALQNWLEQRE 59 Query: 321 NGYG 324 G Sbjct: 60 KLSG 63 >UniRef50_Q4PFD0 Putative uncharacterized protein n=3 Tax=Basidiomycota RepID=Q4PFD0_USTMA Length = 1692 Score = 41.3 bits (95), Expect = 0.053, Method: Composition-based stats. Identities = 24/149 (16%), Positives = 51/149 (34%), Gaps = 7/149 (4%) Query: 47 AMRKSGYYAQNIGESSLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKI 106 R + + + R ++ ++ ++I + L L + K+ Sbjct: 626 HGRDLVAAHTRMNDQARRFGRAMLKFHADSEREEQKRVERIAKERLNALKADDEEAYLKL 685 Query: 107 ---SADAVTPWVVGEIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQ---GVDI 160 + D ++ + + + +A+A +D + + A Q+ VD Sbjct: 686 IDTAKDTRITHLLRQTDGYLDSLAQAVQAQQNDDVHADAIAAERAVEESANQEVGVAVDE 745 Query: 161 ALSGRMATSGMMTELGKVDGAMSIAHAIT 189 + G + GKVD S+AH IT Sbjct: 746 TMFGATRQDDPSEDRGKVD-YYSVAHRIT 773 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_Q46899 Uncharacterized protein ygcJ n=13 Tax=Proteobact... 462 e-129 UniRef50_C0W6U1 CRISPR-associated Cse4 family protein n=2 Tax=Ac... 411 e-113 UniRef50_C2BET9 CRISPR-associated protein n=3 Tax=Bacteria RepID... 389 e-107 UniRef50_D2RB01 CRISPR system CASCADE complex protein CasC n=2 T... 387 e-106 UniRef50_A0LM53 CRISPR-associated protein, Cse4 family n=1 Tax=S... 384 e-105 UniRef50_B6XT63 Putative uncharacterized protein n=1 Tax=Bifidob... 379 e-104 UniRef50_D1CAJ1 CRISPR-associated protein, Cse4 family n=1 Tax=S... 378 e-103 UniRef50_B1VIY1 CRISPR-associated protein n=3 Tax=Corynebacteriu... 377 e-103 UniRef50_B3E5V0 CRISPR-associated protein, Cse4 family n=56 Tax=... 376 e-103 UniRef50_B0TDU0 Crispr-associated protein, ct1973 family, putati... 375 e-102 UniRef50_D1CGD3 CRISPR-associated protein, Cse4 family n=1 Tax=T... 375 e-102 UniRef50_A3EQA5 CRISPR-ssociated protein, Cas4 n=4 Tax=Bacteria ... 374 e-102 UniRef50_C3PF94 CRISPR-associated protein n=5 Tax=Corynebacteriu... 374 e-102 UniRef50_C6SPJ0 Putative uncharacterized protein n=1 Tax=Strepto... 373 e-102 UniRef50_A7BA64 Putative uncharacterized protein n=1 Tax=Actinom... 371 e-101 UniRef50_A1ARH7 CRISPR-associated protein, Cse4 family n=1 Tax=P... 370 e-101 UniRef50_Q2JWC4 CRISPR-associated protein, Cse4 family n=1 Tax=S... 370 e-101 UniRef50_A4XYU0 CRISPR-associated protein, Cse4 family n=5 Tax=B... 369 e-101 UniRef50_C4FG89 Putative uncharacterized protein n=1 Tax=Bifidob... 367 e-100 UniRef50_D2TKK6 CRISPR-associated protein n=1 Tax=Citrobacter ro... 364 3e-99 UniRef50_C7QEM5 CRISPR-associated protein, Cse4 family n=13 Tax=... 363 5e-99 UniRef50_A1SV72 CRISPR-associated protein, Cse4 family n=2 Tax=G... 363 6e-99 UniRef50_Q3A5Z5 CRISPR-associated protein, Cse4 family n=23 Tax=... 363 9e-99 UniRef50_Q2JH28 CRISPR-associated protein, CT1975 n=6 Tax=Actino... 362 1e-98 UniRef50_Q2FNL3 CRISPR-associated protein, CT1975 n=8 Tax=cellul... 361 2e-98 UniRef50_D1YEE3 CRISPR system CASCADE complex protein CasC n=1 T... 361 3e-98 UniRef50_Q47PJ3 CRISPR-associated protein, Cse4 family n=1 Tax=T... 360 4e-98 UniRef50_C7LYW7 CRISPR-associated protein, Cse4 family n=1 Tax=A... 358 3e-97 UniRef50_Q03C61 CRISPR-associated protein n=6 Tax=Firmicutes Rep... 355 2e-96 UniRef50_C7MTA9 CRISPR-associated protein, Cse4 family n=1 Tax=S... 352 8e-96 UniRef50_UPI0001AF1D4B hypothetical protein SghaA1_37372 n=1 Tax... 352 1e-95 UniRef50_B8IZA6 CRISPR-associated protein, Cse4 family n=1 Tax=D... 352 1e-95 UniRef50_C4ZJY0 CRISPR-associated protein, Cse4 family n=1 Tax=T... 350 5e-95 UniRef50_A8LYZ6 CRISPR-associated protein, Cse4 family n=1 Tax=S... 349 9e-95 UniRef50_C5SD49 CRISPR-associated protein, Cse4 family n=1 Tax=A... 349 1e-94 UniRef50_A5UR15 CRISPR-associated protein, Cse4 family n=1 Tax=R... 347 5e-94 UniRef50_D0Y919 CRISPR-associated protein, Cse4 family n=2 Tax=D... 346 5e-94 UniRef50_B6WQ62 Putative uncharacterized protein n=1 Tax=Desulfo... 345 2e-93 UniRef50_D1NTI0 CRISPR-associated protein, Cse4 family n=1 Tax=B... 345 2e-93 UniRef50_B4UE70 CRISPR-associated protein, Cse4 family n=2 Tax=A... 344 2e-93 UniRef50_A5FTJ7 CRISPR-associated protein, Cse4 family n=11 Tax=... 344 2e-93 UniRef50_Q1EQS8 CRISPR-associated protein n=3 Tax=Streptomyces R... 344 2e-93 UniRef50_Q2RXJ6 CRISPR-associated protein, Cse4 family n=2 Tax=A... 341 3e-92 UniRef50_Q0BRF9 Putative uncharacterized protein n=1 Tax=Granuli... 340 5e-92 UniRef50_B6B782 CRISPR-associated protein, Cse4 family n=2 Tax=A... 340 6e-92 UniRef50_C7RP61 CRISPR-associated protein, Cse4 family n=1 Tax=C... 338 2e-91 UniRef50_Q67RP1 Putative uncharacterized protein n=1 Tax=Symbiob... 335 2e-90 UniRef50_D0MET5 CRISPR-associated protein, Cse4 family n=1 Tax=R... 333 8e-90 UniRef50_Q0AA32 CRISPR-associated protein, Cse4 family n=1 Tax=A... 332 1e-89 UniRef50_C9M9R6 CRISPR-associated protein, Cse4 family n=1 Tax=J... 329 7e-89 UniRef50_D1Y487 CRISPR-associated protein, Cse4 family n=1 Tax=P... 329 1e-88 UniRef50_B8FDH9 CRISPR-associated protein, Cse4 family n=2 Tax=B... 327 3e-88 UniRef50_D2L2X7 CRISPR-associated protein, Cse4 family n=1 Tax=D... 326 1e-87 UniRef50_Q2RY18 CRISPR-associated protein, Cse4 family n=2 Tax=A... 324 2e-87 UniRef50_A5GBK1 CRISPR-associated protein, Cse4 family n=1 Tax=G... 324 4e-87 UniRef50_C6HV95 CRISPR-associated protein, Cas4 n=1 Tax=Leptospi... 323 7e-87 UniRef50_Q1J368 CRISPR-associated protein, CT1975 n=1 Tax=Deinoc... 322 1e-86 UniRef50_B4S8P9 CRISPR-associated protein, Cse4 family n=9 Tax=B... 322 1e-86 UniRef50_C7MQD5 CRISPR-associated protein, Cse4 family n=1 Tax=S... 312 2e-83 UniRef50_Q60AD1 CRISPR-associated protein, CT1975 family n=1 Tax... 304 5e-81 UniRef50_D1A6Q4 CRISPR-associated protein, Cse4 family n=2 Tax=A... 293 7e-78 UniRef50_C2GEY7 CRISPR-associated Cse4 family protein n=6 Tax=Ac... 285 1e-75 UniRef50_C8P6I6 CRISPR-associated protein n=1 Tax=Lactobacillus ... 276 9e-73 UniRef50_B6IWM4 CRISPR-associated protein, CT1975 family n=1 Tax... 272 1e-71 UniRef50_B8HWH9 CRISPR-associated protein, Cse4 family n=1 Tax=C... 259 9e-68 UniRef50_D0WFC9 CRISPR-associated protein, Cse4 family n=1 Tax=S... 259 1e-67 UniRef50_Q31XC0 Putative cytoplasmic protein n=1 Tax=Shigella bo... 252 2e-65 UniRef50_B7KJ25 CRISPR-associated protein, Cse4 family n=1 Tax=C... 232 1e-59 UniRef50_UPI0001B51C2C hypothetical protein SvirD4_12600 n=1 Tax... 220 4e-56 UniRef50_UPI000190E665 hypothetical protein SentesTyp_08452 n=3 ... 138 3e-31 UniRef50_UPI0001B58196 CRISPR-associated Cse4 family protein n=1... 83 1e-14 UniRef50_C2BS05 Possible CRISPR-associated protein n=1 Tax=Mobil... 66 2e-09 Sequences not found previously or not previously below threshold: UniRef50_O87037 Z35f protein n=1 Tax=Vibrio cholerae RepID=O8703... 46 0.003 CONVERGED! >UniRef50_Q46899 Uncharacterized protein ygcJ n=13 Tax=Proteobacteria RepID=YGCJ_ECOLI Length = 363 Score = 462 bits (1189), Expect = e-129, Method: Composition-based stats. Identities = 363/363 (100%), Positives = 363/363 (100%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGE 60 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGE Sbjct: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGE 60 Query: 61 SSLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGEIA 120 SSLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGEIA Sbjct: 61 SSLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGEIA 120 Query: 121 WFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGKVDG 180 WFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGKVDG Sbjct: 121 WFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGKVDG 180 Query: 181 AMSIAHAITTHQVDSDIDWFTAVDDLQEQGSAHLGTQEFSSGVFYRYANINLAQLQENLG 240 AMSIAHAITTHQVDSDIDWFTAVDDLQEQGSAHLGTQEFSSGVFYRYANINLAQLQENLG Sbjct: 181 AMSIAHAITTHQVDSDIDWFTAVDDLQEQGSAHLGTQEFSSGVFYRYANINLAQLQENLG 240 Query: 241 GASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFSDMPLSMANAFEKAVKA 300 GASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFSDMPLSMANAFEKAVKA Sbjct: 241 GASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFSDMPLSMANAFEKAVKA 300 Query: 301 KDGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITAQVKQMPTLEQLKSWVRNN 360 KDGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITAQVKQMPTLEQLKSWVRNN Sbjct: 301 KDGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITAQVKQMPTLEQLKSWVRNN 360 Query: 361 GEA 363 GEA Sbjct: 361 GEA 363 >UniRef50_C0W6U1 CRISPR-associated Cse4 family protein n=2 Tax=Actinomycetales RepID=C0W6U1_9ACTO Length = 374 Score = 411 bits (1057), Expect = e-113, Method: Composition-based stats. Identities = 113/369 (30%), Positives = 169/369 (45%), Gaps = 20/369 (5%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGE 60 MS F++IH++ S PSC+NRDD K A++GG RR+R+SSQS KRA R + + Sbjct: 1 MSTFVDIHLIQSLPPSCVNRDDSGSPKSALYGGVRRLRVSSQSWKRATRLYFNEHLDATD 60 Query: 61 SSLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEA---EKISADAVTPWVVG 117 +RT + +L + + + S + A K A A + +++ Sbjct: 61 VGIRTKRVVELLADRISAIAPDLADSALALAEQVFSAAKIKVAPPRGKKDAPAESGYLLF 120 Query: 118 EIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGK 177 ++A+ + + + VDIAL GRM Sbjct: 121 LSTSQINRLAEMATRAAHAGE---KIDPKETKKIFKEEHAVDIALFGRMVADDA---DLN 174 Query: 178 VDGAMSIAHAITTHQVDSDIDWFTAVDD------LQEQGSAHLGTQEFSSGVFYRYANIN 231 VD A +AHAI+TH +++ D+FTAVDD ++ G+ +GT EFSS YRYA +N Sbjct: 175 VDAACQVAHAISTHAAENEYDFFTAVDDEKSRAMEEDAGAGMMGTVEFSSATMYRYATVN 234 Query: 232 LAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMV-NFSDMPLSM 290 L L ENLG R+ AL + + +P KQ T+A D V+V D P+S+ Sbjct: 235 LDMLVENLG--DRDAALRALSVFLEGFCLSMPTGKQNTFANRTLPDSVVVSVRDDQPVSL 292 Query: 291 ANAFEKAVKAK--DGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITAQVKQMP 348 AFEK V+ DGFL S++A +Y + +GL A+ P A + + Sbjct: 293 VGAFEKPVRTTESDGFLTRSVEALARYEHTIEENFGLKPQASFVVSLADVPELASLGERI 352 Query: 349 TLEQLKSWV 357 T L V Sbjct: 353 TFADLPGKV 361 >UniRef50_C2BET9 CRISPR-associated protein n=3 Tax=Bacteria RepID=C2BET9_9FIRM Length = 359 Score = 389 bits (1000), Expect = e-107, Method: Composition-based stats. Identities = 98/356 (27%), Positives = 173/356 (48%), Gaps = 22/356 (6%) Query: 4 FINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESSL 63 F++IH + + P+ +NRDD K A +GG R R+SSQS KRA+RK ++ + Sbjct: 10 FLDIHAIQTVPPANINRDDTGSPKTAQYGGVTRARVSSQSWKRAIRKYFNENGDVENVGI 69 Query: 64 RTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGEIAWFC 123 R++ + + + K+ ++ I++ + ++ K+++ A+ + D + Sbjct: 70 RSLEIVRY---VANKIVQKDGSISIEEAM-EMADKTINNAKISTKDQKAKALFFMSDKQA 125 Query: 124 EQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGKVDGAMS 183 E++A+A D ++DKK+L+ + ++ +D+AL GRM D + Sbjct: 126 EELAQASIDKVNDKKILQEILKN--------DTSIDVALFGRMVADDA---SLNEDASSQ 174 Query: 184 IAHAITTHQVDSDIDWFTAVDDLQ---EQGSAHLGTQEFSSGVFYRYANINLAQLQENLG 240 +AHAI+TH + S+ D+FTAVDDL G+ LGT E++S YRYANI L L Sbjct: 175 VAHAISTHAIQSEFDFFTAVDDLAPEDNAGAGMLGTVEYNSSTLYRYANIALHDFYRQL- 233 Query: 241 GASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVN-FSDMPLSMANAFEKAVK 299 A +E+ ++ V +P K T+A ++V+ SD PL+M +AFE+ +K Sbjct: 234 -ADKEETIKATKLFVKSFVESMPTGKINTFANQTLPQAIVVSLRSDRPLNMVSAFEEPIK 292 Query: 300 AKDGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITAQVKQMPTLEQLKS 355 + +G++ SI+ + + A L + + K +L L Sbjct: 293 SDNGYVDKSIEKLFSEYTKYDKILDKPIFTAYLILGNT-EVNEIGKSEASLNDLLE 347 >UniRef50_D2RB01 CRISPR system CASCADE complex protein CasC n=2 Tax=Gardnerella vaginalis RepID=D2RB01_GARVA Length = 362 Score = 387 bits (994), Expect = e-106, Method: Composition-based stats. Identities = 100/360 (27%), Positives = 170/360 (47%), Gaps = 22/360 (6%) Query: 4 FINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESSL 63 F++I + S P +NRDD K A +GG R R+SSQ K +MR+ + Sbjct: 6 FLDIQAIQSVPPCNINRDDAGSPKTAQYGGVTRARVSSQCWKHSMREYFKEHSGDSNVGM 65 Query: 64 RTIHLAQLRD----VLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGEI 119 R+ ++ + L+ +L E+ + +KTL K+ + KI + +GE Sbjct: 66 RSKNIVKYVADKIITLKPELSEQEALDLANKTLNNAGFKTKTDKGKIIPVVNVLFFLGE- 124 Query: 120 AWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGKVD 179 +A+A +N+ DKK L+ + +D +DIAL GRM D Sbjct: 125 -NQANSLAQAAINNVTDKKQLEEILKDNP--------PIDIALFGRMLADN---PSLNED 172 Query: 180 GAMSIAHAITTHQVDSDIDWFTAVDDL---QEQGSAHLGTQEFSSGVFYRYANINLAQLQ 236 + +AHAI+TH V ++ D++TAVDDL G+ LGT E++S YRYAN+ + + Sbjct: 173 ASSQVAHAISTHAVRAEFDYYTAVDDLSVDDNAGAGMLGTIEYNSSTLYRYANVAIHEFS 232 Query: 237 ENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVN-FSDMPLSMANAFE 295 L ++E + + A +P K T+A M++V D P+++ +AFE Sbjct: 233 HQLSD-NKESTINALKLFIEAFANAMPTGKVNTFANQTLPQMLVVTLREDRPVNLVSAFE 291 Query: 296 KAVKAKDGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITAQVKQMPTLEQLKS 355 VKAKDG++ SI+ +Q +++V A+ ++ + + +++QL Sbjct: 292 DPVKAKDGYVSKSIEKLSQEYEKVQKFVHKPLASFYVTMDSSNKEIKLGVEEQSMQQLLD 351 >UniRef50_A0LM53 CRISPR-associated protein, Cse4 family n=1 Tax=Syntrophobacter fumaroxidans MPOB RepID=A0LM53_SYNFM Length = 384 Score = 384 bits (986), Expect = e-105, Method: Composition-based stats. Identities = 125/394 (31%), Positives = 196/394 (49%), Gaps = 48/394 (12%) Query: 4 FINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNI----G 59 F++IH++ + +PS LNRDD N KD FGG RR RISSQ +KR +R ++Q + G Sbjct: 2 FVDIHIIQNFAPSNLNRDDTNSPKDCEFGGYRRARISSQCIKRVVRSHRSFSQAVVHAGG 61 Query: 60 ESSLRTIHL-AQLRDVLRQKLG--ERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVV 116 ++ +RT + ++L D+ +K G E + + + +T+ L G + + EK T +++ Sbjct: 62 DTGVRTKRIKSRLMDLFAKKYGKPEIVETEKVAETVIELLGLKLKDEEK------TEYLL 115 Query: 117 GEIAWFCEQVAKAEADNLDD---------------------KKLLKVLKEDIAAIRVNLQ 155 Q+A+ D+ D K+ + LK + R + Sbjct: 116 YLGENEAAQLARLAVDSWDALLAIEPEQDKKKKKGTGQESLKEFQEELKGIVGKRRKEAR 175 Query: 156 Q-GVDIALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDL---QEQGS 211 DIAL GRM VD A +AHA++T++V+ ++D+FTAVDDL +E GS Sbjct: 176 SYAADIALFGRMIADNKNM---NVDAACQVAHAVSTNKVEMEMDYFTAVDDLLPGEETGS 232 Query: 212 AHLGTQEFSSGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYA 271 +G EF+S FYRY+N+N+++L ENL G + + V VP KQ + A Sbjct: 233 DMIGVVEFNSSCFYRYSNVNVSKLAENL-GFNNDLTTAALLGYVEASVKSVPTGKQNSMA 291 Query: 272 AFNPADM--VMVNFSDMPLSMANAFEKAVKA--KDGFLQPSIQAFNQYWDRVANGYGLNG 327 A NPA V+V P S+ANAF+K V+ + SI A +Y++R+ YG G Sbjct: 292 AQNPAGYARVIVRRDGFPWSLANAFQKPVRPSLDKSLEEASIDALERYFERLKAVYGTEG 351 Query: 328 AAAQFSLSDVDPITAQVKQMPTLEQLKSWVRNNG 361 S + +++M L+ LK+ V G Sbjct: 352 IVCDASFNLHRDDGGSLRKM--LDALKACVAGEG 383 >UniRef50_B6XT63 Putative uncharacterized protein n=1 Tax=Bifidobacterium catenulatum DSM 16992 RepID=B6XT63_9BIFI Length = 371 Score = 379 bits (974), Expect = e-104, Method: Composition-based stats. Identities = 103/372 (27%), Positives = 162/372 (43%), Gaps = 18/372 (4%) Query: 4 FINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESSL 63 F++IH L PS +NRDD K A GG R R+SSQS KRAMR+ + + Sbjct: 2 FVDIHCLQQVPPSNINRDDTGSPKTAYVGGALRARVSSQSWKRAMREMFSSKLDSSKLGK 61 Query: 64 RTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSG-----KSVDEAEKISADAVTPWVVGE 118 RT L + + ++ +L+ K+ D A + T +++ Sbjct: 62 RTKSAVALISSVIAEKRPDLVEESKSLAEKVLAATGVKVKASDRAGADKGSSATEYLIFI 121 Query: 119 IAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGKV 178 EQ+A D+ K +K+++AA+ + +Q +DIA GRM Sbjct: 122 ANREVEQLADIAITAFDEGKDPSKMKKEVAAV-FHGEQAIDIACFGRMLADA---PDLNT 177 Query: 179 DGAMSIAHAITTHQVDSDIDWFTAVDDLQ---EQGSAHLGTQEFSSGVFYRYANINLAQL 235 D + +AHA + Q+ + D+FTAVDD G+A + T F+S YRYA +N+ L Sbjct: 178 DASAQVAHAFSIDQITPEYDYFTAVDDCASDDNAGAAMIDTIGFNSSTLYRYATVNVDAL 237 Query: 236 QENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPA-DMVMVNFSDMPLSMANAF 294 ++ L A+E V +P KQ T+A D+V+V P+S A+AF Sbjct: 238 KDQLQ--DANAAVEGVAAFVDAFIKSMPSGKQNTFANHTLPEDIVIVLRDSQPISAADAF 295 Query: 295 EKAVKAKDGF--LQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVD-PITAQVKQMPTLE 351 E +K KDG + I+ + + YG A +S + + TL Sbjct: 296 EDPIKRKDGISVSRQGIERLGDRLNEIRINYGEEPVKAWHVVSGGSVHSLDEWSEQVTLP 355 Query: 352 QLKSWVRNNGEA 363 +L+ +R A Sbjct: 356 ELEQGLRETLSA 367 >UniRef50_D1CAJ1 CRISPR-associated protein, Cse4 family n=1 Tax=Sphaerobacter thermophilus DSM 20745 RepID=D1CAJ1_SPHTD Length = 397 Score = 378 bits (971), Expect = e-103, Method: Composition-based stats. Identities = 108/387 (27%), Positives = 171/387 (44%), Gaps = 38/387 (9%) Query: 4 FINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGE--S 61 F+ +H++ + +PS LNRDD KD FGG RR RISSQ+LKRA+R + + E Sbjct: 2 FVELHIIQNFAPSNLNRDDTGAPKDCQFGGYRRARISSQALKRAIRMTFGEENLLPEESR 61 Query: 62 SLRTIHLA-QLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGEIA 120 + RT +A L + L + + + G S ++ ++ + T +++ Sbjct: 62 ARRTKRIAGALVERLVASGKDAVAAAAVVEAAIQGIGLSFEKPKEGDTEKKTQYLLFLGQ 121 Query: 121 WFCEQVAKAEADNLD--------------------DKKLLKVLKEDIAAIRVNL--QQGV 158 +A + D K L + + ++ + Sbjct: 122 REINALADVCLAHWDTLVDVAPNADAASERDAKKAKKANKAALPKQVQLALLDALDGRSA 181 Query: 159 DIALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQE---QGSAHLG 215 D+AL GRM +D A +AHAI+TH+V ++ D++TAVDDL+ G+ LG Sbjct: 182 DVALFGRMLAD---LPEKNIDAASQVAHAISTHRVATEFDFYTAVDDLKPDDTAGADMLG 238 Query: 216 TQEFSSGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNP 275 T EF+S FYRY+NI++ QL ENLGG + A + +P KQ + AA NP Sbjct: 239 TVEFNSACFYRYSNIDVDQLIENLGG-DVDLARTTVEAFLWASIHAIPTGKQNSMAAQNP 297 Query: 276 ADMVMVNFSDMP-LSMANAFEKAVKA--KDGFLQPSIQAFNQYWDRVANGYGLNGAAAQF 332 VM D S+ANAF V ++ S+ A YW + YG + Sbjct: 298 PSFVMAVVRDRGLWSLANAFVNPVAPAHDGDLIERSVDALEAYWSNLVRVYG-GELRGTW 356 Query: 333 SLSDVDPITAQVKQ--MPTLEQLKSWV 357 ++ +++ + T E+L V Sbjct: 357 CVNVNPRELGPLEELHVDTFEELVDAV 383 >UniRef50_B1VIY1 CRISPR-associated protein n=3 Tax=Corynebacterium RepID=B1VIY1_CORU7 Length = 376 Score = 377 bits (968), Expect = e-103, Method: Composition-based stats. Identities = 108/373 (28%), Positives = 161/373 (43%), Gaps = 20/373 (5%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGE 60 MS I+I+ L S PS +NRDD + K+AIFGG R R+SSQS KRA+R+ + + Sbjct: 1 MSKIIDIYALQSLPPSLINRDDTGVPKNAIFGGVPRQRVSSQSWKRAIRRYFFENFDAAN 60 Query: 61 SSLRTIHLAQLRDVLRQKLGERFDQKIID-----KTLALLSGKSVDEAEKISADAVTPW- 114 R+ L + ++ G I K + + +K DA + Sbjct: 61 IGDRSKRLPEKIARQLEEQGMEQGTAIERTEQLFKAAGIKTAVEKKPKDKDETDAEVAYP 120 Query: 115 VVGEIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTE 174 G + + + ++ K + AI + + VDIA+ GRM Sbjct: 121 QTGYLLFLSAHQIDNAVKAIQERDGKNFTKREAQAIL-DQEHSVDIAMFGRMVADDAAY- 178 Query: 175 LGKVDGAMSIAHAITTHQVDSDIDWFTAVDDL----QEQGSAHLGTQEFSSGVFYRYANI 230 VD A+ +AHA+ H + D+FTAVDDL +E G+ +GT + S YRYA + Sbjct: 179 --NVDAAVQVAHALGIHDSAPEFDYFTAVDDLAEEGEETGAGMIGTVQMMSSTLYRYATV 236 Query: 231 NLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFSD-MPLS 289 NL L ENL S + A + A V +P K T+A ++V V D P+S Sbjct: 237 NLEGLAENL--DSEDAAKQAAVEFVEAFIASMPTGKINTFANQTLPELVYVAVRDTRPVS 294 Query: 290 MANAFEKAVKA--KDGFLQPSIQAFNQYWDRVANGYGLNGAAAQF-SLSDVDPITAQVKQ 346 + NAFE V+A G + + Q V N YG A+ L + + Sbjct: 295 LVNAFEAPVEATEDKGRREVGAEVLAQEARDVENVYGFKPQASFVMGLGQLAEPFTDIAT 354 Query: 347 MPTLEQLKSWVRN 359 TL +LK + Sbjct: 355 QVTLPELKEQLAG 367 >UniRef50_B3E5V0 CRISPR-associated protein, Cse4 family n=56 Tax=Proteobacteria RepID=B3E5V0_GEOLS Length = 356 Score = 376 bits (966), Expect = e-103, Method: Composition-based stats. Identities = 105/375 (28%), Positives = 158/375 (42%), Gaps = 35/375 (9%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNI-G 59 MS F+ IH+L S+ P+ LNRDD K A GG R+R+SSQSLKRA R S + Q + Sbjct: 1 MSRFVQIHLLTSYPPANLNRDDQGRPKTAKMGGYDRLRVSSQSLKRAWRTSDLFQQALTE 60 Query: 60 ESSLRTIHLA------QLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTP 113 RT L + +++K + QKI AL K D + + + Sbjct: 61 HVGTRTKLLGVMAYEKLVAGGVKEKQAKESAQKIAGVFGALKKAKEKDSLVDLEIEQLVH 120 Query: 114 WVVGEIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMT 173 EI + + + ++ + Q DIA+ GRM S Sbjct: 121 VSPSEIQAIESLLETLISQG-------RAPEDTELDLLRIQGQSADIAMFGRMLASS--- 170 Query: 174 ELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDL----QEQGSAHLGTQEFSSGVFYRYAN 229 V+ A +AHAI+ H V + D+FTAVDDL ++ G+AH+G F++G+FY Y Sbjct: 171 PSYNVEAACQVAHAISVHPVVIEDDYFTAVDDLNDGSEDAGAAHIGETGFAAGLFYSYIC 230 Query: 230 INLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFSD-MPL 288 IN L ENLGG + ++ P KQ ++ + A V+ D P Sbjct: 231 INRTLLVENLGG-DEALVQKSIQALIEAAVKVPPNGKQNSFGSRAYASYVLAEKGDQQPR 289 Query: 289 SMANAFEKAVKAKD----GFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITAQV 344 S++ AF K V ++ F ++ A + YG +D Sbjct: 290 SLSVAFLKPVTSQGIEGTDFGTAAVDALTTQRQNMDAVYGP--------CADASCEINVF 341 Query: 345 KQMPTLEQLKSWVRN 359 + TL +L +V Sbjct: 342 EGKGTLAELLKFVAE 356 >UniRef50_B0TDU0 Crispr-associated protein, ct1973 family, putative n=2 Tax=cellular organisms RepID=B0TDU0_HELMI Length = 385 Score = 375 bits (964), Expect = e-102, Method: Composition-based stats. Identities = 117/389 (30%), Positives = 174/389 (44%), Gaps = 37/389 (9%) Query: 5 INIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGE--SS 62 + IHVL +H+P+ LNRD+ KD +FGG RR RISSQ KR +R S + +IGE Sbjct: 2 VEIHVLQNHAPANLNRDESGSPKDCMFGGVRRGRISSQCQKRTIRCSPLFQDSIGESRLG 61 Query: 63 LRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGEIAWF 122 +RT L L +LG + I A G ++ K D +T + Sbjct: 62 MRTRKLPFLVKEELMRLGLSEELAKIGARKASGLG---NKDGKERDDEITAQAIFLTQED 118 Query: 123 CEQVAKAEADNLDDKKLLKVLKEDIAAIRVN------LQQGVDIALSGRMATSGMMTELG 176 +A+ +L DK + + ++ + VD+AL GRM TS + Sbjct: 119 VSVIARCLFRHLKDKTVKQAKAIKAQELQKDPELVGWRPVTVDVALFGRMTTSTAFND-- 176 Query: 177 KVDGAMSIAHAITTHQVDSDIDWFTAVDDLQ---EQGSAHLGTQEFSSGVFYRYANINLA 233 V+ ++ + HAI+TH+VDS+ D+FTAVDDL + G+ +G EF+S +Y+Y N+++ Sbjct: 177 -VEASVQVGHAISTHRVDSEFDYFTAVDDLMGDGDSGADMIGDTEFNSCCYYKYFNVDMD 235 Query: 234 QLQENLGGASR-------------EQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVM 280 +L+ NL G R A I + L P KQ ++AA V+ Sbjct: 236 ELKRNLAGPDRLKKLTAEERQDLARDAAHIVKAFIESLVFCSPDGKQNSFAARQLPSAVL 295 Query: 281 VNFSDM--PLSMANAFEKAVKAKD--GFLQPSIQAFNQYWDRVANGYGLNGAAAQFSL-- 334 V P+S ANAF K V A+ +Q S+ AF + +GL L Sbjct: 296 VEVKKRKIPVSYANAFVKPVTARGEMDLVQASVNAFLDHVKETEKCFGLTPNRRWLLLMG 355 Query: 335 -SDVDPITAQVKQMPTLEQLKSWVRNNGE 362 T QV P L + + GE Sbjct: 356 CESPKMTTDQVSTFPALVEELTAALQQGE 384 >UniRef50_D1CGD3 CRISPR-associated protein, Cse4 family n=1 Tax=Thermobaculum terrenum ATCC BAA-798 RepID=D1CGD3_THET1 Length = 382 Score = 375 bits (963), Expect = e-102, Method: Composition-based stats. Identities = 120/379 (31%), Positives = 183/379 (48%), Gaps = 30/379 (7%) Query: 4 FINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYA--QNIGES 61 + +H++ + +PS LNRDD KD FGG RR RISSQ +KRA+R+ + Sbjct: 2 LVELHMIQNFAPSNLNRDDTGSPKDCEFGGVRRARISSQCIKRAIRREFKQNGLLDSERI 61 Query: 62 SLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGEIAW 121 + RT + Q +LG R ++ LLS + + + GEI Sbjct: 62 AERTRLVTQEIADRLARLG-RDREQATRVAGFLLSAAKLKVDNSQRTEYLLFLGRGEIDA 120 Query: 122 F-------CEQVAKAEADNL-----DDKKLLKVLKEDIAA---IRVNLQQGVDIALSGRM 166 +Q+A +L D KK + + D++ R++ + D+AL GRM Sbjct: 121 ITALCNERWDQLAPLADQSLSDQSNDKKKAAQQVPADMSRELLARLDGGKAADLALFGRM 180 Query: 167 ATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQ---EQGSAHLGTQEFSSGV 223 +D A +AHAI+TH+V + D++TAVDDLQ E G+ +GT EF+S Sbjct: 181 LAD---LPDKNIDAASQVAHAISTHRVSIEFDFYTAVDDLQPESETGAGMMGTVEFNSAC 237 Query: 224 FYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNF 283 FYRY+N+++ QL NL G RE AL+ +H +P KQ + AA NP MV Sbjct: 238 FYRYSNVSMEQLITNLQG-DRELALKTLEAFIHASVRAIPTGKQNSMAAHNPPSMVFAVV 296 Query: 284 SD-MPLSMANAFEKAVKA--KDGFLQPSIQAFNQYWDRVANGYGLNGAA--AQFSLSDVD 338 + P S+ANAF + V ++ + SIQA + YW ++ + YG + A +L DV Sbjct: 297 REGAPWSLANAFARPVAPGREEDLVGRSIQALDSYWGKLVSVYGGDDIRKKALITLEDVP 356 Query: 339 PITAQVKQMPTLEQLKSWV 357 ++ T++ L V Sbjct: 357 LQHLGDARVETVKALVEQV 375 >UniRef50_A3EQA5 CRISPR-ssociated protein, Cas4 n=4 Tax=Bacteria RepID=A3EQA5_9BACT Length = 398 Score = 374 bits (960), Expect = e-102, Method: Composition-based stats. Identities = 113/395 (28%), Positives = 180/395 (45%), Gaps = 47/395 (11%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIG- 59 M I IHVL + +PS LNRDD KDA+FGG RR RISSQ +KR++R + + G Sbjct: 1 MKTLIEIHVLQNFAPSNLNRDDTGAPKDALFGGTRRARISSQCIKRSVRDFFCHKREDGI 60 Query: 60 ----ESSLRTIHLAQ-LRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAE---------- 104 E +RT + Q + D+L++K L+ L K +E Sbjct: 61 FSPDEIGVRTKRIYQAIADLLKEKRDISDTITKAKTALSYLKIKPKNEKTQYLLFLSPKE 120 Query: 105 -KISADAVTPW---VVGE-IAWFCEQVAKAEADNLDDKK-------------LLKVLKED 146 K A+A+ + +VGE I ++ + D + ++ + K +E Sbjct: 121 IKDFANAIDEYWDQIVGEPIETDNSELDEETPDTVSLEEQKPKKGKKNKKPNIPKEFQEK 180 Query: 147 IAAIRVNLQQGVDIALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDL 206 + ++ N + +DIAL GRM + A +AHAI+TH V+ + D++TA+DDL Sbjct: 181 LESVL-NGGKSIDIALFGRMLAD---IPEKNQNAACQVAHAISTHAVEREFDYYTAIDDL 236 Query: 207 QE---QGSAHLGTQEFSSGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVP 263 + GS +GT EF+S FYRYA ++L L +NL E + + P Sbjct: 237 KPDDTAGSDMIGTVEFNSACFYRYAVVDLEALNKNLHD-DSELTNKSIRAFLEAFIISEP 295 Query: 264 GAKQRTYAAFNPADMVMVNFSDM--PLSMANAFEKAVKAKDG--FLQPSIQAFNQYWDRV 319 KQ ++AA NP + + ++ P ++ANAFE AV K G + S + + Sbjct: 296 TGKQNSFAAHNPPEFIAISVRHNAGPRNLANAFETAVFPKKGESLTRKSADELVKKAKSL 355 Query: 320 ANGYGLNGAAAQFSLSDVDPITAQVKQMPTLEQLK 354 + +G +L + + + +LE L Sbjct: 356 QSAFGGEDKTFLINLVGTN-VNGYGTVVASLEDLL 389 >UniRef50_C3PF94 CRISPR-associated protein n=5 Tax=Corynebacterium RepID=C3PF94_CORA7 Length = 384 Score = 374 bits (960), Expect = e-102, Method: Composition-based stats. Identities = 109/380 (28%), Positives = 168/380 (44%), Gaps = 32/380 (8%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGE 60 MS I+IH L + PS +NRDD K AIFGG R R+SSQS KRA+R + Sbjct: 1 MSLVIDIHALQTLPPSLINRDDTGAPKSAIFGGVPRQRVSSQSWKRAIRNYFEKNVDPEF 60 Query: 61 SSLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSV-----------------DEA 103 R+ L + L + ++ I + L + ++ Sbjct: 61 VGDRSKRLPEKIAKLVENHDGWDSERAIKQVSDLFKAAGISTEVDSKRIKELEKSDAEDK 120 Query: 104 EKISADAVTPWVVGEIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALS 163 E++ +A P I +Q+ +A +D + +K+ A + ++ Q VD+A+ Sbjct: 121 EELIKEASYPRTKYLIFLSPQQIDRAVRAIVDADG--EKIKKAEAKVILDTQHSVDMAMF 178 Query: 164 GRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDL----QEQGSAHLGTQEF 219 GRM VD A+ +AHA+ H + D+FTAVDDL +E G+ +GT + Sbjct: 179 GRMIADDAAF---NVDAAVQVAHALGIHSSAPEFDYFTAVDDLAEDGEETGAGMIGTVQM 235 Query: 220 SSGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMV 279 S YR+A +N+A L +NL AS E A + A V +P K T+A ++V Sbjct: 236 MSSTLYRFATVNVAGLTKNL--ASEENAKQAAVQFVDAFIKSMPTGKINTFANHTLPELV 293 Query: 280 MVNFSD-MPLSMANAFEKAVKAKDG--FLQPSIQAFNQYWDRVANGYGLNGAAAQFS-LS 335 V D P+S+ AFE+ V+A D +A + YGL AA +S Sbjct: 294 YVTVRDTRPVSLVTAFEEPVQATDDKNLRLAGAEALAKEEREFEENYGLKPLAAFAVGVS 353 Query: 336 DVDPITAQVKQMPTLEQLKS 355 + A + + TL +L Sbjct: 354 EARAPFADIAETVTLPELSE 373 >UniRef50_C6SPJ0 Putative uncharacterized protein n=1 Tax=Streptococcus mutans NN2025 RepID=C6SPJ0_STRMN Length = 359 Score = 373 bits (957), Expect = e-102, Method: Composition-based stats. Identities = 91/356 (25%), Positives = 163/356 (45%), Gaps = 24/356 (6%) Query: 4 FINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESSL 63 F++I+ + + PS +NRDD K +GG RR R+SSQS K+AMR Y + Sbjct: 11 FLDIYAIQTLPPSNINRDDTGSPKTTQYGGVRRARVSSQSWKKAMRDYFYEHAEEEQLGK 70 Query: 64 RTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGEIAWFC 123 RT + ++K K + + + D + +G Sbjct: 71 RTRKVVNYVAEKIIHQKIDLNEKESSKLAT-----DILKLAGVPTDGKVLFFIGNTE--A 123 Query: 124 EQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGKVDGAMS 183 E++A A + DK+ + + + +D+AL GRM + TE D + Sbjct: 124 EKLATAAVKGVKDKEEARKI--------MQSNLALDVALFGRMVANDKETEA---DASSQ 172 Query: 184 IAHAITTHQVDSDIDWFTAVDDLQ---EQGSAHLGTQEFSSGVFYRYANINLAQLQENLG 240 AH I+TH V ++ D++TAVDDL + + LGT EF+S YRYAN+ + + G Sbjct: 173 FAHPISTHAVQTEFDFYTAVDDLASDDDAKAGMLGTVEFNSSTLYRYANVAIHEFLVQRG 232 Query: 241 GASREQALEIATHVVHMLATEVPGAKQRTYAAFNPAD-MVMVNFSDMPLSMANAFEKAVK 299 +RE ++ + A +P K ++A +++ SD P+++ +AFE+ VK Sbjct: 233 --NREDLVDSLQLFIKAFAESMPRGKINSFANQTIPQTLIITVRSDRPVNLVSAFEEPVK 290 Query: 300 AKDGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITAQVKQMPTLEQLKS 355 + +G++ SI+ ++ + +V + SL +V+ +T + ++ +L Sbjct: 291 SSNGYVTKSIEKLSKEFVKVEKMVKKPVLSFYVSLEEVEALTKVGIEKNSITELVE 346 >UniRef50_A7BA64 Putative uncharacterized protein n=1 Tax=Actinomyces odontolyticus ATCC 17982 RepID=A7BA64_9ACTO Length = 374 Score = 371 bits (953), Expect = e-101, Method: Composition-based stats. Identities = 107/379 (28%), Positives = 168/379 (44%), Gaps = 23/379 (6%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGE 60 MS F++IHVL + PS NRDD K A FGG +R+RISSQ++KRA R+ G Sbjct: 1 MSVFVDIHVLQTLPPSNPNRDDTGAPKSATFGGVQRMRISSQAIKRATRQDFEGKIADGN 60 Query: 61 SSLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEA-------EKISADAVTP 113 +RT + +L + +R D + LA + K++ + + + Sbjct: 61 YGVRTKKIVELVARTITE--KRPDLEAASIELAEMGLKAIGFKLAEPRGNKSDNELKESG 118 Query: 114 WVVGEIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMT 173 ++V A E V+ A + KE V+ +DIAL GRM Sbjct: 119 FLVFLSAKQIEHVSDALISVAHEDDPAAAFKELKPRSLVDTDHSIDIALFGRMVAEPNA- 177 Query: 174 ELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQ------EQGSAHLGTQEFSSGVFYRY 227 VD A +AHAI V+ + D++TAVDD + ++G+ +GT EF+S YRY Sbjct: 178 --LNVDAACQVAHAIGVGAVEREYDYYTAVDDAKKRNDEADEGAGMIGTIEFASATVYRY 235 Query: 228 ANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMV-NFSDM 286 A IN+ L+ENLG A V +P K T+A + V+V D Sbjct: 236 ATINVDLLRENLG--DDAVADRAVELFVDSFVRSMPTGKVTTFANRTLPEAVLVQVRDDQ 293 Query: 287 PLSMANAFEKA-VKAKDGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDV-DPITAQV 344 P++M+ AFE+ + + GF +P+I F ++ ++ GL + S + +++ Sbjct: 294 PINMSGAFEEPIIAGQHGFAEPAIARFVEFESQLRELTGLEAVESLVSWTTPRGESFSEL 353 Query: 345 KQMPTLEQLKSWVRNNGEA 363 + L L Sbjct: 354 GKQVRLASLGETAAEAVRG 372 >UniRef50_A1ARH7 CRISPR-associated protein, Cse4 family n=1 Tax=Pelobacter propionicus DSM 2379 RepID=A1ARH7_PELPD Length = 374 Score = 370 bits (951), Expect = e-101, Method: Composition-based stats. Identities = 115/374 (30%), Positives = 173/374 (46%), Gaps = 23/374 (6%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGE 60 M + IHVL + +PS LNRDD KDA+FGG RR R+SSQ LKR++R+ QN G Sbjct: 1 MKTIVEIHVLQNFAPSNLNRDDTGAPKDALFGGTRRARVSSQCLKRSVREYFKD-QNKGW 59 Query: 61 SSLRTIHLAQLRDVLRQKLGERFDQ-------KIIDKTLALLSGK-SVDEAEKISADAVT 112 + RT + + E K I+ ++ L V ++ +D + Sbjct: 60 VADRTKRVVYALKERISPVLESQKDFSEDNLLKAIEVAVSNLGSNKKVKVDKEKKSDVLL 119 Query: 113 PWVVGEIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVN--LQQGVDIALSGRMATSG 170 EI + VA++ AD L K +V++ AI + VD+AL GRM Sbjct: 120 FLSPKEIDALAQVVAESYADLLKTKLSDQVVRNLNDAIDGENKSRLSVDVALFGRMLA-- 177 Query: 171 MMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQE---QGSAHLGTQEFSSGVFYRY 227 + + A +AHAI+TH V+ + D++TAVDDL+ G+ +GT EF+S FYRY Sbjct: 178 -VMPEKNQNAACQVAHAISTHAVEREFDFYTAVDDLKPEDTAGADMMGTVEFNSACFYRY 236 Query: 228 ANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFS--D 285 A ++ +L NL A A + + P KQ T+AA NP + V V Sbjct: 237 AVVDWEKLLVNLQ-ADEALATKGLRAFLEGFVVAEPTGKQNTFAAHNPPEFVAVTVRRNA 295 Query: 286 MPLSMANAFEKAVKAKDG--FLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITAQ 343 P ++ANAFE AV+ + + S + + + +G + +L++ I Sbjct: 296 APRNLANAFETAVRVRKDESLTRKSAEGLANKAKALQSAFGGDEKTFVLNLAEAT-IDGF 354 Query: 344 VKQMPTLEQLKSWV 357 MPTL L Sbjct: 355 GIVMPTLNDLLDKA 368 >UniRef50_Q2JWC4 CRISPR-associated protein, Cse4 family n=1 Tax=Synechococcus sp. JA-3-3Ab RepID=Q2JWC4_SYNJA Length = 380 Score = 370 bits (950), Expect = e-101, Method: Composition-based stats. Identities = 104/370 (28%), Positives = 167/370 (45%), Gaps = 22/370 (5%) Query: 5 INIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYY--AQNIGESS 62 + IH++ S P+ LNRD+ M K IFGG+ R RISSQ KRA+RK + + + Sbjct: 3 LEIHLIQSFPPANLNRDENGMPKSTIFGGRPRARISSQCQKRAVRKYYHQYAELDPAHFA 62 Query: 63 LRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGEIAWF 122 R+ + K G +Q + LAL G + +K A + E+ Sbjct: 63 ARSRNWLPELKSKLVKAGIPDEQAGMAARLALEQGLKLKFNDKNEATTIVFLGKTELDAI 122 Query: 123 CEQVAK---AEADNLDDKKLL--KVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGK 177 E + K A L ++K + + + I V+ + D+AL GRM S Sbjct: 123 AEILIKNWSAIESGLREEKPKLPQKIAKAIEKALVDTGKPGDVALFGRMMAS---LPTVN 179 Query: 178 VDGAMSIAHAITTHQVDSDIDWFTAVDDL---QEQGSAHLGTQEFSSGVFYRYANINLAQ 234 VD A+ +AHAI+ + + + D+FTAVDDL ++ G+ H+G ++S +YR+A ++ Q Sbjct: 180 VDAAVQVAHAISINALQQEFDFFTAVDDLGSSEDTGADHMGETGYNSSTYYRFAVLDKKQ 239 Query: 235 LQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFSD-MPLSMANA 293 L ENLGG E I VP Q +AA +VM + P+S+ +A Sbjct: 240 LVENLGGT--EHLGSIIKAFATAFIHAVPSGHQNGFAAHTRPALVMAVVREGQPISLVDA 297 Query: 294 FEKAVKAKDGF--LQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITAQ----VKQM 347 FE V GF L+ +++A ++YW + YG + + + Sbjct: 298 FENPVAPSGGFSLLENAVKALDEYWGSLVKMYGEADVQYKGVVVLDRLAARLNVLKSSKK 357 Query: 348 PTLEQLKSWV 357 ++E+L Sbjct: 358 DSVEELLKSA 367 >UniRef50_A4XYU0 CRISPR-associated protein, Cse4 family n=5 Tax=Bacteria RepID=A4XYU0_PSEMY Length = 384 Score = 369 bits (949), Expect = e-101, Method: Composition-based stats. Identities = 115/391 (29%), Positives = 172/391 (43%), Gaps = 39/391 (9%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQ--NI 58 MS F+ H++ + +PS LNRDD KDA+FGG RR R+SSQ KRA+R + + Sbjct: 1 MSLFVEFHLIQNFAPSNLNRDDTGAPKDALFGGHRRARVSSQCFKRAIRLAAQEHELVAP 60 Query: 59 GESSLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGE 118 +RT L L L ++L R + K L+ + + + + E Sbjct: 61 EFRGVRTKKLKTL---LLERLAGRDPLEAEGKIEVALAAAGLKLKDDGKTEYLLFLGEAE 117 Query: 119 IAWFC-------EQVAKA-----------EADNLDDKKLLKVLKEDIAAIRVNLQQGVDI 160 IA F +++A A +V+K+ A ++ + VD+ Sbjct: 118 IAGFATLIEQHWDELAGAPAGGEKKGEKKGKKEAKASAPAEVVKK--AKALLDGGKAVDV 175 Query: 161 ALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDL---QEQGSAHLGTQ 217 AL GRM D A +AHAI+TH+V+ + D+FTAVDD E G+ +G Sbjct: 176 ALFGRMLAD---MPEVNQDAACQVAHAISTHRVEREFDYFTAVDDKGGPDETGAGMIGQV 232 Query: 218 EFSSGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPAD 277 EF+S YRYA ++ +L NL RE L + +P KQ T+AA N Sbjct: 233 EFNSATLYRYAVVDAGKLLGNLQ-QDRELTLSALEAFTQAMVRAIPTGKQNTFAAHNLPS 291 Query: 278 MVMVNFSDM-PLSMANAFEKAVKAKDG--FLQPSIQAFNQYWDRVANGYGLNGAAAQFSL 334 V V PL++ANAFEK + A+ S+ ++ ++A Y ++ Sbjct: 292 FVGVCLRHAGPLNLANAFEKPIAARQDAALSSLSVTELAKHEGKLAAVYADASDQ--WAY 349 Query: 335 SDVDPITAQVKQMP--TLEQLKSWVRNNGEA 363 D+ Q K L +L SWVR A Sbjct: 350 LDLSEAWPQQKGFAVQNLGELASWVRMQVAA 380 >UniRef50_C4FG89 Putative uncharacterized protein n=1 Tax=Bifidobacterium angulatum DSM 20098 RepID=C4FG89_9BIFI Length = 387 Score = 367 bits (942), Expect = e-100, Method: Composition-based stats. Identities = 106/387 (27%), Positives = 168/387 (43%), Gaps = 32/387 (8%) Query: 4 FINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESSL 63 F++IH + PS +NRDD K A GG R R+SSQ+ KRAMR + + Sbjct: 2 FMDIHCIQQVPPSNINRDDTGSPKTAYVGGALRSRVSSQAWKRAMRGVFDDMLDSDKLGK 61 Query: 64 RTIHLAQLRD---VLRQKLGERFDQKIIDKTLALL--SGKSVDEAEKISADAVTPWVVGE 118 RT + L ++ +++ + LAL K+ + A VT +++ Sbjct: 62 RTKGVVALIASSITAKRPDLAESAEELGQRVLALEGIGVKASNRAGSDKGTLVTDYLIFI 121 Query: 119 IAWFCEQVAKAEADNLD---------DKKLLKVLKEDIAAIR------VNLQQGVDIALS 163 +++A D K L K K D+A ++ + Q +DIAL Sbjct: 122 ANNEIDKLADWAIAASDKGRDFSKVGKKGLSKAEKTDLAKMKNEVSEIFHGPQAIDIALF 181 Query: 164 GRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQ---EQGSAHLGTQEFS 220 GRM + D + +AHA + Q+ + D+FTAVDD G+A L T F+ Sbjct: 182 GRMLANA---PDLNTDASAQVAHAFSIDQITPEYDYFTAVDDCASEDNAGAAMLDTVGFN 238 Query: 221 SGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVM 280 S YRYA +N+ L+E L AS A+E A V +P KQ T+A + V+ Sbjct: 239 SSTLYRYAAVNIDALKEQLQDAS--AAVEGAVAFVEAFIKSMPSGKQNTFANHTLPEDVV 296 Query: 281 VNFSD-MPLSMANAFEKAVKAKDG--FLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDV 337 V D P+S A+AFE+ V+ K+G + I+ + + + Y A + S Sbjct: 297 VVLRDSQPISAADAFEEPVRRKEGVSVSRQGIERLGKRLNEIRVNYSEEPVKAWYIASGG 356 Query: 338 D-PITAQVKQMPTLEQLKSWVRNNGEA 363 + + + +L L+ +R A Sbjct: 357 EVDSLKEWSEQVSLPDLEHGLRETLNA 383 >UniRef50_D2TKK6 CRISPR-associated protein n=1 Tax=Citrobacter rodentium ICC168 RepID=D2TKK6_CITRO Length = 363 Score = 364 bits (935), Expect = 3e-99, Method: Composition-based stats. Identities = 97/358 (27%), Positives = 151/358 (42%), Gaps = 21/358 (5%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNI-G 59 M+ FI +H+L ++ + LNRDD K + GG R+RISSQSLKRA R S + Q + G Sbjct: 13 MTTFIQLHLLTAYPAANLNRDDTGAPKTVVLGGTTRLRISSQSLKRAWRTSELFEQALAG 72 Query: 60 ESSLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGEI 119 +R+ +A+ + K G + + V K A P E Sbjct: 73 NIGIRSGRIAREAAEILIKSGIDEKKAVAYVEAIARCFGKV----KADKKAKEPLTNSET 128 Query: 120 AWFCEQVAKAEADNLDD-----KKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTE 174 ++ AE D + + + KE+ A+ + + VDIA+ GRM Sbjct: 129 EQLV-HISPAEFDAVKALAHRLAEEKRAPKEEELALLRHDRMAVDIAMFGRMLADK---P 184 Query: 175 LGKVDGAMSIAHAITTHQVDSDIDWFTAVDDL----QEQGSAHLGTQEFSSGVFYRYANI 230 V+ A +AHA + + D+FTAVDDL + G+ HLG F S +FY Y I Sbjct: 185 EFNVEAACQVAHAFGVSETIVEDDFFTAVDDLRANSDDAGAGHLGYTGFGSALFYTYICI 244 Query: 231 NLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFS-DMPLS 289 N L +NL G + + A + P KQ ++A+ A M D P S Sbjct: 245 NKDLLIKNLNG-NVDLANQTLRAFTEAALKVSPTGKQNSFASRAYACWAMAEKGTDQPRS 303 Query: 290 MANAFEKAVKAKDGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITAQVKQM 347 +A AF K + L ++Q + + + Y F++ + + V + Sbjct: 304 LAAAFYKPIVGS-DHLNVAVQRVTELRENMNAVYEQQTEFVGFNVMNKEGSIKDVLEF 360 >UniRef50_C7QEM5 CRISPR-associated protein, Cse4 family n=13 Tax=Actinomycetales RepID=C7QEM5_CATAD Length = 399 Score = 363 bits (933), Expect = 5e-99, Method: Composition-based stats. Identities = 103/391 (26%), Positives = 173/391 (44%), Gaps = 38/391 (9%) Query: 1 MSN-FINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIG 59 M+ ++IH+L + PS LNRDD K A++GG RR R+SSQ+ KRA R++ + Sbjct: 1 MTRVILDIHILQTVPPSNLNRDDTGSPKTAVYGGVRRARVSSQAWKRATRQAFGDLLDPS 60 Query: 60 ESSLRTIHLAQ--------LRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAV 111 E +RT +A+ L L ++I S ++ + +D Sbjct: 61 ELGVRTKRVAEQIANRMTALEPSLSPGDAVAVAVEVIKAATGAKSEVPKRKSAAVKSDQD 120 Query: 112 TPWVVGEIAWFCEQVAKAEADNLDD------KKLLKVLKEDIAAIRV----NLQQGVDIA 161 + E + +++++ +NL K + LK+ RV + + VDIA Sbjct: 121 ATAALPETGYLM-FLSESQLNNLARLGVEGSKDITAFLKDKDFKNRVRQAADTRHSVDIA 179 Query: 162 LSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDD---LQEQGSAHLGTQE 218 L GRM VD A +AHAI+ H V+++ D+FTAVDD E G+ +G + Sbjct: 180 LFGRMVADAT---DINVDAAAQVAHAISVHAVENESDYFTAVDDRSTEAEPGAGMIGIVD 236 Query: 219 FSSGVFYRYANINLAQLQENLGG------ASREQALEIATHVVHMLATEVPGAKQRTYAA 272 F++ YRYA +++ +L +NLG + E + A +P K T+ Sbjct: 237 FNAATLYRYAAVDVNRLADNLGAGLLEGESQTEPVRRAVEAFIRGFALSMPTGKVNTFGN 296 Query: 273 FNPADMVMVNFS-DMPLSMANAFEKAVKA---KDGFLQPSIQAFNQYWDRVANGYGLNGA 328 D+V+V P+S A AFE+A+ A + G+L+ + + Y ++ Y L Sbjct: 297 HTVPDVVLVKLRASRPISFAAAFEEAISAGEHQGGYLKGACERLASYIPKLEQAYDLQEG 356 Query: 329 AAQFSL--SDVDPITAQVKQMPTLEQLKSWV 357 + + Q ++ QL + V Sbjct: 357 TDSWVVCAGSATEALEQAGDPVSISQLVAAV 387 >UniRef50_A1SV72 CRISPR-associated protein, Cse4 family n=2 Tax=Gammaproteobacteria RepID=A1SV72_PSYIN Length = 337 Score = 363 bits (932), Expect = 6e-99, Method: Composition-based stats. Identities = 140/356 (39%), Positives = 204/356 (57%), Gaps = 23/356 (6%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGE 60 M+ FINIH LISH S +NRDD MQK A+FGG R RISSQ LKRA+R+S Y + + E Sbjct: 1 MTTFINIHTLISHPSSMMNRDDSGMQKTAVFGGSVRSRISSQCLKRAIRQSDIYGEAVAE 60 Query: 61 SSLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISA-DAVTPWVVGEI 119 S+RT +L D+ ++ + E + I D L + S + D+ +I DAV P+ +G I Sbjct: 61 KSIRTNKFDELLDLCKEAMPETDIKLIEDVLLNMGSKVTKDKKTEIRNFDAVQPYAIGSI 120 Query: 120 AWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGKVD 179 + + + K L K+++ +D+ALSGRM S V+ Sbjct: 121 ----REAINMVNEGTELKDLKKIVQ----------IPTIDVALSGRMDAS---CPPRNVE 163 Query: 180 GAMSIAHAITTHQVDSDIDWFTAVDDLQEQGSAHLGTQEFSSGVFYRYANINLAQLQENL 239 AMS+AH++TTH D ++DWFTA DDL EQGS H+GT EFSSGVFYRYA+IN+ L +N+ Sbjct: 164 AAMSVAHSLTTHSADIEVDWFTACDDLAEQGSGHIGTTEFSSGVFYRYASINVDLLAKNV 223 Query: 240 GGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFSDMPLSMANAFEKAVK 299 E I ++ A P AKQ+ +AA+N AD VM S+ P+S+ANAF K ++ Sbjct: 224 KSTVSEVTP-IINTMIRCFAQVSPSAKQKVFAAYNQADFVMATHSNQPISLANAFRKPIE 282 Query: 300 AKDGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITAQVKQMPTLEQLKS 355 ++ SI A ++++++ N Y L+ A L+D +AQ KQ+ + ++ Sbjct: 283 NNGDVMENSIAALVKHYEKLTNAYELDSKAIALDLTD----SAQSKQINLVNKISD 334 >UniRef50_Q3A5Z5 CRISPR-associated protein, Cse4 family n=23 Tax=Bacteria RepID=Q3A5Z5_PELCD Length = 373 Score = 363 bits (931), Expect = 9e-99, Method: Composition-based stats. Identities = 102/358 (28%), Positives = 157/358 (43%), Gaps = 30/358 (8%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGE 60 MS FI +H+L S+ P+ LNRDD+ K A GG R+R+SSQSLKRA R S + + + Sbjct: 1 MSRFIQLHLLTSYPPANLNRDDLGRPKTAKMGGVDRLRVSSQSLKRAWRTSDLFGKTVKN 60 Query: 61 -SSLRTIHLAQLR--DVLRQKLGERFDQKIIDKTLALLSG-KSVDEAEKISADAVT---- 112 RT + + ++ + +G + + K + + EK + + Sbjct: 61 GLGTRTKEMGRKVYERLVEKGIGHKDALSWAGAIAGVFGKLKKLTDKEKTALKKLATEER 120 Query: 113 ---PWVVGEIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQ----GVDIALSGR 165 V EI + E LD + KE +NL + VDIAL GR Sbjct: 121 REKELVEVEIEQLAFFDLEEEQAVLDLTNSIAERKEGPQPEELNLLRQKMTSVDIALFGR 180 Query: 166 MATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDL----QEQGSAHLGTQEFSS 221 M S V+ A +AHAI+ H + + D+FTAVDDL ++ G+AH+G F++ Sbjct: 181 MLASSPAF---NVEAACQVAHAISVHPIVIEDDYFTAVDDLNDGSEDAGAAHIGETGFAA 237 Query: 222 GVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMV 281 G+FY Y IN L ENLGG + A + P KQ ++A+ A V+ Sbjct: 238 GLFYSYICINRDLLAENLGG-DEDLAQRAIAALTEAAVKVPPNGKQNSFASRAYASYVLA 296 Query: 282 NFSD-MPLSMANAFEKAV------KAKDGFLQPSIQAFNQYWDRVANGYGLNGAAAQF 332 + P S++ AF K + + F +++A + + YG Sbjct: 297 EKGEQQPRSLSVAFLKPIDNRTLYRDDQDFGTAAVEALEAHRQNMNKVYGDCADELYA 354 >UniRef50_Q2JH28 CRISPR-associated protein, CT1975 n=6 Tax=Actinomycetales RepID=Q2JH28_FRASC Length = 384 Score = 362 bits (930), Expect = 1e-98, Method: Composition-based stats. Identities = 94/336 (27%), Positives = 156/336 (46%), Gaps = 12/336 (3%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGE 60 M +I++H+L + PS LNRDD K A++GG +R R+SSQ+ KRA R + + + Sbjct: 1 MRCYIDVHILQTVPPSNLNRDDAGTPKQAVYGGVKRARVSSQAWKRATRTAFADHIDQAQ 60 Query: 61 SSLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEK-ISADAVTPWVVGEI 119 RT ++ L + +LL+ + +K + + ++ Sbjct: 61 LGTRTKRISALLAERLATRCALDAETSTRIATSLLTALKISAGKKAAETAYLLFFGRPQL 120 Query: 120 AWFCEQVAKAEA--DNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGK 177 + + + +L D LL +K+ + +D+AL GRM Sbjct: 121 ERLIDLIVEDVPRLADLSDGDLLAAVKDVPVLATLGSDHPIDVALFGRMVAD---LASLN 177 Query: 178 VDGAMSIAHAITTHQVDSDIDWFTAVDDLQ---EQGSAHLGTQEFSSGVFYRYANINLAQ 234 VD A +AHA++TH VD + D++TAVDD E G+ +GT EF S YR+A + L Q Sbjct: 178 VDAATQVAHALSTHAVDVEFDYYTAVDDQNAKDETGAGMIGTVEFQSATLYRFATVGLHQ 237 Query: 235 LQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMV-MVNFSDMPLSMANA 293 L ENLGG E +E + T +P Q ++A +++ + D P+++ +A Sbjct: 238 LAENLGG-DIEATVEALRVFLTAFTTSMPTGHQNSFAHRTVPNLLTIAIRPDQPVNLVSA 296 Query: 294 FEKAVKAKD-GFLQPSIQAFNQYWDRVANGYGLNGA 328 FEK V + G L S++ F + + +GL Sbjct: 297 FEKPVLPRGRGVLTGSLEQFAIELNSASTLWGLQPD 332 >UniRef50_Q2FNL3 CRISPR-associated protein, CT1975 n=8 Tax=cellular organisms RepID=Q2FNL3_METHJ Length = 382 Score = 361 bits (927), Expect = 2e-98, Method: Composition-based stats. Identities = 112/401 (27%), Positives = 177/401 (44%), Gaps = 62/401 (15%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNI-G 59 MS FI IH+L S+ PS LNRDD+ K A GG +R+R+SSQSLKR+ R S ++ + G Sbjct: 1 MSEFIQIHMLASYPPSNLNRDDLGRPKTATVGGTQRIRVSSQSLKRSWRTSEAFSDALKG 60 Query: 60 ESSLRTIHLA-----------QLRDVLRQKLGERFDQKIIDKTLA-------------LL 95 +RT + L D+L K ++I D+ A + Sbjct: 61 AIGIRTRDMGVKIKKALVEGRLLSDILEGKESGVTRERIKDEKKAHEWAVKISSHFGKIE 120 Query: 96 SGKSVDEAEKISA------------DAVTPWVVGEIAWFCEQVAKAEADNLDDKKLLKVL 143 GK D +K + + EIA + + + + + Sbjct: 121 GGKEKDSDKKSEKTDEKSNKNPLSHKQMVHYSPEEIAGIDDLLGRISGG--------EKV 172 Query: 144 KEDIAAIRVNLQQGVDIALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAV 203 +D + + VDIAL GRM + A+ ++HAIT H + D+FTAV Sbjct: 173 SDDDCIRLRSDHKAVDIALFGRMLADNAAY---NTEAAVQVSHAITVHDTPVEDDYFTAV 229 Query: 204 DDLQE----QGSAHLGTQEFSSGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLA 259 DDL + G+ H+G EF +G+FY Y IN L+ENL G E + ++ + Sbjct: 230 DDLNQLDDTAGAGHIGEAEFGAGLFYTYICINRDLLKENLQG-DNELSNRAIEALIRAAS 288 Query: 260 TEVPGAKQRTYAAFNPADMVMVNFS-DMPLSMANAFEKAVKAKDGFLQPSIQAFNQYWDR 318 P KQ ++A+ + A ++V + P S+A AF K V KD + +++ DR Sbjct: 289 MVSPSGKQNSFASRSYASYLLVEKGTEQPRSLAAAFFKPVSGKDIY-GDAVKNLEGLRDR 347 Query: 319 VANGYGLNGAAAQFSLSDVDPITAQVKQMPTLEQLKSWVRN 359 + N YG + + S++ +D +L + S+V Sbjct: 348 MDNAYGTSFKQSSRSMNVIDGTG-------SLTDIISFVLE 381 >UniRef50_D1YEE3 CRISPR system CASCADE complex protein CasC n=1 Tax=Propionibacterium acnes J139 RepID=D1YEE3_PROAC Length = 374 Score = 361 bits (926), Expect = 3e-98, Method: Composition-based stats. Identities = 100/375 (26%), Positives = 166/375 (44%), Gaps = 26/375 (6%) Query: 2 SNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGES 61 S +++IHV+ S PS +NRDD K A++GG RR R+SSQ+ K+A+R S ++ Sbjct: 3 SYYVDIHVIQSVPPSNVNRDDTGSPKSALYGGVRRARVSSQAWKKAVRTSFKEFLPANQT 62 Query: 62 SLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISAD--------AVTP 113 RT+ + +L ++ + + +AEK T Sbjct: 63 GSRTLRVVELLMNRLTAAPYGLPEEDARQKALEVVKALGLKAEKPRKKDESGAEGIERTQ 122 Query: 114 WVVGEIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMT 173 ++V +++A+ A D K+ + A + G+++AL GRM Sbjct: 123 YLVFYSNQQLDRLAQLAA--TTDGKITATDAKKAA----DSDHGIEVALFGRMVADSK-- 174 Query: 174 ELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQ----EQGSAHLGTQEFSSGVFYRYAN 229 VD A+ +AHA++TH V+ + D+FTAVDD + + G+ +GT EF+S YR+A Sbjct: 175 -DLNVDSAVQVAHALSTHAVEIESDYFTAVDDYKLDEDDAGAGMIGTVEFTSETLYRFAT 233 Query: 230 INLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMV-NFSDMPL 288 + ++ L++NLG + + A+ V +P KQ T+A D V+V Sbjct: 234 VAVSTLKDNLG--DVDLTAQAASAFVRGFIMSMPTGKQNTFANNTIPDAVVVQVRKGRSA 291 Query: 289 SMANAFEKAVKA-KDGFLQPSIQAFNQYWDRVANGY-GLNGAAAQFSLSDVDPITAQVKQ 346 S AFE V + GF+ S QA Y + G A+ + + Sbjct: 292 SFIGAFEDPVTSDDGGFVAASCQAVAAYAHDCEEAFLGAPEASFVTRVGSRTEAIGTMGT 351 Query: 347 MPTLEQLKSWVRNNG 361 ++ L S VR+ Sbjct: 352 QMPIDDLVSSVRDQV 366 >UniRef50_Q47PJ3 CRISPR-associated protein, Cse4 family n=1 Tax=Thermobifida fusca YX RepID=Q47PJ3_THEFY Length = 373 Score = 360 bits (925), Expect = 4e-98, Method: Composition-based stats. Identities = 103/369 (27%), Positives = 168/369 (45%), Gaps = 23/369 (6%) Query: 3 NFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESS 62 F++IH + + S +NRDD+ K ++GGK R R+SSQS KRA+R +G+ + Sbjct: 2 TFVDIHAIQTLPYSNINRDDLGSPKTVVYGGKERTRVSSQSWKRAVRHEV--EARLGDKA 59 Query: 63 LRTIH-LAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPW------- 114 +RT ++++ LR++ + + + L GK + D+ P Sbjct: 60 VRTRRIISEIAKRLRERGWDADLADAGARQVVLSVGKKSGIKLEKEKDSEAPATSVLFYL 119 Query: 115 ---VVGEIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGM 171 + E+A ++ A A K +L D + + V + L GRM Sbjct: 120 PVPAIDELAAIADEHRDAVAKEAAKKTPKGILPADRITEVLKSRN-VSVNLFGRMLAE-- 176 Query: 172 MTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDL---QEQGSAHLGTQEFSSGVFYRYA 228 +VDGA+ AHA T H ++D+FTAVDD+ + GS H+ +FS+G FYRYA Sbjct: 177 -LPSTEVDGAVQFAHAFTVHGTTVEVDFFTAVDDIPKENDHGSGHMNAGQFSAGTFYRYA 235 Query: 229 NINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFS-DMP 287 N+NL +L EN G + A + + VP KQ AA D+V + D P Sbjct: 236 NVNLDRLVENTG--DAQTARTAVAEFLRAFLSTVPSGKQNATAAMTLPDLVHIAVRFDRP 293 Query: 288 LSMANAFEKAVKAKDGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITAQVKQM 347 +S A AFE A+ DG+ + Q N Y +R+ + + ++ + + A ++ Sbjct: 294 ISFAPAFETALYGSDGYTLRACQELNNYAERLREVWPDDAIRGYATVENKTDLAALGERY 353 Query: 348 PTLEQLKSW 356 + L Sbjct: 354 DSYPALIDA 362 >UniRef50_C7LYW7 CRISPR-associated protein, Cse4 family n=1 Tax=Acidimicrobium ferrooxidans DSM 10331 RepID=C7LYW7_ACIFD Length = 386 Score = 358 bits (918), Expect = 3e-97, Method: Composition-based stats. Identities = 111/373 (29%), Positives = 169/373 (45%), Gaps = 25/373 (6%) Query: 5 INIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQN-IGESSL 63 I++HVL + PSCLNRDD N K A++GG RR R+SSQS KRA R+ IG L Sbjct: 9 IDVHVLQTLPPSCLNRDDTNAPKTALYGGARRARVSSQSWKRATRRYFNENLATIGTDWL 68 Query: 64 RTI----HLAQLRDVLRQKLGER-----FDQKIIDKTLALLSGK---SVDEAEKISADAV 111 R+ +L +L +++ R + + + + L +G +E K A Sbjct: 69 RSRGGGIRTRKLAGLLHERVQARVRDLDVREDDVARLVNLAAGALLGLKEEKLKKRAQET 128 Query: 112 TPWVVGEIAWFCEQVAKAEADNLDDK-KLLKVLKEDIAAIRVNLQQGVDIALSGRMATSG 170 P + + E A L+ + L D+ + +D+AL GRM Sbjct: 129 QPADLEYALFVSESAIDAAVGELERSLRAGDDLDLDVLTTAMGRDLSLDVALFGRMIAD- 187 Query: 171 MMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQ---EQGSAHLGTQEFSSGVFYRY 227 T VD A +AHAI+TH+V S+ D++T VDDL E G+A +G EF+S YR+ Sbjct: 188 --TPNLNVDAACQVAHAISTHRVTSEFDFYTTVDDLAGDDETGAAMMGFIEFNSATVYRF 245 Query: 228 ANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVN-FSDM 286 A ++L +L +NLG + + A +P Q T+AA D+V V+ D Sbjct: 246 ATVSLGRLADNLG--DPDAVPTGVRAFIEAFAKSLPTGHQNTFAALTVPDLVFVSMRGDQ 303 Query: 287 PLSMANAFEKAVKAKDGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSL--SDVDPITAQV 344 P+S+ AFE V++ G++ S + Y D + YG+ S + + Sbjct: 304 PVSLVGAFEAPVESDRGYVHASAERLATYADDIDGLYGVPRLNGWASYVPKLEQAVATHL 363 Query: 345 KQMPTLEQLKSWV 357 QL V Sbjct: 364 GDSIAFPQLLDAV 376 >UniRef50_Q03C61 CRISPR-associated protein n=6 Tax=Firmicutes RepID=Q03C61_LACC3 Length = 361 Score = 355 bits (911), Expect = 2e-96, Method: Composition-based stats. Identities = 108/365 (29%), Positives = 176/365 (48%), Gaps = 30/365 (8%) Query: 1 MSN---FINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQN 57 M+N +I+IHVL + + +NRDD K A++GG R R+SSQS KRAMR ++ Sbjct: 1 MTNKNLYIDIHVLQTVPSANINRDDTGAPKKALYGGVTRARVSSQSWKRAMRLRFN-QED 59 Query: 58 IGESSLRTIHLAQ-LRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVV 116 ++ LRT + Q LR L+ D++I K A+ S + + A+ Sbjct: 60 HDDAGLRTKEVPQLLRQALKAAAPALTDEEIAAKVDAVFSTAKIKITKDGQTGALMLIST 119 Query: 117 GEIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELG 176 G++ + EA LD K+L K+ K +Q +D+AL GRM Sbjct: 120 GQLKKLAQYALDNEA--LDKKELTKLFK---------GEQSLDLALFGRMVADN---PEL 165 Query: 177 KVDGAMSIAHAITTHQVDSDIDWFTAVDDLQ---EQGSAHLGTQEFSSGVFYRYANINLA 233 V+G+ +AHAI+TH++ + D+FTA+DD + G+A LGT E++S YRYAN+N Sbjct: 166 NVEGSAQVAHAISTHEIVPEFDYFTALDDFKPEDNAGAAMLGTVEYNSSTLYRYANLNFQ 225 Query: 234 QLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFS-DMPLSMAN 292 + + N+GG A+ A + +P KQ T+A + VMV D P+++ + Sbjct: 226 EFEANIGGR---AAVSGALSYIKEFLLSMPNGKQNTFANKTLPNYVMVTLRPDTPVNLVS 282 Query: 293 AFEKAVKAKDGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITAQVKQMPTLEQ 352 AFE VK+ G+++ S++ Q + ++ + + +Q + Sbjct: 283 AFEDPVKSNHGYVEASVKRLEQEYQ--DALQFVDAPLFTAVVGKTN--GEVGEQQANVNG 338 Query: 353 LKSWV 357 L V Sbjct: 339 LLDAV 343 >UniRef50_C7MTA9 CRISPR-associated protein, Cse4 family n=1 Tax=Saccharomonospora viridis DSM 43017 RepID=C7MTA9_SACVD Length = 390 Score = 352 bits (905), Expect = 8e-96, Method: Composition-based stats. Identities = 105/350 (30%), Positives = 162/350 (46%), Gaps = 24/350 (6%) Query: 2 SNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGES 61 +I+IHV+ + S +NRDD K FGG R R+SSQS KR +R+ GE+ Sbjct: 4 PKYIDIHVIQTLPFSNVNRDDTGSPKTVEFGGVERTRVSSQSWKRVVRQH-VEEAVGGET 62 Query: 62 SLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVD-EAEKISADA---------- 110 + + + L ++ E+ + + +AL +GK + + EK +D Sbjct: 63 VRTRRVVVGVAERLIKQGWEKSEAEAAGVQIALSAGKKISLKQEKDESDEVVLTTNVLLL 122 Query: 111 VTPWVVGEIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVN---LQQGVDIALSGRMA 167 + + E+A ++ + K L +K + + R+N ++ I L GRM Sbjct: 123 LPESGIDELAALADEHREVILAEAKKAKKLTGMKPKLPSERINEILSRRSATINLFGRMV 182 Query: 168 TSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQE----QGSAHLGTQEFSSGV 223 VDGA+ +AHA TTH + D+FTAVDD+++ GS ++ T FS+G Sbjct: 183 AE---LPGANVDGAVQVAHAFTTHGTAVEYDFFTAVDDIEQKLDLPGSGYMDTALFSAGT 239 Query: 224 FYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNF 283 FYRYAN+NL L NL + A + + T VP KQ AA D+V V Sbjct: 240 FYRYANVNLTDLLRNL-DQDTDLARVLVKTFLDGFITTVPSGKQNATAAVTLPDLVHVTV 298 Query: 284 S-DMPLSMANAFEKAVKAKDGFLQPSIQAFNQYWDRVANGYGLNGAAAQF 332 D P+S+ANAFE V DGF++ S + + +A G + Sbjct: 299 RDDRPVSLANAFEAPVGGGDGFVRKSAHRLDSHAGAIAELLGESHVLFSA 348 >UniRef50_UPI0001AF1D4B hypothetical protein SghaA1_37372 n=1 Tax=Streptomyces ghanaensis ATCC 14672 RepID=UPI0001AF1D4B Length = 383 Score = 352 bits (904), Expect = 1e-95, Method: Composition-based stats. Identities = 102/381 (26%), Positives = 156/381 (40%), Gaps = 36/381 (9%) Query: 4 FINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYA-QNIGESS 62 I +H+L S S LNRDD+ K A FGG R RISSQSLKRA R + E Sbjct: 2 LIELHLLQSFPVSNLNRDDLGQPKTARFGGHTRARISSQSLKRAARTLLAQHGLDPSELG 61 Query: 63 LRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISA----------DAVT 112 +RT L L L ER +K + + + A + Sbjct: 62 VRTKRLRDAAASL---LAERGREKEQAVEVCQAGLEEIGFAAHTATGLTKYLLYVGKPAQ 118 Query: 113 PWVVGEIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIR------------VNLQQGVDI 160 + + +AK A+ K+ + AA + ++ + DI Sbjct: 119 TLLADYCDERWDTLAKTVAEAKKRKEKQEKTPRKTAAKKPTKQAQEQAKRILDGTRAADI 178 Query: 161 ALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDL---QEQGSAHLGTQ 217 AL GRM V+ A +AHA++TH V ++ D++TA+DDL E + +GT Sbjct: 179 ALFGRMIADNTDF---NVNAASQVAHALSTHAVVNEFDYYTALDDLRPDAEPAADMIGTV 235 Query: 218 EFSSGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPAD 277 +F++ FYRYAN++L QL NL + A +H VPG KQ + +A Sbjct: 236 DFNAACFYRYANLDLEQLATNLPD-DPDLVARSARAWLHSFIHAVPGGKQNSMSARTMPQ 294 Query: 278 MVM-VNFSDMPLSMANAFEKAVKAKDGFLQPSIQAFNQYWDRVANGYGLNGAAAQ--FSL 334 ++ V ++ANAF V + S Q ++ ++ + YG S+ Sbjct: 295 TLLGVVRETGAWNLANAFLSPVTDVPDLMAASTQRLVDHFQQLRSFYGDTQLRHTTIASI 354 Query: 335 SDVDPITAQVKQMPTLEQLKS 355 + + PTL+ S Sbjct: 355 GSDPAGMPENEIAPTLDDFVS 375 >UniRef50_B8IZA6 CRISPR-associated protein, Cse4 family n=1 Tax=Desulfovibrio desulfuricans subsp. desulfuricans str. ATCC 27774 RepID=B8IZA6_DESDA Length = 350 Score = 352 bits (903), Expect = 1e-95, Method: Composition-based stats. Identities = 100/370 (27%), Positives = 166/370 (44%), Gaps = 31/370 (8%) Query: 3 NFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESS 62 + +H+L S +CLNRDD+ K A+FGG +R R+SSQ KRA+R+ Sbjct: 2 RHLELHILQSVPVACLNRDDLGSPKTAVFGGVQRARVSSQCWKRAIREYCGELLPQHFKG 61 Query: 63 LRTIHLAQ-LRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGEIAW 121 RT + + LRD+ G ++ ++D+ T + Sbjct: 62 ERTRLIVEPLRDIFINTYGLDEATALVKANDLAEGLATLDKDAAKKNKLQTKTLFFTSRS 121 Query: 122 FCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGKVDGA 181 E +A +N + KK K + + DIAL GRM S ++GA Sbjct: 122 ELEALAAIAVNNENIKKHAKTFAQSLCT------DAADIALFGRMVASA---PELTLEGA 172 Query: 182 MSIAHAITTHQVDSDIDWFTAVDDL---QEQGSAHLGTQEFSSGVFYRYANINLAQL--Q 236 +HA++TH+ D++ID+F+A+DDL +E G+ GT EF++ +YR+ +NL L Sbjct: 173 AMFSHALSTHKADNEIDFFSALDDLLPSEETGAGMTGTLEFNAAAYYRFCALNLDMLADA 232 Query: 237 ENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFSD--MPLSMANAF 294 ++LG S ++ I V +P A++ + A V+ D P+ + NAF Sbjct: 233 DHLGALSPDERQGIVAAFVEATLKAMPVARKNSMNANTMPAYVLCVLRDSGQPVQLVNAF 292 Query: 295 EKAVKAKD--GFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITAQVKQMPTLEQ 352 EKAV + D G+++ SI+ + + R+ N +GL + +L + Sbjct: 293 EKAVYSPDGRGYVEASIKRMEEEYQRLENTWGLTAVETIRMP------------LQSLGE 340 Query: 353 LKSWVRNNGE 362 L VR + Sbjct: 341 LLQGVRRHVR 350 >UniRef50_C4ZJY0 CRISPR-associated protein, Cse4 family n=1 Tax=Thauera sp. MZ1T RepID=C4ZJY0_THASP Length = 394 Score = 350 bits (898), Expect = 5e-95, Method: Composition-based stats. Identities = 108/394 (27%), Positives = 170/394 (43%), Gaps = 38/394 (9%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSG--YYAQNI 58 + FI IH L ++ + LNRDD + K +GG R RISSQ LKR R + + + Sbjct: 3 LPRFIQIHTLHTYPAALLNRDDAGLAKRLPYGGAIRTRISSQCLKRHWRVADDAFSLAKL 62 Query: 59 G-ESSLRTIHLAQLR-DVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTP--- 113 G + RT ++A+L L ++ + + L + +K A+ Sbjct: 63 GVPMATRTRYVAELIRQRLIEQGIDEARAYATAEALLEALFGEKADKKKEGVKALQTGQA 122 Query: 114 --WVVGEIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVN-----LQQGVDIALSGRM 166 + EIA+ + D D L + + + + N L G++ AL GRM Sbjct: 123 VLFGNEEIAYLARRCRDITGDFSDPVALKAEVAKFLKEEKKNIEAMKLGSGLESALFGRM 182 Query: 167 ATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQE----QGSAHLGTQEFSSG 222 TS + L D ++S+AHA T H+ + D+FT VDD + GSA + E +SG Sbjct: 183 VTSDL---LANRDASVSVAHAFTVHEAQVENDYFTVVDDFAQAEDGAGSAGIFDTELASG 239 Query: 223 VFYRYANINLAQLQENLGGASRE-----------QALEIATHVVHMLATEVPGAKQRTYA 271 ++Y Y I++ QL NL G E A ++ H++H++AT PGAK+ + A Sbjct: 240 LYYGYVVIDVPQLVANLEGIKVEDVFTIGADKRGLAGKVVQHLLHLIATVSPGAKRGSTA 299 Query: 272 AFNPADMVMVNFSD-MPLSMANAFEKAVKAKDG--FLQPSIQAFNQYWDRVANGYGLNGA 328 ++ A V+V D P S+A AF + K + + YG+ A Sbjct: 300 PYDWAKFVLVEAGDWQPRSLAAAFHDPIPLKGDSSIRGRAASKLAKEIAAFDAAYGMPTA 359 Query: 329 AAQFSLSDVDPITAQVKQMPTLEQLKSWVRNNGE 362 SL D + + TL QL W+ Sbjct: 360 RRFLSL---DELAVPAAERATLSQLGEWIAQTVR 390 >UniRef50_A8LYZ6 CRISPR-associated protein, Cse4 family n=1 Tax=Salinispora arenicola CNS-205 RepID=A8LYZ6_SALAI Length = 380 Score = 349 bits (896), Expect = 9e-95, Method: Composition-based stats. Identities = 109/387 (28%), Positives = 175/387 (45%), Gaps = 32/387 (8%) Query: 1 MS-NFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIG 59 M+ +++IHVL + + LNRDD+ K FG R R+SSQS KRA+R+ ++ G Sbjct: 1 MTARYVDIHVLQTVPYANLNRDDLGSPKTVRFGYADRTRVSSQSWKRAVRRE--LEESSG 58 Query: 60 ESSLRTIHLAQLRDV------LRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTP 113 + + RT L Q +L +++ + + +K + +A Sbjct: 59 DKAKRTRRLPQAIQARLTGPDWDSELAAFAATQVMATLATIAVKADGFKVDKATGEAQVL 118 Query: 114 WVVGEIAWFC---------EQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSG 164 + + E A+ +++ + L KK L D + + V I L G Sbjct: 119 FYLPERAFDMLADVCVQQRDRLIGLRSGALKLKKGEAPLPADAVRAAMEHRSDV-INLFG 177 Query: 165 RMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQ----EQGSAHLGTQEFS 220 RM VDGA+ +AHA TTH D +D+FTAVDDL+ + GS H+ + EFS Sbjct: 178 RMLAE---LPGSNVDGAVQVAHAFTTHGTDPQVDFFTAVDDLKQDADQAGSGHMNSAEFS 234 Query: 221 SGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVM 280 +G FYRYA++NL L NLG A+E+ + T +P AK+ A F ++ Sbjct: 235 TGTFYRYASVNLEDLAHNLG--DPATAVELTRVFLSAFITAMPQAKKNATAPFTVPELAY 292 Query: 281 VNFS-DMPLSMANAFEKAVKA--KDGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDV 337 + D P+S+A+AFE V+A G+ +PS + +Y ++ G G S Sbjct: 293 IAVRTDRPVSLASAFETPVRATFDSGYAEPSRRQLAEYAGQIYRLIGDQGMVYHGCASVD 352 Query: 338 DPITAQ-VKQMPTLEQLKSWVRNNGEA 363 D Q + + + L + + A Sbjct: 353 DKGLEQLGETRQSFDNLIATAVDKLRA 379 >UniRef50_C5SD49 CRISPR-associated protein, Cse4 family n=1 Tax=Allochromatium vinosum DSM 180 RepID=C5SD49_CHRVI Length = 393 Score = 349 bits (895), Expect = 1e-94, Method: Composition-based stats. Identities = 178/395 (45%), Positives = 237/395 (60%), Gaps = 40/395 (10%) Query: 3 NFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESS 62 NF+N HVLISHSPSCLNRDDMNMQK AIFGGK RVRISSQSLKRA+R S YYA+ S Sbjct: 5 NFVNFHVLISHSPSCLNRDDMNMQKTAIFGGKTRVRISSQSLKRAIRYSDYYARYFISKS 64 Query: 63 LRTIHLA-QLRDVLRQKLGERFDQKIIDK----TLALLSGK-SVDEAEKISAD------- 109 RT L ++ D L I+K A+ GK +DE K D Sbjct: 65 QRTRRLFDKMADELSASAESAEQTTAIEKCALYAAAIFEGKTKIDEIGKYERDKKSDHIE 124 Query: 110 -AVTPWVVGEIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQ--GVDIALSGRM 166 + P+ EI + + EA +K ++ +K +I + + +D+ALSGRM Sbjct: 125 TQIIPFSCAEIEGIKQIL--LEAAGKPEKGRIEYMKAEIQRLEREQRTRIDLDVALSGRM 182 Query: 167 ATSGMMTELGKVDGAMSIAHAITTHQVDSDI-DWFTAVDDLQ----EQGSAHLGTQEFSS 221 A S ++ VDGA+++AHAITTH V+ DWFTAVDDL E G+ HL TQ+FS+ Sbjct: 183 ANSELIYP---VDGALAVAHAITTHTVEPQDIDWFTAVDDLTLDAGETGAGHLNTQQFSA 239 Query: 222 GVFYRYANINLAQLQENLG----------GASREQALEIATHVVHMLATEVPGAKQRTYA 271 GVFYRYA++NL QLQ NLG SR +AL+IA HV+H+LAT VP AKQ+++A Sbjct: 240 GVFYRYASLNLRQLQFNLGLLANINAEQTTESRARALDIARHVLHLLATVVPSAKQQSFA 299 Query: 272 AFNPADMVMVNFSDMPLSMANAFEKAVKAK---DGFLQPSIQAFNQYWDRVANGYGLNGA 328 A N AD V+V+ +D P+S+ANAFE+ ++ + GFLQPSI A YW RV + YGL+ Sbjct: 300 AHNLADFVIVSLADQPVSLANAFEEPIERERKIGGFLQPSITALADYWSRVNSAYGLDEQ 359 Query: 329 AAQFSLSDVDPITAQVKQMPTLEQLKSWVRNNGEA 363 A F+L + Q + + ++ L+ W+ N+G A Sbjct: 360 ARAFALRGGIKLGDQ-EVLTSIADLEQWLANDGRA 393 >UniRef50_A5UR15 CRISPR-associated protein, Cse4 family n=1 Tax=Roseiflexus sp. RS-1 RepID=A5UR15_ROSS1 Length = 402 Score = 347 bits (890), Expect = 5e-94, Method: Composition-based stats. Identities = 114/401 (28%), Positives = 177/401 (44%), Gaps = 50/401 (12%) Query: 4 FINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESS- 62 I +H+L +H+PS LNRDD N KDAIFGG RR RISSQ++KR++R S ++ Sbjct: 2 LIALHLLQNHAPSNLNRDDNNEPKDAIFGGVRRARISSQAIKRSIRWSDHFRAPFETQGL 61 Query: 63 --LRTIHLAQLRD-VLRQKLGERFDQKIIDKTLALLSGKSV------DEAEKISADAVTP 113 +RT L + L DQ+ I + A L EA D P Sbjct: 62 LAIRTQLLPEKVRHHLVNAGLNDDDQRAIVEAAARLGKGEQRSPSGEGEAGDERGDQNQP 121 Query: 114 WVV---------GEIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNL---------- 154 E++ A+ L + ++ ++ + I +R Sbjct: 122 RSSSRSRRSSRQSNTTGDAERIKTAQLMFLTENEIQQLAQRLIEIVREKGAKHLNELQGD 181 Query: 155 ----------QQGVDIALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVD 204 VDIA+ GRM TS V+ A+ +AHAI+TH V+ + D++TAVD Sbjct: 182 TLVREIGEYEPHSVDIAMFGRMTTSS---PFKDVEAAVQVAHAISTHAVEMEFDFYTAVD 238 Query: 205 DLQ-EQGSAHLGTQEFSSGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVP 263 D+ E G+ +G F+S +Y+Y +I+ L +NL G + A + ++ +P Sbjct: 239 DISGEAGAGFIGDTTFNSATYYKYFSIDWDGLLKNLHG-EQNVARQSVEALIRAALFAIP 297 Query: 264 GAKQRTYAAFNPADMVMVNFSDM--PLSMANAFEKAVKAKD--GFLQPSIQAFNQYWDRV 319 KQ ++AA N D+ +V LS ANAF K V+A ++ S +A +Y + Sbjct: 298 SGKQNSFAAHNLPDLALVEVRKENIALSYANAFVKPVRATGKLSLIEASAKALEEYIPAI 357 Query: 320 ANGYGLNGAAAQFSLSDVDPITAQVKQMPTLEQLKSWVRNN 360 Y L+ A S V + + LE+L +W+ Sbjct: 358 NERYNLSAQRAFLST--VPFTLSGAECCSDLEKLITWLSKQ 396 >UniRef50_D0Y919 CRISPR-associated protein, Cse4 family n=2 Tax=Dehalococcoides RepID=D0Y919_9CHLR Length = 427 Score = 346 bits (889), Expect = 5e-94, Method: Composition-based stats. Identities = 101/387 (26%), Positives = 165/387 (42%), Gaps = 60/387 (15%) Query: 6 NIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESS-LR 64 IH++ + +PS LNRDD K A FGG RR RISSQ KR+ R G A+ + +R Sbjct: 9 EIHLIQNFAPSNLNRDDTGQPKSATFGGFRRARISSQCSKRSTRLQGPLAELLENQGAVR 68 Query: 65 TI-HLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGE----- 118 T + ++ + K E D++ I+ + ++ K S + Sbjct: 69 TRQLIMEIAKAIDTK--EEPDERTIEIVAGVFEAGGLERPAKRSGKVKSQAAEAIGEDGE 126 Query: 119 --------------IAWFCEQVA-----KAEADNLDD-----KKLLKVLKEDIAAIRVNL 154 I F +++A +N DD K++ + + + + Sbjct: 127 INGNEGFESGNKTKILLFLDKMAFPKLIDVFKENWDDLAKGNKEVKEKACDKVGRLLFEA 186 Query: 155 QQGVDIALSGRMATSGMMTELGK----VDGAMSIAHAITTHQVDSDIDWFTAVDDLQ--- 207 + DIAL GRM T GK V+ A +AH I+TH++D ++D++TAVDDL Sbjct: 187 VKAPDIALFGRMLEVKNNTPFGKYNMSVEAACQVAHPISTHKIDMEMDFYTAVDDLNPDG 246 Query: 208 EQGSAHLGTQEFSSGVFYRYANINLAQLQENLG---------------GASREQALEIAT 252 E G+ +G F+S +YRYA ++ QL NL E+A ++ Sbjct: 247 ETGAGMMGVVGFNSACYYRYALVDRDQLARNLARKTERKNGGWAQGLETQDYEEADKVVK 306 Query: 253 HVVHMLATEVPGAKQRTYAAFNPADMVMVNFS--DMPLSMANAFEKAVKA---KDGFLQP 307 + + +P KQ ++AA N + +P+S+ANAF ++ D + Sbjct: 307 AFLEAMIYAIPTGKQNSFAAQNLPSFGLFVKRKGGVPVSLANAFSTPIRPVRDDDDLVGL 366 Query: 308 SIQAFNQYWDRVANGYGLNGAAAQFSL 334 S+ A ++WD + YG G Sbjct: 367 SVNALTKHWDAIKELYGDQGIKVTSCF 393 >UniRef50_B6WQ62 Putative uncharacterized protein n=1 Tax=Desulfovibrio piger ATCC 29098 RepID=B6WQ62_9DELT Length = 341 Score = 345 bits (885), Expect = 2e-93, Method: Composition-based stats. Identities = 98/331 (29%), Positives = 157/331 (47%), Gaps = 20/331 (6%) Query: 3 NFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESS 62 + +H+L S +CLNRDD K A+FG +R R+SSQ KRA+R+ Sbjct: 2 RHLELHILQSVPVACLNRDDFGSPKTALFGNVQRARVSSQCWKRAVRELMQEEVPALFGG 61 Query: 63 LRTIHL-AQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGEIAW 121 RT + L +L ++ G ++ A G +V + + T + + Sbjct: 62 QRTRLILDPLCRILHEQHGLAEEEAR---KKAEELGAAVSKLDTPPVRVKTLFFTSPLE- 117 Query: 122 FCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGKVDGA 181 E +A A + KK +K L + L+ DIAL GRM S L +GA Sbjct: 118 -LEALAAAYVATGNAKKAVKELAKHP------LKDAADIALFGRMVASDHSLTL---EGA 167 Query: 182 MSIAHAITTHQVDSDIDWFTAVDDL---QEQGSAHLGTQEFSSGVFYRYANINLAQLQEN 238 +HA++TH+V ++ID+F AVDDL E G+ GT EF+S +YR+A +NL L+++ Sbjct: 168 AMFSHALSTHKVSNEIDFFAAVDDLQPEDEAGAGMTGTLEFNSATYYRFAALNLDLLEQH 227 Query: 239 LGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFSD--MPLSMANAFEK 296 L S E+ E+ + V VPGA++ + A V+ + P+ + NAFEK Sbjct: 228 LSALSAEERREVVCNFVTATLRAVPGARKNSMNAATLPSHVLAVVREKGHPVQLVNAFEK 287 Query: 297 AVKAKDGFLQPSIQAFNQYWDRVANGYGLNG 327 V + G ++ S+ + + + +GL Sbjct: 288 PVWTRGGLMEESVSQLEREYTHLKETWGLEA 318 >UniRef50_D1NTI0 CRISPR-associated protein, Cse4 family n=1 Tax=Bifidobacterium gallicum DSM 20093 RepID=D1NTI0_9BIFI Length = 381 Score = 345 bits (885), Expect = 2e-93, Method: Composition-based stats. Identities = 92/375 (24%), Positives = 159/375 (42%), Gaps = 25/375 (6%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGE 60 M+ + I+ + + PS +NRDD K AI+GG R R+SSQ+ KRAMR++ + + Sbjct: 1 MTTIVEIYAIQNVPPSNINRDDTGNPKTAIYGGVLRARVSSQAWKRAMREAFPEMLDADQ 60 Query: 61 SSLRTIH-LAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADA------VTP 113 +RT + LAQ+ + K + D + + K + + EK +T Sbjct: 61 LGIRTKNALAQIEQSIVAKRPD-IDVETVHKAATAALTATGAKVEKSKRKGSMEGADLTQ 119 Query: 114 WVVG----EIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATS 169 +++ EI + + D K K +K ++A+ + Q VDIAL GRM Sbjct: 120 YLIFIANREIDKLADLAIAWIDADEDLDKPSKEMKGQVSAV-FHGPQAVDIALFGRMLAD 178 Query: 170 GMMTELGKVDGAMSIAHAITTHQVDSDIDWFT---AVDDLQEQGSAHLGTQEFSSGVFYR 226 D + +AHAI+ +V + D+FT G+A L T F+S YR Sbjct: 179 A---PELNTDASAQVAHAISVDEVTPEYDYFTAIDDDAADDNAGAAMLDTVGFNSSTLYR 235 Query: 227 YANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFSD- 285 YA + + L E L A ++ V+ +P KQ T+A +V + Sbjct: 236 YATVAVDSLYEQLQSAD--MTVKAVDAFVNAFLRSMPTGKQNTFANRTLPTAALVVVRNS 293 Query: 286 MPLSMANAFEKAVKAKDG--FLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVD-PITA 342 P++ AFE+ V A+ + + + + + + YG AA ++ + Sbjct: 294 QPINPVEAFERPVHAERDKSISRVAAERLGRKLQDIQDTYGETPIAAWNIVAGQPVELLD 353 Query: 343 QVKQMPTLEQLKSWV 357 + + TL + + Sbjct: 354 SLSEHVTLPVMVESL 368 >UniRef50_B4UE70 CRISPR-associated protein, Cse4 family n=2 Tax=Anaeromyxobacter RepID=B4UE70_ANASK Length = 413 Score = 344 bits (884), Expect = 2e-93, Method: Composition-based stats. Identities = 107/415 (25%), Positives = 179/415 (43%), Gaps = 55/415 (13%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIG- 59 M+ F+ IH L S+ S LNRDD K FGG R R+SSQ LKR R G Sbjct: 1 MNRFVQIHTLTSYPASLLNRDDAGFAKRIPFGGVTRTRVSSQCLKRHWRTFEGEGALSGL 60 Query: 60 --ESSLRTIHLAQ---LRDVLRQKLGERFDQKIIDKTLALLSGKSVDEA----------- 103 S+R+ + ++ ++ + + +++ ++ + GKS A Sbjct: 61 GQPMSVRSRYTFDELVVQPLVGEGVPAELAREVTRALMSEVLGKSAKAAKADARADEKEE 120 Query: 104 ---------EKISADAVTPWVVGEIAWFCEQVAKAEADNLDDKKLLKVLKEDIAA----- 149 + +T E+A+ E D K+ K + + + A Sbjct: 121 EEDKDAKTESTLQTGQITVLGRPEVAYLLELARTVCRKKPDPAKIAKAVSDHLGADGRKN 180 Query: 150 -IRVNLQQGVDIALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDL-- 206 + L G+D A+ GRM TS + L + D A+ +AHA T H ++ D+F+AVDDL Sbjct: 181 LRELRLGAGLDAAMFGRMVTSDI---LARGDAALHVAHAFTVHGEATETDYFSAVDDLPM 237 Query: 207 ----QEQGSAHLGTQEFSSGVFYRYANINLAQLQENLGG--------ASREQALEIATHV 254 QGS H+G E +SG+FY Y I++ L NL G A R+ A ++A + Sbjct: 238 ARTEDGQGSGHIGNAELTSGLFYGYVVIDVPLLVSNLEGVDRKAWEKADRKLAAQLAERM 297 Query: 255 VHMLATEVPGAKQRTYAAFNPADMVMVNFSD-MPLSMANAFEKAV---KAKDGFLQPSIQ 310 V ++AT PGAK + A A +V+ + P ++ANAF + V + + + + Sbjct: 298 VKLVATVSPGAKLGSTAPHAYAHLVLAESGNAQPRTLANAFLEPVVTGPRQPDPVAAAYR 357 Query: 311 AFNQYWDRVANGYGLNGAAAQFSLSDVDPITA--QVKQMPTLEQLKSWVRNNGEA 363 A ++ + YG ++ D + + +L ++ +WV + Sbjct: 358 ALARHSADLDRMYGPAFQRRLAAIGPADGLADVLRAPANASLAEVATWVADQVRG 412 >UniRef50_A5FTJ7 CRISPR-associated protein, Cse4 family n=11 Tax=Acetobacteraceae RepID=A5FTJ7_ACICJ Length = 370 Score = 344 bits (884), Expect = 2e-93, Method: Composition-based stats. Identities = 99/370 (26%), Positives = 156/370 (42%), Gaps = 26/370 (7%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNI-G 59 M+ F+ +H+L PS +NRDD K A+ GG R+R+SSQ+LKRA R S +++ + G Sbjct: 1 MTQFLQVHLLTFFPPSNMNRDDTGRPKTAMVGGAMRLRLSSQALKRAWRTSTIFSEALKG 60 Query: 60 ESSLRTIHLA-QLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGE 118 RT L ++ L+ + + + +A GK ++ + E Sbjct: 61 YMGERTQRLGEEILKTLQAEGVSEVQALAVARAVAGQFGKLNEDETPARIQQLAFISPDE 120 Query: 119 IAWFCEQVAKAEADNLDDKKLLKVLKEDIAA-------------IRVNLQQGVDIALSGR 165 + + A L + K + + DIAL GR Sbjct: 121 RKAAFDLARRYAAGELPLPEKAKGKRGKANKTEGEEEVEAPEILLLRESDTAADIALFGR 180 Query: 166 MATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDL----QEQGSAHLGTQEFSS 221 M + A +AHAITTH++ D D++TAVDDL ++ G+ +G F S Sbjct: 181 MLADKPAF---NREAAAQVAHAITTHRISVDDDYYTAVDDLKRPSEDAGAGFIGETGFGS 237 Query: 222 GVFYRYANINLAQLQENLGGAS--REQALEIATHVVHMLATEVPGAKQRTYAAFNPADMV 279 GVFY Y +IN+ L NLGG R+ A +V AT P KQ ++AA A + Sbjct: 238 GVFYTYMSINIDLLIRNLGGGDQARDLAATAIAALVEAAATTAPSGKQNSFAAHGRAGYI 297 Query: 280 MVNFSD-MPLSMANAFEKAVKAKDGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVD 338 + P ++A AF K V+ + SI ++ + + YG A + + Sbjct: 298 LAERGKAQPRTLAGAFAKPVEG-GDIMDASIGRLEEFREAIDKAYGPTADATKVMRVGGE 356 Query: 339 PITAQVKQMP 348 A + Sbjct: 357 GSLADIIVFA 366 >UniRef50_Q1EQS8 CRISPR-associated protein n=3 Tax=Streptomyces RepID=Q1EQS8_STRKN Length = 393 Score = 344 bits (884), Expect = 2e-93, Method: Composition-based stats. Identities = 113/384 (29%), Positives = 169/384 (44%), Gaps = 33/384 (8%) Query: 2 SNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGES 61 + FI++H++ S + LNRDD N K +G R R+SSQS KRA R+ + + IG++ Sbjct: 5 ARFIDVHIVQSVPFANLNRDDTNSVKTVQYGNTLRTRVSSQSWKRATRE--VFQERIGQA 62 Query: 62 SLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVT--------- 112 +LRT + + + G A + E K AD Sbjct: 63 ALRTRRIGERVTQELEGRGWPPALAQRAGGHAAAASSIKFELAKDPADNKQFLPNTVLTN 122 Query: 113 ------PWVVGEIAWFCEQVAKAEADNLDDKKLLKV--LKEDIAAIRVNLQQGVDIALSG 164 V E+A EQ + D KK L +D + + GV I L G Sbjct: 123 AMVYVPEAAVAELADLAEQHRQELESAKDIKKPADKSVLPKDAVEAVLRSRNGV-INLFG 181 Query: 165 RMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDL-----QEQGSAHLGTQEF 219 RM + VDGA+ +AHA+TTH+ D ++D+F+AVDD+ GS H+G EF Sbjct: 182 RMLAE---VDDAGVDGAVQVAHAMTTHETDVELDYFSAVDDITAAWKDSTGSGHMGHTEF 238 Query: 220 SSGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMV 279 S+G FYRYA ++L L N+GG R A E+ + +P AK+ + A D+V Sbjct: 239 SAGTFYRYATVDLRDLATNIGGEVRA-ARELIAAFLASYIESLPQAKKNSTAPHTIPDLV 297 Query: 280 MV-NFSDMPLSMANAFEKAVK--AKDGFLQPSIQAFNQYWDRVANGYGLNGAAAQ-FSLS 335 + SD PLS A AFEK V+ A GF + S Y G ++ Sbjct: 298 HISVRSDRPLSYAAAFEKPVRAGAPGGFGEVSRAELATYAQAANTLLGTGRIVTSGWASL 357 Query: 336 DVDPITAQVKQMPTLEQLKSWVRN 359 + +T + + + L + + Sbjct: 358 ETKDLTGLGTRHESFDDLITAALD 381 >UniRef50_Q2RXJ6 CRISPR-associated protein, Cse4 family n=2 Tax=Alphaproteobacteria RepID=Q2RXJ6_RHORT Length = 381 Score = 341 bits (875), Expect = 3e-92, Method: Composition-based stats. Identities = 103/382 (26%), Positives = 173/382 (45%), Gaps = 30/382 (7%) Query: 2 SNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGES 61 S F+ IH L S++ + LNRDD + K +GG R RISSQ LKR R + + + Sbjct: 4 SRFLQIHSLHSYTAALLNRDDSGLAKRLTYGGSNRTRISSQCLKRHWRMAEHDPHALQTL 63 Query: 62 S-----LRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVV 116 R+ L D++ + L R+ Q I+D + + ++ Sbjct: 64 GGYVGSFRSRELV--TDLVIKPLEGRYPQDILDVLEPEFQKLVYGDKADKGKKSRQTLLL 121 Query: 117 G--EIAWFCEQVAKAEADNLDDKKLLKVLKE-------DIAAIRVNLQQGVDIALSGRMA 167 G E+AW + + A D K L K + + + L G+ AL GRM Sbjct: 122 GQPELAWLARRAEELAAGANDAKALQKAVADWRKDANFKAMSENAALPGGLVAALFGRMV 181 Query: 168 TSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQ----EQGSAHLGTQEFSSGV 223 TS +D + +AHA T H +++ D+FTAVDDL+ + G+ + E +SG+ Sbjct: 182 TSD---PAANIDAPVHVAHAFTVHAEEAEGDYFTAVDDLKKDESDSGADTIQETELTSGL 238 Query: 224 FYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNF 283 FY Y I+L L N GG +E A ++ ++V+++A PGAK + A + AD++++ Sbjct: 239 FYGYVVIDLPGLIGNCGG-DKEIAAQVVNNLVYLIAEVSPGAKLGSTAPYGRADLMLIEA 297 Query: 284 SD-MPLSMANAFEKAVKAKDGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITA 342 D P S+A A+ KA+ + ++ A + ++ Y A SL++ Sbjct: 298 GDRQPRSLATAYRKAIAPD---REQAVAALDGCLAKLDATYETGEARRYLSLAETPLTGP 354 Query: 343 --QVKQMPTLEQLKSWVRNNGE 362 + +L+ L W + + Sbjct: 355 ATSGLEKLSLKALADWTASRVK 376 >UniRef50_Q0BRF9 Putative uncharacterized protein n=1 Tax=Granulibacter bethesdensis CGDNIH1 RepID=Q0BRF9_GRABC Length = 386 Score = 340 bits (872), Expect = 5e-92, Method: Composition-based stats. Identities = 105/388 (27%), Positives = 162/388 (41%), Gaps = 39/388 (10%) Query: 2 SNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMR----KSGYYAQN 57 F+ IH L S++ S LNRDD + K +G R RISSQ LKR R + Sbjct: 4 PRFLQIHSLHSYTASLLNRDDSGLAKRLPYGSAVRTRISSQCLKRHWRMDEGTFSLHRIE 63 Query: 58 IGESSLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVG 117 E ++R+ L LR+ L D I++ + + P + G Sbjct: 64 GAEEAVRSRDLV--TKRLREPLQGTVDVNILNAIEPAFQAAVYGKKGADDKSSRQPLLFG 121 Query: 118 --EIAWFCEQVAKAEADNLDDKKLLKVLKE-----------DIAAIRVNLQQGVDIALSG 164 E+ + EQ + D K ++ V+L G+ AL G Sbjct: 122 APELRYLAEQFTRIATSATDPKSAKAAAEDFTKDKLFQNTMKAMRDSVSLPGGLTSALFG 181 Query: 165 RMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDL---QEQGSAHLGTQEFSS 221 RM TS +D + +AHA TTH ++ D+F VDDL ++ G+ H+G+ E +S Sbjct: 182 RMVTSD---PEANIDAPVHVAHAFTTHAEQTESDYFAVVDDLAGVEDTGADHIGSTELTS 238 Query: 222 GVFYRYANINLAQLQENLGG--------ASREQALEIATHVVHMLATEVPGAKQRTYAAF 273 G+FY Y I++ L NL G A R+ A E+ ++ +AT PGAK + A + Sbjct: 239 GLFYGYVVIDVPTLVSNLTGVAASNWLAADRKMAAEVTACLIGQIATVSPGAKLGSTAPY 298 Query: 274 NPADMVMVNFSD-MPLSMANAFEKAVKAKDGFLQPSIQAFNQYWDRVANGYGLNGAAAQF 332 A ++V D P S+A AF + ++ + +Q Y Sbjct: 299 GYATTMLVEAGDRQPRSLAEAFRDPAEPT---VKDAEDKLHQKLKAFDEAYQTGEDRRLL 355 Query: 333 SLSDVDPITAQVKQMPTLEQLKSWVRNN 360 SLS+ I + +L +L WVR+ Sbjct: 356 SLSNDPGIKNVSRT--SLPELMQWVRDT 381 >UniRef50_B6B782 CRISPR-associated protein, Cse4 family n=2 Tax=Alphaproteobacteria RepID=B6B782_9RHOB Length = 353 Score = 340 bits (872), Expect = 6e-92, Method: Composition-based stats. Identities = 107/330 (32%), Positives = 158/330 (47%), Gaps = 19/330 (5%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNI-G 59 M+ F+ H+L ++ S NRDD K A+ GG R+RISSQSLKRA+R+S Y+AQ++ G Sbjct: 1 MTTFVQFHLLTTYPLSNPNRDDQGRPKQAMIGGSPRLRISSQSLKRALRESSYFAQDLAG 60 Query: 60 ESSLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGEI 119 + RT L L+ +L + + A G + EK S +A T + Sbjct: 61 HTGTRTRR---LATELKAELIGQGVEDAHADETATKIGAVFSKTEKGSTNATTLAFISPD 117 Query: 120 AWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGKVD 179 W +A+ A + L K+ AI VDIA+ GRM D Sbjct: 118 EW---ALARELAARDVAGEPLPAEKDLKKAILRRADGAVDIAMFGRMLADS---PDYNRD 171 Query: 180 GAMSIAHAITTHQVDSDIDWFTAVDDLQ----EQGSAHLGTQEFSSGVFYRYANINLAQL 235 A+ +AHA TTH+ + DWF+AVDDL+ + G+ H+G F SG++Y YA +N+ L Sbjct: 172 AAVQVAHAFTTHRAQAQDDWFSAVDDLKTREVDAGAGHIGEHGFGSGIYYLYACVNVDLL 231 Query: 236 QENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFS-DMPLSMANAF 294 ENL G R A + + LAT P KQ ++A A + V P ++ AF Sbjct: 232 VENLAG-DRALAAKGMEALARALATATPKGKQNSHAHHPRAGFIRVERGQQQPRDLSGAF 290 Query: 295 EKAVKAKDGFLQPSIQAFNQYWDRVANGYG 324 K A + + S++A ++ YG Sbjct: 291 HKPTAADE---RASVEALQGMAAKIDRAYG 317 >UniRef50_C7RP61 CRISPR-associated protein, Cse4 family n=1 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RP61_9PROT Length = 400 Score = 338 bits (867), Expect = 2e-91, Method: Composition-based stats. Identities = 97/395 (24%), Positives = 169/395 (42%), Gaps = 42/395 (10%) Query: 2 SNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMR-KSGYYAQNIGE 60 FI IH L ++ + LNRDD + K G R RISSQ LKR R +A + + Sbjct: 4 PRFIQIHTLHTYPAALLNRDDAGLAKRLPLGNAVRTRISSQCLKRHWRVVEDRFALSCLD 63 Query: 61 --SSLRTIHLAQLRDV------LRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVT 112 ++R+ +L + + + + + + D L GK + + Sbjct: 64 VPMAIRSRGTLELISKRIQESGVSETMAQAAAEAMRDAGLLDKGGKEKKGDDALKTGQAV 123 Query: 113 PWVVGEIAWFCEQVAKAEADNLDDKKLLKVL-------KEDIAAIRVNLQQGVDIALSGR 165 EI + + +D +++K +++ E + G++ AL GR Sbjct: 124 LLGKPEIDYLVRRCVDLASDGVEEKGFKELITLWLKGKDEKRNIEALKHGSGLESALFGR 183 Query: 166 MATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDL----QEQGSAHLGTQEFSS 221 M TS ++T + A+ +AHA T HQ + D+FT VDDL E GSA + E +S Sbjct: 184 MVTSDVLTS---REAAVYVAHAFTVHQAQVENDYFTVVDDLLQDAGELGSAGIFDTELAS 240 Query: 222 GVFYRYANINLAQLQENLGGAS-------------REQALEIATHVVHMLATEVPGAKQR 268 G++Y Y +++ QL +NL G R A ++ H++H++AT PGAK+ Sbjct: 241 GLYYGYVVVDVPQLVQNLEGEDFNECFASGTPADRRVLAGQVVQHLLHLIATVSPGAKRG 300 Query: 269 TYAAFNPADMVMVNFSD-MPLSMANAFEKAVK---AKDGFLQPSIQAFNQYWDRVANGYG 324 + A F+ A ++V D P S+A AF A+ + + ++ + + + YG Sbjct: 301 STAPFDWAKFMLVEAGDWQPRSLAGAFHDALPLSGSGGTIRERTVDRLTKEIAAMDDAYG 360 Query: 325 LNGAAAQFSLSDVDPITAQVKQMPTLEQLKSWVRN 359 + ++ + + L L W + Sbjct: 361 APLSRRFLAIDQ--EVQVPGAERLNLASLADWAKE 393 >UniRef50_Q67RP1 Putative uncharacterized protein n=1 Tax=Symbiobacterium thermophilum RepID=Q67RP1_SYMTH Length = 379 Score = 335 bits (859), Expect = 2e-90, Method: Composition-based stats. Identities = 104/383 (27%), Positives = 177/383 (46%), Gaps = 33/383 (8%) Query: 4 FINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESSL 63 F+ +H+L + + S LNRDD K +FGG RR RISSQ LKRA+R + Sbjct: 2 FVEMHLLQNFALSNLNRDDTGAPKSCVFGGTRRARISSQCLKRAVRTYVREQALVP---- 57 Query: 64 RTIHLAQLRDVLRQKLGERFDQKIIDKTLA-LLSGKSVDEAEKISADAVTPWVV----GE 118 + L+ L+++L R ++ A ++ ++++ E + T +++ E Sbjct: 58 -SELLSYRTKWLQRELANRLAAGGVEAEQAGQVAARALELLEFRLKNGRTEYLLMVGERE 116 Query: 119 IAWFCEQVAK--AEADNLDDKKLLKVLKEDIAAI---RVNLQQGVDIALSGRMATSGMMT 173 IA + + A D + K +++A + ++ VDIAL GRM + Sbjct: 117 IARIADLCREHAAALQGGDGGRKSKKEGDNLAGLFLKALDGGDAVDIALFGRMIATH--- 173 Query: 174 ELGKVDGAMSIAHAITTHQVDSDIDWFTAV------DDLQEQGSAHLGTQEFSSGVFYRY 227 VD A+ +AHA +T+ + ++ D+++AV DD + G+ LGT ++S +YRY Sbjct: 174 PEKNVDAAVQMAHAFSTNAIANEFDFYSAVDDLQQQDDDEGAGAGMLGTVLYNSSCYYRY 233 Query: 228 ANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFSDMP 287 AN++L QL NLGG ++AL + + VP K+ A NP ++M + Sbjct: 234 ANVDLRQLLTNLGG-DPDRALTAVRAFLLGMVHAVPTGKRTNSAPQNPPALIMAVVREHG 292 Query: 288 -LSMANAFEKAVK-AKDGFLQPSIQAFNQYWDRVANGYGLNGA--AAQFSLSDVDPITA- 342 S+ANAF V A+ ++ S + +W++++ YG G A + D I A Sbjct: 293 LWSLANAFVVPVSGARGNLMELSAKEMLAHWNQLSELYGQEGVHYAGLATYLSSDAIGAS 352 Query: 343 ---QVKQMPTLEQLKSWVRNNGE 362 + L L V + Sbjct: 353 NAVGIAVEKRLADLVDRVLAEVQ 375 >UniRef50_D0MET5 CRISPR-associated protein, Cse4 family n=1 Tax=Rhodothermus marinus DSM 4252 RepID=D0MET5_RHOM4 Length = 423 Score = 333 bits (854), Expect = 8e-90, Method: Composition-based stats. Identities = 116/422 (27%), Positives = 188/422 (44%), Gaps = 63/422 (14%) Query: 2 SNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIG-- 59 S F+ IH L ++ + LNRDD K FGG R R+SSQ LK R G Sbjct: 3 SAFVQIHTLTAYPAALLNRDDAGFAKRLPFGGAIRTRVSSQCLKYHWRNFSGEHALYGLD 62 Query: 60 -ESSLRTIHLAQL---RDVLRQKLGERF--------------DQKIIDKTLALLSGKSVD 101 SLR+ + R ++ + R D+ + L VD Sbjct: 63 VPRSLRSRETFKRCIARPLVEEGYPLRLVVAFALHLQKLIVSDESLSKTDFKKLMSDEVD 122 Query: 102 EA---EKISADAVTPWVVGEIAWFCEQVAKA----------EADNLDDKKLLKVLKEDIA 148 +A +++ ++ V E+ + ++ + A L D++L +V +E A Sbjct: 123 DATLLDQLKSNQVIILGRPEVDYLTRRIRERLDALREVWADAAAPLSDEQLERVYQELQA 182 Query: 149 AIRVNLQQ---------GVDIALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDW 199 + L++ G+D AL GRMATS + L + D A+ +AHA TTH +S+ D+ Sbjct: 183 IGKGELKKNLKGLYLAAGLDAALFGRMATSDV---LARGDAAIHVAHAFTTHAEESESDY 239 Query: 200 FTAVDDL------QEQGSAHLGTQEFSSGVFYRYANINLAQLQENLGG--------ASRE 245 FTAVD+L E GS HL QE +SG+FY Y +++ L NL G A R Sbjct: 240 FTAVDELVAQEGEGELGSGHLNNQELTSGLFYGYVVVDVPLLVSNLEGVPPAAWQEADRT 299 Query: 246 QALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNF-SDMPLSMANAFEKAVKAKD-G 303 A E+ ++H++AT PGAK + A A ++V + P ++ANAF + V G Sbjct: 300 LAAEVVRRLLHLIATVSPGAKLGSTAPHAYAQFMLVEWGRSQPRTLANAFHRPVSLDGEG 359 Query: 304 FLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITAQVK--QMPTLEQLKSWVRNNG 361 L S +A +Y +++ YG ++ + + Q++ + + ++ WV Sbjct: 360 VLVNSYRALGRYVEQMDRMYGKLTERRLAAIDLPEAVQRQLQVDTLNAVPEIADWVAEKI 419 Query: 362 EA 363 + Sbjct: 420 QG 421 >UniRef50_Q0AA32 CRISPR-associated protein, Cse4 family n=1 Tax=Alkalilimnicola ehrlichii MLHE-1 RepID=Q0AA32_ALHEH Length = 385 Score = 332 bits (852), Expect = 1e-89, Method: Composition-based stats. Identities = 99/385 (25%), Positives = 174/385 (45%), Gaps = 36/385 (9%) Query: 4 FINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESSL 63 F+ IH L S+ + LNRDD + K FG R+R+SSQ LKR R++ ++ S + Sbjct: 2 FLQIHTLTSYHAALLNRDDAGLAKRIPFGSAERMRVSSQCLKRHWRQALKDVISLP-SGI 60 Query: 64 RTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAV-----TPWVVGE 118 RT H + +V R+ + E + + + L + E D++ + E Sbjct: 61 RTRHFFER-EVCRRVIAEGVEDEKARELTGKLIDAVMHSKEAREKDSLFLKQPVLFGRPE 119 Query: 119 IAWFCEQVAKAEADNLDD----KKLLKVLKEDIAAIR-----VNLQQGVDIALSGRMATS 169 +F + + D K +K K++ A+ +L+ G++ AL GR TS Sbjct: 120 ADYFVSLITECARSGEDPGSTLKDRVKAEKKNFRALLQAAGGSDLESGIEGALFGRFVTS 179 Query: 170 GMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQE----QGSAHLGTQEFSSGVFY 225 + L + D ++ +AHA T H +++++D+FT VDDL+E G+AH G E +G+FY Sbjct: 180 DI---LARTDASVHVAHAFTVHSLNNEVDYFTVVDDLKEPGEDAGAAHAGDMELGAGLFY 236 Query: 226 RYANINLAQLQENLGGASR----------EQALEIATHVVHMLATEVPGAKQRTYAAFNP 275 Y +++ L NL G R A ++ +VH +AT PGAK A + Sbjct: 237 GYVVVDVPLLVSNLSGCERQAWREQTEACADARDVLAALVHSIATVSPGAKLGATAPYAR 296 Query: 276 ADMVMVNFS-DMPLSMANAFEKAVKAKDGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSL 334 D ++ P ++ANA+ + + A+ +Q S+ Y + + +G + + Sbjct: 297 TDCALLETGTTQPRALANAYLEPLPARGDLMQQSVNTMGHYLKSLDDMFGEETSRFVSAT 356 Query: 335 SDVD--PITAQVKQMPTLEQLKSWV 357 D P + T++ + Sbjct: 357 RDTTSLPCAHRGPLSETIDGALDSI 381 >UniRef50_C9M9R6 CRISPR-associated protein, Cse4 family n=1 Tax=Jonquetella anthropi E3_33 E1 RepID=C9M9R6_9BACT Length = 400 Score = 329 bits (845), Expect = 7e-89, Method: Composition-based stats. Identities = 100/401 (24%), Positives = 179/401 (44%), Gaps = 45/401 (11%) Query: 2 SNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGY-----YAQ 56 F+ I L ++S S LNRDD + K FG R RISSQ LKR R +G A Sbjct: 5 PRFVQISTLTTYSASLLNRDDSGLAKRIPFGDSVRTRISSQCLKRHWRNAGGPYGLDKAG 64 Query: 57 NIGESSLRTI-HLAQLRD--VLRQKLGERFDQKIIDKTLALLSGKSV------------- 100 + S+R+ +L + ++ + L ++ K LL Sbjct: 65 DALSLSVRSRFSFPELIEKPLVAEGLEQKLVVSGSQKLQQLLYNGEEKGDTKKDKKKKIE 124 Query: 101 --DEAEKISADAVTPWVVGEIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQ-- 156 ++ + + E+ + + + A + + + K++ +K+ + NL Sbjct: 125 LDEDGYSAKRNELVVLGRPELEYLKQIIRDAISSSSNIKEIDNAVKDFYTKRKSNLLALR 184 Query: 157 ---GVDIALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQEQGSAH 213 GVD A+ GR + + KV A+ +AH+ T H S+ D+FTAVDDL EQG+ H Sbjct: 185 AGCGVDAAMFGRFVSGDV---DAKVTAAVHVAHSFTIHGEQSETDYFTAVDDLVEQGTGH 241 Query: 214 LGTQEFSSGVFYRYANINLAQLQENLGG--------ASREQALEIATHVVHMLATEVPGA 265 + E ++G++Y Y +++ QL NL G A R A ++ ++++H++AT PGA Sbjct: 242 INAAELNTGIYYGYVVVDVPQLISNLCGCDSKNSADADRTLAAQVTSNLIHLMATVTPGA 301 Query: 266 KQRTYAAFNPADMVMVNFSD-MPLSMANAFEKAVK--AKDGFLQPSIQAFNQYWDRVANG 322 K A + + +V+ +SD P ++A+AF + +K + ++Q +Y + Sbjct: 302 KLSGTAPYAASWLVLAEWSDSQPRTLADAFFEGLKLGSDGSARSLAVQMLAEYIRKYDAM 361 Query: 323 YGLNGAAAQFSLSDVDPITAQVKQMPTLEQLKSWVRNNGEA 363 Y + ++P + +L++L V+ E Sbjct: 362 YTPQLTRR---CASIEPCQIPGAENGSLDELCEAVKLAIEG 399 >UniRef50_D1Y487 CRISPR-associated protein, Cse4 family n=1 Tax=Pyramidobacter piscolens W5455 RepID=D1Y487_9BACT Length = 408 Score = 329 bits (844), Expect = 1e-88, Method: Composition-based stats. Identities = 105/404 (25%), Positives = 176/404 (43%), Gaps = 55/404 (13%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSG--YYAQNI 58 + FI I L ++ S LNRDD + K FGG R R+SSQ LKR R + + QN+ Sbjct: 4 LPRFIQISTLTTYPASLLNRDDSGLSKRIPFGGVSRTRVSSQCLKRHWRMADGLWSLQNV 63 Query: 59 GE---SSLRTIHLA--QLRDVLRQKLGERFDQ--KIIDKTLALLSGKSVDEAEK------ 105 + +S+R+ + ++ L +K G ++ + L G EA Sbjct: 64 DKDIATSIRSRRIFPEKIEKPLIEKEGLDAEKVVAASQALQSELYGAKGTEAAAKNKKTA 123 Query: 106 -ISADAVTP---------------WVVGEIAWFCEQVAKAEAD-------NLDDKKLLKV 142 ADA+ P EI + + V + + + K Sbjct: 124 KDDADALNPSIDAQLSAERSELVVLGHPEIQFLSKIVREMASADGSAADVGKKTGEWFKK 183 Query: 143 LKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTA 202 K+D A++ G+D A+ GR + +V A+ +AHA T H +S+ D+FTA Sbjct: 184 HKKDFQALKCGA--GLDAAMFGRFISGDT---DARVSAAVHVAHAFTVHAEESETDYFTA 238 Query: 203 VDDLQEQGSAHLGTQEFSSGVFYRYANINLAQLQENLGG--------ASREQALEIATHV 254 VDDL GSAH+ E +SG+FY Y +++ QL N+ G A R+ A + H+ Sbjct: 239 VDDLNNSGSAHINAAELTSGIFYNYVVVDVPQLVSNIEGCPSKQWQTAQRDVAGRLVKHL 298 Query: 255 VHMLATEVPGAKQRTYAAFNPADMVMVNFSD-MPLSMANAFEKAVKAKDGFLQPSIQAFN 313 +H++AT PGAK + A + VM + P ++A+AF V + +++ Sbjct: 299 LHLIATVTPGAKLGSTAPYARPWFVMAEAGESQPHTLADAFYLPVPLRGDMRAQALRQLE 358 Query: 314 QYWDRVANGYGLNGAAAQFSLSDVDPITAQVKQMPTLEQLKSWV 357 Y + YG + S+ D ++ + +L+++ + Sbjct: 359 DYVGKSDEMYGSDERRWIASMYD---VSIPRGENSSLDRMGESL 399 >UniRef50_B8FDH9 CRISPR-associated protein, Cse4 family n=2 Tax=Bacteria RepID=B8FDH9_DESAA Length = 383 Score = 327 bits (840), Expect = 3e-88, Method: Composition-based stats. Identities = 111/383 (28%), Positives = 176/383 (45%), Gaps = 46/383 (12%) Query: 5 INIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIG-ESSL 63 + H+L S +CLNRDD+ K A+ GG R R+SSQ KR +R + +++G Sbjct: 11 VEFHILQSFPVTCLNRDDVGAPKTAVVGGATRARVSSQCWKRNIRLT---MKDLGVPIGS 67 Query: 64 RTIHLAQLRDVLRQKLGERFDQK-----------IIDKTLALLSGKSVDEAEKISADAVT 112 RT + Q+ + +LG DQ I +K G + +DA+ Sbjct: 68 RTKLIHQMIEDACAELGADTDQAQACAAQVASVFIKEKKGKKDDGDDSEGNGSDKSDALI 127 Query: 113 PWVVGEIAWFCEQVAKA------EADNLDDKKLLKVLKEDIAAIRVNL-------QQGVD 159 E+ + + + + ++ K KV K + NL + GVD Sbjct: 128 FLSREEVKKIALALRENNFSTEFQEEKVNKKGDAKVEKIKLEKKIQNLLGKPDFSRDGVD 187 Query: 160 IALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQ-EQGSAHLGTQE 218 IAL GRM V+GA S +HAI+TH+V +++++FTA+DDLQ E GSAH+G E Sbjct: 188 IALFGRMVAQAAA---LNVEGAASFSHAISTHKVTNEVEFFTALDDLQTEPGSAHMGALE 244 Query: 219 FSSGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADM 278 F+S +YRY +++ QL +NL G QALE V L +P A+Q T + + Sbjct: 245 FNSATYYRYVCLDMGQLWKNLAGQHLPQALEG---FVKALYLALPSARQATQSGACWWEF 301 Query: 279 VMVNFSDMPLSMANAFEKAVKAK-DGFLQPSIQAFNQYWDRVANGYG-LNGAAAQFSLSD 336 V + F+ AVK + G L+PS A Y ++ G L A+F+ + Sbjct: 302 AKVFVR-KGQRLQAPFDTAVKPRNGGLLEPSKDALCAYLEKKEQQAGSLFRKIAEFTFGE 360 Query: 337 VDPITAQVKQMPTLEQLKSWVRN 359 + P+++ L +++ Sbjct: 361 DNG--------PSIDDLVLSIQD 375 >UniRef50_D2L2X7 CRISPR-associated protein, Cse4 family n=1 Tax=Desulfovibrio sp. FW1012B RepID=D2L2X7_9DELT Length = 385 Score = 326 bits (835), Expect = 1e-87, Method: Composition-based stats. Identities = 103/387 (26%), Positives = 167/387 (43%), Gaps = 43/387 (11%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIG- 59 MS FI +H+L S+ + LNRDD+ K FG R+R+SSQSLKRA R S + +G Sbjct: 1 MSRFIQLHILTSYPAANLNRDDLGAPKSMRFGEANRLRVSSQSLKRAWRTSDVFKATLGA 60 Query: 60 -ESSLRTIHLAQLR-----------------------DVLRQKLGERFDQKIID-----K 90 +RT L + L++K + I K Sbjct: 61 DHLGVRTKELGRKVFCALTQGASLDAVWDAPDATGTLAALKEKTAAEIARTIAGVFGKIK 120 Query: 91 TLALLSGKSVDEAEKISADAVTPWVVGEIAWFCEQVAKAEADNLDD-KKLLKVLKEDIAA 149 A + + K + + + ++A ++ +A A + + K + Sbjct: 121 KEADAKAEKDADPVKKRKELLDSLEIEQLAHVSQEERRAVAALTEACRDAGKAPDANALN 180 Query: 150 IRVNLQQGVDIALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQ-- 207 + + + DIA+ GRM + V+ A+ +AHA+T H+ ++ D+FTAVDDL Sbjct: 181 LLRSDAKAADIAMFGRMLAASARF---NVEAAVQVAHAVTVHRAVAEDDFFTAVDDLNRD 237 Query: 208 EQGSAHLGTQEFSSGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQ 267 + G+ H+G EF +GV+Y Y I+ A L ENLGG + T + T P KQ Sbjct: 238 DAGAGHMGVSEFGAGVYYLYLCIDRALLAENLGG-DEALVQKALTALTTAACTVAPTGKQ 296 Query: 268 RTYAAFNPADMVMVNFS-DMPLSMANAFEKAV----KAKDGFLQP-SIQAFNQYWDRVAN 321 +YA+ A + D P +++ AF K V + +DG L +I + ++ Sbjct: 297 ASYASRAYACFALAEKGDDTPRNLSLAFLKPVGEREEERDGHLGKTAIAELLKTKAKMDK 356 Query: 322 GYGLNGAAAQFSLSDVDPITAQVKQMP 348 YG A F++ D A++ Sbjct: 357 VYGQTLADTSFNVFDGKGTLAELAAFV 383 >UniRef50_Q2RY18 CRISPR-associated protein, Cse4 family n=2 Tax=Alphaproteobacteria RepID=Q2RY18_RHORT Length = 359 Score = 324 bits (832), Expect = 2e-87, Method: Composition-based stats. Identities = 105/373 (28%), Positives = 161/373 (43%), Gaps = 32/373 (8%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNI-G 59 MS F+ +HVL +++ S LNRDD K FGG R+R+SSQSLKRA R+S + + G Sbjct: 1 MSRFLQLHVLTAYAASNLNRDDTGRPKTLNFGGAERLRVSSQSLKRAFRQSELFQSRLPG 60 Query: 60 ESSLRTIHLAQ-LRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGE 118 E R+ A+ L L + E + + L + K + + E Sbjct: 61 ELGTRSQDFAKALVSALVARGVEEAEAITRAEALIDHDKLGKVKKGKAQTEQLVHLGPDE 120 Query: 119 IAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGKV 178 +A + L + + + + VDIA+ GRM V Sbjct: 121 LAAIDALAERLATSAT--------LDDKAMLVLKSKPRAVDIAMFGRMLAGNPGF---NV 169 Query: 179 DGAMSIAHAITTHQVDSDIDWFTAVDDLQEQ------GSAHLGTQEFSSGVFYRYANINL 232 + A+ +AHA TTH+ + D++T VDD++ G+ LG E+ SG+FY Y IN Sbjct: 170 EAAVQVAHAFTTHRATPEDDYYTTVDDIKNADQEEDRGAGFLGILEYGSGLFYLYICINA 229 Query: 233 AQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFS-DMPLSMA 291 L +NL G + A E A ++ T P KQ T+A+ ++ + P S+A Sbjct: 230 DLLVDNLAG-DQALAAEAAALLIEAACTISPTGKQNTFASRARGLYALLEIGEETPRSLA 288 Query: 292 NAFEKAVKAK---DGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITAQVKQMP 348 AF+ AV ++ L SIQ + + YG N +L DP T Sbjct: 289 AAFQYAVGSRATEADHLAASIQRLTALREGFSKAYGENL--RSVALDVTDPATPG----- 341 Query: 349 TLEQLKSWVRNNG 361 L+ L + R+ Sbjct: 342 -LKALIAAARDAV 353 >UniRef50_A5GBK1 CRISPR-associated protein, Cse4 family n=1 Tax=Geobacter uraniireducens Rf4 RepID=A5GBK1_GEOUR Length = 408 Score = 324 bits (830), Expect = 4e-87, Method: Composition-based stats. Identities = 105/402 (26%), Positives = 170/402 (42%), Gaps = 53/402 (13%) Query: 3 NFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGE-- 60 + +H++ S +CLNRDD+N K A+FGG +R R+SSQS KRA+R+ + Sbjct: 2 KHLELHIIQSVPVACLNRDDLNSPKTAVFGGVQRARVSSQSWKRAIREMAKEIAAEEKSD 61 Query: 61 --SSLRTIHLAQ-LRDVLRQKLGERFDQKIIDKTLALL---SGKSVDEAEKISADAVTPW 114 S RT + L L +K I + +A + VD V + Sbjct: 62 LFSGDRTRRMVYTLSTRLAEKGITSQAAIAIAEQVADVVETLDSKVDSEGYKKIKTVMFF 121 Query: 115 VVGEIAWFCEQVA------------KAEADNLDDKKLLKVLKEDIAAIRVN--------- 153 E E +A + A +D++ K LK + + Sbjct: 122 SKAEYDAIAEAIATSDEVKNSVEALEKAAVEGNDREREKALKAMVKILEKGAISKTIKSA 181 Query: 154 -LQQGVDIALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQ--EQG 210 L+ DIAL GRM + KVDGA AH ++TH+ D++ID+F AVDDL E G Sbjct: 182 QLKDAADIALFGRMVAND---PSLKVDGASMFAHILSTHKADNEIDFFAAVDDLNKDESG 238 Query: 211 SAHLGTQEFSSGVFYRYANINLAQLQ--ENLGG---------ASREQALEIATHVVHMLA 259 + T EF+S +YR+A +NL L ++LG S E ++ + + Sbjct: 239 AGMTSTLEFNSATYYRFAALNLDALANDDHLGDITLKDGTVVRSVETRKQVVKTFLKAII 298 Query: 260 TEVPGAKQRTYAAFNPADMVMVNFSD--MPLSMANAFEKAV-KAKDGFLQPSIQAFNQYW 316 +P A++ T V+ + P+ + NAFE V +++ GF+ SI N + Sbjct: 299 QSIPSARKTTMNGNTLPVYVLGVVREKGHPIQLINAFETPVRRSEKGFVTESINRMNIEY 358 Query: 317 DRVANGYGLNGAAAQF----SLSDVDPITAQVKQMPTLEQLK 354 + +G++ A+ SL + + + + L Sbjct: 359 ADLKETWGVDSLFAKAVVKGSLKEQIKANQGSIETCSQDDLI 400 >UniRef50_C6HV95 CRISPR-associated protein, Cas4 n=1 Tax=Leptospirillum ferrodiazotrophum RepID=C6HV95_9BACT Length = 393 Score = 323 bits (828), Expect = 7e-87, Method: Composition-based stats. Identities = 110/389 (28%), Positives = 176/389 (45%), Gaps = 52/389 (13%) Query: 5 INIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIG-ESSL 63 H+L S +CLNRDD+ K A+ GG +R R+SSQS KRA+R + + ++G + Sbjct: 13 FEFHILQSFPVTCLNRDDVGSPKTAMIGGSQRARVSSQSWKRAVRLAMH---DLGVTHGV 69 Query: 64 RTIHLAQLRDVLRQKLGERFDQKII--DKTLALLSG---------------------KSV 100 RT ++ L + LG +Q DK A+ + Sbjct: 70 RTKLISPLIAEACRSLGATPEQARACGDKVEAVFIKKDEKGKKKSAKTKGDSDTQDEEVG 129 Query: 101 DEAEKISADAVTPWVVGEIAWFCEQVAKAEAD------NLDDKKLLKVLKEDIAAIRVNL 154 ++ D + EI+ + K E D D KK K + + I + Sbjct: 130 SDSSSEKTDTLLFLSPKEISVLANEFKKQEFDPGKVIVQSDPKKQAKEIADMIGKVP-EG 188 Query: 155 QQGVDIALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQ-EQGSAH 213 VDIAL GRM V+ A S AHAI+TH+V +++++FTA+DD + G+AH Sbjct: 189 IDAVDIALFGRMVAQAA---ELNVEAAASFAHAISTHKVANEVEFFTALDDCAVDPGAAH 245 Query: 214 LGTQEFSSGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAF 273 +G+ EF+S +YRY +++L QL + L G + V L VP A+Q T + Sbjct: 246 MGSLEFNSATYYRYVSLDLGQLSQTLAGQHIPET---IEAFVKALFVSVPAARQSTQSGA 302 Query: 274 NPADMVMVNFSDMPLSMANAFEKAVKA-KDGFLQPSIQAFNQYWDRVANGYG-LNGAAAQ 331 +P D + + FE A+K+ GFL+PSI+ Y +R +G L G A+ Sbjct: 303 SPWDFAKILVR-TGHRIQIPFETAIKSKDGGFLKPSIEEMKAYLNRQEKLHGSLFGKKAE 361 Query: 332 FSLSDVDPITAQVKQMPTLEQLKSWVRNN 360 ++ + + T++ L S ++ Sbjct: 362 YTYGED--------ENFTIDDLISALKQQ 382 >UniRef50_Q1J368 CRISPR-associated protein, CT1975 n=1 Tax=Deinococcus geothermalis DSM 11300 RepID=Q1J368_DEIGD Length = 385 Score = 322 bits (827), Expect = 1e-86, Method: Composition-based stats. Identities = 116/390 (29%), Positives = 169/390 (43%), Gaps = 35/390 (8%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSG--YYAQNI 58 M + +H L + +PS LNRDD KDA FGG RR+RISSQ+ KRAMR+ Sbjct: 1 MKALLELHYLQNFAPSNLNRDDTGSPKDAFFGGTRRLRISSQAFKRAMRQDFGGRELLRP 60 Query: 59 GESSLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGE 118 E +RT + L G +Q LAL + K + E Sbjct: 61 EEIGVRTKRAHEAIAELLAGEGRTEEQCRAAAELALGGLGLPVKDGKN--QYLLFLGRDE 118 Query: 119 IAWFCEQV----AKAEADNLDDKKLLKVLK------------EDIAAIRVNLQQGVDIAL 162 + + + A+ +A + + K A ++ + VD+AL Sbjct: 119 LRRVADIIGANWAEFQAAAPEPESTDGKKKKASKKAALSGDLGKQLAGALDGSKAVDVAL 178 Query: 163 SGRMATSGMMTELGKVDGAMSIAHAITTHQV-DSDIDWFTAVDDLQ---EQGSAHLGTQE 218 GRM D A +AHAI+TH + + D++TAVDDL+ G+ LGT E Sbjct: 179 FGRMLAD---LPDKNADAAAQVAHAISTHALRERQYDFYTAVDDLKPDDNAGADMLGTVE 235 Query: 219 FSSGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADM 278 F+S YRYA I+L +L ENL G RE ++ P KQ T+AA N + Sbjct: 236 FASATVYRYACIDLGKLLENLQG-DRELLERGLRAFLYASVYAAPTGKQNTFAAHNLPGL 294 Query: 279 V--MVNFSDMPLSMANAFEKAVKAKD--GFLQPSIQAFNQYWDRVANGYGLNGAAAQFSL 334 + +V + P ++ANAFEK V+A+ G+L PS+ A +G G A + Sbjct: 295 MVQVVRRNASPRNLANAFEKGVRAEGGQGYLAPSVAALADEMRWQNGVFGDAGTARFVAR 354 Query: 335 SDVDPITAQVKQMPTLEQLKSW-VRNNGEA 363 D + + MP + L V + A Sbjct: 355 EGGDAVF--GEAMPNVAALIDATVADALSA 382 >UniRef50_B4S8P9 CRISPR-associated protein, Cse4 family n=9 Tax=Bacteria RepID=B4S8P9_PROA2 Length = 347 Score = 322 bits (825), Expect = 1e-86, Method: Composition-based stats. Identities = 106/361 (29%), Positives = 170/361 (47%), Gaps = 33/361 (9%) Query: 5 INIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIG-ESSL 63 I H+L S +CLNRDD+ K AI GG R R+SSQ KR +R S Q+ G + + Sbjct: 12 IEYHILQSFPVTCLNRDDVGAPKTAIVGGSTRARVSSQCWKRQVRLS---MQDFGIKLGI 68 Query: 64 RTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGEIAWFC 123 R+ +++ + QK + A GK + + S D + + E F Sbjct: 69 RSKKVSEFV-------AKACLQKGASEEQAAECGKVISD--SFSKDTLFFFSESEAQAFA 119 Query: 124 EQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGKVDGAMS 183 + + D+ + K +++ G+DIAL GRM ++ A S Sbjct: 120 DYAREKNFDSKNLND--KEIRKVAKKALNPAIDGLDIALFGRMVAQAT---DLNIEAAAS 174 Query: 184 IAHAITTHQVDSDIDWFTAVDDL-QEQGSAHLGTQEFSSGVFYRYANINLAQLQENLGGA 242 +HAI+TH+V +++++FTA+DDL +E GSAH+G+ EF+S +YRY +++L QL E++GG Sbjct: 175 FSHAISTHKVSNEVEFFTALDDLAEEPGSAHMGSLEFNSATYYRYISLDLGQLWESIGG- 233 Query: 243 SREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFSDMPLSMANAFEKAVKA-K 301 E E + L VP A+Q T + +P + + + FE AVKA Sbjct: 234 --EHLAEAVESLTKALFVAVPSARQTTQSGASPWEFAKIFIR-KGQRLQVPFETAVKAKD 290 Query: 302 DGFLQPSIQAFNQYWDRVANGYG-LNGAAAQFSLSDVDPITAQVKQMPTLEQLKSWVRNN 360 G+LQPSI A Y + G L G +F+ + +++ L ++ Sbjct: 291 GGYLQPSITALTDYLTKKEALAGSLFGKEKEFTFGED--------VNFSIDDLIKGLKLT 342 Query: 361 G 361 Sbjct: 343 V 343 >UniRef50_C7MQD5 CRISPR-associated protein, Cse4 family n=1 Tax=Saccharomonospora viridis DSM 43017 RepID=C7MQD5_SACVD Length = 368 Score = 312 bits (799), Expect = 2e-83, Method: Composition-based stats. Identities = 97/374 (25%), Positives = 161/374 (43%), Gaps = 24/374 (6%) Query: 4 FINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESSL 63 F++IH L + S +NRD++ K +GG R+R+SSQ+ KRA+RK+ Q++ + + Sbjct: 2 FVDIHALHTLPYSNVNRDNLGAPKSCWYGGTERIRVSSQAWKRAIRKAV--EQDLEQPTE 59 Query: 64 RTIHLAQLRD-VLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADA-VTPWVVGEIAW 121 RT +A L +L ++ D + + + G + + TP +A Sbjct: 60 RTRRIASLVAGILTERGWGAEDARRAGRAVIYAYGLEPAADDDDTDTLLWTPPAAEALAG 119 Query: 122 FCEQVAKAE------------ADNLDDKKLLKVLKEDIAAIRVNLQQGVD-IALSGRMAT 168 E+ A N K + +K ++ L + IAL GRM Sbjct: 120 VVEKHRDTVVTLPLPKGEGKKAKNPPAKDITDAVKPMAGEVKSILNRTTPTIALLGRMLA 179 Query: 169 SGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDD-LQEQGSAHLGTQEFSSGVFYRY 227 + G IAHA T H+ + D+FTAVDD G+ H+ T +F++G FYRY Sbjct: 180 D---RPDHTIYGLAEIAHAFTVHEAAPEFDYFTAVDDRAANTGAGHVNTAQFTTGTFYRY 236 Query: 228 ANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFSDMP 287 ++IN+ +L + +G A + T P KQ AA AD+ + + P Sbjct: 237 SSINITRLVDVVGEQD---ARAVLLAWARRFITVTPAGKQTATAARTAADLAHIVVRNAP 293 Query: 288 LSMANAFEKAVKAKDGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITAQVKQM 347 S A AFE + + G+L P+ +A Y R+A G ++ + + + Sbjct: 294 QSYAPAFETPIVSTGGYLDPAARALGDYATRLAAYLGDTPVEHGYATTLPTNVDGLGGRF 353 Query: 348 PTLEQLKSWVRNNG 361 TL+ L + Sbjct: 354 DTLDTLINATVGAV 367 >UniRef50_Q60AD1 CRISPR-associated protein, CT1975 family n=1 Tax=Methylococcus capsulatus RepID=Q60AD1_METCA Length = 414 Score = 304 bits (778), Expect = 5e-81, Method: Composition-based stats. Identities = 106/411 (25%), Positives = 176/411 (42%), Gaps = 61/411 (14%) Query: 4 FINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESSL 63 F+ IH L S+ + LNRDD + K FG R+R+SSQ LKR R+S + + L Sbjct: 2 FLQIHSLTSYHATLLNRDDAGLAKRIPFGDAVRLRVSSQCLKRHWRESLKQTIPLP-TGL 60 Query: 64 RTIHLAQLR--DVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISA------------- 108 RT H+ + L+Q+ E K + +L L + D+ K Sbjct: 61 RTRHVFEREIYPRLKQEGVEDSLAKQLTLSLMGLLLQKSDKTAKPEKAKKGKNGHEEQAE 120 Query: 109 -------------------DAVTPWVVGEIAWFCEQV-AKAEADNLDDKKLLKVLKED-- 146 + E+ + + A AE + +K L LK D Sbjct: 121 FDFEEGAGTEESSAGDLRVKQPILFGRPEVDYLISLLKACAEEGSGAEKALQAKLKGDKA 180 Query: 147 ------IAAIRVNLQQGVDIALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWF 200 AA +L G++ AL GR TS + L + D A+ +AH+ T H +D+++D+F Sbjct: 181 NFKAMLKAAGHGDLYAGLEGALFGRFVTSDV---LSRSDAAVHVAHSFTVHGLDTEVDYF 237 Query: 201 TAVDDL---QEQGSAHLGTQEFSSGVFYRYANINLAQLQENLGGAS--------REQALE 249 T VDDL +E G+AH G E +G+FY Y +++ L NL G + Sbjct: 238 TVVDDLNREEETGAAHAGDMELGAGLFYGYVAVDIPLLVSNLTGCDTTRWAEQEPADVRK 297 Query: 250 IATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFSD-MPLSMANAFEKAVKAKDGFLQPS 308 + T ++ +AT PGAK A + ++ V++ P +++NA+ +A+ + LQ + Sbjct: 298 VLTGLIRAIATVSPGAKLGATAPYAFSEFVLLETGKQQPRALSNAYLQALPMRGDPLQAA 357 Query: 309 IQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITAQVKQMPTLEQLKSWVRN 359 I A +Y + YG + S++ A + +L+ + Sbjct: 358 IDALAKYLRALDAMYGRTSDSR--SVASTRAFDADLAPTNSLDASIGAALD 406 >UniRef50_D1A6Q4 CRISPR-associated protein, Cse4 family n=2 Tax=Actinomycetales RepID=D1A6Q4_THECD Length = 399 Score = 293 bits (750), Expect = 7e-78, Method: Composition-based stats. Identities = 116/388 (29%), Positives = 173/388 (44%), Gaps = 39/388 (10%) Query: 3 NFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESS 62 FI H++ + + LNRDD N K +GGK R R+SSQ KRAMR E++ Sbjct: 7 RFIEAHIIQAIPFANLNRDDTNAVKTVTWGGKERTRVSSQCWKRAMRLY-LQTSLGQEAA 65 Query: 63 LRTIHLAQ-LRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISAD------------ 109 LRT L + L L + G D +++ EA K D Sbjct: 66 LRTRRLPEYLARHLEEHHGWPADLAERAGRHIVVASSVGGEAPKKKTDGEETGGTGEHWS 125 Query: 110 -----AVTPWVVGEIAWFCEQVAKAEADNLDDKKLLKVLKED------IAAIRVNLQQGV 158 + V E+A Q +A + + K K ++D + + GV Sbjct: 126 TAAMVYIPSSAVPELAELAIQYREALENAKEPKDPAKFGRKDSVIPTGKVDEILRRRNGV 185 Query: 159 DIALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQE-----QGSAH 213 I L GRM + +VDGA+ +AHA TTH ++ID+F+AVDD+ + GSAH Sbjct: 186 -INLFGRMLAQ---VDDAEVDGAVQVAHAFTTHATTTEIDYFSAVDDVTDIWGDTTGSAH 241 Query: 214 LGTQEFSSGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAF 273 +G E S+GV YRY ++L L NLGG E E+A ++ +P AK+ + A Sbjct: 242 MGQAEHSAGVLYRYIVLDLNDLHANLGG-DLEATRELAAGLLKAALLSLPRAKKNSTAPH 300 Query: 274 NPADMVMVNFS-DMPLSMANAFEKAVKAK--DGFLQPSIQAFNQYWDRVANGYGLNGAA- 329 + + D P+S A AFEK V A G +PS+ A N+Y V G +G Sbjct: 301 TIPHLAHLTVRTDRPVSYAGAFEKPVPADRHGGHSEPSVAALNEYAAAVQKLLGTSGCRY 360 Query: 330 AQFSLSDVDPITAQVKQMPTLEQLKSWV 357 A + + I A +++ + ++L Sbjct: 361 AAHATLSQEKIDALGERVESFDKLIEGA 388 >UniRef50_C2GEY7 CRISPR-associated Cse4 family protein n=6 Tax=Actinomycetales RepID=C2GEY7_9CORY Length = 356 Score = 285 bits (731), Expect = 1e-75, Method: Composition-based stats. Identities = 92/360 (25%), Positives = 148/360 (41%), Gaps = 38/360 (10%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGE 60 MSN + +H L S S LNRDD + K + GG R SSQS+KR R Y + Sbjct: 1 MSNQLTLHFLCSIPYSNLNRDDTGVPKRVMQGGALRALHSSQSIKRGSRV--LYENASQD 58 Query: 61 SSLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGK-SVDEAEKISADAV-TPWVVGE 118 S+R+ L + ++ D+K K A L G + EA+ DA + W+ E Sbjct: 59 LSIRSGRLDEEVAEKAMEMNPDLDEKTALKQAAKLIGNLTKGEAKSGEGDAKRSTWLSSE 118 Query: 119 IAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGKV 178 A A++ D ++ I N + IA GRM + Sbjct: 119 EILTA---ATYVANSTDPREKF---------IDGNTTGSLAIAAFGRMFANAT---DLNT 163 Query: 179 DGAMSIAHAITTHQVDSDIDWFTAVDDLQ----EQGSAHLGTQEFSSGVFYRYANINLAQ 234 + A++++ AITTHQ + D+F+ DD+ + + +L ++SG FYR I+ Q Sbjct: 164 EAAVAVSPAITTHQATIETDYFSTADDINLRDHKANATYLDVSLYTSGTFYRTVTIDRNQ 223 Query: 235 LQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFSDMPLSMANAF 294 L+ + G E V L P K+ + A F +++ + +A F Sbjct: 224 LRTSWSGFESNSVRENLEAFVRSLVYGQPRGKKNSTAPFTMPSLILAE--EQQYRVAYDF 281 Query: 295 EKAVKAK---DGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITAQVKQMPTLE 351 E+ V+A GF++ SI+ + + A F + P+ A P L+ Sbjct: 282 ERPVEADKDGGGFMKSSIEKLAKQYT----------LARSFDPGNFGPVEALSGTYPDLD 331 >UniRef50_C8P6I6 CRISPR-associated protein n=1 Tax=Lactobacillus antri DSM 16041 RepID=C8P6I6_9LACO Length = 311 Score = 276 bits (706), Expect = 9e-73, Method: Composition-based stats. Identities = 79/309 (25%), Positives = 143/309 (46%), Gaps = 26/309 (8%) Query: 61 SSLRTIHLAQLRD-VLRQKLGERFDQKIIDKTLALLSGKSVDEAEKIS-ADAVTPWVVGE 118 + +RT+ L L+++ + + + + + + + +K + A+ G+ Sbjct: 14 AGIRTMRGPLLLANELQKQDSNLSSDEAMAQAVDVFNKAKIKLDKKTNQTKALLMLSHGQ 73 Query: 119 IAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGKV 178 IA E V + D LD K + + L+ +D+AL GRM V Sbjct: 74 IAKLAEYVRQN--DELDSKAVKEALQ---------GDHSLDMALFGRMVADD---PSLNV 119 Query: 179 DGAMSIAHAITTHQVDSDIDWFTAVDDLQ---EQGSAHLGTQEFSSGVFYRYANINLAQL 235 D A +AHAI+TH++ + D++TAVDD + E GSA +GT E+ S YRYAN+N+ +L Sbjct: 120 DAACQVAHAISTHEIVPEYDYYTAVDDEKADDESGSAMIGTIEYDSATLYRYANVNVNEL 179 Query: 236 QENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFS-DMPLSMANAF 294 ++LG + A++ V +P KQ ++A V+V D P+++ +AF Sbjct: 180 VQSLG--DVDTAVKGLQLFVKDFVLSMPTGKQNSFANKTVPQYVLVTVREDTPVNLVSAF 237 Query: 295 EKAVKAKDGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITAQVKQMPTLEQLK 354 E+AVK++ G+LQPS+ + + S+ + + + ++ L Sbjct: 238 EEAVKSRHGYLQPSVAKLEKEYQDTQQFVQTP----LASVVVTNKESKISTKAADVDDLV 293 Query: 355 SWVRNNGEA 363 S + E+ Sbjct: 294 SKITEVIES 302 >UniRef50_B6IWM4 CRISPR-associated protein, CT1975 family n=1 Tax=Rhodospirillum centenum SW RepID=B6IWM4_RHOCS Length = 435 Score = 272 bits (696), Expect = 1e-71, Method: Composition-based stats. Identities = 95/425 (22%), Positives = 159/425 (37%), Gaps = 73/425 (17%) Query: 5 INIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESSLR 64 I HVL + P +NRD+ K GG R RISSQ+ KRA+R + ++ + + R Sbjct: 15 IQFHVLTAFPPHNVNRDEDGRPKTCQLGGVTRGRISSQAKKRALRLAPHFPTA--QRATR 72 Query: 65 TIH--------------------LAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAE 104 T A L G + + + +A K ++A Sbjct: 73 TRKAGIHTFLKLTAAGIDTTSAVWAALAVNHATGGGGKPPKAEDAQAIAAPDPKKQEDAY 132 Query: 105 KISADAVT---------------PWVVG-------------EIAWFCEQVAKAEADNLDD 136 K AVT W+ G E A E +A A D Sbjct: 133 KKKEKAVTDMMEKRGLDRAAAEQEWLTGQVGTEQGLVISTREFARIEEGIAHLTAAWAAD 192 Query: 137 KKLLKVLKED------IAAIRVNLQQGVDIALSGRMATSGMMTELGKVDGAMSIAHAITT 190 + + E ++ +D AL GRM + V+ A ++ HA TT Sbjct: 193 RDGFPAVLEGWVRQVCKESLLTKADHDLDTALFGRMVAANANF---NVEAACAVGHAFTT 249 Query: 191 HQVDSDIDWFTAVDDLQ---EQGSAHLGTQEFSSGVFYRYANINLAQLQENLG-GASREQ 246 H+ + D+F+A ++L+ G+ G F GV+Y++A ++ L+ L G S E+ Sbjct: 250 HRFALEGDYFSAGEELKVLGGTGAVITGYAFFGGGVYYQHAVLDRGHLRTTLSRGRSAEE 309 Query: 247 A----LEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFSDMP-LSMANAFEKAVKAK 301 A ++ + L P K ++A+ A V+ P L++ AF VKA Sbjct: 310 AERLTVQAVDTFLTGLLFSQPRGKCNSHASDVAASYVLATRGGDPALNLGLAFLDPVKAT 369 Query: 302 D---GFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPIT--AQVKQMPTLEQLKSW 356 + + SI+ + + YGL A + + ++ T+E + + Sbjct: 370 EDVTDLMCASIRRLTDFHRALTAAYGLGNAVCVLNAYPPARGNDAPRAPEVWTVEDFRRF 429 Query: 357 VRNNG 361 V+ G Sbjct: 430 VQGRG 434 >UniRef50_B8HWH9 CRISPR-associated protein, Cse4 family n=1 Tax=Cyanothece sp. PCC 7425 RepID=B8HWH9_CYAP4 Length = 501 Score = 259 bits (663), Expect = 9e-68, Method: Composition-based stats. Identities = 74/324 (22%), Positives = 143/324 (44%), Gaps = 25/324 (7%) Query: 48 MRKSGYYAQNIGESSLRTIHLAQLRDVLRQKLGERFDQKIIDKTLA----LLSGKSVDEA 103 R E R L VL + ++ + + ++ A + + ++ Sbjct: 174 WRT--KLQSEFAEMPERVDDQVSLWSVLSIQALQKSQEDLANEDEADDEKVDTSNTMFFV 231 Query: 104 EKISADAVTPWVVGEIAWFCEQVAKAEADNLDD--KKLLKVLKEDIAAIRVNLQQGVDIA 161 + + + +++ + + ++ + K++ +K V + DIA Sbjct: 232 GDVEIENLAGFLLNNLQVVQQDISASVPSFSKAVVDKIIDTIKHKDEKGNVIFPKPGDIA 291 Query: 162 LSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQEQ---GSAHLGTQE 218 L GRM + KVD ++ +AHAI+ +++ + D+FTAV+DL E GS H+G Sbjct: 292 LFGRMMAN---LPNAKVDASVQVAHAISVNKLQQEFDFFTAVEDLAEPDSLGSGHMGETG 348 Query: 219 FSSGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADM 278 ++S +YR+ ++ QL++NLG + + A IA +P Q +AA + Sbjct: 349 YNSSTYYRFTTLDTEQLKQNLG--NEDNAATIAHAFAEAFVRAIPTGHQNGFAAHSLPAA 406 Query: 279 VM-VNFSDMPLSMANAFEKAVKAKDG--FLQPSIQAFNQYWDRVANGYGL-----NGAAA 330 VM V P+S+ +AFE V K G L+ ++ +++W ++ YG G A Sbjct: 407 VMAVVRKGQPVSLVDAFENPVAPKAGKSLLENAVSKLDEHWAELSKMYGEKTVVFKGIVA 466 Query: 331 QFSLSDVDPITAQVKQMPTLEQLK 354 + L+ A V++ P++E+L Sbjct: 467 RAQLAQQLEYLAAVEK-PSVEELL 489 Score = 120 bits (301), Expect = 8e-26, Method: Composition-based stats. Identities = 36/147 (24%), Positives = 55/147 (37%), Gaps = 2/147 (1%) Query: 5 INIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESSLR 64 + IH+L S P+ LNRD+ M K +FGG R RISSQ KR R YY + E + Sbjct: 3 LEIHILQSFPPANLNRDENGMPKSTVFGGYPRARISSQCQKR--RTREYYHEYCKELGVD 60 Query: 65 TIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGEIAWFCE 124 H A ++L E+ Q+ + + A L D + Sbjct: 61 LKHFANRSRNWIKQLKEKLTQRGVSEAQAELMASLTISVLSEKPDKKGKLKYKPEDVIKK 120 Query: 125 QVAKAEADNLDDKKLLKVLKEDIAAIR 151 V + +K ++ + Sbjct: 121 LVGVWQKALKSPRKKNELEQAITEQTL 147 >UniRef50_D0WFC9 CRISPR-associated protein, Cse4 family n=1 Tax=Slackia exigua ATCC 700122 RepID=D0WFC9_9ACTN Length = 310 Score = 259 bits (662), Expect = 1e-67, Method: Composition-based stats. Identities = 66/269 (24%), Positives = 118/269 (43%), Gaps = 14/269 (5%) Query: 79 LGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGEIAWFCEQVAKAEADNLDDKK 138 + E + I+K +L + +K + + +++ ++A+ L D + Sbjct: 1 MPEVSEGDAIEKAKEVLVALGF-KLKKEENEYLNEYLIFIGTLQIGKLAELAIQALRDGE 59 Query: 139 L--LKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSD 196 K K+ + R VDIA+ GRM VD ++ +AHAI+ +++ Sbjct: 60 KVDKKEAKKILDVKRSPALNAVDIAMFGRMVADA---PDLNVDASVQVAHAISVSSAETE 116 Query: 197 IDWFTAVDD---LQEQGSAHLGTQEFSSGVFYRYANINLAQLQENLGGASREQALEIATH 253 D+FTA+DD G+A + T EF+S +FYRYAN+++ L ENLG S + A + Sbjct: 117 FDYFTALDDKAPEDNAGAAMIETTEFTSAMFYRYANVDVFHLCENLG--SPDAATKGINA 174 Query: 254 VVHMLATEVPGAKQRTYAAFNPADMVMVNFSD-MPLSMANAFEKAVKA--KDGFLQPSIQ 310 + +P KQ ++A V++ D P+S+ N+FE+ V A L + + Sbjct: 175 FLQSFVKSMPTGKQNSFANRTLPSAVVIQLRDSQPVSLVNSFERPVVALRDKSQLTNAAE 234 Query: 311 AFNQYWDRVANGYGLNGAAAQFSLSDVDP 339 A + +G+ + D Sbjct: 235 ALVAQEKALDEAFGVTPQHTFVVAASPDA 263 >UniRef50_Q31XC0 Putative cytoplasmic protein n=1 Tax=Shigella boydii Sb227 RepID=Q31XC0_SHIBS Length = 245 Score = 252 bits (643), Expect = 2e-65, Method: Composition-based stats. Identities = 68/240 (28%), Positives = 110/240 (45%), Gaps = 16/240 (6%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNI-G 59 M+ FI +H+L ++ + LNRDD K I GG R+R+SSQSLKRA R S + Q + G Sbjct: 1 MTTFIQLHLLTAYPAANLNRDDSGSPKTVILGGATRLRVSSQSLKRAWRTSELFEQALAG 60 Query: 60 ESSLRTIHLAQLRD--VLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVG 117 +R+ +A+ ++ + + ++ + + L D+ +K P Sbjct: 61 HIGVRSGRIAREAATILIEKGIEDKKAIEWAVEIADYLGKAKKDKKQKNDKKPKDPLTSA 120 Query: 118 EIAWFCEQVAKAEADNLDD-----KKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMM 172 E ++ AE D + + + KE A+ + VDIA+ GRM Sbjct: 121 ETEQLV-HISPAEFDAVKALAHQLAEEKRAPKEKDLALLRKDRMAVDIAMFGRMLAKKPG 179 Query: 173 TELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDL----QEQGSAHLGTQEFSSGVFYRYA 228 V+ A +AHA + + D+FTAVDDL ++ G+ H+ F S +FY Y Sbjct: 180 F---NVEAACQVAHAFGVSETIVENDFFTAVDDLRQASEDAGAGHVDETGFGSALFYTYI 236 >UniRef50_B7KJ25 CRISPR-associated protein, Cse4 family n=1 Tax=Cyanothece sp. PCC 7424 RepID=B7KJ25_CYAP7 Length = 480 Score = 232 bits (593), Expect = 1e-59, Method: Composition-based stats. Identities = 69/279 (24%), Positives = 114/279 (40%), Gaps = 20/279 (7%) Query: 83 FDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGEIAWFCEQVAKAEADNLDDKKLLKV 142 D + ++ I D T + +A + + KKL Sbjct: 170 SDDDTSTPEETESTITILELPGAIQGDLKTSYKDNPLAKVVNE-----EEFNQLKKLCNE 224 Query: 143 LKEDIAAIRVNLQQGV--DIALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWF 200 +K + + + V D+AL GRM S VD ++S+AHAI+T+ + + D++ Sbjct: 225 IKGILYDEKNKRIKPVPGDVALFGRMLAS---FSDASVDASVSVAHAISTNSIKREFDYW 281 Query: 201 TAVDDL------QEQGSAHLGTQEFSSGVFYRYANINLAQLQENLGGASREQALEIATHV 254 TA D + QG+ H+G + F+SGVFYRY+ ++ QL ENLG +E + Sbjct: 282 TAARDFQKNNSDESQGAGHIGDRPFASGVFYRYSCLDSNQLSENLGEIYQEDIQYLVEQY 341 Query: 255 VHMLATEVPGAKQRTYAAFNPA-DMVMVNFSDMPLSMANAFEKAVKAKDGFLQPSIQAFN 313 + P + + V P+S+ NAF+ +K D F + S Sbjct: 342 LDAFLHSRPSGYSHQFGHDTLPFAGIFVIRQSQPISLVNAFDIPIKKYDSFCRQSWNKLV 401 Query: 314 QYWDRVANGYGLNGAAAQ---FSLSDVDPITAQVKQMPT 349 +W+ + YG + FSL I+ VK +P Sbjct: 402 DHWNEIQQAYGKRLPVKEVHVFSLESFKDISELVKAVPN 440 Score = 127 bits (319), Expect = 8e-28, Method: Composition-based stats. Identities = 38/134 (28%), Positives = 56/134 (41%), Gaps = 10/134 (7%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGE 60 M+NF+ IH+L S PS +NRD K A FGG R+R+SSQS K A+R+ Sbjct: 1 MTNFLEIHLLQSTPPSNMNRDQNGSPKTAHFGGVERLRVSSQSWKHAVRQYYKKTLPDDH 60 Query: 61 SSLRTIHLA-QLRDVLR-QKLGERFDQK--------IIDKTLALLSGKSVDEAEKISADA 110 + R +L L+ +K E + K I L G D+ ++ D Sbjct: 61 KTYRDKGWPTELAKRLKQEKFDEELNLKDSDFSVVLPIAFMLLSAIGAKRDDKKEGDIDT 120 Query: 111 VTPWVVGEIAWFCE 124 + E+ Sbjct: 121 MLFLGEAEVREIIN 134 >UniRef50_UPI0001B51C2C hypothetical protein SvirD4_12600 n=1 Tax=Streptomyces viridochromogenes DSM 40736 RepID=UPI0001B51C2C Length = 461 Score = 220 bits (562), Expect = 4e-56, Method: Composition-based stats. Identities = 90/402 (22%), Positives = 150/402 (37%), Gaps = 92/402 (22%) Query: 3 NFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGE-- 60 + ++H+L + + + RD+ M K +FGG R I++Q+ +RA R N G+ Sbjct: 16 QYFSLHLLETFTAALPVRDENGMPKQFVFGGDPRTMITAQARRRAERTHSRERANAGQGP 75 Query: 61 -----SSLRTIHLAQL-------------------RDVLRQKLGERFDQKIIDKTLALLS 96 +RT A+L L + +G +F K + L + Sbjct: 76 LAGYTMGIRTREWAKLTAKALADRYGWDRADALATAKALLEGVGLKFGAKPTTRDLTQVL 135 Query: 97 GKSVDEAEKISADAVTPWVVGEIAWF-------------------------------CEQ 125 + ++A +I AD + AW + Sbjct: 136 LFAPEDAGQIIADWIQEHRAEVAAWTSDYLKAKEAGAAAAAAKKAAAAAARKAKKSGTDA 195 Query: 126 VAKAEADNL--DDKKLLKVLKEDIAAIRVNL--QQGVDIALSGRMATSGMMTELGKVDGA 181 +A A DN ++++L V ++ AI L + +DIAL GR + VDGA Sbjct: 196 LASAADDNQPNNEEQLPPVPRKIREAILSALAPRDAIDIALYGRFLAEIADSP--NVDGA 253 Query: 182 MSIAHAITTHQVD------------------SDIDWFTAVDDLQEQGSAHLGTQEFSSGV 223 + AHA T H + +D+ A DD G+ G Q SG Sbjct: 254 IQTAHAFTVHAAEHIDDFYAAADDAKLHRKAHALDYIDAADD---SGAGMTGYQSLISGT 310 Query: 224 FYRYANINLAQLQENL--GGASREQAL----EIATHVVHMLATEVPGAKQRTYAAFN-PA 276 FYR+A ++ +L+ NL G +Q V +P AK+ T AA Sbjct: 311 FYRHAVLDRYKLRINLLASGMKPDQVQAAAEAAELEFVEAFTNAIPQAKKNTTAATGILP 370 Query: 277 DMVMVNFSDMPLSMANAFEKAVKAKDGFLQPSIQAFNQYWDR 318 +VM P + A FEK + + + S+ A ++ ++ Sbjct: 371 KLVMAFTGARPFNYAGIFEKPIAEETDGV-ASVAAADRLLNQ 411 >UniRef50_UPI000190E665 hypothetical protein SentesTyp_08452 n=3 Tax=Salmonella enterica subsp. enterica serovar Typhi RepID=UPI000190E665 Length = 139 Score = 138 bits (348), Expect = 3e-31, Method: Composition-based stats. Identities = 39/138 (28%), Positives = 64/138 (46%), Gaps = 6/138 (4%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNI-G 59 M+ FI +H+L +++P+ LNRD+ K A GG R+R+SSQSLKRA R S + + G Sbjct: 1 MTTFIQLHLLTAYAPANLNRDESGRPKTAFMGGVERLRVSSQSLKRAWRVSETFEAAMDG 60 Query: 60 ESSLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTP--WVVG 117 RT + D + + + + ++ I K+ + L K + K DA + Sbjct: 61 FMGKRTRRIG--VDYVYRPMKDAGIEEKIAKSSSELIAKQFGKL-KSDKDAKPEKNLEIE 117 Query: 118 EIAWFCEQVAKAEADNLD 135 +I +D Sbjct: 118 QIVHVSNHEISLIKQLVD 135 >UniRef50_UPI0001B58196 CRISPR-associated Cse4 family protein n=1 Tax=Streptomyces sp. C RepID=UPI0001B58196 Length = 91 Score = 83.4 bits (205), Expect = 1e-14, Method: Composition-based stats. Identities = 20/84 (23%), Positives = 35/84 (41%), Gaps = 4/84 (4%) Query: 262 VPGAKQRTYAAFNPADMVMVN-FSDMPLSMANAFEKAV---KAKDGFLQPSIQAFNQYWD 317 +P K T+ D+V+V S P+S AFEK V + +G ++ + +A ++ Sbjct: 1 MPTGKANTFGNHTLPDVVIVKLRSSRPVSFVGAFEKPVIQHETGEGHVRAAWKALAEHIP 60 Query: 318 RVANGYGLNGAAAQFSLSDVDPIT 341 + +G A P T Sbjct: 61 AIEKTFGATADATWILRVGEPPTT 84 >UniRef50_C2BS05 Possible CRISPR-associated protein n=1 Tax=Mobiluncus curtisii ATCC 43063 RepID=C2BS05_9ACTO Length = 435 Score = 66.0 bits (160), Expect = 2e-09, Method: Composition-based stats. Identities = 18/89 (20%), Positives = 36/89 (40%), Gaps = 3/89 (3%) Query: 274 NPADMVMVNFSD-MPLSMANAFEKAVK-AKDGFLQPSIQAFNQYWDRVANGYGLNGAAAQ 331 + ++V V D +S+ NAFE+ V + +Q +++ + + YG+ AA Sbjct: 339 SLPELVYVAVRDTRSVSLVNAFEEPVACERGSRVQAAVEVLANEETAIEDAYGMKPLAAF 398 Query: 332 -FSLSDVDPITAQVKQMPTLEQLKSWVRN 359 D + T+ +L S + Sbjct: 399 VVDPKDYAAKLEDIAHKVTVPELTSLIVE 427 >UniRef50_O87037 Z35f protein n=1 Tax=Vibrio cholerae RepID=O87037_VIBCH Length = 96 Score = 45.6 bits (107), Expect = 0.003, Method: Composition-based stats. Identities = 15/64 (23%), Positives = 29/64 (45%), Gaps = 2/64 (3%) Query: 262 VPGAKQRTYAAFNPADMVMVNFSDMPLSMANAFEKAVKAKD-GFLQPSIQAFNQYWDRVA 320 +P A+Q T + P + V + +FE+ V+A G+L P+ +A + ++ Sbjct: 1 MPNARQTTQSGACPWEYARVLVR-KGQRLQASFEQPVRAAGEGYLLPNKKALQNWLEQRE 59 Query: 321 NGYG 324 G Sbjct: 60 KLSG 63 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.310 0.156 0.498 Lambda K H 0.267 0.0482 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 2,107,918,916 Number of Sequences: 3077464 Number of extensions: 100636640 Number of successful extensions: 249008 Number of sequences better than 1.0e-01: 116 Number of HSP's better than 0.1 without gapping: 212 Number of HSP's successfully gapped in prelim test: 64 Number of HSP's that attempted gapping in prelim test: 247765 Number of HSP's gapped (non-prelim): 384 length of query: 363 length of database: 1,040,396,356 effective HSP length: 130 effective length of query: 233 effective length of database: 640,326,036 effective search space: 149195966388 effective search space used: 149195966388 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.0 bits) S2: 94 (40.6 bits)