BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (363 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_Q46899 Uncharacterized protein ygcJ n=13 Tax=Proteobact... 462 e-129 UniRef50_C0W6U1 CRISPR-associated Cse4 family protein n=2 Tax=Ac... 411 e-113 UniRef50_C2BET9 CRISPR-associated protein n=3 Tax=Bacteria RepID... 389 e-107 UniRef50_D2RB01 CRISPR system CASCADE complex protein CasC n=2 T... 387 e-106 UniRef50_A0LM53 CRISPR-associated protein, Cse4 family n=1 Tax=S... 384 e-105 UniRef50_B6XT63 Putative uncharacterized protein n=1 Tax=Bifidob... 379 e-104 UniRef50_D1CAJ1 CRISPR-associated protein, Cse4 family n=1 Tax=S... 378 e-103 UniRef50_B1VIY1 CRISPR-associated protein n=3 Tax=Corynebacteriu... 377 e-103 UniRef50_B3E5V0 CRISPR-associated protein, Cse4 family n=56 Tax=... 376 e-103 UniRef50_B0TDU0 Crispr-associated protein, ct1973 family, putati... 375 e-102 UniRef50_D1CGD3 CRISPR-associated protein, Cse4 family n=1 Tax=T... 375 e-102 UniRef50_A3EQA5 CRISPR-ssociated protein, Cas4 n=4 Tax=Bacteria ... 374 e-102 UniRef50_C3PF94 CRISPR-associated protein n=5 Tax=Corynebacteriu... 374 e-102 UniRef50_C6SPJ0 Putative uncharacterized protein n=1 Tax=Strepto... 373 e-102 UniRef50_A7BA64 Putative uncharacterized protein n=1 Tax=Actinom... 371 e-101 UniRef50_A1ARH7 CRISPR-associated protein, Cse4 family n=1 Tax=P... 370 e-101 UniRef50_Q2JWC4 CRISPR-associated protein, Cse4 family n=1 Tax=S... 370 e-101 UniRef50_A4XYU0 CRISPR-associated protein, Cse4 family n=5 Tax=B... 369 e-101 UniRef50_C4FG89 Putative uncharacterized protein n=1 Tax=Bifidob... 367 e-100 UniRef50_D2TKK6 CRISPR-associated protein n=1 Tax=Citrobacter ro... 364 3e-99 UniRef50_C7QEM5 CRISPR-associated protein, Cse4 family n=13 Tax=... 363 5e-99 UniRef50_A1SV72 CRISPR-associated protein, Cse4 family n=2 Tax=G... 363 6e-99 UniRef50_Q3A5Z5 CRISPR-associated protein, Cse4 family n=23 Tax=... 363 9e-99 UniRef50_Q2JH28 CRISPR-associated protein, CT1975 n=6 Tax=Actino... 362 1e-98 UniRef50_Q2FNL3 CRISPR-associated protein, CT1975 n=8 Tax=cellul... 361 2e-98 UniRef50_D1YEE3 CRISPR system CASCADE complex protein CasC n=1 T... 361 3e-98 UniRef50_Q47PJ3 CRISPR-associated protein, Cse4 family n=1 Tax=T... 360 4e-98 UniRef50_C7LYW7 CRISPR-associated protein, Cse4 family n=1 Tax=A... 358 3e-97 UniRef50_Q03C61 CRISPR-associated protein n=6 Tax=Firmicutes Rep... 355 2e-96 UniRef50_C7MTA9 CRISPR-associated protein, Cse4 family n=1 Tax=S... 352 8e-96 UniRef50_UPI0001AF1D4B hypothetical protein SghaA1_37372 n=1 Tax... 352 1e-95 UniRef50_B8IZA6 CRISPR-associated protein, Cse4 family n=1 Tax=D... 352 1e-95 UniRef50_C4ZJY0 CRISPR-associated protein, Cse4 family n=1 Tax=T... 350 5e-95 UniRef50_A8LYZ6 CRISPR-associated protein, Cse4 family n=1 Tax=S... 349 9e-95 UniRef50_C5SD49 CRISPR-associated protein, Cse4 family n=1 Tax=A... 349 1e-94 UniRef50_A5UR15 CRISPR-associated protein, Cse4 family n=1 Tax=R... 347 5e-94 UniRef50_D0Y919 CRISPR-associated protein, Cse4 family n=2 Tax=D... 346 5e-94 UniRef50_B6WQ62 Putative uncharacterized protein n=1 Tax=Desulfo... 345 2e-93 UniRef50_D1NTI0 CRISPR-associated protein, Cse4 family n=1 Tax=B... 345 2e-93 UniRef50_B4UE70 CRISPR-associated protein, Cse4 family n=2 Tax=A... 344 2e-93 UniRef50_A5FTJ7 CRISPR-associated protein, Cse4 family n=11 Tax=... 344 2e-93 UniRef50_Q1EQS8 CRISPR-associated protein n=3 Tax=Streptomyces R... 344 2e-93 UniRef50_Q2RXJ6 CRISPR-associated protein, Cse4 family n=2 Tax=A... 341 3e-92 UniRef50_Q0BRF9 Putative uncharacterized protein n=1 Tax=Granuli... 340 5e-92 UniRef50_B6B782 CRISPR-associated protein, Cse4 family n=2 Tax=A... 340 6e-92 UniRef50_C7RP61 CRISPR-associated protein, Cse4 family n=1 Tax=C... 338 2e-91 UniRef50_Q67RP1 Putative uncharacterized protein n=1 Tax=Symbiob... 335 2e-90 UniRef50_D0MET5 CRISPR-associated protein, Cse4 family n=1 Tax=R... 333 8e-90 UniRef50_Q0AA32 CRISPR-associated protein, Cse4 family n=1 Tax=A... 332 1e-89 UniRef50_C9M9R6 CRISPR-associated protein, Cse4 family n=1 Tax=J... 329 7e-89 UniRef50_D1Y487 CRISPR-associated protein, Cse4 family n=1 Tax=P... 329 1e-88 UniRef50_B8FDH9 CRISPR-associated protein, Cse4 family n=2 Tax=B... 327 3e-88 UniRef50_D2L2X7 CRISPR-associated protein, Cse4 family n=1 Tax=D... 326 1e-87 UniRef50_Q2RY18 CRISPR-associated protein, Cse4 family n=2 Tax=A... 324 2e-87 UniRef50_A5GBK1 CRISPR-associated protein, Cse4 family n=1 Tax=G... 324 4e-87 UniRef50_C6HV95 CRISPR-associated protein, Cas4 n=1 Tax=Leptospi... 323 7e-87 UniRef50_Q1J368 CRISPR-associated protein, CT1975 n=1 Tax=Deinoc... 322 1e-86 UniRef50_B4S8P9 CRISPR-associated protein, Cse4 family n=9 Tax=B... 322 1e-86 UniRef50_C7MQD5 CRISPR-associated protein, Cse4 family n=1 Tax=S... 312 2e-83 UniRef50_Q60AD1 CRISPR-associated protein, CT1975 family n=1 Tax... 304 5e-81 UniRef50_D1A6Q4 CRISPR-associated protein, Cse4 family n=2 Tax=A... 293 7e-78 UniRef50_C2GEY7 CRISPR-associated Cse4 family protein n=6 Tax=Ac... 285 1e-75 UniRef50_C8P6I6 CRISPR-associated protein n=1 Tax=Lactobacillus ... 276 9e-73 UniRef50_B6IWM4 CRISPR-associated protein, CT1975 family n=1 Tax... 272 1e-71 UniRef50_B8HWH9 CRISPR-associated protein, Cse4 family n=1 Tax=C... 259 9e-68 UniRef50_D0WFC9 CRISPR-associated protein, Cse4 family n=1 Tax=S... 259 1e-67 UniRef50_Q31XC0 Putative cytoplasmic protein n=1 Tax=Shigella bo... 252 2e-65 UniRef50_B7KJ25 CRISPR-associated protein, Cse4 family n=1 Tax=C... 232 1e-59 UniRef50_UPI0001B51C2C hypothetical protein SvirD4_12600 n=1 Tax... 220 4e-56 UniRef50_UPI000190E665 hypothetical protein SentesTyp_08452 n=3 ... 138 3e-31 UniRef50_UPI0001B58196 CRISPR-associated Cse4 family protein n=1... 83 1e-14 UniRef50_C2BS05 Possible CRISPR-associated protein n=1 Tax=Mobil... 66 2e-09 UniRef50_O87037 Z35f protein n=1 Tax=Vibrio cholerae RepID=O8703... 46 0.003 >UniRef50_Q46899 Uncharacterized protein ygcJ n=13 Tax=Proteobacteria RepID=YGCJ_ECOLI Length = 363 Score = 462 bits (1189), Expect = e-129, Method: Composition-based stats. Identities = 363/363 (100%), Positives = 363/363 (100%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGE 60 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGE Sbjct: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGE 60 Query: 61 SSLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGEIA 120 SSLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGEIA Sbjct: 61 SSLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGEIA 120 Query: 121 WFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGKVDG 180 WFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGKVDG Sbjct: 121 WFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGKVDG 180 Query: 181 AMSIAHAITTHQVDSDIDWFTAVDDLQEQGSAHLGTQEFSSGVFYRYANINLAQLQENLG 240 AMSIAHAITTHQVDSDIDWFTAVDDLQEQGSAHLGTQEFSSGVFYRYANINLAQLQENLG Sbjct: 181 AMSIAHAITTHQVDSDIDWFTAVDDLQEQGSAHLGTQEFSSGVFYRYANINLAQLQENLG 240 Query: 241 GASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFSDMPLSMANAFEKAVKA 300 GASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFSDMPLSMANAFEKAVKA Sbjct: 241 GASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFSDMPLSMANAFEKAVKA 300 Query: 301 KDGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITAQVKQMPTLEQLKSWVRNN 360 KDGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITAQVKQMPTLEQLKSWVRNN Sbjct: 301 KDGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITAQVKQMPTLEQLKSWVRNN 360 Query: 361 GEA 363 GEA Sbjct: 361 GEA 363 >UniRef50_C0W6U1 CRISPR-associated Cse4 family protein n=2 Tax=Actinomycetales RepID=C0W6U1_9ACTO Length = 374 Score = 411 bits (1057), Expect = e-113, Method: Composition-based stats. Identities = 113/369 (30%), Positives = 169/369 (45%), Gaps = 20/369 (5%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGE 60 MS F++IH++ S PSC+NRDD K A++GG RR+R+SSQS KRA R + + Sbjct: 1 MSTFVDIHLIQSLPPSCVNRDDSGSPKSALYGGVRRLRVSSQSWKRATRLYFNEHLDATD 60 Query: 61 SSLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEA---EKISADAVTPWVVG 117 +RT + +L + + + S + A K A A + +++ Sbjct: 61 VGIRTKRVVELLADRISAIAPDLADSALALAEQVFSAAKIKVAPPRGKKDAPAESGYLLF 120 Query: 118 EIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGK 177 ++A+ + + + VDIAL GRM Sbjct: 121 LSTSQINRLAEMATRAAHAGE---KIDPKETKKIFKEEHAVDIALFGRMVADDA---DLN 174 Query: 178 VDGAMSIAHAITTHQVDSDIDWFTAVDD------LQEQGSAHLGTQEFSSGVFYRYANIN 231 VD A +AHAI+TH +++ D+FTAVDD ++ G+ +GT EFSS YRYA +N Sbjct: 175 VDAACQVAHAISTHAAENEYDFFTAVDDEKSRAMEEDAGAGMMGTVEFSSATMYRYATVN 234 Query: 232 LAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMV-NFSDMPLSM 290 L L ENLG R+ AL + + +P KQ T+A D V+V D P+S+ Sbjct: 235 LDMLVENLG--DRDAALRALSVFLEGFCLSMPTGKQNTFANRTLPDSVVVSVRDDQPVSL 292 Query: 291 ANAFEKAVKAK--DGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITAQVKQMP 348 AFEK V+ DGFL S++A +Y + +GL A+ P A + + Sbjct: 293 VGAFEKPVRTTESDGFLTRSVEALARYEHTIEENFGLKPQASFVVSLADVPELASLGERI 352 Query: 349 TLEQLKSWV 357 T L V Sbjct: 353 TFADLPGKV 361 >UniRef50_C2BET9 CRISPR-associated protein n=3 Tax=Bacteria RepID=C2BET9_9FIRM Length = 359 Score = 389 bits (1000), Expect = e-107, Method: Composition-based stats. Identities = 98/356 (27%), Positives = 173/356 (48%), Gaps = 22/356 (6%) Query: 4 FINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESSL 63 F++IH + + P+ +NRDD K A +GG R R+SSQS KRA+RK ++ + Sbjct: 10 FLDIHAIQTVPPANINRDDTGSPKTAQYGGVTRARVSSQSWKRAIRKYFNENGDVENVGI 69 Query: 64 RTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGEIAWFC 123 R++ + + + K+ ++ I++ + ++ K+++ A+ + D + Sbjct: 70 RSLEIVRY---VANKIVQKDGSISIEEAM-EMADKTINNAKISTKDQKAKALFFMSDKQA 125 Query: 124 EQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGKVDGAMS 183 E++A+A D ++DKK+L+ + ++ +D+AL GRM D + Sbjct: 126 EELAQASIDKVNDKKILQEILKN--------DTSIDVALFGRMVADDA---SLNEDASSQ 174 Query: 184 IAHAITTHQVDSDIDWFTAVDDLQ---EQGSAHLGTQEFSSGVFYRYANINLAQLQENLG 240 +AHAI+TH + S+ D+FTAVDDL G+ LGT E++S YRYANI L L Sbjct: 175 VAHAISTHAIQSEFDFFTAVDDLAPEDNAGAGMLGTVEYNSSTLYRYANIALHDFYRQL- 233 Query: 241 GASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVN-FSDMPLSMANAFEKAVK 299 A +E+ ++ V +P K T+A ++V+ SD PL+M +AFE+ +K Sbjct: 234 -ADKEETIKATKLFVKSFVESMPTGKINTFANQTLPQAIVVSLRSDRPLNMVSAFEEPIK 292 Query: 300 AKDGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITAQVKQMPTLEQLKS 355 + +G++ SI+ + + A L + + K +L L Sbjct: 293 SDNGYVDKSIEKLFSEYTKYDKILDKPIFTAYLILGNT-EVNEIGKSEASLNDLLE 347 >UniRef50_D2RB01 CRISPR system CASCADE complex protein CasC n=2 Tax=Gardnerella vaginalis RepID=D2RB01_GARVA Length = 362 Score = 387 bits (994), Expect = e-106, Method: Composition-based stats. Identities = 100/360 (27%), Positives = 170/360 (47%), Gaps = 22/360 (6%) Query: 4 FINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESSL 63 F++I + S P +NRDD K A +GG R R+SSQ K +MR+ + Sbjct: 6 FLDIQAIQSVPPCNINRDDAGSPKTAQYGGVTRARVSSQCWKHSMREYFKEHSGDSNVGM 65 Query: 64 RTIHLAQLRD----VLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGEI 119 R+ ++ + L+ +L E+ + +KTL K+ + KI + +GE Sbjct: 66 RSKNIVKYVADKIITLKPELSEQEALDLANKTLNNAGFKTKTDKGKIIPVVNVLFFLGE- 124 Query: 120 AWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGKVD 179 +A+A +N+ DKK L+ + +D +DIAL GRM D Sbjct: 125 -NQANSLAQAAINNVTDKKQLEEILKDNP--------PIDIALFGRMLADN---PSLNED 172 Query: 180 GAMSIAHAITTHQVDSDIDWFTAVDDL---QEQGSAHLGTQEFSSGVFYRYANINLAQLQ 236 + +AHAI+TH V ++ D++TAVDDL G+ LGT E++S YRYAN+ + + Sbjct: 173 ASSQVAHAISTHAVRAEFDYYTAVDDLSVDDNAGAGMLGTIEYNSSTLYRYANVAIHEFS 232 Query: 237 ENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVN-FSDMPLSMANAFE 295 L ++E + + A +P K T+A M++V D P+++ +AFE Sbjct: 233 HQLSD-NKESTINALKLFIEAFANAMPTGKVNTFANQTLPQMLVVTLREDRPVNLVSAFE 291 Query: 296 KAVKAKDGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITAQVKQMPTLEQLKS 355 VKAKDG++ SI+ +Q +++V A+ ++ + + +++QL Sbjct: 292 DPVKAKDGYVSKSIEKLSQEYEKVQKFVHKPLASFYVTMDSSNKEIKLGVEEQSMQQLLD 351 >UniRef50_A0LM53 CRISPR-associated protein, Cse4 family n=1 Tax=Syntrophobacter fumaroxidans MPOB RepID=A0LM53_SYNFM Length = 384 Score = 384 bits (986), Expect = e-105, Method: Composition-based stats. Identities = 125/394 (31%), Positives = 196/394 (49%), Gaps = 48/394 (12%) Query: 4 FINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNI----G 59 F++IH++ + +PS LNRDD N KD FGG RR RISSQ +KR +R ++Q + G Sbjct: 2 FVDIHIIQNFAPSNLNRDDTNSPKDCEFGGYRRARISSQCIKRVVRSHRSFSQAVVHAGG 61 Query: 60 ESSLRTIHL-AQLRDVLRQKLG--ERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVV 116 ++ +RT + ++L D+ +K G E + + + +T+ L G + + EK T +++ Sbjct: 62 DTGVRTKRIKSRLMDLFAKKYGKPEIVETEKVAETVIELLGLKLKDEEK------TEYLL 115 Query: 117 GEIAWFCEQVAKAEADNLDD---------------------KKLLKVLKEDIAAIRVNLQ 155 Q+A+ D+ D K+ + LK + R + Sbjct: 116 YLGENEAAQLARLAVDSWDALLAIEPEQDKKKKKGTGQESLKEFQEELKGIVGKRRKEAR 175 Query: 156 Q-GVDIALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDL---QEQGS 211 DIAL GRM VD A +AHA++T++V+ ++D+FTAVDDL +E GS Sbjct: 176 SYAADIALFGRMIADNKNM---NVDAACQVAHAVSTNKVEMEMDYFTAVDDLLPGEETGS 232 Query: 212 AHLGTQEFSSGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYA 271 +G EF+S FYRY+N+N+++L ENL G + + V VP KQ + A Sbjct: 233 DMIGVVEFNSSCFYRYSNVNVSKLAENL-GFNNDLTTAALLGYVEASVKSVPTGKQNSMA 291 Query: 272 AFNPADM--VMVNFSDMPLSMANAFEKAVKA--KDGFLQPSIQAFNQYWDRVANGYGLNG 327 A NPA V+V P S+ANAF+K V+ + SI A +Y++R+ YG G Sbjct: 292 AQNPAGYARVIVRRDGFPWSLANAFQKPVRPSLDKSLEEASIDALERYFERLKAVYGTEG 351 Query: 328 AAAQFSLSDVDPITAQVKQMPTLEQLKSWVRNNG 361 S + +++M L+ LK+ V G Sbjct: 352 IVCDASFNLHRDDGGSLRKM--LDALKACVAGEG 383 >UniRef50_B6XT63 Putative uncharacterized protein n=1 Tax=Bifidobacterium catenulatum DSM 16992 RepID=B6XT63_9BIFI Length = 371 Score = 379 bits (974), Expect = e-104, Method: Composition-based stats. Identities = 103/372 (27%), Positives = 162/372 (43%), Gaps = 18/372 (4%) Query: 4 FINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESSL 63 F++IH L PS +NRDD K A GG R R+SSQS KRAMR+ + + Sbjct: 2 FVDIHCLQQVPPSNINRDDTGSPKTAYVGGALRARVSSQSWKRAMREMFSSKLDSSKLGK 61 Query: 64 RTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSG-----KSVDEAEKISADAVTPWVVGE 118 RT L + + ++ +L+ K+ D A + T +++ Sbjct: 62 RTKSAVALISSVIAEKRPDLVEESKSLAEKVLAATGVKVKASDRAGADKGSSATEYLIFI 121 Query: 119 IAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGKV 178 EQ+A D+ K +K+++AA+ + +Q +DIA GRM Sbjct: 122 ANREVEQLADIAITAFDEGKDPSKMKKEVAAV-FHGEQAIDIACFGRMLADA---PDLNT 177 Query: 179 DGAMSIAHAITTHQVDSDIDWFTAVDDLQ---EQGSAHLGTQEFSSGVFYRYANINLAQL 235 D + +AHA + Q+ + D+FTAVDD G+A + T F+S YRYA +N+ L Sbjct: 178 DASAQVAHAFSIDQITPEYDYFTAVDDCASDDNAGAAMIDTIGFNSSTLYRYATVNVDAL 237 Query: 236 QENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPA-DMVMVNFSDMPLSMANAF 294 ++ L A+E V +P KQ T+A D+V+V P+S A+AF Sbjct: 238 KDQLQ--DANAAVEGVAAFVDAFIKSMPSGKQNTFANHTLPEDIVIVLRDSQPISAADAF 295 Query: 295 EKAVKAKDGF--LQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVD-PITAQVKQMPTLE 351 E +K KDG + I+ + + YG A +S + + TL Sbjct: 296 EDPIKRKDGISVSRQGIERLGDRLNEIRINYGEEPVKAWHVVSGGSVHSLDEWSEQVTLP 355 Query: 352 QLKSWVRNNGEA 363 +L+ +R A Sbjct: 356 ELEQGLRETLSA 367 >UniRef50_D1CAJ1 CRISPR-associated protein, Cse4 family n=1 Tax=Sphaerobacter thermophilus DSM 20745 RepID=D1CAJ1_SPHTD Length = 397 Score = 378 bits (971), Expect = e-103, Method: Composition-based stats. Identities = 108/387 (27%), Positives = 171/387 (44%), Gaps = 38/387 (9%) Query: 4 FINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGE--S 61 F+ +H++ + +PS LNRDD KD FGG RR RISSQ+LKRA+R + + E Sbjct: 2 FVELHIIQNFAPSNLNRDDTGAPKDCQFGGYRRARISSQALKRAIRMTFGEENLLPEESR 61 Query: 62 SLRTIHLA-QLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGEIA 120 + RT +A L + L + + + G S ++ ++ + T +++ Sbjct: 62 ARRTKRIAGALVERLVASGKDAVAAAAVVEAAIQGIGLSFEKPKEGDTEKKTQYLLFLGQ 121 Query: 121 WFCEQVAKAEADNLD--------------------DKKLLKVLKEDIAAIRVNL--QQGV 158 +A + D K L + + ++ + Sbjct: 122 REINALADVCLAHWDTLVDVAPNADAASERDAKKAKKANKAALPKQVQLALLDALDGRSA 181 Query: 159 DIALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQE---QGSAHLG 215 D+AL GRM +D A +AHAI+TH+V ++ D++TAVDDL+ G+ LG Sbjct: 182 DVALFGRMLAD---LPEKNIDAASQVAHAISTHRVATEFDFYTAVDDLKPDDTAGADMLG 238 Query: 216 TQEFSSGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNP 275 T EF+S FYRY+NI++ QL ENLGG + A + +P KQ + AA NP Sbjct: 239 TVEFNSACFYRYSNIDVDQLIENLGG-DVDLARTTVEAFLWASIHAIPTGKQNSMAAQNP 297 Query: 276 ADMVMVNFSDMP-LSMANAFEKAVKA--KDGFLQPSIQAFNQYWDRVANGYGLNGAAAQF 332 VM D S+ANAF V ++ S+ A YW + YG + Sbjct: 298 PSFVMAVVRDRGLWSLANAFVNPVAPAHDGDLIERSVDALEAYWSNLVRVYG-GELRGTW 356 Query: 333 SLSDVDPITAQVKQ--MPTLEQLKSWV 357 ++ +++ + T E+L V Sbjct: 357 CVNVNPRELGPLEELHVDTFEELVDAV 383 >UniRef50_B1VIY1 CRISPR-associated protein n=3 Tax=Corynebacterium RepID=B1VIY1_CORU7 Length = 376 Score = 377 bits (968), Expect = e-103, Method: Composition-based stats. Identities = 108/373 (28%), Positives = 161/373 (43%), Gaps = 20/373 (5%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGE 60 MS I+I+ L S PS +NRDD + K+AIFGG R R+SSQS KRA+R+ + + Sbjct: 1 MSKIIDIYALQSLPPSLINRDDTGVPKNAIFGGVPRQRVSSQSWKRAIRRYFFENFDAAN 60 Query: 61 SSLRTIHLAQLRDVLRQKLGERFDQKIID-----KTLALLSGKSVDEAEKISADAVTPW- 114 R+ L + ++ G I K + + +K DA + Sbjct: 61 IGDRSKRLPEKIARQLEEQGMEQGTAIERTEQLFKAAGIKTAVEKKPKDKDETDAEVAYP 120 Query: 115 VVGEIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTE 174 G + + + ++ K + AI + + VDIA+ GRM Sbjct: 121 QTGYLLFLSAHQIDNAVKAIQERDGKNFTKREAQAIL-DQEHSVDIAMFGRMVADDAAY- 178 Query: 175 LGKVDGAMSIAHAITTHQVDSDIDWFTAVDDL----QEQGSAHLGTQEFSSGVFYRYANI 230 VD A+ +AHA+ H + D+FTAVDDL +E G+ +GT + S YRYA + Sbjct: 179 --NVDAAVQVAHALGIHDSAPEFDYFTAVDDLAEEGEETGAGMIGTVQMMSSTLYRYATV 236 Query: 231 NLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFSD-MPLS 289 NL L ENL S + A + A V +P K T+A ++V V D P+S Sbjct: 237 NLEGLAENL--DSEDAAKQAAVEFVEAFIASMPTGKINTFANQTLPELVYVAVRDTRPVS 294 Query: 290 MANAFEKAVKA--KDGFLQPSIQAFNQYWDRVANGYGLNGAAAQF-SLSDVDPITAQVKQ 346 + NAFE V+A G + + Q V N YG A+ L + + Sbjct: 295 LVNAFEAPVEATEDKGRREVGAEVLAQEARDVENVYGFKPQASFVMGLGQLAEPFTDIAT 354 Query: 347 MPTLEQLKSWVRN 359 TL +LK + Sbjct: 355 QVTLPELKEQLAG 367 >UniRef50_B3E5V0 CRISPR-associated protein, Cse4 family n=56 Tax=Proteobacteria RepID=B3E5V0_GEOLS Length = 356 Score = 376 bits (966), Expect = e-103, Method: Composition-based stats. Identities = 105/375 (28%), Positives = 158/375 (42%), Gaps = 35/375 (9%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNI-G 59 MS F+ IH+L S+ P+ LNRDD K A GG R+R+SSQSLKRA R S + Q + Sbjct: 1 MSRFVQIHLLTSYPPANLNRDDQGRPKTAKMGGYDRLRVSSQSLKRAWRTSDLFQQALTE 60 Query: 60 ESSLRTIHLA------QLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTP 113 RT L + +++K + QKI AL K D + + + Sbjct: 61 HVGTRTKLLGVMAYEKLVAGGVKEKQAKESAQKIAGVFGALKKAKEKDSLVDLEIEQLVH 120 Query: 114 WVVGEIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMT 173 EI + + + ++ + Q DIA+ GRM S Sbjct: 121 VSPSEIQAIESLLETLISQG-------RAPEDTELDLLRIQGQSADIAMFGRMLASS--- 170 Query: 174 ELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDL----QEQGSAHLGTQEFSSGVFYRYAN 229 V+ A +AHAI+ H V + D+FTAVDDL ++ G+AH+G F++G+FY Y Sbjct: 171 PSYNVEAACQVAHAISVHPVVIEDDYFTAVDDLNDGSEDAGAAHIGETGFAAGLFYSYIC 230 Query: 230 INLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFSD-MPL 288 IN L ENLGG + ++ P KQ ++ + A V+ D P Sbjct: 231 INRTLLVENLGG-DEALVQKSIQALIEAAVKVPPNGKQNSFGSRAYASYVLAEKGDQQPR 289 Query: 289 SMANAFEKAVKAKD----GFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITAQV 344 S++ AF K V ++ F ++ A + YG +D Sbjct: 290 SLSVAFLKPVTSQGIEGTDFGTAAVDALTTQRQNMDAVYGP--------CADASCEINVF 341 Query: 345 KQMPTLEQLKSWVRN 359 + TL +L +V Sbjct: 342 EGKGTLAELLKFVAE 356 >UniRef50_B0TDU0 Crispr-associated protein, ct1973 family, putative n=2 Tax=cellular organisms RepID=B0TDU0_HELMI Length = 385 Score = 375 bits (964), Expect = e-102, Method: Composition-based stats. Identities = 117/389 (30%), Positives = 174/389 (44%), Gaps = 37/389 (9%) Query: 5 INIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGE--SS 62 + IHVL +H+P+ LNRD+ KD +FGG RR RISSQ KR +R S + +IGE Sbjct: 2 VEIHVLQNHAPANLNRDESGSPKDCMFGGVRRGRISSQCQKRTIRCSPLFQDSIGESRLG 61 Query: 63 LRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGEIAWF 122 +RT L L +LG + I A G ++ K D +T + Sbjct: 62 MRTRKLPFLVKEELMRLGLSEELAKIGARKASGLG---NKDGKERDDEITAQAIFLTQED 118 Query: 123 CEQVAKAEADNLDDKKLLKVLKEDIAAIRVN------LQQGVDIALSGRMATSGMMTELG 176 +A+ +L DK + + ++ + VD+AL GRM TS + Sbjct: 119 VSVIARCLFRHLKDKTVKQAKAIKAQELQKDPELVGWRPVTVDVALFGRMTTSTAFND-- 176 Query: 177 KVDGAMSIAHAITTHQVDSDIDWFTAVDDLQ---EQGSAHLGTQEFSSGVFYRYANINLA 233 V+ ++ + HAI+TH+VDS+ D+FTAVDDL + G+ +G EF+S +Y+Y N+++ Sbjct: 177 -VEASVQVGHAISTHRVDSEFDYFTAVDDLMGDGDSGADMIGDTEFNSCCYYKYFNVDMD 235 Query: 234 QLQENLGGASR-------------EQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVM 280 +L+ NL G R A I + L P KQ ++AA V+ Sbjct: 236 ELKRNLAGPDRLKKLTAEERQDLARDAAHIVKAFIESLVFCSPDGKQNSFAARQLPSAVL 295 Query: 281 VNFSDM--PLSMANAFEKAVKAKD--GFLQPSIQAFNQYWDRVANGYGLNGAAAQFSL-- 334 V P+S ANAF K V A+ +Q S+ AF + +GL L Sbjct: 296 VEVKKRKIPVSYANAFVKPVTARGEMDLVQASVNAFLDHVKETEKCFGLTPNRRWLLLMG 355 Query: 335 -SDVDPITAQVKQMPTLEQLKSWVRNNGE 362 T QV P L + + GE Sbjct: 356 CESPKMTTDQVSTFPALVEELTAALQQGE 384 >UniRef50_D1CGD3 CRISPR-associated protein, Cse4 family n=1 Tax=Thermobaculum terrenum ATCC BAA-798 RepID=D1CGD3_THET1 Length = 382 Score = 375 bits (963), Expect = e-102, Method: Composition-based stats. Identities = 120/379 (31%), Positives = 183/379 (48%), Gaps = 30/379 (7%) Query: 4 FINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYA--QNIGES 61 + +H++ + +PS LNRDD KD FGG RR RISSQ +KRA+R+ + Sbjct: 2 LVELHMIQNFAPSNLNRDDTGSPKDCEFGGVRRARISSQCIKRAIRREFKQNGLLDSERI 61 Query: 62 SLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGEIAW 121 + RT + Q +LG R ++ LLS + + + GEI Sbjct: 62 AERTRLVTQEIADRLARLG-RDREQATRVAGFLLSAAKLKVDNSQRTEYLLFLGRGEIDA 120 Query: 122 F-------CEQVAKAEADNL-----DDKKLLKVLKEDIAA---IRVNLQQGVDIALSGRM 166 +Q+A +L D KK + + D++ R++ + D+AL GRM Sbjct: 121 ITALCNERWDQLAPLADQSLSDQSNDKKKAAQQVPADMSRELLARLDGGKAADLALFGRM 180 Query: 167 ATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQ---EQGSAHLGTQEFSSGV 223 +D A +AHAI+TH+V + D++TAVDDLQ E G+ +GT EF+S Sbjct: 181 LAD---LPDKNIDAASQVAHAISTHRVSIEFDFYTAVDDLQPESETGAGMMGTVEFNSAC 237 Query: 224 FYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNF 283 FYRY+N+++ QL NL G RE AL+ +H +P KQ + AA NP MV Sbjct: 238 FYRYSNVSMEQLITNLQG-DRELALKTLEAFIHASVRAIPTGKQNSMAAHNPPSMVFAVV 296 Query: 284 SD-MPLSMANAFEKAVKA--KDGFLQPSIQAFNQYWDRVANGYGLNGAA--AQFSLSDVD 338 + P S+ANAF + V ++ + SIQA + YW ++ + YG + A +L DV Sbjct: 297 REGAPWSLANAFARPVAPGREEDLVGRSIQALDSYWGKLVSVYGGDDIRKKALITLEDVP 356 Query: 339 PITAQVKQMPTLEQLKSWV 357 ++ T++ L V Sbjct: 357 LQHLGDARVETVKALVEQV 375 >UniRef50_A3EQA5 CRISPR-ssociated protein, Cas4 n=4 Tax=Bacteria RepID=A3EQA5_9BACT Length = 398 Score = 374 bits (960), Expect = e-102, Method: Composition-based stats. Identities = 113/395 (28%), Positives = 180/395 (45%), Gaps = 47/395 (11%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIG- 59 M I IHVL + +PS LNRDD KDA+FGG RR RISSQ +KR++R + + G Sbjct: 1 MKTLIEIHVLQNFAPSNLNRDDTGAPKDALFGGTRRARISSQCIKRSVRDFFCHKREDGI 60 Query: 60 ----ESSLRTIHLAQ-LRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAE---------- 104 E +RT + Q + D+L++K L+ L K +E Sbjct: 61 FSPDEIGVRTKRIYQAIADLLKEKRDISDTITKAKTALSYLKIKPKNEKTQYLLFLSPKE 120 Query: 105 -KISADAVTPW---VVGE-IAWFCEQVAKAEADNLDDKK-------------LLKVLKED 146 K A+A+ + +VGE I ++ + D + ++ + K +E Sbjct: 121 IKDFANAIDEYWDQIVGEPIETDNSELDEETPDTVSLEEQKPKKGKKNKKPNIPKEFQEK 180 Query: 147 IAAIRVNLQQGVDIALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDL 206 + ++ N + +DIAL GRM + A +AHAI+TH V+ + D++TA+DDL Sbjct: 181 LESVL-NGGKSIDIALFGRMLAD---IPEKNQNAACQVAHAISTHAVEREFDYYTAIDDL 236 Query: 207 QE---QGSAHLGTQEFSSGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVP 263 + GS +GT EF+S FYRYA ++L L +NL E + + P Sbjct: 237 KPDDTAGSDMIGTVEFNSACFYRYAVVDLEALNKNLHD-DSELTNKSIRAFLEAFIISEP 295 Query: 264 GAKQRTYAAFNPADMVMVNFSDM--PLSMANAFEKAVKAKDG--FLQPSIQAFNQYWDRV 319 KQ ++AA NP + + ++ P ++ANAFE AV K G + S + + Sbjct: 296 TGKQNSFAAHNPPEFIAISVRHNAGPRNLANAFETAVFPKKGESLTRKSADELVKKAKSL 355 Query: 320 ANGYGLNGAAAQFSLSDVDPITAQVKQMPTLEQLK 354 + +G +L + + + +LE L Sbjct: 356 QSAFGGEDKTFLINLVGTN-VNGYGTVVASLEDLL 389 >UniRef50_C3PF94 CRISPR-associated protein n=5 Tax=Corynebacterium RepID=C3PF94_CORA7 Length = 384 Score = 374 bits (960), Expect = e-102, Method: Composition-based stats. Identities = 109/380 (28%), Positives = 168/380 (44%), Gaps = 32/380 (8%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGE 60 MS I+IH L + PS +NRDD K AIFGG R R+SSQS KRA+R + Sbjct: 1 MSLVIDIHALQTLPPSLINRDDTGAPKSAIFGGVPRQRVSSQSWKRAIRNYFEKNVDPEF 60 Query: 61 SSLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSV-----------------DEA 103 R+ L + L + ++ I + L + ++ Sbjct: 61 VGDRSKRLPEKIAKLVENHDGWDSERAIKQVSDLFKAAGISTEVDSKRIKELEKSDAEDK 120 Query: 104 EKISADAVTPWVVGEIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALS 163 E++ +A P I +Q+ +A +D + +K+ A + ++ Q VD+A+ Sbjct: 121 EELIKEASYPRTKYLIFLSPQQIDRAVRAIVDADG--EKIKKAEAKVILDTQHSVDMAMF 178 Query: 164 GRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDL----QEQGSAHLGTQEF 219 GRM VD A+ +AHA+ H + D+FTAVDDL +E G+ +GT + Sbjct: 179 GRMIADDAAF---NVDAAVQVAHALGIHSSAPEFDYFTAVDDLAEDGEETGAGMIGTVQM 235 Query: 220 SSGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMV 279 S YR+A +N+A L +NL AS E A + A V +P K T+A ++V Sbjct: 236 MSSTLYRFATVNVAGLTKNL--ASEENAKQAAVQFVDAFIKSMPTGKINTFANHTLPELV 293 Query: 280 MVNFSD-MPLSMANAFEKAVKAKDG--FLQPSIQAFNQYWDRVANGYGLNGAAAQFS-LS 335 V D P+S+ AFE+ V+A D +A + YGL AA +S Sbjct: 294 YVTVRDTRPVSLVTAFEEPVQATDDKNLRLAGAEALAKEEREFEENYGLKPLAAFAVGVS 353 Query: 336 DVDPITAQVKQMPTLEQLKS 355 + A + + TL +L Sbjct: 354 EARAPFADIAETVTLPELSE 373 >UniRef50_C6SPJ0 Putative uncharacterized protein n=1 Tax=Streptococcus mutans NN2025 RepID=C6SPJ0_STRMN Length = 359 Score = 373 bits (957), Expect = e-102, Method: Composition-based stats. Identities = 91/356 (25%), Positives = 163/356 (45%), Gaps = 24/356 (6%) Query: 4 FINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESSL 63 F++I+ + + PS +NRDD K +GG RR R+SSQS K+AMR Y + Sbjct: 11 FLDIYAIQTLPPSNINRDDTGSPKTTQYGGVRRARVSSQSWKKAMRDYFYEHAEEEQLGK 70 Query: 64 RTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGEIAWFC 123 RT + ++K K + + + D + +G Sbjct: 71 RTRKVVNYVAEKIIHQKIDLNEKESSKLAT-----DILKLAGVPTDGKVLFFIGNTE--A 123 Query: 124 EQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGKVDGAMS 183 E++A A + DK+ + + + +D+AL GRM + TE D + Sbjct: 124 EKLATAAVKGVKDKEEARKI--------MQSNLALDVALFGRMVANDKETEA---DASSQ 172 Query: 184 IAHAITTHQVDSDIDWFTAVDDLQ---EQGSAHLGTQEFSSGVFYRYANINLAQLQENLG 240 AH I+TH V ++ D++TAVDDL + + LGT EF+S YRYAN+ + + G Sbjct: 173 FAHPISTHAVQTEFDFYTAVDDLASDDDAKAGMLGTVEFNSSTLYRYANVAIHEFLVQRG 232 Query: 241 GASREQALEIATHVVHMLATEVPGAKQRTYAAFNPAD-MVMVNFSDMPLSMANAFEKAVK 299 +RE ++ + A +P K ++A +++ SD P+++ +AFE+ VK Sbjct: 233 --NREDLVDSLQLFIKAFAESMPRGKINSFANQTIPQTLIITVRSDRPVNLVSAFEEPVK 290 Query: 300 AKDGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITAQVKQMPTLEQLKS 355 + +G++ SI+ ++ + +V + SL +V+ +T + ++ +L Sbjct: 291 SSNGYVTKSIEKLSKEFVKVEKMVKKPVLSFYVSLEEVEALTKVGIEKNSITELVE 346 >UniRef50_A7BA64 Putative uncharacterized protein n=1 Tax=Actinomyces odontolyticus ATCC 17982 RepID=A7BA64_9ACTO Length = 374 Score = 371 bits (953), Expect = e-101, Method: Composition-based stats. Identities = 107/379 (28%), Positives = 168/379 (44%), Gaps = 23/379 (6%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGE 60 MS F++IHVL + PS NRDD K A FGG +R+RISSQ++KRA R+ G Sbjct: 1 MSVFVDIHVLQTLPPSNPNRDDTGAPKSATFGGVQRMRISSQAIKRATRQDFEGKIADGN 60 Query: 61 SSLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEA-------EKISADAVTP 113 +RT + +L + +R D + LA + K++ + + + Sbjct: 61 YGVRTKKIVELVARTITE--KRPDLEAASIELAEMGLKAIGFKLAEPRGNKSDNELKESG 118 Query: 114 WVVGEIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMT 173 ++V A E V+ A + KE V+ +DIAL GRM Sbjct: 119 FLVFLSAKQIEHVSDALISVAHEDDPAAAFKELKPRSLVDTDHSIDIALFGRMVAEPNA- 177 Query: 174 ELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQ------EQGSAHLGTQEFSSGVFYRY 227 VD A +AHAI V+ + D++TAVDD + ++G+ +GT EF+S YRY Sbjct: 178 --LNVDAACQVAHAIGVGAVEREYDYYTAVDDAKKRNDEADEGAGMIGTIEFASATVYRY 235 Query: 228 ANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMV-NFSDM 286 A IN+ L+ENLG A V +P K T+A + V+V D Sbjct: 236 ATINVDLLRENLG--DDAVADRAVELFVDSFVRSMPTGKVTTFANRTLPEAVLVQVRDDQ 293 Query: 287 PLSMANAFEKA-VKAKDGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDV-DPITAQV 344 P++M+ AFE+ + + GF +P+I F ++ ++ GL + S + +++ Sbjct: 294 PINMSGAFEEPIIAGQHGFAEPAIARFVEFESQLRELTGLEAVESLVSWTTPRGESFSEL 353 Query: 345 KQMPTLEQLKSWVRNNGEA 363 + L L Sbjct: 354 GKQVRLASLGETAAEAVRG 372 >UniRef50_A1ARH7 CRISPR-associated protein, Cse4 family n=1 Tax=Pelobacter propionicus DSM 2379 RepID=A1ARH7_PELPD Length = 374 Score = 370 bits (951), Expect = e-101, Method: Composition-based stats. Identities = 115/374 (30%), Positives = 173/374 (46%), Gaps = 23/374 (6%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGE 60 M + IHVL + +PS LNRDD KDA+FGG RR R+SSQ LKR++R+ QN G Sbjct: 1 MKTIVEIHVLQNFAPSNLNRDDTGAPKDALFGGTRRARVSSQCLKRSVREYFKD-QNKGW 59 Query: 61 SSLRTIHLAQLRDVLRQKLGERFDQ-------KIIDKTLALLSGK-SVDEAEKISADAVT 112 + RT + + E K I+ ++ L V ++ +D + Sbjct: 60 VADRTKRVVYALKERISPVLESQKDFSEDNLLKAIEVAVSNLGSNKKVKVDKEKKSDVLL 119 Query: 113 PWVVGEIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVN--LQQGVDIALSGRMATSG 170 EI + VA++ AD L K +V++ AI + VD+AL GRM Sbjct: 120 FLSPKEIDALAQVVAESYADLLKTKLSDQVVRNLNDAIDGENKSRLSVDVALFGRMLA-- 177 Query: 171 MMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQE---QGSAHLGTQEFSSGVFYRY 227 + + A +AHAI+TH V+ + D++TAVDDL+ G+ +GT EF+S FYRY Sbjct: 178 -VMPEKNQNAACQVAHAISTHAVEREFDFYTAVDDLKPEDTAGADMMGTVEFNSACFYRY 236 Query: 228 ANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFS--D 285 A ++ +L NL A A + + P KQ T+AA NP + V V Sbjct: 237 AVVDWEKLLVNLQ-ADEALATKGLRAFLEGFVVAEPTGKQNTFAAHNPPEFVAVTVRRNA 295 Query: 286 MPLSMANAFEKAVKAKDG--FLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITAQ 343 P ++ANAFE AV+ + + S + + + +G + +L++ I Sbjct: 296 APRNLANAFETAVRVRKDESLTRKSAEGLANKAKALQSAFGGDEKTFVLNLAEAT-IDGF 354 Query: 344 VKQMPTLEQLKSWV 357 MPTL L Sbjct: 355 GIVMPTLNDLLDKA 368 >UniRef50_Q2JWC4 CRISPR-associated protein, Cse4 family n=1 Tax=Synechococcus sp. JA-3-3Ab RepID=Q2JWC4_SYNJA Length = 380 Score = 370 bits (950), Expect = e-101, Method: Composition-based stats. Identities = 104/370 (28%), Positives = 167/370 (45%), Gaps = 22/370 (5%) Query: 5 INIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYY--AQNIGESS 62 + IH++ S P+ LNRD+ M K IFGG+ R RISSQ KRA+RK + + + Sbjct: 3 LEIHLIQSFPPANLNRDENGMPKSTIFGGRPRARISSQCQKRAVRKYYHQYAELDPAHFA 62 Query: 63 LRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGEIAWF 122 R+ + K G +Q + LAL G + +K A + E+ Sbjct: 63 ARSRNWLPELKSKLVKAGIPDEQAGMAARLALEQGLKLKFNDKNEATTIVFLGKTELDAI 122 Query: 123 CEQVAK---AEADNLDDKKLL--KVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGK 177 E + K A L ++K + + + I V+ + D+AL GRM S Sbjct: 123 AEILIKNWSAIESGLREEKPKLPQKIAKAIEKALVDTGKPGDVALFGRMMAS---LPTVN 179 Query: 178 VDGAMSIAHAITTHQVDSDIDWFTAVDDL---QEQGSAHLGTQEFSSGVFYRYANINLAQ 234 VD A+ +AHAI+ + + + D+FTAVDDL ++ G+ H+G ++S +YR+A ++ Q Sbjct: 180 VDAAVQVAHAISINALQQEFDFFTAVDDLGSSEDTGADHMGETGYNSSTYYRFAVLDKKQ 239 Query: 235 LQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFSD-MPLSMANA 293 L ENLGG E I VP Q +AA +VM + P+S+ +A Sbjct: 240 LVENLGGT--EHLGSIIKAFATAFIHAVPSGHQNGFAAHTRPALVMAVVREGQPISLVDA 297 Query: 294 FEKAVKAKDGF--LQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITAQ----VKQM 347 FE V GF L+ +++A ++YW + YG + + + Sbjct: 298 FENPVAPSGGFSLLENAVKALDEYWGSLVKMYGEADVQYKGVVVLDRLAARLNVLKSSKK 357 Query: 348 PTLEQLKSWV 357 ++E+L Sbjct: 358 DSVEELLKSA 367 >UniRef50_A4XYU0 CRISPR-associated protein, Cse4 family n=5 Tax=Bacteria RepID=A4XYU0_PSEMY Length = 384 Score = 369 bits (949), Expect = e-101, Method: Composition-based stats. Identities = 115/391 (29%), Positives = 172/391 (43%), Gaps = 39/391 (9%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQ--NI 58 MS F+ H++ + +PS LNRDD KDA+FGG RR R+SSQ KRA+R + + Sbjct: 1 MSLFVEFHLIQNFAPSNLNRDDTGAPKDALFGGHRRARVSSQCFKRAIRLAAQEHELVAP 60 Query: 59 GESSLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGE 118 +RT L L L ++L R + K L+ + + + + E Sbjct: 61 EFRGVRTKKLKTL---LLERLAGRDPLEAEGKIEVALAAAGLKLKDDGKTEYLLFLGEAE 117 Query: 119 IAWFC-------EQVAKA-----------EADNLDDKKLLKVLKEDIAAIRVNLQQGVDI 160 IA F +++A A +V+K+ A ++ + VD+ Sbjct: 118 IAGFATLIEQHWDELAGAPAGGEKKGEKKGKKEAKASAPAEVVKK--AKALLDGGKAVDV 175 Query: 161 ALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDL---QEQGSAHLGTQ 217 AL GRM D A +AHAI+TH+V+ + D+FTAVDD E G+ +G Sbjct: 176 ALFGRMLAD---MPEVNQDAACQVAHAISTHRVEREFDYFTAVDDKGGPDETGAGMIGQV 232 Query: 218 EFSSGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPAD 277 EF+S YRYA ++ +L NL RE L + +P KQ T+AA N Sbjct: 233 EFNSATLYRYAVVDAGKLLGNLQ-QDRELTLSALEAFTQAMVRAIPTGKQNTFAAHNLPS 291 Query: 278 MVMVNFSDM-PLSMANAFEKAVKAKDG--FLQPSIQAFNQYWDRVANGYGLNGAAAQFSL 334 V V PL++ANAFEK + A+ S+ ++ ++A Y ++ Sbjct: 292 FVGVCLRHAGPLNLANAFEKPIAARQDAALSSLSVTELAKHEGKLAAVYADASDQ--WAY 349 Query: 335 SDVDPITAQVKQMP--TLEQLKSWVRNNGEA 363 D+ Q K L +L SWVR A Sbjct: 350 LDLSEAWPQQKGFAVQNLGELASWVRMQVAA 380 >UniRef50_C4FG89 Putative uncharacterized protein n=1 Tax=Bifidobacterium angulatum DSM 20098 RepID=C4FG89_9BIFI Length = 387 Score = 367 bits (942), Expect = e-100, Method: Composition-based stats. Identities = 106/387 (27%), Positives = 168/387 (43%), Gaps = 32/387 (8%) Query: 4 FINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESSL 63 F++IH + PS +NRDD K A GG R R+SSQ+ KRAMR + + Sbjct: 2 FMDIHCIQQVPPSNINRDDTGSPKTAYVGGALRSRVSSQAWKRAMRGVFDDMLDSDKLGK 61 Query: 64 RTIHLAQLRD---VLRQKLGERFDQKIIDKTLALL--SGKSVDEAEKISADAVTPWVVGE 118 RT + L ++ +++ + LAL K+ + A VT +++ Sbjct: 62 RTKGVVALIASSITAKRPDLAESAEELGQRVLALEGIGVKASNRAGSDKGTLVTDYLIFI 121 Query: 119 IAWFCEQVAKAEADNLD---------DKKLLKVLKEDIAAIR------VNLQQGVDIALS 163 +++A D K L K K D+A ++ + Q +DIAL Sbjct: 122 ANNEIDKLADWAIAASDKGRDFSKVGKKGLSKAEKTDLAKMKNEVSEIFHGPQAIDIALF 181 Query: 164 GRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQ---EQGSAHLGTQEFS 220 GRM + D + +AHA + Q+ + D+FTAVDD G+A L T F+ Sbjct: 182 GRMLANA---PDLNTDASAQVAHAFSIDQITPEYDYFTAVDDCASEDNAGAAMLDTVGFN 238 Query: 221 SGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVM 280 S YRYA +N+ L+E L AS A+E A V +P KQ T+A + V+ Sbjct: 239 SSTLYRYAAVNIDALKEQLQDAS--AAVEGAVAFVEAFIKSMPSGKQNTFANHTLPEDVV 296 Query: 281 VNFSD-MPLSMANAFEKAVKAKDG--FLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDV 337 V D P+S A+AFE+ V+ K+G + I+ + + + Y A + S Sbjct: 297 VVLRDSQPISAADAFEEPVRRKEGVSVSRQGIERLGKRLNEIRVNYSEEPVKAWYIASGG 356 Query: 338 D-PITAQVKQMPTLEQLKSWVRNNGEA 363 + + + +L L+ +R A Sbjct: 357 EVDSLKEWSEQVSLPDLEHGLRETLNA 383 >UniRef50_D2TKK6 CRISPR-associated protein n=1 Tax=Citrobacter rodentium ICC168 RepID=D2TKK6_CITRO Length = 363 Score = 364 bits (935), Expect = 3e-99, Method: Composition-based stats. Identities = 97/358 (27%), Positives = 151/358 (42%), Gaps = 21/358 (5%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNI-G 59 M+ FI +H+L ++ + LNRDD K + GG R+RISSQSLKRA R S + Q + G Sbjct: 13 MTTFIQLHLLTAYPAANLNRDDTGAPKTVVLGGTTRLRISSQSLKRAWRTSELFEQALAG 72 Query: 60 ESSLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGEI 119 +R+ +A+ + K G + + V K A P E Sbjct: 73 NIGIRSGRIAREAAEILIKSGIDEKKAVAYVEAIARCFGKV----KADKKAKEPLTNSET 128 Query: 120 AWFCEQVAKAEADNLDD-----KKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTE 174 ++ AE D + + + KE+ A+ + + VDIA+ GRM Sbjct: 129 EQLV-HISPAEFDAVKALAHRLAEEKRAPKEEELALLRHDRMAVDIAMFGRMLADK---P 184 Query: 175 LGKVDGAMSIAHAITTHQVDSDIDWFTAVDDL----QEQGSAHLGTQEFSSGVFYRYANI 230 V+ A +AHA + + D+FTAVDDL + G+ HLG F S +FY Y I Sbjct: 185 EFNVEAACQVAHAFGVSETIVEDDFFTAVDDLRANSDDAGAGHLGYTGFGSALFYTYICI 244 Query: 231 NLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFS-DMPLS 289 N L +NL G + + A + P KQ ++A+ A M D P S Sbjct: 245 NKDLLIKNLNG-NVDLANQTLRAFTEAALKVSPTGKQNSFASRAYACWAMAEKGTDQPRS 303 Query: 290 MANAFEKAVKAKDGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITAQVKQM 347 +A AF K + L ++Q + + + Y F++ + + V + Sbjct: 304 LAAAFYKPIVGS-DHLNVAVQRVTELRENMNAVYEQQTEFVGFNVMNKEGSIKDVLEF 360 >UniRef50_C7QEM5 CRISPR-associated protein, Cse4 family n=13 Tax=Actinomycetales RepID=C7QEM5_CATAD Length = 399 Score = 363 bits (933), Expect = 5e-99, Method: Composition-based stats. Identities = 103/391 (26%), Positives = 173/391 (44%), Gaps = 38/391 (9%) Query: 1 MSN-FINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIG 59 M+ ++IH+L + PS LNRDD K A++GG RR R+SSQ+ KRA R++ + Sbjct: 1 MTRVILDIHILQTVPPSNLNRDDTGSPKTAVYGGVRRARVSSQAWKRATRQAFGDLLDPS 60 Query: 60 ESSLRTIHLAQ--------LRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAV 111 E +RT +A+ L L ++I S ++ + +D Sbjct: 61 ELGVRTKRVAEQIANRMTALEPSLSPGDAVAVAVEVIKAATGAKSEVPKRKSAAVKSDQD 120 Query: 112 TPWVVGEIAWFCEQVAKAEADNLDD------KKLLKVLKEDIAAIRV----NLQQGVDIA 161 + E + +++++ +NL K + LK+ RV + + VDIA Sbjct: 121 ATAALPETGYLM-FLSESQLNNLARLGVEGSKDITAFLKDKDFKNRVRQAADTRHSVDIA 179 Query: 162 LSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDD---LQEQGSAHLGTQE 218 L GRM VD A +AHAI+ H V+++ D+FTAVDD E G+ +G + Sbjct: 180 LFGRMVADAT---DINVDAAAQVAHAISVHAVENESDYFTAVDDRSTEAEPGAGMIGIVD 236 Query: 219 FSSGVFYRYANINLAQLQENLGG------ASREQALEIATHVVHMLATEVPGAKQRTYAA 272 F++ YRYA +++ +L +NLG + E + A +P K T+ Sbjct: 237 FNAATLYRYAAVDVNRLADNLGAGLLEGESQTEPVRRAVEAFIRGFALSMPTGKVNTFGN 296 Query: 273 FNPADMVMVNFS-DMPLSMANAFEKAVKA---KDGFLQPSIQAFNQYWDRVANGYGLNGA 328 D+V+V P+S A AFE+A+ A + G+L+ + + Y ++ Y L Sbjct: 297 HTVPDVVLVKLRASRPISFAAAFEEAISAGEHQGGYLKGACERLASYIPKLEQAYDLQEG 356 Query: 329 AAQFSL--SDVDPITAQVKQMPTLEQLKSWV 357 + + Q ++ QL + V Sbjct: 357 TDSWVVCAGSATEALEQAGDPVSISQLVAAV 387 >UniRef50_A1SV72 CRISPR-associated protein, Cse4 family n=2 Tax=Gammaproteobacteria RepID=A1SV72_PSYIN Length = 337 Score = 363 bits (932), Expect = 6e-99, Method: Composition-based stats. Identities = 140/356 (39%), Positives = 204/356 (57%), Gaps = 23/356 (6%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGE 60 M+ FINIH LISH S +NRDD MQK A+FGG R RISSQ LKRA+R+S Y + + E Sbjct: 1 MTTFINIHTLISHPSSMMNRDDSGMQKTAVFGGSVRSRISSQCLKRAIRQSDIYGEAVAE 60 Query: 61 SSLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISA-DAVTPWVVGEI 119 S+RT +L D+ ++ + E + I D L + S + D+ +I DAV P+ +G I Sbjct: 61 KSIRTNKFDELLDLCKEAMPETDIKLIEDVLLNMGSKVTKDKKTEIRNFDAVQPYAIGSI 120 Query: 120 AWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGKVD 179 + + + K L K+++ +D+ALSGRM S V+ Sbjct: 121 ----REAINMVNEGTELKDLKKIVQ----------IPTIDVALSGRMDAS---CPPRNVE 163 Query: 180 GAMSIAHAITTHQVDSDIDWFTAVDDLQEQGSAHLGTQEFSSGVFYRYANINLAQLQENL 239 AMS+AH++TTH D ++DWFTA DDL EQGS H+GT EFSSGVFYRYA+IN+ L +N+ Sbjct: 164 AAMSVAHSLTTHSADIEVDWFTACDDLAEQGSGHIGTTEFSSGVFYRYASINVDLLAKNV 223 Query: 240 GGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFSDMPLSMANAFEKAVK 299 E I ++ A P AKQ+ +AA+N AD VM S+ P+S+ANAF K ++ Sbjct: 224 KSTVSEVTP-IINTMIRCFAQVSPSAKQKVFAAYNQADFVMATHSNQPISLANAFRKPIE 282 Query: 300 AKDGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITAQVKQMPTLEQLKS 355 ++ SI A ++++++ N Y L+ A L+D +AQ KQ+ + ++ Sbjct: 283 NNGDVMENSIAALVKHYEKLTNAYELDSKAIALDLTD----SAQSKQINLVNKISD 334 >UniRef50_Q3A5Z5 CRISPR-associated protein, Cse4 family n=23 Tax=Bacteria RepID=Q3A5Z5_PELCD Length = 373 Score = 363 bits (931), Expect = 9e-99, Method: Composition-based stats. Identities = 102/358 (28%), Positives = 157/358 (43%), Gaps = 30/358 (8%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGE 60 MS FI +H+L S+ P+ LNRDD+ K A GG R+R+SSQSLKRA R S + + + Sbjct: 1 MSRFIQLHLLTSYPPANLNRDDLGRPKTAKMGGVDRLRVSSQSLKRAWRTSDLFGKTVKN 60 Query: 61 -SSLRTIHLAQLR--DVLRQKLGERFDQKIIDKTLALLSG-KSVDEAEKISADAVT---- 112 RT + + ++ + +G + + K + + EK + + Sbjct: 61 GLGTRTKEMGRKVYERLVEKGIGHKDALSWAGAIAGVFGKLKKLTDKEKTALKKLATEER 120 Query: 113 ---PWVVGEIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQ----GVDIALSGR 165 V EI + E LD + KE +NL + VDIAL GR Sbjct: 121 REKELVEVEIEQLAFFDLEEEQAVLDLTNSIAERKEGPQPEELNLLRQKMTSVDIALFGR 180 Query: 166 MATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDL----QEQGSAHLGTQEFSS 221 M S V+ A +AHAI+ H + + D+FTAVDDL ++ G+AH+G F++ Sbjct: 181 MLASSPAF---NVEAACQVAHAISVHPIVIEDDYFTAVDDLNDGSEDAGAAHIGETGFAA 237 Query: 222 GVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMV 281 G+FY Y IN L ENLGG + A + P KQ ++A+ A V+ Sbjct: 238 GLFYSYICINRDLLAENLGG-DEDLAQRAIAALTEAAVKVPPNGKQNSFASRAYASYVLA 296 Query: 282 NFSD-MPLSMANAFEKAV------KAKDGFLQPSIQAFNQYWDRVANGYGLNGAAAQF 332 + P S++ AF K + + F +++A + + YG Sbjct: 297 EKGEQQPRSLSVAFLKPIDNRTLYRDDQDFGTAAVEALEAHRQNMNKVYGDCADELYA 354 >UniRef50_Q2JH28 CRISPR-associated protein, CT1975 n=6 Tax=Actinomycetales RepID=Q2JH28_FRASC Length = 384 Score = 362 bits (930), Expect = 1e-98, Method: Composition-based stats. Identities = 94/336 (27%), Positives = 156/336 (46%), Gaps = 12/336 (3%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGE 60 M +I++H+L + PS LNRDD K A++GG +R R+SSQ+ KRA R + + + Sbjct: 1 MRCYIDVHILQTVPPSNLNRDDAGTPKQAVYGGVKRARVSSQAWKRATRTAFADHIDQAQ 60 Query: 61 SSLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEK-ISADAVTPWVVGEI 119 RT ++ L + +LL+ + +K + + ++ Sbjct: 61 LGTRTKRISALLAERLATRCALDAETSTRIATSLLTALKISAGKKAAETAYLLFFGRPQL 120 Query: 120 AWFCEQVAKAEA--DNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGK 177 + + + +L D LL +K+ + +D+AL GRM Sbjct: 121 ERLIDLIVEDVPRLADLSDGDLLAAVKDVPVLATLGSDHPIDVALFGRMVAD---LASLN 177 Query: 178 VDGAMSIAHAITTHQVDSDIDWFTAVDDLQ---EQGSAHLGTQEFSSGVFYRYANINLAQ 234 VD A +AHA++TH VD + D++TAVDD E G+ +GT EF S YR+A + L Q Sbjct: 178 VDAATQVAHALSTHAVDVEFDYYTAVDDQNAKDETGAGMIGTVEFQSATLYRFATVGLHQ 237 Query: 235 LQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMV-MVNFSDMPLSMANA 293 L ENLGG E +E + T +P Q ++A +++ + D P+++ +A Sbjct: 238 LAENLGG-DIEATVEALRVFLTAFTTSMPTGHQNSFAHRTVPNLLTIAIRPDQPVNLVSA 296 Query: 294 FEKAVKAKD-GFLQPSIQAFNQYWDRVANGYGLNGA 328 FEK V + G L S++ F + + +GL Sbjct: 297 FEKPVLPRGRGVLTGSLEQFAIELNSASTLWGLQPD 332 >UniRef50_Q2FNL3 CRISPR-associated protein, CT1975 n=8 Tax=cellular organisms RepID=Q2FNL3_METHJ Length = 382 Score = 361 bits (927), Expect = 2e-98, Method: Composition-based stats. Identities = 112/401 (27%), Positives = 177/401 (44%), Gaps = 62/401 (15%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNI-G 59 MS FI IH+L S+ PS LNRDD+ K A GG +R+R+SSQSLKR+ R S ++ + G Sbjct: 1 MSEFIQIHMLASYPPSNLNRDDLGRPKTATVGGTQRIRVSSQSLKRSWRTSEAFSDALKG 60 Query: 60 ESSLRTIHLA-----------QLRDVLRQKLGERFDQKIIDKTLA-------------LL 95 +RT + L D+L K ++I D+ A + Sbjct: 61 AIGIRTRDMGVKIKKALVEGRLLSDILEGKESGVTRERIKDEKKAHEWAVKISSHFGKIE 120 Query: 96 SGKSVDEAEKISA------------DAVTPWVVGEIAWFCEQVAKAEADNLDDKKLLKVL 143 GK D +K + + EIA + + + + + Sbjct: 121 GGKEKDSDKKSEKTDEKSNKNPLSHKQMVHYSPEEIAGIDDLLGRISGG--------EKV 172 Query: 144 KEDIAAIRVNLQQGVDIALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAV 203 +D + + VDIAL GRM + A+ ++HAIT H + D+FTAV Sbjct: 173 SDDDCIRLRSDHKAVDIALFGRMLADNAAY---NTEAAVQVSHAITVHDTPVEDDYFTAV 229 Query: 204 DDLQE----QGSAHLGTQEFSSGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLA 259 DDL + G+ H+G EF +G+FY Y IN L+ENL G E + ++ + Sbjct: 230 DDLNQLDDTAGAGHIGEAEFGAGLFYTYICINRDLLKENLQG-DNELSNRAIEALIRAAS 288 Query: 260 TEVPGAKQRTYAAFNPADMVMVNFS-DMPLSMANAFEKAVKAKDGFLQPSIQAFNQYWDR 318 P KQ ++A+ + A ++V + P S+A AF K V KD + +++ DR Sbjct: 289 MVSPSGKQNSFASRSYASYLLVEKGTEQPRSLAAAFFKPVSGKDIY-GDAVKNLEGLRDR 347 Query: 319 VANGYGLNGAAAQFSLSDVDPITAQVKQMPTLEQLKSWVRN 359 + N YG + + S++ +D +L + S+V Sbjct: 348 MDNAYGTSFKQSSRSMNVIDGTG-------SLTDIISFVLE 381 >UniRef50_D1YEE3 CRISPR system CASCADE complex protein CasC n=1 Tax=Propionibacterium acnes J139 RepID=D1YEE3_PROAC Length = 374 Score = 361 bits (926), Expect = 3e-98, Method: Composition-based stats. Identities = 100/375 (26%), Positives = 166/375 (44%), Gaps = 26/375 (6%) Query: 2 SNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGES 61 S +++IHV+ S PS +NRDD K A++GG RR R+SSQ+ K+A+R S ++ Sbjct: 3 SYYVDIHVIQSVPPSNVNRDDTGSPKSALYGGVRRARVSSQAWKKAVRTSFKEFLPANQT 62 Query: 62 SLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISAD--------AVTP 113 RT+ + +L ++ + + +AEK T Sbjct: 63 GSRTLRVVELLMNRLTAAPYGLPEEDARQKALEVVKALGLKAEKPRKKDESGAEGIERTQ 122 Query: 114 WVVGEIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMT 173 ++V +++A+ A D K+ + A + G+++AL GRM Sbjct: 123 YLVFYSNQQLDRLAQLAA--TTDGKITATDAKKAA----DSDHGIEVALFGRMVADSK-- 174 Query: 174 ELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQ----EQGSAHLGTQEFSSGVFYRYAN 229 VD A+ +AHA++TH V+ + D+FTAVDD + + G+ +GT EF+S YR+A Sbjct: 175 -DLNVDSAVQVAHALSTHAVEIESDYFTAVDDYKLDEDDAGAGMIGTVEFTSETLYRFAT 233 Query: 230 INLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMV-NFSDMPL 288 + ++ L++NLG + + A+ V +P KQ T+A D V+V Sbjct: 234 VAVSTLKDNLG--DVDLTAQAASAFVRGFIMSMPTGKQNTFANNTIPDAVVVQVRKGRSA 291 Query: 289 SMANAFEKAVKA-KDGFLQPSIQAFNQYWDRVANGY-GLNGAAAQFSLSDVDPITAQVKQ 346 S AFE V + GF+ S QA Y + G A+ + + Sbjct: 292 SFIGAFEDPVTSDDGGFVAASCQAVAAYAHDCEEAFLGAPEASFVTRVGSRTEAIGTMGT 351 Query: 347 MPTLEQLKSWVRNNG 361 ++ L S VR+ Sbjct: 352 QMPIDDLVSSVRDQV 366 >UniRef50_Q47PJ3 CRISPR-associated protein, Cse4 family n=1 Tax=Thermobifida fusca YX RepID=Q47PJ3_THEFY Length = 373 Score = 360 bits (925), Expect = 4e-98, Method: Composition-based stats. Identities = 103/369 (27%), Positives = 168/369 (45%), Gaps = 23/369 (6%) Query: 3 NFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESS 62 F++IH + + S +NRDD+ K ++GGK R R+SSQS KRA+R +G+ + Sbjct: 2 TFVDIHAIQTLPYSNINRDDLGSPKTVVYGGKERTRVSSQSWKRAVRHEV--EARLGDKA 59 Query: 63 LRTIH-LAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPW------- 114 +RT ++++ LR++ + + + L GK + D+ P Sbjct: 60 VRTRRIISEIAKRLRERGWDADLADAGARQVVLSVGKKSGIKLEKEKDSEAPATSVLFYL 119 Query: 115 ---VVGEIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGM 171 + E+A ++ A A K +L D + + V + L GRM Sbjct: 120 PVPAIDELAAIADEHRDAVAKEAAKKTPKGILPADRITEVLKSRN-VSVNLFGRMLAE-- 176 Query: 172 MTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDL---QEQGSAHLGTQEFSSGVFYRYA 228 +VDGA+ AHA T H ++D+FTAVDD+ + GS H+ +FS+G FYRYA Sbjct: 177 -LPSTEVDGAVQFAHAFTVHGTTVEVDFFTAVDDIPKENDHGSGHMNAGQFSAGTFYRYA 235 Query: 229 NINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFS-DMP 287 N+NL +L EN G + A + + VP KQ AA D+V + D P Sbjct: 236 NVNLDRLVENTG--DAQTARTAVAEFLRAFLSTVPSGKQNATAAMTLPDLVHIAVRFDRP 293 Query: 288 LSMANAFEKAVKAKDGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITAQVKQM 347 +S A AFE A+ DG+ + Q N Y +R+ + + ++ + + A ++ Sbjct: 294 ISFAPAFETALYGSDGYTLRACQELNNYAERLREVWPDDAIRGYATVENKTDLAALGERY 353 Query: 348 PTLEQLKSW 356 + L Sbjct: 354 DSYPALIDA 362 >UniRef50_C7LYW7 CRISPR-associated protein, Cse4 family n=1 Tax=Acidimicrobium ferrooxidans DSM 10331 RepID=C7LYW7_ACIFD Length = 386 Score = 358 bits (918), Expect = 3e-97, Method: Composition-based stats. Identities = 111/373 (29%), Positives = 169/373 (45%), Gaps = 25/373 (6%) Query: 5 INIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQN-IGESSL 63 I++HVL + PSCLNRDD N K A++GG RR R+SSQS KRA R+ IG L Sbjct: 9 IDVHVLQTLPPSCLNRDDTNAPKTALYGGARRARVSSQSWKRATRRYFNENLATIGTDWL 68 Query: 64 RTI----HLAQLRDVLRQKLGER-----FDQKIIDKTLALLSGK---SVDEAEKISADAV 111 R+ +L +L +++ R + + + + L +G +E K A Sbjct: 69 RSRGGGIRTRKLAGLLHERVQARVRDLDVREDDVARLVNLAAGALLGLKEEKLKKRAQET 128 Query: 112 TPWVVGEIAWFCEQVAKAEADNLDDK-KLLKVLKEDIAAIRVNLQQGVDIALSGRMATSG 170 P + + E A L+ + L D+ + +D+AL GRM Sbjct: 129 QPADLEYALFVSESAIDAAVGELERSLRAGDDLDLDVLTTAMGRDLSLDVALFGRMIAD- 187 Query: 171 MMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQ---EQGSAHLGTQEFSSGVFYRY 227 T VD A +AHAI+TH+V S+ D++T VDDL E G+A +G EF+S YR+ Sbjct: 188 --TPNLNVDAACQVAHAISTHRVTSEFDFYTTVDDLAGDDETGAAMMGFIEFNSATVYRF 245 Query: 228 ANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVN-FSDM 286 A ++L +L +NLG + + A +P Q T+AA D+V V+ D Sbjct: 246 ATVSLGRLADNLG--DPDAVPTGVRAFIEAFAKSLPTGHQNTFAALTVPDLVFVSMRGDQ 303 Query: 287 PLSMANAFEKAVKAKDGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSL--SDVDPITAQV 344 P+S+ AFE V++ G++ S + Y D + YG+ S + + Sbjct: 304 PVSLVGAFEAPVESDRGYVHASAERLATYADDIDGLYGVPRLNGWASYVPKLEQAVATHL 363 Query: 345 KQMPTLEQLKSWV 357 QL V Sbjct: 364 GDSIAFPQLLDAV 376 >UniRef50_Q03C61 CRISPR-associated protein n=6 Tax=Firmicutes RepID=Q03C61_LACC3 Length = 361 Score = 355 bits (911), Expect = 2e-96, Method: Composition-based stats. Identities = 108/365 (29%), Positives = 176/365 (48%), Gaps = 30/365 (8%) Query: 1 MSN---FINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQN 57 M+N +I+IHVL + + +NRDD K A++GG R R+SSQS KRAMR ++ Sbjct: 1 MTNKNLYIDIHVLQTVPSANINRDDTGAPKKALYGGVTRARVSSQSWKRAMRLRFN-QED 59 Query: 58 IGESSLRTIHLAQ-LRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVV 116 ++ LRT + Q LR L+ D++I K A+ S + + A+ Sbjct: 60 HDDAGLRTKEVPQLLRQALKAAAPALTDEEIAAKVDAVFSTAKIKITKDGQTGALMLIST 119 Query: 117 GEIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELG 176 G++ + EA LD K+L K+ K +Q +D+AL GRM Sbjct: 120 GQLKKLAQYALDNEA--LDKKELTKLFK---------GEQSLDLALFGRMVADN---PEL 165 Query: 177 KVDGAMSIAHAITTHQVDSDIDWFTAVDDLQ---EQGSAHLGTQEFSSGVFYRYANINLA 233 V+G+ +AHAI+TH++ + D+FTA+DD + G+A LGT E++S YRYAN+N Sbjct: 166 NVEGSAQVAHAISTHEIVPEFDYFTALDDFKPEDNAGAAMLGTVEYNSSTLYRYANLNFQ 225 Query: 234 QLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFS-DMPLSMAN 292 + + N+GG A+ A + +P KQ T+A + VMV D P+++ + Sbjct: 226 EFEANIGGR---AAVSGALSYIKEFLLSMPNGKQNTFANKTLPNYVMVTLRPDTPVNLVS 282 Query: 293 AFEKAVKAKDGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITAQVKQMPTLEQ 352 AFE VK+ G+++ S++ Q + ++ + + +Q + Sbjct: 283 AFEDPVKSNHGYVEASVKRLEQEYQ--DALQFVDAPLFTAVVGKTN--GEVGEQQANVNG 338 Query: 353 LKSWV 357 L V Sbjct: 339 LLDAV 343 >UniRef50_C7MTA9 CRISPR-associated protein, Cse4 family n=1 Tax=Saccharomonospora viridis DSM 43017 RepID=C7MTA9_SACVD Length = 390 Score = 352 bits (905), Expect = 8e-96, Method: Composition-based stats. Identities = 105/350 (30%), Positives = 162/350 (46%), Gaps = 24/350 (6%) Query: 2 SNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGES 61 +I+IHV+ + S +NRDD K FGG R R+SSQS KR +R+ GE+ Sbjct: 4 PKYIDIHVIQTLPFSNVNRDDTGSPKTVEFGGVERTRVSSQSWKRVVRQH-VEEAVGGET 62 Query: 62 SLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVD-EAEKISADA---------- 110 + + + L ++ E+ + + +AL +GK + + EK +D Sbjct: 63 VRTRRVVVGVAERLIKQGWEKSEAEAAGVQIALSAGKKISLKQEKDESDEVVLTTNVLLL 122 Query: 111 VTPWVVGEIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVN---LQQGVDIALSGRMA 167 + + E+A ++ + K L +K + + R+N ++ I L GRM Sbjct: 123 LPESGIDELAALADEHREVILAEAKKAKKLTGMKPKLPSERINEILSRRSATINLFGRMV 182 Query: 168 TSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQE----QGSAHLGTQEFSSGV 223 VDGA+ +AHA TTH + D+FTAVDD+++ GS ++ T FS+G Sbjct: 183 AE---LPGANVDGAVQVAHAFTTHGTAVEYDFFTAVDDIEQKLDLPGSGYMDTALFSAGT 239 Query: 224 FYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNF 283 FYRYAN+NL L NL + A + + T VP KQ AA D+V V Sbjct: 240 FYRYANVNLTDLLRNL-DQDTDLARVLVKTFLDGFITTVPSGKQNATAAVTLPDLVHVTV 298 Query: 284 S-DMPLSMANAFEKAVKAKDGFLQPSIQAFNQYWDRVANGYGLNGAAAQF 332 D P+S+ANAFE V DGF++ S + + +A G + Sbjct: 299 RDDRPVSLANAFEAPVGGGDGFVRKSAHRLDSHAGAIAELLGESHVLFSA 348 >UniRef50_UPI0001AF1D4B hypothetical protein SghaA1_37372 n=1 Tax=Streptomyces ghanaensis ATCC 14672 RepID=UPI0001AF1D4B Length = 383 Score = 352 bits (904), Expect = 1e-95, Method: Composition-based stats. Identities = 102/381 (26%), Positives = 156/381 (40%), Gaps = 36/381 (9%) Query: 4 FINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYA-QNIGESS 62 I +H+L S S LNRDD+ K A FGG R RISSQSLKRA R + E Sbjct: 2 LIELHLLQSFPVSNLNRDDLGQPKTARFGGHTRARISSQSLKRAARTLLAQHGLDPSELG 61 Query: 63 LRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISA----------DAVT 112 +RT L L L ER +K + + + A + Sbjct: 62 VRTKRLRDAAASL---LAERGREKEQAVEVCQAGLEEIGFAAHTATGLTKYLLYVGKPAQ 118 Query: 113 PWVVGEIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIR------------VNLQQGVDI 160 + + +AK A+ K+ + AA + ++ + DI Sbjct: 119 TLLADYCDERWDTLAKTVAEAKKRKEKQEKTPRKTAAKKPTKQAQEQAKRILDGTRAADI 178 Query: 161 ALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDL---QEQGSAHLGTQ 217 AL GRM V+ A +AHA++TH V ++ D++TA+DDL E + +GT Sbjct: 179 ALFGRMIADNTDF---NVNAASQVAHALSTHAVVNEFDYYTALDDLRPDAEPAADMIGTV 235 Query: 218 EFSSGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPAD 277 +F++ FYRYAN++L QL NL + A +H VPG KQ + +A Sbjct: 236 DFNAACFYRYANLDLEQLATNLPD-DPDLVARSARAWLHSFIHAVPGGKQNSMSARTMPQ 294 Query: 278 MVM-VNFSDMPLSMANAFEKAVKAKDGFLQPSIQAFNQYWDRVANGYGLNGAAAQ--FSL 334 ++ V ++ANAF V + S Q ++ ++ + YG S+ Sbjct: 295 TLLGVVRETGAWNLANAFLSPVTDVPDLMAASTQRLVDHFQQLRSFYGDTQLRHTTIASI 354 Query: 335 SDVDPITAQVKQMPTLEQLKS 355 + + PTL+ S Sbjct: 355 GSDPAGMPENEIAPTLDDFVS 375 >UniRef50_B8IZA6 CRISPR-associated protein, Cse4 family n=1 Tax=Desulfovibrio desulfuricans subsp. desulfuricans str. ATCC 27774 RepID=B8IZA6_DESDA Length = 350 Score = 352 bits (903), Expect = 1e-95, Method: Composition-based stats. Identities = 100/370 (27%), Positives = 166/370 (44%), Gaps = 31/370 (8%) Query: 3 NFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESS 62 + +H+L S +CLNRDD+ K A+FGG +R R+SSQ KRA+R+ Sbjct: 2 RHLELHILQSVPVACLNRDDLGSPKTAVFGGVQRARVSSQCWKRAIREYCGELLPQHFKG 61 Query: 63 LRTIHLAQ-LRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGEIAW 121 RT + + LRD+ G ++ ++D+ T + Sbjct: 62 ERTRLIVEPLRDIFINTYGLDEATALVKANDLAEGLATLDKDAAKKNKLQTKTLFFTSRS 121 Query: 122 FCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGKVDGA 181 E +A +N + KK K + + DIAL GRM S ++GA Sbjct: 122 ELEALAAIAVNNENIKKHAKTFAQSLCT------DAADIALFGRMVASA---PELTLEGA 172 Query: 182 MSIAHAITTHQVDSDIDWFTAVDDL---QEQGSAHLGTQEFSSGVFYRYANINLAQL--Q 236 +HA++TH+ D++ID+F+A+DDL +E G+ GT EF++ +YR+ +NL L Sbjct: 173 AMFSHALSTHKADNEIDFFSALDDLLPSEETGAGMTGTLEFNAAAYYRFCALNLDMLADA 232 Query: 237 ENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFSD--MPLSMANAF 294 ++LG S ++ I V +P A++ + A V+ D P+ + NAF Sbjct: 233 DHLGALSPDERQGIVAAFVEATLKAMPVARKNSMNANTMPAYVLCVLRDSGQPVQLVNAF 292 Query: 295 EKAVKAKD--GFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITAQVKQMPTLEQ 352 EKAV + D G+++ SI+ + + R+ N +GL + +L + Sbjct: 293 EKAVYSPDGRGYVEASIKRMEEEYQRLENTWGLTAVETIRMP------------LQSLGE 340 Query: 353 LKSWVRNNGE 362 L VR + Sbjct: 341 LLQGVRRHVR 350 >UniRef50_C4ZJY0 CRISPR-associated protein, Cse4 family n=1 Tax=Thauera sp. MZ1T RepID=C4ZJY0_THASP Length = 394 Score = 350 bits (898), Expect = 5e-95, Method: Composition-based stats. Identities = 108/394 (27%), Positives = 170/394 (43%), Gaps = 38/394 (9%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSG--YYAQNI 58 + FI IH L ++ + LNRDD + K +GG R RISSQ LKR R + + + Sbjct: 3 LPRFIQIHTLHTYPAALLNRDDAGLAKRLPYGGAIRTRISSQCLKRHWRVADDAFSLAKL 62 Query: 59 G-ESSLRTIHLAQLR-DVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTP--- 113 G + RT ++A+L L ++ + + L + +K A+ Sbjct: 63 GVPMATRTRYVAELIRQRLIEQGIDEARAYATAEALLEALFGEKADKKKEGVKALQTGQA 122 Query: 114 --WVVGEIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVN-----LQQGVDIALSGRM 166 + EIA+ + D D L + + + + N L G++ AL GRM Sbjct: 123 VLFGNEEIAYLARRCRDITGDFSDPVALKAEVAKFLKEEKKNIEAMKLGSGLESALFGRM 182 Query: 167 ATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQE----QGSAHLGTQEFSSG 222 TS + L D ++S+AHA T H+ + D+FT VDD + GSA + E +SG Sbjct: 183 VTSDL---LANRDASVSVAHAFTVHEAQVENDYFTVVDDFAQAEDGAGSAGIFDTELASG 239 Query: 223 VFYRYANINLAQLQENLGGASRE-----------QALEIATHVVHMLATEVPGAKQRTYA 271 ++Y Y I++ QL NL G E A ++ H++H++AT PGAK+ + A Sbjct: 240 LYYGYVVIDVPQLVANLEGIKVEDVFTIGADKRGLAGKVVQHLLHLIATVSPGAKRGSTA 299 Query: 272 AFNPADMVMVNFSD-MPLSMANAFEKAVKAKDG--FLQPSIQAFNQYWDRVANGYGLNGA 328 ++ A V+V D P S+A AF + K + + YG+ A Sbjct: 300 PYDWAKFVLVEAGDWQPRSLAAAFHDPIPLKGDSSIRGRAASKLAKEIAAFDAAYGMPTA 359 Query: 329 AAQFSLSDVDPITAQVKQMPTLEQLKSWVRNNGE 362 SL D + + TL QL W+ Sbjct: 360 RRFLSL---DELAVPAAERATLSQLGEWIAQTVR 390 >UniRef50_A8LYZ6 CRISPR-associated protein, Cse4 family n=1 Tax=Salinispora arenicola CNS-205 RepID=A8LYZ6_SALAI Length = 380 Score = 349 bits (896), Expect = 9e-95, Method: Composition-based stats. Identities = 109/387 (28%), Positives = 175/387 (45%), Gaps = 32/387 (8%) Query: 1 MS-NFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIG 59 M+ +++IHVL + + LNRDD+ K FG R R+SSQS KRA+R+ ++ G Sbjct: 1 MTARYVDIHVLQTVPYANLNRDDLGSPKTVRFGYADRTRVSSQSWKRAVRRE--LEESSG 58 Query: 60 ESSLRTIHLAQLRDV------LRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTP 113 + + RT L Q +L +++ + + +K + +A Sbjct: 59 DKAKRTRRLPQAIQARLTGPDWDSELAAFAATQVMATLATIAVKADGFKVDKATGEAQVL 118 Query: 114 WVVGEIAWFC---------EQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSG 164 + + E A+ +++ + L KK L D + + V I L G Sbjct: 119 FYLPERAFDMLADVCVQQRDRLIGLRSGALKLKKGEAPLPADAVRAAMEHRSDV-INLFG 177 Query: 165 RMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQ----EQGSAHLGTQEFS 220 RM VDGA+ +AHA TTH D +D+FTAVDDL+ + GS H+ + EFS Sbjct: 178 RMLAE---LPGSNVDGAVQVAHAFTTHGTDPQVDFFTAVDDLKQDADQAGSGHMNSAEFS 234 Query: 221 SGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVM 280 +G FYRYA++NL L NLG A+E+ + T +P AK+ A F ++ Sbjct: 235 TGTFYRYASVNLEDLAHNLG--DPATAVELTRVFLSAFITAMPQAKKNATAPFTVPELAY 292 Query: 281 VNFS-DMPLSMANAFEKAVKA--KDGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDV 337 + D P+S+A+AFE V+A G+ +PS + +Y ++ G G S Sbjct: 293 IAVRTDRPVSLASAFETPVRATFDSGYAEPSRRQLAEYAGQIYRLIGDQGMVYHGCASVD 352 Query: 338 DPITAQ-VKQMPTLEQLKSWVRNNGEA 363 D Q + + + L + + A Sbjct: 353 DKGLEQLGETRQSFDNLIATAVDKLRA 379 >UniRef50_C5SD49 CRISPR-associated protein, Cse4 family n=1 Tax=Allochromatium vinosum DSM 180 RepID=C5SD49_CHRVI Length = 393 Score = 349 bits (895), Expect = 1e-94, Method: Composition-based stats. Identities = 178/395 (45%), Positives = 237/395 (60%), Gaps = 40/395 (10%) Query: 3 NFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESS 62 NF+N HVLISHSPSCLNRDDMNMQK AIFGGK RVRISSQSLKRA+R S YYA+ S Sbjct: 5 NFVNFHVLISHSPSCLNRDDMNMQKTAIFGGKTRVRISSQSLKRAIRYSDYYARYFISKS 64 Query: 63 LRTIHLA-QLRDVLRQKLGERFDQKIIDK----TLALLSGK-SVDEAEKISAD------- 109 RT L ++ D L I+K A+ GK +DE K D Sbjct: 65 QRTRRLFDKMADELSASAESAEQTTAIEKCALYAAAIFEGKTKIDEIGKYERDKKSDHIE 124 Query: 110 -AVTPWVVGEIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQ--GVDIALSGRM 166 + P+ EI + + EA +K ++ +K +I + + +D+ALSGRM Sbjct: 125 TQIIPFSCAEIEGIKQIL--LEAAGKPEKGRIEYMKAEIQRLEREQRTRIDLDVALSGRM 182 Query: 167 ATSGMMTELGKVDGAMSIAHAITTHQVDSDI-DWFTAVDDLQ----EQGSAHLGTQEFSS 221 A S ++ VDGA+++AHAITTH V+ DWFTAVDDL E G+ HL TQ+FS+ Sbjct: 183 ANSELIYP---VDGALAVAHAITTHTVEPQDIDWFTAVDDLTLDAGETGAGHLNTQQFSA 239 Query: 222 GVFYRYANINLAQLQENLG----------GASREQALEIATHVVHMLATEVPGAKQRTYA 271 GVFYRYA++NL QLQ NLG SR +AL+IA HV+H+LAT VP AKQ+++A Sbjct: 240 GVFYRYASLNLRQLQFNLGLLANINAEQTTESRARALDIARHVLHLLATVVPSAKQQSFA 299 Query: 272 AFNPADMVMVNFSDMPLSMANAFEKAVKAK---DGFLQPSIQAFNQYWDRVANGYGLNGA 328 A N AD V+V+ +D P+S+ANAFE+ ++ + GFLQPSI A YW RV + YGL+ Sbjct: 300 AHNLADFVIVSLADQPVSLANAFEEPIERERKIGGFLQPSITALADYWSRVNSAYGLDEQ 359 Query: 329 AAQFSLSDVDPITAQVKQMPTLEQLKSWVRNNGEA 363 A F+L + Q + + ++ L+ W+ N+G A Sbjct: 360 ARAFALRGGIKLGDQ-EVLTSIADLEQWLANDGRA 393 >UniRef50_A5UR15 CRISPR-associated protein, Cse4 family n=1 Tax=Roseiflexus sp. RS-1 RepID=A5UR15_ROSS1 Length = 402 Score = 347 bits (890), Expect = 5e-94, Method: Composition-based stats. Identities = 114/401 (28%), Positives = 177/401 (44%), Gaps = 50/401 (12%) Query: 4 FINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESS- 62 I +H+L +H+PS LNRDD N KDAIFGG RR RISSQ++KR++R S ++ Sbjct: 2 LIALHLLQNHAPSNLNRDDNNEPKDAIFGGVRRARISSQAIKRSIRWSDHFRAPFETQGL 61 Query: 63 --LRTIHLAQLRD-VLRQKLGERFDQKIIDKTLALLSGKSV------DEAEKISADAVTP 113 +RT L + L DQ+ I + A L EA D P Sbjct: 62 LAIRTQLLPEKVRHHLVNAGLNDDDQRAIVEAAARLGKGEQRSPSGEGEAGDERGDQNQP 121 Query: 114 WVV---------GEIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNL---------- 154 E++ A+ L + ++ ++ + I +R Sbjct: 122 RSSSRSRRSSRQSNTTGDAERIKTAQLMFLTENEIQQLAQRLIEIVREKGAKHLNELQGD 181 Query: 155 ----------QQGVDIALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVD 204 VDIA+ GRM TS V+ A+ +AHAI+TH V+ + D++TAVD Sbjct: 182 TLVREIGEYEPHSVDIAMFGRMTTSS---PFKDVEAAVQVAHAISTHAVEMEFDFYTAVD 238 Query: 205 DLQ-EQGSAHLGTQEFSSGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVP 263 D+ E G+ +G F+S +Y+Y +I+ L +NL G + A + ++ +P Sbjct: 239 DISGEAGAGFIGDTTFNSATYYKYFSIDWDGLLKNLHG-EQNVARQSVEALIRAALFAIP 297 Query: 264 GAKQRTYAAFNPADMVMVNFSDM--PLSMANAFEKAVKAKD--GFLQPSIQAFNQYWDRV 319 KQ ++AA N D+ +V LS ANAF K V+A ++ S +A +Y + Sbjct: 298 SGKQNSFAAHNLPDLALVEVRKENIALSYANAFVKPVRATGKLSLIEASAKALEEYIPAI 357 Query: 320 ANGYGLNGAAAQFSLSDVDPITAQVKQMPTLEQLKSWVRNN 360 Y L+ A S V + + LE+L +W+ Sbjct: 358 NERYNLSAQRAFLST--VPFTLSGAECCSDLEKLITWLSKQ 396 >UniRef50_D0Y919 CRISPR-associated protein, Cse4 family n=2 Tax=Dehalococcoides RepID=D0Y919_9CHLR Length = 427 Score = 346 bits (889), Expect = 5e-94, Method: Composition-based stats. Identities = 101/387 (26%), Positives = 165/387 (42%), Gaps = 60/387 (15%) Query: 6 NIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESS-LR 64 IH++ + +PS LNRDD K A FGG RR RISSQ KR+ R G A+ + +R Sbjct: 9 EIHLIQNFAPSNLNRDDTGQPKSATFGGFRRARISSQCSKRSTRLQGPLAELLENQGAVR 68 Query: 65 TI-HLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGE----- 118 T + ++ + K E D++ I+ + ++ K S + Sbjct: 69 TRQLIMEIAKAIDTK--EEPDERTIEIVAGVFEAGGLERPAKRSGKVKSQAAEAIGEDGE 126 Query: 119 --------------IAWFCEQVA-----KAEADNLDD-----KKLLKVLKEDIAAIRVNL 154 I F +++A +N DD K++ + + + + Sbjct: 127 INGNEGFESGNKTKILLFLDKMAFPKLIDVFKENWDDLAKGNKEVKEKACDKVGRLLFEA 186 Query: 155 QQGVDIALSGRMATSGMMTELGK----VDGAMSIAHAITTHQVDSDIDWFTAVDDLQ--- 207 + DIAL GRM T GK V+ A +AH I+TH++D ++D++TAVDDL Sbjct: 187 VKAPDIALFGRMLEVKNNTPFGKYNMSVEAACQVAHPISTHKIDMEMDFYTAVDDLNPDG 246 Query: 208 EQGSAHLGTQEFSSGVFYRYANINLAQLQENLG---------------GASREQALEIAT 252 E G+ +G F+S +YRYA ++ QL NL E+A ++ Sbjct: 247 ETGAGMMGVVGFNSACYYRYALVDRDQLARNLARKTERKNGGWAQGLETQDYEEADKVVK 306 Query: 253 HVVHMLATEVPGAKQRTYAAFNPADMVMVNFS--DMPLSMANAFEKAVKA---KDGFLQP 307 + + +P KQ ++AA N + +P+S+ANAF ++ D + Sbjct: 307 AFLEAMIYAIPTGKQNSFAAQNLPSFGLFVKRKGGVPVSLANAFSTPIRPVRDDDDLVGL 366 Query: 308 SIQAFNQYWDRVANGYGLNGAAAQFSL 334 S+ A ++WD + YG G Sbjct: 367 SVNALTKHWDAIKELYGDQGIKVTSCF 393 >UniRef50_B6WQ62 Putative uncharacterized protein n=1 Tax=Desulfovibrio piger ATCC 29098 RepID=B6WQ62_9DELT Length = 341 Score = 345 bits (885), Expect = 2e-93, Method: Composition-based stats. Identities = 98/331 (29%), Positives = 157/331 (47%), Gaps = 20/331 (6%) Query: 3 NFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESS 62 + +H+L S +CLNRDD K A+FG +R R+SSQ KRA+R+ Sbjct: 2 RHLELHILQSVPVACLNRDDFGSPKTALFGNVQRARVSSQCWKRAVRELMQEEVPALFGG 61 Query: 63 LRTIHL-AQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGEIAW 121 RT + L +L ++ G ++ A G +V + + T + + Sbjct: 62 QRTRLILDPLCRILHEQHGLAEEEAR---KKAEELGAAVSKLDTPPVRVKTLFFTSPLE- 117 Query: 122 FCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGKVDGA 181 E +A A + KK +K L + L+ DIAL GRM S L +GA Sbjct: 118 -LEALAAAYVATGNAKKAVKELAKHP------LKDAADIALFGRMVASDHSLTL---EGA 167 Query: 182 MSIAHAITTHQVDSDIDWFTAVDDL---QEQGSAHLGTQEFSSGVFYRYANINLAQLQEN 238 +HA++TH+V ++ID+F AVDDL E G+ GT EF+S +YR+A +NL L+++ Sbjct: 168 AMFSHALSTHKVSNEIDFFAAVDDLQPEDEAGAGMTGTLEFNSATYYRFAALNLDLLEQH 227 Query: 239 LGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFSD--MPLSMANAFEK 296 L S E+ E+ + V VPGA++ + A V+ + P+ + NAFEK Sbjct: 228 LSALSAEERREVVCNFVTATLRAVPGARKNSMNAATLPSHVLAVVREKGHPVQLVNAFEK 287 Query: 297 AVKAKDGFLQPSIQAFNQYWDRVANGYGLNG 327 V + G ++ S+ + + + +GL Sbjct: 288 PVWTRGGLMEESVSQLEREYTHLKETWGLEA 318 >UniRef50_D1NTI0 CRISPR-associated protein, Cse4 family n=1 Tax=Bifidobacterium gallicum DSM 20093 RepID=D1NTI0_9BIFI Length = 381 Score = 345 bits (885), Expect = 2e-93, Method: Composition-based stats. Identities = 92/375 (24%), Positives = 159/375 (42%), Gaps = 25/375 (6%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGE 60 M+ + I+ + + PS +NRDD K AI+GG R R+SSQ+ KRAMR++ + + Sbjct: 1 MTTIVEIYAIQNVPPSNINRDDTGNPKTAIYGGVLRARVSSQAWKRAMREAFPEMLDADQ 60 Query: 61 SSLRTIH-LAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADA------VTP 113 +RT + LAQ+ + K + D + + K + + EK +T Sbjct: 61 LGIRTKNALAQIEQSIVAKRPD-IDVETVHKAATAALTATGAKVEKSKRKGSMEGADLTQ 119 Query: 114 WVVG----EIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATS 169 +++ EI + + D K K +K ++A+ + Q VDIAL GRM Sbjct: 120 YLIFIANREIDKLADLAIAWIDADEDLDKPSKEMKGQVSAV-FHGPQAVDIALFGRMLAD 178 Query: 170 GMMTELGKVDGAMSIAHAITTHQVDSDIDWFT---AVDDLQEQGSAHLGTQEFSSGVFYR 226 D + +AHAI+ +V + D+FT G+A L T F+S YR Sbjct: 179 A---PELNTDASAQVAHAISVDEVTPEYDYFTAIDDDAADDNAGAAMLDTVGFNSSTLYR 235 Query: 227 YANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFSD- 285 YA + + L E L A ++ V+ +P KQ T+A +V + Sbjct: 236 YATVAVDSLYEQLQSAD--MTVKAVDAFVNAFLRSMPTGKQNTFANRTLPTAALVVVRNS 293 Query: 286 MPLSMANAFEKAVKAKDG--FLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVD-PITA 342 P++ AFE+ V A+ + + + + + + YG AA ++ + Sbjct: 294 QPINPVEAFERPVHAERDKSISRVAAERLGRKLQDIQDTYGETPIAAWNIVAGQPVELLD 353 Query: 343 QVKQMPTLEQLKSWV 357 + + TL + + Sbjct: 354 SLSEHVTLPVMVESL 368 >UniRef50_B4UE70 CRISPR-associated protein, Cse4 family n=2 Tax=Anaeromyxobacter RepID=B4UE70_ANASK Length = 413 Score = 344 bits (884), Expect = 2e-93, Method: Composition-based stats. Identities = 107/415 (25%), Positives = 179/415 (43%), Gaps = 55/415 (13%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIG- 59 M+ F+ IH L S+ S LNRDD K FGG R R+SSQ LKR R G Sbjct: 1 MNRFVQIHTLTSYPASLLNRDDAGFAKRIPFGGVTRTRVSSQCLKRHWRTFEGEGALSGL 60 Query: 60 --ESSLRTIHLAQ---LRDVLRQKLGERFDQKIIDKTLALLSGKSVDEA----------- 103 S+R+ + ++ ++ + + +++ ++ + GKS A Sbjct: 61 GQPMSVRSRYTFDELVVQPLVGEGVPAELAREVTRALMSEVLGKSAKAAKADARADEKEE 120 Query: 104 ---------EKISADAVTPWVVGEIAWFCEQVAKAEADNLDDKKLLKVLKEDIAA----- 149 + +T E+A+ E D K+ K + + + A Sbjct: 121 EEDKDAKTESTLQTGQITVLGRPEVAYLLELARTVCRKKPDPAKIAKAVSDHLGADGRKN 180 Query: 150 -IRVNLQQGVDIALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDL-- 206 + L G+D A+ GRM TS + L + D A+ +AHA T H ++ D+F+AVDDL Sbjct: 181 LRELRLGAGLDAAMFGRMVTSDI---LARGDAALHVAHAFTVHGEATETDYFSAVDDLPM 237 Query: 207 ----QEQGSAHLGTQEFSSGVFYRYANINLAQLQENLGG--------ASREQALEIATHV 254 QGS H+G E +SG+FY Y I++ L NL G A R+ A ++A + Sbjct: 238 ARTEDGQGSGHIGNAELTSGLFYGYVVIDVPLLVSNLEGVDRKAWEKADRKLAAQLAERM 297 Query: 255 VHMLATEVPGAKQRTYAAFNPADMVMVNFSD-MPLSMANAFEKAV---KAKDGFLQPSIQ 310 V ++AT PGAK + A A +V+ + P ++ANAF + V + + + + Sbjct: 298 VKLVATVSPGAKLGSTAPHAYAHLVLAESGNAQPRTLANAFLEPVVTGPRQPDPVAAAYR 357 Query: 311 AFNQYWDRVANGYGLNGAAAQFSLSDVDPITA--QVKQMPTLEQLKSWVRNNGEA 363 A ++ + YG ++ D + + +L ++ +WV + Sbjct: 358 ALARHSADLDRMYGPAFQRRLAAIGPADGLADVLRAPANASLAEVATWVADQVRG 412 >UniRef50_A5FTJ7 CRISPR-associated protein, Cse4 family n=11 Tax=Acetobacteraceae RepID=A5FTJ7_ACICJ Length = 370 Score = 344 bits (884), Expect = 2e-93, Method: Composition-based stats. Identities = 99/370 (26%), Positives = 156/370 (42%), Gaps = 26/370 (7%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNI-G 59 M+ F+ +H+L PS +NRDD K A+ GG R+R+SSQ+LKRA R S +++ + G Sbjct: 1 MTQFLQVHLLTFFPPSNMNRDDTGRPKTAMVGGAMRLRLSSQALKRAWRTSTIFSEALKG 60 Query: 60 ESSLRTIHLA-QLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGE 118 RT L ++ L+ + + + +A GK ++ + E Sbjct: 61 YMGERTQRLGEEILKTLQAEGVSEVQALAVARAVAGQFGKLNEDETPARIQQLAFISPDE 120 Query: 119 IAWFCEQVAKAEADNLDDKKLLKVLKEDIAA-------------IRVNLQQGVDIALSGR 165 + + A L + K + + DIAL GR Sbjct: 121 RKAAFDLARRYAAGELPLPEKAKGKRGKANKTEGEEEVEAPEILLLRESDTAADIALFGR 180 Query: 166 MATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDL----QEQGSAHLGTQEFSS 221 M + A +AHAITTH++ D D++TAVDDL ++ G+ +G F S Sbjct: 181 MLADKPAF---NREAAAQVAHAITTHRISVDDDYYTAVDDLKRPSEDAGAGFIGETGFGS 237 Query: 222 GVFYRYANINLAQLQENLGGAS--REQALEIATHVVHMLATEVPGAKQRTYAAFNPADMV 279 GVFY Y +IN+ L NLGG R+ A +V AT P KQ ++AA A + Sbjct: 238 GVFYTYMSINIDLLIRNLGGGDQARDLAATAIAALVEAAATTAPSGKQNSFAAHGRAGYI 297 Query: 280 MVNFSD-MPLSMANAFEKAVKAKDGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVD 338 + P ++A AF K V+ + SI ++ + + YG A + + Sbjct: 298 LAERGKAQPRTLAGAFAKPVEG-GDIMDASIGRLEEFREAIDKAYGPTADATKVMRVGGE 356 Query: 339 PITAQVKQMP 348 A + Sbjct: 357 GSLADIIVFA 366 >UniRef50_Q1EQS8 CRISPR-associated protein n=3 Tax=Streptomyces RepID=Q1EQS8_STRKN Length = 393 Score = 344 bits (884), Expect = 2e-93, Method: Composition-based stats. Identities = 113/384 (29%), Positives = 169/384 (44%), Gaps = 33/384 (8%) Query: 2 SNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGES 61 + FI++H++ S + LNRDD N K +G R R+SSQS KRA R+ + + IG++ Sbjct: 5 ARFIDVHIVQSVPFANLNRDDTNSVKTVQYGNTLRTRVSSQSWKRATRE--VFQERIGQA 62 Query: 62 SLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVT--------- 112 +LRT + + + G A + E K AD Sbjct: 63 ALRTRRIGERVTQELEGRGWPPALAQRAGGHAAAASSIKFELAKDPADNKQFLPNTVLTN 122 Query: 113 ------PWVVGEIAWFCEQVAKAEADNLDDKKLLKV--LKEDIAAIRVNLQQGVDIALSG 164 V E+A EQ + D KK L +D + + GV I L G Sbjct: 123 AMVYVPEAAVAELADLAEQHRQELESAKDIKKPADKSVLPKDAVEAVLRSRNGV-INLFG 181 Query: 165 RMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDL-----QEQGSAHLGTQEF 219 RM + VDGA+ +AHA+TTH+ D ++D+F+AVDD+ GS H+G EF Sbjct: 182 RMLAE---VDDAGVDGAVQVAHAMTTHETDVELDYFSAVDDITAAWKDSTGSGHMGHTEF 238 Query: 220 SSGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMV 279 S+G FYRYA ++L L N+GG R A E+ + +P AK+ + A D+V Sbjct: 239 SAGTFYRYATVDLRDLATNIGGEVRA-ARELIAAFLASYIESLPQAKKNSTAPHTIPDLV 297 Query: 280 MV-NFSDMPLSMANAFEKAVK--AKDGFLQPSIQAFNQYWDRVANGYGLNGAAAQ-FSLS 335 + SD PLS A AFEK V+ A GF + S Y G ++ Sbjct: 298 HISVRSDRPLSYAAAFEKPVRAGAPGGFGEVSRAELATYAQAANTLLGTGRIVTSGWASL 357 Query: 336 DVDPITAQVKQMPTLEQLKSWVRN 359 + +T + + + L + + Sbjct: 358 ETKDLTGLGTRHESFDDLITAALD 381 >UniRef50_Q2RXJ6 CRISPR-associated protein, Cse4 family n=2 Tax=Alphaproteobacteria RepID=Q2RXJ6_RHORT Length = 381 Score = 341 bits (875), Expect = 3e-92, Method: Composition-based stats. Identities = 103/382 (26%), Positives = 173/382 (45%), Gaps = 30/382 (7%) Query: 2 SNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGES 61 S F+ IH L S++ + LNRDD + K +GG R RISSQ LKR R + + + Sbjct: 4 SRFLQIHSLHSYTAALLNRDDSGLAKRLTYGGSNRTRISSQCLKRHWRMAEHDPHALQTL 63 Query: 62 S-----LRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVV 116 R+ L D++ + L R+ Q I+D + + ++ Sbjct: 64 GGYVGSFRSRELV--TDLVIKPLEGRYPQDILDVLEPEFQKLVYGDKADKGKKSRQTLLL 121 Query: 117 G--EIAWFCEQVAKAEADNLDDKKLLKVLKE-------DIAAIRVNLQQGVDIALSGRMA 167 G E+AW + + A D K L K + + + L G+ AL GRM Sbjct: 122 GQPELAWLARRAEELAAGANDAKALQKAVADWRKDANFKAMSENAALPGGLVAALFGRMV 181 Query: 168 TSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQ----EQGSAHLGTQEFSSGV 223 TS +D + +AHA T H +++ D+FTAVDDL+ + G+ + E +SG+ Sbjct: 182 TSD---PAANIDAPVHVAHAFTVHAEEAEGDYFTAVDDLKKDESDSGADTIQETELTSGL 238 Query: 224 FYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNF 283 FY Y I+L L N GG +E A ++ ++V+++A PGAK + A + AD++++ Sbjct: 239 FYGYVVIDLPGLIGNCGG-DKEIAAQVVNNLVYLIAEVSPGAKLGSTAPYGRADLMLIEA 297 Query: 284 SD-MPLSMANAFEKAVKAKDGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITA 342 D P S+A A+ KA+ + ++ A + ++ Y A SL++ Sbjct: 298 GDRQPRSLATAYRKAIAPD---REQAVAALDGCLAKLDATYETGEARRYLSLAETPLTGP 354 Query: 343 --QVKQMPTLEQLKSWVRNNGE 362 + +L+ L W + + Sbjct: 355 ATSGLEKLSLKALADWTASRVK 376 >UniRef50_Q0BRF9 Putative uncharacterized protein n=1 Tax=Granulibacter bethesdensis CGDNIH1 RepID=Q0BRF9_GRABC Length = 386 Score = 340 bits (872), Expect = 5e-92, Method: Composition-based stats. Identities = 105/388 (27%), Positives = 162/388 (41%), Gaps = 39/388 (10%) Query: 2 SNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMR----KSGYYAQN 57 F+ IH L S++ S LNRDD + K +G R RISSQ LKR R + Sbjct: 4 PRFLQIHSLHSYTASLLNRDDSGLAKRLPYGSAVRTRISSQCLKRHWRMDEGTFSLHRIE 63 Query: 58 IGESSLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVG 117 E ++R+ L LR+ L D I++ + + P + G Sbjct: 64 GAEEAVRSRDLV--TKRLREPLQGTVDVNILNAIEPAFQAAVYGKKGADDKSSRQPLLFG 121 Query: 118 --EIAWFCEQVAKAEADNLDDKKLLKVLKE-----------DIAAIRVNLQQGVDIALSG 164 E+ + EQ + D K ++ V+L G+ AL G Sbjct: 122 APELRYLAEQFTRIATSATDPKSAKAAAEDFTKDKLFQNTMKAMRDSVSLPGGLTSALFG 181 Query: 165 RMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDL---QEQGSAHLGTQEFSS 221 RM TS +D + +AHA TTH ++ D+F VDDL ++ G+ H+G+ E +S Sbjct: 182 RMVTSD---PEANIDAPVHVAHAFTTHAEQTESDYFAVVDDLAGVEDTGADHIGSTELTS 238 Query: 222 GVFYRYANINLAQLQENLGG--------ASREQALEIATHVVHMLATEVPGAKQRTYAAF 273 G+FY Y I++ L NL G A R+ A E+ ++ +AT PGAK + A + Sbjct: 239 GLFYGYVVIDVPTLVSNLTGVAASNWLAADRKMAAEVTACLIGQIATVSPGAKLGSTAPY 298 Query: 274 NPADMVMVNFSD-MPLSMANAFEKAVKAKDGFLQPSIQAFNQYWDRVANGYGLNGAAAQF 332 A ++V D P S+A AF + ++ + +Q Y Sbjct: 299 GYATTMLVEAGDRQPRSLAEAFRDPAEPT---VKDAEDKLHQKLKAFDEAYQTGEDRRLL 355 Query: 333 SLSDVDPITAQVKQMPTLEQLKSWVRNN 360 SLS+ I + +L +L WVR+ Sbjct: 356 SLSNDPGIKNVSRT--SLPELMQWVRDT 381 >UniRef50_B6B782 CRISPR-associated protein, Cse4 family n=2 Tax=Alphaproteobacteria RepID=B6B782_9RHOB Length = 353 Score = 340 bits (872), Expect = 6e-92, Method: Composition-based stats. Identities = 107/330 (32%), Positives = 158/330 (47%), Gaps = 19/330 (5%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNI-G 59 M+ F+ H+L ++ S NRDD K A+ GG R+RISSQSLKRA+R+S Y+AQ++ G Sbjct: 1 MTTFVQFHLLTTYPLSNPNRDDQGRPKQAMIGGSPRLRISSQSLKRALRESSYFAQDLAG 60 Query: 60 ESSLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGEI 119 + RT L L+ +L + + A G + EK S +A T + Sbjct: 61 HTGTRTRR---LATELKAELIGQGVEDAHADETATKIGAVFSKTEKGSTNATTLAFISPD 117 Query: 120 AWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGKVD 179 W +A+ A + L K+ AI VDIA+ GRM D Sbjct: 118 EW---ALARELAARDVAGEPLPAEKDLKKAILRRADGAVDIAMFGRMLADS---PDYNRD 171 Query: 180 GAMSIAHAITTHQVDSDIDWFTAVDDLQ----EQGSAHLGTQEFSSGVFYRYANINLAQL 235 A+ +AHA TTH+ + DWF+AVDDL+ + G+ H+G F SG++Y YA +N+ L Sbjct: 172 AAVQVAHAFTTHRAQAQDDWFSAVDDLKTREVDAGAGHIGEHGFGSGIYYLYACVNVDLL 231 Query: 236 QENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFS-DMPLSMANAF 294 ENL G R A + + LAT P KQ ++A A + V P ++ AF Sbjct: 232 VENLAG-DRALAAKGMEALARALATATPKGKQNSHAHHPRAGFIRVERGQQQPRDLSGAF 290 Query: 295 EKAVKAKDGFLQPSIQAFNQYWDRVANGYG 324 K A + + S++A ++ YG Sbjct: 291 HKPTAADE---RASVEALQGMAAKIDRAYG 317 >UniRef50_C7RP61 CRISPR-associated protein, Cse4 family n=1 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RP61_9PROT Length = 400 Score = 338 bits (867), Expect = 2e-91, Method: Composition-based stats. Identities = 97/395 (24%), Positives = 169/395 (42%), Gaps = 42/395 (10%) Query: 2 SNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMR-KSGYYAQNIGE 60 FI IH L ++ + LNRDD + K G R RISSQ LKR R +A + + Sbjct: 4 PRFIQIHTLHTYPAALLNRDDAGLAKRLPLGNAVRTRISSQCLKRHWRVVEDRFALSCLD 63 Query: 61 --SSLRTIHLAQLRDV------LRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVT 112 ++R+ +L + + + + + + D L GK + + Sbjct: 64 VPMAIRSRGTLELISKRIQESGVSETMAQAAAEAMRDAGLLDKGGKEKKGDDALKTGQAV 123 Query: 113 PWVVGEIAWFCEQVAKAEADNLDDKKLLKVL-------KEDIAAIRVNLQQGVDIALSGR 165 EI + + +D +++K +++ E + G++ AL GR Sbjct: 124 LLGKPEIDYLVRRCVDLASDGVEEKGFKELITLWLKGKDEKRNIEALKHGSGLESALFGR 183 Query: 166 MATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDL----QEQGSAHLGTQEFSS 221 M TS ++T + A+ +AHA T HQ + D+FT VDDL E GSA + E +S Sbjct: 184 MVTSDVLTS---REAAVYVAHAFTVHQAQVENDYFTVVDDLLQDAGELGSAGIFDTELAS 240 Query: 222 GVFYRYANINLAQLQENLGGAS-------------REQALEIATHVVHMLATEVPGAKQR 268 G++Y Y +++ QL +NL G R A ++ H++H++AT PGAK+ Sbjct: 241 GLYYGYVVVDVPQLVQNLEGEDFNECFASGTPADRRVLAGQVVQHLLHLIATVSPGAKRG 300 Query: 269 TYAAFNPADMVMVNFSD-MPLSMANAFEKAVK---AKDGFLQPSIQAFNQYWDRVANGYG 324 + A F+ A ++V D P S+A AF A+ + + ++ + + + YG Sbjct: 301 STAPFDWAKFMLVEAGDWQPRSLAGAFHDALPLSGSGGTIRERTVDRLTKEIAAMDDAYG 360 Query: 325 LNGAAAQFSLSDVDPITAQVKQMPTLEQLKSWVRN 359 + ++ + + L L W + Sbjct: 361 APLSRRFLAIDQ--EVQVPGAERLNLASLADWAKE 393 >UniRef50_Q67RP1 Putative uncharacterized protein n=1 Tax=Symbiobacterium thermophilum RepID=Q67RP1_SYMTH Length = 379 Score = 335 bits (859), Expect = 2e-90, Method: Composition-based stats. Identities = 104/383 (27%), Positives = 177/383 (46%), Gaps = 33/383 (8%) Query: 4 FINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESSL 63 F+ +H+L + + S LNRDD K +FGG RR RISSQ LKRA+R + Sbjct: 2 FVEMHLLQNFALSNLNRDDTGAPKSCVFGGTRRARISSQCLKRAVRTYVREQALVP---- 57 Query: 64 RTIHLAQLRDVLRQKLGERFDQKIIDKTLA-LLSGKSVDEAEKISADAVTPWVV----GE 118 + L+ L+++L R ++ A ++ ++++ E + T +++ E Sbjct: 58 -SELLSYRTKWLQRELANRLAAGGVEAEQAGQVAARALELLEFRLKNGRTEYLLMVGERE 116 Query: 119 IAWFCEQVAK--AEADNLDDKKLLKVLKEDIAAI---RVNLQQGVDIALSGRMATSGMMT 173 IA + + A D + K +++A + ++ VDIAL GRM + Sbjct: 117 IARIADLCREHAAALQGGDGGRKSKKEGDNLAGLFLKALDGGDAVDIALFGRMIATH--- 173 Query: 174 ELGKVDGAMSIAHAITTHQVDSDIDWFTAV------DDLQEQGSAHLGTQEFSSGVFYRY 227 VD A+ +AHA +T+ + ++ D+++AV DD + G+ LGT ++S +YRY Sbjct: 174 PEKNVDAAVQMAHAFSTNAIANEFDFYSAVDDLQQQDDDEGAGAGMLGTVLYNSSCYYRY 233 Query: 228 ANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFSDMP 287 AN++L QL NLGG ++AL + + VP K+ A NP ++M + Sbjct: 234 ANVDLRQLLTNLGG-DPDRALTAVRAFLLGMVHAVPTGKRTNSAPQNPPALIMAVVREHG 292 Query: 288 -LSMANAFEKAVK-AKDGFLQPSIQAFNQYWDRVANGYGLNGA--AAQFSLSDVDPITA- 342 S+ANAF V A+ ++ S + +W++++ YG G A + D I A Sbjct: 293 LWSLANAFVVPVSGARGNLMELSAKEMLAHWNQLSELYGQEGVHYAGLATYLSSDAIGAS 352 Query: 343 ---QVKQMPTLEQLKSWVRNNGE 362 + L L V + Sbjct: 353 NAVGIAVEKRLADLVDRVLAEVQ 375 >UniRef50_D0MET5 CRISPR-associated protein, Cse4 family n=1 Tax=Rhodothermus marinus DSM 4252 RepID=D0MET5_RHOM4 Length = 423 Score = 333 bits (854), Expect = 8e-90, Method: Composition-based stats. Identities = 116/422 (27%), Positives = 188/422 (44%), Gaps = 63/422 (14%) Query: 2 SNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIG-- 59 S F+ IH L ++ + LNRDD K FGG R R+SSQ LK R G Sbjct: 3 SAFVQIHTLTAYPAALLNRDDAGFAKRLPFGGAIRTRVSSQCLKYHWRNFSGEHALYGLD 62 Query: 60 -ESSLRTIHLAQL---RDVLRQKLGERF--------------DQKIIDKTLALLSGKSVD 101 SLR+ + R ++ + R D+ + L VD Sbjct: 63 VPRSLRSRETFKRCIARPLVEEGYPLRLVVAFALHLQKLIVSDESLSKTDFKKLMSDEVD 122 Query: 102 EA---EKISADAVTPWVVGEIAWFCEQVAKA----------EADNLDDKKLLKVLKEDIA 148 +A +++ ++ V E+ + ++ + A L D++L +V +E A Sbjct: 123 DATLLDQLKSNQVIILGRPEVDYLTRRIRERLDALREVWADAAAPLSDEQLERVYQELQA 182 Query: 149 AIRVNLQQ---------GVDIALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDW 199 + L++ G+D AL GRMATS + L + D A+ +AHA TTH +S+ D+ Sbjct: 183 IGKGELKKNLKGLYLAAGLDAALFGRMATSDV---LARGDAAIHVAHAFTTHAEESESDY 239 Query: 200 FTAVDDL------QEQGSAHLGTQEFSSGVFYRYANINLAQLQENLGG--------ASRE 245 FTAVD+L E GS HL QE +SG+FY Y +++ L NL G A R Sbjct: 240 FTAVDELVAQEGEGELGSGHLNNQELTSGLFYGYVVVDVPLLVSNLEGVPPAAWQEADRT 299 Query: 246 QALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNF-SDMPLSMANAFEKAVKAKD-G 303 A E+ ++H++AT PGAK + A A ++V + P ++ANAF + V G Sbjct: 300 LAAEVVRRLLHLIATVSPGAKLGSTAPHAYAQFMLVEWGRSQPRTLANAFHRPVSLDGEG 359 Query: 304 FLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITAQVK--QMPTLEQLKSWVRNNG 361 L S +A +Y +++ YG ++ + + Q++ + + ++ WV Sbjct: 360 VLVNSYRALGRYVEQMDRMYGKLTERRLAAIDLPEAVQRQLQVDTLNAVPEIADWVAEKI 419 Query: 362 EA 363 + Sbjct: 420 QG 421 >UniRef50_Q0AA32 CRISPR-associated protein, Cse4 family n=1 Tax=Alkalilimnicola ehrlichii MLHE-1 RepID=Q0AA32_ALHEH Length = 385 Score = 332 bits (852), Expect = 1e-89, Method: Composition-based stats. Identities = 99/385 (25%), Positives = 174/385 (45%), Gaps = 36/385 (9%) Query: 4 FINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESSL 63 F+ IH L S+ + LNRDD + K FG R+R+SSQ LKR R++ ++ S + Sbjct: 2 FLQIHTLTSYHAALLNRDDAGLAKRIPFGSAERMRVSSQCLKRHWRQALKDVISLP-SGI 60 Query: 64 RTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAV-----TPWVVGE 118 RT H + +V R+ + E + + + L + E D++ + E Sbjct: 61 RTRHFFER-EVCRRVIAEGVEDEKARELTGKLIDAVMHSKEAREKDSLFLKQPVLFGRPE 119 Query: 119 IAWFCEQVAKAEADNLDD----KKLLKVLKEDIAAIR-----VNLQQGVDIALSGRMATS 169 +F + + D K +K K++ A+ +L+ G++ AL GR TS Sbjct: 120 ADYFVSLITECARSGEDPGSTLKDRVKAEKKNFRALLQAAGGSDLESGIEGALFGRFVTS 179 Query: 170 GMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQE----QGSAHLGTQEFSSGVFY 225 + L + D ++ +AHA T H +++++D+FT VDDL+E G+AH G E +G+FY Sbjct: 180 DI---LARTDASVHVAHAFTVHSLNNEVDYFTVVDDLKEPGEDAGAAHAGDMELGAGLFY 236 Query: 226 RYANINLAQLQENLGGASR----------EQALEIATHVVHMLATEVPGAKQRTYAAFNP 275 Y +++ L NL G R A ++ +VH +AT PGAK A + Sbjct: 237 GYVVVDVPLLVSNLSGCERQAWREQTEACADARDVLAALVHSIATVSPGAKLGATAPYAR 296 Query: 276 ADMVMVNFS-DMPLSMANAFEKAVKAKDGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSL 334 D ++ P ++ANA+ + + A+ +Q S+ Y + + +G + + Sbjct: 297 TDCALLETGTTQPRALANAYLEPLPARGDLMQQSVNTMGHYLKSLDDMFGEETSRFVSAT 356 Query: 335 SDVD--PITAQVKQMPTLEQLKSWV 357 D P + T++ + Sbjct: 357 RDTTSLPCAHRGPLSETIDGALDSI 381 >UniRef50_C9M9R6 CRISPR-associated protein, Cse4 family n=1 Tax=Jonquetella anthropi E3_33 E1 RepID=C9M9R6_9BACT Length = 400 Score = 329 bits (845), Expect = 7e-89, Method: Composition-based stats. Identities = 100/401 (24%), Positives = 179/401 (44%), Gaps = 45/401 (11%) Query: 2 SNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGY-----YAQ 56 F+ I L ++S S LNRDD + K FG R RISSQ LKR R +G A Sbjct: 5 PRFVQISTLTTYSASLLNRDDSGLAKRIPFGDSVRTRISSQCLKRHWRNAGGPYGLDKAG 64 Query: 57 NIGESSLRTI-HLAQLRD--VLRQKLGERFDQKIIDKTLALLSGKSV------------- 100 + S+R+ +L + ++ + L ++ K LL Sbjct: 65 DALSLSVRSRFSFPELIEKPLVAEGLEQKLVVSGSQKLQQLLYNGEEKGDTKKDKKKKIE 124 Query: 101 --DEAEKISADAVTPWVVGEIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQ-- 156 ++ + + E+ + + + A + + + K++ +K+ + NL Sbjct: 125 LDEDGYSAKRNELVVLGRPELEYLKQIIRDAISSSSNIKEIDNAVKDFYTKRKSNLLALR 184 Query: 157 ---GVDIALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQEQGSAH 213 GVD A+ GR + + KV A+ +AH+ T H S+ D+FTAVDDL EQG+ H Sbjct: 185 AGCGVDAAMFGRFVSGDV---DAKVTAAVHVAHSFTIHGEQSETDYFTAVDDLVEQGTGH 241 Query: 214 LGTQEFSSGVFYRYANINLAQLQENLGG--------ASREQALEIATHVVHMLATEVPGA 265 + E ++G++Y Y +++ QL NL G A R A ++ ++++H++AT PGA Sbjct: 242 INAAELNTGIYYGYVVVDVPQLISNLCGCDSKNSADADRTLAAQVTSNLIHLMATVTPGA 301 Query: 266 KQRTYAAFNPADMVMVNFSD-MPLSMANAFEKAVK--AKDGFLQPSIQAFNQYWDRVANG 322 K A + + +V+ +SD P ++A+AF + +K + ++Q +Y + Sbjct: 302 KLSGTAPYAASWLVLAEWSDSQPRTLADAFFEGLKLGSDGSARSLAVQMLAEYIRKYDAM 361 Query: 323 YGLNGAAAQFSLSDVDPITAQVKQMPTLEQLKSWVRNNGEA 363 Y + ++P + +L++L V+ E Sbjct: 362 YTPQLTRR---CASIEPCQIPGAENGSLDELCEAVKLAIEG 399 >UniRef50_D1Y487 CRISPR-associated protein, Cse4 family n=1 Tax=Pyramidobacter piscolens W5455 RepID=D1Y487_9BACT Length = 408 Score = 329 bits (844), Expect = 1e-88, Method: Composition-based stats. Identities = 105/404 (25%), Positives = 176/404 (43%), Gaps = 55/404 (13%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSG--YYAQNI 58 + FI I L ++ S LNRDD + K FGG R R+SSQ LKR R + + QN+ Sbjct: 4 LPRFIQISTLTTYPASLLNRDDSGLSKRIPFGGVSRTRVSSQCLKRHWRMADGLWSLQNV 63 Query: 59 GE---SSLRTIHLA--QLRDVLRQKLGERFDQ--KIIDKTLALLSGKSVDEAEK------ 105 + +S+R+ + ++ L +K G ++ + L G EA Sbjct: 64 DKDIATSIRSRRIFPEKIEKPLIEKEGLDAEKVVAASQALQSELYGAKGTEAAAKNKKTA 123 Query: 106 -ISADAVTP---------------WVVGEIAWFCEQVAKAEAD-------NLDDKKLLKV 142 ADA+ P EI + + V + + + K Sbjct: 124 KDDADALNPSIDAQLSAERSELVVLGHPEIQFLSKIVREMASADGSAADVGKKTGEWFKK 183 Query: 143 LKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTA 202 K+D A++ G+D A+ GR + +V A+ +AHA T H +S+ D+FTA Sbjct: 184 HKKDFQALKCGA--GLDAAMFGRFISGDT---DARVSAAVHVAHAFTVHAEESETDYFTA 238 Query: 203 VDDLQEQGSAHLGTQEFSSGVFYRYANINLAQLQENLGG--------ASREQALEIATHV 254 VDDL GSAH+ E +SG+FY Y +++ QL N+ G A R+ A + H+ Sbjct: 239 VDDLNNSGSAHINAAELTSGIFYNYVVVDVPQLVSNIEGCPSKQWQTAQRDVAGRLVKHL 298 Query: 255 VHMLATEVPGAKQRTYAAFNPADMVMVNFSD-MPLSMANAFEKAVKAKDGFLQPSIQAFN 313 +H++AT PGAK + A + VM + P ++A+AF V + +++ Sbjct: 299 LHLIATVTPGAKLGSTAPYARPWFVMAEAGESQPHTLADAFYLPVPLRGDMRAQALRQLE 358 Query: 314 QYWDRVANGYGLNGAAAQFSLSDVDPITAQVKQMPTLEQLKSWV 357 Y + YG + S+ D ++ + +L+++ + Sbjct: 359 DYVGKSDEMYGSDERRWIASMYD---VSIPRGENSSLDRMGESL 399 >UniRef50_B8FDH9 CRISPR-associated protein, Cse4 family n=2 Tax=Bacteria RepID=B8FDH9_DESAA Length = 383 Score = 327 bits (840), Expect = 3e-88, Method: Composition-based stats. Identities = 111/383 (28%), Positives = 176/383 (45%), Gaps = 46/383 (12%) Query: 5 INIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIG-ESSL 63 + H+L S +CLNRDD+ K A+ GG R R+SSQ KR +R + +++G Sbjct: 11 VEFHILQSFPVTCLNRDDVGAPKTAVVGGATRARVSSQCWKRNIRLT---MKDLGVPIGS 67 Query: 64 RTIHLAQLRDVLRQKLGERFDQK-----------IIDKTLALLSGKSVDEAEKISADAVT 112 RT + Q+ + +LG DQ I +K G + +DA+ Sbjct: 68 RTKLIHQMIEDACAELGADTDQAQACAAQVASVFIKEKKGKKDDGDDSEGNGSDKSDALI 127 Query: 113 PWVVGEIAWFCEQVAKA------EADNLDDKKLLKVLKEDIAAIRVNL-------QQGVD 159 E+ + + + + ++ K KV K + NL + GVD Sbjct: 128 FLSREEVKKIALALRENNFSTEFQEEKVNKKGDAKVEKIKLEKKIQNLLGKPDFSRDGVD 187 Query: 160 IALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQ-EQGSAHLGTQE 218 IAL GRM V+GA S +HAI+TH+V +++++FTA+DDLQ E GSAH+G E Sbjct: 188 IALFGRMVAQAAA---LNVEGAASFSHAISTHKVTNEVEFFTALDDLQTEPGSAHMGALE 244 Query: 219 FSSGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADM 278 F+S +YRY +++ QL +NL G QALE V L +P A+Q T + + Sbjct: 245 FNSATYYRYVCLDMGQLWKNLAGQHLPQALEG---FVKALYLALPSARQATQSGACWWEF 301 Query: 279 VMVNFSDMPLSMANAFEKAVKAK-DGFLQPSIQAFNQYWDRVANGYG-LNGAAAQFSLSD 336 V + F+ AVK + G L+PS A Y ++ G L A+F+ + Sbjct: 302 AKVFVR-KGQRLQAPFDTAVKPRNGGLLEPSKDALCAYLEKKEQQAGSLFRKIAEFTFGE 360 Query: 337 VDPITAQVKQMPTLEQLKSWVRN 359 + P+++ L +++ Sbjct: 361 DNG--------PSIDDLVLSIQD 375 >UniRef50_D2L2X7 CRISPR-associated protein, Cse4 family n=1 Tax=Desulfovibrio sp. FW1012B RepID=D2L2X7_9DELT Length = 385 Score = 326 bits (835), Expect = 1e-87, Method: Composition-based stats. Identities = 103/387 (26%), Positives = 167/387 (43%), Gaps = 43/387 (11%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIG- 59 MS FI +H+L S+ + LNRDD+ K FG R+R+SSQSLKRA R S + +G Sbjct: 1 MSRFIQLHILTSYPAANLNRDDLGAPKSMRFGEANRLRVSSQSLKRAWRTSDVFKATLGA 60 Query: 60 -ESSLRTIHLAQLR-----------------------DVLRQKLGERFDQKIID-----K 90 +RT L + L++K + I K Sbjct: 61 DHLGVRTKELGRKVFCALTQGASLDAVWDAPDATGTLAALKEKTAAEIARTIAGVFGKIK 120 Query: 91 TLALLSGKSVDEAEKISADAVTPWVVGEIAWFCEQVAKAEADNLDD-KKLLKVLKEDIAA 149 A + + K + + + ++A ++ +A A + + K + Sbjct: 121 KEADAKAEKDADPVKKRKELLDSLEIEQLAHVSQEERRAVAALTEACRDAGKAPDANALN 180 Query: 150 IRVNLQQGVDIALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQ-- 207 + + + DIA+ GRM + V+ A+ +AHA+T H+ ++ D+FTAVDDL Sbjct: 181 LLRSDAKAADIAMFGRMLAASARF---NVEAAVQVAHAVTVHRAVAEDDFFTAVDDLNRD 237 Query: 208 EQGSAHLGTQEFSSGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQ 267 + G+ H+G EF +GV+Y Y I+ A L ENLGG + T + T P KQ Sbjct: 238 DAGAGHMGVSEFGAGVYYLYLCIDRALLAENLGG-DEALVQKALTALTTAACTVAPTGKQ 296 Query: 268 RTYAAFNPADMVMVNFS-DMPLSMANAFEKAV----KAKDGFLQP-SIQAFNQYWDRVAN 321 +YA+ A + D P +++ AF K V + +DG L +I + ++ Sbjct: 297 ASYASRAYACFALAEKGDDTPRNLSLAFLKPVGEREEERDGHLGKTAIAELLKTKAKMDK 356 Query: 322 GYGLNGAAAQFSLSDVDPITAQVKQMP 348 YG A F++ D A++ Sbjct: 357 VYGQTLADTSFNVFDGKGTLAELAAFV 383 >UniRef50_Q2RY18 CRISPR-associated protein, Cse4 family n=2 Tax=Alphaproteobacteria RepID=Q2RY18_RHORT Length = 359 Score = 324 bits (832), Expect = 2e-87, Method: Composition-based stats. Identities = 105/373 (28%), Positives = 161/373 (43%), Gaps = 32/373 (8%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNI-G 59 MS F+ +HVL +++ S LNRDD K FGG R+R+SSQSLKRA R+S + + G Sbjct: 1 MSRFLQLHVLTAYAASNLNRDDTGRPKTLNFGGAERLRVSSQSLKRAFRQSELFQSRLPG 60 Query: 60 ESSLRTIHLAQ-LRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGE 118 E R+ A+ L L + E + + L + K + + E Sbjct: 61 ELGTRSQDFAKALVSALVARGVEEAEAITRAEALIDHDKLGKVKKGKAQTEQLVHLGPDE 120 Query: 119 IAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGKV 178 +A + L + + + + VDIA+ GRM V Sbjct: 121 LAAIDALAERLATSAT--------LDDKAMLVLKSKPRAVDIAMFGRMLAGNPGF---NV 169 Query: 179 DGAMSIAHAITTHQVDSDIDWFTAVDDLQEQ------GSAHLGTQEFSSGVFYRYANINL 232 + A+ +AHA TTH+ + D++T VDD++ G+ LG E+ SG+FY Y IN Sbjct: 170 EAAVQVAHAFTTHRATPEDDYYTTVDDIKNADQEEDRGAGFLGILEYGSGLFYLYICINA 229 Query: 233 AQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFS-DMPLSMA 291 L +NL G + A E A ++ T P KQ T+A+ ++ + P S+A Sbjct: 230 DLLVDNLAG-DQALAAEAAALLIEAACTISPTGKQNTFASRARGLYALLEIGEETPRSLA 288 Query: 292 NAFEKAVKAK---DGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITAQVKQMP 348 AF+ AV ++ L SIQ + + YG N +L DP T Sbjct: 289 AAFQYAVGSRATEADHLAASIQRLTALREGFSKAYGENL--RSVALDVTDPATPG----- 341 Query: 349 TLEQLKSWVRNNG 361 L+ L + R+ Sbjct: 342 -LKALIAAARDAV 353 >UniRef50_A5GBK1 CRISPR-associated protein, Cse4 family n=1 Tax=Geobacter uraniireducens Rf4 RepID=A5GBK1_GEOUR Length = 408 Score = 324 bits (830), Expect = 4e-87, Method: Composition-based stats. Identities = 105/402 (26%), Positives = 170/402 (42%), Gaps = 53/402 (13%) Query: 3 NFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGE-- 60 + +H++ S +CLNRDD+N K A+FGG +R R+SSQS KRA+R+ + Sbjct: 2 KHLELHIIQSVPVACLNRDDLNSPKTAVFGGVQRARVSSQSWKRAIREMAKEIAAEEKSD 61 Query: 61 --SSLRTIHLAQ-LRDVLRQKLGERFDQKIIDKTLALL---SGKSVDEAEKISADAVTPW 114 S RT + L L +K I + +A + VD V + Sbjct: 62 LFSGDRTRRMVYTLSTRLAEKGITSQAAIAIAEQVADVVETLDSKVDSEGYKKIKTVMFF 121 Query: 115 VVGEIAWFCEQVA------------KAEADNLDDKKLLKVLKEDIAAIRVN--------- 153 E E +A + A +D++ K LK + + Sbjct: 122 SKAEYDAIAEAIATSDEVKNSVEALEKAAVEGNDREREKALKAMVKILEKGAISKTIKSA 181 Query: 154 -LQQGVDIALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQ--EQG 210 L+ DIAL GRM + KVDGA AH ++TH+ D++ID+F AVDDL E G Sbjct: 182 QLKDAADIALFGRMVAND---PSLKVDGASMFAHILSTHKADNEIDFFAAVDDLNKDESG 238 Query: 211 SAHLGTQEFSSGVFYRYANINLAQLQ--ENLGG---------ASREQALEIATHVVHMLA 259 + T EF+S +YR+A +NL L ++LG S E ++ + + Sbjct: 239 AGMTSTLEFNSATYYRFAALNLDALANDDHLGDITLKDGTVVRSVETRKQVVKTFLKAII 298 Query: 260 TEVPGAKQRTYAAFNPADMVMVNFSD--MPLSMANAFEKAV-KAKDGFLQPSIQAFNQYW 316 +P A++ T V+ + P+ + NAFE V +++ GF+ SI N + Sbjct: 299 QSIPSARKTTMNGNTLPVYVLGVVREKGHPIQLINAFETPVRRSEKGFVTESINRMNIEY 358 Query: 317 DRVANGYGLNGAAAQF----SLSDVDPITAQVKQMPTLEQLK 354 + +G++ A+ SL + + + + L Sbjct: 359 ADLKETWGVDSLFAKAVVKGSLKEQIKANQGSIETCSQDDLI 400 >UniRef50_C6HV95 CRISPR-associated protein, Cas4 n=1 Tax=Leptospirillum ferrodiazotrophum RepID=C6HV95_9BACT Length = 393 Score = 323 bits (828), Expect = 7e-87, Method: Composition-based stats. Identities = 110/389 (28%), Positives = 176/389 (45%), Gaps = 52/389 (13%) Query: 5 INIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIG-ESSL 63 H+L S +CLNRDD+ K A+ GG +R R+SSQS KRA+R + + ++G + Sbjct: 13 FEFHILQSFPVTCLNRDDVGSPKTAMIGGSQRARVSSQSWKRAVRLAMH---DLGVTHGV 69 Query: 64 RTIHLAQLRDVLRQKLGERFDQKII--DKTLALLSG---------------------KSV 100 RT ++ L + LG +Q DK A+ + Sbjct: 70 RTKLISPLIAEACRSLGATPEQARACGDKVEAVFIKKDEKGKKKSAKTKGDSDTQDEEVG 129 Query: 101 DEAEKISADAVTPWVVGEIAWFCEQVAKAEAD------NLDDKKLLKVLKEDIAAIRVNL 154 ++ D + EI+ + K E D D KK K + + I + Sbjct: 130 SDSSSEKTDTLLFLSPKEISVLANEFKKQEFDPGKVIVQSDPKKQAKEIADMIGKVP-EG 188 Query: 155 QQGVDIALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQ-EQGSAH 213 VDIAL GRM V+ A S AHAI+TH+V +++++FTA+DD + G+AH Sbjct: 189 IDAVDIALFGRMVAQAA---ELNVEAAASFAHAISTHKVANEVEFFTALDDCAVDPGAAH 245 Query: 214 LGTQEFSSGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAF 273 +G+ EF+S +YRY +++L QL + L G + V L VP A+Q T + Sbjct: 246 MGSLEFNSATYYRYVSLDLGQLSQTLAGQHIPET---IEAFVKALFVSVPAARQSTQSGA 302 Query: 274 NPADMVMVNFSDMPLSMANAFEKAVKA-KDGFLQPSIQAFNQYWDRVANGYG-LNGAAAQ 331 +P D + + FE A+K+ GFL+PSI+ Y +R +G L G A+ Sbjct: 303 SPWDFAKILVR-TGHRIQIPFETAIKSKDGGFLKPSIEEMKAYLNRQEKLHGSLFGKKAE 361 Query: 332 FSLSDVDPITAQVKQMPTLEQLKSWVRNN 360 ++ + + T++ L S ++ Sbjct: 362 YTYGED--------ENFTIDDLISALKQQ 382 >UniRef50_Q1J368 CRISPR-associated protein, CT1975 n=1 Tax=Deinococcus geothermalis DSM 11300 RepID=Q1J368_DEIGD Length = 385 Score = 322 bits (827), Expect = 1e-86, Method: Composition-based stats. Identities = 116/390 (29%), Positives = 169/390 (43%), Gaps = 35/390 (8%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSG--YYAQNI 58 M + +H L + +PS LNRDD KDA FGG RR+RISSQ+ KRAMR+ Sbjct: 1 MKALLELHYLQNFAPSNLNRDDTGSPKDAFFGGTRRLRISSQAFKRAMRQDFGGRELLRP 60 Query: 59 GESSLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGE 118 E +RT + L G +Q LAL + K + E Sbjct: 61 EEIGVRTKRAHEAIAELLAGEGRTEEQCRAAAELALGGLGLPVKDGKN--QYLLFLGRDE 118 Query: 119 IAWFCEQV----AKAEADNLDDKKLLKVLK------------EDIAAIRVNLQQGVDIAL 162 + + + A+ +A + + K A ++ + VD+AL Sbjct: 119 LRRVADIIGANWAEFQAAAPEPESTDGKKKKASKKAALSGDLGKQLAGALDGSKAVDVAL 178 Query: 163 SGRMATSGMMTELGKVDGAMSIAHAITTHQV-DSDIDWFTAVDDLQ---EQGSAHLGTQE 218 GRM D A +AHAI+TH + + D++TAVDDL+ G+ LGT E Sbjct: 179 FGRMLAD---LPDKNADAAAQVAHAISTHALRERQYDFYTAVDDLKPDDNAGADMLGTVE 235 Query: 219 FSSGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADM 278 F+S YRYA I+L +L ENL G RE ++ P KQ T+AA N + Sbjct: 236 FASATVYRYACIDLGKLLENLQG-DRELLERGLRAFLYASVYAAPTGKQNTFAAHNLPGL 294 Query: 279 V--MVNFSDMPLSMANAFEKAVKAKD--GFLQPSIQAFNQYWDRVANGYGLNGAAAQFSL 334 + +V + P ++ANAFEK V+A+ G+L PS+ A +G G A + Sbjct: 295 MVQVVRRNASPRNLANAFEKGVRAEGGQGYLAPSVAALADEMRWQNGVFGDAGTARFVAR 354 Query: 335 SDVDPITAQVKQMPTLEQLKSW-VRNNGEA 363 D + + MP + L V + A Sbjct: 355 EGGDAVF--GEAMPNVAALIDATVADALSA 382 >UniRef50_B4S8P9 CRISPR-associated protein, Cse4 family n=9 Tax=Bacteria RepID=B4S8P9_PROA2 Length = 347 Score = 322 bits (825), Expect = 1e-86, Method: Composition-based stats. Identities = 106/361 (29%), Positives = 170/361 (47%), Gaps = 33/361 (9%) Query: 5 INIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIG-ESSL 63 I H+L S +CLNRDD+ K AI GG R R+SSQ KR +R S Q+ G + + Sbjct: 12 IEYHILQSFPVTCLNRDDVGAPKTAIVGGSTRARVSSQCWKRQVRLS---MQDFGIKLGI 68 Query: 64 RTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGEIAWFC 123 R+ +++ + QK + A GK + + S D + + E F Sbjct: 69 RSKKVSEFV-------AKACLQKGASEEQAAECGKVISD--SFSKDTLFFFSESEAQAFA 119 Query: 124 EQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGKVDGAMS 183 + + D+ + K +++ G+DIAL GRM ++ A S Sbjct: 120 DYAREKNFDSKNLND--KEIRKVAKKALNPAIDGLDIALFGRMVAQAT---DLNIEAAAS 174 Query: 184 IAHAITTHQVDSDIDWFTAVDDL-QEQGSAHLGTQEFSSGVFYRYANINLAQLQENLGGA 242 +HAI+TH+V +++++FTA+DDL +E GSAH+G+ EF+S +YRY +++L QL E++GG Sbjct: 175 FSHAISTHKVSNEVEFFTALDDLAEEPGSAHMGSLEFNSATYYRYISLDLGQLWESIGG- 233 Query: 243 SREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFSDMPLSMANAFEKAVKA-K 301 E E + L VP A+Q T + +P + + + FE AVKA Sbjct: 234 --EHLAEAVESLTKALFVAVPSARQTTQSGASPWEFAKIFIR-KGQRLQVPFETAVKAKD 290 Query: 302 DGFLQPSIQAFNQYWDRVANGYG-LNGAAAQFSLSDVDPITAQVKQMPTLEQLKSWVRNN 360 G+LQPSI A Y + G L G +F+ + +++ L ++ Sbjct: 291 GGYLQPSITALTDYLTKKEALAGSLFGKEKEFTFGED--------VNFSIDDLIKGLKLT 342 Query: 361 G 361 Sbjct: 343 V 343 >UniRef50_C7MQD5 CRISPR-associated protein, Cse4 family n=1 Tax=Saccharomonospora viridis DSM 43017 RepID=C7MQD5_SACVD Length = 368 Score = 312 bits (799), Expect = 2e-83, Method: Composition-based stats. Identities = 97/374 (25%), Positives = 161/374 (43%), Gaps = 24/374 (6%) Query: 4 FINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESSL 63 F++IH L + S +NRD++ K +GG R+R+SSQ+ KRA+RK+ Q++ + + Sbjct: 2 FVDIHALHTLPYSNVNRDNLGAPKSCWYGGTERIRVSSQAWKRAIRKAV--EQDLEQPTE 59 Query: 64 RTIHLAQLRD-VLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADA-VTPWVVGEIAW 121 RT +A L +L ++ D + + + G + + TP +A Sbjct: 60 RTRRIASLVAGILTERGWGAEDARRAGRAVIYAYGLEPAADDDDTDTLLWTPPAAEALAG 119 Query: 122 FCEQVAKAE------------ADNLDDKKLLKVLKEDIAAIRVNLQQGVD-IALSGRMAT 168 E+ A N K + +K ++ L + IAL GRM Sbjct: 120 VVEKHRDTVVTLPLPKGEGKKAKNPPAKDITDAVKPMAGEVKSILNRTTPTIALLGRMLA 179 Query: 169 SGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDD-LQEQGSAHLGTQEFSSGVFYRY 227 + G IAHA T H+ + D+FTAVDD G+ H+ T +F++G FYRY Sbjct: 180 D---RPDHTIYGLAEIAHAFTVHEAAPEFDYFTAVDDRAANTGAGHVNTAQFTTGTFYRY 236 Query: 228 ANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFSDMP 287 ++IN+ +L + +G A + T P KQ AA AD+ + + P Sbjct: 237 SSINITRLVDVVGEQD---ARAVLLAWARRFITVTPAGKQTATAARTAADLAHIVVRNAP 293 Query: 288 LSMANAFEKAVKAKDGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITAQVKQM 347 S A AFE + + G+L P+ +A Y R+A G ++ + + + Sbjct: 294 QSYAPAFETPIVSTGGYLDPAARALGDYATRLAAYLGDTPVEHGYATTLPTNVDGLGGRF 353 Query: 348 PTLEQLKSWVRNNG 361 TL+ L + Sbjct: 354 DTLDTLINATVGAV 367 >UniRef50_Q60AD1 CRISPR-associated protein, CT1975 family n=1 Tax=Methylococcus capsulatus RepID=Q60AD1_METCA Length = 414 Score = 304 bits (778), Expect = 5e-81, Method: Composition-based stats. Identities = 106/411 (25%), Positives = 176/411 (42%), Gaps = 61/411 (14%) Query: 4 FINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESSL 63 F+ IH L S+ + LNRDD + K FG R+R+SSQ LKR R+S + + L Sbjct: 2 FLQIHSLTSYHATLLNRDDAGLAKRIPFGDAVRLRVSSQCLKRHWRESLKQTIPLP-TGL 60 Query: 64 RTIHLAQLR--DVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISA------------- 108 RT H+ + L+Q+ E K + +L L + D+ K Sbjct: 61 RTRHVFEREIYPRLKQEGVEDSLAKQLTLSLMGLLLQKSDKTAKPEKAKKGKNGHEEQAE 120 Query: 109 -------------------DAVTPWVVGEIAWFCEQV-AKAEADNLDDKKLLKVLKED-- 146 + E+ + + A AE + +K L LK D Sbjct: 121 FDFEEGAGTEESSAGDLRVKQPILFGRPEVDYLISLLKACAEEGSGAEKALQAKLKGDKA 180 Query: 147 ------IAAIRVNLQQGVDIALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWF 200 AA +L G++ AL GR TS + L + D A+ +AH+ T H +D+++D+F Sbjct: 181 NFKAMLKAAGHGDLYAGLEGALFGRFVTSDV---LSRSDAAVHVAHSFTVHGLDTEVDYF 237 Query: 201 TAVDDL---QEQGSAHLGTQEFSSGVFYRYANINLAQLQENLGGAS--------REQALE 249 T VDDL +E G+AH G E +G+FY Y +++ L NL G + Sbjct: 238 TVVDDLNREEETGAAHAGDMELGAGLFYGYVAVDIPLLVSNLTGCDTTRWAEQEPADVRK 297 Query: 250 IATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFSD-MPLSMANAFEKAVKAKDGFLQPS 308 + T ++ +AT PGAK A + ++ V++ P +++NA+ +A+ + LQ + Sbjct: 298 VLTGLIRAIATVSPGAKLGATAPYAFSEFVLLETGKQQPRALSNAYLQALPMRGDPLQAA 357 Query: 309 IQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITAQVKQMPTLEQLKSWVRN 359 I A +Y + YG + S++ A + +L+ + Sbjct: 358 IDALAKYLRALDAMYGRTSDSR--SVASTRAFDADLAPTNSLDASIGAALD 406 >UniRef50_D1A6Q4 CRISPR-associated protein, Cse4 family n=2 Tax=Actinomycetales RepID=D1A6Q4_THECD Length = 399 Score = 293 bits (750), Expect = 7e-78, Method: Composition-based stats. Identities = 116/388 (29%), Positives = 173/388 (44%), Gaps = 39/388 (10%) Query: 3 NFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESS 62 FI H++ + + LNRDD N K +GGK R R+SSQ KRAMR E++ Sbjct: 7 RFIEAHIIQAIPFANLNRDDTNAVKTVTWGGKERTRVSSQCWKRAMRLY-LQTSLGQEAA 65 Query: 63 LRTIHLAQ-LRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISAD------------ 109 LRT L + L L + G D +++ EA K D Sbjct: 66 LRTRRLPEYLARHLEEHHGWPADLAERAGRHIVVASSVGGEAPKKKTDGEETGGTGEHWS 125 Query: 110 -----AVTPWVVGEIAWFCEQVAKAEADNLDDKKLLKVLKED------IAAIRVNLQQGV 158 + V E+A Q +A + + K K ++D + + GV Sbjct: 126 TAAMVYIPSSAVPELAELAIQYREALENAKEPKDPAKFGRKDSVIPTGKVDEILRRRNGV 185 Query: 159 DIALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQE-----QGSAH 213 I L GRM + +VDGA+ +AHA TTH ++ID+F+AVDD+ + GSAH Sbjct: 186 -INLFGRMLAQ---VDDAEVDGAVQVAHAFTTHATTTEIDYFSAVDDVTDIWGDTTGSAH 241 Query: 214 LGTQEFSSGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAF 273 +G E S+GV YRY ++L L NLGG E E+A ++ +P AK+ + A Sbjct: 242 MGQAEHSAGVLYRYIVLDLNDLHANLGG-DLEATRELAAGLLKAALLSLPRAKKNSTAPH 300 Query: 274 NPADMVMVNFS-DMPLSMANAFEKAVKAK--DGFLQPSIQAFNQYWDRVANGYGLNGAA- 329 + + D P+S A AFEK V A G +PS+ A N+Y V G +G Sbjct: 301 TIPHLAHLTVRTDRPVSYAGAFEKPVPADRHGGHSEPSVAALNEYAAAVQKLLGTSGCRY 360 Query: 330 AQFSLSDVDPITAQVKQMPTLEQLKSWV 357 A + + I A +++ + ++L Sbjct: 361 AAHATLSQEKIDALGERVESFDKLIEGA 388 >UniRef50_C2GEY7 CRISPR-associated Cse4 family protein n=6 Tax=Actinomycetales RepID=C2GEY7_9CORY Length = 356 Score = 285 bits (731), Expect = 1e-75, Method: Composition-based stats. Identities = 92/360 (25%), Positives = 148/360 (41%), Gaps = 38/360 (10%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGE 60 MSN + +H L S S LNRDD + K + GG R SSQS+KR R Y + Sbjct: 1 MSNQLTLHFLCSIPYSNLNRDDTGVPKRVMQGGALRALHSSQSIKRGSRV--LYENASQD 58 Query: 61 SSLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGK-SVDEAEKISADAV-TPWVVGE 118 S+R+ L + ++ D+K K A L G + EA+ DA + W+ E Sbjct: 59 LSIRSGRLDEEVAEKAMEMNPDLDEKTALKQAAKLIGNLTKGEAKSGEGDAKRSTWLSSE 118 Query: 119 IAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGKV 178 A A++ D ++ I N + IA GRM + Sbjct: 119 EILTA---ATYVANSTDPREKF---------IDGNTTGSLAIAAFGRMFANAT---DLNT 163 Query: 179 DGAMSIAHAITTHQVDSDIDWFTAVDDLQ----EQGSAHLGTQEFSSGVFYRYANINLAQ 234 + A++++ AITTHQ + D+F+ DD+ + + +L ++SG FYR I+ Q Sbjct: 164 EAAVAVSPAITTHQATIETDYFSTADDINLRDHKANATYLDVSLYTSGTFYRTVTIDRNQ 223 Query: 235 LQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFSDMPLSMANAF 294 L+ + G E V L P K+ + A F +++ + +A F Sbjct: 224 LRTSWSGFESNSVRENLEAFVRSLVYGQPRGKKNSTAPFTMPSLILAE--EQQYRVAYDF 281 Query: 295 EKAVKAK---DGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITAQVKQMPTLE 351 E+ V+A GF++ SI+ + + A F + P+ A P L+ Sbjct: 282 ERPVEADKDGGGFMKSSIEKLAKQYT----------LARSFDPGNFGPVEALSGTYPDLD 331 >UniRef50_C8P6I6 CRISPR-associated protein n=1 Tax=Lactobacillus antri DSM 16041 RepID=C8P6I6_9LACO Length = 311 Score = 276 bits (706), Expect = 9e-73, Method: Composition-based stats. Identities = 79/309 (25%), Positives = 143/309 (46%), Gaps = 26/309 (8%) Query: 61 SSLRTIHLAQLRD-VLRQKLGERFDQKIIDKTLALLSGKSVDEAEKIS-ADAVTPWVVGE 118 + +RT+ L L+++ + + + + + + + +K + A+ G+ Sbjct: 14 AGIRTMRGPLLLANELQKQDSNLSSDEAMAQAVDVFNKAKIKLDKKTNQTKALLMLSHGQ 73 Query: 119 IAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGKV 178 IA E V + D LD K + + L+ +D+AL GRM V Sbjct: 74 IAKLAEYVRQN--DELDSKAVKEALQ---------GDHSLDMALFGRMVADD---PSLNV 119 Query: 179 DGAMSIAHAITTHQVDSDIDWFTAVDDLQ---EQGSAHLGTQEFSSGVFYRYANINLAQL 235 D A +AHAI+TH++ + D++TAVDD + E GSA +GT E+ S YRYAN+N+ +L Sbjct: 120 DAACQVAHAISTHEIVPEYDYYTAVDDEKADDESGSAMIGTIEYDSATLYRYANVNVNEL 179 Query: 236 QENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFS-DMPLSMANAF 294 ++LG + A++ V +P KQ ++A V+V D P+++ +AF Sbjct: 180 VQSLG--DVDTAVKGLQLFVKDFVLSMPTGKQNSFANKTVPQYVLVTVREDTPVNLVSAF 237 Query: 295 EKAVKAKDGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITAQVKQMPTLEQLK 354 E+AVK++ G+LQPS+ + + S+ + + + ++ L Sbjct: 238 EEAVKSRHGYLQPSVAKLEKEYQDTQQFVQTP----LASVVVTNKESKISTKAADVDDLV 293 Query: 355 SWVRNNGEA 363 S + E+ Sbjct: 294 SKITEVIES 302 >UniRef50_B6IWM4 CRISPR-associated protein, CT1975 family n=1 Tax=Rhodospirillum centenum SW RepID=B6IWM4_RHOCS Length = 435 Score = 272 bits (696), Expect = 1e-71, Method: Composition-based stats. Identities = 95/425 (22%), Positives = 159/425 (37%), Gaps = 73/425 (17%) Query: 5 INIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESSLR 64 I HVL + P +NRD+ K GG R RISSQ+ KRA+R + ++ + + R Sbjct: 15 IQFHVLTAFPPHNVNRDEDGRPKTCQLGGVTRGRISSQAKKRALRLAPHFPTA--QRATR 72 Query: 65 TIH--------------------LAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAE 104 T A L G + + + +A K ++A Sbjct: 73 TRKAGIHTFLKLTAAGIDTTSAVWAALAVNHATGGGGKPPKAEDAQAIAAPDPKKQEDAY 132 Query: 105 KISADAVT---------------PWVVG-------------EIAWFCEQVAKAEADNLDD 136 K AVT W+ G E A E +A A D Sbjct: 133 KKKEKAVTDMMEKRGLDRAAAEQEWLTGQVGTEQGLVISTREFARIEEGIAHLTAAWAAD 192 Query: 137 KKLLKVLKED------IAAIRVNLQQGVDIALSGRMATSGMMTELGKVDGAMSIAHAITT 190 + + E ++ +D AL GRM + V+ A ++ HA TT Sbjct: 193 RDGFPAVLEGWVRQVCKESLLTKADHDLDTALFGRMVAANANF---NVEAACAVGHAFTT 249 Query: 191 HQVDSDIDWFTAVDDLQ---EQGSAHLGTQEFSSGVFYRYANINLAQLQENLG-GASREQ 246 H+ + D+F+A ++L+ G+ G F GV+Y++A ++ L+ L G S E+ Sbjct: 250 HRFALEGDYFSAGEELKVLGGTGAVITGYAFFGGGVYYQHAVLDRGHLRTTLSRGRSAEE 309 Query: 247 A----LEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFSDMP-LSMANAFEKAVKAK 301 A ++ + L P K ++A+ A V+ P L++ AF VKA Sbjct: 310 AERLTVQAVDTFLTGLLFSQPRGKCNSHASDVAASYVLATRGGDPALNLGLAFLDPVKAT 369 Query: 302 D---GFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPIT--AQVKQMPTLEQLKSW 356 + + SI+ + + YGL A + + ++ T+E + + Sbjct: 370 EDVTDLMCASIRRLTDFHRALTAAYGLGNAVCVLNAYPPARGNDAPRAPEVWTVEDFRRF 429 Query: 357 VRNNG 361 V+ G Sbjct: 430 VQGRG 434 >UniRef50_B8HWH9 CRISPR-associated protein, Cse4 family n=1 Tax=Cyanothece sp. PCC 7425 RepID=B8HWH9_CYAP4 Length = 501 Score = 259 bits (663), Expect = 9e-68, Method: Composition-based stats. Identities = 74/324 (22%), Positives = 143/324 (44%), Gaps = 25/324 (7%) Query: 48 MRKSGYYAQNIGESSLRTIHLAQLRDVLRQKLGERFDQKIIDKTLA----LLSGKSVDEA 103 R E R L VL + ++ + + ++ A + + ++ Sbjct: 174 WRT--KLQSEFAEMPERVDDQVSLWSVLSIQALQKSQEDLANEDEADDEKVDTSNTMFFV 231 Query: 104 EKISADAVTPWVVGEIAWFCEQVAKAEADNLDD--KKLLKVLKEDIAAIRVNLQQGVDIA 161 + + + +++ + + ++ + K++ +K V + DIA Sbjct: 232 GDVEIENLAGFLLNNLQVVQQDISASVPSFSKAVVDKIIDTIKHKDEKGNVIFPKPGDIA 291 Query: 162 LSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQEQ---GSAHLGTQE 218 L GRM + KVD ++ +AHAI+ +++ + D+FTAV+DL E GS H+G Sbjct: 292 LFGRMMAN---LPNAKVDASVQVAHAISVNKLQQEFDFFTAVEDLAEPDSLGSGHMGETG 348 Query: 219 FSSGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADM 278 ++S +YR+ ++ QL++NLG + + A IA +P Q +AA + Sbjct: 349 YNSSTYYRFTTLDTEQLKQNLG--NEDNAATIAHAFAEAFVRAIPTGHQNGFAAHSLPAA 406 Query: 279 VM-VNFSDMPLSMANAFEKAVKAKDG--FLQPSIQAFNQYWDRVANGYGL-----NGAAA 330 VM V P+S+ +AFE V K G L+ ++ +++W ++ YG G A Sbjct: 407 VMAVVRKGQPVSLVDAFENPVAPKAGKSLLENAVSKLDEHWAELSKMYGEKTVVFKGIVA 466 Query: 331 QFSLSDVDPITAQVKQMPTLEQLK 354 + L+ A V++ P++E+L Sbjct: 467 RAQLAQQLEYLAAVEK-PSVEELL 489 Score = 120 bits (301), Expect = 8e-26, Method: Composition-based stats. Identities = 36/147 (24%), Positives = 55/147 (37%), Gaps = 2/147 (1%) Query: 5 INIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESSLR 64 + IH+L S P+ LNRD+ M K +FGG R RISSQ KR R YY + E + Sbjct: 3 LEIHILQSFPPANLNRDENGMPKSTVFGGYPRARISSQCQKR--RTREYYHEYCKELGVD 60 Query: 65 TIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGEIAWFCE 124 H A ++L E+ Q+ + + A L D + Sbjct: 61 LKHFANRSRNWIKQLKEKLTQRGVSEAQAELMASLTISVLSEKPDKKGKLKYKPEDVIKK 120 Query: 125 QVAKAEADNLDDKKLLKVLKEDIAAIR 151 V + +K ++ + Sbjct: 121 LVGVWQKALKSPRKKNELEQAITEQTL 147 >UniRef50_D0WFC9 CRISPR-associated protein, Cse4 family n=1 Tax=Slackia exigua ATCC 700122 RepID=D0WFC9_9ACTN Length = 310 Score = 259 bits (662), Expect = 1e-67, Method: Composition-based stats. Identities = 66/269 (24%), Positives = 118/269 (43%), Gaps = 14/269 (5%) Query: 79 LGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGEIAWFCEQVAKAEADNLDDKK 138 + E + I+K +L + +K + + +++ ++A+ L D + Sbjct: 1 MPEVSEGDAIEKAKEVLVALGF-KLKKEENEYLNEYLIFIGTLQIGKLAELAIQALRDGE 59 Query: 139 L--LKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSD 196 K K+ + R VDIA+ GRM VD ++ +AHAI+ +++ Sbjct: 60 KVDKKEAKKILDVKRSPALNAVDIAMFGRMVADA---PDLNVDASVQVAHAISVSSAETE 116 Query: 197 IDWFTAVDD---LQEQGSAHLGTQEFSSGVFYRYANINLAQLQENLGGASREQALEIATH 253 D+FTA+DD G+A + T EF+S +FYRYAN+++ L ENLG S + A + Sbjct: 117 FDYFTALDDKAPEDNAGAAMIETTEFTSAMFYRYANVDVFHLCENLG--SPDAATKGINA 174 Query: 254 VVHMLATEVPGAKQRTYAAFNPADMVMVNFSD-MPLSMANAFEKAVKA--KDGFLQPSIQ 310 + +P KQ ++A V++ D P+S+ N+FE+ V A L + + Sbjct: 175 FLQSFVKSMPTGKQNSFANRTLPSAVVIQLRDSQPVSLVNSFERPVVALRDKSQLTNAAE 234 Query: 311 AFNQYWDRVANGYGLNGAAAQFSLSDVDP 339 A + +G+ + D Sbjct: 235 ALVAQEKALDEAFGVTPQHTFVVAASPDA 263 >UniRef50_Q31XC0 Putative cytoplasmic protein n=1 Tax=Shigella boydii Sb227 RepID=Q31XC0_SHIBS Length = 245 Score = 252 bits (643), Expect = 2e-65, Method: Composition-based stats. Identities = 68/240 (28%), Positives = 110/240 (45%), Gaps = 16/240 (6%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNI-G 59 M+ FI +H+L ++ + LNRDD K I GG R+R+SSQSLKRA R S + Q + G Sbjct: 1 MTTFIQLHLLTAYPAANLNRDDSGSPKTVILGGATRLRVSSQSLKRAWRTSELFEQALAG 60 Query: 60 ESSLRTIHLAQLRD--VLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVG 117 +R+ +A+ ++ + + ++ + + L D+ +K P Sbjct: 61 HIGVRSGRIAREAATILIEKGIEDKKAIEWAVEIADYLGKAKKDKKQKNDKKPKDPLTSA 120 Query: 118 EIAWFCEQVAKAEADNLDD-----KKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMM 172 E ++ AE D + + + KE A+ + VDIA+ GRM Sbjct: 121 ETEQLV-HISPAEFDAVKALAHQLAEEKRAPKEKDLALLRKDRMAVDIAMFGRMLAKKPG 179 Query: 173 TELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDL----QEQGSAHLGTQEFSSGVFYRYA 228 V+ A +AHA + + D+FTAVDDL ++ G+ H+ F S +FY Y Sbjct: 180 F---NVEAACQVAHAFGVSETIVENDFFTAVDDLRQASEDAGAGHVDETGFGSALFYTYI 236 >UniRef50_B7KJ25 CRISPR-associated protein, Cse4 family n=1 Tax=Cyanothece sp. PCC 7424 RepID=B7KJ25_CYAP7 Length = 480 Score = 232 bits (593), Expect = 1e-59, Method: Composition-based stats. Identities = 69/279 (24%), Positives = 114/279 (40%), Gaps = 20/279 (7%) Query: 83 FDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGEIAWFCEQVAKAEADNLDDKKLLKV 142 D + ++ I D T + +A + + KKL Sbjct: 170 SDDDTSTPEETESTITILELPGAIQGDLKTSYKDNPLAKVVNE-----EEFNQLKKLCNE 224 Query: 143 LKEDIAAIRVNLQQGV--DIALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWF 200 +K + + + V D+AL GRM S VD ++S+AHAI+T+ + + D++ Sbjct: 225 IKGILYDEKNKRIKPVPGDVALFGRMLAS---FSDASVDASVSVAHAISTNSIKREFDYW 281 Query: 201 TAVDDL------QEQGSAHLGTQEFSSGVFYRYANINLAQLQENLGGASREQALEIATHV 254 TA D + QG+ H+G + F+SGVFYRY+ ++ QL ENLG +E + Sbjct: 282 TAARDFQKNNSDESQGAGHIGDRPFASGVFYRYSCLDSNQLSENLGEIYQEDIQYLVEQY 341 Query: 255 VHMLATEVPGAKQRTYAAFNPA-DMVMVNFSDMPLSMANAFEKAVKAKDGFLQPSIQAFN 313 + P + + V P+S+ NAF+ +K D F + S Sbjct: 342 LDAFLHSRPSGYSHQFGHDTLPFAGIFVIRQSQPISLVNAFDIPIKKYDSFCRQSWNKLV 401 Query: 314 QYWDRVANGYGLNGAAAQ---FSLSDVDPITAQVKQMPT 349 +W+ + YG + FSL I+ VK +P Sbjct: 402 DHWNEIQQAYGKRLPVKEVHVFSLESFKDISELVKAVPN 440 Score = 127 bits (319), Expect = 8e-28, Method: Composition-based stats. Identities = 38/134 (28%), Positives = 56/134 (41%), Gaps = 10/134 (7%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGE 60 M+NF+ IH+L S PS +NRD K A FGG R+R+SSQS K A+R+ Sbjct: 1 MTNFLEIHLLQSTPPSNMNRDQNGSPKTAHFGGVERLRVSSQSWKHAVRQYYKKTLPDDH 60 Query: 61 SSLRTIHLA-QLRDVLR-QKLGERFDQK--------IIDKTLALLSGKSVDEAEKISADA 110 + R +L L+ +K E + K I L G D+ ++ D Sbjct: 61 KTYRDKGWPTELAKRLKQEKFDEELNLKDSDFSVVLPIAFMLLSAIGAKRDDKKEGDIDT 120 Query: 111 VTPWVVGEIAWFCE 124 + E+ Sbjct: 121 MLFLGEAEVREIIN 134 >UniRef50_UPI0001B51C2C hypothetical protein SvirD4_12600 n=1 Tax=Streptomyces viridochromogenes DSM 40736 RepID=UPI0001B51C2C Length = 461 Score = 220 bits (562), Expect = 4e-56, Method: Composition-based stats. Identities = 90/402 (22%), Positives = 150/402 (37%), Gaps = 92/402 (22%) Query: 3 NFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGE-- 60 + ++H+L + + + RD+ M K +FGG R I++Q+ +RA R N G+ Sbjct: 16 QYFSLHLLETFTAALPVRDENGMPKQFVFGGDPRTMITAQARRRAERTHSRERANAGQGP 75 Query: 61 -----SSLRTIHLAQL-------------------RDVLRQKLGERFDQKIIDKTLALLS 96 +RT A+L L + +G +F K + L + Sbjct: 76 LAGYTMGIRTREWAKLTAKALADRYGWDRADALATAKALLEGVGLKFGAKPTTRDLTQVL 135 Query: 97 GKSVDEAEKISADAVTPWVVGEIAWF-------------------------------CEQ 125 + ++A +I AD + AW + Sbjct: 136 LFAPEDAGQIIADWIQEHRAEVAAWTSDYLKAKEAGAAAAAAKKAAAAAARKAKKSGTDA 195 Query: 126 VAKAEADNL--DDKKLLKVLKEDIAAIRVNL--QQGVDIALSGRMATSGMMTELGKVDGA 181 +A A DN ++++L V ++ AI L + +DIAL GR + VDGA Sbjct: 196 LASAADDNQPNNEEQLPPVPRKIREAILSALAPRDAIDIALYGRFLAEIADSP--NVDGA 253 Query: 182 MSIAHAITTHQVD------------------SDIDWFTAVDDLQEQGSAHLGTQEFSSGV 223 + AHA T H + +D+ A DD G+ G Q SG Sbjct: 254 IQTAHAFTVHAAEHIDDFYAAADDAKLHRKAHALDYIDAADD---SGAGMTGYQSLISGT 310 Query: 224 FYRYANINLAQLQENL--GGASREQAL----EIATHVVHMLATEVPGAKQRTYAAFN-PA 276 FYR+A ++ +L+ NL G +Q V +P AK+ T AA Sbjct: 311 FYRHAVLDRYKLRINLLASGMKPDQVQAAAEAAELEFVEAFTNAIPQAKKNTTAATGILP 370 Query: 277 DMVMVNFSDMPLSMANAFEKAVKAKDGFLQPSIQAFNQYWDR 318 +VM P + A FEK + + + S+ A ++ ++ Sbjct: 371 KLVMAFTGARPFNYAGIFEKPIAEETDGV-ASVAAADRLLNQ 411 >UniRef50_UPI000190E665 hypothetical protein SentesTyp_08452 n=3 Tax=Salmonella enterica subsp. enterica serovar Typhi RepID=UPI000190E665 Length = 139 Score = 138 bits (348), Expect = 3e-31, Method: Composition-based stats. Identities = 39/138 (28%), Positives = 64/138 (46%), Gaps = 6/138 (4%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNI-G 59 M+ FI +H+L +++P+ LNRD+ K A GG R+R+SSQSLKRA R S + + G Sbjct: 1 MTTFIQLHLLTAYAPANLNRDESGRPKTAFMGGVERLRVSSQSLKRAWRVSETFEAAMDG 60 Query: 60 ESSLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTP--WVVG 117 RT + D + + + + ++ I K+ + L K + K DA + Sbjct: 61 FMGKRTRRIG--VDYVYRPMKDAGIEEKIAKSSSELIAKQFGKL-KSDKDAKPEKNLEIE 117 Query: 118 EIAWFCEQVAKAEADNLD 135 +I +D Sbjct: 118 QIVHVSNHEISLIKQLVD 135 >UniRef50_UPI0001B58196 CRISPR-associated Cse4 family protein n=1 Tax=Streptomyces sp. C RepID=UPI0001B58196 Length = 91 Score = 83.4 bits (205), Expect = 1e-14, Method: Composition-based stats. Identities = 20/84 (23%), Positives = 35/84 (41%), Gaps = 4/84 (4%) Query: 262 VPGAKQRTYAAFNPADMVMVN-FSDMPLSMANAFEKAV---KAKDGFLQPSIQAFNQYWD 317 +P K T+ D+V+V S P+S AFEK V + +G ++ + +A ++ Sbjct: 1 MPTGKANTFGNHTLPDVVIVKLRSSRPVSFVGAFEKPVIQHETGEGHVRAAWKALAEHIP 60 Query: 318 RVANGYGLNGAAAQFSLSDVDPIT 341 + +G A P T Sbjct: 61 AIEKTFGATADATWILRVGEPPTT 84 >UniRef50_C2BS05 Possible CRISPR-associated protein n=1 Tax=Mobiluncus curtisii ATCC 43063 RepID=C2BS05_9ACTO Length = 435 Score = 66.0 bits (160), Expect = 2e-09, Method: Composition-based stats. Identities = 18/89 (20%), Positives = 36/89 (40%), Gaps = 3/89 (3%) Query: 274 NPADMVMVNFSD-MPLSMANAFEKAVK-AKDGFLQPSIQAFNQYWDRVANGYGLNGAAAQ 331 + ++V V D +S+ NAFE+ V + +Q +++ + + YG+ AA Sbjct: 339 SLPELVYVAVRDTRSVSLVNAFEEPVACERGSRVQAAVEVLANEETAIEDAYGMKPLAAF 398 Query: 332 -FSLSDVDPITAQVKQMPTLEQLKSWVRN 359 D + T+ +L S + Sbjct: 399 VVDPKDYAAKLEDIAHKVTVPELTSLIVE 427 >UniRef50_O87037 Z35f protein n=1 Tax=Vibrio cholerae RepID=O87037_VIBCH Length = 96 Score = 45.6 bits (107), Expect = 0.003, Method: Composition-based stats. Identities = 15/64 (23%), Positives = 29/64 (45%), Gaps = 2/64 (3%) Query: 262 VPGAKQRTYAAFNPADMVMVNFSDMPLSMANAFEKAVKAKD-GFLQPSIQAFNQYWDRVA 320 +P A+Q T + P + V + +FE+ V+A G+L P+ +A + ++ Sbjct: 1 MPNARQTTQSGACPWEYARVLVR-KGQRLQASFEQPVRAAGEGYLLPNKKALQNWLEQRE 59 Query: 321 NGYG 324 G Sbjct: 60 KLSG 63 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_Q46899 Uncharacterized protein ygcJ n=13 Tax=Proteobact... 440 e-122 UniRef50_C0W6U1 CRISPR-associated Cse4 family protein n=2 Tax=Ac... 420 e-116 UniRef50_D2RB01 CRISPR system CASCADE complex protein CasC n=2 T... 393 e-108 UniRef50_C2BET9 CRISPR-associated protein n=3 Tax=Bacteria RepID... 391 e-107 UniRef50_B6XT63 Putative uncharacterized protein n=1 Tax=Bifidob... 390 e-107 UniRef50_D1CGD3 CRISPR-associated protein, Cse4 family n=1 Tax=T... 386 e-106 UniRef50_B1VIY1 CRISPR-associated protein n=3 Tax=Corynebacteriu... 385 e-105 UniRef50_B3E5V0 CRISPR-associated protein, Cse4 family n=56 Tax=... 384 e-105 UniRef50_A3EQA5 CRISPR-ssociated protein, Cas4 n=4 Tax=Bacteria ... 381 e-104 UniRef50_D1CAJ1 CRISPR-associated protein, Cse4 family n=1 Tax=S... 381 e-104 UniRef50_Q2JWC4 CRISPR-associated protein, Cse4 family n=1 Tax=S... 381 e-104 UniRef50_A1ARH7 CRISPR-associated protein, Cse4 family n=1 Tax=P... 381 e-104 UniRef50_C3PF94 CRISPR-associated protein n=5 Tax=Corynebacteriu... 380 e-104 UniRef50_A7BA64 Putative uncharacterized protein n=1 Tax=Actinom... 379 e-104 UniRef50_Q2JH28 CRISPR-associated protein, CT1975 n=6 Tax=Actino... 378 e-103 UniRef50_A0LM53 CRISPR-associated protein, Cse4 family n=1 Tax=S... 378 e-103 UniRef50_A4XYU0 CRISPR-associated protein, Cse4 family n=5 Tax=B... 377 e-103 UniRef50_B0TDU0 Crispr-associated protein, ct1973 family, putati... 376 e-103 UniRef50_C4FG89 Putative uncharacterized protein n=1 Tax=Bifidob... 375 e-102 UniRef50_D1YEE3 CRISPR system CASCADE complex protein CasC n=1 T... 373 e-102 UniRef50_D2TKK6 CRISPR-associated protein n=1 Tax=Citrobacter ro... 373 e-102 UniRef50_C6SPJ0 Putative uncharacterized protein n=1 Tax=Strepto... 372 e-101 UniRef50_Q3A5Z5 CRISPR-associated protein, Cse4 family n=23 Tax=... 368 e-100 UniRef50_C7QEM5 CRISPR-associated protein, Cse4 family n=13 Tax=... 368 e-100 UniRef50_Q03C61 CRISPR-associated protein n=6 Tax=Firmicutes Rep... 364 2e-99 UniRef50_A1SV72 CRISPR-associated protein, Cse4 family n=2 Tax=G... 362 1e-98 UniRef50_C7LYW7 CRISPR-associated protein, Cse4 family n=1 Tax=A... 361 2e-98 UniRef50_Q2FNL3 CRISPR-associated protein, CT1975 n=8 Tax=cellul... 361 2e-98 UniRef50_Q47PJ3 CRISPR-associated protein, Cse4 family n=1 Tax=T... 360 5e-98 UniRef50_UPI0001AF1D4B hypothetical protein SghaA1_37372 n=1 Tax... 359 8e-98 UniRef50_B8IZA6 CRISPR-associated protein, Cse4 family n=1 Tax=D... 357 5e-97 UniRef50_C7MTA9 CRISPR-associated protein, Cse4 family n=1 Tax=S... 354 4e-96 UniRef50_A8LYZ6 CRISPR-associated protein, Cse4 family n=1 Tax=S... 351 2e-95 UniRef50_A5UR15 CRISPR-associated protein, Cse4 family n=1 Tax=R... 351 3e-95 UniRef50_D1NTI0 CRISPR-associated protein, Cse4 family n=1 Tax=B... 350 4e-95 UniRef50_A5FTJ7 CRISPR-associated protein, Cse4 family n=11 Tax=... 350 5e-95 UniRef50_D0Y919 CRISPR-associated protein, Cse4 family n=2 Tax=D... 349 7e-95 UniRef50_C4ZJY0 CRISPR-associated protein, Cse4 family n=1 Tax=T... 349 7e-95 UniRef50_B6B782 CRISPR-associated protein, Cse4 family n=2 Tax=A... 346 7e-94 UniRef50_B6WQ62 Putative uncharacterized protein n=1 Tax=Desulfo... 346 7e-94 UniRef50_C5SD49 CRISPR-associated protein, Cse4 family n=1 Tax=A... 346 8e-94 UniRef50_B4UE70 CRISPR-associated protein, Cse4 family n=2 Tax=A... 345 1e-93 UniRef50_Q1EQS8 CRISPR-associated protein n=3 Tax=Streptomyces R... 343 6e-93 UniRef50_Q67RP1 Putative uncharacterized protein n=1 Tax=Symbiob... 343 7e-93 UniRef50_C7RP61 CRISPR-associated protein, Cse4 family n=1 Tax=C... 342 1e-92 UniRef50_Q2RXJ6 CRISPR-associated protein, Cse4 family n=2 Tax=A... 341 2e-92 UniRef50_Q0BRF9 Putative uncharacterized protein n=1 Tax=Granuli... 341 3e-92 UniRef50_D1Y487 CRISPR-associated protein, Cse4 family n=1 Tax=P... 339 1e-91 UniRef50_Q0AA32 CRISPR-associated protein, Cse4 family n=1 Tax=A... 336 8e-91 UniRef50_C9M9R6 CRISPR-associated protein, Cse4 family n=1 Tax=J... 336 8e-91 UniRef50_Q1J368 CRISPR-associated protein, CT1975 n=1 Tax=Deinoc... 335 2e-90 UniRef50_B8FDH9 CRISPR-associated protein, Cse4 family n=2 Tax=B... 334 4e-90 UniRef50_A5GBK1 CRISPR-associated protein, Cse4 family n=1 Tax=G... 332 9e-90 UniRef50_B4S8P9 CRISPR-associated protein, Cse4 family n=9 Tax=B... 332 1e-89 UniRef50_Q2RY18 CRISPR-associated protein, Cse4 family n=2 Tax=A... 332 1e-89 UniRef50_D0MET5 CRISPR-associated protein, Cse4 family n=1 Tax=R... 332 2e-89 UniRef50_C6HV95 CRISPR-associated protein, Cas4 n=1 Tax=Leptospi... 330 6e-89 UniRef50_D2L2X7 CRISPR-associated protein, Cse4 family n=1 Tax=D... 329 8e-89 UniRef50_Q60AD1 CRISPR-associated protein, CT1975 family n=1 Tax... 316 9e-85 UniRef50_C7MQD5 CRISPR-associated protein, Cse4 family n=1 Tax=S... 316 1e-84 UniRef50_D1A6Q4 CRISPR-associated protein, Cse4 family n=2 Tax=A... 302 1e-80 UniRef50_C8P6I6 CRISPR-associated protein n=1 Tax=Lactobacillus ... 284 3e-75 UniRef50_C2GEY7 CRISPR-associated Cse4 family protein n=6 Tax=Ac... 284 3e-75 UniRef50_B6IWM4 CRISPR-associated protein, CT1975 family n=1 Tax... 273 6e-72 UniRef50_B8HWH9 CRISPR-associated protein, Cse4 family n=1 Tax=C... 262 1e-68 UniRef50_D0WFC9 CRISPR-associated protein, Cse4 family n=1 Tax=S... 260 8e-68 UniRef50_Q31XC0 Putative cytoplasmic protein n=1 Tax=Shigella bo... 255 2e-66 UniRef50_B7KJ25 CRISPR-associated protein, Cse4 family n=1 Tax=C... 232 1e-59 UniRef50_UPI0001B51C2C hypothetical protein SvirD4_12600 n=1 Tax... 218 2e-55 UniRef50_UPI000190E665 hypothetical protein SentesTyp_08452 n=3 ... 141 4e-32 UniRef50_UPI0001B58196 CRISPR-associated Cse4 family protein n=1... 83 1e-14 UniRef50_C2BS05 Possible CRISPR-associated protein n=1 Tax=Mobil... 66 2e-09 Sequences not found previously or not previously below threshold: UniRef50_O87037 Z35f protein n=1 Tax=Vibrio cholerae RepID=O8703... 46 0.003 CONVERGED! >UniRef50_Q46899 Uncharacterized protein ygcJ n=13 Tax=Proteobacteria RepID=YGCJ_ECOLI Length = 363 Score = 440 bits (1132), Expect = e-122, Method: Composition-based stats. Identities = 363/363 (100%), Positives = 363/363 (100%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGE 60 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGE Sbjct: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGE 60 Query: 61 SSLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGEIA 120 SSLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGEIA Sbjct: 61 SSLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGEIA 120 Query: 121 WFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGKVDG 180 WFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGKVDG Sbjct: 121 WFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGKVDG 180 Query: 181 AMSIAHAITTHQVDSDIDWFTAVDDLQEQGSAHLGTQEFSSGVFYRYANINLAQLQENLG 240 AMSIAHAITTHQVDSDIDWFTAVDDLQEQGSAHLGTQEFSSGVFYRYANINLAQLQENLG Sbjct: 181 AMSIAHAITTHQVDSDIDWFTAVDDLQEQGSAHLGTQEFSSGVFYRYANINLAQLQENLG 240 Query: 241 GASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFSDMPLSMANAFEKAVKA 300 GASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFSDMPLSMANAFEKAVKA Sbjct: 241 GASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFSDMPLSMANAFEKAVKA 300 Query: 301 KDGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITAQVKQMPTLEQLKSWVRNN 360 KDGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITAQVKQMPTLEQLKSWVRNN Sbjct: 301 KDGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITAQVKQMPTLEQLKSWVRNN 360 Query: 361 GEA 363 GEA Sbjct: 361 GEA 363 >UniRef50_C0W6U1 CRISPR-associated Cse4 family protein n=2 Tax=Actinomycetales RepID=C0W6U1_9ACTO Length = 374 Score = 420 bits (1079), Expect = e-116, Method: Composition-based stats. Identities = 113/372 (30%), Positives = 170/372 (45%), Gaps = 20/372 (5%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGE 60 MS F++IH++ S PSC+NRDD K A++GG RR+R+SSQS KRA R + + Sbjct: 1 MSTFVDIHLIQSLPPSCVNRDDSGSPKSALYGGVRRLRVSSQSWKRATRLYFNEHLDATD 60 Query: 61 SSLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEA---EKISADAVTPWVVG 117 +RT + +L + + + S + A K A A + +++ Sbjct: 61 VGIRTKRVVELLADRISAIAPDLADSALALAEQVFSAAKIKVAPPRGKKDAPAESGYLLF 120 Query: 118 EIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGK 177 ++A+ + + + VDIAL GRM Sbjct: 121 LSTSQINRLAEMATRAAHAGE---KIDPKETKKIFKEEHAVDIALFGRMVADDA---DLN 174 Query: 178 VDGAMSIAHAITTHQVDSDIDWFTAVDD------LQEQGSAHLGTQEFSSGVFYRYANIN 231 VD A +AHAI+TH +++ D+FTAVDD ++ G+ +GT EFSS YRYA +N Sbjct: 175 VDAACQVAHAISTHAAENEYDFFTAVDDEKSRAMEEDAGAGMMGTVEFSSATMYRYATVN 234 Query: 232 LAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFS-DMPLSM 290 L L ENLG R+ AL + + +P KQ T+A D V+V+ D P+S+ Sbjct: 235 LDMLVENLG--DRDAALRALSVFLEGFCLSMPTGKQNTFANRTLPDSVVVSVRDDQPVSL 292 Query: 291 ANAFEKAVKAK--DGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITAQVKQMP 348 AFEK V+ DGFL S++A +Y + +GL A+ P A + + Sbjct: 293 VGAFEKPVRTTESDGFLTRSVEALARYEHTIEENFGLKPQASFVVSLADVPELASLGERI 352 Query: 349 TLEQLKSWVRNN 360 T L V Sbjct: 353 TFADLPGKVCEA 364 >UniRef50_D2RB01 CRISPR system CASCADE complex protein CasC n=2 Tax=Gardnerella vaginalis RepID=D2RB01_GARVA Length = 362 Score = 393 bits (1010), Expect = e-108, Method: Composition-based stats. Identities = 100/360 (27%), Positives = 170/360 (47%), Gaps = 22/360 (6%) Query: 4 FINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESSL 63 F++I + S P +NRDD K A +GG R R+SSQ K +MR+ + Sbjct: 6 FLDIQAIQSVPPCNINRDDAGSPKTAQYGGVTRARVSSQCWKHSMREYFKEHSGDSNVGM 65 Query: 64 RTIHLAQLRDV----LRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGEI 119 R+ ++ + L+ +L E+ + +KTL K+ + KI + +GE Sbjct: 66 RSKNIVKYVADKIITLKPELSEQEALDLANKTLNNAGFKTKTDKGKIIPVVNVLFFLGE- 124 Query: 120 AWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGKVD 179 +A+A +N+ DKK L+ + +D +DIAL GRM D Sbjct: 125 -NQANSLAQAAINNVTDKKQLEEILKDNP--------PIDIALFGRMLADN---PSLNED 172 Query: 180 GAMSIAHAITTHQVDSDIDWFTAVDDL---QEQGSAHLGTQEFSSGVFYRYANINLAQLQ 236 + +AHAI+TH V ++ D++TAVDDL G+ LGT E++S YRYAN+ + + Sbjct: 173 ASSQVAHAISTHAVRAEFDYYTAVDDLSVDDNAGAGMLGTIEYNSSTLYRYANVAIHEFS 232 Query: 237 ENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFS-DMPLSMANAFE 295 L ++E + + A +P K T+A M++V D P+++ +AFE Sbjct: 233 HQLSD-NKESTINALKLFIEAFANAMPTGKVNTFANQTLPQMLVVTLREDRPVNLVSAFE 291 Query: 296 KAVKAKDGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITAQVKQMPTLEQLKS 355 VKAKDG++ SI+ +Q +++V A+ ++ + + +++QL Sbjct: 292 DPVKAKDGYVSKSIEKLSQEYEKVQKFVHKPLASFYVTMDSSNKEIKLGVEEQSMQQLLD 351 >UniRef50_C2BET9 CRISPR-associated protein n=3 Tax=Bacteria RepID=C2BET9_9FIRM Length = 359 Score = 391 bits (1006), Expect = e-107, Method: Composition-based stats. Identities = 97/363 (26%), Positives = 168/363 (46%), Gaps = 22/363 (6%) Query: 4 FINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESSL 63 F++IH + + P+ +NRDD K A +GG R R+SSQS KRA+RK ++ + Sbjct: 10 FLDIHAIQTVPPANINRDDTGSPKTAQYGGVTRARVSSQSWKRAIRKYFNENGDVENVGI 69 Query: 64 RTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGEIAWFC 123 R++ + + + I + ++ K+++ A+ + D + Sbjct: 70 RSLEIVRYVANKIVQKDG----SISIEEAMEMADKTINNAKISTKDQKAKALFFMSDKQA 125 Query: 124 EQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGKVDGAMS 183 E++A+A D ++DKK+L+ + ++ +D+AL GRM D + Sbjct: 126 EELAQASIDKVNDKKILQEILKN--------DTSIDVALFGRMVADDA---SLNEDASSQ 174 Query: 184 IAHAITTHQVDSDIDWFTAVDDL---QEQGSAHLGTQEFSSGVFYRYANINLAQLQENLG 240 +AHAI+TH + S+ D+FTAVDDL G+ LGT E++S YRYANI L L Sbjct: 175 VAHAISTHAIQSEFDFFTAVDDLAPEDNAGAGMLGTVEYNSSTLYRYANIALHDFYRQL- 233 Query: 241 GASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFS-DMPLSMANAFEKAVK 299 A +E+ ++ V +P K T+A ++V+ D PL+M +AFE+ +K Sbjct: 234 -ADKEETIKATKLFVKSFVESMPTGKINTFANQTLPQAIVVSLRSDRPLNMVSAFEEPIK 292 Query: 300 AKDGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITAQVKQMPTLEQLKSWVRN 359 + +G++ SI+ + + A L + + K +L L + Sbjct: 293 SDNGYVDKSIEKLFSEYTKYDKILDKPIFTAYLILGNT-EVNEIGKSEASLNDLLEDLGK 351 Query: 360 NGE 362 E Sbjct: 352 EIE 354 >UniRef50_B6XT63 Putative uncharacterized protein n=1 Tax=Bifidobacterium catenulatum DSM 16992 RepID=B6XT63_9BIFI Length = 371 Score = 390 bits (1003), Expect = e-107, Method: Composition-based stats. Identities = 101/372 (27%), Positives = 162/372 (43%), Gaps = 18/372 (4%) Query: 4 FINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESSL 63 F++IH L PS +NRDD K A GG R R+SSQS KRAMR+ + + Sbjct: 2 FVDIHCLQQVPPSNINRDDTGSPKTAYVGGALRARVSSQSWKRAMREMFSSKLDSSKLGK 61 Query: 64 RTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSG-----KSVDEAEKISADAVTPWVVGE 118 RT L + + ++ +L+ K+ D A + T +++ Sbjct: 62 RTKSAVALISSVIAEKRPDLVEESKSLAEKVLAATGVKVKASDRAGADKGSSATEYLIFI 121 Query: 119 IAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGKV 178 EQ+A D+ K +K+++AA+ + +Q +DIA GRM Sbjct: 122 ANREVEQLADIAITAFDEGKDPSKMKKEVAAV-FHGEQAIDIACFGRMLADA---PDLNT 177 Query: 179 DGAMSIAHAITTHQVDSDIDWFTAVDDL---QEQGSAHLGTQEFSSGVFYRYANINLAQL 235 D + +AHA + Q+ + D+FTAVDD G+A + T F+S YRYA +N+ L Sbjct: 178 DASAQVAHAFSIDQITPEYDYFTAVDDCASDDNAGAAMIDTIGFNSSTLYRYATVNVDAL 237 Query: 236 QENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFSD-MPLSMANAF 294 ++ L A+E V +P KQ T+A + +++ D P+S A+AF Sbjct: 238 KDQLQ--DANAAVEGVAAFVDAFIKSMPSGKQNTFANHTLPEDIVIVLRDSQPISAADAF 295 Query: 295 EKAVKAKDGF--LQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVD-PITAQVKQMPTLE 351 E +K KDG + I+ + + YG A +S + + TL Sbjct: 296 EDPIKRKDGISVSRQGIERLGDRLNEIRINYGEEPVKAWHVVSGGSVHSLDEWSEQVTLP 355 Query: 352 QLKSWVRNNGEA 363 +L+ +R A Sbjct: 356 ELEQGLRETLSA 367 >UniRef50_D1CGD3 CRISPR-associated protein, Cse4 family n=1 Tax=Thermobaculum terrenum ATCC BAA-798 RepID=D1CGD3_THET1 Length = 382 Score = 386 bits (993), Expect = e-106, Method: Composition-based stats. Identities = 119/382 (31%), Positives = 182/382 (47%), Gaps = 30/382 (7%) Query: 4 FINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYA--QNIGES 61 + +H++ + +PS LNRDD KD FGG RR RISSQ +KRA+R+ + Sbjct: 2 LVELHMIQNFAPSNLNRDDTGSPKDCEFGGVRRARISSQCIKRAIRREFKQNGLLDSERI 61 Query: 62 SLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGEIAW 121 + RT + Q +LG R ++ LLS + + + GEI Sbjct: 62 AERTRLVTQEIADRLARLG-RDREQATRVAGFLLSAAKLKVDNSQRTEYLLFLGRGEIDA 120 Query: 122 F-------CEQVAKAEADNL-----DDKKLLKVLKEDIAAIR---VNLQQGVDIALSGRM 166 +Q+A +L D KK + + D++ ++ + D+AL GRM Sbjct: 121 ITALCNERWDQLAPLADQSLSDQSNDKKKAAQQVPADMSRELLARLDGGKAADLALFGRM 180 Query: 167 ATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQ---EQGSAHLGTQEFSSGV 223 +D A +AHAI+TH+V + D++TAVDDLQ E G+ +GT EF+S Sbjct: 181 LAD---LPDKNIDAASQVAHAISTHRVSIEFDFYTAVDDLQPESETGAGMMGTVEFNSAC 237 Query: 224 FYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNF 283 FYRY+N+++ QL NL G RE AL+ +H +P KQ + AA NP MV Sbjct: 238 FYRYSNVSMEQLITNLQG-DRELALKTLEAFIHASVRAIPTGKQNSMAAHNPPSMVFAVV 296 Query: 284 SD-MPLSMANAFEKAVKA--KDGFLQPSIQAFNQYWDRVANGYGLNGAA--AQFSLSDVD 338 + P S+ANAF + V ++ + SIQA + YW ++ + YG + A +L DV Sbjct: 297 REGAPWSLANAFARPVAPGREEDLVGRSIQALDSYWGKLVSVYGGDDIRKKALITLEDVP 356 Query: 339 PITAQVKQMPTLEQLKSWVRNN 360 ++ T++ L V Sbjct: 357 LQHLGDARVETVKALVEQVVAA 378 >UniRef50_B1VIY1 CRISPR-associated protein n=3 Tax=Corynebacterium RepID=B1VIY1_CORU7 Length = 376 Score = 385 bits (990), Expect = e-105, Method: Composition-based stats. Identities = 108/375 (28%), Positives = 161/375 (42%), Gaps = 20/375 (5%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGE 60 MS I+I+ L S PS +NRDD + K+AIFGG R R+SSQS KRA+R+ + + Sbjct: 1 MSKIIDIYALQSLPPSLINRDDTGVPKNAIFGGVPRQRVSSQSWKRAIRRYFFENFDAAN 60 Query: 61 SSLRTIHLAQLRDVLRQKLGERFDQKIID-----KTLALLSGKSVDEAEKISADAVTPW- 114 R+ L + ++ G I K + + +K DA + Sbjct: 61 IGDRSKRLPEKIARQLEEQGMEQGTAIERTEQLFKAAGIKTAVEKKPKDKDETDAEVAYP 120 Query: 115 VVGEIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTE 174 G + + + ++ K + AI + + VDIA+ GRM Sbjct: 121 QTGYLLFLSAHQIDNAVKAIQERDGKNFTKREAQAIL-DQEHSVDIAMFGRMVADDAAY- 178 Query: 175 LGKVDGAMSIAHAITTHQVDSDIDWFTAVDDL----QEQGSAHLGTQEFSSGVFYRYANI 230 VD A+ +AHA+ H + D+FTAVDDL +E G+ +GT + S YRYA + Sbjct: 179 --NVDAAVQVAHALGIHDSAPEFDYFTAVDDLAEEGEETGAGMIGTVQMMSSTLYRYATV 236 Query: 231 NLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFSD-MPLS 289 NL L ENL S + A + A V +P K T+A ++V V D P+S Sbjct: 237 NLEGLAENL--DSEDAAKQAAVEFVEAFIASMPTGKINTFANQTLPELVYVAVRDTRPVS 294 Query: 290 MANAFEKAVKA--KDGFLQPSIQAFNQYWDRVANGYGLNGAAAQFS-LSDVDPITAQVKQ 346 + NAFE V+A G + + Q V N YG A+ L + + Sbjct: 295 LVNAFEAPVEATEDKGRREVGAEVLAQEARDVENVYGFKPQASFVMGLGQLAEPFTDIAT 354 Query: 347 MPTLEQLKSWVRNNG 361 TL +LK + Sbjct: 355 QVTLPELKEQLAGAI 369 >UniRef50_B3E5V0 CRISPR-associated protein, Cse4 family n=56 Tax=Proteobacteria RepID=B3E5V0_GEOLS Length = 356 Score = 384 bits (987), Expect = e-105, Method: Composition-based stats. Identities = 105/375 (28%), Positives = 158/375 (42%), Gaps = 35/375 (9%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNI-G 59 MS F+ IH+L S+ P+ LNRDD K A GG R+R+SSQSLKRA R S + Q + Sbjct: 1 MSRFVQIHLLTSYPPANLNRDDQGRPKTAKMGGYDRLRVSSQSLKRAWRTSDLFQQALTE 60 Query: 60 ESSLRTIHLAQLRDV------LRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTP 113 RT L + +++K + QKI AL K D + + + Sbjct: 61 HVGTRTKLLGVMAYEKLVAGGVKEKQAKESAQKIAGVFGALKKAKEKDSLVDLEIEQLVH 120 Query: 114 WVVGEIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMT 173 EI + + + ++ + Q DIA+ GRM S Sbjct: 121 VSPSEIQAIESLLETLISQG-------RAPEDTELDLLRIQGQSADIAMFGRMLASS--- 170 Query: 174 ELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDL----QEQGSAHLGTQEFSSGVFYRYAN 229 V+ A +AHAI+ H V + D+FTAVDDL ++ G+AH+G F++G+FY Y Sbjct: 171 PSYNVEAACQVAHAISVHPVVIEDDYFTAVDDLNDGSEDAGAAHIGETGFAAGLFYSYIC 230 Query: 230 INLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFSD-MPL 288 IN L ENLGG + ++ P KQ ++ + A V+ D P Sbjct: 231 INRTLLVENLGG-DEALVQKSIQALIEAAVKVPPNGKQNSFGSRAYASYVLAEKGDQQPR 289 Query: 289 SMANAFEKAVKAKD----GFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITAQV 344 S++ AF K V ++ F ++ A + YG +D Sbjct: 290 SLSVAFLKPVTSQGIEGTDFGTAAVDALTTQRQNMDAVYGP--------CADASCEINVF 341 Query: 345 KQMPTLEQLKSWVRN 359 + TL +L +V Sbjct: 342 EGKGTLAELLKFVAE 356 >UniRef50_A3EQA5 CRISPR-ssociated protein, Cas4 n=4 Tax=Bacteria RepID=A3EQA5_9BACT Length = 398 Score = 381 bits (980), Expect = e-104, Method: Composition-based stats. Identities = 109/405 (26%), Positives = 172/405 (42%), Gaps = 51/405 (12%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIG- 59 M I IHVL + +PS LNRDD KDA+FGG RR RISSQ +KR++R + + G Sbjct: 1 MKTLIEIHVLQNFAPSNLNRDDTGAPKDALFGGTRRARISSQCIKRSVRDFFCHKREDGI 60 Query: 60 ----ESSLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWV 115 E +RT + Q L ++ +R I K LS + + + + Sbjct: 61 FSPDEIGVRTKRIYQAIADLLKE--KRDISDTITKAKTALSYLKI-KPKNEKTQYLLFLS 117 Query: 116 VGEIAWFCEQVAKA------EADNLDDKKLLKVLKE------------------------ 145 EI F + + E D+ +L + + Sbjct: 118 PKEIKDFANAIDEYWDQIVGEPIETDNSELDEETPDTVSLEEQKPKKGKKNKKPNIPKEF 177 Query: 146 -DIAAIRVNLQQGVDIALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVD 204 + +N + +DIAL GRM + A +AHAI+TH V+ + D++TA+D Sbjct: 178 QEKLESVLNGGKSIDIALFGRMLAD---IPEKNQNAACQVAHAISTHAVEREFDYYTAID 234 Query: 205 DLQE---QGSAHLGTQEFSSGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATE 261 DL+ GS +GT EF+S FYRYA ++L L +NL E + + Sbjct: 235 DLKPDDTAGSDMIGTVEFNSACFYRYAVVDLEALNKNLHD-DSELTNKSIRAFLEAFIIS 293 Query: 262 VPGAKQRTYAAFNPADMVMVNFSDM--PLSMANAFEKAVKAKDG--FLQPSIQAFNQYWD 317 P KQ ++AA NP + + ++ P ++ANAFE AV K G + S + Sbjct: 294 EPTGKQNSFAAHNPPEFIAISVRHNAGPRNLANAFETAVFPKKGESLTRKSADELVKKAK 353 Query: 318 RVANGYGLNGAAAQFSLSDVDPITAQVKQMPTLEQLKSWVRNNGE 362 + + +G +L + + + +LE L + + E Sbjct: 354 SLQSAFGGEDKTFLINLVGTN-VNGYGTVVASLEDLLNKTLSAIE 397 >UniRef50_D1CAJ1 CRISPR-associated protein, Cse4 family n=1 Tax=Sphaerobacter thermophilus DSM 20745 RepID=D1CAJ1_SPHTD Length = 397 Score = 381 bits (980), Expect = e-104, Method: Composition-based stats. Identities = 108/388 (27%), Positives = 171/388 (44%), Gaps = 38/388 (9%) Query: 4 FINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGE--S 61 F+ +H++ + +PS LNRDD KD FGG RR RISSQ+LKRA+R + + E Sbjct: 2 FVELHIIQNFAPSNLNRDDTGAPKDCQFGGYRRARISSQALKRAIRMTFGEENLLPEESR 61 Query: 62 SLRTIHLA-QLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGEIA 120 + RT +A L + L + + + G S ++ ++ + T +++ Sbjct: 62 ARRTKRIAGALVERLVASGKDAVAAAAVVEAAIQGIGLSFEKPKEGDTEKKTQYLLFLGQ 121 Query: 121 WFCEQVAKAEADNLD--------------------DKKLLKVLKEDIAAIRVNL--QQGV 158 +A + D K L + + ++ + Sbjct: 122 REINALADVCLAHWDTLVDVAPNADAASERDAKKAKKANKAALPKQVQLALLDALDGRSA 181 Query: 159 DIALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQE---QGSAHLG 215 D+AL GRM +D A +AHAI+TH+V ++ D++TAVDDL+ G+ LG Sbjct: 182 DVALFGRMLAD---LPEKNIDAASQVAHAISTHRVATEFDFYTAVDDLKPDDTAGADMLG 238 Query: 216 TQEFSSGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNP 275 T EF+S FYRY+NI++ QL ENLGG + A + +P KQ + AA NP Sbjct: 239 TVEFNSACFYRYSNIDVDQLIENLGG-DVDLARTTVEAFLWASIHAIPTGKQNSMAAQNP 297 Query: 276 ADMVMVNFSDMP-LSMANAFEKAVKA--KDGFLQPSIQAFNQYWDRVANGYGLNGAAAQF 332 VM D S+ANAF V ++ S+ A YW + YG + Sbjct: 298 PSFVMAVVRDRGLWSLANAFVNPVAPAHDGDLIERSVDALEAYWSNLVRVYG-GELRGTW 356 Query: 333 SLSDVDPITAQVKQ--MPTLEQLKSWVR 358 ++ +++ + T E+L V Sbjct: 357 CVNVNPRELGPLEELHVDTFEELVDAVV 384 >UniRef50_Q2JWC4 CRISPR-associated protein, Cse4 family n=1 Tax=Synechococcus sp. JA-3-3Ab RepID=Q2JWC4_SYNJA Length = 380 Score = 381 bits (979), Expect = e-104, Method: Composition-based stats. Identities = 104/373 (27%), Positives = 167/373 (44%), Gaps = 22/373 (5%) Query: 5 INIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYY--AQNIGESS 62 + IH++ S P+ LNRD+ M K IFGG+ R RISSQ KRA+RK + + + Sbjct: 3 LEIHLIQSFPPANLNRDENGMPKSTIFGGRPRARISSQCQKRAVRKYYHQYAELDPAHFA 62 Query: 63 LRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGEIAWF 122 R+ + K G +Q + LAL G + +K A + E+ Sbjct: 63 ARSRNWLPELKSKLVKAGIPDEQAGMAARLALEQGLKLKFNDKNEATTIVFLGKTELDAI 122 Query: 123 CEQVAK---AEADNLDDKKLL--KVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGK 177 E + K A L ++K + + + I V+ + D+AL GRM S Sbjct: 123 AEILIKNWSAIESGLREEKPKLPQKIAKAIEKALVDTGKPGDVALFGRMMAS---LPTVN 179 Query: 178 VDGAMSIAHAITTHQVDSDIDWFTAVDDL---QEQGSAHLGTQEFSSGVFYRYANINLAQ 234 VD A+ +AHAI+ + + + D+FTAVDDL ++ G+ H+G ++S +YR+A ++ Q Sbjct: 180 VDAAVQVAHAISINALQQEFDFFTAVDDLGSSEDTGADHMGETGYNSSTYYRFAVLDKKQ 239 Query: 235 LQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFSD-MPLSMANA 293 L ENLGG E I VP Q +AA +VM + P+S+ +A Sbjct: 240 LVENLGGT--EHLGSIIKAFATAFIHAVPSGHQNGFAAHTRPALVMAVVREGQPISLVDA 297 Query: 294 FEKAVKAKDGF--LQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITAQ----VKQM 347 FE V GF L+ +++A ++YW + YG + + + Sbjct: 298 FENPVAPSGGFSLLENAVKALDEYWGSLVKMYGEADVQYKGVVVLDRLAARLNVLKSSKK 357 Query: 348 PTLEQLKSWVRNN 360 ++E+L Sbjct: 358 DSVEELLKSALKA 370 >UniRef50_A1ARH7 CRISPR-associated protein, Cse4 family n=1 Tax=Pelobacter propionicus DSM 2379 RepID=A1ARH7_PELPD Length = 374 Score = 381 bits (979), Expect = e-104, Method: Composition-based stats. Identities = 114/379 (30%), Positives = 173/379 (45%), Gaps = 23/379 (6%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGE 60 M + IHVL + +PS LNRDD KDA+FGG RR R+SSQ LKR++R+ QN G Sbjct: 1 MKTIVEIHVLQNFAPSNLNRDDTGAPKDALFGGTRRARVSSQCLKRSVREYFKD-QNKGW 59 Query: 61 SSLRTIHLAQLRDVLRQKLGERFDQ-------KIIDKTLALLSGK-SVDEAEKISADAVT 112 + RT + + E K I+ ++ L V ++ +D + Sbjct: 60 VADRTKRVVYALKERISPVLESQKDFSEDNLLKAIEVAVSNLGSNKKVKVDKEKKSDVLL 119 Query: 113 PWVVGEIAWFCEQVAKAEADNLDDKKLLKVLK--EDIAAIRVNLQQGVDIALSGRMATSG 170 EI + VA++ AD L K +V++ D + VD+AL GRM Sbjct: 120 FLSPKEIDALAQVVAESYADLLKTKLSDQVVRNLNDAIDGENKSRLSVDVALFGRMLA-- 177 Query: 171 MMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQE---QGSAHLGTQEFSSGVFYRY 227 + + A +AHAI+TH V+ + D++TAVDDL+ G+ +GT EF+S FYRY Sbjct: 178 -VMPEKNQNAACQVAHAISTHAVEREFDFYTAVDDLKPEDTAGADMMGTVEFNSACFYRY 236 Query: 228 ANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFS--D 285 A ++ +L NL A A + + P KQ T+AA NP + V V Sbjct: 237 AVVDWEKLLVNLQ-ADEALATKGLRAFLEGFVVAEPTGKQNTFAAHNPPEFVAVTVRRNA 295 Query: 286 MPLSMANAFEKAVKAKDG--FLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITAQ 343 P ++ANAFE AV+ + + S + + + +G + +L++ I Sbjct: 296 APRNLANAFETAVRVRKDESLTRKSAEGLANKAKALQSAFGGDEKTFVLNLAEAT-IDGF 354 Query: 344 VKQMPTLEQLKSWVRNNGE 362 MPTL L + Sbjct: 355 GIVMPTLNDLLDKALLAVQ 373 >UniRef50_C3PF94 CRISPR-associated protein n=5 Tax=Corynebacterium RepID=C3PF94_CORA7 Length = 384 Score = 380 bits (977), Expect = e-104, Method: Composition-based stats. Identities = 109/388 (28%), Positives = 171/388 (44%), Gaps = 32/388 (8%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGE 60 MS I+IH L + PS +NRDD K AIFGG R R+SSQS KRA+R + Sbjct: 1 MSLVIDIHALQTLPPSLINRDDTGAPKSAIFGGVPRQRVSSQSWKRAIRNYFEKNVDPEF 60 Query: 61 SSLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSV-----------------DEA 103 R+ L + L + ++ I + L + ++ Sbjct: 61 VGDRSKRLPEKIAKLVENHDGWDSERAIKQVSDLFKAAGISTEVDSKRIKELEKSDAEDK 120 Query: 104 EKISADAVTPWVVGEIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALS 163 E++ +A P I +Q+ +A +D + +K+ A + ++ Q VD+A+ Sbjct: 121 EELIKEASYPRTKYLIFLSPQQIDRAVRAIVDADG--EKIKKAEAKVILDTQHSVDMAMF 178 Query: 164 GRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDL----QEQGSAHLGTQEF 219 GRM VD A+ +AHA+ H + D+FTAVDDL +E G+ +GT + Sbjct: 179 GRMIADDAAF---NVDAAVQVAHALGIHSSAPEFDYFTAVDDLAEDGEETGAGMIGTVQM 235 Query: 220 SSGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMV 279 S YR+A +N+A L +NL AS E A + A V +P K T+A ++V Sbjct: 236 MSSTLYRFATVNVAGLTKNL--ASEENAKQAAVQFVDAFIKSMPTGKINTFANHTLPELV 293 Query: 280 MVNFSD-MPLSMANAFEKAVKAKDG--FLQPSIQAFNQYWDRVANGYGLNGAAAQFS-LS 335 V D P+S+ AFE+ V+A D +A + YGL AA +S Sbjct: 294 YVTVRDTRPVSLVTAFEEPVQATDDKNLRLAGAEALAKEEREFEENYGLKPLAAFAVGVS 353 Query: 336 DVDPITAQVKQMPTLEQLKSWVRNNGEA 363 + A + + TL +L + ++ Sbjct: 354 EARAPFADIAETVTLPELSERLTAALDS 381 >UniRef50_A7BA64 Putative uncharacterized protein n=1 Tax=Actinomyces odontolyticus ATCC 17982 RepID=A7BA64_9ACTO Length = 374 Score = 379 bits (974), Expect = e-104, Method: Composition-based stats. Identities = 107/379 (28%), Positives = 168/379 (44%), Gaps = 23/379 (6%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGE 60 MS F++IHVL + PS NRDD K A FGG +R+RISSQ++KRA R+ G Sbjct: 1 MSVFVDIHVLQTLPPSNPNRDDTGAPKSATFGGVQRMRISSQAIKRATRQDFEGKIADGN 60 Query: 61 SSLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEA-------EKISADAVTP 113 +RT + +L + +R D + LA + K++ + + + Sbjct: 61 YGVRTKKIVELVARTITE--KRPDLEAASIELAEMGLKAIGFKLAEPRGNKSDNELKESG 118 Query: 114 WVVGEIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMT 173 ++V A E V+ A + KE V+ +DIAL GRM Sbjct: 119 FLVFLSAKQIEHVSDALISVAHEDDPAAAFKELKPRSLVDTDHSIDIALFGRMVAEPNA- 177 Query: 174 ELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQ------EQGSAHLGTQEFSSGVFYRY 227 VD A +AHAI V+ + D++TAVDD + ++G+ +GT EF+S YRY Sbjct: 178 --LNVDAACQVAHAIGVGAVEREYDYYTAVDDAKKRNDEADEGAGMIGTIEFASATVYRY 235 Query: 228 ANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFS-DM 286 A IN+ L+ENLG A V +P K T+A + V+V D Sbjct: 236 ATINVDLLRENLG--DDAVADRAVELFVDSFVRSMPTGKVTTFANRTLPEAVLVQVRDDQ 293 Query: 287 PLSMANAFEKA-VKAKDGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVD-PITAQV 344 P++M+ AFE+ + + GF +P+I F ++ ++ GL + S + +++ Sbjct: 294 PINMSGAFEEPIIAGQHGFAEPAIARFVEFESQLRELTGLEAVESLVSWTTPRGESFSEL 353 Query: 345 KQMPTLEQLKSWVRNNGEA 363 + L L Sbjct: 354 GKQVRLASLGETAAEAVRG 372 >UniRef50_Q2JH28 CRISPR-associated protein, CT1975 n=6 Tax=Actinomycetales RepID=Q2JH28_FRASC Length = 384 Score = 378 bits (971), Expect = e-103, Method: Composition-based stats. Identities = 97/380 (25%), Positives = 165/380 (43%), Gaps = 22/380 (5%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGE 60 M +I++H+L + PS LNRDD K A++GG +R R+SSQ+ KRA R + + + Sbjct: 1 MRCYIDVHILQTVPPSNLNRDDAGTPKQAVYGGVKRARVSSQAWKRATRTAFADHIDQAQ 60 Query: 61 SSLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEK-ISADAVTPWVVGEI 119 RT ++ L + +LL+ + +K + + ++ Sbjct: 61 LGTRTKRISALLAERLATRCALDAETSTRIATSLLTALKISAGKKAAETAYLLFFGRPQL 120 Query: 120 AWFCEQVAKAEA--DNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGK 177 + + + +L D LL +K+ + +D+AL GRM Sbjct: 121 ERLIDLIVEDVPRLADLSDGDLLAAVKDVPVLATLGSDHPIDVALFGRMVAD---LASLN 177 Query: 178 VDGAMSIAHAITTHQVDSDIDWFTAVDDLQ---EQGSAHLGTQEFSSGVFYRYANINLAQ 234 VD A +AHA++TH VD + D++TAVDD E G+ +GT EF S YR+A + L Q Sbjct: 178 VDAATQVAHALSTHAVDVEFDYYTAVDDQNAKDETGAGMIGTVEFQSATLYRFATVGLHQ 237 Query: 235 LQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMV-MVNFSDMPLSMANA 293 L ENLGG E +E + T +P Q ++A +++ + D P+++ +A Sbjct: 238 LAENLGG-DIEATVEALRVFLTAFTTSMPTGHQNSFAHRTVPNLLTIAIRPDQPVNLVSA 296 Query: 294 FEKAVKAKD-GFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDP----------ITA 342 FEK V + G L S++ F + + +GL + D I Sbjct: 297 FEKPVLPRGRGVLTGSLEQFAIELNSASTLWGLQPDILASTYRAPDDTNTNTDTTAMIVK 356 Query: 343 QVKQMPTLEQLKSWVRNNGE 362 + + +++ V Sbjct: 357 ALGEPKPFDEVLDTVVAAAR 376 >UniRef50_A0LM53 CRISPR-associated protein, Cse4 family n=1 Tax=Syntrophobacter fumaroxidans MPOB RepID=A0LM53_SYNFM Length = 384 Score = 378 bits (970), Expect = e-103, Method: Composition-based stats. Identities = 119/389 (30%), Positives = 185/389 (47%), Gaps = 36/389 (9%) Query: 4 FINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNI----G 59 F++IH++ + +PS LNRDD N KD FGG RR RISSQ +KR +R ++Q + G Sbjct: 2 FVDIHIIQNFAPSNLNRDDTNSPKDCEFGGYRRARISSQCIKRVVRSHRSFSQAVVHAGG 61 Query: 60 ESSLRTIHL-AQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGE 118 ++ +RT + ++L D+ +K G+ + ++ + ++ + + E Sbjct: 62 DTGVRTKRIKSRLMDLFAKKYGKPEIVETEKVAETVIELLGLKLKDEEKTEYLLYLGENE 121 Query: 119 IAWFCEQVAK-----AEADNLDDKKLLK------------VLKEDIAAIRVNLQQ-GVDI 160 A + DKK K LK + R + DI Sbjct: 122 AAQLARLAVDSWDALLAIEPEQDKKKKKGTGQESLKEFQEELKGIVGKRRKEARSYAADI 181 Query: 161 ALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDL---QEQGSAHLGTQ 217 AL GRM VD A +AHA++T++V+ ++D+FTAVDDL +E GS +G Sbjct: 182 ALFGRMIADNKNM---NVDAACQVAHAVSTNKVEMEMDYFTAVDDLLPGEETGSDMIGVV 238 Query: 218 EFSSGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPAD 277 EF+S FYRY+N+N+++L ENL G + + V VP KQ + AA NPA Sbjct: 239 EFNSSCFYRYSNVNVSKLAENL-GFNNDLTTAALLGYVEASVKSVPTGKQNSMAAQNPAG 297 Query: 278 M--VMVNFSDMPLSMANAFEKAVKA--KDGFLQPSIQAFNQYWDRVANGYGLNGAAAQFS 333 V+V P S+ANAF+K V+ + SI A +Y++R+ YG G S Sbjct: 298 YARVIVRRDGFPWSLANAFQKPVRPSLDKSLEEASIDALERYFERLKAVYGTEGIVCDAS 357 Query: 334 LSDVDPITAQVKQMPTLEQLKSWVRNNGE 362 + +++M L+ LK+ V G Sbjct: 358 FNLHRDDGGSLRKM--LDALKACVAGEGS 384 >UniRef50_A4XYU0 CRISPR-associated protein, Cse4 family n=5 Tax=Bacteria RepID=A4XYU0_PSEMY Length = 384 Score = 377 bits (968), Expect = e-103, Method: Composition-based stats. Identities = 109/387 (28%), Positives = 164/387 (42%), Gaps = 31/387 (8%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQ--NI 58 MS F+ H++ + +PS LNRDD KDA+FGG RR R+SSQ KRA+R + + Sbjct: 1 MSLFVEFHLIQNFAPSNLNRDDTGAPKDALFGGHRRARVSSQCFKRAIRLAAQEHELVAP 60 Query: 59 GESSLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGE 118 +RT L L L ++L R + K L+ + + + + E Sbjct: 61 EFRGVRTKKLKTL---LLERLAGRDPLEAEGKIEVALAAAGLKLKDDGKTEYLLFLGEAE 117 Query: 119 IAWFCEQVAK----------------AEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIAL 162 IA F + + + + K A ++ + VD+AL Sbjct: 118 IAGFATLIEQHWDELAGAPAGGEKKGEKKGKKEAKASAPAEVVKKAKALLDGGKAVDVAL 177 Query: 163 SGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDL---QEQGSAHLGTQEF 219 GRM D A +AHAI+TH+V+ + D+FTAVDD E G+ +G EF Sbjct: 178 FGRMLAD---MPEVNQDAACQVAHAISTHRVEREFDYFTAVDDKGGPDETGAGMIGQVEF 234 Query: 220 SSGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMV 279 +S YRYA ++ +L NL RE L + +P KQ T+AA N V Sbjct: 235 NSATLYRYAVVDAGKLLGNLQ-QDRELTLSALEAFTQAMVRAIPTGKQNTFAAHNLPSFV 293 Query: 280 MVNFSDM-PLSMANAFEKAVKAKDG--FLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSD 336 V PL++ANAFEK + A+ S+ ++ ++A Y + Sbjct: 294 GVCLRHAGPLNLANAFEKPIAARQDAALSSLSVTELAKHEGKLAAVYADASDQWAYLDLS 353 Query: 337 VDPITAQVKQMPTLEQLKSWVRNNGEA 363 + + L +L SWVR A Sbjct: 354 EAWPQQKGFAVQNLGELASWVRMQVAA 380 >UniRef50_B0TDU0 Crispr-associated protein, ct1973 family, putative n=2 Tax=cellular organisms RepID=B0TDU0_HELMI Length = 385 Score = 376 bits (967), Expect = e-103, Method: Composition-based stats. Identities = 117/389 (30%), Positives = 174/389 (44%), Gaps = 37/389 (9%) Query: 5 INIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGE--SS 62 + IHVL +H+P+ LNRD+ KD +FGG RR RISSQ KR +R S + +IGE Sbjct: 2 VEIHVLQNHAPANLNRDESGSPKDCMFGGVRRGRISSQCQKRTIRCSPLFQDSIGESRLG 61 Query: 63 LRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGEIAWF 122 +RT L L +LG + I A G ++ K D +T + Sbjct: 62 MRTRKLPFLVKEELMRLGLSEELAKIGARKASGLG---NKDGKERDDEITAQAIFLTQED 118 Query: 123 CEQVAKAEADNLDDKKLLKVLKEDIAAIRVN------LQQGVDIALSGRMATSGMMTELG 176 +A+ +L DK + + ++ + VD+AL GRM TS + Sbjct: 119 VSVIARCLFRHLKDKTVKQAKAIKAQELQKDPELVGWRPVTVDVALFGRMTTSTAFND-- 176 Query: 177 KVDGAMSIAHAITTHQVDSDIDWFTAVDDLQ---EQGSAHLGTQEFSSGVFYRYANINLA 233 V+ ++ + HAI+TH+VDS+ D+FTAVDDL + G+ +G EF+S +Y+Y N+++ Sbjct: 177 -VEASVQVGHAISTHRVDSEFDYFTAVDDLMGDGDSGADMIGDTEFNSCCYYKYFNVDMD 235 Query: 234 QLQENLGGASR-------------EQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVM 280 +L+ NL G R A I + L P KQ ++AA V+ Sbjct: 236 ELKRNLAGPDRLKKLTAEERQDLARDAAHIVKAFIESLVFCSPDGKQNSFAARQLPSAVL 295 Query: 281 VNFSDM--PLSMANAFEKAVKAKD--GFLQPSIQAFNQYWDRVANGYGLNGAAAQFSL-- 334 V P+S ANAF K V A+ +Q S+ AF + +GL L Sbjct: 296 VEVKKRKIPVSYANAFVKPVTARGEMDLVQASVNAFLDHVKETEKCFGLTPNRRWLLLMG 355 Query: 335 -SDVDPITAQVKQMPTLEQLKSWVRNNGE 362 T QV P L + + GE Sbjct: 356 CESPKMTTDQVSTFPALVEELTAALQQGE 384 >UniRef50_C4FG89 Putative uncharacterized protein n=1 Tax=Bifidobacterium angulatum DSM 20098 RepID=C4FG89_9BIFI Length = 387 Score = 375 bits (963), Expect = e-102, Method: Composition-based stats. Identities = 104/387 (26%), Positives = 166/387 (42%), Gaps = 32/387 (8%) Query: 4 FINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESSL 63 F++IH + PS +NRDD K A GG R R+SSQ+ KRAMR + + Sbjct: 2 FMDIHCIQQVPPSNINRDDTGSPKTAYVGGALRSRVSSQAWKRAMRGVFDDMLDSDKLGK 61 Query: 64 RTIHLAQLRDV---LRQKLGERFDQKIIDKTLALLSG--KSVDEAEKISADAVTPWVVGE 118 RT + L ++ +++ + LAL K+ + A VT +++ Sbjct: 62 RTKGVVALIASSITAKRPDLAESAEELGQRVLALEGIGVKASNRAGSDKGTLVTDYLIFI 121 Query: 119 IAWFCEQVAKAEADNLD---------DKKLLKVLKEDIAAIR------VNLQQGVDIALS 163 +++A D K L K K D+A ++ + Q +DIAL Sbjct: 122 ANNEIDKLADWAIAASDKGRDFSKVGKKGLSKAEKTDLAKMKNEVSEIFHGPQAIDIALF 181 Query: 164 GRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDL---QEQGSAHLGTQEFS 220 GRM + D + +AHA + Q+ + D+FTAVDD G+A L T F+ Sbjct: 182 GRMLANA---PDLNTDASAQVAHAFSIDQITPEYDYFTAVDDCASEDNAGAAMLDTVGFN 238 Query: 221 SGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVM 280 S YRYA +N+ L+E L A+E A V +P KQ T+A + V+ Sbjct: 239 SSTLYRYAAVNIDALKEQLQ--DASAAVEGAVAFVEAFIKSMPSGKQNTFANHTLPEDVV 296 Query: 281 VNFSD-MPLSMANAFEKAVKAKDG--FLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDV 337 V D P+S A+AFE+ V+ K+G + I+ + + + Y A + S Sbjct: 297 VVLRDSQPISAADAFEEPVRRKEGVSVSRQGIERLGKRLNEIRVNYSEEPVKAWYIASGG 356 Query: 338 D-PITAQVKQMPTLEQLKSWVRNNGEA 363 + + + +L L+ +R A Sbjct: 357 EVDSLKEWSEQVSLPDLEHGLRETLNA 383 >UniRef50_D1YEE3 CRISPR system CASCADE complex protein CasC n=1 Tax=Propionibacterium acnes J139 RepID=D1YEE3_PROAC Length = 374 Score = 373 bits (959), Expect = e-102, Method: Composition-based stats. Identities = 100/375 (26%), Positives = 166/375 (44%), Gaps = 26/375 (6%) Query: 2 SNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGES 61 S +++IHV+ S PS +NRDD K A++GG RR R+SSQ+ K+A+R S ++ Sbjct: 3 SYYVDIHVIQSVPPSNVNRDDTGSPKSALYGGVRRARVSSQAWKKAVRTSFKEFLPANQT 62 Query: 62 SLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISAD--------AVTP 113 RT+ + +L ++ + + +AEK T Sbjct: 63 GSRTLRVVELLMNRLTAAPYGLPEEDARQKALEVVKALGLKAEKPRKKDESGAEGIERTQ 122 Query: 114 WVVGEIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMT 173 ++V +++A+ A D K+ + A + G+++AL GRM Sbjct: 123 YLVFYSNQQLDRLAQLAA--TTDGKITATDAKKAA----DSDHGIEVALFGRMVADSK-- 174 Query: 174 ELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQ----EQGSAHLGTQEFSSGVFYRYAN 229 VD A+ +AHA++TH V+ + D+FTAVDD + + G+ +GT EF+S YR+A Sbjct: 175 -DLNVDSAVQVAHALSTHAVEIESDYFTAVDDYKLDEDDAGAGMIGTVEFTSETLYRFAT 233 Query: 230 INLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFS-DMPL 288 + ++ L++NLG + + A+ V +P KQ T+A D V+V Sbjct: 234 VAVSTLKDNLG--DVDLTAQAASAFVRGFIMSMPTGKQNTFANNTIPDAVVVQVRKGRSA 291 Query: 289 SMANAFEKAVKA-KDGFLQPSIQAFNQYWDRVANGY-GLNGAAAQFSLSDVDPITAQVKQ 346 S AFE V + GF+ S QA Y + G A+ + + Sbjct: 292 SFIGAFEDPVTSDDGGFVAASCQAVAAYAHDCEEAFLGAPEASFVTRVGSRTEAIGTMGT 351 Query: 347 MPTLEQLKSWVRNNG 361 ++ L S VR+ Sbjct: 352 QMPIDDLVSSVRDQV 366 >UniRef50_D2TKK6 CRISPR-associated protein n=1 Tax=Citrobacter rodentium ICC168 RepID=D2TKK6_CITRO Length = 363 Score = 373 bits (957), Expect = e-102, Method: Composition-based stats. Identities = 97/358 (27%), Positives = 151/358 (42%), Gaps = 21/358 (5%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNI-G 59 M+ FI +H+L ++ + LNRDD K + GG R+RISSQSLKRA R S + Q + G Sbjct: 13 MTTFIQLHLLTAYPAANLNRDDTGAPKTVVLGGTTRLRISSQSLKRAWRTSELFEQALAG 72 Query: 60 ESSLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGEI 119 +R+ +A+ + K G + + V K A P E Sbjct: 73 NIGIRSGRIAREAAEILIKSGIDEKKAVAYVEAIARCFGKV----KADKKAKEPLTNSET 128 Query: 120 AWFCEQVAKAEADNLDD-----KKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTE 174 ++ AE D + + + KE+ A+ + + VDIA+ GRM Sbjct: 129 EQLV-HISPAEFDAVKALAHRLAEEKRAPKEEELALLRHDRMAVDIAMFGRMLADK---P 184 Query: 175 LGKVDGAMSIAHAITTHQVDSDIDWFTAVDDL----QEQGSAHLGTQEFSSGVFYRYANI 230 V+ A +AHA + + D+FTAVDDL + G+ HLG F S +FY Y I Sbjct: 185 EFNVEAACQVAHAFGVSETIVEDDFFTAVDDLRANSDDAGAGHLGYTGFGSALFYTYICI 244 Query: 231 NLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFS-DMPLS 289 N L +NL G + + A + P KQ ++A+ A M D P S Sbjct: 245 NKDLLIKNLNG-NVDLANQTLRAFTEAALKVSPTGKQNSFASRAYACWAMAEKGTDQPRS 303 Query: 290 MANAFEKAVKAKDGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITAQVKQM 347 +A AF K + L ++Q + + + Y F++ + + V + Sbjct: 304 LAAAFYKPIVGS-DHLNVAVQRVTELRENMNAVYEQQTEFVGFNVMNKEGSIKDVLEF 360 >UniRef50_C6SPJ0 Putative uncharacterized protein n=1 Tax=Streptococcus mutans NN2025 RepID=C6SPJ0_STRMN Length = 359 Score = 372 bits (955), Expect = e-101, Method: Composition-based stats. Identities = 90/356 (25%), Positives = 162/356 (45%), Gaps = 24/356 (6%) Query: 4 FINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESSL 63 F++I+ + + PS +NRDD K +GG RR R+SSQS K+AMR Y + Sbjct: 11 FLDIYAIQTLPPSNINRDDTGSPKTTQYGGVRRARVSSQSWKKAMRDYFYEHAEEEQLGK 70 Query: 64 RTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGEIAWFC 123 RT + ++K K + + + D + +G Sbjct: 71 RTRKVVNYVAEKIIHQKIDLNEKESSKLATD-----ILKLAGVPTDGKVLFFIGNTE--A 123 Query: 124 EQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGKVDGAMS 183 E++A A + DK+ + + + +D+AL GRM + TE D + Sbjct: 124 EKLATAAVKGVKDKEEARKI--------MQSNLALDVALFGRMVANDKETEA---DASSQ 172 Query: 184 IAHAITTHQVDSDIDWFTAVDDL---QEQGSAHLGTQEFSSGVFYRYANINLAQLQENLG 240 AH I+TH V ++ D++TAVDDL + + LGT EF+S YRYAN+ + + G Sbjct: 173 FAHPISTHAVQTEFDFYTAVDDLASDDDAKAGMLGTVEFNSSTLYRYANVAIHEFLVQRG 232 Query: 241 GASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFS-DMPLSMANAFEKAVK 299 +RE ++ + A +P K ++A +++ D P+++ +AFE+ VK Sbjct: 233 --NREDLVDSLQLFIKAFAESMPRGKINSFANQTIPQTLIITVRSDRPVNLVSAFEEPVK 290 Query: 300 AKDGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITAQVKQMPTLEQLKS 355 + +G++ SI+ ++ + +V + SL +V+ +T + ++ +L Sbjct: 291 SSNGYVTKSIEKLSKEFVKVEKMVKKPVLSFYVSLEEVEALTKVGIEKNSITELVE 346 >UniRef50_Q3A5Z5 CRISPR-associated protein, Cse4 family n=23 Tax=Bacteria RepID=Q3A5Z5_PELCD Length = 373 Score = 368 bits (946), Expect = e-100, Method: Composition-based stats. Identities = 105/376 (27%), Positives = 163/376 (43%), Gaps = 31/376 (8%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGE 60 MS FI +H+L S+ P+ LNRDD+ K A GG R+R+SSQSLKRA R S + + + Sbjct: 1 MSRFIQLHLLTSYPPANLNRDDLGRPKTAKMGGVDRLRVSSQSLKRAWRTSDLFGKTVKN 60 Query: 61 -SSLRTIHLAQLR--DVLRQKLGERFDQKIIDKTLALLSG-KSVDEAEKISADAVT---- 112 RT + + ++ + +G + + K + + EK + + Sbjct: 61 GLGTRTKEMGRKVYERLVEKGIGHKDALSWAGAIAGVFGKLKKLTDKEKTALKKLATEER 120 Query: 113 ---PWVVGEIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQ----GVDIALSGR 165 V EI + E LD + KE +NL + VDIAL GR Sbjct: 121 REKELVEVEIEQLAFFDLEEEQAVLDLTNSIAERKEGPQPEELNLLRQKMTSVDIALFGR 180 Query: 166 MATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDL----QEQGSAHLGTQEFSS 221 M S V+ A +AHAI+ H + + D+FTAVDDL ++ G+AH+G F++ Sbjct: 181 MLASS---PAFNVEAACQVAHAISVHPIVIEDDYFTAVDDLNDGSEDAGAAHIGETGFAA 237 Query: 222 GVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMV 281 G+FY Y IN L ENLGG + A + P KQ ++A+ A V+ Sbjct: 238 GLFYSYICINRDLLAENLGG-DEDLAQRAIAALTEAAVKVPPNGKQNSFASRAYASYVLA 296 Query: 282 NFSD-MPLSMANAFEKAV------KAKDGFLQPSIQAFNQYWDRVANGYGLNGAAAQF-S 333 + P S++ AF K + + F +++A + + YG + Sbjct: 297 EKGEQQPRSLSVAFLKPIDNRTLYRDDQDFGTAAVEALEAHRQNMNKVYGDCADELYAIN 356 Query: 334 LSDVDPITAQVKQMPT 349 D A++ T Sbjct: 357 ALKGDGAMAELLDFVT 372 >UniRef50_C7QEM5 CRISPR-associated protein, Cse4 family n=13 Tax=Actinomycetales RepID=C7QEM5_CATAD Length = 399 Score = 368 bits (944), Expect = e-100, Method: Composition-based stats. Identities = 103/395 (26%), Positives = 172/395 (43%), Gaps = 38/395 (9%) Query: 1 MSN-FINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIG 59 M+ ++IH+L + PS LNRDD K A++GG RR R+SSQ+ KRA R++ + Sbjct: 1 MTRVILDIHILQTVPPSNLNRDDTGSPKTAVYGGVRRARVSSQAWKRATRQAFGDLLDPS 60 Query: 60 ESSLRTIHLAQ--------LRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAV 111 E +RT +A+ L L ++I S ++ + +D Sbjct: 61 ELGVRTKRVAEQIANRMTALEPSLSPGDAVAVAVEVIKAATGAKSEVPKRKSAAVKSDQD 120 Query: 112 TPWVVGEIAWFCEQVAKAEADNLDD------KKLLKVLKEDIAAIRV----NLQQGVDIA 161 + E + +++++ +NL K + LK+ RV + + VDIA Sbjct: 121 ATAALPETGYLM-FLSESQLNNLARLGVEGSKDITAFLKDKDFKNRVRQAADTRHSVDIA 179 Query: 162 LSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDD---LQEQGSAHLGTQE 218 L GRM VD A +AHAI+ H V+++ D+FTAVDD E G+ +G + Sbjct: 180 LFGRMVADAT---DINVDAAAQVAHAISVHAVENESDYFTAVDDRSTEAEPGAGMIGIVD 236 Query: 219 FSSGVFYRYANINLAQLQENLGG------ASREQALEIATHVVHMLATEVPGAKQRTYAA 272 F++ YRYA +++ +L +NLG + E + A +P K T+ Sbjct: 237 FNAATLYRYAAVDVNRLADNLGAGLLEGESQTEPVRRAVEAFIRGFALSMPTGKVNTFGN 296 Query: 273 FNPADMVMVNFS-DMPLSMANAFEKAVKA---KDGFLQPSIQAFNQYWDRVANGYGLNG- 327 D+V+V P+S A AFE+A+ A + G+L+ + + Y ++ Y L Sbjct: 297 HTVPDVVLVKLRASRPISFAAAFEEAISAGEHQGGYLKGACERLASYIPKLEQAYDLQEG 356 Query: 328 -AAAQFSLSDVDPITAQVKQMPTLEQLKSWVRNNG 361 + Q ++ QL + V Sbjct: 357 TDSWVVCAGSATEALEQAGDPVSISQLVAAVGAAV 391 >UniRef50_Q03C61 CRISPR-associated protein n=6 Tax=Firmicutes RepID=Q03C61_LACC3 Length = 361 Score = 364 bits (936), Expect = 2e-99, Method: Composition-based stats. Identities = 100/365 (27%), Positives = 167/365 (45%), Gaps = 30/365 (8%) Query: 1 MSN---FINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQN 57 M+N +I+IHVL + + +NRDD K A++GG R R+SSQS KRAMR + Sbjct: 1 MTNKNLYIDIHVLQTVPSANINRDDTGAPKKALYGGVTRARVSSQSWKRAMRLRFNQE-D 59 Query: 58 IGESSLRTIHLAQLR-DVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVV 116 ++ LRT + QL L+ D++I K A+ S + + A+ Sbjct: 60 HDDAGLRTKEVPQLLRQALKAAAPALTDEEIAAKVDAVFSTAKIKITKDGQTGALMLIST 119 Query: 117 GEIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELG 176 G++ + + L + +Q +D+AL GRM Sbjct: 120 GQLKKLAQYALDN-----------EALDKKELTKLFKGEQSLDLALFGRMVADN---PEL 165 Query: 177 KVDGAMSIAHAITTHQVDSDIDWFTAVDDLQ---EQGSAHLGTQEFSSGVFYRYANINLA 233 V+G+ +AHAI+TH++ + D+FTA+DD + G+A LGT E++S YRYAN+N Sbjct: 166 NVEGSAQVAHAISTHEIVPEFDYFTALDDFKPEDNAGAAMLGTVEYNSSTLYRYANLNFQ 225 Query: 234 QLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFS-DMPLSMAN 292 + + N+GG A+ A + +P KQ T+A + VMV D P+++ + Sbjct: 226 EFEANIGGR---AAVSGALSYIKEFLLSMPNGKQNTFANKTLPNYVMVTLRPDTPVNLVS 282 Query: 293 AFEKAVKAKDGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITAQVKQMPTLEQ 352 AFE VK+ G+++ S++ Q + ++ + + +Q + Sbjct: 283 AFEDPVKSNHGYVEASVKRLEQEYQ--DALQFVDAPLFTAVVGKTNG--EVGEQQANVNG 338 Query: 353 LKSWV 357 L V Sbjct: 339 LLDAV 343 >UniRef50_A1SV72 CRISPR-associated protein, Cse4 family n=2 Tax=Gammaproteobacteria RepID=A1SV72_PSYIN Length = 337 Score = 362 bits (930), Expect = 1e-98, Method: Composition-based stats. Identities = 136/337 (40%), Positives = 195/337 (57%), Gaps = 19/337 (5%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGE 60 M+ FINIH LISH S +NRDD MQK A+FGG R RISSQ LKRA+R+S Y + + E Sbjct: 1 MTTFINIHTLISHPSSMMNRDDSGMQKTAVFGGSVRSRISSQCLKRAIRQSDIYGEAVAE 60 Query: 61 SSLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISA-DAVTPWVVGEI 119 S+RT +L D+ ++ + E + I D L + S + D+ +I DAV P+ +G I Sbjct: 61 KSIRTNKFDELLDLCKEAMPETDIKLIEDVLLNMGSKVTKDKKTEIRNFDAVQPYAIGSI 120 Query: 120 AWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGKVD 179 + + + K L K+++ +D+ALSGRM S V+ Sbjct: 121 ----REAINMVNEGTELKDLKKIVQ----------IPTIDVALSGRMDAS---CPPRNVE 163 Query: 180 GAMSIAHAITTHQVDSDIDWFTAVDDLQEQGSAHLGTQEFSSGVFYRYANINLAQLQENL 239 AMS+AH++TTH D ++DWFTA DDL EQGS H+GT EFSSGVFYRYA+IN+ L +N+ Sbjct: 164 AAMSVAHSLTTHSADIEVDWFTACDDLAEQGSGHIGTTEFSSGVFYRYASINVDLLAKNV 223 Query: 240 GGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFSDMPLSMANAFEKAVK 299 E I ++ A P AKQ+ +AA+N AD VM S+ P+S+ANAF K ++ Sbjct: 224 KSTVSEVTP-IINTMIRCFAQVSPSAKQKVFAAYNQADFVMATHSNQPISLANAFRKPIE 282 Query: 300 AKDGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSD 336 ++ SI A ++++++ N Y L+ A L+D Sbjct: 283 NNGDVMENSIAALVKHYEKLTNAYELDSKAIALDLTD 319 >UniRef50_C7LYW7 CRISPR-associated protein, Cse4 family n=1 Tax=Acidimicrobium ferrooxidans DSM 10331 RepID=C7LYW7_ACIFD Length = 386 Score = 361 bits (928), Expect = 2e-98, Method: Composition-based stats. Identities = 111/373 (29%), Positives = 161/373 (43%), Gaps = 25/373 (6%) Query: 5 INIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGES--- 61 I++HVL + PSCLNRDD N K A++GG RR R+SSQS KRA R+ + Sbjct: 9 IDVHVLQTLPPSCLNRDDTNAPKTALYGGARRARVSSQSWKRATRRYFNENLATIGTDWL 68 Query: 62 -----SLRTIHLAQLRDVLRQKLGERFDQKIIDKT-----LALLSGKSVDEAEKISADAV 111 +RT LA L Q D + D A +E K A Sbjct: 69 RSRGGGIRTRKLAGLLHERVQARVRDLDVREDDVARLVNLAAGALLGLKEEKLKKRAQET 128 Query: 112 TPWVVGEIAWFCEQVAKAEADNLDDK-KLLKVLKEDIAAIRVNLQQGVDIALSGRMATSG 170 P + + E A L+ + L D+ + +D+AL GRM Sbjct: 129 QPADLEYALFVSESAIDAAVGELERSLRAGDDLDLDVLTTAMGRDLSLDVALFGRMIAD- 187 Query: 171 MMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDL---QEQGSAHLGTQEFSSGVFYRY 227 T VD A +AHAI+TH+V S+ D++T VDDL E G+A +G EF+S YR+ Sbjct: 188 --TPNLNVDAACQVAHAISTHRVTSEFDFYTTVDDLAGDDETGAAMMGFIEFNSATVYRF 245 Query: 228 ANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVN-FSDM 286 A ++L +L +NLG + + A +P Q T+AA D+V V+ D Sbjct: 246 ATVSLGRLADNLG--DPDAVPTGVRAFIEAFAKSLPTGHQNTFAALTVPDLVFVSMRGDQ 303 Query: 287 PLSMANAFEKAVKAKDGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSD--VDPITAQV 344 P+S+ AFE V++ G++ S + Y D + YG+ S + + Sbjct: 304 PVSLVGAFEAPVESDRGYVHASAERLATYADDIDGLYGVPRLNGWASYVPKLEQAVATHL 363 Query: 345 KQMPTLEQLKSWV 357 QL V Sbjct: 364 GDSIAFPQLLDAV 376 >UniRef50_Q2FNL3 CRISPR-associated protein, CT1975 n=8 Tax=cellular organisms RepID=Q2FNL3_METHJ Length = 382 Score = 361 bits (927), Expect = 2e-98, Method: Composition-based stats. Identities = 112/401 (27%), Positives = 177/401 (44%), Gaps = 62/401 (15%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNI-G 59 MS FI IH+L S+ PS LNRDD+ K A GG +R+R+SSQSLKR+ R S ++ + G Sbjct: 1 MSEFIQIHMLASYPPSNLNRDDLGRPKTATVGGTQRIRVSSQSLKRSWRTSEAFSDALKG 60 Query: 60 ESSLRTIHLA-----------QLRDVLRQKLGERFDQKIIDKTLA-------------LL 95 +RT + L D+L K ++I D+ A + Sbjct: 61 AIGIRTRDMGVKIKKALVEGRLLSDILEGKESGVTRERIKDEKKAHEWAVKISSHFGKIE 120 Query: 96 SGKSVDEAEKISA------------DAVTPWVVGEIAWFCEQVAKAEADNLDDKKLLKVL 143 GK D +K + + EIA + + + + + Sbjct: 121 GGKEKDSDKKSEKTDEKSNKNPLSHKQMVHYSPEEIAGIDDLLGRISGG--------EKV 172 Query: 144 KEDIAAIRVNLQQGVDIALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAV 203 +D + + VDIAL GRM + A+ ++HAIT H + D+FTAV Sbjct: 173 SDDDCIRLRSDHKAVDIALFGRMLADNAAY---NTEAAVQVSHAITVHDTPVEDDYFTAV 229 Query: 204 DDLQE----QGSAHLGTQEFSSGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLA 259 DDL + G+ H+G EF +G+FY Y IN L+ENL G E + ++ + Sbjct: 230 DDLNQLDDTAGAGHIGEAEFGAGLFYTYICINRDLLKENLQG-DNELSNRAIEALIRAAS 288 Query: 260 TEVPGAKQRTYAAFNPADMVMVNFS-DMPLSMANAFEKAVKAKDGFLQPSIQAFNQYWDR 318 P KQ ++A+ + A ++V + P S+A AF K V KD + +++ DR Sbjct: 289 MVSPSGKQNSFASRSYASYLLVEKGTEQPRSLAAAFFKPVSGKDIY-GDAVKNLEGLRDR 347 Query: 319 VANGYGLNGAAAQFSLSDVDPITAQVKQMPTLEQLKSWVRN 359 + N YG + + S++ +D +L + S+V Sbjct: 348 MDNAYGTSFKQSSRSMNVIDGTG-------SLTDIISFVLE 381 >UniRef50_Q47PJ3 CRISPR-associated protein, Cse4 family n=1 Tax=Thermobifida fusca YX RepID=Q47PJ3_THEFY Length = 373 Score = 360 bits (924), Expect = 5e-98, Method: Composition-based stats. Identities = 103/373 (27%), Positives = 169/373 (45%), Gaps = 23/373 (6%) Query: 3 NFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESS 62 F++IH + + S +NRDD+ K ++GGK R R+SSQS KRA+R +G+ + Sbjct: 2 TFVDIHAIQTLPYSNINRDDLGSPKTVVYGGKERTRVSSQSWKRAVRHEV--EARLGDKA 59 Query: 63 LRTIH-LAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPW------- 114 +RT ++++ LR++ + + + L GK + D+ P Sbjct: 60 VRTRRIISEIAKRLRERGWDADLADAGARQVVLSVGKKSGIKLEKEKDSEAPATSVLFYL 119 Query: 115 ---VVGEIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGM 171 + E+A ++ A A K +L D + + V + L GRM Sbjct: 120 PVPAIDELAAIADEHRDAVAKEAAKKTPKGILPADRITEVLKSRN-VSVNLFGRMLAE-- 176 Query: 172 MTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDL---QEQGSAHLGTQEFSSGVFYRYA 228 +VDGA+ AHA T H ++D+FTAVDD+ + GS H+ +FS+G FYRYA Sbjct: 177 -LPSTEVDGAVQFAHAFTVHGTTVEVDFFTAVDDIPKENDHGSGHMNAGQFSAGTFYRYA 235 Query: 229 NINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFS-DMP 287 N+NL +L EN G + A + + VP KQ AA D+V + D P Sbjct: 236 NVNLDRLVENTG--DAQTARTAVAEFLRAFLSTVPSGKQNATAAMTLPDLVHIAVRFDRP 293 Query: 288 LSMANAFEKAVKAKDGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITAQVKQM 347 +S A AFE A+ DG+ + Q N Y +R+ + + ++ + + A ++ Sbjct: 294 ISFAPAFETALYGSDGYTLRACQELNNYAERLREVWPDDAIRGYATVENKTDLAALGERY 353 Query: 348 PTLEQLKSWVRNN 360 + L + Sbjct: 354 DSYPALIDAMVAA 366 >UniRef50_UPI0001AF1D4B hypothetical protein SghaA1_37372 n=1 Tax=Streptomyces ghanaensis ATCC 14672 RepID=UPI0001AF1D4B Length = 383 Score = 359 bits (922), Expect = 8e-98, Method: Composition-based stats. Identities = 99/389 (25%), Positives = 157/389 (40%), Gaps = 36/389 (9%) Query: 4 FINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYA-QNIGESS 62 I +H+L S S LNRDD+ K A FGG R RISSQSLKRA R + E Sbjct: 2 LIELHLLQSFPVSNLNRDDLGQPKTARFGGHTRARISSQSLKRAARTLLAQHGLDPSELG 61 Query: 63 LRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISA----------DAVT 112 +RT L L + G +K + + + A + Sbjct: 62 VRTKRLRDAAASLLAERG---REKEQAVEVCQAGLEEIGFAAHTATGLTKYLLYVGKPAQ 118 Query: 113 PWVVGEIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIR------------VNLQQGVDI 160 + + +AK A+ K+ + AA + ++ + DI Sbjct: 119 TLLADYCDERWDTLAKTVAEAKKRKEKQEKTPRKTAAKKPTKQAQEQAKRILDGTRAADI 178 Query: 161 ALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDL---QEQGSAHLGTQ 217 AL GRM V+ A +AHA++TH V ++ D++TA+DDL E + +GT Sbjct: 179 ALFGRMIADNTDF---NVNAASQVAHALSTHAVVNEFDYYTALDDLRPDAEPAADMIGTV 235 Query: 218 EFSSGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPAD 277 +F++ FYRYAN++L QL NL + A +H VPG KQ + +A Sbjct: 236 DFNAACFYRYANLDLEQLATNLPD-DPDLVARSARAWLHSFIHAVPGGKQNSMSARTMPQ 294 Query: 278 MVMVNFSDMP-LSMANAFEKAVKAKDGFLQPSIQAFNQYWDRVANGYGLNGAAAQ--FSL 334 ++ + ++ANAF V + S Q ++ ++ + YG S+ Sbjct: 295 TLLGVVRETGAWNLANAFLSPVTDVPDLMAASTQRLVDHFQQLRSFYGDTQLRHTTIASI 354 Query: 335 SDVDPITAQVKQMPTLEQLKSWVRNNGEA 363 + + PTL+ S + + Sbjct: 355 GSDPAGMPENEIAPTLDDFVSRLLTATKG 383 >UniRef50_B8IZA6 CRISPR-associated protein, Cse4 family n=1 Tax=Desulfovibrio desulfuricans subsp. desulfuricans str. ATCC 27774 RepID=B8IZA6_DESDA Length = 350 Score = 357 bits (916), Expect = 5e-97, Method: Composition-based stats. Identities = 100/370 (27%), Positives = 165/370 (44%), Gaps = 31/370 (8%) Query: 3 NFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESS 62 + +H+L S +CLNRDD+ K A+FGG +R R+SSQ KRA+R+ Sbjct: 2 RHLELHILQSVPVACLNRDDLGSPKTAVFGGVQRARVSSQCWKRAIREYCGELLPQHFKG 61 Query: 63 LRTIHLAQ-LRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGEIAW 121 RT + + LRD+ G ++ ++D+ T + Sbjct: 62 ERTRLIVEPLRDIFINTYGLDEATALVKANDLAEGLATLDKDAAKKNKLQTKTLFFTSRS 121 Query: 122 FCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGKVDGA 181 E +A +N + KK K + + DIAL GRM S ++GA Sbjct: 122 ELEALAAIAVNNENIKKHAKTFAQSLCT------DAADIALFGRMVASA---PELTLEGA 172 Query: 182 MSIAHAITTHQVDSDIDWFTAVDDL---QEQGSAHLGTQEFSSGVFYRYANINLAQL--Q 236 +HA++TH+ D++ID+F+A+DDL +E G+ GT EF++ +YR+ +NL L Sbjct: 173 AMFSHALSTHKADNEIDFFSALDDLLPSEETGAGMTGTLEFNAAAYYRFCALNLDMLADA 232 Query: 237 ENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFSD--MPLSMANAF 294 ++LG S ++ I V +P A++ + A V+ D P+ + NAF Sbjct: 233 DHLGALSPDERQGIVAAFVEATLKAMPVARKNSMNANTMPAYVLCVLRDSGQPVQLVNAF 292 Query: 295 EKAVKAKD--GFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITAQVKQMPTLEQ 352 EKAV + D G+++ SI+ + + R+ N +GL +L + Sbjct: 293 EKAVYSPDGRGYVEASIKRMEEEYQRLENTWGLTAVETIRMPLQ------------SLGE 340 Query: 353 LKSWVRNNGE 362 L VR + Sbjct: 341 LLQGVRRHVR 350 >UniRef50_C7MTA9 CRISPR-associated protein, Cse4 family n=1 Tax=Saccharomonospora viridis DSM 43017 RepID=C7MTA9_SACVD Length = 390 Score = 354 bits (908), Expect = 4e-96, Method: Composition-based stats. Identities = 107/384 (27%), Positives = 168/384 (43%), Gaps = 31/384 (8%) Query: 2 SNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGES 61 +I+IHV+ + S +NRDD K FGG R R+SSQS KR +R+ GE+ Sbjct: 4 PKYIDIHVIQTLPFSNVNRDDTGSPKTVEFGGVERTRVSSQSWKRVVRQH-VEEAVGGET 62 Query: 62 SLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVD-EAEKISADA---------- 110 + + + L ++ E+ + + +AL +GK + + EK +D Sbjct: 63 VRTRRVVVGVAERLIKQGWEKSEAEAAGVQIALSAGKKISLKQEKDESDEVVLTTNVLLL 122 Query: 111 VTPWVVGEIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVN---LQQGVDIALSGRMA 167 + + E+A ++ + K L +K + + R+N ++ I L GRM Sbjct: 123 LPESGIDELAALADEHREVILAEAKKAKKLTGMKPKLPSERINEILSRRSATINLFGRMV 182 Query: 168 TSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQE----QGSAHLGTQEFSSGV 223 VDGA+ +AHA TTH + D+FTAVDD+++ GS ++ T FS+G Sbjct: 183 AE---LPGANVDGAVQVAHAFTTHGTAVEYDFFTAVDDIEQKLDLPGSGYMDTALFSAGT 239 Query: 224 FYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNF 283 FYRYAN+NL L NL + A + + T VP KQ AA D+V V Sbjct: 240 FYRYANVNLTDLLRNL-DQDTDLARVLVKTFLDGFITTVPSGKQNATAAVTLPDLVHVTV 298 Query: 284 S-DMPLSMANAFEKAVKAKDGFLQPSIQAFNQYWDRVANGYGLNGAAAQ-------FSLS 335 D P+S+ANAFE V DGF++ S + + +A G + Sbjct: 299 RDDRPVSLANAFEAPVGGGDGFVRKSAHRLDSHAGAIAELLGESHVLFSAHTTTPGAMPK 358 Query: 336 DVDPITAQVKQMPTLEQLKSWVRN 359 + + + E+L Sbjct: 359 NAEGWGHLGVNARSFEKLIDDAVG 382 >UniRef50_A8LYZ6 CRISPR-associated protein, Cse4 family n=1 Tax=Salinispora arenicola CNS-205 RepID=A8LYZ6_SALAI Length = 380 Score = 351 bits (902), Expect = 2e-95, Method: Composition-based stats. Identities = 108/387 (27%), Positives = 171/387 (44%), Gaps = 32/387 (8%) Query: 1 MS-NFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIG 59 M+ +++IHVL + + LNRDD+ K FG R R+SSQS KRA+R+ + G Sbjct: 1 MTARYVDIHVLQTVPYANLNRDDLGSPKTVRFGYADRTRVSSQSWKRAVRRELEE--SSG 58 Query: 60 ESSLRTIHLAQLRDVLRQKLGERFDQKI------IDKTLALLSGKSVDEAEKISADAVTP 113 + + RT L Q + + + + +K + +A Sbjct: 59 DKAKRTRRLPQAIQARLTGPDWDSELAAFAATQVMATLATIAVKADGFKVDKATGEAQVL 118 Query: 114 WVVGEIAWFC---------EQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSG 164 + + E A+ +++ + L KK L D + + V I L G Sbjct: 119 FYLPERAFDMLADVCVQQRDRLIGLRSGALKLKKGEAPLPADAVRAAMEHRSDV-INLFG 177 Query: 165 RMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQ----EQGSAHLGTQEFS 220 RM VDGA+ +AHA TTH D +D+FTAVDDL+ + GS H+ + EFS Sbjct: 178 RMLAE---LPGSNVDGAVQVAHAFTTHGTDPQVDFFTAVDDLKQDADQAGSGHMNSAEFS 234 Query: 221 SGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVM 280 +G FYRYA++NL L NLG A+E+ + T +P AK+ A F ++ Sbjct: 235 TGTFYRYASVNLEDLAHNLG--DPATAVELTRVFLSAFITAMPQAKKNATAPFTVPELAY 292 Query: 281 VNFS-DMPLSMANAFEKAVKA--KDGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDV 337 + D P+S+A+AFE V+A G+ +PS + +Y ++ G G S Sbjct: 293 IAVRTDRPVSLASAFETPVRATFDSGYAEPSRRQLAEYAGQIYRLIGDQGMVYHGCASVD 352 Query: 338 DPITAQ-VKQMPTLEQLKSWVRNNGEA 363 D Q + + + L + + A Sbjct: 353 DKGLEQLGETRQSFDNLIATAVDKLRA 379 >UniRef50_A5UR15 CRISPR-associated protein, Cse4 family n=1 Tax=Roseiflexus sp. RS-1 RepID=A5UR15_ROSS1 Length = 402 Score = 351 bits (901), Expect = 3e-95, Method: Composition-based stats. Identities = 114/406 (28%), Positives = 177/406 (43%), Gaps = 54/406 (13%) Query: 4 FINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESS- 62 I +H+L +H+PS LNRDD N KDAIFGG RR RISSQ++KR++R S ++ Sbjct: 2 LIALHLLQNHAPSNLNRDDNNEPKDAIFGGVRRARISSQAIKRSIRWSDHFRAPFETQGL 61 Query: 63 --LRTIHLAQLRD-VLRQKLGERFDQKIIDKTLALLSGKSV------------------- 100 +RT L + L DQ+ I + A L Sbjct: 62 LAIRTQLLPEKVRHHLVNAGLNDDDQRAIVEAAARLGKGEQRSPSGEGEAGDERGDQNQP 121 Query: 101 ----------------DEAEKISADAVTPWVVGEIAWFCEQVAKAEADNLDDKKLLKVLK 144 +AE+I + EI +++ + + K L L+ Sbjct: 122 RSSSRSRRSSRQSNTTGDAERIKTAQLMFLTENEIQQLAQRLIEIVRE--KGAKHLNELQ 179 Query: 145 EDIAAIRVN--LQQGVDIALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTA 202 D + VDIA+ GRM TS V+ A+ +AHAI+TH V+ + D++TA Sbjct: 180 GDTLVREIGEYEPHSVDIAMFGRMTTSS---PFKDVEAAVQVAHAISTHAVEMEFDFYTA 236 Query: 203 VDDLQ-EQGSAHLGTQEFSSGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATE 261 VDD+ E G+ +G F+S +Y+Y +I+ L +NL G + A + ++ Sbjct: 237 VDDISGEAGAGFIGDTTFNSATYYKYFSIDWDGLLKNLHG-EQNVARQSVEALIRAALFA 295 Query: 262 VPGAKQRTYAAFNPADMVMVNFSDM--PLSMANAFEKAVKAKD--GFLQPSIQAFNQYWD 317 +P KQ ++AA N D+ +V LS ANAF K V+A ++ S +A +Y Sbjct: 296 IPSGKQNSFAAHNLPDLALVEVRKENIALSYANAFVKPVRATGKLSLIEASAKALEEYIP 355 Query: 318 RVANGYGLNGAAAQFSLSDVDPITAQVKQMPTLEQLKSWVRNNGEA 363 + Y L+ A S V + + LE+L +W+ Sbjct: 356 AINERYNLSAQRAFLST--VPFTLSGAECCSDLEKLITWLSKQIGG 399 >UniRef50_D1NTI0 CRISPR-associated protein, Cse4 family n=1 Tax=Bifidobacterium gallicum DSM 20093 RepID=D1NTI0_9BIFI Length = 381 Score = 350 bits (899), Expect = 4e-95, Method: Composition-based stats. Identities = 89/374 (23%), Positives = 151/374 (40%), Gaps = 23/374 (6%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGE 60 M+ + I+ + + PS +NRDD K AI+GG R R+SSQ+ KRAMR++ + + Sbjct: 1 MTTIVEIYAIQNVPPSNINRDDTGNPKTAIYGGVLRARVSSQAWKRAMREAFPEMLDADQ 60 Query: 61 SSLRTIH-LAQLRDVLRQKLGERFDQKIIDKTLA---------LLSGKSVDEAEKISADA 110 +RT + LAQ+ + K + + + A S + Sbjct: 61 LGIRTKNALAQIEQSIVAKRPDIDVETVHKAATAALTATGAKVEKSKRKGSMEGADLTQY 120 Query: 111 VTPWVVGEIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSG 170 + EI + + D K K +K ++A+ + Q VDIAL GRM Sbjct: 121 LIFIANREIDKLADLAIAWIDADEDLDKPSKEMKGQVSAV-FHGPQAVDIALFGRMLADA 179 Query: 171 MMTELGKVDGAMSIAHAITTHQVDSDIDWFT---AVDDLQEQGSAHLGTQEFSSGVFYRY 227 D + +AHAI+ +V + D+FT G+A L T F+S YRY Sbjct: 180 ---PELNTDASAQVAHAISVDEVTPEYDYFTAIDDDAADDNAGAAMLDTVGFNSSTLYRY 236 Query: 228 ANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFSD-M 286 A + + L E L A ++ V+ +P KQ T+A +V + Sbjct: 237 ATVAVDSLYEQLQSAD--MTVKAVDAFVNAFLRSMPTGKQNTFANRTLPTAALVVVRNSQ 294 Query: 287 PLSMANAFEKAVKA--KDGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVD-PITAQ 343 P++ AFE+ V A + + + + + + YG AA ++ + Sbjct: 295 PINPVEAFERPVHAERDKSISRVAAERLGRKLQDIQDTYGETPIAAWNIVAGQPVELLDS 354 Query: 344 VKQMPTLEQLKSWV 357 + + TL + + Sbjct: 355 LSEHVTLPVMVESL 368 >UniRef50_A5FTJ7 CRISPR-associated protein, Cse4 family n=11 Tax=Acetobacteraceae RepID=A5FTJ7_ACICJ Length = 370 Score = 350 bits (898), Expect = 5e-95, Method: Composition-based stats. Identities = 99/370 (26%), Positives = 155/370 (41%), Gaps = 26/370 (7%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNI-G 59 M+ F+ +H+L PS +NRDD K A+ GG R+R+SSQ+LKRA R S +++ + G Sbjct: 1 MTQFLQVHLLTFFPPSNMNRDDTGRPKTAMVGGAMRLRLSSQALKRAWRTSTIFSEALKG 60 Query: 60 ESSLRTIHLAQLR-DVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGE 118 RT L + L+ + + + +A GK ++ + E Sbjct: 61 YMGERTQRLGEEILKTLQAEGVSEVQALAVARAVAGQFGKLNEDETPARIQQLAFISPDE 120 Query: 119 IAWFCEQVAKAEADNLDDKKLLKVLKEDIAA-------------IRVNLQQGVDIALSGR 165 + + A L + K + + DIAL GR Sbjct: 121 RKAAFDLARRYAAGELPLPEKAKGKRGKANKTEGEEEVEAPEILLLRESDTAADIALFGR 180 Query: 166 MATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDL----QEQGSAHLGTQEFSS 221 M + A +AHAITTH++ D D++TAVDDL ++ G+ +G F S Sbjct: 181 MLADK---PAFNREAAAQVAHAITTHRISVDDDYYTAVDDLKRPSEDAGAGFIGETGFGS 237 Query: 222 GVFYRYANINLAQLQENLGGAS--REQALEIATHVVHMLATEVPGAKQRTYAAFNPADMV 279 GVFY Y +IN+ L NLGG R+ A +V AT P KQ ++AA A + Sbjct: 238 GVFYTYMSINIDLLIRNLGGGDQARDLAATAIAALVEAAATTAPSGKQNSFAAHGRAGYI 297 Query: 280 MVNFSD-MPLSMANAFEKAVKAKDGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVD 338 + P ++A AF K V+ + SI ++ + + YG A + + Sbjct: 298 LAERGKAQPRTLAGAFAKPVEG-GDIMDASIGRLEEFREAIDKAYGPTADATKVMRVGGE 356 Query: 339 PITAQVKQMP 348 A + Sbjct: 357 GSLADIIVFA 366 >UniRef50_D0Y919 CRISPR-associated protein, Cse4 family n=2 Tax=Dehalococcoides RepID=D0Y919_9CHLR Length = 427 Score = 349 bits (897), Expect = 7e-95, Method: Composition-based stats. Identities = 100/414 (24%), Positives = 167/414 (40%), Gaps = 63/414 (15%) Query: 6 NIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESS-LR 64 IH++ + +PS LNRDD K A FGG RR RISSQ KR+ R G A+ + +R Sbjct: 9 EIHLIQNFAPSNLNRDDTGQPKSATFGGFRRARISSQCSKRSTRLQGPLAELLENQGAVR 68 Query: 65 TI-HLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADA------------- 110 T + ++ + K E D++ I+ + ++ K S Sbjct: 69 TRQLIMEIAKAIDTK--EEPDERTIEIVAGVFEAGGLERPAKRSGKVKSQAAEAIGEDGE 126 Query: 111 -----------VTPWVVGEIAWFCEQVAKAEADNLDD-----KKLLKVLKEDIAAIRVNL 154 T ++ ++ +N DD K++ + + + + Sbjct: 127 INGNEGFESGNKTKILLFLDKMAFPKLIDVFKENWDDLAKGNKEVKEKACDKVGRLLFEA 186 Query: 155 QQGVDIALSGRMATSGMMTELGK----VDGAMSIAHAITTHQVDSDIDWFTAVDDLQ--- 207 + DIAL GRM T GK V+ A +AH I+TH++D ++D++TAVDDL Sbjct: 187 VKAPDIALFGRMLEVKNNTPFGKYNMSVEAACQVAHPISTHKIDMEMDFYTAVDDLNPDG 246 Query: 208 EQGSAHLGTQEFSSGVFYRYANINLAQLQENLG---------------GASREQALEIAT 252 E G+ +G F+S +YRYA ++ QL NL E+A ++ Sbjct: 247 ETGAGMMGVVGFNSACYYRYALVDRDQLARNLARKTERKNGGWAQGLETQDYEEADKVVK 306 Query: 253 HVVHMLATEVPGAKQRTYAAFNPADMVMVNFSDM--PLSMANAFEKAVKA---KDGFLQP 307 + + +P KQ ++AA N + P+S+ANAF ++ D + Sbjct: 307 AFLEAMIYAIPTGKQNSFAAQNLPSFGLFVKRKGGVPVSLANAFSTPIRPVRDDDDLVGL 366 Query: 308 SIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPI---TAQVKQMPTLEQLKSWVR 358 S+ A ++WD + YG G ++++ V Sbjct: 367 SVNALTKHWDAIKELYGDQGIKVTSCFHLQQEKRLNGLAGSVKTSVDKAIHEVL 420 >UniRef50_C4ZJY0 CRISPR-associated protein, Cse4 family n=1 Tax=Thauera sp. MZ1T RepID=C4ZJY0_THASP Length = 394 Score = 349 bits (897), Expect = 7e-95, Method: Composition-based stats. Identities = 107/394 (27%), Positives = 167/394 (42%), Gaps = 38/394 (9%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRK---SGYYAQN 57 + FI IH L ++ + LNRDD + K +GG R RISSQ LKR R + A+ Sbjct: 3 LPRFIQIHTLHTYPAALLNRDDAGLAKRLPYGGAIRTRISSQCLKRHWRVADDAFSLAKL 62 Query: 58 IGESSLRTIHLAQLR-DVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVT---- 112 + RT ++A+L L ++ + + L + +K A+ Sbjct: 63 GVPMATRTRYVAELIRQRLIEQGIDEARAYATAEALLEALFGEKADKKKEGVKALQTGQA 122 Query: 113 -PWVVGEIAWFCEQVAKAEADNLDDKKLLKVLKE-----DIAAIRVNLQQGVDIALSGRM 166 + EIA+ + D D L + + + L G++ AL GRM Sbjct: 123 VLFGNEEIAYLARRCRDITGDFSDPVALKAEVAKFLKEEKKNIEAMKLGSGLESALFGRM 182 Query: 167 ATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQE----QGSAHLGTQEFSSG 222 TS + L D ++S+AHA T H+ + D+FT VDD + GSA + E +SG Sbjct: 183 VTSDL---LANRDASVSVAHAFTVHEAQVENDYFTVVDDFAQAEDGAGSAGIFDTELASG 239 Query: 223 VFYRYANINLAQLQENLGGASRE-----------QALEIATHVVHMLATEVPGAKQRTYA 271 ++Y Y I++ QL NL G E A ++ H++H++AT PGAK+ + A Sbjct: 240 LYYGYVVIDVPQLVANLEGIKVEDVFTIGADKRGLAGKVVQHLLHLIATVSPGAKRGSTA 299 Query: 272 AFNPADMVMVNFSD-MPLSMANAFEKAVKAKDG--FLQPSIQAFNQYWDRVANGYGLNGA 328 ++ A V+V D P S+A AF + K + + YG+ A Sbjct: 300 PYDWAKFVLVEAGDWQPRSLAAAFHDPIPLKGDSSIRGRAASKLAKEIAAFDAAYGMPTA 359 Query: 329 AAQFSLSDVDPITAQVKQMPTLEQLKSWVRNNGE 362 SL D + + TL QL W+ Sbjct: 360 RRFLSL---DELAVPAAERATLSQLGEWIAQTVR 390 >UniRef50_B6B782 CRISPR-associated protein, Cse4 family n=2 Tax=Alphaproteobacteria RepID=B6B782_9RHOB Length = 353 Score = 346 bits (889), Expect = 7e-94, Method: Composition-based stats. Identities = 107/330 (32%), Positives = 158/330 (47%), Gaps = 19/330 (5%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNI-G 59 M+ F+ H+L ++ S NRDD K A+ GG R+RISSQSLKRA+R+S Y+AQ++ G Sbjct: 1 MTTFVQFHLLTTYPLSNPNRDDQGRPKQAMIGGSPRLRISSQSLKRALRESSYFAQDLAG 60 Query: 60 ESSLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGEI 119 + RT L L+ +L + + A G + EK S +A T + Sbjct: 61 HTGTRTRR---LATELKAELIGQGVEDAHADETATKIGAVFSKTEKGSTNATTLAFISPD 117 Query: 120 AWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGKVD 179 W +A+ A + L K+ AI VDIA+ GRM D Sbjct: 118 EW---ALARELAARDVAGEPLPAEKDLKKAILRRADGAVDIAMFGRMLADS---PDYNRD 171 Query: 180 GAMSIAHAITTHQVDSDIDWFTAVDDLQ----EQGSAHLGTQEFSSGVFYRYANINLAQL 235 A+ +AHA TTH+ + DWF+AVDDL+ + G+ H+G F SG++Y YA +N+ L Sbjct: 172 AAVQVAHAFTTHRAQAQDDWFSAVDDLKTREVDAGAGHIGEHGFGSGIYYLYACVNVDLL 231 Query: 236 QENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFS-DMPLSMANAF 294 ENL G R A + + LAT P KQ ++A A + V P ++ AF Sbjct: 232 VENLAG-DRALAAKGMEALARALATATPKGKQNSHAHHPRAGFIRVERGQQQPRDLSGAF 290 Query: 295 EKAVKAKDGFLQPSIQAFNQYWDRVANGYG 324 K A + + S++A ++ YG Sbjct: 291 HKPTAADE---RASVEALQGMAAKIDRAYG 317 >UniRef50_B6WQ62 Putative uncharacterized protein n=1 Tax=Desulfovibrio piger ATCC 29098 RepID=B6WQ62_9DELT Length = 341 Score = 346 bits (888), Expect = 7e-94, Method: Composition-based stats. Identities = 97/331 (29%), Positives = 156/331 (47%), Gaps = 20/331 (6%) Query: 3 NFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESS 62 + +H+L S +CLNRDD K A+FG +R R+SSQ KRA+R+ Sbjct: 2 RHLELHILQSVPVACLNRDDFGSPKTALFGNVQRARVSSQCWKRAVRELMQEEVPALFGG 61 Query: 63 LRTIHL-AQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGEIAW 121 RT + L +L ++ G + A G +V + + T + + Sbjct: 62 QRTRLILDPLCRILHEQHGLAE---EEARKKAEELGAAVSKLDTPPVRVKTLFFTSPLE- 117 Query: 122 FCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGKVDGA 181 E +A A + KK +K L + L+ DIAL GRM S ++GA Sbjct: 118 -LEALAAAYVATGNAKKAVKELAKHP------LKDAADIALFGRMVASDH---SLTLEGA 167 Query: 182 MSIAHAITTHQVDSDIDWFTAVDDL---QEQGSAHLGTQEFSSGVFYRYANINLAQLQEN 238 +HA++TH+V ++ID+F AVDDL E G+ GT EF+S +YR+A +NL L+++ Sbjct: 168 AMFSHALSTHKVSNEIDFFAAVDDLQPEDEAGAGMTGTLEFNSATYYRFAALNLDLLEQH 227 Query: 239 LGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFSD--MPLSMANAFEK 296 L S E+ E+ + V VPGA++ + A V+ + P+ + NAFEK Sbjct: 228 LSALSAEERREVVCNFVTATLRAVPGARKNSMNAATLPSHVLAVVREKGHPVQLVNAFEK 287 Query: 297 AVKAKDGFLQPSIQAFNQYWDRVANGYGLNG 327 V + G ++ S+ + + + +GL Sbjct: 288 PVWTRGGLMEESVSQLEREYTHLKETWGLEA 318 >UniRef50_C5SD49 CRISPR-associated protein, Cse4 family n=1 Tax=Allochromatium vinosum DSM 180 RepID=C5SD49_CHRVI Length = 393 Score = 346 bits (888), Expect = 8e-94, Method: Composition-based stats. Identities = 174/395 (44%), Positives = 233/395 (58%), Gaps = 40/395 (10%) Query: 3 NFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESS 62 NF+N HVLISHSPSCLNRDDMNMQK AIFGGK RVRISSQSLKRA+R S YYA+ S Sbjct: 5 NFVNFHVLISHSPSCLNRDDMNMQKTAIFGGKTRVRISSQSLKRAIRYSDYYARYFISKS 64 Query: 63 LRTIHLA-QLRDVLRQKLGERFDQKIIDK-----TLALLSGKSVDEAEKISAD------- 109 RT L ++ D L I+K +DE K D Sbjct: 65 QRTRRLFDKMADELSASAESAEQTTAIEKCALYAAAIFEGKTKIDEIGKYERDKKSDHIE 124 Query: 110 -AVTPWVVGEIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQ--GVDIALSGRM 166 + P+ EI + + +A +K ++ +K +I + + +D+ALSGRM Sbjct: 125 TQIIPFSCAEIEGIKQILLEAA--GKPEKGRIEYMKAEIQRLEREQRTRIDLDVALSGRM 182 Query: 167 ATSGMMTELGKVDGAMSIAHAITTHQVDSDI-DWFTAVDDLQ----EQGSAHLGTQEFSS 221 A S ++ VDGA+++AHAITTH V+ DWFTAVDDL E G+ HL TQ+FS+ Sbjct: 183 ANSELIYP---VDGALAVAHAITTHTVEPQDIDWFTAVDDLTLDAGETGAGHLNTQQFSA 239 Query: 222 GVFYRYANINLAQLQENLG----------GASREQALEIATHVVHMLATEVPGAKQRTYA 271 GVFYRYA++NL QLQ NLG SR +AL+IA HV+H+LAT VP AKQ+++A Sbjct: 240 GVFYRYASLNLRQLQFNLGLLANINAEQTTESRARALDIARHVLHLLATVVPSAKQQSFA 299 Query: 272 AFNPADMVMVNFSDMPLSMANAFEKAVKAK---DGFLQPSIQAFNQYWDRVANGYGLNGA 328 A N AD V+V+ +D P+S+ANAFE+ ++ + GFLQPSI A YW RV + YGL+ Sbjct: 300 AHNLADFVIVSLADQPVSLANAFEEPIERERKIGGFLQPSITALADYWSRVNSAYGLDEQ 359 Query: 329 AAQFSLSDVDPITAQVKQMPTLEQLKSWVRNNGEA 363 A F+L + Q + + ++ L+ W+ N+G A Sbjct: 360 ARAFALRGGIKLGDQ-EVLTSIADLEQWLANDGRA 393 >UniRef50_B4UE70 CRISPR-associated protein, Cse4 family n=2 Tax=Anaeromyxobacter RepID=B4UE70_ANASK Length = 413 Score = 345 bits (886), Expect = 1e-93, Method: Composition-based stats. Identities = 107/415 (25%), Positives = 177/415 (42%), Gaps = 55/415 (13%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIG- 59 M+ F+ IH L S+ S LNRDD K FGG R R+SSQ LKR R G Sbjct: 1 MNRFVQIHTLTSYPASLLNRDDAGFAKRIPFGGVTRTRVSSQCLKRHWRTFEGEGALSGL 60 Query: 60 --ESSLRTIHLAQLR---DVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKI-------- 106 S+R+ + ++ + + +++ ++ + GKS A+ Sbjct: 61 GQPMSVRSRYTFDELVVQPLVGEGVPAELAREVTRALMSEVLGKSAKAAKADARADEKEE 120 Query: 107 ------------SADAVTPWVVGEIAWFCEQVAKAEADNLDDKKLLKVLKEDIAA----- 149 +T E+A+ E D K+ K + + + A Sbjct: 121 EEDKDAKTESTLQTGQITVLGRPEVAYLLELARTVCRKKPDPAKIAKAVSDHLGADGRKN 180 Query: 150 -IRVNLQQGVDIALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDL-- 206 + L G+D A+ GRM TS + L + D A+ +AHA T H ++ D+F+AVDDL Sbjct: 181 LRELRLGAGLDAAMFGRMVTSDI---LARGDAALHVAHAFTVHGEATETDYFSAVDDLPM 237 Query: 207 ----QEQGSAHLGTQEFSSGVFYRYANINLAQLQENLGG--------ASREQALEIATHV 254 QGS H+G E +SG+FY Y I++ L NL G A R+ A ++A + Sbjct: 238 ARTEDGQGSGHIGNAELTSGLFYGYVVIDVPLLVSNLEGVDRKAWEKADRKLAAQLAERM 297 Query: 255 VHMLATEVPGAKQRTYAAFNPADMVMVNFSD-MPLSMANAFEKAV---KAKDGFLQPSIQ 310 V ++AT PGAK + A A +V+ + P ++ANAF + V + + + + Sbjct: 298 VKLVATVSPGAKLGSTAPHAYAHLVLAESGNAQPRTLANAFLEPVVTGPRQPDPVAAAYR 357 Query: 311 AFNQYWDRVANGYGLNGAAAQFSLSDVDPITA--QVKQMPTLEQLKSWVRNNGEA 363 A ++ + YG ++ D + + +L ++ +WV + Sbjct: 358 ALARHSADLDRMYGPAFQRRLAAIGPADGLADVLRAPANASLAEVATWVADQVRG 412 >UniRef50_Q1EQS8 CRISPR-associated protein n=3 Tax=Streptomyces RepID=Q1EQS8_STRKN Length = 393 Score = 343 bits (880), Expect = 6e-93, Method: Composition-based stats. Identities = 112/384 (29%), Positives = 166/384 (43%), Gaps = 33/384 (8%) Query: 3 NFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESS 62 FI++H++ S + LNRDD N K +G R R+SSQS KRA R+ IG+++ Sbjct: 6 RFIDVHIVQSVPFANLNRDDTNSVKTVQYGNTLRTRVSSQSWKRATREVFQER--IGQAA 63 Query: 63 LRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPW-------- 114 LRT + + + G A + E K AD Sbjct: 64 LRTRRIGERVTQELEGRGWPPALAQRAGGHAAAASSIKFELAKDPADNKQFLPNTVLTNA 123 Query: 115 -------VVGEIAWFCEQVAKAEADNLDDKKLLKV--LKEDIAAIRVNLQQGVDIALSGR 165 V E+A EQ + D KK L +D + + GV I L GR Sbjct: 124 MVYVPEAAVAELADLAEQHRQELESAKDIKKPADKSVLPKDAVEAVLRSRNGV-INLFGR 182 Query: 166 MATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDL-----QEQGSAHLGTQEFS 220 M + VDGA+ +AHA+TTH+ D ++D+F+AVDD+ GS H+G EFS Sbjct: 183 MLAE---VDDAGVDGAVQVAHAMTTHETDVELDYFSAVDDITAAWKDSTGSGHMGHTEFS 239 Query: 221 SGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVM 280 +G FYRYA ++L L N+GG R A E+ + +P AK+ + A D+V Sbjct: 240 AGTFYRYATVDLRDLATNIGGEVRA-ARELIAAFLASYIESLPQAKKNSTAPHTIPDLVH 298 Query: 281 VNFS-DMPLSMANAFEKAVK--AKDGFLQPSIQAFNQYWDRVANGYGLNGAAAQ-FSLSD 336 ++ D PLS A AFEK V+ A GF + S Y G ++ + Sbjct: 299 ISVRSDRPLSYAAAFEKPVRAGAPGGFGEVSRAELATYAQAANTLLGTGRIVTSGWASLE 358 Query: 337 VDPITAQVKQMPTLEQLKSWVRNN 360 +T + + + L + + Sbjct: 359 TKDLTGLGTRHESFDDLITAALDA 382 >UniRef50_Q67RP1 Putative uncharacterized protein n=1 Tax=Symbiobacterium thermophilum RepID=Q67RP1_SYMTH Length = 379 Score = 343 bits (880), Expect = 7e-93, Method: Composition-based stats. Identities = 103/380 (27%), Positives = 164/380 (43%), Gaps = 27/380 (7%) Query: 4 FINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQ--NIGES 61 F+ +H+L + + S LNRDD K +FGG RR RISSQ LKRA+R Sbjct: 2 FVEMHLLQNFALSNLNRDDTGAPKSCVFGGTRRARISSQCLKRAVRTYVREQALVPSELL 61 Query: 62 SLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGEIAW 121 S RT L + G + + + A + + + EIA Sbjct: 62 SYRTKWLQRELANRLAAGG--VEAEQAGQVAARALELLEFRLKNGRTEYLLMVGEREIAR 119 Query: 122 FCEQVAK--AEADNLDDKKLLKVLKEDIAAI---RVNLQQGVDIALSGRMATSGMMTELG 176 + + A D + K +++A + ++ VDIAL GRM Sbjct: 120 IADLCREHAAALQGGDGGRKSKKEGDNLAGLFLKALDGGDAVDIALFGRMIA---THPEK 176 Query: 177 KVDGAMSIAHAITTHQVDSDIDWFTAV------DDLQEQGSAHLGTQEFSSGVFYRYANI 230 VD A+ +AHA +T+ + ++ D+++AV DD + G+ LGT ++S +YRYAN+ Sbjct: 177 NVDAAVQMAHAFSTNAIANEFDFYSAVDDLQQQDDDEGAGAGMLGTVLYNSSCYYRYANV 236 Query: 231 NLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFSDMP-LS 289 +L QL NLGG ++AL + + VP K+ A NP ++M + S Sbjct: 237 DLRQLLTNLGG-DPDRALTAVRAFLLGMVHAVPTGKRTNSAPQNPPALIMAVVREHGLWS 295 Query: 290 MANAFEKAVK-AKDGFLQPSIQAFNQYWDRVANGYGLNGA--AAQFSLSDVDPITA---- 342 +ANAF V A+ ++ S + +W++++ YG G A + D I A Sbjct: 296 LANAFVVPVSGARGNLMELSAKEMLAHWNQLSELYGQEGVHYAGLATYLSSDAIGASNAV 355 Query: 343 QVKQMPTLEQLKSWVRNNGE 362 + L L V + Sbjct: 356 GIAVEKRLADLVDRVLAEVQ 375 >UniRef50_C7RP61 CRISPR-associated protein, Cse4 family n=1 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RP61_9PROT Length = 400 Score = 342 bits (878), Expect = 1e-92, Method: Composition-based stats. Identities = 98/398 (24%), Positives = 170/398 (42%), Gaps = 42/398 (10%) Query: 2 SNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMR-KSGYYAQNIGE 60 FI IH L ++ + LNRDD + K G R RISSQ LKR R +A + + Sbjct: 4 PRFIQIHTLHTYPAALLNRDDAGLAKRLPLGNAVRTRISSQCLKRHWRVVEDRFALSCLD 63 Query: 61 --SSLRTIHLAQLRDV------LRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVT 112 ++R+ +L + + + + + + D L GK + + Sbjct: 64 VPMAIRSRGTLELISKRIQESGVSETMAQAAAEAMRDAGLLDKGGKEKKGDDALKTGQAV 123 Query: 113 PWVVGEIAWFCEQVAKAEADNLDDKKLLKVL-------KEDIAAIRVNLQQGVDIALSGR 165 EI + + +D +++K +++ E + G++ AL GR Sbjct: 124 LLGKPEIDYLVRRCVDLASDGVEEKGFKELITLWLKGKDEKRNIEALKHGSGLESALFGR 183 Query: 166 MATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDL----QEQGSAHLGTQEFSS 221 M TS ++T + A+ +AHA T HQ + D+FT VDDL E GSA + E +S Sbjct: 184 MVTSDVLTS---REAAVYVAHAFTVHQAQVENDYFTVVDDLLQDAGELGSAGIFDTELAS 240 Query: 222 GVFYRYANINLAQLQENLGGAS-------------REQALEIATHVVHMLATEVPGAKQR 268 G++Y Y +++ QL +NL G R A ++ H++H++AT PGAK+ Sbjct: 241 GLYYGYVVVDVPQLVQNLEGEDFNECFASGTPADRRVLAGQVVQHLLHLIATVSPGAKRG 300 Query: 269 TYAAFNPADMVMVNFSD-MPLSMANAFEKAVK---AKDGFLQPSIQAFNQYWDRVANGYG 324 + A F+ A ++V D P S+A AF A+ + + ++ + + + YG Sbjct: 301 STAPFDWAKFMLVEAGDWQPRSLAGAFHDALPLSGSGGTIRERTVDRLTKEIAAMDDAYG 360 Query: 325 LNGAAAQFSLSDVDPITAQVKQMPTLEQLKSWVRNNGE 362 + ++ + + L L W + E Sbjct: 361 APLSRRFLAI--DQEVQVPGAERLNLASLADWAKEIIE 396 >UniRef50_Q2RXJ6 CRISPR-associated protein, Cse4 family n=2 Tax=Alphaproteobacteria RepID=Q2RXJ6_RHORT Length = 381 Score = 341 bits (875), Expect = 2e-92, Method: Composition-based stats. Identities = 102/382 (26%), Positives = 172/382 (45%), Gaps = 30/382 (7%) Query: 2 SNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGES 61 S F+ IH L S++ + LNRDD + K +GG R RISSQ LKR R + + + Sbjct: 4 SRFLQIHSLHSYTAALLNRDDSGLAKRLTYGGSNRTRISSQCLKRHWRMAEHDPHALQTL 63 Query: 62 S-----LRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVV 116 R+ L D++ + L R+ Q I+D + + ++ Sbjct: 64 GGYVGSFRSRELV--TDLVIKPLEGRYPQDILDVLEPEFQKLVYGDKADKGKKSRQTLLL 121 Query: 117 --GEIAWFCEQVAKAEADNLDDKKLLKVLKE-------DIAAIRVNLQQGVDIALSGRMA 167 E+AW + + A D K L K + + + L G+ AL GRM Sbjct: 122 GQPELAWLARRAEELAAGANDAKALQKAVADWRKDANFKAMSENAALPGGLVAALFGRMV 181 Query: 168 TSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQ----EQGSAHLGTQEFSSGV 223 TS +D + +AHA T H +++ D+FTAVDDL+ + G+ + E +SG+ Sbjct: 182 TSD---PAANIDAPVHVAHAFTVHAEEAEGDYFTAVDDLKKDESDSGADTIQETELTSGL 238 Query: 224 FYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNF 283 FY Y I+L L N GG +E A ++ ++V+++A PGAK + A + AD++++ Sbjct: 239 FYGYVVIDLPGLIGNCGG-DKEIAAQVVNNLVYLIAEVSPGAKLGSTAPYGRADLMLIEA 297 Query: 284 SD-MPLSMANAFEKAVKAKDGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITA 342 D P S+A A+ KA+ + ++ A + ++ Y A SL++ Sbjct: 298 GDRQPRSLATAYRKAIAPD---REQAVAALDGCLAKLDATYETGEARRYLSLAETPLTGP 354 Query: 343 --QVKQMPTLEQLKSWVRNNGE 362 + +L+ L W + + Sbjct: 355 ATSGLEKLSLKALADWTASRVK 376 >UniRef50_Q0BRF9 Putative uncharacterized protein n=1 Tax=Granulibacter bethesdensis CGDNIH1 RepID=Q0BRF9_GRABC Length = 386 Score = 341 bits (875), Expect = 3e-92, Method: Composition-based stats. Identities = 104/389 (26%), Positives = 161/389 (41%), Gaps = 39/389 (10%) Query: 2 SNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMR----KSGYYAQN 57 F+ IH L S++ S LNRDD + K +G R RISSQ LKR R + Sbjct: 4 PRFLQIHSLHSYTASLLNRDDSGLAKRLPYGSAVRTRISSQCLKRHWRMDEGTFSLHRIE 63 Query: 58 IGESSLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTP--WV 115 E ++R+ L LR+ L D I++ + + P + Sbjct: 64 GAEEAVRSRDLV--TKRLREPLQGTVDVNILNAIEPAFQAAVYGKKGADDKSSRQPLLFG 121 Query: 116 VGEIAWFCEQVAKAEADNLDDKKLLKVLKE-----------DIAAIRVNLQQGVDIALSG 164 E+ + EQ + D K ++ V+L G+ AL G Sbjct: 122 APELRYLAEQFTRIATSATDPKSAKAAAEDFTKDKLFQNTMKAMRDSVSLPGGLTSALFG 181 Query: 165 RMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDL---QEQGSAHLGTQEFSS 221 RM TS +D + +AHA TTH ++ D+F VDDL ++ G+ H+G+ E +S Sbjct: 182 RMVTSD---PEANIDAPVHVAHAFTTHAEQTESDYFAVVDDLAGVEDTGADHIGSTELTS 238 Query: 222 GVFYRYANINLAQLQENLGG--------ASREQALEIATHVVHMLATEVPGAKQRTYAAF 273 G+FY Y I++ L NL G A R+ A E+ ++ +AT PGAK + A + Sbjct: 239 GLFYGYVVIDVPTLVSNLTGVAASNWLAADRKMAAEVTACLIGQIATVSPGAKLGSTAPY 298 Query: 274 NPADMVMVNFSD-MPLSMANAFEKAVKAKDGFLQPSIQAFNQYWDRVANGYGLNGAAAQF 332 A ++V D P S+A AF + ++ + +Q Y Sbjct: 299 GYATTMLVEAGDRQPRSLAEAFRDPAEPT---VKDAEDKLHQKLKAFDEAYQTGEDRRLL 355 Query: 333 SLSDVDPITAQVKQMPTLEQLKSWVRNNG 361 SLS+ I + +L +L WVR+ Sbjct: 356 SLSNDPGIKNVSRT--SLPELMQWVRDTI 382 >UniRef50_D1Y487 CRISPR-associated protein, Cse4 family n=1 Tax=Pyramidobacter piscolens W5455 RepID=D1Y487_9BACT Length = 408 Score = 339 bits (869), Expect = 1e-91, Method: Composition-based stats. Identities = 101/408 (24%), Positives = 172/408 (42%), Gaps = 55/408 (13%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSG--YYAQNI 58 + FI I L ++ S LNRDD + K FGG R R+SSQ LKR R + + QN+ Sbjct: 4 LPRFIQISTLTTYPASLLNRDDSGLSKRIPFGGVSRTRVSSQCLKRHWRMADGLWSLQNV 63 Query: 59 GE---SSLRTIHLA--QLRDVLRQKLGERFDQ--KIIDKTLALLSGKSVDEAE------- 104 + +S+R+ + ++ L +K G ++ + L G EA Sbjct: 64 DKDIATSIRSRRIFPEKIEKPLIEKEGLDAEKVVAASQALQSELYGAKGTEAAAKNKKTA 123 Query: 105 ---------------KISADAVTPWVVGEIAWFCEQVAKAEAD-------NLDDKKLLKV 142 + EI + + V + + + K Sbjct: 124 KDDADALNPSIDAQLSAERSELVVLGHPEIQFLSKIVREMASADGSAADVGKKTGEWFKK 183 Query: 143 LKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTA 202 K+D A++ G+D A+ GR + +V A+ +AHA T H +S+ D+FTA Sbjct: 184 HKKDFQALKCGA--GLDAAMFGRFISGDT---DARVSAAVHVAHAFTVHAEESETDYFTA 238 Query: 203 VDDLQEQGSAHLGTQEFSSGVFYRYANINLAQLQENLGG--------ASREQALEIATHV 254 VDDL GSAH+ E +SG+FY Y +++ QL N+ G A R+ A + H+ Sbjct: 239 VDDLNNSGSAHINAAELTSGIFYNYVVVDVPQLVSNIEGCPSKQWQTAQRDVAGRLVKHL 298 Query: 255 VHMLATEVPGAKQRTYAAFNPADMVMVNFSD-MPLSMANAFEKAVKAKDGFLQPSIQAFN 313 +H++AT PGAK + A + VM + P ++A+AF V + +++ Sbjct: 299 LHLIATVTPGAKLGSTAPYARPWFVMAEAGESQPHTLADAFYLPVPLRGDMRAQALRQLE 358 Query: 314 QYWDRVANGYGLNGAAAQFSLSDVDPITAQVKQMPTLEQLKSWVRNNG 361 Y + YG + S+ D ++ + +L+++ + Sbjct: 359 DYVGKSDEMYGSDERRWIASMYD---VSIPRGENSSLDRMGESLERAV 403 >UniRef50_Q0AA32 CRISPR-associated protein, Cse4 family n=1 Tax=Alkalilimnicola ehrlichii MLHE-1 RepID=Q0AA32_ALHEH Length = 385 Score = 336 bits (862), Expect = 8e-91, Method: Composition-based stats. Identities = 97/385 (25%), Positives = 173/385 (44%), Gaps = 36/385 (9%) Query: 4 FINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESSL 63 F+ IH L S+ + LNRDD + K FG R+R+SSQ LKR R++ ++ + Sbjct: 2 FLQIHTLTSYHAALLNRDDAGLAKRIPFGSAERMRVSSQCLKRHWRQALKDVISLPS-GI 60 Query: 64 RTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAV-----TPWVVGE 118 RT H + +V R+ + E + + + L + E D++ + E Sbjct: 61 RTRHFFER-EVCRRVIAEGVEDEKARELTGKLIDAVMHSKEAREKDSLFLKQPVLFGRPE 119 Query: 119 IAWFCEQVAKAEADNLDD----KKLLKVLKEDIAAIR-----VNLQQGVDIALSGRMATS 169 +F + + D K +K K++ A+ +L+ G++ AL GR TS Sbjct: 120 ADYFVSLITECARSGEDPGSTLKDRVKAEKKNFRALLQAAGGSDLESGIEGALFGRFVTS 179 Query: 170 GMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDL----QEQGSAHLGTQEFSSGVFY 225 + L + D ++ +AHA T H +++++D+FT VDDL ++ G+AH G E +G+FY Sbjct: 180 DI---LARTDASVHVAHAFTVHSLNNEVDYFTVVDDLKEPGEDAGAAHAGDMELGAGLFY 236 Query: 226 RYANINLAQLQENLGGASR----------EQALEIATHVVHMLATEVPGAKQRTYAAFNP 275 Y +++ L NL G R A ++ +VH +AT PGAK A + Sbjct: 237 GYVVVDVPLLVSNLSGCERQAWREQTEACADARDVLAALVHSIATVSPGAKLGATAPYAR 296 Query: 276 ADMVMVNFS-DMPLSMANAFEKAVKAKDGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSL 334 D ++ P ++ANA+ + + A+ +Q S+ Y + + +G + + Sbjct: 297 TDCALLETGTTQPRALANAYLEPLPARGDLMQQSVNTMGHYLKSLDDMFGEETSRFVSAT 356 Query: 335 SDVD--PITAQVKQMPTLEQLKSWV 357 D P + T++ + Sbjct: 357 RDTTSLPCAHRGPLSETIDGALDSI 381 >UniRef50_C9M9R6 CRISPR-associated protein, Cse4 family n=1 Tax=Jonquetella anthropi E3_33 E1 RepID=C9M9R6_9BACT Length = 400 Score = 336 bits (862), Expect = 8e-91, Method: Composition-based stats. Identities = 100/401 (24%), Positives = 178/401 (44%), Gaps = 45/401 (11%) Query: 2 SNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGY-----YAQ 56 F+ I L ++S S LNRDD + K FG R RISSQ LKR R +G A Sbjct: 5 PRFVQISTLTTYSASLLNRDDSGLAKRIPFGDSVRTRISSQCLKRHWRNAGGPYGLDKAG 64 Query: 57 NIGESSLRTI-HLAQLR--DVLRQKLGERFDQKIIDKTLALLSGKSV------------- 100 + S+R+ +L ++ + L ++ K LL Sbjct: 65 DALSLSVRSRFSFPELIEKPLVAEGLEQKLVVSGSQKLQQLLYNGEEKGDTKKDKKKKIE 124 Query: 101 --DEAEKISADAVTPWVVGEIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQ-- 156 ++ + + E+ + + + A + + + K++ +K+ + NL Sbjct: 125 LDEDGYSAKRNELVVLGRPELEYLKQIIRDAISSSSNIKEIDNAVKDFYTKRKSNLLALR 184 Query: 157 ---GVDIALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQEQGSAH 213 GVD A+ GR + + KV A+ +AH+ T H S+ D+FTAVDDL EQG+ H Sbjct: 185 AGCGVDAAMFGRFVSGDV---DAKVTAAVHVAHSFTIHGEQSETDYFTAVDDLVEQGTGH 241 Query: 214 LGTQEFSSGVFYRYANINLAQLQENLGG--------ASREQALEIATHVVHMLATEVPGA 265 + E ++G++Y Y +++ QL NL G A R A ++ ++++H++AT PGA Sbjct: 242 INAAELNTGIYYGYVVVDVPQLISNLCGCDSKNSADADRTLAAQVTSNLIHLMATVTPGA 301 Query: 266 KQRTYAAFNPADMVMVNFSD-MPLSMANAFEKAVK--AKDGFLQPSIQAFNQYWDRVANG 322 K A + + +V+ +SD P ++A+AF + +K + ++Q +Y + Sbjct: 302 KLSGTAPYAASWLVLAEWSDSQPRTLADAFFEGLKLGSDGSARSLAVQMLAEYIRKYDAM 361 Query: 323 YGLNGAAAQFSLSDVDPITAQVKQMPTLEQLKSWVRNNGEA 363 Y + ++P + +L++L V+ E Sbjct: 362 YTPQLTRR---CASIEPCQIPGAENGSLDELCEAVKLAIEG 399 >UniRef50_Q1J368 CRISPR-associated protein, CT1975 n=1 Tax=Deinococcus geothermalis DSM 11300 RepID=Q1J368_DEIGD Length = 385 Score = 335 bits (860), Expect = 2e-90, Method: Composition-based stats. Identities = 116/390 (29%), Positives = 169/390 (43%), Gaps = 35/390 (8%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSG--YYAQNI 58 M + +H L + +PS LNRDD KDA FGG RR+RISSQ+ KRAMR+ Sbjct: 1 MKALLELHYLQNFAPSNLNRDDTGSPKDAFFGGTRRLRISSQAFKRAMRQDFGGRELLRP 60 Query: 59 GESSLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGE 118 E +RT + L G +Q LAL + K + E Sbjct: 61 EEIGVRTKRAHEAIAELLAGEGRTEEQCRAAAELALGGLGLPVKDGKNQ--YLLFLGRDE 118 Query: 119 IAWFCEQV----AKAEADNLDDKKLLKVLK------------EDIAAIRVNLQQGVDIAL 162 + + + A+ +A + + K A ++ + VD+AL Sbjct: 119 LRRVADIIGANWAEFQAAAPEPESTDGKKKKASKKAALSGDLGKQLAGALDGSKAVDVAL 178 Query: 163 SGRMATSGMMTELGKVDGAMSIAHAITTHQV-DSDIDWFTAVDDLQ---EQGSAHLGTQE 218 GRM D A +AHAI+TH + + D++TAVDDL+ G+ LGT E Sbjct: 179 FGRMLAD---LPDKNADAAAQVAHAISTHALRERQYDFYTAVDDLKPDDNAGADMLGTVE 235 Query: 219 FSSGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADM 278 F+S YRYA I+L +L ENL G RE ++ P KQ T+AA N + Sbjct: 236 FASATVYRYACIDLGKLLENLQG-DRELLERGLRAFLYASVYAAPTGKQNTFAAHNLPGL 294 Query: 279 V--MVNFSDMPLSMANAFEKAVKAKD--GFLQPSIQAFNQYWDRVANGYGLNGAAAQFSL 334 + +V + P ++ANAFEK V+A+ G+L PS+ A +G G A + Sbjct: 295 MVQVVRRNASPRNLANAFEKGVRAEGGQGYLAPSVAALADEMRWQNGVFGDAGTARFVAR 354 Query: 335 SDVDPITAQVKQMPTLEQLKSW-VRNNGEA 363 D + + MP + L V + A Sbjct: 355 EGGDAVF--GEAMPNVAALIDATVADALSA 382 >UniRef50_B8FDH9 CRISPR-associated protein, Cse4 family n=2 Tax=Bacteria RepID=B8FDH9_DESAA Length = 383 Score = 334 bits (856), Expect = 4e-90, Method: Composition-based stats. Identities = 106/385 (27%), Positives = 170/385 (44%), Gaps = 44/385 (11%) Query: 5 INIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESSLR 64 + H+L S +CLNRDD+ K A+ GG R R+SSQ KR +R + R Sbjct: 11 VEFHILQSFPVTCLNRDDVGAPKTAVVGGATRARVSSQCWKRNIRLTMKDLGVP--IGSR 68 Query: 65 TIHLAQLRDVLRQKLGERFDQK-----------IIDKTLALLSGKSVDEAEKISADAVTP 113 T + Q+ + +LG DQ I +K G + +DA+ Sbjct: 69 TKLIHQMIEDACAELGADTDQAQACAAQVASVFIKEKKGKKDDGDDSEGNGSDKSDALIF 128 Query: 114 WVVGEIAWFCEQVAKA------EADNLDDKKLLKVLKEDIAAIRVNL-------QQGVDI 160 E+ + + + + ++ K KV K + NL + GVDI Sbjct: 129 LSREEVKKIALALRENNFSTEFQEEKVNKKGDAKVEKIKLEKKIQNLLGKPDFSRDGVDI 188 Query: 161 ALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQ-EQGSAHLGTQEF 219 AL GRM V+GA S +HAI+TH+V +++++FTA+DDLQ E GSAH+G EF Sbjct: 189 ALFGRMVAQAAA---LNVEGAASFSHAISTHKVTNEVEFFTALDDLQTEPGSAHMGALEF 245 Query: 220 SSGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMV 279 +S +YRY +++ QL +NL G + V L +P A+Q T + + Sbjct: 246 NSATYYRYVCLDMGQLWKNLAGQH---LPQALEGFVKALYLALPSARQATQSGACWWEFA 302 Query: 280 MVNFSDMPLSMANAFEKAVKAK-DGFLQPSIQAFNQYWDRVANGYG-LNGAAAQFSLSDV 337 V + F+ AVK + G L+PS A Y ++ G L A+F+ + Sbjct: 303 KVFVR-KGQRLQAPFDTAVKPRNGGLLEPSKDALCAYLEKKEQQAGSLFRKIAEFTFGED 361 Query: 338 DPITAQVKQMPTLEQLKSWVRNNGE 362 + P+++ L +++ + Sbjct: 362 NG--------PSIDDLVLSIQDAIQ 378 >UniRef50_A5GBK1 CRISPR-associated protein, Cse4 family n=1 Tax=Geobacter uraniireducens Rf4 RepID=A5GBK1_GEOUR Length = 408 Score = 332 bits (853), Expect = 9e-90, Method: Composition-based stats. Identities = 105/409 (25%), Positives = 172/409 (42%), Gaps = 53/409 (12%) Query: 3 NFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIG--- 59 + +H++ S +CLNRDD+N K A+FGG +R R+SSQS KRA+R+ Sbjct: 2 KHLELHIIQSVPVACLNRDDLNSPKTAVFGGVQRARVSSQSWKRAIREMAKEIAAEEKSD 61 Query: 60 -ESSLRTIHLAQ-LRDVLRQKLGERFDQKIIDKTLALL---SGKSVDEAEKISADAVTPW 114 S RT + L L +K I + +A + VD V + Sbjct: 62 LFSGDRTRRMVYTLSTRLAEKGITSQAAIAIAEQVADVVETLDSKVDSEGYKKIKTVMFF 121 Query: 115 VVGEIAWFCEQVA------------KAEADNLDDKKLLKVLKEDIAAIRV---------- 152 E E +A + A +D++ K LK + + Sbjct: 122 SKAEYDAIAEAIATSDEVKNSVEALEKAAVEGNDREREKALKAMVKILEKGAISKTIKSA 181 Query: 153 NLQQGVDIALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQ--EQG 210 L+ DIAL GRM + KVDGA AH ++TH+ D++ID+F AVDDL E G Sbjct: 182 QLKDAADIALFGRMVAND---PSLKVDGASMFAHILSTHKADNEIDFFAAVDDLNKDESG 238 Query: 211 SAHLGTQEFSSGVFYRYANINLAQLQ--ENLGG---------ASREQALEIATHVVHMLA 259 + T EF+S +YR+A +NL L ++LG S E ++ + + Sbjct: 239 AGMTSTLEFNSATYYRFAALNLDALANDDHLGDITLKDGTVVRSVETRKQVVKTFLKAII 298 Query: 260 TEVPGAKQRTYAAFNPADMVMVNFSD--MPLSMANAFEKAV-KAKDGFLQPSIQAFNQYW 316 +P A++ T V+ + P+ + NAFE V +++ GF+ SI N + Sbjct: 299 QSIPSARKTTMNGNTLPVYVLGVVREKGHPIQLINAFETPVRRSEKGFVTESINRMNIEY 358 Query: 317 DRVANGYGLNGAAAQF----SLSDVDPITAQVKQMPTLEQLKSWVRNNG 361 + +G++ A+ SL + + + + L + + + Sbjct: 359 ADLKETWGVDSLFAKAVVKGSLKEQIKANQGSIETCSQDDLINGMVAHV 407 >UniRef50_B4S8P9 CRISPR-associated protein, Cse4 family n=9 Tax=Bacteria RepID=B4S8P9_PROA2 Length = 347 Score = 332 bits (852), Expect = 1e-89, Method: Composition-based stats. Identities = 106/361 (29%), Positives = 169/361 (46%), Gaps = 33/361 (9%) Query: 5 INIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIG-ESSL 63 I H+L S +CLNRDD+ K AI GG R R+SSQ KR +R S Q+ G + + Sbjct: 12 IEYHILQSFPVTCLNRDDVGAPKTAIVGGSTRARVSSQCWKRQVRLS---MQDFGIKLGI 68 Query: 64 RTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGEIAWFC 123 R+ +++ QK + A GK + + S D + + E F Sbjct: 69 RSKKVSEFVAK-------ACLQKGASEEQAAECGKVISD--SFSKDTLFFFSESEAQAFA 119 Query: 124 EQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGKVDGAMS 183 + + D+ + K +++ G+DIAL GRM ++ A S Sbjct: 120 DYAREKNFDSKNLND--KEIRKVAKKALNPAIDGLDIALFGRMVAQAT---DLNIEAAAS 174 Query: 184 IAHAITTHQVDSDIDWFTAVDDL-QEQGSAHLGTQEFSSGVFYRYANINLAQLQENLGGA 242 +HAI+TH+V +++++FTA+DDL +E GSAH+G+ EF+S +YRY +++L QL E++GG Sbjct: 175 FSHAISTHKVSNEVEFFTALDDLAEEPGSAHMGSLEFNSATYYRYISLDLGQLWESIGG- 233 Query: 243 SREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFSDMPLSMANAFEKAVKA-K 301 E E + L VP A+Q T + +P + + + FE AVKA Sbjct: 234 --EHLAEAVESLTKALFVAVPSARQTTQSGASPWEFAKIFIR-KGQRLQVPFETAVKAKD 290 Query: 302 DGFLQPSIQAFNQYWDRVANGYG-LNGAAAQFSLSDVDPITAQVKQMPTLEQLKSWVRNN 360 G+LQPSI A Y + G L G +F+ + +++ L ++ Sbjct: 291 GGYLQPSITALTDYLTKKEALAGSLFGKEKEFTFGED--------VNFSIDDLIKGLKLT 342 Query: 361 G 361 Sbjct: 343 V 343 >UniRef50_Q2RY18 CRISPR-associated protein, Cse4 family n=2 Tax=Alphaproteobacteria RepID=Q2RY18_RHORT Length = 359 Score = 332 bits (851), Expect = 1e-89, Method: Composition-based stats. Identities = 105/373 (28%), Positives = 161/373 (43%), Gaps = 32/373 (8%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNI-G 59 MS F+ +HVL +++ S LNRDD K FGG R+R+SSQSLKRA R+S + + G Sbjct: 1 MSRFLQLHVLTAYAASNLNRDDTGRPKTLNFGGAERLRVSSQSLKRAFRQSELFQSRLPG 60 Query: 60 ESSLRTIHLAQ-LRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGE 118 E R+ A+ L L + E + + L + K + + E Sbjct: 61 ELGTRSQDFAKALVSALVARGVEEAEAITRAEALIDHDKLGKVKKGKAQTEQLVHLGPDE 120 Query: 119 IAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGKV 178 +A + L + + + + VDIA+ GRM V Sbjct: 121 LAAIDALAERLATSAT--------LDDKAMLVLKSKPRAVDIAMFGRMLAGN---PGFNV 169 Query: 179 DGAMSIAHAITTHQVDSDIDWFTAVDDLQEQ------GSAHLGTQEFSSGVFYRYANINL 232 + A+ +AHA TTH+ + D++T VDD++ G+ LG E+ SG+FY Y IN Sbjct: 170 EAAVQVAHAFTTHRATPEDDYYTTVDDIKNADQEEDRGAGFLGILEYGSGLFYLYICINA 229 Query: 233 AQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFS-DMPLSMA 291 L +NL G + A E A ++ T P KQ T+A+ ++ + P S+A Sbjct: 230 DLLVDNLAG-DQALAAEAAALLIEAACTISPTGKQNTFASRARGLYALLEIGEETPRSLA 288 Query: 292 NAFEKAVKAK---DGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITAQVKQMP 348 AF+ AV ++ L SIQ + + YG N +L DP T Sbjct: 289 AAFQYAVGSRATEADHLAASIQRLTALREGFSKAYGENL--RSVALDVTDPATPG----- 341 Query: 349 TLEQLKSWVRNNG 361 L+ L + R+ Sbjct: 342 -LKALIAAARDAV 353 >UniRef50_D0MET5 CRISPR-associated protein, Cse4 family n=1 Tax=Rhodothermus marinus DSM 4252 RepID=D0MET5_RHOM4 Length = 423 Score = 332 bits (851), Expect = 2e-89, Method: Composition-based stats. Identities = 115/422 (27%), Positives = 187/422 (44%), Gaps = 63/422 (14%) Query: 2 SNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIG-- 59 S F+ IH L ++ + LNRDD K FGG R R+SSQ LK R G Sbjct: 3 SAFVQIHTLTAYPAALLNRDDAGFAKRLPFGGAIRTRVSSQCLKYHWRNFSGEHALYGLD 62 Query: 60 -ESSLRTIHLAQLR---DVLRQKLGER--------------FDQKIIDKTLALLSGKSVD 101 SLR+ + ++ + R D+ + L VD Sbjct: 63 VPRSLRSRETFKRCIARPLVEEGYPLRLVVAFALHLQKLIVSDESLSKTDFKKLMSDEVD 122 Query: 102 EA---EKISADAVTPWVVGEIAWFCEQVAKA----------EADNLDDKKLLKVLKEDIA 148 +A +++ ++ V E+ + ++ + A L D++L +V +E A Sbjct: 123 DATLLDQLKSNQVIILGRPEVDYLTRRIRERLDALREVWADAAAPLSDEQLERVYQELQA 182 Query: 149 AIRVNLQQ---------GVDIALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDW 199 + L++ G+D AL GRMATS + L + D A+ +AHA TTH +S+ D+ Sbjct: 183 IGKGELKKNLKGLYLAAGLDAALFGRMATSDV---LARGDAAIHVAHAFTTHAEESESDY 239 Query: 200 FTAVDDL------QEQGSAHLGTQEFSSGVFYRYANINLAQLQENLGG--------ASRE 245 FTAVD+L E GS HL QE +SG+FY Y +++ L NL G A R Sbjct: 240 FTAVDELVAQEGEGELGSGHLNNQELTSGLFYGYVVVDVPLLVSNLEGVPPAAWQEADRT 299 Query: 246 QALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNF-SDMPLSMANAFEKAVKAKD-G 303 A E+ ++H++AT PGAK + A A ++V + P ++ANAF + V G Sbjct: 300 LAAEVVRRLLHLIATVSPGAKLGSTAPHAYAQFMLVEWGRSQPRTLANAFHRPVSLDGEG 359 Query: 304 FLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITAQVK--QMPTLEQLKSWVRNNG 361 L S +A +Y +++ YG ++ + + Q++ + + ++ WV Sbjct: 360 VLVNSYRALGRYVEQMDRMYGKLTERRLAAIDLPEAVQRQLQVDTLNAVPEIADWVAEKI 419 Query: 362 EA 363 + Sbjct: 420 QG 421 >UniRef50_C6HV95 CRISPR-associated protein, Cas4 n=1 Tax=Leptospirillum ferrodiazotrophum RepID=C6HV95_9BACT Length = 393 Score = 330 bits (846), Expect = 6e-89, Method: Composition-based stats. Identities = 109/388 (28%), Positives = 173/388 (44%), Gaps = 50/388 (12%) Query: 5 INIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESSLR 64 H+L S +CLNRDD+ K A+ GG +R R+SSQS KRA+R + + +R Sbjct: 13 FEFHILQSFPVTCLNRDDVGSPKTAMIGGSQRARVSSQSWKRAVRLAMHDLGV--THGVR 70 Query: 65 TIHLAQLRDVLRQKLGERFDQKII--DKTLALLSG---------------------KSVD 101 T ++ L + LG +Q DK A+ + Sbjct: 71 TKLISPLIAEACRSLGATPEQARACGDKVEAVFIKKDEKGKKKSAKTKGDSDTQDEEVGS 130 Query: 102 EAEKISADAVTPWVVGEIAWFCEQVAKAEAD------NLDDKKLLKVLKEDIAAIRVNLQ 155 ++ D + EI+ + K E D D KK K + + I + Sbjct: 131 DSSSEKTDTLLFLSPKEISVLANEFKKQEFDPGKVIVQSDPKKQAKEIADMIGKVP-EGI 189 Query: 156 QGVDIALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQ-EQGSAHL 214 VDIAL GRM V+ A S AHAI+TH+V +++++FTA+DD + G+AH+ Sbjct: 190 DAVDIALFGRMVAQAA---ELNVEAAASFAHAISTHKVANEVEFFTALDDCAVDPGAAHM 246 Query: 215 GTQEFSSGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFN 274 G+ EF+S +YRY +++L QL + L G + V L VP A+Q T + + Sbjct: 247 GSLEFNSATYYRYVSLDLGQLSQTLAGQHIPET---IEAFVKALFVSVPAARQSTQSGAS 303 Query: 275 PADMVMVNFSDMPLSMANAFEKAVKA-KDGFLQPSIQAFNQYWDRVANGYG-LNGAAAQF 332 P D + + FE A+K+ GFL+PSI+ Y +R +G L G A++ Sbjct: 304 PWDFAKILVR-TGHRIQIPFETAIKSKDGGFLKPSIEEMKAYLNRQEKLHGSLFGKKAEY 362 Query: 333 SLSDVDPITAQVKQMPTLEQLKSWVRNN 360 + + + T++ L S ++ Sbjct: 363 TYGED--------ENFTIDDLISALKQQ 382 >UniRef50_D2L2X7 CRISPR-associated protein, Cse4 family n=1 Tax=Desulfovibrio sp. FW1012B RepID=D2L2X7_9DELT Length = 385 Score = 329 bits (845), Expect = 8e-89, Method: Composition-based stats. Identities = 100/387 (25%), Positives = 164/387 (42%), Gaps = 43/387 (11%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIG- 59 MS FI +H+L S+ + LNRDD+ K FG R+R+SSQSLKRA R S + +G Sbjct: 1 MSRFIQLHILTSYPAANLNRDDLGAPKSMRFGEANRLRVSSQSLKRAWRTSDVFKATLGA 60 Query: 60 -ESSLRTIHLAQLR-----------------------DVLRQKLGERFDQKIID-----K 90 +RT L + L++K + I K Sbjct: 61 DHLGVRTKELGRKVFCALTQGASLDAVWDAPDATGTLAALKEKTAAEIARTIAGVFGKIK 120 Query: 91 TLALLSGKSVDEAEKISADAVTPWVVGEIAWFCEQVAKAEADNLDD-KKLLKVLKEDIAA 149 A + + K + + + ++A ++ +A A + + K + Sbjct: 121 KEADAKAEKDADPVKKRKELLDSLEIEQLAHVSQEERRAVAALTEACRDAGKAPDANALN 180 Query: 150 IRVNLQQGVDIALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQ-- 207 + + + DIA+ GRM + V+ A+ +AHA+T H+ ++ D+FTAVDDL Sbjct: 181 LLRSDAKAADIAMFGRMLAASARF---NVEAAVQVAHAVTVHRAVAEDDFFTAVDDLNRD 237 Query: 208 EQGSAHLGTQEFSSGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQ 267 + G+ H+G EF +GV+Y Y I+ A L ENLGG + T + T P KQ Sbjct: 238 DAGAGHMGVSEFGAGVYYLYLCIDRALLAENLGG-DEALVQKALTALTTAACTVAPTGKQ 296 Query: 268 RTYAAFNPADMVMVNFS-DMPLSMANAFEKAV-----KAKDGFLQPSIQAFNQYWDRVAN 321 +YA+ A + D P +++ AF K V + + +I + ++ Sbjct: 297 ASYASRAYACFALAEKGDDTPRNLSLAFLKPVGEREEERDGHLGKTAIAELLKTKAKMDK 356 Query: 322 GYGLNGAAAQFSLSDVDPITAQVKQMP 348 YG A F++ D A++ Sbjct: 357 VYGQTLADTSFNVFDGKGTLAELAAFV 383 >UniRef50_Q60AD1 CRISPR-associated protein, CT1975 family n=1 Tax=Methylococcus capsulatus RepID=Q60AD1_METCA Length = 414 Score = 316 bits (810), Expect = 9e-85, Method: Composition-based stats. Identities = 106/413 (25%), Positives = 176/413 (42%), Gaps = 61/413 (14%) Query: 4 FINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESSL 63 F+ IH L S+ + LNRDD + K FG R+R+SSQ LKR R+S + + L Sbjct: 2 FLQIHSLTSYHATLLNRDDAGLAKRIPFGDAVRLRVSSQCLKRHWRESLKQTIPLP-TGL 60 Query: 64 RTIHLAQLR--DVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISA------------- 108 RT H+ + L+Q+ E K + +L L + D+ K Sbjct: 61 RTRHVFEREIYPRLKQEGVEDSLAKQLTLSLMGLLLQKSDKTAKPEKAKKGKNGHEEQAE 120 Query: 109 -------------------DAVTPWVVGEIAWFCEQV-AKAEADNLDDKKLLKVLKEDIA 148 + E+ + + A AE + +K L LK D A Sbjct: 121 FDFEEGAGTEESSAGDLRVKQPILFGRPEVDYLISLLKACAEEGSGAEKALQAKLKGDKA 180 Query: 149 --------AIRVNLQQGVDIALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWF 200 A +L G++ AL GR TS + L + D A+ +AH+ T H +D+++D+F Sbjct: 181 NFKAMLKAAGHGDLYAGLEGALFGRFVTSDV---LSRSDAAVHVAHSFTVHGLDTEVDYF 237 Query: 201 TAVDDL---QEQGSAHLGTQEFSSGVFYRYANINLAQLQENLGGAS--------REQALE 249 T VDDL +E G+AH G E +G+FY Y +++ L NL G + Sbjct: 238 TVVDDLNREEETGAAHAGDMELGAGLFYGYVAVDIPLLVSNLTGCDTTRWAEQEPADVRK 297 Query: 250 IATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFSD-MPLSMANAFEKAVKAKDGFLQPS 308 + T ++ +AT PGAK A + ++ V++ P +++NA+ +A+ + LQ + Sbjct: 298 VLTGLIRAIATVSPGAKLGATAPYAFSEFVLLETGKQQPRALSNAYLQALPMRGDPLQAA 357 Query: 309 IQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITAQVKQMPTLEQLKSWVRNNG 361 I A +Y + YG + S++ A + +L+ + Sbjct: 358 IDALAKYLRALDAMYGRTSDSR--SVASTRAFDADLAPTNSLDASIGAALDAI 408 >UniRef50_C7MQD5 CRISPR-associated protein, Cse4 family n=1 Tax=Saccharomonospora viridis DSM 43017 RepID=C7MQD5_SACVD Length = 368 Score = 316 bits (809), Expect = 1e-84, Method: Composition-based stats. Identities = 97/374 (25%), Positives = 161/374 (43%), Gaps = 24/374 (6%) Query: 4 FINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESSL 63 F++IH L + S +NRD++ K +GG R+R+SSQ+ KRA+RK+ Q++ + + Sbjct: 2 FVDIHALHTLPYSNVNRDNLGAPKSCWYGGTERIRVSSQAWKRAIRKAV--EQDLEQPTE 59 Query: 64 RTIHLAQLRD-VLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADA-VTPWVVGEIAW 121 RT +A L +L ++ D + + + G + + TP +A Sbjct: 60 RTRRIASLVAGILTERGWGAEDARRAGRAVIYAYGLEPAADDDDTDTLLWTPPAAEALAG 119 Query: 122 FCEQVAKAE------------ADNLDDKKLLKVLKEDIAAIRVNLQQGVD-IALSGRMAT 168 E+ A N K + +K ++ L + IAL GRM Sbjct: 120 VVEKHRDTVVTLPLPKGEGKKAKNPPAKDITDAVKPMAGEVKSILNRTTPTIALLGRMLA 179 Query: 169 SGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDD-LQEQGSAHLGTQEFSSGVFYRY 227 + G IAHA T H+ + D+FTAVDD G+ H+ T +F++G FYRY Sbjct: 180 D---RPDHTIYGLAEIAHAFTVHEAAPEFDYFTAVDDRAANTGAGHVNTAQFTTGTFYRY 236 Query: 228 ANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFSDMP 287 ++IN+ +L + +G A + T P KQ AA AD+ + + P Sbjct: 237 SSINITRLVDVVGEQD---ARAVLLAWARRFITVTPAGKQTATAARTAADLAHIVVRNAP 293 Query: 288 LSMANAFEKAVKAKDGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITAQVKQM 347 S A AFE + + G+L P+ +A Y R+A G ++ + + + Sbjct: 294 QSYAPAFETPIVSTGGYLDPAARALGDYATRLAAYLGDTPVEHGYATTLPTNVDGLGGRF 353 Query: 348 PTLEQLKSWVRNNG 361 TL+ L + Sbjct: 354 DTLDTLINATVGAV 367 >UniRef50_D1A6Q4 CRISPR-associated protein, Cse4 family n=2 Tax=Actinomycetales RepID=D1A6Q4_THECD Length = 399 Score = 302 bits (774), Expect = 1e-80, Method: Composition-based stats. Identities = 115/389 (29%), Positives = 172/389 (44%), Gaps = 39/389 (10%) Query: 3 NFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESS 62 FI H++ + + LNRDD N K +GGK R R+SSQ KRAMR E++ Sbjct: 7 RFIEAHIIQAIPFANLNRDDTNAVKTVTWGGKERTRVSSQCWKRAMRLY-LQTSLGQEAA 65 Query: 63 LRTIHLAQ-LRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISAD------------ 109 LRT L + L L + G D +++ EA K D Sbjct: 66 LRTRRLPEYLARHLEEHHGWPADLAERAGRHIVVASSVGGEAPKKKTDGEETGGTGEHWS 125 Query: 110 -----AVTPWVVGEIAWFCEQVAKAEADNLDDKKLLKVLKED------IAAIRVNLQQGV 158 + V E+A Q +A + + K K ++D + + GV Sbjct: 126 TAAMVYIPSSAVPELAELAIQYREALENAKEPKDPAKFGRKDSVIPTGKVDEILRRRNGV 185 Query: 159 DIALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQE-----QGSAH 213 I L GRM + +VDGA+ +AHA TTH ++ID+F+AVDD+ + GSAH Sbjct: 186 -INLFGRMLAQ---VDDAEVDGAVQVAHAFTTHATTTEIDYFSAVDDVTDIWGDTTGSAH 241 Query: 214 LGTQEFSSGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAF 273 +G E S+GV YRY ++L L NLGG E E+A ++ +P AK+ + A Sbjct: 242 MGQAEHSAGVLYRYIVLDLNDLHANLGG-DLEATRELAAGLLKAALLSLPRAKKNSTAPH 300 Query: 274 NPADMVMVNFS-DMPLSMANAFEKAVKAK--DGFLQPSIQAFNQYWDRVANGYGLNGAAA 330 + + D P+S A AFEK V A G +PS+ A N+Y V G +G Sbjct: 301 TIPHLAHLTVRTDRPVSYAGAFEKPVPADRHGGHSEPSVAALNEYAAAVQKLLGTSGCRY 360 Query: 331 QF-SLSDVDPITAQVKQMPTLEQLKSWVR 358 + + I A +++ + ++L Sbjct: 361 AAHATLSQEKIDALGERVESFDKLIEGAL 389 >UniRef50_C8P6I6 CRISPR-associated protein n=1 Tax=Lactobacillus antri DSM 16041 RepID=C8P6I6_9LACO Length = 311 Score = 284 bits (728), Expect = 3e-75, Method: Composition-based stats. Identities = 80/322 (24%), Positives = 143/322 (44%), Gaps = 28/322 (8%) Query: 50 KSGYYAQNIGE--SSLRTIHLAQLRD-VLRQKLGERFDQKIIDKTLALLSGKSVDEAEK- 105 E + +RT+ L L+++ + + + + + + + +K Sbjct: 1 MMFKEQSTDAEWLAGIRTMRGPLLLANELQKQDSNLSSDEAMAQAVDVFNKAKIKLDKKT 60 Query: 106 ISADAVTPWVVGEIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGR 165 A+ G+IA E V + D LD K + + L+ +D+AL GR Sbjct: 61 NQTKALLMLSHGQIAKLAEYVRQN--DELDSKAVKEALQ---------GDHSLDMALFGR 109 Query: 166 MATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQ---EQGSAHLGTQEFSSG 222 M VD A +AHAI+TH++ + D++TAVDD + E GSA +GT E+ S Sbjct: 110 MVADD---PSLNVDAACQVAHAISTHEIVPEYDYYTAVDDEKADDESGSAMIGTIEYDSA 166 Query: 223 VFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVN 282 YRYAN+N+ +L ++LG + A++ V +P KQ ++A V+V Sbjct: 167 TLYRYANVNVNELVQSLG--DVDTAVKGLQLFVKDFVLSMPTGKQNSFANKTVPQYVLVT 224 Query: 283 FS-DMPLSMANAFEKAVKAKDGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPIT 341 D P+++ +AFE+AVK++ G+LQPS+ + + S+ + + Sbjct: 225 VREDTPVNLVSAFEEAVKSRHGYLQPSVAKLEKEYQDTQQFVQTP----LASVVVTNKES 280 Query: 342 AQVKQMPTLEQLKSWVRNNGEA 363 + ++ L S + E+ Sbjct: 281 KISTKAADVDDLVSKITEVIES 302 >UniRef50_C2GEY7 CRISPR-associated Cse4 family protein n=6 Tax=Actinomycetales RepID=C2GEY7_9CORY Length = 356 Score = 284 bits (728), Expect = 3e-75, Method: Composition-based stats. Identities = 95/377 (25%), Positives = 151/377 (40%), Gaps = 44/377 (11%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGE 60 MSN + +H L S S LNRDD + K + GG R SSQS+KR R Y + Sbjct: 1 MSNQLTLHFLCSIPYSNLNRDDTGVPKRVMQGGALRALHSSQSIKRGSRV--LYENASQD 58 Query: 61 SSLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGK-SVDEAEKISADAV--TPWVVG 117 S+R+ L + ++ D+K K A L G + EA+ DA T Sbjct: 59 LSIRSGRLDEEVAEKAMEMNPDLDEKTALKQAAKLIGNLTKGEAKSGEGDAKRSTWLSSE 118 Query: 118 EIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGK 177 EI A A++ D ++ I N + IA GRM + Sbjct: 119 EILT----AATYVANSTDPREKF---------IDGNTTGSLAIAAFGRMFANAT---DLN 162 Query: 178 VDGAMSIAHAITTHQVDSDIDWFTAVDDLQ----EQGSAHLGTQEFSSGVFYRYANINLA 233 + A++++ AITTHQ + D+F+ DD+ + + +L ++SG FYR I+ Sbjct: 163 TEAAVAVSPAITTHQATIETDYFSTADDINLRDHKANATYLDVSLYTSGTFYRTVTIDRN 222 Query: 234 QLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFSDMPLSMANA 293 QL+ + G E V L P K+ + A F +++ + +A Sbjct: 223 QLRTSWSGFESNSVRENLEAFVRSLVYGQPRGKKNSTAPFTMPSLILAE--EQQYRVAYD 280 Query: 294 FEKAVKAK---DGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITAQVKQMPTL 350 FE+ V+A GF++ SI+ + + A F + P+ A P L Sbjct: 281 FERPVEADKDGGGFMKSSIEKLAKQYT----------LARSFDPGNFGPVEALSGTYPDL 330 Query: 351 E----QLKSWVRNNGEA 363 + LK ++ Sbjct: 331 DGHFGDLKKASLDSLIG 347 >UniRef50_B6IWM4 CRISPR-associated protein, CT1975 family n=1 Tax=Rhodospirillum centenum SW RepID=B6IWM4_RHOCS Length = 435 Score = 273 bits (699), Expect = 6e-72, Method: Composition-based stats. Identities = 95/425 (22%), Positives = 159/425 (37%), Gaps = 73/425 (17%) Query: 5 INIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESSLR 64 I HVL + P +NRD+ K GG R RISSQ+ KRA+R + ++ + + R Sbjct: 15 IQFHVLTAFPPHNVNRDEDGRPKTCQLGGVTRGRISSQAKKRALRLAPHF--PTAQRATR 72 Query: 65 TIH--------------------LAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAE 104 T A L G + + + +A K ++A Sbjct: 73 TRKAGIHTFLKLTAAGIDTTSAVWAALAVNHATGGGGKPPKAEDAQAIAAPDPKKQEDAY 132 Query: 105 KISADAVT---------------PWVVG-------------EIAWFCEQVAKAEADNLDD 136 K AVT W+ G E A E +A A D Sbjct: 133 KKKEKAVTDMMEKRGLDRAAAEQEWLTGQVGTEQGLVISTREFARIEEGIAHLTAAWAAD 192 Query: 137 KKLLKVLKED------IAAIRVNLQQGVDIALSGRMATSGMMTELGKVDGAMSIAHAITT 190 + + E ++ +D AL GRM + V+ A ++ HA TT Sbjct: 193 RDGFPAVLEGWVRQVCKESLLTKADHDLDTALFGRMVAANANF---NVEAACAVGHAFTT 249 Query: 191 HQVDSDIDWFTAVDDLQ---EQGSAHLGTQEFSSGVFYRYANINLAQLQENLG-GASREQ 246 H+ + D+F+A ++L+ G+ G F GV+Y++A ++ L+ L G S E+ Sbjct: 250 HRFALEGDYFSAGEELKVLGGTGAVITGYAFFGGGVYYQHAVLDRGHLRTTLSRGRSAEE 309 Query: 247 A----LEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFSDMP-LSMANAFEKAVKAK 301 A ++ + L P K ++A+ A V+ P L++ AF VKA Sbjct: 310 AERLTVQAVDTFLTGLLFSQPRGKCNSHASDVAASYVLATRGGDPALNLGLAFLDPVKAT 369 Query: 302 D---GFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPIT--AQVKQMPTLEQLKSW 356 + + SI+ + + YGL A + + ++ T+E + + Sbjct: 370 EDVTDLMCASIRRLTDFHRALTAAYGLGNAVCVLNAYPPARGNDAPRAPEVWTVEDFRRF 429 Query: 357 VRNNG 361 V+ G Sbjct: 430 VQGRG 434 >UniRef50_B8HWH9 CRISPR-associated protein, Cse4 family n=1 Tax=Cyanothece sp. PCC 7425 RepID=B8HWH9_CYAP4 Length = 501 Score = 262 bits (671), Expect = 1e-68, Method: Composition-based stats. Identities = 68/329 (20%), Positives = 138/329 (41%), Gaps = 23/329 (6%) Query: 48 MRKSGYYAQNIGESSLRTIHLAQLRDVLRQKLGERFDQKIIDKTLA----LLSGKSVDEA 103 R E R L VL + ++ + + ++ A + + ++ Sbjct: 174 WRT--KLQSEFAEMPERVDDQVSLWSVLSIQALQKSQEDLANEDEADDEKVDTSNTMFFV 231 Query: 104 EKISADAVTPWVVGEIAWFCEQVAKAEADNLDD--KKLLKVLKEDIAAIRVNLQQGVDIA 161 + + + +++ + + ++ + K++ +K V + DIA Sbjct: 232 GDVEIENLAGFLLNNLQVVQQDISASVPSFSKAVVDKIIDTIKHKDEKGNVIFPKPGDIA 291 Query: 162 LSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQEQ---GSAHLGTQE 218 L GRM + KVD ++ +AHAI+ +++ + D+FTAV+DL E GS H+G Sbjct: 292 LFGRMMAN---LPNAKVDASVQVAHAISVNKLQQEFDFFTAVEDLAEPDSLGSGHMGETG 348 Query: 219 FSSGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADM 278 ++S +YR+ ++ QL++NLG + + A IA +P Q +AA + Sbjct: 349 YNSSTYYRFTTLDTEQLKQNLG--NEDNAATIAHAFAEAFVRAIPTGHQNGFAAHSLPAA 406 Query: 279 VMVNFS-DMPLSMANAFEKAVKAKDG--FLQPSIQAFNQYWDRVANGYGLNGAAAQFSLS 335 VM P+S+ +AFE V K G L+ ++ +++W ++ YG + ++ Sbjct: 407 VMAVVRKGQPVSLVDAFENPVAPKAGKSLLENAVSKLDEHWAELSKMYGEKTVVFKGIVA 466 Query: 336 DVDPITAQ----VKQMPTLEQLKSWVRNN 360 + P++E+L + Sbjct: 467 RAQLAQQLEYLAAVEKPSVEELLKDAIDA 495 Score = 121 bits (305), Expect = 3e-26, Method: Composition-based stats. Identities = 31/112 (27%), Positives = 51/112 (45%), Gaps = 8/112 (7%) Query: 5 INIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYY-----AQNIG 59 + IH+L S P+ LNRD+ M K +FGG R RISSQ KR R+ + ++ Sbjct: 3 LEIHILQSFPPANLNRDENGMPKSTVFGGYPRARISSQCQKRRTREYYHEYCKELGVDLK 62 Query: 60 ESSLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAV 111 + R+ + L++KL +R + + +A L+ + E Sbjct: 63 HFANRSRNW---IKQLKEKLTQRGVSEAQAELMASLTISVLSEKPDKKGKLK 111 >UniRef50_D0WFC9 CRISPR-associated protein, Cse4 family n=1 Tax=Slackia exigua ATCC 700122 RepID=D0WFC9_9ACTN Length = 310 Score = 260 bits (664), Expect = 8e-68, Method: Composition-based stats. Identities = 66/270 (24%), Positives = 118/270 (43%), Gaps = 14/270 (5%) Query: 79 LGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGEIAWFCEQVAKAEADNLDDKK 138 + E + I+K +L + +K + + +++ ++A+ L D + Sbjct: 1 MPEVSEGDAIEKAKEVLVALGF-KLKKEENEYLNEYLIFIGTLQIGKLAELAIQALRDGE 59 Query: 139 L--LKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSD 196 K K+ + R VDIA+ GRM VD ++ +AHAI+ +++ Sbjct: 60 KVDKKEAKKILDVKRSPALNAVDIAMFGRMVADA---PDLNVDASVQVAHAISVSSAETE 116 Query: 197 IDWFTAVDD---LQEQGSAHLGTQEFSSGVFYRYANINLAQLQENLGGASREQALEIATH 253 D+FTA+DD G+A + T EF+S +FYRYAN+++ L ENLG S + A + Sbjct: 117 FDYFTALDDKAPEDNAGAAMIETTEFTSAMFYRYANVDVFHLCENLG--SPDAATKGINA 174 Query: 254 VVHMLATEVPGAKQRTYAAFNPADMVMVNFSD-MPLSMANAFEKAVKA--KDGFLQPSIQ 310 + +P KQ ++A V++ D P+S+ N+FE+ V A L + + Sbjct: 175 FLQSFVKSMPTGKQNSFANRTLPSAVVIQLRDSQPVSLVNSFERPVVALRDKSQLTNAAE 234 Query: 311 AFNQYWDRVANGYGLNGAAAQFSLSDVDPI 340 A + +G+ + D Sbjct: 235 ALVAQEKALDEAFGVTPQHTFVVAASPDAS 264 >UniRef50_Q31XC0 Putative cytoplasmic protein n=1 Tax=Shigella boydii Sb227 RepID=Q31XC0_SHIBS Length = 245 Score = 255 bits (652), Expect = 2e-66, Method: Composition-based stats. Identities = 69/240 (28%), Positives = 107/240 (44%), Gaps = 16/240 (6%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNI-G 59 M+ FI +H+L ++ + LNRDD K I GG R+R+SSQSLKRA R S + Q + G Sbjct: 1 MTTFIQLHLLTAYPAANLNRDDSGSPKTVILGGATRLRVSSQSLKRAWRTSELFEQALAG 60 Query: 60 ESSLRTIHLAQLRDVLRQKLGERF--DQKIIDKTLALLSGKSVDEAEKISADAVTPWVVG 117 +R+ +A+ + + G + + L D+ +K P Sbjct: 61 HIGVRSGRIAREAATILIEKGIEDKKAIEWAVEIADYLGKAKKDKKQKNDKKPKDPLTSA 120 Query: 118 EIAWFCEQVAKAEADNLDD-----KKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMM 172 E ++ AE D + + + KE A+ + VDIA+ GRM Sbjct: 121 ETEQLV-HISPAEFDAVKALAHQLAEEKRAPKEKDLALLRKDRMAVDIAMFGRMLAKK-- 177 Query: 173 TELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDL----QEQGSAHLGTQEFSSGVFYRYA 228 V+ A +AHA + + D+FTAVDDL ++ G+ H+ F S +FY Y Sbjct: 178 -PGFNVEAACQVAHAFGVSETIVENDFFTAVDDLRQASEDAGAGHVDETGFGSALFYTYI 236 >UniRef50_B7KJ25 CRISPR-associated protein, Cse4 family n=1 Tax=Cyanothece sp. PCC 7424 RepID=B7KJ25_CYAP7 Length = 480 Score = 232 bits (593), Expect = 1e-59, Method: Composition-based stats. Identities = 69/280 (24%), Positives = 113/280 (40%), Gaps = 20/280 (7%) Query: 82 RFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGEIAWFCEQVAKAEADNLDDKKLLK 141 D + ++ I D T + +A + + KKL Sbjct: 169 SSDDDTSTPEETESTITILELPGAIQGDLKTSYKDNPLAKVVNE-----EEFNQLKKLCN 223 Query: 142 VLKEDIAAIRVNLQQGV--DIALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDW 199 +K + + + V D+AL GRM S VD ++S+AHAI+T+ + + D+ Sbjct: 224 EIKGILYDEKNKRIKPVPGDVALFGRMLAS---FSDASVDASVSVAHAISTNSIKREFDY 280 Query: 200 FTAVDDL------QEQGSAHLGTQEFSSGVFYRYANINLAQLQENLGGASREQALEIATH 253 +TA D + QG+ H+G + F+SGVFYRY+ ++ QL ENLG +E + Sbjct: 281 WTAARDFQKNNSDESQGAGHIGDRPFASGVFYRYSCLDSNQLSENLGEIYQEDIQYLVEQ 340 Query: 254 VVHMLATEVPGAKQRTYAAFNPA-DMVMVNFSDMPLSMANAFEKAVKAKDGFLQPSIQAF 312 + P + + V P+S+ NAF+ +K D F + S Sbjct: 341 YLDAFLHSRPSGYSHQFGHDTLPFAGIFVIRQSQPISLVNAFDIPIKKYDSFCRQSWNKL 400 Query: 313 NQYWDRVANGYGLNG---AAAQFSLSDVDPITAQVKQMPT 349 +W+ + YG FSL I+ VK +P Sbjct: 401 VDHWNEIQQAYGKRLPVKEVHVFSLESFKDISELVKAVPN 440 Score = 139 bits (351), Expect = 1e-31, Method: Composition-based stats. Identities = 38/134 (28%), Positives = 56/134 (41%), Gaps = 10/134 (7%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGE 60 M+NF+ IH+L S PS +NRD K A FGG R+R+SSQS K A+R+ Sbjct: 1 MTNFLEIHLLQSTPPSNMNRDQNGSPKTAHFGGVERLRVSSQSWKHAVRQYYKKTLPDDH 60 Query: 61 SSLRTIHLA-QLRDVLR-QKLGERFDQK--------IIDKTLALLSGKSVDEAEKISADA 110 + R +L L+ +K E + K I L G D+ ++ D Sbjct: 61 KTYRDKGWPTELAKRLKQEKFDEELNLKDSDFSVVLPIAFMLLSAIGAKRDDKKEGDIDT 120 Query: 111 VTPWVVGEIAWFCE 124 + E+ Sbjct: 121 MLFLGEAEVREIIN 134 >UniRef50_UPI0001B51C2C hypothetical protein SvirD4_12600 n=1 Tax=Streptomyces viridochromogenes DSM 40736 RepID=UPI0001B51C2C Length = 461 Score = 218 bits (556), Expect = 2e-55, Method: Composition-based stats. Identities = 90/402 (22%), Positives = 148/402 (36%), Gaps = 92/402 (22%) Query: 3 NFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGE-- 60 + ++H+L + + + RD+ M K +FGG R I++Q+ +RA R N G+ Sbjct: 16 QYFSLHLLETFTAALPVRDENGMPKQFVFGGDPRTMITAQARRRAERTHSRERANAGQGP 75 Query: 61 -----SSLRTIHLAQL-------------------RDVLRQKLGERFDQKIIDKTLALLS 96 +RT A+L L + +G +F K + L + Sbjct: 76 LAGYTMGIRTREWAKLTAKALADRYGWDRADALATAKALLEGVGLKFGAKPTTRDLTQVL 135 Query: 97 GKSVDEAEKISADAVTPWVVGEIAWFCEQVA---------------------------KA 129 + ++A +I AD + AW + + A Sbjct: 136 LFAPEDAGQIIADWIQEHRAEVAAWTSDYLKAKEAGAAAAAAKKAAAAAARKAKKSGTDA 195 Query: 130 EADNLDD------KKLLKVLKEDIAAIRVNL--QQGVDIALSGRMATSGMMTELGKVDGA 181 A DD ++L V ++ AI L + +DIAL GR + VDGA Sbjct: 196 LASAADDNQPNNEEQLPPVPRKIREAILSALAPRDAIDIALYGRFLAEIADSP--NVDGA 253 Query: 182 MSIAHAITTHQVD------------------SDIDWFTAVDDLQEQGSAHLGTQEFSSGV 223 + AHA T H + +D+ A DD G+ G Q SG Sbjct: 254 IQTAHAFTVHAAEHIDDFYAAADDAKLHRKAHALDYIDAADD---SGAGMTGYQSLISGT 310 Query: 224 FYRYANINLAQLQENL--GGASREQAL----EIATHVVHMLATEVPGAKQRTYAAFN-PA 276 FYR+A ++ +L+ NL G +Q V +P AK+ T AA Sbjct: 311 FYRHAVLDRYKLRINLLASGMKPDQVQAAAEAAELEFVEAFTNAIPQAKKNTTAATGILP 370 Query: 277 DMVMVNFSDMPLSMANAFEKAVKAKDGFLQPSIQAFNQYWDR 318 +VM P + A FEK + + + S+ A ++ ++ Sbjct: 371 KLVMAFTGARPFNYAGIFEKPIAEETDGV-ASVAAADRLLNQ 411 >UniRef50_UPI000190E665 hypothetical protein SentesTyp_08452 n=3 Tax=Salmonella enterica subsp. enterica serovar Typhi RepID=UPI000190E665 Length = 139 Score = 141 bits (356), Expect = 4e-32, Method: Composition-based stats. Identities = 39/138 (28%), Positives = 63/138 (45%), Gaps = 6/138 (4%) Query: 1 MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQN-IG 59 M+ FI +H+L +++P+ LNRD+ K A GG R+R+SSQSLKRA R S + G Sbjct: 1 MTTFIQLHLLTAYAPANLNRDESGRPKTAFMGGVERLRVSSQSLKRAWRVSETFEAAMDG 60 Query: 60 ESSLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTP--WVVG 117 RT + D + + + + ++ I K+ + L K + K DA + Sbjct: 61 FMGKRTRRIG--VDYVYRPMKDAGIEEKIAKSSSELIAKQFGKL-KSDKDAKPEKNLEIE 117 Query: 118 EIAWFCEQVAKAEADNLD 135 +I +D Sbjct: 118 QIVHVSNHEISLIKQLVD 135 >UniRef50_UPI0001B58196 CRISPR-associated Cse4 family protein n=1 Tax=Streptomyces sp. C RepID=UPI0001B58196 Length = 91 Score = 83.3 bits (205), Expect = 1e-14, Method: Composition-based stats. Identities = 19/84 (22%), Positives = 34/84 (40%), Gaps = 4/84 (4%) Query: 262 VPGAKQRTYAAFNPADMVMVNFS-DMPLSMANAFEKAV---KAKDGFLQPSIQAFNQYWD 317 +P K T+ D+V+V P+S AFEK V + +G ++ + +A ++ Sbjct: 1 MPTGKANTFGNHTLPDVVIVKLRSSRPVSFVGAFEKPVIQHETGEGHVRAAWKALAEHIP 60 Query: 318 RVANGYGLNGAAAQFSLSDVDPIT 341 + +G A P T Sbjct: 61 AIEKTFGATADATWILRVGEPPTT 84 >UniRef50_C2BS05 Possible CRISPR-associated protein n=1 Tax=Mobiluncus curtisii ATCC 43063 RepID=C2BS05_9ACTO Length = 435 Score = 66.4 bits (161), Expect = 2e-09, Method: Composition-based stats. Identities = 18/93 (19%), Positives = 37/93 (39%), Gaps = 3/93 (3%) Query: 274 NPADMVMVNFSD-MPLSMANAFEKAVK-AKDGFLQPSIQAFNQYWDRVANGYGLNGAAAQ 331 + ++V V D +S+ NAFE+ V + +Q +++ + + YG+ AA Sbjct: 339 SLPELVYVAVRDTRSVSLVNAFEEPVACERGSRVQAAVEVLANEETAIEDAYGMKPLAAF 398 Query: 332 FS-LSDVDPITAQVKQMPTLEQLKSWVRNNGEA 363 D + T+ +L S + + Sbjct: 399 VVDPKDYAAKLEDIAHKVTVPELTSLIVEVLAS 431 >UniRef50_O87037 Z35f protein n=1 Tax=Vibrio cholerae RepID=O87037_VIBCH Length = 96 Score = 45.6 bits (107), Expect = 0.003, Method: Composition-based stats. Identities = 15/64 (23%), Positives = 29/64 (45%), Gaps = 2/64 (3%) Query: 262 VPGAKQRTYAAFNPADMVMVNFSDMPLSMANAFEKAVKAKD-GFLQPSIQAFNQYWDRVA 320 +P A+Q T + P + V + +FE+ V+A G+L P+ +A + ++ Sbjct: 1 MPNARQTTQSGACPWEYARVLVR-KGQRLQASFEQPVRAAGEGYLLPNKKALQNWLEQRE 59 Query: 321 NGYG 324 G Sbjct: 60 KLSG 63 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.311 0.157 0.504 Lambda K H 0.267 0.0486 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,688,984,506 Number of Sequences: 3077464 Number of extensions: 102254629 Number of successful extensions: 216389 Number of sequences better than 1.0e-01: 89 Number of HSP's better than 0.1 without gapping: 147 Number of HSP's successfully gapped in prelim test: 58 Number of HSP's that attempted gapping in prelim test: 215375 Number of HSP's gapped (non-prelim): 288 length of query: 363 length of database: 1,040,396,356 effective HSP length: 130 effective length of query: 233 effective length of database: 640,326,036 effective search space: 149195966388 effective search space used: 149195966388 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.0 bits) S2: 94 (40.6 bits)