BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (502 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_Q46901 Uncharacterized protein ygcL n=11 Tax=Proteobact... 1045 0.0 UniRef50_D0FPP1 CRISPR-associated protein, Cse1 family n=2 Tax=E... 540 e-152 UniRef50_C5SD51 CRISPR-associated protein, Cse1 family n=1 Tax=A... 409 e-112 UniRef50_A1SV74 CRISPR-associated protein, Cse1 family n=2 Tax=G... 127 8e-28 UniRef50_Q12YB1 CRISPR-associated protein, Cse1 family n=1 Tax=M... 94 2e-17 UniRef50_Q314I1 Putative uncharacterized protein n=1 Tax=Desulfo... 93 3e-17 UniRef50_Q1R117 Putative uncharacterized protein n=1 Tax=Chromoh... 92 4e-17 UniRef50_B8GIV6 CRISPR-associated protein, Cse1 family n=1 Tax=M... 91 8e-17 UniRef50_Q2FNL5 Putative uncharacterized protein n=1 Tax=Methano... 85 5e-15 UniRef50_D2TKK4 CRISPR-associated protein n=1 Tax=Citrobacter ro... 81 8e-14 UniRef50_A7ZQK3 CRISPR-associated protein, Cse1 family n=55 Tax=... 79 3e-13 UniRef50_B3E5V2 CRISPR-associated protein, Cse1 family n=3 Tax=D... 79 5e-13 UniRef50_B4TTX5 Crispr-associated protein, Cse1 family n=9 Tax=S... 79 6e-13 UniRef50_Q054L1 Putative uncharacterized protein n=2 Tax=Leptosp... 71 1e-10 UniRef50_A5FZI3 CRISPR-associated protein, Cse1 family n=2 Tax=A... 69 4e-10 UniRef50_D0KFE0 CRISPR-associated protein, Cse1 family n=4 Tax=E... 69 6e-10 UniRef50_B4RSK0 CRISPR-associated protein, Cse1 family n=6 Tax=G... 66 4e-09 UniRef50_B5ZCF9 CRISPR-associated protein, Cse1 family n=10 Tax=... 63 3e-08 UniRef50_B8IZA3 CRISPR-associated protein, Cse1 family n=1 Tax=D... 62 4e-08 UniRef50_Q0W587 Putative uncharacterized protein n=1 Tax=uncultu... 62 4e-08 UniRef50_Q2RY16 CRISPR-associated protein, Cse1 family n=1 Tax=R... 55 5e-06 UniRef50_D1CGD1 CRISPR-associated protein, Cse1 family n=1 Tax=T... 50 1e-04 UniRef50_B6WQ59 Putative uncharacterized protein n=1 Tax=Desulfo... 49 4e-04 UniRef50_A7BA67 Putative uncharacterized protein n=1 Tax=Actinom... 47 0.001 UniRef50_Q0BSC4 Putative uncharacterized protein n=1 Tax=Granuli... 47 0.002 UniRef50_A5UR17 CRISPR-associated protein, Cse1 family n=1 Tax=R... 46 0.003 UniRef50_D2L2X5 CRISPR-associated protein, Cse1 family n=1 Tax=D... 46 0.004 UniRef50_C6C421 CRISPR-associated protein, Cse1 family n=3 Tax=E... 45 0.005 UniRef50_A5GBL8 CRISPR-associated protein, Cse1 family n=1 Tax=G... 42 0.039 UniRef50_B0TDT8 Crispr-associated protein, ct1972 family, putati... 41 0.090 >UniRef50_Q46901 Uncharacterized protein ygcL n=11 Tax=Proteobacteria RepID=YGCL_ECOLI Length = 502 Score = 1045 bits (2702), Expect = 0.0, Method: Compositional matrix adjust. Identities = 502/502 (100%), Positives = 502/502 (100%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAP 60 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAP Sbjct: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAP 60 Query: 61 AKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDVTPMEKLLAG 120 AKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDVTPMEKLLAG Sbjct: 61 AKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDVTPMEKLLAG 120 Query: 121 VSGATNCAFVNQPGQGEALCGGCTAIALFNQANQAPGFGGGFKSGLRGGTPVTTFVRGID 180 VSGATNCAFVNQPGQGEALCGGCTAIALFNQANQAPGFGGGFKSGLRGGTPVTTFVRGID Sbjct: 121 VSGATNCAFVNQPGQGEALCGGCTAIALFNQANQAPGFGGGFKSGLRGGTPVTTFVRGID 180 Query: 181 LRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSNESIPASSIGFVRGLFWQPAHIE 240 LRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSNESIPASSIGFVRGLFWQPAHIE Sbjct: 181 LRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSNESIPASSIGFVRGLFWQPAHIE 240 Query: 241 LCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWPHPHSPCLVTVKKGEVEEKFLAF 300 LCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWPHPHSPCLVTVKKGEVEEKFLAF Sbjct: 241 LCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWPHPHSPCLVTVKKGEVEEKFLAF 300 Query: 301 TTSAPSWTQISRVVVDKIIQNENGNRVAAVVNQFRNIAPQSPLELIMGGYRNNQASILER 360 TTSAPSWTQISRVVVDKIIQNENGNRVAAVVNQFRNIAPQSPLELIMGGYRNNQASILER Sbjct: 301 TTSAPSWTQISRVVVDKIIQNENGNRVAAVVNQFRNIAPQSPLELIMGGYRNNQASILER 360 Query: 361 RHDVLMFNQGWQQYGNVINEIVTVGLGYKTALRKALYTFAEGFKNKDFKGAGVSVHETAE 420 RHDVLMFNQGWQQYGNVINEIVTVGLGYKTALRKALYTFAEGFKNKDFKGAGVSVHETAE Sbjct: 361 RHDVLMFNQGWQQYGNVINEIVTVGLGYKTALRKALYTFAEGFKNKDFKGAGVSVHETAE 420 Query: 421 RHFYRQSELLIPDVLANVNFSQADEVIADLRDKLHQLCEMLFNQSVAPYAHHPKLISTLA 480 RHFYRQSELLIPDVLANVNFSQADEVIADLRDKLHQLCEMLFNQSVAPYAHHPKLISTLA Sbjct: 421 RHFYRQSELLIPDVLANVNFSQADEVIADLRDKLHQLCEMLFNQSVAPYAHHPKLISTLA 480 Query: 481 LARATLYKHLRELKPQGGPSNG 502 LARATLYKHLRELKPQGGPSNG Sbjct: 481 LARATLYKHLRELKPQGGPSNG 502 >UniRef50_D0FPP1 CRISPR-associated protein, Cse1 family n=2 Tax=Erwinia pyrifoliae RepID=D0FPP1_ERWPY Length = 507 Score = 540 bits (1392), Expect = e-152, Method: Compositional matrix adjust. Identities = 271/499 (54%), Positives = 348/499 (69%), Gaps = 2/499 (0%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAP 60 MNLL D+WIPVRP +GG+ Q I LQ+L C +W ++LPRDDME+A LLVC+ Q + Sbjct: 1 MNLLTDDWIPVRPLSGGEGQQITLQTLLCDDRRWLVALPRDDMEMATFQLLVCLLQTLWM 60 Query: 61 AKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDVTPMEKLLAG 120 D + RI PL+ EF +A W F LNH + PFMQ +GV A +VT M+KLL G Sbjct: 61 PSDAQQLVQRIRQPLSAREFADGVAGWQQAFDLNHPQQPFMQVRGVAAKEVTGMDKLLVG 120 Query: 121 VSGATNCAFVNQPGQGEALCGGCTAIALFNQANQAPGFGGGFKSGLRGGTPVTTFVRGID 180 ++G+T+ AFVNQ GQG+ALC GCTAIALFNQA APGFGGGFKSGLRGG+PVTT V+G Sbjct: 121 LTGSTSGAFVNQSGQGKALCSGCTAIALFNQACNAPGFGGGFKSGLRGGSPVTTLVQGDC 180 Query: 181 LRSTVLLNVLTLPRLQKQFPNESHTENQP-TWIKPIKSNESIPASSIGFVRGLFWQPAHI 239 LR+T+ NVL+ L + P+ QP TW +PIK +++I SSIG RGLFWQPAHI Sbjct: 181 LRTTLWFNVLSETTLDEFCPDWREQRAQPFTWQQPIKKDQAIAGSSIGLARGLFWQPAHI 240 Query: 240 ELCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWPHPHSPCLVTVKKGEVEEKFLA 299 EL P G G+CS CG+ ++ RY FLKEKF FTVNGLW HPHSP + +KKG+VE +++A Sbjct: 241 ELSPPDGAGQCSACGRMASQRYRSFLKEKFNFTVNGLWLHPHSPLIQQIKKGQVEWRYMA 300 Query: 300 FTTSAPSWTQISRVVVDKII-QNENGNRVAAVVNQFRNIAPQSPLELIMGGYRNNQASIL 358 F+T APSWTQI R+++++ + + + G RVA V Q R ++ L L++GGYRNNQASI+ Sbjct: 301 FSTPAPSWTQIGRLLIEQQVNKQQEGRRVATTVEQARMLSRGRALRLMIGGYRNNQASII 360 Query: 359 ERRHDVLMFNQGWQQYGNVINEIVTVGLGYKTALRKALYTFAEGFKNKDFKGAGVSVHET 418 ERRH+VL FNQGWQ VINEIV +GL Y+ ALR AL+ FAEG K D KGAGV++HE Sbjct: 361 ERRHEVLQFNQGWQHAMPVINEIVNLGLEYRKALRTALWIFAEGAKESDIKGAGVALHEK 420 Query: 419 AERHFYRQSELLIPDVLANVNFSQADEVIADLRDKLHQLCEMLFNQSVAPYAHHPKLIST 478 + +YRQS + ++LA +++ + + + QLC+ LFN APYAHHPKLI + Sbjct: 421 VDPQYYRQSHARVLNLLAQIDYQSPLPQLEQFQTQQQQLCQQLFNDLTAPYAHHPKLICS 480 Query: 479 LALARATLYKHLRELKPQG 497 LA AR L L +LKPQG Sbjct: 481 LAKARRYLMSSLAKLKPQG 499 >UniRef50_C5SD51 CRISPR-associated protein, Cse1 family n=1 Tax=Allochromatium vinosum DSM 180 RepID=C5SD51_CHRVI Length = 507 Score = 409 bits (1051), Expect = e-112, Method: Compositional matrix adjust. Identities = 223/506 (44%), Positives = 305/506 (60%), Gaps = 14/506 (2%) Query: 1 MNLLIDNWIPVRPRNG-GKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQII- 58 M+LL WIPVR G G +++ Q L C + W++SLPRDD+ELA L LL+C+ QI+ Sbjct: 1 MDLLKTPWIPVRAHGGSGTFRLLTYQELLCEDEDWQISLPRDDLELACLQLLICMTQIMF 60 Query: 59 APAKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDVTPMEKLL 118 P +DDV RI PLT DEF + I+P ++ F L+H PFMQT+GV A DVTP++KLL Sbjct: 61 LPPEDDV-LLDRIDIPLTPDEFTEGISPCLEWFDLDHPTQPFMQTRGVVAKDVTPIQKLL 119 Query: 119 AGVSGATNCAFVNQPGQGEALCGGCTAIALFNQANQAPGFGGGFKSGLRGGTPVTTFVRG 178 G+ TN AF N PG+ L AIALF+QA P FGGGFK LRG P+TT V G Sbjct: 120 IGLPEGTNHAFFNAPGEVSVLSAPVAAIALFHQATNCPSFGGGFKGSLRGIAPITTLVDG 179 Query: 179 IDLRSTVLLNVLTLPRLQKQFPNESH--TENQPTWIKPIKSNESIPASSIGFVRGLFWQP 236 +LR + NVLT ++ FP+ H +++ PTWI+PI+S E+I A IG RGLFWQP Sbjct: 180 RNLRKRIWCNVLTPEFIRTDFPDWQHDLSQDLPTWIEPIRSKETIHAHQIGLARGLFWQP 239 Query: 237 AHIELCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWPHPHSPCLVTVKKGEVEEK 296 AH+EL G C G E+ YTGF KEKF FT+ G WPHPH ++KKG +E K Sbjct: 240 AHVELVGSRESGPCDLLGIEAGPLYTGFRKEKFNFTLEGTWPHPHGVLQSSLKKGALEMK 299 Query: 297 FLAFTTSAPSWTQISRVVVDKIIQNENGNRVAAVVNQFRNIA------PQSPLELIMGGY 350 F +FTT AP+WT+++ +V+ G+R A V Q +++A PQ PL LI+GGY Sbjct: 300 FASFTTEAPAWTRLTEMVLRINGPKGEGSRPATPVAQAKSMAVTALEKPQ-PLTLIIGGY 358 Query: 351 RNNQASILERRHDVLMFNQGW-QQYGNVINEIVTVGLGYKTALRKALYTFAEGFKNKDFK 409 RNN+AS+ ERRH++L GW ++ G+ + ++V +G+ K +L+ L ++G K K Sbjct: 359 RNNKASVTERRHEMLSLAAGWSEEDGSRLKDLVALGIKAKESLKDKLSFASKGHKKKMLP 418 Query: 410 GAGVSVHETAERHFYRQSELLIPDVLAN-VNFSQADEVIADLRDKLHQLCEMLFNQSVAP 468 G G + + ER FY ++E I + L+ F Q E A D L C +F+ P Sbjct: 419 GIGSPIQDVGERIFYSRTEGKIIETLSRPTTFMQWKENRAAYIDALAADCRDIFDAMTEP 478 Query: 469 YAHHPKLISTLALARATLYKHLRELK 494 Y P+LI +A AR +L L++LK Sbjct: 479 YTMKPELIPIIAWARRSLNADLKKLK 504 >UniRef50_A1SV74 CRISPR-associated protein, Cse1 family n=2 Tax=Gammaproteobacteria RepID=A1SV74_PSYIN Length = 488 Score = 127 bits (320), Expect = 8e-28, Method: Compositional matrix adjust. Identities = 125/507 (24%), Positives = 214/507 (42%), Gaps = 39/507 (7%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAP 60 MNLL D++I + GK I+L+++ ++L D+++LA L LL + ++ Sbjct: 1 MNLLKDDFIST---SQGK---ISLKTILTGEQNYQLQYYFDEIQLAMLQLLSSLSTVVLQ 54 Query: 61 AKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTK---GVKANDVTPMEKL 117 E + + N LT ++++ + ++ + FMQ+K K D P+ KL Sbjct: 55 PTVQ-ELKDYLKNGLTPEQYEAALDKVESQWFESDC---FMQSKPPTNAKWPDA-PITKL 109 Query: 118 LAGVSGATNC---AFVNQPGQGEALCGGCTAIALFNQANQAPG--FGGGFKSGLRGGTPV 172 L+G+ T+ ++ Q E C C +N G FG +G+RGG + Sbjct: 110 LSGIECGTSANAMGLFSEIEQAEISCTDCMHALNYNLHMNIKGECFGPTGATGIRGGGAI 169 Query: 173 TTFVRGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSNESIPASSIGFVRGL 232 +T + G +L+ T+L N + +S E +P W+ P+ S AS IG RGL Sbjct: 170 STLIAGENLKQTLLNNTIAKDYFNDYAQLDSDAEQRPMWVAPL-SGSVYQASKIGINRGL 228 Query: 233 FWQPAHIELCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVN----------GLWPHPHS 282 F HI C CG ES F +EK+ G WPHP++ Sbjct: 229 FALAYHIGFNIEDKPCLCDVCGSESEQSVKTFNREKYKGNYGSTKNGREAGAGWWPHPYT 288 Query: 283 PCLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAAVVNQFRNIAPQSP 342 P + K E A + SW ++ +V K ++ A ++ QF+ + Sbjct: 289 PRTI---KEEGAFAVCARDQNWQSWQELGSYIVGKET-DKATLEPAYIIKQFQYMKTPRQ 344 Query: 343 LELIMGGYRNNQASILERRHDVLMFNQGWQQYGNVINEIVTVGLGYKTALRKAL-YTFAE 401 L++GG +Q I R +D+ ++ + + +++ GL K L +A F Sbjct: 345 TNLLVGGNIADQGGITGRVYDLYSMPSSLNKHLSKVTQVLDSGLDQKNRLSQAFNKMFGA 404 Query: 402 GFKNKDFKGAGVSVHETAERHFYRQSELLIPDVLANVNFSQADEVIADLRDKLHQLCEML 461 G+ +K+F G + E A F ++ +I L +V +A E+ +L Q + + Sbjct: 405 GY-DKNFVGG---IKENAMYRFTANAQQIIQRTLLDVERKEATELRKTAVIELKQEAQRI 460 Query: 462 FNQSVAPYAHHPKLISTLALARATLYK 488 F Y H L L + LY+ Sbjct: 461 FMGVQRKYQHDLPLFKALVKGESALYR 487 >UniRef50_Q12YB1 CRISPR-associated protein, Cse1 family n=1 Tax=Methanococcoides burtonii DSM 6242 RepID=Q12YB1_METBU Length = 528 Score = 93.6 bits (231), Expect = 2e-17, Method: Compositional matrix adjust. Identities = 122/533 (22%), Positives = 212/533 (39%), Gaps = 57/533 (10%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQ--SLYCSRDQWRLSLPRDDMELAALALLVCIGQII 58 NL+ + WI V+ ++G + I Q S L PR D + + L+ + Q Sbjct: 3 FNLIHEKWIWVQRQDGTRSMIAPWQITDEIGSNPIISLDEPRPDFNGSMIQFLIGLVQTT 62 Query: 59 APAKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDVTPMEKLL 118 K D ++R R ++P T +E + +F L+ + FMQ ++ LL Sbjct: 63 MSPKSDGKWRKRFISPPTPEELLETFEKVAHVFDLDGDDERFMQDHEHIEGAKNRVDALL 122 Query: 119 AGVSGAT----NCAFVNQPGQGEALCGGCTAIALFNQANQAPGFGGGFKSGLRGGTPVTT 174 + G N + +C C ALFN AP G G ++ LRGG P+TT Sbjct: 123 MEMPGVQTLKHNADHFQKRDTVTQMCLPCCVTALFNLQLNAPAGGQGHRTSLRGGGPLTT 182 Query: 175 FVRGIDLRSTVLLNVLTLPRLQ--KQFPNESHTENQPTWIKPIKSNESIPASSIGFV--- 229 V G +L T+ LNV++ + N ++ P W+ P +++E A + Sbjct: 183 LVLGSNLWQTIWLNVISDENFKGLGDVDNCEISDIYP-WMGPTRTSEKKNAMTTPMDVNP 241 Query: 230 RGLFW-QPAHIEL-CDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWPHPHSPCLVT 287 + ++W P I L D + G C CG ES+ + ++ + + + +G W H SP Sbjct: 242 KQMYWGMPRRIRLDLDDLIEGACDVCGCESDKLVSNYVTKNYGYNYDGGWCHVLSPHNEN 301 Query: 288 VK---------KGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAAVVNQFRNIA 338 G +L P S + ++ +Q + ++ + + F+N Sbjct: 302 KNGLLPRHPQPGGITYRHWLGLVQHDPDKGLYSSLAFERFVQKQKD--LSDLGDVFKN-T 358 Query: 339 PQSPLELIMGGYRNNQASILERRHDVL-MFN---QGWQQYGNVINEIVT----VGLGYKT 390 PQ L GY + + + +FN + Q Y +++ +V + ++ Sbjct: 359 PQ----LWAFGYDFDNMKVRCWYESTMPLFNVADELRQSYEQIVSRLVKTAEIIAYNTRS 414 Query: 391 ALRKALYTFAEGFKNKDFKGAGVSVHETAERHFYRQSELLIPDVLANVNFSQADEVIADL 450 ++KAL F + D E FY I + L +V +A ++ +L Sbjct: 415 CVKKAL--FGDNTPRGDLSFIDSRFWHDTESEFYN-----ILNQLTDVVNDEA--MVLEL 465 Query: 451 RDKLHQ----LCEMLFNQSVAPYAHHPKLISTLALARATL-YKHLRELKPQGG 498 + K H+ + E LF+ S S + RA L +K LR+ GG Sbjct: 466 KMKWHKELSRISEKLFDDSSQSMQ-----FSVIDPERAALAHKDLRKFNSDGG 513 >UniRef50_Q314I1 Putative uncharacterized protein n=1 Tax=Desulfovibrio desulfuricans subsp. desulfuricans str. G20 RepID=Q314I1_DESDG Length = 534 Score = 92.8 bits (229), Expect = 3e-17, Method: Compositional matrix adjust. Identities = 98/370 (26%), Positives = 149/370 (40%), Gaps = 30/370 (8%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAP 60 +N+L D W+PV +G +++I + L L +PR D A L LV Q + P Sbjct: 4 LNILSDQWLPVILADGKRIRIAPWE-LTADPRPVALDIPRPDFGGAMLEFLVGCMQTMCP 62 Query: 61 AKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKAND--VTPMEKLL 118 + ++R P + + P+I F+L FMQ +K + V + LL Sbjct: 63 PQSRKDWRSWRKTPPQPQTLRTAMEPFIPHFHLLGERPLFMQDLTLKQEEENVMGVAALL 122 Query: 119 AGVSG----ATNCAFVNQPGQGEALCGGCTAIALFNQANQAPGFGGGFKSGLRGGTPVTT 174 G + F + G+ E LC C A+AL+ AP G G+++ LRGG P++T Sbjct: 123 IDSPGENAIKNDTDFFVKRGRIETLCPACAAMALYTMQAFAPSGGAGYRTSLRGGGPLST 182 Query: 175 FVRGIDLRSTVLLNVLTLPRLQKQFPNESHTENQ------PTWIKPIKSNESIPASSIGF 228 V G L TV NVL + P+ + PT K +PA+ Sbjct: 183 LVLGETLWETVWNNVLVAESTDWRIPDGHDPLGRILPWTVPTRDSKKKGTAILPATGHNL 242 Query: 229 VRGLFWQPA-----HIE-LCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWPHPHS 282 + FW H E L DP C CG S + + G W HP + Sbjct: 243 LH--FWAMPRRFRLHPENLSDP---AACDICGTPSTTVIRQIGAKNYGNNYEGAWQHPLT 297 Query: 283 PCLVTVK-KGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAA-VVNQFRNIAPQ 340 P K K + K + T+ W + + + ++N VAA + QFR + P Sbjct: 298 PYREQGKGKLALSVKGASECTAYHQWLGL----LYGPLGSKNKTLVAAQCIRQFRELLPA 353 Query: 341 SPLELIMGGY 350 S + + GY Sbjct: 354 SAVRVRAFGY 363 >UniRef50_Q1R117 Putative uncharacterized protein n=1 Tax=Chromohalobacter salexigens DSM 3043 RepID=Q1R117_CHRSD Length = 564 Score = 92.4 bits (228), Expect = 4e-17, Method: Compositional matrix adjust. Identities = 85/310 (27%), Positives = 126/310 (40%), Gaps = 39/310 (12%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAP 60 MNLL D W+P R R+ G + + S D L+LPR D + AA L+ + Q Sbjct: 3 MNLLTDPWLPFR-RSDGSL-LYRPPSALADPDILDLALPRADFQGAAWQFLIALLQTAMT 60 Query: 61 AKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQT-KGVKANDVTPMEKLLA 119 K+ + R P + +EF+ +AP+ F L+ FMQ ++ P+ LL Sbjct: 61 PKNTDAWLDRYQTPPSVEEFEAALAPFSRAFELDGEGPRFMQDLDPLEDVKDAPVAGLLI 120 Query: 120 GVSGAT----NCAFVNQPGQGEALCGGCTAIALFNQANQAPGFGGGFKSGLRGGTPVTTF 175 GA N F + G+ EA+C C A+AL+ AP G G + GLRGG P+TT Sbjct: 121 DSPGANGIKNNTDFFVKRGRVEAVCPDCAALALYTMQINAPAGGAGIRVGLRGGGPLTTL 180 Query: 176 VRGIDLRSTVLLNVLTLPRLQKQFPN--ESHTENQP--TWIKPI---------------- 215 + D ++ ++ +PN + QP TW P Sbjct: 181 ILPEDETKSL---------WERLWPNVMPADAVGQPGQTWRPPTVDDADLFFWMSDTRVS 231 Query: 216 --KSNESIPASSIGFVRGLFWQPAHIELCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTV 273 K E P + + L+ P L G C CG+E + +K Sbjct: 232 DKKGTEVFP-DQVHPLHALWSMPRRYRLLFEDESGCCDLCGRECSRLVRRLRSKKQGANY 290 Query: 274 NGLWPHPHSP 283 +G W HP +P Sbjct: 291 DGPWRHPLTP 300 >UniRef50_B8GIV6 CRISPR-associated protein, Cse1 family n=1 Tax=Methanosphaerula palustris E1-9c RepID=B8GIV6_METPE Length = 534 Score = 91.3 bits (225), Expect = 8e-17, Method: Compositional matrix adjust. Identities = 96/351 (27%), Positives = 149/351 (42%), Gaps = 26/351 (7%) Query: 1 MNLLIDNWIPVRPRNGGKVQII--NLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQII 58 +NL+ WIPV ++G + I L S Y L PR D A + L+ I Q Sbjct: 2 LNLIEQAWIPVIRKDGERSTIAPWELTSDYQENPIVELDAPRPDFNGALVQFLIGIVQTE 61 Query: 59 APAKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDVTPMEKLL 118 P + V ++ P + + + I+ F L+ FMQ + + ++KLL Sbjct: 62 LPPTNPVTWKRMFRRPPEPADLKASFSTHIEAFNLDGDGPRFMQDLTLAKGEALAIDKLL 121 Query: 119 AGVSG----ATNCAFVNQPGQGEALCGGCTAIALFNQANQAPGFGGGFKSGLRGGTPVTT 174 G N + G + LC C A+ALF AP G G ++ LRGG P+TT Sbjct: 122 IERPGEQTVKKNTDHFLKRGGIDHLCMTCAAMALFTLQTNAPSGGRGHRTSLRGGGPLTT 181 Query: 175 FVRGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPT--WIKPIKS---NESIPASSIGFV 229 V G L TV LNV++ P+ +++ N + T W+ ++ NE + Sbjct: 182 LVTGRTLWETVWLNVIS-PQELERYGNSALTSAADIFPWMGETRTSNNNEITTPQDVNPA 240 Query: 230 RGLFWQPAHIELCDPIGI---GKCSCCGQESNLRYTGFLKEKFTFTVNGLWPHPHSPCLV 286 + + P I L D G G+C CG+ + + + F + G W H SP Sbjct: 241 QMFWGMPRRIRL-DLDGKPEPGECDLCGKTTERQVSTFSAKDSGVNYKGGWCHVLSP-YS 298 Query: 287 TVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNE--NGNRVAAVVNQFR 335 T KGE+ K + P + ++QN+ N ++ AAVV+ FR Sbjct: 299 TNPKGELLAKH-----AQPGGVTYRNWL--GLVQNDSQNNSQPAAVVSLFR 342 >UniRef50_Q2FNL5 Putative uncharacterized protein n=1 Tax=Methanospirillum hungatei JF-1 RepID=Q2FNL5_METHJ Length = 532 Score = 85.1 bits (209), Expect = 5e-15, Method: Compositional matrix adjust. Identities = 74/274 (27%), Positives = 119/274 (43%), Gaps = 20/274 (7%) Query: 1 MNLLIDNWIPVRPRNG--GKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQII 58 ++LL D WIPV +NG G + + S Y + L+ R D A + L+ + Q + Sbjct: 2 IHLLHDAWIPVVRKNGDSGLIAPHQITSDYDTNPVIELNASRPDFNGALIQFLIGLIQTV 61 Query: 59 APAKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDVTPMEKLL 118 P + D E+ R+ N + D + D F L+ FMQ + ++ LL Sbjct: 62 CPPESDKEWTDRLDNVIPSDVLKGHFKQIQDAFSLDGKGPRFMQDISIGDEKKNSVDGLL 121 Query: 119 AGVSGAT----NCAFVNQPGQGEALCGGCTAIALFNQANQAPGFGGGFKSGLRGGTPVTT 174 + G N F + + +C C A+ALF PG G G K+GLRGG P+T+ Sbjct: 122 IEMPGENTVKKNTDFFVKRDTVKQMCPSCAAMALFTLQVNGPGGGAGHKTGLRGGGPLTS 181 Query: 175 FVRGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPT------WIKPIKSNESIPASSIGF 228 + G L TV LN+ +P + +F ++ + + + W I+ ++ + + Sbjct: 182 VILGETLWETVWLNI--IPSI--KFFGDAIAKQKKSMDMIFPWFGKIRLSDKKEKTGVID 237 Query: 229 VRGL--FWQPAHIELCD--PIGIGKCSCCGQESN 258 V L FW L D +G C CG S+ Sbjct: 238 VNPLQMFWGMGRRILLDFEDKPVGACDVCGLASS 271 >UniRef50_D2TKK4 CRISPR-associated protein n=1 Tax=Citrobacter rodentium ICC168 RepID=D2TKK4_CITRO Length = 519 Score = 81.3 bits (199), Expect = 8e-14, Method: Compositional matrix adjust. Identities = 84/311 (27%), Positives = 131/311 (42%), Gaps = 30/311 (9%) Query: 1 MNLLIDNWIPVRPRNG--GKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQII 58 +NL+ W+PVR +NG GK+ ++L + ++ R D++ AA L+ + Q Sbjct: 4 VNLIFCQWLPVRFKNGATGKLAPVDL----ADENVVDIAATRADLQGAAWQFLLGLLQSS 59 Query: 59 APAKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDVTPMEKLL 118 K+ + LT + + +AP F+ FMQ A + + LL Sbjct: 60 IAPKNYSRWEDIWEEGLTGEMLHKALAPLGHAFHFGAESPSFMQDFEPLAGEKVSIASLL 119 Query: 119 AGVSGATNCAFVN----QPGQGEALCGGCTAIALFNQANQAPGFGGGFKSGLRGGTPVTT 174 + GA F + G E LC C A+ALF+ AP G G+++GLRGG P+TT Sbjct: 120 PEIPGAQTIKFNKDHFIKRGVTERLCPHCAALALFSLQLNAPSGGKGYRTGLRGGGPLTT 179 Query: 175 FV--------RGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSNESIPASSI 226 + R L + LNV+ P P W+ +++E PA+++ Sbjct: 180 LIELQEYKGERQTPLWRKLWLNVMPQDTADLPLPAVCDASVFP-WLAATRTSEP-PANTV 237 Query: 227 GFVR-----GLFW-QPAHIEL-CDPIGIGKCSCCGQESNLRYTGFLKEK-FTFTVNGLWP 278 ++W P I L G C CG ES+ GF+ K + +G W Sbjct: 238 TTPEQVNKLQMYWGMPRRIRLDFATTQTGLCDICGVESDA-LLGFMTVKNYGVNYDG-WR 295 Query: 279 HPHSPCLVTVK 289 HP +P VK Sbjct: 296 HPLTPYRAPVK 306 >UniRef50_A7ZQK3 CRISPR-associated protein, Cse1 family n=55 Tax=Enterobacteriaceae RepID=A7ZQK3_ECO24 Length = 520 Score = 79.3 bits (194), Expect = 3e-13, Method: Compositional matrix adjust. Identities = 103/516 (19%), Positives = 211/516 (40%), Gaps = 49/516 (9%) Query: 1 MNLLIDNWIPVRPRNG--GKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQII 58 +LL W+PVR ++G GK+ ++L + ++ PR D++ AA L+ + Q Sbjct: 4 FSLLTTPWLPVRFKDGTTGKLAPVDL----ADENVVDIAAPRADLQGAAWQFLLGLLQTS 59 Query: 59 APAKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDVTPMEKLL 118 K+ + + L ++ ++ + F FMQ D + LL Sbjct: 60 FAPKNHGRWDDIWEDGLEAEKLREALLSLEHAFQFGADSPSFMQDFEALKGDKVQVASLL 119 Query: 119 AGVSGATNCAFVN----QPGQGEALCGGCTAIALFNQANQAPGFGGGFKSGLRGGTPVTT 174 + GA F + G E +C C+A+ALF+ AP G G+++GLRGG P+TT Sbjct: 120 PEIPGAQTTKFNKDHFIKRGVTEHVCPHCSALALFSLQLNAPSGGKGYRTGLRGGGPMTT 179 Query: 175 FVRGIDL---RSTVLLNVL---TLPRLQKQFPNESHTENQP-TWIKPIKSNE----SIPA 223 + + + T L L +P+ + P ++ W+ P +++E + Sbjct: 180 LIELQEYQGNQQTPLWRKLWPNVMPQDEADLPLPKKFDDLVFPWLGPTRTSELAGAVVTH 239 Query: 224 SSIGFVRGLFWQPAHIEL-CDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWPHPHS 282 + ++ + P I + + +G C CG++S+ + + + +W HP + Sbjct: 240 DQVNKLQAYWGMPRRIRIDFNTTTVGNCDICGEQSDALLSLMTTKNYGANY-AMWQHPLT 298 Query: 283 PCLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAAVVNQFRNIAP--Q 340 P + +K+G +F + + + + ++EN + A+V + N + Q Sbjct: 299 PYRIPLKEG---GEFYSVKPQPGGLIWRDWLGLIETGKSENNTELPALVVKLFNASSLKQ 355 Query: 341 SPLELIMGGY--RNNQASILERRHDVLMFNQGWQQY------GNVINEIVTVGLGYKTAL 392 + + L GY N +A H L+ + Q + I+++ ++AL Sbjct: 356 AKVGLWGFGYDFDNMKARCWYEHHFPLLLKKKEGQIPKLRLAAQTASRILSL---LRSAL 412 Query: 393 RKALYTFAEGFKNKDFKGAGVSVHETAERHFYRQSELLIPDVLANVNFSQ-ADEVIADLR 451 ++A ++ +G + DF + + F R ++ + Q ADE++ + Sbjct: 413 KEAWFSDPKGARG-DFSFVDIDFWNKTQHRFLR--------LVRQIEEGQDADELLGKWQ 463 Query: 452 DKLHQLCEMLFNQSVAPYAHHPKLISTLALARATLY 487 ++ F++ V + P + + AR + Sbjct: 464 KEIWLFARQDFDERVFTNPYEPVDLKRVMTARKKYF 499 >UniRef50_B3E5V2 CRISPR-associated protein, Cse1 family n=3 Tax=Deltaproteobacteria RepID=B3E5V2_GEOLS Length = 539 Score = 78.6 bits (192), Expect = 5e-13, Method: Compositional matrix adjust. Identities = 118/502 (23%), Positives = 188/502 (37%), Gaps = 41/502 (8%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAP 60 MNL+ D WIPV G+ I Q L+ PR D + A LL+ + Q Sbjct: 1 MNLIKDAWIPVIRAKSGRGVIAPWQIAELDDPVMELAAPRPDFQGAMYQLLIGLLQTGFA 60 Query: 61 AKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHP-FMQTKGVKANDVTPMEKLL- 118 +D E+ P + + F + + P FMQ + + + LL Sbjct: 61 PEDFDEWLDYWSKPPDATLLRTRLETLAAAFDFDKPDSPAFMQDYAMPDGEKKGIASLLI 120 Query: 119 ---AGVSGATNCAFVNQPGQGEALCGGCTAIALFNQANQAPGFGGGFKSGLRGGTPVTTF 175 G + N + + +C C A+ALF AP G G + GLRGG P+TT Sbjct: 121 ESPGGKTVKDNLDHFIKRDAVQHMCKSCAAMALFTLQTNAPSGGVGHRVGLRGGGPLTTL 180 Query: 176 V---RGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSNESIPA----SSIGF 228 V + L T+ LNVL + + + W+ P +++E A S+ Sbjct: 181 VLPPEQMPLWQTLWLNVLDREDMPEY--RQDRVAGVFPWMGPTRTSEKNGAETTPESVHA 238 Query: 229 VRGLFWQPAHIELCDP--IGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWPHPHSPCLV 286 ++ + P I L P +G C C ++ + + G W HP +P Sbjct: 239 LQAYWGMPRRIRLDFPAKASMGDCDVCDVKNVALVEEYRTRNYGVNYVGNWVHPLTPYRF 298 Query: 287 TVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAAVVNQFR-----NIAPQS 341 KK E+ L+ + + ENG+ A +V ++ + Q Sbjct: 299 DPKK---EKPPLSLKGQQGGLGYRYWLALTLANDTENGDAAAKIVRRYSEQRATELKIQR 355 Query: 342 PLELIMGGYR-NNQASILERRHDVLMFNQGWQQYGNVI---NEIVTVGLGYKTALRK--- 394 L G+ +N + H +FN QQ ++ ++++TV + LRK Sbjct: 356 TARLWCFGFDMDNMKARCWYDHTFPLFNLAPQQRKKLLQWADDLITVANDVSSLLRKQVK 415 Query: 395 -ALYTFAEGFKNKDFKGAGVSVHETAERHFYRQSELLIPDVLANVNFSQA---DEVIADL 450 A + E K D + + +E FY ELL D LA V+ Q E+ + Sbjct: 416 AAWFRRPEDAKG-DMNTVSLDFWQRSEPVFY---ELL--DQLAKVSGEQELPPPELYSQW 469 Query: 451 RDKLHQLCEMLFNQSVAPYAHH 472 L L LF+ V A+ Sbjct: 470 EKMLVSLSLQLFDAWVLEAANE 491 >UniRef50_B4TTX5 Crispr-associated protein, Cse1 family n=9 Tax=Salmonella enterica subsp. enterica RepID=B4TTX5_SALSV Length = 511 Score = 78.6 bits (192), Expect = 6e-13, Method: Compositional matrix adjust. Identities = 80/315 (25%), Positives = 135/315 (42%), Gaps = 25/315 (7%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAP 60 MNL+ + W+PV +G K +I +L+ L +R Q L+ PR D + AA +L+ I Q Sbjct: 1 MNLITEKWLPVIFSSGEKTRI-SLRDLLDNRIQ-DLAYPRPDFQGAAWQMLIGILQCTIA 58 Query: 61 AKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDVTPMEKLLAG 120 +D E+ + + +++++ + + F+Q+ ++ + LL Sbjct: 59 PEDKEEWADIWHDGIEFEQWEKALNTISLALQFGEQKPSFLQSFDPLDSEYGSIAGLLVD 118 Query: 121 VSGATNCA-----FVNQPGQGEALCGGCTAIALFNQANQAPGFGGGFKSGLRGGTPVTTF 175 G FV + G E +C C AIALF +P G G++ G+RGG P+TT Sbjct: 119 APGGNTLKLNKDHFVKR-GNVEQICPHCAAIALFAIQTNSPAGGAGYRVGMRGGGPLTTL 177 Query: 176 V-----RGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSNES----IPASSI 226 V L + LNVL Q++ PN + W+ P K++E + + Sbjct: 178 VVPQEEDKYPLWKKLWLNVLP----QEEPPNVTQHPLIFPWLAPTKTSEKAGNVVTPDNS 233 Query: 227 GFVRGLFWQPAHIELCDPIGI-GKCSCCGQESNLRYTGFLKEKFTFTVNGLWPHPHSPCL 285 ++ + P IEL + G C CG+ + + + W HP SP Sbjct: 234 HPLQAYWGMPRRIELDFTHTVAGICDLCGEHHESLLLQMRSKNYGVQYDS-WLHPFSPYR 292 Query: 286 VTVKKGEVEEKFLAF 300 +K + +LAF Sbjct: 293 QALK--DPSAPWLAF 305 >UniRef50_Q054L1 Putative uncharacterized protein n=2 Tax=Leptospira borgpetersenii serovar Hardjo-bovis RepID=Q054L1_LEPBL Length = 533 Score = 70.9 bits (172), Expect = 1e-10, Method: Compositional matrix adjust. Identities = 120/526 (22%), Positives = 198/526 (37%), Gaps = 60/526 (11%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYC-----SRDQWRLSLPRDDMELAALALLVCIG 55 MNL+ D WIPV+ R K++ I+ + S LS PR D A L LV + Sbjct: 1 MNLIKDVWIPVQ-RFSEKLEEISPFEITSRIENDSDPVMSLSAPRPDFNGALLQFLVGLL 59 Query: 56 QIIAPAKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKAND-VTPM 114 Q + +++ E+ +NP + + ++ + F L FMQ + D V + Sbjct: 60 QAVFSPENETEWEDLFVNPPSPEVLKEAMEKVKSAFELFGDGPRFMQDTNLNEEDTVFDI 119 Query: 115 EKLLAGVSGATNCA----FVNQPGQGEALCGGCTAIALFNQANQAPGFGGGFKSGLRGGT 170 L G F + +C C I LF+ +P G G + +RGG Sbjct: 120 SALFIESPGENTIKLKKDFFIKRNSISQICEKCAGIGLFSFQTNSPSGGQGHLTSIRGGG 179 Query: 171 PVTTFV-------RGIDLRSTVLLNVLTLPRL----QKQFPNESHTENQPTWIKPIKSNE 219 P+TTFV + L S + LNVL QK+ P +S I+ ++S Sbjct: 180 PLTTFVTSKLKNPKKNSLWSKLWLNVLPKSYFQVNGQKKIPFQSVFPWTNPKIEDLQSKG 239 Query: 220 SIPASSIGFVRGLFWQ-PAHIELCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVN-GLW 277 ++W P I L + G C C + S++ + + + + + G W Sbjct: 240 KTTTPQDLHPLSVYWSYPRRILLRKDVENGICDVCNRSSSVLVRSYHTKTYGLSYSEGGW 299 Query: 278 PHPHSPCLVTVKKGEVEEKFLAFTTSAPS-----WTQISRVVVDKIIQNENGNRVAAVVN 332 HP SP + +E + + W I+ + E ++ A V+ Sbjct: 300 IHPLSPYYKS------KESWFPYHPQPGGILYHYWQTIA-------LGKEQEDQAALVIR 346 Query: 333 QF--RNIAPQSPLELIMGGYRNNQASILERRHDVLMFN---QGWQQYGNVINEIVTVGLG 387 +F R I + L G +N + ++ FN +++ +++I+ Sbjct: 347 RFLNRKIPGEQTSILTFGYDMDNMKARCWYESEIPFFNIPSDKIEKFEEQVSQILNASTE 406 Query: 388 YKTALRKALYTFAEGFKNK---DFKGAGVSVHETAERHFY----RQSELLIPDVLANVNF 440 K LR+A+ KN D VS + E FY E LI DV +F Sbjct: 407 VKKNLRQAVRNAWLDKKNDSKGDLTFLDVSFLKDTENSFYDLIRNVQENLISDV---ADF 463 Query: 441 SQADEVIADLRDKLHQLCEMLFNQSVAPYAHHPKLISTLALARATL 486 + E L L++ LF+Q + + I + AR L Sbjct: 464 AALKETWLKL---LNESACKLFDQYADSGSFEFENIERIVKARKNL 506 >UniRef50_A5FZI3 CRISPR-associated protein, Cse1 family n=2 Tax=Acidiphilium cryptum JF-5 RepID=A5FZI3_ACICJ Length = 529 Score = 68.9 bits (167), Expect = 4e-10, Method: Compositional matrix adjust. Identities = 50/178 (28%), Positives = 81/178 (45%), Gaps = 4/178 (2%) Query: 1 MNLLIDNWIPVRPRNGGKVQII--NLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQII 58 NLL + W+P+ ++G K I ++ S ++ PR D LA + LLV + Sbjct: 2 FNLLTNPWLPIVRQDGTKSVIAPRDITEDISSNPVIAVNWPRADFRLATMELLVGLIATA 61 Query: 59 APAKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDVTPMEKLL 118 P D+ ++ P + ++ AP F + FMQ D P+E LL Sbjct: 62 CPPADEDDWLDAWEAPHSPEKLDGAFAPLAHAFSFDGPGPRFMQDLADLDADEEPVENLL 121 Query: 119 AGVSGATNCAFVNQPGQGEALCGGCTAIALFNQANQAPGFGGGFKSGLRGGTPVTTFV 176 V+G N + PG+ + + AI+L+ + +P G G ++GLRGG P+ T V Sbjct: 122 IEVAG--NSGPLVHPGRTKRMGRPAAAISLYTLQSWSPSGGRGNRTGLRGGGPMVTMV 177 >UniRef50_D0KFE0 CRISPR-associated protein, Cse1 family n=4 Tax=Enterobacteriaceae RepID=D0KFE0_PECWW Length = 523 Score = 68.6 bits (166), Expect = 6e-10, Method: Compositional matrix adjust. Identities = 76/304 (25%), Positives = 124/304 (40%), Gaps = 25/304 (8%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQW-RLSLPRDDMELAALALLVCIGQIIA 59 +L+ + W+P +G ++ N+ L D L+ R D + A+ LL+ + Q Sbjct: 3 FSLIEEPWLPAVFADG---RMSNISPLQLPDDNIIDLAWTRADFQGASYQLLIGLLQTAY 59 Query: 60 PAKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDVTPMEKLLA 119 DD ++ + L D F Q +A + FMQ +D TP+ LL Sbjct: 60 APADDDDWDAIWEDGLGTD-FSQALAALAPAMQFGAQKPAFMQDCAPLDSDSTPISGLLI 118 Query: 120 GVSGATNCA-----FVNQPGQGEALCGGCTAIALFNQANQAPGFGGGFKSGLRGGTPVTT 174 G F+ + G A+C C A+ALF AP G G ++G+RGG P+TT Sbjct: 119 DAPGGNTLKLNKDHFIKR-GTVNAICPHCAAMALFTLQTNAPSGGQGHRTGMRGGGPITT 177 Query: 175 FVRGIDLR----STVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSNES----IPASSI 226 + D R + +NVLT QK P W+ +++ + + Sbjct: 178 LLMQEDGRLPLWKKLWMNVLT----QKVMPKGKPDATVFPWLAATPTSDGTHPPVTQENS 233 Query: 227 GFVRGLFWQPAHIEL-CDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWPHPHSPCL 285 ++ + P IEL + G+C CG ES T + + + + W HP +P Sbjct: 234 HQLQAYWGMPRRIELDFTTLQTGECDLCGTESTALLTQYRTKNYGIQYDS-WRHPLTPYR 292 Query: 286 VTVK 289 +K Sbjct: 293 RALK 296 >UniRef50_B4RSK0 CRISPR-associated protein, Cse1 family n=6 Tax=Gammaproteobacteria RepID=B4RSK0_ALTMD Length = 535 Score = 65.9 bits (159), Expect = 4e-09, Method: Compositional matrix adjust. Identities = 68/275 (24%), Positives = 107/275 (38%), Gaps = 19/275 (6%) Query: 36 LSLPRDDMELAALALLVCIGQIIAPAKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNH 95 +LPR D + AA + + Q D+ E++ + P TED + + F Sbjct: 38 FALPRADFQGAAYQFAIGLLQTCFAPDDEFEWKDNYLEPPTEDALRPAFSKAEHAFNATG 97 Query: 96 AEHPFMQT-KGVKANDVTPMEKLLAGVSGAT----NCAFVNQPGQGEALCGGCTAIALFN 150 FMQ + T + LL G N + G GE + +ALF Sbjct: 98 DGPLFMQDFDSLDEAKPTSVSGLLIEAPGGNGLKLNTDHFVKRGIGEVMSLPMAVLALFT 157 Query: 151 QANQAPGFGGGFKSGLRGGTPVTTFV----RGIDLRSTVLLNVLTLPRLQKQFPNESHTE 206 AP G G ++GLRGG P+TT V L + LNV P ++ + H++ Sbjct: 158 LQINAPAGGQGHRTGLRGGGPLTTLVMPQNENSPLWQKLWLNV--APNDERYSAPDLHSD 215 Query: 207 NQPTWIKPIKSNESIPASSIGFVRG-----LFWQ-PAHIELCDPIGIGKCSCCGQESNLR 260 W+ K+ S S + + +FW P I L G+CS G+ + Sbjct: 216 TVFPWLG--KTRVSAKKGSETYQKDVHPLHMFWSMPRRIRLIVEDVAGECSLTGKSCSQL 273 Query: 261 YTGFLKEKFTFTVNGLWPHPHSPCLVTVKKGEVEE 295 + + + G W HP +P +KK + E+ Sbjct: 274 VKLYKTQNYGANYAGSWSHPLTPYKRDLKKPDQED 308 >UniRef50_B5ZCF9 CRISPR-associated protein, Cse1 family n=10 Tax=Acetobacteraceae RepID=B5ZCF9_GLUDA Length = 546 Score = 62.8 bits (151), Expect = 3e-08, Method: Compositional matrix adjust. Identities = 51/183 (27%), Positives = 85/183 (46%), Gaps = 8/183 (4%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSL--PRDDMELAALALLVCIGQII 58 MNLL +W+P+R ++G I Q + D ++L PR D +A+L L+ + Sbjct: 1 MNLLTASWLPIRRKSGAAETIRPAQIVDRVADDPIMALDWPRADFRIASLEFLIGLLATA 60 Query: 59 APAKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDVTPMEKLL 118 P K++ + +P + + + AP + F+L+ F+Q + P+E+LL Sbjct: 61 FPPKNEDIWCETWEDPPSVEALDEAFAPVAEAFWLDGPGPRFLQDLENLQSGQEPVERLL 120 Query: 119 AGVSGATNCA-----FVNQPGQGEALCGGCTAIALFNQANQAPGFGGGFKSGLRGGTPVT 173 G + FV++ + AL +ALF + AP G G +GLRGG P+ Sbjct: 121 IDAPGDSTVKKNTDLFVHR-QRIMALGRPAACMALFTLQSWAPSGGAGNMTGLRGGGPLV 179 Query: 174 TFV 176 T V Sbjct: 180 TLV 182 >UniRef50_B8IZA3 CRISPR-associated protein, Cse1 family n=1 Tax=Desulfovibrio desulfuricans subsp. desulfuricans str. ATCC 27774 RepID=B8IZA3_DESDA Length = 540 Score = 62.4 bits (150), Expect = 4e-08, Method: Compositional matrix adjust. Identities = 39/110 (35%), Positives = 56/110 (50%), Gaps = 7/110 (6%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQ--II 58 MNL+ D WIPV N G+ Q++NL+ C QWR R +A + LL+CI + Sbjct: 1 MNLVSDQWIPVLD-NSGQHQLVNLREALCEGAQWRDLAVRPHERVALMRLLLCIAHAALN 59 Query: 59 APAKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKA 108 P+++D R+ L D + W D F L H + PF+Q G+KA Sbjct: 60 GPSREDWS---RVPQ-LLPDAVAAYLQKWQDSFDLFHPQKPFLQISGLKA 105 >UniRef50_Q0W587 Putative uncharacterized protein n=1 Tax=uncultured methanogenic archaeon RC-I RepID=Q0W587_UNCMA Length = 533 Score = 62.0 bits (149), Expect = 4e-08, Method: Compositional matrix adjust. Identities = 119/505 (23%), Positives = 191/505 (37%), Gaps = 73/505 (14%) Query: 2 NLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVC-IGQIIAP 60 NLL + WI +G VQ L +L + + P +E LL+ I + P Sbjct: 5 NLLTEPWITSIDLSGNPVQEGILATLKNAHKIDSIFDPAPPVEFGIYRLLIAFITDVFQP 64 Query: 61 A--KD--DVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQ--TKGVKANDVTPM 114 +D D+ R R ++P DE+ A W D F L ++PF+Q GV P+ Sbjct: 65 QGLEDLADLLDRKR-LDPTALDEYA---ARWRDRFDLFDEKYPFLQQAITGVIKKPPEPI 120 Query: 115 EKLLAGVSGATNCAFVNQPGQGE------ALCGGCTAIALFNQANQAPGFGGGFKSGLRG 168 +L+ + TN + + E G IA F A G G + G Sbjct: 121 SRLMQHLPAGTNVSHFHHGRWDENSFSFEQCAKGLVTIAPFMTAG-----GAGLSPSING 175 Query: 169 GTPVTTFVRGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSNESIPASSIGF 228 P V+G +L T+L NV +P K ++ W + + + Sbjct: 176 SPPWYVLVKGNNLFETLLYNVCQIPMTVKPI-----GDSPVAWRNDKRIDPGDEPKTFSI 230 Query: 229 VRGLFWQPAHIELCDPIGIGKCSCCGQE-----SNLRYTGFLKEKFTFTVNGLWPHPHSP 283 V GL W+P I+L G G C+ G++ S++ Y K GLW P Sbjct: 231 VEGLTWRPRIIQLIPGNGKGTCTYTGEKDVDTVSHMHYYPGQKS----PEPGLWVDPQ-- 284 Query: 284 CLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNEN----GNRVA----AVVNQFR 335 V KK + + L + W I ++ ++Q+ + +V+ AVV Q++ Sbjct: 285 --VAYKKTKDAIRPLRPDENKALWRDIGPLM---LLQHGDYSGKDGKVSFDRPAVVTQYK 339 Query: 336 N------IAPQSPLELIMGGYRNN-QASILERRHDVLMF--------NQGWQQYGNVINE 380 I PL L + G R + + I E H+ L N G +Q + ++ Sbjct: 340 QMVSNGMIKRSEPLRLEVYGIRTDGKMKIYEWYHEKLALPIEILKKANSG-RQIQDAMDL 398 Query: 381 IVTVGLGYKTALRKALYTFAEGFKNKDFKGAGVSVHETAERHFYRQSELLIPDVLANVNF 440 +V + A++KA Y F +SV + H Q E + L Sbjct: 399 ADSVAYILRKAMKKA-YPRNAKSNESGFDNLILSVQSSYWSHLKGQFESIFLKTL----- 452 Query: 441 SQADEVIADLRDKLHQLCEMLFNQS 465 SQ DE D KL + + + + + Sbjct: 453 SQQDENDLDAYTKLMEQWKKILDDT 477 >UniRef50_Q2RY16 CRISPR-associated protein, Cse1 family n=1 Tax=Rhodospirillum rubrum ATCC 11170 RepID=Q2RY16_RHORT Length = 555 Score = 55.5 bits (132), Expect = 5e-06, Method: Compositional matrix adjust. Identities = 46/184 (25%), Positives = 69/184 (37%), Gaps = 9/184 (4%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSL---YCSRDQWRLSLPRDDMELAALALLVCIGQI 57 NLL++ W+PVR R GK + L + L PR D A L L+ + + Sbjct: 3 FNLLLERWLPVR-RVSGKRDWVAPHQLTEGFAEDPIVGLDFPRADFNAAVLEFLIGVVYV 61 Query: 58 IAPAKDDVEFRHRIMNPLTEDEFQQLIAPWIDMF-YLNHAEHPFMQTKGVKANDVTPMEK 116 P + ++ + P Q ++P F + + T + A D P+ Sbjct: 62 ALPCQKAADWVKGSLTPPAPATLQAALSPLAFAFDFDGDGPRAYQDTSDLAAADCRPITG 121 Query: 117 LLAGVSG----ATNCAFVNQPGQGEALCGGCTAIALFNQANQAPGFGGGFKSGLRGGTPV 172 L G N + ALC A A AP G G ++ +RGG P+ Sbjct: 122 LFIDFPGENTLKNNADLFIKRRDASALCLPYAAAATITLQTYAPSGGAGHRTSIRGGGPL 181 Query: 173 TTFV 176 TT V Sbjct: 182 TTLV 185 >UniRef50_D1CGD1 CRISPR-associated protein, Cse1 family n=1 Tax=Thermobaculum terrenum ATCC BAA-798 RepID=D1CGD1_THET1 Length = 533 Score = 50.4 bits (119), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 67/272 (24%), Positives = 110/272 (40%), Gaps = 35/272 (12%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWR-LSLPRDDMELAALALLVCIGQIIA 59 NL+ + WIPVRP Q++ L+ + R L P + ++ LL+ I + Sbjct: 4 FNLVDEPWIPVRPIGASTTQLMGLRDVLLGAHAIRELVDPSPLVTVSLHRLLLAILHRVF 63 Query: 60 PAKDDVEFRHRI------MNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDVT- 112 +DD E+ PL ED Q+ W D F L H ++PF Q ++ VT Sbjct: 64 GPRDDAEWAELYGGGSFPPQPL-EDYLQR----WHDRFDLFHEKYPFYQKGSIQRQSVTK 118 Query: 113 --PMEKLLAGVSGATNCA--FVNQPGQGEALCGGCTA--IALFNQANQAPGFG--GGFKS 164 P+ +L ++ N F + +G A A + L + FG G K Sbjct: 119 LWPVTRLAPEIASPGNATTLFDHTLPEGVAFTPDRAARYLVLLHPFTVGGLFGLLKGEKD 178 Query: 165 GLRGGTPV----TTFVRGIDLRSTVLLNVLTLPRLQKQF--PNESHTENQPTWIKPIKSN 218 P+ +RG L T++LN++ R +F P S E+ P W + + Sbjct: 179 KAADAGPLAKCAVVLLRGRTLFETLMLNMV---RYDPEFDEPCPSTPEDSPAW---ERDD 232 Query: 219 ESIPASSI--GFVRGLFWQPAHIELCDPIGIG 248 ++ P + G++ L WQ + L + G Sbjct: 233 DTQPVDRLPKGYLDYLTWQSRRVRLFPEVQDG 264 >UniRef50_B6WQ59 Putative uncharacterized protein n=1 Tax=Desulfovibrio piger ATCC 29098 RepID=B6WQ59_9DELT Length = 516 Score = 49.3 bits (116), Expect = 4e-04, Method: Compositional matrix adjust. Identities = 40/140 (28%), Positives = 66/140 (47%), Gaps = 15/140 (10%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAP 60 MNL+ D WIP R+G V+ +L+ + D L++ R +A + LL+C+ A Sbjct: 1 MNLVDDPWIPCIRRDG-MVRPASLRDCFTCDDIVDLAV-RPHERVALMRLLLCVSYAAAG 58 Query: 61 AKDDVE----FRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVK----ANDVT 112 +D + R R+ PL E + W D F L H + PF+Q G++ + D+T Sbjct: 59 IPEDYDGWEDLRERL--PL---EVPVYLDQWRDAFELFHPQKPFLQVVGLRSASASGDLT 113 Query: 113 PMEKLLAGVSGATNCAFVNQ 132 P KL ++ +N + Sbjct: 114 PCSKLDFSLATGSNSTLFDH 133 >UniRef50_A7BA67 Putative uncharacterized protein n=1 Tax=Actinomyces odontolyticus ATCC 17982 RepID=A7BA67_9ACTO Length = 556 Score = 47.4 bits (111), Expect = 0.001, Method: Compositional matrix adjust. Identities = 62/268 (23%), Positives = 109/268 (40%), Gaps = 28/268 (10%) Query: 2 NLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAPA 61 NLL + WIPVR +G + L+ L + D L+ +A L++ I +A Sbjct: 7 NLLDEPWIPVRLVDGTITDVGLLELLRRTTDIADLACELPTQSIAIQRLILAIMYRVATP 66 Query: 62 KDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKA--NDVTPMEKLLA 119 +D ++ + ++ + + W D FYL PFMQ ++ + V+ +EKL+A Sbjct: 67 RDTRDWVRQWDEGAPTEQMIEYLERWRDRFYLFGGRFPFMQVANLRTAKDAVSGLEKLIA 126 Query: 120 GVSGATNCAFVNQPGQGEALCGGCTAIA--LFNQANQAPGFGGGF--KSGLRGGT--PV- 172 V F + G+ A A + QA G G S ++GG P+ Sbjct: 127 DVPNGEQF-FTTRHGRALACIPASEAARWLVHAQAYDPSGIRSGAVGDSQVKGGKGYPIG 185 Query: 173 --------TTFVRGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKP-----IKSNE 219 +++G DL T++LN++ + + + S +W P ++ + Sbjct: 186 PAWCGHLGLVWLKGKDLDETLVLNLIPATTAELRGVDSSTDWGACSWEDPEPETSVRGDY 245 Query: 220 SI--PASS---IGFVRGLFWQPAHIELC 242 S+ PA + + R L W I L Sbjct: 246 SLLDPAGTPKELSIPRLLTWHSRRIRLV 273 >UniRef50_Q0BSC4 Putative uncharacterized protein n=1 Tax=Granulibacter bethesdensis CGDNIH1 RepID=Q0BSC4_GRABC Length = 505 Score = 47.0 bits (110), Expect = 0.002, Method: Compositional matrix adjust. Identities = 48/180 (26%), Positives = 78/180 (43%), Gaps = 8/180 (4%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAP 60 +NL+ D WIPV +G + I Q D + PR D+ +A L LL+ + + P Sbjct: 23 LNLIDDQWIPVLCADGSRRVIAPWQ--MAEPDVVQPDWPRPDLNIACLELLIGLVFLADP 80 Query: 61 AKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDVTPMEKLL-- 118 D ++ R +P + Q+ +AP+ F L F+Q + ++ L Sbjct: 81 PVDGEDWEAR-RDPDPQ-RLQEKLAPYAPAFNLVGDGPRFLQDLEPFTGKASSVDMLFID 138 Query: 119 -AGVSGA-TNCAFVNQPGQGEALCGGCTAIALFNQANQAPGFGGGFKSGLRGGTPVTTFV 176 A V A N + + + L A+AL+ + AP G G + +RGG P+ T V Sbjct: 139 SAAVETARKNADVMVHRSRYDRLDFPIAAMALYTFQSYAPAGGAGNFTSMRGGGPMVTLV 198 >UniRef50_A5UR17 CRISPR-associated protein, Cse1 family n=1 Tax=Roseiflexus sp. RS-1 RepID=A5UR17_ROSS1 Length = 525 Score = 46.2 bits (108), Expect = 0.003, Method: Compositional matrix adjust. Identities = 64/288 (22%), Positives = 108/288 (37%), Gaps = 10/288 (3%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAP 60 NL + WI V R+G +I L + + LS P + LL I Q I Sbjct: 7 FNLWTEPWIRVIRRDGRDDEIGIGTCLTDAHELAALSDPSPLVAGGTHRLLTAILQAIHQ 66 Query: 61 AKDDVEFRHRIMNPLTE-DEFQQLIAPWIDMFYLNHAEHPFMQTKGV---KANDVTPMEK 116 +D E + N + + Q F L PF+QT V ++ P+ + Sbjct: 67 PQDIGEIAALLHNAKFDINRLQAFEKNHAGRFDLFDPHAPFLQTGDVPLHSNHNPQPVAR 126 Query: 117 LLAGVSGATN-CAFVNQPGQGEALCGGCTAIALFNQANQAPGFGGGFKSGLRGGTPVTTF 175 L A + AT F + +C C A L A G G + + G P+ Sbjct: 127 LFAEIPVATERVHFTHVTDDRHRICPACCARGLVTAPAFASSGGAGIRPSINGVPPIYVL 186 Query: 176 VRGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTW-IKPIKSNESIPASSIGFVRGLFW 234 G L T+ L++++ L + +Q W P ++ S++G++ L + Sbjct: 187 PAGDTLFETLTLSLVSSDYLPPG--ADPKRADQAIWNSDPPVVGKNCEVSAVGYLESLTF 244 Query: 235 QPAHIELCDPIGIGKCSCCGQESNLRYTGFLKE--KFTFTVNGLWPHP 280 + L G C+ CG+++++ L E + G+W P Sbjct: 245 PARRMRLYPQAGSVFCTNCGRQTDIFVATMLFEMGHWLSKQTGVWEDP 292 >UniRef50_D2L2X5 CRISPR-associated protein, Cse1 family n=1 Tax=Desulfovibrio sp. FW1012B RepID=D2L2X5_9DELT Length = 554 Score = 45.8 bits (107), Expect = 0.004, Method: Compositional matrix adjust. Identities = 80/328 (24%), Positives = 115/328 (35%), Gaps = 56/328 (17%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLS---------------LPRDDMEL 45 NLL +WIPVR +G +++I WR++ PR D + Sbjct: 4 FNLLTQDWIPVRRVDGTRLRI----------PPWRITDPGDGSPGQAIADIDTPRPDFKG 53 Query: 46 AALALLVCIGQIIAPAKDDVEFRHRIMNPLTED-----------EFQQLIAPWIDMFYLN 94 A L LL+ Q P D+ ++R + T + + AP F L Sbjct: 54 ALLELLIGFVQTALPPTDNRKWRLGLSANTTNEPHLAPPDYAPAALKTAFAPLTPFFNLF 113 Query: 95 HAEHPFMQT---KGVKANDVTPMEKLLAGVSGATNCAF-----VNQPGQGEALCGGCTAI 146 F+Q +A + +P+ LL G F + + + LC C A Sbjct: 114 GDRPRFLQDLTLTEAEAKEPSPIAALLMDSPGENATKFNSDFFIKRDQPPDRLCPACAAA 173 Query: 147 ALFNQANQAPGFGGGFKSGLRGGTPVTTFVRGID-LRSTVLLNVL--------TLPRLQK 197 AL AP G G + LRGG P+TT V D L TV NVL LP Sbjct: 174 ALHALQTYAPSGGAGHRVSLRGGGPLTTLVMLDDSLWKTVWANVLPLDAANVEALPANPA 233 Query: 198 QFPNESHTENQPTWIKPIKSNESIPASSIGFVRGLFWQPAHIELCDPIG--IGKCSCCGQ 255 P T K +E + + F+ + P I L C CGQ Sbjct: 234 ALPGAVFPWLAVTRDSTAKGSE-VHREGMHFLHHYWAMPRRIVLDAETDETPSACPVCGQ 292 Query: 256 ESNLRYTGFLKEKFTFTVNGLWPHPHSP 283 N+ + + + W HP +P Sbjct: 293 PGNVFVRQYRTKNYGNNYGKGWQHPLTP 320 >UniRef50_C6C421 CRISPR-associated protein, Cse1 family n=3 Tax=Enterobacteriaceae RepID=C6C421_DICDC Length = 511 Score = 45.4 bits (106), Expect = 0.005, Method: Compositional matrix adjust. Identities = 68/275 (24%), Positives = 110/275 (40%), Gaps = 32/275 (11%) Query: 36 LSLPRDDMELAALALLVCIGQIIAPAKDDVEFRHRIMNPLTEDEFQQL--IAPWIDMFYL 93 L+ PR D + AA LL+ + Q D+ + + L + Q L +AP + Sbjct: 35 LACPRPDFQGAAWQLLIGLLQTAYAPSDEEAWEDIWHDGLGDGWIQALDGLAPALQF--- 91 Query: 94 NHAEHP-FMQTKGVKANDVTPMEKLLAGVSGATNCA-----FVNQPGQGEALCGGCTAIA 147 A+ P FMQ D +P+ LL G FV + A+C C A+A Sbjct: 92 -GADKPAFMQDFSSLDADNSPIAGLLIDAPGGNTLKLNKDHFVKRDAV-SAICPHCAALA 149 Query: 148 LFNQANQAPGFGGGFKSGLRGGTPVTTFVRGIDLRSTVLLNVLTLPRLQKQFPNESHTEN 207 L+ AP G G + G+RGG P+TT + D + V P +K + N + E Sbjct: 150 LYTLQTNAPSGGVGHRVGVRGGGPITTLLMPYDAHTPV-------PLWRKLWANVTSGER 202 Query: 208 QPT------WIKPIKSNE----SIPASSIGFVRGLFWQPAHIEL-CDPIGIGKCSCCGQE 256 W+ +++E + + ++ + P IEL G+C CG + Sbjct: 203 GRCEADVFPWLAATRTSEGDKDKVTPENAHPLQAFWGMPRRIELDFSHTESGRCDLCGDK 262 Query: 257 SNLRYTGFLKEKFTFTVNGLWPHPHSPCLVTVKKG 291 S+ T + + + W HP +P + K G Sbjct: 263 SDHLLTHYRTKNYGVQYEH-WRHPLTPYRQSNKDG 296 >UniRef50_A5GBL8 CRISPR-associated protein, Cse1 family n=1 Tax=Geobacter uraniireducens Rf4 RepID=A5GBL8_GEOUR Length = 545 Score = 42.4 bits (98), Expect = 0.039, Method: Compositional matrix adjust. Identities = 34/143 (23%), Positives = 59/143 (41%), Gaps = 10/143 (6%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAP 60 MN+ D WIPV GG+ ++ +L S+ D++ R ++ + L +C+ Sbjct: 1 MNVAFDPWIPVVTITGGR-ELASLCSVLTEGDKFADLAVRPHERVSLMRLFLCVTHAALK 59 Query: 61 AKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVK--------ANDVT 112 D + + L Q+ + W D F L H E P++Q G+K + + Sbjct: 60 GPKDYDEWCEVPKRLPVAA-QKYLTEWKDSFELFHKERPWLQVAGLKGVEKEGSDSGKTS 118 Query: 113 PMEKLLAGVSGATNCAFVNQPGQ 135 P+ L +S N + GQ Sbjct: 119 PLSLLDFELSTGNNSTLHDHGGQ 141 >UniRef50_B0TDT8 Crispr-associated protein, ct1972 family, putative n=1 Tax=Heliobacterium modesticaldum Ice1 RepID=B0TDT8_HELMI Length = 523 Score = 41.2 bits (95), Expect = 0.090, Method: Compositional matrix adjust. Identities = 79/392 (20%), Positives = 139/392 (35%), Gaps = 36/392 (9%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIA- 59 +LL + W+ VR G ++ +++L+ + +W + D+ L L + +I Sbjct: 5 FDLLTEPWVTVRDVKG-RICVVHLRDVLAKAHEWSEVI--DESPLIQFGLYRFLQALIID 61 Query: 60 --PAKDDVEFRHRIMNPLTEDEFQQLIAPWIDM---FYLNHAEHPFMQTKGVKANDVT-- 112 P K R +M DE +L A W F L AE PF+Q + V Sbjct: 62 IFPLKGQ-RGRLELMEEGQFDE-TKLNAYWEKYGVYFDLFDAERPFLQVPPREQEKVKRK 119 Query: 113 PMEKLLAGVSGATNCAFVNQPGQGEALCGGCTAIALFNQANQAPGFGG-GFKSGLRGGTP 171 + +L + TN + Q E + + + GG G + G P Sbjct: 120 SVAELFHQLPTGTNVIHFHHRLQDEYVLAPDVCARIMTTLSPFTTAGGQGLSPSINGNPP 179 Query: 172 VTTFVRGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSNESIPASSIGFVRG 231 + +G +L T+LLN P W + + + S + G Sbjct: 180 YYVWRKGDNLFETLLLNYWIT----------DQDRGIPAW-RDRRPSRGETRSEARLLEG 228 Query: 232 LFWQPAHIELCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWPHPHSPCLVTVKKG 291 L WQP + L +G +C+ G+ + E W P+ +V KG Sbjct: 229 LTWQPRRVTLIPEMGPFQCTYSGRSCQWGVRQMVFEAGFQARVDTWRDPNV-AVVNTDKG 287 Query: 292 EVEEKFLAFTTSAPSWTQISRVVV----DKIIQNENGNRVAAVVNQ---FRNIAPQSPLE 344 F+ +W + + + K +Q +N A ++NQ + Q+ Sbjct: 288 ---RSFVRPRWGRQTWRDVGPLALIDGAGKGVQEKNSYERAPILNQASIYLECEQQTTTI 344 Query: 345 LIMGGYRNNQASILERRHDVLMFNQGWQQYGN 376 + G + L+ R++ L G +Q N Sbjct: 345 EVYGLQTDGNMKYLDWRYEELQLPAGLEQVPN 376 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_Q46901 Uncharacterized protein ygcL n=11 Tax=Proteobact... 677 0.0 UniRef50_D0FPP1 CRISPR-associated protein, Cse1 family n=2 Tax=E... 579 e-164 UniRef50_C5SD51 CRISPR-associated protein, Cse1 family n=1 Tax=A... 545 e-153 UniRef50_Q12YB1 CRISPR-associated protein, Cse1 family n=1 Tax=M... 501 e-140 UniRef50_B3E5V2 CRISPR-associated protein, Cse1 family n=3 Tax=D... 497 e-139 UniRef50_A7ZQK3 CRISPR-associated protein, Cse1 family n=55 Tax=... 481 e-134 UniRef50_Q054L1 Putative uncharacterized protein n=2 Tax=Leptosp... 463 e-129 UniRef50_B8GIV6 CRISPR-associated protein, Cse1 family n=1 Tax=M... 439 e-121 UniRef50_D2TKK4 CRISPR-associated protein n=1 Tax=Citrobacter ro... 431 e-119 UniRef50_A1SV74 CRISPR-associated protein, Cse1 family n=2 Tax=G... 431 e-119 UniRef50_B4TTX5 Crispr-associated protein, Cse1 family n=9 Tax=S... 425 e-117 UniRef50_Q1R117 Putative uncharacterized protein n=1 Tax=Chromoh... 421 e-116 UniRef50_Q314I1 Putative uncharacterized protein n=1 Tax=Desulfo... 421 e-116 UniRef50_B4RSK0 CRISPR-associated protein, Cse1 family n=6 Tax=G... 411 e-113 UniRef50_D0KFE0 CRISPR-associated protein, Cse1 family n=4 Tax=E... 409 e-112 UniRef50_Q2FNL5 Putative uncharacterized protein n=1 Tax=Methano... 389 e-106 UniRef50_Q0W587 Putative uncharacterized protein n=1 Tax=uncultu... 369 e-100 UniRef50_B5ZCF9 CRISPR-associated protein, Cse1 family n=10 Tax=... 334 7e-90 UniRef50_A5FZI3 CRISPR-associated protein, Cse1 family n=2 Tax=A... 329 2e-88 UniRef50_Q2RY16 CRISPR-associated protein, Cse1 family n=1 Tax=R... 282 2e-74 UniRef50_Q0BSC4 Putative uncharacterized protein n=1 Tax=Granuli... 274 6e-72 UniRef50_D1CGD1 CRISPR-associated protein, Cse1 family n=1 Tax=T... 220 1e-55 UniRef50_A7BA67 Putative uncharacterized protein n=1 Tax=Actinom... 213 9e-54 UniRef50_B6WQ59 Putative uncharacterized protein n=1 Tax=Desulfo... 150 1e-34 UniRef50_B8IZA3 CRISPR-associated protein, Cse1 family n=1 Tax=D... 143 1e-32 Sequences not found previously or not previously below threshold: UniRef50_C6C421 CRISPR-associated protein, Cse1 family n=3 Tax=E... 364 6e-99 UniRef50_D2L2X5 CRISPR-associated protein, Cse1 family n=1 Tax=D... 303 9e-81 UniRef50_B8IMR5 CRISPR-associated protein, Cse1 family n=1 Tax=M... 261 6e-68 UniRef50_A5UR17 CRISPR-associated protein, Cse1 family n=1 Tax=R... 192 3e-47 UniRef50_B0TDT8 Crispr-associated protein, ct1972 family, putati... 184 8e-45 UniRef50_D0Y921 CRISPR-associated protein, Cse1 family n=2 Tax=D... 161 8e-38 UniRef50_D1CAJ3 CRISPR-associated protein, Cse1 family n=1 Tax=S... 159 2e-37 UniRef50_Q67RP3 Putative uncharacterized protein n=1 Tax=Symbiob... 154 7e-36 UniRef50_A1ARH9 CRISPR-associated protein, Cse1 family n=3 Tax=B... 149 2e-34 UniRef50_C0W6U3 CRISPR-associated Cse1 family protein n=1 Tax=Ac... 141 8e-32 UniRef50_A0LM51 CRISPR-associated protein, Cse1 family n=1 Tax=S... 133 1e-29 UniRef50_C1XYH9 CRISPR-associated protein, Cse1 family n=1 Tax=M... 132 2e-29 UniRef50_C7MTN0 CRISPR-associated protein, Cse1 family n=1 Tax=S... 130 1e-28 UniRef50_D1YEE1 CRISPR system CASCADE complex protein CasA n=1 T... 129 3e-28 UniRef50_Q53VY1 Putative uncharacterized protein TTHB188 n=1 Tax... 128 6e-28 UniRef50_C8XAY7 CRISPR-associated protein, Cse1 family n=1 Tax=N... 127 1e-27 UniRef50_Q2JWC2 CRISPR-associated protein, Cse1 family n=2 Tax=C... 125 3e-27 UniRef50_D1A5T7 CRISPR-associated protein, Cse1 family n=4 Tax=A... 123 2e-26 UniRef50_Q2JH30 Putative uncharacterized protein n=2 Tax=Frankia... 122 2e-26 UniRef50_C1XFZ8 CRISPR-associated protein, Cse1 family n=2 Tax=M... 122 4e-26 UniRef50_B6XT61 Putative uncharacterized protein n=2 Tax=Bifidob... 117 6e-25 UniRef50_C7QEM7 CRISPR-associated protein, Cse1 family n=12 Tax=... 114 6e-24 UniRef50_Q5YRB3 Putative uncharacterized protein n=1 Tax=Nocardi... 113 2e-23 UniRef50_C3PF93 CRISPR-associated protein n=3 Tax=Corynebacteriu... 112 3e-23 UniRef50_Q1J370 CRISPR-associated protein Cse1 n=1 Tax=Deinococc... 112 4e-23 UniRef50_A5GBL8 CRISPR-associated protein, Cse1 family n=1 Tax=G... 112 4e-23 UniRef50_C7LYW9 CRISPR-associated protein, Cse1 family n=1 Tax=A... 111 5e-23 UniRef50_B1VIY3 CRISPR-associated protein n=1 Tax=Corynebacteriu... 111 9e-23 UniRef50_A4XYU2 CRISPR-associated protein, Cse1 family n=3 Tax=P... 109 2e-22 UniRef50_Q4JWJ7 Putative uncharacterized protein n=2 Tax=Coryneb... 109 3e-22 UniRef50_C0VRW0 CRISPR-associated protein n=1 Tax=Corynebacteriu... 108 5e-22 UniRef50_C4FG91 Putative uncharacterized protein n=1 Tax=Bifidob... 107 1e-21 UniRef50_A8M401 CRISPR-associated protein, Cse1 family n=1 Tax=S... 106 2e-21 UniRef50_D1NTH8 CRISPR-associated protein, Cse1 family n=1 Tax=B... 106 3e-21 UniRef50_B6IWM6 CRISPR-associated protein, CT1972 family n=1 Tax... 105 5e-21 UniRef50_D1A6Q3 CRISPR-associated protein, Cse1 family n=1 Tax=T... 103 2e-20 UniRef50_A8LYZ4 CRISPR-associated protein, Cse1 family n=1 Tax=S... 103 2e-20 UniRef50_C6HV92 CRISPR-associated protein, Cas1 n=1 Tax=Leptospi... 102 3e-20 UniRef50_B5GY59 Putative uncharacterized protein n=1 Tax=Strepto... 100 8e-20 UniRef50_Q1EQS6 Putative uncharacterized protein n=2 Tax=Strepto... 98 7e-19 UniRef50_B8FDH6 CRISPR-associated protein, Cse1 family n=1 Tax=D... 98 7e-19 UniRef50_C2CN11 CRISPR-associated protein n=1 Tax=Corynebacteriu... 97 1e-18 UniRef50_B7KJ23 CRISPR-associated protein, Cse1 family n=1 Tax=C... 95 5e-18 UniRef50_C6CML6 CRISPR-associated protein, Cse1 family n=6 Tax=G... 91 9e-17 UniRef50_C7MQD3 CRISPR-associated protein, Cse1 family n=1 Tax=S... 90 2e-16 UniRef50_B0LU91 CRISPR-associated protein Cas1 n=2 Tax=Streptomy... 89 3e-16 UniRef50_Q8KB26 CRISPR-associated protein, CT1972 family n=1 Tax... 84 8e-15 UniRef50_B6ZW55 CRISPR-associated protein, Cse1 family n=2 Tax=E... 84 9e-15 UniRef50_UPI0001AF1D49 CRISPR-associated Cse1 family protein n=1... 84 1e-14 UniRef50_D2RAZ9 CRISPR system CASCADE complex protein CasA n=3 T... 84 2e-14 UniRef50_A8SDS0 Putative uncharacterized protein n=1 Tax=Faecali... 83 2e-14 UniRef50_B5GA97 Crispr-associated protein n=1 Tax=Streptomyces s... 82 6e-14 UniRef50_C6SPI8 Putative uncharacterized protein n=1 Tax=Strepto... 79 4e-13 UniRef50_C2BEU1 CRISPR-associated protein n=1 Tax=Anaerococcus l... 79 4e-13 UniRef50_Q47PJ1 CRISPR-associated protein, Cse1 family n=1 Tax=T... 77 1e-12 UniRef50_UPI0001AEDDCB hypothetical protein SalbJ_26479 n=1 Tax=... 73 2e-11 UniRef50_C8P6I4 Putative uncharacterized protein n=1 Tax=Lactoba... 72 5e-11 UniRef50_D0WFD1 CRISPR-associated protein, Cse1 family n=1 Tax=S... 72 6e-11 UniRef50_Q03C63 CRISPR-associated protein n=1 Tax=Lactobacillus ... 70 2e-10 UniRef50_B3ENH5 CRISPR-associated protein, Cse1 family n=2 Tax=C... 70 2e-10 UniRef50_Q06WG4 Putative uncharacterized protein (Fragment) n=4 ... 68 7e-10 UniRef50_B0S4B8 Putative uncharacterized protein n=1 Tax=Finegol... 67 2e-09 UniRef50_Q60AC9 CRISPR-associated protein, CT1972 family n=1 Tax... 58 8e-07 UniRef50_B5H6V1 Predicted protein n=1 Tax=Streptomyces pristinae... 57 1e-06 UniRef50_D1Y489 CRISPR-associated protein, Cse1 family n=1 Tax=P... 55 5e-06 UniRef50_C5V9N4 Putative uncharacterized protein n=1 Tax=Coryneb... 55 7e-06 UniRef50_UPI000169879C hypothetical protein Epers_00880 n=1 Tax=... 54 1e-05 UniRef50_C2GEY5 Putative uncharacterized protein n=1 Tax=Coryneb... 49 3e-04 UniRef50_A8LM39 CRISPR-associated protein, Cse1 family n=1 Tax=D... 48 0.001 UniRef50_Q0AA30 CRISPR-associated protein, Cse1 family n=1 Tax=A... 44 0.011 >UniRef50_Q46901 Uncharacterized protein ygcL n=11 Tax=Proteobacteria RepID=YGCL_ECOLI Length = 502 Score = 677 bits (1747), Expect = 0.0, Method: Composition-based stats. Identities = 502/502 (100%), Positives = 502/502 (100%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAP 60 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAP Sbjct: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAP 60 Query: 61 AKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDVTPMEKLLAG 120 AKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDVTPMEKLLAG Sbjct: 61 AKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDVTPMEKLLAG 120 Query: 121 VSGATNCAFVNQPGQGEALCGGCTAIALFNQANQAPGFGGGFKSGLRGGTPVTTFVRGID 180 VSGATNCAFVNQPGQGEALCGGCTAIALFNQANQAPGFGGGFKSGLRGGTPVTTFVRGID Sbjct: 121 VSGATNCAFVNQPGQGEALCGGCTAIALFNQANQAPGFGGGFKSGLRGGTPVTTFVRGID 180 Query: 181 LRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSNESIPASSIGFVRGLFWQPAHIE 240 LRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSNESIPASSIGFVRGLFWQPAHIE Sbjct: 181 LRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSNESIPASSIGFVRGLFWQPAHIE 240 Query: 241 LCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWPHPHSPCLVTVKKGEVEEKFLAF 300 LCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWPHPHSPCLVTVKKGEVEEKFLAF Sbjct: 241 LCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWPHPHSPCLVTVKKGEVEEKFLAF 300 Query: 301 TTSAPSWTQISRVVVDKIIQNENGNRVAAVVNQFRNIAPQSPLELIMGGYRNNQASILER 360 TTSAPSWTQISRVVVDKIIQNENGNRVAAVVNQFRNIAPQSPLELIMGGYRNNQASILER Sbjct: 301 TTSAPSWTQISRVVVDKIIQNENGNRVAAVVNQFRNIAPQSPLELIMGGYRNNQASILER 360 Query: 361 RHDVLMFNQGWQQYGNVINEIVTVGLGYKTALRKALYTFAEGFKNKDFKGAGVSVHETAE 420 RHDVLMFNQGWQQYGNVINEIVTVGLGYKTALRKALYTFAEGFKNKDFKGAGVSVHETAE Sbjct: 361 RHDVLMFNQGWQQYGNVINEIVTVGLGYKTALRKALYTFAEGFKNKDFKGAGVSVHETAE 420 Query: 421 RHFYRQSELLIPDVLANVNFSQADEVIADLRDKLHQLCEMLFNQSVAPYAHHPKLISTLA 480 RHFYRQSELLIPDVLANVNFSQADEVIADLRDKLHQLCEMLFNQSVAPYAHHPKLISTLA Sbjct: 421 RHFYRQSELLIPDVLANVNFSQADEVIADLRDKLHQLCEMLFNQSVAPYAHHPKLISTLA 480 Query: 481 LARATLYKHLRELKPQGGPSNG 502 LARATLYKHLRELKPQGGPSNG Sbjct: 481 LARATLYKHLRELKPQGGPSNG 502 >UniRef50_D0FPP1 CRISPR-associated protein, Cse1 family n=2 Tax=Erwinia pyrifoliae RepID=D0FPP1_ERWPY Length = 507 Score = 579 bits (1493), Expect = e-164, Method: Composition-based stats. Identities = 271/499 (54%), Positives = 348/499 (69%), Gaps = 2/499 (0%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAP 60 MNLL D+WIPVRP +GG+ Q I LQ+L C +W ++LPRDDME+A LLVC+ Q + Sbjct: 1 MNLLTDDWIPVRPLSGGEGQQITLQTLLCDDRRWLVALPRDDMEMATFQLLVCLLQTLWM 60 Query: 61 AKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDVTPMEKLLAG 120 D + RI PL+ EF +A W F LNH + PFMQ +GV A +VT M+KLL G Sbjct: 61 PSDAQQLVQRIRQPLSAREFADGVAGWQQAFDLNHPQQPFMQVRGVAAKEVTGMDKLLVG 120 Query: 121 VSGATNCAFVNQPGQGEALCGGCTAIALFNQANQAPGFGGGFKSGLRGGTPVTTFVRGID 180 ++G+T+ AFVNQ GQG+ALC GCTAIALFNQA APGFGGGFKSGLRGG+PVTT V+G Sbjct: 121 LTGSTSGAFVNQSGQGKALCSGCTAIALFNQACNAPGFGGGFKSGLRGGSPVTTLVQGDC 180 Query: 181 LRSTVLLNVLTLPRLQKQFPNESHTENQP-TWIKPIKSNESIPASSIGFVRGLFWQPAHI 239 LR+T+ NVL+ L + P+ QP TW +PIK +++I SSIG RGLFWQPAHI Sbjct: 181 LRTTLWFNVLSETTLDEFCPDWREQRAQPFTWQQPIKKDQAIAGSSIGLARGLFWQPAHI 240 Query: 240 ELCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWPHPHSPCLVTVKKGEVEEKFLA 299 EL P G G+CS CG+ ++ RY FLKEKF FTVNGLW HPHSP + +KKG+VE +++A Sbjct: 241 ELSPPDGAGQCSACGRMASQRYRSFLKEKFNFTVNGLWLHPHSPLIQQIKKGQVEWRYMA 300 Query: 300 FTTSAPSWTQISRVVVDKII-QNENGNRVAAVVNQFRNIAPQSPLELIMGGYRNNQASIL 358 F+T APSWTQI R+++++ + + + G RVA V Q R ++ L L++GGYRNNQASI+ Sbjct: 301 FSTPAPSWTQIGRLLIEQQVNKQQEGRRVATTVEQARMLSRGRALRLMIGGYRNNQASII 360 Query: 359 ERRHDVLMFNQGWQQYGNVINEIVTVGLGYKTALRKALYTFAEGFKNKDFKGAGVSVHET 418 ERRH+VL FNQGWQ VINEIV +GL Y+ ALR AL+ FAEG K D KGAGV++HE Sbjct: 361 ERRHEVLQFNQGWQHAMPVINEIVNLGLEYRKALRTALWIFAEGAKESDIKGAGVALHEK 420 Query: 419 AERHFYRQSELLIPDVLANVNFSQADEVIADLRDKLHQLCEMLFNQSVAPYAHHPKLIST 478 + +YRQS + ++LA +++ + + + QLC+ LFN APYAHHPKLI + Sbjct: 421 VDPQYYRQSHARVLNLLAQIDYQSPLPQLEQFQTQQQQLCQQLFNDLTAPYAHHPKLICS 480 Query: 479 LALARATLYKHLRELKPQG 497 LA AR L L +LKPQG Sbjct: 481 LAKARRYLMSSLAKLKPQG 499 >UniRef50_C5SD51 CRISPR-associated protein, Cse1 family n=1 Tax=Allochromatium vinosum DSM 180 RepID=C5SD51_CHRVI Length = 507 Score = 545 bits (1404), Expect = e-153, Method: Composition-based stats. Identities = 218/504 (43%), Positives = 299/504 (59%), Gaps = 10/504 (1%) Query: 1 MNLLIDNWIPVRPRNG-GKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIA 59 M+LL WIPVR G G +++ Q L C + W++SLPRDD+ELA L LL+C+ QI+ Sbjct: 1 MDLLKTPWIPVRAHGGSGTFRLLTYQELLCEDEDWQISLPRDDLELACLQLLICMTQIMF 60 Query: 60 PAKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDVTPMEKLLA 119 +D RI PLT DEF + I+P ++ F L+H PFMQT+GV A DVTP++KLL Sbjct: 61 LPPEDDVLLDRIDIPLTPDEFTEGISPCLEWFDLDHPTQPFMQTRGVVAKDVTPIQKLLI 120 Query: 120 GVSGATNCAFVNQPGQGEALCGGCTAIALFNQANQAPGFGGGFKSGLRGGTPVTTFVRGI 179 G+ TN AF N PG+ L AIALF+QA P FGGGFK LRG P+TT V G Sbjct: 121 GLPEGTNHAFFNAPGEVSVLSAPVAAIALFHQATNCPSFGGGFKGSLRGIAPITTLVDGR 180 Query: 180 DLRSTVLLNVLTLPRLQKQFPNESHT--ENQPTWIKPIKSNESIPASSIGFVRGLFWQPA 237 +LR + NVLT ++ FP+ H ++ PTWI+PI+S E+I A IG RGLFWQPA Sbjct: 181 NLRKRIWCNVLTPEFIRTDFPDWQHDLSQDLPTWIEPIRSKETIHAHQIGLARGLFWQPA 240 Query: 238 HIELCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWPHPHSPCLVTVKKGEVEEKF 297 H+EL G C G E+ YTGF KEKF FT+ G WPHPH ++KKG +E KF Sbjct: 241 HVELVGSRESGPCDLLGIEAGPLYTGFRKEKFNFTLEGTWPHPHGVLQSSLKKGALEMKF 300 Query: 298 LAFTTSAPSWTQISRVVVDKIIQNENGNRVAAVVNQFRNIA-----PQSPLELIMGGYRN 352 +FTT AP+WT+++ +V+ G+R A V Q +++A PL LI+GGYRN Sbjct: 301 ASFTTEAPAWTRLTEMVLRINGPKGEGSRPATPVAQAKSMAVTALEKPQPLTLIIGGYRN 360 Query: 353 NQASILERRHDVLMFNQGW-QQYGNVINEIVTVGLGYKTALRKALYTFAEGFKNKDFKGA 411 N+AS+ ERRH++L GW ++ G+ + ++V +G+ K +L+ L ++G K K G Sbjct: 361 NKASVTERRHEMLSLAAGWSEEDGSRLKDLVALGIKAKESLKDKLSFASKGHKKKMLPGI 420 Query: 412 GVSVHETAERHFYRQSELLIPDVLAN-VNFSQADEVIADLRDKLHQLCEMLFNQSVAPYA 470 G + + ER FY ++E I + L+ F Q E A D L C +F+ PY Sbjct: 421 GSPIQDVGERIFYSRTEGKIIETLSRPTTFMQWKENRAAYIDALAADCRDIFDAMTEPYT 480 Query: 471 HHPKLISTLALARATLYKHLRELK 494 P+LI +A AR +L L++LK Sbjct: 481 MKPELIPIIAWARRSLNADLKKLK 504 >UniRef50_Q12YB1 CRISPR-associated protein, Cse1 family n=1 Tax=Methanococcoides burtonii DSM 6242 RepID=Q12YB1_METBU Length = 528 Score = 501 bits (1289), Expect = e-140, Method: Composition-based stats. Identities = 115/530 (21%), Positives = 203/530 (38%), Gaps = 51/530 (9%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQ--SLYCSRDQWRLSLPRDDMELAALALLVCIGQII 58 NL+ + WI V+ ++G + I Q S L PR D + + L+ + Q Sbjct: 3 FNLIHEKWIWVQRQDGTRSMIAPWQITDEIGSNPIISLDEPRPDFNGSMIQFLIGLVQTT 62 Query: 59 APAKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDVTPMEKLL 118 K D ++R R ++P T +E + +F L+ + FMQ ++ LL Sbjct: 63 MSPKSDGKWRKRFISPPTPEELLETFEKVAHVFDLDGDDERFMQDHEHIEGAKNRVDALL 122 Query: 119 AGVSG----ATNCAFVNQPGQGEALCGGCTAIALFNQANQAPGFGGGFKSGLRGGTPVTT 174 + G N + +C C ALFN AP G G ++ LRGG P+TT Sbjct: 123 MEMPGVQTLKHNADHFQKRDTVTQMCLPCCVTALFNLQLNAPAGGQGHRTSLRGGGPLTT 182 Query: 175 FVRGIDLRSTVLLNVLTLPRLQKQF-PNESHTENQPTWIKPIKSNES----IPASSIGFV 229 V G +L T+ LNV++ + + + W+ P +++E + Sbjct: 183 LVLGSNLWQTIWLNVISDENFKGLGDVDNCEISDIYPWMGPTRTSEKKNAMTTPMDVNPK 242 Query: 230 RGLFWQPAHIEL-CDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWPHPHSPC---- 284 + + P I L D + G C CG ES+ + ++ + + + +G W H SP Sbjct: 243 QMYWGMPRRIRLDLDDLIEGACDVCGCESDKLVSNYVTKNYGYNYDGGWCHVLSPHNENK 302 Query: 285 -----LVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAAVVNQFRNIAP 339 G +L P S + ++ +Q + ++ + + F+N Sbjct: 303 NGLLPRHPQPGGITYRHWLGLVQHDPDKGLYSSLAFERFVQKQKD--LSDLGDVFKNTP- 359 Query: 340 QSPLELIMGGYRNNQASI-LERRHDVLMFN---QGWQQYGNVINEIVT----VGLGYKTA 391 +L GY + + + +FN + Q Y +++ +V + ++ Sbjct: 360 ----QLWAFGYDFDNMKVRCWYESTMPLFNVADELRQSYEQIVSRLVKTAEIIAYNTRSC 415 Query: 392 LRKALYTFAEGFKNKDFKGAGVSVHETAERHFYRQSELLIPDVLANVNFSQA--DEVIAD 449 ++KAL F + D E FY I + L +V +A E+ Sbjct: 416 VKKAL--FGDNTPRGDLSFIDSRFWHDTESEFYN-----ILNQLTDVVNDEAMVLELKMK 468 Query: 450 LRDKLHQLCEMLFNQSVAPYAHHPKLISTLALARATL-YKHLRELKPQGG 498 +L ++ E LF+ S S + RA L +K LR+ GG Sbjct: 469 WHKELSRISEKLFDDSSQSMQ-----FSVIDPERAALAHKDLRKFNSDGG 513 >UniRef50_B3E5V2 CRISPR-associated protein, Cse1 family n=3 Tax=Deltaproteobacteria RepID=B3E5V2_GEOLS Length = 539 Score = 497 bits (1279), Expect = e-139, Method: Composition-based stats. Identities = 115/518 (22%), Positives = 187/518 (36%), Gaps = 41/518 (7%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAP 60 MNL+ D WIPV G+ I Q L+ PR D + A LL+ + Q Sbjct: 1 MNLIKDAWIPVIRAKSGRGVIAPWQIAELDDPVMELAAPRPDFQGAMYQLLIGLLQTGFA 60 Query: 61 AKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAE-HPFMQTKGVKANDVTPMEKLLA 119 +D E+ P + + F + + FMQ + + + LL Sbjct: 61 PEDFDEWLDYWSKPPDATLLRTRLETLAAAFDFDKPDSPAFMQDYAMPDGEKKGIASLLI 120 Query: 120 GVSGAT----NCAFVNQPGQGEALCGGCTAIALFNQANQAPGFGGGFKSGLRGGTPVTTF 175 G N + + +C C A+ALF AP G G + GLRGG P+TT Sbjct: 121 ESPGGKTVKDNLDHFIKRDAVQHMCKSCAAMALFTLQTNAPSGGVGHRVGLRGGGPLTTL 180 Query: 176 V---RGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSNES----IPASSIGF 228 V + L T+ LNVL + + + W+ P +++E S+ Sbjct: 181 VLPPEQMPLWQTLWLNVLDREDMPEYRQD--RVAGVFPWMGPTRTSEKNGAETTPESVHA 238 Query: 229 VRGLFWQPAHIELCDPIGI--GKCSCCGQESNLRYTGFLKEKFTFTVNGLWPHPHSPCLV 286 ++ + P I L P G C C ++ + + G W HP +P Sbjct: 239 LQAYWGMPRRIRLDFPAKASMGDCDVCDVKNVALVEEYRTRNYGVNYVGNWVHPLTPYRF 298 Query: 287 TVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAAVVNQFR-----NIAPQS 341 KK E+ L+ + + ENG+ A +V ++ + Q Sbjct: 299 DPKK---EKPPLSLKGQQGGLGYRYWLALTLANDTENGDAAAKIVRRYSEQRATELKIQR 355 Query: 342 PLELIMGGYR-NNQASILERRHDVLMFNQGWQQYGNVI---NEIVTVGLGYKTALRK--- 394 L G+ +N + H +FN QQ ++ ++++TV + LRK Sbjct: 356 TARLWCFGFDMDNMKARCWYDHTFPLFNLAPQQRKKLLQWADDLITVANDVSSLLRKQVK 415 Query: 395 -ALYTFAEGFKNKDFKGAGVSVHETAERHFYRQSELLIPDVLANVNFSQ---ADEVIADL 450 A + E K D + + +E FY + D LA V+ Q E+ + Sbjct: 416 AAWFRRPEDAKG-DMNTVSLDFWQRSEPVFYE-----LLDQLAKVSGEQELPPPELYSQW 469 Query: 451 RDKLHQLCEMLFNQSVAPYAHHPKLISTLALARATLYK 488 L L LF+ V A+ + + R L + Sbjct: 470 EKMLVSLSLQLFDAWVLEAANEDMDMKRIIAERDGLKR 507 >UniRef50_A7ZQK3 CRISPR-associated protein, Cse1 family n=55 Tax=Enterobacteriaceae RepID=A7ZQK3_ECO24 Length = 520 Score = 481 bits (1239), Expect = e-134, Method: Composition-based stats. Identities = 98/523 (18%), Positives = 203/523 (38%), Gaps = 47/523 (8%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAP 60 +LL W+PVR ++G ++ + + ++ PR D++ AA L+ + Q Sbjct: 4 FSLLTTPWLPVRFKDGTTGKLAPV--DLADENVVDIAAPRADLQGAAWQFLLGLLQTSFA 61 Query: 61 AKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDVTPMEKLLAG 120 K+ + + L ++ ++ + F FMQ D + LL Sbjct: 62 PKNHGRWDDIWEDGLEAEKLREALLSLEHAFQFGADSPSFMQDFEALKGDKVQVASLLPE 121 Query: 121 VSGATNC----AFVNQPGQGEALCGGCTAIALFNQANQAPGFGGGFKSGLRGGTPVTTFV 176 + GA + G E +C C+A+ALF+ AP G G+++GLRGG P+TT + Sbjct: 122 IPGAQTTKFNKDHFIKRGVTEHVCPHCSALALFSLQLNAPSGGKGYRTGLRGGGPMTTLI 181 Query: 177 R--------GIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSNE----SIPAS 224 L + NV+ P + W+ P +++E + Sbjct: 182 ELQEYQGNQQTPLWRKLWPNVMPQDEADLPLPK-KFDDLVFPWLGPTRTSELAGAVVTHD 240 Query: 225 SIGFVRGLFWQPAHIEL-CDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWPHPHSP 283 + ++ + P I + + +G C CG++S+ + + + +W HP +P Sbjct: 241 QVNKLQAYWGMPRRIRIDFNTTTVGNCDICGEQSDALLSLMTTKNYGANY-AMWQHPLTP 299 Query: 284 CLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAAVVNQFRNIAP--QS 341 + +K+G +F + + + + ++EN + A+V + N + Q+ Sbjct: 300 YRIPLKEG---GEFYSVKPQPGGLIWRDWLGLIETGKSENNTELPALVVKLFNASSLKQA 356 Query: 342 PLELIMGGY--RNNQASILERRHDVLMFNQGWQQY------GNVINEIVTVGLGYKTALR 393 + L GY N +A H L+ + Q + I+++ ++AL+ Sbjct: 357 KVGLWGFGYDFDNMKARCWYEHHFPLLLKKKEGQIPKLRLAAQTASRILSL---LRSALK 413 Query: 394 KALYTFAEGFKNKDFKGAGVSVHETAERHFYRQSELLIPDVLANVNFSQ-ADEVIADLRD 452 +A ++ +G + DF + + F ++ + Q ADE++ + Sbjct: 414 EAWFSDPKGARG-DFSFVDIDFWNKTQHRF--------LRLVRQIEEGQDADELLGKWQK 464 Query: 453 KLHQLCEMLFNQSVAPYAHHPKLISTLALARATLYKHLRELKP 495 ++ F++ V + P + + AR + E + Sbjct: 465 EIWLFARQDFDERVFTNPYEPVDLKRVMTARKKYFTTSAEKQS 507 >UniRef50_Q054L1 Putative uncharacterized protein n=2 Tax=Leptospira borgpetersenii serovar Hardjo-bovis RepID=Q054L1_LEPBL Length = 533 Score = 463 bits (1192), Expect = e-129, Method: Composition-based stats. Identities = 111/517 (21%), Positives = 194/517 (37%), Gaps = 42/517 (8%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINL----QSLYCSRDQWRLSLPRDDMELAALALLVCIGQ 56 MNL+ D WIPV+ + +I + S LS PR D A L LV + Q Sbjct: 1 MNLIKDVWIPVQRFSEKLEEISPFEITSRIENDSDPVMSLSAPRPDFNGALLQFLVGLLQ 60 Query: 57 IIAPAKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKAND-VTPME 115 + +++ E+ +NP + + ++ + F L FMQ + D V + Sbjct: 61 AVFSPENETEWEDLFVNPPSPEVLKEAMEKVKSAFELFGDGPRFMQDTNLNEEDTVFDIS 120 Query: 116 KLLAGVSGATNC----AFVNQPGQGEALCGGCTAIALFNQANQAPGFGGGFKSGLRGGTP 171 L G F + +C C I LF+ +P G G + +RGG P Sbjct: 121 ALFIESPGENTIKLKKDFFIKRNSISQICEKCAGIGLFSFQTNSPSGGQGHLTSIRGGGP 180 Query: 172 VTTFV-------RGIDLRSTVLLNVLTLPRL----QKQFPNESHTENQPTWIKPIKSNES 220 +TTFV + L S + LNVL QK+ P +S I+ ++S Sbjct: 181 LTTFVTSKLKNPKKNSLWSKLWLNVLPKSYFQVNGQKKIPFQSVFPWTNPKIEDLQSKGK 240 Query: 221 -IPASSIGFVRGLFWQPAHIELCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTV-NGLWP 278 + + + P I L + G C C + S++ + + + + G W Sbjct: 241 TTTPQDLHPLSVYWSYPRRILLRKDVENGICDVCNRSSSVLVRSYHTKTYGLSYSEGGWI 300 Query: 279 HPHSPCLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAAVVNQF--RN 336 HP SP + +E + + + + E ++ A V+ +F R Sbjct: 301 HPLSPYYKS------KESWFPYHPQPGGILYHYWQTI--ALGKEQEDQAALVIRRFLNRK 352 Query: 337 IAPQSPLELIMGGYRNNQASILERRHDVLMFN---QGWQQYGNVINEIVTVGLGYKTALR 393 I + L G +N + ++ FN +++ +++I+ K LR Sbjct: 353 IPGEQTSILTFGYDMDNMKARCWYESEIPFFNIPSDKIEKFEEQVSQILNASTEVKKNLR 412 Query: 394 KALYTFAEGFKNK---DFKGAGVSVHETAERHFYRQSELLIPDVLANV-NFSQADEVIAD 449 +A+ KN D VS + E FY + +++++V +F+ E Sbjct: 413 QAVRNAWLDKKNDSKGDLTFLDVSFLKDTENSFYDLIRNVQENLISDVADFAALKETWLK 472 Query: 450 LRDKLHQLCEMLFNQSVAPYAHHPKLISTLALARATL 486 L L++ LF+Q + + I + AR L Sbjct: 473 L---LNESACKLFDQYADSGSFEFENIERIVKARKNL 506 >UniRef50_B8GIV6 CRISPR-associated protein, Cse1 family n=1 Tax=Methanosphaerula palustris E1-9c RepID=B8GIV6_METPE Length = 534 Score = 439 bits (1129), Expect = e-121, Method: Composition-based stats. Identities = 107/521 (20%), Positives = 186/521 (35%), Gaps = 40/521 (7%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQ--SLYCSRDQWRLSLPRDDMELAALALLVCIGQII 58 +NL+ WIPV ++G + I + S Y L PR D A + L+ I Q Sbjct: 2 LNLIEQAWIPVIRKDGERSTIAPWELTSDYQENPIVELDAPRPDFNGALVQFLIGIVQTE 61 Query: 59 APAKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDVTPMEKLL 118 P + V ++ P + + + I+ F L+ FMQ + + ++KLL Sbjct: 62 LPPTNPVTWKRMFRRPPEPADLKASFSTHIEAFNLDGDGPRFMQDLTLAKGEALAIDKLL 121 Query: 119 AGVSG----ATNCAFVNQPGQGEALCGGCTAIALFNQANQAPGFGGGFKSGLRGGTPVTT 174 G N + G + LC C A+ALF AP G G ++ LRGG P+TT Sbjct: 122 IERPGEQTVKKNTDHFLKRGGIDHLCMTCAAMALFTLQTNAPSGGRGHRTSLRGGGPLTT 181 Query: 175 FVRGIDLRSTVLLNVLTLPRLQKQFPNESH-TENQPTWIKPIKSN---ESIPASSIGFVR 230 V G L TV LNV++ L++ + + W+ +++ E + + Sbjct: 182 LVTGRTLWETVWLNVISPQELERYGNSALTSAADIFPWMGETRTSNNNEITTPQDVNPAQ 241 Query: 231 GLFWQPAHIELCDP--IGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWPHPHSPCLVTV 288 + P I L G+C CG+ + + + F + G W H SP Sbjct: 242 MFWGMPRRIRLDLDGKPEPGECDLCGKTTERQVSTFSAKDSGVNYKGGWCHVLSPYSTNP 301 Query: 289 KKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAAVVNQFRNIAPQSPLE---- 344 K + + + P + ++N ++ AAVV+ FR Q L Sbjct: 302 KGELLAKH------AQPGGVTYRNWLGLVQNDSQNNSQPAAVVSLFRE---QRQLGLNGF 352 Query: 345 ---LIMGGYRNNQASILERRHDVL--------MFNQGWQQYGNVINEIVTVGLGYKTALR 393 L GY + + + ++ ++ +G +T+++ Sbjct: 353 QPHLWAFGYDMDNMKARCWYEGKMPLHHIDEGLLPGYEEEIARLVRTAGLIGFSVRTSIK 412 Query: 394 KALYTFAEGFKNKDFKGAGVSVHETAERHFYRQSELLIPDVLANVNFSQADEVIADLRDK 453 KAL++ E D + E F++ + L L + + + Sbjct: 413 KALFSRPEDATG-DLSFIDARFWQDTEPAFHKTLDELA--TLLK-DGGDRTTLKLNWLKS 468 Query: 454 LHQLCEMLFNQSVAPYAHHPKLISTLALARATLYKHLRELK 494 L + LF+ +ALA L + Sbjct: 469 LRDEGKRLFDDYSQADLIDQTDPKRVALAWRDLQRFTSRFN 509 >UniRef50_D2TKK4 CRISPR-associated protein n=1 Tax=Citrobacter rodentium ICC168 RepID=D2TKK4_CITRO Length = 519 Score = 431 bits (1108), Expect = e-119, Method: Composition-based stats. Identities = 99/517 (19%), Positives = 171/517 (33%), Gaps = 36/517 (6%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAP 60 +NL+ W+PVR +NG ++ + + ++ R D++ AA L+ + Q Sbjct: 4 VNLIFCQWLPVRFKNGATGKLAPV--DLADENVVDIAATRADLQGAAWQFLLGLLQSSIA 61 Query: 61 AKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDVTPMEKLLAG 120 K+ + LT + + +AP F+ FMQ A + + LL Sbjct: 62 PKNYSRWEDIWEEGLTGEMLHKALAPLGHAFHFGAESPSFMQDFEPLAGEKVSIASLLPE 121 Query: 121 VSGAT----NCAFVNQPGQGEALCGGCTAIALFNQANQAPGFGGGFKSGLRGGTPVTTFV 176 + GA N + G E LC C A+ALF+ AP G G+++GLRGG P+TT + Sbjct: 122 IPGAQTIKFNKDHFIKRGVTERLCPHCAALALFSLQLNAPSGGKGYRTGLRGGGPLTTLI 181 Query: 177 --------RGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSNE-----SIPA 223 R L + LNV+ P + W+ +++E Sbjct: 182 ELQEYKGERQTPLWRKLWLNVMPQDTADLPLPAVCDA-SVFPWLAATRTSEPPANTVTTP 240 Query: 224 SSIGFVRGLFWQPAHIELCD-PIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWPHPHS 282 + ++ + P I L G C CG ES+ + + +G W HP + Sbjct: 241 EQVNKLQMYWGMPRRIRLDFATTQTGLCDICGVESDALLGFMTVKNYGVNYDG-WRHPLT 299 Query: 283 PCLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNR-VAAVVNQFRNIA-PQ 340 P VK + F + + +++ E A VVN F Sbjct: 300 PYRAPVK---DKSGFFSVKLQPGGLIWRDWLGLNQKNSTEANEEYPAQVVNVFNAHKLAG 356 Query: 341 SPLELIMGGYRNNQASILERR--HDVLMFNQGWQQYGNVINEIVTVGLGYKTALRKALYT 398 L G + I H L+ + + L + K + Sbjct: 357 VKAGLWGFGADFDNMKIRCWYEHHFPLLMTENLLSDLRKAVQTAARLLSLLRSALKEAWF 416 Query: 399 FAEGFKNKDFKGAGVSVHETAERHFYRQSELLIPDVLANVNFSQADEVIADLRDKLHQLC 458 + DF + + F L + DE + + +L Sbjct: 417 ASAKDARGDFSFIDIDFWNLTQGRFLHLIHDL-------ETGQKPDERLNQWQRELWLFT 469 Query: 459 EMLFNQSVAPYAHHPKLISTLALARATLYKHLRELKP 495 F+ + + + AR + E + Sbjct: 470 RRYFDDRAFTNPYENNDLQRIMAARRKYFTTSAEKQS 506 >UniRef50_A1SV74 CRISPR-associated protein, Cse1 family n=2 Tax=Gammaproteobacteria RepID=A1SV74_PSYIN Length = 488 Score = 431 bits (1107), Expect = e-119, Method: Composition-based stats. Identities = 125/507 (24%), Positives = 214/507 (42%), Gaps = 39/507 (7%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAP 60 MNLL D++I + GK I+L+++ ++L D+++LA L LL + ++ Sbjct: 1 MNLLKDDFIST---SQGK---ISLKTILTGEQNYQLQYYFDEIQLAMLQLLSSLSTVVLQ 54 Query: 61 AKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTK---GVKANDVTPMEKL 117 E + + N LT ++++ + ++ + FMQ+K K D P+ KL Sbjct: 55 PTVQ-ELKDYLKNGLTPEQYEAALDKVESQWFESDC---FMQSKPPTNAKWPDA-PITKL 109 Query: 118 LAGVSGATNC---AFVNQPGQGEALCGGCTAIALFNQANQAPG--FGGGFKSGLRGGTPV 172 L+G+ T+ ++ Q E C C +N G FG +G+RGG + Sbjct: 110 LSGIECGTSANAMGLFSEIEQAEISCTDCMHALNYNLHMNIKGECFGPTGATGIRGGGAI 169 Query: 173 TTFVRGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSNESIPASSIGFVRGL 232 +T + G +L+ T+L N + +S E +P W+ P+ S AS IG RGL Sbjct: 170 STLIAGENLKQTLLNNTIAKDYFNDYAQLDSDAEQRPMWVAPL-SGSVYQASKIGINRGL 228 Query: 233 FWQPAHIELCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVN----------GLWPHPHS 282 F HI C CG ES F +EK+ G WPHP++ Sbjct: 229 FALAYHIGFNIEDKPCLCDVCGSESEQSVKTFNREKYKGNYGSTKNGREAGAGWWPHPYT 288 Query: 283 PCLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAAVVNQFRNIAPQSP 342 P + K E A + SW ++ +V K ++ A ++ QF+ + Sbjct: 289 PRTI---KEEGAFAVCARDQNWQSWQELGSYIVGKET-DKATLEPAYIIKQFQYMKTPRQ 344 Query: 343 LELIMGGYRNNQASILERRHDVLMFNQGWQQYGNVINEIVTVGLGYKTALRKAL-YTFAE 401 L++GG +Q I R +D+ ++ + + +++ GL K L +A F Sbjct: 345 TNLLVGGNIADQGGITGRVYDLYSMPSSLNKHLSKVTQVLDSGLDQKNRLSQAFNKMFGA 404 Query: 402 GFKNKDFKGAGVSVHETAERHFYRQSELLIPDVLANVNFSQADEVIADLRDKLHQLCEML 461 G+ +K+F G + E A F ++ +I L +V +A E+ +L Q + + Sbjct: 405 GY-DKNFVGG---IKENAMYRFTANAQQIIQRTLLDVERKEATELRKTAVIELKQEAQRI 460 Query: 462 FNQSVAPYAHHPKLISTLALARATLYK 488 F Y H L L + LY+ Sbjct: 461 FMGVQRKYQHDLPLFKALVKGESALYR 487 >UniRef50_B4TTX5 Crispr-associated protein, Cse1 family n=9 Tax=Salmonella enterica subsp. enterica RepID=B4TTX5_SALSV Length = 511 Score = 425 bits (1093), Expect = e-117, Method: Composition-based stats. Identities = 108/517 (20%), Positives = 200/517 (38%), Gaps = 35/517 (6%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAP 60 MNL+ + W+PV +G K +I +L+ L L+ PR D + AA +L+ I Q Sbjct: 1 MNLITEKWLPVIFSSGEKTRI-SLRDLL-DNRIQDLAYPRPDFQGAAWQMLIGILQCTIA 58 Query: 61 AKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDVTPMEKLLAG 120 +D E+ + + +++++ + + F+Q+ ++ + LL Sbjct: 59 PEDKEEWADIWHDGIEFEQWEKALNTISLALQFGEQKPSFLQSFDPLDSEYGSIAGLLVD 118 Query: 121 VSGAT----NCAFVNQPGQGEALCGGCTAIALFNQANQAPGFGGGFKSGLRGGTPVTTFV 176 G N + G E +C C AIALF +P G G++ G+RGG P+TT V Sbjct: 119 APGGNTLKLNKDHFVKRGNVEQICPHCAAIALFAIQTNSPAGGAGYRVGMRGGGPLTTLV 178 Query: 177 -----RGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSNES----IPASSIG 227 L + LNVL Q++ PN + W+ P K++E + + Sbjct: 179 VPQEEDKYPLWKKLWLNVLP----QEEPPNVTQHPLIFPWLAPTKTSEKAGNVVTPDNSH 234 Query: 228 FVRGLFWQPAHIELCDPIGI-GKCSCCGQESNLRYTGFLKEKFTFTVNGLWPHPHSPCLV 286 ++ + P IEL + G C CG+ + + + W HP SP Sbjct: 235 PLQAYWGMPRRIELDFTHTVAGICDLCGEHHESLLLQMRSKNYGVQYDS-WLHPFSPYRQ 293 Query: 287 TVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNE-NGNRVAAVVNQFRNIAPQSPLEL 345 +K + +LAF + + + +++ N + A VV ++ + L Sbjct: 294 ALK--DPSAPWLAFKGQPGGLSYKDWLGLMLNREDKFNKMQPAKVVRAAGQ---RNNMSL 348 Query: 346 IMGGYR-NNQASILERRHDVLMFN-QGWQQYGNVINEIVTVGLGYKTALRKALYTFAEGF 403 + +N + +H + + + +Q+ +N ++ + + LR AL + Sbjct: 349 WCFAFDMDNAKARCWYQHRIPLISVSHEEQFLAALNTVLVLASEALSLLRNALKSAKFDC 408 Query: 404 KNK---DFKGAGVSVHETAERHFYRQSELLIPDVLANVNFSQADEVIADLRDKLHQLCEM 460 + DF ++ + E F E L D L +Q ++ +L Sbjct: 409 PKEAKMDFSMVDIAFWQETEPAFRALQEALAVDPLRQ--DTQTRHAVSQWEAELAHYLFH 466 Query: 461 LFNQSVAPYAHHPKLI-STLALARATLYKHLRELKPQ 496 +F++ P I AR L R+ K + Sbjct: 467 VFDRDALTNPDCPDDILQRQLTARQDLASSYRKHKAR 503 >UniRef50_Q1R117 Putative uncharacterized protein n=1 Tax=Chromohalobacter salexigens DSM 3043 RepID=Q1R117_CHRSD Length = 564 Score = 421 bits (1083), Expect = e-116, Method: Composition-based stats. Identities = 106/535 (19%), Positives = 180/535 (33%), Gaps = 50/535 (9%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAP 60 MNLL D W+P R +G + S D L+LPR D + AA L+ + Q Sbjct: 3 MNLLTDPWLPFRRSDGSL--LYRPPSALADPDILDLALPRADFQGAAWQFLIALLQTAMT 60 Query: 61 AKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKAN-DVTPMEKLLA 119 K+ + R P + +EF+ +AP+ F L+ FMQ + P+ LL Sbjct: 61 PKNTDAWLDRYQTPPSVEEFEAALAPFSRAFELDGEGPRFMQDLDPLEDVKDAPVAGLLI 120 Query: 120 GVSG----ATNCAFVNQPGQGEALCGGCTAIALFNQANQAPGFGGGFKSGLRGGTPVTTF 175 G N F + G+ EA+C C A+AL+ AP G G + GLRGG P+TT Sbjct: 121 DSPGANGIKNNTDFFVKRGRVEAVCPDCAALALYTMQINAPAGGAGIRVGLRGGGPLTTL 180 Query: 176 VRGID----LRSTVLLNVLTLPRLQKQFPNES----HTENQPTWIKPIKSNESIP----A 223 + D L + NV+ + + + W+ + ++ Sbjct: 181 ILPEDETKSLWERLWPNVMPADAVGQPGQTWRPPTVDDADLFFWMSDTRVSDKKGTEVFP 240 Query: 224 SSIGFVRGLFWQPAHIELCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWPHPHSP 283 + + L+ P L G C CG+E + +K +G W HP +P Sbjct: 241 DQVHPLHALWSMPRRYRLLFEDESGCCDLCGRECSRLVRRLRSKKQGANYDGPWRHPLTP 300 Query: 284 CLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAAVVN----QFRNIAP 339 K E L+ + + +G A VV ++R + Sbjct: 301 YRRLNPKKTDEL-PLSSKGQPGGLGYRHWPGLVLEDEASSGAMPARVVTHHLHKYRMVES 359 Query: 340 QSP-----------LELIMGGYRNNQASILERRHDVLMFNQGWQQYGNV--------INE 380 L + GY + + + + + ++ Sbjct: 360 ARDDGEAFDAMFRHARLWVFGYDMDNMKPRGWYSVEMPLVGVPEAHQEILRDWVKRFVDL 419 Query: 381 IVTVGLGYKTALRKALYTFAEGFKNKDFKGAGVSVHETAERHFYRQSELLIPDVLANVNF 440 V + L++A + E K D E + F+ + + L + Sbjct: 420 ASDVAWQVRNQLKRAWFKRPEDAKG-DMSQIDAQFFEATQLAFFDVLRQM-SETLRDYGD 477 Query: 441 SQA--DEVIADLRDKLHQLCEMLFNQSVAPYA---HHPKLISTLALARATLYKHL 490 + A E+ L + LF+ + + + AR L +L Sbjct: 478 TPALSPEIHQKWHLTLKREALRLFDAQAISGPLEGMKMQQLERITSARRYLLAYL 532 >UniRef50_Q314I1 Putative uncharacterized protein n=1 Tax=Desulfovibrio desulfuricans subsp. desulfuricans str. G20 RepID=Q314I1_DESDG Length = 534 Score = 421 bits (1083), Expect = e-116, Method: Composition-based stats. Identities = 111/509 (21%), Positives = 189/509 (37%), Gaps = 36/509 (7%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAP 60 +N+L D W+PV +G +++I L L +PR D A L LV Q + P Sbjct: 4 LNILSDQWLPVILADGKRIRIAPW-ELTADPRPVALDIPRPDFGGAMLEFLVGCMQTMCP 62 Query: 61 AKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKAND--VTPMEKLL 118 + ++R P + + P+I F+L FMQ +K + V + LL Sbjct: 63 PQSRKDWRSWRKTPPQPQTLRTAMEPFIPHFHLLGERPLFMQDLTLKQEEENVMGVAALL 122 Query: 119 AGVSG----ATNCAFVNQPGQGEALCGGCTAIALFNQANQAPGFGGGFKSGLRGGTPVTT 174 G + F + G+ E LC C A+AL+ AP G G+++ LRGG P++T Sbjct: 123 IDSPGENAIKNDTDFFVKRGRIETLCPACAAMALYTMQAFAPSGGAGYRTSLRGGGPLST 182 Query: 175 FVRGIDLRSTVLLNVLTLPRLQKQFPNESH------TENQPTWIKPIKSNESIPASSIGF 228 V G L TV NVL + P+ PT K +PA+ Sbjct: 183 LVLGETLWETVWNNVLVAESTDWRIPDGHDPLGRILPWTVPTRDSKKKGTAILPATGHNL 242 Query: 229 VRGLFWQPAHIELCDPI--GIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWPHPHSPCLV 286 + + P L C CG S + + G W HP +P Sbjct: 243 LH-FWAMPRRFRLHPENLSDPAACDICGTPSTTVIRQIGAKNYGNNYEGAWQHPLTPYRE 301 Query: 287 TVK-KGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAA-VVNQFRNIAPQSPLE 344 K K + K + T+ W + + + ++N VAA + QFR + P S + Sbjct: 302 QGKGKLALSVKGASECTAYHQWLGL----LYGPLGSKNKTLVAAQCIRQFRELLPASAVR 357 Query: 345 LIMGGYR-NNQASILERRHDVLMFNQGWQQYGNVINEI---VTVGLGYK----TALRKAL 396 + GY +N + ++ ++ + + NEI + + AL++AL Sbjct: 358 VRAFGYDMDNMKARQWCEGEMPLYALDPAETAILQNEIELWLDAADKTRSNLIKALKQAL 417 Query: 397 YT---FAEGFKNKDFKGAGVSVHETAERHFYRQSELLIPDVLANVNFSQ---ADEVIADL 450 + A + E FY + + V + Q A + + Sbjct: 418 FADGGRNAKADQTLLANASTAFWSRTESAFYSLAARFVESVQQQDDAQQIALAKMLRNEW 477 Query: 451 RDKLHQLCEMLFNQSVAPYAHHPKLISTL 479 +++ + +F++ A A + + Sbjct: 478 ANRILAATDAIFSEQAASGAFDERQAPRI 506 >UniRef50_B4RSK0 CRISPR-associated protein, Cse1 family n=6 Tax=Gammaproteobacteria RepID=B4RSK0_ALTMD Length = 535 Score = 411 bits (1056), Expect = e-113, Method: Composition-based stats. Identities = 98/523 (18%), Positives = 181/523 (34%), Gaps = 42/523 (8%) Query: 1 MNLLIDNWI--PVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQII 58 MNLL + W+ V+ +G + + + +LPR D + AA + + Q Sbjct: 1 MNLLKEPWLLFNVQQPDGSIAEKTLPITAIAKPEVIDFALPRADFQGAAYQFAIGLLQTC 60 Query: 59 APAKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKG-VKANDVTPMEKL 117 D+ E++ + P TED + + F FMQ + T + L Sbjct: 61 FAPDDEFEWKDNYLEPPTEDALRPAFSKAEHAFNATGDGPLFMQDFDSLDEAKPTSVSGL 120 Query: 118 LAGVSGAT----NCAFVNQPGQGEALCGGCTAIALFNQANQAPGFGGGFKSGLRGGTPVT 173 L G N + G GE + +ALF AP G G ++GLRGG P+T Sbjct: 121 LIEAPGGNGLKLNTDHFVKRGIGEVMSLPMAVLALFTLQINAPAGGQGHRTGLRGGGPLT 180 Query: 174 TFV----RGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSNESIPASS---- 225 T V L + LNV P ++ + H++ W+ + + + + Sbjct: 181 TLVMPQNENSPLWQKLWLNV--APNDERYSAPDLHSDTVFPWLGKTRVSAKKGSETYQKD 238 Query: 226 IGFVRGLFWQPAHIELCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWPHPHSPCL 285 + + + P I L G+CS G+ + + + + G W HP +P Sbjct: 239 VHPLHMFWSMPRRIRLIVEDVAGECSLTGKSCSQLVKLYKTQNYGANYAGSWSHPLTPYK 298 Query: 286 VTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKII---QNENGNRVAAVVNQFRNIAP--- 339 +KK + E+ L+ T V+ + A+VV F ++ Sbjct: 299 RDLKKPDQED--LSIKGQPGGITYKIWDVLTLTGSPDGGKTQQMCASVVRSFNHLVNDDI 356 Query: 340 ----QSPLELIMGGYRNNQASILERRHDVLMF----NQGWQQYGNVINEIVTVGLGY--- 388 + L + GY + + + + Q + I ++ T+ Sbjct: 357 LEDVTAQARLWVFGYDMDNMKARGWYSETMPLFQVPSAKQQHILDYIKQLQTIANDALWH 416 Query: 389 -KTALRKALYTFAEGFKNKDFKGAGVSVHETAERHFYRQSELLIPDVLANVNFSQADEVI 447 ++ ++ A + K DF + + + F+ + L+ + +A Sbjct: 417 CRSQIKSAWFDKPGDAKG-DFSFIETAFWQQTQSAFFAAVQQLMSSDSLYLTSMEAK--- 472 Query: 448 ADLRDKLHQLCEMLFNQSVAPYAHHPKLISTLALARATLYKHL 490 L + LF++ + + AR L K L Sbjct: 473 -QWLSTLRNVALSLFDEYALSELGSERTMEKRIGARKNLLKGL 514 >UniRef50_D0KFE0 CRISPR-associated protein, Cse1 family n=4 Tax=Enterobacteriaceae RepID=D0KFE0_PECWW Length = 523 Score = 409 bits (1051), Expect = e-112, Method: Composition-based stats. Identities = 104/523 (19%), Positives = 183/523 (34%), Gaps = 46/523 (8%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAP 60 +L+ + W+P +G I LQ + L+ R D + A+ LL+ + Q Sbjct: 3 FSLIEEPWLPAVFADGRMSNISPLQ--LPDDNIIDLAWTRADFQGASYQLLIGLLQTAYA 60 Query: 61 AKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDVTPMEKLLAG 120 DD ++ + L D F Q +A + FMQ +D TP+ LL Sbjct: 61 PADDDDWDAIWEDGLGTD-FSQALAALAPAMQFGAQKPAFMQDCAPLDSDSTPISGLLID 119 Query: 121 VSGAT----NCAFVNQPGQGEALCGGCTAIALFNQANQAPGFGGGFKSGLRGGTPVTTFV 176 G N + G A+C C A+ALF AP G G ++G+RGG P+TT + Sbjct: 120 APGGNTLKLNKDHFIKRGTVNAICPHCAAMALFTLQTNAPSGGQGHRTGMRGGGPITTLL 179 Query: 177 RGI----DLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSNESIPA----SSIGF 228 L + +NVLT QK P W+ +++ + Sbjct: 180 MQEDGRLPLWKKLWMNVLT----QKVMPKGKPDATVFPWLAATPTSDGTHPPVTQENSHQ 235 Query: 229 VRGLFWQPAHIELCDPI-GIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWPHPHSPCLVT 287 ++ + P IEL G+C CG ES T + + + + W HP +P Sbjct: 236 LQAYWGMPRRIELDFTTLQTGECDLCGTESTALLTQYRTKNYGIQYDS-WRHPLTPYRRA 294 Query: 288 VKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNE-NGNRVAAVVNQFRNIAPQSPLELI 346 +K + FL+ + + ++++ N AAVV L Sbjct: 295 LKGDD--APFLSVKGQPGGLAYRDWLGMMVSVEDKLNHTYPAAVVQHNAGKRALRNAGLW 352 Query: 347 MGGYR-NNQASILERRHDVLMF---------NQGWQQYGNVINEIVTVGLG----YKTAL 392 GY +N + H V + ++Y + + V + + + Sbjct: 353 CFGYDMDNMKARCWYEHHVPLLFSPARFQSDALSLREYKDHLQLAVELARDSATLLRQMI 412 Query: 393 RKALYTFAEGFKNKDFKGAGVSVHETAERHFYRQSELLIPDVLANVNFSQADEVIADLRD 452 ++A ++ + K DF V+ + + F R L + + Sbjct: 413 KEAWFSRPKDAKG-DFSAIDVAFWQETQPDFMRLCRSL-------AQGDTPATALNIWKK 464 Query: 453 KLHQLCEMLFNQSVAPYAHHPKLISTLALARATLYKHLRELKP 495 L++ ++ V + ++ A R L + K Sbjct: 465 SLYRYLLNNYDARVFSNPDEHRDLAKAAKTRKKLSAFFYKQKA 507 >UniRef50_Q2FNL5 Putative uncharacterized protein n=1 Tax=Methanospirillum hungatei JF-1 RepID=Q2FNL5_METHJ Length = 532 Score = 389 bits (998), Expect = e-106, Method: Composition-based stats. Identities = 102/515 (19%), Positives = 187/515 (36%), Gaps = 37/515 (7%) Query: 1 MNLLIDNWIPVRPRNG--GKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQII 58 ++LL D WIPV +NG G + + S Y + L+ R D A + L+ + Q + Sbjct: 2 IHLLHDAWIPVVRKNGDSGLIAPHQITSDYDTNPVIELNASRPDFNGALIQFLIGLIQTV 61 Query: 59 APAKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDVTPMEKLL 118 P + D E+ R+ N + D + D F L+ FMQ + ++ LL Sbjct: 62 CPPESDKEWTDRLDNVIPSDVLKGHFKQIQDAFSLDGKGPRFMQDISIGDEKKNSVDGLL 121 Query: 119 AGVSGAT----NCAFVNQPGQGEALCGGCTAIALFNQANQAPGFGGGFKSGLRGGTPVTT 174 + G N F + + +C C A+ALF PG G G K+GLRGG P+T+ Sbjct: 122 IEMPGENTVKKNTDFFVKRDTVKQMCPSCAAMALFTLQVNGPGGGAGHKTGLRGGGPLTS 181 Query: 175 FVRGIDLRSTVLLNVLTLPRL--QKQFPNESHTENQPTWIKPIKSNESIPASS---IGFV 229 + G L TV LN++ + + + W I+ ++ + + + Sbjct: 182 VILGETLWETVWLNIIPSIKFFGDAIAKQKKSMDMIFPWFGKIRLSDKKEKTGVIDVNPL 241 Query: 230 RGLFWQPAHIELCDPIGI-GKCSCCGQESNLRYTGFLKEKFTFTVNGLW-PHPHSPCLVT 287 + + I L G C CG S+ + + + W H +P Sbjct: 242 QMFWGMGRRILLDFEDKPVGACDVCGLASSATVLTYHTKPHGVDYDETWDNHTLTPYY-- 299 Query: 288 VKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAAVVNQFRN----IAPQS-- 341 ++++ P + I + +V+ V F+N + +S Sbjct: 300 -----LDKEVYRPIHLQPGGITYRNFMGLIIPDSSRNIKVSRTVTNFQNNIIRLKRKSFK 354 Query: 342 PLELIMGGYR-NNQASILERRHDVLMF----NQGWQQYGNVINEIVTVGLGYKTALRKAL 396 + L GY +N + + ++ + + + N+++ ++ +L ++ Sbjct: 355 KVRLWSFGYDMDNAKARCWYESRMPIYLLDDEKKRELFENIVSNLIFTAEYVLQSLAGSI 414 Query: 397 YTFAEGFKNKDFKGAGV--SVHETAERHFYRQSELLIPDVLANVNFSQADEVIADLRDKL 454 G NK + + E FY + L L NV Q D++ + L Sbjct: 415 KDAISGHGNKGKEPVDLRSRFWNETEELFYATLDRLFF-SLGNVE--QIDQIKREWYRML 471 Query: 455 HQLCEMLFNQSVA-PYAHHPKLISTLALARATLYK 488 +LF+ ++ AR K Sbjct: 472 VGHSVLLFDYYTQVSLISDLNDPKSVIDARIKFKK 506 >UniRef50_Q0W587 Putative uncharacterized protein n=1 Tax=uncultured methanogenic archaeon RC-I RepID=Q0W587_UNCMA Length = 533 Score = 369 bits (948), Expect = e-100, Method: Composition-based stats. Identities = 107/497 (21%), Positives = 178/497 (35%), Gaps = 57/497 (11%) Query: 2 NLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAPA 61 NLL + WI +G VQ L +L + + P +E LL+ + Sbjct: 5 NLLTEPWITSIDLSGNPVQEGILATLKNAHKIDSIFDPAPPVEFGIYRLLIAFITDVFQP 64 Query: 62 KDDVEFRHRIMNP-LTEDEFQQLIAPWIDMFYLNHAEHPFMQ--TKGVKANDVTPMEKLL 118 + + + L + A W D F L ++PF+Q GV P+ +L+ Sbjct: 65 QGLEDLADLLDRKRLDPTALDEYAARWRDRFDLFDEKYPFLQQAITGVIKKPPEPISRLM 124 Query: 119 AGVSGATNCAFVNQPGQGE------ALCGGCTAIALFNQANQAPGFGGGFKSGLRGGTPV 172 + TN + + E G IA F A G G + G P Sbjct: 125 QHLPAGTNVSHFHHGRWDENSFSFEQCAKGLVTIAPFMTA-----GGAGLSPSINGSPPW 179 Query: 173 TTFVRGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSNESIPASSIGFVRGL 232 V+G +L T+L NV +P K ++ W + + + V GL Sbjct: 180 YVLVKGNNLFETLLYNVCQIPMTVKPI-----GDSPVAWRNDKRIDPGDEPKTFSIVEGL 234 Query: 233 FWQPAHIELCDPIGIGKCSCCGQE-----SNLRYTGFLKEKFTFTVNGLWPHPHSPCLVT 287 W+P I+L G G C+ G++ S++ Y K GLW P V Sbjct: 235 TWRPRIIQLIPGNGKGTCTYTGEKDVDTVSHMHYYPGQKS----PEPGLWVDPQ----VA 286 Query: 288 VKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQ-NENGNRVA----AVVNQFRNI----- 337 KK + + L + W I +++ + + +V+ AVV Q++ + Sbjct: 287 YKKTKDAIRPLRPDENKALWRDIGPLMLLQHGDYSGKDGKVSFDRPAVVTQYKQMVSNGM 346 Query: 338 -APQSPLELIMGGYRNN-QASILERRHDVLMFN-------QGWQQYGNVINEIVTVGLGY 388 PL L + G R + + I E H+ L +Q + ++ +V Sbjct: 347 IKRSEPLRLEVYGIRTDGKMKIYEWYHEKLALPIEILKKANSGRQIQDAMDLADSVAYIL 406 Query: 389 KTALRKALYTFAEGFKNKDFKGAGVSVHETAERHFYRQSELLIPDVLANVNFSQADEVIA 448 + A++KA Y F +SV + H Q E + L SQ DE Sbjct: 407 RKAMKKA-YPRNAKSNESGFDNLILSVQSSYWSHLKGQFESIFLKTL-----SQQDENDL 460 Query: 449 DLRDKLHQLCEMLFNQS 465 D KL + + + + + Sbjct: 461 DAYTKLMEQWKKILDDT 477 >UniRef50_C6C421 CRISPR-associated protein, Cse1 family n=3 Tax=Enterobacteriaceae RepID=C6C421_DICDC Length = 511 Score = 364 bits (933), Expect = 6e-99, Method: Composition-based stats. Identities = 100/537 (18%), Positives = 192/537 (35%), Gaps = 62/537 (11%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAP 60 +L+ W+ V +G + +I Q L+ PR D + AA LL+ + Q Sbjct: 2 FSLIDTPWLSVVGADGHRTRISPRQ--LTDDRIIDLACPRPDFQGAAWQLLIGLLQTAYA 59 Query: 61 AKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDVTPMEKLLAG 120 D+ + + L D + Q + + FMQ D +P+ LL Sbjct: 60 PSDEEAWEDIWHDGLG-DGWIQALDGLAPALQFGADKPAFMQDFSSLDADNSPIAGLLID 118 Query: 121 VSGAT----NCAFVNQPGQGEALCGGCTAIALFNQANQAPGFGGGFKSGLRGGTPVTTFV 176 G N + A+C C A+AL+ AP G G + G+RGG P+TT + Sbjct: 119 APGGNTLKLNKDHFVKRDAVSAICPHCAALALYTLQTNAPSGGVGHRVGVRGGGPITTLL 178 Query: 177 RGI------DLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSNE----SIPASSI 226 L + NV + R + W+ +++E + + Sbjct: 179 MPYDAHTPVPLWRKLWANVTSGER-------GRCEADVFPWLAATRTSEGDKDKVTPENA 231 Query: 227 GFVRGLFWQPAHIELCDPI-GIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWPHPHSPCL 285 ++ + P IEL G+C CG +S+ T + + + W HP +P Sbjct: 232 HPLQAFWGMPRRIELDFSHTESGRCDLCGDKSDHLLTHYRTKNYGVQYE-HWRHPLTPYR 290 Query: 286 VTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAAVVNQF--RNIAPQSPL 343 + K G + L+ + + + ++ + + A V + + + + Sbjct: 291 QSNKDGML----LSVKGQPGGLSYRDWLGLVLGTKDTLNDTLPACVVSLSHQRVPLRQKV 346 Query: 344 ELIMGGYR-NNQASILERRHDVLMFNQGWQQYGNVINEIVTVGLG--------YKTALRK 394 L GY +N + H V + W++ + + + +G+ + +++ Sbjct: 347 GLWCFGYDMDNMKARCWYEHRVPV----WKEITPAVRDYLPLGVQMAHDAQQLLRQSVKA 402 Query: 395 ALYTFAEGFKNKDFKGAGVSVHETAERHFYRQSELLIPDVLANVNFSQADEVIADLRDKL 454 A ++ + DF VS + E F+RQ I + + R++L Sbjct: 403 AWFSRPKDV-GGDFSVIDVSFWQETE-QFFRQLYRSI------AGGNCPIAALNAWRNQL 454 Query: 455 HQLCEMLFNQSVAPYAHHPKLISTLALARATL---------YKHLRELKPQGGPSNG 502 + F++ ++ AR+ + K L+ L+PQ +NG Sbjct: 455 YMYLIGTFDRLTFGNPDQQGDLTRAVEARSEMVKLFYGQKSMKKLKALQPQEVSANG 511 >UniRef50_B5ZCF9 CRISPR-associated protein, Cse1 family n=10 Tax=Acetobacteraceae RepID=B5ZCF9_GLUDA Length = 546 Score = 334 bits (855), Expect = 7e-90, Method: Composition-based stats. Identities = 91/504 (18%), Positives = 166/504 (32%), Gaps = 52/504 (10%) Query: 1 MNLLIDNWIPVRPRNG--GKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQII 58 MNLL +W+P+R ++G ++ + L PR D +A+L L+ + Sbjct: 1 MNLLTASWLPIRRKSGAAETIRPAQIVDRVADDPIMALDWPRADFRIASLEFLIGLLATA 60 Query: 59 APAKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDVTPMEKLL 118 P K++ + +P + + + AP + F+L+ F+Q + P+E+LL Sbjct: 61 FPPKNEDIWCETWEDPPSVEALDEAFAPVAEAFWLDGPGPRFLQDLENLQSGQEPVERLL 120 Query: 119 AGVSG----ATNCAFVNQPGQGEALCGGCTAIALFNQANQAPGFGGGFKSGLRGGTPVTT 174 G N + AL +ALF + AP G G +GLRGG P+ T Sbjct: 121 IDAPGDSTVKKNTDLFVHRQRIMALGRPAACMALFTLQSWAPSGGAGNMTGLRGGGPLVT 180 Query: 175 FV---RGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSNESI-----PASSI 226 V G L V N P+E+ W+ P + P + Sbjct: 181 LVLPREGASLWEMVWAN-----TPFGVPPSEADLPRVFPWLAPTIGSGKDGTSVRPGHNA 235 Query: 227 GFVRGLFWQPAHIELCDP-IGIGKCSCCGQESNLRYTGFLKEKFTFTV---------NGL 276 ++ + P I L G C GQ + G+ + + + G Sbjct: 236 HPLQCWWGMPRRIRLDFEAAEDGICDLTGQPDAVLVPGWRQRPYGASYADWTGMPYGAGA 295 Query: 277 WPHPHSPCLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAAVV----N 332 HP +P K E +L+ + + + + V+ + Sbjct: 296 SIHPLTPRYRQKKDAE----WLSVHPQPGGIGYRHWAGIVVNSSDTHRLPASTVLSWRND 351 Query: 333 QFRNIAPQSPLELIMGGYRNNQASILERRHDVLMFNQ----GWQQYGNVINE-----IVT 383 + RN+A L+ GY + + Q+ + + Sbjct: 352 RARNVAASLTPRLLAAGYDMDNMKARSFVESEMPLPGVVDPVRQEALDALARAYVEAADQ 411 Query: 384 VGLGYKTALRKALYTFAEGFKNKD-FKGAGVSVHETAERHFYRQSELLIPDVLANVNFSQ 442 V + +R+AL+ + F G E F+ + + Sbjct: 412 VAGILRQCVREALFGKGTISPDATLFSGLRERFWAQTEGTFFDLLHQAVL-----LGDGD 466 Query: 443 ADEVIADLRDKLHQLCEMLFNQSV 466 ++ L ++ LF+ +V Sbjct: 467 DIDLRRIWLRALRRVALDLFDSAV 490 >UniRef50_A5FZI3 CRISPR-associated protein, Cse1 family n=2 Tax=Acidiphilium cryptum JF-5 RepID=A5FZI3_ACICJ Length = 529 Score = 329 bits (842), Expect = 2e-88, Method: Composition-based stats. Identities = 101/525 (19%), Positives = 177/525 (33%), Gaps = 48/525 (9%) Query: 1 MNLLIDNWIPVRPRNGGKVQIIN--LQSLYCSRDQWRLSLPRDDMELAALALLVCIGQII 58 NLL + W+P+ ++G K I + S ++ PR D LA + LLV + Sbjct: 2 FNLLTNPWLPIVRQDGTKSVIAPRDITEDISSNPVIAVNWPRADFRLATMELLVGLIATA 61 Query: 59 APAKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDVTPMEKLL 118 P D+ ++ P + ++ AP F + FMQ D P+E LL Sbjct: 62 CPPADEDDWLDAWEAPHSPEKLDGAFAPLAHAFSFDGPGPRFMQDLADLDADEEPVENLL 121 Query: 119 AGVSGATNCAFVNQPGQGEALCGGCTAIALFNQANQAPGFGGGFKSGLRGGTPVTTFV-- 176 V+G N + PG+ + + AI+L+ + +P G G ++GLRGG P+ T V Sbjct: 122 IEVAG--NSGPLVHPGRTKRMGRPAAAISLYTLQSWSPSGGRGNRTGLRGGGPMVTMVAP 179 Query: 177 -RGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKS---NESIPASSIGFVRGL 232 R L + NV + P W+ P + +E + + + Sbjct: 180 GRHRSLWHHIWANV-----PLGRKPEPVDFPRIFPWLSPTITSVNDEVVTPDDVAHPLQV 234 Query: 233 FW-QPAHIELCDPI--GIGKCSCCGQESNLRYTGFLKEKFTFTVNGLW-PHPHSPCLVTV 288 +W P I L C G + TG+ + G HP +P Sbjct: 235 WWGMPRRIRLSFVQLPSPAPCDLTGALDSSVVTGWRQRPHGPKYVGWGARHPLTPTYQNK 294 Query: 289 KKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAAVV-----NQFRNIAPQSPL 343 E+ L+ + + + + + R A +V +++R Sbjct: 295 AGEEI----LSVHPNPGGVGYRNWIGLVLRSPDGL-RRPAPIVSTWRNDRYRGTEEAKGA 349 Query: 344 ELIMGGYRNNQASILERRH-DVLMFNQGWQQYGNVINEIVT--------VGLGYKTALRK 394 LI GGY + +V + ++ ++ + T + A+++ Sbjct: 350 RLIAGGYDTDNMKARGFMETEVPLVLASSKEVQERLDALATSLVRASERASYQLRKAVQQ 409 Query: 395 ALYTFAEGFKNK--DFKGAGVSVHETAERHFYRQSELLIPDVLANVNFSQADEVIADLRD 452 ALY K G V E F+ + ++ + Sbjct: 410 ALYHPGAKVKATAHGIALLGDRVWLETESAFFSALD-------RAMSLDDTAPERVAWQV 462 Query: 453 KLHQLCEMLFNQSVAPYAHHPKLISTLALARATLYKHLRELKPQG 497 +L L +F+ +V + AR L L G Sbjct: 463 RLRGLALRIFDDTVPIDPLDRNN-ARQVRARFFLGLGLSGYGKDG 506 >UniRef50_D2L2X5 CRISPR-associated protein, Cse1 family n=1 Tax=Desulfovibrio sp. FW1012B RepID=D2L2X5_9DELT Length = 554 Score = 303 bits (776), Expect = 9e-81, Method: Composition-based stats. Identities = 101/531 (19%), Positives = 170/531 (32%), Gaps = 55/531 (10%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSR-----DQWRLSLPRDDMELAALALLVCIG 55 NLL +WIPVR +G +++I + + PR D + A L LL+ Sbjct: 4 FNLLTQDWIPVRRVDGTRLRIPPWRITDPGDGSPGQAIADIDTPRPDFKGALLELLIGFV 63 Query: 56 QIIAPAKDDVEFRHRI---------MNPL--TEDEFQQLIAPWIDMFYLNHAEHPFMQTK 104 Q P D+ ++R + + P + AP F L F+Q Sbjct: 64 QTALPPTDNRKWRLGLSANTTNEPHLAPPDYAPAALKTAFAPLTPFFNLFGDRPRFLQDL 123 Query: 105 GV---KANDVTPMEKLLAGVSGAT----NCAFVNQPGQG-EALCGGCTAIALFNQANQAP 156 + +A + +P+ LL G N F + Q + LC C A AL AP Sbjct: 124 TLTEAEAKEPSPIAALLMDSPGENATKFNSDFFIKRDQPPDRLCPACAAAALHALQTYAP 183 Query: 157 GFGGGFKSGLRGGTPVTTFVRGID-LRSTVLLNVLTLPRLQ---KQFPNESHTENQPTWI 212 G G + LRGG P+TT V D L TV NVL L + W+ Sbjct: 184 SGGAGHRVSLRGGGPLTTLVMLDDSLWKTVWANVLPLDAANVEALPANPAALPGAVFPWL 243 Query: 213 KPIKSN----ESIPASSIGFVRGLFWQPAHIELCDPI--GIGKCSCCGQESNLRYTGFLK 266 + + + + F+ + P I L C CGQ N+ + Sbjct: 244 AVTRDSTAKGSEVHREGMHFLHHYWAMPRRIVLDAETDETPSACPVCGQPGNVFVRQYRT 303 Query: 267 EKFTFTVNGLWPHPHSPCLVTVK-KGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGN 325 + + W HP +P K + K + + W V ++ Sbjct: 304 KNYGNNYGKGWQHPLTPYRDQGPGKEALTIKGESEGRAYNQWLGF----VYGATDDKKPV 359 Query: 326 RVAAVVNQFRNIAPQ---SPLELIMGGYR-NNQASILERRHDVLMFNQG-------WQQY 374 A VV +R +P +P L G+ +N + + + + + Sbjct: 360 IPARVVTHYRTGSPPGQETPARLRTFGWDMDNMKARNWCEGEYPILDLKGREAKRFIGEV 419 Query: 375 GNVINEIVTVGLGYKTALRKALYTFAEGFKNKD---FKGAGVSVHETAERHFYRQSELLI 431 ++ + A+ +AL++ D E FY ++ Sbjct: 420 APLVKAAEEACNNLRKAVHEALFSEKGPKPKPDATLLALVETRFWAETETAFYTSVRSIL 479 Query: 432 PDVLANVNFSQADEVIADLRDKLHQLCEMLFNQSVAPYAHHPKLISTLALA 482 ++ + + R L +F P+ + A Sbjct: 480 EA--SDDDEEARLGIALGWRRTLLDAVGAIFAAVAEDGGTTPRKTRQIYAA 528 >UniRef50_Q2RY16 CRISPR-associated protein, Cse1 family n=1 Tax=Rhodospirillum rubrum ATCC 11170 RepID=Q2RY16_RHORT Length = 555 Score = 282 bits (722), Expect = 2e-74, Method: Composition-based stats. Identities = 95/541 (17%), Positives = 159/541 (29%), Gaps = 63/541 (11%) Query: 1 MNLLIDNWIPVRPRNGGK--VQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQII 58 NLL++ W+PVR +G + V L + L PR D A L L+ + + Sbjct: 3 FNLLLERWLPVRRVSGKRDWVAPHQLTEGFAEDPIVGLDFPRADFNAAVLEFLIGVVYVA 62 Query: 59 APAKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQT-KGVKANDVTPMEKL 117 P + ++ + P Q ++P F + Q + A D P+ L Sbjct: 63 LPCQKAADWVKGSLTPPAPATLQAALSPLAFAFDFDGDGPRAYQDTSDLAAADCRPITGL 122 Query: 118 LAGVSG----ATNCAFVNQPGQGEALCGGCTAIALFNQANQAPGFGGGFKSGLRGGTPVT 173 G N + ALC A A AP G G ++ +RGG P+T Sbjct: 123 FIDFPGENTLKNNADLFIKRRDASALCLPYAAAATITLQTYAPSGGAGHRTSIRGGGPLT 182 Query: 174 TFVRGI----------DLRSTVLLNVLTLPRLQKQFPNESHTEN------QPTWIKPIKS 217 T V L + NV + P + W+ + Sbjct: 183 TLVAPRRRLAGGGEVATLWDRIWANV-PDQKWDGSDPIAGDPADHANWPLVFPWLAAAIT 241 Query: 218 N---ESIPASSIGFVRGLFWQPAHIELCDP--IGIGKCSCCGQESNLRYTGFLKEKFTFT 272 + + + + + F P + L C G + GF + + Sbjct: 242 SSHGQIVAPADATKRQSFFGCPRRLRLVFQQASPDHPCVLGGPAGAIMAVGFRTQNYGAN 301 Query: 273 VNGLWPHPHSPCLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAAVVN 332 G W HP SP K G + + W I + E +R Sbjct: 302 YEG-WTHPLSPYRDDKKAGRLPIHPHGGAATYGDWLAIWGYDGTPAVGVEIWDR-----R 355 Query: 333 QFRNIAPQSPLELIMGGYRNNQASILERRHDVLMFNQGWQQYG----NVINEIVTVGLGY 388 + A + + G+ + A + L + + + I +++ Sbjct: 356 RALLGATLAGDAIEAFGFDMDNAKARQWLDIRLPWVGVYGEDAATLRTAIAQMIGATQKA 415 Query: 389 KTALRKALYTFAEGFKNKDFKGA------------------GVSVHETAERHFYRQSELL 430 LR A+ G + D K + + E F R Sbjct: 416 SQRLRLAIRLALWGQRATDPKTGKPGFRLPDELPADAATIDVTPIWQETEGPFRRH---- 471 Query: 431 IPDVLANVNFSQADEVIADLRDKLHQLCEMLFNQSVAPYAHHPKLISTLALARATLYKHL 490 + D++A + A V L F+ +V L AR L + L Sbjct: 472 VQDLIAKPDGHLA--VRKLWLKTLRGQTLRQFDTTVDLDGLTDADPHRLLFARDGLSRAL 529 Query: 491 R 491 Sbjct: 530 A 530 >UniRef50_Q0BSC4 Putative uncharacterized protein n=1 Tax=Granulibacter bethesdensis CGDNIH1 RepID=Q0BSC4_GRABC Length = 505 Score = 274 bits (700), Expect = 6e-72, Method: Composition-based stats. Identities = 96/510 (18%), Positives = 165/510 (32%), Gaps = 58/510 (11%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAP 60 +NL+ D WIPV +G + I Q D + PR D+ +A L LL+ + + P Sbjct: 23 LNLIDDQWIPVLCADGSRRVIAPWQ--MAEPDVVQPDWPRPDLNIACLELLIGLVFLADP 80 Query: 61 AKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDVTPMEKLLAG 120 D ++ R Q+ +AP+ F L F+Q + ++ L Sbjct: 81 PVDGEDWEARRD--PDPQRLQEKLAPYAPAFNLVGDGPRFLQDLEPFTGKASSVDMLFID 138 Query: 121 VSG----ATNCAFVNQPGQGEALCGGCTAIALFNQANQAPGFGGGFKSGLRGGTPVTTFV 176 + N + + + L A+AL+ + AP G G + +RGG P+ T V Sbjct: 139 SAAVETARKNADVMVHRSRYDRLDFPIAAMALYTFQSYAPAGGAGNFTSMRGGGPMVTLV 198 Query: 177 RGID-LRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSN----ESIPASSIGF-VR 230 L V +NV + + W++P + + +++P F Sbjct: 199 DPERMLWDLVWVNVSCGHSAKME---------TLPWMRPTRVSHTGQQTLPPDGELFGAE 249 Query: 231 GLFWQPAHIELCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWPHPHSPCLVTVKK 290 F P + L + TG +++ LW HP SP Sbjct: 250 AFFGMPRRLRL-------------THNEGAVTGVIQKPGGTDY-ALWKHPLSPYYRKKSG 295 Query: 291 GEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAAVVNQFRNIAPQSPLELIMGGY 350 E +L A + + + V + E G+ ++ + R L+ G Sbjct: 296 EE----WLPKHPRAGHFGYRNWLGV---VVKEKGSDLSELALCLREDRIGGGSILVAGWS 348 Query: 351 RNNQASILERRHDVLMFNQGWQQYGNVINEIVTVGLGYKTALRKALYTFAEGFKNKDFKG 410 +N + + I +++ ALR AL Sbjct: 349 MDNMKPRDFILSRQRRLSAIPAEAEYRIVDLIQAADAVAVALRNALTP----------VL 398 Query: 411 AGVSVHETAERHFYRQSELLIPDVLANVNFSQADEVIADLRDKLHQLCEMLFNQSVAP-- 468 AG E F+RQ+E + + + L + F+ P Sbjct: 399 AGGEAREAEREEFFRQTETKFLTHVQAIERGEDPA--EAWLADLRRQALGQFDAKALPGL 456 Query: 469 YAHHPKLISTLALARATLYKHLRELKPQGG 498 K I + R L L +GG Sbjct: 457 NQRDVKAIGRITEGRRYLGLVLAGYGKEGG 486 >UniRef50_B8IMR5 CRISPR-associated protein, Cse1 family n=1 Tax=Methylobacterium nodulans ORS 2060 RepID=B8IMR5_METNO Length = 560 Score = 261 bits (666), Expect = 6e-68, Method: Composition-based stats. Identities = 90/487 (18%), Positives = 168/487 (34%), Gaps = 57/487 (11%) Query: 1 MNLLIDNWIPVRPRNGGK--VQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQII 58 +LL + WIPV +G ++ + + + + R D++ A + + Sbjct: 4 FSLLTEPWIPVLRADGTHACIRPAEITADIAANPVVAPAWGRPDLDAATREYWIALFGTA 63 Query: 59 APA-KDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDVTPMEKL 117 + +R + +P + AP F L+ F Q A + P+ +L Sbjct: 64 CGSWAGPGAWREHLRHPPAPEVLDAAFAPLAPAFILDGEGPRFGQDLEDIAGETVPVGQL 123 Query: 118 LAGVSGAT----NCAFVNQPGQGEALCGGCTAIALFNQANQAPGFGGGFKSGLRGGTPVT 173 L GA N + G+ E L AIAL AP G G + +RGG P+T Sbjct: 124 LIEAPGANTIKRNLDHFVRRGRVETLSRAGAAIALHTLQTYAPSGGAGHRVSVRGGGPLT 183 Query: 174 TFV-----------RGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSNE--- 219 T + R + L T+ L S + W+ P +++E Sbjct: 184 TLLLPGPPRGGDPARPVPLWQTLWL--------ATPACEASSLKRVFPWLAPTRTSEQKR 235 Query: 220 SIPASSIGFVRGLFWQPAHIELCDP-IGIG-KCSCCGQESNLRYTGFLKEKFTFTVNGLW 277 S + ++ + P + L G C G+ + + + G + Sbjct: 236 VTTPSDVDPLQAFWGMPRRVRLVFEANTEGHPCDLTGRIDPVVVRAYRTRPHGTSYVG-F 294 Query: 278 PHPHSPCLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENG-NRVAAVVNQFRN 336 HP SP G+ +E FL V + Q + R A V + Sbjct: 295 THPLSPHYR----GKADEPFLPVHGQPGRVGYRHWVGLVVSDQAASPLRRPADAVTLGLS 350 Query: 337 I------APQSPLELIMGGYR-NNQASILERRHDVLMFNQGWQQYGN---VINEIVTVGL 386 + L+ GY +N + ++ + ++ + +++++ Sbjct: 351 RLEGVGGPTAAQARLLATGYDMDNMKARAFIESEMPLHLPPPGRFSDLNGAVSDMIKGAY 410 Query: 387 G----YKTALRKALY---TFAEGFKNKDFKGAGVSVHETAERHFYRQSELLIPDVLANVN 439 +T +R AL+ T +GF+N G + + A F+ ++E + LA ++ Sbjct: 411 AAEGLLRTGVRAALFVKATAGDGFQNAPKGGGAIDL---ARARFWERTEAAFGEALAALS 467 Query: 440 FSQADEV 446 AD Sbjct: 468 EDLADPN 474 >UniRef50_D1CGD1 CRISPR-associated protein, Cse1 family n=1 Tax=Thermobaculum terrenum ATCC BAA-798 RepID=D1CGD1_THET1 Length = 533 Score = 220 bits (559), Expect = 1e-55, Method: Composition-based stats. Identities = 96/534 (17%), Positives = 179/534 (33%), Gaps = 45/534 (8%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWR-LSLPRDDMELAALALLVCIGQIIA 59 NL+ + WIPVRP Q++ L+ + R L P + ++ LL+ I + Sbjct: 4 FNLVDEPWIPVRPIGASTTQLMGLRDVLLGAHAIRELVDPSPLVTVSLHRLLLAILHRVF 63 Query: 60 PAKDDVEFRHRIMNP-LTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDVT---PME 115 +DD E+ + + W D F L H ++PF Q ++ VT P+ Sbjct: 64 GPRDDAEWAELYGGGSFPPQPLEDYLQRWHDRFDLFHEKYPFYQKGSIQRQSVTKLWPVT 123 Query: 116 KLLAGVSGATNCA--FVNQPGQGEALCGGCTA--IALFNQANQAPGFG--GGFKSGLRGG 169 +L ++ N F + +G A A + L + FG G K Sbjct: 124 RLAPEIASPGNATTLFDHTLPEGVAFTPDRAARYLVLLHPFTVGGLFGLLKGEKDKAADA 183 Query: 170 TPV----TTFVRGIDLRSTVLLNVLT-LPRLQKQFPNESHTENQPTWIKPIKSNESIPAS 224 P+ +RG L T++LN++ P + P S E+ P W + +++ P Sbjct: 184 GPLAKCAVVLLRGRTLFETLMLNMVRYDPEFDE--PCPSTPEDSPAWE---RDDDTQPVD 238 Query: 225 SI--GFVRGLFWQPAHIELCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWPHPHS 282 + G++ L WQ + L + G+ E+ + + Sbjct: 239 RLPKGYLDYLTWQSRRVRLFPEVQDGRVVVREVIIVKGCQLRPTEEI-ANYETMVAFRKN 297 Query: 283 PCLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNR--VAAVVNQFRNIAPQ 340 P KGE + F W + + + +A ++ R + + Sbjct: 298 PR---AGKGENPRPPVGFREDRAMWRDSHVIFQSTDTHTQPRSLRWIAELIAMGR-LQEE 353 Query: 341 SPLELIMGGYRNNQASILERRHDVLMFNQG-------WQQYGNVINEIVTVGLGYKTALR 393 L L + G +QA+I RH+VL + + Q + +VG L Sbjct: 354 HRLPLEIYGIITDQANIKLWRHEVLPVSTRYFSDRNLYSQLQRALAMAESVGQELDRTLE 413 Query: 394 KALYTFAEGFKNKDFKGAGVSVHE--TAERHFYRQSELLIPDVLANVNFSQADEVIA--- 448 + + +++ T + ++ A+ Q + Sbjct: 414 QLARDLLPNPNRDEINNLRKAINATPTYWASLEVPFHHFLLELEADRQLVQGRPIYGYGY 473 Query: 449 ---DLRDKLHQLCEMLFNQSVAPYAHHPKLISTLALARATLYKHLRELKPQGGP 499 + D + Q + +V+ + + + AR L L Q P Sbjct: 474 AMRNWMDAIKQAGRLALEFAVSGLDGNARNLRAAVNARGRFNGRLNTLLAQEAP 527 >UniRef50_A7BA67 Putative uncharacterized protein n=1 Tax=Actinomyces odontolyticus ATCC 17982 RepID=A7BA67_9ACTO Length = 556 Score = 213 bits (543), Expect = 9e-54, Method: Composition-based stats. Identities = 59/270 (21%), Positives = 103/270 (38%), Gaps = 28/270 (10%) Query: 2 NLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAPA 61 NLL + WIPVR +G + L+ L + D L+ +A L++ I +A Sbjct: 7 NLLDEPWIPVRLVDGTITDVGLLELLRRTTDIADLACELPTQSIAIQRLILAIMYRVATP 66 Query: 62 KDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKAND--VTPMEKLLA 119 +D ++ + ++ + + W D FYL PFMQ ++ V+ +EKL+A Sbjct: 67 RDTRDWVRQWDEGAPTEQMIEYLERWRDRFYLFGGRFPFMQVANLRTAKDAVSGLEKLIA 126 Query: 120 GVSGATNCAFVNQPGQGEALCGGCTAIA-LFNQANQAPGF---GGGFKSGLRGG-----T 170 V F + G+ A A L + P G S ++GG Sbjct: 127 DVPNGEQF-FTTRHGRALACIPASEAARWLVHAQAYDPSGIRSGAVGDSQVKGGKGYPIG 185 Query: 171 PV------TTFVRGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKP-----IKSNE 219 P +++G DL T++LN++ + + + S +W P ++ + Sbjct: 186 PAWCGHLGLVWLKGKDLDETLVLNLIPATTAELRGVDSSTDWGACSWEDPEPETSVRGDY 245 Query: 220 SI-----PASSIGFVRGLFWQPAHIELCDP 244 S+ + R L W I L Sbjct: 246 SLLDPAGTPKELSIPRLLTWHSRRIRLVGD 275 >UniRef50_A5UR17 CRISPR-associated protein, Cse1 family n=1 Tax=Roseiflexus sp. RS-1 RepID=A5UR17_ROSS1 Length = 525 Score = 192 bits (487), Expect = 3e-47, Method: Composition-based stats. Identities = 92/523 (17%), Positives = 170/523 (32%), Gaps = 39/523 (7%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAP 60 NL + WI V R+G +I L + + LS P + LL I Q I Sbjct: 7 FNLWTEPWIRVIRRDGRDDEIGIGTCLTDAHELAALSDPSPLVAGGTHRLLTAILQAIHQ 66 Query: 61 AKDDVEFRHRIMNP-LTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVK---ANDVTPMEK 116 +D E + N + Q F L PF+QT V ++ P+ + Sbjct: 67 PQDIGEIAALLHNAKFDINRLQAFEKNHAGRFDLFDPHAPFLQTGDVPLHSNHNPQPVAR 126 Query: 117 LLAGVSGATNCAFVNQPGQGEA-LCGGCTAIALFNQANQAPGFGGGFKSGLRGGTPVTTF 175 L A + AT +C C A L A G G + + G P+ Sbjct: 127 LFAEIPVATERVHFTHVTDDRHRICPACCARGLVTAPAFASSGGAGIRPSINGVPPIYVL 186 Query: 176 VRGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSNESIPASSIGFVRGLFWQ 235 G L T+ L++++ L + + P ++ S++G++ L + Sbjct: 187 PAGDTLFETLTLSLVSSDYL-PPGADPKRADQAIWNSDPPVVGKNCEVSAVGYLESLTFP 245 Query: 236 PAHIELCDPIGIGKCSCCGQESNLRYTGFLKE--KFTFTVNGLWPHPHSPCLVTVKKGEV 293 + L G C+ CG+++++ L E + G+W P K+ + Sbjct: 246 ARRMRLYPQAGSVFCTNCGRQTDIFVATMLFEMGHWLSKQTGVWEDPFVAFRKPSKQSKN 305 Query: 294 EE-KFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAAVVNQFRN------IAPQSPLELI 346 + K + W + + +++D+ + G R +V Q + + L Sbjct: 306 ADLKPIRPEEGKAIWREYAVLLLDE---DAAGLRP-RIVRQLARLIDRGTLTGRQRLRFR 361 Query: 347 MGGYRNN-QASILERRH-------DVLMFNQGWQQYGNVINEIVTVGLGYKTALRKALYT 398 G R + +A I E ++L G + + V K+ + Sbjct: 362 CIGIRTDGKAKIFEWLDEALEAPPELLQDPDAAAYVGEALRQSHEVAAILKSTFERHFRP 421 Query: 399 -FAEGFKNKD----FKGAGVSVHETAERHFYRQSELLIPDVLANVNFSQADEVIADLRDK 453 G N+ FK + R + D+ + Q D+ Sbjct: 422 ERGTGGSNEQKFIRFKTVLERLIADYWRRLGLHFRQFVNDL---SDVWQRDDTARTWVIL 478 Query: 454 LHQLCEMLF----NQSVAPYAHHPKLISTLALARATLYKHLRE 492 + + + F +Q+ + A L+ +E Sbjct: 479 IIKEAQACFRTALDQTGDRADALRIRVEAQAECERQLHARRKE 521 >UniRef50_B0TDT8 Crispr-associated protein, ct1972 family, putative n=1 Tax=Heliobacterium modesticaldum Ice1 RepID=B0TDT8_HELMI Length = 523 Score = 184 bits (466), Expect = 8e-45, Method: Composition-based stats. Identities = 83/518 (16%), Positives = 164/518 (31%), Gaps = 44/518 (8%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQW-RLSLPRDDMELAALALLVCIGQIIA 59 +LL + W+ VR G ++ +++L+ + +W + ++ L + I Sbjct: 5 FDLLTEPWVTVRDVKG-RICVVHLRDVLAKAHEWSEVIDESPLIQFGLYRFLQALIIDIF 63 Query: 60 PAKDDVEFRHRIMNP-LTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDVT--PMEK 116 P K + E + + F L AE PF+Q + V + + Sbjct: 64 PLKGQRGRLELMEEGQFDETKLNAYWEKYGVYFDLFDAERPFLQVPPREQEKVKRKSVAE 123 Query: 117 LLAGVSGATNCAFVNQPGQGE-ALCGGCTAIALFNQANQAPGFGGGFKSGLRGGTPVTTF 175 L + TN + Q E L A + + G G + G P + Sbjct: 124 LFHQLPTGTNVIHFHHRLQDEYVLAPDVCARIMTTLSPFTTAGGQGLSPSINGNPPYYVW 183 Query: 176 VRGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSNESIPASSIGFVRGLFWQ 235 +G +L T+LLN + P W + + S + GL WQ Sbjct: 184 RKGDNLFETLLLNYWITDQ----------DRGIPAWR-DRRPSRGETRSEARLLEGLTWQ 232 Query: 236 PAHIELCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWPHPHSPCLVTVKKGEVEE 295 P + L +G +C+ G+ + E W P V V + Sbjct: 233 PRRVTLIPEMGPFQCTYSGRSCQWGVRQMVFEAGFQARVDTWRDP----NVAVVNTDKGR 288 Query: 296 KFLAFTTSAPSWTQISRVVV----DKIIQNENGNRVAAVVNQFRNIA--PQSPLELIMGG 349 F+ +W + + + K +Q +N A ++NQ Q + + G Sbjct: 289 SFVRPRWGRQTWRDVGPLALIDGAGKGVQEKNSYERAPILNQASIYLECEQQTTTIEVYG 348 Query: 350 YRNN-QASILERRHDVLMFNQGWQQY-------GNVINEIVTVGLGYKTALRKALYTFAE 401 + + L+ R++ L G +Q +N + A+ + + Sbjct: 349 LQTDGNMKYLDWRYEELQLPAGLEQVPNGEEFALQAMNNAEKAAWALRKAVNMCVQIKQK 408 Query: 402 GFKNKDFKGAGVSVHETAERHFYRQSELLIPDVLANVNFSQADE--------VIADLRDK 453 K + G + E ++ E L+ + + +E ++ + Sbjct: 409 KGKKEQKIWPG-EWGQRVEDAYWLSLEAPYLAFLSVLAGTAKEEDPDKHLETLMEAWTKE 467 Query: 454 LHQLCEMLFNQSVAPYAHHPKLISTLALARATLYKHLR 491 + F ++ + + A L + LR Sbjct: 468 IRNKASDYFTEATKENVSDAEAMRRQIQAEQYLRRSLR 505 >UniRef50_D0Y921 CRISPR-associated protein, Cse1 family n=2 Tax=Dehalococcoides RepID=D0Y921_9CHLR Length = 543 Score = 161 bits (406), Expect = 8e-38, Method: Composition-based stats. Identities = 78/542 (14%), Positives = 168/542 (30%), Gaps = 81/542 (14%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIA- 59 NL+ + WIP + ++ +L+ + + + + +A LL+ I Sbjct: 4 FNLIDEPWIPCIGADDNIIEYSIRDTLFKAHELREICDDSPLVTVAIHRLLLAILYRAFE 63 Query: 60 PAKDDVEFRHRIMNP-LTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDVTPMEKLL 118 E+R N + + ++ + W F L ++PF Q + + +L Sbjct: 64 GPSSMQEWRELYRNGSFNKSKIKEYLEKWCQRFNLLDEDYPFYQMSQFETVKPISVNRLA 123 Query: 119 AGVSGATNCAFVNQPGQGEAL--CGGCTAIALFNQANQAPGFGGGFKSGLRGGTPVT--- 173 ++ N + G + A L + A GFG + + G + Sbjct: 124 TEIASGNNATLFDHCGDDIEVEWTPSQVAQRLITCQSFALGFGRSGNAKINGINEILPYS 183 Query: 174 ----------TFVRGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSNESIPA 223 +++G L T+++N+ P + P ++ + E + Sbjct: 184 SDAIALRGMNIWLQGGTLFETLMINL--SPVIDNSLPPWEL-KDSNKYRDRQNGKERVVC 240 Query: 224 SSIGFVRGLFWQPAHIELCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWPHPHSP 283 S G V L WQ I L C S + + + Sbjct: 241 RSSGLVDQLTWQSRLIRLIPN--------CQTISKMYFAQGRSADKSAN------DLMKV 286 Query: 284 CLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAAVVNQFRNIAPQSPL 343 ++ +G L+ +++ +W +++ ++ + N+A ++ + Sbjct: 287 YRLSKDEGVSS---LSLSSNKAAWRDAHSILMIPESGSKERRP------ECFNMAEEAII 337 Query: 344 ELIMGGYR-------------NNQASILERRHDVLMFNQGWQQYGNVINEI---VTVGLG 387 ++GG + N + RH+ + + +++ + + Sbjct: 338 SGVIGGSKSFVTHIVGLATAPNKAGKFIFWRHERMPVPAAFLSNIDLLKRLGSCLENAER 397 Query: 388 YKTALR------KALYTFAE-----GFK--NKDFKGAG------VSVHETAERHFYRQSE 428 ALR LY + G + D ++ E HF+ E Sbjct: 398 AAEALRYRIQRVTKLYLSPDCESPGGHRPDKADVDNIIEATDPCLTYWSRMEEHFFALLE 457 Query: 429 LLIPDVLANVNFSQADEV---IADLRDKLHQLCEMLFNQSVAPYAHHPKLISTLALARAT 485 L D A S+ DE R + + +S+ + + I +A Sbjct: 458 SLPNDWDAATGDSKPDEEQTARLTWRQSVKLEAKRALLESIELFGTTARAIQAIAHVSTD 517 Query: 486 LY 487 Y Sbjct: 518 FY 519 >UniRef50_D1CAJ3 CRISPR-associated protein, Cse1 family n=1 Tax=Sphaerobacter thermophilus DSM 20745 RepID=D1CAJ3_SPHTD Length = 555 Score = 159 bits (402), Expect = 2e-37, Method: Composition-based stats. Identities = 67/420 (15%), Positives = 125/420 (29%), Gaps = 39/420 (9%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLY-CSRDQWRLSLPRDDMELAALALLVCIGQIIA 59 NLL WIP G + ++L+ + + + + P + ++ LL+ + Sbjct: 5 FNLLDCPWIPCMRAADGAWEDLSLRDVLVRAHELREIVDPSPLVTVSLHRLLLAFLHRVF 64 Query: 60 PAKDDVEFRHRIMNPL-TEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDVTPMEKLL 118 E+ D + W + F L +PF QT V A+ P+ ++ Sbjct: 65 GPASIDEWAALWERGSWDPDPIDRYCERWRNRFNLFDPTYPFYQTPAVDASYAKPVAGIV 124 Query: 119 AGVSGATNCAFVNQPGQGE--ALCGGCTAIALFNQANQAPGF------GGGFKSGLR--- 167 G+ + + AL A L G G ++ + Sbjct: 125 HGMMLGNYLTLFDHSVATDPPALSPAQAARYLVAYQAFDVGGMISYQSRHGEEASVAKYT 184 Query: 168 GGTPVTT----FVRGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSNESIPA 223 P+TT V+G +L T++LN+ T++ W + Sbjct: 185 KAGPLTTSAVALVKGRNLFQTLMLNLHAYNGADGLPF--HFTDDSAAWERDEHPTPRERR 242 Query: 224 SSIGFVRGLFWQPAHIELCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWP----H 279 S G+V L WQ + L G + + W Sbjct: 243 PS-GYVDLLTWQSRRVRLLPESADG--------NAAPVVRYAVIMKGEQFPDGWNPADYE 293 Query: 280 PHSPCLVTVKKGEVEEKFLA--FTTSAPSWTQISRVVVDKIIQNENGNRVAAVVNQFRNI 337 P ++K + + F W + Q+ + V Sbjct: 294 PMVAFRKSLKPRDGVPPWFPIGFQEDRALWRDSLALFQSVSGQSARPKMLDWVAGLAAEG 353 Query: 338 APQSPLE--LIMGGYRNNQASILERRHDVLMFNQGW---QQYGNVINEIVTVGLGYKTAL 392 S L + G +QA++ RH+ L ++ ++ E + + T L Sbjct: 354 PLGSRARFALDLYGMVTDQANVTLWRHERLPLPAPLLNDRERYELLQEALGLAERVSTLL 413 >UniRef50_Q67RP3 Putative uncharacterized protein n=1 Tax=Symbiobacterium thermophilum RepID=Q67RP3_SYMTH Length = 523 Score = 154 bits (389), Expect = 7e-36, Method: Composition-based stats. Identities = 86/528 (16%), Positives = 158/528 (29%), Gaps = 59/528 (11%) Query: 12 RPRNGGKVQIINLQSLYCSRDQWR-LSLPRDDMELAALALLVCIGQIIAPAKDDVEFRHR 70 +G ++ +L+ R + P + +A LL+ + + ++ Sbjct: 2 IRLSGHPDRL-SLRQALAEAHVVREVCDPSPLVVVAIHRLLMALIYRVYRPVTRADWAAL 60 Query: 71 IMNP-LTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDVTPMEKLLAGVSGATNCAF 129 A W+D F L H E PF Q + V P+ L+ + N Sbjct: 61 WNAGRFDPGPLDGYGAFWMDRFELFHPERPFYQVPFIDGEKVHPISALVLEAASGNNPTL 120 Query: 130 VNQPGQ--GEALCGGCTAIALFNQANQAPGFGGGFKSGLRGGTPVT----TFVRGIDLRS 183 + G AL A L A G G K R P+T +L Sbjct: 121 FDHGRVEGGVALPPDRAACHLLAHQLFALGGGVS-KPFNRMDAPLTKGLVVEALDTNLFR 179 Query: 184 TVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSNESIPASSI-GFVRGLFWQPAHIELC 242 T+LLN L L ++ P ++ P W + + G + L WQ + LC Sbjct: 180 TLLLNTLPLEDWERLIPP--TDDDAPFWEGDDPPEPVREGTPVKGPLHYLTWQSRQLHLC 237 Query: 243 DPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWPHPHSPCLVTVKKGEVEEKFLAFTT 302 G + C ++++ +G+ P + K+G Sbjct: 238 TDEESGLVTGCQI----------RQRYALPKDGVRLDPGKVYQQSPKEG---FVPFKLNK 284 Query: 303 SAPSWTQISRVVVDKIIQNENGNRVA---AVVNQFRN---IAPQSPLELIMGGYRNNQAS 356 W + V++ Q+ + + A +++FR+ IA S + L + G + Sbjct: 285 ERAVWQ-YTHVLLQTSGQDYSRPYLTDWLATMHRFRSRYGIAFPSRVILAVTGLTTDPQK 343 Query: 357 I----LERR-------------------HDVLMFNQGWQQYGNVINEIVTVGLGYKTALR 393 L RR ++L + + + + + + AL Sbjct: 344 AAKVELWRRERLPLPMTILDQPELMAEVEEMLAEARRVEGLLSRTAQALVWASAERKALG 403 Query: 394 KALYTFAEGF--KNKDFKGAGVSVHETAE-RHFYRQSELLIPDVLANVNFSQADEVIADL 450 A+ G K ++ Q E + ++ A EV + Sbjct: 404 DAVTYTWTGKLPPGKKLDQVKGLARSLGMVARYWPQLEEPFRRSIEDLAVKSAGEVRSAW 463 Query: 451 RDKLHQLCEMLFNQSVAPYAHHPKLISTLALARATLYKHLRELKPQGG 498 R+ + F H L + + L + G Sbjct: 464 REAVMMAARDAFRSGRDGLLHTEASFEVLTCVGSAFHGKLSRIFAAAG 511 >UniRef50_B6WQ59 Putative uncharacterized protein n=1 Tax=Desulfovibrio piger ATCC 29098 RepID=B6WQ59_9DELT Length = 516 Score = 150 bits (378), Expect = 1e-34, Method: Composition-based stats. Identities = 55/231 (23%), Positives = 92/231 (39%), Gaps = 23/231 (9%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAP 60 MNL+ D WIP R+G V+ +L+ + D L++ R +A + LL+C+ A Sbjct: 1 MNLVDDPWIPCIRRDG-MVRPASLRDCFTCDDIVDLAV-RPHERVALMRLLLCVSYAAAG 58 Query: 61 -AKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVK----ANDVTPME 115 +D + E + W D F L H + PF+Q G++ + D+TP Sbjct: 59 IPEDYDGWEDLRER--LPLEVPVYLDQWRDAFELFHPQKPFLQVVGLRSASASGDLTPCS 116 Query: 116 KLLAGVSGATNCAFVNQPGQGE-ALCGGCTAIALFNQANQAPGF--GG---GFKSGLRGG 169 KL ++ +N + E A A+ L + G G G K+ R Sbjct: 117 KLDFSLATGSNSTLFDHAALMERAFTPEWLALNLLTYQMFSLGGLIGSVCWGEKTTGRSS 176 Query: 170 --TP------VTTFVRGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWI 212 P + TF+R L ++ +N+L+ L + +P W Sbjct: 177 CDGPCAPGSMLHTFLRRDVLLDSIHVNLLSEEELHDYQQLGEGWQGRPLWE 227 >UniRef50_A1ARH9 CRISPR-associated protein, Cse1 family n=3 Tax=Bacteria RepID=A1ARH9_PELPD Length = 506 Score = 149 bits (376), Expect = 2e-34, Method: Composition-based stats. Identities = 76/515 (14%), Positives = 150/515 (29%), Gaps = 42/515 (8%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIA- 59 NL+ + WIPVR +G + ++ +L S++ + P + A L+ + Sbjct: 4 FNLIDEKWIPVRFPDGAREELGIRDTLLRSKEIAAIEDPSPLVVAALHRFLLAVLYRALE 63 Query: 60 PAKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDVTPMEKLLA 119 D + + ++ L + W + F+L ++PF Q V +++ P KL A Sbjct: 64 GPTDIDQAKILFLSGLPGQRITAYLEKWRERFWLFDEKYPFGQNPNVSRDEIEPWTKLTA 123 Query: 120 GVSGATNCAFVNQPGQGE--ALCGGCTAIALFNQANQAPGFGGGFKSGLRGGTPVTTFVR 177 + +N + A A L + + G G+ + Sbjct: 124 EYNATSNKVLFDHTNTKNPGAREPKECARWLLSTMTFSISGGRGYYPS-PSPNAMMCIPL 182 Query: 178 GIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSNESIPASSI--GFVRGLFWQ 235 G + T+ ++ P + + W + K+ + G+ WQ Sbjct: 183 GRNFHETLCYCLVPYPNRNVMSGDSTL------WEREPKALPLNTPKQMATGYADLYTWQ 236 Query: 236 PAHIELCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWPHPHSPCLVTVKKGEVEE 295 I L G+ S +R+ F P P P + KG + Sbjct: 237 SRMIRL--EEQP-----TGEVSMMRFVAGQ----GFENPSSTPDPMHPYKLEKNKGIL-- 283 Query: 296 KFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAAVVNQFRNIAPQSPLELIMGGYRNNQA 355 + F W ++ D + A + + ++ Y A Sbjct: 284 -PVQFRKDRGVWRDFDSLLPDSSELAPITIQNAVKLAGKNMNYLPESVLILGLKYEPPNA 342 Query: 356 SILERRHDVLMFNQ---GWQQYGNVINEIVTVGLGYKTALRKALYTFAEG---------- 402 ++ R + L + G + I + + + L A +FA Sbjct: 343 NLEFWRMECLSLPKALAGDRFIRTDIRQFLADAEEAQKTLWTACNSFARDLISRGDKRPV 402 Query: 403 FKNKDFKGAG-VSVHETAERHFYRQSELLIPDVLANVNFSQADEVIADLRDKLHQLCEML 461 K + V ++ D + + Sbjct: 403 SKKDISDFVEQMPVSSVYWSTLESCFHKILSDYNLERDPEDIRCQWLKFVRDAMRTAWKQ 462 Query: 462 FNQSVAPYAHHPKLISTLALARATLYKHLRELKPQ 496 SV+ I L A + + L+EL + Sbjct: 463 HTSSVSTG--DAWAIRALVKAERPVLRKLKELNDE 495 >UniRef50_B8IZA3 CRISPR-associated protein, Cse1 family n=1 Tax=Desulfovibrio desulfuricans subsp. desulfuricans str. ATCC 27774 RepID=B8IZA3_DESDA Length = 540 Score = 143 bits (361), Expect = 1e-32, Method: Composition-based stats. Identities = 90/516 (17%), Positives = 171/516 (33%), Gaps = 67/516 (12%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAP 60 MNL+ D WIPV N G+ Q++NL+ C QWR R +A + LL+CI Sbjct: 1 MNLVSDQWIPVLD-NSGQHQLVNLREALCEGAQWRDLAVRPHERVALMRLLLCIAHAALN 59 Query: 61 AKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKAN---------DV 111 ++ R+ L D + W D F L H + PF+Q G+KA D Sbjct: 60 GPSREDW-SRVPQ-LLPDAVAAYLQKWQDSFDLFHPQKPFLQISGLKAASKKTKRTEDDE 117 Query: 112 TPM---EKLLAGVSGATNCAFVNQPGQGEALCGGCT--AIALFNQANQAPGFGGG----- 161 P+ KL ++ + G A+ L + +PG G Sbjct: 118 GPLVKASKLDFTLATGNQSTHFDHEGSLAQRSAEQALPALNLLSYLCFSPGGLIGTVVWN 177 Query: 162 ----FKSGLRG----GTPVTTFVRGIDLRSTVLLNVLTLPRLQKQFPN-ESHTENQPTWI 212 +S G G+ TF RG ++ T+ LN+ ++++ + +P W Sbjct: 178 NHVTARSSSDGPCAVGSMTHTFWRGANVLQTLHLNMCARDDIERRLASIPEAGWGKPVWE 237 Query: 213 K-PIKSNESIP--ASSIGFVRGLFWQPAHIELCDPIGIGKCSCCGQESNLRYTGFLKEKF 269 + P+ +++ ++ ++ L + G + + F Sbjct: 238 QMPVSFDDANAWRNATHTYLGRLTPLSRLVLF----QRGASGMT-LGAGPVFPNFN--NA 290 Query: 270 TFTVNGLWPHPHSPCLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAA 329 P S ++ K + L T W ++ + + + N + V Sbjct: 291 KAPYV---EEPTSTIILRGKDNKQTRALLPVTPGKALWRELHAL-----VAHRNKDDVGG 342 Query: 330 VVNQFRNIAPQSPLE-LIMGGYRNNQASILERRHDVLMFNQGWQ------------QYGN 376 A + +++ G +QA +++ V Q Q QY Sbjct: 343 FWAAALASAEGASGRDMVVAGMARDQAEVVDTLESVYHIPQAMQKTPGQLVYGQGVQYAE 402 Query: 377 VINEIVTVGLG-YKTALRKALYTFAEGFKNKDFKGAGVSVHETAERHFYRQSELLIPDVL 435 ++ + + Y+ + +G + + + A ++ +E +P + Sbjct: 403 DMSRKLGWAVDTYREKVDGGWAGRLKGA-GAGKVELLIKLRQKAFTLYWTAAEQSLPLLF 461 Query: 436 ANVN---FSQADEVIADLRDKLHQLCEMLFNQSVAP 468 A V + L ++ AP Sbjct: 462 ACVESLGSDAFPAAQKAWQKALFVAARKAYSSICAP 497 >UniRef50_C0W6U3 CRISPR-associated Cse1 family protein n=1 Tax=Actinomyces urogenitalis DSM 15434 RepID=C0W6U3_9ACTO Length = 551 Score = 141 bits (354), Expect = 8e-32, Method: Composition-based stats. Identities = 80/496 (16%), Positives = 153/496 (30%), Gaps = 44/496 (8%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQ-IIA 59 NLL + WI V +G ++ L + + +A L LL+ I + Sbjct: 5 FNLLDEPWIRVTWLSGESEEVSLLTLFRDATQIEGIHGEIASQNIAILRLLLAICHRTMD 64 Query: 60 PAKDDVEFRHRIMNPLTE-DEFQQLIAPWIDMFYLNHAEHPFMQTKGV--KANDVTPMEK 116 +D +R +P + + + + F L E PF Q G+ + + +E Sbjct: 65 GPEDLEVWREYWSSPGSLGQDASTYLERFRSRFDLRDPEQPFFQVAGIHTASGKSSGLES 124 Query: 117 LLAGVSGATNCAFVNQPGQG-EALCGGCTAIALFNQANQAPGF---GGGFKSGLRGGT-- 170 L+A + F + G+G + A L + P G +R G Sbjct: 125 LIADIPNGHPF-FTTRMGEGLSQMTWAEAARWLIHVHAFDPSGIRSGAVGDPQVRNGKGY 183 Query: 171 PV---------TTFVRGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSNESI 221 P+ T + G +L T+LLN + ++ + + P W +P Sbjct: 184 PIGPGWTGQIGTITLAGDNLEQTLLLNTVVCNCVEG-LQEVDLSRDLPPWERPADGPGGS 242 Query: 222 PASS-IGFVRGLFWQPAHIELCDPIGI-----GKCSCCGQESNLRYTGFLKEKFTFTVNG 275 + G V WQ + L + G ++ +++ + Sbjct: 243 ASKQPTGPVSCYTWQTRRVLLHGKEEVTSLFLGNGDKATPQNRQHVEPLTAWRYSEPQS- 301 Query: 276 LWPHPHSPCLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAAVVNQFR 335 + + K + T P +S +V K + AV+ ++ Sbjct: 302 --QKAKATVYMPRKLPTDRAMWRGLPTVVP---HLSPMVSTKAGGQVSRFLPPAVITFYQ 356 Query: 336 NIAPQS--------PLELIMGGYRNNQASILERRHDVLMFNQG-WQQYGNVINEIVTVGL 386 + Q P+ + Y +A I E D L + + +V+ + Sbjct: 357 RLMYQRVIPPRKLLPIHAVGMEYGAQEAVITELVEDTLHVPSALLGRDNTRLLTLVSDAI 416 Query: 387 GYKTALRKALYTFAEGFKN--KDFKGAGVSVHETAERHFYRQSELLIPDVLANVNFSQAD 444 L A + + A FY+ + P LA++ + + Sbjct: 417 EVTEQAAGTLRNLAANLDRAAGGSPDTSSAARQRAGAQFYQAIDERFPRWLADIADADPE 476 Query: 445 EVIADLRDKLHQLCEM 460 V R+ L Sbjct: 477 SVAEQWREVLRSEAHR 492 >UniRef50_A0LM51 CRISPR-associated protein, Cse1 family n=1 Tax=Syntrophobacter fumaroxidans MPOB RepID=A0LM51_SYNFM Length = 517 Score = 133 bits (334), Expect = 1e-29, Method: Composition-based stats. Identities = 87/532 (16%), Positives = 154/532 (28%), Gaps = 67/532 (12%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIA- 59 NL+ WI +G + + + L + + + L E A +L+ I Sbjct: 6 FNLIDRPWISCVELSGRRRTLGLHEVLSRAHELRGIELQSPLAETALFRVLLAAVHRIVE 65 Query: 60 PAKDDVEFRHRIM-NPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDVTP----- 113 K E+R L W F L E PF QT G+ D Sbjct: 66 GPKGTGEWRALYQATKLPGGRIDAYFEKWSHRFDLFSKEEPFYQTPGLAIRDAKGAEAPA 125 Query: 114 -MEKLLAGVSGATNCAFVNQPGQGEALC--GGCTAIALFNQANQAPGFGGGFKSGLRG-- 168 + ++ + N + + C +AL + + L G Sbjct: 126 VIAGIMLERASGNNKTIFDHSMDEDRGCLSPEEAVLALIAAQMYSLRGLNKKTTNLFGYQ 185 Query: 169 ---------GTPVTTFVRGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSNE 219 G + ++G L ++ +L L P S ++ P W + Sbjct: 186 ESFSDSVMVGG-IFAALQGQSLFESL---LLNLLLYTDNLPIHSSRDDCPVWERHDHGET 241 Query: 220 SIPASSIGFVRGLFWQPAHIELCDPIG-IGKCSCCGQESNLRYTGFLKEKFTFTVNGLWP 278 + G++ L + HI L G G C E F Sbjct: 242 GVRTPR-GYLDYLTCKCRHILLVPEPGLDGPC-------IRHVHIAQGEAF--------Q 285 Query: 279 HPHSPCLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAAV-----VNQ 333 +P + K E + + + W + + E R A + Sbjct: 286 DVDNPGFIKRKNKEGKWLPVQMQPARLVWRDSISLFSFDTGKREGDRRPDAFRLVGDIAL 345 Query: 334 FRNIAPQSPLELIMGGYRNNQASILERRHDVLMFNQGWQQYGNVIN---EIVTVGLGYKT 390 R +A S G N+QA+ L R + L +++ E + + T Sbjct: 346 RRIVALPSKYRCCTYGLANDQANPLAWRKETLNIPTALLSDPDLVACLREAMDLSEKAHT 405 Query: 391 ALRKALYTFAEGF---------KNKDFKGAGVSVHETAERHFYR-----QSELLIPDVLA 436 LR A+ T+ + + + + GA + E HF +++ Sbjct: 406 ILRNAIRTYMDKYLPRNSRDVTEKLNATGASRLFWDRLESHFNAFLLEIENQDKALVAWE 465 Query: 437 NVNFSQADEVIADLRDKLHQLCEMLFNQSVAPYAHHPKLISTLALARATLYK 488 A E + + F H +L++ LA TL + Sbjct: 466 RNIERAALEAFEACLKQRYADSAKKFRAWTEA---HGQLVARLATLSKTLSR 514 >UniRef50_C1XYH9 CRISPR-associated protein, Cse1 family n=1 Tax=Meiothermus silvanus DSM 9946 RepID=C1XYH9_9DEIN Length = 419 Score = 132 bits (333), Expect = 2e-29, Method: Composition-based stats. Identities = 51/259 (19%), Positives = 93/259 (35%), Gaps = 11/259 (4%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQW-RLSLPRDDMELAALALLVCIGQIIA 59 NL+ WIPVR G +++ ++L+ ++ R+ P + +A LL+ I Sbjct: 4 FNLITQPWIPVR--EGNQLKEVSLEQALLEGRRFERIEDPSPLVTVALYRLLLAILHRAL 61 Query: 60 -PAKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGV-KANDVTPMEKL 117 ++ E N ++ + +A D F L H E PF Q L Sbjct: 62 QGPENSDEAAKWFSNGFDAEKIRDYLAKHQDRFDLFHPERPFYQVPDFTLERSCRSWTVL 121 Query: 118 LAGVSGATNCAFVNQ--PGQGEALCGGCTAIALFNQANQAPGFGGG---FKSGLRGGTPV 172 ++ N + + L A L A G + T Sbjct: 122 APELNSDNNKVLFDHTVTSRPRPLHPAEAARLLVANQTFALSAGKSVLCHTATAPVATAA 181 Query: 173 TTFVRGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSNESIPASSIGFVRGL 232 + G +L T+ LN+++ P+ + + + E +P + +K+ E+ A+ G V Sbjct: 182 LALMLGENLHETLCLNLVSYPKSE-YERDFATWEREPLRVSDLKNCEAARATPKGIVHRY 240 Query: 233 FWQPAHIELCDPIGIGKCS 251 W + L G G+ Sbjct: 241 TWLSRAVRLDPEEGNGQAD 259 >UniRef50_C7MTN0 CRISPR-associated protein, Cse1 family n=1 Tax=Saccharomonospora viridis DSM 43017 RepID=C7MTN0_SACVD Length = 567 Score = 130 bits (327), Expect = 1e-28, Method: Composition-based stats. Identities = 78/456 (17%), Positives = 133/456 (29%), Gaps = 57/456 (12%) Query: 2 NLLIDNWIPVR-PRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALA--LLVCIGQII 58 NLL W+P+R +G + ++L S + L + A L LL + + Sbjct: 9 NLLTQPWLPIRHRHSGALEAVGIAEALLRSHELADLVVDVPTQVPALLRQVLLPVMVDAL 68 Query: 59 APAKDDVEFRHRIMNPLTEDE----FQQLIAPWIDMFYLNHAEHPFMQTKGVK--ANDVT 112 P + R DE + + D F L H PF Q G++ + Sbjct: 69 GPPTTREGWSKRFAAGRFTDEERDRLSEYFDQYRDRFALFHDTRPFAQVAGLRTPKGETK 128 Query: 113 PMEKLLAGVSGATNCAFVNQPGQGE--ALCGGCTAIALFNQANQ---APGFGGGFKSGLR 167 L+A + N G+ L G A L + A G ++ Sbjct: 129 GTAVLVATAASGNNVPLFTSRTDGDPFPLTPGEAARWLLHTQCWDTAAIKSGAEGDPKVK 188 Query: 168 GG-------TPV----TTFVRGIDLRSTVLLNVLTLPRLQKQFPNES-HTENQPTWIKPI 215 G P+ G L T+LLN P P P W Sbjct: 189 AGKTTGNPTGPLGQLGVVVPVGRSLYETLLLNTPVHPEDMWGVPQWKRDPPFGPEW---- 244 Query: 216 KSNESIPASSIGFVRGLFWQPAHIELCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNG 275 ++ G + WQ + L C L +V Sbjct: 245 ---DTYAPQ--GLLELWTWQSRRVRLSPEQTGDGQRVC--------RVVLTAGDRISVLP 291 Query: 276 LWPHPHSPCLVT--VKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAAVVNQ 333 W PH+ K G + W + ++ + + + ++ Q Sbjct: 292 EW-EPHTTWTSAPNPKAGAPARRPRRHAPGKAIWQGMEALLAVEREEKGK-FHTSDLLRQ 349 Query: 334 FRN------IAPQSPLELIMGG--YRNNQASILERRHDVLMFNQGWQQYGNVINEIVTVG 385 + IA PL + G Y N A + + HD + + + ++V Sbjct: 350 INSARVDGVIADDYPLRVQTYGLLYGNQSAVVEDILHDAMPLPVAALRAEGEVYDLVAEA 409 Query: 386 LGYKTALRKALYTFAEGFKNKDFKGAGVSVHETAER 421 L +A+ + + + +GA + R Sbjct: 410 TEQAEELAQAVNSLSADLRRA--QGADPIPWDKGHR 443 >UniRef50_D1YEE1 CRISPR system CASCADE complex protein CasA n=1 Tax=Propionibacterium acnes J139 RepID=D1YEE1_PROAC Length = 552 Score = 129 bits (323), Expect = 3e-28, Method: Composition-based stats. Identities = 61/353 (17%), Positives = 108/353 (30%), Gaps = 43/353 (12%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCI-GQIIA 59 NL+ + WI VR + G ++ ++ + + + L+ E A L LL+ I Q A Sbjct: 5 FNLMDEPWISVRTPDNGVTEVSIREAFHRATEFRGLAGEIPTQEAAVLRLLLAIAIQATA 64 Query: 60 PAKDDV----EFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKA--NDVTP 113 + D ++ L DE W++ F L PFMQ + + Sbjct: 65 RFRSDDEKIDDWGQWWEEGLPLDEIDSYSDRWLNRFNLFDDSAPFMQVTDLHTSNGGYSG 124 Query: 114 MEKLLAGVSGATNCAFVNQPGQGEALCGGCTAIA---LFNQANQAPG------------F 158 + K+++ V N F A + A G Sbjct: 125 LTKIISEVP--PNDKFFTTRDGAGTTSLSFAEAARWLVHTHAFDVSGIKSGAVGDPRVKG 182 Query: 159 GGGFKSGLRGGTPV-TTFVRGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKS 217 G G+ G P+ V G L T+LLN+ Q+ P + Sbjct: 183 GRGYPIGTGISGPMGIVIVEGKSLAETILLNLFLQDDPQQDVPVWERPPQ-------TAT 235 Query: 218 NESIPASSIGFVRGLFWQPAHIELC-DPIGIGKCSCC-GQESNLRYTGFLKEKFTFTVNG 275 + G WQ + L D + C G + +Y + + Sbjct: 236 PDREHPVPTGCADLFTWQSRRVRLIADGDRVVDVLLCNGDKVEWKYLLHNDSTTAWRYSA 295 Query: 276 LWPHPHSPCLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVA 328 T GE + ++ W + ++V + ++ + A Sbjct: 296 P---------QTKAAGETVYMPRSHDSTKAMWRGLEPLLVREPAADDRRRKKA 339 >UniRef50_Q53VY1 Putative uncharacterized protein TTHB188 n=1 Tax=Thermus thermophilus HB8 RepID=Q53VY1_THET8 Length = 502 Score = 128 bits (321), Expect = 6e-28, Method: Composition-based stats. Identities = 86/527 (16%), Positives = 156/527 (29%), Gaps = 71/527 (13%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINL-QSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIA 59 NL+ + WIPV GG+V + + ++L + + R+ P E LL+ + Sbjct: 7 FNLIDEPWIPVLK--GGRVVEVGIGEALLRAHEFARIETPSPLEEAVLHRLLLAVLHRAL 64 Query: 60 -PAKDDVEFRHRIMNP-LTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDVTPMEKL 117 + + +D + + + D F+L H E PF+Q + + P KL Sbjct: 65 SGPRCPEDVLDWWRKGGFPQDPIRDYLNRFRDRFFLFHPEAPFLQVADLPEENPLPWSKL 124 Query: 118 LAGVSGATNCAFVNQ--PGQGEALCGGCTAIALFNQANQAPGF-----GGGFKSGLRGGT 170 L ++ N + A AL APG G G Sbjct: 125 LPELASGNNPTLFDHTTEENLPKATYAQAARALLVHQAFAPGGLLRRYGVGSAKDAPVAR 184 Query: 171 PVTTFVRGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTW-IKPIKSNESIPASS---- 225 P G + L L + ++ P W + P++ + A + Sbjct: 185 PALFLPTGQN----------LLETLLLNLVPYTPEDDAPIWEVPPLRLGDLEGARTKWPL 234 Query: 226 IGFVRGLFWQPAHIELCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWPHPHSPCL 285 G R W + L D G G + P Sbjct: 235 TGRTRVYTWPARGVRLLDE-GDG------------VRFMGYGPGVEPLEATHRDPMVAQR 281 Query: 286 VTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAAVVNQFRNIAP------ 339 + K + L + W S ++ G +VAA + N+ Sbjct: 282 LDAKGNLL---VLRLSEERSFWRDFSAMLPR------QGGKVAATLEHAENLQGELEDEG 332 Query: 340 -QSPLELIMGGYRNNQASILERRHDVLMFNQGW--QQYGNVINEIVTVGLGYKTALRK-A 395 + + L + G ++QA +L+ R +V G + + + + + L+ A Sbjct: 333 LEGRITLRVLGQVSDQAKVLDIRREVYPLPSGLLTPKAEENLEKALKMAEELGQGLKHLA 392 Query: 396 LYTFAEGFKNKDFKGAGVSVHET---------AERHFYRQSELLIPDVLANVNFSQADEV 446 +D E ER ++ + P A V + ++ Sbjct: 393 QEVAKAVVGERDRGHGRSPYLEELTKLANSLPLERLYWHALDGAFPRFFARVEEEASLDL 452 Query: 447 IADLRDKLHQLCEMLFNQSVAPYAHHPKLISTLALARATLYKHLREL 493 + + + A H + LA + L EL Sbjct: 453 WREALRGAALEAWKATRRFLGTGARH---LKALAQGEQEFGRLLGEL 496 >UniRef50_C8XAY7 CRISPR-associated protein, Cse1 family n=1 Tax=Nakamurella multipartita DSM 44233 RepID=C8XAY7_NAKMY Length = 555 Score = 127 bits (318), Expect = 1e-27, Method: Composition-based stats. Identities = 55/273 (20%), Positives = 93/273 (34%), Gaps = 31/273 (11%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAP 60 N++ + +P +G I Q+L + + M A LL+ I P Sbjct: 5 FNVIDEPVLPAVWLDGTSADISIRQALIDAHRIAAIEGEPASMTFALHRLLLAIVYRALP 64 Query: 61 AKD-DVEFRHRIMNP-LTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGV--KANDVTPMEK 116 + E+R P L ++ + W F L P++Q G+ ++ + +EK Sbjct: 65 VERPRQEWRELWDAPELPAEDLNSYLDDWYQRFDLLDPAQPWLQVAGLHTTRSEFSELEK 124 Query: 117 LLAGVSGATNCAFVNQPGQGEALCGGCTAIA-LFNQANQAPGF---GGGFKSGLRGGT-- 170 L+ + F + G A L + P G ++GG Sbjct: 125 LIPDIPNGEQF-FTVRAGLAARSISLAEAARYLIHAQAFDPSGIKSGAVGDPRVKGGKGY 183 Query: 171 PVTT---------FVRGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTW-IKPIKSNES 220 P+ T V+G L+ T+LLN+ P S E+QP W +P+ + E Sbjct: 184 PIGTAWAGNLGGVLVQGRTLKETLLLNLTLGSPNDDDRP-WSGEEDQPVWEREPLTAAEE 242 Query: 221 IPASSI---------GFVRGLFWQPAHIELCDP 244 P + G L W + L Sbjct: 243 FPGETTGDIPGRAPRGPADLLTWPSRRMRLRVE 275 >UniRef50_Q2JWC2 CRISPR-associated protein, Cse1 family n=2 Tax=Chroococcales RepID=Q2JWC2_SYNJA Length = 524 Score = 125 bits (314), Expect = 3e-27, Method: Composition-based stats. Identities = 81/533 (15%), Positives = 158/533 (29%), Gaps = 75/533 (14%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCS-RDQWRLSLPRDDMELAALALLVCIGQII- 58 NL + WIPV + ++Q ++L L+ + LA L+ I Sbjct: 6 FNLTKEKWIPVLDPD-FRIQELSLVELFREWESLKEMRGDNPPTTLALYRFLLAIMHRAY 64 Query: 59 APAKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDVTPMEKLL 118 KD ++ + + + D F L H HPFMQ + P+ + Sbjct: 65 LGPKDTDHWKEIFQD--NGKRVIKYLQDRQDCFDLFHPTHPFMQDPALPIEKAVPVHSI- 121 Query: 119 AGVSGATNCAFVNQPGQ-GEALCGGCTAIALFNQA----NQAPGFGGGFKSGLRGG--TP 171 + +T+ F ++ G ++ A L F G SG TP Sbjct: 122 --HTMSTSEVFFHEHEWSGYSISLPEAARLLVRLQGVDITSLRAFYVGQDSGNHSAVNTP 179 Query: 172 VT----TFVRGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSNESIPASSI- 226 ++G L+ T++LN++ + + P+ E+ PTW K + Sbjct: 180 TMNVANVLLKGRTLKETLMLNLM-RYSPEDEMPSVVAGEDVPTWE--TKVGYTGQPKKEI 236 Query: 227 --GFVRGLFWQPAHIELCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWPHPHSPC 284 G++ L + + L G W C Sbjct: 237 PAGYIHYLTFPWRRLRLFSEAG---------RVQQLAITMGNSLPNGVEARQWE-----C 282 Query: 285 LVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAAVVNQFRNIAPQSPLE 344 V K E+K + + W ++ Q +V+ + + + Sbjct: 283 SVAYK----EDKPVRLSLHRQLWRDADSFLLTASKQTR-----PRIVDWLAELKSEELVN 333 Query: 345 ----LIMGGYRNNQASILERRHDVLMFN-------QGWQQYGNVINEIVTVGLGYKTALR 393 + G +QA L + Q + I ++++ Sbjct: 334 NLVVFEVLGMSADQAKPLGWSSARFSVPMQFVTDSELAQSLKSAIGIAENHQQIFRSSKG 393 Query: 394 KALYTFAEGFKNKDFKGAGVSVHETAERHFYRQSELLIPDVLANVNFSQAD--------- 444 ++ AE KN + + ++ E ++ + +L ++ Sbjct: 394 SPYFSLAEVLKNGETEKLSKAL--DGESRYWAILDHAFSMLLHDLPQDNQPGADGIIYYG 451 Query: 445 -EVIADLRDKLHQLCEMLFNQSVAPYAHHPKLISTLALARATLYKHLRELKPQ 496 + + F +S+ + A A TL + L EL+ Sbjct: 452 LTTLPAWTKTVQDAARRAFTESIES----IRNYQARAAALRTLERKLAELRAD 500 >UniRef50_D1A5T7 CRISPR-associated protein, Cse1 family n=4 Tax=Actinomycetales RepID=D1A5T7_THECD Length = 561 Score = 123 bits (308), Expect = 2e-26, Method: Composition-based stats. Identities = 72/367 (19%), Positives = 118/367 (32%), Gaps = 58/367 (15%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIA- 59 +L W+PV NGG+ + + + RL E A L LL+ I Sbjct: 7 FDLTRRPWLPVLYDNGGEGLLSLTEVFQQAHRLRRLVGDVPTQEFALLRLLLAILHDAIE 66 Query: 60 PAKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKAN--DVTPMEKL 117 D E+ L D + D F L H + PF+QT ++ +V ++++ Sbjct: 67 GPDDIDEWTELWEEGLPTDRITAYLERHRDRFDLLHPQAPFLQTAELRTAGDEVFSLDRI 126 Query: 118 LAGVSGATNCAFVNQPGQGEALCGGCTAIALFNQANQAPGF------------------- 158 +A V T F + E L A L + Sbjct: 127 VADVPNGT-LFFTMRAHGVERLDFAEAARWLVHAHAFDTSGIKSGAVGDPRVKKGKVYPQ 185 Query: 159 GGGFKSGLRGGTPVTTFVRGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSN 218 G G+ L G FV G DLR T+LLN++ + + P W + + Sbjct: 186 GVGWAGNLGG-----VFVEGDDLRETLLLNLIAFDTDNLRI---DPARDLPAWRQEPRGP 237 Query: 219 ESIPASSI-----GFVRGLFWQPAHIELCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTV 273 + + + G WQ I L ++ Y L + ++ Sbjct: 238 QQLDEIELSRRPAGLRDLYTWQSRRIRLHFD------------ADGVYGVVLA--YGDSL 283 Query: 274 NGLWPHPHSPCL------VTVKKGEVEEKFLAF--TTSAPSWTQISRVVVDKIIQNENGN 325 + H H P KK + + +L S +W + +V + E Sbjct: 284 SPHNKHVHEPMTAWRRSPAQEKKLRLAQVYLPREHDPSRSAWRGLGALVAGRAEGTEQRE 343 Query: 326 RVAAVVN 332 AA+V Sbjct: 344 EAAAIVR 350 >UniRef50_Q2JH30 Putative uncharacterized protein n=2 Tax=Frankia RepID=Q2JH30_FRASC Length = 550 Score = 122 bits (307), Expect = 2e-26, Method: Composition-based stats. Identities = 82/515 (15%), Positives = 155/515 (30%), Gaps = 66/515 (12%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALA--LLVCIGQII 58 NL+ WIPV R G ++++ ++L + L+L +A L LL + + Sbjct: 4 FNLIDGQWIPVIKR-GRRLEVGIRKALVDAHTIDGLALDDPLEAVAVLRQVLLPVVLDVF 62 Query: 59 APAKDDVEFRHRIMNP-----------LTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVK 107 + D E+ R E+ + + F+L H PF Q G++ Sbjct: 63 GAPRTDEEWSQRWEAGCFDRIIRKDRAEDEEGIESYLIRQAARFHLFHPTAPFAQVAGLR 122 Query: 108 --ANDVTPMEKLLAGVSGATNCAFVNQPGQGE--ALCGGCTAIALFNQANQAPGF---GG 160 ++ P+ L+ ++ N + + + +L A AL G Sbjct: 123 TAKDETKPVSLLVPRLASGNNVPLFSSRTENDPPSLTPAAAARALLAAHCWDTAAIKTGA 182 Query: 161 GFKSGLRGG-------TPV----TTFVRGIDLRSTVLLNVLTLPRLQKQFPNESHTENQP 209 ++ G P+ G L T++L++ + +++P Sbjct: 183 ADDPKVKTGKTMGNPTGPLGQFGIVLPLGETLFHTLMLSI-------PVLRHGLRQKDRP 235 Query: 210 TWIKPIK-SNESIPASSIGFVRGLFWQPAHIELCDPIGIGKCSCCGQESNLRYTGFLKEK 268 W ++ + G + L WQ I L E L Sbjct: 236 QWRSESSATSRWETRAPEGLLDLLTWQSRRIRLVPEA-----DPTAVEDVSVRRVVLTA- 289 Query: 269 FTFTVNGLWPHPHSPCL--VTVKKGEVEEKFLAFTTSAPS---WTQISRVVVDKIIQNEN 323 + G H P V K + +E + P W + ++ + ++ Sbjct: 290 -GDRLTGS-VHALEPHTAWRQVDKPKADEPPVRPVRHQPGRSAWRGLEALLTTTPLSSDK 347 Query: 324 GNRVAAVVNQFR-----NIAPQSPLELIMGG--YRNNQASILERRHDVLMFNQGWQQYGN 376 A+ R + PL+++ G Y A I E D + + Sbjct: 348 VFAPTALSQLARLRDDGYVPDDLPLQVLTVGVKYGTQSAVIDEVMADEIPLPVTALARDS 407 Query: 377 VINEIVTVGLGYKTALRKALYTFAEGFKNKDFKGAGVSVHETAER---HFYRQSELLIPD 433 + E V +LR A + + + + +R + Sbjct: 408 AVRETVLAVAAQAESLRIAANRLGDDLREAAGATDKLP-WDKGQRLGEILIHSFNPTVHR 466 Query: 434 VLANVNFSQADEVIADLRDKLHQLCEMLFNQSVAP 468 +LA + Q E L L + V P Sbjct: 467 LLAGL--QQHPEDAKRAELAWRILARRLAWEVVDP 499 >UniRef50_C1XFZ8 CRISPR-associated protein, Cse1 family n=2 Tax=Meiothermus RepID=C1XFZ8_MEIRU Length = 494 Score = 122 bits (305), Expect = 4e-26, Method: Composition-based stats. Identities = 76/496 (15%), Positives = 140/496 (28%), Gaps = 41/496 (8%) Query: 25 QSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIA-PAKDDVEFRHRIMNPLTEDEFQQL 83 Q+L + R+ P + A LL+ + D E + Q Sbjct: 5 QALLEAHKFERIEDPSPLVTAALHRLLLAVLHRALEGPADAYEAAEWFEEGFDRGKIQTY 64 Query: 84 IAPWIDMFYLNHAEHPFMQTKGV-KANDVTPMEKLLAGVSGATNCAFVNQ--PGQGEALC 140 ++ + D F L H E PF Q L ++ N + + L Sbjct: 65 LSKYRDRFDLFHPERPFYQVPDFSLERSCRSWTVLAPELNSDNNKVLFDHTVTSRPRPLL 124 Query: 141 GGCTAIALFNQANQAPGFGGG---FKSGLRGGTPVTTFVRGIDLRSTVLLNVLTLPRLQK 197 A L A G + T V G +L T+ LN++ P+ + Sbjct: 125 PAEAARLLVANQTFALSAGKSVLCHTATAPVATAALALVLGDNLHQTLCLNLVAYPKREH 184 Query: 198 QFPNESHTENQPTWIKPIKSNESIPASSIGFVRGLFWQPAHIELCDPIGIGKCSCCGQES 257 + + + E +P + + E AS+ G V W + L G Sbjct: 185 EH-DFATWEQEPLKVADLADCERARASAKGIVHRYTWLARAVRLHPEEEDG--------- 234 Query: 258 NLRYTGFLKEKFTFTVNGLW--PHPHSPCLVTVKKGEVEEKFLAFTTSAPSWTQISRVVV 315 ++ P K E + L F+ W + ++ Sbjct: 235 -QTIVRWIAYASGVRYEESQVRRDPMVAFRPDPKDPTRE-RPLGFSEGRALWRDFAALLP 292 Query: 316 DKIIQNENGNRVAAVVN-QFRNIA---PQSPLELIMGGYRNNQASILERRHDVLMFNQGW 371 + G VA +R + Q L +++ G ++QA + R ++ + Sbjct: 293 KP--GSAQGLAVADHARNVYRALGRHFRQRGLPVMVAGQASDQAKVELWRGEIYRLPEAI 350 Query: 372 ----------QQYGNVINEIVTVGLGYKTALRKALYTFAEGFK-NKDFKGAGVSVHET-- 418 +Q N + V G AL L + + D S Sbjct: 351 LGETDLRAFVEQCLNEAEVMGEVLNGAARALAAGLLSMGDRKPHKDDVSKLARSFPHQVA 410 Query: 419 AERHFYRQSELLIPDVLANVNFSQADEVIADLRDKLHQLCEMLFNQSVAPYAHHPKLIST 478 I + + QA + R+ L + + + + + Sbjct: 411 YWSALEGHFADWISQLGPDFEKQQAR-LERAWREILQREALQAWRLAALAAGDDARALRA 469 Query: 479 LALARATLYKHLRELK 494 + L H+ + K Sbjct: 470 VHKGEGILLAHIYKQK 485 >UniRef50_B6XT61 Putative uncharacterized protein n=2 Tax=Bifidobacterium RepID=B6XT61_9BIFI Length = 550 Score = 117 bits (294), Expect = 6e-25, Method: Composition-based stats. Identities = 90/520 (17%), Positives = 166/520 (31%), Gaps = 87/520 (16%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPR---DDMELAALALLVCIGQI 57 +LL D WIPV +G ++L+ L+ D W++ R +A L L + I Sbjct: 5 FSLLDDGWIPVSYVDGHP-DEVSLRRLF--EDAWKIKEIRGDIPQQAIAILRLALGILYR 61 Query: 58 IAPAKDDVE------FRHRIMNPL-TEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVK--- 107 ++ E + D + W D F+L PF Q G++ Sbjct: 62 AYYVENPSEEQMRDMWDDIFRIGHFDLDILEDYFDEWGDRFFLFGDR-PFFQVSGLEYVG 120 Query: 108 ANDVTPMEKLLAGVSGATNCAFVNQP-GQGEALCGGCTAIALFNQANQAPGFGGGFKSGL 166 P+ +++A + F + G + L +A L + G K+ + Sbjct: 121 QKPYDPVSEMIADMPKPEKYLFAMRGLGTTDTLSLPESARWLVYLQSFDT---AGIKTPV 177 Query: 167 RGG-----TPVTTF----------------VRGIDLRSTVLLN-VLTLPRLQ-KQFPNES 203 +G + G +L T++LN VL R + + Sbjct: 178 KGNTHINKGKIYPLKGFLGTGWLGGVGGVYAEGANLFETLMLNWVLYDDRYDSEYYRLFG 237 Query: 204 HTENQPTWI-KPIKSNESIPASSI-GFVRGLFWQPAHIELCD----PIGIGKCSCCGQES 257 +T + P W + S + +S G V+ + WQ I L IG +C G Sbjct: 238 NTNDIPVWEKNEVPSADMDDQNSFAGPVQAMTWQSRRIRLVPNEDCTRVIGVVNCYGDA- 296 Query: 258 NLRYTGFLKEKFTFTVNGLWPHPHSPCLVTVKKGEVEEKFLAFTTSAPSWTQISRVV--- 314 T + + F W P + + S W + ++ Sbjct: 297 ---VTQYNTD--GFEKMTAWRRSI-PQQKKLGLPVPPHMPVTHDASKALWRGLEPILCVG 350 Query: 315 ---------------VDKIIQNENGNRVAAVVNQFRNIAPQSPLELIMGGYRNNQA-SIL 358 + + + + + V + + + + G + + S++ Sbjct: 351 DDGDFRPGIIRWLEEIRTEVLDSEEHVLNMVTIHAQGMTYGTQSSVFETGIDDKLSLSMV 410 Query: 359 ERRHDVLMFNQGWQQYGNVINEIVTVGLGYKTALRKALYTFAEGFKNKDFKGAGVSVHET 418 RHD + ++V TAL ++ D G + + Sbjct: 411 MFRHDYAGIAA--------VVDVVKSTDKAVTAL--TMFVRNLRSAAGDHSGKTQEIADQ 460 Query: 419 AERHFYRQSELLIPDVLANVNFSQADEVIA-DLRDKLHQL 457 Y +LL D LAN + SQ + D++H+L Sbjct: 461 IRESAYADLDLLFRDRLANFDESQDPVTYSNAWLDEVHRL 500 >UniRef50_C7QEM7 CRISPR-associated protein, Cse1 family n=12 Tax=Actinomycetales RepID=C7QEM7_CATAD Length = 1540 Score = 114 bits (286), Expect = 6e-24, Method: Composition-based stats. Identities = 63/362 (17%), Positives = 109/362 (30%), Gaps = 53/362 (14%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRL---SLPRDDMELAALALLVCIGQI 57 +L W+PV +G + + +L+ ++ + R LP D L L L + Sbjct: 987 FDLTSAPWLPVLYADGMQGVL-SLRDVFAQSNLIRRLVGDLPTQDFALLRLL-LAVLYDA 1044 Query: 58 IAPAKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGV--KANDVTPME 115 + +D ++ + + + F L H PF Q G+ +V P+ Sbjct: 1045 VDGPRDGQDWEDLWTSDDPFAAVPAYLDSHRERFDLLHPATPFYQVPGLQTAKGEVGPLN 1104 Query: 116 KLLAGVSGATNCAFVNQPGQGEALCGGCTAIALFNQANQAPGFGGGFKSGLRG------- 168 K++A V + E L A L + G KSG+ G Sbjct: 1105 KIVADVPDGDPF-LTMRMPGVEQLSFAEAARWLVHTQAFDTS---GIKSGVVGDPKAVNG 1160 Query: 169 -----GTPVT-----TFVRGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSN 218 G F G LR T+LLN++ Q + ++ P W Sbjct: 1161 KRYPQGVAWLGNLGGVFAEGDTLRQTLLLNLIPADTTNLQV---TSAQDVPAWRGTNGRA 1217 Query: 219 ESIPASSI-----GFVRGLFWQPAHIELCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTV 273 S A + G WQ I L G + E Sbjct: 1218 GSDHADAEPRVPAGLRDLYTWQSRRIRLEYDTRG----VTGA-----VLTYGDELTAHNK 1268 Query: 274 NGLWPHPHSPCLVTV---KKGEVEEKFLA--FTTSAPSWTQISRVVVDKII-QNENGNRV 327 +G P + + KK + ++ + +W I ++ + Sbjct: 1269 HG--VEPMTGWRRSKPQEKKLGLSTVYMPQQHDPTRAAWRGIESLLAGSAGSGSSQTGEP 1326 Query: 328 AA 329 A+ Sbjct: 1327 AS 1328 >UniRef50_Q5YRB3 Putative uncharacterized protein n=1 Tax=Nocardia farcinica RepID=Q5YRB3_NOCFA Length = 552 Score = 113 bits (282), Expect = 2e-23, Method: Composition-based stats. Identities = 67/390 (17%), Positives = 124/390 (31%), Gaps = 56/390 (14%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRL--SLPRDDMELAALALLVCIGQII 58 +LL + WI V +G ++ Q + + + +P + L L + + Sbjct: 8 FDLLDEPWIIVTDASGKASEVSLRQVFRRADEYVAIGGEVPTQQFAILRLLLAILHRTVA 67 Query: 59 APAKDDVE-FRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKAND--VTPME 115 ++ + D+ + + D F L H PF Q +++ V+ ++ Sbjct: 68 DRPGTAIDVWSRLWREWPA-DDIDRYLLAHRDRFDLFHPSTPFFQVADLRSAKDGVSSLD 126 Query: 116 KLLAGVSGATNCAFVNQPGQG-EALCGGCTAIALFNQANQAPG-------------FGGG 161 KL+A V F + G+G + + G A L + P G G Sbjct: 127 KLIADVPNGDKY-FTTRAGRGLDHIDFGEAARWLVHAHAFDPSGIKTGAVGDARVKGGKG 185 Query: 162 FKSGLRGGTPV-TTFVRGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTW-IKPIKSNE 219 + G+ + ++ G DLR T+LLN++ ++P+ + P W P Sbjct: 186 YPIGVAWAGSLGGVYLEGGDLRRTLLLNLVLADPDGDRYPD----HDLPPWERDPDGPAV 241 Query: 220 SIPASSIGFVRGLFWQPAHIELCDP----IGIGKC--SCCGQESNLRYTGFLKEKFTFTV 273 G V WQ + L + G+ C + +++ Sbjct: 242 RDITGPSGPVDLFTWQSRRVRLVNSGAQVTGVVLCNGDALESFNKQLLEPMTGWRYSEN- 300 Query: 274 NGLWPHPHSPCLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRV------ 327 + K GE + W + ++ D R Sbjct: 301 ------------QSKKAGETRHYPMVHDPEKSLWRGLRSLLGDVASSEPVAGRAIAPGVV 348 Query: 328 --AAVVNQFRNIAPQSPLELIMGG--YRNN 353 AA + + P P+ L G Y NN Sbjct: 349 EWAATLLDAGALPPDQPIRLHAVGMHYINN 378 >UniRef50_C3PF93 CRISPR-associated protein n=3 Tax=Corynebacterium RepID=C3PF93_CORA7 Length = 581 Score = 112 bits (280), Expect = 3e-23, Method: Composition-based stats. Identities = 85/531 (16%), Positives = 158/531 (29%), Gaps = 96/531 (18%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINL--QSLYCSR-DQWRLSLPRDDMELAALALLVCIGQI 57 NLL + WI G Q ++L + ++ R D +++ + A L +L+ I Sbjct: 7 FNLLDEPWIKCMD---GTNQPVSLSIRDIFSGRGDAYKVVGDSPTQDYAVLRVLLAIFWR 63 Query: 58 IAPAK----------DDVEFRHRIMNPLTE-------DEFQQLIAPWIDMFYLNHAEHPF 100 + +D ++ + D + + + D F L PF Sbjct: 64 AHALELVESYADDNWEDFDWPEWFDELREQLVNEKRDDVVLEYLDGYEDRFDLLSPSAPF 123 Query: 101 MQTKGV--KANDVTPMEKLLAGVSGATNCAFVNQP--GQGEALCGGCTAIALFNQANQAP 156 MQ + K+ P+ ++ + F E+L A L + Sbjct: 124 MQVADLHTKSGATRPVSFIVPEAAD----DFFTMRTAEGRESLALDEAARWLIHTQAFDF 179 Query: 157 GF---GGGFKSGLRGGT--PV---------TTFVRGI-DLRSTVLLNVLTLPRLQKQFPN 201 G ++GG P+ T + G + T++LN L Q Sbjct: 180 SGIKSGAEGDPRVKGGKGYPIGTGWTGRTGGTIILGEGGILETLILNTPPSAVLDSQ-EG 238 Query: 202 ESHTENQPTWIKPIKSNESIPASS-------IGFVRGLFWQPAHIELCDPIGIGKCSCCG 254 + + + P W + + P SS G V WQ I L Sbjct: 239 GAVSADTPVWEREPDTAAQRPGSSDDIGAVPHGAVDLATWQARRIRLFFEG--------D 290 Query: 255 QESNLRYTGF-LKEKFTFTVNGLWPHPHSPCLVTV---KKGEVEEKFLAFTTSAPSWTQI 310 + + + V G P +P + KKG + + W + Sbjct: 291 RAVQVLVSNGDRIPDAGKNVMG---DPMTPYRYSPNQSKKGTPAYYARPYDPTRTMWRAL 347 Query: 311 SRVVVDKIIQNENGNRVAAVVNQFRNIAPQSPLELIMGGYRNNQASILERRHDVLMFNQG 370 ++ + + + A + RN++ + L GY + ++ G Sbjct: 348 DALIALEDDPGFDNGKNKAP-KRPRNLSNLAA--LEADGYLDKSL----LDLALVSMEYG 400 Query: 371 WQQYGNVINEIVTVGL--------GYKTALRKALYTFAEGFKNKDFK------------G 410 Q+ I T+GL +R A+ T AE G Sbjct: 401 PQESSVASTFIATIGLPLVVLRADETGRKVRNAVRTSAEKTGKAAISLGWFAGQLLVAAG 460 Query: 411 AGVSVHETAERHFYRQSELLIPDVLANVNFSQADEVIADLRDKLHQLCEML 461 + FY + E L + + A+E D + ++ + Sbjct: 461 GDYEFGSSTADRFYARLEPLFLTWMTGLISDNAEEWQIDWQKQVREQVLRD 511 >UniRef50_Q1J370 CRISPR-associated protein Cse1 n=1 Tax=Deinococcus geothermalis DSM 11300 RepID=Q1J370_DEIGD Length = 573 Score = 112 bits (279), Expect = 4e-23, Method: Composition-based stats. Identities = 90/575 (15%), Positives = 170/575 (29%), Gaps = 84/575 (14%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIA- 59 LL WIPV G + + SL + + R+ A L + + Sbjct: 4 FPLLDREWIPVIAGVGERRHVSLRDSLLRAAEFRRIDAGHPLQTAALYRLHLAVLHRALK 63 Query: 60 PAKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDV-----TPM 114 +D + + D+ + + D F+L + PF+Q K + V + Sbjct: 64 GPRDAEQGADWYLAGHFPDDVAHYLDRYADRFHLFGPQ-PFLQVKDLDPALVGENFRSHW 122 Query: 115 EKLLAGVSGATNCAFVN---QPG--QGEALCGGCTAIALFNQANQAPGFGGGFKSGLRGG 169 +L A A N +PG + +AL A+ L N A G + Sbjct: 123 TRLSAEEGSPNTTALYNVEARPGGDRSDALTPAQAALRLLEHQNFALGGLIKRFTTSARA 182 Query: 170 TPVTT----FVRGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKS-------- 217 PV T G +L T+ LN++ +P H + P W + + Sbjct: 183 APVATAGLFLAEGANLHQTLCLNLVP-------YPQAMHGPDLPPWEEAPLTVAQIRACY 235 Query: 218 NESIPASSIGFVRGLFWQPAHIELCDPI-------------------GIGKCSCCGQESN 258 + P + G+ W + L G G+ S G + Sbjct: 236 DPEQPRVAAGYASRYTWPSRSVLLLPEETPQGVVVRWVGFGAGVPLAGPGEGSGTGTDPM 295 Query: 259 LRYTGFL--KEKFTFTVN-----GLWPH--PHSPCLVTVKKGEVEEKFLAFTTSAPSWTQ 309 + K + F LW P + K + P Sbjct: 296 VSLRPSRDPKNEQPFPYKLRRERLLWRDLNALLPDPAAQVDENRQGKVKVRPGTPPKTVS 355 Query: 310 ISRVVVDKIIQNENGNRVAAVVNQFRNIAPQSP-----------LELIMGGYRNNQASIL 358 +R V+ + + + + A + +++ G +Q Sbjct: 356 QARAVMRAVAERQRRVQPPVPFQDAPEDAWAEEGTPDARAAHPVIPVVVFGQLTDQGKAF 415 Query: 359 ERRHDVLMFNQG--------WQQYGNVINEIVTVGLGYKTAL----RKALYTFAEGFKNK 406 R + + + + TVG G + ++ L AE +K Sbjct: 416 AMRQETYTLPEAFIENPERFRDHVQAALTDASTVGEGLRRSVHLLAHALLKKDAERDPHK 475 Query: 407 DFKGAGVSVHETAERHFYRQSELLIPDVLANVNFSQADEVIADLRDKLHQLCEMLFNQSV 466 D ++ AE ++ + L ++ + D L + + ++ + Sbjct: 476 D-DVGKLANQIPAEPTYWAGLDTPFRAYLLALDADPQAALR-DWHAALRRAALVGWHTAE 533 Query: 467 APYAHHPKLISTLALARATLYKHLRELKPQGGPSN 501 + + + A+ L K L LKP+ P + Sbjct: 534 EAAGMNAAGLRAVEKAQGPLLKALNTLKPKETPHD 568 >UniRef50_A5GBL8 CRISPR-associated protein, Cse1 family n=1 Tax=Geobacter uraniireducens Rf4 RepID=A5GBL8_GEOUR Length = 545 Score = 112 bits (279), Expect = 4e-23, Method: Composition-based stats. Identities = 94/548 (17%), Positives = 161/548 (29%), Gaps = 84/548 (15%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIA- 59 MN+ D WIPV GG+ ++ +L S+ D++ R ++ + L +C+ Sbjct: 1 MNVAFDPWIPVVTITGGR-ELASLCSVLTEGDKFADLAVRPHERVSLMRLFLCVTHAALK 59 Query: 60 PAKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVK--------ANDV 111 KD E+ Q+ + W D F L H E P++Q G+K + Sbjct: 60 GPKDYDEWCEVPKRLPVAA--QKYLTEWKDSFELFHKERPWLQVAGLKGVEKEGSDSGKT 117 Query: 112 TPMEKLLAGVSGATNCAFVNQPGQ--GEALCGGCTAIALFNQANQAPGFG---------G 160 +P+ L +S N + GQ + + L N + G G Sbjct: 118 SPLSLLDFELSTGNNSTLHDHGGQLIVRQIEPERVVLNLLTFQNFSSGGGSPVAQWMTTK 177 Query: 161 GFKSGLRGGTPVT-----TFVRGIDLRSTVLLNVLTLPRL------------QKQFPNES 203 + G ++ RG L T+ LN+ T K+ Sbjct: 178 TLQVGNPDAPCLSQSMAHCLFRGASLAETIQLNLPTFETARRLYNSFATHKKDKEKQEWE 237 Query: 204 HTE------NQPTWI-----KPIKSNESIPASSIGFVRGLFWQPAHIELCDPIGIGKCSC 252 E +P W +S+ I A+ R L + L + C Sbjct: 238 RVEITVVEMGKPVWEFFPESPDSQSDSVINATKTYIGR-LVPISRWVLLFNESDQMYC-- 294 Query: 253 CGQESNLRYTGFLKEKFTFTVNGLWPHPHSPCLVTVKKGEVEEKFLAFTTSAPSWTQISR 312 C + K F V+ K + S W ++S Sbjct: 295 CNG------FKYDTFKDGFPSEPTASVQLVTKRDKNGAESVDRKVVKIEPSKALWRELSA 348 Query: 313 VVVDKIIQNENGNRVAAVVNQFRNIAPQSPLELIMGGYRNNQAS----ILERRHDVLMFN 368 ++V + G +A N S + + +QAS + H F Sbjct: 349 LLVKRSA-FGLGGPLA-----MENAPHDSEFDFHVCAMTRDQASMDIALESVFHVTPAFQ 402 Query: 369 QGWQQYGNVINEIVTVGLG-------YKTALRKALYTFAEGFKNKDFKGAGVSVHETAER 421 + Y I + Y+ + E K K + A Sbjct: 403 FNFPVYQAEIVRAEGISRRLGWTVEVYRKEVDGDWANRVERAKEKWV--LKAKLQSIATI 460 Query: 422 HFYRQSELLIPDVLANVNF---SQADEVIADLRDKLHQLCEMLFNQSVAPYAHHPKLIST 478 H++ E + ++ ++ A R L + SVA P+ + Sbjct: 461 HYWTTVEKNLALLMTHIESIGTDDAIPTREAWRKMLFATACDAY--SVACGQETPRQMRA 518 Query: 479 LALARATL 486 A L Sbjct: 519 FAKGWQKL 526 >UniRef50_C7LYW9 CRISPR-associated protein, Cse1 family n=1 Tax=Acidimicrobium ferrooxidans DSM 10331 RepID=C7LYW9_ACIFD Length = 540 Score = 111 bits (278), Expect = 5e-23, Method: Composition-based stats. Identities = 54/266 (20%), Positives = 88/266 (33%), Gaps = 25/266 (9%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLY-CSRDQWRLSLPRDDMELAALALLVCIGQIIA 59 +L + W+PVR R+G + ++L+ ++ + + +E A L L++ + I Sbjct: 7 FDLSSEPWLPVRFRDG-RRSEVSLRDIFVLAHTIVGFDVDFPTLEPALLRLVLALAYRIL 65 Query: 60 -PAKDDVEFRHRIM-NPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDVTPM--- 114 KDD E+ + +ED A W F L E PF Q ++ + Sbjct: 66 RGPKDDAEWGRLWEADRFSEDAIDDYFARWRHRFDLFSKEFPFFQVADLEPAGKGGVKTA 125 Query: 115 EKLLAGVSGATNCAFV--NQPGQGEALCGGCTAIALFNQANQAPGF---GGGFKSGLRGG 169 L+A N AL A L + G ++GG Sbjct: 126 NSLVAYAPSGNNVPVFTPITDRTELALSPAEAARWLVERHAFGSASDKTGAKGNPKVKGG 185 Query: 170 T--------PVTTFVR--GIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSNE 219 FV G LR T+LLN++ E ++ P W + Sbjct: 186 KDTPAIGYLAWIGFVAPVGQTLRETLLLNLVPWQYRNLIRGGE---DDVPAWERDPLGPT 242 Query: 220 SIPASSIGFVRGLFWQPAHIELCDPI 245 + + G WQ I L Sbjct: 243 RVMRAPDGVCDLFTWQGRRIRLFPER 268 >UniRef50_B1VIY3 CRISPR-associated protein n=1 Tax=Corynebacterium urealyticum DSM 7109 RepID=B1VIY3_CORU7 Length = 560 Score = 111 bits (276), Expect = 9e-23, Method: Composition-based stats. Identities = 85/544 (15%), Positives = 157/544 (28%), Gaps = 117/544 (21%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYC-SRDQWRLSLPRDDMELAALALLVCIGQIIA 59 NL+ + WI R G Q+++++ ++ S + + A L +L+ I Sbjct: 7 FNLVHEPWIKCRTAEGN--QLLSIRQVFDGSAKPLAVVGDSPTQDYAVLRVLLAIFWRAH 64 Query: 60 ---------PAKDDV--EFRHRIMNPLTE-------DEFQQLIAPWIDMFYLNHAEHPFM 101 + E+ ++ + +A + F L PFM Sbjct: 65 YHDFVRRYPSPRSRKKFEWETWFLDTRETLRETGKDEVVLGYLADVENRFDLLDPTVPFM 124 Query: 102 QTKGVKANDVTP--MEKLLAGVSGATNCAFVNQ--PGQGEALCGGCTAIALFNQANQAPG 157 Q + T + ++L + F + PG+ + QA G Sbjct: 125 QVADLHTAKNTSNEIRRILPDSE---DSYFTMRTGPGRVSISYDEAARWLIHAQAYDYSG 181 Query: 158 ------------FGGGFK-----SGLRGGTPVTTFVRGIDLRSTVLLNV---LTLPRLQK 197 G G+ SGL GG T +RG +L T++LN + Sbjct: 182 IKSGAVGDPRVKGGRGYPIGQGWSGLTGG----TVIRGANLLETLVLNTTESCIPTAAET 237 Query: 198 QFPNESHTENQPTWIKPIKSNESIPASSIGFVRGLFWQPAHIELCDPIGIGKCSCCGQES 257 P + P + P G WQ I L G + Sbjct: 238 DKPVWEREPDT---AAPQDLEATQPK---GPADLATWQSRRIRLFTED--------GVVT 283 Query: 258 NLRYTGF-LKEKFTFTVNGLWPHPHSPCLVTV---KKGEVEEKFLAFTTSAPSWTQISRV 313 + + V G P +P + KKG + +W + + Sbjct: 284 RVLVSNGDRIPNAGLNVFG---DPMTPYRFSKNKSKKGFEAYYPRPYDEQRTTWRSLDAL 340 Query: 314 VVDKIIQNENGNRVAAVVNQFRNIAPQSPLELIMGGYRNNQASILERRH----DVLMFNQ 369 V R + P+ P + +N A ILE R ++ Sbjct: 341 VAVD----------GDPGFSSRELPPKRPENV------DNVARILEDREVLDLQIVSMAY 384 Query: 370 GWQQ--YGNVINEIVTVGL------GYKTALRKALYTFAEGFKNKDF------------K 409 G Q YG +++ + + + + A+R + AE Sbjct: 385 GPQSSTYGTIVSSSIGLPVHLLRNSEWSRAVRNDVRNSAEATGRAATAVGAFAGQLYVAA 444 Query: 410 GAGVSVHETAERHFYRQSELLIPDVLANVNFSQADEVIADLRDKLHQ----LCEMLFNQS 465 G A Y Q E + L ++ + ++ + + + + + L N + Sbjct: 445 GGEYEFGVDAADRLYAQLEPRFHNWLRGLDPKNMAQEVSSWQHTVREAALGIAQDLLNGA 504 Query: 466 VAPY 469 Sbjct: 505 GQKA 508 >UniRef50_A4XYU2 CRISPR-associated protein, Cse1 family n=3 Tax=Pseudomonadaceae RepID=A4XYU2_PSEMY Length = 525 Score = 109 bits (273), Expect = 2e-22, Method: Composition-based stats. Identities = 84/498 (16%), Positives = 149/498 (29%), Gaps = 76/498 (15%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAP 60 LL + W+ VR +G ++ L+ S + L+ +A LL+ I Sbjct: 17 FTLLDEPWLAVRMHDGQVGELGLLELFERSGEIGALAETSPPSLIAQYRLLLAITHRAIT 76 Query: 61 AKDDVEFRH----RIM-NPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDVT--- 112 + R N L + + W + F+L H ++PFMQ + + T Sbjct: 77 QA-QGRWTDAERMRWHQNGLPLAAIRDYLERWRERFWLFHPQYPFMQVAALADAEETRDK 135 Query: 113 --PMEKLLAGVSGATNCAFVNQ--------PGQGEALCGGCTAIALFNQANQAPGFGGGF 162 P ++ + + G +ALC L G Sbjct: 136 LKPWTQISLASANGNAPVVFDHSCDLAPRSIGAADALCT-----LLGFLQFTPGGLVKTL 190 Query: 163 KSGLRGGTPVTT---FVRGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSNE 219 + + G T G L ++ L + P ++ E+ P W + S Sbjct: 191 RDSDKAGALANTAAVMPMGDSLAQSLCLALHP--------PTQTGHEDLPAWERSAPSIA 242 Query: 220 ---SIPASSIGFVRGLFWQPAHIELCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGL 276 P + G Q + L + V Sbjct: 243 QLCGEPELATGPNDRYTRQSRAVLLLADD---------ERRVQWIRFAAGLALGDDVQA- 292 Query: 277 WPHPHSPCLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAAVVNQFRN 336 P P + G L+FT W + ++ D + ++ AAV+ N Sbjct: 293 -PDPMASYRA----GSNSLVRLSFTEGRALWRDLPALLPDAEGKA---SQPAAVLEWAAN 344 Query: 337 IA---PQSPLELIMGGYRNNQASILERRHDVLMFNQGW---QQYGNVINEIVTVGLGYKT 390 + L++ G ++QA +L R + + + N + V + Sbjct: 345 LQFYLGNGVQPLLIAGLASDQAKLLRWRSERIALPAKLLASPDHANELRRYVRDAEELFS 404 Query: 391 ALRK-ALYTFAEGFKNKDFKGAGVSVHETAER---------HFYRQSELLIPDVLANVNF 440 ALRK A AE + A F+ +E + V+A + Sbjct: 405 ALRKLATGMLAETLPDPG----SKDTWARARSLIDAGPASALFFAGAERQLGRVMALLGS 460 Query: 441 SQADEVIADLRDKLHQLC 458 + D+ A R LH+ Sbjct: 461 DELDQAEALWRQSLHKAA 478 >UniRef50_Q4JWJ7 Putative uncharacterized protein n=2 Tax=Corynebacterium jeikeium RepID=Q4JWJ7_CORJK Length = 561 Score = 109 bits (271), Expect = 3e-22, Method: Composition-based stats. Identities = 52/379 (13%), Positives = 104/379 (27%), Gaps = 59/379 (15%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAP 60 +LL WI +G ++ + S + ++A LL+ + Sbjct: 22 FSLLDQPWILTTLTDGSAAELSLREIFDGSHSVASIRGDSPLQDVAIYRLLLTVYWCAHR 81 Query: 61 AK---------DDVEFR-HRIMNPLTED---EFQQLIAPWIDMFYLNHAEHPFMQTKGVK 107 + D E+ R+ + + + D F L + PFMQ + Sbjct: 82 QELLSDPGTELDMAEWIPDRLEAAAENEPDNTVLNYLERYADRFDLLDPKQPFMQVADLH 141 Query: 108 AND--VTPMEKLLAGVSGATNCAFVNQPGQGEALCGGCTAIA---LFNQANQAPG----- 157 + T + +++ + AL A ++ QA G Sbjct: 142 TSKNATTDVRRIVPDFED----DYFTLRAGDGALSLTYAEAARWLIYVQAYDYSGIKSGA 197 Query: 158 -------FGGGFKSGLRGGTPVT-TFVRGIDLRSTVLLNVL-TLPRLQKQFPNE-SHTEN 207 G G+ G +T T + G +L+ T+ LN + + P + Sbjct: 198 VGDPRVKGGRGYPIGTGWTGAITATIILGENLQETLALNTTGGALQAKNDHPVWEREPDT 257 Query: 208 QPTWIKPIKSNESIPASSIGFVRGLFWQPAHIELCDPIGIGKCSCCGQESNLRYTGF-LK 266 + P + P G L WQ I L G + + + Sbjct: 258 SAQRLDPANKDGIYPK---GPAEILTWQSRRIRLFPDG--------GLITQVLVSNGDRI 306 Query: 267 EKFTFTVNGLWPHPHSPCLVTVK---KGEVEEKFLAFTTSAPSWTQISRVVVDK--IIQN 321 V P +P + K + W + ++ + + + Sbjct: 307 PNANANVQD---DPMTPYRFSKNKSTKTLDVYYPKPLDSQRTMWRSLEPLIALETDPVYD 363 Query: 322 ENGNRV--AAVVNQFRNIA 338 ++Q + Sbjct: 364 AKNRAPKRPKTIDQLAYLK 382 >UniRef50_C0VRW0 CRISPR-associated protein n=1 Tax=Corynebacterium glucuronolyticum ATCC 51867 RepID=C0VRW0_9CORY Length = 553 Score = 108 bits (270), Expect = 5e-22, Method: Composition-based stats. Identities = 89/505 (17%), Positives = 158/505 (31%), Gaps = 76/505 (15%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAP 60 +LL + WI +G V++ L R+ + A + LL+ I Sbjct: 8 FSLLDEPWILCESLDGTPVELGLLDVFDGKHPIKRVRGDAPTQDSAIVGLLLPIYWRAHT 67 Query: 61 -----------AKDDV--EFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVK 107 E + + + ++ + + D F+L PFMQT ++ Sbjct: 68 GDLTVFNGDNLPFSTWFAEHLEQARSGVADEAVLNYLETYRDRFFLVGGPAPFMQTPTLE 127 Query: 108 AN--DVTPMEKLLAGVSGATNCAFVNQPGQGEALCGGCTAIA---LFNQANQAPG----- 157 + P+ +L+ + + + A A + QA G Sbjct: 128 TKNMEFLPLSRLIPEAE----SEYFSMREEDAAETVPLGEAARWIVTTQAYDYSGIKPGA 183 Query: 158 -------FGGGFKSGLRGGTPVT--TFVRGIDLRSTVLLNVLTLPRLQKQFPNESHTENQ 208 G G+ G+ G + +T T + G T++ N S E++ Sbjct: 184 IGDDRVKGGRGYPIGV-GWSGMTGRTLIVGNTFAETLVYNTTAD--------CISSPEDK 234 Query: 209 PTWIKPIKSNESIP-ASSIGFVRGLFWQPAHIELCDPIGIGKCSCCGQESNLRYTGFLK- 266 P W +P+ + + G L WQ I + + + + T K Sbjct: 235 PCWERPVDTAAVREFPAPKGAADLLTWQTRRIRVRYEG--------DRATGVIVTNGDKI 286 Query: 267 EKFTFTVNGLWPHPHSPCLVTV---KKGEVEEKFLAFTTSAPSWTQISRVV-VDKIIQNE 322 V G P +P + KKG F T+ W + +V +D Q Sbjct: 287 PDAGANVFG---DPLTPYRYSKNKSKKGHTVYYPQCFDTNRTMWRSLVPLVALDSDPQFT 343 Query: 323 NGNRVAA-------VVNQFRNIAPQS--PLELIMGGYRNNQASILERRHDVLMFNQGWQQ 373 +R + F ++ + PLELI Y N ++ H L Q Sbjct: 344 EKDRAPKRPRNLDSLSRVFNDLGIEETIPLELISASYGPNDSTPSTTVHAGLNLPSPILQ 403 Query: 374 YGNV--INEIVTVGLGYKTALRKALYTFAEGFKNKDFKGAGVSVHETAERHFYRQSELLI 431 NV ++IV+ TA + + G + E Sbjct: 404 PENVELRDQIVSQATATSTAAVALGSFAGQLLQAA---GGDYEFQPAPTDGALAELEHRF 460 Query: 432 PDVLANVNFSQADEVIADLRDKLHQ 456 L+ V+ Q DE I +D +++ Sbjct: 461 NAWLSTVDEDQLDEQITQWQDIVYE 485 >UniRef50_C4FG91 Putative uncharacterized protein n=1 Tax=Bifidobacterium angulatum DSM 20098 RepID=C4FG91_9BIFI Length = 573 Score = 107 bits (267), Expect = 1e-21, Method: Composition-based stats. Identities = 105/571 (18%), Positives = 193/571 (33%), Gaps = 98/571 (17%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAP 60 +LL + W+ V R+G +I Q + D LS +L + L + I Sbjct: 6 FSLLDEPWVQVVYRDGHPGEISLRQIFSDAPDIKELSGDIPQQKLPLIRLFLAILYRAYR 65 Query: 61 A--KDDVEFRHRIMNPLTE-----DEFQQLIAPWIDMFYLNHAEHPFMQTKGVK---AND 110 ++ + R + D + + W D F+L PF Q ++ A Sbjct: 66 VVGVNEEQMRELWKEIFSSKHFDMDIVSRYLDKWEDRFFLIGER-PFFQIPDLEYVGAKP 124 Query: 111 VTPMEKLLAGVSGATNCAFVNQPGQGEALCGGCTAIA--LFNQANQAPGFGGGFKSGLRG 168 +P+ +++A V F + + A +F QA G K+ ++G Sbjct: 125 YSPVSEMIADVPKPDKYLFSMRSMEETDSISFAEASRWLVFMQAYDI----AGIKTPVKG 180 Query: 169 -----GTPVTT---------------FVRGIDLRSTVLLN-VLTLPRLQ-KQFPNESHTE 206 G V + + G +L T++LN VL + +++ + Sbjct: 181 NTYVKGGKVYSPKGMSTGWLGAIGGLYAEGRNLFETLMLNWVLYDTKYDSERYRLFGNER 240 Query: 207 NQPTW-IKPIKSNESIPASSI-GFVRGLFWQPAHIELCD----PIGIGKCSCCGQESNLR 260 + P W I S + S+ G V+ + WQ + L +G C G Sbjct: 241 DVPVWEQNNIPSPDLDNQSTFAGPVQAMTWQSRRLRLVPNEDVTRIVGVVYCYGD----V 296 Query: 261 YTGFLKEKFTFTVNGLWPHPHSPCLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQ 320 + + F W + KKG K + S W + ++ ++ Sbjct: 297 VSPDDTD--GFEKMTAWR-----SIPQQKKGLPTHKPVMHDASKALWRGLEPILC---VK 346 Query: 321 NENGNRVAAVVNQFRNIAPQ------SPLEL-------IMGGYRNNQASILER-RHDVLM 366 +++ R ++ I + L L ++ G +Q+S+ E D L Sbjct: 347 DDDDCRPG-LIRWLEEIRTEIFDSEDHVLNLVTIHAQGMVYG---SQSSVFETGIDDTLS 402 Query: 367 FNQ-GWQQYGNVINEIVTVGLGYKTALRKALYTF------AEGFKNKDFKGAGVSVHETA 419 N ++ + I ++ V A+ +AL F + G K K K + E Sbjct: 403 LNTIMFRHDYDGIAAVIDVAKSADNAV-QALTQFIRNLQMSAGDKGKSAK-VENRIEERI 460 Query: 420 ERHFYRQSELLIPDVLANVNFSQADEVIA-DLRDKLH----QLCEMLFNQSVAPY--AHH 472 Y + + L D LA + S+ + D +DK+H ++ +QS P H Sbjct: 461 RESAYTELDRLCRDELAAFDKSKDFIKYSNDWKDKIHRRLLEMERDYLDQSSVPVFDEHE 520 Query: 473 PKL-----ISTLALARATLYKHLRELKPQGG 498 S + A +L + G Sbjct: 521 FDNKRKSHTSNMMKATRAQLSFQSKLNRELG 551 >UniRef50_A8M401 CRISPR-associated protein, Cse1 family n=1 Tax=Salinispora arenicola CNS-205 RepID=A8M401_SALAI Length = 560 Score = 106 bits (264), Expect = 2e-21, Method: Composition-based stats. Identities = 66/383 (17%), Positives = 119/383 (31%), Gaps = 38/383 (9%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWR-LSLPRDDMELAALALLVCIGQIIA 59 +L WIPV G+ ++++L L+ + R ++ A L LL+ I Sbjct: 8 FSLSGQPWIPVLDL-AGRRRLVSLAELFAQAAELRAVAGDLPTQTSALLRLLLAILHRAV 66 Query: 60 -PAKDDVEFRHRIMNP-LTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGV---KANDVTPM 114 +D+ ++ P L + + + D F L H PF Q + K NDV + Sbjct: 67 DGPEDERVWQGLWRQPDLPAGDVVDYLDEYRDRFDLLHPVTPFYQVADLRTQKQNDVFGL 126 Query: 115 EKLLAGVSGATNCAFVNQPGQGEALCGGCTAIALFNQANQAP-------------GFGGG 161 ++ +A V + L A+ L + G G Sbjct: 127 QRFIADVPNGAPYLTTRLGPGLQRLTPAEAAVWLVHCQAYDTSGIKSGAVGDPRVSGGKG 186 Query: 162 FKSGLRGGTPV-TTFVRGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTW-IKPIKSNE 219 + G G + ++ G LR T+LLN++ L Q + + P W P E Sbjct: 187 YPIGPGAGGSLGLVYLEGRTLRETLLLNLVPLDNAYLQ---QDPERDSPMWERDPHGPAE 243 Query: 220 SIPASS--IGFVRGLFWQPAHIELCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLW 277 G + WQ I L G + + +T Sbjct: 244 EAERDRGPHGPLNLYTWQSRRIRLF-GDQTGI-------TGAMIANGDRITWTNLHR--- 292 Query: 278 PHPHSPCLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAAVVNQFRNI 337 P S + + + + S T+ + ++ A + Sbjct: 293 KEPMSGWRRSPHQEKKLVLPTVYLPSLHDHTRALWRGLTAVLPTSAEKPGADAPTRRPPA 352 Query: 338 APQSPLELIMGGYRNNQASILER 360 Q L + G +++ + R Sbjct: 353 VSQWLAGLRVTGLIDDRYRVTTR 375 >UniRef50_D1NTH8 CRISPR-associated protein, Cse1 family n=1 Tax=Bifidobacterium gallicum DSM 20093 RepID=D1NTH8_9BIFI Length = 566 Score = 106 bits (263), Expect = 3e-21, Method: Composition-based stats. Identities = 81/552 (14%), Positives = 166/552 (30%), Gaps = 88/552 (15%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIA- 59 NL+ D WIPV + + + +S + + + L I Sbjct: 8 FNLVNDPWIPVVYDDATRAVVSLRESFEQASHIVAIVTDNPLQKAVLYRLFEAIWMRAYE 67 Query: 60 -------PAKDDVEFRHRIMNP-LTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVK---A 108 P++ ++ + + + F L ++ PF Q ++ Sbjct: 68 MEQIDVAPSECYALWQEFWDLGEFDLEIINAYLNKYEAKFELFDSKTPFYQVPDLEYVGK 127 Query: 109 NDVTPMEKLLAGVSGATNCAFVNQPGQGEALCGGCTAIALFNQANQAPGFGGGFKSGLRG 168 +E ++ V T + P E L A L G KS + G Sbjct: 128 KAYDGVETMILDVPKGTGLFSLRNPETLEGLDFAEAARQLLTIMAYDT---AGIKSPVEG 184 Query: 169 GTPV---------------------TTFVRGIDLRSTVLLN-VLTLPRLQKQFPNESHTE 206 + + + + G +L T++LN V++ P + T Sbjct: 185 FSAINKGKAFAPQGVPSVGWLGNIGSVWAEGSNLFETIMLNWVISNPLTSE---LSESTY 241 Query: 207 NQPTWI--KPIKSNESIPASSIGFVRGLFWQPAHIELCDPIGIGKCSCCGQESNLRYTGF 264 ++ W P + + + G V L Q I L C+ G + + Sbjct: 242 DRAPWELDTPPEHDLVVRDGFRGMVDALTVQSRRIRLV-------CNEAGTQVIGLVICY 294 Query: 265 L-----KEKFTFTVNGLWPHPHSPCLVTVKKGEVEEK---FLAFTTSAPSWTQISRVVVD 316 ++ W + KKGE F W + ++V Sbjct: 295 GDIIRPAYTQIAEMHTSWR------VSKPKKGEGNAPVVMPRTFEAGKALWRSLGPLLVA 348 Query: 317 KIIQNENGNRVAAV---------VNQFRNIAPQSPLELIMGGYR-NNQASILE--RRHDV 364 +EN R + + + R + +I G Q+S+ E + Sbjct: 349 ---DSENSARPGVLRWLDRLFDEIPELREKHLLQTIGIIAQGMTYGTQSSVFEASYDDSL 405 Query: 365 LMFNQGWQQYGNVINEIVTVGLGYKTALRK----ALYTFAEGFKNKDFKGAGVSVHETAE 420 + ++ + G++I ++ V + +++ A + D K Sbjct: 406 ELSSEMLRSGGDIIGRVLDVVAATEQSVKDLGTFAFRLEVASGADSDSKNRRTDTRMQIA 465 Query: 421 RHFYRQSELLIPDVLANV-NFSQADEVIADLRDKLH----QLCEMLFNQS-VAPYAHHPK 474 Y + + + LA + A +D++H +L + +QS ++ H + Sbjct: 466 EEAYAALDGVFRERLAQYRSDDDALAYCKSWKDEIHRLLLRLAQDYLDQSPTQSFSFHEE 525 Query: 475 LISTLALARATL 486 + +A L Sbjct: 526 NGRRVDAGQAML 537 >UniRef50_B6IWM6 CRISPR-associated protein, CT1972 family n=1 Tax=Rhodospirillum centenum SW RepID=B6IWM6_RHOCS Length = 600 Score = 105 bits (261), Expect = 5e-21, Method: Composition-based stats. Identities = 70/381 (18%), Positives = 118/381 (30%), Gaps = 58/381 (15%) Query: 9 IPVRPRNG-GKVQIINLQSLYCSRDQ--WRLSLPRDDMELAALALLV-CIGQIIAPAKDD 64 IPV R+G G + L S + RL P D L+ I Q +D Sbjct: 8 IPVDRRSGPGVATPLELTSRHGDDPILGVRLGHPLADRG---FEFLMRDILQAALAPEDA 64 Query: 65 VEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKG----VKANDVT-------- 112 +R ++ P + +AP+ + F L+H HP +Q + + D Sbjct: 65 TAWRRMLVEPPGPEALAAALAPYRETFRLDHPTHPALQVRPAPERLAEADAKKPAGSRKP 124 Query: 113 -----------------PMEKLLAGVSGAT----NCAFVNQPGQGEALCGGCTAIALFNQ 151 + LL + N F + G + G L+ Sbjct: 125 APEAEEDGEEEEEEGPVGIGALLPDLPTKNAEKRNKDFFTRRGSIRTIGAGAVLPILYAN 184 Query: 152 ANQAPGFGGGFKSGLRGGTPVTTFVRGIDLRSTVLLNVLTLPRL---QKQFPNESHTENQ 208 G + S G T + + G L T+ LNVLT +P Sbjct: 185 QVLFIDKKGSYYSLPHGRTCILFQLVGRTLWETIWLNVLTRGTEGGGDAVWPARPDDPTA 244 Query: 209 PTWIKPIKSNESIPASSIGFVRGL---FWQPAHIE-----LCDPIGIGKCSCCGQESNLR 260 W+ + S+ +++ R + PAHI L P I +C G Sbjct: 245 FPWLDSGLRDMSLDSNNARATRSMSRATLHPAHIPMTRRYLLAPPVIDRCDLTG-MDGPA 303 Query: 261 YTGFLKEKFTFTV--NGLWPHPHSPCLVTVKKGEVEEKFLAFTTS--APSWTQISRVVVD 316 + F + W ++ + K E +FL+ W + + Sbjct: 304 FKSFSRWPRGLQYETPDWWF--YAAVRLENPKKPDEPQFLSANGPLRFNDWIETAIFSNA 361 Query: 317 KIIQNENGNRVAAVVNQFRNI 337 K ++ + QFR++ Sbjct: 362 KNKNSKIIIHQPPSLRQFRSV 382 >UniRef50_D1A6Q3 CRISPR-associated protein, Cse1 family n=1 Tax=Thermomonospora curvata DSM 43183 RepID=D1A6Q3_THECD Length = 722 Score = 103 bits (256), Expect = 2e-20, Method: Composition-based stats. Identities = 85/529 (16%), Positives = 157/529 (29%), Gaps = 63/529 (11%) Query: 1 MNLLIDNWIPVRPRNGGKVQI--INLQSLYC-SRDQWRLSLPRDDMELAALALLVCIGQI 57 +L ++ W VR + G + + L+ L + + L++ A LL + Sbjct: 7 FDLALEPWAEVRWKEAGPDRPSRLGLRDLLVHAHEIEALAITPPPALSAMYRLLYALTAR 66 Query: 58 IAP----AKDDVEFRHR----IMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKAN 109 + D ++ R PL D A F L H + PF+Q + Sbjct: 67 VTGLDENPDGDGDWLDRRAEIFGEPLAPDAVDAYFAEHEGRFDLFHPQRPFLQDPRLADP 126 Query: 110 DVTP----MEKLLAGVSGATNCAFVNQPGQGEAL--CGGCTAIALFNQANQAPGFGGGFK 163 V P + KL+ G N + + ++L P + Sbjct: 127 AVCPKSAGVNKLVLGRPAGNNSVWFGHHWDASPIPVPTPDAFLSLLVWLYYGPSGRCSTR 186 Query: 164 SG------------LRGGTPVTTFVRGIDLRSTVLLNVLTLPRLQKQF--PNESHTENQP 209 + LRG ++ G L T+L + P ++ P + P Sbjct: 187 THADVTAADVSAGPLRGS--LSYHPEGDTLLETLLAGLTPPPEGLRRADDPCPWELADLP 244 Query: 210 TWIKPIKSNESIPASSIGFVRGLFWQPAHIELCDPIGIGKCSCCGQESNLRYTGFLKEKF 269 + P ++ P G L H L P G+ ++ T + K Sbjct: 245 DPLAPPRTPNPYP----GPCTRLTGGWQHALLLVPDDTGR-----HVTDAYITWGHRGKL 295 Query: 270 TFTVNGLWPHPHSPCLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAA 329 T + K+G + + W + ++ R A Sbjct: 296 PSTNDAY------VIFQISKQGNLYARPA--DAGRALWRDLDGLLDLPTTATGTQPRRPA 347 Query: 330 VVNQFRNIAPQSPLELIMGGYRNNQASI--LERRHDVLMF--NQGWQQYGNVINEIVTVG 385 V + + + I + L+F N I ++ T G Sbjct: 348 VFGTGLDDLGSFKVRALGFEQDGKTKDIQFISAVTPPLLFRINDEDLATARRIGDMRTAG 407 Query: 386 LGYKTALRKALYTFAEGFKNKDFKGAGVSVHETAERHFYRQSELLIPDVLANVNFSQADE 445 Y L A+ + K + E A ++ ++E + L N ++ D Sbjct: 408 ELYGGRLEYAVKRAWAAVVDDKPK--DCAWAEHAAAAYWPKAEEIFWTRLRNQDY---DR 462 Query: 446 VIADLRDKLHQLCEMLFNQSVAPYAHHPKLISTLALARATLYKHLRELK 494 R ++ +F+Q +A + + AR LY R+ K Sbjct: 463 HWQSFR----RVAISVFDQITRDHARGARTARAIEEARLELYGGARKAK 507 >UniRef50_A8LYZ4 CRISPR-associated protein, Cse1 family n=1 Tax=Salinispora arenicola CNS-205 RepID=A8LYZ4_SALAI Length = 495 Score = 103 bits (256), Expect = 2e-20, Method: Composition-based stats. Identities = 77/529 (14%), Positives = 159/529 (30%), Gaps = 79/529 (14%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLY-CSRDQWRLSLPRDDMELAALALLVCIGQIIA 59 +L WIPV ++ +++++ L+ L+ + + L++P +L I + Sbjct: 5 FDLTDQPWIPVVAKS--ELELVGLRELFVRAAEFDDLAVPVPPAASGLWRILYAITARVT 62 Query: 60 PAK--DDVEFRHRIMN-----PLTEDEFQQLIAPWIDMFYLNHAEHPFMQT--KGVKAND 110 ++R R + A + D F L A P+MQ V+ Sbjct: 63 GLDMLRGPQWRQRQERLLDQGGFAAGDVDAYFAKYSDRFDLFGALRPWMQDPRLAVECPK 122 Query: 111 VTPMEKLLAGVSGATNCAFVNQPGQGE--ALCGGCTAIALFNQANQAPGFG------GGF 162 + + KL+ + + + AL G A L Q G Sbjct: 123 SSGVNKLVFDRPAGNSQVWFGHHTDADAVALAPGEAAWYLIAQLYYGASGRCSSREVAGQ 182 Query: 163 KSGLRGGTPVTTFV----RGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSN 218 K P+ + G +L ++++ V + + W + + Sbjct: 183 KFANSNAGPLRGVMSYHPLGENLFESLVVGVPPGVSS-----GQDEGLDLCPWERDELPD 237 Query: 219 ESIPASSIGFVRG-LFWQPAHIELCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLW 277 S+ + G L + H L P G+ S+ T W Sbjct: 238 PLGAPWSVSWPCGALTGRARHAVLLVPDAAGE-----AVSDAYVTW------------AW 280 Query: 278 PHPHS----PCLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAAVVNQ 333 P + P +V + E L S W + ++ E ++ Sbjct: 281 RLPGAASPDPYVVRRQNKEGGWYQLPADDSRALWRDVDALL---GGNTEVKTHRPDIMAV 337 Query: 334 FRNIAPQSPLELIMGGYRNN-QASILERRHDVLMFNQGWQQYGNVIN---------EIVT 383 ++ + G+ + QA + + GW + + + Sbjct: 338 AADL--GLDGRVRAYGFDQDGQAKDRQWFIALTPPVLGWLSERDPVTADGVALLTRAAES 395 Query: 384 VGLGYKTALRKALYTFAEGFKNKDFKGAGVSVHETAERHFYRQSELLIPDVLANVNFSQA 443 +G ALR+A + T E +++ ++E + + + + F++ Sbjct: 396 IGRRVGAALRQAWRELVSVKDREG------PWAHTGEAYYWTRAEAVFWEHVRDGRFAEG 449 Query: 444 DEVIADLRDKLHQLCEMLFNQSVAPYAHHPKLISTLALARATLYKHLRE 492 +L + + A P+L+ A L LR+ Sbjct: 450 G-------RAFARLGHEAIDHAADGDASSPRLVRATQTAHRLLTTPLRK 491 >UniRef50_C6HV92 CRISPR-associated protein, Cas1 n=1 Tax=Leptospirillum ferrodiazotrophum RepID=C6HV92_9BACT Length = 498 Score = 102 bits (254), Expect = 3e-20, Method: Composition-based stats. Identities = 44/234 (18%), Positives = 82/234 (35%), Gaps = 31/234 (13%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAP 60 NL+ + WIPV +++L+ ++ L ++A L+ I Q A Sbjct: 7 FNLIDEPWIPVADAG-----LVSLKDVFLRDSLRALGG-NPVQKIAMTKFLLAIAQAAAT 60 Query: 61 AKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDVTPMEKLLAG 120 ++D E+ L E + W D F+L + PF+Q +GV+ + +L Sbjct: 61 PENDDEWATMGPKGLAERCLS-YLEKWHDRFFLFGEQ-PFLQMEGVRTAALQSFGAVLPE 118 Query: 121 VSGATNCAFVNQPGQGEALCGGCTAIAL----FNQANQA-------PGFGG----GFKSG 165 +S + + I++ F +A PG+GG K Sbjct: 119 ISTGNTSLLFQSQIEKKLSEADKALISIQLSGFGLGGKADNSLVLTPGYGGKRNPKGKPS 178 Query: 166 LRGGTPVT-------TFVRGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWI 212 + +F G +L T+ +N+ + ++ S P W Sbjct: 179 VSKSGAWVGYKGYLHSFFFGDNLLKTLWVNLFSRSQIGLMSVYPS-GVGTPPWE 231 >UniRef50_B5GY59 Putative uncharacterized protein n=1 Tax=Streptomyces clavuligerus ATCC 27064 RepID=B5GY59_STRCL Length = 594 Score = 100 bits (250), Expect = 8e-20, Method: Composition-based stats. Identities = 70/457 (15%), Positives = 132/457 (28%), Gaps = 54/457 (11%) Query: 50 LLVCIGQIIAPAKDDVEFRHRIMNPL-----TEDEFQQLIAPWIDMFYLNHAEHPFMQTK 104 LL + + + E+ P + + + D F L PF Q Sbjct: 2 LLPVVVDALGFPETPEEWAEHFHAPDGFTGQAAERLTEYLDEHRDRFGLFDPVDPFAQVG 61 Query: 105 GVKAN-DVTPMEKLLAG-VSGATNCAFVNQP--GQGEALCGGCTAIALFNQANQAPGF-- 158 G++ D T L+ + N F + GQ L G A L + PG Sbjct: 62 GLRTGKDETRNSALIVATAASGNNVPFWSARTDGQAPRLSPGRAAHWLLHTHCWDPGAIK 121 Query: 159 -GGGFKSGLRGG-------TPV----TTFVRGIDLRSTVLLNV-LTLPRLQKQFPNESHT 205 G R G P+ G L ++ LNV + RL P Sbjct: 122 TGAFGDPRARAGKVMGNPTGPLGALGLVLPMGRTLYESLWLNVPFGVTRLAGDLPQWRRR 181 Query: 206 ENQPTWIKPIKSNESIPASS---IGFVRGLFWQPAHIELCDPIGIGKCSCCGQESNLRYT 262 + + ++ +S + S G + WQ I L G+ R Sbjct: 182 DREGP-VEETRSTATPGWDSRPPRGPLDAWTWQARRIRLVPETAPQDADGDGEPEVNRVV 240 Query: 263 GFLKEKFTFTVN-----GLWPHPHSPCLVTVKK--GEVEEKFLAFTTSAPSWTQISRVV- 314 ++ + + + K ++ + +W + ++ Sbjct: 241 VAAGDRLRLQPDHEFHTAWTVDSQTVHRKRLAKDPDALQIRPRRHRAGRAAWRGLDALLA 300 Query: 315 VDKIIQNENGNRV------AAVVNQF----RNIAPQSPLELIMGGYRNNQA--SILERRH 362 V+ ++ V A ++ + + P PL L + G N +I + H Sbjct: 301 VEGSTWQQDATEVGQGFHTAQILVKLAEAGAELPPDYPLRLELTGIAYNSKFSAIEDTFH 360 Query: 363 DVLMFNQGWQQYGNVINEIVTVGLGYKTALRKALYTFAEGFKN--KDFKGAGVSVHETAE 420 D L + ++ + + L A+ + G E Sbjct: 361 DELPLPVAALRRDGLVRAALIGAVAQAERLADAVNRLVADLRRAAGARPVPGGEWQHPGE 420 Query: 421 RHFYR----QSELLIPDVLANVNFSQADEVIADLRDK 453 + LL ++ +F + DE++ +K Sbjct: 421 SLLHALDPVVRLLLRLLRTSDEDFDRVDELLRAWEEK 457 >UniRef50_Q1EQS6 Putative uncharacterized protein n=2 Tax=Streptomyces RepID=Q1EQS6_STRKN Length = 544 Score = 97.9 bits (242), Expect = 7e-19, Method: Composition-based stats. Identities = 52/282 (18%), Positives = 88/282 (31%), Gaps = 42/282 (14%) Query: 1 MNLLIDNWIPVR-------PRNGGKVQIINLQSLYC-SRDQWRLSLPRDDMELAALALLV 52 NLL + WIPVR G+ I L+ L S + L++ A L +L Sbjct: 6 FNLLDEPWIPVRWTPTELSSAVAGRPDRIGLRELLARSPEIAGLAIAEPPAHSALLRILY 65 Query: 53 CIGQIIAPAKDD--VEFRHRIMNP-----LTEDEFQQLIAPWIDMFYLNHA--EHPFMQT 103 + + + ++ R + L +A + F+L P+MQ Sbjct: 66 ALTARVTGLDEAGPGDWGVRRADVRDAGELPPQGISDYLATYRHRFFLYDPDGGRPWMQD 125 Query: 104 KGVK---ANDVT-PMEKLLAGVSGATNCAFVNQPGQGEALCGGCTAIALFNQ---ANQAP 156 + D T + KL+ N ++ A + A+ N P Sbjct: 126 ARLAHECDPDNTAGVNKLIVTRPSGNNHSWFEHTSDA-APGLPTASEAVLNLLVWHYYGP 184 Query: 157 GFG------GGFKSGLRGGTPVTTFV----RGIDLRSTVLLNVLTLPRLQKQFPNESHTE 206 G KS P+ T + G L T+L ++ K + Sbjct: 185 SGRCSSREVNGAKSASAKAGPLRTALSYHPEGETLFETLLAGLVPPKSTVK------SAQ 238 Query: 207 NQPTWI-KPIKSNESIPASSIGFVRGLFWQPAHIELCDPIGI 247 +Q W + +++P + G L H L P Sbjct: 239 DQCPWEWHDLPDPDAVPVAPAGPCARLTACSQHALLLVPQEP 280 >UniRef50_B8FDH6 CRISPR-associated protein, Cse1 family n=1 Tax=Desulfatibacillum alkenivorans AK-01 RepID=B8FDH6_DESAA Length = 494 Score = 97.9 bits (242), Expect = 7e-19, Method: Composition-based stats. Identities = 79/494 (15%), Positives = 148/494 (29%), Gaps = 90/494 (18%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAP 60 NL+ + WIPV R +++L+ ++ L ++A L + I Q Sbjct: 5 FNLVDEEWIPVAGRG-----LVSLRDVFTDPSLEALGG-NPLEKIALTKLFLAITQTAHT 58 Query: 61 AKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDVTPMEKLLAG 120 D E+ + ++ + D F+L FMQ VKA D+ + +L Sbjct: 59 PADTDEWLAMGAPGMASRA-REYLEAHKDCFWLYGDRP-FMQMPAVKAADIQNVSAVLPF 116 Query: 121 VSGATNCAFVNQPGQGEALCGGCTAIALFNQANQAPGF---------GGGFKSGLRGGTP 171 V+ + + L A+ L + A G + + P Sbjct: 117 VATGNTTQVF-ESQKDRDLSDPEKALVLVFLSCFALGGKKVDAKIVLSPSYSEKSKTAKP 175 Query: 172 VTTF---------VRGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSNESIP 222 T + G L+ T+ LN+ TL + + P W E +P Sbjct: 176 GTCLGFQGFLHNFLVGGSLQETIWLNLFTLEEIGRLEQFP-EGLGVPPW-------EKMP 227 Query: 223 ASSIGFVRGLFWQPAHIELCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWPHP-- 280 + F L C R+ F E F HP Sbjct: 228 EGEDCSLARSFKNSYMGRLLP--------LC------RFALFADENF--HYVEGIFHPGY 271 Query: 281 ----HSPCLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAAV---VNQ 333 + P + + + W +++ ++ + + + + Sbjct: 272 KDGAYDPSMAVDNTKKPRVLW--VDPEKRPWRELTSLLSFIQADSPRSFDCPQLRSGILK 329 Query: 334 FRNIAPQSPLELIMGGYRNNQASILERRHDVLMFNQGWQQYGNVINEIVTVGLGYKTALR 393 RN P S ++ GG R + N G +QY + ++ V + + Sbjct: 330 ARNGGPGS-FKIWSGGLR-------------VSSNAG-EQYVSGADDFVESEIRLSSQWL 374 Query: 394 KALYTFAEGFKNKDFKGAGVSVHETAERHFYRQSELLIPDVLANVNFSQADEVIADLRDK 453 + + + +G SV+ ++ +F Q + + A + Sbjct: 375 GETWFASLKGEMDALEGLARSVYGSSLAYFKHQK-------------ADGKKQAAQASNL 421 Query: 454 LHQLCEMLFNQSVA 467 QL E F V Sbjct: 422 FWQLSERNFQDLVD 435 >UniRef50_C2CN11 CRISPR-associated protein n=1 Tax=Corynebacterium striatum ATCC 6940 RepID=C2CN11_CORST Length = 562 Score = 97.1 bits (240), Expect = 1e-18, Method: Composition-based stats. Identities = 63/387 (16%), Positives = 125/387 (32%), Gaps = 55/387 (14%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCI---GQI 57 +LL + WI +G + + Q + D L + A L +L+ I Sbjct: 9 FSLLDEPWIAAVGSHGEPLLVSIRQIFDGTHDIAELRGDSPAQDYAVLRVLLAIFWRAHS 68 Query: 58 IAPAK-----DDVEF----RHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGV-- 106 +A E+ R + + + + + + F L ++HPFMQ + Sbjct: 69 VAKPAGKTKFSMAEWFVKARADALEGAADIKVLSYLDGYANRFNLFDSDHPFMQVADLHT 128 Query: 107 KANDVTPMEKLLAGVSGATNCAFVNQPGQ-GEALCGGCTAIALFNQANQAPGF---GGGF 162 + V+P+ ++ + A N F + G+ E L G A L G Sbjct: 129 EKGSVSPINRI---IPEAENEFFTMRAGKPLETLSFGEAARWLVYVHAYDYSGIKSGAVG 185 Query: 163 KSGLRGGT--PV---------TTFVRGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTW 211 S ++GG P+ T++ G +LR T+ LN + PN+ + Sbjct: 186 DSRVKGGRGYPIGTGWTGMTGGTYLIGANLRETLALNT---TEACLRTPNDKPVWEREPD 242 Query: 212 IKPIKSNESIPASSIGFVRGLFWQPAHIELCDPIGIGKCSCCGQESNLRYTGF-LKEKFT 270 ++ +I G WQ + L + + + Sbjct: 243 TAAERNGGAIHIG--GPADLATWQTRRVRLHREN--------NEVVAVLVSNGDRIPDAG 292 Query: 271 FTVNGLWPHPHSPCLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENG----NR 326 G P +P K + ++ F+AF + ++ +I E+ Sbjct: 293 LNAFG---DPMTPYR--YSKNQSKKDFVAFYPRPYDAGRTMWRSLEPLIAMESDAPYLRS 347 Query: 327 VAAVVNQFRNIAPQSPLELIMGGYRNN 353 ++ + L+ + +R + Sbjct: 348 TSSAGKGAQAPKRPEILDQLAYYWRTD 374 >UniRef50_B7KJ23 CRISPR-associated protein, Cse1 family n=1 Tax=Cyanothece sp. PCC 7424 RepID=B7KJ23_CYAP7 Length = 525 Score = 95.2 bits (235), Expect = 5e-18, Method: Composition-based stats. Identities = 81/534 (15%), Positives = 176/534 (32%), Gaps = 72/534 (13%) Query: 2 NLLIDNWIPVRPRNGGKVQIINLQSLYCS-RDQWRLSLPRDDMELAALALLVCIGQ-IIA 59 +LL + WIPV + K + I+LQ L+ + + +A L+ Q I Sbjct: 17 SLLTEPWIPVVYNDSLKYKNISLQDLFLEWENLKTVQGINPPRTIALWRWLIAFTQWSIQ 76 Query: 60 PAKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKA----NDVTPME 115 K E++ + + + + L H + PF Q K ++ + +P+ Sbjct: 77 GTKTIDEWKQLWTDENLGSRIIKRLETVKERLDLLHPDFPFGQCKDLREETKGKEPSPVS 136 Query: 116 K-LLAGVSGA--------TNCAFVNQPGQGEALCG-GCTAIALFNQANQAPGFGGGFKSG 165 K L N AF++ + L C + ++ ++G Sbjct: 137 KILFQDKDSGLLWSKYSDQNPAFLSYAEAVQELLRLLCCDLG----GTKSDSQDRSAQTG 192 Query: 166 LRGGTPVTTFVRGIDLRSTVLLNVLT-LPRLQKQFPNESHTENQPTWIKPIKSNESIPAS 224 + + + G +++ T+LLN+ P+ ++ P W + N + Sbjct: 193 ICVMGRIVMPI-GKNVKETLLLNLHQYSPQDDIPSIFPDQDKDLPLWE---RLNIKKQSR 248 Query: 225 SI-GFVRGLFWQPAHIELCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWPHPHSP 283 +I G + L + + L +S + E+F + W Sbjct: 249 TITGLLDYLTFPNRRVMLIH----------NGKSVTGVYLYKGEEFNQKDSYFWE--LWQ 296 Query: 284 CLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAAVVNQFRN---IAPQ 340 + VK + K L + SW + ++ Q+ + ++ + + R+ + Sbjct: 297 AYIQVKDESMPLK-LKLDINKASWRD-AEALLHPTTQDNHKPKIFDWLVKCRHTGCVPDP 354 Query: 341 SPLELIMGGYRNNQASILERRHDVLMFNQGWQQYGNVINEIVTVGLGYKTALRKALYTFA 400 P++++ + ++ L HD + Q + ++V GL Y K F+ Sbjct: 355 IPVQVLGFAHGSDLGKPLHWLHDTMTIPQVYLDSKEAYYKLVE-GLKYAE---KIGRLFS 410 Query: 401 EGFKNKDFKGAGVSVHET-------------AERHFYRQSELLIPDVLANVNFSQADEVI 447 G +S + + + + ++ + + DE Sbjct: 411 SKTYETVANGLKLSKKDKQKFINQLSTTAAIYWSALDSEFQQFMFELAEDKVVDEEDEDD 470 Query: 448 A--------DLRDKLHQLCEMLFNQSVAPYAHHPKLISTLALARATLYKHLREL 493 + ++KL + + QS+A + A YK L ++ Sbjct: 471 ITFGEIKIPEWKNKLKTIATECYEQSIAGISS----YEARARGLNAWYKELNKI 520 >UniRef50_C6CML6 CRISPR-associated protein, Cse1 family n=6 Tax=Gammaproteobacteria RepID=C6CML6_DICZE Length = 507 Score = 91.0 bits (224), Expect = 9e-17, Method: Composition-based stats. Identities = 73/493 (14%), Positives = 147/493 (29%), Gaps = 86/493 (17%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAP 60 NL+ + WIPV ++L+ ++ S Q R ++A LL+ I Q + Sbjct: 5 FNLIDEPWIPVADIGQ-----VSLKEIF-SNPQLRALGGNPVQKIALTKLLLAIAQSAST 58 Query: 61 AKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDVTPMEKLLAG 120 DD ++R + E+ + W D FYL + PF+Q ++ +V + L Sbjct: 59 PIDDNDWRQTGWQGMAENCLS-YLEKWHDRFYLYGEK-PFLQMPAIQTAEVKSLGVLSPE 116 Query: 121 VSGATNCAFVNQPGQ------------GEALCGGCTA---------IALFNQANQAPGFG 159 +S + + G + A + G Sbjct: 117 ISTGNTTVLTETQQEQRSYDADKAITVIVQMGFGLSGKKTDNSVVLTAGYQGKQNDKGKP 176 Query: 160 GGFKSGLRGG--TPVTTFVRGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIK-PI- 215 K+G+ G + +F G + +V LN+ T + + + W + P Sbjct: 177 ASGKAGIAVGHMGLLHSFWLGDSIVHSVWLNLFTTEDITELVMYPTL--GVAPWEQMPTG 234 Query: 216 KSNESIPASSIGFVRGLFWQPAHIELCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNG 275 + ++ A + L L D + Y+ + Sbjct: 235 EDDDIAQALKTSLIGRLIPMGKFCLLADD-------------GIHYSDGIAH---AGYLE 278 Query: 276 LWPHPHSPCLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAAVVNQFR 335 P + KK + S W +++ ++ G + + Sbjct: 279 GKADPTASVDFAQKKPKALW----VNPSKRPWRELTSLLQFIEQGKVGGFDTPQLKRTLK 334 Query: 336 NIAPQ-SPLELIMGGYRNNQASILERRHDVLMFNQGWQQYGNVINEIVTVGLGYKTALRK 394 ++ L GG R + N G +QY + ++ V + + L Sbjct: 335 RVSRSAEQFALWSGGLR-------------VSSNAG-EQYASGTDDYVQSEIWLSSHLLG 380 Query: 395 ALYTFAEGFKNKDFKGAGVSVHETAERHFYRQSELLIPDVLANVNFSQA-------DEVI 447 +++ + + + R+F L++++ S Sbjct: 381 SVFLEYLKHEMSQLEAIQKQLWGAVVRYF---------RQLSDIDKSGTGKAQPFVSNQA 431 Query: 448 ADLRDKLHQLCEM 460 QLCE Sbjct: 432 EKATSTFWQLCER 444 >UniRef50_C7MQD3 CRISPR-associated protein, Cse1 family n=1 Tax=Saccharomonospora viridis DSM 43017 RepID=C7MQD3_SACVD Length = 484 Score = 90.2 bits (222), Expect = 2e-16, Method: Composition-based stats. Identities = 71/461 (15%), Positives = 126/461 (27%), Gaps = 64/461 (13%) Query: 2 NLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALL---VCIGQII 58 N+ D IPV +G ++ L + L +P E L +L C + Sbjct: 3 NIATDPVIPVTRSDGTTTRLGLRDLLVHAHKIRHLDIPIPPAEAGLLRILYTITCRITGL 62 Query: 59 APAKDDVEFRHRIMNPLTEDEFQQ-LIAPW--IDMFYLNHAEHPFMQTKGVKANDVTPME 115 + + L F I + + L + P+MQ + Sbjct: 63 DTHDNRNTWAEHRNTVLATGRFDADAINAYLGKHCWDLFDEQRPWMQDPRLPDQAERKTA 122 Query: 116 K-LLAGVSGATNCAFVNQPGQGEA--LCGGCTAIALFNQANQAPGFGGGFKS-------- 164 L G + + A L L G GG ++ Sbjct: 123 NVLDMTRPGDNSAIWWKHTHADYAPPLPAHEAVQWLIVHHYYGSGGAGGKRTVTHNNKTV 182 Query: 165 --GLRGGTPVTTFV----RGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSN 218 P+ + V G L T+L + P+ + T + W + Sbjct: 183 SDQYMSSGPLRSTVTYYPLGATLFETLLAGI--------PAPSHTTTGDAAPWETDLNQP 234 Query: 219 ESIPASSIGFVRGLFWQPAHIELCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWP 278 P + L Q H L E + Y + ++ + Sbjct: 235 LGTPPAPTWPAGILTGQSRHALLL--------DHTDNEVDGVYLTWAWKERHTPI----L 282 Query: 279 HPHSPCLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAAVVNQFRNIA 338 P+ + K G E + TS +W ++ D+ AV+ ++ Sbjct: 283 DPYCIHNIDPKTG--EAQPRQANTSRSAWRDFDALLADRPTHTR-----PAVLGDALDLP 335 Query: 339 P--QSPLELIMGGYRNN-QASILERR--HDVLMFNQGWQQ---YGNVINEIVTVGLGYKT 390 Q L + G+ + QA+ + + + +VT Sbjct: 336 DDLQDTLRVRAIGWHQDRQATNTGWYVSETPPLLRYMDEHDPARAALAETLVTTADKVYG 395 Query: 391 ALRKALYTFAEGFKNKD------FKGAGVSVHETAERHFYR 425 A+RKAL+ + D A AE F+ Sbjct: 396 AMRKALHKAWKDADLGDPKQCPWKDAADHLYWPAAEHIFWA 436 >UniRef50_B0LU91 CRISPR-associated protein Cas1 n=2 Tax=Streptomyces RepID=B0LU91_9ACTO Length = 540 Score = 89.1 bits (219), Expect = 3e-16, Method: Composition-based stats. Identities = 59/429 (13%), Positives = 112/429 (26%), Gaps = 73/429 (17%) Query: 25 QSLYCSRDQWRLSLPRDDMELAALA--LLVCIGQIIAPAKDDVEFRHRIMNPL-TEDE-- 79 + L + + + A LL + + KD + + ++ Sbjct: 3 ELLLNAEKFADIVVDLPTQRPAVFRQVLLPLVVDALGCPKDAEAWMDMFRAGAFSPEQRQ 62 Query: 80 -FQQLIAPWIDMFYLNHAEHPFMQTKGVK--ANDVTPMEKLLAGVSGATNCAFVNQP--G 134 + +F L PF Q ++ + L+A + N + G Sbjct: 63 LLADYLDKHQHLFGLLDPVEPFGQVADLRTAKGETKGSALLVATAATGNNVPLFSSRTEG 122 Query: 135 QGEALCGGCTAIALFNQANQAPGF---GGGFKSGLRGG-------TPV----TTFVRGID 180 L A L + G ++ G P+ T G Sbjct: 123 DVLELTPAEAARWLLHTHCWDTAAIKTGAVGDPMVKSGKTTGNPTGPLGQLGVTMPVGST 182 Query: 181 LRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSNESIPASSI-------------G 227 L T+LLN+ + +++ P W + +S + ++ G Sbjct: 183 LFETLLLNI--------PYGQAGLSDDVPQWRR--RSTQGDVKDTLSCATPVWQSRPARG 232 Query: 228 FVRGLFWQPAHIELC---DPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWPHPHSPC 284 + WQ I L G + E T W SP Sbjct: 233 LLEAWTWQARRIRLISQDTDRGPRITRVLVSAGDRLEVSPDTEPHTA-----WV-VDSPA 286 Query: 285 ----LVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNEN-------GNRVAAVVNQ 333 + G + T W + ++ + + G + +V Q Sbjct: 287 GRRGKSPARSGVKSARPRRHTAGRAGWRGLDALLAVNAVDQDQQATATRSGAVSSQLVRQ 346 Query: 334 FRNIAPQSPLE------LIMGGYRNNQASILERRHDVLMFNQGWQQYGNVINEIVTVGLG 387 I+ + P L Y N A I + D + ++ + + Sbjct: 347 LSTISRRLPSRYPLRVELTGIAYGNQSAVIEDMYFDEIPLPVAALDPEGIVYGALLEVVD 406 Query: 388 YKTALRKAL 396 L KA+ Sbjct: 407 QAEDLAKAV 415 >UniRef50_Q8KB26 CRISPR-associated protein, CT1972 family n=1 Tax=Chlorobaculum tepidum RepID=Q8KB26_CHLTE Length = 530 Score = 84.4 bits (207), Expect = 8e-15, Method: Composition-based stats. Identities = 79/530 (14%), Positives = 153/530 (28%), Gaps = 104/530 (19%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAP 60 NL+ + WIP + +++L ++ L ++A LL+ IGQ Sbjct: 5 FNLIDEPWIPAIGKG-----LVSLADIFSDPRIPALGG-NPVQKIALTKLLLAIGQAACT 58 Query: 61 AKDDVEFRHRIMNPLTEDEFQ----QLIAPWIDMFYLNHAEHPFMQTKGVK-ANDVTPME 115 + L + F+ + W D F+L + PF+Q + + Sbjct: 59 PETTEALEQ-----LDAETFRRACRAYLEKWRDRFWLFGDK-PFLQMPAILDWMESQRAA 112 Query: 116 KLLAGVSGAT-------------NCAFVNQPGQGEALCGGCTAIALFNQANQAPGFGGGF 162 +L+ A N + ++Q +A A+ + + N A G Sbjct: 113 GILSETENAKQIGPGFYPSLPSENDSILSQFQTLKAQTDAEKALFIVSVMNFAFGGTQIN 172 Query: 163 KS------GLRG-GTP------------VTTFVRGIDLRSTVLLNVLTLPRLQKQFPNES 203 K+ ++G G P + TF+ G + T+++N+L+ + P Sbjct: 173 KNIYPSEEKVKGKGKPAKPGPSLGRNGYLHTFLFGSTIIDTLIMNLLSQEEIDN-LPFWE 231 Query: 204 HTENQPTWIKPIKSNESIPASSI--GFVRGLFWQPAHIELCDPIGIGKCSCCGQESNLRY 261 P W S E A S+ ++ L + L GI Sbjct: 232 KGIGTPPWENMPVSRECDAALSLKKSYMGTLVSLSRFV-LLHDDGIYYID---------- 280 Query: 262 TGFLKEKFTFTVNG---LWPHPHSPCLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKI 318 W P +T+ + K + W ++ ++ Sbjct: 281 --------GLPYPSHQEGWLEP----SMTIDNQQNPPKAILVNPEKRPWRELVS-ILAVF 327 Query: 319 IQNENGNRVAAVVNQ------FRNIAPQSPLELIMGGYRNNQASILERRHDVLMFNQGWQ 372 N+N V + R P + + GG + + F G + Sbjct: 328 DSNKNNKFVCLFIKYGLSRWPKRYNKPGDKIGVWSGGLQ-------------VSFQTG-E 373 Query: 373 QYGNVINEIVTVGLGYKTALRKALYTFAEGFKNKDFKGAGVSVHETAERHFYRQSELLIP 432 QY N+ V + + + + + + + P Sbjct: 374 QYAKATNDFVESSVELDPDM---WNNLWYDKFFGEISILEI-MANKVKNGVINYYDSFEP 429 Query: 433 DVLANVNFSQADEVIADLRDKLHQLCEMLFNQSVAPYAHHPKLISTLALA 482 + + + QLCE F + V P + + A Sbjct: 430 KKEKKPKERASTIMGKKAVELFWQLCERRFPELVD-ACGEPDKLPAIHEA 478 >UniRef50_B6ZW55 CRISPR-associated protein, Cse1 family n=2 Tax=Enterobacteriaceae RepID=B6ZW55_ECO57 Length = 109 Score = 84.4 bits (207), Expect = 9e-15, Method: Composition-based stats. Identities = 15/105 (14%), Positives = 39/105 (37%), Gaps = 10/105 (9%) Query: 392 LRKALYTFAEGFKNKDFKGAGVSVHETAERHFYRQSELLIPDVLANVNFSQ-ADEVIADL 450 +++A ++ +G + DF + + F ++ + Q ADE++ Sbjct: 1 MKEAWFSDPKGARG-DFSFVDIDFWNKTQHRF--------LRLVRQIEEGQDADELLGKW 51 Query: 451 RDKLHQLCEMLFNQSVAPYAHHPKLISTLALARATLYKHLRELKP 495 + ++ F++ V + P + + AR + E + Sbjct: 52 QKEIWLFARQDFDERVFTNPYEPVDLERVMTARKKYFTTSAEKQS 96 >UniRef50_UPI0001AF1D49 CRISPR-associated Cse1 family protein n=1 Tax=Streptomyces ghanaensis ATCC 14672 RepID=UPI0001AF1D49 Length = 531 Score = 84.0 bits (206), Expect = 1e-14, Method: Composition-based stats. Identities = 62/444 (13%), Positives = 120/444 (27%), Gaps = 50/444 (11%) Query: 58 IAPAKDDVEFRHRIM----NPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDVTP 113 + D + L + E + F L PF Q N+ Sbjct: 38 VYRPLDGARWAELWRAREKEGLPQPELATYQDKFWSRFELFDPSRPFFQ-CPALDNEPGS 96 Query: 114 MEKLLAGVSGATNCAFVNQPGQGEA--LCGGCTAIALFNQANQAPGF-GGGFKSGLRGGT 170 KL+A + +N + + L A L +++ Sbjct: 97 TAKLVAHRATGSNRTLFDHTTADQRPLLQPAEAARWLVTTQAYDTSGTKQPYRTERSAEG 156 Query: 171 PV-----TTFVRGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSNE-SIPAS 224 + V G L T+LLN+ L + + + P + ++P W + + Sbjct: 157 GLGNRFGCVLVEGASLHETLLLNM-QLYQPEAELPPRTTARDRPVWEASQPPDPHPDARA 215 Query: 225 SIGFVRGLFWQPAHIELCDPIGIGKCSCCGQESNLRYT------------GFLKEKFTFT 272 +G+ L W I L + G G + + Sbjct: 216 PLGWTDLLTWPSRRILLSTTVASGATLVDGVVLTPGTRMEGDLIDWEAMAAYRRPWLKGN 275 Query: 273 VNGLWPHPHSPCLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAAV-- 330 G + L V + E + SW R Q R AA+ Sbjct: 276 KQGDFRAVTLDELRGVWRHSQELLLSSDPRWWNSWRGRLRAKGPLPAQEPQRQRPAALDH 335 Query: 331 ---VNQFRNIAPQSPLELIMGGYR--NNQASILERRHD-----VLMFNQGWQQYGNVINE 380 + + +IA + L + G + + + V + + G +I Sbjct: 336 IADLVEDDHIAEDTVYTLRIFGQQLGDQGGDTYAWYEEAVPAPVALLRAESARVGYIIGY 395 Query: 381 IVTVGLGYKTALRKALYTFAEGF-----KNKDFKGAGVSVHETAERHFYRQSELLIPDVL 435 V++ L+ ++ F K++D K + +H + ++ Sbjct: 396 AVSLANDLGEQLKLMERQYSADFHRELTKDQDKKPTDLEIH--YWPRLAAPFATFLRELG 453 Query: 436 ANV----NFSQADEVIADLRDKLH 455 V + + E L Sbjct: 454 EAVRLGASETAPAERWGQAVSDLA 477 >UniRef50_D2RAZ9 CRISPR system CASCADE complex protein CasA n=3 Tax=Actinobacteria (class) RepID=D2RAZ9_GARVA Length = 556 Score = 83.7 bits (205), Expect = 2e-14, Method: Composition-based stats. Identities = 44/261 (16%), Positives = 88/261 (33%), Gaps = 59/261 (22%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQW-RLSLPRDDMELAALALLVCIGQIIA 59 NLL + WI V G +++++ ++ ++ L+ + A L +L+ + + Sbjct: 4 FNLLDEPWISVIVDEKGHNKLVSITDVFKHASEYKALAGDMKTQDFALLRILLAVLHTVF 63 Query: 60 P------------AKDDVE----------FRHRIMNPLTEDEFQQLIAPWIDMFYLNHAE 97 + +D E +R + D + + W D FYL + Sbjct: 64 SRYDIQGNSREFDSNEDNEYYFNKETMNIWREVWNSKEFPDAVFKYLEQWHDRFYLFDDK 123 Query: 98 HPFMQTK-----GVKANDVTP-------MEKLLAGVSGATNCAFVNQPGQGE----ALCG 141 +PF+Q K +P + +L++ A + + +L Sbjct: 124 YPFLQVLKQDIDSKKLGGKSPSEISGKNINRLISE--SNNKVAVFSPKDNVDNNKSSLNE 181 Query: 142 GCTAIALFNQANQAPG-----FGGGFKSGLRG-----GTPVTTFVRGIDLRSTVLLNVLT 191 A + + A FG G +G G +V G +L T++LN + Sbjct: 182 AQLARWIITLQSYAGLADKTFFGTGKYKASKGWLFDLGG---IYVEGENLFETLMLNCVL 238 Query: 192 LPRLQKQFPNESHTENQPTWI 212 + +Q +P W Sbjct: 239 VGEMQSP-----EKRQKPCWE 254 >UniRef50_A8SDS0 Putative uncharacterized protein n=1 Tax=Faecalibacterium prausnitzii M21/2 RepID=A8SDS0_9FIRM Length = 537 Score = 82.9 bits (203), Expect = 2e-14, Method: Composition-based stats. Identities = 87/517 (16%), Positives = 147/517 (28%), Gaps = 75/517 (14%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAP 60 NLL + WI VR R+ ++ ++L ++D L+ A L LL+ + + Sbjct: 6 FNLLTEPWIRVRLRDNTVREVSLTEALVSAQDYVDLAGEMPTQNAAVLRLLLAVLFTVFS 65 Query: 61 AKDD--------------VEFRHRIMNPLTEDE-FQQLIAPWIDMFYLNHAEHPFMQTKG 105 D + + + W D F+L H HPF Q Sbjct: 66 RVDAKGEPRPLMQSDDALERWSALWQLGHFPAAPVRDYLEQWKDRFWLFHPTHPFWQVPQ 125 Query: 106 VKANDVTPMEKLLAGVSGATN-CAFV--NQPGQGEALCGGCTAIALFNQA----NQAPGF 158 K KL +S ++N E L A L A Sbjct: 126 AKIGTEYGAAKLNGEMSESSNKLRLFPLYAGQSKEQLSYPQAARWLLCVNGYDDTSAKPK 185 Query: 159 GGGFKSGLRG--GTPVTTFVRGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTW--IKP 214 G G S G G +G +L T++LN+ + E E++P W P Sbjct: 186 GKGLPSVGAGWLGKIGFIQAQGDNLYETLMLNL-----TLLRDGRECWGESKPCWELEAP 240 Query: 215 IKSNESIPASSIGFVRGLFWQPAHIEL--CDPIGIGKCSCCGQESNLRYTGFLKEKFTFT 272 + + + L Q + L G C G F +E Sbjct: 241 KSAERTEICCPDNPAQLLTLQSRRLLLHRTGENVDGFCLLGGD-------FFPRENVFAE 293 Query: 273 VNGLWPHPHSPCLVTVKKGEV-EEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAAVV 331 +W + +KK E + W + V ++G+R V Sbjct: 294 QMTIWR------TMPIKKNEPVVFVPCRHDPAKQFWREFPAVFC-----QDSGHRPGVVC 342 Query: 332 -------NQFRNIAPQSPLELIMGG--YRNNQASILERRHDVLMFNQG--------WQQY 374 + + + P+ + + G Y + + + D L F G WQ Sbjct: 343 WIEKLQEKRLKLLDPRRKVHFRISGVQYGDKDFFVNDSFSDSLTFQAGILDEIGRPWQSR 402 Query: 375 GNVINEIVTVGLGYKTALRKALYTFAEGFKNKDFKGAGVSVHETAERHFYRQSELLIPDV 434 E + L A + +R F + + + Sbjct: 403 IVREIERCEQTAALIGRFAQELAIAAGDRNENAGGAVRAQFYFAVDRPFRQWLQAI---- 458 Query: 435 LANVNFSQADEVIADLRDKLHQLCEMLFNQSVAPYAH 471 + DE + + + E L Q V + Sbjct: 459 --DQEQDDPDEAALRWQTRARSIAEKLGKQMVMEAGN 493 >UniRef50_B5GA97 Crispr-associated protein n=1 Tax=Streptomyces sp. SPB74 RepID=B5GA97_9ACTO Length = 534 Score = 81.7 bits (200), Expect = 6e-14, Method: Composition-based stats. Identities = 68/474 (14%), Positives = 134/474 (28%), Gaps = 63/474 (13%) Query: 2 NLLIDNWIPVRPRNGGKVQIINL---QSLYCSRDQWRLSLPRDDMELAALALLVCIGQII 58 +L+ + PVR + + + L + L++ + A L +L + + Sbjct: 38 SLVTGEFFPVRLVDAADTVPVKYGLRRLLVEAGSIASLAVTPPPAQAALLRILYVVTARV 97 Query: 59 A----PAKDDVEFRHRIMN----PLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKAN- 109 + PA + R + L + +A + L A F+Q + Sbjct: 98 SGLDRPAASPSAWLDRRDDVAEEGLDPERVDAYLAEHAERLRLFGARP-FLQDPRLAEEC 156 Query: 110 -DVTPMEKLLAGVSGATNCAFVNQPGQGEALCGGCTAIALFNQANQAPGFGGGFKSGLRG 168 + KL+ G +N + + AL + G + R Sbjct: 157 SKKAGVNKLVFGRPAGSNQVWFGHHRDADPRPVP-ADEALLHLLMWLYYGAAG-RCSTRT 214 Query: 169 ----------GTPVTTFVRGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSN 218 P+ + T+L ++ + + ++ W P + Sbjct: 215 VGSVSAADSRSGPLRGSLSYHPEGPTLLHTLVAG--IPRPGNGTDPATDRCPWELPELPD 272 Query: 219 ESIPAS-SIGFVRGLFWQPAHIELCDP-IGIGKCSCCGQESNLRYTGFLKEKFTFTVNGL 276 P++ ++G + L H L G + T ++K Sbjct: 273 PLHPSTANVGPMSQLTAGWQHALLLQEGARPGTVD------DAYITWAARDKL------- 319 Query: 277 WPHPHSPCLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAAVVNQFR- 335 P P P LV ++ W I +++ + R AA Sbjct: 320 -PRPEDPFLVLQLSQAGNIYARRAKSARALWRDIDALLIQEPYGTAKPRRPAAFHGVLEL 378 Query: 336 NIAPQSPLELIMGGY--RNNQASILERRHDVLMFNQGWQQYGNVINEIVTV--------G 385 + PL + G+ + I ++ ++ + G Sbjct: 379 DPEGGGPLRVRALGFEQDDRTKDIQYISGTTPPVLDLIEEREPRLSARLRTMRVAGELYG 438 Query: 386 LGYKTALRKALYTFAEGFKNKDFKG-----AGVSVHETAERHFYRQSELLIPDV 434 A+R+A E + D G A V AER F+R+ D Sbjct: 439 RRLDFAVRQAWR---ELVNDSDAVGPWGELAAVDYWPAAEREFWRRVGARDMDQ 489 >UniRef50_C6SPI8 Putative uncharacterized protein n=1 Tax=Streptococcus mutans NN2025 RepID=C6SPI8_STRMN Length = 572 Score = 79.0 bits (193), Expect = 4e-13, Method: Composition-based stats. Identities = 48/271 (17%), Positives = 80/271 (29%), Gaps = 68/271 (25%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQW-RLSLPRDDMELAALALLVCIGQIIA 59 NLL + WI V G + ++L + + + L+ + A L +L+ + + Sbjct: 4 FNLLDEPWISVVFDEKGSTKEVSLLDFFQNAHHYKDLAGDTKTQDFAVLRVLLAVLHTVF 63 Query: 60 PAKDD---------------------------------VEFRHRIMNPLTEDEFQQLIAP 86 D + N D ++ + Sbjct: 64 SRFDANGNAYGYLEIDEKYRQIEEIEEDDLEEYEDDLYETWLTLWQNRQFPDIIEEYLKK 123 Query: 87 WIDMFYLNHAEHPFMQ----------TKGVKANDVTP--MEKLLAGVSGATNCAFV---- 130 W D FYL E+PF Q A + + +L++ + A Sbjct: 124 WRDRFYLFDEEYPFFQVRKEDIEMVMDLNKDAGKIFGKNINRLVSE--SSNKIALFSPKH 181 Query: 131 NQPGQGEALCGGCTAIALFNQANQAPGFGGGFKSGLRG---------GTPVTTFVRGIDL 181 N E L L + G K G R G F++G +L Sbjct: 182 NYDNNKERLSNSEIVRWLLTYHGYSEIGGRMKKIGKRDYSKGWLYNLGG---LFLKGKNL 238 Query: 182 RSTVLLNVLTLPRLQKQFPNESHTENQPTWI 212 T+LLN LTL + + +P W Sbjct: 239 YETLLLN-LTLFYFEY---DNHLHIQKPCWE 265 >UniRef50_C2BEU1 CRISPR-associated protein n=1 Tax=Anaerococcus lactolyticus ATCC 51172 RepID=C2BEU1_9FIRM Length = 562 Score = 79.0 bits (193), Expect = 4e-13, Method: Composition-based stats. Identities = 38/273 (13%), Positives = 75/273 (27%), Gaps = 70/273 (25%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLY-CSRDQWRLSLPRDDMELAALALLVCIGQIIA 59 NL+ + WI V G +++ L+ + + L+ A + L+ I + Sbjct: 4 FNLIDEPWISVVTDYKGTTKLVGLREFFQNCHNYLELAGEMPTQNFAVMRFLLAILHTVF 63 Query: 60 PAKDD---------------------------------VEFRHRIMNPLTEDEFQQLIAP 86 D + + + + Sbjct: 64 SRYDANGKPYEMVTINEKMQQVENVDEEYEEDYEDALMETWESLWKSGKFPEIVTDYLEC 123 Query: 87 WIDMFYLNHAEHPFMQT-----KGVKANDVTP-------MEKLLAGVSGATNCAFVNQP- 133 W D FYL +PF Q K + P + +L++ A + Sbjct: 124 WHDRFYLFDDNYPFYQVTKEEISESKISKTNPSEILGKNINRLVSE--SGNKIALFSPKY 181 Query: 134 ---GQGEALCGGCTAIALFNQANQAPG------FGGGFKSGLRG-----GTPVTTFVRGI 179 E L L + + A +K+ +G G F+ Sbjct: 182 SSDDNKEILDYDEVVRWLISFQSYASLSDKVRFSNKSYKAS-KGWLFDLGG---VFLSSD 237 Query: 180 DLRSTVLLNVLTLPRLQKQFPNESHTENQPTWI 212 +L T++LN++ + + + P W Sbjct: 238 NLYKTMVLNLVLVNTSNTDY---NTNIQNPVWE 267 >UniRef50_Q47PJ1 CRISPR-associated protein, Cse1 family n=1 Tax=Thermobifida fusca YX RepID=Q47PJ1_THEFY Length = 549 Score = 77.1 bits (188), Expect = 1e-12, Method: Composition-based stats. Identities = 77/545 (14%), Positives = 163/545 (29%), Gaps = 81/545 (14%) Query: 1 MNLLIDNWIPVRPRN--GGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQII 58 ++ I W+ R R+ + L S + + +P +L I I Sbjct: 20 FDVTIAPWLIARSRDVLAAPEMLGLRDVLIRSHELSDVEIPLPPGAAVLWRILALITARI 79 Query: 59 ----APAKD--DVEFRHRIMNPLT-----EDEFQQLIAPWIDMFYLNHAEHPFMQTKGVK 107 P +++ R L+ + A + + F L H E P++Q ++ Sbjct: 80 TGLDQPPNKNPKRKWQARRSQILSKGRLDPEAVDAYFADYSERFDLFHPERPWLQDPRLR 139 Query: 108 AN--DVTPMEKLLAGVSGATNCAFV--NQPGQGEA-LCGGCTAIALFNQANQAPGFGGGF 162 + + KL G + N ++ + L L P Sbjct: 140 EECPKTSGVNKLAWGRTAGENQVWLGGHHHDLDPHPLDSAEAVWHLLATLGYGPSGMCTA 199 Query: 163 KSGLRG-------GTPVTTFV----RGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTW 211 + +RG P+ V G L +++LN+ +P + W Sbjct: 200 RV-VRGRSERNVTAGPLRGTVSYHPLGRTLFESLILNI--------PYPGTGAADLAF-W 249 Query: 212 IKPIKSNE-SIPASSIGFVRGL-FWQPAHIELCDPIGIGKCSCCGQESNLRYTGFLKEKF 269 +P ++ +P S G L H L P G + ++ + Sbjct: 250 EQPELNDPLGLPEESAGLAGILRLDHFRHAVLLHPSPDG---------SHVVDAWVTWAW 300 Query: 270 TFTVNGLWPHPHSPCLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAA 329 P+ + K+G V + W + ++ N + Sbjct: 301 RERNISPELDPYLIYQTS-KEGRVYPRPA--EAERAIWRDLDALLHYGEDGNYRPTILDN 357 Query: 330 VVNQFRNIAPQS---PLELIMGGYRNN-QASILERRHDVLMFNQGW--------QQYGNV 377 + PQ L L G+ + QA + W + + Sbjct: 358 CTPLAQ--VPQEVLDSLRLRAFGFDQDGQARDKQWFTATTPAVLRWLADRETDDNENARI 415 Query: 378 INEIV-------TVGLGYKTALRKALY------TFAEGFKNKDFKGAGVSVHETAERHFY 424 + I +G + A ++A + + G K G G V ++ Sbjct: 416 VRRITLARKAAEALGRRLEKACKEAWKESNSPSSTSSGTNAKTETGVGPWVQH-GMSRYW 474 Query: 425 RQSELLIPDVLANVNFSQADEVIADLRDKLHQLCEMLFNQSVAPYAHHPKLISTLALARA 484 ++E + +++ + +A + + + +++ PY P++ + R+ Sbjct: 475 AKAEPVFWNIVYDRPAQGYTPGMAGPGNAFNLVALAAYDEVTGPYCERPRVAKVVERHRS 534 Query: 485 TLYKH 489 TL+ + Sbjct: 535 TLFSN 539 >UniRef50_UPI0001AEDDCB hypothetical protein SalbJ_26479 n=1 Tax=Streptomyces albus J1074 RepID=UPI0001AEDDCB Length = 509 Score = 73.3 bits (178), Expect = 2e-11, Method: Composition-based stats. Identities = 67/519 (12%), Positives = 149/519 (28%), Gaps = 77/519 (14%) Query: 22 INLQSLYCSRDQWR-LSLPRDDMELAALALLVCIGQIIAPAKDDV--EFRHRIMNPLT-- 76 ++L+SL+ + R L +P A L +L + I + + R L Sbjct: 1 MSLRSLFLRAREIRTLLIPEAPTHSALLRVLYALTARITALDEAGPGSWGDRREEVLERG 60 Query: 77 --EDEFQ----------QLIAPWIDMFYLNHAEHPFMQTKGVKA----NDVTPMEKLLAG 120 + F+ W F L +E P++Q + + + KL Sbjct: 61 FCAESFELPDGRKAGIGGYFDGWAHRFDLFDSERPWLQDPRLPDQCDRSQTAGLHKLAMS 120 Query: 121 VSGATNCAFVNQPGQGEALCGGC--TAIALFNQANQAPGFGGGFK------------SGL 166 S N ++ G + + A++L G + S L Sbjct: 121 RSAGNNHSWFGHRGDDKLVLPTVSQAALSLLTWHYWGSPGGLSRRAVGQVSHHYAKASPL 180 Query: 167 RGGTPVTTFVRGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSNESIP-ASS 225 RG ++ +L T+L + + S + W + + P Sbjct: 181 RGA--LSYHPECDNLFLTLLAGLTPPD------GDVSRQTDLCPWEREDVPDPLAPMPEP 232 Query: 226 IGFVRGLFWQPAHIELCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWPHPHSPCL 285 +G L H P G+ ++ T + + P L Sbjct: 233 LGPCSRLTACSQHALYLVPADDGE-----HAADAYIT--------WAYHAERLRPEDDYL 279 Query: 286 VTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAAVVNQFRNIAPQSPLEL 345 + + E +S W + +++ ++ ++ V++ +++ + Sbjct: 280 IWDIGKDGETSPRLARSSRSLWRDVDALLLKQL--DDASPIQPKVMDHAFDVSEYLRVRA 337 Query: 346 IMGGYRNNQASILER-RHDVLMFNQGWQQYGNVINEIVT--------VGLGYKTALRKAL 396 + +Q++ + + ++ + V G + A ++A Sbjct: 338 LGFEQDTSQSANYQYVDSTTPVLLSRVEEDVVTSDLPVRQLRELGELFGGRLEHATKEAW 397 Query: 397 YTFAEGFKNKD---FKGAGVSVHETAERHFYRQSELLIPDVLANVNFSQADEVIADLR-D 452 T+ + KN AE F+ L+ + + D Sbjct: 398 LTYTDDKKNSPGAWLDAVAARYWPAAEDEFWSAF-----RKLSRSDAAVDPAFDFDAACR 452 Query: 453 KLHQLCEMLFNQSVAPYAHHPKLISTLALARATLYKHLR 491 + + P+ + + A+A + LR Sbjct: 453 AFGRHALDAYEAVTDSVLRTPRGVKAVTGAKAIILAALR 491 >UniRef50_C8P6I4 Putative uncharacterized protein n=1 Tax=Lactobacillus antri DSM 16041 RepID=C8P6I4_9LACO Length = 584 Score = 72.1 bits (175), Expect = 5e-11, Method: Composition-based stats. Identities = 80/568 (14%), Positives = 160/568 (28%), Gaps = 112/568 (19%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQW-RLSLPRDDMELAALALLVCIGQII- 58 NL+ + WI V + + + ++L+ L+ + + +L+ +LA L L+ I + Sbjct: 11 FNLVTEPWIKVVDED-NRERTVSLEQLFTNAVHYRQLAGEMKSQDLALLRFLLAILTTVY 69 Query: 59 -------------------APA--------KDDVE------FRHRIMNPLTEDEFQQLIA 85 +DD E ++ + + + Sbjct: 70 SRYTADGEPYEWLKIDGQTMQPVPFEGKTFEDDDEDGLRQTWKDLYHAQHFTEIVTRYLQ 129 Query: 86 PWIDMFYLNHAEHPFMQTKGVKANDVTPMEKLLAG--------------VSGATNCAFV- 130 + D F L A+HPF Q + + + P K++A Sbjct: 130 KYADRFDLLDADHPFYQATRAQYDSLVPKNKVVAKGKGTVAVKQINRTISESNNKPDIFS 189 Query: 131 -NQPGQGEALCGGCTAIALFNQANQAPGFGGGFKSG----------LRGGTPVTTFVRGI 179 N L A L N L G P F G Sbjct: 190 PNTSPHKNDLSLASLARWLITYQNFTAVTDKTKVVAKEKFPVSPGWLYGLNP--VFATGS 247 Query: 180 DLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPI------KSNESIPASSIGFVRGLF 233 +L T++LN++ +P Q + P W PI + +P ++ + L+ Sbjct: 248 NLFETLMLNLVLIP--QGVNSETESMDQHPAWEVPIEEYIQARLTGIVP-GNLAELYTLW 304 Query: 234 WQPAHIELCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWPHPHSPCLVTVKKGE- 292 + HIE + F + + P + K E Sbjct: 305 SRVIHIEWDNGRP---------------LIFSAGLPKLNNHEAFLEPMTTWKFNKKDNEW 349 Query: 293 -VEEKFLAFTTSAPSWTQISRVVVDKI--IQNENGNRVAAV-----VNQFRNIAPQSPLE 344 ++L + W + + + Q ++ V + + + L Sbjct: 350 QPNLRWLN-SLGKAMWRNFGQYISVQQDDTQKDSQREPGIVTWLHMLRSTKMLPADLALH 408 Query: 345 LIMGGYRNN-----QASILERRHDV-----LMFNQGWQQYGNVINEIVTVGLGYKTALRK 394 L G N+ Q+ E ++ ++F+ + I + Sbjct: 409 LTTVGLINDGNATSQSPAAEFADEMQINADVLFDPNPLKRLQWPKLIENTVEMTEKVGAL 468 Query: 395 ALYTFAEGFKNKDFKGAGVSVHETAERHFYRQSELLIPDVLAN-VNFSQADEVIADLRDK 453 Y + + K + FY + + LA N + DE + + Sbjct: 469 VWYFANHIMELRGVKD-DGAFANRVSARFYERLNQPFREWLAGLTNNDERDEKVNLWKQT 527 Query: 454 LHQLCEMLFNQSVAPYAHHPKLISTLAL 481 Q+ ++ + P+ I Sbjct: 528 AKQIAVQTADELLNSAT--PQDIRGRVK 553 >UniRef50_D0WFD1 CRISPR-associated protein, Cse1 family n=1 Tax=Slackia exigua ATCC 700122 RepID=D0WFD1_9ACTN Length = 547 Score = 71.7 bits (174), Expect = 6e-11, Method: Composition-based stats. Identities = 45/292 (15%), Positives = 82/292 (28%), Gaps = 45/292 (15%) Query: 56 QIIAPAKDDVE------FRHRIM-NPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGV-- 106 Q D + + + L +E + W F L PFMQ G+ Sbjct: 43 QRSLSPSLDEDDDPAEVWAKLWEADTLPVEEIHSYLEKWRHRFDLLDNNEPFMQIAGLVR 102 Query: 107 -----KANDVTP-MEKLLAGVSGATNCAFVNQP--GQGEALCGGCTAIALFNQANQAPGF 158 K D P +++++A V N + L A L + + G Sbjct: 103 SNDAIKDEDGEPYLKRVIADVPSRRNRRLFSVRMGEGINRLSYAEAARWLIHVHSFDTGG 162 Query: 159 ----GGGFKS--------GLRGGTPVTTFV-----RGIDLRSTVLLNVLTLPRLQKQFPN 201 G S GGT + G ++ T++LN + L R + Sbjct: 163 PKNAAKGDSSDVIKKEGRSYPGGTGWLGRIGCLYFEGSTIKETLILNFVPLYRNEIDSLF 222 Query: 202 ESHTENQPTWIKPIK-SNESIPASSI--GFVRGLFWQPAHIELCDPIGIGKCSCCGQESN 258 + + P W + + ++ + G WQ + G + + Sbjct: 223 PEN--DLPIWERRQRCVSDGHEPRVLPDGRADLYTWQSRWV--NFSHEDGMITNVVLSAG 278 Query: 259 LRYTGFLKEKFTFTVNGLWPHPHSPCLVTVKKGEVEEKFLAFTTSAPSWTQI 310 + E +T W K + + F + W + Sbjct: 279 DLLSVEAAELYTVENMTSWKE----GTSKSKPSAPKLIPMHFDSDKALWRGL 326 >UniRef50_Q03C63 CRISPR-associated protein n=1 Tax=Lactobacillus casei ATCC 334 RepID=Q03C63_LACC3 Length = 569 Score = 69.8 bits (169), Expect = 2e-10, Method: Composition-based stats. Identities = 62/405 (15%), Positives = 114/405 (28%), Gaps = 83/405 (20%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLY-CSRDQWRLSLPRDDMELAALALLVCIGQIIA 59 NL+ D WI V + + ++LQ+L+ S D RL+ +LA + LL+ I + Sbjct: 6 FNLVTDPWIKVIRAADYRSEEVSLQTLFQQSSDYLRLAGETQSQDLAIMRLLLAILHTVY 65 Query: 60 P-------------------------AKDDVE-------FRHRIMNPLTEDEFQQLIAPW 87 +DD E + + +A + Sbjct: 66 SRFDATGEPYEWLTIDLESLQVAEAVEQDDYEPDDLFETWDALHQLGHFSAIVIEYLARY 125 Query: 88 IDMFYLNHAEHPFMQTKGVKANDVTPMEKLLAGVSG--------------ATNCAFVNQP 133 D F PF Q + + + P +K +A SG + A Sbjct: 126 QDRFDFFGER-PFYQATQSEYDVLVPEKKKVATGSGTVAIRQINRTISESGNSPALFAPR 184 Query: 134 GQGEALCGGCTAIA--LFNQANQAPGFGGGFKSGL-------RGGTPVT----TFVRGID 180 G + + N G K+ + + F G Sbjct: 185 SDAGKDTLGMAELVRWVITYQNYT---GVTDKTKIVAQENFSNDSGWLYRLSPVFAVGDT 241 Query: 181 LRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSNESIPASSIGFVRGLFWQPAHIE 240 L T+LLN++ + + E +P + ++ E + Sbjct: 242 LFDTLLLNLILVQNEDAPYAVERPVWERPNAQRYVQDRERQRQPD-NLAALYTSWSRVLF 300 Query: 241 LCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWPHPHSPCLVTVKKGEVEEKFLAF 300 L + F F+ + + P + +KK +V Sbjct: 301 LQWGD------------DRLENIFSAGVPPFSADNAFLEPMTTWRW-IKKEQVYRPHRKA 347 Query: 301 TT--SAPSWTQISRVVVDKIIQNENGNRVAAVVNQFRNIAPQSPL 343 T S W V + +++ R VV + + + L Sbjct: 348 MTSVSKAMWRNFGEYV---DLHDDSKRRQPGVVTWLQTLKARKSL 389 >UniRef50_B3ENH5 CRISPR-associated protein, Cse1 family n=2 Tax=Chlorobiaceae RepID=B3ENH5_CHLPB Length = 529 Score = 69.8 bits (169), Expect = 2e-10, Method: Composition-based stats. Identities = 81/541 (14%), Positives = 159/541 (29%), Gaps = 107/541 (19%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAP 60 NL+ + WIPV + I+L+ ++ D L +LA LL+ I Q Sbjct: 7 FNLIDEPWIPVIDKG-----RISLRQVFSEPDNRALGG-NPLQKLALTKLLLAIAQATCT 60 Query: 61 AKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGV-------------- 106 ++D + L + W D F+L + PF+Q + Sbjct: 61 PENDEIHASMESSELARKSID-YLDKWYDRFWLYGEK-PFLQMSAIHGLIEQRKRKYLNA 118 Query: 107 -------KANDVTPMEKLL-----AGVSGATNCAFVNQPGQGEALCGGCTAIALFNQANQ 154 + +V + K L + N + Q +AL A+ + + N Sbjct: 119 AKKDSVRRTAEVNALPKSLGMGFYPDMPSENNT-ILTQYQIPKALADSDKALFIVSLMNF 177 Query: 155 APGF-------------GGGFKSGLRGGTP--------VTTFVRGIDLRSTVLLNVLTLP 193 A G G K+ P + + + G L T+L+N+L+ Sbjct: 178 ALGGKRVEKNLDNQLMLGYAGKTPSAKSAPSLGNYIGYLHSILVGETLADTLLINLLSHE 237 Query: 194 RLQKQFPNESHTENQPTWIK-PIKSN-ESIPASSIGFVRGLFWQPAHIELCDPIGIGKCS 251 R+Q + W + P ++ L + L G G Sbjct: 238 RIQANV-YWKSGLGKAPWEEMPTGVACPIATNLKSSYMATLVAMSRFVLL---QGDGIYY 293 Query: 252 CCGQESNLRYTGFLKEKFTFTVNGLWPHPHSPCLVTVKKGEVEEKFLAFTTSAPSWTQIS 311 G + KE W P +++ + W ++ Sbjct: 294 VEGIQ-----YPSHKE--------GWREPSMAVNAQAATPKIKW----IDPNKRPWRELV 336 Query: 312 RVVVDKIIQNENGNRVAAVVNQFRNI-APQSPLELIMGGYRNNQASILERRHDVLMFNQG 370 ++ G + +N + + GG R + N+ Sbjct: 337 SLLAFMDGGGSQGYECQFIKYGLKNFGDRFKRIGVWSGGLR-------------VSTNRC 383 Query: 371 WQQYGNVINEIVTVGLGYKTALRKALYTFAEGFKNKDFKGAGVSVHETAERHFYRQ---S 427 Q + + ++ + + Y + FK V++++ ++ F Sbjct: 384 DQSVKQDNDFVESLVFLESKIIGQLWY--------QQFKLEMVALNKISDTIFTATVAYY 435 Query: 428 ELLIPDVLANVNFSQADEVIADLRDKLHQLCEMLFNQSVA--PYAHHPKLISTLALARAT 485 + L+ + + +LCE F + V + K + + A+A Sbjct: 436 QSLVEKMDKKKAIKSFKNIADKATSLFWELCERHFQELVDACEPPYETKK-TRIVFAQAA 494 Query: 486 L 486 L Sbjct: 495 L 495 >UniRef50_Q06WG4 Putative uncharacterized protein (Fragment) n=4 Tax=Salmonella enterica subsp. enterica RepID=Q06WG4_SALNE Length = 55 Score = 68.3 bits (165), Expect = 7e-10, Method: Composition-based stats. Identities = 20/57 (35%), Positives = 30/57 (52%), Gaps = 2/57 (3%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQI 57 M+L + W+PV NG K +I +L+ L L+ PR D + AA +L+ I Q Sbjct: 1 MDLTKEKWLPVIFSNGDKKKI-SLRDLL-DNRIQDLAYPRADFQGAAWQMLIGILQC 55 >UniRef50_B0S4B8 Putative uncharacterized protein n=1 Tax=Finegoldia magna ATCC 29328 RepID=B0S4B8_FINM2 Length = 519 Score = 66.7 bits (161), Expect = 2e-09, Method: Composition-based stats. Identities = 40/305 (13%), Positives = 88/305 (28%), Gaps = 76/305 (24%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAP 60 NLL + WIPV + + + L+ + ++ + ++ L+ I Q + Sbjct: 4 FNLLDEKWIPVIDNDCNNLNVSILELFKNASKYISIAGDTEVQTISITRFLLSILQTVFS 63 Query: 61 AKDD-----------------------------VEFRHRIMNPLT----EDEFQQLIAPW 87 D+ + +N L + + + Sbjct: 64 RFDENGEEYGYFGLNDMFKQKTEIDPNVIEDYVNDLNKAWINLLEMKSFPKIVEIYLNKY 123 Query: 88 IDMFYLNHAEHPFMQTKGVKANDVTPMEKLLAGVSGATNCA------------------- 128 D FYL E+PF Q D + G N Sbjct: 124 HDRFYLYDDEYPFYQVSSNIWEDFNVISM------GKQNAKPSNIPFKKINGKFYESNSL 177 Query: 129 --FVNQPGQGEALCGGCTAIA-LFNQANQAPGFGGGFKSGLRGGT-----PVTTF-VRGI 179 F + + + N + G+ + +TT +RG Sbjct: 178 RMFNVNDEKAKNKMSDSELARWIITYQNYSNTSDKAVFDGVSKKSYGWLYKITTMSLRGS 237 Query: 180 DLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKP-----IKSNESIPASSIGFVRGLFW 234 ++ T++LN++ + +++ N +P W + K+ + ++ + + Sbjct: 238 NVFETLMLNLVLVHPVKEFVGNS----QKPCWEESSNKIITKNIKGYVPDNLAELYNTYS 293 Query: 235 QPAHI 239 + I Sbjct: 294 RAIRI 298 >UniRef50_Q60AC9 CRISPR-associated protein, CT1972 family n=1 Tax=Methylococcus capsulatus RepID=Q60AC9_METCA Length = 520 Score = 58.2 bits (139), Expect = 8e-07, Method: Composition-based stats. Identities = 76/537 (14%), Positives = 145/537 (27%), Gaps = 70/537 (13%) Query: 1 MNLLIDNWIPVRPRNG-GKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIA 59 MNLL D V +G ++ + L L + E A L + + Sbjct: 1 MNLLTDPLFRVETPDGIERLSLPQLLEALGQDRVESLLGLQRHQEDAFHIFLCYLAGAVL 60 Query: 60 PAKDDVEFR---HRIMNPLTEDEFQQLIAPWIDMFYLNH-AEHPFMQ-------TKGVKA 108 + E R + + W ++ + FMQ G Sbjct: 61 AREARSEPRQPEDFWREGI--RKLTGRDDDWAWTLIVDDVTQPAFMQAPVPDKKDFGAFK 118 Query: 109 NDVTPMEKLLAGVSGATNCAFVNQPGQGEALCGGCTAIALFNQANQAPGFGGGFKSGLR- 167 + L + A N + + A AL + + FG G R Sbjct: 119 LKARTADALDI-LPTAKN--HDVKASRSGATSPDGWVYALVSLQTMSGFFGQGNYGIARM 175 Query: 168 ----GGTPVTTFVRGIDL---RSTVLLNVLTLPRLQKQFPNESHTENQP-TWIKPIKSNE 219 G P + + ++ + P W +P Sbjct: 176 NGGFGSRPAVAVYHAERMGMRWHCDVTRLVGIREELLAGPWGYRERGIVLVWEQPWDLES 235 Query: 220 SIPASSIGFVRGLFWQPAH-IELCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTV----- 273 S+ S+ + + + A + L F + Sbjct: 236 SL---SLNVLDPFYIEIARAVRLMGDGKN-------------VRAFGASTKAARLAAGDA 279 Query: 274 NGLWPHPHSPCLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAAVVNQ 333 G+ P +P V KK ++ + +P + + + E+G R A + Sbjct: 280 GGVLGDPWTPVNVADKKKGQSAMTVSASGLSPE--------LIRNVLFEDGFRAARMQCL 331 Query: 334 FRNIAPQ----SPLELIMG-----GYRNNQASILERRHDVLMFNQGWQQYGNVINEIVTV 384 Q S L+ G G+ + + R H + + + ++ + Sbjct: 332 LEENEGQSCLFSATVLVRGQGTTDGFHHVAIPVPARAHRLFRRSSERDRLASISKTALND 391 Query: 385 GLGYKTALRK----ALYTFAEGFKNKDFKGAGVSVHETAERHFYRQSELLIPDVLANVNF 440 + + K AL N D + + ++E +R SE P + Sbjct: 392 AKEIQNRVLKPSVIALLEAGPDKINFDRREVNLWLNEATQRFSAAWSEDYFPWLWRQAEQ 451 Query: 441 SQADEVIADLRDKLHQLCEMLFNQSVAPYA-HHPKLISTLALARATLYKHLRELKPQ 496 AD + L + +++A Y + A + L + PQ Sbjct: 452 DDADAARLEWLRALRDKAHKVLEEAIARYPSREGRRYRARVKAEGLFHGSLFKTFPQ 508 >UniRef50_B5H6V1 Predicted protein n=1 Tax=Streptomyces pristinaespiralis ATCC 25486 RepID=B5H6V1_STRPR Length = 130 Score = 57.5 bits (137), Expect = 1e-06, Method: Composition-based stats. Identities = 16/101 (15%), Positives = 34/101 (33%), Gaps = 3/101 (2%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQW-RLSLPRDDMELAALALLVCIGQIIA 59 L WIPV +G + ++L+ ++ R+ E A + L++ + Sbjct: 15 FGLTTQPWIPVLRGDGTQ-DELSLREVFAQAAGLRRIVGDLPTQEFALVRLMLAVVHDAL 73 Query: 60 -PAKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHP 99 +D ++ + + F L A+ P Sbjct: 74 DGPQDIEDWSDLWADERCFAPVDAYLDAHRGRFDLLDAQAP 114 >UniRef50_D1Y489 CRISPR-associated protein, Cse1 family n=1 Tax=Pyramidobacter piscolens W5455 RepID=D1Y489_9BACT Length = 536 Score = 55.2 bits (131), Expect = 5e-06, Method: Composition-based stats. Identities = 76/543 (13%), Positives = 141/543 (25%), Gaps = 92/543 (16%) Query: 2 NLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLP------RDDMELAALALLVCIG 55 NLL + W+ V G + + + P + + L Sbjct: 4 NLLTERWLSVENPQGAMRRFSLPELFSALERNEVAAFPALLPHQAAPFHVWLVQLGCHAL 63 Query: 56 QII-----APAKDDVE-FRHRIMNPLTE--DEFQQLIAPWIDMFYLNHA---------EH 98 + P D + + + E D + ++ + F L+ + Sbjct: 64 ETAGQVENLPPPDPKKPWAMLGRHSPDEWRDMIRGVVPDYSRKFPLDEPWCLVTDDLNKP 123 Query: 99 PFMQ------TKGVKANDVTPMEKLLAGVSGATNCAFVNQPGQGEALCGGCTAIALFNQA 152 FMQ + + L +S F + G + AL + Sbjct: 124 AFMQVPAPDGDFADYRGEAQFPDDLDLLISAKN---FDVKSGVMKHPSAEEWIFALISLQ 180 Query: 153 NQAPGFGGGFKSGLRGGT-----PVTTFVR--------GIDLRSTVLLNVLTLPRLQKQF 199 + G G R P+ T G D R V+LN P + Sbjct: 181 TNSGFLGRGNYGVARQNGGWSIRPILTLQSSSSPGARWGRDAR--VILN--CPPDWELYA 236 Query: 200 PNESHTENQPTWIKPIKSNESIPASSIGFVRGLFWQPAHIELCDPIGIGKCSCCGQESNL 259 S E + W++P S + + + + G S +++ Sbjct: 237 FCRSEKETRLLWLEPWNGKTSSALRDLHPL--FIEICRRVR---AVRSGN-SVSVKKAAS 290 Query: 260 RYTGFLKEKFTFTVNGLWPHPHSPCLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKII 319 EK + P P + + V L + + +I+ Sbjct: 291 ACARVDIEKTGGNL----RDPWEPVVFDKQGSHVFGSNLNYAN------------LARIL 334 Query: 320 QNENGNRVAAVVNQFRNIAPQSPLELIMGGYRNNQASILERRHDVLMFN--------QGW 371 G + ++ R I + R Q ++ + W Sbjct: 335 AEAEGMQKPLLLRYHRGIDDPVATQAWCSALRKGQGKTEGYEERLIPVSVAGVSDKFTLW 394 Query: 372 QQYGNVINEIVTVGLGY--------KTALRKALYTFAEGFKNKDFKGAGVSVHETAERHF 423 + G +IN + T + A A + + K V E Sbjct: 395 RAAGAMINLVKTAKNTVLGVALARFMQCGKSADGNRAIDWNSSAVKNWIPVVKNKMEDEV 454 Query: 424 YRQSELLIPDVLANVNFSQ-ADEVIA-DLRDKLHQLCEMLFNQSVAPYAHHPKLISTLAL 481 + D + + + DE + L QL + V+ P +S Sbjct: 455 ETLFFRYLWDTCSRMTDGEITDESWLVPWKTCLRQLVRKYYEIGVSSL---PGSVSQSVK 511 Query: 482 ARA 484 ARA Sbjct: 512 ARA 514 >UniRef50_C5V9N4 Putative uncharacterized protein n=1 Tax=Corynebacterium matruchotii ATCC 14266 RepID=C5V9N4_9CORY Length = 516 Score = 54.8 bits (130), Expect = 7e-06, Method: Composition-based stats. Identities = 86/517 (16%), Positives = 155/517 (29%), Gaps = 81/517 (15%) Query: 29 CSRDQWRLSLPRDDME-LAALALLVCIGQIIAPAKDDVEFRHRIMNPLTEDEFQQLIAPW 87 + +RL+L ME ++ + LL IG + KDD R PL + + +A Sbjct: 32 ATDPDFRLNLDVSGMEFMSIIRLLSHIGARMLQ-KDDSLHRKHRKKPLPDGLIVETLAEL 90 Query: 88 IDMFYLNHAEHPFMQT---KGVKANDVTPMEKLLAGVSGATNCAFVNQPGQGEALCGGCT 144 L + F Q KGV P KL G + A+ ++ Sbjct: 91 EADRPLYGGKQNFFQIPDSKGVVGRGKQPTSKLSPTAPGDNSQAYWDRDKHKPVTLSAEE 150 Query: 145 AIALFNQANQAPGFG---------GGFKSGLR----GGTPVTTFVRGIDLRSTVLLNVLT 191 A+ + G G+R G T V+ +L ++L ++ Sbjct: 151 AMRQILIFSMYSSAGNNKFENRKCQNGSPGIRFLGAGNTATEVMVQSKNLWDSLLCSI-- 208 Query: 192 LPRLQKQFPNESHTENQPTWIKPIKSNESIPASSIGFVRGLFWQPAHIELCDPIGIGKCS 251 P W P+ +S + + + W P + Sbjct: 209 -------PATWVAGSGMPAWADPM-GEQSKTDTGMHPLWQASWMPNGVSGYWE------- 253 Query: 252 CCGQESNLRYTGFLKEKFTFTVNGLWPHPHSPCLVTVKKGEVEEKFLAFTTSAPSWTQIS 311 G+E G + ++ + + +W SP K + F T P + I Sbjct: 254 --GRELVGVGVGGVPPQYLGSFSKVW----SPYGDKDAKESYKAWFKQRDTEDPFYLYIR 307 Query: 312 RVVVDKIIQNE---NGNRVAAVVNQFRNIAPQSPLELIMGGYRNNQASILERRHDVLMFN 368 + + + + V R L+ G + + +HD L+F Sbjct: 308 DSKTNDPKAKRLDLSKDLIQLAVEWAREGTISKLDSLMAG-----RVAAPNFKHDKLLFA 362 Query: 369 Q---GWQQYGNVINEIVTVGL---------------------GYKTALRK---ALYTFAE 401 + G VI E VT + L++ A + Sbjct: 363 RHQIGGNASTPVIRESVTTNTASSLWCLDQDPEVQARIIGQAEFIDTLKQRVCAPFRRQS 422 Query: 402 GFKNKDFKGAGVSVHETAERHFYRQSELLIPDVLANVNFSQADEVIADLRDKLHQLCEML 461 + F + E F+R+ + +V++ AD + +L +K Sbjct: 423 DKDHPTFDDL-ADLRPMMEAEFWRRITPVYEEVISTA--QAADFNVVELYEKGVAATIAA 479 Query: 462 FNQSVAPYAHHPKLISTLALAR--ATLYKHLRELKPQ 496 + + PY + R LY L++ K Q Sbjct: 480 LDAVIDPYLLQNPKRNINVKERTIRFLYALLKDKKGQ 516 >UniRef50_UPI000169879C hypothetical protein Epers_00880 n=1 Tax=Endoriftia persephone 'Hot96_1+Hot96_2' RepID=UPI000169879C Length = 102 Score = 54.4 bits (129), Expect = 1e-05, Method: Composition-based stats. Identities = 16/94 (17%), Positives = 30/94 (31%), Gaps = 11/94 (11%) Query: 278 PHPHSPCLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAAVVNQFRNI 337 HP SP + + G L + + + + + + A VV+ F + Sbjct: 1 IHPLSPHIANKEGG-----LLPQHAQPGGLSYRHWLGL---VSKQENRQPAQVVSTFLSY 52 Query: 338 A--PQSPLELIMGGYR-NNQASILERRHDVLMFN 368 PQ L GY +N + ++ Sbjct: 53 RKLPQEQFRLHTFGYDMDNMKARCWYETTFPLYP 86 >UniRef50_C2GEY5 Putative uncharacterized protein n=1 Tax=Corynebacterium glucuronolyticum ATCC 51866 RepID=C2GEY5_9CORY Length = 541 Score = 49.4 bits (116), Expect = 3e-04, Method: Composition-based stats. Identities = 64/504 (12%), Positives = 128/504 (25%), Gaps = 69/504 (13%) Query: 47 ALALLVCIGQIIAP----AKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQ 102 L ++ A + + + L ED + + PFMQ Sbjct: 50 THRFLASTVAVVIQELNIATSPRKIKKLLEKGLPEDAVDAALERLAQGSDVFDPFFPFMQ 109 Query: 103 TKGVKANDVT-----------PMEKLLAGVSGATNCAFVNQPGQGEA-LCGGCTAIALFN 150 + D P++KL + F + G L L Sbjct: 110 QPALNIKDPKNKTTYVGPGIQPVKKLSPSMPPDEAEDFWHLLAAGNTELDLTAALQQLVG 169 Query: 151 QANQAPGFGGGFKS-GLRGGTPVTTFV-RGIDLRSTVLLNVLTLPRLQKQFPN-ESHTEN 207 + + + G P FV + + L + P + + Sbjct: 170 YQYLSLAGNNSYDGRKCQNGAPSMRFVGENRTATEIIWESTSLLASILLMIPLSWAVGQG 229 Query: 208 QPTWIKPIKSNESIPASSIGFVRGLFWQP--AHIELCDPIGIGKCSCCGQESNLRYTGFL 265 P W K S + + W + D + +G G Y + Sbjct: 230 LPAW-ADRKCEHSRGENGPHPLWRSTWSSNAPAVAWKDDVMVGV--RTGGIPENWYLPEM 286 Query: 266 KEKFTFTVNGLW-----PHPHSPCLVTVKK-----------------GEVEEKFLAFTTS 303 E + W P+ +K + ++ A + Sbjct: 287 GET-KESRKKWWDTRNESDPYYLYRSRAQKDGTQELVLQRLDLGTDATALAVEWAAKNKT 345 Query: 304 APSWTQISRVVVDKIIQNENGNRVAAVVNQFRNIAPQSPLELIMGGYRNNQASILERRHD 363 + + + + + + + R S + + Sbjct: 346 KALLAWQTPRLGEHTLDDR------LLFVRHRVEGTASSANIRASEIFAPSREKWSYDLE 399 Query: 364 VLMFNQGWQQYGNVINEIVTVGLGYKTALRKALYTFAEGFKNKDFKGAGVSVHETAERHF 423 + N Q I + + R + + G ++ A F Sbjct: 400 ENVLN----QISLRAELIQKIHNIVISPFRTRDNSASGGRAPLVIDFLK-TIRPDASTAF 454 Query: 424 YRQSELLIPDVLANV-----NFSQADEVIADLRDKLHQLCEMLFNQSVAPYAH-HPKLIS 477 +R + ++L V N Q + +LR+ L + + + V PY + P LIS Sbjct: 455 WRHINAVFTEMLREVRSDFANGKQLTSISPELRENLIRAADGALEEVVEPYYYKDPALIS 514 Query: 478 TL-----ALARATLYKHLRELKPQ 496 + R T+ K + + Sbjct: 515 YVQNGIRTWVRQTINKAFPKPNTE 538 >UniRef50_A8LM39 CRISPR-associated protein, Cse1 family n=1 Tax=Dinoroseobacter shibae DFL 12 RepID=A8LM39_DINSH Length = 506 Score = 47.8 bits (112), Expect = 0.001, Method: Composition-based stats. Identities = 27/185 (14%), Positives = 49/185 (26%), Gaps = 31/185 (16%) Query: 1 MNLLIDNWIPVRPRNGGKVQIIN-LQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIA 59 +NLL D P+ GG+ + L + + R A V + + Sbjct: 3 VNLLSD---PIFSAEGGRRLNLPGLFAALACDEVRGFPRLRAHQRAAWHMFRVQLAALAL 59 Query: 60 -------PAKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHA---EHPFMQT---KGV 106 P +D+ ++ L + L + F+Q G+ Sbjct: 60 DKAGRAEPPQDEADWHAL---------LVALTEGVAGPWDLTGPDRTKPAFLQPPDPGGL 110 Query: 107 KANDVTPMEKLLAGVSGATNCAFVNQPGQGEALCGGCTAIALFNQANQAPGFGGGFKS-- 164 K V + L ++ + AL + G G Sbjct: 111 KWEPVATPDALDLLITSRN---HDLKSEIAAQAAPEDWVYALISLQTSEGYGGRGNFGIA 167 Query: 165 GLRGG 169 + GG Sbjct: 168 RMNGG 172 >UniRef50_Q0AA30 CRISPR-associated protein, Cse1 family n=1 Tax=Alkalilimnicola ehrlichii MLHE-1 RepID=Q0AA30_ALHEH Length = 515 Score = 44.4 bits (103), Expect = 0.011, Method: Composition-based stats. Identities = 73/538 (13%), Positives = 145/538 (26%), Gaps = 70/538 (13%) Query: 2 NLLIDNWIPVRPRNGGKVQII--NLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIA 59 NLL DN + + ++ L + + L+ + A L + Sbjct: 4 NLLTDNVFGILTPDQQHRRLSLPGLLAALARGEVESLTGVQRHQIDAFHIFLCYLSAAAL 63 Query: 60 PAKDDVE---FRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHP-FMQ------TKGVKAN 109 D + R L ++ P FMQ Sbjct: 64 ECADQPDPPQEEDRWKQSL--RLLSDYADDCAWTLAVDGPGKPAFMQPPIPSNDLAGYKP 121 Query: 110 DVTPMEKLLAGVSGATNCAFVNQPGQGEALCGGCTAIALFNQANQAPGFGGGFK--SGLR 167 ++L + + + A+AL + + G G S + Sbjct: 122 KAATPDELDVLQTAKN---HDLKASRLTHASPEDWALALISSQTMSGFLGQGNYGISRMN 178 Query: 168 GGTPVTTFV-RGIDLRSTVLLNVLTLPRLQKQFPN---ESHTENQPTWIKPIKSN----- 218 GG V V L+ R ++ + QP W P + + Sbjct: 179 GGFGVRVCVGVNRTLQ--------PSERWREDLARLNAQRTHLTQPPW--PFRDDGHLLL 228 Query: 219 -----ESIPASSIGFVRGLFWQPAHIELCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTV 273 + + + + F + + G + G+ + KE V Sbjct: 229 WTLPWDGQTSIGLETLHPYFIEICRLVRLVSKPTGIAAL-GKPTKAARIAGGKELAG-NV 286 Query: 274 NGLWPHPHSP-CLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVD--KIIQNENGNRVAAV 330 W +P + + ++ + + + +G AA+ Sbjct: 287 GDGW----TPIHRKKGSALTPSARGFHPDMLRDLIITQTEYLLAPMQELPSGDG---AAI 339 Query: 331 VNQFRNIAPQSPLELIMGGYRNNQASILERRHDVLMFNQGWQQYGNVINE-IVTVGLGYK 389 + + Q + G+ I + +L+ + +E + + Sbjct: 340 FHASALVRGQGTTD----GFHEVNIPIDSKAKRLLLLGGEPADHLGRRSEWAIDAARNLR 395 Query: 390 TA-LRKALYTFAEGFKNK--DFKGAGVSVHETAERHFY--RQSELLIPDVLANVNFSQAD 444 + LR AL+T EG D Y ++ P + +++ + Sbjct: 396 SRVLRPALFTLLEGGPEGWPDTNRREAGQWTAVWLGEYDEGWADAYFPWLWSSIEVNSEP 455 Query: 445 EVIADLRDKLHQLCEMLFN-----QSVAPYAHHPKLISTLALARATLYKHLRELKPQG 497 + AD +L QL E + + + L R LYKH +E G Sbjct: 456 DARADWVARLSQLAENILEHAFGAAPQRTGRRYRAQVRASGLFRGALYKHFQEEMAHG 513 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_Q46901 Uncharacterized protein ygcL n=11 Tax=Proteobact... 480 e-134 UniRef50_D0FPP1 CRISPR-associated protein, Cse1 family n=2 Tax=E... 423 e-117 UniRef50_C5SD51 CRISPR-associated protein, Cse1 family n=1 Tax=A... 405 e-111 UniRef50_B3E5V2 CRISPR-associated protein, Cse1 family n=3 Tax=D... 403 e-111 UniRef50_Q12YB1 CRISPR-associated protein, Cse1 family n=1 Tax=M... 403 e-111 UniRef50_B8GIV6 CRISPR-associated protein, Cse1 family n=1 Tax=M... 400 e-110 UniRef50_Q054L1 Putative uncharacterized protein n=2 Tax=Leptosp... 391 e-107 UniRef50_A7ZQK3 CRISPR-associated protein, Cse1 family n=55 Tax=... 389 e-106 UniRef50_B4RSK0 CRISPR-associated protein, Cse1 family n=6 Tax=G... 388 e-106 UniRef50_Q1R117 Putative uncharacterized protein n=1 Tax=Chromoh... 374 e-102 UniRef50_Q2FNL5 Putative uncharacterized protein n=1 Tax=Methano... 371 e-101 UniRef50_B4TTX5 Crispr-associated protein, Cse1 family n=9 Tax=S... 371 e-101 UniRef50_D2TKK4 CRISPR-associated protein n=1 Tax=Citrobacter ro... 370 e-101 UniRef50_D0KFE0 CRISPR-associated protein, Cse1 family n=4 Tax=E... 361 5e-98 UniRef50_Q314I1 Putative uncharacterized protein n=1 Tax=Desulfo... 356 1e-96 UniRef50_C6C421 CRISPR-associated protein, Cse1 family n=3 Tax=E... 353 1e-95 UniRef50_Q0W587 Putative uncharacterized protein n=1 Tax=uncultu... 351 6e-95 UniRef50_B5ZCF9 CRISPR-associated protein, Cse1 family n=10 Tax=... 325 3e-87 UniRef50_A5FZI3 CRISPR-associated protein, Cse1 family n=2 Tax=A... 319 2e-85 UniRef50_D1CGD1 CRISPR-associated protein, Cse1 family n=1 Tax=T... 317 5e-85 UniRef50_A1ARH9 CRISPR-associated protein, Cse1 family n=3 Tax=B... 315 2e-84 UniRef50_D2L2X5 CRISPR-associated protein, Cse1 family n=1 Tax=D... 304 5e-81 UniRef50_D0Y921 CRISPR-associated protein, Cse1 family n=2 Tax=D... 300 9e-80 UniRef50_B0TDT8 Crispr-associated protein, ct1972 family, putati... 296 2e-78 UniRef50_A5UR17 CRISPR-associated protein, Cse1 family n=1 Tax=R... 295 4e-78 UniRef50_D1CAJ3 CRISPR-associated protein, Cse1 family n=1 Tax=S... 290 6e-77 UniRef50_A1SV74 CRISPR-associated protein, Cse1 family n=2 Tax=G... 289 2e-76 UniRef50_Q2RY16 CRISPR-associated protein, Cse1 family n=1 Tax=R... 284 5e-75 UniRef50_C0W6U3 CRISPR-associated Cse1 family protein n=1 Tax=Ac... 284 8e-75 UniRef50_B8IMR5 CRISPR-associated protein, Cse1 family n=1 Tax=M... 277 7e-73 UniRef50_Q67RP3 Putative uncharacterized protein n=1 Tax=Symbiob... 274 4e-72 UniRef50_A0LM51 CRISPR-associated protein, Cse1 family n=1 Tax=S... 273 8e-72 UniRef50_A7BA67 Putative uncharacterized protein n=1 Tax=Actinom... 273 1e-71 UniRef50_Q0BSC4 Putative uncharacterized protein n=1 Tax=Granuli... 267 8e-70 UniRef50_Q2JWC2 CRISPR-associated protein, Cse1 family n=2 Tax=C... 262 2e-68 UniRef50_C4FG91 Putative uncharacterized protein n=1 Tax=Bifidob... 259 1e-67 UniRef50_B6XT61 Putative uncharacterized protein n=2 Tax=Bifidob... 258 5e-67 UniRef50_Q53VY1 Putative uncharacterized protein TTHB188 n=1 Tax... 256 2e-66 UniRef50_D1YEE1 CRISPR system CASCADE complex protein CasA n=1 T... 255 2e-66 UniRef50_Q1J370 CRISPR-associated protein Cse1 n=1 Tax=Deinococc... 255 3e-66 UniRef50_C1XFZ8 CRISPR-associated protein, Cse1 family n=2 Tax=M... 253 1e-65 UniRef50_D1NTH8 CRISPR-associated protein, Cse1 family n=1 Tax=B... 251 4e-65 UniRef50_B1VIY3 CRISPR-associated protein n=1 Tax=Corynebacteriu... 249 2e-64 UniRef50_C7MTN0 CRISPR-associated protein, Cse1 family n=1 Tax=S... 248 4e-64 UniRef50_C0VRW0 CRISPR-associated protein n=1 Tax=Corynebacteriu... 246 1e-63 UniRef50_D1A5T7 CRISPR-associated protein, Cse1 family n=4 Tax=A... 244 5e-63 UniRef50_Q5YRB3 Putative uncharacterized protein n=1 Tax=Nocardi... 241 4e-62 UniRef50_Q4JWJ7 Putative uncharacterized protein n=2 Tax=Coryneb... 241 5e-62 UniRef50_C3PF93 CRISPR-associated protein n=3 Tax=Corynebacteriu... 240 7e-62 UniRef50_A5GBL8 CRISPR-associated protein, Cse1 family n=1 Tax=G... 239 2e-61 UniRef50_C7QEM7 CRISPR-associated protein, Cse1 family n=12 Tax=... 237 9e-61 UniRef50_A8LYZ4 CRISPR-associated protein, Cse1 family n=1 Tax=S... 236 1e-60 UniRef50_B7KJ23 CRISPR-associated protein, Cse1 family n=1 Tax=C... 236 2e-60 UniRef50_Q2JH30 Putative uncharacterized protein n=2 Tax=Frankia... 233 8e-60 UniRef50_C8XAY7 CRISPR-associated protein, Cse1 family n=1 Tax=N... 233 1e-59 UniRef50_B8IZA3 CRISPR-associated protein, Cse1 family n=1 Tax=D... 232 3e-59 UniRef50_Q1EQS6 Putative uncharacterized protein n=2 Tax=Strepto... 232 3e-59 UniRef50_D1A6Q3 CRISPR-associated protein, Cse1 family n=1 Tax=T... 232 3e-59 UniRef50_C2CN11 CRISPR-associated protein n=1 Tax=Corynebacteriu... 230 9e-59 UniRef50_A8M401 CRISPR-associated protein, Cse1 family n=1 Tax=S... 226 1e-57 UniRef50_A8SDS0 Putative uncharacterized protein n=1 Tax=Faecali... 226 1e-57 UniRef50_C8P6I4 Putative uncharacterized protein n=1 Tax=Lactoba... 225 3e-57 UniRef50_C1XYH9 CRISPR-associated protein, Cse1 family n=1 Tax=M... 223 1e-56 UniRef50_C7MQD3 CRISPR-associated protein, Cse1 family n=1 Tax=S... 222 2e-56 UniRef50_A4XYU2 CRISPR-associated protein, Cse1 family n=3 Tax=P... 221 5e-56 UniRef50_B6WQ59 Putative uncharacterized protein n=1 Tax=Desulfo... 218 4e-55 UniRef50_C7LYW9 CRISPR-associated protein, Cse1 family n=1 Tax=A... 216 1e-54 UniRef50_D2RAZ9 CRISPR system CASCADE complex protein CasA n=3 T... 210 1e-52 UniRef50_B8FDH6 CRISPR-associated protein, Cse1 family n=1 Tax=D... 209 2e-52 UniRef50_C6CML6 CRISPR-associated protein, Cse1 family n=6 Tax=G... 209 2e-52 UniRef50_Q47PJ1 CRISPR-associated protein, Cse1 family n=1 Tax=T... 209 2e-52 UniRef50_UPI0001AEDDCB hypothetical protein SalbJ_26479 n=1 Tax=... 208 5e-52 UniRef50_Q8KB26 CRISPR-associated protein, CT1972 family n=1 Tax... 207 6e-52 UniRef50_UPI0001AF1D49 CRISPR-associated Cse1 family protein n=1... 204 6e-51 UniRef50_D0WFD1 CRISPR-associated protein, Cse1 family n=1 Tax=S... 203 1e-50 UniRef50_B0LU91 CRISPR-associated protein Cas1 n=2 Tax=Streptomy... 203 2e-50 UniRef50_B5GY59 Putative uncharacterized protein n=1 Tax=Strepto... 200 1e-49 UniRef50_C2BEU1 CRISPR-associated protein n=1 Tax=Anaerococcus l... 198 3e-49 UniRef50_C6HV92 CRISPR-associated protein, Cas1 n=1 Tax=Leptospi... 197 7e-49 UniRef50_C6SPI8 Putative uncharacterized protein n=1 Tax=Strepto... 196 2e-48 UniRef50_B3ENH5 CRISPR-associated protein, Cse1 family n=2 Tax=C... 195 4e-48 UniRef50_B5GA97 Crispr-associated protein n=1 Tax=Streptomyces s... 186 1e-45 UniRef50_Q03C63 CRISPR-associated protein n=1 Tax=Lactobacillus ... 184 9e-45 UniRef50_Q60AC9 CRISPR-associated protein, CT1972 family n=1 Tax... 178 5e-43 UniRef50_D1Y489 CRISPR-associated protein, Cse1 family n=1 Tax=P... 166 2e-39 UniRef50_C2GEY5 Putative uncharacterized protein n=1 Tax=Coryneb... 152 3e-35 UniRef50_B0S4B8 Putative uncharacterized protein n=1 Tax=Finegol... 148 5e-34 UniRef50_C5V9N4 Putative uncharacterized protein n=1 Tax=Coryneb... 147 8e-34 UniRef50_B6IWM6 CRISPR-associated protein, CT1972 family n=1 Tax... 144 1e-32 UniRef50_A8LM39 CRISPR-associated protein, Cse1 family n=1 Tax=D... 101 7e-20 UniRef50_B5H6V1 Predicted protein n=1 Tax=Streptomyces pristinae... 98 8e-19 UniRef50_B6ZW55 CRISPR-associated protein, Cse1 family n=2 Tax=E... 71 7e-11 UniRef50_UPI000169879C hypothetical protein Epers_00880 n=1 Tax=... 59 3e-07 UniRef50_Q06WG4 Putative uncharacterized protein (Fragment) n=4 ... 56 2e-06 Sequences not found previously or not previously below threshold: UniRef50_Q04AX2 CRISPR-associated protein n=2 Tax=Lactobacillus ... 178 4e-43 UniRef50_C7MTA7 CRISPR-associated protein, Cse1 family n=1 Tax=S... 170 9e-41 UniRef50_B2GBJ6 Putative uncharacterized protein n=1 Tax=Lactoba... 155 4e-36 UniRef50_Q0AA30 CRISPR-associated protein, Cse1 family n=1 Tax=A... 107 1e-21 UniRef50_Q0BRG1 Putative uncharacterized protein n=1 Tax=Granuli... 95 6e-18 UniRef50_D0MET3 CRISPR-associated protein, Cse1 family n=1 Tax=R... 83 3e-14 UniRef50_B4UE68 CRISPR-associated protein, Cse1 family n=2 Tax=A... 79 4e-13 UniRef50_UPI0001B51C2F CRISPR-associated Cse1 family protein n=1... 77 1e-12 UniRef50_Q6NEQ6 Putative uncharacterized protein n=1 Tax=Coryneb... 73 4e-11 UniRef50_C9M9R4 CRISPR-associated protein, Cse1 family n=1 Tax=J... 72 4e-11 UniRef50_C4ZJX8 CRISPR-associated protein, Cse1 family n=2 Tax=B... 70 2e-10 UniRef50_Q2RXJ9 CRISPR-associated protein, Cse1 family n=1 Tax=R... 67 2e-09 UniRef50_C2KP47 Putative uncharacterized protein n=1 Tax=Mobilun... 64 1e-08 UniRef50_UPI0001B51C2E hypothetical protein SvirD4_12610 n=1 Tax... 55 7e-06 UniRef50_B6B780 Putative uncharacterized protein n=1 Tax=Rhodoba... 46 0.003 UniRef50_UPI000169A1F1 hypothetical protein Epers_00060 n=1 Tax=... 42 0.059 >UniRef50_Q46901 Uncharacterized protein ygcL n=11 Tax=Proteobacteria RepID=YGCL_ECOLI Length = 502 Score = 480 bits (1236), Expect = e-134, Method: Composition-based stats. Identities = 502/502 (100%), Positives = 502/502 (100%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAP 60 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAP Sbjct: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAP 60 Query: 61 AKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDVTPMEKLLAG 120 AKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDVTPMEKLLAG Sbjct: 61 AKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDVTPMEKLLAG 120 Query: 121 VSGATNCAFVNQPGQGEALCGGCTAIALFNQANQAPGFGGGFKSGLRGGTPVTTFVRGID 180 VSGATNCAFVNQPGQGEALCGGCTAIALFNQANQAPGFGGGFKSGLRGGTPVTTFVRGID Sbjct: 121 VSGATNCAFVNQPGQGEALCGGCTAIALFNQANQAPGFGGGFKSGLRGGTPVTTFVRGID 180 Query: 181 LRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSNESIPASSIGFVRGLFWQPAHIE 240 LRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSNESIPASSIGFVRGLFWQPAHIE Sbjct: 181 LRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSNESIPASSIGFVRGLFWQPAHIE 240 Query: 241 LCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWPHPHSPCLVTVKKGEVEEKFLAF 300 LCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWPHPHSPCLVTVKKGEVEEKFLAF Sbjct: 241 LCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWPHPHSPCLVTVKKGEVEEKFLAF 300 Query: 301 TTSAPSWTQISRVVVDKIIQNENGNRVAAVVNQFRNIAPQSPLELIMGGYRNNQASILER 360 TTSAPSWTQISRVVVDKIIQNENGNRVAAVVNQFRNIAPQSPLELIMGGYRNNQASILER Sbjct: 301 TTSAPSWTQISRVVVDKIIQNENGNRVAAVVNQFRNIAPQSPLELIMGGYRNNQASILER 360 Query: 361 RHDVLMFNQGWQQYGNVINEIVTVGLGYKTALRKALYTFAEGFKNKDFKGAGVSVHETAE 420 RHDVLMFNQGWQQYGNVINEIVTVGLGYKTALRKALYTFAEGFKNKDFKGAGVSVHETAE Sbjct: 361 RHDVLMFNQGWQQYGNVINEIVTVGLGYKTALRKALYTFAEGFKNKDFKGAGVSVHETAE 420 Query: 421 RHFYRQSELLIPDVLANVNFSQADEVIADLRDKLHQLCEMLFNQSVAPYAHHPKLISTLA 480 RHFYRQSELLIPDVLANVNFSQADEVIADLRDKLHQLCEMLFNQSVAPYAHHPKLISTLA Sbjct: 421 RHFYRQSELLIPDVLANVNFSQADEVIADLRDKLHQLCEMLFNQSVAPYAHHPKLISTLA 480 Query: 481 LARATLYKHLRELKPQGGPSNG 502 LARATLYKHLRELKPQGGPSNG Sbjct: 481 LARATLYKHLRELKPQGGPSNG 502 >UniRef50_D0FPP1 CRISPR-associated protein, Cse1 family n=2 Tax=Erwinia pyrifoliae RepID=D0FPP1_ERWPY Length = 507 Score = 423 bits (1088), Expect = e-117, Method: Composition-based stats. Identities = 271/499 (54%), Positives = 347/499 (69%), Gaps = 2/499 (0%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAP 60 MNLL D+WIPVRP +GG+ Q I LQ+L C +W ++LPRDDME+A LLVC+ Q + Sbjct: 1 MNLLTDDWIPVRPLSGGEGQQITLQTLLCDDRRWLVALPRDDMEMATFQLLVCLLQTLWM 60 Query: 61 AKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDVTPMEKLLAG 120 D + RI PL+ EF +A W F LNH + PFMQ +GV A +VT M+KLL G Sbjct: 61 PSDAQQLVQRIRQPLSAREFADGVAGWQQAFDLNHPQQPFMQVRGVAAKEVTGMDKLLVG 120 Query: 121 VSGATNCAFVNQPGQGEALCGGCTAIALFNQANQAPGFGGGFKSGLRGGTPVTTFVRGID 180 ++G+T+ AFVNQ GQG+ALC GCTAIALFNQA APGFGGGFKSGLRGG+PVTT V+G Sbjct: 121 LTGSTSGAFVNQSGQGKALCSGCTAIALFNQACNAPGFGGGFKSGLRGGSPVTTLVQGDC 180 Query: 181 LRSTVLLNVLTLPRLQKQFPNESHTENQP-TWIKPIKSNESIPASSIGFVRGLFWQPAHI 239 LR+T+ NVL+ L + P+ QP TW +PIK +++I SSIG RGLFWQPAHI Sbjct: 181 LRTTLWFNVLSETTLDEFCPDWREQRAQPFTWQQPIKKDQAIAGSSIGLARGLFWQPAHI 240 Query: 240 ELCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWPHPHSPCLVTVKKGEVEEKFLA 299 EL P G G+CS CG+ ++ RY FLKEKF FTVNGLW HPHSP + +KKG+VE +++A Sbjct: 241 ELSPPDGAGQCSACGRMASQRYRSFLKEKFNFTVNGLWLHPHSPLIQQIKKGQVEWRYMA 300 Query: 300 FTTSAPSWTQISRVVVDKIIQNEN-GNRVAAVVNQFRNIAPQSPLELIMGGYRNNQASIL 358 F+T APSWTQI R+++++ + + G RVA V Q R ++ L L++GGYRNNQASI+ Sbjct: 301 FSTPAPSWTQIGRLLIEQQVNKQQEGRRVATTVEQARMLSRGRALRLMIGGYRNNQASII 360 Query: 359 ERRHDVLMFNQGWQQYGNVINEIVTVGLGYKTALRKALYTFAEGFKNKDFKGAGVSVHET 418 ERRH+VL FNQGWQ VINEIV +GL Y+ ALR AL+ FAEG K D KGAGV++HE Sbjct: 361 ERRHEVLQFNQGWQHAMPVINEIVNLGLEYRKALRTALWIFAEGAKESDIKGAGVALHEK 420 Query: 419 AERHFYRQSELLIPDVLANVNFSQADEVIADLRDKLHQLCEMLFNQSVAPYAHHPKLIST 478 + +YRQS + ++LA +++ + + + QLC+ LFN APYAHHPKLI + Sbjct: 421 VDPQYYRQSHARVLNLLAQIDYQSPLPQLEQFQTQQQQLCQQLFNDLTAPYAHHPKLICS 480 Query: 479 LALARATLYKHLRELKPQG 497 LA AR L L +LKPQG Sbjct: 481 LAKARRYLMSSLAKLKPQG 499 >UniRef50_C5SD51 CRISPR-associated protein, Cse1 family n=1 Tax=Allochromatium vinosum DSM 180 RepID=C5SD51_CHRVI Length = 507 Score = 405 bits (1040), Expect = e-111, Method: Composition-based stats. Identities = 218/504 (43%), Positives = 299/504 (59%), Gaps = 10/504 (1%) Query: 1 MNLLIDNWIPVRPRNG-GKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIA 59 M+LL WIPVR G G +++ Q L C + W++SLPRDD+ELA L LL+C+ QI+ Sbjct: 1 MDLLKTPWIPVRAHGGSGTFRLLTYQELLCEDEDWQISLPRDDLELACLQLLICMTQIMF 60 Query: 60 PAKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDVTPMEKLLA 119 +D RI PLT DEF + I+P ++ F L+H PFMQT+GV A DVTP++KLL Sbjct: 61 LPPEDDVLLDRIDIPLTPDEFTEGISPCLEWFDLDHPTQPFMQTRGVVAKDVTPIQKLLI 120 Query: 120 GVSGATNCAFVNQPGQGEALCGGCTAIALFNQANQAPGFGGGFKSGLRGGTPVTTFVRGI 179 G+ TN AF N PG+ L AIALF+QA P FGGGFK LRG P+TT V G Sbjct: 121 GLPEGTNHAFFNAPGEVSVLSAPVAAIALFHQATNCPSFGGGFKGSLRGIAPITTLVDGR 180 Query: 180 DLRSTVLLNVLTLPRLQKQFPNESHT--ENQPTWIKPIKSNESIPASSIGFVRGLFWQPA 237 +LR + NVLT ++ FP+ H ++ PTWI+PI+S E+I A IG RGLFWQPA Sbjct: 181 NLRKRIWCNVLTPEFIRTDFPDWQHDLSQDLPTWIEPIRSKETIHAHQIGLARGLFWQPA 240 Query: 238 HIELCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWPHPHSPCLVTVKKGEVEEKF 297 H+EL G C G E+ YTGF KEKF FT+ G WPHPH ++KKG +E KF Sbjct: 241 HVELVGSRESGPCDLLGIEAGPLYTGFRKEKFNFTLEGTWPHPHGVLQSSLKKGALEMKF 300 Query: 298 LAFTTSAPSWTQISRVVVDKIIQNENGNRVAAVVNQFRNIA-----PQSPLELIMGGYRN 352 +FTT AP+WT+++ +V+ G+R A V Q +++A PL LI+GGYRN Sbjct: 301 ASFTTEAPAWTRLTEMVLRINGPKGEGSRPATPVAQAKSMAVTALEKPQPLTLIIGGYRN 360 Query: 353 NQASILERRHDVLMFNQGW-QQYGNVINEIVTVGLGYKTALRKALYTFAEGFKNKDFKGA 411 N+AS+ ERRH++L GW ++ G+ + ++V +G+ K +L+ L ++G K K G Sbjct: 361 NKASVTERRHEMLSLAAGWSEEDGSRLKDLVALGIKAKESLKDKLSFASKGHKKKMLPGI 420 Query: 412 GVSVHETAERHFYRQSELLIPDVLANV-NFSQADEVIADLRDKLHQLCEMLFNQSVAPYA 470 G + + ER FY ++E I + L+ F Q E A D L C +F+ PY Sbjct: 421 GSPIQDVGERIFYSRTEGKIIETLSRPTTFMQWKENRAAYIDALAADCRDIFDAMTEPYT 480 Query: 471 HHPKLISTLALARATLYKHLRELK 494 P+LI +A AR +L L++LK Sbjct: 481 MKPELIPIIAWARRSLNADLKKLK 504 >UniRef50_B3E5V2 CRISPR-associated protein, Cse1 family n=3 Tax=Deltaproteobacteria RepID=B3E5V2_GEOLS Length = 539 Score = 403 bits (1036), Expect = e-111, Method: Composition-based stats. Identities = 108/522 (20%), Positives = 183/522 (35%), Gaps = 35/522 (6%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAP 60 MNL+ D WIPV G+ I Q L+ PR D + A LL+ + Q Sbjct: 1 MNLIKDAWIPVIRAKSGRGVIAPWQIAELDDPVMELAAPRPDFQGAMYQLLIGLLQTGFA 60 Query: 61 AKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHP-FMQTKGVKANDVTPMEKLLA 119 +D E+ P + + F + + P FMQ + + + LL Sbjct: 61 PEDFDEWLDYWSKPPDATLLRTRLETLAAAFDFDKPDSPAFMQDYAMPDGEKKGIASLLI 120 Query: 120 GVSGAT----NCAFVNQPGQGEALCGGCTAIALFNQANQAPGFGGGFKSGLRGGTPVTTF 175 G N + + +C C A+ALF AP G G + GLRGG P+TT Sbjct: 121 ESPGGKTVKDNLDHFIKRDAVQHMCKSCAAMALFTLQTNAPSGGVGHRVGLRGGGPLTTL 180 Query: 176 V---RGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSNES----IPASSIGF 228 V + L T+ LNVL + + + W+ P +++E S+ Sbjct: 181 VLPPEQMPLWQTLWLNVLDREDMPEYRQD--RVAGVFPWMGPTRTSEKNGAETTPESVHA 238 Query: 229 VRGLFWQPAHIELCDPIGI--GKCSCCGQESNLRYTGFLKEKFTFTVNGLWPHPHSPCLV 286 ++ + P I L P G C C ++ + + G W HP +P Sbjct: 239 LQAYWGMPRRIRLDFPAKASMGDCDVCDVKNVALVEEYRTRNYGVNYVGNWVHPLTPYRF 298 Query: 287 TVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAAVVNQFR-----NIAPQS 341 KK + L+ + + ENG+ A +V ++ + Q Sbjct: 299 DPKKEKP---PLSLKGQQGGLGYRYWLALTLANDTENGDAAAKIVRRYSEQRATELKIQR 355 Query: 342 PLELIMGGYRNNQAS-ILERRHDVLMFNQGWQQYGNVI---NEIVTVGLG----YKTALR 393 L G+ + H +FN QQ ++ ++++TV + ++ Sbjct: 356 TARLWCFGFDMDNMKARCWYDHTFPLFNLAPQQRKKLLQWADDLITVANDVSSLLRKQVK 415 Query: 394 KALYTFAEGFKNKDFKGAGVSVHETAERHFYRQSELLIPDVLANVNFSQADEVIADLRDK 453 A + E K D + + +E FY + L ++ E+ + Sbjct: 416 AAWFRRPEDAK-GDMNTVSLDFWQRSEPVFYELLDQL--AKVSGEQELPPPELYSQWEKM 472 Query: 454 LHQLCEMLFNQSVAPYAHHPKLISTLALARATLYKHLRELKP 495 L L LF+ V A+ + + R L + + K Sbjct: 473 LVSLSLQLFDAWVLEAANEDMDMKRIIAERDGLKRLVYGCKS 514 >UniRef50_Q12YB1 CRISPR-associated protein, Cse1 family n=1 Tax=Methanococcoides burtonii DSM 6242 RepID=Q12YB1_METBU Length = 528 Score = 403 bits (1035), Expect = e-111, Method: Composition-based stats. Identities = 106/533 (19%), Positives = 190/533 (35%), Gaps = 57/533 (10%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQ--SLYCSRDQWRLSLPRDDMELAALALLVCIGQII 58 NL+ + WI V+ ++G + I Q S L PR D + + L+ + Q Sbjct: 3 FNLIHEKWIWVQRQDGTRSMIAPWQITDEIGSNPIISLDEPRPDFNGSMIQFLIGLVQTT 62 Query: 59 APAKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDVTPMEKLL 118 K D ++R R ++P T +E + +F L+ + FMQ ++ LL Sbjct: 63 MSPKSDGKWRKRFISPPTPEELLETFEKVAHVFDLDGDDERFMQDHEHIEGAKNRVDALL 122 Query: 119 AGVSG----ATNCAFVNQPGQGEALCGGCTAIALFNQANQAPGFGGGFKSGLRGGTPVTT 174 + G N + +C C ALFN AP G G ++ LRGG P+TT Sbjct: 123 MEMPGVQTLKHNADHFQKRDTVTQMCLPCCVTALFNLQLNAPAGGQGHRTSLRGGGPLTT 182 Query: 175 FVRGIDLRSTVLLNVLTLPRLQKQF-PNESHTENQPTWIKPIKSNES----IPASSIGFV 229 V G +L T+ LNV++ + + + W+ P +++E + Sbjct: 183 LVLGSNLWQTIWLNVISDENFKGLGDVDNCEISDIYPWMGPTRTSEKKNAMTTPMDVNPK 242 Query: 230 RGLFWQPAHIELCDPI-GIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWPHPHSPCLVTV 288 + + P I L G C CG ES+ + ++ + + + +G W H SP Sbjct: 243 QMYWGMPRRIRLDLDDLIEGACDVCGCESDKLVSNYVTKNYGYNYDGGWCHVLSPHNENK 302 Query: 289 KKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQN-------------ENGNRVAAVVNQFR 335 L T + + + + + ++ + + F+ Sbjct: 303 NG------LLPRHPQPGGITYRHWLGLVQHDPDKGLYSSLAFERFVQKQKDLSDLGDVFK 356 Query: 336 NIAPQSPLELIMGGYRNNQASILERRHDVLMF----NQGWQQYGNVINEIVTVGLG---- 387 N +L GY + + + ++ Q Y +++ +V Sbjct: 357 NTP-----QLWAFGYDFDNMKVRCWYESTMPLFNVADELRQSYEQIVSRLVKTAEIIAYN 411 Query: 388 YKTALRKALYTFAEGFKNKDFKGAGVSVHETAERHFYRQSELLIPDVLANVNFSQA--DE 445 ++ ++KAL+ + D E FY L +V +A E Sbjct: 412 TRSCVKKALF--GDNTPRGDLSFIDSRFWHDTESEFYNILNQ-----LTDVVNDEAMVLE 464 Query: 446 VIADLRDKLHQLCEMLFNQSVAPYAHHPKLISTLALARATLYKHLRELKPQGG 498 + +L ++ E LF+ S ALA +K LR+ GG Sbjct: 465 LKMKWHKELSRISEKLFDDSSQSMQFSVIDPERAALA----HKDLRKFNSDGG 513 >UniRef50_B8GIV6 CRISPR-associated protein, Cse1 family n=1 Tax=Methanosphaerula palustris E1-9c RepID=B8GIV6_METPE Length = 534 Score = 400 bits (1027), Expect = e-110, Method: Composition-based stats. Identities = 106/519 (20%), Positives = 186/519 (35%), Gaps = 34/519 (6%) Query: 2 NLLIDNWIPVRPRNGGKVQIINLQ--SLYCSRDQWRLSLPRDDMELAALALLVCIGQIIA 59 NL+ WIPV ++G + I + S Y L PR D A + L+ I Q Sbjct: 3 NLIEQAWIPVIRKDGERSTIAPWELTSDYQENPIVELDAPRPDFNGALVQFLIGIVQTEL 62 Query: 60 PAKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDVTPMEKLLA 119 P + V ++ P + + + I+ F L+ FMQ + + ++KLL Sbjct: 63 PPTNPVTWKRMFRRPPEPADLKASFSTHIEAFNLDGDGPRFMQDLTLAKGEALAIDKLLI 122 Query: 120 GVSG----ATNCAFVNQPGQGEALCGGCTAIALFNQANQAPGFGGGFKSGLRGGTPVTTF 175 G N + G + LC C A+ALF AP G G ++ LRGG P+TT Sbjct: 123 ERPGEQTVKKNTDHFLKRGGIDHLCMTCAAMALFTLQTNAPSGGRGHRTSLRGGGPLTTL 182 Query: 176 VRGIDLRSTVLLNVLTLPRLQKQFPNE-SHTENQPTWIKPIKSN---ESIPASSIGFVRG 231 V G L TV LNV++ L++ + + + W+ +++ E + + Sbjct: 183 VTGRTLWETVWLNVISPQELERYGNSALTSAADIFPWMGETRTSNNNEITTPQDVNPAQM 242 Query: 232 LFWQPAHIELCDP--IGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWPHPHSPCLVTVK 289 + P I L G+C CG+ + + + F + G W H SP K Sbjct: 243 FWGMPRRIRLDLDGKPEPGECDLCGKTTERQVSTFSAKDSGVNYKGGWCHVLSPYSTNPK 302 Query: 290 KGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAAVVNQFRN----IAPQSPLEL 345 LA T + + + + ++N ++ AAVV+ FR L Sbjct: 303 GE-----LLAKHAQPGGVTYRNWLGLVQN-DSQNNSQPAAVVSLFREQRQLGLNGFQPHL 356 Query: 346 IMGGYRNNQASILERRHDVLM--------FNQGWQQYGNVINEIVTVGLGYKTALRKALY 397 GY + + ++ ++ +G +T+++KAL+ Sbjct: 357 WAFGYDMDNMKARCWYEGKMPLHHIDEGLLPGYEEEIARLVRTAGLIGFSVRTSIKKALF 416 Query: 398 TFAEGFKNKDFKGAGVSVHETAERHFYRQSELLIPDVLANVNFSQADEVIADLRDKLHQL 457 + E D + E F++ + L + + + + L Sbjct: 417 SRPEDAT-GDLSFIDARFWQDTEPAFHKTLDELATLL---KDGGDRTTLKLNWLKSLRDE 472 Query: 458 CEMLFNQSVAPYAHHPKLISTLALARATLYKHLRELKPQ 496 + LF+ +ALA L + + Sbjct: 473 GKRLFDDYSQADLIDQTDPKRVALAWRDLQRFTSRFNKK 511 >UniRef50_Q054L1 Putative uncharacterized protein n=2 Tax=Leptospira borgpetersenii serovar Hardjo-bovis RepID=Q054L1_LEPBL Length = 533 Score = 391 bits (1005), Expect = e-107, Method: Composition-based stats. Identities = 103/516 (19%), Positives = 184/516 (35%), Gaps = 40/516 (7%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYC----SRDQWRLSLPRDDMELAALALLVCIGQ 56 MNL+ D WIPV+ + +I + S LS PR D A L LV + Q Sbjct: 1 MNLIKDVWIPVQRFSEKLEEISPFEITSRIENDSDPVMSLSAPRPDFNGALLQFLVGLLQ 60 Query: 57 IIAPAKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKAND-VTPME 115 + +++ E+ +NP + + ++ + F L FMQ + D V + Sbjct: 61 AVFSPENETEWEDLFVNPPSPEVLKEAMEKVKSAFELFGDGPRFMQDTNLNEEDTVFDIS 120 Query: 116 KLLAGVSGAT----NCAFVNQPGQGEALCGGCTAIALFNQANQAPGFGGGFKSGLRGGTP 171 L G F + +C C I LF+ +P G G + +RGG P Sbjct: 121 ALFIESPGENTIKLKKDFFIKRNSISQICEKCAGIGLFSFQTNSPSGGQGHLTSIRGGGP 180 Query: 172 VTTFV-------RGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSN-----E 219 +TTFV + L S + LNVL Q + ++ W P + + Sbjct: 181 LTTFVTSKLKNPKKNSLWSKLWLNVLPKSYFQVNGQKKIPFQSVFPWTNPKIEDLQSKGK 240 Query: 220 SIPASSIGFVRGLFWQPAHIELCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTV-NGLWP 278 + + + + P I L + G C C + S++ + + + + G W Sbjct: 241 TTTPQDLHPLSVYWSYPRRILLRKDVENGICDVCNRSSSVLVRSYHTKTYGLSYSEGGWI 300 Query: 279 HPHSPCLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAAVVNQFRNIA 338 HP SP + E + + + E ++ A V+ +F N Sbjct: 301 HPLSPYYKSK------ESWFPYHPQPGGILYHYW--QTIALGKEQEDQAALVIRRFLNRK 352 Query: 339 -PQSPLELIMGGYRNNQASILERRHDVLMF----NQGWQQYGNVINEIVTVGLGYKTALR 393 P ++ GY + + F + +++ +++I+ K LR Sbjct: 353 IPGEQTSILTFGYDMDNMKARCWYESEIPFFNIPSDKIEKFEEQVSQILNASTEVKKNLR 412 Query: 394 KALYTFAEGFKN---KDFKGAGVSVHETAERHFYRQSELLIPDVLANVNFSQADEVIADL 450 +A+ KN D VS + E FY + +++++V + Sbjct: 413 QAVRNAWLDKKNDSKGDLTFLDVSFLKDTENSFYDLIRNVQENLISDVA--DFAALKETW 470 Query: 451 RDKLHQLCEMLFNQSVAPYAHHPKLISTLALARATL 486 L++ LF+Q + + I + AR L Sbjct: 471 LKLLNESACKLFDQYADSGSFEFENIERIVKARKNL 506 >UniRef50_A7ZQK3 CRISPR-associated protein, Cse1 family n=55 Tax=Enterobacteriaceae RepID=A7ZQK3_ECO24 Length = 520 Score = 389 bits (998), Expect = e-106, Method: Composition-based stats. Identities = 95/519 (18%), Positives = 195/519 (37%), Gaps = 39/519 (7%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAP 60 +LL W+PVR ++G ++ + + ++ PR D++ AA L+ + Q Sbjct: 4 FSLLTTPWLPVRFKDGTTGKLAPVD--LADENVVDIAAPRADLQGAAWQFLLGLLQTSFA 61 Query: 61 AKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDVTPMEKLLAG 120 K+ + + L ++ ++ + F FMQ D + LL Sbjct: 62 PKNHGRWDDIWEDGLEAEKLREALLSLEHAFQFGADSPSFMQDFEALKGDKVQVASLLPE 121 Query: 121 VSGAT----NCAFVNQPGQGEALCGGCTAIALFNQANQAPGFGGGFKSGLRGGTPVTTFV 176 + GA N + G E +C C+A+ALF+ AP G G+++GLRGG P+TT + Sbjct: 122 IPGAQTTKFNKDHFIKRGVTEHVCPHCSALALFSLQLNAPSGGKGYRTGLRGGGPMTTLI 181 Query: 177 R--------GIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSNE----SIPAS 224 L + NV+ P + W+ P +++E + Sbjct: 182 ELQEYQGNQQTPLWRKLWPNVMPQDEADLPLPK-KFDDLVFPWLGPTRTSELAGAVVTHD 240 Query: 225 SIGFVRGLFWQPAHIELCD-PIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWPHPHSP 283 + ++ + P I + +G C CG++S+ + + + +W HP +P Sbjct: 241 QVNKLQAYWGMPRRIRIDFNTTTVGNCDICGEQSDALLSLMTTKNYGANY-AMWQHPLTP 299 Query: 284 CLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAAVVNQFRNIAP--QS 341 + +K+G +F + + + + ++EN + A+V + N + Q+ Sbjct: 300 YRIPLKEG---GEFYSVKPQPGGLIWRDWLGLIETGKSENNTELPALVVKLFNASSLKQA 356 Query: 342 PLELIMGGYR-NNQASILERRHDVLMFNQGWQQYGNVINEIVTVGLG----YKTALRKAL 396 + L GY +N + H + + + + ++AL++A Sbjct: 357 KVGLWGFGYDFDNMKARCWYEHHFPLLLKKKEGQIPKLRLAAQTASRILSLLRSALKEAW 416 Query: 397 YTFAEGFKNKDFKGAGVSVHETAERHFYRQSELLIPDVLANVNFSQADEVIADLRDKLHQ 456 ++ +G DF + + F R + ADE++ + ++ Sbjct: 417 FSDPKGA-RGDFSFVDIDFWNKTQHRFLRLVRQI-------EEGQDADELLGKWQKEIWL 468 Query: 457 LCEMLFNQSVAPYAHHPKLISTLALARATLYKHLRELKP 495 F++ V + P + + AR + E + Sbjct: 469 FARQDFDERVFTNPYEPVDLKRVMTARKKYFTTSAEKQS 507 >UniRef50_B4RSK0 CRISPR-associated protein, Cse1 family n=6 Tax=Gammaproteobacteria RepID=B4RSK0_ALTMD Length = 535 Score = 388 bits (997), Expect = e-106, Method: Composition-based stats. Identities = 97/524 (18%), Positives = 180/524 (34%), Gaps = 42/524 (8%) Query: 1 MNLLIDNWI--PVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQII 58 MNLL + W+ V+ +G + + + +LPR D + AA + + Q Sbjct: 1 MNLLKEPWLLFNVQQPDGSIAEKTLPITAIAKPEVIDFALPRADFQGAAYQFAIGLLQTC 60 Query: 59 APAKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKG-VKANDVTPMEKL 117 D+ E++ + P TED + + F FMQ + T + L Sbjct: 61 FAPDDEFEWKDNYLEPPTEDALRPAFSKAEHAFNATGDGPLFMQDFDSLDEAKPTSVSGL 120 Query: 118 LAGVSGAT----NCAFVNQPGQGEALCGGCTAIALFNQANQAPGFGGGFKSGLRGGTPVT 173 L G N + G GE + +ALF AP G G ++GLRGG P+T Sbjct: 121 LIEAPGGNGLKLNTDHFVKRGIGEVMSLPMAVLALFTLQINAPAGGQGHRTGLRGGGPLT 180 Query: 174 TFVRGID----LRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSNESIPAS----S 225 T V + L + LNV ++ + H++ W+ + + + Sbjct: 181 TLVMPQNENSPLWQKLWLNVAPND--ERYSAPDLHSDTVFPWLGKTRVSAKKGSETYQKD 238 Query: 226 IGFVRGLFWQPAHIELCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWPHPHSPCL 285 + + + P I L G+CS G+ + + + + G W HP +P Sbjct: 239 VHPLHMFWSMPRRIRLIVEDVAGECSLTGKSCSQLVKLYKTQNYGANYAGSWSHPLTPYK 298 Query: 286 VTVKKGEVEEKFLAFTTSAPSWTQISRVVVDK---IIQNENGNRVAAVVNQFRNIAPQ-- 340 +KK + E+ L+ T V+ + A+VV F ++ Sbjct: 299 RDLKKPDQED--LSIKGQPGGITYKIWDVLTLTGSPDGGKTQQMCASVVRSFNHLVNDDI 356 Query: 341 -----SPLELIMGGYRNNQASILERRHDVLMF----NQGWQQYGNVINEIVTVGLGY--- 388 + L + GY + + + + Q + I ++ T+ Sbjct: 357 LEDVTAQARLWVFGYDMDNMKARGWYSETMPLFQVPSAKQQHILDYIKQLQTIANDALWH 416 Query: 389 -KTALRKALYTFAEGFKNKDFKGAGVSVHETAERHFYRQSELLIPDVLANVNFSQADEVI 447 ++ ++ A + K DF + + + F+ + L+ + +A Sbjct: 417 CRSQIKSAWFDKPGDAK-GDFSFIETAFWQQTQSAFFAAVQQLMSSDSLYLTSMEA---- 471 Query: 448 ADLRDKLHQLCEMLFNQSVAPYAHHPKLISTLALARATLYKHLR 491 L + LF++ + + AR L K L Sbjct: 472 KQWLSTLRNVALSLFDEYALSELGSERTMEKRIGARKNLLKGLY 515 >UniRef50_Q1R117 Putative uncharacterized protein n=1 Tax=Chromohalobacter salexigens DSM 3043 RepID=Q1R117_CHRSD Length = 564 Score = 374 bits (960), Expect = e-102, Method: Composition-based stats. Identities = 105/541 (19%), Positives = 180/541 (33%), Gaps = 50/541 (9%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAP 60 MNLL D W+P R +G + S D L+LPR D + AA L+ + Q Sbjct: 3 MNLLTDPWLPFRRSDGSLLYRPP--SALADPDILDLALPRADFQGAAWQFLIALLQTAMT 60 Query: 61 AKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKG-VKANDVTPMEKLLA 119 K+ + R P + +EF+ +AP+ F L+ FMQ ++ P+ LL Sbjct: 61 PKNTDAWLDRYQTPPSVEEFEAALAPFSRAFELDGEGPRFMQDLDPLEDVKDAPVAGLLI 120 Query: 120 GVSG----ATNCAFVNQPGQGEALCGGCTAIALFNQANQAPGFGGGFKSGLRGGTPVTTF 175 G N F + G+ EA+C C A+AL+ AP G G + GLRGG P+TT Sbjct: 121 DSPGANGIKNNTDFFVKRGRVEAVCPDCAALALYTMQINAPAGGAGIRVGLRGGGPLTTL 180 Query: 176 VRGI----DLRSTVLLNVLTLPRLQKQFPNESHT----ENQPTWIKPIKSNESIP----A 223 + L + NV+ + + + W+ + ++ Sbjct: 181 ILPEDETKSLWERLWPNVMPADAVGQPGQTWRPPTVDDADLFFWMSDTRVSDKKGTEVFP 240 Query: 224 SSIGFVRGLFWQPAHIELCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWPHPHSP 283 + + L+ P L G C CG+E + +K +G W HP +P Sbjct: 241 DQVHPLHALWSMPRRYRLLFEDESGCCDLCGRECSRLVRRLRSKKQGANYDGPWRHPLTP 300 Query: 284 CLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAAVVNQ----FRNIAP 339 K E L+ + + +G A VV +R + Sbjct: 301 YRRLNPKKTDEL-PLSSKGQPGGLGYRHWPGLVLEDEASSGAMPARVVTHHLHKYRMVES 359 Query: 340 QSP-----------LELIMGGYRNNQASILERRHDVLMFNQGWQQYGNVINE-------- 380 L + GY + + + + ++ + Sbjct: 360 ARDDGEAFDAMFRHARLWVFGYDMDNMKPRGWYSVEMPLVGVPEAHQEILRDWVKRFVDL 419 Query: 381 IVTVGLGYKTALRKALYTFAEGFKNKDFKGAGVSVHETAERHFYRQSELLIPDVLANVNF 440 V + L++A + E K D E + F+ + + L + Sbjct: 420 ASDVAWQVRNQLKRAWFKRPEDAK-GDMSQIDAQFFEATQLAFFDVLRQ-MSETLRDYGD 477 Query: 441 SQA--DEVIADLRDKLHQLCEMLFNQSVAPYAHHP---KLISTLALARATLYKHLRELKP 495 + A E+ L + LF+ + + + AR L +L Sbjct: 478 TPALSPEIHQKWHLTLKREALRLFDAQAISGPLEGMKMQQLERITSARRYLLAYLNGSGK 537 Query: 496 Q 496 + Sbjct: 538 K 538 >UniRef50_Q2FNL5 Putative uncharacterized protein n=1 Tax=Methanospirillum hungatei JF-1 RepID=Q2FNL5_METHJ Length = 532 Score = 371 bits (953), Expect = e-101, Method: Composition-based stats. Identities = 100/517 (19%), Positives = 179/517 (34%), Gaps = 37/517 (7%) Query: 3 LLIDNWIPVRPRNG--GKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAP 60 LL D WIPV +NG G + + S Y + L+ R D A + L+ + Q + P Sbjct: 4 LLHDAWIPVVRKNGDSGLIAPHQITSDYDTNPVIELNASRPDFNGALIQFLIGLIQTVCP 63 Query: 61 AKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDVTPMEKLLAG 120 + D E+ R+ N + D + D F L+ FMQ + ++ LL Sbjct: 64 PESDKEWTDRLDNVIPSDVLKGHFKQIQDAFSLDGKGPRFMQDISIGDEKKNSVDGLLIE 123 Query: 121 VSGAT----NCAFVNQPGQGEALCGGCTAIALFNQANQAPGFGGGFKSGLRGGTPVTTFV 176 + G N F + + +C C A+ALF PG G G K+GLRGG P+T+ + Sbjct: 124 MPGENTVKKNTDFFVKRDTVKQMCPSCAAMALFTLQVNGPGGGAGHKTGLRGGGPLTSVI 183 Query: 177 RGIDLRSTVLLNVLTLPRL--QKQFPNESHTENQPTWIKPIKSNESIPAS---SIGFVRG 231 G L TV LN++ + + + W I+ ++ + + ++ Sbjct: 184 LGETLWETVWLNIIPSIKFFGDAIAKQKKSMDMIFPWFGKIRLSDKKEKTGVIDVNPLQM 243 Query: 232 LFWQPAHIELCDPIGI-GKCSCCGQESNLRYTGFLKEKFTFTVNGLW-PHPHSPCLVTVK 289 + I L G C CG S+ + + + W H +P + Sbjct: 244 FWGMGRRILLDFEDKPVGACDVCGLASSATVLTYHTKPHGVDYDETWDNHTLTPYYLDK- 302 Query: 290 KGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAAVVNQFRNI------APQSPL 343 E + T + + + + +V+ V F+N + Sbjct: 303 -----EVYRPIHLQPGGITYRNFMGLIIP-DSSRNIKVSRTVTNFQNNIIRLKRKSFKKV 356 Query: 344 ELIMGGYRNNQASILERRHDVLMF-----NQGWQQYGNVINEIVTVGLGYKTALRKALYT 398 L GY + A + + + + N+++ ++ +L ++ Sbjct: 357 RLWSFGYDMDNAKARCWYESRMPIYLLDDEKKRELFENIVSNLIFTAEYVLQSLAGSIKD 416 Query: 399 FAEGFKNKDFKGAGV--SVHETAERHFYRQSELLIPDVLANVNFSQADEVIADLRDKLHQ 456 G NK + + E FY + L + N Q D++ + L Sbjct: 417 AISGHGNKGKEPVDLRSRFWNETEELFYATLDRLFFSLG---NVEQIDQIKREWYRMLVG 473 Query: 457 LCEMLFNQSVA-PYAHHPKLISTLALARATLYKHLRE 492 +LF+ ++ AR K E Sbjct: 474 HSVLLFDYYTQVSLISDLNDPKSVIDARIKFKKFTGE 510 >UniRef50_B4TTX5 Crispr-associated protein, Cse1 family n=9 Tax=Salmonella enterica subsp. enterica RepID=B4TTX5_SALSV Length = 511 Score = 371 bits (952), Expect = e-101, Method: Composition-based stats. Identities = 106/517 (20%), Positives = 192/517 (37%), Gaps = 35/517 (6%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAP 60 MNL+ + W+PV +G K +I L L+ PR D + AA +L+ I Q Sbjct: 1 MNLITEKWLPVIFSSGEKTRISLRDLL--DNRIQDLAYPRPDFQGAAWQMLIGILQCTIA 58 Query: 61 AKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDVTPMEKLLAG 120 +D E+ + + +++++ + + F+Q+ ++ + LL Sbjct: 59 PEDKEEWADIWHDGIEFEQWEKALNTISLALQFGEQKPSFLQSFDPLDSEYGSIAGLLVD 118 Query: 121 VSGAT----NCAFVNQPGQGEALCGGCTAIALFNQANQAPGFGGGFKSGLRGGTPVTTFV 176 G N + G E +C C AIALF +P G G++ G+RGG P+TT V Sbjct: 119 APGGNTLKLNKDHFVKRGNVEQICPHCAAIALFAIQTNSPAGGAGYRVGMRGGGPLTTLV 178 Query: 177 RGI-----DLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSNES----IPASSIG 227 L + LNVL PN + W+ P K++E + + Sbjct: 179 VPQEEDKYPLWKKLWLNVLPQEEP----PNVTQHPLIFPWLAPTKTSEKAGNVVTPDNSH 234 Query: 228 FVRGLFWQPAHIELCDPIGI-GKCSCCGQESNLRYTGFLKEKFTFTVNGLWPHPHSPCLV 286 ++ + P IEL + G C CG+ + + + W HP SP Sbjct: 235 PLQAYWGMPRRIELDFTHTVAGICDLCGEHHESLLLQMRSKNYGVQYD-SWLHPFSPYRQ 293 Query: 287 TVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNE-NGNRVAAVVNQFRNIAPQSPLEL 345 +K +LAF + + + +++ N + A VV ++ + L Sbjct: 294 ALKDPS--APWLAFKGQPGGLSYKDWLGLMLNREDKFNKMQPAKVVRAA---GQRNNMSL 348 Query: 346 IMGGYRNNQAS-ILERRHDVLMFN-QGWQQYGNVINEIVTVGLGYKTALRKALYTFAEGF 403 + + A +H + + + +Q+ +N ++ + + LR AL + Sbjct: 349 WCFAFDMDNAKARCWYQHRIPLISVSHEEQFLAALNTVLVLASEALSLLRNALKSAKFDC 408 Query: 404 KNK---DFKGAGVSVHETAERHFYRQSELLIPDVLANVNFSQADEVIADLRDKLHQLCEM 460 + DF ++ + E F E L D L +Q ++ +L Sbjct: 409 PKEAKMDFSMVDIAFWQETEPAFRALQEALAVDPLRQ--DTQTRHAVSQWEAELAHYLFH 466 Query: 461 LFNQSVAPYAHHPKLI-STLALARATLYKHLRELKPQ 496 +F++ P I AR L R+ K + Sbjct: 467 VFDRDALTNPDCPDDILQRQLTARQDLASSYRKHKAR 503 >UniRef50_D2TKK4 CRISPR-associated protein n=1 Tax=Citrobacter rodentium ICC168 RepID=D2TKK4_CITRO Length = 519 Score = 370 bits (950), Expect = e-101, Method: Composition-based stats. Identities = 98/516 (18%), Positives = 167/516 (32%), Gaps = 36/516 (6%) Query: 2 NLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAPA 61 NL+ W+PVR +NG ++ + + ++ R D++ AA L+ + Q Sbjct: 5 NLIFCQWLPVRFKNGATGKLAPVD--LADENVVDIAATRADLQGAAWQFLLGLLQSSIAP 62 Query: 62 KDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDVTPMEKLLAGV 121 K+ + LT + + +AP F+ FMQ A + + LL + Sbjct: 63 KNYSRWEDIWEEGLTGEMLHKALAPLGHAFHFGAESPSFMQDFEPLAGEKVSIASLLPEI 122 Query: 122 SGAT----NCAFVNQPGQGEALCGGCTAIALFNQANQAPGFGGGFKSGLRGGTPVTTFVR 177 GA N + G E LC C A+ALF+ AP G G+++GLRGG P+TT + Sbjct: 123 PGAQTIKFNKDHFIKRGVTERLCPHCAALALFSLQLNAPSGGKGYRTGLRGGGPLTTLIE 182 Query: 178 --------GIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSNE-----SIPAS 224 L + LNV+ P W+ +++E Sbjct: 183 LQEYKGERQTPLWRKLWLNVMPQDTADLPLPAVCDAS-VFPWLAATRTSEPPANTVTTPE 241 Query: 225 SIGFVRGLFWQPAHIELCD-PIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWPHPHSP 283 + ++ + P I L G C CG ES+ + + +G W HP +P Sbjct: 242 QVNKLQMYWGMPRRIRLDFATTQTGLCDICGVESDALLGFMTVKNYGVNYDG-WRHPLTP 300 Query: 284 CLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNR-VAAVVNQFRNIA-PQS 341 VK F + + +++ E A VVN F Sbjct: 301 YRAPVKDKSG---FFSVKLQPGGLIWRDWLGLNQKNSTEANEEYPAQVVNVFNAHKLAGV 357 Query: 342 PLELIMGGYRNNQASILERR--HDVLMFNQGWQQYGNVINEIVTVGLGYKTALRKALYTF 399 L G + I H L+ + + L + K + Sbjct: 358 KAGLWGFGADFDNMKIRCWYEHHFPLLMTENLLSDLRKAVQTAARLLSLLRSALKEAWFA 417 Query: 400 AEGFKNKDFKGAGVSVHETAERHFYRQSELLIPDVLANVNFSQADEVIADLRDKLHQLCE 459 + DF + + F L + DE + + +L Sbjct: 418 SAKDARGDFSFIDIDFWNLTQGRFLHLIHDL-------ETGQKPDERLNQWQRELWLFTR 470 Query: 460 MLFNQSVAPYAHHPKLISTLALARATLYKHLRELKP 495 F+ + + + AR + E + Sbjct: 471 RYFDDRAFTNPYENNDLQRIMAARRKYFTTSAEKQS 506 >UniRef50_D0KFE0 CRISPR-associated protein, Cse1 family n=4 Tax=Enterobacteriaceae RepID=D0KFE0_PECWW Length = 523 Score = 361 bits (925), Expect = 5e-98, Method: Composition-based stats. Identities = 103/526 (19%), Positives = 181/526 (34%), Gaps = 46/526 (8%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAP 60 +L+ + W+P +G I LQ + L+ R D + A+ LL+ + Q Sbjct: 3 FSLIEEPWLPAVFADGRMSNISPLQ--LPDDNIIDLAWTRADFQGASYQLLIGLLQTAYA 60 Query: 61 AKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDVTPMEKLLAG 120 DD ++ + L D F Q +A + FMQ +D TP+ LL Sbjct: 61 PADDDDWDAIWEDGLGTD-FSQALAALAPAMQFGAQKPAFMQDCAPLDSDSTPISGLLID 119 Query: 121 VSGAT----NCAFVNQPGQGEALCGGCTAIALFNQANQAPGFGGGFKSGLRGGTPVTTFV 176 G N + G A+C C A+ALF AP G G ++G+RGG P+TT + Sbjct: 120 APGGNTLKLNKDHFIKRGTVNAICPHCAAMALFTLQTNAPSGGQGHRTGMRGGGPITTLL 179 Query: 177 RGI----DLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSNESIPA----SSIGF 228 L + +NVLT QK P W+ +++ + Sbjct: 180 MQEDGRLPLWKKLWMNVLT----QKVMPKGKPDATVFPWLAATPTSDGTHPPVTQENSHQ 235 Query: 229 VRGLFWQPAHIELCDPI-GIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWPHPHSPCLVT 287 ++ + P IEL G+C CG ES T + + + + W HP +P Sbjct: 236 LQAYWGMPRRIELDFTTLQTGECDLCGTESTALLTQYRTKNYGIQYD-SWRHPLTPYRRA 294 Query: 288 VKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNE-NGNRVAAVVNQFRNIAPQSPLELI 346 +K + FL+ + + ++++ N AAVV L Sbjct: 295 LKG--DDAPFLSVKGQPGGLAYRDWLGMMVSVEDKLNHTYPAAVVQHNAGKRALRNAGLW 352 Query: 347 MGGYRNNQAS-ILERRHDVLMF---------NQGWQQYGNVINEIVTVGLG----YKTAL 392 GY + H V + ++Y + + V + + + Sbjct: 353 CFGYDMDNMKARCWYEHHVPLLFSPARFQSDALSLREYKDHLQLAVELARDSATLLRQMI 412 Query: 393 RKALYTFAEGFKNKDFKGAGVSVHETAERHFYRQSELLIPDVLANVNFSQADEVIADLRD 452 ++A ++ + K DF V+ + + F R L + + Sbjct: 413 KEAWFSRPKDAK-GDFSAIDVAFWQETQPDFMRLCRSL-------AQGDTPATALNIWKK 464 Query: 453 KLHQLCEMLFNQSVAPYAHHPKLISTLALARATLYKHLRELKPQGG 498 L++ ++ V + ++ A R L + K Sbjct: 465 SLYRYLLNNYDARVFSNPDEHRDLAKAAKTRKKLSAFFYKQKAAET 510 >UniRef50_Q314I1 Putative uncharacterized protein n=1 Tax=Desulfovibrio desulfuricans subsp. desulfuricans str. G20 RepID=Q314I1_DESDG Length = 534 Score = 356 bits (914), Expect = 1e-96, Method: Composition-based stats. Identities = 109/534 (20%), Positives = 187/534 (35%), Gaps = 38/534 (7%) Query: 2 NLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAPA 61 N+L D W+PV +G +++I + L L +PR D A L LV Q + P Sbjct: 5 NILSDQWLPVILADGKRIRIAPWE-LTADPRPVALDIPRPDFGGAMLEFLVGCMQTMCPP 63 Query: 62 KDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKAND--VTPMEKLLA 119 + ++R P + + P+I F+L FMQ +K + V + LL Sbjct: 64 QSRKDWRSWRKTPPQPQTLRTAMEPFIPHFHLLGERPLFMQDLTLKQEEENVMGVAALLI 123 Query: 120 GVSG----ATNCAFVNQPGQGEALCGGCTAIALFNQANQAPGFGGGFKSGLRGGTPVTTF 175 G + F + G+ E LC C A+AL+ AP G G+++ LRGG P++T Sbjct: 124 DSPGENAIKNDTDFFVKRGRIETLCPACAAMALYTMQAFAPSGGAGYRTSLRGGGPLSTL 183 Query: 176 VRGIDLRSTVLLNVLTLPRLQKQFPNESHTENQ-PTWIKPIKSNESI-----PASSIGFV 229 V G L TV NVL + P+ + W P + ++ PA+ + Sbjct: 184 VLGETLWETVWNNVLVAESTDWRIPDGHDPLGRILPWTVPTRDSKKKGTAILPATGHNLL 243 Query: 230 RGLFWQPAHIELCDPI--GIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWPHPHSPCLVT 287 + P L C CG S + + G W HP +P Sbjct: 244 H-FWAMPRRFRLHPENLSDPAACDICGTPSTTVIRQIGAKNYGNNYEGAWQHPLTPYREQ 302 Query: 288 VKKG-EVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAAVVNQFRNIAPQSPLELI 346 K + K + T+ W ++ + A + QFR + P S + + Sbjct: 303 GKGKLALSVKGASECTAYHQWLG---LLYGPLGSKNKTLVAAQCIRQFRELLPASAVRVR 359 Query: 347 MGGYRNNQASILERRHDVLMF----NQGWQQYGNVINEIVTVGLGYK----TALRKALYT 398 GY + + + N I + + AL++AL+ Sbjct: 360 AFGYDMDNMKARQWCEGEMPLYALDPAETAILQNEIELWLDAADKTRSNLIKALKQALFA 419 Query: 399 FAEGFKNKD---FKGAGVSVHETAERHFYRQSELLIPDVLANVNFSQ---ADEVIADLRD 452 D A + E FY + + V + Q A + + + Sbjct: 420 DGGRNAKADQTLLANASTAFWSRTESAFYSLAARFVESVQQQDDAQQIALAKMLRNEWAN 479 Query: 453 KLHQLCEMLFNQSVAPYAHHPKLISTLALA----RATLYKHLRELKPQGGPSNG 502 ++ + +F++ A A + + A R + ++ G +G Sbjct: 480 RILAATDAIFSEQAASGAFDERQAPRIYGALNQMRRFNRGNCNKVLATGSTGDG 533 >UniRef50_C6C421 CRISPR-associated protein, Cse1 family n=3 Tax=Enterobacteriaceae RepID=C6C421_DICDC Length = 511 Score = 353 bits (905), Expect = 1e-95, Method: Composition-based stats. Identities = 94/536 (17%), Positives = 185/536 (34%), Gaps = 60/536 (11%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAP 60 +L+ W+ V +G + +I Q L+ PR D + AA LL+ + Q Sbjct: 2 FSLIDTPWLSVVGADGHRTRISPRQ--LTDDRIIDLACPRPDFQGAAWQLLIGLLQTAYA 59 Query: 61 AKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDVTPMEKLLAG 120 D+ + + L D + Q + + FMQ D +P+ LL Sbjct: 60 PSDEEAWEDIWHDGL-GDGWIQALDGLAPALQFGADKPAFMQDFSSLDADNSPIAGLLID 118 Query: 121 VSGAT----NCAFVNQPGQGEALCGGCTAIALFNQANQAPGFGGGFKSGLRGGTPVTTFV 176 G N + A+C C A+AL+ AP G G + G+RGG P+TT + Sbjct: 119 APGGNTLKLNKDHFVKRDAVSAICPHCAALALYTLQTNAPSGGVGHRVGVRGGGPITTLL 178 Query: 177 RGI------DLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSNESI----PASSI 226 L + NV + R + W+ +++E + Sbjct: 179 MPYDAHTPVPLWRKLWANVTSGER-------GRCEADVFPWLAATRTSEGDKDKVTPENA 231 Query: 227 GFVRGLFWQPAHIELCDPI-GIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWPHPHSPCL 285 ++ + P IEL G+C CG +S+ T + + + W HP +P Sbjct: 232 HPLQAFWGMPRRIELDFSHTESGRCDLCGDKSDHLLTHYRTKNYGVQYE-HWRHPLTPYR 290 Query: 286 VTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAAVVNQF--RNIAPQSPL 343 + K G L+ + + + ++ + + A V + + + + Sbjct: 291 QSNKDG----MLLSVKGQPGGLSYRDWLGLVLGTKDTLNDTLPACVVSLSHQRVPLRQKV 346 Query: 344 ELIMGGYRNNQASILERRHDVLMFNQGWQQYGNVINEIVTVGLG--------YKTALRKA 395 L GY + + W++ + + + +G+ + +++ A Sbjct: 347 GLWCFGYDMDNMKARCWYEHRVPV---WKEITPAVRDYLPLGVQMAHDAQQLLRQSVKAA 403 Query: 396 LYTFAEGFKNKDFKGAGVSVHETAERHFYRQSELLIPDVLANVNFSQADEVIADLRDKLH 455 ++ + DF VS + E+ F + + + + R++L+ Sbjct: 404 WFSRPKDV-GGDFSVIDVSFWQETEQFFRQLYRSI-------AGGNCPIAALNAWRNQLY 455 Query: 456 QLCEMLFNQSVAPYAHHPKLISTLALARATL---------YKHLRELKPQGGPSNG 502 F++ ++ AR+ + K L+ L+PQ +NG Sbjct: 456 MYLIGTFDRLTFGNPDQQGDLTRAVEARSEMVKLFYGQKSMKKLKALQPQEVSANG 511 >UniRef50_Q0W587 Putative uncharacterized protein n=1 Tax=uncultured methanogenic archaeon RC-I RepID=Q0W587_UNCMA Length = 533 Score = 351 bits (899), Expect = 6e-95, Method: Composition-based stats. Identities = 99/517 (19%), Positives = 174/517 (33%), Gaps = 37/517 (7%) Query: 2 NLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAPA 61 NLL + WI +G VQ L +L + + P +E LL+ + Sbjct: 5 NLLTEPWITSIDLSGNPVQEGILATLKNAHKIDSIFDPAPPVEFGIYRLLIAFITDVFQP 64 Query: 62 KDDVEFRHRI-MNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQ--TKGVKANDVTPMEKLL 118 + + + L + A W D F L ++PF+Q GV P+ +L+ Sbjct: 65 QGLEDLADLLDRKRLDPTALDEYAARWRDRFDLFDEKYPFLQQAITGVIKKPPEPISRLM 124 Query: 119 AGVSGATNCAFVNQPGQGE-ALCGGCTAIALFNQANQAPGFGGGFKSGLRGGTPVTTFVR 177 + TN + + E + A L A G G + G P V+ Sbjct: 125 QHLPAGTNVSHFHHGRWDENSFSFEQCAKGLVTIAPFMTAGGAGLSPSINGSPPWYVLVK 184 Query: 178 GIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSNESIPASSIGFVRGLFWQPA 237 G +L T+L NV +P K ++ W + + + V GL W+P Sbjct: 185 GNNLFETLLYNVCQIPMTVKPI-----GDSPVAWRNDKRIDPGDEPKTFSIVEGLTWRPR 239 Query: 238 HIELCDPIGIGKCSCCGQESNLRYTGFLKEK-FTFTVNGLWPHPHSPCLVTVKKGEVEEK 296 I+L G G C+ G++ + GLW P KK + + Sbjct: 240 IIQLIPGNGKGTCTYTGEKDVDTVSHMHYYPGQKSPEPGLWVDPQVAY----KKTKDAIR 295 Query: 297 FLAFTTSAPSWTQISRVVVDKII-----QNENGNRVAAVVNQFR------NIAPQSPLEL 345 L + W I +++ + + AVV Q++ I PL L Sbjct: 296 PLRPDENKALWRDIGPLMLLQHGDYSGKDGKVSFDRPAVVTQYKQMVSNGMIKRSEPLRL 355 Query: 346 IMGGYRNN-QASILERRHDVLMFNQGW-------QQYGNVINEIVTVGLGYKTALRKALY 397 + G R + + I E H+ L +Q + ++ +V + A++KA Y Sbjct: 356 EVYGIRTDGKMKIYEWYHEKLALPIEILKKANSGRQIQDAMDLADSVAYILRKAMKKA-Y 414 Query: 398 TFAEGFKNKDFKGAGVSVHETAERHFYRQSELLIPDVLANVNFSQADE---VIADLRDKL 454 F +SV + H Q E + L+ + + D ++ + L Sbjct: 415 PRNAKSNESGFDNLILSVQSSYWSHLKGQFESIFLKTLSQQDENDLDAYTKLMEQWKKIL 474 Query: 455 HQLCEMLFNQSVAPYAHHPKLISTLALARATLYKHLR 491 + ++ + P + A +R Sbjct: 475 DDTGKNALDKGLGPLDTDGDSLRRQVKAMNEYSSGIR 511 >UniRef50_B5ZCF9 CRISPR-associated protein, Cse1 family n=10 Tax=Acetobacteraceae RepID=B5ZCF9_GLUDA Length = 546 Score = 325 bits (832), Expect = 3e-87, Method: Composition-based stats. Identities = 95/536 (17%), Positives = 172/536 (32%), Gaps = 53/536 (9%) Query: 1 MNLLIDNWIPVRPRNG--GKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQII 58 MNLL +W+P+R ++G ++ + L PR D +A+L L+ + Sbjct: 1 MNLLTASWLPIRRKSGAAETIRPAQIVDRVADDPIMALDWPRADFRIASLEFLIGLLATA 60 Query: 59 APAKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDVTPMEKLL 118 P K++ + +P + + + AP + F+L+ F+Q + P+E+LL Sbjct: 61 FPPKNEDIWCETWEDPPSVEALDEAFAPVAEAFWLDGPGPRFLQDLENLQSGQEPVERLL 120 Query: 119 AGVSG----ATNCAFVNQPGQGEALCGGCTAIALFNQANQAPGFGGGFKSGLRGGTPVTT 174 G N + AL +ALF + AP G G +GLRGG P+ T Sbjct: 121 IDAPGDSTVKKNTDLFVHRQRIMALGRPAACMALFTLQSWAPSGGAGNMTGLRGGGPLVT 180 Query: 175 FV---RGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSNESI-----PASSI 226 V G L V N P+E+ W+ P + P + Sbjct: 181 LVLPREGASLWEMVWANT-----PFGVPPSEADLPRVFPWLAPTIGSGKDGTSVRPGHNA 235 Query: 227 GFVRGLFWQPAHIELCDP-IGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLW-------- 277 ++ + P I L G C GQ + G+ + + + Sbjct: 236 HPLQCWWGMPRRIRLDFEAAEDGICDLTGQPDAVLVPGWRQRPYGASYADWTGMPYGAGA 295 Query: 278 -PHPHSPCLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAAVVNQ--- 333 HP +P K ++L+ + + + + V++ Sbjct: 296 SIHPLTPRYRQKKD----AEWLSVHPQPGGIGYRHWAGIVVNSSDTHRLPASTVLSWRND 351 Query: 334 -FRNIAPQSPLELIMGGYRNNQASILERRHDVLMFNQ----GWQQYGNVINE-----IVT 383 RN+A L+ GY + + Q+ + + Sbjct: 352 RARNVAASLTPRLLAAGYDMDNMKARSFVESEMPLPGVVDPVRQEALDALARAYVEAADQ 411 Query: 384 VGLGYKTALRKALYTFAEGFKNKD-FKGAGVSVHETAERHFYRQSELLIPDVLANVNFSQ 442 V + +R+AL+ + F G E F+ + + Sbjct: 412 VAGILRQCVREALFGKGTISPDATLFSGLRERFWAQTEGTFFDLLHQAVL-----LGDGD 466 Query: 443 ADEVIADLRDKLHQLCEMLFNQSVAPYAHHPKL-ISTLALARATLYKHLRELKPQG 497 ++ L ++ LF+ +V ALAR L + +G Sbjct: 467 DIDLRRIWLRALRRVALDLFDSAVMLTPDTGTTEAQRSALARRRLGAAVAGGGKEG 522 >UniRef50_A5FZI3 CRISPR-associated protein, Cse1 family n=2 Tax=Acidiphilium cryptum JF-5 RepID=A5FZI3_ACICJ Length = 529 Score = 319 bits (816), Expect = 2e-85, Method: Composition-based stats. Identities = 96/524 (18%), Positives = 168/524 (32%), Gaps = 46/524 (8%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINL--QSLYCSRDQWRLSLPRDDMELAALALLVCIGQII 58 NLL + W+P+ ++G K I S ++ PR D LA + LLV + Sbjct: 2 FNLLTNPWLPIVRQDGTKSVIAPRDITEDISSNPVIAVNWPRADFRLATMELLVGLIATA 61 Query: 59 APAKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDVTPMEKLL 118 P D+ ++ P + ++ AP F + FMQ D P+E LL Sbjct: 62 CPPADEDDWLDAWEAPHSPEKLDGAFAPLAHAFSFDGPGPRFMQDLADLDADEEPVENLL 121 Query: 119 AGVSGATNCAFVNQPGQGEALCGGCTAIALFNQANQAPGFGGGFKSGLRGGTPVTTFV-- 176 V+G N + PG+ + + AI+L+ + +P G G ++GLRGG P+ T V Sbjct: 122 IEVAG--NSGPLVHPGRTKRMGRPAAAISLYTLQSWSPSGGRGNRTGLRGGGPMVTMVAP 179 Query: 177 -RGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKS---NESIPASSI-GFVRG 231 R L + NV + P W+ P + +E + + ++ Sbjct: 180 GRHRSLWHHIWANVPL-----GRKPEPVDFPRIFPWLSPTITSVNDEVVTPDDVAHPLQV 234 Query: 232 LFWQPAHIELCDPI--GIGKCSCCGQESNLRYTGFLKEKFTFTVNGL-WPHPHSPCLVTV 288 + P I L C G + TG+ + G HP +P Sbjct: 235 WWGMPRRIRLSFVQLPSPAPCDLTGALDSSVVTGWRQRPHGPKYVGWGARHPLTPTYQNK 294 Query: 289 KKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAAVVNQ----FRNIAPQSPLE 344 E+ L+ + + + + + V +R Sbjct: 295 AGEEI----LSVHPNPGGVGYRNWIGLVLRSPDGLRRPAPIVSTWRNDRYRGTEEAKGAR 350 Query: 345 LIMGGYRNNQASILERRHDVLMFNQGWQQ---------YGNVINEIVTVGLGYKTALRKA 395 LI GGY + + + +++ + A+++A Sbjct: 351 LIAGGYDTDNMKARGFMETEVPLVLASSKEVQERLDALATSLVRASERASYQLRKAVQQA 410 Query: 396 LYTFAEGFKNK--DFKGAGVSVHETAERHFYRQSELLIPDVLANVNFSQADEVIADLRDK 453 LY K G V E F+ + ++ + + Sbjct: 411 LYHPGAKVKATAHGIALLGDRVWLETESAFFSALD-------RAMSLDDTAPERVAWQVR 463 Query: 454 LHQLCEMLFNQSVAPYAHHPKLISTLALARATLYKHLRELKPQG 497 L L +F+ +V + + AR L L G Sbjct: 464 LRGLALRIFDDTVPIDPLD-RNNARQVRARFFLGLGLSGYGKDG 506 >UniRef50_D1CGD1 CRISPR-associated protein, Cse1 family n=1 Tax=Thermobaculum terrenum ATCC BAA-798 RepID=D1CGD1_THET1 Length = 533 Score = 317 bits (813), Expect = 5e-85, Method: Composition-based stats. Identities = 91/531 (17%), Positives = 169/531 (31%), Gaps = 39/531 (7%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQS-LYCSRDQWRLSLPRDDMELAALALLVCIGQIIA 59 NL+ + WIPVRP Q++ L+ L + L P + ++ LL+ I + Sbjct: 4 FNLVDEPWIPVRPIGASTTQLMGLRDVLLGAHAIRELVDPSPLVTVSLHRLLLAILHRVF 63 Query: 60 PAKDDVEFRHRIMNP-LTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDVT---PME 115 +DD E+ + + W D F L H ++PF Q ++ VT P+ Sbjct: 64 GPRDDAEWAELYGGGSFPPQPLEDYLQRWHDRFDLFHEKYPFYQKGSIQRQSVTKLWPVT 123 Query: 116 KLLAGVSG-ATNCAFVNQP-GQGEALCGGCTAIALFNQANQAPGF----GGGFKSGLRGG 169 +L ++ + +G A A L G G K Sbjct: 124 RLAPEIASPGNATTLFDHTLPEGVAFTPDRAARYLVLLHPFTVGGLFGLLKGEKDKAADA 183 Query: 170 TPV----TTFVRGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSNESIPASS 225 P+ +RG L T++LN++ + P S E+ P W + + Sbjct: 184 GPLAKCAVVLLRGRTLFETLMLNMV-RYDPEFDEPCPSTPEDSPAWERDDDTQPVD-RLP 241 Query: 226 IGFVRGLFWQPAHIELCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWPHPHSPCL 285 G++ L WQ + L + G+ E+ + +P Sbjct: 242 KGYLDYLTWQSRRVRLFPEVQDGRVVVREVIIVKGCQLRPTEEI-ANYETMVAFRKNP-- 298 Query: 286 VTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNR--VAAVVNQFRNIAPQSPL 343 KGE + F W + + + +A ++ R + + L Sbjct: 299 -RAGKGENPRPPVGFREDRAMWRDSHVIFQSTDTHTQPRSLRWIAELIAMGR-LQEEHRL 356 Query: 344 ELIMGGYRNNQASILERRHDVLMFNQG-------WQQYGNVINEIVTVGLGYKTALRKAL 396 L + G +QA+I RH+VL + + Q + +VG L + Sbjct: 357 PLEIYGIITDQANIKLWRHEVLPVSTRYFSDRNLYSQLQRALAMAESVGQELDRTLEQLA 416 Query: 397 YTFAEGFKNKDFKGAGVSVH--ETAERHFYRQSELLIPDVLANVNFSQADEVI------A 448 + +++ T + ++ A+ Q + Sbjct: 417 RDLLPNPNRDEINNLRKAINATPTYWASLEVPFHHFLLELEADRQLVQGRPIYGYGYAMR 476 Query: 449 DLRDKLHQLCEMLFNQSVAPYAHHPKLISTLALARATLYKHLRELKPQGGP 499 + D + Q + +V+ + + + AR L L Q P Sbjct: 477 NWMDAIKQAGRLALEFAVSGLDGNARNLRAAVNARGRFNGRLNTLLAQEAP 527 >UniRef50_A1ARH9 CRISPR-associated protein, Cse1 family n=3 Tax=Bacteria RepID=A1ARH9_PELPD Length = 506 Score = 315 bits (808), Expect = 2e-84, Method: Composition-based stats. Identities = 76/515 (14%), Positives = 148/515 (28%), Gaps = 42/515 (8%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIA- 59 NL+ + WIPVR +G + ++ +L S++ + P + A L+ + Sbjct: 4 FNLIDEKWIPVRFPDGAREELGIRDTLLRSKEIAAIEDPSPLVVAALHRFLLAVLYRALE 63 Query: 60 PAKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDVTPMEKLLA 119 D + + ++ L + W + F+L ++PF Q V +++ P KL A Sbjct: 64 GPTDIDQAKILFLSGLPGQRITAYLEKWRERFWLFDEKYPFGQNPNVSRDEIEPWTKLTA 123 Query: 120 GVSGATNCAFVNQPGQGE--ALCGGCTAIALFNQANQAPGFGGGFKSGLRGGTPVTTFVR 177 + +N + A A L + + G G+ + Sbjct: 124 EYNATSNKVLFDHTNTKNPGAREPKECARWLLSTMTFSISGGRGYYPS-PSPNAMMCIPL 182 Query: 178 GIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSNESIPASSI--GFVRGLFWQ 235 G + T+ ++ P + + W + K+ + G+ WQ Sbjct: 183 GRNFHETLCYCLVPYPNRNVMSGDST------LWEREPKALPLNTPKQMATGYADLYTWQ 236 Query: 236 PAHIELCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWPHPHSPCLVTVKKGEVEE 295 I L G+ S +R+ F P P P + KG Sbjct: 237 SRMIRL--EEQP-----TGEVSMMRFVAGQ----GFENPSSTPDPMHPYKLEKNKG---I 282 Query: 296 KFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAAVVNQFRNIAPQSPLELIMGGYRNNQA 355 + F W ++ D + A + + ++ Y A Sbjct: 283 LPVQFRKDRGVWRDFDSLLPDSSELAPITIQNAVKLAGKNMNYLPESVLILGLKYEPPNA 342 Query: 356 SILERRHDVLMFN---QGWQQYGNVINEIVTVGLGYKTALRKALYTFAEG---------- 402 ++ R + L G + I + + + L A +FA Sbjct: 343 NLEFWRMECLSLPKALAGDRFIRTDIRQFLADAEEAQKTLWTACNSFARDLISRGDKRPV 402 Query: 403 FKNKDFKGAG-VSVHETAERHFYRQSELLIPDVLANVNFSQADEVIADLRDKLHQLCEML 461 K + V ++ D + + Sbjct: 403 SKKDISDFVEQMPVSSVYWSTLESCFHKILSDYNLERDPEDIRCQWLKFVRDAMRTAWKQ 462 Query: 462 FNQSVAPYAHHPKLISTLALARATLYKHLRELKPQ 496 SV+ I L A + + L+EL + Sbjct: 463 HTSSVSTG--DAWAIRALVKAERPVLRKLKELNDE 495 >UniRef50_D2L2X5 CRISPR-associated protein, Cse1 family n=1 Tax=Desulfovibrio sp. FW1012B RepID=D2L2X5_9DELT Length = 554 Score = 304 bits (778), Expect = 5e-81, Method: Composition-based stats. Identities = 99/531 (18%), Positives = 163/531 (30%), Gaps = 55/531 (10%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSR-----DQWRLSLPRDDMELAALALLVCIG 55 NLL +WIPVR +G +++I + + PR D + A L LL+ Sbjct: 4 FNLLTQDWIPVRRVDGTRLRIPPWRITDPGDGSPGQAIADIDTPRPDFKGALLELLIGFV 63 Query: 56 QIIAPAKDDVEFR---------HRIMNPLT--EDEFQQLIAPWIDMFYLNHAEHPFMQTK 104 Q P D+ ++R + P + AP F L F+Q Sbjct: 64 QTALPPTDNRKWRLGLSANTTNEPHLAPPDYAPAALKTAFAPLTPFFNLFGDRPRFLQDL 123 Query: 105 GV---KANDVTPMEKLLAGVSGAT----NCAFVNQPGQG-EALCGGCTAIALFNQANQAP 156 + +A + +P+ LL G N F + Q + LC C A AL AP Sbjct: 124 TLTEAEAKEPSPIAALLMDSPGENATKFNSDFFIKRDQPPDRLCPACAAAALHALQTYAP 183 Query: 157 GFGGGFKSGLRGGTPVTTFVR-GIDLRSTVLLNVLTLPRLQ---KQFPNESHTENQPTWI 212 G G + LRGG P+TT V L TV NVL L + W+ Sbjct: 184 SGGAGHRVSLRGGGPLTTLVMLDDSLWKTVWANVLPLDAANVEALPANPAALPGAVFPWL 243 Query: 213 KPIKSN----ESIPASSIGFVRGLFWQPAHIELCDPIG--IGKCSCCGQESNLRYTGFLK 266 + + + + F+ + P I L C CGQ N+ + Sbjct: 244 AVTRDSTAKGSEVHREGMHFLHHYWAMPRRIVLDAETDETPSACPVCGQPGNVFVRQYRT 303 Query: 267 EKFTFTVNGLWPHPHSPCLVT-VKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGN 325 + + W HP +P K + K + + W V ++ Sbjct: 304 KNYGNNYGKGWQHPLTPYRDQGPGKEALTIKGESEGRAYNQWLGF----VYGATDDKKPV 359 Query: 326 RVAAVVNQFRNIAPQ---SPLELIMGGYRNNQASILERRHDVLMFNQG--------WQQY 374 A VV +R +P +P L G+ + + Sbjct: 360 IPARVVTHYRTGSPPGQETPARLRTFGWDMDNMKARNWCEGEYPILDLKGREAKRFIGEV 419 Query: 375 GNVINEIVTVGLGYKTALRKALYTFAEGFKNKD---FKGAGVSVHETAERHFYRQSELLI 431 ++ + A+ +AL++ D E FY ++ Sbjct: 420 APLVKAAEEACNNLRKAVHEALFSEKGPKPKPDATLLALVETRFWAETETAFYTSVRSIL 479 Query: 432 PDVLANVNFSQADEVIADLRDKLHQLCEMLFNQSVAPYAHHPKLISTLALA 482 ++ + + R L +F P+ + A Sbjct: 480 EA--SDDDEEARLGIALGWRRTLLDAVGAIFAAVAEDGGTTPRKTRQIYAA 528 >UniRef50_D0Y921 CRISPR-associated protein, Cse1 family n=2 Tax=Dehalococcoides RepID=D0Y921_9CHLR Length = 543 Score = 300 bits (767), Expect = 9e-80, Method: Composition-based stats. Identities = 74/538 (13%), Positives = 158/538 (29%), Gaps = 73/538 (13%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIA- 59 NL+ + WIP + ++ +L+ + + + + +A LL+ I Sbjct: 4 FNLIDEPWIPCIGADDNIIEYSIRDTLFKAHELREICDDSPLVTVAIHRLLLAILYRAFE 63 Query: 60 PAKDDVEFRHRIMNP-LTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDVTPMEKLL 118 E+R N + + ++ + W F L ++PF Q + + +L Sbjct: 64 GPSSMQEWRELYRNGSFNKSKIKEYLEKWCQRFNLLDEDYPFYQMSQFETVKPISVNRLA 123 Query: 119 AGVSGATNCAFVNQPGQGEAL--CGGCTAIALFNQANQAPGFGGGFKSGLRGGTPVT--- 173 ++ N + G + A L + A GFG + + G + Sbjct: 124 TEIASGNNATLFDHCGDDIEVEWTPSQVAQRLITCQSFALGFGRSGNAKINGINEILPYS 183 Query: 174 ----------TFVRGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSNESIPA 223 +++G L T+++N+ + + P ++ + E + Sbjct: 184 SDAIALRGMNIWLQGGTLFETLMINLSPV--IDNSLPPWEL-KDSNKYRDRQNGKERVVC 240 Query: 224 SSIGFVRGLFWQPAHIELCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWPHPHSP 283 S G V L WQ I L C S + + + Sbjct: 241 RSSGLVDQLTWQSRLIRLIP--------NCQTISKMYFAQGRSADKSAN------DLMKV 286 Query: 284 CLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAAVVNQFRN------I 337 ++ +G L+ +++ +W +++ ++ N I Sbjct: 287 YRLSKDEGVSS---LSLSSNKAAWRDAHSILMIPESGSKERR--PECFNMAEEAIISGVI 341 Query: 338 APQSPLELIMGGYR---NNQASILERRHDVLMFNQGWQQYGNVINEI---VTVGLGYKTA 391 + G N + RH+ + + +++ + + A Sbjct: 342 GGSKSFVTHIVGLATAPNKAGKFIFWRHERMPVPAAFLSNIDLLKRLGSCLENAERAAEA 401 Query: 392 LRKALYT----------FAEGFKNKDFKGAG---------VSVHETAERHFYRQSELLIP 432 LR + + G D ++ E HF+ E L Sbjct: 402 LRYRIQRVTKLYLSPDCESPGGHRPDKADVDNIIEATDPCLTYWSRMEEHFFALLESLPN 461 Query: 433 DVLANVNFSQADE---VIADLRDKLHQLCEMLFNQSVAPYAHHPKLISTLALARATLY 487 D A S+ DE R + + +S+ + + I +A Y Sbjct: 462 DWDAATGDSKPDEEQTARLTWRQSVKLEAKRALLESIELFGTTARAIQAIAHVSTDFY 519 >UniRef50_B0TDT8 Crispr-associated protein, ct1972 family, putative n=1 Tax=Heliobacterium modesticaldum Ice1 RepID=B0TDT8_HELMI Length = 523 Score = 296 bits (757), Expect = 2e-78, Method: Composition-based stats. Identities = 82/517 (15%), Positives = 160/517 (30%), Gaps = 42/517 (8%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAP 60 +LL + W+ VR G + L + + + ++ L + I P Sbjct: 5 FDLLTEPWVTVRDVKGRICVVHLRDVLAKAHEWSEVIDESPLIQFGLYRFLQALIIDIFP 64 Query: 61 AKDDVEFRHRIMNP-LTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDVT--PMEKL 117 K + E + + F L AE PF+Q + V + +L Sbjct: 65 LKGQRGRLELMEEGQFDETKLNAYWEKYGVYFDLFDAERPFLQVPPREQEKVKRKSVAEL 124 Query: 118 LAGVSGATNCAFVNQPGQGE-ALCGGCTAIALFNQANQAPGFGGGFKSGLRGGTPVTTFV 176 + TN + Q E L A + + G G + G P + Sbjct: 125 FHQLPTGTNVIHFHHRLQDEYVLAPDVCARIMTTLSPFTTAGGQGLSPSINGNPPYYVWR 184 Query: 177 RGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSNESIPASSIGFVRGLFWQP 236 +G +L T+LLN + P W + + + S + GL WQP Sbjct: 185 KGDNLFETLLLNYWITDQ----------DRGIPAW-RDRRPSRGETRSEARLLEGLTWQP 233 Query: 237 AHIELCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWPHPHSPCLVTVKKGEVEEK 296 + L +G +C+ G+ + E W P+ + T K Sbjct: 234 RRVTLIPEMGPFQCTYSGRSCQWGVRQMVFEAGFQARVDTWRDPNVAVVNTDKG----RS 289 Query: 297 FLAFTTSAPSWTQISRVVVD----KIIQNENGNRVAAVVNQFRNIA--PQSPLELIMGGY 350 F+ +W + + + K +Q +N A ++NQ Q + + G Sbjct: 290 FVRPRWGRQTWRDVGPLALIDGAGKGVQEKNSYERAPILNQASIYLECEQQTTTIEVYGL 349 Query: 351 RNN-QASILERRHDVLMFNQGWQQY-------GNVINEIVTVGLGYKTALRKALYTFAEG 402 + + L+ R++ L G +Q +N + A+ + + Sbjct: 350 QTDGNMKYLDWRYEELQLPAGLEQVPNGEEFALQAMNNAEKAAWALRKAVNMCVQIKQKK 409 Query: 403 FKNKDFKGAGVSVHETAERHFYRQSELLIPDVLANVNFSQADE--------VIADLRDKL 454 K + G + E ++ E L+ + + +E ++ ++ Sbjct: 410 GKKEQKIWPG-EWGQRVEDAYWLSLEAPYLAFLSVLAGTAKEEDPDKHLETLMEAWTKEI 468 Query: 455 HQLCEMLFNQSVAPYAHHPKLISTLALARATLYKHLR 491 F ++ + + A L + LR Sbjct: 469 RNKASDYFTEATKENVSDAEAMRRQIQAEQYLRRSLR 505 >UniRef50_A5UR17 CRISPR-associated protein, Cse1 family n=1 Tax=Roseiflexus sp. RS-1 RepID=A5UR17_ROSS1 Length = 525 Score = 295 bits (754), Expect = 4e-78, Method: Composition-based stats. Identities = 89/520 (17%), Positives = 171/520 (32%), Gaps = 29/520 (5%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAP 60 NL + WI V R+G +I L + + LS P + LL I Q I Sbjct: 7 FNLWTEPWIRVIRRDGRDDEIGIGTCLTDAHELAALSDPSPLVAGGTHRLLTAILQAIHQ 66 Query: 61 AKDDVEFRHRIMNP-LTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVK---ANDVTPMEK 116 +D E + N + Q F L PF+QT V ++ P+ + Sbjct: 67 PQDIGEIAALLHNAKFDINRLQAFEKNHAGRFDLFDPHAPFLQTGDVPLHSNHNPQPVAR 126 Query: 117 LLAGVSGATNCAFVNQPGQGEA-LCGGCTAIALFNQANQAPGFGGGFKSGLRGGTPVTTF 175 L A + AT +C C A L A G G + + G P+ Sbjct: 127 LFAEIPVATERVHFTHVTDDRHRICPACCARGLVTAPAFASSGGAGIRPSINGVPPIYVL 186 Query: 176 VRGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSNESIPASSIGFVRGLFWQ 235 G L T+ L++++ L + + P ++ S++G++ L + Sbjct: 187 PAGDTLFETLTLSLVSSDYL-PPGADPKRADQAIWNSDPPVVGKNCEVSAVGYLESLTFP 245 Query: 236 PAHIELCDPIGIGKCSCCGQESNLRYTGFLKE--KFTFTVNGLWPHPHSPCLVTVKKGEV 293 + L G C+ CG+++++ L E + G+W P K+ + Sbjct: 246 ARRMRLYPQAGSVFCTNCGRQTDIFVATMLFEMGHWLSKQTGVWEDPFVAFRKPSKQSKN 305 Query: 294 E-EKFLAFTTSAPSWTQISRVVVDKIIQNENG---NRVAAVVNQFRNIAPQSPLELIMGG 349 K + W + + +++D+ ++A ++++ + + L G Sbjct: 306 ADLKPIRPEEGKAIWREYAVLLLDEDAAGLRPRIVRQLARLIDRG-TLTGRQRLRFRCIG 364 Query: 350 YRNN-QASILERRHDVLMFNQGWQQY-------GNVINEIVTVGLGYKTALRKALYT-FA 400 R + +A I E + L Q G + + V K+ + Sbjct: 365 IRTDGKAKIFEWLDEALEAPPELLQDPDAAAYVGEALRQSHEVAAILKSTFERHFRPERG 424 Query: 401 EGFKNKD----FKGAGVSVHETAERHFYRQSELLIPDVLANVNFSQADEVIADLRDKLHQ 456 G N+ FK + R + D+ + Q D+ + + Sbjct: 425 TGGSNEQKFIRFKTVLERLIADYWRRLGLHFRQFVNDL---SDVWQRDDTARTWVILIIK 481 Query: 457 LCEMLFNQSVAPYAHHPKLISTLALARATLYKHLRELKPQ 496 + F ++ + A+A + L + + Sbjct: 482 EAQACFRTALDQTGDRADALRIRVEAQAECERQLHARRKE 521 >UniRef50_D1CAJ3 CRISPR-associated protein, Cse1 family n=1 Tax=Sphaerobacter thermophilus DSM 20745 RepID=D1CAJ3_SPHTD Length = 555 Score = 290 bits (743), Expect = 6e-77, Method: Composition-based stats. Identities = 78/552 (14%), Positives = 155/552 (28%), Gaps = 63/552 (11%) Query: 1 MNLLIDNWIPVRPR-NGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIA 59 NLL WIP +G + L + + + P + ++ LL+ + Sbjct: 5 FNLLDCPWIPCMRAADGAWEDLSLRDVLVRAHELREIVDPSPLVTVSLHRLLLAFLHRVF 64 Query: 60 PAKDDVEFRHRIMNPL-TEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDVTPMEKLL 118 E+ D + W + F L +PF QT V A+ P+ ++ Sbjct: 65 GPASIDEWAALWERGSWDPDPIDRYCERWRNRFNLFDPTYPFYQTPAVDASYAKPVAGIV 124 Query: 119 AGVSGATNCAFVNQP--GQGEALCGGCTAIALFNQANQAPGFGGGFKS---------GLR 167 G+ + AL A L G ++S Sbjct: 125 HGMMLGNYLTLFDHSVATDPPALSPAQAARYLVAYQAFDVGGMISYQSRHGEEASVAKYT 184 Query: 168 GGTPVTT----FVRGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSNESIPA 223 P+TT V+G +L T++LN+ T++ W + Sbjct: 185 KAGPLTTSAVALVKGRNLFQTLMLNLHAYNGADGL--PFHFTDDSAAWERDEHPTPR-ER 241 Query: 224 SSIGFVRGLFWQPAHIELCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWP----H 279 G+V L WQ + L G + + W Sbjct: 242 RPSGYVDLLTWQSRRVRLLPESADG--------NAAPVVRYAVIMKGEQFPDGWNPADYE 293 Query: 280 PHSPCLVTVK--KGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAAVVNQFRNI 337 P ++K G + F W + Q+ + V Sbjct: 294 PMVAFRKSLKPRDGVPPWFPIGFQEDRALWRDSLALFQSVSGQSARPKMLDWVAGLAAEG 353 Query: 338 APQSPLE--LIMGGYRNNQASILERRHDVLMFNQGW---QQYGNVINEIVTVGLGYKT-A 391 S L + G +QA++ RH+ L ++ ++ E + + T Sbjct: 354 PLGSRARFALDLYGMVTDQANVTLWRHERLPLPAPLLNDRERYELLQEALGLAERVSTLL 413 Query: 392 LRKALYTFAEGFKNKDFKGAGVSVHETAERH-----------------------FYRQSE 428 + + K + +T F+ + Sbjct: 414 VNDVVTVTGADGKTRKLPSPLRVFAQTLAAPDDAVTPSPTTVRSIAKSLAPSAVFWSRLA 473 Query: 429 LLIPDVLANVNFSQADEVIADLRDKLHQLCEMLFNQSVAPYAHHPKLISTLALARATLYK 488 + +++++ A + +A ++ + ++ F + + + A A + Sbjct: 474 VPFSQLVSDLGGDIAGDPLARWTREIQRTAQVAFRGVIHSLDSSARALKAAARAERAFNR 533 Query: 489 HLRELKPQGGPS 500 LRE+ P+ Sbjct: 534 LLREMLTGYLPA 545 >UniRef50_A1SV74 CRISPR-associated protein, Cse1 family n=2 Tax=Gammaproteobacteria RepID=A1SV74_PSYIN Length = 488 Score = 289 bits (739), Expect = 2e-76, Method: Composition-based stats. Identities = 117/506 (23%), Positives = 204/506 (40%), Gaps = 35/506 (6%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAP 60 MNLL D++I I+L+++ ++L D+++LA L LL + ++ Sbjct: 1 MNLLKDDFIS------TSQGKISLKTILTGEQNYQLQYYFDEIQLAMLQLLSSLSTVVLQ 54 Query: 61 AKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKAND--VTPMEKLL 118 E + + N LT ++++ + ++ + FMQ+K P+ KLL Sbjct: 55 PTV-QELKDYLKNGLTPEQYEAALDKVESQWFESDC---FMQSKPPTNAKWPDAPITKLL 110 Query: 119 AGVSGA---TNCAFVNQPGQGEALCGGCTAIALFNQANQAPG--FGGGFKSGLRGGTPVT 173 +G+ ++ Q E C C +N G FG +G+RGG ++ Sbjct: 111 SGIECGTSANAMGLFSEIEQAEISCTDCMHALNYNLHMNIKGECFGPTGATGIRGGGAIS 170 Query: 174 TFVRGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSNESIPASSIGFVRGLF 233 T + G +L+ T+L N + +S E +P W+ P+ + AS IG RGLF Sbjct: 171 TLIAGENLKQTLLNNTIAKDYFNDYAQLDSDAEQRPMWVAPLSGS-VYQASKIGINRGLF 229 Query: 234 WQPAHIELCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVN----------GLWPHPHSP 283 HI C CG ES F +EK+ G WPHP++P Sbjct: 230 ALAYHIGFNIEDKPCLCDVCGSESEQSVKTFNREKYKGNYGSTKNGREAGAGWWPHPYTP 289 Query: 284 CLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAAVVNQFRNIAPQSPL 343 + K E A + SW ++ +V K ++ A ++ QF+ + Sbjct: 290 RTI---KEEGAFAVCARDQNWQSWQELGSYIVGKET-DKATLEPAYIIKQFQYMKTPRQT 345 Query: 344 ELIMGGYRNNQASILERRHDVLMFNQGWQQYGNVINEIVTVGLGYKTALRKALYTFAEGF 403 L++GG +Q I R +D+ ++ + + +++ GL K L +A Sbjct: 346 NLLVGGNIADQGGITGRVYDLYSMPSSLNKHLSKVTQVLDSGLDQKNRLSQAFNKMFGAG 405 Query: 404 KNKDFKGAGVSVHETAERHFYRQSELLIPDVLANVNFSQADEVIADLRDKLHQLCEMLFN 463 +K+F G + E A F ++ +I L +V +A E+ +L Q + +F Sbjct: 406 YDKNFVGG---IKENAMYRFTANAQQIIQRTLLDVERKEATELRKTAVIELKQEAQRIFM 462 Query: 464 QSVAPYAHHPKLISTLALARATLYKH 489 Y H L L + LY+ Sbjct: 463 GVQRKYQHDLPLFKALVKGESALYRK 488 >UniRef50_Q2RY16 CRISPR-associated protein, Cse1 family n=1 Tax=Rhodospirillum rubrum ATCC 11170 RepID=Q2RY16_RHORT Length = 555 Score = 284 bits (727), Expect = 5e-75, Method: Composition-based stats. Identities = 90/535 (16%), Positives = 156/535 (29%), Gaps = 51/535 (9%) Query: 1 MNLLIDNWIPVRPRNGGKVQIIN--LQSLYCSRDQWRLSLPRDDMELAALALLVCIGQII 58 NLL++ W+PVR +G + + L + L PR D A L L+ + + Sbjct: 3 FNLLLERWLPVRRVSGKRDWVAPHQLTEGFAEDPIVGLDFPRADFNAAVLEFLIGVVYVA 62 Query: 59 APAKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQT-KGVKANDVTPMEKL 117 P + ++ + P Q ++P F + Q + A D P+ L Sbjct: 63 LPCQKAADWVKGSLTPPAPATLQAALSPLAFAFDFDGDGPRAYQDTSDLAAADCRPITGL 122 Query: 118 LAGVSG----ATNCAFVNQPGQGEALCGGCTAIALFNQANQAPGFGGGFKSGLRGGTPVT 173 G N + ALC A A AP G G ++ +RGG P+T Sbjct: 123 FIDFPGENTLKNNADLFIKRRDASALCLPYAAAATITLQTYAPSGGAGHRTSIRGGGPLT 182 Query: 174 TFVRGI----------DLRSTVLLNVLTLPRLQKQFPNESHTEN------QPTWIKPIKS 217 T V L + NV + P + W+ + Sbjct: 183 TLVAPRRRLAGGGEVATLWDRIWANV-PDQKWDGSDPIAGDPADHANWPLVFPWLAAAIT 241 Query: 218 N---ESIPASSIGFVRGLFWQPAHIELCDPI--GIGKCSCCGQESNLRYTGFLKEKFTFT 272 + + + + + F P + L C G + GF + + Sbjct: 242 SSHGQIVAPADATKRQSFFGCPRRLRLVFQQASPDHPCVLGGPAGAIMAVGFRTQNYGAN 301 Query: 273 VNGLWPHPHSPCLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAAVVN 332 G W HP SP K G + + W I + E +R Sbjct: 302 YEG-WTHPLSPYRDDKKAGRLPIHPHGGAATYGDWLAIWGYDGTPAVGVEIWDR-----R 355 Query: 333 QFRNIAPQSPLELIMGGYRNNQASILERRHDVLMFNQGWQQYG----NVINEIVTVGLGY 388 + A + + G+ + A + L + + + I +++ Sbjct: 356 RALLGATLAGDAIEAFGFDMDNAKARQWLDIRLPWVGVYGEDAATLRTAIAQMIGATQKA 415 Query: 389 KTALRKALYTFAEGFKNKDFKGAGVSVHETAE----------RHFYRQSELLIPDVLANV 438 LR A+ G + D K E ++++E + ++ Sbjct: 416 SQRLRLAIRLALWGQRATDPKTGKPGFRLPDELPADAATIDVTPIWQETEGPFRRHVQDL 475 Query: 439 --NFSQADEVIADLRDKLHQLCEMLFNQSVAPYAHHPKLISTLALARATLYKHLR 491 V L F+ +V L AR L + L Sbjct: 476 IAKPDGHLAVRKLWLKTLRGQTLRQFDTTVDLDGLTDADPHRLLFARDGLSRALA 530 >UniRef50_C0W6U3 CRISPR-associated Cse1 family protein n=1 Tax=Actinomyces urogenitalis DSM 15434 RepID=C0W6U3_9ACTO Length = 551 Score = 284 bits (725), Expect = 8e-75, Method: Composition-based stats. Identities = 83/548 (15%), Positives = 156/548 (28%), Gaps = 63/548 (11%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQI-IA 59 NLL + WI V +G ++ L + + +A L LL+ I + Sbjct: 5 FNLLDEPWIRVTWLSGESEEVSLLTLFRDATQIEGIHGEIASQNIAILRLLLAICHRTMD 64 Query: 60 PAKDDVEFRHRIMNPLT-EDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKA--NDVTPMEK 116 +D +R +P + + + + F L E PF Q G+ + +E Sbjct: 65 GPEDLEVWREYWSSPGSLGQDASTYLERFRSRFDLRDPEQPFFQVAGIHTASGKSSGLES 124 Query: 117 LLAGVSGATNCAFVNQPGQGEALCGGCTAIALFNQANQAPGF-------------GGGFK 163 L+A + + A L + P G G+ Sbjct: 125 LIADIPNGHPFFTTRMGEGLSQMTWAEAARWLIHVHAFDPSGIRSGAVGDPQVRNGKGYP 184 Query: 164 SGLRGGTPVTTF-VRGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSNESIP 222 G + T + G +L T+LLN + ++ + + P W +P Sbjct: 185 IGPGWTGQIGTITLAGDNLEQTLLLNTVVCNCVEG-LQEVDLSRDLPPWERPADGPGGSA 243 Query: 223 AS-SIGFVRGLFWQPAHIELCDPIGI-----GKCSCCGQESNLRYTGFLKEKFTFTVNGL 276 + G V WQ + L + G ++ +++ Sbjct: 244 SKQPTGPVSCYTWQTRRVLLHGKEEVTSLFLGNGDKATPQNRQHVEPLTAWRYSEP---- 299 Query: 277 WPHPHSPCLVTVKKGEVEEKFLAFTTSAPSWTQ-------ISRVVVDKIIQNENGNRVAA 329 + K T W +S +V K + A Sbjct: 300 ---------QSQKAKATVYMPRKLPTDRAMWRGLPTVVPHLSPMVSTKAGGQVSRFLPPA 350 Query: 330 VVNQFRNIAPQS--------PLELIMGGYRNNQASILERRHDVLMFNQGWQ-QYGNVINE 380 V+ ++ + Q P+ + Y +A I E D L + + Sbjct: 351 VITFYQRLMYQRVIPPRKLLPIHAVGMEYGAQEAVITELVEDTLHVPSALLGRDNTRLLT 410 Query: 381 IVTVGLGYKTALRKALYTFAE--GFKNKDFKGAGVSVHETAERHFYRQSELLIPDVLANV 438 +V+ + L A + + A FY+ + P LA++ Sbjct: 411 LVSDAIEVTEQAAGTLRNLAANLDRAAGGSPDTSSAARQRAGAQFYQAIDERFPRWLADI 470 Query: 439 NFSQADEVIADLRDKLHQLCEMLFNQSVAPYAHHP-----KLISTLALARATLY--KHLR 491 + + V R+ L S + + RA + + L Sbjct: 471 ADADPESVAEQWREVLRSEAHRQAEHLAQNAPSTAFTGRGDGTSRMDVGRALFFFRRKLA 530 Query: 492 ELKPQGGP 499 E+ P+ G Sbjct: 531 EVLPRPGS 538 >UniRef50_B8IMR5 CRISPR-associated protein, Cse1 family n=1 Tax=Methylobacterium nodulans ORS 2060 RepID=B8IMR5_METNO Length = 560 Score = 277 bits (708), Expect = 7e-73, Method: Composition-based stats. Identities = 92/550 (16%), Positives = 166/550 (30%), Gaps = 64/550 (11%) Query: 1 MNLLIDNWIPVRPRNGGK--VQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQII 58 +LL + WIPV +G ++ + + + + R D++ A + + Sbjct: 4 FSLLTEPWIPVLRADGTHACIRPAEITADIAANPVVAPAWGRPDLDAATREYWIALFGTA 63 Query: 59 APA-KDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDVTPMEKL 117 + +R + +P + AP F L+ F Q A + P+ +L Sbjct: 64 CGSWAGPGAWREHLRHPPAPEVLDAAFAPLAPAFILDGEGPRFGQDLEDIAGETVPVGQL 123 Query: 118 LAGVSGAT----NCAFVNQPGQGEALCGGCTAIALFNQANQAPGFGGGFKSGLRGGTPVT 173 L GA N + G+ E L AIAL AP G G + +RGG P+T Sbjct: 124 LIEAPGANTIKRNLDHFVRRGRVETLSRAGAAIALHTLQTYAPSGGAGHRVSVRGGGPLT 183 Query: 174 TFVRGI-----------DLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSNE--- 219 T + L T+ L S + W+ P +++E Sbjct: 184 TLLLPGPPRGGDPARPVPLWQTLWLAT--------PACEASSLKRVFPWLAPTRTSEQKR 235 Query: 220 SIPASSIGFVRGLFWQPAHIELCDP--IGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLW 277 S + ++ + P + L C G+ + + + G + Sbjct: 236 VTTPSDVDPLQAFWGMPRRVRLVFEANTEGHPCDLTGRIDPVVVRAYRTRPHGTSYVG-F 294 Query: 278 PHPHSPCLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENG-NRVAAVVNQFRN 336 HP SP G+ +E FL V + Q + R A V + Sbjct: 295 THPLSPHYR----GKADEPFLPVHGQPGRVGYRHWVGLVVSDQAASPLRRPADAVTLGLS 350 Query: 337 ------IAPQSPLELIMGGYRNNQASILERRHDVLMF----NQGWQQYGNVINEIVTVGL 386 + L+ GY + + + +++++ Sbjct: 351 RLEGVGGPTAAQARLLATGYDMDNMKARAFIESEMPLHLPPPGRFSDLNGAVSDMIKGAY 410 Query: 387 G----YKTALRKALYTFAEGFKNKDFKGAGVSVHETAERHFYRQSELLIPDVLANVNFSQ 442 +T +R AL+ A G + A F+ ++E + LA ++ Sbjct: 411 AAEGLLRTGVRAALFVKATAGDGFQNAPKGGGAIDLARARFWERTEAAFGEALAALSEDL 470 Query: 443 ADE----------VIADLRDKLHQLCE---MLFNQSVAPYAHHPKLISTLALARATLYKH 489 AD R+ L + A + AR+ L+ Sbjct: 471 ADPNADALVVTTAAREAWRESLRRAAIDLFDDLVPLDDLDALDLRAGQARIEARSNLHLA 530 Query: 490 LRELKPQGGP 499 L G Sbjct: 531 LHGYGKSGAS 540 >UniRef50_Q67RP3 Putative uncharacterized protein n=1 Tax=Symbiobacterium thermophilum RepID=Q67RP3_SYMTH Length = 523 Score = 274 bits (701), Expect = 4e-72, Method: Composition-based stats. Identities = 81/526 (15%), Positives = 146/526 (27%), Gaps = 55/526 (10%) Query: 12 RPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAPAKDDVEFRHRI 71 +G ++ Q+L + + P + +A LL+ + + ++ Sbjct: 2 IRLSGHPDRLSLRQALAEAHVVREVCDPSPLVVVAIHRLLMALIYRVYRPVTRADWAALW 61 Query: 72 MNP-LTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDVTPMEKLLAGVSGATNCAFV 130 A W+D F L H E PF Q + V P+ L+ + N Sbjct: 62 NAGRFDPGPLDGYGAFWMDRFELFHPERPFYQVPFIDGEKVHPISALVLEAASGNNPTLF 121 Query: 131 NQP--GQGEALCGGCTAIALFNQANQAPGFGGGFKSGLRGGTPVT----TFVRGIDLRST 184 + G AL A L A G G K R P+T +L T Sbjct: 122 DHGRVEGGVALPPDRAACHLLAHQLFALGGGVS-KPFNRMDAPLTKGLVVEALDTNLFRT 180 Query: 185 VLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSNESIPASSI-GFVRGLFWQPAHIELCD 243 +LLN L L ++ P ++ P W + + G + L WQ + LC Sbjct: 181 LLLNTLPLEDWERLIPP--TDDDAPFWEGDDPPEPVREGTPVKGPLHYLTWQSRQLHLCT 238 Query: 244 PIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWPHPHSPCLVTVKKGEVEEKFLAFTTS 303 G + C ++++ +G+ P + K+G Sbjct: 239 DEESGLVTGCQI----------RQRYALPKDGVRLDPGKVYQQSPKEG---FVPFKLNKE 285 Query: 304 APSWTQISRVVVDKIIQNENGNRVAAVVNQFR-----NIAPQSPLELIMGGYRNN---QA 355 W ++ + R IA S + L + G + A Sbjct: 286 RAVWQYTHVLLQTSGQDYSRPYLTDWLATMHRFRSRYGIAFPSRVILAVTGLTTDPQKAA 345 Query: 356 SILERRHDVLMFN--------------------QGWQQYGNVINEIVTVGLGYKTALRKA 395 + R + L + + + + + + AL A Sbjct: 346 KVELWRRERLPLPMTILDQPELMAEVEEMLAEARRVEGLLSRTAQALVWASAERKALGDA 405 Query: 396 LYTFAEGF--KNKDFKGAGVSVHETAE-RHFYRQSELLIPDVLANVNFSQADEVIADLRD 452 + G K ++ Q E + ++ A EV + R+ Sbjct: 406 VTYTWTGKLPPGKKLDQVKGLARSLGMVARYWPQLEEPFRRSIEDLAVKSAGEVRSAWRE 465 Query: 453 KLHQLCEMLFNQSVAPYAHHPKLISTLALARATLYKHLRELKPQGG 498 + F H L + + L + G Sbjct: 466 AVMMAARDAFRSGRDGLLHTEASFEVLTCVGSAFHGKLSRIFAAAG 511 >UniRef50_A0LM51 CRISPR-associated protein, Cse1 family n=1 Tax=Syntrophobacter fumaroxidans MPOB RepID=A0LM51_SYNFM Length = 517 Score = 273 bits (699), Expect = 8e-72, Method: Composition-based stats. Identities = 83/524 (15%), Positives = 153/524 (29%), Gaps = 52/524 (9%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIA- 59 NL+ WI +G + + + L + + + L E A +L+ I Sbjct: 6 FNLIDRPWISCVELSGRRRTLGLHEVLSRAHELRGIELQSPLAETALFRVLLAAVHRIVE 65 Query: 60 PAKDDVEFRHRIMN-PLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDVTP----- 113 K E+R L W F L E PF QT G+ D Sbjct: 66 GPKGTGEWRALYQATKLPGGRIDAYFEKWSHRFDLFSKEEPFYQTPGLAIRDAKGAEAPA 125 Query: 114 -MEKLLAGVSGATNCAFVNQPGQGEA--LCGGCTAIALFNQANQAPGFGGGFKSGLRG-- 168 + ++ + N + + L +AL + + L G Sbjct: 126 VIAGIMLERASGNNKTIFDHSMDEDRGCLSPEEAVLALIAAQMYSLRGLNKKTTNLFGYQ 185 Query: 169 --------GTPVTTFVRGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSNES 220 + ++G L ++ +L L P S ++ P W + Sbjct: 186 ESFSDSVMVGGIFAALQGQSLFESL---LLNLLLYTDNLPIHSSRDDCPVWERHDHGETG 242 Query: 221 IPASSIGFVRGLFWQPAHIELCDPIG-IGKCSCCGQESNLRYTGFLKEKFTFTVNGLWPH 279 + + G++ L + HI L G G C + + Sbjct: 243 V-RTPRGYLDYLTCKCRHILLVPEPGLDGPCIRHVHIAQG---------------EAFQD 286 Query: 280 PHSPCLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAAV-----VNQF 334 +P + K E + + + W + + E R A + Sbjct: 287 VDNPGFIKRKNKEGKWLPVQMQPARLVWRDSISLFSFDTGKREGDRRPDAFRLVGDIALR 346 Query: 335 RNIAPQSPLELIMGGYRNNQASILERRHDVLMFNQGWQQYGNVI---NEIVTVGLGYKTA 391 R +A S G N+QA+ L R + L +++ E + + T Sbjct: 347 RIVALPSKYRCCTYGLANDQANPLAWRKETLNIPTALLSDPDLVACLREAMDLSEKAHTI 406 Query: 392 LRKALYTFAEGFKNKDFKGAGVSVHETAERH-FYRQSELLIPDVLANVNFSQADEVIADL 450 LR A+ T+ + + ++ + ++ T F+ + E L + D+ + Sbjct: 407 LRNAIRTYMDKYLPRNSRDVTEKLNATGASRLFWDRLESHFNAFLLEIENQ--DKALVAW 464 Query: 451 RDKLHQLCEMLFNQ-SVAPYAHHPKLISTLALARATLYKHLREL 493 + + F YA K A L L L Sbjct: 465 ERNIERAALEAFEACLKQRYADSAKKFRAWTEAHGQLVARLATL 508 >UniRef50_A7BA67 Putative uncharacterized protein n=1 Tax=Actinomyces odontolyticus ATCC 17982 RepID=A7BA67_9ACTO Length = 556 Score = 273 bits (697), Expect = 1e-71, Method: Composition-based stats. Identities = 87/549 (15%), Positives = 166/549 (30%), Gaps = 64/549 (11%) Query: 2 NLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAPA 61 NLL + WIPVR +G + L+ L + D L+ +A L++ I +A Sbjct: 7 NLLDEPWIPVRLVDGTITDVGLLELLRRTTDIADLACELPTQSIAIQRLILAIMYRVATP 66 Query: 62 KDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKAND--VTPMEKLLA 119 +D ++ + ++ + + W D FYL PFMQ ++ V+ +EKL+A Sbjct: 67 RDTRDWVRQWDEGAPTEQMIEYLERWRDRFYLFGGRFPFMQVANLRTAKDAVSGLEKLIA 126 Query: 120 GVSGATNCAFVNQPGQGEALCGG-CTAIALFNQANQAPGF---GGGFKSGLRGG-----T 170 V F + G+ A A L + P G S ++GG Sbjct: 127 DVPNGE-QFFTTRHGRALACIPASEAARWLVHAQAYDPSGIRSGAVGDSQVKGGKGYPIG 185 Query: 171 PV------TTFVRGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIK-------- 216 P +++G DL T++LN++ + + + S +W P Sbjct: 186 PAWCGHLGLVWLKGKDLDETLVLNLIPATTAELRGVDSSTDWGACSWEDPEPETSVRGDY 245 Query: 217 --SNESIPASSIGFVRGLFWQPAHIELCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVN 274 + + + R L W I L G +E + Sbjct: 246 SLLDPAGTPKELSIPRLLTWHSRRIRLVGDSS----GVTGVVLAQGDKLAPQEMRLYEPQ 301 Query: 275 GLWPHPHSPCLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRV------- 327 LW + +P + K F W + + + Sbjct: 302 SLWRYS-TP--QSKKFKTDVYMPRKFEAGRALWRNLPGTLPTVTTVQGVDKQPKREFLPS 358 Query: 328 AAVVNQFR------NIAPQSPLELIMGG--YRNNQASILERRHDVLMFNQG-----WQQY 374 A + ++ + + + G Y +A+ + D L + + Sbjct: 359 ATLSFHYQLDNSSIQTSYPKVMRIQAVGVTYGPQEATFEDIYSDELTLSVAVMRVEREDL 418 Query: 375 GNVINEIVTVGLGYKTALRKALYTFAEGFKNKDFKGAGVSVHETAERHFYRQSELLIPDV 434 I+ V + + A + + A+ F+ + Sbjct: 419 SAQIDREVRLTEEVARDVGTLAANLARAAGESGDGAGDGA-RDRAKELFFSAVDNDFRAW 477 Query: 435 LANVNFSQ-ADEVIADLRDKLHQLCEMLFNQSVAPYAHHP-------KLISTLALARATL 486 L V+ + A +V L Q + + V + + + +A Sbjct: 478 LTQVDGHESARDVGCRWECTLRQHALGIQTELVRSASSSAIVGRDTGRGYMNVGIAENYF 537 Query: 487 YKHLRELKP 495 L + P Sbjct: 538 RSALNKRLP 546 >UniRef50_Q0BSC4 Putative uncharacterized protein n=1 Tax=Granulibacter bethesdensis CGDNIH1 RepID=Q0BSC4_GRABC Length = 505 Score = 267 bits (682), Expect = 8e-70, Method: Composition-based stats. Identities = 92/509 (18%), Positives = 162/509 (31%), Gaps = 58/509 (11%) Query: 2 NLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAPA 61 NL+ D WIPV +G + I Q D + PR D+ +A L LL+ + + P Sbjct: 24 NLIDDQWIPVLCADGSRRVIAPWQ--MAEPDVVQPDWPRPDLNIACLELLIGLVFLADPP 81 Query: 62 KDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDVTPMEKLLAGV 121 D ++ R Q+ +AP+ F L F+Q + ++ L Sbjct: 82 VDGEDWEARRD--PDPQRLQEKLAPYAPAFNLVGDGPRFLQDLEPFTGKASSVDMLFIDS 139 Query: 122 SG----ATNCAFVNQPGQGEALCGGCTAIALFNQANQAPGFGGGFKSGLRGGTPVTTFVR 177 + N + + + L A+AL+ + AP G G + +RGG P+ T V Sbjct: 140 AAVETARKNADVMVHRSRYDRLDFPIAAMALYTFQSYAPAGGAGNFTSMRGGGPMVTLVD 199 Query: 178 GID-LRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSNESIPAS--SIGFV---RG 231 L V +NV + + W++P + + + + G + Sbjct: 200 PERMLWDLVWVNVSCGHSAKME---------TLPWMRPTRVSHTGQQTLPPDGELFGAEA 250 Query: 232 LFWQPAHIELCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWPHPHSPCLVTVKKG 291 F P + L + TG +++ LW HP SP Sbjct: 251 FFGMPRRLRL-------------THNEGAVTGVIQKPGGTDY-ALWKHPLSPYYRKKSGE 296 Query: 292 EVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAAVVNQFRNIAPQSPLELIMGGYR 351 E +L A + + + V E G+ ++ + R L+ G Sbjct: 297 E----WLPKHPRAGHFGYRNWLGVVV---KEKGSDLSELALCLREDRIGGGSILVAGWSM 349 Query: 352 NNQASILERRHDVLMFNQGWQQYGNVINEIVTVGLGYKTALRKALYTFAEGFKNKDFKGA 411 +N + + I +++ ALR AL G + ++ + Sbjct: 350 DNMKPRDFILSRQRRLSAIPAEAEYRIVDLIQAADAVAVALRNALTPVLAGGEAREAERE 409 Query: 412 GVSVHETAERHFYRQSELLIPDVLANVNFSQADEVIADLRDKLHQLCEMLFNQSVAP--Y 469 RQ+E + + + L + F+ P Sbjct: 410 EFF----------RQTETKFLTHVQAIERGEDP--AEAWLADLRRQALGQFDAKALPGLN 457 Query: 470 AHHPKLISTLALARATLYKHLRELKPQGG 498 K I + R L L +GG Sbjct: 458 QRDVKAIGRITEGRRYLGLVLAGYGKEGG 486 >UniRef50_Q2JWC2 CRISPR-associated protein, Cse1 family n=2 Tax=Chroococcales RepID=Q2JWC2_SYNJA Length = 524 Score = 262 bits (669), Expect = 2e-68, Method: Composition-based stats. Identities = 75/530 (14%), Positives = 154/530 (29%), Gaps = 69/530 (13%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIA- 59 NL + WIPV + ++ ++ + LA L+ I Sbjct: 6 FNLTKEKWIPVLDPDFRIQELSLVELFREWESLKEMRGDNPPTTLALYRFLLAIMHRAYL 65 Query: 60 PAKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDVTPMEKLLA 119 KD ++ + + + D F L H HPFMQ + P+ + Sbjct: 66 GPKDTDHWKEIFQD--NGKRVIKYLQDRQDCFDLFHPTHPFMQDPALPIEKAVPVHSI-- 121 Query: 120 GVSGATNCAFVNQPGQ-GEALCGGCTAIALFNQA--------NQAPGFGGGFKSGLRGGT 170 + +T+ F ++ G ++ A L G G S + T Sbjct: 122 -HTMSTSEVFFHEHEWSGYSISLPEAARLLVRLQGVDITSLRAFYVGQDSGNHSAVNTPT 180 Query: 171 P--VTTFVRGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIK-SNESIPASSIG 227 ++G L+ T++LN++ + + P+ E+ PTW + + + G Sbjct: 181 MNVANVLLKGRTLKETLMLNLM-RYSPEDEMPSVVAGEDVPTWETKVGYTGQPKKEIPAG 239 Query: 228 FVRGLFWQPAHIELCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWPHPHSPCLVT 287 ++ L + + L G W C V Sbjct: 240 YIHYLTFPWRRLRLFSEAG---------RVQQLAITMGNSLPNGVEARQWE-----CSVA 285 Query: 288 VKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAAVVNQFRNIAPQSPLE--- 344 K E+K + + W ++ Q +V+ + + + Sbjct: 286 YK----EDKPVRLSLHRQLWRDADSFLLTASKQTR-----PRIVDWLAELKSEELVNNLV 336 Query: 345 -LIMGGYRNNQASILERRHDVLMFN-------QGWQQYGNVINEIVTVGLGYKTALRKAL 396 + G +QA L + Q + I ++++ Sbjct: 337 VFEVLGMSADQAKPLGWSSARFSVPMQFVTDSELAQSLKSAIGIAENHQQIFRSSKGSPY 396 Query: 397 YTFAEGFKNKDFKGAGVSVHETAERHFYRQSELLIPDVLANVNFSQAD----------EV 446 ++ AE KN + + ++ E ++ + +L ++ Sbjct: 397 FSLAEVLKNGETEKLSKAL--DGESRYWAILDHAFSMLLHDLPQDNQPGADGIIYYGLTT 454 Query: 447 IADLRDKLHQLCEMLFNQSVAPYAHHPKLISTLALARATLYKHLRELKPQ 496 + + F +S+ + A A TL + L EL+ Sbjct: 455 LPAWTKTVQDAARRAFTESIESI----RNYQARAAALRTLERKLAELRAD 500 >UniRef50_C4FG91 Putative uncharacterized protein n=1 Tax=Bifidobacterium angulatum DSM 20098 RepID=C4FG91_9BIFI Length = 573 Score = 259 bits (662), Expect = 1e-67, Method: Composition-based stats. Identities = 79/559 (14%), Positives = 164/559 (29%), Gaps = 72/559 (12%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAP 60 +LL + W+ V R+G +I Q + D LS +L + L + I Sbjct: 6 FSLLDEPWVQVVYRDGHPGEISLRQIFSDAPDIKELSGDIPQQKLPLIRLFLAILYRAYR 65 Query: 61 A--KDDVEFRHRIMNPLTE-----DEFQQLIAPWIDMFYLNHAEHPFMQTKGVK---AND 110 ++ + R + D + + W D F+L PF Q ++ A Sbjct: 66 VVGVNEEQMRELWKEIFSSKHFDMDIVSRYLDKWEDRFFLIGE-RPFFQIPDLEYVGAKP 124 Query: 111 VTPMEKLLAGVSGATNCAFVNQP-GQGEALCGGCTAIALFNQANQAPG------------ 157 +P+ +++A V F + + +++ + L Sbjct: 125 YSPVSEMIADVPKPDKYLFSMRSMEETDSISFAEASRWLVFMQAYDIAGIKTPVKGNTYV 184 Query: 158 -FGGGFKSGLRGGTPVTT----FVRGIDLRSTVLLNVLTLP--RLQKQFPNESHTENQPT 210 G + + + G +L T++LN + +++ + + P Sbjct: 185 KGGKVYSPKGMSTGWLGAIGGLYAEGRNLFETLMLNWVLYDTKYDSERYRLFGNERDVPV 244 Query: 211 WIKPIKSNESIPASSI--GFVRGLFWQPAHIELCDPIGIGKCSCCGQESNLRYTGFLKEK 268 W + + + S G V+ + WQ + L + + + + Sbjct: 245 WEQNNIPSPDLDNQSTFAGPVQAMTWQSRRLRLVPNEDVTRIVGVVYCYGDVVSPDDTD- 303 Query: 269 FTFTVNGLWPHPHSPCLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVA 328 F W + KKG K + S W + ++ K + Sbjct: 304 -GFEKMTAWR-----SIPQQKKGLPTHKPVMHDASKALWRGLEPILCVKDDDDCR----P 353 Query: 329 AVVNQFRNIAPQ---------SPLELIMGG--YRNNQASILERRHDVLMFNQ-GWQQYGN 376 ++ I + + + + G Y + + D L N ++ + Sbjct: 354 GLIRWLEEIRTEIFDSEDHVLNLVTIHAQGMVYGSQSSVFETGIDDTLSLNTIMFRHDYD 413 Query: 377 VINEIVTVGLGYKTALRKAL----YTFAEGFKNKDFKGAGVSVHETAERHFYRQSELLIP 432 I ++ V A++ + E Y + + L Sbjct: 414 GIAAVIDVAKSADNAVQALTQFIRNLQMSAGDKGKSAKVENRIEERIRESAYTELDRLCR 473 Query: 433 DVLANVNFSQADEVI-ADLRDKLHQ----LCEMLFNQSVAP--YAHHPKLIST-----LA 480 D LA + S+ D +DK+H+ + +QS P H + Sbjct: 474 DELAAFDKSKDFIKYSNDWKDKIHRRLLEMERDYLDQSSVPVFDEHEFDNKRKSHTSNMM 533 Query: 481 LARATLYKHLRELKPQGGP 499 A +L + G Sbjct: 534 KATRAQLSFQSKLNRELGS 552 >UniRef50_B6XT61 Putative uncharacterized protein n=2 Tax=Bifidobacterium RepID=B6XT61_9BIFI Length = 550 Score = 258 bits (658), Expect = 5e-67, Method: Composition-based stats. Identities = 86/550 (15%), Positives = 162/550 (29%), Gaps = 69/550 (12%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAP 60 +LL D WIPV +G ++ + + + +A L L + I Sbjct: 5 FSLLDDGWIPVSYVDGHPDEVSLRRLFEDAWKIKEIRGDIPQQAIAILRLALGILYRAYY 64 Query: 61 AKDDVE------FRHRIMNP-LTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVK---AND 110 ++ E + D + W D F+L PF Q G++ Sbjct: 65 VENPSEEQMRDMWDDIFRIGHFDLDILEDYFDEWGDRFFLFGD-RPFFQVSGLEYVGQKP 123 Query: 111 VTPMEKLLAGVSGATNCAFVNQP-GQGEALCGGCTAIALFNQANQAPGFGGGFKSGLRGG 169 P+ +++A + F + G + L +A L + G K+ ++G Sbjct: 124 YDPVSEMIADMPKPEKYLFAMRGLGTTDTLSLPESARWLVYLQSFDT---AGIKTPVKGN 180 Query: 170 T--------PVTTF-------------VRGIDLRSTVLLNVLTLP--RLQKQFPNESHTE 206 T P+ F G +L T++LN + + + +T Sbjct: 181 THINKGKIYPLKGFLGTGWLGGVGGVYAEGANLFETLMLNWVLYDDRYDSEYYRLFGNTN 240 Query: 207 NQPTWIKPIKSNESIPASS--IGFVRGLFWQPAHIELCDPIGIGKCSCCGQESNLRYTGF 264 + P W K + + + G V+ + WQ I L + T + Sbjct: 241 DIPVWEKNEVPSADMDDQNSFAGPVQAMTWQSRRIRLVPNEDCTRVIGVVNCYGDAVTQY 300 Query: 265 LKEKFTFTVNGLWPHPHSPCLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENG 324 + F W P + + S W + ++ + Sbjct: 301 NTD--GFEKMTAWRRSI-PQQKKLGLPVPPHMPVTHDASKALWRGLEPILCVGDDGDFR- 356 Query: 325 NRVAAVVNQFRNIAPQSP---------LELIMGG--YRNNQASILERRHDVLMFN-QGWQ 372 ++ I + + + G Y + D L + ++ Sbjct: 357 ---PGIIRWLEEIRTEVLDSEEHVLNMVTIHAQGMTYGTQSSVFETGIDDKLSLSMVMFR 413 Query: 373 QYGNVINEIVTVGLGYKTAL-RKALYTFAEGFKNKDFKGAGVSVHETAERHFYRQSELLI 431 I +V V A+ ++ D G + + Y +LL Sbjct: 414 HDYAGIAAVVDVVKSTDKAVTALTMFVRNLRSAAGDHSGKTQEIADQIRESAYADLDLLF 473 Query: 432 PDVLANVNFSQADEVI-ADLRDKLHQL----CEMLFNQSVAPYAHHPKLIS----TLALA 482 D LAN + SQ D++H+L +QS P + + ALA Sbjct: 474 RDRLANFDESQDPVTYSNAWLDEVHRLLLTMGRDYLSQSPVPVFEEHESGRFGVMSAALA 533 Query: 483 RATLYKHLRE 492 + L + Sbjct: 534 QLLFRGSLNK 543 >UniRef50_Q53VY1 Putative uncharacterized protein TTHB188 n=1 Tax=Thermus thermophilus HB8 RepID=Q53VY1_THET8 Length = 502 Score = 256 bits (653), Expect = 2e-66, Method: Composition-based stats. Identities = 82/529 (15%), Positives = 151/529 (28%), Gaps = 69/529 (13%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIA- 59 NL+ + WIPV + G V++ ++L + + R+ P E LL+ + Sbjct: 7 FNLIDEPWIPVL-KGGRVVEVGIGEALLRAHEFARIETPSPLEEAVLHRLLLAVLHRALS 65 Query: 60 PAKDDVEFRHRIMNP-LTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDVTPMEKLL 118 + + +D + + + D F+L H E PF+Q + + P KLL Sbjct: 66 GPRCPEDVLDWWRKGGFPQDPIRDYLNRFRDRFFLFHPEAPFLQVADLPEENPLPWSKLL 125 Query: 119 AGVSGATNCAFVNQP--GQGEALCGGCTAIALFNQANQAPGF-----GGGFKSGLRGGTP 171 ++ N + A AL APG G G P Sbjct: 126 PELASGNNPTLFDHTTEENLPKATYAQAARALLVHQAFAPGGLLRRYGVGSAKDAPVARP 185 Query: 172 VTTFVRGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPI-KSNESIPASSIGFV- 229 G L L + ++ P W P + + A + + Sbjct: 186 ALFLPTGQ----------NLLETLLLNLVPYTPEDDAPIWEVPPLRLGDLEGARTKWPLT 235 Query: 230 ---RGLFWQPAHIELCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWPHPHSPCLV 286 R W + L D + P + Sbjct: 236 GRTRVYTWPARGVRLLDEGD-------------GVRFMGYGPGVEPLEATHRDPMVAQRL 282 Query: 287 TVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAAVVNQFRNIAP------- 339 K L + W S ++ G +VAA + N+ Sbjct: 283 DAKGN---LLVLRLSEERSFWRDFSAMLP------RQGGKVAATLEHAENLQGELEDEGL 333 Query: 340 QSPLELIMGGYRNNQASILERRHDVLMFNQGWQ--QYGNVINEIVTVGLGYKTALRK-AL 396 + + L + G ++QA +L+ R +V G + + + + + L+ A Sbjct: 334 EGRITLRVLGQVSDQAKVLDIRREVYPLPSGLLTPKAEENLEKALKMAEELGQGLKHLAQ 393 Query: 397 YTFAEGFKNKDFKGAGVSVHET---------AERHFYRQSELLIPDVLANVNFSQADEVI 447 +D E ER ++ + P A V + ++ Sbjct: 394 EVAKAVVGERDRGHGRSPYLEELTKLANSLPLERLYWHALDGAFPRFFARVEEEASLDL- 452 Query: 448 ADLRDKLHQLCEMLFNQSVAPYAHHPKLISTLALARATLYKHLRELKPQ 496 R+ L + + + + LA + L EL + Sbjct: 453 --WREALRGAALEAWKATRRFLGTGARHLKALAQGEQEFGRLLGELGEE 499 >UniRef50_D1YEE1 CRISPR system CASCADE complex protein CasA n=1 Tax=Propionibacterium acnes J139 RepID=D1YEE1_PROAC Length = 552 Score = 255 bits (652), Expect = 2e-66, Method: Composition-based stats. Identities = 73/519 (14%), Positives = 142/519 (27%), Gaps = 68/519 (13%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAP 60 NL+ + WI VR + G ++ ++ + + + L+ E A L LL+ I Sbjct: 5 FNLMDEPWISVRTPDNGVTEVSIREAFHRATEFRGLAGEIPTQEAAVLRLLLAIAIQATA 64 Query: 61 A-----KDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKA--NDVTP 113 + ++ L DE W++ F L PFMQ + + Sbjct: 65 RFRSDDEKIDDWGQWWEEGLPLDEIDSYSDRWLNRFNLFDDSAPFMQVTDLHTSNGGYSG 124 Query: 114 MEKLLAGVSGATNCAFVNQPGQGEA--LCGGCTAIALFNQANQAPG-------------F 158 + K+++ V N F L A L + Sbjct: 125 LTKIISEVPP--NDKFFTTRDGAGTTSLSFAEAARWLVHTHAFDVSGIKSGAVGDPRVKG 182 Query: 159 GGGFKSGLRGGTPV-TTFVRGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIK- 216 G G+ G P+ V G L T+LLN+ Q ++ P W +P + Sbjct: 183 GRGYPIGTGISGPMGIVIVEGKSLAETILLNLFLQDDPQ---------QDVPVWERPPQT 233 Query: 217 -SNESIPASSIGFVRGLFWQPAHIELCDP-IGIGKCSCC-GQESNLRYTGFLKEKFTFTV 273 + + G WQ + L + C G + +Y + Sbjct: 234 ATPDREHPVPTGCADLFTWQSRRVRLIADGDRVVDVLLCNGDKVEWKYLLHNDSTTAWRY 293 Query: 274 NGLWPHPHSPCLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVA----- 328 + T GE + ++ W + ++V + ++ + A Sbjct: 294 SAP---------QTKAAGETVYMPRSHDSTKAMWRGLEPLLVREPAADDRRRKKAGEPDE 344 Query: 329 -----------AVVNQFRNIAPQSPLELIMGG--YRNNQASILERRHDVLMFNQGWQQYG 375 A + + + G Y + + + D + Sbjct: 345 LWLRPEIFEQLAAFSSDGALPRDHVTRIRTIGMEYGSQSSVVTTTIDDAMPAAMAVIADE 404 Query: 376 NVINEIVTVGLGYKTALRKALYTFAEGFKNKDFKGAGVSVHETAERHFYRQSELLIPDVL 435 + V + + + + G V A Y + D Sbjct: 405 QLGRLAVESAQTADSLIFALQNLATDLAAATGSESEG--VRPRASEMGYAALDPTYRDWF 462 Query: 436 ANVNFSQADEVI-ADLRDKLHQLCEMLFNQSVAPYAHHP 473 + ++ E R + Q+ + Q Sbjct: 463 SRLSSRTDVETATLSWRRRARQVVREIGEQLCRDAGPTA 501 >UniRef50_Q1J370 CRISPR-associated protein Cse1 n=1 Tax=Deinococcus geothermalis DSM 11300 RepID=Q1J370_DEIGD Length = 573 Score = 255 bits (651), Expect = 3e-66, Method: Composition-based stats. Identities = 78/578 (13%), Positives = 160/578 (27%), Gaps = 90/578 (15%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIA- 59 LL WIPV G + + SL + + R+ A L + + Sbjct: 4 FPLLDREWIPVIAGVGERRHVSLRDSLLRAAEFRRIDAGHPLQTAALYRLHLAVLHRALK 63 Query: 60 PAKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDV-----TPM 114 +D + + D+ + + D F+L + PF+Q K + V + Sbjct: 64 GPRDAEQGADWYLAGHFPDDVAHYLDRYADRFHLFGPQ-PFLQVKDLDPALVGENFRSHW 122 Query: 115 EKLLAGVSGATNCAFVNQ-----PGQGEALCGGCTAIALFNQANQAPGFGGGFKSGLRGG 169 +L A A N + +AL A+ L N A G + Sbjct: 123 TRLSAEEGSPNTTALYNVEARPGGDRSDALTPAQAALRLLEHQNFALGGLIKRFTTSARA 182 Query: 170 TPVTT----FVRGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKS-------- 217 PV T G +L T+ LN++ +P H + P W + + Sbjct: 183 APVATAGLFLAEGANLHQTLCLNLVP-------YPQAMHGPDLPPWEEAPLTVAQIRACY 235 Query: 218 NESIPASSIGFVRGLFWQPAHIELCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLW 277 + P + G+ W + L + +G Sbjct: 236 DPEQPRVAAGYASRYTWPSRSVLLLPEETPQGVVVRWVGFGAGVPLAGPGEG----SGTG 291 Query: 278 PHPHSPCLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNR---------VA 328 P + + W ++ ++ D Q + + Sbjct: 292 TDPMVSLRPSRDPKNEQPFPYKLRRERLLWRDLNALLPDPAAQVDENRQGKVKVRPGTPP 351 Query: 329 AVVNQFRNIAPQSP----------------------------------LELIMGGYRNNQ 354 V+Q R + + +++ G +Q Sbjct: 352 KTVSQARAVMRAVAERQRRVQPPVPFQDAPEDAWAEEGTPDARAAHPVIPVVVFGQLTDQ 411 Query: 355 ASILERRHDVLMFNQGW--------QQYGNVINEIVTVGLGYKTALRKALYTFAEGFKNK 406 R + + + + + TVG G + ++ + + + Sbjct: 412 GKAFAMRQETYTLPEAFIENPERFRDHVQAALTDASTVGEGLRRSVHLLAHALLKKDAER 471 Query: 407 DF---KGAGVSVHETAERHFYRQSELLIPDVLANVNFSQADEVIADLRDKLHQLCEMLFN 463 D ++ AE ++ + L ++ + D L + + ++ Sbjct: 472 DPHKDDVGKLANQIPAEPTYWAGLDTPFRAYLLALDADPQAALR-DWHAALRRAALVGWH 530 Query: 464 QSVAPYAHHPKLISTLALARATLYKHLRELKPQGGPSN 501 + + + + A+ L K L LKP+ P + Sbjct: 531 TAEEAAGMNAAGLRAVEKAQGPLLKALNTLKPKETPHD 568 >UniRef50_C1XFZ8 CRISPR-associated protein, Cse1 family n=2 Tax=Meiothermus RepID=C1XFZ8_MEIRU Length = 494 Score = 253 bits (645), Expect = 1e-65, Method: Composition-based stats. Identities = 76/497 (15%), Positives = 141/497 (28%), Gaps = 43/497 (8%) Query: 25 QSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIA-PAKDDVEFRHRIMNPLTEDEFQQL 83 Q+L + R+ P + A LL+ + D E + Q Sbjct: 5 QALLEAHKFERIEDPSPLVTAALHRLLLAVLHRALEGPADAYEAAEWFEEGFDRGKIQTY 64 Query: 84 IAPWIDMFYLNHAEHPFMQTKGV-KANDVTPMEKLLAGVSGATNCAFVNQ--PGQGEALC 140 ++ + D F L H E PF Q L ++ N + + L Sbjct: 65 LSKYRDRFDLFHPERPFYQVPDFSLERSCRSWTVLAPELNSDNNKVLFDHTVTSRPRPLL 124 Query: 141 GGCTAIALFNQANQAPGFGGG---FKSGLRGGTPVTTFVRGIDLRSTVLLNVLTLPRLQK 197 A L A G + T V G +L T+ LN++ P+ + Sbjct: 125 PAEAARLLVANQTFALSAGKSVLCHTATAPVATAALALVLGDNLHQTLCLNLVAYPKREH 184 Query: 198 QFPNESHTENQPTWIKPIKSNESIPASSIGFVRGLFWQPAHIELCDPIGIGKCSCCGQES 257 + + + E +P + + E AS+ G V W + L G Sbjct: 185 EH-DFATWEQEPLKVADLADCERARASAKGIVHRYTWLARAVRLHPEEEDG--------- 234 Query: 258 NLRYTGFLKEKFTFTVNGL--WPHPHSPCLVTVKKGEVEEKFLAFTTSAPSWTQISRVVV 315 ++ P K E + L F+ W + ++ Sbjct: 235 -QTIVRWIAYASGVRYEESQVRRDPMVAFRPDPKDPTRE-RPLGFSEGRALWRDFAALLP 292 Query: 316 DKIIQNENGNRVAA----VVNQFRNIAPQSPLELIMGGYRNNQASILERRHDVLMFNQGW 371 + G VA V Q L +++ G ++QA + R ++ + Sbjct: 293 KP--GSAQGLAVADHARNVYRALGRHFRQRGLPVMVAGQASDQAKVELWRGEIYRLPEAI 350 Query: 372 ----------QQYGNVINEIVTVGLGYKTALRKALYTFAEGFK-NKDFKGAGVSVHETAE 420 +Q N + V G AL L + + D S Sbjct: 351 LGETDLRAFVEQCLNEAEVMGEVLNGAARALAAGLLSMGDRKPHKDDVSKLARSFPHQV- 409 Query: 421 RHFYRQSELLIPDVLANVNFS---QADEVIADLRDKLHQLCEMLFNQSVAPYAHHPKLIS 477 ++ E D ++ + Q + R+ L + + + + + Sbjct: 410 -AYWSALEGHFADWISQLGPDFEKQQARLERAWREILQREALQAWRLAALAAGDDARALR 468 Query: 478 TLALARATLYKHLRELK 494 + L H+ + K Sbjct: 469 AVHKGEGILLAHIYKQK 485 >UniRef50_D1NTH8 CRISPR-associated protein, Cse1 family n=1 Tax=Bifidobacterium gallicum DSM 20093 RepID=D1NTH8_9BIFI Length = 566 Score = 251 bits (641), Expect = 4e-65, Method: Composition-based stats. Identities = 77/550 (14%), Positives = 153/550 (27%), Gaps = 74/550 (13%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIA- 59 NL+ D WIPV + + + +S + + + L I Sbjct: 8 FNLVNDPWIPVVYDDATRAVVSLRESFEQASHIVAIVTDNPLQKAVLYRLFEAIWMRAYE 67 Query: 60 -------PAKDDVEFRHRIMNP-LTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVK---A 108 P++ ++ + + + F L ++ PF Q ++ Sbjct: 68 MEQIDVAPSECYALWQEFWDLGEFDLEIINAYLNKYEAKFELFDSKTPFYQVPDLEYVGK 127 Query: 109 NDVTPMEKLLAGVSGATNCAFVNQPGQGEALCGGCTAIALFNQANQAPGF---------- 158 +E ++ V T + P E L A L Sbjct: 128 KAYDGVETMILDVPKGTGLFSLRNPETLEGLDFAEAARQLLTIMAYDTAGIKSPVEGFSA 187 Query: 159 ---GGGFKS-GLRGGTPV----TTFVRGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPT 210 G F G+ + + + G +L T++LN + L + T ++ Sbjct: 188 INKGKAFAPQGVPSVGWLGNIGSVWAEGSNLFETIMLNWVISNPLTSEL--SESTYDRAP 245 Query: 211 WIKPIKSNE--SIPASSIGFVRGLFWQPAHIELCDPIGIGKCSCCGQESNLRYTGFL--- 265 W + G V L Q I L C+ G + + Sbjct: 246 WELDTPPEHDLVVRDGFRGMVDALTVQSRRIRLV-------CNEAGTQVIGLVICYGDII 298 Query: 266 --KEKFTFTVNGLWPHPHSPCLVTVKKGEVEEK---FLAFTTSAPSWTQISRVVVDKIIQ 320 ++ W + KKGE F W + ++V Sbjct: 299 RPAYTQIAEMHTSWR------VSKPKKGEGNAPVVMPRTFEAGKALWRSLGPLLVADSEN 352 Query: 321 NENGNRVAAVVNQFRNIAPQS------PLELIMGG--YRNNQASILERRHDVLMFNQ-GW 371 + + + F I + +I G Y + D L + Sbjct: 353 SARPGVLRWLDRLFDEIPELREKHLLQTIGIIAQGMTYGTQSSVFEASYDDSLELSSEML 412 Query: 372 QQYGNVINEIVTVGLGYKTALRK----ALYTFAEGFKNKDFKGAGVSVHETAERHFYRQS 427 + G++I ++ V + +++ A + D K Y Sbjct: 413 RSGGDIIGRVLDVVAATEQSVKDLGTFAFRLEVASGADSDSKNRRTDTRMQIAEEAYAAL 472 Query: 428 ELLIPDVLAN-VNFSQADEVIADLRDKLH----QLCEMLFNQS-VAPYAHHPKLISTLAL 481 + + + LA + A +D++H +L + +QS ++ H + + Sbjct: 473 DGVFRERLAQYRSDDDALAYCKSWKDEIHRLLLRLAQDYLDQSPTQSFSFHEENGRRVDA 532 Query: 482 ARATLYKHLR 491 +A L R Sbjct: 533 GQAMLTLKYR 542 >UniRef50_B1VIY3 CRISPR-associated protein n=1 Tax=Corynebacterium urealyticum DSM 7109 RepID=B1VIY3_CORU7 Length = 560 Score = 249 bits (635), Expect = 2e-64, Method: Composition-based stats. Identities = 69/555 (12%), Positives = 144/555 (25%), Gaps = 81/555 (14%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIA- 59 NL+ + WI R G ++ + Q S + + A L +L+ I Sbjct: 7 FNLVHEPWIKCRTAEGNQL-LSIRQVFDGSAKPLAVVGDSPTQDYAVLRVLLAIFWRAHY 65 Query: 60 --------PAKDDV--EFRHRIMNPLTE-------DEFQQLIAPWIDMFYLNHAEHPFMQ 102 + E+ ++ + +A + F L PFMQ Sbjct: 66 HDFVRRYPSPRSRKKFEWETWFLDTRETLRETGKDEVVLGYLADVENRFDLLDPTVPFMQ 125 Query: 103 TKGVKANDVTP--MEKLLAGVSGATNCAFVNQPGQGEA--LCGGCTAIALFNQANQAPGF 158 + T + ++L ++ + A L + Sbjct: 126 VADLHTAKNTSNEIRRILPDSED----SYFTMRTGPGRVSISYDEAARWLIHAQAYDYSG 181 Query: 159 ---GGGFKSGLRGGT--PV---------TTFVRGIDLRSTVLLNVLTLPRLQKQFPNESH 204 G ++GG P+ T +RG +L T++LN + + Sbjct: 182 IKSGAVGDPRVKGGRGYPIGQGWSGLTGGTVIRGANLLETLVLNTT------ESCIPTAA 235 Query: 205 TENQPTWIKPIKSN---ESIPASSIGFVRGLFWQPAHIELCDPIGIGKCSCCGQESNLRY 261 ++P W + + + G WQ I L G + + Sbjct: 236 ETDKPVWEREPDTAAPQDLEATQPKGPADLATWQSRRIRLFTED--------GVVTRVLV 287 Query: 262 TGF-LKEKFTFTVNGLWPHPHSPCLVTV---KKGEVEEKFLAFTTSAPSWTQISRVVVDK 317 + V G P +P + KKG + +W + +V Sbjct: 288 SNGDRIPNAGLNVFG---DPMTPYRFSKNKSKKGFEAYYPRPYDEQRTTWRSLDALVAVD 344 Query: 318 IIQNENGNRVAA-----VVNQFRNIAPQSPLEL--IMGGYRNNQASILERRHDVLMFNQG 370 + + V N R + + L+L + Y ++ + Sbjct: 345 GDPGFSSRELPPKRPENVDNVARILEDREVLDLQIVSMAYGPQSSTYGTIVSSSIGLPVH 404 Query: 371 WQQYGNVINEIVTVGLGYKTALRKALYTFAE-GFKNKDFKGAGVSVHETAERHFYRQSEL 429 + + A +A + G A Y Q E Sbjct: 405 LLRNSEWSRAVRNDVRNSAEATGRAATAVGAFAGQLYVAAGGEYEFGVDAADRLYAQLEP 464 Query: 430 LIPDVLANVNFSQADEVIADLRDKLHQLCEMLFNQSVAPYA--------HHPKLISTLAL 481 + L ++ + ++ + + + + + + Sbjct: 465 RFHNWLRGLDPKNMAQEVSSWQHTVREAALGIAQDLLNGAGQKALIGRLLDEDSGGRVIN 524 Query: 482 ARATLYKHLRELKPQ 496 A + R+L + Sbjct: 525 AGTAFQQLKRKLNKE 539 >UniRef50_C7MTN0 CRISPR-associated protein, Cse1 family n=1 Tax=Saccharomonospora viridis DSM 43017 RepID=C7MTN0_SACVD Length = 567 Score = 248 bits (632), Expect = 4e-64, Method: Composition-based stats. Identities = 86/558 (15%), Positives = 155/558 (27%), Gaps = 80/558 (14%) Query: 2 NLLIDNWIPVR-PRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALA--LLVCIGQII 58 NLL W+P+R +G + ++L S + L + A L LL + + Sbjct: 9 NLLTQPWLPIRHRHSGALEAVGIAEALLRSHELADLVVDVPTQVPALLRQVLLPVMVDAL 68 Query: 59 APAKDDVEFRHRIMNPLTED----EFQQLIAPWIDMFYLNHAEHPFMQTKGVKA--NDVT 112 P + R D + + D F L H PF Q G++ + Sbjct: 69 GPPTTREGWSKRFAAGRFTDEERDRLSEYFDQYRDRFALFHDTRPFAQVAGLRTPKGETK 128 Query: 113 PMEKLLAGVSGATNCAFVNQPGQGE--ALCGGCTAIALFNQANQAPGF---GGGFKSGLR 167 L+A + N G+ L G A L + G ++ Sbjct: 129 GTAVLVATAASGNNVPLFTSRTDGDPFPLTPGEAARWLLHTQCWDTAAIKSGAEGDPKVK 188 Query: 168 GG-------TPV----TTFVRGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTW-IKPI 215 G P+ G L T+LLN P P W P Sbjct: 189 AGKTTGNPTGPLGQLGVVVPVGRSLYETLLLNTPVHPE---------DMWGVPQWKRDPP 239 Query: 216 KSNESIPASSIGFVRGLFWQPAHIELCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNG 275 E + G + WQ + L C L +V Sbjct: 240 FGPEWDTYAPQGLLELWTWQSRRVRLSPEQTGDGQRVC--------RVVLTAGDRISVLP 291 Query: 276 LWPHPHSPCLVT--VKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAAVVNQ 333 W PH+ K G + W + ++ + + + + ++ Q Sbjct: 292 EW-EPHTTWTSAPNPKAGAPARRPRRHAPGKAIWQGMEALLAVER-EEKGKFHTSDLLRQ 349 Query: 334 FRN------IAPQSPLELIMGG--YRNNQASILERRHDVLMFNQGWQQYGNVINEIVTVG 385 + IA PL + G Y N A + + HD + + + ++V Sbjct: 350 INSARVDGVIADDYPLRVQTYGLLYGNQSAVVEDILHDAMPLPVAALRAEGEVYDLVAEA 409 Query: 386 LGYKTALRKALYTFAEGFKNKDFKGAGVSVHETAERH---FYRQSELLIPDVLANVNFSQ 442 L +A+ + + + GA + R + L+ +LA ++ + Sbjct: 410 TEQAEELAQAVNSLSADLRRAQ--GADPIPWDKGHRPGELVLHALDPLVRRLLAGISATG 467 Query: 443 ADEVI-----ADLRDKLHQLCEMLFNQSVAPYAHHPKLISTLA---------------LA 482 D + K + + + + + +A A Sbjct: 468 TDFDVLSRGQRAWEQKAWAETMRIAERVLGTASAGAFIGREVADKGKKDGKKVLYSLGTA 527 Query: 483 RATLYKHLRELKPQGGPS 500 L + P+ Sbjct: 528 ERDFRARLARILPRAAEH 545 >UniRef50_C0VRW0 CRISPR-associated protein n=1 Tax=Corynebacterium glucuronolyticum ATCC 51867 RepID=C0VRW0_9CORY Length = 553 Score = 246 bits (628), Expect = 1e-63, Method: Composition-based stats. Identities = 80/518 (15%), Positives = 151/518 (29%), Gaps = 74/518 (14%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAP 60 +LL + WI +G V++ L R+ + A + LL+ I Sbjct: 8 FSLLDEPWILCESLDGTPVELGLLDVFDGKHPIKRVRGDAPTQDSAIVGLLLPIYWRAHT 67 Query: 61 -----------AKDD--VEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVK 107 E + + + ++ + + D F+L PFMQT ++ Sbjct: 68 GDLTVFNGDNLPFSTWFAEHLEQARSGVADEAVLNYLETYRDRFFLVGGPAPFMQTPTLE 127 Query: 108 AN--DVTPMEKLLAGVSGATNCAFVNQPGQG--EALCGGCTAIALFNQANQAPG------ 157 + P+ +L+ + + + E + G A + Sbjct: 128 TKNMEFLPLSRLIPEAESE----YFSMREEDAAETVPLGEAARWIVTTQAYDYSGIKPGA 183 Query: 158 -------FGGGFKSGLRGGTPV-TTFVRGIDLRSTVLLNVLTLPRLQKQFPNESHTENQP 209 G G+ G+ T + G T++ N S E++P Sbjct: 184 IGDDRVKGGRGYPIGVGWSGMTGRTLIVGNTFAETLVYNTTAD--------CISSPEDKP 235 Query: 210 TWIKPIKSNESIP-ASSIGFVRGLFWQPAHIELCDPIGIGKCSCCGQESNLRYTGFLK-E 267 W +P+ + + G L WQ I + + + + T K Sbjct: 236 CWERPVDTAAVREFPAPKGAADLLTWQTRRIRVRYEGD--------RATGVIVTNGDKIP 287 Query: 268 KFTFTVNGLWPHPHSPCLVTV---KKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQN--- 321 V G P +P + KKG F T+ W + +V Sbjct: 288 DAGANVFG---DPLTPYRYSKNKSKKGHTVYYPQCFDTNRTMWRSLVPLVALDSDPQFTE 344 Query: 322 -----ENGNRVAAVVNQFRNIAPQS--PLELIMGGYRNNQASILERRHDVLMFNQGWQQY 374 + + ++ F ++ + PLELI Y N ++ H L Q Sbjct: 345 KDRAPKRPRNLDSLSRVFNDLGIEETIPLELISASYGPNDSTPSTTVHAGLNLPSPILQP 404 Query: 375 G--NVINEIVTVGLGYKTALRKALYTFAEGFKNKDFKGAGVSVHETAERHFYRQSELLIP 432 + ++IV+ TA + + G + E Sbjct: 405 ENVELRDQIVSQATATSTAAVALGSFAGQLLQAA---GGDYEFQPAPTDGALAELEHRFN 461 Query: 433 DVLANVNFSQADEVIADLRDKLHQLCEMLFNQSVAPYA 470 L+ V+ Q DE I +D +++ + Sbjct: 462 AWLSTVDEDQLDEQITQWQDIVYETIIDKAETLLRGAG 499 >UniRef50_D1A5T7 CRISPR-associated protein, Cse1 family n=4 Tax=Actinomycetales RepID=D1A5T7_THECD Length = 561 Score = 244 bits (623), Expect = 5e-63, Method: Composition-based stats. Identities = 81/522 (15%), Positives = 148/522 (28%), Gaps = 60/522 (11%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIA- 59 +L W+PV NGG+ + + + RL E A L LL+ I Sbjct: 7 FDLTRRPWLPVLYDNGGEGLLSLTEVFQQAHRLRRLVGDVPTQEFALLRLLLAILHDAIE 66 Query: 60 PAKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKAN--DVTPMEKL 117 D E+ L D + D F L H + PF+QT ++ +V ++++ Sbjct: 67 GPDDIDEWTELWEEGLPTDRITAYLERHRDRFDLLHPQAPFLQTAELRTAGDEVFSLDRI 126 Query: 118 LAGVSGATNCAFVNQPGQGEALCGGCTAIALFNQANQAPGF-------------GGGFKS 164 +A V T F + E L A L + G + Sbjct: 127 VADVPNGT-LFFTMRAHGVERLDFAEAARWLVHAHAFDTSGIKSGAVGDPRVKKGKVYPQ 185 Query: 165 GLRGGTPV-TTFVRGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSNESIP- 222 G+ + FV G DLR T+LLN++ + + P W + + + + Sbjct: 186 GVGWAGNLGGVFVEGDDLRETLLLNLIAFDTDNLRI---DPARDLPAWRQEPRGPQQLDE 242 Query: 223 ----ASSIGFVRGLFWQPAHIELCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWP 278 G WQ I L + + Sbjct: 243 IELSRRPAGLRDLYTWQSRRIRLHFDADGVY---------GVVLAYGDSLSPHNKHVH-- 291 Query: 279 HPHSPCLVTVKKGE-----VEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAAVVN- 332 P + + + + S +W + +V + E AA+V Sbjct: 292 EPMTAWRRSPAQEKKLRLAQVYLPREHDPSRSAWRGLGALVAGRAEGTEQREEAAAIVRP 351 Query: 333 ----QFRNIAPQSPL--------ELIMGGYRNNQASILERRHDVLMFNQGWQQYGNVINE 380 + + PL L+ Y Q+ I + D + E Sbjct: 352 RILDWVARLTVEGPLPDDFLIRARLVGAVYGTQQSVINDMVDDAVTMPIVLLH--ERNKE 409 Query: 381 IVTVGLGYKTALRKALYTFAEGFKN-KDFKGAGVSVHETAERHF-YRQSELLIPDVLANV 438 + + KA+ + + + G + A R + + L + Sbjct: 410 LGQTAIDAVADAEKAVTILGDLASDLAEASGTDPAPAVAAARALGFGMLDGPFRQWLTTL 469 Query: 439 N-FSQADEVIADLRDKLHQLCEMLFNQSVAPYAHHPKLISTL 479 + A E + + +++ L V + Sbjct: 470 TPGADAGERRVAWQRETYKIITRLGRDLVDSAGDPAWEGRIV 511 >UniRef50_Q5YRB3 Putative uncharacterized protein n=1 Tax=Nocardia farcinica RepID=Q5YRB3_NOCFA Length = 552 Score = 241 bits (615), Expect = 4e-62, Method: Composition-based stats. Identities = 84/537 (15%), Positives = 153/537 (28%), Gaps = 58/537 (10%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMEL--AALALLVCIGQII 58 +LL + WI V +G ++ Q + + + + L L + + Sbjct: 8 FDLLDEPWIIVTDASGKASEVSLRQVFRRADEYVAIGGEVPTQQFAILRLLLAILHRTVA 67 Query: 59 APAKDD-VEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKAND--VTPME 115 + D+ + + D F L H PF Q +++ V+ ++ Sbjct: 68 DRPGTAIDVWSRLWREWPA-DDIDRYLLAHRDRFDLFHPSTPFFQVADLRSAKDGVSSLD 126 Query: 116 KLLAGVSGATNCAFVNQPGQGEALCGGCTAIALFNQANQAPG-------------FGGGF 162 KL+A V + + G A L + P G G+ Sbjct: 127 KLIADVPNGDKYFTTRAGRGLDHIDFGEAARWLVHAHAFDPSGIKTGAVGDARVKGGKGY 186 Query: 163 KSGLRGGTPV-TTFVRGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSNE-S 220 G+ + ++ G DLR T+LLN++ ++P+ + P W + Sbjct: 187 PIGVAWAGSLGGVYLEGGDLRRTLLLNLVLADPDGDRYPDH----DLPPWERDPDGPAVR 242 Query: 221 IPASSIGFVRGLFWQPAHIELCDP--IGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWP 278 G V WQ + L + G C G L+ + + Sbjct: 243 DITGPSGPVDLFTWQSRRVRLVNSGAQVTGVVLCNGDALESFNKQLLEPMTGWRYSE--- 299 Query: 279 HPHSPCLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRV--------AAV 330 + K GE + W + ++ D R AA Sbjct: 300 ------NQSKKAGETRHYPMVHDPEKSLWRGLRSLLGDVASSEPVAGRAIAPGVVEWAAT 353 Query: 331 VNQFRNIAPQSPLELIMGG--YRNNQASILERRHDVLMFNQGWQQYGNVINE-IVTVGLG 387 + + P P+ L G Y NN + + + D + F + V Sbjct: 354 LLDAGALPPDQPIRLHAVGMHYINNLSIVGDIVDDAIGFRAAMLASDPRLRICAVAAVQI 413 Query: 388 YKTALRKALYTFAEGFKNKDFKGAGVSVHETAERHFYRQSELLIPDVLANVNFSQA--DE 445 + A+ A+ + G + E F E LA + Sbjct: 414 AEEAVDALANLAADLAAAAGGEPTGARMRAREEGFF--ALEAPYRRWLAGLVPQSTGYAR 471 Query: 446 VIADLRDKLHQLCEMLFNQSVAPYAHHPKLISTLAL-------ARATLYKHLRELKP 495 A+ + L + +A + + A YK LR + P Sbjct: 472 QAAEWEQTAFVIVRRLGTEYIAAAGDVAWVGRPVREKWLDSSIAERWFYKKLRAVLP 528 >UniRef50_Q4JWJ7 Putative uncharacterized protein n=2 Tax=Corynebacterium jeikeium RepID=Q4JWJ7_CORJK Length = 561 Score = 241 bits (614), Expect = 5e-62, Method: Composition-based stats. Identities = 66/559 (11%), Positives = 139/559 (24%), Gaps = 80/559 (14%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAP 60 +LL WI +G ++ + S + ++A LL+ + Sbjct: 22 FSLLDQPWILTTLTDGSAAELSLREIFDGSHSVASIRGDSPLQDVAIYRLLLTVYWCAHR 81 Query: 61 AK---------DDVEFR-HRIMNPLTEDE---FQQLIAPWIDMFYLNHAEHPFMQTKGVK 107 + D E+ R+ + + + D F L + PFMQ + Sbjct: 82 QELLSDPGTELDMAEWIPDRLEAAAENEPDNTVLNYLERYADRFDLLDPKQPFMQVADLH 141 Query: 108 AND--VTPMEKLLAGVSGATNCAFVNQPGQGEALCG--GCTAIALFNQANQAPG------ 157 + T + +++ + AL A L Sbjct: 142 TSKNATTDVRRIVPDFED----DYFTLRAGDGALSLTYAEAARWLIYVQAYDYSGIKSGA 197 Query: 158 -------FGGGFKSGLRGGTPVT-TFVRGIDLRSTVLLNVLTLP-RLQKQFPNESHTENQ 208 G G+ G +T T + G +L+ T+ LN + + P + Sbjct: 198 VGDPRVKGGRGYPIGTGWTGAITATIILGENLQETLALNTTGGALQAKNDHPVWEREPDT 257 Query: 209 PTWIKPIKSNESIPASSIGFVRGLFWQPAHIELCDPIGIGKCSCCGQESNLRYTGF-LKE 267 + + I G L WQ I L G + + + Sbjct: 258 SAQRLDPANKDGIYPK--GPAEILTWQSRRIRLFPDG--------GLITQVLVSNGDRIP 307 Query: 268 KFTFTVNGLWPHPHSPCLVTVKKGE---VEEKFLAFTTSAPSWTQISRVVVDKIIQ---- 320 V P +P + K + W + ++ + Sbjct: 308 NANANVQD---DPMTPYRFSKNKSTKTLDVYYPKPLDSQRTMWRSLEPLIALETDPVYDA 364 Query: 321 NENGNRVAAVVNQFRNIAPQS-------PLELIMGGYRNNQASILERRHD--VLMFNQGW 371 + ++Q + + + Y N A + L Sbjct: 365 KNRAPKRPKTIDQLAYLKENEVDLPETLNVSMTSMEYGPNSAVVGISISTQIELPLEILP 424 Query: 372 QQYGNVINEIVTVGLGYKTALRKALYTFAEGFKNKDFKGAGVSVHETAERHFYRQSELLI 431 + N ++ + A + F+ G A + E Sbjct: 425 KSALNQRQAVLNLAAATSQAGTMLGSFAGQLFQAA---GGDYEFQPAATDTLLAELEPKF 481 Query: 432 PDVLANVNFSQADEVIADLRDKLHQLCEMLFNQSVAPYA--------HHPKLISTLALAR 483 + + +++ + + + + + + A Sbjct: 482 SEWMKSLHGADLEPRLKAWEMTVRNAVLEHADVLLVGAGPKALVGRIIESNDDERFVSAG 541 Query: 484 AT---LYKHLRELKPQGGP 499 L + LRE+ P+ P Sbjct: 542 TVMNWLQRKLREILPRTVP 560 >UniRef50_C3PF93 CRISPR-associated protein n=3 Tax=Corynebacterium RepID=C3PF93_CORA7 Length = 581 Score = 240 bits (613), Expect = 7e-62, Method: Composition-based stats. Identities = 71/542 (13%), Positives = 143/542 (26%), Gaps = 76/542 (14%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAP 60 NLL + WI V + D +++ + A L +L+ I Sbjct: 7 FNLLDEPWIKCMDGTNQPVSLSIRDIFSGRGDAYKVVGDSPTQDYAVLRVLLAIFWRAHA 66 Query: 61 AK----------DDVEFRHRIMNPLT-------EDEFQQLIAPWIDMFYLNHAEHPFMQT 103 + +D ++ +D + + + D F L PFMQ Sbjct: 67 LELVESYADDNWEDFDWPEWFDELREQLVNEKRDDVVLEYLDGYEDRFDLLSPSAPFMQV 126 Query: 104 KGVKA--NDVTPMEKLLAGVSGATNCAFVNQP--GQGEALCGGCTAIALFNQANQAPGF- 158 + P+ ++ + F E+L A L + Sbjct: 127 ADLHTKSGATRPVSFIVPEAAD----DFFTMRTAEGRESLALDEAARWLIHTQAFDFSGI 182 Query: 159 --GGGFKSGLRGGT--PV---------TTFVRGI-DLRSTVLLNVLTLPRLQKQFPNESH 204 G ++GG P+ T + G + T++LN L Q Sbjct: 183 KSGAEGDPRVKGGKGYPIGTGWTGRTGGTIILGEGGILETLILNTPPSAVLDSQEGGAVS 242 Query: 205 TENQPTWIKPIKSNESIPAS-------SIGFVRGLFWQPAHIELCDPIGIGKCSCCGQES 257 + P W + + P S G V WQ I L + Sbjct: 243 A-DTPVWEREPDTAAQRPGSSDDIGAVPHGAVDLATWQARRIRLFFEGD--------RAV 293 Query: 258 NLRYTGF-LKEKFTFTVNGLWPHPHSPCLVTV---KKGEVEEKFLAFTTSAPSWTQISRV 313 + + V G P +P + KKG + + W + + Sbjct: 294 QVLVSNGDRIPDAGKNVMG---DPMTPYRYSPNQSKKGTPAYYARPYDPTRTMWRALDAL 350 Query: 314 VVDKIIQ--NENGNRVAAVVNQFRNIAPQSP----------LELIMGGYRNNQASILERR 361 + + + N+ N+A L L+ Y ++S+ Sbjct: 351 IALEDDPGFDNGKNKAPKRPRNLSNLAALEADGYLDKSLLDLALVSMEYGPQESSVASTF 410 Query: 362 HDVLMFNQGWQQYGNVINEIVTVGL-GYKTALRKALYTFAEGFKNKDFKGAGVSVHETAE 420 + + ++ + + A+ + G + Sbjct: 411 IATIGLPLVVLRADETGRKVRNAVRTSAEKTGKAAISLGWFAGQLLVAAGGDYEFGSSTA 470 Query: 421 RHFYRQSELLIPDVLANVNFSQADEVIADLRDKLHQLCEMLFNQSVAPYAHHPKLISTLA 480 FY + E L + + A+E D + ++ + + + + + Sbjct: 471 DRFYARLEPLFLTWMTGLISDNAEEWQIDWQKQVREQVLRDARELLRGAGTKAIVGREVD 530 Query: 481 LA 482 Sbjct: 531 AG 532 >UniRef50_A5GBL8 CRISPR-associated protein, Cse1 family n=1 Tax=Geobacter uraniireducens Rf4 RepID=A5GBL8_GEOUR Length = 545 Score = 239 bits (610), Expect = 2e-61, Method: Composition-based stats. Identities = 90/557 (16%), Positives = 162/557 (29%), Gaps = 82/557 (14%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIA- 59 MN+ D WIPV GG+ ++ +L S+ D++ R ++ + L +C+ Sbjct: 1 MNVAFDPWIPVVTITGGR-ELASLCSVLTEGDKFADLAVRPHERVSLMRLFLCVTHAALK 59 Query: 60 PAKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVK--------ANDV 111 KD E+ Q+ + W D F L H E P++Q G+K + Sbjct: 60 GPKDYDEWCEVPKRLPVAA--QKYLTEWKDSFELFHKERPWLQVAGLKGVEKEGSDSGKT 117 Query: 112 TPMEKLLAGVSGATNCAFVNQPGQ--GEALCGGCTAIALFNQANQAPGFG---------G 160 +P+ L +S N + GQ + + L N + G G Sbjct: 118 SPLSLLDFELSTGNNSTLHDHGGQLIVRQIEPERVVLNLLTFQNFSSGGGSPVAQWMTTK 177 Query: 161 GFKSGLRGGTPV-----TTFVRGIDLRSTVLLNVLTLPRL------------QKQFPNES 203 + G + RG L T+ LN+ T K+ Sbjct: 178 TLQVGNPDAPCLSQSMAHCLFRGASLAETIQLNLPTFETARRLYNSFATHKKDKEKQEWE 237 Query: 204 HTE------NQPTW----IKPIKSNESIPASSIGFVRGLFWQPAHIELCDPIGIGKCSCC 253 E +P W P ++S+ ++ ++ L + L + C Sbjct: 238 RVEITVVEMGKPVWEFFPESPDSQSDSVINATKTYIGRLVPISRWVLLFNESDQMYCCNG 297 Query: 254 GQESNLRYTGFLKEKFTFTVNGLWPHPHSPCLVTVKKGEVEEKFLAFTTSAPSWTQISRV 313 + K F V+ K + S W ++S + Sbjct: 298 --------FKYDTFKDGFPSEPTASVQLVTKRDKNGAESVDRKVVKIEPSKALWRELSAL 349 Query: 314 VVDKIIQNENGNRVAAVVNQFRNIAPQSPLELIMGGYRNNQA----SILERRHDVLMFNQ 369 +V + G +A N S + + +QA ++ H F Sbjct: 350 LVKRS-AFGLGGPLA-----MENAPHDSEFDFHVCAMTRDQASMDIALESVFHVTPAFQF 403 Query: 370 GWQQYGNVINEIVTVGLGY-------KTALRKALYTFAEGFKNKDFKGAGVSVHETAERH 422 + Y I + + + E K K + A H Sbjct: 404 NFPVYQAEIVRAEGISRRLGWTVEVYRKEVDGDWANRVERAKEKWV--LKAKLQSIATIH 461 Query: 423 FYRQSELLIPDVLANVNF---SQADEVIADLRDKLHQLCEMLFNQSVAPYAHHPKLISTL 479 ++ E + ++ ++ A R L + SVA P+ + Sbjct: 462 YWTTVEKNLALLMTHIESIGTDDAIPTREAWRKMLFATACDAY--SVACGQETPRQMRAF 519 Query: 480 ALARATLYKHLRELKPQ 496 A L E + Sbjct: 520 AKGWQKLTTKKDEPETD 536 >UniRef50_C7QEM7 CRISPR-associated protein, Cse1 family n=12 Tax=Actinomycetales RepID=C7QEM7_CATAD Length = 1540 Score = 237 bits (604), Expect = 9e-61, Method: Composition-based stats. Identities = 71/529 (13%), Positives = 143/529 (27%), Gaps = 61/529 (11%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQII-A 59 +L W+PV +G + + S RL + A L LL+ + Sbjct: 987 FDLTSAPWLPVLYADGMQGVLSLRDVFAQSNLIRRLVGDLPTQDFALLRLLLAVLYDAVD 1046 Query: 60 PAKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKA--NDVTPMEKL 117 +D ++ + + + F L H PF Q G++ +V P+ K+ Sbjct: 1047 GPRDGQDWEDLWTSDDPFAAVPAYLDSHRERFDLLHPATPFYQVPGLQTAKGEVGPLNKI 1106 Query: 118 LAGVSGATNCAFVNQPGQGEALCGGCTAIALFNQANQAPGF-------------GGGFKS 164 +A V + E L A L + G + Sbjct: 1107 VADVPDGD-PFLTMRMPGVEQLSFAEAARWLVHTQAFDTSGIKSGVVGDPKAVNGKRYPQ 1165 Query: 165 GLRGGTPV-TTFVRGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKP-----IKSN 218 G+ + F G LR T+LLN++ Q + ++ P W Sbjct: 1166 GVAWLGNLGGVFAEGDTLRQTLLLNLIPADTTNLQV---TSAQDVPAWRGTNGRAGSDHA 1222 Query: 219 ESIPASSIGFVRGLFWQPAHIELCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWP 278 ++ P G WQ I L + + E +G+ Sbjct: 1223 DAEPRVPAGLRDLYTWQSRRIRLEYDT---------RGVTGAVLTYGDELTAHNKHGV-- 1271 Query: 279 HPHSPCLVTVKKGE-----VEEKFLAFTTSAPSWTQISRVVVDKII-QNENGNRVAAV-- 330 P + + + + + +W I ++ + A+ Sbjct: 1272 EPMTGWRRSKPQEKKLGLSTVYMPQQHDPTRAAWRGIESLLAGSAGSGSSQTGEPASHYR 1331 Query: 331 ---------VNQFRNIAPQSPLELIMGG--YRNNQASILERRHDVLMFNQGWQQYGNVI- 378 + N+ + + + G Y Q+ I E D L + Sbjct: 1332 PKIVDWLGELAHHGNLPSRGLIRVRTSGAVYGTQQSIIDEVVSDELTMAVVLLHEDDPRF 1391 Query: 379 -NEIVTVGLGYKTALRKALYTFAEGFKNKDFKGAGVSVHETAERHFYRQSELLIPDVLAN 437 VT +A+ ++ + V TA + + L + Sbjct: 1392 GKAAVTAVKDADSAVAALGDLASDLARAAGLDPEPERV--TARDRAFGALDGPYRRWLLD 1449 Query: 438 VNFSQADEVIAD-LRDKLHQLCEMLFNQSVAPYAHHPKLISTLALARAT 485 + S + + +++ + + + + R Sbjct: 1450 LGNSTDPAAMRAVWQGRVYDIIAVQGQMLLDSAGSAAAQGRMVKTTRGE 1498 >UniRef50_A8LYZ4 CRISPR-associated protein, Cse1 family n=1 Tax=Salinispora arenicola CNS-205 RepID=A8LYZ4_SALAI Length = 495 Score = 236 bits (602), Expect = 1e-60, Method: Composition-based stats. Identities = 69/524 (13%), Positives = 144/524 (27%), Gaps = 69/524 (13%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAP 60 +L WIPV ++ + + + + L++P +L I + Sbjct: 5 FDLTDQPWIPVVAKS-ELELVGLRELFVRAAEFDDLAVPVPPAASGLWRILYAITARVTG 63 Query: 61 AK--DDVEFRHRIMN-----PLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGV--KANDV 111 ++R R + A + D F L A P+MQ + + Sbjct: 64 LDMLRGPQWRQRQERLLDQGGFAAGDVDAYFAKYSDRFDLFGALRPWMQDPRLAVECPKS 123 Query: 112 TPMEKLLAGVSGATNCAFVNQPGQGE--ALCGGCTAIALFNQANQAPGFG------GGFK 163 + + KL+ + + + AL G A L Q G K Sbjct: 124 SGVNKLVFDRPAGNSQVWFGHHTDADAVALAPGEAAWYLIAQLYYGASGRCSSREVAGQK 183 Query: 164 SGLRGGTPVTTF----VRGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSNE 219 P+ G +L ++++ V + + W + + Sbjct: 184 FANSNAGPLRGVMSYHPLGENLFESLVVGVPP-----GVSSGQDEGLDLCPWERDELPDP 238 Query: 220 SIPASSIG-FVRGLFWQPAHIELCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWP 278 S+ L + H L P G+ + Sbjct: 239 LGAPWSVSWPCGALTGRARHAVLLVPDAAGEAVSDAYVTWAWRLPGAASP---------- 288 Query: 279 HPHSPCLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAAVVNQFRNIA 338 P +V + E L S W + ++ E ++ ++ Sbjct: 289 ---DPYVVRRQNKEGGWYQLPADDSRALWRDVDALLG---GNTEVKTHRPDIMAVAADLG 342 Query: 339 PQSPLELIMGGYRNN-QASILERRHDVLMFNQGWQQYGNVI---------NEIVTVGLGY 388 + G+ + QA + + GW + + ++G Sbjct: 343 LDG--RVRAYGFDQDGQAKDRQWFIALTPPVLGWLSERDPVTADGVALLTRAAESIGRRV 400 Query: 389 KTALRKALYTFAEGFKNKDFKGAGVSVHETAERHFYRQSELLIPDVLANVNFSQADEVIA 448 ALR+A + T E +++ ++E + + + + F++ Sbjct: 401 GAALRQAWRELVSVKDREG------PWAHTGEAYYWTRAEAVFWEHVRDGRFAEGG---- 450 Query: 449 DLRDKLHQLCEMLFNQSVAPYAHHPKLISTLALARATLYKHLRE 492 +L + + A P+L+ A L LR+ Sbjct: 451 ---RAFARLGHEAIDHAADGDASSPRLVRATQTAHRLLTTPLRK 491 >UniRef50_B7KJ23 CRISPR-associated protein, Cse1 family n=1 Tax=Cyanothece sp. PCC 7424 RepID=B7KJ23_CYAP7 Length = 525 Score = 236 bits (601), Expect = 2e-60, Method: Composition-based stats. Identities = 73/528 (13%), Positives = 167/528 (31%), Gaps = 60/528 (11%) Query: 2 NLLIDNWIPVRPRNGGKVQIINLQSLYCS-RDQWRLSLPRDDMELAALALLVCIG-QIIA 59 +LL + WIPV + K + I+LQ L+ + + +A L+ I Sbjct: 17 SLLTEPWIPVVYNDSLKYKNISLQDLFLEWENLKTVQGINPPRTIALWRWLIAFTQWSIQ 76 Query: 60 PAKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKAN----DVTPME 115 K E++ + + + + L H + PF Q K ++ + +P+ Sbjct: 77 GTKTIDEWKQLWTDENLGSRIIKRLETVKERLDLLHPDFPFGQCKDLREETKGKEPSPVS 136 Query: 116 KLLAGVSGATNCAFVNQPGQGEA--LCGGCTAIALFNQANQAPGF------GGGFKSGLR 167 K+L + ++ L L G ++G+ Sbjct: 137 KILFQ--DKDSGLLWSKYSDQNPAFLSYAEAVQELLRLLCCDLGGTKSDSQDRSAQTGIC 194 Query: 168 GGTPVTTFVRGIDLRSTVLLNVLTL-PRLQKQFPNESHTENQPTWIKPIKSNESIPASSI 226 + G +++ T+LLN+ P+ ++ P W + + + + Sbjct: 195 VMGRI-VMPIGKNVKETLLLNLHQYSPQDDIPSIFPDQDKDLPLWER--LNIKKQSRTIT 251 Query: 227 GFVRGLFWQPAHIELCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWPHPHSPCLV 286 G + L + + L +S + E+F + W + Sbjct: 252 GLLDYLTFPNRRVMLIH----------NGKSVTGVYLYKGEEFNQKDSYFWE--LWQAYI 299 Query: 287 TVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAAVVNQFRN---IAPQSPL 343 VK + K L + SW ++ Q+ + ++ + + R+ + P+ Sbjct: 300 QVKDESMPLK-LKLDINKASWRDAEALL-HPTTQDNHKPKIFDWLVKCRHTGCVPDPIPV 357 Query: 344 ELIMGGYRNNQASILERRHDVLMFNQGWQQYGNVINEIVTVGLGYKTALRKALYTFA--- 400 +++ + ++ L HD + Q + ++V GL Y + + + Sbjct: 358 QVLGFAHGSDLGKPLHWLHDTMTIPQVYLDSKEAYYKLVE-GLKYAEKIGRLFSSKTYET 416 Query: 401 -------EGFKNKDFKGAGVSVHETAERHFYRQSELLIPDVLANVNFSQADEVI------ 447 + F + + + + ++ + + DE Sbjct: 417 VANGLKLSKKDKQKFINQLSTTAAIYWSALDSEFQQFMFELAEDKVVDEEDEDDITFGEI 476 Query: 448 --ADLRDKLHQLCEMLFNQSVAPYAHHPKLISTLALARATLYKHLREL 493 + ++KL + + QS+A + A YK L ++ Sbjct: 477 KIPEWKNKLKTIATECYEQSIAGISS----YEARARGLNAWYKELNKI 520 >UniRef50_Q2JH30 Putative uncharacterized protein n=2 Tax=Frankia RepID=Q2JH30_FRASC Length = 550 Score = 233 bits (595), Expect = 8e-60, Method: Composition-based stats. Identities = 81/509 (15%), Positives = 155/509 (30%), Gaps = 54/509 (10%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALA--LLVCIGQII 58 NL+ WIPV R G ++++ ++L + L+L +A L LL + + Sbjct: 4 FNLIDGQWIPVIKR-GRRLEVGIRKALVDAHTIDGLALDDPLEAVAVLRQVLLPVVLDVF 62 Query: 59 APAKDDVEFRHRIMNP-----------LTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVK 107 + D E+ R E+ + + F+L H PF Q G++ Sbjct: 63 GAPRTDEEWSQRWEAGCFDRIIRKDRAEDEEGIESYLIRQAARFHLFHPTAPFAQVAGLR 122 Query: 108 AN--DVTPMEKLLAGVSGATNCAFVNQPGQGEA--LCGGCTAIALFNQANQAPGF---GG 160 + P+ L+ ++ N + + + L A AL G Sbjct: 123 TAKDETKPVSLLVPRLASGNNVPLFSSRTENDPPSLTPAAAARALLAAHCWDTAAIKTGA 182 Query: 161 GFKSGLRGG-------TPV----TTFVRGIDLRSTVLLNVLTLPRLQKQFPNESHTENQP 209 ++ G P+ G L T++L++ + +++P Sbjct: 183 ADDPKVKTGKTMGNPTGPLGQFGIVLPLGETLFHTLMLSIP-------VLRHGLRQKDRP 235 Query: 210 TWIKPIK-SNESIPASSIGFVRGLFWQPAHIELCDPIGIGKCSCCGQESNLRYTGFLKEK 268 W ++ + G + L WQ I L S R ++ Sbjct: 236 QWRSESSATSRWETRAPEGLLDLLTWQSRRIRLVPEADPTAVE---DVSVRRVVLTAGDR 292 Query: 269 FTFTVNGLWPHPHSPCLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVA 328 T +V+ L PH + K E + + +W + ++ + ++ Sbjct: 293 LTGSVHALEPHTAWRQVDKPKADEPPVRPVRHQPGRSAWRGLEALLTTTPLSSDKVFAPT 352 Query: 329 AVVNQFR-----NIAPQSPLELIMGG--YRNNQASILERRHDVLMFNQGWQQYGNVINEI 381 A+ R + PL+++ G Y A I E D + + + E Sbjct: 353 ALSQLARLRDDGYVPDDLPLQVLTVGVKYGTQSAVIDEVMADEIPLPVTALARDSAVRET 412 Query: 382 VTVGLGYKTALRKALYTFAEGFKNKDFKGAGVSV--HETAERHFYRQSELLIPDVLANVN 439 V +LR A + + + + + +LA + Sbjct: 413 VLAVAAQAESLRIAANRLGDDLREAAGATDKLPWDKGQRLGEILIHSFNPTVHRLLAGL- 471 Query: 440 FSQADEVIADLRDKLHQLCEMLFNQSVAP 468 Q E L L + V P Sbjct: 472 -QQHPEDAKRAELAWRILARRLAWEVVDP 499 >UniRef50_C8XAY7 CRISPR-associated protein, Cse1 family n=1 Tax=Nakamurella multipartita DSM 44233 RepID=C8XAY7_NAKMY Length = 555 Score = 233 bits (594), Expect = 1e-59, Method: Composition-based stats. Identities = 76/549 (13%), Positives = 151/549 (27%), Gaps = 60/549 (10%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAP 60 N++ + +P +G I Q+L + + M A LL+ I P Sbjct: 5 FNVIDEPVLPAVWLDGTSADISIRQALIDAHRIAAIEGEPASMTFALHRLLLAIVYRALP 64 Query: 61 AKDDV-EFRHRIMNP-LTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKA--NDVTPMEK 116 + E+R P L ++ + W F L P++Q G+ ++ + +EK Sbjct: 65 VERPRQEWRELWDAPELPAEDLNSYLDDWYQRFDLLDPAQPWLQVAGLHTTRSEFSELEK 124 Query: 117 LLAGVSGATNCAFVNQPGQGEALCGGCTAIALFNQANQAPGF---GGGFKSGLRGGT--P 171 L+ + V ++ A L + P G ++GG P Sbjct: 125 LIPDIPNGEQFFTVRAGLAARSISLAEAARYLIHAQAFDPSGIKSGAVGDPRVKGGKGYP 184 Query: 172 VTT---------FVRGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSN---- 218 + T V+G L+ T+LLN+ P S E+QP W + + Sbjct: 185 IGTAWAGNLGGVLVQGRTLKETLLLNLTLGSPNDDDRP-WSGEEDQPVWEREPLTAAEEF 243 Query: 219 ------ESIPASSIGFVRGLFWQPAHIELCDPIGIGKCSCCGQESNLRYTGFLKEKFTFT 272 + + G L W + L G + + + + Sbjct: 244 PGETTGDIPGRAPRGPADLLTWPSRRMRLRVEAD----RVTGVLIANGDVLWPQNRHAWE 299 Query: 273 VNGLWPHPHSPCLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKI----IQNENGNRVA 328 W + K W V+ + A Sbjct: 300 PMTAWRRSD---PQSKKYKTTVYMPRQHDADRSFWRGAGAVLPRADRSHHTVDGETGLPA 356 Query: 329 AVVNQFRNIAPQ-------SPLELIMGGYRNNQASILERRHDVLMFNQGWQQYGNVINEI 381 A + + I Y +N + I D ++ + ++ + Sbjct: 357 ASLRWLQGAVDDVLGPNFVLRARAISVIYGSNSSVIDAVYDDTMVVPAAVLDHPDLQQGL 416 Query: 382 VTVGLGYKTALRKALYTFAEGFKNKDFKGAGVSVHETAERHFYRQSELLIPDVLAN-VNF 440 + A+ +AL + G G ++ A + + + + + V+ Sbjct: 417 IDAIAHTDKAV-QALGDLGRDL-EQAAGGVGDALRGRARQLGFHRLDEPFRRWMGTLVSG 474 Query: 441 SQADEVIADLRDKLHQLCEMLFNQSVAPYAHHPKL----------ISTLALARATLYKHL 490 S D + + + + + A + +L Sbjct: 475 SNLDAAVTQWFVVARRDLVSIGLALIDAAPPEAWTGRPDPRRPDRRLDVVSAERRFHWNL 534 Query: 491 RELKPQGGP 499 P P Sbjct: 535 AAALPTAPP 543 >UniRef50_B8IZA3 CRISPR-associated protein, Cse1 family n=1 Tax=Desulfovibrio desulfuricans subsp. desulfuricans str. ATCC 27774 RepID=B8IZA3_DESDA Length = 540 Score = 232 bits (591), Expect = 3e-59, Method: Composition-based stats. Identities = 89/538 (16%), Positives = 164/538 (30%), Gaps = 65/538 (12%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAP 60 MNL+ D WIPV +G + Q++NL+ C QWR R +A + LL+CI Sbjct: 1 MNLVSDQWIPVLDNSG-QHQLVNLREALCEGAQWRDLAVRPHERVALMRLLLCIAHAALN 59 Query: 61 AKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKAN---------DV 111 ++ L D + W D F L H + PF+Q G+KA D Sbjct: 60 GPSREDWSRVPQ--LLPDAVAAYLQKWQDSFDLFHPQKPFLQISGLKAASKKTKRTEDDE 117 Query: 112 TPM---EKLLAGVSGATNCAFVNQPGQGEALCGGCT--AIALFNQANQAPGFGGG----- 161 P+ KL ++ + G A+ L + +PG G Sbjct: 118 GPLVKASKLDFTLATGNQSTHFDHEGSLAQRSAEQALPALNLLSYLCFSPGGLIGTVVWN 177 Query: 162 --------FKSGLRGGTPVTTFVRGIDLRSTVLLNVLTLPRLQKQFPN-ESHTENQPTWI 212 G+ TF RG ++ T+ LN+ ++++ + +P W Sbjct: 178 NHVTARSSSDGPCAVGSMTHTFWRGANVLQTLHLNMCARDDIERRLASIPEAGWGKPVWE 237 Query: 213 KPI---KSNESIPASSIGFVRGLFWQPAHIELCDPIGIGKCSCCGQESNLRYTGFLKEKF 269 + + ++ ++ L + L G G F Sbjct: 238 QMPVSFDDANAWRNATHTYLGRLTPLSRLV-LFQRGASGMTLGAGPV---------FPNF 287 Query: 270 TFTVNGLWPHPHSPCLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAA 329 P S ++ K + L T W ++ +V + + G Sbjct: 288 NNAKAPYVEEPTSTIILRGKDNKQTRALLPVTPGKALWRELHALVAHRNKDDVGGF---- 343 Query: 330 VVNQFRNIAPQSPLELIMGGYRNNQASILERRHDVLMFNQGWQ------------QYGNV 377 + S ++++ G +QA +++ V Q Q QY Sbjct: 344 WAAALASAEGASGRDMVVAGMARDQAEVVDTLESVYHIPQAMQKTPGQLVYGQGVQYAED 403 Query: 378 INEIVTVGLGYKTALRKALYTFAEGFKNKDFKGAGVSVHETAERHFYRQSELLIPDVLAN 437 ++ + + + + + + A ++ +E +P + A Sbjct: 404 MSRKLGWAVDTYREKVDGGWAGRLKGAGAGKVELLIKLRQKAFTLYWTAAEQSLPLLFAC 463 Query: 438 VN---FSQADEVIADLRDKLHQLCEMLFNQSVAPYAHHPKLISTLALARATLYKHLRE 492 V + L ++ P+ AL A L K + Sbjct: 464 VESLGSDAFPAAQKAWQKALFVAARKAYSSI--CAPQTPRQHRAFALGLARLCKPVAT 519 >UniRef50_Q1EQS6 Putative uncharacterized protein n=2 Tax=Streptomyces RepID=Q1EQS6_STRKN Length = 544 Score = 232 bits (591), Expect = 3e-59, Method: Composition-based stats. Identities = 69/538 (12%), Positives = 140/538 (26%), Gaps = 62/538 (11%) Query: 1 MNLLIDNWIPVRP--------RNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLV 52 NLL + WIPVR G +I + L S + L++ A L +L Sbjct: 6 FNLLDEPWIPVRWTPTELSSAVAGRPDRIGLRELLARSPEIAGLAIAEPPAHSALLRILY 65 Query: 53 CIGQIIAPAKD--DVEF----RHRIMNP-LTEDEFQQLIAPWIDMFYLNHA--EHPFMQT 103 + + + ++ L +A + F+L P+MQ Sbjct: 66 ALTARVTGLDEAGPGDWGVRRADVRDAGELPPQGISDYLATYRHRFFLYDPDGGRPWMQD 125 Query: 104 KGVKAN----DVTPMEKLLAGVSGATNCAFVNQPGQGEA--LCGGCTAIALFNQANQAPG 157 + + + KL+ N ++ + L P Sbjct: 126 ARLAHECDPDNTAGVNKLIVTRPSGNNHSWFEHTSDAAPGLPTASEAVLNLLVWHYYGPS 185 Query: 158 FGG------GFKSGLRGGTPVTT----FVRGIDLRSTVLLNVLTLPRLQKQFPNESHTEN 207 G KS P+ T G L T+L ++ ++ Sbjct: 186 GRCSSREVNGAKSASAKAGPLRTALSYHPEGETLFETLLAGLVPPKST------VKSAQD 239 Query: 208 QPTWI-KPIKSNESIPASSIGFVRGLFWQPAHIELCDPIGIGKCSCCGQESNLRYTGFLK 266 Q W + +++P + G L H L P Y Sbjct: 240 QCPWEWHDLPDPDAVPVAPAGPCARLTACSQHALLLVPQEPDGQWVRDAFITWAYRDGRI 299 Query: 267 EKFTFTVNGLWPHPHSPCLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNR 326 + L+ + + W + +++ + R Sbjct: 300 PRD------------DSFLIWQVSQQGNRYPRPADSGRALWRDLDALLLKESAGAAQPRR 347 Query: 327 VAAVVNQFRNIAPQSPLELIMGGYRNNQASILERRHDVLMFNQGWQQYGNV----INEIV 382 V ++ + + + ++ + + Sbjct: 348 P-RVFEYACEVSDYLRVRALGFEQEGQAKDTQFVDASTPPVVEFVERETARTALPVATLR 406 Query: 383 TVGLGYKTALRKALYTFAEGFKNKDFKGAGVSVHETAERHFYRQSELLIPDVLANVNFSQ 442 +G Y L +A+ + + AG + ++ +E A ++ + Sbjct: 407 QLGETYGRRLDRAVKRAWAQYVDDAKADAGTWAAQAG-ARYWPGAEAEFWHRFAQLDRTG 465 Query: 443 AD----EVIADLRDKLHQLCEMLFNQSVAPYAHHPKLISTLALARATLYKHLRELKPQ 496 AD A R +L ++ + + AR LY R+ K + Sbjct: 466 ADLGAGFDAAAARTAFLRLAMAAYDSVTDSVTRTQRGARAASDARIELYGGPRKKKQE 523 >UniRef50_D1A6Q3 CRISPR-associated protein, Cse1 family n=1 Tax=Thermomonospora curvata DSM 43183 RepID=D1A6Q3_THECD Length = 722 Score = 232 bits (590), Expect = 3e-59, Method: Composition-based stats. Identities = 74/531 (13%), Positives = 144/531 (27%), Gaps = 63/531 (11%) Query: 1 MNLLIDNWIPVRPRNGGK---VQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQI 57 +L ++ W VR + G ++ L + + L++ A LL + Sbjct: 7 FDLALEPWAEVRWKEAGPDRPSRLGLRDLLVHAHEIEALAITPPPALSAMYRLLYALTAR 66 Query: 58 IAP----AKDDVEFRHR----IMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKAN 109 + D ++ R PL D A F L H + PF+Q + Sbjct: 67 VTGLDENPDGDGDWLDRRAEIFGEPLAPDAVDAYFAEHEGRFDLFHPQRPFLQDPRLADP 126 Query: 110 DVTP----MEKLLAGVSGATNCAFVNQPGQGEAL--CGGCTAIALFNQANQAPGFGGGFK 163 V P + KL+ G N + + ++L P + Sbjct: 127 AVCPKSAGVNKLVLGRPAGNNSVWFGHHWDASPIPVPTPDAFLSLLVWLYYGPSGRCSTR 186 Query: 164 SGLR------GGTP----VTTFVRGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIK 213 + P ++ G L T+L + P + ++ W Sbjct: 187 THADVTAADVSAGPLRGSLSYHPEGDTLLETLLAGLTPPPEGLR------RADDPCPWEL 240 Query: 214 PIKSNESIPASS----IGFVRGLFWQPAHIELCDPIGIGKCSCCGQESNLRYTGFLKEKF 269 + P + G L H L P G+ ++ T + K Sbjct: 241 ADLPDPLAPPRTPNPYPGPCTRLTGGWQHALLLVPDDTGR-----HVTDAYITWGHRGKL 295 Query: 270 TFTVNGLWPHPHSPCLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAA 329 T + ++ + W + ++ R A Sbjct: 296 PSTNDA--------YVIFQISKQGNLYARPADAGRALWRDLDGLLDLPTTATGTQPRRPA 347 Query: 330 VVNQFRNIAPQSPLELIMGGYRNNQASILERRHDVLMF----NQGWQQYGNVINEIVTVG 385 V + + + I N I ++ T G Sbjct: 348 VFGTGLDDLGSFKVRALGFEQDGKTKDIQFISAVTPPLLFRINDEDLATARRIGDMRTAG 407 Query: 386 LGYKTALRKALYTFAEGFKNKDFKGAGVSVHETAERHFYRQSELLIPDVLANVNFSQADE 445 Y L A+ + K + E A ++ ++E + L N ++ Sbjct: 408 ELYGGRLEYAVKRAWAAVVDDKPK--DCAWAEHAAAAYWPKAEEIFWTRLRNQDYD---- 461 Query: 446 VIADLRDKLHQLCEMLFNQSVAPYAHHPKLISTLALARATLYKHLRELKPQ 496 + ++ +F+Q +A + + AR LY R+ K + Sbjct: 462 --RHWQS-FRRVAISVFDQITRDHARGARTARAIEEARLELYGGARKAKRK 509 >UniRef50_C2CN11 CRISPR-associated protein n=1 Tax=Corynebacterium striatum ATCC 6940 RepID=C2CN11_CORST Length = 562 Score = 230 bits (586), Expect = 9e-59, Method: Composition-based stats. Identities = 76/547 (13%), Positives = 151/547 (27%), Gaps = 82/547 (14%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAP 60 +LL + WI +G + + Q + D L + A L +L+ I Sbjct: 9 FSLLDEPWIAAVGSHGEPLLVSIRQIFDGTHDIAELRGDSPAQDYAVLRVLLAIFWRAHS 68 Query: 61 AK--------DDVEF----RHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKA 108 E+ R + + + + + + F L ++HPFMQ + Sbjct: 69 VAKPAGKTKFSMAEWFVKARADALEGAADIKVLSYLDGYANRFNLFDSDHPFMQVADLHT 128 Query: 109 ND--VTPMEKLLAGVSGATNCAFVNQPGQG-EALCGGCTAIALFNQANQAPGF---GGGF 162 V+P+ +++ N F + G+ E L G A L G Sbjct: 129 EKGSVSPINRIIPEAE---NEFFTMRAGKPLETLSFGEAARWLVYVHAYDYSGIKSGAVG 185 Query: 163 KSGLRGGT--PV---------TTFVRGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTW 211 S ++GG P+ T++ G +LR T+ LN ++P W Sbjct: 186 DSRVKGGRGYPIGTGWTGMTGGTYLIGANLRETLALNTT--------EACLRTPNDKPVW 237 Query: 212 IKPIKSNESIPASSI---GFVRGLFWQPAHIELCDPIGIGKCSCCGQESNLRYTGF-LKE 267 + + +I G WQ + L + + + Sbjct: 238 EREPDTAAERNGGAIHIGGPADLATWQTRRVRLHREN--------NEVVAVLVSNGDRIP 289 Query: 268 KFTFTVNGLWPHPHSPCLVTV---KKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENG 324 G P +P + KK V + W + ++ + Sbjct: 290 DAGLNAFG---DPMTPYRYSKNQSKKDFVAFYPRPYDAGRTMWRSLEPLIAMESDAPYLR 346 Query: 325 NRVAAVVNQFRNIAPQSPLELIMGGYRNNQASILERRHDVLMFNQGWQQYGNV----INE 380 + ++ + L+ + +R + ++ G Q Sbjct: 347 S-TSSAGKGAQAPKRPEILDQLAYYWRTDVLPPSVAHVGMVTVEYGAQSSSVAASVTART 405 Query: 381 IVTVGLGYKTALRKALYTFAEGFKNKDF------------KGAGVSVHETAERHFYRQSE 428 + + + A+R+A A+ N G Q E Sbjct: 406 TLNLPVLENDAVRRAALNAADVTLNAAIALGQFGGNLLVAAGGTYEFQAEPMDSALAQLE 465 Query: 429 LLIPDVLAN-VNFSQADEVIADLRDKLHQLCEMLFNQSVAPYAHHPKLISTLALARATLY 487 +A + + E + + ++ F S H + R L Sbjct: 466 PAFNRWVAGWSDSPEPKEYAIEWQKQVRS-----FMLSRGEAMLHAAGPRAMI-GRPLLS 519 Query: 488 KHLRELK 494 + + Sbjct: 520 STTEKPQ 526 >UniRef50_A8M401 CRISPR-associated protein, Cse1 family n=1 Tax=Salinispora arenicola CNS-205 RepID=A8M401_SALAI Length = 560 Score = 226 bits (577), Expect = 1e-57, Method: Composition-based stats. Identities = 76/512 (14%), Positives = 146/512 (28%), Gaps = 59/512 (11%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQII-A 59 +L WIPV G + + + + + ++ A L LL+ I Sbjct: 8 FSLSGQPWIPVLDLAGRRRLVSLAELFAQAAELRAVAGDLPTQTSALLRLLLAILHRAVD 67 Query: 60 PAKDDVEFRHRIMNP-LTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKA---NDVTPME 115 +D+ ++ P L + + + D F L H PF Q ++ NDV ++ Sbjct: 68 GPEDERVWQGLWRQPDLPAGDVVDYLDEYRDRFDLLHPVTPFYQVADLRTQKQNDVFGLQ 127 Query: 116 KLLAGVSGATNCAFVNQPGQGEALCGGCTAIALFNQANQAP-------------GFGGGF 162 + +A V + L A+ L + G G+ Sbjct: 128 RFIADVPNGAPYLTTRLGPGLQRLTPAEAAVWLVHCQAYDTSGIKSGAVGDPRVSGGKGY 187 Query: 163 KSGLRGGTPV-TTFVRGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSNESI 221 G G + ++ G LR T+LLN++ L Q + + P W + Sbjct: 188 PIGPGAGGSLGLVYLEGRTLRETLLLNLVPLDNAYLQ---QDPERDSPMWERDPHGPAEE 244 Query: 222 P---ASSIGFVRGLFWQPAHIELCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWP 278 G + WQ I L + + +T N Sbjct: 245 AERDRGPHGPLNLYTWQSRRIRLFGDQT--------GITGAMIANGDRITWT---NLHRK 293 Query: 279 HPHSPCLVTVKKGEVEEKFLA-----FTTSAPSWTQISRVV---VDKIIQNENGNRVAAV 330 P S + + + + W ++ V+ +K + R AV Sbjct: 294 EPMSGWRRSPHQEKKLVLPTVYLPSLHDHTRALWRGLTAVLPTSAEKPGADAPTRRPPAV 353 Query: 331 VNQFRNIA------PQSPLELIMGG--YRNNQASILERRHDVLMFNQGWQQYGNVINEIV 382 + + + G Y N + I E HD + + + + Sbjct: 354 SQWLAGLRVTGLIDDRYRVTTRAVGVIYGNQMSVINEIYHDAVTMPVQAFEPTGPLATTI 413 Query: 383 TVGLGYKTALRKALYT------FAEGFKNKDFKGAGVSVHETAERHFYRQSELLIPDVLA 436 A L A G + + + + A Y + ++L L Sbjct: 414 VDSAADADAAVTTLRGLAVNLCRASGGYGERPEDPPAAAADRAAELAYAELDVLFRQWLG 473 Query: 437 NVN-FSQADEVIADLRDKLHQLCEMLFNQSVA 467 ++ + + + + L + V Sbjct: 474 GLDPADDPAQARVRWQTQARRCVLRLGSDLVD 505 >UniRef50_A8SDS0 Putative uncharacterized protein n=1 Tax=Faecalibacterium prausnitzii M21/2 RepID=A8SDS0_9FIRM Length = 537 Score = 226 bits (576), Expect = 1e-57, Method: Composition-based stats. Identities = 85/525 (16%), Positives = 144/525 (27%), Gaps = 73/525 (13%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAP 60 NLL + WI VR R+ ++ ++L ++D L+ A L LL+ + + Sbjct: 6 FNLLTEPWIRVRLRDNTVREVSLTEALVSAQDYVDLAGEMPTQNAAVLRLLLAVLFTVFS 65 Query: 61 AKDD--------------VEFRHRIMNPLTEDE-FQQLIAPWIDMFYLNHAEHPFMQTKG 105 D + + + W D F+L H HPF Q Sbjct: 66 RVDAKGEPRPLMQSDDALERWSALWQLGHFPAAPVRDYLEQWKDRFWLFHPTHPFWQVPQ 125 Query: 106 VKANDVTPMEKLLAGVS-GATNCAFV--NQPGQGEALCGGCTAIALFNQANQ----APGF 158 K KL +S + E L A L A Sbjct: 126 AKIGTEYGAAKLNGEMSESSNKLRLFPLYAGQSKEQLSYPQAARWLLCVNGYDDTSAKPK 185 Query: 159 GGGFKSGLRG--GTPVTTFVRGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTW--IKP 214 G G S G G +G +L T++LN+ + E E++P W P Sbjct: 186 GKGLPSVGAGWLGKIGFIQAQGDNLYETLMLNLTL-----LRDGRECWGESKPCWELEAP 240 Query: 215 IKSNESIPASSIGFVRGLFWQPAHIELC--DPIGIGKCSCCGQESNLRYTGFLKEKFTFT 272 + + + L Q + L G C G F +E Sbjct: 241 KSAERTEICCPDNPAQLLTLQSRRLLLHRTGENVDGFCLLGGD-------FFPRENVFAE 293 Query: 273 VNGLWPHPHSPCLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAAVV- 331 +W + K V + W + V ++G+R V Sbjct: 294 QMTIWR-----TMPIKKNEPVVFVPCRHDPAKQFWREFPAVFC-----QDSGHRPGVVCW 343 Query: 332 ------NQFRNIAPQSPLELIMGG--YRNNQASILERRHDVLMFNQG--------WQQYG 375 + + + P+ + + G Y + + + D L F G WQ Sbjct: 344 IEKLQEKRLKLLDPRRKVHFRISGVQYGDKDFFVNDSFSDSLTFQAGILDEIGRPWQSRI 403 Query: 376 NVINEIVTVGLGYKTALRKALYTFAEGFKNKDFKGAGVSVHETAERHFYRQSELLIPDVL 435 E + L A + +R F + + + Sbjct: 404 VREIERCEQTAALIGRFAQELAIAAGDRNENAGGAVRAQFYFAVDRPFRQWLQAI----- 458 Query: 436 ANVNFSQADEVIADLRDKLHQLCEMLFNQSVAPYAHHPKLISTLA 480 + DE + + + E L Q V + + Sbjct: 459 -DQEQDDPDEAALRWQTRARSIAEKLGKQMVMEAGNAALKGRRIV 502 >UniRef50_C8P6I4 Putative uncharacterized protein n=1 Tax=Lactobacillus antri DSM 16041 RepID=C8P6I4_9LACO Length = 584 Score = 225 bits (573), Expect = 3e-57, Method: Composition-based stats. Identities = 73/562 (12%), Positives = 144/562 (25%), Gaps = 100/562 (17%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAP 60 NL+ + WI V + + + Q + +L+ +LA L L+ I + Sbjct: 11 FNLVTEPWIKVVDEDNRERTVSLEQLFTNAVHYRQLAGEMKSQDLALLRFLLAILTTVYS 70 Query: 61 A----------------------------KDDVE------FRHRIMNPLTEDEFQQLIAP 86 +DD E ++ + + + Sbjct: 71 RYTADGEPYEWLKIDGQTMQPVPFEGKTFEDDDEDGLRQTWKDLYHAQHFTEIVTRYLQK 130 Query: 87 WIDMFYLNHAEHPFMQTKGVKANDVTPMEKLLAG--------------VSGATNCAFV-- 130 + D F L A+HPF Q + + + P K++A Sbjct: 131 YADRFDLLDADHPFYQATRAQYDSLVPKNKVVAKGKGTVAVKQINRTISESNNKPDIFSP 190 Query: 131 NQPGQGEALCGGCTAIALFNQANQAPGFGGGF---KSGLRGGTPVT-----TFVRGIDLR 182 N L A L N K F G +L Sbjct: 191 NTSPHKNDLSLASLARWLITYQNFTAVTDKTKVVAKEKFPVSPGWLYGLNPVFATGSNLF 250 Query: 183 STVLLNVLTLPRLQKQFPNESHTENQPTWIKPIK---SNESIPASSIGFVRGLFWQPAHI 239 T++LN++ +P Q + P W PI+ I Sbjct: 251 ETLMLNLVLIP--QGVNSETESMDQHPAWEVPIEEYIQARLTGIVPGNLAELYTLWSRVI 308 Query: 240 ELCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWPHPHSPCLVTVKKGE--VEEKF 297 + G F + + P + K E ++ Sbjct: 309 HIEWDNGRPLI-------------FSAGLPKLNNHEAFLEPMTTWKFNKKDNEWQPNLRW 355 Query: 298 LAFTTSAPSWTQISRVVVDKIIQNENGNRV-AAVVNQF------RNIAPQSPLELIMGGY 350 L + W + + + + ++ +V + + L L G Sbjct: 356 LN-SLGKAMWRNFGQYISVQQDDTQKDSQREPGIVTWLHMLRSTKMLPADLALHLTTVGL 414 Query: 351 RNN-----QASILERRHDV-----LMFNQGWQQYGNVINEIVTVGLGYKTALRKALYTFA 400 N+ Q+ E ++ ++F+ + I + Y Sbjct: 415 INDGNATSQSPAAEFADEMQINADVLFDPNPLKRLQWPKLIENTVEMTEKVGALVWYFAN 474 Query: 401 EGFKNKDFKGAGVSVHETAERHFYRQSELLIPDVLANV-NFSQADEVIADLRDKLHQLCE 459 + + K + FY + + LA + N + DE + + Q+ Sbjct: 475 HIMELRGVKD-DGAFANRVSARFYERLNQPFREWLAGLTNNDERDEKVNLWKQTAKQIAV 533 Query: 460 MLFNQSVAPYAHHPKLISTLAL 481 ++ + P+ I Sbjct: 534 QTADELLNSA--TPQDIRGRVK 553 >UniRef50_C1XYH9 CRISPR-associated protein, Cse1 family n=1 Tax=Meiothermus silvanus DSM 9946 RepID=C1XYH9_9DEIN Length = 419 Score = 223 bits (568), Expect = 1e-56, Method: Composition-based stats. Identities = 66/395 (16%), Positives = 118/395 (29%), Gaps = 54/395 (13%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINL-QSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIA 59 NL+ WIPVR G +++ ++L Q+L R R+ P + +A LL+ I Sbjct: 4 FNLITQPWIPVR--EGNQLKEVSLEQALLEGRRFERIEDPSPLVTVALYRLLLAILHRAL 61 Query: 60 -PAKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGV-KANDVTPMEKL 117 ++ E N ++ + +A D F L H E PF Q L Sbjct: 62 QGPENSDEAAKWFSNGFDAEKIRDYLAKHQDRFDLFHPERPFYQVPDFTLERSCRSWTVL 121 Query: 118 LAGVSGATNCAFVNQ--PGQGEALCGGCTAIALFNQANQAPGFGGG---FKSGLRGGTPV 172 ++ N + + L A L A G + T Sbjct: 122 APELNSDNNKVLFDHTVTSRPRPLHPAEAARLLVANQTFALSAGKSVLCHTATAPVATAA 181 Query: 173 TTFVRGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSNESIPASSIGFVRGL 232 + G +L T+ LN+++ P+ + + + E +P + +K+ E+ A+ G V Sbjct: 182 LALMLGENLHETLCLNLVSYPKSE-YERDFATWEREPLRVSDLKNCEAARATPKGIVHRY 240 Query: 233 FWQPAHIELCDPIGIGKCS--CCGQESNLR------------------------------ 260 W + L G G+ G+ +++ Sbjct: 241 TWLSRAVRLDPEEGNGQADDPLSGRFASVHRTDDPLSGQGTHAPVHPGRSPAGQTAHAAV 300 Query: 261 ---YTGFLKEKFTFTVNGL--WPHPHSPCLVTVKKGEVEEKFLAFTTSAPSWTQISRVVV 315 ++ P P K ++ L F W + ++ Sbjct: 301 YRTVVRWIAYASGIRYEEAAIRPDPMVAFRPDPKDLS-KQYPLGFREGRALWRDFASLLP 359 Query: 316 DKIIQNENGNRVAAVVNQFRNIAPQSPLELIMGGY 350 VV RN+ G Sbjct: 360 RPGSA-----HSPRVVEHARNVYRALGTRFKGRGI 389 >UniRef50_C7MQD3 CRISPR-associated protein, Cse1 family n=1 Tax=Saccharomonospora viridis DSM 43017 RepID=C7MQD3_SACVD Length = 484 Score = 222 bits (566), Expect = 2e-56, Method: Composition-based stats. Identities = 77/516 (14%), Positives = 146/516 (28%), Gaps = 75/516 (14%) Query: 2 NLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAPA 61 N+ D IPV +G ++ L + L +P E L +L I I Sbjct: 3 NIATDPVIPVTRSDGTTTRLGLRDLLVHAHKIRHLDIPIPPAEAGLLRILYTITCRITGL 62 Query: 62 ---KDDVEFRHRIMNPL-----TEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKA-NDVT 112 + + L D + + L + P+MQ + + Sbjct: 63 DTHDNRNTWAEHRNTVLATGRFDADAINAYLGKH--CWDLFDEQRPWMQDPRLPDQAERK 120 Query: 113 PMEKLLAGVSGATNCAFVNQPGQGEA--LCGGCTAIALFNQANQAPGFGGGFKSGLR--- 167 L G + + A L L G GG ++ Sbjct: 121 TANVLDMTRPGDNSAIWWKHTHADYAPPLPAHEAVQWLIVHHYYGSGGAGGKRTVTHNNK 180 Query: 168 -------GGTPV----TTFVRGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIK 216 P+ T + G L T+L + P+ + T + W + Sbjct: 181 TVSDQYMSSGPLRSTVTYYPLGATLFETLLAGI--------PAPSHTTTGDAAPWETDLN 232 Query: 217 SNESIPASSIGFVRGLFWQPAHIELCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGL 276 P + L Q H L E + Y + + + Sbjct: 233 QPLGTPPAPTWPAGILTGQSRHALLL--------DHTDNEVDGVYLTWAWK----ERHTP 280 Query: 277 WPHPHSPCLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAAVVNQFRN 336 P+ + K GE + TS +W ++ D+ AV+ + Sbjct: 281 ILDPYCIHNIDPKTGE--AQPRQANTSRSAWRDFDALLADRPTHTR-----PAVLGDALD 333 Query: 337 IAP--QSPLELIMGGYRNNQASIL--ERRHDVLMFNQGWQQ----YGNVINEIVTVGLGY 388 + Q L + G+ ++ + + + + + +VT Sbjct: 334 LPDDLQDTLRVRAIGWHQDRQATNTGWYVSETPPLLRYMDEHDPARAALAETLVTTADKV 393 Query: 389 KTALRKALYTFAEGFKNKDFKGAGVSVHETAERHFYRQSELLIPDVLANVNFSQADEVIA 448 A+RKAL+ + D K + A+ ++ +E + L + + E Sbjct: 394 YGAMRKALHKAWKDADLGDPK--QCPWKDAADHLYWPAAEHIFWAHL---DANTPPE--- 445 Query: 449 DLRDKLHQLCEMLFNQSVAPYAHHPKLISTLALARA 484 + + + AP+A +L+ ARA Sbjct: 446 ---REFVDTAVRVIDTVTAPHAD--RLLVARETARA 476 >UniRef50_A4XYU2 CRISPR-associated protein, Cse1 family n=3 Tax=Pseudomonadaceae RepID=A4XYU2_PSEMY Length = 525 Score = 221 bits (562), Expect = 5e-56, Method: Composition-based stats. Identities = 84/512 (16%), Positives = 150/512 (29%), Gaps = 56/512 (10%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAP 60 LL + W+ VR +G ++ L+ S + L+ +A LL+ I Sbjct: 17 FTLLDEPWLAVRMHDGQVGELGLLELFERSGEIGALAETSPPSLIAQYRLLLAITHRAIT 76 Query: 61 AK----DDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDVT---- 112 D E N L + + W + F+L H ++PFMQ + + T Sbjct: 77 QAQGRWTDAERMRWHQNGLPLAAIRDYLERWRERFWLFHPQYPFMQVAALADAEETRDKL 136 Query: 113 -PMEKLLAGVSGATNCAFVNQPGQ--GEALCGGCTAIALFNQANQAPGFG-GGFKSGLRG 168 P ++ + + ++ L PG + + Sbjct: 137 KPWTQISLASANGNAPVVFDHSCDLAPRSIGAADALCTLLGFLQFTPGGLVKTLRDSDKA 196 Query: 169 GTPVTT---FVRGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSNE---SIP 222 G T G L ++ L + P ++ E+ P W + S P Sbjct: 197 GALANTAAVMPMGDSLAQSLCLALHP--------PTQTGHEDLPAWERSAPSIAQLCGEP 248 Query: 223 ASSIGFVRGLFWQPAHIELCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWPHPHS 282 + G Q + L V P P + Sbjct: 249 ELATGPNDRYTRQSRAVLLLADDE---------RRVQWIRFAAGLALGDDVQA--PDPMA 297 Query: 283 PCLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAAVVNQFRNIA---P 339 G L+FT W + ++ ++ AAV+ N+ Sbjct: 298 SYRA----GSNSLVRLSFTEGRALWRDLPALL---PDAEGKASQPAAVLEWAANLQFYLG 350 Query: 340 QSPLELIMGGYRNNQASILERRHDVLMFNQGWQ---QYGNVINEIVTVGLGYKTALRK-A 395 L++ G ++QA +L R + + + N + V +ALRK A Sbjct: 351 NGVQPLLIAGLASDQAKLLRWRSERIALPAKLLASPDHANELRRYVRDAEELFSALRKLA 410 Query: 396 LYTFAEGFKNKDFKGAGVSVHE-----TAERHFYRQSELLIPDVLANVNFSQADEVIADL 450 AE + K A F+ +E + V+A + + D+ A Sbjct: 411 TGMLAETLPDPGSKDTWARARSLIDAGPASALFFAGAERQLGRVMALLGSDELDQAEALW 470 Query: 451 RDKLHQLCEMLFNQSVAPYAHHPKLISTLALA 482 R LH+ + + K + A Sbjct: 471 RQSLHKAAHEAWQTVLTDLGRGAKALRAEARH 502 >UniRef50_B6WQ59 Putative uncharacterized protein n=1 Tax=Desulfovibrio piger ATCC 29098 RepID=B6WQ59_9DELT Length = 516 Score = 218 bits (555), Expect = 4e-55, Method: Composition-based stats. Identities = 93/535 (17%), Positives = 173/535 (32%), Gaps = 61/535 (11%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAP 60 MNL+ D WIP R+G V+ +L+ + D L++ R +A + LL+C+ A Sbjct: 1 MNLVDDPWIPCIRRDG-MVRPASLRDCFTCDDIVDLAV-RPHERVALMRLLLCVSYAAAG 58 Query: 61 -AKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVK----ANDVTPME 115 +D + E + W D F L H + PF+Q G++ + D+TP Sbjct: 59 IPEDYDGWEDLRERLPLE--VPVYLDQWRDAFELFHPQKPFLQVVGLRSASASGDLTPCS 116 Query: 116 KLLAGVSGATNCAFVNQPGQGE-ALCGGCTAIALFNQANQAPGFGGG---------FKSG 165 KL ++ +N + E A A+ L + G G +S Sbjct: 117 KLDFSLATGSNSTLFDHAALMERAFTPEWLALNLLTYQMFSLGGLIGSVCWGEKTTGRSS 176 Query: 166 LRG----GTPVTTFVRGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKS---N 218 G G+ + TF+R L ++ +N+L+ L + +P W + Sbjct: 177 CDGPCAPGSMLHTFLRRDVLLDSIHVNLLSEEELHDYQQLGEGWQGRPLWERFPHGLDDA 236 Query: 219 ESIPASSIGFVRGLFWQPAHIELCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWP 278 +I ++ F+ + + L G G L + + + F Sbjct: 237 PAIRNATETFLGRMVPLTRAVLL---SRDGAGMVLG--DGLAFPSYTSPQRPFP------ 285 Query: 279 HPHSPCLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAAVVNQFRNIA 338 P V + L W Q++ + V + G AA++ + Sbjct: 286 -PEVTATVIAGGKKDVRFLLGAQPDKAIWRQLAALTVKRQGDGIGG--CAALI----HGH 338 Query: 339 PQSPLELIMGGYRNNQASILERRHDVLMFNQGWQQ------YGNVINEIVTVGLGYKTAL 392 + +L++ G +QA +++ V Q Y + + G A+ Sbjct: 339 EEQGTDLVVCGLSRDQADVVDVLESVFHVPGAMFQTAGHTLYEGEVARAENIAGGLGDAV 398 Query: 393 RK------ALYTFAEGFKNKDFKGAGVSVHETAERHFYRQSE---LLIPDVLANVNFSQA 443 + + G + A R ++ E L+ D++ Sbjct: 399 ERYRRLVDGGWEARLKLAGPKKGGELARLKAQAFRCYWTSVETGVSLLWDMVRTCGSEAF 458 Query: 444 DEVIADLRDKLHQLCEMLFNQSVAPYAHHPKLISTLALARATLYKHLRELKPQGG 498 V L + + A + + R L LR+ G Sbjct: 459 VPVQRAWWAHLEKSARAA--YAAACGKDTERQMRAYVTGRRILAGRLRKYLEHDG 511 >UniRef50_C7LYW9 CRISPR-associated protein, Cse1 family n=1 Tax=Acidimicrobium ferrooxidans DSM 10331 RepID=C7LYW9_ACIFD Length = 540 Score = 216 bits (551), Expect = 1e-54, Method: Composition-based stats. Identities = 82/541 (15%), Positives = 140/541 (25%), Gaps = 54/541 (9%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIA- 59 +L + W+PVR R+G + ++ + + +E A L L++ + I Sbjct: 7 FDLSSEPWLPVRFRDGRRSEVSLRDIFVLAHTIVGFDVDFPTLEPALLRLVLALAYRILR 66 Query: 60 PAKDDVEFRHRIM-NPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDVTPM---E 115 KDD E+ + +ED A W F L E PF Q ++ + Sbjct: 67 GPKDDAEWGRLWEADRFSEDAIDDYFARWRHRFDLFSKEFPFFQVADLEPAGKGGVKTAN 126 Query: 116 KLLAGVSGATNCAFVN--QPGQGEALCGGCTAIALFNQANQAPGF---GGGFKSGLRGGT 170 L+A N AL A L + G ++GG Sbjct: 127 SLVAYAPSGNNVPVFTPITDRTELALSPAEAARWLVERHAFGSASDKTGAKGNPKVKGGK 186 Query: 171 --------PVTTFV--RGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSNES 220 FV G LR T+LLN++ ++ P W + Sbjct: 187 DTPAIGYLAWIGFVAPVGQTLRETLLLNLVPWQYRNLIRGG---EDDVPAWERDPLGPTR 243 Query: 221 IPASSIGFVRGLFWQPAHIELCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWPHP 280 + + G WQ I L G + P Sbjct: 244 VMRAPDGVCDLFTWQGRRIRLFPERR-------GDAIVVPRVLICAGDEVDRRAARDVDP 296 Query: 281 HSPCLVTVKKG-EVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAAVVNQFRNIAP 339 H + ++G EV L W +S V+ + G + V ++ Sbjct: 297 HVGWRMESRRGAEVSYVPLRARPGQQVWRGLSSVLALGAEEQRAGVL--SFVEGLQSRGI 354 Query: 340 QSPLELIM---GGYRNNQASILERRHDVLMFNQGWQQYGNVINEIVTVGLGYKTALRKAL 396 L+ G + L Q+ + A + Sbjct: 355 ALVSLLVTSAKFGNMSTTLDDLAYDRLDTPLAVLNQEDPAAATVAIDAVTFAAHAAQALG 414 Query: 397 YTFAEGFKNKDFKGAGVSVHETA---------------ERHFYRQSELLIPDVLANVNFS 441 Y + + D S Y + + L + Sbjct: 415 YVAEARYLSYDLSFHEESKRHRVPEGKAALAKAARSALAEELYGRLDAPYRHFLTGLANI 474 Query: 442 QADEV-IADLRDKLHQLCEMLFNQSVAPYAHHPKLISTLALARATLYKHLRELKPQGGPS 500 E A+ + + L S P + L + + PS Sbjct: 475 DDLERPRAEWAALVEAVARDL--ASRELAQLAPAQAFAGVAGEDRFRRMLARARNEFSPS 532 Query: 501 N 501 + Sbjct: 533 D 533 >UniRef50_D2RAZ9 CRISPR system CASCADE complex protein CasA n=3 Tax=Actinobacteria (class) RepID=D2RAZ9_GARVA Length = 556 Score = 210 bits (534), Expect = 1e-52, Method: Composition-based stats. Identities = 69/573 (12%), Positives = 152/573 (26%), Gaps = 97/573 (16%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLY-CSRDQWRLSLPRDDMELAALALLVCIGQIIA 59 NLL + WI V G +++++ ++ + + L+ + A L +L+ + + Sbjct: 4 FNLLDEPWISVIVDEKGHNKLVSITDVFKHASEYKALAGDMKTQDFALLRILLAVLHTVF 63 Query: 60 PAKDDVE----------------------FRHRIMNPLTEDEFQQLIAPWIDMFYLNHAE 97 D +R + D + + W D FYL + Sbjct: 64 SRYDIQGNSREFDSNEDNEYYFNKETMNIWREVWNSKEFPDAVFKYLEQWHDRFYLFDDK 123 Query: 98 HPFMQTK--GVKANDVTP----------MEKLLAGVSGATNCAFVNQPGQGE----ALCG 141 +PF+Q + + + + +L++ A + + +L Sbjct: 124 YPFLQVLKQDIDSKKLGGKSPSEISGKNINRLISE--SNNKVAVFSPKDNVDNNKSSLNE 181 Query: 142 GCTAIALFNQANQA-----PGFGGGFKSGLRG-----GTPVTTFVRGIDLRSTVLLNVLT 191 A + + A FG G +G G +V G +L T++LN + Sbjct: 182 AQLARWIITLQSYAGLADKTFFGTGKYKASKGWLFDLGG---IYVEGENLFETLMLNCVL 238 Query: 192 LPRLQKQFPNESHTENQPTWI---KPIKSNESIPASSIGFVRGLFWQPAHIELCDPIGIG 248 + +Q +P W N + I + Sbjct: 239 VGEMQSP-----EKRQKPCWEYSGAENIENSFYETFIDNISQLYTRWSRAIYINPD---- 289 Query: 249 KCSCCGQESNLRYTGFLKEKFTFTVNGLWPHPHSPCLVTVKKGEVEEK-FLAFTTSAPSW 307 S S + + P + ++ ++ W Sbjct: 290 -ISIENPISVSIV-----KLPDINHKDAFIEPMTVWQYNKERENKDKYTPRKHKVEESMW 343 Query: 308 TQISRVVVDKIIQNENGNRVAAVVNQFRNIA---PQSPLELIMGGYRNNQASILERRHDV 364 + + + N ++ I+ S + L +++ + D Sbjct: 344 RSFGLLTLQESDDGILKNHKPGIMEWLNKISKDIEGSSISLQAVSMKDDGNATSWVPTDE 403 Query: 365 L-------MFNQGWQQYGNVINEIVTVGLGYKTALRKALYTFAEGF--KNKDFKGAGVSV 415 + + I ++A+ F K Sbjct: 404 ICDTLHIDEVVVTDNSDNGWVGRINNEVEYTRSAIGFIYRQFLLDICEIRNRNKDDATKY 463 Query: 416 HETAERHFYRQSELLIPDVLANV-NFSQADEVIADLRDKLH----QLCEMLFNQSVAPY- 469 + H Y + LAN+ +E A R+ LH + + + Sbjct: 464 ADKCISHVYFLVDQPFRQWLANIKPKDSMNERCAQWRNTLHNILINEAKGMLENATLRDF 523 Query: 470 ------AHHPKLISTLALARATLYKHLRELKPQ 496 + + A + L++L + Sbjct: 524 TGRPVVQSEKESTKNIVTAYSIFTSRLKKLSKK 556 >UniRef50_B8FDH6 CRISPR-associated protein, Cse1 family n=1 Tax=Desulfatibacillum alkenivorans AK-01 RepID=B8FDH6_DESAA Length = 494 Score = 209 bits (532), Expect = 2e-52, Method: Composition-based stats. Identities = 78/522 (14%), Positives = 149/522 (28%), Gaps = 96/522 (18%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAP 60 NL+ + WIPV R +++L+ ++ L ++A L + I Q Sbjct: 5 FNLVDEEWIPVAGRG-----LVSLRDVFTDPSLEALGG-NPLEKIALTKLFLAITQTAHT 58 Query: 61 AKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDVTPMEKLLAG 120 D E+ + ++ + D F+L PFMQ VKA D+ + +L Sbjct: 59 PADTDEWLAMGAPGMASRA-REYLEAHKDCFWLYGD-RPFMQMPAVKAADIQNVSAVLPF 116 Query: 121 VSGATNCAFVNQPGQGEALCGGCTAIALFNQANQAPGFGG---------GFKSGLRGGTP 171 V+ + + L A+ L + A G + + P Sbjct: 117 VATGNTTQVF-ESQKDRDLSDPEKALVLVFLSCFALGGKKVDAKIVLSPSYSEKSKTAKP 175 Query: 172 VTT---------FVRGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSNESIP 222 T F+ G L+ T+ LN+ TL + + P W K + + Sbjct: 176 GTCLGFQGFLHNFLVGGSLQETIWLNLFTLEEIGRLEQFPE-GLGVPPWEKMPEGEDCS- 233 Query: 223 ASSIGFVRGLFWQPAHIELCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWPHP-- 280 + F L L F HP Sbjct: 234 ------LARSFKNSYMGRLLP----------------LCRFALFADENFHYVEGIFHPGY 271 Query: 281 ----HSPCLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAAV---VNQ 333 + P + + ++ W +++ ++ + + + + Sbjct: 272 KDGAYDPSMAVDNTKKPRVLWV--DPEKRPWRELTSLLSFIQADSPRSFDCPQLRSGILK 329 Query: 334 FRNIAPQSPLELIMGGYRNNQASILERRHDVLMFNQGWQQYGNVINEIVTVGLGYKTALR 393 RN P S ++ GG R + + +QY + ++ V + + Sbjct: 330 ARNGGPGS-FKIWSGGLR--------------VSSNAGEQYVSGADDFVESEIRLSSQWL 374 Query: 394 KALYTFAEGFKNKDFKGAGVSVHETAERHFYRQSELLIPDVLANVNFSQADEVIADLRDK 453 + + + +G SV+ ++ +F Q + + A + Sbjct: 375 GETWFASLKGEMDALEGLARSVYGSSLAYFKHQ-------------KADGKKQAAQASNL 421 Query: 454 LHQLCEMLFNQSVA------PYAHHPKLISTLALARATLYKH 489 QL E F V H AR Sbjct: 422 FWQLSERNFQDLVDVCGSEIDGGAHGMRPRFADCARTAFNTF 463 >UniRef50_C6CML6 CRISPR-associated protein, Cse1 family n=6 Tax=Gammaproteobacteria RepID=C6CML6_DICZE Length = 507 Score = 209 bits (531), Expect = 2e-52, Method: Composition-based stats. Identities = 71/535 (13%), Positives = 151/535 (28%), Gaps = 88/535 (16%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAP 60 NL+ + WIPV ++L+ ++ + L ++A LL+ I Q + Sbjct: 5 FNLIDEPWIPVADIGQ-----VSLKEIFSNPQLRALGG-NPVQKIALTKLLLAIAQSAST 58 Query: 61 AKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDVTPMEKLLAG 120 DD ++R + E+ + W D FYL + PF+Q ++ +V + L Sbjct: 59 PIDDNDWRQTGWQGMAENCLS-YLEKWHDRFYLYGEK-PFLQMPAIQTAEVKSLGVLSPE 116 Query: 121 VSGATNCAFVNQPGQGEALCGGCTAIALFNQANQAPGFG--------------------- 159 +S + Q + AI + Q Sbjct: 117 ISTGNTTVL-TETQQEQRSYDADKAITVIVQMGFGLSGKKTDNSVVLTAGYQGKQNDKGK 175 Query: 160 -GGFKSGLRGG--TPVTTFVRGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPI- 215 K+G+ G + +F G + +V LN+ T + + + W + Sbjct: 176 PASGKAGIAVGHMGLLHSFWLGDSIVHSVWLNLFTTEDITELVMYPTL--GVAPWEQMPT 233 Query: 216 -KSNESIPASSIGFVRGLFWQPAHIELCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVN 274 + ++ A + L L D + Sbjct: 234 GEDDDIAQALKTSLIGRLIPMGKFCLLADDG----------------IHYSDGIAHAGYL 277 Query: 275 GLWPHPHSPCLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAAVVNQF 334 P + KK + S W +++ ++ G + Sbjct: 278 EGKADPTASVDFAQKKPKALW----VNPSKRPWRELTSLLQFIEQGKVGGFDTPQLKRTL 333 Query: 335 RNIAP-QSPLELIMGGYRNNQASILERRHDVLMFNQGWQQYGNVINEIVTVGLGYKTALR 393 + ++ L GG R + + +QY + ++ V + + L Sbjct: 334 KRVSRSAEQFALWSGGLR--------------VSSNAGEQYASGTDDYVQSEIWLSSHLL 379 Query: 394 KALYTFAEGFKNKDFKGAGVSVHETAERHFYRQSELLIPDVLANVNFSQA-------DEV 446 +++ + + + R+F L++++ S Sbjct: 380 GSVFLEYLKHEMSQLEAIQKQLWGAVVRYF---------RQLSDIDKSGTGKAQPFVSNQ 430 Query: 447 IADLRDKLHQLCEMLFNQSVAPYAHHPKLISTLALARATLYKHLRELKPQGGPSN 501 QLCE + + T + ++ Q P++ Sbjct: 431 AEKATSTFWQLCERRAQVLINACDSTDEAKRQRIQLHKTFAAYAIQVFDQMCPND 485 >UniRef50_Q47PJ1 CRISPR-associated protein, Cse1 family n=1 Tax=Thermobifida fusca YX RepID=Q47PJ1_THEFY Length = 549 Score = 209 bits (531), Expect = 2e-52, Method: Composition-based stats. Identities = 66/551 (11%), Positives = 149/551 (27%), Gaps = 75/551 (13%) Query: 1 MNLLIDNWIPVRPRN--GGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQII 58 ++ I W+ R R+ + L S + + +P +L I I Sbjct: 20 FDVTIAPWLIARSRDVLAAPEMLGLRDVLIRSHELSDVEIPLPPGAAVLWRILALITARI 79 Query: 59 APAKDD------VEFRHRIMNPL-----TEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVK 107 +++ R L + A + + F L H E P++Q ++ Sbjct: 80 TGLDQPPNKNPKRKWQARRSQILSKGRLDPEAVDAYFADYSERFDLFHPERPWLQDPRLR 139 Query: 108 AN--DVTPMEKLLAGVSGATNCAFV---NQPGQGEALCGGCTAIALFNQANQAPGFGGGF 162 + + KL G + N ++ + L L P Sbjct: 140 EECPKTSGVNKLAWGRTAGENQVWLGGHHHDLDPHPLDSAEAVWHLLATLGYGPSGMCTA 199 Query: 163 KSGLRG-------GTPVTTFV----RGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTW 211 + +RG P+ V G L +++LN+ + + W Sbjct: 200 RV-VRGRSERNVTAGPLRGTVSYHPLGRTLFESLILNIPYP---------GTGAADLAFW 249 Query: 212 IKPIKSNE-SIPASSIGFVRGL-FWQPAHIELCDPIGIGKCSCCGQESNLRYTGFLKEKF 269 +P ++ +P S G L H L P G + ++ + Sbjct: 250 EQPELNDPLGLPEESAGLAGILRLDHFRHAVLLHPSPDG---------SHVVDAWVTWAW 300 Query: 270 TFTVNGLWPHPHSPCLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAA 329 P+ + E W + ++ N + Sbjct: 301 RERNISPELDPYLIYQTSK---EGRVYPRPAEAERAIWRDLDALLHYGEDGNYRPTILDN 357 Query: 330 VVNQFRNIAP-QSPLELIMGGYRNN-QASILERRHDVLMFNQGWQQYGN----------- 376 + L L G+ + QA + W Sbjct: 358 CTPLAQVPQEVLDSLRLRAFGFDQDGQARDKQWFTATTPAVLRWLADRETDDNENARIVR 417 Query: 377 ----VINEIVTVGLGYKTALRKALY-----TFAEGFKNKDFKGAGVSVHETAERHFYRQS 427 +G + A ++A + N + + ++ ++ Sbjct: 418 RITLARKAAEALGRRLEKACKEAWKESNSPSSTSSGTNAKTETGVGPWVQHGMSRYWAKA 477 Query: 428 ELLIPDVLANVNFSQADEVIADLRDKLHQLCEMLFNQSVAPYAHHPKLISTLALARATLY 487 E + +++ + +A + + + +++ PY P++ + R+TL+ Sbjct: 478 EPVFWNIVYDRPAQGYTPGMAGPGNAFNLVALAAYDEVTGPYCERPRVAKVVERHRSTLF 537 Query: 488 KHLRELKPQGG 498 + + + Sbjct: 538 SNWTPKQDKEA 548 >UniRef50_UPI0001AEDDCB hypothetical protein SalbJ_26479 n=1 Tax=Streptomyces albus J1074 RepID=UPI0001AEDDCB Length = 509 Score = 208 bits (528), Expect = 5e-52, Method: Composition-based stats. Identities = 57/519 (10%), Positives = 136/519 (26%), Gaps = 70/519 (13%) Query: 22 INLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAPAKD--DVEFRHRIMNPLTE-- 77 +R+ L +P A L +L + I + + R L Sbjct: 2 SLRSLFLRAREIRTLLIPEAPTHSALLRVLYALTARITALDEAGPGSWGDRREEVLERGF 61 Query: 78 ------------DEFQQLIAPWIDMFYLNHAEHPFMQTKGVKA----NDVTPMEKLLAGV 121 W F L +E P++Q + + + KL Sbjct: 62 CAESFELPDGRKAGIGGYFDGWAHRFDLFDSERPWLQDPRLPDQCDRSQTAGLHKLAMSR 121 Query: 122 SGATNCAFVNQPGQGEALCG--GCTAIALFNQANQAPGFGGGFKSGLRGG---------- 169 S N ++ G + + A++L G ++ + Sbjct: 122 SAGNNHSWFGHRGDDKLVLPTVSQAALSLLTWHYWGSPGGLSRRAVGQVSHHYAKASPLR 181 Query: 170 TPVTTFVRGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSNESIP-ASSIGF 228 ++ +L T+L + + S + W + + P +G Sbjct: 182 GALSYHPECDNLFLTLLAGLTPPD------GDVSRQTDLCPWEREDVPDPLAPMPEPLGP 235 Query: 229 VRGLFWQPAHIELCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWPHPHSPCLVTV 288 L H P G+ ++ T + P L+ Sbjct: 236 CSRLTACSQHALYLVPADDGE-----HAADAYITWA--------YHAERLRPEDDYLIWD 282 Query: 289 KKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAAVVNQFRNIAPQSPLELIMG 348 + E +S W + +++ ++ ++ V++ +++ + + Sbjct: 283 IGKDGETSPRLARSSRSLWRDVDALLLKQL--DDASPIQPKVMDHAFDVSEYLRVRALGF 340 Query: 349 GYRNNQASILERRHDVLMFNQGWQQYGNVINEI---------VTVGLGYKTALRKALYTF 399 +Q++ + + V +++ G + A ++A T+ Sbjct: 341 EQDTSQSANYQYVDSTTPVLLSRVEEDVVTSDLPVRQLRELGELFGGRLEHATKEAWLTY 400 Query: 400 AEGFKN---KDFKGAGVSVHETAERHFYRQSELLIPDVLANVNFSQADEVIADLRDKLHQ 456 + KN AE F+ + + + Sbjct: 401 TDDKKNSPGAWLDAVAARYWPAAEDEFWSA----FRKLSRSDAAVDPAFDFDAACRAFGR 456 Query: 457 LCEMLFNQSVAPYAHHPKLISTLALARATLYKHLRELKP 495 + P+ + + A+A + LR+ + Sbjct: 457 HALDAYEAVTDSVLRTPRGVKAVTGAKAIILAALRDPRK 495 >UniRef50_Q8KB26 CRISPR-associated protein, CT1972 family n=1 Tax=Chlorobaculum tepidum RepID=Q8KB26_CHLTE Length = 530 Score = 207 bits (527), Expect = 6e-52, Method: Composition-based stats. Identities = 75/522 (14%), Positives = 144/522 (27%), Gaps = 88/522 (16%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAP 60 NL+ + WIP + +++L ++ L ++A LL+ IGQ Sbjct: 5 FNLIDEPWIPAIGKG-----LVSLADIFSDPRIPALGG-NPVQKIALTKLLLAIGQAACT 58 Query: 61 AKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKG-VKANDVTPMEKLLA 119 + + + W D F+L + PF+Q + + +L+ Sbjct: 59 PETTEALEQL-DAETFRRACRAYLEKWRDRFWLFGDK-PFLQMPAILDWMESQRAAGILS 116 Query: 120 GVSGAT-------------NCAFVNQPGQGEALCGGCTAIALFNQANQAPGF-------- 158 A N + ++Q +A A+ + + N A G Sbjct: 117 ETENAKQIGPGFYPSLPSENDSILSQFQTLKAQTDAEKALFIVSVMNFAFGGTQINKNIY 176 Query: 159 -------GGGFK----SGLRGGTPVTTFVRGIDLRSTVLLNVLTLPRLQKQFPNESHTEN 207 G G L + TF+ G + T+++N+L+ + P Sbjct: 177 PSEEKVKGKGKPAKPGPSLGRNGYLHTFLFGSTIIDTLIMNLLSQEEIDN-LPFWEKGIG 235 Query: 208 QPTWIKPIKSNESIPASSI--GFVRGLFWQPAHIELCDPIGIGKCSCCGQESNLRYTGFL 265 P W S E A S+ ++ L + L D G G Sbjct: 236 TPPWENMPVSRECDAALSLKKSYMGTLVSLSRFVLLHDD---GIYYIDGLPYPSH----- 287 Query: 266 KEKFTFTVNGLWPHPHSPCLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGN 325 W P +T+ + K + W ++ ++ N Sbjct: 288 --------QEGWLEP----SMTIDNQQNPPKAILVNPEKRPWRELVSILAVFDSNKNNKF 335 Query: 326 RVAAVVNQF-----RNIAPQSPLELIMGGYRNNQASILERRHDVLMFNQGWQQYGNVINE 380 + R P + + GG + + F G +QY N+ Sbjct: 336 VCLFIKYGLSRWPKRYNKPGDKIGVWSGGLQ-------------VSFQTG-EQYAKATND 381 Query: 381 IVTVGLGYKTALRKALYTFAEGFKNKDFKGAGVSVHETAERHFYRQSELLIPDVLANVNF 440 V + + + + + + + P Sbjct: 382 FVESSVELDPDM---WNNLWYDKFFGEISILEI-MANKVKNGVINYYDSFEPKKEKKPKE 437 Query: 441 SQADEVIADLRDKLHQLCEMLFNQSVAPYAHHPKLISTLALA 482 + + + QLCE F + V P + + A Sbjct: 438 RASTIMGKKAVELFWQLCERRFPELVD-ACGEPDKLPAIHEA 478 >UniRef50_UPI0001AF1D49 CRISPR-associated Cse1 family protein n=1 Tax=Streptomyces ghanaensis ATCC 14672 RepID=UPI0001AF1D49 Length = 531 Score = 204 bits (519), Expect = 6e-51, Method: Composition-based stats. Identities = 67/498 (13%), Positives = 134/498 (26%), Gaps = 47/498 (9%) Query: 22 INLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAPAKDDVEFRHRIM----NPLTE 77 Q+L + RL M A LL+ + + D + L + Sbjct: 2 GLAQALGQAARYRRLVGSTPTMTAALHRLLLALAHRVYRPLDGARWAELWRAREKEGLPQ 61 Query: 78 DEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDVTPMEKLLAGVSGATNCAFVNQPGQGE 137 E + F L PF Q + N+ KL+A + +N + + Sbjct: 62 PELATYQDKFWSRFELFDPSRPFFQCPAL-DNEPGSTAKLVAHRATGSNRTLFDHTTADQ 120 Query: 138 A--LCGGCTAIALFNQANQAPGF-GGGFKSGLRGGTPV-----TTFVRGIDLRSTVLLNV 189 L A L +++ + V G L T+LLN+ Sbjct: 121 RPLLQPAEAARWLVTTQAYDTSGTKQPYRTERSAEGGLGNRFGCVLVEGASLHETLLLNM 180 Query: 190 LTLPRLQKQFPNESHTENQPTWIKPIKSNE-SIPASSIGFVRGLFWQPAHIELCDPIGIG 248 L + + + P + ++P W + + +G+ L W I L + G Sbjct: 181 -QLYQPEAELPPRTTARDRPVWEASQPPDPHPDARAPLGWTDLLTWPSRRILLSTTVASG 239 Query: 249 KCSCCGQESNLRYT------------GFLKEKFTFTVNGLWPHPHSPCLVTVKKGEVEEK 296 G + + G + L V + E Sbjct: 240 ATLVDGVVLTPGTRMEGDLIDWEAMAAYRRPWLKGNKQGDFRAVTLDELRGVWRHSQELL 299 Query: 297 FLAFTTSAPSWTQISRVVVDKIIQNENGNRVAAV-----VNQFRNIAPQSPLELIMGG-- 349 + SW R Q R AA+ + + +IA + L + G Sbjct: 300 LSSDPRWWNSWRGRLRAKGPLPAQEPQRQRPAALDHIADLVEDDHIAEDTVYTLRIFGQQ 359 Query: 350 YRNNQASILERRHDVLMFNQGWQQYGNV-----INEIVTVGLGYKTALRKALYTFAEG-- 402 + + + + + I V++ L+ ++ Sbjct: 360 LGDQGGDTYAWYEEAVPAPVALLRAESARVGYIIGYAVSLANDLGEQLKLMERQYSADFH 419 Query: 403 ---FKNKDFKGAGVSVHETAERHFYRQSELLIPDVLANVNFSQADEV-IADLRDKLHQLC 458 K++D K + +H + ++ V ++ + L Sbjct: 420 RELTKDQDKKPTDLEIH--YWPRLAAPFATFLRELGEAVRLGASETAPAERWGQAVSDLA 477 Query: 459 EMLFNQSVAPYAHHPKLI 476 + Q + + + Sbjct: 478 DKTAWQWLRGAPRRDRSL 495 >UniRef50_D0WFD1 CRISPR-associated protein, Cse1 family n=1 Tax=Slackia exigua ATCC 700122 RepID=D0WFD1_9ACTN Length = 547 Score = 203 bits (517), Expect = 1e-50, Method: Composition-based stats. Identities = 69/508 (13%), Positives = 142/508 (27%), Gaps = 61/508 (12%) Query: 16 GGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAPAKDDVE------FRH 69 G ++ C+ + L+ ++A L LL+ I Q D + + Sbjct: 3 GISRELSLWDLFTCAGELKCLANDLPTQDIAILRLLLAILQRSLSPSLDEDDDPAEVWAK 62 Query: 70 RIM-NPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGV-------KANDVTP-MEKLLAG 120 + L +E + W F L PFMQ G+ K D P +++++A Sbjct: 63 LWEADTLPVEEIHSYLEKWRHRFDLLDNNEPFMQIAGLVRSNDAIKDEDGEPYLKRVIAD 122 Query: 121 VSGATNCAFVNQP--GQGEALCGGCTAIALFNQANQAPGF----------------GGGF 162 V N + L A L + + G G + Sbjct: 123 VPSRRNRRLFSVRMGEGINRLSYAEAARWLIHVHSFDTGGPKNAAKGDSSDVIKKEGRSY 182 Query: 163 KSGLRGGTPVTTF-VRGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIK-SNES 220 G + G ++ T++LN + L R + + + P W + + ++ Sbjct: 183 PGGTGWLGRIGCLYFEGSTIKETLILNFVPLYRNEIDSLFPEN--DLPIWERRQRCVSDG 240 Query: 221 IPAS--SIGFVRGLFWQPAHIELCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWP 278 G WQ + G + + + E +T W Sbjct: 241 HEPRVLPDGRADLYTWQSRWV--NFSHEDGMITNVVLSAGDLLSVEAAELYTVENMTSWK 298 Query: 279 HPHSPCLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAAVVNQFRNIA 338 K + + F + W ++ + ++ R+++ Sbjct: 299 E----GTSKSKPSAPKLIPMHFDSDKALWRGLNAIFAQNCSNKNPACHLSGTAVWLRHLS 354 Query: 339 PQSPLELIM-----------GGYRNNQASIL-ERRHDVLMFNQGWQQY--GNVINEIVTV 384 + I Y ++Q+S D L + ++N Sbjct: 355 SNNGGRAISKEYLLNVHAVDFKYDDSQSSSYHSMVDDKLEMSSYLLSPEGAPLVNFACGC 414 Query: 385 GLGYKTALRKALYTF-AEGFKNKDFKGAGVSVHETAERHFYRQSELLIPDVLANVNFS-Q 442 K A++ + + V E AE + + + L + + Sbjct: 415 VEKTKEAVKTLGDLAVNLYLASGGDSASSSGVREAAESRAFFEIDRLFRVWFSALGSQTN 474 Query: 443 ADEVIADLRDKLHQLCEMLFNQSVAPYA 470 A+ A +L + L + + Sbjct: 475 AEAARAGWYAQLRDVLMRLAQELIGEAG 502 >UniRef50_B0LU91 CRISPR-associated protein Cas1 n=2 Tax=Streptomyces RepID=B0LU91_9ACTO Length = 540 Score = 203 bits (515), Expect = 2e-50, Method: Composition-based stats. Identities = 66/519 (12%), Positives = 135/519 (26%), Gaps = 75/519 (14%) Query: 24 LQSLYCSRDQWRLSLPRDDMELAALA--LLVCIGQIIAPAKDDVEFRHRIMNP-LTEDE- 79 + L + + + A LL + + KD + + ++ Sbjct: 2 RELLLNAEKFADIVVDLPTQRPAVFRQVLLPLVVDALGCPKDAEAWMDMFRAGAFSPEQR 61 Query: 80 --FQQLIAPWIDMFYLNHAEHPFMQTKGVKA--NDVTPMEKLLAGVSGATNCAFVNQP-- 133 + +F L PF Q ++ + L+A + N + Sbjct: 62 QLLADYLDKHQHLFGLLDPVEPFGQVADLRTAKGETKGSALLVATAATGNNVPLFSSRTE 121 Query: 134 GQGEALCGGCTAIALFNQANQAPGF---GGGFKSGLRGG-------TPV----TTFVRGI 179 G L A L + G ++ G P+ T G Sbjct: 122 GDVLELTPAEAARWLLHTHCWDTAAIKTGAVGDPMVKSGKTTGNPTGPLGQLGVTMPVGS 181 Query: 180 DLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSNESIP-----------ASSIGF 228 L T+LLN+ + +++ P W + + + G Sbjct: 182 TLFETLLLNI--------PYGQAGLSDDVPQWRRRSTQGDVKDTLSCATPVWQSRPARGL 233 Query: 229 VRGLFWQPAHIELC---DPIGIGKCSCCGQESNLRYTGFLKEKFTFTV--NGLWPHPHSP 283 + WQ I L G + E T V + SP Sbjct: 234 LEAWTWQARRIRLISQDTDRGPRITRVLVSAGDRLEVSPDTEPHTAWVVDSPAGRRGKSP 293 Query: 284 CLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNEN-------GNRVAAVVNQF-- 334 VK + T W + ++ + + G + +V Q Sbjct: 294 ARSGVK----SARPRRHTAGRAGWRGLDALLAVNAVDQDQQATATRSGAVSSQLVRQLST 349 Query: 335 --RNIAPQSPLELIMGG--YRNNQASILERRHDVLMFNQGWQQYGNVINEIVTVGLGYKT 390 R + + PL + + G Y N A I + D + ++ + + Sbjct: 350 ISRRLPSRYPLRVELTGIAYGNQSAVIEDMYFDEIPLPVAALDPEGIVYGALLEVVDQAE 409 Query: 391 ALRKALYTFAEGFKNKDFKGAGVSVHETAER---HFYRQSELLIPDVLANV-----NFSQ 442 L KA+ + + + +R + ++ +LA + +F + Sbjct: 410 DLAKAVNHLSGDLRRAAGSEPIP--WDKGQRPGDTLLHALDPIVRRLLAGLRQAGDDFDR 467 Query: 443 ADEVIADLRDKLHQLCEMLFNQSVAPYAHHPKLISTLAL 481 ++ + K Q + + Sbjct: 468 CEQGLEAWEHKAGQATLRVAEGLFNSAPAALFTGRRVKK 506 >UniRef50_B5GY59 Putative uncharacterized protein n=1 Tax=Streptomyces clavuligerus ATCC 27064 RepID=B5GY59_STRCL Length = 594 Score = 200 bits (507), Expect = 1e-49, Method: Composition-based stats. Identities = 66/474 (13%), Positives = 129/474 (27%), Gaps = 52/474 (10%) Query: 49 ALLVCIGQIIAPAKDDVEFRHRIMNPL-----TEDEFQQLIAPWIDMFYLNHAEHPFMQT 103 LL + + + E+ P + + + D F L PF Q Sbjct: 1 MLLPVVVDALGFPETPEEWAEHFHAPDGFTGQAAERLTEYLDEHRDRFGLFDPVDPFAQV 60 Query: 104 KGVKAN--DVTPMEKLLAGVSGATNCAFVNQP--GQGEALCGGCTAIALFNQANQAPGF- 158 G++ + ++A + N F + GQ L G A L + PG Sbjct: 61 GGLRTGKDETRNSALIVATAASGNNVPFWSARTDGQAPRLSPGRAAHWLLHTHCWDPGAI 120 Query: 159 --GGGFKSGLRGG-------TPV----TTFVRGIDLRSTVLLNV-LTLPRLQKQFPNESH 204 G R G P+ G L ++ LNV + RL P Sbjct: 121 KTGAFGDPRARAGKVMGNPTGPLGALGLVLPMGRTLYESLWLNVPFGVTRLAGDLPQWRR 180 Query: 205 TENQPTWI--KPIKSNESIPASSIGFVRGLFWQPAHIELCDPIGIGKCSCCGQESNLRYT 262 + + + + G + WQ I L G+ R Sbjct: 181 RDREGPVEETRSTATPGWDSRPPRGPLDAWTWQARRIRLVPETAPQDADGDGEPEVNRVV 240 Query: 263 GFLKEKFTFT-----VNGLWPHPHSPCLVTVKK--GEVEEKFLAFTTSAPSWTQISRVVV 315 ++ + + K ++ + +W + ++ Sbjct: 241 VAAGDRLRLQPDHEFHTAWTVDSQTVHRKRLAKDPDALQIRPRRHRAGRAAWRGLDALLA 300 Query: 316 DK-------IIQNENGNRVAAVVNQF----RNIAPQSPLELIMGGYRNNQA--SILERRH 362 + + G A ++ + + P PL L + G N +I + H Sbjct: 301 VEGSTWQQDATEVGQGFHTAQILVKLAEAGAELPPDYPLRLELTGIAYNSKFSAIEDTFH 360 Query: 363 DVLMFNQGWQQYGNVINEIVTVGLGYKTALRKALYTFAEGFKN--KDFKGAGVSVHETAE 420 D L + ++ + + L A+ + G E Sbjct: 361 DELPLPVAALRRDGLVRAALIGAVAQAERLADAVNRLVADLRRAAGARPVPGGEWQHPGE 420 Query: 421 RHFYRQ----SELLIPDVLANVNFSQADEVIADLRDKLHQLCEMLFNQSVAPYA 470 + LL ++ +F + DE++ +K + + +A Sbjct: 421 SLLHALDPVVRLLLRLLRTSDEDFDRVDELLRAWEEKAGRETWKVAEHLLAQSP 474 >UniRef50_C2BEU1 CRISPR-associated protein n=1 Tax=Anaerococcus lactolyticus ATCC 51172 RepID=C2BEU1_9FIRM Length = 562 Score = 198 bits (504), Expect = 3e-49, Method: Composition-based stats. Identities = 65/576 (11%), Positives = 141/576 (24%), Gaps = 103/576 (17%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYC-SRDQWRLSLPRDDMELAALALLVCIGQIIA 59 NL+ + WI V G +++ L+ + + L+ A + L+ I + Sbjct: 4 FNLIDEPWISVVTDYKGTTKLVGLREFFQNCHNYLELAGEMPTQNFAVMRFLLAILHTVF 63 Query: 60 PAKDD---------------------------------VEFRHRIMNPLTEDEFQQLIAP 86 D + + + + Sbjct: 64 SRYDANGKPYEMVTINEKMQQVENVDEEYEEDYEDALMETWESLWKSGKFPEIVTDYLEC 123 Query: 87 WIDMFYLNHAEHPFMQT-----KGVKANDVTP-------MEKLLAGVSGATNCAFVNQ-- 132 W D FYL +PF Q K + P + +L++ A + Sbjct: 124 WHDRFYLFDDNYPFYQVTKEEISESKISKTNPSEILGKNINRLVSE--SGNKIALFSPKY 181 Query: 133 --PGQGEALCGGCTAIALFNQANQAPGFGG---GFKSGLRGGTPVT----TFVRGIDLRS 183 E L L + + A KS + F+ +L Sbjct: 182 SSDDNKEILDYDEVVRWLISFQSYASLSDKVRFSNKSYKASKGWLFDLGGVFLSSDNLYK 241 Query: 184 TVLLNVLTLPRLQKQFPNESHTENQPTWI-KPIKSNESIPASSI--GFVRGLFWQPAHIE 240 T++LN++ + + P W KP + + + + I Sbjct: 242 TMVLNLVLVNTSNTDYN---TNIQNPVWEYKPSEVVKKYMSDNPINNTAELYTAYSRAI- 297 Query: 241 LCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWPHPHSPCLVTVKK-GEVEEKFLA 299 +S + + + + + P + + + Sbjct: 298 ----------YISDFDSEKPFKMGIVKLPEVLHSNNFLEPMTVWRYNKDGTNKGDFTPRK 347 Query: 300 FTTSAPSWTQISRVVVDKIIQNENG---NRVAAVVNQFRNI-----APQSPLELIMGGYR 351 + + + N R +++ ++ + I Sbjct: 348 HQLNKSLRRSFGLITETEDANEGNENTAKRKPGIIDWLNDVNDYIGDEFVKINAISMEDD 407 Query: 352 NNQAS-------ILERRHDVLMFNQGWQQYGNVINEIVTVGLGYKTALRKALYTFAEGFK 404 N S + + + N + I I V K + K +F K Sbjct: 408 GNAMSWVPTNEIVDSIYIEANLVND--LEDKGWIFRINKVVDKTKYIVDKIFRSFVNDTK 465 Query: 405 NKDFKGAGVSVHETAERHFYRQSELLIPDVLANVN-FSQADEVIADLRDKLHQLCEMLFN 463 Y + + D L +++ D+ I D +L ++ Sbjct: 466 K-IRNIESNEYVSRYSESLYYELDKPFRDWLISIDYSDNKDKKIDDWYKELKKISIKQAE 524 Query: 464 QSV-APYAHH------PKLISTLALARATLYKHLRE 492 + V + +A A + + Sbjct: 525 KIVADSGPRDYTGIIENDSVRNIATAFNFFMARINK 560 >UniRef50_C6HV92 CRISPR-associated protein, Cas1 n=1 Tax=Leptospirillum ferrodiazotrophum RepID=C6HV92_9BACT Length = 498 Score = 197 bits (501), Expect = 7e-49, Method: Composition-based stats. Identities = 71/527 (13%), Positives = 153/527 (29%), Gaps = 71/527 (13%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAP 60 NL+ + WIPV +++L+ ++ L ++A L+ I Q A Sbjct: 7 FNLIDEPWIPVADAG-----LVSLKDVFLRDSLRALGG-NPVQKIAMTKFLLAIAQAAAT 60 Query: 61 AKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDVTPMEKLLAG 120 ++D E+ L E + W D F+L + PF+Q +GV+ + +L Sbjct: 61 PENDDEWATMGPKGLAERCLS-YLEKWHDRFFLFGEQ-PFLQMEGVRTAALQSFGAVLPE 118 Query: 121 VSGATNCAFVNQPGQGEALCGGCTAIALFNQANQAPGFGG----------------GFKS 164 +S Q + L A+ + G K Sbjct: 119 ISTGNTSLLF-QSQIEKKLSEADKALISIQLSGFGLGGKADNSLVLTPGYGGKRNPKGKP 177 Query: 165 GLRGGTPV-------TTFVRGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKS 217 + +F G +L T+ +N+ + ++ S P W + Sbjct: 178 SVSKSGAWVGYKGYLHSFFFGDNLLKTLWVNLFSRSQIGLMSVYPS-GVGTPPWELMPEG 236 Query: 218 NESIPASSIGFVRGLFWQPAHIELCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLW 277 + A + L + L +E L YT + Sbjct: 237 EDCPVAKRL----KLSLMGRLVPLSG-------FFLFEEDGLHYTEGIAYP---DYKEGG 282 Query: 278 PHPHSPCLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAAVVNQFRNI 337 P + K + W I+ ++ + G + Sbjct: 283 VDPSVAIDDSGKDPKALW----VDPEKRPWRSITALLGFLAGRASKGFDCWQLRFNLEKA 338 Query: 338 A-PQSPLELIMGGYR--NNQASIL-----ERRHDVLMFNQGWQQYG--NVINEIVTVGLG 387 + + + GG R +N + + ++ ++ + Sbjct: 339 SNHLKTVGIWSGGLRISSNSGKVYIGGSDDFVESLVELPGNLLDKNWFARLSLEIEEMEN 398 Query: 388 YKTALRKALYTFAEGFKNKDFKGAGVSVHETAERHFYRQSELLIPDVLANVNFSQADEVI 447 + + + + + + + K A +V F++ E +++ S E Sbjct: 399 LSKIVFQTVSNYFKDQRMTEPKQAKNAVQ-----LFWQLCEQQFNELVEACESS---ESA 450 Query: 448 ADLRDKLHQLCEMLFNQSVAPYAHHPKLISTLALARATLYKHLRELK 494 LR ++ E +++ + + + R L +L +K Sbjct: 451 KSLRPTFVRIVEKVYD--LNCPRETARQVEAWVRNRPNLASYLSRMK 495 >UniRef50_C6SPI8 Putative uncharacterized protein n=1 Tax=Streptococcus mutans NN2025 RepID=C6SPI8_STRMN Length = 572 Score = 196 bits (497), Expect = 2e-48, Method: Composition-based stats. Identities = 78/566 (13%), Positives = 151/566 (26%), Gaps = 117/566 (20%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYC-SRDQWRLSLPRDDMELAALALLVCIGQIIA 59 NLL + WI V G + ++L + + L+ + A L +L+ + + Sbjct: 4 FNLLDEPWISVVFDEKGSTKEVSLLDFFQNAHHYKDLAGDTKTQDFAVLRVLLAVLHTVF 63 Query: 60 PAKDDV---------------------------------EFRHRIMNPLTEDEFQQLIAP 86 D + N D ++ + Sbjct: 64 SRFDANGNAYGYLEIDEKYRQIEEIEEDDLEEYEDDLYETWLTLWQNRQFPDIIEEYLKK 123 Query: 87 WIDMFYLNHAEHPFMQ----------TKGVKANDVTP--MEKLLAGVSGATNCAFV---- 130 W D FYL E+PF Q A + + +L++ + A Sbjct: 124 WRDRFYLFDEEYPFFQVRKEDIEMVMDLNKDAGKIFGKNINRLVSE--SSNKIALFSPKH 181 Query: 131 NQPGQGEALCGGCTAIALFNQANQAPGFGGGFKSGLR--GGTPVT----TFVRGIDLRST 184 N E L L + G K G R + F++G +L T Sbjct: 182 NYDNNKERLSNSEIVRWLLTYHGYSEIGGRMKKIGKRDYSKGWLYNLGGLFLKGKNLYET 241 Query: 185 VLLNVLTLPRLQKQFPNESHTENQPTWIKP---IKSNESIPASSIGFVRGLFWQPAHIEL 241 +LLN+ + +P W I + + + Sbjct: 242 LLLNLTLFYFEY----DNHLHIQKPCWEFDSAVIIDSYVSGKRIDNLASLYTSWSKEVYI 297 Query: 242 CDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWPHPHSPCLVTVKKGEVEEKFLAFT 301 + + + + N L+ P + K E + + Sbjct: 298 DP-----------HVVQPEFECRVAKIPAISPNDLFLEPMTLW----KYEENSFQPQSHY 342 Query: 302 TSAPSWTQISRVVVDKIIQNENGNRVAAVVNQFRNIA---PQSPLELIMGGYRNN----Q 354 T+ W + + E N + +++ R I + + + G ++ Sbjct: 343 TNQSLWR---SFGLITMGGMEKLNHIPGIIDWQRTIKDNIENASINICSVGMVSDKTANN 399 Query: 355 ASILERRHDVLMFNQGWQ---QYGNVINEIVTVGLGYKTALRKALYTFAEGF-------K 404 I+E D L N+ Q + I K + F + + Sbjct: 400 TPIIEVF-DTLSINEFVLTDIQKDGWVIRINDEVDRVKKVISHTYAYFVKDVIVIRKHIE 458 Query: 405 NKDFKGAGVS------VHETAE-------RHFYRQSELLIPDVLANVN-FSQADEVIADL 450 K+ S + E Y + + L+N+ D I + Sbjct: 459 EKEIPKLKKSILRGSLFRQDYEKQISNRIEELYFKIDQPFRQWLSNIQPGDDKDSKILEW 518 Query: 451 RDKLHQLCEMLFNQSVAPYAHHPKLI 476 R+ L ++ Y +P+ Sbjct: 519 REILEKIVLK--EAKSLLYEGNPRDY 542 >UniRef50_B3ENH5 CRISPR-associated protein, Cse1 family n=2 Tax=Chlorobiaceae RepID=B3ENH5_CHLPB Length = 529 Score = 195 bits (494), Expect = 4e-48, Method: Composition-based stats. Identities = 77/544 (14%), Positives = 155/544 (28%), Gaps = 107/544 (19%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAP 60 NL+ + WIPV + I+L+ ++ D L +LA LL+ I Q Sbjct: 7 FNLIDEPWIPVIDKG-----RISLRQVFSEPDNRALGG-NPLQKLALTKLLLAIAQATCT 60 Query: 61 AKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGV-------------- 106 ++D + L + W D F+L + PF+Q + Sbjct: 61 PENDEIHASMESSELARKSID-YLDKWYDRFWLYGEK-PFLQMSAIHGLIEQRKRKYLNA 118 Query: 107 -------KANDVTPMEKLL-----AGVSGATNCAFVNQPGQGEALCGGCTAIALFNQANQ 154 + +V + K L + N +AL A+ + + N Sbjct: 119 AKKDSVRRTAEVNALPKSLGMGFYPDMPSENNTILTQY-QIPKALADSDKALFIVSLMNF 177 Query: 155 APGF-------------GGGFKSGLRGGTP--------VTTFVRGIDLRSTVLLNVLTLP 193 A G G K+ P + + + G L T+L+N+L+ Sbjct: 178 ALGGKRVEKNLDNQLMLGYAGKTPSAKSAPSLGNYIGYLHSILVGETLADTLLINLLSHE 237 Query: 194 RLQKQFPNESHTENQPTWIKPIKS--NESIPASSIGFVRGLFWQPAHIELCDPIGIGKCS 251 R+Q + W + ++ L + L G G Sbjct: 238 RIQANV-YWKSGLGKAPWEEMPTGVACPIATNLKSSYMATLVAMSRFVLL---QGDGIYY 293 Query: 252 CCGQESNLRYTGFLKEKFTFTVNGLWPHPHSPCLVTVKKGEVEEKFLAFTTSAPSWTQIS 311 G + W P +++ + W ++ Sbjct: 294 VEGIQYPSH-------------KEGWREPSMAVNAQAATPKIKW----IDPNKRPWRELV 336 Query: 312 RVVVDKIIQNENGNRVAAVVNQFRNIAPQ-SPLELIMGGYRNNQASILERRHDVLMFNQG 370 ++ G + +N + + + GG R + N+ Sbjct: 337 SLLAFMDGGGSQGYECQFIKYGLKNFGDRFKRIGVWSGGLR-------------VSTNRC 383 Query: 371 WQQYGNVINEIVTVGLGYKTALRKALYTFAEGFKNKDFKGAGVSVHETAERHFYRQ---S 427 Q + + ++ + + Y + FK V++++ ++ F Sbjct: 384 DQSVKQDNDFVESLVFLESKIIGQLWY--------QQFKLEMVALNKISDTIFTATVAYY 435 Query: 428 ELLIPDVLANVNFSQADEVIADLRDKLHQLCEMLFNQSVA--PYAHHPKLISTLALARAT 485 + L+ + + +LCE F + V + K + + A+A Sbjct: 436 QSLVEKMDKKKAIKSFKNIADKATSLFWELCERHFQELVDACEPPYETKK-TRIVFAQAA 494 Query: 486 LYKH 489 L Sbjct: 495 LKAF 498 >UniRef50_B5GA97 Crispr-associated protein n=1 Tax=Streptomyces sp. SPB74 RepID=B5GA97_9ACTO Length = 534 Score = 186 bits (473), Expect = 1e-45, Method: Composition-based stats. Identities = 68/532 (12%), Positives = 143/532 (26%), Gaps = 74/532 (13%) Query: 2 NLLIDNWIPVRPRNGGKVQII---NLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQII 58 +L+ + PVR + + + L + L++ + A L +L + + Sbjct: 38 SLVTGEFFPVRLVDAADTVPVKYGLRRLLVEAGSIASLAVTPPPAQAALLRILYVVTARV 97 Query: 59 A----PAKDDVEFRHRI----MNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKAN- 109 + PA + R L + +A + L A PF+Q + Sbjct: 98 SGLDRPAASPSAWLDRRDDVAEEGLDPERVDAYLAEHAERLRLFGA-RPFLQDPRLAEEC 156 Query: 110 -DVTPMEKLLAGVSGATNCAFVNQPGQGEA--LCGGCTAIALFNQANQAPGFGGGFKSGL 166 + KL+ G +N + + + + L ++ Sbjct: 157 SKKAGVNKLVFGRPAGSNQVWFGHHRDADPRPVPADEALLHLLMWLYYGAAGRCSTRTVG 216 Query: 167 RGG----------TPVTTFVRGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIK 216 ++ G L T++ + + ++ W P Sbjct: 217 SVSAADSRSGPLRGSLSYHPEGPTLLHTLVAGIP------RPGNGTDPATDRCPWELPEL 270 Query: 217 SNESIPAS-SIGFVRGLFWQPAHIELCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNG 275 + P++ ++G + L H L G + T ++K Sbjct: 271 PDPLHPSTANVGPMSQLTAGWQHALLLQEGA-----RPGTVDDAYITWAARDKL------ 319 Query: 276 LWPHPHSPCLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAA---VVN 332 P P P LV ++ W I +++ + R AA V+ Sbjct: 320 --PRPEDPFLVLQLSQAGNIYARRAKSARALWRDIDALLIQEPYGTAKPRRPAAFHGVLE 377 Query: 333 QFRNIAPQSPLELIMGGYRNNQASILERRHDVLMFNQGWQQYGNVINEIVTV-------- 384 + + + I ++ ++ + Sbjct: 378 LDPEGGGPLRVRALGFEQDDRTKDIQYISGTTPPVLDLIEEREPRLSARLRTMRVAGELY 437 Query: 385 GLGYKTALRKALYTFAEGFK--NKDFKGAGVSVHETAERHFYRQSELLIPDVLANVNFSQ 442 G A+R+A + A V AER F+R+ Sbjct: 438 GRRLDFAVRQAWRELVNDSDAVGPWGELAAVDYWPAAEREFWRRV--------------G 483 Query: 443 ADEVIADLRDKLHQLCEMLFNQSVAPYAHHPKLISTLALARATLYKHLRELK 494 A ++ + L F++ A + + AR + R+ K Sbjct: 484 ARDMDQPWQS-FRLLAYRAFDKVTASAPPTMRAARAVQRARLAVAGGRRKKK 534 >UniRef50_Q03C63 CRISPR-associated protein n=1 Tax=Lactobacillus casei ATCC 334 RepID=Q03C63_LACC3 Length = 569 Score = 184 bits (466), Expect = 9e-45, Method: Composition-based stats. Identities = 77/541 (14%), Positives = 144/541 (26%), Gaps = 110/541 (20%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLY-CSRDQWRLSLPRDDMELAALALLVCIGQIIA 59 NL+ D WI V + + ++LQ+L+ S D RL+ +LA + LL+ I + Sbjct: 6 FNLVTDPWIKVIRAADYRSEEVSLQTLFQQSSDYLRLAGETQSQDLAIMRLLLAILHTVY 65 Query: 60 PAKDD--------------------------------VEFRHRIMNPLTEDEFQQLIAPW 87 D + + +A + Sbjct: 66 SRFDATGEPYEWLTIDLESLQVAEAVEQDDYEPDDLFETWDALHQLGHFSAIVIEYLARY 125 Query: 88 IDMFYLNHAEHPFMQTKGVKANDVTPMEKLLAG--------------VSGATNCAFVNQP 133 D F PF Q + + + P +K +A + A Sbjct: 126 QDRFDFFGE-RPFYQATQSEYDVLVPEKKKVATGSGTVAIRQINRTISESGNSPALFAPR 184 Query: 134 GQG--EALCGGCTAIALFNQANQAPGFGGGFKSGL-------RGGTPVT----TFVRGID 180 + L + N G K+ + + F G Sbjct: 185 SDAGKDTLGMAELVRWVITYQNYT---GVTDKTKIVAQENFSNDSGWLYRLSPVFAVGDT 241 Query: 181 LRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPI----KSNESIPASSIGFVRGLFWQP 236 L T+LLN++ + + E +P W +P + Sbjct: 242 LFDTLLLNLILVQNEDAPYAVE-----RPVWERPNAQRYVQDRERQRQPDNLAALYTSWS 296 Query: 237 AHIELCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWPHPHSPCLVTVKKGEVEEK 296 + L + F F+ + + P + +KK +V Sbjct: 297 RVLFLQWGD------------DRLENIFSAGVPPFSADNAFLEPMTTWR-WIKKEQVYRP 343 Query: 297 FLAFTT--SAPSWTQISRVVVDKIIQNENGNRVAAVVNQFRNIAPQSPLE---------- 344 T S W V + +++ R VV + + + L Sbjct: 344 HRKAMTSVSKAMWRNFGEYV---DLHDDSKRRQPGVVTWLQTLKARKSLPGETQITLATV 400 Query: 345 -LIMGGYRNNQASILERRHDVLMFNQGWQQY----GNVINEIVTVGLGYKTALRKALYTF 399 LI G +QA E DV+ N I T + Sbjct: 401 ALISDGNATSQAPAAE-VADVMRVNADVLFDSMGAQYWPKRIEDAIESTDTVAKFYWIFV 459 Query: 400 AEGFKNKDFKGAGVSVHETAERHFYRQSELLIPDVLANVN-FSQADEVIADLRDKLHQLC 458 + K ++ G + + FY+ + + L ++ D I ++ ++ Sbjct: 460 STIAKLRNISG--STFAAAEQEKFYQDLTIPFDNWLRTLSINDDRDAKIGQWNAQVKKIV 517 Query: 459 E 459 Sbjct: 518 L 518 >UniRef50_Q04AX2 CRISPR-associated protein n=2 Tax=Lactobacillus delbrueckii subsp. bulgaricus RepID=Q04AX2_LACDB Length = 574 Score = 178 bits (451), Expect = 4e-43, Method: Composition-based stats. Identities = 70/562 (12%), Positives = 147/562 (26%), Gaps = 112/562 (19%) Query: 2 NLLIDNWIPVRPRNGGKVQIINLQSLYC-SRDQWRLSLPRDDMELAALALLVCIGQIIAP 60 NL+ + WI V +++++ L + D +L+ +LA L LL I + Sbjct: 5 NLITEPWIKVLDGKTNSEHMVSIKELLQNASDYRQLAGEMHAQDLAVLRLLEAILTTVYT 64 Query: 61 AKDDVE---------------------------------FRHRIMNPLTEDEFQQLIAPW 87 D + + + + Sbjct: 65 RVDQNDEEYEWVTLDEQMHVQSYDDEIEGSTLLQVLTKTWNALYKEGSFSEAVFDYLDKN 124 Query: 88 IDMFYLNHAEHPFMQTKGVKANDVTP----------------MEKLLAGVSGATNCAFVN 131 +F PF Q + + P + +L++ + A + Sbjct: 125 KGLFDFFGD-RPFYQVTAEQYDSFVPDNKKIAKGSGTVDLMQINRLISQ--SGNSVAIFS 181 Query: 132 Q--PGQGEALCGGCTAIALFNQANQAPGFGGGFKSGLRGG-------TPVTTF----VRG 178 + L + N G K+ ++ + T +G Sbjct: 182 PKSANRKNKLTLDELVRWVITYQNFT---GVTDKTKVKAKEKMSNSRGWLYTLNPVYAQG 238 Query: 179 IDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIK---SNESIPASSIGFVRGLFWQ 235 +L T++LN+L + ++T+ +P W + + F Sbjct: 239 KNLFETLMLNLLLFNPNK-----PAYTQQRPVWEEDLGEYVKRRLSQVKPDNFAETYTVW 293 Query: 236 PAHIELCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWPHPHSPCLVTVKKGEVEE 295 + + G G + + E P S + K Sbjct: 294 SRLLHIEWQDGTPTIFSAGLPAFDSANAYDIE------------PMSTWRMNKKDENYYP 341 Query: 296 KFLAFTT-SAPSWTQISRVVVDKIIQNENGNRVAAVVNQFRNIAP------QSPLELIMG 348 ++ W + V D E+ V + Q + L Sbjct: 342 ATRQLSSIGIAMWRNFGQYV-DIEGHTESRE--PLTVAWLNYLKTKNEFLNQQMINLHTS 398 Query: 349 GYRNNQASI----------LERRHDVLMFNQGWQQYGNVINEIVTVGLGYKTALRKALYT 398 G +N + R ++F++ + I + + RK Sbjct: 399 GIISNGGATSLMPAAEFDDNLRIEADVLFDETEDRKNAWPQRIEEMVDLNQEVGRKYYGF 458 Query: 399 FAEGFKNKDFKGAGVSVHETAERHFYRQSELLIPDVLANVN-FSQADEVIADLRDKLHQL 457 E + + + FY + LAN++ D+ + +L Q+ Sbjct: 459 LMEVGRIRFGPQGATDFAGRKSQTFYDNLNEPFENWLANLSGGDDRDKQQELWKKQLRQI 518 Query: 458 CEMLFNQSVAPYAHHPKLISTL 479 + + A P+ IS + Sbjct: 519 ALRTLDDFLEIIA--PRDISGI 538 >UniRef50_Q60AC9 CRISPR-associated protein, CT1972 family n=1 Tax=Methylococcus capsulatus RepID=Q60AC9_METCA Length = 520 Score = 178 bits (451), Expect = 5e-43, Method: Composition-based stats. Identities = 73/540 (13%), Positives = 144/540 (26%), Gaps = 62/540 (11%) Query: 1 MNLLIDNWIPVRPRNG-GKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIA 59 MNLL D V +G ++ + L L + E A L + + Sbjct: 1 MNLLTDPLFRVETPDGIERLSLPQLLEALGQDRVESLLGLQRHQEDAFHIFLCYLAGAVL 60 Query: 60 PAKDDVEFR---HRIMNPLTEDEFQQLIAPWIDMFYLNH-AEHPFMQ-------TKGVKA 108 + E R + + W ++ + FMQ G Sbjct: 61 AREARSEPRQPEDFWREGI--RKLTGRDDDWAWTLIVDDVTQPAFMQAPVPDKKDFGAFK 118 Query: 109 NDVTPMEKLLAGVSGATNCAFVNQPGQGEALCGGCTAIALFNQANQAPGFGGGFKSGLR- 167 + L + A N + + A AL + + FG G R Sbjct: 119 LKARTADALDI-LPTAKN--HDVKASRSGATSPDGWVYALVSLQTMSGFFGQGNYGIARM 175 Query: 168 ----GGTPVTTFVRGIDL---RSTVLLNVLTLPRLQKQFPNESHTENQP-TWIKPIKSNE 219 G P + + ++ + P W +P Sbjct: 176 NGGFGSRPAVAVYHAERMGMRWHCDVTRLVGIREELLAGPWGYRERGIVLVWEQPWDLES 235 Query: 220 SIPASSIGFVRGLFWQ-PAHIELCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWP 278 S+ S+ + + + + L + ++ G Sbjct: 236 SL---SLNVLDPFYIEIARAVRLMGDGKNVRAFGASTKAARLAAGDAGGVLG-------- 284 Query: 279 HPHSPCLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAAVVNQFRNIA 338 P +P V KK ++ + +P + + + E+G R A + Sbjct: 285 DPWTPVNVADKKKGQSAMTVSASGLSPE--------LIRNVLFEDGFRAARMQCLLEENE 336 Query: 339 PQS---PLELIMGGYRNNQA------SILERRHDVLMFNQGWQQYGNVINEIVTVGLGYK 389 QS +++ G + R H + + + ++ + + Sbjct: 337 GQSCLFSATVLVRGQGTTDGFHHVAIPVPARAHRLFRRSSERDRLASISKTALNDAKEIQ 396 Query: 390 TALRK----ALYTFAEGFKNKDFKGAGVSVHETAERHFYRQSELLIPDVLANVNFSQADE 445 + K AL N D + + ++E +R SE P + AD Sbjct: 397 NRVLKPSVIALLEAGPDKINFDRREVNLWLNEATQRFSAAWSEDYFPWLWRQAEQDDADA 456 Query: 446 VIADLRDKLHQLCEMLFNQSVAPYA-HHPKLISTLALARATLYKHLRELKPQ--GGPSNG 502 + L + +++A Y + A + L + PQ G + Sbjct: 457 ARLEWLRALRDKAHKVLEEAIARYPSREGRRYRARVKAEGLFHGSLFKTFPQLKEGSHDA 516 >UniRef50_C7MTA7 CRISPR-associated protein, Cse1 family n=1 Tax=Saccharomonospora viridis DSM 43017 RepID=C7MTA7_SACVD Length = 525 Score = 170 bits (431), Expect = 9e-41, Method: Composition-based stats. Identities = 73/553 (13%), Positives = 157/553 (28%), Gaps = 83/553 (15%) Query: 1 MNLLIDNWIPVRPRNGGKVQ-------IINLQSLYCSRDQWRLSLPRDDMELAALALLVC 53 +L + PVR R G + + + + L + + + L +L Sbjct: 5 FDLAERGFCPVRWRAGQRPETFSPEASLGLVDLLLHAHRIEDVEISPPPALSGFLRILAV 64 Query: 54 IGQIIAPA---KDDVEFRHRIMNPL-----TEDEFQQLIAPWIDMFYLNHAE--HPFMQT 103 + I + ++ + L E ++ + F L H PF+Q Sbjct: 65 LTGRITGLDRMESFEDWEEAREDLLHAGRFDETAIRRYFDEFSGRFELFHTATARPFLQD 124 Query: 104 KGVK--------ANDVTPMEKLLAGVSGATNCAFVNQPGQGEALCGG--CTAIALFNQAN 153 + + + KL+ + + + + +L Sbjct: 125 ARLLEQCRDPQGDQVSSGVNKLVLNRAAGQAFVWQSHTVDADPSPAPVAEAVWSLLTWLY 184 Query: 154 QAPGFGGGFK--SGLRGG----TPVTTFV----RGIDLRSTVLLNVLTLPRLQKQFPNES 203 P + R G P+ V G L T++L + +PR Sbjct: 185 YGPPGCCTARQVGKTRAGDTKVGPLRGTVSYHPVGKSLFHTLVLGLPYVPR--------- 235 Query: 204 HTENQPTWIKPIKSNESIPASSIGFVRGLFWQPAHIELCDPIGIGKCSCCGQESNLRYTG 263 H + W + + S+P G R L + H L P G + Sbjct: 236 HEHDAAPWEEEPRDPLSVPPPVQGLARKLTGRFRHAVLLTPSESG---------DTVVDA 286 Query: 264 FLKEKFTFTVNGLWPHPHSPCLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNEN 323 + + + P+ V G + + + W + ++ Sbjct: 287 RITWAWREE-HAPVEDPYVMHRVPTNGGAP--IPESASAARAVWRDLDALL-----GRTE 338 Query: 324 GNRVAAVVNQFRNIAPQSPL--ELIMGGYRNNQASILERRHD---VLMF--------NQG 370 R V+ +A + + G+ +++ + +R++ + Sbjct: 339 QRRRPEVLAHLDELAVDEGVFTAIRALGFDQDRSKVKDRQYFSGITPPVLEAWQARDPRR 398 Query: 371 WQQYGNVINEIVTVGLGYKTALRKAL-YTFAEGFKNKDFKGAGVSVHETAERHFYRQSEL 429 W Q + AL N + GV A +++ +E Sbjct: 399 WAQLREAREAAEKTAWRLQEALSGLWQKLKPSKTTNGKARDRGVPWLHKAMTTYWQSAER 458 Query: 430 LIPDVLANVNFSQADEVIADLRDKLHQLCEMLFNQSVAPYAHHPKLISTLALARATLYKH 489 + + ADE +R+++ ++ ++ + A PK + + R L+ Sbjct: 459 CFWEAVR------ADEASVPVRNRMIEVALRTYDATTADLMRQPKQVKAVEEHRRGLWWG 512 Query: 490 LRELKPQGGPSNG 502 R G +G Sbjct: 513 WRSEAKANGEEDG 525 >UniRef50_D1Y489 CRISPR-associated protein, Cse1 family n=1 Tax=Pyramidobacter piscolens W5455 RepID=D1Y489_9BACT Length = 536 Score = 166 bits (420), Expect = 2e-39, Method: Composition-based stats. Identities = 75/551 (13%), Positives = 143/551 (25%), Gaps = 90/551 (16%) Query: 2 NLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLP------RDDMELAALALLVCIG 55 NLL + W+ V G + + + P + + L Sbjct: 4 NLLTERWLSVENPQGAMRRFSLPELFSALERNEVAAFPALLPHQAAPFHVWLVQLGCHAL 63 Query: 56 QII-----APAKDDVE-FRHRIMNPLTE--DEFQQLIAPWIDMFYLNHA---------EH 98 + P D + + + E D + ++ + F L+ + Sbjct: 64 ETAGQVENLPPPDPKKPWAMLGRHSPDEWRDMIRGVVPDYSRKFPLDEPWCLVTDDLNKP 123 Query: 99 PFMQTKGVK------ANDVTPMEKLLAGVSGATNCAFVNQPGQGEALCGGCTAIALFNQA 152 FMQ + + L +S F + G + AL + Sbjct: 124 AFMQVPAPDGDFADYRGEAQFPDDLDLLISAKN---FDVKSGVMKHPSAEEWIFALISLQ 180 Query: 153 NQAPGFGGGFKSGLRGGT-----PVTTFVR--------GIDLRSTVLLNVLTLPRLQKQF 199 + G G R P+ T G D R V+LN P + Sbjct: 181 TNSGFLGRGNYGVARQNGGWSIRPILTLQSSSSPGARWGRDAR--VILN--CPPDWELYA 236 Query: 200 PNESHTENQPTWIKPIKSNESIPASSIGFVRGLFWQPAHIELCDPIGIGKCSCCGQESNL 259 S E + W++P S + + + S +++ Sbjct: 237 FCRSEKETRLLWLEPWNGKTSSALRDLHPL--FIEICRRVR----AVRSGNSVSVKKAAS 290 Query: 260 RYTGFLKEKFTFTVNGLWPHPHSPCLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKII 319 EK + P P + + V L + + +I+ Sbjct: 291 ACARVDIEKTGGNL----RDPWEPVVFDKQGSHVFGSNLNYAN------------LARIL 334 Query: 320 QNENGNRVAAVVNQFRNIAPQSPLELIMGGYRNNQASILERRHDVLMFNQG--------W 371 G + ++ R I + R Q ++ + W Sbjct: 335 AEAEGMQKPLLLRYHRGIDDPVATQAWCSALRKGQGKTEGYEERLIPVSVAGVSDKFTLW 394 Query: 372 QQYGNVINEIVTVGL--------GYKTALRKALYTFAEGFKNKDFKGAGVSVHETAERHF 423 + G +IN + T + + A A + + K V E Sbjct: 395 RAAGAMINLVKTAKNTVLGVALARFMQCGKSADGNRAIDWNSSAVKNWIPVVKNKMEDEV 454 Query: 424 YRQSELLIPDVLANVNFSQ-ADEVIA-DLRDKLHQLCEMLFNQSVAPYAHH-PKLISTLA 480 + D + + + DE + L QL + V+ + + A Sbjct: 455 ETLFFRYLWDTCSRMTDGEITDESWLVPWKTCLRQLVRKYYEIGVSSLPGSVSQSVKARA 514 Query: 481 LARATLYKHLR 491 L+ TL + L Sbjct: 515 LSELTLDRLLA 525 >UniRef50_B2GBJ6 Putative uncharacterized protein n=1 Tax=Lactobacillus fermentum IFO 3956 RepID=B2GBJ6_LACF3 Length = 525 Score = 155 bits (391), Expect = 4e-36, Method: Composition-based stats. Identities = 56/496 (11%), Positives = 120/496 (24%), Gaps = 104/496 (20%) Query: 36 LSLPRDDMELAALALLVCIGQIIAPAKDDV------------------------------ 65 ++ +LA L L+ I + D Sbjct: 1 MAGEMRSQDLAILRFLLAILTTVYTRFDANGNPYEWLELDSKSWRPIDNSLDDDTDDIQE 60 Query: 66 ----EFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDVTP-------- 113 + D + + + F + + PF Q +D+T Sbjct: 61 DLLATWEDLFQRGSFSDIVVKYLEKYAARFDVFG-KQPFYQVTAEIYDDLTSKKIASGKG 119 Query: 114 ------MEKLLAGVSGATNCAFVNQ--PGQGEALCGGCTAIALFNQANQAPGFGGGFKSG 165 M +L++ + + Q + L + N G K+ Sbjct: 120 TVAIKQMNRLISE--SNNSPDIFSPKSSSQKNKIGTAEFVRWLISYQNFT---GTTDKTK 174 Query: 166 LRG-------GTPV----TTFVRGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKP 214 ++G + T + +G L T++LN++ + ++ +P W Sbjct: 175 IKGVEKYSASAGWLYGLDTVYAQGKTLFETLMLNLVLISDFSGEYG---SIIQKPNWEFS 231 Query: 215 IKSNES----IPASSIGFVRGLFWQPAHIELCDPIGIGKCSCCGQESNLRYTGFLKEKFT 270 +++ + I + S C KF Sbjct: 232 SQADYVKYLLEKKAPDNIAGLYTNWSRMIYVRWDEEPTVFSTC------------LPKFD 279 Query: 271 FTVNGLWPHPHSPCLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAAV 330 P + V + + W + V +N + + Sbjct: 280 DDAINNEIEPMTTWRVKDRHHNTKHL---SDLGKKMWRNFG-LYVPTEADTQNADMKPGI 335 Query: 331 VNQFRNIAPQS-----------PLELIMGGYRNNQASILERRHDVLMFNQGWQQYGNVIN 379 VN + + LI G +Q E D+ N + Sbjct: 336 VNWLSLLKDEQLIPDAHYVNLATANLISDGNATSQLPAAEISDDLY-INADILFDLHWTT 394 Query: 380 EIVTVGLGYKTALRKALYTFAEGFKNKDFKGAGVSVHETAERHFYRQSELLIPDVLANVN 439 +I + ++ + + G FY + + L +++ Sbjct: 395 KIEDAIEYTQQIIKYYWGFANGLAELRGI-GDKRDFANRLSEDFYNRLNEPFNNWLIDLD 453 Query: 440 FSQA-DEVIADLRDKL 454 Q I D++ Sbjct: 454 PMQPTTPQILKWEDEV 469 >UniRef50_C2GEY5 Putative uncharacterized protein n=1 Tax=Corynebacterium glucuronolyticum ATCC 51866 RepID=C2GEY5_9CORY Length = 541 Score = 152 bits (383), Expect = 3e-35, Method: Composition-based stats. Identities = 61/502 (12%), Positives = 124/502 (24%), Gaps = 65/502 (12%) Query: 47 ALALLVCIGQIIAP----AKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQ 102 L ++ A + + + L ED + + PFMQ Sbjct: 50 THRFLASTVAVVIQELNIATSPRKIKKLLEKGLPEDAVDAALERLAQGSDVFDPFFPFMQ 109 Query: 103 TKGVKANDV-----------TPMEKLLAGVSGATNCAFVNQPGQGEA-LCGGCTAIALFN 150 + D P++KL + F + G L L Sbjct: 110 QPALNIKDPKNKTTYVGPGIQPVKKLSPSMPPDEAEDFWHLLAAGNTELDLTAALQQLVG 169 Query: 151 QANQAPGFGGGFKS-GLRGGTPVTTFV-RGIDLRSTVLLNVLTLPRLQKQFP-NESHTEN 207 + + + G P FV + + L + P + + + Sbjct: 170 YQYLSLAGNNSYDGRKCQNGAPSMRFVGENRTATEIIWESTSLLASILLMIPLSWAVGQG 229 Query: 208 QPTWIKPIKSNESIPASSIGFVRGLFWQPAHIELCDPIGIGKCSCCGQESNLRYTGFLKE 267 P W K S + + W + + G Y + E Sbjct: 230 LPAW-ADRKCEHSRGENGPHPLWRSTWSSNAPAVAWKDDVMVGVRTGGIPENWYLPEMGE 288 Query: 268 KFTFTVNGLW-----PHPHSPCLVTVKK-----------------GEVEEKFLAFTTSAP 305 + W P+ +K + ++ A + Sbjct: 289 T-KESRKKWWDTRNESDPYYLYRSRAQKDGTQELVLQRLDLGTDATALAVEWAAKNKTKA 347 Query: 306 SWTQISRVVVDKIIQNENGNRVAAVVNQFRNIAPQSPLELIMGGYRNNQASILERRHDVL 365 + + + + + + + R S + + Sbjct: 348 LLAWQTPRLGEHTLDDR------LLFVRHRVEGTASSANIRASEIFAPSREKWSYDLEEN 401 Query: 366 MFNQGWQQYGNVINEIVTVGLGYKTALRKALYTFAEGFKNKDFKGAGVSVHETAERHFYR 425 + N Q I + + R + + G + A F+R Sbjct: 402 VLN----QISLRAELIQKIHNIVISPFRTRDNSASGGRAPLVIDFLKT-IRPDASTAFWR 456 Query: 426 QSELLIPDVLANVNFS-----QADEVIADLRDKLHQLCEMLFNQSVAPYAH-HPKLISTL 479 + ++L V Q + +LR+ L + + + V PY + P LIS + Sbjct: 457 HINAVFTEMLREVRSDFANGKQLTSISPELRENLIRAADGALEEVVEPYYYKDPALISYV 516 Query: 480 -----ALARATLYKHLRELKPQ 496 R T+ K + + Sbjct: 517 QNGIRTWVRQTINKAFPKPNTE 538 >UniRef50_B0S4B8 Putative uncharacterized protein n=1 Tax=Finegoldia magna ATCC 29328 RepID=B0S4B8_FINM2 Length = 519 Score = 148 bits (373), Expect = 5e-34, Method: Composition-based stats. Identities = 38/298 (12%), Positives = 84/298 (28%), Gaps = 62/298 (20%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAP 60 NLL + WIPV + + + L+ + ++ + ++ L+ I Q + Sbjct: 4 FNLLDEKWIPVIDNDCNNLNVSILELFKNASKYISIAGDTEVQTISITRFLLSILQTVFS 63 Query: 61 AKDDV---------------------------------EFRHRIMNPLTEDEFQQLIAPW 87 D+ + + + + + + Sbjct: 64 RFDENGEEYGYFGLNDMFKQKTEIDPNVIEDYVNDLNKAWINLLEMKSFPKIVEIYLNKY 123 Query: 88 IDMFYLNHAEHPFMQTKGVKANDVT--------------PMEKLLAGVSGATNCAFVNQP 133 D FYL E+PF Q D P +K+ + + N Sbjct: 124 HDRFYLYDDEYPFYQVSSNIWEDFNVISMGKQNAKPSNIPFKKINGKFYESNSLRMFNVN 183 Query: 134 GQ--GEALCGGCTAIALFNQANQAPGFGGGFKSGLRGGT-----PVTTF-VRGIDLRSTV 185 + + A + N + G+ + +TT +RG ++ T+ Sbjct: 184 DEKAKNKMSDSELARWIITYQNYSNTSDKAVFDGVSKKSYGWLYKITTMSLRGSNVFETL 243 Query: 186 LLNVLTLPRLQKQFPNESHTENQPTWIKPIK---SNESIPASSIGFVRGLFWQPAHIE 240 +LN++ + +++ N +P W + + I Sbjct: 244 MLNLVLVHPVKEFVGNS----QKPCWEESSNKIITKNIKGYVPDNLAELYNTYSRAIR 297 >UniRef50_C5V9N4 Putative uncharacterized protein n=1 Tax=Corynebacterium matruchotii ATCC 14266 RepID=C5V9N4_9CORY Length = 516 Score = 147 bits (371), Expect = 8e-34, Method: Composition-based stats. Identities = 82/545 (15%), Positives = 146/545 (26%), Gaps = 81/545 (14%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMEL-AALALLVCIGQIIA 59 M L +I R+ + + + +RL+L ME + + LL IG + Sbjct: 4 MPLTEVKFIRTFLRDEPVDMTVEEVLKHATDPDFRLNLDVSGMEFMSIIRLLSHIGARML 63 Query: 60 PAKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKG---VKANDVTPMEK 116 KDD R PL + + +A L + F Q V P K Sbjct: 64 Q-KDDSLHRKHRKKPLPDGLIVETLAELEADRPLYGGKQNFFQIPDSKGVVGRGKQPTSK 122 Query: 117 LLAGVSGATNCAFVNQPGQGEALCGGCTAIALFNQANQAPGFG---------GGFKSGLR 167 L G + A+ ++ A+ + G G+R Sbjct: 123 LSPTAPGDNSQAYWDRDKHKPVTLSAEEAMRQILIFSMYSSAGNNKFENRKCQNGSPGIR 182 Query: 168 ----GGTPVTTFVRGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSNESIPA 223 G T V+ +L ++L ++ P W P+ +S Sbjct: 183 FLGAGNTATEVMVQSKNLWDSLLCSIPA---------TWVAGSGMPAWADPM-GEQSKTD 232 Query: 224 SSIGFVRGLFWQPAHIELCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWPHPHSP 283 + + + W P + G G + SP Sbjct: 233 TGMHPLWQASWMPNGVSGYWEGRE-------------LVGVGVGGVPPQYLGSFSKVWSP 279 Query: 284 CLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNE---NGNRVAAVVNQFRNIAPQ 340 K + F T P + I + + + + V R Sbjct: 280 YGDKDAKESYKAWFKQRDTEDPFYLYIRDSKTNDPKAKRLDLSKDLIQLAVEWAREGTIS 339 Query: 341 SPLELIMGGYRNNQASILERRHDVLMFNQ-------------------------GWQQYG 375 L+ G + + +HD L+F + Q Sbjct: 340 KLDSLMAG-----RVAAPNFKHDKLLFARHQIGGNASTPVIRESVTTNTASSLWCLDQDP 394 Query: 376 NVINEIVTVGL--GYKTALRKALYTFAEGFKNKDFKGAGVSVHETAERHFYRQSELLIPD 433 V I+ A + + F + E F+R+ + + Sbjct: 395 EVQARIIGQAEFIDTLKQRVCAPFRRQSDKDHPTFDDL-ADLRPMMEAEFWRRITPVYEE 453 Query: 434 VLANVNFSQADEVIADLRDKLHQLCEMLFNQSVAPYAHHPKLISTLALAR--ATLYKHLR 491 V++ AD + +L +K + + PY + R LY L+ Sbjct: 454 VIS--TAQAADFNVVELYEKGVAATIAALDAVIDPYLLQNPKRNINVKERTIRFLYALLK 511 Query: 492 ELKPQ 496 + K Q Sbjct: 512 DKKGQ 516 >UniRef50_B6IWM6 CRISPR-associated protein, CT1972 family n=1 Tax=Rhodospirillum centenum SW RepID=B6IWM6_RHOCS Length = 600 Score = 144 bits (362), Expect = 1e-32, Method: Composition-based stats. Identities = 76/543 (13%), Positives = 152/543 (27%), Gaps = 92/543 (16%) Query: 9 IPVRPRNG-GKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAPAKDDVEF 67 IPV R+G G + L S + + L + L+ I Q +D + Sbjct: 8 IPVDRRSGPGVATPLELTSRHGDDPILGVRLGHPLADRGFEFLMRDILQAALAPEDATAW 67 Query: 68 RHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTK----GVKANDVT----------- 112 R ++ P + +AP+ + F L+H HP +Q + + D Sbjct: 68 RRMLVEPPGPEALAAALAPYRETFRLDHPTHPALQVRPAPERLAEADAKKPAGSRKPAPE 127 Query: 113 --------------PMEKLLAGVSGAT----NCAFVNQPGQGEALCGGCTAIALFNQANQ 154 + LL + N F + G + G L+ Sbjct: 128 AEEDGEEEEEEGPVGIGALLPDLPTKNAEKRNKDFFTRRGSIRTIGAGAVLPILYANQVL 187 Query: 155 APGFGGGFKSGLRGGTPVTTFVRGIDLRSTVLLNVLTLPRL---QKQFPNESHTENQPTW 211 G + S G T + + G L T+ LNVLT +P W Sbjct: 188 FIDKKGSYYSLPHGRTCILFQLVGRTLWETIWLNVLTRGTEGGGDAVWPARPDDPTAFPW 247 Query: 212 IKP-----------IKSNESIPASSIGFVRGLFWQPAHIELCDPIGIGKCSCCGQESNLR 260 + ++ S+ +++ L P+ +C G Sbjct: 248 LDSGLRDMSLDSNNARATRSMSRATLHPAHI--PMTRRYLLAPPVID-RCDLTG-MDGPA 303 Query: 261 YTGFLKEKFTFTVNG--LWPHPHSPCLVTVKKGEVEEKFLAFTT--SAPSWTQISRVVVD 316 + F + W ++ + K E +FL+ W + + Sbjct: 304 FKSFSRWPRGLQYETPDWWF--YAAVRLENPKKPDEPQFLSANGPLRFNDWIETAIFSNA 361 Query: 317 KIIQNENGNRVAAVVNQFRN-----------------IAPQSPLELIMGGY----RNNQA 355 K ++ + QFR+ I +++ +N Sbjct: 362 KNKNSKIIIHQPPSLRQFRSVFSASEELSEHGRRTTAIIEDGAIKVRCFAQYCYGSSNIG 421 Query: 356 SILERRHDVLMFNQGWQQ-----YGNVINEIVTVGLGYKTALRKALYTFAEGFKNKDFKG 410 + V Q ++ +V + K A++ + N Sbjct: 422 GTSQYELPVWHLPDEGQSWLGEIVSEDVDRLVRIAENLKKAVKNLSKDNKKKKANNQSLE 481 Query: 411 AGVSVHETAERHFYRQS---ELLIPDVLANVNFSQ-----ADEVIADLRDKLHQLCEMLF 462 ++++ Q+ + + ++ A E L + +L LF Sbjct: 482 LDNALYDALLAAVDGQATEHAATLAALARDIPDRATRQAGAAESRTKLLKRTGRLALALF 541 Query: 463 NQS 465 +++ Sbjct: 542 DET 544 >UniRef50_Q0AA30 CRISPR-associated protein, Cse1 family n=1 Tax=Alkalilimnicola ehrlichii MLHE-1 RepID=Q0AA30_ALHEH Length = 515 Score = 107 bits (266), Expect = 1e-21, Method: Composition-based stats. Identities = 59/535 (11%), Positives = 132/535 (24%), Gaps = 62/535 (11%) Query: 2 NLLIDNWIPVRPRNGGKVQIIN--LQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIA 59 NLL DN + + ++ L + + L+ + A L + Sbjct: 4 NLLTDNVFGILTPDQQHRRLSLPGLLAALARGEVESLTGVQRHQIDAFHIFLCYLSAAAL 63 Query: 60 -------PAKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHA-EHPFMQ------TKG 105 P +++ ++ + ++ + FMQ Sbjct: 64 ECADQPDPPQEEDRWKQSLR------LLSDYADDCAWTLAVDGPGKPAFMQPPIPSNDLA 117 Query: 106 VKANDVTPMEKLLAGVSGATNCAFVNQPGQGEALCGGCTAIALFNQANQAPGFGGGFKSG 165 ++L + + + A+AL + + G G Sbjct: 118 GYKPKAATPDELDVLQTAKN---HDLKASRLTHASPEDWALALISSQTMSGFLGQGNYGI 174 Query: 166 LRGGTPVTTFV---RGIDL-----RSTVLLNVLTLPRLQKQFPNESHTEN-QPTWIKPIK 216 R V L L + Q P + W P Sbjct: 175 SRMNGGFGVRVCVGVNRTLQPSERWREDLARLNAQRTHLTQPPWPFRDDGHLLLWTLPWD 234 Query: 217 SNESIPASSIGFVRGLFWQPAHIELCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGL 276 SI ++ + L + G+ + KE V Sbjct: 235 GQTSIGLETLHP--YFIEICRLVRLV--SKPTGIAALGKPTKAARIAGGKELAG-NVGDG 289 Query: 277 WPHPHSPCLVTVKKGEVEEKFLAFTTS--APSWTQISRVVVDKIIQNENGNRVAAVVNQF 334 W +P + K + F + ++ + + +G+ A + Sbjct: 290 W----TP-IHRKKGSALTPSARGFHPDMLRDLIITQTEYLLAPMQELPSGD--GAAIFHA 342 Query: 335 RNIAPQSPLELIMGGYRNNQASILERRHDVLMFNQGWQQYGNVINE-IVTVGLGYKTAL- 392 + G+ I + +L+ + +E + ++ + Sbjct: 343 SALVRGQGT---TDGFHEVNIPIDSKAKRLLLLGGEPADHLGRRSEWAIDAARNLRSRVL 399 Query: 393 RKALYTFAEGFKNKDFKGAGVSVHETAERHFY------RQSELLIPDVLANVNFSQADEV 446 R AL+T EG + + ++ P + +++ + + Sbjct: 400 RPALFTLLEGGPEGWPDTNRREAGQ--WTAVWLGEYDEGWADAYFPWLWSSIEVNSEPDA 457 Query: 447 IADLRDKLHQLCEMLFNQSVAPYA-HHPKLISTLALARATLYKHLRELKPQGGPS 500 AD +L QL E + + + A L + + Sbjct: 458 RADWVARLSQLAENILEHAFGAAPQRTGRRYRAQVRASGLFRGALYKHFQEEMAH 512 >UniRef50_A8LM39 CRISPR-associated protein, Cse1 family n=1 Tax=Dinoroseobacter shibae DFL 12 RepID=A8LM39_DINSH Length = 506 Score = 101 bits (251), Expect = 7e-20, Method: Composition-based stats. Identities = 56/543 (10%), Positives = 120/543 (22%), Gaps = 85/543 (15%) Query: 2 NLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIA-- 59 NLL D G ++ + L + + R A V + + Sbjct: 4 NLLSDPIFSAE--GGRRLNLPGLFAALACDEVRGFPRLRAHQRAAWHMFRVQLAALALDK 61 Query: 60 -----PAKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHA---EHPFMQTKG---VKA 108 P +D+ ++ L + L + F+Q +K Sbjct: 62 AGRAEPPQDEADWHAL---------LVALTEGVAGPWDLTGPDRTKPAFLQPPDPGGLKW 112 Query: 109 NDVTPMEKLLAGVSGATNCAFVNQPGQGEALCGGCTAIALFNQANQAPGFGGGFKSGLRG 168 V + L ++ + AL + G G R Sbjct: 113 EPVATPDALDLLITSRN---HDLKSEIAAQAAPEDWVYALISLQTSEGYGGRGNFGIARM 169 Query: 169 GTPVTTFVRGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSNESIPASSIGF 228 ++ +L + + P + ++ + + Sbjct: 170 NGGSSSRA---------MLGLAPAGPDGRPDPASWWRRDLALVLRNRNAPTLLTRGGKAL 220 Query: 229 VRGLFWQ-----------------PAHIELCDPIGIGKCSCCGQESNLRYTGFLKEKFTF 271 + L W I L G ++ Sbjct: 221 LWTLPWPEGRQIPALEMDPLAIEVCRRIRLVARDGTVVAERAASKAARVEAKA------- 273 Query: 272 TVNGLWPHPHSPCLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAAVV 331 NG+ P +P V + + ++ ++ G + Sbjct: 274 -FNGVLDDPWAPVNVKDATPKTLTLG---EGGRFHYRRMVDLLTG-------GYWQLPLA 322 Query: 332 NQFRNIAPQSPLELIMGGYRNNQASILERRHDVLMFNQGWQQYGNVINEIVTVGLGY--- 388 + + L+ + + + + + ++ N I Sbjct: 323 ARLDEGEVAGNMVLVAEALARGNSKTDGLQSRNVPMPKRVRGLADIRNRIARAAQEQMAE 382 Query: 389 ----KTALRKALYTFAEGFKNKDFKGAGVSVHETAERHFYRQSELLIPDVL------ANV 438 ALR+A+ +A + A ++ + D L + Sbjct: 383 IAAADAALREAVALYAARGDFETVGKPQRQRAAAARERLDATADRIFFDHLWARIAGMDE 442 Query: 439 NFSQADEVIADLRDKLHQLCEMLFNQSVAPYA-HHPKLISTLALARATLYKHLRELKPQG 497 E A+ R L ++ AR L LR+ Sbjct: 443 GDDALAEARANFRAVLVTTARDELTRAFDAIPCARIHAPRARIRARGRLEGALRKANLLE 502 Query: 498 GPS 500 Sbjct: 503 VTH 505 >UniRef50_B5H6V1 Predicted protein n=1 Tax=Streptomyces pristinaespiralis ATCC 25486 RepID=B5H6V1_STRPR Length = 130 Score = 98.0 bits (242), Expect = 8e-19, Method: Composition-based stats. Identities = 15/100 (15%), Positives = 32/100 (32%), Gaps = 1/100 (1%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIA- 59 L WIPV +G + ++ + + R+ E A + L++ + Sbjct: 15 FGLTTQPWIPVLRGDGTQDELSLREVFAQAAGLRRIVGDLPTQEFALVRLMLAVVHDALD 74 Query: 60 PAKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHP 99 +D ++ + + F L A+ P Sbjct: 75 GPQDIEDWSDLWADERCFAPVDAYLDAHRGRFDLLDAQAP 114 >UniRef50_Q0BRG1 Putative uncharacterized protein n=1 Tax=Granulibacter bethesdensis CGDNIH1 RepID=Q0BRG1_GRABC Length = 521 Score = 94.9 bits (234), Expect = 6e-18, Method: Composition-based stats. Identities = 56/530 (10%), Positives = 126/530 (23%), Gaps = 69/530 (13%) Query: 3 LLIDNWIPVR-PRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAP- 60 LL V + + + + R + A L +G I Sbjct: 4 LLSHPVFRVVTRTETHVLTLPGALAALMLDKVDNFTGLRPHQQHAWHMFLAALGAIALHH 63 Query: 61 ------AKDDVEFRHRIMNP--LTEDEFQQLIAPWIDMFYLNHAEHPFMQTK------GV 106 + + E++ +++ E+ + ++ + F+Q V Sbjct: 64 ADQSAIPETEGEWKDLLLDLTNQAEEPWSLIVEDLS--------KPAFLQPPVPEGKREV 115 Query: 107 KANDVTPMEKLLAGVSGATNCAFVNQPGQGEALCGGCTAIALFNQANQAPGFGGGFKSGL 166 N V + L ++ F + AL + G Sbjct: 116 LKNTVYTPDALDILITAKN---FDLKAEVAVEAGLDEWVFALVSLQTMQGYSGATKYGIA 172 Query: 167 RGGT-----PVTTFV-----RGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIK 216 R P G LR + + +L + + W++P Sbjct: 173 RMNGGFSARPFLGLAPPKGGIGAHLRRDIRAMLAGRGKLLDMYRDYDDEGLALLWLEPWD 232 Query: 217 SNESIPASSIGFVRGLFWQPAHIELCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGL 276 S+ + + L + S E G+ Sbjct: 233 GKTSLSLDQLDP--WFIEICRRVRLIKGADQPIVALAVGSSVA-----RIEAKACN--GI 283 Query: 277 WPHPHSPCLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAAVVNQFRN 336 +P + K + S V+ +++ E R A + + Sbjct: 284 TGDFWAPVYDSEGKS------FSLDA-----RGFSYRVLCRLLFGEKKQRAARLPQSMQL 332 Query: 337 IAPQSPLELIMGGYRNNQASILERRHDVLMFNQGWQQYGNVINEIVTVGLGYKTALRKAL 396 + ++ + L+ G Q ++ ++ N + + + ++ Sbjct: 333 LPDETDMILVARGMVRGQGKTEGFHERIIPTHRHIVDAMNDETKRLQLAEIADKQEKEIA 392 Query: 397 YTFAE----------GFKNKDFKGAGVSVHETAERHFYRQSELLIPDVLANVNFSQADEV 446 Y + G NK +++ + F + + Sbjct: 393 YIASALRQGCAIVSIGGANKKPSKDDYGCAAPYTDRLEAEADAYFFTTIQQR-FEEGENT 451 Query: 447 IADLRDKLHQLCEMLF-NQSVAPYAHHPKLISTLALARATLYKHLRELKP 495 L + E L + S + A + L K Sbjct: 452 KIPYLRHLIRYAERLLIDASESIACPVQNRWRARVRAPRAFHGMLWTQKS 501 >UniRef50_D0MET3 CRISPR-associated protein, Cse1 family n=1 Tax=Rhodothermus marinus DSM 4252 RepID=D0MET3_RHOM4 Length = 518 Score = 82.9 bits (203), Expect = 3e-14, Method: Composition-based stats. Identities = 62/526 (11%), Positives = 129/526 (24%), Gaps = 62/526 (11%) Query: 3 LLIDNWIPVRPRNGGKVQIINLQSLYC--SRDQWRLSLPRDDMELAALALLVCIGQIIAP 60 LL + I VR ++G + + + + + A + LV + + Sbjct: 5 LLNEPLIRVRLKDGTVRDCTLPEVIVALLRDEVLGFEALQPHQQQAWHSFLVQLVAMAVA 64 Query: 61 -------AKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTK--------G 105 ++ +R ++ +E + + L + F+Q Sbjct: 65 RITGGAFPEEAEPWRRALVELAGGEEAAWYL----VVSDL--SRPAFLQPPVPEGSLEAA 118 Query: 106 VKANDVTPMEKLLAGVSGATNCAFVNQPGQGEALCGGCTAIALFNQANQAPGFGGGFKSG 165 DV ++L ++ + + AL G G Sbjct: 119 RYRADVQTPDQLDVLITAKN---HDLKARRIVQPRPEHWMFALVTLQTMEGFLGRGNYGI 175 Query: 166 LR-----GGTPVTTFVR----GIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIK 216 +R G P+ R G R V + + P ++ H W+ P Sbjct: 176 VRMNGGFGNRPLVGLTRALSPGAHFRRDVQVLLDARPSFADRYDLGGHAL---LWVLPWD 232 Query: 217 SNESIPASSIGFVRGLFWQPAHIELCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGL 276 + I + G C + +NG+ Sbjct: 233 GKKQRAIELKECDPYFIEICRRIRFLEEAGRLVCWRTNTDGPRVAAP-------KALNGI 285 Query: 277 WPHPHSPCLVTVKKGEVEEK----FLAFTTSAPSWTQISR--VVVDKIIQNENGNRVAAV 330 P +P + K + + + + + VA Sbjct: 286 TGDPWTPVEKSDKPKALTVSDSGFTYRLLQQVFLSGDYEPPVALAFREDEKDGVYLVART 345 Query: 331 VNQFRNIAPQSPLELIMGGYRNNQASILERRHDVLMFNQGWQQYGNVINEIVTVGLGYKT 390 + + + ++ +QA+ + ++ + + Sbjct: 346 LVRGQGKTGGLHFRIVPV----SQAAARWLSGSEERRTKLARRAEQRVRLA---ADVQRK 398 Query: 391 ALRKALYTFAEGFKNKDFKGAGVSVHETAERHFYRQSELLIPDVLANVNFSQADEVIADL 450 L AL G ++ + V+ A F R + D L + D Sbjct: 399 VLYPALAALLSGGRDARVEWEQVTPWIEA---FDRAVDAQFFDRLWASVEQDEEIARRDW 455 Query: 451 RDKLHQLCEMLFNQSVAPYA-HHPKLISTLALARATLYKHLRELKP 495 L F + + A A RE+ P Sbjct: 456 ERFLLTEARRQFAVAEHGMPVASAHHWRARSRAHAIFEARAREVLP 501 >UniRef50_B4UE68 CRISPR-associated protein, Cse1 family n=2 Tax=Anaeromyxobacter RepID=B4UE68_ANASK Length = 539 Score = 79.1 bits (193), Expect = 4e-13, Method: Composition-based stats. Identities = 57/503 (11%), Positives = 126/503 (25%), Gaps = 70/503 (13%) Query: 2 NLLIDNWIPVRPRNGGKVQIINL--QSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIA 59 +LL + + + +G + ++ + S + + A A LV + I Sbjct: 34 DLLDEPLLGIARGDGSRGRLDLPGVLEALGKDEVEGFSALQAHQQHAWHAFLVQLAAIAL 93 Query: 60 PAKDDVEFRHRIMNPLTEDEFQQLIAPW----IDMFYLNHA---EHPFMQTK------GV 106 +D L +++L+ + + L F+Q Sbjct: 94 QRGED------RSPKLKAARWRELLETLTRGRHEPWTLVVPDVSRPAFLQPPVPEGTLAA 147 Query: 107 KANDVTPMEKLLAGVSGATNCAFVNQPGQGEALCGGCTAIALFNQANQAPGFGG------ 160 + + ++L V+ + + AL + G Sbjct: 148 FKSRLARSDELDLLVTSKN---HDVKAARAANARPEHWVYALVSLQTMQGFSGRANYGIA 204 Query: 161 ---GFKSGLRGGTPVTTFVRGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKS 217 G G G R V + + ++ ++H W++P Sbjct: 205 RMNGGAGSRPGLGLAPGHALGARFRRDVSVLLEVRDSIRAGRGYKAHGGISLLWLEPWDG 264 Query: 218 NESIPASSIGFVRGLFWQPAHIELCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLW 277 N +I + + + L G G + G L+ Sbjct: 265 NSAITPGELDPL--FIEVCRRVRLDVSDGGIDAHTTGTAAARISAGDLRGNTG------- 315 Query: 278 PHPHSPCLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAAVVNQFRNI 337 P +P K AFT + ++ V+ +++ +E + + Sbjct: 316 -DPWTPVQKAEGK--------AFTATEAGFSYR---VLQRLLGDEYEPGATQLPRRA--- 360 Query: 338 APQSPLELIMGGYRNNQASILERRHDVLMFN----------QGWQQYGNVINEIVTVGLG 387 +EL+ L + G + E V + Sbjct: 361 --DREVELVATVLARGMGKTGGYHERRLPVPGDVLPWLADDERRALLGALARERVQLAAD 418 Query: 388 YKTALRKALYTFAEGFKNKDFKGAGVSVHETAERHFYRQSELLIPDVLANVNFSQADEVI 447 + + K D K + H + + + + L D Sbjct: 419 VQRLVLKPAILAYLQGAPDDLKFKDRRADPWLDAH-DAEVDRIFFERLWADLTRDVDGAR 477 Query: 448 ADLRDKLHQLCEMLFNQSVAPYA 470 D + +L ++ Sbjct: 478 NDWARTVLELARAQLESALEVAP 500 >UniRef50_UPI0001B51C2F CRISPR-associated Cse1 family protein n=1 Tax=Streptomyces viridochromogenes DSM 40736 RepID=UPI0001B51C2F Length = 287 Score = 77.2 bits (188), Expect = 1e-12, Method: Composition-based stats. Identities = 34/250 (13%), Positives = 58/250 (23%), Gaps = 27/250 (10%) Query: 9 IPVRP--RNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAP-AKDDV 65 IPV + +V++ + L + + + A + L+ + + Sbjct: 11 IPVLTTGPDLRRVKLNLVDVLCRADELAAVCGDTPGETAALIDWLIGLIHAADQHPETSD 70 Query: 66 EFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVK---ANDVTPMEKLLAGVS 122 E+ + + D+ IA + L E P Q + +L Sbjct: 71 EWLSWVTDRTPLDKVADWIASHPGCWDLFDPERPLGQNPVLHPHLDTYGVGPAQLFLERV 130 Query: 123 GATNCAFVNQPGQGEALCGGCTA-IALFNQANQAPG----------FGGGFKSGLRGGTP 171 G N F A A+ Q G G + L Sbjct: 131 GDYNQFFNLHHLHHPEPVPADAAWRAMLTQHVYGIGMRGRIKAKDMGLPGTFTNLGTNRL 190 Query: 172 VTTF-------VRGIDLRSTVLLNVLTLPRLQKQFP-NESHTENQPTWIKPIKSNESIPA 223 T G L + LN P P + + S Sbjct: 191 ATRLKVIARPAWPGATLGDLLRLNTAPWPDEPGPLNLTWKSGRAVPKRLHTAGTTAS--P 248 Query: 224 SSIGFVRGLF 233 G Sbjct: 249 HFAGPADLHT 258 >UniRef50_Q6NEQ6 Putative uncharacterized protein n=1 Tax=Corynebacterium diphtheriae RepID=Q6NEQ6_CORDI Length = 518 Score = 72.5 bits (176), Expect = 4e-11, Method: Composition-based stats. Identities = 57/510 (11%), Positives = 126/510 (24%), Gaps = 58/510 (11%) Query: 41 DDMEL-AALALLVCIGQIIAPAK-----DDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLN 94 E+ A L+ + ++ + + + + N + +++ + + Sbjct: 4 PGFEISAQFRFLLSVTALLVREEFGRAPSLGQTQSLLQNGFSSAVVEKVNQNLVSHLNVL 63 Query: 95 HAEHPFMQTKGVKANDVTP-----------MEKLLAGVSGATNCAFVNQP-GQGEALCGG 142 PFM + ++KL + + N ++L Sbjct: 64 DGLQPFMGRPRLHPEGPKDASRRIGPGDQEVKKLSPAMPSEQGEDYWNLLVEFPDSLSIS 123 Query: 143 CTAIALFNQANQAPGFGGGFKS-GLRGGTPVTTFV-RGIDLRSTVLLN---VLTLPRLQK 197 + + + F R G P F +G + + L L Sbjct: 124 EATLKIVTYHYFSMAGNNKFDGDKTRMGAPGIRFPGKGYAATEHIWIREGSSLLRSLLTS 183 Query: 198 QFPNESHTENQPTWIKPIKSNESIPASSIGFVRGLFWQPAHIELCDPIGIGKCSCCGQES 257 + E P W + W + G G Sbjct: 184 LPTSWVKGEGLPAWADRTAQKSRTGQQQFHALWEATWSSNTVVSYWEDGYLTGVRVGGIP 243 Query: 258 NLRYTGFLKEKFTFTVNGLWPHP---HSPCLVTVKKGEVEEKFLAFTTSAPS------WT 308 + K LW P +K + E K W Sbjct: 244 PAWWPDIPNTKEGEKALKLWWDQRNEKDPLYFYLKNKKGEPKAQRIDFGRDGIDLAVEWA 303 Query: 309 QISRVV----------------VDKIIQNENGNRVAAVVNQFRNIAPQSPLELIMGGYRN 352 ++++ +E ++ V + + S + Sbjct: 304 AEAKMMDLVEKTYKNVLPVDSYPTVDGDDEGIDKYQLVFFRHQVEGTASSPSIRASEVFL 363 Query: 353 NQASILERR---HDVLMFNQGWQQYGNVINEIVTVGLGYKTALRKALYTFAEGFKNKDFK 409 S+ + L Q +V ++ V + + A A G+ Sbjct: 364 ADRSVWAFDLSCDEQLRLRDNAQLVRDVYITLLGVFR--RKSAADASREVATGWGAAVLD 421 Query: 410 GAGVSVHETAERHFYRQSELLIPDVLANVNFSQADEVIADLRDKLHQLCEMLFNQSVAPY 469 VS A ++R + LA++ + E+ +L + +++ APY Sbjct: 422 SLAVS-QSDASDAYWRGVTDVYQKYLADLRVGE--EITEELYRGIRSAALEAYDEVTAPY 478 Query: 470 AHHPKLISTLALAR--ATLYKHLRELKPQG 497 A ++ + + +G Sbjct: 479 LAQYANEIYYVRASLVRSVNSKINTARDEG 508 >UniRef50_C9M9R4 CRISPR-associated protein, Cse1 family n=1 Tax=Jonquetella anthropi E3_33 E1 RepID=C9M9R4_9BACT Length = 540 Score = 72.1 bits (175), Expect = 4e-11, Method: Composition-based stats. Identities = 64/557 (11%), Positives = 129/557 (23%), Gaps = 95/557 (17%) Query: 2 NLLIDNWIPVRPRNGGKVQIINLQSL--YCSRDQWRLSLPRDDMELAAL----------- 48 NLL + I V+ +G K Q L R Sbjct: 6 NLLNEQLITVQDPDGRKTQASLPDIFAMLEENQVESFPLLRPHQSAPWTCLLTQLAALAL 65 Query: 49 ----ALLVCI----GQIIAPAKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNH-AEHP 99 L + +A + ++ + ++ + Sbjct: 66 EESGETLPPLDPEKPWTMAGRHEPEKWAQLLRALTP-----GYTEDEPWCLVVSDVTKPA 120 Query: 100 FMQTKGVKANDVT--------PMEKLLAGVSGATNCAFVNQPGQGEALCGGCTAIALFNQ 151 F+Q + L + T A +PG A A+ + Sbjct: 121 FLQPPSSTEASAEKCFDKVDLTPDNLDLTI---TRKAHDLKPGLIAAPRIEEWLFAIVSL 177 Query: 152 ANQAPGFGGGFKSGLR---GGTPVTTFVRGIDLRSTV---L---LNVLTLPRLQKQFPNE 202 A G R G G T+ + V+ Sbjct: 178 QTNAGFLGAENYGIARQNGGHGKRICSSLGSS--RTIGGRWGRNVRVILDHIDDLYSELY 235 Query: 203 SHTENQPTWIKPIKSNESIPASSIGFVRGLFWQ-PAHIELCDPIGIGKCSCCGQESNLRY 261 E + +G + LF + I C G C G + R Sbjct: 236 EAENPLRLLWLLPWDGEKGSSIFVGDLHPLFVEVARRIR-CVEKDGGLCVTRGGTKDWRV 294 Query: 262 TGFLKEKFTFTVNGLWPHPHSPCLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQN 321 +G P +P + + S+ + +V+ Sbjct: 295 AA-------EDFHGALNDPWTPLIDDEGD-------VKAYGGDMSYRGLQKVLCF----- 335 Query: 322 ENGNRVAAVVNQFRNIAPQSPLELIMGGYRNNQASILERRHDVLMFNQGWQQ-------- 373 + ++ + ++ G + L + ++ Sbjct: 336 ---SHKPLLLQWHQKYDGSKNQLVLFEGLMKGKGKTEGVAFRALPVSSAVRRLFSSPDLT 392 Query: 374 ------------YGNVINEIVTVGLGYKTALRKALYTFAEGFKNKDFKGAGVSVHETAER 421 N+++ V L R+ + A + D +V ++ Sbjct: 393 KENEIASAMLNLVETAENKVLKVALATSAQARRPSFNGAIEWNGPDVDDWIPTVIREMDK 452 Query: 422 HFYRQSELLIPDVLANVNFSQADEVIADLRDKLHQLCEMLFNQSVAPYA-HHPKLISTLA 480 + + L D+ D+L +L E F + + + +A Sbjct: 453 EVDDLFFPALWNELER-GAGARDDSDKTWTDRLRELVEKYFELGMKTLSLGSEMNLKGIA 511 Query: 481 LARATLYKHLRELKPQG 497 R L+ ++ L P Sbjct: 512 QGRNRLFGLMKNLLPAA 528 >UniRef50_B6ZW55 CRISPR-associated protein, Cse1 family n=2 Tax=Enterobacteriaceae RepID=B6ZW55_ECO57 Length = 109 Score = 71.4 bits (173), Expect = 7e-11, Method: Composition-based stats. Identities = 15/104 (14%), Positives = 36/104 (34%), Gaps = 8/104 (7%) Query: 392 LRKALYTFAEGFKNKDFKGAGVSVHETAERHFYRQSELLIPDVLANVNFSQADEVIADLR 451 +++A ++ +G DF + + F R + ADE++ + Sbjct: 1 MKEAWFSDPKGA-RGDFSFVDIDFWNKTQHRFLRLVRQI-------EEGQDADELLGKWQ 52 Query: 452 DKLHQLCEMLFNQSVAPYAHHPKLISTLALARATLYKHLRELKP 495 ++ F++ V + P + + AR + E + Sbjct: 53 KEIWLFARQDFDERVFTNPYEPVDLERVMTARKKYFTTSAEKQS 96 >UniRef50_C4ZJX8 CRISPR-associated protein, Cse1 family n=2 Tax=Betaproteobacteria RepID=C4ZJX8_THASP Length = 526 Score = 69.8 bits (169), Expect = 2e-10, Method: Composition-based stats. Identities = 59/532 (11%), Positives = 128/532 (24%), Gaps = 79/532 (14%) Query: 2 NLLIDNWIPVRPRNGGKVQIINLQSL--YCSRDQWRLSLPRDDMELAALALLVCIGQIIA 59 N+L + I VR G ++ Q L R ALLV + + Sbjct: 7 NILDEPLIRVRDLGGQPQRLTLPQLLVALGRDAVRDYPALRPHQRHPWHALLVQLAALAL 66 Query: 60 PAKDDV-EFRHRIMNPLTEDEFQQLIAPWID------MFYLN--HAEHPFMQTKGVKAND 110 A D+ + +++ + F L H F+Q V + Sbjct: 67 HAADEDCPWDE-------ATDWRAALLALTPNHPDGCAFCLVAPHDRPAFLQ-PPVPSGK 118 Query: 111 VTPM------EKLLAGVSGATNCAFVNQPGQGEALCGGCTAIALFNQANQAPGFGGGFKS 164 + + L V+ + + AL + Q G G Sbjct: 119 LDGWKYIATPDALDMLVTSKN---HDLKSERMRHAEADDWLFALASLQTQEGFLGKGNYG 175 Query: 165 GLRGGTPVTTFV---------RGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPI 215 R ++ G+ R + + + + + WI P Sbjct: 176 ISRMNGGFSSRPALGVAPAGGVGVRWRRDIGALLTERDDITETIGFQRDGGMALLWIPPW 235 Query: 216 KSNESIPASSIGFVRGLFWQPAHIELCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNG 275 +S+ +++ I G+ + G Sbjct: 236 DGTKSLAFTALDP--FYIEICRRIRFVLDSD-------GRLKAHGIGTSVARVDAAARKG 286 Query: 276 LWPHPHSPCLVTVKKGEVEEKFLAFTT-SAPSWTQISRVVVDKIIQNENGNRVAAVVNQF 334 + +P E + L + +++ ++ +A + Sbjct: 287 VTGDAWTPV------EEDKALTLGADGFNYGLAAELAW-------GSKYRKTLAQALRAE 333 Query: 335 RNIAPQSPLELIMGGYRNNQASILERRHDVLMFNQGWQQYGN---------VINEIVTVG 385 IA + L+ G Q + + Y + V Sbjct: 334 DGIAG---ISLLAQGVVRGQGKTEGYHERRIPLTKKAIGYLRGGQTDLPAAIAAARVKAV 390 Query: 386 LGYKTALRKALYTFAEGFKNKDFKGAGVSVHETAERHFYRQSE-----LLIPDVLANVNF 440 + ++ + + + + + E F R E D+ + Sbjct: 391 ATMRGSILRTALFYLLEAGTEKIDFDRRTAKQQIE-AFIRAFERTEDARFFEDLNREIET 449 Query: 441 SQADEVIADLRDKLHQLCEMLFN-QSVAPYAHHPKLISTLALARATLYKHLR 491 + L + E++ + + A + + LR Sbjct: 450 EDRNAERLAWMLGLAERAEVVLKGAFTIGPQSGERRYRARSRALSYFHGALR 501 >UniRef50_Q2RXJ9 CRISPR-associated protein, Cse1 family n=1 Tax=Rhodospirillum rubrum ATCC 11170 RepID=Q2RXJ9_RHORT Length = 427 Score = 66.8 bits (161), Expect = 2e-09, Method: Composition-based stats. Identities = 48/407 (11%), Positives = 104/407 (25%), Gaps = 81/407 (19%) Query: 2 NLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAP- 60 NLL+ I V P G + + + + + R A LV + + Sbjct: 3 NLLLQPLIDVTP--CGVLTLPGVMAALARDEVGSFPSLRPHQAPAWHMFLVQLAALALQK 60 Query: 61 ------AKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHA-EHPFMQ---TKGVKAN- 109 + ++ + LT + PW ++ + F+Q G++ Sbjct: 61 AGEKTVPTVEEDWARLLR-GLTPG--FKADEPW--CLVVDDPGKPAFLQPPIPPGLQLGN 115 Query: 110 DVTPMEKLLAGVSGATNCAFVNQPGQGEALCGGCTAIALFNQANQAPGFGGGFKSGLRGG 169 V + L ++ + AL + G G + R Sbjct: 116 PVATPDGLDLLITSRN---HDLKQSVAYRGTAQDWVFALISLQTGEGYGGAGNQGIARMN 172 Query: 170 TPVTTFVRGIDLRSTVLLNVLTLP----RLQKQFPNESHTENQPTWIKPIKSNESIPASS 225 ++ L+++ LP + P + + + +E Sbjct: 173 GGASSRP---------LVSLAPLPPKSEKAMAPRPGAWFRRDVAV-LLETRESEMAHYEP 222 Query: 226 IGFVR----GLFWQ---------------------PAHIELCDPIGIGKCSCCGQESNLR 260 +G+ GL W + L + G G++ + Sbjct: 223 LGYRETGGLGLTWLALWPEDEQLQTKDLDIWFIEVCRRVRLSEESG----RLIGRKGTSK 278 Query: 261 YTGFLKEKFTFTVNGLWPHPHSPCLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQ 320 T + + G P +P K V+ ++I Sbjct: 279 ATRINAK----PLKGALGDPFAPVDKVENKSFTLND-----------RDFDYRVLTELIL 323 Query: 321 NENGNRVAAVVNQFRNIAPQSPLELIMGGYRNNQASILERRHDVLMF 367 + N + + + L L+ + + +L Sbjct: 324 SGNWD-LPLLARPASFEPEGETLALVCAALARGNSKTYGFKTRILPV 369 >UniRef50_C2KP47 Putative uncharacterized protein n=1 Tax=Mobiluncus mulieris ATCC 35243 RepID=C2KP47_9ACTO Length = 527 Score = 64.1 bits (154), Expect = 1e-08, Method: Composition-based stats. Identities = 70/541 (12%), Positives = 146/541 (26%), Gaps = 78/541 (14%) Query: 3 LLIDNWIP----------VRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDM-ELAALALL 51 L WIP V + + +L L ++L +L Sbjct: 6 LKDIAWIPTARGRMTARDVLCSSATVPNNSSRNTLSTQD-----LLINPGFSAGSSLRIL 60 Query: 52 VCIGQIIAPAKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDV 111 I + + D + I +A++PF+Q + + + Sbjct: 61 RDIAVLAERIRKQKG-SKVAPEEPDIDAIDEAIESLAPYCNPLNADYPFLQRQVISGQET 119 Query: 112 TP-MEKLLAGVSGATNCAFVNQP-GQGEALCGGCTAIALFNQANQAPGFGGGFK------ 163 +KL ++ + AF L + L + + + Sbjct: 120 DGAPKKLSPAMAPDSAEAFWKMSFTFANQLSLDKALLWLAINHHYSLAGNNQYDGEKCAM 179 Query: 164 --SGLR----------GGTPVTTFVRGIDLRSTVLLNVLTLPRLQKQFPNE------SHT 205 G+R G T F + T+L+N+ FP S Sbjct: 180 GAPGIRFLGVDKKKDVGKTITEVFFIKDSIYETLLMNIPIDWLECNDFPAWALRNPESLG 239 Query: 206 ENQPTWIKPIKSNESIPASSIGFVRGLF-----WQPAHIELCDPIGIGKCSCCGQESNLR 260 ++ P W SN A + L P H G C ++ Sbjct: 240 KDHPLWDASWSSN---TAVCVWEGNMLTSAKPCGIPRH---WYSQAHGVCPSAKKDKAA- 292 Query: 261 YTGFLKEKFTFTVNGLW-----PHPHSPCLVTVKKGEVEEKFLAFTTSAPSWTQISRVVV 315 KE+ W P K G ++ + L F + + V Sbjct: 293 -----KEQENARKKSWWDNRNTRDPLYLYT-PNKDGILKPQRLDF------GRDATDLAV 340 Query: 316 DKIIQNENGNRVAAVVNQFRNIAPQSPLELIMGGYRNNQASILERRHDVLMF--NQGWQQ 373 + + + + +S L + +S + R +VL+ N W Sbjct: 341 HWNAEQNPQYMEYSSTTRILHHDAESYLCFLRHQLGGTPSSPVIRASEVLIPDDNSLWSP 400 Query: 374 YGNVINEIVTVGLGYKTALRKALYTFAEGFKNKDFKGAGVSV-HETAERHFYRQSELLIP 432 + ++ +K ++ F + + + + F+R + Sbjct: 401 HKDLYEIAEMYANVVLDMQKKLVFCFGKPTQPNIPTLSNLGFLRGDVSTAFWRHIRPIFE 460 Query: 433 DVLANVNFSQADEVIADLRDKLHQLCEMLFNQSVAPYAHHPKLISTLALARATLYKHLRE 492 + + + +++ + F++ PYA + + L + L + Sbjct: 461 NSMRENQLED-ENNYSEILKATQKATMSAFDEVTLPYAQSM--VPEVTTTHQYLREKLGK 517 Query: 493 L 493 + Sbjct: 518 I 518 >UniRef50_UPI000169879C hypothetical protein Epers_00880 n=1 Tax=Endoriftia persephone 'Hot96_1+Hot96_2' RepID=UPI000169879C Length = 102 Score = 59.4 bits (142), Expect = 3e-07, Method: Composition-based stats. Identities = 15/91 (16%), Positives = 26/91 (28%), Gaps = 10/91 (10%) Query: 279 HPHSPCLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAAVVNQFRNIA 338 HP SP + + G L + + + + + + A VV+ F + Sbjct: 2 HPLSPHIANKEGG-----LLPQHAQPGGLSYRHWLGL---VSKQENRQPAQVVSTFLSYR 53 Query: 339 --PQSPLELIMGGYRNNQASILERRHDVLMF 367 PQ L GY + Sbjct: 54 KLPQEQFRLHTFGYDMDNMKARCWYETTFPL 84 >UniRef50_Q06WG4 Putative uncharacterized protein (Fragment) n=4 Tax=Salmonella enterica subsp. enterica RepID=Q06WG4_SALNE Length = 55 Score = 56.4 bits (134), Expect = 2e-06, Method: Composition-based stats. Identities = 19/57 (33%), Positives = 27/57 (47%), Gaps = 2/57 (3%) Query: 1 MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQI 57 M+L + W+PV NG K +I L L+ PR D + AA +L+ I Q Sbjct: 1 MDLTKEKWLPVIFSNGDKKKISLRDLL--DNRIQDLAYPRADFQGAAWQMLIGILQC 55 >UniRef50_UPI0001B51C2E hypothetical protein SvirD4_12610 n=1 Tax=Streptomyces viridochromogenes DSM 40736 RepID=UPI0001B51C2E Length = 228 Score = 54.8 bits (130), Expect = 7e-06, Method: Composition-based stats. Identities = 29/210 (13%), Positives = 57/210 (27%), Gaps = 11/210 (5%) Query: 291 GEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAAVVNQFRNIAPQSPLELIMGGY 350 ++ L + + W + + +++ + V + + L G Sbjct: 15 PGGKKTLLKPSPTRELWRESHALYAAVAERDKGTDLYGRV-----AMLHGRRVHLWAIGL 69 Query: 351 RNNQASILERRHDVLMFNQGWQ----QYGNVINEIVTVGLGYKTALRKALYTFAEGFKNK 406 Q + D + G + + I + A A Sbjct: 70 LATQGKLTAWLSDEFPYVPGRETQLRHAAEQGSAICEYTARSLYLVAAATREIAYPNPKP 129 Query: 407 DFKGAGVSVHETAERHFYRQSELLIPDVLANV-NFSQADEVIADLRDKLHQLCEMLFNQS 465 D K A ++ E + + L +L V + A E +A + L M + Sbjct: 130 DDKAAQLA-RFNGEPEMWAGAADLFHHLLDQVADTGAAIEALATFGRDILALAIMSLDGR 188 Query: 466 VAPYAHHPKLISTLALARATLYKHLRELKP 495 + + ARA L L K Sbjct: 189 LTSLPSGGTGLQARVTARARLRALLGHSKA 218 >UniRef50_B6B780 Putative uncharacterized protein n=1 Tax=Rhodobacterales bacterium Y4I RepID=B6B780_9RHOB Length = 171 Score = 46.3 bits (108), Expect = 0.003, Method: Composition-based stats. Identities = 22/157 (14%), Positives = 40/157 (25%), Gaps = 14/157 (8%) Query: 345 LIMGGYRNNQASILERRHDVLMFNQGWQQYGNVINEIVTVGLGYKTALRKALYTFAEGFK 404 +I GG+ + + L + ++ LR+AL Sbjct: 1 MIAGGWAMDNMKPKDFLWSELPLLTFGTPAQAMAERLIEAANLVAGGLRQAL-------- 52 Query: 405 NKDFKGAGVSVHETAERHFYRQSELLIPDVLANVNFSQ--ADEVIADLRDKLHQLCEMLF 462 A S E HF+ +E LA + + + + F Sbjct: 53 --PVVLAEGSAREAQLEHFWTATEGDFTAALAELAQEDFEPEATARQFLSGIGRQALAQF 110 Query: 463 NQSVAPYAHHPK--LISTLALARATLYKHLRELKPQG 497 + P + + A L + QG Sbjct: 111 QELALPGMSDGRIEQAGRIVAANRNLNALIHGRSTQG 147 >UniRef50_UPI000169A1F1 hypothetical protein Epers_00060 n=1 Tax=Endoriftia persephone 'Hot96_1+Hot96_2' RepID=UPI000169A1F1 Length = 84 Score = 41.7 bits (96), Expect = 0.059, Method: Composition-based stats. Identities = 13/71 (18%), Positives = 21/71 (29%), Gaps = 3/71 (4%) Query: 424 YRQSELLIPDVLANVNFSQADEVIADLRDKLHQLCEMLFNQSVAPYAHHPKLISTLALAR 483 Y + L + A + EV +L + LF+ + +A A Sbjct: 1 YAHLQQLKQQLKAGKDGKALLEV---WHGELKKAALDLFDYWTSRGDFEAVNPRRIAQAH 57 Query: 484 ATLYKHLRELK 494 L L K Sbjct: 58 RKLNNWLHGKK 68 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.307 0.122 0.319 Lambda K H 0.267 0.0375 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 2,338,362,191 Number of Sequences: 3077464 Number of extensions: 83433140 Number of successful extensions: 258124 Number of sequences better than 1.0e-01: 117 Number of HSP's better than 0.1 without gapping: 156 Number of HSP's successfully gapped in prelim test: 88 Number of HSP's that attempted gapping in prelim test: 256923 Number of HSP's gapped (non-prelim): 268 length of query: 502 length of database: 1,040,396,356 effective HSP length: 133 effective length of query: 369 effective length of database: 631,093,644 effective search space: 232873554636 effective search space used: 232873554636 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.2 bits) S2: 95 (41.3 bits)