BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (224 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_Q46898 Uncharacterized protein ygcI n=13 Tax=Proteobact... 462 e-129 UniRef50_C5SD48 CRISPR-associated protein Cas5 family n=1 Tax=Al... 155 8e-37 UniRef50_A1SV73 CRISPR-associated protein, Cas5e family n=2 Tax=... 119 7e-26 UniRef50_A6W168 CRISPR-associated protein Cas5 family n=6 Tax=Ga... 114 3e-24 UniRef50_Q1R114 CRISPR-associated protein, CT1976 n=1 Tax=Chromo... 112 1e-23 UniRef50_Q314I4 CRISPR-associated protein, CT1976 n=1 Tax=Desulf... 111 2e-23 UniRef50_Q04QB7 Putative uncharacterized protein n=2 Tax=Leptosp... 100 3e-20 UniRef50_C7MTB0 CRISPR-associated protein Cas5 n=1 Tax=Saccharom... 100 4e-20 UniRef50_B7KJ26 CRISPR-associated protein Cas5 family n=1 Tax=Cy... 97 3e-19 UniRef50_A8LYZ7 CRISPR-associated protein Cas5 family n=2 Tax=Ac... 97 4e-19 UniRef50_C7MQD6 CRISPR-associated protein Cas5 n=1 Tax=Saccharom... 95 2e-18 UniRef50_Q1EQS9 CRISPR-associated protein n=3 Tax=Streptomyces R... 95 2e-18 UniRef50_Q2RY19 CRISPR-associated protein, Cas5e family n=1 Tax=... 94 3e-18 UniRef50_C2BET8 CRISPR-associated protein n=2 Tax=Firmicutes Rep... 93 6e-18 UniRef50_A5UR14 CRISPR-associated protein, Cas5e family n=1 Tax=... 92 2e-17 UniRef50_Q12YA8 CRISPR-associated protein, CT1976-like n=1 Tax=M... 92 2e-17 UniRef50_D1A6Q5 CRISPR-associated protein Cas5 family n=2 Tax=Ac... 91 3e-17 UniRef50_A8SDR7 Putative uncharacterized protein n=1 Tax=Faecali... 90 6e-17 UniRef50_A8ZZ17 CRISPR-associated protein Cas5 family n=1 Tax=De... 90 6e-17 UniRef50_B4TTX2 CRISPR-associated protein Cas5 n=15 Tax=Enteroba... 89 1e-16 UniRef50_D0Y918 CRISPR-associated protein Cas5 family n=2 Tax=De... 89 1e-16 UniRef50_D2L2X8 CRISPR-associated protein Cas5 family n=1 Tax=De... 87 6e-16 UniRef50_Q0W584 Putative uncharacterized protein n=1 Tax=uncultu... 87 6e-16 UniRef50_D1CGD4 CRISPR-associated protein Cas5 family n=6 Tax=Ba... 85 2e-15 UniRef50_Q47PI7 CRISPR-associated protein, Cas5e family n=12 Tax... 84 3e-15 UniRef50_A7BA63 Putative uncharacterized protein n=1 Tax=Actinom... 83 7e-15 UniRef50_D1NTI1 CRISPR-associated protein Cas5 n=1 Tax=Bifidobac... 82 1e-14 UniRef50_C8XAY4 CRISPR-associated protein Cas5 family n=2 Tax=Ac... 82 1e-14 UniRef50_Q2JH27 CRISPR-associated protein, CT1976 n=6 Tax=Actino... 82 2e-14 UniRef50_C7LYW6 CRISPR-associated protein Cas5 family n=1 Tax=Ac... 82 2e-14 UniRef50_B0S4B6 Putative uncharacterized protein n=1 Tax=Finegol... 80 5e-14 UniRef50_B2GBJ9 Putative uncharacterized protein n=1 Tax=Lactoba... 80 7e-14 UniRef50_Q5YRB6 Putative uncharacterized protein n=1 Tax=Nocardi... 79 1e-13 UniRef50_B3E5U9 CRISPR-associated protein Cas5 family n=2 Tax=De... 79 1e-13 UniRef50_A3EQA4 CRISPR-ssociated protein, Cas5 n=3 Tax=Bacteria ... 79 1e-13 UniRef50_B0TDU1 Crispr-associated protein cas5 n=1 Tax=Heliobact... 78 2e-13 UniRef50_C9M9R7 CRISPR-associated protein Cas5, Ecoli subtype n=... 77 4e-13 UniRef50_A5GBK2 CRISPR-associated protein Cas5 family n=2 Tax=De... 77 6e-13 UniRef50_D1Y486 Crispr-associated protein Cas5 n=1 Tax=Pyramidob... 76 7e-13 UniRef50_D0WFC8 CRISPR-associated protein Cas5 n=1 Tax=Slackia e... 76 7e-13 UniRef50_B1VIY0 CRISPR-associated protein n=9 Tax=Actinomycetale... 76 8e-13 UniRef50_D2RB02 CRISPR system CASCADE complex protein CasD n=3 T... 76 8e-13 UniRef50_C9M2Y8 CRISPR-associated protein n=1 Tax=Lactobacillus ... 76 1e-12 UniRef50_UPI0001AF1D4C CRISPR-associated protein, CT1976 n=1 Tax... 74 5e-12 UniRef50_A8M404 CRISPR-associated protein Cas5 family n=1 Tax=Sa... 73 7e-12 UniRef50_B4UE71 CRISPR-associated protein Cas5 family n=2 Tax=An... 71 3e-11 UniRef50_D0MET6 CRISPR-associated protein Cas5 family n=1 Tax=Rh... 71 3e-11 UniRef50_Q0BSC7 Putative uncharacterized protein n=1 Tax=Granuli... 70 5e-11 UniRef50_B8GIV3 CRISPR-associated protein Cas5 family n=1 Tax=Me... 70 6e-11 UniRef50_A1ARH6 CRISPR-associated protein, Cas5e family n=2 Tax=... 69 1e-10 UniRef50_B5GY62 Crispr-associated protein (Fragment) n=1 Tax=Str... 69 1e-10 UniRef50_A9HLC6 CRISPR-associated protein Cas5 family n=11 Tax=A... 69 1e-10 UniRef50_B6B783 CRISPR-associated protein Cas5, Ecoli subtype n=... 69 2e-10 UniRef50_Q0AA33 CRISPR-associated protein Cas5 family n=2 Tax=Ga... 67 3e-10 UniRef50_C2KP44 Putative uncharacterized protein n=1 Tax=Mobilun... 67 5e-10 UniRef50_Q1J367 CRISPR-associated protein, CT1976 n=1 Tax=Deinoc... 67 5e-10 UniRef50_Q2FNU0 CRISPR-associated protein, CT1976 n=1 Tax=Methan... 67 6e-10 UniRef50_C2GEY8 CRISPR-associated protein n=1 Tax=Corynebacteriu... 65 2e-09 UniRef50_Q03C60 CRISPR-associated protein n=4 Tax=Lactobacillus ... 65 2e-09 UniRef50_UPI0001B51C2B CRISPR-associated Cas5 family protein n=1... 64 6e-09 UniRef50_B5F422 CRISPR-associated protein Cas5 n=59 Tax=Enteroba... 62 2e-08 UniRef50_B4S8P8 CRISPR-associated protein Cas5 family n=8 Tax=Ba... 60 4e-08 UniRef50_C5V9N1 CRISPR-associated protein Cas5 n=1 Tax=Corynebac... 60 4e-08 UniRef50_B8IZA7 CRISPR-associated protein Cas5 family n=1 Tax=De... 60 4e-08 UniRef50_B6IWM3 CRISPR-associated protein, CT1976 family n=1 Tax... 59 1e-07 UniRef50_B8IMR2 CRISPR-associated protein Cas5 family n=1 Tax=Me... 59 2e-07 UniRef50_Q2JWC5 CRISPR-associated protein Cas5, Ecoli subtype n=... 58 3e-07 UniRef50_D1CAJ0 CRISPR-associated protein Cas5 family n=1 Tax=Sp... 57 4e-07 UniRef50_B8FDI0 CRISPR-associated protein Cas5 family n=3 Tax=Ba... 55 2e-06 UniRef50_B8HWH8 CRISPR-associated protein Cas5 family n=1 Tax=Cy... 54 3e-06 UniRef50_B6XT64 Putative uncharacterized protein n=2 Tax=Bifidob... 50 6e-05 UniRef50_Q2RXJ5 CRISPR-associated protein, Cas5e family n=5 Tax=... 50 7e-05 UniRef50_Q6NEQ9 Putative uncharacterized protein n=1 Tax=Coryneb... 49 1e-04 UniRef50_C0W6U0 CRISPR-associated Cas5 family protein n=1 Tax=Ac... 49 2e-04 UniRef50_B8IJS9 CRISPR-associated protein Cas5 family n=1 Tax=Me... 40 0.046 >UniRef50_Q46898 Uncharacterized protein ygcI n=13 Tax=Proteobacteria RepID=YGCI_ECOLI Length = 224 Score = 462 bits (1190), Expect = e-129, Method: Compositional matrix adjust. Identities = 224/224 (100%), Positives = 224/224 (100%) Query: 1 MRSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSES 60 MRSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSES Sbjct: 1 MRSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSES 60 Query: 61 VQFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVAL 120 VQFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVAL Sbjct: 61 VQFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVAL 120 Query: 121 WLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKALLNYEPVGGD 180 WLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKALLNYEPVGGD Sbjct: 121 WLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKALLNYEPVGGD 180 Query: 181 IYSEESVTGHHLKFTARDEPMITLPRQFASREWYVIKGGMDVSQ 224 IYSEESVTGHHLKFTARDEPMITLPRQFASREWYVIKGGMDVSQ Sbjct: 181 IYSEESVTGHHLKFTARDEPMITLPRQFASREWYVIKGGMDVSQ 224 >UniRef50_C5SD48 CRISPR-associated protein Cas5 family n=1 Tax=Allochromatium vinosum DSM 180 RepID=C5SD48_CHRVI Length = 227 Score = 155 bits (392), Expect = 8e-37, Method: Compositional matrix adjust. Identities = 109/225 (48%), Positives = 137/225 (60%), Gaps = 15/225 (6%) Query: 1 MRSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSES 60 M SYLILRL GPMQAWG TFE RP+ FPTRSGLLGLLGACLG+ R DT SL AL+ES Sbjct: 1 MPSYLILRLDGPMQAWGTHTFEDYRPSNPFPTRSGLLGLLGACLGLDRSDTPSLDALAES 60 Query: 61 VQFAVRCD--------ELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLC 112 V F VR D + ++ R T L DYHTVL AR+ G + IQ+ REYL Sbjct: 61 VAFTVRLDTGAPRPGVDRLMPKRH---TKLSDYHTVLDARK-VDGSTNKFPIQSHREYLF 116 Query: 113 DASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLF--LGTCQASDPQKA 170 DA+F VA+ P A+ ++ + +++ +PR+TP LGRRSCPL PL +A D + A Sbjct: 117 DAAFAVAIGSRPDASFSLARIAESLRQPRFTPVLGRRSCPLGRPLLERPDCIEADDAKAA 176 Query: 171 LLNYEPVGGDIYSEESVTGHHLKFTARDEPMITLPRQFASREWYV 215 L + P GG IYSE+ + + RD P RQFA+R Y+ Sbjct: 177 LAQFPPHGGLIYSEDELVSDQPTWI-RDVPRYGRHRQFATRRLYL 220 >UniRef50_A1SV73 CRISPR-associated protein, Cas5e family n=2 Tax=Gammaproteobacteria RepID=A1SV73_PSYIN Length = 217 Score = 119 bits (298), Expect = 7e-26, Method: Compositional matrix adjust. Identities = 81/216 (37%), Positives = 116/216 (53%), Gaps = 25/216 (11%) Query: 5 LILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESVQFA 64 LIL+ G M A+G TF+ R FPTRS ++G+LGA +GI R++ + L ALSE ++ A Sbjct: 4 LILKTEG-MSAYGLQTFDVHRRANHFPTRSAIMGILGAAMGITRENFNELYALSEQLKIA 62 Query: 65 VRCDELILDDRRVSVTGLRDYHTVLGAR----EDYRGLKSHETIQTWREYLCDASFTVAL 120 V+ + +S + DYHTV R + +G+K T+REY CD+ T A+ Sbjct: 63 VQVN--------LSGEKMVDYHTVQHFRSPQGKIQKGVKP-----TYREYWCDSEHTFAI 109 Query: 121 WLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKALLNYEPVGGD 180 H VI +L +V P +T + GR+SCPLT PLF +P AL N+ G Sbjct: 110 SAAEH---VIEKLVNSVKFPEFTLFQGRKSCPLTRPLFEAVTDDDNPANALKNHGE-QGQ 165 Query: 181 IYSEESVTGHHLKFTARDEPMIT-LPRQFASREWYV 215 I+S+ S RD +IT +PR++A R YV Sbjct: 166 IFSDISGDNQLAIVQVRD--LITAIPRKYAMRTVYV 199 >UniRef50_A6W168 CRISPR-associated protein Cas5 family n=6 Tax=Gammaproteobacteria RepID=A6W168_MARMS Length = 258 Score = 114 bits (285), Expect = 3e-24, Method: Compositional matrix adjust. Identities = 70/165 (42%), Positives = 93/165 (56%), Gaps = 23/165 (13%) Query: 1 MRSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSES 60 M+ YL+ RL GPM +WGQP G R T PTRS +LGLLGA LGI+RDD L AL S Sbjct: 1 MKDYLVFRLYGPMASWGQPAVGGDRATAIAPTRSAILGLLGAALGIKRDDAQQLDALHSS 60 Query: 61 VQFAVRCDELILDDRRVSVTG-LRDYHTV-LGARED---YRGLKSH---------ETIQT 106 VQ A ++V+ T LRDYHT + +R + YR K+ TI + Sbjct: 61 VQMAT---------KQVTPTSLLRDYHTSQVPSRNNKYVYRTRKNELLDEHKEKLNTILS 111 Query: 107 WREYLCDASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSC 151 R+Y CD + VA+ LT + + L++A++KP Y LGR+SC Sbjct: 112 TRDYRCDGIWIVAVSLTQESLFSLERLKQALIKPVYVLSLGRKSC 156 >UniRef50_Q1R114 CRISPR-associated protein, CT1976 n=1 Tax=Chromohalobacter salexigens DSM 3043 RepID=Q1R114_CHRSD Length = 260 Score = 112 bits (279), Expect = 1e-23, Method: Compositional matrix adjust. Identities = 68/183 (37%), Positives = 96/183 (52%), Gaps = 20/183 (10%) Query: 1 MRSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSES 60 M +L+ RL PM +WG+ RPT +P R +LGL+GA LGI+RDD L +S Sbjct: 1 MTGHLVFRLYAPMASWGEAAVGEARPTATYPGRGAILGLIGAALGIRRDDDEGQLRLRQS 60 Query: 61 VQFAVRCDELILDDRRVSVTGLRDYHTVL----GAREDYRGLKSH--------ETIQTWR 108 + AV+ +R LRDYHTV ++ +YR + TI + R Sbjct: 61 LGIAVK--------QRSPGWLLRDYHTVQVPPSQSKVNYRSRREELSVPKDALNTILSSR 112 Query: 109 EYLCDASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQ 168 +Y CD + VAL L P A + EL+ A+ +PR+T YLGR++CPL PL +A + Sbjct: 113 DYRCDGLWVVALRLMPDAVWTLDELKSALERPRFTLYLGRKACPLAAPLTPAIVEADHWR 172 Query: 169 KAL 171 AL Sbjct: 173 GAL 175 >UniRef50_Q314I4 CRISPR-associated protein, CT1976 n=1 Tax=Desulfovibrio desulfuricans subsp. desulfuricans str. G20 RepID=Q314I4_DESDG Length = 245 Score = 111 bits (277), Expect = 2e-23, Method: Compositional matrix adjust. Identities = 84/239 (35%), Positives = 114/239 (47%), Gaps = 19/239 (7%) Query: 1 MRSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSES 60 M YL ++ GP+QA+G RPT PTRS +LG+L A +GI+RD+ + L L + Sbjct: 1 MAQYLTFQIYGPLQAYGTVAVGEIRPTSTMPTRSAVLGILAAAIGIRRDEETRLAELRDG 60 Query: 61 VQFAVRCD---ELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFT 117 + AVR D +++LD + G R L R D L TI + REYL DA FT Sbjct: 61 YRVAVREDAPGKVMLDYHTIQTPGARGKRQ-LHCRRDELLLTEPNTILSRREYLMDALFT 119 Query: 118 VALWLTPHAT-MVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKALLNYEP 176 V LW H + E+ +A+ PR+T LGR+SCP P + PQ+A+ Y Sbjct: 120 VCLWQANHTVPYSLQEIARALRSPRWTIGLGRKSCPPALPFAPKITDHTTPQEAVAAYPA 179 Query: 177 ---VGGDIYSEE------SVTGHH--LKFTARDEPMITLPRQFAS---REWYVIKGGMD 221 V + S + G H + T RD P+ RQFA RE V K D Sbjct: 180 DKLVSAGLRSPQVMRMLLDTEGPHTDTETTVRDVPLHHGRRQFAERKVRELLVRKAATD 238 >UniRef50_Q04QB7 Putative uncharacterized protein n=2 Tax=Leptospira borgpetersenii serovar Hardjo-bovis RepID=Q04QB7_LEPBJ Length = 247 Score = 100 bits (249), Expect = 3e-20, Method: Compositional matrix adjust. Identities = 58/167 (34%), Positives = 85/167 (50%), Gaps = 18/167 (10%) Query: 1 MRSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSES 60 M+ YL+ RL GP+ +WG RP+ FPT+S ++GL+ A G R + + L +S Sbjct: 1 MKDYLVFRLYGPLVSWGNIAVGEYRPSDSFPTKSAIIGLISASFGFDRSEDGKISELVKS 60 Query: 61 VQFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSH----------ETIQTWREY 110 V FA + L+ + LRDYHT+ R L + ETI + R+Y Sbjct: 61 VFFATKT----LNPGNL----LRDYHTIQSPGNVKRSLLTRKDELLDSEYVETILSSRDY 112 Query: 111 LCDASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPL 157 DA + VAL A + E+ A+L P +TPYLGR+SC + P+ Sbjct: 113 RVDAVYDVALSEKKRAPYSLKEIRNALLSPIHTPYLGRKSCSIALPM 159 >UniRef50_C7MTB0 CRISPR-associated protein Cas5 n=1 Tax=Saccharomonospora viridis DSM 43017 RepID=C7MTB0_SACVD Length = 255 Score = 100 bits (249), Expect = 4e-20, Method: Compositional matrix adjust. Identities = 73/201 (36%), Positives = 101/201 (50%), Gaps = 24/201 (11%) Query: 3 SYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESVQ 62 S L+LRLAGP+Q+WG+ + R T FPT SGLLGLL +G +R + SL+ L+ ++ Sbjct: 2 SGLLLRLAGPLQSWGERSTFDVRDTAGFPTHSGLLGLLACVMGRRRGE--SLEDLA-ALT 58 Query: 63 FAVRCDELILDDRRVSVTGLRDYHTVLGA--------REDYRGLKSHE-TIQTWREYLCD 113 F +R D T + DY T GA D +G + + T+QTWREYL D Sbjct: 59 FTIRVDR--------PGTRIIDYQTAGGALPPSMKVPTADGKGRPAGKGTVQTWREYLAD 110 Query: 114 ASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKALLN 173 A F VA+ + V+ ++ A+ P + PYLGRRSCP PL L DP L Sbjct: 111 AVFVVAVQ---GPSEVLDQVRHALRYPHWQPYLGRRSCPPDQPLLL-DVPVEDPVAELCT 166 Query: 174 YEPVGGDIYSEESVTGHHLKF 194 P+ + +E F Sbjct: 167 RVPLARRVGKDEETVPVDFIF 187 >UniRef50_B7KJ26 CRISPR-associated protein Cas5 family n=1 Tax=Cyanothece sp. PCC 7424 RepID=B7KJ26_CYAP7 Length = 215 Score = 97.4 bits (241), Expect = 3e-19, Method: Compositional matrix adjust. Identities = 56/160 (35%), Positives = 96/160 (60%), Gaps = 14/160 (8%) Query: 1 MRSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSES 60 M L+LRLAGP+Q+WG+ + R T PT+SG++GL+ A +GI RD+ L L++ Sbjct: 1 MMKTLLLRLAGPLQSWGRGSRFDFRDTDTIPTKSGVIGLVAAAMGINRDNQVELAKLAQ- 59 Query: 61 VQFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHET-IQTWREYLCDASFTVA 119 +R + + ++ V DYHTV+G + K H+ IQ++R+YLC+A F V Sbjct: 60 ----LRMGVCVEKEGKLVV----DYHTVIGTI--HADGKPHKAPIQSYRQYLCNAEFLVG 109 Query: 120 LWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFL 159 L + + +++E+E + P++ +LGR++CP + P+F+ Sbjct: 110 LESSEYH--LLNEIEHYLCFPKWELFLGRKACPPSKPIFV 147 >UniRef50_A8LYZ7 CRISPR-associated protein Cas5 family n=2 Tax=Actinomycetales RepID=A8LYZ7_SALAI Length = 257 Score = 97.1 bits (240), Expect = 4e-19, Method: Compositional matrix adjust. Identities = 84/242 (34%), Positives = 114/242 (47%), Gaps = 49/242 (20%) Query: 5 LILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESVQFA 64 L+LRLAGPMQ+WG + R TG PTRS ++G++ A G R + L L+ VQF Sbjct: 4 LLLRLAGPMQSWGDHSTFSVRDTGTVPTRSAMIGIIAAAQGRHRGE--PLGDLAP-VQFT 60 Query: 65 VRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHE---------TIQTWREYLCDAS 115 VR D T + D+HTV G R + + E TI + R YL DA Sbjct: 61 VRVDR--------PGTVMSDFHTVGGGAPPERTVPTAEGKRRTAGAGTIVSRRFYLADAV 112 Query: 116 FTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKAL---- 171 FTVA+ ++ ++ A+ P + PYLGRRSCP+ HP FL + DP L Sbjct: 113 FTVAVT---GPDDLVGQIHTALNNPVWGPYLGRRSCPVAHP-FLMSGPIPDPVGRLEHLP 168 Query: 172 LNYEPVGGDIYSEESV-----------TGHHLKFTARDEPMITLP-------RQFASREW 213 LN GD EE+V G + T D PM +P R++ +R+ Sbjct: 169 LNRRRPPGD---EETVRVDFVTGAPHGDGSISRMTLNDVPMEPVPGSPDPRRRRYLTRQV 225 Query: 214 YV 215 YV Sbjct: 226 YV 227 >UniRef50_C7MQD6 CRISPR-associated protein Cas5 n=1 Tax=Saccharomonospora viridis DSM 43017 RepID=C7MQD6_SACVD Length = 236 Score = 94.7 bits (234), Expect = 2e-18, Method: Compositional matrix adjust. Identities = 75/217 (34%), Positives = 108/217 (49%), Gaps = 22/217 (10%) Query: 5 LILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESVQFA 64 L+L L GPMQAWG + R T +PTRSG++G++ A LG Q D SL LS ++F Sbjct: 4 LVLHLDGPMQAWGHASQWDHRDTLDYPTRSGVIGMIAAALGKQWGD--SLDDLS-PLRFT 60 Query: 65 VRCDELILDDRRVSVTGLRDYHTVLGARE----DYRGLKSHETIQTWREYLCDASFTVAL 120 +R D RR+ DYHT G E +G + + R Y+ DA++TVA+ Sbjct: 61 IRIDR---PGRRIV-----DYHTAGGGYEVGIARVKGGNRAHAVLSDRFYMSDAAYTVAI 112 Query: 121 WLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKALLNY--EPVG 178 ++ ++ A+ P + P+LGRRSCP P LG DP + L + +P Sbjct: 113 ---TGPDTLLYRVDDALRAPVFGPFLGRRSCPPAGPWHLG-LHDGDPLRTLPLHRDKPRD 168 Query: 179 GDIYSEESVTGHHLKF-TARDEPMITLPRQFASREWY 214 GD + E V+ H T R + +T P +F R Y Sbjct: 169 GDTVAVEFVSDHETHGPTDRVDTTLTDPHEFGPRRSY 205 >UniRef50_Q1EQS9 CRISPR-associated protein n=3 Tax=Streptomyces RepID=Q1EQS9_STRKN Length = 280 Score = 94.7 bits (234), Expect = 2e-18, Method: Compositional matrix adjust. Identities = 63/168 (37%), Positives = 88/168 (52%), Gaps = 20/168 (11%) Query: 5 LILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESVQFA 64 L+LRL+GP+Q+WG+ + R T RFPTRSG++G+L A LG +R E V Sbjct: 14 LLLRLSGPLQSWGERSHFNERDTARFPTRSGIIGMLAAALGRRR---------GEPVDDL 64 Query: 65 VRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHE---------TIQTWREYLCDAS 115 R + DR + LRD HTV G + + E T+ T R YL DA+ Sbjct: 65 ARLSLTVRTDRPGIL--LRDLHTVGGGLPAKATVTTAEGKKRPGTTGTLLTHRTYLADAA 122 Query: 116 FTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQ 163 FT+AL TP ++ + +A+ P + +LGRRSCP PL LG + Sbjct: 123 FTIALTSTPDDRPLLDQAAQALNTPCWPLFLGRRSCPPEGPLLLGASE 170 >UniRef50_Q2RY19 CRISPR-associated protein, Cas5e family n=1 Tax=Rhodospirillum rubrum ATCC 11170 RepID=Q2RY19_RHORT Length = 261 Score = 94.0 bits (232), Expect = 3e-18, Method: Compositional matrix adjust. Identities = 74/234 (31%), Positives = 102/234 (43%), Gaps = 32/234 (13%) Query: 1 MRSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSES 60 +R +L+ RL GPM AWG R T P +S +LGLL A LGI R D ++ +AL Sbjct: 3 VRDFLVFRLVGPMAAWGDIAVGERRGTWDVPAKSAILGLLAAGLGIDRADRTAHEALDRG 62 Query: 61 VQFAVRCDELILDDRRVSVTGLRDYHTV----------LGAREDYRGLKSHETIQTWREY 110 + FAVR D LRDYHT R D T+ + R Y Sbjct: 63 LGFAVRQDR--------PGRLLRDYHTAQAPKARKNARWSTRRDELNDDDLNTVLSDRLY 114 Query: 111 LCDASFTVALWLTPHATM-VISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQK 169 +A T A+W + +L +A+L+PR+TPYLGR++CPL P QA Sbjct: 115 RTNAIATPAIWRRQGTEGPTLDQLTQALLRPRFTPYLGRKACPLGWPPRPRLLQADGLLA 174 Query: 170 ALLNYEPVGGDIYSE--ESVTGHHLKFTARDEPMITLPRQFASREWYVIKGGMD 221 A Y+ D + ++ G T R P+ W+ I G+D Sbjct: 175 AFDAYDSAEWDAARQFHKAYPGGWPGDTDRPTPV-----------WFEIAAGLD 217 >UniRef50_C2BET8 CRISPR-associated protein n=2 Tax=Firmicutes RepID=C2BET8_9FIRM Length = 244 Score = 93.2 bits (230), Expect = 6e-18, Method: Compositional matrix adjust. Identities = 59/183 (32%), Positives = 99/183 (54%), Gaps = 17/183 (9%) Query: 5 LILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESVQFA 64 ++L+L GPMQ+WG + TR + +P++SG++G++ A G +RD+ +Q L++ + FA Sbjct: 7 ILLKLTGPMQSWGTSSRFETRTSDYYPSKSGVIGIIAASFGYERDEDEKIQKLND-LDFA 65 Query: 65 VRCD-ELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVALWLT 123 VR D E +L +DYH AR+ + T T R Y+ DA F VA ++ Sbjct: 66 VRVDQEGVLK---------KDYHI---ARKVKPNGELERTYVTNRYYMEDAVFVVA--IS 111 Query: 124 PHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKALLNYEPVGGDIYS 183 + E+ + + P + P++GRRSCPL LGT + P +AL N + D + Sbjct: 112 HEDDKWMEEILQGLKYPYFQPFMGRRSCPLPARFILGTNEEG-PIEALENLDWQAADWFK 170 Query: 184 EES 186 +++ Sbjct: 171 KKN 173 >UniRef50_A5UR14 CRISPR-associated protein, Cas5e family n=1 Tax=Roseiflexus sp. RS-1 RepID=A5UR14_ROSS1 Length = 262 Score = 91.7 bits (226), Expect = 2e-17, Method: Compositional matrix adjust. Identities = 71/202 (35%), Positives = 96/202 (47%), Gaps = 45/202 (22%) Query: 5 LILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESVQFA 64 L LRL GP+Q+WG G R T PT+SG++GLLG LG++RDD + L+ LS++++ Sbjct: 4 LFLRLEGPLQSWGLRARWGERDTTDAPTKSGVIGLLGCALGLRRDD-ARLRDLSDNLRMG 62 Query: 65 VRCDELILDDRRVSVTGLRDYHTVLGAR--------------EDYRG------------- 97 VR D + +RDYHT G R E Y G Sbjct: 63 VRVD--------LPGILMRDYHTTGGGRYSTIASTGGPRYHDEPYIGGVLSAEVTKGRIK 114 Query: 98 ------LKSHETIQTWREYLCDASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSC 151 ET + R YL DASF VAL +P I EL A+ P + +LGR++C Sbjct: 115 VKINQKTGEPETDVSERYYLADASFLVALQGSPD---YIGELATAIQSPVWPLFLGRKAC 171 Query: 152 PLTHPLFLGTCQASDPQKALLN 173 + P+F GT Q + AL N Sbjct: 172 VPSTPIFAGTGQFDILEDALKN 193 >UniRef50_Q12YA8 CRISPR-associated protein, CT1976-like n=1 Tax=Methanococcoides burtonii DSM 6242 RepID=Q12YA8_METBU Length = 244 Score = 91.7 bits (226), Expect = 2e-17, Method: Compositional matrix adjust. Identities = 63/194 (32%), Positives = 95/194 (48%), Gaps = 28/194 (14%) Query: 1 MRSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSES 60 M+ YLI RL GPM +WG RPT P++S + GL+ A LGI+RD+ LS + Sbjct: 1 MKEYLIFRLYGPMASWGDIAVGQHRPTYDHPSKSAIFGLIAAALGIRRDEEERHLELSNA 60 Query: 61 VQFAVRCDELILDDRRVSVTGLRDYHT-------------VLGAREDYRGLKSHE--TIQ 105 + LI ++ LRDYHT R+D + E T+ Sbjct: 61 YSYGT----LINSAGKL----LRDYHTSQVPSAGTGRNRKTFATRKDELAVPKEELNTVL 112 Query: 106 TWREYLCDASFTVALWL---TPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTC 162 + R+Y CD +TV L TP ++ + L ++ +P + YLGR+SCPL P+ Sbjct: 113 STRDYYCDGVYTVILSCKTDTPPYSLEL--LGNSLKEPSFCLYLGRKSCPLALPINPKIV 170 Query: 163 QASDPQKALLNYEP 176 AS+ ++AL + +P Sbjct: 171 SASNIKEALQSVDP 184 >UniRef50_D1A6Q5 CRISPR-associated protein Cas5 family n=2 Tax=Actinomycetales RepID=D1A6Q5_THECD Length = 273 Score = 90.9 bits (224), Expect = 3e-17, Method: Compositional matrix adjust. Identities = 67/179 (37%), Positives = 94/179 (52%), Gaps = 25/179 (13%) Query: 5 LILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQR-DDTSSLQALSESVQF 63 L+L L+GP+Q+WG+ + R T PTRSGL+G++ A G +R + + L+AL +F Sbjct: 9 LLLHLSGPLQSWGERSRFNQRDTATAPTRSGLIGMIAAAFGRRRTEPVTDLRAL----RF 64 Query: 64 AVRCDELILDDRRVSVTGLRDYHTVLGA---------REDYRGLKSHETIQTWREYLCDA 114 VR D T LRD+HTV G E R T+ + R YL DA Sbjct: 65 TVRIDR--------PGTLLRDFHTVGGGMPRDLTVITAEGKRRAADTATVTSDRYYLQDA 116 Query: 115 SFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKALLN 173 +FTVA +T ++ +A+ PR+ YLGRRSCP PL L T +DP AL++ Sbjct: 117 AFTVA--VTADDPALLDRCAQALRAPRWPLYLGRRSCPPNAPLLL-TVLRTDPVTALID 172 >UniRef50_A8SDR7 Putative uncharacterized protein n=1 Tax=Faecalibacterium prausnitzii M21/2 RepID=A8SDR7_9FIRM Length = 220 Score = 89.7 bits (221), Expect = 6e-17, Method: Compositional matrix adjust. Identities = 68/172 (39%), Positives = 95/172 (55%), Gaps = 20/172 (11%) Query: 5 LILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESVQFA 64 L+LRLA P+QAWG + TR TGR PT+SG++GLL A LG++RD++ +L L+ ++F Sbjct: 4 LLLRLAAPLQAWGADSKFETRKTGREPTKSGVIGLLAAALGLRRDESEALTRLT-GLRFG 62 Query: 65 VRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVALWLTP 124 VR + L DYHT E + T+R YL DA F + T Sbjct: 63 VRVER--------EGQLLVDYHTAKTQDE-------KTSYVTYRHYLQDAVFLAGIEST- 106 Query: 125 HATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKALLNYEP 176 T ++ +L++A+L P + YLGRR CP T PL LG C S + +L EP Sbjct: 107 -DTALLQQLQQALLHPAFPLYLGRRCCPPTLPLCLGVCPGS--LQEVLQAEP 155 >UniRef50_A8ZZ17 CRISPR-associated protein Cas5 family n=1 Tax=Desulfococcus oleovorans Hxd3 RepID=A8ZZ17_DESOH Length = 259 Score = 89.7 bits (221), Expect = 6e-17, Method: Compositional matrix adjust. Identities = 60/166 (36%), Positives = 83/166 (50%), Gaps = 20/166 (12%) Query: 4 YLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESVQF 63 YL+ RL GPM +WG+ TR T +P RS ++GL+ A LGI+R +T + QAL + Sbjct: 3 YLLFRLYGPMASWGEIAVGETRHTANYPGRSAIIGLMAAALGIKRSETENQQALDQGCLI 62 Query: 64 AVRCDELILDDRRVSVTGLRDYHT----------VLGARED--YRGLKSHETIQTWREYL 111 AV R + LRDYHT V R D G TI + REY Sbjct: 63 AVEA--------RSHGSLLRDYHTTQVPDSVGGFVYRTRRDELIIGKPRLGTILSSREYR 114 Query: 112 CDASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPL 157 DA A+ + P A + ++ + +PR YLGR+SCPL+ P+ Sbjct: 115 QDALAVSAVRVLPGARYELQTIKTHLEQPRLHVYLGRKSCPLSAPM 160 >UniRef50_B4TTX2 CRISPR-associated protein Cas5 n=15 Tax=Enterobacteriaceae RepID=B4TTX2_SALSV Length = 241 Score = 89.0 bits (219), Expect = 1e-16, Method: Compositional matrix adjust. Identities = 67/169 (39%), Positives = 89/169 (52%), Gaps = 19/169 (11%) Query: 1 MRSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSES 60 M+ YL+ +L P+ +WG+ R + PTRS LLGLL A LGI+RD+ + L + Sbjct: 1 MKEYLVFQLYAPLASWGEEASGEIRHSATVPTRSALLGLLAAALGIRRDEEARLNNFNRH 60 Query: 61 VQFAVRCDELILDDRRVSVTGLRDYHTVLGARED--YRGLKSHE----------TIQTWR 108 AV L DR LRDYHTV RE+ YR + T+ + R Sbjct: 61 YHLAVHA--LASQDR-----WLRDYHTVSAPRENKKYRYYTRRDELTLAPDEVGTLISQR 113 Query: 109 EYLCDASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPL 157 EY CD + VA+ TP A +SEL +A+L P + YLGR+SCPL PL Sbjct: 114 EYRCDGYWHVAISATPDAPHSLSELREALLTPHFPLYLGRKSCPLALPL 162 >UniRef50_D0Y918 CRISPR-associated protein Cas5 family n=2 Tax=Dehalococcoides RepID=D0Y918_9CHLR Length = 205 Score = 88.6 bits (218), Expect = 1e-16, Method: Compositional matrix adjust. Identities = 57/153 (37%), Positives = 89/153 (58%), Gaps = 17/153 (11%) Query: 5 LILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESVQFA 64 L++RL GPMQ+WG + R T PTRSG++GL+ A +GI RD+ A + ++ Sbjct: 7 LLMRLEGPMQSWGYRSRFDCRDTALEPTRSGVIGLICAAMGIARDEDI---ARFDGIRMG 63 Query: 65 VRCDELILDDRRVSVTGLRDYHTVLGA-REDYRGLKSHETIQTWREYLCDASFTVALWLT 123 VR D D +V +DYHT L + D G +T+ ++R+YL DASFTV L + Sbjct: 64 VRVDR----DGKVE----QDYHTALDVIKADGSG---KDTVVSYRDYLTDASFTVGLESS 112 Query: 124 PHATMVISELEKAVLKPRYTPYLGRRSCPLTHP 156 ++ ++ KA++ P++ +LGR++ PLT P Sbjct: 113 DRN--LLEKIAKALVSPQWVLFLGRKAFPLTKP 143 >UniRef50_D2L2X8 CRISPR-associated protein Cas5 family n=1 Tax=Desulfovibrio sp. FW1012B RepID=D2L2X8_9DELT Length = 266 Score = 86.7 bits (213), Expect = 6e-16, Method: Compositional matrix adjust. Identities = 67/197 (34%), Positives = 91/197 (46%), Gaps = 33/197 (16%) Query: 1 MRSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSES 60 M YLI +L G + A+G R + PTRS + GLL ACLGI+R + + L ALS Sbjct: 1 MARYLIFQLYGMLAAYGLVAVGEVRLSAGHPTRSAVFGLLAACLGIRRHEEARLAALSGG 60 Query: 61 VQFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGL---KSHE------------TIQ 105 AVR D T L DYHT+ E + + ++ E T+ Sbjct: 61 YALAVRVD--------APGTSLLDYHTIQTPPEKSKRIYRTRADELGGLLGIDEPPYTVL 112 Query: 106 TWREYLCDASFTVALWLTPHAT--------MVISELEKAVLKPRYTPYLGRRSCPLTHPL 157 + R YLCDA FT LTP A + L +A+ +P TPYLGR+SCP + P Sbjct: 113 SRRGYLCDAHFTA--CLTPAAAPPTDATPPHTLEALAEALRRPVLTPYLGRKSCPPSLPF 170 Query: 158 FLGTCQASDPQKALLNY 174 + + AL +Y Sbjct: 171 HPRLGEYDSLEAALADY 187 >UniRef50_Q0W584 Putative uncharacterized protein n=1 Tax=uncultured methanogenic archaeon RC-I RepID=Q0W584_UNCMA Length = 227 Score = 86.7 bits (213), Expect = 6e-16, Method: Compositional matrix adjust. Identities = 55/150 (36%), Positives = 74/150 (49%), Gaps = 15/150 (10%) Query: 11 GPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESVQFAVRCDEL 70 GPMQ+WG R TG PT+SG++GLLG LG R D L ++ + Sbjct: 13 GPMQSWGLKARWDIRDTGDEPTKSGIIGLLGCALGYARKDPRLTDELDSQLRIGI----- 67 Query: 71 ILDDRRVSVTG--LRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVALWLTPHATM 128 RV G RDYHTV G G TI ++R+YL DA+F V L Sbjct: 68 -----RVECPGEIARDYHTVSGELRTAEGKLRETTIVSFRDYLQDAAFLVVL---EGPGE 119 Query: 129 VISELEKAVLKPRYTPYLGRRSCPLTHPLF 158 +++ + A+ P + YLGR+SCP T P+F Sbjct: 120 LLTRISNALKDPVWPIYLGRKSCPPTRPVF 149 >UniRef50_D1CGD4 CRISPR-associated protein Cas5 family n=6 Tax=Bacteria RepID=D1CGD4_THET1 Length = 230 Score = 84.7 bits (208), Expect = 2e-15, Method: Compositional matrix adjust. Identities = 68/207 (32%), Positives = 105/207 (50%), Gaps = 34/207 (16%) Query: 5 LILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESVQFA 64 L++RL+GPMQ+WG + R TGR P++SG++GL+ A LG R T+ + L ++ Sbjct: 4 LLMRLSGPMQSWGTQSRFTVRDTGREPSKSGVIGLICAALG--RPRTAPVDDLVR-LRMG 60 Query: 65 VRCDELILDDRRVSVTGLRDYHTVLGAREDYR-GLKS----HETIQTWREYLCDASFTVA 119 VR D R + +RDYHT GA R G+ + +Q+ R YL DASF VA Sbjct: 61 VRVD-------REGIV-MRDYHTAGGAPAGERYGVATVTGDQRPVQSSRYYLADASFLVA 112 Query: 120 LWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGT-------CQASDPQKALL 172 L ++ ++++A+ PR+ +LGR+SC + P+ L + D + AL+ Sbjct: 113 LEGGEEDRPLLEQIDEALRAPRWQLFLGRKSCVPSEPIHLPKEPPLGPPIREEDLRTALI 172 Query: 173 NYE-PVGGDIYSEESVTGHHLKFTARD 198 +Y P G H L+F D Sbjct: 173 SYPWPEG----------AHRLRFVFED 189 >UniRef50_Q47PI7 CRISPR-associated protein, Cas5e family n=12 Tax=Actinomycetales RepID=Q47PI7_THEFY Length = 245 Score = 84.0 bits (206), Expect = 3e-15, Method: Compositional matrix adjust. Identities = 66/173 (38%), Positives = 89/173 (51%), Gaps = 25/173 (14%) Query: 5 LILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQR-DDTSSLQALSESVQF 63 L L LAGP+QAWG + R T PT+SG+LGLL A G +R DD S L AL +F Sbjct: 4 LTLLLAGPLQAWGAASRFTRRTTEHAPTKSGVLGLLAAAQGRERTDDLSDLAAL----RF 59 Query: 64 AVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQ-TWREYLCDASFTVALWL 122 VR D+ T +RD+ T + L + +++ + R YL DA F A+ Sbjct: 60 GVRVDQ--------RGTRIRDFQTAI-------HLDTGKSMPVSERFYLADAVFVAAVE- 103 Query: 123 TPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKALLNYE 175 +I L +AV P Y PYLGRRSCP + P+ LG + P + +L E Sbjct: 104 --GEDTLIDTLHQAVQHPVYLPYLGRRSCPPSRPINLG-VHSGKPLEQVLAEE 153 >UniRef50_A7BA63 Putative uncharacterized protein n=1 Tax=Actinomyces odontolyticus ATCC 17982 RepID=A7BA63_9ACTO Length = 242 Score = 82.8 bits (203), Expect = 7e-15, Method: Compositional matrix adjust. Identities = 65/165 (39%), Positives = 86/165 (52%), Gaps = 22/165 (13%) Query: 1 MRSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSES 60 M + L+LRLAGPMQ+WG + R T FPT+S L+GLLGA G +R D ++ L+E Sbjct: 1 MSAVLVLRLAGPMQSWGADSRFTRRSTEAFPTKSALVGLLGAAQGRRRSD--PIEDLAE- 57 Query: 61 VQFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVAL 120 + AVR D+ L D+HT R D SH R Y DA+F A Sbjct: 58 LSVAVRVDQ--------PGQLLHDFHT--AHRGDTSMPLSH------RFYRADAAFG-AF 100 Query: 121 WLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQAS 165 P +I L +A+++P + YLGRRSCP T PL L + S Sbjct: 101 IEGPDD--MIDALAQAIVRPVFPLYLGRRSCPPTLPLRLAVREGS 143 >UniRef50_D1NTI1 CRISPR-associated protein Cas5 n=1 Tax=Bifidobacterium gallicum DSM 20093 RepID=D1NTI1_9BIFI Length = 250 Score = 82.4 bits (202), Expect = 1e-14, Method: Compositional matrix adjust. Identities = 62/174 (35%), Positives = 92/174 (52%), Gaps = 15/174 (8%) Query: 3 SYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESVQ 62 S LILRLAGPMQ+WG + R T PT+S ++GLL + G +R+D S++ L ++ Sbjct: 2 SVLILRLAGPMQSWGDSSRFNRRETRTEPTKSAVIGLLASAQGRRRED--SIEDLL-GLR 58 Query: 63 FAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVALWL 122 F VR D+ R+ +RD+ T G S T R YL DA F VA+ Sbjct: 59 FGVRSDQ----PGRI----MRDFQTEKSIARKKSGEFSLTMPLTHRYYLADAKFLVAI-- 108 Query: 123 TPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKALLNYEP 176 ++ L+ A+ P++ +LGRRSCP P+ LG ++ ++A L+ EP Sbjct: 109 -EGERSLLESLDAALRNPQWPLFLGRRSCPPASPVSLGVKDYANVEEA-LDKEP 160 >UniRef50_C8XAY4 CRISPR-associated protein Cas5 family n=2 Tax=Actinomycetales RepID=C8XAY4_NAKMY Length = 252 Score = 82.4 bits (202), Expect = 1e-14, Method: Compositional matrix adjust. Identities = 71/210 (33%), Positives = 98/210 (46%), Gaps = 22/210 (10%) Query: 3 SYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESVQ 62 S L+LRL GPMQ+WG+ + R T P++S ++GLL A LG +R D +++ L+ + Sbjct: 2 SVLVLRLTGPMQSWGERSRYARRETAAEPSKSAIVGLLAAALGRRRTD--AIEDLAGLI- 58 Query: 63 FAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQ-TWREYLCDASFTVALW 121 F VR D+ T LRD+ T R L T+ + R YL DA F A+ Sbjct: 59 FGVRVDQ--------PGTLLRDFQTA-------RSLDGARTMPLSERYYLSDARFLAAVE 103 Query: 122 LTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKALLNYEPVGGDI 181 P + +I+ L A+ P + YLGRRSCP + P+ C S P A L EP Sbjct: 104 -GPES--LIAGLAGALRDPTFPLYLGRRSCPPSEPIAQQDCIRSGPLLAALFDEPWHATK 160 Query: 182 YSEESVTGHHLKFTARDEPMITLPRQFASR 211 V A D +P Q A R Sbjct: 161 SYRRRVADPARLSIAVDAAATEVPAQLAER 190 >UniRef50_Q2JH27 CRISPR-associated protein, CT1976 n=6 Tax=Actinomycetales RepID=Q2JH27_FRASC Length = 276 Score = 82.0 bits (201), Expect = 2e-14, Method: Compositional matrix adjust. Identities = 69/187 (36%), Positives = 89/187 (47%), Gaps = 37/187 (19%) Query: 2 RSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESV 61 R L+LRLAGP+Q+WG + R T PT+SG++GLL A G +R T ++ L S+ Sbjct: 7 RHCLVLRLAGPLQSWGSRSMFNRRDTLTEPTKSGIIGLLAAAQGRRR--TDPIEDLL-SL 63 Query: 62 QFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGL-----------------KSHETI 104 +R D+ T LRDYHTV DYRG + T Sbjct: 64 TLGIRTDQ--------PGTLLRDYHTV----SDYRGRPLPSAAVSAKGLQKPTSPAKHTH 111 Query: 105 QTWREYLCDASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQA 164 T R YL DA F AL V++ L A+ P + LGRR+CP THPL L Sbjct: 112 VTERFYLQDAVFVAALAA---PEPVLTTLADALRTPAFPLALGRRACPPTHPLLL--VPD 166 Query: 165 SDPQKAL 171 S+P AL Sbjct: 167 SEPDAAL 173 >UniRef50_C7LYW6 CRISPR-associated protein Cas5 family n=1 Tax=Acidimicrobium ferrooxidans DSM 10331 RepID=C7LYW6_ACIFD Length = 253 Score = 81.6 bits (200), Expect = 2e-14, Method: Compositional matrix adjust. Identities = 67/176 (38%), Positives = 86/176 (48%), Gaps = 31/176 (17%) Query: 3 SYLILRLAGPMQAWGQPT-FEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESV 61 S L LRL GP+QAWG + R T RFPT+SG++GLL A LG R +++L L ++ Sbjct: 2 SVLALRLGGPLQAWGSSQRLDHYRRTERFPTKSGVIGLLAAALGRPR--SAALDDLG-AL 58 Query: 62 QFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHE----------------TIQ 105 +FAVR D R V LRD+HT+ +D + E T Sbjct: 59 RFAVRID------RPGEV--LRDFHTLSSLFDDKKRFAPGEGRLPTASGGYRSAATSTQV 110 Query: 106 TWREYLCDASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGT 161 T R YL DA F L + EL+ A+ P + YLGRRSCP PL LG Sbjct: 111 TERFYLADACFVAGL---EGDAAQLQELDDALRTPVFPLYLGRRSCPPDKPLRLGV 163 >UniRef50_B0S4B6 Putative uncharacterized protein n=1 Tax=Finegoldia magna ATCC 29328 RepID=B0S4B6_FINM2 Length = 228 Score = 80.1 bits (196), Expect = 5e-14, Method: Compositional matrix adjust. Identities = 66/239 (27%), Positives = 107/239 (44%), Gaps = 35/239 (14%) Query: 3 SYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESVQ 62 S ++L+ A P+Q+WG R T +PT+S ++GL+ A G ++ DT S++ L+ S+ Sbjct: 2 SVILLKFASPLQSWGGLANYEIRNTEYYPTKSAVIGLVAAAFGYKKTDTESIKRLN-SLN 60 Query: 63 FAVRCDELILDDRRVSVTGLRDYHTVLGAREDY-----RGLKSHETIQTWREYLCDASFT 117 F+VR D+ + +RD+ + Y IQ + Y+ DA F Sbjct: 61 FSVRIDQ--------KGSLIRDFQIAMEYNPKYMPNDPNYFVKSNLIQKY--YIQDAKFL 110 Query: 118 VALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQ-------------A 164 +A L+ ++ ++ A+ P Y +LGR+S P+ +G A Sbjct: 111 IA--LSSDDETLMEDVYNALESPAYQLFLGRKSNPINADYLIGKFDGNELEIIKDYEWLA 168 Query: 165 SDPQKALLNYEPVGGDIYSEESVTGHHLKFTARD--EPMITLPRQFASR-EW-YVIKGG 219 S K + + V I+S+ TG K RD E R F+SR E+ YV K G Sbjct: 169 SKWYKKSIKKDSVELSIFSDYIDTGSKEKLIRRDLTESFENTKRDFSSRFEYRYVTKVG 227 >UniRef50_B2GBJ9 Putative uncharacterized protein n=1 Tax=Lactobacillus fermentum IFO 3956 RepID=B2GBJ9_LACF3 Length = 235 Score = 79.7 bits (195), Expect = 7e-14, Method: Compositional matrix adjust. Identities = 60/173 (34%), Positives = 85/173 (49%), Gaps = 31/173 (17%) Query: 5 LILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQR--DDTSSLQALSESVQ 62 L++R+A P+Q++G P R T R P++S ++G++GA LG +R DD SL L Sbjct: 4 LVIRIAAPLQSYGDPASFEKRTTFRAPSKSAVIGMIGAALGFRRESDDYKSLNDLD---- 59 Query: 63 FAVRCDE--LILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVAL 120 FAVR D+ +L D +++ L+ G SH R YL DA F VAL Sbjct: 60 FAVRVDQPGEVLSDFQITHYSLKK-----------PGKLSH------RIYLQDAVFMVAL 102 Query: 121 WLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKALLN 173 A M E+E A+ P++ Y GRRS P L + C P K +N Sbjct: 103 SSKQDALM--EEIEYALRHPKFQLYFGRRSNPPAGILKMKMC----PDKTAIN 149 >UniRef50_Q5YRB6 Putative uncharacterized protein n=1 Tax=Nocardia farcinica RepID=Q5YRB6_NOCFA Length = 235 Score = 79.0 bits (193), Expect = 1e-13, Method: Compositional matrix adjust. Identities = 55/173 (31%), Positives = 87/173 (50%), Gaps = 19/173 (10%) Query: 3 SYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESVQ 62 + L+LRLA P+Q+WG + R T ++P++SG+LGL+ A G +R D ++ +++ Sbjct: 2 TVLLLRLAAPLQSWGVASRFARRETQQYPSKSGILGLIAAARGHRRTD--PIEEALQNLA 59 Query: 63 FAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVALWL 122 F VR D+ +RD+ L K+ + + R YL DA F A+ Sbjct: 60 FGVRVDQ--------PGRLIRDFQVALNID------KTKQFPLSQRYYLADAVFLAAI-- 103 Query: 123 TPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKALLNYE 175 +I + A+ +P + YLGRRSCP+T PL LG + + AL E Sbjct: 104 -QGERGLIEGIGNALRRPEFPLYLGRRSCPVTGPLVLGEPRDVTLEHALHETE 155 >UniRef50_B3E5U9 CRISPR-associated protein Cas5 family n=2 Tax=Desulfuromonadales RepID=B3E5U9_GEOLS Length = 271 Score = 79.0 bits (193), Expect = 1e-13, Method: Compositional matrix adjust. Identities = 81/239 (33%), Positives = 111/239 (46%), Gaps = 40/239 (16%) Query: 4 YLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESVQF 63 YL+ RL GP+ +WG+ +R + +P +S LLGL+ A LGI+RD+ AL+ +F Sbjct: 3 YLLFRLYGPLASWGEIAVGESRHSAVYPGKSALLGLIAAALGIRRDEEQRQAALASGYRF 62 Query: 64 AVRCDELILDDRRVSVTG--LRDYHT----------VLGARED--YRGLKSHETIQTWRE 109 AV +V TG LRDYHT V R D G + TI + RE Sbjct: 63 AV----------KVISTGHPLRDYHTAQAPDSVGKFVYRTRRDELVLGKERLGTILSSRE 112 Query: 110 YLCDASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQK 169 Y CDA VA+ A + E+ +A++KPR+ YLGR+SCP+ PL A Sbjct: 113 YRCDAFSLVAVVAEDDAPYSLDEIREALMKPRFHLYLGRKSCPVAAPLNPLVRDAVGFGD 172 Query: 170 ALLNYEPVGGDIYS--------EESVTGHHL-------KFTARDEPMITLPRQFASREW 213 AL +Y P G S +E V G L KF+ DE + +Q W Sbjct: 173 ALDSY-PYGALFVSSWLMKTAQKEIVEGGKLAEVPSLAKFSREDETVFAYNKQPVRYYW 230 >UniRef50_A3EQA4 CRISPR-ssociated protein, Cas5 n=3 Tax=Bacteria RepID=A3EQA4_9BACT Length = 227 Score = 78.6 bits (192), Expect = 1e-13, Method: Compositional matrix adjust. Identities = 55/177 (31%), Positives = 88/177 (49%), Gaps = 22/177 (12%) Query: 5 LILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESVQFA 64 L++RL PMQ+WG + R TG+ P++SG++GLL A LGI R++ L+ L+ + Sbjct: 4 LLIRLVSPMQSWGTSSRFDQRDTGKEPSKSGVIGLLAAALGIDRNNWDDLEPLA-GLSMG 62 Query: 65 VRCDELILDDRRVSVTGL--RDYHT---VLGAREDYRGLKSHETIQTWREYLCDASFTVA 119 VR D G+ RDY T ++ A K H T T+REYL DA F V Sbjct: 63 VRHDR----------PGIPRRDYQTASKIISADHS----KIHPTAVTYREYLADAVFLVG 108 Query: 120 LWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKALLNYEP 176 ++ ++ A+ P + +LGR+S + P+++ + P + L + P Sbjct: 109 --FESAEVSLLEKINSALKNPVWPLFLGRKSYVPSEPIWIENGLKNVPLREALEHFP 163 >UniRef50_B0TDU1 Crispr-associated protein cas5 n=1 Tax=Heliobacterium modesticaldum Ice1 RepID=B0TDU1_HELMI Length = 230 Score = 77.8 bits (190), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 56/159 (35%), Positives = 82/159 (51%), Gaps = 17/159 (10%) Query: 5 LILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESVQFA 64 L LRL GP+Q+WG + R + PT+SG++GLLG LG R+D L++L +++ Sbjct: 6 LALRLEGPLQSWGSRSRWDYRDSALEPTKSGIIGLLGCALGWSRND-KRLESLDAALRLT 64 Query: 65 VRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHE-----TIQTWREYLCDASFTVA 119 VR D+ T L D+HTV G G + T+ + R YL +ASF Sbjct: 65 VRIDK--------PGTPLIDFHTVQGYLLMAEGKQKKSGNDMYTVVSRRVYLQEASF--- 113 Query: 120 LWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLF 158 L L + + +KA+ P + +LGR+SCP PLF Sbjct: 114 LALLTGEQGALHQCKKALNDPVWPVFLGRKSCPPARPLF 152 >UniRef50_C9M9R7 CRISPR-associated protein Cas5, Ecoli subtype n=1 Tax=Jonquetella anthropi E3_33 E1 RepID=C9M9R7_9BACT Length = 243 Score = 77.0 bits (188), Expect = 4e-13, Method: Compositional matrix adjust. Identities = 56/176 (31%), Positives = 90/176 (51%), Gaps = 11/176 (6%) Query: 4 YLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESVQF 63 +L+LR GPM ++G + RP RFP S + GL+ LG +T LQAL + + Sbjct: 3 FLVLRFRGPMMSFGDVAVDEQRPIDRFPGVSMVTGLVANALGWDWSETEKLQALQDRLVL 62 Query: 64 AVRCD---ELILDDRRVSV---TGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFT 117 AVR D E + + + V++ +GL H++ + R L T+Q + Y ++ T Sbjct: 63 AVREDRAGERLREYQTVALPGKSGLFVTHSIPCS----RNLDKPMTVQKYLSYWANSLIT 118 Query: 118 VALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTC-QASDPQKALL 172 + LT T + E+ A+ KP +LGR++C T P+F G A P++A+L Sbjct: 119 CFIALTGLGTPTLDEIACALKKPARPLFLGRKTCLPTEPVFRGEIFGAESPEEAVL 174 >UniRef50_A5GBK2 CRISPR-associated protein Cas5 family n=2 Tax=Deltaproteobacteria RepID=A5GBK2_GEOUR Length = 231 Score = 76.6 bits (187), Expect = 6e-13, Method: Compositional matrix adjust. Identities = 55/170 (32%), Positives = 87/170 (51%), Gaps = 12/170 (7%) Query: 2 RSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESV 61 +S+L LRL GP+Q+WG + R TG PT+S + G+ A LG R + L V Sbjct: 10 KSFLALRLEGPLQSWGFDSQYNRRNTGLMPTKSAIAGMCCAALGFLRGCDKEQEFL---V 66 Query: 62 QF-AVRCDELIL----DDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASF 116 F AVR + + + + V L+DYHTV R G +++ + T R+YL DA+F Sbjct: 67 AFGAVRMTAIAIPRNGAKKELPVRRLQDYHTVQNTRR-ASGAINNDCVLTHRQYLTDAAF 125 Query: 117 TVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASD 166 V L + ++ ++ A+ P + +LGR++C T P+ G + D Sbjct: 126 GV---LLEGDSTLLKQIAAALENPVWGVWLGRKTCIPTAPVLAGLRENRD 172 >UniRef50_D1Y486 Crispr-associated protein Cas5 n=1 Tax=Pyramidobacter piscolens W5455 RepID=D1Y486_9BACT Length = 269 Score = 76.3 bits (186), Expect = 7e-13, Method: Compositional matrix adjust. Identities = 68/226 (30%), Positives = 97/226 (42%), Gaps = 36/226 (15%) Query: 5 LILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESVQFA 64 LILRL GP+ A+G + RPT P S + GL+ LG D LQ L E ++ A Sbjct: 4 LILRLRGPLMAFGDVAVDEIRPTDLLPGLSEMTGLIANALGWTFQDVEKLQRLQERLRLA 63 Query: 65 VRCDELILDDRRVSVTGLRDYHTVLGARED--YR----------GLKSHETIQTWREYLC 112 R D R V LRDY T + ED +R G K T+Q +R Y Sbjct: 64 SRED-------RTGVP-LRDYQTARLSSEDSLWRTDGIVAERGGGSKGEFTVQRYRHYRA 115 Query: 113 DASFTVALWLTP-HATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKAL 171 DA+ TV + L P + E+ A+ P ++GR CP + P+ C Sbjct: 116 DAAVTVLIALDPADEAPALEEIRDALRHPARPLFIGRIGCPPSQPI----C--------- 162 Query: 172 LNYEPVGGDIYSEESVTGHHLKFTARDEPMITLPRQFASREWYVIK 217 +EP G +I +S+ + T R R+ ASR +++ Sbjct: 163 --FEPEGREIIHTDSLKDAIMHITPRAPLGAQTKREPASRNKVLVE 206 >UniRef50_D0WFC8 CRISPR-associated protein Cas5 n=1 Tax=Slackia exigua ATCC 700122 RepID=D0WFC8_9ACTN Length = 267 Score = 76.3 bits (186), Expect = 7e-13, Method: Compositional matrix adjust. Identities = 53/161 (32%), Positives = 89/161 (55%), Gaps = 17/161 (10%) Query: 3 SYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESVQ 62 + L+L+LA P+Q+WG + R T P++SG++GLL A LG +R+D+ A +++ Sbjct: 5 TVLLLKLAAPLQSWGASSRFTERTTRHEPSKSGVIGLLAAALGRRREDSVDDLA---ALR 61 Query: 63 FAVRCDELILDDRRVSVTGLRDYHTVLGAR--EDYRGLKSHETIQ-TWREYLCDASFTVA 119 FAVR D+ + +RD+ T + D R +E++ + R+YL DA F A Sbjct: 62 FAVRIDQ--------PGSFMRDFQTEHTRKWDSDTRRFVFNESLSLSKRDYLSDAVFVAA 113 Query: 120 LWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLG 160 L +++E +A+ P + +LGRRSCP + ++LG Sbjct: 114 L---EGDEDLLAECAEALHHPAFPLFLGRRSCPPSTQVYLG 151 >UniRef50_B1VIY0 CRISPR-associated protein n=9 Tax=Actinomycetales RepID=B1VIY0_CORU7 Length = 240 Score = 76.3 bits (186), Expect = 8e-13, Method: Compositional matrix adjust. Identities = 56/175 (32%), Positives = 94/175 (53%), Gaps = 18/175 (10%) Query: 1 MRSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSES 60 M L+L L GPMQ+WG + R T PT+SG++GL+ A G +R T ++ L++ Sbjct: 1 MAHSLLLLLKGPMQSWGDESRFSVRATATTPTKSGIVGLIAAAQGRRR--TDGVEDLAK- 57 Query: 61 VQFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVAL 120 ++ AVR D+ S + LRDY T A+ + ++ ++ T R +L DA+F A Sbjct: 58 LRMAVRVDQ--------SGSLLRDYQT---AQPWLKNPGANASLVT-RYFLSDAAFVAA- 104 Query: 121 WLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKALLNYE 175 + ++ ++ +A+ +P Y Y+GRRSCP+ L +G D + AL ++ Sbjct: 105 -VESEDRELLDQMAEALRRPAYPLYMGRRSCPVHPGLVIGVVDG-DAESALRAHD 157 >UniRef50_D2RB02 CRISPR system CASCADE complex protein CasD n=3 Tax=Actinobacteria (class) RepID=D2RB02_GARVA Length = 291 Score = 76.3 bits (186), Expect = 8e-13, Method: Compositional matrix adjust. Identities = 58/176 (32%), Positives = 86/176 (48%), Gaps = 16/176 (9%) Query: 1 MRSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRD-DTSSLQALSE 59 M+S L+L+ +GP+Q+WG + TR T +P++S ++G++ A G +R D A Sbjct: 1 MKS-LLLKFSGPLQSWGTDSHFETRHTDYYPSKSAVIGMIAAAFGYRRSTDCDENIAKLN 59 Query: 60 SVQFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVA 119 + FAVR D+ LRDYH + A+ G + T R YL DA F VA Sbjct: 60 DLDFAVRIDQ--------QGNLLRDYH--IAAKYKANG-DFEKNYVTNRYYLEDAIFLVA 108 Query: 120 LWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKALLNYE 175 + + +I ++ A+ P + LGRRS P T LG Q ALL +E Sbjct: 109 --IGSNNEQLIYDISNALRSPYFQSSLGRRSLPPTADFILGVEDCGVIQ-ALLTHE 161 >UniRef50_C9M2Y8 CRISPR-associated protein n=1 Tax=Lactobacillus helveticus DSM 20075 RepID=C9M2Y8_LACHE Length = 241 Score = 75.9 bits (185), Expect = 1e-12, Method: Compositional matrix adjust. Identities = 47/165 (28%), Positives = 82/165 (49%), Gaps = 16/165 (9%) Query: 7 LRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESVQFAVR 66 +RL P+Q++G R + +P++S ++G++ A LG +RDD LQ ++ FAVR Sbjct: 6 IRLTAPLQSYGNQASFNQRTSDNYPSKSAVIGIIAAALGYRRDDARILQL--NNLLFAVR 63 Query: 67 CDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVALWLTPHA 126 ++ S + ++ TV + + + T+RE++ DA F VA + Sbjct: 64 IEQ--------SGNMMTEFQTVEYQKSSTKTARKL----TYREFIQDAVFMVA--IGSDN 109 Query: 127 TMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKAL 171 I ++ A+ P++ YLGRRS P PL + T +P + L Sbjct: 110 DHEIEKIVSALKHPKFQLYLGRRSNPPAGPLMIETYDEENPLQVL 154 >UniRef50_UPI0001AF1D4C CRISPR-associated protein, CT1976 n=1 Tax=Streptomyces ghanaensis ATCC 14672 RepID=UPI0001AF1D4C Length = 244 Score = 73.6 bits (179), Expect = 5e-12, Method: Compositional matrix adjust. Identities = 54/153 (35%), Positives = 78/153 (50%), Gaps = 13/153 (8%) Query: 5 LILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESVQFA 64 L++ PMQ+WG + +R T PT+SG++GLL A LGI RD +Q L+E ++ Sbjct: 4 LLMCFDAPMQSWGTRSQFASRDTATEPTKSGVVGLLAAALGIPRDADEEIQNLAE-LRMG 62 Query: 65 VRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVALWLTP 124 VR DR V D+HTV + G K+H T T R YL DA F V + Sbjct: 63 VRV------DREGVVEA--DFHTVQNV-PNTEG-KNHRTAVTKRFYLADALFLVG--VES 110 Query: 125 HATMVISELEKAVLKPRYTPYLGRRSCPLTHPL 157 T ++ +L A+ PR+ Y GR++ P+ Sbjct: 111 DDTQLLHQLHTALTAPRWPLYFGRKAFVPARPI 143 >UniRef50_A8M404 CRISPR-associated protein Cas5 family n=1 Tax=Salinispora arenicola CNS-205 RepID=A8M404_SALAI Length = 238 Score = 72.8 bits (177), Expect = 7e-12, Method: Compositional matrix adjust. Identities = 56/160 (35%), Positives = 82/160 (51%), Gaps = 26/160 (16%) Query: 5 LILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSE--SVQ 62 L+LRLAGP+Q+WG + R T PT+SG++G+L A G++R D L+E S+ Sbjct: 2 LLLRLAGPLQSWGATSRFTHRHTQVTPTKSGVIGMLAAASGLRRTD-----PLTELLSLD 56 Query: 63 FAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQ-TWREYLCDASFTVALW 121 F VR D+ LRD+ R L +++ T R YL DA F VA+ Sbjct: 57 FGVRIDQ--------PGQLLRDFQVA-------RTLDGRDSMPLTNRYYLSDAVFLVAIG 101 Query: 122 LTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGT 161 ++ L ++V +P + YLGRR+CP P+ LG Sbjct: 102 ---GDQALLEGLHESVRRPHFPLYLGRRACPPVAPISLGV 138 >UniRef50_B4UE71 CRISPR-associated protein Cas5 family n=2 Tax=Anaeromyxobacter RepID=B4UE71_ANASK Length = 246 Score = 71.2 bits (173), Expect = 3e-11, Method: Compositional matrix adjust. Identities = 58/186 (31%), Positives = 89/186 (47%), Gaps = 6/186 (3%) Query: 1 MRSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSES 60 M LILR P+ A+G + G FP S + GL+ LG D L+AL Sbjct: 1 MLDALILRFDAPLLAFGGVAVDNHGEVGDFPGLSMVAGLIANALGYDHRDCDRLEALQRR 60 Query: 61 VQFAVRCD---ELILDDRRVSVTG--LRDYHTVLGAREDYRGLKSHETIQTWREYLCDAS 115 ++ AVR D + ++D + V++ L T GA E G S T +R Y DA Sbjct: 61 LRIAVRRDRSGQRLVDFQTVALGQPFLERGWTTRGAVEGRDGAFSDGTHIRYRAYWADAV 120 Query: 116 FTVALWLTPHA-TMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKALLNY 174 +T+A+ L P A + + +E+A+ +P +LGR++C + P+ G Q AL ++ Sbjct: 121 YTLAVTLDPPAESPGLDAVERALREPERPLFLGRKACLPSVPILAGRLQIPSLLAALASF 180 Query: 175 EPVGGD 180 E V D Sbjct: 181 ERVSKD 186 >UniRef50_D0MET6 CRISPR-associated protein Cas5 family n=1 Tax=Rhodothermus marinus DSM 4252 RepID=D0MET6_RHOM4 Length = 252 Score = 70.9 bits (172), Expect = 3e-11, Method: Compositional matrix adjust. Identities = 57/184 (30%), Positives = 90/184 (48%), Gaps = 25/184 (13%) Query: 5 LILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESVQFA 64 L+LR P+ ++G P + +P S + GLL LG +T+ L+ L E +++A Sbjct: 4 LLLRFDAPLMSFGAPIVDQYGFIQPYPALSMMTGLLANALGYTHAETARLERLQERLRYA 63 Query: 65 VRCDELILDDRRVSVTGLRDYHTVLGAR---EDYRGLKSHETIQT-------------WR 108 VR +DRR LRD+ TV ++ D R + T++T R Sbjct: 64 VR------EDRRGQ--QLRDFQTVDLSQPFLHDERAWTTRGTLETRQGGTASLGIHIRLR 115 Query: 109 EYLCDASFTVALWLT-PHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDP 167 +Y DA +TVAL L P +++LE+A+ P ++GR+ C PLF+G +A+D Sbjct: 116 DYWADAVYTVALTLDPPDEPPTLADLEQALRFPARPLFIGRKPCLPAAPLFIGRVEAADL 175 Query: 168 QKAL 171 AL Sbjct: 176 LDAL 179 >UniRef50_Q0BSC7 Putative uncharacterized protein n=1 Tax=Granulibacter bethesdensis CGDNIH1 RepID=Q0BSC7_GRABC Length = 225 Score = 70.1 bits (170), Expect = 5e-11, Method: Compositional matrix adjust. Identities = 62/204 (30%), Positives = 94/204 (46%), Gaps = 37/204 (18%) Query: 4 YLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRD-DTSSLQALSESVQ 62 +L+ LA + + G+ R + +PTRS ++GL+GA LGI+RD D S+L LS Sbjct: 6 FLVFGLAASLGSMGELAGHERRGSLIWPTRSAIIGLMGAALGIERDGDFSALDVLS---- 61 Query: 63 FAVRCDELILDDRRVSVTGLRDYHTVLG--------------AREDYRGLKSHETIQTWR 108 D I D + LRDYHT+ A D RG + T T R Sbjct: 62 ----IDVAIFD----AGAPLRDYHTIETIPSAAAKNPNSRPEALRDARGRTN--TAITHR 111 Query: 109 EYLCDASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQ 168 +Y + +A+ + + A+L+P +T Y+GR+SCPL P +A + Sbjct: 112 DYRTSVFYGIAV-----RGAGLERIVAALLEPHFTLYVGRKSCPLAAPTGAKIVEAVSAE 166 Query: 169 KALLNYEPVGGDIYSEESVTGHHL 192 AL E + ++ +ESV H L Sbjct: 167 AAL---EHLKAPLWRKESVKAHLL 187 >UniRef50_B8GIV3 CRISPR-associated protein Cas5 family n=1 Tax=Methanosphaerula palustris E1-9c RepID=B8GIV3_METPE Length = 257 Score = 69.7 bits (169), Expect = 6e-11, Method: Compositional matrix adjust. Identities = 64/182 (35%), Positives = 86/182 (47%), Gaps = 22/182 (12%) Query: 4 YLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESVQF 63 YL L G M +WG RPT PTRS +LGLL A LGI+RD+ L AL+ + + Sbjct: 6 YLTFSLYGMMASWGDIAVGEYRPTADHPTRSAVLGLLAAALGIRRDEEERLAALTRAYKV 65 Query: 64 AVRCDELILDDRRVSVTGLRDYHTVL---GAREDYRGLKSHE----------TIQTWREY 110 A+R D LRDYHT A++ + L + TI + R+Y Sbjct: 66 AIRVD--------APGMLLRDYHTTQVPSAAKKGRQYLTRKDELAAPREVLNTILSTRDY 117 Query: 111 LCDASFTVALWLTPHA-TMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQK 169 CDA + V +W A + L + + P +T YLGR+SCPL P+ A D Sbjct: 118 RCDAVYRVYIWCRDTAPPYSLKTLAEHLQHPVFTLYLGRKSCPLALPVNPEVKTAPDLLT 177 Query: 170 AL 171 AL Sbjct: 178 AL 179 >UniRef50_A1ARH6 CRISPR-associated protein, Cas5e family n=2 Tax=Bacteria RepID=A1ARH6_PELPD Length = 232 Score = 68.9 bits (167), Expect = 1e-10, Method: Compositional matrix adjust. Identities = 50/146 (34%), Positives = 75/146 (51%), Gaps = 6/146 (4%) Query: 5 LILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESVQFA 64 L+LRL GPMQ+WG + R TG+ P++SG++GLL A LGI R++ L+ L+ + Sbjct: 4 LLLRLVGPMQSWGTTSRFDQRDTGKEPSKSGVVGLLAAALGIDRENWVDLEPLT-CLAMG 62 Query: 65 VRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVALWLTP 124 VR D + R G T++ A D K + R YL DA+F V L Sbjct: 63 VRHDRPGVPKRDYQTAGCASTDTIIKA--DGTQAKGGGVVSQ-RFYLADAAFLVGLECDD 119 Query: 125 HATMVISELEKAVLKPRYTPYLGRRS 150 + ++ + A+ P +T LGR+S Sbjct: 120 NC--LLERIHVALHNPFWTLALGRKS 143 >UniRef50_B5GY62 Crispr-associated protein (Fragment) n=1 Tax=Streptomyces clavuligerus ATCC 27064 RepID=B5GY62_STRCL Length = 260 Score = 68.9 bits (167), Expect = 1e-10, Method: Compositional matrix adjust. Identities = 67/175 (38%), Positives = 86/175 (49%), Gaps = 29/175 (16%) Query: 5 LILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDD-TSSLQALSESVQF 63 L+LRLAGP+Q+WG + +R TG PT+SG++GLL A G R L+AL + Sbjct: 86 LLLRLAGPLQSWGSASAFNSRQTGAEPTKSGVIGLLAAADGRARGACIEDLRAL----RL 141 Query: 64 AVRCDELILDDRRVSVTGLRDYHTV------------LGAREDYRGLKSHETIQ-TWREY 110 VR D S T LRDYHT +GA+ R + Q T R Y Sbjct: 142 GVRVDR--------SGTLLRDYHTASDHRGRPLAQAGVGAKGTQRPTSPAKYTQVTTRYY 193 Query: 111 LCDASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQAS 165 L DA F AL P A ++ L++AV P + LGRRSC + PL LG S Sbjct: 194 LQDAVFLAAL-AGPRA--LLDRLDRAVRAPAFPLALGRRSCVPSLPLALGVHPGS 245 >UniRef50_A9HLC6 CRISPR-associated protein Cas5 family n=11 Tax=Acetobacteraceae RepID=A9HLC6_GLUDA Length = 260 Score = 68.6 bits (166), Expect = 1e-10, Method: Compositional matrix adjust. Identities = 53/170 (31%), Positives = 74/170 (43%), Gaps = 21/170 (12%) Query: 1 MRSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSES 60 M +L + PM ++G R P RS +LGL+ ACLG+ RDD + AL+ Sbjct: 1 MGQFLTFAMVAPMASFGAIAVGERRDGWDRPARSAVLGLMAACLGLTRDDEDAQAALAAD 60 Query: 61 VQFAVRCDELILDDRRVSVTGLRDYHTVLG--AREDYRGLKSHE----------TIQTWR 108 A+ C L DYHT AR ++R E TI + R Sbjct: 61 YGLAILC--------HAPGKLLTDYHTAQAAPARRNWRPATRAEELAASPGDLATILSRR 112 Query: 109 EYLCDASFTVALWLTPH-ATMVISELEKAVLKPRYTPYLGRRSCPLTHPL 157 +Y A+W + A + L+ A+ +P +TP LGRRSCP PL Sbjct: 113 DYRMGTWHLGAVWTSGKTARWSLEALQAAMREPVFTPSLGRRSCPAGLPL 162 >UniRef50_B6B783 CRISPR-associated protein Cas5, Ecoli subtype n=1 Tax=Rhodobacterales bacterium Y4I RepID=B6B783_9RHOB Length = 232 Score = 68.6 bits (166), Expect = 2e-10, Method: Compositional matrix adjust. Identities = 59/185 (31%), Positives = 82/185 (44%), Gaps = 32/185 (17%) Query: 1 MRSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRD-DTSSLQALSE 59 M YLI +L + A G+ R + P RS ++G LGA +G++RD D S L AL Sbjct: 1 MPEYLIFQLVAAIGAMGEFGGHDRRGSLTLPGRSAVIGTLGAAMGLRRDADFSGLDALGV 60 Query: 60 SVQFAVRCDELILDDRRVSVTGLRDYHTVLGA------REDYR-------GLKSHETIQT 106 +V + RDYHTV R R G K + T+ T Sbjct: 61 AVASFGKTAP------------FRDYHTVQTVPSAAVKRPQSRPQALRDAGRKVNTTL-T 107 Query: 107 WREYLCDASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASD 166 R+Y D F VA+W ++EL A+ P + +LGR+SCPL+ P A+ Sbjct: 108 SRDYRADCVFGVAIW-----GEGLAELASALSAPVFQTFLGRKSCPLSAPFDPQIVAAAT 162 Query: 167 PQKAL 171 P AL Sbjct: 163 PSAAL 167 >UniRef50_Q0AA33 CRISPR-associated protein Cas5 family n=2 Tax=Gammaproteobacteria RepID=Q0AA33_ALHEH Length = 242 Score = 67.4 bits (163), Expect = 3e-10, Method: Compositional matrix adjust. Identities = 59/187 (31%), Positives = 81/187 (43%), Gaps = 25/187 (13%) Query: 5 LILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESVQFA 64 LILRL P+ ++G + PT RFP RS L G+L LG DT +L +L + +A Sbjct: 4 LILRLDAPLMSFGGVLVDQHNPTDRFPGRSMLTGMLANALGWHHQDTEALNSLQARISYA 63 Query: 65 VRCDELILDDRRVSVTGLRDYHTV--------------LGAREDYRGLKSHETI-QTWRE 109 R D V LRDY TV GA E G + I Q R Sbjct: 64 ARWD--------VPPEPLRDYQTVDLGQTHLANPGWTTRGAPEHREGGTAKRGIHQRDRH 115 Query: 110 YLCDASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQK 169 Y + TVA+ + P V + L A+ P ++GR++C P+ L + SD Sbjct: 116 YWANGVMTVAVTVPPGEPNVAT-LAAALRHPARPLFIGRKACLPAAPVLL-RVRESDDAY 173 Query: 170 ALLNYEP 176 +L EP Sbjct: 174 HVLASEP 180 >UniRef50_C2KP44 Putative uncharacterized protein n=1 Tax=Mobiluncus mulieris ATCC 35243 RepID=C2KP44_9ACTO Length = 245 Score = 67.0 bits (162), Expect = 5e-10, Method: Compositional matrix adjust. Identities = 56/176 (31%), Positives = 78/176 (44%), Gaps = 26/176 (14%) Query: 7 LRLAGPMQAWGQPTFEGT-RPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESVQFAV 65 +RLAGP+Q+W G T +PTR L GL+ ACLG R + +QFAV Sbjct: 6 IRLAGPLQSWAGAKVSGNISHTQDYPTRGSLEGLVAACLGCPR---GKYPLWFQDLQFAV 62 Query: 66 RCDE--LILDDRRVSVTGLRDYHT---------VLGAREDYRGL-----KSHETIQTWRE 109 R D I DD + G+RD + G R RGL +T R Sbjct: 63 RVDSPGRICDDYQ--TIGVRDEDMQVATRLLTLLTGKRATNRGLAFIPDAQGKTTIVRRT 120 Query: 110 YLCDASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQAS 165 L DA F V + H + +L++A+ P + YLGR++ P +LG + S Sbjct: 121 LLADAEFIVQIQCEGH----LEQLDQAISDPTFVSYLGRKAFAPGFPFYLGIGEDS 172 >UniRef50_Q1J367 CRISPR-associated protein, CT1976 n=1 Tax=Deinococcus geothermalis DSM 11300 RepID=Q1J367_DEIGD Length = 232 Score = 67.0 bits (162), Expect = 5e-10, Method: Compositional matrix adjust. Identities = 56/158 (35%), Positives = 79/158 (50%), Gaps = 18/158 (11%) Query: 3 SYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESVQ 62 + L+LRL PMQAWG + R T P++SG+LGL A LGI R D S++ L+ + Sbjct: 2 ATLLLRLVAPMQAWGTRSRFDDRDTEAEPSKSGVLGLCAAALGIDRAD--SVEHLAR-LA 58 Query: 63 FAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVALWL 122 F VR D R V G DYHT + G T T R YL DA+F L Sbjct: 59 FGVRVD-------REGVAGT-DYHTA----QLRPGNPRTRTDVTRRAYLADAAFWAGL-- 104 Query: 123 TPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLG 160 ++++L+ A+ P + LGR++ P + P+ G Sbjct: 105 -EGDAGLLTDLDAALHNPHWPLSLGRKAFPPSLPICAG 141 >UniRef50_Q2FNU0 CRISPR-associated protein, CT1976 n=1 Tax=Methanospirillum hungatei JF-1 RepID=Q2FNU0_METHJ Length = 225 Score = 66.6 bits (161), Expect = 6e-10, Method: Compositional matrix adjust. Identities = 44/148 (29%), Positives = 71/148 (47%), Gaps = 9/148 (6%) Query: 35 GLLGLLGACLGIQRDDTSSLQALSESVQFAVRCDE--LILDD----RRVSVTGLRDYHTV 88 +LG++ A LGI+RDD + L F+V + +++ D + V + L+ + V Sbjct: 15 AVLGMVAAALGIRRDDEEAQNRLQAGYGFSVMVLQPGIMIQDFHTIQSVHSSSLKKMNHV 74 Query: 89 LGAREDYRGLKSHETIQTWREYLCDASFTVALWL--TPHATMVISELEKAVLKPRYTPYL 146 + R D L ETI + REYLCD +W+ A + E+ + P + YL Sbjct: 75 M-TRRDEMNLGDSETILSRREYLCDHVSVACVWIRDAESAQFSLEEIAASFRNPVFCLYL 133 Query: 147 GRRSCPLTHPLFLGTCQASDPQKALLNY 174 GR+SCP P+ QA + AL+ + Sbjct: 134 GRKSCPPALPVHARVIQADSLKSALVQH 161 >UniRef50_C2GEY8 CRISPR-associated protein n=1 Tax=Corynebacterium glucuronolyticum ATCC 51866 RepID=C2GEY8_9CORY Length = 242 Score = 65.1 bits (157), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 55/184 (29%), Positives = 84/184 (45%), Gaps = 39/184 (21%) Query: 1 MRSYLILRLAGPMQAWGQPTFEGTR-PTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSE 59 M S +RL+GP+Q+W + G T PT +GL GLL LG +RD+ + Sbjct: 1 MPSSTFIRLSGPIQSWAGQSVSGNFIRTNPIPTLTGLRGLLAGALGARRDE---IPEWIS 57 Query: 60 SVQFAVRCDELILDDRRVSVTGLRDYHTVLGARE---DYR-------GLKSHETIQ---- 105 V+F+VR D+ + + + D+ T+ G+RE D+R G+K+ Q Sbjct: 58 KVRFSVREDQ--------TGSFVDDFQTI-GSREEEWDFRRRIAILQGMKARSIKQLSFK 108 Query: 106 --------TWREYLCDASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPL 157 R YL +A F V + H E++ A P + YLGR++ P P Sbjct: 109 PAVGANAVVRRTYLSEAEFIVRVTDERHT----EEIDHAFSSPVFATYLGRKAFPAAFPF 164 Query: 158 FLGT 161 +LGT Sbjct: 165 YLGT 168 >UniRef50_Q03C60 CRISPR-associated protein n=4 Tax=Lactobacillus RepID=Q03C60_LACC3 Length = 236 Score = 64.7 bits (156), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 50/161 (31%), Positives = 78/161 (48%), Gaps = 20/161 (12%) Query: 7 LRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESVQFAVR 66 +RL P+Q++G R TG +P++S ++G+L A LG QRDD ++ AL++ + FAVR Sbjct: 6 IRLTSPLQSYGNEAQFARRTTGDYPSKSAIIGMLAAALGYQRDD-PAINALNDLL-FAVR 63 Query: 67 CDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVALWLTPHA 126 D+ + ++ T K T+R+ L DA F VA+ A Sbjct: 64 VDQ--------PGQVMTEFQTA--------EWKPGTRKLTYRDLLQDAVFVVAIGSEDEA 107 Query: 127 TMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDP 167 + L +A+ PR+ YLGRR+ L + T DP Sbjct: 108 WL--DRLAEALRHPRFQLYLGRRANVPAGVLKIQTFAGQDP 146 >UniRef50_UPI0001B51C2B CRISPR-associated Cas5 family protein n=1 Tax=Streptomyces viridochromogenes DSM 40736 RepID=UPI0001B51C2B Length = 278 Score = 63.5 bits (153), Expect = 6e-09, Method: Compositional matrix adjust. Identities = 62/199 (31%), Positives = 88/199 (44%), Gaps = 52/199 (26%) Query: 1 MRSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSES 60 M S L+LRLAGP+Q+WG R T PT+SG+ GL+ A LG+ R D L AL++ Sbjct: 1 MTSVLLLRLAGPLQSWGALARFDRRDTLNRPTKSGVTGLVAAALGLDRAD--DLGALTD- 57 Query: 61 VQFAVRCD----------------------ELILDDRRV--------SVTG-------LR 83 ++FAVR D +LI D RR + TG R Sbjct: 58 LRFAVRADRPGTAVRDFHIVGSGTYPLRPRDLITDHRRAEKAAAALETSTGPVFGHLAAR 117 Query: 84 DYHTVLGAREDY----------RGLKSHETIQTWREYLCDASFTVALWLTPHATMVISEL 133 GA ++ G + + + T R YL DA+F A+ P + + + Sbjct: 118 SVTKWYGAPKEIAPDPKTGVLLAGNTTRDAMMTTRWYLADAAFVAAV-EHPDQNL-LHRI 175 Query: 134 EKAVLKPRYTPYLGRRSCP 152 AV P+ +LGR+SCP Sbjct: 176 SHAVEHPKRLLWLGRKSCP 194 >UniRef50_B5F422 CRISPR-associated protein Cas5 n=59 Tax=Enterobacteriaceae RepID=B5F422_SALA4 Length = 248 Score = 61.6 bits (148), Expect = 2e-08, Method: Compositional matrix adjust. Identities = 55/161 (34%), Positives = 79/161 (49%), Gaps = 19/161 (11%) Query: 1 MRSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSES 60 M YL+ +L GPM +WG R + P+RS LLGLL A LGI+RD+ L A + Sbjct: 1 MSQYLVFQLHGPMASWGVDAPGEVRHSHELPSRSALLGLLAAALGIRRDEEERLNAFNRH 60 Query: 61 VQFAVRCDELILDDRRVSVTGLRDYHTVLGARE--DYRGLKSHETIQ---------TWRE 109 QF + C + RDYHTV +E R E +Q + R+ Sbjct: 61 YQFLL-CAS-------GNPRWARDYHTVQMPKEVRKARYFSRREELQDPELLSALISRRD 112 Query: 110 YLCDASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRS 150 Y DA + +A+ TP A +++L+ A+ P + YLGR+S Sbjct: 113 YYTDAWWMIAVSATPDAPYTLAQLQAALQHPVFPLYLGRKS 153 >UniRef50_B4S8P8 CRISPR-associated protein Cas5 family n=8 Tax=Bacteria RepID=B4S8P8_PROA2 Length = 243 Score = 60.5 bits (145), Expect = 4e-08, Method: Compositional matrix adjust. Identities = 53/179 (29%), Positives = 80/179 (44%), Gaps = 28/179 (15%) Query: 4 YLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESVQF 63 YL+L L P+Q+WG + G R T FPT+SG+LG+L LG + L+ ++ Q Sbjct: 5 YLLLWLEAPLQSWGADSRFGRRGTLEFPTKSGVLGMLCCSLGAGGEQKELLEKMAPLKQS 64 Query: 64 AV--------RCDELILDDRRVSVTGLRDYHTVLGAREDYRGL-------KSHETIQ--- 105 A+ R +E+ DR LRD+H V +D KS T Sbjct: 65 AISFCRTSKFRQEEIKKLDRE---PLLRDFHMVGSGYDDKNPWETLLIPKKSDGTTAVNG 121 Query: 106 ----TWREYLCDASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLG 160 T+R YL DA F V + + + ++ A+ P + Y GR+ C T ++ G Sbjct: 122 GSKITYRYYLQDAVFAVIMEVPSEKLTLFAD---ALENPCWDIYFGRKCCAPTDFIYRG 177 >UniRef50_C5V9N1 CRISPR-associated protein Cas5 n=1 Tax=Corynebacterium matruchotii ATCC 14266 RepID=C5V9N1_9CORY Length = 223 Score = 60.5 bits (145), Expect = 4e-08, Method: Compositional matrix adjust. Identities = 58/201 (28%), Positives = 85/201 (42%), Gaps = 37/201 (18%) Query: 1 MRSYLILRLAGPMQAWGQPTFEGT--RPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALS 58 M L +RLAGP+Q+W P G R R PTRSGL+GLL G R + Sbjct: 1 MTEALYIRLAGPLQSWAGPAITGNFVRTEPR-PTRSGLVGLLAGACGYGRGEYPEWLT-- 57 Query: 59 ESVQFAVRCDE--LILDD---------------RRVSVTGLRDYHTVLGAREDYRGLKSH 101 + F +R D ++DD R + G R +L + D +GL Sbjct: 58 -QLHFQIREDNRGTLVDDFHTINPRDTEEEFRSRLLLAMGQRPTKKLLNSTPDGQGL--- 113 Query: 102 ETIQTWREYLCDASFTVALWLTP--HATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFL 159 T T R Y+ D F V + H ++ +L+ +P + YLGR++ + P +L Sbjct: 114 -TAITERTYIADGEFIVQIKAGSREHQELLAEKLQ----QPHFVTYLGRKAFAPSFPFYL 168 Query: 160 GTCQASDPQKALLNYEPVGGD 180 G + P L VGG+ Sbjct: 169 G----AGPDDTLARIPTVGGE 185 >UniRef50_B8IZA7 CRISPR-associated protein Cas5 family n=1 Tax=Desulfovibrio desulfuricans subsp. desulfuricans str. ATCC 27774 RepID=B8IZA7_DESDA Length = 249 Score = 60.5 bits (145), Expect = 4e-08, Method: Compositional matrix adjust. Identities = 60/225 (26%), Positives = 100/225 (44%), Gaps = 42/225 (18%) Query: 5 LILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQR---DDTSSLQALSESV 61 L LRL PM ++G + R T P++S + G+L A G+ R ++ + LQ ++ Sbjct: 8 LALRLQAPMLSFGNESRFNRRCTASLPSKSVVAGMLCAAKGLHRGSVEEQAFLQQVAAIP 67 Query: 62 QFAV---RCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTV 118 +V RC D ++ D+HTV G R+ G+K T R YL D+SF V Sbjct: 68 MLSVAIPRCLSANGKDWLLAAGRTVDFHTVQGTRKAAGGIKDCHI--TTRHYLHDSSFAV 125 Query: 119 ALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKALLNYEPVG 178 L P+ V+ + +A+ P + ++GR+ C + P+F Sbjct: 126 FLN-GPY--RVLEDAARALQNPVWGLWIGRKCCIPSAPVF-------------------- 162 Query: 179 GDIYSEESVTGHHLKFTARDEPMITLPRQFAS--REWYVIKGGMD 221 G ++S E+V +H M+ P +F + RE + + G D Sbjct: 163 GGLFSSEAVALNH---------MLDAPLEFFTHEREVHSFEDGND 198 >UniRef50_B6IWM3 CRISPR-associated protein, CT1976 family n=1 Tax=Rhodospirillum centenum SW RepID=B6IWM3_RHOCS Length = 280 Score = 58.9 bits (141), Expect = 1e-07, Method: Compositional matrix adjust. Identities = 57/191 (29%), Positives = 82/191 (42%), Gaps = 28/191 (14%) Query: 3 SYLILRLAGPMQAWGQPTFEGT----RPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALS 58 ++L LA P AWG + + T P+RS L GLLGA LG++R + L LS Sbjct: 6 AHLCFTLAAPYGAWGAASQSSATTAWKATELDPSRSALTGLLGAALGLER---AHLGRLS 62 Query: 59 ESVQFAVRCDELILDDRRVSVTGLRDYHTVLGA---------------REDYRGLKSHET 103 E+++FAVR D + DYHT+ A R G K Sbjct: 63 EALRFAVRTGIRPTRDPQP------DYHTISRAHRPEGREHWSRFEELRPALAGGKQEGA 116 Query: 104 IQTWREYLCDASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQ 163 + + REY +TVA+ A + + L +A+ P + Y GR++C L P Sbjct: 117 LLSRREYWSLGLWTVAVATLNPAGVPLDRLAQALRTPHWPLYAGRKACTLGLPPDPEVRT 176 Query: 164 ASDPQKALLNY 174 P LL+Y Sbjct: 177 GPGPLSVLLDY 187 >UniRef50_B8IMR2 CRISPR-associated protein Cas5 family n=1 Tax=Methylobacterium nodulans ORS 2060 RepID=B8IMR2_METNO Length = 273 Score = 58.5 bits (140), Expect = 2e-07, Method: Compositional matrix adjust. Identities = 51/172 (29%), Positives = 68/172 (39%), Gaps = 29/172 (16%) Query: 1 MRSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSES 60 M + L+ L P G R + P RS +LGL+ LGI R D + AL Sbjct: 1 MPAGLVFTLYAPFAGMGDVAVGEERGSFDRPARSAVLGLVAGALGIDRADEAGHAALDRG 60 Query: 61 VQFAVRCDELILDDRRVSVTGLRDYHTV----------LGAREDYRGLKSHETIQTWREY 110 + A+R R + DYHTV R + + T+ + R Y Sbjct: 61 YRLALRL--------RTPGCLVEDYHTVQAPPVDRKARWATRREALAVAGLNTLVSRRAY 112 Query: 111 LCDASFTVALW-----LTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPL 157 D V L TP A L A+ +P + PYLGR+SCPL PL Sbjct: 113 RADPIVDVVLIHVDEGPTPEA------LATALRRPTFAPYLGRKSCPLGLPL 158 >UniRef50_Q2JWC5 CRISPR-associated protein Cas5, Ecoli subtype n=1 Tax=Synechococcus sp. JA-3-3Ab RepID=Q2JWC5_SYNJA Length = 207 Score = 57.8 bits (138), Expect = 3e-07, Method: Compositional matrix adjust. Identities = 49/162 (30%), Positives = 78/162 (48%), Gaps = 31/162 (19%) Query: 5 LILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQR-DDTSSLQALSESVQF 63 L++RL PM +WG + R + R PT+S ++G+L A LG R + L AL V Sbjct: 4 LLMRLRAPMMSWGDHSQFDYRDSRREPTKSAVIGILCAALGRPRWEPVDDLAALKMGV-- 61 Query: 64 AVRCDELILDDRRVSVTGL--RDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVALW 121 RV+ G+ +D+HTV + ETI R Y+ D + V L Sbjct: 62 ------------RVNKEGILCKDFHTV----------QIKETISN-RYYVADGDYLVGLE 98 Query: 122 LTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQ 163 P+ ++ L++A+ KP + +LGR+S + PL +G + Sbjct: 99 GDPN---LLRTLDQALQKPYWQVFLGRKSFIPSRPLRVGLVE 137 >UniRef50_D1CAJ0 CRISPR-associated protein Cas5 family n=1 Tax=Sphaerobacter thermophilus DSM 20745 RepID=D1CAJ0_SPHTD Length = 245 Score = 57.4 bits (137), Expect = 4e-07, Method: Compositional matrix adjust. Identities = 55/166 (33%), Positives = 77/166 (46%), Gaps = 24/166 (14%) Query: 3 SYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESVQ 62 S L+LRL GPMQAWG + R TG P++SG++GLL A LG R ++ + L+ ++ Sbjct: 2 STLLLRLTGPMQAWGTQSRFSWRDTGLEPSKSGVIGLLCAALGRPR--SAPVDDLAR-LR 58 Query: 63 FAVRCDELILDDRRVSVTGLRDYHTVLG---------AREDYRGLKSHETIQTWREYLCD 113 VR D T D+HT G D G I R YL D Sbjct: 59 MGVRVDR--------EGTMHVDFHTAGGWHRRAEAGYGVPDPSGTARRPQISR-RFYLAD 109 Query: 114 ASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFL 159 A F V L ++ L++A+ PR+ +LGR+S P+ L Sbjct: 110 ADFLVGL---EGDEELLVLLDRALAAPRWQLFLGRKSFVPAAPVRL 152 >UniRef50_B8FDI0 CRISPR-associated protein Cas5 family n=3 Tax=Bacteria RepID=B8FDI0_DESAA Length = 240 Score = 55.1 bits (131), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 71/242 (29%), Positives = 102/242 (42%), Gaps = 33/242 (13%) Query: 4 YLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESVQF 63 YL++ L P+Q+WG + G R T FPTRSG+LGLL LG + L L+ Q Sbjct: 5 YLLMWLEAPLQSWGADSKFGRRDTLPFPTRSGVLGLLLCALGASGEQKELLARLAPYGQT 64 Query: 64 AVRCDELILDDRRVSVTG------LRDYHTVLGARED---YRGLKSHETIQ--------- 105 + C S LRD+H V A D + L +T + Sbjct: 65 VISCAGGRPGRSGGSPEKIPRQPLLRDFHMVGSAYNDKDPWERLHIPKTNEGKPAVGGGA 124 Query: 106 --TWREYLCDASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTC- 162 T+R YL DA F V L L P + + +A+ P + YLGR++C T ++ G Sbjct: 125 KLTYRYYLQDARFAVILELPPD---LAEDFAQALQNPVWDIYLGRKNCAPTEFVYQGVFD 181 Query: 163 ---QASDPQKALLNYEPVGGDIYSEESVTGHH--LKFTARDEPMITLP-RQFASREWYVI 216 A D AL+ + + D V G H T D P+ P +++ R VI Sbjct: 182 SQKDAMDRAAALMEEKELMEDF---RVVDGEHPGEPITLNDVPLQFGPMKKYRDRRVTVI 238 Query: 217 KG 218 + Sbjct: 239 RN 240 >UniRef50_B8HWH8 CRISPR-associated protein Cas5 family n=1 Tax=Cyanothece sp. PCC 7425 RepID=B8HWH8_CYAP4 Length = 216 Score = 54.3 bits (129), Expect = 3e-06, Method: Compositional matrix adjust. Identities = 49/157 (31%), Positives = 74/157 (47%), Gaps = 25/157 (15%) Query: 5 LILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQR-DDTSSLQALSESVQF 63 L+LR+ PM +WG + R + R PT+S ++GLL A LG R + + L AL V Sbjct: 4 LLLRMRAPMMSWGDHSRFTIRDSRREPTKSAVIGLLCAALGRPRWEAVADLTALKMGV-- 61 Query: 64 AVRCDELILDDRRVSVTGLR--DYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVALW 121 R++ GL DYHTV + + G K + T+ + R Y+ DA + V Sbjct: 62 ------------RINQEGLVQCDYHTVQDSIKS-SGSKGN-TVISHRYYIADADYLVG-- 105 Query: 122 LTPHATMVISELEKAVLKPRYTPYLGRR----SCPLT 154 L + L+ A+ P + Y GR+ SCP+ Sbjct: 106 LEGSDRHFLESLDSALQSPIWQVYFGRKSFVPSCPVA 142 >UniRef50_B6XT64 Putative uncharacterized protein n=2 Tax=Bifidobacterium RepID=B6XT64_9BIFI Length = 212 Score = 50.1 bits (118), Expect = 6e-05, Method: Compositional matrix adjust. Identities = 29/70 (41%), Positives = 39/70 (55%), Gaps = 3/70 (4%) Query: 106 TWREYLCDASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQAS 165 T+R YL DA F VAL V+ L++A+ P++ YLGRRSCP +PL LG Sbjct: 58 TYRYYLADACFLVALGAD---RSVLEMLDEAIHSPKWPLYLGRRSCPPNYPLSLGIHDEY 114 Query: 166 DPQKALLNYE 175 + + LN E Sbjct: 115 EDIRQALNSE 124 >UniRef50_Q2RXJ5 CRISPR-associated protein, Cas5e family n=5 Tax=Proteobacteria RepID=Q2RXJ5_RHORT Length = 249 Score = 49.7 bits (117), Expect = 7e-05, Method: Compositional matrix adjust. Identities = 44/159 (27%), Positives = 69/159 (43%), Gaps = 5/159 (3%) Query: 4 YLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESVQF 63 +LI+ L P+ A+G + T FP S L GL LG +R + Q L + + F Sbjct: 6 WLIVHLEAPLLAFGGVAIDNVGVTRDFPAASMLTGLFANALGWRRTEWERHQRLQDRLIF 65 Query: 64 AVRCDE-----LILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTV 118 A R + ++ D + ++ T G E G + R+Y DAS V Sbjct: 66 AARRERENPTGVLTDTQNAKLSKTERGWTTWGEPEGRDGASYGAPHRRRRDYHGDASVVV 125 Query: 119 ALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPL 157 AL L + +L A+ +P ++GR+SC + PL Sbjct: 126 ALRLDAAEEPALDDLAAALDRPARPLFIGRKSCVPSRPL 164 >UniRef50_Q6NEQ9 Putative uncharacterized protein n=1 Tax=Corynebacterium diphtheriae RepID=Q6NEQ9_CORDI Length = 242 Score = 48.9 bits (115), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 49/179 (27%), Positives = 77/179 (43%), Gaps = 24/179 (13%) Query: 7 LRLAGPMQAWGQPTFEGT--RPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESVQFA 64 +RL+GP+Q+W G R R PT S L GLL LG +R + + + V+F Sbjct: 7 IRLSGPLQSWAGSVVTGNIVRTEPR-PTFSSLRGLLAGALGARRGEWPNWL---DDVEFW 62 Query: 65 VRCD-ELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHE-------------TIQTWREY 110 VR D + I+ + ++ L + T +G K++ T R Y Sbjct: 63 VREDRKPIVVNEFQTINPLPEVETFRKRLLIAQGRKANSAKALTFTPDAQGGTSIVNRTY 122 Query: 111 LCDASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQK 169 L D + V + + H + E+E A P + YLGR++ P +LG A +K Sbjct: 123 LADGEYLVRVTSSTH----MDEIENAFSSPAFVTYLGRKAFYAEFPFYLGRGSADAFEK 177 >UniRef50_C0W6U0 CRISPR-associated Cas5 family protein n=1 Tax=Actinomyces urogenitalis DSM 15434 RepID=C0W6U0_9ACTO Length = 201 Score = 48.5 bits (114), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 43/128 (33%), Positives = 63/128 (49%), Gaps = 22/128 (17%) Query: 39 LLGACLGIQRDDTSSLQALSESVQFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGL 98 +L A +G +R T ++ L S++F VR D+ T LRD+HT R L Sbjct: 1 MLAAAVGRRR--TDPIEDLL-SLRFGVRKDQ--------PGTVLRDFHTA-------RTL 42 Query: 99 KSHETIQ-TWREYLCDASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPL 157 +++ + R YL DA + A+ ++ L+ AV P + YLGRRSCP + PL Sbjct: 43 DGKQSMPLSERYYLADAVYLAAIE---GEKTLLEGLDVAVRHPVFPLYLGRRSCPPSQPL 99 Query: 158 FLGTCQAS 165 LG AS Sbjct: 100 SLGIRHAS 107 >UniRef50_B8IJS9 CRISPR-associated protein Cas5 family n=1 Tax=Methylobacterium nodulans ORS 2060 RepID=B8IJS9_METNO Length = 253 Score = 40.4 bits (93), Expect = 0.046, Method: Compositional matrix adjust. Identities = 42/157 (26%), Positives = 69/157 (43%), Gaps = 6/157 (3%) Query: 1 MRSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSES 60 MR +L+L L P+QAWG + P FP + + GL+ LG R D L+AL E Sbjct: 1 MREHLLLLLEAPLQAWGGVLVDAYGPVDEFPAATLVGGLVANALGYDRADWQRLEALQER 60 Query: 61 VQFA---VRCDELILDDRRVSV-TGLRDYHTVLGAREDYRGLKSHETI-QTWREYLCDAS 115 + +R I D++ + G + T G +++++ + +R+Y D Sbjct: 61 LVVGAAVLRRGSTITDNQNAKLEKGDVGWTTRGRPEGRGGGAEAYKSPHRRFRDYHADTL 120 Query: 116 FTVALWLTPHATMV-ISELEKAVLKPRYTPYLGRRSC 151 VAL L P + + + P +LGR+ C Sbjct: 121 ALVALRLDPEDEKPDLDAIAHTLEWPERPLFLGRKPC 157 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_Q46898 Uncharacterized protein ygcI n=13 Tax=Proteobact... 320 3e-86 UniRef50_Q314I4 CRISPR-associated protein, CT1976 n=1 Tax=Desulf... 213 4e-54 UniRef50_Q12YA8 CRISPR-associated protein, CT1976-like n=1 Tax=M... 205 7e-52 UniRef50_D1NTI1 CRISPR-associated protein Cas5 n=1 Tax=Bifidobac... 202 6e-51 UniRef50_D1CGD4 CRISPR-associated protein Cas5 family n=6 Tax=Ba... 197 2e-49 UniRef50_C2BET8 CRISPR-associated protein n=2 Tax=Firmicutes Rep... 197 2e-49 UniRef50_B7KJ26 CRISPR-associated protein Cas5 family n=1 Tax=Cy... 196 3e-49 UniRef50_C7MQD6 CRISPR-associated protein Cas5 n=1 Tax=Saccharom... 192 6e-48 UniRef50_Q04QB7 Putative uncharacterized protein n=2 Tax=Leptosp... 191 2e-47 UniRef50_A3EQA4 CRISPR-ssociated protein, Cas5 n=3 Tax=Bacteria ... 190 2e-47 UniRef50_Q1R114 CRISPR-associated protein, CT1976 n=1 Tax=Chromo... 189 7e-47 UniRef50_Q0W584 Putative uncharacterized protein n=1 Tax=uncultu... 187 3e-46 UniRef50_C5SD48 CRISPR-associated protein Cas5 family n=1 Tax=Al... 187 3e-46 UniRef50_Q2RY19 CRISPR-associated protein, Cas5e family n=1 Tax=... 186 4e-46 UniRef50_D2RB02 CRISPR system CASCADE complex protein CasD n=3 T... 186 4e-46 UniRef50_B0S4B6 Putative uncharacterized protein n=1 Tax=Finegol... 185 1e-45 UniRef50_B0TDU1 Crispr-associated protein cas5 n=1 Tax=Heliobact... 183 4e-45 UniRef50_C8XAY4 CRISPR-associated protein Cas5 family n=2 Tax=Ac... 182 6e-45 UniRef50_A1SV73 CRISPR-associated protein, Cas5e family n=2 Tax=... 182 1e-44 UniRef50_A8LYZ7 CRISPR-associated protein Cas5 family n=2 Tax=Ac... 181 2e-44 UniRef50_D1Y486 Crispr-associated protein Cas5 n=1 Tax=Pyramidob... 181 2e-44 UniRef50_D0WFC8 CRISPR-associated protein Cas5 n=1 Tax=Slackia e... 180 2e-44 UniRef50_B4S8P8 CRISPR-associated protein Cas5 family n=8 Tax=Ba... 180 3e-44 UniRef50_Q1EQS9 CRISPR-associated protein n=3 Tax=Streptomyces R... 180 4e-44 UniRef50_A5UR14 CRISPR-associated protein, Cas5e family n=1 Tax=... 179 6e-44 UniRef50_C9M2Y8 CRISPR-associated protein n=1 Tax=Lactobacillus ... 179 7e-44 UniRef50_D0Y918 CRISPR-associated protein Cas5 family n=2 Tax=De... 179 7e-44 UniRef50_A5GBK2 CRISPR-associated protein Cas5 family n=2 Tax=De... 178 1e-43 UniRef50_C7MTB0 CRISPR-associated protein Cas5 n=1 Tax=Saccharom... 178 1e-43 UniRef50_D1A6Q5 CRISPR-associated protein Cas5 family n=2 Tax=Ac... 178 2e-43 UniRef50_A8ZZ17 CRISPR-associated protein Cas5 family n=1 Tax=De... 177 2e-43 UniRef50_A8SDR7 Putative uncharacterized protein n=1 Tax=Faecali... 177 2e-43 UniRef50_A7BA63 Putative uncharacterized protein n=1 Tax=Actinom... 177 3e-43 UniRef50_A6W168 CRISPR-associated protein Cas5 family n=6 Tax=Ga... 176 5e-43 UniRef50_B8HWH8 CRISPR-associated protein Cas5 family n=1 Tax=Cy... 176 6e-43 UniRef50_B1VIY0 CRISPR-associated protein n=9 Tax=Actinomycetale... 175 7e-43 UniRef50_B3E5U9 CRISPR-associated protein Cas5 family n=2 Tax=De... 175 1e-42 UniRef50_A1ARH6 CRISPR-associated protein, Cas5e family n=2 Tax=... 175 1e-42 UniRef50_UPI0001AF1D4C CRISPR-associated protein, CT1976 n=1 Tax... 174 2e-42 UniRef50_D2L2X8 CRISPR-associated protein Cas5 family n=1 Tax=De... 174 2e-42 UniRef50_C9M9R7 CRISPR-associated protein Cas5, Ecoli subtype n=... 174 2e-42 UniRef50_Q5YRB6 Putative uncharacterized protein n=1 Tax=Nocardi... 172 9e-42 UniRef50_B2GBJ9 Putative uncharacterized protein n=1 Tax=Lactoba... 172 1e-41 UniRef50_Q2JWC5 CRISPR-associated protein Cas5, Ecoli subtype n=... 170 4e-41 UniRef50_Q47PI7 CRISPR-associated protein, Cas5e family n=12 Tax... 169 6e-41 UniRef50_Q2JH27 CRISPR-associated protein, CT1976 n=6 Tax=Actino... 167 2e-40 UniRef50_B4UE71 CRISPR-associated protein Cas5 family n=2 Tax=An... 167 2e-40 UniRef50_B8IZA7 CRISPR-associated protein Cas5 family n=1 Tax=De... 166 4e-40 UniRef50_C7LYW6 CRISPR-associated protein Cas5 family n=1 Tax=Ac... 165 1e-39 UniRef50_Q03C60 CRISPR-associated protein n=4 Tax=Lactobacillus ... 163 4e-39 UniRef50_B8FDI0 CRISPR-associated protein Cas5 family n=3 Tax=Ba... 163 4e-39 UniRef50_A8M404 CRISPR-associated protein Cas5 family n=1 Tax=Sa... 163 4e-39 UniRef50_D0MET6 CRISPR-associated protein Cas5 family n=1 Tax=Rh... 162 8e-39 UniRef50_B8GIV3 CRISPR-associated protein Cas5 family n=1 Tax=Me... 162 9e-39 UniRef50_D1CAJ0 CRISPR-associated protein Cas5 family n=1 Tax=Sp... 160 4e-38 UniRef50_Q1J367 CRISPR-associated protein, CT1976 n=1 Tax=Deinoc... 159 8e-38 UniRef50_Q0AA33 CRISPR-associated protein Cas5 family n=2 Tax=Ga... 159 9e-38 UniRef50_Q2FNU0 CRISPR-associated protein, CT1976 n=1 Tax=Methan... 153 5e-36 UniRef50_B4TTX2 CRISPR-associated protein Cas5 n=15 Tax=Enteroba... 152 8e-36 UniRef50_A9HLC6 CRISPR-associated protein Cas5 family n=11 Tax=A... 151 2e-35 UniRef50_Q0BSC7 Putative uncharacterized protein n=1 Tax=Granuli... 148 1e-34 UniRef50_UPI0001B51C2B CRISPR-associated Cas5 family protein n=1... 148 2e-34 UniRef50_B8IMR2 CRISPR-associated protein Cas5 family n=1 Tax=Me... 143 4e-33 UniRef50_C5V9N1 CRISPR-associated protein Cas5 n=1 Tax=Corynebac... 141 2e-32 UniRef50_B6B783 CRISPR-associated protein Cas5, Ecoli subtype n=... 141 2e-32 UniRef50_C2KP44 Putative uncharacterized protein n=1 Tax=Mobilun... 137 2e-31 UniRef50_B6IWM3 CRISPR-associated protein, CT1976 family n=1 Tax... 137 3e-31 UniRef50_B5GY62 Crispr-associated protein (Fragment) n=1 Tax=Str... 135 9e-31 UniRef50_C2GEY8 CRISPR-associated protein n=1 Tax=Corynebacteriu... 132 1e-29 UniRef50_Q2RXJ5 CRISPR-associated protein, Cas5e family n=5 Tax=... 132 1e-29 UniRef50_B5F422 CRISPR-associated protein Cas5 n=59 Tax=Enteroba... 130 3e-29 UniRef50_Q6NEQ9 Putative uncharacterized protein n=1 Tax=Coryneb... 129 8e-29 UniRef50_C0W6U0 CRISPR-associated Cas5 family protein n=1 Tax=Ac... 122 1e-26 UniRef50_B6XT64 Putative uncharacterized protein n=2 Tax=Bifidob... 119 8e-26 Sequences not found previously or not previously below threshold: UniRef50_B8IJS9 CRISPR-associated protein Cas5 family n=1 Tax=Me... 107 3e-22 UniRef50_A0LM54 Putative uncharacterized protein n=1 Tax=Syntrop... 84 4e-15 UniRef50_UPI0000F51765 hypothetical protein Faci_00030 n=1 Tax=F... 55 2e-06 UniRef50_B1I5P1 CRISPR-associated protein Cas5 n=2 Tax=Clostridi... 50 6e-05 UniRef50_Q3AA65 CRISPR-associated protein Cas5, Hmari subtype n=... 49 1e-04 UniRef50_B5IGN1 CRISPR-associated protein Cas5 n=1 Tax=Acidulipr... 49 1e-04 UniRef50_B0K553 CRISPR-associated protein Cas5, Hmari subtype n=... 47 3e-04 UniRef50_Q1AZD3 CRISPR-associated protein, Cas5h family n=1 Tax=... 46 9e-04 UniRef50_D1N0K0 Putative uncharacterized protein n=1 Tax=Victiva... 46 0.001 UniRef50_A3DHS3 CRISPR-associated protein Cas5 n=3 Tax=Clostridi... 45 0.003 UniRef50_D0MJ68 CRISPR-associated protein Cas5 n=1 Tax=Rhodother... 44 0.004 UniRef50_C9RCY2 CRISPR-associated protein Cas5, Hmari subtype n=... 42 0.015 UniRef50_A5D0Y3 Putative uncharacterized protein n=1 Tax=Pelotom... 42 0.021 UniRef50_D1A6P6 Metal dependent phosphohydrolase n=1 Tax=Thermom... 41 0.038 >UniRef50_Q46898 Uncharacterized protein ygcI n=13 Tax=Proteobacteria RepID=YGCI_ECOLI Length = 224 Score = 320 bits (819), Expect = 3e-86, Method: Composition-based stats. Identities = 224/224 (100%), Positives = 224/224 (100%) Query: 1 MRSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSES 60 MRSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSES Sbjct: 1 MRSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSES 60 Query: 61 VQFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVAL 120 VQFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVAL Sbjct: 61 VQFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVAL 120 Query: 121 WLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKALLNYEPVGGD 180 WLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKALLNYEPVGGD Sbjct: 121 WLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKALLNYEPVGGD 180 Query: 181 IYSEESVTGHHLKFTARDEPMITLPRQFASREWYVIKGGMDVSQ 224 IYSEESVTGHHLKFTARDEPMITLPRQFASREWYVIKGGMDVSQ Sbjct: 181 IYSEESVTGHHLKFTARDEPMITLPRQFASREWYVIKGGMDVSQ 224 >UniRef50_Q314I4 CRISPR-associated protein, CT1976 n=1 Tax=Desulfovibrio desulfuricans subsp. desulfuricans str. G20 RepID=Q314I4_DESDG Length = 245 Score = 213 bits (542), Expect = 4e-54, Method: Composition-based stats. Identities = 83/246 (33%), Positives = 111/246 (45%), Gaps = 33/246 (13%) Query: 1 MRSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSES 60 M YL ++ GP+QA+G RPT PTRS +LG+L A +GI+RD+ + L L + Sbjct: 1 MAQYLTFQIYGPLQAYGTVAVGEIRPTSTMPTRSAVLGILAAAIGIRRDEETRLAELRDG 60 Query: 61 VQFAVRCDELILDDRRVSVTGLRDYHTVLGA----------REDYRGLKSHETIQTWREY 110 + AVR D + DYHT+ R D L TI + REY Sbjct: 61 YRVAVRED--------APGKVMLDYHTIQTPGARGKRQLHCRRDELLLTEPNTILSRREY 112 Query: 111 LCDASFTVALWLTPHA-TMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQK 169 L DA FTV LW H + E+ +A+ PR+T LGR+SCP P + PQ+ Sbjct: 113 LMDALFTVCLWQANHTVPYSLQEIARALRSPRWTIGLGRKSCPPALPFAPKITDHTTPQE 172 Query: 170 ALLNYEP---VGGDIYSEE------SVTGHH--LKFTARDEPMITLPRQFAS---REWYV 215 A+ Y V + S + G H + T RD P+ RQFA RE V Sbjct: 173 AVAAYPADKLVSAGLRSPQVMRMLLDTEGPHTDTETTVRDVPLHHGRRQFAERKVRELLV 232 Query: 216 IKGGMD 221 K D Sbjct: 233 RKAATD 238 >UniRef50_Q12YA8 CRISPR-associated protein, CT1976-like n=1 Tax=Methanococcoides burtonii DSM 6242 RepID=Q12YA8_METBU Length = 244 Score = 205 bits (522), Expect = 7e-52, Method: Composition-based stats. Identities = 65/244 (26%), Positives = 104/244 (42%), Gaps = 38/244 (15%) Query: 1 MRSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSES 60 M+ YLI RL GPM +WG RPT P++S + GL+ A LGI+RD+ LS + Sbjct: 1 MKEYLIFRLYGPMASWGDIAVGQHRPTYDHPSKSAIFGLIAAALGIRRDEEERHLELSNA 60 Query: 61 VQFAVRCDELILDDRRVSVTGLRDYHTVL-------------GAREDYRGLKSHE--TIQ 105 + + LRDYHT R+D + E T+ Sbjct: 61 YSYGTLI--------NSAGKLLRDYHTSQVPSAGTGRNRKTFATRKDELAVPKEELNTVL 112 Query: 106 TWREYLCDASFTVALWLTPH-ATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQA 164 + R+Y CD +TV L + L ++ +P + YLGR+SCPL P+ A Sbjct: 113 STRDYYCDGVYTVILSCKTDTPPYSLELLGNSLKEPSFCLYLGRKSCPLALPINPKIVSA 172 Query: 165 SDPQKALLNYEPVGGDIYSEESVTGHH-------------LKFTARDEPMITLPR-QFAS 210 S+ ++AL + +P + + + + +R + +++ R QF+ Sbjct: 173 SNIKEALQSVDPGEEGLVKKIEMKSPYRLYWDDPKESMTCEHTISRYDKLLSRKRWQFSK 232 Query: 211 REWY 214 R Y Sbjct: 233 RNEY 236 >UniRef50_D1NTI1 CRISPR-associated protein Cas5 n=1 Tax=Bifidobacterium gallicum DSM 20093 RepID=D1NTI1_9BIFI Length = 250 Score = 202 bits (515), Expect = 6e-51, Method: Composition-based stats. Identities = 66/226 (29%), Positives = 101/226 (44%), Gaps = 28/226 (12%) Query: 2 RSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESV 61 S LILRLAGPMQ+WG + R T PT+S ++GLL + G +R+D S++ L + Sbjct: 1 MSVLILRLAGPMQSWGDSSRFNRRETRTEPTKSAVIGLLASAQGRRRED--SIEDLL-GL 57 Query: 62 QFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVALW 121 +F VR D+ +RD+ T G S T R YL DA F VA+ Sbjct: 58 RFGVRSDQ--------PGRIMRDFQTEKSIARKKSGEFSLTMPLTHRYYLADAKFLVAIE 109 Query: 122 LTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKALLNYEPVGGDI 181 ++ L+ A+ P++ +LGRRSCP P+ LG ++ ++AL + Sbjct: 110 ---GERSLLESLDAALRNPQWPLFLGRRSCPPASPVSLGVKDYANVEEALDKEPWIASPW 166 Query: 182 YSE------------ESVTGHHLKFTARDEPMITL--PRQFASREW 213 Y + ++V D P+ R++A R Sbjct: 167 YRKKVHDSKRLQVVVDAVENGETTGQQSDMPLSFSQKHRRYAQRPV 212 >UniRef50_D1CGD4 CRISPR-associated protein Cas5 family n=6 Tax=Bacteria RepID=D1CGD4_THET1 Length = 230 Score = 197 bits (501), Expect = 2e-49, Method: Composition-based stats. Identities = 69/227 (30%), Positives = 107/227 (47%), Gaps = 26/227 (11%) Query: 1 MRSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSES 60 M + L++RL+GPMQ+WG + R TGR P++SG++GL+ A LG R T+ + L Sbjct: 1 MPT-LLMRLSGPMQSWGTQSRFTVRDTGREPSKSGVIGLICAALGRPR--TAPVDDLVR- 56 Query: 61 VQFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHET-----IQTWREYLCDAS 115 ++ VR D +RDYHT GA R + T +Q+ R YL DAS Sbjct: 57 LRMGVRVDR--------EGIVMRDYHTAGGAPAGERYGVATVTGDQRPVQSSRYYLADAS 108 Query: 116 FTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGT-------CQASDPQ 168 F VAL ++ ++++A+ PR+ +LGR+SC + P+ L + D + Sbjct: 109 FLVALEGGEEDRPLLEQIDEALRAPRWQLFLGRKSCVPSEPIHLPKEPPLGPPIREEDLR 168 Query: 169 KALLNYEPVGGDIYSEESVTGHHLKFTARDEP--MITLPRQFASREW 213 AL++Y G + D P RQ+A+R Sbjct: 169 TALISYPWPEGAHRLRFVFEDPEGEELRNDVPISFEIGNRQYAARFV 215 >UniRef50_C2BET8 CRISPR-associated protein n=2 Tax=Firmicutes RepID=C2BET8_9FIRM Length = 244 Score = 197 bits (501), Expect = 2e-49, Method: Composition-based stats. Identities = 62/223 (27%), Positives = 103/223 (46%), Gaps = 29/223 (13%) Query: 3 SYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESVQ 62 ++L+L GPMQ+WG + TR + +P++SG++G++ A G +RD+ +Q L++ + Sbjct: 5 KTILLKLTGPMQSWGTSSRFETRTSDYYPSKSGVIGIIAASFGYERDEDEKIQKLND-LD 63 Query: 63 FAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVALWL 122 FAVR D+ +DYH AR+ + T T R Y+ DA F VA+ Sbjct: 64 FAVRVDQ--------EGVLKKDYHI---ARKVKPNGELERTYVTNRYYMEDAVFVVAI-- 110 Query: 123 TPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKALLNYEPVGGDIY 182 + + E+ + + P + P++GRRSCPL LGT + P +AL N + D + Sbjct: 111 SHEDDKWMEEILQGLKYPYFQPFMGRRSCPLPARFILGTNE-EGPIEALENLDWQAADWF 169 Query: 183 SEESV--------------TGHHLKFTARDEPMITLPRQFASR 211 +++ H R R+F R Sbjct: 170 KKKNKNYRADIYADKDLLPENSHTIRNDRVVSFSQKERKFGPR 212 >UniRef50_B7KJ26 CRISPR-associated protein Cas5 family n=1 Tax=Cyanothece sp. PCC 7424 RepID=B7KJ26_CYAP7 Length = 215 Score = 196 bits (499), Expect = 3e-49, Method: Composition-based stats. Identities = 68/228 (29%), Positives = 113/228 (49%), Gaps = 17/228 (7%) Query: 1 MRSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSES 60 M L+LRLAGP+Q+WG+ + R T PT+SG++GL+ A +GI RD+ L L++ Sbjct: 1 MMKTLLLRLAGPLQSWGRGSRFDFRDTDTIPTKSGVIGLVAAAMGINRDNQVELAKLAQ- 59 Query: 61 VQFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVAL 120 ++ V ++ + DYHTV+G G IQ++R+YLC+A F V L Sbjct: 60 LRMGVCVEK--------EGKLVVDYHTVIGTIH-ADGKPHKAPIQSYRQYLCNAEFLVGL 110 Query: 121 WLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKALLNY--EPVG 178 + + +++E+E + P++ +LGR++CP + P+F+ S + AL Y + Sbjct: 111 ESSEY--HLLNEIEHYLCFPKWELFLGRKACPPSKPIFVDLLTNS-LEDALYQYAISHLK 167 Query: 179 GDIYSEESVTGHHLKFTARDEPMITLPRQFASREWYVIK--GGMDVSQ 224 Y + D P+ R F++R K DVS+ Sbjct: 168 KGTYRLLIESKEPTGALRLDVPIDFKKRIFSARTVITPKPLEVSDVSE 215 >UniRef50_C7MQD6 CRISPR-associated protein Cas5 n=1 Tax=Saccharomonospora viridis DSM 43017 RepID=C7MQD6_SACVD Length = 236 Score = 192 bits (489), Expect = 6e-48, Method: Composition-based stats. Identities = 73/220 (33%), Positives = 107/220 (48%), Gaps = 22/220 (10%) Query: 2 RSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESV 61 + L+L L GPMQAWG + R T +PTRSG++G++ A LG Q D SL LS + Sbjct: 1 MTTLVLHLDGPMQAWGHASQWDHRDTLDYPTRSGVIGMIAAALGKQWGD--SLDDLSP-L 57 Query: 62 QFAVRCDELILDDRRVSVTGLRDYHTVLGAREDY----RGLKSHETIQTWREYLCDASFT 117 +F +R D + DYHT G E +G + + R Y+ DA++T Sbjct: 58 RFTIRIDR--------PGRRIVDYHTAGGGYEVGIARVKGGNRAHAVLSDRFYMSDAAYT 109 Query: 118 VALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKALLNY--E 175 VA+ ++ ++ A+ P + P+LGRRSCP P LG DP + L + + Sbjct: 110 VAIT---GPDTLLYRVDDALRAPVFGPFLGRRSCPPAGPWHLGLHDG-DPLRTLPLHRDK 165 Query: 176 PVGGDIYSEESVTGHHLKF-TARDEPMITLPRQFASREWY 214 P GD + E V+ H T R + +T P +F R Y Sbjct: 166 PRDGDTVAVEFVSDHETHGPTDRVDTTLTDPHEFGPRRSY 205 >UniRef50_Q04QB7 Putative uncharacterized protein n=2 Tax=Leptospira borgpetersenii serovar Hardjo-bovis RepID=Q04QB7_LEPBJ Length = 247 Score = 191 bits (485), Expect = 2e-47, Method: Composition-based stats. Identities = 61/220 (27%), Positives = 93/220 (42%), Gaps = 18/220 (8%) Query: 1 MRSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSES 60 M+ YL+ RL GP+ +WG RP+ FPT+S ++GL+ A G R + + L +S Sbjct: 1 MKDYLVFRLYGPLVSWGNIAVGEYRPSDSFPTKSAIIGLISASFGFDRSEDGKISELVKS 60 Query: 61 VQFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSH----------ETIQTWREY 110 V FA + LRDYHT+ R L + ETI + R+Y Sbjct: 61 VFFATKTLN--------PGNLLRDYHTIQSPGNVKRSLLTRKDELLDSEYVETILSSRDY 112 Query: 111 LCDASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKA 170 DA + VAL A + E+ A+L P +TPYLGR+SC + P+ + A Sbjct: 113 RVDAVYDVALSEKKRAPYSLKEIRNALLSPIHTPYLGRKSCSIALPMCPEILSSDSFPNA 172 Query: 171 LLNYEPVGGDIYSEESVTGHHLKFTARDEPMITLPRQFAS 210 Y + Y +++ ++ L Sbjct: 173 FEEYNKILMKKYESSDYKDPLADLSSKSSAILYLWEDPTE 212 >UniRef50_A3EQA4 CRISPR-ssociated protein, Cas5 n=3 Tax=Bacteria RepID=A3EQA4_9BACT Length = 227 Score = 190 bits (484), Expect = 2e-47, Method: Composition-based stats. Identities = 61/223 (27%), Positives = 96/223 (43%), Gaps = 23/223 (10%) Query: 1 MRSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSES 60 M + L++RL PMQ+WG + R TG+ P++SG++GLL A LGI R++ L+ L+ Sbjct: 1 MPT-LLIRLVSPMQSWGTSSRFDQRDTGKEPSKSGVIGLLAAALGIDRNNWDDLEPLA-G 58 Query: 61 VQFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVAL 120 + VR D RDY T K H T T+REYL DA F V Sbjct: 59 LSMGVRHDR--------PGIPRRDYQTASKII-SADHSKIHPTAVTYREYLADAVFLVGF 109 Query: 121 WLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKALLNYEPVGGD 180 ++ ++ A+ P + +LGR+S + P+++ + P + L + P Sbjct: 110 E--SAEVSLLEKINSALKNPVWPLFLGRKSYVPSEPIWIENGLKNVPLREALEHFPWIAC 167 Query: 181 IYSEESVTGHH---------LKFTARDEPM-ITLPRQFASREW 213 E + D+P+ R+F SR Sbjct: 168 RRRNERLPEKLVITFESEDGTGVLKMDQPLSSFAKRRFGSRFV 210 >UniRef50_Q1R114 CRISPR-associated protein, CT1976 n=1 Tax=Chromohalobacter salexigens DSM 3043 RepID=Q1R114_CHRSD Length = 260 Score = 189 bits (480), Expect = 7e-47, Method: Composition-based stats. Identities = 66/183 (36%), Positives = 92/183 (50%), Gaps = 20/183 (10%) Query: 1 MRSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSES 60 M +L+ RL PM +WG+ RPT +P R +LGL+GA LGI+RDD L +S Sbjct: 1 MTGHLVFRLYAPMASWGEAAVGEARPTATYPGRGAILGLIGAALGIRRDDDEGQLRLRQS 60 Query: 61 VQFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYR------------GLKSHETIQTWR 108 + AV+ +R LRDYHTV + + TI + R Sbjct: 61 LGIAVK--------QRSPGWLLRDYHTVQVPPSQSKVNYRSRREELSVPKDALNTILSSR 112 Query: 109 EYLCDASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQ 168 +Y CD + VAL L P A + EL+ A+ +PR+T YLGR++CPL PL +A + Sbjct: 113 DYRCDGLWVVALRLMPDAVWTLDELKSALERPRFTLYLGRKACPLAAPLTPAIVEADHWR 172 Query: 169 KAL 171 AL Sbjct: 173 GAL 175 >UniRef50_Q0W584 Putative uncharacterized protein n=1 Tax=uncultured methanogenic archaeon RC-I RepID=Q0W584_UNCMA Length = 227 Score = 187 bits (474), Expect = 3e-46, Method: Composition-based stats. Identities = 57/178 (32%), Positives = 83/178 (46%), Gaps = 12/178 (6%) Query: 10 AGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESVQFAVRCDE 69 GPMQ+WG R TG PT+SG++GLLG LG R D L ++ +R + Sbjct: 12 EGPMQSWGLKARWDIRDTGDEPTKSGIIGLLGCALGYARKDPRLTDELDSQLRIGIRVE- 70 Query: 70 LILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVALWLTPHATMV 129 RDYHTV G G TI ++R+YL DA+F V L + Sbjct: 71 -------CPGEIARDYHTVSGELRTAEGKLRETTIVSFRDYLQDAAFLVVLE---GPGEL 120 Query: 130 ISELEKAVLKPRYTPYLGRRSCPLTHPLFLG-TCQASDPQKALLNYEPVGGDIYSEES 186 ++ + A+ P + YLGR+SCP T P+F T + AL + G + + ++ Sbjct: 121 LTRISNALKDPVWPIYLGRKSCPPTRPVFETLTTDYASIDDALSRHPWSSGTMEARKA 178 >UniRef50_C5SD48 CRISPR-associated protein Cas5 family n=1 Tax=Allochromatium vinosum DSM 180 RepID=C5SD48_CHRVI Length = 227 Score = 187 bits (474), Expect = 3e-46, Method: Composition-based stats. Identities = 108/224 (48%), Positives = 134/224 (59%), Gaps = 9/224 (4%) Query: 1 MRSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSES 60 M SYLILRL GPMQAWG TFE RP+ FPTRSGLLGLLGACLG+ R DT SL AL+ES Sbjct: 1 MPSYLILRLDGPMQAWGTHTFEDYRPSNPFPTRSGLLGLLGACLGLDRSDTPSLDALAES 60 Query: 61 VQFAVRCDELILDDR-----RVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDAS 115 V F VR D T L DYHTVL AR+ G + IQ+ REYL DA+ Sbjct: 61 VAFTVRLDTGAPRPGVDRLMPKRHTKLSDYHTVLDARK-VDGSTNKFPIQSHREYLFDAA 119 Query: 116 FTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPL--FLGTCQASDPQKALLN 173 F VA+ P A+ ++ + +++ +PR+TP LGRRSCPL PL +A D + AL Sbjct: 120 FAVAIGSRPDASFSLARIAESLRQPRFTPVLGRRSCPLGRPLLERPDCIEADDAKAALAQ 179 Query: 174 YEPVGGDIYSEESVTGHHLKFTARDEPMITLPRQFASREWYVIK 217 + P GG IYSE+ + + RD P RQFA+R Y+ + Sbjct: 180 FPPHGGLIYSEDELVSDQPTWI-RDVPRYGRHRQFATRRLYLHR 222 >UniRef50_Q2RY19 CRISPR-associated protein, Cas5e family n=1 Tax=Rhodospirillum rubrum ATCC 11170 RepID=Q2RY19_RHORT Length = 261 Score = 186 bits (473), Expect = 4e-46, Method: Composition-based stats. Identities = 74/234 (31%), Positives = 102/234 (43%), Gaps = 32/234 (13%) Query: 1 MRSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSES 60 +R +L+ RL GPM AWG R T P +S +LGLL A LGI R D ++ +AL Sbjct: 3 VRDFLVFRLVGPMAAWGDIAVGERRGTWDVPAKSAILGLLAAGLGIDRADRTAHEALDRG 62 Query: 61 VQFAVRCDELILDDRRVSVTGLRDYHTV----------LGAREDYRGLKSHETIQTWREY 110 + FAVR D LRDYHT R D T+ + R Y Sbjct: 63 LGFAVRQDR--------PGRLLRDYHTAQAPKARKNARWSTRRDELNDDDLNTVLSDRLY 114 Query: 111 LCDASFTVALWLTPH-ATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQK 169 +A T A+W + +L +A+L+PR+TPYLGR++CPL P QA Sbjct: 115 RTNAIATPAIWRRQGTEGPTLDQLTQALLRPRFTPYLGRKACPLGWPPRPRLLQADGLLA 174 Query: 170 ALLNYEPVGGDIYSE--ESVTGHHLKFTARDEPMITLPRQFASREWYVIKGGMD 221 A Y+ D + ++ G T R P+ W+ I G+D Sbjct: 175 AFDAYDSAEWDAARQFHKAYPGGWPGDTDRPTPV-----------WFEIAAGLD 217 >UniRef50_D2RB02 CRISPR system CASCADE complex protein CasD n=3 Tax=Actinobacteria (class) RepID=D2RB02_GARVA Length = 291 Score = 186 bits (473), Expect = 4e-46, Method: Composition-based stats. Identities = 55/187 (29%), Positives = 89/187 (47%), Gaps = 18/187 (9%) Query: 1 MRSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQR--DDTSSLQALS 58 M+S L+L+ +GP+Q+WG + TR T +P++S ++G++ A G +R D ++ L+ Sbjct: 1 MKS-LLLKFSGPLQSWGTDSHFETRHTDYYPSKSAVIGMIAAAFGYRRSTDCDENIAKLN 59 Query: 59 ESVQFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTV 118 + + FAVR D+ LRDYH + + T R YL DA F V Sbjct: 60 D-LDFAVRIDQ--------QGNLLRDYHIAAKYK---ANGDFEKNYVTNRYYLEDAIFLV 107 Query: 119 ALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKALLNYEPVG 178 A+ + +I ++ A+ P + LGRRS P T LG +ALL +E + Sbjct: 108 AI--GSNNEQLIYDISNALRSPYFQSSLGRRSLPPTADFILGVEDC-GVIQALLTHEWLA 164 Query: 179 GDIYSEE 185 + Sbjct: 165 NKWSKKR 171 >UniRef50_B0S4B6 Putative uncharacterized protein n=1 Tax=Finegoldia magna ATCC 29328 RepID=B0S4B6_FINM2 Length = 228 Score = 185 bits (470), Expect = 1e-45, Method: Composition-based stats. Identities = 63/238 (26%), Positives = 107/238 (44%), Gaps = 31/238 (13%) Query: 2 RSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESV 61 S ++L+ A P+Q+WG R T +PT+S ++GL+ A G ++ DT S++ L+ S+ Sbjct: 1 MSVILLKFASPLQSWGGLANYEIRNTEYYPTKSAVIGLVAAAFGYKKTDTESIKRLN-SL 59 Query: 62 QFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTW---REYLCDASFTV 118 F+VR D+ + +RD+ + Y + +++ + Y+ DA F + Sbjct: 60 NFSVRIDQ--------KGSLIRDFQIAMEYNPKYMPNDPNYFVKSNLIQKYYIQDAKFLI 111 Query: 119 ALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQ-------------AS 165 AL + ++ ++ A+ P Y +LGR+S P+ +G AS Sbjct: 112 AL--SSDDETLMEDVYNALESPAYQLFLGRKSNPINADYLIGKFDGNELEIIKDYEWLAS 169 Query: 166 DPQKALLNYEPVGGDIYSEESVTGHHLKFTARD--EPMITLPRQFASR--EWYVIKGG 219 K + + V I+S+ TG K RD E R F+SR YV K G Sbjct: 170 KWYKKSIKKDSVELSIFSDYIDTGSKEKLIRRDLTESFENTKRDFSSRFEYRYVTKVG 227 >UniRef50_B0TDU1 Crispr-associated protein cas5 n=1 Tax=Heliobacterium modesticaldum Ice1 RepID=B0TDU1_HELMI Length = 230 Score = 183 bits (464), Expect = 4e-45, Method: Composition-based stats. Identities = 67/228 (29%), Positives = 102/228 (44%), Gaps = 23/228 (10%) Query: 5 LILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESVQFA 64 L LRL GP+Q+WG + R + PT+SG++GLLG LG R+D L++L +++ Sbjct: 6 LALRLEGPLQSWGSRSRWDYRDSALEPTKSGIIGLLGCALGWSRND-KRLESLDAALRLT 64 Query: 65 VRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHE-----TIQTWREYLCDASFTVA 119 VR D+ T L D+HTV G G + T+ + R YL +ASF Sbjct: 65 VRIDK--------PGTPLIDFHTVQGYLLMAEGKQKKSGNDMYTVVSRRVYLQEASFLAL 116 Query: 120 LWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQAS--DPQKALLNYE-- 175 L A + + +KA+ P + +LGR+SCP PLF + D +A+ + Sbjct: 117 LTGEQGA---LHQCKKALNDPVWPVFLGRKSCPPARPLFDSFYEGDFRDVLEAMRSIPWS 173 Query: 176 --PVGGDIYSEESVTGHHLKFTARDEPMITLPRQFASREWYVIKGGMD 221 P G I + + +D I R + R V +D Sbjct: 174 SAPAAGPIRLRYVMEDEGGREWRQDVLRINGARMYGRRRVSVGWVDLD 221 >UniRef50_C8XAY4 CRISPR-associated protein Cas5 family n=2 Tax=Actinomycetales RepID=C8XAY4_NAKMY Length = 252 Score = 182 bits (463), Expect = 6e-45, Method: Composition-based stats. Identities = 70/211 (33%), Positives = 96/211 (45%), Gaps = 22/211 (10%) Query: 2 RSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESV 61 S L+LRL GPMQ+WG+ + R T P++S ++GLL A LG +R D +++ L+ + Sbjct: 1 MSVLVLRLTGPMQSWGERSRYARRETAAEPSKSAIVGLLAAALGRRRTD--AIEDLA-GL 57 Query: 62 QFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETI-QTWREYLCDASFTVAL 120 F VR D+ T LRD+ T R L T+ + R YL DA F A+ Sbjct: 58 IFGVRVDQ--------PGTLLRDFQTA-------RSLDGARTMPLSERYYLSDARFLAAV 102 Query: 121 WLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKALLNYEPVGGD 180 +I+ L A+ P + YLGRRSCP + P+ C S P A L EP Sbjct: 103 E---GPESLIAGLAGALRDPTFPLYLGRRSCPPSEPIAQQDCIRSGPLLAALFDEPWHAT 159 Query: 181 IYSEESVTGHHLKFTARDEPMITLPRQFASR 211 V A D +P Q A R Sbjct: 160 KSYRRRVADPARLSIAVDAAATEVPAQLAER 190 >UniRef50_A1SV73 CRISPR-associated protein, Cas5e family n=2 Tax=Gammaproteobacteria RepID=A1SV73_PSYIN Length = 217 Score = 182 bits (461), Expect = 1e-44, Method: Composition-based stats. Identities = 78/214 (36%), Positives = 110/214 (51%), Gaps = 15/214 (7%) Query: 2 RSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESV 61 LIL+ G M A+G TF+ R FPTRS ++G+LGA +GI R++ + L ALSE + Sbjct: 1 MKTLILKTEG-MSAYGLQTFDVHRRANHFPTRSAIMGILGAAMGITRENFNELYALSEQL 59 Query: 62 QFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVALW 121 + AV+ +S + DYHTV R +G T+REY CD+ T A+ Sbjct: 60 KIAVQV--------NLSGEKMVDYHTVQHFRSP-QGKIQKGVKPTYREYWCDSEHTFAIS 110 Query: 122 LTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKALLNYEPVGGDI 181 H VI +L +V P +T + GR+SCPLT PLF +P AL N+ G I Sbjct: 111 AAEH---VIEKLVNSVKFPEFTLFQGRKSCPLTRPLFEAVTDDDNPANALKNHGE-QGQI 166 Query: 182 YSEESVTGHHLKFTARDEPMITLPRQFASREWYV 215 +S+ S RD + +PR++A R YV Sbjct: 167 FSDISGDNQLAIVQVRD-LITAIPRKYAMRTVYV 199 >UniRef50_A8LYZ7 CRISPR-associated protein Cas5 family n=2 Tax=Actinomycetales RepID=A8LYZ7_SALAI Length = 257 Score = 181 bits (459), Expect = 2e-44, Method: Composition-based stats. Identities = 78/240 (32%), Positives = 109/240 (45%), Gaps = 43/240 (17%) Query: 5 LILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESVQFA 64 L+LRLAGPMQ+WG + R TG PTRS ++G++ A G R + L L+ VQF Sbjct: 4 LLLRLAGPMQSWGDHSTFSVRDTGTVPTRSAMIGIIAAAQGRHRGE--PLGDLAP-VQFT 60 Query: 65 VRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHE---------TIQTWREYLCDAS 115 VR D T + D+HTV G R + + E TI + R YL DA Sbjct: 61 VRVDR--------PGTVMSDFHTVGGGAPPERTVPTAEGKRRTAGAGTIVSRRFYLADAV 112 Query: 116 FTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKAL---- 171 FTVA+ ++ ++ A+ P + PYLGRRSCP+ HP L + DP L Sbjct: 113 FTVAVTGPDD---LVGQIHTALNNPVWGPYLGRRSCPVAHPF-LMSGPIPDPVGRLEHLP 168 Query: 172 --LNYEPVGGDIYSEESV------TGHHLKFTARDEPMITLP-------RQFASREWYVI 216 P + + V G + T D PM +P R++ +R+ YV Sbjct: 169 LNRRRPPGDEETVRVDFVTGAPHGDGSISRMTLNDVPMEPVPGSPDPRRRRYLTRQVYVT 228 >UniRef50_D1Y486 Crispr-associated protein Cas5 n=1 Tax=Pyramidobacter piscolens W5455 RepID=D1Y486_9BACT Length = 269 Score = 181 bits (459), Expect = 2e-44, Method: Composition-based stats. Identities = 64/229 (27%), Positives = 93/229 (40%), Gaps = 36/229 (15%) Query: 2 RSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESV 61 LILRL GP+ A+G + RPT P S + GL+ LG D LQ L E + Sbjct: 1 MDALILRLRGPLMAFGDVAVDEIRPTDLLPGLSEMTGLIANALGWTFQDVEKLQRLQERL 60 Query: 62 QFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYR------------GLKSHETIQTWRE 109 + A R D + LRDY T + ED G K T+Q +R Sbjct: 61 RLASREDR--------TGVPLRDYQTARLSSEDSLWRTDGIVAERGGGSKGEFTVQRYRH 112 Query: 110 YLCDASFTVALWLTP-HATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQ 168 Y DA+ TV + L P + E+ A+ P ++GR CP + P+ Sbjct: 113 YRADAAVTVLIALDPADEAPALEEIRDALRHPARPLFIGRIGCPPSQPIC---------- 162 Query: 169 KALLNYEPVGGDIYSEESVTGHHLKFTARDEPMITLPRQFASREWYVIK 217 +EP G +I +S+ + T R R+ ASR +++ Sbjct: 163 -----FEPEGREIIHTDSLKDAIMHITPRAPLGAQTKREPASRNKVLVE 206 >UniRef50_D0WFC8 CRISPR-associated protein Cas5 n=1 Tax=Slackia exigua ATCC 700122 RepID=D0WFC8_9ACTN Length = 267 Score = 180 bits (458), Expect = 2e-44, Method: Composition-based stats. Identities = 57/201 (28%), Positives = 96/201 (47%), Gaps = 20/201 (9%) Query: 2 RSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESV 61 + L+L+LA P+Q+WG + R T P++SG++GLL A LG +R+D S+ L+ ++ Sbjct: 4 MTVLLLKLAAPLQSWGASSRFTERTTRHEPSKSGVIGLLAAALGRRRED--SVDDLA-AL 60 Query: 62 QFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLK---SHETIQTWREYLCDASFTV 118 +FAVR D+ + +RD+ T + D + + + R+YL DA F Sbjct: 61 RFAVRIDQ--------PGSFMRDFQTEHTRKWDSDTRRFVFNESLSLSKRDYLSDAVFVA 112 Query: 119 ALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKALLNYEPVG 178 AL +++E +A+ P + +LGRRSCP + ++LG +AL + Sbjct: 113 ALE---GDEDLLAECAEALHHPAFPLFLGRRSCPPSTQVYLGLVDGP-MMEALADIPWQA 168 Query: 179 GD--IYSEESVTGHHLKFTAR 197 + G K T Sbjct: 169 TERHWNYAYRFRGDKPKETVE 189 >UniRef50_B4S8P8 CRISPR-associated protein Cas5 family n=8 Tax=Bacteria RepID=B4S8P8_PROA2 Length = 243 Score = 180 bits (457), Expect = 3e-44, Method: Composition-based stats. Identities = 59/244 (24%), Positives = 93/244 (38%), Gaps = 32/244 (13%) Query: 4 YLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESVQF 63 YL+L L P+Q+WG + G R T FPT+SG+LG+L LG + L+ ++ Q Sbjct: 5 YLLLWLEAPLQSWGADSRFGRRGTLEFPTKSGVLGMLCCSLGAGGEQKELLEKMAPLKQS 64 Query: 64 AV--------RCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKS--------------H 101 A+ R +E+ DR LRD+H V +D ++ Sbjct: 65 AISFCRTSKFRQEEIKKLDRE---PLLRDFHMVGSGYDDKNPWETLLIPKKSDGTTAVNG 121 Query: 102 ETIQTWREYLCDASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGT 161 + T+R YL DA F V + + + A+ P + Y GR+ C T ++ G Sbjct: 122 GSKITYRYYLQDAVFAVIMEVPSEKLTLF---ADALENPCWDIYFGRKCCAPTDFIYRGC 178 Query: 162 CQASDP-QKALLNYEPVGGDIYSEESVTGHHLK--FTARDEPMITLPRQ-FASREWYVIK 217 L + V G H D P+ ++ + R VI Sbjct: 179 FNTESLAIGKALEIAQEKRLMEDFRVVDGEHEGEAIVLNDVPIQFGEQKLYRERRVTVIS 238 Query: 218 GGMD 221 + Sbjct: 239 CANE 242 >UniRef50_Q1EQS9 CRISPR-associated protein n=3 Tax=Streptomyces RepID=Q1EQS9_STRKN Length = 280 Score = 180 bits (456), Expect = 4e-44, Method: Composition-based stats. Identities = 71/235 (30%), Positives = 107/235 (45%), Gaps = 39/235 (16%) Query: 5 LILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESVQFA 64 L+LRL+GP+Q+WG+ + R T RFPTRSG++G+L A LG +R + + L+ + Sbjct: 14 LLLRLSGPLQSWGERSHFNERDTARFPTRSGIIGMLAAALGRRRGE--PVDDLAR-LSLT 70 Query: 65 VRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHE---------TIQTWREYLCDAS 115 VR D LRD HTV G + + E T+ T R YL DA+ Sbjct: 71 VRTDR--------PGILLRDLHTVGGGLPAKATVTTAEGKKRPGTTGTLLTHRTYLADAA 122 Query: 116 FTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQAS------DPQK 169 FT+AL TP ++ + +A+ P + +LGRRSCP PL LG + + P Sbjct: 123 FTIALTSTPDDRPLLDQAAQALNTPCWPLFLGRRSCPPEGPLLLGASEDALHHLVHLPLA 182 Query: 170 ALLNYEPVGGDIYSEESV-------------TGHHLKFTARDEPMITLPRQFASR 211 A + ++ + G H D+P+ PR+ + R Sbjct: 183 AHPGRGQQDTEFLADRPLNRLPYGTATPVGADGTHPSGEVNDQPLSFDPRRRSYR 237 >UniRef50_A5UR14 CRISPR-associated protein, Cas5e family n=1 Tax=Roseiflexus sp. RS-1 RepID=A5UR14_ROSS1 Length = 262 Score = 179 bits (455), Expect = 6e-44, Method: Composition-based stats. Identities = 68/205 (33%), Positives = 94/205 (45%), Gaps = 45/205 (21%) Query: 2 RSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESV 61 + L LRL GP+Q+WG G R T PT+SG++GLLG LG++RDD + L+ LS+++ Sbjct: 1 MNTLFLRLEGPLQSWGLRARWGERDTTDAPTKSGVIGLLGCALGLRRDD-ARLRDLSDNL 59 Query: 62 QFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKS--------------------- 100 + VR D + +RDYHT G R Sbjct: 60 RMGVRVD--------LPGILMRDYHTTGGGRYSTIASTGGPRYHDEPYIGGVLSAEVTKG 111 Query: 101 ------------HETIQTWREYLCDASFTVALWLTPHATMVISELEKAVLKPRYTPYLGR 148 ET + R YL DASF VAL +P I EL A+ P + +LGR Sbjct: 112 RIKVKINQKTGEPETDVSERYYLADASFLVALQGSPD---YIGELATAIQSPVWPLFLGR 168 Query: 149 RSCPLTHPLFLGTCQASDPQKALLN 173 ++C + P+F GT Q + AL N Sbjct: 169 KACVPSTPIFAGTGQFDILEDALKN 193 >UniRef50_C9M2Y8 CRISPR-associated protein n=1 Tax=Lactobacillus helveticus DSM 20075 RepID=C9M2Y8_LACHE Length = 241 Score = 179 bits (454), Expect = 7e-44, Method: Composition-based stats. Identities = 51/201 (25%), Positives = 90/201 (44%), Gaps = 17/201 (8%) Query: 1 MRSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSES 60 M++ +RL P+Q++G R + +P++S ++G++ A LG +RDD LQ + Sbjct: 1 MKTA-TIRLTAPLQSYGNQASFNQRTSDNYPSKSAVIGIIAAALGYRRDDARILQ--LNN 57 Query: 61 VQFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVAL 120 + FAVR ++ S + ++ TV + + + T+RE++ DA F VA+ Sbjct: 58 LLFAVRIEQ--------SGNMMTEFQTVEYQKSSTKTARK----LTYREFIQDAVFMVAI 105 Query: 121 WLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKALLNYEPVGGD 180 I ++ A+ P++ YLGRRS P PL + T +P + L Sbjct: 106 --GSDNDHEIEKIVSALKHPKFQLYLGRRSNPPAGPLMIETYDEENPLQVLEKLSWQAEP 163 Query: 181 IYSEESVTGHHLKFTARDEPM 201 Y + L D + Sbjct: 164 WYQKRLRAPKFLTRIIADAEL 184 >UniRef50_D0Y918 CRISPR-associated protein Cas5 family n=2 Tax=Dehalococcoides RepID=D0Y918_9CHLR Length = 205 Score = 179 bits (454), Expect = 7e-44, Method: Composition-based stats. Identities = 64/213 (30%), Positives = 106/213 (49%), Gaps = 19/213 (8%) Query: 4 YLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESVQF 63 L++RL GPMQ+WG + R T PTRSG++GL+ A +GI RD+ A + ++ Sbjct: 6 TLLMRLEGPMQSWGYRSRFDCRDTALEPTRSGVIGLICAAMGIARDEDI---ARFDGIRM 62 Query: 64 AVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVALWLT 123 VR D ++ DYHT L + +T+ ++R+YL DASFTV L + Sbjct: 63 GVRVDRDGKVEQ--------DYHTALDVIK--ADGSGKDTVVSYRDYLTDASFTVGLESS 112 Query: 124 PHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKALLNYEPVGGDIYS 183 ++ ++ KA++ P++ +LGR++ PLT P + S+P K E + + Sbjct: 113 --DRNLLEKIAKALVSPQWVLFLGRKAFPLTKPP---IFEFSNPVKPGSLEEHLLCGASA 167 Query: 184 EES-VTGHHLKFTARDEPMITLPRQFASREWYV 215 + + + T D P+ R+F R + V Sbjct: 168 KRVLLESPDGERTQYDWPLCFGERRFKPRRFTV 200 >UniRef50_A5GBK2 CRISPR-associated protein Cas5 family n=2 Tax=Deltaproteobacteria RepID=A5GBK2_GEOUR Length = 231 Score = 178 bits (451), Expect = 1e-43, Method: Composition-based stats. Identities = 61/225 (27%), Positives = 98/225 (43%), Gaps = 10/225 (4%) Query: 2 RSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSE-- 59 +S+L LRL GP+Q+WG + R TG PT+S + G+ A LG R + L Sbjct: 10 KSFLALRLEGPLQSWGFDSQYNRRNTGLMPTKSAIAGMCCAALGFLRGCDKEQEFLVAFG 69 Query: 60 SVQFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVA 119 +V+ + + V L+DYHTV R G +++ + T R+YL DA+F V Sbjct: 70 AVRMTAIAIPRNGAKKELPVRRLQDYHTVQNTRR-ASGAINNDCVLTHRQYLTDAAFGVL 128 Query: 120 LWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKALLNYEPVGG 179 L + ++ ++ A+ P + +LGR++C T P+ G + D LL + Sbjct: 129 LE---GDSTLLKQIAAALENPVWGVWLGRKTCIPTAPVLAGLRENRDEALKLLLKDKPLE 185 Query: 180 DIYSEESVT---GHHLKFTARDEPMITLPRQFASREWYVIKGGMD 221 +E V T R F+ R ++ G D Sbjct: 186 SFARQEDVESFADGRDSLPDMPVSFATERRIFSPRRVRTLQ-GTD 229 >UniRef50_C7MTB0 CRISPR-associated protein Cas5 n=1 Tax=Saccharomonospora viridis DSM 43017 RepID=C7MTB0_SACVD Length = 255 Score = 178 bits (451), Expect = 1e-43, Method: Composition-based stats. Identities = 75/234 (32%), Positives = 107/234 (45%), Gaps = 36/234 (15%) Query: 2 RSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESV 61 S L+LRLAGP+Q+WG+ + R T FPT SGLLGLL +G +R + SL+ L+ ++ Sbjct: 1 MSGLLLRLAGPLQSWGERSTFDVRDTAGFPTHSGLLGLLACVMGRRRGE--SLEDLA-AL 57 Query: 62 QFAVRCDELILDDRRVSVTGLRDYHTVLGARE--------DYRGLK-SHETIQTWREYLC 112 F +R D T + DY T GA D +G T+QTWREYL Sbjct: 58 TFTIRVDR--------PGTRIIDYQTAGGALPPSMKVPTADGKGRPAGKGTVQTWREYLA 109 Query: 113 DASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKALL 172 DA F VA+ + V+ ++ A+ P + PYLGRRSCP PL L DP L Sbjct: 110 DAVFVVAV---QGPSEVLDQVRHALRYPHWQPYLGRRSCPPDQPLLLDV-PVEDPVAELC 165 Query: 173 NYEPVGGDIYSEESV------------TGHHLKFTARDEPMITLPRQFASREWY 214 P+ + +E G + ++ R ++ R + Sbjct: 166 TRVPLARRVGKDEETVPVDFIFPVERRDGVRSEIHDVPVAFTSVDRAYSPRPVW 219 >UniRef50_D1A6Q5 CRISPR-associated protein Cas5 family n=2 Tax=Actinomycetales RepID=D1A6Q5_THECD Length = 273 Score = 178 bits (451), Expect = 2e-43, Method: Composition-based stats. Identities = 70/243 (28%), Positives = 102/243 (41%), Gaps = 44/243 (18%) Query: 2 RSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESV 61 + L+L L+GP+Q+WG+ + R T PTRSGL+G++ A G +R T + L ++ Sbjct: 6 TTGLLLHLSGPLQSWGERSRFNQRDTATAPTRSGLIGMIAAAFGRRR--TEPVTDL-RAL 62 Query: 62 QFAVRCDELILDDRRVSVTGLRDYHTVLG---------AREDYRGLKSHETIQTWREYLC 112 +F VR D T LRD+HTV G E R T+ + R YL Sbjct: 63 RFTVRIDR--------PGTLLRDFHTVGGGMPRDLTVITAEGKRRAADTATVTSDRYYLQ 114 Query: 113 DASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKALL 172 DA+FTVA+ ++ +A+ PR+ YLGRRSCP PL L T +DP AL+ Sbjct: 115 DAAFTVAVTA--DDPALLDRCAQALRAPRWPLYLGRRSCPPNAPLLL-TVLRTDPVTALI 171 Query: 173 NYEP---------------------VGGDIYSEESVTGHHLKFTARDEPMITLPRQFASR 211 + S + + R+F +R Sbjct: 172 DLPLARTAPRDRGDVLVEFRSDTPFESRAWPSAPEDEQVYTEAQDEPVSFQPHHRRFQTR 231 Query: 212 EWY 214 Y Sbjct: 232 PIY 234 >UniRef50_A8ZZ17 CRISPR-associated protein Cas5 family n=1 Tax=Desulfococcus oleovorans Hxd3 RepID=A8ZZ17_DESOH Length = 259 Score = 177 bits (449), Expect = 2e-43, Method: Composition-based stats. Identities = 60/170 (35%), Positives = 84/170 (49%), Gaps = 20/170 (11%) Query: 4 YLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESVQF 63 YL+ RL GPM +WG+ TR T +P RS ++GL+ A LGI+R +T + QAL + Sbjct: 3 YLLFRLYGPMASWGEIAVGETRHTANYPGRSAIIGLMAAALGIKRSETENQQALDQGCLI 62 Query: 64 AVRCDELILDDRRVSVTGLRDYHT----------VLGAREDYR--GLKSHETIQTWREYL 111 AV + R + LRDYHT V R D G TI + REY Sbjct: 63 AV--------EARSHGSLLRDYHTTQVPDSVGGFVYRTRRDELIIGKPRLGTILSSREYR 114 Query: 112 CDASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGT 161 DA A+ + P A + ++ + +PR YLGR+SCPL+ P+ Sbjct: 115 QDALAVSAVRVLPGARYELQTIKTHLEQPRLHVYLGRKSCPLSAPMNPQI 164 >UniRef50_A8SDR7 Putative uncharacterized protein n=1 Tax=Faecalibacterium prausnitzii M21/2 RepID=A8SDR7_9FIRM Length = 220 Score = 177 bits (449), Expect = 2e-43, Method: Composition-based stats. Identities = 75/217 (34%), Positives = 102/217 (47%), Gaps = 24/217 (11%) Query: 2 RSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESV 61 + L+LRLA P+QAWG + TR TGR PT+SG++GLL A LG++RD++ +L L+ + Sbjct: 1 MATLLLRLAAPLQAWGADSKFETRKTGREPTKSGVIGLLAAALGLRRDESEALTRLT-GL 59 Query: 62 QFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVALW 121 +F VR + L DYHT + + T+R YL DA F + Sbjct: 60 RFGVRVER--------EGQLLVDYHTA-------KTQDEKTSYVTYRHYLQDAVFLAGIE 104 Query: 122 LTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKALLNYEPV-GGD 180 T A + + L P + YLGRR CP T PL LG C S Q+ L P+ G Sbjct: 105 STDTALLQQLQQAL--LHPAFPLYLGRRCCPPTLPLCLGVCPGS-LQEVLQAEPPLCPGR 161 Query: 181 IYSEESVTGHHLKFTA--RDEPMITLP--RQFASREW 213 TA RD P+ P RQ+ R Sbjct: 162 QSRILLDADPLEPGTAPQRDVPVSFDPHHRQYGYRSV 198 >UniRef50_A7BA63 Putative uncharacterized protein n=1 Tax=Actinomyces odontolyticus ATCC 17982 RepID=A7BA63_9ACTO Length = 242 Score = 177 bits (448), Expect = 3e-43, Method: Composition-based stats. Identities = 64/205 (31%), Positives = 94/205 (45%), Gaps = 23/205 (11%) Query: 1 MRSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSES 60 M + L+LRLAGPMQ+WG + R T FPT+S L+GLLGA G +R D ++ L+E Sbjct: 1 MSAVLVLRLAGPMQSWGADSRFTRRSTEAFPTKSALVGLLGAAQGRRRSD--PIEDLAE- 57 Query: 61 VQFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVAL 120 + AVR D+ L D+HT + R Y DA+F + Sbjct: 58 LSVAVRVDQ--------PGQLLHDFHTAHRG--------DTSMPLSHRFYRADAAFGAFI 101 Query: 121 WLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKALLNYEPVGGD 180 +I L +A+++P + YLGRRSCP T PL L + S A+ + Sbjct: 102 EGPDD---MIDALAQAIVRPVFPLYLGRRSCPPTLPLRLAVREGSAW-DAVRETPWMAST 157 Query: 181 IYSEESVTGHHLKFTARDEPMITLP 205 Y ++ H ++ + I P Sbjct: 158 YYQKKQRHDHFVRMRVVADLGIIPP 182 >UniRef50_A6W168 CRISPR-associated protein Cas5 family n=6 Tax=Gammaproteobacteria RepID=A6W168_MARMS Length = 258 Score = 176 bits (446), Expect = 5e-43, Method: Composition-based stats. Identities = 71/189 (37%), Positives = 97/189 (51%), Gaps = 21/189 (11%) Query: 1 MRSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSES 60 M+ YL+ RL GPM +WGQP G R T PTRS +LGLLGA LGI+RDD L AL S Sbjct: 1 MKDYLVFRLYGPMASWGQPAVGGDRATAIAPTRSAILGLLGAALGIKRDDAQQLDALHSS 60 Query: 61 VQFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGL-------------KSHETIQTW 107 VQ A + + + LRDYHT + + + + TI + Sbjct: 61 VQMATK--------QVTPTSLLRDYHTSQVPSRNNKYVYRTRKNELLDEHKEKLNTILST 112 Query: 108 REYLCDASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDP 167 R+Y CD + VA+ LT + + L++A++KP Y LGR+SCPL PL + Sbjct: 113 RDYRCDGIWIVAVSLTQESLFSLERLKQALIKPVYVLSLGRKSCPLAAPLLPVLLTSVSL 172 Query: 168 QKALLNYEP 176 ++AL P Sbjct: 173 REALDYPFP 181 >UniRef50_B8HWH8 CRISPR-associated protein Cas5 family n=1 Tax=Cyanothece sp. PCC 7425 RepID=B8HWH8_CYAP4 Length = 216 Score = 176 bits (446), Expect = 6e-43, Method: Composition-based stats. Identities = 53/217 (24%), Positives = 92/217 (42%), Gaps = 23/217 (10%) Query: 1 MRSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSES 60 M + L+LR+ PM +WG + R + R PT+S ++GLL A LG R ++ L+ + Sbjct: 1 MPT-LLLRMRAPMMSWGDHSRFTIRDSRREPTKSAVIGLLCAALGRPR--WEAVADLT-A 56 Query: 61 VQFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVAL 120 ++ VR ++ L DYHTV + + T+ + R Y+ DA + V L Sbjct: 57 LKMGVRINQEGLVQC--------DYHTVQDSIK--SSGSKGNTVISHRYYIADADYLVGL 106 Query: 121 WLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKALLNYEPVGGD 180 + + L+ A+ P + Y GR+S + P+ L +AL + + Sbjct: 107 EGS--DRHFLESLDSALQSPIWQVYFGRKSFVPSCPVALHVSDQP-LAEALKHRITLSKT 163 Query: 181 IYSEE------SVTGHHLKFTARDEPMITLPRQFASR 211 + + + +D P+ R F SR Sbjct: 164 MAHKLPNRLRCVLEVPDSLDVRQDVPLDWQKRHFGSR 200 >UniRef50_B1VIY0 CRISPR-associated protein n=9 Tax=Actinomycetales RepID=B1VIY0_CORU7 Length = 240 Score = 175 bits (445), Expect = 7e-43, Method: Composition-based stats. Identities = 59/218 (27%), Positives = 95/218 (43%), Gaps = 26/218 (11%) Query: 1 MRSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSES 60 M L+L L GPMQ+WG + R T PT+SG++GL+ A G +R D ++ L++ Sbjct: 1 MAHSLLLLLKGPMQSWGDESRFSVRATATTPTKSGIVGLIAAAQGRRRTD--GVEDLAK- 57 Query: 61 VQFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVAL 120 ++ AVR D+ S + LRDY T + R +L DA+F A+ Sbjct: 58 LRMAVRVDQ--------SGSLLRDYQTAQ----PWLKNPGANASLVTRYFLSDAAFVAAV 105 Query: 121 WLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKALLNYEPVGGD 180 ++ ++ +A+ +P Y Y+GRRSCP+ L +G D + AL ++ Sbjct: 106 E--SEDRELLDQMAEALRRPAYPLYMGRRSCPVHPGLVIGVVDG-DAESALRAHDTWHAT 162 Query: 181 IYSEESVTGHHLKFTARD--------EPMITLPRQFAS 210 + RD P +P F+ Sbjct: 163 AVHRKESPKKVSLAIYRDANPGEGGSVPRQDVPVSFSP 200 >UniRef50_B3E5U9 CRISPR-associated protein Cas5 family n=2 Tax=Desulfuromonadales RepID=B3E5U9_GEOLS Length = 271 Score = 175 bits (443), Expect = 1e-42, Method: Composition-based stats. Identities = 81/250 (32%), Positives = 114/250 (45%), Gaps = 37/250 (14%) Query: 2 RSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESV 61 YL+ RL GP+ +WG+ +R + +P +S LLGL+ A LGI+RD+ AL+ Sbjct: 1 MKYLLFRLYGPLASWGEIAVGESRHSAVYPGKSALLGLIAAALGIRRDEEQRQAALASGY 60 Query: 62 QFAVRCDELILDDRRVSVTGLRDYHTVLG----------AREDY--RGLKSHETIQTWRE 109 +FAV+ + LRDYHT R D G + TI + RE Sbjct: 61 RFAVKV--------ISTGHPLRDYHTAQAPDSVGKFVYRTRRDELVLGKERLGTILSSRE 112 Query: 110 YLCDASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQK 169 Y CDA VA+ A + E+ +A++KPR+ YLGR+SCP+ PL A Sbjct: 113 YRCDAFSLVAVVAEDDAPYSLDEIREALMKPRFHLYLGRKSCPVAAPLNPLVRDAVGFGD 172 Query: 170 ALLNYEPVGGDIYS--------EESVTGHHL-------KFTARDEPMITLPRQFASREWY 214 AL +Y P G S +E V G L KF+ DE + +Q R ++ Sbjct: 173 ALDSY-PYGALFVSSWLMKTAQKEIVEGGKLAEVPSLAKFSREDETVFAYNKQPV-RYYW 230 Query: 215 VIKGGMDVSQ 224 G VSQ Sbjct: 231 EGDAGDLVSQ 240 >UniRef50_A1ARH6 CRISPR-associated protein, Cas5e family n=2 Tax=Bacteria RepID=A1ARH6_PELPD Length = 232 Score = 175 bits (443), Expect = 1e-42, Method: Composition-based stats. Identities = 58/204 (28%), Positives = 86/204 (42%), Gaps = 17/204 (8%) Query: 1 MRSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSES 60 M + L+LRL GPMQ+WG + R TG+ P++SG++GLL A LGI R++ L+ L+ Sbjct: 1 MPT-LLLRLVGPMQSWGTTSRFDQRDTGKEPSKSGVVGLLAAALGIDRENWVDLEPLT-C 58 Query: 61 VQFAVRCDELILDDRRVSVTGLRDYHTVLGAREDY-----RGLKSHETIQTWREYLCDAS 115 + VR D RDY T A D + + R YL DA+ Sbjct: 59 LAMGVRHDR--------PGVPKRDYQTAGCASTDTIIKADGTQAKGGGVVSQRFYLADAA 110 Query: 116 FTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKALLNYE 175 F V L ++ + A+ P +T LGR+S + +++ P L Sbjct: 111 FLVGLEC--DDNCLLERIHVALHNPFWTLALGRKSYVPSESIWIVDGVRDAPLLETLKRY 168 Query: 176 PVGGDIYSEESVTGHHLKFTARDE 199 P S E L D+ Sbjct: 169 PWIASSRSREEPPERLLVSIESDD 192 >UniRef50_UPI0001AF1D4C CRISPR-associated protein, CT1976 n=1 Tax=Streptomyces ghanaensis ATCC 14672 RepID=UPI0001AF1D4C Length = 244 Score = 174 bits (442), Expect = 2e-42, Method: Composition-based stats. Identities = 54/188 (28%), Positives = 83/188 (44%), Gaps = 20/188 (10%) Query: 2 RSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESV 61 + L++ PMQ+WG + +R T PT+SG++GLL A LGI RD +Q L+E + Sbjct: 1 MATLLMCFDAPMQSWGTRSQFASRDTATEPTKSGVVGLLAAALGIPRDADEEIQNLAE-L 59 Query: 62 QFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVALW 121 + VR D + + D+HTV K+H T T R YL DA F V + Sbjct: 60 RMGVRVDREGVVEA--------DFHTVQNVPNTE--GKNHRTAVTKRFYLADALFLVGVE 109 Query: 122 LTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPL-FLGT------CQASDPQKALLNY 174 T ++ +L A+ PR+ Y GR++ P+ G AL + Sbjct: 110 --SDDTQLLHQLHTALTAPRWPLYFGRKAFVPARPIPSPGLAGEHHPVTGQSLDDALRTH 167 Query: 175 EPVGGDIY 182 + + Sbjct: 168 PWLENQLR 175 >UniRef50_D2L2X8 CRISPR-associated protein Cas5 family n=1 Tax=Desulfovibrio sp. FW1012B RepID=D2L2X8_9DELT Length = 266 Score = 174 bits (442), Expect = 2e-42, Method: Composition-based stats. Identities = 70/249 (28%), Positives = 98/249 (39%), Gaps = 45/249 (18%) Query: 1 MRSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSES 60 M YLI +L G + A+G R + PTRS + GLL ACLGI+R + + L ALS Sbjct: 1 MARYLIFQLYGMLAAYGLVAVGEVRLSAGHPTRSAVFGLLAACLGIRRHEEARLAALSGG 60 Query: 61 VQFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGL---------------KSHETIQ 105 AVR D T L DYHT+ E + + + T+ Sbjct: 61 YALAVRVD--------APGTSLLDYHTIQTPPEKSKRIYRTRADELGGLLGIDEPPYTVL 112 Query: 106 TWREYLCDASFTVALW------LTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFL 159 + R YLCDA FT L + L +A+ +P TPYLGR+SCP + P Sbjct: 113 SRRGYLCDAHFTACLTPAAAPPTDATPPHTLEALAEALRRPVLTPYLGRKSCPPSLPFHP 172 Query: 160 GTCQASDPQKALLNY------------EPVGGDIYSEESVTGHHLKFT----ARDEPMIT 203 + + AL +Y ++++E T RD + Sbjct: 173 RLGEYDSLEAALADYPLEKLAFPAGLKPHDPAVVFADEDEAITPATVTSRPLVRDRTVQH 232 Query: 204 LPRQFASRE 212 R F R Sbjct: 233 GRRLFEERR 241 >UniRef50_C9M9R7 CRISPR-associated protein Cas5, Ecoli subtype n=1 Tax=Jonquetella anthropi E3_33 E1 RepID=C9M9R7_9BACT Length = 243 Score = 174 bits (440), Expect = 2e-42, Method: Composition-based stats. Identities = 56/183 (30%), Positives = 84/183 (45%), Gaps = 19/183 (10%) Query: 2 RSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESV 61 +L+LR GPM ++G + RP RFP S + GL+ LG +T LQAL + + Sbjct: 1 MDFLVLRFRGPMMSFGDVAVDEQRPIDRFPGVSMVTGLVANALGWDWSETEKLQALQDRL 60 Query: 62 QFAVRCDELILDDRRVSVTGLRDYHT----------VLGAREDYRGLKSHETIQTWREYL 111 AVR D + LR+Y T V + R L T+Q + Y Sbjct: 61 VLAVREDR--------AGERLREYQTVALPGKSGLFVTHSIPCSRNLDKPMTVQKYLSYW 112 Query: 112 CDASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLG-TCQASDPQKA 170 ++ T + LT T + E+ A+ KP +LGR++C T P+F G A P++A Sbjct: 113 ANSLITCFIALTGLGTPTLDEIACALKKPARPLFLGRKTCLPTEPVFRGEIFGAESPEEA 172 Query: 171 LLN 173 +L Sbjct: 173 VLR 175 >UniRef50_Q5YRB6 Putative uncharacterized protein n=1 Tax=Nocardia farcinica RepID=Q5YRB6_NOCFA Length = 235 Score = 172 bits (435), Expect = 9e-42, Method: Composition-based stats. Identities = 63/231 (27%), Positives = 99/231 (42%), Gaps = 36/231 (15%) Query: 2 RSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESV 61 + L+LRLA P+Q+WG + R T ++P++SG+LGL+ A G +R D ++ +++ Sbjct: 1 MTVLLLRLAAPLQSWGVASRFARRETQQYPSKSGILGLIAAARGHRRTD--PIEEALQNL 58 Query: 62 QFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVALW 121 F VR D+ +RD+ L K+ + + R YL DA F A+ Sbjct: 59 AFGVRVDQ--------PGRLIRDFQVALNID------KTKQFPLSQRYYLADAVFLAAI- 103 Query: 122 LTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKALLNYEPVGGDI 181 +I + A+ +P + YLGRRSCP+T PL LG + + AL E Sbjct: 104 --QGERGLIEGIGNALRRPEFPLYLGRRSCPVTGPLVLGEPRDVTLEHALHETEWQAATW 161 Query: 182 YSEES---------------VTGHHLKFTARDEPMITLP--RQFASREWYV 215 Y L+ RD P+ P R++ R Sbjct: 162 YRRSQHRRVRLPIYRDLLPGDPVELLREQVRDMPLSFDPVRREYGWRTVVE 212 >UniRef50_B2GBJ9 Putative uncharacterized protein n=1 Tax=Lactobacillus fermentum IFO 3956 RepID=B2GBJ9_LACF3 Length = 235 Score = 172 bits (435), Expect = 1e-41, Method: Composition-based stats. Identities = 54/206 (26%), Positives = 83/206 (40%), Gaps = 23/206 (11%) Query: 2 RSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQR--DDTSSLQALSE 59 L++R+A P+Q++G P R T R P++S ++G++GA LG +R DD SL L Sbjct: 1 MKTLVIRIAAPLQSYGDPASFEKRTTFRAPSKSAVIGMIGAALGFRRESDDYKSLNDLD- 59 Query: 60 SVQFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVA 119 FAVR D+ L D+ + + + R YL DA F VA Sbjct: 60 ---FAVRVDQ--------PGEVLSDFQITHYSLK-------KPGKLSHRIYLQDAVFMVA 101 Query: 120 LWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKALLNYEPVGG 179 L A ++ E+E A+ P++ Y GRRS P L + C L Sbjct: 102 LSSKQDA--LMEEIEYALRHPKFQLYFGRRSNPPAGILKMKMCPDKTAINVLKELPWQAS 159 Query: 180 DIYSEESVTGHHLKFTARDEPMITLP 205 + + D ++ Sbjct: 160 VWFQRKYKKDVFNARIYADAKLVPDR 185 >UniRef50_Q2JWC5 CRISPR-associated protein Cas5, Ecoli subtype n=1 Tax=Synechococcus sp. JA-3-3Ab RepID=Q2JWC5_SYNJA Length = 207 Score = 170 bits (430), Expect = 4e-41, Method: Composition-based stats. Identities = 53/217 (24%), Positives = 92/217 (42%), Gaps = 31/217 (14%) Query: 2 RSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESV 61 + L++RL PM +WG + R + R PT+S ++G+L A LG R + L+ ++ Sbjct: 1 MTTLLMRLRAPMMSWGDHSQFDYRDSRREPTKSAVIGILCAALGRPR--WEPVDDLA-AL 57 Query: 62 QFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVALW 121 + VR ++ +D+HTV ETI + R Y+ D + V L Sbjct: 58 KMGVRVNK--------EGILCKDFHTVQ----------IKETI-SNRYYVADGDYLVGLE 98 Query: 122 LTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKALLNYEPVGGDI 181 ++ L++A+ KP + +LGR+S + PL +G + +AL + Sbjct: 99 ---GDPNLLRTLDQALQKPYWQVFLGRKSFIPSRPLRVGLVEQP-LLEALRQHPYECSRR 154 Query: 182 YSEES-----VTGHHLKFTARDEPMITLPRQFASREW 213 S + +D P+ PR+F R Sbjct: 155 GKRPSQLRFVLEVSESLDVRQDVPLSWQPRRFGCRAV 191 >UniRef50_Q47PI7 CRISPR-associated protein, Cas5e family n=12 Tax=Actinomycetales RepID=Q47PI7_THEFY Length = 245 Score = 169 bits (428), Expect = 6e-41, Method: Composition-based stats. Identities = 68/228 (29%), Positives = 97/228 (42%), Gaps = 36/228 (15%) Query: 2 RSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESV 61 L L LAGP+QAWG + R T PT+SG+LGLL A G +R D L L+ ++ Sbjct: 1 MKVLTLLLAGPLQAWGAASRFTRRTTEHAPTKSGVLGLLAAAQGRERTD--DLSDLA-AL 57 Query: 62 QFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVALW 121 +F VR D+ T +RD+ T + + R YL DA F A+ Sbjct: 58 RFGVRVDQR--------GTRIRDFQTAIHLD------TGKSMPVSERFYLADAVFVAAVE 103 Query: 122 LTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKALLNYEPVGGDI 181 +I L +AV P Y PYLGRRSCP + P+ LG ++ L + + Sbjct: 104 ---GEDTLIDTLHQAVQHPVYLPYLGRRSCPPSRPINLGVHSGKPLEQVLAEEKWHAANW 160 Query: 182 YSEESVTGHHLKF--------------TARDEPMITLP--RQFASREW 213 Y + + + RD P+ P R++A R Sbjct: 161 YQRQLRDLPEVPLDLLVDAPPGDPGADSLRDLPISFDPVHRRYALRGV 208 >UniRef50_Q2JH27 CRISPR-associated protein, CT1976 n=6 Tax=Actinomycetales RepID=Q2JH27_FRASC Length = 276 Score = 167 bits (424), Expect = 2e-40, Method: Composition-based stats. Identities = 76/253 (30%), Positives = 99/253 (39%), Gaps = 56/253 (22%) Query: 2 RSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESV 61 R L+LRLAGP+Q+WG + R T PT+SG++GLL A G +R D ++ L S+ Sbjct: 7 RHCLVLRLAGPLQSWGSRSMFNRRDTLTEPTKSGIIGLLAAAQGRRRTD--PIEDLL-SL 63 Query: 62 QFAVRCDELILDDRRVSVTGLRDYHTVLGARE-------------DYRGLKSHETIQTWR 108 +R D+ T LRDYHTV R + T T R Sbjct: 64 TLGIRTDQ--------PGTLLRDYHTVSDYRGRPLPSAAVSAKGLQKPTSPAKHTHVTER 115 Query: 109 EYLCDASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQ 168 YL DA F AL V++ L A+ P + LGRR+CP THPL L S+P Sbjct: 116 FYLQDAVFVAALAAPE---PVLTTLADALRTPAFPLALGRRACPPTHPLLL--VPDSEPD 170 Query: 169 KAL--------LNYEPVGGDIYSEE-----------------SVTGHHLKFTARDEPMIT 203 AL L P + +V D P Sbjct: 171 AALWSGSALEVLRQVPWQARPDHRDALARRRPPRLRRIDLPVTVDDPDGDDVRIDLPTTF 230 Query: 204 LPRQ--FASREWY 214 P Q F SR + Sbjct: 231 DPHQRGFTSRRVH 243 >UniRef50_B4UE71 CRISPR-associated protein Cas5 family n=2 Tax=Anaeromyxobacter RepID=B4UE71_ANASK Length = 246 Score = 167 bits (424), Expect = 2e-40, Method: Composition-based stats. Identities = 58/188 (30%), Positives = 89/188 (47%), Gaps = 6/188 (3%) Query: 1 MRSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSES 60 M LILR P+ A+G + G FP S + GL+ LG D L+AL Sbjct: 1 MLDALILRFDAPLLAFGGVAVDNHGEVGDFPGLSMVAGLIANALGYDHRDCDRLEALQRR 60 Query: 61 VQFAVRCD---ELILDDRRVSV--TGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDAS 115 ++ AVR D + ++D + V++ L T GA E G S T +R Y DA Sbjct: 61 LRIAVRRDRSGQRLVDFQTVALGQPFLERGWTTRGAVEGRDGAFSDGTHIRYRAYWADAV 120 Query: 116 FTVALWLTPHAT-MVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKALLNY 174 +T+A+ L P A + +E+A+ +P +LGR++C + P+ G Q AL ++ Sbjct: 121 YTLAVTLDPPAESPGLDAVERALREPERPLFLGRKACLPSVPILAGRLQIPSLLAALASF 180 Query: 175 EPVGGDIY 182 E V D + Sbjct: 181 ERVSKDRW 188 >UniRef50_B8IZA7 CRISPR-associated protein Cas5 family n=1 Tax=Desulfovibrio desulfuricans subsp. desulfuricans str. ATCC 27774 RepID=B8IZA7_DESDA Length = 249 Score = 166 bits (421), Expect = 4e-40, Method: Composition-based stats. Identities = 58/225 (25%), Positives = 95/225 (42%), Gaps = 42/225 (18%) Query: 5 LILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSE----- 59 L LRL PM ++G + R T P++S + G+L A G+ R L + Sbjct: 8 LALRLQAPMLSFGNESRFNRRCTASLPSKSVVAGMLCAAKGLHRGSVEEQAFLQQVAAIP 67 Query: 60 SVQFAV-RCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTV 118 + A+ RC D ++ D+HTV G R+ G+K + T R YL D+SF V Sbjct: 68 MLSVAIPRCLSANGKDWLLAAGRTVDFHTVQGTRKAAGGIK--DCHITTRHYLHDSSFAV 125 Query: 119 ALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKALLNYEPVG 178 L V+ + +A+ P + ++GR+ C + P+F Sbjct: 126 FL---NGPYRVLEDAARALQNPVWGLWIGRKCCIPSAPVF-------------------- 162 Query: 179 GDIYSEESVTGHHLKFTARDEPMITLPRQFAS--REWYVIKGGMD 221 G ++S E+V +H M+ P +F + RE + + G D Sbjct: 163 GGLFSSEAVALNH---------MLDAPLEFFTHEREVHSFEDGND 198 >UniRef50_C7LYW6 CRISPR-associated protein Cas5 family n=1 Tax=Acidimicrobium ferrooxidans DSM 10331 RepID=C7LYW6_ACIFD Length = 253 Score = 165 bits (417), Expect = 1e-39, Method: Composition-based stats. Identities = 74/238 (31%), Positives = 95/238 (39%), Gaps = 41/238 (17%) Query: 2 RSYLILRLAGPMQAWGQPTF-EGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSES 60 S L LRL GP+QAWG + R T RFPT+SG++GLL A LG R ++L L + Sbjct: 1 MSVLALRLGGPLQAWGSSQRLDHYRRTERFPTKSGVIGLLAAALGRPRS--AALDDLG-A 57 Query: 61 VQFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHE----------------TI 104 ++FAVR D LRD+HT+ +D + E T Sbjct: 58 LRFAVRIDR--------PGEVLRDFHTLSSLFDDKKRFAPGEGRLPTASGGYRSAATSTQ 109 Query: 105 QTWREYLCDASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQA 164 T R YL DA F L + EL+ A+ P + YLGRRSCP PL LG Sbjct: 110 VTERFYLADACFVAGLE---GDAAQLQELDDALRTPVFPLYLGRRSCPPDKPLRLGVYDG 166 Query: 165 SDPQKALLNYEPVGGD-------IYSEESVTGHHLKFTARDEPMITLP--RQFASREW 213 L + D I E V D+ P R + R Sbjct: 167 -GLIDVLASIPWQANDPAQSATSIRCELVVENPAGDVELADQARSFDPLTRSYTRRRV 223 >UniRef50_Q03C60 CRISPR-associated protein n=4 Tax=Lactobacillus RepID=Q03C60_LACC3 Length = 236 Score = 163 bits (413), Expect = 4e-39, Method: Composition-based stats. Identities = 52/180 (28%), Positives = 80/180 (44%), Gaps = 20/180 (11%) Query: 7 LRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESVQFAVR 66 +RL P+Q++G R TG +P++S ++G+L A LG QRDD ++ AL++ + FAVR Sbjct: 6 IRLTSPLQSYGNEAQFARRTTGDYPSKSAIIGMLAAALGYQRDD-PAINALNDLL-FAVR 63 Query: 67 CDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVALWLTPHA 126 D+ + ++ T K T+R+ L DA F VA+ Sbjct: 64 VDQ--------PGQVMTEFQTAE--------WKPGTRKLTYRDLLQDAVFVVAI--GSED 105 Query: 127 TMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKALLNYEPVGGDIYSEES 186 + L +A+ PR+ YLGRR+ L + T DP L Y S Sbjct: 106 EAWLDRLAEALRHPRFQLYLGRRANVPAGVLKIQTFAGQDPVGVLAQLPWQASRWYQRRS 165 >UniRef50_B8FDI0 CRISPR-associated protein Cas5 family n=3 Tax=Bacteria RepID=B8FDI0_DESAA Length = 240 Score = 163 bits (413), Expect = 4e-39, Method: Composition-based stats. Identities = 69/244 (28%), Positives = 98/244 (40%), Gaps = 33/244 (13%) Query: 2 RSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESV 61 YL++ L P+Q+WG + G R T FPTRSG+LGLL LG + L L+ Sbjct: 3 SRYLLMWLEAPLQSWGADSKFGRRDTLPFPTRSGVLGLLLCALGASGEQKELLARLAPYG 62 Query: 62 QFAVRCDELILDDRRVSV------TGLRDYHTVLGAREDYRGLKS--------------H 101 Q + C S LRD+H V A D + Sbjct: 63 QTVISCAGGRPGRSGGSPEKIPRQPLLRDFHMVGSAYNDKDPWERLHIPKTNEGKPAVGG 122 Query: 102 ETIQTWREYLCDASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGT 161 T+R YL DA F V L L P + + +A+ P + YLGR++C T ++ G Sbjct: 123 GAKLTYRYYLQDARFAVILELPPD---LAEDFAQALQNPVWDIYLGRKNCAPTEFVYQGV 179 Query: 162 C----QASDPQKALLNYEPVGGDIYSEESVTGHHL--KFTARDEPMITLP-RQFASREWY 214 A D AL+ + + D V G H T D P+ P +++ R Sbjct: 180 FDSQKDAMDRAAALMEEKELMEDF---RVVDGEHPGEPITLNDVPLQFGPMKKYRDRRVT 236 Query: 215 VIKG 218 VI+ Sbjct: 237 VIRN 240 >UniRef50_A8M404 CRISPR-associated protein Cas5 family n=1 Tax=Salinispora arenicola CNS-205 RepID=A8M404_SALAI Length = 238 Score = 163 bits (412), Expect = 4e-39, Method: Composition-based stats. Identities = 65/227 (28%), Positives = 98/227 (43%), Gaps = 35/227 (15%) Query: 5 LILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESVQFA 64 L+LRLAGP+Q+WG + R T PT+SG++G+L A G++R D L L S+ F Sbjct: 2 LLLRLAGPLQSWGATSRFTHRHTQVTPTKSGVIGMLAAASGLRRTD--PLTELL-SLDFG 58 Query: 65 VRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVALWLTP 124 VR D+ LRD+ T R YL DA F VA+ Sbjct: 59 VRIDQ--------PGQLLRDFQVARTLD------GRDSMPLTNRYYLSDAVFLVAI---G 101 Query: 125 HATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKALLNYEPVGGDIYSE 184 ++ L ++V +P + YLGRR+CP P+ LG + +AL ++ E Sbjct: 102 GDQALLEGLHESVRRPHFPLYLGRRACPPVAPISLGVHPGTV-DEALRDWPWQAAKRLRE 160 Query: 185 ------------ESVTGHHLKFTARDEPMITLP--RQFASREWYVIK 217 ++ G + T D+P+ P RQ+ R + Sbjct: 161 RGELTVPLEVVSDAPPGADVTETLPDQPISFDPAHRQYGWRAVVRTR 207 >UniRef50_D0MET6 CRISPR-associated protein Cas5 family n=1 Tax=Rhodothermus marinus DSM 4252 RepID=D0MET6_RHOM4 Length = 252 Score = 162 bits (410), Expect = 8e-39, Method: Composition-based stats. Identities = 59/234 (25%), Positives = 97/234 (41%), Gaps = 27/234 (11%) Query: 2 RSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESV 61 L+LR P+ ++G P + +P S + GLL LG +T+ L+ L E + Sbjct: 1 MEILLLRFDAPLMSFGAPIVDQYGFIQPYPALSMMTGLLANALGYTHAETARLERLQERL 60 Query: 62 QFAVRCDELILDDRRVSVTGLRDYHTVLGARE---DYRGLKSHETIQTW----------- 107 ++AVR D LRD+ TV ++ D R + T++T Sbjct: 61 RYAVREDRR--------GQQLRDFQTVDLSQPFLHDERAWTTRGTLETRQGGTASLGIHI 112 Query: 108 --REYLCDASFTVALWLT-PHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQA 164 R+Y DA +TVAL L P +++LE+A+ P ++GR+ C PLF+G +A Sbjct: 113 RLRDYWADAVYTVALTLDPPDEPPTLADLEQALRFPARPLFIGRKPCLPAAPLFIGRVEA 172 Query: 165 SDPQKALLNYEPVGGDIYSEESVTGHHLKFTARDEPMITLPRQFASREWYVIKG 218 +D AL P+ + P+ + R+ R + Sbjct: 173 ADLLDAL-RRAPLDARADRADFYRVWWETGPDDPPPVEGI-RENLRRPVTDRRD 224 >UniRef50_B8GIV3 CRISPR-associated protein Cas5 family n=1 Tax=Methanosphaerula palustris E1-9c RepID=B8GIV3_METPE Length = 257 Score = 162 bits (410), Expect = 9e-39, Method: Composition-based stats. Identities = 63/194 (32%), Positives = 84/194 (43%), Gaps = 22/194 (11%) Query: 2 RSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESV 61 YL L G M +WG RPT PTRS +LGLL A LGI+RD+ L AL+ + Sbjct: 4 PEYLTFSLYGMMASWGDIAVGEYRPTADHPTRSAVLGLLAAALGIRRDEEERLAALTRAY 63 Query: 62 QFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKS-------------HETIQTWR 108 + A+R D LRDYHT +G + TI + R Sbjct: 64 KVAIRVD--------APGMLLRDYHTTQVPSAAKKGRQYLTRKDELAAPREVLNTILSTR 115 Query: 109 EYLCDASFTVALWLTPHATM-VISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDP 167 +Y CDA + V +W A + L + + P +T YLGR+SCPL P+ A D Sbjct: 116 DYRCDAVYRVYIWCRDTAPPYSLKTLAEHLQHPVFTLYLGRKSCPLALPVNPEVKTAPDL 175 Query: 168 QKALLNYEPVGGDI 181 AL + Sbjct: 176 LTALSEEREIELRF 189 >UniRef50_D1CAJ0 CRISPR-associated protein Cas5 family n=1 Tax=Sphaerobacter thermophilus DSM 20745 RepID=D1CAJ0_SPHTD Length = 245 Score = 160 bits (404), Expect = 4e-38, Method: Composition-based stats. Identities = 65/238 (27%), Positives = 97/238 (40%), Gaps = 40/238 (16%) Query: 2 RSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESV 61 S L+LRL GPMQAWG + R TG P++SG++GLL A LG R + + L+ + Sbjct: 1 MSTLLLRLTGPMQAWGTQSRFSWRDTGLEPSKSGVIGLLCAALGRPRS--APVDDLAR-L 57 Query: 62 QFAVRCDELILDDRRVSVTGLRDYHTVLG-AREDYRG-------LKSHETIQTWREYLCD 113 + VR D T D+HT G R G + + R YL D Sbjct: 58 RMGVRVDR--------EGTMHVDFHTAGGWHRRAEAGYGVPDPSGTARRPQISRRFYLAD 109 Query: 114 ASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDP------ 167 A F V L ++ L++A+ PR+ +LGR+S P+ L P Sbjct: 110 ADFLVGLE---GDEELLVLLDRALAAPRWQLFLGRKSFVPAAPVRLPDTPPWGPGLRPEP 166 Query: 168 -QKALLNYEPVGGDIYSEESVTGHHLKF-----------TARDEPMITLPRQFASREW 213 + AL Y +G + + L+ D P+ R+F++R Sbjct: 167 LETALRTYPWLGYQLPHPRADAPDRLRLVLDAEGDDAADIRMDVPISFAERRFSTRAV 224 >UniRef50_Q1J367 CRISPR-associated protein, CT1976 n=1 Tax=Deinococcus geothermalis DSM 11300 RepID=Q1J367_DEIGD Length = 232 Score = 159 bits (402), Expect = 8e-38, Method: Composition-based stats. Identities = 60/229 (26%), Positives = 91/229 (39%), Gaps = 25/229 (10%) Query: 2 RSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESV 61 + L+LRL PMQAWG + R T P++SG+LGL A LGI R D+ A + Sbjct: 1 MATLLLRLVAPMQAWGTRSRFDDRDTEAEPSKSGVLGLCAAALGIDRADSVEHLA---RL 57 Query: 62 QFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVALW 121 F VR D + DYHT G T T R YL DA+F L Sbjct: 58 AFGVRVDREGVAG--------TDYHTAQL----RPGNPRTRTDVTRRAYLADAAFWAGLE 105 Query: 122 LTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKALLNYEPVGG-- 179 ++++L+ A+ P + LGR++ P + P+ G AL + Sbjct: 106 ---GDAGLLTDLDAALHNPHWPLSLGRKAFPPSLPICAGPPLEVSLWDALRTAPSLRWRD 162 Query: 180 -----DIYSEESVTGHHLKFTARDEPMITLPRQFASREWYVIKGGMDVS 223 + + L+ A +P +R Y+ + + V+ Sbjct: 163 DDEPYRLVLDREAVPQPLRAAASPSRRQDVPDGPFARRRYLSRDVLTVT 211 >UniRef50_Q0AA33 CRISPR-associated protein Cas5 family n=2 Tax=Gammaproteobacteria RepID=Q0AA33_ALHEH Length = 242 Score = 159 bits (401), Expect = 9e-38, Method: Composition-based stats. Identities = 56/211 (26%), Positives = 78/211 (36%), Gaps = 24/211 (11%) Query: 5 LILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESVQFA 64 LILRL P+ ++G + PT RFP RS L G+L LG DT +L +L + +A Sbjct: 4 LILRLDAPLMSFGGVLVDQHNPTDRFPGRSMLTGMLANALGWHHQDTEALNSLQARISYA 63 Query: 65 VRCDELILDDRRVSVTGLRDYHTV--------------LGAREDYRGLK-SHETIQTWRE 109 R D V LRDY TV GA E G Q R Sbjct: 64 ARWD--------VPPEPLRDYQTVDLGQTHLANPGWTTRGAPEHREGGTAKRGIHQRDRH 115 Query: 110 YLCDASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQK 169 Y + TVA+ + P V + L A+ P ++GR++C P+ L ++ D Sbjct: 116 YWANGVMTVAVTVPPGEPNV-ATLAAALRHPARPLFIGRKACLPAAPVLLRVRESDDAYH 174 Query: 170 ALLNYEPVGGDIYSEESVTGHHLKFTARDEP 200 L + T Sbjct: 175 VLASEPRDPRAAADTRFFEACWPPGTTAPAT 205 >UniRef50_Q2FNU0 CRISPR-associated protein, CT1976 n=1 Tax=Methanospirillum hungatei JF-1 RepID=Q2FNU0_METHJ Length = 225 Score = 153 bits (386), Expect = 5e-36, Method: Composition-based stats. Identities = 52/210 (24%), Positives = 81/210 (38%), Gaps = 38/210 (18%) Query: 35 GLLGLLGACLGIQRDDTSSLQALSESVQFAVRCDELILDDRRVSVTGLRDYHTVLG---- 90 +LG++ A LGI+RDD + L F+V + ++D+HT+ Sbjct: 15 AVLGMVAAALGIRRDDEEAQNRLQAGYGFSVMVLQ--------PGIMIQDFHTIQSVHSS 66 Query: 91 ---------AREDYRGLKSHETIQTWREYLCDASFTVALWLT--PHATMVISELEKAVLK 139 R D L ETI + REYLCD +W+ A + E+ + Sbjct: 67 SLKKMNHVMTRRDEMNLGDSETILSRREYLCDHVSVACVWIRDAESAQFSLEEIAASFRN 126 Query: 140 PRYTPYLGRRSCPLTHPLFLGTCQASDPQKALLNY-----------EPVGGDIYSEESV- 187 P + YLGR+SCP P+ QA + AL+ + P +Y E+ + Sbjct: 127 PVFCLYLGRKSCPPALPVHARVIQADSLKSALVQHIEGFDLLNGFRVPDRVSLYFEDGID 186 Query: 188 ---TGHHLKFTARDEPMITLPRQFASREWY 214 + RD + QF+ R Y Sbjct: 187 IGFDDPVMVMKRRDNILSRSRWQFSDRNEY 216 >UniRef50_B4TTX2 CRISPR-associated protein Cas5 n=15 Tax=Enterobacteriaceae RepID=B4TTX2_SALSV Length = 241 Score = 152 bits (384), Expect = 8e-36, Method: Composition-based stats. Identities = 62/177 (35%), Positives = 85/177 (48%), Gaps = 19/177 (10%) Query: 1 MRSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSES 60 M+ YL+ +L P+ +WG+ R + PTRS LLGLL A LGI+RD+ + L + Sbjct: 1 MKEYLVFQLYAPLASWGEEASGEIRHSATVPTRSALLGLLAAALGIRRDEEARLNNFNRH 60 Query: 61 VQFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGL------------KSHETIQTWR 108 AV LRDYHTV RE+ + T+ + R Sbjct: 61 YHLAVHA-------LASQDRWLRDYHTVSAPRENKKYRYYTRRDELTLAPDEVGTLISQR 113 Query: 109 EYLCDASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQAS 165 EY CD + VA+ TP A +SEL +A+L P + YLGR+SCPL PL + Sbjct: 114 EYRCDGYWHVAISATPDAPHSLSELREALLTPHFPLYLGRKSCPLALPLAARLMTGT 170 >UniRef50_A9HLC6 CRISPR-associated protein Cas5 family n=11 Tax=Acetobacteraceae RepID=A9HLC6_GLUDA Length = 260 Score = 151 bits (382), Expect = 2e-35, Method: Composition-based stats. Identities = 56/191 (29%), Positives = 78/191 (40%), Gaps = 21/191 (10%) Query: 1 MRSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSES 60 M +L + PM ++G R P RS +LGL+ ACLG+ RDD + AL+ Sbjct: 1 MGQFLTFAMVAPMASFGAIAVGERRDGWDRPARSAVLGLMAACLGLTRDDEDAQAALAAD 60 Query: 61 VQFAVRCDELILDDRRVSVTGLRDYHTVLG--AREDYRGLKSHE----------TIQTWR 108 A+ C L DYHT AR ++R E TI + R Sbjct: 61 YGLAILC--------HAPGKLLTDYHTAQAAPARRNWRPATRAEELAASPGDLATILSRR 112 Query: 109 EYLCDASFTVALWLTPH-ATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDP 167 +Y A+W + A + L+ A+ +P +TP LGRRSCP PL Sbjct: 113 DYRMGTWHLGAVWTSGKTARWSLEALQAAMREPVFTPSLGRRSCPAGLPLAPSVTDGVSA 172 Query: 168 QKALLNYEPVG 178 LL+ G Sbjct: 173 AAVLLDRHRNG 183 >UniRef50_Q0BSC7 Putative uncharacterized protein n=1 Tax=Granulibacter bethesdensis CGDNIH1 RepID=Q0BSC7_GRABC Length = 225 Score = 148 bits (374), Expect = 1e-34, Method: Composition-based stats. Identities = 65/231 (28%), Positives = 98/231 (42%), Gaps = 43/231 (18%) Query: 4 YLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRD-DTSSLQALSESVQ 62 +L+ LA + + G+ R + +PTRS ++GL+GA LGI+RD D S+L LS V Sbjct: 6 FLVFGLAASLGSMGELAGHERRGSLIWPTRSAIIGLMGAALGIERDGDFSALDVLSIDVA 65 Query: 63 FAVRCDELILDDRRVSVTGLRDYHTVLG--------------AREDYRGLKSHETIQTWR 108 + LRDYHT+ A D RG T T R Sbjct: 66 I------------FDAGAPLRDYHTIETIPSAAAKNPNSRPEALRDARGRT--NTAITHR 111 Query: 109 EYLCDASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQ 168 +Y + +A+ + + A+L+P +T Y+GR+SCPL P +A + Sbjct: 112 DYRTSVFYGIAVRGAG-----LERIVAALLEPHFTLYVGRKSCPLAAPTGAKIVEAVSAE 166 Query: 169 KALLNYEPVGGDIYSEESVTGHHLKF------TARDEPMITLPRQFASREW 213 AL E + ++ +ESV H L D P+ FA+R Sbjct: 167 AAL---EHLKAPLWRKESVKAHLLVTDDPEGEVVTDVPLDRSSWHFATRRV 214 >UniRef50_UPI0001B51C2B CRISPR-associated Cas5 family protein n=1 Tax=Streptomyces viridochromogenes DSM 40736 RepID=UPI0001B51C2B Length = 278 Score = 148 bits (373), Expect = 2e-34, Method: Composition-based stats. Identities = 68/272 (25%), Positives = 98/272 (36%), Gaps = 74/272 (27%) Query: 1 MRSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSES 60 M S L+LRLAGP+Q+WG R T PT+SG+ GL+ A LG+ R D L AL++ Sbjct: 1 MTSVLLLRLAGPLQSWGALARFDRRDTLNRPTKSGVTGLVAAALGLDRAD--DLGALTD- 57 Query: 61 VQFAVRCDELILDDRRVSVTGLRDYHTVLGA-------------REDYRGLKSHET---- 103 ++FAVR D T +RD+H V R + + ET Sbjct: 58 LRFAVRADR--------PGTAVRDFHIVGSGTYPLRPRDLITDHRRAEKAAAALETSTGP 109 Query: 104 --------------------------------------IQTWREYLCDASFTVALWLTPH 125 + T R YL DA+F A+ Sbjct: 110 VFGHLAARSVTKWYGAPKEIAPDPKTGVLLAGNTTRDAMMTTRWYLADAAFVAAVE--HP 167 Query: 126 ATMVISELEKAVLKPRYTPYLGRRSCPL----THPLFLGTCQASDPQKALL--NYEPVGG 179 ++ + AV P+ +LGR+SCP + + GT + ALL P Sbjct: 168 DQNLLHRISHAVEHPKRLLWLGRKSCPPSGTISGGVHPGTAETILTTTALLPNATSPQPW 227 Query: 180 DIYSEESVTGHHLKFTARDEPMITLPRQFASR 211 T + T + R +R Sbjct: 228 AWIEAAPGTPGAAQRTDQPVTYHPEHRTHTAR 259 >UniRef50_B8IMR2 CRISPR-associated protein Cas5 family n=1 Tax=Methylobacterium nodulans ORS 2060 RepID=B8IMR2_METNO Length = 273 Score = 143 bits (361), Expect = 4e-33, Method: Composition-based stats. Identities = 51/187 (27%), Positives = 70/187 (37%), Gaps = 23/187 (12%) Query: 1 MRSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSES 60 M + L+ L P G R + P RS +LGL+ LGI R D + AL Sbjct: 1 MPAGLVFTLYAPFAGMGDVAVGEERGSFDRPARSAVLGLVAGALGIDRADEAGHAALDRG 60 Query: 61 VQFAVRCDELILDDRRVSVTGLRDYHTV----------LGAREDYRGLKSHETIQTWREY 110 + A+R R + DYHTV R + + T+ + R Y Sbjct: 61 YRLALRL--------RTPGCLVEDYHTVQAPPVDRKARWATRREALAVAGLNTLVSRRAY 112 Query: 111 LCDASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFL----GTCQASD 166 D V L + L A+ +P + PYLGR+SCPL PL G + Sbjct: 113 RADPIVDVVL-IHVDEGPTPEALATALRRPTFAPYLGRKSCPLGLPLRPLWAEGVTRVGS 171 Query: 167 PQKALLN 173 AL Sbjct: 172 LLAALDE 178 >UniRef50_C5V9N1 CRISPR-associated protein Cas5 n=1 Tax=Corynebacterium matruchotii ATCC 14266 RepID=C5V9N1_9CORY Length = 223 Score = 141 bits (356), Expect = 2e-32, Method: Composition-based stats. Identities = 58/234 (24%), Positives = 89/234 (38%), Gaps = 41/234 (17%) Query: 1 MRSYLILRLAGPMQAWGQPTFEGTR-PTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSE 59 M L +RLAGP+Q+W P G T PTRSGL+GLL G R + Sbjct: 1 MTEALYIRLAGPLQSWAGPAITGNFVRTEPRPTRSGLVGLLAGACGYGRGEYPEWLT--- 57 Query: 60 SVQFAVRCDELILDDRRVSVTGLRDYHTV----------------LGAREDYRGLKSHE- 102 + F +R D T + D+HT+ +G R + L S Sbjct: 58 QLHFQIREDNR--------GTLVDDFHTINPRDTEEEFRSRLLLAMGQRPTKKLLNSTPD 109 Query: 103 ----TIQTWREYLCDASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLF 158 T T R Y+ D F V + + L + + +P + YLGR++ + P + Sbjct: 110 GQGLTAITERTYIADGEFIVQIKAGSREHQEL--LAEKLQQPHFVTYLGRKAFAPSFPFY 167 Query: 159 LGTCQASDPQKALLNYEPVGGDIYSE--ESVTGHHLKFTARDEPMITLPRQFAS 210 LG + P L VGG+ + +T P++ Q+ + Sbjct: 168 LG----AGPDDTLARIPTVGGEEPKKILRFYALDDYGYTTTTVPVVKDRNQWLT 217 >UniRef50_B6B783 CRISPR-associated protein Cas5, Ecoli subtype n=1 Tax=Rhodobacterales bacterium Y4I RepID=B6B783_9RHOB Length = 232 Score = 141 bits (356), Expect = 2e-32, Method: Composition-based stats. Identities = 59/205 (28%), Positives = 81/205 (39%), Gaps = 32/205 (15%) Query: 1 MRSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRD-DTSSLQALSE 59 M YLI +L + A G+ R + P RS ++G LGA +G++RD D S L AL Sbjct: 1 MPEYLIFQLVAAIGAMGEFGGHDRRGSLTLPGRSAVIGTLGAAMGLRRDADFSGLDALGV 60 Query: 60 SVQFAVRCDELILDDRRVSVTGLRDYHTVLGAREDY------------RGLKSHETIQTW 107 +V RDYHTV + T T Sbjct: 61 AVA------------SFGKTAPFRDYHTVQTVPSAAVKRPQSRPQALRDAGRKVNTTLTS 108 Query: 108 REYLCDASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDP 167 R+Y D F VA+W ++EL A+ P + +LGR+SCPL+ P A+ P Sbjct: 109 RDYRADCVFGVAIWGEG-----LAELASALSAPVFQTFLGRKSCPLSAPFDPQIVAAATP 163 Query: 168 QKAL--LNYEPVGGDIYSEESVTGH 190 AL L P G + V Sbjct: 164 SAALSQLRLPPWIGAREMDMIVADE 188 >UniRef50_C2KP44 Putative uncharacterized protein n=1 Tax=Mobiluncus mulieris ATCC 35243 RepID=C2KP44_9ACTO Length = 245 Score = 137 bits (346), Expect = 2e-31, Method: Composition-based stats. Identities = 55/184 (29%), Positives = 76/184 (41%), Gaps = 23/184 (12%) Query: 1 MRSYLILRLAGPMQAW-GQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSE 59 M S +RLAGP+Q+W G T +PTR L GL+ ACLG R + Sbjct: 1 MTSV-YIRLAGPLQSWAGAKVSGNISHTQDYPTRGSLEGLVAACLGCPRGKYPLW---FQ 56 Query: 60 SVQFAVRCDELILDDRRVSVTGLRDYHT---------VLGAREDYRGLK-----SHETIQ 105 +QFAVR D G+RD + G R RGL +T Sbjct: 57 DLQFAVRVDSPGRICDDYQTIGVRDEDMQVATRLLTLLTGKRATNRGLAFIPDAQGKTTI 116 Query: 106 TWREYLCDASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQAS 165 R L DA F V + H + +L++A+ P + YLGR++ P +LG + S Sbjct: 117 VRRTLLADAEFIVQIQCEGH----LEQLDQAISDPTFVSYLGRKAFAPGFPFYLGIGEDS 172 Query: 166 DPQK 169 Sbjct: 173 AIDT 176 >UniRef50_B6IWM3 CRISPR-associated protein, CT1976 family n=1 Tax=Rhodospirillum centenum SW RepID=B6IWM3_RHOCS Length = 280 Score = 137 bits (345), Expect = 3e-31, Method: Composition-based stats. Identities = 57/194 (29%), Positives = 82/194 (42%), Gaps = 28/194 (14%) Query: 3 SYLILRLAGPMQAWGQPTFEG----TRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALS 58 ++L LA P AWG + + T P+RS L GLLGA LG++R + L LS Sbjct: 6 AHLCFTLAAPYGAWGAASQSSATTAWKATELDPSRSALTGLLGAALGLER---AHLGRLS 62 Query: 59 ESVQFAVRCDELILDDRRVSVTGLRDYHTVLGA---------------REDYRGLKSHET 103 E+++FAVR D + DYHT+ A R G K Sbjct: 63 EALRFAVRTGIRPTRDPQP------DYHTISRAHRPEGREHWSRFEELRPALAGGKQEGA 116 Query: 104 IQTWREYLCDASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQ 163 + + REY +TVA+ A + + L +A+ P + Y GR++C L P Sbjct: 117 LLSRREYWSLGLWTVAVATLNPAGVPLDRLAQALRTPHWPLYAGRKACTLGLPPDPEVRT 176 Query: 164 ASDPQKALLNYEPV 177 P LL+Y Sbjct: 177 GPGPLSVLLDYGWP 190 >UniRef50_B5GY62 Crispr-associated protein (Fragment) n=1 Tax=Streptomyces clavuligerus ATCC 27064 RepID=B5GY62_STRCL Length = 260 Score = 135 bits (341), Expect = 9e-31, Method: Composition-based stats. Identities = 61/186 (32%), Positives = 84/186 (45%), Gaps = 28/186 (15%) Query: 4 YLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESVQF 63 L+LRLAGP+Q+WG + +R TG PT+SG++GLL A R + ++ L +++ Sbjct: 85 VLLLRLAGPLQSWGSASAFNSRQTGAEPTKSGVIGLLAAA--DGRARGACIEDL-RALRL 141 Query: 64 AVRCDELILDDRRVSVTGLRDYHTVLGARE-------------DYRGLKSHETIQTWREY 110 VR D S T LRDYHT R + T T R Y Sbjct: 142 GVRVDR--------SGTLLRDYHTASDHRGRPLAQAGVGAKGTQRPTSPAKYTQVTTRYY 193 Query: 111 LCDASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKA 170 L DA F AL ++ L++AV P + LGRRSC + PL LG S + Sbjct: 194 LQDAVFLAALA---GPRALLDRLDRAVRAPAFPLALGRRSCVPSLPLALGVHPGS-LGEV 249 Query: 171 LLNYEP 176 L + Sbjct: 250 LSTHPW 255 >UniRef50_C2GEY8 CRISPR-associated protein n=1 Tax=Corynebacterium glucuronolyticum ATCC 51866 RepID=C2GEY8_9CORY Length = 242 Score = 132 bits (332), Expect = 1e-29, Method: Composition-based stats. Identities = 50/189 (26%), Positives = 80/189 (42%), Gaps = 37/189 (19%) Query: 1 MRSYLILRLAGPMQAWGQPTFEGTR-PTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSE 59 M S +RL+GP+Q+W + G T PT +GL GLL LG +RD+ + Sbjct: 1 MPSSTFIRLSGPIQSWAGQSVSGNFIRTNPIPTLTGLRGLLAGALGARRDE---IPEWIS 57 Query: 60 SVQFAVRCDELILDDRRVSVTGLRDYHTVLGAREDY---------RGLKSHET------- 103 V+F+VR D+ + + + D+ T+ E++ +G+K+ Sbjct: 58 KVRFSVREDQ--------TGSFVDDFQTIGSREEEWDFRRRIAILQGMKARSIKQLSFKP 109 Query: 104 -----IQTWREYLCDASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLF 158 R YL +A F V + H E++ A P + YLGR++ P P + Sbjct: 110 AVGANAVVRRTYLSEAEFIVRVTDERHT----EEIDHAFSSPVFATYLGRKAFPAAFPFY 165 Query: 159 LGTCQASDP 167 LGT Sbjct: 166 LGTGNEDVL 174 >UniRef50_Q2RXJ5 CRISPR-associated protein, Cas5e family n=5 Tax=Proteobacteria RepID=Q2RXJ5_RHORT Length = 249 Score = 132 bits (331), Expect = 1e-29, Method: Composition-based stats. Identities = 51/209 (24%), Positives = 83/209 (39%), Gaps = 8/209 (3%) Query: 1 MRS--YLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALS 58 M +LI+ L P+ A+G + T FP S L GL LG +R + Q L Sbjct: 1 MPEHRWLIVHLEAPLLAFGGVAIDNVGVTRDFPAASMLTGLFANALGWRRTEWERHQRLQ 60 Query: 59 ESVQFAVRCDE-----LILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCD 113 + + FA R + ++ D + ++ T G E G + R+Y D Sbjct: 61 DRLIFAARRERENPTGVLTDTQNAKLSKTERGWTTWGEPEGRDGASYGAPHRRRRDYHGD 120 Query: 114 ASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGT-CQASDPQKALL 172 AS VAL L + +L A+ +P ++GR+SC + PL A+ +AL Sbjct: 121 ASVVVALRLDAAEEPALDDLAAALDRPARPLFIGRKSCVPSRPLRGKEFVVAATAYQALQ 180 Query: 173 NYEPVGGDIYSEESVTGHHLKFTARDEPM 201 G D + + + + P+ Sbjct: 181 ALRSDGNDRQRDGAAAERRAVWPVGEGPV 209 >UniRef50_B5F422 CRISPR-associated protein Cas5 n=59 Tax=Enterobacteriaceae RepID=B5F422_SALA4 Length = 248 Score = 130 bits (328), Expect = 3e-29, Method: Composition-based stats. Identities = 53/161 (32%), Positives = 76/161 (47%), Gaps = 19/161 (11%) Query: 1 MRSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSES 60 M YL+ +L GPM +WG R + P+RS LLGLL A LGI+RD+ L A + Sbjct: 1 MSQYLVFQLHGPMASWGVDAPGEVRHSHELPSRSALLGLLAAALGIRRDEEERLNAFNRH 60 Query: 61 VQFAVRCDELILDDRRVSVTGLRDYHTVLGAR--EDYRGLKSHE---------TIQTWRE 109 QF L + RDYHTV + R E + + R+ Sbjct: 61 YQF--------LLCASGNPRWARDYHTVQMPKEVRKARYFSRREELQDPELLSALISRRD 112 Query: 110 YLCDASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRS 150 Y DA + +A+ TP A +++L+ A+ P + YLGR+S Sbjct: 113 YYTDAWWMIAVSATPDAPYTLAQLQAALQHPVFPLYLGRKS 153 >UniRef50_Q6NEQ9 Putative uncharacterized protein n=1 Tax=Corynebacterium diphtheriae RepID=Q6NEQ9_CORDI Length = 242 Score = 129 bits (324), Expect = 8e-29, Method: Composition-based stats. Identities = 49/184 (26%), Positives = 76/184 (41%), Gaps = 22/184 (11%) Query: 1 MRSYLILRLAGPMQAW-GQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSE 59 M +RL+GP+Q+W G T PT S L GLL LG +R + + + Sbjct: 1 MIESAYIRLSGPLQSWAGSVVTGNIVRTEPRPTFSSLRGLLAGALGARRGEWPNWL---D 57 Query: 60 SVQFAVRCDE-LILDDRRVSVTGLRDYHTVLGAREDYRGLKSH-------------ETIQ 105 V+F VR D I+ + ++ L + T +G K++ T Sbjct: 58 DVEFWVREDRKPIVVNEFQTINPLPEVETFRKRLLIAQGRKANSAKALTFTPDAQGGTSI 117 Query: 106 TWREYLCDASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQAS 165 R YL D + V + + H + E+E A P + YLGR++ P +LG A Sbjct: 118 VNRTYLADGEYLVRVTSSTH----MDEIENAFSSPAFVTYLGRKAFYAEFPFYLGRGSAD 173 Query: 166 DPQK 169 +K Sbjct: 174 AFEK 177 >UniRef50_C0W6U0 CRISPR-associated Cas5 family protein n=1 Tax=Actinomyces urogenitalis DSM 15434 RepID=C0W6U0_9ACTO Length = 201 Score = 122 bits (305), Expect = 1e-26, Method: Composition-based stats. Identities = 45/168 (26%), Positives = 67/168 (39%), Gaps = 21/168 (12%) Query: 39 LLGACLGIQRDDTSSLQALSESVQFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGL 98 +L A +G +R D ++ L S++F VR D+ T LRD+HT Sbjct: 1 MLAAAVGRRRTD--PIEDLL-SLRFGVRKDQ--------PGTVLRDFHTARTLD------ 43 Query: 99 KSHETIQTWREYLCDASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLF 158 + R YL DA + A+ ++ L+ AV P + YLGRRSCP + PL Sbjct: 44 GKQSMPLSERYYLADAVYLAAIE---GEKTLLEGLDVAVRHPVFPLYLGRRSCPPSQPLS 100 Query: 159 LGTCQASDPQKALLNYEPVGGDIYSEESVTGHHLKFTARDEPMITLPR 206 LG AS +AL + D + + + R Sbjct: 101 LGIRHAS-LLQALTDEPWQAADWFRLRQDNSFRAEIVIDAASLAPDER 147 >UniRef50_B6XT64 Putative uncharacterized protein n=2 Tax=Bifidobacterium RepID=B6XT64_9BIFI Length = 212 Score = 119 bits (298), Expect = 8e-26, Method: Composition-based stats. Identities = 49/190 (25%), Positives = 76/190 (40%), Gaps = 28/190 (14%) Query: 39 LLGACLGIQRDDTSSLQALSESVQFAVRCDELILDDRRVSVTGLRDYHTVLG-AREDYRG 97 +L + G R+D ++ L + F VR ++ +RD T R+ Sbjct: 1 MLASAQGRTRED--PIEDLL-GISFGVRVEQR--------GRVIRDLQTEKSLTRKRNSR 49 Query: 98 LKSHETIQTWREYLCDASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPL 157 E T+R YL DA F VAL V+ L++A+ P++ YLGRRSCP +PL Sbjct: 50 KFDKEMPLTYRYYLADACFLVALGA---DRSVLEMLDEAIHSPKWPLYLGRRSCPPNYPL 106 Query: 158 FLGTCQASDPQKALLNYEPVGGDIYSEESVTGHHLKF-----------TARDEPMITLP- 205 LG + + LN E + L+ T D P+ Sbjct: 107 SLGIHDEYEDIRQALNSETWHASEWYRRRYRYPDLEIVCDAEKGENITTQSDLPLSFSRE 166 Query: 206 -RQFASREWY 214 R++A+R + Sbjct: 167 GRRYANRAVH 176 >UniRef50_B8IJS9 CRISPR-associated protein Cas5 family n=1 Tax=Methylobacterium nodulans ORS 2060 RepID=B8IJS9_METNO Length = 253 Score = 107 bits (267), Expect = 3e-22, Method: Composition-based stats. Identities = 47/178 (26%), Positives = 72/178 (40%), Gaps = 7/178 (3%) Query: 1 MRSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSES 60 MR +L+L L P+QAWG + P FP + + GL+ LG R D L+AL E Sbjct: 1 MREHLLLLLEAPLQAWGGVLVDAYGPVDEFPAATLVGGLVANALGYDRADWQRLEALQER 60 Query: 61 VQFA---VRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKS--HETIQTWREYLCDAS 115 + +R I D++ + T G E G + +R+Y D Sbjct: 61 LVVGAAVLRRGSTITDNQNAKLEKGDVGWTTRGRPEGRGGGAEAYKSPHRRFRDYHADTL 120 Query: 116 FTVALWLTPHAT-MVISELEKAVLKPRYTPYLGRRSCPLTH-PLFLGTCQASDPQKAL 171 VAL L P + + + P +LGR+ C + ++ +A KAL Sbjct: 121 ALVALRLDPEDEKPDLDAIAHTLEWPERPLFLGRKPCLPSRSIVWPERMRAETLLKAL 178 >UniRef50_A0LM54 Putative uncharacterized protein n=1 Tax=Syntrophobacter fumaroxidans MPOB RepID=A0LM54_SYNFM Length = 210 Score = 83.9 bits (206), Expect = 4e-15, Method: Composition-based stats. Identities = 39/196 (19%), Positives = 71/196 (36%), Gaps = 39/196 (19%) Query: 46 IQRDDTSSLQALSESVQFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQ 105 ++R D ++L + VR D +D+ T G + Sbjct: 26 LRRGDLAAL-------RMGVRVDR--------EGLLRKDFQTAQNVIVAGGGSVGD--LV 68 Query: 106 TWREYLCDASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFL--GTCQ 163 + R +L DA+F V L M++ L ++ PR+ +LGR+S + P ++ G + Sbjct: 69 SNRYFLSDAAFLVGLE---GDFMLLHRLHASLAHPRWPVFLGRKSYVPSIPPYIKNGLLE 125 Query: 164 ASDPQKALLNYEP---VGGDIYSEESVTGHH--------------LKFTARDEPMITLPR 206 ++ AL ++ P V + V RD+PM Sbjct: 126 GAELMSALASFTPLISVEELAARRKRVESGRRVERTRFVLESASPTHEIRRDQPMSFALG 185 Query: 207 QFASREWYVIKGGMDV 222 Q + +V+ +DV Sbjct: 186 QRVFHDRFVVTEYLDV 201 >UniRef50_UPI0000F51765 hypothetical protein Faci_00030 n=1 Tax=Ferroplasma acidarmanus fer1 RepID=UPI0000F51765 Length = 237 Score = 55.0 bits (131), Expect = 2e-06, Method: Composition-based stats. Identities = 21/149 (14%), Positives = 52/149 (34%), Gaps = 21/149 (14%) Query: 1 MRSYLILRL--AGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALS 58 M+ ++R+ G + ++ P T P ++ ++G++ A +G RDD +++L Sbjct: 1 MKEIKLIRINAYGIINSFRIPLHMTIHDTLDLPVKTHIIGMIAAAMGYLRDDKEKIESLY 60 Query: 59 ESVQFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTV 118 ++ + + + + + K++ TI Y+ Sbjct: 61 KNTSIGIYGTSYSKFYDLIRIYKYKGKEVEVSLVNRQINYKNNYTI-----YI------- 108 Query: 119 ALWLTPHATMVISELEKAVLKPRYTPYLG 147 + E+ + P + LG Sbjct: 109 -------ENNNLEEIYNFLKNPVFALSLG 130 >UniRef50_B1I5P1 CRISPR-associated protein Cas5 n=2 Tax=Clostridia RepID=B1I5P1_DESAP Length = 238 Score = 50.0 bits (118), Expect = 6e-05, Method: Composition-based stats. Identities = 40/164 (24%), Positives = 62/164 (37%), Gaps = 29/164 (17%) Query: 38 GLLGACLGIQRDDTSSLQALSESVQFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRG 97 GL GA G+ + Q + ++ D L+ RD TV+ + + Sbjct: 26 GLAGAARGL--AEEELWQE-ASPLR-----DLLVATLALQKPGLARDMWTVMKIKNNKLA 77 Query: 98 LKSHETIQTWREYLCDASFTVALWLTPHATMVISELEKAVLKPRYTPYLGR--------- 148 +S +RE L +A F + L +++EL++A L P Y LGR Sbjct: 78 ERSPY----FREILFNARFMI---LYGGPEELLAELQQAFLDPTYPLSLGREDELIVVEE 130 Query: 149 ----RSCPLTHPLFLGTCQASDPQKALLNYEPVGGDIYSEESVT 188 +CP PLF GT D Q + P G + +V Sbjct: 131 LGRGETCP-GAPLFSGTVIPGDLQGLRFKWVPRPGIAFEPPAVE 173 >UniRef50_Q3AA65 CRISPR-associated protein Cas5, Hmari subtype n=1 Tax=Carboxydothermus hydrogenoformans Z-2901 RepID=Q3AA65_CARHZ Length = 248 Score = 48.8 bits (115), Expect = 1e-04, Method: Composition-based stats. Identities = 45/235 (19%), Positives = 78/235 (33%), Gaps = 33/235 (14%) Query: 1 MRSYLILRLAGPMQAWGQPTFEGTRPTGR------FPTRSGLLGLLGACLGIQRDDTSSL 54 MR ++ + GP T FP R+ L G++ A LG+ +D SL Sbjct: 1 MRKVIVFEIRGP------AAHFRKFYTNSSSLSYAFPARTTLAGIIAAVLGLPKDSYYSL 54 Query: 55 ---QALSESVQFAVRCDELILDDRRVSVTGLRDYHTVLGAR--EDYRGLKSHETIQTWRE 109 +++ +++ + V L++ + G L +R Sbjct: 55 LTGNKAHYALRLMTPVRKIMQTVKFVRTKTLKEVNGSGGPTMIPTEIILPVRGRELIYRV 114 Query: 110 YLCDASFTVALWLTPHATMVISELEKAV-LKPRYTPYLGRRSCPLTHPL---FLGTCQAS 165 Y EL K + L P++ YLGR F GT + Sbjct: 115 Y-----------FYHDDPSFQEELAKQLALGPKFPVYLGRSEFLAKIDFLGVFPGTALET 163 Query: 166 DPQKALLNYEPVGGDIYSEESVTGHHLKFTARDEPMITLP-RQFASREWYVIKGG 219 + ++N E + + ++ G L++ P P R AS V + G Sbjct: 164 NFVDTVVNLELLKDEELLFSTLHGEDLRYLKEKMPFSFNPDRSIASTASVVYEIG 218 >UniRef50_B5IGN1 CRISPR-associated protein Cas5 n=1 Tax=Aciduliprofundum boonei T469 RepID=B5IGN1_9EURY Length = 236 Score = 48.8 bits (115), Expect = 1e-04, Method: Composition-based stats. Identities = 23/150 (15%), Positives = 50/150 (33%), Gaps = 21/150 (14%) Query: 2 RSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGI-QRDDTSSLQALSES 60 L +L ++ + T P + ++GLLG+ LG+ R+ + L++ Sbjct: 1 MEVLTAKLRAISVSFRRILDFNYHRTYPLPPPTTIVGLLGSALGLSDRELWNEYNGLND- 59 Query: 61 VQFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWR--EYLCDASFTV 118 + FAV +D ++ + + R + + Sbjct: 60 ISFAVLSLR--------KPGFAKDMWSIQKIKNGR---------ISERSPYFRELLFYPE 102 Query: 119 ALWLTPHATMVISELEKAVLKPRYTPYLGR 148 + + + + + A+ P Y LGR Sbjct: 103 YVLIFNGDSKSLECVRDALNNPEYALSLGR 132 >UniRef50_B0K553 CRISPR-associated protein Cas5, Hmari subtype n=4 Tax=Thermoanaerobacter RepID=B0K553_THEPX Length = 236 Score = 47.3 bits (111), Expect = 3e-04, Method: Composition-based stats. Identities = 29/170 (17%), Positives = 57/170 (33%), Gaps = 34/170 (20%) Query: 16 WGQPTFEGTRPTGR------FPTRSGLLGLLGACLGIQRDD-TSSLQALSESVQFAVR-- 66 WG+ T P R+ + G++ A LG +RD L A E++ AVR Sbjct: 7 WGKFAHFRKFYTNSSSLTYSVPPRTTVEGMIAALLGYERDTYYEKLNA--ENLYVAVRKM 64 Query: 67 --CDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVALWLTP 124 +++ + T L + H + L ++ +R Y+ Sbjct: 65 SKTKKIMQSVNYIKATTLGELHFPKQHTQIPFELLISDSKIRYRFYI-----------IH 113 Query: 125 HATMVISELEKAV--LKPRYTPYLGRRSCPLTHPL--FLGTCQASDPQKA 170 + E+++ + P + Y G + P ++ + A Sbjct: 114 KDENIFREIKERLFKKAPVFPLYFG------SAPFSCYIDYVEEVTWDWA 157 >UniRef50_Q1AZD3 CRISPR-associated protein, Cas5h family n=1 Tax=Rubrobacter xylanophilus DSM 9941 RepID=Q1AZD3_RUBXD Length = 258 Score = 46.1 bits (108), Expect = 9e-04, Method: Composition-based stats. Identities = 38/190 (20%), Positives = 59/190 (31%), Gaps = 34/190 (17%) Query: 27 TGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESVQFAVRCDELILDDRRVSVTGLRDYH 86 + FP R+ L GL+ +G +RD + +L E Q AV + + + + H Sbjct: 29 SYPFPPRTTLAGLIAGMMGCERDSYAEDLSL-ERCQIAVSVITPVRRVMQQVNYVMTEGH 87 Query: 87 TVLGAREDYRGLKSHETI-------------QTWREYLCDASFTVALWLTPHATMVISEL 133 + G + +R Y T + L Sbjct: 88 VWTKNTGGFDGSSGPIQVPVEWVFPEVGHRELRYRVY-----------ATHEDRGWLKRL 136 Query: 134 EKAVLK--PRYTPYLGRRSCP-------LTHPLFLGTCQASDPQKALLNYEPVGGDIYSE 184 + + P Y PYLG CP LG + P + +L E V G E Sbjct: 137 AEILEGGVPIYPPYLGMSECPGRVEHVATLEGWGLGHREDELPVRTVLPSEAVSGPPRLE 196 Query: 185 ESVTGHHLKF 194 E V + Sbjct: 197 EGVQIVKERI 206 >UniRef50_D1N0K0 Putative uncharacterized protein n=1 Tax=Victivallis vadensis ATCC BAA-548 RepID=D1N0K0_9BACT Length = 239 Score = 46.1 bits (108), Expect = 0.001, Method: Composition-based stats. Identities = 39/242 (16%), Positives = 69/242 (28%), Gaps = 36/242 (14%) Query: 2 RSYLILRLAGPMQAWGQP--TFEGTRPTGRFPTRSGLLGLLGACLGI--QRDDTSSLQAL 57 S LI LAG M W + G P L G++GA LG R + Sbjct: 1 MSILIFELAGEMAMWRNVYESMGSYSCLGPAPGN--LAGVIGAALGFASPRSQAAEKPDA 58 Query: 58 SESVQFAVRCDELILDDRRVSVTGLRDYHTV---LGAREDYRGLKSHETIQTWR------ 108 + + + ++ D+H +G + + R Sbjct: 59 KQLKNWDKAGLPWPVSPELLAWEETNDFHVACRWIGKFPKRVPWNINGCKEINRSDNLRL 118 Query: 109 --EYLCDASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSC------------PLT 154 + + D ++ VA+ L + + A+ KP + LG C Sbjct: 119 QQQVILDPAYEVAVALPAYEA---ERVAAALRKPAFPLCLGASFCRAIVRNVRIEDAVPE 175 Query: 155 HPLFLGTCQASDPQKALLNYEPV--GGDIYSEESVTGHHLKFTARDEPMITLPRQFASRE 212 P + +A + G + G+ + T +I + R Sbjct: 176 SPFWAFRTDGGALGEATPFSRHIVNPGSCFERIRSDGYWIYPTPDQPGVIAA--EPLVRG 233 Query: 213 WY 214 W Sbjct: 234 WV 235 >UniRef50_A3DHS3 CRISPR-associated protein Cas5 n=3 Tax=Clostridium thermocellum RepID=A3DHS3_CLOTH Length = 241 Score = 44.6 bits (104), Expect = 0.003, Method: Composition-based stats. Identities = 32/143 (22%), Positives = 51/143 (35%), Gaps = 17/143 (11%) Query: 5 LILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESVQFA 64 LI+ L ++ P + T P S + G+ GA LG+ +D AL+ + A Sbjct: 4 LIVTLYAKTASFRDPGAQLYHETMSLPPPSTITGIAGAALGLSFED-----ALAFMKENA 58 Query: 65 VRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVALWLTP 124 V + R +D + + G ++ I R +L D + Sbjct: 59 VMVGCNGSSEGRG-----KD---LWNYTKIKSGEITNAIII--RNFLADLKVEIFFAC-- 106 Query: 125 HATMVISELEKAVLKPRYTPYLG 147 VI+ L A P Y LG Sbjct: 107 EKREVITRLADAFENPVYAITLG 129 >UniRef50_D0MJ68 CRISPR-associated protein Cas5 n=1 Tax=Rhodothermus marinus DSM 4252 RepID=D0MJ68_RHOM4 Length = 236 Score = 43.8 bits (102), Expect = 0.004, Method: Composition-based stats. Identities = 36/144 (25%), Positives = 53/144 (36%), Gaps = 21/144 (14%) Query: 9 LAGPMQAWGQPTFE-GTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESVQFAV-- 65 LAGP+ ++ P F G +PT P S + GL+ A LG +AL + +F Sbjct: 13 LAGPVASFRYPHFLIGRQPTYPMPPPSTIYGLISAALGR----FPDPEALQFAYRFECAR 68 Query: 66 -RCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVALWLTP 124 R D++ T R E R + RE+L T+ + Sbjct: 69 HRVDDVETIWFVQPNTATR--------GEAARKNLEATSNILPREWLVHPRLTLYVTGDE 120 Query: 125 HATMVISELEKAVLKPRYTPYLGR 148 + L +A P Y LGR Sbjct: 121 -----LEALYRAFRSPCYILTLGR 139 >UniRef50_C9RCY2 CRISPR-associated protein Cas5, Hmari subtype n=2 Tax=Thermoanaerobacteraceae RepID=C9RCY2_AMMDK Length = 266 Score = 41.9 bits (97), Expect = 0.015, Method: Composition-based stats. Identities = 43/235 (18%), Positives = 73/235 (31%), Gaps = 33/235 (14%) Query: 1 MRSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSES 60 M + + L GP + + + + FP R+ L+G + A LG +RD LSE+ Sbjct: 1 MVNVAVFDLVGPFAHFRKYYTNSSSLSYAFPPRTALMGTVAAVLGWERDSYYEKLGLSEA 60 Query: 61 VQFAV-------RCDELILDDRRVS--VTGLRDYHTVLGAREDYRGLKSHETIQT--WRE 109 +FAV R + + R + LR V G + L T + +R Sbjct: 61 -RFAVVIKVPVRRLIQTVNYIRTKEEDLNRLRKLEAVKGTQVPLELLLPGGTASSLCFRV 119 Query: 110 YLCDASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQK 169 Y A + L + + YLG LT + P Sbjct: 120 Y-------FAHRDDQVTRELAERLAAG--RSYFPLYLG-----LTEFIAQARLVDFKPPD 165 Query: 170 ALLN-------YEPVGGDIYSEESVTGHHLKFTARDEPMITLPRQFASREWYVIK 217 ++ + + D + G R R+ Y+ + Sbjct: 166 EIIPAGQEVELHSVLAADYLRRPVLRGEVALNRERAPQSFGAGRKLMPPRSYIYE 220 >UniRef50_A5D0Y3 Putative uncharacterized protein n=1 Tax=Pelotomaculum thermopropionicum SI RepID=A5D0Y3_PELTS Length = 242 Score = 41.5 bits (96), Expect = 0.021, Method: Composition-based stats. Identities = 46/220 (20%), Positives = 68/220 (30%), Gaps = 21/220 (9%) Query: 3 SYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESVQ 62 L+ + G + + QP T T FP R + GLL + LG+ DD + L E Sbjct: 4 QVLVFSIKGSLAHFRQPDTTATHATYPFPPRPTIHGLLASVLGLDFDDEAGAAFLHEEHF 63 Query: 63 FAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETI--QTWREYLCDASFTVAL 120 + + + +V H TI YL Sbjct: 64 VGLSLLKPVR-----TVCAQMSMHGKGFTGGGGDSFNRLTTIELVVSPHYL-------VY 111 Query: 121 WLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQ----ASDPQKALLNYEP 176 + + + + Y YLG C LT P+F G A ++ L Sbjct: 112 YTGSRLGELAERIRTG--QSVYHTYLGSAYC-LTFPVFHGLYPLLEVAPGEEEPLPCSSV 168 Query: 177 VGGDIYSEESVTGHHLKFTARDEPMITLPRQFASREWYVI 216 V + E V AR P + +F R VI Sbjct: 169 VPQGVIQEILVEPGGNYAVARALPYRHVGGRFFERTLNVI 208 >UniRef50_D1A6P6 Metal dependent phosphohydrolase n=1 Tax=Thermomonospora curvata DSM 43183 RepID=D1A6P6_THECD Length = 1027 Score = 40.8 bits (94), Expect = 0.038, Method: Composition-based stats. Identities = 27/146 (18%), Positives = 42/146 (28%), Gaps = 26/146 (17%) Query: 8 RLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESVQFAVRC 67 +L P+ ++ P F G P S L G+L A G +E V V Sbjct: 14 QLYAPVASFRDPMFPGVTRCLPVPPPSTLRGMLAAATGRP----------AEPVVLGV-- 61 Query: 68 DELILDDRRVSVTGLRDYH--TVLGAREDYRGLKSH---ETIQTWREYLCDASFTVALWL 122 YH G+ G R +L T+ + + Sbjct: 62 ----CAYAEGRGVDTETYHPIAADGSNPAIGGRVRPGKGGMTIRERPFLTGVHITLWVPM 117 Query: 123 TPHATMVISELEKAVLKPRYTPYLGR 148 + A+ +P + LGR Sbjct: 118 PDG-----ERIATALRRPTWGLRLGR 138 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_Q46898 Uncharacterized protein ygcI n=13 Tax=Proteobact... 278 7e-74 UniRef50_B7KJ26 CRISPR-associated protein Cas5 family n=1 Tax=Cy... 223 4e-57 UniRef50_D1NTI1 CRISPR-associated protein Cas5 n=1 Tax=Bifidobac... 220 2e-56 UniRef50_D1CGD4 CRISPR-associated protein Cas5 family n=6 Tax=Ba... 211 2e-53 UniRef50_Q12YA8 CRISPR-associated protein, CT1976-like n=1 Tax=M... 209 5e-53 UniRef50_Q314I4 CRISPR-associated protein, CT1976 n=1 Tax=Desulf... 209 7e-53 UniRef50_C2BET8 CRISPR-associated protein n=2 Tax=Firmicutes Rep... 205 8e-52 UniRef50_A3EQA4 CRISPR-ssociated protein, Cas5 n=3 Tax=Bacteria ... 205 8e-52 UniRef50_B0TDU1 Crispr-associated protein cas5 n=1 Tax=Heliobact... 199 5e-50 UniRef50_B8HWH8 CRISPR-associated protein Cas5 family n=1 Tax=Cy... 199 8e-50 UniRef50_Q04QB7 Putative uncharacterized protein n=2 Tax=Leptosp... 198 1e-49 UniRef50_D0Y918 CRISPR-associated protein Cas5 family n=2 Tax=De... 198 1e-49 UniRef50_Q0W584 Putative uncharacterized protein n=1 Tax=uncultu... 197 3e-49 UniRef50_D0WFC8 CRISPR-associated protein Cas5 n=1 Tax=Slackia e... 197 3e-49 UniRef50_B4S8P8 CRISPR-associated protein Cas5 family n=8 Tax=Ba... 195 8e-49 UniRef50_A1ARH6 CRISPR-associated protein, Cas5e family n=2 Tax=... 194 3e-48 UniRef50_C9M2Y8 CRISPR-associated protein n=1 Tax=Lactobacillus ... 193 4e-48 UniRef50_Q5YRB6 Putative uncharacterized protein n=1 Tax=Nocardi... 192 9e-48 UniRef50_D2RB02 CRISPR system CASCADE complex protein CasD n=3 T... 192 9e-48 UniRef50_A8SDR7 Putative uncharacterized protein n=1 Tax=Faecali... 191 1e-47 UniRef50_B2GBJ9 Putative uncharacterized protein n=1 Tax=Lactoba... 191 2e-47 UniRef50_A7BA63 Putative uncharacterized protein n=1 Tax=Actinom... 191 2e-47 UniRef50_Q2JWC5 CRISPR-associated protein Cas5, Ecoli subtype n=... 190 3e-47 UniRef50_Q1EQS9 CRISPR-associated protein n=3 Tax=Streptomyces R... 190 3e-47 UniRef50_A5GBK2 CRISPR-associated protein Cas5 family n=2 Tax=De... 189 5e-47 UniRef50_UPI0001AF1D4C CRISPR-associated protein, CT1976 n=1 Tax... 189 7e-47 UniRef50_C7MQD6 CRISPR-associated protein Cas5 n=1 Tax=Saccharom... 189 8e-47 UniRef50_A5UR14 CRISPR-associated protein, Cas5e family n=1 Tax=... 188 1e-46 UniRef50_B1VIY0 CRISPR-associated protein n=9 Tax=Actinomycetale... 186 4e-46 UniRef50_B0S4B6 Putative uncharacterized protein n=1 Tax=Finegol... 185 7e-46 UniRef50_Q1R114 CRISPR-associated protein, CT1976 n=1 Tax=Chromo... 185 8e-46 UniRef50_A6W168 CRISPR-associated protein Cas5 family n=6 Tax=Ga... 185 1e-45 UniRef50_Q47PI7 CRISPR-associated protein, Cas5e family n=12 Tax... 184 2e-45 UniRef50_A8M404 CRISPR-associated protein Cas5 family n=1 Tax=Sa... 184 3e-45 UniRef50_D1A6Q5 CRISPR-associated protein Cas5 family n=2 Tax=Ac... 183 4e-45 UniRef50_C5SD48 CRISPR-associated protein Cas5 family n=1 Tax=Al... 182 6e-45 UniRef50_C7LYW6 CRISPR-associated protein Cas5 family n=1 Tax=Ac... 182 6e-45 UniRef50_A1SV73 CRISPR-associated protein, Cas5e family n=2 Tax=... 182 6e-45 UniRef50_Q03C60 CRISPR-associated protein n=4 Tax=Lactobacillus ... 180 3e-44 UniRef50_C8XAY4 CRISPR-associated protein Cas5 family n=2 Tax=Ac... 180 3e-44 UniRef50_A8LYZ7 CRISPR-associated protein Cas5 family n=2 Tax=Ac... 180 4e-44 UniRef50_Q2RY19 CRISPR-associated protein, Cas5e family n=1 Tax=... 179 7e-44 UniRef50_C7MTB0 CRISPR-associated protein Cas5 n=1 Tax=Saccharom... 178 1e-43 UniRef50_D2L2X8 CRISPR-associated protein Cas5 family n=1 Tax=De... 178 1e-43 UniRef50_A8ZZ17 CRISPR-associated protein Cas5 family n=1 Tax=De... 178 1e-43 UniRef50_D1CAJ0 CRISPR-associated protein Cas5 family n=1 Tax=Sp... 178 1e-43 UniRef50_Q2JH27 CRISPR-associated protein, CT1976 n=6 Tax=Actino... 176 4e-43 UniRef50_B3E5U9 CRISPR-associated protein Cas5 family n=2 Tax=De... 175 8e-43 UniRef50_D0MET6 CRISPR-associated protein Cas5 family n=1 Tax=Rh... 172 7e-42 UniRef50_Q1J367 CRISPR-associated protein, CT1976 n=1 Tax=Deinoc... 171 2e-41 UniRef50_C9M9R7 CRISPR-associated protein Cas5, Ecoli subtype n=... 170 3e-41 UniRef50_D1Y486 Crispr-associated protein Cas5 n=1 Tax=Pyramidob... 170 4e-41 UniRef50_B8IZA7 CRISPR-associated protein Cas5 family n=1 Tax=De... 169 5e-41 UniRef50_B4UE71 CRISPR-associated protein Cas5 family n=2 Tax=An... 169 6e-41 UniRef50_B8GIV3 CRISPR-associated protein Cas5 family n=1 Tax=Me... 166 4e-40 UniRef50_Q0AA33 CRISPR-associated protein Cas5 family n=2 Tax=Ga... 166 4e-40 UniRef50_B8FDI0 CRISPR-associated protein Cas5 family n=3 Tax=Ba... 166 5e-40 UniRef50_Q2FNU0 CRISPR-associated protein, CT1976 n=1 Tax=Methan... 158 9e-38 UniRef50_UPI0001B51C2B CRISPR-associated Cas5 family protein n=1... 158 2e-37 UniRef50_Q0BSC7 Putative uncharacterized protein n=1 Tax=Granuli... 157 4e-37 UniRef50_C5V9N1 CRISPR-associated protein Cas5 n=1 Tax=Corynebac... 155 9e-37 UniRef50_B4TTX2 CRISPR-associated protein Cas5 n=15 Tax=Enteroba... 153 3e-36 UniRef50_A9HLC6 CRISPR-associated protein Cas5 family n=11 Tax=A... 152 8e-36 UniRef50_B6B783 CRISPR-associated protein Cas5, Ecoli subtype n=... 150 4e-35 UniRef50_B5GY62 Crispr-associated protein (Fragment) n=1 Tax=Str... 145 7e-34 UniRef50_C2KP44 Putative uncharacterized protein n=1 Tax=Mobilun... 145 8e-34 UniRef50_B8IMR2 CRISPR-associated protein Cas5 family n=1 Tax=Me... 142 6e-33 UniRef50_Q2RXJ5 CRISPR-associated protein, Cas5e family n=5 Tax=... 142 1e-32 UniRef50_Q6NEQ9 Putative uncharacterized protein n=1 Tax=Coryneb... 138 1e-31 UniRef50_B5F422 CRISPR-associated protein Cas5 n=59 Tax=Enteroba... 136 6e-31 UniRef50_B6XT64 Putative uncharacterized protein n=2 Tax=Bifidob... 136 7e-31 UniRef50_B6IWM3 CRISPR-associated protein, CT1976 family n=1 Tax... 135 1e-30 UniRef50_C2GEY8 CRISPR-associated protein n=1 Tax=Corynebacteriu... 134 2e-30 UniRef50_C0W6U0 CRISPR-associated Cas5 family protein n=1 Tax=Ac... 133 5e-30 UniRef50_B8IJS9 CRISPR-associated protein Cas5 family n=1 Tax=Me... 129 5e-29 UniRef50_Q3AA65 CRISPR-associated protein Cas5, Hmari subtype n=... 118 1e-25 UniRef50_D1N0K0 Putative uncharacterized protein n=1 Tax=Victiva... 115 2e-24 UniRef50_A0LM54 Putative uncharacterized protein n=1 Tax=Syntrop... 103 5e-21 UniRef50_UPI0000F51765 hypothetical protein Faci_00030 n=1 Tax=F... 103 5e-21 UniRef50_B0K553 CRISPR-associated protein Cas5, Hmari subtype n=... 95 2e-18 UniRef50_B5IGN1 CRISPR-associated protein Cas5 n=1 Tax=Acidulipr... 94 3e-18 UniRef50_Q1AZD3 CRISPR-associated protein, Cas5h family n=1 Tax=... 91 3e-17 UniRef50_B1I5P1 CRISPR-associated protein Cas5 n=2 Tax=Clostridi... 78 3e-13 Sequences not found previously or not previously below threshold: UniRef50_D1CHV1 CRISPR-associated protein Cas5 n=1 Tax=Thermobac... 63 9e-09 UniRef50_A5D0Y3 Putative uncharacterized protein n=1 Tax=Pelotom... 59 9e-08 UniRef50_A3DHS3 CRISPR-associated protein Cas5 n=3 Tax=Clostridi... 58 3e-07 UniRef50_D1B1G1 CRISPR-associated protein Cas5 n=1 Tax=Sulfurosp... 58 3e-07 UniRef50_A7ZDW7 Crispr-associated protein Cas5 n=3 Tax=Campyloba... 57 4e-07 UniRef50_A7HLK6 CRISPR-associated protein Cas5, Hmari subtype n=... 57 5e-07 UniRef50_C7P9L3 CRISPR-associated protein Cas5 n=2 Tax=Methanoca... 57 6e-07 UniRef50_D0MJ68 CRISPR-associated protein Cas5 n=1 Tax=Rhodother... 55 2e-06 UniRef50_B7R550 CRISPR-associated protein Cas5 n=1 Tax=Thermococ... 55 3e-06 UniRef50_C9RCY2 CRISPR-associated protein Cas5, Hmari subtype n=... 54 3e-06 UniRef50_B5YBH8 CRISPR-associated protein Cas5, Hmari subtype n=... 53 8e-06 UniRef50_C6QNG6 CRISPR-associated protein Cas5, Hmari subtype n=... 53 8e-06 UniRef50_O27159 Putative uncharacterized protein n=1 Tax=Methano... 53 9e-06 UniRef50_D1A6P6 Metal dependent phosphohydrolase n=1 Tax=Thermom... 52 2e-05 UniRef50_A4J1X9 CRISPR-associated protein Cas5, Hmari subtype n=... 52 2e-05 UniRef50_O57910 Putative uncharacterized protein PH0171 n=1 Tax=... 51 3e-05 UniRef50_Q2NH81 Putative uncharacterized protein n=1 Tax=Methano... 50 4e-05 UniRef50_B1LAM2 CRISPR-associated protein Cas5, Hmari subtype n=... 50 5e-05 UniRef50_D2QT47 CRISPR-associated protein Cas5 n=1 Tax=Spirosoma... 49 1e-04 UniRef50_A3XI99 Putative uncharacterized protein n=1 Tax=Leeuwen... 49 1e-04 UniRef50_C7NNV3 CRISPR-associated protein Cas5, Hmari subtype n=... 48 2e-04 UniRef50_C8PE23 CRISPR-associated protein Cas5 n=1 Tax=Campyloba... 48 2e-04 UniRef50_B9MPU1 CRISPR-associated protein Cas5, Hmari subtype n=... 47 4e-04 UniRef50_B1B9K0 Crispr-associated protein Cas5, tneap subtype n=... 47 4e-04 UniRef50_A3DKB9 CRISPR-associated protein Cas5, Hmari subtype n=... 47 5e-04 UniRef50_C8VZL1 CRISPR-associated protein Cas5 n=1 Tax=Desulfoto... 46 8e-04 UniRef50_B1I4M5 CRISPR-associated protein Cas5 family n=1 Tax=Ca... 46 8e-04 UniRef50_Q3M7D6 Fruiting body developmental protein S-like prote... 46 0.001 UniRef50_C1DUR1 Crispr-associated protein Cas5, hmari subtype n=... 46 0.001 UniRef50_A7BQN4 Protein containing DUF522 n=1 Tax=Beggiatoa sp. ... 46 0.001 UniRef50_B8CYA3 CRISPR-associated protein Cas5 n=1 Tax=Halotherm... 45 0.002 UniRef50_Q8PZD2 Putative uncharacterized protein n=1 Tax=Methano... 45 0.002 UniRef50_A1ZVP9 Crispr-associated protein Cas5, tneap subtype n=... 44 0.003 UniRef50_A8F3P0 CRISPR-associated protein Cas5, Hmari subtype n=... 44 0.004 UniRef50_A4XGC7 CRISPR-associated protein Cas5 family n=1 Tax=Ca... 43 0.008 UniRef50_D1QSA9 CRISPR-associated protein Cas5, Tneap subtype n=... 41 0.026 UniRef50_B9LWL0 CRISPR-associated protein Cas5, Hmari subtype n=... 41 0.041 UniRef50_A1HND7 CRISPR-associated protein Cas5 family n=1 Tax=Th... 40 0.063 >UniRef50_Q46898 Uncharacterized protein ygcI n=13 Tax=Proteobacteria RepID=YGCI_ECOLI Length = 224 Score = 278 bits (712), Expect = 7e-74, Method: Composition-based stats. Identities = 224/224 (100%), Positives = 224/224 (100%) Query: 1 MRSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSES 60 MRSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSES Sbjct: 1 MRSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSES 60 Query: 61 VQFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVAL 120 VQFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVAL Sbjct: 61 VQFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVAL 120 Query: 121 WLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKALLNYEPVGGD 180 WLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKALLNYEPVGGD Sbjct: 121 WLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKALLNYEPVGGD 180 Query: 181 IYSEESVTGHHLKFTARDEPMITLPRQFASREWYVIKGGMDVSQ 224 IYSEESVTGHHLKFTARDEPMITLPRQFASREWYVIKGGMDVSQ Sbjct: 181 IYSEESVTGHHLKFTARDEPMITLPRQFASREWYVIKGGMDVSQ 224 >UniRef50_B7KJ26 CRISPR-associated protein Cas5 family n=1 Tax=Cyanothece sp. PCC 7424 RepID=B7KJ26_CYAP7 Length = 215 Score = 223 bits (568), Expect = 4e-57, Method: Composition-based stats. Identities = 67/228 (29%), Positives = 111/228 (48%), Gaps = 17/228 (7%) Query: 1 MRSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSES 60 M L+LRLAGP+Q+WG+ + R T PT+SG++GL+ A +GI RD+ L L++ Sbjct: 1 MMKTLLLRLAGPLQSWGRGSRFDFRDTDTIPTKSGVIGLVAAAMGINRDNQVELAKLAQ- 59 Query: 61 VQFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVAL 120 ++ V ++ + DYHTV+G G IQ++R+YLC+A F V L Sbjct: 60 LRMGVCVEK--------EGKLVVDYHTVIGTIH-ADGKPHKAPIQSYRQYLCNAEFLVGL 110 Query: 121 WLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKALLNY--EPVG 178 + +++E+E + P++ +LGR++CP + P+F+ + AL Y + Sbjct: 111 ESS--EYHLLNEIEHYLCFPKWELFLGRKACPPSKPIFVDLLTN-SLEDALYQYAISHLK 167 Query: 179 GDIYSEESVTGHHLKFTARDEPMITLPRQFASREWYVIK--GGMDVSQ 224 Y + D P+ R F++R K DVS+ Sbjct: 168 KGTYRLLIESKEPTGALRLDVPIDFKKRIFSARTVITPKPLEVSDVSE 215 >UniRef50_D1NTI1 CRISPR-associated protein Cas5 n=1 Tax=Bifidobacterium gallicum DSM 20093 RepID=D1NTI1_9BIFI Length = 250 Score = 220 bits (561), Expect = 2e-56, Method: Composition-based stats. Identities = 66/226 (29%), Positives = 101/226 (44%), Gaps = 28/226 (12%) Query: 2 RSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESV 61 S LILRLAGPMQ+WG + R T PT+S ++GLL + G +R+D S++ L + Sbjct: 1 MSVLILRLAGPMQSWGDSSRFNRRETRTEPTKSAVIGLLASAQGRRRED--SIEDLL-GL 57 Query: 62 QFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVALW 121 +F VR D+ +RD+ T G S T R YL DA F VA+ Sbjct: 58 RFGVRSDQ--------PGRIMRDFQTEKSIARKKSGEFSLTMPLTHRYYLADAKFLVAIE 109 Query: 122 LTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKALLNYEPVGGDI 181 ++ L+ A+ P++ +LGRRSCP P+ LG ++ ++AL + Sbjct: 110 ---GERSLLESLDAALRNPQWPLFLGRRSCPPASPVSLGVKDYANVEEALDKEPWIASPW 166 Query: 182 YSE------------ESVTGHHLKFTARDEPMITL--PRQFASREW 213 Y + ++V D P+ R++A R Sbjct: 167 YRKKVHDSKRLQVVVDAVENGETTGQQSDMPLSFSQKHRRYAQRPV 212 >UniRef50_D1CGD4 CRISPR-associated protein Cas5 family n=6 Tax=Bacteria RepID=D1CGD4_THET1 Length = 230 Score = 211 bits (537), Expect = 2e-53, Method: Composition-based stats. Identities = 68/226 (30%), Positives = 106/226 (46%), Gaps = 25/226 (11%) Query: 2 RSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESV 61 L++RL+GPMQ+WG + R TGR P++SG++GL+ A LG R T+ + L + Sbjct: 1 MPTLLMRLSGPMQSWGTQSRFTVRDTGREPSKSGVIGLICAALGRPR--TAPVDDLV-RL 57 Query: 62 QFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHET-----IQTWREYLCDASF 116 + VR D +RDYHT GA R + T +Q+ R YL DASF Sbjct: 58 RMGVRVDR--------EGIVMRDYHTAGGAPAGERYGVATVTGDQRPVQSSRYYLADASF 109 Query: 117 TVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGT-------CQASDPQK 169 VAL ++ ++++A+ PR+ +LGR+SC + P+ L + D + Sbjct: 110 LVALEGGEEDRPLLEQIDEALRAPRWQLFLGRKSCVPSEPIHLPKEPPLGPPIREEDLRT 169 Query: 170 ALLNYEPVGGDIYSEESVTGHHLKFTARDEPMIT--LPRQFASREW 213 AL++Y G + D P+ RQ+A+R Sbjct: 170 ALISYPWPEGAHRLRFVFEDPEGEELRNDVPISFEIGNRQYAARFV 215 >UniRef50_Q12YA8 CRISPR-associated protein, CT1976-like n=1 Tax=Methanococcoides burtonii DSM 6242 RepID=Q12YA8_METBU Length = 244 Score = 209 bits (533), Expect = 5e-53, Method: Composition-based stats. Identities = 63/244 (25%), Positives = 100/244 (40%), Gaps = 38/244 (15%) Query: 1 MRSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSES 60 M+ YLI RL GPM +WG RPT P++S + GL+ A LGI+RD+ LS + Sbjct: 1 MKEYLIFRLYGPMASWGDIAVGQHRPTYDHPSKSAIFGLIAAALGIRRDEEERHLELSNA 60 Query: 61 VQFAVRCDELILDDRRVSVTGLRDYHTVL-------------GAREDYRGLKSH--ETIQ 105 + + LRDYHT R+D + T+ Sbjct: 61 YSYGTLI--------NSAGKLLRDYHTSQVPSAGTGRNRKTFATRKDELAVPKEELNTVL 112 Query: 106 TWREYLCDASFTVALWLTPH-ATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQA 164 + R+Y CD +TV L + L ++ +P + YLGR+SCPL P+ A Sbjct: 113 STRDYYCDGVYTVILSCKTDTPPYSLELLGNSLKEPSFCLYLGRKSCPLALPINPKIVSA 172 Query: 165 SDPQKALLNYEPVGGDIYSEESVTGHHL--------------KFTARDEPMITLPRQFAS 210 S+ ++AL + +P + + + + + D+ + QF+ Sbjct: 173 SNIKEALQSVDPGEEGLVKKIEMKSPYRLYWDDPKESMTCEHTISRYDKLLSRKRWQFSK 232 Query: 211 REWY 214 R Y Sbjct: 233 RNEY 236 >UniRef50_Q314I4 CRISPR-associated protein, CT1976 n=1 Tax=Desulfovibrio desulfuricans subsp. desulfuricans str. G20 RepID=Q314I4_DESDG Length = 245 Score = 209 bits (531), Expect = 7e-53, Method: Composition-based stats. Identities = 79/246 (32%), Positives = 108/246 (43%), Gaps = 33/246 (13%) Query: 1 MRSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSES 60 M YL ++ GP+QA+G RPT PTRS +LG+L A +GI+RD+ + L L + Sbjct: 1 MAQYLTFQIYGPLQAYGTVAVGEIRPTSTMPTRSAVLGILAAAIGIRRDEETRLAELRDG 60 Query: 61 VQFAVRCDELILDDRRVSVTGLRDYHTVLGA----------REDYRGLKSHETIQTWREY 110 + AVR D + DYHT+ R D L TI + REY Sbjct: 61 YRVAVRED--------APGKVMLDYHTIQTPGARGKRQLHCRRDELLLTEPNTILSRREY 112 Query: 111 LCDASFTVALWLTPHA-TMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQK 169 L DA FTV LW H + E+ +A+ PR+T LGR+SCP P + PQ+ Sbjct: 113 LMDALFTVCLWQANHTVPYSLQEIARALRSPRWTIGLGRKSCPPALPFAPKITDHTTPQE 172 Query: 170 ALLNYE-----------PVGGDIYSEESVTGHHLKFTARDEPMITLPRQFASREW---YV 215 A+ Y P + + + T RD P+ RQFA R+ V Sbjct: 173 AVAAYPADKLVSAGLRSPQVMRMLLDTEGPHTDTETTVRDVPLHHGRRQFAERKVRELLV 232 Query: 216 IKGGMD 221 K D Sbjct: 233 RKAATD 238 >UniRef50_C2BET8 CRISPR-associated protein n=2 Tax=Firmicutes RepID=C2BET8_9FIRM Length = 244 Score = 205 bits (522), Expect = 8e-52, Method: Composition-based stats. Identities = 61/223 (27%), Positives = 101/223 (45%), Gaps = 29/223 (13%) Query: 3 SYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESVQ 62 ++L+L GPMQ+WG + TR + +P++SG++G++ A G +RD+ +Q L++ + Sbjct: 5 KTILLKLTGPMQSWGTSSRFETRTSDYYPSKSGVIGIIAASFGYERDEDEKIQKLND-LD 63 Query: 63 FAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVALWL 122 FAVR D+ +DYH R+ + T T R Y+ DA F VA+ Sbjct: 64 FAVRVDQ--------EGVLKKDYHIA---RKVKPNGELERTYVTNRYYMEDAVFVVAI-- 110 Query: 123 TPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKALLNYEPVGGDIY 182 + + E+ + + P + P++GRRSCPL LGT + P +AL N + D + Sbjct: 111 SHEDDKWMEEILQGLKYPYFQPFMGRRSCPLPARFILGTNE-EGPIEALENLDWQAADWF 169 Query: 183 SEE--------------SVTGHHLKFTARDEPMITLPRQFASR 211 ++ H R R+F R Sbjct: 170 KKKNKNYRADIYADKDLLPENSHTIRNDRVVSFSQKERKFGPR 212 >UniRef50_A3EQA4 CRISPR-ssociated protein, Cas5 n=3 Tax=Bacteria RepID=A3EQA4_9BACT Length = 227 Score = 205 bits (522), Expect = 8e-52, Method: Composition-based stats. Identities = 60/222 (27%), Positives = 94/222 (42%), Gaps = 22/222 (9%) Query: 2 RSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESV 61 L++RL PMQ+WG + R TG+ P++SG++GLL A LGI R++ L+ L+ + Sbjct: 1 MPTLLIRLVSPMQSWGTSSRFDQRDTGKEPSKSGVIGLLAAALGIDRNNWDDLEPLA-GL 59 Query: 62 QFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVALW 121 VR D RDY T K H T T+REYL DA F V Sbjct: 60 SMGVRHDR--------PGIPRRDYQTASKII-SADHSKIHPTAVTYREYLADAVFLVGFE 110 Query: 122 LTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKALLNYEPVGGDI 181 ++ ++ A+ P + +LGR+S + P+++ + P + L + P Sbjct: 111 SA--EVSLLEKINSALKNPVWPLFLGRKSYVPSEPIWIENGLKNVPLREALEHFPWIACR 168 Query: 182 YSEESVTGHH---------LKFTARDEPM-ITLPRQFASREW 213 E + D+P+ R+F SR Sbjct: 169 RRNERLPEKLVITFESEDGTGVLKMDQPLSSFAKRRFGSRFV 210 >UniRef50_B0TDU1 Crispr-associated protein cas5 n=1 Tax=Heliobacterium modesticaldum Ice1 RepID=B0TDU1_HELMI Length = 230 Score = 199 bits (507), Expect = 5e-50, Method: Composition-based stats. Identities = 66/229 (28%), Positives = 101/229 (44%), Gaps = 23/229 (10%) Query: 4 YLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESVQF 63 L LRL GP+Q+WG + R + PT+SG++GLLG LG R+D L++L +++ Sbjct: 5 ILALRLEGPLQSWGSRSRWDYRDSALEPTKSGIIGLLGCALGWSRNDK-RLESLDAALRL 63 Query: 64 AVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSH-----ETIQTWREYLCDASFTV 118 VR D+ T L D+HTV G G + T+ + R YL +ASF Sbjct: 64 TVRIDK--------PGTPLIDFHTVQGYLLMAEGKQKKSGNDMYTVVSRRVYLQEASFLA 115 Query: 119 ALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQAS--DPQKALLNYEP 176 L A + + +KA+ P + +LGR+SCP PLF + D +A+ + Sbjct: 116 LLTGEQGA---LHQCKKALNDPVWPVFLGRKSCPPARPLFDSFYEGDFRDVLEAMRSIPW 172 Query: 177 ----VGGDIYSEESVTGHHLKFTARDEPMITLPRQFASREWYVIKGGMD 221 G I + + +D I R + R V +D Sbjct: 173 SSAPAAGPIRLRYVMEDEGGREWRQDVLRINGARMYGRRRVSVGWVDLD 221 >UniRef50_B8HWH8 CRISPR-associated protein Cas5 family n=1 Tax=Cyanothece sp. PCC 7425 RepID=B8HWH8_CYAP4 Length = 216 Score = 199 bits (505), Expect = 8e-50, Method: Composition-based stats. Identities = 52/218 (23%), Positives = 90/218 (41%), Gaps = 22/218 (10%) Query: 2 RSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESV 61 L+LR+ PM +WG + R + R PT+S ++GLL A LG R ++ L+ ++ Sbjct: 1 MPTLLLRMRAPMMSWGDHSRFTIRDSRREPTKSAVIGLLCAALGRPR--WEAVADLT-AL 57 Query: 62 QFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVALW 121 + VR ++ L DYHTV + + T+ + R Y+ DA + V L Sbjct: 58 KMGVRINQEGLVQC--------DYHTVQDSIKSS--GSKGNTVISHRYYIADADYLVGLE 107 Query: 122 LTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKALLNYEPVGGDI 181 + + L+ A+ P + Y GR+S + P+ L +AL + + + Sbjct: 108 GS--DRHFLESLDSALQSPIWQVYFGRKSFVPSCPVALHVSDQP-LAEALKHRITLSKTM 164 Query: 182 YSEE------SVTGHHLKFTARDEPMITLPRQFASREW 213 + + +D P+ R F SR Sbjct: 165 AHKLPNRLRCVLEVPDSLDVRQDVPLDWQKRHFGSRCV 202 >UniRef50_Q04QB7 Putative uncharacterized protein n=2 Tax=Leptospira borgpetersenii serovar Hardjo-bovis RepID=Q04QB7_LEPBJ Length = 247 Score = 198 bits (504), Expect = 1e-49, Method: Composition-based stats. Identities = 61/220 (27%), Positives = 93/220 (42%), Gaps = 18/220 (8%) Query: 1 MRSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSES 60 M+ YL+ RL GP+ +WG RP+ FPT+S ++GL+ A G R + + L +S Sbjct: 1 MKDYLVFRLYGPLVSWGNIAVGEYRPSDSFPTKSAIIGLISASFGFDRSEDGKISELVKS 60 Query: 61 VQFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSH----------ETIQTWREY 110 V FA + LRDYHT+ R L + ETI + R+Y Sbjct: 61 VFFATKTLN--------PGNLLRDYHTIQSPGNVKRSLLTRKDELLDSEYVETILSSRDY 112 Query: 111 LCDASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKA 170 DA + VAL A + E+ A+L P +TPYLGR+SC + P+ + A Sbjct: 113 RVDAVYDVALSEKKRAPYSLKEIRNALLSPIHTPYLGRKSCSIALPMCPEILSSDSFPNA 172 Query: 171 LLNYEPVGGDIYSEESVTGHHLKFTARDEPMITLPRQFAS 210 Y + Y +++ ++ L Sbjct: 173 FEEYNKILMKKYESSDYKDPLADLSSKSSAILYLWEDPTE 212 >UniRef50_D0Y918 CRISPR-associated protein Cas5 family n=2 Tax=Dehalococcoides RepID=D0Y918_9CHLR Length = 205 Score = 198 bits (504), Expect = 1e-49, Method: Composition-based stats. Identities = 64/213 (30%), Positives = 105/213 (49%), Gaps = 19/213 (8%) Query: 4 YLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESVQF 63 L++RL GPMQ+WG + R T PTRSG++GL+ A +GI RD+ A + ++ Sbjct: 6 TLLMRLEGPMQSWGYRSRFDCRDTALEPTRSGVIGLICAAMGIARDEDI---ARFDGIRM 62 Query: 64 AVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVALWLT 123 VR D ++ DYHT L +T+ ++R+YL DASFTV L + Sbjct: 63 GVRVDRDGKVEQ--------DYHTALDVI--KADGSGKDTVVSYRDYLTDASFTVGLESS 112 Query: 124 PHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKALLNYEPVGGDIYS 183 ++ ++ KA++ P++ +LGR++ PLT P + S+P K E + + Sbjct: 113 --DRNLLEKIAKALVSPQWVLFLGRKAFPLTKPP---IFEFSNPVKPGSLEEHLLCGASA 167 Query: 184 EES-VTGHHLKFTARDEPMITLPRQFASREWYV 215 + + + T D P+ R+F R + V Sbjct: 168 KRVLLESPDGERTQYDWPLCFGERRFKPRRFTV 200 >UniRef50_Q0W584 Putative uncharacterized protein n=1 Tax=uncultured methanogenic archaeon RC-I RepID=Q0W584_UNCMA Length = 227 Score = 197 bits (501), Expect = 3e-49, Method: Composition-based stats. Identities = 61/213 (28%), Positives = 90/213 (42%), Gaps = 20/213 (9%) Query: 10 AGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESVQFAVRCDE 69 GPMQ+WG R TG PT+SG++GLLG LG R D L ++ +R + Sbjct: 12 EGPMQSWGLKARWDIRDTGDEPTKSGIIGLLGCALGYARKDPRLTDELDSQLRIGIRVE- 70 Query: 70 LILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVALWLTPHATMV 129 RDYHTV G G TI ++R+YL DA+F V L + Sbjct: 71 -------CPGEIARDYHTVSGELRTAEGKLRETTIVSFRDYLQDAAFLVVLEGPGE---L 120 Query: 130 ISELEKAVLKPRYTPYLGRRSCPLTHPLFLG-TCQASDPQKALLNYEPVGGDIYSEES-- 186 ++ + A+ P + YLGR+SCP T P+F T + AL + G + + ++ Sbjct: 121 LTRISNALKDPVWPIYLGRKSCPPTRPVFETLTTDYASIDDALSRHPWSSGTMEARKAHP 180 Query: 187 ------VTGHHLKFTARDEPMITLPRQFASREW 213 V + D + R + R Sbjct: 181 KELKCIVEDLSGPYQRTDRMTKSPARMYGIRHV 213 >UniRef50_D0WFC8 CRISPR-associated protein Cas5 n=1 Tax=Slackia exigua ATCC 700122 RepID=D0WFC8_9ACTN Length = 267 Score = 197 bits (500), Expect = 3e-49, Method: Composition-based stats. Identities = 62/246 (25%), Positives = 105/246 (42%), Gaps = 43/246 (17%) Query: 2 RSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESV 61 + L+L+LA P+Q+WG + R T P++SG++GLL A LG +R+D S+ L+ ++ Sbjct: 4 MTVLLLKLAAPLQSWGASSRFTERTTRHEPSKSGVIGLLAAALGRRRED--SVDDLA-AL 60 Query: 62 QFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLK---SHETIQTWREYLCDASFTV 118 +FAVR D+ + +RD+ T + D + + + R+YL DA F Sbjct: 61 RFAVRIDQ--------PGSFMRDFQTEHTRKWDSDTRRFVFNESLSLSKRDYLSDAVFVA 112 Query: 119 ALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKALLNYEPVG 178 AL +++E +A+ P + +LGRRSCP + ++LG +AL + Sbjct: 113 ALE---GDEDLLAECAEALHHPAFPLFLGRRSCPPSTQVYLGLVDGP-MMEALADIPWQA 168 Query: 179 GDI-----YSEESVTGHHLK------------------FTARDEPMITL--PRQFASREW 213 + Y T RD P+ R++ R Sbjct: 169 TERHWNYAYRFRGDKPKETVELKVVYEATEEQAACTLSETVRDVPLSFSQLKREYGWRTE 228 Query: 214 YVIKGG 219 I+ Sbjct: 229 TTIRMA 234 >UniRef50_B4S8P8 CRISPR-associated protein Cas5 family n=8 Tax=Bacteria RepID=B4S8P8_PROA2 Length = 243 Score = 195 bits (496), Expect = 8e-49, Method: Composition-based stats. Identities = 55/241 (22%), Positives = 88/241 (36%), Gaps = 26/241 (10%) Query: 4 YLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESVQF 63 YL+L L P+Q+WG + G R T FPT+SG+LG+L LG + L+ ++ Q Sbjct: 5 YLLLWLEAPLQSWGADSRFGRRGTLEFPTKSGVLGMLCCSLGAGGEQKELLEKMAPLKQS 64 Query: 64 AVRCDELILDDR-----RVSVTGLRDYHTVLGAREDYRGLK--------------SHETI 104 A+ + LRD+H V +D + + + Sbjct: 65 AISFCRTSKFRQEEIKKLDREPLLRDFHMVGSGYDDKNPWETLLIPKKSDGTTAVNGGSK 124 Query: 105 QTWREYLCDASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQA 164 T+R YL DA F V + + + A+ P + Y GR+ C T ++ G Sbjct: 125 ITYRYYLQDAVFAVIMEVPSEKLTLF---ADALENPCWDIYFGRKCCAPTDFIYRGCFNT 181 Query: 165 SDP-QKALLNYEPVGGDIYSEESVTGHHLK--FTARDEPMITLPRQ-FASREWYVIKGGM 220 L + V G H D P+ ++ + R VI Sbjct: 182 ESLAIGKALEIAQEKRLMEDFRVVDGEHEGEAIVLNDVPIQFGEQKLYRERRVTVISCAN 241 Query: 221 D 221 + Sbjct: 242 E 242 >UniRef50_A1ARH6 CRISPR-associated protein, Cas5e family n=2 Tax=Bacteria RepID=A1ARH6_PELPD Length = 232 Score = 194 bits (492), Expect = 3e-48, Method: Composition-based stats. Identities = 59/228 (25%), Positives = 92/228 (40%), Gaps = 28/228 (12%) Query: 2 RSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESV 61 L+LRL GPMQ+WG + R TG+ P++SG++GLL A LGI R++ L+ L+ + Sbjct: 1 MPTLLLRLVGPMQSWGTTSRFDQRDTGKEPSKSGVVGLLAAALGIDRENWVDLEPLT-CL 59 Query: 62 QFAVRCDELILDDRRVSVTGLRDYHTVL-----GAREDYRGLKSHETIQTWREYLCDASF 116 VR D RDY T + + + R YL DA+F Sbjct: 60 AMGVRHDR--------PGVPKRDYQTAGCASTDTIIKADGTQAKGGGVVSQRFYLADAAF 111 Query: 117 TVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFL--GTCQASDPQKALLNY 174 V L ++ + A+ P +T LGR+S + +++ G A + L Y Sbjct: 112 LVGLEC--DDNCLLERIHVALHNPFWTLALGRKSYVPSESIWIVDGVRDAP-LLETLKRY 168 Query: 175 EPVGGDIYSEESVT--------GHHLKFTARDEPM-ITLPRQFASREW 213 + EE D+P+ R+F +R Sbjct: 169 PWIASSRSREEPPERLLVSIESDDGAGVLKMDQPLSSFAERRFGARFV 216 >UniRef50_C9M2Y8 CRISPR-associated protein n=1 Tax=Lactobacillus helveticus DSM 20075 RepID=C9M2Y8_LACHE Length = 241 Score = 193 bits (491), Expect = 4e-48, Method: Composition-based stats. Identities = 51/200 (25%), Positives = 86/200 (43%), Gaps = 16/200 (8%) Query: 2 RSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESV 61 +RL P+Q++G R + +P++S ++G++ A LG +RDD LQ ++ Sbjct: 1 MKTATIRLTAPLQSYGNQASFNQRTSDNYPSKSAVIGIIAAALGYRRDDARILQ--LNNL 58 Query: 62 QFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVALW 121 FAVR ++ S + ++ TV E + T+RE++ DA F VA+ Sbjct: 59 LFAVRIEQ--------SGNMMTEFQTV----EYQKSSTKTARKLTYREFIQDAVFMVAIG 106 Query: 122 LTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKALLNYEPVGGDI 181 I ++ A+ P++ YLGRRS P PL + T +P + L Sbjct: 107 SDND--HEIEKIVSALKHPKFQLYLGRRSNPPAGPLMIETYDEENPLQVLEKLSWQAEPW 164 Query: 182 YSEESVTGHHLKFTARDEPM 201 Y + L D + Sbjct: 165 YQKRLRAPKFLTRIIADAEL 184 >UniRef50_Q5YRB6 Putative uncharacterized protein n=1 Tax=Nocardia farcinica RepID=Q5YRB6_NOCFA Length = 235 Score = 192 bits (487), Expect = 9e-48, Method: Composition-based stats. Identities = 62/231 (26%), Positives = 98/231 (42%), Gaps = 36/231 (15%) Query: 2 RSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESV 61 + L+LRLA P+Q+WG + R T ++P++SG+LGL+ A G +R D ++ +++ Sbjct: 1 MTVLLLRLAAPLQSWGVASRFARRETQQYPSKSGILGLIAAARGHRRTD--PIEEALQNL 58 Query: 62 QFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVALW 121 F VR D+ +RD+ L + + + R YL DA F A+ Sbjct: 59 AFGVRVDQ--------PGRLIRDFQVALNIDKTKQ------FPLSQRYYLADAVFLAAI- 103 Query: 122 LTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKALLNYEPVGGDI 181 +I + A+ +P + YLGRRSCP+T PL LG + + AL E Sbjct: 104 --QGERGLIEGIGNALRRPEFPLYLGRRSCPVTGPLVLGEPRDVTLEHALHETEWQAATW 161 Query: 182 YSE---------------ESVTGHHLKFTARDEPMITLP--RQFASREWYV 215 Y L+ RD P+ P R++ R Sbjct: 162 YRRSQHRRVRLPIYRDLLPGDPVELLREQVRDMPLSFDPVRREYGWRTVVE 212 >UniRef50_D2RB02 CRISPR system CASCADE complex protein CasD n=3 Tax=Actinobacteria (class) RepID=D2RB02_GARVA Length = 291 Score = 192 bits (487), Expect = 9e-48, Method: Composition-based stats. Identities = 53/187 (28%), Positives = 86/187 (45%), Gaps = 17/187 (9%) Query: 2 RSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQR--DDTSSLQALSE 59 L+L+ +GP+Q+WG + TR T +P++S ++G++ A G +R D ++ L++ Sbjct: 1 MKSLLLKFSGPLQSWGTDSHFETRHTDYYPSKSAVIGMIAAAFGYRRSTDCDENIAKLND 60 Query: 60 SVQFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVA 119 + FAVR D+ LRDYH + T R YL DA F VA Sbjct: 61 -LDFAVRIDQ--------QGNLLRDYHIAAKY---KANGDFEKNYVTNRYYLEDAIFLVA 108 Query: 120 LWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKALLNYEPVGG 179 + + +I ++ A+ P + LGRRS P T LG + +ALL +E + Sbjct: 109 I--GSNNEQLIYDISNALRSPYFQSSLGRRSLPPTADFILGV-EDCGVIQALLTHEWLAN 165 Query: 180 DIYSEES 186 + Sbjct: 166 KWSKKRF 172 >UniRef50_A8SDR7 Putative uncharacterized protein n=1 Tax=Faecalibacterium prausnitzii M21/2 RepID=A8SDR7_9FIRM Length = 220 Score = 191 bits (486), Expect = 1e-47, Method: Composition-based stats. Identities = 74/221 (33%), Positives = 108/221 (48%), Gaps = 24/221 (10%) Query: 2 RSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESV 61 + L+LRLA P+QAWG + TR TGR PT+SG++GLL A LG++RD++ +L L+ + Sbjct: 1 MATLLLRLAAPLQAWGADSKFETRKTGREPTKSGVIGLLAAALGLRRDESEALTRLT-GL 59 Query: 62 QFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVALW 121 +F VR + L DYHT + + T+R YL DA F + Sbjct: 60 RFGVRVER--------EGQLLVDYHTA-------KTQDEKTSYVTYRHYLQDAVFLAGIE 104 Query: 122 LTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKALLNYEPVGGDI 181 T T ++ +L++A+L P + YLGRR CP T PL LG C Q+ L P+ Sbjct: 105 ST--DTALLQQLQQALLHPAFPLYLGRRCCPPTLPLCLGVCPG-SLQEVLQAEPPLCPGR 161 Query: 182 YSE---ESVTGHHLKFTARDEPMITLP--RQFASREWYVIK 217 S ++ RD P+ P RQ+ R + Sbjct: 162 QSRILLDADPLEPGTAPQRDVPVSFDPHHRQYGYRSVRELW 202 >UniRef50_B2GBJ9 Putative uncharacterized protein n=1 Tax=Lactobacillus fermentum IFO 3956 RepID=B2GBJ9_LACF3 Length = 235 Score = 191 bits (485), Expect = 2e-47, Method: Composition-based stats. Identities = 54/206 (26%), Positives = 81/206 (39%), Gaps = 23/206 (11%) Query: 2 RSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQR--DDTSSLQALSE 59 L++R+A P+Q++G P R T R P++S ++G++GA LG +R DD SL L Sbjct: 1 MKTLVIRIAAPLQSYGDPASFEKRTTFRAPSKSAVIGMIGAALGFRRESDDYKSLNDLD- 59 Query: 60 SVQFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVA 119 FAVR D+ L D+ + R YL DA F VA Sbjct: 60 ---FAVRVDQ--------PGEVLSDFQITH-------YSLKKPGKLSHRIYLQDAVFMVA 101 Query: 120 LWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKALLNYEPVGG 179 L A ++ E+E A+ P++ Y GRRS P L + C L Sbjct: 102 LSSKQDA--LMEEIEYALRHPKFQLYFGRRSNPPAGILKMKMCPDKTAINVLKELPWQAS 159 Query: 180 DIYSEESVTGHHLKFTARDEPMITLP 205 + + D ++ Sbjct: 160 VWFQRKYKKDVFNARIYADAKLVPDR 185 >UniRef50_A7BA63 Putative uncharacterized protein n=1 Tax=Actinomyces odontolyticus ATCC 17982 RepID=A7BA63_9ACTO Length = 242 Score = 191 bits (485), Expect = 2e-47, Method: Composition-based stats. Identities = 68/245 (27%), Positives = 103/245 (42%), Gaps = 44/245 (17%) Query: 1 MRSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSES 60 M + L+LRLAGPMQ+WG + R T FPT+S L+GLLGA G +R D ++ L+E Sbjct: 1 MSAVLVLRLAGPMQSWGADSRFTRRSTEAFPTKSALVGLLGAAQGRRRSD--PIEDLAE- 57 Query: 61 VQFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVAL 120 + AVR D+ L D+HT + R Y DA+F + Sbjct: 58 LSVAVRVDQ--------PGQLLHDFHTAHRG--------DTSMPLSHRFYRADAAFGAFI 101 Query: 121 WLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKALLNYEPVGGD 180 +I L +A+++P + YLGRRSCP T PL L + S A+ + Sbjct: 102 EGPDD---MIDALAQAIVRPVFPLYLGRRSCPPTLPLRLAVREGSAW-DAVRETPWMAST 157 Query: 181 IYSEESVTGHHLKF-------------------TARDEPMITL--PRQFASREWYVIKGG 219 Y ++ H ++ T +D P+ R++ R Sbjct: 158 YYQKKQRHDHFVRMRVVADLGIIPPEVEKVAQQTLQDMPISFDSENRKYTLRTVEETYID 217 Query: 220 MDVSQ 224 ++ Q Sbjct: 218 LENPQ 222 >UniRef50_Q2JWC5 CRISPR-associated protein Cas5, Ecoli subtype n=1 Tax=Synechococcus sp. JA-3-3Ab RepID=Q2JWC5_SYNJA Length = 207 Score = 190 bits (483), Expect = 3e-47, Method: Composition-based stats. Identities = 51/217 (23%), Positives = 90/217 (41%), Gaps = 31/217 (14%) Query: 2 RSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESV 61 + L++RL PM +WG + R + R PT+S ++G+L A LG R + L+ ++ Sbjct: 1 MTTLLMRLRAPMMSWGDHSQFDYRDSRREPTKSAVIGILCAALGRPR--WEPVDDLA-AL 57 Query: 62 QFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVALW 121 + VR ++ +D+HTV ET + R Y+ D + V L Sbjct: 58 KMGVRVNK--------EGILCKDFHTVQ----------IKET-ISNRYYVADGDYLVGLE 98 Query: 122 LTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKALLNYEPVGGDI 181 ++ L++A+ KP + +LGR+S + PL +G + +AL + Sbjct: 99 ---GDPNLLRTLDQALQKPYWQVFLGRKSFIPSRPLRVGLVEQP-LLEALRQHPYECSRR 154 Query: 182 YSEE-----SVTGHHLKFTARDEPMITLPRQFASREW 213 + +D P+ PR+F R Sbjct: 155 GKRPSQLRFVLEVSESLDVRQDVPLSWQPRRFGCRAV 191 >UniRef50_Q1EQS9 CRISPR-associated protein n=3 Tax=Streptomyces RepID=Q1EQS9_STRKN Length = 280 Score = 190 bits (482), Expect = 3e-47, Method: Composition-based stats. Identities = 70/241 (29%), Positives = 107/241 (44%), Gaps = 39/241 (16%) Query: 5 LILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESVQFA 64 L+LRL+GP+Q+WG+ + R T RFPTRSG++G+L A LG +R + L+ + Sbjct: 14 LLLRLSGPLQSWGERSHFNERDTARFPTRSGIIGMLAAALGRRRG--EPVDDLA-RLSLT 70 Query: 65 VRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKS---------HETIQTWREYLCDAS 115 VR D LRD HTV G + + T+ T R YL DA+ Sbjct: 71 VRTDR--------PGILLRDLHTVGGGLPAKATVTTAEGKKRPGTTGTLLTHRTYLADAA 122 Query: 116 FTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQAS------DPQK 169 FT+AL TP ++ + +A+ P + +LGRRSCP PL LG + + P Sbjct: 123 FTIALTSTPDDRPLLDQAAQALNTPCWPLFLGRRSCPPEGPLLLGASEDALHHLVHLPLA 182 Query: 170 ALLNYEPVGGDIYSEESV-------------TGHHLKFTARDEPMITLPRQFASREWYVI 216 A + ++ + G H D+P+ PR+ + R + Sbjct: 183 AHPGRGQQDTEFLADRPLNRLPYGTATPVGADGTHPSGEVNDQPLSFDPRRRSYRARPLY 242 Query: 217 K 217 + Sbjct: 243 R 243 >UniRef50_A5GBK2 CRISPR-associated protein Cas5 family n=2 Tax=Deltaproteobacteria RepID=A5GBK2_GEOUR Length = 231 Score = 189 bits (481), Expect = 5e-47, Method: Composition-based stats. Identities = 61/225 (27%), Positives = 98/225 (43%), Gaps = 10/225 (4%) Query: 2 RSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSE-- 59 +S+L LRL GP+Q+WG + R TG PT+S + G+ A LG R + L Sbjct: 10 KSFLALRLEGPLQSWGFDSQYNRRNTGLMPTKSAIAGMCCAALGFLRGCDKEQEFLVAFG 69 Query: 60 SVQFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVA 119 +V+ + + V L+DYHTV R G +++ + T R+YL DA+F V Sbjct: 70 AVRMTAIAIPRNGAKKELPVRRLQDYHTVQNTRR-ASGAINNDCVLTHRQYLTDAAFGVL 128 Query: 120 LWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKALLNYEPVGG 179 L + ++ ++ A+ P + +LGR++C T P+ G + D LL + Sbjct: 129 LE---GDSTLLKQIAAALENPVWGVWLGRKTCIPTAPVLAGLRENRDEALKLLLKDKPLE 185 Query: 180 DIYSEESVT---GHHLKFTARDEPMITLPRQFASREWYVIKGGMD 221 +E V T R F+ R ++ G D Sbjct: 186 SFARQEDVESFADGRDSLPDMPVSFATERRIFSPRRVRTLQ-GTD 229 >UniRef50_UPI0001AF1D4C CRISPR-associated protein, CT1976 n=1 Tax=Streptomyces ghanaensis ATCC 14672 RepID=UPI0001AF1D4C Length = 244 Score = 189 bits (480), Expect = 7e-47, Method: Composition-based stats. Identities = 59/240 (24%), Positives = 89/240 (37%), Gaps = 41/240 (17%) Query: 2 RSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESV 61 + L++ PMQ+WG + +R T PT+SG++GLL A LGI RD +Q L+E + Sbjct: 1 MATLLMCFDAPMQSWGTRSQFASRDTATEPTKSGVVGLLAAALGIPRDADEEIQNLAE-L 59 Query: 62 QFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVALW 121 + VR D + + D+HTV K+H T T R YL DA F V + Sbjct: 60 RMGVRVDREGVVEA--------DFHTVQNVPNTE--GKNHRTAVTKRFYLADALFLVGVE 109 Query: 122 LTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPL-FLGT------CQASDPQKALLNY 174 T ++ +L A+ PR+ Y GR++ P+ G AL + Sbjct: 110 --SDDTQLLHQLHTALTAPRWPLYFGRKAFVPARPIPSPGLAGEHHPVTGQSLDDALRTH 167 Query: 175 EPVGGD-----------------IYSEESVTGHHL--KFTARDEPMITLP--RQFASREW 213 + D P+ R +A R Sbjct: 168 PWLENQLRIHANRRTTAYAPSPAWLRTIVDADPLALDVELRHDHPLSFTQQDRSYAPRAV 227 >UniRef50_C7MQD6 CRISPR-associated protein Cas5 n=1 Tax=Saccharomonospora viridis DSM 43017 RepID=C7MQD6_SACVD Length = 236 Score = 189 bits (479), Expect = 8e-47, Method: Composition-based stats. Identities = 71/219 (32%), Positives = 104/219 (47%), Gaps = 20/219 (9%) Query: 2 RSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESV 61 + L+L L GPMQAWG + R T +PTRSG++G++ A LG Q D SL LS + Sbjct: 1 MTTLVLHLDGPMQAWGHASQWDHRDTLDYPTRSGVIGMIAAALGKQWGD--SLDDLSP-L 57 Query: 62 QFAVRCDELILDDRRVSVTGLRDYHTVLGARE----DYRGLKSHETIQTWREYLCDASFT 117 +F +R D + DYHT G E +G + + R Y+ DA++T Sbjct: 58 RFTIRIDR--------PGRRIVDYHTAGGGYEVGIARVKGGNRAHAVLSDRFYMSDAAYT 109 Query: 118 VALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQK-ALLNYEP 176 VA+ ++ ++ A+ P + P+LGRRSCP P LG + L +P Sbjct: 110 VAIT---GPDTLLYRVDDALRAPVFGPFLGRRSCPPAGPWHLGLHDGDPLRTLPLHRDKP 166 Query: 177 VGGDIYSEESVTGHHLKF-TARDEPMITLPRQFASREWY 214 GD + E V+ H T R + +T P +F R Y Sbjct: 167 RDGDTVAVEFVSDHETHGPTDRVDTTLTDPHEFGPRRSY 205 >UniRef50_A5UR14 CRISPR-associated protein, Cas5e family n=1 Tax=Roseiflexus sp. RS-1 RepID=A5UR14_ROSS1 Length = 262 Score = 188 bits (478), Expect = 1e-46, Method: Composition-based stats. Identities = 75/259 (28%), Positives = 102/259 (39%), Gaps = 55/259 (21%) Query: 2 RSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESV 61 + L LRL GP+Q+WG G R T PT+SG++GLLG LG++RDD + L+ LS+++ Sbjct: 1 MNTLFLRLEGPLQSWGLRARWGERDTTDAPTKSGVIGLLGCALGLRRDD-ARLRDLSDNL 59 Query: 62 QFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKS--------------------- 100 + VR D + +RDYHT G R Sbjct: 60 RMGVRVD--------LPGILMRDYHTTGGGRYSTIASTGGPRYHDEPYIGGVLSAEVTKG 111 Query: 101 ------------HETIQTWREYLCDASFTVALWLTPHATMVISELEKAVLKPRYTPYLGR 148 ET + R YL DASF VAL +P I EL A+ P + +LGR Sbjct: 112 RIKVKINQKTGEPETDVSERYYLADASFLVALQGSPD---YIGELATAIQSPVWPLFLGR 168 Query: 149 RSCPLTHPLFLGTCQASDPQKALLNY---EPVGGDIYSEESVTGHHLKFT-------ARD 198 ++C + P+F GT Q + AL N V L T D Sbjct: 169 KACVPSTPIFAGTGQFDILEDALKNLLLSPRVETAWQRSRPTQLRLLIETGPGAGNRQYD 228 Query: 199 EPMITLPRQFASREWYVIK 217 R F +R Sbjct: 229 NIGTPSRRVFRARYVRETW 247 >UniRef50_B1VIY0 CRISPR-associated protein n=9 Tax=Actinomycetales RepID=B1VIY0_CORU7 Length = 240 Score = 186 bits (473), Expect = 4e-46, Method: Composition-based stats. Identities = 61/234 (26%), Positives = 100/234 (42%), Gaps = 33/234 (14%) Query: 1 MRSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSES 60 M L+L L GPMQ+WG + R T PT+SG++GL+ A G +R D ++ L++ Sbjct: 1 MAHSLLLLLKGPMQSWGDESRFSVRATATTPTKSGIVGLIAAAQGRRRTD--GVEDLAK- 57 Query: 61 VQFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVAL 120 ++ AVR D+ S + LRDY T + R +L DA+F A+ Sbjct: 58 LRMAVRVDQ--------SGSLLRDYQTAQ----PWLKNPGANASLVTRYFLSDAAFVAAV 105 Query: 121 WLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKALLNYEPVGGD 180 ++ ++ +A+ +P Y Y+GRRSCP+ L +G D + AL ++ Sbjct: 106 E--SEDRELLDQMAEALRRPAYPLYMGRRSCPVHPGLVIGVVDG-DAESALRAHDTWHAT 162 Query: 181 IYSEESV-------------TGHHLKFTARDEPMITLP--RQFASREWYVIKGG 219 + G +D P+ P R++ RE + Sbjct: 163 AVHRKESPKKVSLAIYRDANPGEGGSVPRQDVPVSFSPEHRKYGWREVIRAEDV 216 >UniRef50_B0S4B6 Putative uncharacterized protein n=1 Tax=Finegoldia magna ATCC 29328 RepID=B0S4B6_FINM2 Length = 228 Score = 185 bits (471), Expect = 7e-46, Method: Composition-based stats. Identities = 60/239 (25%), Positives = 107/239 (44%), Gaps = 33/239 (13%) Query: 2 RSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESV 61 S ++L+ A P+Q+WG R T +PT+S ++GL+ A G ++ DT S++ L+ S+ Sbjct: 1 MSVILLKFASPLQSWGGLANYEIRNTEYYPTKSAVIGLVAAAFGYKKTDTESIKRLN-SL 59 Query: 62 QFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTW---REYLCDASFTV 118 F+VR D+ + +RD+ + Y + +++ + Y+ DA F + Sbjct: 60 NFSVRIDQ--------KGSLIRDFQIAMEYNPKYMPNDPNYFVKSNLIQKYYIQDAKFLI 111 Query: 119 ALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKALLNYEPVG 178 AL + ++ ++ A+ P Y +LGR+S P+ +G ++ + + +YE + Sbjct: 112 AL--SSDDETLMEDVYNALESPAYQLFLGRKSNPINADYLIGKFDGNEL-EIIKDYEWLA 168 Query: 179 GDIYSE--------------ESVTGHHLKFTARD--EPMITLPRQFASR--EWYVIKGG 219 Y + TG K RD E R F+SR YV K G Sbjct: 169 SKWYKKSIKKDSVELSIFSDYIDTGSKEKLIRRDLTESFENTKRDFSSRFEYRYVTKVG 227 >UniRef50_Q1R114 CRISPR-associated protein, CT1976 n=1 Tax=Chromohalobacter salexigens DSM 3043 RepID=Q1R114_CHRSD Length = 260 Score = 185 bits (470), Expect = 8e-46, Method: Composition-based stats. Identities = 66/184 (35%), Positives = 92/184 (50%), Gaps = 20/184 (10%) Query: 1 MRSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSES 60 M +L+ RL PM +WG+ RPT +P R +LGL+GA LGI+RDD L +S Sbjct: 1 MTGHLVFRLYAPMASWGEAAVGEARPTATYPGRGAILGLIGAALGIRRDDDEGQLRLRQS 60 Query: 61 VQFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYR------------GLKSHETIQTWR 108 + AV+ +R LRDYHTV + + TI + R Sbjct: 61 LGIAVK--------QRSPGWLLRDYHTVQVPPSQSKVNYRSRREELSVPKDALNTILSSR 112 Query: 109 EYLCDASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQ 168 +Y CD + VAL L P A + EL+ A+ +PR+T YLGR++CPL PL +A + Sbjct: 113 DYRCDGLWVVALRLMPDAVWTLDELKSALERPRFTLYLGRKACPLAAPLTPAIVEADHWR 172 Query: 169 KALL 172 AL Sbjct: 173 GALD 176 >UniRef50_A6W168 CRISPR-associated protein Cas5 family n=6 Tax=Gammaproteobacteria RepID=A6W168_MARMS Length = 258 Score = 185 bits (470), Expect = 1e-45, Method: Composition-based stats. Identities = 78/253 (30%), Positives = 109/253 (43%), Gaps = 43/253 (16%) Query: 1 MRSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSES 60 M+ YL+ RL GPM +WGQP G R T PTRS +LGLLGA LGI+RDD L AL S Sbjct: 1 MKDYLVFRLYGPMASWGQPAVGGDRATAIAPTRSAILGLLGAALGIKRDDAQQLDALHSS 60 Query: 61 VQFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRG-------------LKSHETIQTW 107 VQ A + + + LRDYHT + + + TI + Sbjct: 61 VQMATK--------QVTPTSLLRDYHTSQVPSRNNKYVYRTRKNELLDEHKEKLNTILST 112 Query: 108 REYLCDASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDP 167 R+Y CD + VA+ LT + + L++A++KP Y LGR+SCPL PL + Sbjct: 113 RDYRCDGIWIVAVSLTQESLFSLERLKQALIKPVYVLSLGRKSCPLAAPLLPVLLTSVSL 172 Query: 168 QKALL---------NYEPVGGDIYS-------------EESVTGHHLKFTARDEPMITLP 205 ++AL + + +E L D+P+ Sbjct: 173 REALDYPFPSIIDNDKPHLDAVWLRPNKLSTYTWEGSVDEFSGETALTTHPWDDPINRDR 232 Query: 206 RQFASREWYVIKG 218 QF R + I Sbjct: 233 WQFKQRTMHQITV 245 >UniRef50_Q47PI7 CRISPR-associated protein, Cas5e family n=12 Tax=Actinomycetales RepID=Q47PI7_THEFY Length = 245 Score = 184 bits (467), Expect = 2e-45, Method: Composition-based stats. Identities = 69/233 (29%), Positives = 99/233 (42%), Gaps = 36/233 (15%) Query: 2 RSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESV 61 L L LAGP+QAWG + R T PT+SG+LGLL A G +R D L L+ ++ Sbjct: 1 MKVLTLLLAGPLQAWGAASRFTRRTTEHAPTKSGVLGLLAAAQGRERTDD--LSDLA-AL 57 Query: 62 QFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVALW 121 +F VR D+ T +RD+ T + + R YL DA F A+ Sbjct: 58 RFGVRVDQ--------RGTRIRDFQTAIHLD------TGKSMPVSERFYLADAVFVAAVE 103 Query: 122 LTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKALLNYEPVGGDI 181 +I L +AV P Y PYLGRRSCP + P+ LG ++ L + + Sbjct: 104 ---GEDTLIDTLHQAVQHPVYLPYLGRRSCPPSRPINLGVHSGKPLEQVLAEEKWHAANW 160 Query: 182 YSE--------------ESVTGHHLKFTARDEPMITLP--RQFASREWYVIKG 218 Y ++ G + RD P+ P R++A R + Sbjct: 161 YQRQLRDLPEVPLDLLVDAPPGDPGADSLRDLPISFDPVHRRYALRGVRTLTV 213 >UniRef50_A8M404 CRISPR-associated protein Cas5 family n=1 Tax=Salinispora arenicola CNS-205 RepID=A8M404_SALAI Length = 238 Score = 184 bits (466), Expect = 3e-45, Method: Composition-based stats. Identities = 65/229 (28%), Positives = 96/229 (41%), Gaps = 35/229 (15%) Query: 5 LILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESVQFA 64 L+LRLAGP+Q+WG + R T PT+SG++G+L A G++R D L L S+ F Sbjct: 2 LLLRLAGPLQSWGATSRFTHRHTQVTPTKSGVIGMLAAASGLRRTD--PLTELL-SLDFG 58 Query: 65 VRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVALWLTP 124 VR D+ LRD+ T R YL DA F VA+ Sbjct: 59 VRIDQ--------PGQLLRDFQVARTLDG------RDSMPLTNRYYLSDAVFLVAI---G 101 Query: 125 HATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKALLNYEPVGGDIYSE 184 ++ L ++V +P + YLGRR+CP P+ LG +AL ++ E Sbjct: 102 GDQALLEGLHESVRRPHFPLYLGRRACPPVAPISLGVHPG-TVDEALRDWPWQAAKRLRE 160 Query: 185 E------------SVTGHHLKFTARDEPMITLP--RQFASREWYVIKGG 219 + G + T D+P+ P RQ+ R + Sbjct: 161 RGELTVPLEVVSDAPPGADVTETLPDQPISFDPAHRQYGWRAVVRTRIV 209 >UniRef50_D1A6Q5 CRISPR-associated protein Cas5 family n=2 Tax=Actinomycetales RepID=D1A6Q5_THECD Length = 273 Score = 183 bits (465), Expect = 4e-45, Method: Composition-based stats. Identities = 70/243 (28%), Positives = 102/243 (41%), Gaps = 44/243 (18%) Query: 2 RSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESV 61 + L+L L+GP+Q+WG+ + R T PTRSGL+G++ A G +R T + L ++ Sbjct: 6 TTGLLLHLSGPLQSWGERSRFNQRDTATAPTRSGLIGMIAAAFGRRR--TEPVTDL-RAL 62 Query: 62 QFAVRCDELILDDRRVSVTGLRDYHTVLG---------AREDYRGLKSHETIQTWREYLC 112 +F VR D T LRD+HTV G E R T+ + R YL Sbjct: 63 RFTVRIDR--------PGTLLRDFHTVGGGMPRDLTVITAEGKRRAADTATVTSDRYYLQ 114 Query: 113 DASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKALL 172 DA+FTVA+ ++ +A+ PR+ YLGRRSCP PL L T +DP AL+ Sbjct: 115 DAAFTVAVTA--DDPALLDRCAQALRAPRWPLYLGRRSCPPNAPLLL-TVLRTDPVTALI 171 Query: 173 NYEP---------------------VGGDIYSEESVTGHHLKFTARDEPMITLPRQFASR 211 + S + + R+F +R Sbjct: 172 DLPLARTAPRDRGDVLVEFRSDTPFESRAWPSAPEDEQVYTEAQDEPVSFQPHHRRFQTR 231 Query: 212 EWY 214 Y Sbjct: 232 PIY 234 >UniRef50_C5SD48 CRISPR-associated protein Cas5 family n=1 Tax=Allochromatium vinosum DSM 180 RepID=C5SD48_CHRVI Length = 227 Score = 182 bits (463), Expect = 6e-45, Method: Composition-based stats. Identities = 108/225 (48%), Positives = 134/225 (59%), Gaps = 9/225 (4%) Query: 1 MRSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSES 60 M SYLILRL GPMQAWG TFE RP+ FPTRSGLLGLLGACLG+ R DT SL AL+ES Sbjct: 1 MPSYLILRLDGPMQAWGTHTFEDYRPSNPFPTRSGLLGLLGACLGLDRSDTPSLDALAES 60 Query: 61 VQFAVRCDELILDD-----RRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDAS 115 V F VR D T L DYHTVL AR+ G + IQ+ REYL DA+ Sbjct: 61 VAFTVRLDTGAPRPGVDRLMPKRHTKLSDYHTVLDARKV-DGSTNKFPIQSHREYLFDAA 119 Query: 116 FTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPL--FLGTCQASDPQKALLN 173 F VA+ P A+ ++ + +++ +PR+TP LGRRSCPL PL +A D + AL Sbjct: 120 FAVAIGSRPDASFSLARIAESLRQPRFTPVLGRRSCPLGRPLLERPDCIEADDAKAALAQ 179 Query: 174 YEPVGGDIYSEESVTGHHLKFTARDEPMITLPRQFASREWYVIKG 218 + P GG IYSE+ + + RD P RQFA+R Y+ + Sbjct: 180 FPPHGGLIYSEDELVSDQPTWI-RDVPRYGRHRQFATRRLYLHRE 223 >UniRef50_C7LYW6 CRISPR-associated protein Cas5 family n=1 Tax=Acidimicrobium ferrooxidans DSM 10331 RepID=C7LYW6_ACIFD Length = 253 Score = 182 bits (463), Expect = 6e-45, Method: Composition-based stats. Identities = 74/238 (31%), Positives = 95/238 (39%), Gaps = 41/238 (17%) Query: 2 RSYLILRLAGPMQAWGQPTF-EGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSES 60 S L LRL GP+QAWG + R T RFPT+SG++GLL A LG R ++L L + Sbjct: 1 MSVLALRLGGPLQAWGSSQRLDHYRRTERFPTKSGVIGLLAAALGRPRS--AALDDLG-A 57 Query: 61 VQFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHE----------------TI 104 ++FAVR D LRD+HT+ +D + E T Sbjct: 58 LRFAVRIDR--------PGEVLRDFHTLSSLFDDKKRFAPGEGRLPTASGGYRSAATSTQ 109 Query: 105 QTWREYLCDASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQA 164 T R YL DA F L + EL+ A+ P + YLGRRSCP PL LG Sbjct: 110 VTERFYLADACFVAGLE---GDAAQLQELDDALRTPVFPLYLGRRSCPPDKPLRLGVYDG 166 Query: 165 SDPQKALLNYEPVGGD-------IYSEESVTGHHLKFTARDEPMITLP--RQFASREW 213 L + D I E V D+ P R + R Sbjct: 167 -GLIDVLASIPWQANDPAQSATSIRCELVVENPAGDVELADQARSFDPLTRSYTRRRV 223 >UniRef50_A1SV73 CRISPR-associated protein, Cas5e family n=2 Tax=Gammaproteobacteria RepID=A1SV73_PSYIN Length = 217 Score = 182 bits (463), Expect = 6e-45, Method: Composition-based stats. Identities = 78/214 (36%), Positives = 110/214 (51%), Gaps = 15/214 (7%) Query: 2 RSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESV 61 LIL+ G M A+G TF+ R FPTRS ++G+LGA +GI R++ + L ALSE + Sbjct: 1 MKTLILKTEG-MSAYGLQTFDVHRRANHFPTRSAIMGILGAAMGITRENFNELYALSEQL 59 Query: 62 QFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVALW 121 + AV+ +S + DYHTV R +G T+REY CD+ T A+ Sbjct: 60 KIAVQV--------NLSGEKMVDYHTVQHFRSP-QGKIQKGVKPTYREYWCDSEHTFAIS 110 Query: 122 LTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKALLNYEPVGGDI 181 A VI +L +V P +T + GR+SCPLT PLF +P AL N+ G I Sbjct: 111 A---AEHVIEKLVNSVKFPEFTLFQGRKSCPLTRPLFEAVTDDDNPANALKNHGEQ-GQI 166 Query: 182 YSEESVTGHHLKFTARDEPMITLPRQFASREWYV 215 +S+ S RD + +PR++A R YV Sbjct: 167 FSDISGDNQLAIVQVRDL-ITAIPRKYAMRTVYV 199 >UniRef50_Q03C60 CRISPR-associated protein n=4 Tax=Lactobacillus RepID=Q03C60_LACC3 Length = 236 Score = 180 bits (457), Expect = 3e-44, Method: Composition-based stats. Identities = 59/234 (25%), Positives = 90/234 (38%), Gaps = 37/234 (15%) Query: 2 RSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESV 61 + +RL P+Q++G R TG +P++S ++G+L A LG QRDD ++ AL++ + Sbjct: 1 MKTISIRLTSPLQSYGNEAQFARRTTGDYPSKSAIIGMLAAALGYQRDD-PAINALNDLL 59 Query: 62 QFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVALW 121 FAVR D+ + ++ T K T+R+ L DA F VA+ Sbjct: 60 -FAVRVDQ--------PGQVMTEFQTAE--------WKPGTRKLTYRDLLQDAVFVVAI- 101 Query: 122 LTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKALLNYEPVGGDI 181 + L +A+ PR+ YLGRR+ L + T DP L Sbjct: 102 -GSEDEAWLDRLAEALRHPRFQLYLGRRANVPAGVLKIQTFAGQDPVGVLAQLPWQASRW 160 Query: 182 YSEESVT---------------GHHLKFTARDE--PMITLPRQFASREWYVIKG 218 Y S RD RQ++ R V K Sbjct: 161 YQRRSRRQASMSVQLIADAVLLPERRADLVRDAVHSFDQKHRQYSFRPIAVTKV 214 >UniRef50_C8XAY4 CRISPR-associated protein Cas5 family n=2 Tax=Actinomycetales RepID=C8XAY4_NAKMY Length = 252 Score = 180 bits (456), Expect = 3e-44, Method: Composition-based stats. Identities = 67/210 (31%), Positives = 92/210 (43%), Gaps = 20/210 (9%) Query: 2 RSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESV 61 S L+LRL GPMQ+WG+ + R T P++S ++GLL A LG +R D +++ L+ + Sbjct: 1 MSVLVLRLTGPMQSWGERSRYARRETAAEPSKSAIVGLLAAALGRRRTD--AIEDLA-GL 57 Query: 62 QFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVALW 121 F VR D+ T LRD+ T + R YL DA F A+ Sbjct: 58 IFGVRVDQ--------PGTLLRDFQTARSLDGAR------TMPLSERYYLSDARFLAAVE 103 Query: 122 LTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKALLNYEPVGGDI 181 +I+ L A+ P + YLGRRSCP + P+ C S P A L EP Sbjct: 104 ---GPESLIAGLAGALRDPTFPLYLGRRSCPPSEPIAQQDCIRSGPLLAALFDEPWHATK 160 Query: 182 YSEESVTGHHLKFTARDEPMITLPRQFASR 211 V A D +P Q A R Sbjct: 161 SYRRRVADPARLSIAVDAAATEVPAQLAER 190 >UniRef50_A8LYZ7 CRISPR-associated protein Cas5 family n=2 Tax=Actinomycetales RepID=A8LYZ7_SALAI Length = 257 Score = 180 bits (456), Expect = 4e-44, Method: Composition-based stats. Identities = 77/243 (31%), Positives = 107/243 (44%), Gaps = 43/243 (17%) Query: 2 RSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESV 61 + L+LRLAGPMQ+WG + R TG PTRS ++G++ A G R L L+ V Sbjct: 1 MTGLLLRLAGPMQSWGDHSTFSVRDTGTVPTRSAMIGIIAAAQGRHRG--EPLGDLAP-V 57 Query: 62 QFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLK---------SHETIQTWREYLC 112 QF VR D T + D+HTV G R + TI + R YL Sbjct: 58 QFTVRVDR--------PGTVMSDFHTVGGGAPPERTVPTAEGKRRTAGAGTIVSRRFYLA 109 Query: 113 DASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKAL- 171 DA FTVA+ ++ ++ A+ P + PYLGRRSCP+ HP L + DP L Sbjct: 110 DAVFTVAVTGPDD---LVGQIHTALNNPVWGPYLGRRSCPVAHPF-LMSGPIPDPVGRLE 165 Query: 172 -----LNYEPVGGDIYSEESV------TGHHLKFTARDEPMITLP-------RQFASREW 213 P + + V G + T D PM +P R++ +R+ Sbjct: 166 HLPLNRRRPPGDEETVRVDFVTGAPHGDGSISRMTLNDVPMEPVPGSPDPRRRRYLTRQV 225 Query: 214 YVI 216 YV Sbjct: 226 YVT 228 >UniRef50_Q2RY19 CRISPR-associated protein, Cas5e family n=1 Tax=Rhodospirillum rubrum ATCC 11170 RepID=Q2RY19_RHORT Length = 261 Score = 179 bits (454), Expect = 7e-44, Method: Composition-based stats. Identities = 70/216 (32%), Positives = 95/216 (43%), Gaps = 21/216 (9%) Query: 2 RSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESV 61 R +L+ RL GPM AWG R T P +S +LGLL A LGI R D ++ +AL + Sbjct: 4 RDFLVFRLVGPMAAWGDIAVGERRGTWDVPAKSAILGLLAAGLGIDRADRTAHEALDRGL 63 Query: 62 QFAVRCDELILDDRRVSVTGLRDYHTVL----------GAREDYRGLKSHETIQTWREYL 111 FAVR D LRDYHT R D T+ + R Y Sbjct: 64 GFAVRQDR--------PGRLLRDYHTAQAPKARKNARWSTRRDELNDDDLNTVLSDRLYR 115 Query: 112 CDASFTVALWLTPH-ATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKA 170 +A T A+W + +L +A+L+PR+TPYLGR++CPL P QA A Sbjct: 116 TNAIATPAIWRRQGTEGPTLDQLTQALLRPRFTPYLGRKACPLGWPPRPRLLQADGLLAA 175 Query: 171 LLNYEPVGGDIYSE--ESVTGHHLKFTARDEPMITL 204 Y+ D + ++ G T R P+ Sbjct: 176 FDAYDSAEWDAARQFHKAYPGGWPGDTDRPTPVWFE 211 >UniRef50_C7MTB0 CRISPR-associated protein Cas5 n=1 Tax=Saccharomonospora viridis DSM 43017 RepID=C7MTB0_SACVD Length = 255 Score = 178 bits (452), Expect = 1e-43, Method: Composition-based stats. Identities = 73/234 (31%), Positives = 104/234 (44%), Gaps = 36/234 (15%) Query: 2 RSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESV 61 S L+LRLAGP+Q+WG+ + R T FPT SGLLGLL +G +R SL+ L+ ++ Sbjct: 1 MSGLLLRLAGPLQSWGERSTFDVRDTAGFPTHSGLLGLLACVMGRRRG--ESLEDLA-AL 57 Query: 62 QFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLK---------SHETIQTWREYLC 112 F +R D T + DY T GA + T+QTWREYL Sbjct: 58 TFTIRVDR--------PGTRIIDYQTAGGALPPSMKVPTADGKGRPAGKGTVQTWREYLA 109 Query: 113 DASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKALL 172 DA F VA+ + V+ ++ A+ P + PYLGRRSCP PL L DP L Sbjct: 110 DAVFVVAV---QGPSEVLDQVRHALRYPHWQPYLGRRSCPPDQPLLLDV-PVEDPVAELC 165 Query: 173 NYEPVGGDIYSE------------ESVTGHHLKFTARDEPMITLPRQFASREWY 214 P+ + + E G + ++ R ++ R + Sbjct: 166 TRVPLARRVGKDEETVPVDFIFPVERRDGVRSEIHDVPVAFTSVDRAYSPRPVW 219 >UniRef50_D2L2X8 CRISPR-associated protein Cas5 family n=1 Tax=Desulfovibrio sp. FW1012B RepID=D2L2X8_9DELT Length = 266 Score = 178 bits (452), Expect = 1e-43, Method: Composition-based stats. Identities = 70/249 (28%), Positives = 98/249 (39%), Gaps = 45/249 (18%) Query: 1 MRSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSES 60 M YLI +L G + A+G R + PTRS + GLL ACLGI+R + + L ALS Sbjct: 1 MARYLIFQLYGMLAAYGLVAVGEVRLSAGHPTRSAVFGLLAACLGIRRHEEARLAALSGG 60 Query: 61 VQFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGL---------------KSHETIQ 105 AVR D T L DYHT+ E + + + T+ Sbjct: 61 YALAVRVD--------APGTSLLDYHTIQTPPEKSKRIYRTRADELGGLLGIDEPPYTVL 112 Query: 106 TWREYLCDASFTVALW------LTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFL 159 + R YLCDA FT L + L +A+ +P TPYLGR+SCP + P Sbjct: 113 SRRGYLCDAHFTACLTPAAAPPTDATPPHTLEALAEALRRPVLTPYLGRKSCPPSLPFHP 172 Query: 160 GTCQASDPQKALLNY------------EPVGGDIYSEESVTGHHLKFT----ARDEPMIT 203 + + AL +Y ++++E T RD + Sbjct: 173 RLGEYDSLEAALADYPLEKLAFPAGLKPHDPAVVFADEDEAITPATVTSRPLVRDRTVQH 232 Query: 204 LPRQFASRE 212 R F R Sbjct: 233 GRRLFEERR 241 >UniRef50_A8ZZ17 CRISPR-associated protein Cas5 family n=1 Tax=Desulfococcus oleovorans Hxd3 RepID=A8ZZ17_DESOH Length = 259 Score = 178 bits (452), Expect = 1e-43, Method: Composition-based stats. Identities = 67/259 (25%), Positives = 99/259 (38%), Gaps = 54/259 (20%) Query: 2 RSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESV 61 YL+ RL GPM +WG+ TR T +P RS ++GL+ A LGI+R +T + QAL + Sbjct: 1 MGYLLFRLYGPMASWGEIAVGETRHTANYPGRSAIIGLMAAALGIKRSETENQQALDQGC 60 Query: 62 QFAVRCDELILDDRRVSVTGLRDYHTVL----------GAREDYR--GLKSHETIQTWRE 109 AV + R + LRDYHT R D G TI + RE Sbjct: 61 LIAV--------EARSHGSLLRDYHTTQVPDSVGGFVYRTRRDELIIGKPRLGTILSSRE 112 Query: 110 YLCDASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGT-CQASDPQ 168 Y DA A+ + P A + ++ + +PR YLGR+SCPL+ P+ + + Sbjct: 113 YRQDALAVSAVRVLPGARYELQTIKTHLEQPRLHVYLGRKSCPLSAPMNPQIDKTSRNFH 172 Query: 169 KALLNYEPVGGDIYSEESVTG---------------------------------HHLKFT 195 +A Y G Sbjct: 173 EAFQAYAHQPLLPVHHSGKEGLSKRDAYWLGLANDRHYYWEGEPSEFSDTIDLSRVQTRI 232 Query: 196 ARDEPMITLPRQFASREWY 214 D+P+ QF+ R+ + Sbjct: 233 RHDQPLSRTRWQFSPRQEH 251 >UniRef50_D1CAJ0 CRISPR-associated protein Cas5 family n=1 Tax=Sphaerobacter thermophilus DSM 20745 RepID=D1CAJ0_SPHTD Length = 245 Score = 178 bits (451), Expect = 1e-43, Method: Composition-based stats. Identities = 65/245 (26%), Positives = 97/245 (39%), Gaps = 40/245 (16%) Query: 2 RSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESV 61 S L+LRL GPMQAWG + R TG P++SG++GLL A LG R + + L+ + Sbjct: 1 MSTLLLRLTGPMQAWGTQSRFSWRDTGLEPSKSGVIGLLCAALGRPRS--APVDDLA-RL 57 Query: 62 QFAVRCDELILDDRRVSVTGLRDYHTVLG-AREDYRG-------LKSHETIQTWREYLCD 113 + VR D T D+HT G R G + + R YL D Sbjct: 58 RMGVRVDR--------EGTMHVDFHTAGGWHRRAEAGYGVPDPSGTARRPQISRRFYLAD 109 Query: 114 ASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDP------ 167 A F V L ++ L++A+ PR+ +LGR+S P+ L P Sbjct: 110 ADFLVGLE---GDEELLVLLDRALAAPRWQLFLGRKSFVPAAPVRLPDTPPWGPGLRPEP 166 Query: 168 -QKALLNYEPVGGDIYSEESV-----------TGHHLKFTARDEPMITLPRQFASREWYV 215 + AL Y +G + + G D P+ R+F++R Sbjct: 167 LETALRTYPWLGYQLPHPRADAPDRLRLVLDAEGDDAADIRMDVPISFAERRFSTRAVRT 226 Query: 216 IKGGM 220 + Sbjct: 227 VWIAT 231 >UniRef50_Q2JH27 CRISPR-associated protein, CT1976 n=6 Tax=Actinomycetales RepID=Q2JH27_FRASC Length = 276 Score = 176 bits (447), Expect = 4e-43, Method: Composition-based stats. Identities = 72/255 (28%), Positives = 96/255 (37%), Gaps = 52/255 (20%) Query: 2 RSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESV 61 R L+LRLAGP+Q+WG + R T PT+SG++GLL A G +R D ++ L S+ Sbjct: 7 RHCLVLRLAGPLQSWGSRSMFNRRDTLTEPTKSGIIGLLAAAQGRRRTD--PIEDLL-SL 63 Query: 62 QFAVRCDELILDDRRVSVTGLRDYHTVLGARED-------------YRGLKSHETIQTWR 108 +R D+ T LRDYHTV R + T T R Sbjct: 64 TLGIRTDQ--------PGTLLRDYHTVSDYRGRPLPSAAVSAKGLQKPTSPAKHTHVTER 115 Query: 109 EYLCDASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASD-- 166 YL DA F AL V++ L A+ P + LGRR+CP THPL L D Sbjct: 116 FYLQDAVFVAALAAP---EPVLTTLADALRTPAFPLALGRRACPPTHPLLLVPDSEPDAA 172 Query: 167 -----PQKALLNYEPVGGDIYSEE----------------SVTGHHLKFTARDEPMITLP 205 + L + + +V D P P Sbjct: 173 LWSGSALEVLRQVPWQARPDHRDALARRRPPRLRRIDLPVTVDDPDGDDVRIDLPTTFDP 232 Query: 206 RQ--FASREWYVIKG 218 Q F SR + Sbjct: 233 HQRGFTSRRVHQSWV 247 >UniRef50_B3E5U9 CRISPR-associated protein Cas5 family n=2 Tax=Desulfuromonadales RepID=B3E5U9_GEOLS Length = 271 Score = 175 bits (445), Expect = 8e-43, Method: Composition-based stats. Identities = 78/249 (31%), Positives = 111/249 (44%), Gaps = 35/249 (14%) Query: 2 RSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESV 61 YL+ RL GP+ +WG+ +R + +P +S LLGL+ A LGI+RD+ AL+ Sbjct: 1 MKYLLFRLYGPLASWGEIAVGESRHSAVYPGKSALLGLIAAALGIRRDEEQRQAALASGY 60 Query: 62 QFAVRCDELILDDRRVSVTGLRDYHTVLGA----------REDY--RGLKSHETIQTWRE 109 +FAV+ + LRDYHT R D G + TI + RE Sbjct: 61 RFAVKV--------ISTGHPLRDYHTAQAPDSVGKFVYRTRRDELVLGKERLGTILSSRE 112 Query: 110 YLCDASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQK 169 Y CDA VA+ A + E+ +A++KPR+ YLGR+SCP+ PL A Sbjct: 113 YRCDAFSLVAVVAEDDAPYSLDEIREALMKPRFHLYLGRKSCPVAAPLNPLVRDAVGFGD 172 Query: 170 ALLNYE-------PVGGDIYSEESVTGHHL-------KFTARDEPMITLPRQFASREWYV 215 AL +Y +E V G L KF+ DE + +Q R ++ Sbjct: 173 ALDSYPYGALFVSSWLMKTAQKEIVEGGKLAEVPSLAKFSREDETVFAYNKQPV-RYYWE 231 Query: 216 IKGGMDVSQ 224 G VSQ Sbjct: 232 GDAGDLVSQ 240 >UniRef50_D0MET6 CRISPR-associated protein Cas5 family n=1 Tax=Rhodothermus marinus DSM 4252 RepID=D0MET6_RHOM4 Length = 252 Score = 172 bits (436), Expect = 7e-42, Method: Composition-based stats. Identities = 59/234 (25%), Positives = 95/234 (40%), Gaps = 27/234 (11%) Query: 2 RSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESV 61 L+LR P+ ++G P + +P S + GLL LG +T+ L+ L E + Sbjct: 1 MEILLLRFDAPLMSFGAPIVDQYGFIQPYPALSMMTGLLANALGYTHAETARLERLQERL 60 Query: 62 QFAVRCDELILDDRRVSVTGLRDYHTV---LGAREDYRGLKSHETIQTW----------- 107 ++AVR D LRD+ TV D R + T++T Sbjct: 61 RYAVREDRR--------GQQLRDFQTVDLSQPFLHDERAWTTRGTLETRQGGTASLGIHI 112 Query: 108 --REYLCDASFTVALWL-TPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQA 164 R+Y DA +TVAL L P +++LE+A+ P ++GR+ C PLF+G +A Sbjct: 113 RLRDYWADAVYTVALTLDPPDEPPTLADLEQALRFPARPLFIGRKPCLPAAPLFIGRVEA 172 Query: 165 SDPQKALLNYEPVGGDIYSEESVTGHHLKFTARDEPMITLPRQFASREWYVIKG 218 +D AL P+ + P+ + R+ R + Sbjct: 173 ADLLDALRR-APLDARADRADFYRVWWETGPDDPPPVEGI-RENLRRPVTDRRD 224 >UniRef50_Q1J367 CRISPR-associated protein, CT1976 n=1 Tax=Deinococcus geothermalis DSM 11300 RepID=Q1J367_DEIGD Length = 232 Score = 171 bits (433), Expect = 2e-41, Method: Composition-based stats. Identities = 60/229 (26%), Positives = 90/229 (39%), Gaps = 25/229 (10%) Query: 2 RSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESV 61 + L+LRL PMQAWG + R T P++SG+LGL A LGI R D+ A + Sbjct: 1 MATLLLRLVAPMQAWGTRSRFDDRDTEAEPSKSGVLGLCAAALGIDRADSVEHLA---RL 57 Query: 62 QFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVALW 121 F VR D DYHT G T T R YL DA+F L Sbjct: 58 AFGVRVDR--------EGVAGTDYHTAQL----RPGNPRTRTDVTRRAYLADAAFWAGLE 105 Query: 122 LTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKALLNYEPVGG-- 179 ++++L+ A+ P + LGR++ P + P+ G AL + Sbjct: 106 ---GDAGLLTDLDAALHNPHWPLSLGRKAFPPSLPICAGPPLEVSLWDALRTAPSLRWRD 162 Query: 180 -----DIYSEESVTGHHLKFTARDEPMITLPRQFASREWYVIKGGMDVS 223 + + L+ A +P +R Y+ + + V+ Sbjct: 163 DDEPYRLVLDREAVPQPLRAAASPSRRQDVPDGPFARRRYLSRDVLTVT 211 >UniRef50_C9M9R7 CRISPR-associated protein Cas5, Ecoli subtype n=1 Tax=Jonquetella anthropi E3_33 E1 RepID=C9M9R7_9BACT Length = 243 Score = 170 bits (431), Expect = 3e-41, Method: Composition-based stats. Identities = 58/222 (26%), Positives = 90/222 (40%), Gaps = 19/222 (8%) Query: 2 RSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESV 61 +L+LR GPM ++G + RP RFP S + GL+ LG +T LQAL + + Sbjct: 1 MDFLVLRFRGPMMSFGDVAVDEQRPIDRFPGVSMVTGLVANALGWDWSETEKLQALQDRL 60 Query: 62 QFAVRCDELILDDRRVSVTGLRDYHT----------VLGAREDYRGLKSHETIQTWREYL 111 AVR D + LR+Y T V + R L T+Q + Y Sbjct: 61 VLAVREDR--------AGERLREYQTVALPGKSGLFVTHSIPCSRNLDKPMTVQKYLSYW 112 Query: 112 CDASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLG-TCQASDPQKA 170 ++ T + LT T + E+ A+ KP +LGR++C T P+F G A P++A Sbjct: 113 ANSLITCFIALTGLGTPTLDEIACALKKPARPLFLGRKTCLPTEPVFRGEIFGAESPEEA 172 Query: 171 LLNYEPVGGDIYSEESVTGHHLKFTARDEPMITLPRQFASRE 212 +L + + D + + R Sbjct: 173 VLRSCQLDKRPLKFVESPHSCVVEWPYDGRTVVSQSEIFERR 214 >UniRef50_D1Y486 Crispr-associated protein Cas5 n=1 Tax=Pyramidobacter piscolens W5455 RepID=D1Y486_9BACT Length = 269 Score = 170 bits (430), Expect = 4e-41, Method: Composition-based stats. Identities = 64/229 (27%), Positives = 93/229 (40%), Gaps = 36/229 (15%) Query: 2 RSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESV 61 LILRL GP+ A+G + RPT P S + GL+ LG D LQ L E + Sbjct: 1 MDALILRLRGPLMAFGDVAVDEIRPTDLLPGLSEMTGLIANALGWTFQDVEKLQRLQERL 60 Query: 62 QFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYR------------GLKSHETIQTWRE 109 + A R D + LRDY T + ED G K T+Q +R Sbjct: 61 RLASREDR--------TGVPLRDYQTARLSSEDSLWRTDGIVAERGGGSKGEFTVQRYRH 112 Query: 110 YLCDASFTVALWLTP-HATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQ 168 Y DA+ TV + L P + E+ A+ P ++GR CP + P+ Sbjct: 113 YRADAAVTVLIALDPADEAPALEEIRDALRHPARPLFIGRIGCPPSQPIC---------- 162 Query: 169 KALLNYEPVGGDIYSEESVTGHHLKFTARDEPMITLPRQFASREWYVIK 217 +EP G +I +S+ + T R R+ ASR +++ Sbjct: 163 -----FEPEGREIIHTDSLKDAIMHITPRAPLGAQTKREPASRNKVLVE 206 >UniRef50_B8IZA7 CRISPR-associated protein Cas5 family n=1 Tax=Desulfovibrio desulfuricans subsp. desulfuricans str. ATCC 27774 RepID=B8IZA7_DESDA Length = 249 Score = 169 bits (429), Expect = 5e-41, Method: Composition-based stats. Identities = 53/214 (24%), Positives = 86/214 (40%), Gaps = 13/214 (6%) Query: 5 LILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSE----- 59 L LRL PM ++G + R T P++S + G+L A G+ R L + Sbjct: 8 LALRLQAPMLSFGNESRFNRRCTASLPSKSVVAGMLCAAKGLHRGSVEEQAFLQQVAAIP 67 Query: 60 SVQFAV-RCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTV 118 + A+ RC D ++ D+HTV G R+ G + T R YL D+SF V Sbjct: 68 MLSVAIPRCLSANGKDWLLAAGRTVDFHTVQGTRKAAGG--IKDCHITTRHYLHDSSFAV 125 Query: 119 ALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTC--QASDPQKALLNYEP 176 L V+ + +A+ P + ++GR+ C + P+F G +A L Sbjct: 126 FL---NGPYRVLEDAARALQNPVWGLWIGRKCCIPSAPVFGGLFSSEAVALNHMLDAPLE 182 Query: 177 VGGDIYSEESVTGHHLKFTARDEPMITLPRQFAS 210 S + + E ++ R+FA Sbjct: 183 FFTHEREVHSFEDGNDTVPDQAESFLSAARRFAP 216 >UniRef50_B4UE71 CRISPR-associated protein Cas5 family n=2 Tax=Anaeromyxobacter RepID=B4UE71_ANASK Length = 246 Score = 169 bits (428), Expect = 6e-41, Method: Composition-based stats. Identities = 56/189 (29%), Positives = 81/189 (42%), Gaps = 6/189 (3%) Query: 1 MRSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSES 60 M LILR P+ A+G + G FP S + GL+ LG D L+AL Sbjct: 1 MLDALILRFDAPLLAFGGVAVDNHGEVGDFPGLSMVAGLIANALGYDHRDCDRLEALQRR 60 Query: 61 VQFAVRCDELILDDRRVSV-----TGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDAS 115 ++ AVR D L T GA E G S T +R Y DA Sbjct: 61 LRIAVRRDRSGQRLVDFQTVALGQPFLERGWTTRGAVEGRDGAFSDGTHIRYRAYWADAV 120 Query: 116 FTVALWLTPHAT-MVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKALLNY 174 +T+A+ L P A + +E+A+ +P +LGR++C + P+ G Q AL ++ Sbjct: 121 YTLAVTLDPPAESPGLDAVERALREPERPLFLGRKACLPSVPILAGRLQIPSLLAALASF 180 Query: 175 EPVGGDIYS 183 E V D + Sbjct: 181 ERVSKDRWE 189 >UniRef50_B8GIV3 CRISPR-associated protein Cas5 family n=1 Tax=Methanosphaerula palustris E1-9c RepID=B8GIV3_METPE Length = 257 Score = 166 bits (421), Expect = 4e-40, Method: Composition-based stats. Identities = 69/239 (28%), Positives = 94/239 (39%), Gaps = 37/239 (15%) Query: 2 RSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESV 61 YL L G M +WG RPT PTRS +LGLL A LGI+RD+ L AL+ + Sbjct: 4 PEYLTFSLYGMMASWGDIAVGEYRPTADHPTRSAVLGLLAAALGIRRDEEERLAALTRAY 63 Query: 62 QFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRG-------------LKSHETIQTWR 108 + A+R D LRDYHT +G + TI + R Sbjct: 64 KVAIRVD--------APGMLLRDYHTTQVPSAAKKGRQYLTRKDELAAPREVLNTILSTR 115 Query: 109 EYLCDASFTVALWLTPHATM-VISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDP 167 +Y CDA + V +W A + L + + P +T YLGR+SCPL P+ A D Sbjct: 116 DYRCDAVYRVYIWCRDTAPPYSLKTLAEHLQHPVFTLYLGRKSCPLALPVNPEVKTAPDL 175 Query: 168 QKALLNYEPVGGDIY------SEESVTGHH---------LKFTARDEPMITLPRQFASR 211 AL + + +V + DE + QF R Sbjct: 176 LTALSEEREIELRFLGRVPGNRDSNVRAYWDCTEGIQAEGTSVRNDESLNRRRWQFGRR 234 >UniRef50_Q0AA33 CRISPR-associated protein Cas5 family n=2 Tax=Gammaproteobacteria RepID=Q0AA33_ALHEH Length = 242 Score = 166 bits (421), Expect = 4e-40, Method: Composition-based stats. Identities = 53/232 (22%), Positives = 78/232 (33%), Gaps = 24/232 (10%) Query: 2 RSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESV 61 LILRL P+ ++G + PT RFP RS L G+L LG DT +L +L + Sbjct: 1 MPCLILRLDAPLMSFGGVLVDQHNPTDRFPGRSMLTGMLANALGWHHQDTEALNSLQARI 60 Query: 62 QFAVRCDELILDDRRVSVTGLRDYHTV---------------LGAREDYRGLKSHETIQT 106 +A R D V LRDY TV G Q Sbjct: 61 SYAARWD--------VPPEPLRDYQTVDLGQTHLANPGWTTRGAPEHREGGTAKRGIHQR 112 Query: 107 WREYLCDASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASD 166 R Y + TVA+ + P V + L A+ P ++GR++C P+ L ++ D Sbjct: 113 DRHYWANGVMTVAVTVPPGEPNV-ATLAAALRHPARPLFIGRKACLPAAPVLLRVRESDD 171 Query: 167 PQKALLNYEPVGGDIYSEESVTGHHLKFTARDEPMITLPRQFASREWYVIKG 218 L + T + + + Sbjct: 172 AYHVLASEPRDPRAAADTRFFEACWPPGTTAPATESRQRQNRTDDRDWRTQA 223 >UniRef50_B8FDI0 CRISPR-associated protein Cas5 family n=3 Tax=Bacteria RepID=B8FDI0_DESAA Length = 240 Score = 166 bits (420), Expect = 5e-40, Method: Composition-based stats. Identities = 68/243 (27%), Positives = 99/243 (40%), Gaps = 33/243 (13%) Query: 3 SYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESVQ 62 YL++ L P+Q+WG + G R T FPTRSG+LGLL LG + L L+ Q Sbjct: 4 RYLLMWLEAPLQSWGADSKFGRRDTLPFPTRSGVLGLLLCALGASGEQKELLARLAPYGQ 63 Query: 63 FAVRC------DELILDDRRVSVTGLRDYHTVLGAREDYRGLKS--------------HE 102 + C ++ LRD+H V A D + Sbjct: 64 TVISCAGGRPGRSGGSPEKIPRQPLLRDFHMVGSAYNDKDPWERLHIPKTNEGKPAVGGG 123 Query: 103 TIQTWREYLCDASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTC 162 T+R YL DA F V L L P + + +A+ P + YLGR++C T ++ G Sbjct: 124 AKLTYRYYLQDARFAVILELPPD---LAEDFAQALQNPVWDIYLGRKNCAPTEFVYQGVF 180 Query: 163 ----QASDPQKALLNYEPVGGDIYSEESVTGHHL--KFTARDEPMITLP-RQFASREWYV 215 A D AL+ + + D V G H T D P+ P +++ R V Sbjct: 181 DSQKDAMDRAAALMEEKELMEDF---RVVDGEHPGEPITLNDVPLQFGPMKKYRDRRVTV 237 Query: 216 IKG 218 I+ Sbjct: 238 IRN 240 >UniRef50_Q2FNU0 CRISPR-associated protein, CT1976 n=1 Tax=Methanospirillum hungatei JF-1 RepID=Q2FNU0_METHJ Length = 225 Score = 158 bits (401), Expect = 9e-38, Method: Composition-based stats. Identities = 51/214 (23%), Positives = 80/214 (37%), Gaps = 38/214 (17%) Query: 35 GLLGLLGACLGIQRDDTSSLQALSESVQFAVRCDELILDDRRVSVTGLRDYHTVLG---- 90 +LG++ A LGI+RDD + L F+V + ++D+HT+ Sbjct: 15 AVLGMVAAALGIRRDDEEAQNRLQAGYGFSVMVLQ--------PGIMIQDFHTIQSVHSS 66 Query: 91 ---------AREDYRGLKSHETIQTWREYLCDASFTVALWLTPHA--TMVISELEKAVLK 139 R D L ETI + REYLCD +W+ + E+ + Sbjct: 67 SLKKMNHVMTRRDEMNLGDSETILSRREYLCDHVSVACVWIRDAESAQFSLEEIAASFRN 126 Query: 140 PRYTPYLGRRSCPLTHPLFLGTCQASDPQKALLNY-----------EPVGGDIYSEE--- 185 P + YLGR+SCP P+ QA + AL+ + P +Y E+ Sbjct: 127 PVFCLYLGRKSCPPALPVHARVIQADSLKSALVQHIEGFDLLNGFRVPDRVSLYFEDGID 186 Query: 186 -SVTGHHLKFTARDEPMITLPRQFASREWYVIKG 218 + RD + QF+ R Y + Sbjct: 187 IGFDDPVMVMKRRDNILSRSRWQFSDRNEYYARI 220 >UniRef50_UPI0001B51C2B CRISPR-associated Cas5 family protein n=1 Tax=Streptomyces viridochromogenes DSM 40736 RepID=UPI0001B51C2B Length = 278 Score = 158 bits (399), Expect = 2e-37, Method: Composition-based stats. Identities = 68/276 (24%), Positives = 100/276 (36%), Gaps = 72/276 (26%) Query: 1 MRSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSES 60 M S L+LRLAGP+Q+WG R T PT+SG+ GL+ A LG+ R D L AL++ Sbjct: 1 MTSVLLLRLAGPLQSWGALARFDRRDTLNRPTKSGVTGLVAAALGLDRADD--LGALTD- 57 Query: 61 VQFAVRCDELILDDRRVSVTGLRDYHTVLG-------------AREDYRGLKSHET---- 103 ++FAVR D T +RD+H V R + + ET Sbjct: 58 LRFAVRADR--------PGTAVRDFHIVGSGTYPLRPRDLITDHRRAEKAAAALETSTGP 109 Query: 104 --------------------------------------IQTWREYLCDASFTVALWLTPH 125 + T R YL DA+F A+ Sbjct: 110 VFGHLAARSVTKWYGAPKEIAPDPKTGVLLAGNTTRDAMMTTRWYLADAAFVAAVE--HP 167 Query: 126 ATMVISELEKAVLKPRYTPYLGRRSCPL----THPLFLGTCQASDPQKALLNYEPVGGDI 181 ++ + AV P+ +LGR+SCP + + GT + ALL Sbjct: 168 DQNLLHRISHAVEHPKRLLWLGRKSCPPSGTISGGVHPGTAETILTTTALLPNATSPQPW 227 Query: 182 YSEESVTGHHLKFTARDEPMITLPRQFASREWYVIK 217 E+ G D+P+ P + + Sbjct: 228 AWIEAAPGTPGAAQRTDQPVTYHPEHRTHTARWETR 263 >UniRef50_Q0BSC7 Putative uncharacterized protein n=1 Tax=Granulibacter bethesdensis CGDNIH1 RepID=Q0BSC7_GRABC Length = 225 Score = 157 bits (396), Expect = 4e-37, Method: Composition-based stats. Identities = 60/234 (25%), Positives = 93/234 (39%), Gaps = 41/234 (17%) Query: 1 MRS--YLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRD-DTSSLQAL 57 M +L+ LA + + G+ R + +PTRS ++GL+GA LGI+RD D S+L L Sbjct: 1 MMQQPFLVFGLAASLGSMGELAGHERRGSLIWPTRSAIIGLMGAALGIERDGDFSALDVL 60 Query: 58 SESVQFAVRCDELILDDRRVSVTGLRDYHTVLGARED------------YRGLKSHETIQ 105 S V + LRDYHT+ T Sbjct: 61 SIDVAI------------FDAGAPLRDYHTIETIPSAAAKNPNSRPEALRDARGRTNTAI 108 Query: 106 TWREYLCDASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQAS 165 T R+Y + +A+ + + A+L+P +T Y+GR+SCPL P +A Sbjct: 109 THRDYRTSVFYGIAVRGAG-----LERIVAALLEPHFTLYVGRKSCPLAAPTGAKIVEAV 163 Query: 166 DPQKALLNYEPVGGDIYSEESVT------GHHLKFTARDEPMITLPRQFASREW 213 + AL E + ++ +ESV D P+ FA+R Sbjct: 164 SAEAAL---EHLKAPLWRKESVKAHLLVTDDPEGEVVTDVPLDRSSWHFATRRV 214 >UniRef50_C5V9N1 CRISPR-associated protein Cas5 n=1 Tax=Corynebacterium matruchotii ATCC 14266 RepID=C5V9N1_9CORY Length = 223 Score = 155 bits (392), Expect = 9e-37, Method: Composition-based stats. Identities = 57/234 (24%), Positives = 87/234 (37%), Gaps = 41/234 (17%) Query: 1 MRSYLILRLAGPMQAWGQPTF-EGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSE 59 M L +RLAGP+Q+W P T PTRSGL+GLL G R + Sbjct: 1 MTEALYIRLAGPLQSWAGPAITGNFVRTEPRPTRSGLVGLLAGACGYGRGEYPEWLT--- 57 Query: 60 SVQFAVRCDELILDDRRVSVTGLRDYHT----------------VLGAREDYRGLKSHE- 102 + F +R D T + D+HT +G R + L S Sbjct: 58 QLHFQIREDNR--------GTLVDDFHTINPRDTEEEFRSRLLLAMGQRPTKKLLNSTPD 109 Query: 103 ----TIQTWREYLCDASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLF 158 T T R Y+ D F V + + L + + +P + YLGR++ + P + Sbjct: 110 GQGLTAITERTYIADGEFIVQIKAGSREHQEL--LAEKLQQPHFVTYLGRKAFAPSFPFY 167 Query: 159 LGTCQASDPQKALLNYEPVGGDIYSE--ESVTGHHLKFTARDEPMITLPRQFAS 210 LG + P L VGG+ + +T P++ Q+ + Sbjct: 168 LG----AGPDDTLARIPTVGGEEPKKILRFYALDDYGYTTTTVPVVKDRNQWLT 217 >UniRef50_B4TTX2 CRISPR-associated protein Cas5 n=15 Tax=Enterobacteriaceae RepID=B4TTX2_SALSV Length = 241 Score = 153 bits (388), Expect = 3e-36, Method: Composition-based stats. Identities = 66/235 (28%), Positives = 95/235 (40%), Gaps = 33/235 (14%) Query: 1 MRSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSES 60 M+ YL+ +L P+ +WG+ R + PTRS LLGLL A LGI+RD+ + L + Sbjct: 1 MKEYLVFQLYAPLASWGEEASGEIRHSATVPTRSALLGLLAAALGIRRDEEARLNNFNRH 60 Query: 61 VQFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGL------------KSHETIQTWR 108 AV LRDYHTV RE+ + T+ + R Sbjct: 61 YHLAVHAL-------ASQDRWLRDYHTVSAPRENKKYRYYTRRDELTLAPDEVGTLISQR 113 Query: 109 EYLCDASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQA---S 165 EY CD + VA+ TP A +SEL +A+L P + YLGR+SCPL PL Sbjct: 114 EYRCDGYWHVAISATPDAPHSLSELREALLTPHFPLYLGRKSCPLALPLAARLMTGTLKE 173 Query: 166 DPQKALLNYEPVGGDIYSEESV----TGHHLKFTA-------RDEPMITLPRQFA 209 A+ ++ + ++P+ QF Sbjct: 174 VFTHAVEEISAAELSGFTLREGICYWDDPDEESLVWQQKQHSNNQPVSRQRWQFG 228 >UniRef50_A9HLC6 CRISPR-associated protein Cas5 family n=11 Tax=Acetobacteraceae RepID=A9HLC6_GLUDA Length = 260 Score = 152 bits (384), Expect = 8e-36, Method: Composition-based stats. Identities = 64/259 (24%), Positives = 90/259 (34%), Gaps = 52/259 (20%) Query: 1 MRSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSES 60 M +L + PM ++G R P RS +LGL+ ACLG+ RDD + AL+ Sbjct: 1 MGQFLTFAMVAPMASFGAIAVGERRDGWDRPARSAVLGLMAACLGLTRDDEDAQAALAAD 60 Query: 61 VQFAVRCDELILDDRRVSVTGLRDYHTVLG--AREDYRGLKSHE----------TIQTWR 108 A+ C L DYHT AR ++R E TI + R Sbjct: 61 YGLAILC--------HAPGKLLTDYHTAQAAPARRNWRPATRAEELAASPGDLATILSRR 112 Query: 109 EYLCDASFTVALWLTPH-ATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDP 167 +Y A+W + A + L+ A+ +P +TP LGRRSCP PL Sbjct: 113 DYRMGTWHLGAVWTSGKTARWSLEALQAAMREPVFTPSLGRRSCPAGLPLAPSVTDGVSA 172 Query: 168 QKALLNYEPVGGDIYSEESVTG-------------------------------HHLKFTA 196 LL+ G + + Sbjct: 173 AAVLLDRHRNGPEAGLRIRHDSFRRQFAGASRSGGLLLVLDAVDMAEHGGGHTPLRREIR 232 Query: 197 RDEPMITLPRQFASREWYV 215 RD+P+ QF RE V Sbjct: 233 RDQPLSRRRWQFGLREEAV 251 >UniRef50_B6B783 CRISPR-associated protein Cas5, Ecoli subtype n=1 Tax=Rhodobacterales bacterium Y4I RepID=B6B783_9RHOB Length = 232 Score = 150 bits (378), Expect = 4e-35, Method: Composition-based stats. Identities = 63/237 (26%), Positives = 87/237 (36%), Gaps = 39/237 (16%) Query: 1 MRSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRD-DTSSLQALSE 59 M YLI +L + A G+ R + P RS ++G LGA +G++RD D S L AL Sbjct: 1 MPEYLIFQLVAAIGAMGEFGGHDRRGSLTLPGRSAVIGTLGAAMGLRRDADFSGLDALGV 60 Query: 60 SVQFAVRCDELILDDRRVSVTGLRDYHTVLGARED------------YRGLKSHETIQTW 107 +V RDYHTV + T T Sbjct: 61 AVA------------SFGKTAPFRDYHTVQTVPSAAVKRPQSRPQALRDAGRKVNTTLTS 108 Query: 108 REYLCDASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDP 167 R+Y D F VA+W ++EL A+ P + +LGR+SCPL+ P A+ P Sbjct: 109 RDYRADCVFGVAIWGEG-----LAELASALSAPVFQTFLGRKSCPLSAPFDPQIVAAATP 163 Query: 168 QKAL--LNYEPVGGDIYSEESVTGHHLKF-------TARDEPMITLPRQFASREWYV 215 AL L P G + V T D + F + V Sbjct: 164 SAALSQLRLPPWIGAREMDMIVADEGTDLGAPSILETRHDRALDRTLWHFGKGRYAV 220 >UniRef50_B5GY62 Crispr-associated protein (Fragment) n=1 Tax=Streptomyces clavuligerus ATCC 27064 RepID=B5GY62_STRCL Length = 260 Score = 145 bits (367), Expect = 7e-34, Method: Composition-based stats. Identities = 60/186 (32%), Positives = 83/186 (44%), Gaps = 28/186 (15%) Query: 4 YLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESVQF 63 L+LRLAGP+Q+WG + +R TG PT+SG++GLL A R + ++ L +++ Sbjct: 85 VLLLRLAGPLQSWGSASAFNSRQTGAEPTKSGVIGLLAAA--DGRARGACIEDL-RALRL 141 Query: 64 AVRCDELILDDRRVSVTGLRDYHTVLGARED-------------YRGLKSHETIQTWREY 110 VR D S T LRDYHT R + T T R Y Sbjct: 142 GVRVDR--------SGTLLRDYHTASDHRGRPLAQAGVGAKGTQRPTSPAKYTQVTTRYY 193 Query: 111 LCDASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKA 170 L DA F AL ++ L++AV P + LGRRSC + PL LG + Sbjct: 194 LQDAVFLAALA---GPRALLDRLDRAVRAPAFPLALGRRSCVPSLPLALGVHPG-SLGEV 249 Query: 171 LLNYEP 176 L + Sbjct: 250 LSTHPW 255 >UniRef50_C2KP44 Putative uncharacterized protein n=1 Tax=Mobiluncus mulieris ATCC 35243 RepID=C2KP44_9ACTO Length = 245 Score = 145 bits (367), Expect = 8e-34, Method: Composition-based stats. Identities = 53/183 (28%), Positives = 75/183 (40%), Gaps = 22/183 (12%) Query: 2 RSYLILRLAGPMQAW-GQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSES 60 + + +RLAGP+Q+W G T +PTR L GL+ ACLG R + Sbjct: 1 MTSVYIRLAGPLQSWAGAKVSGNISHTQDYPTRGSLEGLVAACLGCPRGKYPLW---FQD 57 Query: 61 VQFAVRCDELILDDRRVSVTGLRD--YHTVL-------GAREDYRGLK-----SHETIQT 106 +QFAVR D G+RD G R RGL +T Sbjct: 58 LQFAVRVDSPGRICDDYQTIGVRDEDMQVATRLLTLLTGKRATNRGLAFIPDAQGKTTIV 117 Query: 107 WREYLCDASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASD 166 R L DA F V + H + +L++A+ P + YLGR++ P +LG + S Sbjct: 118 RRTLLADAEFIVQIQCEGH----LEQLDQAISDPTFVSYLGRKAFAPGFPFYLGIGEDSA 173 Query: 167 PQK 169 Sbjct: 174 IDT 176 >UniRef50_B8IMR2 CRISPR-associated protein Cas5 family n=1 Tax=Methylobacterium nodulans ORS 2060 RepID=B8IMR2_METNO Length = 273 Score = 142 bits (359), Expect = 6e-33, Method: Composition-based stats. Identities = 51/187 (27%), Positives = 70/187 (37%), Gaps = 23/187 (12%) Query: 1 MRSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSES 60 M + L+ L P G R + P RS +LGL+ LGI R D + AL Sbjct: 1 MPAGLVFTLYAPFAGMGDVAVGEERGSFDRPARSAVLGLVAGALGIDRADEAGHAALDRG 60 Query: 61 VQFAVRCDELILDDRRVSVTGLRDYHTVLGA----------REDYRGLKSHETIQTWREY 110 + A+R R + DYHTV R + + T+ + R Y Sbjct: 61 YRLALRL--------RTPGCLVEDYHTVQAPPVDRKARWATRREALAVAGLNTLVSRRAY 112 Query: 111 LCDASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFL----GTCQASD 166 D V L + L A+ +P + PYLGR+SCPL PL G + Sbjct: 113 RADPIVDVVL-IHVDEGPTPEALATALRRPTFAPYLGRKSCPLGLPLRPLWAEGVTRVGS 171 Query: 167 PQKALLN 173 AL Sbjct: 172 LLAALDE 178 >UniRef50_Q2RXJ5 CRISPR-associated protein, Cas5e family n=5 Tax=Proteobacteria RepID=Q2RXJ5_RHORT Length = 249 Score = 142 bits (358), Expect = 1e-32, Method: Composition-based stats. Identities = 51/210 (24%), Positives = 81/210 (38%), Gaps = 8/210 (3%) Query: 1 MRSY--LILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALS 58 M + LI+ L P+ A+G + T FP S L GL LG +R + Q L Sbjct: 1 MPEHRWLIVHLEAPLLAFGGVAIDNVGVTRDFPAASMLTGLFANALGWRRTEWERHQRLQ 60 Query: 59 ESVQFAVRCDELIL-----DDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCD 113 + + FA R + D + ++ T G E G + R+Y D Sbjct: 61 DRLIFAARRERENPTGVLTDTQNAKLSKTERGWTTWGEPEGRDGASYGAPHRRRRDYHGD 120 Query: 114 ASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFL-GTCQASDPQKALL 172 AS VAL L + +L A+ +P ++GR+SC + PL A+ +AL Sbjct: 121 ASVVVALRLDAAEEPALDDLAAALDRPARPLFIGRKSCVPSRPLRGKEFVVAATAYQALQ 180 Query: 173 NYEPVGGDIYSEESVTGHHLKFTARDEPMI 202 G D + + + + P+ Sbjct: 181 ALRSDGNDRQRDGAAAERRAVWPVGEGPVD 210 >UniRef50_Q6NEQ9 Putative uncharacterized protein n=1 Tax=Corynebacterium diphtheriae RepID=Q6NEQ9_CORDI Length = 242 Score = 138 bits (349), Expect = 1e-31, Method: Composition-based stats. Identities = 49/184 (26%), Positives = 75/184 (40%), Gaps = 22/184 (11%) Query: 1 MRSYLILRLAGPMQAW-GQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSE 59 M +RL+GP+Q+W G T PT S L GLL LG +R + + Sbjct: 1 MIESAYIRLSGPLQSWAGSVVTGNIVRTEPRPTFSSLRGLLAGALGARRGEWPNWLD--- 57 Query: 60 SVQFAVRCDE-LILDDRRVSVTGLRDYHTVLGAREDYRGLKSH-------------ETIQ 105 V+F VR D I+ + ++ L + T +G K++ T Sbjct: 58 DVEFWVREDRKPIVVNEFQTINPLPEVETFRKRLLIAQGRKANSAKALTFTPDAQGGTSI 117 Query: 106 TWREYLCDASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQAS 165 R YL D + V + + H + E+E A P + YLGR++ P +LG A Sbjct: 118 VNRTYLADGEYLVRVTSSTH----MDEIENAFSSPAFVTYLGRKAFYAEFPFYLGRGSAD 173 Query: 166 DPQK 169 +K Sbjct: 174 AFEK 177 >UniRef50_B5F422 CRISPR-associated protein Cas5 n=59 Tax=Enterobacteriaceae RepID=B5F422_SALA4 Length = 248 Score = 136 bits (342), Expect = 6e-31, Method: Composition-based stats. Identities = 69/255 (27%), Positives = 101/255 (39%), Gaps = 39/255 (15%) Query: 1 MRSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSES 60 M YL+ +L GPM +WG R + P+RS LLGLL A LGI+RD+ L A + Sbjct: 1 MSQYLVFQLHGPMASWGVDAPGEVRHSHELPSRSALLGLLAAALGIRRDEEERLNAFNRH 60 Query: 61 VQFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGL-----------KSHETIQTWRE 109 QF L + RDYHTV +E + + + + R+ Sbjct: 61 YQF--------LLCASGNPRWARDYHTVQMPKEVRKARYFSRREELQDPELLSALISRRD 112 Query: 110 YLCDASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQK 169 Y DA + +A+ TP A +++L+ A+ P + YLGR+S PL PL + S Sbjct: 113 YYTDAWWMIAVSATPDAPYTLAQLQAALQHPVFPLYLGRKSHPLALPLAPQLLEGSAADV 172 Query: 170 ALLNYEPVGGDIYSEESV----------TGHHLKFT------ARDEPMITLPRQFASREW 213 Y + + G H T RD P+ F R Sbjct: 173 LREAYRWYQDQFNALKLPLPRLQNECWWEGEHDGLTASKILRRRDMPLSRQQWLFGERSV 232 Query: 214 ----YVIKGGMDVSQ 224 ++ K +SQ Sbjct: 233 NQGPWLRKEDACISQ 247 >UniRef50_B6XT64 Putative uncharacterized protein n=2 Tax=Bifidobacterium RepID=B6XT64_9BIFI Length = 212 Score = 136 bits (342), Expect = 7e-31, Method: Composition-based stats. Identities = 49/196 (25%), Positives = 76/196 (38%), Gaps = 28/196 (14%) Query: 39 LLGACLGIQRDDTSSLQALSESVQFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGL 98 +L + G R+D ++ L + F VR ++ +RD T Sbjct: 1 MLASAQGRTRED--PIEDLL-GISFGVRVEQR--------GRVIRDLQTEKSLTRKRNSR 49 Query: 99 K-SHETIQTWREYLCDASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPL 157 K E T+R YL DA F VAL V+ L++A+ P++ YLGRRSCP +PL Sbjct: 50 KFDKEMPLTYRYYLADACFLVAL---GADRSVLEMLDEAIHSPKWPLYLGRRSCPPNYPL 106 Query: 158 FLGTCQA-SDPQKALLNYEPVGGDIYSEE----------SVTGHHLKFTARDEPMITLP- 205 LG D ++AL + + Y T D P+ Sbjct: 107 SLGIHDEYEDIRQALNSETWHASEWYRRRYRYPDLEIVCDAEKGENITTQSDLPLSFSRE 166 Query: 206 -RQFASREWYVIKGGM 220 R++A+R + + Sbjct: 167 GRRYANRAVHRYRIPN 182 >UniRef50_B6IWM3 CRISPR-associated protein, CT1976 family n=1 Tax=Rhodospirillum centenum SW RepID=B6IWM3_RHOCS Length = 280 Score = 135 bits (340), Expect = 1e-30, Method: Composition-based stats. Identities = 57/208 (27%), Positives = 84/208 (40%), Gaps = 31/208 (14%) Query: 3 SYLILRLAGPMQAWGQPTFEG----TRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALS 58 ++L LA P AWG + + T P+RS L GLLGA LG++R + L LS Sbjct: 6 AHLCFTLAAPYGAWGAASQSSATTAWKATELDPSRSALTGLLGAALGLER---AHLGRLS 62 Query: 59 ESVQFAVRCDELILDDRRVSVTGLRDYHTVLGA---------------REDYRGLKSHET 103 E+++FAVR D + DYHT+ A R G K Sbjct: 63 EALRFAVRTGIRPTRDPQP------DYHTISRAHRPEGREHWSRFEELRPALAGGKQEGA 116 Query: 104 IQTWREYLCDASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQ 163 + + REY +TVA+ A + + L +A+ P + Y GR++C L P Sbjct: 117 LLSRREYWSLGLWTVAVATLNPAGVPLDRLAQALRTPHWPLYAGRKACTLGLPPDPEVRT 176 Query: 164 ASDPQKALLNYEPVGGDIYSEESVTGHH 191 P LL+Y + + Sbjct: 177 GPGPLSVLLDYGW---PWQRKPGLDRPL 201 >UniRef50_C2GEY8 CRISPR-associated protein n=1 Tax=Corynebacterium glucuronolyticum ATCC 51866 RepID=C2GEY8_9CORY Length = 242 Score = 134 bits (337), Expect = 2e-30, Method: Composition-based stats. Identities = 48/189 (25%), Positives = 75/189 (39%), Gaps = 37/189 (19%) Query: 1 MRSYLILRLAGPMQAWGQPTFEGTR-PTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSE 59 M S +RL+GP+Q+W + G T PT +GL GLL LG +RD+ Sbjct: 1 MPSSTFIRLSGPIQSWAGQSVSGNFIRTNPIPTLTGLRGLLAGALGARRDEIPEW---IS 57 Query: 60 SVQFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKS------------------- 100 V+F+VR D+ + + + D+ T+ E++ + Sbjct: 58 KVRFSVREDQ--------TGSFVDDFQTIGSREEEWDFRRRIAILQGMKARSIKQLSFKP 109 Query: 101 --HETIQTWREYLCDASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLF 158 R YL +A F V + H E++ A P + YLGR++ P P + Sbjct: 110 AVGANAVVRRTYLSEAEFIVRVTDERHT----EEIDHAFSSPVFATYLGRKAFPAAFPFY 165 Query: 159 LGTCQASDP 167 LGT Sbjct: 166 LGTGNEDVL 174 >UniRef50_C0W6U0 CRISPR-associated Cas5 family protein n=1 Tax=Actinomyces urogenitalis DSM 15434 RepID=C0W6U0_9ACTO Length = 201 Score = 133 bits (334), Expect = 5e-30, Method: Composition-based stats. Identities = 50/193 (25%), Positives = 75/193 (38%), Gaps = 39/193 (20%) Query: 39 LLGACLGIQRDDTSSLQALSESVQFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGL 98 +L A +G +R D ++ L S++F VR D+ T LRD+HT Sbjct: 1 MLAAAVGRRRTD--PIEDLL-SLRFGVRKDQ--------PGTVLRDFHTARTLDGKQ--- 46 Query: 99 KSHETIQTWREYLCDASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLF 158 + R YL DA + A+ ++ L+ AV P + YLGRRSCP + PL Sbjct: 47 ---SMPLSERYYLADAVYLAAIE---GEKTLLEGLDVAVRHPVFPLYLGRRSCPPSQPLS 100 Query: 159 LGTCQASDPQKALLNYEPVGGDIYSEESV----------------TGHHLKFTARDEPMI 202 LG A +AL + D + +T D P+ Sbjct: 101 LGIRHA-SLLQALTDEPWQAADWFRLRQDNSFRAEIVIDAASLAPDERGSGYTTLDSPVS 159 Query: 203 TLPRQ--FASREW 213 PR+ + +RE Sbjct: 160 FDPRRRDYQAREV 172 >UniRef50_B8IJS9 CRISPR-associated protein Cas5 family n=1 Tax=Methylobacterium nodulans ORS 2060 RepID=B8IJS9_METNO Length = 253 Score = 129 bits (325), Expect = 5e-29, Method: Composition-based stats. Identities = 51/244 (20%), Positives = 81/244 (33%), Gaps = 34/244 (13%) Query: 1 MRSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSES 60 MR +L+L L P+QAWG + P FP + + GL+ LG R D L+AL E Sbjct: 1 MREHLLLLLEAPLQAWGGVLVDAYGPVDEFPAATLVGGLVANALGYDRADWQRLEALQER 60 Query: 61 VQFAVRCDELI---LDDRRVSVTGLRDYHTVLGAREDYRGLKS--HETIQTWREYLCDAS 115 + D++ + T G E G + +R+Y D Sbjct: 61 LVVGAAVLRRGSTITDNQNAKLEKGDVGWTTRGRPEGRGGGAEAYKSPHRRFRDYHADTL 120 Query: 116 FTVALWLTPHAT-MVISELEKAVLKPRYTPYLGRRSCPLTH-PLFLGTCQASDPQKAL-- 171 VAL L P + + + P +LGR+ C + ++ +A KAL Sbjct: 121 ALVALRLDPEDEKPDLDAIAHTLEWPERPLFLGRKPCLPSRSIVWPERMRAETLLKALNL 180 Query: 172 -------------------------LNYEPVGGDIYSEESVTGHHLKFTARDEPMITLPR 206 + G+ S V + D P+ R Sbjct: 181 GAAMLVAAESANPKLRLELDAGPWRARWPEREGNAPSSRLVEVCDDRDFRNDVPVGLRRR 240 Query: 207 QFAS 210 + + Sbjct: 241 RVGT 244 >UniRef50_Q3AA65 CRISPR-associated protein Cas5, Hmari subtype n=1 Tax=Carboxydothermus hydrogenoformans Z-2901 RepID=Q3AA65_CARHZ Length = 248 Score = 118 bits (296), Expect = 1e-25, Method: Composition-based stats. Identities = 45/235 (19%), Positives = 78/235 (33%), Gaps = 33/235 (14%) Query: 1 MRSYLILRLAGPMQAWGQPTFEGTRPTGR------FPTRSGLLGLLGACLGIQRDDTSSL 54 MR ++ + GP T FP R+ L G++ A LG+ +D SL Sbjct: 1 MRKVIVFEIRGP------AAHFRKFYTNSSSLSYAFPARTTLAGIIAAVLGLPKDSYYSL 54 Query: 55 ---QALSESVQFAVRCDELILDDRRVSVTGLRDYHTVLGAR--EDYRGLKSHETIQTWRE 109 +++ +++ + V L++ + G L +R Sbjct: 55 LTGNKAHYALRLMTPVRKIMQTVKFVRTKTLKEVNGSGGPTMIPTEIILPVRGRELIYRV 114 Query: 110 YLCDASFTVALWLTPHATMVISELEKAV-LKPRYTPYLGRRSCPLTHPL---FLGTCQAS 165 Y EL K + L P++ YLGR F GT + Sbjct: 115 Y-----------FYHDDPSFQEELAKQLALGPKFPVYLGRSEFLAKIDFLGVFPGTALET 163 Query: 166 DPQKALLNYEPVGGDIYSEESVTGHHLKFTARDEPMITLP-RQFASREWYVIKGG 219 + ++N E + + ++ G L++ P P R AS V + G Sbjct: 164 NFVDTVVNLELLKDEELLFSTLHGEDLRYLKEKMPFSFNPDRSIASTASVVYEIG 218 >UniRef50_D1N0K0 Putative uncharacterized protein n=1 Tax=Victivallis vadensis ATCC BAA-548 RepID=D1N0K0_9BACT Length = 239 Score = 115 bits (287), Expect = 2e-24, Method: Composition-based stats. Identities = 39/242 (16%), Positives = 69/242 (28%), Gaps = 36/242 (14%) Query: 2 RSYLILRLAGPMQAWGQP--TFEGTRPTGRFPTRSGLLGLLGACLGI--QRDDTSSLQAL 57 S LI LAG M W + G P L G++GA LG R + Sbjct: 1 MSILIFELAGEMAMWRNVYESMGSYSCLGPAPGN--LAGVIGAALGFASPRSQAAEKPDA 58 Query: 58 SESVQFAVRCDELILDDRRVSVTGLRDYHTV---LGAREDYRGLKSHETIQTWR------ 108 + + + ++ D+H +G + + R Sbjct: 59 KQLKNWDKAGLPWPVSPELLAWEETNDFHVACRWIGKFPKRVPWNINGCKEINRSDNLRL 118 Query: 109 --EYLCDASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSC------------PLT 154 + + D ++ VA+ L + + A+ KP + LG C Sbjct: 119 QQQVILDPAYEVAVALPAYEA---ERVAAALRKPAFPLCLGASFCRAIVRNVRIEDAVPE 175 Query: 155 HPLFLGTCQASDPQKALLNYEPV--GGDIYSEESVTGHHLKFTARDEPMITLPRQFASRE 212 P + +A + G + G+ + T +I + R Sbjct: 176 SPFWAFRTDGGALGEATPFSRHIVNPGSCFERIRSDGYWIYPTPDQPGVIAA--EPLVRG 233 Query: 213 WY 214 W Sbjct: 234 WV 235 >UniRef50_A0LM54 Putative uncharacterized protein n=1 Tax=Syntrophobacter fumaroxidans MPOB RepID=A0LM54_SYNFM Length = 210 Score = 103 bits (257), Expect = 5e-21, Method: Composition-based stats. Identities = 38/196 (19%), Positives = 72/196 (36%), Gaps = 39/196 (19%) Query: 46 IQRDDTSSLQALSESVQFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQ 105 ++R D + +++ VR D +D+ T G S + Sbjct: 26 LRRGDLA-------ALRMGVRVDR--------EGLLRKDFQTAQNVIVA--GGGSVGDLV 68 Query: 106 TWREYLCDASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFL--GTCQ 163 + R +L DA+F V L M++ L ++ PR+ +LGR+S + P ++ G + Sbjct: 69 SNRYFLSDAAFLVGLE---GDFMLLHRLHASLAHPRWPVFLGRKSYVPSIPPYIKNGLLE 125 Query: 164 ASDPQKALLNYEPV---GGDIYSEESVTGHH--------------LKFTARDEPMITLPR 206 ++ AL ++ P+ + V RD+PM Sbjct: 126 GAELMSALASFTPLISVEELAARRKRVESGRRVERTRFVLESASPTHEIRRDQPMSFALG 185 Query: 207 QFASREWYVIKGGMDV 222 Q + +V+ +DV Sbjct: 186 QRVFHDRFVVTEYLDV 201 >UniRef50_UPI0000F51765 hypothetical protein Faci_00030 n=1 Tax=Ferroplasma acidarmanus fer1 RepID=UPI0000F51765 Length = 237 Score = 103 bits (257), Expect = 5e-21, Method: Composition-based stats. Identities = 21/149 (14%), Positives = 52/149 (34%), Gaps = 21/149 (14%) Query: 1 MRSYLILRL--AGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALS 58 M+ ++R+ G + ++ P T P ++ ++G++ A +G RDD +++L Sbjct: 1 MKEIKLIRINAYGIINSFRIPLHMTIHDTLDLPVKTHIIGMIAAAMGYLRDDKEKIESLY 60 Query: 59 ESVQFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTV 118 ++ + + + + + K++ TI Y+ Sbjct: 61 KNTSIGIYGTSYSKFYDLIRIYKYKGKEVEVSLVNRQINYKNNYTI-----YI------- 108 Query: 119 ALWLTPHATMVISELEKAVLKPRYTPYLG 147 + E+ + P + LG Sbjct: 109 -------ENNNLEEIYNFLKNPVFALSLG 130 >UniRef50_B0K553 CRISPR-associated protein Cas5, Hmari subtype n=4 Tax=Thermoanaerobacter RepID=B0K553_THEPX Length = 236 Score = 95.0 bits (235), Expect = 2e-18, Method: Composition-based stats. Identities = 36/225 (16%), Positives = 67/225 (29%), Gaps = 40/225 (17%) Query: 16 WGQPTFEGTRPTGR------FPTRSGLLGLLGACLGIQRDD-TSSLQALSESVQFAVR-- 66 WG+ T P R+ + G++ A LG +RD L A E++ AVR Sbjct: 7 WGKFAHFRKFYTNSSSLTYSVPPRTTVEGMIAALLGYERDTYYEKLNA--ENLYVAVRKM 64 Query: 67 --CDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVALWLTP 124 +++ + T L + H + L ++ +R Y+ Sbjct: 65 SKTKKIMQSVNYIKATTLGELHFPKQHTQIPFELLISDSKIRYRFYI-----------IH 113 Query: 125 HATMVISELEKAV--LKPRYTPYLGRRSCPLTHPL--FLGTCQASDPQKALLN-----YE 175 + E+++ + P + Y G + P ++ + A Y Sbjct: 114 KDENIFREIKERLFKKAPVFPLYFG------SAPFSCYIDYVEEVTWDWATSEDFQDIYS 167 Query: 176 PVGGDIYSEESVTGHHLKFTARDEPMITL-PRQFASREWYVIKGG 219 + D E + P R YV + Sbjct: 168 VMPSDKIKEIDIKNMKGYLLKERMPRDFGIDRTIKEVTTYVYEDV 212 >UniRef50_B5IGN1 CRISPR-associated protein Cas5 n=1 Tax=Aciduliprofundum boonei T469 RepID=B5IGN1_9EURY Length = 236 Score = 94.3 bits (233), Expect = 3e-18, Method: Composition-based stats. Identities = 23/150 (15%), Positives = 50/150 (33%), Gaps = 21/150 (14%) Query: 2 RSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGI-QRDDTSSLQALSES 60 L +L ++ + T P + ++GLLG+ LG+ R+ + L++ Sbjct: 1 MEVLTAKLRAISVSFRRILDFNYHRTYPLPPPTTIVGLLGSALGLSDRELWNEYNGLND- 59 Query: 61 VQFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWR--EYLCDASFTV 118 + FAV +D ++ + + R + + Sbjct: 60 ISFAVLSLR--------KPGFAKDMWSIQKIKNGR---------ISERSPYFRELLFYPE 102 Query: 119 ALWLTPHATMVISELEKAVLKPRYTPYLGR 148 + + + + + A+ P Y LGR Sbjct: 103 YVLIFNGDSKSLECVRDALNNPEYALSLGR 132 >UniRef50_Q1AZD3 CRISPR-associated protein, Cas5h family n=1 Tax=Rubrobacter xylanophilus DSM 9941 RepID=Q1AZD3_RUBXD Length = 258 Score = 91.2 bits (225), Expect = 3e-17, Method: Composition-based stats. Identities = 45/241 (18%), Positives = 75/241 (31%), Gaps = 37/241 (15%) Query: 5 LILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESVQFA 64 LI L G + + + + FP R+ L GL+ +G +RD + +L E Q A Sbjct: 7 LIFDLCGAYGMFRKFYTNSSSLSYPFPPRTTLAGLIAGMMGCERDSYAEDLSL-ERCQIA 65 Query: 65 VRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETI-------------QTWREYL 111 V + + + + H + G + +R Y Sbjct: 66 VSVITPVRRVMQQVNYVMTEGHVWTKNTGGFDGSSGPIQVPVEWVFPEVGHRELRYRVY- 124 Query: 112 CDASFTVALWLTPHATMVISELEKAVLK--PRYTPYLGRRSCP-------LTHPLFLGTC 162 T + L + + P Y PYLG CP LG Sbjct: 125 ----------ATHEDRGWLKRLAEILEGGVPIYPPYLGMSECPGRVEHVATLEGWGLGHR 174 Query: 163 QASDPQKALLNYEPVGGDIYSEESVTGHHLKFTARDEPMITLPRQFASREWYVIKGGMDV 222 + P + +L E V G EE V + + R A+ + + G + Sbjct: 175 EDELPVRTVLPSEAVSGPPRLEEGVQIVKERI---PLALDERRRLIAAADVLYNRAGPHI 231 Query: 223 S 223 + Sbjct: 232 T 232 >UniRef50_B1I5P1 CRISPR-associated protein Cas5 n=2 Tax=Clostridia RepID=B1I5P1_DESAP Length = 238 Score = 77.7 bits (190), Expect = 3e-13, Method: Composition-based stats. Identities = 49/215 (22%), Positives = 75/215 (34%), Gaps = 38/215 (17%) Query: 13 MQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESVQFAVRCDELIL 72 M ++ +P + T P + L GL GA G+ + Q S D L+ Sbjct: 1 MASFRRPLDHNYQRTLPLPPPTTLFGLAGAARGL--AEEELWQEASPL------RDLLVA 52 Query: 73 DDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVALWLTPHATMVISE 132 RD TV+ + + +S +RE L +A F + L +++E Sbjct: 53 TLALQKPGLARDMWTVMKIKNNKLAERSPY----FREILFNARFMI---LYGGPEELLAE 105 Query: 133 LEKAVLKPRYTPYLGR-------------RSCPLTHPLFLGTCQASDPQKALLNYEPVGG 179 L++A L P Y LGR +C PLF GT D Q + P G Sbjct: 106 LQQAFLDPTYPLSLGREDELIVVEELGRGETC-PGAPLFSGTVIPGDLQGLRFKWVPRPG 164 Query: 180 DIYSEESVTGHHL---------KFTARDEPMITLP 205 + +V L ++ P LP Sbjct: 165 IAFEPPAVETMPLAFEVDKRGIRYPLNPRPFTFLP 199 >UniRef50_D1CHV1 CRISPR-associated protein Cas5 n=1 Tax=Thermobaculum terrenum ATCC BAA-798 RepID=D1CHV1_THET1 Length = 264 Score = 62.7 bits (151), Expect = 9e-09, Method: Composition-based stats. Identities = 34/234 (14%), Positives = 63/234 (26%), Gaps = 45/234 (19%) Query: 1 MR-SYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQR-----DDTSSL 54 M +I LAG + + + + FP R+ L GL+ +G +R L Sbjct: 1 MPNKMIIFDLAGAYAMFRKFYTNSSSLSYPFPPRTVLAGLIAGIMGYERQGHRNTYAEHL 60 Query: 55 QALSESVQFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSH------------- 101 + +VR + + + + + + G + Sbjct: 61 APGVADIALSVRV--PVRRVMQTVNYVMTEGNVWSRNAGGFDGSRERTLTPVEWVFPASG 118 Query: 102 ETIQTWREYLCDASFTVALWLTPHATMVISELEKAVLK-PRYTPYLGRRSCPL------- 153 +R YL T + + Y PYLG CP Sbjct: 119 RRQLRYRVYL-----------THRDEGWLERFAGYLRSGAVYPPYLGMTECPAVIEPVAE 167 Query: 154 THPLFLGTCQASDPQKALLNYEPVGGDIYSEESVTGHHLKFTARDEPMITLPRQ 207 LG + P ++N D + + + P+ + Sbjct: 168 VEDWELGVREEVLPISTVIN-----ADKIVDLPLPDADAQIVKERMPIALDNDR 216 >UniRef50_A5D0Y3 Putative uncharacterized protein n=1 Tax=Pelotomaculum thermopropionicum SI RepID=A5D0Y3_PELTS Length = 242 Score = 59.2 bits (142), Expect = 9e-08, Method: Composition-based stats. Identities = 46/224 (20%), Positives = 72/224 (32%), Gaps = 21/224 (9%) Query: 3 SYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESVQ 62 L+ + G + + QP T T FP R + GLL + LG+ DD + L E Sbjct: 4 QVLVFSIKGSLAHFRQPDTTATHATYPFPPRPTIHGLLASVLGLDFDDEAGAAFLHEEHF 63 Query: 63 FAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVALWL 122 + + + + + G T E + + V Sbjct: 64 VGLSLLKPVRTVCA---------QMSMHGKGFTGGGGDSFNRLTTIELVVSPHYLVYYTG 114 Query: 123 TPHATMVISELEKAVL--KPRYTPYLGRRSCPLTHPLFLGTCQ----ASDPQKALLNYEP 176 + + EL + + + Y YLG C LT P+F G A ++ L Sbjct: 115 SR-----LGELAERIRTGQSVYHTYLGSAYC-LTFPVFHGLYPLLEVAPGEEEPLPCSSV 168 Query: 177 VGGDIYSEESVTGHHLKFTARDEPMITLPRQFASREWYVIKGGM 220 V + E V AR P + +F R VI Sbjct: 169 VPQGVIQEILVEPGGNYAVARALPYRHVGGRFFERTLNVIYEVN 212 >UniRef50_A3DHS3 CRISPR-associated protein Cas5 n=3 Tax=Clostridium thermocellum RepID=A3DHS3_CLOTH Length = 241 Score = 57.7 bits (138), Expect = 3e-07, Method: Composition-based stats. Identities = 33/146 (22%), Positives = 52/146 (35%), Gaps = 17/146 (11%) Query: 2 RSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESV 61 LI+ L ++ P + T P S + G+ GA LG+ +D AL+ Sbjct: 1 MYGLIVTLYAKTASFRDPGAQLYHETMSLPPPSTITGIAGAALGLSFED-----ALAFMK 55 Query: 62 QFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVALW 121 + AV + R +D + G ++ I R +L D V ++ Sbjct: 56 ENAVMVGCNGSSEGRG-----KDLW---NYTKIKSGEITNAIII--RNFLADLK--VEIF 103 Query: 122 LTPHATMVISELEKAVLKPRYTPYLG 147 VI+ L A P Y LG Sbjct: 104 FACEKREVITRLADAFENPVYAITLG 129 >UniRef50_D1B1G1 CRISPR-associated protein Cas5 n=1 Tax=Sulfurospirillum deleyianum DSM 6946 RepID=D1B1G1_SULD5 Length = 237 Score = 57.7 bits (138), Expect = 3e-07, Method: Composition-based stats. Identities = 28/147 (19%), Positives = 54/147 (36%), Gaps = 18/147 (12%) Query: 2 RSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGA-CLGIQRDDTSSLQALSES 60 + + G + ++ P F T P ++ ++GLL L Q++ L E Sbjct: 1 MEAIRFEVEGLLNSFRVPFFRTYHKTFLAPPKTTIIGLLCNIALKSQKEFFEILN--QEL 58 Query: 61 VQFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVAL 120 + +V DE+ +D + + G R+ L + +T+ L Sbjct: 59 IDVSVIIDEING--------KTKDLWSYKTLEKGNMGKS-----VIRRDKLFLSKYTIYL 105 Query: 121 WLTPHATMVISELEKAVLKPRYTPYLG 147 + + E+ A+ P+ TP LG Sbjct: 106 SIKNK--TLFDEIYSALKNPKNTPALG 130 >UniRef50_A7ZDW7 Crispr-associated protein Cas5 n=3 Tax=Campylobacter RepID=A7ZDW7_CAMC1 Length = 236 Score = 57.3 bits (137), Expect = 4e-07, Method: Composition-based stats. Identities = 38/206 (18%), Positives = 66/206 (32%), Gaps = 23/206 (11%) Query: 2 RSYLILRLAGPMQAWGQPTFEGT------RPTGRFPTRSGLLGLLGACLGIQRDDTSSLQ 55 S + RL +G T P ++ ++G LGA +G +D L Sbjct: 1 MSIVAFRL------FGDYAHFSHPATIYSSLTYPVPPKTTIMGFLGAVIG--EEDYFKLS 52 Query: 56 ALSESVQFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDAS 115 + SV+ + + + + H G + E Q +RE +C S Sbjct: 53 NIQYSVKIDRQILKKSFVFNGIKFALSSNMHIEEGYQNAK------EKKQFYRELICSPS 106 Query: 116 FTVALWLTPHATMVISELEKAVL--KPRYTPYLGRRSCPLTHPLF-LGTCQASDPQKALL 172 + V L L ++ + K +TPYLG C + C+ ++ + Sbjct: 107 YVVFLNLENLEQSYQDKIISNLKEHKTAFTPYLGINFCIADFSWIDIKICEKISQDESFI 166 Query: 173 NYEPVGGDIYSEESVTGHHLKFTARD 198 N + D E L Sbjct: 167 NTFTLMDDFVFEGINENAKLTTARMP 192 >UniRef50_A7HLK6 CRISPR-associated protein Cas5, Hmari subtype n=2 Tax=Thermotogaceae RepID=A7HLK6_FERNB Length = 236 Score = 56.9 bits (136), Expect = 5e-07, Method: Composition-based stats. Identities = 33/244 (13%), Positives = 64/244 (26%), Gaps = 62/244 (25%) Query: 2 RSYLILRLAGPMQAWGQPTFEGTRPTGR------FPTRSGLLGLLGACLGIQRDDTSSL- 54 +L+ + G+ T P R+ L G++ A LG +RD + Sbjct: 1 MEFLVFDIK------GKFAHFRKFYTNSSSLSYSIPPRTTLEGIIAAILGFERDSYYEIL 54 Query: 55 --QALSESVQFAVRCDELILDDRRVSVTGLR-----DYHTVLGAREDYRGLKSHETIQTW 107 Q L+ ++ A ++I + D HT + + + + Sbjct: 55 NAQKLNIGLKKATPTRKIIQTLNYIKAKTPSNVYDPDEHT-----QIPFEIITSNDKVIY 109 Query: 108 REYLCDASFTVALWLTPHATMVISELEKAVLKPRYT--PYLGRRSCPLTHPLFLGTCQAS 165 R Y + ++ +LE + ++ PY G P + Sbjct: 110 RVY-----------VNHVDQTIMEDLEYRLKNNKFCYIPYFG------VAPFNISI---- 148 Query: 166 DPQKALLNYEPVGGDIYSEESV----------TGHHLKFTARDEPMITLPRQFASREWYV 215 D + D SV + P R Sbjct: 149 DLKGKFQAEPKFSEDFVKVSSVIRRNLISSLKVEEEVILLKEKMPRDFSR----ERTVIE 204 Query: 216 IKGG 219 ++ Sbjct: 205 MEDY 208 >UniRef50_C7P9L3 CRISPR-associated protein Cas5 n=2 Tax=Methanocaldococcus RepID=C7P9L3_METFA Length = 258 Score = 56.5 bits (135), Expect = 6e-07, Method: Composition-based stats. Identities = 23/170 (13%), Positives = 50/170 (29%), Gaps = 27/170 (15%) Query: 2 RSYLILRLAGP-MQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSES 60 L R G ++ +P T + P + + GL+ LG+ RD Sbjct: 1 MWGLKFRCEGIYFVSFRKPVTTSLSLTYKLPPFTAIRGLIANALGMPRDSFEIQNWFKIG 60 Query: 61 VQFAVRCDELILDDRRVSVTGLR--------DYHTVLGAREDYRGLKSHETIQT------ 106 ++ + + + + + + + + ++ E + Sbjct: 61 MRVEGKIEIGREMAKFLKMISRKSCYRCENCRFEKIADSKPKKCPNCGKENLVKVEKMIY 120 Query: 107 ---------WREYLCDASFTVALWLTPHATMVISELEKAVLKPRYTPYLG 147 RE+L + + L I ++ A+ P YLG Sbjct: 121 ERAFPSSPMHREFLIMPKYWIYLV---GEEKKIKKIYYALKSPERPLYLG 167 >UniRef50_D0MJ68 CRISPR-associated protein Cas5 n=1 Tax=Rhodothermus marinus DSM 4252 RepID=D0MJ68_RHOM4 Length = 236 Score = 55.0 bits (131), Expect = 2e-06, Method: Composition-based stats. Identities = 36/144 (25%), Positives = 54/144 (37%), Gaps = 17/144 (11%) Query: 8 RLAGPMQAWGQPTFE--GTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESVQFAV 65 LAGP+ ++ P G +PT P S + GL+ A LG D E++QFA Sbjct: 12 ELAGPVASFRYP-HFLIGRQPTYPMPPPSTIYGLISAALGRFPD--------PEALQFAY 62 Query: 66 RCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVALWLTPH 125 R + V +T + L++ I RE+L T+ + Sbjct: 63 RFECARHRVDDVETIWFVQPNTATRGEAARKNLEATSNILP-REWLVHPRLTLYVTGDE- 120 Query: 126 ATMVISELEKAVLKPRYTPYLGRR 149 + L +A P Y LGR Sbjct: 121 ----LEALYRAFRSPCYILTLGRS 140 >UniRef50_B7R550 CRISPR-associated protein Cas5 n=1 Tax=Thermococcus sp. AM4 RepID=B7R550_9EURY Length = 275 Score = 54.6 bits (130), Expect = 3e-06, Method: Composition-based stats. Identities = 27/159 (16%), Positives = 50/159 (31%), Gaps = 17/159 (10%) Query: 3 SYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACL----GIQRDDTSSLQALS 58 L++ L P + P T P +S ++G+L L G +R Sbjct: 4 KTLLIELFQPFAQYRNPFTFYYAQTYPLPPKSTIIGMLQNALNDWYGNERGIDEWWN--- 60 Query: 59 ESVQFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGL------KSHETIQTWRE-YL 111 ++ +V + + + L + + R L Sbjct: 61 --LRVSVHGGFESVFWNYQQLIKATKTGISIVRFRGKPTLWNQKLPLYGFPVTSQRSPVL 118 Query: 112 CDASFTVALW-LTPHATMVISELEKAVLKPRYTPYLGRR 149 F L+ L +SE+++A+ +PR LGR Sbjct: 119 QQELFNGWLYILLKGEEEFLSEIKEALERPRKVISLGRS 157 >UniRef50_C9RCY2 CRISPR-associated protein Cas5, Hmari subtype n=2 Tax=Thermoanaerobacteraceae RepID=C9RCY2_AMMDK Length = 266 Score = 54.2 bits (129), Expect = 3e-06, Method: Composition-based stats. Identities = 36/160 (22%), Positives = 56/160 (35%), Gaps = 25/160 (15%) Query: 1 MRSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSES 60 M + + L GP + + + + FP R+ L+G + A LG +RD LSE+ Sbjct: 1 MVNVAVFDLVGPFAHFRKYYTNSSSLSYAFPPRTALMGTVAAVLGWERDSYYEKLGLSEA 60 Query: 61 VQFA----VRCDELILDDRRVSVT-----GLRDYHTVLGAREDYRGLKSHETIQT--WRE 109 +FA V LI + LR V G + L T + +R Sbjct: 61 -RFAVVIKVPVRRLIQTVNYIRTKEEDLNRLRKLEAVKGTQVPLELLLPGGTASSLCFRV 119 Query: 110 YLCDASFTVALWLTPHATMVISELEKAVL--KPRYTPYLG 147 Y V EL + + + + YLG Sbjct: 120 YFA-----------HRDDQVTRELAERLAAGRSYFPLYLG 148 >UniRef50_B5YBH8 CRISPR-associated protein Cas5, Hmari subtype n=1 Tax=Dictyoglomus thermophilum H-6-12 RepID=B5YBH8_DICT6 Length = 250 Score = 53.1 bits (126), Expect = 8e-06, Method: Composition-based stats. Identities = 24/157 (15%), Positives = 54/157 (34%), Gaps = 24/157 (15%) Query: 2 RSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSES- 60 ++ L G M + + + + FP R+ ++GL+ LG +RD + + ++ Sbjct: 1 MKVIVFDLLGKMAHFRKFYTNSSSLSYHFPPRTTIVGLIAGLLGYERDTYYEIFSTDKAK 60 Query: 61 VQFAVR--------CDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLC 112 + V+ + + + R HT E E +R Sbjct: 61 ITIGVKSPLRKILQVVNYVWAENVKQLNQSRGQHT-QIPLEIIFPQDMKE-DICYR---- 114 Query: 113 DASFTVALWLTPHATMVISELEKAVLK--PRYTPYLG 147 ++ ++ +L+ ++ + PYLG Sbjct: 115 -------IFFHHKDEKIMEDLKNKLVNFDFCFPPYLG 144 >UniRef50_C6QNG6 CRISPR-associated protein Cas5, Hmari subtype n=1 Tax=Geobacillus sp. Y4.1MC1 RepID=C6QNG6_9BACI Length = 244 Score = 52.7 bits (125), Expect = 8e-06, Method: Composition-based stats. Identities = 28/157 (17%), Positives = 50/157 (31%), Gaps = 25/157 (15%) Query: 2 RSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESV 61 LI L G M + + + + FP R+ ++G++ LG++RD + + + Sbjct: 1 MKMLIFDLIGKMGHFRKIDTNSSSLSYAFPPRTTIVGMIAGILGMERDSYYEVFS-PDQC 59 Query: 62 QFAVRCDELILDDRRVSVTGLR---------DYHTVLGAREDYRGLKSHETIQTWREYLC 112 Q A+ I + HT E E +R Y Sbjct: 60 QIAISVRTPIRKVMQTVNYMFVKSKAHLNNSGGHT-QIPLEFVLPG-GEEANLRYRIY-- 115 Query: 113 DASFTVALWLTPHATMVISELEKAVLKP--RYTPYLG 147 + V +++ + Y PYLG Sbjct: 116 ---------FSHSDRSVYESVKERIQSGRYVYPPYLG 143 >UniRef50_O27159 Putative uncharacterized protein n=1 Tax=Methanothermobacter thermautotrophicus str. Delta H RepID=O27159_METTH Length = 269 Score = 52.7 bits (125), Expect = 9e-06, Method: Composition-based stats. Identities = 24/153 (15%), Positives = 47/153 (30%), Gaps = 11/153 (7%) Query: 2 RSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESV 61 L + + P + P T +S + G+L G R + +S S+ Sbjct: 1 METLAVEIFQPFAQFRNPFTFDYAQTYPLSPKSTVTGMLQNATG--RYYDQDISEISVSI 58 Query: 62 QFAVRCDELILDDRRVSVTGLRDYH-----TVLGAREDYRGLKSHETIQTWREYLCDASF 116 + + YH Y K + ++++ L + + Sbjct: 59 H-GLFESTFWNYQSFIVGDIALKYHHNKLKLWNKGYPLYSVNKKSQRSPSYQQELFNGHY 117 Query: 117 TVALWLTPHATMVISELEKAVLKPRYTPYLGRR 149 + L +I E+ ++ KP LGR Sbjct: 118 YIFLR---GDDDIIEEVCDSLRKPTKPLSLGRS 147 >UniRef50_D1A6P6 Metal dependent phosphohydrolase n=1 Tax=Thermomonospora curvata DSM 43183 RepID=D1A6P6_THECD Length = 1027 Score = 51.9 bits (123), Expect = 2e-05, Method: Composition-based stats. Identities = 26/147 (17%), Positives = 41/147 (27%), Gaps = 26/147 (17%) Query: 8 RLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESVQFAVRC 67 +L P+ ++ P F G P S L G+L A G +E V V Sbjct: 14 QLYAPVASFRDPMFPGVTRCLPVPPPSTLRGMLAAATGRP----------AEPVVLGV-- 61 Query: 68 DELILDDRRVSVTGLRDYH-TVLGAREDYRGLK----SHETIQTWREYLCDASFTVALWL 122 YH G + R +L T+ + + Sbjct: 62 ----CAYAEGRGVDTETYHPIAADGSNPAIGGRVRPGKGGMTIRERPFLTGVHITLWVPM 117 Query: 123 TPHATMVISELEKAVLKPRYTPYLGRR 149 + A+ +P + LGR Sbjct: 118 PDG-----ERIATALRRPTWGLRLGRS 139 >UniRef50_A4J1X9 CRISPR-associated protein Cas5, Hmari subtype n=1 Tax=Desulfotomaculum reducens MI-1 RepID=A4J1X9_DESRM Length = 242 Score = 51.5 bits (122), Expect = 2e-05, Method: Composition-based stats. Identities = 31/151 (20%), Positives = 61/151 (40%), Gaps = 9/151 (5%) Query: 1 MRSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDD-TSSLQALSE 59 M + RL+G + + + + P R+ LLG+LGA LG+++D L+ L Sbjct: 1 MMKLIAFRLSGRFGHFLRAEAGTSALSYPVPPRTVLLGILGAVLGLEKDLPQELLEPLHI 60 Query: 60 SVQFAVRCDELILDD-RRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTV 118 ++ V R+ L V ++ + K +E+L + ++T+ Sbjct: 61 ALAGPVPQSHWHKAKLRKDPPEALP--QVVKNNQKQEKTTKPEMATLITQEWLFNPAYTI 118 Query: 119 ALWLTPHATMVISELEKAVLKP--RYTPYLG 147 + L +LE+ + + + P LG Sbjct: 119 WVALP---EPYHQQLEQRLKERCWHFQPCLG 146 >UniRef50_O57910 Putative uncharacterized protein PH0171 n=1 Tax=Pyrococcus horikoshii RepID=O57910_PYRHO Length = 227 Score = 51.1 bits (121), Expect = 3e-05, Method: Composition-based stats. Identities = 33/218 (15%), Positives = 68/218 (31%), Gaps = 25/218 (11%) Query: 1 MRSYLILRLAG-PMQA-WGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALS 58 M L L + P+QA + P T FP ++ +G+L C+G+ + L Sbjct: 1 MSELLGLIVDARPLQAHFRIPHTSLLLDTYPFPPKTTAVGMLAGCMGLG---EEGFKKLL 57 Query: 59 ESVQFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTV 118 E +++ V + + + + + R L F + Sbjct: 58 EKIKYGVIIE--------DPGEKIEEVSVIYK-------NPYSPSYPITRVSLYKPRFRM 102 Query: 119 ALWLTPHATMVISELEKAVLKPRYTPYLGRRS--CPLTHPLFLGTCQASDPQKALLNYEP 176 VI E + +L P++ PY+G ++ + ++ +L Sbjct: 103 FFA---GEERVIEEAYEGLLDPKFVPYMGDSESLFYPGKKRYVEVVDVQEGKEDILRSVI 159 Query: 177 VGGDIYSEESVTGHHLKFTARDEPMITLPRQFASREWY 214 + L + P+ + + R Y Sbjct: 160 PEVKFKEFVPLRRKVLVPKVYEAPVKFTYKGKSRRAVY 197 >UniRef50_Q2NH81 Putative uncharacterized protein n=1 Tax=Methanosphaera stadtmanae DSM 3091 RepID=Q2NH81_METST Length = 235 Score = 50.4 bits (119), Expect = 4e-05, Method: Composition-based stats. Identities = 30/150 (20%), Positives = 50/150 (33%), Gaps = 16/150 (10%) Query: 16 WGQPTFEGTRPT------GRFPTRSGLLGLLGACLGIQRDDTSSLQALSESVQFAVRCDE 69 WG + T FP+R+ + G + LG R+ L E+ + ++ Sbjct: 9 WGDYAYFRRGYTTTSTLTYPFPSRTTIAGFIAGILGYPRNSYYDL-FQKENSKIGLKIIN 67 Query: 70 LILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVALWLTPHATMV 129 I R +T T E+L + + + L L + Sbjct: 68 PIKKTRINLNYI----NTKNSMLLSEIKGNGKRTQVPA-EFLKNVKYRIYLSL--DDEEI 120 Query: 130 ISELEKAVL--KPRYTPYLGRRSCPLTHPL 157 +++L + K YTPYLG C L Sbjct: 121 MNKLYNTLKEHKSVYTPYLGITECLANFSL 150 >UniRef50_B1LAM2 CRISPR-associated protein Cas5, Hmari subtype n=2 Tax=Thermotoga RepID=B1LAM2_THESQ Length = 220 Score = 50.4 bits (119), Expect = 5e-05, Method: Composition-based stats. Identities = 32/170 (18%), Positives = 60/170 (35%), Gaps = 23/170 (13%) Query: 2 RSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESV 61 L+ ++ + +P + T FP R+ LLGL+G LG L + +V Sbjct: 1 MKVLVFDVSTSYALFRRPYTTTSSYTLPFPPRTALLGLVGCVLGY--STPEKLDSAKVAV 58 Query: 62 QFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVALW 121 Q + + LR T E + K+ + + + L + ++ V Sbjct: 59 QI------------KNPLKFLR---TGTNFVETKKDKKASKRTRISLQLLKNPAYRVFFS 103 Query: 122 LTPHATMVISELEKAVLK--PRYTPYLGRRSCPLTHPLFLGTCQASDPQK 169 L+ + +TPYLG S ++G +A+ + Sbjct: 104 WEDKD---FERLKNLLEHNETIFTPYLGVASFIAKLD-YVGEYEATRVED 149 >UniRef50_D2QT47 CRISPR-associated protein Cas5 n=1 Tax=Spirosoma linguale DSM 74 RepID=D2QT47_9SPHI Length = 237 Score = 49.2 bits (116), Expect = 1e-04, Method: Composition-based stats. Identities = 26/143 (18%), Positives = 42/143 (29%), Gaps = 21/143 (14%) Query: 6 ILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGI-QRDDTSSLQALSESVQFA 64 + L ++ P F+ + P + L+GL GA LG+ R L Sbjct: 5 TIELKSVTASFRNPEFQNFHKSFPLPPPTALIGLAGAALGLSPRMAQDFLD--------- 55 Query: 65 VRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVALWLTP 124 ++ RD Y L + RE + + L Sbjct: 56 --TNQFQAGVSGKGEGMTRDLW-------KYDRLTGTGSSIILREIYVNPHY--RLVFGS 104 Query: 125 HATMVISELEKAVLKPRYTPYLG 147 + +L+ A P Y LG Sbjct: 105 DNHEAVEQLKAAFDSPVYALTLG 127 >UniRef50_A3XI99 Putative uncharacterized protein n=1 Tax=Leeuwenhoekiella blandensis MED217 RepID=A3XI99_9FLAO Length = 243 Score = 48.8 bits (115), Expect = 1e-04, Method: Composition-based stats. Identities = 24/148 (16%), Positives = 48/148 (32%), Gaps = 15/148 (10%) Query: 2 RSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQ--RDDTSSLQALSE 59 I+ + ++ P F+ + P + ++G+ GA LG R ++ E Sbjct: 1 MKNFIIEIQCQTASFRNPDFQNFHKSLELPPPTTVIGIAGAALGYSPLRAQEFFDESKFE 60 Query: 60 SVQFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVA 119 + + + +RD + RE+L +F +A Sbjct: 61 IGIYGTYLGKCKDTWKYNKG--IRDM---------RLYDPGLDGSIIQREFLIFPTFIIA 109 Query: 120 LWLTPHATMVISELEKAVLKPRYTPYLG 147 + + +L KA P Y +G Sbjct: 110 --FSSENIKAVEKLYKAFTSPVYALTMG 135 >UniRef50_C7NNV3 CRISPR-associated protein Cas5, Hmari subtype n=3 Tax=Halobacteriaceae RepID=C7NNV3_HALUD Length = 264 Score = 48.1 bits (113), Expect = 2e-04, Method: Composition-based stats. Identities = 38/213 (17%), Positives = 63/213 (29%), Gaps = 21/213 (9%) Query: 5 LILRLAGPMQAWGQPTFEG---TRPTGRFPTRSGLLGLLGACLGIQRDDTSSL--QALSE 59 L + GP WG + T R R+ + GL+ A LGI RD L +S Sbjct: 21 LSFEIRGP---WGHFRRVEGNVVKQTYRIVPRTTVAGLIAAVLGIDRDGYYDLFGPEVSA 77 Query: 60 SVQFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWRE-Y--LCDASF 116 V + T D T L R + T + Y L D ++ Sbjct: 78 IAIQPVEELRTVNMPMNTLSTAAGDL-TSLNPRGKISIKLPNPTKLRQQHNYEVLVDPAY 136 Query: 117 TVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKALLNYEP 176 + + L +L + + G+ + L + + + P Sbjct: 137 RIDVALADDERY--EQLRETLAA-------GKSHYVPSLGLSEYLAEIDYLGEFDVKPGP 187 Query: 177 VGGDIYSEESVTGHHLKFTARDEPMITLPRQFA 209 G I + +V E + + A Sbjct: 188 ASGTIAVDSAVPDAMDDVVLDPETRCQIEQSPA 220 >UniRef50_C8PE23 CRISPR-associated protein Cas5 n=1 Tax=Campylobacter gracilis RM3268 RepID=C8PE23_9PROT Length = 237 Score = 48.1 bits (113), Expect = 2e-04, Method: Composition-based stats. Identities = 28/152 (18%), Positives = 56/152 (36%), Gaps = 10/152 (6%) Query: 2 RSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESV 61 + L+G + P + T P ++ ++GLLGA +G + L + SV Sbjct: 1 MDIIAFELSGDYAHFSHPATIYSSLTYPVPPKTAIMGLLGAIIG--EMNYFKLNDIGYSV 58 Query: 62 QFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVALW 121 + + + + + H G S E Q ++E + D + + + Sbjct: 59 ILSSQFRKKTMIFNGIKFALSSSMHIEQG------YQDSSEKKQFYKELIKDPRYVIFVD 112 Query: 122 LTPHATMVISELEKAVL--KPRYTPYLGRRSC 151 L+ + ++ + +TPYLG C Sbjct: 113 LSALNASYKENIIDSLKAHRCVFTPYLGINFC 144 >UniRef50_B9MPU1 CRISPR-associated protein Cas5, Hmari subtype n=2 Tax=Clostridia RepID=B9MPU1_ANATD Length = 250 Score = 47.3 bits (111), Expect = 4e-04, Method: Composition-based stats. Identities = 29/183 (15%), Positives = 57/183 (31%), Gaps = 36/183 (19%) Query: 2 RSYLILRLAGPMQAWGQPTFEGTRPTGR------FPTRSGLLGLLGACLGIQRDDTSSLQ 55 +L+ L G+ T P R+ + G++ A LG +RD + Sbjct: 1 MKFLVFDLK------GKFAHFRKFYTNSSSLSYLVPPRTVIEGMVAAILGFERDSYYDMF 54 Query: 56 ALSESVQFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLC--D 113 + +E++ AV ++ + +Y E + E L + Sbjct: 55 S-AENLLVAV-----QKLEKTYKIVQTVNYIKAQTISELKNPNTHTQVPL---EILAGYN 105 Query: 114 ASFTVALWLTPHATMVISELEKAVLK--PRYTPYLGRRSCPLTHPL-----FLGTCQASD 166 +++ P + S L + + Y G + P FLG +A Sbjct: 106 GFVGFRVFVMPKDEKIYSFLRARLESGKSEFPIYFG------SAPFAAKIEFLGEFEACR 159 Query: 167 PQK 169 + Sbjct: 160 WED 162 >UniRef50_B1B9K0 Crispr-associated protein Cas5, tneap subtype n=1 Tax=Clostridium botulinum C str. Eklund RepID=B1B9K0_CLOBO Length = 244 Score = 47.3 bits (111), Expect = 4e-04, Method: Composition-based stats. Identities = 24/150 (16%), Positives = 54/150 (36%), Gaps = 11/150 (7%) Query: 2 RSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQ-ALSES 60 ++L++ + + +PT + T P S ++G++ A G D+ ++ ++ + Sbjct: 1 MKAILLKVTQNLVNYKKPTSFQLKETYPLPPYSTVIGMVHAACGF--DEYKDMEVSIQGN 58 Query: 61 VQFAVRCDELILDDRRVSVTGLRDY-HTVLGAREDYRGLKSHETIQTWREYLCDASFTVA 119 V + + D H + ED + E L D + Sbjct: 59 YHSKV---NDLYTRYEFAGASYEDKRHNIKLKGEDKYYGAMRG--VSTCELLVDVELLIH 113 Query: 120 LWLTPHATMVISELEKAVLKPRYTPYLGRR 149 + P +I + + + P+ +GRR Sbjct: 114 I--KPKDENLIQIIYQNLKYPKEYLSIGRR 141 >UniRef50_A3DKB9 CRISPR-associated protein Cas5, Hmari subtype n=2 Tax=Clostridium thermocellum RepID=A3DKB9_CLOTH Length = 237 Score = 46.9 bits (110), Expect = 5e-04, Method: Composition-based stats. Identities = 27/149 (18%), Positives = 53/149 (35%), Gaps = 26/149 (17%) Query: 3 SYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSL-QALSESV 61 YL+ ++ + +P + T PTR+ + G++ A LG ++D + Sbjct: 4 KYLVFDISASYGHFKKPYTTTSPLTYSIPTRTAVSGIIAAVLGFGKEDYQEHFTKPQAKI 63 Query: 62 QFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWR-----EYLCDASF 116 +R V +R E+ K I R E+L DA + Sbjct: 64 AIGIR----------NPVKKVR-------ISENLINTKKSMNIIHERTQIKIEFLKDACY 106 Query: 117 TVALWLTPHATMVISELEKAVLKPRYTPY 145 ++ T + L++++ + T Y Sbjct: 107 --RIYFTHTDKQIYERLKESLKE-HRTVY 132 >UniRef50_C8VZL1 CRISPR-associated protein Cas5 n=1 Tax=Desulfotomaculum acetoxidans DSM 771 RepID=C8VZL1_DESAS Length = 253 Score = 46.1 bits (108), Expect = 8e-04, Method: Composition-based stats. Identities = 15/64 (23%), Positives = 29/64 (45%), Gaps = 1/64 (1%) Query: 2 RSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESV 61 + L G M + + + + P R+ L+G++ LG +RD + +L+ S Sbjct: 1 MKIISFHLRGKMAHFRRYYANSSALSYSIPPRTTLIGIVAGLLGWERDKYYEIFSLN-SC 59 Query: 62 QFAV 65 + AV Sbjct: 60 KVAV 63 >UniRef50_B1I4M5 CRISPR-associated protein Cas5 family n=1 Tax=Candidatus Desulforudis audaxviator MP104C RepID=B1I4M5_DESAP Length = 255 Score = 46.1 bits (108), Expect = 8e-04, Method: Composition-based stats. Identities = 23/153 (15%), Positives = 51/153 (33%), Gaps = 24/153 (15%) Query: 2 RSYLILRLAGPMQAWGQPTFE--GTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSE 59 + + P+ ++ P G + + P S + G + + +G + + + Sbjct: 1 MRVAKVHIEAPIASFRYP-HFLIGRQTSFDMPPPSTIYGHIASAVG----EWFNPATVKF 55 Query: 60 SVQFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREY---LCDASF 116 + QF+ + + L H + + ++ + + T L D F Sbjct: 56 AYQFS----------FQAKGSDLEHQHVITKGGQTFKWGERKYPVSTQAVVQPHLRDFLF 105 Query: 117 TVALWLTPHATMVISELEKAVLKPRYTPYLGRR 149 L L + +L +A P + LGR Sbjct: 106 GCRLTLYLYPP----DLAEAFRNPVFCVILGRS 134 >UniRef50_Q3M7D6 Fruiting body developmental protein S-like protein n=3 Tax=Cyanobacteria RepID=Q3M7D6_ANAVT Length = 210 Score = 46.1 bits (108), Expect = 0.001, Method: Composition-based stats. Identities = 22/142 (15%), Positives = 46/142 (32%), Gaps = 17/142 (11%) Query: 2 RSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESV 61 + + L++ P+ + Q T P S + G+L + +G + + + Sbjct: 1 MTTIALKVEVPIACFRQSRAREYGETYPVPPPSTVYGMLLSLVG----EVDRYKHCGVKL 56 Query: 62 QFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVALW 121 A L T +R +H + S ++E L + F V + Sbjct: 57 AIA-------LLSEPEKSTVIRTFH----RFKTKNIHDSKNNKPDYQELLTNIEFIVWVD 105 Query: 122 LTPH--ATMVISELEKAVLKPR 141 ++ L +A+ P Sbjct: 106 AGVDKAKPNLVERLAEALTNPA 127 >UniRef50_C1DUR1 Crispr-associated protein Cas5, hmari subtype n=1 Tax=Sulfurihydrogenibium azorense Az-Fu1 RepID=C1DUR1_SULAA Length = 229 Score = 45.7 bits (107), Expect = 0.001, Method: Composition-based stats. Identities = 28/154 (18%), Positives = 52/154 (33%), Gaps = 27/154 (17%) Query: 2 RSYLILRLAGPMQAWGQPTFEGTRPT------GRFPTRSGLLGLLGACLGIQRDDTSSLQ 55 L+ + WG T P + + G++GA LGI + D + Sbjct: 1 MEVLVFDV------WGDFGHFRKFYTTTSPLTFSIPPPTAVFGIIGAILGIDKKDYLKII 54 Query: 56 ALSESVQFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDAS 115 +E+ + A+ + + R T+ + E+L DA Sbjct: 55 N-AETTKVAIEIVKPVKKIRF----------TINYIDTKVSFSRIKNRTPIRTEFLKDAH 103 Query: 116 FTVALWLTPHATMVISELEKAV--LKPRYTPYLG 147 + + + L + + EL+ + K YT LG Sbjct: 104 YRLYINL--NEENLFRELKDRIKERKTYYTVSLG 135 >UniRef50_A7BQN4 Protein containing DUF522 n=1 Tax=Beggiatoa sp. PS RepID=A7BQN4_9GAMM Length = 267 Score = 45.7 bits (107), Expect = 0.001, Method: Composition-based stats. Identities = 30/132 (22%), Positives = 56/132 (42%), Gaps = 16/132 (12%) Query: 27 TGRFPTRSGLLGLLGACLGIQRDDTSSLQALS-ESVQFAVRCDELILDDRRVSVTGLRDY 85 T FP + ++GLLGA LG +++ + L ++++ +R + R ++ L Sbjct: 47 TYPFPPPTAIMGLLGAILGYSKEEY--HERLGWQTLRVGIRLLKPTQIFR-AAINLL--- 100 Query: 86 HTVLGAREDYRGLKSHETIQTWR---EYLCDASFTVALWLTPHATMVISELEKAVL--KP 140 T G +R + R E++ ++ + + P V S+L + + Sbjct: 101 QTKDGTDTFFRPKADKNSH--TRVPYEFVKSPAYRIYVMQLPD--NVASDLVAHLKTHRT 156 Query: 141 RYTPYLGRRSCP 152 YTP LG SC Sbjct: 157 VYTPVLGLASCL 168 >UniRef50_B8CYA3 CRISPR-associated protein Cas5 n=1 Tax=Halothermothrix orenii H 168 RepID=B8CYA3_HALOH Length = 238 Score = 45.4 bits (106), Expect = 0.002, Method: Composition-based stats. Identities = 26/158 (16%), Positives = 55/158 (34%), Gaps = 19/158 (12%) Query: 3 SYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESVQ 62 L+ ++ G + + + + T + P R+ L+G++ + L RD L + ++ + Sbjct: 5 QVLVFKIKGKIAHFKKYYSNKSSLTYKIPPRTVLMGIVASILEKPRDSYYELLSPQQA-K 63 Query: 63 FAVRCDELILDDRRVSVTGLRD-YHTVLGAREDYRGLKSHETIQTWREYLCDASFTVALW 121 F V+ + D HT L ++ +++ + Sbjct: 64 FGVKIESESYTHFECMNYLKEDGGHT---QVRLQLLLPANNM-LSYKVF----------- 108 Query: 122 LTPHATMVISELEKAVLKPR--YTPYLGRRSCPLTHPL 157 T ++ EL + Y YLG+R T Sbjct: 109 FTHQDESLLKELATKIKNKIYGYGIYLGQRQFRATAEF 146 >UniRef50_Q8PZD2 Putative uncharacterized protein n=1 Tax=Methanosarcina mazei RepID=Q8PZD2_METMA Length = 232 Score = 45.0 bits (105), Expect = 0.002, Method: Composition-based stats. Identities = 21/115 (18%), Positives = 43/115 (37%), Gaps = 20/115 (17%) Query: 35 GLLGLLGACLGIQRDDTSSLQALSESVQFAVRCDELILDDRRVSVTGLRDYHTVLGARED 94 + GL+GA LGI + + + + ++ ++ + +D + Sbjct: 34 AVKGLIGAVLGIDKTE---IYRNTLDLKIGIQVLSPVR----------KDMQVLKLVSMK 80 Query: 95 YRGLKSHETIQTWREYLCDASFTVALWLTPHATMVISELEKAVLK--PRYTPYLG 147 + + E+L D + + + P + ELE + P +TPYLG Sbjct: 81 AEKDLFNFPV--NAEFLRDPEYRIFVSWLPDK---LDELEDRLQNQMPIFTPYLG 130 >UniRef50_A1ZVP9 Crispr-associated protein Cas5, tneap subtype n=1 Tax=Microscilla marina ATCC 23134 RepID=A1ZVP9_9SPHI Length = 218 Score = 44.2 bits (103), Expect = 0.003, Method: Composition-based stats. Identities = 23/124 (18%), Positives = 35/124 (28%), Gaps = 26/124 (20%) Query: 26 PTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESVQFAVRCDELILDDRRVSVTGLRDY 85 PT P S +LGL+ A G + L F + D Sbjct: 26 PTLEVPPVSTVLGLINAAAGH----YVAHSTLKLGYYF-------------EYQSKAVDL 68 Query: 86 HTV-LGAREDYRGLKSHETIQTWREYLCDASFTVALWLTPHATMVISELEKAVLKPRYTP 144 T+ ++ RE+L +A + + E+ P Y Sbjct: 69 ETIYQIGAHKGMPSNHAKSNIIRREFLFEAFLRLYVTS--------QEVADYFRSPVYPI 120 Query: 145 YLGR 148 LGR Sbjct: 121 VLGR 124 >UniRef50_A8F3P0 CRISPR-associated protein Cas5, Hmari subtype n=1 Tax=Thermotoga lettingae TMO RepID=A8F3P0_THELT Length = 230 Score = 43.8 bits (102), Expect = 0.004, Method: Composition-based stats. Identities = 26/150 (17%), Positives = 53/150 (35%), Gaps = 21/150 (14%) Query: 2 RSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRD--DTSSLQALSE 59 ++ + G + + + + FP R+ + GLLGA +GIQ + S + + Sbjct: 1 MKVIVFDVKGKYALFRRNYTTSSSTSYNFPPRTSICGLLGAIMGIQNEATQFSKHLRIFD 60 Query: 60 SVQFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVA 119 + A+R I T+ + + K+ T E + + + + Sbjct: 61 NAHIALRVLVPIRKT------------TMGVNYAETKSGKNQRTQII-LELIKEPVYRIY 107 Query: 120 LWLTPHATMVISELEKAVLKP--RYTPYLG 147 + +L + +TPYLG Sbjct: 108 VS----EFSQFDQLRNHLENNTCVFTPYLG 133 >UniRef50_A4XGC7 CRISPR-associated protein Cas5 family n=1 Tax=Caldicellulosiruptor saccharolyticus DSM 8903 RepID=A4XGC7_CALS8 Length = 275 Score = 43.0 bits (100), Expect = 0.008, Method: Composition-based stats. Identities = 14/53 (26%), Positives = 22/53 (41%), Gaps = 6/53 (11%) Query: 1 MRSY------LILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQ 47 M+ L +++ P + P R T P S +LGL+ LGI+ Sbjct: 1 MKETKNSIYALKIKIYQPQAHFRIPFSYQRRHTYPIPPYSTVLGLIANILGIK 53 >UniRef50_D1QSA9 CRISPR-associated protein Cas5, Tneap subtype n=1 Tax=Prevotella oris F0302 RepID=D1QSA9_9BACT Length = 224 Score = 41.1 bits (95), Expect = 0.026, Method: Composition-based stats. Identities = 29/155 (18%), Positives = 51/155 (32%), Gaps = 28/155 (18%) Query: 2 RSYLILRLAGPMQAWGQP-TFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSES 60 ++++ ++ P G +PT P S +LGL+ AC G + + Sbjct: 1 MKVFRIKISAWTASFRYPNIISGYQPTLEVPPLSTILGLMNACAGR----YLNHADMEIG 56 Query: 61 VQFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSH-ETIQTWREYLCDASFTVA 119 F R D T+ + D K ++ R++L D + + Sbjct: 57 YYFNYRSIS-------------NDLETIYQMKYDKGTAKKQVKSNVINRQFLFDNTLYIY 103 Query: 120 LWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLT 154 + + K P + LGR SC L Sbjct: 104 IID--------KDFVKYFQTPAFQILLGR-SCDLA 129 >UniRef50_B9LWL0 CRISPR-associated protein Cas5, Hmari subtype n=1 Tax=Halorubrum lacusprofundi ATCC 49239 RepID=B9LWL0_HALLT Length = 265 Score = 40.7 bits (94), Expect = 0.041, Method: Composition-based stats. Identities = 36/220 (16%), Positives = 57/220 (25%), Gaps = 51/220 (23%) Query: 16 WGQPTFEGT---RPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESVQFAVRCDELIL 72 WG G + T R P R+ + G+L A +G RD D Sbjct: 23 WGHFKRVGRSVTKQTYRIPPRTTVAGMLAAIVGSDRDSYY---------------DVFGA 67 Query: 73 DDRRVSVTGLRDYHTVLGAREDYRGLKS---HETIQTWREY------------------L 111 D +++T L D TV +T + R Y L Sbjct: 68 DTSAIAITPLFDIRTVNVPTTGLGTDPKQAVTKTAGSRRSYALTYQDTEGDRQIHAYELL 127 Query: 112 CDASF--TVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQK 169 D + VA+ + + L+ + Y P LG+ Sbjct: 128 TDPVYRIDVAVEDEAYYEKLERRLDN--EESYYPPSLGKSEYL--------CTVEDVETD 177 Query: 170 ALLNYEPVGGDIYSEESVTGHHLKFTARDEPMITLPRQFA 209 + V G + + + R A Sbjct: 178 VSPTELAEAEQYDVDSVVPGSLSEVIPQQGVTYDIERSPA 217 >UniRef50_A1HND7 CRISPR-associated protein Cas5 family n=1 Tax=Thermosinus carboxydivorans Nor1 RepID=A1HND7_9FIRM Length = 257 Score = 40.0 bits (92), Expect = 0.063, Method: Composition-based stats. Identities = 40/228 (17%), Positives = 72/228 (31%), Gaps = 23/228 (10%) Query: 3 SYLILRLAGPMQAWGQPTFEGTR-PTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESV 61 L + ++ ++ P R PT P + + G L G + + L S Sbjct: 6 DVLKIEMSAMTASFRYPHVMIGRLPTFEMPPPATIYGHLCGVWG----EWFNPGDLEFSY 61 Query: 62 QF---AVRCD-ELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFT 117 F + D EL S + T+ G ++ +G + + R++L T Sbjct: 62 VFTHGGIGEDVELGHMIEFGSGRSEK---TLGGLPKNMKGSLNPQ----RRQFLFRPRMT 114 Query: 118 VALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLT--HPLFLGTCQASDPQKALLNYE 175 + L ++ LE+A P + LGR T + QAS A Sbjct: 115 LYLK---GDDKILQRLEQAFKSPAFAYILGRSQDLATVHSITWARLKQASGAFFANTLLP 171 Query: 176 PVGGDIYS--EESVTGHHLKFTARDEPMITLPRQFASREWYVIKGGMD 221 S + + EP+ + R + G++ Sbjct: 172 WTLRQWVSVGRPVYMPKWINYHKLREPVFERYLEIDDRPLRIFGEGVE 219 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.308 0.142 0.427 Lambda K H 0.267 0.0438 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,436,281,571 Number of Sequences: 3077464 Number of extensions: 64330909 Number of successful extensions: 160290 Number of sequences better than 1.0e-01: 122 Number of HSP's better than 0.1 without gapping: 227 Number of HSP's successfully gapped in prelim test: 59 Number of HSP's that attempted gapping in prelim test: 159388 Number of HSP's gapped (non-prelim): 306 length of query: 224 length of database: 1,040,396,356 effective HSP length: 124 effective length of query: 100 effective length of database: 658,790,820 effective search space: 65879082000 effective search space used: 65879082000 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.1 bits) S2: 91 (39.6 bits)