BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (94 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P45956 Uncharacterized protein ygbF n=103 Tax=cellular ... 188 5e-47 UniRef50_A4XYT6 CRISPR-associated protein, Cas2 family n=14 Tax=... 148 5e-35 UniRef50_B8IZA9 CRISPR-associated protein Cas2 n=1 Tax=Desulfovi... 104 7e-22 UniRef50_A8LZ00 CRISPR-associated protein Cas2 n=6 Tax=Actinomyc... 82 4e-15 UniRef50_Q2JWC8 CRISPR-associated protein Cas2 n=2 Tax=Chroococc... 81 1e-14 UniRef50_Q0W581 Predicted CRISPR-associated protein n=2 Tax=cell... 77 1e-13 UniRef50_C7MTL7 CRISPR-associated protein Cas2 n=1 Tax=Saccharom... 75 6e-13 UniRef50_C2BET5 3'-5' exonuclease and CRISPR-associated Cas2 fam... 74 1e-12 UniRef50_C6SPJ4 DNA polymerase III alpha subunit n=1 Tax=Strepto... 74 1e-12 UniRef50_D1CGD7 CRISPR-associated protein Cas2 n=2 Tax=Bacteria ... 72 4e-12 UniRef50_C7QEM1 CRISPR-associated protein Cas2 n=7 Tax=Actinomyc... 72 4e-12 UniRef50_Q1J363 CRISPR-associated protein Cas2 n=1 Tax=Deinococc... 71 9e-12 UniRef50_D1CAI7 CRISPR-associated protein Cas2 n=2 Tax=Bacteria ... 71 1e-11 UniRef50_B1VIX7 CRISPR-associated protein n=2 Tax=Actinomycetale... 70 2e-11 UniRef50_Q03C57 3'-5' exonuclease and CRISPR-associated protein ... 70 3e-11 UniRef50_D2RB05 CRISPR-associated endoribonuclease Cas2 n=2 Tax=... 69 3e-11 UniRef50_B6XT67 Putative uncharacterized protein n=1 Tax=Bifidob... 69 4e-11 UniRef50_D0WFC5 DNA polymerase III, alpha chain n=1 Tax=Slackia ... 69 4e-11 UniRef50_C3PF98 CRISPR-associated protein n=3 Tax=Corynebacteriu... 69 4e-11 UniRef50_C8XAY1 CRISPR-associated protein Cas2 n=1 Tax=Nakamurel... 69 6e-11 UniRef50_C2D7T6 3'-5' exonuclease and CRISPR-associated Cas2 fam... 67 1e-10 UniRef50_C9M2Y5 Putative uncharacterized protein (Fragment) n=1 ... 67 1e-10 UniRef50_C7MTM4 CRISPR-associated protein, Cas2 family n=4 Tax=A... 66 4e-10 UniRef50_C2GEZ2 CRISPR-associated Cas2 family protein n=1 Tax=Co... 65 6e-10 UniRef50_A8M407 CRISPR-associated protein Cas2 n=4 Tax=Actinobac... 62 5e-09 UniRef50_C9M9S0 CRISPR-associated protein Cas2 n=1 Tax=Jonquetel... 61 1e-08 UniRef50_A4WZ21 Putative uncharacterized protein n=1 Tax=Rhodoba... 53 2e-06 UniRef50_B3ENI1 CRISPR-associated protein Cas2 n=11 Tax=Bacteria... 53 2e-06 UniRef50_D1Y483 CRISPR-associated protein Cas2 n=1 Tax=Pyramidob... 52 7e-06 UniRef50_C5V9N8 CRISPR-associated protein Cas2 n=2 Tax=Corynebac... 50 2e-05 UniRef50_C1DSI2 Putative uncharacterized protein n=1 Tax=Azotoba... 50 3e-05 UniRef50_A7BA60 Putative uncharacterized protein n=1 Tax=Actinom... 49 4e-05 UniRef50_B8JDP4 CRISPR-associated protein Cas2 n=6 Tax=Proteobac... 49 5e-05 UniRef50_D0MET9 CRISPR-associated protein Cas2 n=2 Tax=Bacteria ... 49 6e-05 UniRef50_C2KP51 CRISPR-associated Cas2 family protein n=1 Tax=Mo... 48 9e-05 UniRef50_Q04QB3 Putative uncharacterized protein n=2 Tax=Leptosp... 47 2e-04 UniRef50_Q0BRF5 Putative uncharacterized protein n=1 Tax=Granuli... 46 3e-04 UniRef50_C4ZJY4 CRISPR-associated protein Cas2 n=1 Tax=Thauera s... 46 5e-04 UniRef50_Q6NEQ2 Putative uncharacterized protein n=1 Tax=Coryneb... 43 0.003 >UniRef50_P45956 Uncharacterized protein ygbF n=103 Tax=cellular organisms RepID=YGBF_ECOLI Length = 94 Score = 188 bits (477), Expect = 5e-47, Method: Compositional matrix adjust. Identities = 94/94 (100%), Positives = 94/94 (100%) Query: 1 MSMLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAW 60 MSMLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAW Sbjct: 1 MSMLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAW 60 Query: 61 ATNTETGFEFQTFGLNRRTPVDLDGLRLVSFLPV 94 ATNTETGFEFQTFGLNRRTPVDLDGLRLVSFLPV Sbjct: 61 ATNTETGFEFQTFGLNRRTPVDLDGLRLVSFLPV 94 >UniRef50_A4XYT6 CRISPR-associated protein, Cas2 family n=14 Tax=cellular organisms RepID=A4XYT6_PSEMY Length = 99 Score = 148 bits (373), Expect = 5e-35, Method: Compositional matrix adjust. Identities = 69/93 (74%), Positives = 80/93 (86%) Query: 1 MSMLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAW 60 MS LVVVTENVPPRLRGR+AIWLLEVRAGVY+GDVS + REMIWEQ++ E+GNVVMAW Sbjct: 1 MSFLVVVTENVPPRLRGRMAIWLLEVRAGVYIGDVSKRTREMIWEQLSQGHEDGNVVMAW 60 Query: 61 ATNTETGFEFQTFGLNRRTPVDLDGLRLVSFLP 93 A+N E+G+EFQT G NRR PV+ DGL LV+F P Sbjct: 61 ASNHESGYEFQTLGPNRRLPVEFDGLHLVAFHP 93 >UniRef50_B8IZA9 CRISPR-associated protein Cas2 n=1 Tax=Desulfovibrio desulfuricans subsp. desulfuricans str. ATCC 27774 RepID=B8IZA9_DESDA Length = 102 Score = 104 bits (260), Expect = 7e-22, Method: Compositional matrix adjust. Identities = 49/91 (53%), Positives = 65/91 (71%) Query: 3 MLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAWAT 62 MLV+VTE VP RLRG L+ WLLEVRAGVYVG+ S ++R+ +WE + +GN V+AW + Sbjct: 1 MLVIVTEAVPQRLRGYLSRWLLEVRAGVYVGNYSVRVRQKLWEVVCQQVGDGNAVLAWTS 60 Query: 63 NTETGFEFQTFGLNRRTPVDLDGLRLVSFLP 93 E+GF+F T G N R +D DGL LV++ P Sbjct: 61 CHESGFQFLTVGANCREQIDWDGLPLVAYTP 91 >UniRef50_A8LZ00 CRISPR-associated protein Cas2 n=6 Tax=Actinomycetales RepID=A8LZ00_SALAI Length = 113 Score = 82.4 bits (202), Expect = 4e-15, Method: Compositional matrix adjust. Identities = 39/92 (42%), Positives = 58/92 (63%) Query: 2 SMLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAWA 61 S++V+ T VP LRG L+ W++EV G++VG +SAK+R+ +W + + +G V+ Sbjct: 3 SLVVLATTAVPDHLRGALSRWMIEVTPGMFVGTLSAKVRDELWNAASSVVGDGAAVLIHP 62 Query: 62 TNTETGFEFQTFGLNRRTPVDLDGLRLVSFLP 93 +TE GF +T G RR PVD DGL LV+ P Sbjct: 63 DDTEQGFSLRTAGARRRRPVDFDGLTLVAMSP 94 >UniRef50_Q2JWC8 CRISPR-associated protein Cas2 n=2 Tax=Chroococcales RepID=Q2JWC8_SYNJA Length = 90 Score = 80.9 bits (198), Expect = 1e-14, Method: Compositional matrix adjust. Identities = 38/87 (43%), Positives = 59/87 (67%) Query: 3 MLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAWAT 62 M+V + ENVP LRG L+ WL E++AGV+VG VSA +RE +W ++ +G+ +M ++T Sbjct: 1 MVVFILENVPASLRGDLSRWLFEIKAGVFVGRVSALVREELWARVTSKIGDGSALMVYST 60 Query: 63 NTETGFEFQTFGLNRRTPVDLDGLRLV 89 N+E GF ++ G R VD++G+ LV Sbjct: 61 NSEQGFSARSIGDPSRQLVDIEGVLLV 87 >UniRef50_Q0W581 Predicted CRISPR-associated protein n=2 Tax=cellular organisms RepID=Q0W581_UNCMA Length = 96 Score = 77.4 bits (189), Expect = 1e-13, Method: Compositional matrix adjust. Identities = 34/88 (38%), Positives = 55/88 (62%) Query: 2 SMLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAWA 61 S V + E+V P LRG L W++E +AGV++G +S +R +W +I ++G MA++ Sbjct: 3 SFSVFIVESVSPSLRGELTRWMIEPKAGVFIGKLSGMVRNKLWGKIIKNIKKGGCTMAYS 62 Query: 62 TNTETGFEFQTFGLNRRTPVDLDGLRLV 89 N E G++ +++G RT VD +GL LV Sbjct: 63 YNNEQGYKIESYGDTTRTIVDFEGLSLV 90 >UniRef50_C7MTL7 CRISPR-associated protein Cas2 n=1 Tax=Saccharomonospora viridis DSM 43017 RepID=C7MTL7_SACVD Length = 109 Score = 75.1 bits (183), Expect = 6e-13, Method: Compositional matrix adjust. Identities = 35/90 (38%), Positives = 56/90 (62%) Query: 2 SMLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAWA 61 +++V+ T VP +RG L+ WL E G+YVG +SA++R+ +WEQ++ EG V Sbjct: 3 NLVVISTTAVPDYVRGSLSRWLTEPAPGLYVGSISARVRDSLWEQVSAAVGEGAAVCVHP 62 Query: 62 TNTETGFEFQTFGLNRRTPVDLDGLRLVSF 91 T+ E + +T G RR +D DGL+L++F Sbjct: 63 TDNEQRYVIKTAGERRRRVMDFDGLQLIAF 92 >UniRef50_C2BET5 3'-5' exonuclease and CRISPR-associated Cas2 family protein n=1 Tax=Anaerococcus lactolyticus ATCC 51172 RepID=C2BET5_9FIRM Length = 295 Score = 74.3 bits (181), Expect = 1e-12, Method: Composition-based stats. Identities = 33/89 (37%), Positives = 52/89 (58%), Gaps = 1/89 (1%) Query: 1 MSMLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAW 60 M + V+ +N PP LRG L W+ E+ GVYVG+ + KIR+ +WE++ G M + Sbjct: 1 MPLTVITLKNSPPSLRGDLTKWMQEIATGVYVGNFNTKIRQELWERVVESVGSGEATMTY 60 Query: 61 ATNTETGFEFQTFGLNRRTPVDLDGLRLV 89 A E G++F+T N + +D +G+ LV Sbjct: 61 AYRNEIGYKFETHNSN-KIMIDFEGIPLV 88 >UniRef50_C6SPJ4 DNA polymerase III alpha subunit n=1 Tax=Streptococcus mutans NN2025 RepID=C6SPJ4_STRMN Length = 300 Score = 73.9 bits (180), Expect = 1e-12, Method: Composition-based stats. Identities = 32/91 (35%), Positives = 50/91 (54%), Gaps = 1/91 (1%) Query: 1 MSMLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAW 60 M + V+ +N PP LRG L W+ E+ GVYVG+ + K+RE +W ++ G +++ Sbjct: 1 MPLTVITVKNAPPSLRGDLTKWMQEIATGVYVGNFNTKVREQLWSRVKDSVSNGEATLSF 60 Query: 61 ATNTETGFEFQTFGLNRRTPVDLDGLRLVSF 91 A E G+ F T R+ VD +G+ LV Sbjct: 61 AYRNEIGYCFDTMNAQRKV-VDFEGIPLVQL 90 >UniRef50_D1CGD7 CRISPR-associated protein Cas2 n=2 Tax=Bacteria RepID=D1CGD7_THET1 Length = 96 Score = 72.4 bits (176), Expect = 4e-12, Method: Compositional matrix adjust. Identities = 41/95 (43%), Positives = 53/95 (55%), Gaps = 7/95 (7%) Query: 3 MLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIA-------GLAEEGN 55 M V+V E VP +RG L WLLE R GV+VG SA +R+ +WE + G E G Sbjct: 1 MTVIVVEKVPASVRGELTRWLLEPRTGVFVGRPSALVRDKLWELVCQRIVERTGPEEMGG 60 Query: 56 VVMAWATNTETGFEFQTFGLNRRTPVDLDGLRLVS 90 VM + ++ E GFE + FG R VD +GL LV Sbjct: 61 AVMIYTSDNEQGFEMRIFGDTSRDLVDFEGLWLVK 95 >UniRef50_C7QEM1 CRISPR-associated protein Cas2 n=7 Tax=Actinomycetales RepID=C7QEM1_CATAD Length = 116 Score = 72.4 bits (176), Expect = 4e-12, Method: Compositional matrix adjust. Identities = 35/88 (39%), Positives = 56/88 (63%), Gaps = 1/88 (1%) Query: 3 MLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAWAT 62 M V+V P LRG L WLLE+ AGV++G SA++R+++W+++ A +G ++A+ T Sbjct: 1 MTVIVLTLCPVGLRGLLTRWLLEISAGVFIGSPSARVRDLLWDEVTSHAGKGRALLAYTT 60 Query: 63 NTETGFEFQTFGLNRRTPVDLDGLRLVS 90 + E GF F+T + PVD +GL L+ Sbjct: 61 DNEQGFAFRTHD-HAWHPVDHEGLTLIH 87 >UniRef50_Q1J363 CRISPR-associated protein Cas2 n=1 Tax=Deinococcus geothermalis DSM 11300 RepID=Q1J363_DEIGD Length = 107 Score = 71.2 bits (173), Expect = 9e-12, Method: Compositional matrix adjust. Identities = 35/82 (42%), Positives = 48/82 (58%) Query: 9 ENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAWATNTETGF 68 E VP LRG L+ WL+EV+ GVYVG+ SA +R+++WE+ G + N E GF Sbjct: 4 EAVPESLRGELSRWLIEVQPGVYVGNASALVRDLLWEKAVSHTRRGRCTQVYRANNEQGF 63 Query: 69 EFQTFGLNRRTPVDLDGLRLVS 90 +T G R V LDG +LV+ Sbjct: 64 IIRTHGDPTRRVVSLDGYQLVA 85 >UniRef50_D1CAI7 CRISPR-associated protein Cas2 n=2 Tax=Bacteria RepID=D1CAI7_SPHTD Length = 92 Score = 71.2 bits (173), Expect = 1e-11, Method: Compositional matrix adjust. Identities = 36/87 (41%), Positives = 56/87 (64%) Query: 3 MLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAWAT 62 M+V++ E VP LRG L W+LE +AGV+VG +SA +R+ +WE+ + G ++ +++ Sbjct: 1 MVVMILERVPRSLRGELTRWMLEPKAGVFVGTMSALVRDKLWEKACASMKGGAGMLIYSS 60 Query: 63 NTETGFEFQTFGLNRRTPVDLDGLRLV 89 NTE GF + +G R VD DGL L+ Sbjct: 61 NTEQGFVVRFWGNLGREVVDFDGLTLI 87 >UniRef50_B1VIX7 CRISPR-associated protein n=2 Tax=Actinomycetales RepID=B1VIX7_CORU7 Length = 113 Score = 70.1 bits (170), Expect = 2e-11, Method: Compositional matrix adjust. Identities = 32/86 (37%), Positives = 54/86 (62%), Gaps = 1/86 (1%) Query: 4 LVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAWATN 63 +V++ P LRG L WL+E+ G +VG SA+IRE++W++ L ++G ++ +++N Sbjct: 1 MVLIVTACPAGLRGDLTKWLMELAPGTFVGRPSARIRELLWDRTVELCKDGRALLVYSSN 60 Query: 64 TETGFEFQTFGLNRRTPVDLDGLRLV 89 E G EF+T + P D DGL+L+ Sbjct: 61 NEQGMEFRTHRHD-WEPTDFDGLKLM 85 >UniRef50_Q03C57 3'-5' exonuclease and CRISPR-associated protein cas2 n=3 Tax=Lactobacillus RepID=Q03C57_LACC3 Length = 301 Score = 69.7 bits (169), Expect = 3e-11, Method: Compositional matrix adjust. Identities = 36/87 (41%), Positives = 50/87 (57%), Gaps = 1/87 (1%) Query: 3 MLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAWAT 62 M+V+ VPP LRG L W EV+ GVYVG SA+IR+ +WE+I G + + Sbjct: 1 MIVITLSKVPPSLRGVLTKWCQEVQTGVYVGRFSARIRDSLWERIQRDIGSGEATIVFNA 60 Query: 63 NTETGFEFQTFGLNRRTPVDLDGLRLV 89 E G++F+T +R VD DGL L+ Sbjct: 61 KNELGYQFRTTRTDREV-VDYDGLPLL 86 >UniRef50_D2RB05 CRISPR-associated endoribonuclease Cas2 n=2 Tax=Gardnerella vaginalis RepID=D2RB05_GARVA Length = 359 Score = 69.3 bits (168), Expect = 3e-11, Method: Composition-based stats. Identities = 33/89 (37%), Positives = 54/89 (60%), Gaps = 1/89 (1%) Query: 1 MSMLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAW 60 M + V+ N P LRG L W+ E+ +GVYVG+ ++++RE +W++I G V M++ Sbjct: 1 MPLTVITMTNCPLSLRGDLTKWMQEIASGVYVGNFNSRVREELWKRIEDSVGNGAVTMSF 60 Query: 61 ATNTETGFEFQTFGLNRRTPVDLDGLRLV 89 ++ E G++F+T +R V DGL LV Sbjct: 61 SSRNEIGYDFKTIHSHREV-VYSDGLPLV 88 >UniRef50_B6XT67 Putative uncharacterized protein n=1 Tax=Bifidobacterium catenulatum DSM 16992 RepID=B6XT67_9BIFI Length = 119 Score = 69.3 bits (168), Expect = 4e-11, Method: Compositional matrix adjust. Identities = 34/88 (38%), Positives = 51/88 (57%), Gaps = 1/88 (1%) Query: 3 MLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAWAT 62 M+V+V P LRG L WLLE+ GV+VG + A++RE +WE+I L++ G +M ++ Sbjct: 1 MVVIVLTACPVGLRGDLTRWLLEISPGVFVGHLDARVREKLWERIVELSKNGRAIMVYSA 60 Query: 63 NTETGFEFQTFGLNRRTPVDLDGLRLVS 90 E F+ G +P D +GL LV Sbjct: 61 RNEQHLAFKVHGAE-WSPTDCEGLELVK 87 >UniRef50_D0WFC5 DNA polymerase III, alpha chain n=1 Tax=Slackia exigua ATCC 700122 RepID=D0WFC5_9ACTN Length = 291 Score = 69.3 bits (168), Expect = 4e-11, Method: Composition-based stats. Identities = 32/87 (36%), Positives = 49/87 (56%), Gaps = 1/87 (1%) Query: 3 MLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAWAT 62 M+V+ E P LRG L WL E+ GVYVG VSA++R+ +WE++ + G M ++ Sbjct: 1 MVVITLEKCPLALRGDLTKWLQEISMGVYVGQVSARVRDRLWERVCKECKSGRATMVYSV 60 Query: 63 NTETGFEFQTFGLNRRTPVDLDGLRLV 89 E +F+ P+D DGL+L+ Sbjct: 61 RNEQRHDFRIHNTTWE-PIDFDGLKLM 86 >UniRef50_C3PF98 CRISPR-associated protein n=3 Tax=Corynebacterium RepID=C3PF98_CORA7 Length = 118 Score = 69.3 bits (168), Expect = 4e-11, Method: Compositional matrix adjust. Identities = 33/87 (37%), Positives = 54/87 (62%), Gaps = 1/87 (1%) Query: 3 MLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAWAT 62 M+V+V P LRG L+ WL+E+ GV+VG SA+IR+++WE+ L ++G ++ ++ Sbjct: 1 MIVLVVTACPAGLRGDLSKWLIELTPGVFVGRPSARIRDLLWERTVELCKDGRALLVYSA 60 Query: 63 NTETGFEFQTFGLNRRTPVDLDGLRLV 89 E G EF+T + P D DG+ L+ Sbjct: 61 ANEQGLEFKTH-RHHWQPTDFDGVTLM 86 >UniRef50_C8XAY1 CRISPR-associated protein Cas2 n=1 Tax=Nakamurella multipartita DSM 44233 RepID=C8XAY1_NAKMY Length = 124 Score = 68.6 bits (166), Expect = 6e-11, Method: Compositional matrix adjust. Identities = 32/87 (36%), Positives = 56/87 (64%), Gaps = 1/87 (1%) Query: 3 MLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAWAT 62 M+V++ P LRG L WL+E+ GV+VG VS ++R+++W+++ LA++G VM + Sbjct: 1 MVVLMLTACPAGLRGHLTRWLMEIGPGVFVGRVSHRVRDLLWDRVLELAKDGRAVMVYPA 60 Query: 63 NTETGFEFQTFGLNRRTPVDLDGLRLV 89 E G E++ + + P+D+DGL L+ Sbjct: 61 RNEQGLEYRVHRSSWK-PIDVDGLTLM 86 >UniRef50_C2D7T6 3'-5' exonuclease and CRISPR-associated Cas2 family protein n=1 Tax=Atopobium vaginae DSM 15829 RepID=C2D7T6_9ACTN Length = 337 Score = 67.4 bits (163), Expect = 1e-10, Method: Composition-based stats. Identities = 31/93 (33%), Positives = 53/93 (56%), Gaps = 2/93 (2%) Query: 1 MSMLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAW 60 M + ++ P LRG L W+ E+ GVYVG+++++IRE +W +++ G+ +++ Sbjct: 19 MPLTIITLSKCPRSLRGDLTKWMQEIDTGVYVGNLNSRIREKLWSRVSNCVGSGSATLSF 78 Query: 61 ATNTETGFEFQTFGLNRRTPVDLDGLRLVSFLP 93 E G++F T RR V LDG+ L+ F P Sbjct: 79 VAQNEIGYDFCTINSARRV-VYLDGIPLI-FTP 109 >UniRef50_C9M2Y5 Putative uncharacterized protein (Fragment) n=1 Tax=Lactobacillus helveticus DSM 20075 RepID=C9M2Y5_LACHE Length = 249 Score = 67.4 bits (163), Expect = 1e-10, Method: Compositional matrix adjust. Identities = 32/84 (38%), Positives = 50/84 (59%), Gaps = 1/84 (1%) Query: 3 MLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAWAT 62 M+V+ VP LRG L W EV+ GVYVG SA+IR+++W++I G + + T Sbjct: 1 MIVITLTKVPQSLRGDLTKWCQEVQTGVYVGSFSARIRDLLWKRILLNIGRGEATLIYTT 60 Query: 63 NTETGFEFQTFGLNRRTPVDLDGL 86 N E G++F+T +++ V DG+ Sbjct: 61 NNELGYDFKTTRKDKQV-VQFDGI 83 >UniRef50_C7MTM4 CRISPR-associated protein, Cas2 family n=4 Tax=Actinomycetales RepID=C7MTM4_SACVD Length = 93 Score = 65.9 bits (159), Expect = 4e-10, Method: Compositional matrix adjust. Identities = 32/84 (38%), Positives = 49/84 (58%) Query: 3 MLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAWAT 62 M V+V P LRG L W++EV AGV+VG+ S ++R+ +WE +A +G ++ Sbjct: 1 MTVIVLIAAPEGLRGHLTRWMVEVHAGVFVGNPSRRVRDRLWELLATRIADGQAILVEPA 60 Query: 63 NTETGFEFQTFGLNRRTPVDLDGL 86 + E G+ +T G +R P D DGL Sbjct: 61 DNEQGWAVRTAGTDRWRPTDFDGL 84 >UniRef50_C2GEZ2 CRISPR-associated Cas2 family protein n=1 Tax=Corynebacterium glucuronolyticum ATCC 51866 RepID=C2GEZ2_9CORY Length = 104 Score = 65.5 bits (158), Expect = 6e-10, Method: Compositional matrix adjust. Identities = 37/90 (41%), Positives = 48/90 (53%), Gaps = 1/90 (1%) Query: 4 LVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVM-AWAT 62 LV+ VP L G L +L EV GVYVG+VS ++R +W + A + G + M Sbjct: 3 LVITCSAVPDHLHGYLTRFLSEVDTGVYVGNVSRRVRNNLWTRCATAIKSGRLTMINRDP 62 Query: 63 NTETGFEFQTFGLNRRTPVDLDGLRLVSFL 92 E GF T G RRT +D+DGL L S L Sbjct: 63 EREQGFAVNTLGSQRRTIIDMDGLLLASTL 92 >UniRef50_A8M407 CRISPR-associated protein Cas2 n=4 Tax=Actinobacteria (class) RepID=A8M407_SALAI Length = 136 Score = 62.4 bits (150), Expect = 5e-09, Method: Compositional matrix adjust. Identities = 32/87 (36%), Positives = 48/87 (55%), Gaps = 1/87 (1%) Query: 3 MLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAWAT 62 M V++ P LRG L WLLE+ AGVYVG V+++IR +W ++ +A G ++ + Sbjct: 1 MTVIILTACPEGLRGHLTQWLLEISAGVYVGHVNSRIRHRLWAKVVDMAGPGRALLVYQQ 60 Query: 63 NTETGFEFQTFGLNRRTPVDLDGLRLV 89 E F T + PVD DG+ L+ Sbjct: 61 PGEQRLSF-TVHDHHWEPVDHDGITLM 86 >UniRef50_C9M9S0 CRISPR-associated protein Cas2 n=1 Tax=Jonquetella anthropi E3_33 E1 RepID=C9M9S0_9BACT Length = 105 Score = 60.8 bits (146), Expect = 1e-08, Method: Compositional matrix adjust. Identities = 39/92 (42%), Positives = 54/92 (58%), Gaps = 6/92 (6%) Query: 3 MLVVVTENVPPRLRGRLAIWLLEVRAGVYVGD-VSAKIREMIW----EQIAGLAEEGNVV 57 MLV++T VP R+RG LA LLEV GVYV ++A +RE IW E ++E +V+ Sbjct: 1 MLVLITNQVPMRVRGFLAACLLEVAPGVYVHPRINAGVRERIWKIMTEWSVEFSQEASVL 60 Query: 58 MAW-ATNTETGFEFQTFGLNRRTPVDLDGLRL 88 W A + G +T G+ +RT +D DGL L Sbjct: 61 ALWPAPKSSGGINIRTIGIPQRTLIDYDGLVL 92 >UniRef50_A4WZ21 Putative uncharacterized protein n=1 Tax=Rhodobacter sphaeroides ATCC 17025 RepID=A4WZ21_RHOS5 Length = 76 Score = 53.1 bits (126), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 23/47 (48%), Positives = 29/47 (61%) Query: 46 QIAGLAEEGNVVMAWATNTETGFEFQTFGLNRRTPVDLDGLRLVSFL 92 Q+ EEG+ M W T+ F+F T G NRR PVD+DGL+ VSF Sbjct: 25 QVVNHTEEGDAAMVWKAPTDQRFDFATTGRNRRMPVDVDGLKFVSFF 71 >UniRef50_B3ENI1 CRISPR-associated protein Cas2 n=11 Tax=Bacteria RepID=B3ENI1_CHLPB Length = 105 Score = 53.1 bits (126), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 24/88 (27%), Positives = 49/88 (55%), Gaps = 1/88 (1%) Query: 3 MLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQI-AGLAEEGNVVMAWA 61 M++VV ++PP +RGR+ +W +E RA V+V + + + + + + E +++ + Sbjct: 1 MVIVVANDIPPAVRGRMKLWFVEPRANVFVSGIKDSVAKKVIDYLHKHCPSESGLMVFKS 60 Query: 62 TNTETGFEFQTFGLNRRTPVDLDGLRLV 89 N G+E G R+ ++L G++LV Sbjct: 61 CNEAPGYEIFGHGDTRKQLIELSGMQLV 88 >UniRef50_D1Y483 CRISPR-associated protein Cas2 n=1 Tax=Pyramidobacter piscolens W5455 RepID=D1Y483_9BACT Length = 115 Score = 52.0 bits (123), Expect = 7e-06, Method: Compositional matrix adjust. Identities = 34/95 (35%), Positives = 52/95 (54%), Gaps = 7/95 (7%) Query: 1 MSMLVVVTENVPPRLRGRLAIWLLEVRAGVYVG-DVSAKIREMIWEQIAGLAEE-----G 54 M M VV+T NVP + RG LA +LE+ GVY +SA IR+ IW+ I E Sbjct: 1 MPMTVVITNNVPMKYRGFLASCMLELAPGVYSHPKMSAGIRQRIWQVIEKWYNEQQDLNS 60 Query: 55 NVVMAWATNTETGFE-FQTFGLNRRTPVDLDGLRL 88 ++++ W+ ++ G + + GL R +D DG+ L Sbjct: 61 SIMLIWSDSSRPGGQGIECLGLPARIVLDCDGVLL 95 >UniRef50_C5V9N8 CRISPR-associated protein Cas2 n=2 Tax=Corynebacterium matruchotii RepID=C5V9N8_9CORY Length = 109 Score = 50.1 bits (118), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 29/90 (32%), Positives = 45/90 (50%), Gaps = 1/90 (1%) Query: 5 VVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAWATNT 64 V+ +P +RG L + EV G+YVG VS + + +W +I G G+ + + + Sbjct: 4 VISCTAIPDHVRGFLTRFFSEVSTGLYVGIVSPVVLDNLWARIDGTITMGSFTLVHSCHE 63 Query: 65 -ETGFEFQTFGLNRRTPVDLDGLRLVSFLP 93 E GF + G R D+DGL L S +P Sbjct: 64 REQGFNIRMTGPQSRPLFDMDGLLLTSRVP 93 >UniRef50_C1DSI2 Putative uncharacterized protein n=1 Tax=Azotobacter vinelandii DJ RepID=C1DSI2_AZOVD Length = 63 Score = 49.7 bits (117), Expect = 3e-05, Method: Compositional matrix adjust. Identities = 21/36 (58%), Positives = 26/36 (72%) Query: 58 MAWATNTETGFEFQTFGLNRRTPVDLDGLRLVSFLP 93 MAW + E+G EFQT G NRR PV+ DGL L++F P Sbjct: 1 MAWTSRHESGHEFQTQGANRRLPVEFDGLHLMAFHP 36 >UniRef50_A7BA60 Putative uncharacterized protein n=1 Tax=Actinomyces odontolyticus ATCC 17982 RepID=A7BA60_9ACTO Length = 113 Score = 49.3 bits (116), Expect = 4e-05, Method: Compositional matrix adjust. Identities = 26/66 (39%), Positives = 39/66 (59%), Gaps = 1/66 (1%) Query: 24 LEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAWATNTETGFEFQTFGLNRRTPVDL 83 +E+ GV+VG +SA++RE +W + + G VM + E G EF T+G + PVD Sbjct: 1 MEISPGVFVGTLSARVRERLWVIVTENMKTGRAVMVYRARNEQGLEFLTWG-DPWKPVDF 59 Query: 84 DGLRLV 89 DGL L+ Sbjct: 60 DGLTLM 65 >UniRef50_B8JDP4 CRISPR-associated protein Cas2 n=6 Tax=Proteobacteria RepID=B8JDP4_ANAD2 Length = 109 Score = 48.9 bits (115), Expect = 5e-05, Method: Compositional matrix adjust. Identities = 30/73 (41%), Positives = 38/73 (52%), Gaps = 4/73 (5%) Query: 1 MSMLVVVTENVPPRLRGRLAIWLLEVRAGVYVG-DVSAKIREMIW---EQIAGLAEEGNV 56 M M V+VT +VP R RG LA LE+ GVY D++A +RE W E A +G V Sbjct: 1 MPMTVIVTRDVPDRFRGFLASVALEIAPGVYTAPDMTASVRERAWTVLEDWHQHARQGAV 60 Query: 57 VMAWATNTETGFE 69 VM W G + Sbjct: 61 VMTWPDGAAPGGQ 73 >UniRef50_D0MET9 CRISPR-associated protein Cas2 n=2 Tax=Bacteria RepID=D0MET9_RHOM4 Length = 142 Score = 48.5 bits (114), Expect = 6e-05, Method: Compositional matrix adjust. Identities = 31/94 (32%), Positives = 50/94 (53%), Gaps = 6/94 (6%) Query: 1 MSMLVVVTENVPPRLRGRLAIWLLEVRAGVYVGD-VSAKIREMIWEQIAGLAE----EGN 55 M+M + VT N P R RG LA +LE+ GVYV + +RE +W+ + AE +G Sbjct: 1 MAMTIAVTRNTPGRFRGFLASCMLEIAPGVYVAPRMPRDVRERVWQVLLSWAELIPPDGG 60 Query: 56 VVMAWAT-NTETGFEFQTFGLNRRTPVDLDGLRL 88 VV+ W +G + + G ++ V+ +G+ L Sbjct: 61 VVLLWRNRKAPSGLDVRLLGWPKKELVEYEGVWL 94 >UniRef50_C2KP51 CRISPR-associated Cas2 family protein n=1 Tax=Mobiluncus mulieris ATCC 35243 RepID=C2KP51_9ACTO Length = 97 Score = 48.1 bits (113), Expect = 9e-05, Method: Compositional matrix adjust. Identities = 26/86 (30%), Positives = 46/86 (53%), Gaps = 2/86 (2%) Query: 3 MLVVVTEN-VPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAWA 61 M V++T VP L G L+ +L EV GVYVG ++ ++ + +WE+ + +EG++ + + Sbjct: 1 MFVILTATAVPEHLHGYLSRFLTEVNMGVYVGKITPRVADALWERCRKVGKEGSLTLVQS 60 Query: 62 -TNTETGFEFQTFGLNRRTPVDLDGL 86 E GF + + + DGL Sbjct: 61 DVRFEQGFSVRAYSPRQHRVRCFDGL 86 >UniRef50_Q04QB3 Putative uncharacterized protein n=2 Tax=Leptospira borgpetersenii serovar Hardjo-bovis RepID=Q04QB3_LEPBJ Length = 116 Score = 47.0 bits (110), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 21/65 (32%), Positives = 44/65 (67%), Gaps = 1/65 (1%) Query: 26 VRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAWATNTETGFEFQTFGLNRRTPVDLDG 85 ++ GV+V ++A++R+ IW++I+ A + + +M +++N+E G+ ++ G R +D DG Sbjct: 1 MKPGVFVASINARVRDRIWKKISE-AWKSDAIMLFSSNSEQGYGIRSHGDPSREIMDFDG 59 Query: 86 LRLVS 90 L L+S Sbjct: 60 LLLMS 64 >UniRef50_Q0BRF5 Putative uncharacterized protein n=1 Tax=Granulibacter bethesdensis CGDNIH1 RepID=Q0BRF5_GRABC Length = 98 Score = 46.2 bits (108), Expect = 3e-04, Method: Compositional matrix adjust. Identities = 32/94 (34%), Positives = 49/94 (52%), Gaps = 5/94 (5%) Query: 1 MSMLVVVTENVPPRLRGRLAIWLLEVRAGVYVG-DVSAKIREMIWEQIAGLAEE---GNV 56 M +V+T +V R RG L +LE+ AGVY+ +S+ +RE W ++ E G + Sbjct: 2 MPATLVITRDVEARYRGYLTSIMLELSAGVYLSPQLSSAVRERTWAVLSEWHSELRRGAI 61 Query: 57 VMAWA-TNTETGFEFQTFGLNRRTPVDLDGLRLV 89 V+AW + G +T G + VD DG+ LV Sbjct: 62 VLAWPDAKSPGGMAIRTLGDAPKEIVDADGVLLV 95 >UniRef50_C4ZJY4 CRISPR-associated protein Cas2 n=1 Tax=Thauera sp. MZ1T RepID=C4ZJY4_THASP Length = 103 Score = 45.8 bits (107), Expect = 5e-04, Method: Compositional matrix adjust. Identities = 32/91 (35%), Positives = 46/91 (50%), Gaps = 5/91 (5%) Query: 1 MSMLVVVTENVPPRLRGRLAIWLLEVRAGVYVGD-VSAKIREMIWEQIA---GLAEEGNV 56 M ++V+VT +V R RG L +LEV VYV +S +R+ W +A G++ Sbjct: 1 MPLVVIVTRDVADRFRGFLKSVMLEVAPAVYVSPRMSKGVRDRTWNVLAEWHDFEPRGSI 60 Query: 57 VMAWATNTET-GFEFQTFGLNRRTPVDLDGL 86 VM W N ET G G R V++DG+ Sbjct: 61 VMVWRDNNETGGVGLAHLGEPPRELVEMDGM 91 >UniRef50_Q6NEQ2 Putative uncharacterized protein n=1 Tax=Corynebacterium diphtheriae RepID=Q6NEQ2_CORDI Length = 104 Score = 43.1 bits (100), Expect = 0.003, Method: Compositional matrix adjust. Identities = 24/90 (26%), Positives = 44/90 (48%), Gaps = 1/90 (1%) Query: 5 VVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAWATNT 64 V+ + P L G + +L E +YVG+VS + +W ++ ++ + M + N+ Sbjct: 4 VLYLQAAPDHLLGYVTRFLTEADTSIYVGNVSKNVASNLWIRVTEAIKDAHATMIVSDNS 63 Query: 65 -ETGFEFQTFGLNRRTPVDLDGLRLVSFLP 93 E GF T G + +D DGL +++ P Sbjct: 64 REQGFSIMTTGDSTLQVLDADGLSVLASRP 93 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_D0WFC5 DNA polymerase III, alpha chain n=1 Tax=Slackia ... 137 8e-32 UniRef50_C2BET5 3'-5' exonuclease and CRISPR-associated Cas2 fam... 133 1e-30 UniRef50_D2RB05 CRISPR-associated endoribonuclease Cas2 n=2 Tax=... 133 2e-30 UniRef50_B8IZA9 CRISPR-associated protein Cas2 n=1 Tax=Desulfovi... 131 5e-30 UniRef50_P45956 Uncharacterized protein ygbF n=103 Tax=cellular ... 131 8e-30 UniRef50_A4XYT6 CRISPR-associated protein, Cas2 family n=14 Tax=... 130 1e-29 UniRef50_C2D7T6 3'-5' exonuclease and CRISPR-associated Cas2 fam... 130 1e-29 UniRef50_C6SPJ4 DNA polymerase III alpha subunit n=1 Tax=Strepto... 129 2e-29 UniRef50_Q03C57 3'-5' exonuclease and CRISPR-associated protein ... 127 9e-29 UniRef50_B6XT67 Putative uncharacterized protein n=1 Tax=Bifidob... 127 1e-28 UniRef50_C7MTL7 CRISPR-associated protein Cas2 n=1 Tax=Saccharom... 126 3e-28 UniRef50_C7QEM1 CRISPR-associated protein Cas2 n=7 Tax=Actinomyc... 124 1e-27 UniRef50_A8LZ00 CRISPR-associated protein Cas2 n=6 Tax=Actinomyc... 123 2e-27 UniRef50_C3PF98 CRISPR-associated protein n=3 Tax=Corynebacteriu... 123 2e-27 UniRef50_C9M2Y5 Putative uncharacterized protein (Fragment) n=1 ... 122 3e-27 UniRef50_Q2JWC8 CRISPR-associated protein Cas2 n=2 Tax=Chroococc... 121 7e-27 UniRef50_Q1J363 CRISPR-associated protein Cas2 n=1 Tax=Deinococc... 121 8e-27 UniRef50_A8M407 CRISPR-associated protein Cas2 n=4 Tax=Actinobac... 121 9e-27 UniRef50_B1VIX7 CRISPR-associated protein n=2 Tax=Actinomycetale... 119 3e-26 UniRef50_Q0W581 Predicted CRISPR-associated protein n=2 Tax=cell... 119 3e-26 UniRef50_C8XAY1 CRISPR-associated protein Cas2 n=1 Tax=Nakamurel... 117 9e-26 UniRef50_C7MTM4 CRISPR-associated protein, Cas2 family n=4 Tax=A... 117 1e-25 UniRef50_D1CAI7 CRISPR-associated protein Cas2 n=2 Tax=Bacteria ... 114 8e-25 UniRef50_D1CGD7 CRISPR-associated protein Cas2 n=2 Tax=Bacteria ... 113 1e-24 UniRef50_C2GEZ2 CRISPR-associated Cas2 family protein n=1 Tax=Co... 113 1e-24 UniRef50_C5V9N8 CRISPR-associated protein Cas2 n=2 Tax=Corynebac... 105 5e-22 UniRef50_D1Y483 CRISPR-associated protein Cas2 n=1 Tax=Pyramidob... 103 2e-21 UniRef50_C9M9S0 CRISPR-associated protein Cas2 n=1 Tax=Jonquetel... 100 1e-20 UniRef50_C2KP51 CRISPR-associated Cas2 family protein n=1 Tax=Mo... 99 4e-20 UniRef50_B3ENI1 CRISPR-associated protein Cas2 n=11 Tax=Bacteria... 93 2e-18 UniRef50_A7BA60 Putative uncharacterized protein n=1 Tax=Actinom... 92 4e-18 UniRef50_Q0BRF5 Putative uncharacterized protein n=1 Tax=Granuli... 92 5e-18 UniRef50_D0MET9 CRISPR-associated protein Cas2 n=2 Tax=Bacteria ... 92 5e-18 UniRef50_C4ZJY4 CRISPR-associated protein Cas2 n=1 Tax=Thauera s... 91 9e-18 UniRef50_Q04QB3 Putative uncharacterized protein n=2 Tax=Leptosp... 91 1e-17 UniRef50_B8JDP4 CRISPR-associated protein Cas2 n=6 Tax=Proteobac... 86 2e-16 UniRef50_A4WZ21 Putative uncharacterized protein n=1 Tax=Rhodoba... 57 1e-07 UniRef50_C1DSI2 Putative uncharacterized protein n=1 Tax=Azotoba... 54 2e-06 Sequences not found previously or not previously below threshold: UniRef50_Q6NEQ2 Putative uncharacterized protein n=1 Tax=Coryneb... 82 6e-15 UniRef50_B6IWM0 CRISPR-associated protein Cas2, putative n=1 Tax... 64 2e-09 UniRef50_UPI00016983CD CRISPR-associated protein, Cas2 n=1 Tax=E... 64 2e-09 UniRef50_D0Y915 Putative uncharacterized protein n=1 Tax=Dehaloc... 57 1e-07 UniRef50_C0W6T7 Putative uncharacterized protein n=1 Tax=Actinom... 51 1e-05 >UniRef50_D0WFC5 DNA polymerase III, alpha chain n=1 Tax=Slackia exigua ATCC 700122 RepID=D0WFC5_9ACTN Length = 291 Score = 137 bits (347), Expect = 8e-32, Method: Composition-based stats. Identities = 32/87 (36%), Positives = 49/87 (56%), Gaps = 1/87 (1%) Query: 3 MLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAWAT 62 M+V+ E P LRG L WL E+ GVYVG VSA++R+ +WE++ + G M ++ Sbjct: 1 MVVITLEKCPLALRGDLTKWLQEISMGVYVGQVSARVRDRLWERVCKECKSGRATMVYSV 60 Query: 63 NTETGFEFQTFGLNRRTPVDLDGLRLV 89 E +F+ P+D DGL+L+ Sbjct: 61 RNEQRHDFRIHNTT-WEPIDFDGLKLM 86 >UniRef50_C2BET5 3'-5' exonuclease and CRISPR-associated Cas2 family protein n=1 Tax=Anaerococcus lactolyticus ATCC 51172 RepID=C2BET5_9FIRM Length = 295 Score = 133 bits (336), Expect = 1e-30, Method: Composition-based stats. Identities = 33/89 (37%), Positives = 52/89 (58%), Gaps = 1/89 (1%) Query: 1 MSMLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAW 60 M + V+ +N PP LRG L W+ E+ GVYVG+ + KIR+ +WE++ G M + Sbjct: 1 MPLTVITLKNSPPSLRGDLTKWMQEIATGVYVGNFNTKIRQELWERVVESVGSGEATMTY 60 Query: 61 ATNTETGFEFQTFGLNRRTPVDLDGLRLV 89 A E G++F+T N + +D +G+ LV Sbjct: 61 AYRNEIGYKFETHNSN-KIMIDFEGIPLV 88 >UniRef50_D2RB05 CRISPR-associated endoribonuclease Cas2 n=2 Tax=Gardnerella vaginalis RepID=D2RB05_GARVA Length = 359 Score = 133 bits (336), Expect = 2e-30, Method: Composition-based stats. Identities = 33/89 (37%), Positives = 54/89 (60%), Gaps = 1/89 (1%) Query: 1 MSMLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAW 60 M + V+ N P LRG L W+ E+ +GVYVG+ ++++RE +W++I G V M++ Sbjct: 1 MPLTVITMTNCPLSLRGDLTKWMQEIASGVYVGNFNSRVREELWKRIEDSVGNGAVTMSF 60 Query: 61 ATNTETGFEFQTFGLNRRTPVDLDGLRLV 89 ++ E G++F+T + R V DGL LV Sbjct: 61 SSRNEIGYDFKTIHSH-REVVYSDGLPLV 88 >UniRef50_B8IZA9 CRISPR-associated protein Cas2 n=1 Tax=Desulfovibrio desulfuricans subsp. desulfuricans str. ATCC 27774 RepID=B8IZA9_DESDA Length = 102 Score = 131 bits (331), Expect = 5e-30, Method: Composition-based stats. Identities = 49/91 (53%), Positives = 65/91 (71%) Query: 3 MLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAWAT 62 MLV+VTE VP RLRG L+ WLLEVRAGVYVG+ S ++R+ +WE + +GN V+AW + Sbjct: 1 MLVIVTEAVPQRLRGYLSRWLLEVRAGVYVGNYSVRVRQKLWEVVCQQVGDGNAVLAWTS 60 Query: 63 NTETGFEFQTFGLNRRTPVDLDGLRLVSFLP 93 E+GF+F T G N R +D DGL LV++ P Sbjct: 61 CHESGFQFLTVGANCREQIDWDGLPLVAYTP 91 >UniRef50_P45956 Uncharacterized protein ygbF n=103 Tax=cellular organisms RepID=YGBF_ECOLI Length = 94 Score = 131 bits (330), Expect = 8e-30, Method: Composition-based stats. Identities = 94/94 (100%), Positives = 94/94 (100%) Query: 1 MSMLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAW 60 MSMLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAW Sbjct: 1 MSMLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAW 60 Query: 61 ATNTETGFEFQTFGLNRRTPVDLDGLRLVSFLPV 94 ATNTETGFEFQTFGLNRRTPVDLDGLRLVSFLPV Sbjct: 61 ATNTETGFEFQTFGLNRRTPVDLDGLRLVSFLPV 94 >UniRef50_A4XYT6 CRISPR-associated protein, Cas2 family n=14 Tax=cellular organisms RepID=A4XYT6_PSEMY Length = 99 Score = 130 bits (328), Expect = 1e-29, Method: Composition-based stats. Identities = 69/93 (74%), Positives = 80/93 (86%) Query: 1 MSMLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAW 60 MS LVVVTENVPPRLRGR+AIWLLEVRAGVY+GDVS + REMIWEQ++ E+GNVVMAW Sbjct: 1 MSFLVVVTENVPPRLRGRMAIWLLEVRAGVYIGDVSKRTREMIWEQLSQGHEDGNVVMAW 60 Query: 61 ATNTETGFEFQTFGLNRRTPVDLDGLRLVSFLP 93 A+N E+G+EFQT G NRR PV+ DGL LV+F P Sbjct: 61 ASNHESGYEFQTLGPNRRLPVEFDGLHLVAFHP 93 >UniRef50_C2D7T6 3'-5' exonuclease and CRISPR-associated Cas2 family protein n=1 Tax=Atopobium vaginae DSM 15829 RepID=C2D7T6_9ACTN Length = 337 Score = 130 bits (327), Expect = 1e-29, Method: Composition-based stats. Identities = 31/93 (33%), Positives = 53/93 (56%), Gaps = 2/93 (2%) Query: 1 MSMLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAW 60 M + ++ P LRG L W+ E+ GVYVG+++++IRE +W +++ G+ +++ Sbjct: 19 MPLTIITLSKCPRSLRGDLTKWMQEIDTGVYVGNLNSRIREKLWSRVSNCVGSGSATLSF 78 Query: 61 ATNTETGFEFQTFGLNRRTPVDLDGLRLVSFLP 93 E G++F T RR V LDG+ L+ F P Sbjct: 79 VAQNEIGYDFCTINSARR-VVYLDGIPLI-FTP 109 >UniRef50_C6SPJ4 DNA polymerase III alpha subunit n=1 Tax=Streptococcus mutans NN2025 RepID=C6SPJ4_STRMN Length = 300 Score = 129 bits (326), Expect = 2e-29, Method: Composition-based stats. Identities = 32/91 (35%), Positives = 50/91 (54%), Gaps = 1/91 (1%) Query: 1 MSMLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAW 60 M + V+ +N PP LRG L W+ E+ GVYVG+ + K+RE +W ++ G +++ Sbjct: 1 MPLTVITVKNAPPSLRGDLTKWMQEIATGVYVGNFNTKVREQLWSRVKDSVSNGEATLSF 60 Query: 61 ATNTETGFEFQTFGLNRRTPVDLDGLRLVSF 91 A E G+ F T R+ VD +G+ LV Sbjct: 61 AYRNEIGYCFDTMNAQRK-VVDFEGIPLVQL 90 >UniRef50_Q03C57 3'-5' exonuclease and CRISPR-associated protein cas2 n=3 Tax=Lactobacillus RepID=Q03C57_LACC3 Length = 301 Score = 127 bits (320), Expect = 9e-29, Method: Composition-based stats. Identities = 36/87 (41%), Positives = 50/87 (57%), Gaps = 1/87 (1%) Query: 3 MLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAWAT 62 M+V+ VPP LRG L W EV+ GVYVG SA+IR+ +WE+I G + + Sbjct: 1 MIVITLSKVPPSLRGVLTKWCQEVQTGVYVGRFSARIRDSLWERIQRDIGSGEATIVFNA 60 Query: 63 NTETGFEFQTFGLNRRTPVDLDGLRLV 89 E G++F+T + R VD DGL L+ Sbjct: 61 KNELGYQFRTTRTD-REVVDYDGLPLL 86 >UniRef50_B6XT67 Putative uncharacterized protein n=1 Tax=Bifidobacterium catenulatum DSM 16992 RepID=B6XT67_9BIFI Length = 119 Score = 127 bits (319), Expect = 1e-28, Method: Composition-based stats. Identities = 34/89 (38%), Positives = 51/89 (57%), Gaps = 1/89 (1%) Query: 3 MLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAWAT 62 M+V+V P LRG L WLLE+ GV+VG + A++RE +WE+I L++ G +M ++ Sbjct: 1 MVVIVLTACPVGLRGDLTRWLLEISPGVFVGHLDARVREKLWERIVELSKNGRAIMVYSA 60 Query: 63 NTETGFEFQTFGLNRRTPVDLDGLRLVSF 91 E F+ G +P D +GL LV Sbjct: 61 RNEQHLAFKVHGAE-WSPTDCEGLELVKR 88 >UniRef50_C7MTL7 CRISPR-associated protein Cas2 n=1 Tax=Saccharomonospora viridis DSM 43017 RepID=C7MTL7_SACVD Length = 109 Score = 126 bits (316), Expect = 3e-28, Method: Composition-based stats. Identities = 36/92 (39%), Positives = 56/92 (60%), Gaps = 1/92 (1%) Query: 1 MS-MLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMA 59 M ++V+ T VP +RG L+ WL E G+YVG +SA++R+ +WEQ++ EG V Sbjct: 1 MPNLVVISTTAVPDYVRGSLSRWLTEPAPGLYVGSISARVRDSLWEQVSAAVGEGAAVCV 60 Query: 60 WATNTETGFEFQTFGLNRRTPVDLDGLRLVSF 91 T+ E + +T G RR +D DGL+L++F Sbjct: 61 HPTDNEQRYVIKTAGERRRRVMDFDGLQLIAF 92 >UniRef50_C7QEM1 CRISPR-associated protein Cas2 n=7 Tax=Actinomycetales RepID=C7QEM1_CATAD Length = 116 Score = 124 bits (311), Expect = 1e-27, Method: Composition-based stats. Identities = 35/89 (39%), Positives = 56/89 (62%), Gaps = 1/89 (1%) Query: 3 MLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAWAT 62 M V+V P LRG L WLLE+ AGV++G SA++R+++W+++ A +G ++A+ T Sbjct: 1 MTVIVLTLCPVGLRGLLTRWLLEISAGVFIGSPSARVRDLLWDEVTSHAGKGRALLAYTT 60 Query: 63 NTETGFEFQTFGLNRRTPVDLDGLRLVSF 91 + E GF F+T + PVD +GL L+ Sbjct: 61 DNEQGFAFRTH-DHAWHPVDHEGLTLIHR 88 >UniRef50_A8LZ00 CRISPR-associated protein Cas2 n=6 Tax=Actinomycetales RepID=A8LZ00_SALAI Length = 113 Score = 123 bits (310), Expect = 2e-27, Method: Composition-based stats. Identities = 39/92 (42%), Positives = 58/92 (63%) Query: 2 SMLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAWA 61 S++V+ T VP LRG L+ W++EV G++VG +SAK+R+ +W + + +G V+ Sbjct: 3 SLVVLATTAVPDHLRGALSRWMIEVTPGMFVGTLSAKVRDELWNAASSVVGDGAAVLIHP 62 Query: 62 TNTETGFEFQTFGLNRRTPVDLDGLRLVSFLP 93 +TE GF +T G RR PVD DGL LV+ P Sbjct: 63 DDTEQGFSLRTAGARRRRPVDFDGLTLVAMSP 94 >UniRef50_C3PF98 CRISPR-associated protein n=3 Tax=Corynebacterium RepID=C3PF98_CORA7 Length = 118 Score = 123 bits (309), Expect = 2e-27, Method: Composition-based stats. Identities = 33/87 (37%), Positives = 54/87 (62%), Gaps = 1/87 (1%) Query: 3 MLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAWAT 62 M+V+V P LRG L+ WL+E+ GV+VG SA+IR+++WE+ L ++G ++ ++ Sbjct: 1 MIVLVVTACPAGLRGDLSKWLIELTPGVFVGRPSARIRDLLWERTVELCKDGRALLVYSA 60 Query: 63 NTETGFEFQTFGLNRRTPVDLDGLRLV 89 E G EF+T + P D DG+ L+ Sbjct: 61 ANEQGLEFKTHR-HHWQPTDFDGVTLM 86 >UniRef50_C9M2Y5 Putative uncharacterized protein (Fragment) n=1 Tax=Lactobacillus helveticus DSM 20075 RepID=C9M2Y5_LACHE Length = 249 Score = 122 bits (308), Expect = 3e-27, Method: Composition-based stats. Identities = 32/87 (36%), Positives = 51/87 (58%), Gaps = 1/87 (1%) Query: 3 MLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAWAT 62 M+V+ VP LRG L W EV+ GVYVG SA+IR+++W++I G + + T Sbjct: 1 MIVITLTKVPQSLRGDLTKWCQEVQTGVYVGSFSARIRDLLWKRILLNIGRGEATLIYTT 60 Query: 63 NTETGFEFQTFGLNRRTPVDLDGLRLV 89 N E G++F+T + + V DG+ ++ Sbjct: 61 NNELGYDFKTTRKD-KQVVQFDGIPVM 86 >UniRef50_Q2JWC8 CRISPR-associated protein Cas2 n=2 Tax=Chroococcales RepID=Q2JWC8_SYNJA Length = 90 Score = 121 bits (304), Expect = 7e-27, Method: Composition-based stats. Identities = 38/90 (42%), Positives = 59/90 (65%) Query: 3 MLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAWAT 62 M+V + ENVP LRG L+ WL E++AGV+VG VSA +RE +W ++ +G+ +M ++T Sbjct: 1 MVVFILENVPASLRGDLSRWLFEIKAGVFVGRVSALVREELWARVTSKIGDGSALMVYST 60 Query: 63 NTETGFEFQTFGLNRRTPVDLDGLRLVSFL 92 N+E GF ++ G R VD++G+ LV Sbjct: 61 NSEQGFSARSIGDPSRQLVDIEGVLLVKTY 90 >UniRef50_Q1J363 CRISPR-associated protein Cas2 n=1 Tax=Deinococcus geothermalis DSM 11300 RepID=Q1J363_DEIGD Length = 107 Score = 121 bits (304), Expect = 8e-27, Method: Composition-based stats. Identities = 35/85 (41%), Positives = 49/85 (57%) Query: 6 VVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAWATNTE 65 + E VP LRG L+ WL+EV+ GVYVG+ SA +R+++WE+ G + N E Sbjct: 1 MTLEAVPESLRGELSRWLIEVQPGVYVGNASALVRDLLWEKAVSHTRRGRCTQVYRANNE 60 Query: 66 TGFEFQTFGLNRRTPVDLDGLRLVS 90 GF +T G R V LDG +LV+ Sbjct: 61 QGFIIRTHGDPTRRVVSLDGYQLVA 85 >UniRef50_A8M407 CRISPR-associated protein Cas2 n=4 Tax=Actinobacteria (class) RepID=A8M407_SALAI Length = 136 Score = 121 bits (303), Expect = 9e-27, Method: Composition-based stats. Identities = 32/89 (35%), Positives = 48/89 (53%), Gaps = 1/89 (1%) Query: 3 MLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAWAT 62 M V++ P LRG L WLLE+ AGVYVG V+++IR +W ++ +A G ++ + Sbjct: 1 MTVIILTACPEGLRGHLTQWLLEISAGVYVGHVNSRIRHRLWAKVVDMAGPGRALLVYQQ 60 Query: 63 NTETGFEFQTFGLNRRTPVDLDGLRLVSF 91 E F T + PVD DG+ L+ Sbjct: 61 PGEQRLSF-TVHDHHWEPVDHDGITLMRR 88 >UniRef50_B1VIX7 CRISPR-associated protein n=2 Tax=Actinomycetales RepID=B1VIX7_CORU7 Length = 113 Score = 119 bits (299), Expect = 3e-26, Method: Composition-based stats. Identities = 32/86 (37%), Positives = 54/86 (62%), Gaps = 1/86 (1%) Query: 4 LVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAWATN 63 +V++ P LRG L WL+E+ G +VG SA+IRE++W++ L ++G ++ +++N Sbjct: 1 MVLIVTACPAGLRGDLTKWLMELAPGTFVGRPSARIRELLWDRTVELCKDGRALLVYSSN 60 Query: 64 TETGFEFQTFGLNRRTPVDLDGLRLV 89 E G EF+T + P D DGL+L+ Sbjct: 61 NEQGMEFRTHR-HDWEPTDFDGLKLM 85 >UniRef50_Q0W581 Predicted CRISPR-associated protein n=2 Tax=cellular organisms RepID=Q0W581_UNCMA Length = 96 Score = 119 bits (299), Expect = 3e-26, Method: Composition-based stats. Identities = 34/88 (38%), Positives = 55/88 (62%) Query: 2 SMLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAWA 61 S V + E+V P LRG L W++E +AGV++G +S +R +W +I ++G MA++ Sbjct: 3 SFSVFIVESVSPSLRGELTRWMIEPKAGVFIGKLSGMVRNKLWGKIIKNIKKGGCTMAYS 62 Query: 62 TNTETGFEFQTFGLNRRTPVDLDGLRLV 89 N E G++ +++G RT VD +GL LV Sbjct: 63 YNNEQGYKIESYGDTTRTIVDFEGLSLV 90 >UniRef50_C8XAY1 CRISPR-associated protein Cas2 n=1 Tax=Nakamurella multipartita DSM 44233 RepID=C8XAY1_NAKMY Length = 124 Score = 117 bits (295), Expect = 9e-26, Method: Composition-based stats. Identities = 32/87 (36%), Positives = 54/87 (62%), Gaps = 1/87 (1%) Query: 3 MLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAWAT 62 M+V++ P LRG L WL+E+ GV+VG VS ++R+++W+++ LA++G VM + Sbjct: 1 MVVLMLTACPAGLRGHLTRWLMEIGPGVFVGRVSHRVRDLLWDRVLELAKDGRAVMVYPA 60 Query: 63 NTETGFEFQTFGLNRRTPVDLDGLRLV 89 E G E++ P+D+DGL L+ Sbjct: 61 RNEQGLEYRVHRS-SWKPIDVDGLTLM 86 >UniRef50_C7MTM4 CRISPR-associated protein, Cas2 family n=4 Tax=Actinomycetales RepID=C7MTM4_SACVD Length = 93 Score = 117 bits (294), Expect = 1e-25, Method: Composition-based stats. Identities = 33/89 (37%), Positives = 51/89 (57%) Query: 3 MLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAWAT 62 M V+V P LRG L W++EV AGV+VG+ S ++R+ +WE +A +G ++ Sbjct: 1 MTVIVLIAAPEGLRGHLTRWMVEVHAGVFVGNPSRRVRDRLWELLATRIADGQAILVEPA 60 Query: 63 NTETGFEFQTFGLNRRTPVDLDGLRLVSF 91 + E G+ +T G +R P D DGL L + Sbjct: 61 DNEQGWAVRTAGTDRWRPTDFDGLILSAR 89 >UniRef50_D1CAI7 CRISPR-associated protein Cas2 n=2 Tax=Bacteria RepID=D1CAI7_SPHTD Length = 92 Score = 114 bits (287), Expect = 8e-25, Method: Composition-based stats. Identities = 36/87 (41%), Positives = 56/87 (64%) Query: 3 MLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAWAT 62 M+V++ E VP LRG L W+LE +AGV+VG +SA +R+ +WE+ + G ++ +++ Sbjct: 1 MVVMILERVPRSLRGELTRWMLEPKAGVFVGTMSALVRDKLWEKACASMKGGAGMLIYSS 60 Query: 63 NTETGFEFQTFGLNRRTPVDLDGLRLV 89 NTE GF + +G R VD DGL L+ Sbjct: 61 NTEQGFVVRFWGNLGREVVDFDGLTLI 87 >UniRef50_D1CGD7 CRISPR-associated protein Cas2 n=2 Tax=Bacteria RepID=D1CGD7_THET1 Length = 96 Score = 113 bits (284), Expect = 1e-24, Method: Composition-based stats. Identities = 40/95 (42%), Positives = 52/95 (54%), Gaps = 7/95 (7%) Query: 3 MLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLA-------EEGN 55 M V+V E VP +RG L WLLE R GV+VG SA +R+ +WE + E G Sbjct: 1 MTVIVVEKVPASVRGELTRWLLEPRTGVFVGRPSALVRDKLWELVCQRIVERTGPEEMGG 60 Query: 56 VVMAWATNTETGFEFQTFGLNRRTPVDLDGLRLVS 90 VM + ++ E GFE + FG R VD +GL LV Sbjct: 61 AVMIYTSDNEQGFEMRIFGDTSRDLVDFEGLWLVK 95 >UniRef50_C2GEZ2 CRISPR-associated Cas2 family protein n=1 Tax=Corynebacterium glucuronolyticum ATCC 51866 RepID=C2GEZ2_9CORY Length = 104 Score = 113 bits (284), Expect = 1e-24, Method: Composition-based stats. Identities = 37/91 (40%), Positives = 48/91 (52%), Gaps = 1/91 (1%) Query: 3 MLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMA-WA 61 LV+ VP L G L +L EV GVYVG+VS ++R +W + A + G + M Sbjct: 2 FLVITCSAVPDHLHGYLTRFLSEVDTGVYVGNVSRRVRNNLWTRCATAIKSGRLTMINRD 61 Query: 62 TNTETGFEFQTFGLNRRTPVDLDGLRLVSFL 92 E GF T G RRT +D+DGL L S L Sbjct: 62 PEREQGFAVNTLGSQRRTIIDMDGLLLASTL 92 >UniRef50_C5V9N8 CRISPR-associated protein Cas2 n=2 Tax=Corynebacterium matruchotii RepID=C5V9N8_9CORY Length = 109 Score = 105 bits (263), Expect = 5e-22, Method: Composition-based stats. Identities = 29/92 (31%), Positives = 45/92 (48%), Gaps = 1/92 (1%) Query: 3 MLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAWAT 62 V+ +P +RG L + EV G+YVG VS + + +W +I G G+ + + Sbjct: 2 FAVISCTAIPDHVRGFLTRFFSEVSTGLYVGIVSPVVLDNLWARIDGTITMGSFTLVHSC 61 Query: 63 NT-ETGFEFQTFGLNRRTPVDLDGLRLVSFLP 93 + E GF + G R D+DGL L S +P Sbjct: 62 HEREQGFNIRMTGPQSRPLFDMDGLLLTSRVP 93 >UniRef50_D1Y483 CRISPR-associated protein Cas2 n=1 Tax=Pyramidobacter piscolens W5455 RepID=D1Y483_9BACT Length = 115 Score = 103 bits (257), Expect = 2e-21, Method: Composition-based stats. Identities = 34/95 (35%), Positives = 52/95 (54%), Gaps = 7/95 (7%) Query: 1 MSMLVVVTENVPPRLRGRLAIWLLEVRAGVYVGD-VSAKIREMIWEQIAGLAEE-----G 54 M M VV+T NVP + RG LA +LE+ GVY +SA IR+ IW+ I E Sbjct: 1 MPMTVVITNNVPMKYRGFLASCMLELAPGVYSHPKMSAGIRQRIWQVIEKWYNEQQDLNS 60 Query: 55 NVVMAWATNTETGFE-FQTFGLNRRTPVDLDGLRL 88 ++++ W+ ++ G + + GL R +D DG+ L Sbjct: 61 SIMLIWSDSSRPGGQGIECLGLPARIVLDCDGVLL 95 >UniRef50_C9M9S0 CRISPR-associated protein Cas2 n=1 Tax=Jonquetella anthropi E3_33 E1 RepID=C9M9S0_9BACT Length = 105 Score = 100 bits (251), Expect = 1e-20, Method: Composition-based stats. Identities = 38/96 (39%), Positives = 53/96 (55%), Gaps = 6/96 (6%) Query: 3 MLVVVTENVPPRLRGRLAIWLLEVRAGVYVGD-VSAKIREMIW----EQIAGLAEEGNVV 57 MLV++T VP R+RG LA LLEV GVYV ++A +RE IW E ++E +V+ Sbjct: 1 MLVLITNQVPMRVRGFLAACLLEVAPGVYVHPRINAGVRERIWKIMTEWSVEFSQEASVL 60 Query: 58 MAWA-TNTETGFEFQTFGLNRRTPVDLDGLRLVSFL 92 W + G +T G+ +RT +D DGL L Sbjct: 61 ALWPAPKSSGGINIRTIGIPQRTLIDYDGLVLSKLT 96 >UniRef50_C2KP51 CRISPR-associated Cas2 family protein n=1 Tax=Mobiluncus mulieris ATCC 35243 RepID=C2KP51_9ACTO Length = 97 Score = 99.1 bits (246), Expect = 4e-20, Method: Composition-based stats. Identities = 24/87 (27%), Positives = 45/87 (51%), Gaps = 1/87 (1%) Query: 3 MLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAWAT 62 +++ VP L G L+ +L EV GVYVG ++ ++ + +WE+ + +EG++ + + Sbjct: 2 FVILTATAVPEHLHGYLSRFLTEVNMGVYVGKITPRVADALWERCRKVGKEGSLTLVQSD 61 Query: 63 -NTETGFEFQTFGLNRRTPVDLDGLRL 88 E GF + + + DGL L Sbjct: 62 VRFEQGFSVRAYSPRQHRVRCFDGLWL 88 >UniRef50_B3ENI1 CRISPR-associated protein Cas2 n=11 Tax=Bacteria RepID=B3ENI1_CHLPB Length = 105 Score = 93.3 bits (231), Expect = 2e-18, Method: Composition-based stats. Identities = 24/88 (27%), Positives = 49/88 (55%), Gaps = 1/88 (1%) Query: 3 MLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAWAT 62 M++VV ++PP +RGR+ +W +E RA V+V + + + + + + + +M + + Sbjct: 1 MVIVVANDIPPAVRGRMKLWFVEPRANVFVSGIKDSVAKKVIDYLHKHCPSESGLMVFKS 60 Query: 63 NTE-TGFEFQTFGLNRRTPVDLDGLRLV 89 E G+E G R+ ++L G++LV Sbjct: 61 CNEAPGYEIFGHGDTRKQLIELSGMQLV 88 >UniRef50_A7BA60 Putative uncharacterized protein n=1 Tax=Actinomyces odontolyticus ATCC 17982 RepID=A7BA60_9ACTO Length = 113 Score = 92.1 bits (228), Expect = 4e-18, Method: Composition-based stats. Identities = 26/66 (39%), Positives = 38/66 (57%), Gaps = 1/66 (1%) Query: 24 LEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAWATNTETGFEFQTFGLNRRTPVDL 83 +E+ GV+VG +SA++RE +W + + G VM + E G EF T+G PVD Sbjct: 1 MEISPGVFVGTLSARVRERLWVIVTENMKTGRAVMVYRARNEQGLEFLTWGDP-WKPVDF 59 Query: 84 DGLRLV 89 DGL L+ Sbjct: 60 DGLTLM 65 >UniRef50_Q0BRF5 Putative uncharacterized protein n=1 Tax=Granulibacter bethesdensis CGDNIH1 RepID=Q0BRF5_GRABC Length = 98 Score = 92.1 bits (228), Expect = 5e-18, Method: Composition-based stats. Identities = 32/94 (34%), Positives = 49/94 (52%), Gaps = 5/94 (5%) Query: 1 MSMLVVVTENVPPRLRGRLAIWLLEVRAGVYVGD-VSAKIREMIWEQIAGLAEE---GNV 56 M +V+T +V R RG L +LE+ AGVY+ +S+ +RE W ++ E G + Sbjct: 2 MPATLVITRDVEARYRGYLTSIMLELSAGVYLSPQLSSAVRERTWAVLSEWHSELRRGAI 61 Query: 57 VMAWA-TNTETGFEFQTFGLNRRTPVDLDGLRLV 89 V+AW + G +T G + VD DG+ LV Sbjct: 62 VLAWPDAKSPGGMAIRTLGDAPKEIVDADGVLLV 95 >UniRef50_D0MET9 CRISPR-associated protein Cas2 n=2 Tax=Bacteria RepID=D0MET9_RHOM4 Length = 142 Score = 92.1 bits (228), Expect = 5e-18, Method: Composition-based stats. Identities = 31/94 (32%), Positives = 49/94 (52%), Gaps = 6/94 (6%) Query: 1 MSMLVVVTENVPPRLRGRLAIWLLEVRAGVYVGD-VSAKIREMIWEQIAGLAE----EGN 55 M+M + VT N P R RG LA +LE+ GVYV + +RE +W+ + AE +G Sbjct: 1 MAMTIAVTRNTPGRFRGFLASCMLEIAPGVYVAPRMPRDVRERVWQVLLSWAELIPPDGG 60 Query: 56 VVMAWATNTET-GFEFQTFGLNRRTPVDLDGLRL 88 VV+ W G + + G ++ V+ +G+ L Sbjct: 61 VVLLWRNRKAPSGLDVRLLGWPKKELVEYEGVWL 94 >UniRef50_C4ZJY4 CRISPR-associated protein Cas2 n=1 Tax=Thauera sp. MZ1T RepID=C4ZJY4_THASP Length = 103 Score = 91.4 bits (226), Expect = 9e-18, Method: Composition-based stats. Identities = 31/92 (33%), Positives = 45/92 (48%), Gaps = 5/92 (5%) Query: 1 MSMLVVVTENVPPRLRGRLAIWLLEVRAGVYVGD-VSAKIREMIWEQIAGLA---EEGNV 56 M ++V+VT +V R RG L +LEV VYV +S +R+ W +A G++ Sbjct: 1 MPLVVIVTRDVADRFRGFLKSVMLEVAPAVYVSPRMSKGVRDRTWNVLAEWHDFEPRGSI 60 Query: 57 VMAWATNTE-TGFEFQTFGLNRRTPVDLDGLR 87 VM W N E G G R V++DG+ Sbjct: 61 VMVWRDNNETGGVGLAHLGEPPRELVEMDGMW 92 >UniRef50_Q04QB3 Putative uncharacterized protein n=2 Tax=Leptospira borgpetersenii serovar Hardjo-bovis RepID=Q04QB3_LEPBJ Length = 116 Score = 90.6 bits (224), Expect = 1e-17, Method: Composition-based stats. Identities = 21/65 (32%), Positives = 44/65 (67%), Gaps = 1/65 (1%) Query: 26 VRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAWATNTETGFEFQTFGLNRRTPVDLDG 85 ++ GV+V ++A++R+ IW++I+ A + + +M +++N+E G+ ++ G R +D DG Sbjct: 1 MKPGVFVASINARVRDRIWKKISE-AWKSDAIMLFSSNSEQGYGIRSHGDPSREIMDFDG 59 Query: 86 LRLVS 90 L L+S Sbjct: 60 LLLMS 64 >UniRef50_B8JDP4 CRISPR-associated protein Cas2 n=6 Tax=Proteobacteria RepID=B8JDP4_ANAD2 Length = 109 Score = 86.4 bits (213), Expect = 2e-16, Method: Composition-based stats. Identities = 35/93 (37%), Positives = 43/93 (46%), Gaps = 5/93 (5%) Query: 1 MSMLVVVTENVPPRLRGRLAIWLLEVRAGVYVGD-VSAKIREMIW---EQIAGLAEEGNV 56 M M V+VT +VP R RG LA LE+ GVY ++A +RE W E A +G V Sbjct: 1 MPMTVIVTRDVPDRFRGFLASVALEIAPGVYTAPDMTASVRERAWTVLEDWHQHARQGAV 60 Query: 57 VMAWATNTETGFE-FQTFGLNRRTPVDLDGLRL 88 VM W G + G R DGL L Sbjct: 61 VMTWPDGAAPGGQRVLVLGDAPRELWVADGLVL 93 >UniRef50_Q6NEQ2 Putative uncharacterized protein n=1 Tax=Corynebacterium diphtheriae RepID=Q6NEQ2_CORDI Length = 104 Score = 81.7 bits (201), Expect = 6e-15, Method: Composition-based stats. Identities = 24/92 (26%), Positives = 44/92 (47%), Gaps = 1/92 (1%) Query: 3 MLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAWAT 62 V+ + P L G + +L E +YVG+VS + +W ++ ++ + M + Sbjct: 2 FAVLYLQAAPDHLLGYVTRFLTEADTSIYVGNVSKNVASNLWIRVTEAIKDAHATMIVSD 61 Query: 63 NT-ETGFEFQTFGLNRRTPVDLDGLRLVSFLP 93 N+ E GF T G + +D DGL +++ P Sbjct: 62 NSREQGFSIMTTGDSTLQVLDADGLSVLASRP 93 >UniRef50_B6IWM0 CRISPR-associated protein Cas2, putative n=1 Tax=Rhodospirillum centenum SW RepID=B6IWM0_RHOCS Length = 98 Score = 64.0 bits (155), Expect = 2e-09, Method: Composition-based stats. Identities = 25/87 (28%), Positives = 37/87 (42%), Gaps = 5/87 (5%) Query: 3 MLVVVTENVPPRLRGRLAIWLLEVRAGVYVG-DVSAKIREMIWE---QIAGLAEEGNVVM 58 M+V+ + R G L +L V GVYV D+ RE IW+ + G V+M Sbjct: 1 MIVICLSDTADRFHGFLRSVMLNVHPGVYVSMDLDKGSRERIWDILTRWWEAEPRGMVLM 60 Query: 59 AWAT-NTETGFEFQTFGLNRRTPVDLD 84 + ++ G +RT VD D Sbjct: 61 IHRDTRKSMDLDLRSLGAPKRTIVDYD 87 >UniRef50_UPI00016983CD CRISPR-associated protein, Cas2 n=1 Tax=Endoriftia persephone 'Hot96_1+Hot96_2' RepID=UPI00016983CD Length = 102 Score = 63.6 bits (154), Expect = 2e-09, Method: Composition-based stats. Identities = 22/84 (26%), Positives = 36/84 (42%), Gaps = 6/84 (7%) Query: 1 MSMLVVVTENVPPRLRGRLAIWLLEVRAGVYVGD-VSAKIREMIWEQIAGLAE-----EG 54 M ++V+VT +V R RG LA +LEV VY+ ++ +R W+ ++ + Sbjct: 1 MPLVVIVTRDVKDRFRGFLASVMLEVAPTVYISPRMNQGVRSRTWKVLSDWHNTEPRAQR 60 Query: 55 NVVMAWATNTETGFEFQTFGLNRR 78 + N G T G R Sbjct: 61 SAWSGSDANETGGVGIATLGSPPR 84 >UniRef50_D0Y915 Putative uncharacterized protein n=1 Tax=Dehalococcoides sp. GT RepID=D0Y915_9CHLR Length = 71 Score = 57.5 bits (138), Expect = 1e-07, Method: Composition-based stats. Identities = 16/52 (30%), Positives = 25/52 (48%), Gaps = 1/52 (1%) Query: 40 REMIWEQIAGLAEEGNVVM-AWATNTETGFEFQTFGLNRRTPVDLDGLRLVS 90 R+ +WE+ +E ++ W GF + +G RT VD +GL LV Sbjct: 2 RDELWERAINKTKESGAILQIWTDQNSQGFSSRQYGERERTFVDFEGLYLVK 53 >UniRef50_A4WZ21 Putative uncharacterized protein n=1 Tax=Rhodobacter sphaeroides ATCC 17025 RepID=A4WZ21_RHOS5 Length = 76 Score = 57.5 bits (138), Expect = 1e-07, Method: Composition-based stats. Identities = 23/48 (47%), Positives = 29/48 (60%) Query: 45 EQIAGLAEEGNVVMAWATNTETGFEFQTFGLNRRTPVDLDGLRLVSFL 92 Q+ EEG+ M W T+ F+F T G NRR PVD+DGL+ VSF Sbjct: 24 AQVVNHTEEGDAAMVWKAPTDQRFDFATTGRNRRMPVDVDGLKFVSFF 71 >UniRef50_C1DSI2 Putative uncharacterized protein n=1 Tax=Azotobacter vinelandii DJ RepID=C1DSI2_AZOVD Length = 63 Score = 54.0 bits (129), Expect = 2e-06, Method: Composition-based stats. Identities = 21/36 (58%), Positives = 26/36 (72%) Query: 58 MAWATNTETGFEFQTFGLNRRTPVDLDGLRLVSFLP 93 MAW + E+G EFQT G NRR PV+ DGL L++F P Sbjct: 1 MAWTSRHESGHEFQTQGANRRLPVEFDGLHLMAFHP 36 >UniRef50_C0W6T7 Putative uncharacterized protein n=1 Tax=Actinomyces urogenitalis DSM 15434 RepID=C0W6T7_9ACTO Length = 94 Score = 50.9 bits (121), Expect = 1e-05, Method: Composition-based stats. Identities = 13/50 (26%), Positives = 27/50 (54%), Gaps = 1/50 (2%) Query: 40 REMIWEQIAGLAEEGNVVMAWATNTETGFEFQTFGLNRRTPVDLDGLRLV 89 RE +WE + +G ++ W+ +E F + G + R P+D++G ++ Sbjct: 2 REHLWEMVQTYIGDGRALLIWSVRSEQRFAVASLG-HEREPIDIEGCTVM 50 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_D0WFC5 DNA polymerase III, alpha chain n=1 Tax=Slackia ... 138 4e-32 UniRef50_D2RB05 CRISPR-associated endoribonuclease Cas2 n=2 Tax=... 136 2e-31 UniRef50_C2BET5 3'-5' exonuclease and CRISPR-associated Cas2 fam... 134 1e-30 UniRef50_B8IZA9 CRISPR-associated protein Cas2 n=1 Tax=Desulfovi... 131 6e-30 UniRef50_C6SPJ4 DNA polymerase III alpha subunit n=1 Tax=Strepto... 131 9e-30 UniRef50_C2D7T6 3'-5' exonuclease and CRISPR-associated Cas2 fam... 131 9e-30 UniRef50_A4XYT6 CRISPR-associated protein, Cas2 family n=14 Tax=... 129 3e-29 UniRef50_B6XT67 Putative uncharacterized protein n=1 Tax=Bifidob... 129 3e-29 UniRef50_Q03C57 3'-5' exonuclease and CRISPR-associated protein ... 128 4e-29 UniRef50_P45956 Uncharacterized protein ygbF n=103 Tax=cellular ... 127 1e-28 UniRef50_C7MTL7 CRISPR-associated protein Cas2 n=1 Tax=Saccharom... 126 3e-28 UniRef50_Q2JWC8 CRISPR-associated protein Cas2 n=2 Tax=Chroococc... 125 4e-28 UniRef50_C9M2Y5 Putative uncharacterized protein (Fragment) n=1 ... 125 5e-28 UniRef50_A8LZ00 CRISPR-associated protein Cas2 n=6 Tax=Actinomyc... 125 6e-28 UniRef50_C7QEM1 CRISPR-associated protein Cas2 n=7 Tax=Actinomyc... 124 9e-28 UniRef50_C3PF98 CRISPR-associated protein n=3 Tax=Corynebacteriu... 124 1e-27 UniRef50_A8M407 CRISPR-associated protein Cas2 n=4 Tax=Actinobac... 122 3e-27 UniRef50_Q1J363 CRISPR-associated protein Cas2 n=1 Tax=Deinococc... 121 5e-27 UniRef50_Q0W581 Predicted CRISPR-associated protein n=2 Tax=cell... 120 9e-27 UniRef50_B1VIX7 CRISPR-associated protein n=2 Tax=Actinomycetale... 120 1e-26 UniRef50_C8XAY1 CRISPR-associated protein Cas2 n=1 Tax=Nakamurel... 119 3e-26 UniRef50_C7MTM4 CRISPR-associated protein, Cas2 family n=4 Tax=A... 119 4e-26 UniRef50_C2GEZ2 CRISPR-associated Cas2 family protein n=1 Tax=Co... 117 9e-26 UniRef50_D1CAI7 CRISPR-associated protein Cas2 n=2 Tax=Bacteria ... 117 1e-25 UniRef50_D1CGD7 CRISPR-associated protein Cas2 n=2 Tax=Bacteria ... 115 4e-25 UniRef50_C5V9N8 CRISPR-associated protein Cas2 n=2 Tax=Corynebac... 108 7e-23 UniRef50_C2KP51 CRISPR-associated Cas2 family protein n=1 Tax=Mo... 105 6e-22 UniRef50_Q6NEQ2 Putative uncharacterized protein n=1 Tax=Coryneb... 104 8e-22 UniRef50_D1Y483 CRISPR-associated protein Cas2 n=1 Tax=Pyramidob... 103 1e-21 UniRef50_C9M9S0 CRISPR-associated protein Cas2 n=1 Tax=Jonquetel... 101 9e-21 UniRef50_C4ZJY4 CRISPR-associated protein Cas2 n=1 Tax=Thauera s... 100 1e-20 UniRef50_D0MET9 CRISPR-associated protein Cas2 n=2 Tax=Bacteria ... 95 7e-19 UniRef50_A7BA60 Putative uncharacterized protein n=1 Tax=Actinom... 95 7e-19 UniRef50_Q0BRF5 Putative uncharacterized protein n=1 Tax=Granuli... 94 9e-19 UniRef50_B8JDP4 CRISPR-associated protein Cas2 n=6 Tax=Proteobac... 94 1e-18 UniRef50_Q04QB3 Putative uncharacterized protein n=2 Tax=Leptosp... 91 9e-18 UniRef50_B3ENI1 CRISPR-associated protein Cas2 n=11 Tax=Bacteria... 90 2e-17 UniRef50_B6IWM0 CRISPR-associated protein Cas2, putative n=1 Tax... 83 3e-15 UniRef50_UPI00016983CD CRISPR-associated protein, Cas2 n=1 Tax=E... 79 3e-14 UniRef50_D0Y915 Putative uncharacterized protein n=1 Tax=Dehaloc... 65 8e-10 UniRef50_C0W6T7 Putative uncharacterized protein n=1 Tax=Actinom... 59 7e-08 UniRef50_A4WZ21 Putative uncharacterized protein n=1 Tax=Rhodoba... 56 5e-07 UniRef50_C1DSI2 Putative uncharacterized protein n=1 Tax=Azotoba... 53 2e-06 Sequences not found previously or not previously below threshold: CONVERGED! >UniRef50_D0WFC5 DNA polymerase III, alpha chain n=1 Tax=Slackia exigua ATCC 700122 RepID=D0WFC5_9ACTN Length = 291 Score = 138 bits (349), Expect = 4e-32, Method: Composition-based stats. Identities = 32/89 (35%), Positives = 49/89 (55%), Gaps = 1/89 (1%) Query: 3 MLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAWAT 62 M+V+ E P LRG L WL E+ GVYVG VSA++R+ +WE++ + G M ++ Sbjct: 1 MVVITLEKCPLALRGDLTKWLQEISMGVYVGQVSARVRDRLWERVCKECKSGRATMVYSV 60 Query: 63 NTETGFEFQTFGLNRRTPVDLDGLRLVSF 91 E +F+ P+D DGL+L+ Sbjct: 61 RNEQRHDFRIHNT-TWEPIDFDGLKLMMR 88 >UniRef50_D2RB05 CRISPR-associated endoribonuclease Cas2 n=2 Tax=Gardnerella vaginalis RepID=D2RB05_GARVA Length = 359 Score = 136 bits (344), Expect = 2e-31, Method: Composition-based stats. Identities = 33/90 (36%), Positives = 54/90 (60%), Gaps = 1/90 (1%) Query: 1 MSMLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAW 60 M + V+ N P LRG L W+ E+ +GVYVG+ ++++RE +W++I G V M++ Sbjct: 1 MPLTVITMTNCPLSLRGDLTKWMQEIASGVYVGNFNSRVREELWKRIEDSVGNGAVTMSF 60 Query: 61 ATNTETGFEFQTFGLNRRTPVDLDGLRLVS 90 ++ E G++F+T + R V DGL LV Sbjct: 61 SSRNEIGYDFKTIHSH-REVVYSDGLPLVR 89 >UniRef50_C2BET5 3'-5' exonuclease and CRISPR-associated Cas2 family protein n=1 Tax=Anaerococcus lactolyticus ATCC 51172 RepID=C2BET5_9FIRM Length = 295 Score = 134 bits (337), Expect = 1e-30, Method: Composition-based stats. Identities = 33/89 (37%), Positives = 52/89 (58%), Gaps = 1/89 (1%) Query: 1 MSMLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAW 60 M + V+ +N PP LRG L W+ E+ GVYVG+ + KIR+ +WE++ G M + Sbjct: 1 MPLTVITLKNSPPSLRGDLTKWMQEIATGVYVGNFNTKIRQELWERVVESVGSGEATMTY 60 Query: 61 ATNTETGFEFQTFGLNRRTPVDLDGLRLV 89 A E G++F+T N + +D +G+ LV Sbjct: 61 AYRNEIGYKFETHNSN-KIMIDFEGIPLV 88 >UniRef50_B8IZA9 CRISPR-associated protein Cas2 n=1 Tax=Desulfovibrio desulfuricans subsp. desulfuricans str. ATCC 27774 RepID=B8IZA9_DESDA Length = 102 Score = 131 bits (331), Expect = 6e-30, Method: Composition-based stats. Identities = 49/91 (53%), Positives = 65/91 (71%) Query: 3 MLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAWAT 62 MLV+VTE VP RLRG L+ WLLEVRAGVYVG+ S ++R+ +WE + +GN V+AW + Sbjct: 1 MLVIVTEAVPQRLRGYLSRWLLEVRAGVYVGNYSVRVRQKLWEVVCQQVGDGNAVLAWTS 60 Query: 63 NTETGFEFQTFGLNRRTPVDLDGLRLVSFLP 93 E+GF+F T G N R +D DGL LV++ P Sbjct: 61 CHESGFQFLTVGANCREQIDWDGLPLVAYTP 91 >UniRef50_C6SPJ4 DNA polymerase III alpha subunit n=1 Tax=Streptococcus mutans NN2025 RepID=C6SPJ4_STRMN Length = 300 Score = 131 bits (329), Expect = 9e-30, Method: Composition-based stats. Identities = 32/91 (35%), Positives = 50/91 (54%), Gaps = 1/91 (1%) Query: 1 MSMLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAW 60 M + V+ +N PP LRG L W+ E+ GVYVG+ + K+RE +W ++ G +++ Sbjct: 1 MPLTVITVKNAPPSLRGDLTKWMQEIATGVYVGNFNTKVREQLWSRVKDSVSNGEATLSF 60 Query: 61 ATNTETGFEFQTFGLNRRTPVDLDGLRLVSF 91 A E G+ F T +R VD +G+ LV Sbjct: 61 AYRNEIGYCFDTMNA-QRKVVDFEGIPLVQL 90 >UniRef50_C2D7T6 3'-5' exonuclease and CRISPR-associated Cas2 family protein n=1 Tax=Atopobium vaginae DSM 15829 RepID=C2D7T6_9ACTN Length = 337 Score = 131 bits (329), Expect = 9e-30, Method: Composition-based stats. Identities = 31/93 (33%), Positives = 53/93 (56%), Gaps = 2/93 (2%) Query: 1 MSMLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAW 60 M + ++ P LRG L W+ E+ GVYVG+++++IRE +W +++ G+ +++ Sbjct: 19 MPLTIITLSKCPRSLRGDLTKWMQEIDTGVYVGNLNSRIREKLWSRVSNCVGSGSATLSF 78 Query: 61 ATNTETGFEFQTFGLNRRTPVDLDGLRLVSFLP 93 E G++F T RR V LDG+ L+ F P Sbjct: 79 VAQNEIGYDFCTINSARR-VVYLDGIPLI-FTP 109 >UniRef50_A4XYT6 CRISPR-associated protein, Cas2 family n=14 Tax=cellular organisms RepID=A4XYT6_PSEMY Length = 99 Score = 129 bits (325), Expect = 3e-29, Method: Composition-based stats. Identities = 69/93 (74%), Positives = 80/93 (86%) Query: 1 MSMLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAW 60 MS LVVVTENVPPRLRGR+AIWLLEVRAGVY+GDVS + REMIWEQ++ E+GNVVMAW Sbjct: 1 MSFLVVVTENVPPRLRGRMAIWLLEVRAGVYIGDVSKRTREMIWEQLSQGHEDGNVVMAW 60 Query: 61 ATNTETGFEFQTFGLNRRTPVDLDGLRLVSFLP 93 A+N E+G+EFQT G NRR PV+ DGL LV+F P Sbjct: 61 ASNHESGYEFQTLGPNRRLPVEFDGLHLVAFHP 93 >UniRef50_B6XT67 Putative uncharacterized protein n=1 Tax=Bifidobacterium catenulatum DSM 16992 RepID=B6XT67_9BIFI Length = 119 Score = 129 bits (325), Expect = 3e-29, Method: Composition-based stats. Identities = 34/89 (38%), Positives = 51/89 (57%), Gaps = 1/89 (1%) Query: 3 MLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAWAT 62 M+V+V P LRG L WLLE+ GV+VG + A++RE +WE+I L++ G +M ++ Sbjct: 1 MVVIVLTACPVGLRGDLTRWLLEISPGVFVGHLDARVREKLWERIVELSKNGRAIMVYSA 60 Query: 63 NTETGFEFQTFGLNRRTPVDLDGLRLVSF 91 E F+ G +P D +GL LV Sbjct: 61 RNEQHLAFKVHGA-EWSPTDCEGLELVKR 88 >UniRef50_Q03C57 3'-5' exonuclease and CRISPR-associated protein cas2 n=3 Tax=Lactobacillus RepID=Q03C57_LACC3 Length = 301 Score = 128 bits (323), Expect = 4e-29, Method: Composition-based stats. Identities = 36/89 (40%), Positives = 50/89 (56%), Gaps = 1/89 (1%) Query: 3 MLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAWAT 62 M+V+ VPP LRG L W EV+ GVYVG SA+IR+ +WE+I G + + Sbjct: 1 MIVITLSKVPPSLRGVLTKWCQEVQTGVYVGRFSARIRDSLWERIQRDIGSGEATIVFNA 60 Query: 63 NTETGFEFQTFGLNRRTPVDLDGLRLVSF 91 E G++F+T + R VD DGL L+ Sbjct: 61 KNELGYQFRTTRTD-REVVDYDGLPLLLR 88 >UniRef50_P45956 Uncharacterized protein ygbF n=103 Tax=cellular organisms RepID=YGBF_ECOLI Length = 94 Score = 127 bits (320), Expect = 1e-28, Method: Composition-based stats. Identities = 94/94 (100%), Positives = 94/94 (100%) Query: 1 MSMLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAW 60 MSMLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAW Sbjct: 1 MSMLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAW 60 Query: 61 ATNTETGFEFQTFGLNRRTPVDLDGLRLVSFLPV 94 ATNTETGFEFQTFGLNRRTPVDLDGLRLVSFLPV Sbjct: 61 ATNTETGFEFQTFGLNRRTPVDLDGLRLVSFLPV 94 >UniRef50_C7MTL7 CRISPR-associated protein Cas2 n=1 Tax=Saccharomonospora viridis DSM 43017 RepID=C7MTL7_SACVD Length = 109 Score = 126 bits (316), Expect = 3e-28, Method: Composition-based stats. Identities = 36/93 (38%), Positives = 56/93 (60%), Gaps = 1/93 (1%) Query: 1 MS-MLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMA 59 M ++V+ T VP +RG L+ WL E G+YVG +SA++R+ +WEQ++ EG V Sbjct: 1 MPNLVVISTTAVPDYVRGSLSRWLTEPAPGLYVGSISARVRDSLWEQVSAAVGEGAAVCV 60 Query: 60 WATNTETGFEFQTFGLNRRTPVDLDGLRLVSFL 92 T+ E + +T G RR +D DGL+L++F Sbjct: 61 HPTDNEQRYVIKTAGERRRRVMDFDGLQLIAFR 93 >UniRef50_Q2JWC8 CRISPR-associated protein Cas2 n=2 Tax=Chroococcales RepID=Q2JWC8_SYNJA Length = 90 Score = 125 bits (315), Expect = 4e-28, Method: Composition-based stats. Identities = 38/90 (42%), Positives = 59/90 (65%) Query: 3 MLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAWAT 62 M+V + ENVP LRG L+ WL E++AGV+VG VSA +RE +W ++ +G+ +M ++T Sbjct: 1 MVVFILENVPASLRGDLSRWLFEIKAGVFVGRVSALVREELWARVTSKIGDGSALMVYST 60 Query: 63 NTETGFEFQTFGLNRRTPVDLDGLRLVSFL 92 N+E GF ++ G R VD++G+ LV Sbjct: 61 NSEQGFSARSIGDPSRQLVDIEGVLLVKTY 90 >UniRef50_C9M2Y5 Putative uncharacterized protein (Fragment) n=1 Tax=Lactobacillus helveticus DSM 20075 RepID=C9M2Y5_LACHE Length = 249 Score = 125 bits (314), Expect = 5e-28, Method: Composition-based stats. Identities = 32/87 (36%), Positives = 51/87 (58%), Gaps = 1/87 (1%) Query: 3 MLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAWAT 62 M+V+ VP LRG L W EV+ GVYVG SA+IR+++W++I G + + T Sbjct: 1 MIVITLTKVPQSLRGDLTKWCQEVQTGVYVGSFSARIRDLLWKRILLNIGRGEATLIYTT 60 Query: 63 NTETGFEFQTFGLNRRTPVDLDGLRLV 89 N E G++F+T + + V DG+ ++ Sbjct: 61 NNELGYDFKTTRKD-KQVVQFDGIPVM 86 >UniRef50_A8LZ00 CRISPR-associated protein Cas2 n=6 Tax=Actinomycetales RepID=A8LZ00_SALAI Length = 113 Score = 125 bits (314), Expect = 6e-28, Method: Composition-based stats. Identities = 39/92 (42%), Positives = 58/92 (63%) Query: 2 SMLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAWA 61 S++V+ T VP LRG L+ W++EV G++VG +SAK+R+ +W + + +G V+ Sbjct: 3 SLVVLATTAVPDHLRGALSRWMIEVTPGMFVGTLSAKVRDELWNAASSVVGDGAAVLIHP 62 Query: 62 TNTETGFEFQTFGLNRRTPVDLDGLRLVSFLP 93 +TE GF +T G RR PVD DGL LV+ P Sbjct: 63 DDTEQGFSLRTAGARRRRPVDFDGLTLVAMSP 94 >UniRef50_C7QEM1 CRISPR-associated protein Cas2 n=7 Tax=Actinomycetales RepID=C7QEM1_CATAD Length = 116 Score = 124 bits (312), Expect = 9e-28, Method: Composition-based stats. Identities = 35/89 (39%), Positives = 56/89 (62%), Gaps = 1/89 (1%) Query: 3 MLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAWAT 62 M V+V P LRG L WLLE+ AGV++G SA++R+++W+++ A +G ++A+ T Sbjct: 1 MTVIVLTLCPVGLRGLLTRWLLEISAGVFIGSPSARVRDLLWDEVTSHAGKGRALLAYTT 60 Query: 63 NTETGFEFQTFGLNRRTPVDLDGLRLVSF 91 + E GF F+T + PVD +GL L+ Sbjct: 61 DNEQGFAFRTH-DHAWHPVDHEGLTLIHR 88 >UniRef50_C3PF98 CRISPR-associated protein n=3 Tax=Corynebacterium RepID=C3PF98_CORA7 Length = 118 Score = 124 bits (311), Expect = 1e-27, Method: Composition-based stats. Identities = 33/89 (37%), Positives = 54/89 (60%), Gaps = 1/89 (1%) Query: 3 MLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAWAT 62 M+V+V P LRG L+ WL+E+ GV+VG SA+IR+++WE+ L ++G ++ ++ Sbjct: 1 MIVLVVTACPAGLRGDLSKWLIELTPGVFVGRPSARIRDLLWERTVELCKDGRALLVYSA 60 Query: 63 NTETGFEFQTFGLNRRTPVDLDGLRLVSF 91 E G EF+T + P D DG+ L+ Sbjct: 61 ANEQGLEFKTHR-HHWQPTDFDGVTLMVR 88 >UniRef50_A8M407 CRISPR-associated protein Cas2 n=4 Tax=Actinobacteria (class) RepID=A8M407_SALAI Length = 136 Score = 122 bits (307), Expect = 3e-27, Method: Composition-based stats. Identities = 32/89 (35%), Positives = 48/89 (53%), Gaps = 1/89 (1%) Query: 3 MLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAWAT 62 M V++ P LRG L WLLE+ AGVYVG V+++IR +W ++ +A G ++ + Sbjct: 1 MTVIILTACPEGLRGHLTQWLLEISAGVYVGHVNSRIRHRLWAKVVDMAGPGRALLVYQQ 60 Query: 63 NTETGFEFQTFGLNRRTPVDLDGLRLVSF 91 E F T + PVD DG+ L+ Sbjct: 61 PGEQRLSF-TVHDHHWEPVDHDGITLMRR 88 >UniRef50_Q1J363 CRISPR-associated protein Cas2 n=1 Tax=Deinococcus geothermalis DSM 11300 RepID=Q1J363_DEIGD Length = 107 Score = 121 bits (305), Expect = 5e-27, Method: Composition-based stats. Identities = 35/85 (41%), Positives = 49/85 (57%) Query: 6 VVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAWATNTE 65 + E VP LRG L+ WL+EV+ GVYVG+ SA +R+++WE+ G + N E Sbjct: 1 MTLEAVPESLRGELSRWLIEVQPGVYVGNASALVRDLLWEKAVSHTRRGRCTQVYRANNE 60 Query: 66 TGFEFQTFGLNRRTPVDLDGLRLVS 90 GF +T G R V LDG +LV+ Sbjct: 61 QGFIIRTHGDPTRRVVSLDGYQLVA 85 >UniRef50_Q0W581 Predicted CRISPR-associated protein n=2 Tax=cellular organisms RepID=Q0W581_UNCMA Length = 96 Score = 120 bits (303), Expect = 9e-27, Method: Composition-based stats. Identities = 34/88 (38%), Positives = 55/88 (62%) Query: 2 SMLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAWA 61 S V + E+V P LRG L W++E +AGV++G +S +R +W +I ++G MA++ Sbjct: 3 SFSVFIVESVSPSLRGELTRWMIEPKAGVFIGKLSGMVRNKLWGKIIKNIKKGGCTMAYS 62 Query: 62 TNTETGFEFQTFGLNRRTPVDLDGLRLV 89 N E G++ +++G RT VD +GL LV Sbjct: 63 YNNEQGYKIESYGDTTRTIVDFEGLSLV 90 >UniRef50_B1VIX7 CRISPR-associated protein n=2 Tax=Actinomycetales RepID=B1VIX7_CORU7 Length = 113 Score = 120 bits (303), Expect = 1e-26, Method: Composition-based stats. Identities = 32/88 (36%), Positives = 54/88 (61%), Gaps = 1/88 (1%) Query: 4 LVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAWATN 63 +V++ P LRG L WL+E+ G +VG SA+IRE++W++ L ++G ++ +++N Sbjct: 1 MVLIVTACPAGLRGDLTKWLMELAPGTFVGRPSARIRELLWDRTVELCKDGRALLVYSSN 60 Query: 64 TETGFEFQTFGLNRRTPVDLDGLRLVSF 91 E G EF+T + P D DGL+L+ Sbjct: 61 NEQGMEFRTHR-HDWEPTDFDGLKLMMR 87 >UniRef50_C8XAY1 CRISPR-associated protein Cas2 n=1 Tax=Nakamurella multipartita DSM 44233 RepID=C8XAY1_NAKMY Length = 124 Score = 119 bits (299), Expect = 3e-26, Method: Composition-based stats. Identities = 32/89 (35%), Positives = 54/89 (60%), Gaps = 1/89 (1%) Query: 3 MLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAWAT 62 M+V++ P LRG L WL+E+ GV+VG VS ++R+++W+++ LA++G VM + Sbjct: 1 MVVLMLTACPAGLRGHLTRWLMEIGPGVFVGRVSHRVRDLLWDRVLELAKDGRAVMVYPA 60 Query: 63 NTETGFEFQTFGLNRRTPVDLDGLRLVSF 91 E G E++ P+D+DGL L+ Sbjct: 61 RNEQGLEYRVHRS-SWKPIDVDGLTLMLR 88 >UniRef50_C7MTM4 CRISPR-associated protein, Cas2 family n=4 Tax=Actinomycetales RepID=C7MTM4_SACVD Length = 93 Score = 119 bits (298), Expect = 4e-26, Method: Composition-based stats. Identities = 33/89 (37%), Positives = 51/89 (57%) Query: 3 MLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAWAT 62 M V+V P LRG L W++EV AGV+VG+ S ++R+ +WE +A +G ++ Sbjct: 1 MTVIVLIAAPEGLRGHLTRWMVEVHAGVFVGNPSRRVRDRLWELLATRIADGQAILVEPA 60 Query: 63 NTETGFEFQTFGLNRRTPVDLDGLRLVSF 91 + E G+ +T G +R P D DGL L + Sbjct: 61 DNEQGWAVRTAGTDRWRPTDFDGLILSAR 89 >UniRef50_C2GEZ2 CRISPR-associated Cas2 family protein n=1 Tax=Corynebacterium glucuronolyticum ATCC 51866 RepID=C2GEZ2_9CORY Length = 104 Score = 117 bits (295), Expect = 9e-26, Method: Composition-based stats. Identities = 37/91 (40%), Positives = 48/91 (52%), Gaps = 1/91 (1%) Query: 3 MLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAWAT 62 LV+ VP L G L +L EV GVYVG+VS ++R +W + A + G + M Sbjct: 2 FLVITCSAVPDHLHGYLTRFLSEVDTGVYVGNVSRRVRNNLWTRCATAIKSGRLTMINRD 61 Query: 63 NT-ETGFEFQTFGLNRRTPVDLDGLRLVSFL 92 E GF T G RRT +D+DGL L S L Sbjct: 62 PEREQGFAVNTLGSQRRTIIDMDGLLLASTL 92 >UniRef50_D1CAI7 CRISPR-associated protein Cas2 n=2 Tax=Bacteria RepID=D1CAI7_SPHTD Length = 92 Score = 117 bits (294), Expect = 1e-25, Method: Composition-based stats. Identities = 36/88 (40%), Positives = 56/88 (63%) Query: 3 MLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAWAT 62 M+V++ E VP LRG L W+LE +AGV+VG +SA +R+ +WE+ + G ++ +++ Sbjct: 1 MVVMILERVPRSLRGELTRWMLEPKAGVFVGTMSALVRDKLWEKACASMKGGAGMLIYSS 60 Query: 63 NTETGFEFQTFGLNRRTPVDLDGLRLVS 90 NTE GF + +G R VD DGL L+ Sbjct: 61 NTEQGFVVRFWGNLGREVVDFDGLTLIR 88 >UniRef50_D1CGD7 CRISPR-associated protein Cas2 n=2 Tax=Bacteria RepID=D1CGD7_THET1 Length = 96 Score = 115 bits (289), Expect = 4e-25, Method: Composition-based stats. Identities = 40/95 (42%), Positives = 52/95 (54%), Gaps = 7/95 (7%) Query: 3 MLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLA-------EEGN 55 M V+V E VP +RG L WLLE R GV+VG SA +R+ +WE + E G Sbjct: 1 MTVIVVEKVPASVRGELTRWLLEPRTGVFVGRPSALVRDKLWELVCQRIVERTGPEEMGG 60 Query: 56 VVMAWATNTETGFEFQTFGLNRRTPVDLDGLRLVS 90 VM + ++ E GFE + FG R VD +GL LV Sbjct: 61 AVMIYTSDNEQGFEMRIFGDTSRDLVDFEGLWLVK 95 >UniRef50_C5V9N8 CRISPR-associated protein Cas2 n=2 Tax=Corynebacterium matruchotii RepID=C5V9N8_9CORY Length = 109 Score = 108 bits (270), Expect = 7e-23, Method: Composition-based stats. Identities = 29/92 (31%), Positives = 45/92 (48%), Gaps = 1/92 (1%) Query: 3 MLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAWAT 62 V+ +P +RG L + EV G+YVG VS + + +W +I G G+ + + Sbjct: 2 FAVISCTAIPDHVRGFLTRFFSEVSTGLYVGIVSPVVLDNLWARIDGTITMGSFTLVHSC 61 Query: 63 NT-ETGFEFQTFGLNRRTPVDLDGLRLVSFLP 93 + E GF + G R D+DGL L S +P Sbjct: 62 HEREQGFNIRMTGPQSRPLFDMDGLLLTSRVP 93 >UniRef50_C2KP51 CRISPR-associated Cas2 family protein n=1 Tax=Mobiluncus mulieris ATCC 35243 RepID=C2KP51_9ACTO Length = 97 Score = 105 bits (262), Expect = 6e-22, Method: Composition-based stats. Identities = 24/87 (27%), Positives = 45/87 (51%), Gaps = 1/87 (1%) Query: 3 MLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAWAT 62 +++ VP L G L+ +L EV GVYVG ++ ++ + +WE+ + +EG++ + + Sbjct: 2 FVILTATAVPEHLHGYLSRFLTEVNMGVYVGKITPRVADALWERCRKVGKEGSLTLVQSD 61 Query: 63 -NTETGFEFQTFGLNRRTPVDLDGLRL 88 E GF + + + DGL L Sbjct: 62 VRFEQGFSVRAYSPRQHRVRCFDGLWL 88 >UniRef50_Q6NEQ2 Putative uncharacterized protein n=1 Tax=Corynebacterium diphtheriae RepID=Q6NEQ2_CORDI Length = 104 Score = 104 bits (261), Expect = 8e-22, Method: Composition-based stats. Identities = 24/92 (26%), Positives = 44/92 (47%), Gaps = 1/92 (1%) Query: 3 MLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAWAT 62 V+ + P L G + +L E +YVG+VS + +W ++ ++ + M + Sbjct: 2 FAVLYLQAAPDHLLGYVTRFLTEADTSIYVGNVSKNVASNLWIRVTEAIKDAHATMIVSD 61 Query: 63 NT-ETGFEFQTFGLNRRTPVDLDGLRLVSFLP 93 N+ E GF T G + +D DGL +++ P Sbjct: 62 NSREQGFSIMTTGDSTLQVLDADGLSVLASRP 93 >UniRef50_D1Y483 CRISPR-associated protein Cas2 n=1 Tax=Pyramidobacter piscolens W5455 RepID=D1Y483_9BACT Length = 115 Score = 103 bits (258), Expect = 1e-21, Method: Composition-based stats. Identities = 34/99 (34%), Positives = 52/99 (52%), Gaps = 7/99 (7%) Query: 1 MSMLVVVTENVPPRLRGRLAIWLLEVRAGVYVGD-VSAKIREMIWEQIAGLAEE-----G 54 M M VV+T NVP + RG LA +LE+ GVY +SA IR+ IW+ I E Sbjct: 1 MPMTVVITNNVPMKYRGFLASCMLELAPGVYSHPKMSAGIRQRIWQVIEKWYNEQQDLNS 60 Query: 55 NVVMAWATNTETGFE-FQTFGLNRRTPVDLDGLRLVSFL 92 ++++ W+ ++ G + + GL R +D DG+ L Sbjct: 61 SIMLIWSDSSRPGGQGIECLGLPARIVLDCDGVLLSRLS 99 >UniRef50_C9M9S0 CRISPR-associated protein Cas2 n=1 Tax=Jonquetella anthropi E3_33 E1 RepID=C9M9S0_9BACT Length = 105 Score = 101 bits (252), Expect = 9e-21, Method: Composition-based stats. Identities = 38/96 (39%), Positives = 53/96 (55%), Gaps = 6/96 (6%) Query: 3 MLVVVTENVPPRLRGRLAIWLLEVRAGVYVGD-VSAKIREMIW----EQIAGLAEEGNVV 57 MLV++T VP R+RG LA LLEV GVYV ++A +RE IW E ++E +V+ Sbjct: 1 MLVLITNQVPMRVRGFLAACLLEVAPGVYVHPRINAGVRERIWKIMTEWSVEFSQEASVL 60 Query: 58 MAWAT-NTETGFEFQTFGLNRRTPVDLDGLRLVSFL 92 W + G +T G+ +RT +D DGL L Sbjct: 61 ALWPAPKSSGGINIRTIGIPQRTLIDYDGLVLSKLT 96 >UniRef50_C4ZJY4 CRISPR-associated protein Cas2 n=1 Tax=Thauera sp. MZ1T RepID=C4ZJY4_THASP Length = 103 Score = 100 bits (251), Expect = 1e-20, Method: Composition-based stats. Identities = 31/97 (31%), Positives = 46/97 (47%), Gaps = 5/97 (5%) Query: 1 MSMLVVVTENVPPRLRGRLAIWLLEVRAGVYVGD-VSAKIREMIWEQIAGLA---EEGNV 56 M ++V+VT +V R RG L +LEV VYV +S +R+ W +A G++ Sbjct: 1 MPLVVIVTRDVADRFRGFLKSVMLEVAPAVYVSPRMSKGVRDRTWNVLAEWHDFEPRGSI 60 Query: 57 VMAWATNTE-TGFEFQTFGLNRRTPVDLDGLRLVSFL 92 VM W N E G G R V++DG+ + Sbjct: 61 VMVWRDNNETGGVGLAHLGEPPRELVEMDGMWVARLR 97 >UniRef50_D0MET9 CRISPR-associated protein Cas2 n=2 Tax=Bacteria RepID=D0MET9_RHOM4 Length = 142 Score = 94.8 bits (235), Expect = 7e-19, Method: Composition-based stats. Identities = 30/94 (31%), Positives = 48/94 (51%), Gaps = 6/94 (6%) Query: 1 MSMLVVVTENVPPRLRGRLAIWLLEVRAGVYVGD-VSAKIREMIWEQIAGLA----EEGN 55 M+M + VT N P R RG LA +LE+ GVYV + +RE +W+ + A +G Sbjct: 1 MAMTIAVTRNTPGRFRGFLASCMLEIAPGVYVAPRMPRDVRERVWQVLLSWAELIPPDGG 60 Query: 56 VVMAWATNTET-GFEFQTFGLNRRTPVDLDGLRL 88 VV+ W G + + G ++ V+ +G+ L Sbjct: 61 VVLLWRNRKAPSGLDVRLLGWPKKELVEYEGVWL 94 >UniRef50_A7BA60 Putative uncharacterized protein n=1 Tax=Actinomyces odontolyticus ATCC 17982 RepID=A7BA60_9ACTO Length = 113 Score = 94.8 bits (235), Expect = 7e-19, Method: Composition-based stats. Identities = 26/68 (38%), Positives = 38/68 (55%), Gaps = 1/68 (1%) Query: 24 LEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAWATNTETGFEFQTFGLNRRTPVDL 83 +E+ GV+VG +SA++RE +W + + G VM + E G EF T+G PVD Sbjct: 1 MEISPGVFVGTLSARVRERLWVIVTENMKTGRAVMVYRARNEQGLEFLTWGDP-WKPVDF 59 Query: 84 DGLRLVSF 91 DGL L+ Sbjct: 60 DGLTLMMR 67 >UniRef50_Q0BRF5 Putative uncharacterized protein n=1 Tax=Granulibacter bethesdensis CGDNIH1 RepID=Q0BRF5_GRABC Length = 98 Score = 94.4 bits (234), Expect = 9e-19, Method: Composition-based stats. Identities = 32/95 (33%), Positives = 49/95 (51%), Gaps = 5/95 (5%) Query: 1 MSMLVVVTENVPPRLRGRLAIWLLEVRAGVYVGD-VSAKIREMIWEQIAGLAEE---GNV 56 M +V+T +V R RG L +LE+ AGVY+ +S+ +RE W ++ E G + Sbjct: 2 MPATLVITRDVEARYRGYLTSIMLELSAGVYLSPQLSSAVRERTWAVLSEWHSELRRGAI 61 Query: 57 VMAWAT-NTETGFEFQTFGLNRRTPVDLDGLRLVS 90 V+AW + G +T G + VD DG+ LV Sbjct: 62 VLAWPDAKSPGGMAIRTLGDAPKEIVDADGVLLVR 96 >UniRef50_B8JDP4 CRISPR-associated protein Cas2 n=6 Tax=Proteobacteria RepID=B8JDP4_ANAD2 Length = 109 Score = 94.0 bits (233), Expect = 1e-18, Method: Composition-based stats. Identities = 35/96 (36%), Positives = 43/96 (44%), Gaps = 5/96 (5%) Query: 1 MSMLVVVTENVPPRLRGRLAIWLLEVRAGVYVGD-VSAKIREMIW---EQIAGLAEEGNV 56 M M V+VT +VP R RG LA LE+ GVY ++A +RE W E A +G V Sbjct: 1 MPMTVIVTRDVPDRFRGFLASVALEIAPGVYTAPDMTASVRERAWTVLEDWHQHARQGAV 60 Query: 57 VMAWATNTETGFE-FQTFGLNRRTPVDLDGLRLVSF 91 VM W G + G R DGL L Sbjct: 61 VMTWPDGAAPGGQRVLVLGDAPRELWVADGLVLARR 96 >UniRef50_Q04QB3 Putative uncharacterized protein n=2 Tax=Leptospira borgpetersenii serovar Hardjo-bovis RepID=Q04QB3_LEPBJ Length = 116 Score = 91.3 bits (226), Expect = 9e-18, Method: Composition-based stats. Identities = 21/65 (32%), Positives = 44/65 (67%), Gaps = 1/65 (1%) Query: 26 VRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAWATNTETGFEFQTFGLNRRTPVDLDG 85 ++ GV+V ++A++R+ IW++I+ A + + +M +++N+E G+ ++ G R +D DG Sbjct: 1 MKPGVFVASINARVRDRIWKKISE-AWKSDAIMLFSSNSEQGYGIRSHGDPSREIMDFDG 59 Query: 86 LRLVS 90 L L+S Sbjct: 60 LLLMS 64 >UniRef50_B3ENI1 CRISPR-associated protein Cas2 n=11 Tax=Bacteria RepID=B3ENI1_CHLPB Length = 105 Score = 89.8 bits (222), Expect = 2e-17, Method: Composition-based stats. Identities = 24/88 (27%), Positives = 49/88 (55%), Gaps = 1/88 (1%) Query: 3 MLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAWAT 62 M++VV ++PP +RGR+ +W +E RA V+V + + + + + + + +M + + Sbjct: 1 MVIVVANDIPPAVRGRMKLWFVEPRANVFVSGIKDSVAKKVIDYLHKHCPSESGLMVFKS 60 Query: 63 NTE-TGFEFQTFGLNRRTPVDLDGLRLV 89 E G+E G R+ ++L G++LV Sbjct: 61 CNEAPGYEIFGHGDTRKQLIELSGMQLV 88 >UniRef50_B6IWM0 CRISPR-associated protein Cas2, putative n=1 Tax=Rhodospirillum centenum SW RepID=B6IWM0_RHOCS Length = 98 Score = 82.9 bits (204), Expect = 3e-15, Method: Composition-based stats. Identities = 25/87 (28%), Positives = 37/87 (42%), Gaps = 5/87 (5%) Query: 3 MLVVVTENVPPRLRGRLAIWLLEVRAGVYVG-DVSAKIREMIWE---QIAGLAEEGNVVM 58 M+V+ + R G L +L V GVYV D+ RE IW+ + G V+M Sbjct: 1 MIVICLSDTADRFHGFLRSVMLNVHPGVYVSMDLDKGSRERIWDILTRWWEAEPRGMVLM 60 Query: 59 AWAT-NTETGFEFQTFGLNRRTPVDLD 84 + ++ G +RT VD D Sbjct: 61 IHRDTRKSMDLDLRSLGAPKRTIVDYD 87 >UniRef50_UPI00016983CD CRISPR-associated protein, Cas2 n=1 Tax=Endoriftia persephone 'Hot96_1+Hot96_2' RepID=UPI00016983CD Length = 102 Score = 79.4 bits (195), Expect = 3e-14, Method: Composition-based stats. Identities = 22/84 (26%), Positives = 36/84 (42%), Gaps = 6/84 (7%) Query: 1 MSMLVVVTENVPPRLRGRLAIWLLEVRAGVYVGD-VSAKIREMIWEQIAGLAE-----EG 54 M ++V+VT +V R RG LA +LEV VY+ ++ +R W+ ++ + Sbjct: 1 MPLVVIVTRDVKDRFRGFLASVMLEVAPTVYISPRMNQGVRSRTWKVLSDWHNTEPRAQR 60 Query: 55 NVVMAWATNTETGFEFQTFGLNRR 78 + N G T G R Sbjct: 61 SAWSGSDANETGGVGIATLGSPPR 84 >UniRef50_D0Y915 Putative uncharacterized protein n=1 Tax=Dehalococcoides sp. GT RepID=D0Y915_9CHLR Length = 71 Score = 64.8 bits (157), Expect = 8e-10, Method: Composition-based stats. Identities = 16/52 (30%), Positives = 25/52 (48%), Gaps = 1/52 (1%) Query: 40 REMIWEQIAGLAEEGNVVM-AWATNTETGFEFQTFGLNRRTPVDLDGLRLVS 90 R+ +WE+ +E ++ W GF + +G RT VD +GL LV Sbjct: 2 RDELWERAINKTKESGAILQIWTDQNSQGFSSRQYGERERTFVDFEGLYLVK 53 >UniRef50_C0W6T7 Putative uncharacterized protein n=1 Tax=Actinomyces urogenitalis DSM 15434 RepID=C0W6T7_9ACTO Length = 94 Score = 58.6 bits (141), Expect = 7e-08, Method: Composition-based stats. Identities = 13/53 (24%), Positives = 27/53 (50%), Gaps = 1/53 (1%) Query: 40 REMIWEQIAGLAEEGNVVMAWATNTETGFEFQTFGLNRRTPVDLDGLRLVSFL 92 RE +WE + +G ++ W+ +E F + G + R P+D++G ++ Sbjct: 2 REHLWEMVQTYIGDGRALLIWSVRSEQRFAVASLG-HEREPIDIEGCTVMRSS 53 >UniRef50_A4WZ21 Putative uncharacterized protein n=1 Tax=Rhodobacter sphaeroides ATCC 17025 RepID=A4WZ21_RHOS5 Length = 76 Score = 55.5 bits (133), Expect = 5e-07, Method: Composition-based stats. Identities = 23/48 (47%), Positives = 29/48 (60%) Query: 45 EQIAGLAEEGNVVMAWATNTETGFEFQTFGLNRRTPVDLDGLRLVSFL 92 Q+ EEG+ M W T+ F+F T G NRR PVD+DGL+ VSF Sbjct: 24 AQVVNHTEEGDAAMVWKAPTDQRFDFATTGRNRRMPVDVDGLKFVSFF 71 >UniRef50_C1DSI2 Putative uncharacterized protein n=1 Tax=Azotobacter vinelandii DJ RepID=C1DSI2_AZOVD Length = 63 Score = 53.2 bits (127), Expect = 2e-06, Method: Composition-based stats. Identities = 21/36 (58%), Positives = 26/36 (72%) Query: 58 MAWATNTETGFEFQTFGLNRRTPVDLDGLRLVSFLP 93 MAW + E+G EFQT G NRR PV+ DGL L++F P Sbjct: 1 MAWTSRHESGHEFQTQGANRRLPVEFDGLHLMAFHP 36 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.314 0.168 0.532 Lambda K H 0.267 0.0512 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 739,436,754 Number of Sequences: 3077464 Number of extensions: 32081046 Number of successful extensions: 66697 Number of sequences better than 1.0e-01: 43 Number of HSP's better than 0.1 without gapping: 116 Number of HSP's successfully gapped in prelim test: 9 Number of HSP's that attempted gapping in prelim test: 66495 Number of HSP's gapped (non-prelim): 125 length of query: 94 length of database: 1,040,396,356 effective HSP length: 63 effective length of query: 31 effective length of database: 846,516,124 effective search space: 26241999844 effective search space used: 26241999844 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.2 bits) S2: 88 (38.2 bits)