BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (160 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P76632 Uncharacterized protein ygcK n=11 Tax=Enterobact... 331 6e-90 UniRef50_B9PF41 Predicted protein n=3 Tax=cellular organisms Rep... 118 6e-26 UniRef50_C5SD50 CRISPR-associated protein, Cse2 family n=1 Tax=A... 74 2e-12 UniRef50_A1SV71 CRISPR-associated protein, Cse2 family n=2 Tax=G... 57 2e-07 UniRef50_UPI000169A1F0 hypothetical protein Epers_00055 n=1 Tax=... 44 0.002 >UniRef50_P76632 Uncharacterized protein ygcK n=11 Tax=Enterobacteriaceae RepID=YGCK_ECOLI Length = 160 Score = 331 bits (848), Expect = 6e-90, Method: Compositional matrix adjust. Identities = 160/160 (100%), Positives = 160/160 (100%) Query: 1 MADEIDAMALYRAWQQLDNGSCAQIRRVSEPDELRDIPAFYRLVQPFGWENPRHQQALLR 60 MADEIDAMALYRAWQQLDNGSCAQIRRVSEPDELRDIPAFYRLVQPFGWENPRHQQALLR Sbjct: 1 MADEIDAMALYRAWQQLDNGSCAQIRRVSEPDELRDIPAFYRLVQPFGWENPRHQQALLR 60 Query: 61 MVFCLSAGKNVIRHQDKKSEQTTGISLGRALANSGRINERRIFQLIRADRTADMVQLRRL 120 MVFCLSAGKNVIRHQDKKSEQTTGISLGRALANSGRINERRIFQLIRADRTADMVQLRRL Sbjct: 61 MVFCLSAGKNVIRHQDKKSEQTTGISLGRALANSGRINERRIFQLIRADRTADMVQLRRL 120 Query: 121 LTHAEPVLDWPLMARMLTWWGKRERQQLLEDFVLTTNKNA 160 LTHAEPVLDWPLMARMLTWWGKRERQQLLEDFVLTTNKNA Sbjct: 121 LTHAEPVLDWPLMARMLTWWGKRERQQLLEDFVLTTNKNA 160 >UniRef50_B9PF41 Predicted protein n=3 Tax=cellular organisms RepID=B9PF41_POPTR Length = 155 Score = 118 bits (296), Expect = 6e-26, Method: Compositional matrix adjust. Identities = 69/164 (42%), Positives = 97/164 (59%), Gaps = 18/164 (10%) Query: 1 MADEIDAMALYRAWQQLDNGSCAQIRRVSEPDELRDIPAFYRLVQPFGWENPRHQQALLR 60 M E D M LY+AW+ L NGS A++RR ++PD+L ++PAFYRL G E ++A R Sbjct: 1 MTKENDFMELYQAWKSLPNGSKAELRRCTKPDDLLEVPAFYRLFGGRG-EKEWQKKAYQR 59 Query: 61 MVFCLSAGKNVIRHQDKKSEQTTGISLGRALANSGR-----INERRIFQLIRADRTADMV 115 ++FCL I H ++K ISLG ALA + ++E R+ Q+IR DM+ Sbjct: 60 LIFCLPC----IEHTEQK------ISLGAALAGGRKGERPAVSESRMIQVIRNQTPNDMI 109 Query: 116 QLRRLLTHAEPVLDWPLMARMLTWWGKRERQQ--LLEDFVLTTN 157 QLRR+L EP + WPLMA+ L +W ER + LLEDF + + Sbjct: 110 QLRRILKQVEPKVHWPLMAKQLWYWDYNERSKRDLLEDFFINQS 153 >UniRef50_C5SD50 CRISPR-associated protein, Cse2 family n=1 Tax=Allochromatium vinosum DSM 180 RepID=C5SD50_CHRVI Length = 152 Score = 73.9 bits (180), Expect = 2e-12, Method: Compositional matrix adjust. Identities = 50/163 (30%), Positives = 81/163 (49%), Gaps = 22/163 (13%) Query: 1 MADEIDAMALYRA-WQQLDNGSCAQIRRVSEPDELRDIPAFYRLVQPFGWENPRHQQALL 59 M+ E A Y A +++L NG+ A +RR +EPD+LRD+P Y L F P +Q+ LL Sbjct: 1 MSTEAPDFAEYHARFERLPNGAKAGLRRAAEPDDLRDLPGLYHL---FPGSRPSNQETLL 57 Query: 60 RMVFCLSAGKNVIRHQDKKSEQTTGISLGRALANSGRINERRIFQLIRADRTADMVQLRR 119 E + ANS I+E RI Q+ RA+ D++ LRR Sbjct: 58 AFFLPWC------------PELNSNTGFMTLCANS--ISEERIMQIARANPPDDLIALRR 103 Query: 120 LLTHAEPVLDWPLMARMLTWWGKRE----RQQLLEDFVLTTNK 158 L+ P + W +A +L +WG ++ +++L+E + + +K Sbjct: 104 LVMQLHPAVGWLDLAPLLWYWGSKKTGSSKRRLVEGYYIALHK 146 >UniRef50_A1SV71 CRISPR-associated protein, Cse2 family n=2 Tax=Gammaproteobacteria RepID=A1SV71_PSYIN Length = 143 Score = 56.6 bits (135), Expect = 2e-07, Method: Compositional matrix adjust. Identities = 41/149 (27%), Positives = 78/149 (52%), Gaps = 17/149 (11%) Query: 10 LYRAWQQLDNGSCAQIRRVSEPDELRDIPAFYRLVQPFGWENPRHQQALLRMVFCLSAGK 69 +Y+A+Q L NG A ++R + +L D PA++R+++ ++ Q +L ++ L Sbjct: 5 IYQAYQLLSNGDKADLKRCNLK-KLADSPAYFRVLKFSRAKDTPQTQRILYLLVGL---- 59 Query: 70 NVIRHQDKKSEQTTGISLGRALANSGRINERRIFQLIRA-DRTADMVQLRRLLTHAEPVL 128 K S+ G+++ AL N+G + E +I Q+ R+ D D L+R L E + Sbjct: 60 -------KMSDDQPGVNVANALLNAG-VKEAQIIQITRSGDNGIDY--LKRQLVRCENI- 108 Query: 129 DWPLMARMLTWWGKRERQQLLEDFVLTTN 157 + ++ +WG R+ LL++F+L+ N Sbjct: 109 KLESIGKLAQFWGDNARRNLLKNFILSAN 137 >UniRef50_UPI000169A1F0 hypothetical protein Epers_00055 n=1 Tax=Endoriftia persephone 'Hot96_1+Hot96_2' RepID=UPI000169A1F0 Length = 136 Score = 43.5 bits (101), Expect = 0.002, Method: Compositional matrix adjust. Identities = 39/116 (33%), Positives = 62/116 (53%), Gaps = 18/116 (15%) Query: 14 WQQLDN--GSCAQIRRVSEPDELRDIPAFYRLVQPFGWENPRHQQALLRMVFCLSAGKNV 71 WQ L+N G+ A++RR + PD++ PAF RL Q E P+ Q+ L +V L+ Sbjct: 19 WQGLENNKGTRAELRRCTSPDKVMFQPAFQRLCQRLKPE-PQEQRQLASVVGLLAH---- 73 Query: 72 IRHQDKKSEQTTGISLGRALA-NSGRINERRIFQLIRADRT---ADMVQLRRLLTH 123 +R+ TTG L +A N ++E R +L++ DRT M+++ R+L H Sbjct: 74 VRY-------TTGQKLAYQMAGNPPVVSELRFRRLLQRDRTDLYGAMIRILRMLDH 122 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P76632 Uncharacterized protein ygcK n=11 Tax=Enterobact... 240 1e-62 UniRef50_B9PF41 Predicted protein n=3 Tax=cellular organisms Rep... 197 1e-49 UniRef50_C5SD50 CRISPR-associated protein, Cse2 family n=1 Tax=A... 189 2e-47 UniRef50_A1SV71 CRISPR-associated protein, Cse2 family n=2 Tax=G... 154 1e-36 Sequences not found previously or not previously below threshold: UniRef50_Q2FNL6 Putative uncharacterized protein n=1 Tax=Methano... 57 2e-07 UniRef50_UPI000169A1F0 hypothetical protein Epers_00055 n=1 Tax=... 54 1e-06 UniRef50_D0MET4 CRISPR-associated protein, Cse2 family n=1 Tax=R... 45 7e-04 UniRef50_Q0AA31 CRISPR-associated protein, Cse2 family n=1 Tax=A... 44 0.001 UniRef50_Q314I2 Putative uncharacterized protein n=1 Tax=Desulfo... 43 0.005 UniRef50_B6IWM5 CRISPR-associated protein, CT1973 family n=1 Tax... 38 0.088 >UniRef50_P76632 Uncharacterized protein ygcK n=11 Tax=Enterobacteriaceae RepID=YGCK_ECOLI Length = 160 Score = 240 bits (612), Expect = 1e-62, Method: Composition-based stats. Identities = 160/160 (100%), Positives = 160/160 (100%) Query: 1 MADEIDAMALYRAWQQLDNGSCAQIRRVSEPDELRDIPAFYRLVQPFGWENPRHQQALLR 60 MADEIDAMALYRAWQQLDNGSCAQIRRVSEPDELRDIPAFYRLVQPFGWENPRHQQALLR Sbjct: 1 MADEIDAMALYRAWQQLDNGSCAQIRRVSEPDELRDIPAFYRLVQPFGWENPRHQQALLR 60 Query: 61 MVFCLSAGKNVIRHQDKKSEQTTGISLGRALANSGRINERRIFQLIRADRTADMVQLRRL 120 MVFCLSAGKNVIRHQDKKSEQTTGISLGRALANSGRINERRIFQLIRADRTADMVQLRRL Sbjct: 61 MVFCLSAGKNVIRHQDKKSEQTTGISLGRALANSGRINERRIFQLIRADRTADMVQLRRL 120 Query: 121 LTHAEPVLDWPLMARMLTWWGKRERQQLLEDFVLTTNKNA 160 LTHAEPVLDWPLMARMLTWWGKRERQQLLEDFVLTTNKNA Sbjct: 121 LTHAEPVLDWPLMARMLTWWGKRERQQLLEDFVLTTNKNA 160 >UniRef50_B9PF41 Predicted protein n=3 Tax=cellular organisms RepID=B9PF41_POPTR Length = 155 Score = 197 bits (500), Expect = 1e-49, Method: Composition-based stats. Identities = 69/164 (42%), Positives = 97/164 (59%), Gaps = 18/164 (10%) Query: 1 MADEIDAMALYRAWQQLDNGSCAQIRRVSEPDELRDIPAFYRLVQPFGWENPRHQQALLR 60 M E D M LY+AW+ L NGS A++RR ++PD+L ++PAFYRL G E ++A R Sbjct: 1 MTKENDFMELYQAWKSLPNGSKAELRRCTKPDDLLEVPAFYRLFGGRG-EKEWQKKAYQR 59 Query: 61 MVFCLSAGKNVIRHQDKKSEQTTGISLGRALANSGR-----INERRIFQLIRADRTADMV 115 ++FCL I H ++K ISLG ALA + ++E R+ Q+IR DM+ Sbjct: 60 LIFCLPC----IEHTEQK------ISLGAALAGGRKGERPAVSESRMIQVIRNQTPNDMI 109 Query: 116 QLRRLLTHAEPVLDWPLMARMLTWWGKRER--QQLLEDFVLTTN 157 QLRR+L EP + WPLMA+ L +W ER + LLEDF + + Sbjct: 110 QLRRILKQVEPKVHWPLMAKQLWYWDYNERSKRDLLEDFFINQS 153 >UniRef50_C5SD50 CRISPR-associated protein, Cse2 family n=1 Tax=Allochromatium vinosum DSM 180 RepID=C5SD50_CHRVI Length = 152 Score = 189 bits (480), Expect = 2e-47, Method: Composition-based stats. Identities = 50/163 (30%), Positives = 81/163 (49%), Gaps = 22/163 (13%) Query: 1 MADEIDAMALYRA-WQQLDNGSCAQIRRVSEPDELRDIPAFYRLVQPFGWENPRHQQALL 59 M+ E A Y A +++L NG+ A +RR +EPD+LRD+P Y L F P +Q+ LL Sbjct: 1 MSTEAPDFAEYHARFERLPNGAKAGLRRAAEPDDLRDLPGLYHL---FPGSRPSNQETLL 57 Query: 60 RMVFCLSAGKNVIRHQDKKSEQTTGISLGRALANSGRINERRIFQLIRADRTADMVQLRR 119 E + ANS I+E RI Q+ RA+ D++ LRR Sbjct: 58 AFFLPWC------------PELNSNTGFMTLCANS--ISEERIMQIARANPPDDLIALRR 103 Query: 120 LLTHAEPVLDWPLMARMLTWWGKRE----RQQLLEDFVLTTNK 158 L+ P + W +A +L +WG ++ +++L+E + + +K Sbjct: 104 LVMQLHPAVGWLDLAPLLWYWGSKKTGSSKRRLVEGYYIALHK 146 >UniRef50_A1SV71 CRISPR-associated protein, Cse2 family n=2 Tax=Gammaproteobacteria RepID=A1SV71_PSYIN Length = 143 Score = 154 bits (388), Expect = 1e-36, Method: Composition-based stats. Identities = 41/149 (27%), Positives = 78/149 (52%), Gaps = 17/149 (11%) Query: 10 LYRAWQQLDNGSCAQIRRVSEPDELRDIPAFYRLVQPFGWENPRHQQALLRMVFCLSAGK 69 +Y+A+Q L NG A ++R + +L D PA++R+++ ++ Q +L ++ L Sbjct: 5 IYQAYQLLSNGDKADLKRCNLK-KLADSPAYFRVLKFSRAKDTPQTQRILYLLVGL---- 59 Query: 70 NVIRHQDKKSEQTTGISLGRALANSGRINERRIFQLIRA-DRTADMVQLRRLLTHAEPVL 128 K S+ G+++ AL N+G + E +I Q+ R+ D D L+R L E + Sbjct: 60 -------KMSDDQPGVNVANALLNAG-VKEAQIIQITRSGDNGIDY--LKRQLVRCENI- 108 Query: 129 DWPLMARMLTWWGKRERQQLLEDFVLTTN 157 + ++ +WG R+ LL++F+L+ N Sbjct: 109 KLESIGKLAQFWGDNARRNLLKNFILSAN 137 >UniRef50_Q2FNL6 Putative uncharacterized protein n=1 Tax=Methanospirillum hungatei JF-1 RepID=Q2FNL6_METHJ Length = 174 Score = 56.8 bits (135), Expect = 2e-07, Method: Composition-based stats. Identities = 29/157 (18%), Positives = 65/157 (41%), Gaps = 19/157 (12%) Query: 6 DAMALYRAWQQ---LDNGSCAQIRRVSEPDELRDIPAFYRLVQPFGWENPRHQQALLRMV 62 + +W + G A++RR + E+ PAF+RL Q ++AL + Sbjct: 12 PVREKFHSWWKGLIESRGDSAELRRCHDLTEVFFCPAFHRLYQSLLPHGMVRREALALIA 71 Query: 63 FCLSAGKNVIRHQDKKSEQTTGISLGRALANSG-----RINERRIFQLIRADR-TADMVQ 116 L+ K E +G++ + + S ++E R +LIR + + Sbjct: 72 VSLAHVK----------EDISGVTFAQQMGESRIGQTPSVSEARFRKLIRCESYPDLFLP 121 Query: 117 LRRLLTHAEPVLDWPLMARMLTWWGKRERQQLLEDFV 153 + R++ ++ + R L +W ++ R+++ ++ Sbjct: 122 VTRIIRMLNGTVNIDDVVRKLYFWNEKSRKEMTFEYF 158 >UniRef50_UPI000169A1F0 hypothetical protein Epers_00055 n=1 Tax=Endoriftia persephone 'Hot96_1+Hot96_2' RepID=UPI000169A1F0 Length = 136 Score = 54.5 bits (129), Expect = 1e-06, Method: Composition-based stats. Identities = 38/131 (29%), Positives = 60/131 (45%), Gaps = 15/131 (11%) Query: 10 LYRAWQQLDN--GSCAQIRRVSEPDELRDIPAFYRLVQPFGWENPRHQQALLRMVFCLSA 67 L + WQ L+N G+ A++RR + PD++ PAF RL Q + +Q L V L A Sbjct: 15 LIQWWQGLENNKGTRAELRRCTSPDKVMFQPAFQRLCQ--RLKPEPQEQRQLASVVGLLA 72 Query: 68 GKNVIRHQDKKSEQTTGISLGRALA-NSGRINERRIFQLIRADRTADMVQLRRLLTHAEP 126 TTG L +A N ++E R +L++ DRT + R+L + Sbjct: 73 HVRY----------TTGQKLAYQMAGNPPVVSELRFRRLLQRDRTDLYGAMIRILRMLDH 122 Query: 127 VLDWPLMARML 137 + P + R + Sbjct: 123 RANLPDLIREI 133 >UniRef50_D0MET4 CRISPR-associated protein, Cse2 family n=1 Tax=Rhodothermus marinus DSM 4252 RepID=D0MET4_RHOM4 Length = 179 Score = 45.2 bits (105), Expect = 7e-04, Method: Composition-based stats. Identities = 32/128 (25%), Positives = 55/128 (42%), Gaps = 8/128 (6%) Query: 13 AWQQLDNGSCAQIRRVSEPDELRDIPAFYRLVQPFGWENPRHQQALLRMVFCLSAGKNVI 72 A + L +G A++RR++ PA ++++ E + C + + Sbjct: 26 ASEALSSGERAELRRIAFEAPF--TPALWKVLFYLRDEGAPVRIGDEADERCWATLLMGM 83 Query: 73 RHQDKKSEQTTGISLGRALANSGRINERRIFQLIRADRTADMVQLRRLLTHAEPV---LD 129 H K + + LGRALA +G +E R QL+RA V LRR+ + + Sbjct: 84 AHCIKLHDYQ--VPLGRALAEAGW-SELRFTQLLRARGPQLAVFLRRMAQYLSAKNQLAN 140 Query: 130 WPLMARML 137 W +A +L Sbjct: 141 WADVADLL 148 >UniRef50_Q0AA31 CRISPR-associated protein, Cse2 family n=1 Tax=Alkalilimnicola ehrlichii MLHE-1 RepID=Q0AA31_ALHEH Length = 172 Score = 44.4 bits (103), Expect = 0.001, Method: Composition-based stats. Identities = 34/163 (20%), Positives = 58/163 (35%), Gaps = 17/163 (10%) Query: 2 ADEIDAMALYRAWQQLDNGSCAQIRRVSEPDELRDIPAFYRLVQPFGWENPRHQQALLRM 61 ++ +A Q NG A+++R+ P Y + ++ L Sbjct: 12 SEMAPHLAAALHHQAFPNGDRARLKRMGVTGP---TPLAYHRFLLRHIPHRWQREGLE-- 66 Query: 62 VFCLSAGKNVIRHQDKKSEQTTGISLGRALANSGRINERRIFQLIRAD----RTADMVQL 117 + Q ISLGRALA+SG +E R+ L+ A+ T + Sbjct: 67 -MGWRTLVAALARQHHNPH-APDISLGRALADSGY-SEARLESLLAAEGRVLATLTLRAA 123 Query: 118 RRLLTHAEPVLDWPLMARMLTWWGKRER----QQLLEDFVLTT 156 RL +W AR+L + R +++ D+ T Sbjct: 124 TRLAAQ-RARCNWKDTARLLFAFDDEARERINRKIARDYYRTA 165 >UniRef50_Q314I2 Putative uncharacterized protein n=1 Tax=Desulfovibrio desulfuricans subsp. desulfuricans str. G20 RepID=Q314I2_DESDG Length = 168 Score = 42.5 bits (98), Expect = 0.005, Method: Composition-based stats. Identities = 27/161 (16%), Positives = 51/161 (31%), Gaps = 10/161 (6%) Query: 1 MADEIDAMALYRAWQQLDN--GSCAQIRRVSEPDELRDIPAFYR----LVQPFGWENPRH 54 + +E L W+ L G A++RR P ++ AF R +Q Sbjct: 8 LKEEAFWTCLREWWEGLAQNRGPRAELRRARTPFDVLTSKAFQRNLVPRLQGKNISLTGA 67 Query: 55 QQALLRMVFCLSAGKNVIRHQDKKSEQTTGISLGRALANSGRINERRIFQLIRADRTADM 114 +Q L + + A + + + + R+ R+ + DR Sbjct: 68 EQERLALPVGVLAHVRQLEAKRFMPVMLADMQKANSDVTDRRVK--RLLAVT--DRDELF 123 Query: 115 VQLRRLLTHAEPVLDWPLMARMLTWWGKRERQQLLEDFVLT 155 L RL+ + + WW R+ ++ L Sbjct: 124 TALIRLVRFMDNTAHLRNLVESGFWWTDATRKNWALNYYLN 164 >UniRef50_B6IWM5 CRISPR-associated protein, CT1973 family n=1 Tax=Rhodospirillum centenum SW RepID=B6IWM5_RHOCS Length = 206 Score = 38.3 bits (87), Expect = 0.088, Method: Composition-based stats. Identities = 29/138 (21%), Positives = 50/138 (36%), Gaps = 16/138 (11%) Query: 15 QQLDNGSCAQIRRVSEPDELRDIPAFY----RLVQPFGWENPRHQQALLRMVFCLSAGKN 70 + L G A +RR E +PA Y L+ + + + R+ L+ Sbjct: 39 KPLARGDRAALRRAQTVTEACMVPASYALPAALMADARMDRHPEAETVARIAMALA---- 94 Query: 71 VIRHQDKKSEQTTGISLGRALA------NSGRINERRIFQLIRADRTADMVQLRRLLTHA 124 + TG +LGRA A R++ R+ + AD + ++L R Sbjct: 95 TVDEDTGAMAVRTGAALGRAFAVRRQDTGKPRVSGDRLRLICTADDPDEFLRLLRGAIRL 154 Query: 125 EPVLDWP--LMARMLTWW 140 P +AR++ W Sbjct: 155 LEKEKAPVADIARVVEAW 172 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P76632 Uncharacterized protein ygcK n=11 Tax=Enterobact... 190 9e-48 UniRef50_B9PF41 Predicted protein n=3 Tax=cellular organisms Rep... 161 7e-39 UniRef50_C5SD50 CRISPR-associated protein, Cse2 family n=1 Tax=A... 148 7e-35 UniRef50_Q2FNL6 Putative uncharacterized protein n=1 Tax=Methano... 145 6e-34 UniRef50_A1SV71 CRISPR-associated protein, Cse2 family n=2 Tax=G... 127 1e-28 UniRef50_Q0AA31 CRISPR-associated protein, Cse2 family n=1 Tax=A... 118 7e-26 UniRef50_D0MET4 CRISPR-associated protein, Cse2 family n=1 Tax=R... 113 2e-24 UniRef50_UPI000169A1F0 hypothetical protein Epers_00055 n=1 Tax=... 107 1e-22 Sequences not found previously or not previously below threshold: UniRef50_B8GIV5 CRISPR-associated protein, Cse2 family n=1 Tax=M... 77 2e-13 UniRef50_Q12YB0 CRISPR-associated protein, Cse2 family n=1 Tax=M... 72 6e-12 UniRef50_Q60AD0 CRISPR-associated protein, CT1973 family n=1 Tax... 63 4e-09 UniRef50_Q04QB9 Putative uncharacterized protein n=2 Tax=Leptosp... 62 5e-09 UniRef50_Q0BSC5 Putative uncharacterized protein n=1 Tax=Granuli... 60 2e-08 UniRef50_Q314I2 Putative uncharacterized protein n=1 Tax=Desulfo... 60 3e-08 UniRef50_A8ZZ15 CRISPR-associated protein, Cse2 family n=2 Tax=D... 56 3e-07 UniRef50_A6W170 CRISPR-associated protein, Cse2 family n=1 Tax=M... 52 7e-06 UniRef50_B3E5V1 CRISPR-associated protein, Cse2 family n=1 Tax=G... 51 1e-05 UniRef50_B8JDN9 CRISPR-associated protein, Cse2 family n=2 Tax=A... 50 2e-05 UniRef50_C9M9R5 CRISPR-associated protein, Cse2 family n=1 Tax=J... 49 5e-05 UniRef50_D1Y488 CRISPR-associated protein, Cse2 family n=1 Tax=P... 48 7e-05 UniRef50_D2L2X6 CRISPR-associated protein, Cse2 family n=1 Tax=D... 48 1e-04 UniRef50_Q7N8H7 Similar to unknown protein n=2 Tax=Photorhabdus ... 48 1e-04 UniRef50_B4RSK1 CRISPR-associated protein, Cse2 family n=1 Tax=A... 48 1e-04 UniRef50_D1A6Q3 CRISPR-associated protein, Cse1 family n=1 Tax=T... 47 2e-04 UniRef50_B6B781 CRISPR-associated protein, Cse2 family n=1 Tax=R... 47 2e-04 UniRef50_A1ARH8 CRISPR-associated protein, Cse2 family n=3 Tax=B... 46 3e-04 UniRef50_B7KJ24 CRISPR-associated protein, Cse2 family n=1 Tax=C... 46 5e-04 UniRef50_B6IWM5 CRISPR-associated protein, CT1973 family n=1 Tax... 44 0.001 UniRef50_Q1R116 Putative uncharacterized protein n=1 Tax=Chromoh... 42 0.005 UniRef50_Q1ZM79 Putative uncharacterized protein n=2 Tax=Photoba... 42 0.005 UniRef50_Q2RY17 CRISPR-associated protein, Cse2 family n=1 Tax=R... 42 0.006 UniRef50_A9HLD1 CRISPR-associated protein, Cse2 family n=9 Tax=A... 40 0.021 UniRef50_C9XXR2 Putative uncharacterized protein n=2 Tax=Cronoba... 39 0.049 UniRef50_Q1J369 CRISPR-associated protein n=1 Tax=Deinococcus ge... 39 0.065 UniRef50_B4T470 CRISPR-associated protein, Cse2 family n=10 Tax=... 38 0.073 UniRef50_B5FTT8 Crispr-associated protein, Cse2 family n=53 Tax=... 38 0.077 >UniRef50_P76632 Uncharacterized protein ygcK n=11 Tax=Enterobacteriaceae RepID=YGCK_ECOLI Length = 160 Score = 190 bits (483), Expect = 9e-48, Method: Composition-based stats. Identities = 160/160 (100%), Positives = 160/160 (100%) Query: 1 MADEIDAMALYRAWQQLDNGSCAQIRRVSEPDELRDIPAFYRLVQPFGWENPRHQQALLR 60 MADEIDAMALYRAWQQLDNGSCAQIRRVSEPDELRDIPAFYRLVQPFGWENPRHQQALLR Sbjct: 1 MADEIDAMALYRAWQQLDNGSCAQIRRVSEPDELRDIPAFYRLVQPFGWENPRHQQALLR 60 Query: 61 MVFCLSAGKNVIRHQDKKSEQTTGISLGRALANSGRINERRIFQLIRADRTADMVQLRRL 120 MVFCLSAGKNVIRHQDKKSEQTTGISLGRALANSGRINERRIFQLIRADRTADMVQLRRL Sbjct: 61 MVFCLSAGKNVIRHQDKKSEQTTGISLGRALANSGRINERRIFQLIRADRTADMVQLRRL 120 Query: 121 LTHAEPVLDWPLMARMLTWWGKRERQQLLEDFVLTTNKNA 160 LTHAEPVLDWPLMARMLTWWGKRERQQLLEDFVLTTNKNA Sbjct: 121 LTHAEPVLDWPLMARMLTWWGKRERQQLLEDFVLTTNKNA 160 >UniRef50_B9PF41 Predicted protein n=3 Tax=cellular organisms RepID=B9PF41_POPTR Length = 155 Score = 161 bits (407), Expect = 7e-39, Method: Composition-based stats. Identities = 69/164 (42%), Positives = 97/164 (59%), Gaps = 18/164 (10%) Query: 1 MADEIDAMALYRAWQQLDNGSCAQIRRVSEPDELRDIPAFYRLVQPFGWENPRHQQALLR 60 M E D M LY+AW+ L NGS A++RR ++PD+L ++PAFYRL G E ++A R Sbjct: 1 MTKENDFMELYQAWKSLPNGSKAELRRCTKPDDLLEVPAFYRLFGGRG-EKEWQKKAYQR 59 Query: 61 MVFCLSAGKNVIRHQDKKSEQTTGISLGRALANSGR-----INERRIFQLIRADRTADMV 115 ++FCL I H ++K ISLG ALA + ++E R+ Q+IR DM+ Sbjct: 60 LIFCLPC----IEHTEQK------ISLGAALAGGRKGERPAVSESRMIQVIRNQTPNDMI 109 Query: 116 QLRRLLTHAEPVLDWPLMARMLTWWGKRER--QQLLEDFVLTTN 157 QLRR+L EP + WPLMA+ L +W ER + LLEDF + + Sbjct: 110 QLRRILKQVEPKVHWPLMAKQLWYWDYNERSKRDLLEDFFINQS 153 >UniRef50_C5SD50 CRISPR-associated protein, Cse2 family n=1 Tax=Allochromatium vinosum DSM 180 RepID=C5SD50_CHRVI Length = 152 Score = 148 bits (372), Expect = 7e-35, Method: Composition-based stats. Identities = 49/163 (30%), Positives = 79/163 (48%), Gaps = 22/163 (13%) Query: 1 MADEIDAMALYRA-WQQLDNGSCAQIRRVSEPDELRDIPAFYRLVQPFGWENPRHQQALL 59 M+ E A Y A +++L NG+ A +RR +EPD+LRD+P Y L P +Q+ LL Sbjct: 1 MSTEAPDFAEYHARFERLPNGAKAGLRRAAEPDDLRDLPGLYHLF---PGSRPSNQETLL 57 Query: 60 RMVFCLSAGKNVIRHQDKKSEQTTGISLGRALANSGRINERRIFQLIRADRTADMVQLRR 119 E + ANS I+E RI Q+ RA+ D++ LRR Sbjct: 58 AFFLPWC------------PELNSNTGFMTLCANS--ISEERIMQIARANPPDDLIALRR 103 Query: 120 LLTHAEPVLDWPLMARMLTWWGKR----ERQQLLEDFVLTTNK 158 L+ P + W +A +L +WG + +++L+E + + +K Sbjct: 104 LVMQLHPAVGWLDLAPLLWYWGSKKTGSSKRRLVEGYYIALHK 146 >UniRef50_Q2FNL6 Putative uncharacterized protein n=1 Tax=Methanospirillum hungatei JF-1 RepID=Q2FNL6_METHJ Length = 174 Score = 145 bits (364), Expect = 6e-34, Method: Composition-based stats. Identities = 29/160 (18%), Positives = 66/160 (41%), Gaps = 19/160 (11%) Query: 3 DEIDAMALYRAWQQ---LDNGSCAQIRRVSEPDELRDIPAFYRLVQPFGWENPRHQQALL 59 + + +W + G A++RR + E+ PAF+RL Q ++AL Sbjct: 9 ESDPVREKFHSWWKGLIESRGDSAELRRCHDLTEVFFCPAFHRLYQSLLPHGMVRREALA 68 Query: 60 RMVFCLSAGKNVIRHQDKKSEQTTGISLGRALANSG-----RINERRIFQLIRADR-TAD 113 + L+ K E +G++ + + S ++E R +LIR + Sbjct: 69 LIAVSLAHVK----------EDISGVTFAQQMGESRIGQTPSVSEARFRKLIRCESYPDL 118 Query: 114 MVQLRRLLTHAEPVLDWPLMARMLTWWGKRERQQLLEDFV 153 + + R++ ++ + R L +W ++ R+++ ++ Sbjct: 119 FLPVTRIIRMLNGTVNIDDVVRKLYFWNEKSRKEMTFEYF 158 >UniRef50_A1SV71 CRISPR-associated protein, Cse2 family n=2 Tax=Gammaproteobacteria RepID=A1SV71_PSYIN Length = 143 Score = 127 bits (318), Expect = 1e-28, Method: Composition-based stats. Identities = 41/149 (27%), Positives = 78/149 (52%), Gaps = 17/149 (11%) Query: 10 LYRAWQQLDNGSCAQIRRVSEPDELRDIPAFYRLVQPFGWENPRHQQALLRMVFCLSAGK 69 +Y+A+Q L NG A ++R + +L D PA++R+++ ++ Q +L ++ L Sbjct: 5 IYQAYQLLSNGDKADLKRCNLK-KLADSPAYFRVLKFSRAKDTPQTQRILYLLVGL---- 59 Query: 70 NVIRHQDKKSEQTTGISLGRALANSGRINERRIFQLIRA-DRTADMVQLRRLLTHAEPVL 128 K S+ G+++ AL N+G + E +I Q+ R+ D D L+R L E + Sbjct: 60 -------KMSDDQPGVNVANALLNAG-VKEAQIIQITRSGDNGIDY--LKRQLVRCEN-I 108 Query: 129 DWPLMARMLTWWGKRERQQLLEDFVLTTN 157 + ++ +WG R+ LL++F+L+ N Sbjct: 109 KLESIGKLAQFWGDNARRNLLKNFILSAN 137 >UniRef50_Q0AA31 CRISPR-associated protein, Cse2 family n=1 Tax=Alkalilimnicola ehrlichii MLHE-1 RepID=Q0AA31_ALHEH Length = 172 Score = 118 bits (295), Expect = 7e-26, Method: Composition-based stats. Identities = 34/163 (20%), Positives = 58/163 (35%), Gaps = 17/163 (10%) Query: 2 ADEIDAMALYRAWQQLDNGSCAQIRRVSEPDELRDIPAFYRLVQPFGWENPRHQQALLRM 61 ++ +A Q NG A+++R+ P Y + ++ L Sbjct: 12 SEMAPHLAAALHHQAFPNGDRARLKRMGVTGP---TPLAYHRFLLRHIPHRWQREGLE-- 66 Query: 62 VFCLSAGKNVIRHQDKKSEQTTGISLGRALANSGRINERRIFQLIRADR----TADMVQL 117 + Q ISLGRALA+SG +E R+ L+ A+ T + Sbjct: 67 -MGWRTLVAALARQHHNPH-APDISLGRALADSGY-SEARLESLLAAEGRVLATLTLRAA 123 Query: 118 RRLLTHAEPVLDWPLMARMLTWWGKRER----QQLLEDFVLTT 156 RL +W AR+L + R +++ D+ T Sbjct: 124 TRLAAQ-RARCNWKDTARLLFAFDDEARERINRKIARDYYRTA 165 >UniRef50_D0MET4 CRISPR-associated protein, Cse2 family n=1 Tax=Rhodothermus marinus DSM 4252 RepID=D0MET4_RHOM4 Length = 179 Score = 113 bits (282), Expect = 2e-24, Method: Composition-based stats. Identities = 35/152 (23%), Positives = 61/152 (40%), Gaps = 12/152 (7%) Query: 9 ALYRAWQQLDNGSCAQIRRVSEPDELRDIPAFYRLVQPFGWENPRHQQALLRMVFCLSAG 68 A A + L +G A++RR++ PA ++++ E + C + Sbjct: 22 AGLLASEALSSGERAELRRIAFEAP--FTPALWKVLFYLRDEGAPVRIGDEADERCWATL 79 Query: 69 KNVIRHQDKKSEQTTGISLGRALANSGRINERRIFQLIRADRTADMVQLRRLLTHAEPV- 127 + H K + + LGRALA +G +E R QL+RA V LRR+ + Sbjct: 80 LMGMAHCIKLHDYQ--VPLGRALAEAGW-SELRFTQLLRARGPQLAVFLRRMAQYLSAKN 136 Query: 128 --LDWPLMARMLT----WWGKRERQQLLEDFV 153 +W +A +L R ++ D+ Sbjct: 137 QLANWADVADLLFEQEGAKADTVRLRIARDYY 168 >UniRef50_UPI000169A1F0 hypothetical protein Epers_00055 n=1 Tax=Endoriftia persephone 'Hot96_1+Hot96_2' RepID=UPI000169A1F0 Length = 136 Score = 107 bits (267), Expect = 1e-22, Method: Composition-based stats. Identities = 38/131 (29%), Positives = 60/131 (45%), Gaps = 15/131 (11%) Query: 10 LYRAWQQLDN--GSCAQIRRVSEPDELRDIPAFYRLVQPFGWENPRHQQALLRMVFCLSA 67 L + WQ L+N G+ A++RR + PD++ PAF RL Q + +Q L V L A Sbjct: 15 LIQWWQGLENNKGTRAELRRCTSPDKVMFQPAFQRLCQ--RLKPEPQEQRQLASVVGLLA 72 Query: 68 GKNVIRHQDKKSEQTTGISLGRALA-NSGRINERRIFQLIRADRTADMVQLRRLLTHAEP 126 TTG L +A N ++E R +L++ DRT + R+L + Sbjct: 73 HVR----------YTTGQKLAYQMAGNPPVVSELRFRRLLQRDRTDLYGAMIRILRMLDH 122 Query: 127 VLDWPLMARML 137 + P + R + Sbjct: 123 RANLPDLIREI 133 >UniRef50_B8GIV5 CRISPR-associated protein, Cse2 family n=1 Tax=Methanosphaerula palustris E1-9c RepID=B8GIV5_METPE Length = 178 Score = 76.8 bits (187), Expect = 2e-13, Method: Composition-based stats. Identities = 27/154 (17%), Positives = 56/154 (36%), Gaps = 16/154 (10%) Query: 9 ALYRAWQQLD--NGSCAQIRRVSEPDELRDIPAFYRLVQPFGWENPRHQQALLRMVFCLS 66 AL W+ LD G A +RR E+ +P ++RL L+ Sbjct: 25 ALSTWWEGLDEARGDRAMLRRCHSTTEVAFMPTYHRLRLSLERIGHVDPDR-------LA 77 Query: 67 AGKNVIRHQDKKSEQTTGISLGRALANSGR------INERRIFQLIRADRTAD-MVQLRR 119 V+ H + + + I+ + LA + ++ R +L++ + D + R Sbjct: 78 LVAGVLSHLKENTRTKSTITFAQQLATPKKDGDHAAMSGLRFRRLLQVEHPDDLYQAMIR 137 Query: 120 LLTHAEPVLDWPLMARMLTWWGKRERQQLLEDFV 153 + D +A + WW + ++ ++ Sbjct: 138 AVRLLGGAADIDTLANGVYWWNEMTKKNWAFEYY 171 >UniRef50_Q12YB0 CRISPR-associated protein, Cse2 family n=1 Tax=Methanococcoides burtonii DSM 6242 RepID=Q12YB0_METBU Length = 143 Score = 71.8 bits (174), Expect = 6e-12, Method: Composition-based stats. Identities = 25/148 (16%), Positives = 55/148 (37%), Gaps = 16/148 (10%) Query: 18 DNGSCAQIRRVSEPDELRDIPAFYRLVQPFGWENPRHQQALLRMVFCLSAGKNVIRHQDK 77 + G A +RR E+ P ++RL ++ ++ + L+ KN Sbjct: 4 ERGDRADLRRCHNVVEVVFNPTYHRLWSELNKIGFGNKDSVALIAGVLAHVKN------- 56 Query: 78 KSEQTTGISLGRALA--NSGR---INERRIFQLIR-ADRTADMVQLRRLLTHAEPVLDWP 131 G S +A N G ++ R +L++ ++ + R++ + ++ Sbjct: 57 ---NQGGESFAAQMASLNGGSNSQVSGLRFKRLLKIEEKAELFSSIVRVVKLMDGNVNIC 113 Query: 132 LMARMLTWWGKRERQQLLEDFVLTTNKN 159 +A L WW ++Q + +N Sbjct: 114 NLANSLYWWNDITKKQWAFSYYEKAPRN 141 >UniRef50_Q60AD0 CRISPR-associated protein, CT1973 family n=1 Tax=Methylococcus capsulatus RepID=Q60AD0_METCA Length = 170 Score = 62.6 bits (150), Expect = 4e-09, Method: Composition-based stats. Identities = 31/160 (19%), Positives = 54/160 (33%), Gaps = 14/160 (8%) Query: 1 MADEIDAMALYRAWQQLDNGSCAQIRRVSEPDELRDIPAFYRLVQPFGWENPRHQQALLR 60 +A I +A A + G A +RR++ AFYR + Sbjct: 17 LASLIGHLAATIAAEHFPTGDRAALRRLNPDAPPNL--AFYR-FAFRHLPQNWENRRTA- 72 Query: 61 MVFCLSAGKNVIRHQDKKSEQTTGISLGRALANSGRINERRIFQLIRADRTADMVQLRRL 120 +A I K S+G LA +G +E+R+ +L+ A+ L R Sbjct: 73 ----WTALVAGIALMCPKPH-RPDRSVGLTLAETGY-SEKRLERLLAAEGDTLHTLLLRA 126 Query: 121 LTHAEPV---LDWPLMARMLTWWG-KRERQQLLEDFVLTT 156 +W A +L ++ R ++ D+ Sbjct: 127 ARFLAAKNESCNWTDFAHLLLDRNPEKARLKIARDYYRNL 166 >UniRef50_Q04QB9 Putative uncharacterized protein n=2 Tax=Leptospira borgpetersenii serovar Hardjo-bovis RepID=Q04QB9_LEPBJ Length = 178 Score = 62.2 bits (149), Expect = 5e-09, Method: Composition-based stats. Identities = 24/157 (15%), Positives = 54/157 (34%), Gaps = 25/157 (15%) Query: 10 LYRAWQQLDN--GSCAQIRRVSEPDELRDIPAFYRLVQPFGWENP---RHQQALLRMVFC 64 + W L + G A +RR S + IP +RL+ E + + + Sbjct: 14 IINWWIDLKSRTGDRAALRRCSNGLDTLLIPYTHRLISQLFQEGFRFFPDKIGPIAGILS 73 Query: 65 LSAGKNVIRHQDKKSEQTTGISLGRALA----NSGRINERRIFQLIR----ADRTADMVQ 116 E +S R++A + INE R ++++ Sbjct: 74 ------------HIEEDNPSVSFARSMARKEGENPVINEIRFRKILQYSDILSEELFYQN 121 Query: 117 LRRLLTHAEPVLDWPLMARMLTWWGKRERQQLLEDFV 153 + R++ + + + ++ + W +R ++ D+ Sbjct: 122 MVRIVKNLKKKANISDLSLSIYSWNQRTKKDWAYDYY 158 >UniRef50_Q0BSC5 Putative uncharacterized protein n=1 Tax=Granulibacter bethesdensis CGDNIH1 RepID=Q0BSC5_GRABC Length = 165 Score = 60.3 bits (144), Expect = 2e-08, Method: Composition-based stats. Identities = 30/162 (18%), Positives = 55/162 (33%), Gaps = 19/162 (11%) Query: 1 MADEIDAMALYRAWQQL--DNGS-------CAQIRRVSEPDELRDIPAFYRLVQPFGWEN 51 M+ E + W + G+ A++RR ++P + PA ++L Q Sbjct: 1 MSREEKGTIAFEWWAEYLKPRGTNTAARALSARLRR-ADPIKALCEPAVHQLAQALCVSG 59 Query: 52 PRHQQALLRMVFCLSAGKNVIRHQDKKSEQTTGISLGRALANSGRINERRIFQLIRADRT 111 + L + CL A R ++ R +LIRA+ Sbjct: 60 GERETEKLVRLACLLAEVREDDAAPLAH---------RLGGKEPVLSRGRFEKLIRAEGE 110 Query: 112 ADMVQLRRLLTHAEPVLDWPLMARMLTWWGKRERQQLLEDFV 153 +RR + A+ + +AR L +W + R + Sbjct: 111 NLTDLMRRAIVMADRRCNVGALARDLWYWNDKTRTNWCFGYF 152 >UniRef50_Q314I2 Putative uncharacterized protein n=1 Tax=Desulfovibrio desulfuricans subsp. desulfuricans str. G20 RepID=Q314I2_DESDG Length = 168 Score = 59.9 bits (143), Expect = 3e-08, Method: Composition-based stats. Identities = 31/158 (19%), Positives = 53/158 (33%), Gaps = 4/158 (2%) Query: 1 MADEIDAMALYRAWQQLD--NGSCAQIRRVSEPDELRDIPAFYRLVQPFGWENPRHQQAL 58 + +E L W+ L G A++RR P ++ AF R + P Sbjct: 8 LKEEAFWTCLREWWEGLAQNRGPRAELRRARTPFDVLTSKAFQRNLVPRLQGKNISLTGA 67 Query: 59 LRMVFCLSAGKNVIRHQDKKSEQTTGISLGRALANSGRINERRIFQLIR-ADRTADMVQL 117 + L G Q + + ANS + +RR+ +L+ DR L Sbjct: 68 EQERLALPVGVLAHVRQLEAKRFMPVMLADMQKANS-DVTDRRVKRLLAVTDRDELFTAL 126 Query: 118 RRLLTHAEPVLDWPLMARMLTWWGKRERQQLLEDFVLT 155 RL+ + + WW R+ ++ L Sbjct: 127 IRLVRFMDNTAHLRNLVESGFWWTDATRKNWALNYYLN 164 >UniRef50_A8ZZ15 CRISPR-associated protein, Cse2 family n=2 Tax=Deltaproteobacteria RepID=A8ZZ15_DESOH Length = 192 Score = 56.4 bits (134), Expect = 3e-07, Method: Composition-based stats. Identities = 27/163 (16%), Positives = 61/163 (37%), Gaps = 18/163 (11%) Query: 2 ADEIDAMALYRAWQQLDNGSCAQIRRVSEPDELRDIPAFYRLVQPFGWENPRHQQALLRM 61 A+ + + W + G A++RR P+++ AF+ ++ N +Q + Sbjct: 9 AESCQLLQRWHHWLDNNRGDRARLRRAERPEDVLLTEAFFNFLK--QMPNSWQEQKPIFS 66 Query: 62 VFCLSAGKNVIR--HQDKKSEQTTG--------ISLGRALANS-----GRINERRIFQLI 106 ++ + ++ HQ S LA S G ++E R QL Sbjct: 67 SAAVAGLLSHVKADHQVVSKGYQPKDEKKSKKMASFSEQLATSIKGDRGAMSELRFQQLQ 126 Query: 107 RADRTAD-MVQLRRLLTHAEPVLDWPLMARMLTWWGKRERQQL 148 ++ T + ++ R + + ++ +A + W K ++ Sbjct: 127 KSRTTDEFYRRMVRAIRLLDGRVNILSLANDIIQWHKEHDREF 169 >UniRef50_A6W170 CRISPR-associated protein, Cse2 family n=1 Tax=Marinomonas sp. MWYL1 RepID=A6W170_MARMS Length = 206 Score = 51.8 bits (122), Expect = 7e-06, Method: Composition-based stats. Identities = 27/174 (15%), Positives = 62/174 (35%), Gaps = 25/174 (14%) Query: 12 RAWQQLDNGSCAQIRRVSEPDELRDIPAF---YRLVQPFGWENPRH---QQALLRMVFCL 65 + Q A+++R D+ F + + G E+ ++ + + Sbjct: 26 KHIQPAPTSHKAELKRCDSADDAMLSEGFRALWMALLNGGLEDILQSMSKERQTQKLEAW 85 Query: 66 SAGKNVIRHQDKKSEQTTGISLGRALANSGR------INERRIFQLIRADRTADMVQ-LR 118 + V+ H + + + I G+ L G ++E R +L D ++ LR Sbjct: 86 ATVAAVLVHIKQDNGEKLAIQAGKKLNKQGEPTDKSIVSELRFAKLQNTPTPDDFLKRLR 145 Query: 119 RLLTHAEPVLDWPLM-ARMLTWWGKRERQ-----------QLLEDFVLTTNKNA 160 R++ + + + A +L W+ + + Q D+ + N+ A Sbjct: 146 RIIQQLDGKVSPTKVAADILQWFEEHYDRQPRKADKRITVQWAMDYYRSANQKA 199 >UniRef50_B3E5V1 CRISPR-associated protein, Cse2 family n=1 Tax=Geobacter lovleyi SZ RepID=B3E5V1_GEOLS Length = 188 Score = 51.4 bits (121), Expect = 1e-05, Method: Composition-based stats. Identities = 27/146 (18%), Positives = 59/146 (40%), Gaps = 6/146 (4%) Query: 12 RAWQQLDN---GSCAQIRRVSEPDELRDIPAFYRLVQPFG--WENPRHQQALLRMVFCLS 66 AW++L + G A++RR PD++ + AF+ +Q W H A + L+ Sbjct: 20 LAWRKLIDEKPGERARLRRAESPDDVLLLDAFFNFLQEMPEEWSESAHLPASALIATVLA 79 Query: 67 AGKNVIRHQDKKSEQTTGISLGRALANSGRINERRIFQLIRADRTADM-VQLRRLLTHAE 125 +Q + ++ + + R++E R QL ++ + +L R + + Sbjct: 80 HANLHEMNQYDSTSFAAQLATAKEGGDKPRMSEIRFQQLQKSHDPTEFCRRLVRAVKMLD 139 Query: 126 PVLDWPLMARMLTWWGKRERQQLLED 151 + +A + W R+ + + Sbjct: 140 RNANLFSLANDILHWMHEYRKGVDRN 165 >UniRef50_B8JDN9 CRISPR-associated protein, Cse2 family n=2 Tax=Anaeromyxobacter RepID=B8JDN9_ANAD2 Length = 178 Score = 50.3 bits (118), Expect = 2e-05, Method: Composition-based stats. Identities = 35/154 (22%), Positives = 54/154 (35%), Gaps = 14/154 (9%) Query: 9 ALYRAWQQLDNGSCAQIRRVSEPDELRDIPAFYRLVQPFGWENPRHQ--QALLRMVFCLS 66 AL RA G A +RR++ D PAF+RL +A + Sbjct: 23 ALARAIASGSPGDVAALRRLTPDDPA--SPAFWRLAAAHLDGALPAGGGEAREEAERSWA 80 Query: 67 AGKNVIRHQDKKSEQTTGISLGRALANSGRINERRIFQLIRADRTADMVQLRRLLTHAEP 126 A + + G ALA +G +E R +L+RA ++R Sbjct: 81 AVMSGMALTAGLH--VPRRRAGAALAQAGY-SELRFERLLRASGPQLFREVRAAAAFLAS 137 Query: 127 VL---DWPLMARMLTWWG----KRERQQLLEDFV 153 DW +A ++ G +R R+ L F Sbjct: 138 KAVEFDWTDLAALVLGDGGPSAERTRRALARSFY 171 >UniRef50_C9M9R5 CRISPR-associated protein, Cse2 family n=1 Tax=Jonquetella anthropi E3_33 E1 RepID=C9M9R5_9BACT Length = 192 Score = 49.1 bits (115), Expect = 5e-05, Method: Composition-based stats. Identities = 30/153 (19%), Positives = 59/153 (38%), Gaps = 23/153 (15%) Query: 18 DNGSCAQIRRVSEPDELRDIP-AFYRLV--QPFGWENPRHQQALLRMVFCLSAGKNVIRH 74 G A +RR+ R +P AF++L+ Q E +A ++ ++ +V Sbjct: 24 SPGERAALRRMDSQRTGRVLPGAFWKLLVSQEIFPEGEGETRAWEAILQGMALSADVSSG 83 Query: 75 QDKKSEQTTGISLGRALANS-----GRINERRIFQLIRADRTADMVQLRRLLTHAEPV-- 127 + S RAL + G +E R+ ++++A LR ++ Sbjct: 84 EAP--------SFARALGTADAALVGTTSEDRLIRMLQAQGERFYDLLRGMVRFCASRGI 135 Query: 128 -LDWPLMARMLTWWGKRERQQ----LLEDFVLT 155 W +AR+ +R++ L +D+ L Sbjct: 136 SFSWGDLARLCLAQNSEQRRKTCEHLAQDYYLA 168 >UniRef50_D1Y488 CRISPR-associated protein, Cse2 family n=1 Tax=Pyramidobacter piscolens W5455 RepID=D1Y488_9BACT Length = 211 Score = 48.3 bits (113), Expect = 7e-05, Method: Composition-based stats. Identities = 32/153 (20%), Positives = 52/153 (33%), Gaps = 11/153 (7%) Query: 17 LDNGSCAQIRRVSEPDELRDIPAFYRLVQPFGWENPRHQQALLRMVFCLSAGKNVIRHQD 76 L G A++RR++ + P F+RL+ + A + Sbjct: 42 LTRGERAELRRMNPTKDDTRPPVFWRLLCEYDILGDAGNSLDEGAEKAWGAVFQGMAMTA 101 Query: 77 KKSEQTTGISLGRALA---NSGRINERRIFQLIRADRTADMVQLRRLLTHAEPV---LDW 130 + G AL ++G RR L+RA+ LR LL A W Sbjct: 102 QNCRGARD-DFGAALGSMEDAGDALTRRFDLLMRAEGDRFFDLLRYLLKLASSKGCTFSW 160 Query: 131 PLMARMLTWWGKRERQQ----LLEDFVLTTNKN 159 +A + + ER + L + F L K+ Sbjct: 161 TALACLCVASEENERSKVRSALTKSFYLALWKS 193 >UniRef50_D2L2X6 CRISPR-associated protein, Cse2 family n=1 Tax=Desulfovibrio sp. FW1012B RepID=D2L2X6_9DELT Length = 173 Score = 48.0 bits (112), Expect = 1e-04, Method: Composition-based stats. Identities = 22/132 (16%), Positives = 43/132 (32%), Gaps = 10/132 (7%) Query: 31 PDELRDIPAFYR----LVQPFGWENPRHQQALLRMVFCLSAGKNVIRHQDKKSEQTTGIS 86 PDE+ P + R L++ G E A + A + E Sbjct: 44 PDEVFVSPDYQRSLLPLLKTAGIELTPQDAAKFARAVGVLAHVRTL-----LPEGHFARQ 98 Query: 87 LGRALANSGRINERRIFQLIRADRTAD-MVQLRRLLTHAEPVLDWPLMARMLTWWGKRER 145 L A + + R +L+ D + LRRL+ + + + + W + R Sbjct: 99 LAPADPGQESVRDPRFKKLLATTDPDDLFLMLRRLVAYLGGTAELRSLVTGASDWTDKTR 158 Query: 146 QQLLEDFVLTTN 157 + + + + Sbjct: 159 RAWAIQYYVNRS 170 >UniRef50_Q7N8H7 Similar to unknown protein n=2 Tax=Photorhabdus RepID=Q7N8H7_PHOLL Length = 193 Score = 47.6 bits (111), Expect = 1e-04, Method: Composition-based stats. Identities = 24/148 (16%), Positives = 45/148 (30%), Gaps = 19/148 (12%) Query: 10 LYRAWQQL---------------DNGSCAQIRRVSEPDELRDIPAFYRLVQPFGWENPRH 54 L WQ + + AQ++R + F L E Sbjct: 9 LLHWWQSMFMSPKQLKEKGIIPAPSTYRAQLKRCDSVEMAMLTEGFRALWLSLPDEISLS 68 Query: 55 QQALLRMVFCLSAGKNVIRHQDKKSEQTTGISLGRALA-NSGRINERRIFQLIRADRTA- 112 + + + H S+ ++ G+ N ++E R QL A Sbjct: 69 DNPIKLEY--WATMAVALVHVKNNSDIKLAVAAGKKGGGNKPVVSELRFSQLQNAKTPNE 126 Query: 113 DMVQLRRLLTHAEPVLDWPLMARMLTWW 140 + +L R+L + + +AR + W Sbjct: 127 LLRRLCRVLQQIKGNISVLALARDIEEW 154 >UniRef50_B4RSK1 CRISPR-associated protein, Cse2 family n=1 Tax=Alteromonas macleodii 'Deep ecotype' RepID=B4RSK1_ALTMD Length = 189 Score = 47.6 bits (111), Expect = 1e-04, Method: Composition-based stats. Identities = 23/147 (15%), Positives = 51/147 (34%), Gaps = 7/147 (4%) Query: 15 QQLDNGSCAQIRRVSEPDELRDIPAFYRLVQPFGWE-NPRHQQALLRMVFCLSAGKNVIR 73 Q A+++R D + F L + + + + C +A + Sbjct: 33 QAAPTAWKAELKRAESIDAVLLSQGFRALWLSLSSDITEGSDKQVSENMLCWAAVAGALV 92 Query: 74 HQDKKSEQTTGISLGRAL-ANSGRINERRIFQLIRADRTADMV-QLRRLLTHAEPVLDWP 131 ++ GR + ++E R QL +A + + ++RR+L + Sbjct: 93 SVSDNHTESFAKLAGRKGDGDKPVVSELRFAQLQQAQAPEEFLRRIRRILKQLKGKASVT 152 Query: 132 LMARMLTWWGKRERQQLLEDFVLTTNK 158 +A+ W Q+L ++ +K Sbjct: 153 QLAKDTCCW----YQELTSNYPREADK 175 >UniRef50_D1A6Q3 CRISPR-associated protein, Cse1 family n=1 Tax=Thermomonospora curvata DSM 43183 RepID=D1A6Q3_THECD Length = 722 Score = 47.2 bits (110), Expect = 2e-04, Method: Composition-based stats. Identities = 32/153 (20%), Positives = 54/153 (35%), Gaps = 14/153 (9%) Query: 20 GSCAQIRRVSEPDELRDIPAFYRLVQPFGWENPRHQQALLRMVFCLSAGKNVIRHQDKKS 79 G A + R L + ++++ E Q + + A + S Sbjct: 555 GKRAAL-RSGLGRPLDECHRMHKVIAARVPEERETVQQAYYAIAAMIASLPPQAREAPPS 613 Query: 80 EQTTGISLGRALANS-------GRINERRIFQLIRADRTADMV---QLRRLLTHAEPVLD 129 + TG S G+ LA E R+ QL R R+L +D Sbjct: 614 DALTGRSFGQCLAEGVGRGLLRESAAEARLDQLTRQSVDDLHRRLPAAVRILADRSSAVD 673 Query: 130 WP-LMARMLTWWGKRER--QQLLEDFVLTTNKN 159 W L+ ++ W R+R ++ L+DF T K+ Sbjct: 674 WAQLLLDLVWWEDDRDRIARRWLQDFYRTRFKD 706 >UniRef50_B6B781 CRISPR-associated protein, Cse2 family n=1 Tax=Rhodobacterales bacterium Y4I RepID=B6B781_9RHOB Length = 167 Score = 46.8 bits (109), Expect = 2e-04, Method: Composition-based stats. Identities = 26/154 (16%), Positives = 48/154 (31%), Gaps = 22/154 (14%) Query: 9 ALYRAWQQLDNGSC---------AQIRRVSEPDELRDIPAFYRLVQPFGWENPRHQQALL 59 L W G A++RR + + PA + L + + L Sbjct: 12 QLILGWWSAALGDRKTSAQKALSARLRR-GDDVSVLCEPAVHDLARSLNLRDGPRVARLA 70 Query: 60 RMVFCLSAGKNVIRHQDKKSEQTTGISLGRALANSGRINERRIFQLIRADRTADMVQLRR 119 R V+ H + + LGR + ++ R +LI++ +RR Sbjct: 71 R----------VLAHVRAYTSDSLPRRLGR--GDPPALSPMRFERLIQSGGADLEAAIRR 118 Query: 120 LLTHAEPVLDWPLMARMLTWWGKRERQQLLEDFV 153 L + + + L W R + D+ Sbjct: 119 ALPMVQHSANPAHLGEALLNWSDATRMRWCFDYY 152 >UniRef50_A1ARH8 CRISPR-associated protein, Cse2 family n=3 Tax=Bacteria RepID=A1ARH8_PELPD Length = 172 Score = 46.4 bits (108), Expect = 3e-04, Method: Composition-based stats. Identities = 23/141 (16%), Positives = 47/141 (33%), Gaps = 7/141 (4%) Query: 22 CAQIRRVSEPDELRDIPAFYRLVQPFGWE-NPRHQQALLRMVFCLSAGKNVIRHQDKKSE 80 A +RR +PA+ + E N ++ + +A + + E Sbjct: 19 RAVLRRSLAFTPGFHVPAYPYIEPFIKNESNSWRREMFYLVAGLWAAHWR--EDRKGQPE 76 Query: 81 QTTGISLGRALANSGRINERRIFQLIRADRTADMVQLRRLLTHA-EPVLDWPLMARMLTW 139 A+ E R L+ AD +LR+++ E +D+ + L + Sbjct: 77 SLGKACARYQAASGSTSTENRFISLLDADTDQLPHRLRQMIALLKEQSIDFEELLTGLLY 136 Query: 140 WGKRERQQ---LLEDFVLTTN 157 W +++ D+ N Sbjct: 137 WNDEQKRTQNGWGRDYYRNLN 157 >UniRef50_B7KJ24 CRISPR-associated protein, Cse2 family n=1 Tax=Cyanothece sp. PCC 7424 RepID=B7KJ24_CYAP7 Length = 171 Score = 45.6 bits (106), Expect = 5e-04, Method: Composition-based stats. Identities = 27/156 (17%), Positives = 46/156 (29%), Gaps = 19/156 (12%) Query: 14 WQQLD--NGSCAQIRRVSEPDELRDIPAFYRLVQPFGWE-NPRHQQALLRMVFCLSAGKN 70 WQ+++ G+ A + R ++ D A + + ++ + A Sbjct: 17 WQRIEGNRGAIATVSRCTDSDPYYQRQAATYIYPYLPDDIRQWQKKINRYVFV---AALM 73 Query: 71 VIRHQDKKSEQTTGISLGRALANSGRIN-------ERRIFQLIRADRTADMVQL---RRL 120 HQ + G S G + E R L+ A+ L L Sbjct: 74 AKNHQQNPKDAQIGSSFGHTCLRLKKHANVNANGIENRFQALLNANGEDVYRYLGMFAPL 133 Query: 121 LTHAEPVLDWPLMARMLTWWGKRE---RQQLLEDFV 153 L +W + L W E R + DF Sbjct: 134 LRQHNIPCEWAKLLEDLNCWDHEEEKVRLRWARDFY 169 >UniRef50_B6IWM5 CRISPR-associated protein, CT1973 family n=1 Tax=Rhodospirillum centenum SW RepID=B6IWM5_RHOCS Length = 206 Score = 44.1 bits (102), Expect = 0.001, Method: Composition-based stats. Identities = 23/134 (17%), Positives = 45/134 (33%), Gaps = 8/134 (5%) Query: 15 QQLDNGSCAQIRRVSEPDELRDIPAFYRLVQPF----GWENPRHQQALLRMVFCLSAGKN 70 + L G A +RR E +PA Y L + + + R+ L+ Sbjct: 39 KPLARGDRAALRRAQTVTEACMVPASYALPAALMADARMDRHPEAETVARIAMALATVDE 98 Query: 71 VIRHQDKKSEQTTGISLG--RALANSGRINERRIFQLIRADRTADMVQLRRLLTHA--EP 126 ++ G + R R++ R+ + AD + ++L R + Sbjct: 99 DTGAMAVRTGAALGRAFAVRRQDTGKPRVSGDRLRLICTADDPDEFLRLLRGAIRLLEKE 158 Query: 127 VLDWPLMARMLTWW 140 +AR++ W Sbjct: 159 KAPVADIARVVEAW 172 >UniRef50_Q1R116 Putative uncharacterized protein n=1 Tax=Chromohalobacter salexigens DSM 3043 RepID=Q1R116_CHRSD Length = 230 Score = 42.2 bits (97), Expect = 0.005, Method: Composition-based stats. Identities = 34/171 (19%), Positives = 58/171 (33%), Gaps = 28/171 (16%) Query: 3 DEIDAMALYRAWQQL----------------DNGSCAQIRRVSEPDELRDIPAFYRLVQP 46 D +A L WQ+L G A +RR + + AF L Q Sbjct: 29 DNNEAFELRLWWQRLVMDEQELKRYTKRRPYPRGVRAALRRCDAIESVLLTEAFRHLWQA 88 Query: 47 FG---WENPRHQQALLRMVFCLSAGKNVIRHQDKKSEQTTGISLGRAL-------ANSGR 96 W+ L A + + +T G LG+ L ++ R Sbjct: 89 LETRVWKEMPGDGPSQWRDRRLEAWAVIAAIVSELRAETFGAPLGKRLGENRPNTGDTPR 148 Query: 97 INERRIFQLIRADRT-ADMVQLRRLLTHAEPV-LDWPLMARMLTWWGKRER 145 +++ R QL+ + + RR L A+ + +A M+ W + +R Sbjct: 149 MSDLRFQQLLDCHTPKELIRRFRRALKLADGTGVSVVRLADMVALWHREQR 199 >UniRef50_Q1ZM79 Putative uncharacterized protein n=2 Tax=Photobacterium RepID=Q1ZM79_PHOAS Length = 183 Score = 42.2 bits (97), Expect = 0.005, Method: Composition-based stats. Identities = 28/126 (22%), Positives = 48/126 (38%), Gaps = 7/126 (5%) Query: 18 DNGSCAQIRRVSEPDELRDIPAFYRLVQPFGWENPRHQQALLRMVFCLSAGKNVIRHQDK 77 +G A++ R D + AF+ L P + + C + + H Sbjct: 33 PSGHKAKLMRAESVDAVLMQDAFHTLWLSL----PEQDSHSVNDMECWAVIAASLIHVSS 88 Query: 78 KSEQTTGISLGRALANS--GRINERRIFQLIRADRTAD-MVQLRRLLTHAEPVLDWPLMA 134 I+ G+ NS ++E R QL A+ + + LRRLL + +D +A Sbjct: 89 GYRDGLAIAAGKKKENSHIPLVSEMRFSQLQAANTPDELLRTLRRLLKLMKGKVDPLTLA 148 Query: 135 RMLTWW 140 R + W Sbjct: 149 RDIEQW 154 >UniRef50_Q2RY17 CRISPR-associated protein, Cse2 family n=1 Tax=Rhodospirillum rubrum ATCC 11170 RepID=Q2RY17_RHORT Length = 200 Score = 42.2 bits (97), Expect = 0.006, Method: Composition-based stats. Identities = 24/173 (13%), Positives = 52/173 (30%), Gaps = 19/173 (10%) Query: 1 MADEIDAMALYRAWQQLD--------NGSCAQIRRVSEPDELR----DIPAFYRLVQPFG 48 M + W + A + R+ D++ D+ + Sbjct: 1 MNRASPLNETFSEWWEKSIDKDDGQAKADRAALCRLGVADQVFPPAIDVAGALTIGAFRT 60 Query: 49 WENPRHQQALLRMVFC------LSAGKNVIRHQDKKSEQTTGISLGRALANSGRINERRI 102 +Q+ +L+ L+ V+ + T +L ++ + E R Sbjct: 61 LYRQINQREILKDFRTGDWEDRLAVAAMVLAQVRTNTPSHTTAALLGGDDDTPLMAESRF 120 Query: 103 FQLIRADRTADM-VQLRRLLTHAEPVLDWPLMARMLTWWGKRERQQLLEDFVL 154 +L+RA D+ Q RR + + + L W R++ + Sbjct: 121 RRLLRAKDPVDLWAQARRAVALLKREAPVGDLGASLFTWDAHTRRRWAGAYWR 173 >UniRef50_A9HLD1 CRISPR-associated protein, Cse2 family n=9 Tax=Acetobacteraceae RepID=A9HLD1_GLUDA Length = 196 Score = 40.3 bits (92), Expect = 0.021, Method: Composition-based stats. Identities = 30/152 (19%), Positives = 51/152 (33%), Gaps = 20/152 (13%) Query: 3 DEIDAMALYRAWQQL----------DNGSCAQIRRVSEPDELRDIPAFY---RLVQPFGW 49 D R W + D G+ A++RR + + PA R Sbjct: 5 DRNAIAEKARNWWRELQPDPGGRPGDRGTLARLRRCASIVDALFEPAVQTLARRCGARR- 63 Query: 50 ENPRHQQALLRMVFCLSAGKNVIRHQDKKSEQTTGISLGRALANSGRINERRIFQLIRAD 109 E + AL+ V N + ++ T + AL R +L+ A Sbjct: 64 EGELARVALVAAVLAHVRSDNPAQFVARQIGPTDMAKVATALCKP-----VRFRRLLDAS 118 Query: 110 RTAD-MVQLRRLLTHAEPVLDWPLMARMLTWW 140 + + RRL+T A ++ +AR + W Sbjct: 119 EFDECLTAFRRLVTLAGRTVNVADLARSVLAW 150 >UniRef50_C9XXR2 Putative uncharacterized protein n=2 Tax=Cronobacter RepID=C9XXR2_CROTZ Length = 201 Score = 39.1 bits (89), Expect = 0.049, Method: Composition-based stats. Identities = 19/85 (22%), Positives = 31/85 (36%), Gaps = 3/85 (3%) Query: 63 FCLSAGKNVIRHQDKKSEQTTGIS--LGRALANSGRINERRIFQLIRADRTA-DMVQLRR 119 FC + K + + LG N ++E R +L +A QLRR Sbjct: 66 FCALGIVAALAAHVKTVDTRASFAEQLGHKEGNHAVMSELRFRRLSQARTQEELFRQLRR 125 Query: 120 LLTHAEPVLDWPLMARMLTWWGKRE 144 + V++ P +A + W E Sbjct: 126 AVQLLGGVVNLPDLAEGVFRWCAEE 150 >UniRef50_Q1J369 CRISPR-associated protein n=1 Tax=Deinococcus geothermalis DSM 11300 RepID=Q1J369_DEIGD Length = 204 Score = 38.7 bits (88), Expect = 0.065, Method: Composition-based stats. Identities = 27/122 (22%), Positives = 42/122 (34%), Gaps = 4/122 (3%) Query: 42 RLVQPFGWENPRHQQALLRMVFCLSAGKNVIRHQDKKSEQTTGISLGRALANSGRINERR 101 RLV PR +Q + +K T + + E+R Sbjct: 50 RLVAGLYALKPRARQ-DEGDAAEVETPPAETADSEKAPSIGTLMGQLYLAQGARPSTEKR 108 Query: 102 IFQLIRADR---TADMVQLRRLLTHAEPVLDWPLMARMLTWWGKRERQQLLEDFVLTTNK 158 L+ ADR + Q LL + DW + R L WG R R+ +DF ++ Sbjct: 109 FLALLDADRDGLPYHLRQAVTLLATEDFTPDWVRLTRDLLRWGDRVRRGWAQDFYRELSR 168 Query: 159 NA 160 + Sbjct: 169 ES 170 >UniRef50_B4T470 CRISPR-associated protein, Cse2 family n=10 Tax=Enterobacteriaceae RepID=B4T470_SALNS Length = 200 Score = 38.3 bits (87), Expect = 0.073, Method: Composition-based stats. Identities = 17/133 (12%), Positives = 42/133 (31%), Gaps = 14/133 (10%) Query: 22 CAQIRRVSEPDELRDIPAFYRLVQPFGWENPRHQQALLRMVFCLSAGKNVIRHQDKKSEQ 81 A+++R++ P + L Q + + +S ++ H+ Sbjct: 46 RAELKRMAPPYGVMICEGHDALRQALLKHMRLQPLDEMALALFVSVAVHIKSHK------ 99 Query: 82 TTGISLGRALAN-----SGRINERRIFQLIRADRTADMVQLRRLLTHAEPV--LDWPLMA 134 IS L + ++ R +L +A QL ++ +A Sbjct: 100 -ANISFAAQLGEKLKGSTSCVSGLRFERLQKASDPETFCQLLIQAVKIRGTEGVNVLSLA 158 Query: 135 RMLTWWGKRERQQ 147 + W + +++ Sbjct: 159 DGIFLWMEEWQRR 171 >UniRef50_B5FTT8 Crispr-associated protein, Cse2 family n=53 Tax=Enterobacteriaceae RepID=B5FTT8_SALDC Length = 186 Score = 38.3 bits (87), Expect = 0.077, Method: Composition-based stats. Identities = 25/143 (17%), Positives = 47/143 (32%), Gaps = 13/143 (9%) Query: 6 DAMALYRAWQ---QLDNGSCAQIRRVSEPDELRDIPAFYRLVQPFGW----ENPRHQQAL 58 D A R W Q G A +RR ++ + L+ + P + AL Sbjct: 7 DDKATLRQWHDELQEKRGLRASLRRSKTVNDACLAEGLHSLLMQTHSLWKNKAPWNVTAL 66 Query: 59 LRMVFCLSAGKNVIRHQDKKSEQTTGISLGRALANSGRINERRIFQLIRADRTA-DMVQL 117 + K + K G G ++ +++ R L+ + QL Sbjct: 67 AITAALAAHIKFIDEQ--KSFAAQLGQKKG---GDTPVMSKLRFSHLLAVKTPDELLRQL 121 Query: 118 RRLLTHAEPVLDWPLMARMLTWW 140 RR + + ++ +A + W Sbjct: 122 RRAVKLLDGSVNLFSLADDIFCW 144 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.306 0.116 0.275 Lambda K H 0.267 0.0356 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 669,411,512 Number of Sequences: 3077464 Number of extensions: 19335445 Number of successful extensions: 73761 Number of sequences better than 1.0e-01: 45 Number of HSP's better than 0.1 without gapping: 18 Number of HSP's successfully gapped in prelim test: 44 Number of HSP's that attempted gapping in prelim test: 73684 Number of HSP's gapped (non-prelim): 62 length of query: 160 length of database: 1,040,396,356 effective HSP length: 118 effective length of query: 42 effective length of database: 677,255,604 effective search space: 28444735368 effective search space used: 28444735368 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 40 (20.8 bits) S2: 87 (38.3 bits)