BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (373 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P27431 Uncharacterized protein ycfD n=205 Tax=Gammaprot... 763 0.0 UniRef50_A0KI50 Cupin superfamily protein n=6 Tax=Gammaproteobac... 391 e-107 UniRef50_C4LEX7 Cupin 4 family protein n=1 Tax=Tolumonas auensis... 367 e-100 UniRef50_C9QJT9 Putative uncharacterized protein n=2 Tax=Vibrion... 361 2e-98 UniRef50_Q5E4F9 Conserved protein n=16 Tax=Gammaproteobacteria R... 360 4e-98 UniRef50_A1STI6 Cupin 4 family protein n=1 Tax=Psychromonas ingr... 349 7e-95 UniRef50_B8K5G8 Cupin superfamily protein n=1 Tax=Vibrio parahae... 342 1e-92 UniRef50_A6F8R4 Putative uncharacterized protein n=1 Tax=Moritel... 320 5e-86 UniRef50_C4K8V5 Putative uncharacterized protein n=1 Tax=Candida... 311 3e-83 UniRef50_A3QD76 Cupin 4 family protein n=19 Tax=Shewanella RepID... 303 5e-81 UniRef50_A1RJT3 Cupin 4 family protein n=14 Tax=Alteromonadales ... 296 1e-78 UniRef50_C3M8B3 Putative uncharacterized protein n=3 Tax=Candida... 288 2e-76 UniRef50_Q15T89 Cupin 4 n=1 Tax=Pseudoalteromonas atlantica T6c ... 280 5e-74 UniRef50_Q5QZ10 Cupin superfamily protein n=2 Tax=Idiomarina Rep... 279 1e-73 UniRef50_Q1NG82 Putative uncharacterized protein n=1 Tax=Sphingo... 263 5e-69 UniRef50_A0YBW0 Transcription factor jumonji, jmjC n=1 Tax=marin... 261 3e-68 UniRef50_Q1QUR4 Cupin 4 n=1 Tax=Chromohalobacter salexigens DSM ... 260 5e-68 UniRef50_UPI0000E0F5AA putative enzyme with RmlC-like domain n=1... 260 6e-68 UniRef50_B4RRX0 Putative enzyme with RmlC-like domain n=2 Tax=Al... 252 1e-65 UniRef50_Q48H58 YcfD protein n=22 Tax=Gammaproteobacteria RepID=... 251 4e-65 UniRef50_Q2S4H4 Cupin superfamily protein n=3 Tax=Bacteria RepID... 247 4e-64 UniRef50_Q2BJ43 Putative uncharacterized protein n=1 Tax=Neptuni... 236 1e-60 UniRef50_A6W0E5 Cupin 4 family protein n=2 Tax=Marinomonas RepID... 230 6e-59 UniRef50_Q1N4P0 Transcription factor jumonji, jmjC n=1 Tax=Berma... 229 1e-58 UniRef50_B7RUZ0 Cupin superfamily protein n=1 Tax=marine gamma p... 222 1e-56 UniRef50_B3PKY0 Putative uncharacterized protein n=2 Tax=Pseudom... 219 1e-55 UniRef50_D2UDU1 Putative uncharacterized protein n=1 Tax=Xanthom... 218 2e-55 UniRef50_A6F0B9 Transcription factor jumonji, jmjC n=1 Tax=Marin... 216 1e-54 UniRef50_C7RB22 Cupin 4 family protein n=1 Tax=Kangiella koreens... 214 3e-54 UniRef50_C6WYD1 Cupin 4 family protein n=1 Tax=Methylotenera mob... 214 3e-54 UniRef50_C5BU83 Cupin 4 family protein n=1 Tax=Teredinibacter tu... 212 2e-53 UniRef50_Q2Y9X5 Cupin region n=9 Tax=root RepID=Q2Y9X5_NITMU 209 2e-52 UniRef50_D0L0L5 Cupin 4 family protein n=1 Tax=Halothiobacillus ... 208 3e-52 UniRef50_A0Z1Z1 Putative uncharacterized protein n=1 Tax=marine ... 207 4e-52 UniRef50_A6SXH9 Uncharacterized conserved protein n=2 Tax=Oxalob... 204 3e-51 UniRef50_B2SQ70 Transcription factor jumonji, JmjC n=19 Tax=Xant... 203 7e-51 UniRef50_C1DCJ3 Cupin region n=1 Tax=Laribacter hongkongensis HL... 202 2e-50 UniRef50_Q2SJM1 Uncharacterized conserved protein n=3 Tax=Gammap... 202 2e-50 UniRef50_B8KRM1 Cupin 4 family protein n=1 Tax=gamma proteobacte... 201 3e-50 UniRef50_Q21K45 Cupin 4 n=1 Tax=Saccharophagus degradans 2-40 Re... 198 2e-49 UniRef50_D1UI98 Cupin 4 family protein n=6 Tax=Burkholderia RepI... 198 2e-49 UniRef50_C5A9S6 Cupin superfamily protein family protein n=49 Ta... 197 4e-49 UniRef50_A4BDP0 Putative uncharacterized protein n=1 Tax=Reineke... 197 6e-49 UniRef50_B8KGD9 Cupin 4 family protein n=2 Tax=unclassified Gamm... 197 7e-49 UniRef50_D1RFR4 Cupin superfamily protein n=1 Tax=Legionella lon... 196 1e-48 UniRef50_B8GSM7 Cupin 4 family protein n=1 Tax=Thioalkalivibrio ... 192 2e-47 UniRef50_Q5WVF0 Putative uncharacterized protein n=4 Tax=Legione... 191 3e-47 UniRef50_Q3JQS3 Cupin superfamily protein family n=25 Tax=Burkho... 189 1e-46 UniRef50_C7I1M3 Cupin 4 family protein n=1 Tax=Thiomonas interme... 188 4e-46 UniRef50_A1K4G1 Putative uncharacterized protein n=1 Tax=Azoarcu... 186 2e-45 UniRef50_Q0VQ28 Putative uncharacterized protein n=1 Tax=Alcaniv... 185 2e-45 UniRef50_C0N3X6 Cupin superfamily protein n=1 Tax=Methylophaga t... 184 4e-45 UniRef50_Q31GJ6 Cupin superfamily protein n=2 Tax=Gammaproteobac... 182 1e-44 UniRef50_B9ZR02 Cupin 4 family protein n=1 Tax=Thioalkalivibrio ... 178 2e-43 UniRef50_A1VLH8 Cupin 4 family protein n=6 Tax=Burkholderiales R... 178 3e-43 UniRef50_D1KE35 Putative uncharacterized protein n=1 Tax=uncultu... 177 8e-43 UniRef50_B4X170 Cupin superfamily protein n=1 Tax=Alcanivorax sp... 175 2e-42 UniRef50_A6GQ27 Putative uncharacterized protein n=1 Tax=Limnoba... 174 3e-42 UniRef50_Q7NS46 Putative uncharacterized protein n=1 Tax=Chromob... 174 7e-42 UniRef50_C0VP99 Cupin 4 n=2 Tax=Acinetobacter RepID=C0VP99_9GAMM 172 1e-41 UniRef50_B7H3P1 Cupin superfamily protein n=16 Tax=Acinetobacter... 165 2e-39 UniRef50_P44683 Uncharacterized protein HI0396 n=36 Tax=Gammapro... 163 1e-38 UniRef50_A4SX54 Cupin 4 family protein n=2 Tax=Polynucleobacter ... 161 3e-38 UniRef50_B1Y837 Cupin 4 family protein n=3 Tax=cellular organism... 161 5e-38 UniRef50_A2W941 Transcription factor jumonji n=1 Tax=Burkholderi... 152 2e-35 UniRef50_C1E292 Predicted protein n=2 Tax=Micromonas RepID=C1E29... 145 2e-33 UniRef50_UPI0000E87D6F hypothetical protein MB2181_02235 n=1 Tax... 144 5e-33 UniRef50_A4S2B8 Predicted protein n=2 Tax=Ostreococcus RepID=A4S... 138 3e-31 UniRef50_B7FZB3 Predicted protein n=1 Tax=Phaeodactylum tricornu... 119 2e-25 UniRef50_D1TSY0 Conserved domain protein n=19 Tax=Yersinia pesti... 113 1e-23 UniRef50_B6BWI1 Putative cytoplasmic protein n=1 Tax=beta proteo... 107 6e-22 UniRef50_B8C536 Putative uncharacterized protein (Fragment) n=1 ... 99 3e-19 UniRef50_A9TET4 Predicted protein n=1 Tax=Physcomitrella patens ... 71 5e-11 UniRef50_Q091R3 Mina protein n=1 Tax=Stigmatella aurantiaca DW4/... 65 3e-09 UniRef50_Q849M1 Putative uncharacterized protein pSV2.19c n=3 Ta... 61 7e-08 UniRef50_A4U3D3 MYC induced nuclear antigen n=1 Tax=Magnetospiri... 59 2e-07 UniRef50_A3Q8B6 Cupin 4 family protein n=4 Tax=Mycobacterium Rep... 57 1e-06 UniRef50_B7PMB0 MYC-induced nuclear antigen, putative (Fragment)... 56 3e-06 UniRef50_B7G6P1 Predicted protein (Fragment) n=1 Tax=Phaeodactyl... 55 4e-06 UniRef50_B4B491 Cupin 4 family protein n=1 Tax=Cyanothece sp. PC... 55 6e-06 UniRef50_B0CEG8 Cupin 4 family protein, putative n=1 Tax=Acaryoc... 54 1e-05 UniRef50_D2A374 Putative uncharacterized protein GLEAN_07936 n=1... 54 1e-05 UniRef50_Q7N884 Similar to unknown protein n=1 Tax=Photorhabdus ... 53 1e-05 UniRef50_A3M7T2 Putative uncharacterized protein n=2 Tax=Acineto... 53 2e-05 UniRef50_C1EHB5 Predicted protein (Fragment) n=2 Tax=Micromonas ... 53 2e-05 UniRef50_UPI0000E45D23 PREDICTED: hypothetical protein n=2 Tax=S... 52 4e-05 UniRef50_Q9H6W3 Lysine-specific demethylase NO66 n=17 Tax=Eumeta... 51 6e-05 UniRef50_A5PK74 Lysine-specific demethylase NO66 n=1 Tax=Bos tau... 51 6e-05 UniRef50_D0NRY0 Nucleolar protein, putative n=2 Tax=Phytophthora... 51 7e-05 UniRef50_D2SA69 Cupin 4 family protein n=2 Tax=Actinomycetales R... 51 9e-05 UniRef50_B1FB07 Cupin 4 family protein n=1 Tax=Burkholderia ambi... 50 1e-04 UniRef50_B4Q068 Lysine-specific demethylase NO66 n=5 Tax=Sophoph... 50 1e-04 UniRef50_Q016L9 [S] KOG3706 Uncharacterized conserved protein n=... 50 1e-04 UniRef50_D0L9V4 Cupin 4 family protein n=1 Tax=Gordonia bronchia... 50 1e-04 UniRef50_Q10ZZ1 Cupin 4 n=1 Tax=Trichodesmium erythraeum IMS101 ... 50 2e-04 UniRef50_A9C261 Cupin 4 family protein n=1 Tax=Delftia acidovora... 50 2e-04 UniRef50_A8QFQ3 Lysine-specific demethylase NO66 n=2 Tax=Brugia ... 50 2e-04 UniRef50_UPI000192663F PREDICTED: similar to Myc-induced nuclear... 50 2e-04 UniRef50_B4M7P8 Lysine-specific demethylase NO66 n=3 Tax=Drosoph... 50 2e-04 UniRef50_B4GUZ2 Lysine-specific demethylase NO66 n=2 Tax=Drosoph... 50 2e-04 UniRef50_A6W7N8 Cupin 4 family protein n=1 Tax=Kineococcus radio... 50 2e-04 UniRef50_B5W5P2 Cupin 4 family protein n=2 Tax=Arthrospira RepID... 49 2e-04 UniRef50_Q5ZMM1 Lysine-specific demethylase NO66 n=3 Tax=Eumetaz... 49 3e-04 UniRef50_Q28VG0 Cupin 4 n=1 Tax=Jannaschia sp. CCS1 RepID=Q28VG0... 49 3e-04 UniRef50_C8XBP6 Cupin 4 family protein n=1 Tax=Nakamurella multi... 49 4e-04 UniRef50_A4RZ92 Predicted protein n=1 Tax=Ostreococcus lucimarin... 49 4e-04 UniRef50_B8BSJ2 Predicted protein n=1 Tax=Thalassiosira pseudona... 48 5e-04 UniRef50_Q1DFZ7 Cupin family protein n=1 Tax=Myxococcus xanthus ... 48 6e-04 UniRef50_B5DUH6 Lysine-specific demethylase NO66 n=2 Tax=Drosoph... 48 6e-04 UniRef50_B4V6J8 Putative uncharacterized protein n=1 Tax=Strepto... 48 7e-04 UniRef50_B4JMQ2 Lysine-specific demethylase NO66 n=1 Tax=Drosoph... 47 9e-04 UniRef50_A0YJB4 Putative uncharacterized protein n=1 Tax=Lyngbya... 47 0.001 UniRef50_Q1D4G2 Cupin family protein n=2 Tax=Myxococcus xanthus ... 47 0.001 UniRef50_A1R1T1 Putative cupin superfamily protein n=2 Tax=Micro... 47 0.001 UniRef50_B4R4H1 Lysine-specific demethylase NO66 n=2 Tax=melanog... 47 0.001 UniRef50_A9UZN8 Predicted protein n=1 Tax=Monosiga brevicollis R... 47 0.002 UniRef50_UPI000186D1B6 conserved hypothetical protein n=1 Tax=Pe... 47 0.002 UniRef50_A9V5A3 Predicted protein n=1 Tax=Monosiga brevicollis R... 46 0.002 UniRef50_UPI00017929D5 PREDICTED: similar to Nucleolar protein 6... 46 0.002 UniRef50_A9V7G6 Predicted protein n=1 Tax=Monosiga brevicollis R... 46 0.002 UniRef50_B0WMG3 Lysine-specific demethylase NO66 n=2 Tax=Culicin... 46 0.002 UniRef50_B0BQ44 Putative uncharacterized protein n=5 Tax=Pasteur... 46 0.002 UniRef50_B7FXD3 Predicted protein n=1 Tax=Phaeodactylum tricornu... 46 0.002 UniRef50_A2SGT4 Putative uncharacterized protein n=1 Tax=Methyli... 46 0.003 UniRef50_A4X6V2 Cupin 4 family protein n=4 Tax=Micromonosporacea... 46 0.003 UniRef50_C6SNC5 Putative uncharacterized protein n=2 Tax=Neisser... 45 0.003 UniRef50_Q7K4H4 Lysine-specific demethylase NO66 n=2 Tax=melanog... 45 0.003 UniRef50_A0QI05 Cupin superfamily protein n=4 Tax=Mycobacterium ... 45 0.003 UniRef50_A1KTI5 Putative uncharacterized protein n=2 Tax=Neisser... 45 0.004 UniRef50_C5LMW3 Putative uncharacterized protein n=1 Tax=Perkins... 45 0.005 UniRef50_P46327 Uncharacterized protein yxbC n=1 Tax=Bacillus su... 45 0.005 UniRef50_A5GJ70 Putative uncharacterized protein SynWH7803_0559 ... 45 0.005 UniRef50_B6KFH2 Putative uncharacterized protein n=4 Tax=Toxopla... 45 0.005 UniRef50_Q54K96 Lysine-specific demethylase NO66 n=1 Tax=Dictyos... 44 0.006 UniRef50_D2VJG1 Predicted protein n=1 Tax=Naegleria gruberi RepI... 44 0.006 UniRef50_Q4D641 Putative uncharacterized protein n=1 Tax=Trypano... 44 0.007 UniRef50_O01658 Lysine-specific demethylase NO66 n=3 Tax=Caenorh... 44 0.007 UniRef50_UPI000180B5EA PREDICTED: similar to Nucleolar protein 6... 44 0.009 UniRef50_B4L6Q5 Lysine-specific demethylase NO66 n=1 Tax=Drosoph... 44 0.009 UniRef50_C6W918 Cupin 4 family protein n=2 Tax=Actinomycetales R... 44 0.011 UniRef50_UPI000192614C PREDICTED: similar to chromosome 14 open ... 44 0.013 UniRef50_A3UGV1 Putative uncharacterized protein n=1 Tax=Oceanic... 43 0.023 UniRef50_A1SPZ0 Cupin 4 family protein n=1 Tax=Nocardioides sp. ... 42 0.025 UniRef50_Q15JF4 VldL n=1 Tax=Streptomyces hygroscopicus subsp. l... 42 0.025 UniRef50_A9VEP7 Predicted protein (Fragment) n=1 Tax=Monosiga br... 42 0.031 UniRef50_Q31RB4 Putative uncharacterized protein n=2 Tax=Synecho... 42 0.034 UniRef50_D2PSR6 Cupin family protein n=1 Tax=Kribbella flavida D... 42 0.035 UniRef50_C7NJK3 Cupin superfamily protein n=1 Tax=Kytococcus sed... 42 0.037 UniRef50_Q2T4J7 Unnamed protein product n=2 Tax=Burkholderia tha... 42 0.038 UniRef50_D1VL61 Cupin 4 family protein n=1 Tax=Frankia sp. EuI1c... 42 0.039 UniRef50_Q2RW70 Cupin region n=1 Tax=Rhodospirillum rubrum ATCC ... 42 0.041 UniRef50_D0MXW2 Nucleolar protein, putative n=1 Tax=Phytophthora... 42 0.042 UniRef50_C0INQ3 Putative uncharacterized protein n=2 Tax=environ... 42 0.046 UniRef50_UPI0000E4684D PREDICTED: hypothetical protein n=1 Tax=S... 41 0.058 UniRef50_Q091R4 Chromosome 14 open reading frame 169, putative n... 41 0.084 >UniRef50_P27431 Uncharacterized protein ycfD n=205 Tax=Gammaproteobacteria RepID=YCFD_ECOLI Length = 373 Score = 763 bits (1969), Expect = 0.0, Method: Compositional matrix adjust. Identities = 373/373 (100%), Positives = 373/373 (100%) Query: 1 MEYQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGK 60 MEYQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGK Sbjct: 1 MEYQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGK 60 Query: 61 WQVSHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGG 120 WQVSHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGG Sbjct: 61 WQVSHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGG 120 Query: 121 GVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILY 180 GVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILY Sbjct: 121 GVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILY 180 Query: 181 IPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPA 240 IPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPA Sbjct: 181 IPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPA 240 Query: 241 DVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQG 300 DVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQG Sbjct: 241 DVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQG 300 Query: 301 EVLVRLGGLRVLRIGDDVYANGEKIDSPHRPALDALASNIALTAENFGDALEDPSFLAML 360 EVLVRLGGLRVLRIGDDVYANGEKIDSPHRPALDALASNIALTAENFGDALEDPSFLAML Sbjct: 301 EVLVRLGGLRVLRIGDDVYANGEKIDSPHRPALDALASNIALTAENFGDALEDPSFLAML 360 Query: 361 AALVNSGYWFFEG 373 AALVNSGYWFFEG Sbjct: 361 AALVNSGYWFFEG 373 >UniRef50_A0KI50 Cupin superfamily protein n=6 Tax=Gammaproteobacteria RepID=A0KI50_AERHH Length = 376 Score = 391 bits (1005), Expect = e-107, Method: Compositional matrix adjust. Identities = 187/374 (50%), Positives = 252/374 (67%), Gaps = 5/374 (1%) Query: 3 YQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQ 62 YQL L+ FLE +WQKRP+++K GF +F DPISPDELAGLAME ++SRLV+ + KW+ Sbjct: 2 YQLNLDIAHFLEHYWQKRPLLIKGGFTDFQDPISPDELAGLAMEEVIESRLVTRFNNKWE 61 Query: 63 VSHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGV 122 +HGPFESYDHLGE NW++LVQA NHW L PF+ +P WR DD+M+SFS P GGV Sbjct: 62 AAHGPFESYDHLGEENWTVLVQACNHWAPEVNELALPFQFIPGWRFDDVMVSFSTPHGGV 121 Query: 123 GPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIP 182 GPH+D YDVFI QG G+R WRVG+ + + H LL +PFEAIID +EPGDILYIP Sbjct: 122 GPHIDNYDVFITQGQGKRHWRVGDAKPLNEFAAHAALLHCEPFEAIIDVIMEPGDILYIP 181 Query: 183 PGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADV 242 PGFPHEGYA+E ++N+SVGFRAP+ + LIS FAD+++ E+ Y D D+ PRA ++ Sbjct: 182 PGFPHEGYAIEPSLNFSVGFRAPDAKALISSFADHLIDNEVRTERYGDADLKPRARHGEI 241 Query: 243 LPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQGEV 302 P E+ +LRE+M + ++ FK+WFG IS+++H+LD+ P EP Y +E+ D L QGE Sbjct: 242 QPHELHRLRELMQQALDDETLFKEWFGTMISEAKHDLDVNPVEPDYSAEEVADLLTQGEP 301 Query: 303 LVRLGGLRVLRIGDD---VYANGE--KIDSPHRPALDALASNIALTAENFGDALEDPSFL 357 +++ GLR + + Y +GE + S A+ L +T + + + FL Sbjct: 302 AIKVPGLRTVWFSGESQQCYIDGEAWTLQSEDAAAISLLCDKDMVTQADMVELADQAGFL 361 Query: 358 AMLAALVNSGYWFF 371 +L LVN GYWFF Sbjct: 362 QLLTRLVNRGYWFF 375 >UniRef50_C4LEX7 Cupin 4 family protein n=1 Tax=Tolumonas auensis DSM 9187 RepID=C4LEX7_TOLAT Length = 381 Score = 367 bits (942), Expect = e-100, Method: Compositional matrix adjust. Identities = 181/373 (48%), Positives = 244/373 (65%), Gaps = 5/373 (1%) Query: 4 QLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQV 63 QL L+ F+ WQK+P VL+ + F DPI+PDELAGLA E +V+SRLV+ DGKW Sbjct: 3 QLNLDLAAFMREFWQKKPTVLRGAYAPFTDPITPDELAGLATEEQVESRLVTFADGKWTA 62 Query: 64 SHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVG 123 HGPF+ Y LGE++W+LLVQA +HW +P A L+ PFR LP+WRIDD+MIS+SVPGGGVG Sbjct: 63 EHGPFDDYSQLGESHWALLVQATDHWIKPVADLITPFRGLPNWRIDDVMISYSVPGGGVG 122 Query: 124 PHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIPP 183 PH+DQYDVFIIQG+G RRWRVG +Q P LL V+ FE IID EL+ GDILYIPP Sbjct: 123 PHIDQYDVFIIQGSGSRRWRVGADTPAEQFVATPGLLHVEQFEPIIDVELQSGDILYIPP 182 Query: 184 GFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADVL 243 GFPH+GYA+ AM+YS+G+RAPN ++L S FAD++LQ G Y+DP P V Sbjct: 183 GFPHDGYAITEAMSYSIGYRAPNQQDLFSSFADFLLQENAGQVRYTDPKRELTKTPGLVT 242 Query: 244 PQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQGEVL 303 ++++ LR++M L++ + F +W G +SQ++HEL+I E PDE+ AL+ + L Sbjct: 243 NKDVNDLRDLMRTLLHDEQLFSKWLGTNLSQAKHELNILSQEWDLIPDELLPALEAEDEL 302 Query: 304 VRLGGLRVLRIG---DDVYANGEKIDSPH--RPALDALASNIALTAENFGDALEDPSFLA 358 RLGGLR L D + NGE++ P R + ++ LT + L++P + Sbjct: 303 YRLGGLRCLYFAALPDCCFVNGEQLQIPEGGRALAHLMCNSTVLTHKELQPYLDNPILVD 362 Query: 359 MLAALVNSGYWFF 371 + N GYW+ Sbjct: 363 WICYWFNQGYWYL 375 >UniRef50_C9QJT9 Putative uncharacterized protein n=2 Tax=Vibrionaceae RepID=C9QJT9_VIBOR Length = 377 Score = 361 bits (927), Expect = 2e-98, Method: Compositional matrix adjust. Identities = 180/377 (47%), Positives = 247/377 (65%), Gaps = 8/377 (2%) Query: 3 YQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQ 62 YQLT + FL HW K+P V+K GF +FIDPIS DELAGLAME E+DSR +S++D +W Sbjct: 2 YQLTFDLKAFLAEHWHKKPTVIKAGFADFIDPISADELAGLAMEEEIDSRFISNKDNQWS 61 Query: 63 VSHGPF--ESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGG 120 +HGP ++ L E++W L+VQA NHWH +A L++ F++LP W DDLM+ FS P G Sbjct: 62 ATHGPLPESHFESLDESHWQLIVQACNHWHLGSAELVQAFKQLPQWLFDDLMVCFSAPEG 121 Query: 121 GVGPHLDQYDVFIIQGTGRRRWRVGE--KLQMKQHCPHPDLLQVDPFEAIIDEELEPGDI 178 GVGPH+DQYDVFIIQG+G+RRWRVG+ K Q K+ L Q++ FE+IIDE LEPGDI Sbjct: 122 GVGPHIDQYDVFIIQGSGKRRWRVGDIDKGQYKESIQAGALRQIEGFESIIDEVLEPGDI 181 Query: 179 LYIPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAH 238 LYIPPGFPHEG LE +M+YS+GFR+P +EL+S FADYVL ++G + +P+ + + Sbjct: 182 LYIPPGFPHEGNTLEPSMSYSIGFRSPKEQELLSNFADYVLAHDIGDVHLHNPEQSAQDN 241 Query: 239 PADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALK 298 ++L Q++ KL +M+ +N + + + G +SQSRH+LDI PE Y E+ + L+ Sbjct: 242 NGELLSQDLAKLTDMLKAALNGEKDIQTFMGAMLSQSRHQLDIVEPEEAYSDTEVSEYLQ 301 Query: 299 QGEVLVRLGGLRVLR---IGDDVYANGEKIDSPHRPALDALASNIALTAENFGDALEDPS 355 G VL ++ GLR L +Y NGE D P+ AL L+ ++ D S Sbjct: 302 SGGVLRKVSGLRALYHQGYFHSIYINGESFDVPNSNMTRALCDYDELSIDSSTGPDLDES 361 Query: 356 FLAMLAALVNSGYWFFE 372 +L LVN GYW+F+ Sbjct: 362 -TQLLTKLVNKGYWYFD 377 >UniRef50_Q5E4F9 Conserved protein n=16 Tax=Gammaproteobacteria RepID=Q5E4F9_VIBF1 Length = 394 Score = 360 bits (924), Expect = 4e-98, Method: Compositional matrix adjust. Identities = 171/381 (44%), Positives = 254/381 (66%), Gaps = 11/381 (2%) Query: 3 YQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQ 62 YQL+ + +FL +WQK+PV++K GF NF DP++P+ELAGL +E++VDSR +S+ + +W+ Sbjct: 14 YQLSFSLQEFLSEYWQKKPVIIKDGFENFQDPVTPEELAGLTLENDVDSRFISNANNEWK 73 Query: 63 VSHGPF--ESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGG 120 HGP E Y+ LGETNWS++VQA NHWH+ A L +PF+++P+W DD+MIS+SVP G Sbjct: 74 AEHGPLSEELYETLGETNWSIIVQAANHWHKGAAELFKPFKQMPNWLFDDIMISYSVPHG 133 Query: 121 GVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILY 180 GVGPH+DQYDVFIIQG G+R WRVG+ + ++ H L Q+ FE IID+ LEPGDILY Sbjct: 134 GVGPHIDQYDVFIIQGQGKRHWRVGDIGEYQEEHRHSALKQITGFEPIIDQILEPGDILY 193 Query: 181 IPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPA 240 IPPGFPH+GYALE +M+YS GFR+P +ELIS FAD++++ E G +Y +P++ ++H + Sbjct: 194 IPPGFPHDGYALEPSMSYSAGFRSPKEQELISNFADFIIENEKGDVHYHNPELSTQSHGS 253 Query: 241 DVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQG 300 ++ + + L+ MML ++ + KQ+ GE++S SRH L+I P + +E+ + L G Sbjct: 254 EITTRSFEDLKAMMLSAMSDEQTLKQFMGEYLSNSRHHLNIIPDSEKWTTEELLNYLHSG 313 Query: 301 EVLVRLGGLRVL-------RIGDDVYANGEKIDSPHRPALDALASNIA--LTAENFGDAL 351 + L+++ G+R ++ +GE P + D + A +T N L Sbjct: 314 QALIKVAGVRSFYHEVESCEENMTLFIDGESYVFPLKMKNDVITLCEANEVTLNNIEQLL 373 Query: 352 EDPSFLAMLAALVNSGYWFFE 372 DP +A L LVN GY++ E Sbjct: 374 LDPHSVANLLQLVNIGYFYAE 394 >UniRef50_A1STI6 Cupin 4 family protein n=1 Tax=Psychromonas ingrahamii 37 RepID=A1STI6_PSYIN Length = 375 Score = 349 bits (896), Expect = 7e-95, Method: Compositional matrix adjust. Identities = 172/372 (46%), Positives = 241/372 (64%), Gaps = 4/372 (1%) Query: 3 YQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQ 62 ++L L+ DFL+ +WQK+P V+K+GF +F DPI PDE+AGLAME E++SRL+ +DG+WQ Sbjct: 2 FELNLDINDFLDTYWQKKPTVIKQGFVDFEDPIMPDEMAGLAMEEELESRLIYQEDGEWQ 61 Query: 63 VSHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGV 122 GPF S++ L +LLVQAV+HWH L+RPFR LP+WRIDDLMIS+S P GGV Sbjct: 62 ALSGPFTSFERLENDGATLLVQAVDHWHPDAQELIRPFRFLPNWRIDDLMISYSTPKGGV 121 Query: 123 GPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIP 182 GPH+D YDVFIIQG G+R WRVG+K + + H L + F+AIID ELEPGDILYIP Sbjct: 122 GPHIDNYDVFIIQGLGKRHWRVGDKGALPEFAAHDALKHCESFDAIIDVELEPGDILYIP 181 Query: 183 PGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADV 242 G+PHEGY++E ++NYS+GFRAP+ +L+S F DY + Y+D ++ R P + Sbjct: 182 AGYPHEGYSIETSLNYSIGFRAPDQNDLLSSFTDYCIDTNPAPERYADKEMLLREKPGQI 241 Query: 243 LPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQGEV 302 E+++L +ML PE WFG IS+++H+LDIA PE P+ I + L++G Sbjct: 242 ETPELNELHRIMLANCATPEMLMPWFGRMISEAKHDLDIAEPEQPHTAQSILEQLEEGAQ 301 Query: 303 LVRLGGLRVL---RIGDDVYANGEKIDSPHRPALD-ALASNIALTAENFGDALEDPSFLA 358 VRLGGL + + + ++ NGE+ + L L + E + +E+ + L Sbjct: 302 FVRLGGLHAVYFEQAPELLFINGEQFNCEGFTELGHHLCDQDEVGGELYDLLIENKNALI 361 Query: 359 MLAALVNSGYWF 370 + LVN GYW+ Sbjct: 362 LFTDLVNQGYWY 373 >UniRef50_B8K5G8 Cupin superfamily protein n=1 Tax=Vibrio parahaemolyticus 16 RepID=B8K5G8_VIBPA Length = 375 Score = 342 bits (878), Expect = 1e-92, Method: Compositional matrix adjust. Identities = 171/376 (45%), Positives = 246/376 (65%), Gaps = 8/376 (2%) Query: 3 YQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQ 62 YQL+ + FL ++W K+P V+K G NFIDPISP+ELAGLAME EVDSR V++++G WQ Sbjct: 2 YQLSFDLDSFLAKYWHKQPTVIKHGITNFIDPISPEELAGLAMEEEVDSRFVTNKNGHWQ 61 Query: 63 VSHGPF-ES-YDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGG 120 HGP ES + L E++W L+VQA NHWH A L+ PF+ LP W DDLM+ +S P G Sbjct: 62 AQHGPLPESLFSQLEESHWQLIVQACNHWHLGAAELVAPFKALPQWLFDDLMVCYSAPQG 121 Query: 121 GVGPHLDQYDVFIIQGTGRRRWRVGEKLQMK-QHCPHPDLLQVDPFEAIIDEELEPGDIL 179 GVGPH+DQYDVFIIQG+G+RRWRVG + + Q L Q++ F+AIIDE LEPGDIL Sbjct: 122 GVGPHIDQYDVFIIQGSGKRRWRVGAADEGQYQESIQGALRQIESFDAIIDEVLEPGDIL 181 Query: 180 YIPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHP 239 YIPPGFPHEG +E +M+YS+GFR+P +EL+S FADYVL +E G + +P + + + Sbjct: 182 YIPPGFPHEGNTIEPSMSYSMGFRSPKEQELLSHFADYVLAKEKGDVHLHNPQMQTQRNH 241 Query: 240 ADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQ 299 ++L ++ L +M+ + + + + +SQSRH+LDI PE +++Y L+ Sbjct: 242 GEILRSDLTLLTQMLQSALESKQDIENFLALNLSQSRHQLDIVEPEEVISQEQVYAHLEA 301 Query: 300 GEVLVRLGGLRVLRIGDD---VYANGEKIDSPHRPALDALASNIALTAENFGDALEDPSF 356 +V++ GLR L ++ V+ NGE+ L ++T +++ ++E PS+ Sbjct: 302 LGHVVKVSGLRALYHANNANHVFINGEEFSVAEPAFAPILCDQASITLDSY--SIESPSW 359 Query: 357 LAMLAALVNSGYWFFE 372 +A+L LVN GYW+ + Sbjct: 360 IALLTRLVNLGYWYLD 375 >UniRef50_A6F8R4 Putative uncharacterized protein n=1 Tax=Moritella sp. PE36 RepID=A6F8R4_9GAMM Length = 379 Score = 320 bits (820), Expect = 5e-86, Method: Compositional matrix adjust. Identities = 164/378 (43%), Positives = 234/378 (61%), Gaps = 8/378 (2%) Query: 3 YQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQ 62 Y+L L+ DF++ +WQK+P+++K GF +FIDPISPDE+AGLAME +V SR+VS +DGKW+ Sbjct: 2 YKLNLDIADFMQNYWQKKPLLIKAGFKDFIDPISPDEIAGLAMEEDVTSRMVSLEDGKWE 61 Query: 63 VSHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGV 122 GPF +D L + ++LVQA+NHWH+P+A L F +P WR DDLM+S+S GGV Sbjct: 62 AKCGPFTEFDRLEKPGAAILVQAINHWHDPSAELANVFNFIPSWRFDDLMVSYSSDTGGV 121 Query: 123 GPHLDQYDVFIIQGTGRRRWRVGEK-LQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYI 181 GPH+D+Y VFIIQG G+R WRVG + + ++ + L + F+A+ID LEPGDILYI Sbjct: 122 GPHVDRYCVFIIQGQGKRHWRVGSQDMNPQEFAANGALKHCEAFDAVIDTVLEPGDILYI 181 Query: 182 PPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPAD 241 PP PHEGYA+ A+NYSVGFRA + +EL++ F DY+LQ++ YSDP + PRA Sbjct: 182 PPYAPHEGYAVGEAINYSVGFRAQDQKELLNDFGDYLLQQDKEFVRYSDPKLQPRAEHGS 241 Query: 242 VLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDE--IYDALKQ 299 + E+ L ++M L+ + G S+S HELD+ PE Y D + D + Sbjct: 242 IESGEVQGLTDIMTSLMADKSVMHDFLGRHYSESAHELDLLVPEGGYIADYAIVVDEIGM 301 Query: 300 GEVLVRLGGLRVL---RIGDDVYANGEK--IDSPHRPALDALASNIALTAENFGDALEDP 354 L ++ GL+ L + + +GE+ D+ ++ L + TA+ +ED Sbjct: 302 ESYLRKVNGLKTLYFPEMPTSCFIDGERYDFDASIAASVQTLCNTTEQTAKELEVLMEDK 361 Query: 355 SFLAMLAALVNSGYWFFE 372 F +L VN GYW FE Sbjct: 362 VFGELLIEWVNLGYWHFE 379 >UniRef50_C4K8V5 Putative uncharacterized protein n=1 Tax=Candidatus Hamiltonella defensa 5AT (Acyrthosiphon pisum) RepID=C4K8V5_HAMD5 Length = 379 Score = 311 bits (796), Expect = 3e-83, Method: Compositional matrix adjust. Identities = 154/371 (41%), Positives = 220/371 (59%), Gaps = 7/371 (1%) Query: 5 LTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQVS 64 L +NW DFL+ +WQK P++LK+ NFI+P+SPDELA L +E ++S+L+ +GK QV Sbjct: 3 LMINWQDFLQHYWQKHPMLLKQAVVNFINPVSPDELAKLVIEKALESQLIKKVNGKCQVV 62 Query: 65 HGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGP 124 H F Y LG NWSL VQA+NHWH P M FR PDW +DL +SFSVPGGG+G Sbjct: 63 HNVFNGYKSLGRHNWSLKVQAINHWHRPAEEFMYLFRTFPDWYREDLTVSFSVPGGGLGL 122 Query: 125 HLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIPPG 184 + DVFIIQG GR RWR+ L H + F +II+EEL GD LYIP G Sbjct: 123 YAKTSDVFIIQGIGRSRWRIWNPLSSVVHYDQKNF-----FPSIINEELVSGDALYIPKG 177 Query: 185 FPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYS-DPDVPPRAHPADVL 243 FPHE + E A++Y + N+ +I + + + + G YS PD+ R P ++L Sbjct: 178 FPHEAISSETALSYCINLWTDNSLRMIRNWTESLSDKNHRGIEYSPSPDLLMRDDPTEIL 237 Query: 244 PQEMDKLREMMLE-LINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQGEV 302 PQ++ ++ +M + L+ Q + + WFG+ +SQS ++L +AP YQP ++ L+Q Sbjct: 238 PQDITAIQNIMNQFLLQQRDDLETWFGQQMSQSSYDLPMAPAAQVYQPSQVQSILQQDIS 297 Query: 303 LVRLGGLRVLRIGDDVYANGEKIDSPHRPALDALASNIALTAENFGDALEDPSFLAMLAA 362 L RL GLR+L IGD + NGE + S + A + +A N + ++D F+ L Sbjct: 298 LCRLMGLRMLHIGDRYFLNGESLASNYADAWNIMAHNTTINGYMLRKFIDDNDFMTQLTL 357 Query: 363 LVNSGYWFFEG 373 L+N GYW+F+G Sbjct: 358 LINKGYWYFQG 368 >UniRef50_A3QD76 Cupin 4 family protein n=19 Tax=Shewanella RepID=A3QD76_SHELP Length = 386 Score = 303 bits (777), Expect = 5e-81, Method: Compositional matrix adjust. Identities = 147/325 (45%), Positives = 209/325 (64%), Gaps = 6/325 (1%) Query: 3 YQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQ 62 Y + +FL HWQK+P+V+K F +F DPI+PDELAGLA E E+ SR+V + W+ Sbjct: 6 YTPNFDTQEFLAHHWQKQPLVIKGAFAHFQDPIAPDELAGLACEEEIASRIVLTKKDNWE 65 Query: 63 VSHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGV 122 + GP E Y G+ NW LLVQAVNHW+ L+ FR +PDWR DDLM+S++ PGGGV Sbjct: 66 IFQGPIEDYSPFGDANWQLLVQAVNHWYPDVEPLVNAFRFIPDWRFDDLMVSYATPGGGV 125 Query: 123 GPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIP 182 GPH+D YDVF++QG GRRRW+VG K Q +D FE I+D LE GD+LYIP Sbjct: 126 GPHIDNYDVFLLQGEGRRRWKVGAKGQYSPRGGDTHTALIDDFEPILDVVLEAGDMLYIP 185 Query: 183 PGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADV 242 PGFPH G LE A++YS+GFRAP+ +EL S AD+++ G ++ P A P + Sbjct: 186 PGFPHRGETLETALSYSIGFRAPSQQELFSSIADHLIDTNGGNKRFTSNQEP--ASPGLL 243 Query: 243 LPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQGEV 302 ++ + ++ E+++QP+H++ G+ +SQ+R ELD+A Y +E+ +AL+ G Sbjct: 244 SVEQQAGMLALVSEILSQPDHYQTVLGQTLSQNRFELDLAEQGESYSQEELMEALEDGAC 303 Query: 303 LVRLGGLRVLRIGDD----VYANGE 323 L R+GGL+V+R+ D ++ NGE Sbjct: 304 LQRIGGLKVIRLEGDKHLRLFINGE 328 >UniRef50_A1RJT3 Cupin 4 family protein n=14 Tax=Alteromonadales RepID=A1RJT3_SHESW Length = 386 Score = 296 bits (757), Expect = 1e-78, Method: Compositional matrix adjust. Identities = 156/370 (42%), Positives = 230/370 (62%), Gaps = 11/370 (2%) Query: 11 DFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQVSHGPFES 70 +FL ++WQK+P+V+++GF F D +SP+ELAGLAM+ V+SR V Q G+W GPF+S Sbjct: 12 EFLAQYWQKKPLVIRQGFKQFQDLVSPEELAGLAMDELVESRRVYQQAGQWHAEFGPFDS 71 Query: 71 YDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPHLDQYD 130 Y+ LGE +W+L+VQA+N+W AL++ F +P WR DD+M+S++ PGGGVGPH+D YD Sbjct: 72 YEKLGERDWTLIVQALNNWVPDAEALIQCFDFIPRWRFDDVMVSYATPGGGVGPHIDLYD 131 Query: 131 VFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIPPGFPHEGY 190 VFI QG+GRRRWRVG++ ++ HP LL + FE IID EL PGDILYIPPGFPH+G Sbjct: 132 VFICQGSGRRRWRVGDRGPHREFAAHPALLHTEAFEPIIDTELLPGDILYIPPGFPHDGI 191 Query: 191 ALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADVLPQEMDKL 250 LE ++++SVG+R + +++ S AD++ + +LG +DP+ V ++ +L Sbjct: 192 TLEESLSFSVGYRTASAKDMFSALADHLSEHDLGAQQIADPERQVSHRSGCVDNNDLARL 251 Query: 251 REMMLELINQPEHFKQWFGEFISQSRHELDIAPPEP-PYQPDEIYDALKQGEVLVRLGGL 309 R + ++N + ++ G +++QS+ LD+ P EP DE+ L + + L+RLGGL Sbjct: 252 RSQLTSMLND-KLVSEFSGRYLTQSKCALDL-PDEPLDITQDEVLAWLDE-QPLIRLGGL 308 Query: 310 RVLRIGDDV-----YANGEKIDSPHRPA--LDALASNIALTAENFGDALEDPSFLAMLAA 362 R L V + NGE+ P A + L L L++ LA L Sbjct: 309 RCLYFDVSVEQGTIFINGERYQLPVELAGIIPLLCDMSQLDKTALLPWLDNADGLAQLTE 368 Query: 363 LVNSGYWFFE 372 VN GYW+FE Sbjct: 369 WVNLGYWYFE 378 >UniRef50_C3M8B3 Putative uncharacterized protein n=3 Tax=Candidatus Hamiltonella defensa 5AT (Acyrthosiphon pisum) RepID=C3M8B3_HAMD5 Length = 367 Score = 288 bits (737), Expect = 2e-76, Method: Compositional matrix adjust. Identities = 146/370 (39%), Positives = 215/370 (58%), Gaps = 6/370 (1%) Query: 4 QLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQV 63 L +NW DFL HWQKRPV+LK+ ++F++PISP+EL L ++ ++ +L+ GK Q+ Sbjct: 2 HLIINWEDFLHHHWQKRPVLLKQSISDFVNPISPEELETLVIKKALECQLIQRSHGKCQL 61 Query: 64 SHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVG 123 + F Y LG+ NWSL V+A++H H + FR PDW ++L FSVPGGG+G Sbjct: 62 GYQAFNGYGSLGQRNWSLRVEALHHCHRAAEEFLSLFRIFPDWYTEELTTFFSVPGGGIG 121 Query: 124 PHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIPP 183 P DV +IQG G RWRVG+ + P D Q D EA++DEEL GD+LYIP Sbjct: 122 PQTRPSDVLVIQGMGSSRWRVGD----RGASPAFDYGQNDFSEAMVDEELSAGDMLYIPK 177 Query: 184 GFPHEGYALENAMNYSVGFRAPNTRELISGFADYVL-QRELGGNYYSDPDVPPRAHPADV 242 FPHE + E AM+Y + F N+ +I + + + + G Y PD+ R P ++ Sbjct: 178 VFPHEATSTEAAMSYCLNFWTDNSLRMIRNWTESLSDENHRGIEYAPSPDLLLRDDPTEI 237 Query: 243 LPQEMDKLREMMLE-LINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQGE 301 LPQ++ L+EMM + L+ + EH + WF + +SQ+ +EL AP Y ++ L++G Sbjct: 238 LPQDITALQEMMSQFLLKKREHLENWFAQEMSQTSYELPKAPAAKVYSVSQVQTLLQKGS 297 Query: 302 VLVRLGGLRVLRIGDDVYANGEKIDSPHRPALDALASNIALTAENFGDALEDPSFLAMLA 361 L RL GLR+L IG+ + NGE +DS H A + LA + + + + FLA L Sbjct: 298 RLNRLMGLRMLHIGNRYFVNGESLDSDHADAWNVLARHRTIEGPMLIKFINEADFLAELT 357 Query: 362 ALVNSGYWFF 371 ++N GYW+F Sbjct: 358 LIINKGYWYF 367 >UniRef50_Q15T89 Cupin 4 n=1 Tax=Pseudoalteromonas atlantica T6c RepID=Q15T89_PSEA6 Length = 389 Score = 280 bits (716), Expect = 5e-74, Method: Compositional matrix adjust. Identities = 152/375 (40%), Positives = 219/375 (58%), Gaps = 19/375 (5%) Query: 12 FLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQVSHGPFESY 71 FL+ HWQKRPVV K F+ F+DP+ +ELAGLA + +DSR+VS ++ W V HGP + Sbjct: 12 FLDSHWQKRPVVFKGAFSQFVDPLDENELAGLAQDPRIDSRIVSSENANWHVQHGPISDF 71 Query: 72 DHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPHLDQYDV 131 +H + +WSLLVQ+V+ + AL+R F +P WR+DDLM+SFS G GVGPHLDQYDV Sbjct: 72 EHACQGSWSLLVQSVDQHVDEADALIRMFNFIPYWRLDDLMVSFSNTGAGVGPHLDQYDV 131 Query: 132 FIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIPPGFPHEGYA 191 FIIQG G RRW+ G++ + + PHPDL Q+ F IIDE L GD+LYIP G PH G A Sbjct: 132 FIIQGKGSRRWQAGKRGEYSTYHPHPDLSQIQGFTPIIDEVLHSGDMLYIPAGCPHNGVA 191 Query: 192 LENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADVLPQEMDKLR 251 LE+ MNYSVGFRAP ++L+S ADY + + Y D + PR P+++ +E+ R Sbjct: 192 LEDCMNYSVGFRAPTQQDLLSSLADYSIDLGIFKKRYQDKGLTPRFDPSELAQEEIHSFR 251 Query: 252 EMMLELINQPEHFKQWFGEFISQSRHELDIAPPE---PPYQPDEIYDALKQGEVLVRLGG 308 M+ + I+ P+ F +W S + +L+ E P Y EI +Q V R G Sbjct: 252 NMLHDAIDSPD-FTRWLTSHFSDT--QLNQGYDEQHNPDYSLQEILVLFQQQTVFERQPG 308 Query: 309 LRVLRIGD-------DVYANGEKIDSP--HRPALDALASNIALTAE---NFGDALEDPSF 356 +R + + + + G+ +P H A+ A + + + + G A+ F Sbjct: 309 IRPIYLAQSDENTSLEFFIEGQAFFAPPEHAQAVRAFLQSASWQFDLHSDKGTAVTINHF 368 Query: 357 -LAMLAALVNSGYWF 370 + +++ LVN+G W Sbjct: 369 WVQLISELVNAGAWL 383 >UniRef50_Q5QZ10 Cupin superfamily protein n=2 Tax=Idiomarina RepID=Q5QZ10_IDILO Length = 380 Score = 279 bits (713), Expect = 1e-73, Method: Compositional matrix adjust. Identities = 144/326 (44%), Positives = 200/326 (61%), Gaps = 9/326 (2%) Query: 4 QLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQ---DGK 60 +L + DFL +WQK+P ++++GF +F DP+SP+ LAGLAME DSR++ + + Sbjct: 2 KLVFDKDDFLTNYWQKKPCLIRQGFADFSDPVSPEILAGLAMEEGADSRVIESKADTESG 61 Query: 61 WQVSHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGG 120 W V+HGPFE Y+ GET+W+LLVQ+VN W L+ PFR LPDWRIDD+M+SFS G Sbjct: 62 WLVTHGPFEDYEKFGETDWTLLVQSVNEWLPDVGELITPFRFLPDWRIDDVMVSFSCENG 121 Query: 121 GVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVD-PFEAIIDEELEPGDIL 179 GVGPHLDQYDVFIIQG G R WRVGEK M+++ P DL Q+ F A+I+E L GD+L Sbjct: 122 GVGPHLDQYDVFIIQGAGSRHWRVGEKQAMQEYQPAEDLCQIKGEFNAVINEHLTAGDVL 181 Query: 180 YIPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHP 239 YIP G PH+G +LE ++NYSVGFRAP+ EL+ D +Q++ Y DP + Sbjct: 182 YIPAGCPHDGISLEPSLNYSVGFRAPSKAELLLQLGDIAMQQKSLQERYQDPALSSEDVS 241 Query: 240 ADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQ 299 + ++ L++ + + + E + ISQS+ L PE P ++I L Q Sbjct: 242 WVIEKTQLSALKQFLKDALESDET-DALLAKIISQSKRPL--PEPELPTIAEQIPTLLAQ 298 Query: 300 GEVLV-RLGGLRVLRIGD-DVYANGE 323 + + G R L++ D Y NGE Sbjct: 299 QNAFIEKTSGARFLKLSDTQFYGNGE 324 >UniRef50_Q1NG82 Putative uncharacterized protein n=1 Tax=Sphingomonas sp. SKA58 RepID=Q1NG82_9SPHN Length = 380 Score = 263 bits (673), Expect = 5e-69, Method: Compositional matrix adjust. Identities = 152/373 (40%), Positives = 219/373 (58%), Gaps = 19/373 (5%) Query: 12 FLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQVSHGPF--E 69 FL HWQK+P++++ + + +P+ PDELAGLA E V+SR+V DG W + HGPF + Sbjct: 10 FLRDHWQKQPLLIRNPWGAWANPLEPDELAGLACEEGVESRIVVQTDGDWALEHGPFADD 69 Query: 70 SYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPHLDQY 129 + LG + W+LLVQAV+H AAL+ PFR +PDWRIDD+M+S++ GGGVGPH DQY Sbjct: 70 RFATLGGSPWTLLVQAVDHHAPDVAALIAPFRFIPDWRIDDVMVSYASDGGGVGPHFDQY 129 Query: 130 DVFIIQGTGRRRWRVGEKLQMKQHC-PHPDLLQVDPFEAIIDEELEPGDILYIPPGFPHE 188 DVF++QG GRRRWRVG++ PH DL + F A + LEPGDILY+PPGF HE Sbjct: 130 DVFLVQGLGRRRWRVGQRCDRDTALRPHRDLRLLPDFAATDEWVLEPGDILYVPPGFAHE 189 Query: 189 GYAL-ENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADVLPQEM 247 G A+ ++ M YS+GFRAP+ +++ +AD++ + + Y+DPD+ P A+P ++ P + Sbjct: 190 GVAVGDDCMTYSIGFRAPSRPDMLVEWADHLAAQMPDDDLYADPDIQPAANPGEIEPDAI 249 Query: 248 DKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQGEVLVRLG 307 +L EM + + F WFG+ ++ ++ PE P +E+ + G L R Sbjct: 250 ARLHEMTIAAMADRSAFAAWFGQHVTTPKYPDADWRPEEPVTAEELLALIDAGAQLWRNP 309 Query: 308 GLR--VLRIGDDVYANGEKIDSPHRPALDALASNIALTAENFGD--ALE-DPSFLA---M 359 R LR D V +D P A ++A+ A+ AL DPS +A + Sbjct: 310 ASRFAFLREEDGVTLF---VDGSAYPC----AGDLAILAQQLCAYPALALDPSMVAGVGL 362 Query: 360 LAALVNSGYWFFE 372 L LVN G E Sbjct: 363 LVTLVNQGSLMIE 375 >UniRef50_A0YBW0 Transcription factor jumonji, jmjC n=1 Tax=marine gamma proteobacterium HTCC2143 RepID=A0YBW0_9GAMM Length = 391 Score = 261 bits (667), Expect = 3e-68, Method: Compositional matrix adjust. Identities = 137/382 (35%), Positives = 218/382 (57%), Gaps = 15/382 (3%) Query: 6 TLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQVSH 65 L+ DFL WQ +P +++ F NFI+P+SP++LAGLA E+E++SRL++ +GKWQ SH Sbjct: 5 NLDIADFLANTWQTKPRLIRNAFPNFINPMSPEDLAGLACEAEIESRLITEANGKWQTSH 64 Query: 66 GPF--ESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVG 123 GP ++++L ++NW+LLVQAV+HW A L+ FR +P WRIDD+M+S++ GG VG Sbjct: 65 GPIAETTFNNLSDSNWTLLVQAVDHWVPEVADLLDNFRFIPSWRIDDVMVSYATRGGSVG 124 Query: 124 PHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCP-HPDLLQVDPFEAIIDEELEPGDILYIP 182 PH D YDVF++QG G+RRW+VG +P+L + F A + LE GD+LYIP Sbjct: 125 PHYDNYDVFLVQGAGQRRWQVGGPCSAANSLQNNPELRLLADFVAEEEWVLEAGDMLYIP 184 Query: 183 PGFPHEGYALEN-AMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPAD 241 PG H G A++N M YS+GFRAP+ E++S F D L Y+DP + + H + Sbjct: 185 PGISHWGTAMDNDCMTYSIGFRAPSHSEMLSDFCDDTLAGLTEELRYADPGLQEQGHSGE 244 Query: 242 VLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAP-PEPPYQPDEIYDALKQG 300 ++P + + ++ +N + +WFG +++Q ++ + A + +Q ++ LK Sbjct: 245 IMPAAISNAQRILQNYVNDEQRLTEWFGRYVTQQKYPAETADNTDEKFQQGDLVQLLKDD 304 Query: 301 EVLVRLGGLRVLRIGDD-------VYANGEKIDSPHRPAL---DALASNIALTAENFGDA 350 V++R +R+ I + + NG +S + LA N + + Sbjct: 305 GVILRDPTVRIAFIDAESPSNSLLFFVNGVCFESVGDSCIALSKLLADNTRICSGQIMPW 364 Query: 351 LEDPSFLAMLAALVNSGYWFFE 372 L D + +L LVN G +F+ Sbjct: 365 LGDTESVQLLLRLVNQGVLYFD 386 >UniRef50_Q1QUR4 Cupin 4 n=1 Tax=Chromohalobacter salexigens DSM 3043 RepID=Q1QUR4_CHRSD Length = 397 Score = 260 bits (665), Expect = 5e-68, Method: Compositional matrix adjust. Identities = 146/368 (39%), Positives = 220/368 (59%), Gaps = 14/368 (3%) Query: 12 FLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQ--DGKWQVSHGPFE 69 FL +WQK+P++++ F +F P++P+ELAGLA E +++RLV Q D WQVSHGPF+ Sbjct: 18 FLRDYWQKKPLLIRGAFPDFASPLAPEELAGLACEDGIEARLVEAQGPDKPWQVSHGPFD 77 Query: 70 --SYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPHLD 127 ++ L + W+LLVQAV+H+ AAL+ F LP WR+DD+M+S++ P G VGPH+D Sbjct: 78 DATFARLPDREWTLLVQAVDHYVPEVAALLDAFDFLPRWRLDDVMVSYAPPEGSVGPHVD 137 Query: 128 QYDVFIIQGTGRRRWRV-GEKLQMKQHCPHPDLLQVDPFEAIIDEE--LEPGDILYIPPG 184 YDVF++QG+G+RRW++ GE+ DL ++ FE DE+ LEPGD+LY+PP Sbjct: 138 NYDVFLLQGSGQRRWQLGGEQPDDAPIVSGIDLRMLERFEVTADEDWVLEPGDMLYLPPR 197 Query: 185 FPHEGYALE-NAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADVL 243 H G + + M YS+GFRAP+ E+I+ FADY+ + + Y+DPD+ P AH + Sbjct: 198 IAHHGVSQSADCMTYSIGFRAPSADEVITSFADYLGEMQPDSRRYTDPDLAPCAHAGQLD 257 Query: 244 PQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQGEVL 303 Q + +LR +ML +I+ P QWFG ++Q ++ +AP + P +AL QG L Sbjct: 258 DQAIARLRRLMLSVIDDPAQMAQWFGRVMTQPKYVDQLAPLDTPMDSAATAEALAQGRYL 317 Query: 304 VRLGGLRVLRIGDD----VYANGEKIDSPHRPALDALASNIALTAENFGDALEDPSFLAM 359 R G R +D ++ +G+ P P LA L A + L+D + L++ Sbjct: 318 ERALGSRFAFHDEDGETTLFVDGDGHACPP-PLARLLADTTPLHAATLAEHLDDAA-LSL 375 Query: 360 LAALVNSG 367 L L+N G Sbjct: 376 LTELLNRG 383 >UniRef50_UPI0000E0F5AA putative enzyme with RmlC-like domain n=1 Tax=Glaciecola sp. HTCC2999 RepID=UPI0000E0F5AA Length = 381 Score = 260 bits (664), Expect = 6e-68, Method: Compositional matrix adjust. Identities = 142/370 (38%), Positives = 208/370 (56%), Gaps = 20/370 (5%) Query: 12 FLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQVSHGPFESY 71 FL +WQ++P VL NF DP+ +LAGLA E ++DSR++S DG W+V+ GPF + Sbjct: 12 FLAENWQRKPCVLHNALPNFEDPLDEHDLAGLAQEQDIDSRVISQMDGDWKVTEGPFTEF 71 Query: 72 DHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPHLDQYDV 131 + + + W+LLVQ V+ E + LM F +P WR+DDL++S+S PG GVG H+DQYDV Sbjct: 72 EDVCKGAWTLLVQGVDTHIESASLLMNAFNFIPHWRMDDLLVSYSQPGAGVGAHIDQYDV 131 Query: 132 FIIQGTGRRRWRVGEK-LQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIPPGFPHEGY 190 FI+QG G RRW+VG+K ++ ++ PHP L Q+D FE IID EL PGDILYIPPGFPH+G Sbjct: 132 FIVQGKGTRRWQVGDKSMKYAKYYPHPKLQQIDEFEPIIDVELLPGDILYIPPGFPHKGQ 191 Query: 191 ALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADVLPQEMDKL 250 ++ MNYSVGFRAP+ EL AD +L + + D + P+ + P+++ L Sbjct: 192 SITECMNYSVGFRAPDQTELFQAIADDLLDSDKLTRRFIDRNRTYIDRPSAISPKDIMLL 251 Query: 251 REMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQGEVLVRLGGLR 310 ++++ + + + Q + +S+ LD P E P I + QG L G+R Sbjct: 252 KQLLQQYVTNTQ-IDQVLTQHLSKQNEYLDAFPLETPLPSSYILALINQGVTLQLACGVR 310 Query: 311 VLRIGDDV------YANGEKIDSPHRP------ALDALASNIALTAENFGDALEDPSFLA 358 + + V + NG K + LD + I AE D +E Sbjct: 311 PVYLDYQVDDEFIFFINGHKFSTSATARLETSRLLDNHQTFIKFNAELTHDWIE------ 364 Query: 359 MLAALVNSGY 368 ++ L+N GY Sbjct: 365 LIRELINLGY 374 >UniRef50_B4RRX0 Putative enzyme with RmlC-like domain n=2 Tax=Alteromonas macleodii RepID=B4RRX0_ALTMD Length = 388 Score = 252 bits (644), Expect = 1e-65, Method: Compositional matrix adjust. Identities = 139/371 (37%), Positives = 213/371 (57%), Gaps = 14/371 (3%) Query: 12 FLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQVSHGPFESY 71 FL+ +WQ++PVV+K+ F +F DPI ++LAGLA ESEVD+R++S+ G W V GP + Sbjct: 23 FLKHYWQQKPVVIKQFFTDFDDPIDENDLAGLAQESEVDARVISNVQGNWHVEQGPITDF 82 Query: 72 DHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPHLDQYDV 131 DH + W+LLVQ V+ + A ++ PF +P WR+DDLM+SF+ G GVG H+DQYDV Sbjct: 83 DHACQGKWTLLVQGVDKYVPDVAPILSPFSFVPHWRLDDLMVSFATNGAGVGAHIDQYDV 142 Query: 132 FIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIPPGFPHEGYA 191 F++QG G+RRWRVG+ K+ PHP L Q++ F +ID +EPGD++Y+PPG+PH+G Sbjct: 143 FLVQGKGKRRWRVGQPGDYKEVFPHPKLRQIERFTPVIDVVVEPGDVIYVPPGWPHDGET 202 Query: 192 LENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADVLPQEMDKLR 251 +E+++ YSVG+RAP+ +L A +L + ++D + +PA V ++ L+ Sbjct: 203 VEDSLTYSVGYRAPDNLQLAESLA-MMLDKGAHNYRFTDIGRTHQNNPALVSTSDIAALK 261 Query: 252 EMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQGEVLVRLGGLRV 311 + +++ IN E F E + S + P + +++ + L G G+R Sbjct: 262 QQLIDAING-EDFTLALLE--AMSEQGIPEYPLDNEVNLEQVSNDLAAGMSFAPAPGVRA 318 Query: 312 LRIGDD------VYANGEKIDSPHRPA--LDALASNIALTAENFGDALEDPSFLAMLAAL 363 L +Y NG + + LAS L A DA +FL L L Sbjct: 319 LLCDGKRGLPRALYVNGSQFTFAKNDQEWFEVLASGSILNATCCQDA-PSFTFLETLTTL 377 Query: 364 VNSGYW-FFEG 373 +N+GYW +FEG Sbjct: 378 INNGYWEWFEG 388 >UniRef50_Q48H58 YcfD protein n=22 Tax=Gammaproteobacteria RepID=Q48H58_PSE14 Length = 388 Score = 251 bits (640), Expect = 4e-65, Method: Compositional matrix adjust. Identities = 140/370 (37%), Positives = 217/370 (58%), Gaps = 12/370 (3%) Query: 12 FLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLV-SHQDGKWQVSHGPF-- 68 FL +WQK+P+++++ +F PI DELAGLA+E EV+SRLV H + W++ GPF Sbjct: 18 FLRDYWQKKPLLIRQALPDFQSPIDADELAGLALEEEVESRLVLEHGERPWELRRGPFAE 77 Query: 69 ESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPHLDQ 128 + + L E +W+LLVQAV+ + + L+ FR LP WRIDD+MIS++ PGG VGPH D Sbjct: 78 DEFSKLPERDWTLLVQAVDQFVPEVSELLENFRFLPSWRIDDVMISYAAPGGSVGPHFDN 137 Query: 129 YDVFIIQGTGRRRWRVGEKLQMKQ-HCPHPDLLQVDPFEAIIDEELEPGDILYIPPGFPH 187 YDVF++QG G+R W++G+ + H DL + FE + LEPGD+LY+PP H Sbjct: 138 YDVFLLQGHGKRHWQIGQMCDAESPMLQHADLRILAEFEKTEEWTLEPGDMLYLPPRLAH 197 Query: 188 EGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADVLPQEM 247 G A+++ + YSVGFRAP+ E+++ F D++ Q Y+D D P + P + + Sbjct: 198 CGVAVDDCLTYSVGFRAPSAAEVLTLFTDFLSQFIPDEERYTDADAQPVSDPHQIQHDAL 257 Query: 248 DKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQGEVLVRLG 307 D+L+ ++ E ++ WFG+F+++ R+ + PE + D++ D+L+QG VL+R Sbjct: 258 DRLKALLTEHMSDERLLLTWFGQFMTEPRYPELVTGPE--LEEDDLLDSLEQGAVLIRNP 315 Query: 308 GLRVL--RIGDD--VYANGEKIDSPH--RPALDALASNIALTAENFGDALEDPSFLAMLA 361 R+ + DD ++A+G+ P R L + + AL +EN G L D +L Sbjct: 316 SARLAWSEVDDDLLLFASGQSRLLPGSLRELLKLICAADALHSENLGQWLADDDGRNLLC 375 Query: 362 ALVNSGYWFF 371 LV G F Sbjct: 376 ELVKQGSLGF 385 >UniRef50_Q2S4H4 Cupin superfamily protein n=3 Tax=Bacteria RepID=Q2S4H4_SALRD Length = 394 Score = 247 bits (631), Expect = 4e-64, Method: Compositional matrix adjust. Identities = 143/370 (38%), Positives = 216/370 (58%), Gaps = 13/370 (3%) Query: 11 DFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGK--WQVSHGPF 68 DFL+ +WQ+RP+V++ +F P+SP+ELAGLA E V+SRL+ + G+ W++ HGPF Sbjct: 18 DFLDTYWQERPLVVRDALPDFRSPLSPEELAGLACEDGVESRLILEEGGEHPWELRHGPF 77 Query: 69 --ESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPHL 126 E + HL ET+W+LLVQ V+ AL+ FR LPDWR+DD+M+S++ G VGPH+ Sbjct: 78 ASEEFLHLPETHWTLLVQEVDRLIPEVGALLDRFRFLPDWRLDDVMVSYAPTHGTVGPHI 137 Query: 127 DQYDVFIIQGTGRRRWRVG-EKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIPPGF 185 D YDVF++QG G RRW++G E + ++ P D+ + FEA + L PGD+LY+PP Sbjct: 138 DNYDVFLLQGAGHRRWQIGTEPVDDEEIVPDLDVRILADFEAEEEFVLGPGDLLYLPPRV 197 Query: 186 PHEGYALEN-AMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADVLP 244 H G A ++ M YSVGFRAP ++L+ F + YSDPD+ P HP ++ Sbjct: 198 AHYGVATDDQCMTYSVGFRAPRHQDLVGNFLQQAMDTVGPDARYSDPDLSPVDHPGEIHD 257 Query: 245 QEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQGEVLV 304 +R ++ +L+ + QWFG+++++ + + PPE P DE+ D L+ G L Sbjct: 258 DARQTVRRLLRDLVRDDDAIDQWFGQYLTRPGRDREAVPPETPVTDDELTDMLRAGHGLR 317 Query: 305 RLGGLRVLRIGDD-----VYANGEKID-SPHRP-ALDALASNIALTAENFGDALEDPSFL 357 R+ I D ++ANG ID SP R A + + ++ LED +F+ Sbjct: 318 PGPVSRLAFIEHDDGSVTLFANGSPIDLSPDRAYAARLVTGRQQIPSDALTPHLEDDAFV 377 Query: 358 AMLAALVNSG 367 +L AL+N G Sbjct: 378 DLLVALINDG 387 >UniRef50_Q2BJ43 Putative uncharacterized protein n=1 Tax=Neptuniibacter caesariensis RepID=Q2BJ43_9GAMM Length = 382 Score = 236 bits (602), Expect = 1e-60, Method: Compositional matrix adjust. Identities = 136/378 (35%), Positives = 216/378 (57%), Gaps = 30/378 (7%) Query: 12 FLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLV--SHQDGKWQVSHGPF- 68 FL+ +WQK+P++++ F +F P++ DELAG+A+E EV+SRL+ S W++ HGP Sbjct: 13 FLKEYWQKKPLLIRNAFPDFEPPVTADELAGMALEEEVESRLIIQSADGADWELKHGPLN 72 Query: 69 -ESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPHLD 127 E++ L +++W+LLVQAV+HW A L+ FR P+WR+DDLMIS++ GGGVGPH D Sbjct: 73 EETFAELPDSHWTLLVQAVDHWVPEAAELVEQFRFAPNWRLDDLMISYASDGGGVGPHYD 132 Query: 128 QYDVFIIQGTGRRRWRVGEKLQMKQHCPHPD---LLQVDPFEAIIDEELEPGDILYIPPG 184 YDVF+IQ TG RRW VG + P D ++ + +EA +L+PGD+LY+PP Sbjct: 133 NYDVFLIQATGTRRWEVGGIF--DEDSPRRDDVPVMILPEWEAEQSWDLQPGDMLYLPPR 190 Query: 185 FPHEGYAL-ENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADVL 243 H GYAL ++ M SVGFRAP+ +E+ +GF +Y+ + YSDPD+ +A+P ++ Sbjct: 191 VGHNGYALGDDCMTLSVGFRAPSHQEIFAGFTNYLDNITCAEDRYSDPDLKTQANPGEID 250 Query: 244 PQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQGEVL 303 + + +++ ++ E I P WFG+F+++S++ A + +I + ++ G L Sbjct: 251 QEAIGRVQAILREYIADPALLSHWFGQFMTESKYPDLGADQSQEMEEGDIKNLIEAGVPL 310 Query: 304 VRLGGLR---------VLRI-GDDVYANGEKIDSPHRPALDALASNIALTAENFGDALED 353 R G R VL + G + +I+ R D + I + EN Sbjct: 311 CRTEGSRFAYHQGQPFVLFVDGKGCACSPGQIELAKRLCADLYHTEIETSEEN------- 363 Query: 354 PSFLAMLAALVNSGYWFF 371 L ++ AL+ G +F Sbjct: 364 ---LQLIKALLLQGSLYF 378 >UniRef50_A6W0E5 Cupin 4 family protein n=2 Tax=Marinomonas RepID=A6W0E5_MARMS Length = 400 Score = 230 bits (587), Expect = 6e-59, Method: Compositional matrix adjust. Identities = 119/322 (36%), Positives = 193/322 (59%), Gaps = 8/322 (2%) Query: 12 FLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGK-WQVSHGPF-- 68 F++ +WQK+P++++ G NF P+ DELAG+AME E++SR+V + W++ GPF Sbjct: 19 FIDEYWQKKPLLIRGGLVNFTLPLEADELAGMAMEEEIESRIVIENGLRPWEMRQGPFTE 78 Query: 69 ESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPHLDQ 128 +++ L E W+LLVQAV+HW L F LP WR+DD+M+S++ GG VGPH DQ Sbjct: 79 DTFATLPEKEWTLLVQAVDHWVPEVQTLKEKFEFLPSWRLDDVMVSYATEGGSVGPHYDQ 138 Query: 129 YDVFIIQGTGRRRWRVGEKLQMKQHC-PHPDLLQVD--PFEAIIDEELEPGDILYIPPGF 185 YDVF++Q +G+RRW+V + + P+ L +D P +D EL+ GDILY+PP F Sbjct: 139 YDVFLVQVSGKRRWQVLSPDEYQDSAIPNIKLHILDNFPVNPEMDWELDAGDILYLPPNF 198 Query: 186 PHEGYALEN-AMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADVLP 244 H G +L++ M YS+GFRAP+ +++++G D + + E + ++ P+ R H A + Sbjct: 199 AHNGRSLDDECMTYSIGFRAPSMQDILTGVRDKLCETENVKDRFAAPETANRQHSAHISK 258 Query: 245 QEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQGEVLV 304 ++ L+ + LINQP+ +W GE +S+S++ +AP + +E + + QG+ + Sbjct: 259 DDIQYLQTQLARLINQPDLLAEWLGETMSESKYPEYLAPLNHE-EVNEAFSSATQGQTFI 317 Query: 305 RLGGLRVLRIGDDVYANGEKID 326 R G R+ N KI+ Sbjct: 318 RPGDARICYYIQQSTENNGKIN 339 >UniRef50_Q1N4P0 Transcription factor jumonji, jmjC n=1 Tax=Bermanella marisrubri RepID=Q1N4P0_9GAMM Length = 386 Score = 229 bits (585), Expect = 1e-58, Method: Compositional matrix adjust. Identities = 114/298 (38%), Positives = 182/298 (61%), Gaps = 4/298 (1%) Query: 12 FLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLV-SHQDGKWQVSHGPF-- 68 FL+ +WQK+PV++++ NF PI PD+LAGL++E +V+SR++ + D WQ+ HGPF Sbjct: 12 FLKDYWQKKPVLIRQALPNFTPPIEPDDLAGLSLEEDVESRIILENGDTPWQLIHGPFSE 71 Query: 69 ESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPHLDQ 128 E++ +L E W+LLVQ V+ W + L+ F+ +P WR+DD+M+SF+ GG VGPH DQ Sbjct: 72 ETFGNLPEEKWTLLVQGVDQWVPEMSELLSYFQFIPKWRLDDIMVSFAPKGGSVGPHFDQ 131 Query: 129 YDVFIIQGTGRRRWRVGEKLQMKQ-HCPHPDLLQVDPFEAIIDEELEPGDILYIPPGFPH 187 YDVF++Q GRR W++G K L ++ E + LEPGD+LYIPP + H Sbjct: 132 YDVFLLQAQGRRHWQIGPKYDASSPRIKDTPLHLLENMEVTEEWTLEPGDMLYIPPQYAH 191 Query: 188 EGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADVLPQEM 247 G A+++ M +SVGFRAP+ E++SG + + + + Y D D+ A PA + Sbjct: 192 NGVAVDDCMTFSVGFRAPSEAEILSGITQHAMDQLTEADRYHDEDLKASAQPALIDQAAF 251 Query: 248 DKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQGEVLVR 305 D+L++++ + N E ++WF E ++QS++ P E P +E+ L+ V+ + Sbjct: 252 DRLQQIIAKHANNTELMQEWFAECMTQSKYPELAEPLEDPLDWEEVAPLLQNDTVISQ 309 >UniRef50_B7RUZ0 Cupin superfamily protein n=1 Tax=marine gamma proteobacterium HTCC2148 RepID=B7RUZ0_9GAMM Length = 377 Score = 222 bits (566), Expect = 1e-56, Method: Compositional matrix adjust. Identities = 109/278 (39%), Positives = 165/278 (59%), Gaps = 2/278 (0%) Query: 2 EYQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKW 61 +++L L+ FL +HWQK P++++ NF PIS DELAGLA E EV++R+V HQ+ W Sbjct: 4 DWELNLDKEQFLAQHWQKAPLLIRGAIKNFKPPISSDELAGLAYEEEVEARIVEHQEDNW 63 Query: 62 QVSHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGG 121 Q+ HGPF + D+ + W+LLVQAV+ + A L + +P WR+DD+M S++ GG Sbjct: 64 QLFHGPFSATDYQRKHPWTLLVQAVDQYIPEVAQLRKLVDFIPQWRVDDVMASYASDGGS 123 Query: 122 VGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQ-HCPHPDLLQVDPFEAIIDEELEPGDILY 180 VGPH D YDVF++QG G R W+ G+ H L + F + LEPGDILY Sbjct: 124 VGPHFDNYDVFLLQGEGHRLWKTGQFCDSSSPLVDHDSLRLLSQFNTEAEYLLEPGDILY 183 Query: 181 IPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPA 240 +PPG H G A +S+GFRAP E++S F D ++++ +YSD + P Sbjct: 184 VPPGIAHWGTAQGECTTFSIGFRAPRITEMVSRFTDALIEQLDPDLFYSDARIEVATRPG 243 Query: 241 DVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHE 278 ++ P+++D++ + ++Q E WFGE ++ R+E Sbjct: 244 EIRPRDLDRVSAQIQAALDQSEG-NHWFGELATEPRYE 280 >UniRef50_B3PKY0 Putative uncharacterized protein n=2 Tax=Pseudomonadaceae RepID=B3PKY0_CELJU Length = 396 Score = 219 bits (558), Expect = 1e-55, Method: Compositional matrix adjust. Identities = 127/365 (34%), Positives = 212/365 (58%), Gaps = 13/365 (3%) Query: 11 DFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGK-WQVSHGPFE 69 +FL +WQK+P++++ F F PI+PDELAGLA+E EV+SR+V W++ +GPF+ Sbjct: 27 EFLRDYWQKKPLLIRNAFPGFESPIAPDELAGLALEEEVESRIVLENGATPWELRNGPFD 86 Query: 70 --SYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPHLD 127 ++ L E W+LLVQAV+ W L+ FR +P+WR+DDLMIS++ GGVGPH D Sbjct: 87 EDTFAKLPEKRWTLLVQAVDQWVPEVNQLLDYFRFIPNWRLDDLMISYAPDQGGVGPHFD 146 Query: 128 QYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQV-DPFEAIIDEELEPGDILYIPPGFP 186 YDVF++QG G+R W++G+ L++ F + LEPGD+LYIPPG Sbjct: 147 YYDVFLLQGLGKRHWKIGQVCDNNSPRVEGTRLKILSEFHTTDEWVLEPGDMLYIPPGIA 206 Query: 187 HEGYAL-ENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADVLPQ 245 H G A+ ++ M YS+GFRAP+ +++S V YSDPD+ +++P ++ P+ Sbjct: 207 HWGNAVGDDCMTYSIGFRAPSHADILSEIGQEVALNIADDLRYSDPDLKRQSNPGEIGPE 266 Query: 246 EMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEP-PYQPDEIYDALKQGEVLV 304 + +L+ ++ + + PE WFG+++++ ++ L+ EP D+ AL G++L Sbjct: 267 AIAQLQHIIQQHLT-PETIAHWFGKYMTERKY-LEQTDEEPLEIDADDWQAALADGQLLW 324 Query: 305 RLGGLRVL----RIGDDVYANGEKIDSPHRPALDALASNIALTAENFGDALEDPSFLAML 360 R R+ + G ++A+GE I+ R + + + ++ +++P +A L Sbjct: 325 RHPAARLAFHSDKNGTFLFADGEAINC-SRELAELVCAETEISWVQIKPFVQEPFDVAAL 383 Query: 361 AALVN 365 + L+N Sbjct: 384 SQLIN 388 >UniRef50_D2UDU1 Putative uncharacterized protein n=1 Tax=Xanthomonas albilineans RepID=D2UDU1_XANAL Length = 415 Score = 218 bits (555), Expect = 2e-55, Method: Compositional matrix adjust. Identities = 139/386 (36%), Positives = 207/386 (53%), Gaps = 24/386 (6%) Query: 2 EYQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDG-- 59 +Y L ++ FL +WQKRP++++ F +F+ PI PD+LAGLA E SRLV H Sbjct: 23 QYPLGMSAASFLRDYWQKRPLLIRNAFPDFVSPIEPDDLAGLACEEAALSRLVIHDRATD 82 Query: 60 KWQVSHGPFESYDHLG--ETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSV 117 +W + +GPF+ ++ G + +W+LLVQ V+ W AL+ FR LP WR+DD+M+SF+ Sbjct: 83 RWSLRNGPFQEHEFPGMPDHDWTLLVQDVDKWDPDIRALLGQFRFLPRWRVDDVMVSFAA 142 Query: 118 PGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQ------VDPFEAIIDE 171 GG VG H+D YDVF++Q GRRRW++ M + P + + + F D Sbjct: 143 RGGSVGAHVDHYDVFLLQAHGRRRWQIDASASMGRPPPPTEFREDVELKLLRQFAPTHDW 202 Query: 172 ELEPGDILYIPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDP 231 LEPGD+LY+PP PH G A + + +SVG RAP++ ELI+ + D ++ Y D Sbjct: 203 VLEPGDMLYLPPMVPHHGVAEDACLTFSVGMRAPSSAELIADYLDTLIDGADEALRYHDE 262 Query: 232 DVPPRAHPADVLPQEMDKLREMMLEL-INQPEHFKQWFGEFISQSRHELDIAPPE--PPY 288 D+ P ++ M ++ E + L +N P+ WFG FI+ R +I PP PP Sbjct: 263 DLLAPTDPHEIDAAAMGRVVEALNALRMNDPDRLGAWFGRFITTYRAGGEILPPSNLPPV 322 Query: 289 QPDEIYDALKQGEVL-----VRLGGLRVLRIGDDVYANGEKIDSPHRPALDALASNIALT 343 + E AL QG VL RL R R G ++ NG + P + A LA+ L Sbjct: 323 E--ETAAALAQGLVLQRHPWARLAWRRASR-GAMLFCNGMEFALPIQDA-KRLAAAEHLD 378 Query: 344 AENFGDALEDPSFLAMLAALVNSGYW 369 A ++ A + L L+ SG++ Sbjct: 379 ATDY--AALSATGRQTLLQLIQSGFY 402 >UniRef50_A6F0B9 Transcription factor jumonji, jmjC n=1 Tax=Marinobacter algicola DG893 RepID=A6F0B9_9ALTE Length = 383 Score = 216 bits (550), Expect = 1e-54, Method: Compositional matrix adjust. Identities = 128/382 (33%), Positives = 201/382 (52%), Gaps = 29/382 (7%) Query: 11 DFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQD-GK-WQVSHGPF 68 +FL +WQK+P+V+++ F F P+S DELAGLA E V+SR+V D GK WQ+ +GPF Sbjct: 11 EFLRDYWQKKPLVIRQAFAGFECPVSADELAGLACEDAVESRIVIENDKGKPWQLHNGPF 70 Query: 69 E--SYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPHL 126 E + L +++W+LLVQ ++HW A L+ FR +P+WR+DD+M S++ GG VGPH Sbjct: 71 EPERFSKLPDSHWTLLVQGLDHWVPDFADLLDEFRFVPNWRLDDIMASYAPKGGSVGPHY 130 Query: 127 DQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPD-------LLQVDPFEAIIDEELEPGDIL 179 DQYDVF++Q G RRW G HC H L + +E L PGD+L Sbjct: 131 DQYDVFLLQAEGHRRWTFG------GHCDHTSPRVDGTPLRILSSWEGEETVTLAPGDML 184 Query: 180 YIPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHP 239 Y+PPG H G A ++ + S+GFRAP ++++GF D++ R + DPD+ + +P Sbjct: 185 YLPPGVGHHGVAEDDCITLSIGFRAPTVDDVLTGFTDFLCSRSDASGHLDDPDLKVQDNP 244 Query: 240 ADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQ 299 + P + +L ++ + ++ WFG++ + + + P E P P+ + + + Sbjct: 245 GAIGPDVIHRLDRLIRDQLSDQRQLALWFGQYSTAPKSLEIVVPAEEPVTPELLGELIAA 304 Query: 300 GEVLVRLGGLRV----LRIGDDVYANGEKI-----DSPHRPALDALASNIALTAENFGDA 350 G L G R ++ +GE+ P P L A A +F Sbjct: 305 GNPLRWNEGSRFAYHDFEDETALFVDGEQFLLRGDAGPLAPLLCAGARPDMGALASFAG- 363 Query: 351 LEDPSFLAMLAALVNSGYWFFE 372 D + +L+ LVN G +F+ Sbjct: 364 --DDAIQGLLSTLVNQGSLYFD 383 >UniRef50_C7RB22 Cupin 4 family protein n=1 Tax=Kangiella koreensis DSM 16069 RepID=C7RB22_KANKD Length = 390 Score = 214 bits (546), Expect = 3e-54, Method: Compositional matrix adjust. Identities = 113/297 (38%), Positives = 183/297 (61%), Gaps = 10/297 (3%) Query: 12 FLERHWQKRPVVLKRGFNNFIDP-----ISPDELAGLAMESEVDSRLVSHQDGKWQVSHG 66 FL+ +WQKRP++++ F++ IS +ELAG ++E +++SRL+ WQ+ HG Sbjct: 13 FLKEYWQKRPLLIRGAFSSAQVSGEDALISAEELAGYSLEDDIESRLIERDGDDWQLEHG 72 Query: 67 PF--ESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGP 124 P + LG+ NW+LLVQ+++++H P L++ +P WR+DD+M+S++ GGGVGP Sbjct: 73 PIAESKFAELGDQNWTLLVQSLDYFHPPLCELIKACNFIPRWRLDDVMVSYATNGGGVGP 132 Query: 125 HLDQYDVFIIQGTGRRRWRVGEKLQ-MKQHCPHPDLLQVDPFEAIIDEELEPGDILYIPP 183 HLD+YDVF+IQG G+RRWRVG K Q CPHP + Q++PF+A +D + PGD+LYIPP Sbjct: 133 HLDKYDVFLIQGEGQRRWRVGHKNQGTTAICPHPQIAQIEPFDADMDVIVNPGDMLYIPP 192 Query: 184 GFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADVL 243 PH G ++ N++ YSVGFRAPN ++ + Q EL + + + ++ + L Sbjct: 193 NTPHWGESVGNSICYSVGFRAPNIGGIVQKLMQ-LPQTELDQLWSDEARLSLKSSRGE-L 250 Query: 244 PQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQG 300 ++M + E L+ + E + FG+ +++ ++ + P+ D I AL+QG Sbjct: 251 TRDMSRWAEQQLKQLWTSEDYLMAFGKEVTELKYPDMLEVPDDDELIDWIELALEQG 307 >UniRef50_C6WYD1 Cupin 4 family protein n=1 Tax=Methylotenera mobilis JLW8 RepID=C6WYD1_METML Length = 395 Score = 214 bits (546), Expect = 3e-54, Method: Compositional matrix adjust. Identities = 138/389 (35%), Positives = 210/389 (53%), Gaps = 40/389 (10%) Query: 11 DFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQVSHGPFES 70 +FL+ +W K+P+++K F +SPDELAGLA E EV SR+V GKW SHGPFE Sbjct: 18 EFLQHYWHKKPLLIKNAIPGFTGLLSPDELAGLACEEEVQSRIVEEIKGKWYASHGPFEE 77 Query: 71 YDHLG-------ETNWSLLVQAVNHWHEPTAA-LMRPFRELPDWRIDDLMISFSVPGGGV 122 D + W+LLVQ+VNH H P AA L+ F +P R+DDLM+S++ GGGV Sbjct: 78 SDFANLPEKPDPKHRWTLLVQSVNH-HLPEAAELLSQFNFIPHARLDDLMVSYAPDGGGV 136 Query: 123 GPHLDQYDVFIIQGTGRRRWRVGEK--LQMKQHCPHPDLLQVDPFEAIIDEELEPGDILY 180 GPH D YDVF++QG G+R WR+ E+ L + + P L + F+ + +E GD+LY Sbjct: 137 GPHFDSYDVFLLQGQGKRLWRISEQTDLSLVEGAP---LRILKNFDTAQEWLVEAGDLLY 193 Query: 181 IPPGFPHEGYAL----ENAMNYSVGFRAPNTRELISGFADYVLQRELGGN------YYSD 230 +PP H G A+ + M YS+GFRAP EL++ F + +Q +L + Y D Sbjct: 194 LPPHLAHWGIAVTDGDTDCMTYSIGFRAPKVHELVTEFLGF-MQDKLNQDANALPGIYQD 252 Query: 231 PDVPPRAHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQP 290 D+ P+ HPA + + K+ E++ + +H + G ++S+ + P+ ++P Sbjct: 253 ADLTPQEHPAQIGSSMVSKVAEILKTIQWSEQHVADFLGSYLSEPK-------PDIFFEP 305 Query: 291 DEIYDALKQGEVLVRLG-----GLRVLRIGDDVYANGEKIDSPHRPA--LDALASNIALT 343 ++ K E L++ G ++L Y NGE I + + A L ALA LT Sbjct: 306 NKKMSLRKFNENLLQHGISLDLKSQMLFTQQYFYLNGEAISAAGQAASLLTALADYRMLT 365 Query: 344 AENFGDALE-DPSFLAMLAALVNSGYWFF 371 +++ A E D +F+ L +GY +F Sbjct: 366 SDDIAQAGEVDSAFIEQLHGWYLAGYLYF 394 >UniRef50_C5BU83 Cupin 4 family protein n=1 Tax=Teredinibacter turnerae T7901 RepID=C5BU83_TERTT Length = 385 Score = 212 bits (540), Expect = 2e-53, Method: Compositional matrix adjust. Identities = 105/273 (38%), Positives = 167/273 (61%), Gaps = 4/273 (1%) Query: 6 TLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQD-GKWQVS 64 T+ + FL +WQK+P+++++ F +F P+S DELAGLA+E +V SRLV +D WQV Sbjct: 16 TITFEQFLNEYWQKKPLLIRQAFPDFEAPVSADELAGLALEDDVVSRLVVQRDESDWQVE 75 Query: 65 HGPF--ESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGV 122 HGP E + L E++W+LLVQ + AL+ FR +P+WR+DD+MIS++ GGV Sbjct: 76 HGPLLEERFAQLPESHWTLLVQHADALDPAINALLDAFRFIPNWRLDDIMISYAADKGGV 135 Query: 123 GPHLDQYDVFIIQGTGRRRWRVGEKLQMKQ-HCPHPDLLQVDPFEAIIDEELEPGDILYI 181 GPH D YDVF++Q G+RRWR+G++ + P D+ + F+ + D +EPGD+LYI Sbjct: 136 GPHFDYYDVFLLQAQGKRRWRIGQRCSHESPLLPAADMKILQDFDTVEDWIVEPGDLLYI 195 Query: 182 PPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPAD 241 PP H G A M YS+GFRAP+ E++ F++ + Y DP + P+ P + Sbjct: 196 PPNIAHWGEADGECMTYSIGFRAPSHAEVLLDFSEEMASFTNPDMRYMDPGLRPQQLPGE 255 Query: 242 VLPQEMDKLREMMLELINQPEHFKQWFGEFISQ 274 + Q +++++ ++ + WFGE++++ Sbjct: 256 ISQQSIEQVQAIIHQYSTDKAALAGWFGEYMTR 288 >UniRef50_Q2Y9X5 Cupin region n=9 Tax=root RepID=Q2Y9X5_NITMU Length = 415 Score = 209 bits (531), Expect = 2e-52, Method: Compositional matrix adjust. Identities = 114/338 (33%), Positives = 194/338 (57%), Gaps = 8/338 (2%) Query: 11 DFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQVSHGPF-- 68 DFL+ HWQK+P+++++ +F + +EL LA + + SRLV+ ++G+W+V HGPF Sbjct: 39 DFLQDHWQKKPLLIRKALPDFSGLLDANELIDLACQEDAQSRLVTRRNGRWEVRHGPFAP 98 Query: 69 ESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPHLDQ 128 ++ L + W+LLVQ VNH+ L+ F +P R+DDLM+S++ GGVGPH D Sbjct: 99 RAFARLPQKGWTLLVQDVNHFLPAARELLLKFNFIPHSRLDDLMVSYAPEDGGVGPHFDS 158 Query: 129 YDVFIIQGTGRRRWRV-GEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIPPGFPH 187 YDVF++QGTGRRRWR+ G+K + +LQ F + LEPGD+LY+PPG+ H Sbjct: 159 YDVFLLQGTGRRRWRISGQKDRTLVAAAPLKILQ--DFRPEQEWVLEPGDMLYLPPGYAH 216 Query: 188 EGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADVLPQEM 247 +G A+E M YS+GFRAP +EL F ++ Y DPD+ + HP + + Sbjct: 217 DGVAVEPCMTYSIGFRAPTYQELAMQFLVHLQDSCEIAGIYEDPDLRIQTHPGQISSAML 276 Query: 248 DKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQGEVLVRLG 307 D++ + ++ +++ G ++++ + + PP+ P + +++G++ + L Sbjct: 277 DQVNAALDKIEWDNVEVERFIGMYLTEPKPHVFFMPPQEPISERKFVHQIRKGKLQLDLK 336 Query: 308 GLRVLRIGDDVYANGE--KIDSPHRPALDALASNIALT 343 R+L + ++ NG+ ++ + L LA +AL+ Sbjct: 337 S-RMLFRENRIFLNGDVYEVGKTAQRILGELADRLALS 373 >UniRef50_D0L0L5 Cupin 4 family protein n=1 Tax=Halothiobacillus neapolitanus c2 RepID=D0L0L5_HALNC Length = 397 Score = 208 bits (529), Expect = 3e-52, Method: Compositional matrix adjust. Identities = 122/327 (37%), Positives = 182/327 (55%), Gaps = 13/327 (3%) Query: 6 TLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGK--WQV 63 TL+ DFL +WQK+PV++++G F P+SP+ELAGLA E +V +RL+ G W + Sbjct: 9 TLSVADFLRDYWQKKPVLIRQGVPGFESPLSPEELAGLACEEDVPARLILESAGARPWTL 68 Query: 64 SHGPFESYD--HLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGG 121 HGPF D L E +SLL+ L++ FR +PDWRIDDLMIS++ PGG Sbjct: 69 RHGPFTEADFTSLPEDGYSLLITDCEKLIPDLMNLVQHFRFVPDWRIDDLMISYAPPGGS 128 Query: 122 VGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYI 181 VG H+D+YDVF++QG GRR+W + + P D+ + FE + LEPGD+LY+ Sbjct: 129 VGAHIDEYDVFLLQGMGRRKWMIEYPPKHSDFVPDLDIRLLQEFEPTEEWVLEPGDMLYL 188 Query: 182 PPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPAD 241 PPG PH G A+++ M YS+GFRAP E+ +G D ++ Y DPD+ A+P Sbjct: 189 PPGVPHHGVAVDHCMTYSIGFRAPLLHEMAAGVTDRLITDMDQAARYGDPDLQAPANPGA 248 Query: 242 VLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHE--LDIA---PPEPPYQPDEIYDA 296 + KLR ++ +++Q + FI+++ E LD A P P + Sbjct: 249 LDASSRVKLRAILQSVLDQDDAV---LDRFIAETLTERPLDHAGFYPQNDPLDAKALRGE 305 Query: 297 LK-QGEVLVRLGGLRVLRIGDDVYANG 322 + G+ L+R R+L + D+ + G Sbjct: 306 IAHSGDTLMRTPAARLLLVEDEPDSAG 332 >UniRef50_A0Z1Z1 Putative uncharacterized protein n=1 Tax=marine gamma proteobacterium HTCC2080 RepID=A0Z1Z1_9GAMM Length = 364 Score = 207 bits (528), Expect = 4e-52, Method: Compositional matrix adjust. Identities = 134/364 (36%), Positives = 185/364 (50%), Gaps = 28/364 (7%) Query: 12 FLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQVSHGPFESY 71 FL+ +WQ+RP+++K N+ P+SP+EL GLA E + DSRL+S W + GP S Sbjct: 15 FLDCYWQRRPLLIKAALPNWQSPLSPEELGGLAFEEDADSRLISKSKNGWMLKQGPLVSA 74 Query: 72 DHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPHLDQYDV 131 D +W+LLV V+HW AAL + R LP WR DD+M+S++V GGVGPH D+YDV Sbjct: 75 DFQRSDDWTLLVNGVDHWVPEVAALRQCLRFLPQWRFDDVMVSYAVADGGVGPHFDRYDV 134 Query: 132 FIIQGTGRRRWRVGEKL-QMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIPPGFPHEGY 190 F++QGTGRR+WR+G + H L + FE + LE GD+LY+PPG H G Sbjct: 135 FLVQGTGRRKWRLGGWCDENTPRIKHEGLNLLQNFETSEEYLLEAGDVLYVPPGLAHWGV 194 Query: 191 ALENAMNYSVGFRAPNTRELISGFADYVLQR---ELGGNYYSDPDVPPRAHPADVLPQEM 247 A M YS+GFRAP L++ +AD L+ EL + PPR P ++ Sbjct: 195 ADTPCMTYSLGFRAPTVAALLARWADKTLESVDPELLLEDRASVTNPPR--PGEITLAHW 252 Query: 248 DKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQGEVLVRLG 307 + RE + + + W GE +++ + APP K L Sbjct: 253 NNAREAIRNSMEALDD-GSWLGEVVTEHG---ECAPPPS-----------KHATALRLHP 297 Query: 308 GLRV----LRIGDDVYANGEKIDSP--HRPALDALASNIALTAENFGDALED-PSFLAML 360 G RV L VYANGE + P P L+ L S ++ A D +FLAM Sbjct: 298 GARVSWQALSNECSVYANGEALRIPLSSVPILERLCSGDTVSPYELTSAHPDFLNFLAMS 357 Query: 361 AALV 364 LV Sbjct: 358 GVLV 361 >UniRef50_A6SXH9 Uncharacterized conserved protein n=2 Tax=Oxalobacteraceae RepID=A6SXH9_JANMA Length = 373 Score = 204 bits (520), Expect = 3e-51, Method: Compositional matrix adjust. Identities = 110/322 (34%), Positives = 175/322 (54%), Gaps = 6/322 (1%) Query: 7 LNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQVSHG 66 L +FL +W K+P+++++ +F +S DEL GL +V+SRL++H +W + G Sbjct: 10 LTAAEFLRDYWHKKPLLIRQAIPDFKPLLSRDELFGLVKSEDVESRLITHVKREWNMDSG 69 Query: 67 PFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPHL 126 PFE L + +W+LLVQ VN E +LMR F +PD R+DDLMIS++ GGVG H Sbjct: 70 PFEQLPPLKQKDWTLLVQGVNLHDEAVDSLMREFSFIPDARLDDLMISYATETGGVGAHF 129 Query: 127 DQYDVFIIQGTGRRRWRVGEK--LQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIPPG 184 D YDVF++Q G RRWR+G + L + P L P E I L PGD+LY+PP Sbjct: 130 DSYDVFLLQAHGHRRWRIGAQTDLTLVDGMPLKILKNFKPEEEFI---LAPGDMLYLPPQ 186 Query: 185 FPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADVLP 244 + HEG A++ M YS+GFRAP+ +EL F + ++ Y+DPD+ P H A++ Sbjct: 187 YAHEGVAMDECMTYSIGFRAPSYQELGEAFLESMIDSIDLPGRYADPDLKPAKHSAEISA 246 Query: 245 QEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQGEVLV 304 + ++ + ++ E + GE++S+ + ++ PE K+ + + Sbjct: 247 AMLSRIAAELNKVRFTQEDIALFVGEYLSEPKAQIYFDAPEENLTRARFLQNAKKSGIKL 306 Query: 305 RLGGLRVLRIGDDVYANGEKID 326 L L + R + ++ NG + Sbjct: 307 SLKSLMLHR-NNYIFINGTSFE 327 >UniRef50_B2SQ70 Transcription factor jumonji, JmjC n=19 Tax=Xanthomonadaceae RepID=B2SQ70_XANOP Length = 498 Score = 203 bits (517), Expect = 7e-51, Method: Compositional matrix adjust. Identities = 112/305 (36%), Positives = 169/305 (55%), Gaps = 11/305 (3%) Query: 12 FLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDG--KWQVSHGPFE 69 FL +W K P++++ F +F P+ P++LAGLA E V +RL+SH W V GPF+ Sbjct: 28 FLRNYWHKHPLLIRNAFADFASPLQPEDLAGLACEDGVLARLISHDRATDSWDVRSGPFQ 87 Query: 70 SYDHLG--ETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPHLD 127 D G + +W+LLVQ V+ W AL+ FR LP WRIDD+MISF+ GG VG H+D Sbjct: 88 ETDFPGLPDHDWTLLVQDVDKWDADVRALLEQFRFLPRWRIDDIMISFAATGGSVGAHVD 147 Query: 128 QYDVFIIQGTGRRRWRVGEKL-QMKQHCP-----HPDLLQVDPFEAIIDEELEPGDILYI 181 YDVF++QG G RRW++ + Q + P +L + F+ L PGD+LY+ Sbjct: 148 HYDVFLLQGQGHRRWQIDARTAQGSKATPLAFREDVELKLLRTFKPTHHWVLGPGDMLYL 207 Query: 182 PPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPAD 241 PP PH G A + + +S+G RAP++ ELI + D ++ Y D D+ A P + Sbjct: 208 PPLIPHHGVAEDACLTFSIGTRAPSSAELIGDYLDTLIADADEAVRYHDEDLKVPADPYE 267 Query: 242 VLPQEMDKLREMMLEL-INQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQG 300 + M+++ + L +N P+ WFG F++ R D+ P P + + AL++G Sbjct: 268 IDVTAMNRVVAALNALRMNDPDRLGDWFGRFMTTYRACGDVVPAPAPIPREAVEQALEEG 327 Query: 301 EVLVR 305 +L R Sbjct: 328 VLLHR 332 >UniRef50_C1DCJ3 Cupin region n=1 Tax=Laribacter hongkongensis HLHK9 RepID=C1DCJ3_LARHH Length = 380 Score = 202 bits (513), Expect = 2e-50, Method: Compositional matrix adjust. Identities = 122/366 (33%), Positives = 199/366 (54%), Gaps = 14/366 (3%) Query: 11 DFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQ-DGKWQVSHGPFE 69 +FL +W K+P++++ + P + LA LA +V+SRL+ ++ G+W V HGPF+ Sbjct: 16 EFLRDYWHKQPLLIRGALRDVGTPADFEVLAELARRDDVESRLIENRAGGRWHVEHGPFQ 75 Query: 70 --SYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPHLD 127 L ET+W+LLVQ+VNH + ++ F LP R+DDLMIS++ PGG VGPH D Sbjct: 76 PARLARLPETDWTLLVQSVNHHLPHVSDILWRFNFLPYARLDDLMISYAPPGGTVGPHFD 135 Query: 128 QYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIPPGFPH 187 YDVF++Q G++RW+VG + P + + F+A+ ELE GD+LY+PP F H Sbjct: 136 SYDVFLLQVGGKKRWQVGSPDNDRLEDGAP-IKVLSSFDALQSWELEQGDMLYLPPKFSH 194 Query: 188 EGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADVLPQEM 247 G ALE M YS+GFRAP T+EL F Y+ Y+DPD+ P HPA++ + Sbjct: 195 YGVALEPGMTYSIGFRAPTTQELAEQFLTYLQDTLCLDGRYADPDLEPPRHPAEISESMV 254 Query: 248 DKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQGEVLVRLG 307 +++++M+ + + ++ G ++++ ++ + PPE P DE + + +++ L Sbjct: 255 EQVQDMLKAIRWDRDGVGEFLGCYLTEPKNHVFFDPPEDPLDEDEFAKVILRDGLVLDLK 314 Query: 308 GLRVLRIGDDVYANGE---KIDSPHRPALDALASNIALTAENFGDALEDPSFLAMLAALV 364 + R + NGE +D P LA+ L + D + + + L Sbjct: 315 SQMLFR-NSLCFVNGEIHAGMDG-DLPVWRELANQRRLAGQAISDGMTETLYAGYL---- 368 Query: 365 NSGYWF 370 SG+W+ Sbjct: 369 -SGWWW 373 >UniRef50_Q2SJM1 Uncharacterized conserved protein n=3 Tax=Gammaproteobacteria RepID=Q2SJM1_HAHCH Length = 405 Score = 202 bits (513), Expect = 2e-50, Method: Compositional matrix adjust. Identities = 130/388 (33%), Positives = 199/388 (51%), Gaps = 32/388 (8%) Query: 11 DFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQ-DGK-WQVSHGPF 68 DFL +WQK+P++++ + P+ ++LAGLA E EV+SRLV + +G+ WQ+ HGPF Sbjct: 12 DFLAHYWQKKPLIIRGLLPGYECPLDENDLAGLATEEEVESRLVYEELNGQPWQLEHGPF 71 Query: 69 --ESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPHL 126 E +++ W+LLVQ ++ W A L+ FR LP+WR+DD+M SF+ PGG VGPH Sbjct: 72 SIEKLENMPHQGWTLLVQGLDTWVPEIADLLDRFRFLPNWRVDDIMASFAPPGGSVGPHF 131 Query: 127 DQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPD---LLQVDPFEAIIDEELEPGDILYIPP 183 D YDVF+IQ TG RRWR+G P D L + FE + LEPGD LY+PP Sbjct: 132 DHYDVFLIQATGARRWRIGPP--CDDQSPRVDGTPLRILQNFEQTEEWVLEPGDALYLPP 189 Query: 184 GFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADVL 243 G+ H G A + + SVGFR+P EL+S AD + + D P ++P + Sbjct: 190 GYAHYGVAETSCITLSVGFRSPTYAELMSALADDWFENPALSTHLHDATEAPLSNPGLIS 249 Query: 244 PQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQ--PDEIYDALKQGE 301 +R + L++ ++ FG FIS + + + P + + P++ ++L+ E Sbjct: 250 DDVFADIRSRLQALLDDEAGLRRSFGRFISAPKFDAAVPPLDAAMRLSPEDAGESLQDQE 309 Query: 302 VLVR--------------LGGLRVLRIGDDVYANGEKIDSPHR--PALDALASNIALTAE 345 + R G RV+ +GE D+ R P ++ L + + E Sbjct: 310 IQWRWNEGSRYTYSLYEEAGARRVM-----FAVDGEAYDADERFAPLVEILCRSNNVDRE 364 Query: 346 NFGDALEDPSFLAMLAALVNSGYWFFEG 373 D L +L++L+N G EG Sbjct: 365 RLLPWSADKDALKLLSSLLNRGSLVLEG 392 >UniRef50_B8KRM1 Cupin 4 family protein n=1 Tax=gamma proteobacterium NOR51-B RepID=B8KRM1_9GAMM Length = 365 Score = 201 bits (511), Expect = 3e-50, Method: Compositional matrix adjust. Identities = 123/342 (35%), Positives = 180/342 (52%), Gaps = 17/342 (4%) Query: 5 LTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQVS 64 ++L+ FL+ +WQK P+V+++ +F PI D LAGLA+E +V SR+VS G W+V Sbjct: 3 ISLDTERFLKHYWQKHPLVIRQAVPDFTPPIDADHLAGLALEPDVQSRIVSCDRGHWEVQ 62 Query: 65 HGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGP 124 HGPF D + WSLLVQ V+ AAL R LP WR DD+MIS++ GG VGP Sbjct: 63 HGPFSEADFDRDDQWSLLVQGVDRLLPEVAALQRAVDFLPSWRFDDVMISYASEGGSVGP 122 Query: 125 HLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPD-LLQVDPFEAIIDEELEPGDILYIPP 183 H D+YDVF++QG G R WR+G++ + D LL +D FE L+ GD LYIPP Sbjct: 123 HFDRYDVFLLQGEGEREWRIGQRCDHTTATHNYDELLLLDDFEHRETHLLQTGDALYIPP 182 Query: 184 GFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPD--VPPRAHPAD 241 G H G A +S+GFRAP+ L + D L++ + D + V R P + Sbjct: 183 GIAHWGIARGPCTTFSLGFRAPSIAALTARLTDSALEQLMPDLLLEDRNSLVSERGRPGE 242 Query: 242 VLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAP--PEPPYQPDEIYDALKQ 299 + Q+ D +R +L ++ + W GE ++++ + +P PP+ ++ + Sbjct: 243 ITTQQRDNIRSAVLSALSALDD-GVWLGELLTETEPFIGESPEGAVPPHIAMDLGSRINW 301 Query: 300 GEVLVRLGGLRVLRIGDDVYANGEKIDSPHRPALDALASNIA 341 E G V+ANGE+ + R ALD L A Sbjct: 302 MET----------PEGIAVFANGERFPA-SRQALDVLTPLCA 332 >UniRef50_Q21K45 Cupin 4 n=1 Tax=Saccharophagus degradans 2-40 RepID=Q21K45_SACD2 Length = 383 Score = 198 bits (504), Expect = 2e-49, Method: Compositional matrix adjust. Identities = 102/275 (37%), Positives = 162/275 (58%), Gaps = 15/275 (5%) Query: 12 FLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQ------DGKWQVSH 65 FL +WQK+P+++++ NF P+S DELAGL +E +V SRL++ + +W V+H Sbjct: 19 FLRDYWQKKPLLIRQALPNFESPLSADELAGLCLEDDVISRLITETPQSSPFNSEWNVTH 78 Query: 66 GPF--ESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVG 123 GP + ++ L E WSLLVQ V+ L+ FR +P+WR+DD+MIS++ GGVG Sbjct: 79 GPLPEDIFETLPENYWSLLVQHVDQLSPEVNQLLNLFRFIPNWRLDDVMISYAPDKGGVG 138 Query: 124 PHLDQYDVFIIQGTGRRRWRVGEKLQMKQ----HCPHPDLLQVDPFEAIIDEELEPGDIL 179 PH D YDVF++QG G+RRWR+G++ K + P L + D E D L PGDIL Sbjct: 139 PHFDYYDVFLLQGHGQRRWRLGQQCTSKSPMLANAPMKVLTEFDVQE---DWVLNPGDIL 195 Query: 180 YIPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHP 239 Y+PPG H G A+ ++ YSVGFRAP+ ++++ F+ V + N Y D + + Sbjct: 196 YVPPGLAHWGTAVGESITYSVGFRAPSHQDIVLDFSQEVASKIEEDNRYQDQFLTANKNA 255 Query: 240 ADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQ 274 ++ +++L+ ++ + + QW G+ ++Q Sbjct: 256 GEITGDAIEQLKHILQTYMQDEQALAQWLGKSMTQ 290 >UniRef50_D1UI98 Cupin 4 family protein n=6 Tax=Burkholderia RepID=D1UI98_9BURK Length = 424 Score = 198 bits (504), Expect = 2e-49, Method: Compositional matrix adjust. Identities = 124/365 (33%), Positives = 193/365 (52%), Gaps = 11/365 (3%) Query: 12 FLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQVSHGPF--E 69 F+ R+WQK+P+++++ + P+S DEL LA + +V++RL++H +WQ+ HGPF + Sbjct: 56 FMRRYWQKKPLLIRQAIPDVEAPLSRDELFELADQDDVEARLITHFRNRWQLEHGPFAPD 115 Query: 70 SYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPHLDQY 129 L + W+LLVQ V+ + AL+ FR +PD R+DDLMIS++ GGGVGPH D Y Sbjct: 116 ELPSLKQRAWTLLVQGVDLHDDRARALLERFRFVPDARLDDLMISYATDGGGVGPHFDSY 175 Query: 130 DVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIPPGFPHEG 189 DVF++Q G+RRWR+ + + P L + F A + LEPGD+LY+PP H+G Sbjct: 176 DVFLLQVKGKRRWRISAQKDLTLQAGLP-LKVLQNFAAEQEWVLEPGDMLYLPPHIAHDG 234 Query: 190 YALENAMNYSVGFRAPNTRELISGFADYVLQR----ELGGNYYSDPDVPPRAHPADVLPQ 245 A M S+GFRAP+ EL + F ++ +R G Y DP P PA++ P Sbjct: 235 VAEGECMTCSIGFRAPSAGELTAQFLYHLAERGEASGQAGALYRDPQQPAVERPAELPPA 294 Query: 246 EMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDE-IYDALKQGEVLV 304 ++++ ++ + + + G ++S+ + + PP+ P I A K G L Sbjct: 295 LVERVGAILAGITWNEQDIASFLGTYLSEPKPSVVFDPPQRPLNEARFISQASKSGVRLD 354 Query: 305 RLGGLRVLRIGDDVYANGEKID-SPHRPALDALASNIALTAENFGDALEDPSFLAMLAAL 363 R L R + NGEK + L LA + L+A+ F D S A L Sbjct: 355 RKTNLLYNR--RFFFLNGEKTSLEGSKKWLFDLADHRCLSAKRFVTLSHDSSVTARLHEW 412 Query: 364 VNSGY 368 +G+ Sbjct: 413 YRAGW 417 >UniRef50_C5A9S6 Cupin superfamily protein family protein n=49 Tax=Burkholderiales RepID=C5A9S6_BURGB Length = 422 Score = 197 bits (502), Expect = 4e-49, Method: Compositional matrix adjust. Identities = 123/369 (33%), Positives = 195/369 (52%), Gaps = 17/369 (4%) Query: 12 FLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQVSHGPFE-- 69 F+ RHWQK+P+++++ + P+S D L LA + + +SRL++H +WQ++ GPFE Sbjct: 52 FMRRHWQKKPLLIRQAIPGIVPPLSRDALFELAGDYDTESRLITHFRNRWQLAQGPFELD 111 Query: 70 SYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPHLDQY 129 S + + W+LLVQ V+ + AL+ FR +PD R+DDLMIS++ GGGVGPH D Y Sbjct: 112 SLPSVSKREWTLLVQGVDLHDDAARALLERFRFIPDARLDDLMISYATDGGGVGPHFDSY 171 Query: 130 DVFIIQGTGRRRWRVG--EKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIPPGFPH 187 DVF++Q GRRRWR+G + L +++ P L + +P + + LEPGD+LY+PP H Sbjct: 172 DVFLLQVHGRRRWRIGAQQDLTLREDLPLKVLARFEPTDEWV---LEPGDMLYLPPHIAH 228 Query: 188 EGYALENAMNYSVGFRAPNTRELISGFADYVLQR------ELGGNYYSDPDVPPRAHPAD 241 +G A M S+GFRAP+ EL F Y+ +R G Y DP PP PA Sbjct: 229 DGIAEGECMTCSIGFRAPSAGELTGQFLYYLAERGALRQGARAGELYRDPAQPPVDDPAR 288 Query: 242 VLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPY-QPDEIYDALKQG 300 + ++++ ++ + + + G ++S+ + + PE P + + A ++G Sbjct: 289 LPAALVERVETILKGIRWTTRDVENFLGSYLSEPKSNVVFDAPERPLGEAAFVAQASRRG 348 Query: 301 EVLVRLGGLRVLRIGDDVYANGEKID-SPHRPALDALASNIALTAENFGDALEDPSFLAM 359 L R L L + NGE+ + + L LA L A+ F P A+ Sbjct: 349 IRLDRKAAL--LYNARSYFINGEENPLAGNAKWLPELADRRHLGAKRFVTYSRHPLMTAL 406 Query: 360 LAALVNSGY 368 L +G+ Sbjct: 407 LHEWYCAGW 415 >UniRef50_A4BDP0 Putative uncharacterized protein n=1 Tax=Reinekea blandensis MED297 RepID=A4BDP0_9GAMM Length = 381 Score = 197 bits (500), Expect = 6e-49, Method: Compositional matrix adjust. Identities = 108/284 (38%), Positives = 169/284 (59%), Gaps = 19/284 (6%) Query: 7 LNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVS--HQDGKWQVS 64 N +FL +WQ+ P+ LKR + D I+ DELAGLA E+EV+SRL+S ++ +W + Sbjct: 17 FNSQEFLNTYWQQAPL-LKRNALSLHDIITADELAGLATEAEVESRLISGSNETEQWTLQ 75 Query: 65 HGPF--ESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGV 122 HGPF + + L E +W+LLVQAV+HW ++ F LP WRIDD+MISF+ GGGV Sbjct: 76 HGPFSDDVFQTLPERDWTLLVQAVDHWVPEVRQVLAQFSFLPRWRIDDIMISFATDGGGV 135 Query: 123 GPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH-PDLLQVDPFEAIIDEE------LEP 175 GPH DQYDVF++Q G+R W++G Q C DL++ P + + E L+P Sbjct: 136 GPHFDQYDVFLVQLAGQREWKIG------QMCDEDSDLVENIPVKVLSAFEEQDAWVLDP 189 Query: 176 GDILYIPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPP 235 GD+LY+PPG H G +L ++M SVGFRAP+ E I+ ++ Y D + Sbjct: 190 GDVLYLPPGVAHWGTSLGDSMTLSVGFRAPSDSETIAELGHFMSSMVSDFQRYGDAGISQ 249 Query: 236 RAH-PADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHE 278 R P + +++D+++ ++ L + +WFG+++++ +++ Sbjct: 250 RNQTPHAIEEEDIDRVQAIIKRLADDRSLVSEWFGQYVTEPKYD 293 >UniRef50_B8KGD9 Cupin 4 family protein n=2 Tax=unclassified Gammaproteobacteria RepID=B8KGD9_9GAMM Length = 370 Score = 197 bits (500), Expect = 7e-49, Method: Compositional matrix adjust. Identities = 115/334 (34%), Positives = 176/334 (52%), Gaps = 16/334 (4%) Query: 1 MEYQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGK 60 +Y+L L F+ R+WQK+ + + GF +F P DELAGLAME E+D+R+V Sbjct: 2 TDYRLDLEVKSFVARYWQKQHLFIPGGFKHFSVPADADELAGLAMEDELDARIVFRDGQH 61 Query: 61 WQVSHGPFESYDHLGETNWSLLVQAVN-HWHEPTAALMRPFRELPDWRIDDLMISFSVPG 119 W GPF + +W+LLVQ V+ HW E A L+ LP WR+DD+M+S++ G Sbjct: 62 WHQERGPFSQESYRRSGSWTLLVQGVDQHWDE-AAELLNAVSFLPSWRLDDIMMSYATDG 120 Query: 120 GGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHC-PHPDLLQVDPFEAIIDEELEPGDI 178 G GPH D YDVFIIQG G+RRW+VG + +L + FE+ + + GD+ Sbjct: 121 GSAGPHYDNYDVFIIQGDGQRRWQVGGLCDASSALMDNTELRLLADFESQREYLMNTGDV 180 Query: 179 LYIPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAH 238 LYIPPG H G ++ + ++S+GFRAP +L++ +AD +L + DP P Sbjct: 181 LYIPPGIAHYGVSVGESTSFSIGFRAPRQSDLLARWADNLLNTLEDDALFCDPGREPATR 240 Query: 239 PADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALK 298 ++ ++ + R +L + + +WFGE I+ S + +P D + + Sbjct: 241 VGEITTADLHRARAQLLRVFEDKD--PRWFGEAITNSGTTV-----QPS--SDTALNLDE 291 Query: 299 QGEVLVRLGGLRVLRIGDD----VYANGEKIDSP 328 QG + R G R+ D V+A+G D+P Sbjct: 292 QGAWVTRAPGSRLAWHATDEELLVFAHGSTHDTP 325 >UniRef50_D1RFR4 Cupin superfamily protein n=1 Tax=Legionella longbeachae D-4968 RepID=D1RFR4_LEGLO Length = 396 Score = 196 bits (497), Expect = 1e-48, Method: Compositional matrix adjust. Identities = 115/320 (35%), Positives = 179/320 (55%), Gaps = 16/320 (5%) Query: 4 QLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQ-DGK-- 60 Q++LN FL +WQK+P+V+++ F P+SPDELAGLA+E +V+SRLV D K Sbjct: 6 QISLN--TFLGDYWQKKPLVIRKALPEFTHPLSPDELAGLALEEDVESRLVFETPDEKPY 63 Query: 61 WQVSHGPFE--SYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVP 118 W + GPF + L T+W+LLVQ V+ AL+ F +P WRIDD+MIS++V Sbjct: 64 WHLKRGPFSVNDFSTLPSTHWTLLVQGVDRLIPEVYALLDYFNFIPQWRIDDIMISYAVL 123 Query: 119 GGGVGPHLDQYDVFIIQGTGRRRWRVGEK-LQMKQHCPHPDLLQVDPFEAIIDEELEPGD 177 G VGPH D YDVF+ Q G+R W + K + + +L + F+ LE GD Sbjct: 124 HGSVGPHYDNYDVFLYQAKGKREWSLTTKGCNNQNYMKGLELRIMSQFDVEERFILEEGD 183 Query: 178 ILYIPPGFPHEGYALEN-AMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPR 236 +LY+PP H G +L + M YS G+R+ +EL+ F+DY+ ++ L N Y DPD Sbjct: 184 MLYLPPHVGHHGISLSDECMTYSFGYRSYQGQELLESFSDYLSEKGLFKNLYQDPDWSNL 243 Query: 237 AHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDA 296 + +++ P ++++ ++IN + + WFG F ++ + ++ P P + DE+ D Sbjct: 244 QNTSEIPPSAWLNAQKLLQQVINDEKTMQTWFGCFATRLDQQAELQLP-VPLEEDELIDI 302 Query: 297 ------LKQGEVLVRLGGLR 310 +K+G L+R R Sbjct: 303 SDFIKEIKEGLNLIRDASCR 322 >UniRef50_B8GSM7 Cupin 4 family protein n=1 Tax=Thioalkalivibrio sp. HL-EbGR7 RepID=B8GSM7_THISH Length = 397 Score = 192 bits (487), Expect = 2e-47, Method: Compositional matrix adjust. Identities = 117/328 (35%), Positives = 185/328 (56%), Gaps = 19/328 (5%) Query: 12 FLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSH--QDGKWQVSHGPFE 69 FL +WQ++P+++++ F P+SP+ELAGLA E V SRLV + G W + GPF+ Sbjct: 20 FLRDYWQQKPLLVRQAIPGFESPLSPEELAGLACEEGVISRLVRERGETGSWALRTGPFD 79 Query: 70 S--YDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPHLD 127 + L E++W+LLV + A + PFR +PDWR+DDLM+S++ P G VGPH+D Sbjct: 80 EDDFTTLPESHWTLLVSDMEKHLPELRAYLEPFRFIPDWRMDDLMVSYAAPEGSVGPHVD 139 Query: 128 QYDVFIIQGTGRRRWRVG-EKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIPPGFP 186 +YDVF++Q GRRRW++ + + P +L + F+ + LEPGD+LY+PP P Sbjct: 140 EYDVFLLQAQGRRRWQIARQAVSGDDFLPGVELRILRDFQPDQEWILEPGDMLYLPPRIP 199 Query: 187 HEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHP----ADV 242 H G A+ M +SVGFRAP R+L++ + D + + Y+DP + P+ +P A Sbjct: 200 HHGVAVGPCMTWSVGFRAPAWRDLMAAWVDQRYEALAPQDRYADPGLEPQDNPGELSAAA 259 Query: 243 LPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHEL--DIAPPEPPYQPDEIYDALKQG 300 L + + LR M ++ E +W G +++ + EL + PE + DE L+ G Sbjct: 260 LARLIAGLRRAM--AVDDAE-LARWLGTVLTEPKAELLEHMQLPETLTR-DEALGLLQDG 315 Query: 301 EVLVRLGGLRVLRIGD----DVYANGEK 324 L R G R+ + D ++ NG++ Sbjct: 316 VSLERHGAARLAWMSDHGGLRLFVNGQE 343 >UniRef50_Q5WVF0 Putative uncharacterized protein n=4 Tax=Legionella pneumophila RepID=Q5WVF0_LEGPL Length = 395 Score = 191 bits (486), Expect = 3e-47, Method: Compositional matrix adjust. Identities = 112/335 (33%), Positives = 176/335 (52%), Gaps = 20/335 (5%) Query: 12 FLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSH---QDGKWQVSHGPF 68 FL+ +WQK+P+++++ +F +P++PDELAGLA+E E++SRLV Q +W + GPF Sbjct: 12 FLKDYWQKKPLIIRQALPDFTNPLTPDELAGLALEEEIESRLVYETPDQSPQWNLKRGPF 71 Query: 69 ESYDHLG--ETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPHL 126 + D +G +T+W+LLVQ V+ L+ F +P WR+DD+MIS++ G VGPH Sbjct: 72 KESDLIGLPKTHWTLLVQGVDRIVPDVYELLDHFNFIPQWRVDDVMISYATLHGSVGPHY 131 Query: 127 DQYDVFIIQGTGRRRWRV-GEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIPPGF 185 D YDVF+ Q G+R W + +K +L ++ FE LE GD+LY+PP Sbjct: 132 DNYDVFLYQAKGQRLWSLTSKKCHTNNFIKGLELRIMNEFEVEEQFILEEGDMLYLPPHI 191 Query: 186 PHEGYAL-ENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADVLP 244 H G A E M YS G+R+ +EL DY+ + L + Y DPD + +++ P Sbjct: 192 GHYGIAQSEECMTYSFGYRSYQGQELWDSLGDYLSEHGLFKSLYQDPDWSTLKNTSEITP 251 Query: 245 QEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIY------DALK 298 + R+++ +++ + WFG F + + P PP + DE+ L Sbjct: 252 KAWSNARQLLRQVLENDQLMHSWFGCFATSLDQSAEQYLP-PPLEEDELLGLDEFIKELS 310 Query: 299 QGEVLVRLGGLRVLRIGDD------VYANGEKIDS 327 + +VR R I D Y NG++ DS Sbjct: 311 NYQEIVRDASCRFAYIMSDQESQCHFYVNGKEWDS 345 >UniRef50_Q3JQS3 Cupin superfamily protein family n=25 Tax=Burkholderiales RepID=Q3JQS3_BURP1 Length = 422 Score = 189 bits (481), Expect = 1e-46, Method: Compositional matrix adjust. Identities = 131/375 (34%), Positives = 195/375 (52%), Gaps = 29/375 (7%) Query: 12 FLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQVSHGPFESY 71 F+ R+WQK+P+++++ P+S D L LA + +V+SRLV+H +WQ+ HGPFE Sbjct: 52 FMRRYWQKKPLLIRQAITGIAPPLSRDALFELAADYDVESRLVTHFRNRWQLEHGPFEP- 110 Query: 72 DHLGETN---WSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPHLDQ 128 +HL W+LLVQ ++ + AL+ FR +PD R+DDLMIS++ GGGVGPH D Sbjct: 111 EHLPSVKRREWTLLVQGLDLHDDRARALLERFRFVPDARLDDLMISYATDGGGVGPHFDS 170 Query: 129 YDVFIIQGTGRRRWRVG--EKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIPPGFP 186 YDVF++Q G+RRWR+G + L +++ P L +P + + LEPGD+LY+PP Sbjct: 171 YDVFLLQVHGKRRWRIGAQQDLSLQEGLPLKILANFEPTDEWV---LEPGDMLYLPPHIA 227 Query: 187 HEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGG--------NYYSDPDVPPRAH 238 H+G AL M S+GFRAP+ EL + F ++ +R GG Y DP P Sbjct: 228 HDGIALGECMTCSIGFRAPSAGELRAQFLYHLAER--GGLRTGARDDARYRDPAQPAVDS 285 Query: 239 PADVLPQEMDKLREMMLELINQPEH-FKQWFGEFISQSRHELDIAPPEPPY-QPDEIYDA 296 PA +LP M K L I EH + G ++S+ + + PP + + A Sbjct: 286 PA-MLPAAMVKRVAATLAGIQWDEHDVGDFLGCYLSEPKSNVVFEPPTRRLGEAAFVTQA 344 Query: 297 LKQGEVLVRLGGLRVLRIGDDVYANGEKIDSPHRPA---LDALASNIALTAENFGDALED 353 ++G L R L L + NG+ P A L LA + A+ F D Sbjct: 345 SRRGVRLDRKAAL--LYNARSYFINGDA--HPLATAAKWLPELADTRRMEAKRFVTLSRD 400 Query: 354 PSFLAMLAALVNSGY 368 P+ +L +G+ Sbjct: 401 PAMTGLLHEWYCAGW 415 >UniRef50_C7I1M3 Cupin 4 family protein n=1 Tax=Thiomonas intermedia K12 RepID=C7I1M3_THIIN Length = 378 Score = 188 bits (477), Expect = 4e-46, Method: Compositional matrix adjust. Identities = 128/349 (36%), Positives = 187/349 (53%), Gaps = 27/349 (7%) Query: 6 TLNWPDFLERH-----WQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGK 60 TL+W F E WQ++P++L++ F F +S +L LA + +V+SRL+ + Sbjct: 5 TLHWGAFTEARFLREIWQRKPLLLRQAFPGFKPLLSRAQLFALAGQDDVESRLLQRAGRR 64 Query: 61 WQVSHGPFESYDH--LGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVP 118 WQ+ HGPF + + NW+LLVQ VN + L+R FR +PD R+DDLMIS++ Sbjct: 65 WQLDHGPFSRKQLPPVEQRNWTLLVQGVNLHVDAAGDLLRQFRFIPDARLDDLMISWASE 124 Query: 119 GGGVGPHLDQYDVFIIQGTGRRRWRVG--EKLQMKQHCPHPDLLQVDPFEAIIDEELEPG 176 GGGVGPH D YDVF++Q GRRRWR+G E ++ P L + P E +I LE G Sbjct: 125 GGGVGPHQDAYDVFLLQAAGRRRWRIGPVEDATLQPGKPVKLLAKFTPEEDLI---LESG 181 Query: 177 DILYIPPGFPHEGYALE-NAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPP 235 D+LY+PPG+ H+G A + M YSVGFRAP EL+ + + + GG Y DP + Sbjct: 182 DMLYLPPGWGHDGIAASGDCMTYSVGFRAPPQGELLKEVLWQLAEAQQGGAIYRDPPLRS 241 Query: 236 RAHPADVLPQEMDKL-REMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIY 294 A PA +LP M + RE L F+ G +++ + ++ E P + Sbjct: 242 GASPA-LLPAAMVRFAREAFSRLKPDAAMFENVLGLYLTTPKPQVWFESVETPTA--TLR 298 Query: 295 DALKQ-GEVLVRLGGLRVLRIGDDVYANGEKIDSPHRPALDALASNIAL 342 A +Q G L R ++L ++ NGE +D+ ALAS+ L Sbjct: 299 RACRQTGCRLDRRS--KMLYTTQALFLNGEAVDA-------ALASSALL 338 >UniRef50_A1K4G1 Putative uncharacterized protein n=1 Tax=Azoarcus sp. BH72 RepID=A1K4G1_AZOSB Length = 371 Score = 186 bits (471), Expect = 2e-45, Method: Compositional matrix adjust. Identities = 111/348 (31%), Positives = 189/348 (54%), Gaps = 10/348 (2%) Query: 12 FLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQVSHGPFESY 71 FL+ +WQK+P+++++ F + +++ LA + +V+SR + +G W+++ GP Sbjct: 14 FLQEYWQKKPLLVRQAVPGFTGVLGREDIFDLACDPDVESRHIRLHEGNWELNRGPQTRA 73 Query: 72 DHLGETN-WSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPHLDQYD 130 G+ + W++LVQ +N W E L+ F +P R+DDLM+S++V GGGVGPH D YD Sbjct: 74 RLRGKRSPWTVLVQGINLWSEAADELLHRFNFIPQARLDDLMVSYAVDGGGVGPHFDNYD 133 Query: 131 VFIIQGTGRRRWRVGEK--LQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIPPGFPHE 188 VF++QG G+RRW++ ++ + + P L P I LEPGD+LY+PP + H Sbjct: 134 VFLLQGQGQRRWQIADQDDRSLVEGAPLRILRNFVPAHDWI---LEPGDMLYLPPHWAHN 190 Query: 189 GYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADVLPQEMD 248 G A+ YS+GFR+P +EL + F ++ +R YSDPD+ + + A + +D Sbjct: 191 GIAIGECTTYSIGFRSPTAQELGAEFLGWLQERVCLDGLYSDPDLTEQDNSALIGDAMID 250 Query: 249 KLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQGEVLVRLGG 308 +++ ++ + + G ++++ + + PPE P AL G + + Sbjct: 251 QVQRVIEGIRWSRADVAAFLGHYLTEPKPTVFFEPPEEPIPLKAFRRALGAGGLRLDART 310 Query: 309 LRVLRIGDDVYANGEKIDS--PHRPALDALASNIALT-AENFGDALED 353 L +LR + + NGE +DS + ALD LA LT + AL+D Sbjct: 311 L-LLRSQGNFFLNGEAVDSVPAWQQALDTLAHARRLTGCADLPAALQD 357 >UniRef50_Q0VQ28 Putative uncharacterized protein n=1 Tax=Alcanivorax borkumensis SK2 RepID=Q0VQ28_ALCBS Length = 377 Score = 185 bits (470), Expect = 2e-45, Method: Compositional matrix adjust. Identities = 126/376 (33%), Positives = 204/376 (54%), Gaps = 20/376 (5%) Query: 3 YQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQ-DGKW 61 + L L FL HWQKRP+ + G + +D + LAGLA+E V++R+++ +G W Sbjct: 10 FTLPLTPAAFLREHWQKRPLFMP-GAASGLDQPDANTLAGLALEESVEARVITGAGNGPW 68 Query: 62 QVSHGPFES--YDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPG 119 V P + ++ LGE NW+LLVQ+V+H+ T+ L+ F LP+WR++D+MIS++ G Sbjct: 69 SVLQSPLDDNVFEALGEKNWTLLVQSVDHFLTETSLLLDDFAFLPNWRVEDIMISYAAKG 128 Query: 120 GGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEEL-EPGDI 178 G VGPH D+YDVF+IQ +G RRW++G+ D L++ + +E + +PGD+ Sbjct: 129 GSVGPHFDRYDVFLIQASGSRRWQIGDVCDESSPRQATDELKLLAQMPVREEFIAQPGDV 188 Query: 179 LYIPPGFPHEGYALE-NAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRA 237 LY+PPG H G A + + + +SVGFRAP+ + L++ A L E ++DPD Sbjct: 189 LYLPPGVAHHGVAEDSDCITWSVGFRAPDYQMLMAEIAGECLA-ESDSKLFTDPDRGITT 247 Query: 238 HPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHE-LDIAPPEPPYQPDEIYDA 296 P+ + + +L L+L++ PE ++ ++S R E L+ A DE + Sbjct: 248 DPSILADTDRQQLVRGALDLLH-PEAIERAIYRWLSTPRLEGLEFA-------VDEHHIR 299 Query: 297 LKQGEV-LVRLGGLRVLRIGDDVYANGE--KIDSPHRPALDALASNIALTAENFGDALED 353 + +V LVR G +R+L G + NGE + +P + LAS DA+ Sbjct: 300 ERDSDVSLVRHGSVRLLMQGKLAWLNGEAHTLTEQQQPLVQLLASKRRYQKREL-DAVMT 358 Query: 354 PSFLAMLAALVNSGYW 369 P+ +L + GY+ Sbjct: 359 PTARELLHEWIEQGYF 374 >UniRef50_C0N3X6 Cupin superfamily protein n=1 Tax=Methylophaga thiooxidans DMS010 RepID=C0N3X6_9GAMM Length = 389 Score = 184 bits (467), Expect = 4e-45, Method: Compositional matrix adjust. Identities = 117/385 (30%), Positives = 199/385 (51%), Gaps = 15/385 (3%) Query: 3 YQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQD--GK 60 + L FL + WQK+P+++++ + +S +ELAGLA E++++SRL+ Q G Sbjct: 5 FNTELTQQQFLTQFWQKKPLLIRQAWPQMDALLSAEELAGLACEADIESRLIQEQGELGP 64 Query: 61 WQVSHGPFESYD--HLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVP 118 WQV+ GPF D L ++W+LLVQ V+ +M F +PDWR DDLM+SF+ Sbjct: 65 WQVNDGPFTEADFAKLPASHWTLLVQDVDKHVPELTEVMAKFDFIPDWRRDDLMVSFAPE 124 Query: 119 GGGVGPHLDQYDVFIIQGTGRRRWRVGEK-LQMKQHCPHPDLLQVDPFEAIIDEELEPGD 177 GG VGPH D YDVF++Q G RRW + + + + +L + F+A +L+PGD Sbjct: 125 GGSVGPHTDGYDVFLLQAQGTRRWAISQTPVVEAEFIDGLELKILKQFDADDVWDLQPGD 184 Query: 178 ILYIPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRA 237 +LY+PP F H G AL + M +S+GFRAP EL+ F + + E+G Y DP++ Sbjct: 185 MLYLPPHFAHHGVALNDCMTFSIGFRAPTQLELLDAFMHSLSEHEVGQQRYRDPELKVCD 244 Query: 238 HPADVLPQEMDKLREMMLELINQPEH-FKQWFGEFISQSRHELDIAPPEPPYQPDEIYDA 296 + + + ++ +++ I + G +++++ L++ E D + A Sbjct: 245 DDKYIDRSALRRFKQSLIKCIEDSDDVLLDAVGRLLTETKPSLELLADELIADSDNVSLA 304 Query: 297 --LKQGEVLVRLGGLRVLRIGDD----VYANGE--KIDSPHRPALDALASNIALTAENFG 348 QGE L R +R+ ++ ++A GE + D R + L + A ++ Sbjct: 305 EYFSQGEQLHRNPYIRIAWAENEESVQLFAAGETYQADKAVRSIMPILTGTEPIQALHWT 364 Query: 349 DALEDPSFLAMLAALVNSGYWFFEG 373 ++ + +L LV G W+++ Sbjct: 365 Q-IQSAAATNLLEELVAIGCWYWQS 388 >UniRef50_Q31GJ6 Cupin superfamily protein n=2 Tax=Gammaproteobacteria RepID=Q31GJ6_THICR Length = 401 Score = 182 bits (463), Expect = 1e-44, Method: Compositional matrix adjust. Identities = 96/270 (35%), Positives = 153/270 (56%), Gaps = 10/270 (3%) Query: 12 FLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLV-SHQDGKWQVSHGPFES 70 FL +WQK+P++++ +F P+S +ELAGL++E EV+SR+V H +++ GPF+ Sbjct: 24 FLSEYWQKKPLLIRNALPDFSPPVSAEELAGLSLEEEVESRIVIQHSAEDYELKKGPFKE 83 Query: 71 --YDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPHLDQ 128 Y+ L E NW+LLVQ ++ L+ F +P WRIDD+M+S++ GG VGPH D Sbjct: 84 SLYETLPEKNWTLLVQGMDRLLPEVTELLNEFDFIPSWRIDDIMVSYATEGGNVGPHFDH 143 Query: 129 YDVFIIQGTGRRRWRV-GEKLQMKQHCPHPDLLQVDPFEAIIDEE--LEPGDILYIPPGF 185 YDVF++Q G RRW++ + + DL + F +++EE +PGDILY+PP + Sbjct: 144 YDVFLLQAQGERRWQLSAQDCDETNYIEGVDLRIMKRF--VVEEEYVCQPGDILYVPPKW 201 Query: 186 PHEGYAL-ENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADVLP 244 H G L ++ M +S+G+R EL F DY+ + + + Y DP+ A P + Sbjct: 202 GHHGVGLTDDCMTFSIGYRTYRGLELWDSFGDYLAETQQFQSLYQDPNWKGTA-PGQISE 260 Query: 245 QEMDKLREMMLELINQPEHFKQWFGEFISQ 274 + + ++ + E K WFG F +Q Sbjct: 261 GSWQQAQSLLKAALENEEALKNWFGRFATQ 290 >UniRef50_B9ZR02 Cupin 4 family protein n=1 Tax=Thioalkalivibrio sp. K90mix RepID=B9ZR02_9GAMM Length = 385 Score = 178 bits (452), Expect = 2e-43, Method: Compositional matrix adjust. Identities = 104/293 (35%), Positives = 166/293 (56%), Gaps = 14/293 (4%) Query: 11 DFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLV--SHQDGKWQVSHGPF 68 +FL +WQ++P++++ + F +PI PD+LAGLA + + +RLV G W V +GPF Sbjct: 20 EFLRDYWQQKPLLVRGAVSGFANPIEPDDLAGLACDPDASARLVLGDTDHGDWAVEYGPF 79 Query: 69 ES--YDHLGETNWSLLVQAVNH-WHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 125 E + L + W+LL+ V W E L R F +P WR DDLMIS++ P G VGPH Sbjct: 80 EEDRFASLPDRAWTLLISDVERFWPEGHDFLAR-FDFVPRWRRDDLMISYASPDGSVGPH 138 Query: 126 LDQYDVFIIQGTGRRRWRVGE---KLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIP 182 +D YDVF+ Q GRRRW++ L P L + +P E+ +LEPGD+LY+P Sbjct: 139 VDAYDVFLFQAAGRRRWQIQSPPGPLDCHDDLPLAILREFEPTESW---DLEPGDLLYLP 195 Query: 183 PGFPHEGYALEN-AMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPAD 241 P PH G +L++ M +S+GFRAP +L++GF + R YSDP P A+ ++ Sbjct: 196 PNLPHYGLSLDDQCMTWSIGFRAPTYLDLLTGFLEERANRVGEAPRYSDPQRPVSAYVSE 255 Query: 242 VLPQEMDKLREMMLELINQPE-HFKQWFGEFISQSRHELDIAPPEPPYQPDEI 293 + + +LR+++ E++ + + G F+++ +++ +PP + E Sbjct: 256 LPSHDRTRLRDILREMLAADDTELDAFLGRFLTRPAGNVELHTGDPPAEAREC 308 >UniRef50_A1VLH8 Cupin 4 family protein n=6 Tax=Burkholderiales RepID=A1VLH8_POLNA Length = 413 Score = 178 bits (451), Expect = 3e-43, Method: Compositional matrix adjust. Identities = 124/351 (35%), Positives = 179/351 (50%), Gaps = 52/351 (14%) Query: 12 FLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQVSHGPFESY 71 F+ RHWQK+P+++++ F +S L LA +V+SRL+ Q W + GPF S Sbjct: 18 FMRRHWQKKPLLVRQAIAGFEPFLSRAALFKLAAREQVESRLIVQQAKGWGMKKGPFASK 77 Query: 72 DH--LGETNWSLLVQAVNHWHEPTA-ALMRPFRELPDWRIDDLMISFSVPGGGVGPHLDQ 128 L + W+LLVQ V+ HEP AL++ FR +PD R+DDLMISF+ PGGGVGPH D Sbjct: 78 SLPPLSQEGWTLLVQGVD-LHEPAGHALLQQFRFVPDARLDDLMISFATPGGGVGPHFDS 136 Query: 129 YDVFIIQGTGRRRWRVGEKLQMKQHCPHPD--LLQVDPFEAIIDEE--LEPGDILYIPPG 184 YDVF+ Q +GRRRW++G + K PD L + FE +DEE LE GD+LY+PP Sbjct: 137 YDVFLFQASGRRRWKIGLQ---KDFTLQPDVPLKILQNFE--VDEEFVLEAGDMLYLPPR 191 Query: 185 FPHEGYAL---------ENAMNYSVGFRAPNTRELISGFADYVLQR--ELGGN------- 226 + H+G A + M YS+GFR+P EL A +L R E+G + Sbjct: 192 YAHDGIAEASVGTNGKPADCMTYSIGFRSPARTEL----ASELLHRLAEMGEDAAEEACA 247 Query: 227 ------------YYSDPDVPPRAHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQ 274 Y DP P PA + D + +LE + P GE++++ Sbjct: 248 AEAGRKPARAQPMYRDPTQPATETPAAMPAGLADFAGQAVLEALKDPLALACALGEYMTE 307 Query: 275 SRHELDIAPPEPPYQPDEIYDALKQGEVLVRLGG-LRVLRIGDDVYANGEK 324 + + PE + DA K G++ + L R++ D ++ NGE Sbjct: 308 PKPGVWFDEPEQAWD----GDAAKAGQMAIALDARTRMMYDSDHIFINGES 354 >UniRef50_D1KE35 Putative uncharacterized protein n=1 Tax=uncultured SUP05 cluster bacterium RepID=D1KE35_9GAMM Length = 362 Score = 177 bits (448), Expect = 8e-43, Method: Compositional matrix adjust. Identities = 97/241 (40%), Positives = 141/241 (58%), Gaps = 17/241 (7%) Query: 11 DFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLV--SHQDGKWQVSHGPF 68 +FLE +WQK+P+++K+ NFI PIS DELAGL++E E +SRLV S +W +++GPF Sbjct: 11 EFLEDYWQKKPLLIKQALPNFISPISSDELAGLSLEEEFESRLVQGSTAQQQWSLTNGPF 70 Query: 69 E--SYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPHL 126 ++ L E +W+LLVQ V+ + + L++ F +P WR DD+MIS++ GG VGPH Sbjct: 71 TKTTFTQLPEQDWTLLVQGVDRFIDEVHDLIKQFDFIPRWRFDDVMISYATKGGSVGPHF 130 Query: 127 DQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIID------EELEPGDILY 180 D YDVF++QG+GRRRW + Q C + L+ P + E+EPGD+LY Sbjct: 131 DYYDVFLLQGSGRRRWELS-----TQFCTLDNYLKDVPLRIMHTFTPEQFFEVEPGDVLY 185 Query: 181 IPPGFPHEGYALEN-AMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHP 239 IPP H G +L++ S G+RA + +EL D + YY DP + P Sbjct: 186 IPPKVAHHGVSLDDECTTLSFGYRAYSAQELFESL-DMQNPDQEQNIYYQDPIWINTSSP 244 Query: 240 A 240 A Sbjct: 245 A 245 >UniRef50_B4X170 Cupin superfamily protein n=1 Tax=Alcanivorax sp. DG881 RepID=B4X170_9GAMM Length = 382 Score = 175 bits (444), Expect = 2e-42, Method: Compositional matrix adjust. Identities = 121/368 (32%), Positives = 195/368 (52%), Gaps = 22/368 (5%) Query: 12 FLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQD-GKWQVSHGPF-- 68 FL HWQK+ + + G +D + LAGLA+E V++R+++ D G W V P Sbjct: 24 FLREHWQKKALFMP-GAARGLDQPDANTLAGLALEESVEARIITGADNGPWSVLQSPLSD 82 Query: 69 ESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPHLDQ 128 + ++ LGE NW+LLVQ+V+H+ T+ L+ F LP+WR++D+MIS++ GG VGPH D+ Sbjct: 83 DVFETLGEENWTLLVQSVDHFLTETSLLLDDFAFLPNWRVEDIMISYAAKGGSVGPHFDR 142 Query: 129 YDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEEL-EPGDILYIPPGFPH 187 YDVF+IQ G RRW++G+ D L++ + +E + PGD+LY+PPG H Sbjct: 143 YDVFLIQAAGHRRWQIGDVCDESTPRQPTDELKLLADMPVREEFVAAPGDVLYLPPGVAH 202 Query: 188 EGYALE-NAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADVLPQE 246 G A + + + +SVGFRAP+ + L++ A L E ++DPD + P + + Sbjct: 203 HGVAEDSDCITWSVGFRAPDYQMLMAEIAGECLA-ESDSQLFTDPDRDVTSDPTVLADAD 261 Query: 247 MDKLREMMLELINQPEHFKQWFGEFISQSR---HELDIAPPEPPYQPDEIYDALKQGEVL 303 +L L+L+ QP+ ++ ++S R E I + D++ L Sbjct: 262 RQQLIRGALDLL-QPDAIERAVYRWLSTPRLDGLEFAIDDHHIRERDDKV--------AL 312 Query: 304 VRLGGLRVLRIGDDVYANGEK--IDSPHRPALDALASNIALTAENFGDALEDPSFLAMLA 361 VR G +R+L G + NG+ + RP + LAS E+ DA+ P+ +L Sbjct: 313 VRHGSVRLLMQGKLAWLNGDSHTLTEQQRPLVQLLASKRRYQ-ESELDAVMTPAARELLH 371 Query: 362 ALVNSGYW 369 + GY+ Sbjct: 372 EWIEQGYF 379 >UniRef50_A6GQ27 Putative uncharacterized protein n=1 Tax=Limnobacter sp. MED105 RepID=A6GQ27_9BURK Length = 383 Score = 174 bits (442), Expect = 3e-42, Method: Compositional matrix adjust. Identities = 111/375 (29%), Positives = 190/375 (50%), Gaps = 18/375 (4%) Query: 6 TLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQVSH 65 L+ F+ HW +P + ++ F NF D +A +A + +++SRL+ H W + H Sbjct: 10 NLSVEKFMTEHWHIKPYLFRQAFPNFEPLCDFDTIAEMASDEDIESRLIQHSKTGWTLEH 69 Query: 66 GPFESYDHLGETNWSLLVQAVNHWHEPTAA-LMRPFRELPDWRIDDLMISFSVPGGGVGP 124 GPF+ + + W++L+Q ++H H P A L++ FR +PD R+DD+M+S + GGGVGP Sbjct: 70 GPFDELPSMKKKAWTVLIQGIDH-HLPEAYDLLQLFRFIPDARLDDVMLSLASDGGGVGP 128 Query: 125 HLDQYDVFIIQGTGRRRWRVGEKL--QMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIP 182 H D YDVF++Q G+RRW++G L ++++ P L +P E + LEPGD+LY+P Sbjct: 129 HYDSYDVFLLQMHGKRRWKIGPLLDKELEEGLPLKILKNFEPTEEFV---LEPGDMLYLP 185 Query: 183 PGFPHEGYALENAMNYSVGFRAPNTRELISGF----ADYVLQRELGG-NYYSDPDVPPRA 237 P + H+G A + S+GFRAP E++SG AD + Q +SDP + Sbjct: 186 PNYGHDGIAEGSCSTLSIGFRAPTQAEVLSGILRDMADQIDQDPTKTQTLFSDPARGLQK 245 Query: 238 HPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDAL 297 +PA++ ++ ++ + Q ++ G +++ + + + EI L Sbjct: 246 NPAEIPDDLLNFGINLIQQFSAQSPQIQRSMGILLTEPKSHVYFVNNTEDQEIHEIISVL 305 Query: 298 KQGEVLVRLGGLRVLRIGDDV-YANGEKIDSPHR---PALDALASNIALTAENFGDALED 353 GE + L + D V Y NG+ ++ L LA+ + + +AL + Sbjct: 306 --GERGIALSMKTKMLFKDAVFYINGDAVNPTSALTVKQLQMLANQREMEPIDAAEALNN 363 Query: 354 PSFLAMLAALVNSGY 368 P F L +G+ Sbjct: 364 PEFQYFLVGFAKAGW 378 >UniRef50_Q7NS46 Putative uncharacterized protein n=1 Tax=Chromobacterium violaceum RepID=Q7NS46_CHRVO Length = 377 Score = 174 bits (440), Expect = 7e-42, Method: Compositional matrix adjust. Identities = 108/330 (32%), Positives = 174/330 (52%), Gaps = 28/330 (8%) Query: 12 FLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQVSHGPFES- 70 FL +W K+P++++ + + L+ LA + +SRL+ ++ KW + GPF + Sbjct: 15 FLAEYWHKKPLLIRGALTDVGPHVDFSVLSELAQRDDAESRLIEYKKDKWHLERGPFRAS 74 Query: 71 -YDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPHLDQY 129 + L ET+W+LLVQ VNH ++ F +P R+DDLMIS++ PGG VGPH D Y Sbjct: 75 RFRRLAETDWTLLVQGVNHHLPHIDDILWRFNFIPYARLDDLMISYAPPGGTVGPHFDAY 134 Query: 130 DVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEE------LEPGDILYIPP 183 DVF++Q G++RW++ QH D ++ P + D LE GD+LY+PP Sbjct: 135 DVFLLQVGGKKRWQIS-----SQH--DDDFIEDAPIRVLKDFRMEQEFVLEHGDMLYLPP 187 Query: 184 GFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADVL 243 H G ALE M YS+GFRAP +EL + F Y+ R Y+DPD+ +A PA + Sbjct: 188 HCAHYGVALEPGMTYSIGFRAPPAQELAAQFLVYLQDRVCIDGVYADPDLKLQADPAKIG 247 Query: 244 PQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQGEVL 303 + +D++ ++ ++ + + G ++++ + + PE DE+ + V Sbjct: 248 GEMIDQVAGLLSKIRWDKDTVCDFLGHYLTEPKAHVFYDSPE-----DELDEEAFAEAVA 302 Query: 304 VRLGGL------RVLRIGDDVYANGEKIDS 327 R GL ++L VY NGEK+D+ Sbjct: 303 ER--GLELDRKSQILYCDACVYCNGEKVDA 330 >UniRef50_C0VP99 Cupin 4 n=2 Tax=Acinetobacter RepID=C0VP99_9GAMM Length = 387 Score = 172 bits (437), Expect = 1e-41, Method: Compositional matrix adjust. Identities = 104/323 (32%), Positives = 165/323 (51%), Gaps = 10/323 (3%) Query: 12 FLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQD---GKWQVSHGPF 68 FL +WQK+P++++ I + P ++ LA+E +V +RL+ ++ +W V P Sbjct: 17 FLAEYWQKKPLLVRNAMPEIIGLLEPADVQELALEEDVTARLIRQKNKNPNEWHVKSSPL 76 Query: 69 ESYDHLGETN-WSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPHLD 127 D N W+LLVQAV+H+ A L + F +P WR DD+M+S++ GG VG H D Sbjct: 77 TKGDFQKLPNLWTLLVQAVDHYSFDIAELWKKFPFIPQWRRDDIMVSYAPKGGSVGKHFD 136 Query: 128 QYDVFIIQGTGRRRWRVGEKL-QMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIPPGFP 186 YDVF++QG G RRW++G+K + P L + DE L PGD+LY+PPG Sbjct: 137 FYDVFLVQGYGHRRWQLGQKCDETTALIPDQPLKLLTDMHVEFDEVLAPGDLLYVPPGLA 196 Query: 187 HEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADVLPQE 246 H G A ++ + +S GFR PN E+I +D + E+ D A + E Sbjct: 197 HYGVAEDDCLTFSFGFRMPNLSEMIDQVSDKFAENEILKKPLIDIVRQHTAPIGKINSTE 256 Query: 247 MDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQGEVLVRL 306 + L+ +L+ + Q F+ ++S+S + I PE D + + + G L+ Sbjct: 257 LAYLKAQLLDYLTQAPEFEAAIMSYMSESNYPNSIPEPEEITTED-LLEVIGTGYQLILE 315 Query: 307 GGLRVL--RIGD--DVYANGEKI 325 R+L +GD D +AN E + Sbjct: 316 PASRLLYRELGDSLDFWANSENV 338 >UniRef50_B7H3P1 Cupin superfamily protein n=16 Tax=Acinetobacter RepID=B7H3P1_ACIB3 Length = 387 Score = 165 bits (418), Expect = 2e-39, Method: Compositional matrix adjust. Identities = 106/370 (28%), Positives = 183/370 (49%), Gaps = 21/370 (5%) Query: 12 FLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQD---GKWQVSHGPF 68 FL +WQK+P++++ + + P+++ LA+E V +RL+ +D +W V P Sbjct: 16 FLTEYWQKKPLLVRNAMPEIVGMLEPNDVKELALEDHVTARLIRQKDKNPNEWHVKSSPL 75 Query: 69 ESYDHLGETN-WSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPHLD 127 D W+LLVQAV+H+ A L + F +P WR DD+M+S++ GG VG H D Sbjct: 76 TKGDFQKLPKLWTLLVQAVDHYSFDIAELWKKFPFIPQWRRDDIMVSYAPKGGSVGKHFD 135 Query: 128 QYDVFIIQGTGRRRWRVGEKLQMK-QHCPHPDLLQVDPFEAIIDEELEPGDILYIPPGFP 186 YDVF++QG G RRW++G+ + P+ L + + DE L PGD+LY+PPG Sbjct: 136 FYDVFLVQGYGHRRWQLGQMCDASTEFVPNQPLKLLPEIDVHFDEVLAPGDLLYVPPGLS 195 Query: 187 HEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADVLPQE 246 H G A ++ + +S GFR PN +I +D EL N D ++ +E Sbjct: 196 HYGVAEDDCLTFSFGFRMPNISGMIDRISDQFATDELLQNPVVDITRKNPPQIGEINTEE 255 Query: 247 MDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEP-PYQPDEIYDALKQGEVLVR 305 + LR+++L + ++S+ ++ +I PEP + +++ L +G L+ Sbjct: 256 LAYLRDLVLAQLKNSTVLDAALMSYMSEPKYPDNI--PEPDEIEVEDLNAILSEGYELLL 313 Query: 306 LGGLRVLRIGDD----VYANGEKIDSPHRPALDALASNIALTAEN----FGDALEDPSFL 357 R+L + + NGE++ P +++ A+ + A+ F L + L Sbjct: 314 EPASRLLYTEQNGILKFWGNGEEL-----PIVESFATQLKSIADGKSIPFNSELNNTDIL 368 Query: 358 AMLAALVNSG 367 + L+N+ Sbjct: 369 ENIVQLLNNS 378 >UniRef50_P44683 Uncharacterized protein HI0396 n=36 Tax=Gammaproteobacteria RepID=Y396_HAEIN Length = 404 Score = 163 bits (412), Expect = 1e-38, Method: Compositional matrix adjust. Identities = 105/332 (31%), Positives = 170/332 (51%), Gaps = 21/332 (6%) Query: 12 FLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLV-SHQDGKWQVSHGPFES 70 FL +WQK+P+V++ G + P ++ LA +V +RLV + D W+V P Sbjct: 20 FLRDYWQKKPLVIRNGLPEIVGQFEPQDIIELAQNEDVTARLVKTFSDDDWKVFFSPLSE 79 Query: 71 YD--HLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPHLDQ 128 D L E WS+LVQ + W L F +P W+ DD+M+S++ GG VG H D+ Sbjct: 80 KDFQKLPE-KWSVLVQNLEQWSPELGQLWNKFGFIPQWQRDDIMVSYAPKGGSVGKHYDE 138 Query: 129 YDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQV--DPFEAIIDEELEPGDILYIPPGFP 186 YDVF++QG G RRW+VG+ +++ D E +IDE + PGDILYIP Sbjct: 139 YDVFLVQGYGHRRWQVGKWCDASTEFKPNQSIRIFDDMGELVIDEVMNPGDILYIPARMA 198 Query: 187 HEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPAD----- 241 H G A ++ + +S G R PN LI G + ++ N S+ D+P R ++ Sbjct: 199 HYGVAEDDCLTFSFGLRYPNLSNLIDGISKGFCHQDPDLN-LSEFDLPLRLSQSEQRTGK 257 Query: 242 VLPQEMDKLREMMLELINQPEH----FKQWFGEFISQSRHELDIAPPEPPYQPDEIYDAL 297 + + + +++++L+ + E FKQ +S R+EL ++ + PDE+ L Sbjct: 258 LADENIQAMKQLLLDKLAHSEAFDTLFKQAVASAVSSRRYELLVS--DEMCDPDEVRSIL 315 Query: 298 KQ-GEVLVRLGGLRVLRIGD--DVYANGEKID 326 ++ G L + ++L + +YANGE +D Sbjct: 316 EEDGAFLSQDNNCKLLYTENPLRIYANGEWLD 347 >UniRef50_A4SX54 Cupin 4 family protein n=2 Tax=Polynucleobacter necessarius RepID=A4SX54_POLSQ Length = 410 Score = 161 bits (408), Expect = 3e-38, Method: Compositional matrix adjust. Identities = 115/380 (30%), Positives = 195/380 (51%), Gaps = 25/380 (6%) Query: 12 FLERHWQKRPVVLKRGFNNFI----------DPISPDELAGLAMESEVDSRLVSHQDGKW 61 F++++W K+P++++ F PIS ELA L+ + V+SRL+ + W Sbjct: 36 FMKQYWHKKPLLIRGAIPAFSLTNQNGEALESPISFPELAELSTQDTVESRLI--RSKPW 93 Query: 62 QVSHGPF--ESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPG 119 HGPF +S + + NW+LL+Q + H A ++ FR +PD R+DDLMIS + G Sbjct: 94 SFDHGPFAKKSIPAINKPNWTLLLQGMEAHHPAAAKILSWFRFIPDARLDDLMISVAGIG 153 Query: 120 GGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDIL 179 GGVGPH D YDVF++Q +GRR W + E+ + + P L + F + D LEPGD+L Sbjct: 154 GGVGPHFDSYDVFLMQMSGRRHWHISEQKDLSLN-PKLPLKILQHFRSEQDWILEPGDML 212 Query: 180 YIPPGFPHEGYALE-NAMNYSVGFRAPNTRELIS----GFADYVLQRELGGNYYSDPDVP 234 Y+PP H+G AL+ +S+GFR+P+ +EL+ A+ + ++DP Sbjct: 213 YLPPHVAHDGIALDAGCQTWSIGFRSPSFKELLQEGLWRLAESLENLPELEQKFADPKQE 272 Query: 235 PRAHPADVLPQEM-DKLREMMLEL-INQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDE 292 A A+ LP E+ +L+ + +L ++Q + F ++S+ + + P P +P Sbjct: 273 ATAS-AEQLPDELIAQLKGQLHKLKLDQIDSFLPGITAYLSEPKQQAIFDGPNSPLKPKA 331 Query: 293 IYDALKQGEVLVRLGGLRVLRIGDDVYANGEKIDSPHRPALDALASNIALTAENFGDALE 352 L + E L+ R+L +G V+ NGE + P + +++ +L+ Sbjct: 332 FLARLSR-ENLLPHPQTRILSLGKQVFCNGESMTQDQGPRIGDAWRSLSAQKRLRTKSLQ 390 Query: 353 DPSFLAMLAALVNSGYWFFE 372 + ++ A + SG+ FE Sbjct: 391 NIDKSSLYEAYL-SGWLIFE 409 >UniRef50_B1Y837 Cupin 4 family protein n=3 Tax=cellular organisms RepID=B1Y837_LEPCP Length = 418 Score = 161 bits (407), Expect = 5e-38, Method: Compositional matrix adjust. Identities = 123/380 (32%), Positives = 187/380 (49%), Gaps = 43/380 (11%) Query: 12 FLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQD----GKWQVSHGP 67 F++RHWQ++P+++++ P+S ++ + + V+SR +S Q WQ GP Sbjct: 53 FMQRHWQRKPLLVRQAVPGIEPPVSRAQMFAMLEDDAVESRFLSRQGEGDRQTWQFKRGP 112 Query: 68 F--ESYDHLGETNWSLLVQAVNHWHEPTAA-LMRPFRELPDWRIDDLMISFSVPGGGVGP 124 S + + W++LVQ +N H P AA L+ FR +P R+DDLMIS++ GGGVGP Sbjct: 113 MPRRSLPAIKQPGWTVLVQGLN-LHVPAAADLLNRFRFVPQARLDDLMISWASEGGGVGP 171 Query: 125 HLDQYDVFIIQGTGRRRWRVGE--KLQMKQHCPHPDLLQVDPFEAIIDEE---LEPGDIL 179 H D YDVF+IQ GRRRWR+G ++++ P V E EE LEPGD+L Sbjct: 172 HFDSYDVFLIQVAGRRRWRIGRLPDARLREGLP------VKIIENFRHEEEWVLEPGDML 225 Query: 180 YIPPGFPHEGYALEN-AMNYSVGFRAPNTRELIS----GFADYVLQRELGGN---YYSDP 231 Y+PPG+ H+G A++ M SVGFR+P EL+ AD + G Y DP Sbjct: 226 YLPPGWAHDGDAVDGECMTCSVGFRSPQRSELVRETLLRLADGIDDPADAGARPPVYRDP 285 Query: 232 DVPPRAHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPD 291 A P + + + + ++ + +P + GE++S+ + ++ Sbjct: 286 KQSATAAPGRIPAELLAFAEQGLMRALAEPGALARALGEYLSEPKAQVSF---------- 335 Query: 292 EIYDALKQGEVLVRLGGLRVLRIGDD-VYANGEKIDSPHRPA--LDALASNIALTAENFG 348 E+ + L G V VRL L D VY NG+ + R A L LA L A Sbjct: 336 ELGEPLPDG-VGVRLDDRSCLLYDDGHVYCNGDSWRAAGRDAAMLHLLADARQLDATTLR 394 Query: 349 DALEDPSFLAMLAALVNSGY 368 A P+ A+L + G+ Sbjct: 395 RA--SPALRALLEQWADDGW 412 >UniRef50_A2W941 Transcription factor jumonji n=1 Tax=Burkholderia dolosa AUO158 RepID=A2W941_9BURK Length = 360 Score = 152 bits (384), Expect = 2e-35, Method: Compositional matrix adjust. Identities = 76/182 (41%), Positives = 110/182 (60%), Gaps = 4/182 (2%) Query: 12 FLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQVSHGPFE-- 69 F+ R+WQK+P+++++ P++ D L LA + + +SRL++H KWQ++HGPFE Sbjct: 54 FMRRYWQKKPLLIRQAIPGVASPVTRDALFELAADYDAESRLITHFRNKWQLTHGPFEPG 113 Query: 70 SYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPHLDQY 129 S + W+LLVQ ++ + AL+ FR +PD R+DDLMIS++ GGGVGPH D Y Sbjct: 114 SLPAVTRRAWTLLVQGLDLHVDAARALLDRFRFIPDARLDDLMISYATDGGGVGPHFDSY 173 Query: 130 DVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIPPGFPHEG 189 DVF++Q GRRRWR+G Q C L + FE + G ILY+PP H+G Sbjct: 174 DVFLLQVEGRRRWRIGA--QTDCRCSRRALKILRHFEPATNGCWNRGAILYLPPHSAHDG 231 Query: 190 YA 191 A Sbjct: 232 VA 233 >UniRef50_C1E292 Predicted protein n=2 Tax=Micromonas RepID=C1E292_9CHLO Length = 466 Score = 145 bits (367), Expect = 2e-33, Method: Compositional matrix adjust. Identities = 89/284 (31%), Positives = 139/284 (48%), Gaps = 18/284 (6%) Query: 9 WPDFLERHWQKRPVVLKRGF-NNFIDPISPDELAGLAMESEVDSRLVSHQD---GKWQVS 64 W F E++WQK PVV++ G P+ DELAGLA E+E R++ D W + Sbjct: 68 WTTFFEKYWQKEPVVIRGGLPTELCTPVDNDELAGLACETEFRPRIIRKGDEGPSSWSLQ 127 Query: 65 HGPFESYDHL----GETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGG 120 GPF S D L + +W LL+ + ++ F P WR+ D+ S S GG Sbjct: 128 MGPF-SEDELKSLPSDGSWCLLLNDLEKHVSEFMDVLNLFDRFPRWRVADVQASISSEGG 186 Query: 121 GVGPHLDQYDVFIIQGTGRRRWRVGEKLQM-----KQHCPHPDLLQVDPFEAIIDEELEP 175 VG H DQ+DVF+IQGTG +RW + + + + P ++ + F+ L+ Sbjct: 187 SVGAHSDQFDVFLIQGTGHKRWSISDCAEYVPDNDEAFFPDAEVRVLKNFQPQSCSLLKQ 246 Query: 176 GDILYIPPGFPHEGYA---LENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPD 232 GDILY+PP H G A +SVGF AP EL+ +A + G + DP Sbjct: 247 GDILYLPPKVAHHGVAEGCKTICTTFSVGFLAPAHDELVLSYAQASVDTHDGSQRWRDPW 306 Query: 233 VPPRAHPADVLPQEMDKLREMMLELINQPE-HFKQWFGEFISQS 275 + P+ H ++ + + + E++ + + + + +WFG +QS Sbjct: 307 LKPQEHVGEISSEAVAQAAEIIRQSMPKNDAEIARWFGCHATQS 350 >UniRef50_UPI0000E87D6F hypothetical protein MB2181_02235 n=1 Tax=Methylophilales bacterium HTCC2181 RepID=UPI0000E87D6F Length = 377 Score = 144 bits (363), Expect = 5e-33, Method: Compositional matrix adjust. Identities = 95/323 (29%), Positives = 164/323 (50%), Gaps = 24/323 (7%) Query: 12 FLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQVSHGPFESY 71 FLE++W K+ + L+ + +S D + GLA ++S++++ +G Q ++GPF Sbjct: 18 FLEKYWGKQALFLQDAIDISGAGLSKDVVFGLAKNENIESKIIAFIEGSQQTTYGPFNKV 77 Query: 72 DHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPHLDQYDV 131 H G+++ SLL+ N HE + L + +P DD+M+SFS GGGVGPH D YDV Sbjct: 78 KH-GKSS-SLLIHQFNLIHEFSYNLFQSINFVPYCLHDDVMMSFSSEGGGVGPHSDSYDV 135 Query: 132 FIIQGTGRRRWRVG--EKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIPPGFPHEG 189 F++QG G + W +G +K K L+ P E + +PGDILY+PP PH G Sbjct: 136 FLVQGQGEKVWNIGATDKKAFKTTSTDHSNLKFTPTEQFL---AKPGDILYVPPFTPHHG 192 Query: 190 YAL-ENAMNYSVGFRAPNTRELISGFADYVLQR-ELGGNYYSDPDVPPRAHPADVLPQEM 247 +L ++ + YS+GFR+P+ E+ + + +Y++ R E + ++ D+ ++P + Sbjct: 193 ISLSDDCITYSIGFRSPSNNEIRNQYLEYLMDRKEKSNDLFNGLDLSENTKA--LIPNAL 250 Query: 248 DKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPY--QPDEIYDALKQ--GEVL 303 + P + G F+S+ P E + + + +A K+ E + Sbjct: 251 ASFIKKNTAFPKDPTIIDDFIGCFLSE--------PHEGAFFTKKNITKNAFKKIDTEKI 302 Query: 304 VRLG-GLRVLRIGDDVYANGEKI 325 +RL R + ++ Y N E I Sbjct: 303 LRLNIQTRAVIHNENFYINAENI 325 >UniRef50_A4S2B8 Predicted protein n=2 Tax=Ostreococcus RepID=A4S2B8_OSTLU Length = 392 Score = 138 bits (348), Expect = 3e-31, Method: Compositional matrix adjust. Identities = 107/392 (27%), Positives = 182/392 (46%), Gaps = 39/392 (9%) Query: 13 LERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQ---DGKWQVSHGPFE 69 + +WQK+P+++++ NF P+ +E+AGLA E + +R+ + + W+ GPFE Sbjct: 1 MREYWQKKPLLMRQAIPNFRPPLDGNEIAGLACEEDASARIFVREGDDEQSWRKKIGPFE 60 Query: 70 SYDHLG---ETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPHL 126 D + WSL+V ++ +P ++ F P WRI D+ S S GGGVGPH Sbjct: 61 ESDLTSLPEDKPWSLIVNDLDVQAQPFGDMLELFNCFPRWRISDIQASVSPDGGGVGPHS 120 Query: 127 DQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDL-----LQVDPFEAIIDEE---LEPGDI 178 D +DVF++Q G + W V + +++ P D ++ ++ ++++ L PGD+ Sbjct: 121 DHFDVFLLQAEGEKVWAVADN---EEYWPDNDAAFVPECEIRVLKSFVEDDSFTLVPGDM 177 Query: 179 LYIPPGFPHEGYALEN----AMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVP 234 LY+PP H G A + ++ S+GF AP T EL+ + ++ L G+ +SDP + Sbjct: 178 LYLPPKIAHNGVATNSKPGVSVTLSIGFLAPTTDELVLSYTQRASEK-LKGSRWSDPWLK 236 Query: 235 PRAHPADVLPQEMDKLREMMLELI-NQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEI 293 P + + + E++ +WFG + E D A E +E+ Sbjct: 237 PVEDVGAISAESITYASEIIKRTYPKNDAEVARWFGCHTTARTGEDDDA-DENEVSIEEL 295 Query: 294 YDALKQGEVLVR--LGGLRVLRIGDD------VYANGEKIDSPHRPALDALASNIALTAE 345 A + ++ R L V ++ DD +ANGE D PA A+ IA E Sbjct: 296 LAAWEHQGLVAREDLRFAFVEKVADDSLKNALFFANGECWDVVS-PAAVKTATVIANRGE 354 Query: 346 NFGDALE------DPSFLAMLAALVNSGYWFF 371 + + + D L + L GY +F Sbjct: 355 LYEEDTQTEECDFDDEALKLALTLFERGYLYF 386 >UniRef50_B7FZB3 Predicted protein n=1 Tax=Phaeodactylum tricornutum CCAP 1055/1 RepID=B7FZB3_PHATR Length = 492 Score = 119 bits (298), Expect = 2e-25, Method: Compositional matrix adjust. Identities = 92/332 (27%), Positives = 158/332 (47%), Gaps = 38/332 (11%) Query: 10 PDFLERHWQKRPVVLKRGFN-NFIDPISPDELAGLAME------SEVDSRLVSHQDGK-- 60 PD L +W + P++++ F+ + + P + L + S +R+++H G+ Sbjct: 67 PDLLTNYWGRSPLLIRSAFHAEALTEVWPSQADLLELALDDDEISSDSARIITHTSGRLD 126 Query: 61 -WQVSHGPFES-----YDHLGETNWSLLVQAVNHWHEPTAALMR-PFRELPDWRIDDLMI 113 + GPF + +H G+ W+L+V V+ + A M F LP WR DD I Sbjct: 127 SFASQLGPFSTSTIQGLEH-GDKMWTLIVNDVDRYVSTLADWMDDEFGFLPRWRRDDAQI 185 Query: 114 SFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH--PDLL---------QV 162 S + GGG+GPH+D YDVF+ Q +G+R W VG + +++ PDL Sbjct: 186 SMARTGGGIGPHVDSYDVFLTQTSGQRTWLVGNTMTVQEEMNTLIPDLSVRILRDVSNHN 245 Query: 163 DPFEAIIDEELEPGDILYIPPGFPHEGYAL-ENAMNYSVGFRAPNTRELISGFADYVLQR 221 + A EL+PGD+LY+PP + H G AL ++ + SVG R+P++ EL++ A+ +L Sbjct: 246 ESSHAYTRLELQPGDVLYLPPRYVHWGTALTDDCVTLSVGARSPSSAELVARIAETMLGS 305 Query: 222 EL--GGNYYSDPDVPPRAHPA---DVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSR 276 Y+DPD+ + A + D ++ M+L+ +++ + E +++ Sbjct: 306 VSVHAVQRYTDPDLLQEVNGAPLHSMTNHAKDSMKTMVLDAVHEITDDPMRWDELVAK-- 363 Query: 277 HELDIAPPEPPYQPDEIYDALKQGEVLVRLGG 308 L P Y+ +K E L GG Sbjct: 364 --LATEPKRMSENALVPYNEIKDSEYLAIWGG 393 >UniRef50_D1TSY0 Conserved domain protein n=19 Tax=Yersinia pestis RepID=D1TSY0_YERPE Length = 102 Score = 113 bits (282), Expect = 1e-23, Method: Compositional matrix adjust. Identities = 51/100 (51%), Positives = 69/100 (69%) Query: 272 ISQSRHELDIAPPEPPYQPDEIYDALKQGEVLVRLGGLRVLRIGDDVYANGEKIDSPHRP 331 ++ RHELDIAP +PPY DEI DAL +G VL RLGGLRVLR+GD+V+ N E+++ + Sbjct: 1 MTTPRHELDIAPAQPPYDQDEIVDALMEGAVLTRLGGLRVLRVGDNVFINSERLEMANAE 60 Query: 332 ALDALASNIALTAENFGDALEDPSFLAMLAALVNSGYWFF 371 A DAL + + G+AL+D +F+ L L+N GYWFF Sbjct: 61 AADALCRYTIIGKKELGEALQDSAFVTELTELINQGYWFF 100 >UniRef50_B6BWI1 Putative cytoplasmic protein n=1 Tax=beta proteobacterium KB13 RepID=B6BWI1_9PROT Length = 346 Score = 107 bits (267), Expect = 6e-22, Method: Compositional matrix adjust. Identities = 92/354 (25%), Positives = 154/354 (43%), Gaps = 38/354 (10%) Query: 4 QLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAM-ESEVDSRLVSHQDGKWQ 62 +L LN F++ +W K+ L G NF D +L L + S+ R + QDG+ Sbjct: 2 ELVLNKKCFVKSYWGKKHFFLPGGIKNFNDNFV--DLDDLNLPSSKALERKIFIQDGRKY 59 Query: 63 VSHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGV 122 ++ + ++ T S L NH H+ + + F +P + IDD+MIS S G V Sbjct: 60 INFTNVKKKLNVN-TPKSKLFYKTNHIHQLSFEVKNLFDFIPQYLIDDVMISLSNTKGSV 118 Query: 123 GPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIP 182 G H D Y VF+IQG G + W++ E + + ++ GDILY+P Sbjct: 119 GKHKDNYSVFLIQGKGIKNWKIYEN------------------KKVFSYTVKEGDILYVP 160 Query: 183 PGFPHEGYALENAMN-YSVGFRAPNTRELISGFADYV--LQRELGGNYYSDPDVPPRAHP 239 PG H G + N YSVGFR+P++ L F DY+ L + ++ + + Sbjct: 161 PGIDHYGISQSEICNTYSVGFRSPDSLNLKEIFNDYIFNLLDQTSTIFFQNKLFSKQKAS 220 Query: 240 ADVLPQEMDKLREMMLE-LINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALK 298 +P D +++ + L +P ++ G +++ EL + + D LK Sbjct: 221 ---IP---DDIKDFFIRHLDCKPIILDEFIGIYLTSVDLEL---FKKKEITLKKFKDQLK 271 Query: 299 QGEVLVRLGGLRVLRIGDDVYANGEKID--SPHRPALDALASNIALTAENFGDA 350 + + + R L G + Y NG KID + R + + A+N + Sbjct: 272 RMPLFLN-QMTRALYFGKNFYINGFKIDIETNSRKEFRKFFNESTIIAKNLNNK 324 >UniRef50_B8C536 Putative uncharacterized protein (Fragment) n=1 Tax=Thalassiosira pseudonana RepID=B8C536_THAPS Length = 204 Score = 98.6 bits (244), Expect = 3e-19, Method: Compositional matrix adjust. Identities = 63/187 (33%), Positives = 102/187 (54%), Gaps = 22/187 (11%) Query: 52 RLVSHQ---DGKWQVSHGPFESYDHLG-----------ETNWSLLVQAVNHWHEPTAALM 97 R++SH D ++++ GP + G E +L+V ++ ++ P A + Sbjct: 1 RVISHSPGDDSSYELTWGPLSDAEFHGWMAKVTSPNNNEQRETLVVNDIDRFYPPLADWI 60 Query: 98 R-PFRELPDWRIDDLMISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGE-----KLQMK 151 + LP WR+DD IS + GG+GPH+D YDVF+IQ +G R W+VG K +M Sbjct: 61 HDTYHFLPRWRMDDGQISLAEQSGGIGPHVDNYDVFLIQMSGTRAWQVGRKELSTKEEMD 120 Query: 152 QHCPHPDLLQVDPFEAIIDE-ELEPGDILYIPPGFPHEGYALEN-AMNYSVGFRAPNTRE 209 + D+ ++ + + ++E L+PGD+LY+PP H G AL + M SVG RAP+ + Sbjct: 121 RMIEGLDVRVLENWASEMEEWVLQPGDMLYLPPRVAHCGTALSDGCMTLSVGCRAPSVSD 180 Query: 210 LISGFAD 216 L+S A+ Sbjct: 181 LMSRLAE 187 >UniRef50_A9TET4 Predicted protein n=1 Tax=Physcomitrella patens subsp. patens RepID=A9TET4_PHYPA Length = 530 Score = 71.2 bits (173), Expect = 5e-11, Method: Compositional matrix adjust. Identities = 60/197 (30%), Positives = 87/197 (44%), Gaps = 32/197 (16%) Query: 79 WSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGG--GVGPHLDQYDVFIIQG 136 WS+ + W +P ++ F W ++ P G G PH D + F+IQ Sbjct: 195 WSVRILHPQRWCDPVFLILSAFERF--WGSVAGCNAYLTPAGSQGFSPHYDDIEAFVIQT 252 Query: 137 TGRRRWRVGEKLQMKQHCPH---PDLLQVDPFEAIIDEELEPGDILYIPPGFPHEGYALE 193 GR+RW+V + + P P+ Q + E I+D +LEPGDILY+P G H+ A E Sbjct: 253 EGRKRWKVYKPRTPGEALPRFSSPNFEQGEIGEPILDVDLEPGDILYMPRGTIHQAKASE 312 Query: 194 NA----MNYSVG----------FRAPNTRELISGFADYVLQRE---------LGGNYYSD 230 +A + SVG F P EL S D++L RE +G + D Sbjct: 313 DAHSLHITVSVGQRNCWGDFLEFAMPRALELAS--EDHILLRESLPRGYADYMGVAHSDD 370 Query: 231 PDVPPRAHPADVLPQEM 247 D P RA D + + M Sbjct: 371 HDNPQRAAFIDKIMECM 387 >UniRef50_Q091R3 Mina protein n=1 Tax=Stigmatella aurantiaca DW4/3-1 RepID=Q091R3_STIAU Length = 383 Score = 65.5 bits (158), Expect = 3e-09, Method: Compositional matrix adjust. Identities = 91/386 (23%), Positives = 159/386 (41%), Gaps = 55/386 (14%) Query: 12 FLERHWQKRPVVLK------------RGFNNFIDPISPDELAGLAMESEVDSRLVSHQDG 59 F E W+++P+VL+ R + P + G+ + E H+D Sbjct: 15 FFEEAWERKPLVLQGPPDRWSGLFSSRDLGRLLTYQPPRSIEGMMLVKEG-----RHRDE 69 Query: 60 KWQVSHGP--FESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSV 117 W G E +++++ V + EP E + + + + Sbjct: 70 NWLSPDGSPRLEQVQAAWREGYTIVINKVGQFWEPVGRFCAAVEEELHHPVG-VNLYMTP 128 Query: 118 PGG-GVGPHLDQYDVFIIQGTGRRRWRV-GEKLQMKQHCPHPDLLQVDPFEAI----IDE 171 PG G H D D F++Q G + W+V G ++ + P PD E++ +++ Sbjct: 129 PGAQGFKAHFDIMDAFVLQVEGSKVWQVRGPQVTL----PLPDEHTATSSESLPPVLLEQ 184 Query: 172 ELEPGDILYIPPGFPHEG-YALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSD 230 EL+ GD+LYIP GF HE A ++++ ++G +A +L + E Sbjct: 185 ELKRGDVLYIPRGFVHEARTAQTHSVHLTLGLQAVTWSDLFVAAIAAARRDE-----RFR 239 Query: 231 PDVPPRAHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQP 290 +PPR + ++ RE++ EL P H + G ++Q L + P PP + Sbjct: 240 KGLPPRFLEGSAMMEQ--TFRELLAEL---PRHLE--LGHALTQLAERLVVQKPPPPTE- 291 Query: 291 DEIYDA--LKQGEVLVRLGGLRVLRIGD-----DVYANGEKIDSPHR--PALDALASNIA 341 D + A LK VL R G+ VLR+ + + +G K+ P + PAL +A Sbjct: 292 DLLEGAVELKGSTVLTRRPGM-VLRVMEGPGYAGLQYSGGKLMGPAKIGPALRHIAKGSV 350 Query: 342 LTAENFGDALEDPSFLAMLAALVNSG 367 + ++ L + L + LV SG Sbjct: 351 IPVQSL-PGLSEKEQLVLAGRLVRSG 375 >UniRef50_Q849M1 Putative uncharacterized protein pSV2.19c n=3 Tax=Streptomyces RepID=Q849M1_STRVN Length = 390 Score = 60.8 bits (146), Expect = 7e-08, Method: Compositional matrix adjust. Identities = 76/267 (28%), Positives = 119/267 (44%), Gaps = 30/267 (11%) Query: 61 WQVSHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPF-RELPDWRIDDLMISFSVPG 119 W H P E + L E SL + +V+ H P A L REL +L S+S Sbjct: 81 WHRLH-PAELHTRLTE-GASLALDSVDELHPPIARLCEAIERELHTRVQANLYASWSATE 138 Query: 120 GGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQV-DPFEA----IIDEELE 174 G G H D +D I+Q G +RWR+ + P P + DP EA + D L Sbjct: 139 G-FGVHWDDHDTVIVQLDGAKRWRIYGTTR-----PFPLYRDIADPGEAPTEPVADLVLW 192 Query: 175 PGDILYIPPGFPHEGYALE--NAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPD 232 PGD+LY+P G H A + +++ + G + +L++ ++ +L E ++ D Sbjct: 193 PGDVLYVPRGVWHAVSADQGVRSLHVTCGLQTHTATDLMAWVSEQLLTHE---DWRR--D 247 Query: 233 VPPRAHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDE 292 +P A P DV +D +R+ + EL++ P ++ Q+ + P PY Sbjct: 248 LPLLAAP-DVQADAVDGMRKRLAELLDDPTLLARYRTAMDGQAVGRM---VPSLPYIDGI 303 Query: 293 IYDALKQGEVLVRLGGLR-VLRIGDDV 318 D G + VRL R VL +G+D Sbjct: 304 PVD----GALRVRLTTARAVLDVGEDT 326 >UniRef50_A4U3D3 MYC induced nuclear antigen n=1 Tax=Magnetospirillum gryphiswaldense RepID=A4U3D3_9PROT Length = 390 Score = 59.3 bits (142), Expect = 2e-07, Method: Compositional matrix adjust. Identities = 45/190 (23%), Positives = 85/190 (44%), Gaps = 15/190 (7%) Query: 11 DFLERHWQKRPVVLKRG----FNNFIDPISPDELAGLAMESEVDSRLVSHQD----GKWQ 62 +FL +W+K+P+++KR + + + + D++ + D R+ D ++ Sbjct: 17 EFLAEYWEKKPLLVKRAAPGFYRDLLSVQAIDQVLAMPGLHRRDIRVARGTDPLAVEEYA 76 Query: 63 VSHGPFE--SYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGG 120 G S L +++++ +N P A + R F ++ + Sbjct: 77 DKDGFINAASLSRLFTDGFTIILNTLNLKLRPLAEICRAFEQVLSIPCQTNIYYTPRLAQ 136 Query: 121 GVGPHLDQYDVFIIQGTGRRRWRVGE---KLQMKQHCPHPDLLQVDPFEAIIDEELEPGD 177 G PH D +DVF+ Q GR+ W V + +L ++ L + P + ++ +LEPGD Sbjct: 137 GFKPHYDSHDVFVFQVAGRKHWLVNDTPVELPLRGQGFEAGLYE--PGDVTMEFDLEPGD 194 Query: 178 ILYIPPGFPH 187 +LYIP G H Sbjct: 195 LLYIPRGVMH 204 >UniRef50_A3Q8B6 Cupin 4 family protein n=4 Tax=Mycobacterium RepID=A3Q8B6_MYCSJ Length = 404 Score = 56.6 bits (135), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 38/125 (30%), Positives = 59/125 (47%), Gaps = 24/125 (19%) Query: 121 GVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHP-----------DLLQVDPFEAII 169 G PH D +DVF++Q G +RW V E + PHP + + E +I Sbjct: 140 GFDPHYDVHDVFVLQTAGEKRWVVHEPVH-----PHPLPSQPWTQHRDAIAERAAGEPVI 194 Query: 170 DEELEPGDILYIPPGFPHEGYALE-NAMNYSVGFRAPN----TRELISGFADYVLQRE-- 222 D L PGD LY+P G+ H +AL+ +++ ++G A R ++ AD R Sbjct: 195 DTVLAPGDALYLPRGWVHSAHALDTTSIHLTIGVSAVTGVDVARAVVDALADSAAFRAPL 254 Query: 223 -LGGN 226 +GG+ Sbjct: 255 PMGGD 259 >UniRef50_B7PMB0 MYC-induced nuclear antigen, putative (Fragment) n=1 Tax=Ixodes scapularis RepID=B7PMB0_IXOSC Length = 472 Score = 55.8 bits (133), Expect = 3e-06, Method: Compositional matrix adjust. Identities = 68/298 (22%), Positives = 116/298 (38%), Gaps = 53/298 (17%) Query: 1 MEYQLT-LNWPDFLERHWQKRPVVL--KRGFNNFIDPI-SPDELAGLAMESEV----DSR 52 ME+ L+ L++ +F E++W++ P V + G F + S D +A E+++ D Sbjct: 16 MEFLLSPLSYKEFSEKYWEREPFVAHDRAGMRAFWPQLFSKDAFFSIAKETKLYFGKDVS 75 Query: 53 LVSHQDGK-------WQVSHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPD 105 ++DGK + Y E +L V W + Sbjct: 76 ACKYEDGKRSDYAEGYSAKSAKLNKY--FEERKATLQVHQPQRWKDSL------------ 121 Query: 106 WRIDDLMISF----------SVPGG--GVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQH 153 W + +LM F P G G+ PH D DVFI+Q G + W++ + + Sbjct: 122 WEVLELMERFFGCLVGCNAYITPAGSQGLAPHHD--DVFIVQLEGEKCWKLHKPVTELAR 179 Query: 154 CPHPDLLQVDPFEAIIDEELEPGDILYIPPGFPHEGYALENAMNYSV-----GFRAPNTR 208 D + E + L PGD LY+P G H Y E+A ++S ++ Sbjct: 180 IYSKDFTSEEIGEPTHEFTLRPGDFLYMPRGTIHHAYVPESADSHSTHITISTYQKQTVG 239 Query: 209 ELISGFADYVLQRELGGNYYSDPDVPPRAHPADVLPQEMDKLREMMLELINQPEHFKQ 266 + + A ++ + +P R P+ VL +E ++ L + EH KQ Sbjct: 240 DCLMDIAPDLISSAMDSCIELRKGLPNRFLPSCVLSKET-----VVTALSSVLEHVKQ 292 >UniRef50_B7G6P1 Predicted protein (Fragment) n=1 Tax=Phaeodactylum tricornutum CCAP 1055/1 RepID=B7G6P1_PHATR Length = 351 Score = 55.1 bits (131), Expect = 4e-06, Method: Compositional matrix adjust. Identities = 28/85 (32%), Positives = 45/85 (52%), Gaps = 5/85 (5%) Query: 121 GVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHP---DLLQVD--PFEAIIDEELEP 175 G PH D + F +Q G++RW+V LQ + P D ++ D E +D L+P Sbjct: 52 GFAPHYDDIEAFCLQLEGKKRWKVYAPLQKSERLPRTSSEDYVEADLRDVEPALDVVLKP 111 Query: 176 GDILYIPPGFPHEGYALENAMNYSV 200 GD+LY+P G+ H+ ++ YS+ Sbjct: 112 GDVLYMPRGWIHQACTIDGTDGYSL 136 >UniRef50_B4B491 Cupin 4 family protein n=1 Tax=Cyanothece sp. PCC 7822 RepID=B4B491_9CHRO Length = 390 Score = 54.7 bits (130), Expect = 6e-06, Method: Compositional matrix adjust. Identities = 46/220 (20%), Positives = 92/220 (41%), Gaps = 8/220 (3%) Query: 12 FLERHWQKRPVVLKRG----FNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQVSHGP 67 LE++W+K P+++ R ++ I + D + L D L+ Sbjct: 20 LLEKYWEKSPLLVARNHPDYYSELISLKNIDSILRLYGPKSSDVDLIKENSFFSAGGEVD 79 Query: 68 FESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPHLD 127 F +SL+++ ++ +P + L + + + + S G H D Sbjct: 80 FNQIYQAYSLGYSLVMRKIHERWQPLSVLHKNLEAFLNHPVGINLYMTSKNSQGFKAHFD 139 Query: 128 QYDVFIIQGTGRRRWRVGEK---LQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIPPG 184 +DVFI+Q G ++W++ + L + + D + L GD+LYIP G Sbjct: 140 THDVFILQVEGSKQWKIYDSPITLPVISDLKYTDKFINQLKSPTAEYCLNKGDLLYIPRG 199 Query: 185 FPHEGYALEN-AMNYSVGFRAPNTRELISGFADYVLQREL 223 + HE Y + +++ +VG + +LI+ + Q+E+ Sbjct: 200 YIHEVYTDNSFSVHLTVGIHSLKWFDLINSAVTKLAQKEV 239 >UniRef50_B0CEG8 Cupin 4 family protein, putative n=1 Tax=Acaryochloris marina MBIC11017 RepID=B0CEG8_ACAM1 Length = 416 Score = 53.9 bits (128), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 52/220 (23%), Positives = 94/220 (42%), Gaps = 19/220 (8%) Query: 11 DFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEV------DSRLVSHQDG----K 60 DF + +W+ + + L R +F + E L ++++ + RLV + Sbjct: 29 DFFQTYWETKTLYLPRNDASFYGSVLQPEDIDLLLQNKALLADYNNFRLVDQGNKLSLED 88 Query: 61 WQVSHGPFESYDHLGETNWSLLVQ----AVNHWHEPTAALMRPFRELPDWRIDDLMISFS 116 W H + Y + +SLL Q +N H+ L L L + Sbjct: 89 WCDRHSKSQQYFINNDKLYSLLHQGLTLTINGAHKKIPKLRHFCSALECELKFKLRTNIY 148 Query: 117 VP---GGGVGPHLDQYDVFIIQGTGRRRWRVGEK-LQMKQHCPHPDLLQVDPFEAIIDEE 172 + G+ PH D++DVFI+Q TG + W++ +++ H + + E + Sbjct: 149 ITPPQAQGLAPHYDEHDVFILQITGEKEWKLYHSPVELPSHIRDQSIGRHKLAEPELTVM 208 Query: 173 LEPGDILYIPPGFPHEGYALE-NAMNYSVGFRAPNTRELI 211 L+PGD+LYIP G H+ + E +++ S+G EL+ Sbjct: 209 LQPGDLLYIPRGVVHQAASQETTSVHASLGLYPTFAYELL 248 >UniRef50_D2A374 Putative uncharacterized protein GLEAN_07936 n=1 Tax=Tribolium castaneum RepID=D2A374_TRICA Length = 568 Score = 53.9 bits (128), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 52/203 (25%), Positives = 93/203 (45%), Gaps = 27/203 (13%) Query: 12 FLERHWQKRPVVLKRG----FNNFIDPISPDEL---AGLAMESEVDSRLVSHQDGKWQVS 64 F + +W+++P+ +KRG + + +D S D++ L VD +V++++G+ QV Sbjct: 147 FFKTYWEQKPLYIKRGNRSYYTHILDSSSLDKILRNNSLFFTRNVD--VVTYENGEKQVF 204 Query: 65 H-----GPFESYDHLGETNWSLLV---QAVNH-WHEPTAALMRPFRELPDWRIDDLMISF 115 + P +D+ G S+ V Q NH H A L F + + Sbjct: 205 NQEGRATPSALWDYYG-NGCSIRVLNPQTYNHKVHLLLATLQEYFGTMVGANV-----YL 258 Query: 116 SVPGG-GVGPHLDQYDVFIIQGTGRRRWRVGE--KLQMKQHCPHPDLLQVDPFEAIIDEE 172 + PG G PH D + F++Q GR+ W++ + + P+ + D E ++ Sbjct: 259 TPPGSQGFAPHYDDIEAFVVQLEGRKHWKLYQPKSEDVLARFSSPNFKREDLGEPFMELT 318 Query: 173 LEPGDILYIPPGFPHEGYALENA 195 L G++LY P G HEG E++ Sbjct: 319 LNAGELLYFPRGTIHEGRTDEDS 341 >UniRef50_Q7N884 Similar to unknown protein n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7N884_PHOLL Length = 388 Score = 53.1 bits (126), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 71/305 (23%), Positives = 122/305 (40%), Gaps = 56/305 (18%) Query: 7 LNWP----DFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQ 62 +N+P DFLE ++K+P V K+ +++ D I E+ + S + S +G Sbjct: 4 INFPIDKKDFLENFFEKKPCVFKKIYDD--DFIKHSEIENIFNRSNLPSF-----EGIKL 56 Query: 63 VSHGPF------ESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFS 116 + +G ESY+ LG + + + + A L+ + + +ID L + S Sbjct: 57 MYNGIIDKTEYIESYNDLGTRRYRYIYSKLYDYLNSGATLVAN-GIINETKIDQLAKACS 115 Query: 117 V---------------PGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQ 161 PH D D+F IQ +G++RW + K P P L Sbjct: 116 SFTDSHPFSSLYLSYGEKSSFKPHWDSRDIFAIQLSGKKRWII-----YKPSFPDPVYLH 170 Query: 162 VD---------PFEAIIDEELEPGDILYIPPGFPHEGYAL-ENAMNYSVGFRAPNTRELI 211 P E D LE GD+LY+P G+ H L E ++ SVG P E I Sbjct: 171 QSKDMENTYPCPSEPYDDFVLETGDVLYLPRGWWHNPLPLGEETIHLSVGIFPPYAHEYI 230 Query: 212 SGFADYVLQRELGGNYYSDPDVPPRAHPADVLPQEMDKLREMMLELINQPEHFKQWFGEF 271 + + + ++G +P A E+D L + +++ I + + ++ F Sbjct: 231 NWLSYKITDIDIGRK-----SLPRSWKQA---KDEIDILAKYVIDNITSEDSYNEFLKSF 282 Query: 272 ISQSR 276 + R Sbjct: 283 SDEKR 287 >UniRef50_A3M7T2 Putative uncharacterized protein n=2 Tax=Acinetobacter baumannii ATCC 17978 RepID=A3M7T2_ACIBT Length = 382 Score = 52.8 bits (125), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 55/211 (26%), Positives = 89/211 (42%), Gaps = 24/211 (11%) Query: 19 KRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQVSHGPFESYDHLGETN 78 K+P + K ++ IS +++ L ++ R +G ESY+ LG Sbjct: 19 KKPYLFKSAIDS--SGISWNDVNELYSRGDISHRDFKLMNGYEVPKKEYIESYECLGVIE 76 Query: 79 WSLLVQAVNHWHEPTAALMR------PFRELPDWRIDDLMISFSVPGG--------GVGP 124 + + + + A L+R PF + +I + ++ GG Sbjct: 77 YRCITSVLYKYLRNGATLVRNRISNEPFVDQISKQIATFAEARTLVGGYAAFSSKSSYKS 136 Query: 125 HLDQYDVFIIQGTGRRRWRVGEK-----LQMKQHCPHPDLLQVDPFEAIIDEELEPGDIL 179 H D DV+ +Q GR+RW + + L M+Q PD+ + P E +D LE GDIL Sbjct: 137 HWDTRDVYAVQLLGRKRWILRKPNFEFPLYMQQTKNFPDIKE--PEEIYMDVILEAGDIL 194 Query: 180 YIPPGFPHEGYAL-ENAMNYSVGFRAPNTRE 209 YIP G+ H+ L E + +V AP E Sbjct: 195 YIPRGWWHDPLPLDEETFHLAVATFAPTGFE 225 >UniRef50_C1EHB5 Predicted protein (Fragment) n=2 Tax=Micromonas RepID=C1EHB5_9CHLO Length = 387 Score = 52.8 bits (125), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 55/214 (25%), Positives = 85/214 (39%), Gaps = 38/214 (17%) Query: 12 FLERHWQKRPVVLKRG-----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQV-- 63 F+ W++RP + R F+ + DE L M + + + S++DG + Sbjct: 17 FMRDIWERRPAYVSRNAHKGYFDGLLSKADIDEWLRAGKMRYQRNVDVTSYKDGVRRTHN 76 Query: 64 -----SHGPFESYDHLG----ETNW--------SLLVQAVNHWHEP---TAALMRPFREL 103 S G + G +T W SL V W +P T A + F Sbjct: 77 LNDDGSGGVDATTGEPGFADADTVWRRFEQEGCSLRVLHPQRWRDPLWKTLAALERF--- 133 Query: 104 PDWRIDDLMISFSVPGG--GVGPHLDQYDVFIIQGTGRRRWRV---GEKLQMKQHCPHPD 158 W + P G PH D D FI+Q G++ WRV + +M P+ Sbjct: 134 --WNCSTGCNCYLTPADSQGFSPHYDDIDAFILQLEGKKLWRVYPPRSEAEMLPRYSSPN 191 Query: 159 LLQVDPFEAIIDEELEPGDILYIPPGFPHEGYAL 192 Q D E +++ LEPGD+LY+P G H+ + Sbjct: 192 FGQDDVGEPVLEVILEPGDLLYMPRGTVHQANCV 225 >UniRef50_UPI0000E45D23 PREDICTED: hypothetical protein n=2 Tax=Strongylocentrotus purpuratus RepID=UPI0000E45D23 Length = 555 Score = 52.0 bits (123), Expect = 4e-05, Method: Compositional matrix adjust. Identities = 53/230 (23%), Positives = 98/230 (42%), Gaps = 23/230 (10%) Query: 11 DFLERHWQKRPVVLKR---GFNNFIDPISPDELAGLAMESEV----DSRLVSHQDGKWQV 63 D+ + ++++P+ LKR G+ F D S EL+ + E++V + + ++ DGK + Sbjct: 109 DYFKNIFERKPLFLKRHKPGY--FTDIFSSKELSNILKENDVQFTRNIDVTTYTDGKRE- 165 Query: 64 SHGPFES------YDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSV 117 +H P +D+ S+ V + L+ +E + I + Sbjct: 166 THNPTGRAQPQVVWDYYN-NGCSVRVLNPQTYSTRVWQLLAALQEFFGCFVG-ANIYLTP 223 Query: 118 PGG-GVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQVDPFEAIIDEEL 173 PG G PH D + F++Q G++ W++ + + P + D + I+D L Sbjct: 224 PGTQGFAPHYDDIEAFVLQLEGKKHWKLYNQRSPAEVLPRFSSSNFTDADIGQPILDTTL 283 Query: 174 EPGDILYIPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQREL 223 EPGD+LY P G H+ + + A + ++VL R L Sbjct: 284 EPGDLLYFPRGVIHQASTPSETHSLHITISACQ-KNTWGDLMEHVLTRAL 332 >UniRef50_Q9H6W3 Lysine-specific demethylase NO66 n=17 Tax=Eumetazoa RepID=NO66_HUMAN Length = 641 Score = 51.2 bits (121), Expect = 6e-05, Method: Compositional matrix adjust. Identities = 28/75 (37%), Positives = 39/75 (52%), Gaps = 9/75 (12%) Query: 121 GVGPHLDQYDVFIIQGTGRRRWRV------GEKLQMKQHCPHPDLLQVDPFEAIIDEELE 174 G PH D + F++Q GR+ WRV E+L + P+ Q D E ++ LE Sbjct: 336 GFAPHYDDIEAFVLQLEGRKLWRVYRPRVPTEELALTSS---PNFSQDDLGEPVLQTVLE 392 Query: 175 PGDILYIPPGFPHEG 189 PGD+LY P GF H+ Sbjct: 393 PGDLLYFPRGFIHQA 407 >UniRef50_A5PK74 Lysine-specific demethylase NO66 n=1 Tax=Bos taurus RepID=NO66_BOVIN Length = 667 Score = 51.2 bits (121), Expect = 6e-05, Method: Compositional matrix adjust. Identities = 28/75 (37%), Positives = 39/75 (52%), Gaps = 9/75 (12%) Query: 121 GVGPHLDQYDVFIIQGTGRRRWRV------GEKLQMKQHCPHPDLLQVDPFEAIIDEELE 174 G PH D + F++Q GR+ WRV E+L + P+ Q D E ++ LE Sbjct: 364 GFAPHYDDIEAFVLQLEGRKLWRVYRPRVPTEELALTSS---PNFSQDDLGEPVLQTVLE 420 Query: 175 PGDILYIPPGFPHEG 189 PGD+LY P GF H+ Sbjct: 421 PGDLLYFPRGFIHQA 435 >UniRef50_D0NRY0 Nucleolar protein, putative n=2 Tax=Phytophthora infestans T30-4 RepID=D0NRY0_PHYIN Length = 676 Score = 50.8 bits (120), Expect = 7e-05, Method: Compositional matrix adjust. Identities = 56/260 (21%), Positives = 114/260 (43%), Gaps = 44/260 (16%) Query: 11 DFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQVSHGP-FE 69 +F E +W++RP+ +KR F ++ D G + E+D L +H + +G + Sbjct: 73 EFYENYWEQRPLAIKRNFPSYYD--------GWFSKQEIDRILKTHT-----LEYGTDVD 119 Query: 70 SYDHLGETNWSL-------LVQAVNHWHEPTAALMRPFRELPD--WRIDDLM-------- 112 ++ +T +L Q H+ + + + ++ D W++ + Sbjct: 120 LTKYVDDTRHTLNPPGSATAKQVWKHYDDGCSVRLLCPQKFSDDVWKLLATLEDEWGCMA 179 Query: 113 --ISFSVPGG--GVGPHLDQYDVFIIQGTGRRRWRVGEKLQ---MKQHCPHPDLLQVDPF 165 ++ P G PH D + F++Q G + W+V + L + P + D Sbjct: 180 GANTYLTPKNTQGFAPHFDDIEAFLLQTEGCKHWKVYKPLNESDVLARYPSGNFKAEDLG 239 Query: 166 EAIIDEELEPGDILYIPPGFPHEGYAL--ENAMNYSVGFRAPNTRELISGFADYVLQREL 223 + ++ +LE GD+LY P GF H+ A +++++ +V NT + F + +L + L Sbjct: 240 KPTLEVDLEQGDLLYFPRGFIHQARAHKEKHSLHLTVSTGQQNT---MGNFLEVLLPQAL 296 Query: 224 GGNYYSDPDVPPRAHPADVL 243 G ++ ++ R+ P D L Sbjct: 297 AGAINTNVEL-RRSLPRDYL 315 >UniRef50_D2SA69 Cupin 4 family protein n=2 Tax=Actinomycetales RepID=D2SA69_9ACTO Length = 436 Score = 50.8 bits (120), Expect = 9e-05, Method: Compositional matrix adjust. Identities = 30/85 (35%), Positives = 41/85 (48%), Gaps = 20/85 (23%) Query: 121 GVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPF-------------EA 167 G PH D +DVF++Q G +RWR+ E + D L+ P+ E Sbjct: 155 GFSPHYDVHDVFVLQVAGEKRWRIHEPVLT-------DPLRTQPWNERGAAVAAAAEREP 207 Query: 168 IIDEELEPGDILYIPPGFPHEGYAL 192 +ID L PGD LY+P G+ H AL Sbjct: 208 LIDAVLRPGDALYLPRGYLHSATAL 232 >UniRef50_B1FB07 Cupin 4 family protein n=1 Tax=Burkholderia ambifaria IOP40-10 RepID=B1FB07_9BURK Length = 380 Score = 50.4 bits (119), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 48/177 (27%), Positives = 74/177 (41%), Gaps = 15/177 (8%) Query: 123 GPHLDQYDVFIIQGTGRRRWRVGEKLQM--KQHCPHPDLLQVDPFEAIIDEELEPGDILY 180 G H D + VF +Q GR+RW V E +H P E +D +E GDILY Sbjct: 135 GKHWDTHSVFAVQMMGRKRWLVYEPTHALPLKHQRSTGKQSECPAEPYMDVTIETGDILY 194 Query: 181 IPPGFPHEGYAL-ENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHP 239 +P G+ H L E + +VG + I AD E+ G++ + P Sbjct: 195 LPRGWWHTAIPLNEETFHLAVGVHESTISDYIKYLAD-----EIIGDFDAFRQTIPLGER 249 Query: 240 ADV----LPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDE 292 D+ + E+ ++ E LIN + ++ F E SR L + +Q D+ Sbjct: 250 RDIDLRLVANELARIVEDRNVLINYNDRRRRNFRE---ASRPNLQLHAFRSKFQLDK 303 >UniRef50_B4Q068 Lysine-specific demethylase NO66 n=5 Tax=Sophophora RepID=NO66_DROYA Length = 683 Score = 50.1 bits (118), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 29/78 (37%), Positives = 39/78 (50%), Gaps = 7/78 (8%) Query: 121 GVGPHLDQYDVFIIQGTGRRRWRV-----GEKLQMKQHCPHPDLLQVDPFEAIIDEELEP 175 G PH D + F+IQ GR+RWR+ G + + D Q+ E I+DE LE Sbjct: 377 GFAPHYDDIEAFVIQVEGRKRWRLYEPPSGSDQLCRNSSSNFDQEQLG--EPILDEVLEA 434 Query: 176 GDILYIPPGFPHEGYALE 193 GD+LY P G H+ E Sbjct: 435 GDLLYFPRGTVHQAITEE 452 >UniRef50_Q016L9 [S] KOG3706 Uncharacterized conserved protein n=1 Tax=Ostreococcus tauri RepID=Q016L9_OSTTA Length = 455 Score = 50.1 bits (118), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 31/109 (28%), Positives = 46/109 (42%), Gaps = 6/109 (5%) Query: 121 GVGPHLDQYDVFIIQGTGRRRWRVGEKLQM----KQHCPHPDLLQVDPFEAIIDEELEPG 176 G PH D D +++Q G +RWRV Q + + ++ E + D LE G Sbjct: 151 GFAPHYDDIDAYVLQIEGEKRWRVYAPFQSDELPRTSSKNYTQEEIAGLEVLFDGVLEAG 210 Query: 177 DILYIPPGFPHEGYALENA--MNYSVGFRAPNTRELISGFADYVLQREL 223 D LYIP GF H+ A ++ ++ NT A + R L Sbjct: 211 DFLYIPRGFVHQAECSSRAHSVHATISTNQANTHADAFEIATQTIARSL 259 >UniRef50_D0L9V4 Cupin 4 family protein n=1 Tax=Gordonia bronchialis DSM 43247 RepID=D0L9V4_GORB4 Length = 414 Score = 50.1 bits (118), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 30/102 (29%), Positives = 51/102 (50%), Gaps = 8/102 (7%) Query: 121 GVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHP---DLLQVDPFEAI----IDEEL 173 G PH D +DVF++Q G +RWRV + P Q++ + I+ L Sbjct: 140 GFDPHYDVHDVFVLQVAGTKRWRVHRPVHTHPLATQPWTDHRAQIERRASDDAPEIEAVL 199 Query: 174 EPGDILYIPPGFPHEGYAL-ENAMNYSVGFRAPNTRELISGF 214 PGD LY+P G+ H AL + +++ ++G A R++++ Sbjct: 200 SPGDALYLPRGWIHSADALGDTSIHLTIGVGAVTVRDVVAAI 241 >UniRef50_Q10ZZ1 Cupin 4 n=1 Tax=Trichodesmium erythraeum IMS101 RepID=Q10ZZ1_TRIEI Length = 385 Score = 50.1 bits (118), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 60/238 (25%), Positives = 96/238 (40%), Gaps = 41/238 (17%) Query: 7 LNWPDFLERHWQKRPVVL-KRGFNNFIDPISPDELAGLAMESEV---DSRLVSHQDGKWQ 62 L +FLE +W K+ + + +G +F D S ++L L ++ D RL DGK Sbjct: 11 LKQEEFLENNWTKKAIAISNKGEKDFTDLFSWEKLNYLLNFHQIKYPDVRLAF--DGK-- 66 Query: 63 VSHGPFESYDHLGETNWSLLVQAVNHWHEPTAALM--RPFRELPDWRIDDLMISFSV--- 117 E ++ T W E A L+ + R +P+ I +S+ + Sbjct: 67 ----VLEEKENRNFTQWC----------EKGATLILDQIHRRIPEVAIFTSKLSYELGYP 112 Query: 118 ----------PGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDP--F 165 G PH D +DVFI+Q G ++W V K P+ P Sbjct: 113 TQVNAYCSWSSKKGFSPHYDTHDVFILQVEGNKQWYVYND-TFKYPLPNQKSSSFTPPEK 171 Query: 166 EAIIDEELEPGDILYIPPGFPHEGYALEN-AMNYSVGFRAPNTRELISGFADYVLQRE 222 EA + L PGD+LYIP G H E +++ ++G + +L+ + RE Sbjct: 172 EAYLSCILHPGDVLYIPRGHWHYAVTKEEPSIHLTLGIHSSTGVDLLEWLIGQLQYRE 229 >UniRef50_A9C261 Cupin 4 family protein n=1 Tax=Delftia acidovorans SPH-1 RepID=A9C261_DELAS Length = 298 Score = 49.7 bits (117), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 38/152 (25%), Positives = 61/152 (40%), Gaps = 12/152 (7%) Query: 120 GGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVD---PFEAIIDEELEPG 176 G G H D +DV IQ G++ WRV K P D P + I D LE G Sbjct: 135 GSFGSHWDTHDVMAIQLIGKKHWRVYAP-TYKSPLPGQTSKSFDSTCPTDPIFDGVLEAG 193 Query: 177 DILYIPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPR 236 D+LY+P G+ HE + ++ ++G P+ ++ F L N ++ Sbjct: 194 DLLYVPRGWWHEVLPIGETLHVAIGIYPPHVLNYVAWF--------LEKNIKHHEELRKT 245 Query: 237 AHPADVLPQEMDKLREMMLELINQPEHFKQWF 268 + + ++ E +N PE K + Sbjct: 246 LRSCTTTKEVVSNACHVLTEGLNDPEVLKAFM 277 >UniRef50_A8QFQ3 Lysine-specific demethylase NO66 n=2 Tax=Brugia malayi RepID=NO66_BRUMA Length = 710 Score = 49.7 bits (117), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 26/77 (33%), Positives = 40/77 (51%), Gaps = 5/77 (6%) Query: 120 GGVGPHLDQYDVFIIQGTGRRRWRV---GEKLQMKQHCPHPDLLQVDPFEAII--DEELE 174 G PH D D F++Q GR+ W++ +M P + D ++ D+ LE Sbjct: 400 AGFAPHWDDIDAFLLQLEGRKHWKIYAPDSDDEMLPRLPSGNFTDNDVINRMLVFDDWLE 459 Query: 175 PGDILYIPPGFPHEGYA 191 GD+LYIP G+ H+G+A Sbjct: 460 QGDMLYIPRGYIHQGFA 476 >UniRef50_UPI000192663F PREDICTED: similar to Myc-induced nuclear antigen n=1 Tax=Hydra magnipapillata RepID=UPI000192663F Length = 437 Score = 49.7 bits (117), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 46/196 (23%), Positives = 81/196 (41%), Gaps = 20/196 (10%) Query: 12 FLERHWQKRPVVLKR---GFNNFIDPISP--DELAGLAMESEVDSRLVSHQDGKWQVSHG 66 F E W+K+P+ +KR G+ + +S + LA +E E D + + D + ++ + Sbjct: 55 FFEEFWEKKPLYIKRENSGYYGDLFSLSSMKEILAAHELEFETDVNVCRYVDNEKELLNE 114 Query: 67 ----PFESYDHLGETNWSLLVQAVNHWHEPTA------ALMRPFRELPDWRIDDLMISFS 116 + +D L A H+P LM + + Sbjct: 115 DGCLTVDKFDKLMNDK-----HATFQLHQPQRYGTVLWQLMEKMETYFGCLVGSNVYITP 169 Query: 117 VPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPG 176 G+ PH D +VFI+Q G + W++ + + DL Q + I++ LEPG Sbjct: 170 KESQGLAPHCDDVEVFILQLEGTKHWKLYKPMVELSRDYTQDLSQDSIGKPIMELTLEPG 229 Query: 177 DILYIPPGFPHEGYAL 192 D+LY P G H+ ++ Sbjct: 230 DLLYFPRGTIHQARSV 245 >UniRef50_B4M7P8 Lysine-specific demethylase NO66 n=3 Tax=Drosophila RepID=NO66_DROVI Length = 907 Score = 49.7 bits (117), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 30/79 (37%), Positives = 39/79 (49%), Gaps = 17/79 (21%) Query: 121 GVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHP-DLL---------QVDPFEAIID 170 G PH D + F++Q GR+RWR+ L HP D+L Q + E + D Sbjct: 602 GFAPHYDDIEAFVLQVEGRKRWRLYSPL-------HPSDVLARNSSGNYSQAELGEPLFD 654 Query: 171 EELEPGDILYIPPGFPHEG 189 LEPGDILY P G H+ Sbjct: 655 AVLEPGDILYFPRGTVHQA 673 >UniRef50_B4GUZ2 Lysine-specific demethylase NO66 n=2 Tax=Drosophila persimilis RepID=NO66_DROPE Length = 687 Score = 49.7 bits (117), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 24/84 (28%), Positives = 44/84 (52%) Query: 121 GVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILY 180 G PH D + F++Q G++RWR+ + +L Q + + I+D L+PGD+LY Sbjct: 383 GFAPHYDDIEAFVLQVEGKKRWRIYAPTKELPRESSGNLSQTELGDPIMDIVLKPGDLLY 442 Query: 181 IPPGFPHEGYALENAMNYSVGFRA 204 P G+ H+ +++ + + A Sbjct: 443 FPRGWIHQAITEKDSHSLHITLSA 466 >UniRef50_A6W7N8 Cupin 4 family protein n=1 Tax=Kineococcus radiotolerans SRS30216 RepID=A6W7N8_KINRD Length = 434 Score = 49.7 bits (117), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 39/125 (31%), Positives = 60/125 (48%), Gaps = 16/125 (12%) Query: 113 ISFSVPGG-GVGPHLDQYDVFIIQGTGRRRWRV---GEKLQMKQHCPHPDLLQVDPFEA- 167 + + PG G PH D +DV ++Q GR+ W + +L +K L DP Sbjct: 158 VYVTPPGAQGFKPHHDTHDVVVLQVDGRKHWTIHPPAVELPLKSQ--PSTQLGPDPVGGR 215 Query: 168 --IIDEELEPGDILYIPPGFPHEGYALEN-AMNYSVGFRAPNTRELIS------GFADYV 218 ID LEPGD LY+P G+ H E+ +++ +VG A ++++ G AD Sbjct: 216 PPAIDTVLEPGDALYLPRGWLHSARTTEDRSIHLTVGLLATTWADVLTDAVASAGVADVA 275 Query: 219 LQREL 223 L+R L Sbjct: 276 LRRAL 280 >UniRef50_B5W5P2 Cupin 4 family protein n=2 Tax=Arthrospira RepID=B5W5P2_SPIMA Length = 387 Score = 49.3 bits (116), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 64/279 (22%), Positives = 113/279 (40%), Gaps = 47/279 (16%) Query: 116 SVPG-GGVGPHLDQYDVFIIQGTGRRRWRVGEKL--------QMKQHCPHPDLLQVDPFE 166 S PG G H D ++VFI+Q +GR+ WRV + Q P PD P+ Sbjct: 121 SFPGHQGFACHYDSHEVFILQISGRKHWRVFSDTFIYPLSENRSSQFSP-PD---TQPY- 175 Query: 167 AIIDEELEPGDILYIPPGFPHEGYAL-ENAMNYSVGFRAPNTRELISGFADYVL-QRELG 224 ID + PGD+LYIP G H A+ E +++ ++G + F+D++ Q + Sbjct: 176 --IDAIINPGDLLYIPRGHWHYAIAIDEPSLHLTLGIDCQTGID----FSDWLTSQLQQH 229 Query: 225 GNYYSDPDVPPRAHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPP 284 + + + ++H + Q + L + LE++ + ++ E + Q + +L + P Sbjct: 230 PQWRKNLPLLNKSHRENC-RQHLQNLVQNWLEILESEDLINRYLDEQLLQGQPDLQLGFP 288 Query: 285 EP----------------PYQPDEIYDALKQGEVLVRLGGLRVLRIGDDVYANGEKIDSP 328 P QP I G+ ++ GG ++ G D + + S Sbjct: 289 SQIGYDIFPQGQETKFYRPQQPVYITQLTPTGKFEIKTGGKKISLTGLDQHILEKIFTST 348 Query: 329 HRPALDALASNIALTAENFGDALEDPSFLAMLAALVNSG 367 LD + D D + +L+ LV +G Sbjct: 349 EFSGLD--------IQQWLQDFDWDTEIVPLLSRLVKAG 379 >UniRef50_Q5ZMM1 Lysine-specific demethylase NO66 n=3 Tax=Eumetazoa RepID=NO66_CHICK Length = 601 Score = 48.9 bits (115), Expect = 3e-04, Method: Compositional matrix adjust. Identities = 26/83 (31%), Positives = 42/83 (50%), Gaps = 3/83 (3%) Query: 121 GVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQVDPFEAIIDEELEPGD 177 G PH D + F++Q G++ WRV + P +L Q + E +++ LE GD Sbjct: 296 GFAPHYDDIEAFVLQLEGKKHWRVYGPRTSSEALPQFSSANLTQAELGEPLLEVVLEAGD 355 Query: 178 ILYIPPGFPHEGYALENAMNYSV 200 +LY P GF H+ L +A + + Sbjct: 356 LLYFPRGFIHQADCLPDAHSLHI 378 >UniRef50_Q28VG0 Cupin 4 n=1 Tax=Jannaschia sp. CCS1 RepID=Q28VG0_JANSC Length = 392 Score = 48.9 bits (115), Expect = 3e-04, Method: Compositional matrix adjust. Identities = 57/229 (24%), Positives = 97/229 (42%), Gaps = 36/229 (15%) Query: 1 MEYQLTLNW------PD-FLERHWQKRPVVLKRGF-NNFIDPISPDELAGL--AMESEVD 50 M + +W PD F +++K+P+++KRG F D +S E+ + M V Sbjct: 1 MTNTFSFDWAIAPETPDTFFAEYFEKKPMLIKRGQPGYFSDLLSYGEIDRVVSTMGLHVP 60 Query: 51 SRLVSHQDGKWQVSHGPFES-------YDHLGETNWSLLVQAVNHWHEPTAALMRPFREL 103 V+ DG + +E+ + L ++++ ++ A R Sbjct: 61 EINVTRADGNITPADFAYETGQIDPVRVNQLHADGATVILSGLHERLPALARYCRAMEAA 120 Query: 104 PDWRIDDLMISFSVPGG-GVGPHLDQYDVFIIQGTGRRRWRV-GEKLQMKQHCPHPDLLQ 161 R+ I + PG G PH D +DV ++Q G + WR+ G +++ L Sbjct: 121 MSARVQ-TNIYMTPPGNQGFNPHYDGHDVLVLQVAGTKEWRIYGTPVELP--------LA 171 Query: 162 VDPFEAIID--EE-----LEPGDILYIPPGFPHEGYAL-ENAMNYSVGF 202 FE +D EE LEPGD +YIP G H+ A E +++ + G Sbjct: 172 DQAFERGMDVGEEAQRFVLEPGDAVYIPRGMAHDAVATDETSLHITTGL 220 >UniRef50_C8XBP6 Cupin 4 family protein n=1 Tax=Nakamurella multipartita DSM 44233 RepID=C8XBP6_NAKMY Length = 452 Score = 48.5 bits (114), Expect = 4e-04, Method: Compositional matrix adjust. Identities = 68/293 (23%), Positives = 118/293 (40%), Gaps = 47/293 (16%) Query: 1 MEYQLTLNWPDFLERHWQKRPVVLKRG--FNNFIDPISPDELAGLAMESEVDSRLVSHQD 58 ++ + ++ DF +R+W + P++ ++F D S D + L E + + + Sbjct: 23 VQRCIAIDADDFAQRYWAQAPLLTTAAELNDDFSDLFSADSVDELVSERGLRTPFLRMAK 82 Query: 59 GKWQVSHGPFESYDHLGET----------------NWSLLVQAVNHWHEPTAALMRPFRE 102 +S F G T +L++QA++ P L+R E Sbjct: 83 NGSVLSSASFTRGGGAGATITDQVADDKVLAQLAGGATLVLQALHRTWPP---LVRFGSE 139 Query: 103 LPDWRIDDLMISFSVP---GGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPD- 158 L + I+ + G H D +DVF++Q G + WR+ E + + PH Sbjct: 140 LAAELGHPVQINAYITPPQNQGFASHYDTHDVFVLQIAGTKHWRIHEPV-LPDPLPHQTW 198 Query: 159 ---LLQVDPFEA---IIDEELEPGDILYIPPGFPHEGYAL-ENAMNYSVGFRAPNTRELI 211 QV A ID L PGD LY+P G+ H A E +++ ++G + Sbjct: 199 DGRRAQVQDRAAQAPAIDALLRPGDALYLPRGYLHSAVAQGELSIHLTIG---------V 249 Query: 212 SGFADYVLQRELGGNYYSDPDVPPRAHPADVLPQEMDK----LREMMLELINQ 260 Y L REL DP++ R+ P V ++D LR+ L+++ Sbjct: 250 HPLTGYDLARELIAAAEDDPEL-RRSLPMGVDVTDVDAMATHLRQAAQRLVDR 301 >UniRef50_A4RZ92 Predicted protein n=1 Tax=Ostreococcus lucimarinus CCE9901 RepID=A4RZ92_OSTLU Length = 515 Score = 48.5 bits (114), Expect = 4e-04, Method: Compositional matrix adjust. Identities = 34/126 (26%), Positives = 50/126 (39%), Gaps = 30/126 (23%) Query: 121 GVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQH--CPHPDLLQ--VDPFEAIIDEELEPG 176 G PH D D F++Q G +RWRV E + + H + Q + + D+ LE G Sbjct: 210 GFAPHYDDIDAFVLQIEGAKRWRVYEPFEDETHPRTSSRNFTQEEIATQRVVFDDVLEAG 269 Query: 177 DILYIPPGFPHEG--------------------------YALENAMNYSVGFRAPNTREL 210 D LY+P G+ H+ AL NA+ ++ RA R Sbjct: 270 DFLYLPRGWIHQAECSSSTHSVHATLSTNQSNAPADALEIALNNALASTIDGRAELRRSF 329 Query: 211 ISGFAD 216 +S D Sbjct: 330 VSTLND 335 >UniRef50_B8BSJ2 Predicted protein n=1 Tax=Thalassiosira pseudonana CCMP1335 RepID=B8BSJ2_THAPS Length = 830 Score = 48.1 bits (113), Expect = 5e-04, Method: Compositional matrix adjust. Identities = 26/76 (34%), Positives = 39/76 (51%), Gaps = 7/76 (9%) Query: 121 GVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDL-------LQVDPFEAIIDEEL 173 G PH D DVFI+Q G +RWRV + ++ P ++ E ++D L Sbjct: 457 GFAPHYDDVDVFILQLEGYKRWRVYAPMNKQETLPRVSSRDYTEKEVEESMGEEVLDVVL 516 Query: 174 EPGDILYIPPGFPHEG 189 PGD+LY+P G+ H+ Sbjct: 517 VPGDVLYLPRGWIHQA 532 >UniRef50_Q1DFZ7 Cupin family protein n=1 Tax=Myxococcus xanthus DK 1622 RepID=Q1DFZ7_MYXXD Length = 295 Score = 47.8 bits (112), Expect = 6e-04, Method: Compositional matrix adjust. Identities = 40/143 (27%), Positives = 62/143 (43%), Gaps = 11/143 (7%) Query: 78 NWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGG--GVGPHLDQYDVFIIQ 135 ++L ++ + H A L R F RI+ + + P G G G H D +VFI+Q Sbjct: 81 GYTLALRQPDLHHPDLAQLARAFSAELHGRIN--LHIYCTPAGHHGFGWHCDPEEVFILQ 138 Query: 136 GTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEA-----IIDEELEPGDILYIPPGFPHEGY 190 GR+ + + E P P+ + A + L GD +YIP G H Sbjct: 139 TAGRKDYLLREN--TLHPVPLPESVPSGSLAAQEKTPVETHSLSAGDFIYIPGGHWHMAQ 196 Query: 191 ALENAMNYSVGFRAPNTRELISG 213 A E A++ S+G P +L+ G Sbjct: 197 ATEEALSISIGLMPPTLLDLLDG 219 >UniRef50_B5DUH6 Lysine-specific demethylase NO66 n=2 Tax=Drosophila pseudoobscura pseudoobscura RepID=NO66_DROPS Length = 946 Score = 47.8 bits (112), Expect = 6e-04, Method: Compositional matrix adjust. Identities = 24/84 (28%), Positives = 43/84 (51%) Query: 121 GVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILY 180 G PH D + F++Q G++RWR+ + +L Q + + I+D L PGD+LY Sbjct: 642 GFAPHYDDIEAFVLQVEGKKRWRIYAPTKELPRESSGNLSQTELGDPIMDIVLMPGDLLY 701 Query: 181 IPPGFPHEGYALENAMNYSVGFRA 204 P G+ H+ +++ + + A Sbjct: 702 FPRGWIHQAITEKDSHSLHITLSA 725 >UniRef50_B4V6J8 Putative uncharacterized protein n=1 Tax=Streptomyces sp. Mg1 RepID=B4V6J8_9ACTO Length = 394 Score = 47.8 bits (112), Expect = 7e-04, Method: Compositional matrix adjust. Identities = 61/241 (25%), Positives = 100/241 (41%), Gaps = 33/241 (13%) Query: 41 AGLAMESEVDSRLVSHQDGKWQVSHGPFESYDHLGETNWSLLVQAVNHWHEPT----AAL 96 AG A+ + S L +++ G P + + L E SL++ A+ H P A L Sbjct: 63 AGGAVPATAYSILRTNRRGVSWYQPQPADFHARLAEGA-SLVIDAIEQIHPPVREAAAGL 121 Query: 97 MRPFRE------LPDWRIDDLMISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRV-GEKLQ 149 R FR W ++ G G H D +DV ++Q G +RW+V G Q Sbjct: 122 ERFFRTPVQVNAYASWTAEE----------GFGTHWDDHDVVVLQLEGSKRWKVYGPTRQ 171 Query: 150 MK--QHCPHPDLLQVDPFEAIIDEELEPGDILYIPPGFPHEGYALENAMNYSVGFR-APN 206 + P++ DP I+ L PGD+LY+P G+ H A + + + F A Sbjct: 172 APAWRDVETPEVPTGDPIADIV---LTPGDVLYLPRGWWHAVSADQGTASLHLTFGLATQ 228 Query: 207 TRELISGFADYVLQRELGGNYYSDPDVPPRAHPADVLPQEMDKLREMMLELINQPEHFKQ 266 T G+ L+ +L + DV PR + + +R+ +L ++ P + Sbjct: 229 TGAEFLGW----LRDDLRASLTVRADV-PRFGTTEERADYLAAVRKDVLAALDAPAVLDR 283 Query: 267 W 267 W Sbjct: 284 W 284 >UniRef50_B4JMQ2 Lysine-specific demethylase NO66 n=1 Tax=Drosophila grimshawi RepID=NO66_DROGR Length = 723 Score = 47.4 bits (111), Expect = 9e-04, Method: Compositional matrix adjust. Identities = 27/79 (34%), Positives = 37/79 (46%), Gaps = 17/79 (21%) Query: 121 GVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHP----------DLLQVDPFEAIID 170 G PH D + F++Q GR+RWR+ + P P +L Q + I D Sbjct: 415 GFAPHYDDIEAFVLQVEGRKRWRLYD-------APSPNDVLARTSSGNLKQQQLSKPIFD 467 Query: 171 EELEPGDILYIPPGFPHEG 189 E LE GD+LY P G H+ Sbjct: 468 EVLEAGDLLYFPRGCVHQA 486 >UniRef50_A0YJB4 Putative uncharacterized protein n=1 Tax=Lyngbya sp. PCC 8106 RepID=A0YJB4_9CYAN Length = 386 Score = 47.4 bits (111), Expect = 0.001, Method: Compositional matrix adjust. Identities = 30/91 (32%), Positives = 51/91 (56%), Gaps = 7/91 (7%) Query: 116 SVPGG-GVGPHLDQYDVFIIQGTGRRRWRVGE---KLQMKQHCPHPDLLQVDPFEAIIDE 171 S PG G H D ++VFI+Q +G + WRV + + +H +LL + I++ Sbjct: 120 SFPGSQGFACHYDSHEVFILQISGDKHWRVFSPTFEFPLSKH--RSNLLDPPTTDPYINQ 177 Query: 172 ELEPGDILYIPPGFPHEGYALEN-AMNYSVG 201 L+PGD+LYIP G H A++ +++ ++G Sbjct: 178 VLKPGDLLYIPRGHWHYAVAVDQPSLHLTLG 208 >UniRef50_Q1D4G2 Cupin family protein n=2 Tax=Myxococcus xanthus DK 1622 RepID=Q1D4G2_MYXXD Length = 442 Score = 47.4 bits (111), Expect = 0.001, Method: Compositional matrix adjust. Identities = 58/247 (23%), Positives = 91/247 (36%), Gaps = 52/247 (21%) Query: 1 MEYQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDEL----AGLAMESEVDS----- 51 +E +W F+ R+W +RPV+ K P + D++ AG S Sbjct: 4 LEIATRFDWDTFVRRYWNQRPVLFK---GTQASPFTVDDVFEASAGATQRYLSRSYEPAS 60 Query: 52 ---------RLVSHQDGKW--QVSHGPFESYD-----HLGETNWSLLVQAVNHWHEPTAA 95 RL + +W + S G + YD LGE ++L++ ++ + Sbjct: 61 RPDVTFTVDRLRQLRSREWLPRKSDGSLDGYDARIASQLGERRYALIIATMHASGFQLWS 120 Query: 96 LMRPFRELPDWRIDDLMISFSVPGGG--------------VGPHLDQYDVFIIQGTGRRR 141 R F L +P G VG HLD++ F+ GR+R Sbjct: 121 RQRAF-------FSGLWQRVGMPVTGGITSLFHGTYEHSPVGVHLDRFTTFMFALRGRKR 173 Query: 142 WRVGEKLQMKQHCPHPDLLQVDPFEAI-IDEELEPGDILYIPPGFPHEGYALENAMNYSV 200 R K + +L P+ A E+EPGDILY P + H G + + S+ Sbjct: 174 MRFWHKRPWSEDVS--TILDYQPYLASSFVAEVEPGDILYWPSTYYHVGESAGAGVASSL 231 Query: 201 GFRAPNT 207 P T Sbjct: 232 NVGIPIT 238 >UniRef50_A1R1T1 Putative cupin superfamily protein n=2 Tax=Micrococcineae RepID=A1R1T1_ARTAT Length = 388 Score = 47.4 bits (111), Expect = 0.001, Method: Compositional matrix adjust. Identities = 45/194 (23%), Positives = 74/194 (38%), Gaps = 22/194 (11%) Query: 20 RPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQVSHGPFESYDHLGET-- 77 R +L RG +F D S D + L + + + G + F S +G T Sbjct: 26 RTALLTRGVGDFSDLFSADAVDELISRRGLRTPFLRVAKGGSTLPESSFTSPAGVGATIS 85 Query: 78 --------------NWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVG 123 +L++QA++ EP ++ + G Sbjct: 86 DQLDDTQLWRKFADGATLVLQALHRTWEPVSSFSTQLSTELGHPVQANAYITPPQNRGFD 145 Query: 124 PHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHP------DLLQVDPFEAIIDEELEPGD 177 H D +DVF++Q G +RW + E + + P + + +A ID LEPGD Sbjct: 146 DHYDVHDVFVLQIEGTKRWIIHEPVHVDPLRSQPWTDRRSAVAEAAQGKAYIDTVLEPGD 205 Query: 178 ILYIPPGFPHEGYA 191 +LY+P G+ H A Sbjct: 206 VLYLPRGWLHAAEA 219 >UniRef50_B4R4H1 Lysine-specific demethylase NO66 n=2 Tax=melanogaster subgroup RepID=NO66_DROSI Length = 847 Score = 46.6 bits (109), Expect = 0.001, Method: Compositional matrix adjust. Identities = 27/76 (35%), Positives = 35/76 (46%), Gaps = 3/76 (3%) Query: 121 GVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEA---IIDEELEPGD 177 G PH D + F+IQ GR+RW + E + H D + IIDE L GD Sbjct: 541 GFAPHYDDIEAFVIQVEGRKRWLLYEPPKEADHLARISSGNYDQEQLGKPIIDEVLSAGD 600 Query: 178 ILYIPPGFPHEGYALE 193 +LY P G H+ E Sbjct: 601 VLYFPRGTVHQAITEE 616 >UniRef50_A9UZN8 Predicted protein n=1 Tax=Monosiga brevicollis RepID=A9UZN8_MONBE Length = 432 Score = 46.6 bits (109), Expect = 0.002, Method: Compositional matrix adjust. Identities = 28/80 (35%), Positives = 39/80 (48%), Gaps = 9/80 (11%) Query: 118 PGG-GVGPHLDQYDVFIIQGTGRRRWRV-----GEKLQMKQHCPHPDLLQVDPFEAIIDE 171 PG G PH D + I+Q G +RWR+ GE+L + Q + E I+D Sbjct: 147 PGAQGFAPHYDDIEALILQLEGSKRWRLYNNPTGERLP---RTSSRNFDQSELSEPILDV 203 Query: 172 ELEPGDILYIPPGFPHEGYA 191 L+PGD LY P G H+ + Sbjct: 204 VLQPGDFLYFPRGMAHQAVS 223 >UniRef50_UPI000186D1B6 conserved hypothetical protein n=1 Tax=Pediculus humanus corporis RepID=UPI000186D1B6 Length = 467 Score = 46.6 bits (109), Expect = 0.002, Method: Compositional matrix adjust. Identities = 45/199 (22%), Positives = 84/199 (42%), Gaps = 22/199 (11%) Query: 11 DFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQVSHGPF-E 69 +F E W+K+P+ + R N + + + A+ SE D + D + F E Sbjct: 35 EFFENFWEKKPLYISRNNNEYYNELCSMNAFEKAL-SEKDMYFTKNIDVTSYIDGQRFTE 93 Query: 70 SYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPD-WRIDDLMISFS----------VP 118 + D G+ S + N + L+ P +P+ W ++ + F P Sbjct: 94 NLD--GKATVSNIWDFFNE--GKSIRLLNPQTFIPNVWLLNTNLQEFFNCFVGANMYLTP 149 Query: 119 GG--GVGPHLDQYDVFIIQGTGRRRWRV---GEKLQMKQHCPHPDLLQVDPFEAIIDEEL 173 G G PH D + F++Q G++ W+V + ++ + + + + I+ L Sbjct: 150 AGTQGFAPHYDDIEAFVLQLEGQKHWKVYNPRDSSEVLARESSKNFKEDEIGKPILKVTL 209 Query: 174 EPGDILYIPPGFPHEGYAL 192 +PGD+LY P G+ H+ L Sbjct: 210 KPGDMLYFPRGYIHQAKCL 228 >UniRef50_A9V5A3 Predicted protein n=1 Tax=Monosiga brevicollis RepID=A9V5A3_MONBE Length = 595 Score = 46.2 bits (108), Expect = 0.002, Method: Compositional matrix adjust. Identities = 25/84 (29%), Positives = 42/84 (50%) Query: 121 GVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILY 180 G+ PH D +V+I+Q G + WR+ E ++ DL + + + I + L PGD LY Sbjct: 221 GLAPHHDDVEVYILQLEGEKAWRLYEPIEPLAMSYSADLDREELAQPIAELVLRPGDFLY 280 Query: 181 IPPGFPHEGYALENAMNYSVGFRA 204 +P G HE + N + + + Sbjct: 281 LPRGTIHEASCVGNQHSTHITISS 304 >UniRef50_UPI00017929D5 PREDICTED: similar to Nucleolar protein 66 (hsNO66) n=1 Tax=Acyrthosiphon pisum RepID=UPI00017929D5 Length = 473 Score = 46.2 bits (108), Expect = 0.002, Method: Compositional matrix adjust. Identities = 25/75 (33%), Positives = 37/75 (49%), Gaps = 3/75 (4%) Query: 121 GVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHP---DLLQVDPFEAIIDEELEPGD 177 G PH D + F++Q G + WRV + + P + Q + E I+D L PGD Sbjct: 201 GFAPHYDDIEAFVVQVDGEKHWRVYKPRSEFETLPRTSSRNFHQDEIGEPILDVILRPGD 260 Query: 178 ILYIPPGFPHEGYAL 192 LY+P G+ H+ L Sbjct: 261 FLYMPRGYIHQADTL 275 >UniRef50_A9V7G6 Predicted protein n=1 Tax=Monosiga brevicollis RepID=A9V7G6_MONBE Length = 806 Score = 46.2 bits (108), Expect = 0.002, Method: Compositional matrix adjust. Identities = 34/109 (31%), Positives = 45/109 (41%), Gaps = 21/109 (19%) Query: 115 FSVPGG-GVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEEL 173 S PG + PH D YDV ++Q GR+RWR+ + P DL + + Sbjct: 182 LSAPGARALQPHTDPYDVIVVQLAGRKRWRLCTGC---LNWPESDLTRFH----CQSLWM 234 Query: 174 EPGDILYIPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRE 222 EPGD+LY+P H AP+ E S Y L RE Sbjct: 235 EPGDVLYLPKAVIHVA-------------DAPHADETTSIHLTYSLDRE 270 >UniRef50_B0WMG3 Lysine-specific demethylase NO66 n=2 Tax=Culicini RepID=NO66_CULQU Length = 648 Score = 46.2 bits (108), Expect = 0.002, Method: Compositional matrix adjust. Identities = 22/75 (29%), Positives = 37/75 (49%), Gaps = 3/75 (4%) Query: 121 GVGPHLDQYDVFIIQGTGRRRWRV---GEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGD 177 G PH D + F++Q GR+ W++ ++ P+ Q + I++ LEPGD Sbjct: 350 GFAPHYDDIEAFVLQVEGRKHWKLYSPRTASEVLARVSSPNFTQEEIGVPILEVTLEPGD 409 Query: 178 ILYIPPGFPHEGYAL 192 +LY P G H+ + Sbjct: 410 LLYFPRGIIHQASTV 424 >UniRef50_B0BQ44 Putative uncharacterized protein n=5 Tax=Pasteurellaceae RepID=B0BQ44_ACTPJ Length = 396 Score = 45.8 bits (107), Expect = 0.002, Method: Compositional matrix adjust. Identities = 41/166 (24%), Positives = 70/166 (42%), Gaps = 28/166 (16%) Query: 125 HLDQYDVFIIQGTGRRRWRVGE-----------KLQMKQHCPHPDLLQVDPFEAIIDEEL 173 H D D+F IQ GR+RW + M ++ P+ D + ID L Sbjct: 150 HWDSRDIFAIQMQGRKRWIIHSPTFKDPLFMHRSKDMPEYFPNKD-------DVYIDILL 202 Query: 174 EPGDILYIPPGFPHEGYAL-ENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPD 232 E GDILY+P G+ H+ + E ++ +VG P T + +S + +++ E+ S + Sbjct: 203 EAGDILYLPRGWWHDPIPVGEETVHLAVGVFPPYTNDYLSWVTENIVKNEIARKSLSSSE 262 Query: 233 VPPRAHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHE 278 +L E + + IN ++F + F + R E Sbjct: 263 --KNDEIISLLSAE-------VADFINNKDNFNIFLESFYDKKRIE 299 >UniRef50_B7FXD3 Predicted protein n=1 Tax=Phaeodactylum tricornutum CCAP 1055/1 RepID=B7FXD3_PHATR Length = 481 Score = 45.8 bits (107), Expect = 0.002, Method: Compositional matrix adjust. Identities = 28/92 (30%), Positives = 44/92 (47%), Gaps = 9/92 (9%) Query: 118 PGGGVGP-HLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLL--------QVDPFEAI 168 PG P H D DVF+IQ G + W++ + + H + QV + Sbjct: 175 PGSQTVPAHADDRDVFVIQLVGCKAWKIYRNIPVPYPYSHEQVGKGELEVPGQVLDGPVL 234 Query: 169 IDEELEPGDILYIPPGFPHEGYALENAMNYSV 200 D L PGD+LY+P G+ HE +A++ ++ V Sbjct: 235 TDRVLAPGDVLYMPRGYVHEAHAVDGGPSFHV 266 >UniRef50_A2SGT4 Putative uncharacterized protein n=1 Tax=Methylibium petroleiphilum PM1 RepID=A2SGT4_METPP Length = 353 Score = 45.8 bits (107), Expect = 0.003, Method: Compositional matrix adjust. Identities = 33/105 (31%), Positives = 51/105 (48%), Gaps = 15/105 (14%) Query: 118 PGGGVGP-HLDQYDVFIIQGTGRRRWRVGEKLQMKQ-----------HCPHPDLLQVDPF 165 P G V P H D + Q GR+RWR L+ + HPDL + F Sbjct: 238 PAGTVTPLHHDTLMLLHTQVVGRKRWRFISPLETPRLYNHDGVFSAIDLDHPDLDRYPAF 297 Query: 166 E--AIIDEELEPGDILYIPPGFPHEGYALENAMNYSVG-FRAPNT 207 +++ LEPGD +++P G+ H+ +LE ++++S F PNT Sbjct: 298 RDVKVLEVVLEPGDTVFLPLGWWHQVASLEVSLSFSFSNFVFPNT 342 >UniRef50_A4X6V2 Cupin 4 family protein n=4 Tax=Micromonosporaceae RepID=A4X6V2_SALTO Length = 496 Score = 45.8 bits (107), Expect = 0.003, Method: Compositional matrix adjust. Identities = 36/115 (31%), Positives = 53/115 (46%), Gaps = 25/115 (21%) Query: 114 SFSVPGG--GVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCP-HPDLLQVDPFEAIID 170 ++ P G G H D +DVF++Q G + WR+ H P PD L+ P+ D Sbjct: 132 AYLTPAGSQGFATHYDTHDVFVLQVDGGKHWRI--------HPPVLPDPLERQPWGGRAD 183 Query: 171 EE-------------LEPGDILYIPPGFPHEGYALE-NAMNYSVGFRAPNTRELI 211 E L PGD LY+P G+ H A E ++++ +VG RA L+ Sbjct: 184 EVVATATGAPALDVLLAPGDALYLPRGWLHSAAAQERSSLHLTVGVRALTRYTLV 238 >UniRef50_C6SNC5 Putative uncharacterized protein n=2 Tax=Neisseria meningitidis RepID=C6SNC5_NEIME Length = 387 Score = 45.4 bits (106), Expect = 0.003, Method: Compositional matrix adjust. Identities = 46/208 (22%), Positives = 80/208 (38%), Gaps = 27/208 (12%) Query: 1 MEYQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGK 60 M ++ + +F E + K+P + K + + IS E+ L ++ + G+ Sbjct: 1 MHINFSMEYKEFNENYLYKKPFIFKNALD--VSSISWKEINELYQRADPTDWQFKFRKGE 58 Query: 61 WQVSHGPFESYDHLGETNWSLLVQAVNHWHEPTAALM------RPFRELPDWRIDDLMIS 114 ES++ +G AV + + A ++ PF + +I + Sbjct: 59 IIPKEAYVESFNDVGRIRHRFNKTAVYQYLQDGATMVYNRIDNEPFVDSIAKQIAQFAQA 118 Query: 115 FSVPGGGVG--------PHLDQYDVFIIQGTGRRRWRVGE-------KLQMKQHCPHPDL 159 +V G + H D DVF +Q G++ W + +Q + PH Sbjct: 119 QTVVSGYLAFGSSSSYRNHWDTRDVFAVQLIGKKHWTISAPNFDMPLYMQQAKDMPHITP 178 Query: 160 LQVDPFEAIIDEELEPGDILYIPPGFPH 187 + E I LE GDILYIP G+ H Sbjct: 179 SKTVDMEVI----LEAGDILYIPRGWWH 202 >UniRef50_Q7K4H4 Lysine-specific demethylase NO66 n=2 Tax=melanogaster subgroup RepID=NO66_DROME Length = 653 Score = 45.4 bits (106), Expect = 0.003, Method: Compositional matrix adjust. Identities = 27/76 (35%), Positives = 35/76 (46%), Gaps = 3/76 (3%) Query: 121 GVGPHLDQYDVFIIQGTGRRRWRVGE---KLQMKQHCPHPDLLQVDPFEAIIDEELEPGD 177 G PH D + F+IQ GR+RW + E K + Q + IIDE L GD Sbjct: 347 GFAPHYDDIEAFVIQVEGRKRWLLYEPPKKADQLARISSGNYDQEQLGKPIIDEVLSAGD 406 Query: 178 ILYIPPGFPHEGYALE 193 +LY P G H+ E Sbjct: 407 VLYFPRGAVHQAITEE 422 >UniRef50_A0QI05 Cupin superfamily protein n=4 Tax=Mycobacterium avium complex (MAC) RepID=A0QI05_MYCA1 Length = 361 Score = 45.4 bits (106), Expect = 0.003, Method: Compositional matrix adjust. Identities = 29/94 (30%), Positives = 46/94 (48%), Gaps = 2/94 (2%) Query: 121 GVGPHLDQYDVFIIQGTGRRRWRVGEKLQM-KQHCPHPDLLQVDPFEAIIDEELEPGDIL 179 G PH D +DV ++Q G + WRV ++ + Q + D + D L PGD+L Sbjct: 139 GFVPHYDPHDVLVLQIEGCKTWRVSDEPPVPPQQIQSRKGVGADGPASRTDVCLRPGDVL 198 Query: 180 YIPPGFPHEGYA-LENAMNYSVGFRAPNTRELIS 212 Y+P G H E +++ +VG AP L++ Sbjct: 199 YLPRGQVHSARTHSEPSVHLTVGLHAPTVLTLVT 232 >UniRef50_A1KTI5 Putative uncharacterized protein n=2 Tax=Neisseria meningitidis RepID=A1KTI5_NEIMF Length = 382 Score = 45.1 bits (105), Expect = 0.004, Method: Compositional matrix adjust. Identities = 45/208 (21%), Positives = 81/208 (38%), Gaps = 27/208 (12%) Query: 1 MEYQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGK 60 M ++ + +F E + K+P + K + + IS E+ L ++ + G+ Sbjct: 1 MHINFSMEYKEFNENYLYKKPFIFKNALD--VSSISWKEINELYQRADPTDWQFKFRKGE 58 Query: 61 WQVSHGPFESYDHLGETNWSLLVQAVNHWHEPTAALM------RPFRELPDWRIDDLMIS 114 ES++ +G+ + AV + + A ++ PF + +I + Sbjct: 59 IIPKEAYVESFNDVGKIRYRFNKTAVYQYLQDGATMVYNRIDNEPFVDSIAKQIAQFAQA 118 Query: 115 FSVPGGGVG--------PHLDQYDVFIIQGTGRRRWRVGEK-------LQMKQHCPHPDL 159 +V G + H D DVF +Q G + W + +Q + PH Sbjct: 119 QTVVSGYLAFGSSSSYRNHWDTRDVFAVQLIGTKHWTLSAANFDMPLYMQQAKDIPHI-- 176 Query: 160 LQVDPFEAIIDEELEPGDILYIPPGFPH 187 P ++ LE GDILYIP G+ H Sbjct: 177 --TPPTTVDMEVILEAGDILYIPRGWWH 202 >UniRef50_C5LMW3 Putative uncharacterized protein n=1 Tax=Perkinsus marinus ATCC 50983 RepID=C5LMW3_9ALVE Length = 521 Score = 45.1 bits (105), Expect = 0.005, Method: Compositional matrix adjust. Identities = 50/208 (24%), Positives = 86/208 (41%), Gaps = 43/208 (20%) Query: 11 DFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQ------------D 58 +F E +W+K+P+ ++R P + D +G+ ++ + L H+ + Sbjct: 62 EFFEEYWEKKPLHVRR-------PTARDYYSGVWTKAMAEKTLTKHECRFGESVNFARVE 114 Query: 59 GKWQVSH----GPFESYDHLG---ETNWSLLVQAVNHWHEPTAALMRPFREL--PDWRID 109 +V H G + +++ E S + +P ALM W + Sbjct: 115 AGVKVMHNGEEGEKATVEYMQGQFEDGVSCQFMQPQRFSKPCHALMERLENYFGTLWGAN 174 Query: 110 DLMISFSVPGGGVG--PHLDQYDVFIIQGTGRRRWRVGEK------LQMKQHCPHPDLLQ 161 S+ P VG PH D +VF++Q G +RWR+ + L M+ + + Sbjct: 175 ----SYLTPANSVGFAPHYDDVEVFMLQTEGSKRWRLYDSPDDDGPLPMEYSRDYTEEEL 230 Query: 162 VDPFEAIIDEELEPGDILYIPPGFPHEG 189 P+ DE +E GD+LYIP G H G Sbjct: 231 SLPY---FDEVVEQGDLLYIPRGTVHFG 255 >UniRef50_P46327 Uncharacterized protein yxbC n=1 Tax=Bacillus subtilis RepID=YXBC_BACSU Length = 330 Score = 45.1 bits (105), Expect = 0.005, Method: Compositional matrix adjust. Identities = 35/142 (24%), Positives = 54/142 (38%), Gaps = 21/142 (14%) Query: 81 LLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPHLDQYDVFIIQGTGRR 140 L + V W E A +R LP ++ + GGG H D Y I Q G + Sbjct: 105 LFIPQVRRWIEKLKAELR----LPAGTSSKAIVYAAKNGGGFKAHFDAYTNLIFQIQGEK 160 Query: 141 RWRVGEKLQMKQHCPHPDLLQV--------------DPFEAIIDEE---LEPGDILYIPP 183 W++ + + H DL + P E + D E L PG +LY+P Sbjct: 161 TWKLAKNENVSNPMQHYDLSEAPYYPDDLQSYWKGDPPKEDLPDAEIVNLTPGTMLYLPR 220 Query: 184 GFPHEGYALENAMNYSVGFRAP 205 G H + + + ++ F P Sbjct: 221 GLWHSTKSDQATLALNITFGQP 242 >UniRef50_A5GJ70 Putative uncharacterized protein SynWH7803_0559 n=1 Tax=Synechococcus sp. WH 7803 RepID=A5GJ70_SYNPW Length = 370 Score = 44.7 bits (104), Expect = 0.005, Method: Compositional matrix adjust. Identities = 33/135 (24%), Positives = 60/135 (44%), Gaps = 5/135 (3%) Query: 121 GVGPHLDQYDVFIIQGTGRRRWRVGEKL-QMKQHCPHPDLLQVDPFEAIIDEE--LEPGD 177 + PH D +D+F +Q G+++W V +L + +L D ++ E ++ GD Sbjct: 108 ALSPHFDSHDIFALQVVGQKQWFVDSELSSLTTKSTFQPILSADQASSVDFREVVMDEGD 167 Query: 178 ILYIPPGFPHEGYALE-NAMNYSVGFRAPNTRELISGFADYVLQR-ELGGNYYSDPDVPP 235 ++Y+P G H + +M+ +VG E I+ + E G S P Sbjct: 168 VMYLPRGCVHHARTISCQSMHLTVGLYPLEWSEFIASAVEIAASAPEARGLRTSVPLGLK 227 Query: 236 RAHPADVLPQEMDKL 250 R HP+ + +D+L Sbjct: 228 RQHPSFYRQELLDRL 242 >UniRef50_B6KFH2 Putative uncharacterized protein n=4 Tax=Toxoplasma gondii RepID=B6KFH2_TOXGO Length = 508 Score = 44.7 bits (104), Expect = 0.005, Method: Compositional matrix adjust. Identities = 28/82 (34%), Positives = 41/82 (50%), Gaps = 14/82 (17%) Query: 121 GVGPHLDQYDVFIIQGTGRRRWRV---------GEKLQMKQHCPHPDLLQVDPFEAIIDE 171 V H D DVF++Q G + W++ E++ K+ PD DP + +++ Sbjct: 236 AVKTHTDDQDVFLLQVWGSKAWKIWTPPQILPLTEEMLGKRE-AFPD----DPGKPLLEF 290 Query: 172 ELEPGDILYIPPGFPHEGYALE 193 L+ GDILYIP GFPH E Sbjct: 291 VLKEGDILYIPRGFPHAAVTTE 312 >UniRef50_Q54K96 Lysine-specific demethylase NO66 n=1 Tax=Dictyostelium discoideum RepID=NO66_DICDI Length = 514 Score = 44.3 bits (103), Expect = 0.006, Method: Compositional matrix adjust. Identities = 26/82 (31%), Positives = 38/82 (46%), Gaps = 5/82 (6%) Query: 115 FSVPGG--GVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHP---DLLQVDPFEAII 169 + P G G PH D DVFI+Q G++ WR+ + + P + Q + E Sbjct: 214 YLTPAGAQGFAPHYDDVDVFILQLEGKKEWRLYKPRDANEVLPKKSSENFTQEEIGEPYF 273 Query: 170 DEELEPGDILYIPPGFPHEGYA 191 LE GD+LY P G H+ + Sbjct: 274 TVTLEAGDLLYFPRGVIHQAVS 295 >UniRef50_D2VJG1 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2VJG1_NAEGR Length = 542 Score = 44.3 bits (103), Expect = 0.006, Method: Compositional matrix adjust. Identities = 24/86 (27%), Positives = 42/86 (48%), Gaps = 14/86 (16%) Query: 114 SFSVPGG--GVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHC--------PHPDLLQVD 163 ++ P G G PH D + F+IQ G + W++ L+ +Q+ ++ + Sbjct: 230 AYLTPAGSQGFAPHYDDIEAFLIQLEGEKHWKIYRPLENQQYLDRFSSKNFTQEEVAGFE 289 Query: 164 PFEAIIDEELEPGDILYIPPGFPHEG 189 FE + L+PGD+LY+P G H+ Sbjct: 290 CFEIL----LKPGDMLYVPKGVIHQA 311 >UniRef50_Q4D641 Putative uncharacterized protein n=1 Tax=Trypanosoma cruzi RepID=Q4D641_TRYCR Length = 476 Score = 44.3 bits (103), Expect = 0.007, Method: Compositional matrix adjust. Identities = 37/119 (31%), Positives = 55/119 (46%), Gaps = 19/119 (15%) Query: 121 GVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELE------ 174 G PH D DVF++Q G + WR+ + + D+L E EEL Sbjct: 168 GFAPHYDDVDVFLLQLEGEKEWRLYDPPE------RVDVLSRHSSEDYNPEELPKPTQIF 221 Query: 175 ---PGDILYIPPGFPHEGYALENAMNYSVGFRAP--NT-RELISGFADYVLQRELGGNY 227 PGD+LY+P G H+G +A + V F A NT +L+ +V+++ L NY Sbjct: 222 RLFPGDVLYMPRGTVHQGRKYNHAHSLHVTFSANQMNTWADLMKHAVTHVVEK-LAANY 279 >UniRef50_O01658 Lysine-specific demethylase NO66 n=3 Tax=Caenorhabditis RepID=NO66_CAEEL Length = 748 Score = 44.3 bits (103), Expect = 0.007, Method: Compositional matrix adjust. Identities = 27/86 (31%), Positives = 42/86 (48%), Gaps = 13/86 (15%) Query: 114 SFSVPGG--GVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPF------ 165 ++ P G G PH D+ D F++Q GR+ WRV ++ P L D F Sbjct: 429 TYLTPAGSSGFAPHWDEIDAFLLQVEGRKYWRVWAPESAEEELP---LESSDNFTEDDMK 485 Query: 166 --EAIIDEELEPGDILYIPPGFPHEG 189 E + + +E GD++YIP G+ H+ Sbjct: 486 GREPVFEGWIEKGDMIYIPRGYIHQA 511 >UniRef50_UPI000180B5EA PREDICTED: similar to Nucleolar protein 66 (hsNO66), partial n=1 Tax=Ciona intestinalis RepID=UPI000180B5EA Length = 594 Score = 43.9 bits (102), Expect = 0.009, Method: Compositional matrix adjust. Identities = 41/196 (20%), Positives = 85/196 (43%), Gaps = 18/196 (9%) Query: 12 FLERHWQKRPVVLKRGFNNFIDPI-SPDELAGLAMES----EVDSRLVSHQDGKWQVSHG 66 F + W+ RP+++ R + D + S E+ + E V+ + ++Q+G+ + + Sbjct: 161 FFKDIWESRPLLVLRHCPRYADGLFSTKEMNRILNECNVRYSVNLDVTTYQNGRRETHNI 220 Query: 67 PFESY-----DHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGG- 120 +Y D+ + S+ ++ + +P L +E + ++ P G Sbjct: 221 DGRAYAPVVWDYF-KNGCSIRLKNPQAFSKPVWRLCATLQEFFKCMVG--ANTYLTPPGT 277 Query: 121 -GVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQVDPFEAIIDEELEPG 176 G PH D + F++Q G++ W + K+ P + + + I + LE G Sbjct: 278 QGFAPHYDDIEAFVLQLEGKKEWTLYSPRSGKETLPRYSSGNFTADEIGDEIFTQTLEAG 337 Query: 177 DILYIPPGFPHEGYAL 192 ++LY P G+ H+ AL Sbjct: 338 NLLYFPRGYIHQAKAL 353 >UniRef50_B4L6Q5 Lysine-specific demethylase NO66 n=1 Tax=Drosophila mojavensis RepID=NO66_DROMO Length = 888 Score = 43.9 bits (102), Expect = 0.009, Method: Compositional matrix adjust. Identities = 24/72 (33%), Positives = 35/72 (48%), Gaps = 3/72 (4%) Query: 121 GVGPHLDQYDVFIIQGTGRRRWRV---GEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGD 177 G PH D + F+IQ GR+RWR+ + + + Q + + + D LE GD Sbjct: 606 GFAPHYDDIEAFVIQVEGRKRWRLYAPPHQSDVLARTSSGNYKQEELGQPLFDAVLEAGD 665 Query: 178 ILYIPPGFPHEG 189 ILY P G H+ Sbjct: 666 ILYFPRGTVHQA 677 >UniRef50_C6W918 Cupin 4 family protein n=2 Tax=Actinomycetales RepID=C6W918_ACTMD Length = 395 Score = 43.5 bits (101), Expect = 0.011, Method: Compositional matrix adjust. Identities = 26/96 (27%), Positives = 50/96 (52%), Gaps = 7/96 (7%) Query: 125 HLDQYDVFIIQGTGRRRWRV---GEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYI 181 H D +D ++Q +GR+RWR+ M + P + +P + + LE G++LY+ Sbjct: 148 HWDDHDAIVVQVSGRKRWRIHGFTRVAPMVRDVELPPRPEGEPLDEFV---LEAGEVLYL 204 Query: 182 PPGFPHEGYAL-ENAMNYSVGFRAPNTRELISGFAD 216 P G H+ A+ E +++ ++G +L++ AD Sbjct: 205 PRGCWHDVSAVGEESLHLTIGVNRATGVDLVAWLAD 240 >UniRef50_UPI000192614C PREDICTED: similar to chromosome 14 open reading frame 169 n=1 Tax=Hydra magnipapillata RepID=UPI000192614C Length = 388 Score = 43.5 bits (101), Expect = 0.013, Method: Compositional matrix adjust. Identities = 24/82 (29%), Positives = 42/82 (51%), Gaps = 4/82 (4%) Query: 118 PGG-GVGPHLDQYDVFIIQGTGRRRWRV---GEKLQMKQHCPHPDLLQVDPFEAIIDEEL 173 PG G PH D + F+IQ G++ W++ ++ ++ + + E I+++ L Sbjct: 82 PGSQGFAPHYDDIEAFVIQLEGKKHWKLYPPRNTNEVLARYSSENMQEENLGEPILNKVL 141 Query: 174 EPGDILYIPPGFPHEGYALENA 195 E GD LY P G H+ LE++ Sbjct: 142 EAGDTLYFPRGVIHQASTLEDS 163 >UniRef50_A3UGV1 Putative uncharacterized protein n=1 Tax=Oceanicaulis alexandrii HTCC2633 RepID=A3UGV1_9RHOB Length = 387 Score = 42.7 bits (99), Expect = 0.023, Method: Compositional matrix adjust. Identities = 30/108 (27%), Positives = 52/108 (48%), Gaps = 19/108 (17%) Query: 121 GVGPHLDQYDVFIIQGTGRRRWR-----VGEKLQMKQHCPHPDLLQVDPFEAIIDEELEP 175 G H D +DV ++Q G +RWR VG + ++ P Q +P ++ L P Sbjct: 136 GFQTHYDNHDVLVLQVEGSKRWRLYDAPVGVPYRGERFTPG-RFAQTEPRAELV---LNP 191 Query: 176 GDILYIPPGFPHEGY---ALENAMNYSVGFRAPNTRELISGFADYVLQ 220 GD+LY+P G H+ + E +++ + G L +AD++L+ Sbjct: 192 GDVLYVPRGLMHDAVNEGSDEASLHITTGL-------LAKTWADFLLE 232 >UniRef50_A1SPZ0 Cupin 4 family protein n=1 Tax=Nocardioides sp. JS614 RepID=A1SPZ0_NOCSJ Length = 403 Score = 42.4 bits (98), Expect = 0.025, Method: Compositional matrix adjust. Identities = 43/146 (29%), Positives = 62/146 (42%), Gaps = 27/146 (18%) Query: 115 FSVPGG-GVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEEL 173 + PG G H D +DVF+ Q G +RW V H P D E ++ L Sbjct: 165 LTPPGAQGFAVHSDSHDVFVFQTAGSKRWEV--------HGP-------DGPEEVL---L 206 Query: 174 EPGDILYIPPGFPHEGYALEN-AMNYSVGFRAPNTREL----ISGFADYVLQRELGGNYY 228 EPG +Y+P G PH A + +++ ++G R L ++G V L Y Sbjct: 207 EPGVSMYLPTGTPHAARAQDTVSLHVTLGINQLTWRGLVERTVAGALGEVADEHLPAGYL 266 Query: 229 SDPDVPPRAHP-ADVLPQEMDKLREM 253 DP A P AD L + D +R + Sbjct: 267 DDPAA--LAGPLADRLERLADAVRRL 290 >UniRef50_Q15JF4 VldL n=1 Tax=Streptomyces hygroscopicus subsp. limoneus RepID=Q15JF4_STRHY Length = 282 Score = 42.4 bits (98), Expect = 0.025, Method: Compositional matrix adjust. Identities = 26/91 (28%), Positives = 44/91 (48%), Gaps = 15/91 (16%) Query: 121 GVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQ-------VDPFEAIIDEEL 173 G+G H D D F++Q G + W L QH D+++ V E +D+ + Sbjct: 108 GIGAHFDHSDNFVLQQNGTKEW----TLASSQHLDRQDVVRRMMNHPGVGAHELPVDDSV 163 Query: 174 E----PGDILYIPPGFPHEGYALENAMNYSV 200 PGD+LYIP + H G + ++++ S+ Sbjct: 164 RFTVGPGDLLYIPLLWLHSGVSRGDSLSVSL 194 >UniRef50_A9VEP7 Predicted protein (Fragment) n=1 Tax=Monosiga brevicollis RepID=A9VEP7_MONBE Length = 254 Score = 42.4 bits (98), Expect = 0.031, Method: Compositional matrix adjust. Identities = 25/98 (25%), Positives = 46/98 (46%), Gaps = 18/98 (18%) Query: 122 VGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYI 181 + PH D Y + I+Q G + W V QVD E+ + L PGD+L++ Sbjct: 67 INPHTDNYHILILQLQGEKHWLV---------------CQVDNAESCEEFTLYPGDVLFL 111 Query: 182 PPGFPHEGYALE-NAMNYSVGFRAPNTRELI--SGFAD 216 P H + +++ ++GF+ + +L+ +GF + Sbjct: 112 PRRAGHVAWTTNVTSVHATIGFQGVDCGDLVEAAGFTE 149 >UniRef50_Q31RB4 Putative uncharacterized protein n=2 Tax=Synechococcus elongatus RepID=Q31RB4_SYNE7 Length = 428 Score = 42.0 bits (97), Expect = 0.034, Method: Compositional matrix adjust. Identities = 46/214 (21%), Positives = 84/214 (39%), Gaps = 18/214 (8%) Query: 80 SLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPHLDQYDVFIIQGTGR 139 +L++ V+H L R+ +R + S G H D +DV I+Q G Sbjct: 85 TLVLNGVHHRVPALKHLATNLRQEFGYRCHINLYSSPAQQQGFDCHYDTHDVLILQIEGE 144 Query: 140 RRWRVGEKLQMKQHCPHPDLLQVDPFEA-IIDEELEPGDILYIPPGFPHEGYALENA-MN 197 + W + + P ++ P E + + L PGD+LYIP G H A E A ++ Sbjct: 145 KEWLIYPETLPYPTADQPSYDRLPPEEPPYLQQVLSPGDLLYIPRGHWHYAIAQETASLH 204 Query: 198 YSVGFRAPNTRELISGFADYVLQR-------ELGGNYYSDPDVPPRAHPADVLPQEMDKL 250 ++G + ++ + + L G+ DP + R H ++ L Sbjct: 205 LTIGIHTATGLDWVNWLQQQLRDQPHWRQGLPLAGSCNFDP-LKLRGH--------LESL 255 Query: 251 REMMLELINQPEHFKQWFGEFISQSRHELDIAPP 284 R+ ++ + +P+ + Q + L I P Sbjct: 256 RDQLITYLQEPQAIDDYLQYLSWQDQPHLPIQLP 289 >UniRef50_D2PSR6 Cupin family protein n=1 Tax=Kribbella flavida DSM 17836 RepID=D2PSR6_9ACTO Length = 408 Score = 42.0 bits (97), Expect = 0.035, Method: Compositional matrix adjust. Identities = 56/235 (23%), Positives = 91/235 (38%), Gaps = 34/235 (14%) Query: 7 LNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESE---------------VDS 51 L+W +F E +W + PV++ RG P DE+ A+ + V+ Sbjct: 10 LDWAEFAELYWDRHPVLI-RGVRPV--PFRADEVFSAALRARCAEGGGRIAPNASVTVEQ 66 Query: 52 RLVSHQDGKWQV-SHGPFESY-----DHLGETNWSLLVQAVNHWHEPTAALMRPFRE--- 102 + + +DG S G F+ Y D L ++L++ A + + P R F Sbjct: 67 TVQADRDGLLPAESDGCFDGYERRVGDRLDGRKYALIISAFHAFDFPLWDRERRFFAGLW 126 Query: 103 ----LPDWRIDDLMISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRV-GEKLQMKQHCPHP 157 LP + + VG H D++ F+ R+R R+ E+ +Q Sbjct: 127 DEVGLPLTSAITTLFHGNYDHSPVGVHKDRFATFMFGLRERKRMRLWTERPWTEQVGSVV 186 Query: 158 DLLQVDPFEAIIDEELEPGDILYIPPGFPHEGYALENAMNYSVGFRAPNTRELIS 212 D + P + E+EPGD+LY P + H G SV P T +S Sbjct: 187 DYERFLPSSFAV--EVEPGDLLYWPASYFHVGENCGRTPATSVNIGVPRTEHRVS 239 >UniRef50_C7NJK3 Cupin superfamily protein n=1 Tax=Kytococcus sedentarius DSM 20547 RepID=C7NJK3_KYTSD Length = 414 Score = 42.0 bits (97), Expect = 0.037, Method: Compositional matrix adjust. Identities = 44/175 (25%), Positives = 73/175 (41%), Gaps = 27/175 (15%) Query: 121 GVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAI------------ 168 G H D +DVF++Q G + W + E + HP L+ P++ + Sbjct: 157 GFSAHYDVHDVFVLQVHGTKHWTLHEPV-----VAHP--LRDQPWDTVREAVAHRAAQDA 209 Query: 169 --IDEELEPGDILYIPPGFPHEGYAL-ENAMNYSVGFRAPNTRELISGFADYVLQRELGG 225 ID L PGD+LY+P G H A E + + ++G + D V R L Sbjct: 210 PLIDAVLAPGDVLYLPRGTIHAAAAQGEISAHLTIGVHTWTPDHVTGAVLDAVRSR-LRD 268 Query: 226 NYYSDPDVPPRAHPAD--VLPQEMDKLREMMLELINQ--PEHFKQWFGEFISQSR 276 ++P A P D V+ +++LR + E I+ E ++F + +R Sbjct: 269 QPTVRANLPLGARPDDAAVVGPTLEQLRGALHEAIDSLDAEELARYFRPQVRVTR 323 >UniRef50_Q2T4J7 Unnamed protein product n=2 Tax=Burkholderia thailandensis RepID=Q2T4J7_BURTA Length = 397 Score = 42.0 bits (97), Expect = 0.038, Method: Compositional matrix adjust. Identities = 48/202 (23%), Positives = 76/202 (37%), Gaps = 29/202 (14%) Query: 12 FLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVD----------------SRLVS 55 F+ER+W ++P++++R + + E + S D S + Sbjct: 18 FMERYWGRKPLIVRRQAPHLYACLPDSEEFAFLLHSLTDPERGWFSIVNGVARPPSDSLL 77 Query: 56 HQDGKWQVSHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISF 115 Q+G +S E Y + N SLL+ V H TA L R L Sbjct: 78 TQEGLLNLS----EVYAAYRDGN-SLLMNQVQRRHRETAMLCRRIESALSAHGIALARHI 132 Query: 116 SVPG-------GGVGPHLDQYDVFIIQGTGRRRWRV-GEKLQMKQHCPHPDLLQVDPFEA 167 G G H D +DV I+Q GR+ WR+ G + P + + Sbjct: 133 GANGYLSPPSSQGFNIHYDPHDVLILQIEGRKHWRLYGRHVAWPTQPPATPIPPEEAGSP 192 Query: 168 IIDEELEPGDILYIPPGFPHEG 189 + L PG+++YIP G H+ Sbjct: 193 RREFVLSPGELVYIPRGVLHDA 214 >UniRef50_D1VL61 Cupin 4 family protein n=1 Tax=Frankia sp. EuI1c RepID=D1VL61_9ACTO Length = 313 Score = 42.0 bits (97), Expect = 0.039, Method: Compositional matrix adjust. Identities = 52/215 (24%), Positives = 88/215 (40%), Gaps = 25/215 (11%) Query: 125 HLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIPPG 184 H D +DV I+Q G + W V + +V EA+ L G++L+IP G Sbjct: 55 HWDDHDVLIVQLAGEKNWDVRGSTRSAPMFRDAVPNEVASSEAVWQGVLRAGEVLHIPRG 114 Query: 185 FPHEGYALEN----AMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPA 240 + H+ +E+ +++ + GF + ++ AD ++EL + +D P A Sbjct: 115 YWHQATRVEHDDPVSLHLTFGFTRRTGVDWLTWIADQAREQEL---FRTDLTRSPAEREA 171 Query: 241 DVLPQEMDKLREMMLELINQ--PEHFKQWFGEFISQSRHELDIAPPEP-PYQPDEIYDAL 297 E +L++ +EL+ P F +RH AP P ++P+ + Sbjct: 172 -----ERARLQDAAIELVRSLPPAAFLTARERTRPPARH----APTLPSAHEPEVVVCVT 222 Query: 298 KQGEVLVRLGGLRVLRIGDDVYANGEKIDSPHRPA 332 + + R G V VYA G KI R A Sbjct: 223 EFAPHVERSEGQLV------VYAGGRKITVRDRAA 251 >UniRef50_Q2RW70 Cupin region n=1 Tax=Rhodospirillum rubrum ATCC 11170 RepID=Q2RW70_RHORT Length = 301 Score = 42.0 bits (97), Expect = 0.041, Method: Compositional matrix adjust. Identities = 36/147 (24%), Positives = 69/147 (46%), Gaps = 24/147 (16%) Query: 121 GVGPHLDQYDVFIIQGTGRRRWRV-GEKLQ---MKQHCPHPDLLQVDPFEAIIDE-ELEP 175 G+ PH D D+ I+Q GR+ W++ G ++ K+ PD + DE ++ Sbjct: 137 GLPPHYDDRDLIIVQVAGRKHWKILGTPVEGPWRKRTMSVPD--------TVTDEFVMQG 188 Query: 176 GDILYIPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPP 235 GD+L++P G H+ LE +++ P +L+ ++Q +DP++ Sbjct: 189 GDMLFVPAGLYHQCVPLEPSLHLGALITRPCGADLLK-----MVQPRW---ETTDPELAA 240 Query: 236 RAHPADV---LPQEMDKLREMMLELIN 259 R + D L Q+ +L+E ++ L+ Sbjct: 241 RLYVGDGETDLQQQDARLKEALIRLVQ 267 >UniRef50_D0MXW2 Nucleolar protein, putative n=1 Tax=Phytophthora infestans T30-4 RepID=D0MXW2_PHYIN Length = 506 Score = 41.6 bits (96), Expect = 0.042, Method: Compositional matrix adjust. Identities = 23/80 (28%), Positives = 38/80 (47%) Query: 121 GVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILY 180 + PH D +VF++Q GR++W++ L DL + E ++ +E GD+LY Sbjct: 196 ALAPHHDDVEVFVVQTQGRKKWKLYHPLVELAGEHSSDLAEDQIGEPWMELTVEEGDLLY 255 Query: 181 IPPGFPHEGYALENAMNYSV 200 P G H+ E + V Sbjct: 256 FPRGVIHQACTDEKEFSTHV 275 >UniRef50_C0INQ3 Putative uncharacterized protein n=2 Tax=environmental samples RepID=C0INQ3_9BACT Length = 207 Score = 41.6 bits (96), Expect = 0.046, Method: Compositional matrix adjust. Identities = 18/63 (28%), Positives = 32/63 (50%), Gaps = 1/63 (1%) Query: 125 HLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIPPG 184 H D++++Q GE + +K+ D + +++ L+PGD+LY+PPG Sbjct: 128 HATSLDIWLVQAGSATAITGGEMVDLKKRADSDDAAG-SSIKGGVEQALKPGDVLYVPPG 186 Query: 185 FPH 187 PH Sbjct: 187 VPH 189 >UniRef50_UPI0000E4684D PREDICTED: hypothetical protein n=1 Tax=Strongylocentrotus purpuratus RepID=UPI0000E4684D Length = 511 Score = 41.2 bits (95), Expect = 0.058, Method: Compositional matrix adjust. Identities = 39/190 (20%), Positives = 78/190 (41%), Gaps = 12/190 (6%) Query: 12 FLERHWQKRPVVL---KRGFNNFIDPISPDELAGLAMESEV----DSRLVSHQDGKWQVS 64 F+E +W+K+P+V+ ++ + F + L GL E ++ D + ++ + Sbjct: 76 FMEEYWEKQPIVISNREKHRDYFQSLFTRTILEGLVAEKKISFIQDCNVCRYKGEVRESL 135 Query: 65 HG-----PFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPG 119 +G P + + L + ++ + E L+ + + Sbjct: 136 NGNGIVKPTKLKELLDQDKATIQFHQPQRFQESVWNLLEKLESYFGCLVGSNIYMTPKLS 195 Query: 120 GGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDIL 179 G+ PH D +VF++Q G + WR+ + + DL Q + E D L+ GD++ Sbjct: 196 QGLAPHYDDVEVFVLQLEGEKHWRLYKPPTLLPRDYSRDLDQSELGEPTHDIVLKAGDLM 255 Query: 180 YIPPGFPHEG 189 Y P G H+ Sbjct: 256 YFPRGTVHQA 265 >UniRef50_Q091R4 Chromosome 14 open reading frame 169, putative n=1 Tax=Stigmatella aurantiaca DW4/3-1 RepID=Q091R4_STIAU Length = 355 Score = 40.8 bits (94), Expect = 0.084, Method: Compositional matrix adjust. Identities = 29/78 (37%), Positives = 45/78 (57%), Gaps = 4/78 (5%) Query: 121 GVGPHLDQYDVFIIQGTGRRRWRV---GEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGD 177 GV PH D + FI+Q G + W+V G++L + P + + E +++ EL PGD Sbjct: 90 GVQPHFDTQENFILQVDGVKHWKVYGAGQELPRVEGSYTP-VARERLPELLLETELHPGD 148 Query: 178 ILYIPPGFPHEGYALENA 195 +LY+P GF HE A ++A Sbjct: 149 MLYVPRGFVHEAEARDSA 166 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P27431 Uncharacterized protein ycfD n=205 Tax=Gammaprot... 549 e-155 UniRef50_A0KI50 Cupin superfamily protein n=6 Tax=Gammaproteobac... 465 e-130 UniRef50_A1STI6 Cupin 4 family protein n=1 Tax=Psychromonas ingr... 446 e-124 UniRef50_C4LEX7 Cupin 4 family protein n=1 Tax=Tolumonas auensis... 444 e-123 UniRef50_A0YBW0 Transcription factor jumonji, jmjC n=1 Tax=marin... 437 e-121 UniRef50_Q5E4F9 Conserved protein n=16 Tax=Gammaproteobacteria R... 436 e-121 UniRef50_A6F8R4 Putative uncharacterized protein n=1 Tax=Moritel... 435 e-120 UniRef50_C9QJT9 Putative uncharacterized protein n=2 Tax=Vibrion... 433 e-120 UniRef50_A1RJT3 Cupin 4 family protein n=14 Tax=Alteromonadales ... 422 e-116 UniRef50_B8K5G8 Cupin superfamily protein n=1 Tax=Vibrio parahae... 418 e-115 UniRef50_Q48H58 YcfD protein n=22 Tax=Gammaproteobacteria RepID=... 413 e-114 UniRef50_B3PKY0 Putative uncharacterized protein n=2 Tax=Pseudom... 413 e-114 UniRef50_Q1QUR4 Cupin 4 n=1 Tax=Chromohalobacter salexigens DSM ... 411 e-113 UniRef50_A6F0B9 Transcription factor jumonji, jmjC n=1 Tax=Marin... 411 e-113 UniRef50_Q1NG82 Putative uncharacterized protein n=1 Tax=Sphingo... 410 e-113 UniRef50_Q2S4H4 Cupin superfamily protein n=3 Tax=Bacteria RepID... 401 e-110 UniRef50_Q2Y9X5 Cupin region n=9 Tax=root RepID=Q2Y9X5_NITMU 400 e-110 UniRef50_Q3JQS3 Cupin superfamily protein family n=25 Tax=Burkho... 400 e-110 UniRef50_C4K8V5 Putative uncharacterized protein n=1 Tax=Candida... 398 e-109 UniRef50_D1UI98 Cupin 4 family protein n=6 Tax=Burkholderia RepI... 397 e-109 UniRef50_Q2BJ43 Putative uncharacterized protein n=1 Tax=Neptuni... 396 e-109 UniRef50_C5A9S6 Cupin superfamily protein family protein n=49 Ta... 396 e-109 UniRef50_C6WYD1 Cupin 4 family protein n=1 Tax=Methylotenera mob... 395 e-108 UniRef50_A3QD76 Cupin 4 family protein n=19 Tax=Shewanella RepID... 395 e-108 UniRef50_B4RRX0 Putative enzyme with RmlC-like domain n=2 Tax=Al... 391 e-107 UniRef50_A6SXH9 Uncharacterized conserved protein n=2 Tax=Oxalob... 385 e-105 UniRef50_D2UDU1 Putative uncharacterized protein n=1 Tax=Xanthom... 384 e-105 UniRef50_Q2SJM1 Uncharacterized conserved protein n=3 Tax=Gammap... 379 e-104 UniRef50_Q1N4P0 Transcription factor jumonji, jmjC n=1 Tax=Berma... 378 e-103 UniRef50_C1DCJ3 Cupin region n=1 Tax=Laribacter hongkongensis HL... 378 e-103 UniRef50_Q15T89 Cupin 4 n=1 Tax=Pseudoalteromonas atlantica T6c ... 376 e-103 UniRef50_C0N3X6 Cupin superfamily protein n=1 Tax=Methylophaga t... 374 e-102 UniRef50_UPI0000E0F5AA putative enzyme with RmlC-like domain n=1... 372 e-101 UniRef50_C3M8B3 Putative uncharacterized protein n=3 Tax=Candida... 371 e-101 UniRef50_B8GSM7 Cupin 4 family protein n=1 Tax=Thioalkalivibrio ... 371 e-101 UniRef50_A1K4G1 Putative uncharacterized protein n=1 Tax=Azoarcu... 371 e-101 UniRef50_D1RFR4 Cupin superfamily protein n=1 Tax=Legionella lon... 369 e-101 UniRef50_Q5QZ10 Cupin superfamily protein n=2 Tax=Idiomarina Rep... 369 e-101 UniRef50_D0L0L5 Cupin 4 family protein n=1 Tax=Halothiobacillus ... 369 e-100 UniRef50_C5BU83 Cupin 4 family protein n=1 Tax=Teredinibacter tu... 367 e-100 UniRef50_A6W0E5 Cupin 4 family protein n=2 Tax=Marinomonas RepID... 363 6e-99 UniRef50_A6GQ27 Putative uncharacterized protein n=1 Tax=Limnoba... 361 3e-98 UniRef50_B2SQ70 Transcription factor jumonji, JmjC n=19 Tax=Xant... 361 3e-98 UniRef50_B7RUZ0 Cupin superfamily protein n=1 Tax=marine gamma p... 361 3e-98 UniRef50_Q5WVF0 Putative uncharacterized protein n=4 Tax=Legione... 360 4e-98 UniRef50_Q21K45 Cupin 4 n=1 Tax=Saccharophagus degradans 2-40 Re... 357 3e-97 UniRef50_Q31GJ6 Cupin superfamily protein n=2 Tax=Gammaproteobac... 357 3e-97 UniRef50_Q7NS46 Putative uncharacterized protein n=1 Tax=Chromob... 354 2e-96 UniRef50_A4BDP0 Putative uncharacterized protein n=1 Tax=Reineke... 349 1e-94 UniRef50_A1VLH8 Cupin 4 family protein n=6 Tax=Burkholderiales R... 345 1e-93 UniRef50_B7H3P1 Cupin superfamily protein n=16 Tax=Acinetobacter... 345 2e-93 UniRef50_C7I1M3 Cupin 4 family protein n=1 Tax=Thiomonas interme... 341 3e-92 UniRef50_B8KRM1 Cupin 4 family protein n=1 Tax=gamma proteobacte... 339 7e-92 UniRef50_A0Z1Z1 Putative uncharacterized protein n=1 Tax=marine ... 339 7e-92 UniRef50_B8KGD9 Cupin 4 family protein n=2 Tax=unclassified Gamm... 338 2e-91 UniRef50_Q0VQ28 Putative uncharacterized protein n=1 Tax=Alcaniv... 335 2e-90 UniRef50_C0VP99 Cupin 4 n=2 Tax=Acinetobacter RepID=C0VP99_9GAMM 332 1e-89 UniRef50_B1Y837 Cupin 4 family protein n=3 Tax=cellular organism... 331 2e-89 UniRef50_A4SX54 Cupin 4 family protein n=2 Tax=Polynucleobacter ... 328 3e-88 UniRef50_C7RB22 Cupin 4 family protein n=1 Tax=Kangiella koreens... 327 5e-88 UniRef50_B4X170 Cupin superfamily protein n=1 Tax=Alcanivorax sp... 325 2e-87 UniRef50_D1KE35 Putative uncharacterized protein n=1 Tax=uncultu... 317 4e-85 UniRef50_B9ZR02 Cupin 4 family protein n=1 Tax=Thioalkalivibrio ... 314 4e-84 UniRef50_C1E292 Predicted protein n=2 Tax=Micromonas RepID=C1E29... 302 1e-80 UniRef50_A4S2B8 Predicted protein n=2 Tax=Ostreococcus RepID=A4S... 301 2e-80 UniRef50_P44683 Uncharacterized protein HI0396 n=36 Tax=Gammapro... 297 6e-79 UniRef50_UPI0000E87D6F hypothetical protein MB2181_02235 n=1 Tax... 260 7e-68 UniRef50_A2W941 Transcription factor jumonji n=1 Tax=Burkholderi... 260 7e-68 UniRef50_B6BWI1 Putative cytoplasmic protein n=1 Tax=beta proteo... 222 1e-56 UniRef50_B7FZB3 Predicted protein n=1 Tax=Phaeodactylum tricornu... 215 2e-54 UniRef50_B5W5P2 Cupin 4 family protein n=2 Tax=Arthrospira RepID... 199 2e-49 UniRef50_C8XBP6 Cupin 4 family protein n=1 Tax=Nakamurella multi... 183 1e-44 UniRef50_Q091R3 Mina protein n=1 Tax=Stigmatella aurantiaca DW4/... 183 1e-44 UniRef50_A4U3D3 MYC induced nuclear antigen n=1 Tax=Magnetospiri... 181 3e-44 UniRef50_A3Q8B6 Cupin 4 family protein n=4 Tax=Mycobacterium Rep... 179 1e-43 UniRef50_UPI000192663F PREDICTED: similar to Myc-induced nuclear... 178 3e-43 UniRef50_Q7N884 Similar to unknown protein n=1 Tax=Photorhabdus ... 177 5e-43 UniRef50_D0L9V4 Cupin 4 family protein n=1 Tax=Gordonia bronchia... 177 8e-43 UniRef50_Q28VG0 Cupin 4 n=1 Tax=Jannaschia sp. CCS1 RepID=Q28VG0... 175 3e-42 UniRef50_B4B491 Cupin 4 family protein n=1 Tax=Cyanothece sp. PC... 174 5e-42 UniRef50_B4GUZ2 Lysine-specific demethylase NO66 n=2 Tax=Drosoph... 173 1e-41 UniRef50_UPI000186D1B6 conserved hypothetical protein n=1 Tax=Pe... 172 2e-41 UniRef50_B7PMB0 MYC-induced nuclear antigen, putative (Fragment)... 172 2e-41 UniRef50_B5DUH6 Lysine-specific demethylase NO66 n=2 Tax=Drosoph... 172 2e-41 UniRef50_Q10ZZ1 Cupin 4 n=1 Tax=Trichodesmium erythraeum IMS101 ... 172 2e-41 UniRef50_B8C536 Putative uncharacterized protein (Fragment) n=1 ... 172 3e-41 UniRef50_D2A374 Putative uncharacterized protein GLEAN_07936 n=1... 170 6e-41 UniRef50_UPI0000E45D23 PREDICTED: hypothetical protein n=2 Tax=S... 168 2e-40 UniRef50_D0NRY0 Nucleolar protein, putative n=2 Tax=Phytophthora... 167 6e-40 UniRef50_B0CEG8 Cupin 4 family protein, putative n=1 Tax=Acaryoc... 166 1e-39 UniRef50_A9TET4 Predicted protein n=1 Tax=Physcomitrella patens ... 164 4e-39 UniRef50_A0YJB4 Putative uncharacterized protein n=1 Tax=Lyngbya... 164 4e-39 UniRef50_Q849M1 Putative uncharacterized protein pSV2.19c n=3 Ta... 164 4e-39 UniRef50_A1R1T1 Putative cupin superfamily protein n=2 Tax=Micro... 160 6e-38 UniRef50_B4Q068 Lysine-specific demethylase NO66 n=5 Tax=Sophoph... 160 1e-37 UniRef50_B4M7P8 Lysine-specific demethylase NO66 n=3 Tax=Drosoph... 160 1e-37 UniRef50_B4V6J8 Putative uncharacterized protein n=1 Tax=Strepto... 157 5e-37 UniRef50_B4JMQ2 Lysine-specific demethylase NO66 n=1 Tax=Drosoph... 157 6e-37 UniRef50_A3M7T2 Putative uncharacterized protein n=2 Tax=Acineto... 156 1e-36 UniRef50_B4R4H1 Lysine-specific demethylase NO66 n=2 Tax=melanog... 154 6e-36 UniRef50_A6W7N8 Cupin 4 family protein n=1 Tax=Kineococcus radio... 153 1e-35 UniRef50_A9UZN8 Predicted protein n=1 Tax=Monosiga brevicollis R... 153 1e-35 UniRef50_B1FB07 Cupin 4 family protein n=1 Tax=Burkholderia ambi... 152 2e-35 UniRef50_C1EHB5 Predicted protein (Fragment) n=2 Tax=Micromonas ... 150 9e-35 UniRef50_Q5ZMM1 Lysine-specific demethylase NO66 n=3 Tax=Eumetaz... 149 1e-34 UniRef50_D2SA69 Cupin 4 family protein n=2 Tax=Actinomycetales R... 149 2e-34 UniRef50_UPI00017929D5 PREDICTED: similar to Nucleolar protein 6... 145 2e-33 UniRef50_A9C261 Cupin 4 family protein n=1 Tax=Delftia acidovora... 144 6e-33 UniRef50_A8QFQ3 Lysine-specific demethylase NO66 n=2 Tax=Brugia ... 136 8e-31 UniRef50_Q1D4G2 Cupin family protein n=2 Tax=Myxococcus xanthus ... 136 1e-30 UniRef50_Q1DFZ7 Cupin family protein n=1 Tax=Myxococcus xanthus ... 136 2e-30 UniRef50_Q9H6W3 Lysine-specific demethylase NO66 n=17 Tax=Eumeta... 135 3e-30 UniRef50_A5PK74 Lysine-specific demethylase NO66 n=1 Tax=Bos tau... 131 3e-29 UniRef50_B8BSJ2 Predicted protein n=1 Tax=Thalassiosira pseudona... 122 2e-26 UniRef50_B7G6P1 Predicted protein (Fragment) n=1 Tax=Phaeodactyl... 120 1e-25 UniRef50_A9V5A3 Predicted protein n=1 Tax=Monosiga brevicollis R... 119 1e-25 UniRef50_Q016L9 [S] KOG3706 Uncharacterized conserved protein n=... 118 4e-25 UniRef50_A4RZ92 Predicted protein n=1 Tax=Ostreococcus lucimarin... 112 3e-23 UniRef50_D1TSY0 Conserved domain protein n=19 Tax=Yersinia pesti... 111 5e-23 Sequences not found previously or not previously below threshold: UniRef50_B4L6Q5 Lysine-specific demethylase NO66 n=1 Tax=Drosoph... 165 3e-39 UniRef50_UPI0000E4684D PREDICTED: hypothetical protein n=1 Tax=S... 163 8e-39 UniRef50_Q31RB4 Putative uncharacterized protein n=2 Tax=Synecho... 162 2e-38 UniRef50_B0WMG3 Lysine-specific demethylase NO66 n=2 Tax=Culicin... 153 8e-36 UniRef50_Q7K4H4 Lysine-specific demethylase NO66 n=2 Tax=melanog... 148 3e-34 UniRef50_A3UGV1 Putative uncharacterized protein n=1 Tax=Oceanic... 147 7e-34 UniRef50_Q54K96 Lysine-specific demethylase NO66 n=1 Tax=Dictyos... 146 2e-33 UniRef50_A7SRW5 Predicted protein n=1 Tax=Nematostella vectensis... 145 2e-33 UniRef50_UPI000180B5EA PREDICTED: similar to Nucleolar protein 6... 144 4e-33 UniRef50_B0BQ44 Putative uncharacterized protein n=5 Tax=Pasteur... 144 4e-33 UniRef50_Q6DDJ7 Mina-prov protein n=2 Tax=Xenopus RepID=Q6DDJ7_X... 142 2e-32 UniRef50_B3S582 Putative uncharacterized protein n=1 Tax=Trichop... 140 7e-32 UniRef50_C7NJK3 Cupin superfamily protein n=1 Tax=Kytococcus sed... 140 8e-32 UniRef50_A4X6V2 Cupin 4 family protein n=4 Tax=Micromonosporacea... 138 3e-31 UniRef50_C5LMW3 Putative uncharacterized protein n=1 Tax=Perkins... 137 6e-31 UniRef50_C3XRY1 Lysine-specific demethylase NO66 n=1 Tax=Branchi... 137 6e-31 UniRef50_C9N2N9 Cupin 4 family protein n=4 Tax=Streptomyces RepI... 136 1e-30 UniRef50_C6SNC5 Putative uncharacterized protein n=2 Tax=Neisser... 135 2e-30 UniRef50_Q2T4J7 Unnamed protein product n=2 Tax=Burkholderia tha... 135 3e-30 UniRef50_Q8IUF8 MYC-induced nuclear antigen n=25 Tax=Amniota Rep... 134 5e-30 UniRef50_D0MXW2 Nucleolar protein, putative n=1 Tax=Phytophthora... 134 7e-30 UniRef50_C6W918 Cupin 4 family protein n=2 Tax=Actinomycetales R... 133 8e-30 UniRef50_A1KTI5 Putative uncharacterized protein n=2 Tax=Neisser... 130 7e-29 UniRef50_Q4D641 Putative uncharacterized protein n=1 Tax=Trypano... 130 9e-29 UniRef50_A0QI05 Cupin superfamily protein n=4 Tax=Mycobacterium ... 129 2e-28 UniRef50_D2VJG1 Predicted protein n=1 Tax=Naegleria gruberi RepI... 129 2e-28 UniRef50_C3ZLE4 Putative uncharacterized protein n=1 Tax=Branchi... 127 7e-28 UniRef50_O01658 Lysine-specific demethylase NO66 n=3 Tax=Caenorh... 126 1e-27 UniRef50_UPI0000523E0E PREDICTED: similar to MYC induced nuclear... 125 2e-27 UniRef50_C6SMA2 Myc induced nuclear antigen n=24 Tax=Neisseria R... 125 2e-27 UniRef50_Q4Q6P0 Putative uncharacterized protein n=3 Tax=Leishma... 125 3e-27 UniRef50_UPI000192614C PREDICTED: similar to chromosome 14 open ... 124 4e-27 UniRef50_Q0ALX3 Cupin 4 family protein n=1 Tax=Maricaulis maris ... 123 1e-26 UniRef50_B9BV10 Cupin superfamily protein n=5 Tax=Proteobacteria... 122 2e-26 UniRef50_Q7T3G6 MYC induced nuclear antigen-like n=6 Tax=Euteleo... 121 3e-26 UniRef50_Q47NS9 Putative uncharacterized protein n=1 Tax=Thermob... 113 1e-23 UniRef50_Q2JG11 Cupin 4 n=3 Tax=Actinomycetales RepID=Q2JG11_FRASC 108 3e-22 UniRef50_A1SPZ0 Cupin 4 family protein n=1 Tax=Nocardioides sp. ... 107 5e-22 UniRef50_P46327 Uncharacterized protein yxbC n=1 Tax=Bacillus su... 105 3e-21 UniRef50_Q091R4 Chromosome 14 open reading frame 169, putative n... 101 4e-20 UniRef50_UPI0001B4BFC9 putative cupin superfamily protein n=1 Ta... 101 4e-20 UniRef50_A5GJ70 Putative uncharacterized protein SynWH7803_0559 ... 101 5e-20 UniRef50_B0KHI4 Cupin 4 family protein n=1 Tax=Pseudomonas putid... 100 7e-20 UniRef50_D2PSR6 Cupin family protein n=1 Tax=Kribbella flavida D... 99 1e-19 UniRef50_D1VL61 Cupin 4 family protein n=1 Tax=Frankia sp. EuI1c... 100 2e-19 UniRef50_C9NEK8 Cupin 4 family protein n=1 Tax=Streptomyces flav... 100 2e-19 UniRef50_B7FXD3 Predicted protein n=1 Tax=Phaeodactylum tricornu... 100 2e-19 UniRef50_B6KFH2 Putative uncharacterized protein n=4 Tax=Toxopla... 100 2e-19 UniRef50_A8TXW2 Putative uncharacterized protein n=1 Tax=alpha p... 99 3e-19 UniRef50_A9UW44 Predicted protein n=1 Tax=Monosiga brevicollis R... 98 6e-19 UniRef50_B1FKZ3 Cupin 4 family protein n=1 Tax=Burkholderia ambi... 97 9e-19 UniRef50_C1YJ55 Cupin superfamily protein n=1 Tax=Nocardiopsis d... 96 2e-18 UniRef50_C9Z2L7 Putative uncharacterized protein n=2 Tax=Strepto... 95 3e-18 UniRef50_D1WSH6 Cupin family protein n=2 Tax=Streptomyces RepID=... 95 3e-18 UniRef50_C4DQG4 Putative uncharacterized protein n=1 Tax=Stackeb... 91 4e-17 UniRef50_C6XMP3 Cupin 4 family protein n=1 Tax=Hirschia baltica ... 89 3e-16 UniRef50_D1H9M4 Whole genome shotgun sequence of line PN40024, s... 86 1e-15 UniRef50_B8BVR1 Predicted protein n=1 Tax=Thalassiosira pseudona... 86 2e-15 UniRef50_Q15JF4 VldL n=1 Tax=Streptomyces hygroscopicus subsp. l... 86 2e-15 UniRef50_Q6MH74 Putative uncharacterized protein yxbC n=1 Tax=Bd... 85 5e-15 UniRef50_C7Q411 Cupin 4 family protein n=1 Tax=Catenulispora aci... 82 3e-14 UniRef50_B9IN14 Predicted protein n=2 Tax=rosids RepID=B9IN14_POPTR 81 4e-14 UniRef50_Q8S3P4 OSJNBa0011F23.16 protein n=4 Tax=Oryza sativa Re... 81 9e-14 UniRef50_C7J1Y3 Os04g0659150 protein n=4 Tax=Poaceae RepID=C7J1Y... 79 3e-13 UniRef50_A9TBQ2 Predicted protein n=1 Tax=Physcomitrella patens ... 77 8e-13 UniRef50_UPI0001BCFC5A Cupin 4 family protein n=2 Tax=Mannheimia... 77 9e-13 UniRef50_C4DE44 Putative uncharacterized protein n=1 Tax=Stackeb... 76 2e-12 UniRef50_Q2RW70 Cupin region n=1 Tax=Rhodospirillum rubrum ATCC ... 76 3e-12 UniRef50_Q6MPD0 Putative RNA methylase n=1 Tax=Bdellovibrio bact... 75 3e-12 UniRef50_A9V3F7 Predicted protein n=1 Tax=Monosiga brevicollis R... 75 5e-12 UniRef50_B5Y4L3 Predicted protein n=1 Tax=Phaeodactylum tricornu... 73 1e-11 UniRef50_UPI000180C0C1 PREDICTED: similar to reserved n=1 Tax=Ci... 73 1e-11 UniRef50_A9V2P6 Predicted protein n=3 Tax=Monosiga brevicollis R... 73 1e-11 UniRef50_A9VDC2 Predicted protein n=1 Tax=Monosiga brevicollis R... 73 2e-11 UniRef50_A9V7G6 Predicted protein n=1 Tax=Monosiga brevicollis R... 72 3e-11 UniRef50_A9V7T0 Predicted protein n=1 Tax=Monosiga brevicollis R... 72 4e-11 UniRef50_C5AHL8 JmjC domain protein n=2 Tax=Burkholderia RepID=C... 71 4e-11 UniRef50_A9VDD4 Predicted protein n=1 Tax=Monosiga brevicollis R... 71 5e-11 UniRef50_A9V0X5 Predicted protein n=2 Tax=Monosiga brevicollis R... 71 6e-11 UniRef50_A9V428 Predicted protein n=2 Tax=Monosiga brevicollis R... 70 1e-10 UniRef50_A9VEP7 Predicted protein (Fragment) n=1 Tax=Monosiga br... 70 1e-10 UniRef50_A9V7C3 Predicted protein n=3 Tax=Monosiga brevicollis R... 70 1e-10 UniRef50_UPI0000D57503 PREDICTED: similar to JmjC domain-contain... 70 2e-10 UniRef50_C5KSU0 Putative uncharacterized protein n=2 Tax=Perkins... 69 2e-10 UniRef50_A9SQV0 Predicted protein n=1 Tax=Physcomitrella patens ... 68 5e-10 UniRef50_UPI0001927155 PREDICTED: similar to jumonji domain cont... 68 6e-10 UniRef50_A9V427 Predicted protein n=1 Tax=Monosiga brevicollis R... 68 6e-10 UniRef50_A9GPS9 Transcription factor jumonji (JmjC) domain-conta... 68 6e-10 UniRef50_A9V0E4 Predicted protein n=1 Tax=Monosiga brevicollis R... 67 8e-10 UniRef50_Q8RWR1 AT3g20810/MOE17_10 n=9 Tax=Viridiplantae RepID=Q... 66 2e-09 UniRef50_D0LTG4 Transcription factor jumonji n=1 Tax=Haliangium ... 66 2e-09 UniRef50_B8CD01 Predicted protein n=1 Tax=Thalassiosira pseudona... 66 2e-09 UniRef50_B8BQ14 Predicted protein n=1 Tax=Thalassiosira pseudona... 64 7e-09 UniRef50_A9V590 Predicted protein n=1 Tax=Monosiga brevicollis R... 64 7e-09 UniRef50_Q0QZH9 Gp49 n=1 Tax=Synechococcus phage syn9 RepID=Q0QZ... 64 8e-09 UniRef50_Q1D441 JmjC domain protein n=1 Tax=Myxococcus xanthus D... 64 8e-09 UniRef50_C3Z534 Putative uncharacterized protein n=1 Tax=Branchi... 64 9e-09 UniRef50_B3SDY7 Putative uncharacterized protein n=1 Tax=Trichop... 64 9e-09 UniRef50_UPI0000DB7045 PREDICTED: similar to Hspb associated pro... 64 9e-09 UniRef50_Q2SID9 Uncharacterized conserved protein n=1 Tax=Hahell... 63 1e-08 UniRef50_UPI00015B5EA6 PREDICTED: similar to Jumonji domain cont... 63 1e-08 UniRef50_Q96EW2 HSPB1-associated protein 1 n=23 Tax=Amniota RepI... 63 2e-08 UniRef50_Q6AXL5 HSPB1-associated protein 1 homolog n=3 Tax=Clupe... 63 2e-08 UniRef50_UPI00015B5A68 PREDICTED: hypothetical protein n=1 Tax=N... 63 2e-08 UniRef50_A7RV46 Predicted protein n=2 Tax=Eumetazoa RepID=A7RV46... 62 3e-08 UniRef50_Q7UIC0 Probable protein associating with small stress p... 62 3e-08 UniRef50_A6FWV6 JmjC domain protein n=1 Tax=Plesiocystis pacific... 62 3e-08 UniRef50_Q4DVQ0 Putative uncharacterized protein n=2 Tax=Trypano... 62 4e-08 UniRef50_A4X242 Transcription factor jumonji, jmjC domain protei... 61 5e-08 UniRef50_Q8N371 JmjC domain-containing protein 5 n=17 Tax=Chorda... 61 5e-08 UniRef50_B5Y3G4 Predicted protein n=1 Tax=Phaeodactylum tricornu... 61 5e-08 UniRef50_Q095N0 Putative uncharacterized protein n=1 Tax=Stigmat... 61 6e-08 UniRef50_A2SGT4 Putative uncharacterized protein n=1 Tax=Methyli... 61 6e-08 UniRef50_A4RRR9 Predicted protein n=1 Tax=Ostreococcus lucimarin... 61 6e-08 UniRef50_A4S2N6 Predicted protein n=3 Tax=Ostreococcus RepID=A4S... 61 6e-08 UniRef50_D2VHH6 Predicted protein n=1 Tax=Naegleria gruberi RepI... 61 7e-08 UniRef50_UPI0001927319 PREDICTED: similar to predicted protein n... 61 8e-08 UniRef50_A9UPY7 Predicted protein n=1 Tax=Monosiga brevicollis R... 60 1e-07 UniRef50_D0MSD2 Putative uncharacterized protein n=1 Tax=Phytoph... 60 2e-07 UniRef50_UPI0001B56C67 cupin 4 family protein n=1 Tax=Streptomyc... 60 2e-07 UniRef50_A4RRC2 Predicted protein n=2 Tax=Ostreococcus RepID=A4R... 59 2e-07 UniRef50_C5BK86 JmjC domain protein n=1 Tax=Teredinibacter turne... 59 3e-07 UniRef50_B5S2S3 Putative uncharacterized protein n=1 Tax=Ralston... 58 4e-07 UniRef50_Q0J0P8 Os09g0489200 protein n=10 Tax=Poaceae RepID=Q0J0... 58 5e-07 UniRef50_Q5YSD2 Putative uncharacterized protein n=1 Tax=Nocardi... 58 5e-07 UniRef50_B3RNN1 Putative uncharacterized protein n=1 Tax=Trichop... 58 7e-07 UniRef50_B5W056 Transcription factor jumonji n=2 Tax=Arthrospira... 58 7e-07 UniRef50_UPI0000D567FA PREDICTED: similar to reserved n=1 Tax=Tr... 58 8e-07 UniRef50_C1FDI9 JmjC transcription factor domain-containing prot... 57 1e-06 UniRef50_UPI000186E7C4 hspbap1/pass1, putative n=1 Tax=Pediculus... 57 1e-06 UniRef50_D2V3K7 Predicted protein n=1 Tax=Naegleria gruberi RepI... 57 1e-06 >UniRef50_P27431 Uncharacterized protein ycfD n=205 Tax=Gammaproteobacteria RepID=YCFD_ECOLI Length = 373 Score = 549 bits (1414), Expect = e-155, Method: Composition-based stats. Identities = 373/373 (100%), Positives = 373/373 (100%) Query: 1 MEYQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGK 60 MEYQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGK Sbjct: 1 MEYQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGK 60 Query: 61 WQVSHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGG 120 WQVSHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGG Sbjct: 61 WQVSHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGG 120 Query: 121 GVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILY 180 GVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILY Sbjct: 121 GVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILY 180 Query: 181 IPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPA 240 IPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPA Sbjct: 181 IPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPA 240 Query: 241 DVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQG 300 DVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQG Sbjct: 241 DVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQG 300 Query: 301 EVLVRLGGLRVLRIGDDVYANGEKIDSPHRPALDALASNIALTAENFGDALEDPSFLAML 360 EVLVRLGGLRVLRIGDDVYANGEKIDSPHRPALDALASNIALTAENFGDALEDPSFLAML Sbjct: 301 EVLVRLGGLRVLRIGDDVYANGEKIDSPHRPALDALASNIALTAENFGDALEDPSFLAML 360 Query: 361 AALVNSGYWFFEG 373 AALVNSGYWFFEG Sbjct: 361 AALVNSGYWFFEG 373 >UniRef50_A0KI50 Cupin superfamily protein n=6 Tax=Gammaproteobacteria RepID=A0KI50_AERHH Length = 376 Score = 465 bits (1198), Expect = e-130, Method: Composition-based stats. Identities = 187/374 (50%), Positives = 250/374 (66%), Gaps = 5/374 (1%) Query: 3 YQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQ 62 YQL L+ FLE +WQKRP+++K GF +F DPISPDELAGLAME ++SRLV+ + KW+ Sbjct: 2 YQLNLDIAHFLEHYWQKRPLLIKGGFTDFQDPISPDELAGLAMEEVIESRLVTRFNNKWE 61 Query: 63 VSHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGV 122 +HGPFESYDHLGE NW++LVQA NHW L PF+ +P WR DD+M+SFS P GGV Sbjct: 62 AAHGPFESYDHLGEENWTVLVQACNHWAPEVNELALPFQFIPGWRFDDVMVSFSTPHGGV 121 Query: 123 GPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIP 182 GPH+D YDVFI QG G+R WRVG+ + + H LL +PFEAIID +EPGDILYIP Sbjct: 122 GPHIDNYDVFITQGQGKRHWRVGDAKPLNEFAAHAALLHCEPFEAIIDVIMEPGDILYIP 181 Query: 183 PGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADV 242 PGFPHEGYA+E ++N+SVGFRAP+ + LIS FAD+++ E+ Y D D+ PRA ++ Sbjct: 182 PGFPHEGYAIEPSLNFSVGFRAPDAKALISSFADHLIDNEVRTERYGDADLKPRARHGEI 241 Query: 243 LPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQGEV 302 P E+ +LRE+M + ++ FK+WFG IS+++H+LD+ P EP Y +E+ D L QGE Sbjct: 242 QPHELHRLRELMQQALDDETLFKEWFGTMISEAKHDLDVNPVEPDYSAEEVADLLTQGEP 301 Query: 303 LVRLGGLRVLRI---GDDVYANGEKID--SPHRPALDALASNIALTAENFGDALEDPSFL 357 +++ GLR + Y +GE S A+ L +T + + + FL Sbjct: 302 AIKVPGLRTVWFSGESQQCYIDGEAWTLQSEDAAAISLLCDKDMVTQADMVELADQAGFL 361 Query: 358 AMLAALVNSGYWFF 371 +L LVN GYWFF Sbjct: 362 QLLTRLVNRGYWFF 375 >UniRef50_A1STI6 Cupin 4 family protein n=1 Tax=Psychromonas ingrahamii 37 RepID=A1STI6_PSYIN Length = 375 Score = 446 bits (1148), Expect = e-124, Method: Composition-based stats. Identities = 172/372 (46%), Positives = 239/372 (64%), Gaps = 4/372 (1%) Query: 3 YQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQ 62 ++L L+ DFL+ +WQK+P V+K+GF +F DPI PDE+AGLAME E++SRL+ +DG+WQ Sbjct: 2 FELNLDINDFLDTYWQKKPTVIKQGFVDFEDPIMPDEMAGLAMEEELESRLIYQEDGEWQ 61 Query: 63 VSHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGV 122 GPF S++ L +LLVQAV+HWH L+RPFR LP+WRIDDLMIS+S P GGV Sbjct: 62 ALSGPFTSFERLENDGATLLVQAVDHWHPDAQELIRPFRFLPNWRIDDLMISYSTPKGGV 121 Query: 123 GPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIP 182 GPH+D YDVFIIQG G+R WRVG+K + + H L + F+AIID ELEPGDILYIP Sbjct: 122 GPHIDNYDVFIIQGLGKRHWRVGDKGALPEFAAHDALKHCESFDAIIDVELEPGDILYIP 181 Query: 183 PGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADV 242 G+PHEGY++E ++NYS+GFRAP+ +L+S F DY + Y+D ++ R P + Sbjct: 182 AGYPHEGYSIETSLNYSIGFRAPDQNDLLSSFTDYCIDTNPAPERYADKEMLLREKPGQI 241 Query: 243 LPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQGEV 302 E+++L +ML PE WFG IS+++H+LDIA PE P+ I + L++G Sbjct: 242 ETPELNELHRIMLANCATPEMLMPWFGRMISEAKHDLDIAEPEQPHTAQSILEQLEEGAQ 301 Query: 303 LVRLGGLRVLRIGDD---VYANGEKIDSPHRPAL-DALASNIALTAENFGDALEDPSFLA 358 VRLGGL + ++ NGE+ + L L + E + +E+ + L Sbjct: 302 FVRLGGLHAVYFEQAPELLFINGEQFNCEGFTELGHHLCDQDEVGGELYDLLIENKNALI 361 Query: 359 MLAALVNSGYWF 370 + LVN GYW+ Sbjct: 362 LFTDLVNQGYWY 373 >UniRef50_C4LEX7 Cupin 4 family protein n=1 Tax=Tolumonas auensis DSM 9187 RepID=C4LEX7_TOLAT Length = 381 Score = 444 bits (1142), Expect = e-123, Method: Composition-based stats. Identities = 181/374 (48%), Positives = 244/374 (65%), Gaps = 5/374 (1%) Query: 3 YQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQ 62 QL L+ F+ WQK+P VL+ + F DPI+PDELAGLA E +V+SRLV+ DGKW Sbjct: 2 NQLNLDLAAFMREFWQKKPTVLRGAYAPFTDPITPDELAGLATEEQVESRLVTFADGKWT 61 Query: 63 VSHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGV 122 HGPF+ Y LGE++W+LLVQA +HW +P A L+ PFR LP+WRIDD+MIS+SVPGGGV Sbjct: 62 AEHGPFDDYSQLGESHWALLVQATDHWIKPVADLITPFRGLPNWRIDDVMISYSVPGGGV 121 Query: 123 GPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIP 182 GPH+DQYDVFIIQG+G RRWRVG +Q P LL V+ FE IID EL+ GDILYIP Sbjct: 122 GPHIDQYDVFIIQGSGSRRWRVGADTPAEQFVATPGLLHVEQFEPIIDVELQSGDILYIP 181 Query: 183 PGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADV 242 PGFPH+GYA+ AM+YS+G+RAPN ++L S FAD++LQ G Y+DP P V Sbjct: 182 PGFPHDGYAITEAMSYSIGYRAPNQQDLFSSFADFLLQENAGQVRYTDPKRELTKTPGLV 241 Query: 243 LPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQGEV 302 ++++ LR++M L++ + F +W G +SQ++HEL+I E PDE+ AL+ + Sbjct: 242 TNKDVNDLRDLMRTLLHDEQLFSKWLGTNLSQAKHELNILSQEWDLIPDELLPALEAEDE 301 Query: 303 LVRLGGLRVLRIG---DDVYANGEKIDSPH--RPALDALASNIALTAENFGDALEDPSFL 357 L RLGGLR L D + NGE++ P R + ++ LT + L++P + Sbjct: 302 LYRLGGLRCLYFAALPDCCFVNGEQLQIPEGGRALAHLMCNSTVLTHKELQPYLDNPILV 361 Query: 358 AMLAALVNSGYWFF 371 + N GYW+ Sbjct: 362 DWICYWFNQGYWYL 375 >UniRef50_A0YBW0 Transcription factor jumonji, jmjC n=1 Tax=marine gamma proteobacterium HTCC2143 RepID=A0YBW0_9GAMM Length = 391 Score = 437 bits (1125), Expect = e-121, Method: Composition-based stats. Identities = 136/382 (35%), Positives = 218/382 (57%), Gaps = 15/382 (3%) Query: 6 TLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQVSH 65 L+ DFL WQ +P +++ F NFI+P+SP++LAGLA E+E++SRL++ +GKWQ SH Sbjct: 5 NLDIADFLANTWQTKPRLIRNAFPNFINPMSPEDLAGLACEAEIESRLITEANGKWQTSH 64 Query: 66 GPF--ESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVG 123 GP ++++L ++NW+LLVQAV+HW A L+ FR +P WRIDD+M+S++ GG VG Sbjct: 65 GPIAETTFNNLSDSNWTLLVQAVDHWVPEVADLLDNFRFIPSWRIDDVMVSYATRGGSVG 124 Query: 124 PHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCP-HPDLLQVDPFEAIIDEELEPGDILYIP 182 PH D YDVF++QG G+RRW+VG +P+L + F A + LE GD+LYIP Sbjct: 125 PHYDNYDVFLVQGAGQRRWQVGGPCSAANSLQNNPELRLLADFVAEEEWVLEAGDMLYIP 184 Query: 183 PGFPHEGYALE-NAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPAD 241 PG H G A++ + M YS+GFRAP+ E++S F D L Y+DP + + H + Sbjct: 185 PGISHWGTAMDNDCMTYSIGFRAPSHSEMLSDFCDDTLAGLTEELRYADPGLQEQGHSGE 244 Query: 242 VLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPP-EPPYQPDEIYDALKQG 300 ++P + + ++ +N + +WFG +++Q ++ + A + +Q ++ LK Sbjct: 245 IMPAAISNAQRILQNYVNDEQRLTEWFGRYVTQQKYPAETADNTDEKFQQGDLVQLLKDD 304 Query: 301 EVLVRLGGLRVLRIGDD-------VYANGEKIDSPHRPAL---DALASNIALTAENFGDA 350 V++R +R+ I + + NG +S + LA N + + Sbjct: 305 GVILRDPTVRIAFIDAESPSNSLLFFVNGVCFESVGDSCIALSKLLADNTRICSGQIMPW 364 Query: 351 LEDPSFLAMLAALVNSGYWFFE 372 L D + +L LVN G +F+ Sbjct: 365 LGDTESVQLLLRLVNQGVLYFD 386 >UniRef50_Q5E4F9 Conserved protein n=16 Tax=Gammaproteobacteria RepID=Q5E4F9_VIBF1 Length = 394 Score = 436 bits (1122), Expect = e-121, Method: Composition-based stats. Identities = 170/381 (44%), Positives = 253/381 (66%), Gaps = 11/381 (2%) Query: 3 YQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQ 62 YQL+ + +FL +WQK+PV++K GF NF DP++P+ELAGL +E++VDSR +S+ + +W+ Sbjct: 14 YQLSFSLQEFLSEYWQKKPVIIKDGFENFQDPVTPEELAGLTLENDVDSRFISNANNEWK 73 Query: 63 VSHGPFES--YDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGG 120 HGP Y+ LGETNWS++VQA NHWH+ A L +PF+++P+W DD+MIS+SVP G Sbjct: 74 AEHGPLSEELYETLGETNWSIIVQAANHWHKGAAELFKPFKQMPNWLFDDIMISYSVPHG 133 Query: 121 GVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILY 180 GVGPH+DQYDVFIIQG G+R WRVG+ + ++ H L Q+ FE IID+ LEPGDILY Sbjct: 134 GVGPHIDQYDVFIIQGQGKRHWRVGDIGEYQEEHRHSALKQITGFEPIIDQILEPGDILY 193 Query: 181 IPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPA 240 IPPGFPH+GYALE +M+YS GFR+P +ELIS FAD++++ E G +Y +P++ ++H + Sbjct: 194 IPPGFPHDGYALEPSMSYSAGFRSPKEQELISNFADFIIENEKGDVHYHNPELSTQSHGS 253 Query: 241 DVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQG 300 ++ + + L+ MML ++ + KQ+ GE++S SRH L+I P + +E+ + L G Sbjct: 254 EITTRSFEDLKAMMLSAMSDEQTLKQFMGEYLSNSRHHLNIIPDSEKWTTEELLNYLHSG 313 Query: 301 EVLVRLGGLRVLRIGDD-------VYANGEKIDSPHRPALDA--LASNIALTAENFGDAL 351 + L+++ G+R + ++ +GE P + D L +T N L Sbjct: 314 QALIKVAGVRSFYHEVESCEENMTLFIDGESYVFPLKMKNDVITLCEANEVTLNNIEQLL 373 Query: 352 EDPSFLAMLAALVNSGYWFFE 372 DP +A L LVN GY++ E Sbjct: 374 LDPHSVANLLQLVNIGYFYAE 394 >UniRef50_A6F8R4 Putative uncharacterized protein n=1 Tax=Moritella sp. PE36 RepID=A6F8R4_9GAMM Length = 379 Score = 435 bits (1120), Expect = e-120, Method: Composition-based stats. Identities = 164/378 (43%), Positives = 234/378 (61%), Gaps = 8/378 (2%) Query: 3 YQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQ 62 Y+L L+ DF++ +WQK+P+++K GF +FIDPISPDE+AGLAME +V SR+VS +DGKW+ Sbjct: 2 YKLNLDIADFMQNYWQKKPLLIKAGFKDFIDPISPDEIAGLAMEEDVTSRMVSLEDGKWE 61 Query: 63 VSHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGV 122 GPF +D L + ++LVQA+NHWH+P+A L F +P WR DDLM+S+S GGV Sbjct: 62 AKCGPFTEFDRLEKPGAAILVQAINHWHDPSAELANVFNFIPSWRFDDLMVSYSSDTGGV 121 Query: 123 GPHLDQYDVFIIQGTGRRRWRVG-EKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYI 181 GPH+D+Y VFIIQG G+R WRVG + + ++ + L + F+A+ID LEPGDILYI Sbjct: 122 GPHVDRYCVFIIQGQGKRHWRVGSQDMNPQEFAANGALKHCEAFDAVIDTVLEPGDILYI 181 Query: 182 PPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPAD 241 PP PHEGYA+ A+NYSVGFRA + +EL++ F DY+LQ++ YSDP + PRA Sbjct: 182 PPYAPHEGYAVGEAINYSVGFRAQDQKELLNDFGDYLLQQDKEFVRYSDPKLQPRAEHGS 241 Query: 242 VLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDE--IYDALKQ 299 + E+ L ++M L+ + G S+S HELD+ PE Y D + D + Sbjct: 242 IESGEVQGLTDIMTSLMADKSVMHDFLGRHYSESAHELDLLVPEGGYIADYAIVVDEIGM 301 Query: 300 GEVLVRLGGLRVLRIGD---DVYANGEK--IDSPHRPALDALASNIALTAENFGDALEDP 354 L ++ GL+ L + + +GE+ D+ ++ L + TA+ +ED Sbjct: 302 ESYLRKVNGLKTLYFPEMPTSCFIDGERYDFDASIAASVQTLCNTTEQTAKELEVLMEDK 361 Query: 355 SFLAMLAALVNSGYWFFE 372 F +L VN GYW FE Sbjct: 362 VFGELLIEWVNLGYWHFE 379 >UniRef50_C9QJT9 Putative uncharacterized protein n=2 Tax=Vibrionaceae RepID=C9QJT9_VIBOR Length = 377 Score = 433 bits (1115), Expect = e-120, Method: Composition-based stats. Identities = 179/377 (47%), Positives = 245/377 (64%), Gaps = 8/377 (2%) Query: 3 YQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQ 62 YQLT + FL HW K+P V+K GF +FIDPIS DELAGLAME E+DSR +S++D +W Sbjct: 2 YQLTFDLKAFLAEHWHKKPTVIKAGFADFIDPISADELAGLAMEEEIDSRFISNKDNQWS 61 Query: 63 VSHGPF--ESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGG 120 +HGP ++ L E++W L+VQA NHWH +A L++ F++LP W DDLM+ FS P G Sbjct: 62 ATHGPLPESHFESLDESHWQLIVQACNHWHLGSAELVQAFKQLPQWLFDDLMVCFSAPEG 121 Query: 121 GVGPHLDQYDVFIIQGTGRRRWRVGE--KLQMKQHCPHPDLLQVDPFEAIIDEELEPGDI 178 GVGPH+DQYDVFIIQG+G+RRWRVG+ K Q K+ L Q++ FE+IIDE LEPGDI Sbjct: 122 GVGPHIDQYDVFIIQGSGKRRWRVGDIDKGQYKESIQAGALRQIEGFESIIDEVLEPGDI 181 Query: 179 LYIPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAH 238 LYIPPGFPHEG LE +M+YS+GFR+P +EL+S FADYVL ++G + +P+ + + Sbjct: 182 LYIPPGFPHEGNTLEPSMSYSIGFRSPKEQELLSNFADYVLAHDIGDVHLHNPEQSAQDN 241 Query: 239 PADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALK 298 ++L Q++ KL +M+ +N + + + G +SQSRH+LDI PE Y E+ + L+ Sbjct: 242 NGELLSQDLAKLTDMLKAALNGEKDIQTFMGAMLSQSRHQLDIVEPEEAYSDTEVSEYLQ 301 Query: 299 QGEVLVRLGGLRVLRIGDDV---YANGEKIDSPHRPALDALASNIALTAENFGDALEDPS 355 G VL ++ GLR L Y NGE D P+ AL L+ ++ D Sbjct: 302 SGGVLRKVSGLRALYHQGYFHSIYINGESFDVPNSNMTRALCDYDELSIDSSTGPDLD-E 360 Query: 356 FLAMLAALVNSGYWFFE 372 +L LVN GYW+F+ Sbjct: 361 STQLLTKLVNKGYWYFD 377 >UniRef50_A1RJT3 Cupin 4 family protein n=14 Tax=Alteromonadales RepID=A1RJT3_SHESW Length = 386 Score = 422 bits (1085), Expect = e-116, Method: Composition-based stats. Identities = 155/380 (40%), Positives = 230/380 (60%), Gaps = 10/380 (2%) Query: 1 MEYQLT-LNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDG 59 M+ ++ L +FL ++WQK+P+V+++GF F D +SP+ELAGLAM+ V+SR V Q G Sbjct: 1 MQLEINGLTPAEFLAQYWQKKPLVIRQGFKQFQDLVSPEELAGLAMDELVESRRVYQQAG 60 Query: 60 KWQVSHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPG 119 +W GPF+SY+ LGE +W+L+VQA+N+W AL++ F +P WR DD+M+S++ PG Sbjct: 61 QWHAEFGPFDSYEKLGERDWTLIVQALNNWVPDAEALIQCFDFIPRWRFDDVMVSYATPG 120 Query: 120 GGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDIL 179 GGVGPH+D YDVFI QG+GRRRWRVG++ ++ HP LL + FE IID EL PGDIL Sbjct: 121 GGVGPHIDLYDVFICQGSGRRRWRVGDRGPHREFAAHPALLHTEAFEPIIDTELLPGDIL 180 Query: 180 YIPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHP 239 YIPPGFPH+G LE ++++SVG+R + +++ S AD++ + +LG +DP+ Sbjct: 181 YIPPGFPHDGITLEESLSFSVGYRTASAKDMFSALADHLSEHDLGAQQIADPERQVSHRS 240 Query: 240 ADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQ 299 V ++ +LR + ++N ++ G +++QS+ LD+ DE+ L Sbjct: 241 GCVDNNDLARLRSQLTSMLNDK-LVSEFSGRYLTQSKCALDLPDEPLDITQDEVLAWL-D 298 Query: 300 GEVLVRLGGLRVLRIGDDV-----YANGEKIDSPHRPA--LDALASNIALTAENFGDALE 352 + L+RLGGLR L V + NGE+ P A + L L L+ Sbjct: 299 EQPLIRLGGLRCLYFDVSVEQGTIFINGERYQLPVELAGIIPLLCDMSQLDKTALLPWLD 358 Query: 353 DPSFLAMLAALVNSGYWFFE 372 + LA L VN GYW+FE Sbjct: 359 NADGLAQLTEWVNLGYWYFE 378 >UniRef50_B8K5G8 Cupin superfamily protein n=1 Tax=Vibrio parahaemolyticus 16 RepID=B8K5G8_VIBPA Length = 375 Score = 418 bits (1075), Expect = e-115, Method: Composition-based stats. Identities = 168/376 (44%), Positives = 242/376 (64%), Gaps = 8/376 (2%) Query: 3 YQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQ 62 YQL+ + FL ++W K+P V+K G NFIDPISP+ELAGLAME EVDSR V++++G WQ Sbjct: 2 YQLSFDLDSFLAKYWHKQPTVIKHGITNFIDPISPEELAGLAMEEEVDSRFVTNKNGHWQ 61 Query: 63 VSHGPF--ESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGG 120 HGP + L E++W L+VQA NHWH A L+ PF+ LP W DDLM+ +S P G Sbjct: 62 AQHGPLPESLFSQLEESHWQLIVQACNHWHLGAAELVAPFKALPQWLFDDLMVCYSAPQG 121 Query: 121 GVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCP-HPDLLQVDPFEAIIDEELEPGDIL 179 GVGPH+DQYDVFIIQG+G+RRWRVG + + L Q++ F+AIIDE LEPGDIL Sbjct: 122 GVGPHIDQYDVFIIQGSGKRRWRVGAADEGQYQESIQGALRQIESFDAIIDEVLEPGDIL 181 Query: 180 YIPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHP 239 YIPPGFPHEG +E +M+YS+GFR+P +EL+S FADYVL +E G + +P + + + Sbjct: 182 YIPPGFPHEGNTIEPSMSYSMGFRSPKEQELLSHFADYVLAKEKGDVHLHNPQMQTQRNH 241 Query: 240 ADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQ 299 ++L ++ L +M+ + + + + +SQSRH+LDI PE +++Y L+ Sbjct: 242 GEILRSDLTLLTQMLQSALESKQDIENFLALNLSQSRHQLDIVEPEEVISQEQVYAHLEA 301 Query: 300 GEVLVRLGGLRVLRIGDD---VYANGEKIDSPHRPALDALASNIALTAENFGDALEDPSF 356 +V++ GLR L ++ V+ NGE+ L ++T +++ +E PS+ Sbjct: 302 LGHVVKVSGLRALYHANNANHVFINGEEFSVAEPAFAPILCDQASITLDSYS--IESPSW 359 Query: 357 LAMLAALVNSGYWFFE 372 +A+L LVN GYW+ + Sbjct: 360 IALLTRLVNLGYWYLD 375 >UniRef50_Q48H58 YcfD protein n=22 Tax=Gammaproteobacteria RepID=Q48H58_PSE14 Length = 388 Score = 413 bits (1062), Expect = e-114, Method: Composition-based stats. Identities = 138/375 (36%), Positives = 216/375 (57%), Gaps = 12/375 (3%) Query: 7 LNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLV-SHQDGKWQVSH 65 ++ FL +WQK+P+++++ +F PI DELAGLA+E EV+SRLV H + W++ Sbjct: 13 ISARVFLRDYWQKKPLLIRQALPDFQSPIDADELAGLALEEEVESRLVLEHGERPWELRR 72 Query: 66 GPF--ESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVG 123 GPF + + L E +W+LLVQAV+ + + L+ FR LP WRIDD+MIS++ PGG VG Sbjct: 73 GPFAEDEFSKLPERDWTLLVQAVDQFVPEVSELLENFRFLPSWRIDDVMISYAAPGGSVG 132 Query: 124 PHLDQYDVFIIQGTGRRRWRVGEKLQ-MKQHCPHPDLLQVDPFEAIIDEELEPGDILYIP 182 PH D YDVF++QG G+R W++G+ H DL + FE + LEPGD+LY+P Sbjct: 133 PHFDNYDVFLLQGHGKRHWQIGQMCDAESPMLQHADLRILAEFEKTEEWTLEPGDMLYLP 192 Query: 183 PGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADV 242 P H G A+++ + YSVGFRAP+ E+++ F D++ Q Y+D D P + P + Sbjct: 193 PRLAHCGVAVDDCLTYSVGFRAPSAAEVLTLFTDFLSQFIPDEERYTDADAQPVSDPHQI 252 Query: 243 LPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQGEV 302 +D+L+ ++ E ++ WFG+F+++ R+ + PE + D++ D+L+QG V Sbjct: 253 QHDALDRLKALLTEHMSDERLLLTWFGQFMTEPRYPELVTGPE--LEEDDLLDSLEQGAV 310 Query: 303 LVRLGGLRVLRIGDD----VYANGEK--IDSPHRPALDALASNIALTAENFGDALEDPSF 356 L+R R+ D ++A+G+ + R L + + AL +EN G L D Sbjct: 311 LIRNPSARLAWSEVDDDLLLFASGQSRLLPGSLRELLKLICAADALHSENLGQWLADDDG 370 Query: 357 LAMLAALVNSGYWFF 371 +L LV G F Sbjct: 371 RNLLCELVKQGSLGF 385 >UniRef50_B3PKY0 Putative uncharacterized protein n=2 Tax=Pseudomonadaceae RepID=B3PKY0_CELJU Length = 396 Score = 413 bits (1062), Expect = e-114, Method: Composition-based stats. Identities = 124/367 (33%), Positives = 205/367 (55%), Gaps = 11/367 (2%) Query: 9 WPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDG-KWQVSHGP 67 +FL +WQK+P++++ F F PI+PDELAGLA+E EV+SR+V W++ +GP Sbjct: 25 IEEFLRDYWQKKPLLIRNAFPGFESPIAPDELAGLALEEEVESRIVLENGATPWELRNGP 84 Query: 68 FES--YDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 125 F+ + L E W+LLVQAV+ W L+ FR +P+WR+DDLMIS++ GGVGPH Sbjct: 85 FDEDTFAKLPEKRWTLLVQAVDQWVPEVNQLLDYFRFIPNWRLDDLMISYAPDQGGVGPH 144 Query: 126 LDQYDVFIIQGTGRRRWRVGEKLQ-MKQHCPHPDLLQVDPFEAIIDEELEPGDILYIPPG 184 D YDVF++QG G+R W++G+ L + F + LEPGD+LYIPPG Sbjct: 145 FDYYDVFLLQGLGKRHWKIGQVCDNNSPRVEGTRLKILSEFHTTDEWVLEPGDMLYIPPG 204 Query: 185 FPHEGYAL-ENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADVL 243 H G A+ ++ M YS+GFRAP+ +++S V YSDPD+ +++P ++ Sbjct: 205 IAHWGNAVGDDCMTYSIGFRAPSHADILSEIGQEVALNIADDLRYSDPDLKRQSNPGEIG 264 Query: 244 PQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQGEVL 303 P+ + +L+ ++ + + PE WFG+++++ ++ D+ AL G++L Sbjct: 265 PEAIAQLQHIIQQHLT-PETIAHWFGKYMTERKYLEQTDEEPLEIDADDWQAALADGQLL 323 Query: 304 VRLGGLRVLRI----GDDVYANGEKIDSPHRPALDALASNIALTAENFGDALEDPSFLAM 359 R R+ G ++A+GE I+ R + + + ++ +++P +A Sbjct: 324 WRHPAARLAFHSDKNGTFLFADGEAINC-SRELAELVCAETEISWVQIKPFVQEPFDVAA 382 Query: 360 LAALVNS 366 L+ L+N Sbjct: 383 LSQLINQ 389 >UniRef50_Q1QUR4 Cupin 4 n=1 Tax=Chromohalobacter salexigens DSM 3043 RepID=Q1QUR4_CHRSD Length = 397 Score = 411 bits (1058), Expect = e-113, Method: Composition-based stats. Identities = 147/378 (38%), Positives = 222/378 (58%), Gaps = 14/378 (3%) Query: 7 LNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQ--DGKWQVS 64 L FL +WQK+P++++ F +F P++P+ELAGLA E +++RLV Q D WQVS Sbjct: 13 LTAETFLRDYWQKKPLLIRGAFPDFASPLAPEELAGLACEDGIEARLVEAQGPDKPWQVS 72 Query: 65 HGPFE--SYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGV 122 HGPF+ ++ L + W+LLVQAV+H+ AAL+ F LP WR+DD+M+S++ P G V Sbjct: 73 HGPFDDATFARLPDREWTLLVQAVDHYVPEVAALLDAFDFLPRWRLDDVMVSYAPPEGSV 132 Query: 123 GPHLDQYDVFIIQGTGRRRWRV-GEKLQMKQHCPHPDLLQVDPFE--AIIDEELEPGDIL 179 GPH+D YDVF++QG+G+RRW++ GE+ DL ++ FE A D LEPGD+L Sbjct: 133 GPHVDNYDVFLLQGSGQRRWQLGGEQPDDAPIVSGIDLRMLERFEVTADEDWVLEPGDML 192 Query: 180 YIPPGFPHEGYALE-NAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAH 238 Y+PP H G + + M YS+GFRAP+ E+I+ FADY+ + + Y+DPD+ P AH Sbjct: 193 YLPPRIAHHGVSQSADCMTYSIGFRAPSADEVITSFADYLGEMQPDSRRYTDPDLAPCAH 252 Query: 239 PADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALK 298 + Q + +LR +ML +I+ P QWFG ++Q ++ +AP + P +AL Sbjct: 253 AGQLDDQAIARLRRLMLSVIDDPAQMAQWFGRVMTQPKYVDQLAPLDTPMDSAATAEALA 312 Query: 299 QGEVLVRLGGLRVLRIGDD----VYANGEKIDSPHRPALDALASNIALTAENFGDALEDP 354 QG L R G R +D ++ +G+ P P LA L A + L+D Sbjct: 313 QGRYLERALGSRFAFHDEDGETTLFVDGDGHACPP-PLARLLADTTPLHAATLAEHLDD- 370 Query: 355 SFLAMLAALVNSGYWFFE 372 + L++L L+N G ++ Sbjct: 371 AALSLLTELLNRGSLQWQ 388 >UniRef50_A6F0B9 Transcription factor jumonji, jmjC n=1 Tax=Marinobacter algicola DG893 RepID=A6F0B9_9ALTE Length = 383 Score = 411 bits (1058), Expect = e-113, Method: Composition-based stats. Identities = 120/383 (31%), Positives = 195/383 (50%), Gaps = 11/383 (2%) Query: 1 MEYQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDG- 59 M+ + +FL +WQK+P+V+++ F F P+S DELAGLA E V+SR+V D Sbjct: 1 MQLPGGMPAQEFLRDYWQKKPLVIRQAFAGFECPVSADELAGLACEDAVESRIVIENDKG 60 Query: 60 -KWQVSHGPFE--SYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFS 116 WQ+ +GPFE + L +++W+LLVQ ++HW A L+ FR +P+WR+DD+M S++ Sbjct: 61 KPWQLHNGPFEPERFSKLPDSHWTLLVQGLDHWVPDFADLLDEFRFVPNWRLDDIMASYA 120 Query: 117 VPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQ-MKQHCPHPDLLQVDPFEAIIDEELEP 175 GG VGPH DQYDVF++Q G RRW G L + +E L P Sbjct: 121 PKGGSVGPHYDQYDVFLLQAEGHRRWTFGGHCDHTSPRVDGTPLRILSSWEGEETVTLAP 180 Query: 176 GDILYIPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPP 235 GD+LY+PPG H G A ++ + S+GFRAP ++++GF D++ R + DPD+ Sbjct: 181 GDMLYLPPGVGHHGVAEDDCITLSIGFRAPTVDDVLTGFTDFLCSRSDASGHLDDPDLKV 240 Query: 236 RAHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYD 295 + +P + P + +L ++ + ++ WFG++ + + + P E P P+ + + Sbjct: 241 QDNPGAIGPDVIHRLDRLIRDQLSDQRQLALWFGQYSTAPKSLEIVVPAEEPVTPELLGE 300 Query: 296 ALKQGEVLVRLGGLRVLRI----GDDVYANGEKI--DSPHRPALDALASNIALTAENFGD 349 + G L G R ++ +GE+ P L + Sbjct: 301 LIAAGNPLRWNEGSRFAYHDFEDETALFVDGEQFLLRGDAGPLAPLLCAGARPDMGALAS 360 Query: 350 ALEDPSFLAMLAALVNSGYWFFE 372 D + +L+ LVN G +F+ Sbjct: 361 FAGDDAIQGLLSTLVNQGSLYFD 383 >UniRef50_Q1NG82 Putative uncharacterized protein n=1 Tax=Sphingomonas sp. SKA58 RepID=Q1NG82_9SPHN Length = 380 Score = 410 bits (1054), Expect = e-113, Method: Composition-based stats. Identities = 142/375 (37%), Positives = 214/375 (57%), Gaps = 11/375 (2%) Query: 6 TLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQVSH 65 + + FL HWQK+P++++ + + +P+ PDELAGLA E V+SR+V DG W + H Sbjct: 4 SFDVQAFLRDHWQKQPLLIRNPWGAWANPLEPDELAGLACEEGVESRIVVQTDGDWALEH 63 Query: 66 GPF--ESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVG 123 GPF + + LG + W+LLVQAV+H AAL+ PFR +PDWRIDD+M+S++ GGGVG Sbjct: 64 GPFADDRFATLGGSPWTLLVQAVDHHAPDVAALIAPFRFIPDWRIDDVMVSYASDGGGVG 123 Query: 124 PHLDQYDVFIIQGTGRRRWRVGEKLQMKQHC-PHPDLLQVDPFEAIIDEELEPGDILYIP 182 PH DQYDVF++QG GRRRWRVG++ PH DL + F A + LEPGDILY+P Sbjct: 124 PHFDQYDVFLVQGLGRRRWRVGQRCDRDTALRPHRDLRLLPDFAATDEWVLEPGDILYVP 183 Query: 183 PGFPHEGYAL-ENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPAD 241 PGF HEG A+ ++ M YS+GFRAP+ +++ +AD++ + + Y+DPD+ P A+P + Sbjct: 184 PGFAHEGVAVGDDCMTYSIGFRAPSRPDMLVEWADHLAAQMPDDDLYADPDIQPAANPGE 243 Query: 242 VLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQGE 301 + P + +L EM + + F WFG+ ++ ++ PE P +E+ + G Sbjct: 244 IEPDAIARLHEMTIAAMADRSAFAAWFGQHVTTPKYPDADWRPEEPVTAEELLALIDAGA 303 Query: 302 VLVRLGGLRVLRI----GDDVYANGEKIDSPHRPALDALASNIALTAENFGDALEDPSFL 357 L R R + G ++ +G A+ A A ++ + + Sbjct: 304 QLWRNPASRFAFLREEDGVTLFVDGSAYPCAGDLAI-LAQQLCAYPALALDPSM--VAGV 360 Query: 358 AMLAALVNSGYWFFE 372 +L LVN G E Sbjct: 361 GLLVTLVNQGSLMIE 375 >UniRef50_Q2S4H4 Cupin superfamily protein n=3 Tax=Bacteria RepID=Q2S4H4_SALRD Length = 394 Score = 401 bits (1032), Expect = e-110, Method: Composition-based stats. Identities = 143/379 (37%), Positives = 219/379 (57%), Gaps = 13/379 (3%) Query: 8 NWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGK--WQVSH 65 + DFL+ +WQ+RP+V++ +F P+SP+ELAGLA E V+SRL+ + G+ W++ H Sbjct: 15 SPADFLDTYWQERPLVVRDALPDFRSPLSPEELAGLACEDGVESRLILEEGGEHPWELRH 74 Query: 66 GPF--ESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVG 123 GPF E + HL ET+W+LLVQ V+ AL+ FR LPDWR+DD+M+S++ G VG Sbjct: 75 GPFASEEFLHLPETHWTLLVQEVDRLIPEVGALLDRFRFLPDWRLDDVMVSYAPTHGTVG 134 Query: 124 PHLDQYDVFIIQGTGRRRWRVG-EKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIP 182 PH+D YDVF++QG G RRW++G E + ++ P D+ + FEA + L PGD+LY+P Sbjct: 135 PHIDNYDVFLLQGAGHRRWQIGTEPVDDEEIVPDLDVRILADFEAEEEFVLGPGDLLYLP 194 Query: 183 PGFPHEGYALEN-AMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPAD 241 P H G A ++ M YSVGFRAP ++L+ F + YSDPD+ P HP + Sbjct: 195 PRVAHYGVATDDQCMTYSVGFRAPRHQDLVGNFLQQAMDTVGPDARYSDPDLSPVDHPGE 254 Query: 242 VLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQGE 301 + +R ++ +L+ + QWFG+++++ + + PPE P DE+ D L+ G Sbjct: 255 IHDDARQTVRRLLRDLVRDDDAIDQWFGQYLTRPGRDREAVPPETPVTDDELTDMLRAGH 314 Query: 302 VLVRLGGLRVLRIGDD-----VYANGEKID-SPHRP-ALDALASNIALTAENFGDALEDP 354 L R+ I D ++ANG ID SP R A + + ++ LED Sbjct: 315 GLRPGPVSRLAFIEHDDGSVTLFANGSPIDLSPDRAYAARLVTGRQQIPSDALTPHLEDD 374 Query: 355 SFLAMLAALVNSGYWFFEG 373 +F+ +L AL+N G ++ Sbjct: 375 AFVDLLVALINDGLLEWDA 393 >UniRef50_Q2Y9X5 Cupin region n=9 Tax=root RepID=Q2Y9X5_NITMU Length = 415 Score = 400 bits (1029), Expect = e-110, Method: Composition-based stats. Identities = 117/366 (31%), Positives = 199/366 (54%), Gaps = 10/366 (2%) Query: 7 LNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQVSHG 66 L+ DFL+ HWQK+P+++++ +F + +EL LA + + SRLV+ ++G+W+V HG Sbjct: 35 LSPSDFLQDHWQKKPLLIRKALPDFSGLLDANELIDLACQEDAQSRLVTRRNGRWEVRHG 94 Query: 67 PF--ESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGP 124 PF ++ L + W+LLVQ VNH+ L+ F +P R+DDLM+S++ GGVGP Sbjct: 95 PFAPRAFARLPQKGWTLLVQDVNHFLPAARELLLKFNFIPHSRLDDLMVSYAPEDGGVGP 154 Query: 125 HLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIPPG 184 H D YDVF++QGTGRRRWR+ + + L + F + LEPGD+LY+PPG Sbjct: 155 HFDSYDVFLLQGTGRRRWRISGQKD-RTLVAAAPLKILQDFRPEQEWVLEPGDMLYLPPG 213 Query: 185 FPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADVLP 244 + H+G A+E M YS+GFRAP +EL F ++ Y DPD+ + HP + Sbjct: 214 YAHDGVAVEPCMTYSIGFRAPTYQELAMQFLVHLQDSCEIAGIYEDPDLRIQTHPGQISS 273 Query: 245 QEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQGEVLV 304 +D++ + ++ +++ G ++++ + + PP+ P + +++G++ + Sbjct: 274 AMLDQVNAALDKIEWDNVEVERFIGMYLTEPKPHVFFMPPQEPISERKFVHQIRKGKLQL 333 Query: 305 RLGGLRVLRIGDDVYANGE--KIDSPHRPALDALASNIALTAENFGDALEDPSFLAMLAA 362 L R+L + ++ NG+ ++ + L LA +AL+ D A+L Sbjct: 334 DLKS-RMLFRENRIFLNGDVYEVGKTAQRILGELADRLALSPVR----DIDAETQALLYQ 388 Query: 363 LVNSGY 368 GY Sbjct: 389 WYLDGY 394 >UniRef50_Q3JQS3 Cupin superfamily protein family n=25 Tax=Burkholderiales RepID=Q3JQS3_BURP1 Length = 422 Score = 400 bits (1028), Expect = e-110, Method: Composition-based stats. Identities = 117/373 (31%), Positives = 186/373 (49%), Gaps = 13/373 (3%) Query: 6 TLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQVSH 65 L+ F+ R+WQK+P+++++ P+S D L LA + +V+SRLV+H +WQ+ H Sbjct: 46 NLSPAQFMRRYWQKKPLLIRQAITGIAPPLSRDALFELAADYDVESRLVTHFRNRWQLEH 105 Query: 66 GPFE--SYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVG 123 GPFE + W+LLVQ ++ + AL+ FR +PD R+DDLMIS++ GGGVG Sbjct: 106 GPFEPEHLPSVKRREWTLLVQGLDLHDDRARALLERFRFVPDARLDDLMISYATDGGGVG 165 Query: 124 PHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIPP 183 PH D YDVF++Q G+RRWR+G + + P L + FE + LEPGD+LY+PP Sbjct: 166 PHFDSYDVFLLQVHGKRRWRIGAQQDLSLQEGLP-LKILANFEPTDEWVLEPGDMLYLPP 224 Query: 184 GFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQR------ELGGNYYSDPDVPPRA 237 H+G AL M S+GFRAP+ EL + F ++ +R Y DP P Sbjct: 225 HIAHDGIALGECMTCSIGFRAPSAGELRAQFLYHLAERGGLRTGARDDARYRDPAQPAVD 284 Query: 238 HPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPY-QPDEIYDA 296 PA + + ++ + + + G ++S+ + + PP + + A Sbjct: 285 SPAMLPAAMVKRVAATLAGIQWDEHDVGDFLGCYLSEPKSNVVFEPPTRRLGEAAFVTQA 344 Query: 297 LKQGEVLVRLGGLRVLRIGDDVYANGEKID-SPHRPALDALASNIALTAENFGDALEDPS 355 ++G L R +L + NG+ + L LA + A+ F DP+ Sbjct: 345 SRRGVRLDRKAA--LLYNARSYFINGDAHPLATAAKWLPELADTRRMEAKRFVTLSRDPA 402 Query: 356 FLAMLAALVNSGY 368 +L +G+ Sbjct: 403 MTGLLHEWYCAGW 415 >UniRef50_C4K8V5 Putative uncharacterized protein n=1 Tax=Candidatus Hamiltonella defensa 5AT (Acyrthosiphon pisum) RepID=C4K8V5_HAMD5 Length = 379 Score = 398 bits (1022), Expect = e-109, Method: Composition-based stats. Identities = 154/371 (41%), Positives = 220/371 (59%), Gaps = 7/371 (1%) Query: 5 LTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQVS 64 L +NW DFL+ +WQK P++LK+ NFI+P+SPDELA L +E ++S+L+ +GK QV Sbjct: 3 LMINWQDFLQHYWQKHPMLLKQAVVNFINPVSPDELAKLVIEKALESQLIKKVNGKCQVV 62 Query: 65 HGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGP 124 H F Y LG NWSL VQA+NHWH P M FR PDW +DL +SFSVPGGG+G Sbjct: 63 HNVFNGYKSLGRHNWSLKVQAINHWHRPAEEFMYLFRTFPDWYREDLTVSFSVPGGGLGL 122 Query: 125 HLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIPPG 184 + DVFIIQG GR RWR+ L H + F +II+EEL GD LYIP G Sbjct: 123 YAKTSDVFIIQGIGRSRWRIWNPLSSVVHYDQKNF-----FPSIINEELVSGDALYIPKG 177 Query: 185 FPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYS-DPDVPPRAHPADVL 243 FPHE + E A++Y + N+ +I + + + + G YS PD+ R P ++L Sbjct: 178 FPHEAISSETALSYCINLWTDNSLRMIRNWTESLSDKNHRGIEYSPSPDLLMRDDPTEIL 237 Query: 244 PQEMDKLREMMLE-LINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQGEV 302 PQ++ ++ +M + L+ Q + + WFG+ +SQS ++L +AP YQP ++ L+Q Sbjct: 238 PQDITAIQNIMNQFLLQQRDDLETWFGQQMSQSSYDLPMAPAAQVYQPSQVQSILQQDIS 297 Query: 303 LVRLGGLRVLRIGDDVYANGEKIDSPHRPALDALASNIALTAENFGDALEDPSFLAMLAA 362 L RL GLR+L IGD + NGE + S + A + +A N + ++D F+ L Sbjct: 298 LCRLMGLRMLHIGDRYFLNGESLASNYADAWNIMAHNTTINGYMLRKFIDDNDFMTQLTL 357 Query: 363 LVNSGYWFFEG 373 L+N GYW+F+G Sbjct: 358 LINKGYWYFQG 368 >UniRef50_D1UI98 Cupin 4 family protein n=6 Tax=Burkholderia RepID=D1UI98_9BURK Length = 424 Score = 397 bits (1021), Expect = e-109, Method: Composition-based stats. Identities = 124/371 (33%), Positives = 195/371 (52%), Gaps = 11/371 (2%) Query: 6 TLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQVSH 65 L F+ R+WQK+P+++++ + P+S DEL LA + +V++RL++H +WQ+ H Sbjct: 50 NLTPSQFMRRYWQKKPLLIRQAIPDVEAPLSRDELFELADQDDVEARLITHFRNRWQLEH 109 Query: 66 GPF--ESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVG 123 GPF + L + W+LLVQ V+ + AL+ FR +PD R+DDLMIS++ GGGVG Sbjct: 110 GPFAPDELPSLKQRAWTLLVQGVDLHDDRARALLERFRFVPDARLDDLMISYATDGGGVG 169 Query: 124 PHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIPP 183 PH D YDVF++Q G+RRWR+ + + P L + F A + LEPGD+LY+PP Sbjct: 170 PHFDSYDVFLLQVKGKRRWRISAQKDLTLQAGLP-LKVLQNFAAEQEWVLEPGDMLYLPP 228 Query: 184 GFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQR----ELGGNYYSDPDVPPRAHP 239 H+G A M S+GFRAP+ EL + F ++ +R G Y DP P P Sbjct: 229 HIAHDGVAEGECMTCSIGFRAPSAGELTAQFLYHLAERGEASGQAGALYRDPQQPAVERP 288 Query: 240 ADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPY-QPDEIYDALK 298 A++ P ++++ ++ + + + G ++S+ + + PP+ P + I A K Sbjct: 289 AELPPALVERVGAILAGITWNEQDIASFLGTYLSEPKPSVVFDPPQRPLNEARFISQASK 348 Query: 299 QGEVLVRLGGLRVLRIGDDVYANGEKID-SPHRPALDALASNIALTAENFGDALEDPSFL 357 G L R +L + NGEK + L LA + L+A+ F D S Sbjct: 349 SGVRLDRKTN--LLYNRRFFFLNGEKTSLEGSKKWLFDLADHRCLSAKRFVTLSHDSSVT 406 Query: 358 AMLAALVNSGY 368 A L +G+ Sbjct: 407 ARLHEWYRAGW 417 >UniRef50_Q2BJ43 Putative uncharacterized protein n=1 Tax=Neptuniibacter caesariensis RepID=Q2BJ43_9GAMM Length = 382 Score = 396 bits (1019), Expect = e-109, Method: Composition-based stats. Identities = 130/381 (34%), Positives = 213/381 (55%), Gaps = 26/381 (6%) Query: 7 LNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQ--DGKWQVS 64 ++ FL+ +WQK+P++++ F +F P++ DELAG+A+E EV+SRL+ W++ Sbjct: 8 ISVETFLKEYWQKKPLLIRNAFPDFEPPVTADELAGMALEEEVESRLIIQSADGADWELK 67 Query: 65 HGPF--ESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGV 122 HGP E++ L +++W+LLVQAV+HW A L+ FR P+WR+DDLMIS++ GGGV Sbjct: 68 HGPLNEETFAELPDSHWTLLVQAVDHWVPEAAELVEQFRFAPNWRLDDLMISYASDGGGV 127 Query: 123 GPHLDQYDVFIIQGTGRRRWRVGEKLQM-KQHCPHPDLLQVDPFEAIIDEELEPGDILYI 181 GPH D YDVF+IQ TG RRW VG ++ + +EA +L+PGD+LY+ Sbjct: 128 GPHYDNYDVFLIQATGTRRWEVGGIFDEDSPRRDDVPVMILPEWEAEQSWDLQPGDMLYL 187 Query: 182 PPGFPHEGYAL-ENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPA 240 PP H GYAL ++ M SVGFRAP+ +E+ +GF +Y+ + YSDPD+ +A+P Sbjct: 188 PPRVGHNGYALGDDCMTLSVGFRAPSHQEIFAGFTNYLDNITCAEDRYSDPDLKTQANPG 247 Query: 241 DVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQG 300 ++ + + +++ ++ E I P WFG+F+++S++ A + +I + ++ G Sbjct: 248 EIDQEAIGRVQAILREYIADPALLSHWFGQFMTESKYPDLGADQSQEMEEGDIKNLIEAG 307 Query: 301 EVLVRLGGLRVLRIGDD---VYANGE-------KIDSPHRPALDALASNIALTAENFGDA 350 L R G R ++ +G+ +I+ R D + I + EN Sbjct: 308 VPLCRTEGSRFAYHQGQPFVLFVDGKGCACSPGQIELAKRLCADLYHTEIETSEENL--- 364 Query: 351 LEDPSFLAMLAALVNSGYWFF 371 ++ AL+ G +F Sbjct: 365 -------QLIKALLLQGSLYF 378 >UniRef50_C5A9S6 Cupin superfamily protein family protein n=49 Tax=Burkholderiales RepID=C5A9S6_BURGB Length = 422 Score = 396 bits (1017), Expect = e-109, Method: Composition-based stats. Identities = 123/373 (32%), Positives = 190/373 (50%), Gaps = 13/373 (3%) Query: 6 TLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQVSH 65 L F+ RHWQK+P+++++ + P+S D L LA + + +SRL++H +WQ++ Sbjct: 46 NLTPSQFMRRHWQKKPLLIRQAIPGIVPPLSRDALFELAGDYDTESRLITHFRNRWQLAQ 105 Query: 66 GPFE--SYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVG 123 GPFE S + + W+LLVQ V+ + AL+ FR +PD R+DDLMIS++ GGGVG Sbjct: 106 GPFELDSLPSVSKREWTLLVQGVDLHDDAARALLERFRFIPDARLDDLMISYATDGGGVG 165 Query: 124 PHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIPP 183 PH D YDVF++Q GRRRWR+G + + P L + FE + LEPGD+LY+PP Sbjct: 166 PHFDSYDVFLLQVHGRRRWRIGAQQDLTLREDLP-LKVLARFEPTDEWVLEPGDMLYLPP 224 Query: 184 GFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQR------ELGGNYYSDPDVPPRA 237 H+G A M S+GFRAP+ EL F Y+ +R G Y DP PP Sbjct: 225 HIAHDGIAEGECMTCSIGFRAPSAGELTGQFLYYLAERGALRQGARAGELYRDPAQPPVD 284 Query: 238 HPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPY-QPDEIYDA 296 PA + ++++ ++ + + + G ++S+ + + PE P + + A Sbjct: 285 DPARLPAALVERVETILKGIRWTTRDVENFLGSYLSEPKSNVVFDAPERPLGEAAFVAQA 344 Query: 297 LKQGEVLVRLGGLRVLRIGDDVYANGEKIDSPH-RPALDALASNIALTAENFGDALEDPS 355 ++G L R +L + NGE+ L LA L A+ F P Sbjct: 345 SRRGIRLDRKAA--LLYNARSYFINGEENPLAGNAKWLPELADRRHLGAKRFVTYSRHPL 402 Query: 356 FLAMLAALVNSGY 368 A+L +G+ Sbjct: 403 MTALLHEWYCAGW 415 >UniRef50_C6WYD1 Cupin 4 family protein n=1 Tax=Methylotenera mobilis JLW8 RepID=C6WYD1_METML Length = 395 Score = 395 bits (1016), Expect = e-108, Method: Composition-based stats. Identities = 129/384 (33%), Positives = 199/384 (51%), Gaps = 22/384 (5%) Query: 7 LNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQVSHG 66 ++ +FL+ +W K+P+++K F +SPDELAGLA E EV SR+V GKW SHG Sbjct: 14 ISASEFLQHYWHKKPLLIKNAIPGFTGLLSPDELAGLACEEEVQSRIVEEIKGKWYASHG 73 Query: 67 PFESYDHL-------GETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPG 119 PFE D + W+LLVQ+VNH A L+ F +P R+DDLM+S++ G Sbjct: 74 PFEESDFANLPEKPDPKHRWTLLVQSVNHHLPEAAELLSQFNFIPHARLDDLMVSYAPDG 133 Query: 120 GGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDIL 179 GGVGPH D YDVF++QG G+R WR+ E+ + L + F+ + +E GD+L Sbjct: 134 GGVGPHFDSYDVFLLQGQGKRLWRISEQTDLS-LVEGAPLRILKNFDTAQEWLVEAGDLL 192 Query: 180 YIPPGFPHEGYAL----ENAMNYSVGFRAPNTRELISGFADYVLQRELGGN-----YYSD 230 Y+PP H G A+ + M YS+GFRAP EL++ F ++ + Y D Sbjct: 193 YLPPHLAHWGIAVTDGDTDCMTYSIGFRAPKVHELVTEFLGFMQDKLNQDANALPGIYQD 252 Query: 231 PDVPPRAHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQP 290 D+ P+ HPA + + K+ E++ + +H + G ++S+ + ++ P + Sbjct: 253 ADLTPQEHPAQIGSSMVSKVAEILKTIQWSEQHVADFLGSYLSEPKPDIFFEPNKKMSLR 312 Query: 291 DEIYDALKQGEVLVRLGGLRVLRIGDDVYANGEKIDSPHRPA--LDALASNIALTAENFG 348 + L+ G L ++L Y NGE I + + A L ALA LT+++ Sbjct: 313 KFNENLLQHGISLDLK--SQMLFTQQYFYLNGEAISAAGQAASLLTALADYRMLTSDDIA 370 Query: 349 DALE-DPSFLAMLAALVNSGYWFF 371 A E D +F+ L +GY +F Sbjct: 371 QAGEVDSAFIEQLHGWYLAGYLYF 394 >UniRef50_A3QD76 Cupin 4 family protein n=19 Tax=Shewanella RepID=A3QD76_SHELP Length = 386 Score = 395 bits (1015), Expect = e-108, Method: Composition-based stats. Identities = 157/382 (41%), Positives = 226/382 (59%), Gaps = 15/382 (3%) Query: 3 YQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQ 62 Y + +FL HWQK+P+V+K F +F DPI+PDELAGLA E E+ SR+V + W+ Sbjct: 6 YTPNFDTQEFLAHHWQKQPLVIKGAFAHFQDPIAPDELAGLACEEEIASRIVLTKKDNWE 65 Query: 63 VSHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGV 122 + GP E Y G+ NW LLVQAVNHW+ L+ FR +PDWR DDLM+S++ PGGGV Sbjct: 66 IFQGPIEDYSPFGDANWQLLVQAVNHWYPDVEPLVNAFRFIPDWRFDDLMVSYATPGGGV 125 Query: 123 GPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIP 182 GPH+D YDVF++QG GRRRW+VG K Q +D FE I+D LE GD+LYIP Sbjct: 126 GPHIDNYDVFLLQGEGRRRWKVGAKGQYSPRGGDTHTALIDDFEPILDVVLEAGDMLYIP 185 Query: 183 PGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADV 242 PGFPH G LE A++YS+GFRAP+ +EL S AD+++ G ++ P A P + Sbjct: 186 PGFPHRGETLETALSYSIGFRAPSQQELFSSIADHLIDTNGGNKRFTSNQEP--ASPGLL 243 Query: 243 LPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQGEV 302 ++ + ++ E+++QP+H++ G+ +SQ+R ELD+A Y +E+ +AL+ G Sbjct: 244 SVEQQAGMLALVSEILSQPDHYQTVLGQTLSQNRFELDLAEQGESYSQEELMEALEDGAC 303 Query: 303 LVRLGGLRVLRIGDD----VYANGEKIDSPHRPA---------LDALASNIALTAENFGD 349 L R+GGL+V+R+ D ++ NGE D L LA+ + + D Sbjct: 304 LQRIGGLKVIRLEGDKHLRLFINGEIYDFDAVDDDDADELDDKLMLLANAFSFEGKQALD 363 Query: 350 ALEDPSFLAMLAALVNSGYWFF 371 + + L+NSGY + Sbjct: 364 LCQREAIGQYFIWLLNSGYAYL 385 >UniRef50_B4RRX0 Putative enzyme with RmlC-like domain n=2 Tax=Alteromonas macleodii RepID=B4RRX0_ALTMD Length = 388 Score = 391 bits (1005), Expect = e-107, Method: Composition-based stats. Identities = 138/376 (36%), Positives = 216/376 (57%), Gaps = 14/376 (3%) Query: 7 LNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQVSHG 66 + FL+ +WQ++PVV+K+ F +F DPI ++LAGLA ESEVD+R++S+ G W V G Sbjct: 18 FDADTFLKHYWQQKPVVIKQFFTDFDDPIDENDLAGLAQESEVDARVISNVQGNWHVEQG 77 Query: 67 PFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPHL 126 P +DH + W+LLVQ V+ + A ++ PF +P WR+DDLM+SF+ G GVG H+ Sbjct: 78 PITDFDHACQGKWTLLVQGVDKYVPDVAPILSPFSFVPHWRLDDLMVSFATNGAGVGAHI 137 Query: 127 DQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIPPGFP 186 DQYDVF++QG G+RRWRVG+ K+ PHP L Q++ F +ID +EPGD++Y+PPG+P Sbjct: 138 DQYDVFLVQGKGKRRWRVGQPGDYKEVFPHPKLRQIERFTPVIDVVVEPGDVIYVPPGWP 197 Query: 187 HEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADVLPQE 246 H+G +E+++ YSVG+RAP+ +L A +L + ++D + +PA V + Sbjct: 198 HDGETVEDSLTYSVGYRAPDNLQLAESLA-MMLDKGAHNYRFTDIGRTHQNNPALVSTSD 256 Query: 247 MDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQGEVLVRL 306 + L++ +++ IN + F E +S+ + P + +++ + L G Sbjct: 257 IAALKQQLIDAINGED-FTLALLEAMSE--QGIPEYPLDNEVNLEQVSNDLAAGMSFAPA 313 Query: 307 GGLRVLRIG------DDVYANGEKIDSP--HRPALDALASNIALTAENFGDALEDPSFLA 358 G+R L +Y NG + + + LAS L A DA +FL Sbjct: 314 PGVRALLCDGKRGLPRALYVNGSQFTFAKNDQEWFEVLASGSILNATCCQDA-PSFTFLE 372 Query: 359 MLAALVNSGYW-FFEG 373 L L+N+GYW +FEG Sbjct: 373 TLTTLINNGYWEWFEG 388 >UniRef50_A6SXH9 Uncharacterized conserved protein n=2 Tax=Oxalobacteraceae RepID=A6SXH9_JANMA Length = 373 Score = 385 bits (989), Expect = e-105, Method: Composition-based stats. Identities = 113/365 (30%), Positives = 185/365 (50%), Gaps = 6/365 (1%) Query: 7 LNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQVSHG 66 L +FL +W K+P+++++ +F +S DEL GL +V+SRL++H +W + G Sbjct: 10 LTAAEFLRDYWHKKPLLIRQAIPDFKPLLSRDELFGLVKSEDVESRLITHVKREWNMDSG 69 Query: 67 PFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPHL 126 PFE L + +W+LLVQ VN E +LMR F +PD R+DDLMIS++ GGVG H Sbjct: 70 PFEQLPPLKQKDWTLLVQGVNLHDEAVDSLMREFSFIPDARLDDLMISYATETGGVGAHF 129 Query: 127 DQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIPPGFP 186 D YDVF++Q G RRWR+G + + L + F+ + L PGD+LY+PP + Sbjct: 130 DSYDVFLLQAHGHRRWRIGAQTDL-TLVDGMPLKILKNFKPEEEFILAPGDMLYLPPQYA 188 Query: 187 HEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADVLPQE 246 HEG A++ M YS+GFRAP+ +EL F + ++ Y+DPD+ P H A++ Sbjct: 189 HEGVAMDECMTYSIGFRAPSYQELGEAFLESMIDSIDLPGRYADPDLKPAKHSAEISAAM 248 Query: 247 MDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQGEVLVRL 306 + ++ + ++ E + GE++S+ + ++ PE K+ + + L Sbjct: 249 LSRIAAELNKVRFTQEDIALFVGEYLSEPKAQIYFDAPEENLTRARFLQNAKKSGIKLSL 308 Query: 307 GGLRVLRIGDDVYANGEKIDSPHRPA--LDALASNIALTAENFGDALEDPSFLAMLAALV 364 L +L + ++ NG + L LA+ L+ A D + Sbjct: 309 KSL-MLHRNNYIFINGTSFEVGDEDLAILTELANTRQLSGTIIASASAD--VIDAFHTWH 365 Query: 365 NSGYW 369 G+ Sbjct: 366 KDGWL 370 >UniRef50_D2UDU1 Putative uncharacterized protein n=1 Tax=Xanthomonas albilineans RepID=D2UDU1_XANAL Length = 415 Score = 384 bits (986), Expect = e-105, Method: Composition-based stats. Identities = 134/383 (34%), Positives = 203/383 (53%), Gaps = 18/383 (4%) Query: 2 EYQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQD--G 59 +Y L ++ FL +WQKRP++++ F +F+ PI PD+LAGLA E SRLV H Sbjct: 23 QYPLGMSAASFLRDYWQKRPLLIRNAFPDFVSPIEPDDLAGLACEEAALSRLVIHDRATD 82 Query: 60 KWQVSHGPFESYDH--LGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSV 117 +W + +GPF+ ++ + + +W+LLVQ V+ W AL+ FR LP WR+DD+M+SF+ Sbjct: 83 RWSLRNGPFQEHEFPGMPDHDWTLLVQDVDKWDPDIRALLGQFRFLPRWRVDDVMVSFAA 142 Query: 118 PGGGVGPHLDQYDVFIIQGTGRRRWRV------GEKLQMKQHCPHPDLLQVDPFEAIIDE 171 GG VG H+D YDVF++Q GRRRW++ G + +L + F D Sbjct: 143 RGGSVGAHVDHYDVFLLQAHGRRRWQIDASASMGRPPPPTEFREDVELKLLRQFAPTHDW 202 Query: 172 ELEPGDILYIPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDP 231 LEPGD+LY+PP PH G A + + +SVG RAP++ ELI+ + D ++ Y D Sbjct: 203 VLEPGDMLYLPPMVPHHGVAEDACLTFSVGMRAPSSAELIADYLDTLIDGADEALRYHDE 262 Query: 232 DVPPRAHPADVLPQEMDKLREMMLEL-INQPEHFKQWFGEFISQSRHELDIAPPEPPYQP 290 D+ P ++ M ++ E + L +N P+ WFG FI+ R +I PP Sbjct: 263 DLLAPTDPHEIDAAAMGRVVEALNALRMNDPDRLGAWFGRFITTYRAGGEILPPSNLPPV 322 Query: 291 DEIYDALKQGEVLVRLGGLRVLR----IGDDVYANGEKIDSPHRPALDALASNIALTAEN 346 +E AL QG VL R R+ G ++ NG + P + A LA+ L A + Sbjct: 323 EETAAALAQGLVLQRHPWARLAWRRASRGAMLFCNGMEFALPIQDA-KRLAAAEHLDATD 381 Query: 347 FGDALEDPSFLAMLAALVNSGYW 369 + A + L L+ SG++ Sbjct: 382 Y--AALSATGRQTLLQLIQSGFY 402 >UniRef50_Q2SJM1 Uncharacterized conserved protein n=3 Tax=Gammaproteobacteria RepID=Q2SJM1_HAHCH Length = 405 Score = 379 bits (974), Expect = e-104, Method: Composition-based stats. Identities = 127/385 (32%), Positives = 194/385 (50%), Gaps = 18/385 (4%) Query: 7 LNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQ--DGKWQVS 64 ++ DFL +WQK+P++++ + P+ ++LAGLA E EV+SRLV + WQ+ Sbjct: 8 ISVADFLAHYWQKKPLIIRGLLPGYECPLDENDLAGLATEEEVESRLVYEELNGQPWQLE 67 Query: 65 HGPF--ESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGV 122 HGPF E +++ W+LLVQ ++ W A L+ FR LP+WR+DD+M SF+ PGG V Sbjct: 68 HGPFSIEKLENMPHQGWTLLVQGLDTWVPEIADLLDRFRFLPNWRVDDIMASFAPPGGSV 127 Query: 123 GPHLDQYDVFIIQGTGRRRWRVGEKLQM-KQHCPHPDLLQVDPFEAIIDEELEPGDILYI 181 GPH D YDVF+IQ TG RRWR+G L + FE + LEPGD LY+ Sbjct: 128 GPHFDHYDVFLIQATGARRWRIGPPCDDQSPRVDGTPLRILQNFEQTEEWVLEPGDALYL 187 Query: 182 PPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPAD 241 PPG+ H G A + + SVGFR+P EL+S AD + + D P ++P Sbjct: 188 PPGYAHYGVAETSCITLSVGFRSPTYAELMSALADDWFENPALSTHLHDATEAPLSNPGL 247 Query: 242 VLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPE--PPYQPDEIYDALKQ 299 + +R + L++ ++ FG FIS + + + P + P++ ++L+ Sbjct: 248 ISDDVFADIRSRLQALLDDEAGLRRSFGRFISAPKFDAAVPPLDAAMRLSPEDAGESLQD 307 Query: 300 GEVLVRLG-GLRV---LRIGDD-----VYANGEKIDSPHR--PALDALASNIALTAENFG 348 E+ R G R L +GE D+ R P ++ L + + E Sbjct: 308 QEIQWRWNEGSRYTYSLYEEAGARRVMFAVDGEAYDADERFAPLVEILCRSNNVDRERLL 367 Query: 349 DALEDPSFLAMLAALVNSGYWFFEG 373 D L +L++L+N G EG Sbjct: 368 PWSADKDALKLLSSLLNRGSLVLEG 392 >UniRef50_Q1N4P0 Transcription factor jumonji, jmjC n=1 Tax=Bermanella marisrubri RepID=Q1N4P0_9GAMM Length = 386 Score = 378 bits (972), Expect = e-103, Method: Composition-based stats. Identities = 122/364 (33%), Positives = 198/364 (54%), Gaps = 12/364 (3%) Query: 7 LNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLV-SHQDGKWQVSH 65 + FL+ +WQK+PV++++ NF PI PD+LAGL++E +V+SR++ + D WQ+ H Sbjct: 7 FSVETFLKDYWQKKPVLIRQALPNFTPPIEPDDLAGLSLEEDVESRIILENGDTPWQLIH 66 Query: 66 GPF--ESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVG 123 GPF E++ +L E W+LLVQ V+ W + L+ F+ +P WR+DD+M+SF+ GG VG Sbjct: 67 GPFSEETFGNLPEEKWTLLVQGVDQWVPEMSELLSYFQFIPKWRLDDIMVSFAPKGGSVG 126 Query: 124 PHLDQYDVFIIQGTGRRRWRVGEKLQ-MKQHCPHPDLLQVDPFEAIIDEELEPGDILYIP 182 PH DQYDVF++Q GRR W++G K L ++ E + LEPGD+LYIP Sbjct: 127 PHFDQYDVFLLQAQGRRHWQIGPKYDASSPRIKDTPLHLLENMEVTEEWTLEPGDMLYIP 186 Query: 183 PGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADV 242 P + H G A+++ M +SVGFRAP+ E++SG + + + + Y D D+ A PA + Sbjct: 187 PQYAHNGVAVDDCMTFSVGFRAPSEAEILSGITQHAMDQLTEADRYHDEDLKASAQPALI 246 Query: 243 LPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQGEV 302 D+L++++ + N E ++WF E ++QS++ P E P +E+ L+ V Sbjct: 247 DQAAFDRLQQIIAKHANNTELMQEWFAECMTQSKYPELAEPLEDPLDWEEVAPLLQNDTV 306 Query: 303 LVRLGGLRVLRIGDD----VYANGEKI----DSPHRPALDALASNIALTAENFGDALEDP 354 + + R Y NG+++ D+ + L T + L+ Sbjct: 307 ISQNETSRWAYYESKGHWIFYGNGQQLLESKDNELTDSAKKLWDQRQTTLNDIKAILDHS 366 Query: 355 SFLA 358 Sbjct: 367 EGQQ 370 >UniRef50_C1DCJ3 Cupin region n=1 Tax=Laribacter hongkongensis HLHK9 RepID=C1DCJ3_LARHH Length = 380 Score = 378 bits (972), Expect = e-103, Method: Composition-based stats. Identities = 121/369 (32%), Positives = 198/369 (53%), Gaps = 12/369 (3%) Query: 7 LNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQ-DGKWQVSH 65 L +FL +W K+P++++ + P + LA LA +V+SRL+ ++ G+W V H Sbjct: 12 LTAREFLRDYWHKQPLLIRGALRDVGTPADFEVLAELARRDDVESRLIENRAGGRWHVEH 71 Query: 66 GPFE--SYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVG 123 GPF+ L ET+W+LLVQ+VNH + ++ F LP R+DDLMIS++ PGG VG Sbjct: 72 GPFQPARLARLPETDWTLLVQSVNHHLPHVSDILWRFNFLPYARLDDLMISYAPPGGTVG 131 Query: 124 PHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIPP 183 PH D YDVF++Q G++RW+VG + + + F+A+ ELE GD+LY+PP Sbjct: 132 PHFDSYDVFLLQVGGKKRWQVGSP-DNDRLEDGAPIKVLSSFDALQSWELEQGDMLYLPP 190 Query: 184 GFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADVL 243 F H G ALE M YS+GFRAP T+EL F Y+ Y+DPD+ P HPA++ Sbjct: 191 KFSHYGVALEPGMTYSIGFRAPTTQELAEQFLTYLQDTLCLDGRYADPDLEPPRHPAEIS 250 Query: 244 PQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQGEVL 303 ++++++M+ + + ++ G ++++ ++ + PPE P DE + + ++ Sbjct: 251 ESMVEQVQDMLKAIRWDRDGVGEFLGCYLTEPKNHVFFDPPEDPLDEDEFAKVILRDGLV 310 Query: 304 VRLGGLRVLRIGDDVYANGEKIDS--PHRPALDALASNIALTAENFGDALEDPSFLAMLA 361 + L ++L + NGE P LA+ L + D + + + L Sbjct: 311 LDLKS-QMLFRNSLCFVNGEIHAGMDGDLPVWRELANQRRLAGQAISDGMTETLYAGYL- 368 Query: 362 ALVNSGYWF 370 SG+W+ Sbjct: 369 ----SGWWW 373 >UniRef50_Q15T89 Cupin 4 n=1 Tax=Pseudoalteromonas atlantica T6c RepID=Q15T89_PSEA6 Length = 389 Score = 376 bits (967), Expect = e-103, Method: Composition-based stats. Identities = 150/378 (39%), Positives = 217/378 (57%), Gaps = 15/378 (3%) Query: 6 TLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQVSH 65 + FL+ HWQKRPVV K F+ F+DP+ +ELAGLA + +DSR+VS ++ W V H Sbjct: 6 NFDPTLFLDSHWQKRPVVFKGAFSQFVDPLDENELAGLAQDPRIDSRIVSSENANWHVQH 65 Query: 66 GPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 125 GP ++H + +WSLLVQ+V+ + AL+R F +P WR+DDLM+SFS G GVGPH Sbjct: 66 GPISDFEHACQGSWSLLVQSVDQHVDEADALIRMFNFIPYWRLDDLMVSFSNTGAGVGPH 125 Query: 126 LDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIPPGF 185 LDQYDVFIIQG G RRW+ G++ + + PHPDL Q+ F IIDE L GD+LYIP G Sbjct: 126 LDQYDVFIIQGKGSRRWQAGKRGEYSTYHPHPDLSQIQGFTPIIDEVLHSGDMLYIPAGC 185 Query: 186 PHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADVLPQ 245 PH G ALE+ MNYSVGFRAP ++L+S ADY + + Y D + PR P+++ + Sbjct: 186 PHNGVALEDCMNYSVGFRAPTQQDLLSSLADYSIDLGIFKKRYQDKGLTPRFDPSELAQE 245 Query: 246 EMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPP-YQPDEIYDALKQGEVLV 304 E+ R M+ + I+ P+ F +W S ++ P Y EI +Q V Sbjct: 246 EIHSFRNMLHDAIDSPD-FTRWLTSHFSDTQLNQGYDEQHNPDYSLQEILVLFQQQTVFE 304 Query: 305 RLGGLRVLRIGD-------DVYANGEKIDSP--HRPALDALASNIALTAE---NFGDALE 352 R G+R + + + + G+ +P H A+ A + + + + G A+ Sbjct: 305 RQPGIRPIYLAQSDENTSLEFFIEGQAFFAPPEHAQAVRAFLQSASWQFDLHSDKGTAVT 364 Query: 353 DPSF-LAMLAALVNSGYW 369 F + +++ LVN+G W Sbjct: 365 INHFWVQLISELVNAGAW 382 >UniRef50_C0N3X6 Cupin superfamily protein n=1 Tax=Methylophaga thiooxidans DMS010 RepID=C0N3X6_9GAMM Length = 389 Score = 374 bits (960), Expect = e-102, Method: Composition-based stats. Identities = 115/385 (29%), Positives = 197/385 (51%), Gaps = 15/385 (3%) Query: 3 YQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQD--GK 60 + L FL + WQK+P+++++ + +S +ELAGLA E++++SRL+ Q G Sbjct: 5 FNTELTQQQFLTQFWQKKPLLIRQAWPQMDALLSAEELAGLACEADIESRLIQEQGELGP 64 Query: 61 WQVSHGPFES--YDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVP 118 WQV+ GPF + L ++W+LLVQ V+ +M F +PDWR DDLM+SF+ Sbjct: 65 WQVNDGPFTEADFAKLPASHWTLLVQDVDKHVPELTEVMAKFDFIPDWRRDDLMVSFAPE 124 Query: 119 GGGVGPHLDQYDVFIIQGTGRRRWRVGE-KLQMKQHCPHPDLLQVDPFEAIIDEELEPGD 177 GG VGPH D YDVF++Q G RRW + + + + +L + F+A +L+PGD Sbjct: 125 GGSVGPHTDGYDVFLLQAQGTRRWAISQTPVVEAEFIDGLELKILKQFDADDVWDLQPGD 184 Query: 178 ILYIPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRA 237 +LY+PP F H G AL + M +S+GFRAP EL+ F + + E+G Y DP++ Sbjct: 185 MLYLPPHFAHHGVALNDCMTFSIGFRAPTQLELLDAFMHSLSEHEVGQQRYRDPELKVCD 244 Query: 238 HPADVLPQEMDKLREMMLELINQPEH-FKQWFGEFISQSRHELDIAPPEPPYQPDEI--Y 294 + + + ++ +++ I + G +++++ L++ E D + Sbjct: 245 DDKYIDRSALRRFKQSLIKCIEDSDDVLLDAVGRLLTETKPSLELLADELIADSDNVSLA 304 Query: 295 DALKQGEVLVRLGGLRVLRIGD----DVYANGEKI--DSPHRPALDALASNIALTAENFG 348 + QGE L R +R+ + ++A GE D R + L + A ++ Sbjct: 305 EYFSQGEQLHRNPYIRIAWAENEESVQLFAAGETYQADKAVRSIMPILTGTEPIQALHWT 364 Query: 349 DALEDPSFLAMLAALVNSGYWFFEG 373 ++ + +L LV G W+++ Sbjct: 365 Q-IQSAAATNLLEELVAIGCWYWQS 388 >UniRef50_UPI0000E0F5AA putative enzyme with RmlC-like domain n=1 Tax=Glaciecola sp. HTCC2999 RepID=UPI0000E0F5AA Length = 381 Score = 372 bits (955), Expect = e-101, Method: Composition-based stats. Identities = 143/381 (37%), Positives = 211/381 (55%), Gaps = 21/381 (5%) Query: 3 YQLT-LNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKW 61 Y L+ + FL +WQ++P VL NF DP+ +LAGLA E ++DSR++S DG W Sbjct: 2 YTLSAFSIKHFLAENWQRKPCVLHNALPNFEDPLDEHDLAGLAQEQDIDSRVISQMDGDW 61 Query: 62 QVSHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGG 121 +V+ GPF ++ + + W+LLVQ V+ E + LM F +P WR+DDL++S+S PG G Sbjct: 62 KVTEGPFTEFEDVCKGAWTLLVQGVDTHIESASLLMNAFNFIPHWRMDDLLVSYSQPGAG 121 Query: 122 VGPHLDQYDVFIIQGTGRRRWRVGEK-LQMKQHCPHPDLLQVDPFEAIIDEELEPGDILY 180 VG H+DQYDVFI+QG G RRW+VG+K ++ ++ PHP L Q+D FE IID EL PGDILY Sbjct: 122 VGAHIDQYDVFIVQGKGTRRWQVGDKSMKYAKYYPHPKLQQIDEFEPIIDVELLPGDILY 181 Query: 181 IPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPA 240 IPPGFPH+G ++ MNYSVGFRAP+ EL AD +L + + D + P+ Sbjct: 182 IPPGFPHKGQSITECMNYSVGFRAPDQTELFQAIADDLLDSDKLTRRFIDRNRTYIDRPS 241 Query: 241 DVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQG 300 + P+++ L++++ + + + Q + +S+ LD P E P I + QG Sbjct: 242 AISPKDIMLLKQLLQQYVTNTQ-IDQVLTQHLSKQNEYLDAFPLETPLPSSYILALINQG 300 Query: 301 EVLVRLGGLRVLRIGDD------VYANGEKIDSPHRP------ALDALASNIALTAENFG 348 L G+R + + + NG K + LD + I AE Sbjct: 301 VTLQLACGVRPVYLDYQVDDEFIFFINGHKFSTSATARLETSRLLDNHQTFIKFNAELTH 360 Query: 349 DALEDPSFLAMLAALVNSGYW 369 D +E ++ L+N GY Sbjct: 361 DWIE------LIRELINLGYL 375 >UniRef50_C3M8B3 Putative uncharacterized protein n=3 Tax=Candidatus Hamiltonella defensa 5AT (Acyrthosiphon pisum) RepID=C3M8B3_HAMD5 Length = 367 Score = 371 bits (953), Expect = e-101, Method: Composition-based stats. Identities = 146/370 (39%), Positives = 215/370 (58%), Gaps = 6/370 (1%) Query: 4 QLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQV 63 L +NW DFL HWQKRPV+LK+ ++F++PISP+EL L ++ ++ +L+ GK Q+ Sbjct: 2 HLIINWEDFLHHHWQKRPVLLKQSISDFVNPISPEELETLVIKKALECQLIQRSHGKCQL 61 Query: 64 SHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVG 123 + F Y LG+ NWSL V+A++H H + FR PDW ++L FSVPGGG+G Sbjct: 62 GYQAFNGYGSLGQRNWSLRVEALHHCHRAAEEFLSLFRIFPDWYTEELTTFFSVPGGGIG 121 Query: 124 PHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIPP 183 P DV +IQG G RWRVG+ + P D Q D EA++DEEL GD+LYIP Sbjct: 122 PQTRPSDVLVIQGMGSSRWRVGD----RGASPAFDYGQNDFSEAMVDEELSAGDMLYIPK 177 Query: 184 GFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYS-DPDVPPRAHPADV 242 FPHE + E AM+Y + F N+ +I + + + G Y+ PD+ R P ++ Sbjct: 178 VFPHEATSTEAAMSYCLNFWTDNSLRMIRNWTESLSDENHRGIEYAPSPDLLLRDDPTEI 237 Query: 243 LPQEMDKLREMMLE-LINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQGE 301 LPQ++ L+EMM + L+ + EH + WF + +SQ+ +EL AP Y ++ L++G Sbjct: 238 LPQDITALQEMMSQFLLKKREHLENWFAQEMSQTSYELPKAPAAKVYSVSQVQTLLQKGS 297 Query: 302 VLVRLGGLRVLRIGDDVYANGEKIDSPHRPALDALASNIALTAENFGDALEDPSFLAMLA 361 L RL GLR+L IG+ + NGE +DS H A + LA + + + + FLA L Sbjct: 298 RLNRLMGLRMLHIGNRYFVNGESLDSDHADAWNVLARHRTIEGPMLIKFINEADFLAELT 357 Query: 362 ALVNSGYWFF 371 ++N GYW+F Sbjct: 358 LIINKGYWYF 367 >UniRef50_B8GSM7 Cupin 4 family protein n=1 Tax=Thioalkalivibrio sp. HL-EbGR7 RepID=B8GSM7_THISH Length = 397 Score = 371 bits (952), Expect = e-101, Method: Composition-based stats. Identities = 121/375 (32%), Positives = 196/375 (52%), Gaps = 14/375 (3%) Query: 7 LNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQD--GKWQVS 64 L FL +WQ++P+++++ F P+SP+ELAGLA E V SRLV + G W + Sbjct: 15 LTARAFLRDYWQQKPLLVRQAIPGFESPLSPEELAGLACEEGVISRLVRERGETGSWALR 74 Query: 65 HGPFES--YDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGV 122 GPF+ + L E++W+LLV + A + PFR +PDWR+DDLM+S++ P G V Sbjct: 75 TGPFDEDDFTTLPESHWTLLVSDMEKHLPELRAYLEPFRFIPDWRMDDLMVSYAAPEGSV 134 Query: 123 GPHLDQYDVFIIQGTGRRRWRV-GEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYI 181 GPH+D+YDVF++Q GRRRW++ + + P +L + F+ + LEPGD+LY+ Sbjct: 135 GPHVDEYDVFLLQAQGRRRWQIARQAVSGDDFLPGVELRILRDFQPDQEWILEPGDMLYL 194 Query: 182 PPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPAD 241 PP PH G A+ M +SVGFRAP R+L++ + D + + Y+DP + P+ +P + Sbjct: 195 PPRIPHHGVAVGPCMTWSVGFRAPAWRDLMAAWVDQRYEALAPQDRYADPGLEPQDNPGE 254 Query: 242 VLPQEMDKLREMMLELIN-QPEHFKQWFGEFISQSRHE-LDIAPPEPPYQPDEIYDALKQ 299 + + +L + + +W G +++ + E L+ DE L+ Sbjct: 255 LSAAALARLIAGLRRAMAVDDAELARWLGTVLTEPKAELLEHMQLPETLTRDEALGLLQD 314 Query: 300 GEVLVRLGGLRVLRIGD----DVYANGEKIDSPHR--PALDALASNIALTAENF-GDALE 352 G L R G R+ + D ++ NG++ P P + L + A + G A Sbjct: 315 GVSLERHGAARLAWMSDHGGLRLFVNGQEHLLPEAAGPLVRHLCAETAYDGKALWGLASG 374 Query: 353 DPSFLAMLAALVNSG 367 S +L +L +G Sbjct: 375 IDSAEDLLMSLCIAG 389 >UniRef50_A1K4G1 Putative uncharacterized protein n=1 Tax=Azoarcus sp. BH72 RepID=A1K4G1_AZOSB Length = 371 Score = 371 bits (952), Expect = e-101, Method: Composition-based stats. Identities = 111/366 (30%), Positives = 191/366 (52%), Gaps = 11/366 (3%) Query: 7 LNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQVSHG 66 + FL+ +WQK+P+++++ F + +++ LA + +V+SR + +G W+++ G Sbjct: 9 MTPRQFLQEYWQKKPLLVRQAVPGFTGVLGREDIFDLACDPDVESRHIRLHEGNWELNRG 68 Query: 67 PFESYDHLGET-NWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 125 P G+ W++LVQ +N W E L+ F +P R+DDLM+S++V GGGVGPH Sbjct: 69 PQTRARLRGKRSPWTVLVQGINLWSEAADELLHRFNFIPQARLDDLMVSYAVDGGGVGPH 128 Query: 126 LDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIPPGF 185 D YDVF++QG G+RRW++ ++ + L + F D LEPGD+LY+PP + Sbjct: 129 FDNYDVFLLQGQGQRRWQIADQDD-RSLVEGAPLRILRNFVPAHDWILEPGDMLYLPPHW 187 Query: 186 PHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADVLPQ 245 H G A+ YS+GFR+P +EL + F ++ +R YSDPD+ + + A + Sbjct: 188 AHNGIAIGECTTYSIGFRSPTAQELGAEFLGWLQERVCLDGLYSDPDLTEQDNSALIGDA 247 Query: 246 EMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQGEVLVR 305 +D+++ ++ + + G ++++ + + PPE P AL G + + Sbjct: 248 MIDQVQRVIEGIRWSRADVAAFLGHYLTEPKPTVFFEPPEEPIPLKAFRRALGAGGLRLD 307 Query: 306 LGGLRVLRIGDDVYANGEKIDS--PHRPALDALASNIALTA-ENFGDALEDPSFLAMLAA 362 L +LR + + NGE +DS + ALD LA LT + AL+D + Sbjct: 308 ARTL-LLRSQGNFFLNGEAVDSVPAWQQALDTLAHARRLTGCADLPAALQD-----LFYE 361 Query: 363 LVNSGY 368 G+ Sbjct: 362 WYCDGF 367 >UniRef50_D1RFR4 Cupin superfamily protein n=1 Tax=Legionella longbeachae D-4968 RepID=D1RFR4_LEGLO Length = 396 Score = 369 bits (949), Expect = e-101, Method: Composition-based stats. Identities = 121/387 (31%), Positives = 198/387 (51%), Gaps = 22/387 (5%) Query: 7 LNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGK---WQV 63 ++ FL +WQK+P+V+++ F P+SPDELAGLA+E +V+SRLV + W + Sbjct: 7 ISLNTFLGDYWQKKPLVIRKALPEFTHPLSPDELAGLALEEDVESRLVFETPDEKPYWHL 66 Query: 64 SHGPF--ESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGG 121 GPF + L T+W+LLVQ V+ AL+ F +P WRIDD+MIS++V G Sbjct: 67 KRGPFSVNDFSTLPSTHWTLLVQGVDRLIPEVYALLDYFNFIPQWRIDDIMISYAVLHGS 126 Query: 122 VGPHLDQYDVFIIQGTGRRRWRVGEK-LQMKQHCPHPDLLQVDPFEAIIDEELEPGDILY 180 VGPH D YDVF+ Q G+R W + K + + +L + F+ LE GD+LY Sbjct: 127 VGPHYDNYDVFLYQAKGKREWSLTTKGCNNQNYMKGLELRIMSQFDVEERFILEEGDMLY 186 Query: 181 IPPGFPHEGYAL-ENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHP 239 +PP H G +L + M YS G+R+ +EL+ F+DY+ ++ L N Y DPD + Sbjct: 187 LPPHVGHHGISLSDECMTYSFGYRSYQGQELLESFSDYLSEKGLFKNLYQDPDWSNLQNT 246 Query: 240 ADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYD---- 295 +++ P ++++ ++IN + + WFG F ++ + ++ P P + DE+ D Sbjct: 247 SEIPPSAWLNAQKLLQQVINDEKTMQTWFGCFATRLDQQAELQLP-VPLEEDELIDISDF 305 Query: 296 --ALKQGEVLVRLGGLRVLRIGDD------VYANGEKIDS--PHRPALDALASNIALTAE 345 +K+G L+R R + + NG D+ ++ L +A+N L+ + Sbjct: 306 IKEIKEGLNLIRDASCRFAYQNQNEQSEYQFFINGSAWDAKGVNKDLLHYIANNRYLSYK 365 Query: 346 NFGDALEDPSFLAMLAALVNSGYWFFE 372 L ++ L + FE Sbjct: 366 VLTTYLNTKKNQLLIYNLWKLQWLQFE 392 >UniRef50_Q5QZ10 Cupin superfamily protein n=2 Tax=Idiomarina RepID=Q5QZ10_IDILO Length = 380 Score = 369 bits (948), Expect = e-101, Method: Composition-based stats. Identities = 153/382 (40%), Positives = 216/382 (56%), Gaps = 18/382 (4%) Query: 4 QLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSH---QDGK 60 +L + DFL +WQK+P ++++GF +F DP+SP+ LAGLAME DSR++ + Sbjct: 2 KLVFDKDDFLTNYWQKKPCLIRQGFADFSDPVSPEILAGLAMEEGADSRVIESKADTESG 61 Query: 61 WQVSHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGG 120 W V+HGPFE Y+ GET+W+LLVQ+VN W L+ PFR LPDWRIDD+M+SFS G Sbjct: 62 WLVTHGPFEDYEKFGETDWTLLVQSVNEWLPDVGELITPFRFLPDWRIDDVMVSFSCENG 121 Query: 121 GVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVD-PFEAIIDEELEPGDIL 179 GVGPHLDQYDVFIIQG G R WRVGEK M+++ P DL Q+ F A+I+E L GD+L Sbjct: 122 GVGPHLDQYDVFIIQGAGSRHWRVGEKQAMQEYQPAEDLCQIKGEFNAVINEHLTAGDVL 181 Query: 180 YIPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHP 239 YIP G PH+G +LE ++NYSVGFRAP+ EL+ D +Q++ Y DP + Sbjct: 182 YIPAGCPHDGISLEPSLNYSVGFRAPSKAELLLQLGDIAMQQKSLQERYQDPALSSEDVS 241 Query: 240 ADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQ 299 + ++ L++ + + + E + ISQS+ L PE P ++I L Q Sbjct: 242 WVIEKTQLSALKQFLKDALESDET-DALLAKIISQSKRPLP--EPELPTIAEQIPTLLAQ 298 Query: 300 GEVLV-RLGGLRVLRIGD-DVYANGEKI-----DSPHRPALDALASNIALTAENFGDALE 352 + + G R L++ D Y NGE P L L + A E + Sbjct: 299 QNAFIEKTSGARFLKLSDTQFYGNGEAFHVIQEALPTAEWLAQLQGSEAT--EELAKLAK 356 Query: 353 DPSFLAMLAALVNSG--YWFFE 372 + ++A +N G Y + + Sbjct: 357 SVAGCELIAETINQGIIYLYID 378 >UniRef50_D0L0L5 Cupin 4 family protein n=1 Tax=Halothiobacillus neapolitanus c2 RepID=D0L0L5_HALNC Length = 397 Score = 369 bits (947), Expect = e-100, Method: Composition-based stats. Identities = 125/378 (33%), Positives = 193/378 (51%), Gaps = 16/378 (4%) Query: 6 TLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDG--KWQV 63 TL+ DFL +WQK+PV++++G F P+SP+ELAGLA E +V +RL+ G W + Sbjct: 9 TLSVADFLRDYWQKKPVLIRQGVPGFESPLSPEELAGLACEEDVPARLILESAGARPWTL 68 Query: 64 SHGPFES--YDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGG 121 HGPF + L E +SLL+ L++ FR +PDWRIDDLMIS++ PGG Sbjct: 69 RHGPFTEADFTSLPEDGYSLLITDCEKLIPDLMNLVQHFRFVPDWRIDDLMISYAPPGGS 128 Query: 122 VGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYI 181 VG H+D+YDVF++QG GRR+W + + P D+ + FE + LEPGD+LY+ Sbjct: 129 VGAHIDEYDVFLLQGMGRRKWMIEYPPKHSDFVPDLDIRLLQEFEPTEEWVLEPGDMLYL 188 Query: 182 PPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPAD 241 PPG PH G A+++ M YS+GFRAP E+ +G D ++ Y DPD+ A+P Sbjct: 189 PPGVPHHGVAVDHCMTYSIGFRAPLLHEMAAGVTDRLITDMDQAARYGDPDLQAPANPGA 248 Query: 242 VLPQEMDKLREMMLELINQPEH-FKQWFGEFISQ-SRHELDIAPPEPPYQPDEIYDALKQ 299 + KLR ++ +++Q + ++ E +++ P P + + Sbjct: 249 LDASSRVKLRAILQSVLDQDDAVLDRFIAETLTERPLDHAGFYPQNDPLDAKALRGEIAH 308 Query: 300 -GEVLVRLGGLRVLRIGDDVYANGEKIDSPHR---------PALDALASNIALTAENFGD 349 G+ L+R R+L + D+ + G + + P L S + A Sbjct: 309 SGDTLMRTPAARLLLVEDEPDSAGGALAVDGQSTLLNAEMLPLARLLVSQVFYDAAELLA 368 Query: 350 ALEDPSFLAMLAALVNSG 367 A E + +L L G Sbjct: 369 ATESEAAAELLQKLYADG 386 >UniRef50_C5BU83 Cupin 4 family protein n=1 Tax=Teredinibacter turnerae T7901 RepID=C5BU83_TERTT Length = 385 Score = 367 bits (943), Expect = e-100, Method: Composition-based stats. Identities = 117/372 (31%), Positives = 194/372 (52%), Gaps = 17/372 (4%) Query: 6 TLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDG-KWQVS 64 T+ + FL +WQK+P+++++ F +F P+S DELAGLA+E +V SRLV +D WQV Sbjct: 16 TITFEQFLNEYWQKKPLLIRQAFPDFEAPVSADELAGLALEDDVVSRLVVQRDESDWQVE 75 Query: 65 HGPF--ESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGV 122 HGP E + L E++W+LLVQ + AL+ FR +P+WR+DD+MIS++ GGV Sbjct: 76 HGPLLEERFAQLPESHWTLLVQHADALDPAINALLDAFRFIPNWRLDDIMISYAADKGGV 135 Query: 123 GPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHC-PHPDLLQVDPFEAIIDEELEPGDILYI 181 GPH D YDVF++Q G+RRWR+G++ + P D+ + F+ + D +EPGD+LYI Sbjct: 136 GPHFDYYDVFLLQAQGKRRWRIGQRCSHESPLLPAADMKILQDFDTVEDWIVEPGDLLYI 195 Query: 182 PPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPAD 241 PP H G A M YS+GFRAP+ E++ F++ + Y DP + P+ P + Sbjct: 196 PPNIAHWGEADGECMTYSIGFRAPSHAEVLLDFSEEMASFTNPDMRYMDPGLRPQQLPGE 255 Query: 242 VLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQGE 301 + Q +++++ ++ + WFGE++++ D +E L + Sbjct: 256 ISQQSIEQVQAIIHQYSTDKAALAGWFGEYMTRPNPTADAHFQ---TFDEEFDRNLMEAG 312 Query: 302 VLVRLGGLRVLRIGDD----VYANGEKIDSPHRPALDALASNIALTAENFGDALEDPSFL 357 R + V+ NG K R L++ + ++ D Sbjct: 313 QARLSRFARCAFFEEQAGCLVFINGAKWHC-SRKLAVMLSNYEPIHWDSL-----DTLDR 366 Query: 358 AMLAALVNSGYW 369 ++ + ++G+ Sbjct: 367 TVVVQIADAGFL 378 >UniRef50_A6W0E5 Cupin 4 family protein n=2 Tax=Marinomonas RepID=A6W0E5_MARMS Length = 400 Score = 363 bits (932), Expect = 6e-99, Method: Composition-based stats. Identities = 123/378 (32%), Positives = 208/378 (55%), Gaps = 20/378 (5%) Query: 7 LNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQD-GKWQVSH 65 + F++ +WQK+P++++ G NF P+ DELAG+AME E++SR+V W++ Sbjct: 14 MTAQTFIDEYWQKKPLLIRGGLVNFTLPLEADELAGMAMEEEIESRIVIENGLRPWEMRQ 73 Query: 66 GPF--ESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVG 123 GPF +++ L E W+LLVQAV+HW L F LP WR+DD+M+S++ GG VG Sbjct: 74 GPFTEDTFATLPEKEWTLLVQAVDHWVPEVQTLKEKFEFLPSWRLDDVMVSYATEGGSVG 133 Query: 124 PHLDQYDVFIIQGTGRRRWRVGEKLQMK-QHCPHPDLLQVDPF--EAIIDEELEPGDILY 180 PH DQYDVF++Q +G+RRW+V + + P+ L +D F +D EL+ GDILY Sbjct: 134 PHYDQYDVFLVQVSGKRRWQVLSPDEYQDSAIPNIKLHILDNFPVNPEMDWELDAGDILY 193 Query: 181 IPPGFPHEGYALEN-AMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHP 239 +PP F H G +L++ M YS+GFRAP+ +++++G D + + E + ++ P+ R H Sbjct: 194 LPPNFAHNGRSLDDECMTYSIGFRAPSMQDILTGVRDKLCETENVKDRFAAPETANRQHS 253 Query: 240 ADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQ 299 A + ++ L+ + LINQP+ +W GE +S+S++ +AP + +E + + Q Sbjct: 254 AHISKDDIQYLQTQLARLINQPDLLAEWLGETMSESKYPEYLAPLNHE-EVNEAFSSATQ 312 Query: 300 GEVLVRLGGLRVLRIGD---------DVYANGEKI--DSPHRPALDALASNIALTAENFG 348 G+ +R G R+ +V+ NGE + D ++A+ + Sbjct: 313 GQTFIRPGDARICYYIQQSTENNGKINVFCNGEHLLVDEELTSFVEAVCHQVEFDFSGL- 371 Query: 349 DALEDPSFLAMLAALVNS 366 D ++ ++ + Sbjct: 372 DLKQNTDLEPLVRFFIRQ 389 >UniRef50_A6GQ27 Putative uncharacterized protein n=1 Tax=Limnobacter sp. MED105 RepID=A6GQ27_9BURK Length = 383 Score = 361 bits (926), Expect = 3e-98, Method: Composition-based stats. Identities = 104/372 (27%), Positives = 182/372 (48%), Gaps = 12/372 (3%) Query: 6 TLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQVSH 65 L+ F+ HW +P + ++ F NF D +A +A + +++SRL+ H W + H Sbjct: 10 NLSVEKFMTEHWHIKPYLFRQAFPNFEPLCDFDTIAEMASDEDIESRLIQHSKTGWTLEH 69 Query: 66 GPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 125 GPF+ + + W++L+Q ++H L++ FR +PD R+DD+M+S + GGGVGPH Sbjct: 70 GPFDELPSMKKKAWTVLIQGIDHHLPEAYDLLQLFRFIPDARLDDVMLSLASDGGGVGPH 129 Query: 126 LDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIPPGF 185 D YDVF++Q G+RRW++G L K+ L + FE + LEPGD+LY+PP + Sbjct: 130 YDSYDVFLLQMHGKRRWKIG-PLLDKELEEGLPLKILKNFEPTEEFVLEPGDMLYLPPNY 188 Query: 186 PHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGG-----NYYSDPDVPPRAHPA 240 H+G A + S+GFRAP E++SG + + +SDP + +PA Sbjct: 189 GHDGIAEGSCSTLSIGFRAPTQAEVLSGILRDMADQIDQDPTKTQTLFSDPARGLQKNPA 248 Query: 241 DVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQG 300 ++ ++ ++ + Q ++ G +++ + + + EI L G Sbjct: 249 EIPDDLLNFGINLIQQFSAQSPQIQRSMGILLTEPKSHVYFVNNTEDQEIHEIISVL--G 306 Query: 301 EVLVRLGG-LRVLRIGDDVYANGEKIDSPHR---PALDALASNIALTAENFGDALEDPSF 356 E + L ++L Y NG+ ++ L LA+ + + +AL +P F Sbjct: 307 ERGIALSMKTKMLFKDAVFYINGDAVNPTSALTVKQLQMLANQREMEPIDAAEALNNPEF 366 Query: 357 LAMLAALVNSGY 368 L +G+ Sbjct: 367 QYFLVGFAKAGW 378 >UniRef50_B2SQ70 Transcription factor jumonji, JmjC n=19 Tax=Xanthomonadaceae RepID=B2SQ70_XANOP Length = 498 Score = 361 bits (926), Expect = 3e-98, Method: Composition-based stats. Identities = 122/379 (32%), Positives = 190/379 (50%), Gaps = 18/379 (4%) Query: 4 QLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQD--GKW 61 L + FL +W K P++++ F +F P+ P++LAGLA E V +RL+SH W Sbjct: 20 PLGMPVERFLRNYWHKHPLLIRNAFADFASPLQPEDLAGLACEDGVLARLISHDRATDSW 79 Query: 62 QVSHGPFESYDH--LGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPG 119 V GPF+ D L + +W+LLVQ V+ W AL+ FR LP WRIDD+MISF+ G Sbjct: 80 DVRSGPFQETDFPGLPDHDWTLLVQDVDKWDADVRALLEQFRFLPRWRIDDIMISFAATG 139 Query: 120 GGVGPHLDQYDVFIIQGTGRRRWRV------GEKLQMKQHCPHPDLLQVDPFEAIIDEEL 173 G VG H+D YDVF++QG G RRW++ G K +L + F+ L Sbjct: 140 GSVGAHVDHYDVFLLQGQGHRRWQIDARTAQGSKATPLAFREDVELKLLRTFKPTHHWVL 199 Query: 174 EPGDILYIPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDV 233 PGD+LY+PP PH G A + + +S+G RAP++ ELI + D ++ Y D D+ Sbjct: 200 GPGDMLYLPPLIPHHGVAEDACLTFSIGTRAPSSAELIGDYLDTLIADADEAVRYHDEDL 259 Query: 234 PPRAHPADVLPQEMDKLREMMLEL-INQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDE 292 A P ++ M+++ + L +N P+ WFG F++ R D+ P P + Sbjct: 260 KVPADPYEIDVTAMNRVVAALNALRMNDPDRLGDWFGRFMTTYRACGDVVPAPAPIPREA 319 Query: 293 IYDALKQGEVLVRLGGLRVLR----IGDDVYANGEKIDSPHRPALDALASNIALTAENFG 348 + AL++G +L R R+ G ++ +G + + A LA+ + + Sbjct: 320 VEQALEEGVLLHRHPWSRLAWRRAKRGATLFCSGLEFALSAKDASR-LAAAEKIDGTLYA 378 Query: 349 DALEDPSFLAMLAALVNSG 367 P ++ L+ G Sbjct: 379 QL--SPRGRDVVLELLAQG 395 >UniRef50_B7RUZ0 Cupin superfamily protein n=1 Tax=marine gamma proteobacterium HTCC2148 RepID=B7RUZ0_9GAMM Length = 377 Score = 361 bits (926), Expect = 3e-98, Method: Composition-based stats. Identities = 130/378 (34%), Positives = 194/378 (51%), Gaps = 13/378 (3%) Query: 2 EYQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKW 61 +++L L+ FL +HWQK P++++ NF PIS DELAGLA E EV++R+V HQ+ W Sbjct: 4 DWELNLDKEQFLAQHWQKAPLLIRGAIKNFKPPISSDELAGLAYEEEVEARIVEHQEDNW 63 Query: 62 QVSHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGG 121 Q+ HGPF + D+ + W+LLVQAV+ + A L + +P WR+DD+M S++ GG Sbjct: 64 QLFHGPFSATDYQRKHPWTLLVQAVDQYIPEVAQLRKLVDFIPQWRVDDVMASYASDGGS 123 Query: 122 VGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHC-PHPDLLQVDPFEAIIDEELEPGDILY 180 VGPH D YDVF++QG G R W+ G+ H L + F + LEPGDILY Sbjct: 124 VGPHFDNYDVFLLQGEGHRLWKTGQFCDSSSPLVDHDSLRLLSQFNTEAEYLLEPGDILY 183 Query: 181 IPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPA 240 +PPG H G A +S+GFRAP E++S F D ++++ +YSD + P Sbjct: 184 VPPGIAHWGTAQGECTTFSIGFRAPRITEMVSRFTDALIEQLDPDLFYSDARIEVATRPG 243 Query: 241 DVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQG 300 ++ P+++D++ + ++Q E WFGE ++ R+E P E E L G Sbjct: 244 EIRPRDLDRVSAQIQAALDQSEG-NHWFGELATEPRYE--QFPDEGELS--EARSQLSDG 298 Query: 301 EV-LVRLGGLRVLRIGDD----VYANGE--KIDSPHRPALDALASNIALTAENFGDALED 353 + ++ + V+ANG+ P AL+ L A D Sbjct: 299 ANGIELNSAAKLAWQHEAGRVVVFANGDSRSFSESIMPLQIALSDAWKLDKAELAAASAD 358 Query: 354 PSFLAMLAALVNSGYWFF 371 P L L+ SG F Sbjct: 359 PESSGWLDYLLESGCVFI 376 >UniRef50_Q5WVF0 Putative uncharacterized protein n=4 Tax=Legionella pneumophila RepID=Q5WVF0_LEGPL Length = 395 Score = 360 bits (925), Expect = 4e-98, Method: Composition-based stats. Identities = 118/383 (30%), Positives = 186/383 (48%), Gaps = 20/383 (5%) Query: 7 LNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSH---QDGKWQV 63 + FL+ +WQK+P+++++ +F +P++PDELAGLA+E E++SRLV Q +W + Sbjct: 7 MTVQTFLKDYWQKKPLIIRQALPDFTNPLTPDELAGLALEEEIESRLVYETPDQSPQWNL 66 Query: 64 SHGPFESYD--HLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGG 121 GPF+ D L +T+W+LLVQ V+ L+ F +P WR+DD+MIS++ G Sbjct: 67 KRGPFKESDLIGLPKTHWTLLVQGVDRIVPDVYELLDHFNFIPQWRVDDVMISYATLHGS 126 Query: 122 VGPHLDQYDVFIIQGTGRRRWRV-GEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILY 180 VGPH D YDVF+ Q G+R W + +K +L ++ FE LE GD+LY Sbjct: 127 VGPHYDNYDVFLYQAKGQRLWSLTSKKCHTNNFIKGLELRIMNEFEVEEQFILEEGDMLY 186 Query: 181 IPPGFPHEGYAL-ENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHP 239 +PP H G A E M YS G+R+ +EL DY+ + L + Y DPD + Sbjct: 187 LPPHIGHYGIAQSEECMTYSFGYRSYQGQELWDSLGDYLSEHGLFKSLYQDPDWSTLKNT 246 Query: 240 ADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQP-----DEIY 294 +++ P+ R+++ +++ + WFG F + + P P + DE Sbjct: 247 SEITPKAWSNARQLLRQVLENDQLMHSWFGCFATSLDQSAEQYLPPPLEEDELLGLDEFI 306 Query: 295 DALKQGEVLVRLGGLRVLRIGDD------VYANGEKIDS--PHRPALDALASNIALTAEN 346 L + +VR R I D Y NG++ DS L +A+N L + Sbjct: 307 KELSNYQEIVRDASCRFAYIMSDQESQCHFYVNGKEWDSRGVSTNLLSFVANNRFLPLKE 366 Query: 347 FGDALEDPSFLAMLAALVNSGYW 369 L + L L + Sbjct: 367 LKPYLNHKTNQLFLYELWKLQWL 389 >UniRef50_Q21K45 Cupin 4 n=1 Tax=Saccharophagus degradans 2-40 RepID=Q21K45_SACD2 Length = 383 Score = 357 bits (917), Expect = 3e-97, Method: Composition-based stats. Identities = 111/366 (30%), Positives = 189/366 (51%), Gaps = 15/366 (4%) Query: 9 WPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQ------DGKWQ 62 FL +WQK+P+++++ NF P+S DELAGL +E +V SRL++ + +W Sbjct: 16 IETFLRDYWQKKPLLIRQALPNFESPLSADELAGLCLEDDVISRLITETPQSSPFNSEWN 75 Query: 63 VSHGPF--ESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGG 120 V+HGP + ++ L E WSLLVQ V+ L+ FR +P+WR+DD+MIS++ G Sbjct: 76 VTHGPLPEDIFETLPENYWSLLVQHVDQLSPEVNQLLNLFRFIPNWRLDDVMISYAPDKG 135 Query: 121 GVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQH-CPHPDLLQVDPFEAIIDEELEPGDIL 179 GVGPH D YDVF++QG G+RRWR+G++ K + + + F+ D L PGDIL Sbjct: 136 GVGPHFDYYDVFLLQGHGQRRWRLGQQCTSKSPMLANAPMKVLTEFDVQEDWVLNPGDIL 195 Query: 180 YIPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHP 239 Y+PPG H G A+ ++ YSVGFRAP+ ++++ F+ V + N Y D + + Sbjct: 196 YVPPGLAHWGTAVGESITYSVGFRAPSHQDIVLDFSQEVASKIEEDNRYQDQFLTANKNA 255 Query: 240 ADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQ 299 ++ +++L+ ++ + + QW G+ ++Q + + E +P+E+ + Sbjct: 256 GEITGDAIEQLKHILQTYMQDEQALAQWLGKSMTQLNPGM-VDEAENTIEPEEMANTPFT 314 Query: 300 GEVLVRLGGLRV-LRIGDDVYANGEKIDSPHRPALDALASNIALTAENFGD---ALEDPS 355 R R ++ NGE +AL++ + + + + + D Sbjct: 315 LSPFARATFYRADSNSEAYIFINGEVYSG-SLELANALSNYLPIDWLSCSETDKLILDTL 373 Query: 356 FLAMLA 361 L Sbjct: 374 AQQYLL 379 >UniRef50_Q31GJ6 Cupin superfamily protein n=2 Tax=Gammaproteobacteria RepID=Q31GJ6_THICR Length = 401 Score = 357 bits (917), Expect = 3e-97, Method: Composition-based stats. Identities = 108/377 (28%), Positives = 179/377 (47%), Gaps = 18/377 (4%) Query: 6 TLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRL-VSHQDGKWQVS 64 +++ FL +WQK+P++++ +F P+S +ELAGL++E EV+SR+ + H +++ Sbjct: 18 SIDKETFLSEYWQKKPLLIRNALPDFSPPVSAEELAGLSLEEEVESRIVIQHSAEDYELK 77 Query: 65 HGPFES--YDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGV 122 GPF+ Y+ L E NW+LLVQ ++ L+ F +P WRIDD+M+S++ GG V Sbjct: 78 KGPFKESLYETLPEKNWTLLVQGMDRLLPEVTELLNEFDFIPSWRIDDIMVSYATEGGNV 137 Query: 123 GPHLDQYDVFIIQGTGRRRWRVG-EKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYI 181 GPH D YDVF++Q G RRW++ + + DL + F + +PGDILY+ Sbjct: 138 GPHFDHYDVFLLQAQGERRWQLSAQDCDETNYIEGVDLRIMKRFVVEEEYVCQPGDILYV 197 Query: 182 PPGFPHEGYAL-ENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPA 240 PP + H G L ++ M +S+G+R EL F DY+ + + + Y DP+ A P Sbjct: 198 PPKWGHHGVGLTDDCMTFSIGYRTYRGLELWDSFGDYLAETQQFQSLYQDPNWKGTA-PG 256 Query: 241 DVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEP-----PYQPDEIYD 295 + + + ++ + E K WFG F +Q P+P + + Sbjct: 257 QISEGSWQQAQSLLKAALENEEALKNWFGRFATQLDQGASQLLPDPLSDTESCPLETFIE 316 Query: 296 ALKQGEVLVRLGGLRVLRIG-----DDVYANGEKID--SPHRPALDALASNIALTAENFG 348 AL+ E ++R R +Y N + + L + + Sbjct: 317 ALQSAEGVLRDSVCRFAYAEFTQNQVKLYINSAEWQDFKAESDFIRCLCNQRFIDQATLT 376 Query: 349 DALEDPSFLAMLAALVN 365 L A+L L N Sbjct: 377 SYLNHAGNQALLYDLWN 393 >UniRef50_Q7NS46 Putative uncharacterized protein n=1 Tax=Chromobacterium violaceum RepID=Q7NS46_CHRVO Length = 377 Score = 354 bits (910), Expect = 2e-96, Method: Composition-based stats. Identities = 109/369 (29%), Positives = 180/369 (48%), Gaps = 13/369 (3%) Query: 10 PDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQVSHGPF- 68 FL +W K+P++++ + + L+ LA + +SRL+ ++ KW + GPF Sbjct: 13 EQFLAEYWHKKPLLIRGALTDVGPHVDFSVLSELAQRDDAESRLIEYKKDKWHLERGPFR 72 Query: 69 -ESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPHLD 127 + L ET+W+LLVQ VNH ++ F +P R+DDLMIS++ PGG VGPH D Sbjct: 73 ASRFRRLAETDWTLLVQGVNHHLPHIDDILWRFNFIPYARLDDLMISYAPPGGTVGPHFD 132 Query: 128 QYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIPPGFPH 187 YDVF++Q G++RW++ + + + F + LE GD+LY+PP H Sbjct: 133 AYDVFLLQVGGKKRWQISSQ-HDDDFIEDAPIRVLKDFRMEQEFVLEHGDMLYLPPHCAH 191 Query: 188 EGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADVLPQEM 247 G ALE M YS+GFRAP +EL + F Y+ R Y+DPD+ +A PA + + + Sbjct: 192 YGVALEPGMTYSIGFRAPPAQELAAQFLVYLQDRVCIDGVYADPDLKLQADPAKIGGEMI 251 Query: 248 DKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQ-PDEIYDALKQGEVLVRL 306 D++ ++ ++ + + G ++++ + + PE ++G L R Sbjct: 252 DQVAGLLSKIRWDKDTVCDFLGHYLTEPKAHVFYDSPEDELDEEAFAEAVAERGLELDRK 311 Query: 307 GGLRVLRIGDDVYANGEKIDSPHRPA--LDALASNIALTAENFGDALEDPSFLAMLAALV 364 ++L VY NGEK+D+ + L A ++ + D + L Sbjct: 312 --SQILYCDACVYCNGEKVDAADGDFADWQHFGNRRRLPAGSYSADMIDALYDGYL---- 365 Query: 365 NSGYWFFEG 373 SGYW Sbjct: 366 -SGYWHLSS 373 >UniRef50_A4BDP0 Putative uncharacterized protein n=1 Tax=Reinekea blandensis MED297 RepID=A4BDP0_9GAMM Length = 381 Score = 349 bits (896), Expect = 1e-94, Method: Composition-based stats. Identities = 115/373 (30%), Positives = 190/373 (50%), Gaps = 16/373 (4%) Query: 7 LNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDG--KWQVS 64 N +FL +WQ+ P++ + + D I+ DELAGLA E+EV+SRL+S + +W + Sbjct: 17 FNSQEFLNTYWQQAPLLKRNAL-SLHDIITADELAGLATEAEVESRLISGSNETEQWTLQ 75 Query: 65 HGPFES--YDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGV 122 HGPF + L E +W+LLVQAV+HW ++ F LP WRIDD+MISF+ GGGV Sbjct: 76 HGPFSDDVFQTLPERDWTLLVQAVDHWVPEVRQVLAQFSFLPRWRIDDIMISFATDGGGV 135 Query: 123 GPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHC-PHPDLLQVDPFEAIIDEELEPGDILYI 181 GPH DQYDVF++Q G+R W++G+ + + + FE L+PGD+LY+ Sbjct: 136 GPHFDQYDVFLVQLAGQREWKIGQMCDEDSDLVENIPVKVLSAFEEQDAWVLDPGDVLYL 195 Query: 182 PPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAH-PA 240 PPG H G +L ++M SVGFRAP+ E I+ ++ Y D + R P Sbjct: 196 PPGVAHWGTSLGDSMTLSVGFRAPSDSETIAELGHFMSSMVSDFQRYGDAGISQRNQTPH 255 Query: 241 DVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQG 300 + +++D+++ ++ L + +WFG+++++ +++ D+ + + + Sbjct: 256 AIEEEDIDRVQAIIKRLADDRSLVSEWFGQYVTEPKYD-DMNVDTQDWSDESFMKHWQH- 313 Query: 301 EVLVRLGGLRVLRIGDDVYANGEKIDSPHRP-ALDALASNIALTAENFGDALEDPSFLAM 359 L R G R+ ++ +G+ P L + L + + Sbjct: 314 HPLYRNPGSRLAYREQTLFVDGQSYGVNATPEELHLICDCDVL------PYNHNVHIQRI 367 Query: 360 LAALVNSGYWFFE 372 L+N+G FE Sbjct: 368 ALQLLNAGALIFE 380 >UniRef50_A1VLH8 Cupin 4 family protein n=6 Tax=Burkholderiales RepID=A1VLH8_POLNA Length = 413 Score = 345 bits (886), Expect = 1e-93, Method: Composition-based stats. Identities = 121/393 (30%), Positives = 183/393 (46%), Gaps = 38/393 (9%) Query: 7 LNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQVSHG 66 L F+ RHWQK+P+++++ F +S L LA +V+SRL+ Q W + G Sbjct: 13 LTPAQFMRRHWQKKPLLVRQAIAGFEPFLSRAALFKLAAREQVESRLIVQQAKGWGMKKG 72 Query: 67 PF--ESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGP 124 PF +S L + W+LLVQ V+ AL++ FR +PD R+DDLMISF+ PGGGVGP Sbjct: 73 PFASKSLPPLSQEGWTLLVQGVDLHEPAGHALLQQFRFVPDARLDDLMISFATPGGGVGP 132 Query: 125 HLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIPPG 184 H D YDVF+ Q +GRRRW++G + P L + FE + LE GD+LY+PP Sbjct: 133 HFDSYDVFLFQASGRRRWKIGLQKDF-TLQPDVPLKILQNFEVDEEFVLEAGDMLYLPPR 191 Query: 185 FPHEGYAL---------ENAMNYSVGFRAPNTRELISGFADYVLQR-------------- 221 + H+G A + M YS+GFR+P EL S + + Sbjct: 192 YAHDGIAEASVGTNGKPADCMTYSIGFRSPARTELASELLHRLAEMGEDAAEEACAAEAG 251 Query: 222 ---ELGGNYYSDPDVPPRAHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHE 278 Y DP P PA + D + +LE + P GE++++ + Sbjct: 252 RKPARAQPMYRDPTQPATETPAAMPAGLADFAGQAVLEALKDPLALACALGEYMTEPKPG 311 Query: 279 LDIAPPEPPYQPDEIYDALKQGEVLVRLGG-LRVLRIGDDVYANGEKIDS--PHRPALDA 335 + PE + DA K G++ + L R++ D ++ NGE + + Sbjct: 312 VWFDEPEQAWD----GDAAKAGQMAIALDARTRMMYDSDHIFINGESYRAKGADASLMHR 367 Query: 336 LASNIALTAENFGDALEDPSFLAMLAALVNSGY 368 LA+ L A A S + +L +G+ Sbjct: 368 LANQRCLLASELRKAG--ASAIELLGDWHEAGW 398 >UniRef50_B7H3P1 Cupin superfamily protein n=16 Tax=Acinetobacter RepID=B7H3P1_ACIB3 Length = 387 Score = 345 bits (885), Expect = 2e-93, Method: Composition-based stats. Identities = 103/376 (27%), Positives = 180/376 (47%), Gaps = 15/376 (3%) Query: 7 LNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQD---GKWQV 63 + FL +WQK+P++++ + + P+++ LA+E V +RL+ +D +W V Sbjct: 11 ITAEQFLTEYWQKKPLLVRNAMPEIVGMLEPNDVKELALEDHVTARLIRQKDKNPNEWHV 70 Query: 64 SHGPFESYDHLGETN-WSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGV 122 P D W+LLVQAV+H+ A L + F +P WR DD+M+S++ GG V Sbjct: 71 KSSPLTKGDFQKLPKLWTLLVQAVDHYSFDIAELWKKFPFIPQWRRDDIMVSYAPKGGSV 130 Query: 123 GPHLDQYDVFIIQGTGRRRWRVGEKLQMK-QHCPHPDLLQVDPFEAIIDEELEPGDILYI 181 G H D YDVF++QG G RRW++G+ + P+ L + + DE L PGD+LY+ Sbjct: 131 GKHFDFYDVFLVQGYGHRRWQLGQMCDASTEFVPNQPLKLLPEIDVHFDEVLAPGDLLYV 190 Query: 182 PPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPAD 241 PPG H G A ++ + +S GFR PN +I +D EL N D + Sbjct: 191 PPGLSHYGVAEDDCLTFSFGFRMPNISGMIDRISDQFATDELLQNPVVDITRKNPPQIGE 250 Query: 242 VLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQGE 301 + +E+ LR+++L + ++S+ ++ +I P+ + +++ L +G Sbjct: 251 INTEELAYLRDLVLAQLKNSTVLDAALMSYMSEPKYPDNIPEPDE-IEVEDLNAILSEGY 309 Query: 302 VLVRLGGLRVLRIGDD----VYANGEKIDSPH--RPALDALASNIALTAENFGDALEDPS 355 L+ R+L + + NGE++ L ++A ++ F L + Sbjct: 310 ELLLEPASRLLYTEQNGILKFWGNGEELPIVESFATQLKSIADGKSIP---FNSELNNTD 366 Query: 356 FLAMLAALVNSGYWFF 371 L + L+N+ Sbjct: 367 ILENIVQLLNNSILML 382 >UniRef50_C7I1M3 Cupin 4 family protein n=1 Tax=Thiomonas intermedia K12 RepID=C7I1M3_THIIN Length = 378 Score = 341 bits (875), Expect = 3e-92, Method: Composition-based stats. Identities = 120/369 (32%), Positives = 177/369 (47%), Gaps = 11/369 (2%) Query: 6 TLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQVSH 65 FL WQ++P++L++ F F +S +L LA + +V+SRL+ +WQ+ H Sbjct: 10 AFTEARFLREIWQRKPLLLRQAFPGFKPLLSRAQLFALAGQDDVESRLLQRAGRRWQLDH 69 Query: 66 GPFESYDHLG--ETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVG 123 GPF + NW+LLVQ VN + L+R FR +PD R+DDLMIS++ GGGVG Sbjct: 70 GPFSRKQLPPVEQRNWTLLVQGVNLHVDAAGDLLRQFRFIPDARLDDLMISWASEGGGVG 129 Query: 124 PHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIPP 183 PH D YDVF++Q GRRRWR+G ++ P + + F D LE GD+LY+PP Sbjct: 130 PHQDAYDVFLLQAAGRRRWRIG-PVEDATLQPGKPVKLLAKFTPEEDLILESGDMLYLPP 188 Query: 184 GFPHEGYAL-ENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADV 242 G+ H+G A + M YSVGFRAP EL+ + + + GG Y DP + A PA + Sbjct: 189 GWGHDGIAASGDCMTYSVGFRAPPQGELLKEVLWQLAEAQQGGAIYRDPPLRSGASPALL 248 Query: 243 LPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQGEV 302 + RE L F+ G +++ + ++ E P + G Sbjct: 249 PAAMVRFAREAFSRLKPDAAMFENVLGLYLTTPKPQVWFESVETP-TATLRRACRQTGCR 307 Query: 303 LVRLGGLRVLRIGDDVYANGEKIDS--PHRPALDALASNIALTAENFGDALEDPSFLAML 360 L R ++L ++ NGE +D+ L LA L+A A Sbjct: 308 LDRR--SKMLYTTQALFLNGEAVDAALASSALLRQLADQQNLSAAQVQTASAAELAAL-- 363 Query: 361 AALVNSGYW 369 A G+ Sbjct: 364 ADWCAIGWL 372 >UniRef50_B8KRM1 Cupin 4 family protein n=1 Tax=gamma proteobacterium NOR51-B RepID=B8KRM1_9GAMM Length = 365 Score = 339 bits (871), Expect = 7e-92, Method: Composition-based stats. Identities = 129/371 (34%), Positives = 189/371 (50%), Gaps = 19/371 (5%) Query: 4 QLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQV 63 ++L+ FL+ +WQK P+V+++ +F PI D LAGLA+E +V SR+VS G W+V Sbjct: 2 TISLDTERFLKHYWQKHPLVIRQAVPDFTPPIDADHLAGLALEPDVQSRIVSCDRGHWEV 61 Query: 64 SHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVG 123 HGPF D + WSLLVQ V+ AAL R LP WR DD+MIS++ GG VG Sbjct: 62 QHGPFSEADFDRDDQWSLLVQGVDRLLPEVAALQRAVDFLPSWRFDDVMISYASEGGSVG 121 Query: 124 PHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPD-LLQVDPFEAIIDEELEPGDILYIP 182 PH D+YDVF++QG G R WR+G++ + D LL +D FE L+ GD LYIP Sbjct: 122 PHFDRYDVFLLQGEGEREWRIGQRCDHTTATHNYDELLLLDDFEHRETHLLQTGDALYIP 181 Query: 183 PGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPD--VPPRAHPA 240 PG H G A +S+GFRAP+ L + D L++ + D + V R P Sbjct: 182 PGIAHWGIARGPCTTFSLGFRAPSIAALTARLTDSALEQLMPDLLLEDRNSLVSERGRPG 241 Query: 241 DVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAP--PEPPYQPDEIYDALK 298 ++ Q+ D +R +L ++ + W GE ++++ + +P PP+ ++ + Sbjct: 242 EITTQQRDNIRSAVLSALSALDD-GVWLGELLTETEPFIGESPEGAVPPHIAMDLGSRIN 300 Query: 299 QGEVLVRLGGLRVLRIGDDVYANGEKIDSPHRPALDALASNIALTAENFGDALEDPSFLA 358 E G V+ANGE+ + R ALD L A + A D A Sbjct: 301 WMET----------PEGIAVFANGERFPA-SRQALDVLTPLCAGGSIMTSTAPIDT--RA 347 Query: 359 MLAALVNSGYW 369 +L L +G Sbjct: 348 LLEWLWAAGVL 358 >UniRef50_A0Z1Z1 Putative uncharacterized protein n=1 Tax=marine gamma proteobacterium HTCC2080 RepID=A0Z1Z1_9GAMM Length = 364 Score = 339 bits (871), Expect = 7e-92, Method: Composition-based stats. Identities = 130/370 (35%), Positives = 180/370 (48%), Gaps = 24/370 (6%) Query: 4 QLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQV 63 + FL+ +WQ+RP+++K N+ P+SP+EL GLA E + DSRL+S W + Sbjct: 7 TFQFDEKVFLDCYWQRRPLLIKAALPNWQSPLSPEELGGLAFEEDADSRLISKSKNGWML 66 Query: 64 SHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVG 123 GP S D +W+LLV V+HW AAL + R LP WR DD+M+S++V GGVG Sbjct: 67 KQGPLVSADFQRSDDWTLLVNGVDHWVPEVAALRQCLRFLPQWRFDDVMVSYAVADGGVG 126 Query: 124 PHLDQYDVFIIQGTGRRRWRVGEKLQM-KQHCPHPDLLQVDPFEAIIDEELEPGDILYIP 182 PH D+YDVF++QGTGRR+WR+G H L + FE + LE GD+LY+P Sbjct: 127 PHFDRYDVFLVQGTGRRKWRLGGWCDENTPRIKHEGLNLLQNFETSEEYLLEAGDVLYVP 186 Query: 183 PGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSD-PDVPPRAHPAD 241 PG H G A M YS+GFRAP L++ +AD L+ D V P + Sbjct: 187 PGLAHWGVADTPCMTYSLGFRAPTVAALLARWADKTLESVDPELLLEDRASVTNPPRPGE 246 Query: 242 VLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQGE 301 + + RE + + + W GE +++ + APP K Sbjct: 247 ITLAHWNNAREAIRNSMEALDD-GSWLGEVVTEHG---ECAPPPS-----------KHAT 291 Query: 302 VLVRLGGLRVLR----IGDDVYANGEKIDSP--HRPALDALASNIALTAENFGDALEDP- 354 L G RV VYANGE + P P L+ L S ++ A D Sbjct: 292 ALRLHPGARVSWQALSNECSVYANGEALRIPLSSVPILERLCSGDTVSPYELTSAHPDFL 351 Query: 355 SFLAMLAALV 364 +FLAM LV Sbjct: 352 NFLAMSGVLV 361 >UniRef50_B8KGD9 Cupin 4 family protein n=2 Tax=unclassified Gammaproteobacteria RepID=B8KGD9_9GAMM Length = 370 Score = 338 bits (868), Expect = 2e-91, Method: Composition-based stats. Identities = 118/378 (31%), Positives = 191/378 (50%), Gaps = 18/378 (4%) Query: 1 MEYQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGK 60 +Y+L L F+ R+WQK+ + + GF +F P DELAGLAME E+D+R+V Sbjct: 2 TDYRLDLEVKSFVARYWQKQHLFIPGGFKHFSVPADADELAGLAMEDELDARIVFRDGQH 61 Query: 61 WQVSHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGG 120 W GPF + +W+LLVQ V+ + A L+ LP WR+DD+M+S++ GG Sbjct: 62 WHQERGPFSQESYRRSGSWTLLVQGVDQHWDEAAELLNAVSFLPSWRLDDIMMSYATDGG 121 Query: 121 GVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHC-PHPDLLQVDPFEAIIDEELEPGDIL 179 GPH D YDVFIIQG G+RRW+VG + +L + FE+ + + GD+L Sbjct: 122 SAGPHYDNYDVFIIQGDGQRRWQVGGLCDASSALMDNTELRLLADFESQREYLMNTGDVL 181 Query: 180 YIPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHP 239 YIPPG H G ++ + ++S+GFRAP +L++ +AD +L + DP P Sbjct: 182 YIPPGIAHYGVSVGESTSFSIGFRAPRQSDLLARWADNLLNTLEDDALFCDPGREPATRV 241 Query: 240 ADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQ 299 ++ ++ + R +L + + +WFGE I+ S + D + +Q Sbjct: 242 GEITTADLHRARAQLLRVFEDKD--PRWFGEAITNSGTTVQP-------SSDTALNLDEQ 292 Query: 300 GEVLVRLGGLRVLRIGDD----VYANGEKIDSP--HRPALDALASNIALTAENFGDALED 353 G + R G R+ D V+A+G D+P + ++AL ++ + +A + Sbjct: 293 GAWVTRAPGSRLAWHATDEELLVFAHGSTHDTPLALQSVMEALCAHEDVAVSAALEAHD- 351 Query: 354 PSFLAMLAALVNSGYWFF 371 + +L+ L + G F Sbjct: 352 -AAQGLLSWLHDEGAILF 368 >UniRef50_Q0VQ28 Putative uncharacterized protein n=1 Tax=Alcanivorax borkumensis SK2 RepID=Q0VQ28_ALCBS Length = 377 Score = 335 bits (859), Expect = 2e-90, Method: Composition-based stats. Identities = 122/375 (32%), Positives = 195/375 (52%), Gaps = 18/375 (4%) Query: 3 YQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQ-DGKW 61 + L L FL HWQKRP+ + + D + LAGLA+E V++R+++ +G W Sbjct: 10 FTLPLTPAAFLREHWQKRPLFMPGAASGL-DQPDANTLAGLALEESVEARVITGAGNGPW 68 Query: 62 QVSHGPFES--YDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPG 119 V P + ++ LGE NW+LLVQ+V+H+ T+ L+ F LP+WR++D+MIS++ G Sbjct: 69 SVLQSPLDDNVFEALGEKNWTLLVQSVDHFLTETSLLLDDFAFLPNWRVEDIMISYAAKG 128 Query: 120 GGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPD-LLQVDPFEAIIDEELEPGDI 178 G VGPH D+YDVF+IQ +G RRW++G+ D L + + +PGD+ Sbjct: 129 GSVGPHFDRYDVFLIQASGSRRWQIGDVCDESSPRQATDELKLLAQMPVREEFIAQPGDV 188 Query: 179 LYIPPGFPHEGYALE-NAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRA 237 LY+PPG H G A + + + +SVGFRAP+ + L++ A L E ++DPD Sbjct: 189 LYLPPGVAHHGVAEDSDCITWSVGFRAPDYQMLMAEIAGECLA-ESDSKLFTDPDRGITT 247 Query: 238 HPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHE-LDIAPPEPPYQPDEIYDA 296 P+ + + +L L+L++ PE ++ ++S R E L+ A E + + Sbjct: 248 DPSILADTDRQQLVRGALDLLH-PEAIERAIYRWLSTPRLEGLEFAVDEHHIRERD---- 302 Query: 297 LKQGEVLVRLGGLRVLRIGDDVYANGEK--IDSPHRPALDALASNIALTAENFGDALEDP 354 LVR G +R+L G + NGE + +P + LAS DA+ P Sbjct: 303 --SDVSLVRHGSVRLLMQGKLAWLNGEAHTLTEQQQPLVQLLASKRRYQKREL-DAVMTP 359 Query: 355 SFLAMLAALVNSGYW 369 + +L + GY+ Sbjct: 360 TARELLHEWIEQGYF 374 >UniRef50_C0VP99 Cupin 4 n=2 Tax=Acinetobacter RepID=C0VP99_9GAMM Length = 387 Score = 332 bits (852), Expect = 1e-89, Method: Composition-based stats. Identities = 110/371 (29%), Positives = 177/371 (47%), Gaps = 15/371 (4%) Query: 7 LNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQD---GKWQV 63 + FL +WQK+P++++ I + P ++ LA+E +V +RL+ ++ +W V Sbjct: 12 ITAEQFLAEYWQKKPLLVRNAMPEIIGLLEPADVQELALEEDVTARLIRQKNKNPNEWHV 71 Query: 64 SHGPFESYDHLGETN-WSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGV 122 P D N W+LLVQAV+H+ A L + F +P WR DD+M+S++ GG V Sbjct: 72 KSSPLTKGDFQKLPNLWTLLVQAVDHYSFDIAELWKKFPFIPQWRRDDIMVSYAPKGGSV 131 Query: 123 GPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHC-PHPDLLQVDPFEAIIDEELEPGDILYI 181 G H D YDVF++QG G RRW++G+K P L + DE L PGD+LY+ Sbjct: 132 GKHFDFYDVFLVQGYGHRRWQLGQKCDETTALIPDQPLKLLTDMHVEFDEVLAPGDLLYV 191 Query: 182 PPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPAD 241 PPG H G A ++ + +S GFR PN E+I +D + E+ D A Sbjct: 192 PPGLAHYGVAEDDCLTFSFGFRMPNLSEMIDQVSDKFAENEILKKPLIDIVRQHTAPIGK 251 Query: 242 VLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQGE 301 + E+ L+ +L+ + Q F+ ++S+S + I PE +++ + + G Sbjct: 252 INSTELAYLKAQLLDYLTQAPEFEAAIMSYMSESNYPNSIPEPEE-ITTEDLLEVIGTGY 310 Query: 302 VLVRLGGLRVLRIG----DDVYANGE--KIDSPHRPALDALASNIALTAENFGDALEDPS 355 L+ R+L D +AN E + L +A +L F + Sbjct: 311 QLILEPASRLLYRELGDSLDFWANSENVCVSKNFENELKKIADGESL---EFNEQFNQHE 367 Query: 356 FLAMLAALVNS 366 L +A L+NS Sbjct: 368 VLEDIAQLLNS 378 >UniRef50_B1Y837 Cupin 4 family protein n=3 Tax=cellular organisms RepID=B1Y837_LEPCP Length = 418 Score = 331 bits (849), Expect = 2e-89, Method: Composition-based stats. Identities = 113/380 (29%), Positives = 181/380 (47%), Gaps = 29/380 (7%) Query: 7 LNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQ----DGKWQ 62 L+ F++RHWQ++P+++++ P+S ++ + + V+SR +S Q WQ Sbjct: 48 LSPSVFMQRHWQRKPLLVRQAVPGIEPPVSRAQMFAMLEDDAVESRFLSRQGEGDRQTWQ 107 Query: 63 VSHGPF--ESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGG 120 GP S + + W++LVQ +N A L+ FR +P R+DDLMIS++ GG Sbjct: 108 FKRGPMPRRSLPAIKQPGWTVLVQGLNLHVPAAADLLNRFRFVPQARLDDLMISWASEGG 167 Query: 121 GVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILY 180 GVGPH D YDVF+IQ GRRRWR+G + P + ++ F + LEPGD+LY Sbjct: 168 GVGPHFDSYDVFLIQVAGRRRWRIGRLPDARLREGLP-VKIIENFRHEEEWVLEPGDMLY 226 Query: 181 IPPGFPHEGYALE-NAMNYSVGFRAPNTRELIS----GFADYVLQRELGGNY---YSDPD 232 +PPG+ H+G A++ M SVGFR+P EL+ AD + G Y DP Sbjct: 227 LPPGWAHDGDAVDGECMTCSVGFRSPQRSELVRETLLRLADGIDDPADAGARPPVYRDPK 286 Query: 233 VPPRAHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDE 292 A P + + + + ++ + +P + GE++S+ + ++ E Sbjct: 287 QSATAAPGRIPAELLAFAEQGLMRALAEPGALARALGEYLSEPKAQVSF----------E 336 Query: 293 IYDALKQGEVLVRLGGLRVLRIGDDVYANGEKIDSPHRPA--LDALASNIALTAENFGDA 350 + + L G + +L VY NG+ + R A L LA L A A Sbjct: 337 LGEPLPDGVGVRLDDRSCLLYDDGHVYCNGDSWRAAGRDAAMLHLLADARQLDATTLRRA 396 Query: 351 LEDPSFLAMLAALVNSGYWF 370 P+ A+L + G+ Sbjct: 397 --SPALRALLEQWADDGWLH 414 >UniRef50_A4SX54 Cupin 4 family protein n=2 Tax=Polynucleobacter necessarius RepID=A4SX54_POLSQ Length = 410 Score = 328 bits (840), Expect = 3e-88, Method: Composition-based stats. Identities = 111/384 (28%), Positives = 193/384 (50%), Gaps = 23/384 (5%) Query: 7 LNWPDFLERHWQKRPVVLKRGFNNF----------IDPISPDELAGLAMESEVDSRLVSH 56 ++ F++++W K+P++++ F PIS ELA L+ + V+SRL+ Sbjct: 31 ISPEQFMKQYWHKKPLLIRGAIPAFSLTNQNGEALESPISFPELAELSTQDTVESRLIR- 89 Query: 57 QDGKWQVSHGPF--ESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMIS 114 W HGPF +S + + NW+LL+Q + H A ++ FR +PD R+DDLMIS Sbjct: 90 -SKPWSFDHGPFAKKSIPAINKPNWTLLLQGMEAHHPAAAKILSWFRFIPDARLDDLMIS 148 Query: 115 FSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELE 174 + GGGVGPH D YDVF++Q +GRR W + E+ + + P L + F + D LE Sbjct: 149 VAGIGGGVGPHFDSYDVFLMQMSGRRHWHISEQKDLSLN-PKLPLKILQHFRSEQDWILE 207 Query: 175 PGDILYIPPGFPHEGYALEN-AMNYSVGFRAPNTRELIS----GFADYVLQRELGGNYYS 229 PGD+LY+PP H+G AL+ +S+GFR+P+ +EL+ A+ + ++ Sbjct: 208 PGDMLYLPPHVAHDGIALDAGCQTWSIGFRSPSFKELLQEGLWRLAESLENLPELEQKFA 267 Query: 230 DPDVPPRAHPADVLPQEMDKLREMMLEL-INQPEHFKQWFGEFISQSRHELDIAPPEPPY 288 DP A + + + +L+ + +L ++Q + F ++S+ + + P P Sbjct: 268 DPKQEATASAEQLPDELIAQLKGQLHKLKLDQIDSFLPGITAYLSEPKQQAIFDGPNSPL 327 Query: 289 QPDEIYDALKQGEVLVRLGGLRVLRIGDDVYANGEKIDSPHRPALDALASNIALTAENFG 348 +P L + E L+ R+L +G V+ NGE + P + +++ Sbjct: 328 KPKAFLARLSR-ENLLPHPQTRILSLGKQVFCNGESMTQDQGPRIGDAWRSLSAQKRLRT 386 Query: 349 DALEDPSFLAMLAALVNSGYWFFE 372 +L++ ++ A + SG+ FE Sbjct: 387 KSLQNIDKSSLYEAYL-SGWLIFE 409 >UniRef50_C7RB22 Cupin 4 family protein n=1 Tax=Kangiella koreensis DSM 16069 RepID=C7RB22_KANKD Length = 390 Score = 327 bits (838), Expect = 5e-88, Method: Composition-based stats. Identities = 121/369 (32%), Positives = 206/369 (55%), Gaps = 18/369 (4%) Query: 7 LNWPDFLERHWQKRPVVLKRGFNNFI-----DPISPDELAGLAMESEVDSRLVSHQDGKW 61 ++ FL+ +WQKRP++++ F++ IS +ELAG ++E +++SRL+ W Sbjct: 8 ISPEQFLKEYWQKRPLLIRGAFSSAQVSGEDALISAEELAGYSLEDDIESRLIERDGDDW 67 Query: 62 QVSHGPF--ESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPG 119 Q+ HGP + LG+ NW+LLVQ+++++H P L++ +P WR+DD+M+S++ G Sbjct: 68 QLEHGPIAESKFAELGDQNWTLLVQSLDYFHPPLCELIKACNFIPRWRLDDVMVSYATNG 127 Query: 120 GGVGPHLDQYDVFIIQGTGRRRWRVGEKLQ-MKQHCPHPDLLQVDPFEAIIDEELEPGDI 178 GGVGPHLD+YDVF+IQG G+RRWRVG K Q CPHP + Q++PF+A +D + PGD+ Sbjct: 128 GGVGPHLDKYDVFLIQGEGQRRWRVGHKNQGTTAICPHPQIAQIEPFDADMDVIVNPGDM 187 Query: 179 LYIPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAH 238 LYIPP PH G ++ N++ YSVGFRAPN ++ + Q EL + + + ++ Sbjct: 188 LYIPPNTPHWGESVGNSICYSVGFRAPNIGGIVQKLM-QLPQTELDQLWSDEARLSLKSS 246 Query: 239 PADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALK 298 ++ ++M + E L+ + E + FG+ +++ ++ + P+ D I AL+ Sbjct: 247 RGELT-RDMSRWAEQQLKQLWTSEDYLMAFGKEVTELKYPDMLEVPDDDELIDWIELALE 305 Query: 299 QGEVLVRLGGLRVLRIGDD------VYANGE--KIDSPHRPALDALASNIALTAENFGDA 350 QG L + + G ++ NGE + P ++ L +A+ Sbjct: 306 QGVKAEPLARMTYFKHGGKESNELWLFINGEWQAMHISLEPLIEKLNLTYECSAKELATL 365 Query: 351 LEDPSFLAM 359 + + L + Sbjct: 366 AHEVAHLFL 374 >UniRef50_B4X170 Cupin superfamily protein n=1 Tax=Alcanivorax sp. DG881 RepID=B4X170_9GAMM Length = 382 Score = 325 bits (833), Expect = 2e-87, Method: Composition-based stats. Identities = 117/374 (31%), Positives = 186/374 (49%), Gaps = 16/374 (4%) Query: 3 YQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQD-GKW 61 + L + FL HWQK+ + + D + LAGLA+E V++R+++ D G W Sbjct: 15 FILPMKPAAFLREHWQKKALFMPGAARGL-DQPDANTLAGLALEESVEARIITGADNGPW 73 Query: 62 QVSHGPFES--YDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPG 119 V P ++ LGE NW+LLVQ+V+H+ T+ L+ F LP+WR++D+MIS++ G Sbjct: 74 SVLQSPLSDDVFETLGEENWTLLVQSVDHFLTETSLLLDDFAFLPNWRVEDIMISYAAKG 133 Query: 120 GGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMK-QHCPHPDLLQVDPFEAIIDEELEPGDI 178 G VGPH D+YDVF+IQ G RRW++G+ P +L + + PGD+ Sbjct: 134 GSVGPHFDRYDVFLIQAAGHRRWQIGDVCDESTPRQPTDELKLLADMPVREEFVAAPGDV 193 Query: 179 LYIPPGFPHEGYALE-NAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRA 237 LY+PPG H G A + + + +SVGFRAP+ + L++ A L E ++DPD + Sbjct: 194 LYLPPGVAHHGVAEDSDCITWSVGFRAPDYQMLMAEIAGECLA-ESDSQLFTDPDRDVTS 252 Query: 238 HPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDAL 297 P + + +L L+L+ QP+ ++ ++S R + D Sbjct: 253 DPTVLADADRQQLIRGALDLL-QPDAIERAVYRWLSTPRLDGL-----EFAIDDHHIRER 306 Query: 298 KQGEVLVRLGGLRVLRIGDDVYANGEK--IDSPHRPALDALASNIALTAENFGDALEDPS 355 LVR G +R+L G + NG+ + RP + LAS DA+ P+ Sbjct: 307 DDKVALVRHGSVRLLMQGKLAWLNGDSHTLTEQQRPLVQLLASKRRYQESEL-DAVMTPA 365 Query: 356 FLAMLAALVNSGYW 369 +L + GY+ Sbjct: 366 ARELLHEWIEQGYF 379 >UniRef50_D1KE35 Putative uncharacterized protein n=1 Tax=uncultured SUP05 cluster bacterium RepID=D1KE35_9GAMM Length = 362 Score = 317 bits (813), Expect = 4e-85, Method: Composition-based stats. Identities = 105/367 (28%), Positives = 175/367 (47%), Gaps = 26/367 (7%) Query: 6 TLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVS--HQDGKWQV 63 ++ +FLE +WQK+P+++K+ NFI PIS DELAGL++E E +SRLV +W + Sbjct: 6 AISVEEFLEDYWQKKPLLIKQALPNFISPISSDELAGLSLEEEFESRLVQGSTAQQQWSL 65 Query: 64 SHGPFES--YDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGG 121 ++GPF + L E +W+LLVQ V+ + + L++ F +P WR DD+MIS++ GG Sbjct: 66 TNGPFTKTTFTQLPEQDWTLLVQGVDRFIDEVHDLIKQFDFIPRWRFDDVMISYATKGGS 125 Query: 122 VGPHLDQYDVFIIQGTGRRRWRVGEK-LQMKQHCPHPDLLQVDPFEAIIDEELEPGDILY 180 VGPH D YDVF++QG+GRRRW + + + + L + F E+EPGD+LY Sbjct: 126 VGPHFDYYDVFLLQGSGRRRWELSTQFCTLDNYLKDVPLRIMHTFTPEQFFEVEPGDVLY 185 Query: 181 IPPGFPHEGYALEN-AMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHP 239 IPP H G +L++ S G+RA + +EL D + YY DP + P Sbjct: 186 IPPKVAHHGVSLDDECTTLSFGYRAYSAQELFESL-DMQNPDQEQNIYYQDPIWINTSSP 244 Query: 240 ADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQ 299 A + +++ +++ + F + + L E ++ Sbjct: 245 ALIPDLAIEQANQILKI---SSDEFAAFVTKLDILDEQLLQHFEAET----------FRE 291 Query: 300 GEVLVRLGGLRVLRIGDD----VYANGEKIDSPHRP--ALDALASNIALTAENFGDALED 353 ++ D V+ NGE D+ A+ + + A++ + Sbjct: 292 KMQYKLHPSCKIAYFLVDTTPKVFINGEYFDTQEFDPQAVMQFCNKRTINAKDHQQLTIN 351 Query: 354 PSFLAML 360 ++ Sbjct: 352 LFKQNLI 358 >UniRef50_B9ZR02 Cupin 4 family protein n=1 Tax=Thioalkalivibrio sp. K90mix RepID=B9ZR02_9GAMM Length = 385 Score = 314 bits (804), Expect = 4e-84, Method: Composition-based stats. Identities = 99/294 (33%), Positives = 161/294 (54%), Gaps = 6/294 (2%) Query: 7 LNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLV--SHQDGKWQVS 64 L+ +FL +WQ++P++++ + F +PI PD+LAGLA + + +RLV G W V Sbjct: 16 LSPAEFLRDYWQQKPLLVRGAVSGFANPIEPDDLAGLACDPDASARLVLGDTDHGDWAVE 75 Query: 65 HGPFES--YDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGV 122 +GPFE + L + W+LL+ V + + F +P WR DDLMIS++ P G V Sbjct: 76 YGPFEEDRFASLPDRAWTLLISDVERFWPEGHDFLARFDFVPRWRRDDLMISYASPDGSV 135 Query: 123 GPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIP 182 GPH+D YDVF+ Q GRRRW++ L + FE +LEPGD+LY+P Sbjct: 136 GPHVDAYDVFLFQAAGRRRWQIQSPPGPLDCHDDLPLAILREFEPTESWDLEPGDLLYLP 195 Query: 183 PGFPHEGYALEN-AMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPAD 241 P PH G +L++ M +S+GFRAP +L++GF + R YSDP P A+ ++ Sbjct: 196 PNLPHYGLSLDDQCMTWSIGFRAPTYLDLLTGFLEERANRVGEAPRYSDPQRPVSAYVSE 255 Query: 242 VLPQEMDKLREMMLELI-NQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIY 294 + + +LR+++ E++ + G F+++ +++ +PP + E Sbjct: 256 LPSHDRTRLRDILREMLAADDTELDAFLGRFLTRPAGNVELHTGDPPAEARECR 309 >UniRef50_C1E292 Predicted protein n=2 Tax=Micromonas RepID=C1E292_9CHLO Length = 466 Score = 302 bits (774), Expect = 1e-80, Method: Composition-based stats. Identities = 104/398 (26%), Positives = 167/398 (41%), Gaps = 35/398 (8%) Query: 9 WPDFLERHWQKRPVVLKRGFN-NFIDPISPDELAGLAMESEVDSRLVSHQDG---KWQVS 64 W F E++WQK PVV++ G P+ DELAGLA E+E R++ D W + Sbjct: 68 WTTFFEKYWQKEPVVIRGGLPTELCTPVDNDELAGLACETEFRPRIIRKGDEGPSSWSLQ 127 Query: 65 HGPF--ESYDHLGETN-WSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGG 121 GPF + L W LL+ + ++ F P WR+ D+ S S GG Sbjct: 128 MGPFSEDELKSLPSDGSWCLLLNDLEKHVSEFMDVLNLFDRFPRWRVADVQASISSEGGS 187 Query: 122 VGPHLDQYDVFIIQGTGRRRWRVGE-----KLQMKQHCPHPDLLQVDPFEAIIDEELEPG 176 VG H DQ+DVF+IQGTG +RW + + + P ++ + F+ L+ G Sbjct: 188 VGAHSDQFDVFLIQGTGHKRWSISDCAEYVPDNDEAFFPDAEVRVLKNFQPQSCSLLKQG 247 Query: 177 DILYIPPGFPHEGYALE---NAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDV 233 DILY+PP H G A +SVGF AP EL+ +A + G + DP + Sbjct: 248 DILYLPPKVAHHGVAEGCKTICTTFSVGFLAPAHDELVLSYAQASVDTHDGSQRWRDPWL 307 Query: 234 PPRAHPADVLPQEMDKLREMMLELINQPEH-FKQWFGEFISQSRHELDIAP-PEPPYQPD 291 P+ H ++ + + + E++ + + + + +WFG +QS P D Sbjct: 308 KPQEHVGEISSEAVAQAAEIIRQSMPKNDAEIARWFGCHATQSFGIDPSETIPAKDLSAD 367 Query: 292 EIYDALKQGEVLVRLGGLRVLRI---------GDDVYANGEKIDSPHRP---ALDALASN 339 E+ + L R + + G +A G P +A+ Sbjct: 368 ELVVQFAEEGSLQRRADAKFAFVQEVKDGSLEGGLFFAAGNMWPLQSAPGMELARHIANY 427 Query: 340 IALTAENF------GDALEDPSFLAMLAALVNSGYWFF 371 + A+++ + D +L L +SG +F Sbjct: 428 DEIIADDWIESGDAAEYDMDSEAKTLLHDLFSSGLIYF 465 >UniRef50_A4S2B8 Predicted protein n=2 Tax=Ostreococcus RepID=A4S2B8_OSTLU Length = 392 Score = 301 bits (772), Expect = 2e-80, Method: Composition-based stats. Identities = 106/390 (27%), Positives = 171/390 (43%), Gaps = 35/390 (8%) Query: 13 LERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQ---DGKWQVSHGPFE 69 + +WQK+P+++++ NF P+ +E+AGLA E + +R+ + + W+ GPFE Sbjct: 1 MREYWQKKPLLMRQAIPNFRPPLDGNEIAGLACEEDASARIFVREGDDEQSWRKKIGPFE 60 Query: 70 SYD--HLGETN-WSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPHL 126 D L E WSL+V ++ +P ++ F P WRI D+ S S GGGVGPH Sbjct: 61 ESDLTSLPEDKPWSLIVNDLDVQAQPFGDMLELFNCFPRWRISDIQASVSPDGGGVGPHS 120 Query: 127 DQYDVFIIQGTGRRRWRVGE-----KLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYI 181 D +DVF++Q G + W V + P ++ + F L PGD+LY+ Sbjct: 121 DHFDVFLLQAEGEKVWAVADNEEYWPDNDAAFVPECEIRVLKSFVEDDSFTLVPGDMLYL 180 Query: 182 PPGFPHEGYALEN----AMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRA 237 PP H G A + ++ S+GF AP T EL+ + ++ L G+ +SDP + P Sbjct: 181 PPKIAHNGVATNSKPGVSVTLSIGFLAPTTDELVLSYTQRASEK-LKGSRWSDPWLKPVE 239 Query: 238 HPADVLPQEMDKLREMMLE-LINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDA 296 + + + E++ +WFG + E D E +E+ A Sbjct: 240 DVGAISAESITYASEIIKRTYPKNDAEVARWFGCHTTARTGEDD-DADENEVSIEELLAA 298 Query: 297 LKQGEVLVRLGGLRVLRIGD---------DVYANGEKIDSPHRPALDALASNIALTAENF 347 + ++ R LR + +ANGE D PA A+ IA E + Sbjct: 299 WEHQGLVARED-LRFAFVEKVADDSLKNALFFANGECWDVVS-PAAVKTATVIANRGELY 356 Query: 348 GDALE------DPSFLAMLAALVNSGYWFF 371 + + D L + L GY +F Sbjct: 357 EEDTQTEECDFDDEALKLALTLFERGYLYF 386 >UniRef50_P44683 Uncharacterized protein HI0396 n=36 Tax=Gammaproteobacteria RepID=Y396_HAEIN Length = 404 Score = 297 bits (760), Expect = 6e-79, Method: Composition-based stats. Identities = 112/400 (28%), Positives = 189/400 (47%), Gaps = 30/400 (7%) Query: 1 MEYQLT--LNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLV-SHQ 57 +++ L + FL +WQK+P+V++ G + P ++ LA +V +RLV + Sbjct: 7 VDFCLPEHITPEIFLRDYWQKKPLVIRNGLPEIVGQFEPQDIIELAQNEDVTARLVKTFS 66 Query: 58 DGKWQVSHGPFES--YDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISF 115 D W+V P + L E WS+LVQ + W L F +P W+ DD+M+S+ Sbjct: 67 DDDWKVFFSPLSEKDFQKLPEK-WSVLVQNLEQWSPELGQLWNKFGFIPQWQRDDIMVSY 125 Query: 116 SVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMK-QHCPHPDLLQVDP-FEAIIDEEL 173 + GG VG H D+YDVF++QG G RRW+VG+ + P+ + D E +IDE + Sbjct: 126 APKGGSVGKHYDEYDVFLVQGYGHRRWQVGKWCDASTEFKPNQSIRIFDDMGELVIDEVM 185 Query: 174 EPGDILYIPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDV 233 PGDILYIP H G A ++ + +S G R PN LI G + ++ N S+ D+ Sbjct: 186 NPGDILYIPARMAHYGVAEDDCLTFSFGLRYPNLSNLIDGISKGFCHQDPDLNL-SEFDL 244 Query: 234 PPRAHPAD-----VLPQEMDKLREMMLELINQPEHFKQWFGEFISQ--SRHELDIAPPEP 286 P R ++ + + + +++++L+ + E F F + ++ S ++ + Sbjct: 245 PLRLSQSEQRTGKLADENIQAMKQLLLDKLAHSEAFDTLFKQAVASAVSSRRYELLVSDE 304 Query: 287 PYQPDEIYDAL-KQGEVLVRLGGLRVLRIGD--DVYANGEKIDS---PHRPALDALASNI 340 PDE+ L + G L + ++L + +YANGE +D L L+ Sbjct: 305 MCDPDEVRSILEEDGAFLSQDNNCKLLYTENPLRIYANGEWLDELNIIESEVLKRLSDGE 364 Query: 341 ALTAE---NFGDALEDPS-----FLAMLAALVNSGYWFFE 372 +L + + EDP L + V+ G+ E Sbjct: 365 SLDWAFLSDLANKTEDPETSMDLLLDSICNWVDDGWALIE 404 >UniRef50_UPI0000E87D6F hypothetical protein MB2181_02235 n=1 Tax=Methylophilales bacterium HTCC2181 RepID=UPI0000E87D6F Length = 377 Score = 260 bits (664), Expect = 7e-68, Method: Composition-based stats. Identities = 92/369 (24%), Positives = 168/369 (45%), Gaps = 15/369 (4%) Query: 7 LNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQVSHG 66 ++ FLE++W K+ + L+ + +S D + GLA ++S++++ +G Q ++G Sbjct: 13 ISPSAFLEKYWGKQALFLQDAIDISGAGLSKDVVFGLAKNENIESKIIAFIEGSQQTTYG 72 Query: 67 PFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPHL 126 PF H + SLL+ N HE + L + +P DD+M+SFS GGGVGPH Sbjct: 73 PFNKVKHGKSS--SLLIHQFNLIHEFSYNLFQSINFVPYCLHDDVMMSFSSEGGGVGPHS 130 Query: 127 DQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIPPGFP 186 D YDVF++QG G + W +G + D + F +PGDILY+PP P Sbjct: 131 DSYDVFLVQGQGEKVWNIGATDKKAFKTTSTDHSNLK-FTPTEQFLAKPGDILYVPPFTP 189 Query: 187 HEGYAL-ENAMNYSVGFRAPNTRELISGFADYVLQR-ELGGNYYSDPDVPPRAHPADVLP 244 H G +L ++ + YS+GFR+P+ E+ + + +Y++ R E + ++ D+ + ++P Sbjct: 190 HHGISLSDDCITYSIGFRSPSNNEIRNQYLEYLMDRKEKSNDLFNGLDLS--ENTKALIP 247 Query: 245 QEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQGEVLV 304 + + P + G F+S+ + + ++L Sbjct: 248 NALASFIKKNTAFPKDPTIIDDFIGCFLSEPHEGAFFTKKN---ITKNAFKKIDTEKILR 304 Query: 305 RLGGLRVLRIGDDVYANGEKIDSP--HRPALDALASNIALTAENFGDALEDPSFLAMLAA 362 R + ++ Y N E I R + L + + + + S + ++ Sbjct: 305 LNIQTRAVIHNENFYINAENIFVANKDRMFFEELFNQKQI---LITPSKANDSLVEVMIY 361 Query: 363 LVNSGYWFF 371 L++ GY F Sbjct: 362 LLSEGYITF 370 >UniRef50_A2W941 Transcription factor jumonji n=1 Tax=Burkholderia dolosa AUO158 RepID=A2W941_9BURK Length = 360 Score = 260 bits (664), Expect = 7e-68, Method: Composition-based stats. Identities = 77/193 (39%), Positives = 113/193 (58%), Gaps = 5/193 (2%) Query: 7 LNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQVSHG 66 L F+ R+WQK+P+++++ P++ D L LA + + +SRL++H KWQ++HG Sbjct: 49 LTPAQFMRRYWQKKPLLIRQAIPGVASPVTRDALFELAADYDAESRLITHFRNKWQLTHG 108 Query: 67 PFE--SYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGP 124 PFE S + W+LLVQ ++ + AL+ FR +PD R+DDLMIS++ GGGVGP Sbjct: 109 PFEPGSLPAVTRRAWTLLVQGLDLHVDAARALLDRFRFIPDARLDDLMISYATDGGGVGP 168 Query: 125 HLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIPPG 184 H D YDVF++Q GRRRWR+G + C L + FE + G ILY+PP Sbjct: 169 HFDSYDVFLLQVEGRRRWRIGAQTDC--RCSRRALKILRHFEPATNGCWNRGAILYLPPH 226 Query: 185 FPHEGYALENAMN 197 H+G A M+ Sbjct: 227 SAHDGVADGE-MH 238 >UniRef50_B6BWI1 Putative cytoplasmic protein n=1 Tax=beta proteobacterium KB13 RepID=B6BWI1_9PROT Length = 346 Score = 222 bits (567), Expect = 1e-56, Method: Composition-based stats. Identities = 93/374 (24%), Positives = 155/374 (41%), Gaps = 41/374 (10%) Query: 4 QLTLNWPDFLERHWQKRPVVLKRGFNNFIDPI-SPDELAGLAMESEVDSRLVSHQDGKWQ 62 +L LN F++ +W K+ L G NF D D+L L ++ R + QDG+ Sbjct: 2 ELVLNKKCFVKSYWGKKHFFLPGGIKNFNDNFVDLDDLN-LPSSKALE-RKIFIQDGRKY 59 Query: 63 VSHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGV 122 ++ + ++ S L NH H+ + + F +P + IDD+MIS S G V Sbjct: 60 INFTNVKKKLNVNTPK-SKLFYKTNHIHQLSFEVKNLFDFIPQYLIDDVMISLSNTKGSV 118 Query: 123 GPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIP 182 G H D Y VF+IQG G + W++ E + + ++ GDILY+P Sbjct: 119 GKHKDNYSVFLIQGKGIKNWKIYEN------------------KKVFSYTVKEGDILYVP 160 Query: 183 PGFPHEGYALEN-AMNYSVGFRAPNTRELISGFADYV--LQRELGGNYYSDPDVPPRAHP 239 PG H G + YSVGFR+P++ L F DY+ L + ++ + + Sbjct: 161 PGIDHYGISQSEICNTYSVGFRSPDSLNLKEIFNDYIFNLLDQTSTIFFQNKLFSKQKAS 220 Query: 240 ADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQ 299 +P ++ + L +P ++ G +++ EL + + D LK+ Sbjct: 221 ---IPDDIKDF--FIRHLDCKPIILDEFIGIYLTSVDLEL---FKKKEITLKKFKDQLKR 272 Query: 300 GEVLVRLGGLRVLRIGDDVYANGEKID--SPHRPALDALASNIALTAENFGDALEDPSFL 357 + + R L G + Y NG KID + R + + A+N + Sbjct: 273 MPLFL-NQMTRALYFGKNFYINGFKIDIETNSRKEFRKFFNESTIIAKNL-----NNKST 326 Query: 358 AMLAALVNSGYWFF 371 +L L Y F Sbjct: 327 LLLYKLFKKEYIVF 340 >UniRef50_B7FZB3 Predicted protein n=1 Tax=Phaeodactylum tricornutum CCAP 1055/1 RepID=B7FZB3_PHATR Length = 492 Score = 215 bits (549), Expect = 2e-54, Method: Composition-based stats. Identities = 89/332 (26%), Positives = 155/332 (46%), Gaps = 36/332 (10%) Query: 10 PDFLERHWQKRPVVLKRGF-NNFIDPISPDELAGLAME------SEVDSRLVSHQDGK-- 60 PD L +W + P++++ F + + P + L + S +R+++H G+ Sbjct: 67 PDLLTNYWGRSPLLIRSAFHAEALTEVWPSQADLLELALDDDEISSDSARIITHTSGRLD 126 Query: 61 -WQVSHGPFESYD----HLGETNWSLLVQAVNHWHEPTAALMR-PFRELPDWRIDDLMIS 114 + GPF + G+ W+L+V V+ + A M F LP WR DD IS Sbjct: 127 SFASQLGPFSTSTIQGLEHGDKMWTLIVNDVDRYVSTLADWMDDEFGFLPRWRRDDAQIS 186 Query: 115 FSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQH----CPHPDLLQV-------D 163 + GGG+GPH+D YDVF+ Q +G+R W VG + +++ P + + + Sbjct: 187 MARTGGGIGPHVDSYDVFLTQTSGQRTWLVGNTMTVQEEMNTLIPDLSVRILRDVSNHNE 246 Query: 164 PFEAIIDEELEPGDILYIPPGFPHEGYAL-ENAMNYSVGFRAPNTRELISGFADYVLQ-- 220 A EL+PGD+LY+PP + H G AL ++ + SVG R+P++ EL++ A+ +L Sbjct: 247 SSHAYTRLELQPGDVLYLPPRYVHWGTALTDDCVTLSVGARSPSSAELVARIAETMLGSV 306 Query: 221 RELGGNYYSDPDVPPRAHPA---DVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRH 277 Y+DPD+ + A + D ++ M+L+ +++ + E +++ Sbjct: 307 SVHAVQRYTDPDLLQEVNGAPLHSMTNHAKDSMKTMVLDAVHEITDDPMRWDELVAK--- 363 Query: 278 ELDIAPPEPPYQPDEIYDALKQGEVLVRLGGL 309 L P Y+ +K E L GG Sbjct: 364 -LATEPKRMSENALVPYNEIKDSEYLAIWGGT 394 >UniRef50_B5W5P2 Cupin 4 family protein n=2 Tax=Arthrospira RepID=B5W5P2_SPIMA Length = 387 Score = 199 bits (506), Expect = 2e-49, Method: Composition-based stats. Identities = 73/382 (19%), Positives = 143/382 (37%), Gaps = 34/382 (8%) Query: 7 LNWPDFLERHWQKRPVVLKRGFNN-FIDPISPDELAGLAMESEVDSRLVSHQD-GKWQVS 64 L +F +++W ++ V++ + F D S +L L + + G+ Sbjct: 11 LPISEFFDKYWTEKSVLIPGANHQKFADLFSWQKLNNLLNYYPLKHPEIRLAKTGETLPE 70 Query: 65 HGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPG-GGVG 123 E + +L++ ++ E A ++ R R + S PG G Sbjct: 71 ITNNEQIIKQCQEGATLIIDRLHEKIEAIAKMVALLRIEIGHR-SQVNSYCSFPGHQGFA 129 Query: 124 PHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDP-FEAIIDEELEPGDILYIP 182 H D ++VFI+Q +GR+ WRV + + P + ID + PGD+LYIP Sbjct: 130 CHYDSHEVFILQISGRKHWRVFSDTFIYPLSENRSSQFSPPDTQPYIDAIINPGDLLYIP 189 Query: 183 PGFPHEGYALEN-AMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPAD 241 G H A++ +++ ++G + + Q + + + ++H + Sbjct: 190 RGHWHYAIAIDEPSLHLTLGIDCQTGIDFSDWLTSQLQQH---PQWRKNLPLLNKSHREN 246 Query: 242 VLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEP--------------- 286 Q + L + LE++ + ++ E + Q + +L + P Sbjct: 247 C-RQHLQNLVQNWLEILESEDLINRYLDEQLLQGQPDLQLGFPSQIGYDIFPQGQETKFY 305 Query: 287 -PYQPDEIYDALKQGEVLVRLGGLRVLRIGDDVYANGEKIDSPHRPALDALASNIALTAE 345 P QP I G+ ++ GG ++ G D + + S LD + Sbjct: 306 RPQQPVYITQLTPTGKFEIKTGGKKISLTGLDQHILEKIFTSTEFSGLDI--------QQ 357 Query: 346 NFGDALEDPSFLAMLAALVNSG 367 D D + +L+ LV +G Sbjct: 358 WLQDFDWDTEIVPLLSRLVKAG 379 >UniRef50_C8XBP6 Cupin 4 family protein n=1 Tax=Nakamurella multipartita DSM 44233 RepID=C8XBP6_NAKMY Length = 452 Score = 183 bits (464), Expect = 1e-44, Method: Composition-based stats. Identities = 80/407 (19%), Positives = 137/407 (33%), Gaps = 52/407 (12%) Query: 1 MEYQLTLNWPDFLERHWQKRPVVLKRG--FNNFIDPISPDELAGLAMESEVDSRLVSHQD 58 ++ + ++ DF +R+W + P++ ++F D S D + L E + + + Sbjct: 23 VQRCIAIDADDFAQRYWAQAPLLTTAAELNDDFSDLFSADSVDELVSERGLRTPFLRMAK 82 Query: 59 GKWQVSHGPFESYDHLGET----------------NWSLLVQAVNHWHEPTAALMRPFRE 102 +S F G T +L++QA++ P Sbjct: 83 NGSVLSSASFTRGGGAGATITDQVADDKVLAQLAGGATLVLQALHRTWPPLVRFGSELAA 142 Query: 103 LPDWRIDDLMISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPD---- 158 + G H D +DVF++Q G + WR+ E + + PH Sbjct: 143 ELGHPVQINAYITPPQNQGFASHYDTHDVFVLQIAGTKHWRIHEPV-LPDPLPHQTWDGR 201 Query: 159 ---LLQVDPFEAIIDEELEPGDILYIPPGFPHEGYALEN-AMNYSVGFRAPNTRELISGF 214 + ID L PGD LY+P G+ H A +++ ++G Sbjct: 202 RAQVQDRAAQAPAIDALLRPGDALYLPRGYLHSAVAQGELSIHLTIGVHP---------L 252 Query: 215 ADYVLQRELGGNYYSDPDVPPRAHPADVLPQEMDK----LREMMLELINQPEHFKQWFGE 270 Y L REL DP++ R+ P V ++D LR+ L+++ Sbjct: 253 TGYDLARELIAAAEDDPEL-RRSLPMGVDVTDVDAMATHLRQAAQRLVDRLGQAGPELYR 311 Query: 271 FISQSRHELDI----APPEPPYQPDEIYDALKQGEVLVRLGGLRV-LRIGDDVYANG--- 322 ++ + P P L LV GLR LR + + Sbjct: 312 AAARRVGPQQVGQTRPAPIAPLAQLRAAATLDPQTPLVLRPGLRPRLRQQGEKWVLSLID 371 Query: 323 --EKIDSPHRPALDALASNIALTAENFGDALEDPSFLAMLAALVNSG 367 AL + S A TA+ L+D L + L+ G Sbjct: 372 STVSWPEQVHAALLIVLSGKAFTADEL-PNLDDAEQLVVARRLLREG 417 >UniRef50_Q091R3 Mina protein n=1 Tax=Stigmatella aurantiaca DW4/3-1 RepID=Q091R3_STIAU Length = 383 Score = 183 bits (464), Expect = 1e-44, Method: Composition-based stats. Identities = 84/376 (22%), Positives = 143/376 (38%), Gaps = 31/376 (8%) Query: 12 FLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSR----LV---SHQDGKWQVS 64 F E W+++P+VL+ + + S +L L S LV H+D W Sbjct: 15 FFEEAWERKPLVLQGPPDRWSGLFSSRDLGRLLTYQPPRSIEGMMLVKEGRHRDENWLSP 74 Query: 65 HGP--FESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGV 122 G E +++++ V + EP E + + G Sbjct: 75 DGSPRLEQVQAAWREGYTIVINKVGQFWEPVGRFCAAVEEELHHPVGVNLYMTPPGAQGF 134 Query: 123 GPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAI-IDEELEPGDILYI 181 H D D F++Q G + W+V + + +++EL+ GD+LYI Sbjct: 135 KAHFDIMDAFVLQVEGSKVWQVRGPQVTLPLPDEHTATSSESLPPVLLEQELKRGDVLYI 194 Query: 182 PPGFPHEG-YALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPA 240 P GF HE A ++++ ++G +A +L + + +PPR Sbjct: 195 PRGFVHEARTAQTHSVHLTLGLQAVTWSDLFVA----AIAAARRDERFR-KGLPPRFLEG 249 Query: 241 DVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQP-DEIYDALKQ 299 + + RE++ EL P H + G ++Q L + P PP + E LK Sbjct: 250 SAMME--QTFRELLAEL---PRHLE--LGHALTQLAERLVVQKPPPPTEDLLEGAVELKG 302 Query: 300 GEVLVRLGG--LRVLRIGDD--VYANGEKIDSPHR--PALDALASNIALTAENFGDALED 353 VL R G LRV+ + +G K+ P + PAL +A + ++ L + Sbjct: 303 STVLTRRPGMVLRVMEGPGYAGLQYSGGKLMGPAKIGPALRHIAKGSVIPVQSL-PGLSE 361 Query: 354 PSFLAMLAALVNSGYW 369 L + LV SG Sbjct: 362 KEQLVLAGRLVRSGVL 377 >UniRef50_A4U3D3 MYC induced nuclear antigen n=1 Tax=Magnetospirillum gryphiswaldense RepID=A4U3D3_9PROT Length = 390 Score = 181 bits (460), Expect = 3e-44, Method: Composition-based stats. Identities = 69/379 (18%), Positives = 136/379 (35%), Gaps = 24/379 (6%) Query: 7 LNWPDFLERHWQKRPVVLKRGFNNFI-DPIS---PDELAGLAMESEVDSRLVSHQD---- 58 + +FL +W+K+P+++KR F D +S D++ + D R+ D Sbjct: 13 ITPHEFLAEYWEKKPLLVKRAAPGFYRDLLSVQAIDQVLAMPGLHRRDIRVARGTDPLAV 72 Query: 59 GKWQVSHGPFE--SYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFS 116 ++ G S L +++++ +N P A + R F ++ + Sbjct: 73 EEYADKDGFINAASLSRLFTDGFTIILNTLNLKLRPLAEICRAFEQVLSIPCQTNIYYTP 132 Query: 117 VPGGGVGPHLDQYDVFIIQGTGRRRWRVGE-KLQMKQHCPHPDLLQVDPFEAIIDEELEP 175 G PH D +DVF+ Q GR+ W V + +++ + +P + ++ +LEP Sbjct: 133 RLAQGFKPHYDSHDVFVFQVAGRKHWLVNDTPVELPLRGQGFEAGLYEPGDVTMEFDLEP 192 Query: 176 GDILYIPPGFPHEGYALEN-AMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVP 234 GD+LYIP G H + +++ ++G + E++ ++ Sbjct: 193 GDLLYIPRGVMHGARTSDEVSLHITLGALTTSWAEVLLEAVAAAALTDVELRRNLPAGYA 252 Query: 235 PRAHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIY 294 + A Q L + E I+ + G ++ R L E + Sbjct: 253 LPGYDAQAARQTFASLLNRVAENIDVESILDVYRGALQARRRPYL-----ENSVTRLDGL 307 Query: 295 DALKQGEVLVRLGGLRVLRIGDD----VYANGEKIDSP--HRPALDALASNIALTAENFG 348 + + + L + + G I P PAL Sbjct: 308 SQISAADSAEPVPNLLYSLTESEGRASLACFGRTITVPDFAAPALAHAVKGTRFRVGEL- 366 Query: 349 DALEDPSFLAMLAALVNSG 367 L++ +A++ LV G Sbjct: 367 PGLDEDGSVALVRRLVLEG 385 >UniRef50_A3Q8B6 Cupin 4 family protein n=4 Tax=Mycobacterium RepID=A3Q8B6_MYCSJ Length = 404 Score = 179 bits (455), Expect = 1e-43, Method: Composition-based stats. Identities = 71/402 (17%), Positives = 142/402 (35%), Gaps = 47/402 (11%) Query: 1 MEYQLTLNWPDFLERHWQKRPVVLKRGF--NNFIDPISPDELAGLAMESEVDSRLVSHQD 58 + + + F +W +RP++ + G +F D +SP + L E V + + Sbjct: 2 LSRCIATDPHTFATEYWGRRPLLSRSGALPRDFADLLSPGMVDELIAERGVRAPFIRLAK 61 Query: 59 GKWQVSH----GPFESYDHLGET------------NWSLLVQAVNHWHEPTAALMRPFRE 102 ++ GP + + ++++Q ++ P L+R + Sbjct: 62 EGDVLAKDCYLGPAGFGAEMPDQVDSAKVLTQFSAGATIVMQGLHRLWPPVIDLVRHLVD 121 Query: 103 LPDWRIDDLMISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHP----- 157 + G PH D +DVF++Q G +RW V E + P Sbjct: 122 DLGHPVQANAYITPPSNRGFDPHYDVHDVFVLQTAGEKRWVVHEPVHPHPLPSQPWTQHR 181 Query: 158 -DLLQVDPFEAIIDEELEPGDILYIPPGFPHEGYALE-NAMNYSVGFRAPNTRELISGFA 215 + + E +ID L PGD LY+P G+ H +AL+ +++ ++G A ++ Sbjct: 182 DAIAERAAGEPVIDTVLAPGDALYLPRGWVHSAHALDTTSIHLTIGVSAVTGVDVARAVV 241 Query: 216 DYVLQRELGGNYYSDPDVPPRAHPA---DVLPQEMDKLREMMLELINQPEHFKQWFGEFI 272 D + +P PA +++ + +M+ + + + + Sbjct: 242 DALADSAAF-----RAPLPMGGDPADRDEIIAAVTKVMAQMVETMRDDATALSGAAADRL 296 Query: 273 SQSRHELDIAPPEPPYQPDEIYDALKQGEVLVRLG--------GLRV-LRIGDDVYANGE 323 +++ P + V R G G R+ LR+ D V Sbjct: 297 TRTHASRTRPVAVRPLATLDAAAHADTTTVQWRHGLVATADTAGDRIELRLTDRVL---- 352 Query: 324 KIDSPHRPALDALASNIALTAENFGDALEDPSFLAMLAALVN 365 + PA+ AL +A A + L+ ++ L+ Sbjct: 353 SFPASCAPAVLALQRGLAADAGSL-PGLDRADGTVLIRRLLR 393 >UniRef50_UPI000192663F PREDICTED: similar to Myc-induced nuclear antigen n=1 Tax=Hydra magnipapillata RepID=UPI000192663F Length = 437 Score = 178 bits (452), Expect = 3e-43, Method: Composition-based stats. Identities = 46/241 (19%), Positives = 95/241 (39%), Gaps = 13/241 (5%) Query: 7 LNWPDFLERHWQKRPVVLKRGFNNFI-DPIS----PDELAGLAMESEVDSRLVSHQDGKW 61 ++ F E W+K+P+ +KR + + D S + LA +E E D + + D + Sbjct: 50 ISVKTFFEEFWEKKPLYIKRENSGYYGDLFSLSSMKEILAAHELEFETDVNVCRYVDNEK 109 Query: 62 QVSHG----PFESYDHL-GETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFS 116 ++ + + +D L + + + + + LM + + Sbjct: 110 ELLNEDGCLTVDKFDKLMNDKHATFQLHQPQRYGTVLWQLMEKMETYFGCLVGSNVYITP 169 Query: 117 VPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPG 176 G+ PH D +VFI+Q G + W++ + + DL Q + I++ LEPG Sbjct: 170 KESQGLAPHCDDVEVFILQLEGTKHWKLYKPMVELSRDYTQDLSQDSIGKPIMELTLEPG 229 Query: 177 DILYIPPGFPHEGYALENAMNYSVGFRAP---NTRELISGFADYVLQRELGGNYYSDPDV 233 D+LY P G H+ ++ + + + + +S ++ L + + Sbjct: 230 DLLYFPRGTIHQARSVGESYSTHITLSTYQNNTLGDFMSIAVSQAIESALENDVSFRRGL 289 Query: 234 P 234 P Sbjct: 290 P 290 >UniRef50_Q7N884 Similar to unknown protein n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7N884_PHOLL Length = 388 Score = 177 bits (450), Expect = 5e-43, Method: Composition-based stats. Identities = 73/391 (18%), Positives = 140/391 (35%), Gaps = 41/391 (10%) Query: 3 YQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRL-VSHQDGKW 61 ++ DFLE ++K+P V K+ +++ D I E+ + S + S + Sbjct: 4 INFPIDKKDFLENFFEKKPCVFKKIYDD--DFIKHSEIENIFNRSNLPSFEGIKLMYNGI 61 Query: 62 QVSHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPF---------------RELPDW 106 ESY+ LG + + + + A L+ Sbjct: 62 IDKTEYIESYNDLGTRRYRYIYSKLYDYLNSGATLVANGIINETKIDQLAKACSSFTDSH 121 Query: 107 RIDDLMISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVD--- 163 L +S+ PH D D+F IQ +G++RW + + ++ Sbjct: 122 PFSSLYLSYG-EKSSFKPHWDSRDIFAIQLSGKKRWIIYKP-SFPDPVYLHQSKDMENTY 179 Query: 164 --PFEAIIDEELEPGDILYIPPGFPHEGYAL-ENAMNYSVGFRAPNTRELISGFADYVLQ 220 P E D LE GD+LY+P G+ H L E ++ SVG P E I+ + + Sbjct: 180 PCPSEPYDDFVLETGDVLYLPRGWWHNPLPLGEETIHLSVGIFPPYAHEYINWLSYKITD 239 Query: 221 RELGGNYYSDPDVPPRAHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELD 280 ++G +P A E+D L + +++ I + + ++ F + R Sbjct: 240 IDIG-----RKSLPRSWKQA---KDEIDILAKYVIDNITSEDSYNEFLKSFSDEKRVP-- 289 Query: 281 IAPPEPPYQPDEIYDALKQGEVLVRLGGLRVLRIGDDVYANGEKIDSPHRPALDALASNI 340 + + + ++ + L + + NG K++ + L+ Sbjct: 290 -SKLNLRLFCGKKHHSISKTSRLRINSNNNLSIDEGYIICNGAKVNLDQFS-IHLLSKIS 347 Query: 341 ALTAENFGDALE--DPSFLAMLAALV-NSGY 368 + +F + L D S + L+ GY Sbjct: 348 EIPYISFENLLSFFDHSKQKNIEDLIYKLGY 378 >UniRef50_D0L9V4 Cupin 4 family protein n=1 Tax=Gordonia bronchialis DSM 43247 RepID=D0L9V4_GORB4 Length = 414 Score = 177 bits (448), Expect = 8e-43, Method: Composition-based stats. Identities = 71/404 (17%), Positives = 143/404 (35%), Gaps = 45/404 (11%) Query: 1 MEYQLTLNWPDFLERHWQKRPVVLKRGF--NNFIDPISPDELAGLAMESEVDSRLVSHQD 58 + + F +++W +RP++ + +F D +S + L E V + + Sbjct: 2 LSRCTATDLTTFADQYWGRRPMLSRAASLPADFGDLLSVRAVDELIAERGVRAPFIRMAK 61 Query: 59 GKWQVSH----GPFESYDHLGET------------NWSLLVQAVNHWHEPTAALMRPFRE 102 ++ GP + + ++++Q ++ P +R + Sbjct: 62 EGVVLARDCYLGPAGFGAEMPDQVDPAGVLREFAAGATIVLQGLHRLWPPVIDFVRAMVD 121 Query: 103 LPDWRIDDLMISFSVPGG-GVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHP---- 157 + + PG G PH D +DVF++Q G +RWRV + P Sbjct: 122 DLGHPVQAN-AYVTPPGNRGFDPHYDVHDVFVLQVAGTKRWRVHRPVHTHPLATQPWTDH 180 Query: 158 ---DLLQVDPFEAIIDEELEPGDILYIPPGFPHEGYALEN-AMNYSVGFRAPNTRELISG 213 + I+ L PGD LY+P G+ H AL + +++ ++G A R++++ Sbjct: 181 RAQIERRASDDAPEIEAVLSPGDALYLPRGWIHSADALGDTSIHLTIGVGAVTVRDVVAA 240 Query: 214 FADYVLQ-RELGGNYYSDPDVPPRAHPADVLPQEMDKLREMMLELINQ--PEHFKQWFGE 270 + E + D+ R + + M + E + + + Sbjct: 241 IVAELDDCAEFRQSLPLGIDLTGRDQTVPIATKAMAAVVERLRDHAADVGEGAAARLARR 300 Query: 271 FISQSRHELDIAPPEPPYQPDEIYDALKQGEVLVRLGGL----RVLRIGDDVYANGE-KI 325 +++R P P + I + +R G + RV + + GE I Sbjct: 301 HTARTRP----VPVRPLATLEAIGVVNAATRLRLRHGLVPTLRRVADRAELLT--GERSI 354 Query: 326 DSPHR--PALDALASNIALTAENFGDALEDPSFLAMLAALVNSG 367 +P AL + S ++A L+ +L L+ G Sbjct: 355 STPGYCLEALRTIRSGEIVSAAEL-PGLDAADGTVLLRRLLAEG 397 >UniRef50_Q28VG0 Cupin 4 n=1 Tax=Jannaschia sp. CCS1 RepID=Q28VG0_JANSC Length = 392 Score = 175 bits (443), Expect = 3e-42, Method: Composition-based stats. Identities = 67/363 (18%), Positives = 127/363 (34%), Gaps = 39/363 (10%) Query: 1 MEYQLTLNW-------PDFLERHWQKRPVVLKRGFNN-FIDPISPDELAGL--AMESEVD 50 M + +W F +++K+P+++KRG F D +S E+ + M V Sbjct: 1 MTNTFSFDWAIAPETPDTFFAEYFEKKPMLIKRGQPGYFSDLLSYGEIDRVVSTMGLHVP 60 Query: 51 SRLVSHQDGKWQVSHGPFES-------YDHLGETNWSLLVQAVNHWHEPTAALMRPFREL 103 V+ DG + +E+ + L ++++ ++ A R Sbjct: 61 EINVTRADGNITPADFAYETGQIDPVRVNQLHADGATVILSGLHERLPALARYCRAMEAA 120 Query: 104 PDWRIDDLMISFSVPG-GGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQV 162 R+ I + PG G PH D +DV ++Q G + WR+ + Sbjct: 121 MSARVQTN-IYMTPPGNQGFNPHYDGHDVLVLQVAGTKEWRIYGTPVELPLADQAFERGM 179 Query: 163 DPFEAIIDEELEPGDILYIPPGFPHEGYALEN-AMNYSVGFRAPNTRELISGFADYVLQR 221 D E LEPGD +YIP G H+ A + +++ + G + +AD + + Sbjct: 180 DVGEEAQRFVLEPGDAVYIPRGMAHDAVATDETSLHITTGL-------MFRTWADALAEA 232 Query: 222 ELGGNYYSDPDVPPRAHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDI 281 + + +P + P ++L + + F + +L Sbjct: 233 VIAKAH-REPSLRRALPPG---------FANNGVDLDDYKDTFAELIELVGDAHVGKLLS 282 Query: 282 APPEPPYQPDEIYDALKQGEVLVRLGGLRV-LRIGDDVYANGEKIDSPHRPALDALASNI 340 E Q L +L GL + R+G + D P+ + +A Sbjct: 283 GFREEFLTARVPRVE-GQMAQLAKLDGLTMDSRMGAHPHIVFGIHDVPNEDQVCLVAQGA 341 Query: 341 ALT 343 + Sbjct: 342 EII 344 >UniRef50_B4B491 Cupin 4 family protein n=1 Tax=Cyanothece sp. PCC 7822 RepID=B4B491_9CHRO Length = 390 Score = 174 bits (441), Expect = 5e-42, Method: Composition-based stats. Identities = 50/278 (17%), Positives = 108/278 (38%), Gaps = 12/278 (4%) Query: 7 LNWPDFLERHWQKRPVVLKRGFNNFIDPI----SPDELAGLAMESEVDSRLVSHQDGKWQ 62 + LE++W+K P+++ R ++ + + D + L D L+ Sbjct: 15 IEPTTLLEKYWEKSPLLVARNHPDYYSELISLKNIDSILRLYGPKSSDVDLIKENSFFSA 74 Query: 63 VSHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGV 122 F +SL+++ ++ +P + L + + + + S G Sbjct: 75 GGEVDFNQIYQAYSLGYSLVMRKIHERWQPLSVLHKNLEAFLNHPVGINLYMTSKNSQGF 134 Query: 123 GPHLDQYDVFIIQGTGRRRWRVGEK---LQMKQHCPHPDLLQVDPFEAIIDEELEPGDIL 179 H D +DVFI+Q G ++W++ + L + + D + L GD+L Sbjct: 135 KAHFDTHDVFILQVEGSKQWKIYDSPITLPVISDLKYTDKFINQLKSPTAEYCLNKGDLL 194 Query: 180 YIPPGFPHEGYALEN-AMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAH 238 YIP G+ HE Y + +++ +VG + +LI+ + Q+E+ + Sbjct: 195 YIPRGYIHEVYTDNSFSVHLTVGIHSLKWFDLINSAVTKLAQKEVRFRESLPVGFLRQEE 254 Query: 239 PADVLPQEMDKLREMMLELINQPEHFKQ----WFGEFI 272 + L + +L +++ E E + + G+ Sbjct: 255 AEESLKNQFQELLKLLAEQSEVEEAVEDIAQGFLGKMS 292 >UniRef50_B4GUZ2 Lysine-specific demethylase NO66 n=2 Tax=Drosophila persimilis RepID=NO66_DROPE Length = 687 Score = 173 bits (438), Expect = 1e-41, Method: Composition-based stats. Identities = 53/331 (16%), Positives = 118/331 (35%), Gaps = 29/331 (8%) Query: 7 LNWPDFLERHWQKRPVVLKRGF-NNFIDPISPDELAGLAMESEVD----SRLVSHQDGKW 61 + FL HW+K P +K F + IS + + +++ V+ + S++DG Sbjct: 260 MTMATFLRDHWEKSPFRVKTTTSGGFSNLISFKMIDQMLIQNHVEYTTNIDVTSYEDGVR 319 Query: 62 QVSHG-----PFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFS 116 + + P + H + S+ + + + L +E + + Sbjct: 320 KTLNPDGRALPPSVWAHY-QRGCSIRILNPSSYLVQLRQLCVKLQEFFHCLVGANVYLTP 378 Query: 117 VPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPG 176 G PH D + F++Q G++RWR+ + +L Q + + I+D L+PG Sbjct: 379 PESQGFAPHYDDIEAFVLQVEGKKRWRIYAPTKELPRESSGNLSQTELGDPIMDIVLKPG 438 Query: 177 DILYIPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYV--------------LQRE 222 D+LY P G+ H+ +++ + + A + + + + L++ Sbjct: 439 DLLYFPRGWIHQAITEKDSHSLHITLSAYQQQSY-ANLMEKLMPLVVKESVEQTLKLRKG 497 Query: 223 LGGNYYSDPDVPPRAHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIA 282 L + + + V + + ++ + L+ + + + +HE A Sbjct: 498 LPLDIFQNLGVANAEWKGAHRQKLIQHIQNLAQRLVPTEGQIDRALDQLAIKFQHE---A 554 Query: 283 PPEPPYQPDEIYDALKQGEVLVRLGGLRVLR 313 P + R G + Sbjct: 555 LPPTIAPQELKRTVFGAQATADRNGHCSLDY 585 >UniRef50_UPI000186D1B6 conserved hypothetical protein n=1 Tax=Pediculus humanus corporis RepID=UPI000186D1B6 Length = 467 Score = 172 bits (437), Expect = 2e-41, Method: Composition-based stats. Identities = 43/247 (17%), Positives = 89/247 (36%), Gaps = 23/247 (9%) Query: 7 LNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQVSHG 66 + +F E W+K+P+ + R N + + + A+ + D + D + Sbjct: 31 IKVKEFFENFWEKKPLYISRNNNEYYNELCSMNAFEKALSEK-DMYFTKNIDVTSYIDGQ 89 Query: 67 PFES-----------YDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISF 115 F +D E S+ + + L +E + + M + Sbjct: 90 RFTENLDGKATVSNIWDFFNEGK-SIRLLNPQTFIPNVWLLNTNLQEFFNCFVGANM--Y 146 Query: 116 SVPGG--GVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQ---HCPHPDLLQVDPFEAIID 170 P G G PH D + F++Q G++ W+V + + + + + I+ Sbjct: 147 LTPAGTQGFAPHYDDIEAFVLQLEGQKHWKVYNPRDSSEVLARESSKNFKEDEIGKPILK 206 Query: 171 EELEPGDILYIPPGFPHEGYALENAMNYSVGFRAP---NTRELISGFADYVLQRELGGNY 227 L+PGD+LY P G+ H+ L + + + + +L LQ+ + + Sbjct: 207 VTLKPGDMLYFPRGYIHQAKCLPDTHSLHLTVSCYQKNSWADLFEKIFPVALQKAIFEDV 266 Query: 228 YSDPDVP 234 +P Sbjct: 267 EFRKGLP 273 >UniRef50_B7PMB0 MYC-induced nuclear antigen, putative (Fragment) n=1 Tax=Ixodes scapularis RepID=B7PMB0_IXOSC Length = 472 Score = 172 bits (437), Expect = 2e-41, Method: Composition-based stats. Identities = 59/288 (20%), Positives = 113/288 (39%), Gaps = 30/288 (10%) Query: 1 MEYQLT-LNWPDFLERHWQKRPVVL--KRGFNNFID-PISPDELAGLAMESEV----DSR 52 ME+ L+ L++ +F E++W++ P V + G F S D +A E+++ D Sbjct: 16 MEFLLSPLSYKEFSEKYWEREPFVAHDRAGMRAFWPQLFSKDAFFSIAKETKLYFGKDVS 75 Query: 53 LVSHQDGK-------WQVSHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPD 105 ++DGK + Y E +L V W + ++ Sbjct: 76 ACKYEDGKRSDYAEGYSAKSAKLNKY--FEERKATLQVHQPQRWKDSLWEVLELMERFFG 133 Query: 106 WRIDDLMISFSVPGG--GVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVD 163 + ++ P G G+ PH D DVFI+Q G + W++ + + D + Sbjct: 134 CLVG--CNAYITPAGSQGLAPHHD--DVFIVQLEGEKCWKLHKPVTELARIYSKDFTSEE 189 Query: 164 PFEAIIDEELEPGDILYIPPGFPHEGY----ALENAMNYSV-GFRAPNTRELISGFADYV 218 E + L PGD LY+P G H Y A ++ + ++ ++ + + A + Sbjct: 190 IGEPTHEFTLRPGDFLYMPRGTIHHAYVPESADSHSTHITISTYQKQTVGDCLMDIAPDL 249 Query: 219 LQRELGGNYYSDPDVPPRAHPADVLPQE--MDKLREMMLELINQPEHF 264 + + +P R P+ VL +E + L ++ + P+ Sbjct: 250 ISSAMDSCIELRKGLPNRFLPSCVLSKETVVTALSSVLEHVKQMPDEM 297 >UniRef50_B5DUH6 Lysine-specific demethylase NO66 n=2 Tax=Drosophila pseudoobscura pseudoobscura RepID=NO66_DROPS Length = 946 Score = 172 bits (436), Expect = 2e-41, Method: Composition-based stats. Identities = 49/296 (16%), Positives = 111/296 (37%), Gaps = 26/296 (8%) Query: 7 LNWPDFLERHWQKRPV-VLKRGFNNFIDPISPDELAGLAMESEVD----SRLVSHQDGKW 61 + FL HW+K P V+ F + IS + + +++ V+ + S++DG Sbjct: 519 MTMATFLRDHWEKSPFRVITTTSGGFSNLISFKMIDKMLIQNHVEYTTNIDVTSYEDGVR 578 Query: 62 QVSHG-----PFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFS 116 + + P + H + S+ + + + L +E + + Sbjct: 579 KTLNPDGRALPPSVWAHY-QRGCSIRILNPSSYLVQLRQLCVKLQEFFHCLVGANVYLTP 637 Query: 117 VPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPG 176 G PH D + F++Q G++RWR+ + +L Q + + I+D L PG Sbjct: 638 PESQGFAPHYDDIEAFVLQVEGKKRWRIYAPTKELPRESSGNLSQTELGDPIMDIVLMPG 697 Query: 177 DILYIPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYV--------------LQRE 222 D+LY P G+ H+ +++ + + A + + + + L++ Sbjct: 698 DLLYFPRGWIHQAITEKDSHSLHITLSAYQQQSY-ANLMEKLMPLVVKESVEQTLKLRKG 756 Query: 223 LGGNYYSDPDVPPRAHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHE 278 L + + + V + + ++ + L+ + + + +HE Sbjct: 757 LPLDIFQNLGVANAEWNGVHRQKLIQHIQNLAQRLMPTEGQIDRALDQLAIKFQHE 812 >UniRef50_Q10ZZ1 Cupin 4 n=1 Tax=Trichodesmium erythraeum IMS101 RepID=Q10ZZ1_TRIEI Length = 385 Score = 172 bits (436), Expect = 2e-41, Method: Composition-based stats. Identities = 71/376 (18%), Positives = 137/376 (36%), Gaps = 28/376 (7%) Query: 7 LNWPDFLERHWQKRPVVLKR-GFNNFIDPISPDELAGLAMESEV---DSRLVSHQDGKWQ 62 L +FLE +W K+ + + G +F D S ++L L ++ D RL + Sbjct: 11 LKQEEFLENNWTKKAIAISNKGEKDFTDLFSWEKLNYLLNFHQIKYPDVRLAFDGKVLEE 70 Query: 63 VSHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGV 122 + F + E +L++ ++ A + G Sbjct: 71 KENRNFTQW---CEKGATLILDQIHRRIPEVAIFTSKLSYELGYPTQVNAYCSWSSKKGF 127 Query: 123 GPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDP--FEAIIDEELEPGDILY 180 PH D +DVFI+Q G ++W V K P+ P EA + L PGD+LY Sbjct: 128 SPHYDTHDVFILQVEGNKQWYVYND-TFKYPLPNQKSSSFTPPEKEAYLSCILHPGDVLY 186 Query: 181 IPPGFPHEGYA-LENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHP 239 IP G H E +++ ++G + +L+ + RE + + + Sbjct: 187 IPRGHWHYAVTKEEPSIHLTLGIHSSTGVDLLEWLIGQLQYRE---EWRTSLALRIDDTS 243 Query: 240 ADVLPQEMDKLREMMLELINQ---PEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDA 296 +V ++ L + + E IN E + + +++ + ++ + D Sbjct: 244 FNVS---VENLIKDLKEYINNHNISEEYNNYLDG-LAKPFEQYNLPYQAGFHIFHRDIDT 299 Query: 297 LKQGEVLVRLGGLRVLRIGDD-VYANGEKIDSPHRP--ALDALASNIALTAEN----FGD 349 + RL ++ + +G+++ P + L S T ++ D Sbjct: 300 KFKVSQFQRLKISKMADDDGYKILVSGKEVSIRGVPEYLVKNLFSRETFTGKDIINLLPD 359 Query: 350 ALEDPSFLAMLAALVN 365 + + ML+ LVN Sbjct: 360 YDWEIDIMPMLSKLVN 375 >UniRef50_B8C536 Putative uncharacterized protein (Fragment) n=1 Tax=Thalassiosira pseudonana RepID=B8C536_THAPS Length = 204 Score = 172 bits (435), Expect = 3e-41, Method: Composition-based stats. Identities = 64/204 (31%), Positives = 104/204 (50%), Gaps = 24/204 (11%) Query: 52 RLVSHQ---DGKWQVSHGPFESYDHLG-----------ETNWSLLVQAVNHWHEPTAALM 97 R++SH D ++++ GP + G E +L+V ++ ++ P A + Sbjct: 1 RVISHSPGDDSSYELTWGPLSDAEFHGWMAKVTSPNNNEQRETLVVNDIDRFYPPLADWI 60 Query: 98 -RPFRELPDWRIDDLMISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGE-----KLQMK 151 + LP WR+DD IS + GG+GPH+D YDVF+IQ +G R W+VG K +M Sbjct: 61 HDTYHFLPRWRMDDGQISLAEQSGGIGPHVDNYDVFLIQMSGTRAWQVGRKELSTKEEMD 120 Query: 152 QHCPHPDLLQVDPFEAII-DEELEPGDILYIPPGFPHEGYALEN-AMNYSVGFRAPNTRE 209 + D+ ++ + + + + L+PGD+LY+PP H G AL + M SVG RAP+ + Sbjct: 121 RMIEGLDVRVLENWASEMEEWVLQPGDMLYLPPRVAHCGTALSDGCMTLSVGCRAPSVSD 180 Query: 210 LISGFADYVLQRELG--GNYYSDP 231 L+S A+ Y+D Sbjct: 181 LMSRLAENFSGSIEDYATRRYTDA 204 >UniRef50_D2A374 Putative uncharacterized protein GLEAN_07936 n=1 Tax=Tribolium castaneum RepID=D2A374_TRICA Length = 568 Score = 170 bits (432), Expect = 6e-41, Method: Composition-based stats. Identities = 56/251 (22%), Positives = 109/251 (43%), Gaps = 30/251 (11%) Query: 7 LNWPDFLERHWQKRPVVLKRG----FNNFIDPISPDEL---AGLAMESEVDSRLVSHQDG 59 L+ F + +W+++P+ +KRG + + +D S D++ L VD +V++++G Sbjct: 142 LSPASFFKTYWEQKPLYIKRGNRSYYTHILDSSSLDKILRNNSLFFTRNVD--VVTYENG 199 Query: 60 KWQVSHG-----PFESYDHLGETNWSLLV---QAVNH-WHEPTAALMRPFRELPDWRIDD 110 + QV + P +D+ G S+ V Q NH H A L F + Sbjct: 200 EKQVFNQEGRATPSALWDYYG-NGCSIRVLNPQTYNHKVHLLLATLQEYFGTMVGA---- 254 Query: 111 LMISFSVPG-GGVGPHLDQYDVFIIQGTGRRRWRVGEKL--QMKQHCPHPDLLQVDPFEA 167 + + PG G PH D + F++Q GR+ W++ + + P+ + D E Sbjct: 255 -NVYLTPPGSQGFAPHYDDIEAFVVQLEGRKHWKLYQPKSEDVLARFSSPNFKREDLGEP 313 Query: 168 IIDEELEPGDILYIPPGFPHEGYALENAMNYSVG---FRAPNTRELISGFADYVLQRELG 224 ++ L G++LY P G HEG E++ + + ++ + +L+ L++ Sbjct: 314 FMELTLNAGELLYFPRGTIHEGRTDEDSHSLHITVSVYQQTSYVDLLEHILPKALKKAAD 373 Query: 225 GNYYSDPDVPP 235 + +P Sbjct: 374 SDVEFRKGLPL 384 >UniRef50_UPI0000E45D23 PREDICTED: hypothetical protein n=2 Tax=Strongylocentrotus purpuratus RepID=UPI0000E45D23 Length = 555 Score = 168 bits (427), Expect = 2e-40, Method: Composition-based stats. Identities = 51/240 (21%), Positives = 96/240 (40%), Gaps = 17/240 (7%) Query: 7 LNWPDFLERHWQKRPVVLKRGFNN-FIDPISPDELAGLAMESEV----DSRLVSHQDGKW 61 D+ + ++++P+ LKR F D S EL+ + E++V + + ++ DGK Sbjct: 105 FKVEDYFKNIFERKPLFLKRHKPGYFTDIFSSKELSNILKENDVQFTRNIDVTTYTDGKR 164 Query: 62 QVSH-----GPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFS 116 + + P +D+ S+ V + L+ +E + I + Sbjct: 165 ETHNPTGRAQPQVVWDYYN-NGCSVRVLNPQTYSTRVWQLLAALQEFFGCFVGAN-IYLT 222 Query: 117 VPG-GGVGPHLDQYDVFIIQGTGRRRWRVGE---KLQMKQHCPHPDLLQVDPFEAIIDEE 172 PG G PH D + F++Q G++ W++ ++ + D + I+D Sbjct: 223 PPGTQGFAPHYDDIEAFVLQLEGKKHWKLYNQRSPAEVLPRFSSSNFTDADIGQPILDTT 282 Query: 173 LEPGDILYIPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPD 232 LEPGD+LY P G H+ + + A + ++VL R L + D Sbjct: 283 LEPGDLLYFPRGVIHQASTPSETHSLHITISAC-QKNTWGDLMEHVLTRALHVVREENID 341 >UniRef50_D0NRY0 Nucleolar protein, putative n=2 Tax=Phytophthora infestans T30-4 RepID=D0NRY0_PHYIN Length = 676 Score = 167 bits (423), Expect = 6e-40, Method: Composition-based stats. Identities = 63/347 (18%), Positives = 123/347 (35%), Gaps = 45/347 (12%) Query: 8 NWPDFLERHWQKRPVVLKRGFNNFIDP-ISPDELAGL----AMESEVDSRLVSHQDGKWQ 62 +F E +W++RP+ +KR F ++ D S E+ + +E D L + D Sbjct: 70 TPEEFYENYWEQRPLAIKRNFPSYYDGWFSKQEIDRILKTHTLEYGTDVDLTKYVDDTRH 129 Query: 63 VSHGP----FESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVP 118 + P + + S+ + + + L+ + +W ++ P Sbjct: 130 TLNPPGSATAKQVWKHYDDGCSVRLLCPQKFSDDVWKLLATLED--EWGCMAGANTYLTP 187 Query: 119 G--GGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQ---HCPHPDLLQVDPFEAIIDEEL 173 G PH D + F++Q G + W+V + L P + D + ++ +L Sbjct: 188 KNTQGFAPHFDDIEAFLLQTEGCKHWKVYKPLNESDVLARYPSGNFKAEDLGKPTLEVDL 247 Query: 174 EPGDILYIPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDV 233 E GD+LY P GF H+ A + + + + + + F + +L + L G ++ ++ Sbjct: 248 EQGDLLYFPRGFIHQARAHKEKHSLHLTV-STGQQNTMGNFLEVLLPQALAGAINTNVEL 306 Query: 234 PPRAHPADVL------------PQEMDKLREMMLELINQPEHFKQWFGEFISQSRH---- 277 R+ P D L E + + + G + S Sbjct: 307 -RRSLPRDYLEYMGVMHSDRKGDPERQAFANKLKGALK--TVLGEAMGMLDAASDQMAKN 363 Query: 278 -ELDIAPPEPPYQPDEIYDA--------LKQGEVLVRLGGLRVLRIG 315 LD PP + + + LVR G R++ Sbjct: 364 FLLDRLPPALEDEEENCTSDNSPLQKITVNTQLKLVRHGVARLVIED 410 >UniRef50_B0CEG8 Cupin 4 family protein, putative n=1 Tax=Acaryochloris marina MBIC11017 RepID=B0CEG8_ACAM1 Length = 416 Score = 166 bits (421), Expect = 1e-39, Method: Composition-based stats. Identities = 67/391 (17%), Positives = 138/391 (35%), Gaps = 32/391 (8%) Query: 8 NWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEV------DSRLVSHQD--- 58 DF + +W+ + + L R +F + E L ++++ + RLV + Sbjct: 26 TIEDFFQTYWETKTLYLPRNDASFYGSVLQPEDIDLLLQNKALLADYNNFRLVDQGNKLS 85 Query: 59 -GKWQVSHGPFESYDHLGETNWSLLVQAV-------NHWHEPTAALMRPFRELPDWRIDD 110 W H + Y + +SLL Q + + +++ Sbjct: 86 LEDWCDRHSKSQQYFINNDKLYSLLHQGLTLTINGAHKKIPKLRHFCSALECELKFKLRT 145 Query: 111 LMISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGE-KLQMKQHCPHPDLLQVDPFEAII 169 + G+ PH D++DVFI+Q TG + W++ +++ H + + E + Sbjct: 146 NIYITPPQAQGLAPHYDEHDVFILQITGEKEWKLYHSPVELPSHIRDQSIGRHKLAEPEL 205 Query: 170 DEELEPGDILYIPPGFPHEGYALE-NAMNYSVGFRAPNTRELISGFADYVLQRELGGNYY 228 L+PGD+LYIP G H+ + E +++ S+G EL+ + + Sbjct: 206 TVMLQPGDLLYIPRGVVHQAASQETTSVHASLGLYPTFAYELLEELVT--IAQADPAFRK 263 Query: 229 SDPDVPPRAHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPY 288 + P + + L + ++ + E ++ F+ + E + Y Sbjct: 264 AIPHGFSSSEQQQAFYELFQTLSQYLISNVKTEELVERKHKVFLCDRKSEDQGRFQDLIY 323 Query: 289 QPDEIYDALKQGEVLVRLGGLRVLRIGD------DVYANGEKIDSPHRPALDALASNIAL 342 P L V+ R + D + Y +L L + L Sbjct: 324 LPQ-----LNLNSVVARRPNILFSVDRDSTQIVLNFYQKSLTFPIFLATSLTDLIDHPYL 378 Query: 343 TAENFGDALEDPSFLAMLAALVNSGYWFFEG 373 ++ G + D L++ L+ G+ + Sbjct: 379 AVKDIGGLINDAGRLSLAQNLIQEGFLMIKA 409 >UniRef50_B4L6Q5 Lysine-specific demethylase NO66 n=1 Tax=Drosophila mojavensis RepID=NO66_DROMO Length = 888 Score = 165 bits (417), Expect = 3e-39, Method: Composition-based stats. Identities = 66/374 (17%), Positives = 133/374 (35%), Gaps = 40/374 (10%) Query: 1 MEYQLT-LNWPDFLERHWQKRPVVLKRGFNN-FIDPISPDELAGLAMESEVD----SRLV 54 + + L + F E++W++ +KR N F IS + + +E++++ + Sbjct: 476 LNWLLNPITSETFFEQYWERNACQVKRKQPNYFTQLISFQMIDEMLIENQLEFTTNIDVT 535 Query: 55 SHQDGKWQVSHGPFES------YDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRI 108 +++ G Q + P + + G+ S+ + + + L +E + Sbjct: 536 TYKKGVRQTLN-PVGRAMSPAIWGYYGD-GCSIRILNPSTYLPKLRQLCSTMQEFFHCLV 593 Query: 109 DDLMISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQ---HCPHPDLLQVDPF 165 + G PH D + F+IQ GR+RWR+ + Q + Sbjct: 594 GANVYLTPPNSQGFAPHYDDIEAFVIQVEGRKRWRLYAPPHQSDVLARTSSGNYKQEELG 653 Query: 166 EAIIDEELEPGDILYIPPGFPHEGYALENAMNYSVG---FRAPNTRELISGFADYVLQRE 222 + + D LE GDILY P G H+ + + ++ L+ VL+R Sbjct: 654 QPLFDAVLEAGDILYFPRGTVHQAVTEPKQHSLHITLSVYQQQAYANLLEVLMPSVLERA 713 Query: 223 ----------LGGNYYSDPDVPPRAHPADVLPQEMDKLREMMLELI-NQPEHFKQWFGEF 271 L + + + ++ Q M+ ++++ + + Sbjct: 714 IKHHLSLRRGLPLHIWQHVGLAKGGQQSEQRDQLMNSTKQLVQRYLVPTEAQIDAAVDQL 773 Query: 272 ISQSRHEL--DIAPPEPPYQPDEIYDALKQG-EVLVRLGGLRVLRIGDDV--YANGE--- 323 + +HE PE + ++ +Q R LRV D+ Y E Sbjct: 774 AKRFQHEALPPYIKPEESMRTTKVRLLRRQHPAPGGRRQQLRVYYYVDNALEYCKNEPNY 833 Query: 324 -KIDSPHRPALDAL 336 +I PA++AL Sbjct: 834 MEIQPTEAPAVEAL 847 >UniRef50_A9TET4 Predicted protein n=1 Tax=Physcomitrella patens subsp. patens RepID=A9TET4_PHYPA Length = 530 Score = 164 bits (416), Expect = 4e-39, Method: Composition-based stats. Identities = 83/426 (19%), Positives = 155/426 (36%), Gaps = 64/426 (15%) Query: 1 MEYQLT-LNWPDFLERHWQKRPVVLKRG-----FNNFIDPISPDEL-AGLAMESEVDSRL 53 +E+ ++ + F W+K+P +++R + D + ++L ++ ++ + Sbjct: 105 LEWAISPIKLDRFQGEFWEKKPFLVRRPKNRNYYAGIFDKATIEKLLEEHELKYGLNIDV 164 Query: 54 VSHQ-DGKWQVSHG-----PFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWR 107 + DG P + + + WS+ + W +P ++ F W Sbjct: 165 TKYDIDGGRSTFSSEGSATPSKVWSKYAD-GWSVRILHPQRWCDPVFLILSAFERF--WG 221 Query: 108 IDDLMISFSVPGG--GVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQV 162 ++ P G G PH D + F+IQ GR+RW+V + + P P+ Q Sbjct: 222 SVAGCNAYLTPAGSQGFSPHYDDIEAFVIQTEGRKRWKVYKPRTPGEALPRFSSPNFEQG 281 Query: 163 DPFEAIIDEELEPGDILYIPPGFPHEGYALENAMNYSVG--------------FRAPNTR 208 + E I+D +LEPGDILY+P G H+ A E+A + + F P Sbjct: 282 EIGEPILDVDLEPGDILYMPRGTIHQAKASEDAHSLHITVSVGQRNCWGDFLEFAMPRAL 341 Query: 209 ELISGFADYVLQRE---------LGGNYYSDPDVPPRAHPADVLPQEMDKLREMMLELIN 259 EL S D++L RE +G + D D P RA D + + M + + + Sbjct: 342 ELASE--DHILLRESLPRGYADYMGVAHSDDHDNPQRAAFIDKIMECMAIVSQSIPWDSA 399 Query: 260 QPEHFKQWFGEFISQSRHELDIAPPEPPYQ---------PDEIYDALKQGEVLVRLG--G 308 + ++ + + PD L+ V+V Sbjct: 400 ADQLAVKFLQSRLPLPAPANAVHGKGQKITGKSRVRLVAPDVARLVLEGDSVVVYHMLKN 459 Query: 309 LRVLRIGDDVYANGEK-------IDSPHRPALDALASNIALTAENFGDALEDPSFLAMLA 361 R L D N + P L+ L S + ++ + + +++ Sbjct: 460 SRDLHNEGDTEENEDSDADKRLVFTWEVAPVLEELLSAFPDPVDVSSLSVPEDESIPLIS 519 Query: 362 ALVNSG 367 L + G Sbjct: 520 ELCDIG 525 >UniRef50_A0YJB4 Putative uncharacterized protein n=1 Tax=Lyngbya sp. PCC 8106 RepID=A0YJB4_9CYAN Length = 386 Score = 164 bits (416), Expect = 4e-39, Method: Composition-based stats. Identities = 57/368 (15%), Positives = 129/368 (35%), Gaps = 18/368 (4%) Query: 12 FLERHWQKRPVVLKRGFNN-FIDPISPDELAGLAMESEVDSRLVSHQDGKWQVSHGPFES 70 FL+++W K+ +++ + F S + L + + + +++ Sbjct: 16 FLQKNWLKQALIISGYSPHKFSHLFSWQDFNTLLNFHHLTYPEIRLAKSGQTLPENAYDN 75 Query: 71 YDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPHLDQYD 130 + ++++ ++ A R R G H D ++ Sbjct: 76 LIKSCQDGATVIIDSLQTRLPVIAEFTANLRNELGHRTQINAYCSFPGSQGFACHYDSHE 135 Query: 131 VFIIQGTGRRRWRVGEKLQMKQHCPHP-DLLQVDPFEAIIDEELEPGDILYIPPGFPHEG 189 VFI+Q +G + WRV H +LL + I++ L+PGD+LYIP G H Sbjct: 136 VFILQISGDKHWRVFSPTFEFPLSKHRSNLLDPPTTDPYINQVLKPGDLLYIPRGHWHYA 195 Query: 190 YALE-NAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADVLPQEMD 248 A++ +++ ++G + + + + + L + + H Q + Sbjct: 196 VAVDQPSLHLTLGVDCQTGIDFVEWLTEELQENPL---WRQSLPLLNSTHRQACS-QHLR 251 Query: 249 KLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQGEVLVRLGG 308 L + + + E ++ E Q + P ++ K+ + Sbjct: 252 DLIQKWIGQLETDEFIDRYLDEQFIQGQPSTQFGFPAQVGFS--LFPEDKKTQFYRPPKP 309 Query: 309 LRVLRIGDD----VYANGE-KIDSPHRPALDALASNIALTAENFGDAL----EDPSFLAM 359 +++ + +D AN + R ++ L + + + L + AM Sbjct: 310 VKITNLPEDHIEICTANKRITLKGISREVIERLFQQTRFSGIDLCNWLPEFDWEADICAM 369 Query: 360 LAALVNSG 367 + L+ SG Sbjct: 370 MTQLILSG 377 >UniRef50_Q849M1 Putative uncharacterized protein pSV2.19c n=3 Tax=Streptomyces RepID=Q849M1_STRVN Length = 390 Score = 164 bits (416), Expect = 4e-39, Method: Composition-based stats. Identities = 80/362 (22%), Positives = 139/362 (38%), Gaps = 45/362 (12%) Query: 10 PDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVD------SR----------- 52 DFL + + + + ++ D+L + ++ SR Sbjct: 12 EDFLAQALHREHRHIPGAL-DVAGLMTFDDLNQILATHRLEPPRMRLSRDGETLLVGGYT 70 Query: 53 --LVSHQDGKWQVSHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDD 110 + + + W H P E + L E SL + +V+ H P A L R+ Sbjct: 71 TPVATRRHTVWHRLH-PAELHTRLTE-GASLALDSVDELHPPIARLCEAIERELHTRVQA 128 Query: 111 LMI-SFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAII 169 + S+S G G H D +D I+Q G +RWR+ + P E + Sbjct: 129 NLYASWSATE-GFGVHWDDHDTVIVQLDGAKRWRIYGTTRPFPLYRDIADPGEAPTEPVA 187 Query: 170 DEELEPGDILYIPPGFPHEGYALEN--AMNYSVGFRAPNTRELISGFADYVLQRELGGNY 227 D L PGD+LY+P G H A + +++ + G + +L++ ++ +L E ++ Sbjct: 188 DLVLWPGDVLYVPRGVWHAVSADQGVRSLHVTCGLQTHTATDLMAWVSEQLLTHE---DW 244 Query: 228 YSDPDVPPRAHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPP 287 D +P A P DV +D +R+ + EL++ P ++ Q+ + P P Sbjct: 245 RRD--LPLLAAP-DVQADAVDGMRKRLAELLDDPTLLARYRTAMDGQAVGRM---VPSLP 298 Query: 288 YQPDEIYDALKQGEVLVRLGGLR-VLRIGDD---VYANGEKIDSP--HRPALDALASNIA 341 Y D G + VRL R VL +G+D + A G + L L Sbjct: 299 YIDGIPVD----GALRVRLTTARAVLDVGEDTVTLSAAGSTFEFAPEAEAVLRPLVDGRT 354 Query: 342 LT 343 + Sbjct: 355 VD 356 >UniRef50_UPI0000E4684D PREDICTED: hypothetical protein n=1 Tax=Strongylocentrotus purpuratus RepID=UPI0000E4684D Length = 511 Score = 163 bits (413), Expect = 8e-39, Method: Composition-based stats. Identities = 61/330 (18%), Positives = 122/330 (36%), Gaps = 18/330 (5%) Query: 7 LNWPDFLERHWQKRPVVLKRGFNN---FIDPISPDELAGLAMESEV----DSRLVSHQDG 59 + F+E +W+K+P+V+ + F + L GL E ++ D + ++ Sbjct: 71 MKIETFMEEYWEKQPIVISNREKHRDYFQSLFTRTILEGLVAEKKISFIQDCNVCRYKGE 130 Query: 60 KWQVSHG-----PFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMIS 114 + +G P + + L + ++ + E L+ + + Sbjct: 131 VRESLNGNGIVKPTKLKELLDQDKATIQFHQPQRFQESVWNLLEKLESYFGCLVGSNIYM 190 Query: 115 FSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELE 174 G+ PH D +VF++Q G + WR+ + + DL Q + E D L+ Sbjct: 191 TPKLSQGLAPHYDDVEVFVLQLEGEKHWRLYKPPTLLPRDYSRDLDQSELGEPTHDIVLK 250 Query: 175 PGDILYIPPGFPHEGYALENA---MNYSV-GFRAPNTRELISGFADYVLQRELGGNYYSD 230 GD++Y P G H+ + ++ ++ + +L+S ++Q + N Sbjct: 251 AGDLMYFPRGTVHQADTPSTCSHSTHLTISTYQRSSWGDLLSIALPSMIQTAISENVSYR 310 Query: 231 PDVPPRAHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQP 290 +P R + M +LI+Q + G+ + IA PP+ Sbjct: 311 QGLPLRFVSDPNHQASSSETGRKMADLISQLSSHIESHGQEAANEMLCDFIANRLPPFCD 370 Query: 291 DEIYDALKQGEVLVRLGGLRVLRIGDDVYA 320 E D +G + +R LR D + Sbjct: 371 GE-TDLAPRGPMPSSDQQVR-LRFPDHTFI 398 >UniRef50_Q31RB4 Putative uncharacterized protein n=2 Tax=Synechococcus elongatus RepID=Q31RB4_SYNE7 Length = 428 Score = 162 bits (411), Expect = 2e-38, Method: Composition-based stats. Identities = 76/384 (19%), Positives = 137/384 (35%), Gaps = 36/384 (9%) Query: 12 FLERHWQKRPVVLKRGFNNFIDPISPDELAGLAM-ESEVDSRLVSHQDGKWQVSHGPFES 70 FL +W ++ V + F S + L L ++ +S L +DG+ + Sbjct: 16 FLSHYWAQQSVYIAGDSLRFQSLFSWNHLNDLLNYQTFRESELRFSRDGESLPAGDNPTL 75 Query: 71 YDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPHLDQYD 130 + + +L++ V+H L R+ +R + S G H D +D Sbjct: 76 WRSRLQEGATLVLNGVHHRVPALKHLATNLRQEFGYRCHINLYSSPAQQQGFDCHYDTHD 135 Query: 131 VFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFE-AIIDEELEPGDILYIPPGFPHEG 189 V I+Q G + W + + P ++ P E + + L PGD+LYIP G H Sbjct: 136 VLILQIEGEKEWLIYPETLPYPTADQPSYDRLPPEEPPYLQQVLSPGDLLYIPRGHWHYA 195 Query: 190 YALEN-AMNYSVGFRAPNTRELISGFADYVLQR-------ELGGNYYSDPDVPPRAHPAD 241 A E +++ ++G + ++ + + L G+ DP + R H Sbjct: 196 IAQETASLHLTIGIHTATGLDWVNWLQQQLRDQPHWRQGLPLAGSCNFDP-LKLRGH--- 251 Query: 242 VLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDI--------APPEPPYQPDEI 293 ++ LR+ ++ + +P+ + Q + L I P I Sbjct: 252 -----LESLRDQLITYLQEPQAIDDYLQYLSWQDQPHLPIQLPLQLHGDPLAQGLLGKFI 306 Query: 294 YDALKQGEVLVRLGGLRVLRIGDDVYANGEKIDSPHRPALDALASNIALTAENFGDALED 353 + L + ++VL + G I R L S T + GD D Sbjct: 307 WSPLHSLQWQAEDDQIKVLIGSKQMVFKGLPISLAMR-----LFSCRQFTLMDLGDWAPD 361 Query: 354 PSF----LAMLAALVNSGYWFFEG 373 F +L L+ +G F E Sbjct: 362 LDFESAIAPLLQKLILAGILFVEA 385 >UniRef50_A1R1T1 Putative cupin superfamily protein n=2 Tax=Micrococcineae RepID=A1R1T1_ARTAT Length = 388 Score = 160 bits (406), Expect = 6e-38, Method: Composition-based stats. Identities = 74/403 (18%), Positives = 137/403 (33%), Gaps = 63/403 (15%) Query: 2 EYQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKW 61 + + + F W + ++ + G +F D S D + L + + + G Sbjct: 9 TRLIDIGYEKFASDVWGRTALLTR-GVGDFSDLFSADAVDELISRRGLRTPFLRVAKGGS 67 Query: 62 QVSHGPFESYDHLGET----------------NWSLLVQAVNHWHEPTAALMRPFRELPD 105 + F S +G T +L++QA++ EP ++ Sbjct: 68 TLPESSFTSPAGVGATISDQLDDTQLWRKFADGATLVLQALHRTWEPVSSFSTQLSTELG 127 Query: 106 WRIDDLMISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHP------DL 159 + G H D +DVF++Q G +RW + E + + P + Sbjct: 128 HPVQANAYITPPQNRGFDDHYDVHDVFVLQIEGTKRWIIHEPVHVDPLRSQPWTDRRSAV 187 Query: 160 LQVDPFEAIIDEELEPGDILYIPPGFPHEGYALEN-AMNYSVGFRAPNTRELISGFADYV 218 + +A ID LEPGD+LY+P G+ H A +++ ++G + L A Sbjct: 188 AEAAQGKAYIDTVLEPGDVLYLPRGWLHAAEAQGKVSIHLTLGVHSWTRHALAEHLAQAA 247 Query: 219 LQRELGGNYYSDPDVPPRAHPADVL--PQEMDKLREMMLELINQPEHFKQWFGEFISQSR 276 L DP+V R+ P V +E+ +RE + + + + + Q R Sbjct: 248 LAALCD-----DPEV-RRSLPLGVDGPDEEIAAVRERLAAAVLEADTTSLFHRTRRGQGR 301 Query: 277 H----------ELDIAPPEPPYQPDEIYDALKQGEVLVRLGGLRVLRIGDDVYANGEKID 326 LD PE + E +A +G L RV + Sbjct: 302 PAPLGPVAQLAALDGLGPESLVRLREALEARLEGSRL----TTRVGWLD---------FP 348 Query: 327 SPHRPALDALASNIALTAENFGDALEDPSFLAMLAALVNSGYW 369 + P++ L L + ++ L+ +G Sbjct: 349 EANLPSVRRLLDG--------EPHLASDLGVELVERLLRAGVL 383 >UniRef50_B4Q068 Lysine-specific demethylase NO66 n=5 Tax=Sophophora RepID=NO66_DROYA Length = 683 Score = 160 bits (404), Expect = 1e-37, Method: Composition-based stats. Identities = 64/360 (17%), Positives = 132/360 (36%), Gaps = 44/360 (12%) Query: 1 MEYQLT-LNWPDFLERHWQKRPVVLKRGFNNFID-PISPDELAGLAMESEVD----SRLV 54 +++ L + F + W+ V++R ++ IS + + + +D + Sbjct: 247 LQWLLNPIKVNHFFDDFWEHTAFVVQRKNPHYYSKLISFKMIDEMLVRHRLDFTINVDVT 306 Query: 55 SHQDGKWQVSHGPFESYD----HLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDD 110 ++++GK + + + L S+ + + + + +E + Sbjct: 307 TYKNGKRETLNPEGRALPPVVWGLYSEGCSIRILNPSTYLVGLRQVCSIMQEFFHCLVGA 366 Query: 111 LMISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHC---PHPDLLQVDPFEA 167 + G PH D + F+IQ GR+RWR+ E + Q E Sbjct: 367 NVYLTPPNSQGFAPHYDDIEAFVIQVEGRKRWRLYEPPSGSDQLCRNSSSNFDQEQLGEP 426 Query: 168 IIDEELEPGDILYIPPGFPHEGYALENAMNYSVGFRAPNTR---ELISGFADYVLQRELG 224 I+DE LE GD+LY P G H+ E + + + L+ VL++ + Sbjct: 427 ILDEVLEAGDLLYFPRGTVHQAITEEEQHSLHITLSVYQQQAYVNLLEKLMPIVLKKAIK 486 Query: 225 GNYYSDPDVP----------PRAHPADVLPQEMDKLREMMLE-LINQPEHFKQWFGEFIS 273 + +P RA+ +D Q ++ +++++ + L+ + + + Sbjct: 487 QSVALRRGLPLHTFHVLGEAQRANRSDSRNQLVENVQKLVTKHLMPSAQDIDEAVDQLAK 546 Query: 274 QSRHEL--DIAPPEPPY--------QPDEIYDAL-------KQGEVLVRLGGLRVLRIGD 316 + +HE I PE DE +A+ K L+R LR++ D Sbjct: 547 KFQHEALPPIILPEEQVRTVFGSRSTADEQGNAICDYEFDTKTSVRLLRANILRLVTEED 606 >UniRef50_B4M7P8 Lysine-specific demethylase NO66 n=3 Tax=Drosophila RepID=NO66_DROVI Length = 907 Score = 160 bits (404), Expect = 1e-37, Method: Composition-based stats. Identities = 52/309 (16%), Positives = 110/309 (35%), Gaps = 30/309 (9%) Query: 7 LNWPDFLERHWQKRPVVLKRGFNN-FIDPISPDELAGLAMESEVD----SRLVSHQDGKW 61 + DF ++W++ +KR N F IS + + + + ++ + ++++G Sbjct: 479 MTSDDFFSQYWERNACQVKRKQPNYFSQLISFKLIDEMLIRNHLEFTTNIDVTTYKNGMR 538 Query: 62 QVSHGPFESYD-----HLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFS 116 + +H P S+ + + + L +E + + Sbjct: 539 E-THNPDGRAMPPTVWGFYSDGCSIRILNPSTYLIKVRQLCAMMQEFFHCLVGANVYLTP 597 Query: 117 VPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQ---HCPHPDLLQVDPFEAIIDEEL 173 G PH D + F++Q GR+RWR+ L + Q + E + D L Sbjct: 598 PNSQGFAPHYDDIEAFVLQVEGRKRWRLYSPLHPSDVLARNSSGNYSQAELGEPLFDAVL 657 Query: 174 EPGDILYIPPGFPHEGYALENAMNYSVG---FRAPNTRELISGFADYVLQRE-------- 222 EPGDILY P G H+ + + + ++ L+ VLQR Sbjct: 658 EPGDILYFPRGTVHQAVCDQQQHSLHITLSVYQQQAYANLLEELMPAVLQRAIKHHLSLR 717 Query: 223 --LGGNYYSDPDVPPRAHPADVLPQEMDKLREMMLE-LINQPEHFKQWFGEFISQSRHEL 279 L + + + +++ + + + ++ + L+ + + +HE Sbjct: 718 RGLPLHIWQHLGLAKGDQKSELRDELLGNTKRLVQQYLMPSDAQIDAAVDQLAKRFQHEA 777 Query: 280 --DIAPPEP 286 + PE Sbjct: 778 LPPVVLPEE 786 >UniRef50_B4V6J8 Putative uncharacterized protein n=1 Tax=Streptomyces sp. Mg1 RepID=B4V6J8_9ACTO Length = 394 Score = 157 bits (398), Expect = 5e-37, Method: Composition-based stats. Identities = 69/332 (20%), Positives = 114/332 (34%), Gaps = 24/332 (7%) Query: 41 AGLAMESEVDSRLVSHQDGKWQVSHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPF 100 AG A+ + S L +++ G P + + L E SL++ A+ H P Sbjct: 63 AGGAVPATAYSILRTNRRGVSWYQPQPADFHARLAE-GASLVIDAIEQIHPPVREAAAGL 121 Query: 101 RELPDWRIDDLMISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLL 160 + + G G H D +DV ++Q G +RW+V + + Sbjct: 122 ERFFRTPVQVNAYASWTAEEGFGTHWDDHDVVVLQLEGSKRWKVYGPTRQAPAWRDVETP 181 Query: 161 QVDPFEAIIDEELEPGDILYIPPGFPHEGYAL--ENAMNYSVGFRAPNTRELISGFADYV 218 +V + I D L PGD+LY+P G+ H A +++ + G E + D + Sbjct: 182 EVPTGDPIADIVLTPGDVLYLPRGWWHAVSADQGTASLHLTFGLATQTGAEFLGWLRDDL 241 Query: 219 LQRELGGNYYSDPDVPPRAHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHE 278 + DV PR + + +R+ +L ++ P +W R Sbjct: 242 -----RASLTVRADV-PRFGTTEERADYLAAVRKDVLAALDAPAVLDRW-------ERTL 288 Query: 279 LDIAPPEPPYQPDEIYDALKQGEVLVRLGGLRVLRIGDDVYA------NGEKIDSPHRPA 332 P P + + + V+ R DD N P P Sbjct: 289 DATHPGRPRLSLPHLTGVPAEPGITVQATVPRARIDQDDQAVTFAGAGNEWTFALPVAPL 348 Query: 333 LDALASNIALTAENFGDALEDPSFLAMLAALV 364 L LA T + A E L +A LV Sbjct: 349 LRLLAGGPPATLADL--AAESDLTLVQVAELV 378 >UniRef50_B4JMQ2 Lysine-specific demethylase NO66 n=1 Tax=Drosophila grimshawi RepID=NO66_DROGR Length = 723 Score = 157 bits (397), Expect = 6e-37, Method: Composition-based stats. Identities = 50/246 (20%), Positives = 100/246 (40%), Gaps = 19/246 (7%) Query: 7 LNWPDFLERHWQKRPVVLKRGFNN-FIDPISPDELAGLAMESEVD----SRLVSHQDGKW 61 L+ DF R+W+ + +KR + + D +S + + + +E+ ++ + S++DG Sbjct: 292 LSLDDFFSRYWESKACQVKRKRKDLYSDLVSFEMIDEMLIENHLEFTTNIDVTSYKDGVR 351 Query: 62 QVSHGPF------ESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISF 115 Q +H P + H + S+ + + + + + +E + + Sbjct: 352 Q-THNPDGRAMPPTVWGHYSD-GCSVRILNPSTYLKGLRGVCAALQEHFHCLVGANVYLT 409 Query: 116 SVPGGGVGPHLDQYDVFIIQGTGRRRWRVGE---KLQMKQHCPHPDLLQVDPFEAIIDEE 172 G PH D + F++Q GR+RWR+ + + +L Q + I DE Sbjct: 410 PPNSQGFAPHYDDIEAFVLQVEGRKRWRLYDAPSPNDVLARTSSGNLKQQQLSKPIFDEV 469 Query: 173 LEPGDILYIPPGFPHEGYALENAMNYSVG---FRAPNTRELISGFADYVLQRELGGNYYS 229 LE GD+LY P G H+ + + + ++ + L+ VLQ + N Sbjct: 470 LEAGDLLYFPRGCVHQAVTEQQHHSLHITLSVYQQQSYANLMEALMPAVLQNAIKHNLDM 529 Query: 230 DPDVPP 235 +P Sbjct: 530 RRGLPL 535 >UniRef50_A3M7T2 Putative uncharacterized protein n=2 Tax=Acinetobacter baumannii ATCC 17978 RepID=A3M7T2_ACIBT Length = 382 Score = 156 bits (395), Expect = 1e-36, Method: Composition-based stats. Identities = 70/349 (20%), Positives = 127/349 (36%), Gaps = 36/349 (10%) Query: 1 MEYQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGK 60 M +L+ F K+P + K ++ IS +++ L ++ R +G Sbjct: 1 MLLNFSLDKDIFKNDFLYKKPYLFKSAIDS--SGISWNDVNELYSRGDISHRDFKLMNGY 58 Query: 61 WQVSHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMR------PFRELPDWRIDDLMIS 114 ESY+ LG + + + + A L+R PF + +I + Sbjct: 59 EVPKKEYIESYECLGVIEYRCITSVLYKYLRNGATLVRNRISNEPFVDQISKQIATFAEA 118 Query: 115 FSVPGG--------GVGPHLDQYDVFIIQGTGRRRWRVGEK-----LQMKQHCPHPDLLQ 161 ++ GG H D DV+ +Q GR+RW + + L M+Q PD+ Sbjct: 119 RTLVGGYAAFSSKSSYKSHWDTRDVYAVQLLGRKRWILRKPNFEFPLYMQQTKNFPDIK- 177 Query: 162 VDPFEAIIDEELEPGDILYIPPGFPHEGYAL-ENAMNYSVGFRAPNTRELISGFADYVLQ 220 +P E +D LE GDILYIP G+ H+ L E + +V AP E + Q Sbjct: 178 -EPEEIYMDVILEAGDILYIPRGWWHDPLPLDEETFHLAVATFAPTGFEYMRWL-----Q 231 Query: 221 RELGGNYYSDPDVPPRAHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELD 280 + G + + + +D + E+I +++ + +++ Sbjct: 232 NIMPGILDCRKNFTNFEN----DVEMIDSFSHQVAEIIKDKNYYQSFMVHHLAEQSVPSM 287 Query: 281 IAPPEPPYQPDEIYDALKQGEVLVRLGGLRVLRIGDDVYANGEKIDSPH 329 + + + L + L + V NG KI+ Sbjct: 288 L---SLDILGNGKINHLNHNQKLYLNASILYYFDEGFVIINGNKINIDG 333 >UniRef50_B4R4H1 Lysine-specific demethylase NO66 n=2 Tax=melanogaster subgroup RepID=NO66_DROSI Length = 847 Score = 154 bits (389), Expect = 6e-36, Method: Composition-based stats. Identities = 53/310 (17%), Positives = 114/310 (36%), Gaps = 31/310 (10%) Query: 12 FLERHWQKRPVVLKRGFN-NFIDPISPDELAGLAMESEVDS----RLVSHQDGKWQVSHG 66 F + W++ +++R F IS L + + +D + ++++GK + + Sbjct: 423 FFKYFWEQTACLVQRTNPKYFQSLISFKMLDEILIRHHLDFTVNLDVTTYKNGKRETLNP 482 Query: 67 -----PFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGG 121 P + E S+ + + + + +E +++ M G Sbjct: 483 EGRALPPAVWGFYSE-GCSIRLLNPSAYLTRLREVCTVLQEFFHCKVEANMYLTPPNSQG 541 Query: 122 VGPHLDQYDVFIIQGTGRRRWRVGEK---LQMKQHCPHPDLLQVDPFEAIIDEELEPGDI 178 PH D + F+IQ GR+RW + E + Q + IIDE L GD+ Sbjct: 542 FAPHYDDIEAFVIQVEGRKRWLLYEPPKEADHLARISSGNYDQEQLGKPIIDEVLSAGDV 601 Query: 179 LYIPPGFPHEGYALENAMNYSVG---FRAPNTRELISGFADYVLQRELGGNYYSDPDVPP 235 LY P G H+ E + + ++ L+ VL++ + + +P Sbjct: 602 LYFPRGTVHQAITEEQQHSLHITLSVYQQQAYANLLETLMPMVLKKAVDRSVALRRGLPL 661 Query: 236 ----------RAHPADVLPQEMDKLREMMLE-LINQPEHFKQWFGEFISQSRHELDIAPP 284 +A+ ++ +++++ + LI + + + + +HE A P Sbjct: 662 HTFQVLGNAYKANDCGSRQLLVENVQKLVTKYLIPSEDDIDEAVDQMAKKFQHE---ALP 718 Query: 285 EPPYQPDEIY 294 +E+ Sbjct: 719 PIVLPSEEVR 728 >UniRef50_B0WMG3 Lysine-specific demethylase NO66 n=2 Tax=Culicini RepID=NO66_CULQU Length = 648 Score = 153 bits (388), Expect = 8e-36, Method: Composition-based stats. Identities = 50/327 (15%), Positives = 114/327 (34%), Gaps = 25/327 (7%) Query: 8 NWPDFLERHWQKRPVVLKRGFNNFI-DPISPDELAGLAMESEVD----SRLVSHQDGKWQ 62 +F+ + W+K+P +++R + + +S ++ + + ++ + S+++G + Sbjct: 228 TVDEFMAQFWEKKPFLVQRNDPTYYANLLSRGKIDEMLRNNNIEYTKNLDVTSYREGVRE 287 Query: 63 VSHGPFESYD-----HLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSV 117 +H P E S+ + + + +E Sbjct: 288 -THNPDGRALPPDVWAFYEEGCSIRMLNPQTYLPGVYEMNVKLQEFFHCMTGSNFYLTPP 346 Query: 118 PGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQ---HCPHPDLLQVDPFEAIIDEELE 174 G PH D + F++Q GR+ W++ + P+ Q + I++ LE Sbjct: 347 NSQGFAPHYDDIEAFVLQVEGRKHWKLYSPRTASEVLARVSSPNFTQEEIGVPILEVTLE 406 Query: 175 PGDILYIPPGFPHEGYALENAMNYSVG---FRAPNTRELISGFADYVLQRELGGNYYSDP 231 PGD+LY P G H+ + + V ++ + +L+ + + L + + Sbjct: 407 PGDLLYFPRGIIHQASTVPGHHSLHVTMSVYQKNSWADLLELYLPHALSQAAENHLELRR 466 Query: 232 DVPP--RAHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQ 289 +P H V R+ +++ I + S+ ++ + +Q Sbjct: 467 GIPQDLHQHFGIVHSDNETPTRKDLIKKIKSL------VDKIFSEEAIDVAVDQLAKRFQ 520 Query: 290 PDEIYDALKQGEVLVRLGGLRVLRIGD 316 D + L E G D Sbjct: 521 HDALPPLLTDQERAQTAYGANYAFNPD 547 >UniRef50_A6W7N8 Cupin 4 family protein n=1 Tax=Kineococcus radiotolerans SRS30216 RepID=A6W7N8_KINRD Length = 434 Score = 153 bits (387), Expect = 1e-35, Method: Composition-based stats. Identities = 67/375 (17%), Positives = 129/375 (34%), Gaps = 46/375 (12%) Query: 15 RHWQKRPVVLKRGF-------NNFIDPISPDELAGLAMESEVDSRLVSHQD--------- 58 HW RP++++ + D +SP ++ L + + S Sbjct: 37 EHWNTRPLLVRAADRAAEGGRASVHDLLSPADVDELLGPRALRTPFFSLVQDGTPLPRSS 96 Query: 59 -------GKWQVSHGP-FESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDD 110 G Q++ P + ++++QA++ + Sbjct: 97 YTRRAVAGNQQLADLPDTDRVAAAHAGGATIVLQALHRTWPALQTFCSQLAADLGHQCQ- 155 Query: 111 LMISFSVPG-GGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDP----F 165 + + + PG G PH D +DV ++Q GR+ W + P Sbjct: 156 VNVYVTPPGAQGFKPHHDTHDVVVLQVDGRKHWTIHPPAVELPLKSQPSTQLGPDPVGGR 215 Query: 166 EAIIDEELEPGDILYIPPGFPHEGYALEN-AMNYSVGFRAPNTRELISGFADYVLQRELG 224 ID LEPGD LY+P G+ H E+ +++ +VG A ++++ + Sbjct: 216 PPAIDTVLEPGDALYLPRGWLHSARTTEDRSIHLTVGLLATTWADVLTD----AVASAGV 271 Query: 225 GNYYSDPDVPPRAHPAD---VLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDI 281 + +P P V +E+ R ++ + +++ R Sbjct: 272 ADVALRRALPLPGAPGAADGVPDEEVAGFRAAAQRWLDALDDDAV--RRLVARRRSGAVP 329 Query: 282 APPEPPYQPDEIYDALKQGEVLVRLGGLRVLRI----GDDVYANGEKIDSPH--RPALDA 335 A P DE L +G L G+R + G D+ + ++ P RPAL+ Sbjct: 330 AEPVGVLAQDEAARTLAEGTALRPRRGVRSSLVPAGEGVDLVLDDRRVTFPGWLRPALEH 389 Query: 336 LASNIALTAENFGDA 350 + + +A + A Sbjct: 390 VLAAPRTSAADLAAA 404 >UniRef50_A9UZN8 Predicted protein n=1 Tax=Monosiga brevicollis RepID=A9UZN8_MONBE Length = 432 Score = 153 bits (386), Expect = 1e-35, Method: Composition-based stats. Identities = 59/354 (16%), Positives = 120/354 (33%), Gaps = 42/354 (11%) Query: 1 MEYQLT-LNWPDFLERHWQKRPVVLKR-GFNNFIDPISPDELAGL----AMESEVDSRLV 54 +E+ L ++ F +W+ +P++++R F S +L + ++ V+ + Sbjct: 21 LEWLLDPIDLKTFFSEYWETKPLLIRRKNRQRFKGLFSSQQLDDVIRSNYIKYGVNIDMA 80 Query: 55 SHQDGKWQVSH--GPF--ESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDD 110 + DG + G + L E S+ + + +P L+ +E + Sbjct: 81 RYSDGVRTTENPEGRVHANTMWALYEDGCSIRMLNPQTYAKPVWQLISTLQEYFQCMVGC 140 Query: 111 LMISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEK--LQMKQHCPHPDLLQVDPFEAI 168 G PH D + I+Q G +RWR+ + + Q + E I Sbjct: 141 NTYLTPPGAQGFAPHYDDIEALILQLEGSKRWRLYNNPTGERLPRTSSRNFDQSELSEPI 200 Query: 169 IDEELEPGDILYIPPGFPHEGYALENAMNYSVGFRAP---NTRELISGFADYVLQRELGG 225 +D L+PGD LY P G H+ + + + + + E L + Sbjct: 201 LDVVLQPGDFLYFPRGMAHQAVSTPDEHSLHITLSTYQLFDWAEYFKKLVPAALDYAIAE 260 Query: 226 NYYSDPDVPPRA--HPADVLPQE---------MDKLREMMLELINQP-----------EH 263 + +P +A H + + MDK + + +LI+ + Sbjct: 261 DAEFREGLPLQALNHVGLLHSETEGDNQRKRFMDKAKHLFQKLIDVAPYDSAADAVAVDF 320 Query: 264 FKQWFGEFISQSRHEL-----DIAPPEPPYQPDEIYDALKQGEVLVRLGGLRVL 312 +++ L +A P + A L+R R++ Sbjct: 321 LHASMPSYLTSEELALTSRQKQLAARSNPVELSSPALAPSDWIRLIRPSMCRLV 374 >UniRef50_B1FB07 Cupin 4 family protein n=1 Tax=Burkholderia ambifaria IOP40-10 RepID=B1FB07_9BURK Length = 380 Score = 152 bits (385), Expect = 2e-35, Method: Composition-based stats. Identities = 78/394 (19%), Positives = 144/394 (36%), Gaps = 53/394 (13%) Query: 3 YQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQ 62 +L + F+E ++ KRP++ + N + +S +E++ E ++ G + Sbjct: 2 IELNMKASHFVENYFDKRPILFRGALRN--NFLSWEEVSEAIYIGESMTQGPRLNKGGFL 59 Query: 63 VSHGPFESYDHLGE---------------TNWSLLVQAVNHWHEPTAALMRPF-RELPDW 106 + LG+ +L+ + + + R + + Sbjct: 60 DESKYIVNCGELGQVRRRLEKGILYDELRNGTTLVFNRMELTLYKVRLICKSISRFVGEH 119 Query: 107 RIDDLMISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQ--HCPHPDLLQVDP 164 + + I+F G H D + VF +Q GR+RW V E H P Sbjct: 120 TVANGYIAFGEEE-SFGKHWDTHSVFAVQMMGRKRWLVYEPTHALPLKHQRSTGKQSECP 178 Query: 165 FEAIIDEELEPGDILYIPPGFPHEGYAL-ENAMNYSVGFRAPNTRELISGFADYVL---- 219 E +D +E GDILY+P G+ H L E + +VG + I AD ++ Sbjct: 179 AEPYMDVTIETGDILYLPRGWWHTAIPLNEETFHLAVGVHESTISDYIKYLADEIIGDFD 238 Query: 220 --QRELGGNYYSDPDVPPRAHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRH 277 ++ + D D+ A+ E+ ++ E LIN + ++ F E SR Sbjct: 239 AFRQTIPLGERRDIDLRLVAN-------ELARIVEDRNVLINYNDRRRRNFRE---ASRP 288 Query: 278 ELDIAPPEPPYQPDEIYDALKQGEVLVRLGGLRVLRIGDDVYANGEKIDSP---HRPALD 334 L + +Q D K+ + L LR L + NG+ + R Sbjct: 289 NLQLHAFRSKFQLD------KEAKFLANTP-LRSLAFDEG--INGQSVPIGRNLERLHNF 339 Query: 335 ALASNIALTAENFGD---ALEDPSFLAMLAALVN 365 AS A++ + D + F +++ L+ Sbjct: 340 IFASTGAVSYKELRDCACEITSEEFDSLILKLLE 373 >UniRef50_C1EHB5 Predicted protein (Fragment) n=2 Tax=Micromonas RepID=C1EHB5_9CHLO Length = 387 Score = 150 bits (378), Expect = 9e-35, Method: Composition-based stats. Identities = 50/260 (19%), Positives = 87/260 (33%), Gaps = 33/260 (12%) Query: 10 PDFLERHWQKRPVVLKRGF--NNFIDPISPDELAGLA----MESEVDSRLVSHQDGKWQV 63 F+ W++RP + R F +S ++ M + + + S++DG + Sbjct: 15 ETFMRDIWERRPAYVSRNAHKGYFDGLLSKADIDEWLRAGKMRYQRNVDVTSYKDGVRRT 74 Query: 64 SHGPFES-------------------YDHLGETNWSLLVQAVNHWHEPTAALMRPFRELP 104 + + + + SL V W +P + Sbjct: 75 HNLNDDGSGGVDATTGEPGFADADTVWRRFEQEGCSLRVLHPQRWRDPLWKTLAALERF- 133 Query: 105 DWRIDDLMISFSVPG--GGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDL 159 W + P G PH D D FI+Q G++ WRV + P P+ Sbjct: 134 -WNCSTGCNCYLTPADSQGFSPHYDDIDAFILQLEGKKLWRVYPPRSEAEMLPRYSSPNF 192 Query: 160 LQVDPFEAIIDEELEPGDILYIPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVL 219 Q D E +++ LEPGD+LY+P G H+ + + V + N + + Sbjct: 193 GQDDVGEPVLEVILEPGDLLYMPRGTVHQANCVPGDHSLHVTL-STNQFNTWADLLEVAF 251 Query: 220 QRELGGNYYSDPDVPPRAHP 239 L P + P Sbjct: 252 PAALRQAVAEVPALRRCPPP 271 >UniRef50_Q5ZMM1 Lysine-specific demethylase NO66 n=3 Tax=Eumetazoa RepID=NO66_CHICK Length = 601 Score = 149 bits (377), Expect = 1e-34, Method: Composition-based stats. Identities = 51/251 (20%), Positives = 98/251 (39%), Gaps = 35/251 (13%) Query: 10 PDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLV-------SHQDGKWQ 62 +F +HW++ P++++RG + AGL ++ D+ L +H D Sbjct: 176 EEFARQHWERAPLLVQRGDPGYY--------AGLFSTADFDAILRSGDVHFGTHLDVTSY 227 Query: 63 VSHGPFESYDHLG-----------ETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDL 111 G E+++ +G + SL + + + + +E + Sbjct: 228 AE-GVRETHNPVGRALPAVVWDFYQNGCSLRLLSPQAFSTTVWHFLSILQEHFG-SMAGA 285 Query: 112 MISFSVPG-GGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQVDPFEA 167 + PG G PH D + F++Q G++ WRV + P +L Q + E Sbjct: 286 NTYLTPPGTQGFAPHYDDIEAFVLQLEGKKHWRVYGPRTSSEALPQFSSANLTQAELGEP 345 Query: 168 IIDEELEPGDILYIPPGFPHEGYALENAMNYSVGFRAP---NTRELISGFADYVLQRELG 224 +++ LE GD+LY P GF H+ L +A + + + + + + LQ L Sbjct: 346 LLEVVLEAGDLLYFPRGFIHQADCLPDAHSLHITVSSYQRNSWGDFLEKLLPAALQMALE 405 Query: 225 GNYYSDPDVPP 235 + +P Sbjct: 406 EDLEYRQGLPM 416 >UniRef50_D2SA69 Cupin 4 family protein n=2 Tax=Actinomycetales RepID=D2SA69_9ACTO Length = 436 Score = 149 bits (377), Expect = 2e-34, Method: Composition-based stats. Identities = 59/306 (19%), Positives = 97/306 (31%), Gaps = 31/306 (10%) Query: 3 YQLTLNWPDFLERHWQKRPVVLKRGFNN--FIDPISPDELAGLAMESEVDSRLVSHQDGK 60 L+ F E +W +RP++ + F D + + L + + + Sbjct: 19 RCTALDPRVFAEEYWARRPLLTRAEETGGSFADLLDLAAVDELLSRRGLRTPFLRIAKDG 78 Query: 61 WQVSHGPFESYD----------------HLGETNWSLLVQAVNHWHEPTAALMRPFRELP 104 V F + L ++++Q ++ P Sbjct: 79 AVVDPKRFTTSGGAGAEVADQVSSDAVLRLFADGSTVVLQGLHRLWPPLIEFADQLAADL 138 Query: 105 DWRIDDLMISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHP------D 158 G PH D +DVF++Q G +RWR+ E + P Sbjct: 139 GHPTQVNAYVTPPSSRGFSPHYDVHDVFVLQVAGEKRWRIHEPVLTDPLRTQPWNERGAA 198 Query: 159 LLQVDPFEAIIDEELEPGDILYIPPGFPHEGYALEN-AMNYSVGFRAPNTRELISGFAD- 216 + E +ID L PGD LY+P G+ H AL + + +VG + D Sbjct: 199 VAAAAEREPLIDAVLRPGDALYLPRGYLHSATALGAISAHLTVGIHSVTRWAAAESALDL 258 Query: 217 -YVLQRELGGNYYSDPDVPPRAHPADVLPQEMDKLREMMLELIN--QPEHFKQWF-GEFI 272 VL E S P A PA V ++ + + ++ P Sbjct: 259 VRVLATEDPQLRRSLPLGVDLADPAAV-ADDVATVVTALKGWLDRVDPAEVADRLRARTW 317 Query: 273 SQSRHE 278 SQ R E Sbjct: 318 SQVRPE 323 >UniRef50_Q7K4H4 Lysine-specific demethylase NO66 n=2 Tax=melanogaster subgroup RepID=NO66_DROME Length = 653 Score = 148 bits (374), Expect = 3e-34, Method: Composition-based stats. Identities = 57/343 (16%), Positives = 122/343 (35%), Gaps = 43/343 (12%) Query: 12 FLERHWQKRPVVLKRGFN-NFIDPISPDELAGLAMESEVD----SRLVSHQDGKWQVSHG 66 F + W+ +++R F IS L + + +D + ++++GK + + Sbjct: 229 FFKDFWEHTACLVQRSNPKYFQSMISFKMLDEILIRHHLDFTVNVDVTTYKNGKRETLNP 288 Query: 67 -----PFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGG 121 P + + S+ + + + + +E ++ + G Sbjct: 289 EGRALPPAVWGFYSD-GCSIRLLNPSTYLIRLRQVCTVLQEFFHCKVGANLYLTPPNSQG 347 Query: 122 VGPHLDQYDVFIIQGTGRRRWRVGEK---LQMKQHCPHPDLLQVDPFEAIIDEELEPGDI 178 PH D + F+IQ GR+RW + E + Q + IIDE L GD+ Sbjct: 348 FAPHYDDIEAFVIQVEGRKRWLLYEPPKKADQLARISSGNYDQEQLGKPIIDEVLSAGDV 407 Query: 179 LYIPPGFPHEGYALENAMNYSVG---FRAPNTRELISGFADYVLQRELGGNYYSDPDVPP 235 LY P G H+ E + + ++ L+ VL++ + + +P Sbjct: 408 LYFPRGAVHQAITEEQQHSLHITLSVYQQQAYANLLETLMPMVLKKAVDRSVALRRGLPL 467 Query: 236 ----------RAHPADVLPQEMDKLREMMLE-LINQPEHFKQWFGEFISQSRHELDIAPP 284 + + Q ++ +++++ L+ + + + + +HE A P Sbjct: 468 HTFQVLGNAYKGNDCGSRKQLVENVQKLVTNYLMPSEDDIDEAVDQMAKKFQHE---ALP 524 Query: 285 EPPYQPDEIY-------DALKQGEV-----LVRLGGLRVLRIG 315 +E+ DA +QG + +R+LR Sbjct: 525 PIVLPSEEVRTVHGARSDADEQGNCVCDYKFNKKTSVRLLRAN 567 >UniRef50_A3UGV1 Putative uncharacterized protein n=1 Tax=Oceanicaulis alexandrii HTCC2633 RepID=A3UGV1_9RHOB Length = 387 Score = 147 bits (371), Expect = 7e-34, Method: Composition-based stats. Identities = 73/384 (19%), Positives = 130/384 (33%), Gaps = 46/384 (11%) Query: 12 FLERHWQKRPVVLKRGFNN-FIDPISPDELAGLAMESEVDSRLVSHQ-------DGKWQV 63 F E +++ + N F IS D + + E + +S D W Sbjct: 17 FFETVFEQTHLHAPGTDRNRFASLISLDAIDRILAEDLLREGDLSMARAEPRLPDRAWLR 76 Query: 64 SHGPFE--SYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGG 121 G + L + +L++ + H P A L R + + G Sbjct: 77 EDGLVDRGEVARLYQQGATLILPQLQARHRPLADLCRQLEAEFSCPVQTNIYLTPPNAQG 136 Query: 122 VGPHLDQYDVFIIQGTGRRRWRVGE-KLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILY 180 H D +DV ++Q G +RWR+ + + + + E + L PGD+LY Sbjct: 137 FQTHYDNHDVLVLQVEGSKRWRLYDAPVGVPYRGERFTPGRFAQTEPRAELVLNPGDVLY 196 Query: 181 IPPGFPHEGY---ALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYS---DPDVP 234 +P G H+ + E +++ + G L +AD++L+ + +P Sbjct: 197 VPRGLMHDAVNEGSDEASLHITTGL-------LAKTWADFLLEAVSEAALRTPQLRRALP 249 Query: 235 PRAHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIY 294 P V P K LE + Q G F + +P + Sbjct: 250 PGYARGAVSPGVFAKTFAEALEAVGQNADIGAVLGLFTDAALTS----------RPADTR 299 Query: 295 DALKQGEVLVRLGGLRVLRIGDDVYANGE-----------KIDSPHRPALDALASNIALT 343 AL G + R I D+ +G+ D+ L+ L A++ Sbjct: 300 GALTLGPITADTRLKRRALIALDLVDDGDHVALVAPGGALSFDAAAEAGLERLLKGDAIS 359 Query: 344 AENFGDALEDPSFLAMLAALVNSG 367 +F AL+D ++ L+ G Sbjct: 360 LADF-SALDDAKARDVMERLIAYG 382 >UniRef50_Q54K96 Lysine-specific demethylase NO66 n=1 Tax=Dictyostelium discoideum RepID=NO66_DICDI Length = 514 Score = 146 bits (368), Expect = 2e-33, Method: Composition-based stats. Identities = 68/417 (16%), Positives = 142/417 (34%), Gaps = 57/417 (13%) Query: 9 WPDFLERHWQKRPVVLKRGFNNFI-DPISPDELAGLA----MESEVDSRLVSHQDGKWQV 63 DF ++++ ++ + +KR +N + + D L + M+ + + ++ D + Sbjct: 101 IEDFYDQYFGQKHLYVKRNGDNIYKNFFTKDSLDKMLRNNLMKFTENVDVTNYVDFQRIT 160 Query: 64 SHGPFESYDHL----GETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPG 119 + +Y L + S+ + ++ L + + + P Sbjct: 161 LNPEGRAYPSLVWKHYKEGCSVRLLNPQTFNSNVWKLCSTLQTHFQCGVGAN--IYLTPA 218 Query: 120 G--GVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHP---DLLQVDPFEAIIDEELE 174 G G PH D DVFI+Q G++ WR+ + + P + Q + E LE Sbjct: 219 GAQGFAPHYDDVDVFILQLEGKKEWRLYKPRDANEVLPKKSSENFTQEEIGEPYFTVTLE 278 Query: 175 PGDILYIPPGFPHEGYALENAMNYSVGFRAP---NTRELISGFADYVLQRELGGNYYSDP 231 GD+LY P G H+ + + + + +LI + L+ Sbjct: 279 AGDLLYFPRGVIHQAVSPSDVHSLHITVSTYLNNTWGDLIGKVLNRALEIANEECLEFRE 338 Query: 232 DVPP--RAHPADVLPQEM-DKLREMMLE---LINQP--EHFKQWFGEFISQSRHELDIAP 283 +P + + ++ D+ R+ + + + + G ++ LD P Sbjct: 339 GLPRDYTQYLGVIHSDKVGDERRKELTDKVGTLWDKLGQLLPIDIGADQMAVKYLLDSLP 398 Query: 284 P-----------EPPYQPDEIYDALKQGEVLVRLGGLRVLRIGDD-VYAN---------- 321 P E +I + L+R +R++ ++ N Sbjct: 399 PVLTQLEKKHSIEDETTSMKIKPETRF--RLIRADSVRLVVEDIAILFHNADNTRIYHQV 456 Query: 322 GE-----KIDSPHRPALDALASNIALTAENFGDALEDPSF-LAMLAALVNSGYWFFE 372 GE + AL+ + + +ED L +++AL G FE Sbjct: 457 GEEPGVVEFTLECVDALEHIIDSYPSYIYTKDLPIEDDDQKLDVVSALYEKGLIMFE 513 >UniRef50_UPI00017929D5 PREDICTED: similar to Nucleolar protein 66 (hsNO66) n=1 Tax=Acyrthosiphon pisum RepID=UPI00017929D5 Length = 473 Score = 145 bits (367), Expect = 2e-33, Method: Composition-based stats. Identities = 50/238 (21%), Positives = 90/238 (37%), Gaps = 18/238 (7%) Query: 8 NWPDFLERHWQKRPVVLKRGFNN-FIDPISPDELAGLAMESEV----DSRLVSHQDGKWQ 62 + DF+ HW+K + + R +N F S EL + E+ + + + S+ D + Sbjct: 79 SINDFMRDHWEKTILHVPRNSSNYFSQLFSLTELDTILRENNLQYGTNVDITSYTDNVRE 138 Query: 63 VSHGPFES------YDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFS 116 +H P +D+ S+ + + M +E + + Sbjct: 139 -THNPVGRAHPHVVWDYYN-NGCSVRLLNPQLFAPEIYKFMANLQEYFGSLVGCNVYLTP 196 Query: 117 VPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHP---DLLQVDPFEAIIDEEL 173 G PH D + F++Q G + WRV + + P + Q + E I+D L Sbjct: 197 PFSQGFAPHYDDIEAFVVQVDGEKHWRVYKPRSEFETLPRTSSRNFHQDEIGEPILDVIL 256 Query: 174 EPGDILYIPPGFPHEGYAL-ENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSD 230 PGD LY+P G+ H+ L + + F + + F V+ L +D Sbjct: 257 RPGDFLYMPRGYIHQADTLFTETHSLHLTFSSYQQNSMYD-FLQVVVNNSLNNAVKND 313 >UniRef50_A7SRW5 Predicted protein n=1 Tax=Nematostella vectensis RepID=A7SRW5_NEMVE Length = 269 Score = 145 bits (366), Expect = 2e-33, Method: Composition-based stats. Identities = 40/206 (19%), Positives = 79/206 (38%), Gaps = 15/206 (7%) Query: 10 PDFLERHWQKRPVVLKRGFNNFID-PISPDELAGLAMESEV----DSRLVSHQDGKWQVS 64 DF E +W+K+P+V+ R + F S L L + E+ D + + DG+ + Sbjct: 58 KDFFENYWEKKPLVINREDSEFYGALFSKAFLEVLLKKKEINYVEDINVCRYIDGEKEFL 117 Query: 65 HG-------PFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSV 117 + + + + N ++ + + L + + Sbjct: 118 NEDEGTKATASKIMKKVKDDNATIQFHQPQRFQDTLWQLNGNLERFFGCLVGANVYITPP 177 Query: 118 PGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGD 177 G+ PH D +VFI+Q G + W++ L DL + + + L+ GD Sbjct: 178 NAQGLAPHHDDVEVFILQLEGEKNWKLYSPLVELALDYSADLEEDSIGKPTHEFTLKTGD 237 Query: 178 ILYIPPGFPHEGYAL---ENAMNYSV 200 +LY P G H+ L ++ + ++ Sbjct: 238 LLYFPRGTIHQAETLKCGNHSTHITL 263 >UniRef50_UPI000180B5EA PREDICTED: similar to Nucleolar protein 66 (hsNO66), partial n=1 Tax=Ciona intestinalis RepID=UPI000180B5EA Length = 594 Score = 144 bits (364), Expect = 4e-33, Method: Composition-based stats. Identities = 44/242 (18%), Positives = 94/242 (38%), Gaps = 21/242 (8%) Query: 12 FLERHWQKRPVVLKRGFNNFID-PISPDELAGLAME----SEVDSRLVSHQDGKWQVSHG 66 F + W+ RP+++ R + D S E+ + E V+ + ++Q+G+ + + Sbjct: 161 FFKDIWESRPLLVLRHCPRYADGLFSTKEMNRILNECNVRYSVNLDVTTYQNGRRETHN- 219 Query: 67 PFESYDHLG------ETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPG- 119 + + + S+ ++ + +P L +E + + PG Sbjct: 220 -IDGRAYAPVVWDYFKNGCSIRLKNPQAFSKPVWRLCATLQEFFKCMVGANT-YLTPPGT 277 Query: 120 GGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQVDPFEAIIDEELEPG 176 G PH D + F++Q G++ W + K+ P + + + I + LE G Sbjct: 278 QGFAPHYDDIEAFVLQLEGKKEWTLYSPRSGKETLPRYSSGNFTADEIGDEIFTQTLEAG 337 Query: 177 DILYIPPGFPHEGYALENAMNYSVGFRAP---NTRELISGFADYVLQRELGGNYYSDPDV 233 ++LY P G+ H+ AL + + V + +L+ LQ + + + Sbjct: 338 NLLYFPRGYIHQAKALPDTHSLHVTISMYQRNSWGDLLEKLLPTTLQNAIIDDVEFRKGL 397 Query: 234 PP 235 P Sbjct: 398 PL 399 >UniRef50_B0BQ44 Putative uncharacterized protein n=5 Tax=Pasteurellaceae RepID=B0BQ44_ACTPJ Length = 396 Score = 144 bits (364), Expect = 4e-33, Method: Composition-based stats. Identities = 59/301 (19%), Positives = 115/301 (38%), Gaps = 45/301 (14%) Query: 4 QLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEV--DSRLVSHQDGKW 61 +L++ F ++K+P+V+K N D +S +E+ + ++ + + GK Sbjct: 14 NFSLSYEKFKYEFFEKKPLVIKGAIRN-KDLLSWNEINEIFPRCKLIGEEEIKVMYKGKK 72 Query: 62 QVSHGPFESYDHLG---------------ETNWSLLVQ------AVNHWHEPTAALMRPF 100 ESY+ LG +L+ A++ + + A + Sbjct: 73 VPKEYYVESYNDLGTLRYKFKEEELYCLMRDGATLIANGIVNEPAIDIFSQEIAKFTKCH 132 Query: 101 RELPDWRIDDLMISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLL 160 L ++F+ H D D+F IQ GR+RW + H Sbjct: 133 IF------SSLYVAFNTQ-RSFKIHWDSRDIFAIQMQGRKRWIIHSPTFKDPLFMHRSKD 185 Query: 161 QVDPF----EAIIDEELEPGDILYIPPGFPHEGYAL-ENAMNYSVGFRAPNTRELISGFA 215 + F + ID LE GDILY+P G+ H+ + E ++ +VG P T + +S Sbjct: 186 MPEYFPNKDDVYIDILLEAGDILYLPRGWWHDPIPVGEETVHLAVGVFPPYTNDYLSWVT 245 Query: 216 DYVLQRELGGNYYSDPDVPPRAHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQS 275 + +++ E+ ++ + + L + + IN ++F + F + Sbjct: 246 ENIVKNEIA---------RKSLSSSEKNDEIISLLSAEVADFINNKDNFNIFLESFYDKK 296 Query: 276 R 276 R Sbjct: 297 R 297 >UniRef50_A9C261 Cupin 4 family protein n=1 Tax=Delftia acidovorans SPH-1 RepID=A9C261_DELAS Length = 298 Score = 144 bits (363), Expect = 6e-33, Method: Composition-based stats. Identities = 41/201 (20%), Positives = 73/201 (36%), Gaps = 12/201 (5%) Query: 72 DHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPHLDQYDV 131 L +T + ++ +++ E L + G G H D +DV Sbjct: 87 QSLLKTGATAILNRIDNRQELVRRLCEEVASFTNAETTANAYLAFSGEGSFGSHWDTHDV 146 Query: 132 FIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVD---PFEAIIDEELEPGDILYIPPGFPHE 188 IQ G++ WRV K P D P + I D LE GD+LY+P G+ HE Sbjct: 147 MAIQLIGKKHWRVYAP-TYKSPLPGQTSKSFDSTCPTDPIFDGVLEAGDLLYVPRGWWHE 205 Query: 189 GYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADVLPQEMD 248 + ++ ++G P+ ++ F + N ++ + + Sbjct: 206 VLPIGETLHVAIGIYPPHVLNYVAWFLE--------KNIKHHEELRKTLRSCTTTKEVVS 257 Query: 249 KLREMMLELINQPEHFKQWFG 269 ++ E +N PE K + Sbjct: 258 NACHVLTEGLNDPEVLKAFMD 278 >UniRef50_Q6DDJ7 Mina-prov protein n=2 Tax=Xenopus RepID=Q6DDJ7_XENLA Length = 461 Score = 142 bits (358), Expect = 2e-32, Method: Composition-based stats. Identities = 53/332 (15%), Positives = 105/332 (31%), Gaps = 34/332 (10%) Query: 10 PDFLERHWQKRPVVLKRGFNNFIDPI-------SPDELAGLAMESEVDSRLVSHQDGKW- 61 F +W+ + ++L+ F D +AG + E D + +DGK Sbjct: 49 DAFFRDYWETKVLLLQGRDPAFTDYFQTLFRLSDLKHIAGGGIYYERDVNVFKCRDGKKI 108 Query: 62 -QVSHGPFE---SYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSV 117 HG G ++ +++ +M + + Sbjct: 109 ALPRHGKATYLHLLKDFGSGKATIQFHQPQRFNDALWHIMEKLECFFGALVGSNVYITPQ 168 Query: 118 PGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGD 177 G+ H D +VFI+Q G +RWR+ + + + + D L+PGD Sbjct: 169 DSQGLPAHYDDVEVFILQLEGEKRWRLYNPVVPLAR-DYSVVPEDQIGSPTHDFVLKPGD 227 Query: 178 ILYIPPGFPHEGYAL-ENAMNYSVGFRAP---NTRELISGFADYVLQRELGGNYYSDPDV 233 +LY P G H+ AL ++ + V + + + +L N + Sbjct: 228 LLYFPRGVIHQAQALPGSSHSTHVTISTYQNNSWSDYLQDLLPGILFDAAKANIDLRRGI 287 Query: 234 PPR-----AHPADVLPQEMDKLREMMLELINQPEHFKQW-FGEFISQSRHELDIAPPEPP 287 P + P + Q++ L + + + H + + SR + EP Sbjct: 288 PRQQILSLDTPGVI--QQISSLLNTVAKGLESHRHIRSFEILRDFMASRLPPFLDNKEPG 345 Query: 288 YQPDEIYDALKQGEVLVRLGGLRVLRIGDDVY 319 G +++L + Sbjct: 346 QTS---------GMPPKLNSTVQLLYRDYSFF 368 >UniRef50_B3S582 Putative uncharacterized protein n=1 Tax=Trichoplax adhaerens RepID=B3S582_TRIAD Length = 431 Score = 140 bits (354), Expect = 7e-32, Method: Composition-based stats. Identities = 41/275 (14%), Positives = 99/275 (36%), Gaps = 21/275 (7%) Query: 5 LTLNWPDFLERHWQKRPVVLKRGFNNFID-PISPDELAGL----AMESEVDSRLVSHQDG 59 L + F W+++P++ +R +++ + S +L + +E V+ + ++++G Sbjct: 3 LPIPLDTFFNLSWERKPILAQRRSSSYNNGLFSSHDLDRIVRENYIEYSVNLDVTTYENG 62 Query: 60 KWQVSHGPFESYDHLG----ETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISF 115 + + + + S+ + + E +E + M Sbjct: 63 VRETHNAEGRVLASVMWDYYQNGCSIRMLNPQTYSESLWKFCSLLQEYFGSFVGCNM-YL 121 Query: 116 SVPG-GGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCP---HPDLLQVDPFEAIIDE 171 + PG G PH D + F++Q G+++WR + P + Q + + + Sbjct: 122 TPPGTQGFAPHFDDIEAFVLQLEGKKKWRFYNPRDDSEILPEYSSGNFNQNEIGKPSFEF 181 Query: 172 ELEPGDILYIPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDP 231 LE GD Y P G H+ +L + + + F + +L + + ++ Sbjct: 182 VLEQGDFAYFPRGTIHQAQSLPDCHSLHITVSTCQLHSFGKYF-EKLLPMAIRSAFKNNL 240 Query: 232 DVPPRAHP------ADVLPQEMDKLREMMLELINQ 260 + P + + R+ + + Q Sbjct: 241 GLRKSLPPDFFANIGGIHADSKNARRKQLTTEVKQ 275 >UniRef50_C7NJK3 Cupin superfamily protein n=1 Tax=Kytococcus sedentarius DSM 20547 RepID=C7NJK3_KYTSD Length = 414 Score = 140 bits (353), Expect = 8e-32, Method: Composition-based stats. Identities = 54/289 (18%), Positives = 96/289 (33%), Gaps = 30/289 (10%) Query: 10 PDFLERHWQKRPVVLK---RGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQVSHG 66 DF ER W P + R + F D S D + L + + + + Sbjct: 27 ADFAERSWGTTPRHVPATDRAGDTFTDLFSLDAVDDLLTHRGLRTPFIRMAQDGTTLPEN 86 Query: 67 PF----------------ESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDD 110 F + L ++++Q ++ P A R + + Sbjct: 87 RFTRGGGTGAGASDQVDEDRVRSLFAGGATIVLQGLHRTWPPIAEFARELGDELGHPVQV 146 Query: 111 LMISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHP-------DLLQVD 163 G H D +DVF++Q G + W + E + P + Sbjct: 147 NAYITPPQNQGFSAHYDVHDVFVLQVHGTKHWTLHEPVVAHPLRDQPWDTVREAVAHRAA 206 Query: 164 PFEAIIDEELEPGDILYIPPGFPHEGYALEN-AMNYSVGFRAPNTRELISGFADYVLQRE 222 +ID L PGD+LY+P G H A + + ++G + D V R Sbjct: 207 QDAPLIDAVLAPGDVLYLPRGTIHAAAAQGEISAHLTIGVHTWTPDHVTGAVLDAVRSRL 266 Query: 223 LGG-NYYSDPDVPPRAHPADVLPQEMDKLREMMLELIN--QPEHFKQWF 268 ++ + R A V+ +++LR + E I+ E ++F Sbjct: 267 RDQPTVRANLPLGARPDDAAVVGPTLEQLRGALHEAIDSLDAEELARYF 315 >UniRef50_A4X6V2 Cupin 4 family protein n=4 Tax=Micromonosporaceae RepID=A4X6V2_SALTO Length = 496 Score = 138 bits (348), Expect = 3e-31, Method: Composition-based stats. Identities = 69/405 (17%), Positives = 132/405 (32%), Gaps = 52/405 (12%) Query: 1 MEYQLTLNWPDFLERHWQKRPVVLKR----GFNNFIDPISPDELAGLAMESEVDSRLVSH 56 M +++ F HW + P++ + + F D +SP + L + + + Sbjct: 1 MARCVSVEPATFAAAHWGQTPLLSRAHELPNPSGFRDLLSPADADDLLSRRGLRTPFLRV 60 Query: 57 Q---------------DGKWQVSHGPFESY-DHLGETNWSLLVQAVNHWHEPTAALMRPF 100 +++ + L +L++Q ++ R Sbjct: 61 AQDGVLVPAARYTGGGGAGAEITDQVLDEKILDLYAGGATLVLQGLHRTWPALIDFARDL 120 Query: 101 RELPDWRIDDLMISFSVPGG--GVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHP- 157 + + ++ P G G H D +DVF++Q G + WR+ + P Sbjct: 121 GLAVGQPLQ--VNAYLTPAGSQGFATHYDTHDVFVLQVDGGKHWRIHPPVLPDPLERQPW 178 Query: 158 -----DLLQVDPFEAIIDEELEPGDILYIPPGFPHEGYALE-NAMNYSVGFRAPNTRELI 211 +++ +D L PGD LY+P G+ H A E ++++ +VG RA L+ Sbjct: 179 GGRADEVVATATGAPALDVLLAPGDALYLPRGWLHSAAAQERSSLHLTVGVRALTRYTLV 238 Query: 212 SGFADYVLQRELGGNYYSDPDVPPRAHPADVLPQEMDKLREMMLELINQPEHFKQWFGEF 271 L E + P + P V P+ + + E++ + + Sbjct: 239 EELL--ALAAEDQRLRATLPFGIDVSAPEAVEPE-LTETVEILRDWL--RRVDPTALAAR 293 Query: 272 ISQSRHELDIAPPEPPYQPDEIYDALKQGEVLVRLGGLRVLRIGDDVYANGEKIDSPH-- 329 + Q P P AL + GLR GE++ Sbjct: 294 LRQRAWPAARPAPLHPLAQAAALGALGPDSRVTPRPGLRWQLTPA-----GERVTLRVFD 348 Query: 330 ---------RPALDALASNIALTAENFGDALEDPSFLAMLAALVN 365 PAL AL S + +D + ++ L+ Sbjct: 349 RTITLPQMCAPALRALLSGEVSRVGDLPGLADDTDRVTLVRRLLR 393 >UniRef50_C5LMW3 Putative uncharacterized protein n=1 Tax=Perkinsus marinus ATCC 50983 RepID=C5LMW3_9ALVE Length = 521 Score = 137 bits (346), Expect = 6e-31, Method: Composition-based stats. Identities = 69/399 (17%), Positives = 128/399 (32%), Gaps = 50/399 (12%) Query: 8 NWPDFLERHWQKRPVVLKR-GFNNFIDPISPDELAGLAMESEVDSRL------------V 54 +F E +W+K+P+ ++R ++ + +A + + R V Sbjct: 59 TVEEFFEEYWEKKPLHVRRPTARDYYSGVWTKAMAEKTLTKH-ECRFGESVNFARVEAGV 117 Query: 55 SHQDGKWQVSHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMIS 114 + E E S + +P ALM S Sbjct: 118 KVMHNGEEGEKATVEYMQGQFEDGVSCQFMQPQRFSKPCHALMERLENYFGTLWGAN--S 175 Query: 115 FSVPGGGVG--PHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCP---HPDLLQVDPFEAII 169 + P VG PH D +VF++Q G +RWR+ + P D + + Sbjct: 176 YLTPANSVGFAPHYDDVEVFMLQTEGSKRWRLYDSPDDDGPLPMEYSRDYTEEELSLPYF 235 Query: 170 DEELEPGDILYIPPGFPHEG--YALENAMNYSVGFRAPN-TRELISGFADYVLQRELGGN 226 DE +E GD+LYIP G H G + + +V N EL+ ++ L Sbjct: 236 DEVVEQGDLLYIPRGTVHFGCVSPEGYSHHLTVSTYYHNSWGELLQNL---LIPGALAKA 292 Query: 227 YYSDPDVPPRAHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEP 286 D + +P + ++ + + G+ + S ++++ Sbjct: 293 MKEDVGFR------EGVPVNWTRYMGRLMAPVTAETAALEAGGDDDNASDDDVEVGKDGS 346 Query: 287 PYQPDE-----IYDALKQGEVLVRLGGLRVLRIGDDVYANG---------EKIDSPHRPA 332 + +E + QGE L R + G ++ D+ + A Sbjct: 347 STEEEEEVDAKVGSVPPQGETQADPEALMKARKAFKAHVKGLIAKLSEYVDEDDAADQTA 406 Query: 333 LDALASNI--ALTAENFGDALEDPSFLA-MLAALVNSGY 368 +D +A A A P+ +L N + Sbjct: 407 VDFVALRTPPAPRAGETKTHGPSPAAQGNLLVRWRNPAW 445 >UniRef50_C3XRY1 Lysine-specific demethylase NO66 n=1 Tax=Branchiostoma floridae RepID=NO66_BRAFL Length = 607 Score = 137 bits (346), Expect = 6e-31, Method: Composition-based stats. Identities = 63/355 (17%), Positives = 123/355 (34%), Gaps = 66/355 (18%) Query: 8 NWPDFLERHWQKRPVVLKRGFNNFIDP-ISPDELAGLAMESEVDS----RLVSHQDGKWQ 62 F W+K+P+++KR ++ D S ++L + E+++ + +++ G+ + Sbjct: 202 KKEKFFSELWEKKPLLVKRHLESYNDGWFSTEDLTKILHENDIQFGRNLDVTTYEGGQRE 261 Query: 63 VSH-----GPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSV 117 + P +D+ + S+ + + + L +E + I + Sbjct: 262 THNPPGRANPAVVWDYY-QNGCSVRLLNPQTYSQGVWRLCSTLQEYFSSMVGAN-IYLTP 319 Query: 118 PG-GGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPG 176 PG G PH D + F++Q G + Q + EAI+D LEPG Sbjct: 320 PGTQGFAPHYDDIEAFVLQLEG-------------------NFSQEEIGEAILDVTLEPG 360 Query: 177 DILYIPPGFPHEGYALENAMNYSVGFRAP---NTRELISGFADYVLQRELGGNYYSDPDV 233 D+LY P G H+ AL + + + +L+ L + + Sbjct: 361 DLLYFPRGTIHQASALPDTHSLHITVSTCQRNTWGDLMEKLVPAALTMAFSEDVEFRQAL 420 Query: 234 PP------------RAHPADVLPQEMDKLREMMLELINQ---PEHFKQWFGEFISQSRHE 278 P P ++ L+ ++ L+N Q EF+ Sbjct: 421 PRDYLDYMGLANADLDDPR--RKAFLETLQSLLSRLVNYVPVDAGVDQKAVEFMRDCLPP 478 Query: 279 LDIAPPEPPYQPDEIYDALKQGEV-------------LVRLGGLRVLRIGDDVYA 320 + E L++G V L+R G R++ G+ V+ Sbjct: 479 V-FTKNERACSIYGCRTRLEKGRVVGSVDLKTSTPVKLIRKGAARLVMEGEQVFL 532 >UniRef50_A8QFQ3 Lysine-specific demethylase NO66 n=2 Tax=Brugia malayi RepID=NO66_BRUMA Length = 710 Score = 136 bits (344), Expect = 8e-31, Method: Composition-based stats. Identities = 44/246 (17%), Positives = 91/246 (36%), Gaps = 17/246 (6%) Query: 8 NWPDFLERHWQKRPVVLKRGFNNFI-DPIS----PDELAGLAMESEVDSRLVSHQDGKWQ 62 + F + +QK+ ++ N+ + S D L +E + + +++ + Sbjct: 279 DLTQFFKMVFQKKVFLVCHNNPNYYGNLFSTAKFIDILQTDYVEYGTNVNVAIYKNQQRS 338 Query: 63 VSHGPFESYDHLGET----NWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVP 118 +G + Y + S+ + + + +E+ + + Sbjct: 339 TLNGSGKVYPQAIQKSIKAGCSIQLTNPQSFCDNVWYYCDLLQEVFGCFVGANIYITPAN 398 Query: 119 GGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAI-----IDEEL 173 G PH D D F++Q GR+ W++ + P + I D+ L Sbjct: 399 TAGFAPHWDDIDAFLLQLEGRKHWKIYAPDSDDEMLPRLPSGNFTDNDVINRMLVFDDWL 458 Query: 174 EPGDILYIPPGFPHEGYALEN--AMNYSVGF-RAPNTRELISGFADYVLQRELGGNYYSD 230 E GD+LYIP G+ H+G+A ++ +++ +V R +L+ L N Sbjct: 459 EQGDMLYIPRGYIHQGFADKDVHSLHLTVSVCRNVTYADLLERVIPPALSNFAEQNVNIR 518 Query: 231 PDVPPR 236 +P R Sbjct: 519 KSLPAR 524 >UniRef50_C9N2N9 Cupin 4 family protein n=4 Tax=Streptomyces RepID=C9N2N9_9ACTO Length = 402 Score = 136 bits (344), Expect = 1e-30, Method: Composition-based stats. Identities = 65/335 (19%), Positives = 113/335 (33%), Gaps = 43/335 (12%) Query: 5 LTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQVS 64 L+ DF W + + + + D S + L + + + G + Sbjct: 25 TGLSREDFARDVWARTAALTRGASDF-SDVFSSSAVDELISRRGLRTPFLRVAKGGTTLP 83 Query: 65 HGPFESYDHLGET----------------NWSLLVQAVNHWHEPTAALMRPFRELPDWRI 108 F + +G T +L++QA++ EP A L+ + Sbjct: 84 ESSFTAPAGVGATIGDQLDDTALWRAFADGATLVLQALHRTWEPVAGLVSELSTELGHPV 143 Query: 109 DDLMISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKL--QMKQHCPHPDLLQ----- 161 G H D +DVF++Q G +RW V E + + P D Q Sbjct: 144 QANAYVTPPQNRGFDAHYDVHDVFVLQIEGTKRWIVHEPVLPDPLRDQPWTDHRQAVADA 203 Query: 162 VDPFEAIIDEELEPGDILYIPPGFPHEGYALEN-AMNYSVGFRAPNTRELISGFADYVLQ 220 A +D L PGD+LY+P G+ H A +++ ++G L Sbjct: 204 AARSTAHLDTVLGPGDVLYLPRGWLHSARAQGEVSIHLTLGVHTWTRYALAEQLT----- 258 Query: 221 RELGGNYYSDPDVPPRAHP--ADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRH- 277 R DP + R+ P A E + +RE +L + + + + ++ R Sbjct: 259 RAALAALRDDPPM-RRSLPLGAGGQDDERELVRERLLAAVAEADPGPSFERARRAEGRPA 317 Query: 278 ---------ELDIAPPEPPYQPDEIYDALKQGEVL 303 L+ P P + E +A G L Sbjct: 318 PLGPLAQLSALNGLGPTTPVRLREALEARLAGTRL 352 >UniRef50_Q1D4G2 Cupin family protein n=2 Tax=Myxococcus xanthus DK 1622 RepID=Q1D4G2_MYXXD Length = 442 Score = 136 bits (342), Expect = 1e-30, Method: Composition-based stats. Identities = 77/392 (19%), Positives = 128/392 (32%), Gaps = 80/392 (20%) Query: 1 MEYQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDEL----AGLAMESEVDS----- 51 +E +W F+ R+W +RPV+ K P + D++ AG S Sbjct: 4 LEIATRFDWDTFVRRYWNQRPVLFKG---TQASPFTVDDVFEASAGATQRYLSRSYEPAS 60 Query: 52 ---------RLVSHQDGKW--QVSHGPFESYD-----HLGETNWSLLVQAVNHWHEPTAA 95 RL + +W + S G + YD LGE ++L++ ++ + Sbjct: 61 RPDVTFTVDRLRQLRSREWLPRKSDGSLDGYDARIASQLGERRYALIIATMHASGFQLWS 120 Query: 96 LMRPFRELPDWRIDDLMISFSVPGGG--------------VGPHLDQYDVFIIQGTGRRR 141 R F L +P G VG HLD++ F+ GR+R Sbjct: 121 RQRAF-------FSGLWQRVGMPVTGGITSLFHGTYEHSPVGVHLDRFTTFMFALRGRKR 173 Query: 142 WRVGEKLQMKQHCPHPDLLQVDPF-EAIIDEELEPGDILYIPPGFPHEGYALENAMNYSV 200 R K + +L P+ + E+EPGDILY P + H G + + S+ Sbjct: 174 MRFWHKRPWSEDVS--TILDYQPYLASSFVAEVEPGDILYWPSTYYHVGESAGAGVASSL 231 Query: 201 GFRAP--------NTRELISGFAD--YVLQRELGGNYYSDPDVPPRAHP--------ADV 242 P + +L+ G D + +E + P A A Sbjct: 232 NVGIPITEHHVIYSVDDLLRGMLDETSLADQEWKQTRLARVSASPLARGALSKNGVLATE 291 Query: 243 LPQEMDKLREMMLELINQPEHFKQ----WFGEFISQSRHELDIAPPEPPYQPDEIYDALK 298 LP+ + + ++ + E + W S + E P + Sbjct: 292 LPRALTEAVRAFRDVSHPKEARRHIQSTWLKRLTSGGFEPVPPPTREKPLRDSHHVRVDP 351 Query: 299 QGEVL-VRLGGLRVLRIGDDVYANGEKIDSPH 329 VL R R + ANG + Sbjct: 352 SFPVLFERDSATRWI-----CSANGHALRGAG 378 >UniRef50_Q1DFZ7 Cupin family protein n=1 Tax=Myxococcus xanthus DK 1622 RepID=Q1DFZ7_MYXXD Length = 295 Score = 136 bits (342), Expect = 2e-30, Method: Composition-based stats. Identities = 53/270 (19%), Positives = 96/270 (35%), Gaps = 12/270 (4%) Query: 10 PDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQVSHGPFE 69 FL+ H+ +RP + + + L E+ D L + Sbjct: 13 ERFLQEHYLRRPFTGASAAERLQRLGTWETIDFLVEETACDVLLARQGVPYPGDRPTTAK 72 Query: 70 SYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGG--GVGPHLD 127 + L ++L ++ + H A L R F RI + + P G G G H D Sbjct: 73 AARELFAQGYTLALRQPDLHHPDLAQLARAFSAELHGRI--NLHIYCTPAGHHGFGWHCD 130 Query: 128 QYDVFIIQGTGRRRWRVGE----KLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIPP 183 +VFI+Q GR+ + + E + + + P L + L GD +YIP Sbjct: 131 PEEVFILQTAGRKDYLLRENTLHPVPLPESVPSGSLA-AQEKTPVETHSLSAGDFIYIPG 189 Query: 184 GFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDV-PPRAHPA-- 240 G H A E A++ S+G P +L+ G + + P+ Sbjct: 190 GHWHMAQATEEALSISIGLMPPTLLDLLDGVRAALASSPVWRRRMPSLGRASSLDDPSKL 249 Query: 241 DVLPQEMDKLREMMLELINQPEHFKQWFGE 270 +L + +L + + P + ++ + Sbjct: 250 ALLRTLLSELGGEVQRQLADPGYPLRFLAQ 279 >UniRef50_C6SNC5 Putative uncharacterized protein n=2 Tax=Neisseria meningitidis RepID=C6SNC5_NEIME Length = 387 Score = 135 bits (340), Expect = 2e-30, Method: Composition-based stats. Identities = 53/294 (18%), Positives = 110/294 (37%), Gaps = 29/294 (9%) Query: 1 MEYQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGK 60 M ++ + +F E + K+P + K + + IS E+ L ++ + G+ Sbjct: 1 MHINFSMEYKEFNENYLYKKPFIFKNALD--VSSISWKEINELYQRADPTDWQFKFRKGE 58 Query: 61 WQVSHGPFESYDHLGETNWSLLVQAVNHWHEPTAALM------RPFRELPDWRIDDLMIS 114 ES++ +G AV + + A ++ PF + +I + Sbjct: 59 IIPKEAYVESFNDVGRIRHRFNKTAVYQYLQDGATMVYNRIDNEPFVDSIAKQIAQFAQA 118 Query: 115 FSVPGG--GVGP------HLDQYDVFIIQGTGRRRWRVGEK--LQMKQHCPHPDLLQVDP 164 +V G G H D DVF +Q G++ W + D+ + P Sbjct: 119 QTVVSGYLAFGSSSSYRNHWDTRDVFAVQLIGKKHWTISAPNFDMPLYMQQAKDMPHITP 178 Query: 165 FEAI-IDEELEPGDILYIPPGFPHEGYALE-NAMNYSVGFRAPNTRELISGFADYVLQRE 222 + + ++ LE GDILYIP G+ H + + ++G PN + + + Sbjct: 179 SKTVDMEVILEAGDILYIPRGWWHNPMPMNCETFHLAIGTFPPNGYNYMEWLMKKIPDIQ 238 Query: 223 LGGNYYSDPDVPPRAHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSR 276 + D + + +D + + E++ E+++ + +F+ R Sbjct: 239 SIRQNFID---------WEHDQKNIDNAAQAVTEMMKNQENYQAFIQDFLGNQR 283 >UniRef50_Q2T4J7 Unnamed protein product n=2 Tax=Burkholderia thailandensis RepID=Q2T4J7_BURTA Length = 397 Score = 135 bits (340), Expect = 3e-30, Method: Composition-based stats. Identities = 66/396 (16%), Positives = 131/396 (33%), Gaps = 49/396 (12%) Query: 7 LNWPDFLERHWQKRPVVLKRGFNN-FIDPISPDELAGLAMESEVDSRLVSHQDGKWQVSH 65 + F+ER+W ++P++++R + + +E A L L + G + + + Sbjct: 13 ITVDAFMERYWGRKPLIVRRQAPHLYACLPDSEEFAFLL------HSLTDPERGWFSIVN 66 Query: 66 GP----------------FESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDW--- 106 G SLL+ V H TA L R Sbjct: 67 GVARPPSDSLLTQEGLLNLSEVYAAYRDGNSLLMNQVQRRHRETAMLCRRIESALSAHGI 126 Query: 107 ---RIDDLMISFSVPG-GGVGPHLDQYDVFIIQGTGRRRWRVGEK-LQMKQHCPHPDLLQ 161 R S P G H D +DV I+Q GR+ WR+ + + P + Sbjct: 127 ALARHIGANGYLSPPSSQGFNIHYDPHDVLILQIEGRKHWRLYGRHVAWPTQPPATPIPP 186 Query: 162 VDPFEAIIDEELEPGDILYIPPGFPHEGYALEN-AMNYSVGFRAPNTRELISGFADYVLQ 220 + + L PG+++YIP G H+ ++ +++ ++ +L+ Sbjct: 187 EEAGSPRREFVLSPGELVYIPRGVLHDANTTDSRSLHLTLSIETLTWTDLLIE------- 239 Query: 221 RELGGNYYSDPDVPPRAHPAD-VLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHEL 279 + N ++P + + +L + +N P ++ + L Sbjct: 240 -AMSDNPAFRRNLPVCPPFGKRIGDEARAELTR-LTASLNNPRALRRALAAMSGRLLGNL 297 Query: 280 DIAPPEPPYQPDEIYDALKQGEVLVRLGGL--RVLRIGDD--VYANGEKIDSPH--RPAL 333 D P + + ++ L G V GD+ ++ G + + A Sbjct: 298 D-PLPNGGFAEVDGLHLIEPKTWLSLAPGTFGHVEVNGDEAILHLPGSALRAAREMAKAF 356 Query: 334 DALASNIALTAENFGDALEDPSFLAMLAALVNSGYW 369 L + A + + + L + LV G+ Sbjct: 357 YYLLRARRVRACDLPVSASEADKLTFVRKLVQMGFL 392 >UniRef50_Q9H6W3 Lysine-specific demethylase NO66 n=17 Tax=Eumetazoa RepID=NO66_HUMAN Length = 641 Score = 135 bits (340), Expect = 3e-30, Method: Composition-based stats. Identities = 43/241 (17%), Positives = 82/241 (34%), Gaps = 15/241 (6%) Query: 10 PDFLERHWQKRPVVLK-RGFNNFIDPISPDELAGLAMESEVDS----RLVSHQDGKWQVS 64 F R W++ V+++ + + S +L + EV + +G+ + Sbjct: 216 DHFYRRLWEREAVLVRRQDHTYYQGLFSTADLDSMLRNEEVQFGQHLDAARYINGRRETL 275 Query: 65 HGPFESYD----HLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGG 120 + P + L + SL + + + +E + Sbjct: 276 NPPGRALPAAAWSLYQAGCSLRLLCPQAFSTTVWQFLAVLQEQFGSMAGSNVYLTPPNSQ 335 Query: 121 GVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHC---PHPDLLQVDPFEAIIDEELEPGD 177 G PH D + F++Q GR+ WRV + P+ Q D E ++ LEPGD Sbjct: 336 GFAPHYDDIEAFVLQLEGRKLWRVYRPRVPTEELALTSSPNFSQDDLGEPVLQTVLEPGD 395 Query: 178 ILYIPPGFPHEGYALENAMNYSVGFRAP---NTRELISGFADYVLQRELGGNYYSDPDVP 234 +LY P GF H+ + + + + + +Q + N +P Sbjct: 396 LLYFPRGFIHQAECQDGVHSLHLTLSTYQRNTWGDFLEAILPLAVQAAMEENVEFRRGLP 455 Query: 235 P 235 Sbjct: 456 R 456 >UniRef50_Q8IUF8 MYC-induced nuclear antigen n=25 Tax=Amniota RepID=MINA_HUMAN Length = 465 Score = 134 bits (338), Expect = 5e-30, Method: Composition-based stats. Identities = 53/332 (15%), Positives = 109/332 (32%), Gaps = 28/332 (8%) Query: 7 LNWPDFLERHWQKRPVVLKRGFNN----FIDPISPDELAGLAME---SEVDSRLVSHQDG 59 + F + W+++P++++R + +L L D + +G Sbjct: 49 IKTETFFKEFWEQKPLLIQRDDPALATYYGSLFKLTDLKSLCSRGMYYGRDVNVCRCVNG 108 Query: 60 KWQV--SHGP---FESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMIS 114 K +V G + + ++ + + + + + Sbjct: 109 KKKVLNKDGKAHFLQLRKDFDQKRATIQFHQPQRFKDELWRIQEKLECYFGSLVGSNV-- 166 Query: 115 FSVPGGGVG--PHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEE 172 + P G G PH D +VFI+Q G + WR+ + + + + Sbjct: 167 YITPAGSQGLPPHYDDVEVFILQLEGEKHWRLYHPTVPLAREYSVE-AEERIGRPVHEFM 225 Query: 173 LEPGDILYIPPGFPHEG-YALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYS-- 229 L+PGD+LY P G H+ A + V + + D++L G + + Sbjct: 226 LKPGDLLYFPRGTIHQADTPAGLAHSTHVTISTYQN----NSWGDFLLDTISGLVFDTAK 281 Query: 230 -DPDVPPRAHPADVL--PQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEP 286 D ++ P +L + + + + E +S + I P Sbjct: 282 EDVEL-RTGIPRQLLLQVESTTVATRRLSGFLRTLADRLEGTKELLSSDMKKDFIMHRLP 340 Query: 287 PYQPDEIYDALKQGEVLVRLGGLRVLRIGDDV 318 PY + + G L RL + L+ D + Sbjct: 341 PYSAGDGAELSTPGGKLPRLDSVVRLQFKDHI 372 >UniRef50_D0MXW2 Nucleolar protein, putative n=1 Tax=Phytophthora infestans T30-4 RepID=D0MXW2_PHYIN Length = 506 Score = 134 bits (337), Expect = 7e-30, Method: Composition-based stats. Identities = 46/241 (19%), Positives = 87/241 (36%), Gaps = 19/241 (7%) Query: 7 LNWPDFLERHWQKRPVVLKRGFNNF---IDPISPDELAGLAMESEVDS----------RL 53 ++ FL +++K+P+ +++ + S +L ME + S R Sbjct: 66 MSLDTFLSEYFEKKPLHVRKADKGALFDSNLFSRKKLLK-VMEKQHRSLSFGKDLTVCRY 124 Query: 54 V----SHQDGKWQVSHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRID 109 V + DG+ H L + +S + + L F ++ Sbjct: 125 VDSERENFDGEDTNGHATSRQVASLLDRGYSCQFYQPQRYEDGLYELNAAFEDVFGGLAG 184 Query: 110 DLMISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAII 169 + PH D +VF++Q GR++W++ L DL + E + Sbjct: 185 SSAYLTPANSQALAPHHDDVEVFVVQTQGRKKWKLYHPLVELAGEHSSDLAEDQIGEPWM 244 Query: 170 DEELEPGDILYIPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYS 229 + +E GD+LY P G H+ E + V + + F + L R + + S Sbjct: 245 ELTVEEGDLLYFPRGVIHQACTDEKEFSTHVTI-SVYQHNTWANFLEVALPRVIRHAFDS 303 Query: 230 D 230 D Sbjct: 304 D 304 >UniRef50_C6W918 Cupin 4 family protein n=2 Tax=Actinomycetales RepID=C6W918_ACTMD Length = 395 Score = 133 bits (336), Expect = 8e-30, Method: Composition-based stats. Identities = 74/383 (19%), Positives = 138/383 (36%), Gaps = 37/383 (9%) Query: 12 FLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDS-RLVSHQDGKWQVSHGPFE- 69 F E + + F D + E+ + + ++ RL +DG+ +H E Sbjct: 18 FFEAVQGRTHLRFPGERGRFADLLPWSEVNRVLRQHRLEFPRLRLARDGEVVPAHVYSEL 77 Query: 70 ---------------SYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMIS 114 + +L++ +V L R+ + Sbjct: 78 VDTRRAGQVPRVLPGKFAEQMRGGATLVLDSVQELVGAVGDLAVGLEHELRERVQVNAYA 137 Query: 115 -FSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEEL 173 + V G H D +D ++Q +GR+RWR+ ++ +L E + + L Sbjct: 138 GWGVTH-GFDVHWDDHDAIVVQVSGRKRWRIHGFTRVAPMVRDVELPPRPEGEPLDEFVL 196 Query: 174 EPGDILYIPPGFPHEGYAL-ENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPD 232 E G++LY+P G H+ A+ E +++ ++G +L++ AD + E + D Sbjct: 197 EAGEVLYLPRGCWHDVSAVGEESLHLTIGVNRATGVDLVAWLADQLRGDE---AFRGD-- 251 Query: 233 VPPRAHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDE 292 PR A + +LR +LE ++ ++ + +Q+ + P Sbjct: 252 -LPRFGTAAEQAEHAAQLRAGLLERLDD-GVVARFLADRDAQAPAVEHVGLPWTATSAMI 309 Query: 293 IYDALKQGEVLVRLGGLRVLRIGDDVYAN--GEK--IDSPHRPALDALASNIALTAENFG 348 D EVL+ + R GD V G++ P L AL A T + Sbjct: 310 PED--DGAEVLLLAPRAVLSREGDAVVLAAVGKRLVFAGAAEPVLAALLGGRARTVTSLA 367 Query: 349 ----DALEDPSFLAMLAALVNSG 367 AL+ + A+L L G Sbjct: 368 EAGGPALDRVTVRALLGELAAQG 390 >UniRef50_A5PK74 Lysine-specific demethylase NO66 n=1 Tax=Bos taurus RepID=NO66_BOVIN Length = 667 Score = 131 bits (331), Expect = 3e-29, Method: Composition-based stats. Identities = 45/241 (18%), Positives = 88/241 (36%), Gaps = 15/241 (6%) Query: 10 PDFLERHWQKRPVVLKRGFNNFI-DPISPDELAGLAMESEVDS----RLVSHQDGKWQVS 64 F R W++ V+++R +++ S L + EV + +G+ + Sbjct: 244 DHFYRRLWEREAVLVRRQDHSYYQGLFSTAVLDSILRNEEVQFGQHLDAARYINGRRETL 303 Query: 65 HGPFESYD----HLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGG 120 + P + L SL + + + +E + Sbjct: 304 NPPGRALPAAAWSLYRAGCSLRLLCPQAFSTTVWQFLAVLQEQFGSMAGSNVYLTPPNSQ 363 Query: 121 GVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHC---PHPDLLQVDPFEAIIDEELEPGD 177 G PH D + F++Q GR+ WRV + P+ Q D E ++ LEPGD Sbjct: 364 GFAPHYDDIEAFVLQLEGRKLWRVYRPRVPTEELALTSSPNFSQDDLGEPVLQTVLEPGD 423 Query: 178 ILYIPPGFPHEGYALE--NAMNYSV-GFRAPNTRELISGFADYVLQRELGGNYYSDPDVP 234 +LY P GF H+ + ++++ ++ F+ + + +Q + N +P Sbjct: 424 LLYFPRGFIHQAECQDGVHSLHLTLSTFQRNTWGDFLEAVLPLAVQAAMEENVEFRRGLP 483 Query: 235 P 235 Sbjct: 484 R 484 >UniRef50_A1KTI5 Putative uncharacterized protein n=2 Tax=Neisseria meningitidis RepID=A1KTI5_NEIMF Length = 382 Score = 130 bits (328), Expect = 7e-29, Method: Composition-based stats. Identities = 56/301 (18%), Positives = 111/301 (36%), Gaps = 33/301 (10%) Query: 1 MEYQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGK 60 M ++ + +F E + K+P + K + + IS E+ L ++ + G+ Sbjct: 1 MHINFSMEYKEFNENYLYKKPFIFKNALD--VSSISWKEINELYQRADPTDWQFKFRKGE 58 Query: 61 WQVSHGPFESYDHLGETNWSLLVQAVNHWHEPTAALM------RPFRELPDWRIDDLMIS 114 ES++ +G+ + AV + + A ++ PF + +I + Sbjct: 59 IIPKEAYVESFNDVGKIRYRFNKTAVYQYLQDGATMVYNRIDNEPFVDSIAKQIAQFAQA 118 Query: 115 FSVPGG--GVGP------HLDQYDVFIIQGTGRRRWRVGE-----KLQMKQHCPHPDLLQ 161 +V G G H D DVF +Q G + W + L M+Q P Sbjct: 119 QTVVSGYLAFGSSSSYRNHWDTRDVFAVQLIGTKHWTLSAANFDMPLYMQQAKDIP--HI 176 Query: 162 VDPFEAIIDEELEPGDILYIPPGFPHEGYALE-NAMNYSVGFRAPNTRELISGFADYVLQ 220 P ++ LE GDILYIP G+ H + + ++G PN + + Sbjct: 177 TPPTTVDMEVILEAGDILYIPRGWWHNPMPMNCETFHLAIGTFPPNGYNYMEWLMKKIPD 236 Query: 221 RELGGNYYSDPDVPPRAHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELD 280 + + + +D + + E++N P++++ + +F+ R Sbjct: 237 IQSIRQNF--IGWQH-------DQKNLDDAAQAITEMMNNPKNYQTFMQDFLGSQRTNTA 287 Query: 281 I 281 Sbjct: 288 F 288 >UniRef50_Q4D641 Putative uncharacterized protein n=1 Tax=Trypanosoma cruzi RepID=Q4D641_TRYCR Length = 476 Score = 130 bits (327), Expect = 9e-29, Method: Composition-based stats. Identities = 53/263 (20%), Positives = 97/263 (36%), Gaps = 33/263 (12%) Query: 2 EYQLTLNWPDFLERHWQKRPVVLKRG----FNNFIDPI-----SPDELAGLAMESEV--- 49 ++ L +F +++K+P+ G F D + S + + LA E + Sbjct: 30 QWLLGKTQKEFFRHYFEKKPLHFSHGAATHFTEVQDGLPAVKWSTELMLQLAAEKSLSYT 89 Query: 50 -DSRLVSHQDGKWQVSHGPFESYDHLGET--------NWSLLVQAVNHWHEPTAALMRPF 100 D +V Q PF S + E WS+ + + +A++ Sbjct: 90 TDINIVRFDAV--QKKRVPFRSEGIVTEKEMKHSMRKGWSVRFLRPHEYIVENSAVLAML 147 Query: 101 RELPDWRIDDLMISFSVPG--GGVGPHLDQYDVFIIQGTGRRRWRVGEK---LQMKQHCP 155 E + S+ P G PH D DVF++Q G + WR+ + + + Sbjct: 148 EEAFACSCG--LNSYWTPANSQGFAPHYDDVDVFLLQLEGEKEWRLYDPPERVDVLSRHS 205 Query: 156 HPDLLQVDPFEAIIDEELEPGDILYIPPGFPHEGYALENAMNYSVGFRA---PNTRELIS 212 D + + L PGD+LY+P G H+G +A + V F A +L+ Sbjct: 206 SEDYNPEELPKPTQIFRLFPGDVLYMPRGTVHQGRKYNHAHSLHVTFSANQMNTWADLMK 265 Query: 213 GFADYVLQRELGGNYYSDPDVPP 235 +V+++ + +P Sbjct: 266 HAVTHVVEKLAANYIHWRRSLPR 288 >UniRef50_A0QI05 Cupin superfamily protein n=4 Tax=Mycobacterium avium complex (MAC) RepID=A0QI05_MYCA1 Length = 361 Score = 129 bits (324), Expect = 2e-28, Method: Composition-based stats. Identities = 52/279 (18%), Positives = 95/279 (34%), Gaps = 19/279 (6%) Query: 7 LNWPDFLERHWQKRPVVLKRGFNNFIDPISP-----DELAGLAMESEVDSRLVSHQDGK- 60 L FL W R + R + D + P D L RLV + + Sbjct: 14 LGVDAFLNEIWATRHHHIDRCRPGYFDGLLPGPSAVDGLLEQVRPDPAAVRLVKDGEDRD 73 Query: 61 ---WQVSHGPFESYDHLG--ETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISF 115 ++ G + ++L++ + + A+L ++ Sbjct: 74 PAGYRRGDGTLNAGGARDGLADGYTLVLNGLERYLRTVASLSHAIEVELNFPTRVNAYVT 133 Query: 116 SVPGGGVGPHLDQYDVFIIQGTGRRRWRVGE--KLQMKQHCPHPDLLQVDPFEAIIDEEL 173 G PH D +DV ++Q G + WRV + + +Q + P + D L Sbjct: 134 PPHSTGFVPHYDPHDVLVLQIEGCKTWRVSDEPPVPPQQIQSRKGVGADGP-ASRTDVCL 192 Query: 174 EPGDILYIPPGFPHEGYA-LENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPD 232 PGD+LY+P G H E +++ +VG AP L++ + R+ D Sbjct: 193 RPGDVLYLPRGQVHSARTHSEPSVHLTVGLHAPTVLTLVTSALHALSLRDP---RVHDR- 248 Query: 233 VPPRAHPADVLPQEMDKLREMMLELINQPEHFKQWFGEF 271 +PPR + + + + ++ G Sbjct: 249 LPPRHLDDAQVRAGLGEAVRDAVRALDDDAVIADGLGAM 287 >UniRef50_D2VJG1 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2VJG1_NAEGR Length = 542 Score = 129 bits (324), Expect = 2e-28, Method: Composition-based stats. Identities = 42/232 (18%), Positives = 88/232 (37%), Gaps = 26/232 (11%) Query: 7 LNWPDFLERHWQKRPVVLKRG--FNNFI-DPISPDE----LAGLAMESEVDSRLVSHQDG 59 ++ + +KR +V++R + ++ S DE L + D L +++G Sbjct: 111 IDMDKLYQEFVEKRVLVIRRNEVYPDYYKGLYSLDEIKKTLVDHELRYSYDLDLALYRNG 170 Query: 60 KWQVSH-------GPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLM 112 + + P +D S+ + + +L+ F E + Sbjct: 171 RRFTLNPNKDDVADPTLVWDLYENEKCSIRMLRPQEHSDVLLSLLCHFEEYFG--QGAGL 228 Query: 113 ISFSVPGG--GVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAI-- 168 ++ P G G PH D + F+IQ G + W++ L+ +Q+ E Sbjct: 229 NAYLTPAGSQGFAPHYDDIEAFLIQLEGEKHWKIYRPLENQQYLDRFSSKNFTQEEVAGF 288 Query: 169 --IDEELEPGDILYIPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYV 218 + L+PGD+LY+P G H+ ++ + + + + + DY+ Sbjct: 289 ECFEILLKPGDMLYVPKGVIHQAVTSQDQHSLHITVSTSH----LMSWTDYL 336 >UniRef50_C3ZLE4 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3ZLE4_BRAFL Length = 572 Score = 127 bits (319), Expect = 7e-28, Method: Composition-based stats. Identities = 38/181 (20%), Positives = 69/181 (38%), Gaps = 15/181 (8%) Query: 8 NWPDFLERHWQKRPVVLKRGFNN----FIDPISPDELAGLAMESEVD----SRLVSHQDG 59 + F +W+K+P++ KR + S D L L + +++ + + G Sbjct: 190 TYEQFFAEYWEKKPLIAKRNDAAVSEAYKALFSRDVLKKLLKKHDIEYIRDVNVCRYVSG 249 Query: 60 KWQVSHGP----FESYDHL-GETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMIS 114 K + +G + D L ++ +L + + L L + + Sbjct: 250 KRESLNGTERATCKQIDKLFDQSKATLQFHQPQRFQDKLWQLCSLLECLFGCLVGAN-VY 308 Query: 115 FSVPG-GGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEEL 173 + PG G+ PH D +VFI+Q GR+ WR+ DL Q + + D L Sbjct: 309 MTPPGSQGLAPHYDDVEVFILQLEGRKHWRLYTPPVDLPRDYSRDLEQDNIGQPTHDFVL 368 Query: 174 E 174 E Sbjct: 369 E 369 >UniRef50_O01658 Lysine-specific demethylase NO66 n=3 Tax=Caenorhabditis RepID=NO66_CAEEL Length = 748 Score = 126 bits (316), Expect = 1e-27, Method: Composition-based stats. Identities = 64/360 (17%), Positives = 128/360 (35%), Gaps = 49/360 (13%) Query: 8 NWPDFLERHWQKRPVVLKRGFNN-FIDPISPDELAGLA----MESEVDSRLVSHQDGKWQ 62 + F ++ +Q +V++R F + S L L +E + + +++G Sbjct: 316 DVQTFFDKFYQSNVLVVRRKQPTYFGNLFSTARLGELLEKNHLEYGRNINIAQYKNGVRT 375 Query: 63 VSHGPFESYDHLGETN----WSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVP 118 +G +Y + + + S+ + + + L +E + + P Sbjct: 376 TLNGQGRAYPQIVKQHLHNMCSVQLVNPQTYDDRIWYLCEVIQEQFGCFVGANT--YLTP 433 Query: 119 GG--GVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCP---HPDLLQVDPF--EAIIDE 171 G G PH D+ D F++Q GR+ WRV ++ P + + D E + + Sbjct: 434 AGSSGFAPHWDEIDAFLLQVEGRKYWRVWAPESAEEELPLESSDNFTEDDMKGREPVFEG 493 Query: 172 ELEPGDILYIPPGFPHEGYALENAMNYSVGF---RAPNTRELISGF----------ADYV 218 +E GD++YIP G+ H+ + V R + L+ + Sbjct: 494 WIEKGDMIYIPRGYIHQARTDSKVHSLHVTVSTGRQWSFANLMEKVVPEAIGVLTDTRHK 553 Query: 219 LQRELGGNYYS-----DPDVPPRAHPADVLPQEMDKLREMMLELINQ-----------PE 262 L+R L + D D H + +D+ M+ L+ E Sbjct: 554 LRRGLPTGLFDMGGVIDLDYSQEDHFVEKFKMVVDRHMSMLRNLVADQLLESSVDSLAKE 613 Query: 263 HFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQGEV-LVRLGGLRVLRI-GDDVYA 320 KQ +++ +L + D++ D + +V L+R R+L D + Sbjct: 614 FMKQALPPRLTEQEKKLSVLGSSTNLLGDDLVDFTARTKVRLIRRHTQRLLMESEDACFI 673 >UniRef50_UPI0000523E0E PREDICTED: similar to MYC induced nuclear antigen n=1 Tax=Ciona intestinalis RepID=UPI0000523E0E Length = 490 Score = 125 bits (315), Expect = 2e-27, Method: Composition-based stats. Identities = 43/288 (14%), Positives = 96/288 (33%), Gaps = 36/288 (12%) Query: 8 NWPDFLERHWQKRPVVL---KRGFNN--------------FIDPISPDELAGLAM----E 46 + F E +W++R + + K G + F + + L + + + Sbjct: 40 SIERFYEYYWEQRHLYIPCLKSGSGDNCQDKRLPGTRSSYFNKLFNHEILKEVVLSKKLK 99 Query: 47 SEVDSRLVSHQDGKWQVS----HGPFES---YDHLGETNWSLLVQAVNHWHEPTAALMRP 99 + D D K HGP + + + +L +H+ + Sbjct: 100 YDKDICACRFDDEKKCRVNAEVHGPVTAEKVHSLFHDDKMTLQFHQPQRFHDELWKIQEK 159 Query: 100 FRELPDWRIDDLMISFSVPG-GGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPD 158 ++ + + G G+ PH D +VFI+Q G + W++ + D Sbjct: 160 LESFFGSQVGSN-VYMTPDGSQGLAPHHDNVEVFILQLEGEKEWKLYSPVVNLPRNSSSD 218 Query: 159 LLQVDP-FEAIIDE-ELEPGDILYIPPGFPHEGYA---LENAMNYSV-GFRAPNTRELIS 212 ++D ++PGD+LY P G H+ + ++ + ++ + + I Sbjct: 219 FDDSTVKGLTLLDTIIMKPGDVLYFPRGTVHQAKSIKGTGHSTHLTISTYETQCWGDYIL 278 Query: 213 GFADYVLQRELGGNYYSDPDVPPRAHPADVLPQEMDKLREMMLELINQ 260 F Y+ +P + + L++ ++ L Sbjct: 279 DFIPYLTDAAADKVVSLRRGLPRKYYNQTSTDGFKQNLKKALISLAES 326 >UniRef50_C6SMA2 Myc induced nuclear antigen n=24 Tax=Neisseria RepID=C6SMA2_NEIME Length = 382 Score = 125 bits (315), Expect = 2e-27, Method: Composition-based stats. Identities = 54/301 (17%), Positives = 106/301 (35%), Gaps = 29/301 (9%) Query: 1 MEYQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGK 60 M ++ F E + K+P + K+ + + IS E+ L ++ + G+ Sbjct: 1 MHINFSMERKYFHENYLYKKPFIFKKALD--VSCISWKEINELYQRADPTDWQFKFRKGE 58 Query: 61 WQVSHGPFESYDHLGETNWSLLVQAVNHWHEPTAALM------RPFRELPDWRIDDLMIS 114 ES++ +G AV + + A ++ PF + ++ + Sbjct: 59 IIPKEAYVESFNDVGRIRHRFNKTAVYQYLQDGATMVYNRIDNEPFVDTIAKQVAQFAQA 118 Query: 115 FSVPGG--GVGP------HLDQYDVFIIQGTGRRRWRVGEK--LQMKQHCPHPDLLQVDP 164 +V G G H D DVF +Q G++ W + D+ + P Sbjct: 119 QTVVSGYLAFGSSSSYRNHWDTRDVFAVQLIGKKHWTISAPNFDMPLYMQQAKDMPHITP 178 Query: 165 FEAI-IDEELEPGDILYIPPGFPHEGYALE-NAMNYSVGFRAPNTRELISGFADYVLQRE 222 + + ++ LE GDILYIP G+ H + + ++G PN + + + Sbjct: 179 SKTVDMEVILEAGDILYIPRGWWHNPMPMNCETFHLAIGTFPPNGYNYMEWLMKKIPDIQ 238 Query: 223 LGGNYYS--DPDVPPRAHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELD 280 + + D A + EMM N + + G + + +D Sbjct: 239 SIRQNFIGWEHDQKNLDDSA-------QAVTEMMNNPKNYQAFMQDFLGNQRTNTAFNMD 291 Query: 281 I 281 + Sbjct: 292 L 292 >UniRef50_Q4Q6P0 Putative uncharacterized protein n=3 Tax=Leishmania RepID=Q4Q6P0_LEIMA Length = 624 Score = 125 bits (314), Expect = 3e-27, Method: Composition-based stats. Identities = 40/261 (15%), Positives = 86/261 (32%), Gaps = 30/261 (11%) Query: 3 YQLTLNWPDFLERHWQKRPV--------VLKRGFNNFIDPISP------DELAGLAMESE 48 + L + +F ++++++ + G + P++ + + Sbjct: 169 WLLNTSRGEFFRKYFERKHLVASHGSGEYFASGLPGVVPPVNWSTERMLEHVKTHPSRYG 228 Query: 49 VDSRLVSH----QDGKWQVSHGPFE--SYDHLGETNWSLLVQAVNHWHEPTAALMRPFRE 102 D +V + + G + + + WS+ N + E +A + + Sbjct: 229 ADLDIVKFDPKLKRRVSYRTKGLVDAAELEACMKDGWSVRFLRPNEFIESNSAFIGCIEK 288 Query: 103 LPDWRIDDLMISFSVPG--GGVGPHLDQYDVFIIQGTGRRRWRVGEK---LQMKQHCPHP 157 + S+ P G PH D DVF +Q G + W + + + + Sbjct: 289 EFNCYCGAN--SYWTPANSQGFAPHYDDVDVFFLQLEGEKLWCLYDPPEDVDVLARHSSE 346 Query: 158 DLLQVDPFEAIIDEELEPGDILYIPPGFPHEGYALENAMNYSVGFRA---PNTRELISGF 214 D L+ GD+LY+P G H+G + + F A + + +S Sbjct: 347 DYAPERFPTPKHTITLKAGDVLYMPRGTVHQGKTTLKTHSLHITFSANQMNSWADFMSRA 406 Query: 215 ADYVLQRELGGNYYSDPDVPP 235 A Y ++ +P Sbjct: 407 AQYTVETLAANKLEWRRALPR 427 >UniRef50_UPI000192614C PREDICTED: similar to chromosome 14 open reading frame 169 n=1 Tax=Hydra magnipapillata RepID=UPI000192614C Length = 388 Score = 124 bits (312), Expect = 4e-27, Method: Composition-based stats. Identities = 39/231 (16%), Positives = 87/231 (37%), Gaps = 15/231 (6%) Query: 76 ETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPG-GGVGPHLDQYDVFII 134 E S+ + + + L +E + + + PG G PH D + F+I Sbjct: 41 EDGCSIRLLNPQIFAKSVHQLTSRLQEYFGCLVGSN-VYLTPPGSQGFAPHYDDIEAFVI 99 Query: 135 QGTGRRRWRVGEKLQMKQ---HCPHPDLLQVDPFEAIIDEELEPGDILYIPPGFPHEGYA 191 Q G++ W++ + ++ + + E I+++ LE GD LY P G H+ Sbjct: 100 QLEGKKHWKLYPPRNTNEVLARYSSENMQEENLGEPILNKVLEAGDTLYFPRGVIHQAST 159 Query: 192 LENAMNYSVG---FRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHP------ADV 242 LE++ + + ++ + + + LQ+ + N +P ++ Sbjct: 160 LEDSHSLHITISLYQKSSWGDYLEKLIPLALQKAISENVMFREGLPIDFSSFVGVSNSEK 219 Query: 243 LPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEI 293 E D + + +L+ + + + E + + P Y D++ Sbjct: 220 KCPERDTFVKTVKKLMEKLIDYVE-IDEAGDELVLDHMEEFQPPYYSSDDV 269 >UniRef50_Q0ALX3 Cupin 4 family protein n=1 Tax=Maricaulis maris MCS10 RepID=Q0ALX3_MARMM Length = 394 Score = 123 bits (308), Expect = 1e-26, Method: Composition-based stats. Identities = 54/379 (14%), Positives = 117/379 (30%), Gaps = 33/379 (8%) Query: 7 LNWPDFLERHWQKRPVVLKRGFNN----------FIDPISPDELAGLAMESEVDSRLVSH 56 ++ F + +K+P+++ R D +S +L A++ V Sbjct: 18 IDRETFFRDYHEKKPLIVHREDPGRYAGLLSIARIDDIVSSIDLREGALDMARSEPPVQR 77 Query: 57 QD----GKWQVSHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLM 112 +D + G Y ++++ ++ R L + + Sbjct: 78 EDYMFDTGYVDRGGVANQY----RQGATIILPQLHMMDAVLGEFCRAVESLLSCHVQTNI 133 Query: 113 ISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGE-KLQMKQHCPHPDLLQVDPFEAIIDE 171 G H D +DVF++Q G + WR E ++ E + + Sbjct: 134 YLTPPDNQGFNTHYDDHDVFVMQIEGEKLWRFYETPVENPYRGEGFRPDAHKAGEPVAEF 193 Query: 172 ELEPGDILYIPPGFPHEGYALEN--AMNYSVGFRAPNTRELISGFADYVLQRELGGNYYS 229 L+ G+ +Y+P G H+ + +++ ++G +L+ V R + Sbjct: 194 VLKAGECIYVPRGLMHDAQTHGDTASLHITLGLIVKTWADLMLEAVSEVALRTPAMRHSL 253 Query: 230 DPDVPPRAHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQ 289 P + + EM+ ++ + FI P Sbjct: 254 PPGFARPDFDRTDAEVQFRDMAEMLAREMSVDGAMDFFVDSFIRSR------VPNTRGAI 307 Query: 290 PDEIYDALKQGEVLVR--LGGLRVLRIGDDVYAN--GE-KIDSPHRPALDALASNIALTA 344 + + +R + ++V GE + + L L +TA Sbjct: 308 SNYLAPNSASQTFKLRPFVPWRFAGDETENVIITAGGEVRFPAEAEQGLHTLLDGGTVTA 367 Query: 345 ENFGDALEDPSFLAMLAAL 363 +F +D + + L L Sbjct: 368 ASFV-GQDDSAAIETLGKL 385 >UniRef50_B8BSJ2 Predicted protein n=1 Tax=Thalassiosira pseudonana CCMP1335 RepID=B8BSJ2_THAPS Length = 830 Score = 122 bits (307), Expect = 2e-26, Method: Composition-based stats. Identities = 41/250 (16%), Positives = 86/250 (34%), Gaps = 26/250 (10%) Query: 67 PFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPHL 126 P + + ++ ++ +L + + ++ +++ + + G PH Sbjct: 403 PTDVWTNVDASHCTLRLLRPHEHNDNIHSMLSLLESEFGCMVGSNAYLTPLHSQGFAPHY 462 Query: 127 DQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDP-------FEAIIDEELEPGDIL 179 D DVFI+Q G +RWRV + ++ P E ++D L PGD+L Sbjct: 463 DDVDVFILQLEGYKRWRVYAPMNKQETLPRVSSRDYTEKEVEESMGEEVLDVVLVPGDVL 522 Query: 180 YIPPGFPHEGYAL------------ENAMNYSVGFRA-PNTRELISGFADYVLQRELGGN 226 Y+P G+ H+ + +A + + A N + F + ++ L Sbjct: 523 YLPRGWIHQAETVARPSHVSKLPGITDAHSLHLTVSAMQNW--CWADFLEILMPEALESA 580 Query: 227 YYSDPDVPPRAHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEP 286 S+ + R LP+ M +L ++ + + + + Sbjct: 581 SASETSISLRDG----LPRNFLAYMGTMHQLDDEGGELPEGLKQVAEAYAKRVAKGADKD 636 Query: 287 PYQPDEIYDA 296 DE Sbjct: 637 EEIDDEALAE 646 >UniRef50_B9BV10 Cupin superfamily protein n=5 Tax=Proteobacteria RepID=B9BV10_9BURK Length = 384 Score = 122 bits (307), Expect = 2e-26, Method: Composition-based stats. Identities = 67/358 (18%), Positives = 121/358 (33%), Gaps = 47/358 (13%) Query: 1 MEYQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGK 60 M +++ DF + +KRP+++K + + S ++ + S V S Sbjct: 1 MSISFSVSPKDFALDYQEKRPLLMKGAVS--LRNFSWRDVNEIFERSNVASDDFKLTFDG 58 Query: 61 WQVSHGPFESYDHLGETNWSLLVQAVNHWHEPTAALM---------------RPFRELPD 105 + ES+ +G L+ V + A L+ + Sbjct: 59 IRPKSEYIESWWDIGTLRHRLIKPVVYDYLRKGATLIANKIATEPKVNQLSRQLIEFTGR 118 Query: 106 WRIDDLMISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVD-- 163 + ++F H D DVF IQ GR+RW + E ++ + Sbjct: 119 QVVSSAYLAFG-ERDSFRCHWDTRDVFAIQLIGRKRWVLYEP-SLEAPLYMQQSKDYEGL 176 Query: 164 ---PFEAIIDEELEPGDILYIPPGFPHEGYALENA-MNYSVGFRAPNTRELISGFADYVL 219 P +D LE GD+LY+P G+ H + A + + G + +S + + Sbjct: 177 YPCPDTPYMDVMLEAGDLLYLPRGWWHNPLPVGEATFHLAFGTFPAYVIDYLSWAINRMP 236 Query: 220 QRELGGNYYSDPDVPPRAHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHEL 279 SD + + + + + I E+++++ + R E Sbjct: 237 HLLDARRSLSD---------WENDKNVLASIGQQFEDFICTRENYRRFLDDRTGAIRIET 287 Query: 280 DIA-----PPEPPYQPDEIYDALKQGEVLVRLGGLRVLRIGDDVYANGEKIDSPHRPA 332 D+A P PDE L R G R V ANG K++ + A Sbjct: 288 DLALETLGNPAVSAIPDESRIRLSA----YRPPG----RDDRYVIANGTKVNLDDQGA 337 >UniRef50_Q7T3G6 MYC induced nuclear antigen-like n=6 Tax=Euteleostomi RepID=Q7T3G6_DANRE Length = 528 Score = 121 bits (305), Expect = 3e-26, Method: Composition-based stats. Identities = 38/212 (17%), Positives = 78/212 (36%), Gaps = 22/212 (10%) Query: 7 LNWPDFLERHWQKRPVVLKR---GFNNFIDPISPDELAGL------AMESEVDSRLVSHQ 57 L+ +F +R W+++P+VL R + + P L+GL ++ D Sbjct: 67 LDLQEFFQRFWERQPLVLHRSDAALAGYYGSLFP--LSGLRRLCARGLQYGTDINTCRCV 124 Query: 58 DGKWQVSH--GPFE----SYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDL 111 G+ ++ + G + D L E ++ + + + + Sbjct: 125 RGQKRLLNRAGAVDFCLLERDFL-EKKATIQFHQPQRFQDELWRIQERLECFFGCLVGSN 183 Query: 112 MISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDE 171 + G+ PH D +V I+Q G++ WR+ E + + D Sbjct: 184 VYITPAGAQGLPPHYDDVEVLILQLEGQKHWRLYEPTVPLAREYSLE-PEGRIGAPTHDF 242 Query: 172 ELEPGDILYIPPGFPHEGYA---LENAMNYSV 200 L+ GD+LY P G H+ ++ + ++ Sbjct: 243 ILQAGDLLYFPRGTIHQADTPAGAGHSTHLTL 274 >UniRef50_B7G6P1 Predicted protein (Fragment) n=1 Tax=Phaeodactylum tricornutum CCAP 1055/1 RepID=B7G6P1_PHATR Length = 351 Score = 120 bits (300), Expect = 1e-25, Method: Composition-based stats. Identities = 57/337 (16%), Positives = 110/337 (32%), Gaps = 45/337 (13%) Query: 76 ETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPG-GGVGPHLDQYDVFII 134 + ++ + + + T L+ I G G PH D + F + Sbjct: 6 DQGCTIRLLCPHKHSDSTHGLLSLLESEWTCMIGANAYLTPPSGSQGFAPHYDDIEAFCL 65 Query: 135 QGTGRRRWRVGEKLQMKQHCPHPDL-----LQVDPFEAIIDEELEPGDILYIPPGFPHEG 189 Q G++RW+V LQ + P + E +D L+PGD+LY+P G+ H+ Sbjct: 66 QLEGKKRWKVYAPLQKSERLPRTSSEDYVEADLRDVEPALDVVLKPGDVLYMPRGWIHQA 125 Query: 190 YALEN----AMNYSVGFRAP-NTRELISGFADYVLQRELGGNYYS-DPDVPP-------- 235 ++ +++ +V +L+ LQ G+ +P Sbjct: 126 CTIDGTDGYSLHLTVSAMQQWAWADLMELLLPEALQSAASGDSTMLRQGLPRGFLNYMGA 185 Query: 236 ---RAHPADVLPQEMDKLREMMLELINQPEHFKQWFGE----FISQSRHELDIAPPEPPY 288 + A++L Q+ ++ R ++ + + F+S + Sbjct: 186 MYDQKDTAEILEQKAEQDRTAAMDETGAIDMLDAACDQIGKRFLSDRVPPVLTHLERSMT 245 Query: 289 QPDEIYDALKQ---------GEVLVRLGGLRVLRI---GDDVY----ANGEKIDSPHRPA 332 + L Q LV G VL VY + + + PA Sbjct: 246 VHESDAKVLPQTLCRMARPGSGRLVLEAGKAVLYHCADNSRVYHELPLSPMEFEMDDAPA 305 Query: 333 LDALASNIALTAENFGDALEDP--SFLAMLAALVNSG 367 ++ L + D + D + + AL + G Sbjct: 306 MEQLLTTTEHDWVRVADLIHDSIEDKVGVAQALYDEG 342 >UniRef50_A9V5A3 Predicted protein n=1 Tax=Monosiga brevicollis RepID=A9V5A3_MONBE Length = 595 Score = 119 bits (299), Expect = 1e-25, Method: Composition-based stats. Identities = 42/285 (14%), Positives = 87/285 (30%), Gaps = 51/285 (17%) Query: 2 EYQLTLNWPDFLERHWQKRPVVLKRGFNN----FIDPISPDELAGLAMES----EVDSRL 53 + + F ++H+++ + + R + + S D+L L D L Sbjct: 54 DILAPMTTQQFFDKHFERSFLYIPREDRDPGIVYQGLFSLDQLYTLLQRESMFYGTDLNL 113 Query: 54 VSHQDGKWQVSHGPFESYDHLG-------------------------------------- 75 + + V +G L Sbjct: 114 CRYDGERKLVLNGGRNDTTDLPTINGNHSNSQRAEEQDSNDSDDSDELAEEALAADVRRR 173 Query: 76 --ETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPHLDQYDVFI 133 + ++ + L+ F + + + G+ PH D +V+I Sbjct: 174 VEDLKATVQFHQPQRFVRALHDLLYSFEQELTTLVGANVYITPANSQGLAPHHDDVEVYI 233 Query: 134 IQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIPPGFPHEGYALE 193 +Q G + WR+ E ++ DL + + + I + L PGD LY+P G HE + Sbjct: 234 LQLEGEKAWRLYEPIEPLAMSYSADLDREELAQPIAELVLRPGDFLYLPRGTIHEASCVG 293 Query: 194 NAMNYSVGFRAP---NTRELISGFADYVLQRELGGNYYSDPDVPP 235 N + + + N L++ + + +P Sbjct: 294 NQHSTHITISSHQNWNYGHLMAQTLPECITNAMSNVLELRRGLPH 338 >UniRef50_Q016L9 [S] KOG3706 Uncharacterized conserved protein n=1 Tax=Ostreococcus tauri RepID=Q016L9_OSTTA Length = 455 Score = 118 bits (295), Expect = 4e-25, Method: Composition-based stats. Identities = 47/247 (19%), Positives = 85/247 (34%), Gaps = 29/247 (11%) Query: 37 PDELAGLAMES---EVDSRLVSHQDGKWQVSHGPFESYD-----------------HLGE 76 D A LA +D+ + S+ DG + + +++D E Sbjct: 47 ADVEATLASRDARYGIDADVTSYVDGVRRTHNSNDDTHDVCDEASNEIVDAKAVMRAYRE 106 Query: 77 TNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPHLDQYDVFIIQG 136 S+ + H+ T ++ + + G PH D D +++Q Sbjct: 107 RGRSIRLLHPQTRHDATWKMLATLESHFECACGCNVYVTPANAQGFAPHYDDIDAYVLQI 166 Query: 137 TGRRRWRVGEKLQMKQ--HCPHPDLLQVDP--FEAIIDEELEPGDILYIPPGFPHEGYAL 192 G +RWRV Q + + Q + E + D LE GD LYIP GF H+ Sbjct: 167 EGEKRWRVYAPFQSDELPRTSSKNYTQEEIAGLEVLFDGVLEAGDFLYIPRGFVHQAECS 226 Query: 193 E--NAMNYSVGFRAPNTRELISGFADYVLQRELGGN---YYSDPDVPPRAHPADVLPQEM 247 ++++ ++ NT A + R L + P + + Q+M Sbjct: 227 SRAHSVHATISTNQANTHADAFEIATQTIARSLIDESKWLRRNIYRPHQGPRRCLDMQQM 286 Query: 248 DKLREMM 254 + + Sbjct: 287 GQAVLAL 293 >UniRef50_Q47NS9 Putative uncharacterized protein n=1 Tax=Thermobifida fusca YX RepID=Q47NS9_THEFY Length = 395 Score = 113 bits (283), Expect = 1e-23, Method: Composition-based stats. Identities = 62/346 (17%), Positives = 120/346 (34%), Gaps = 44/346 (12%) Query: 32 IDPISPDELAGLAMESEVDS-RLVSHQDGKWQVSHGPFESYDHLGE-------------- 76 ++ D L+ L +++ RL H+ G + P ++Y +GE Sbjct: 38 APLLTFDALSELLSTHQLEPPRLRLHRAG----APVPLDNYTEVGEASGVQRRLVRPEAL 93 Query: 77 -----TNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPHLDQYDV 131 SL++ ++ H P A L R+ + G H D +D Sbjct: 94 YAQLRQGASLVLDGIDRIHPPIRAAADDLMRLVHERVQVNLYLIWGDSHGFNTHWDDHDT 153 Query: 132 FIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAII-DEELEPGDILYIPPGFPHEGY 190 FI+Q G + W+V + P E + + + G++L++P G+ H Sbjct: 154 FIVQVAGTKHWQVHGQGTRPYPMKEDIDHSHQPPEGTVWEGTVRAGEVLHVPRGWWHTVT 213 Query: 191 ALEN-AMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADVLPQEMDK 249 + +M+ + GF + D + + E ++ D+P A P + + + Sbjct: 214 GTGDVSMHLTFGFTRATGVDWARWLVDRLYEVE-----FARRDLPRFATPEERRKHQHEL 268 Query: 250 LREMMLELINQPEHFKQWFG----EFISQSRHELDIAPPEPPYQPDEIYDALKQGEVLVR 305 LR +M + + ++ F + L A QPD + + +R Sbjct: 269 LRHLMD--LAEQHGLDEFLTDRDSRFPRRQSFSLPWAVDGATPQPDTVVEFTPILPSALR 326 Query: 306 LGGLRVLRIGDDVYANGEK--IDSPHRPALDALASNIALTAENFGD 349 G +V + G + +P L+ L LT + Sbjct: 327 DEGQKVA-----LTVAGRRYTFAKAAQPLLEVLVDARVLTVAELAE 367 >UniRef50_A4RZ92 Predicted protein n=1 Tax=Ostreococcus lucimarinus CCE9901 RepID=A4RZ92_OSTLU Length = 515 Score = 112 bits (280), Expect = 3e-23, Method: Composition-based stats. Identities = 39/211 (18%), Positives = 69/211 (32%), Gaps = 8/211 (3%) Query: 73 HLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPHLDQYDVF 132 GE S+ + + T ++ + + G PH D D F Sbjct: 162 RFGEERRSVRLLHPQTRCDATWKILATLERYFECACGCNVYVTPASSQGFAPHYDDIDAF 221 Query: 133 IIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFE-----AIIDEELEPGDILYIPPGFPH 187 ++Q G +RWRV E + + H P E + D+ LE GD LY+P G+ H Sbjct: 222 VLQIEGAKRWRVYEPFEDETH-PRTSSRNFTQEEIATQRVVFDDVLEAGDFLYLPRGWIH 280 Query: 188 EGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADVLPQEM 247 + + + + N + + L L ++ + Sbjct: 281 QAECSSSTHSVHATL-STNQSNAPADALEIALNNALASTIDGRAELRRSFVSTLNDERRR 339 Query: 248 DKLREMM-LELINQPEHFKQWFGEFISQSRH 277 D E + EL + FG + + + Sbjct: 340 DAALEGLGEELRAFASELQGDFGAALVEHAY 370 >UniRef50_D1TSY0 Conserved domain protein n=19 Tax=Yersinia pestis RepID=D1TSY0_YERPE Length = 102 Score = 111 bits (278), Expect = 5e-23, Method: Composition-based stats. Identities = 51/100 (51%), Positives = 69/100 (69%) Query: 272 ISQSRHELDIAPPEPPYQPDEIYDALKQGEVLVRLGGLRVLRIGDDVYANGEKIDSPHRP 331 ++ RHELDIAP +PPY DEI DAL +G VL RLGGLRVLR+GD+V+ N E+++ + Sbjct: 1 MTTPRHELDIAPAQPPYDQDEIVDALMEGAVLTRLGGLRVLRVGDNVFINSERLEMANAE 60 Query: 332 ALDALASNIALTAENFGDALEDPSFLAMLAALVNSGYWFF 371 A DAL + + G+AL+D +F+ L L+N GYWFF Sbjct: 61 AADALCRYTIIGKKELGEALQDSAFVTELTELINQGYWFF 100 >UniRef50_Q2JG11 Cupin 4 n=3 Tax=Actinomycetales RepID=Q2JG11_FRASC Length = 406 Score = 108 bits (271), Expect = 3e-22, Method: Composition-based stats. Identities = 56/308 (18%), Positives = 109/308 (35%), Gaps = 13/308 (4%) Query: 70 SYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPHLDQY 129 L ++ +L++ AVNH+ R F+ + + + G H D + Sbjct: 94 RLAGLLQSGCTLVLDAVNHFDPTLEVACRAFQWWLRAPVQANVYLTTGDAAGFSLHWDDH 153 Query: 130 DVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIPPGFPHEG 189 DV ++Q G + W V + P + + + GD+LYIP G H Sbjct: 154 DVIVLQLAGDKEWEVRGPSRRAPMYRDAAPNTEPPKDIVWSGTVNTGDVLYIPRGHWHRA 213 Query: 190 --YALEN--AMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADVLPQ 245 + + +++ + GF + ++ AD + E+ + P+ H D + Sbjct: 214 SRTSRGDGFSLHATFGFTRRTGVDWLAWLADQSRREEVFREDLNQRGEDPKEHQND-GEK 272 Query: 246 EMDKLREMMLELINQPEHFKQWFGEFISQSRH---ELDIAPPEPPYQPDEIYDALKQGE- 301 + ++ + P H+ + S R+ PP + ++ Sbjct: 273 IIVAASRLLTS--HPPAHYLESVAHATSAGRYVSTAGIFGPPSAVVCVTDFPPQIETQGD 330 Query: 302 -VLVRLGGLRVLRIGDDVYANGEKIDSPHRPALDALASNIALTAENFGDALEDPSFLAML 360 V V R++ + A G + S + LD ++S + G+ L A L Sbjct: 331 TVAVATAEKRIVFTRKALPALG-LLLSGNPVCLDYVSSAAGIDGARLGEILVREGICAEL 389 Query: 361 AALVNSGY 368 + SGY Sbjct: 390 TPELFSGY 397 >UniRef50_A1SPZ0 Cupin 4 family protein n=1 Tax=Nocardioides sp. JS614 RepID=A1SPZ0_NOCSJ Length = 403 Score = 107 bits (268), Expect = 5e-22, Method: Composition-based stats. Identities = 76/403 (18%), Positives = 137/403 (33%), Gaps = 71/403 (17%) Query: 5 LTLNWPDFLERHWQKRPVVLKRGFNNFIDPIS---PDELAGLAMESEVDSRL-------- 53 L+ + FL + W R + + G + P S PD L GL ++ D L Sbjct: 27 LSGDAQTFLAKVWASRVHLHRSGPADPDSPGSADGPDSLVGLFALADADHLLTSSAVRTP 86 Query: 54 -VSHQDGKWQVSHGPFESYDHLG-----------------ETNWSLLVQAVNHWHEPTAA 95 + + + L + +++ Q ++ + P Sbjct: 87 SIRLAKDGAVLPESAYTRRASLAGKPLTGLVDARKALALFDDGATVVFQGLHRYWPPLTR 146 Query: 96 LMRPFRELPDWRIDDLMISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCP 155 L+ G H D +DVF+ Q G +RW V Sbjct: 147 LIARLELELGHPCQANAYLTPPGAQGFAVHSDSHDVFVFQTAGSKRWEVHGP-------- 198 Query: 156 HPDLLQVDPFEAIIDEELEPGDILYIPPGFPHEGYALEN-AMNYSVGFRAPNTREL---- 210 + + LEPG +Y+P G PH A + +++ ++G R L Sbjct: 199 ----------DGPEEVLLEPGVSMYLPTGTPHAARAQDTVSLHVTLGINQLTWRGLVERT 248 Query: 211 ISGFADYVLQRELGGNYYSDPDVPPRAHPADVLPQEMDKLREMMLELINQPEHFKQWFGE 270 ++G V L Y DP A P L +++L + + ++ + Sbjct: 249 VAGALGEVADEHLPAGYLDDP--AALAGP---LADRLERLADAVRR-LDATAAVEAEVRR 302 Query: 271 FISQSRHELDIAPPEPPYQPDEIYDALKQGEVLVRLGG--LRVLRIGDDV-YANGEK--- 324 F++ LD + + + +L R G +L G+ V G++ Sbjct: 303 FLTSRPPRLDGGLHD-----VLAHGTITDTTLLRRRPGHPCVLLDRGERVEVLLGDRSLT 357 Query: 325 IDSPHRPALDALASNIALTAENFGDALEDPSFLAMLAALVNSG 367 + + RPAL+A+ + LT + L++ S L + LV G Sbjct: 358 VPAWIRPALEAVRARGELTPADLP--LDEQSRLVLCRRLVREG 398 >UniRef50_P46327 Uncharacterized protein yxbC n=1 Tax=Bacillus subtilis RepID=YXBC_BACSU Length = 330 Score = 105 bits (262), Expect = 3e-21, Method: Composition-based stats. Identities = 48/282 (17%), Positives = 99/282 (35%), Gaps = 31/282 (10%) Query: 11 DFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEV-DSRLVSHQDGKWQVSHGPFE 69 +FLE +W +P+V + F +++ L + ++ ++ D + S G + Sbjct: 20 EFLEEYWPVKPLVARGEVERFTSIPGFEKVRTLENVLAIYNNPVMVVGDAVIEESEGITD 79 Query: 70 SYDHLG-------ETNWSLLVQAVNHWHEPTAALMRPFRE---LPDWRIDDLMISFSVPG 119 + E +L + + + + LP ++ + G Sbjct: 80 RFLVSPAEALEWYEKGAALEFDFTDLFIPQVRRWIEKLKAELRLPAGTSSKAIVYAAKNG 139 Query: 120 GGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQ--------------VDPF 165 GG H D Y I Q G + W++ + + H DL + P Sbjct: 140 GGFKAHFDAYTNLIFQIQGEKTWKLAKNENVSNPMQHYDLSEAPYYPDDLQSYWKGDPPK 199 Query: 166 EAIIDEE---LEPGDILYIPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRE 222 E + D E L PG +LY+P G H + + + ++ F P +L+ + ++ Sbjct: 200 EDLPDAEIVNLTPGTMLYLPRGLWHSTKSDQATLALNITFGQPAWLDLMLA---ALRKKL 256 Query: 223 LGGNYYSDPDVPPRAHPADVLPQEMDKLREMMLELINQPEHF 264 + N + + V ++ + L ++ L E Sbjct: 257 ISDNRFRELAVNHQSLHESSKSELNGYLESLIQTLSENAETL 298 >UniRef50_Q091R4 Chromosome 14 open reading frame 169, putative n=1 Tax=Stigmatella aurantiaca DW4/3-1 RepID=Q091R4_STIAU Length = 355 Score = 101 bits (253), Expect = 4e-20, Method: Composition-based stats. Identities = 41/196 (20%), Positives = 79/196 (40%), Gaps = 9/196 (4%) Query: 80 SLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVP-GGGVGPHLDQYDVFIIQGTG 138 ++++ + EP R + + + + P GV PH D + FI+Q G Sbjct: 49 TVILSGLEETWEPLVVFCRKLEGQLSHPVA-VAVYLTPPNHHGVQPHFDTQENFILQVDG 107 Query: 139 RRRWRVGEKLQMKQHC--PHPDLLQVDPFEAIIDEELEPGDILYIPPGFPHEGYALE-NA 195 + W+V Q + + + E +++ EL PGD+LY+P GF HE A + + Sbjct: 108 VKHWKVYGAGQELPRVEGSYTPVARERLPELLLETELHPGDMLYVPRGFVHEAEARDSAS 167 Query: 196 MNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADVLPQEMDKLREMML 255 ++ +V R+ + + R P + +H L + RE+M Sbjct: 168 LHITVDVHVRTWRDFLEDALAAMADRNPRFRKSLPPGLLNGSHAKAQLEE---GFRELM- 223 Query: 256 ELINQPEHFKQWFGEF 271 E++++ G+ Sbjct: 224 EMVHREVRLSDALGKH 239 >UniRef50_UPI0001B4BFC9 putative cupin superfamily protein n=1 Tax=Streptomyces sp. C RepID=UPI0001B4BFC9 Length = 394 Score = 101 bits (252), Expect = 4e-20, Method: Composition-based stats. Identities = 70/374 (18%), Positives = 118/374 (31%), Gaps = 51/374 (13%) Query: 24 LKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQVS-----HGPF--------ES 70 L+R +F D E+ L + + QV HG Sbjct: 36 LRRAAGDFSDLFGLAEVDVLLTDRALRRPAFRVIRDGAQVPDASCLHGGLLYPDVADPGK 95 Query: 71 YDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGG--GVGPHLDQ 128 L +L+ Q + P A R + + ++ P G G G H D Sbjct: 96 ISGLLAEGATLVFQGLQELTGPLAEFGRRLGHDLGRPV--NVNAYVTPAGSQGFGDHYDT 153 Query: 129 YDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIPPGFPHE 188 D FI+Q G +RW + + + + + LEPGD L++P G+ H Sbjct: 154 QDSFIVQIHGSKRWTLKDPALAQPLSHETGRPLPEDDGSGRTLTLEPGDCLWLPRGWVHS 213 Query: 189 GYALEN-AMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADVLPQEM 247 + + +++ ++ + E +A + G P ++M Sbjct: 214 ARSTDTASVHLTI-----SLYEWTGHWAWTRIAARAPGLPGRFPLSTDFFRDRAAAEKDM 268 Query: 248 DKLREMMLELINQPE-----HFKQWFG--EFISQSRHEL-DIAPPEPPYQPDE------I 293 LR + E + + + G EF S RH ++ PE + + Sbjct: 269 AALRAELTEWLATADDSALVDLVRAAGAPEFPSPVRHPAREVLSPEADEDAEYTVNAHAV 328 Query: 294 YDALKQGEVLVRLGGLRVLRIGDDVYANGEKIDSPHRPALDALASNIALTAENFGDALED 353 +A +G+ LV G R L + LA + L D Sbjct: 329 LNAETRGDRLVLTLGARGLTLP--------------AAMTPLLADLLTLDRFRPCDLPPT 374 Query: 354 PSFLAMLAALVNSG 367 P A+L L G Sbjct: 375 PGTTALLTRLSAEG 388 >UniRef50_A5GJ70 Putative uncharacterized protein SynWH7803_0559 n=1 Tax=Synechococcus sp. WH 7803 RepID=A5GJ70_SYNPW Length = 370 Score = 101 bits (251), Expect = 5e-20, Method: Composition-based stats. Identities = 57/306 (18%), Positives = 109/306 (35%), Gaps = 23/306 (7%) Query: 77 TNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVG--PHLDQYDVFII 134 S++V V+ + L E + ++ ++ P V PH D +D+F + Sbjct: 64 EGASIVVNEVHRFSSQLMDLASSLSEELGVQC--VVNAYLTPPQSVALSPHFDSHDIFAL 121 Query: 135 QGTGRRRWRVGEKLQMKQHCP--HPDLLQVDPFEAII-DEELEPGDILYIPPGFPHEGYA 191 Q G+++W V +L P L + ++ GD++Y+P G H Sbjct: 122 QVVGQKQWFVDSELSSLTTKSTFQPILSADQASSVDFREVVMDEGDVMYLPRGCVHHART 181 Query: 192 LE-NAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDV-PPRAHPADVLPQEMDK 249 + +M+ +VG E I+ + + + R HP+ + +D+ Sbjct: 182 ISCQSMHLTVGLYPLEWSEFIASAVEIAASAPEARGLRTSVPLGLKRQHPSFYRQELLDR 241 Query: 250 LREMMLE------LINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQGEVL 303 L + + L ++ + F G+ S + P D+ ++ + Sbjct: 242 LSGLFTDDVIEKALRSREKEFSA--GQPSSFVGGLDADSWPAGSIASDDAFERCAEAVYF 299 Query: 304 VRL-GGLRVLRIGDDVYANGEKIDSPHRPALDALASNIALTAENFGDALE-DPSFLAMLA 361 GLRV+ G K +P R L L+ T E + +A + Sbjct: 300 YPAVSGLRVVSSGYGFTL---KHPNPDR-ILKVLSHKKFFTLGELTGGDECSLNDIACIQ 355 Query: 362 ALVNSG 367 AL+ G Sbjct: 356 ALLRRG 361 >UniRef50_B0KHI4 Cupin 4 family protein n=1 Tax=Pseudomonas putida GB-1 RepID=B0KHI4_PSEPG Length = 303 Score = 100 bits (250), Expect = 7e-20, Method: Composition-based stats. Identities = 43/239 (17%), Positives = 78/239 (32%), Gaps = 24/239 (10%) Query: 1 MEYQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGK 60 M++ L F+ +QK+P V + + + E+ L + +D + + Sbjct: 4 MDFALP-EMDFFIRSAFQKKPFVFESVTGHQ--LVGWGEINNLLEKDILDYPRIRLANDG 60 Query: 61 WQVSHGP-----------------FESYDHLG--ETNWSLLVQAVNHWHEPTAALMRPFR 101 G Y+ L +T +L++ + E Sbjct: 61 IPSERGFKGFVTYTLTVTGETSPHINRYNLLKRLQTGSTLIIDRCQAFFERAQQAASYLS 120 Query: 102 ELPDWRIDDLMISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQ 161 R + G H D +DV +Q G +RW V + Q Sbjct: 121 THLRCRSGANLYCAWSSTPSFGAHFDNHDVIAVQIEGVKRWEVYAPTRPYPLLNDKSFDQ 180 Query: 162 VDP-FEAIIDEELEPGDILYIPPGFPHEGYAL-ENAMNYSVGFRAPNTRELISGFADYV 218 P E ++ L PG +Y+P G+ H + E +M+ S P +LI + + Sbjct: 181 TPPAGEPMLCHTLTPGQAIYVPAGYWHNVFTETERSMHISFPVVRPRKIDLIRMVLERL 239 >UniRef50_D2PSR6 Cupin family protein n=1 Tax=Kribbella flavida DSM 17836 RepID=D2PSR6_9ACTO Length = 408 Score = 99 bits (248), Expect = 1e-19, Method: Composition-based stats. Identities = 74/394 (18%), Positives = 133/394 (33%), Gaps = 55/394 (13%) Query: 7 LNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMES-------------EVDSRL 53 L+W +F E +W + PV+++ P DE+ A+ + V Sbjct: 10 LDWAEFAELYWDRHPVLIRGVRP---VPFRADEVFSAALRARCAEGGGRIAPNASVTVEQ 66 Query: 54 VSHQDGKWQV---SHGPFESY-----DHLGETNWSLLVQAVNHWHEPTAALMRPFRELPD 105 D + S G F+ Y D L ++L++ A + + P R F Sbjct: 67 TVQADRDGLLPAESDGCFDGYERRVGDRLDGRKYALIISAFHAFDFPLWDRERRF-FAGL 125 Query: 106 WRIDDLMISFSV----------PGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCP 155 W D++ + + VG H D++ F+ R+R R+ + + Sbjct: 126 W--DEVGLPLTSAITTLFHGNYDHSPVGVHKDRFATFMFGLRERKRMRLWTERPWTEQV- 182 Query: 156 HPDLLQVDPFEAI-IDEELEPGDILYIPPGFPHEGYALENAMNYSVGFRAPNTRELISGF 214 ++ + F E+EPGD+LY P + H G SV P T +S Sbjct: 183 -GSVVDYERFLPSSFAVEVEPGDLLYWPASYFHVGENCGRTPATSVNIGVPRTEHRVSYE 241 Query: 215 ADYVLQRELGGNYYSDPDVPPRAHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQ 274 + +L D + + E++ + P Q Sbjct: 242 LEDLLADSDPARLLDDGGRLAVLADG-IDAPMRQEAGEVLPSTV--PPALAQALTAHAKS 298 Query: 275 SRHELDIAPPE-------PPYQPDEIYDALKQGEVLVRLGGLRVLRIGDDVYANGEKIDS 327 +++ P P E + L G+++ G + + + ANG I + Sbjct: 299 LTDRVEMVSLRRWTAGGLEPVPPPEPFRPLADGQIVELAQGADLAQYRGALAANGHLITA 358 Query: 328 PHRP-ALDALASNIALTAENFGDALEDPSFLAML 360 P AL L+S ++ DA P+ +L Sbjct: 359 DLPPEALQLLSSGRSVR----VDAANRPALEQLL 388 >UniRef50_D1VL61 Cupin 4 family protein n=1 Tax=Frankia sp. EuI1c RepID=D1VL61_9ACTO Length = 313 Score = 99.6 bits (247), Expect = 2e-19, Method: Composition-based stats. Identities = 45/247 (18%), Positives = 92/247 (37%), Gaps = 15/247 (6%) Query: 76 ETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPHLDQYDVFIIQ 135 + +L++ + + R R + + G H D +DV I+Q Sbjct: 6 DAGVTLVLDGLETFDPIVEVATRALRWWSGELVQTNAYLTTRSADGFPLHWDDHDVLIVQ 65 Query: 136 GTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIPPGFPHEGYALEN- 194 G + W V + +V EA+ L G++L+IP G+ H+ +E+ Sbjct: 66 LAGEKNWDVRGSTRSAPMFRDAVPNEVASSEAVWQGVLRAGEVLHIPRGYWHQATRVEHD 125 Query: 195 ---AMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADVLPQEMDKLR 251 +++ + GF + ++ AD ++EL + +D P A+ +L+ Sbjct: 126 DPVSLHLTFGFTRRTGVDWLTWIADQAREQEL---FRTDLTRSPAEREAE-----RARLQ 177 Query: 252 EMMLELINQPEHFKQWFGEFISQSRHELDIAP-PEPPYQPDEIYDALKQGEVLVRLGGLR 310 + +EL+ F ++R AP ++P+ + + + R G Sbjct: 178 DAAIELV--RSLPPAAFLTARERTRPPARHAPTLPSAHEPEVVVCVTEFAPHVERSEGQL 235 Query: 311 VLRIGDD 317 V+ G Sbjct: 236 VVYAGGR 242 >UniRef50_C9NEK8 Cupin 4 family protein n=1 Tax=Streptomyces flavogriseus ATCC 33331 RepID=C9NEK8_9ACTO Length = 403 Score = 99.6 bits (247), Expect = 2e-19, Method: Composition-based stats. Identities = 58/359 (16%), Positives = 120/359 (33%), Gaps = 44/359 (12%) Query: 35 ISPDELAGLAMESEVD-SRLVSHQDGKWQVSHGP-----------FESYDHLGETNWSLL 82 S +L + V+ + L G H L + SL+ Sbjct: 43 FSWRDLNEILSRGRVEPAELKLCTGGSSLPEHAYTVTRAGHRVVDLTRTFSLMRSGASLV 102 Query: 83 VQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPHLDQYDVFIIQGTGRRRW 142 + +++ H A + + H D+ D F++Q G + W Sbjct: 103 IDSLDRIHPAVRAATDDVMRMVGETASCNLFVTFDDAQAFASHFDEVDTFVLQVLGTKSW 162 Query: 143 RVGEKLQMKQHCPHPDLLQVDPF----EAIIDEELEPGDILYIPPGFPHEGYALEN-AMN 197 +V + P P+ DP + + LEPGD++++P G+ H +++ Sbjct: 163 QVHGP---SEEHPLPEYGDSDPARCPEAVLFERTLEPGDVIHVPRGWWHTVRGGGESSLH 219 Query: 198 YSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADVLPQEMDKLREMMLEL 257 + F + + A L DV A ++ + ++ Sbjct: 220 LTFAFTRRTGYDWLRWVAYRALDST---------DVRESLARAGTPEEQRAQAERLVTAF 270 Query: 258 INQPEH--FKQWFGEFISQSRHE----LDIAPPEPPYQPDEIYDALKQGEVLVRLGGLRV 311 +N+ + + +F +S L + + L+ + G R+ Sbjct: 271 VNEAKALTLRDFFDAERRRSGGRDTACLPWDVLKARPSAGTFVELATVQAPLMEIRGERL 330 Query: 312 LRIGDDVYANGEKIDSP--HRPALDALASNIAL-TAENFGDALEDPSFL-AMLAALVNS 366 + + A G++ P HR A + L + TAE + P+ + A+++AL+ + Sbjct: 331 V-----LTAAGQEFVLPAVHREACETLVRARRVGTAELAERSGTSPAAVSALVSALLRA 384 >UniRef50_B7FXD3 Predicted protein n=1 Tax=Phaeodactylum tricornutum CCAP 1055/1 RepID=B7FXD3_PHATR Length = 481 Score = 99.6 bits (247), Expect = 2e-19, Method: Composition-based stats. Identities = 53/299 (17%), Positives = 96/299 (32%), Gaps = 51/299 (17%) Query: 8 NWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQ--DGKWQVSH 65 + F + WQ+ F + P + ++ A E + H+ W V Sbjct: 30 SVETFFQTFWQRACGYFPNTFLD--SPPKAESMSRCAWNKERVEQNAYHELVRNGWSVLV 87 Query: 66 GPFESYDHLGETNWSLLVQAVNHWHEPTAALM---------RPFRELPD----------- 105 E+ + E + L Q++ L F D Sbjct: 88 QLLETSRNRPEHDADLSHQSIPLLFRDQTTLTLEEQVLYDDSLFAAFLDGCSVVTNHADR 147 Query: 106 ------WRIDDLMISF---------SVPG-GGVGPHLDQYDVFIIQGTGRRRWRVGEKLQ 149 +DL SF + PG V H D DVF+IQ G + W++ + Sbjct: 148 RSPWIAALCEDLQASFPHVYANTYLTPPGSQTVPAHADDRDVFVIQLVGCKAWKIYRNIP 207 Query: 150 MKQHCPHPDLLQVD--------PFEAIIDEELEPGDILYIPPGFPHEGYALENAMNYSVG 201 + H + + + + D L PGD+LY+P G+ HE +A++ ++ V Sbjct: 208 VPYPYSHEQVGKGELEVPGQVLDGPVLTDRVLAPGDVLYMPRGYVHEAHAVDGGPSFHVT 267 Query: 202 FRAPNTRELISGFADYVLQRELGGNYYSDPDVPP---RAHPADVLPQEMDKLREMMLEL 257 ++G + L VP R + + L++ + + Sbjct: 268 VALATQDWTLAGLVTAATEASLTQQRSYRQAVPRCFGRRSFESIAVDDKQSLQKQLDDA 326 >UniRef50_B6KFH2 Putative uncharacterized protein n=4 Tax=Toxoplasma gondii RepID=B6KFH2_TOXGO Length = 508 Score = 99.6 bits (247), Expect = 2e-19, Method: Composition-based stats. Identities = 35/218 (16%), Positives = 73/218 (33%), Gaps = 22/218 (10%) Query: 58 DGKWQVSHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRE-LPDWRIDDLMISFS 116 + + SL++ + E ++ + + +S+ Sbjct: 173 ERHRTTTSASLSRATCRYLEGCSLVINQADRTLEILQSICQHLSKKYFSHVF---AVSYL 229 Query: 117 VPGGG--VGPHLDQYDVFIIQGTGRRRWRVGEKLQ----MKQHCPHPDLLQVDPFEAIID 170 P V H D DVF++Q G + W++ Q ++ + DP + +++ Sbjct: 230 TPPRTHAVKTHTDDQDVFLLQVWGSKAWKIWTPPQILPLTEEMLGKREAFPDDPGKPLLE 289 Query: 171 EELEPGDILYIPPGFPHEGYALEN-AMNYSVGFRAPNTR----ELISGFADYVLQREL-- 223 L+ GDILYIP GFPH E +++ ++ P + ++ Sbjct: 290 FVLKEGDILYIPRGFPHAAVTTEEPSLHITLTV--PTAEFAYVTCLQRLVKSLVLTHTLP 347 Query: 224 -GGNYYSDPDVPPRAHPADVLPQEMDKLREMMLELINQ 260 + + P +++ LR + Q Sbjct: 348 SDTERRCRSALLLKDVPGA--AEDLHALRAAVDACAEQ 383 >UniRef50_A8TXW2 Putative uncharacterized protein n=1 Tax=alpha proteobacterium BAL199 RepID=A8TXW2_9PROT Length = 398 Score = 98.8 bits (245), Expect = 3e-19, Method: Composition-based stats. Identities = 47/283 (16%), Positives = 87/283 (30%), Gaps = 33/283 (11%) Query: 6 TLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAME------SEVDSRL---VSH 56 ++ F+ + +R + S D+L L ++ L V Sbjct: 11 AIDRQTFVRDYLDQRVYHEAGSVQDVGRLFSWDKLNDLLQRPKLWDGKSIEMALAGRVLD 70 Query: 57 QDGKWQVSHG----PFESYDH-----LGETNWSLLVQAVNHWHEPTAALMRPFRELPDWR 107 + G P D L + + ++ ++ AA+ R F L Sbjct: 71 PREYCRPGLGRSGEPILRPDRQKVMALLQKGATFVLDYLDGIDPDIAAVTRCFERLFGTN 130 Query: 108 IDDLMISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGE----KLQMKQHCPHPDLLQVD 163 M G H D DVF IQ TG + W + + + D Sbjct: 131 TSCNMYCSWQQVPGYASHFDTMDVFAIQITGEKTWNIYDGRFREATFTAGIRPSDFTVEQ 190 Query: 164 P----FEAIIDEELEPGDILYIPPGFPHEGYALE-NAMNYSVGFRAPNTRELISGFADYV 218 + + PGDILY+P G H+ A + +++ S G ++ A Sbjct: 191 HNRMRGKVAQRITMRPGDILYLPRGVYHDALATDSASLHLSFGVSPQVGFTVVGMLASEA 250 Query: 219 LQRELGGNYYSDPDVPPRAHPADVLPQEMDKLREMMLELINQP 261 + E P + L + + + + +++ P Sbjct: 251 PKHEFLRKR------LPHFEDREELAGYLAAVGDHLKTMLSDP 287 >UniRef50_A9UW44 Predicted protein n=1 Tax=Monosiga brevicollis RepID=A9UW44_MONBE Length = 710 Score = 97.7 bits (242), Expect = 6e-19, Method: Composition-based stats. Identities = 55/299 (18%), Positives = 110/299 (36%), Gaps = 45/299 (15%) Query: 5 LTLNWPDFLERHWQKRPVVLK--RGFNNFIDP---ISPDELAGLAMESEVDSRLVSHQ-- 57 L +F ER++++ PV ++ + +F++ +S + + D R V Sbjct: 339 LRFIRNEFRERYFEQFPVYIQAQGAYLDFLNYSAALSGQTFNYAGDKEKSDPRNVKFIKR 398 Query: 58 --DGKWQVSHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISF 115 D K + + ++L +VN+W A+L E + + Sbjct: 399 TFDQKQESGRKTEKDLARALREGFTLQFYSVNYWDPNIASLALELSE-HGILLPVNANLY 457 Query: 116 SVPGG---GVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLL---------QVD 163 PGG + PH D ++Q G +RWR+ + ++ + + Sbjct: 458 ITPGGTSVSLVPHTDYQCSLMVQLAGVKRWRLWKMPEIMLPVSANMIRGRDTDDLVASEE 517 Query: 164 PFEAIIDEELEPGDILYIPPGFPHEGYALE---NAMNYSVGFRA---------------- 204 E +D L+PGDILY+P G H E +M+ +VG A Sbjct: 518 LGEPYMDVLLQPGDILYVPRGVLHATSTPEGDHPSMHLTVGMEAMWDLGIGQVWHHFLGA 577 Query: 205 ---PNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADVLPQEMDKLREMMLELINQ 260 + + ++ G + ++ Y +P + P++ + REM ++++ Sbjct: 578 GAVAHHQHIVEGLYTALRRKTHEDARYR-ATMPASFYNRSGDPKQSAEWREMGRQMLHD 635 >UniRef50_B1FKZ3 Cupin 4 family protein n=1 Tax=Burkholderia ambifaria IOP40-10 RepID=B1FKZ3_9BURK Length = 304 Score = 96.9 bits (240), Expect = 9e-19, Method: Composition-based stats. Identities = 28/162 (17%), Positives = 61/162 (37%), Gaps = 3/162 (1%) Query: 76 ETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPHLDQYDVFIIQ 135 + ++++ + +L + + + + G H D +D+ +Q Sbjct: 95 DDGCTIIIDGCQDYFPSVLSLTAEIEHILKCQSWANLYISTQSATSFGCHFDDHDIISVQ 154 Query: 136 GTGRRRWRVGEKLQMKQHCPHPDLLQVDP-FEAIIDEELEPGDILYIPPGFPHEGYALEN 194 +G++RW + + + + P + E L G LY+P G+ H + Sbjct: 155 LSGKKRWHIYKPTYISPNRGDKSFYLDPPTGSPDLLENLPTGSSLYLPSGYWHNVETVSP 214 Query: 195 -AMNYSVGFRAPNTRELISGFADYV-LQRELGGNYYSDPDVP 234 +M+ + G P +++ A+ + L G DPD P Sbjct: 215 HSMHITFGLDFPRKLDIVHAIANQLGLNDIFRGAVNFDPDSP 256 >UniRef50_C1YJ55 Cupin superfamily protein n=1 Tax=Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 RepID=C1YJ55_NOCDA Length = 400 Score = 96.1 bits (238), Expect = 2e-18, Method: Composition-based stats. Identities = 62/290 (21%), Positives = 106/290 (36%), Gaps = 30/290 (10%) Query: 76 ETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGG--GVGPHLDQYDVFI 133 SL++ +V+ H P A L R + + + GG G H D +D I Sbjct: 100 REGASLVLDSVDRMHPPVGAAADDLMRLVRERAQANL--YLIWGGSRGFDTHWDDHDTVI 157 Query: 134 IQGTGRRRWRVGEKLQMKQHCPHP-DLLQVDPFEA------IIDEELEPGDILYIPPGFP 186 +Q G + W+V + D P +A + + L PG ++++P G+ Sbjct: 158 VQVEGTKHWQVHGPGSRPYPMKNDVDHAHTPPRDADGELHLVWEGVLRPGQVIHVPRGWW 217 Query: 187 HEGYALEN-AMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADVLPQ 245 H +M+ + GF E +AD +++R + D+P A P DV + Sbjct: 218 HTVTGTGGVSMHLTFGFTRATGVE----WADALVRRLFEEEVFRR-DLPRFADP-DVRRK 271 Query: 246 EMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQGEVLVR 305 L M EL + + + E ++ + P P + A + ++ Sbjct: 272 HQRALAARMAELAEELD-LDGFLAERDARFPRRTSFSLPWPVEEGTPPAHARVEFVPILP 330 Query: 306 LG----GLRVLRIGDDVYANGEKIDSPH--RPALDALASNIALTAENFGD 349 G RV V G + P P L+ALA + LT + Sbjct: 331 PPLSHDGERVA-----VTVGGRRYRFPAVVGPVLEALAEHRELTVAELAE 375 >UniRef50_C9Z2L7 Putative uncharacterized protein n=2 Tax=Streptomyces scabiei 87.22 RepID=C9Z2L7_STRSW Length = 407 Score = 95.4 bits (236), Expect = 3e-18, Method: Composition-based stats. Identities = 45/262 (17%), Positives = 83/262 (31%), Gaps = 18/262 (6%) Query: 70 SYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPHLDQY 129 S L + ++++ N + R + R+ + + G H D + Sbjct: 101 SLGRLLQDGATVIMDQANVFDPTMEVACRALQWWSRERVQVNVYLTTNDAAGFPLHWDDH 160 Query: 130 DVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIPPGFPHE- 188 DV I+Q G ++W V + D E I + GD+++IP G H+ Sbjct: 161 DVVIVQLAGEKKWEVRTASRNVPMYRDSDPNNTASDEIIWSGVMRAGDVMHIPRGHWHQA 220 Query: 189 ---GYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADVLPQ 245 G N+++ + G ++ D+ + E+ + D D A + Sbjct: 221 TRTGSGSGNSLHVTFGITKRTGASWLAWLGDWCREHEI---FRQDLDRWHGAGSEALTTA 277 Query: 246 EMDKLRE----MMLELINQPEHFKQ---WFGEFISQSRHELDIAPPEPPYQPDEIYDALK 298 + E L Q + + P + E D + Sbjct: 278 AARLVAERSPVDFLAAYEQETTLSRHVPFLDVLGPLDAVVCTTHFPPRIQEGGEAVDVVA 337 Query: 299 QGEVLVR----LGGLRVLRIGD 316 G+ L L LR+L G Sbjct: 338 SGKKLTLAVKALPALRLLLSGR 359 >UniRef50_D1WSH6 Cupin family protein n=2 Tax=Streptomyces RepID=D1WSH6_9ACTO Length = 403 Score = 95.4 bits (236), Expect = 3e-18, Method: Composition-based stats. Identities = 75/402 (18%), Positives = 126/402 (31%), Gaps = 52/402 (12%) Query: 1 MEYQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDS--------- 51 + + + +W F +R W + PV+ K P E+ A+ Sbjct: 3 LTVEKSFDWDTFADRFWDRAPVLYKGLD---TAPFDEQEVFRAAVSGSRPPHPLAVPGNL 59 Query: 52 -RLVSHQDGKW------QVSHGPFESY-----DHLGETNWSLLVQAVNHWHEPTAALMRP 99 LV + + G + Y D L ++L+V + + P + Sbjct: 60 QFLVRRRQQTRPHDYLPEAGDGSLDGYERRMADRLEGRRYALVVHRFHSFSHPLWDRAQR 119 Query: 100 FRE-------LPDWRIDDLMISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQ 152 F P M S VG H D++ F+ G +R R Sbjct: 120 FYAGLWERVGQPTHTAGSTMFHGSYEHSPVGVHQDRFATFMFCVRGTKRMRFWADRPWSD 179 Query: 153 HCPHPDLLQVDPF-EAIIDEELEPGDILYIPPGFPHEGYALENAMNYSVGFRAPN----- 206 P +L P+ + E+EPGD+LY P + H G + + SV P Sbjct: 180 --PVHTVLDYQPYLASSFVAEVEPGDLLYWPARYYHVGESASDTPATSVNVGIPRREHRP 237 Query: 207 ---TRELISGFADYVLQRELGGNYYSDPDVPPRAHPADVLPQEMDKLREMMLELINQPEH 263 ++L G R PD P A LP+ + + E + + Sbjct: 238 YYEIKDLFRG------TRPQSSAPLFTPDAGPDGRLAGELPRALADAVDAFAEHLA-EDR 290 Query: 264 FKQWFGEFISQSRHELDIAPPEPPYQPDEIYDA--LKQGEVLVRLGGLRVLRIGDDVYAN 321 F + R P EPP P + D +++ L+ G R + + Sbjct: 291 FTDRATALALRVRTAGGFWPTEPPAAPRPLDDDTPVRRCAPLLPAPGEGPPRWAANGHVT 350 Query: 322 GEKIDSPHRPALDALASNIALTAENFGDALEDPSFLAMLAAL 363 I + L L ++ A+ +A +L L Sbjct: 351 SGAIGADALAVLRRLDADEAVRVGELPEARR-ADVRRLLQEL 391 >UniRef50_C4DQG4 Putative uncharacterized protein n=1 Tax=Stackebrandtia nassauensis DSM 44728 RepID=C4DQG4_9ACTO Length = 423 Score = 91.5 bits (226), Expect = 4e-17, Method: Composition-based stats. Identities = 85/410 (20%), Positives = 130/410 (31%), Gaps = 58/410 (14%) Query: 6 TLNWPDFLERHWQKRPVVLKR-------------GFNNFIDPISPD---ELAGLAMESEV 49 TL+W F E++W + PV+ +R P + D ++ L E Sbjct: 11 TLDWDVFAEKYWDRAPVLYRRVPRAPFLAEEALSAAITASAPGAADVIPDIVRLTCEGR- 69 Query: 50 DSRLVSHQDGKWQVSHGPFESYDH-----LGETNWSLLVQAVNHWHEPTAALMRPFRELP 104 RL++ + S F+SY L +L++ + + R F P Sbjct: 70 --RLLTARGRVPCASDTDFDSYAARVTTALDGERHALVIAGFHPHNPDMWDRQRAF-FHP 126 Query: 105 DW-RIDDLMISFSV-------PGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH 156 W R+ M S VG H D++ F+ +GR+R R+ Sbjct: 127 LWERVGLPMTSAITTLFHGNYEHSPVGVHKDRFGTFMYVLSGRKRMRMWPHRPWSHDAS- 185 Query: 157 PDLLQVDPFEAIIDEELEPGDILYIPPGFPHEGYALENAMNYSVGFRAPNTRELISGFAD 216 L + I E PG ++Y P H G + + SV P + Sbjct: 186 TILDYARYLDTSIAAEAGPGQLMYWPASHFHVGETVGSEPATSVNVGVPREGRRVEFEMT 245 Query: 217 YVLQRELGGNYYSDPD--VPPRAHPADVLP-----QEMDKLREMMLELINQPEHFKQ--W 267 +L +DPD + R P DV P L + + ++Q + W Sbjct: 246 DLLTDLPASAL-TDPDAYLETRMPPIDVDPFVDPADAATGLPLALRQGMDQAVSLLERSW 304 Query: 268 FGEFISQSRHELDIAPPEPPYQPDEIYDALKQGEVLVRLGGLRVLR---IGDDVYANGEK 324 GE A P L L R G VL +G Sbjct: 305 EGERRLAVTLNRYTAGGFRPVPDPVPRPELTDATRLRRAPGATVLWARADDATALCSGNG 364 Query: 325 IDSPHRP-------ALDAL-ASNIALTAENFGDALEDPSF---LAMLAAL 363 + RP LD L A+ T DA+ +P +LA L Sbjct: 365 HTASARPSPKAITALLDRLDANATPATVAELLDAVGEPEREHCRELLAEL 414 >UniRef50_C6XMP3 Cupin 4 family protein n=1 Tax=Hirschia baltica ATCC 49814 RepID=C6XMP3_HIRBI Length = 435 Score = 88.8 bits (219), Expect = 3e-16, Method: Composition-based stats. Identities = 67/400 (16%), Positives = 136/400 (34%), Gaps = 52/400 (13%) Query: 10 PDFLERHWQKRPVVLKRGFNNFIDPISPDELA-GLAMESEV-DSRL---VSHQDGKWQVS 64 +F ++ K+ + N F S ++L+ LA E+ D R S G+ S Sbjct: 39 QEFANSYFAKKSFNVGGTSNKFEHIFSWEKLSHALARGEEIQDPRFNLMASFAGGEKDGS 98 Query: 65 HGP-----FESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWR-IDDLMISFSVP 118 P + L ++ + ++ A + R ++ + S Sbjct: 99 RKPMFQVYIKQVGELLNAGATICITNIHMADPALARWAQAIRSQLNFTGTVGVNCYISPD 158 Query: 119 GGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEA----------- 167 G G+ H D+ +Q G++RW + + + + + E Sbjct: 159 GAGLPMHYDKRIATTLQIAGKKRW-IYSTTPAQAWPDNNAVFKDGRVEPANIDTGTPPDG 217 Query: 168 --IIDEELEPGDILYIPPGFPHEGYALENAMNYSVGFRAPNTRELISG-FADYVLQRELG 224 + EL PGD+L +P G H + ++ ++ F N + ++ F +++ E Sbjct: 218 LEFEEVELNPGDLLCLPAGAWHAAKGIGFSLALNLYFAPRNFSDQLAPLFLEHLSHDE-- 275 Query: 225 GNYYSDPDVPPRAHPADVLP-------QEMDKLREMMLELINQPEHFKQWFGEFISQSRH 277 N+ P V + +D+ + I+Q + + E ++Q+ + Sbjct: 276 -NWRGGPPVTLDNITGETPENIKSYLHDRLDEFHKKAQSFIDQTDTINTAWLESLTQNPY 334 Query: 278 ELDIAPPEPPYQPDEIYDALKQGEVLVRLGGLRVLRIGDDVYA---NG-EKIDSPHRPAL 333 P+ P +P + + V LR ++ D + NG P L Sbjct: 335 TGWQPDPDMPLRPLSPENRFR-----VVGKSLRFIQTADQLAVPCDNGILNFPKNATPIL 389 Query: 334 DALASN-------IALTAENFGDALEDPSFLAMLAALVNS 366 +AS+ ++ +A P A L L + Sbjct: 390 KKMASHSGSFSVPDVISWNTAPNAPTIPEIGAHLQTLYKN 429 >UniRef50_D1H9M4 Whole genome shotgun sequence of line PN40024, scaffold_52.assembly12x (Fragment) n=3 Tax=Vitis vinifera RepID=D1H9M4_VITVI Length = 770 Score = 86.5 bits (213), Expect = 1e-15, Method: Composition-based stats. Identities = 54/324 (16%), Positives = 109/324 (33%), Gaps = 74/324 (22%) Query: 8 NWPDFLERHWQKRPVVLK---RGFNNFID------------------------------P 34 ++ +F+ HW+ P++++ +G N D P Sbjct: 301 SFENFILNHWEVSPLLVRSLSKGLNEQDDVFSSFIQYLNLKKTVSSFVLPLLQGLVSCLP 360 Query: 35 ISPDELAGL----AMESEVDSRLVSHQDGKWQVSHGPFESY-------------DHLGET 77 I DEL L + +E+ ++ QD + + G + + Sbjct: 361 IDSDELNILNFLKTVRNELGCLIIYGQDIRVLRTMGHLKEEAPHFLYIDDILKCEDAYNK 420 Query: 78 NWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVP-GGGVGPHLDQYDVFIIQG 136 +++ ++ + E AA+ L + + + P G+ H D + VF+ Q Sbjct: 421 GYTIALRGMEFRFESIAAIADGLASLFGQPSVGVNLYLTPPDSQGLARHYDDHCVFVCQL 480 Query: 137 TGRRRWRV-GEKLQMKQHC--PHPDLLQVDPFEAII---DEELEPGDILYIPPGFPHEGY 190 G ++W + + + P L ++ L GDILYIP GFPHE Sbjct: 481 FGTKQWTIVSQPIVSLPRLYEPLDSLHSSKIGNSMAGRTQFLLREGDILYIPRGFPHEAC 540 Query: 191 ALENA------------MNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAH 238 + + + ++ P E + A + + +Y+ D Sbjct: 541 TVAESGGPDETTGFSLHLTLAIEVEPPFEWEGFAHVALHCWNQSSKSIHYTSVDPL---- 596 Query: 239 PADVLPQEMDKLREMMLELINQPE 262 +++L L + + LI + Sbjct: 597 -SEILSVMSVNLLHIAIRLIGDSD 619 >UniRef50_B8BVR1 Predicted protein n=1 Tax=Thalassiosira pseudonana RepID=B8BVR1_THAPS Length = 488 Score = 85.7 bits (211), Expect = 2e-15, Method: Composition-based stats. Identities = 47/298 (15%), Positives = 91/298 (30%), Gaps = 53/298 (17%) Query: 9 WPDFLERHWQKRPVVLKR-----------------GFNNFIDPI------SPDELAGLAM 45 F + WQ +P++ + GFN D + S + + Sbjct: 40 AKSFFKHIWQHQPMIFRSTHKTCQADGILRQTMTMGFNGVADMLHNCRKSSSPQSDDTST 99 Query: 46 ESEVDSRLVSHQDGKWQVSHGPFESYDHLGE----TNWSLLVQAVNHWHEPTAALMRPF- 100 S + + Q+G P+ Y S++V + A L Sbjct: 100 NSAATAPPLFFQNG--SPITDPYSMYSSNPHAAYLDGCSIVVNHADLQSASIAKLCNDLQ 157 Query: 101 RELPDWRIDDLMISFSVPGGGVG--PHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPD 158 P + ++ P G H D DVF+IQ G ++W V +K+ ++ + Sbjct: 158 SSFPHVYAN----AYLTPPNGFAVNAHADDRDVFVIQVLGTKKWNVYKKVPVEYPFENEQ 213 Query: 159 LLQVDPFEAIIDEE-----------LEPGDILYIPPGFPHEGYAL------ENAMNYSVG 201 + + E L PGD++Y+P GF HE ++ ++ + Sbjct: 214 VGKSGREVPPSVFEGGLCFGNNVLDLGPGDVMYMPRGFVHEATTEILDVEDGHSPSFHIT 273 Query: 202 FRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADVLPQEMDKLREMMLELIN 259 +S ++ L +P L+ + + + Sbjct: 274 IAIATHDWCLSVVLADCFRKTLSEVVDYRKALPIGPSKEYEPEDSSSFLKRQLNQAMK 331 >UniRef50_Q15JF4 VldL n=1 Tax=Streptomyces hygroscopicus subsp. limoneus RepID=Q15JF4_STRHY Length = 282 Score = 85.7 bits (211), Expect = 2e-15, Method: Composition-based stats. Identities = 42/205 (20%), Positives = 79/205 (38%), Gaps = 27/205 (13%) Query: 4 QLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQV 63 Q + ++ F +W+KRP+ + G + + D+ A+ DG Sbjct: 9 QPSFDFDLFFSAYWRKRPLYVPGGAKELLGRVWTDDDFDAALAGA-------RADGT--- 58 Query: 64 SHGPFESYDHLGETNWSLLVQAVN-HWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGV 122 E + GE + V + H+ L F P D + G+ Sbjct: 59 -----EVKERPGEVTFIEQVSRFDTDLHDRADRLAGVFGA-PQAWFDAIRTCARS---GI 109 Query: 123 GPHLDQYDVFIIQGTGRRRWRVGEKLQM------KQHCPHPDLLQVD-PFEAIIDEELEP 175 G H D D F++Q G + W + + ++ HP + + P + + + P Sbjct: 110 GAHFDHSDNFVLQQNGTKEWTLASSQHLDRQDVVRRMMNHPGVGAHELPVDDSVRFTVGP 169 Query: 176 GDILYIPPGFPHEGYALENAMNYSV 200 GD+LYIP + H G + ++++ S+ Sbjct: 170 GDLLYIPLLWLHSGVSRGDSLSVSL 194 >UniRef50_Q6MH74 Putative uncharacterized protein yxbC n=1 Tax=Bdellovibrio bacteriovorus RepID=Q6MH74_BDEBA Length = 308 Score = 85.0 bits (209), Expect = 5e-15, Method: Composition-based stats. Identities = 49/280 (17%), Positives = 97/280 (34%), Gaps = 34/280 (12%) Query: 8 NWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGL-----AMESEVDSRLVSHQDGKWQ 62 P+F HW P+ + D + +++ L A + +V + L D Sbjct: 14 TLPEFFNSHWPVEPLFIPATPGKLQDIFALEQMQDLKNLISARQRKVRACLPDFDDEYSS 73 Query: 63 VSHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRE----LPDWRIDDLM----IS 114 + P ++ N +L+ ++ A ++ R + +DL I+ Sbjct: 74 IHLEPGDALKAY-RNNMTLVFDSMQSQDSTIADMLGNVRADLGLVTGGAENDLCKARSIA 132 Query: 115 FSVPGG-GVGPHLDQYDVFIIQGTGRRRWRVGEK----------LQMKQHCPHPDLLQ-- 161 ++ P G G H D FIIQ G + WR+ + P Q Sbjct: 133 YATPAGCGTRLHFDANANFIIQIKGTKTWRLAPNESVEFPTERFTTGSEEMPAALEKQCH 192 Query: 162 ----VDPFEAIIDEELEPGDILYIPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADY 217 E + ++PG +L++P G+ HE E +++ + F P ++ + Sbjct: 193 AHLIDALDEDSMKVVMKPGCVLFVPRGYWHETTTEEESLSLNFTFSQPTWADVFTKSLQE 252 Query: 218 VLQRELGGNYYSDPDVPPRAHPADVLPQEMDKLREMMLEL 257 VL R +D + + + + ++ L Sbjct: 253 VLLRSPEWRELAD-GLEGTDQ--ERKEAAIARFEFLLKSL 289 >UniRef50_C7Q411 Cupin 4 family protein n=1 Tax=Catenulispora acidiphila DSM 44928 RepID=C7Q411_CATAD Length = 395 Score = 82.3 bits (202), Expect = 3e-14, Method: Composition-based stats. Identities = 24/158 (15%), Positives = 50/158 (31%), Gaps = 5/158 (3%) Query: 74 LGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPHLDQYDVFI 133 L E +L++ +N + R + G H D +DV + Sbjct: 97 LNEGG-TLILDTINQFDPTLEVACRALGWWTGELVSVNAYLAVGDTAGFSTHWDDHDVLV 155 Query: 134 IQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIPPGFPHEG--YA 191 +Q G++ W V + + P E + + GD+++IP GF H Sbjct: 156 VQVAGQKSWEVRPASRPVPMYRDAEQNLEAPEELLWSGTMNTGDVMHIPRGFWHAATRVG 215 Query: 192 LENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYS 229 ++ + F + + ++ + Sbjct: 216 SGEGISLHLTFGITRRTGV--TWVQHLADAARDVELFR 251 >UniRef50_B9IN14 Predicted protein n=2 Tax=rosids RepID=B9IN14_POPTR Length = 764 Score = 81.5 bits (200), Expect = 4e-14, Method: Composition-based stats. Identities = 56/293 (19%), Positives = 95/293 (32%), Gaps = 79/293 (26%) Query: 7 LNWPDFLERHWQKRPVVLKR-------------------------------GFNNFID-- 33 L + +F+ HW+ P +++R +FI Sbjct: 280 LGFENFMLHHWESSPSLVRRLSGSLTEENDILSSFAESLNCKEPCPTFVASILQSFISCV 339 Query: 34 PISPDELAGLAMESEVDS----RLVSHQDGKWQVSHGP---------------------F 68 PI+ DEL ++ EV S ++ QD + + P F Sbjct: 340 PIASDELNIISFLEEVRSELGCPIIYDQDIRVLRTEQPSKKEVHFFQKKVDPCCFKKLAF 399 Query: 69 ESYDHLG-----ETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPG-GGV 122 + D + + +++ ++ V AA+ L I + P G+ Sbjct: 400 NNVDIMKCEEAFKEGYTIALRGVEFRFASIAAVADALASLFGQPSVGANIYLTPPNSQGL 459 Query: 123 GPHLDQYDVFIIQGTGRRRWRVGE------KLQMKQHCPHPDLLQVDPFEAIIDEELEPG 176 H D + VF+ Q G ++W + L + + L G Sbjct: 460 ARHCDDHCVFVCQLFGTKQWTIYPRPNLQLPRLYDPFDREHCLGEQNSLAECRKFLLREG 519 Query: 177 DILYIPPGFPHEGYALEN--------AMNYSVGFRAPNTRELISGFADYVLQR 221 DILYIP GFPHE ++ +++ + G E GFA L R Sbjct: 520 DILYIPRGFPHEACTHDDGSSDLARFSLHVTFGVEVEPPFE-WEGFAHVALHR 571 >UniRef50_Q8S3P4 OSJNBa0011F23.16 protein n=4 Tax=Oryza sativa RepID=Q8S3P4_ORYSJ Length = 774 Score = 80.7 bits (198), Expect = 9e-14, Method: Composition-based stats. Identities = 57/340 (16%), Positives = 109/340 (32%), Gaps = 95/340 (27%) Query: 8 NWPDFLERHWQKRPVVLKRGFNN------FIDP-------------------------IS 36 ++ +FL +W+K ++ R N F I+ Sbjct: 287 DYENFLLNYWEKSTYLVTRKQKNLHVDSVFTSLLNEFDLKTPDTIIQSLVNGIVSCPAIA 346 Query: 37 PDELA------------GLAMESEVDSRLVSHQDG-------------------KWQVSH 65 DEL G ++ D R+V D +Q + Sbjct: 347 SDELDISSFLREVQGSLGATVKYRQDIRVVRINDQCDQTSIGYAMEEHFFDDGMTFQDAD 406 Query: 66 GPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVP-GGGVGP 124 E + +S+ ++ + E AA+ +L I FS P G+ Sbjct: 407 AFVEKCKDAFKNGFSVALRGMEFRSEKIAAIASAVADLFGQPSVGANIYFSPPRAQGLAR 466 Query: 125 HLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIID---------EELEP 175 H D + V + Q G ++W + ++ +PFE + D E L Sbjct: 467 HYDDHCVLVWQLLGCKKWMIWPDTKLLLP------RLYEPFEPLDDLVDDCGGRMEILLE 520 Query: 176 GDILYIPPGFPHEG------------YALENAMNYSVGFRAPNTRELISGFADYVLQREL 223 GDI+Y+P GF HE ++ +++ ++ E GF ++ Sbjct: 521 GDIMYVPRGFVHEAHTDVDVGGFEVNSTVDCSLHLTLAIEVEPPFE-WEGFT-HIALHCW 578 Query: 224 GGNYYSDPDVPPRAHPADVLPQEMDKLREMMLELINQPEH 263 ++S P V + + L + + L+++ + Sbjct: 579 TEKHWSSPFVKSQE---EARTSLFALLLHVAIRLLSKNDA 615 >UniRef50_C7J1Y3 Os04g0659150 protein n=4 Tax=Poaceae RepID=C7J1Y3_ORYSJ Length = 430 Score = 78.8 bits (193), Expect = 3e-13, Method: Composition-based stats. Identities = 49/257 (19%), Positives = 83/257 (32%), Gaps = 78/257 (30%) Query: 8 NWPDFLERHWQKRPVVLKRGFNN------FIDP-------------------------IS 36 ++ +FL +W+K ++ R N F I+ Sbjct: 151 DYENFLLNYWEKSTYLVTRKQKNLHVDSVFTSLLNEFDLKTPDTIIQSLVNGIVSCPAIA 210 Query: 37 PDELA------------GLAMESEVDSRLVSHQDG-------------------KWQVSH 65 DEL G ++ D R+V D +Q + Sbjct: 211 SDELDISSFLREVQGSLGATVKYRQDIRVVRINDQCDQTSIGYAMEEHFFDDGMTFQDAD 270 Query: 66 GPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVP-GGGVGP 124 E + +S+ ++ + E AA+ +L I FS P G+ Sbjct: 271 AFVEKCKDAFKNGFSVALRGMEFRSEKIAAIASAVADLFGQPSVGANIYFSPPRAQGLAR 330 Query: 125 HLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIID---------EELEP 175 H D + V + Q G ++W + ++ +PFE + D E L Sbjct: 331 HYDDHCVLVWQLLGCKKWMIWPDTKLLLP------RLYEPFEPLDDLVDDCGGRMEILLE 384 Query: 176 GDILYIPPGFPHEGYAL 192 GDI+Y+P GF HE + Sbjct: 385 GDIMYVPRGFVHEAHTD 401 >UniRef50_A9TBQ2 Predicted protein n=1 Tax=Physcomitrella patens subsp. patens RepID=A9TBQ2_PHYPA Length = 992 Score = 77.2 bits (189), Expect = 8e-13, Method: Composition-based stats. Identities = 26/118 (22%), Positives = 45/118 (38%), Gaps = 2/118 (1%) Query: 77 TNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPG-GGVGPHLDQYDVFIIQ 135 + ++++++ + AL + + PG G+ H D + VF+ Q Sbjct: 500 SGYTVVLRGLQFRFPEICALSNGLAAELGQVTVGANLYLTPPGSQGLRVHFDDHCVFVCQ 559 Query: 136 GTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIID-EELEPGDILYIPPGFPHEGYAL 192 GR+ W V L+ L + + +L+ D LYIP GF HE Sbjct: 560 LRGRKGWDVYPPLEQLPRLYSFKTLSTEVTKDYATHFDLQEWDTLYIPRGFLHEARTE 617 >UniRef50_UPI0001BCFC5A Cupin 4 family protein n=2 Tax=Mannheimia haemolytica serotype A2 RepID=UPI0001BCFC5A Length = 257 Score = 77.2 bits (189), Expect = 9e-13, Method: Composition-based stats. Identities = 27/135 (20%), Positives = 56/135 (41%), Gaps = 17/135 (12%) Query: 145 GEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIPPGFPHEGYAL-ENAMNYSVGFR 203 M ++ P+ D + +D LE GD+LY+P G+ H+ + E ++ +VG Sbjct: 43 HRSKDMPEYAPNLD-------DVYMDIVLEAGDVLYLPRGWWHDPIPVGEETVHLAVGIF 95 Query: 204 APNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADVLPQEMDKLREMMLELINQPEH 263 T ++ + ++++E+ + + + +L E E I E+ Sbjct: 96 PAYTHNYLTWVSQNMVEKEIA---------RASLSHYESDKELIAQLAEQTAEYIKDKEN 146 Query: 264 FKQWFGEFISQSRHE 278 + ++ F Q R E Sbjct: 147 YHKFIENFYDQKRVE 161 >UniRef50_C4DE44 Putative uncharacterized protein n=1 Tax=Stackebrandtia nassauensis DSM 44728 RepID=C4DE44_9ACTO Length = 420 Score = 76.5 bits (187), Expect = 2e-12, Method: Composition-based stats. Identities = 44/231 (19%), Positives = 74/231 (32%), Gaps = 35/231 (15%) Query: 4 QLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLA-------MESEVDSRLVSH 56 + +W F +W +RPV+ + P E+ V S + Sbjct: 10 ETHFDWEHFAGNYWNQRPVLYR-AVAE--PPFEVGEVFAATANACAPVSNGAVPSGVQLT 66 Query: 57 QDGKWQVSHGP---------FESY-----DHLGETNWSLLVQAVN-----HWHEP--TAA 95 + + Q G F+ Y L ++L + A + WH Sbjct: 67 VEREQQRVPGDLLPAASDRDFDGYQDRVTAALDGRRYALTINAFHSHEWRLWHRERRFYR 126 Query: 96 LMRPFRELPDWRIDDLMISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCP 155 + +P M + VG H D++ F+ G +R R + + Sbjct: 127 RLWDHVGIPSTSAITTMFHGTYEHSPVGVHKDRFATFMFGLKGDKRMRFWSRKPWTEDVS 186 Query: 156 HP-DLLQVDPFEAIIDEELEPGDILYIPPGFPHEGYALENAMNYSVGFRAP 205 D + P + E+EPGD+LY P + H G + SV P Sbjct: 187 SVVDYAEYLP--SSFVVEVEPGDLLYWPSTYFHVGESGGAPAT-SVNVGVP 234 >UniRef50_Q2RW70 Cupin region n=1 Tax=Rhodospirillum rubrum ATCC 11170 RepID=Q2RW70_RHORT Length = 301 Score = 75.7 bits (185), Expect = 3e-12, Method: Composition-based stats. Identities = 47/274 (17%), Positives = 87/274 (31%), Gaps = 48/274 (17%) Query: 24 LKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQVSHGPFESYDHLGE------- 76 + F D +S L L V F D G Sbjct: 29 IPAAFPESADLMSWGWLEDFLNREYTRPELFRFFMNGRPVEPSRFGLIDGKGRLDRKALR 88 Query: 77 ----TNWSLLVQAVN----HWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVG----- 123 + + ++ ++ E L + + ID + G G Sbjct: 89 PLLTQGITTIFNGLDSSSGYFWEEAVKLEQALGAVVT--IDAI--------GSFGTVCGL 138 Query: 124 -PHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIP 182 PH D D+ I+Q GR+ W++ L P P + ++ GD+L++P Sbjct: 139 PPHYDDRDLIIVQVAGRKHWKI---LGTPVEGPWRKRTMSVPDTVTDEFVMQGGDMLFVP 195 Query: 183 PGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPAD- 241 G H+ LE +++ P +L+ +DP++ R + D Sbjct: 196 AGLYHQCVPLEPSLHLGALITRPCGADLLK--------MVQPRWETTDPELAARLYVGDG 247 Query: 242 --VLPQEMDKLREMMLELINQPEHFK---QWFGE 270 L Q+ +L+E ++ L+ + W + Sbjct: 248 ETDLQQQDARLKEALIRLVQDMDVAALTRAWLAQ 281 >UniRef50_Q6MPD0 Putative RNA methylase n=1 Tax=Bdellovibrio bacteriovorus RepID=Q6MPD0_BDEBA Length = 419 Score = 75.3 bits (184), Expect = 3e-12, Method: Composition-based stats. Identities = 38/251 (15%), Positives = 78/251 (31%), Gaps = 34/251 (13%) Query: 9 WPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSR-------LVSHQDGKW 61 W +F + HW+K+P+V + + ++ E+ L + R L DG Sbjct: 20 WQNFAKNHWEKKPLVARNVKSGLLEMTDA-EIFELLVAYSDRCREMNDPEGLKFFIDGAK 78 Query: 62 QVSHGPFESYDHLGET--------------NWSLLVQAV-------NHWHEPTAALMRPF 100 E + ++ L+ + H + + Sbjct: 79 ADPEEVLELLPEKSDKSLLGYHKRMNAQFPDYCLVCDELLQVNLKKQHLLQDFTDDLFRH 138 Query: 101 RELPDWRIDDLMISFSV-PGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCP-HPD 158 P+ R ++ + G H+D VF G +R+R+ +H Sbjct: 139 VGFPN-RFSEIGLYLGNYRKTPFGVHVDSCGVFSFPVAGVKRFRLWPAAYGDEHPELDRT 197 Query: 159 LLQVDPFEAIIDEELEPGDILYIPPGFPHEGYALEN-AMNYSVGFRA-PNTRELISGFAD 216 + E+ PGD+ Y P H + + + +S+G ++ Sbjct: 198 FNYEKHKKHSQLVEVGPGDMTYWPSSEWHIAESDGSFSATWSLGVWVDQTHGDMFGSALK 257 Query: 217 YVLQRELGGNY 227 ++ +LG Sbjct: 258 DLVDTKLGSAR 268 >UniRef50_A9V3F7 Predicted protein n=1 Tax=Monosiga brevicollis RepID=A9V3F7_MONBE Length = 4167 Score = 74.6 bits (182), Expect = 5e-12, Method: Composition-based stats. Identities = 45/264 (17%), Positives = 85/264 (32%), Gaps = 61/264 (23%) Query: 7 LNWPDFLERHWQKR-----------PVVLKRGFNN------FIDPI---SPDELAGLAME 46 L+ +F ++ + P+++K G + + PI S D L L E Sbjct: 51 LSVEEFTRDYYHNQHVVSAATNSSVPLLVKPGLTHAALAEALLQPILNRSEDALLRLFNE 110 Query: 47 -----SEVDSRLVSHQDGKWQVSHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFR 101 +S +VS +D ++ HL +SL V E L Sbjct: 111 GVFVLRNANSTIVSSED----LTTASLRQ--HLLYDGYSL-VYFYERHREE--DLYERKE 161 Query: 102 ELPD-----WRIDDLMISF--SVPGGGVGPHLDQYDVFIIQGTGRRRWRV---------- 144 + D ++ + + PH D+ D+F Q G + W + Sbjct: 162 TIADVIASLALLEPTFHYYLSGPNAKALPPHTDRNDIFSFQLAGTKHWTICTNPPPDTSW 221 Query: 145 ----GEKLQMKQHCPHPDLLQVDPFEAI----IDEELEPGDILYIPPGFPHEGYALEN-A 195 + + + P E + PGD +Y+P G H + + Sbjct: 222 TPAEWAQQEENEASRTEGCRNPSPIEVTKMQCEHHVVHPGDWIYLPKGTVHFAQTNDEMS 281 Query: 196 MNYSVGFRAP-NTRELISGFADYV 218 ++ +VGFR + ++ + Sbjct: 282 LHATVGFRPSFSWLNRLTALCEQA 305 >UniRef50_B5Y4L3 Predicted protein n=1 Tax=Phaeodactylum tricornutum CCAP 1055/1 RepID=B5Y4L3_PHATR Length = 549 Score = 73.4 bits (179), Expect = 1e-11, Method: Composition-based stats. Identities = 29/171 (16%), Positives = 58/171 (33%), Gaps = 32/171 (18%) Query: 124 PHLDQYDVFIIQGTGRRRWRV------GEKLQMKQHCPHPDLLQVDPFEA---------- 167 H D + F +Q +G +RWR+ H P ++ A Sbjct: 191 WHTDFQENFTVQLSGVKRWRLQKGSVTHPLRGCTPHYRSPSSVEPQLKAARMIDKNFQFA 250 Query: 168 -----------IIDEELEPGDILYIPPGFPHEGYALENAMNYSVGFRAPNTRELISGFAD 216 + + L PGD+LY P G H+ E ++ ++ A N L Sbjct: 251 HPKKDVNAVGEVEEVVLRPGDVLYFPAGMWHQVVTEEPGVSLNISLMATNYAALTCQALQ 310 Query: 217 YVLQRELGGNYYSDPDVPPRAHPADVLPQEMDKLREMMLELINQPEHFKQW 267 +VL ++ ++ ++ L+ ++ L P+ +++ Sbjct: 311 HVLLQK--SSWRECVTRSVSTSNGSQSISVIEHLKSLLQAL---PDIVREF 356 >UniRef50_UPI000180C0C1 PREDICTED: similar to reserved n=1 Tax=Ciona intestinalis RepID=UPI000180C0C1 Length = 468 Score = 73.4 bits (179), Expect = 1e-11, Method: Composition-based stats. Identities = 28/115 (24%), Positives = 46/115 (40%), Gaps = 22/115 (19%) Query: 125 HLDQYDV-FIIQGTGRRRWRVGEKLQMKQHCP-----------------HPDLLQVDPFE 166 H D Y ++Q GR+RW + + P HPDL + + F Sbjct: 151 HYDTYGYNLVLQVQGRKRWMLFPPSDSQHLHPTRIPYEESSVFSKVDLQHPDLEEHESFT 210 Query: 167 AIIDEE--LEPGDILYIPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVL 219 + LEPGD+LY+P + H LE ++ SV P+ + ++ + Sbjct: 211 SCHPHVITLEPGDMLYVPQQWWHYVENLETSI--SVNAWFPSDEDDLTRVKEAAS 263 >UniRef50_A9V2P6 Predicted protein n=3 Tax=Monosiga brevicollis RepID=A9V2P6_MONBE Length = 1934 Score = 73.0 bits (178), Expect = 1e-11, Method: Composition-based stats. Identities = 29/171 (16%), Positives = 51/171 (29%), Gaps = 29/171 (16%) Query: 124 PHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQV-DPFEAI-------------- 168 PH D YDV ++ G++ W V L Q+ FE Sbjct: 171 PHTDPYDVLVLHLHGQKHWTVCHPLDSSPEWADASQAQLAQLFEIQRKSVDGCTNFDIAE 230 Query: 169 ------IDEELEPGDILYIPPGFPHEGYALENAMNYSVGF----RAPNTRELISGFADYV 218 L PGD+LY+P H N + + + +L + Sbjct: 231 TDKMRCEHFTLSPGDVLYLPKSTIHFATTSPNTTTAHITLSLERQGQSWIDLACTVIEST 290 Query: 219 ---LQRELGGNYYSDPDVPPRAHPADVLPQEMDKL-REMMLELINQPEHFK 265 L+ G + + P P + + + ++ + L+ Q Sbjct: 291 LASLETSPSGLRWLELATFPTDEPCQMAQKRLLGFEKDSLRTLVAQDPQLS 341 >UniRef50_A9VDC2 Predicted protein n=1 Tax=Monosiga brevicollis RepID=A9VDC2_MONBE Length = 2266 Score = 73.0 bits (178), Expect = 2e-11, Method: Composition-based stats. Identities = 22/107 (20%), Positives = 36/107 (33%), Gaps = 21/107 (19%) Query: 117 VPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQ---------MKQHCPHPDLLQVDPFEA 167 + PH D YDVF++Q G++ W + K + Sbjct: 197 PNAQALAPHTDPYDVFVVQVYGQKEWTLCTPQPPGGQNLSDAHKAQWQEIAKHNIQGCTN 256 Query: 168 IIDE----------ELEPGDILYIPPGFPHEGY--ALENAMNYSVGF 202 + L PGD+LYIP G H ++ + + +V Sbjct: 257 YQEWQLAKMDCQHITLLPGDLLYIPKGVIHYATTGSVTGSTHLTVSI 303 >UniRef50_A9V7G6 Predicted protein n=1 Tax=Monosiga brevicollis RepID=A9V7G6_MONBE Length = 806 Score = 72.2 bits (176), Expect = 3e-11, Method: Composition-based stats. Identities = 24/80 (30%), Positives = 37/80 (46%), Gaps = 11/80 (13%) Query: 124 PHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIPP 183 PH D YDV ++Q GR+RWR+ + P DL + +EPGD+LY+P Sbjct: 192 PHTDPYDVIVVQLAGRKRWRL---CTGCLNWPESDLTRF----HCQSLWMEPGDVLYLPK 244 Query: 184 GFPHEGYA----LENAMNYS 199 H A +++ + Sbjct: 245 AVIHVADAPHADETTSIHLT 264 >UniRef50_A9V7T0 Predicted protein n=1 Tax=Monosiga brevicollis RepID=A9V7T0_MONBE Length = 1016 Score = 71.9 bits (175), Expect = 4e-11, Method: Composition-based stats. Identities = 50/280 (17%), Positives = 94/280 (33%), Gaps = 45/280 (16%) Query: 18 QKRPVVLKR-GFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQVSHGPFES---YDH 73 +++P +++ F + + L+ L + EV H + Y Sbjct: 752 ERQPFLVRDCAFGVATASWTAEHLSSLVGDREVSV----HVGEDCNMDFTTRNFRPMYLR 807 Query: 74 LGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGP----HLDQY 129 N+ V + AA +P W + + + S ++ G H D Sbjct: 808 SMGKNFRKDVSNIEQTFPEVAAEFALPSCVPSWIMGEKLFSTALRVSSPGVQLWTHYDVM 867 Query: 130 DVFIIQGTGRRRWRVGEKLQMKQ-----------HCPHPDLLQVDPFEAII----DEELE 174 D + GR+R + Q PDL F + + LE Sbjct: 868 DNVLCNVRGRKRVVLFPPEQAGNLYLEGSSSRVVDIERPDLEAFPRFATAMAHALELILE 927 Query: 175 PGDILYIPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVP 234 PGD+L+IP + H ALE ++ +V F + + +A + Y + DV Sbjct: 928 PGDMLHIPALWCHNVRALEPCISVNV-FW--KHLDDAALYA--------SKDLYGNKDVK 976 Query: 235 PRAHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQ 274 P A + ++L+ + P ++G + + Sbjct: 977 PAAEALRLAHDAAERLQTL-------PADHAAFYGAYATA 1009 >UniRef50_C5AHL8 JmjC domain protein n=2 Tax=Burkholderia RepID=C5AHL8_BURGB Length = 296 Score = 71.5 bits (174), Expect = 4e-11, Method: Composition-based stats. Identities = 35/179 (19%), Positives = 58/179 (32%), Gaps = 29/179 (16%) Query: 91 EPTAALMRPFRELPDWRIDDLMISF-SVPGGGVGP-------HLDQYDVFIIQGTGRRRW 142 E L + P W D L PG +GP H D+++ +Q GR+RW Sbjct: 120 EDITLLHERYGF-PRWLPDGLRRRLILRPGFWLGPEGISSPLHFDRHENLNVQVYGRKRW 178 Query: 143 RVGEKLQMKQHCPHPDLLQVDPFEAI------------------IDEELEPGDILYIPPG 184 + Q Q F + D LE G++LY+PPG Sbjct: 179 VLFGPGQSHQVYYRQRRDLPVIFSPVDMTRPDLDAFPRLGDAQRHDFVLEAGEVLYLPPG 238 Query: 185 FPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADVL 243 + H +L +++N + + +P + + D P + Sbjct: 239 WWHFVTSLSDSINVNYWWWSPRALRTWARV--ELASLAQALARRFDRGTDATGKPTSMP 295 >UniRef50_A9VDD4 Predicted protein n=1 Tax=Monosiga brevicollis RepID=A9VDD4_MONBE Length = 2295 Score = 71.5 bits (174), Expect = 5e-11, Method: Composition-based stats. Identities = 52/277 (18%), Positives = 84/277 (30%), Gaps = 56/277 (20%) Query: 9 WPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQVSHGPF 68 W DF + V + + + +P + L + LV G V Sbjct: 100 WADFAPSYQTSHRVYPAKADSRALGSATPLPVFELLQRPGTLTALVQDDAGA-LVYRSDT 158 Query: 69 ESYDHLGETN-WSLLVQAVNHWHEPTAALMRPFRE-------LPDWRIDDLMISFS---- 116 + + N W L Q + ++ ++ P R L W DD + + Sbjct: 159 QVAPIPPDLNTWDKLFQHIQTHQPTSSLVIFPERFSHTFALDLDPWLYDDYYQALTDAFN 218 Query: 117 -----------VPGGGVGPHLDQYDVFIIQGTGRRRW-----RVGEKLQMKQHCPHPDLL 160 G + PH D DVF+ Q +G + W R+ + + P P Sbjct: 219 VPVTQHVYITGPAGRALNPHTDGGDVFVRQISGSKHWQLCVPRLQDPSVCQPSAPSPTAH 278 Query: 161 QV-------------DPFE----------AIIDE---ELEPGDILYIPPGFPHEGYALEN 194 D FE A +D L PGD LY+P G H + Sbjct: 279 PCTDGARARYAEWKRDQFEGCTPYTMGQLADMDCSNITLHPGDTLYLPRGIVHHAWTDSG 338 Query: 195 AMNYSVGFRAPNTRE-LISGFADYVLQRELGGNYYSD 230 + V ++ + L + R G + D Sbjct: 339 ITSTHVTYQLQSKDATLFDRLRNQCYARASTGTRWCD 375 >UniRef50_A9V0X5 Predicted protein n=2 Tax=Monosiga brevicollis RepID=A9V0X5_MONBE Length = 2283 Score = 71.1 bits (173), Expect = 6e-11, Method: Composition-based stats. Identities = 31/150 (20%), Positives = 46/150 (30%), Gaps = 26/150 (17%) Query: 115 FSVPGGG--VGPHLDQYDVFIIQGTGRRRWRVGEKLQM------------KQHCPHPDLL 160 + P G + PH D YDV +IQ G ++W + Q + Sbjct: 163 YVTPTGAQALKPHTDPYDVLVIQTYGEKQWTICTPQPAGAQNRTDAEKAQLQEIVRHSIQ 222 Query: 161 QVDPFEAI-------IDEELEPGDILYIPPGFPHEGYALE----NAMNYSVGFRAPNTRE 209 +EA L+ GD+LY+P G H E + S+ + Sbjct: 223 GCTQYEAWQLAKMECQAITLKAGDVLYLPKGIIHYATTTESMGSTHITLSLERLTHSWLA 282 Query: 210 LISGFADYVLQRELGGNYYSDPDVPPRAHP 239 L L R Y D + P Sbjct: 283 LFGRACGLGLDRATCQQ-YEDVLLTASLTP 311 >UniRef50_A9V428 Predicted protein n=2 Tax=Monosiga brevicollis RepID=A9V428_MONBE Length = 2336 Score = 70.3 bits (171), Expect = 1e-10, Method: Composition-based stats. Identities = 29/175 (16%), Positives = 59/175 (33%), Gaps = 28/175 (16%) Query: 53 LVSHQDGKWQVSHGPFESYDHLGETNWSLLVQA--VNHWHEPTAALMRPFRELPDWRIDD 110 + + Q G H P + ++ + +V + +P+A L ID Sbjct: 94 IRTTQHGGIDTLHSPLTLAEFKNMSDVTSVVYKREFDRKAQPSA-LESLIESELG--IDA 150 Query: 111 LMISFSVPG--GGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFE-- 166 + ++ P + PH D YDV ++Q ++ W + + + E Sbjct: 151 TVHAYFTPANAQTLEPHTDPYDVVVVQVANQKHWTLCLPQTDNATVSLSEADRAQLQEIK 210 Query: 167 ----------------AII--DEELEPGDILYIPPGFPHEGYALE-NAMNYSVGF 202 +I + L GD +Y+P G H + + + ++G Sbjct: 211 RSHLDGCTTYTMSMLQPMICRNVTLHQGDSMYLPKGVIHYAVTTDTPSAHLTIGL 265 >UniRef50_A9VEP7 Predicted protein (Fragment) n=1 Tax=Monosiga brevicollis RepID=A9VEP7_MONBE Length = 254 Score = 70.3 bits (171), Expect = 1e-10, Method: Composition-based stats. Identities = 25/97 (25%), Positives = 45/97 (46%), Gaps = 18/97 (18%) Query: 124 PHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIPP 183 PH D Y + I+Q G + W V QVD E+ + L PGD+L++P Sbjct: 69 PHTDNYHILILQLQGEKHWLV---------------CQVDNAESCEEFTLYPGDVLFLPR 113 Query: 184 GFPHEGYALE-NAMNYSVGFRAPNTRELI--SGFADY 217 H + +++ ++GF+ + +L+ +GF + Sbjct: 114 RAGHVAWTTNVTSVHATIGFQGVDCGDLVEAAGFTEQ 150 >UniRef50_A9V7C3 Predicted protein n=3 Tax=Monosiga brevicollis RepID=A9V7C3_MONBE Length = 3197 Score = 69.9 bits (170), Expect = 1e-10, Method: Composition-based stats. Identities = 25/125 (20%), Positives = 39/125 (31%), Gaps = 25/125 (20%) Query: 124 PHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQV-DPFE--------------AI 168 PH D YDV ++ G++ W V L Q+ FE A Sbjct: 171 PHTDPYDVLVLHLHGQKHWTVCHPLDSSTEWADASQAQLAQLFEMQRKSVDGCTNFDIAE 230 Query: 169 ID------EELEPGDILYIPPGFPHEGYALENAMNYSVGF----RAPNTRELISGFADYV 218 D L PGD+LY+P H N + + + +L+ + Sbjct: 231 TDKMRCEHFTLSPGDVLYLPKSTIHFATTSPNTTTAHITLSLERQGQSWIDLVRRACAVL 290 Query: 219 LQREL 223 + Sbjct: 291 DNQAC 295 >UniRef50_UPI0000D57503 PREDICTED: similar to JmjC domain-containing protein 5 (Jumonji domain-containing protein 5) n=1 Tax=Tribolium castaneum RepID=UPI0000D57503 Length = 394 Score = 69.5 bits (169), Expect = 2e-10, Method: Composition-based stats. Identities = 35/139 (25%), Positives = 53/139 (38%), Gaps = 35/139 (25%) Query: 98 RPFRELPDWRIDDLMISFSVPGGG---------VGP-------HLDQYDVFIIQGTGRRR 141 F ++P+ R D + + G GP H D + F++Q G ++ Sbjct: 256 NLFDQIPELRNDIYIPEYCCLGQDDNEPEINAWFGPAKTISPLHHDPKNNFLVQVFGTKQ 315 Query: 142 WRVGEKLQMKQHCPH-----PDLLQVDPFEAIID------------EELEPGDILYIPPG 184 + PH + QVDPF +D LE G++LYIPP Sbjct: 316 LILYSPDDTFCLYPHESTLLSNTAQVDPFNPDLDKYPNFRNAKAVKCILEAGEMLYIPPK 375 Query: 185 FPHEGYALENAMNYSVGFR 203 + H ALE + +SV F Sbjct: 376 WWHHVTALEKS--FSVSFW 392 >UniRef50_C5KSU0 Putative uncharacterized protein n=2 Tax=Perkinsus marinus ATCC 50983 RepID=C5KSU0_9ALVE Length = 256 Score = 69.2 bits (168), Expect = 2e-10, Method: Composition-based stats. Identities = 34/193 (17%), Positives = 68/193 (35%), Gaps = 29/193 (15%) Query: 130 DVFIIQGTGRRRWRVGEKLQMKQHCPH----PDLLQVDPFEAIIDEELEPGDILYIPPGF 185 D I+Q GR+ W V + P D+ + E + LE GD+L IP G Sbjct: 30 DALILQCKGRKHWDVYDS-SSSHPYPGEEVGKDMPIDNLGEPLYSVTLEEGDLLLIPRGV 88 Query: 186 PHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADVLPQ 245 H A + + + + P+ S + ++ + + P+ Sbjct: 89 IHRARASQEQGSLHITVKIPS-----SDLSYGMM---------------IQRGVGSLSPE 128 Query: 246 EM-DKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQGEVLV 304 + +LRE + ++ ++ E + ++ R I ++ L++ Sbjct: 129 YLESRLRERITKINSEQEAACKA---CMALPRPPGVICDNSLVRIAYDVKCELEEPTKER 185 Query: 305 RLGGLRVLRIGDD 317 R R R+ D+ Sbjct: 186 RCPTARFTRVTDE 198 >UniRef50_A9SQV0 Predicted protein n=1 Tax=Physcomitrella patens subsp. patens RepID=A9SQV0_PHYPA Length = 420 Score = 68.4 bits (166), Expect = 5e-10, Method: Composition-based stats. Identities = 49/234 (20%), Positives = 89/234 (38%), Gaps = 48/234 (20%) Query: 10 PDFLERHWQKR-PVVLKRGFNNFIDPISPDELAGLAMESE-----VDSRLV--SHQDGKW 61 DFL ++ P+VL +++ + +++ L + V++R V + W Sbjct: 192 EDFLRDYFLPGIPLVLTDSIDHWPAMRNWNDITYLQKVAGHRTVPVEARQVGEHYLAADW 251 Query: 62 QVSHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGG 121 + + E + + Q+ N + L F ++P+ + D + + GGG Sbjct: 252 KQELMTISEFL---ERSLTHSAQSTNRLYLAQHPL---FEQVPELQADISIPDYCSIGGG 305 Query: 122 --------VGP-------HLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHP--------- 157 +GP H D + + Q GR+ R+ + P+P Sbjct: 306 DLQSINAWLGPAGTITPLHHDPHHNLLAQVVGRKYVRLYSPESSQNIYPYPEPMLCNSSQ 365 Query: 158 ------DLLQVDPFE--AIIDEELEPGDILYIPPGFPHEGYALENAMNYSVGFR 203 DL++ FE D LE G +LYIPP + H +L + +SV F Sbjct: 366 VDVTNVDLVKFPNFEHLKFTDCILEEGQMLYIPPKWWHYVESLTPS--FSVSFW 417 >UniRef50_UPI0001927155 PREDICTED: similar to jumonji domain containing 5 n=1 Tax=Hydra magnipapillata RepID=UPI0001927155 Length = 406 Score = 68.0 bits (165), Expect = 6e-10, Method: Composition-based stats. Identities = 39/182 (21%), Positives = 70/182 (38%), Gaps = 49/182 (26%) Query: 67 PFESYDHLGETNWSLLVQAVNHWHEP----------TAALMRPFRELPDWRIDDLMI--- 113 P E D NW+ + +V + + A + F ++P+ R DD+ I Sbjct: 228 PIEVGDKYTSENWTQKLISVGEFIDKYICTNNKIGYLAQ-HQLFEQIPELR-DDICIPDY 285 Query: 114 --------------SFSVPGGGVGP-HLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH-- 156 ++ P G V P H D Y +Q G + R+ ++ + PH Sbjct: 286 CCISEQENNRVMTHAWFGPKGTVSPLHHDPYHNLFVQVLGEKYIRLYDRKDSENLYPHES 345 Query: 157 -----PDLLQVDPFEA----------IIDEELEPGDILYIPPGFPHEGYALENAMNYSVG 201 + ++ +A ++ L+ G++LYIPP + H +LE + +SV Sbjct: 346 QMLNNTSQVDLENVDAEKFPLFLQTNYVECVLKQGEMLYIPPKWWHYVRSLETS--FSVS 403 Query: 202 FR 203 F Sbjct: 404 FW 405 >UniRef50_A9V427 Predicted protein n=1 Tax=Monosiga brevicollis RepID=A9V427_MONBE Length = 2348 Score = 68.0 bits (165), Expect = 6e-10, Method: Composition-based stats. Identities = 21/118 (17%), Positives = 41/118 (34%), Gaps = 23/118 (19%) Query: 108 IDDLMISFSVPG--GGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPF 165 ID + ++ P + PH D YDV ++Q ++ W + + + Sbjct: 162 IDATVHAYFTPANAQTLEPHTDPYDVVVVQVANQKHWTLCLPQTDNATVSLSEADRAQLQ 221 Query: 166 E------------------AII--DEELEPGDILYIPPGFPHEGYALE-NAMNYSVGF 202 E +I + L GD +Y+P G H + + + ++G Sbjct: 222 EIKRSHLDGCTTYTMSMLQPMICRNVTLHQGDSMYLPKGVIHYAVTTDTPSAHLTIGL 279 >UniRef50_A9GPS9 Transcription factor jumonji (JmjC) domain-containing protein n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9GPS9_SORC5 Length = 336 Score = 67.6 bits (164), Expect = 6e-10, Method: Composition-based stats. Identities = 26/92 (28%), Positives = 43/92 (46%), Gaps = 8/92 (8%) Query: 118 PGGGVGP-HLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLL-----QVDPFEAII-- 169 P G V P H D ++ Q GR+R+R+ + + + DP ++ Sbjct: 228 PAGTVTPLHYDTSNILFGQVYGRKRYRMIAPFETSLFDGARAMYAGRDPEKDPMAPVLVK 287 Query: 170 DEELEPGDILYIPPGFPHEGYALENAMNYSVG 201 D LEPGD L+IP G+ H AL+ +++ + Sbjct: 288 DVVLEPGDALFIPVGWWHHVRALDASISLGIN 319 >UniRef50_A9V0E4 Predicted protein n=1 Tax=Monosiga brevicollis RepID=A9V0E4_MONBE Length = 2290 Score = 67.2 bits (163), Expect = 8e-10, Method: Composition-based stats. Identities = 31/151 (20%), Positives = 52/151 (34%), Gaps = 25/151 (16%) Query: 98 RPFRELPDWRIDD--LMISFSVPGGG--VGPHLDQYDVFIIQGTGRRRWRVGEK--LQMK 151 PF+ D ++ GG + PH D DVF+ Q G + W + Sbjct: 167 DPFQNALRAAFDTDVTQHLYATMPGGRALDPHTDGGDVFVHQLAGHKHWEICVPTTNTTC 226 Query: 152 QHCPHPDLLQVDPFEAI------------------IDEELEPGDILYIPPGFPHEGYALE 193 Q+C H + FE ++ L GD+LY+P H + Sbjct: 227 QNCTHGAQALLAEFERSSFQGCTSYSYEQLQNMSCLNLTLHAGDLLYLPRALVHHAWTDN 286 Query: 194 NAMNYSVGFRAPNTRE-LISGFADYVLQREL 223 + +Y + ++ + R L LQ+ Sbjct: 287 STASYHMTYQLKSERGTLRDRLTTQCLQQHH 317 >UniRef50_Q8RWR1 AT3g20810/MOE17_10 n=9 Tax=Viridiplantae RepID=Q8RWR1_ARATH Length = 429 Score = 66.1 bits (160), Expect = 2e-09, Method: Composition-based stats. Identities = 53/247 (21%), Positives = 84/247 (34%), Gaps = 47/247 (19%) Query: 1 MEYQLTLNWPDFLERHW-QKRPVVLKRGFNNFIDPISPDELAGL---AMESEVDSRLVS- 55 +E + L+ FL ++ PVV+ ++ + L L A V + Sbjct: 188 VEKRSGLSLEGFLRDYYLPGTPVVITNSMAHWPARTKWNHLDYLNAVAGNRTVPVEVGKN 247 Query: 56 HQDGKWQVSHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMI-S 114 + W+ F + TN S ++ P + R DD+ I Sbjct: 248 YLCSDWKQELVTFSKFLERMRTNKSSPMEPTYLAQHPLFDQINELR-------DDICIPD 300 Query: 115 FSVPGGG--------VGP-------HLDQYDVFIIQGTGRRRWRVGEKLQMKQHCP---- 155 + GGG GP H D + + Q G++ R+ + P Sbjct: 301 YCFVGGGELQSLNAWFGPAGTVTPLHHDPHHNILAQVVGKKYIRLYPSFLQDELYPYSET 360 Query: 156 ------HPDLLQVDPFE-------AIIDEELEPGDILYIPPGFPHEGYALENAMNYSVGF 202 DL +D E +D LE G++LYIPP + H +L M+ SV F Sbjct: 361 MLCNSSQVDLDNIDETEFPKAMELEFMDCILEEGEMLYIPPKWWHYVRSL--TMSLSVSF 418 Query: 203 RAPNTRE 209 N E Sbjct: 419 WWSNEAE 425 >UniRef50_D0LTG4 Transcription factor jumonji n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LTG4_HALO1 Length = 402 Score = 66.1 bits (160), Expect = 2e-09, Method: Composition-based stats. Identities = 27/103 (26%), Positives = 38/103 (36%), Gaps = 21/103 (20%) Query: 125 HLDQYDVFIIQGTGRRRWRV-------------------GEKLQMKQHCPHPDLLQVDPF 165 H D D F+ Q G ++ R+ + Q+ P + Sbjct: 293 HRDLIDNFLAQVWGFKQMRLISPAHTAKLYAIAENLNPYYQPSQLDADRPDLAQFPMCAD 352 Query: 166 EAIIDEELEPGDILYIPPGFPHEGYALENAMNYSVGFRAPNTR 208 D L PGDILY+P G+ H +LE + SV F A N Sbjct: 353 VPYTDCVLSPGDILYLPAGWWHRVRSLEPS--LSVNFFALNQA 393 >UniRef50_B8CD01 Predicted protein n=1 Tax=Thalassiosira pseudonana RepID=B8CD01_THAPS Length = 415 Score = 65.7 bits (159), Expect = 2e-09, Method: Composition-based stats. Identities = 27/97 (27%), Positives = 38/97 (39%), Gaps = 23/97 (23%) Query: 120 GGVGPHLDQYDVF---------------IIQGTGRRRWRVGEK----LQMKQHCPHPDLL 160 G H D D F +IQ +GR+RWRV ++ L K P Sbjct: 47 QGFEAHWDWMDAFDAPFFTDQWKSLVVIVIQLSGRKRWRVAKQPTIFLSNKDQKRRPTNE 106 Query: 161 QVDPFEA----IIDEELEPGDILYIPPGFPHEGYALE 193 + F ++ + PGD LYIP GF H ++ Sbjct: 107 EAQYFATDEGHYVEFTMCPGDSLYIPRGFMHNASTVD 143 >UniRef50_B8BQ14 Predicted protein n=1 Tax=Thalassiosira pseudonana CCMP1335 RepID=B8BQ14_THAPS Length = 798 Score = 64.2 bits (155), Expect = 7e-09, Method: Composition-based stats. Identities = 27/91 (29%), Positives = 38/91 (41%), Gaps = 19/91 (20%) Query: 124 PHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPD---------------LLQVDPF--- 165 PH D+ DV +IQ G +RWR+ + P D LL+ Sbjct: 386 PHTDRQDVVVIQMEGAKRWRIFTPPTDGEVKPTADPFARGKGEDSLPLHTLLEGKEGRLG 445 Query: 166 -EAIIDEELEPGDILYIPPGFPHEGYALENA 195 E ++D GD+L+IP GFPH +E Sbjct: 446 TELLMDVVSREGDVLFIPAGFPHTTDTVEET 476 >UniRef50_A9V590 Predicted protein n=1 Tax=Monosiga brevicollis RepID=A9V590_MONBE Length = 548 Score = 64.2 bits (155), Expect = 7e-09, Method: Composition-based stats. Identities = 23/98 (23%), Positives = 38/98 (38%), Gaps = 22/98 (22%) Query: 125 HLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLL----------------------QV 162 H D + F IQ G +R+ + + +Q P L Sbjct: 302 HFDLFHNFFIQVHGHKRFLLYPPARWQQLYMWPILHPAGRSMQVDLNGDYEDQQRRFPNF 361 Query: 163 DPFEAIIDEELEPGDILYIPPGFPHEGYALENAMNYSV 200 ++ + PGD+LYIPP + H +LE +++ SV Sbjct: 362 RRRAEALEVVVGPGDVLYIPPLWIHHVTSLEASISVSV 399 >UniRef50_Q0QZH9 Gp49 n=1 Tax=Synechococcus phage syn9 RepID=Q0QZH9_BPSYS Length = 236 Score = 64.2 bits (155), Expect = 8e-09, Method: Composition-based stats. Identities = 42/216 (19%), Positives = 77/216 (35%), Gaps = 32/216 (14%) Query: 7 LNWPDFLERHWQKRPVVLKRGFNNFIDPI-SPDELAGLAMESEV-DSRLVSHQDGKWQVS 64 LN +FL ++ + + + D + ++ A + + L+ H + K + Sbjct: 9 LNEANFL---YEDKAYFFPQLVPDPGDFFLTWKDVEICANNPTLFEFELIDHDNNKVE-- 63 Query: 65 HGPFESYDHLGETNWSL-----LVQAVNHWHEPT-----------AALMRPFRELPDWRI 108 Y + + +V+ +NH H LM+ F + + Sbjct: 64 ---INRYTRAWIHDKQVQDHRQIVEHINHGHTFIIMNYAFYSRWTQELMKTFESIFA--V 118 Query: 109 DDLMISFSVPGG--GVGPHLDQYDVFIIQGTGRRRWRVGEKL--QMKQHCPHPDLLQVDP 164 D + + G H D FIIQ G W++ + M Q D ++ D Sbjct: 119 DCAIHVYGGKEGAKSFNIHDDYPSNFIIQVEGETEWKIYKNRISSMLQTGTAQDTIREDN 178 Query: 165 FEAIIDEELEPGDILYIPPGFPHEGYALENAMNYSV 200 E + L+PGD LYIP H + ++ S+ Sbjct: 179 LEVDLHVTLKPGDALYIPSRAYHCAFPTGKRLSMSI 214 >UniRef50_Q1D441 JmjC domain protein n=1 Tax=Myxococcus xanthus DK 1622 RepID=Q1D441_MYXXD Length = 295 Score = 64.2 bits (155), Expect = 8e-09, Method: Composition-based stats. Identities = 50/244 (20%), Positives = 84/244 (34%), Gaps = 54/244 (22%) Query: 10 PDFLERHW--QKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQVSHGP 67 P F H+ ++RPVVL +++ A + D R+V + S+ P Sbjct: 17 PAFFREHYLEKRRPVVLTGVVSHWPAVTRWS--ADSFKQRFGDHRVVVERSRASVPSNDP 74 Query: 68 FESYDHLGETNWSL---LVQAVNHWHEPTA-------------ALMRPFRE--------- 102 E + L + + ++ H P A L+ F Sbjct: 75 LEFLRNRYYEEARLGDTIARMMSGEHPPGAYYVTYANIFDAAPELLGDFESPPQTWGIPP 134 Query: 103 -LPDWRIDDLMIS---FSVPGGGV-GPHLDQYDVFIIQGTGRRRWRVGEKLQMKQ----- 152 P D L + + P G V H D+ + F Q +GR++W + + Sbjct: 135 HYPRALQDRLTLRPGFWLGPAGTVSAVHFDRQENFNAQISGRKKWTLYSPQDSRHLYYPA 194 Query: 153 -----------HCPHPDLLQVDPFEAII--DEELEPGDILYIPPGFPHEGYALENAMNYS 199 PD + F + LEPG++L+IP G+ H LE ++ S Sbjct: 195 LDMPTVIFSPVDIEAPDARRFPRFAEAQPYETILEPGELLFIPAGWWHHVRTLE--LSIS 252 Query: 200 VGFR 203 + F Sbjct: 253 LNFW 256 >UniRef50_C3Z534 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3Z534_BRAFL Length = 346 Score = 64.2 bits (155), Expect = 9e-09, Method: Composition-based stats. Identities = 29/144 (20%), Positives = 54/144 (37%), Gaps = 29/144 (20%) Query: 125 HLDQY-DVFIIQGTGRRRWRVGEKLQMKQHCP-------HPDLLQVDPFEAIIDE----- 171 H D Y ++Q GR++W + + P QV+ ++E Sbjct: 171 HYDTYGCNLVLQVYGRKKWVLFPPEDSPKLYPTRLPYEESSVFSQVNVAHPDVEEHPKVM 230 Query: 172 -------ELEPGDILYIPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQ---R 221 LEPGD+L++P + H +L ++ + + E+ S D + + R Sbjct: 231 SSHPHVVILEPGDVLFVPKHWWHYVESLSTSVAVN------SWIEMASDAEDRLQEAVVR 284 Query: 222 ELGGNYYSDPDVPPRAHPADVLPQ 245 + + SD D +P +V Sbjct: 285 LVLLSIKSDSDQSQWLNPTEVTES 308 >UniRef50_B3SDY7 Putative uncharacterized protein n=1 Tax=Trichoplax adhaerens RepID=B3SDY7_TRIAD Length = 329 Score = 63.8 bits (154), Expect = 9e-09, Method: Composition-based stats. Identities = 31/168 (18%), Positives = 63/168 (37%), Gaps = 26/168 (15%) Query: 115 FSVPGGGVGPHLDQY-DVFIIQGTGRRRWRVGEKLQMKQHCP-----------------H 156 G H D Y + Q GR++W + + + P Sbjct: 119 MGSKGASTPCHYDSYGCNLVAQLYGRKKWLLVAPDESQYMYPIRVPYEESSIFSAVNMKS 178 Query: 157 PDLLQVDPFE--AIIDEELEPGDILYIPPGFPHEGYALENAMNYSVGFRAPNTRELISGF 214 P+L+ F I + LEPGD+L++P + H+ LE A+ SV + + Sbjct: 179 PNLVSYPKFANVTIYEVILEPGDVLFVPKYWWHDVECLETAI--SVNTWIALPSDGMDRL 236 Query: 215 ADYVLQRELGGNYYSDPDVPPRA-HPADVLPQE---MDKLREMMLELI 258 + V + + ++ + + +P++V + L+ + +L+ Sbjct: 237 CEAVTKTVVFALKSAEGNKITQWLNPSEVPTSYQTNIGYLKNSLEQLM 284 >UniRef50_UPI0000DB7045 PREDICTED: similar to Hspb associated protein 1 n=1 Tax=Apis mellifera RepID=UPI0000DB7045 Length = 310 Score = 63.8 bits (154), Expect = 9e-09, Method: Composition-based stats. Identities = 29/132 (21%), Positives = 46/132 (34%), Gaps = 24/132 (18%) Query: 109 DDLMISFSVPGGGVGPHLDQY-DVFIIQGTGRRRWR---------------------VGE 146 DD I G H D Y + Q GR++W + Sbjct: 106 DDSTIWIGSKGAHTNCHQDSYGCNLVAQIHGRKQWLLFPPNSTNFLRPTRIPYEESTIYS 165 Query: 147 KLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIPPGFPHEGYALENAMNYSVGFRAPN 206 K ++ + + LEPGDIL++PPG+ H +L+ + SV P Sbjct: 166 KYNFFCPTKEDEINILKIKDTAKLVTLEPGDILFVPPGWWHYVESLD--FSISVNMWLPI 223 Query: 207 TRELISGFADYV 218 + IS + + Sbjct: 224 LTDNISRVKEAI 235 >UniRef50_Q2SID9 Uncharacterized conserved protein n=1 Tax=Hahella chejuensis KCTC 2396 RepID=Q2SID9_HAHCH Length = 304 Score = 63.4 bits (153), Expect = 1e-08, Method: Composition-based stats. Identities = 23/98 (23%), Positives = 39/98 (39%), Gaps = 15/98 (15%) Query: 123 GPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCP-------------HPDLLQVDPFE--A 167 G H D D +Q G ++ + + + P PDL+ F Sbjct: 144 GLHYDNMDNLFVQVYGEKKAILLAPREARNLYPFGDCISKSRVDPERPDLMHYPRFAKAQ 203 Query: 168 IIDEELEPGDILYIPPGFPHEGYALENAMNYSVGFRAP 205 + L+PGDIL+ P G+ H + +++ S + AP Sbjct: 204 TLTARLQPGDILFFPRGWWHHFSSAGPSISLSCWYGAP 241 >UniRef50_UPI00015B5EA6 PREDICTED: similar to Jumonji domain containing 5 n=1 Tax=Nasonia vitripennis RepID=UPI00015B5EA6 Length = 402 Score = 63.4 bits (153), Expect = 1e-08, Method: Composition-based stats. Identities = 45/223 (20%), Positives = 74/223 (33%), Gaps = 51/223 (22%) Query: 15 RHWQ-----KRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQVSHGPFE 69 HWQ K L+R N PI E+ SR + + W S F Sbjct: 195 EHWQALHLWKDAEYLRRIVGNRTVPI------------EIGSR---YTEDDWTQSLVTFS 239 Query: 70 SY--DHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDL---------MISFSVP 118 + H+ N + A + + L F D + ++ P Sbjct: 240 DFLRSHISSKNEKVGYLAQHQLFDQIPELKNDFSVPEYCSFSDTEEDNEELPDINAWFGP 299 Query: 119 GGGVGP-HLDQYDVFIIQGTGRRRWRVGEKLQMKQHCP-----HPDLLQVDPFEAIID-- 170 G V P H D + + Q G +R + + P + ++DP+ + Sbjct: 300 SGTVSPLHHDPKNNLLCQVFGYKRIILYSPDDNENVYPYETRLLSNTARIDPYNPDFEKY 359 Query: 171 ----------EELEPGDILYIPPGFPHEGYALENAMNYSVGFR 203 L+PGD+L+IPP + H L + +S+ F Sbjct: 360 PNLQKAKAFMCYLKPGDMLFIPPKWWHHVVGLTPS--FSISFW 400 >UniRef50_Q96EW2 HSPB1-associated protein 1 n=23 Tax=Amniota RepID=HBAP1_HUMAN Length = 488 Score = 63.0 bits (152), Expect = 2e-08, Method: Composition-based stats. Identities = 34/211 (16%), Positives = 66/211 (31%), Gaps = 37/211 (17%) Query: 120 GGVGPH----LDQYD---VFIIQGTGRRRWRVGEKLQMKQHCP----------------- 155 G +G H LD Y VF Q GR+RW + P Sbjct: 166 GSLGAHTPCHLDSYGCNLVF--QVQGRKRWHLFPPEDTPFLYPTRIPYEESSVFSKINVV 223 Query: 156 HPDLLQVDPFEAIID--EELEPGDILYIPPGFPHEGYALENAMNYSVGFRAPNTRELISG 213 +PDL + F L PG +L++P + H +++ + S+ + ++ Sbjct: 224 NPDLKRFPQFRKAQRHAVTLSPGQVLFVPRHWWHYVESIDP-VTVSINSWIELEEDHLAR 282 Query: 214 FADYVLQRELGGNYYSDPDVPPRA--HPADVLPQEMDKLREMMLELINQPEHFKQWFGEF 271 + + + + ++ RA +P +V + +F Sbjct: 283 VEEAITRMLVCALKTAENPQNTRAWLNPTEVEETSHAVNCCYLNAA------VSAFFDRC 336 Query: 272 ISQSRHELDIAPPEPPYQPDEIYDALKQGEV 302 + E+ + + E + EV Sbjct: 337 RTSEVVEIQALRTDGEHMKKEELNVCNHMEV 367 >UniRef50_Q6AXL5 HSPB1-associated protein 1 homolog n=3 Tax=Clupeocephala RepID=HBAP1_DANRE Length = 449 Score = 62.6 bits (151), Expect = 2e-08, Method: Composition-based stats. Identities = 30/167 (17%), Positives = 55/167 (32%), Gaps = 31/167 (18%) Query: 125 HLDQYD---VFIIQGTGRRRWRVGEKLQMKQHCP-----------------HPDLLQVDP 164 HLD Y VF Q GR+RW + P PDL + Sbjct: 153 HLDSYGCNLVF--QIQGRKRWHLFPPDDTACLYPTRVPYEESSVFSHVNVIRPDLKKFPA 210 Query: 165 F--EAIIDEELEPGDILYIPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQ-- 220 + + L+PG +L++P + H +++ + SV + + A+ + + Sbjct: 211 YGRARLYTVTLQPGQVLFVPRHWWHYVESVDP-VTVSVNSWIEMDMDDEARVAEALTKTI 269 Query: 221 ----RELGGNYYSDPDVPPRAHPADVLPQEMDKLREMMLELINQPEH 263 + SD + P + M L + +N+ Sbjct: 270 VCAVKSSPSLDNSDQWLNPTEDGVSSHDENMQYLNLAVKVCMNKKRD 316 >UniRef50_UPI00015B5A68 PREDICTED: hypothetical protein n=1 Tax=Nasonia vitripennis RepID=UPI00015B5A68 Length = 409 Score = 62.6 bits (151), Expect = 2e-08, Method: Composition-based stats. Identities = 30/160 (18%), Positives = 56/160 (35%), Gaps = 27/160 (16%) Query: 86 VNHWHEPTAALMRPFR---ELPDWRIDDLMISFSVPGGGVGPHLDQY-DVFIIQGTGRRR 141 +N W + +++ F D + D I G H D Y + Q GR+ Sbjct: 107 MNEWFKDIPEIVKSFDWHQFGIDLDVSDSTIWIGSKGAHTNCHQDTYGCNLVAQIQGRKL 166 Query: 142 WRVGEK----------LQMKQHCPHPDLLQVDPFEAIID-----------EELEPGDILY 180 W + + ++ + P + I+ LEP D+L+ Sbjct: 167 WLLFSPECGDLMQPTRIPYEESTVYSKYNFFAPSKQEIEAIKNMPGSVKMVTLEPKDLLF 226 Query: 181 IPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQ 220 IP G+ H +L+ ++ SV P + S + ++ Sbjct: 227 IPKGWWHYVESLD--ISLSVNVWLPLKEDCESRLKETLVH 264 >UniRef50_A7RV46 Predicted protein n=2 Tax=Eumetazoa RepID=A7RV46_NEMVE Length = 400 Score = 62.2 bits (150), Expect = 3e-08, Method: Composition-based stats. Identities = 40/166 (24%), Positives = 63/166 (37%), Gaps = 44/166 (26%) Query: 67 PFESYDHLGETNWSLLVQAVNHWHEP---------TAALMR--PFRELPDWRID------ 109 P E + W+ + ++ + + A L + F ++P+ R D Sbjct: 221 PIELGLRYTDEEWTQKLMTISEFVDKYVSCSNSSQVAYLAQHQLFDQIPELRRDIIIPDY 280 Query: 110 --------DLMI-SFSVPGGGVGP-HLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH--- 156 D+MI ++ P G V P H D Y+ + Q G + R+ K Q + PH Sbjct: 281 CCLGDDDRDVMINAWFGPKGTVSPLHHDPYNNLLAQVVGEKYLRLYSKDQTDKLYPHETT 340 Query: 157 ------------PDLLQVDPF--EAIIDEELEPGDILYIPPGFPHE 188 PDL Q F + + L PG +L+IPPG H Sbjct: 341 LLHNTSQIDVEAPDLAQFPAFYKASYQECILRPGQMLFIPPGHWHY 386 >UniRef50_Q7UIC0 Probable protein associating with small stress protein PASS1 n=1 Tax=Rhodopirellula baltica RepID=Q7UIC0_RHOBA Length = 316 Score = 62.2 bits (150), Expect = 3e-08, Method: Composition-based stats. Identities = 14/77 (18%), Positives = 33/77 (42%), Gaps = 6/77 (7%) Query: 124 PHLDQYD--VFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDP----FEAIIDEELEPGD 177 H D + VF +Q G++RW + + P + + + + +L GD Sbjct: 122 WHYDGHSLHVFNLQLKGKKRWTIVAPETPLPNMPFSKTCLFEDNSLKGKRVYEFDLCEGD 181 Query: 178 ILYIPPGFPHEGYALEN 194 ++++P + H +++ Sbjct: 182 MVFLPRYWFHHVHSVGE 198 >UniRef50_A6FWV6 JmjC domain protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6FWV6_9DELT Length = 326 Score = 62.2 bits (150), Expect = 3e-08, Method: Composition-based stats. Identities = 28/104 (26%), Positives = 45/104 (43%), Gaps = 15/104 (14%) Query: 122 VGP-------HLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAII----- 169 GP H D ++ Q G++R+R+ + + E I Sbjct: 215 FGPAGTRTPLHHDSCNILFCQLRGQKRFRMLAPWSARLADAAVGYYAGETLEQAIAAGER 274 Query: 170 --DEELEPGDILYIPPGFPHEGYALENAMNYS-VGFRAPNTREL 210 + LEPG+ LY+P + HE AL+ +++ S + FR P T EL Sbjct: 275 GYEVVLEPGEALYLPGWWWHEVLALDLSVSLSFLNFREPTTIEL 318 >UniRef50_Q4DVQ0 Putative uncharacterized protein n=2 Tax=Trypanosoma cruzi RepID=Q4DVQ0_TRYCR Length = 1155 Score = 61.8 bits (149), Expect = 4e-08, Method: Composition-based stats. Identities = 29/152 (19%), Positives = 55/152 (36%), Gaps = 32/152 (21%) Query: 125 HLDQYDVFIIQGTGRRRWRVGEKLQMKQ-----------HCPHPDLLQVDPFEAI----I 169 H D D + Q G++R + + + PDL + F Sbjct: 977 HYDTLDNVLCQVVGKKRVVLFPPSEYNNLYMSGSSSAVLNIDAPDLGRFPRFADACRHAT 1036 Query: 170 DEELEPGDILYIPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYS 229 + LEPGD+L++P + H +E + + SV F + + Y Sbjct: 1037 EVVLEPGDMLFLPSLWFHHITTMEGSYSISVNV-------FFERFPH---EDYDKKDLYG 1086 Query: 230 DPDVPPRAHPADVLPQEMDKLREMMLELINQP 261 + D+P A ++ E + +L+++P Sbjct: 1087 NKDLPAAAR-------LRKRIVEQVQQLVSEP 1111 >UniRef50_A4X242 Transcription factor jumonji, jmjC domain protein n=6 Tax=Bacteria RepID=A4X242_SALTO Length = 281 Score = 61.5 bits (148), Expect = 5e-08, Method: Composition-based stats. Identities = 23/99 (23%), Positives = 41/99 (41%), Gaps = 21/99 (21%) Query: 125 HLDQYDVFIIQGTGRRRWRVGEKLQMKQH----------------CPHPDLLQVDPFEAI 168 H D+++ F I GR+R+ + + + DL + Sbjct: 146 HFDEFENFNIALEGRKRFIIAPPGSRDYYPRSMLRGFGDKSQVFDLDNVDLGRYPRVAPK 205 Query: 169 I----DEELEPGDILYIPPGFPHEGYALENAMNYSVGFR 203 + D LEPG +LY+P G+ H+ +L+ +N +V F Sbjct: 206 LAQRRDFVLEPGHMLYLPLGWWHQAESLDP-ININVNFW 243 >UniRef50_Q8N371 JmjC domain-containing protein 5 n=17 Tax=Chordata RepID=JMJD5_HUMAN Length = 416 Score = 61.5 bits (148), Expect = 5e-08, Method: Composition-based stats. Identities = 40/181 (22%), Positives = 64/181 (35%), Gaps = 46/181 (25%) Query: 67 PFESYDHLGETNWSLLVQAVNHWH--------EPTAALMR--PFRELPDWRIDDLMISFS 116 P E + WS + VN + L + F ++P+ + D + + Sbjct: 236 PVEVGSRYTDEEWSQTLMTVNEFISKYIVNEPRDVGYLAQHQLFDQIPELKQDISIPDYC 295 Query: 117 VPGGG----------VGP-------HLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH--- 156 G G GP H D F++Q GR+ R+ + PH Sbjct: 296 SLGDGEEEEITINAWFGPQGTISPLHQDPQQNFLVQVMGRKYIRLYSPQESGALYPHDTH 355 Query: 157 ------------PDLLQVDPFE--AIIDEELEPGDILYIPPGFPHEGYALENAMNYSVGF 202 PDL + F + L PG+IL+IP + H AL+ +++SV F Sbjct: 356 LLHNTSQVDVENPDLEKFPKFAKAPFLSCILSPGEILFIPVKYWHYVRALD--LSFSVSF 413 Query: 203 R 203 Sbjct: 414 W 414 >UniRef50_B5Y3G4 Predicted protein n=1 Tax=Phaeodactylum tricornutum CCAP 1055/1 RepID=B5Y3G4_PHATR Length = 626 Score = 61.5 bits (148), Expect = 5e-08, Method: Composition-based stats. Identities = 38/220 (17%), Positives = 71/220 (32%), Gaps = 45/220 (20%) Query: 124 PHLDQYDVFIIQGTGRRRWRVGEKLQM--------------KQHCPHPDLLQVDPFEAII 169 PH D+ DV ++Q +GR+ W+V P L + ++ Sbjct: 246 PHTDKQDVAVVQTSGRKHWKVYSPPNPAMKPTVDIFARGKGDDSLPLYILESDLGCQLLL 305 Query: 170 DEELEPGDILYIPPGFPHEGYALEN---------AMNYSVGFRAPNTRELISGFADYVLQ 220 + L PGD++++P FPH + +++ ++G DY+ Sbjct: 306 ETTLNPGDVMFVPAAFPHTTSTVTEDDSTHADKTSIHLTLGIDHHIWE------LDYLCC 359 Query: 221 RELGGNYYSDPDVPPRAHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELD 280 R L ++ + + E+ LIN F E +D Sbjct: 360 RRL-ALRRANVKDTALGQTGEEDSPYIGAANEVTAPLINDL------FAELPLGLLGGVD 412 Query: 281 IAPPEPPYQPDEIYDALKQ---------GEVLVRLGGLRV 311 A P + E+ ++ G + R R+ Sbjct: 413 YAAPVIEHVAAELERISREVDETTASAVGASVWREAVERL 452 >UniRef50_Q095N0 Putative uncharacterized protein n=1 Tax=Stigmatella aurantiaca DW4/3-1 RepID=Q095N0_STIAU Length = 413 Score = 61.1 bits (147), Expect = 6e-08, Method: Composition-based stats. Identities = 32/110 (29%), Positives = 46/110 (41%), Gaps = 22/110 (20%) Query: 118 PGGGVG-PHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH-----------------PDL 159 P G V H D D F+ Q GR+ R+ Q + P PD Sbjct: 287 PSGTVSHVHRDLIDNFLAQVWGRKHLRLFSPDQSRFLYPRRVDGNPFYEASDVDVSAPDF 346 Query: 160 LQVDPFEAI--IDEELEPGDILYIPPGFPHEGYALENAMNYSVGFRAPNT 207 + ID EL PG+++++P G+ H AL+ M++SV F A N Sbjct: 347 EKFPELRHARHIDCELRPGEMIFLPAGWWHYVRALD--MSFSVNFFAVNQ 394 >UniRef50_A2SGT4 Putative uncharacterized protein n=1 Tax=Methylibium petroleiphilum PM1 RepID=A2SGT4_METPP Length = 353 Score = 61.1 bits (147), Expect = 6e-08, Method: Composition-based stats. Identities = 29/100 (29%), Positives = 49/100 (49%), Gaps = 20/100 (20%) Query: 118 PGGGVGP-HLDQYDVFII---QGTGRRRWRVGEKLQMKQ-----------HCPHPDLLQV 162 P G V P H +D ++ Q GR+RWR L+ + HPDL + Sbjct: 238 PAGTVTPLH---HDTLMLLHTQVVGRKRWRFISPLETPRLYNHDGVFSAIDLDHPDLDRY 294 Query: 163 DPFEAI--IDEELEPGDILYIPPGFPHEGYALENAMNYSV 200 F + ++ LEPGD +++P G+ H+ +LE ++++S Sbjct: 295 PAFRDVKVLEVVLEPGDTVFLPLGWWHQVASLEVSLSFSF 334 >UniRef50_A4RRR9 Predicted protein n=1 Tax=Ostreococcus lucimarinus CCE9901 RepID=A4RRR9_OSTLU Length = 235 Score = 61.1 bits (147), Expect = 6e-08, Method: Composition-based stats. Identities = 29/105 (27%), Positives = 43/105 (40%), Gaps = 25/105 (23%) Query: 122 VGP-------HLDQYDVFIIQGTGRRRWRVGEKLQMKQHCP--------------HPDLL 160 GP H D + + Q G +R R+ + + P HP+L Sbjct: 131 FGPAHTESPAHTDPHHNLLCQVIGVKRVRLFAPSETPKMYPRDAPMSNTSRVDVMHPNLD 190 Query: 161 QVDPFEAI--IDEELEPGDILYIPPGFPHEGYALENAMNYSVGFR 203 + F + ID L PGD LYIPPG+ H A +++SV + Sbjct: 191 EFPLFVDVEFIDATLYPGDALYIPPGWWHRVKA--ATVSFSVSYW 233 >UniRef50_A4S2N6 Predicted protein n=3 Tax=Ostreococcus RepID=A4S2N6_OSTLU Length = 373 Score = 61.1 bits (147), Expect = 6e-08, Method: Composition-based stats. Identities = 24/95 (25%), Positives = 38/95 (40%), Gaps = 16/95 (16%) Query: 125 HLDQYDVFIIQGTGRRRWRVGEKL-----------QMKQHCPHPDLLQVDPF----EAII 169 H D D +IQ G +R + + + D F +A + Sbjct: 169 HYDAMDNMLIQLHGEKRVLLFPPSVSGDLYLEGSSSVVRDVDDHDRESFPRFARARKAAL 228 Query: 170 DEELEPGDILYIPPGFPHEGYAL-ENAMNYSVGFR 203 + L+PGD+LYIP + H AL ++ +V FR Sbjct: 229 EVILQPGDVLYIPALWAHHVTALHGPSIALNVFFR 263 >UniRef50_D2VHH6 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2VHH6_NAEGR Length = 311 Score = 61.1 bits (147), Expect = 7e-08, Method: Composition-based stats. Identities = 43/242 (17%), Positives = 85/242 (35%), Gaps = 48/242 (19%) Query: 6 TLNWPDFLERHWQKR-PVVLKRGFNNFIDPISPDELAGLAMESEVDSRLV------SHQD 58 ++ DF ++++ P +LK N+ ++ L + R V + Sbjct: 72 AISLMDFKKKYFNTHTPCLLKNASKNWEAYRKWSDVNYLL--EKAAYRAVPVEIGQYYTS 129 Query: 59 GKWQVSHGPFESY--DHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDL----- 111 W PF Y +++ E N + A + E +L + +E + +L Sbjct: 130 EDWSQKIMPFHQYVKEYVMEGNTQIGYLAQHPLFEQIHSLRKDIQEPIYCMLGELGEMSG 189 Query: 112 MISFSVPGGGVGP-HLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHP-----------DL 159 + ++ P G + P H D D ++Q G + R+ + D Sbjct: 190 VNAWYGPKGTISPLHTDPCDNILVQLVGHKFVRIYHPDETPHLYKRQSGILQANTSEIDN 249 Query: 160 LQVDPFEAII------------------DEELEPGDILYIPPGFPHEGYALENAMNYSVG 201 L + FE D L GD+L+IP + H +L ++++S+ Sbjct: 250 LHLLQFEEEERKILNEKFPLISKATHYWDCTLCEGDMLFIPKLYWHYVQSL--SISFSIS 307 Query: 202 FR 203 + Sbjct: 308 YW 309 >UniRef50_UPI0001927319 PREDICTED: similar to predicted protein n=1 Tax=Hydra magnipapillata RepID=UPI0001927319 Length = 344 Score = 60.7 bits (146), Expect = 8e-08, Method: Composition-based stats. Identities = 22/110 (20%), Positives = 42/110 (38%), Gaps = 19/110 (17%) Query: 125 HLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHP-----------------DLLQVDPFE- 166 H D D F+I +G + R+ ++ P+P D + F Sbjct: 144 HFDPDDNFLIVFSGEKHVRLYRANDLENLYPNPFGSNGRTIQSQVNCDNPDFNKFPNFRN 203 Query: 167 -AIIDEELEPGDILYIPPGFPHEGYALENAMNYSVGFRAPNTRELISGFA 215 + L+PG++LY P + H+ + + ++ ++ F T IS Sbjct: 204 VQFFECILKPGEMLYFPAFWWHQVTSTDTTISMNIFFGNDGTNTYISKIM 253 >UniRef50_A9UPY7 Predicted protein n=1 Tax=Monosiga brevicollis RepID=A9UPY7_MONBE Length = 476 Score = 60.3 bits (145), Expect = 1e-07, Method: Composition-based stats. Identities = 22/145 (15%), Positives = 48/145 (33%), Gaps = 22/145 (15%) Query: 92 PTAALMRPFRELPDWRIDDLMISFSVPGGGVGP-HLDQYDVFIIQGTGRRRWRVGEKLQM 150 P + L + + + S G P H D YDV ++Q +G ++W + + + Sbjct: 147 PLSDLELALQTVFHTTHSTVHAYISSGGAQALPYHTDSYDVIVLQVSGVKKWTICQPPSL 206 Query: 151 KQHCPHPD------LLQVDPFEAI---------------IDEELEPGDILYIPPGFPHEG 189 + P+P L ++ + + GD+LY+P H Sbjct: 207 ETAWPNPSPADLAQLYELRHANPDGCSNFNHRTIEQLNCQNLTMLAGDMLYLPKSMIHVA 266 Query: 190 YALENAMNYSVGFRAPNTRELISGF 214 + ++ + + + Sbjct: 267 HTKPGTVSAHLTYSLDREGGMWRDV 291 >UniRef50_D0MSD2 Putative uncharacterized protein n=1 Tax=Phytophthora infestans T30-4 RepID=D0MSD2_PHYIN Length = 249 Score = 59.5 bits (143), Expect = 2e-07, Method: Composition-based stats. Identities = 26/103 (25%), Positives = 39/103 (37%), Gaps = 19/103 (18%) Query: 118 PGGGVGP-HLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPF----------- 165 PGG V P H D D + Q G + R+ + + P LL Sbjct: 147 PGGTVSPLHFDPKDNVLCQVVGSKYLRLYAPEESDKLYPIEGLLSNTSLVQVEDPDDERF 206 Query: 166 -----EAIIDEELEPGDILYIPPGFPHEGYALENAMNYSVGFR 203 ++ L G++LYIPP + H +L + +SV F Sbjct: 207 PKFRNARYVECVLHEGEMLYIPPKYWHYVKSLFTS--FSVSFW 247 >UniRef50_UPI0001B56C67 cupin 4 family protein n=1 Tax=Streptomyces sp. C RepID=UPI0001B56C67 Length = 298 Score = 59.5 bits (143), Expect = 2e-07, Method: Composition-based stats. Identities = 31/207 (14%), Positives = 75/207 (36%), Gaps = 21/207 (10%) Query: 70 SYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGG--GVGPHLD 127 L E+ ++ + + A + ++ + + + +F P G G+ H D Sbjct: 79 KLRKLYESGHTVRLGNLQRVMPLMADISHGIQQETGY--SNYIHAFVTPSGEQGLRHHWD 136 Query: 128 QYDVFIIQGTGRRRWRVGEKLQMKQHCPHPD---------LLQVDPFEAIIDEELEPGDI 178 Q I+Q G +RW++ + + + + ++ LE G Sbjct: 137 QQMAVIVQLEGVKRWQLWKPPVEAPMREFNESWRVWKQEYIPTWEAAGPDLEIHLEAGQS 196 Query: 179 LYIPPGFPHEGYALEN---AMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPP 235 + +P G+ H L+ +++ + R A+ ++Q + + +P Sbjct: 197 MLLPRGWVHNPAVLDTNERSVHLTFAIRERTPF----WIAERLVQDAIKDPEFRRVILPE 252 Query: 236 RAHPADVLPQEMDKLREMMLELINQPE 262 + + L E+ +RE ++ Q + Sbjct: 253 QLKD-EGLAHEVSAVREGIIRYFQQLD 278 >UniRef50_A4RRC2 Predicted protein n=2 Tax=Ostreococcus RepID=A4RRC2_OSTLU Length = 617 Score = 59.1 bits (142), Expect = 2e-07, Method: Composition-based stats. Identities = 21/94 (22%), Positives = 38/94 (40%), Gaps = 9/94 (9%) Query: 119 GGGVGP-HLDQYDVFIIQ-GTGRRRWRVGEKLQMKQHCPHPDL-------LQVDPFEAII 169 G V P H D ++Q G G +R + + + +P+ + P + Sbjct: 220 AGSVSPMHFDASTSTLVQVGEGTKRMLLYQPFALSAIDLYPNWHPLRRRGRRFAPKALVR 279 Query: 170 DEELEPGDILYIPPGFPHEGYALENAMNYSVGFR 203 + + PGD L PP + H + + ++ SV R Sbjct: 280 EAIIAPGDALIFPPRWAHYTESCGDRVSASVTQR 313 >UniRef50_C5BK86 JmjC domain protein n=1 Tax=Teredinibacter turnerae T7901 RepID=C5BK86_TERTT Length = 290 Score = 59.1 bits (142), Expect = 3e-07, Method: Composition-based stats. Identities = 31/144 (21%), Positives = 49/144 (34%), Gaps = 29/144 (20%) Query: 86 VNHWHEPTAALMRPF--RELPDWRID---DLMISFSVPGGGVGP-HLDQ---YDVFIIQG 136 + H +R P W D F P G + P H D +++F Q Sbjct: 100 IFHLLPDLIDGVRSMPSSIFPLWYRDSWWQFAQFFMSPSGSITPLHFDTLMTHNLF-FQV 158 Query: 137 TG----------------RRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILY 180 G RR WR + P + + + L PGD+LY Sbjct: 159 AGVKEFYLLPFSERSNCYRRGWR---WFDVDPLNPSEEKHPKYRPDRCLKVTLRPGDLLY 215 Query: 181 IPPGFPHEGYALENAMNYSVGFRA 204 +PPG H + +++++V F Sbjct: 216 MPPGMLHHVITKQASISFNVDFHT 239 >UniRef50_B5S2S3 Putative uncharacterized protein n=1 Tax=Ralstonia solanacearum MolK2 RepID=B5S2S3_RALSO Length = 329 Score = 58.4 bits (140), Expect = 4e-07, Method: Composition-based stats. Identities = 39/238 (16%), Positives = 78/238 (32%), Gaps = 57/238 (23%) Query: 6 TLNWPDFLERHWQKR-PVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQVS 64 +L+ F E ++ + PV+++ + + ++ +V++QD Sbjct: 91 SLSSEAFHENYYSRNLPVLIEDAAHAWPALTKW---TNAYLKENYGHCIVTYQDRGKPSD 147 Query: 65 H--------------GPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDD 110 H E ++ GE+N L+ L+ P +DD Sbjct: 148 HRHSFIDHSTQIAFSKYIERVENSGESNACYLIAH--------DRLLDRPEFAP--LLDD 197 Query: 111 LM--ISFSVPGGGVGP--------------HLDQYDVFIIQGTGRRR---------WRVG 145 + + P G VG H D +VF++Q GR+R +V Sbjct: 198 IAFDERYLDPIGPVGKVFFWLGPKGAKTPLHRDLGNVFLVQVRGRKRVNFIPALEMHKVY 257 Query: 146 EKLQMKQHCPHPDLLQVD----PFEAIIDEELEPGDILYIPPGFPHEGYALENAMNYS 199 D + + G++L+IP G+ H A++ ++ + Sbjct: 258 NSFGYHSDLDLDDYDPKKFPRMAKAHVSTTIVSSGEMLFIPVGWWHHVVAIDECISIT 315 >UniRef50_Q0J0P8 Os09g0489200 protein n=10 Tax=Poaceae RepID=Q0J0P8_ORYSJ Length = 413 Score = 58.0 bits (139), Expect = 5e-07, Method: Composition-based stats. Identities = 41/231 (17%), Positives = 79/231 (34%), Gaps = 56/231 (24%) Query: 1 MEYQLTLNWPDFLERHWQKR-PVVLKRGFNNFIDPISPDELAGL---AMESEVDSRLVS- 55 +E + ++ +F+ ++ + PV++ +++ ++ L A + V + Sbjct: 171 VERRSCISLEEFICDYFLRESPVIISGSIDHWPARTKWKDIQYLKKIAGDRTVPVEVGKN 230 Query: 56 HQDGKWQVSHGPFESY-DHLGETNW----SLLVQAVNHWHEPTAALMRPFRELPDWRIDD 110 + +W+ F + + + + L Q H + D Sbjct: 231 YVCSEWKQELITFSQFLERMWSAGCPSNLTYLAQ-----HPLFEQIKELHE--------D 277 Query: 111 LMI-SFSVPGGG--------VGPH-------LDQYDVFIIQGTGRRRWRVGEKLQMKQHC 154 +M+ + GGG GPH D + + Q GR+ R+ + Sbjct: 278 IMVPDYCYAGGGELQSLNAWFGPHGTVTPLHHDPHHNILAQVLGRKYIRLYPASISEDLY 337 Query: 155 PHP---------------DLLQVDPFE--AIIDEELEPGDILYIPPGFPHE 188 PH DL + E +D LE GD+LYIPP + H Sbjct: 338 PHTETMLSNTSQVDLDNVDLKEFPRVENLDFLDCILEEGDLLYIPPKWWHY 388 >UniRef50_Q5YSD2 Putative uncharacterized protein n=1 Tax=Nocardia farcinica RepID=Q5YSD2_NOCFA Length = 305 Score = 58.0 bits (139), Expect = 5e-07, Method: Composition-based stats. Identities = 22/93 (23%), Positives = 38/93 (40%), Gaps = 19/93 (20%) Query: 125 HLDQYDVFIIQGTGRRRWRVGEKLQM-------KQHCPHPDLLQVDPFEAII-------- 169 H D + +IQ GR+R R+ + + P D F+ Sbjct: 174 HNDPWHGLLIQLHGRKRVRLFPPNEYHNVYGIVPRRVNDPYTRLPDQFDPDTADYPRLRR 233 Query: 170 ----DEELEPGDILYIPPGFPHEGYALENAMNY 198 D L+ GD+LYIP + H+ +L+ +++Y Sbjct: 234 ATSYDVVLDAGDVLYIPMFWWHQVESLDASISY 266 >UniRef50_B3RNN1 Putative uncharacterized protein n=1 Tax=Trichoplax adhaerens RepID=B3RNN1_TRIAD Length = 405 Score = 57.6 bits (138), Expect = 7e-07, Method: Composition-based stats. Identities = 28/104 (26%), Positives = 45/104 (43%), Gaps = 20/104 (19%) Query: 118 PGGGVGP-HLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLL---------QVDPFE- 166 P G + P H D Y Q GR+ R+ + + + P+P L + FE Sbjct: 303 PQGTISPLHHDPYHNLFAQVMGRKYIRLYPEHESENVYPYPTKLLSNTSQVDVEFPNFEN 362 Query: 167 -------AIIDEELEPGDILYIPPGFPHEGYALENAMNYSVGFR 203 ++ +EPG +LYIPP H +L+ +++SV F Sbjct: 363 YPNFANAEYLECIIEPGQLLYIPPRCWHYVRSLD--ISFSVSFW 404 >UniRef50_B5W056 Transcription factor jumonji n=2 Tax=Arthrospira RepID=B5W056_SPIMA Length = 375 Score = 57.6 bits (138), Expect = 7e-07, Method: Composition-based stats. Identities = 34/139 (24%), Positives = 60/139 (43%), Gaps = 21/139 (15%) Query: 91 EPTAALMRPFRELPDWRIDDLMIS-----FSVPGGGVGP-HLDQYDVFIIQGTGRRRWRV 144 L+ + +D S + P G V P H D ++ + Q +GR+ R+ Sbjct: 230 PEFKGLLNDLE-IFTEYLDPTQTSGCIFFWYGPAGTVTPLHHDPVNLLLAQVSGRKLIRM 288 Query: 145 GEKLQ-----------MKQHCPHPDLLQVDPFEAI--IDEELEPGDILYIPPGFPHEGYA 191 Q + +PD + F+ + I+ LEPG++++IP G+ H + Sbjct: 289 IPPYQVPFLYNHIGVFSEVDLENPDYRKYPLFQKVRPIEFILEPGEVIFIPVGWWHHVRS 348 Query: 192 LENAMNYSV-GFRAPNTRE 209 LE +++ S+ F PNT E Sbjct: 349 LEPSISVSMTNFVFPNTYE 367 >UniRef50_UPI0000D567FA PREDICTED: similar to reserved n=1 Tax=Tribolium castaneum RepID=UPI0000D567FA Length = 372 Score = 57.6 bits (138), Expect = 8e-07, Method: Composition-based stats. Identities = 24/112 (21%), Positives = 46/112 (41%), Gaps = 18/112 (16%) Query: 125 HLDQY-DVFIIQGTGRRRWRVGEKLQMKQHCPHP-----DLLQVDPFEAIID-------- 170 H+D Y ++Q GR++W + + + P +++ F +I Sbjct: 134 HIDTYGCNIVVQIHGRKQWILFPPDENLKPTRIPYEESSIYSKLNFFSPMITDFDGVGNC 193 Query: 171 --EELEPGDILYIPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQ 220 LEPGD L++P + H L+ A+ S+ P + F + ++Q Sbjct: 194 RRVVLEPGDALFVPHKWWHYVENLDTAI--SINVWLPLPEDHEERFREALVQ 243 >UniRef50_C1FDI9 JmjC transcription factor domain-containing protein n=2 Tax=Micromonas sp. RCC299 RepID=C1FDI9_9CHLO Length = 636 Score = 57.2 bits (137), Expect = 1e-06, Method: Composition-based stats. Identities = 39/235 (16%), Positives = 77/235 (32%), Gaps = 31/235 (13%) Query: 10 PDFLERHWQKRPVVLKRGFNNFIDPISP-DELAGLAMESEVDSRLVSHQDG--KWQVSHG 66 FL W ++ + + N++ P + A +A + + R+ +D + Sbjct: 302 EHFLHCTWTQKLMSMAEFMENYVRPEKAVPQAAEMANQFHLKQRIRKRRDAMHAFCGRTE 361 Query: 67 PFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLM-------ISFSVPG 119 P E +D + S A + E L+ + + P Sbjct: 362 PQEIFDETVFSRCSKGYMAQHDIFEHIPRLLHDLDFPFFCSQGSCTRGHFPKKMIWIGPA 421 Query: 120 GGVGP-HLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPD------------------LL 160 G + P H D + Q G + R+ + L Sbjct: 422 GTISPLHTDPHANLFSQIAGYKYVRLYAPRCETNLYRNTTAKYCNSSQIELRGSLMGMLS 481 Query: 161 QVDPF--EAIIDEELEPGDILYIPPGFPHEGYALENAMNYSVGFRAPNTRELISG 213 + F +D L PGD+L+IPP H +L ++++ ++ R +E++ Sbjct: 482 EFPDFLNAPYVDCVLGPGDLLHIPPLHWHYVQSLTSSVSVTMWCRPKYAQEILDE 536 >UniRef50_UPI000186E7C4 hspbap1/pass1, putative n=1 Tax=Pediculus humanus corporis RepID=UPI000186E7C4 Length = 379 Score = 57.2 bits (137), Expect = 1e-06, Method: Composition-based stats. Identities = 24/112 (21%), Positives = 41/112 (36%), Gaps = 21/112 (18%) Query: 125 HLDQY-DVFIIQGTGRRRWRVGEKLQMKQHCP-----------------HPDLLQVDPFE 166 H+D Y F+ Q GR+ W + ++ + P P + + Sbjct: 133 HMDTYGCNFVAQILGRKLWILIPPDEIDEMEPVRVPYEESSIYSAQNFYTPSEKLLSIKK 192 Query: 167 AIIDEELEPGDILYIPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYV 218 A L PG++L+IP + H LE + SV P + S + + Sbjct: 193 AYQ-VILSPGEVLFIPRHWWHYVENLE--VAISVNVWVPMNIDCESRLEESL 241 >UniRef50_D2V3K7 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2V3K7_NAEGR Length = 766 Score = 56.8 bits (136), Expect = 1e-06, Method: Composition-based stats. Identities = 21/96 (21%), Positives = 38/96 (39%), Gaps = 22/96 (22%) Query: 125 HLDQYDVFIIQGTGRRRWRVGEKLQMK------QHCPHPDL-----LQVDPFEAIIDE-- 171 H D D +Q G++ + E + + P D L +P E ++ Sbjct: 212 HYDNSDNLFVQIFGKKHMILWEPKEKSLLYLNEEDHPTSDRQTRIDLTKEPSEIALNFPN 271 Query: 172 ---------ELEPGDILYIPPGFPHEGYALENAMNY 198 L PGD+++IP G+PH + N+++ Sbjct: 272 FFKSNPVRVTLNPGDVMFIPKGWPHMVLSDGNSISL 307 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P27431 Uncharacterized protein ycfD n=205 Tax=Gammaprot... 433 e-120 UniRef50_A0KI50 Cupin superfamily protein n=6 Tax=Gammaproteobac... 385 e-105 UniRef50_A1STI6 Cupin 4 family protein n=1 Tax=Psychromonas ingr... 366 e-100 UniRef50_A0YBW0 Transcription factor jumonji, jmjC n=1 Tax=marin... 366 e-100 UniRef50_C4LEX7 Cupin 4 family protein n=1 Tax=Tolumonas auensis... 362 1e-98 UniRef50_A6F8R4 Putative uncharacterized protein n=1 Tax=Moritel... 361 2e-98 UniRef50_Q5E4F9 Conserved protein n=16 Tax=Gammaproteobacteria R... 360 4e-98 UniRef50_C9QJT9 Putative uncharacterized protein n=2 Tax=Vibrion... 360 7e-98 UniRef50_A1RJT3 Cupin 4 family protein n=14 Tax=Alteromonadales ... 357 4e-97 UniRef50_A6F0B9 Transcription factor jumonji, jmjC n=1 Tax=Marin... 352 1e-95 UniRef50_Q1NG82 Putative uncharacterized protein n=1 Tax=Sphingo... 347 4e-94 UniRef50_Q1QUR4 Cupin 4 n=1 Tax=Chromohalobacter salexigens DSM ... 346 5e-94 UniRef50_Q48H58 YcfD protein n=22 Tax=Gammaproteobacteria RepID=... 346 9e-94 UniRef50_B3PKY0 Putative uncharacterized protein n=2 Tax=Pseudom... 344 3e-93 UniRef50_B8K5G8 Cupin superfamily protein n=1 Tax=Vibrio parahae... 343 9e-93 UniRef50_Q2Y9X5 Cupin region n=9 Tax=root RepID=Q2Y9X5_NITMU 339 1e-91 UniRef50_Q2S4H4 Cupin superfamily protein n=3 Tax=Bacteria RepID... 337 5e-91 UniRef50_A3QD76 Cupin 4 family protein n=19 Tax=Shewanella RepID... 335 1e-90 UniRef50_Q2BJ43 Putative uncharacterized protein n=1 Tax=Neptuni... 334 3e-90 UniRef50_B4RRX0 Putative enzyme with RmlC-like domain n=2 Tax=Al... 334 4e-90 UniRef50_Q3JQS3 Cupin superfamily protein family n=25 Tax=Burkho... 331 2e-89 UniRef50_Q1N4P0 Transcription factor jumonji, jmjC n=1 Tax=Berma... 330 4e-89 UniRef50_C6WYD1 Cupin 4 family protein n=1 Tax=Methylotenera mob... 330 6e-89 UniRef50_D1UI98 Cupin 4 family protein n=6 Tax=Burkholderia RepI... 329 8e-89 UniRef50_Q2SJM1 Uncharacterized conserved protein n=3 Tax=Gammap... 328 2e-88 UniRef50_A6SXH9 Uncharacterized conserved protein n=2 Tax=Oxalob... 326 6e-88 UniRef50_Q31GJ6 Cupin superfamily protein n=2 Tax=Gammaproteobac... 326 9e-88 UniRef50_C5A9S6 Cupin superfamily protein family protein n=49 Ta... 326 1e-87 UniRef50_C4K8V5 Putative uncharacterized protein n=1 Tax=Candida... 324 2e-87 UniRef50_D2UDU1 Putative uncharacterized protein n=1 Tax=Xanthom... 323 9e-87 UniRef50_B8GSM7 Cupin 4 family protein n=1 Tax=Thioalkalivibrio ... 322 1e-86 UniRef50_D1RFR4 Cupin superfamily protein n=1 Tax=Legionella lon... 322 1e-86 UniRef50_D0L0L5 Cupin 4 family protein n=1 Tax=Halothiobacillus ... 319 9e-86 UniRef50_C5BU83 Cupin 4 family protein n=1 Tax=Teredinibacter tu... 318 2e-85 UniRef50_C1DCJ3 Cupin region n=1 Tax=Laribacter hongkongensis HL... 316 6e-85 UniRef50_Q21K45 Cupin 4 n=1 Tax=Saccharophagus degradans 2-40 Re... 315 2e-84 UniRef50_A1K4G1 Putative uncharacterized protein n=1 Tax=Azoarcu... 313 7e-84 UniRef50_C0N3X6 Cupin superfamily protein n=1 Tax=Methylophaga t... 311 4e-83 UniRef50_B7RUZ0 Cupin superfamily protein n=1 Tax=marine gamma p... 309 7e-83 UniRef50_Q5WVF0 Putative uncharacterized protein n=4 Tax=Legione... 309 1e-82 UniRef50_A6W0E5 Cupin 4 family protein n=2 Tax=Marinomonas RepID... 309 1e-82 UniRef50_Q5QZ10 Cupin superfamily protein n=2 Tax=Idiomarina Rep... 309 1e-82 UniRef50_A6GQ27 Putative uncharacterized protein n=1 Tax=Limnoba... 308 3e-82 UniRef50_UPI0000E0F5AA putative enzyme with RmlC-like domain n=1... 307 3e-82 UniRef50_C3M8B3 Putative uncharacterized protein n=3 Tax=Candida... 307 4e-82 UniRef50_Q7NS46 Putative uncharacterized protein n=1 Tax=Chromob... 306 1e-81 UniRef50_Q15T89 Cupin 4 n=1 Tax=Pseudoalteromonas atlantica T6c ... 306 1e-81 UniRef50_B2SQ70 Transcription factor jumonji, JmjC n=19 Tax=Xant... 301 3e-80 UniRef50_A1VLH8 Cupin 4 family protein n=6 Tax=Burkholderiales R... 299 8e-80 UniRef50_B7H3P1 Cupin superfamily protein n=16 Tax=Acinetobacter... 298 3e-79 UniRef50_A4BDP0 Putative uncharacterized protein n=1 Tax=Reineke... 297 5e-79 UniRef50_B8KGD9 Cupin 4 family protein n=2 Tax=unclassified Gamm... 296 7e-79 UniRef50_B1Y837 Cupin 4 family protein n=3 Tax=cellular organism... 291 2e-77 UniRef50_C7I1M3 Cupin 4 family protein n=1 Tax=Thiomonas interme... 291 4e-77 UniRef50_C0VP99 Cupin 4 n=2 Tax=Acinetobacter RepID=C0VP99_9GAMM 287 4e-76 UniRef50_A0Z1Z1 Putative uncharacterized protein n=1 Tax=marine ... 287 4e-76 UniRef50_C7RB22 Cupin 4 family protein n=1 Tax=Kangiella koreens... 284 3e-75 UniRef50_D1KE35 Putative uncharacterized protein n=1 Tax=uncultu... 284 3e-75 UniRef50_B8KRM1 Cupin 4 family protein n=1 Tax=gamma proteobacte... 280 7e-74 UniRef50_A4SX54 Cupin 4 family protein n=2 Tax=Polynucleobacter ... 279 2e-73 UniRef50_Q0VQ28 Putative uncharacterized protein n=1 Tax=Alcaniv... 277 5e-73 UniRef50_B4X170 Cupin superfamily protein n=1 Tax=Alcanivorax sp... 271 4e-71 UniRef50_B9ZR02 Cupin 4 family protein n=1 Tax=Thioalkalivibrio ... 271 4e-71 UniRef50_P44683 Uncharacterized protein HI0396 n=36 Tax=Gammapro... 267 3e-70 UniRef50_C1E292 Predicted protein n=2 Tax=Micromonas RepID=C1E29... 264 5e-69 UniRef50_A4S2B8 Predicted protein n=2 Tax=Ostreococcus RepID=A4S... 251 4e-65 UniRef50_B5DUH6 Lysine-specific demethylase NO66 n=2 Tax=Drosoph... 248 2e-64 UniRef50_B4GUZ2 Lysine-specific demethylase NO66 n=2 Tax=Drosoph... 247 3e-64 UniRef50_B4L6Q5 Lysine-specific demethylase NO66 n=1 Tax=Drosoph... 244 6e-63 UniRef50_UPI0000E87D6F hypothetical protein MB2181_02235 n=1 Tax... 236 8e-61 UniRef50_B4M7P8 Lysine-specific demethylase NO66 n=3 Tax=Drosoph... 232 1e-59 UniRef50_B4Q068 Lysine-specific demethylase NO66 n=5 Tax=Sophoph... 232 1e-59 UniRef50_A0YJB4 Putative uncharacterized protein n=1 Tax=Lyngbya... 228 3e-58 UniRef50_B5W5P2 Cupin 4 family protein n=2 Tax=Arthrospira RepID... 228 3e-58 UniRef50_UPI0000E4684D PREDICTED: hypothetical protein n=1 Tax=S... 226 9e-58 UniRef50_Q10ZZ1 Cupin 4 n=1 Tax=Trichodesmium erythraeum IMS101 ... 225 2e-57 UniRef50_Q31RB4 Putative uncharacterized protein n=2 Tax=Synecho... 222 2e-56 UniRef50_B4R4H1 Lysine-specific demethylase NO66 n=2 Tax=melanog... 221 3e-56 UniRef50_A2W941 Transcription factor jumonji n=1 Tax=Burkholderi... 220 5e-56 UniRef50_B0WMG3 Lysine-specific demethylase NO66 n=2 Tax=Culicin... 220 6e-56 UniRef50_B4JMQ2 Lysine-specific demethylase NO66 n=1 Tax=Drosoph... 220 8e-56 UniRef50_A4U3D3 MYC induced nuclear antigen n=1 Tax=Magnetospiri... 218 2e-55 UniRef50_A3Q8B6 Cupin 4 family protein n=4 Tax=Mycobacterium Rep... 218 3e-55 UniRef50_D2A374 Putative uncharacterized protein GLEAN_07936 n=1... 218 3e-55 UniRef50_Q54K96 Lysine-specific demethylase NO66 n=1 Tax=Dictyos... 217 7e-55 UniRef50_C8XBP6 Cupin 4 family protein n=1 Tax=Nakamurella multi... 216 9e-55 UniRef50_D0L9V4 Cupin 4 family protein n=1 Tax=Gordonia bronchia... 216 1e-54 UniRef50_Q7K4H4 Lysine-specific demethylase NO66 n=2 Tax=melanog... 216 1e-54 UniRef50_UPI000192663F PREDICTED: similar to Myc-induced nuclear... 215 2e-54 UniRef50_D0NRY0 Nucleolar protein, putative n=2 Tax=Phytophthora... 215 3e-54 UniRef50_UPI000186D1B6 conserved hypothetical protein n=1 Tax=Pe... 214 3e-54 UniRef50_UPI0000E45D23 PREDICTED: hypothetical protein n=2 Tax=S... 214 6e-54 UniRef50_UPI000180B5EA PREDICTED: similar to Nucleolar protein 6... 212 2e-53 UniRef50_A9TET4 Predicted protein n=1 Tax=Physcomitrella patens ... 212 2e-53 UniRef50_A9UZN8 Predicted protein n=1 Tax=Monosiga brevicollis R... 211 3e-53 UniRef50_B0CEG8 Cupin 4 family protein, putative n=1 Tax=Acaryoc... 210 9e-53 UniRef50_Q5ZMM1 Lysine-specific demethylase NO66 n=3 Tax=Eumetaz... 207 4e-52 UniRef50_Q7N884 Similar to unknown protein n=1 Tax=Photorhabdus ... 207 8e-52 UniRef50_Q091R3 Mina protein n=1 Tax=Stigmatella aurantiaca DW4/... 204 3e-51 UniRef50_B4B491 Cupin 4 family protein n=1 Tax=Cyanothece sp. PC... 203 9e-51 UniRef50_C3XRY1 Lysine-specific demethylase NO66 n=1 Tax=Branchi... 203 1e-50 UniRef50_B4V6J8 Putative uncharacterized protein n=1 Tax=Strepto... 202 2e-50 UniRef50_B3S582 Putative uncharacterized protein n=1 Tax=Trichop... 202 2e-50 UniRef50_A6W7N8 Cupin 4 family protein n=1 Tax=Kineococcus radio... 200 8e-50 UniRef50_Q6DDJ7 Mina-prov protein n=2 Tax=Xenopus RepID=Q6DDJ7_X... 200 1e-49 UniRef50_Q8IUF8 MYC-induced nuclear antigen n=25 Tax=Amniota Rep... 199 1e-49 UniRef50_UPI00017929D5 PREDICTED: similar to Nucleolar protein 6... 199 2e-49 UniRef50_A1R1T1 Putative cupin superfamily protein n=2 Tax=Micro... 199 2e-49 UniRef50_A3UGV1 Putative uncharacterized protein n=1 Tax=Oceanic... 198 2e-49 UniRef50_B7PMB0 MYC-induced nuclear antigen, putative (Fragment)... 198 4e-49 UniRef50_Q28VG0 Cupin 4 n=1 Tax=Jannaschia sp. CCS1 RepID=Q28VG0... 196 1e-48 UniRef50_C6W918 Cupin 4 family protein n=2 Tax=Actinomycetales R... 195 2e-48 UniRef50_A4X6V2 Cupin 4 family protein n=4 Tax=Micromonosporacea... 194 5e-48 UniRef50_Q2T4J7 Unnamed protein product n=2 Tax=Burkholderia tha... 194 5e-48 UniRef50_O01658 Lysine-specific demethylase NO66 n=3 Tax=Caenorh... 194 6e-48 UniRef50_Q9H6W3 Lysine-specific demethylase NO66 n=17 Tax=Eumeta... 193 7e-48 UniRef50_Q0ALX3 Cupin 4 family protein n=1 Tax=Maricaulis maris ... 192 1e-47 UniRef50_A5PK74 Lysine-specific demethylase NO66 n=1 Tax=Bos tau... 190 8e-47 UniRef50_D2SA69 Cupin 4 family protein n=2 Tax=Actinomycetales R... 189 1e-46 UniRef50_A7SRW5 Predicted protein n=1 Tax=Nematostella vectensis... 189 1e-46 UniRef50_D0MXW2 Nucleolar protein, putative n=1 Tax=Phytophthora... 189 1e-46 UniRef50_C6SNC5 Putative uncharacterized protein n=2 Tax=Neisser... 189 2e-46 UniRef50_C5LMW3 Putative uncharacterized protein n=1 Tax=Perkins... 187 4e-46 UniRef50_UPI000192614C PREDICTED: similar to chromosome 14 open ... 187 7e-46 UniRef50_Q849M1 Putative uncharacterized protein pSV2.19c n=3 Ta... 186 1e-45 UniRef50_A8QFQ3 Lysine-specific demethylase NO66 n=2 Tax=Brugia ... 185 2e-45 UniRef50_A3M7T2 Putative uncharacterized protein n=2 Tax=Acineto... 185 3e-45 UniRef50_C1EHB5 Predicted protein (Fragment) n=2 Tax=Micromonas ... 184 4e-45 UniRef50_A1KTI5 Putative uncharacterized protein n=2 Tax=Neisser... 183 6e-45 UniRef50_B9BV10 Cupin superfamily protein n=5 Tax=Proteobacteria... 183 7e-45 UniRef50_C7NJK3 Cupin superfamily protein n=1 Tax=Kytococcus sed... 182 2e-44 UniRef50_C6SMA2 Myc induced nuclear antigen n=24 Tax=Neisseria R... 180 6e-44 UniRef50_Q4D641 Putative uncharacterized protein n=1 Tax=Trypano... 180 8e-44 UniRef50_B1FB07 Cupin 4 family protein n=1 Tax=Burkholderia ambi... 179 1e-43 UniRef50_Q4Q6P0 Putative uncharacterized protein n=3 Tax=Leishma... 179 1e-43 UniRef50_B0BQ44 Putative uncharacterized protein n=5 Tax=Pasteur... 178 2e-43 UniRef50_A9C261 Cupin 4 family protein n=1 Tax=Delftia acidovora... 178 2e-43 UniRef50_UPI0000523E0E PREDICTED: similar to MYC induced nuclear... 176 1e-42 UniRef50_B6BWI1 Putative cytoplasmic protein n=1 Tax=beta proteo... 175 2e-42 UniRef50_B7G6P1 Predicted protein (Fragment) n=1 Tax=Phaeodactyl... 175 3e-42 UniRef50_C9N2N9 Cupin 4 family protein n=4 Tax=Streptomyces RepI... 173 9e-42 UniRef50_Q47NS9 Putative uncharacterized protein n=1 Tax=Thermob... 172 2e-41 UniRef50_D2VJG1 Predicted protein n=1 Tax=Naegleria gruberi RepI... 170 6e-41 UniRef50_C3ZLE4 Putative uncharacterized protein n=1 Tax=Branchi... 169 2e-40 UniRef50_B7FZB3 Predicted protein n=1 Tax=Phaeodactylum tricornu... 168 2e-40 UniRef50_B8BSJ2 Predicted protein n=1 Tax=Thalassiosira pseudona... 167 4e-40 UniRef50_C9NEK8 Cupin 4 family protein n=1 Tax=Streptomyces flav... 166 2e-39 UniRef50_Q2JG11 Cupin 4 n=3 Tax=Actinomycetales RepID=Q2JG11_FRASC 165 3e-39 UniRef50_A9V5A3 Predicted protein n=1 Tax=Monosiga brevicollis R... 165 3e-39 UniRef50_Q7T3G6 MYC induced nuclear antigen-like n=6 Tax=Euteleo... 163 9e-39 UniRef50_A0QI05 Cupin superfamily protein n=4 Tax=Mycobacterium ... 158 3e-37 UniRef50_UPI0001B4BFC9 putative cupin superfamily protein n=1 Ta... 158 3e-37 UniRef50_A1SPZ0 Cupin 4 family protein n=1 Tax=Nocardioides sp. ... 158 4e-37 UniRef50_C1YJ55 Cupin superfamily protein n=1 Tax=Nocardiopsis d... 157 6e-37 UniRef50_Q016L9 [S] KOG3706 Uncharacterized conserved protein n=... 155 3e-36 UniRef50_B0KHI4 Cupin 4 family protein n=1 Tax=Pseudomonas putid... 155 3e-36 UniRef50_A4RZ92 Predicted protein n=1 Tax=Ostreococcus lucimarin... 154 4e-36 UniRef50_C6XMP3 Cupin 4 family protein n=1 Tax=Hirschia baltica ... 153 8e-36 UniRef50_A5GJ70 Putative uncharacterized protein SynWH7803_0559 ... 152 2e-35 UniRef50_C9Z2L7 Putative uncharacterized protein n=2 Tax=Strepto... 150 8e-35 UniRef50_Q1DFZ7 Cupin family protein n=1 Tax=Myxococcus xanthus ... 148 2e-34 UniRef50_UPI0000D57503 PREDICTED: similar to JmjC domain-contain... 146 2e-33 UniRef50_Q091R4 Chromosome 14 open reading frame 169, putative n... 145 2e-33 UniRef50_D1VL61 Cupin 4 family protein n=1 Tax=Frankia sp. EuI1c... 145 2e-33 UniRef50_Q8RWR1 AT3g20810/MOE17_10 n=9 Tax=Viridiplantae RepID=Q... 145 3e-33 UniRef50_A9SQV0 Predicted protein n=1 Tax=Physcomitrella patens ... 142 2e-32 UniRef50_B7PVI8 Acetyltransferase, putative (Fragment) n=1 Tax=I... 142 2e-32 UniRef50_P46327 Uncharacterized protein yxbC n=1 Tax=Bacillus su... 142 2e-32 UniRef50_Q6MH74 Putative uncharacterized protein yxbC n=1 Tax=Bd... 141 4e-32 UniRef50_UPI00015B5EA6 PREDICTED: similar to Jumonji domain cont... 141 5e-32 UniRef50_D1H9M4 Whole genome shotgun sequence of line PN40024, s... 140 6e-32 UniRef50_B8C536 Putative uncharacterized protein (Fragment) n=1 ... 140 6e-32 UniRef50_A9UW44 Predicted protein n=1 Tax=Monosiga brevicollis R... 140 1e-31 UniRef50_B1FKZ3 Cupin 4 family protein n=1 Tax=Burkholderia ambi... 140 1e-31 UniRef50_A8PJJ7 Acetyltransferase, GNAT family protein n=1 Tax=B... 139 2e-31 UniRef50_Q0J0P8 Os09g0489200 protein n=10 Tax=Poaceae RepID=Q0J0... 139 2e-31 UniRef50_B2GUS6 LOC100158649 protein n=5 Tax=Xenopus (Silurana) ... 138 2e-31 UniRef50_D2VHH6 Predicted protein n=1 Tax=Naegleria gruberi RepI... 138 2e-31 UniRef50_A8TXW2 Putative uncharacterized protein n=1 Tax=alpha p... 138 2e-31 UniRef50_UPI0000ECAC04 JmjC domain-containing protein 5 (Jumonji... 138 3e-31 UniRef50_D1WSH6 Cupin family protein n=2 Tax=Streptomyces RepID=... 138 3e-31 UniRef50_B7FXD3 Predicted protein n=1 Tax=Phaeodactylum tricornu... 137 5e-31 UniRef50_Q8N371 JmjC domain-containing protein 5 n=17 Tax=Chorda... 137 6e-31 UniRef50_D2PSR6 Cupin family protein n=1 Tax=Kribbella flavida D... 135 2e-30 UniRef50_A7RV46 Predicted protein n=2 Tax=Eumetazoa RepID=A7RV46... 133 8e-30 UniRef50_Q1D4G2 Cupin family protein n=2 Tax=Myxococcus xanthus ... 133 8e-30 UniRef50_UPI0001791EB3 PREDICTED: similar to JmjC domain-contain... 132 2e-29 UniRef50_A0YLC3 JmjC domain protein n=1 Tax=Lyngbya sp. PCC 8106... 132 2e-29 UniRef50_A9V7T0 Predicted protein n=1 Tax=Monosiga brevicollis R... 132 2e-29 UniRef50_C4DQG4 Putative uncharacterized protein n=1 Tax=Stackeb... 132 2e-29 UniRef50_B3RNN1 Putative uncharacterized protein n=1 Tax=Trichop... 132 2e-29 UniRef50_C1C1Z9 JmjC domain-containing protein 5 n=1 Tax=Caligus... 131 3e-29 UniRef50_B8BVR1 Predicted protein n=1 Tax=Thalassiosira pseudona... 131 5e-29 UniRef50_C7Q411 Cupin 4 family protein n=1 Tax=Catenulispora aci... 130 9e-29 UniRef50_C1E0D1 Predicted protein n=2 Tax=Micromonas RepID=C1E0D... 129 1e-28 UniRef50_UPI0001927155 PREDICTED: similar to jumonji domain cont... 129 2e-28 UniRef50_B6KFH2 Putative uncharacterized protein n=4 Tax=Toxopla... 128 3e-28 UniRef50_B2SWM5 Transcription factor jumonji jmjC domain protein... 127 6e-28 UniRef50_Q1D441 JmjC domain protein n=1 Tax=Myxococcus xanthus D... 127 6e-28 UniRef50_B5W056 Transcription factor jumonji n=2 Tax=Arthrospira... 126 1e-27 UniRef50_C1FDI9 JmjC transcription factor domain-containing prot... 126 1e-27 UniRef50_UPI000186F041 protein PTDSR-A, putative n=1 Tax=Pedicul... 126 2e-27 UniRef50_A4RRR9 Predicted protein n=1 Tax=Ostreococcus lucimarin... 125 2e-27 UniRef50_D0MSD2 Putative uncharacterized protein n=1 Tax=Phytoph... 125 2e-27 UniRef50_D0S717 JmjC domain-containing protein n=1 Tax=Acinetoba... 125 3e-27 UniRef50_B0X4Y9 Putative uncharacterized protein n=2 Tax=Culicin... 125 4e-27 UniRef50_C6XNR6 Transcription factor jumonji jmjC domain protein... 124 6e-27 UniRef50_B9IN14 Predicted protein n=2 Tax=rosids RepID=B9IN14_POPTR 123 1e-26 UniRef50_Q2RW70 Cupin region n=1 Tax=Rhodospirillum rubrum ATCC ... 122 2e-26 UniRef50_Q8S3P4 OSJNBa0011F23.16 protein n=4 Tax=Oryza sativa Re... 121 3e-26 UniRef50_Q96EW2 HSPB1-associated protein 1 n=23 Tax=Amniota RepI... 121 4e-26 UniRef50_UPI00005241B3 PREDICTED: similar to JmjC domain-contain... 118 3e-25 UniRef50_Q86NX2 GM21055p n=10 Tax=Drosophila RepID=Q86NX2_DROME 118 4e-25 UniRef50_Q17765 Protein C06H2.3, partially confirmed by transcri... 117 5e-25 UniRef50_UPI000180C0C1 PREDICTED: similar to reserved n=1 Tax=Ci... 116 9e-25 UniRef50_Q6AXL5 HSPB1-associated protein 1 homolog n=3 Tax=Clupe... 116 9e-25 UniRef50_UPI0001925EF6 PREDICTED: similar to predicted protein n... 116 1e-24 UniRef50_B4LGI6 GJ13228 n=3 Tax=Drosophila RepID=B4LGI6_DROVI 116 1e-24 UniRef50_UPI0001927319 PREDICTED: similar to predicted protein n... 116 2e-24 UniRef50_A2RUC4 JmjC domain-containing protein C2orf60 n=26 Tax=... 114 4e-24 UniRef50_B3SDY7 Putative uncharacterized protein n=1 Tax=Trichop... 114 6e-24 UniRef50_A9V2P6 Predicted protein n=3 Tax=Monosiga brevicollis R... 114 6e-24 UniRef50_UPI00015B5A68 PREDICTED: hypothetical protein n=1 Tax=N... 113 8e-24 UniRef50_A2SGT4 Putative uncharacterized protein n=1 Tax=Methyli... 113 1e-23 UniRef50_C7J1Y3 Os04g0659150 protein n=4 Tax=Poaceae RepID=C7J1Y... 113 1e-23 UniRef50_UPI0000DB7045 PREDICTED: similar to Hspb associated pro... 112 2e-23 UniRef50_A9UR02 Predicted protein (Fragment) n=1 Tax=Monosiga br... 112 2e-23 UniRef50_A9TBQ2 Predicted protein n=1 Tax=Physcomitrella patens ... 111 3e-23 UniRef50_C5AHL8 JmjC domain protein n=2 Tax=Burkholderia RepID=C... 110 7e-23 UniRef50_A9V7C3 Predicted protein n=3 Tax=Monosiga brevicollis R... 109 2e-22 UniRef50_B5S2S3 Putative uncharacterized protein n=1 Tax=Ralston... 109 2e-22 UniRef50_UPI00006A359E PREDICTED: similar to predicted protein n... 109 2e-22 UniRef50_Q4DVQ0 Putative uncharacterized protein n=2 Tax=Trypano... 108 4e-22 UniRef50_Q55DF5 JmjC domain-containing protein D n=1 Tax=Dictyos... 107 6e-22 UniRef50_UPI0000D567FA PREDICTED: similar to reserved n=1 Tax=Tr... 107 6e-22 UniRef50_A9V0X5 Predicted protein n=2 Tax=Monosiga brevicollis R... 106 9e-22 UniRef50_A9VDC2 Predicted protein n=1 Tax=Monosiga brevicollis R... 106 1e-21 UniRef50_UPI000186E75E Hypoxia-inducible factor 1 alpha inhibito... 106 2e-21 UniRef50_Q6MPD0 Putative RNA methylase n=1 Tax=Bdellovibrio bact... 105 2e-21 UniRef50_UPI0000588708 PREDICTED: hypothetical protein, partial ... 105 2e-21 UniRef50_Q38DD6 Putative uncharacterized protein n=2 Tax=Trypano... 104 5e-21 UniRef50_A9V428 Predicted protein n=2 Tax=Monosiga brevicollis R... 104 5e-21 UniRef50_B1TGS3 Transcription factor jumonji jmjC domain protein... 104 5e-21 UniRef50_A9VDD4 Predicted protein n=1 Tax=Monosiga brevicollis R... 104 6e-21 UniRef50_A4QNS2 LOC733353 protein n=3 Tax=Xenopus RepID=A4QNS2_X... 104 6e-21 UniRef50_Q2SID9 Uncharacterized conserved protein n=1 Tax=Hahell... 104 7e-21 UniRef50_C3Z534 Putative uncharacterized protein n=1 Tax=Branchi... 104 7e-21 UniRef50_A7SQQ9 Predicted protein (Fragment) n=1 Tax=Nematostell... 103 8e-21 UniRef50_Q1DDR6 JmjC domain protein n=1 Tax=Myxococcus xanthus D... 103 1e-20 UniRef50_A4RKC1 Putative uncharacterized protein n=1 Tax=Magnapo... 103 1e-20 Sequences not found previously or not previously below threshold: >UniRef50_P27431 Uncharacterized protein ycfD n=205 Tax=Gammaproteobacteria RepID=YCFD_ECOLI Length = 373 Score = 433 bits (1114), Expect = e-120, Method: Composition-based stats. Identities = 373/373 (100%), Positives = 373/373 (100%) Query: 1 MEYQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGK 60 MEYQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGK Sbjct: 1 MEYQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGK 60 Query: 61 WQVSHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGG 120 WQVSHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGG Sbjct: 61 WQVSHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGG 120 Query: 121 GVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILY 180 GVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILY Sbjct: 121 GVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILY 180 Query: 181 IPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPA 240 IPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPA Sbjct: 181 IPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPA 240 Query: 241 DVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQG 300 DVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQG Sbjct: 241 DVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQG 300 Query: 301 EVLVRLGGLRVLRIGDDVYANGEKIDSPHRPALDALASNIALTAENFGDALEDPSFLAML 360 EVLVRLGGLRVLRIGDDVYANGEKIDSPHRPALDALASNIALTAENFGDALEDPSFLAML Sbjct: 301 EVLVRLGGLRVLRIGDDVYANGEKIDSPHRPALDALASNIALTAENFGDALEDPSFLAML 360 Query: 361 AALVNSGYWFFEG 373 AALVNSGYWFFEG Sbjct: 361 AALVNSGYWFFEG 373 >UniRef50_A0KI50 Cupin superfamily protein n=6 Tax=Gammaproteobacteria RepID=A0KI50_AERHH Length = 376 Score = 385 bits (988), Expect = e-105, Method: Composition-based stats. Identities = 186/374 (49%), Positives = 249/374 (66%), Gaps = 5/374 (1%) Query: 3 YQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQ 62 YQL L+ FLE +WQKRP+++K GF +F DPISPDELAGLAME ++SRLV+ + KW+ Sbjct: 2 YQLNLDIAHFLEHYWQKRPLLIKGGFTDFQDPISPDELAGLAMEEVIESRLVTRFNNKWE 61 Query: 63 VSHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGV 122 +HGPFESYDHLGE NW++LVQA NHW L PF+ +P WR DD+M+SFS P GGV Sbjct: 62 AAHGPFESYDHLGEENWTVLVQACNHWAPEVNELALPFQFIPGWRFDDVMVSFSTPHGGV 121 Query: 123 GPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIP 182 GPH+D YDVFI QG G+R WRVG+ + + H LL +PFEAIID +EPGDILYIP Sbjct: 122 GPHIDNYDVFITQGQGKRHWRVGDAKPLNEFAAHAALLHCEPFEAIIDVIMEPGDILYIP 181 Query: 183 PGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADV 242 PGFPHEGYA+E ++N+SVGFRAP+ + LIS FAD+++ E+ Y D D+ PRA ++ Sbjct: 182 PGFPHEGYAIEPSLNFSVGFRAPDAKALISSFADHLIDNEVRTERYGDADLKPRARHGEI 241 Query: 243 LPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQGEV 302 P E+ +LRE+M + ++ FK+WFG IS+++H+LD+ P EP Y +E+ D L QGE Sbjct: 242 QPHELHRLRELMQQALDDETLFKEWFGTMISEAKHDLDVNPVEPDYSAEEVADLLTQGEP 301 Query: 303 LVRLGGLRVLRIGD---DVYANGEKIDS--PHRPALDALASNIALTAENFGDALEDPSFL 357 +++ GLR + Y +GE A+ L +T + + + FL Sbjct: 302 AIKVPGLRTVWFSGESQQCYIDGEAWTLQSEDAAAISLLCDKDMVTQADMVELADQAGFL 361 Query: 358 AMLAALVNSGYWFF 371 +L LVN GYWFF Sbjct: 362 QLLTRLVNRGYWFF 375 >UniRef50_A1STI6 Cupin 4 family protein n=1 Tax=Psychromonas ingrahamii 37 RepID=A1STI6_PSYIN Length = 375 Score = 366 bits (941), Expect = e-100, Method: Composition-based stats. Identities = 172/372 (46%), Positives = 239/372 (64%), Gaps = 4/372 (1%) Query: 3 YQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQ 62 ++L L+ DFL+ +WQK+P V+K+GF +F DPI PDE+AGLAME E++SRL+ +DG+WQ Sbjct: 2 FELNLDINDFLDTYWQKKPTVIKQGFVDFEDPIMPDEMAGLAMEEELESRLIYQEDGEWQ 61 Query: 63 VSHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGV 122 GPF S++ L +LLVQAV+HWH L+RPFR LP+WRIDDLMIS+S P GGV Sbjct: 62 ALSGPFTSFERLENDGATLLVQAVDHWHPDAQELIRPFRFLPNWRIDDLMISYSTPKGGV 121 Query: 123 GPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIP 182 GPH+D YDVFIIQG G+R WRVG+K + + H L + F+AIID ELEPGDILYIP Sbjct: 122 GPHIDNYDVFIIQGLGKRHWRVGDKGALPEFAAHDALKHCESFDAIIDVELEPGDILYIP 181 Query: 183 PGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADV 242 G+PHEGY++E ++NYS+GFRAP+ +L+S F DY + Y+D ++ R P + Sbjct: 182 AGYPHEGYSIETSLNYSIGFRAPDQNDLLSSFTDYCIDTNPAPERYADKEMLLREKPGQI 241 Query: 243 LPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQGEV 302 E+++L +ML PE WFG IS+++H+LDIA PE P+ I + L++G Sbjct: 242 ETPELNELHRIMLANCATPEMLMPWFGRMISEAKHDLDIAEPEQPHTAQSILEQLEEGAQ 301 Query: 303 LVRLGGLRVLRIGDD---VYANGEKIDSPHRPAL-DALASNIALTAENFGDALEDPSFLA 358 VRLGGL + ++ NGE+ + L L + E + +E+ + L Sbjct: 302 FVRLGGLHAVYFEQAPELLFINGEQFNCEGFTELGHHLCDQDEVGGELYDLLIENKNALI 361 Query: 359 MLAALVNSGYWF 370 + LVN GYW+ Sbjct: 362 LFTDLVNQGYWY 373 >UniRef50_A0YBW0 Transcription factor jumonji, jmjC n=1 Tax=marine gamma proteobacterium HTCC2143 RepID=A0YBW0_9GAMM Length = 391 Score = 366 bits (939), Expect = e-100, Method: Composition-based stats. Identities = 136/382 (35%), Positives = 218/382 (57%), Gaps = 15/382 (3%) Query: 6 TLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQVSH 65 L+ DFL WQ +P +++ F NFI+P+SP++LAGLA E+E++SRL++ +GKWQ SH Sbjct: 5 NLDIADFLANTWQTKPRLIRNAFPNFINPMSPEDLAGLACEAEIESRLITEANGKWQTSH 64 Query: 66 GPF--ESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVG 123 GP ++++L ++NW+LLVQAV+HW A L+ FR +P WRIDD+M+S++ GG VG Sbjct: 65 GPIAETTFNNLSDSNWTLLVQAVDHWVPEVADLLDNFRFIPSWRIDDVMVSYATRGGSVG 124 Query: 124 PHLDQYDVFIIQGTGRRRWRVGEKLQMKQ-HCPHPDLLQVDPFEAIIDEELEPGDILYIP 182 PH D YDVF++QG G+RRW+VG +P+L + F A + LE GD+LYIP Sbjct: 125 PHYDNYDVFLVQGAGQRRWQVGGPCSAANSLQNNPELRLLADFVAEEEWVLEAGDMLYIP 184 Query: 183 PGFPHEGYALE-NAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPAD 241 PG H G A++ + M YS+GFRAP+ E++S F D L Y+DP + + H + Sbjct: 185 PGISHWGTAMDNDCMTYSIGFRAPSHSEMLSDFCDDTLAGLTEELRYADPGLQEQGHSGE 244 Query: 242 VLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPP-EPPYQPDEIYDALKQG 300 ++P + + ++ +N + +WFG +++Q ++ + A + +Q ++ LK Sbjct: 245 IMPAAISNAQRILQNYVNDEQRLTEWFGRYVTQQKYPAETADNTDEKFQQGDLVQLLKDD 304 Query: 301 EVLVRLGGLRVLRIGDD-------VYANGEKIDSPHRPAL---DALASNIALTAENFGDA 350 V++R +R+ I + + NG +S + LA N + + Sbjct: 305 GVILRDPTVRIAFIDAESPSNSLLFFVNGVCFESVGDSCIALSKLLADNTRICSGQIMPW 364 Query: 351 LEDPSFLAMLAALVNSGYWFFE 372 L D + +L LVN G +F+ Sbjct: 365 LGDTESVQLLLRLVNQGVLYFD 386 >UniRef50_C4LEX7 Cupin 4 family protein n=1 Tax=Tolumonas auensis DSM 9187 RepID=C4LEX7_TOLAT Length = 381 Score = 362 bits (929), Expect = 1e-98, Method: Composition-based stats. Identities = 181/373 (48%), Positives = 244/373 (65%), Gaps = 5/373 (1%) Query: 4 QLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQV 63 QL L+ F+ WQK+P VL+ + F DPI+PDELAGLA E +V+SRLV+ DGKW Sbjct: 3 QLNLDLAAFMREFWQKKPTVLRGAYAPFTDPITPDELAGLATEEQVESRLVTFADGKWTA 62 Query: 64 SHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVG 123 HGPF+ Y LGE++W+LLVQA +HW +P A L+ PFR LP+WRIDD+MIS+SVPGGGVG Sbjct: 63 EHGPFDDYSQLGESHWALLVQATDHWIKPVADLITPFRGLPNWRIDDVMISYSVPGGGVG 122 Query: 124 PHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIPP 183 PH+DQYDVFIIQG+G RRWRVG +Q P LL V+ FE IID EL+ GDILYIPP Sbjct: 123 PHIDQYDVFIIQGSGSRRWRVGADTPAEQFVATPGLLHVEQFEPIIDVELQSGDILYIPP 182 Query: 184 GFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADVL 243 GFPH+GYA+ AM+YS+G+RAPN ++L S FAD++LQ G Y+DP P V Sbjct: 183 GFPHDGYAITEAMSYSIGYRAPNQQDLFSSFADFLLQENAGQVRYTDPKRELTKTPGLVT 242 Query: 244 PQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQGEVL 303 ++++ LR++M L++ + F +W G +SQ++HEL+I E PDE+ AL+ + L Sbjct: 243 NKDVNDLRDLMRTLLHDEQLFSKWLGTNLSQAKHELNILSQEWDLIPDELLPALEAEDEL 302 Query: 304 VRLGGLRVLRI---GDDVYANGEKIDSPH--RPALDALASNIALTAENFGDALEDPSFLA 358 RLGGLR L D + NGE++ P R + ++ LT + L++P + Sbjct: 303 YRLGGLRCLYFAALPDCCFVNGEQLQIPEGGRALAHLMCNSTVLTHKELQPYLDNPILVD 362 Query: 359 MLAALVNSGYWFF 371 + N GYW+ Sbjct: 363 WICYWFNQGYWYL 375 >UniRef50_A6F8R4 Putative uncharacterized protein n=1 Tax=Moritella sp. PE36 RepID=A6F8R4_9GAMM Length = 379 Score = 361 bits (928), Expect = 2e-98, Method: Composition-based stats. Identities = 164/378 (43%), Positives = 233/378 (61%), Gaps = 8/378 (2%) Query: 3 YQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQ 62 Y+L L+ DF++ +WQK+P+++K GF +FIDPISPDE+AGLAME +V SR+VS +DGKW+ Sbjct: 2 YKLNLDIADFMQNYWQKKPLLIKAGFKDFIDPISPDEIAGLAMEEDVTSRMVSLEDGKWE 61 Query: 63 VSHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGV 122 GPF +D L + ++LVQA+NHWH+P+A L F +P WR DDLM+S+S GGV Sbjct: 62 AKCGPFTEFDRLEKPGAAILVQAINHWHDPSAELANVFNFIPSWRFDDLMVSYSSDTGGV 121 Query: 123 GPHLDQYDVFIIQGTGRRRWRVGEKL-QMKQHCPHPDLLQVDPFEAIIDEELEPGDILYI 181 GPH+D+Y VFIIQG G+R WRVG + ++ + L + F+A+ID LEPGDILYI Sbjct: 122 GPHVDRYCVFIIQGQGKRHWRVGSQDMNPQEFAANGALKHCEAFDAVIDTVLEPGDILYI 181 Query: 182 PPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPAD 241 PP PHEGYA+ A+NYSVGFRA + +EL++ F DY+LQ++ YSDP + PRA Sbjct: 182 PPYAPHEGYAVGEAINYSVGFRAQDQKELLNDFGDYLLQQDKEFVRYSDPKLQPRAEHGS 241 Query: 242 VLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDE--IYDALKQ 299 + E+ L ++M L+ + G S+S HELD+ PE Y D + D + Sbjct: 242 IESGEVQGLTDIMTSLMADKSVMHDFLGRHYSESAHELDLLVPEGGYIADYAIVVDEIGM 301 Query: 300 GEVLVRLGGLRVLRIGD---DVYANGEK--IDSPHRPALDALASNIALTAENFGDALEDP 354 L ++ GL+ L + + +GE+ D+ ++ L + TA+ +ED Sbjct: 302 ESYLRKVNGLKTLYFPEMPTSCFIDGERYDFDASIAASVQTLCNTTEQTAKELEVLMEDK 361 Query: 355 SFLAMLAALVNSGYWFFE 372 F +L VN GYW FE Sbjct: 362 VFGELLIEWVNLGYWHFE 379 >UniRef50_Q5E4F9 Conserved protein n=16 Tax=Gammaproteobacteria RepID=Q5E4F9_VIBF1 Length = 394 Score = 360 bits (925), Expect = 4e-98, Method: Composition-based stats. Identities = 168/381 (44%), Positives = 252/381 (66%), Gaps = 11/381 (2%) Query: 3 YQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQ 62 YQL+ + +FL +WQK+PV++K GF NF DP++P+ELAGL +E++VDSR +S+ + +W+ Sbjct: 14 YQLSFSLQEFLSEYWQKKPVIIKDGFENFQDPVTPEELAGLTLENDVDSRFISNANNEWK 73 Query: 63 VSHGPFES--YDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGG 120 HGP Y+ LGETNWS++VQA NHWH+ A L +PF+++P+W DD+MIS+SVP G Sbjct: 74 AEHGPLSEELYETLGETNWSIIVQAANHWHKGAAELFKPFKQMPNWLFDDIMISYSVPHG 133 Query: 121 GVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILY 180 GVGPH+DQYDVFIIQG G+R WRVG+ + ++ H L Q+ FE IID+ LEPGDILY Sbjct: 134 GVGPHIDQYDVFIIQGQGKRHWRVGDIGEYQEEHRHSALKQITGFEPIIDQILEPGDILY 193 Query: 181 IPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPA 240 IPPGFPH+GYALE +M+YS GFR+P +ELIS FAD++++ E G +Y +P++ ++H + Sbjct: 194 IPPGFPHDGYALEPSMSYSAGFRSPKEQELISNFADFIIENEKGDVHYHNPELSTQSHGS 253 Query: 241 DVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQG 300 ++ + + L+ MML ++ + KQ+ GE++S SRH L+I P + +E+ + L G Sbjct: 254 EITTRSFEDLKAMMLSAMSDEQTLKQFMGEYLSNSRHHLNIIPDSEKWTTEELLNYLHSG 313 Query: 301 EVLVRLGGLRVLRIGDD-------VYANGEK--IDSPHRPALDALASNIALTAENFGDAL 351 + L+++ G+R + ++ +GE + + L +T N L Sbjct: 314 QALIKVAGVRSFYHEVESCEENMTLFIDGESYVFPLKMKNDVITLCEANEVTLNNIEQLL 373 Query: 352 EDPSFLAMLAALVNSGYWFFE 372 DP +A L LVN GY++ E Sbjct: 374 LDPHSVANLLQLVNIGYFYAE 394 >UniRef50_C9QJT9 Putative uncharacterized protein n=2 Tax=Vibrionaceae RepID=C9QJT9_VIBOR Length = 377 Score = 360 bits (923), Expect = 7e-98, Method: Composition-based stats. Identities = 178/377 (47%), Positives = 244/377 (64%), Gaps = 8/377 (2%) Query: 3 YQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQ 62 YQLT + FL HW K+P V+K GF +FIDPIS DELAGLAME E+DSR +S++D +W Sbjct: 2 YQLTFDLKAFLAEHWHKKPTVIKAGFADFIDPISADELAGLAMEEEIDSRFISNKDNQWS 61 Query: 63 VSHGPF--ESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGG 120 +HGP ++ L E++W L+VQA NHWH +A L++ F++LP W DDLM+ FS P G Sbjct: 62 ATHGPLPESHFESLDESHWQLIVQACNHWHLGSAELVQAFKQLPQWLFDDLMVCFSAPEG 121 Query: 121 GVGPHLDQYDVFIIQGTGRRRWRVGE--KLQMKQHCPHPDLLQVDPFEAIIDEELEPGDI 178 GVGPH+DQYDVFIIQG+G+RRWRVG+ K Q K+ L Q++ FE+IIDE LEPGDI Sbjct: 122 GVGPHIDQYDVFIIQGSGKRRWRVGDIDKGQYKESIQAGALRQIEGFESIIDEVLEPGDI 181 Query: 179 LYIPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAH 238 LYIPPGFPHEG LE +M+YS+GFR+P +EL+S FADYVL ++G + +P+ + + Sbjct: 182 LYIPPGFPHEGNTLEPSMSYSIGFRSPKEQELLSNFADYVLAHDIGDVHLHNPEQSAQDN 241 Query: 239 PADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALK 298 ++L Q++ KL +M+ +N + + + G +SQSRH+LDI PE Y E+ + L+ Sbjct: 242 NGELLSQDLAKLTDMLKAALNGEKDIQTFMGAMLSQSRHQLDIVEPEEAYSDTEVSEYLQ 301 Query: 299 QGEVLVRLGGLRVLRIGDDV---YANGEKIDSPHRPALDALASNIALTAENFGDALEDPS 355 G VL ++ GLR L Y NGE D P+ AL L+ + + Sbjct: 302 SGGVLRKVSGLRALYHQGYFHSIYINGESFDVPNSNMTRALCDYDELS-IDSSTGPDLDE 360 Query: 356 FLAMLAALVNSGYWFFE 372 +L LVN GYW+F+ Sbjct: 361 STQLLTKLVNKGYWYFD 377 >UniRef50_A1RJT3 Cupin 4 family protein n=14 Tax=Alteromonadales RepID=A1RJT3_SHESW Length = 386 Score = 357 bits (917), Expect = 4e-97, Method: Composition-based stats. Identities = 154/380 (40%), Positives = 231/380 (60%), Gaps = 10/380 (2%) Query: 1 MEYQLT-LNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDG 59 M+ ++ L +FL ++WQK+P+V+++GF F D +SP+ELAGLAM+ V+SR V Q G Sbjct: 1 MQLEINGLTPAEFLAQYWQKKPLVIRQGFKQFQDLVSPEELAGLAMDELVESRRVYQQAG 60 Query: 60 KWQVSHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPG 119 +W GPF+SY+ LGE +W+L+VQA+N+W AL++ F +P WR DD+M+S++ PG Sbjct: 61 QWHAEFGPFDSYEKLGERDWTLIVQALNNWVPDAEALIQCFDFIPRWRFDDVMVSYATPG 120 Query: 120 GGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDIL 179 GGVGPH+D YDVFI QG+GRRRWRVG++ ++ HP LL + FE IID EL PGDIL Sbjct: 121 GGVGPHIDLYDVFICQGSGRRRWRVGDRGPHREFAAHPALLHTEAFEPIIDTELLPGDIL 180 Query: 180 YIPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHP 239 YIPPGFPH+G LE ++++SVG+R + +++ S AD++ + +LG +DP+ Sbjct: 181 YIPPGFPHDGITLEESLSFSVGYRTASAKDMFSALADHLSEHDLGAQQIADPERQVSHRS 240 Query: 240 ADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQ 299 V ++ +LR + ++N ++ G +++QS+ LD+ DE+ L + Sbjct: 241 GCVDNNDLARLRSQLTSMLNDK-LVSEFSGRYLTQSKCALDLPDEPLDITQDEVLAWLDE 299 Query: 300 GEVLVRLGGLRVLRIG-----DDVYANGEKIDSPHRPA--LDALASNIALTAENFGDALE 352 + L+RLGGLR L ++ NGE+ P A + L L L+ Sbjct: 300 -QPLIRLGGLRCLYFDVSVEQGTIFINGERYQLPVELAGIIPLLCDMSQLDKTALLPWLD 358 Query: 353 DPSFLAMLAALVNSGYWFFE 372 + LA L VN GYW+FE Sbjct: 359 NADGLAQLTEWVNLGYWYFE 378 >UniRef50_A6F0B9 Transcription factor jumonji, jmjC n=1 Tax=Marinobacter algicola DG893 RepID=A6F0B9_9ALTE Length = 383 Score = 352 bits (904), Expect = 1e-95, Method: Composition-based stats. Identities = 120/383 (31%), Positives = 195/383 (50%), Gaps = 11/383 (2%) Query: 1 MEYQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGK 60 M+ + +FL +WQK+P+V+++ F F P+S DELAGLA E V+SR+V D Sbjct: 1 MQLPGGMPAQEFLRDYWQKKPLVIRQAFAGFECPVSADELAGLACEDAVESRIVIENDKG 60 Query: 61 --WQVSHGPFE--SYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFS 116 WQ+ +GPFE + L +++W+LLVQ ++HW A L+ FR +P+WR+DD+M S++ Sbjct: 61 KPWQLHNGPFEPERFSKLPDSHWTLLVQGLDHWVPDFADLLDEFRFVPNWRLDDIMASYA 120 Query: 117 VPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQ-MKQHCPHPDLLQVDPFEAIIDEELEP 175 GG VGPH DQYDVF++Q G RRW G L + +E L P Sbjct: 121 PKGGSVGPHYDQYDVFLLQAEGHRRWTFGGHCDHTSPRVDGTPLRILSSWEGEETVTLAP 180 Query: 176 GDILYIPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPP 235 GD+LY+PPG H G A ++ + S+GFRAP ++++GF D++ R + DPD+ Sbjct: 181 GDMLYLPPGVGHHGVAEDDCITLSIGFRAPTVDDVLTGFTDFLCSRSDASGHLDDPDLKV 240 Query: 236 RAHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYD 295 + +P + P + +L ++ + ++ WFG++ + + + P E P P+ + + Sbjct: 241 QDNPGAIGPDVIHRLDRLIRDQLSDQRQLALWFGQYSTAPKSLEIVVPAEEPVTPELLGE 300 Query: 296 ALKQGEVLVRLGGLRVLRI----GDDVYANGEKI--DSPHRPALDALASNIALTAENFGD 349 + G L G R ++ +GE+ P L + Sbjct: 301 LIAAGNPLRWNEGSRFAYHDFEDETALFVDGEQFLLRGDAGPLAPLLCAGARPDMGALAS 360 Query: 350 ALEDPSFLAMLAALVNSGYWFFE 372 D + +L+ LVN G +F+ Sbjct: 361 FAGDDAIQGLLSTLVNQGSLYFD 383 >UniRef50_Q1NG82 Putative uncharacterized protein n=1 Tax=Sphingomonas sp. SKA58 RepID=Q1NG82_9SPHN Length = 380 Score = 347 bits (890), Expect = 4e-94, Method: Composition-based stats. Identities = 141/375 (37%), Positives = 211/375 (56%), Gaps = 11/375 (2%) Query: 6 TLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQVSH 65 + + FL HWQK+P++++ + + +P+ PDELAGLA E V+SR+V DG W + H Sbjct: 4 SFDVQAFLRDHWQKQPLLIRNPWGAWANPLEPDELAGLACEEGVESRIVVQTDGDWALEH 63 Query: 66 GPF--ESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVG 123 GPF + + LG + W+LLVQAV+H AAL+ PFR +PDWRIDD+M+S++ GGGVG Sbjct: 64 GPFADDRFATLGGSPWTLLVQAVDHHAPDVAALIAPFRFIPDWRIDDVMVSYASDGGGVG 123 Query: 124 PHLDQYDVFIIQGTGRRRWRVGEKLQMK-QHCPHPDLLQVDPFEAIIDEELEPGDILYIP 182 PH DQYDVF++QG GRRRWRVG++ PH DL + F A + LEPGDILY+P Sbjct: 124 PHFDQYDVFLVQGLGRRRWRVGQRCDRDTALRPHRDLRLLPDFAATDEWVLEPGDILYVP 183 Query: 183 PGFPHEGYAL-ENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPAD 241 PGF HEG A+ ++ M YS+GFRAP+ +++ +AD++ + + Y+DPD+ P A+P + Sbjct: 184 PGFAHEGVAVGDDCMTYSIGFRAPSRPDMLVEWADHLAAQMPDDDLYADPDIQPAANPGE 243 Query: 242 VLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQGE 301 + P + +L EM + + F WFG+ ++ ++ PE P +E+ + G Sbjct: 244 IEPDAIARLHEMTIAAMADRSAFAAWFGQHVTTPKYPDADWRPEEPVTAEELLALIDAGA 303 Query: 302 VLVRLGGLRVLR----IGDDVYANGEKIDSPHRPALDALASNIALTAENFGDALEDPSFL 357 L R R G ++ +G A A ++ + + Sbjct: 304 QLWRNPASRFAFLREEDGVTLFVDGSAYPCAG-DLAILAQQLCAYPALALDPSM--VAGV 360 Query: 358 AMLAALVNSGYWFFE 372 +L LVN G E Sbjct: 361 GLLVTLVNQGSLMIE 375 >UniRef50_Q1QUR4 Cupin 4 n=1 Tax=Chromohalobacter salexigens DSM 3043 RepID=Q1QUR4_CHRSD Length = 397 Score = 346 bits (889), Expect = 5e-94, Method: Composition-based stats. Identities = 147/384 (38%), Positives = 222/384 (57%), Gaps = 14/384 (3%) Query: 1 MEYQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQ--D 58 + L FL +WQK+P++++ F +F P++P+ELAGLA E +++RLV Q D Sbjct: 7 LSILGGLTAETFLRDYWQKKPLLIRGAFPDFASPLAPEELAGLACEDGIEARLVEAQGPD 66 Query: 59 GKWQVSHGPFES--YDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFS 116 WQVSHGPF+ + L + W+LLVQAV+H+ AAL+ F LP WR+DD+M+S++ Sbjct: 67 KPWQVSHGPFDDATFARLPDREWTLLVQAVDHYVPEVAALLDAFDFLPRWRLDDVMVSYA 126 Query: 117 VPGGGVGPHLDQYDVFIIQGTGRRRWRV-GEKLQMKQHCPHPDLLQVDPFE--AIIDEEL 173 P G VGPH+D YDVF++QG+G+RRW++ GE+ DL ++ FE A D L Sbjct: 127 PPEGSVGPHVDNYDVFLLQGSGQRRWQLGGEQPDDAPIVSGIDLRMLERFEVTADEDWVL 186 Query: 174 EPGDILYIPPGFPHEGYALE-NAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPD 232 EPGD+LY+PP H G + + M YS+GFRAP+ E+I+ FADY+ + + Y+DPD Sbjct: 187 EPGDMLYLPPRIAHHGVSQSADCMTYSIGFRAPSADEVITSFADYLGEMQPDSRRYTDPD 246 Query: 233 VPPRAHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDE 292 + P AH + Q + +LR +ML +I+ P QWFG ++Q ++ +AP + P Sbjct: 247 LAPCAHAGQLDDQAIARLRRLMLSVIDDPAQMAQWFGRVMTQPKYVDQLAPLDTPMDSAA 306 Query: 293 IYDALKQGEVLVRLGGLRVLRIGDD----VYANGEKIDSPHRPALDALASNIALTAENFG 348 +AL QG L R G R +D ++ +G+ P P LA L A Sbjct: 307 TAEALAQGRYLERALGSRFAFHDEDGETTLFVDGDGHACP-PPLARLLADTTPLHAATLA 365 Query: 349 DALEDPSFLAMLAALVNSGYWFFE 372 + L+D + L++L L+N G ++ Sbjct: 366 EHLDDAA-LSLLTELLNRGSLQWQ 388 >UniRef50_Q48H58 YcfD protein n=22 Tax=Gammaproteobacteria RepID=Q48H58_PSE14 Length = 388 Score = 346 bits (887), Expect = 9e-94, Method: Composition-based stats. Identities = 138/381 (36%), Positives = 218/381 (57%), Gaps = 12/381 (3%) Query: 1 MEYQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLV-SHQDG 59 ++ ++ FL +WQK+P+++++ +F PI DELAGLA+E EV+SRLV H + Sbjct: 7 LQLLGGISARVFLRDYWQKKPLLIRQALPDFQSPIDADELAGLALEEEVESRLVLEHGER 66 Query: 60 KWQVSHGPF--ESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSV 117 W++ GPF + + L E +W+LLVQAV+ + + L+ FR LP WRIDD+MIS++ Sbjct: 67 PWELRRGPFAEDEFSKLPERDWTLLVQAVDQFVPEVSELLENFRFLPSWRIDDVMISYAA 126 Query: 118 PGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQ-MKQHCPHPDLLQVDPFEAIIDEELEPG 176 PGG VGPH D YDVF++QG G+R W++G+ H DL + FE + LEPG Sbjct: 127 PGGSVGPHFDNYDVFLLQGHGKRHWQIGQMCDAESPMLQHADLRILAEFEKTEEWTLEPG 186 Query: 177 DILYIPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPR 236 D+LY+PP H G A+++ + YSVGFRAP+ E+++ F D++ Q Y+D D P Sbjct: 187 DMLYLPPRLAHCGVAVDDCLTYSVGFRAPSAAEVLTLFTDFLSQFIPDEERYTDADAQPV 246 Query: 237 AHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDA 296 + P + +D+L+ ++ E ++ WFG+F+++ R+ + PE + D++ D+ Sbjct: 247 SDPHQIQHDALDRLKALLTEHMSDERLLLTWFGQFMTEPRYPELVTGPE--LEEDDLLDS 304 Query: 297 LKQGEVLVRLGGLRVLRIGDD----VYANGEK--IDSPHRPALDALASNIALTAENFGDA 350 L+QG VL+R R+ D ++A+G+ + R L + + AL +EN G Sbjct: 305 LEQGAVLIRNPSARLAWSEVDDDLLLFASGQSRLLPGSLRELLKLICAADALHSENLGQW 364 Query: 351 LEDPSFLAMLAALVNSGYWFF 371 L D +L LV G F Sbjct: 365 LADDDGRNLLCELVKQGSLGF 385 >UniRef50_B3PKY0 Putative uncharacterized protein n=2 Tax=Pseudomonadaceae RepID=B3PKY0_CELJU Length = 396 Score = 344 bits (883), Expect = 3e-93, Method: Composition-based stats. Identities = 124/381 (32%), Positives = 209/381 (54%), Gaps = 11/381 (2%) Query: 1 MEYQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGK 60 + + + +FL +WQK+P++++ F F PI+PDELAGLA+E EV+SR+V Sbjct: 17 LTHLGDMPIEEFLRDYWQKKPLLIRNAFPGFESPIAPDELAGLALEEEVESRIVLENGAT 76 Query: 61 -WQVSHGPFES--YDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSV 117 W++ +GPF+ + L E W+LLVQAV+ W L+ FR +P+WR+DDLMIS++ Sbjct: 77 PWELRNGPFDEDTFAKLPEKRWTLLVQAVDQWVPEVNQLLDYFRFIPNWRLDDLMISYAP 136 Query: 118 PGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQ-MKQHCPHPDLLQVDPFEAIIDEELEPG 176 GGVGPH D YDVF++QG G+R W++G+ L + F + LEPG Sbjct: 137 DQGGVGPHFDYYDVFLLQGLGKRHWKIGQVCDNNSPRVEGTRLKILSEFHTTDEWVLEPG 196 Query: 177 DILYIPPGFPHEGYAL-ENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPP 235 D+LYIPPG H G A+ ++ M YS+GFRAP+ +++S V YSDPD+ Sbjct: 197 DMLYIPPGIAHWGNAVGDDCMTYSIGFRAPSHADILSEIGQEVALNIADDLRYSDPDLKR 256 Query: 236 RAHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYD 295 +++P ++ P+ + +L+ ++ + + PE WFG+++++ ++ D+ Sbjct: 257 QSNPGEIGPEAIAQLQHIIQQHL-TPETIAHWFGKYMTERKYLEQTDEEPLEIDADDWQA 315 Query: 296 ALKQGEVLVRLGGLRVLRIGDD----VYANGEKIDSPHRPALDALASNIALTAENFGDAL 351 AL G++L R R+ D ++A+GE I+ R + + + ++ + Sbjct: 316 ALADGQLLWRHPAARLAFHSDKNGTFLFADGEAINC-SRELAELVCAETEISWVQIKPFV 374 Query: 352 EDPSFLAMLAALVNSGYWFFE 372 ++P +A L+ L+N + Sbjct: 375 QEPFDVAALSQLINQETLLID 395 >UniRef50_B8K5G8 Cupin superfamily protein n=1 Tax=Vibrio parahaemolyticus 16 RepID=B8K5G8_VIBPA Length = 375 Score = 343 bits (879), Expect = 9e-93, Method: Composition-based stats. Identities = 168/376 (44%), Positives = 242/376 (64%), Gaps = 8/376 (2%) Query: 3 YQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQ 62 YQL+ + FL ++W K+P V+K G NFIDPISP+ELAGLAME EVDSR V++++G WQ Sbjct: 2 YQLSFDLDSFLAKYWHKQPTVIKHGITNFIDPISPEELAGLAMEEEVDSRFVTNKNGHWQ 61 Query: 63 VSHGPF--ESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGG 120 HGP + L E++W L+VQA NHWH A L+ PF+ LP W DDLM+ +S P G Sbjct: 62 AQHGPLPESLFSQLEESHWQLIVQACNHWHLGAAELVAPFKALPQWLFDDLMVCYSAPQG 121 Query: 121 GVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCP-HPDLLQVDPFEAIIDEELEPGDIL 179 GVGPH+DQYDVFIIQG+G+RRWRVG + + L Q++ F+AIIDE LEPGDIL Sbjct: 122 GVGPHIDQYDVFIIQGSGKRRWRVGAADEGQYQESIQGALRQIESFDAIIDEVLEPGDIL 181 Query: 180 YIPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHP 239 YIPPGFPHEG +E +M+YS+GFR+P +EL+S FADYVL +E G + +P + + + Sbjct: 182 YIPPGFPHEGNTIEPSMSYSMGFRSPKEQELLSHFADYVLAKEKGDVHLHNPQMQTQRNH 241 Query: 240 ADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQ 299 ++L ++ L +M+ + + + + +SQSRH+LDI PE +++Y L+ Sbjct: 242 GEILRSDLTLLTQMLQSALESKQDIENFLALNLSQSRHQLDIVEPEEVISQEQVYAHLEA 301 Query: 300 GEVLVRLGGLRVLRIGDD---VYANGEKIDSPHRPALDALASNIALTAENFGDALEDPSF 356 +V++ GLR L ++ V+ NGE+ L ++T +++ +E PS+ Sbjct: 302 LGHVVKVSGLRALYHANNANHVFINGEEFSVAEPAFAPILCDQASITLDSYS--IESPSW 359 Query: 357 LAMLAALVNSGYWFFE 372 +A+L LVN GYW+ + Sbjct: 360 IALLTRLVNLGYWYLD 375 >UniRef50_Q2Y9X5 Cupin region n=9 Tax=root RepID=Q2Y9X5_NITMU Length = 415 Score = 339 bits (869), Expect = 1e-91, Method: Composition-based stats. Identities = 117/367 (31%), Positives = 197/367 (53%), Gaps = 10/367 (2%) Query: 6 TLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQVSH 65 L+ DFL+ HWQK+P+++++ +F + +EL LA + + SRLV+ ++G+W+V H Sbjct: 34 GLSPSDFLQDHWQKKPLLIRKALPDFSGLLDANELIDLACQEDAQSRLVTRRNGRWEVRH 93 Query: 66 GPF--ESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVG 123 GPF ++ L + W+LLVQ VNH+ L+ F +P R+DDLM+S++ GGVG Sbjct: 94 GPFAPRAFARLPQKGWTLLVQDVNHFLPAARELLLKFNFIPHSRLDDLMVSYAPEDGGVG 153 Query: 124 PHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIPP 183 PH D YDVF++QGTGRRRWR+ + + L + F + LEPGD+LY+PP Sbjct: 154 PHFDSYDVFLLQGTGRRRWRISGQKD-RTLVAAAPLKILQDFRPEQEWVLEPGDMLYLPP 212 Query: 184 GFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADVL 243 G+ H+G A+E M YS+GFRAP +EL F ++ Y DPD+ + HP + Sbjct: 213 GYAHDGVAVEPCMTYSIGFRAPTYQELAMQFLVHLQDSCEIAGIYEDPDLRIQTHPGQIS 272 Query: 244 PQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQGEVL 303 +D++ + ++ +++ G ++++ + + PP+ P + +++G+ L Sbjct: 273 SAMLDQVNAALDKIEWDNVEVERFIGMYLTEPKPHVFFMPPQEPISERKFVHQIRKGK-L 331 Query: 304 VRLGGLRVLRIGDDVYANGE--KIDSPHRPALDALASNIALTAENFGDALEDPSFLAMLA 361 R+L + ++ NG+ ++ + L LA +AL+ D A+L Sbjct: 332 QLDLKSRMLFRENRIFLNGDVYEVGKTAQRILGELADRLALSPVR----DIDAETQALLY 387 Query: 362 ALVNSGY 368 GY Sbjct: 388 QWYLDGY 394 >UniRef50_Q2S4H4 Cupin superfamily protein n=3 Tax=Bacteria RepID=Q2S4H4_SALRD Length = 394 Score = 337 bits (864), Expect = 5e-91, Method: Composition-based stats. Identities = 142/379 (37%), Positives = 218/379 (57%), Gaps = 13/379 (3%) Query: 8 NWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGK--WQVSH 65 + DFL+ +WQ+RP+V++ +F P+SP+ELAGLA E V+SRL+ + G+ W++ H Sbjct: 15 SPADFLDTYWQERPLVVRDALPDFRSPLSPEELAGLACEDGVESRLILEEGGEHPWELRH 74 Query: 66 GPF--ESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVG 123 GPF E + HL ET+W+LLVQ V+ AL+ FR LPDWR+DD+M+S++ G VG Sbjct: 75 GPFASEEFLHLPETHWTLLVQEVDRLIPEVGALLDRFRFLPDWRLDDVMVSYAPTHGTVG 134 Query: 124 PHLDQYDVFIIQGTGRRRWRVG-EKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIP 182 PH+D YDVF++QG G RRW++G E + ++ P D+ + FEA + L PGD+LY+P Sbjct: 135 PHIDNYDVFLLQGAGHRRWQIGTEPVDDEEIVPDLDVRILADFEAEEEFVLGPGDLLYLP 194 Query: 183 PGFPHEGYALEN-AMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPAD 241 P H G A ++ M YSVGFRAP ++L+ F + YSDPD+ P HP + Sbjct: 195 PRVAHYGVATDDQCMTYSVGFRAPRHQDLVGNFLQQAMDTVGPDARYSDPDLSPVDHPGE 254 Query: 242 VLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQGE 301 + +R ++ +L+ + QWFG+++++ + + PPE P DE+ D L+ G Sbjct: 255 IHDDARQTVRRLLRDLVRDDDAIDQWFGQYLTRPGRDREAVPPETPVTDDELTDMLRAGH 314 Query: 302 VLVRLGGLRVLRIGDD-----VYANGEKIDS-PHRPALDALASNI-ALTAENFGDALEDP 354 L R+ I D ++ANG ID P R L + + ++ LED Sbjct: 315 GLRPGPVSRLAFIEHDDGSVTLFANGSPIDLSPDRAYAARLVTGRQQIPSDALTPHLEDD 374 Query: 355 SFLAMLAALVNSGYWFFEG 373 +F+ +L AL+N G ++ Sbjct: 375 AFVDLLVALINDGLLEWDA 393 >UniRef50_A3QD76 Cupin 4 family protein n=19 Tax=Shewanella RepID=A3QD76_SHELP Length = 386 Score = 335 bits (860), Expect = 1e-90, Method: Composition-based stats. Identities = 156/382 (40%), Positives = 225/382 (58%), Gaps = 15/382 (3%) Query: 3 YQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQ 62 Y + +FL HWQK+P+V+K F +F DPI+PDELAGLA E E+ SR+V + W+ Sbjct: 6 YTPNFDTQEFLAHHWQKQPLVIKGAFAHFQDPIAPDELAGLACEEEIASRIVLTKKDNWE 65 Query: 63 VSHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGV 122 + GP E Y G+ NW LLVQAVNHW+ L+ FR +PDWR DDLM+S++ PGGGV Sbjct: 66 IFQGPIEDYSPFGDANWQLLVQAVNHWYPDVEPLVNAFRFIPDWRFDDLMVSYATPGGGV 125 Query: 123 GPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIP 182 GPH+D YDVF++QG GRRRW+VG K Q +D FE I+D LE GD+LYIP Sbjct: 126 GPHIDNYDVFLLQGEGRRRWKVGAKGQYSPRGGDTHTALIDDFEPILDVVLEAGDMLYIP 185 Query: 183 PGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADV 242 PGFPH G LE A++YS+GFRAP+ +EL S AD+++ G ++ P P + Sbjct: 186 PGFPHRGETLETALSYSIGFRAPSQQELFSSIADHLIDTNGGNKRFTSNQEPAS--PGLL 243 Query: 243 LPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQGEV 302 ++ + ++ E+++QP+H++ G+ +SQ+R ELD+A Y +E+ +AL+ G Sbjct: 244 SVEQQAGMLALVSEILSQPDHYQTVLGQTLSQNRFELDLAEQGESYSQEELMEALEDGAC 303 Query: 303 LVRLGGLRVLRIGDD----VYANGEKIDSPHRPA---------LDALASNIALTAENFGD 349 L R+GGL+V+R+ D ++ NGE D L LA+ + + D Sbjct: 304 LQRIGGLKVIRLEGDKHLRLFINGEIYDFDAVDDDDADELDDKLMLLANAFSFEGKQALD 363 Query: 350 ALEDPSFLAMLAALVNSGYWFF 371 + + L+NSGY + Sbjct: 364 LCQREAIGQYFIWLLNSGYAYL 385 >UniRef50_Q2BJ43 Putative uncharacterized protein n=1 Tax=Neptuniibacter caesariensis RepID=Q2BJ43_9GAMM Length = 382 Score = 334 bits (857), Expect = 3e-90, Method: Composition-based stats. Identities = 127/377 (33%), Positives = 206/377 (54%), Gaps = 16/377 (4%) Query: 6 TLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLV--SHQDGKWQV 63 ++ FL+ +WQK+P++++ F +F P++ DELAG+A+E EV+SRL+ S W++ Sbjct: 7 DISVETFLKEYWQKKPLLIRNAFPDFEPPVTADELAGMALEEEVESRLIIQSADGADWEL 66 Query: 64 SHGPFES--YDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGG 121 HGP + L +++W+LLVQAV+HW A L+ FR P+WR+DDLMIS++ GGG Sbjct: 67 KHGPLNEETFAELPDSHWTLLVQAVDHWVPEAAELVEQFRFAPNWRLDDLMISYASDGGG 126 Query: 122 VGPHLDQYDVFIIQGTGRRRWRVGE-KLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILY 180 VGPH D YDVF+IQ TG RRW VG + ++ + +EA +L+PGD+LY Sbjct: 127 VGPHYDNYDVFLIQATGTRRWEVGGIFDEDSPRRDDVPVMILPEWEAEQSWDLQPGDMLY 186 Query: 181 IPPGFPHEGYAL-ENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHP 239 +PP H GYAL ++ M SVGFRAP+ +E+ +GF +Y+ + YSDPD+ +A+P Sbjct: 187 LPPRVGHNGYALGDDCMTLSVGFRAPSHQEIFAGFTNYLDNITCAEDRYSDPDLKTQANP 246 Query: 240 ADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQ 299 ++ + + +++ ++ E I P WFG+F+++S++ A + +I + ++ Sbjct: 247 GEIDQEAIGRVQAILREYIADPALLSHWFGQFMTESKYPDLGADQSQEMEEGDIKNLIEA 306 Query: 300 GEVLVRLGGLRVLRIGDDVYA-----NGEKIDSPHRPALDALASNIALTAENFGDALEDP 354 G L R G R + G L +++ T Sbjct: 307 GVPLCRTEGSRFAYHQGQPFVLFVDGKGCACSPGQIELAKRLCADLYHTEIE-----TSE 361 Query: 355 SFLAMLAALVNSGYWFF 371 L ++ AL+ G +F Sbjct: 362 ENLQLIKALLLQGSLYF 378 >UniRef50_B4RRX0 Putative enzyme with RmlC-like domain n=2 Tax=Alteromonas macleodii RepID=B4RRX0_ALTMD Length = 388 Score = 334 bits (856), Expect = 4e-90, Method: Composition-based stats. Identities = 137/377 (36%), Positives = 212/377 (56%), Gaps = 14/377 (3%) Query: 6 TLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQVSH 65 + FL+ +WQ++PVV+K+ F +F DPI ++LAGLA ESEVD+R++S+ G W V Sbjct: 17 GFDADTFLKHYWQQKPVVIKQFFTDFDDPIDENDLAGLAQESEVDARVISNVQGNWHVEQ 76 Query: 66 GPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 125 GP +DH + W+LLVQ V+ + A ++ PF +P WR+DDLM+SF+ G GVG H Sbjct: 77 GPITDFDHACQGKWTLLVQGVDKYVPDVAPILSPFSFVPHWRLDDLMVSFATNGAGVGAH 136 Query: 126 LDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIPPGF 185 +DQYDVF++QG G+RRWRVG+ K+ PHP L Q++ F +ID +EPGD++Y+PPG+ Sbjct: 137 IDQYDVFLVQGKGKRRWRVGQPGDYKEVFPHPKLRQIERFTPVIDVVVEPGDVIYVPPGW 196 Query: 186 PHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADVLPQ 245 PH+G +E+++ YSVG+RAP+ +L A +L + ++D + +PA V Sbjct: 197 PHDGETVEDSLTYSVGYRAPDNLQLAESLA-MMLDKGAHNYRFTDIGRTHQNNPALVSTS 255 Query: 246 EMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQGEVLVR 305 ++ L++ +++ IN + Q E P + +++ + L G Sbjct: 256 DIAALKQQLIDAINGEDFTLALLEAMSEQGIPE---YPLDNEVNLEQVSNDLAAGMSFAP 312 Query: 306 LGGLRVLRIGD------DVYANGEKIDSP--HRPALDALASNIALTAENFGDALEDPSFL 357 G+R L +Y NG + + + LAS L A DA +FL Sbjct: 313 APGVRALLCDGKRGLPRALYVNGSQFTFAKNDQEWFEVLASGSILNATCCQDA-PSFTFL 371 Query: 358 AMLAALVNSGYW-FFEG 373 L L+N+GYW +FEG Sbjct: 372 ETLTTLINNGYWEWFEG 388 >UniRef50_Q3JQS3 Cupin superfamily protein family n=25 Tax=Burkholderiales RepID=Q3JQS3_BURP1 Length = 422 Score = 331 bits (849), Expect = 2e-89, Method: Composition-based stats. Identities = 116/374 (31%), Positives = 183/374 (48%), Gaps = 13/374 (3%) Query: 6 TLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQVSH 65 L+ F+ R+WQK+P+++++ P+S D L LA + +V+SRLV+H +WQ+ H Sbjct: 46 NLSPAQFMRRYWQKKPLLIRQAITGIAPPLSRDALFELAADYDVESRLVTHFRNRWQLEH 105 Query: 66 GPFE--SYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVG 123 GPFE + W+LLVQ ++ + AL+ FR +PD R+DDLMIS++ GGGVG Sbjct: 106 GPFEPEHLPSVKRREWTLLVQGLDLHDDRARALLERFRFVPDARLDDLMISYATDGGGVG 165 Query: 124 PHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIPP 183 PH D YDVF++Q G+RRWR+G + L + FE + LEPGD+LY+PP Sbjct: 166 PHFDSYDVFLLQVHGKRRWRIGAQQD-LSLQEGLPLKILANFEPTDEWVLEPGDMLYLPP 224 Query: 184 GFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQR------ELGGNYYSDPDVPPRA 237 H+G AL M S+GFRAP+ EL + F ++ +R Y DP P Sbjct: 225 HIAHDGIALGECMTCSIGFRAPSAGELRAQFLYHLAERGGLRTGARDDARYRDPAQPAVD 284 Query: 238 HPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPY-QPDEIYDA 296 PA + + ++ + + + G ++S+ + + PP + + A Sbjct: 285 SPAMLPAAMVKRVAATLAGIQWDEHDVGDFLGCYLSEPKSNVVFEPPTRRLGEAAFVTQA 344 Query: 297 LKQGEVLVRLGGLRVLRIGDDVYANGEKIDSP-HRPALDALASNIALTAENFGDALEDPS 355 ++G L R +L + NG+ L LA + A+ F DP+ Sbjct: 345 SRRGVRLDRKAA--LLYNARSYFINGDAHPLATAAKWLPELADTRRMEAKRFVTLSRDPA 402 Query: 356 FLAMLAALVNSGYW 369 +L +G+ Sbjct: 403 MTGLLHEWYCAGWI 416 >UniRef50_Q1N4P0 Transcription factor jumonji, jmjC n=1 Tax=Bermanella marisrubri RepID=Q1N4P0_9GAMM Length = 386 Score = 330 bits (847), Expect = 4e-89, Method: Composition-based stats. Identities = 123/370 (33%), Positives = 199/370 (53%), Gaps = 12/370 (3%) Query: 1 MEYQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLV-SHQDG 59 M + FL+ +WQK+PV++++ NF PI PD+LAGL++E +V+SR++ + D Sbjct: 1 MHVLGEFSVETFLKDYWQKKPVLIRQALPNFTPPIEPDDLAGLSLEEDVESRIILENGDT 60 Query: 60 KWQVSHGPF--ESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSV 117 WQ+ HGPF E++ +L E W+LLVQ V+ W + L+ F+ +P WR+DD+M+SF+ Sbjct: 61 PWQLIHGPFSEETFGNLPEEKWTLLVQGVDQWVPEMSELLSYFQFIPKWRLDDIMVSFAP 120 Query: 118 PGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQ-MKQHCPHPDLLQVDPFEAIIDEELEPG 176 GG VGPH DQYDVF++Q GRR W++G K L ++ E + LEPG Sbjct: 121 KGGSVGPHFDQYDVFLLQAQGRRHWQIGPKYDASSPRIKDTPLHLLENMEVTEEWTLEPG 180 Query: 177 DILYIPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPR 236 D+LYIPP + H G A+++ M +SVGFRAP+ E++SG + + + + Y D D+ Sbjct: 181 DMLYIPPQYAHNGVAVDDCMTFSVGFRAPSEAEILSGITQHAMDQLTEADRYHDEDLKAS 240 Query: 237 AHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDA 296 A PA + D+L++++ + N E ++WF E ++QS++ P E P +E+ Sbjct: 241 AQPALIDQAAFDRLQQIIAKHANNTELMQEWFAECMTQSKYPELAEPLEDPLDWEEVAPL 300 Query: 297 LKQGEVLVRLGGLRVLRIGDD----VYANGEKI----DSPHRPALDALASNIALTAENFG 348 L+ V+ + R Y NG+++ D+ + L T + Sbjct: 301 LQNDTVISQNETSRWAYYESKGHWIFYGNGQQLLESKDNELTDSAKKLWDQRQTTLNDIK 360 Query: 349 DALEDPSFLA 358 L+ Sbjct: 361 AILDHSEGQQ 370 >UniRef50_C6WYD1 Cupin 4 family protein n=1 Tax=Methylotenera mobilis JLW8 RepID=C6WYD1_METML Length = 395 Score = 330 bits (846), Expect = 6e-89, Method: Composition-based stats. Identities = 130/390 (33%), Positives = 202/390 (51%), Gaps = 22/390 (5%) Query: 1 MEYQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGK 60 ++ ++ +FL+ +W K+P+++K F +SPDELAGLA E EV SR+V GK Sbjct: 8 LQLLGGISASEFLQHYWHKKPLLIKNAIPGFTGLLSPDELAGLACEEEVQSRIVEEIKGK 67 Query: 61 WQVSHGPFES--YDHLGET-----NWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMI 113 W SHGPFE + +L E W+LLVQ+VNH A L+ F +P R+DDLM+ Sbjct: 68 WYASHGPFEESDFANLPEKPDPKHRWTLLVQSVNHHLPEAAELLSQFNFIPHARLDDLMV 127 Query: 114 SFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEEL 173 S++ GGGVGPH D YDVF++QG G+R WR+ E+ L + F+ + + Sbjct: 128 SYAPDGGGVGPHFDSYDVFLLQGQGKRLWRISEQTD-LSLVEGAPLRILKNFDTAQEWLV 186 Query: 174 EPGDILYIPPGFPHEGYAL----ENAMNYSVGFRAPNTRELISGFADYVLQRELGGN--- 226 E GD+LY+PP H G A+ + M YS+GFRAP EL++ F ++ + Sbjct: 187 EAGDLLYLPPHLAHWGIAVTDGDTDCMTYSIGFRAPKVHELVTEFLGFMQDKLNQDANAL 246 Query: 227 --YYSDPDVPPRAHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPP 284 Y D D+ P+ HPA + + K+ E++ + +H + G ++S+ + ++ P Sbjct: 247 PGIYQDADLTPQEHPAQIGSSMVSKVAEILKTIQWSEQHVADFLGSYLSEPKPDIFFEPN 306 Query: 285 EPPYQPDEIYDALKQGEVLVRLGGLRVLRIGDDVYANGEKIDSPHRPA--LDALASNIAL 342 + + L+ G L ++L Y NGE I + + A L ALA L Sbjct: 307 KKMSLRKFNENLLQHGISLDLK--SQMLFTQQYFYLNGEAISAAGQAASLLTALADYRML 364 Query: 343 TAENFGDALE-DPSFLAMLAALVNSGYWFF 371 T+++ A E D +F+ L +GY +F Sbjct: 365 TSDDIAQAGEVDSAFIEQLHGWYLAGYLYF 394 >UniRef50_D1UI98 Cupin 4 family protein n=6 Tax=Burkholderia RepID=D1UI98_9BURK Length = 424 Score = 329 bits (844), Expect = 8e-89, Method: Composition-based stats. Identities = 122/372 (32%), Positives = 192/372 (51%), Gaps = 11/372 (2%) Query: 6 TLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQVSH 65 L F+ R+WQK+P+++++ + P+S DEL LA + +V++RL++H +WQ+ H Sbjct: 50 NLTPSQFMRRYWQKKPLLIRQAIPDVEAPLSRDELFELADQDDVEARLITHFRNRWQLEH 109 Query: 66 GPF--ESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVG 123 GPF + L + W+LLVQ V+ + AL+ FR +PD R+DDLMIS++ GGGVG Sbjct: 110 GPFAPDELPSLKQRAWTLLVQGVDLHDDRARALLERFRFVPDARLDDLMISYATDGGGVG 169 Query: 124 PHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIPP 183 PH D YDVF++Q G+RRWR+ + L + F A + LEPGD+LY+PP Sbjct: 170 PHFDSYDVFLLQVKGKRRWRISAQKD-LTLQAGLPLKVLQNFAAEQEWVLEPGDMLYLPP 228 Query: 184 GFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGG----NYYSDPDVPPRAHP 239 H+G A M S+GFRAP+ EL + F ++ +R Y DP P P Sbjct: 229 HIAHDGVAEGECMTCSIGFRAPSAGELTAQFLYHLAERGEASGQAGALYRDPQQPAVERP 288 Query: 240 ADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPY-QPDEIYDALK 298 A++ P ++++ ++ + + + G ++S+ + + PP+ P + I A K Sbjct: 289 AELPPALVERVGAILAGITWNEQDIASFLGTYLSEPKPSVVFDPPQRPLNEARFISQASK 348 Query: 299 QGEVLVRLGGLRVLRIGDDVYANGEKIDSP-HRPALDALASNIALTAENFGDALEDPSFL 357 G L R +L + NGEK + L LA + L+A+ F D S Sbjct: 349 SGVRLDRKTN--LLYNRRFFFLNGEKTSLEGSKKWLFDLADHRCLSAKRFVTLSHDSSVT 406 Query: 358 AMLAALVNSGYW 369 A L +G+ Sbjct: 407 ARLHEWYRAGWI 418 >UniRef50_Q2SJM1 Uncharacterized conserved protein n=3 Tax=Gammaproteobacteria RepID=Q2SJM1_HAHCH Length = 405 Score = 328 bits (842), Expect = 2e-88, Method: Composition-based stats. Identities = 126/391 (32%), Positives = 196/391 (50%), Gaps = 18/391 (4%) Query: 1 MEYQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQ--D 58 + + ++ DFL +WQK+P++++ + P+ ++LAGLA E EV+SRLV + Sbjct: 2 LTHLGDISVADFLAHYWQKKPLIIRGLLPGYECPLDENDLAGLATEEEVESRLVYEELNG 61 Query: 59 GKWQVSHGPF--ESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFS 116 WQ+ HGPF E +++ W+LLVQ ++ W A L+ FR LP+WR+DD+M SF+ Sbjct: 62 QPWQLEHGPFSIEKLENMPHQGWTLLVQGLDTWVPEIADLLDRFRFLPNWRVDDIMASFA 121 Query: 117 VPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMK-QHCPHPDLLQVDPFEAIIDEELEP 175 PGG VGPH D YDVF+IQ TG RRWR+G + L + FE + LEP Sbjct: 122 PPGGSVGPHFDHYDVFLIQATGARRWRIGPPCDDQSPRVDGTPLRILQNFEQTEEWVLEP 181 Query: 176 GDILYIPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPP 235 GD LY+PPG+ H G A + + SVGFR+P EL+S AD + + D P Sbjct: 182 GDALYLPPGYAHYGVAETSCITLSVGFRSPTYAELMSALADDWFENPALSTHLHDATEAP 241 Query: 236 RAHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPE--PPYQPDEI 293 ++P + +R + L++ ++ FG FIS + + + P + P++ Sbjct: 242 LSNPGLISDDVFADIRSRLQALLDDEAGLRRSFGRFISAPKFDAAVPPLDAAMRLSPEDA 301 Query: 294 YDALKQGEVLVR-LGGLRVLR--------IGDDVYANGEKIDSPHR--PALDALASNIAL 342 ++L+ E+ R G R +GE D+ R P ++ L + + Sbjct: 302 GESLQDQEIQWRWNEGSRYTYSLYEEAGARRVMFAVDGEAYDADERFAPLVEILCRSNNV 361 Query: 343 TAENFGDALEDPSFLAMLAALVNSGYWFFEG 373 E D L +L++L+N G EG Sbjct: 362 DRERLLPWSADKDALKLLSSLLNRGSLVLEG 392 >UniRef50_A6SXH9 Uncharacterized conserved protein n=2 Tax=Oxalobacteraceae RepID=A6SXH9_JANMA Length = 373 Score = 326 bits (837), Expect = 6e-88, Method: Composition-based stats. Identities = 112/371 (30%), Positives = 184/371 (49%), Gaps = 6/371 (1%) Query: 1 MEYQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGK 60 + L +FL +W K+P+++++ +F +S DEL GL +V+SRL++H + Sbjct: 4 LTLLGGLTAAEFLRDYWHKKPLLIRQAIPDFKPLLSRDELFGLVKSEDVESRLITHVKRE 63 Query: 61 WQVSHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGG 120 W + GPFE L + +W+LLVQ VN E +LMR F +PD R+DDLMIS++ G Sbjct: 64 WNMDSGPFEQLPPLKQKDWTLLVQGVNLHDEAVDSLMREFSFIPDARLDDLMISYATETG 123 Query: 121 GVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILY 180 GVG H D YDVF++Q G RRWR+G + L + F+ + L PGD+LY Sbjct: 124 GVGAHFDSYDVFLLQAHGHRRWRIGAQTD-LTLVDGMPLKILKNFKPEEEFILAPGDMLY 182 Query: 181 IPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPA 240 +PP + HEG A++ M YS+GFRAP+ +EL F + ++ Y+DPD+ P H A Sbjct: 183 LPPQYAHEGVAMDECMTYSIGFRAPSYQELGEAFLESMIDSIDLPGRYADPDLKPAKHSA 242 Query: 241 DVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQG 300 ++ + ++ + ++ E + GE++S+ + ++ PE K+ Sbjct: 243 EISAAMLSRIAAELNKVRFTQEDIALFVGEYLSEPKAQIYFDAPEENLTRARFLQNAKKS 302 Query: 301 EVLVRLGGLRVLRIGDDVYANGEKIDSPHRPA--LDALASNIALTAENFGDALEDPSFLA 358 + + L L +L + ++ NG + L LA+ L+ A + Sbjct: 303 GIKLSLKSL-MLHRNNYIFINGTSFEVGDEDLAILTELANTRQLSGTIIASA--SADVID 359 Query: 359 MLAALVNSGYW 369 G+ Sbjct: 360 AFHTWHKDGWL 370 >UniRef50_Q31GJ6 Cupin superfamily protein n=2 Tax=Gammaproteobacteria RepID=Q31GJ6_THICR Length = 401 Score = 326 bits (836), Expect = 9e-88, Method: Composition-based stats. Identities = 109/381 (28%), Positives = 180/381 (47%), Gaps = 18/381 (4%) Query: 6 TLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLV-SHQDGKWQVS 64 +++ FL +WQK+P++++ +F P+S +ELAGL++E EV+SR+V H +++ Sbjct: 18 SIDKETFLSEYWQKKPLLIRNALPDFSPPVSAEELAGLSLEEEVESRIVIQHSAEDYELK 77 Query: 65 HGPFES--YDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGV 122 GPF+ Y+ L E NW+LLVQ ++ L+ F +P WRIDD+M+S++ GG V Sbjct: 78 KGPFKESLYETLPEKNWTLLVQGMDRLLPEVTELLNEFDFIPSWRIDDIMVSYATEGGNV 137 Query: 123 GPHLDQYDVFIIQGTGRRRWRVGEK-LQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYI 181 GPH D YDVF++Q G RRW++ + + DL + F + +PGDILY+ Sbjct: 138 GPHFDHYDVFLLQAQGERRWQLSAQDCDETNYIEGVDLRIMKRFVVEEEYVCQPGDILYV 197 Query: 182 PPGFPHEGYAL-ENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPA 240 PP + H G L ++ M +S+G+R EL F DY+ + + + Y DP+ A P Sbjct: 198 PPKWGHHGVGLTDDCMTFSIGYRTYRGLELWDSFGDYLAETQQFQSLYQDPNWKGTA-PG 256 Query: 241 DVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEP-----PYQPDEIYD 295 + + + ++ + E K WFG F +Q P+P + + Sbjct: 257 QISEGSWQQAQSLLKAALENEEALKNWFGRFATQLDQGASQLLPDPLSDTESCPLETFIE 316 Query: 296 ALKQGEVLVRLGGLRVLRIG-----DDVYANGEKID--SPHRPALDALASNIALTAENFG 348 AL+ E ++R R +Y N + + L + + Sbjct: 317 ALQSAEGVLRDSVCRFAYAEFTQNQVKLYINSAEWQDFKAESDFIRCLCNQRFIDQATLT 376 Query: 349 DALEDPSFLAMLAALVNSGYW 369 L A+L L N + Sbjct: 377 SYLNHAGNQALLYDLWNLQFI 397 >UniRef50_C5A9S6 Cupin superfamily protein family protein n=49 Tax=Burkholderiales RepID=C5A9S6_BURGB Length = 422 Score = 326 bits (835), Expect = 1e-87, Method: Composition-based stats. Identities = 122/374 (32%), Positives = 188/374 (50%), Gaps = 13/374 (3%) Query: 6 TLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQVSH 65 L F+ RHWQK+P+++++ + P+S D L LA + + +SRL++H +WQ++ Sbjct: 46 NLTPSQFMRRHWQKKPLLIRQAIPGIVPPLSRDALFELAGDYDTESRLITHFRNRWQLAQ 105 Query: 66 GPFE--SYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVG 123 GPFE S + + W+LLVQ V+ + AL+ FR +PD R+DDLMIS++ GGGVG Sbjct: 106 GPFELDSLPSVSKREWTLLVQGVDLHDDAARALLERFRFIPDARLDDLMISYATDGGGVG 165 Query: 124 PHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIPP 183 PH D YDVF++Q GRRRWR+G + L + FE + LEPGD+LY+PP Sbjct: 166 PHFDSYDVFLLQVHGRRRWRIGAQQD-LTLREDLPLKVLARFEPTDEWVLEPGDMLYLPP 224 Query: 184 GFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQR------ELGGNYYSDPDVPPRA 237 H+G A M S+GFRAP+ EL F Y+ +R G Y DP PP Sbjct: 225 HIAHDGIAEGECMTCSIGFRAPSAGELTGQFLYYLAERGALRQGARAGELYRDPAQPPVD 284 Query: 238 HPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPY-QPDEIYDA 296 PA + ++++ ++ + + + G ++S+ + + PE P + + A Sbjct: 285 DPARLPAALVERVETILKGIRWTTRDVENFLGSYLSEPKSNVVFDAPERPLGEAAFVAQA 344 Query: 297 LKQGEVLVRLGGLRVLRIGDDVYANGEKIDSPH-RPALDALASNIALTAENFGDALEDPS 355 ++G L R +L + NGE+ L LA L A+ F P Sbjct: 345 SRRGIRLDRKAA--LLYNARSYFINGEENPLAGNAKWLPELADRRHLGAKRFVTYSRHPL 402 Query: 356 FLAMLAALVNSGYW 369 A+L +G+ Sbjct: 403 MTALLHEWYCAGWI 416 >UniRef50_C4K8V5 Putative uncharacterized protein n=1 Tax=Candidatus Hamiltonella defensa 5AT (Acyrthosiphon pisum) RepID=C4K8V5_HAMD5 Length = 379 Score = 324 bits (832), Expect = 2e-87, Method: Composition-based stats. Identities = 154/371 (41%), Positives = 220/371 (59%), Gaps = 7/371 (1%) Query: 5 LTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQVS 64 L +NW DFL+ +WQK P++LK+ NFI+P+SPDELA L +E ++S+L+ +GK QV Sbjct: 3 LMINWQDFLQHYWQKHPMLLKQAVVNFINPVSPDELAKLVIEKALESQLIKKVNGKCQVV 62 Query: 65 HGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGP 124 H F Y LG NWSL VQA+NHWH P M FR PDW +DL +SFSVPGGG+G Sbjct: 63 HNVFNGYKSLGRHNWSLKVQAINHWHRPAEEFMYLFRTFPDWYREDLTVSFSVPGGGLGL 122 Query: 125 HLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIPPG 184 + DVFIIQG GR RWR+ L H + F +II+EEL GD LYIP G Sbjct: 123 YAKTSDVFIIQGIGRSRWRIWNPLSSVVHYDQKNF-----FPSIINEELVSGDALYIPKG 177 Query: 185 FPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYS-DPDVPPRAHPADVL 243 FPHE + E A++Y + N+ +I + + + + G YS PD+ R P ++L Sbjct: 178 FPHEAISSETALSYCINLWTDNSLRMIRNWTESLSDKNHRGIEYSPSPDLLMRDDPTEIL 237 Query: 244 PQEMDKLREMMLE-LINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQGEV 302 PQ++ ++ +M + L+ Q + + WFG+ +SQS ++L +AP YQP ++ L+Q Sbjct: 238 PQDITAIQNIMNQFLLQQRDDLETWFGQQMSQSSYDLPMAPAAQVYQPSQVQSILQQDIS 297 Query: 303 LVRLGGLRVLRIGDDVYANGEKIDSPHRPALDALASNIALTAENFGDALEDPSFLAMLAA 362 L RL GLR+L IGD + NGE + S + A + +A N + ++D F+ L Sbjct: 298 LCRLMGLRMLHIGDRYFLNGESLASNYADAWNIMAHNTTINGYMLRKFIDDNDFMTQLTL 357 Query: 363 LVNSGYWFFEG 373 L+N GYW+F+G Sbjct: 358 LINKGYWYFQG 368 >UniRef50_D2UDU1 Putative uncharacterized protein n=1 Tax=Xanthomonas albilineans RepID=D2UDU1_XANAL Length = 415 Score = 323 bits (827), Expect = 9e-87, Method: Composition-based stats. Identities = 133/382 (34%), Positives = 200/382 (52%), Gaps = 18/382 (4%) Query: 2 EYQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQD--G 59 +Y L ++ FL +WQKRP++++ F +F+ PI PD+LAGLA E SRLV H Sbjct: 23 QYPLGMSAASFLRDYWQKRPLLIRNAFPDFVSPIEPDDLAGLACEEAALSRLVIHDRATD 82 Query: 60 KWQVSHGPFES--YDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSV 117 +W + +GPF+ + + + +W+LLVQ V+ W AL+ FR LP WR+DD+M+SF+ Sbjct: 83 RWSLRNGPFQEHEFPGMPDHDWTLLVQDVDKWDPDIRALLGQFRFLPRWRVDDVMVSFAA 142 Query: 118 PGGGVGPHLDQYDVFIIQGTGRRRWRV------GEKLQMKQHCPHPDLLQVDPFEAIIDE 171 GG VG H+D YDVF++Q GRRRW++ G + +L + F D Sbjct: 143 RGGSVGAHVDHYDVFLLQAHGRRRWQIDASASMGRPPPPTEFREDVELKLLRQFAPTHDW 202 Query: 172 ELEPGDILYIPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDP 231 LEPGD+LY+PP PH G A + + +SVG RAP++ ELI+ + D ++ Y D Sbjct: 203 VLEPGDMLYLPPMVPHHGVAEDACLTFSVGMRAPSSAELIADYLDTLIDGADEALRYHDE 262 Query: 232 DVPPRAHPADVLPQEMDKLREMMLEL-INQPEHFKQWFGEFISQSRHELDIAPPEPPYQP 290 D+ P ++ M ++ E + L +N P+ WFG FI+ R +I PP Sbjct: 263 DLLAPTDPHEIDAAAMGRVVEALNALRMNDPDRLGAWFGRFITTYRAGGEILPPSNLPPV 322 Query: 291 DEIYDALKQGEVLVRLGGLRVLR----IGDDVYANGEKIDSPHRPALDALASNIALTAEN 346 +E AL QG VL R R+ G ++ NG + P + A LA+ L A + Sbjct: 323 EETAAALAQGLVLQRHPWARLAWRRASRGAMLFCNGMEFALPIQDA-KRLAAAEHLDATD 381 Query: 347 FGDALEDPSFLAMLAALVNSGY 368 + + L L+ SG+ Sbjct: 382 YAAL--SATGRQTLLQLIQSGF 401 >UniRef50_B8GSM7 Cupin 4 family protein n=1 Tax=Thioalkalivibrio sp. HL-EbGR7 RepID=B8GSM7_THISH Length = 397 Score = 322 bits (826), Expect = 1e-86, Method: Composition-based stats. Identities = 120/383 (31%), Positives = 195/383 (50%), Gaps = 14/383 (3%) Query: 1 MEYQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQD-- 58 + L FL +WQ++P+++++ F P+SP+ELAGLA E V SRLV + Sbjct: 9 LTLLGGLTARAFLRDYWQQKPLLVRQAIPGFESPLSPEELAGLACEEGVISRLVRERGET 68 Query: 59 GKWQVSHGPFES--YDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFS 116 G W + GPF+ + L E++W+LLV + A + PFR +PDWR+DDLM+S++ Sbjct: 69 GSWALRTGPFDEDDFTTLPESHWTLLVSDMEKHLPELRAYLEPFRFIPDWRMDDLMVSYA 128 Query: 117 VPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQM-KQHCPHPDLLQVDPFEAIIDEELEP 175 P G VGPH+D+YDVF++Q GRRRW++ + P +L + F+ + LEP Sbjct: 129 APEGSVGPHVDEYDVFLLQAQGRRRWQIARQAVSGDDFLPGVELRILRDFQPDQEWILEP 188 Query: 176 GDILYIPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPP 235 GD+LY+PP PH G A+ M +SVGFRAP R+L++ + D + + Y+DP + P Sbjct: 189 GDMLYLPPRIPHHGVAVGPCMTWSVGFRAPAWRDLMAAWVDQRYEALAPQDRYADPGLEP 248 Query: 236 RAHPADVLPQEMDKLREMMLELIN-QPEHFKQWFGEFISQSRHE-LDIAPPEPPYQPDEI 293 + +P ++ + +L + + +W G +++ + E L+ DE Sbjct: 249 QDNPGELSAAALARLIAGLRRAMAVDDAELARWLGTVLTEPKAELLEHMQLPETLTRDEA 308 Query: 294 YDALKQGEVLVRLGGLRVLRIGDD----VYANGEKI--DSPHRPALDALASNIALTAENF 347 L+ G L R G R+ + D ++ NG++ P + L + A + Sbjct: 309 LGLLQDGVSLERHGAARLAWMSDHGGLRLFVNGQEHLLPEAAGPLVRHLCAETAYDGKAL 368 Query: 348 -GDALEDPSFLAMLAALVNSGYW 369 G A S +L +L +G Sbjct: 369 WGLASGIDSAEDLLMSLCIAGIL 391 >UniRef50_D1RFR4 Cupin superfamily protein n=1 Tax=Legionella longbeachae D-4968 RepID=D1RFR4_LEGLO Length = 396 Score = 322 bits (825), Expect = 1e-86, Method: Composition-based stats. Identities = 118/386 (30%), Positives = 195/386 (50%), Gaps = 20/386 (5%) Query: 7 LNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSH---QDGKWQV 63 ++ FL +WQK+P+V+++ F P+SPDELAGLA+E +V+SRLV + W + Sbjct: 7 ISLNTFLGDYWQKKPLVIRKALPEFTHPLSPDELAGLALEEDVESRLVFETPDEKPYWHL 66 Query: 64 SHGPF--ESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGG 121 GPF + L T+W+LLVQ V+ AL+ F +P WRIDD+MIS++V G Sbjct: 67 KRGPFSVNDFSTLPSTHWTLLVQGVDRLIPEVYALLDYFNFIPQWRIDDIMISYAVLHGS 126 Query: 122 VGPHLDQYDVFIIQGTGRRRWRVGEK-LQMKQHCPHPDLLQVDPFEAIIDEELEPGDILY 180 VGPH D YDVF+ Q G+R W + K + + +L + F+ LE GD+LY Sbjct: 127 VGPHYDNYDVFLYQAKGKREWSLTTKGCNNQNYMKGLELRIMSQFDVEERFILEEGDMLY 186 Query: 181 IPPGFPHEGYAL-ENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHP 239 +PP H G +L + M YS G+R+ +EL+ F+DY+ ++ L N Y DPD + Sbjct: 187 LPPHVGHHGISLSDECMTYSFGYRSYQGQELLESFSDYLSEKGLFKNLYQDPDWSNLQNT 246 Query: 240 ADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPD-----EIY 294 +++ P ++++ ++IN + + WFG F ++ + ++ P P + + + Sbjct: 247 SEIPPSAWLNAQKLLQQVINDEKTMQTWFGCFATRLDQQAELQLPVPLEEDELIDISDFI 306 Query: 295 DALKQGEVLVRLGGLRVLRI------GDDVYANGEKIDS--PHRPALDALASNIALTAEN 346 +K+G L+R R + NG D+ ++ L +A+N L+ + Sbjct: 307 KEIKEGLNLIRDASCRFAYQNQNEQSEYQFFINGSAWDAKGVNKDLLHYIANNRYLSYKV 366 Query: 347 FGDALEDPSFLAMLAALVNSGYWFFE 372 L ++ L + FE Sbjct: 367 LTTYLNTKKNQLLIYNLWKLQWLQFE 392 >UniRef50_D0L0L5 Cupin 4 family protein n=1 Tax=Halothiobacillus neapolitanus c2 RepID=D0L0L5_HALNC Length = 397 Score = 319 bits (818), Expect = 9e-86, Method: Composition-based stats. Identities = 125/387 (32%), Positives = 194/387 (50%), Gaps = 16/387 (4%) Query: 2 EYQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDG-- 59 + TL+ DFL +WQK+PV++++G F P+SP+ELAGLA E +V +RL+ G Sbjct: 5 QVLGTLSVADFLRDYWQKKPVLIRQGVPGFESPLSPEELAGLACEEDVPARLILESAGAR 64 Query: 60 KWQVSHGPFES--YDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSV 117 W + HGPF + L E +SLL+ L++ FR +PDWRIDDLMIS++ Sbjct: 65 PWTLRHGPFTEADFTSLPEDGYSLLITDCEKLIPDLMNLVQHFRFVPDWRIDDLMISYAP 124 Query: 118 PGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGD 177 PGG VG H+D+YDVF++QG GRR+W + + P D+ + FE + LEPGD Sbjct: 125 PGGSVGAHIDEYDVFLLQGMGRRKWMIEYPPKHSDFVPDLDIRLLQEFEPTEEWVLEPGD 184 Query: 178 ILYIPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRA 237 +LY+PPG PH G A+++ M YS+GFRAP E+ +G D ++ Y DPD+ A Sbjct: 185 MLYLPPGVPHHGVAVDHCMTYSIGFRAPLLHEMAAGVTDRLITDMDQAARYGDPDLQAPA 244 Query: 238 HPADVLPQEMDKLREMMLELINQPEH-FKQWFGE-FISQSRHELDIAPPEPPYQPDEIYD 295 +P + KLR ++ +++Q + ++ E + P P + Sbjct: 245 NPGALDASSRVKLRAILQSVLDQDDAVLDRFIAETLTERPLDHAGFYPQNDPLDAKALRG 304 Query: 296 ALKQ-GEVLVRLGGLRVLRIGDDVYANGEKIDSPHR---------PALDALASNIALTAE 345 + G+ L+R R+L + D+ + G + + P L S + A Sbjct: 305 EIAHSGDTLMRTPAARLLLVEDEPDSAGGALAVDGQSTLLNAEMLPLARLLVSQVFYDAA 364 Query: 346 NFGDALEDPSFLAMLAALVNSGYWFFE 372 A E + +L L G ++ Sbjct: 365 ELLAATESEAAAELLQKLYADGVVQWQ 391 >UniRef50_C5BU83 Cupin 4 family protein n=1 Tax=Teredinibacter turnerae T7901 RepID=C5BU83_TERTT Length = 385 Score = 318 bits (816), Expect = 2e-85, Method: Composition-based stats. Identities = 117/378 (30%), Positives = 195/378 (51%), Gaps = 17/378 (4%) Query: 1 MEYQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDG- 59 ++ T+ + FL +WQK+P+++++ F +F P+S DELAGLA+E +V SRLV +D Sbjct: 11 LQQLGTITFEQFLNEYWQKKPLLIRQAFPDFEAPVSADELAGLALEDDVVSRLVVQRDES 70 Query: 60 KWQVSHGPF--ESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSV 117 WQV HGP E + L E++W+LLVQ + AL+ FR +P+WR+DD+MIS++ Sbjct: 71 DWQVEHGPLLEERFAQLPESHWTLLVQHADALDPAINALLDAFRFIPNWRLDDIMISYAA 130 Query: 118 PGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQM-KQHCPHPDLLQVDPFEAIIDEELEPG 176 GGVGPH D YDVF++Q G+RRWR+G++ P D+ + F+ + D +EPG Sbjct: 131 DKGGVGPHFDYYDVFLLQAQGKRRWRIGQRCSHESPLLPAADMKILQDFDTVEDWIVEPG 190 Query: 177 DILYIPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPR 236 D+LYIPP H G A M YS+GFRAP+ E++ F++ + Y DP + P+ Sbjct: 191 DLLYIPPNIAHWGEADGECMTYSIGFRAPSHAEVLLDFSEEMASFTNPDMRYMDPGLRPQ 250 Query: 237 AHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDA 296 P ++ Q +++++ ++ + WFGE++++ D +E Sbjct: 251 QLPGEISQQSIEQVQAIIHQYSTDKAALAGWFGEYMTRPNPTADAHFQ---TFDEEFDRN 307 Query: 297 LKQGEVLVRLGGLRVLRIGDD----VYANGEKIDSPHRPALDALASNIALTAENFGDALE 352 L + R + V+ NG K R L++ + ++ Sbjct: 308 LMEAGQARLSRFARCAFFEEQAGCLVFINGAKWHC-SRKLAVMLSNYEPIHWDSL----- 361 Query: 353 DPSFLAMLAALVNSGYWF 370 D ++ + ++G+ Sbjct: 362 DTLDRTVVVQIADAGFLI 379 >UniRef50_C1DCJ3 Cupin region n=1 Tax=Laribacter hongkongensis HLHK9 RepID=C1DCJ3_LARHH Length = 380 Score = 316 bits (811), Expect = 6e-85, Method: Composition-based stats. Identities = 121/370 (32%), Positives = 198/370 (53%), Gaps = 12/370 (3%) Query: 6 TLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQ-DGKWQVS 64 L +FL +W K+P++++ + P + LA LA +V+SRL+ ++ G+W V Sbjct: 11 GLTAREFLRDYWHKQPLLIRGALRDVGTPADFEVLAELARRDDVESRLIENRAGGRWHVE 70 Query: 65 HGPFE--SYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGV 122 HGPF+ L ET+W+LLVQ+VNH + ++ F LP R+DDLMIS++ PGG V Sbjct: 71 HGPFQPARLARLPETDWTLLVQSVNHHLPHVSDILWRFNFLPYARLDDLMISYAPPGGTV 130 Query: 123 GPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIP 182 GPH D YDVF++Q G++RW+VG + + + F+A+ ELE GD+LY+P Sbjct: 131 GPHFDSYDVFLLQVGGKKRWQVGSP-DNDRLEDGAPIKVLSSFDALQSWELEQGDMLYLP 189 Query: 183 PGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADV 242 P F H G ALE M YS+GFRAP T+EL F Y+ Y+DPD+ P HPA++ Sbjct: 190 PKFSHYGVALEPGMTYSIGFRAPTTQELAEQFLTYLQDTLCLDGRYADPDLEPPRHPAEI 249 Query: 243 LPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQGEV 302 ++++++M+ + + ++ G ++++ ++ + PPE P DE + + + Sbjct: 250 SESMVEQVQDMLKAIRWDRDGVGEFLGCYLTEPKNHVFFDPPEDPLDEDEFAKVILRDGL 309 Query: 303 LVRLGGLRVLRIGDDVYANGEKIDS--PHRPALDALASNIALTAENFGDALEDPSFLAML 360 ++ L ++L + NGE P LA+ L + D + + + L Sbjct: 310 VLDLK-SQMLFRNSLCFVNGEIHAGMDGDLPVWRELANQRRLAGQAISDGMTETLYAGYL 368 Query: 361 AALVNSGYWF 370 SG+W+ Sbjct: 369 -----SGWWW 373 >UniRef50_Q21K45 Cupin 4 n=1 Tax=Saccharophagus degradans 2-40 RepID=Q21K45_SACD2 Length = 383 Score = 315 bits (807), Expect = 2e-84, Method: Composition-based stats. Identities = 110/368 (29%), Positives = 188/368 (51%), Gaps = 15/368 (4%) Query: 6 TLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQ------DG 59 + FL +WQK+P+++++ NF P+S DELAGL +E +V SRL++ + Sbjct: 13 DMPIETFLRDYWQKKPLLIRQALPNFESPLSADELAGLCLEDDVISRLITETPQSSPFNS 72 Query: 60 KWQVSHGPF--ESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSV 117 +W V+HGP + ++ L E WSLLVQ V+ L+ FR +P+WR+DD+MIS++ Sbjct: 73 EWNVTHGPLPEDIFETLPENYWSLLVQHVDQLSPEVNQLLNLFRFIPNWRLDDVMISYAP 132 Query: 118 PGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQM-KQHCPHPDLLQVDPFEAIIDEELEPG 176 GGVGPH D YDVF++QG G+RRWR+G++ + + + F+ D L PG Sbjct: 133 DKGGVGPHFDYYDVFLLQGHGQRRWRLGQQCTSKSPMLANAPMKVLTEFDVQEDWVLNPG 192 Query: 177 DILYIPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPR 236 DILY+PPG H G A+ ++ YSVGFRAP+ ++++ F+ V + N Y D + Sbjct: 193 DILYVPPGLAHWGTAVGESITYSVGFRAPSHQDIVLDFSQEVASKIEEDNRYQDQFLTAN 252 Query: 237 AHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDA 296 + ++ +++L+ ++ + + QW G+ ++Q + E +P+E+ + Sbjct: 253 KNAGEITGDAIEQLKHILQTYMQDEQALAQWLGKSMTQLNPG-MVDEAENTIEPEEMANT 311 Query: 297 LKQGEVLVRLGGLRV-LRIGDDVYANGEKIDSPHRPALDALASNIALTAENFGD---ALE 352 R R ++ NGE +AL++ + + + + + Sbjct: 312 PFTLSPFARATFYRADSNSEAYIFINGEVYSGSLE-LANALSNYLPIDWLSCSETDKLIL 370 Query: 353 DPSFLAML 360 D L Sbjct: 371 DTLAQQYL 378 >UniRef50_A1K4G1 Putative uncharacterized protein n=1 Tax=Azoarcus sp. BH72 RepID=A1K4G1_AZOSB Length = 371 Score = 313 bits (802), Expect = 7e-84, Method: Composition-based stats. Identities = 111/369 (30%), Positives = 189/369 (51%), Gaps = 11/369 (2%) Query: 6 TLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQVSH 65 + FL+ +WQK+P+++++ F + +++ LA + +V+SR + +G W+++ Sbjct: 8 GMTPRQFLQEYWQKKPLLVRQAVPGFTGVLGREDIFDLACDPDVESRHIRLHEGNWELNR 67 Query: 66 GPFESYDHLG-ETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGP 124 GP G + W++LVQ +N W E L+ F +P R+DDLM+S++V GGGVGP Sbjct: 68 GPQTRARLRGKRSPWTVLVQGINLWSEAADELLHRFNFIPQARLDDLMVSYAVDGGGVGP 127 Query: 125 HLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIPPG 184 H D YDVF++QG G+RRW++ ++ + L + F D LEPGD+LY+PP Sbjct: 128 HFDNYDVFLLQGQGQRRWQIADQDD-RSLVEGAPLRILRNFVPAHDWILEPGDMLYLPPH 186 Query: 185 FPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADVLP 244 + H G A+ YS+GFR+P +EL + F ++ +R YSDPD+ + + A + Sbjct: 187 WAHNGIAIGECTTYSIGFRSPTAQELGAEFLGWLQERVCLDGLYSDPDLTEQDNSALIGD 246 Query: 245 QEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQGEVLV 304 +D+++ ++ + + G ++++ + + PPE P AL G L Sbjct: 247 AMIDQVQRVIEGIRWSRADVAAFLGHYLTEPKPTVFFEPPEEPIPLKAFRRALGAGG-LR 305 Query: 305 RLGGLRVLRIGDDVYANGEKIDS--PHRPALDALASNIALTA-ENFGDALEDPSFLAMLA 361 +LR + + NGE +DS + ALD LA LT + AL+D + Sbjct: 306 LDARTLLLRSQGNFFLNGEAVDSVPAWQQALDTLAHARRLTGCADLPAALQD-----LFY 360 Query: 362 ALVNSGYWF 370 G+ Sbjct: 361 EWYCDGFAH 369 >UniRef50_C0N3X6 Cupin superfamily protein n=1 Tax=Methylophaga thiooxidans DMS010 RepID=C0N3X6_9GAMM Length = 389 Score = 311 bits (796), Expect = 4e-83, Method: Composition-based stats. Identities = 115/385 (29%), Positives = 197/385 (51%), Gaps = 15/385 (3%) Query: 3 YQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQD--GK 60 + L FL + WQK+P+++++ + +S +ELAGLA E++++SRL+ Q G Sbjct: 5 FNTELTQQQFLTQFWQKKPLLIRQAWPQMDALLSAEELAGLACEADIESRLIQEQGELGP 64 Query: 61 WQVSHGPFES--YDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVP 118 WQV+ GPF + L ++W+LLVQ V+ +M F +PDWR DDLM+SF+ Sbjct: 65 WQVNDGPFTEADFAKLPASHWTLLVQDVDKHVPELTEVMAKFDFIPDWRRDDLMVSFAPE 124 Query: 119 GGGVGPHLDQYDVFIIQGTGRRRWRVGE-KLQMKQHCPHPDLLQVDPFEAIIDEELEPGD 177 GG VGPH D YDVF++Q G RRW + + + + +L + F+A +L+PGD Sbjct: 125 GGSVGPHTDGYDVFLLQAQGTRRWAISQTPVVEAEFIDGLELKILKQFDADDVWDLQPGD 184 Query: 178 ILYIPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRA 237 +LY+PP F H G AL + M +S+GFRAP EL+ F + + E+G Y DP++ Sbjct: 185 MLYLPPHFAHHGVALNDCMTFSIGFRAPTQLELLDAFMHSLSEHEVGQQRYRDPELKVCD 244 Query: 238 HPADVLPQEMDKLREMMLELINQPEH-FKQWFGEFISQSRHELDIAPPEPPYQPDEI--Y 294 + + + ++ +++ I + G +++++ L++ E D + Sbjct: 245 DDKYIDRSALRRFKQSLIKCIEDSDDVLLDAVGRLLTETKPSLELLADELIADSDNVSLA 304 Query: 295 DALKQGEVLVRLGGLRVLRIGD----DVYANGEKI--DSPHRPALDALASNIALTAENFG 348 + QGE L R +R+ + ++A GE D R + L + A ++ Sbjct: 305 EYFSQGEQLHRNPYIRIAWAENEESVQLFAAGETYQADKAVRSIMPILTGTEPIQALHWT 364 Query: 349 DALEDPSFLAMLAALVNSGYWFFEG 373 ++ + +L LV G W+++ Sbjct: 365 Q-IQSAAATNLLEELVAIGCWYWQS 388 >UniRef50_B7RUZ0 Cupin superfamily protein n=1 Tax=marine gamma proteobacterium HTCC2148 RepID=B7RUZ0_9GAMM Length = 377 Score = 309 bits (793), Expect = 7e-83, Method: Composition-based stats. Identities = 129/378 (34%), Positives = 193/378 (51%), Gaps = 13/378 (3%) Query: 2 EYQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKW 61 +++L L+ FL +HWQK P++++ NF PIS DELAGLA E EV++R+V HQ+ W Sbjct: 4 DWELNLDKEQFLAQHWQKAPLLIRGAIKNFKPPISSDELAGLAYEEEVEARIVEHQEDNW 63 Query: 62 QVSHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGG 121 Q+ HGPF + D+ + W+LLVQAV+ + A L + +P WR+DD+M S++ GG Sbjct: 64 QLFHGPFSATDYQRKHPWTLLVQAVDQYIPEVAQLRKLVDFIPQWRVDDVMASYASDGGS 123 Query: 122 VGPHLDQYDVFIIQGTGRRRWRVGEKLQ-MKQHCPHPDLLQVDPFEAIIDEELEPGDILY 180 VGPH D YDVF++QG G R W+ G+ H L + F + LEPGDILY Sbjct: 124 VGPHFDNYDVFLLQGEGHRLWKTGQFCDSSSPLVDHDSLRLLSQFNTEAEYLLEPGDILY 183 Query: 181 IPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPA 240 +PPG H G A +S+GFRAP E++S F D ++++ +YSD + P Sbjct: 184 VPPGIAHWGTAQGECTTFSIGFRAPRITEMVSRFTDALIEQLDPDLFYSDARIEVATRPG 243 Query: 241 DVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQG 300 ++ P+++D++ + ++Q E WFGE ++ R+E E E L G Sbjct: 244 EIRPRDLDRVSAQIQAALDQSEG-NHWFGELATEPRYEQFPDEGELS----EARSQLSDG 298 Query: 301 EV-LVRLGGLRVLRIGDD----VYANGE--KIDSPHRPALDALASNIALTAENFGDALED 353 + ++ + V+ANG+ P AL+ L A D Sbjct: 299 ANGIELNSAAKLAWQHEAGRVVVFANGDSRSFSESIMPLQIALSDAWKLDKAELAAASAD 358 Query: 354 PSFLAMLAALVNSGYWFF 371 P L L+ SG F Sbjct: 359 PESSGWLDYLLESGCVFI 376 >UniRef50_Q5WVF0 Putative uncharacterized protein n=4 Tax=Legionella pneumophila RepID=Q5WVF0_LEGPL Length = 395 Score = 309 bits (792), Expect = 1e-82, Method: Composition-based stats. Identities = 117/385 (30%), Positives = 185/385 (48%), Gaps = 20/385 (5%) Query: 7 LNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSH---QDGKWQV 63 + FL+ +WQK+P+++++ +F +P++PDELAGLA+E E++SRLV Q +W + Sbjct: 7 MTVQTFLKDYWQKKPLIIRQALPDFTNPLTPDELAGLALEEEIESRLVYETPDQSPQWNL 66 Query: 64 SHGPFESYD--HLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGG 121 GPF+ D L +T+W+LLVQ V+ L+ F +P WR+DD+MIS++ G Sbjct: 67 KRGPFKESDLIGLPKTHWTLLVQGVDRIVPDVYELLDHFNFIPQWRVDDVMISYATLHGS 126 Query: 122 VGPHLDQYDVFIIQGTGRRRWRV-GEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILY 180 VGPH D YDVF+ Q G+R W + +K +L ++ FE LE GD+LY Sbjct: 127 VGPHYDNYDVFLYQAKGQRLWSLTSKKCHTNNFIKGLELRIMNEFEVEEQFILEEGDMLY 186 Query: 181 IPPGFPHEGYAL-ENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHP 239 +PP H G A E M YS G+R+ +EL DY+ + L + Y DPD + Sbjct: 187 LPPHIGHYGIAQSEECMTYSFGYRSYQGQELWDSLGDYLSEHGLFKSLYQDPDWSTLKNT 246 Query: 240 ADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQP-----DEIY 294 +++ P+ R+++ +++ + WFG F + + P P + DE Sbjct: 247 SEITPKAWSNARQLLRQVLENDQLMHSWFGCFATSLDQSAEQYLPPPLEEDELLGLDEFI 306 Query: 295 DALKQGEVLVRLGGLRVLRI------GDDVYANGEKIDS--PHRPALDALASNIALTAEN 346 L + +VR R I Y NG++ DS L +A+N L + Sbjct: 307 KELSNYQEIVRDASCRFAYIMSDQESQCHFYVNGKEWDSRGVSTNLLSFVANNRFLPLKE 366 Query: 347 FGDALEDPSFLAMLAALVNSGYWFF 371 L + L L + Sbjct: 367 LKPYLNHKTNQLFLYELWKLQWLQI 391 >UniRef50_A6W0E5 Cupin 4 family protein n=2 Tax=Marinomonas RepID=A6W0E5_MARMS Length = 400 Score = 309 bits (792), Expect = 1e-82, Method: Composition-based stats. Identities = 123/384 (32%), Positives = 209/384 (54%), Gaps = 20/384 (5%) Query: 1 MEYQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQD-G 59 + + F++ +WQK+P++++ G NF P+ DELAG+AME E++SR+V Sbjct: 8 LAILGGMTAQTFIDEYWQKKPLLIRGGLVNFTLPLEADELAGMAMEEEIESRIVIENGLR 67 Query: 60 KWQVSHGPF--ESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSV 117 W++ GPF +++ L E W+LLVQAV+HW L F LP WR+DD+M+S++ Sbjct: 68 PWEMRQGPFTEDTFATLPEKEWTLLVQAVDHWVPEVQTLKEKFEFLPSWRLDDVMVSYAT 127 Query: 118 PGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQ-HCPHPDLLQVDPFE--AIIDEELE 174 GG VGPH DQYDVF++Q +G+RRW+V + + P+ L +D F +D EL+ Sbjct: 128 EGGSVGPHYDQYDVFLVQVSGKRRWQVLSPDEYQDSAIPNIKLHILDNFPVNPEMDWELD 187 Query: 175 PGDILYIPPGFPHEGYALEN-AMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDV 233 GDILY+PP F H G +L++ M YS+GFRAP+ +++++G D + + E + ++ P+ Sbjct: 188 AGDILYLPPNFAHNGRSLDDECMTYSIGFRAPSMQDILTGVRDKLCETENVKDRFAAPET 247 Query: 234 PPRAHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEI 293 R H A + ++ L+ + LINQP+ +W GE +S+S++ +AP + +E Sbjct: 248 ANRQHSAHISKDDIQYLQTQLARLINQPDLLAEWLGETMSESKYPEYLAPLNHE-EVNEA 306 Query: 294 YDALKQGEVLVRLGGLRVLRIGD---------DVYANGEK--IDSPHRPALDALASNIAL 342 + + QG+ +R G R+ +V+ NGE +D ++A+ + Sbjct: 307 FSSATQGQTFIRPGDARICYYIQQSTENNGKINVFCNGEHLLVDEELTSFVEAVCHQVEF 366 Query: 343 TAENFGDALEDPSFLAMLAALVNS 366 D ++ ++ + Sbjct: 367 DFSGL-DLKQNTDLEPLVRFFIRQ 389 >UniRef50_Q5QZ10 Cupin superfamily protein n=2 Tax=Idiomarina RepID=Q5QZ10_IDILO Length = 380 Score = 309 bits (791), Expect = 1e-82, Method: Composition-based stats. Identities = 152/381 (39%), Positives = 212/381 (55%), Gaps = 18/381 (4%) Query: 5 LTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVS---HQDGKW 61 L + DFL +WQK+P ++++GF +F DP+SP+ LAGLAME DSR++ + W Sbjct: 3 LVFDKDDFLTNYWQKKPCLIRQGFADFSDPVSPEILAGLAMEEGADSRVIESKADTESGW 62 Query: 62 QVSHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGG 121 V+HGPFE Y+ GET+W+LLVQ+VN W L+ PFR LPDWRIDD+M+SFS GG Sbjct: 63 LVTHGPFEDYEKFGETDWTLLVQSVNEWLPDVGELITPFRFLPDWRIDDVMVSFSCENGG 122 Query: 122 VGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVD-PFEAIIDEELEPGDILY 180 VGPHLDQYDVFIIQG G R WRVGEK M+++ P DL Q+ F A+I+E L GD+LY Sbjct: 123 VGPHLDQYDVFIIQGAGSRHWRVGEKQAMQEYQPAEDLCQIKGEFNAVINEHLTAGDVLY 182 Query: 181 IPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPA 240 IP G PH+G +LE ++NYSVGFRAP+ EL+ D +Q++ Y DP + Sbjct: 183 IPAGCPHDGISLEPSLNYSVGFRAPSKAELLLQLGDIAMQQKSLQERYQDPALSSEDVSW 242 Query: 241 DVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQG 300 + ++ L++ + + + E + ISQS+ L PE P ++I L Q Sbjct: 243 VIEKTQLSALKQFLKDALESDET-DALLAKIISQSKRPLP--EPELPTIAEQIPTLLAQQ 299 Query: 301 EVLV-RLGGLRVL-RIGDDVYANGEKI-----DSPHRPALDALASNIALTAENFGDALED 353 + + G R L Y NGE P L L + A E + Sbjct: 300 NAFIEKTSGARFLKLSDTQFYGNGEAFHVIQEALPTAEWLAQLQGSEAT--EELAKLAKS 357 Query: 354 PSFLAMLAALVNSG--YWFFE 372 + ++A +N G Y + + Sbjct: 358 VAGCELIAETINQGIIYLYID 378 >UniRef50_A6GQ27 Putative uncharacterized protein n=1 Tax=Limnobacter sp. MED105 RepID=A6GQ27_9BURK Length = 383 Score = 308 bits (789), Expect = 3e-82, Method: Composition-based stats. Identities = 101/371 (27%), Positives = 180/371 (48%), Gaps = 10/371 (2%) Query: 6 TLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQVSH 65 L+ F+ HW +P + ++ F NF D +A +A + +++SRL+ H W + H Sbjct: 10 NLSVEKFMTEHWHIKPYLFRQAFPNFEPLCDFDTIAEMASDEDIESRLIQHSKTGWTLEH 69 Query: 66 GPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 125 GPF+ + + W++L+Q ++H L++ FR +PD R+DD+M+S + GGGVGPH Sbjct: 70 GPFDELPSMKKKAWTVLIQGIDHHLPEAYDLLQLFRFIPDARLDDVMLSLASDGGGVGPH 129 Query: 126 LDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIPPGF 185 D YDVF++Q G+RRW++G L K+ L + FE + LEPGD+LY+PP + Sbjct: 130 YDSYDVFLLQMHGKRRWKIG-PLLDKELEEGLPLKILKNFEPTEEFVLEPGDMLYLPPNY 188 Query: 186 PHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGG-----NYYSDPDVPPRAHPA 240 H+G A + S+GFRAP E++SG + + +SDP + +PA Sbjct: 189 GHDGIAEGSCSTLSIGFRAPTQAEVLSGILRDMADQIDQDPTKTQTLFSDPARGLQKNPA 248 Query: 241 DVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQG 300 ++ ++ ++ + Q ++ G +++ + + + EI L + Sbjct: 249 EIPDDLLNFGINLIQQFSAQSPQIQRSMGILLTEPKSHVYFVNNTEDQEIHEIISVLGE- 307 Query: 301 EVLVRLGGLRVLRIGDDVYANGEKIDSPHR---PALDALASNIALTAENFGDALEDPSFL 357 + ++L Y NG+ ++ L LA+ + + +AL +P F Sbjct: 308 RGIALSMKTKMLFKDAVFYINGDAVNPTSALTVKQLQMLANQREMEPIDAAEALNNPEFQ 367 Query: 358 AMLAALVNSGY 368 L +G+ Sbjct: 368 YFLVGFAKAGW 378 >UniRef50_UPI0000E0F5AA putative enzyme with RmlC-like domain n=1 Tax=Glaciecola sp. HTCC2999 RepID=UPI0000E0F5AA Length = 381 Score = 307 bits (787), Expect = 3e-82, Method: Composition-based stats. Identities = 137/376 (36%), Positives = 206/376 (54%), Gaps = 14/376 (3%) Query: 6 TLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQVSH 65 + FL +WQ++P VL NF DP+ +LAGLA E ++DSR++S DG W+V+ Sbjct: 6 AFSIKHFLAENWQRKPCVLHNALPNFEDPLDEHDLAGLAQEQDIDSRVISQMDGDWKVTE 65 Query: 66 GPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 125 GPF ++ + + W+LLVQ V+ E + LM F +P WR+DDL++S+S PG GVG H Sbjct: 66 GPFTEFEDVCKGAWTLLVQGVDTHIESASLLMNAFNFIPHWRMDDLLVSYSQPGAGVGAH 125 Query: 126 LDQYDVFIIQGTGRRRWRVGEKL-QMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIPPG 184 +DQYDVFI+QG G RRW+VG+K + ++ PHP L Q+D FE IID EL PGDILYIPPG Sbjct: 126 IDQYDVFIVQGKGTRRWQVGDKSMKYAKYYPHPKLQQIDEFEPIIDVELLPGDILYIPPG 185 Query: 185 FPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADVLP 244 FPH+G ++ MNYSVGFRAP+ EL AD +L + + D + P+ + P Sbjct: 186 FPHKGQSITECMNYSVGFRAPDQTELFQAIADDLLDSDKLTRRFIDRNRTYIDRPSAISP 245 Query: 245 QEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQGEVLV 304 +++ L++++ + + + Q + +S+ LD P E P I + QG L Sbjct: 246 KDIMLLKQLLQQYVTNTQ-IDQVLTQHLSKQNEYLDAFPLETPLPSSYILALINQGVTLQ 304 Query: 305 RLGGLRVLRIGDD------VYANGEKIDSPHRPAL---DALASNIALTAENFGDALEDPS 355 G+R + + + NG K + L L ++ +A Sbjct: 305 LACGVRPVYLDYQVDDEFIFFINGHKFSTSATARLETSRLLDNHQTFIKF---NAELTHD 361 Query: 356 FLAMLAALVNSGYWFF 371 ++ ++ L+N GY Sbjct: 362 WIELIRELINLGYLEI 377 >UniRef50_C3M8B3 Putative uncharacterized protein n=3 Tax=Candidatus Hamiltonella defensa 5AT (Acyrthosiphon pisum) RepID=C3M8B3_HAMD5 Length = 367 Score = 307 bits (787), Expect = 4e-82, Method: Composition-based stats. Identities = 146/370 (39%), Positives = 215/370 (58%), Gaps = 6/370 (1%) Query: 4 QLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQV 63 L +NW DFL HWQKRPV+LK+ ++F++PISP+EL L ++ ++ +L+ GK Q+ Sbjct: 2 HLIINWEDFLHHHWQKRPVLLKQSISDFVNPISPEELETLVIKKALECQLIQRSHGKCQL 61 Query: 64 SHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVG 123 + F Y LG+ NWSL V+A++H H + FR PDW ++L FSVPGGG+G Sbjct: 62 GYQAFNGYGSLGQRNWSLRVEALHHCHRAAEEFLSLFRIFPDWYTEELTTFFSVPGGGIG 121 Query: 124 PHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIPP 183 P DV +IQG G RWRVG++ P D Q D EA++DEEL GD+LYIP Sbjct: 122 PQTRPSDVLVIQGMGSSRWRVGDRGAS----PAFDYGQNDFSEAMVDEELSAGDMLYIPK 177 Query: 184 GFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYS-DPDVPPRAHPADV 242 FPHE + E AM+Y + F N+ +I + + + G Y+ PD+ R P ++ Sbjct: 178 VFPHEATSTEAAMSYCLNFWTDNSLRMIRNWTESLSDENHRGIEYAPSPDLLLRDDPTEI 237 Query: 243 LPQEMDKLREMMLE-LINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQGE 301 LPQ++ L+EMM + L+ + EH + WF + +SQ+ +EL AP Y ++ L++G Sbjct: 238 LPQDITALQEMMSQFLLKKREHLENWFAQEMSQTSYELPKAPAAKVYSVSQVQTLLQKGS 297 Query: 302 VLVRLGGLRVLRIGDDVYANGEKIDSPHRPALDALASNIALTAENFGDALEDPSFLAMLA 361 L RL GLR+L IG+ + NGE +DS H A + LA + + + + FLA L Sbjct: 298 RLNRLMGLRMLHIGNRYFVNGESLDSDHADAWNVLARHRTIEGPMLIKFINEADFLAELT 357 Query: 362 ALVNSGYWFF 371 ++N GYW+F Sbjct: 358 LIINKGYWYF 367 >UniRef50_Q7NS46 Putative uncharacterized protein n=1 Tax=Chromobacterium violaceum RepID=Q7NS46_CHRVO Length = 377 Score = 306 bits (783), Expect = 1e-81, Method: Composition-based stats. Identities = 108/372 (29%), Positives = 182/372 (48%), Gaps = 11/372 (2%) Query: 6 TLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQVSH 65 + FL +W K+P++++ + + L+ LA + +SRL+ ++ KW + Sbjct: 9 GMPPEQFLAEYWHKKPLLIRGALTDVGPHVDFSVLSELAQRDDAESRLIEYKKDKWHLER 68 Query: 66 GPFE--SYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVG 123 GPF + L ET+W+LLVQ VNH ++ F +P R+DDLMIS++ PGG VG Sbjct: 69 GPFRASRFRRLAETDWTLLVQGVNHHLPHIDDILWRFNFIPYARLDDLMISYAPPGGTVG 128 Query: 124 PHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIPP 183 PH D YDVF++Q G++RW++ + + + F + LE GD+LY+PP Sbjct: 129 PHFDAYDVFLLQVGGKKRWQISSQHD-DDFIEDAPIRVLKDFRMEQEFVLEHGDMLYLPP 187 Query: 184 GFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADVL 243 H G ALE M YS+GFRAP +EL + F Y+ R Y+DPD+ +A PA + Sbjct: 188 HCAHYGVALEPGMTYSIGFRAPPAQELAAQFLVYLQDRVCIDGVYADPDLKLQADPAKIG 247 Query: 244 PQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQGEVL 303 + +D++ ++ ++ + + G ++++ + + PE + +A+ + L Sbjct: 248 GEMIDQVAGLLSKIRWDKDTVCDFLGHYLTEPKAHVFYDSPEDELDEEAFAEAVAE-RGL 306 Query: 304 VRLGGLRVLRIGDDVYANGEKIDSPHRPA--LDALASNIALTAENFGDALEDPSFLAMLA 361 ++L VY NGEK+D+ + L A ++ + D + L Sbjct: 307 ELDRKSQILYCDACVYCNGEKVDAADGDFADWQHFGNRRRLPAGSYSADMIDALYDGYL- 365 Query: 362 ALVNSGYWFFEG 373 SGYW Sbjct: 366 ----SGYWHLSS 373 >UniRef50_Q15T89 Cupin 4 n=1 Tax=Pseudoalteromonas atlantica T6c RepID=Q15T89_PSEA6 Length = 389 Score = 306 bits (783), Expect = 1e-81, Method: Composition-based stats. Identities = 148/378 (39%), Positives = 215/378 (56%), Gaps = 15/378 (3%) Query: 6 TLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQVSH 65 + FL+ HWQKRPVV K F+ F+DP+ +ELAGLA + +DSR+VS ++ W V H Sbjct: 6 NFDPTLFLDSHWQKRPVVFKGAFSQFVDPLDENELAGLAQDPRIDSRIVSSENANWHVQH 65 Query: 66 GPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 125 GP ++H + +WSLLVQ+V+ + AL+R F +P WR+DDLM+SFS G GVGPH Sbjct: 66 GPISDFEHACQGSWSLLVQSVDQHVDEADALIRMFNFIPYWRLDDLMVSFSNTGAGVGPH 125 Query: 126 LDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIPPGF 185 LDQYDVFIIQG G RRW+ G++ + + PHPDL Q+ F IIDE L GD+LYIP G Sbjct: 126 LDQYDVFIIQGKGSRRWQAGKRGEYSTYHPHPDLSQIQGFTPIIDEVLHSGDMLYIPAGC 185 Query: 186 PHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADVLPQ 245 PH G ALE+ MNYSVGFRAP ++L+S ADY + + Y D + PR P+++ + Sbjct: 186 PHNGVALEDCMNYSVGFRAPTQQDLLSSLADYSIDLGIFKKRYQDKGLTPRFDPSELAQE 245 Query: 246 EMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPP-YQPDEIYDALKQGEVLV 304 E+ R M+ + I+ P+ F +W S ++ P Y EI +Q V Sbjct: 246 EIHSFRNMLHDAIDSPD-FTRWLTSHFSDTQLNQGYDEQHNPDYSLQEILVLFQQQTVFE 304 Query: 305 RLGGLRVLRIGD-------DVYANGEKIDSP--HRPALDALASNIALTAENFGDALE--- 352 R G+R + + + + G+ +P H A+ A + + + D Sbjct: 305 RQPGIRPIYLAQSDENTSLEFFIEGQAFFAPPEHAQAVRAFLQSASWQFDLHSDKGTAVT 364 Query: 353 -DPSFLAMLAALVNSGYW 369 + ++ +++ LVN+G W Sbjct: 365 INHFWVQLISELVNAGAW 382 >UniRef50_B2SQ70 Transcription factor jumonji, JmjC n=19 Tax=Xanthomonadaceae RepID=B2SQ70_XANOP Length = 498 Score = 301 bits (771), Expect = 3e-80, Method: Composition-based stats. Identities = 121/378 (32%), Positives = 190/378 (50%), Gaps = 18/378 (4%) Query: 5 LTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQD--GKWQ 62 L + FL +W K P++++ F +F P+ P++LAGLA E V +RL+SH W Sbjct: 21 LGMPVERFLRNYWHKHPLLIRNAFADFASPLQPEDLAGLACEDGVLARLISHDRATDSWD 80 Query: 63 VSHGPFES--YDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGG 120 V GPF+ + L + +W+LLVQ V+ W AL+ FR LP WRIDD+MISF+ GG Sbjct: 81 VRSGPFQETDFPGLPDHDWTLLVQDVDKWDADVRALLEQFRFLPRWRIDDIMISFAATGG 140 Query: 121 GVGPHLDQYDVFIIQGTGRRRWRV------GEKLQMKQHCPHPDLLQVDPFEAIIDEELE 174 VG H+D YDVF++QG G RRW++ G K +L + F+ L Sbjct: 141 SVGAHVDHYDVFLLQGQGHRRWQIDARTAQGSKATPLAFREDVELKLLRTFKPTHHWVLG 200 Query: 175 PGDILYIPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVP 234 PGD+LY+PP PH G A + + +S+G RAP++ ELI + D ++ Y D D+ Sbjct: 201 PGDMLYLPPLIPHHGVAEDACLTFSIGTRAPSSAELIGDYLDTLIADADEAVRYHDEDLK 260 Query: 235 PRAHPADVLPQEMDKLREMMLEL-INQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEI 293 A P ++ M+++ + L +N P+ WFG F++ R D+ P P + + Sbjct: 261 VPADPYEIDVTAMNRVVAALNALRMNDPDRLGDWFGRFMTTYRACGDVVPAPAPIPREAV 320 Query: 294 YDALKQGEVLVRLGGLRVLR----IGDDVYANGEKIDSPHRPALDALASNIALTAENFGD 349 AL++G +L R R+ G ++ +G + + A LA+ + + Sbjct: 321 EQALEEGVLLHRHPWSRLAWRRAKRGATLFCSGLEFALSAKDASR-LAAAEKIDGTLYAQ 379 Query: 350 ALEDPSFLAMLAALVNSG 367 P ++ L+ G Sbjct: 380 L--SPRGRDVVLELLAQG 395 >UniRef50_A1VLH8 Cupin 4 family protein n=6 Tax=Burkholderiales RepID=A1VLH8_POLNA Length = 413 Score = 299 bits (767), Expect = 8e-80, Method: Composition-based stats. Identities = 117/393 (29%), Positives = 177/393 (45%), Gaps = 36/393 (9%) Query: 6 TLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQVSH 65 L F+ RHWQK+P+++++ F +S L LA +V+SRL+ Q W + Sbjct: 12 GLTPAQFMRRHWQKKPLLVRQAIAGFEPFLSRAALFKLAAREQVESRLIVQQAKGWGMKK 71 Query: 66 GPF--ESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVG 123 GPF +S L + W+LLVQ V+ AL++ FR +PD R+DDLMISF+ PGGGVG Sbjct: 72 GPFASKSLPPLSQEGWTLLVQGVDLHEPAGHALLQQFRFVPDARLDDLMISFATPGGGVG 131 Query: 124 PHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIPP 183 PH D YDVF+ Q +GRRRW++G + P L + FE + LE GD+LY+PP Sbjct: 132 PHFDSYDVFLFQASGRRRWKIGLQKDF-TLQPDVPLKILQNFEVDEEFVLEAGDMLYLPP 190 Query: 184 GFPHEGYAL---------ENAMNYSVGFRAPNTRELISGFADYVLQR------------- 221 + H+G A + M YS+GFR+P EL S + + Sbjct: 191 RYAHDGIAEASVGTNGKPADCMTYSIGFRSPARTELASELLHRLAEMGEDAAEEACAAEA 250 Query: 222 ----ELGGNYYSDPDVPPRAHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRH 277 Y DP P PA + D + +LE + P GE++++ + Sbjct: 251 GRKPARAQPMYRDPTQPATETPAAMPAGLADFAGQAVLEALKDPLALACALGEYMTEPKP 310 Query: 278 ELDIAPPEPPYQPDEIYDALKQGEVLVRLGGLRVLRIGDDVYANGEKIDS--PHRPALDA 335 + PE + D + R++ D ++ NGE + + Sbjct: 311 GVWFDEPEQAWDGDAAK---AGQMAIALDARTRMMYDSDHIFINGESYRAKGADASLMHR 367 Query: 336 LASNIALTAENFGDALEDPSFLAMLAALVNSGY 368 LA+ L A A S + +L +G+ Sbjct: 368 LANQRCLLASELRKAG--ASAIELLGDWHEAGW 398 >UniRef50_B7H3P1 Cupin superfamily protein n=16 Tax=Acinetobacter RepID=B7H3P1_ACIB3 Length = 387 Score = 298 bits (763), Expect = 3e-79, Method: Composition-based stats. Identities = 103/383 (26%), Positives = 183/383 (47%), Gaps = 17/383 (4%) Query: 1 MEYQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQD-- 58 + + FL +WQK+P++++ + + P+++ LA+E V +RL+ +D Sbjct: 5 LTVLGGITAEQFLTEYWQKKPLLVRNAMPEIVGMLEPNDVKELALEDHVTARLIRQKDKN 64 Query: 59 -GKWQVSHGPFE--SYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISF 115 +W V P + L + W+LLVQAV+H+ A L + F +P WR DD+M+S+ Sbjct: 65 PNEWHVKSSPLTKGDFQKLPKL-WTLLVQAVDHYSFDIAELWKKFPFIPQWRRDDIMVSY 123 Query: 116 SVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMK-QHCPHPDLLQVDPFEAIIDEELE 174 + GG VG H D YDVF++QG G RRW++G+ + P+ L + + DE L Sbjct: 124 APKGGSVGKHFDFYDVFLVQGYGHRRWQLGQMCDASTEFVPNQPLKLLPEIDVHFDEVLA 183 Query: 175 PGDILYIPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVP 234 PGD+LY+PPG H G A ++ + +S GFR PN +I +D EL N D Sbjct: 184 PGDLLYVPPGLSHYGVAEDDCLTFSFGFRMPNISGMIDRISDQFATDELLQNPVVDITRK 243 Query: 235 PRAHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIY 294 ++ +E+ LR+++L + ++S+ ++ +I P+ + +++ Sbjct: 244 NPPQIGEINTEELAYLRDLVLAQLKNSTVLDAALMSYMSEPKYPDNIPEPDE-IEVEDLN 302 Query: 295 DALKQGEVLVRLGGLRVLRIGDD----VYANGEKIDSPH--RPALDALASNIALTAENFG 348 L +G L+ R+L + + NGE++ L ++A ++ F Sbjct: 303 AILSEGYELLLEPASRLLYTEQNGILKFWGNGEELPIVESFATQLKSIADGKSIP---FN 359 Query: 349 DALEDPSFLAMLAALVNSGYWFF 371 L + L + L+N+ Sbjct: 360 SELNNTDILENIVQLLNNSILML 382 >UniRef50_A4BDP0 Putative uncharacterized protein n=1 Tax=Reinekea blandensis MED297 RepID=A4BDP0_9GAMM Length = 381 Score = 297 bits (760), Expect = 5e-79, Method: Composition-based stats. Identities = 115/374 (30%), Positives = 190/374 (50%), Gaps = 16/374 (4%) Query: 6 TLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDG--KWQV 63 N +FL +WQ+ P++ + + D I+ DELAGLA E+EV+SRL+S + +W + Sbjct: 16 PFNSQEFLNTYWQQAPLLKRNALS-LHDIITADELAGLATEAEVESRLISGSNETEQWTL 74 Query: 64 SHGPF--ESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGG 121 HGPF + + L E +W+LLVQAV+HW ++ F LP WRIDD+MISF+ GGG Sbjct: 75 QHGPFSDDVFQTLPERDWTLLVQAVDHWVPEVRQVLAQFSFLPRWRIDDIMISFATDGGG 134 Query: 122 VGPHLDQYDVFIIQGTGRRRWRVGEKLQMK-QHCPHPDLLQVDPFEAIIDEELEPGDILY 180 VGPH DQYDVF++Q G+R W++G+ + + + FE L+PGD+LY Sbjct: 135 VGPHFDQYDVFLVQLAGQREWKIGQMCDEDSDLVENIPVKVLSAFEEQDAWVLDPGDVLY 194 Query: 181 IPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAH-P 239 +PPG H G +L ++M SVGFRAP+ E I+ ++ Y D + R P Sbjct: 195 LPPGVAHWGTSLGDSMTLSVGFRAPSDSETIAELGHFMSSMVSDFQRYGDAGISQRNQTP 254 Query: 240 ADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQ 299 + +++D+++ ++ L + +WFG+++++ ++ D+ + + + Sbjct: 255 HAIEEEDIDRVQAIIKRLADDRSLVSEWFGQYVTEPKY-DDMNVDTQDWSDESFMKHWQH 313 Query: 300 GEVLVRLGGLRVLRIGDDVYANGEKIDSPHRP-ALDALASNIALTAENFGDALEDPSFLA 358 L R G R+ ++ +G+ P L + L + Sbjct: 314 -HPLYRNPGSRLAYREQTLFVDGQSYGVNATPEELHLICDCDVL------PYNHNVHIQR 366 Query: 359 MLAALVNSGYWFFE 372 + L+N+G FE Sbjct: 367 IALQLLNAGALIFE 380 >UniRef50_B8KGD9 Cupin 4 family protein n=2 Tax=unclassified Gammaproteobacteria RepID=B8KGD9_9GAMM Length = 370 Score = 296 bits (759), Expect = 7e-79, Method: Composition-based stats. Identities = 118/377 (31%), Positives = 189/377 (50%), Gaps = 18/377 (4%) Query: 2 EYQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKW 61 +Y+L L F+ R+WQK+ + + GF +F P DELAGLAME E+D+R+V W Sbjct: 3 DYRLDLEVKSFVARYWQKQHLFIPGGFKHFSVPADADELAGLAMEDELDARIVFRDGQHW 62 Query: 62 QVSHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGG 121 GPF + +W+LLVQ V+ + A L+ LP WR+DD+M+S++ GG Sbjct: 63 HQERGPFSQESYRRSGSWTLLVQGVDQHWDEAAELLNAVSFLPSWRLDDIMMSYATDGGS 122 Query: 122 VGPHLDQYDVFIIQGTGRRRWRVGEKLQ-MKQHCPHPDLLQVDPFEAIIDEELEPGDILY 180 GPH D YDVFIIQG G+RRW+VG + +L + FE+ + + GD+LY Sbjct: 123 AGPHYDNYDVFIIQGDGQRRWQVGGLCDASSALMDNTELRLLADFESQREYLMNTGDVLY 182 Query: 181 IPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPA 240 IPPG H G ++ + ++S+GFRAP +L++ +AD +L + DP P Sbjct: 183 IPPGIAHYGVSVGESTSFSIGFRAPRQSDLLARWADNLLNTLEDDALFCDPGREPATRVG 242 Query: 241 DVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQG 300 ++ ++ + R +L + + +WFGE I+ S + D + +QG Sbjct: 243 EITTADLHRARAQLLRVFEDKD--PRWFGEAITNSGTTVQP-------SSDTALNLDEQG 293 Query: 301 EVLVRLGGLRVLRIGDD----VYANGEKIDSPHR--PALDALASNIALTAENFGDALEDP 354 + R G R+ D V+A+G D+P ++AL ++ + +A Sbjct: 294 AWVTRAPGSRLAWHATDEELLVFAHGSTHDTPLALQSVMEALCAHEDVAVSAALEA--HD 351 Query: 355 SFLAMLAALVNSGYWFF 371 + +L+ L + G F Sbjct: 352 AAQGLLSWLHDEGAILF 368 >UniRef50_B1Y837 Cupin 4 family protein n=3 Tax=cellular organisms RepID=B1Y837_LEPCP Length = 418 Score = 291 bits (746), Expect = 2e-77, Method: Composition-based stats. Identities = 113/385 (29%), Positives = 181/385 (47%), Gaps = 29/385 (7%) Query: 2 EYQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGK- 60 E L+ F++RHWQ++P+++++ P+S ++ + + V+SR +S Q Sbjct: 43 ELLGGLSPSVFMQRHWQRKPLLVRQAVPGIEPPVSRAQMFAMLEDDAVESRFLSRQGEGD 102 Query: 61 ---WQVSHGPF--ESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISF 115 WQ GP S + + W++LVQ +N A L+ FR +P R+DDLMIS+ Sbjct: 103 RQTWQFKRGPMPRRSLPAIKQPGWTVLVQGLNLHVPAAADLLNRFRFVPQARLDDLMISW 162 Query: 116 SVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEP 175 + GGGVGPH D YDVF+IQ GRRRWR+G + + ++ F + LEP Sbjct: 163 ASEGGGVGPHFDSYDVFLIQVAGRRRWRIGRLPDAR-LREGLPVKIIENFRHEEEWVLEP 221 Query: 176 GDILYIPPGFPHEGYALE-NAMNYSVGFRAPNTRELIS----GFADYVLQRELGGNY--- 227 GD+LY+PPG+ H+G A++ M SVGFR+P EL+ AD + G Sbjct: 222 GDMLYLPPGWAHDGDAVDGECMTCSVGFRSPQRSELVRETLLRLADGIDDPADAGARPPV 281 Query: 228 YSDPDVPPRAHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPP 287 Y DP A P + + + + ++ + +P + GE++S+ + ++ Sbjct: 282 YRDPKQSATAAPGRIPAELLAFAEQGLMRALAEPGALARALGEYLSEPKAQVSF------ 335 Query: 288 YQPDEIYDALKQGEVLVRLGGLRVLRIGDDVYANGEKIDSPHRPA--LDALASNIALTAE 345 E+ + L G + +L VY NG+ + R A L LA L A Sbjct: 336 ----ELGEPLPDGVGVRLDDRSCLLYDDGHVYCNGDSWRAAGRDAAMLHLLADARQLDAT 391 Query: 346 NFGDALEDPSFLAMLAALVNSGYWF 370 A P+ A+L + G+ Sbjct: 392 TLRRA--SPALRALLEQWADDGWLH 414 >UniRef50_C7I1M3 Cupin 4 family protein n=1 Tax=Thiomonas intermedia K12 RepID=C7I1M3_THIIN Length = 378 Score = 291 bits (744), Expect = 4e-77, Method: Composition-based stats. Identities = 120/369 (32%), Positives = 178/369 (48%), Gaps = 11/369 (2%) Query: 6 TLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQVSH 65 FL WQ++P++L++ F F +S +L LA + +V+SRL+ +WQ+ H Sbjct: 10 AFTEARFLREIWQRKPLLLRQAFPGFKPLLSRAQLFALAGQDDVESRLLQRAGRRWQLDH 69 Query: 66 GPFES--YDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVG 123 GPF + + NW+LLVQ VN + L+R FR +PD R+DDLMIS++ GGGVG Sbjct: 70 GPFSRKQLPPVEQRNWTLLVQGVNLHVDAAGDLLRQFRFIPDARLDDLMISWASEGGGVG 129 Query: 124 PHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIPP 183 PH D YDVF++Q GRRRWR+G ++ P + + F D LE GD+LY+PP Sbjct: 130 PHQDAYDVFLLQAAGRRRWRIG-PVEDATLQPGKPVKLLAKFTPEEDLILESGDMLYLPP 188 Query: 184 GFPHEGY-ALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADV 242 G+ H+G A + M YSVGFRAP EL+ + + + GG Y DP + A PA + Sbjct: 189 GWGHDGIAASGDCMTYSVGFRAPPQGELLKEVLWQLAEAQQGGAIYRDPPLRSGASPALL 248 Query: 243 LPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQGEV 302 + RE L F+ G +++ + ++ E P + G Sbjct: 249 PAAMVRFAREAFSRLKPDAAMFENVLGLYLTTPKPQVWFESVETP-TATLRRACRQTGCR 307 Query: 303 LVRLGGLRVLRIGDDVYANGEKIDSPHR--PALDALASNIALTAENFGDALEDPSFLAML 360 L R ++L ++ NGE +D+ L LA L+A A Sbjct: 308 LDRR--SKMLYTTQALFLNGEAVDAALASSALLRQLADQQNLSAAQVQTASAAELAAL-- 363 Query: 361 AALVNSGYW 369 A G+ Sbjct: 364 ADWCAIGWL 372 >UniRef50_C0VP99 Cupin 4 n=2 Tax=Acinetobacter RepID=C0VP99_9GAMM Length = 387 Score = 287 bits (735), Expect = 4e-76, Method: Composition-based stats. Identities = 110/382 (28%), Positives = 178/382 (46%), Gaps = 15/382 (3%) Query: 1 MEYQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQD-- 58 + + FL +WQK+P++++ I + P ++ LA+E +V +RL+ ++ Sbjct: 6 LTVLGGITAEQFLAEYWQKKPLLVRNAMPEIIGLLEPADVQELALEEDVTARLIRQKNKN 65 Query: 59 -GKWQVSHGPFESYDHLGETN-WSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFS 116 +W V P D N W+LLVQAV+H+ A L + F +P WR DD+M+S++ Sbjct: 66 PNEWHVKSSPLTKGDFQKLPNLWTLLVQAVDHYSFDIAELWKKFPFIPQWRRDDIMVSYA 125 Query: 117 VPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQ-MKQHCPHPDLLQVDPFEAIIDEELEP 175 GG VG H D YDVF++QG G RRW++G+K P L + DE L P Sbjct: 126 PKGGSVGKHFDFYDVFLVQGYGHRRWQLGQKCDETTALIPDQPLKLLTDMHVEFDEVLAP 185 Query: 176 GDILYIPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPP 235 GD+LY+PPG H G A ++ + +S GFR PN E+I +D + E+ D Sbjct: 186 GDLLYVPPGLAHYGVAEDDCLTFSFGFRMPNLSEMIDQVSDKFAENEILKKPLIDIVRQH 245 Query: 236 RAHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYD 295 A + E+ L+ +L+ + Q F+ ++S+S + I PE +++ + Sbjct: 246 TAPIGKINSTELAYLKAQLLDYLTQAPEFEAAIMSYMSESNYPNSIPEPEE-ITTEDLLE 304 Query: 296 ALKQGEVLVRLGGLRVLRIG----DDVYANGE--KIDSPHRPALDALASNIALTAENFGD 349 + G L+ R+L D +AN E + L +A +L F + Sbjct: 305 VIGTGYQLILEPASRLLYRELGDSLDFWANSENVCVSKNFENELKKIADGESL---EFNE 361 Query: 350 ALEDPSFLAMLAALVNSGYWFF 371 L +A L+NS Sbjct: 362 QFNQHEVLEDIAQLLNSAIVML 383 >UniRef50_A0Z1Z1 Putative uncharacterized protein n=1 Tax=marine gamma proteobacterium HTCC2080 RepID=A0Z1Z1_9GAMM Length = 364 Score = 287 bits (735), Expect = 4e-76, Method: Composition-based stats. Identities = 124/358 (34%), Positives = 173/358 (48%), Gaps = 23/358 (6%) Query: 4 QLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQV 63 + FL+ +WQ+RP+++K N+ P+SP+EL GLA E + DSRL+S W + Sbjct: 7 TFQFDEKVFLDCYWQRRPLLIKAALPNWQSPLSPEELGGLAFEEDADSRLISKSKNGWML 66 Query: 64 SHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVG 123 GP S D +W+LLV V+HW AAL + R LP WR DD+M+S++V GGVG Sbjct: 67 KQGPLVSADFQRSDDWTLLVNGVDHWVPEVAALRQCLRFLPQWRFDDVMVSYAVADGGVG 126 Query: 124 PHLDQYDVFIIQGTGRRRWRVGEKLQM-KQHCPHPDLLQVDPFEAIIDEELEPGDILYIP 182 PH D+YDVF++QGTGRR+WR+G H L + FE + LE GD+LY+P Sbjct: 127 PHFDRYDVFLVQGTGRRKWRLGGWCDENTPRIKHEGLNLLQNFETSEEYLLEAGDVLYVP 186 Query: 183 PGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPD-VPPRAHPAD 241 PG H G A M YS+GFRAP L++ +AD L+ D V P + Sbjct: 187 PGLAHWGVADTPCMTYSLGFRAPTVAALLARWADKTLESVDPELLLEDRASVTNPPRPGE 246 Query: 242 VLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQGE 301 + + RE + + + W GE +++ + APP K Sbjct: 247 ITLAHWNNAREAIRNSMEALDD-GSWLGEVVTE---HGECAPPPS-----------KHAT 291 Query: 302 VLVRLGGLRVLR----IGDDVYANGE--KIDSPHRPALDALASNIALTAENFGDALED 353 L G RV VYANGE +I P L+ L S ++ A D Sbjct: 292 ALRLHPGARVSWQALSNECSVYANGEALRIPLSSVPILERLCSGDTVSPYELTSAHPD 349 >UniRef50_C7RB22 Cupin 4 family protein n=1 Tax=Kangiella koreensis DSM 16069 RepID=C7RB22_KANKD Length = 390 Score = 284 bits (728), Expect = 3e-75, Method: Composition-based stats. Identities = 122/373 (32%), Positives = 201/373 (53%), Gaps = 18/373 (4%) Query: 3 YQLTLNWPDFLERHWQKRPVVLKRGFNNFI-----DPISPDELAGLAMESEVDSRLVSHQ 57 ++ FL+ +WQKRP++++ F++ IS +ELAG ++E +++SRL+ Sbjct: 4 ILGDISPEQFLKEYWQKRPLLIRGAFSSAQVSGEDALISAEELAGYSLEDDIESRLIERD 63 Query: 58 DGKWQVSHGPF--ESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISF 115 WQ+ HGP + LG+ NW+LLVQ+++++H P L++ +P WR+DD+M+S+ Sbjct: 64 GDDWQLEHGPIAESKFAELGDQNWTLLVQSLDYFHPPLCELIKACNFIPRWRLDDVMVSY 123 Query: 116 SVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQ-MKQHCPHPDLLQVDPFEAIIDEELE 174 + GGGVGPHLD+YDVF+IQG G+RRWRVG K Q CPHP + Q++PF+A +D + Sbjct: 124 ATNGGGVGPHLDKYDVFLIQGEGQRRWRVGHKNQGTTAICPHPQIAQIEPFDADMDVIVN 183 Query: 175 PGDILYIPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVP 234 PGD+LYIPP PH G ++ N++ YSVGFRAPN ++ L + +SD Sbjct: 184 PGDMLYIPPNTPHWGESVGNSICYSVGFRAPNIGGIVQKLMQ--LPQTELDQLWSDEARL 241 Query: 235 PRAHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIY 294 L ++M + E L+ + E + FG+ +++ ++ + P+ D I Sbjct: 242 SLKSSRGELTRDMSRWAEQQLKQLWTSEDYLMAFGKEVTELKYPDMLEVPDDDELIDWIE 301 Query: 295 DALKQGEVLVRLGGLRVLRIGDD------VYANGE--KIDSPHRPALDALASNIALTAEN 346 AL+QG L + + G ++ NGE + P ++ L +A+ Sbjct: 302 LALEQGVKAEPLARMTYFKHGGKESNELWLFINGEWQAMHISLEPLIEKLNLTYECSAKE 361 Query: 347 FGDALEDPSFLAM 359 + + L + Sbjct: 362 LATLAHEVAHLFL 374 >UniRef50_D1KE35 Putative uncharacterized protein n=1 Tax=uncultured SUP05 cluster bacterium RepID=D1KE35_9GAMM Length = 362 Score = 284 bits (728), Expect = 3e-75, Method: Composition-based stats. Identities = 107/373 (28%), Positives = 177/373 (47%), Gaps = 26/373 (6%) Query: 1 MEYQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLV--SHQD 58 M ++ +FLE +WQK+P+++K+ NFI PIS DELAGL++E E +SRLV S Sbjct: 1 MIRFGAISVEEFLEDYWQKKPLLIKQALPNFISPISSDELAGLSLEEEFESRLVQGSTAQ 60 Query: 59 GKWQVSHGPFES--YDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFS 116 +W +++GPF + L E +W+LLVQ V+ + + L++ F +P WR DD+MIS++ Sbjct: 61 QQWSLTNGPFTKTTFTQLPEQDWTLLVQGVDRFIDEVHDLIKQFDFIPRWRFDDVMISYA 120 Query: 117 VPGGGVGPHLDQYDVFIIQGTGRRRWRVGEK-LQMKQHCPHPDLLQVDPFEAIIDEELEP 175 GG VGPH D YDVF++QG+GRRRW + + + + L + F E+EP Sbjct: 121 TKGGSVGPHFDYYDVFLLQGSGRRRWELSTQFCTLDNYLKDVPLRIMHTFTPEQFFEVEP 180 Query: 176 GDILYIPPGFPHEGYALEN-AMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVP 234 GD+LYIPP H G +L++ S G+RA + +EL D + YY DP Sbjct: 181 GDVLYIPPKVAHHGVSLDDECTTLSFGYRAYSAQELFESL-DMQNPDQEQNIYYQDPIWI 239 Query: 235 PRAHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIY 294 + PA + +++ +++ + F + + L E Sbjct: 240 NTSSPALIPDLAIEQANQILKI---SSDEFAAFVTKLDILDEQLLQHFEAET-------- 288 Query: 295 DALKQGEVLVRLGGLRVLRIGDD----VYANGEKIDSPHRP--ALDALASNIALTAENFG 348 ++ ++ D V+ NGE D+ A+ + + A++ Sbjct: 289 --FREKMQYKLHPSCKIAYFLVDTTPKVFINGEYFDTQEFDPQAVMQFCNKRTINAKDHQ 346 Query: 349 DALEDPSFLAMLA 361 + ++ Sbjct: 347 QLTINLFKQNLIY 359 >UniRef50_B8KRM1 Cupin 4 family protein n=1 Tax=gamma proteobacterium NOR51-B RepID=B8KRM1_9GAMM Length = 365 Score = 280 bits (716), Expect = 7e-74, Method: Composition-based stats. Identities = 129/371 (34%), Positives = 188/371 (50%), Gaps = 19/371 (5%) Query: 4 QLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQV 63 ++L+ FL+ +WQK P+V+++ +F PI D LAGLA+E +V SR+VS G W+V Sbjct: 2 TISLDTERFLKHYWQKHPLVIRQAVPDFTPPIDADHLAGLALEPDVQSRIVSCDRGHWEV 61 Query: 64 SHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVG 123 HGPF D + WSLLVQ V+ AAL R LP WR DD+MIS++ GG VG Sbjct: 62 QHGPFSEADFDRDDQWSLLVQGVDRLLPEVAALQRAVDFLPSWRFDDVMISYASEGGSVG 121 Query: 124 PHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPD-LLQVDPFEAIIDEELEPGDILYIP 182 PH D+YDVF++QG G R WR+G++ + D LL +D FE L+ GD LYIP Sbjct: 122 PHFDRYDVFLLQGEGEREWRIGQRCDHTTATHNYDELLLLDDFEHRETHLLQTGDALYIP 181 Query: 183 PGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPD--VPPRAHPA 240 PG H G A +S+GFRAP+ L + D L++ + D + V R P Sbjct: 182 PGIAHWGIARGPCTTFSLGFRAPSIAALTARLTDSALEQLMPDLLLEDRNSLVSERGRPG 241 Query: 241 DVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAP--PEPPYQPDEIYDALK 298 ++ Q+ D +R +L ++ + W GE ++++ + +P PP+ ++ + Sbjct: 242 EITTQQRDNIRSAVLSALSALDD-GVWLGELLTETEPFIGESPEGAVPPHIAMDLGSRIN 300 Query: 299 QGEVLVRLGGLRVLRIGDDVYANGEKIDSPHRPALDALASNIALTAENFGDALEDPSFLA 358 E G V+ANGE+ R ALD L A + A D A Sbjct: 301 WMETP----------EGIAVFANGERFP-ASRQALDVLTPLCAGGSIMTSTAPIDT--RA 347 Query: 359 MLAALVNSGYW 369 +L L +G Sbjct: 348 LLEWLWAAGVL 358 >UniRef50_A4SX54 Cupin 4 family protein n=2 Tax=Polynucleobacter necessarius RepID=A4SX54_POLSQ Length = 410 Score = 279 bits (713), Expect = 2e-73, Method: Composition-based stats. Identities = 113/389 (29%), Positives = 191/389 (49%), Gaps = 31/389 (7%) Query: 6 TLNWPDFLERHWQKRPVVLKRGFNNFI----------DPISPDELAGLAMESEVDSRLVS 55 ++ F++++W K+P++++ F PIS ELA L+ + V+SRL+ Sbjct: 30 GISPEQFMKQYWHKKPLLIRGAIPAFSLTNQNGEALESPISFPELAELSTQDTVESRLIR 89 Query: 56 HQDGKWQVSHGPF--ESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMI 113 + W HGPF +S + + NW+LL+Q + H A ++ FR +PD R+DDLMI Sbjct: 90 SK--PWSFDHGPFAKKSIPAINKPNWTLLLQGMEAHHPAAAKILSWFRFIPDARLDDLMI 147 Query: 114 SFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEEL 173 S + GGGVGPH D YDVF++Q +GRR W + E+ P L + F + D L Sbjct: 148 SVAGIGGGVGPHFDSYDVFLMQMSGRRHWHISEQKD-LSLNPKLPLKILQHFRSEQDWIL 206 Query: 174 EPGDILYIPPGFPHEGYALEN-AMNYSVGFRAPNTRELIS----GFADYVLQRELGGNYY 228 EPGD+LY+PP H+G AL+ +S+GFR+P+ +EL+ A+ + + Sbjct: 207 EPGDMLYLPPHVAHDGIALDAGCQTWSIGFRSPSFKELLQEGLWRLAESLENLPELEQKF 266 Query: 229 SDPDVPPRAHPADVLPQEMDKLREMMLEL-INQPEHFKQWFGEFISQSRHELDIAPPEPP 287 +DP A + + + +L+ + +L ++Q + F ++S+ + + P P Sbjct: 267 ADPKQEATASAEQLPDELIAQLKGQLHKLKLDQIDSFLPGITAYLSEPKQQAIFDGPNSP 326 Query: 288 YQPDEIYDALKQGEVLVRLGGLRVLRIGDDVYANGEKIDSPHRP----ALDALASNIALT 343 +P L + E L+ R+L +G V+ NGE + P A +L++ L Sbjct: 327 LKPKAFLARLSR-ENLLPHPQTRILSLGKQVFCNGESMTQDQGPRIGDAWRSLSAQKRLR 385 Query: 344 AENFGDALEDPSFLAMLAALVNSGYWFFE 372 ++ + + L SG+ FE Sbjct: 386 TKSLQNIDKSS-----LYEAYLSGWLIFE 409 >UniRef50_Q0VQ28 Putative uncharacterized protein n=1 Tax=Alcanivorax borkumensis SK2 RepID=Q0VQ28_ALCBS Length = 377 Score = 277 bits (708), Expect = 5e-73, Method: Composition-based stats. Identities = 119/375 (31%), Positives = 189/375 (50%), Gaps = 16/375 (4%) Query: 2 EYQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQ-DGK 60 + L L FL HWQKRP+ + + P + LAGLA+E V++R+++ +G Sbjct: 9 SFTLPLTPAAFLREHWQKRPLFMPGAASGLDQP-DANTLAGLALEESVEARVITGAGNGP 67 Query: 61 WQVSHGPFES--YDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVP 118 W V P + ++ LGE NW+LLVQ+V+H+ T+ L+ F LP+WR++D+MIS++ Sbjct: 68 WSVLQSPLDDNVFEALGEKNWTLLVQSVDHFLTETSLLLDDFAFLPNWRVEDIMISYAAK 127 Query: 119 GGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPD-LLQVDPFEAIIDEELEPGD 177 GG VGPH D+YDVF+IQ +G RRW++G+ D L + + +PGD Sbjct: 128 GGSVGPHFDRYDVFLIQASGSRRWQIGDVCDESSPRQATDELKLLAQMPVREEFIAQPGD 187 Query: 178 ILYIPPGFPHEGYALE-NAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPR 236 +LY+PPG H G A + + + +SVGFRAP+ + L++ A L E ++DPD Sbjct: 188 VLYLPPGVAHHGVAEDSDCITWSVGFRAPDYQMLMAEIAGECLA-ESDSKLFTDPDRGIT 246 Query: 237 AHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDA 296 P+ + + +L L+L++ PE ++ ++S R E + Sbjct: 247 TDPSILADTDRQQLVRGALDLLH-PEAIERAIYRWLSTPRLEGL-----EFAVDEHHIRE 300 Query: 297 LKQGEVLVRLGGLRVLRIGDDVYANGEKIDSPH--RPALDALASNIALTAENFGDALEDP 354 LVR G +R+L G + NGE +P + LAS DA+ P Sbjct: 301 RDSDVSLVRHGSVRLLMQGKLAWLNGEAHTLTEQQQPLVQLLASKRRYQKREL-DAVMTP 359 Query: 355 SFLAMLAALVNSGYW 369 + +L + GY+ Sbjct: 360 TARELLHEWIEQGYF 374 >UniRef50_B4X170 Cupin superfamily protein n=1 Tax=Alcanivorax sp. DG881 RepID=B4X170_9GAMM Length = 382 Score = 271 bits (692), Expect = 4e-71, Method: Composition-based stats. Identities = 117/374 (31%), Positives = 186/374 (49%), Gaps = 16/374 (4%) Query: 3 YQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQD-GKW 61 + L + FL HWQK+ + + P + LAGLA+E V++R+++ D G W Sbjct: 15 FILPMKPAAFLREHWQKKALFMPGAARGLDQP-DANTLAGLALEESVEARIITGADNGPW 73 Query: 62 QVSHGPF--ESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPG 119 V P + ++ LGE NW+LLVQ+V+H+ T+ L+ F LP+WR++D+MIS++ G Sbjct: 74 SVLQSPLSDDVFETLGEENWTLLVQSVDHFLTETSLLLDDFAFLPNWRVEDIMISYAAKG 133 Query: 120 GGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMK-QHCPHPDLLQVDPFEAIIDEELEPGDI 178 G VGPH D+YDVF+IQ G RRW++G+ P +L + + PGD+ Sbjct: 134 GSVGPHFDRYDVFLIQAAGHRRWQIGDVCDESTPRQPTDELKLLADMPVREEFVAAPGDV 193 Query: 179 LYIPPGFPHEGYALE-NAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRA 237 LY+PPG H G A + + + +SVGFRAP+ + L++ A L E ++DPD + Sbjct: 194 LYLPPGVAHHGVAEDSDCITWSVGFRAPDYQMLMAEIAGECLA-ESDSQLFTDPDRDVTS 252 Query: 238 HPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDAL 297 P + + +L L+L+ QP+ ++ ++S R + D Sbjct: 253 DPTVLADADRQQLIRGALDLL-QPDAIERAVYRWLSTPRLDGL-----EFAIDDHHIRER 306 Query: 298 KQGEVLVRLGGLRVLRIGDDVYANGEKIDSPH--RPALDALASNIALTAENFGDALEDPS 355 LVR G +R+L G + NG+ RP + LAS DA+ P+ Sbjct: 307 DDKVALVRHGSVRLLMQGKLAWLNGDSHTLTEQQRPLVQLLASKRRYQESEL-DAVMTPA 365 Query: 356 FLAMLAALVNSGYW 369 +L + GY+ Sbjct: 366 ARELLHEWIEQGYF 379 >UniRef50_B9ZR02 Cupin 4 family protein n=1 Tax=Thioalkalivibrio sp. K90mix RepID=B9ZR02_9GAMM Length = 385 Score = 271 bits (692), Expect = 4e-71, Method: Composition-based stats. Identities = 112/381 (29%), Positives = 187/381 (49%), Gaps = 24/381 (6%) Query: 1 MEYQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLV--SHQD 58 +E L+ +FL +WQ++P++++ + F +PI PD+LAGLA + + +RLV Sbjct: 10 LELLGGLSPAEFLRDYWQQKPLLVRGAVSGFANPIEPDDLAGLACDPDASARLVLGDTDH 69 Query: 59 GKWQVSHGPFES--YDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFS 116 G W V +GPFE + L + W+LL+ V + + F +P WR DDLMIS++ Sbjct: 70 GDWAVEYGPFEEDRFASLPDRAWTLLISDVERFWPEGHDFLARFDFVPRWRRDDLMISYA 129 Query: 117 VPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPG 176 P G VGPH+D YDVF+ Q GRRRW++ L + FE +LEPG Sbjct: 130 SPDGSVGPHVDAYDVFLFQAAGRRRWQIQSPPGPLDCHDDLPLAILREFEPTESWDLEPG 189 Query: 177 DILYIPPGFPHEGYALEN-AMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPP 235 D+LY+PP PH G +L++ M +S+GFRAP +L++GF + R YSDP P Sbjct: 190 DLLYLPPNLPHYGLSLDDQCMTWSIGFRAPTYLDLLTGFLEERANRVGEAPRYSDPQRPV 249 Query: 236 RAHPADVLPQEMDKLREMMLELI-NQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIY 294 A+ +++ + +LR+++ E++ + G F+++ +++ +PP + E Sbjct: 250 SAYVSELPSHDRTRLRDILREMLAADDTELDAFLGRFLTRPAGNVELHTGDPPAEARECR 309 Query: 295 DALKQGEVLVRLGGLRVLRI----GDDVYANGEKIDSPHRP--ALDALASNIALTAENFG 348 G+R + G + A G + L+ L + + E + Sbjct: 310 V----------HPGIRRYWLQTPAGPILCAAGHSYPASSLAPGDLEQLCATEIVKPEQWE 359 Query: 349 DALEDPSFLAMLAALVNSGYW 369 + P+ ML + G+ Sbjct: 360 --HQWPAVHDMLRDGLEEGWL 378 >UniRef50_P44683 Uncharacterized protein HI0396 n=36 Tax=Gammaproteobacteria RepID=Y396_HAEIN Length = 404 Score = 267 bits (684), Expect = 3e-70, Method: Composition-based stats. Identities = 111/400 (27%), Positives = 190/400 (47%), Gaps = 30/400 (7%) Query: 1 MEYQLT--LNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVS-HQ 57 +++ L + FL +WQK+P+V++ G + P ++ LA +V +RLV Sbjct: 7 VDFCLPEHITPEIFLRDYWQKKPLVIRNGLPEIVGQFEPQDIIELAQNEDVTARLVKTFS 66 Query: 58 DGKWQVSHGPFES--YDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISF 115 D W+V P + L E WS+LVQ + W L F +P W+ DD+M+S+ Sbjct: 67 DDDWKVFFSPLSEKDFQKLPEK-WSVLVQNLEQWSPELGQLWNKFGFIPQWQRDDIMVSY 125 Query: 116 SVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMK-QHCPHPDLLQVDP-FEAIIDEEL 173 + GG VG H D+YDVF++QG G RRW+VG+ + P+ + D E +IDE + Sbjct: 126 APKGGSVGKHYDEYDVFLVQGYGHRRWQVGKWCDASTEFKPNQSIRIFDDMGELVIDEVM 185 Query: 174 EPGDILYIPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDV 233 PGDILYIP H G A ++ + +S G R PN LI G + ++ N S+ D+ Sbjct: 186 NPGDILYIPARMAHYGVAEDDCLTFSFGLRYPNLSNLIDGISKGFCHQDPDLNL-SEFDL 244 Query: 234 PPRAHPAD-----VLPQEMDKLREMMLELINQPEHFKQWFGEFISQ--SRHELDIAPPEP 286 P R ++ + + + +++++L+ + E F F + ++ S ++ + Sbjct: 245 PLRLSQSEQRTGKLADENIQAMKQLLLDKLAHSEAFDTLFKQAVASAVSSRRYELLVSDE 304 Query: 287 PYQPDEIYDALKQ-GEVLVRLGGLRVLRIGD--DVYANGE---KIDSPHRPALDALASNI 340 PDE+ L++ G L + ++L + +YANGE +++ L L+ Sbjct: 305 MCDPDEVRSILEEDGAFLSQDNNCKLLYTENPLRIYANGEWLDELNIIESEVLKRLSDGE 364 Query: 341 ALTAE---NFGDALEDPSF-----LAMLAALVNSGYWFFE 372 +L + + EDP L + V+ G+ E Sbjct: 365 SLDWAFLSDLANKTEDPETSMDLLLDSICNWVDDGWALIE 404 >UniRef50_C1E292 Predicted protein n=2 Tax=Micromonas RepID=C1E292_9CHLO Length = 466 Score = 264 bits (674), Expect = 5e-69, Method: Composition-based stats. Identities = 104/398 (26%), Positives = 163/398 (40%), Gaps = 35/398 (8%) Query: 9 WPDFLERHWQKRPVVLKRGFN-NFIDPISPDELAGLAMESEVDSRLVSHQDGK---WQVS 64 W F E++WQK PVV++ G P+ DELAGLA E+E R++ D W + Sbjct: 68 WTTFFEKYWQKEPVVIRGGLPTELCTPVDNDELAGLACETEFRPRIIRKGDEGPSSWSLQ 127 Query: 65 HGPF--ESYDHLGETN-WSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGG 121 GPF + L W LL+ + ++ F P WR+ D+ S S GG Sbjct: 128 MGPFSEDELKSLPSDGSWCLLLNDLEKHVSEFMDVLNLFDRFPRWRVADVQASISSEGGS 187 Query: 122 VGPHLDQYDVFIIQGTGRRRWRV-----GEKLQMKQHCPHPDLLQVDPFEAIIDEELEPG 176 VG H DQ+DVF+IQGTG +RW + + P ++ + F+ L+ G Sbjct: 188 VGAHSDQFDVFLIQGTGHKRWSISDCAEYVPDNDEAFFPDAEVRVLKNFQPQSCSLLKQG 247 Query: 177 DILYIPPGFPHEGYALEN---AMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDV 233 DILY+PP H G A +SVGF AP EL+ +A + G + DP + Sbjct: 248 DILYLPPKVAHHGVAEGCKTICTTFSVGFLAPAHDELVLSYAQASVDTHDGSQRWRDPWL 307 Query: 234 PPRAHPADVLPQEMDKLREMMLELI-NQPEHFKQWFGEFISQSRHELDIAP-PEPPYQPD 291 P+ H ++ + + + E++ + + +WFG +QS P D Sbjct: 308 KPQEHVGEISSEAVAQAAEIIRQSMPKNDAEIARWFGCHATQSFGIDPSETIPAKDLSAD 367 Query: 292 EIYDALKQGEVLVRLGGLRVLR---------IGDDVYANGEKIDSPHRP---ALDALASN 339 E+ + L R + G +A G P +A+ Sbjct: 368 ELVVQFAEEGSLQRRADAKFAFVQEVKDGSLEGGLFFAAGNMWPLQSAPGMELARHIANY 427 Query: 340 IALTAENF------GDALEDPSFLAMLAALVNSGYWFF 371 + A+++ + D +L L +SG +F Sbjct: 428 DEIIADDWIESGDAAEYDMDSEAKTLLHDLFSSGLIYF 465 >UniRef50_A4S2B8 Predicted protein n=2 Tax=Ostreococcus RepID=A4S2B8_OSTLU Length = 392 Score = 251 bits (641), Expect = 4e-65, Method: Composition-based stats. Identities = 104/389 (26%), Positives = 167/389 (42%), Gaps = 33/389 (8%) Query: 13 LERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGK---WQVSHGPFE 69 + +WQK+P+++++ NF P+ +E+AGLA E + +R+ + W+ GPFE Sbjct: 1 MREYWQKKPLLMRQAIPNFRPPLDGNEIAGLACEEDASARIFVREGDDEQSWRKKIGPFE 60 Query: 70 SYD--HLGETN-WSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPHL 126 D L E WSL+V ++ +P ++ F P WRI D+ S S GGGVGPH Sbjct: 61 ESDLTSLPEDKPWSLIVNDLDVQAQPFGDMLELFNCFPRWRISDIQASVSPDGGGVGPHS 120 Query: 127 DQYDVFIIQGTGRRRWRV-----GEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYI 181 D +DVF++Q G + W V P ++ + F L PGD+LY+ Sbjct: 121 DHFDVFLLQAEGEKVWAVADNEEYWPDNDAAFVPECEIRVLKSFVEDDSFTLVPGDMLYL 180 Query: 182 PPGFPHEGYALEN----AMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRA 237 PP H G A + ++ S+GF AP T EL+ + ++ L G+ +SDP + P Sbjct: 181 PPKIAHNGVATNSKPGVSVTLSIGFLAPTTDELVLSYTQRASEK-LKGSRWSDPWLKPVE 239 Query: 238 HPADVLPQEMDKLREMMLE-LINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDA 296 + + + E++ +WFG + E D E +E+ A Sbjct: 240 DVGAISAESITYASEIIKRTYPKNDAEVARWFGCHTTARTGEDD-DADENEVSIEELLAA 298 Query: 297 LKQGEVLVRLGGLRVLRIG---------DDVYANGEKIDSPHRPALD---ALASNIALTA 344 + ++ R LR + +ANGE D A+ +A+ L Sbjct: 299 WEHQGLVARED-LRFAFVEKVADDSLKNALFFANGECWDVVSPAAVKTATVIANRGELYE 357 Query: 345 ENFGDA--LEDPSFLAMLAALVNSGYWFF 371 E+ D L + L GY +F Sbjct: 358 EDTQTEECDFDDEALKLALTLFERGYLYF 386 >UniRef50_B5DUH6 Lysine-specific demethylase NO66 n=2 Tax=Drosophila pseudoobscura pseudoobscura RepID=NO66_DROPS Length = 946 Score = 248 bits (634), Expect = 2e-64, Method: Composition-based stats. Identities = 65/423 (15%), Positives = 137/423 (32%), Gaps = 56/423 (13%) Query: 6 TLNWPDFLERHWQKRPV-VLKRGFNNFIDPISPDELAGLAMESEVD----SRLVSHQDGK 60 + FL HW+K P V+ F + IS + + +++ V+ + S++DG Sbjct: 518 PMTMATFLRDHWEKSPFRVITTTSGGFSNLISFKMIDKMLIQNHVEYTTNIDVTSYEDGV 577 Query: 61 WQVSHGPFE----SYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFS 116 + + S + S+ + + + L +E + + Sbjct: 578 RKTLNPDGRALPPSVWAHYQRGCSIRILNPSSYLVQLRQLCVKLQEFFHCLVGANVYLTP 637 Query: 117 VPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPG 176 G PH D + F++Q G++RWR+ + +L Q + + I+D L PG Sbjct: 638 PESQGFAPHYDDIEAFVLQVEGKKRWRIYAPTKELPRESSGNLSQTELGDPIMDIVLMPG 697 Query: 177 DILYIPPGFPHEGYALENAMNYSVG---FRAPNTRELISGFADYVLQRELGGNYYSDPDV 233 D+LY P G+ H+ +++ + + ++ + L+ V++ + + Sbjct: 698 DLLYFPRGWIHQAITEKDSHSLHITLSAYQQQSYANLMEKLMPLVVKESVEQTLKLRKGL 757 Query: 234 P----------PRAHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHEL---D 280 P + + ++ + L+ + + + +HE Sbjct: 758 PLDIFQNLGVANAEWNGVHRQKLIQHIQNLAQRLMPTEGQIDRALDQLAIKFQHEALPPT 817 Query: 281 IAPPE--------------PPYQPDEIYDALKQGEVLVRLGGLRVLRIGD-----DVYAN 321 IAP E + + A L+R +R+ N Sbjct: 818 IAPQELKRTVYGAQATADKTGHCSLDYELAEGTAVRLLRANIVRLTVDEGVLRCYYYTDN 877 Query: 322 GEKI----------DSPHRPALDALASNIA--LTAENFGDALEDPSFLAMLAALVNSGYW 369 G + + H ++ L ++ D L + AL G Sbjct: 878 GLEYCKYEPNFFELEPFHGTVIETLIHAYPDYTKIKDLPPMGNDEDRLEFVEALWERGIL 937 Query: 370 FFE 372 E Sbjct: 938 MVE 940 >UniRef50_B4GUZ2 Lysine-specific demethylase NO66 n=2 Tax=Drosophila persimilis RepID=NO66_DROPE Length = 687 Score = 247 bits (632), Expect = 3e-64, Method: Composition-based stats. Identities = 61/423 (14%), Positives = 134/423 (31%), Gaps = 56/423 (13%) Query: 6 TLNWPDFLERHWQKRPVVLKRG-FNNFIDPISPDELAGLAMESEVD----SRLVSHQDGK 60 + FL HW+K P +K F + IS + + +++ V+ + S++DG Sbjct: 259 PMTMATFLRDHWEKSPFRVKTTTSGGFSNLISFKMIDQMLIQNHVEYTTNIDVTSYEDGV 318 Query: 61 WQVSHGPFE----SYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFS 116 + + S + S+ + + + L +E + + Sbjct: 319 RKTLNPDGRALPPSVWAHYQRGCSIRILNPSSYLVQLRQLCVKLQEFFHCLVGANVYLTP 378 Query: 117 VPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPG 176 G PH D + F++Q G++RWR+ + +L Q + + I+D L+PG Sbjct: 379 PESQGFAPHYDDIEAFVLQVEGKKRWRIYAPTKELPRESSGNLSQTELGDPIMDIVLKPG 438 Query: 177 DILYIPPGFPHEGYALENAMNYSVG---FRAPNTRELISGFADYVLQRELGGNYYSDPDV 233 D+LY P G+ H+ +++ + + ++ + L+ V++ + + Sbjct: 439 DLLYFPRGWIHQAITEKDSHSLHITLSAYQQQSYANLMEKLMPLVVKESVEQTLKLRKGL 498 Query: 234 P----------PRAHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHE----- 278 P + + ++ + L+ + + + +HE Sbjct: 499 PLDIFQNLGVANAEWKGAHRQKLIQHIQNLAQRLVPTEGQIDRALDQLAIKFQHEALPPT 558 Query: 279 ------------LDIAPPEPPYQPDEIYDALKQGEVLVRLGGLRVLRIGD-----DVYAN 321 + + A L+R +R+ N Sbjct: 559 IAPQELKRTVFGAQATADRNGHCSLDYELAEGTAVRLLRANIVRLTVDEGVLRCYYYTDN 618 Query: 322 GEKI----------DSPHRPALDALASNIA--LTAENFGDALEDPSFLAMLAALVNSGYW 369 G + + H ++ L ++ D L + AL G Sbjct: 619 GLEYCKYEPNFFELEPFHGTVIETLIHAYPEYTKIKDLPPMGNDEDRLEFVEALWERGIL 678 Query: 370 FFE 372 E Sbjct: 679 MVE 681 >UniRef50_B4L6Q5 Lysine-specific demethylase NO66 n=1 Tax=Drosophila mojavensis RepID=NO66_DROMO Length = 888 Score = 244 bits (622), Expect = 6e-63, Method: Composition-based stats. Identities = 68/409 (16%), Positives = 137/409 (33%), Gaps = 37/409 (9%) Query: 1 MEYQLT-LNWPDFLERHWQKRPVVLKRGFNN-FIDPISPDELAGLAMESEVD----SRLV 54 + + L + F E++W++ +KR N F IS + + +E++++ + Sbjct: 476 LNWLLNPITSETFFEQYWERNACQVKRKQPNYFTQLISFQMIDEMLIENQLEFTTNIDVT 535 Query: 55 SHQDGKWQVSHGPFES----YDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDD 110 +++ G Q + + S+ + + + L +E + Sbjct: 536 TYKKGVRQTLNPVGRAMSPAIWGYYGDGCSIRILNPSTYLPKLRQLCSTMQEFFHCLVGA 595 Query: 111 LMISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKL---QMKQHCPHPDLLQVDPFEA 167 + G PH D + F+IQ GR+RWR+ + + Q + + Sbjct: 596 NVYLTPPNSQGFAPHYDDIEAFVIQVEGRKRWRLYAPPHQSDVLARTSSGNYKQEELGQP 655 Query: 168 IIDEELEPGDILYIPPGFPHEGYALENAMNYSVG---FRAPNTRELISGFADYVLQRELG 224 + D LE GDILY P G H+ + + ++ L+ VL+R + Sbjct: 656 LFDAVLEAGDILYFPRGTVHQAVTEPKQHSLHITLSVYQQQAYANLLEVLMPSVLERAIK 715 Query: 225 GNYYSDPDVPPR----------AHPADVLPQEMDKLREMMLELI-NQPEHFKQWFGEFIS 273 + +P ++ Q M+ ++++ + + Sbjct: 716 HHLSLRRGLPLHIWQHVGLAKGGQQSEQRDQLMNSTKQLVQRYLVPTEAQIDAAVDQLAK 775 Query: 274 QSRHEL--DIAPPEPPYQPDEIYDA-LKQGEVLVRLGGLRVLRIGDDV--YANGE----K 324 + +HE PE + ++ + R LRV D+ Y E + Sbjct: 776 RFQHEALPPYIKPEESMRTTKVRLLRRQHPAPGGRRQQLRVYYYVDNALEYCKNEPNYME 835 Query: 325 IDSPHRPALDALASNIALTAENFGDALEDPSFL-AMLAALVNSGYWFFE 372 I PA++AL + + L + AL G E Sbjct: 836 IQPTEAPAVEALMTTYPAYLKVGKLPLRSADRRIEVATALWERGLLMTE 884 >UniRef50_UPI0000E87D6F hypothetical protein MB2181_02235 n=1 Tax=Methylophilales bacterium HTCC2181 RepID=UPI0000E87D6F Length = 377 Score = 236 bits (603), Expect = 8e-61, Method: Composition-based stats. Identities = 91/368 (24%), Positives = 166/368 (45%), Gaps = 13/368 (3%) Query: 7 LNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQVSHG 66 ++ FLE++W K+ + L+ + +S D + GLA ++S++++ +G Q ++G Sbjct: 13 ISPSAFLEKYWGKQALFLQDAIDISGAGLSKDVVFGLAKNENIESKIIAFIEGSQQTTYG 72 Query: 67 PFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPHL 126 PF H + SLL+ N HE + L + +P DD+M+SFS GGGVGPH Sbjct: 73 PFNKVKHGKSS--SLLIHQFNLIHEFSYNLFQSINFVPYCLHDDVMMSFSSEGGGVGPHS 130 Query: 127 DQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIPPGFP 186 D YDVF++QG G + W +G + D + F +PGDILY+PP P Sbjct: 131 DSYDVFLVQGQGEKVWNIGATDKKAFKTTSTDHSNLK-FTPTEQFLAKPGDILYVPPFTP 189 Query: 187 HEGYAL-ENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADVLPQ 245 H G +L ++ + YS+GFR+P+ E+ + + +Y++ R+ N + + + ++P Sbjct: 190 HHGISLSDDCITYSIGFRSPSNNEIRNQYLEYLMDRKEKSNDLFN-GLDLSENTKALIPN 248 Query: 246 EMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQGEVLVR 305 + + P + G F+S+ + + ++L Sbjct: 249 ALASFIKKNTAFPKDPTIIDDFIGCFLSEPHEGAFFTKKN---ITKNAFKKIDTEKILRL 305 Query: 306 LGGLRVLRIGDDVYANGEKIDSP--HRPALDALASNIALTAENFGDALEDPSFLAMLAAL 363 R + ++ Y N E I R + L + + + + S + ++ L Sbjct: 306 NIQTRAVIHNENFYINAENIFVANKDRMFFEELFNQKQI---LITPSKANDSLVEVMIYL 362 Query: 364 VNSGYWFF 371 ++ GY F Sbjct: 363 LSEGYITF 370 >UniRef50_B4M7P8 Lysine-specific demethylase NO66 n=3 Tax=Drosophila RepID=NO66_DROVI Length = 907 Score = 232 bits (593), Expect = 1e-59, Method: Composition-based stats. Identities = 64/432 (14%), Positives = 139/432 (32%), Gaps = 60/432 (13%) Query: 1 MEYQLT-LNWPDFLERHWQKRPVVLKRGFNN-FIDPISPDELAGLAMESEVD----SRLV 54 + + + + DF ++W++ +KR N F IS + + + + ++ + Sbjct: 472 LHWLINPMTSDDFFSQYWERNACQVKRKQPNYFSQLISFKLIDEMLIRNHLEFTTNIDVT 531 Query: 55 SHQDGKWQVSHGPFE----SYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDD 110 ++++G + + + S+ + + + L +E + Sbjct: 532 TYKNGMRETHNPDGRAMPPTVWGFYSDGCSIRILNPSTYLIKVRQLCAMMQEFFHCLVGA 591 Query: 111 LMISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGE---KLQMKQHCPHPDLLQVDPFEA 167 + G PH D + F++Q GR+RWR+ + + Q + E Sbjct: 592 NVYLTPPNSQGFAPHYDDIEAFVLQVEGRKRWRLYSPLHPSDVLARNSSGNYSQAELGEP 651 Query: 168 IIDEELEPGDILYIPPGFPHEGYALENAMNYSVG---FRAPNTRELISGFADYVLQRELG 224 + D LEPGDILY P G H+ + + + ++ L+ VLQR + Sbjct: 652 LFDAVLEPGDILYFPRGTVHQAVCDQQQHSLHITLSVYQQQAYANLLEELMPAVLQRAIK 711 Query: 225 GNYYSDPDVPPR----------AHPADVLPQEMDKLREMMLE-LINQPEHFKQWFGEFIS 273 + +P +++ + + + ++ + L+ + Sbjct: 712 HHLSLRRGLPLHIWQHLGLAKGDQKSELRDELLGNTKRLVQQYLMPSDAQIDAAVDQLAK 771 Query: 274 QSRHE-LDIAPPEPPYQPDEIYDALK----------------QGEVLVRLGGLRVLRIGD 316 + +HE L ++ + L+R LR++ G Sbjct: 772 RFQHEALPPVVLPEEHERTVFGSRSQANSHGQCLCDYELTERTSVRLLRANILRLVAEGS 831 Query: 317 DV-----------YA----NGEKIDSPHRPALDALASNIALTAENFGDALEDPSFL-AML 360 + + N +I A++AL S + L + Sbjct: 832 SLRVYYYVDNALEFCKYEANFMEIQPTEAAAVEALMSAYPKYLKVAKLPLRSAERRIEVA 891 Query: 361 AALVNSGYWFFE 372 AL G E Sbjct: 892 TALWERGLLMTE 903 >UniRef50_B4Q068 Lysine-specific demethylase NO66 n=5 Tax=Sophophora RepID=NO66_DROYA Length = 683 Score = 232 bits (593), Expect = 1e-59, Method: Composition-based stats. Identities = 76/433 (17%), Positives = 154/433 (35%), Gaps = 61/433 (14%) Query: 1 MEYQLT-LNWPDFLERHWQKRPVVLKRGFNNFID-PISPDELAGLAMESEVD----SRLV 54 +++ L + F + W+ V++R ++ IS + + + +D + Sbjct: 247 LQWLLNPIKVNHFFDDFWEHTAFVVQRKNPHYYSKLISFKMIDEMLVRHRLDFTINVDVT 306 Query: 55 SHQDGKWQVSHGPFESY----DHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDD 110 ++++GK + + + L S+ + + + + +E + Sbjct: 307 TYKNGKRETLNPEGRALPPVVWGLYSEGCSIRILNPSTYLVGLRQVCSIMQEFFHCLVGA 366 Query: 111 LMISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHC---PHPDLLQVDPFEA 167 + G PH D + F+IQ GR+RWR+ E + Q E Sbjct: 367 NVYLTPPNSQGFAPHYDDIEAFVIQVEGRKRWRLYEPPSGSDQLCRNSSSNFDQEQLGEP 426 Query: 168 IIDEELEPGDILYIPPGFPHEGYALENAMNYSVG---FRAPNTRELISGFADYVLQRELG 224 I+DE LE GD+LY P G H+ E + + ++ L+ VL++ + Sbjct: 427 ILDEVLEAGDLLYFPRGTVHQAITEEEQHSLHITLSVYQQQAYVNLLEKLMPIVLKKAIK 486 Query: 225 GNYYSDPDVP----------PRAHPADVLPQEMDKLREMMLE-LINQPEHFKQWFGEFIS 273 + +P RA+ +D Q ++ +++++ + L+ + + + Sbjct: 487 QSVALRRGLPLHTFHVLGEAQRANRSDSRNQLVENVQKLVTKHLMPSAQDIDEAVDQLAK 546 Query: 274 QSRHEL--DIAPPEPPY--------QPDEIYDAL-------KQGEVLVRLGGLRVLRIGD 316 + +HE I PE DE +A+ K L+R LR++ D Sbjct: 547 KFQHEALPPIILPEEQVRTVFGSRSTADEQGNAICDYEFDTKTSVRLLRANILRLVTEED 606 Query: 317 ---DVYA---NG----------EKIDSPHRPALDALASNIALTAENFGDALEDPS-FLAM 359 +Y NG +I A++ L S L+ + + + Sbjct: 607 GSVRIYHHVDNGFEYCKYEPIFMEILPEEAAAVELLISAYPYYLTVGQMPLDSAARKVEV 666 Query: 360 LAALVNSGYWFFE 372 + AL G E Sbjct: 667 VTALWERGLLMTE 679 >UniRef50_A0YJB4 Putative uncharacterized protein n=1 Tax=Lyngbya sp. PCC 8106 RepID=A0YJB4_9CYAN Length = 386 Score = 228 bits (581), Expect = 3e-58, Method: Composition-based stats. Identities = 58/375 (15%), Positives = 130/375 (34%), Gaps = 18/375 (4%) Query: 10 PDFLERHWQKRPVVLKRGFNN-FIDPISPDELAGLAMESEVDSRLVSHQDGKWQVSHGPF 68 FL+++W K+ +++ + F S + L + + + + Sbjct: 14 ELFLQKNWLKQALIISGYSPHKFSHLFSWQDFNTLLNFHHLTYPEIRLAKSGQTLPENAY 73 Query: 69 ESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPHLDQ 128 ++ + ++++ ++ A R R G H D Sbjct: 74 DNLIKSCQDGATVIIDSLQTRLPVIAEFTANLRNELGHRTQINAYCSFPGSQGFACHYDS 133 Query: 129 YDVFIIQGTGRRRWRVGEKLQMKQHCPH-PDLLQVDPFEAIIDEELEPGDILYIPPGFPH 187 ++VFI+Q +G + WRV H +LL + I++ L+PGD+LYIP G H Sbjct: 134 HEVFILQISGDKHWRVFSPTFEFPLSKHRSNLLDPPTTDPYINQVLKPGDLLYIPRGHWH 193 Query: 188 EGYALE-NAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADVLPQE 246 A++ +++ ++G + + + + + L + + H Q Sbjct: 194 YAVAVDQPSLHLTLGVDCQTGIDFVEWLTEELQENPL---WRQSLPLLNSTHRQAC-SQH 249 Query: 247 MDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQGEVLVRL 306 + L + + + E ++ E Q + P ++ K+ + Sbjct: 250 LRDLIQKWIGQLETDEFIDRYLDEQFIQGQPSTQFGFPAQVGFS--LFPEDKKTQFYRPP 307 Query: 307 GGLRVLRIGDD----VYANGE-KIDSPHRPALDALASNIALTAENFGDAL----EDPSFL 357 +++ + +D AN + R ++ L + + + L + Sbjct: 308 KPVKITNLPEDHIEICTANKRITLKGISREVIERLFQQTRFSGIDLCNWLPEFDWEADIC 367 Query: 358 AMLAALVNSGYWFFE 372 AM+ L+ SG E Sbjct: 368 AMMTQLILSGIILVE 382 >UniRef50_B5W5P2 Cupin 4 family protein n=2 Tax=Arthrospira RepID=B5W5P2_SPIMA Length = 387 Score = 228 bits (581), Expect = 3e-58, Method: Composition-based stats. Identities = 72/388 (18%), Positives = 138/388 (35%), Gaps = 32/388 (8%) Query: 2 EYQLTLNWPDFLERHWQKRPVVLKRGFNN-FIDPISPDELAGLAMESEVDSRLVSHQD-G 59 E L +F +++W ++ V++ + F D S +L L + + G Sbjct: 6 ELLHPLPISEFFDKYWTEKSVLIPGANHQKFADLFSWQKLNNLLNYYPLKHPEIRLAKTG 65 Query: 60 KWQVSHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPG 119 + E + +L++ ++ E A ++ R R Sbjct: 66 ETLPEITNNEQIIKQCQEGATLIIDRLHEKIEAIAKMVALLRIEIGHRSQVNSYCSFPGH 125 Query: 120 GGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFE-AIIDEELEPGDI 178 G H D ++VFI+Q +GR+ WRV + + P ID + PGD+ Sbjct: 126 QGFACHYDSHEVFILQISGRKHWRVFSDTFIYPLSENRSSQFSPPDTQPYIDAIINPGDL 185 Query: 179 LYIPPGFPHEGYA-LENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRA 237 LYIP G H A E +++ ++G + + Q + + + ++ Sbjct: 186 LYIPRGHWHYAIAIDEPSLHLTLGIDCQTGIDFSDWLTSQLQQHP---QWRKNLPLLNKS 242 Query: 238 HPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPE------------ 285 H + Q + L + LE++ + ++ E + Q + +L + P Sbjct: 243 H-RENCRQHLQNLVQNWLEILESEDLINRYLDEQLLQGQPDLQLGFPSQIGYDIFPQGQE 301 Query: 286 ----PPYQPDEIYDALKQGEVLVRLGGLRVLRIGDDVYANGEKIDSPHRPALDALASNIA 341 P QP I G+ ++ GG ++ G D + + S LD Sbjct: 302 TKFYRPQQPVYITQLTPTGKFEIKTGGKKISLTGLDQHILEKIFTSTEFSGLDI------ 355 Query: 342 LTAENFGDALEDPSFLAMLAALVNSGYW 369 + D D + +L+ LV +G Sbjct: 356 --QQWLQDFDWDTEIVPLLSRLVKAGIL 381 >UniRef50_UPI0000E4684D PREDICTED: hypothetical protein n=1 Tax=Strongylocentrotus purpuratus RepID=UPI0000E4684D Length = 511 Score = 226 bits (577), Expect = 9e-58, Method: Composition-based stats. Identities = 61/331 (18%), Positives = 124/331 (37%), Gaps = 18/331 (5%) Query: 6 TLNWPDFLERHWQKRPVVLKRGFNN---FIDPISPDELAGLAMESEV----DSRLVSHQD 58 + F+E +W+K+P+V+ + F + L GL E ++ D + ++ Sbjct: 70 PMKIETFMEEYWEKQPIVISNREKHRDYFQSLFTRTILEGLVAEKKISFIQDCNVCRYKG 129 Query: 59 GKWQVSHG-----PFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMI 113 + +G P + + L + ++ + E L+ + + Sbjct: 130 EVRESLNGNGIVKPTKLKELLDQDKATIQFHQPQRFQESVWNLLEKLESYFGCLVGSNIY 189 Query: 114 SFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEEL 173 G+ PH D +VF++Q G + WR+ + + DL Q + E D L Sbjct: 190 MTPKLSQGLAPHYDDVEVFVLQLEGEKHWRLYKPPTLLPRDYSRDLDQSELGEPTHDIVL 249 Query: 174 EPGDILYIPPGFPHEG---YALENAMNYSV-GFRAPNTRELISGFADYVLQRELGGNYYS 229 + GD++Y P G H+ ++ + ++ ++ + +L+S ++Q + N Sbjct: 250 KAGDLMYFPRGTVHQADTPSTCSHSTHLTISTYQRSSWGDLLSIALPSMIQTAISENVSY 309 Query: 230 DPDVPPRAHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQ 289 +P R + M +LI+Q + G+ + IA PP+ Sbjct: 310 RQGLPLRFVSDPNHQASSSETGRKMADLISQLSSHIESHGQEAANEMLCDFIANRLPPFC 369 Query: 290 PDEIYDALKQGEVLVRLGGLRVLRIGDDVYA 320 E D +G + +R LR D + Sbjct: 370 DGE-TDLAPRGPMPSSDQQVR-LRFPDHTFI 398 >UniRef50_Q10ZZ1 Cupin 4 n=1 Tax=Trichodesmium erythraeum IMS101 RepID=Q10ZZ1_TRIEI Length = 385 Score = 225 bits (574), Expect = 2e-57, Method: Composition-based stats. Identities = 70/382 (18%), Positives = 135/382 (35%), Gaps = 22/382 (5%) Query: 6 TLNWPDFLERHWQKRPVVLKR-GFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQVS 64 L +FLE +W K+ + + G +F D S ++L L ++ V + Sbjct: 10 PLKQEEFLENNWTKKAIAISNKGEKDFTDLFSWEKLNYLLNFHQIKYPDVRLAFDGKVLE 69 Query: 65 HGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGP 124 ++ E +L++ ++ A + G P Sbjct: 70 EKENRNFTQWCEKGATLILDQIHRRIPEVAIFTSKLSYELGYPTQVNAYCSWSSKKGFSP 129 Query: 125 HLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDP--FEAIIDEELEPGDILYIP 182 H D +DVFI+Q G ++W V K P+ P EA + L PGD+LYIP Sbjct: 130 HYDTHDVFILQVEGNKQWYVYN-DTFKYPLPNQKSSSFTPPEKEAYLSCILHPGDVLYIP 188 Query: 183 PGFPHEGYA-LENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPAD 241 G H E +++ ++G + +L+ + RE + + + + Sbjct: 189 RGHWHYAVTKEEPSIHLTLGIHSSTGVDLLEWLIGQLQYRE---EWRTSLALRIDDTSFN 245 Query: 242 VLPQEMDKLREMMLELINQ---PEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALK 298 V ++ L + + E IN E + + ++ + ++ + D Sbjct: 246 VS---VENLIKDLKEYINNHNISEEYNNYLDGL-AKPFEQYNLPYQAGFHIFHRDIDTKF 301 Query: 299 QGEVLVRLGGLRVLRIGDD-VYANGEKIDSPHRP--ALDALASNIALTAEN----FGDAL 351 + RL ++ + +G+++ P + L S T ++ D Sbjct: 302 KVSQFQRLKISKMADDDGYKILVSGKEVSIRGVPEYLVKNLFSRETFTGKDIINLLPDYD 361 Query: 352 EDPSFLAMLAALVNSGYWFFEG 373 + + ML+ LVN F E Sbjct: 362 WEIDIMPMLSKLVNERVIFVES 383 >UniRef50_Q31RB4 Putative uncharacterized protein n=2 Tax=Synechococcus elongatus RepID=Q31RB4_SYNE7 Length = 428 Score = 222 bits (565), Expect = 2e-56, Method: Composition-based stats. Identities = 72/380 (18%), Positives = 130/380 (34%), Gaps = 28/380 (7%) Query: 12 FLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESE-VDSRLVSHQDGKWQVSHGPFES 70 FL +W ++ V + F S + L L +S L +DG+ + Sbjct: 16 FLSHYWAQQSVYIAGDSLRFQSLFSWNHLNDLLNYQTFRESELRFSRDGESLPAGDNPTL 75 Query: 71 YDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPHLDQYD 130 + + +L++ V+H L R+ +R + S G H D +D Sbjct: 76 WRSRLQEGATLVLNGVHHRVPALKHLATNLRQEFGYRCHINLYSSPAQQQGFDCHYDTHD 135 Query: 131 VFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFE-AIIDEELEPGDILYIPPGFPHEG 189 V I+Q G + W + + P ++ P E + + L PGD+LYIP G H Sbjct: 136 VLILQIEGEKEWLIYPETLPYPTADQPSYDRLPPEEPPYLQQVLSPGDLLYIPRGHWHYA 195 Query: 190 YALEN-AMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAH---PADVLPQ 245 A E +++ ++G + ++ + + +P L Sbjct: 196 IAQETASLHLTIGIHTATGLDWVNWLQQQLRDQPH-----WRQGLPLAGSCNFDPLKLRG 250 Query: 246 EMDKLREMMLELINQPEHFKQWFGEFISQSRHELDI--------APPEPPYQPDEIYDAL 297 ++ LR+ ++ + +P+ + Q + L I P I+ L Sbjct: 251 HLESLRDQLITYLQEPQAIDDYLQYLSWQDQPHLPIQLPLQLHGDPLAQGLLGKFIWSPL 310 Query: 298 KQGEVLVRLGGLRVLRIGDDVYANGEKIDSPHRPALDALASNIALTAENFGDALEDPSF- 356 + ++VL + G I R L S T + GD D F Sbjct: 311 HSLQWQAEDDQIKVLIGSKQMVFKGLPISLAMR-----LFSCRQFTLMDLGDWAPDLDFE 365 Query: 357 ---LAMLAALVNSGYWFFEG 373 +L L+ +G F E Sbjct: 366 SAIAPLLQKLILAGILFVEA 385 >UniRef50_B4R4H1 Lysine-specific demethylase NO66 n=2 Tax=melanogaster subgroup RepID=NO66_DROSI Length = 847 Score = 221 bits (563), Expect = 3e-56, Method: Composition-based stats. Identities = 68/433 (15%), Positives = 140/433 (32%), Gaps = 66/433 (15%) Query: 3 YQLTLNWPDFLERHWQKRPVVLKRGFN-NFIDPISPDELAGLAMESEVDS----RLVSHQ 57 + F + W++ +++R F IS L + + +D + +++ Sbjct: 414 IIFPIKPNFFFKYFWEQTACLVQRTNPKYFQSLISFKMLDEILIRHHLDFTVNLDVTTYK 473 Query: 58 DGKWQVSHGPFESY----DHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMI 113 +GK + + + S+ + + + + +E +++ M Sbjct: 474 NGKRETLNPEGRALPPAVWGFYSEGCSIRLLNPSAYLTRLREVCTVLQEFFHCKVEANMY 533 Query: 114 SFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKL---QMKQHCPHPDLLQVDPFEAIID 170 G PH D + F+IQ GR+RW + E + Q + IID Sbjct: 534 LTPPNSQGFAPHYDDIEAFVIQVEGRKRWLLYEPPKEADHLARISSGNYDQEQLGKPIID 593 Query: 171 EELEPGDILYIPPGFPHEGYALENAMNYSVG---FRAPNTRELISGFADYVLQRELGGNY 227 E L GD+LY P G H+ E + + ++ L+ VL++ + + Sbjct: 594 EVLSAGDVLYFPRGTVHQAITEEQQHSLHITLSVYQQQAYANLLETLMPMVLKKAVDRSV 653 Query: 228 YSDPDVPP----------RAHPADVLPQEMDKLREMMLE-LINQPEHFKQWFGEFISQSR 276 +P +A+ ++ +++++ + LI + + + + + Sbjct: 654 ALRRGLPLHTFQVLGNAYKANDCGSRQLLVENVQKLVTKYLIPSEDDIDEAVDQMAKKFQ 713 Query: 277 HELDIAPPEPPYQPDEIYDA--------------------LKQGEVLVRLGGLRVLRIGD 316 HE A P +E+ K L+R LR++ D Sbjct: 714 HE---ALPPIVLPSEEVRTVHGARSGADEQGNCVCDYKFNEKTSVRLLRANILRLVTEPD 770 Query: 317 ---DVYA---NG----------EKIDSPHRPALDALASNIALTAE-NFGDALEDPSFLAM 359 +Y NG +I A++ L S + + + Sbjct: 771 GSVRIYHHADNGLDYCKYEPYFMEILPEEAKAVELLISAYPYYLTIDQLPLKSSARKVEV 830 Query: 360 LAALVNSGYWFFE 372 AL G E Sbjct: 831 ATALWEHGLLMTE 843 >UniRef50_A2W941 Transcription factor jumonji n=1 Tax=Burkholderia dolosa AUO158 RepID=A2W941_9BURK Length = 360 Score = 220 bits (562), Expect = 5e-56, Method: Composition-based stats. Identities = 80/248 (32%), Positives = 122/248 (49%), Gaps = 7/248 (2%) Query: 6 TLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQVSH 65 L F+ R+WQK+P+++++ P++ D L LA + + +SRL++H KWQ++H Sbjct: 48 GLTPAQFMRRYWQKKPLLIRQAIPGVASPVTRDALFELAADYDAESRLITHFRNKWQLTH 107 Query: 66 GPFE--SYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVG 123 GPFE S + W+LLVQ ++ + AL+ FR +PD R+DDLMIS++ GGGVG Sbjct: 108 GPFEPGSLPAVTRRAWTLLVQGLDLHVDAARALLDRFRFIPDARLDDLMISYATDGGGVG 167 Query: 124 PHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIPP 183 PH D YDVF++Q GRRRWR+G + C L + FE + G ILY+PP Sbjct: 168 PHFDSYDVFLLQVEGRRRWRIGAQTDC--RCSRRALKILRHFEPATNGCWNRGAILYLPP 225 Query: 184 GFPHEGYALENAMN--YSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPAD 241 H+G A M+ F +Y+ +R + + Sbjct: 226 HSAHDGVADGE-MHDLLDRLFVRHRPVNSAPNSCNYLAERGGLRDEAWRRALSRPEAAGS 284 Query: 242 VLPQEMDK 249 + Sbjct: 285 RHARATAA 292 >UniRef50_B0WMG3 Lysine-specific demethylase NO66 n=2 Tax=Culicini RepID=NO66_CULQU Length = 648 Score = 220 bits (561), Expect = 6e-56, Method: Composition-based stats. Identities = 48/328 (14%), Positives = 111/328 (33%), Gaps = 23/328 (7%) Query: 6 TLNWPDFLERHWQKRPVVLKRGFNNFI-DPISPDELAGLAMESEVDS----RLVSHQDGK 60 + +F+ + W+K+P +++R + + +S ++ + + ++ + S+++G Sbjct: 226 STTVDEFMAQFWEKKPFLVQRNDPTYYANLLSRGKIDEMLRNNNIEYTKNLDVTSYREGV 285 Query: 61 WQVSHGPFE----SYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFS 116 + + E S+ + + + +E Sbjct: 286 RETHNPDGRALPPDVWAFYEEGCSIRMLNPQTYLPGVYEMNVKLQEFFHCMTGSNFYLTP 345 Query: 117 VPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQ---HCPHPDLLQVDPFEAIIDEEL 173 G PH D + F++Q GR+ W++ + P+ Q + I++ L Sbjct: 346 PNSQGFAPHYDDIEAFVLQVEGRKHWKLYSPRTASEVLARVSSPNFTQEEIGVPILEVTL 405 Query: 174 EPGDILYIPPGFPHEGYALENAMNYSVGFRAP---NTRELISGFADYVLQRELGGNYYSD 230 EPGD+LY P G H+ + + V + +L+ + + L + + Sbjct: 406 EPGDLLYFPRGIIHQASTVPGHHSLHVTMSVYQKNSWADLLELYLPHALSQAAENHLELR 465 Query: 231 PDVPP--RAHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPY 288 +P H V R+ +++ I + S+ ++ + + Sbjct: 466 RGIPQDLHQHFGIVHSDNETPTRKDLIKKIKSL------VDKIFSEEAIDVAVDQLAKRF 519 Query: 289 QPDEIYDALKQGEVLVRLGGLRVLRIGD 316 Q D + L E G D Sbjct: 520 QHDALPPLLTDQERAQTAYGANYAFNPD 547 >UniRef50_B4JMQ2 Lysine-specific demethylase NO66 n=1 Tax=Drosophila grimshawi RepID=NO66_DROGR Length = 723 Score = 220 bits (560), Expect = 8e-56, Method: Composition-based stats. Identities = 55/336 (16%), Positives = 114/336 (33%), Gaps = 31/336 (9%) Query: 6 TLNWPDFLERHWQKRPVVLKRGFNN-FIDPISPDELAGLAMESEVD----SRLVSHQDGK 60 L+ DF R+W+ + +KR + + D +S + + + +E+ ++ + S++DG Sbjct: 291 PLSLDDFFSRYWESKACQVKRKRKDLYSDLVSFEMIDEMLIENHLEFTTNIDVTSYKDGV 350 Query: 61 WQVSH-----GPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISF 115 Q + P + H S+ + + + + + +E + + Sbjct: 351 RQTHNPDGRAMPPTVWGH-YSDGCSVRILNPSTYLKGLRGVCAALQEHFHCLVGANVYLT 409 Query: 116 SVPGGGVGPHLDQYDVFIIQGTGRRRWRVG---EKLQMKQHCPHPDLLQVDPFEAIIDEE 172 G PH D + F++Q GR+RWR+ + +L Q + I DE Sbjct: 410 PPNSQGFAPHYDDIEAFVLQVEGRKRWRLYDAPSPNDVLARTSSGNLKQQQLSKPIFDEV 469 Query: 173 LEPGDILYIPPGFPHEGYALENAMNYSVG---FRAPNTRELISGFADYVLQRELGGNYYS 229 LE GD+LY P G H+ + + + ++ + L+ VLQ + N Sbjct: 470 LEAGDLLYFPRGCVHQAVTEQQHHSLHITLSVYQQQSYANLMEALMPAVLQNAIKHNLDM 529 Query: 230 DPDVP----------PRAHPADVLPQEMDKLREMMLELIN-QPEHFKQWFGEFISQSRHE 278 +P + + + + + + + +HE Sbjct: 530 RRGLPLGTWHHLGMVHGDKKTKERSDLITHTQSLFSKYLAPTASQIDAAVDQLAIRFQHE 589 Query: 279 LDIAPPEPPYQPDEIYDALKQGEVLVRLGGLRVLRI 314 A P ++ + G R Sbjct: 590 ---ALPPRIASSEKKRTVFGSRNKKDKHGNCRCDYD 622 >UniRef50_A4U3D3 MYC induced nuclear antigen n=1 Tax=Magnetospirillum gryphiswaldense RepID=A4U3D3_9PROT Length = 390 Score = 218 bits (556), Expect = 2e-55, Method: Composition-based stats. Identities = 69/380 (18%), Positives = 136/380 (35%), Gaps = 24/380 (6%) Query: 6 TLNWPDFLERHWQKRPVVLKRGFNNFI-DPIS---PDELAGLAMESEVDSRLVSHQD--- 58 + +FL +W+K+P+++KR F D +S D++ + D R+ D Sbjct: 12 PITPHEFLAEYWEKKPLLVKRAAPGFYRDLLSVQAIDQVLAMPGLHRRDIRVARGTDPLA 71 Query: 59 -GKWQVSHGPFE--SYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISF 115 ++ G S L +++++ +N P A + R F ++ + Sbjct: 72 VEEYADKDGFINAASLSRLFTDGFTIILNTLNLKLRPLAEICRAFEQVLSIPCQTNIYYT 131 Query: 116 SVPGGGVGPHLDQYDVFIIQGTGRRRWRVGE-KLQMKQHCPHPDLLQVDPFEAIIDEELE 174 G PH D +DVF+ Q GR+ W V + +++ + +P + ++ +LE Sbjct: 132 PRLAQGFKPHYDSHDVFVFQVAGRKHWLVNDTPVELPLRGQGFEAGLYEPGDVTMEFDLE 191 Query: 175 PGDILYIPPGFPHEGYALEN-AMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDV 233 PGD+LYIP G H + +++ ++G + E++ ++ Sbjct: 192 PGDLLYIPRGVMHGARTSDEVSLHITLGALTTSWAEVLLEAVAAAALTDVELRRNLPAGY 251 Query: 234 PPRAHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEI 293 + A Q L + E I+ + G ++ R L E + Sbjct: 252 ALPGYDAQAARQTFASLLNRVAENIDVESILDVYRGALQARRRPYL-----ENSVTRLDG 306 Query: 294 YDALKQGEVLVRLGGLRVLRIGDD----VYANGEKIDSPH--RPALDALASNIALTAENF 347 + + + L + + G I P PAL Sbjct: 307 LSQISAADSAEPVPNLLYSLTESEGRASLACFGRTITVPDFAAPALAHAVKGTRFRVGEL 366 Query: 348 GDALEDPSFLAMLAALVNSG 367 L++ +A++ LV G Sbjct: 367 -PGLDEDGSVALVRRLVLEG 385 >UniRef50_A3Q8B6 Cupin 4 family protein n=4 Tax=Mycobacterium RepID=A3Q8B6_MYCSJ Length = 404 Score = 218 bits (555), Expect = 3e-55, Method: Composition-based stats. Identities = 71/403 (17%), Positives = 140/403 (34%), Gaps = 47/403 (11%) Query: 1 MEYQLTLNWPDFLERHWQKRPVVLKRGF--NNFIDPISPDELAGLAMESEVDSRLVSHQD 58 + + + F +W +RP++ + G +F D +SP + L E V + + Sbjct: 2 LSRCIATDPHTFATEYWGRRPLLSRSGALPRDFADLLSPGMVDELIAERGVRAPFIRLAK 61 Query: 59 GK----WQVSHGPFESYDHLGET------------NWSLLVQAVNHWHEPTAALMRPFRE 102 GP + + ++++Q ++ P L+R + Sbjct: 62 EGDVLAKDCYLGPAGFGAEMPDQVDSAKVLTQFSAGATIVMQGLHRLWPPVIDLVRHLVD 121 Query: 103 LPDWRIDDLMISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHP----- 157 + G PH D +DVF++Q G +RW V E + P Sbjct: 122 DLGHPVQANAYITPPSNRGFDPHYDVHDVFVLQTAGEKRWVVHEPVHPHPLPSQPWTQHR 181 Query: 158 -DLLQVDPFEAIIDEELEPGDILYIPPGFPHEGYALE-NAMNYSVGFRAPNTRELISGFA 215 + + E +ID L PGD LY+P G+ H +AL+ +++ ++G A ++ Sbjct: 182 DAIAERAAGEPVIDTVLAPGDALYLPRGWVHSAHALDTTSIHLTIGVSAVTGVDVARAVV 241 Query: 216 DYVLQRELGGNYYSDPDVPPRAHPA---DVLPQEMDKLREMMLELINQPEHFKQWFGEFI 272 D + +P PA +++ + +M+ + + + + Sbjct: 242 DALADSAA-----FRAPLPMGGDPADRDEIIAAVTKVMAQMVETMRDDATALSGAAADRL 296 Query: 273 SQSRHELDIAPPEPPYQPDEIYDALKQGEVLVRLG--------GLRV-LRIGDDVYANGE 323 +++ P + V R G G R+ LR+ D V Sbjct: 297 TRTHASRTRPVAVRPLATLDAAAHADTTTVQWRHGLVATADTAGDRIELRLTDRVL---- 352 Query: 324 KIDSPHRPALDALASNIALTAENFGDALEDPSFLAMLAALVNS 366 + PA+ AL +A A + L+ ++ L+ Sbjct: 353 SFPASCAPAVLALQRGLAADAGSL-PGLDRADGTVLIRRLLRE 394 >UniRef50_D2A374 Putative uncharacterized protein GLEAN_07936 n=1 Tax=Tribolium castaneum RepID=D2A374_TRICA Length = 568 Score = 218 bits (555), Expect = 3e-55, Method: Composition-based stats. Identities = 58/425 (13%), Positives = 138/425 (32%), Gaps = 54/425 (12%) Query: 2 EYQL-TLNWPDFLERHWQKRPVVLKRGFNNFI-DPISPDELAGLAMESEV----DSRLVS 55 E+ + L+ F + +W+++P+ +KRG ++ + L + + + + +V+ Sbjct: 136 EWLIQPLSPASFFKTYWEQKPLYIKRGNRSYYTHILDSSSLDKILRNNSLFFTRNVDVVT 195 Query: 56 HQDGKWQVSHG----PFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDL 111 +++G+ QV + + S+ V ++ L+ +E + Sbjct: 196 YENGEKQVFNQEGRATPSALWDYYGNGCSIRVLNPQTYNHKVHLLLATLQEYFGTMVGAN 255 Query: 112 MISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEK--LQMKQHCPHPDLLQVDPFEAII 169 + G PH D + F++Q GR+ W++ + + P+ + D E + Sbjct: 256 VYLTPPGSQGFAPHYDDIEAFVVQLEGRKHWKLYQPKSEDVLARFSSPNFKREDLGEPFM 315 Query: 170 DEELEPGDILYIPPGFPHEGYALENAMNYSVGFRAP---NTRELISGFADYVLQRELGGN 226 + L G++LY P G HEG E++ + + + +L+ L++ + Sbjct: 316 ELTLNAGELLYFPRGTIHEGRTDEDSHSLHITVSVYQQTSYVDLLEHILPKALKKAADSD 375 Query: 227 YYSDPDVP---PRAHPADVLPQEMDKLREMMLELINQPE---HFKQWFGEFISQSRHELD 280 +P + + + + +L+N + + H+ Sbjct: 376 VEFRKGLPLNYLKDFGLAAGSKNRKFVTSKIKDLMNSLINYVDIDSAADQLGKKFMHDSM 435 Query: 281 IAPP---EPPYQPDEIYDALKQGEVLVR-----------------------LGGLRVLRI 314 E LK G V R ++ Sbjct: 436 PPVLTKSEAERSSKADGPVLKDGVVKNRVEIDLDTRVRLLRYYCIRLVCEENSSPKLYYN 495 Query: 315 GDDVYA-NGE-----KIDSPHRPALDALAS-NIALTAENFGDALEDPSFLAMLAALVNSG 367 + +GE +++ P + L + + + + ++ L G Sbjct: 496 TQNATVYHGEEEQWLELEPSMVPLIQMLQNTYPRYIEVEKLPLDDMATKMQLVLDLWEHG 555 Query: 368 YWFFE 372 E Sbjct: 556 LLVTE 560 >UniRef50_Q54K96 Lysine-specific demethylase NO66 n=1 Tax=Dictyostelium discoideum RepID=NO66_DICDI Length = 514 Score = 217 bits (552), Expect = 7e-55, Method: Composition-based stats. Identities = 64/416 (15%), Positives = 143/416 (34%), Gaps = 55/416 (13%) Query: 9 WPDFLERHWQKRPVVLKRGFNN-FIDPISPDELAGLA----MESEVDSRLVSHQDGKWQV 63 DF ++++ ++ + +KR +N + + + D L + M+ + + ++ D + Sbjct: 101 IEDFYDQYFGQKHLYVKRNGDNIYKNFFTKDSLDKMLRNNLMKFTENVDVTNYVDFQRIT 160 Query: 64 SHGPFESYDHL----GETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPG 119 + +Y L + S+ + ++ L + + + Sbjct: 161 LNPEGRAYPSLVWKHYKEGCSVRLLNPQTFNSNVWKLCSTLQTHFQCGVGANIYLTPAGA 220 Query: 120 GGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCP---HPDLLQVDPFEAIIDEELEPG 176 G PH D DVFI+Q G++ WR+ + + P + Q + E LE G Sbjct: 221 QGFAPHYDDVDVFILQLEGKKEWRLYKPRDANEVLPKKSSENFTQEEIGEPYFTVTLEAG 280 Query: 177 DILYIPPGFPHEGYALENAMNYSVGFRAP---NTRELISGFADYVLQRELGGNYYSDPDV 233 D+LY P G H+ + + + + +LI + L+ + Sbjct: 281 DLLYFPRGVIHQAVSPSDVHSLHITVSTYLNNTWGDLIGKVLNRALEIANEECLEFREGL 340 Query: 234 PP--RAHPADVLPQEM-DKLREMMLEL---INQP--EHFKQWFGEFISQSRHELDIAPPE 285 P + + ++ D+ R+ + + + + G ++ LD PP Sbjct: 341 PRDYTQYLGVIHSDKVGDERRKELTDKVGTLWDKLGQLLPIDIGADQMAVKYLLDSLPP- 399 Query: 286 PPYQPDEIYDALKQGE-----------VLVRLGGLRVLRIGDD-VYAN----------GE 323 E +++ L+R +R++ ++ N GE Sbjct: 400 -VLTQLEKKHSIEDETTSMKIKPETRFRLIRADSVRLVVEDIAILFHNADNTRIYHQVGE 458 Query: 324 -----KIDSPHRPALDALASNIA--LTAENFGDALEDPSFLAMLAALVNSGYWFFE 372 + AL+ + + + ++ +D L +++AL G FE Sbjct: 459 EPGVVEFTLECVDALEHIIDSYPSYIYTKDL-PIEDDDQKLDVVSALYEKGLIMFE 513 >UniRef50_C8XBP6 Cupin 4 family protein n=1 Tax=Nakamurella multipartita DSM 44233 RepID=C8XBP6_NAKMY Length = 452 Score = 216 bits (551), Expect = 9e-55, Method: Composition-based stats. Identities = 71/402 (17%), Positives = 126/402 (31%), Gaps = 42/402 (10%) Query: 1 MEYQLTLNWPDFLERHWQKRPVVLKRG--FNNFIDPISPDELAGLAMESEVDSRLVSHQD 58 ++ + ++ DF +R+W + P++ ++F D S D + L E + + + Sbjct: 23 VQRCIAIDADDFAQRYWAQAPLLTTAAELNDDFSDLFSADSVDELVSERGLRTPFLRMAK 82 Query: 59 GKWQVSHGPFESYDHLGET----------------NWSLLVQAVNHWHEPTAALMRPFRE 102 +S F G T +L++QA++ P Sbjct: 83 NGSVLSSASFTRGGGAGATITDQVADDKVLAQLAGGATLVLQALHRTWPPLVRFGSELAA 142 Query: 103 LPDWRIDDLMISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHC------PH 156 + G H D +DVF++Q G + WR+ E + Sbjct: 143 ELGHPVQINAYITPPQNQGFASHYDTHDVFVLQIAGTKHWRIHEPVLPDPLPHQTWDGRR 202 Query: 157 PDLLQVDPFEAIIDEELEPGDILYIPPGFPHEGYALEN-AMNYSVGFRAPNTRELISGFA 215 + ID L PGD LY+P G+ H A +++ ++G +L Sbjct: 203 AQVQDRAAQAPAIDALLRPGDALYLPRGYLHSAVAQGELSIHLTIGVHPLTGYDLARELI 262 Query: 216 DYVLQRELGGNYYSDPDVPPRAHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQS 275 + +P DV LR+ L+++ ++ Sbjct: 263 -----AAAEDDPELRRSLPMGVDVTDVDA-MATHLRQAAQRLVDRLGQAGPELYRAAARR 316 Query: 276 RHELDI----APPEPPYQPDEIYDALKQGEVLVRLGGLRV-LRIGDDVYANGE-----KI 325 + P P L LV GLR LR + + Sbjct: 317 VGPQQVGQTRPAPIAPLAQLRAAATLDPQTPLVLRPGLRPRLRQQGEKWVLSLIDSTVSW 376 Query: 326 DSPHRPALDALASNIALTAENFGDALEDPSFLAMLAALVNSG 367 AL + S A TA+ + L+D L + L+ G Sbjct: 377 PEQVHAALLIVLSGKAFTADELPN-LDDAEQLVVARRLLREG 417 >UniRef50_D0L9V4 Cupin 4 family protein n=1 Tax=Gordonia bronchialis DSM 43247 RepID=D0L9V4_GORB4 Length = 414 Score = 216 bits (550), Expect = 1e-54, Method: Composition-based stats. Identities = 65/400 (16%), Positives = 132/400 (33%), Gaps = 37/400 (9%) Query: 1 MEYQLTLNWPDFLERHWQKRPVVLKRGF--NNFIDPISPDELAGLAMESEVDSRLVSHQD 58 + + F +++W +RP++ + +F D +S + L E V + + Sbjct: 2 LSRCTATDLTTFADQYWGRRPMLSRAASLPADFGDLLSVRAVDELIAERGVRAPFIRMAK 61 Query: 59 GK----WQVSHGPFESYDHLGET------------NWSLLVQAVNHWHEPTAALMRPFRE 102 GP + + ++++Q ++ P +R + Sbjct: 62 EGVVLARDCYLGPAGFGAEMPDQVDPAGVLREFAAGATIVLQGLHRLWPPVIDFVRAMVD 121 Query: 103 LPDWRIDDLMISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHP----- 157 + G PH D +DVF++Q G +RWRV + P Sbjct: 122 DLGHPVQANAYVTPPGNRGFDPHYDVHDVFVLQVAGTKRWRVHRPVHTHPLATQPWTDHR 181 Query: 158 --DLLQVDPFEAIIDEELEPGDILYIPPGFPHEGYAL-ENAMNYSVGFRAPNTRELISGF 214 + I+ L PGD LY+P G+ H AL + +++ ++G A R++++ Sbjct: 182 AQIERRASDDAPEIEAVLSPGDALYLPRGWIHSADALGDTSIHLTIGVGAVTVRDVVAAI 241 Query: 215 ADYVLQ-RELGGNYYSDPDVPPRAHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFIS 273 + E + D+ R + + M + E + + + ++ Sbjct: 242 VAELDDCAEFRQSLPLGIDLTGRDQTVPIATKAMAAVVERLRDH---AADVGEGAAARLA 298 Query: 274 QSRHELDIAPPEPPYQPDEIYDALKQGEVLVRL----GGLRVLRIGDDVYANGEKIDSPH 329 + P P E + L LR + ++ I +P Sbjct: 299 RRHTARTRPVPVRPLATLEAIGVVNAATRLRLRHGLVPTLRRVADRAELLTGERSISTPG 358 Query: 330 --RPALDALASNIALTAENFGDALEDPSFLAMLAALVNSG 367 AL + S ++A L+ +L L+ G Sbjct: 359 YCLEALRTIRSGEIVSAAEL-PGLDAADGTVLLRRLLAEG 397 >UniRef50_Q7K4H4 Lysine-specific demethylase NO66 n=2 Tax=melanogaster subgroup RepID=NO66_DROME Length = 653 Score = 216 bits (550), Expect = 1e-54, Method: Composition-based stats. Identities = 69/433 (15%), Positives = 141/433 (32%), Gaps = 66/433 (15%) Query: 3 YQLTLNWPDFLERHWQKRPVVLKRGFN-NFIDPISPDELAGLAMESE----VDSRLVSHQ 57 + F + W+ +++R F IS L + + V+ + +++ Sbjct: 220 ILFPVQTKVFFKDFWEHTACLVQRSNPKYFQSMISFKMLDEILIRHHLDFTVNVDVTTYK 279 Query: 58 DGKWQVSHGPFESY----DHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMI 113 +GK + + + S+ + + + + +E ++ + Sbjct: 280 NGKRETLNPEGRALPPAVWGFYSDGCSIRLLNPSTYLIRLRQVCTVLQEFFHCKVGANLY 339 Query: 114 SFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKL---QMKQHCPHPDLLQVDPFEAIID 170 G PH D + F+IQ GR+RW + E + Q + IID Sbjct: 340 LTPPNSQGFAPHYDDIEAFVIQVEGRKRWLLYEPPKKADQLARISSGNYDQEQLGKPIID 399 Query: 171 EELEPGDILYIPPGFPHEGYALENAMNYSVG---FRAPNTRELISGFADYVLQRELGGNY 227 E L GD+LY P G H+ E + + ++ L+ VL++ + + Sbjct: 400 EVLSAGDVLYFPRGAVHQAITEEQQHSLHITLSVYQQQAYANLLETLMPMVLKKAVDRSV 459 Query: 228 YSDPDVPP----------RAHPADVLPQEMDKLREMMLE-LINQPEHFKQWFGEFISQSR 276 +P + + Q ++ +++++ L+ + + + + + Sbjct: 460 ALRRGLPLHTFQVLGNAYKGNDCGSRKQLVENVQKLVTNYLMPSEDDIDEAVDQMAKKFQ 519 Query: 277 HELDIAPPEPPYQPDEIY-------DALKQG-------------EVLVRLGGLRVLRIGD 316 HE A P +E+ DA +QG L+R LR++ D Sbjct: 520 HE---ALPPIVLPSEEVRTVHGARSDADEQGNCVCDYKFNKKTSVRLLRANILRLVTESD 576 Query: 317 ---DVYA---NG----------EKIDSPHRPALDALASNIALTAE-NFGDALEDPSFLAM 359 +Y NG +I A++ L S + + + Sbjct: 577 GSVRIYHHVDNGLDYCKYEPYFMEILPEEAKAVELLISAYPFYLTIDQLPLESSARKIEV 636 Query: 360 LAALVNSGYWFFE 372 AL G E Sbjct: 637 ATALWEHGLLMTE 649 >UniRef50_UPI000192663F PREDICTED: similar to Myc-induced nuclear antigen n=1 Tax=Hydra magnipapillata RepID=UPI000192663F Length = 437 Score = 215 bits (548), Expect = 2e-54, Method: Composition-based stats. Identities = 44/242 (18%), Positives = 94/242 (38%), Gaps = 13/242 (5%) Query: 6 TLNWPDFLERHWQKRPVVLKRGFNNFI-DPISPDELAGLA----MESEVDSRLVSHQDGK 60 ++ F E W+K+P+ +KR + + D S + + +E E D + + D + Sbjct: 49 PISVKTFFEEFWEKKPLYIKRENSGYYGDLFSLSSMKEILAAHELEFETDVNVCRYVDNE 108 Query: 61 WQVSHG----PFESYDHL-GETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISF 115 ++ + + +D L + + + + + LM + + Sbjct: 109 KELLNEDGCLTVDKFDKLMNDKHATFQLHQPQRYGTVLWQLMEKMETYFGCLVGSNVYIT 168 Query: 116 SVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEP 175 G+ PH D +VFI+Q G + W++ + + DL Q + I++ LEP Sbjct: 169 PKESQGLAPHCDDVEVFILQLEGTKHWKLYKPMVELSRDYTQDLSQDSIGKPIMELTLEP 228 Query: 176 GDILYIPPGFPHEGYALENAMNYSVGFRAP---NTRELISGFADYVLQRELGGNYYSDPD 232 GD+LY P G H+ ++ + + + + +S ++ L + Sbjct: 229 GDLLYFPRGTIHQARSVGESYSTHITLSTYQNNTLGDFMSIAVSQAIESALENDVSFRRG 288 Query: 233 VP 234 +P Sbjct: 289 LP 290 >UniRef50_D0NRY0 Nucleolar protein, putative n=2 Tax=Phytophthora infestans T30-4 RepID=D0NRY0_PHYIN Length = 676 Score = 215 bits (547), Expect = 3e-54, Method: Composition-based stats. Identities = 63/420 (15%), Positives = 134/420 (31%), Gaps = 58/420 (13%) Query: 1 MEYQL-TLNWPDFLERHWQKRPVVLKRGFNNFID-PISPDELAGLA----MESEVDSRLV 54 + + L + +F E +W++RP+ +KR F ++ D S E+ + +E D L Sbjct: 62 LTWLLYPVTPEEFYENYWEQRPLAIKRNFPSYYDGWFSKQEIDRILKTHTLEYGTDVDLT 121 Query: 55 SHQDGKWQVSHGP----FESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDD 110 + D + P + + S+ + + + L+ + Sbjct: 122 KYVDDTRHTLNPPGSATAKQVWKHYDDGCSVRLLCPQKFSDDVWKLLATLEDEWGCMAGA 181 Query: 111 LMISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEK---LQMKQHCPHPDLLQVDPFEA 167 G PH D + F++Q G + W+V + + P + D + Sbjct: 182 NTYLTPKNTQGFAPHFDDIEAFLLQTEGCKHWKVYKPLNESDVLARYPSGNFKAEDLGKP 241 Query: 168 IIDEELEPGDILYIPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNY 227 ++ +LE GD+LY P GF H+ A + + + + + F + +L + L G Sbjct: 242 TLEVDLEQGDLLYFPRGFIHQARAHKEKHSLHLTVST-GQQNTMGNFLEVLLPQALAGAI 300 Query: 228 YSDPDVPPRA-----------HPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSR 276 ++ ++ H E + + + G + S Sbjct: 301 NTNVELRRSLPRDYLEYMGVMHSDRKGDPERQAFANKLKGALKT--VLGEAMGMLDAASD 358 Query: 277 H-----ELDIAPPEPPYQPDEIYD--------ALKQGEVLVRLGGLRVLRIGD------- 316 LD PP + + + LVR G R++ Sbjct: 359 QMAKNFLLDRLPPALEDEEENCTSDNSPLQKITVNTQLKLVRHGVARLVIEDGKAVLYHC 418 Query: 317 --------DVYANGEKIDSPHRPALDALASNIALTAENFGDALEDP--SFLAMLAALVNS 366 +V + + + +++ + ++ GD + + AL Sbjct: 419 RENSRMHHEVPISPLEFELDDAESIEFILNSYP-DYFRVGDMPHEDPQDQTELAKALYKE 477 >UniRef50_UPI000186D1B6 conserved hypothetical protein n=1 Tax=Pediculus humanus corporis RepID=UPI000186D1B6 Length = 467 Score = 214 bits (546), Expect = 3e-54, Method: Composition-based stats. Identities = 54/359 (15%), Positives = 118/359 (32%), Gaps = 41/359 (11%) Query: 3 YQLTLNWPDFLERHWQKRPVVLKRGFNNFID-PISPDELAGLAMESEV----DSRLVSHQ 57 + +F E W+K+P+ + R N + + S + E ++ + + S+ Sbjct: 27 ILSPIKVKEFFENFWEKKPLYISRNNNEYYNELCSMNAFEKALSEKDMYFTKNIDVTSYI 86 Query: 58 DGKWQVSH----GPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMI 113 DG+ + + S+ + + L +E + + M Sbjct: 87 DGQRFTENLDGKATVSNIWDFFNEGKSIRLLNPQTFIPNVWLLNTNLQEFFNCFVGANMY 146 Query: 114 SFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQ---HCPHPDLLQVDPFEAIID 170 G PH D + F++Q G++ W+V + + + + + I+ Sbjct: 147 LTPAGTQGFAPHYDDIEAFVLQLEGQKHWKVYNPRDSSEVLARESSKNFKEDEIGKPILK 206 Query: 171 EELEPGDILYIPPGFPHEGYALENAMNYSVGFRAP---NTRELISGFADYVLQRELGGNY 227 L+PGD+LY P G+ H+ L + + + + +L LQ+ + + Sbjct: 207 VTLKPGDMLYFPRGYIHQAKCLPDTHSLHLTVSCYQKNSWADLFEKIFPVALQKAIFEDV 266 Query: 228 YSDPDVPP--RAHPADVLPQEMDKLREMMLELINQPEH-------FKQWFGEFISQSRHE 278 +P H + +E ++ R+ ++ N+ + + H+ Sbjct: 267 EFRKGLPIGYLNHMGMMFSEENNEFRQKFIQKFNELGRKVLEMCPIDAGVDQLSTNFIHD 326 Query: 279 -----------LDIAPPEPPYQPDEIYDALKQ------GEVLVRLGGLRVLRIGDDVYA 320 L LK G L+R R++ D++Y Sbjct: 327 ALPPCLTEDEKLYGGDRNVKLPDKNFLHILKNTSNTNLGVKLIRATSARLVCEEDEIYL 385 >UniRef50_UPI0000E45D23 PREDICTED: hypothetical protein n=2 Tax=Strongylocentrotus purpuratus RepID=UPI0000E45D23 Length = 555 Score = 214 bits (544), Expect = 6e-54, Method: Composition-based stats. Identities = 66/431 (15%), Positives = 131/431 (30%), Gaps = 70/431 (16%) Query: 6 TLNWPDFLERHWQKRPVVLKRGFNN-FIDPISPDELAGLAMESEV----DSRLVSHQDGK 60 D+ + ++++P+ LKR F D S EL+ + E++V + + ++ DGK Sbjct: 104 PFKVEDYFKNIFERKPLFLKRHKPGYFTDIFSSKELSNILKENDVQFTRNIDVTTYTDGK 163 Query: 61 WQVSHGPFES----YDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFS 116 + + + S+ V + L+ +E + + Sbjct: 164 RETHNPTGRAQPQVVWDYYNNGCSVRVLNPQTYSTRVWQLLAALQEFFGCFVGANIYLTP 223 Query: 117 VPGGGVGPHLDQYDVFIIQGTGRRRWRVG---EKLQMKQHCPHPDLLQVDPFEAIIDEEL 173 G PH D + F++Q G++ W++ ++ + D + I+D L Sbjct: 224 PGTQGFAPHYDDIEAFVLQLEGKKHWKLYNQRSPAEVLPRFSSSNFTDADIGQPILDTTL 283 Query: 174 EPGDILYIPPGFPHEGYALENAMNYSVGFRAP---NTRELISGFADYVLQRELGGNYYSD 230 EPGD+LY P G H+ + + A +L+ L N Sbjct: 284 EPGDLLYFPRGVIHQASTPSETHSLHITISACQKNTWGDLMEHVLTRALHVVREENIDFR 343 Query: 231 PDVP----------PRAHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELD 280 +P + M K+ ++ +L+ + + H Sbjct: 344 KSLPTDYLNYMGVAHEDLDDERRQPFMKKIGDLFSKLLKSAP-IDAAADQMSLEFFHTSL 402 Query: 281 IAPPEPPYQP--------------------------------DEIYDALKQGEVLVRLG- 307 E + + + ++ +VLV Sbjct: 403 PPVLEEAEESCSVFGESASCRNGKVSGIVQLGEDDQIRLLRRNILRLVPEEDKVLVYHSL 462 Query: 308 -GLRVLRIGDDVYANGEKIDSPHRPALDALASNIA--LTAENFGDAL-----EDPSFLAM 359 RV + + Y +I PA++ L + + N D L E + Sbjct: 463 ENSRVFKEKELQYF---EIPPELAPAVEHLLHSYPDYVPVSNLADQLQPVNDEIADPEDL 519 Query: 360 LAALVNSGYWF 370 L G Sbjct: 520 AQTLYEKGLLM 530 >UniRef50_UPI000180B5EA PREDICTED: similar to Nucleolar protein 66 (hsNO66), partial n=1 Tax=Ciona intestinalis RepID=UPI000180B5EA Length = 594 Score = 212 bits (540), Expect = 2e-53, Method: Composition-based stats. Identities = 46/295 (15%), Positives = 98/295 (33%), Gaps = 24/295 (8%) Query: 12 FLERHWQKRPVVLKRGFNNFID-PISPDELAGLAME----SEVDSRLVSHQDGKWQVSHG 66 F + W+ RP+++ R + D S E+ + E V+ + ++Q+G+ + + Sbjct: 161 FFKDIWESRPLLVLRHCPRYADGLFSTKEMNRILNECNVRYSVNLDVTTYQNGRRETHNI 220 Query: 67 PFESY----DHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGV 122 +Y + S+ ++ + +P L +E + G Sbjct: 221 DGRAYAPVVWDYFKNGCSIRLKNPQAFSKPVWRLCATLQEFFKCMVGANTYLTPPGTQGF 280 Query: 123 GPHLDQYDVFIIQGTGRRRWRVGEKL---QMKQHCPHPDLLQVDPFEAIIDEELEPGDIL 179 PH D + F++Q G++ W + + + + + I + LE G++L Sbjct: 281 APHYDDIEAFVLQLEGKKEWTLYSPRSGKETLPRYSSGNFTADEIGDEIFTQTLEAGNLL 340 Query: 180 YIPPGFPHEGYALENAMNYSVGFRAP---NTRELISGFADYVLQRELGGNYYSDPDVPP- 235 Y P G+ H+ AL + + V + +L+ LQ + + +P Sbjct: 341 YFPRGYIHQAKALPDTHSLHVTISMYQRNSWGDLLEKLLPTTLQNAIIDDVEFRKGLPLD 400 Query: 236 -----RAHPADVLPQEMDKLREMMLEL---INQPEHFKQWFGEFISQSRHELDIA 282 D P + +L + E H+ Sbjct: 401 YLSLFGEQNCDKHPDRRRAFMGKISDLFRRLADKVDIDAAADEHGINFLHDAMPP 455 >UniRef50_A9TET4 Predicted protein n=1 Tax=Physcomitrella patens subsp. patens RepID=A9TET4_PHYPA Length = 530 Score = 212 bits (540), Expect = 2e-53, Method: Composition-based stats. Identities = 69/423 (16%), Positives = 134/423 (31%), Gaps = 54/423 (12%) Query: 1 MEYQLT-LNWPDFLERHWQKRPVVLKR--GFNNFIDPISPDELAGLAMESEV----DSRL 53 +E+ ++ + F W+K+P +++R N + + L E E+ + + Sbjct: 105 LEWAISPIKLDRFQGEFWEKKPFLVRRPKNRNYYAGIFDKATIEKLLEEHELKYGLNIDV 164 Query: 54 VSHQ-DGKWQV----SHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRI 108 + DG WS+ + W +P ++ F Sbjct: 165 TKYDIDGGRSTFSSEGSATPSKVWSKYADGWSVRILHPQRWCDPVFLILSAFERFWGSVA 224 Query: 109 DDLMISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKL---QMKQHCPHPDLLQVDPF 165 G PH D + F+IQ GR+RW+V + + P+ Q + Sbjct: 225 GCNAYLTPAGSQGFSPHYDDIEAFVIQTEGRKRWKVYKPRTPGEALPRFSSPNFEQGEIG 284 Query: 166 EAIIDEELEPGDILYIPPGFPHEGYALENAMNYSVGF---RAPNTRELISGFADYVLQRE 222 E I+D +LEPGDILY+P G H+ A E+A + + + + + L+ Sbjct: 285 EPILDVDLEPGDILYMPRGTIHQAKASEDAHSLHITVSVGQRNCWGDFLEFAMPRALELA 344 Query: 223 LGGNYYSDPDVPPR-------AHPADVLPQEMDKLREMMLELI-----------NQPEHF 264 + +P AH D + + ++E + + Sbjct: 345 SEDHILLRESLPRGYADYMGVAHSDDHDNPQRAAFIDKIMECMAIVSQSIPWDSAADQLA 404 Query: 265 KQWFGEFISQSRHELDIAPPEPPYQ---------PDEIYDALKQGEVLVRL--GGLRVLR 313 ++ + + PD L+ V+V R L Sbjct: 405 VKFLQSRLPLPAPANAVHGKGQKITGKSRVRLVAPDVARLVLEGDSVVVYHMLKNSRDLH 464 Query: 314 IGDDVYANGEK-------IDSPHRPALDALASNIALTAENFGDALEDPSFLAMLAALVNS 366 D N + P L+ L S + ++ + + +++ L + Sbjct: 465 NEGDTEENEDSDADKRLVFTWEVAPVLEELLSAFPDPVDVSSLSVPEDESIPLISELCDI 524 Query: 367 GYW 369 G Sbjct: 525 GIL 527 >UniRef50_A9UZN8 Predicted protein n=1 Tax=Monosiga brevicollis RepID=A9UZN8_MONBE Length = 432 Score = 211 bits (538), Expect = 3e-53, Method: Composition-based stats. Identities = 67/411 (16%), Positives = 133/411 (32%), Gaps = 59/411 (14%) Query: 1 MEYQL-TLNWPDFLERHWQKRPVVLKR-GFNNFIDPISPDELAGL----AMESEVDSRLV 54 +E+ L ++ F +W+ +P++++R F S +L + ++ V+ + Sbjct: 21 LEWLLDPIDLKTFFSEYWETKPLLIRRKNRQRFKGLFSSQQLDDVIRSNYIKYGVNIDMA 80 Query: 55 SHQDGKWQVSHG----PFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDD 110 + DG + + L E S+ + + +P L+ +E + Sbjct: 81 RYSDGVRTTENPEGRVHANTMWALYEDGCSIRMLNPQTYAKPVWQLISTLQEYFQCMVGC 140 Query: 111 LMISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVG--EKLQMKQHCPHPDLLQVDPFEAI 168 G PH D + I+Q G +RWR+ + + Q + E I Sbjct: 141 NTYLTPPGAQGFAPHYDDIEALILQLEGSKRWRLYNNPTGERLPRTSSRNFDQSELSEPI 200 Query: 169 IDEELEPGDILYIPPGFPHEGYALENAMNYSVGFRAPN---TRELISGFADYVLQRELGG 225 +D L+PGD LY P G H+ + + + + E L + Sbjct: 201 LDVVLQPGDFLYFPRGMAHQAVSTPDEHSLHITLSTYQLFDWAEYFKKLVPAALDYAIAE 260 Query: 226 NYYSDPDVPPRA--HPADVLPQE---------MDKLREMMLELINQP-----------EH 263 + +P +A H + + MDK + + +LI+ + Sbjct: 261 DAEFREGLPLQALNHVGLLHSETEGDNQRKRFMDKAKHLFQKLIDVAPYDSAADAVAVDF 320 Query: 264 FKQWFGEFISQSRHEL-----DIAPPEPPYQPDEIYDALKQGEVLVRLGGLR-------- 310 +++ L +A P + A L+R R Sbjct: 321 LHASMPSYLTSEELALTSRQKQLAARSNPVELSSPALAPSDWIRLIRPSMCRLVADSPEI 380 Query: 311 -VLRIGDD---VYANGE----KIDSPHRPALD-ALASNIALTAENFGDALE 352 +L D V+ E + PAL+ L + + + + E Sbjct: 381 VLLYHNQDNAMVWHEEEPQSLEFPVDFAPALEQLLLAQDFVRVDRLPELDE 431 >UniRef50_B0CEG8 Cupin 4 family protein, putative n=1 Tax=Acaryochloris marina MBIC11017 RepID=B0CEG8_ACAM1 Length = 416 Score = 210 bits (534), Expect = 9e-53, Method: Composition-based stats. Identities = 66/394 (16%), Positives = 136/394 (34%), Gaps = 38/394 (9%) Query: 8 NWPDFLERHWQKRPVVLKRGFNNFI-DPISPDELAGLAMESEV-----DSRLVSHQD--- 58 DF + +W+ + + L R +F + P+++ L + + RLV + Sbjct: 26 TIEDFFQTYWETKTLYLPRNDASFYGSVLQPEDIDLLLQNKALLADYNNFRLVDQGNKLS 85 Query: 59 -GKWQVSHGPFESYDHLGETNWSLLVQAV-------NHWHEPTAALMRPFRELPDWRIDD 110 W H + Y + +SLL Q + + +++ Sbjct: 86 LEDWCDRHSKSQQYFINNDKLYSLLHQGLTLTINGAHKKIPKLRHFCSALECELKFKLRT 145 Query: 111 LMISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVG-EKLQMKQHCPHPDLLQVDPFEAII 169 + G+ PH D++DVFI+Q TG + W++ +++ H + + E + Sbjct: 146 NIYITPPQAQGLAPHYDEHDVFILQITGEKEWKLYHSPVELPSHIRDQSIGRHKLAEPEL 205 Query: 170 DEELEPGDILYIPPGFPHEGYALE-NAMNYSVGFRAPNTRELISGFADYVLQRELGGNYY 228 L+PGD+LYIP G H+ + E +++ S+G EL+ Sbjct: 206 TVMLQPGDLLYIPRGVVHQAASQETTSVHASLGLYPTFAYELLEELVTIAQADPA----- 260 Query: 229 SDPDVPPRAHPADVLP---QEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPE 285 +P ++ + L + ++ + E ++ F+ + E + Sbjct: 261 FRKAIPHGFSSSEQQQAFYELFQTLSQYLISNVKTEELVERKHKVFLCDRKSEDQGRFQD 320 Query: 286 PPYQPDEIYDALKQGEVLVRLGGLRVLRIGD------DVYANGEKIDSPHRPALDALASN 339 Y L V+ R + D + Y +L L + Sbjct: 321 LIY-----LPQLNLNSVVARRPNILFSVDRDSTQIVLNFYQKSLTFPIFLATSLTDLIDH 375 Query: 340 IALTAENFGDALEDPSFLAMLAALVNSGYWFFEG 373 L ++ G + D L++ L+ G+ + Sbjct: 376 PYLAVKDIGGLINDAGRLSLAQNLIQEGFLMIKA 409 >UniRef50_Q5ZMM1 Lysine-specific demethylase NO66 n=3 Tax=Eumetazoa RepID=NO66_CHICK Length = 601 Score = 207 bits (528), Expect = 4e-52, Method: Composition-based stats. Identities = 53/350 (15%), Positives = 115/350 (32%), Gaps = 35/350 (10%) Query: 9 WPDFLERHWQKRPVVLKRGFNNFID-PISPDELAGLAMESEVDS----RLVSHQDGKWQV 63 +F +HW++ P++++RG + S + + +V + S+ +G + Sbjct: 175 PEEFARQHWERAPLLVQRGDPGYYAGLFSTADFDAILRSGDVHFGTHLDVTSYAEGVRET 234 Query: 64 SHGPFESY----DHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPG 119 + + + SL + + + + +E Sbjct: 235 HNPVGRALPAVVWDFYQNGCSLRLLSPQAFSTTVWHFLSILQEHFGSMAGANTYLTPPGT 294 Query: 120 GGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCP---HPDLLQVDPFEAIIDEELEPG 176 G PH D + F++Q G++ WRV + P +L Q + E +++ LE G Sbjct: 295 QGFAPHYDDIEAFVLQLEGKKHWRVYGPRTSSEALPQFSSANLTQAELGEPLLEVVLEAG 354 Query: 177 DILYIPPGFPHEGYALENAMNYSVGFRAP---NTRELISGFADYVLQRELGGNYYSDPDV 233 D+LY P GF H+ L +A + + + + + + LQ L + + Sbjct: 355 DLLYFPRGFIHQADCLPDAHSLHITVSSYQRNSWGDFLEKLLPAALQMALEEDLEYRQGL 414 Query: 234 P----------PRAHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAP 283 P ++K++ ++ +L++ + H+ Sbjct: 415 PMDCLGYMGVANSDAVDARRTAFVEKVQHLIKKLVDYAP-IDAAMDQRAKSFLHDCLPPV 473 Query: 284 P---EPPYQPDEIYDALKQGEV------LVRLGGLRVLRIGDDVYANGEK 324 E + G + + +R+L G N E Sbjct: 474 LTQSEKQLSVYGFPARWQDGGPRNVDIQITKDTEVRLLHHGVVRLCNEEA 523 >UniRef50_Q7N884 Similar to unknown protein n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7N884_PHOLL Length = 388 Score = 207 bits (526), Expect = 8e-52, Method: Composition-based stats. Identities = 73/392 (18%), Positives = 136/392 (34%), Gaps = 43/392 (10%) Query: 3 YQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDS-RLVSHQDGKW 61 ++ DFLE ++K+P V K+ +++ D I E+ + S + S + Sbjct: 4 INFPIDKKDFLENFFEKKPCVFKKIYDD--DFIKHSEIENIFNRSNLPSFEGIKLMYNGI 61 Query: 62 QVSHGPFESYDHLGET---------------NWSLLVQAV--NHWHEPTAALMRPFRELP 104 ESY+ LG +L+ + + A F + Sbjct: 62 IDKTEYIESYNDLGTRRYRYIYSKLYDYLNSGATLVANGIINETKIDQLAKACSSFTD-- 119 Query: 105 DWRIDDLMISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVD- 163 L +S+ PH D D+F IQ +G++RW + + H + Sbjct: 120 SHPFSSLYLSYG-EKSSFKPHWDSRDIFAIQLSGKKRWIIYKPSFPDPVYLHQSKDMENT 178 Query: 164 ---PFEAIIDEELEPGDILYIPPGFPHEGYAL-ENAMNYSVGFRAPNTRELISGFADYVL 219 P E D LE GD+LY+P G+ H L E ++ SVG P E I+ + + Sbjct: 179 YPCPSEPYDDFVLETGDVLYLPRGWWHNPLPLGEETIHLSVGIFPPYAHEYINWLSYKIT 238 Query: 220 QRELGGNYYSDPDVPPRAHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHEL 279 D E+D L + +++ I + + ++ F + R Sbjct: 239 DI--------DIGRKSLPRSWKQAKDEIDILAKYVIDNITSEDSYNEFLKSFSDEKRVP- 289 Query: 280 DIAPPEPPYQPDEIYDALKQGEVLVRLGGLRVLRIGDDVYANGEKIDSPHRPALDALASN 339 + + + ++ + L + + NG K++ + L+ Sbjct: 290 --SKLNLRLFCGKKHHSISKTSRLRINSNNNLSIDEGYIICNGAKVNLDQFS-IHLLSKI 346 Query: 340 IALTAENFGDALE--DPSFLAMLAALV-NSGY 368 + +F + L D S + L+ GY Sbjct: 347 SEIPYISFENLLSFFDHSKQKNIEDLIYKLGY 378 >UniRef50_Q091R3 Mina protein n=1 Tax=Stigmatella aurantiaca DW4/3-1 RepID=Q091R3_STIAU Length = 383 Score = 204 bits (520), Expect = 3e-51, Method: Composition-based stats. Identities = 80/382 (20%), Positives = 142/382 (37%), Gaps = 31/382 (8%) Query: 6 TLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAM---ESEVD----SRLVSHQD 58 + F E W+++P+VL+ + + S +L L ++ + H+D Sbjct: 9 PIAPSVFFEEAWERKPLVLQGPPDRWSGLFSSRDLGRLLTYQPPRSIEGMMLVKEGRHRD 68 Query: 59 GKWQVSHGP--FESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFS 116 W G E +++++ V + EP E + + Sbjct: 69 ENWLSPDGSPRLEQVQAAWREGYTIVINKVGQFWEPVGRFCAAVEEELHHPVGVNLYMTP 128 Query: 117 VPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCP-HPDLLQVDPFEAIIDEELEP 175 G H D D F++Q G + W+V H ++++EL+ Sbjct: 129 PGAQGFKAHFDIMDAFVLQVEGSKVWQVRGPQVTLPLPDEHTATSSESLPPVLLEQELKR 188 Query: 176 GDILYIPPGFPHEGYA-LENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVP 234 GD+LYIP GF HE ++++ ++G +A +L + +P Sbjct: 189 GDVLYIPRGFVHEARTAQTHSVHLTLGLQAVTWSDLF-----VAAIAAARRDERFRKGLP 243 Query: 235 PRAHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQP-DEI 293 PR + + RE++ EL P H + G ++Q L + P PP + E Sbjct: 244 PRFLEGSAMME--QTFRELLAEL---PRHLE--LGHALTQLAERLVVQKPPPPTEDLLEG 296 Query: 294 YDALKQGEVLVRLGGLRVLRIGDDVYA----NGEKIDSPHR--PALDALASNIALTAENF 347 LK VL R G+ + + YA +G K+ P + PAL +A + ++ Sbjct: 297 AVELKGSTVLTRRPGMVLRVMEGPGYAGLQYSGGKLMGPAKIGPALRHIAKGSVIPVQSL 356 Query: 348 GDALEDPSFLAMLAALVNSGYW 369 L + L + LV SG Sbjct: 357 -PGLSEKEQLVLAGRLVRSGVL 377 >UniRef50_B4B491 Cupin 4 family protein n=1 Tax=Cyanothece sp. PCC 7822 RepID=B4B491_9CHRO Length = 390 Score = 203 bits (516), Expect = 9e-51, Method: Composition-based stats. Identities = 61/376 (16%), Positives = 128/376 (34%), Gaps = 17/376 (4%) Query: 6 TLNWPDFLERHWQKRPVVLKRGFNNFID-PISP---DELAGLAMESEVDSRLVSHQDGKW 61 + LE++W+K P+++ R ++ IS D + L D L+ Sbjct: 14 PIEPTTLLEKYWEKSPLLVARNHPDYYSELISLKNIDSILRLYGPKSSDVDLIKENSFFS 73 Query: 62 QVSHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGG 121 F +SL+++ ++ +P + L + + + + S G Sbjct: 74 AGGEVDFNQIYQAYSLGYSLVMRKIHERWQPLSVLHKNLEAFLNHPVGINLYMTSKNSQG 133 Query: 122 VGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQ---HCPHPDLLQVDPFEAIIDEELEPGDI 178 H D +DVFI+Q G ++W++ + + D + L GD+ Sbjct: 134 FKAHFDTHDVFILQVEGSKQWKIYDSPITLPVISDLKYTDKFINQLKSPTAEYCLNKGDL 193 Query: 179 LYIPPGFPHEGYALEN-AMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRA 237 LYIP G+ HE Y + +++ +VG + +LI+ + Q+E+ + Sbjct: 194 LYIPRGYIHEVYTDNSFSVHLTVGIHSLKWFDLINSAVTKLAQKEVRFRESLPVGFLRQE 253 Query: 238 HPADVLPQEMDKLREMMLELINQPEHFKQ----WFGEFISQSRHELDIAPPEPPYQPDEI 293 + L + +L +++ E E + + G+ + D I Sbjct: 254 EAEESLKNQFQELLKLLAEQSEVEEAVEDIAQGFLGKMSPLVDEQFYQLENLEYLNLDTI 313 Query: 294 YDALKQGEVLVRLGGLRVLRIGDDVYANGEKIDSPHRPALDALASNIALTAENFGDALED 353 + ++ G ++ G A+ + + L D Sbjct: 314 VKKREGLYRIIEKIGKIGIQFGSKTVME----SLRSERAIRFILEAEEFPIKAL-PGLAD 368 Query: 354 PSFLAMLAALVNSGYW 369 L ++ L+ G Sbjct: 369 NGKLTLVRRLIQEGIL 384 >UniRef50_C3XRY1 Lysine-specific demethylase NO66 n=1 Tax=Branchiostoma floridae RepID=NO66_BRAFL Length = 607 Score = 203 bits (516), Expect = 1e-50, Method: Composition-based stats. Identities = 62/401 (15%), Positives = 125/401 (31%), Gaps = 71/401 (17%) Query: 2 EYQL-TLNWPDFLERHWQKRPVVLKRGFNNFID-PISPDELAGLAMESEVDS----RLVS 55 E+ + + F W+K+P+++KR ++ D S ++L + E+++ + + Sbjct: 195 EWLIHPVKKEKFFSELWEKKPLLVKRHLESYNDGWFSTEDLTKILHENDIQFGRNLDVTT 254 Query: 56 HQDGKWQVSHGPFES----YDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDL 111 ++ G+ + + P + + S+ + + + L +E + Sbjct: 255 YEGGQRETHNPPGRANPAVVWDYYQNGCSVRLLNPQTYSQGVWRLCSTLQEYFSSMVGAN 314 Query: 112 MISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDE 171 + G PH D + F++Q G + Q + EAI+D Sbjct: 315 IYLTPPGTQGFAPHYDDIEAFVLQLEG-------------------NFSQEEIGEAILDV 355 Query: 172 ELEPGDILYIPPGFPHEGYALENAMNYSVGFRAP---NTRELISGFADYVLQRELGGNYY 228 LEPGD+LY P G H+ AL + + + +L+ L + Sbjct: 356 TLEPGDLLYFPRGTIHQASALPDTHSLHITVSTCQRNTWGDLMEKLVPAALTMAFSEDVE 415 Query: 229 SDPDVPP------RAHPADVLPQEMDKLREMMLELINQ-------PEHFKQWFGEFISQS 275 +P AD+ E + L+++ Q EF+ Sbjct: 416 FRQALPRDYLDYMGLANADLDDPRRKAFLETLQSLLSRLVNYVPVDAGVDQKAVEFMRDC 475 Query: 276 RHELDIAPPEPPYQPDEIYDALKQG-------------EVLVRLGGLRVLRIGDDVYANG 322 + E L++G L+R G R++ G+ V+ Sbjct: 476 LPPV-FTKNERACSIYGCRTRLEKGRVVGSVDLKTSTPVKLIRKGAARLVMEGEQVF--- 531 Query: 323 EKIDSPHRPALDALASNIALTAENFGDALEDPSFLAMLAAL 363 L + P + L Sbjct: 532 ---------LYHVLENARVYHGAELQPIEVPPEAAPAIEYL 563 >UniRef50_B4V6J8 Putative uncharacterized protein n=1 Tax=Streptomyces sp. Mg1 RepID=B4V6J8_9ACTO Length = 394 Score = 202 bits (514), Expect = 2e-50, Method: Composition-based stats. Identities = 71/391 (18%), Positives = 125/391 (31%), Gaps = 42/391 (10%) Query: 2 EYQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSR--------- 52 + L FL + + V++ N +S +L + ++ Sbjct: 6 SWAERLGGDTFLAQTCFRAHKVIRSNGNAHPSLLSWGDLNEIVAAHRLEPPRMRLSRAGG 65 Query: 53 ---------LVSHQDGKWQVSHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFREL 103 L +++ G P + + L E SL++ A+ H P Sbjct: 66 AVPATAYSILRTNRRGVSWYQPQPADFHARLAE-GASLVIDAIEQIHPPVREAAAGLERF 124 Query: 104 PDWRIDDLMISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVD 163 + + G G H D +DV ++Q G +RW+V + + +V Sbjct: 125 FRTPVQVNAYASWTAEEGFGTHWDDHDVVVLQLEGSKRWKVYGPTRQAPAWRDVETPEVP 184 Query: 164 PFEAIIDEELEPGDILYIPPGFPHEGYAL--ENAMNYSVGFRAPNTRELISGFADYVLQR 221 + I D L PGD+LY+P G+ H A +++ + G E + D + Sbjct: 185 TGDPIADIVLTPGDVLYLPRGWWHAVSADQGTASLHLTFGLATQTGAEFLGWLRDDLRAS 244 Query: 222 ELGGNYYSDPDVPPRAHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDI 281 +D PR + + +R+ +L ++ P +W R Sbjct: 245 LT---VRADV---PRFGTTEERADYLAAVRKDVLAALDAPAVLDRW-------ERTLDAT 291 Query: 282 APPEPPYQPDEIYDALKQGEVLVRLGGLRVLRIGDDVYA------NGEKIDSPHRPALDA 335 P P + + + V+ R DD N P P L Sbjct: 292 HPGRPRLSLPHLTGVPAEPGITVQATVPRARIDQDDQAVTFAGAGNEWTFALPVAPLLRL 351 Query: 336 LASNIALTAENFGDALEDPSFLAMLAALVNS 366 LA T + A E L +A LV+ Sbjct: 352 LAGGPPATLADL--AAESDLTLVQVAELVSE 380 >UniRef50_B3S582 Putative uncharacterized protein n=1 Tax=Trichoplax adhaerens RepID=B3S582_TRIAD Length = 431 Score = 202 bits (514), Expect = 2e-50, Method: Composition-based stats. Identities = 42/308 (13%), Positives = 100/308 (32%), Gaps = 20/308 (6%) Query: 5 LTLNWPDFLERHWQKRPVVLKRGFNNFID-PISPDELAGL----AMESEVDSRLVSHQDG 59 L + F W+++P++ +R +++ + S +L + +E V+ + ++++G Sbjct: 3 LPIPLDTFFNLSWERKPILAQRRSSSYNNGLFSSHDLDRIVRENYIEYSVNLDVTTYENG 62 Query: 60 KWQVSHGPFESY----DHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISF 115 + + + S+ + + E +E + M Sbjct: 63 VRETHNAEGRVLASVMWDYYQNGCSIRMLNPQTYSESLWKFCSLLQEYFGSFVGCNMYLT 122 Query: 116 SVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMK---QHCPHPDLLQVDPFEAIIDEE 172 G PH D + F++Q G+++WR + Q + + + Sbjct: 123 PPGTQGFAPHFDDIEAFVLQLEGKKKWRFYNPRDDSEILPEYSSGNFNQNEIGKPSFEFV 182 Query: 173 LEPGDILYIPPGFPHEGYALENAMNYSVGFRA---PNTRELISGFADYVLQRELGGNYYS 229 LE GD Y P G H+ +L + + + + + ++ N Sbjct: 183 LEQGDFAYFPRGTIHQAQSLPDCHSLHITVSTCQLHSFGKYFEKLLPMAIRSAFKNNLGL 242 Query: 230 DPDVPPR--AHPADVLPQEMDKLREMMLELINQ--PEHFKQW-FGEFISQSRHELDIAPP 284 +PP A+ + + R+ + + Q + E + Sbjct: 243 RKSLPPDFFANIGGIHADSKNARRKQLTTEVKQFLKDIIDDAPIDEAADLFAAGVIHDYL 302 Query: 285 EPPYQPDE 292 P + +E Sbjct: 303 PPCHTQEE 310 >UniRef50_A6W7N8 Cupin 4 family protein n=1 Tax=Kineococcus radiotolerans SRS30216 RepID=A6W7N8_KINRD Length = 434 Score = 200 bits (509), Expect = 8e-50, Method: Composition-based stats. Identities = 63/374 (16%), Positives = 119/374 (31%), Gaps = 44/374 (11%) Query: 15 RHWQKRPVVLK-------RGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQVSHGP 67 HW RP++++ G + D +SP ++ L + + S + Sbjct: 37 EHWNTRPLLVRAADRAAEGGRASVHDLLSPADVDELLGPRALRTPFFSLVQDGTPLPRSS 96 Query: 68 F-----------------ESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDD 110 + + ++++QA++ + Sbjct: 97 YTRRAVAGNQQLADLPDTDRVAAAHAGGATIVLQALHRTWPALQTFCSQLAADLGHQCQV 156 Query: 111 LMISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPF----E 166 + G PH D +DV ++Q GR+ W + P Sbjct: 157 NVYVTPPGAQGFKPHHDTHDVVVLQVDGRKHWTIHPPAVELPLKSQPSTQLGPDPVGGRP 216 Query: 167 AIIDEELEPGDILYIPPGFPHEGYALEN-AMNYSVGFRAPNTRELISGFADYVLQRELGG 225 ID LEPGD LY+P G+ H E+ +++ +VG A + + Sbjct: 217 PAIDTVLEPGDALYLPRGWLHSARTTEDRSIHLTVGLLATTWAD----VLTDAVASAGVA 272 Query: 226 NYYSDPDVPPRAHPAD---VLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIA 282 + +P P V +E+ R ++ + +++ R A Sbjct: 273 DVALRRALPLPGAPGAADGVPDEEVAGFRAAAQRWLDALD--DDAVRRLVARRRSGAVPA 330 Query: 283 PPEPPYQPDEIYDALKQGEVLVRLGGLRVLR----IGDDVYANGEKIDSPH--RPALDAL 336 P DE L +G L G+R G D+ + ++ P RPAL+ + Sbjct: 331 EPVGVLAQDEAARTLAEGTALRPRRGVRSSLVPAGEGVDLVLDDRRVTFPGWLRPALEHV 390 Query: 337 ASNIALTAENFGDA 350 + +A + A Sbjct: 391 LAAPRTSAADLAAA 404 >UniRef50_Q6DDJ7 Mina-prov protein n=2 Tax=Xenopus RepID=Q6DDJ7_XENLA Length = 461 Score = 200 bits (508), Expect = 1e-49, Method: Composition-based stats. Identities = 58/415 (13%), Positives = 121/415 (29%), Gaps = 53/415 (12%) Query: 6 TLNWPDFLERHWQKRPVVLKRGFNNFIDPI-------SPDELAGLAMESEVDSRLVSHQD 58 + F +W+ + ++L+ F D +AG + E D + +D Sbjct: 45 PVTSDAFFRDYWETKVLLLQGRDPAFTDYFQTLFRLSDLKHIAGGGIYYERDVNVFKCRD 104 Query: 59 GK-----WQVSHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMI 113 GK G ++ +++ +M + + Sbjct: 105 GKKIALPRHGKATYLHLLKDFGSGKATIQFHQPQRFNDALWHIMEKLECFFGALVGSNVY 164 Query: 114 SFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEEL 173 G+ H D +VFI+Q G +RWR+ + + + D L Sbjct: 165 ITPQDSQGLPAHYDDVEVFILQLEGEKRWRLYNPVVPLARDYSV-VPEDQIGSPTHDFVL 223 Query: 174 EPGDILYIPPGFPHEGYAL-ENAMNYSVGFRAP---NTRELISGFADYVLQRELGGNYYS 229 +PGD+LY P G H+ AL ++ + V + + + +L N Sbjct: 224 KPGDLLYFPRGVIHQAQALPGSSHSTHVTISTYQNNSWSDYLQDLLPGILFDAAKANIDL 283 Query: 230 DPDVPPRAHPADVLPQEMDKLREMM---LELINQPEHFKQW-FGEFISQSRHELDIAPPE 285 +P + + P + ++ ++ + + H + + SR + E Sbjct: 284 RRGIPRQQILSLDTPGVIQQISSLLNTVAKGLESHRHIRSFEILRDFMASRLPPFLDNKE 343 Query: 286 PPYQPDEIYDALKQGEVLVRL---------------------------GGLR----VLRI 314 P ++L R R + Sbjct: 344 PGQTSGMPPKLNSTVQLLYRDYSFFTVDHVAEESNAAAEQVVYVYHSLKNTRETHMMAMQ 403 Query: 315 GDDVYANGEKIDSPHRPALDALASNIALTAENFGDALEDPSFLAMLAALVNSGYW 369 +D G + P+ AL + + +++ D + +L G Sbjct: 404 EEDRPVTGLRFPLPYANALKRIWESESVSVGKL-PLDRDEDKENLALSLWTEGLL 457 >UniRef50_Q8IUF8 MYC-induced nuclear antigen n=25 Tax=Amniota RepID=MINA_HUMAN Length = 465 Score = 199 bits (507), Expect = 1e-49, Method: Composition-based stats. Identities = 51/415 (12%), Positives = 126/415 (30%), Gaps = 61/415 (14%) Query: 6 TLNWPDFLERHWQKRPVVLKRGFN----NFIDPISPDELAGLAME---SEVDSRLVSHQD 58 + F + W+++P++++R + +L L D + + Sbjct: 48 PIKTETFFKEFWEQKPLLIQRDDPALATYYGSLFKLTDLKSLCSRGMYYGRDVNVCRCVN 107 Query: 59 GKWQVSHG-----PFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMI 113 GK +V + + + ++ + + + + + Sbjct: 108 GKKKVLNKDGKAHFLQLRKDFDQKRATIQFHQPQRFKDELWRIQEKLECYFGSLVGSNVY 167 Query: 114 SFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEEL 173 G+ PH D +VFI+Q G + WR+ + + + + L Sbjct: 168 ITPAGSQGLPPHYDDVEVFILQLEGEKHWRLYHPTVPLAREYSVE-AEERIGRPVHEFML 226 Query: 174 EPGDILYIPPGFPHEGYA---LENAMNYSV-GFRAPNTRELISGFADYVLQRELGGNYYS 229 +PGD+LY P G H+ L ++ + ++ ++ + + + ++ + Sbjct: 227 KPGDLLYFPRGTIHQADTPAGLAHSTHVTISTYQNNSWGDFLLDTISGLVFDTAKEDVEL 286 Query: 230 DPDVPPRAHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQ 289 +P + + + + + + E +S + I PPY Sbjct: 287 RTGIPRQLL---LQVESTTVATRRLSGFLRTLADRLEGTKELLSSDMKKDFIMHRLPPYS 343 Query: 290 PDEIYDALKQGEVLVRLGGLRVLRIGDDVYAN---------------------------- 321 + + G L RL + L+ D + Sbjct: 344 AGDGAELSTPGGKLPRLDSVVRLQFKDHIVLTVLPDQDQSDEAQEKMVYIYHSLKNSRET 403 Query: 322 ------------GEKIDSPHRPALDALASNIALTAENFGDALEDPSFLAMLAALV 364 G + H AL + ++ A++ ++ D +++ +L Sbjct: 404 HMMGNEEETEFHGLRFPLSHLDALKQIWNSPAISVKDL-KLTTDEEKESLVLSLW 457 >UniRef50_UPI00017929D5 PREDICTED: similar to Nucleolar protein 66 (hsNO66) n=1 Tax=Acyrthosiphon pisum RepID=UPI00017929D5 Length = 473 Score = 199 bits (506), Expect = 2e-49, Method: Composition-based stats. Identities = 61/400 (15%), Positives = 127/400 (31%), Gaps = 37/400 (9%) Query: 1 MEYQL-TLNWPDFLERHWQKRPVVLKRGFNN-FIDPISPDELAGLAME----SEVDSRLV 54 +E+ + + DF+ HW+K + + R +N F S EL + E + + Sbjct: 71 LEWLIHPKSINDFMRDHWEKTILHVPRNSSNYFSQLFSLTELDTILRENNLQYGTNVDIT 130 Query: 55 SHQDGKWQVSHGPFESYDH----LGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDD 110 S+ D + + ++ H S+ + + M +E + Sbjct: 131 SYTDNVRETHNPVGRAHPHVVWDYYNNGCSVRLLNPQLFAPEIYKFMANLQEYFGSLVGC 190 Query: 111 LMISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKL---QMKQHCPHPDLLQVDPFEA 167 + G PH D + F++Q G + WRV + + + Q + E Sbjct: 191 NVYLTPPFSQGFAPHYDDIEAFVVQVDGEKHWRVYKPRSEFETLPRTSSRNFHQDEIGEP 250 Query: 168 IIDEELEPGDILYIPPGFPHEGYAL-ENAMNYSVGFRAP---NTRELISGFADYVLQREL 223 I+D L PGD LY+P G+ H+ L + + F + + + + + L + Sbjct: 251 ILDVILRPGDFLYMPRGYIHQADTLFTETHSLHLTFSSYQQNSMYDFLQVVVNNSLNNAV 310 Query: 224 GGNYYSDPDVP-----------PRAHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFI 272 + +P P + + + +M+ + E E Sbjct: 311 KNDISYRSGLPVGYQHFGGLCELEKSPPLEMLRTVANGAKMLPDGKTTGEEVNLEIAEVK 370 Query: 273 SQSRHELDIAPPEPPYQPDEIYDALKQGEVLVRLGGLRVLRIGDDVYANGEKIDSPHRPA 332 R+ E +E + + L + L D + I Sbjct: 371 LIRRNC--FRLVEESMLDEETDRMTNELRLYYNLDNSKQLHGNDCQFV---VIPDEMGLF 425 Query: 333 LDALASNIA--LTAENFGDALEDPSFLAMLAALVNSGYWF 370 + L + + + D ++ + ++ L G Sbjct: 426 VKKLFNMYPSFIKVSDLCD--DNEQVMQLVNDLWERGLLI 463 >UniRef50_A1R1T1 Putative cupin superfamily protein n=2 Tax=Micrococcineae RepID=A1R1T1_ARTAT Length = 388 Score = 199 bits (505), Expect = 2e-49, Method: Composition-based stats. Identities = 66/401 (16%), Positives = 129/401 (32%), Gaps = 59/401 (14%) Query: 2 EYQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKW 61 + + + F W + ++ RG +F D S D + L + + + G Sbjct: 9 TRLIDIGYEKFASDVWGRTALL-TRGVGDFSDLFSADAVDELISRRGLRTPFLRVAKGGS 67 Query: 62 QVSHGPFE----------------SYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPD 105 + F +L++QA++ EP ++ Sbjct: 68 TLPESSFTSPAGVGATISDQLDDTQLWRKFADGATLVLQALHRTWEPVSSFSTQLSTELG 127 Query: 106 WRIDDLMISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHP------DL 159 + G H D +DVF++Q G +RW + E + + P + Sbjct: 128 HPVQANAYITPPQNRGFDDHYDVHDVFVLQIEGTKRWIIHEPVHVDPLRSQPWTDRRSAV 187 Query: 160 LQVDPFEAIIDEELEPGDILYIPPGFPHEGYALEN-AMNYSVGFRAPNTRELISGFADYV 218 + +A ID LEPGD+LY+P G+ H A +++ ++G + R ++ Sbjct: 188 AEAAQGKAYIDTVLEPGDVLYLPRGWLHAAEAQGKVSIHLTLGVHSWT-RHALAEHLAQA 246 Query: 219 LQRELGGNYYSDPDVPPRAHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRH- 277 L + +P +E+ +RE + + + + + Q R Sbjct: 247 ALAALCDDPEVRRSLPLGVDG---PDEEIAAVRERLAAAVLEADTTSLFHRTRRGQGRPA 303 Query: 278 ---------ELDIAPPEPPYQPDEIYDALKQGEVLVRLGGLRVLRIGDDVYANGEKIDSP 328 LD PE + E +A +G L RV + Sbjct: 304 PLGPVAQLAALDGLGPESLVRLREALEARLEGSRL----TTRVGWLD---------FPEA 350 Query: 329 HRPALDALASNIALTAENFGDALEDPSFLAMLAALVNSGYW 369 + P++ L L + ++ L+ +G Sbjct: 351 NLPSVRRLLDGE--------PHLASDLGVELVERLLRAGVL 383 >UniRef50_A3UGV1 Putative uncharacterized protein n=1 Tax=Oceanicaulis alexandrii HTCC2633 RepID=A3UGV1_9RHOB Length = 387 Score = 198 bits (504), Expect = 2e-49, Method: Composition-based stats. Identities = 69/377 (18%), Positives = 121/377 (32%), Gaps = 30/377 (7%) Query: 11 DFLERHWQKRPVVLKRGFNN-FIDPISPDELAGLAMESEVDSRLVSHQ-------DGKWQ 62 F E +++ + N F IS D + + E + +S D W Sbjct: 16 AFFETVFEQTHLHAPGTDRNRFASLISLDAIDRILAEDLLREGDLSMARAEPRLPDRAWL 75 Query: 63 VSHGPFE--SYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGG 120 G + L + +L++ + H P A L R + + Sbjct: 76 REDGLVDRGEVARLYQQGATLILPQLQARHRPLADLCRQLEAEFSCPVQTNIYLTPPNAQ 135 Query: 121 GVGPHLDQYDVFIIQGTGRRRWRVGE-KLQMKQHCPHPDLLQVDPFEAIIDEELEPGDIL 179 G H D +DV ++Q G +RWR+ + + + + E + L PGD+L Sbjct: 136 GFQTHYDNHDVLVLQVEGSKRWRLYDAPVGVPYRGERFTPGRFAQTEPRAELVLNPGDVL 195 Query: 180 YIPPGFPHEGY---ALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPR 236 Y+P G H+ + E +++ + G A + + R +PP Sbjct: 196 YVPRGLMHDAVNEGSDEASLHITTGLLAKTWADFLLEAVSEAALRTPQ----LRRALPPG 251 Query: 237 AHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDA 296 V P K LE + Q G F + + Sbjct: 252 YARGAVSPGVFAKTFAEALEAVGQNADIGAVLGLFTDA-----ALTSRPADTRGALTLGP 306 Query: 297 LKQGEVLVRLG--GLRVLRIGDDVYA----NGEKIDSPHRPALDALASNIALTAENFGDA 350 + L R L ++ GD V D+ L+ L A++ +F A Sbjct: 307 ITADTRLKRRALIALDLVDDGDHVALVAPGGALSFDAAAEAGLERLLKGDAISLADFS-A 365 Query: 351 LEDPSFLAMLAALVNSG 367 L+D ++ L+ G Sbjct: 366 LDDAKARDVMERLIAYG 382 >UniRef50_B7PMB0 MYC-induced nuclear antigen, putative (Fragment) n=1 Tax=Ixodes scapularis RepID=B7PMB0_IXOSC Length = 472 Score = 198 bits (503), Expect = 4e-49, Method: Composition-based stats. Identities = 60/335 (17%), Positives = 119/335 (35%), Gaps = 25/335 (7%) Query: 1 MEYQLT-LNWPDFLERHWQKRPVVL--KRGFNNFID-PISPDELAGLAMESEV----DSR 52 ME+ L+ L++ +F E++W++ P V + G F S D +A E+++ D Sbjct: 16 MEFLLSPLSYKEFSEKYWEREPFVAHDRAGMRAFWPQLFSKDAFFSIAKETKLYFGKDVS 75 Query: 53 LVSHQDGKWQV-SHGPFESYDHLG----ETNWSLLVQAVNHWHEPTAALMRPFRELPDWR 107 ++DGK + G L E +L V W + ++ Sbjct: 76 ACKYEDGKRSDYAEGYSAKSAKLNKYFEERKATLQVHQPQRWKDSLWEVLELMERFFGCL 135 Query: 108 IDDLMISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEA 167 + G+ PH D DVFI+Q G + W++ + + D + E Sbjct: 136 VGCNAYITPAGSQGLAPHHD--DVFIVQLEGEKCWKLHKPVTELARIYSKDFTSEEIGEP 193 Query: 168 IIDEELEPGDILYIPPGFPHEG----YALENAMNYSV-GFRAPNTRELISGFADYVLQRE 222 + L PGD LY+P G H A ++ + ++ ++ + + A ++ Sbjct: 194 THEFTLRPGDFLYMPRGTIHHAYVPESADSHSTHITISTYQKQTVGDCLMDIAPDLISSA 253 Query: 223 LGGNYYSDPDVPPRAHPADVLPQ-----EMDKLREMMLELINQPEHFKQWFGEFISQSRH 277 + +P R P+ VL + + + E + ++ ++ ++ +F+ Sbjct: 254 MDSCIELRKGLPNRFLPSCVLSKETVVTALSSVLEHVKQMPDEMPSPEEMVRDFMFSRLP 313 Query: 278 ELDIAPPEPPYQPDEIYDALKQGEVLVRLGGLRVL 312 +P L L L Sbjct: 314 PFGFDRRNTSMRPYGDVPHLNDRVKLAYPDHCTYL 348 >UniRef50_Q28VG0 Cupin 4 n=1 Tax=Jannaschia sp. CCS1 RepID=Q28VG0_JANSC Length = 392 Score = 196 bits (498), Expect = 1e-48, Method: Composition-based stats. Identities = 66/405 (16%), Positives = 127/405 (31%), Gaps = 56/405 (13%) Query: 1 MEYQLTLNW-------PDFLERHWQKRPVVLKRGFNN-FIDPISPDELAGLAMESEVDSR 52 M + +W F +++K+P+++KRG F D +S E+ + + Sbjct: 1 MTNTFSFDWAIAPETPDTFFAEYFEKKPMLIKRGQPGYFSDLLSYGEIDRVVSTMGLHVP 60 Query: 53 --LVSHQDGKWQVSHGPFE-------SYDHLGETNWSLLVQAVNHWHEPTAALMRPFREL 103 V+ DG + +E + L ++++ ++ A R Sbjct: 61 EINVTRADGNITPADFAYETGQIDPVRVNQLHADGATVILSGLHERLPALARYCRAMEAA 120 Query: 104 PDWRIDDLMISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVD 163 R+ + G PH D +DV ++Q G + WR+ +D Sbjct: 121 MSARVQTNIYMTPPGNQGFNPHYDGHDVLVLQVAGTKEWRIYGTPVELPLADQAFERGMD 180 Query: 164 PFEAIIDEELEPGDILYIPPGFPHEGYALEN-AMNYSVGFRAPNTRELISGFADYVLQRE 222 E LEPGD +YIP G H+ A + +++ + G + A+ V+ + Sbjct: 181 VGEEAQRFVLEPGDAVYIPRGMAHDAVATDETSLHITTGLMFRTWAD---ALAEAVIAKA 237 Query: 223 LGGNYYSDPDVPPRAHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIA 282 P + P ++L + + F + +L Sbjct: 238 HRE-----PSLRRALPPG---------FANNGVDLDDYKDTFAELIELVGDAHVGKLLSG 283 Query: 283 PPEPPYQPDEIYDALKQGEVLVRLGGLRV-LRIGDDVYANGEKIDSPHRPALDALASNIA 341 E Q L +L GL + R+G + D P+ + +A Sbjct: 284 FREEFLTARVPRVE-GQMAQLAKLDGLTMDSRMGAHPHIVFGIHDVPNEDQVCLVAQGAE 342 Query: 342 L-------------------TAENFGDALEDPSFLAMLAALVNSG 367 + + L+D + + V G Sbjct: 343 IILPAHARDAMEFCITTADFRLGDMEGDLDDAGKMVLAKRFVREG 387 >UniRef50_C6W918 Cupin 4 family protein n=2 Tax=Actinomycetales RepID=C6W918_ACTMD Length = 395 Score = 195 bits (497), Expect = 2e-48, Method: Composition-based stats. Identities = 71/384 (18%), Positives = 133/384 (34%), Gaps = 35/384 (9%) Query: 12 FLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVD-SRLVSHQDGKWQVSHGPFE- 69 F E + + F D + E+ + + ++ RL +DG+ +H E Sbjct: 18 FFEAVQGRTHLRFPGERGRFADLLPWSEVNRVLRQHRLEFPRLRLARDGEVVPAHVYSEL 77 Query: 70 ---------------SYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMIS 114 + +L++ +V L R+ + Sbjct: 78 VDTRRAGQVPRVLPGKFAEQMRGGATLVLDSVQELVGAVGDLAVGLEHELRERVQVNAYA 137 Query: 115 FSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELE 174 G H D +D ++Q +GR+RWR+ ++ +L E + + LE Sbjct: 138 GWGVTHGFDVHWDDHDAIVVQVSGRKRWRIHGFTRVAPMVRDVELPPRPEGEPLDEFVLE 197 Query: 175 PGDILYIPPGFPHEGYAL-ENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDV 233 G++LY+P G H+ A+ E +++ ++G +L++ AD + E Sbjct: 198 AGEVLYLPRGCWHDVSAVGEESLHLTIGVNRATGVDLVAWLADQLRGDEAF------RGD 251 Query: 234 PPRAHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEI 293 PR A + +LR +LE ++ ++ + +Q+ + P Sbjct: 252 LPRFGTAAEQAEHAAQLRAGLLERLDD-GVVARFLADRDAQAPAVEHVGLPWTATSAM-- 308 Query: 294 YDALKQGEVLVRLGGLRVLRIGDDVYAN--GEK--IDSPHRPALDALASNIALTAENFG- 348 EVL+ + R GD V G++ P L AL A T + Sbjct: 309 IPEDDGAEVLLLAPRAVLSREGDAVVLAAVGKRLVFAGAAEPVLAALLGGRARTVTSLAE 368 Query: 349 ---DALEDPSFLAMLAALVNSGYW 369 AL+ + A+L L G Sbjct: 369 AGGPALDRVTVRALLGELAAQGLL 392 >UniRef50_A4X6V2 Cupin 4 family protein n=4 Tax=Micromonosporaceae RepID=A4X6V2_SALTO Length = 496 Score = 194 bits (493), Expect = 5e-48, Method: Composition-based stats. Identities = 66/399 (16%), Positives = 128/399 (32%), Gaps = 38/399 (9%) Query: 1 MEYQLTLNWPDFLERHWQKRPVVLKR----GFNNFIDPISPDELAGLAMESEVDSRLVSH 56 M +++ F HW + P++ + + F D +SP + L + + + Sbjct: 1 MARCVSVEPATFAAAHWGQTPLLSRAHELPNPSGFRDLLSPADADDLLSRRGLRTPFLRV 60 Query: 57 QDGKW---------------QVSHGPFES-YDHLGETNWSLLVQAVNHWHEPTAALMRPF 100 +++ + L +L++Q ++ R Sbjct: 61 AQDGVLVPAARYTGGGGAGAEITDQVLDEKILDLYAGGATLVLQGLHRTWPALIDFARDL 120 Query: 101 RELPDWRIDDLMISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHP--- 157 + G H D +DVF++Q G + WR+ + P Sbjct: 121 GLAVGQPLQVNAYLTPAGSQGFATHYDTHDVFVLQVDGGKHWRIHPPVLPDPLERQPWGG 180 Query: 158 ---DLLQVDPFEAIIDEELEPGDILYIPPGFPHEGYALE-NAMNYSVGFRAPNTRELISG 213 +++ +D L PGD LY+P G+ H A E ++++ +VG RA L+ Sbjct: 181 RADEVVATATGAPALDVLLAPGDALYLPRGWLHSAAAQERSSLHLTVGVRALTRYTLVEE 240 Query: 214 FADYVLQRELGGNYYSDPDVPPRAHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFIS 273 L E + P + P V P+ + + E++ + + + + + Sbjct: 241 LL--ALAAEDQRLRATLPFGIDVSAPEAVEPE-LTETVEILRDWLRRVD--PTALAARLR 295 Query: 274 QSRHELDIAPPEPPYQPDEIYDALKQGEVLVRLGGLRVLRIGD------DVYANGEKIDS 327 Q P P AL + GLR V+ + Sbjct: 296 QRAWPAARPAPLHPLAQAAALGALGPDSRVTPRPGLRWQLTPAGERVTLRVFDRTITLPQ 355 Query: 328 PHRPALDALASNIALTAENFGDALEDPSFLAMLAALVNS 366 PAL AL S + +D + ++ L+ Sbjct: 356 MCAPALRALLSGEVSRVGDLPGLADDTDRVTLVRRLLRE 394 >UniRef50_Q2T4J7 Unnamed protein product n=2 Tax=Burkholderia thailandensis RepID=Q2T4J7_BURTA Length = 397 Score = 194 bits (493), Expect = 5e-48, Method: Composition-based stats. Identities = 64/391 (16%), Positives = 126/391 (32%), Gaps = 37/391 (9%) Query: 6 TLNWPDFLERHWQKRPVVLKRGFNN-FIDPISPDELAGLAMESEVDSRL-VSHQDGKWQV 63 + F+ER+W ++P++++R + + +E A L R S +G + Sbjct: 12 PITVDAFMERYWGRKPLIVRRQAPHLYACLPDSEEFAFLLHSLTDPERGWFSIVNGVARP 71 Query: 64 SHGPF---------ESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWR------- 107 SLL+ V H TA L R Sbjct: 72 PSDSLLTQEGLLNLSEVYAAYRDGNSLLMNQVQRRHRETAMLCRRIESALSAHGIALARH 131 Query: 108 IDDLMISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEK-LQMKQHCPHPDLLQVDPFE 166 I G H D +DV I+Q GR+ WR+ + + P + + Sbjct: 132 IGANGYLSPPSSQGFNIHYDPHDVLILQIEGRKHWRLYGRHVAWPTQPPATPIPPEEAGS 191 Query: 167 AIIDEELEPGDILYIPPGFPHEGYALEN-AMNYSVGFRAPNTRELISGFADYVLQRELGG 225 + L PG+++YIP G H+ ++ +++ ++ +L+ Sbjct: 192 PRREFVLSPGELVYIPRGVLHDANTTDSRSLHLTLSIETLTWTDLLIEAM--------SD 243 Query: 226 NYYSDPDVPPRAHPAD-VLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPP 284 N ++P + + +L + +N P ++ + LD P Sbjct: 244 NPAFRRNLPVCPPFGKRIGDEARAELTR-LTASLNNPRALRRALAAMSGRLLGNLD-PLP 301 Query: 285 EPPYQPDEIYDALKQGEVLVRLGGL--RVLRIGDD--VYANGEKIDSP--HRPALDALAS 338 + + ++ L G V GD+ ++ G + + A L Sbjct: 302 NGGFAEVDGLHLIEPKTWLSLAPGTFGHVEVNGDEAILHLPGSALRAAREMAKAFYYLLR 361 Query: 339 NIALTAENFGDALEDPSFLAMLAALVNSGYW 369 + A + + + L + LV G+ Sbjct: 362 ARRVRACDLPVSASEADKLTFVRKLVQMGFL 392 >UniRef50_O01658 Lysine-specific demethylase NO66 n=3 Tax=Caenorhabditis RepID=NO66_CAEEL Length = 748 Score = 194 bits (492), Expect = 6e-48, Method: Composition-based stats. Identities = 56/359 (15%), Positives = 117/359 (32%), Gaps = 45/359 (12%) Query: 8 NWPDFLERHWQKRPVVLKRGFNN-FIDPISPDELAGLA----MESEVDSRLVSHQDGKWQ 62 + F ++ +Q +V++R F + S L L +E + + +++G Sbjct: 316 DVQTFFDKFYQSNVLVVRRKQPTYFGNLFSTARLGELLEKNHLEYGRNINIAQYKNGVRT 375 Query: 63 VSHGPFESYDHLGETN----WSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVP 118 +G +Y + + + S+ + + + L +E + Sbjct: 376 TLNGQGRAYPQIVKQHLHNMCSVQLVNPQTYDDRIWYLCEVIQEQFGCFVGANTYLTPAG 435 Query: 119 GGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPF-----EAIIDEEL 173 G PH D+ D F++Q GR+ WRV ++ P E + + + Sbjct: 436 SSGFAPHWDEIDAFLLQVEGRKYWRVWAPESAEEELPLESSDNFTEDDMKGREPVFEGWI 495 Query: 174 EPGDILYIPPGFPHEGYALENAMNYSVGF---RAPNTRELISGFADYVLQRELGGNYYSD 230 E GD++YIP G+ H+ + V R + L+ + + Sbjct: 496 EKGDMIYIPRGYIHQARTDSKVHSLHVTVSTGRQWSFANLMEKVVPEAIGVLTDTRHKLR 555 Query: 231 PDVP---------------PRAHPADVLPQEMDKLREMMLELINQ-----------PEHF 264 +P H + +D+ M+ L+ E Sbjct: 556 RGLPTGLFDMGGVIDLDYSQEDHFVEKFKMVVDRHMSMLRNLVADQLLESSVDSLAKEFM 615 Query: 265 KQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQG-EVLVRLGGLRVLRI-GDDVYAN 321 KQ +++ +L + D++ D + L+R R+L D + + Sbjct: 616 KQALPPRLTEQEKKLSVLGSSTNLLGDDLVDFTARTKVRLIRRHTQRLLMESEDACFIS 674 >UniRef50_Q9H6W3 Lysine-specific demethylase NO66 n=17 Tax=Eumetazoa RepID=NO66_HUMAN Length = 641 Score = 193 bits (491), Expect = 7e-48, Method: Composition-based stats. Identities = 59/422 (13%), Positives = 126/422 (29%), Gaps = 59/422 (13%) Query: 6 TLNWPDFLERHWQKRPVVLKRGFNNFI-DPISPDELAGLAMESEVDS----RLVSHQDGK 60 + F R W++ V+++R + + S +L + EV + +G+ Sbjct: 212 PMPPDHFYRRLWEREAVLVRRQDHTYYQGLFSTADLDSMLRNEEVQFGQHLDAARYINGR 271 Query: 61 WQVSHGPFESY----DHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFS 116 + + P + L + SL + + + +E + Sbjct: 272 RETLNPPGRALPAAAWSLYQAGCSLRLLCPQAFSTTVWQFLAVLQEQFGSMAGSNVYLTP 331 Query: 117 VPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKL---QMKQHCPHPDLLQVDPFEAIIDEEL 173 G PH D + F++Q GR+ WRV + P+ Q D E ++ L Sbjct: 332 PNSQGFAPHYDDIEAFVLQLEGRKLWRVYRPRVPTEELALTSSPNFSQDDLGEPVLQTVL 391 Query: 174 EPGDILYIPPGFPHEGYALENAMNYSVGFRAP---NTRELISGFADYVLQRELGGNYYSD 230 EPGD+LY P GF H+ + + + + + +Q + N Sbjct: 392 EPGDLLYFPRGFIHQAECQDGVHSLHLTLSTYQRNTWGDFLEAILPLAVQAAMEENVEFR 451 Query: 231 PDVPP------RAHPADVLPQEMDKLREMMLELIN---------------QPEHFKQWFG 269 +P A +D E + L+ + Sbjct: 452 RGLPRDFMDYMGAQHSDSKDPRRTAFMEKVRVLVARLGHFAPVDAVADQRAKDFIHDSLP 511 Query: 270 EFISQSRHELDIAPPEPPYQPDEIYDA-----LKQGEVLVRLGGLRVLRIGDDVYA---- 320 ++ L + ++ E + + +++ G R++ G ++ Sbjct: 512 PVLTDRERALSVYGLPIRWEAGEPVNVGAQLTTETEVHMLQDGIARLVGEGGHLFLYYTV 571 Query: 321 -NGEKIDSPH----------RPALDALASN--IALTAENFGDALEDPSFLAMLAALVNSG 367 N A++ L + + + + L++ L + G Sbjct: 572 ENSRVYHLEEPKCLEIYPQQADAMELLLGSYPEFVRVGDLPCDSVE-DQLSLATTLYDKG 630 Query: 368 YW 369 Sbjct: 631 LL 632 >UniRef50_Q0ALX3 Cupin 4 family protein n=1 Tax=Maricaulis maris MCS10 RepID=Q0ALX3_MARMM Length = 394 Score = 192 bits (489), Expect = 1e-47, Method: Composition-based stats. Identities = 53/383 (13%), Positives = 117/383 (30%), Gaps = 25/383 (6%) Query: 6 TLNWPDFLERHWQKRPVVLKRGFNN----------FIDPISPDELAGLAMESEVDSRLVS 55 ++ F + +K+P+++ R D +S +L A++ V Sbjct: 17 PIDRETFFRDYHEKKPLIVHREDPGRYAGLLSIARIDDIVSSIDLREGALDMARSEPPVQ 76 Query: 56 HQDGKWQVSHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISF 115 +D + + + ++++ ++ R L + + Sbjct: 77 REDYMFDTGYVDRGGVANQYRQGATIILPQLHMMDAVLGEFCRAVESLLSCHVQTNIYLT 136 Query: 116 SVPGGGVGPHLDQYDVFIIQGTGRRRWRVGE-KLQMKQHCPHPDLLQVDPFEAIIDEELE 174 G H D +DVF++Q G + WR E ++ E + + L+ Sbjct: 137 PPDNQGFNTHYDDHDVFVMQIEGEKLWRFYETPVENPYRGEGFRPDAHKAGEPVAEFVLK 196 Query: 175 PGDILYIPPGFPHEGYALEN--AMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPD 232 G+ +Y+P G H+ + +++ ++G +L+ V R + P Sbjct: 197 AGECIYVPRGLMHDAQTHGDTASLHITLGLIVKTWADLMLEAVSEVALRTPAMRHSLPPG 256 Query: 233 VPPRAHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDE 292 + + EM+ ++ + FI P + Sbjct: 257 FARPDFDRTDAEVQFRDMAEMLAREMSVDGAMDFFVDSFIRSR------VPNTRGAISNY 310 Query: 293 IYDALKQGEVLVRL--GGLRVLRIGDDVYAN--GE-KIDSPHRPALDALASNIALTAENF 347 + +R ++V GE + + L L +TA +F Sbjct: 311 LAPNSASQTFKLRPFVPWRFAGDETENVIITAGGEVRFPAEAEQGLHTLLDGGTVTAASF 370 Query: 348 GDALEDPSFLAMLAALVNSGYWF 370 +D + + L L G Sbjct: 371 V-GQDDSAAIETLGKLHAFGLII 392 >UniRef50_A5PK74 Lysine-specific demethylase NO66 n=1 Tax=Bos taurus RepID=NO66_BOVIN Length = 667 Score = 190 bits (482), Expect = 8e-47, Method: Composition-based stats. Identities = 55/313 (17%), Positives = 103/313 (32%), Gaps = 23/313 (7%) Query: 2 EYQLT-LNWPDFLERHWQKRPVVLKRGFNNFI-DPISPDELAGLAMESEVDS----RLVS 55 E+ ++ + F R W++ V+++R +++ S L + EV Sbjct: 235 EWLISPMPPDHFYRRLWEREAVLVRRQDHSYYQGLFSTAVLDSILRNEEVQFGQHLDAAR 294 Query: 56 HQDGKWQVSHGPFESY----DHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDL 111 + +G+ + + P + L SL + + + +E Sbjct: 295 YINGRRETLNPPGRALPAAAWSLYRAGCSLRLLCPQAFSTTVWQFLAVLQEQFGSMAGSN 354 Query: 112 MISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKL---QMKQHCPHPDLLQVDPFEAI 168 + G PH D + F++Q GR+ WRV + P+ Q D E + Sbjct: 355 VYLTPPNSQGFAPHYDDIEAFVLQLEGRKLWRVYRPRVPTEELALTSSPNFSQDDLGEPV 414 Query: 169 IDEELEPGDILYIPPGFPHEGYALENAMNYSV---GFRAPNTRELISGFADYVLQRELGG 225 + LEPGD+LY P GF H+ + + + F+ + + +Q + Sbjct: 415 LQTVLEPGDLLYFPRGFIHQAECQDGVHSLHLTLSTFQRNTWGDFLEAVLPLAVQAAMEE 474 Query: 226 NYYSDPDVPP------RAHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHEL 279 N +P A +D E + L+ + HF Q + Sbjct: 475 NVEFRRGLPRDFMDYMGAQHSDSKDPRRTAFMEKVRVLVARLGHFAP-VDAVADQRAKDF 533 Query: 280 DIAPPEPPYQPDE 292 P E Sbjct: 534 IHDSLPPVLTDRE 546 >UniRef50_D2SA69 Cupin 4 family protein n=2 Tax=Actinomycetales RepID=D2SA69_9ACTO Length = 436 Score = 189 bits (481), Expect = 1e-46, Method: Composition-based stats. Identities = 70/394 (17%), Positives = 119/394 (30%), Gaps = 34/394 (8%) Query: 3 YQLTLNWPDFLERHWQKRPVVLKRGFNN--FIDPISPDELAGLAMESEVDSRLVSHQDGK 60 L+ F E +W +RP++ + F D + + L + + + Sbjct: 19 RCTALDPRVFAEEYWARRPLLTRAEETGGSFADLLDLAAVDELLSRRGLRTPFLRIAKDG 78 Query: 61 WQVSHGPF----------------ESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELP 104 V F ++ L ++++Q ++ P Sbjct: 79 AVVDPKRFTTSGGAGAEVADQVSSDAVLRLFADGSTVVLQGLHRLWPPLIEFADQLAADL 138 Query: 105 DWRIDDLMISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHP------D 158 G PH D +DVF++Q G +RWR+ E + P Sbjct: 139 GHPTQVNAYVTPPSSRGFSPHYDVHDVFVLQVAGEKRWRIHEPVLTDPLRTQPWNERGAA 198 Query: 159 LLQVDPFEAIIDEELEPGDILYIPPGFPHEGYALEN-AMNYSVGFRAPNTRELISGFAD- 216 + E +ID L PGD LY+P G+ H AL + + +VG + D Sbjct: 199 VAAAAEREPLIDAVLRPGDALYLPRGYLHSATALGAISAHLTVGIHSVTRWAAAESALDL 258 Query: 217 -YVLQRELGGNYYSDPDVPPRAHPADVLPQEMDKLREMMLELIN--QPEHFKQWFGEFIS 273 VL E S P A PA V ++ + + ++ P Sbjct: 259 VRVLATEDPQLRRSLPLGVDLADPAAV-ADDVATVVTALKGWLDRVDPAEVADRLRARTW 317 Query: 274 QSRHELDIAPPEPPYQPDEIYDALKQGEVLVRLGGLRVLRIGDDVYANGE---KIDSPHR 330 +AP + LR G G ++ + R Sbjct: 318 SQVRPEPVAPLAQATAAAALSPDTVLRLRRRLRCQLREAADGRVTLVAGRRSLELPAETR 377 Query: 331 PALDALASNIALTAENFGDALEDPSFLAMLAALV 364 PA+ L + L + L+ L + LV Sbjct: 378 PAVAGLLAAGELKVADL-PGLDPADQLTLGRRLV 410 >UniRef50_A7SRW5 Predicted protein n=1 Tax=Nematostella vectensis RepID=A7SRW5_NEMVE Length = 269 Score = 189 bits (480), Expect = 1e-46, Method: Composition-based stats. Identities = 40/211 (18%), Positives = 79/211 (37%), Gaps = 15/211 (7%) Query: 10 PDFLERHWQKRPVVLKRGFNNFI-DPISPDELAGLAMESEV----DSRLVSHQDGKWQVS 64 DF E +W+K+P+V+ R + F S L L + E+ D + + DG+ + Sbjct: 58 KDFFENYWEKKPLVINREDSEFYGALFSKAFLEVLLKKKEINYVEDINVCRYIDGEKEFL 117 Query: 65 HG-------PFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSV 117 + + + + N ++ + + L + + Sbjct: 118 NEDEGTKATASKIMKKVKDDNATIQFHQPQRFQDTLWQLNGNLERFFGCLVGANVYITPP 177 Query: 118 PGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGD 177 G+ PH D +VFI+Q G + W++ L DL + + + L+ GD Sbjct: 178 NAQGLAPHHDDVEVFILQLEGEKNWKLYSPLVELALDYSADLEEDSIGKPTHEFTLKTGD 237 Query: 178 ILYIPPGFPHEGYAL---ENAMNYSVGFRAP 205 +LY P G H+ L ++ + ++ Sbjct: 238 LLYFPRGTIHQAETLKCGNHSTHITLSTYQQ 268 >UniRef50_D0MXW2 Nucleolar protein, putative n=1 Tax=Phytophthora infestans T30-4 RepID=D0MXW2_PHYIN Length = 506 Score = 189 bits (480), Expect = 1e-46, Method: Composition-based stats. Identities = 54/346 (15%), Positives = 107/346 (30%), Gaps = 35/346 (10%) Query: 6 TLNWPDFLERHWQKRPVVLKRGFNNF---IDPISPDEL--------------AGLAMESE 48 ++ FL +++K+P+ +++ + S +L L + Sbjct: 65 GMSLDTFLSEYFEKKPLHVRKADKGALFDSNLFSRKKLLKVMEKQHRSLSFGKDLTVCRY 124 Query: 49 VDSRLVSHQDGKWQVSHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRI 108 VDS + DG+ H L + +S + + L F ++ Sbjct: 125 VDSE-RENFDGEDTNGHATSRQVASLLDRGYSCQFYQPQRYEDGLYELNAAFEDVFGGLA 183 Query: 109 DDLMISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAI 168 + PH D +VF++Q GR++W++ L DL + E Sbjct: 184 GSSAYLTPANSQALAPHHDDVEVFVVQTQGRKKWKLYHPLVELAGEHSSDLAEDQIGEPW 243 Query: 169 IDEELEPGDILYIPPGFPHEGYALEN--AMNYSVGFRAP-NTRELISGFADYVLQRELGG 225 ++ +E GD+LY P G H+ E + + ++ + V++ Sbjct: 244 MELTVEEGDLLYFPRGVIHQACTDEKEFSTHVTISVYQHNTWANFLEVALPRVIRHAFDS 303 Query: 226 NYYSDPDVPP--RAHPADVLPQEMDKLREMMLELIN---------QPEHFKQWFGEFISQ 274 + +P P + K +E M + ++ E Sbjct: 304 DVSFREGLPVGYLDSVGTQFPADSAKAKEFMATCKKLVGKLAAHVSEKDLQEAADEAALD 363 Query: 275 SRHELDIAPPEPPYQPDEIYDALKQGEV---LVRLGGLRVLRIGDD 317 P E DE + G V +R+ D+ Sbjct: 364 VLANRLPPPSEGKDDDDESGPSPLDGNVSICFKNRSHIRLSIGEDE 409 >UniRef50_C6SNC5 Putative uncharacterized protein n=2 Tax=Neisseria meningitidis RepID=C6SNC5_NEIME Length = 387 Score = 189 bits (479), Expect = 2e-46, Method: Composition-based stats. Identities = 53/386 (13%), Positives = 125/386 (32%), Gaps = 37/386 (9%) Query: 1 MEYQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGK 60 M ++ + +F E + K+P + K + + IS E+ L ++ + G+ Sbjct: 1 MHINFSMEYKEFNENYLYKKPFIFKNALD--VSSISWKEINELYQRADPTDWQFKFRKGE 58 Query: 61 WQVSHGPFESYDHLG---------------ETNWSLLVQAVNHWHEPTAALMRPFRELPD 105 ES++ +G + +++ +++ ++ + + Sbjct: 59 IIPKEAYVESFNDVGRIRHRFNKTAVYQYLQDGATMVYNRIDN-EPFVDSIAKQIAQFAQ 117 Query: 106 WRIDDLMISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDL---LQV 162 + H D DVF +Q G++ W + Sbjct: 118 AQTVVSGYLAFGSSSSYRNHWDTRDVFAVQLIGKKHWTISAPNFDMPLYMQQAKDMPHIT 177 Query: 163 DPFEAIIDEELEPGDILYIPPGFPHEGYALE-NAMNYSVGFRAPNTRELISGFADYVLQR 221 ++ LE GDILYIP G+ H + + ++G PN + + Sbjct: 178 PSKTVDMEVILEAGDILYIPRGWWHNPMPMNCETFHLAIGTFPPNGYNYMEWLMKKIPDI 237 Query: 222 ELGGNYYSDPDVPPRAHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDI 281 + + D + + +D + + E++ E+++ + +F+ R Sbjct: 238 QSIRQNFID---------WEHDQKNIDNAAQAVTEMMKNQENYQAFIQDFLGNQRVNTAF 288 Query: 282 APPEPPYQPDEIYDALKQGEVLVRLGGLRVLRIGDDVYANG--EKIDSPHRPALDALASN 339 + D L + + + ANG +D+ + L +A Sbjct: 289 ---NMQIFGNLDNDRLPENSTIKLNSLDNRTIKQGYIIANGIKTNLDNDSQTILQWIADK 345 Query: 340 IALTAENFGDALEDPSF-LAMLAALV 364 ++ + ++ + L + LV Sbjct: 346 HSVKLTQLYEFCQNQNINLEKVEKLV 371 >UniRef50_C5LMW3 Putative uncharacterized protein n=1 Tax=Perkinsus marinus ATCC 50983 RepID=C5LMW3_9ALVE Length = 521 Score = 187 bits (476), Expect = 4e-46, Method: Composition-based stats. Identities = 62/399 (15%), Positives = 122/399 (30%), Gaps = 46/399 (11%) Query: 6 TLNWPDFLERHWQKRPVVLKR-GFNNFIDPISPDELAGLAMESEVDSRL----------- 53 + +F E +W+K+P+ ++R ++ + +A + + R Sbjct: 57 PVTVEEFFEEYWEKKPLHVRRPTARDYYSGVWTKAMAEKTLTKH-ECRFGESVNFARVEA 115 Query: 54 -VSHQDGKWQVSHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLM 112 V + E E S + +P ALM Sbjct: 116 GVKVMHNGEEGEKATVEYMQGQFEDGVSCQFMQPQRFSKPCHALMERLENYFGTLWGANS 175 Query: 113 ISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMK---QHCPHPDLLQVDPFEAII 169 G PH D +VF++Q G +RWR+ + D + + Sbjct: 176 YLTPANSVGFAPHYDDVEVFMLQTEGSKRWRLYDSPDDDGPLPMEYSRDYTEEELSLPYF 235 Query: 170 DEELEPGDILYIPPGFPHE--GYALENAMNYSVGFRAP-NTRELISGFADYVLQRELGGN 226 DE +E GD+LYIP G H + + +V + EL+ ++ L Sbjct: 236 DEVVEQGDLLYIPRGTVHFGCVSPEGYSHHLTVSTYYHNSWGELLQNL---LIPGALAKA 292 Query: 227 YYSDPDVPPRAHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEP 286 D + +P + ++ + + G+ + S ++++ Sbjct: 293 MKEDVGFR------EGVPVNWTRYMGRLMAPVTAETAALEAGGDDDNASDDDVEVGKDGS 346 Query: 287 PYQPDE-----IYDALKQGEVLVRLGGLRVLRIGDDVYANG---------EKIDSPHRPA 332 + +E + QGE L R + G ++ D+ + A Sbjct: 347 STEEEEEVDAKVGSVPPQGETQADPEALMKARKAFKAHVKGLIAKLSEYVDEDDAADQTA 406 Query: 333 LDALASNIA--LTAENFGDALEDPSFLA-MLAALVNSGY 368 +D +A A P+ +L N + Sbjct: 407 VDFVALRTPPAPRAGETKTHGPSPAAQGNLLVRWRNPAW 445 >UniRef50_UPI000192614C PREDICTED: similar to chromosome 14 open reading frame 169 n=1 Tax=Hydra magnipapillata RepID=UPI000192614C Length = 388 Score = 187 bits (474), Expect = 7e-46, Method: Composition-based stats. Identities = 57/381 (14%), Positives = 123/381 (32%), Gaps = 53/381 (13%) Query: 40 LAGLAMESEVDSRLVSHQDGKWQVSHGPFESY----DHLGETNWSLLVQAVNHWHEPTAA 95 +A ++ + + +++G+ + + ++ E S+ + + + Sbjct: 1 MARKNVQYTKNLDIAVYRNGQRETLNPEGRAFPSVVWKFYEDGCSIRLLNPQIFAKSVHQ 60 Query: 96 LMRPFRELPDWRIDDLMISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKL---QMKQ 152 L +E + + G PH D + F+IQ G++ W++ ++ Sbjct: 61 LTSRLQEYFGCLVGSNVYLTPPGSQGFAPHYDDIEAFVIQLEGKKHWKLYPPRNTNEVLA 120 Query: 153 HCPHPDLLQVDPFEAIIDEELEPGDILYIPPGFPHEGYALENAMNYSVGFRAP---NTRE 209 ++ + + E I+++ LE GD LY P G H+ LE++ + + + + Sbjct: 121 RYSSENMQEENLGEPILNKVLEAGDTLYFPRGVIHQASTLEDSHSLHITISLYQKSSWGD 180 Query: 210 LISGFADYVLQRELGGNYYSDPDVPPRAHP------ADVLPQEMDKLREMMLELINQPEH 263 + LQ+ + N +P ++ E D + + +L+ + Sbjct: 181 YLEKLIPLALQKAISENVMFREGLPIDFSSFVGVSNSEKKCPERDTFVKTVKKLMEKLID 240 Query: 264 FKQWFGEFISQSRHELDIAPPEPPYQPDEIYD-------------------ALKQGEVLV 304 + + E + + P Y D++ L LV Sbjct: 241 YVE-IDEAGDELVLDHMEEFQPPYYSSDDVECSVFSKDGFWDGEVRRHHKIELSTELRLV 299 Query: 305 RLGGLRV--LRIGDDVYANGE-------------KIDSPHRPALDA-LASNIALTAENFG 348 R G +RV VY + E + L+ L S A Sbjct: 300 RPGIVRVIPADTELQVYYSVENSRQYKEVPLRVLRFSLEDADLLEYFLFSYPAYVVVENA 359 Query: 349 DALEDPSFLAMLAALVNSGYW 369 +D + + L N G Sbjct: 360 PG-DDIHKVDVANRLYNFGIL 379 >UniRef50_Q849M1 Putative uncharacterized protein pSV2.19c n=3 Tax=Streptomyces RepID=Q849M1_STRVN Length = 390 Score = 186 bits (472), Expect = 1e-45, Method: Composition-based stats. Identities = 76/385 (19%), Positives = 135/385 (35%), Gaps = 45/385 (11%) Query: 10 PDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSR----------------- 52 DFL + + + + ++ D+L + ++ Sbjct: 12 EDFLAQALHREHRHIPGAL-DVAGLMTFDDLNQILATHRLEPPRMRLSRDGETLLVGGYT 70 Query: 53 --LVSHQDGKWQVSHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDD 110 + + + W H P E + L E SL + +V+ H P A L R+ Sbjct: 71 TPVATRRHTVWHRLH-PAELHTRLTE-GASLALDSVDELHPPIARLCEAIERELHTRVQA 128 Query: 111 LMISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIID 170 + + G G H D +D I+Q G +RWR+ + P E + D Sbjct: 129 NLYASWSATEGFGVHWDDHDTVIVQLDGAKRWRIYGTTRPFPLYRDIADPGEAPTEPVAD 188 Query: 171 EELEPGDILYIPPGFPHEGYALEN--AMNYSVGFRAPNTRELISGFADYVLQRELGGNYY 228 L PGD+LY+P G H A + +++ + G + +L++ ++ +L E Sbjct: 189 LVLWPGDVLYVPRGVWHAVSADQGVRSLHVTCGLQTHTATDLMAWVSEQLLTHED----- 243 Query: 229 SDPDVPPRAHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPY 288 D+P A P DV +D +R+ + EL++ P ++ Q+ + P PY Sbjct: 244 WRRDLPLLAAP-DVQADAVDGMRKRLAELLDDPTLLARYRTAMDGQAVGRM---VPSLPY 299 Query: 289 QPDEIYDALKQGEVLVRLGGLRVLRIGDD----VYANGEKIDSP--HRPALDALASNIAL 342 D G + VRL R + + + A G + L L + Sbjct: 300 IDGIPVD----GALRVRLTTARAVLDVGEDTVTLSAAGSTFEFAPEAEAVLRPLVDGRTV 355 Query: 343 T--AENFGDALEDPSFLAMLAALVN 365 A L ++ LV Sbjct: 356 DLAALAATAGLTLEDVAGLVQELVA 380 >UniRef50_A8QFQ3 Lysine-specific demethylase NO66 n=2 Tax=Brugia malayi RepID=NO66_BRUMA Length = 710 Score = 185 bits (470), Expect = 2e-45, Method: Composition-based stats. Identities = 58/429 (13%), Positives = 123/429 (28%), Gaps = 67/429 (15%) Query: 8 NWPDFLERHWQKRPVVLKRGFNNFI-DPISPDELAGLAM----ESEVDSRLVSHQDGKWQ 62 + F + +QK+ ++ N+ + S + + E + + +++ + Sbjct: 279 DLTQFFKMVFQKKVFLVCHNNPNYYGNLFSTAKFIDILQTDYVEYGTNVNVAIYKNQQRS 338 Query: 63 VSHGPFESYDHLGET----NWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVP 118 +G + Y + S+ + + + +E+ + + Sbjct: 339 TLNGSGKVYPQAIQKSIKAGCSIQLTNPQSFCDNVWYYCDLLQEVFGCFVGANIYITPAN 398 Query: 119 GGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEA-----IIDEEL 173 G PH D D F++Q GR+ W++ + P + + D+ L Sbjct: 399 TAGFAPHWDDIDAFLLQLEGRKHWKIYAPDSDDEMLPRLPSGNFTDNDVINRMLVFDDWL 458 Query: 174 EPGDILYIPPGFPHEGYALENAMNYSVG---FRAPNTRELISGFADYVLQRELGGNYYSD 230 E GD+LYIP G+ H+G+A ++ + + R +L+ L N Sbjct: 459 EQGDMLYIPRGYIHQGFADKDVHSLHLTVSVCRNVTYADLLERVIPPALSNFAEQNVNIR 518 Query: 231 PDVPPR----------AHPADV---------LPQEMDKLREMMLELINQPEHFKQWFGEF 271 +P R +P L + + EL Sbjct: 519 KSLPARYLDMTGVLECDYPLLKTGTMKLHRFLDSIFSNFCKYIKELSEPAVDM---MARE 575 Query: 272 ISQSRHELDIAPPEPPYQP-----------DEIYDALKQGEVLVRLGGLRVLRI-GDDVY 319 ++ + E E L+R G R++ + + Sbjct: 576 FMRTALPPVLTKEEKDMTALCVAGSSLYGDKEHIFTKNTSIKLLRRHGQRLIYESEERCF 635 Query: 320 A-----NGEKIDSPHRPALDA---LASNIALTAENFGDAL--------EDPSFLAMLAAL 363 N + D LA A + + + + L Sbjct: 636 IVHRMANSRVYEGRPEVLFDLDVELAEGFANLVNAYPRWCLVSDLKCNDAADNIRLAELL 695 Query: 364 VNSGYWFFE 372 ++G E Sbjct: 696 YSNGLLMAE 704 >UniRef50_A3M7T2 Putative uncharacterized protein n=2 Tax=Acinetobacter baumannii ATCC 17978 RepID=A3M7T2_ACIBT Length = 382 Score = 185 bits (469), Expect = 3e-45, Method: Composition-based stats. Identities = 66/373 (17%), Positives = 122/373 (32%), Gaps = 34/373 (9%) Query: 1 MEYQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGK 60 M +L+ F K+P + K + IS +++ L ++ R +G Sbjct: 1 MLLNFSLDKDIFKNDFLYKKPYLFKSAID--SSGISWNDVNELYSRGDISHRDFKLMNGY 58 Query: 61 WQVSHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMR------PFRELPDWRIDDLM-- 112 ESY+ LG + + + + A L+R PF + +I Sbjct: 59 EVPKKEYIESYECLGVIEYRCITSVLYKYLRNGATLVRNRISNEPFVDQISKQIATFAEA 118 Query: 113 ------ISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFE 166 + H D DV+ +Q GR+RW + + D E Sbjct: 119 RTLVGGYAAFSSKSSYKSHWDTRDVYAVQLLGRKRWILRKPNFEFPLYMQQTKNFPDIKE 178 Query: 167 A---IIDEELEPGDILYIPPGFPHEGYALE-NAMNYSVGFRAPNTRELISGFADYVLQRE 222 +D LE GDILYIP G+ H+ L+ + +V AP E + Q Sbjct: 179 PEEIYMDVILEAGDILYIPRGWWHDPLPLDEETFHLAVATFAPTGFEYMRWL-----QNI 233 Query: 223 LGGNYYSDPDVPPRAHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIA 282 + G + + + +D + E+I +++ + +++ + Sbjct: 234 MPGILDCRKNFTNFEN----DVEMIDSFSHQVAEIIKDKNYYQSFMVHHLAEQSVPSML- 288 Query: 283 PPEPPYQPDEIYDALKQGEVLVRLGGLRVLRIGDDVYANGEKIDSP--HRPALDALASNI 340 + + L + L + V NG KI+ + L N Sbjct: 289 --SLDILGNGKINHLNHNQKLYLNASILYYFDEGFVIINGNKINIDGISLSLIQYLFENP 346 Query: 341 ALTAENFGDALED 353 T ++ D + Sbjct: 347 YSTVKDVLDQFNE 359 >UniRef50_C1EHB5 Predicted protein (Fragment) n=2 Tax=Micromonas RepID=C1EHB5_9CHLO Length = 387 Score = 184 bits (468), Expect = 4e-45, Method: Composition-based stats. Identities = 58/361 (16%), Positives = 110/361 (30%), Gaps = 49/361 (13%) Query: 9 WPDFLERHWQKRPVVLKRGF--NNFIDPISPDELAGLA----MESEVDSRLVSHQDGKWQ 62 F+ W++RP + R F +S ++ M + + + S++DG + Sbjct: 14 LETFMRDIWERRPAYVSRNAHKGYFDGLLSKADIDEWLRAGKMRYQRNVDVTSYKDGVRR 73 Query: 63 VSHGPFES-------------------YDHLGETNWSLLVQAVNHWHEPTAALMRPFREL 103 + + + + SL V W +P + Sbjct: 74 THNLNDDGSGGVDATTGEPGFADADTVWRRFEQEGCSLRVLHPQRWRDPLWKTLAALERF 133 Query: 104 PDWRIDDLMISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKL---QMKQHCPHPDLL 160 + G PH D D FI+Q G++ WRV +M P+ Sbjct: 134 WNCSTGCNCYLTPADSQGFSPHYDDIDAFILQLEGKKLWRVYPPRSEAEMLPRYSSPNFG 193 Query: 161 QVDPFEAIIDEELEPGDILYIPPGFPHEGYALENAMNYSVGFRA---PNTRELISGFADY 217 Q D E +++ LEPGD+LY+P G H+ + + V +L+ Sbjct: 194 QDDVGEPVLEVILEPGDLLYMPRGTVHQANCVPGDHSLHVTLSTNQFNTWADLLEVAFPA 253 Query: 218 VLQRELGGNYYSDPDVPPR------------AHPADVLPQEMDKLREMMLELINQP--EH 263 L++ + PP + L E+ ++ + + Sbjct: 254 ALRQAVAEVPALRRCPPPDYLAHLGLVDGEFDAVNPRRDDLIGALVELAQCVMRRLPFDA 313 Query: 264 FKQWFGEFISQSRHELDIAPPEPPYQPDEIYDAL----KQGEVLVRLGGLRVLRIGDDVY 319 G + + R + P A + + + G R++ D V Sbjct: 314 AADHIGARLMRQRLPPPPSHVSAPVSATGANAAATVTDETRVRITQEEGARLVVEDDAVV 373 Query: 320 A 320 Sbjct: 374 V 374 >UniRef50_A1KTI5 Putative uncharacterized protein n=2 Tax=Neisseria meningitidis RepID=A1KTI5_NEIMF Length = 382 Score = 183 bits (466), Expect = 6e-45, Method: Composition-based stats. Identities = 58/384 (15%), Positives = 121/384 (31%), Gaps = 35/384 (9%) Query: 1 MEYQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGK 60 M ++ + +F E + K+P + K + + IS E+ L ++ + G+ Sbjct: 1 MHINFSMEYKEFNENYLYKKPFIFKNALD--VSSISWKEINELYQRADPTDWQFKFRKGE 58 Query: 61 WQVSHGPFESYDHLGETNWSLLVQAVNHWHEPTAALM-RPFRELPDWRIDDLMISFSVPG 119 ES++ +G+ + AV + + A ++ P I+ Sbjct: 59 IIPKEAYVESFNDVGKIRYRFNKTAVYQYLQDGATMVYNRIDNEPFVDSIAKQIAQFAQA 118 Query: 120 GGVGP-------------HLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDL---LQVD 163 V H D DVF +Q G + W + Sbjct: 119 QTVVSGYLAFGSSSSYRNHWDTRDVFAVQLIGTKHWTLSAANFDMPLYMQQAKDIPHITP 178 Query: 164 PFEAIIDEELEPGDILYIPPGFPHEGYALE-NAMNYSVGFRAPNTRELISGFADYVLQRE 222 P ++ LE GDILYIP G+ H + + ++G PN + + + Sbjct: 179 PTTVDMEVILEAGDILYIPRGWWHNPMPMNCETFHLAIGTFPPNGYNYMEWLMKKIPDIQ 238 Query: 223 LGGNYYSDPDVPPRAHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIA 282 + + +D + + E++N P++++ + +F+ R Sbjct: 239 SIRQNFI---------GWQHDQKNLDDAAQAITEMMNNPKNYQTFMQDFLGSQRTNTAF- 288 Query: 283 PPEPPYQPDEIYDALKQGEVLVRLGGLRVLRIGDDVYANGEKIDSP--HRPALDALASNI 340 + L + L + ANG K++ L L Sbjct: 289 --NMDLFGNAHNQTLPEHCFLRLNSNDYSTISQGYLIANGIKLNIDEISMEFLMILIEKH 346 Query: 341 ALTAENFGDALEDPSFLAMLAALV 364 ++ + + ++ L+ Sbjct: 347 IISLTEILSLF-NANKQEIVKRLI 369 >UniRef50_B9BV10 Cupin superfamily protein n=5 Tax=Proteobacteria RepID=B9BV10_9BURK Length = 384 Score = 183 bits (466), Expect = 7e-45, Method: Composition-based stats. Identities = 66/401 (16%), Positives = 121/401 (30%), Gaps = 50/401 (12%) Query: 1 MEYQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGK 60 M +++ DF + +KRP+++K + + S ++ + S V S Sbjct: 1 MSISFSVSPKDFALDYQEKRPLLMKGAVS--LRNFSWRDVNEIFERSNVASDDFKLTFDG 58 Query: 61 WQVSHGPFESYDHLG---------------ETNWSLLVQAVNHWHEPTAALMRPFRELPD 105 + ES+ +G +L+ + L R E Sbjct: 59 IRPKSEYIESWWDIGTLRHRLIKPVVYDYLRKGATLIANKI-ATEPKVNQLSRQLIEFTG 117 Query: 106 WRIDDLMISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQV--- 162 ++ H D DVF IQ GR+RW + E Sbjct: 118 RQVVSSAYLAFGERDSFRCHWDTRDVFAIQLIGRKRWVLYEPSLEAPLYMQQSKDYEGLY 177 Query: 163 -DPFEAIIDEELEPGDILYIPPGFPHEGYALEN-AMNYSVGFRAPNTRELISGFADYVLQ 220 P +D LE GD+LY+P G+ H + + + G + +S + + Sbjct: 178 PCPDTPYMDVMLEAGDLLYLPRGWWHNPLPVGEATFHLAFGTFPAYVIDYLSWAINRMPH 237 Query: 221 RELGGNYYSDPDVPPRAHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELD 280 SD + + + + + I E+++++ + R E D Sbjct: 238 LLDARRSLSD---------WENDKNVLASIGQQFEDFICTRENYRRFLDDRTGAIRIETD 288 Query: 281 IA-----PPEPPYQPDEIYDALKQGEVLVRLGGLRVLRIGDDVYANGEKIDSPH--RPAL 333 +A P PDE L R G R V ANG K++ + Sbjct: 289 LALETLGNPAVSAIPDESRIRLSA----YRPPG----RDDRYVIANGTKVNLDDQGATLI 340 Query: 334 DALASNIALTAENFGDALEDPSF---LAMLAALVNSGYWFF 371 + ++ + ++ L F Sbjct: 341 RLIVERPGISLGTLISGFANTDADRIRDLVTDLCRQDILEF 381 >UniRef50_C7NJK3 Cupin superfamily protein n=1 Tax=Kytococcus sedentarius DSM 20547 RepID=C7NJK3_KYTSD Length = 414 Score = 182 bits (462), Expect = 2e-44, Method: Composition-based stats. Identities = 73/406 (17%), Positives = 126/406 (31%), Gaps = 58/406 (14%) Query: 3 YQLTLNWPDFLERHWQKRPVVLK---RGFNNFIDPISPDELAGLAMESEVDSRLVSHQDG 59 L DF ER W P + R + F D S D + L + + + Sbjct: 20 RLLAAAPADFAERSWGTTPRHVPATDRAGDTFTDLFSLDAVDDLLTHRGLRTPFIRMAQD 79 Query: 60 KWQVSHGPF----------------ESYDHLGETNWSLLVQAVNHWHEPTAALMRPFREL 103 + F + L ++++Q ++ P A R + Sbjct: 80 GTTLPENRFTRGGGTGAGASDQVDEDRVRSLFAGGATIVLQGLHRTWPPIAEFARELGDE 139 Query: 104 PDWRIDDLMISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHP------ 157 + G H D +DVF++Q G + W + E + P Sbjct: 140 LGHPVQVNAYITPPQNQGFSAHYDVHDVFVLQVHGTKHWTLHEPVVAHPLRDQPWDTVRE 199 Query: 158 -DLLQVDPFEAIIDEELEPGDILYIPPGFPHEGYALEN-AMNYSVGFRAPNTRELISGFA 215 + +ID L PGD+LY+P G H A + + ++G + Sbjct: 200 AVAHRAAQDAPLIDAVLAPGDVLYLPRGTIHAAAAQGEISAHLTIGVHTWTPDHVTGAVL 259 Query: 216 DYVLQRELGG-NYYSDPDVPPRAHPADVLPQEMDKLREMMLELIN--QPEHFKQWFGEFI 272 D V R ++ + R A V+ +++LR + E I+ E ++F + Sbjct: 260 DAVRSRLRDQPTVRANLPLGARPDDAAVVGPTLEQLRGALHEAIDSLDAEELARYFRPQV 319 Query: 273 SQSRHELDIAPPEPPYQPDEIYD----ALKQGEVLVRLGG------LRVLRIGDDVYANG 322 +R P + + AL+ G L R+ Sbjct: 320 RVTRRPDPAPPLAQLAAAEGLGTASTVALRGGLELGDDPARPDRLHTRLGWFD------- 372 Query: 323 EKIDSPHRPALDALASN-IALTAENFGDALEDPSFLAMLAALVNSG 367 +D R + AL L A P+ L ++ LV G Sbjct: 373 --LDEDARAVVTALQDGPRRLDAL--------PAPLTVVQDLVRRG 408 >UniRef50_C6SMA2 Myc induced nuclear antigen n=24 Tax=Neisseria RepID=C6SMA2_NEIME Length = 382 Score = 180 bits (458), Expect = 6e-44, Method: Composition-based stats. Identities = 49/368 (13%), Positives = 112/368 (30%), Gaps = 36/368 (9%) Query: 1 MEYQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGK 60 M ++ F E + K+P + K+ + + IS E+ L ++ + G+ Sbjct: 1 MHINFSMERKYFHENYLYKKPFIFKKALD--VSCISWKEINELYQRADPTDWQFKFRKGE 58 Query: 61 WQVSHGPFESYDHLG---------------ETNWSLLVQAVNHWHEPTAALMRPFRELPD 105 ES++ +G + +++ +++ + + + Sbjct: 59 IIPKEAYVESFNDVGRIRHRFNKTAVYQYLQDGATMVYNRIDN-EPFVDTIAKQVAQFAQ 117 Query: 106 WRIDDLMISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDL---LQV 162 + H D DVF +Q G++ W + Sbjct: 118 AQTVVSGYLAFGSSSSYRNHWDTRDVFAVQLIGKKHWTISAPNFDMPLYMQQAKDMPHIT 177 Query: 163 DPFEAIIDEELEPGDILYIPPGFPHEGYALE-NAMNYSVGFRAPNTRELISGFADYVLQR 221 ++ LE GDILYIP G+ H + + ++G PN + + Sbjct: 178 PSKTVDMEVILEAGDILYIPRGWWHNPMPMNCETFHLAIGTFPPNGYNYMEWLMKKIPDI 237 Query: 222 ELGGNYYSDPDVPPRAHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDI 281 + + + + +D + + E++N P++++ + +F+ R Sbjct: 238 QSIRQNFI---------GWEHDQKNLDDSAQAVTEMMNNPKNYQAFMQDFLGNQRTNTAF 288 Query: 282 APPEPPYQPDEIYDALKQGEVLVRLGGLRVLRIGDDVYANGEKIDSPHR--PALDALASN 339 + L + L + NG K++ L L Sbjct: 289 ---NMDLFGNAHNQTLPEHCFLRLNSNDCSTLPQGFLIVNGIKLNVDESSMKFLTILVDK 345 Query: 340 IALTAENF 347 ++ Sbjct: 346 YMISLAEI 353 >UniRef50_Q4D641 Putative uncharacterized protein n=1 Tax=Trypanosoma cruzi RepID=Q4D641_TRYCR Length = 476 Score = 180 bits (457), Expect = 8e-44, Method: Composition-based stats. Identities = 58/339 (17%), Positives = 108/339 (31%), Gaps = 40/339 (11%) Query: 2 EYQLTLNWPDFLERHWQKRPVVL-KRGFNNFIDP--------ISPDELAGLAMESEV--- 49 ++ L +F +++K+P+ +F + S + + LA E + Sbjct: 30 QWLLGKTQKEFFRHYFEKKPLHFSHGAATHFTEVQDGLPAVKWSTELMLQLAAEKSLSYT 89 Query: 50 -DSRLVSHQDGKWQVSHGPFE--------SYDHLGETNWSLLVQAVNHWHEPTAALMRPF 100 D +V Q PF H WS+ + + +A++ Sbjct: 90 TDINIVRF--DAVQKKRVPFRSEGIVTEKEMKHSMRKGWSVRFLRPHEYIVENSAVLAML 147 Query: 101 RELPDWRIDDLMISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKL---QMKQHCPHP 157 E G PH D DVF++Q G + WR+ + + Sbjct: 148 EEAFACSCGLNSYWTPANSQGFAPHYDDVDVFLLQLEGEKEWRLYDPPERVDVLSRHSSE 207 Query: 158 DLLQVDPFEAIIDEELEPGDILYIPPGFPHEGYALENAMNYSVGFRA---PNTRELISGF 214 D + + L PGD+LY+P G H+G +A + V F A +L+ Sbjct: 208 DYNPEELPKPTQIFRLFPGDVLYMPRGTVHQGRKYNHAHSLHVTFSANQMNTWADLMKHA 267 Query: 215 ADYVLQRELGGNYYSDPDVPPRA--------HPADVLPQEMDKLREMMLELINQPEHFKQ 266 +V+++ + +P HP + L E + + + Sbjct: 268 VTHVVEKLAANYIHWRRSLPRSLLKRLGAIYHPTFRKESGLALLTEKRKK---RRRLLQL 324 Query: 267 WFGEFISQSRHELDIAPPEPPYQPDEIYDALKQGEVLVR 305 F + ++ H L L + + L R Sbjct: 325 SFRQMAAEVFHHLSSEKFIDECCDVYARSTLAKQQPLPR 363 >UniRef50_B1FB07 Cupin 4 family protein n=1 Tax=Burkholderia ambifaria IOP40-10 RepID=B1FB07_9BURK Length = 380 Score = 179 bits (455), Expect = 1e-43, Method: Composition-based stats. Identities = 71/393 (18%), Positives = 132/393 (33%), Gaps = 51/393 (12%) Query: 3 YQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQ 62 +L + F+E ++ KRP++ + N + +S +E++ E ++ G + Sbjct: 2 IELNMKASHFVENYFDKRPILFRGALRN--NFLSWEEVSEAIYIGESMTQGPRLNKGGFL 59 Query: 63 VSHGPFESYDHLGE---------------TNWSLLVQAVNHWHEPTAALMRPFRELPDWR 107 + LG+ +L+ + + + Sbjct: 60 DESKYIVNCGELGQVRRRLEKGILYDELRNGTTLVFNRMELTLYKVRLICKSISRFVGEH 119 Query: 108 IDDLMISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQH--CPHPDLLQVDPF 165 G H D + VF +Q GR+RW V E P Sbjct: 120 TVANGYIAFGEEESFGKHWDTHSVFAVQMMGRKRWLVYEPTHALPLKHQRSTGKQSECPA 179 Query: 166 EAIIDEELEPGDILYIPPGFPHEGYAL-ENAMNYSVGFRAPNTRELISGFADYVL----- 219 E +D +E GDILY+P G+ H L E + +VG + I AD ++ Sbjct: 180 EPYMDVTIETGDILYLPRGWWHTAIPLNEETFHLAVGVHESTISDYIKYLADEIIGDFDA 239 Query: 220 -QRELGGNYYSDPDVPPRAHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHE 278 ++ + D D+ A+ E+ ++ E LIN + ++ F E SR Sbjct: 240 FRQTIPLGERRDIDLRLVAN-------ELARIVEDRNVLINYNDRRRRNFRE---ASRPN 289 Query: 279 LDIAPPEPPYQPDEIYDALKQGEVLVRLGGLRVLRIGDDVYANGEKIDSP---HRPALDA 335 L + +Q D K+ + L + NG+ + R Sbjct: 290 LQLHAFRSKFQLD------KEAKFLANTPLRSLAFDEG---INGQSVPIGRNLERLHNFI 340 Query: 336 LASNIALTAENFGD---ALEDPSFLAMLAALVN 365 AS A++ + D + F +++ L+ Sbjct: 341 FASTGAVSYKELRDCACEITSEEFDSLILKLLE 373 >UniRef50_Q4Q6P0 Putative uncharacterized protein n=3 Tax=Leishmania RepID=Q4Q6P0_LEIMA Length = 624 Score = 179 bits (454), Expect = 1e-43, Method: Composition-based stats. Identities = 61/448 (13%), Positives = 128/448 (28%), Gaps = 81/448 (18%) Query: 2 EYQLTLNWPDFLERHWQKRPVV--------LKRGFNNFIDPISP------DELAGLAMES 47 + L + +F ++++++ +V G + P++ + + Sbjct: 168 SWLLNTSRGEFFRKYFERKHLVASHGSGEYFASGLPGVVPPVNWSTERMLEHVKTHPSRY 227 Query: 48 EVDSRLVSHQ----DGKWQVSHGPFE--SYDHLGETNWSLLVQAVNHWHEPTAALMRPFR 101 D +V + G + + + WS+ N + E +A + Sbjct: 228 GADLDIVKFDPKLKRRVSYRTKGLVDAAELEACMKDGWSVRFLRPNEFIESNSAFIGCIE 287 Query: 102 ELPDWRIDDLMISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKL---QMKQHCPHPD 158 + + G PH D DVF +Q G + W + + + D Sbjct: 288 KEFNCYCGANSYWTPANSQGFAPHYDDVDVFFLQLEGEKLWCLYDPPEDVDVLARHSSED 347 Query: 159 LLQVDPFEAIIDEELEPGDILYIPPGFPHEGYALENAMNYSVGFRA---PNTRELISGFA 215 L+ GD+LY+P G H+G + + F A + + +S A Sbjct: 348 YAPERFPTPKHTITLKAGDVLYMPRGTVHQGKTTLKTHSLHITFSANQMNSWADFMSRAA 407 Query: 216 DYVLQRELGGNYYSDPDVPP--------------------------RAHPADVLPQEMDK 249 Y ++ +P + L ++ + Sbjct: 408 QYTVETLAANKLEWRRALPRDMPQVMGEVNNPVFRDTHGLPALSANQQDHRANLQTKVRE 467 Query: 250 LREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDA-LKQGEVLVR--- 305 + + L+ + + + +L PP Y L + Sbjct: 468 MVAELTLLLTDEVNMDVCADVYGKDTIQKLQ--PPSRTYSGAAPASRKLSHASRVRLISR 525 Query: 306 ---------LGGLRVLRIGDDVYA--NGE----KIDSPHRPALDALASNI----ALTAEN 346 G RV IG + GE + ++ PA+ L S+ ++A Sbjct: 526 NCMRLLLNVPGEARVYHIGQNSTVCLAGELGELRFEADFAPAIATLLSSYPKTMPVSALP 585 Query: 347 FGDALEDPSFLA----MLAALVNSGYWF 370 F ++ L ++G Sbjct: 586 FPGFDNSDDVAENQLLLVETLRDAGLLH 613 >UniRef50_B0BQ44 Putative uncharacterized protein n=5 Tax=Pasteurellaceae RepID=B0BQ44_ACTPJ Length = 396 Score = 178 bits (453), Expect = 2e-43, Method: Composition-based stats. Identities = 59/303 (19%), Positives = 115/303 (37%), Gaps = 45/303 (14%) Query: 4 QLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEV--DSRLVSHQDGKW 61 +L++ F ++K+P+V+K N D +S +E+ + ++ + + GK Sbjct: 14 NFSLSYEKFKYEFFEKKPLVIKGAIRN-KDLLSWNEINEIFPRCKLIGEEEIKVMYKGKK 72 Query: 62 QVSHGPFESYDHLG---------------ETNWSLLVQAV------NHWHEPTAALMRPF 100 ESY+ LG +L+ + + + + A + Sbjct: 73 VPKEYYVESYNDLGTLRYKFKEEELYCLMRDGATLIANGIVNEPAIDIFSQEIAKFTKCH 132 Query: 101 RELPDWRIDDLMISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLL 160 L ++F+ H D D+F IQ GR+RW + H Sbjct: 133 IF------SSLYVAFNTQR-SFKIHWDSRDIFAIQMQGRKRWIIHSPTFKDPLFMHRSKD 185 Query: 161 QVDPF----EAIIDEELEPGDILYIPPGFPHEGYAL-ENAMNYSVGFRAPNTRELISGFA 215 + F + ID LE GDILY+P G+ H+ + E ++ +VG P T + +S Sbjct: 186 MPEYFPNKDDVYIDILLEAGDILYLPRGWWHDPIPVGEETVHLAVGVFPPYTNDYLSWVT 245 Query: 216 DYVLQRELGGNYYSDPDVPPRAHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQS 275 + +++ E+ ++ + + L + + IN ++F + F + Sbjct: 246 ENIVKNEIA---------RKSLSSSEKNDEIISLLSAEVADFINNKDNFNIFLESFYDKK 296 Query: 276 RHE 278 R E Sbjct: 297 RIE 299 >UniRef50_A9C261 Cupin 4 family protein n=1 Tax=Delftia acidovorans SPH-1 RepID=A9C261_DELAS Length = 298 Score = 178 bits (453), Expect = 2e-43, Method: Composition-based stats. Identities = 45/285 (15%), Positives = 88/285 (30%), Gaps = 27/285 (9%) Query: 3 YQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQ 62 L F E + +++ S ++ S++ + Sbjct: 5 INFDLTKEAFNEGIADRNIHLVRGAVE--PMLFSWSDMDSALYYSDITPPFMHLHKNGII 62 Query: 63 VSHGPFESYD---------------HLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWR 107 S E + L +T + ++ +++ E L + Sbjct: 63 PSEMYIEEFKAGSRTMQRLDTHAVQSLLKTGATAILNRIDNRQELVRRLCEEVASFTNAE 122 Query: 108 IDDLMISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQH--CPHPDLLQVDPF 165 G G H D +DV IQ G++ WRV P Sbjct: 123 TTANAYLAFSGEGSFGSHWDTHDVMAIQLIGKKHWRVYAPTYKSPLPGQTSKSFDSTCPT 182 Query: 166 EAIIDEELEPGDILYIPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGG 225 + I D LE GD+LY+P G+ HE + ++ ++G P+ ++ F + Sbjct: 183 DPIFDGVLEAGDLLYVPRGWWHEVLPIGETLHVAIGIYPPHVLNYVAWFLE--------K 234 Query: 226 NYYSDPDVPPRAHPADVLPQEMDKLREMMLELINQPEHFKQWFGE 270 N ++ + + ++ E +N PE K + + Sbjct: 235 NIKHHEELRKTLRSCTTTKEVVSNACHVLTEGLNDPEVLKAFMDD 279 >UniRef50_UPI0000523E0E PREDICTED: similar to MYC induced nuclear antigen n=1 Tax=Ciona intestinalis RepID=UPI0000523E0E Length = 490 Score = 176 bits (446), Expect = 1e-42, Method: Composition-based stats. Identities = 41/290 (14%), Positives = 92/290 (31%), Gaps = 34/290 (11%) Query: 8 NWPDFLERHWQKRPVVL---KRGFNN--------------FIDPISPDELAGLAM----E 46 + F E +W++R + + K G + F + + L + + + Sbjct: 40 SIERFYEYYWEQRHLYIPCLKSGSGDNCQDKRLPGTRSSYFNKLFNHEILKEVVLSKKLK 99 Query: 47 SEVDSRLVSHQDGKWQVS----HGPF---ESYDHLGETNWSLLVQAVNHWHEPTAALMRP 99 + D D K HGP + + + +L +H+ + Sbjct: 100 YDKDICACRFDDEKKCRVNAEVHGPVTAEKVHSLFHDDKMTLQFHQPQRFHDELWKIQEK 159 Query: 100 FRELPDWRIDDLMISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDL 159 ++ + G+ PH D +VFI+Q G + W++ + D Sbjct: 160 LESFFGSQVGSNVYMTPDGSQGLAPHHDNVEVFILQLEGEKEWKLYSPVVNLPRNSSSDF 219 Query: 160 LQVD--PFEAIIDEELEPGDILYIPPGFPHEGYA---LENAMNYSV-GFRAPNTRELISG 213 + ++PGD+LY P G H+ + ++ + ++ + + I Sbjct: 220 DDSTVKGLTLLDTIIMKPGDVLYFPRGTVHQAKSIKGTGHSTHLTISTYETQCWGDYILD 279 Query: 214 FADYVLQRELGGNYYSDPDVPPRAHPADVLPQEMDKLREMMLELINQPEH 263 F Y+ +P + + L++ ++ L Sbjct: 280 FIPYLTDAAADKVVSLRRGLPRKYYNQTSTDGFKQNLKKALISLAESLTE 329 >UniRef50_B6BWI1 Putative cytoplasmic protein n=1 Tax=beta proteobacterium KB13 RepID=B6BWI1_9PROT Length = 346 Score = 175 bits (444), Expect = 2e-42, Method: Composition-based stats. Identities = 95/374 (25%), Positives = 152/374 (40%), Gaps = 41/374 (10%) Query: 4 QLTLNWPDFLERHWQKRPVVLKRGFNNFIDPI-SPDELAGLAMESEVDSRLVSHQDGKWQ 62 +L LN F++ +W K+ L G NF D D+L L ++ R + QDG+ Sbjct: 2 ELVLNKKCFVKSYWGKKHFFLPGGIKNFNDNFVDLDDLN-LPSSKALE-RKIFIQDGRKY 59 Query: 63 VSHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGV 122 ++ + ++ S L NH H+ + + F +P + IDD+MIS S G V Sbjct: 60 INFTNVKKKLNVNTPK-SKLFYKTNHIHQLSFEVKNLFDFIPQYLIDDVMISLSNTKGSV 118 Query: 123 GPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIP 182 G H D Y VF+IQG G + W++ E + + ++ GDILY+P Sbjct: 119 GKHKDNYSVFLIQGKGIKNWKIYEN------------------KKVFSYTVKEGDILYVP 160 Query: 183 PGFPHEGYALEN-AMNYSVGFRAPNTRELISGFADYV--LQRELGGNYYSDPDVPPRAHP 239 PG H G + YSVGFR+P++ L F DY+ L + ++ + + Sbjct: 161 PGIDHYGISQSEICNTYSVGFRSPDSLNLKEIFNDYIFNLLDQTSTIFFQNKLFSKQK-- 218 Query: 240 ADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQ 299 A + D + L +P ++ G +++ EL + + D LK+ Sbjct: 219 ASIPDDIKDFF---IRHLDCKPIILDEFIGIYLTSVDLEL---FKKKEITLKKFKDQLKR 272 Query: 300 GEVLVRLGGLRVLRIGDDVYANGEKIDSP--HRPALDALASNIALTAENFGDALEDPSFL 357 L R L G + Y NG KID R + + A+N + Sbjct: 273 M-PLFLNQMTRALYFGKNFYINGFKIDIETNSRKEFRKFFNESTIIAKNL-----NNKST 326 Query: 358 AMLAALVNSGYWFF 371 +L L Y F Sbjct: 327 LLLYKLFKKEYIVF 340 >UniRef50_B7G6P1 Predicted protein (Fragment) n=1 Tax=Phaeodactylum tricornutum CCAP 1055/1 RepID=B7G6P1_PHATR Length = 351 Score = 175 bits (443), Expect = 3e-42, Method: Composition-based stats. Identities = 56/347 (16%), Positives = 112/347 (32%), Gaps = 45/347 (12%) Query: 71 YDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPG-GGVGPHLDQY 129 + ++ + + + T L+ I G G PH D Sbjct: 1 LWSRFDQGCTIRLLCPHKHSDSTHGLLSLLESEWTCMIGANAYLTPPSGSQGFAPHYDDI 60 Query: 130 DVFIIQGTGRRRWRVGEKLQMKQHCPHPDL-----LQVDPFEAIIDEELEPGDILYIPPG 184 + F +Q G++RW+V LQ + P + E +D L+PGD+LY+P G Sbjct: 61 EAFCLQLEGKKRWKVYAPLQKSERLPRTSSEDYVEADLRDVEPALDVVLKPGDVLYMPRG 120 Query: 185 FPHEGYALEN----AMNYSVG-FRAPNTRELISGFADYVLQRELGGNYY-SDPDVPP--- 235 + H+ ++ +++ +V + +L+ LQ G+ +P Sbjct: 121 WIHQACTIDGTDGYSLHLTVSAMQQWAWADLMELLLPEALQSAASGDSTMLRQGLPRGFL 180 Query: 236 --------RAHPADVLPQEMDKLREMMLELINQPEHFKQWFGE----FISQSRHELDIAP 283 + A++L Q+ ++ R ++ + + F+S + Sbjct: 181 NYMGAMYDQKDTAEILEQKAEQDRTAAMDETGAIDMLDAACDQIGKRFLSDRVPPVLTHL 240 Query: 284 PEPPYQPDEIYDALKQG---------EVLVRLGGLRVLRIGDD-------VYANGEKIDS 327 + L Q LV G VL D + + + + Sbjct: 241 ERSMTVHESDAKVLPQTLCRMARPGSGRLVLEAGKAVLYHCADNSRVYHELPLSPMEFEM 300 Query: 328 PHRPALDALASNIALTAENFGDALEDP--SFLAMLAALVNSGYWFFE 372 PA++ L + D + D + + AL + G + Sbjct: 301 DDAPAMEQLLTTTEHDWVRVADLIHDSIEDKVGVAQALYDEGILCIQ 347 >UniRef50_C9N2N9 Cupin 4 family protein n=4 Tax=Streptomyces RepID=C9N2N9_9ACTO Length = 402 Score = 173 bits (439), Expect = 9e-42, Method: Composition-based stats. Identities = 62/370 (16%), Positives = 110/370 (29%), Gaps = 34/370 (9%) Query: 2 EYQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKW 61 L+ DF W + + + + D S + L + + + G Sbjct: 22 SRLTGLSREDFARDVWARTAALTRGASDF-SDVFSSSAVDELISRRGLRTPFLRVAKGGT 80 Query: 62 QVSHGPFES----------------YDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPD 105 + F + +L++QA++ EP A L+ Sbjct: 81 TLPESSFTAPAGVGATIGDQLDDTALWRAFADGATLVLQALHRTWEPVAGLVSELSTELG 140 Query: 106 WRIDDLMISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHP--DLLQ-- 161 + G H D +DVF++Q G +RW V E + P D Q Sbjct: 141 HPVQANAYVTPPQNRGFDAHYDVHDVFVLQIEGTKRWIVHEPVLPDPLRDQPWTDHRQAV 200 Query: 162 ---VDPFEAIIDEELEPGDILYIPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYV 218 A +D L PGD+LY+P G+ H A ++ + A+ + Sbjct: 201 ADAAARSTAHLDTVLGPGDVLYLPRGWLHSARAQGE-VSIHLTLGVHTWTRYA--LAEQL 257 Query: 219 LQRELGGNYYSDPDVPPRAHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHE 278 + L P A E + +RE +L + + + + ++ R Sbjct: 258 TRAALAALRDDPPMRRSLPLGAGGQDDERELVRERLLAAVAEADPGPSFERARRAEGRP- 316 Query: 279 LDIAPPEPPYQPDEIYDALKQGEVLVRLGGLRVLRIGDDVYANGEKIDSPHRPALD--AL 336 P P + L + L G ++ +D P L Sbjct: 317 ----APLGPLAQLSALNGLGPTTPVRLREALEARLAGTRLFTRVGYLDLPASDLAPVTRL 372 Query: 337 ASNIALTAEN 346 TA + Sbjct: 373 LDGGIRTAGD 382 >UniRef50_Q47NS9 Putative uncharacterized protein n=1 Tax=Thermobifida fusca YX RepID=Q47NS9_THEFY Length = 395 Score = 172 bits (437), Expect = 2e-41, Method: Composition-based stats. Identities = 61/347 (17%), Positives = 116/347 (33%), Gaps = 36/347 (10%) Query: 27 GFNNFIDPISPDELAGLAMESEVDS-RLVSHQDGKWQVSHGPFESYDHLG---------- 75 G ++ D L+ L +++ RL H+ G E + G Sbjct: 33 GAETVAPLLTFDALSELLSTHQLEPPRLRLHRAGAPVPLDNYTEVGEASGVQRRLVRPEA 92 Query: 76 -----ETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPHLDQYD 130 SL++ ++ H P A L R+ + G H D +D Sbjct: 93 LYAQLRQGASLVLDGIDRIHPPIRAAADDLMRLVHERVQVNLYLIWGDSHGFNTHWDDHD 152 Query: 131 VFIIQGTGRRRWRVGEKLQMK-QHCPHPDLLQVDPFEAIIDEELEPGDILYIPPGFPHEG 189 FI+Q G + W+V + D P + + + G++L++P G+ H Sbjct: 153 TFIVQVAGTKHWQVHGQGTRPYPMKEDIDHSHQPPEGTVWEGTVRAGEVLHVPRGWWHTV 212 Query: 190 YALEN-AMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADVLPQEMD 248 + +M+ + GF + D + + E ++ D+P A P + + + Sbjct: 213 TGTGDVSMHLTFGFTRATGVDWARWLVDRLYEVE-----FARRDLPRFATPEERRKHQHE 267 Query: 249 KLREMMLELINQPEHFKQWFG----EFISQSRHELDIAPPEPPYQPDEIYDALKQGEVLV 304 LR +M + + ++ F + L A QPD + + + Sbjct: 268 LLRHLMD--LAEQHGLDEFLTDRDSRFPRRQSFSLPWAVDGATPQPDTVVEFTPILPSAL 325 Query: 305 RLGGLRVLRIGDDVYANGEK--IDSPHRPALDALASNIALTAENFGD 349 R G +V + G + +P L+ L LT + Sbjct: 326 RDEGQKVA-----LTVAGRRYTFAKAAQPLLEVLVDARVLTVAELAE 367 >UniRef50_D2VJG1 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2VJG1_NAEGR Length = 542 Score = 170 bits (432), Expect = 6e-41, Method: Composition-based stats. Identities = 41/250 (16%), Positives = 88/250 (35%), Gaps = 21/250 (8%) Query: 6 TLNWPDFLERHWQKRPVVLKRG--FNNFI-DPISPDELAGLAMESEV----DSRLVSHQD 58 ++ + +KR +V++R + ++ S DE+ ++ E+ D L +++ Sbjct: 110 PIDMDKLYQEFVEKRVLVIRRNEVYPDYYKGLYSLDEIKKTLVDHELRYSYDLDLALYRN 169 Query: 59 GKWQVSHG-------PFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDL 111 G+ + P +D S+ + + +L+ F E Sbjct: 170 GRRFTLNPNKDDVADPTLVWDLYENEKCSIRMLRPQEHSDVLLSLLCHFEEYFGQGAGLN 229 Query: 112 MISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAI--- 168 G PH D + F+IQ G + W++ L+ +Q+ E Sbjct: 230 AYLTPAGSQGFAPHYDDIEAFLIQLEGEKHWKIYRPLENQQYLDRFSSKNFTQEEVAGFE 289 Query: 169 -IDEELEPGDILYIPPGFPHEGYALENAMNYSVGFRAP---NTRELISGFADYVLQRELG 224 + L+PGD+LY+P G H+ ++ + + + + + LQ Sbjct: 290 CFEILLKPGDMLYVPKGVIHQAVTSQDQHSLHITVSTSHLMSWTDYLEKALPLALQMATE 349 Query: 225 GNYYSDPDVP 234 + +P Sbjct: 350 NHVDLRTALP 359 >UniRef50_C3ZLE4 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3ZLE4_BRAFL Length = 572 Score = 169 bits (428), Expect = 2e-40, Method: Composition-based stats. Identities = 36/183 (19%), Positives = 67/183 (36%), Gaps = 13/183 (7%) Query: 6 TLNWPDFLERHWQKRPVVLKRGF----NNFIDPISPDELAGLAMESEVD----SRLVSHQ 57 + + F +W+K+P++ KR + S D L L + +++ + + Sbjct: 188 PVTYEQFFAEYWEKKPLIAKRNDAAVSEAYKALFSRDVLKKLLKKHDIEYIRDVNVCRYV 247 Query: 58 DGKWQVSHGP----FESYDHL-GETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLM 112 GK + +G + D L ++ +L + + L L + + Sbjct: 248 SGKRESLNGTERATCKQIDKLFDQSKATLQFHQPQRFQDKLWQLCSLLECLFGCLVGANV 307 Query: 113 ISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEE 172 G+ PH D +VFI+Q GR+ WR+ DL Q + + D Sbjct: 308 YMTPPGSQGLAPHYDDVEVFILQLEGRKHWRLYTPPVDLPRDYSRDLEQDNIGQPTHDFV 367 Query: 173 LEP 175 LE Sbjct: 368 LEE 370 >UniRef50_B7FZB3 Predicted protein n=1 Tax=Phaeodactylum tricornutum CCAP 1055/1 RepID=B7FZB3_PHATR Length = 492 Score = 168 bits (427), Expect = 2e-40, Method: Composition-based stats. Identities = 102/425 (24%), Positives = 175/425 (41%), Gaps = 71/425 (16%) Query: 11 DFLERHWQKRPVVLKRGFNN------FIDPISPDELAGLAMESEVDS-RLVSHQDGK--- 60 D L +W + P++++ F+ + ELA E DS R+++H G+ Sbjct: 68 DLLTNYWGRSPLLIRSAFHAEALTEVWPSQADLLELALDDDEISSDSARIITHTSGRLDS 127 Query: 61 WQVSHGPFE----SYDHLGETNWSLLVQAVNHWHEPTAALMRP-FRELPDWRIDDLMISF 115 + GPF G+ W+L+V V+ + A M F LP WR DD IS Sbjct: 128 FASQLGPFSTSTIQGLEHGDKMWTLIVNDVDRYVSTLADWMDDEFGFLPRWRRDDAQISM 187 Query: 116 SVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQ----HCPHPDLLQV-------DP 164 + GGG+GPH+D YDVF+ Q +G+R W VG + +++ P + + + Sbjct: 188 ARTGGGIGPHVDSYDVFLTQTSGQRTWLVGNTMTVQEEMNTLIPDLSVRILRDVSNHNES 247 Query: 165 FEAIIDEELEPGDILYIPPGFPHEGYAL-ENAMNYSVGFRAPNTRELISGFADYVLQ--R 221 A EL+PGD+LY+PP + H G AL ++ + SVG R+P++ EL++ A+ +L Sbjct: 248 SHAYTRLELQPGDVLYLPPRYVHWGTALTDDCVTLSVGARSPSSAELVARIAETMLGSVS 307 Query: 222 ELGGNYYSDPDVPPRAHPADVLP---QEMDKLREMMLELINQPEHFKQWFGEFISQSRHE 278 Y+DPD+ + A + D ++ M+L+ +++ + E +++ Sbjct: 308 VHAVQRYTDPDLLQEVNGAPLHSMTNHAKDSMKTMVLDAVHEITDDPMRWDELVAK---- 363 Query: 279 LDIAPPEPPYQPDEIYDALKQGEVLVRLGG--------------------------LRVL 312 L P Y+ +K E L GG RV Sbjct: 364 LATEPKRMSENALVPYNEIKDSEYLAIWGGTPRDALARIREGRGALYRIEGVSFATSRVE 423 Query: 313 RIG---DDVYANGEKI----DSPHRPALDALASNIALTAENFGDALEDPSFLAMLAALVN 365 G + ++A+G D L + +T + +L L++ Sbjct: 424 YDGVITERLFAHGSMWEICDDELATAVLCRIEKGKPITISHIEGL--SAPLAELLTNLIS 481 Query: 366 SGYWF 370 G + Sbjct: 482 EGILY 486 >UniRef50_B8BSJ2 Predicted protein n=1 Tax=Thalassiosira pseudonana CCMP1335 RepID=B8BSJ2_THAPS Length = 830 Score = 167 bits (424), Expect = 4e-40, Method: Composition-based stats. Identities = 63/533 (11%), Positives = 144/533 (27%), Gaps = 171/533 (32%) Query: 8 NWPDFLERHWQKRPVV-----------------------LKRGFNN------FIDPISPD 38 + +F +R+W KRP++ L++ + F+ S D Sbjct: 283 SVSEFYKRYWGKRPLLAALEESEDGANENDYEDGGEDVKLQQQLEHATRLNGFLHRKSID 342 Query: 39 ELAGLA-MESEVDSRLVSHQDG----------------KWQVSHG--------------- 66 ++ + +D + + D K + G Sbjct: 343 DMIRKNKLRYGLDLNVTRYTDSLGNGTRHRITLDPPPKKRKSKTGSEGGAVDDVEYIVAN 402 Query: 67 PFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPHL 126 P + + ++ ++ +L + + ++ +++ + + G PH Sbjct: 403 PTDVWTNVDASHCTLRLLRPHEHNDNIHSMLSLLESEFGCMVGSNAYLTPLHSQGFAPHY 462 Query: 127 DQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDP-------FEAIIDEELEPGDIL 179 D DVFI+Q G +RWRV + ++ P E ++D L PGD+L Sbjct: 463 DDVDVFILQLEGYKRWRVYAPMNKQETLPRVSSRDYTEKEVEESMGEEVLDVVLVPGDVL 522 Query: 180 YIPPGFPHEGYAL--------------ENAMNYSVG-FRAPNTRELISGFADYVLQRELG 224 Y+P G+ H+ + ++++ +V + + + L+ Sbjct: 523 YLPRGWIHQAETVARPSHVSKLPGITDAHSLHLTVSAMQNWCWADFLEILMPEALESASA 582 Query: 225 GN--YYSDPDVPP-------------------------------------RAHPADVLPQ 245 +P ++ + Sbjct: 583 SETSISLRDGLPRNFLAYMGTMHQLDDEGGELPEGLKQVAEAYAKRVAKGADKDEEIDDE 642 Query: 246 EMD---------------------KLREMMLELINQPEHFKQWFGEFISQSRHELDIAPP 284 + ++ + + ++ + G+ R + P Sbjct: 643 ALAEVMHRQRTAALKKQFKEEAKKRIMRVAKQAMSMLDDACDQIGKRFLSDRLPPALLPH 702 Query: 285 EPPYQPDEIYDALKQGE-----------VLVRLGGLRVLRIGD-----DVYANGE----- 323 E + A + LVR R++ N Sbjct: 703 EATLTKESQSAAAHHHDASKKIWPNSLCRLVRPNIARLVIEDSKAVLYHCLDNSRVYQGT 762 Query: 324 -----KIDSPHRPALDALASNIALTAENFGDALEDP--SFLAMLAALVNSGYW 369 + + PAL+ L + + D + + + AL + G Sbjct: 763 PLSPMEFEIDDAPALEQLLTTVEPHWIMVKDLIHGDVEDKMEIAQALYDEGIL 815 >UniRef50_C9NEK8 Cupin 4 family protein n=1 Tax=Streptomyces flavogriseus ATCC 33331 RepID=C9NEK8_9ACTO Length = 403 Score = 166 bits (420), Expect = 2e-39, Method: Composition-based stats. Identities = 50/356 (14%), Positives = 113/356 (31%), Gaps = 38/356 (10%) Query: 35 ISPDELAGLAMESEVDS-RLVSHQDGKWQVSHGP-----------FESYDHLGETNWSLL 82 S +L + V+ L G H L + SL+ Sbjct: 43 FSWRDLNEILSRGRVEPAELKLCTGGSSLPEHAYTVTRAGHRVVDLTRTFSLMRSGASLV 102 Query: 83 VQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPHLDQYDVFIIQGTGRRRW 142 + +++ H A + + H D+ D F++Q G + W Sbjct: 103 IDSLDRIHPAVRAATDDVMRMVGETASCNLFVTFDDAQAFASHFDEVDTFVLQVLGTKSW 162 Query: 143 RVGEKLQMKQHCPHPDLLQVD-PFEAIIDEELEPGDILYIPPGFPHEGYALEN-AMNYSV 200 +V + + D P + + LEPGD++++P G+ H +++ + Sbjct: 163 QVHGPSEEHPLPEYGDSDPARCPEAVLFERTLEPGDVIHVPRGWWHTVRGGGESSLHLTF 222 Query: 201 GFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADVLPQEMDKLREMMLELINQ 260 F + + A L + P ++ + ++ +N+ Sbjct: 223 AFTRRTGYDWLRWVAYRALDSTDVRESLARAGTP---------EEQRAQAERLVTAFVNE 273 Query: 261 PEH--FKQWFGEFISQSRHELDIAPPEPPY----QPDEIYDALKQGEVLVRLGGLRVLRI 314 + + +F +S P + L+ + G R++ Sbjct: 274 AKALTLRDFFDAERRRSGGRDTACLPWDVLKARPSAGTFVELATVQAPLMEIRGERLV-- 331 Query: 315 GDDVYANGEKI--DSPHRPALDALASNIALTAENFGD-ALEDPSFL-AMLAALVNS 366 + A G++ + HR A + L + + + P+ + A+++AL+ + Sbjct: 332 ---LTAAGQEFVLPAVHREACETLVRARRVGTAELAERSGTSPAAVSALVSALLRA 384 >UniRef50_Q2JG11 Cupin 4 n=3 Tax=Actinomycetales RepID=Q2JG11_FRASC Length = 406 Score = 165 bits (417), Expect = 3e-39, Method: Composition-based stats. Identities = 60/375 (16%), Positives = 119/375 (31%), Gaps = 33/375 (8%) Query: 23 VLKRGFNNF---IDPISPDELAGLAMESEVDSRLVSHQDGKWQVSHGPF----------- 68 L+ + ++P L M + S + + ++ Sbjct: 27 FLRGTLPDSGICPRVLTPTRFLDLIMRRSLVSPQMRCFQNESELHPNSLLQMNTTRRGQV 86 Query: 69 ------ESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGV 122 L ++ +L++ AVNH+ R F+ + + + G Sbjct: 87 TPMVDMRRLAGLLQSGCTLVLDAVNHFDPTLEVACRAFQWWLRAPVQANVYLTTGDAAGF 146 Query: 123 GPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIP 182 H D +DV ++Q G + W V + P + + + GD+LYIP Sbjct: 147 SLHWDDHDVIVLQLAGDKEWEVRGPSRRAPMYRDAAPNTEPPKDIVWSGTVNTGDVLYIP 206 Query: 183 PGFPHEG----YALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAH 238 G H +++ + GF + ++ AD + E+ + P+ H Sbjct: 207 RGHWHRASRTSRGDGFSLHATFGFTRRTGVDWLAWLADQSRREEVFREDLNQRGEDPKEH 266 Query: 239 PADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRH---ELDIAPPEPPYQPDEIYD 295 D + + ++ + P H+ + S R+ PP + Sbjct: 267 QND-GEKIIVAASRLLTS--HPPAHYLESVAHATSAGRYVSTAGIFGPPSAVVCVTDFPP 323 Query: 296 ALKQG--EVLVRLGGLRVLRIGDDVYANGEKIDSPHRPALDALASNIALTAENFGDALED 353 ++ V V R++ + A G + LD ++S + G+ L Sbjct: 324 QIETQGDTVAVATAEKRIVFTRKALPALGLLLSGNPV-CLDYVSSAAGIDGARLGEILVR 382 Query: 354 PSFLAMLAALVNSGY 368 A L + SGY Sbjct: 383 EGICAELTPELFSGY 397 >UniRef50_A9V5A3 Predicted protein n=1 Tax=Monosiga brevicollis RepID=A9V5A3_MONBE Length = 595 Score = 165 bits (417), Expect = 3e-39, Method: Composition-based stats. Identities = 46/371 (12%), Positives = 103/371 (27%), Gaps = 67/371 (18%) Query: 2 EYQLTLNWPDFLERHWQKRPVVLKRGFNN----FIDPISPDELAGLAME----SEVDSRL 53 + + F ++H+++ + + R + + S D+L L D L Sbjct: 54 DILAPMTTQQFFDKHFERSFLYIPREDRDPGIVYQGLFSLDQLYTLLQRESMFYGTDLNL 113 Query: 54 VSHQDGKWQVSHGPFESYDHLGETN----------------------------------- 78 + + V +G L N Sbjct: 114 CRYDGERKLVLNGGRNDTTDLPTINGNHSNSQRAEEQDSNDSDDSDELAEEALAADVRRR 173 Query: 79 -----WSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPHLDQYDVFI 133 ++ + L+ F + + + G+ PH D +V+I Sbjct: 174 VEDLKATVQFHQPQRFVRALHDLLYSFEQELTTLVGANVYITPANSQGLAPHHDDVEVYI 233 Query: 134 IQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIPPGFPHEGYALE 193 +Q G + WR+ E ++ DL + + + I + L PGD LY+P G HE + Sbjct: 234 LQLEGEKAWRLYEPIEPLAMSYSADLDREELAQPIAELVLRPGDFLYLPRGTIHEASCVG 293 Query: 194 NAMNYSVGFRAP---NTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADV-------- 242 N + + + N L++ + + +P Sbjct: 294 NQHSTHITISSHQNWNYGHLMAQTLPECITNAMSNVLELRRGLPHGFLRQHGVLHAGFPP 353 Query: 243 -LPQEMDKLREMML-ELINQP------EHFKQWFGEFISQSRHELDIAPPEPPYQPDEIY 294 +RE++ + + + + ++ + + P + Sbjct: 354 PPSAVFHHMRELLQSDQVWSEIERHFHQAVDSFALDYTTARLPPPALRRNPLNPAPTRVT 413 Query: 295 DALKQGEVLVR 305 + Sbjct: 414 PVFHATTKVRL 424 >UniRef50_Q7T3G6 MYC induced nuclear antigen-like n=6 Tax=Euteleostomi RepID=Q7T3G6_DANRE Length = 528 Score = 163 bits (413), Expect = 9e-39, Method: Composition-based stats. Identities = 63/456 (13%), Positives = 122/456 (26%), Gaps = 95/456 (20%) Query: 6 TLNWPDFLERHWQKRPVVLKRGF----NNFIDPISPDELAGLAME---SEVDSRLVSHQD 58 L+ +F +R W+++P+VL R + L L D Sbjct: 66 PLDLQEFFQRFWERQPLVLHRSDAALAGYYGSLFPLSGLRRLCARGLQYGTDINTCRCVR 125 Query: 59 GKWQVSH--GPFESY---DHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMI 113 G+ ++ + G + E ++ + + + + + Sbjct: 126 GQKRLLNRAGAVDFCLLERDFLEKKATIQFHQPQRFQDELWRIQERLECFFGCLVGSNVY 185 Query: 114 SFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEEL 173 G+ PH D +V I+Q G++ WR+ E + + D L Sbjct: 186 ITPAGAQGLPPHYDDVEVLILQLEGQKHWRLYEPTVPLAREYSLE-PEGRIGAPTHDFIL 244 Query: 174 EPGDILYIPPGFPHEGYA---LENAMNYSVGFRA-------------------------- 204 + GD+LY P G H+ ++ + ++ Sbjct: 245 QAGDLLYFPRGTIHQADTPAGAGHSTHLTLSTYQNMCVCAVHNVTHTHTHTLQNVLVLHS 304 Query: 205 --------PNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPA-DVLPQEMDKLREMML 255 + +L+ + + + +P + P +L + Sbjct: 305 FIVCVRACRSWGDLLLDLMPGCVFDRMKTDCELRTGLPRGLLTTPSISPAVSHQLSVFLR 364 Query: 256 ELINQPEHFKQWFGEFISQSRHELDIAPP----EPPYQPDEIYDALKQGEVLV------- 304 L + + Q + PP QP AL+ L Sbjct: 365 RLADVVDQQGQSLRSSSMRRDFISHRLPPFLQDPQLLQPVGGAPALQDTVSLRFKDHLLL 424 Query: 305 ---------------RLGGLRVLRIGDDVY----------------ANGEKIDSPHRPAL 333 + L LR D + G + H AL Sbjct: 425 TVEPSPDHTDEATELLVYVLHSLRNRRDTHMMMGASDEDEDDEESQVGGLRFPLSHLEAL 484 Query: 334 DALASNIALTAENFGDALEDPSFLAMLAALVNSGYW 369 L + + E+ L+ L +L AL + G Sbjct: 485 QQLLVSDRVPVEDLQ--LQQEDKLNLLLALWSEGLL 518 >UniRef50_A0QI05 Cupin superfamily protein n=4 Tax=Mycobacterium avium complex (MAC) RepID=A0QI05_MYCA1 Length = 361 Score = 158 bits (400), Expect = 3e-37, Method: Composition-based stats. Identities = 54/279 (19%), Positives = 94/279 (33%), Gaps = 17/279 (6%) Query: 6 TLNWPDFLERHWQKRPVVLKRGFNNFID--PISPDELAGLAMESEVD---SRLVSHQDG- 59 L FL W R + R + D P + GL + D RLV + Sbjct: 13 PLGVDAFLNEIWATRHHHIDRCRPGYFDGLLPGPSAVDGLLEQVRPDPAAVRLVKDGEDR 72 Query: 60 ---KWQVSHGPFESYDHLG--ETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMIS 114 ++ G + ++L++ + + A+L ++ Sbjct: 73 DPAGYRRGDGTLNAGGARDGLADGYTLVLNGLERYLRTVASLSHAIEVELNFPTRVNAYV 132 Query: 115 FSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGE-KLQMKQHCPHPDLLQVDPFEAIIDEEL 173 G PH D +DV ++Q G + WRV + Q + D + D L Sbjct: 133 TPPHSTGFVPHYDPHDVLVLQIEGCKTWRVSDEPPVPPQQIQSRKGVGADGPASRTDVCL 192 Query: 174 EPGDILYIPPGFPHEGYA-LENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPD 232 PGD+LY+P G H E +++ +VG AP L++ + R+ D Sbjct: 193 RPGDVLYLPRGQVHSARTHSEPSVHLTVGLHAPTVLTLVTSALHALSLRDP---RVHDRL 249 Query: 233 VPPRAHPADVLPQEMDKLREMMLELINQPEHFKQWFGEF 271 P A V + +R+ + ++ G Sbjct: 250 PPRHLDDAQVRAGLGEAVRDAV-RALDDDAVIADGLGAM 287 >UniRef50_UPI0001B4BFC9 putative cupin superfamily protein n=1 Tax=Streptomyces sp. C RepID=UPI0001B4BFC9 Length = 394 Score = 158 bits (400), Expect = 3e-37, Method: Composition-based stats. Identities = 63/374 (16%), Positives = 110/374 (29%), Gaps = 47/374 (12%) Query: 24 LKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQVSHGP-------------FES 70 L+R +F D E+ L + + QV Sbjct: 36 LRRAAGDFSDLFGLAEVDVLLTDRALRRPAFRVIRDGAQVPDASCLHGGLLYPDVADPGK 95 Query: 71 YDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPHLDQYD 130 L +L+ Q + P A R ++ G G H D D Sbjct: 96 ISGLLAEGATLVFQGLQELTGPLAEFGRRLGHDLGRPVNVNAYVTPAGSQGFGDHYDTQD 155 Query: 131 VFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIPPGFPHEGY 190 FI+Q G +RW + + + + + LEPGD L++P G+ H Sbjct: 156 SFIVQIHGSKRWTLKDPALAQPLSHETGRPLPEDDGSGRTLTLEPGDCLWLPRGWVHSAR 215 Query: 191 ALEN-AMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADVLPQEMDK 249 + + +++ ++ + A +D ++M Sbjct: 216 STDTASVHLTISLYEWTGHWAWTRIAARAPGLPGRFPLSTD-----FFRDRAAAEKDMAA 270 Query: 250 LREMMLELINQPE-----HFKQWFG--EFISQSRHEL-DIAPPEPPYQPDE------IYD 295 LR + E + + + G EF S RH ++ PE + + + Sbjct: 271 LRAELTEWLATADDSALVDLVRAAGAPEFPSPVRHPAREVLSPEADEDAEYTVNAHAVLN 330 Query: 296 ALKQGEVLVRLGGLRVLRIGDDVYANGEKIDSPHRPALDALASNIALTAENFGDALEDPS 355 A +G+ LV G R G + + P L L + + P Sbjct: 331 AETRGDRLVLTLGAR-----------GLTLPAAMTPLLADLLTLDRFRPCDLPP---TPG 376 Query: 356 FLAMLAALVNSGYW 369 A+L L G Sbjct: 377 TTALLTRLSAEGVI 390 >UniRef50_A1SPZ0 Cupin 4 family protein n=1 Tax=Nocardioides sp. JS614 RepID=A1SPZ0_NOCSJ Length = 403 Score = 158 bits (399), Expect = 4e-37, Method: Composition-based stats. Identities = 66/401 (16%), Positives = 120/401 (29%), Gaps = 61/401 (15%) Query: 4 QLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPD------------ELAGLAMESEVDS 51 L+ + FL + W R + + G + P S D + L S V + Sbjct: 26 LLSGDAQTFLAKVWASRVHLHRSGPADPDSPGSADGPDSLVGLFALADADHLLTSSAVRT 85 Query: 52 RLVSHQDGKWQVSHGPFESYDHL-----------------GETNWSLLVQAVNHWHEPTA 94 + + + L + +++ Q ++ + P Sbjct: 86 PSIRLAKDGAVLPESAYTRRASLAGKPLTGLVDARKALALFDDGATVVFQGLHRYWPPLT 145 Query: 95 ALMRPFRELPDWRIDDLMISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHC 154 L+ G H D +DVF+ Q G +RW V + Sbjct: 146 RLIARLELELGHPCQANAYLTPPGAQGFAVHSDSHDVFVFQTAGSKRWEVHGPDGPE--- 202 Query: 155 PHPDLLQVDPFEAIIDEELEPGDILYIPPGFPHEGYALEN-AMNYSVGFRAPNTRELISG 213 + LEPG +Y+P G PH A + +++ ++G R L+ Sbjct: 203 ---------------EVLLEPGVSMYLPTGTPHAARAQDTVSLHVTLGINQLTWRGLVER 247 Query: 214 ----FADYVLQRELGGNYYSDPDVPPRAHPADVLPQEMDKLREMMLELINQPEHFKQWFG 269 V L Y DP A P + + + +++ Sbjct: 248 TVAGALGEVADEHLPAGYLDDPA--ALAGPLADRLERLADAVRRLDATAAVEAEVRRFLT 305 Query: 270 EFISQSRHELDIAPPEPPYQPDEIYDALKQGEVLVRLGGLRV-LRIGDDVYANGEKIDSP 328 + L + ++ G RV + +GD + + Sbjct: 306 SRPPRLDGGLHDVLAHGTITDTTLLRRRPGHPCVLLDRGERVEVLLGDR----SLTVPAW 361 Query: 329 HRPALDALASNIALTAENFGDALEDPSFLAMLAALVNSGYW 369 RPAL+A+ + LT + L++ S L + LV G Sbjct: 362 IRPALEAVRARGELTPADLP--LDEQSRLVLCRRLVREGLL 400 >UniRef50_C1YJ55 Cupin superfamily protein n=1 Tax=Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 RepID=C1YJ55_NOCDA Length = 400 Score = 157 bits (397), Expect = 6e-37, Method: Composition-based stats. Identities = 62/351 (17%), Positives = 114/351 (32%), Gaps = 38/351 (10%) Query: 27 GFNNFIDPISPDELAGLAMESEVDS-RLVSHQDGKWQVSHGPFES--------------- 70 G + ++ D+L L + RL H+DG + E+ Sbjct: 35 GADAVRSLLTFDDLNDLLATCSPEPPRLRLHRDGSPVPADRYTEAATSSRSARRVVRPEA 94 Query: 71 YDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPHLDQYD 130 SL++ +V+ H P A L R + G H D +D Sbjct: 95 LYRELREGASLVLDSVDRMHPPVGAAADDLMRLVRERAQANLYLIWGGSRGFDTHWDDHD 154 Query: 131 VFIIQGTGRRRWRVGEKLQMK-QHCPHPDLLQVDPFEA------IIDEELEPGDILYIPP 183 I+Q G + W+V D P +A + + L PG ++++P Sbjct: 155 TVIVQVEGTKHWQVHGPGSRPYPMKNDVDHAHTPPRDADGELHLVWEGVLRPGQVIHVPR 214 Query: 184 GFPHEGYALEN-AMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADV 242 G+ H +M+ + GF E +AD +++R + D+P A P Sbjct: 215 GWWHTVTGTGGVSMHLTFGFTRATGVE----WADALVRRLFEEEVF-RRDLPRFADPDVR 269 Query: 243 LPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQGEV 302 + M + + + E ++ + P P + A + Sbjct: 270 RKHQRALAARMAE--LAEELDLDGFLAERDARFPRRTSFSLPWPVEEGTPPAHARVEFVP 327 Query: 303 LVRLGGLRVLRIGDDVYAN--GE--KIDSPHRPALDALASNIALTAENFGD 349 ++ + G+ V G + + P L+ALA + LT + Sbjct: 328 ILPPP---LSHDGERVAVTVGGRRYRFPAVVGPVLEALAEHRELTVAELAE 375 >UniRef50_Q016L9 [S] KOG3706 Uncharacterized conserved protein n=1 Tax=Ostreococcus tauri RepID=Q016L9_OSTTA Length = 455 Score = 155 bits (392), Expect = 3e-36, Method: Composition-based stats. Identities = 53/314 (16%), Positives = 99/314 (31%), Gaps = 34/314 (10%) Query: 40 LAGLAMESEVDSRLVSHQDGKWQVSHGPFESYD-----------------HLGETNWSLL 82 LA +D+ + S+ DG + + +++D E S+ Sbjct: 53 LASRDARYGIDADVTSYVDGVRRTHNSNDDTHDVCDEASNEIVDAKAVMRAYRERGRSIR 112 Query: 83 VQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPHLDQYDVFIIQGTGRRRW 142 + H+ T ++ + + G PH D D +++Q G +RW Sbjct: 113 LLHPQTRHDATWKMLATLESHFECACGCNVYVTPANAQGFAPHYDDIDAYVLQIEGEKRW 172 Query: 143 RVGEK--LQMKQHCPHPDLLQVDP--FEAIIDEELEPGDILYIPPGFPHEGYALENAMNY 198 RV + Q + E + D LE GD LYIP GF H+ A + Sbjct: 173 RVYAPFQSDELPRTSSKNYTQEEIAGLEVLFDGVLEAGDFLYIPRGFVHQAECSSRAHSV 232 Query: 199 SVGF---RAPNTRELISGFADYVLQRELGGNYYSDPD--VPPRAHPADVLPQEMDKLREM 253 +A + + + + + + + P + + Q+M + Sbjct: 233 HATISTNQANTHADAFEIATQTIARSLIDESKWLRRNIYRPHQGPRRCLDMQQMGQAVLA 292 Query: 254 MLELINQPEHFKQW--FGEFISQSRHELDIAPPEP---PYQPDEIYDALKQGE---VLVR 305 + P F + R + E D ++ + GE V ++ Sbjct: 293 LGSRDLDPYLFHAYESLRARFQAQRLPVPETHLERSRTSCTGDRAAESFRGGELTKVALQ 352 Query: 306 LGGLRVLRIGDDVY 319 G VL G + Sbjct: 353 RRGCLVLVDGAAYH 366 >UniRef50_B0KHI4 Cupin 4 family protein n=1 Tax=Pseudomonas putida GB-1 RepID=B0KHI4_PSEPG Length = 303 Score = 155 bits (391), Expect = 3e-36, Method: Composition-based stats. Identities = 48/316 (15%), Positives = 93/316 (29%), Gaps = 39/316 (12%) Query: 1 MEYQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGK 60 M++ L F+ +QK+P V + + + E+ L + +D + + Sbjct: 4 MDFALP-EMDFFIRSAFQKKPFVFESVTGH--QLVGWGEINNLLEKDILDYPRIRLANDG 60 Query: 61 WQVSHGP-----------------FESYDHLG--ETNWSLLVQAVNHWHEPTAALMRPFR 101 G Y+ L +T +L++ + E Sbjct: 61 IPSERGFKGFVTYTLTVTGETSPHINRYNLLKRLQTGSTLIIDRCQAFFERAQQAASYLS 120 Query: 102 ELPDWRIDDLMISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQ 161 R + G H D +DV +Q G +RW V + Q Sbjct: 121 THLRCRSGANLYCAWSSTPSFGAHFDNHDVIAVQIEGVKRWEVYAPTRPYPLLNDKSFDQ 180 Query: 162 VDP-FEAIIDEELEPGDILYIPPGFPHEGYALEN-AMNYSVGFRAPNTRELISGFADYVL 219 P E ++ L PG +Y+P G+ H + +M+ S P +LI + + Sbjct: 181 TPPAGEPMLCHTLTPGQAIYVPAGYWHNVFTETERSMHISFPVVRPRKIDLIRMVLERLE 240 Query: 220 QRELGGNYYSDPDVPPRAHPADVLPQEM--DKLREMMLELINQPEHFKQWFGEFISQSRH 277 P + +L ++ + + +W + Sbjct: 241 ASAELRE------------PIAHGSDAIGNARLSGVLYACLANID-IDEWEAAVVEDCMQ 287 Query: 278 ELDIAPPEPPYQPDEI 293 DI P + + Sbjct: 288 GRDIKFNLPDIRTRDA 303 >UniRef50_A4RZ92 Predicted protein n=1 Tax=Ostreococcus lucimarinus CCE9901 RepID=A4RZ92_OSTLU Length = 515 Score = 154 bits (390), Expect = 4e-36, Method: Composition-based stats. Identities = 50/343 (14%), Positives = 100/343 (29%), Gaps = 40/343 (11%) Query: 6 TLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAM----ESEVDSRLVSHQDGKW 61 L+ F ++ V + G ++ ++ + + VD + S+ DG Sbjct: 70 PLDAATFATCVDRRAVCVKRGGRAHYGSWMTMEAVTRATRDGEARYGVDLDVTSYVDGVR 129 Query: 62 QVSHGPFESYD---------------------HLGETNWSLLVQAVNHWHEPTAALMRPF 100 + + ++ GE S+ + + T ++ Sbjct: 130 RTHNRNSDAGREADDEDDDVGELVDADAVLTRRFGEERRSVRLLHPQTRCDATWKILATL 189 Query: 101 RELPDWRIDDLMISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLL 160 + + G PH D D F++Q G +RWRV E + + P Sbjct: 190 ERYFECACGCNVYVTPASSQGFAPHYDDIDAFVLQIEGAKRWRVYEPFE-DETHPRTSSR 248 Query: 161 QVDPFE-----AIIDEELEPGDILYIPPGFPHEGYALENAMNYSVGFRAPNTRELISGFA 215 E + D+ LE GD LY+P G+ H+ + + N + Sbjct: 249 NFTQEEIATQRVVFDDVLEAGDFLYLPRGWIHQAECSSSTHSVHATLST-NQSNAPADAL 307 Query: 216 DYVLQRELGGNYYSDPDVPPRA----HPADVLPQEMDKLREMMLELINQPE-HFKQWFGE 270 + L L ++ + ++ L E + ++ + F E Sbjct: 308 EIALNNALASTIDGRAELRRSFVSTLNDERRRDAALEGLGEELRAFASELQGDFGAALVE 367 Query: 271 F---ISQSRHELDIAPPEPPYQPDEIYDALKQGEVLVRLGGLR 310 ++R + PP + G R Sbjct: 368 HAYASLEARFMMQRCPPPDSHLLRSKARCTGDVAAAELRSGAR 410 >UniRef50_C6XMP3 Cupin 4 family protein n=1 Tax=Hirschia baltica ATCC 49814 RepID=C6XMP3_HIRBI Length = 435 Score = 153 bits (388), Expect = 8e-36, Method: Composition-based stats. Identities = 67/407 (16%), Positives = 137/407 (33%), Gaps = 52/407 (12%) Query: 2 EYQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELA-GLAMESEV-DSRL---VSH 56 + + +F ++ K+ + N F S ++L+ LA E+ D R S Sbjct: 31 QLLSPMTGQEFANSYFAKKSFNVGGTSNKFEHIFSWEKLSHALARGEEIQDPRFNLMASF 90 Query: 57 QDGKWQVSHGP-----FESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWR-IDD 110 G+ S P + L ++ + ++ A + R ++ Sbjct: 91 AGGEKDGSRKPMFQVYIKQVGELLNAGATICITNIHMADPALARWAQAIRSQLNFTGTVG 150 Query: 111 LMISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAI-- 168 + S G G+ H D+ +Q G++RW + + + + + E Sbjct: 151 VNCYISPDGAGLPMHYDKRIATTLQIAGKKRW-IYSTTPAQAWPDNNAVFKDGRVEPANI 209 Query: 169 -----------IDEELEPGDILYIPPGFPHEGYALENAMNYSVGFRAPNTRELISG-FAD 216 + EL PGD+L +P G H + ++ ++ F N + ++ F + Sbjct: 210 DTGTPPDGLEFEEVELNPGDLLCLPAGAWHAAKGIGFSLALNLYFAPRNFSDQLAPLFLE 269 Query: 217 YVLQRELGGNYYSDPDVPPRAHPADVLP-------QEMDKLREMMLELINQPEHFKQWFG 269 ++ E N+ P V + +D+ + I+Q + + Sbjct: 270 HLSHDE---NWRGGPPVTLDNITGETPENIKSYLHDRLDEFHKKAQSFIDQTDTINTAWL 326 Query: 270 EFISQSRHELDIAPPEPPYQPDEIYDALKQGEVLVRLGGLRVLRIGDDVYA---NG-EKI 325 E ++Q+ + P+ P +P + + V LR ++ D + NG Sbjct: 327 ESLTQNPYTGWQPDPDMPLRPLSPENRFR-----VVGKSLRFIQTADQLAVPCDNGILNF 381 Query: 326 DSPHRPALDALASN-------IALTAENFGDALEDPSFLAMLAALVN 365 P L +AS+ ++ +A P A L L Sbjct: 382 PKNATPILKKMASHSGSFSVPDVISWNTAPNAPTIPEIGAHLQTLYK 428 >UniRef50_A5GJ70 Putative uncharacterized protein SynWH7803_0559 n=1 Tax=Synechococcus sp. WH 7803 RepID=A5GJ70_SYNPW Length = 370 Score = 152 bits (384), Expect = 2e-35, Method: Composition-based stats. Identities = 63/366 (17%), Positives = 119/366 (32%), Gaps = 28/366 (7%) Query: 23 VLKRGFNNFIDPISPDELAGLAMESEVDS--RLVSHQDGKWQVSHGPFESYDHLGE---- 76 VLK S DE + + RLV++Q + + + Sbjct: 7 VLKGAMG-----FSLDEFEAILGMPGLRPYLRLVANQVEDYDPKIVANDGHLCKPYVLSR 61 Query: 77 --TNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPHLDQYDVFII 134 S++V V+ + L E + + PH D +D+F + Sbjct: 62 FHEGASIVVNEVHRFSSQLMDLASSLSEELGVQCVVNAYLTPPQSVALSPHFDSHDIFAL 121 Query: 135 QGTGRRRWRVGEKLQMKQHCP--HPDLLQVDPFEAII-DEELEPGDILYIPPGFPHEGYA 191 Q G+++W V +L P L + ++ GD++Y+P G H Sbjct: 122 QVVGQKQWFVDSELSSLTTKSTFQPILSADQASSVDFREVVMDEGDVMYLPRGCVHHART 181 Query: 192 LE-NAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDV-PPRAHPADVLPQEMDK 249 + +M+ +VG E I+ + + + R HP+ + +D+ Sbjct: 182 ISCQSMHLTVGLYPLEWSEFIASAVEIAASAPEARGLRTSVPLGLKRQHPSFYRQELLDR 241 Query: 250 LREMMLELINQPEHFKQ----WFGEFISQSRHELDIAPPEPPYQPDEIYDALKQGEVLVR 305 L + + + + + G+ S + P D+ ++ + Sbjct: 242 LSGLFTDDVIEKALRSREKEFSAGQPSSFVGGLDADSWPAGSIASDDAFERCAEAVYFYP 301 Query: 306 L-GGLRVLRIGDDVYANGEKIDSPHRPALDALASNIALTAENFGDALE-DPSFLAMLAAL 363 GLRV+ G K +P R L L+ T E + +A + AL Sbjct: 302 AVSGLRVVSSGYGFTL---KHPNPDR-ILKVLSHKKFFTLGELTGGDECSLNDIACIQAL 357 Query: 364 VNSGYW 369 + G Sbjct: 358 LRRGIL 363 >UniRef50_C9Z2L7 Putative uncharacterized protein n=2 Tax=Streptomyces scabiei 87.22 RepID=C9Z2L7_STRSW Length = 407 Score = 150 bits (379), Expect = 8e-35, Method: Composition-based stats. Identities = 57/381 (14%), Positives = 114/381 (29%), Gaps = 51/381 (13%) Query: 23 VLKRGFNN---FIDPISPDELAGLAMESEVD-------------SRLVSHQD----GKWQ 62 ++ ++ ++P+ L +AM ++ + + D Sbjct: 34 FVRGSMDDPTLVSRIMTPNRLLDIAMRRSLNRPQFRCFQKGEEVHPAIYYTDSVSPRGQS 93 Query: 63 VSHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGV 122 + S L + ++++ N + R + R+ + + G Sbjct: 94 IPMVNMRSLGRLLQDGATVIMDQANVFDPTMEVACRALQWWSRERVQVNVYLTTNDAAGF 153 Query: 123 GPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIP 182 H D +DV I+Q G ++W V + D E I + GD+++IP Sbjct: 154 PLHWDDHDVVIVQLAGEKKWEVRTASRNVPMYRDSDPNNTASDEIIWSGVMRAGDVMHIP 213 Query: 183 PGFPHEGY----ALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAH 238 G H+ N+++ + G ++ D+ + E+ D D A Sbjct: 214 RGHWHQATRTGSGSGNSLHVTFGITKRTGASWLAWLGDWCREHEIF---RQDLDRWHGAG 270 Query: 239 PADVLPQEMDKLREM----MLELINQPEHFKQ---WFGEFISQSRHELDIAPPEPPYQPD 291 + + E L Q + + P + Sbjct: 271 SEALTTAAARLVAERSPVDFLAAYEQETTLSRHVPFLDVLGPLDAVVCTTHFPPRIQEGG 330 Query: 292 EIYDALKQGEVLVRL----GGLRVLRIGDDVYANGEKIDSPHRPALDALASNIALTAENF 347 E D + G+ L LR+L G V AL+ A+ + Sbjct: 331 EAVDVVASGKKLTLAVKALPALRLLLSGRPV-------------ALERAAAVVGAEVMEV 377 Query: 348 GDALEDPSFLAMLAALVNSGY 368 + L +L ++SGY Sbjct: 378 AEILVKEELCTVLTPELSSGY 398 >UniRef50_Q1DFZ7 Cupin family protein n=1 Tax=Myxococcus xanthus DK 1622 RepID=Q1DFZ7_MYXXD Length = 295 Score = 148 bits (375), Expect = 2e-34, Method: Composition-based stats. Identities = 52/278 (18%), Positives = 94/278 (33%), Gaps = 12/278 (4%) Query: 2 EYQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKW 61 E FL+ H+ +RP + + + L E+ D L Sbjct: 5 ELLNGFPRERFLQEHYLRRPFTGASAAERLQRLGTWETIDFLVEETACDVLLARQGVPYP 64 Query: 62 QVSHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGG- 120 ++ L ++L ++ + H A L R F RI + + P G Sbjct: 65 GDRPTTAKAARELFAQGYTLALRQPDLHHPDLAQLARAFSAELHGRI--NLHIYCTPAGH 122 Query: 121 -GVGPHLDQYDVFIIQGTGRRRWRVG----EKLQMKQHCPHPDLLQVDPFEAIIDEELEP 175 G G H D +VFI+Q GR+ + + + + + P L + L Sbjct: 123 HGFGWHCDPEEVFILQTAGRKDYLLRENTLHPVPLPESVPSGSLA-AQEKTPVETHSLSA 181 Query: 176 GDILYIPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPP 235 GD +YIP G H A E A++ S+G P +L+ G + + Sbjct: 182 GDFIYIPGGHWHMAQATEEALSISIGLMPPTLLDLLDGVRAALASSPVWRRRMPSLGRAS 241 Query: 236 RAHPA---DVLPQEMDKLREMMLELINQPEHFKQWFGE 270 +L + +L + + P + ++ + Sbjct: 242 SLDDPSKLALLRTLLSELGGEVQRQLADPGYPLRFLAQ 279 >UniRef50_UPI0000D57503 PREDICTED: similar to JmjC domain-containing protein 5 (Jumonji domain-containing protein 5) n=1 Tax=Tribolium castaneum RepID=UPI0000D57503 Length = 394 Score = 146 bits (368), Expect = 2e-33, Method: Composition-based stats. Identities = 42/225 (18%), Positives = 84/225 (37%), Gaps = 31/225 (13%) Query: 8 NWPDFLERHW-QKRPVVLKRGFNNFIDPISPDELAGL---AMESEVDSRLVS-HQDGKWQ 62 + F +++ ++PV L+ ++ ++ L A + V + S + D W Sbjct: 170 SLETFNNKYFVSQKPVKLQDCVTHWPALSKWPDITYLLKTAGDRTVPVEIGSHYADENWG 229 Query: 63 VSHGPFESY-DHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRI---DDL---MISF 115 + + + + L A ++ + L +P++ DD + ++ Sbjct: 230 QKLMTLKEFITNYFYKSEDLGYLAQHNLFDQIPELRNDI-YIPEYCCLGQDDNEPEINAW 288 Query: 116 SVPGGGVGP-HLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---------------PDL 159 P + P H D + F++Q G ++ + PH PDL Sbjct: 289 FGPAKTISPLHHDPKNNFLVQVFGTKQLILYSPDDTFCLYPHESTLLSNTAQVDPFNPDL 348 Query: 160 LQVDPF--EAIIDEELEPGDILYIPPGFPHEGYALENAMNYSVGF 202 + F + LE G++LYIPP + H ALE + + S + Sbjct: 349 DKYPNFRNAKAVKCILEAGEMLYIPPKWWHHVTALEKSFSVSFWW 393 >UniRef50_Q091R4 Chromosome 14 open reading frame 169, putative n=1 Tax=Stigmatella aurantiaca DW4/3-1 RepID=Q091R4_STIAU Length = 355 Score = 145 bits (367), Expect = 2e-33, Method: Composition-based stats. Identities = 49/297 (16%), Positives = 92/297 (30%), Gaps = 14/297 (4%) Query: 68 FESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPHLD 127 ++++ + EP R + + GV PH D Sbjct: 37 INLVYENYLKGSTVILSGLEETWEPLVVFCRKLEGQLSHPVAVAVYLTPPNHHGVQPHFD 96 Query: 128 QYDVFIIQGTGRRRWRVGEKLQMKQHC--PHPDLLQVDPFEAIIDEELEPGDILYIPPGF 185 + FI+Q G + W+V Q + + + E +++ EL PGD+LY+P GF Sbjct: 97 TQENFILQVDGVKHWKVYGAGQELPRVEGSYTPVARERLPELLLETELHPGDMLYVPRGF 156 Query: 186 PHEGYA-LENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADVLP 244 HE A +++ +V R+ + + R P + +H L Sbjct: 157 VHEAEARDSASLHITVDVHVRTWRDFLEDALAAMADRNPRFRKSLPPGLLNGSHAKAQLE 216 Query: 245 QEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQGEVLV 304 + +L EM+ G+ + + P + + + Sbjct: 217 EGFRELMEMVHR----EVRLSDALGKH--AEKLIVARPPLPDGHFALLHAEIGLDTPLRK 270 Query: 305 RLGGLRVLRIGDDVY---ANGEKIDSPH--RPALDALASNIALTAENFGDALEDPSF 356 R L + V +G +I P AL + + L + Sbjct: 271 RTAMLTRRFQEEAVAGIQFSGNQILGPVKIAEALRHIDETEIVVPSQLPGGLSNNEK 327 >UniRef50_D1VL61 Cupin 4 family protein n=1 Tax=Frankia sp. EuI1c RepID=D1VL61_9ACTO Length = 313 Score = 145 bits (367), Expect = 2e-33, Method: Composition-based stats. Identities = 48/299 (16%), Positives = 99/299 (33%), Gaps = 18/299 (6%) Query: 73 HLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPHLDQYDVF 132 L + +L++ + + R R + + G H D +DV Sbjct: 3 ALLDAGVTLVLDGLETFDPIVEVATRALRWWSGELVQTNAYLTTRSADGFPLHWDDHDVL 62 Query: 133 IIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIPPGFPHEGYAL 192 I+Q G + W V + +V EA+ L G++L+IP G+ H+ + Sbjct: 63 IVQLAGEKNWDVRGSTRSAPMFRDAVPNEVASSEAVWQGVLRAGEVLHIPRGYWHQATRV 122 Query: 193 EN----AMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADVLPQEMD 248 E+ +++ + GF + ++ AD ++EL +D P A+ Sbjct: 123 EHDDPVSLHLTFGFTRRTGVDWLTWIADQAREQELF---RTDLTRSPAEREAE-----RA 174 Query: 249 KLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQGEVLVRLGG 308 +L++ +EL+ + ++P+ + + + R G Sbjct: 175 RLQDAAIELVRSLPP-AAFLTARERTRPPARHAPTLPSAHEPEVVVCVTEFAPHVERSEG 233 Query: 309 LRVLRIGDDVYANGEKIDSPHRPALDALASNIALTAENFGDALEDPSFLAMLAALVNSG 367 V+ G + A++ L S + +A + V G Sbjct: 234 QLVVYAGGRKI----TVRDRAAAAIELLLSGCPVDLAA-AEARVGFDVRPLARVFVREG 287 >UniRef50_Q8RWR1 AT3g20810/MOE17_10 n=9 Tax=Viridiplantae RepID=Q8RWR1_ARATH Length = 429 Score = 145 bits (365), Expect = 3e-33, Method: Composition-based stats. Identities = 49/240 (20%), Positives = 82/240 (34%), Gaps = 33/240 (13%) Query: 1 MEYQLTLNWPDFLERHWQK-RPVVLKRGFNNFIDPISPDELAGL---AMESEVDSRLVS- 55 +E + L+ FL ++ PVV+ ++ + L L A V + Sbjct: 188 VEKRSGLSLEGFLRDYYLPGTPVVITNSMAHWPARTKWNHLDYLNAVAGNRTVPVEVGKN 247 Query: 56 HQDGKWQVSHGPFESYDHLGETNWSL----LVQAVNHWHEPTAALMRPFRELPDWRIDD- 110 + W+ F + TN S A + + L + Sbjct: 248 YLCSDWKQELVTFSKFLERMRTNKSSPMEPTYLAQHPLFDQINELRDDICIPDYCFVGGG 307 Query: 111 ---LMISFSVPGGGVGP-HLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHP--------- 157 + ++ P G V P H D + + Q G++ R+ + P+ Sbjct: 308 ELQSLNAWFGPAGTVTPLHHDPHHNILAQVVGKKYIRLYPSFLQDELYPYSETMLCNSSQ 367 Query: 158 -DLLQVDPFE-------AIIDEELEPGDILYIPPGFPHEGYALENAMNYSVGFRAPNTRE 209 DL +D E +D LE G++LYIPP + H +L M+ SV F N E Sbjct: 368 VDLDNIDETEFPKAMELEFMDCILEEGEMLYIPPKWWHYVRSL--TMSLSVSFWWSNEAE 425 >UniRef50_A9SQV0 Predicted protein n=1 Tax=Physcomitrella patens subsp. patens RepID=A9SQV0_PHYPA Length = 420 Score = 142 bits (359), Expect = 2e-32, Method: Composition-based stats. Identities = 43/230 (18%), Positives = 78/230 (33%), Gaps = 34/230 (14%) Query: 9 WPDFLERHWQKR-PVVLKRGFNNFIDPISPDELAGL---AMESEVDSRLVS----HQDGK 60 DFL ++ P+VL +++ + +++ L A V + Sbjct: 191 LEDFLRDYFLPGIPLVLTDSIDHWPAMRNWNDITYLQKVAGHRTVPVEARQVGEHYLAAD 250 Query: 61 WQVSHGPFESYDHLGETNWSL----LVQAVNHWHEPTAALMRPFRELPDWRIDD----LM 112 W+ + T+ + L A + E L I + Sbjct: 251 WKQELMTISEFLERSLTHSAQSTNRLYLAQHPLFEQVPELQADISIPDYCSIGGGDLQSI 310 Query: 113 ISFSVPGGGVGP-HLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHP-------------- 157 ++ P G + P H D + + Q GR+ R+ + P+P Sbjct: 311 NAWLGPAGTITPLHHDPHHNLLAQVVGRKYVRLYSPESSQNIYPYPEPMLCNSSQVDVTN 370 Query: 158 -DLLQVDPFEAI--IDEELEPGDILYIPPGFPHEGYALENAMNYSVGFRA 204 DL++ FE + D LE G +LYIPP + H +L + + S + Sbjct: 371 VDLVKFPNFEHLKFTDCILEEGQMLYIPPKWWHYVESLTPSFSVSFWWAT 420 >UniRef50_B7PVI8 Acetyltransferase, putative (Fragment) n=1 Tax=Ixodes scapularis RepID=B7PVI8_IXOSC Length = 406 Score = 142 bits (358), Expect = 2e-32, Method: Composition-based stats. Identities = 40/229 (17%), Positives = 75/229 (32%), Gaps = 34/229 (14%) Query: 8 NWPDFLERHWQKR-PVVLKRGFNNFID----PISPDELAGLAMESEVDSRL-VSHQDGKW 61 + F + + K PV++ +G + + P S L V L + D W Sbjct: 177 SLEHFAKEYLNKEEPVIITKGMDYWPALSTRPWSIRYLLEKVGGRTVPVELGSKYTDEAW 236 Query: 62 QVSHGPFESYDHLG-----ETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDL----- 111 ++ + + A + + L + + Sbjct: 237 SQKLMTVSAFVDTYILKEQSRDTQIGYLAQHQIFDQIPELRDDICIPTYCCLGEKDEEPD 296 Query: 112 MISFSVPGGGVGP-HLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH-------------- 156 M + P G V P H D + + Q G + R+ +K + PH Sbjct: 297 MNLWFGPEGTVSPLHHDPKNNLLAQVFGHKYVRLYKKQETPFLYPHEDRLLENTSQVNVE 356 Query: 157 -PDLLQVDPFEAII--DEELEPGDILYIPPGFPHEGYALENAMNYSVGF 202 PD + F + L+PG++L+IPP H +L +++ S + Sbjct: 357 NPDFEKFPSFANARYSECILKPGEMLFIPPKCWHFVRSLSPSLSISFWW 405 >UniRef50_P46327 Uncharacterized protein yxbC n=1 Tax=Bacillus subtilis RepID=YXBC_BACSU Length = 330 Score = 142 bits (358), Expect = 2e-32, Method: Composition-based stats. Identities = 42/287 (14%), Positives = 95/287 (33%), Gaps = 31/287 (10%) Query: 6 TLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEV-DSRLVSHQDGKWQVS 64 + +FLE +W +P+V + F +++ L + ++ ++ D + S Sbjct: 15 PVTMSEFLEEYWPVKPLVARGEVERFTSIPGFEKVRTLENVLAIYNNPVMVVGDAVIEES 74 Query: 65 HGPFESYDHLG-------ETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDL---MIS 114 G + + E +L + + + + ++ Sbjct: 75 EGITDRFLVSPAEALEWYEKGAALEFDFTDLFIPQVRRWIEKLKAELRLPAGTSSKAIVY 134 Query: 115 FSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFE-------- 166 + GGG H D Y I Q G + W++ + + H DL + + Sbjct: 135 AAKNGGGFKAHFDAYTNLIFQIQGEKTWKLAKNENVSNPMQHYDLSEAPYYPDDLQSYWK 194 Query: 167 --------AIIDEE-LEPGDILYIPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADY 217 + L PG +LY+P G H + + + ++ F P +L+ Sbjct: 195 GDPPKEDLPDAEIVNLTPGTMLYLPRGLWHSTKSDQATLALNITFGQPAWLDLM---LAA 251 Query: 218 VLQRELGGNYYSDPDVPPRAHPADVLPQEMDKLREMMLELINQPEHF 264 + ++ + N + + V ++ + L ++ L E Sbjct: 252 LRKKLISDNRFRELAVNHQSLHESSKSELNGYLESLIQTLSENAETL 298 >UniRef50_Q6MH74 Putative uncharacterized protein yxbC n=1 Tax=Bdellovibrio bacteriovorus RepID=Q6MH74_BDEBA Length = 308 Score = 141 bits (356), Expect = 4e-32, Method: Composition-based stats. Identities = 46/297 (15%), Positives = 96/297 (32%), Gaps = 34/297 (11%) Query: 2 EYQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGL-----AMESEVDSRLVSH 56 E + P+F HW P+ + D + +++ L A + +V + L Sbjct: 8 ELLAPVTLPEFFNSHWPVEPLFIPATPGKLQDIFALEQMQDLKNLISARQRKVRACLPDF 67 Query: 57 QDGKWQVSHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRID------- 109 D + P ++ N +L+ ++ A ++ R Sbjct: 68 DDEYSSIHLEPGDALKA-YRNNMTLVFDSMQSQDSTIADMLGNVRADLGLVTGGAENDLC 126 Query: 110 -DLMISFSVPGG-GVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVD---- 163 I+++ P G G H D FIIQ G + WR+ ++ + Sbjct: 127 KARSIAYATPAGCGTRLHFDANANFIIQIKGTKTWRLAPNESVEFPTERFTTGSEEMPAA 186 Query: 164 ------------PFEAIIDEELEPGDILYIPPGFPHEGYALENAMNYSVGFRAPNTRELI 211 E + ++PG +L++P G+ HE E +++ + F P ++ Sbjct: 187 LEKQCHAHLIDALDEDSMKVVMKPGCVLFVPRGYWHETTTEEESLSLNFTFSQPTWADVF 246 Query: 212 SGFADYVLQRELGGNYYSDPDVPPRAHPADVLPQEMDKLREMMLELINQPEHFKQWF 268 + VL R +D + + + + ++ L + Sbjct: 247 TKSLQEVLLRSPEWRELAD-GLEGTDQ--ERKEAAIARFEFLLKSLATELPEISGRL 300 >UniRef50_UPI00015B5EA6 PREDICTED: similar to Jumonji domain containing 5 n=1 Tax=Nasonia vitripennis RepID=UPI00015B5EA6 Length = 402 Score = 141 bits (355), Expect = 5e-32, Method: Composition-based stats. Identities = 37/229 (16%), Positives = 73/229 (31%), Gaps = 34/229 (14%) Query: 8 NWPDFLERHWQKR-PVVLKRGFNNFIDPISP---DELAGLAMESEVDSRL-VSHQDGKWQ 62 + F + ++ + P +L+ ++ + L + V + + + W Sbjct: 173 SLETFYCKIFKPKIPALLEGCLEHWQALHLWKDAEYLRRIVGNRTVPIEIGSRYTEDDWT 232 Query: 63 VSHGPFESY--DHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDL--------- 111 S F + H+ N + A + + L F D Sbjct: 233 QSLVTFSDFLRSHISSKNEKVGYLAQHQLFDQIPELKNDFSVPEYCSFSDTEEDNEELPD 292 Query: 112 MISFSVPGGGVGP-HLDQYDVFIIQGTGRRRWRVGEKLQMKQHCP--------------- 155 + ++ P G V P H D + + Q G +R + + P Sbjct: 293 INAWFGPSGTVSPLHHDPKNNLLCQVFGYKRIILYSPDDNENVYPYETRLLSNTARIDPY 352 Query: 156 HPDLLQVDP--FEAIIDEELEPGDILYIPPGFPHEGYALENAMNYSVGF 202 +PD + L+PGD+L+IPP + H L + + S + Sbjct: 353 NPDFEKYPNLQKAKAFMCYLKPGDMLFIPPKWWHHVVGLTPSFSISFWW 401 >UniRef50_D1H9M4 Whole genome shotgun sequence of line PN40024, scaffold_52.assembly12x (Fragment) n=3 Tax=Vitis vinifera RepID=D1H9M4_VITVI Length = 770 Score = 140 bits (354), Expect = 6e-32, Method: Composition-based stats. Identities = 49/324 (15%), Positives = 100/324 (30%), Gaps = 74/324 (22%) Query: 8 NWPDFLERHWQKRPVVLKRGFNN-------FID--------------------------P 34 ++ +F+ HW+ P++++ F P Sbjct: 301 SFENFILNHWEVSPLLVRSLSKGLNEQDDVFSSFIQYLNLKKTVSSFVLPLLQGLVSCLP 360 Query: 35 ISPDELAGL----AMESEVDSRLVSHQDGKWQVSHGPFES-------------YDHLGET 77 I DEL L + +E+ ++ QD + + G + + Sbjct: 361 IDSDELNILNFLKTVRNELGCLIIYGQDIRVLRTMGHLKEEAPHFLYIDDILKCEDAYNK 420 Query: 78 NWSLLVQAVNHWHEPTAALMRPFRELPDWR-IDDLMISFSVPGGGVGPHLDQYDVFIIQG 136 +++ ++ + E AA+ L + + G+ H D + VF+ Q Sbjct: 421 GYTIALRGMEFRFESIAAIADGLASLFGQPSVGVNLYLTPPDSQGLARHYDDHCVFVCQL 480 Query: 137 TGRRRWRVGE------KLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIPPGFPHEGY 190 G ++W + + + L GDILYIP GFPHE Sbjct: 481 FGTKQWTIVSQPIVSLPRLYEPLDSLHSSKIGNSMAGRTQFLLREGDILYIPRGFPHEAC 540 Query: 191 ALENA------------MNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAH 238 + + + ++ P E + A + + +Y+ D Sbjct: 541 TVAESGGPDETTGFSLHLTLAIEVEPPFEWEGFAHVALHCWNQSSKSIHYTSVDPL---- 596 Query: 239 PADVLPQEMDKLREMMLELINQPE 262 +++L L + + LI + Sbjct: 597 -SEILSVMSVNLLHIAIRLIGDSD 619 >UniRef50_B8C536 Putative uncharacterized protein (Fragment) n=1 Tax=Thalassiosira pseudonana RepID=B8C536_THAPS Length = 204 Score = 140 bits (354), Expect = 6e-32, Method: Composition-based stats. Identities = 64/203 (31%), Positives = 104/203 (51%), Gaps = 24/203 (11%) Query: 52 RLVSH---QDGKWQVSHGPFESYDHLG-----------ETNWSLLVQAVNHWHEPTAALM 97 R++SH D ++++ GP + G E +L+V ++ ++ P A + Sbjct: 1 RVISHSPGDDSSYELTWGPLSDAEFHGWMAKVTSPNNNEQRETLVVNDIDRFYPPLADWI 60 Query: 98 RP-FRELPDWRIDDLMISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVG-----EKLQMK 151 + LP WR+DD IS + GG+GPH+D YDVF+IQ +G R W+VG K +M Sbjct: 61 HDTYHFLPRWRMDDGQISLAEQSGGIGPHVDNYDVFLIQMSGTRAWQVGRKELSTKEEMD 120 Query: 152 QHCPHPDLLQVDPFEAII-DEELEPGDILYIPPGFPHEGYALEN-AMNYSVGFRAPNTRE 209 + D+ ++ + + + + L+PGD+LY+PP H G AL + M SVG RAP+ + Sbjct: 121 RMIEGLDVRVLENWASEMEEWVLQPGDMLYLPPRVAHCGTALSDGCMTLSVGCRAPSVSD 180 Query: 210 LISGFADYVLQRELG--GNYYSD 230 L+S A+ Y+D Sbjct: 181 LMSRLAENFSGSIEDYATRRYTD 203 >UniRef50_A9UW44 Predicted protein n=1 Tax=Monosiga brevicollis RepID=A9UW44_MONBE Length = 710 Score = 140 bits (352), Expect = 1e-31, Method: Composition-based stats. Identities = 56/300 (18%), Positives = 110/300 (36%), Gaps = 45/300 (15%) Query: 4 QLTLNWPDFLERHWQKRPVVLK--RGFNNFIDP---ISPDELAGLAMESEVDSRLVSHQ- 57 L +F ER++++ PV ++ + +F++ +S + + D R V Sbjct: 338 LLRFIRNEFRERYFEQFPVYIQAQGAYLDFLNYSAALSGQTFNYAGDKEKSDPRNVKFIK 397 Query: 58 ---DGKWQVSHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMIS 114 D K + + ++L +VN+W A+L E + Sbjct: 398 RTFDQKQESGRKTEKDLARALREGFTLQFYSVNYWDPNIASLALELSEH-GILLPVNANL 456 Query: 115 FSVPGG---GVGPHLDQYDVFIIQGTGRRRWRVGEKLQ-----MKQHCPHPD----LLQV 162 + PGG + PH D ++Q G +RWR+ + + D + Sbjct: 457 YITPGGTSVSLVPHTDYQCSLMVQLAGVKRWRLWKMPEIMLPVSANMIRGRDTDDLVASE 516 Query: 163 DPFEAIIDEELEPGDILYIPPGFPHEGYALE---NAMNYSVGFRA--------------- 204 + E +D L+PGDILY+P G H E +M+ +VG A Sbjct: 517 ELGEPYMDVLLQPGDILYVPRGVLHATSTPEGDHPSMHLTVGMEAMWDLGIGQVWHHFLG 576 Query: 205 ----PNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADVLPQEMDKLREMMLELINQ 260 + + ++ G + ++ Y +P + P++ + REM ++++ Sbjct: 577 AGAVAHHQHIVEGLYTALRRKTHEDARY-RATMPASFYNRSGDPKQSAEWREMGRQMLHD 635 >UniRef50_B1FKZ3 Cupin 4 family protein n=1 Tax=Burkholderia ambifaria IOP40-10 RepID=B1FKZ3_9BURK Length = 304 Score = 140 bits (352), Expect = 1e-31, Method: Composition-based stats. Identities = 38/288 (13%), Positives = 88/288 (30%), Gaps = 28/288 (9%) Query: 1 MEYQL--TLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSR------ 52 M L + F++ RP +L D + + L +D Sbjct: 1 MNRTLFPNFDLNFFIKHVLHIRPHLLGAVMQ--KDLPDWNAVNALLESGLLDYPRIRISS 58 Query: 53 -----------LVSHQDGKWQVSHGPF--ESYDHLGETNWSLLVQAVNHWHEPTAALMRP 99 + + + + ++++ + +L Sbjct: 59 ADMEYARGYCGFIRYNSNPRGGRFATIMPDFLYRALDDGCTIIIDGCQDYFPSVLSLTAE 118 Query: 100 FRELPDWRIDDLMISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDL 159 + + + + G H D +D+ +Q +G++RW + + + + Sbjct: 119 IEHILKCQSWANLYISTQSATSFGCHFDDHDIISVQLSGKKRWHIYKPTYISPNRGDKSF 178 Query: 160 LQVDP-FEAIIDEELEPGDILYIPPGFPHEGYALEN-AMNYSVGFRAPNTRELISGFADY 217 P + E L G LY+P G+ H + +M+ + G P +++ A+ Sbjct: 179 YLDPPTGSPDLLENLPTGSSLYLPSGYWHNVETVSPHSMHITFGLDFPRKLDIVHAIANQ 238 Query: 218 V-LQRELGGNYYSDPDVPPRAHPADVLPQEMDKLREMMLELINQPEHF 264 + L G DPD P + + + ++ + E + Sbjct: 239 LGLNDIFRGAVNFDPDSPDFYIFRENIINAIREIN--LEECMKNAIEI 284 >UniRef50_A8PJJ7 Acetyltransferase, GNAT family protein n=1 Tax=Brugia malayi RepID=A8PJJ7_BRUMA Length = 578 Score = 139 bits (350), Expect = 2e-31, Method: Composition-based stats. Identities = 46/229 (20%), Positives = 77/229 (33%), Gaps = 34/229 (14%) Query: 8 NWPDFLERHWQKRPVVLKRGFNNFIDPISPD--ELAGLAMESEVDSRL-VSHQDGKWQVS 64 ++ + L+ K+PVV++ N + + L V + S+ D WQ Sbjct: 350 SFEEMLKIIRNKKPVVIRGLVNQWPAFRKWNFSYFNELIGHRTVPIEIGNSYADSDWQQV 409 Query: 65 HGPFESY-----DHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRID----DLM--I 113 F ++ + L Q + + L+ D + Sbjct: 410 LMTFRTFIQKFIECENSDGPGYLAQ--HRLFDQIPELLDDIIIPDYCSFGEDGLDNVDIN 467 Query: 114 SFSVPGGGVGP-HLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---------------P 157 + P G V P H D Q GR+ R+ + + P P Sbjct: 468 IWIGPSGTVSPLHFDPKSNMFCQVVGRKFLRIIPATETENVYPRQDGILTNTSQIDVRCP 527 Query: 158 DLLQVDPF--EAIIDEELEPGDILYIPPGFPHEGYALENAMNYSVGFRA 204 DL + F + D L GD L+IP GF H +AL+ +++ S F Sbjct: 528 DLTEFPRFREAHVFDCTLYAGDCLFIPAGFWHYVFALDPSISVSCWFTT 576 >UniRef50_Q0J0P8 Os09g0489200 protein n=10 Tax=Poaceae RepID=Q0J0P8_ORYSJ Length = 413 Score = 139 bits (350), Expect = 2e-31, Method: Composition-based stats. Identities = 39/218 (17%), Positives = 78/218 (35%), Gaps = 30/218 (13%) Query: 1 MEYQLTLNWPDFLERHWQKR-PVVLKRGFNNFIDPISPDELAGL---AMESEVDSRLVS- 55 +E + ++ +F+ ++ + PV++ +++ ++ L A + V + Sbjct: 171 VERRSCISLEEFICDYFLRESPVIISGSIDHWPARTKWKDIQYLKKIAGDRTVPVEVGKN 230 Query: 56 HQDGKWQVSHGPFESY-DHLGETNW--SLLVQAVNHWHEPTAALMRPFRELPDWRIDD-- 110 + +W+ F + + + +L A + E L Sbjct: 231 YVCSEWKQELITFSQFLERMWSAGCPSNLTYLAQHPLFEQIKELHEDIMVPDYCYAGGGE 290 Query: 111 --LMISFSVPGGGVGP-HLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHP---------- 157 + ++ P G V P H D + + Q GR+ R+ + PH Sbjct: 291 LQSLNAWFGPHGTVTPLHHDPHHNILAQVLGRKYIRLYPASISEDLYPHTETMLSNTSQV 350 Query: 158 -----DLLQVDPFEAI--IDEELEPGDILYIPPGFPHE 188 DL + E + +D LE GD+LYIPP + H Sbjct: 351 DLDNVDLKEFPRVENLDFLDCILEEGDLLYIPPKWWHY 388 >UniRef50_B2GUS6 LOC100158649 protein n=5 Tax=Xenopus (Silurana) tropicalis RepID=B2GUS6_XENTR Length = 443 Score = 138 bits (349), Expect = 2e-31, Method: Composition-based stats. Identities = 43/226 (19%), Positives = 84/226 (37%), Gaps = 32/226 (14%) Query: 8 NWPDFLERHW-QKRPVVLKRGFNNFIDPISP--DELAGLAMESEVDSRL-VSHQDGKWQV 63 + F + + ++PVVL+ +++ + + +A V L + D +W Sbjct: 218 SLEHFRDHYLVPQKPVVLEGVIDHWPCLKKWSVEYIQRVAGCRTVPVELGSRYTDAEWSQ 277 Query: 64 SHGPFESY--DHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRI------DDLMI-S 114 + ++ + + A + E L +PD+ D++ I + Sbjct: 278 RLMTVNEFITKYILDKQNGIGYLAQHQLFEQIPELKEDI-CIPDYCCLGEASEDEITINA 336 Query: 115 FSVPGGGVGP-HLDQYDVFIIQGTGRRRWRVGEKLQMKQHCP---------------HPD 158 + P G V P H D F+ Q GR+ RV + ++ P PD Sbjct: 337 WFGPAGTVSPLHQDPQQNFLAQIVGRKYIRVYSVAETEKLYPFDSSILHNTSQVDVESPD 396 Query: 159 LLQVDPF--EAIIDEELEPGDILYIPPGFPHEGYALENAMNYSVGF 202 + F + + L PG +L+IP + H AL+ + + S + Sbjct: 397 QNKFPRFSQASYQECILSPGQVLFIPVKWWHYIRALDLSFSVSFWW 442 >UniRef50_D2VHH6 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2VHH6_NAEGR Length = 311 Score = 138 bits (349), Expect = 2e-31, Method: Composition-based stats. Identities = 43/239 (17%), Positives = 82/239 (34%), Gaps = 42/239 (17%) Query: 6 TLNWPDFLERHWQKR-PVVLKRGFNNFIDPISPDELAGLAME---SEVDSRLVSH-QDGK 60 ++ DF ++++ P +LK N+ ++ L + V + + Sbjct: 72 AISLMDFKKKYFNTHTPCLLKNASKNWEAYRKWSDVNYLLEKAAYRAVPVEIGQYYTSED 131 Query: 61 WQVSHGPFESY--DHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDL-----MI 113 W PF Y +++ E N + A + E +L + +E + +L + Sbjct: 132 WSQKIMPFHQYVKEYVMEGNTQIGYLAQHPLFEQIHSLRKDIQEPIYCMLGELGEMSGVN 191 Query: 114 SFSVPGGGVGP-HLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPD-------------- 158 ++ P G + P H D D ++Q G + R+ + Sbjct: 192 AWYGPKGTISPLHTDPCDNILVQLVGHKFVRIYHPDETPHLYKRQSGILQANTSEIDNLH 251 Query: 159 LLQVDPFE---------------AIIDEELEPGDILYIPPGFPHEGYALENAMNYSVGF 202 LLQ + E D L GD+L+IP + H +L + + S F Sbjct: 252 LLQFEEEERKILNEKFPLISKATHYWDCTLCEGDMLFIPKLYWHYVQSLSISFSISYWF 310 >UniRef50_A8TXW2 Putative uncharacterized protein n=1 Tax=alpha proteobacterium BAL199 RepID=A8TXW2_9PROT Length = 398 Score = 138 bits (349), Expect = 2e-31, Method: Composition-based stats. Identities = 64/392 (16%), Positives = 119/392 (30%), Gaps = 43/392 (10%) Query: 6 TLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAME------SEVDSRL---VSH 56 ++ F+ + +R + S D+L L ++ L V Sbjct: 11 AIDRQTFVRDYLDQRVYHEAGSVQDVGRLFSWDKLNDLLQRPKLWDGKSIEMALAGRVLD 70 Query: 57 QDGKWQVSHG----PFESYDH-----LGETNWSLLVQAVNHWHEPTAALMRPFRELPDWR 107 + G P D L + + ++ ++ AA+ R F L Sbjct: 71 PREYCRPGLGRSGEPILRPDRQKVMALLQKGATFVLDYLDGIDPDIAAVTRCFERLFGTN 130 Query: 108 IDDLMISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGE----KLQMKQHCPHPDLLQVD 163 M G H D DVF IQ TG + W + + + D Sbjct: 131 TSCNMYCSWQQVPGYASHFDTMDVFAIQITGEKTWNIYDGRFREATFTAGIRPSDFTVEQ 190 Query: 164 ----PFEAIIDEELEPGDILYIPPGFPHEGYALE-NAMNYSVGFRAPNTRELISGFADYV 218 + + PGDILY+P G H+ A + +++ S G ++ A Sbjct: 191 HNRMRGKVAQRITMRPGDILYLPRGVYHDALATDSASLHLSFGVSPQVGFTVVGMLASEA 250 Query: 219 LQRELGGNYYSDPDVPPRAHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHE 278 + E P + L + + + + +++ P G ++ + Sbjct: 251 PKHEFLRKR------LPHFEDREELAGYLAAVGDHLKTMLSDPGFVDYLSGYLRERTFEK 304 Query: 279 LDIAPPEPPYQPDEIYDALKQGEVLVRLGGLRVLRIGDDVYANGEKIDSP--HRPALDAL 336 + Y + + EVL G L + G ++ R +D + Sbjct: 305 VTDYRLPDRAADQYFYVSRHRPEVLEADGQLTLRTST------GSEVPLQRGDREMVDWV 358 Query: 337 ASNIALTAENFGDALED--PSFLAMLAALVNS 366 + A + P+ +L ALV S Sbjct: 359 LAREVFWLSELNTAHGNRGPAGERVLNALVES 390 >UniRef50_UPI0000ECAC04 JmjC domain-containing protein 5 (Jumonji domain-containing protein 5). n=2 Tax=Gallus gallus RepID=UPI0000ECAC04 Length = 389 Score = 138 bits (348), Expect = 3e-31, Method: Composition-based stats. Identities = 45/227 (19%), Positives = 84/227 (37%), Gaps = 33/227 (14%) Query: 8 NWPDFLERHW-QKRPVVLKRGFNNFIDPISP--DELAGLAMESEVDSRL-VSHQDGKWQV 63 + F +R+ ++PVVL+ +++ D + +A V L + D +W Sbjct: 163 SLEHFRDRYLIPQKPVVLEGIIDHWPCMKKWSVDYVRQVAGCRTVPVELGSRYTDEEWSQ 222 Query: 64 SHGPFESYDH---LGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRI-------DDLMI 113 + + + E+ S+ A + + L +PD+ D + Sbjct: 223 KLMTVNDFINQYIVNESQNSVGYLAQHQLFDQIPELKEDIS-IPDYCCLGEGEEDDITIN 281 Query: 114 SFSVPGGGVGP-HLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---------------P 157 ++ P G + P H D F+ Q GR+ R+ + PH P Sbjct: 282 AWFGPAGTISPLHQDPQQNFLAQVFGRKYIRLCSPQDSENLYPHESQLLHNTSQVDVEDP 341 Query: 158 DLLQVDPFE--AIIDEELEPGDILYIPPGFPHEGYALENAMNYSVGF 202 DL + F A L PG +L+IP + H +L+ + + S + Sbjct: 342 DLTKFPNFRKVAFQSCILMPGQVLFIPVKYWHYIRSLDISFSVSFWW 388 >UniRef50_D1WSH6 Cupin family protein n=2 Tax=Streptomyces RepID=D1WSH6_9ACTO Length = 403 Score = 138 bits (348), Expect = 3e-31, Method: Composition-based stats. Identities = 71/395 (17%), Positives = 120/395 (30%), Gaps = 38/395 (9%) Query: 1 MEYQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDS--------- 51 + + + +W F +R W + PV+ K P E+ A+ Sbjct: 3 LTVEKSFDWDTFADRFWDRAPVLYKGLD---TAPFDEQEVFRAAVSGSRPPHPLAVPGNL 59 Query: 52 -RLVSHQDGKW------QVSHGPFESY-----DHLGETNWSLLVQAVNHWHEPTAALMRP 99 LV + + G + Y D L ++L+V + + P + Sbjct: 60 QFLVRRRQQTRPHDYLPEAGDGSLDGYERRMADRLEGRRYALVVHRFHSFSHPLWDRAQR 119 Query: 100 FRE-------LPDWRIDDLMISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQ 152 F P M S VG H D++ F+ G +R R Sbjct: 120 FYAGLWERVGQPTHTAGSTMFHGSYEHSPVGVHQDRFATFMFCVRGTKRMRFWADRPWSD 179 Query: 153 HCPHPDLLQVDPFEAIIDEELEPGDILYIPPGFPHEGYALENAMNYSVGFRAPNTRELIS 212 H L + E+EPGD+LY P + H G + + SV P Sbjct: 180 PV-HTVLDYQPYLASSFVAEVEPGDLLYWPARYYHVGESASDTPATSVNVGIPRREHRPY 238 Query: 213 GFADYVLQ--RELGGNYYSDPDVPPRAHPADVLPQEMDKLREMMLELINQPEHFKQWFGE 270 + + R PD P A LP+ + + E + + + Sbjct: 239 YEIKDLFRGTRPQSSAPLFTPDAGPDGRLAGELPRALADAVDAFAEHLAEDRFTDRA-TA 297 Query: 271 FISQSRHELDIAPPEPPYQPDEIYDAL--KQGEVLVRLGGLRVLRIGDDVYANGEKIDSP 328 + R P EPP P + D ++ L+ G R + + I + Sbjct: 298 LALRVRTAGGFWPTEPPAAPRPLDDDTPVRRCAPLLPAPGEGPPRWAANGHVTSGAIGAD 357 Query: 329 HRPALDALASNIALTAENFGDALEDPSFLAMLAAL 363 L L ++ A+ +A +L L Sbjct: 358 ALAVLRRLDADEAVRVGELPEA-RRADVRRLLQEL 391 >UniRef50_B7FXD3 Predicted protein n=1 Tax=Phaeodactylum tricornutum CCAP 1055/1 RepID=B7FXD3_PHATR Length = 481 Score = 137 bits (346), Expect = 5e-31, Method: Composition-based stats. Identities = 46/300 (15%), Positives = 89/300 (29%), Gaps = 53/300 (17%) Query: 8 NWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQ--DGKWQVSH 65 + F + WQ+ F + P + ++ A E + H+ W V Sbjct: 30 SVETFFQTFWQRACGYFPNTFLD--SPPKAESMSRCAWNKERVEQNAYHELVRNGWSVLV 87 Query: 66 GPFESYDHLGE-------------------------------------TNWSLLVQAVNH 88 E+ + E S++ + Sbjct: 88 QLLETSRNRPEHDADLSHQSIPLLFRDQTTLTLEEQVLYDDSLFAAFLDGCSVVTNHADR 147 Query: 89 WHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKL 148 AAL + + V H D DVF+IQ G + W++ + Sbjct: 148 RSPWIAALCEDLQASFPH-VYANTYLTPPGSQTVPAHADDRDVFVIQLVGCKAWKIYRNI 206 Query: 149 QMKQHCPHPDLLQVD--------PFEAIIDEELEPGDILYIPPGFPHEGYALENAMNYSV 200 + H + + + + D L PGD+LY+P G+ HE +A++ ++ V Sbjct: 207 PVPYPYSHEQVGKGELEVPGQVLDGPVLTDRVLAPGDVLYMPRGYVHEAHAVDGGPSFHV 266 Query: 201 GFRAPNTRELISGFADYVLQRELGGNYYSDPDVPP---RAHPADVLPQEMDKLREMMLEL 257 ++G + L VP R + + L++ + + Sbjct: 267 TVALATQDWTLAGLVTAATEASLTQQRSYRQAVPRCFGRRSFESIAVDDKQSLQKQLDDA 326 >UniRef50_Q8N371 JmjC domain-containing protein 5 n=17 Tax=Chordata RepID=JMJD5_HUMAN Length = 416 Score = 137 bits (345), Expect = 6e-31, Method: Composition-based stats. Identities = 42/225 (18%), Positives = 79/225 (35%), Gaps = 30/225 (13%) Query: 8 NWPDFLERHW-QKRPVVLKRGFNNFIDP--ISPDELAGLAMESEVDSRL-VSHQDGKWQV 63 + F E+ RPV+LK +++ S + + +A V + + D +W Sbjct: 191 SLQHFREQFLVPGRPVILKGVADHWPCMQKWSLEYIQEIAGCRTVPVEVGSRYTDEEWSQ 250 Query: 64 SHGPFESYDHLG--ETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDD------LMISF 115 + + + A + + L + + D + ++ Sbjct: 251 TLMTVNEFISKYIVNEPRDVGYLAQHQLFDQIPELKQDISIPDYCSLGDGEEEEITINAW 310 Query: 116 SVPGGGVGP-HLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---------------PDL 159 P G + P H D F++Q GR+ R+ + PH PDL Sbjct: 311 FGPQGTISPLHQDPQQNFLVQVMGRKYIRLYSPQESGALYPHDTHLLHNTSQVDVENPDL 370 Query: 160 LQVDPFE--AIIDEELEPGDILYIPPGFPHEGYALENAMNYSVGF 202 + F + L PG+IL+IP + H AL+ + + S + Sbjct: 371 EKFPKFAKAPFLSCILSPGEILFIPVKYWHYVRALDLSFSVSFWW 415 >UniRef50_D2PSR6 Cupin family protein n=1 Tax=Kribbella flavida DSM 17836 RepID=D2PSR6_9ACTO Length = 408 Score = 135 bits (341), Expect = 2e-30, Method: Composition-based stats. Identities = 73/394 (18%), Positives = 132/394 (33%), Gaps = 55/394 (13%) Query: 7 LNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESE-------------VDSRL 53 L+W +F E +W + PV+++ P DE+ A+ + V Sbjct: 10 LDWAEFAELYWDRHPVLIRGVRP---VPFRADEVFSAALRARCAEGGGRIAPNASVTVEQ 66 Query: 54 VSHQDGKWQV---SHGPFESY-----DHLGETNWSLLVQAVNHWHEPTAALMRPFRELPD 105 D + S G F+ Y D L ++L++ A + + P R F Sbjct: 67 TVQADRDGLLPAESDGCFDGYERRVGDRLDGRKYALIISAFHAFDFPLWDRERRF---FA 123 Query: 106 WRIDDLMISFSV----------PGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCP 155 D++ + + VG H D++ F+ R+R R+ + + Sbjct: 124 GLWDEVGLPLTSAITTLFHGNYDHSPVGVHKDRFATFMFGLRERKRMRLWTERPWTEQV- 182 Query: 156 HPDLLQVDPFEAI-IDEELEPGDILYIPPGFPHEGYALENAMNYSVGFRAPNTRELISGF 214 ++ + F E+EPGD+LY P + H G SV P T +S Sbjct: 183 -GSVVDYERFLPSSFAVEVEPGDLLYWPASYFHVGENCGRTPATSVNIGVPRTEHRVSYE 241 Query: 215 ADYVLQRELGGNYYSDPDVPPRAHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQ 274 + +L D + + E++ + P Q Sbjct: 242 LEDLLADSDPARLLDDGGRLAVLADG-IDAPMRQEAGEVLPSTV--PPALAQALTAHAKS 298 Query: 275 SRHELDIAPPE-------PPYQPDEIYDALKQGEVLVRLGGLRVLRIGDDVYANGEKIDS 327 +++ P P E + L G+++ G + + + ANG I + Sbjct: 299 LTDRVEMVSLRRWTAGGLEPVPPPEPFRPLADGQIVELAQGADLAQYRGALAANGHLITA 358 Query: 328 PHRP-ALDALASNIALTAENFGDALEDPSFLAML 360 P AL L+S ++ DA P+ +L Sbjct: 359 DLPPEALQLLSSGRSVRV----DAANRPALEQLL 388 >UniRef50_A7RV46 Predicted protein n=2 Tax=Eumetazoa RepID=A7RV46_NEMVE Length = 400 Score = 133 bits (336), Expect = 8e-30, Method: Composition-based stats. Identities = 46/215 (21%), Positives = 77/215 (35%), Gaps = 30/215 (13%) Query: 6 TLNWPDFLERHWQK-RPVVLKRGFNNFIDPISP--DELAGLAMESEVDSRL-VSHQDGKW 61 +++ DFL H +K +PV+L + + L +A V L + + D +W Sbjct: 174 SMSLQDFLMSHMKKDKPVILDGMMEAWPAMRKWGLEYLKDIAGYRTVPIELGLRYTDEEW 233 Query: 62 QVSHGPFESYDHLG---ETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDD-----LMI 113 + + + A + + L R + D ++ Sbjct: 234 TQKLMTISEFVDKYVSCSNSSQVAYLAQHQLFDQIPELRRDIIIPDYCCLGDDDRDVMIN 293 Query: 114 SFSVPGGGVGP-HLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---------------P 157 ++ P G V P H D Y+ + Q G + R+ K Q + PH P Sbjct: 294 AWFGPKGTVSPLHHDPYNNLLAQVVGEKYLRLYSKDQTDKLYPHETTLLHNTSQIDVEAP 353 Query: 158 DLLQVDPF--EAIIDEELEPGDILYIPPGFPHEGY 190 DL Q F + + L PG +L+IPPG H Sbjct: 354 DLAQFPAFYKASYQECILRPGQMLFIPPGHWHYVR 388 >UniRef50_Q1D4G2 Cupin family protein n=2 Tax=Myxococcus xanthus DK 1622 RepID=Q1D4G2_MYXXD Length = 442 Score = 133 bits (336), Expect = 8e-30, Method: Composition-based stats. Identities = 76/407 (18%), Positives = 129/407 (31%), Gaps = 69/407 (16%) Query: 1 MEYQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGL---------------AM 45 +E +W F+ R+W +RPV+ K P + D++ A Sbjct: 4 LEIATRFDWDTFVRRYWNQRPVLFKGTQ---ASPFTVDDVFEASAGATQRYLSRSYEPAS 60 Query: 46 ESEVDS---RLVSHQDGKWQVS--HGPFESYD-----HLGETNWSLLVQAVNHWHEPTAA 95 +V RL + +W G + YD LGE ++L++ ++ + Sbjct: 61 RPDVTFTVDRLRQLRSREWLPRKSDGSLDGYDARIASQLGERRYALIIATMHASGFQLWS 120 Query: 96 LMRPFRELPDWRID-------DLMISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKL 148 R F R+ + + VG HLD++ F+ GR+R R K Sbjct: 121 RQRAFFSGLWQRVGMPVTGGITSLFHGTYEHSPVGVHLDRFTTFMFALRGRKRMRFWHKR 180 Query: 149 QMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIPPGFPHEGYALENAMNYSVGFRAP--- 205 + L + E+EPGDILY P + H G + + S+ P Sbjct: 181 PWSEDVS-TILDYQPYLASSFVAEVEPGDILYWPSTYYHVGESAGAGVASSLNVGIPITE 239 Query: 206 -----NTRELISGFADY--VLQRELGGNYYSDPDVPPRAHP--------ADVLPQEMDKL 250 + +L+ G D + +E + P A A LP+ + + Sbjct: 240 HHVIYSVDDLLRGMLDETSLADQEWKQTRLARVSASPLARGALSKNGVLATELPRALTEA 299 Query: 251 REMMLELINQPEHFK----QWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQGEVL-VR 305 ++ + E + W S + E P + VL R Sbjct: 300 VRAFRDVSHPKEARRHIQSTWLKRLTSGGFEPVPPPTREKPLRDSHHVRVDPSFPVLFER 359 Query: 306 LGGLRVLRIGDDVYANGEKIDSPH-----RPALDALASNIALTAENF 347 R + ANG + L S A++ E Sbjct: 360 DSATRWI-----CSANGHALRGAGGGRAIEIFFRKLNSGAAVSVEEL 401 >UniRef50_UPI0001791EB3 PREDICTED: similar to JmjC domain-containing protein 5 (Jumonji domain-containing protein 5) n=1 Tax=Acyrthosiphon pisum RepID=UPI0001791EB3 Length = 379 Score = 132 bits (333), Expect = 2e-29, Method: Composition-based stats. Identities = 37/214 (17%), Positives = 69/214 (32%), Gaps = 31/214 (14%) Query: 8 NWPDFLERHWQKR-PVVLKRGFNNFIDPISPDELAG---LAMESEVDSRL-VSHQDGKWQ 62 + FL + + PV + ++ +L LA V + S+ D W Sbjct: 153 SLETFLRDFLKPKIPVKITGNMEHWPALNKWKDLNYFVKLAGARLVPVEIGSSYADADWS 212 Query: 63 VSHGPFESYD--HLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDL------MIS 114 E + H+ + A + L + + D+ + + Sbjct: 213 QKLITLEEFINIHVVQEGEKPAYLAQHQLFNQIPELKDDIKIPDYCYLTDMDGVEPDINA 272 Query: 115 FSVPGGGVGP-HLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---------------PD 158 + P G V P H D + F+ Q G + + + + P+ PD Sbjct: 273 WLGPKGTVSPTHYDPKNNFLAQVVGSKNIILYDPKWSEYLYPYDDKFLKNTAQVDPVKPD 332 Query: 159 LLQVDPFEAI--IDEELEPGDILYIPPGFPHEGY 190 L + F + L G++L+IP G+ H Sbjct: 333 LCKFPNFSQVKAAHCTLNEGEMLFIPSGWWHRVE 366 >UniRef50_A0YLC3 JmjC domain protein n=1 Tax=Lyngbya sp. PCC 8106 RepID=A0YLC3_9CYAN Length = 374 Score = 132 bits (333), Expect = 2e-29, Method: Composition-based stats. Identities = 48/243 (19%), Positives = 97/243 (39%), Gaps = 33/243 (13%) Query: 1 MEYQLTLNWPDFLERHW-QKRPVVLKRGFNNFI--DPISPDELAGLAMESEVDSRLVSHQ 57 +E +++L+ +FL+ + Q +PVVL NN+ + +P L + V+ + + Sbjct: 124 VERRVSLSRSEFLDGFYSQNKPVVLTGIMNNWKALNLWNPKYLKQHYGTATVEVQGNRNS 183 Query: 58 DGKWQVSHGPFESYDHLGE-----------TNWSLLVQAVNHWHEPTAALMRPFRELPDW 106 D +++++ L + + ++ N E LM P++ Sbjct: 184 DPEYELNVEKHRQKVLLKDYIDWIVEKGESNDCYMVANNQNLDREDLKGLMNDLEVFPEY 243 Query: 107 R----IDDLMISFSVPGGGVGP-HLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH----- 156 + + G + P H D ++ + Q GR+R + Q H Sbjct: 244 LNPKDTSRRVFFWFGSAGTITPLHHDPVNLMLAQVLGRKRILLIPPRQTPFLYNHLGVFS 303 Query: 157 ------PDLLQVDPFEAI--IDEELEPGDILYIPPGFPHEGYALENAMNYSV-GFRAPNT 207 PD + ++ I I+ L+PG++++IP G+ H AL+ +++ S F PN Sbjct: 304 QVDPENPDFKKYPLYQNIKPIELILKPGEVIFIPVGWWHHVRALDVSISVSFTNFVFPNY 363 Query: 208 REL 210 Sbjct: 364 YHW 366 >UniRef50_A9V7T0 Predicted protein n=1 Tax=Monosiga brevicollis RepID=A9V7T0_MONBE Length = 1016 Score = 132 bits (332), Expect = 2e-29, Method: Composition-based stats. Identities = 56/296 (18%), Positives = 94/296 (31%), Gaps = 45/296 (15%) Query: 6 TLNWPDFLERHWQKRPVVLKR-GFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQVS 64 TL F R +++P +++ F + + L+ L + EV H + Sbjct: 740 TLTEGAFAVRLTERQPFLVRDCAFGVATASWTAEHLSSLVGDREVSV----HVGEDCNMD 795 Query: 65 HGPFES---YDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISF----SV 117 Y N+ V + AA +P W + + + S S Sbjct: 796 FTTRNFRPMYLRSMGKNFRKDVSNIEQTFPEVAAEFALPSCVPSWIMGEKLFSTALRVSS 855 Query: 118 PGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHC-----------PHPDLLQVDPFE 166 PG + H D D + GR+R + Q PDL F Sbjct: 856 PGVQLWTHYDVMDNVLCNVRGRKRVVLFPPEQAGNLYLEGSSSRVVDIERPDLEAFPRFA 915 Query: 167 AII----DEELEPGDILYIPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRE 222 + + LEPGD+L+IP + H ALE ++ +V D L Sbjct: 916 TAMAHALELILEPGDMLHIPALWCHNVRALEPCISVNV---------FWKHLDDAAL--Y 964 Query: 223 LGGNYYSDPDVPPRAHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHE 278 + Y + DV P A + + + L P ++G + + Sbjct: 965 ASKDLYGNKDVKPAA-------EALRLAHDAAERLQTLPADHAAFYGAYATALLQP 1013 >UniRef50_C4DQG4 Putative uncharacterized protein n=1 Tax=Stackebrandtia nassauensis DSM 44728 RepID=C4DQG4_9ACTO Length = 423 Score = 132 bits (332), Expect = 2e-29, Method: Composition-based stats. Identities = 85/412 (20%), Positives = 130/412 (31%), Gaps = 58/412 (14%) Query: 6 TLNWPDFLERHWQKRPVVLKR-------------GFNNFIDPISPD---ELAGLAMESEV 49 TL+W F E++W + PV+ +R P + D ++ L E Sbjct: 11 TLDWDVFAEKYWDRAPVLYRRVPRAPFLAEEALSAAITASAPGAADVIPDIVRLTCEGR- 69 Query: 50 DSRLVSHQDGKWQVSHGPFESYDH-----LGETNWSLLVQAVNHWHEPTAALMRPFRELP 104 RL++ + S F+SY L +L++ + + R F P Sbjct: 70 --RLLTARGRVPCASDTDFDSYAARVTTALDGERHALVIAGFHPHNPDMWDRQRAF-FHP 126 Query: 105 DW-RIDDLMISFSV-------PGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH 156 W R+ M S VG H D++ F+ +GR+R R+ Sbjct: 127 LWERVGLPMTSAITTLFHGNYEHSPVGVHKDRFGTFMYVLSGRKRMRMWPHRPWSHDAS- 185 Query: 157 PDLLQVDPFEAIIDEELEPGDILYIPPGFPHEGYALENAMNYSVGFRAPNTRELISGFAD 216 L + I E PG ++Y P H G + + SV P + Sbjct: 186 TILDYARYLDTSIAAEAGPGQLMYWPASHFHVGETVGSEPATSVNVGVPREGRRVEFEMT 245 Query: 217 YVLQRELGGNYYSDPD--VPPRAHPADVLP-----QEMDKLREMMLELINQPEHFKQ--W 267 +L +DPD + R P DV P L + + ++Q + W Sbjct: 246 DLLTDLPASAL-TDPDAYLETRMPPIDVDPFVDPADAATGLPLALRQGMDQAVSLLERSW 304 Query: 268 FGEFISQSRHELDIAPPEPPYQPDEIYDALKQGEVLVRLGGLRVLR---IGDDVYANGEK 324 GE A P L L R G VL +G Sbjct: 305 EGERRLAVTLNRYTAGGFRPVPDPVPRPELTDATRLRRAPGATVLWARADDATALCSGNG 364 Query: 325 IDSPHRP-------ALDAL-ASNIALTAENFGDALEDPSF---LAMLAALVN 365 + RP LD L A+ T DA+ +P +LA L Sbjct: 365 HTASARPSPKAITALLDRLDANATPATVAELLDAVGEPEREHCRELLAELCA 416 >UniRef50_B3RNN1 Putative uncharacterized protein n=1 Tax=Trichoplax adhaerens RepID=B3RNN1_TRIAD Length = 405 Score = 132 bits (332), Expect = 2e-29, Method: Composition-based stats. Identities = 44/233 (18%), Positives = 79/233 (33%), Gaps = 39/233 (16%) Query: 8 NWPDFLERHW-QKRPVVLKRGFNNFIDP----ISPDELAGLAMESEVDSRL-VSHQDGKW 61 + F + ++ K PV+L +++ S L +A V + + D W Sbjct: 174 SLDYFRKHYFCTKEPVILTDVIDHWPALGARRWSIQRLKDIAGHRTVPIEIGTRYTDDSW 233 Query: 62 QVSHGPFESYDHL---GETNWSLLVQAVNHWHEPTAALMRPFRELPDWRID--------- 109 P + E+N A + E L +PD+ Sbjct: 234 TQKLMPLSKFIDEFITMESNQESGYLAQHQLFEQIPELRTDI-CVPDYCCIIDDNNDDVD 292 Query: 110 --DLMISFSVPGGGVGP-HLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHP--------- 157 L ++ P G + P H D Y Q GR+ R+ + + + P+P Sbjct: 293 ATVLTNAWFGPQGTISPLHHDPYHNLFAQVMGRKYIRLYPEHESENVYPYPTKLLSNTSQ 352 Query: 158 ------DLLQVDPFE--AIIDEELEPGDILYIPPGFPHEGYALENAMNYSVGF 202 + F ++ +EPG +LYIPP H +L+ + + S + Sbjct: 353 VDVEFPNFENYPNFANAEYLECIIEPGQLLYIPPRCWHYVRSLDISFSVSFWW 405 >UniRef50_C1C1Z9 JmjC domain-containing protein 5 n=1 Tax=Caligus clemensi RepID=C1C1Z9_9MAXI Length = 394 Score = 131 bits (330), Expect = 3e-29, Method: Composition-based stats. Identities = 41/213 (19%), Positives = 74/213 (34%), Gaps = 29/213 (13%) Query: 6 TLNWPDFLERHWQ-KRPVVLKRGFNNFIDPISPD--ELAGLAMESEVDSRL-VSHQDGKW 61 TL++ F+E++ + + PV++K NN+ + +A V + + DG W Sbjct: 169 TLDFETFVEKYKETQTPVIIKGLANNWPARAKWSIPYIRSIAGYRTVPIEIGSRYTDGNW 228 Query: 62 QVSHGPFESY--DHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDD----LMISF 115 ++ + + + A ++ + + L + + + Sbjct: 229 TQRLMTINAFIDQFIDSPSQTTGYLAQHNLMDQVSDLKEDIETPDYCFSGEEDSEDVNFW 288 Query: 116 SVPGGGVGP-HLDQYDVFIIQGTGRRRWRVGEKLQMKQHCP----------------HPD 158 P G V P H D + Q G + R+ + Q K +PD Sbjct: 289 FGPCGTVSPLHTDPKHNILTQVVGYKYVRLYDPDQTKYLYSYSEEDLMSNTSQIDIENPD 348 Query: 159 LLQVDPFEAII--DEELEPGDILYIPPGFPHEG 189 + F + LEPGD LYIPP H Sbjct: 349 FNEFPEFRHALGLQGILEPGDALYIPPKMWHYV 381 >UniRef50_B8BVR1 Predicted protein n=1 Tax=Thalassiosira pseudonana RepID=B8BVR1_THAPS Length = 488 Score = 131 bits (329), Expect = 5e-29, Method: Composition-based stats. Identities = 46/295 (15%), Positives = 89/295 (30%), Gaps = 47/295 (15%) Query: 9 WPDFLERHWQKRPVVLKR-----------------GFNNFIDPI------SPDELAGLAM 45 F + WQ +P++ + GFN D + S + + Sbjct: 40 AKSFFKHIWQHQPMIFRSTHKTCQADGILRQTMTMGFNGVADMLHNCRKSSSPQSDDTST 99 Query: 46 ESEVDSRLVSHQDGKWQVSHGPFESYDHLGE----TNWSLLVQAVNHWHEPTAALMRPFR 101 S + + Q+G P+ Y S++V + A L + Sbjct: 100 NSAATAPPLFFQNG--SPITDPYSMYSSNPHAAYLDGCSIVVNHADLQSASIAKLCNDLQ 157 Query: 102 ELPDWRIDDLMISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQ 161 + G V H D DVF+IQ G ++W V +K+ ++ + + + Sbjct: 158 SSFPH-VYANAYLTPPNGFAVNAHADDRDVFVIQVLGTKKWNVYKKVPVEYPFENEQVGK 216 Query: 162 VDPFEAIIDEE-----------LEPGDILYIPPGFPHEGYA------LENAMNYSVGFRA 204 E L PGD++Y+P GF HE ++ ++ + Sbjct: 217 SGREVPPSVFEGGLCFGNNVLDLGPGDVMYMPRGFVHEATTEILDVEDGHSPSFHITIAI 276 Query: 205 PNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADVLPQEMDKLREMMLELIN 259 +S ++ L +P L+ + + + Sbjct: 277 ATHDWCLSVVLADCFRKTLSEVVDYRKALPIGPSKEYEPEDSSSFLKRQLNQAMK 331 >UniRef50_C7Q411 Cupin 4 family protein n=1 Tax=Catenulispora acidiphila DSM 44928 RepID=C7Q411_CATAD Length = 395 Score = 130 bits (327), Expect = 9e-29, Method: Composition-based stats. Identities = 55/374 (14%), Positives = 116/374 (31%), Gaps = 54/374 (14%) Query: 23 VLKRGFNN---FIDPISPDELAGLAMESEV-DSRLVSHQDGKWQVSHGPFESY------- 71 V + ++ ++P+ L L M + + +L +QDG ++ Sbjct: 25 VARGHIDDQDLLTRLLTPNHLLELVMRRHLANPQLRMYQDGAVLHPAAFLTNFVSRRHQA 84 Query: 72 ---------DHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGV 122 + +L++ +N + R + G Sbjct: 85 SRRADMAAVGRILNEGGTLILDTINQFDPTLEVACRALGWWTGELVSVNAYLAVGDTAGF 144 Query: 123 GPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIP 182 H D +DV ++Q G++ W V + + P E + + GD+++IP Sbjct: 145 STHWDDHDVLVVQVAGQKSWEVRPASRPVPMYRDAEQNLEAPEELLWSGTMNTGDVMHIP 204 Query: 183 PGFPHEGYALEN----AMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAH 238 GF H + + +++ + G + AD EL +D + P A Sbjct: 205 RGFWHAATRVGSGEGISLHLTFGITRRTGVTWVQHLADAARDVELF---RTDLENPAGAD 261 Query: 239 PADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSR---HELDIAPPEPPYQPDEIYD 295 + + +D ++ P+ + Q E +R H P E Sbjct: 262 AKLLTTKLLDLAVDV------NPQRYLQQMREATPAARHIPHVPAFGPLGGAVTVTEYAP 315 Query: 296 AL--KQGEVLVRLGGLRVLRIGDDVYANGEKIDSPHRPALDALASNIALTAENFGDALED 353 + ++ + VR G ++ + L S + ++ D Sbjct: 316 TIIAREANLEVRAAGKKL------------TLSPRAEGWARTLLSGNPIRFDDTT----D 359 Query: 354 PSFLAMLAALVNSG 367 P +A+ + G Sbjct: 360 PGAIALAERFIQEG 373 >UniRef50_C1E0D1 Predicted protein n=2 Tax=Micromonas RepID=C1E0D1_9CHLO Length = 297 Score = 129 bits (325), Expect = 1e-28, Method: Composition-based stats. Identities = 45/245 (18%), Positives = 79/245 (32%), Gaps = 43/245 (17%) Query: 1 MEYQLTLNWPDFLERHWQ-KRPVVLKRGFNNFIDPISPDELAGLAMESE---VDSRLVSH 56 +E + +F ++ +PV L ++ +L LA E V + ++ Sbjct: 53 LERAEGITAKEFKRNYFNADKPVCLGNLGGSWPALAKWRDLRWLAREHGHRNVPLEVGAY 112 Query: 57 QDG-KWQVSHGPFESY---------------DHLGETNWSLLVQAVNHWHEPTAALMRPF 100 D W+ S+ + G + A + E L+ F Sbjct: 113 DDAANWKEEVMLLSSFIDEYLMPGLKKELAGEDQGREGRRIAYLAQHQLFEQLPGLLGDF 172 Query: 101 RELPDWRIDDLM---ISFSVPGGGVGP-HLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH 156 P + + ++ G V P H D YD + Q G + R+ + H Sbjct: 173 DPPPVCDVAGGVQRVNAWIGTAGTVTPCHFDSYDNLLGQVAGYKFVRLYSEDDSPFLYRH 232 Query: 157 -----------------PDLLQVDPFEAI--IDEELEPGDILYIPPGFPHEGYALENAMN 197 PDL + F +D L PG+ +YIP H AL +++ Sbjct: 233 QGARDAQGNISRVDVERPDLERFPLFAKATHMDVVLGPGEFIYIPARCWHYVRALTTSVS 292 Query: 198 YSVGF 202 + F Sbjct: 293 LNFLF 297 >UniRef50_UPI0001927155 PREDICTED: similar to jumonji domain containing 5 n=1 Tax=Hydra magnipapillata RepID=UPI0001927155 Length = 406 Score = 129 bits (325), Expect = 2e-28, Method: Composition-based stats. Identities = 36/212 (16%), Positives = 69/212 (32%), Gaps = 30/212 (14%) Query: 21 PVVLKRGFNNFIDP----ISPDELAGLAMESEVDSRLV-SHQDGKWQVSHGPFESYDHLG 75 P+++ G ++ + +A V + + W + Sbjct: 195 PIIISDGVQHWPAFSNRKWDISYIKKVAGSRTVPIEVGDKYTSENWTQKLISVGEFIDKY 254 Query: 76 E-TNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDD------LMISFSVPGGGVGP-HLD 127 TN + A + E L I + + ++ P G V P H D Sbjct: 255 ICTNNKIGYLAQHQLFEQIPELRDDICIPDYCCISEQENNRVMTHAWFGPKGTVSPLHHD 314 Query: 128 QYDVFIIQGTGRRRWRVGEKLQMKQHCPH---------------PDLLQVDPF--EAIID 170 Y +Q G + R+ ++ + PH D + F ++ Sbjct: 315 PYHNLFVQVLGEKYIRLYDRKDSENLYPHESQMLNNTSQVDLENVDAEKFPLFLQTNYVE 374 Query: 171 EELEPGDILYIPPGFPHEGYALENAMNYSVGF 202 L+ G++LYIPP + H +LE + + S + Sbjct: 375 CVLKQGEMLYIPPKWWHYVRSLETSFSVSFWW 406 >UniRef50_B6KFH2 Putative uncharacterized protein n=4 Tax=Toxoplasma gondii RepID=B6KFH2_TOXGO Length = 508 Score = 128 bits (322), Expect = 3e-28, Method: Composition-based stats. Identities = 44/327 (13%), Positives = 88/327 (26%), Gaps = 28/327 (8%) Query: 58 DGKWQVSHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSV 117 + + SL++ + E ++ + + + + Sbjct: 173 ERHRTTTSASLSRATCRYLEGCSLVINQADRTLEILQSICQHLSKKYFSHVFAVSYLTPP 232 Query: 118 PGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH----PDLLQVDPFEAIIDEEL 173 V H D DVF++Q G + W++ Q+ + DP + +++ L Sbjct: 233 RTHAVKTHTDDQDVFLLQVWGSKAWKIWTPPQILPLTEEMLGKREAFPDDPGKPLLEFVL 292 Query: 174 EPGDILYIPPGFPHEGYALENAMNYSVGFRAPNTR----ELISGFADYVLQRE---LGGN 226 + GDILYIP GFPH E + + P + ++ Sbjct: 293 KEGDILYIPRGFPHAAVTTEE-PSLHITLTVPTAEFAYVTCLQRLVKSLVLTHTLPSDTE 351 Query: 227 YYSDPDVPPRAHPADVLPQEMDKLREMMLELINQP------EHFKQWFG-------EFIS 273 + + P +++ LR + Q + Sbjct: 352 RRCRSALLLKDVPGA--AEDLHALRAAVDACAEQVASRLNYDALCNSLSSQLETVNAMQR 409 Query: 274 QSRHELDIAPPEPPYQPDEIYDALKQGEVLVRLGGLRVLRIGDDVYANGEKIDSPHRPAL 333 + L P P+ + G + KI + Sbjct: 410 RQFLRLQAVPVSTPFTETSLVALAAGLTCKCEEGSAEAVFFKGTQSLT-MKICPSASKLI 468 Query: 334 DALASNIALTAENFGDALEDPSFLAML 360 + LA+ T + F +L Sbjct: 469 NVLANRQPATVCDLPCKDPFERFCVLL 495 >UniRef50_B2SWM5 Transcription factor jumonji jmjC domain protein n=3 Tax=Burkholderia RepID=B2SWM5_BURPP Length = 338 Score = 127 bits (320), Expect = 6e-28, Method: Composition-based stats. Identities = 44/232 (18%), Positives = 82/232 (35%), Gaps = 31/232 (13%) Query: 1 MEYQLTLNWPDFLERH-WQKRPVVLKRGFNNFI--DPISPDELAGLAMESEVDSRLVSHQ 57 +E + L+ F E++ +Q RPV++ F+ + S D L E EV+ + Sbjct: 89 IERRERLSRYAFFEQYYFQNRPVIITGAFDFWPARSLWSWDYLRERCGEREVEVQFGRES 148 Query: 58 DGKWQVSHGPFESYDHL-----------GETNWSLLVQAVNHWHEPTAALMR---PFREL 103 D ++++ ++ + +H AAL P E Sbjct: 149 DANYEINQPKLRRTMRFADYVDLVEQSGPTNDFYMTANNTSHNRAALAALWSDVPPIDEY 208 Query: 104 PDWRIDDLMISFSVPGGGVGP-HLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHP----- 157 D D + P G P H D + F+ Q GR++ ++ H Sbjct: 209 LDASSPDTGFFWMGPAGTKTPFHHDLTNNFMAQVIGRKQIKLVPLSDTPFMANHLHCYSQ 268 Query: 158 ------DLLQVDPF--EAIIDEELEPGDILYIPPGFPHEGYALENAMNYSVG 201 D +I+ L PG++L++P G+ H L+ ++ + Sbjct: 269 VDGAAIDYDSFPSMRQAQLIECTLAPGELLFLPIGWWHYVEGLDASVTMTFT 320 >UniRef50_Q1D441 JmjC domain protein n=1 Tax=Myxococcus xanthus DK 1622 RepID=Q1D441_MYXXD Length = 295 Score = 127 bits (319), Expect = 6e-28, Method: Composition-based stats. Identities = 50/272 (18%), Positives = 93/272 (34%), Gaps = 51/272 (18%) Query: 1 MEYQLTLNWPDFLERHWQKR-PVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDG 59 +E + F E + +KR PVVL +++ A + D R+V + Sbjct: 9 IERISSPTPAFFREHYLEKRRPVVLTGVVSHWPAVTRWS--ADSFKQRFGDHRVVVERSR 66 Query: 60 KWQVSHGPFESYDHLGETNWSL---LVQAVNHWHEPTA-------------ALMRPFRE- 102 S+ P E + L + + ++ H P A L+ F Sbjct: 67 ASVPSNDPLEFLRNRYYEEARLGDTIARMMSGEHPPGAYYVTYANIFDAAPELLGDFESP 126 Query: 103 ---------LPDWRIDDLMI---SFSVPGGGV-GPHLDQYDVFIIQGTGRRRWRVGEKLQ 149 P D L + + P G V H D+ + F Q +GR++W + Sbjct: 127 PQTWGIPPHYPRALQDRLTLRPGFWLGPAGTVSAVHFDRQENFNAQISGRKKWTLYSPQD 186 Query: 150 MKQHC----------------PHPDLLQVDPFEAII--DEELEPGDILYIPPGFPHEGYA 191 + PD + F + LEPG++L+IP G+ H Sbjct: 187 SRHLYYPALDMPTVIFSPVDIEAPDARRFPRFAEAQPYETILEPGELLFIPAGWWHHVRT 246 Query: 192 LENAMNYSVGFRAPNTRELISGFADYVLQREL 223 LE +++ + + + + + +++L Sbjct: 247 LELSISLNFWWWTLASVGTTARVNYHFARKQL 278 >UniRef50_B5W056 Transcription factor jumonji n=2 Tax=Arthrospira RepID=B5W056_SPIMA Length = 375 Score = 126 bits (318), Expect = 1e-27, Method: Composition-based stats. Identities = 45/243 (18%), Positives = 97/243 (39%), Gaps = 33/243 (13%) Query: 1 MEYQLTLNWPDFLERHWQK-RPVVLKRGFNNFIDP--ISPDELAGLAMESEVDSRLVSHQ 57 ++ + ++ +FLE ++ + P++L N+ +P+ L ++ V+ + Sbjct: 126 IDRKPWVSRSEFLESYYSRNTPLILTDILTNWRALELWTPEYLKQNYGQAMVEIQAGREA 185 Query: 58 DGKWQ---VSHGPFESYDHLGE--------TNWSLLVQAVNHWHEPTAALMRPFRELPDW 106 D ++ H + + ++ ++ N L+ ++ Sbjct: 186 DPDYEINLQRHQKTVRFADYIDWVVSGKQTNDYYMVANNRNLDRPEFKGLLNDLEIFTEY 245 Query: 107 R----IDDLMISFSVPGGGVGP-HLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH----- 156 + + P G V P H D ++ + Q +GR+ R+ Q+ H Sbjct: 246 LDPTQTSGCIFFWYGPAGTVTPLHHDPVNLLLAQVSGRKLIRMIPPYQVPFLYNHIGVFS 305 Query: 157 ------PDLLQVDPFEAI--IDEELEPGDILYIPPGFPHEGYALENAMNYSV-GFRAPNT 207 PD + F+ + I+ LEPG++++IP G+ H +LE +++ S+ F PNT Sbjct: 306 EVDLENPDYRKYPLFQKVRPIEFILEPGEVIFIPVGWWHHVRSLEPSISVSMTNFVFPNT 365 Query: 208 REL 210 E Sbjct: 366 YEW 368 >UniRef50_C1FDI9 JmjC transcription factor domain-containing protein n=2 Tax=Micromonas sp. RCC299 RepID=C1FDI9_9CHLO Length = 636 Score = 126 bits (316), Expect = 1e-27, Method: Composition-based stats. Identities = 39/235 (16%), Positives = 77/235 (32%), Gaps = 31/235 (13%) Query: 10 PDFLERHWQKRPVVLKRGFNNFIDPISP-DELAGLAMESEVDSRLVSHQDG--KWQVSHG 66 FL W ++ + + N++ P + A +A + + R+ +D + Sbjct: 302 EHFLHCTWTQKLMSMAEFMENYVRPEKAVPQAAEMANQFHLKQRIRKRRDAMHAFCGRTE 361 Query: 67 PFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLM-------ISFSVPG 119 P E +D + S A + E L+ + + P Sbjct: 362 PQEIFDETVFSRCSKGYMAQHDIFEHIPRLLHDLDFPFFCSQGSCTRGHFPKKMIWIGPA 421 Query: 120 GGVGP-HLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPD------------------LL 160 G + P H D + Q G + R+ + L Sbjct: 422 GTISPLHTDPHANLFSQIAGYKYVRLYAPRCETNLYRNTTAKYCNSSQIELRGSLMGMLS 481 Query: 161 QVDPF--EAIIDEELEPGDILYIPPGFPHEGYALENAMNYSVGFRAPNTRELISG 213 + F +D L PGD+L+IPP H +L ++++ ++ R +E++ Sbjct: 482 EFPDFLNAPYVDCVLGPGDLLHIPPLHWHYVQSLTSSVSVTMWCRPKYAQEILDE 536 >UniRef50_UPI000186F041 protein PTDSR-A, putative n=1 Tax=Pediculus humanus corporis RepID=UPI000186F041 Length = 423 Score = 126 bits (316), Expect = 2e-27, Method: Composition-based stats. Identities = 39/233 (16%), Positives = 80/233 (34%), Gaps = 39/233 (16%) Query: 8 NWPDFLERHWQK-RPVVLKRGFNNFIDPISPDE---LAGLAMESEVDSRLVS-HQDGKWQ 62 + F + K PV L N++ + + G A V + + + Sbjct: 191 SLEYFYNNYMIKNTPVKLTGCMNHWPALKLWKDFGYIVGKAGCRTVPVEIGKHYAHDTYS 250 Query: 63 VSHGPFESYDHLGETNWS---LLVQAVNHWHEPTAALMRPFRELPDWR-------IDDL- 111 + N S + A + + L + +PD+ +D+ Sbjct: 251 QKLMKISEFVEEYINNPSKSAIGYLAQHQLFDQVPELKKDI-IIPDYCALTLKPDVDENS 309 Query: 112 ---MISFSVPGGGVGP-HLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHP---------- 157 + ++ P + P H D + + Q G ++ + + + PHP Sbjct: 310 ETEINAWFGPNATISPLHNDPKNNLLCQVVGTKKLILFSQSDTQFLYPHPSSILFNTSRV 369 Query: 158 -----DLLQVDPFEAI---IDEELEPGDILYIPPGFPHEGYALENAMNYSVGF 202 D F+ + + L+PG+++YIPP + H +LEN+ + S + Sbjct: 370 DVENPDFNSFPEFKKVKTKMTCLLKPGEMIYIPPKYWHHVRSLENSFSVSFWW 422 >UniRef50_A4RRR9 Predicted protein n=1 Tax=Ostreococcus lucimarinus CCE9901 RepID=A4RRR9_OSTLU Length = 235 Score = 125 bits (315), Expect = 2e-27, Method: Composition-based stats. Identities = 41/219 (18%), Positives = 69/219 (31%), Gaps = 37/219 (16%) Query: 21 PVVLKRGFNNFIDPISPDE---LAGLAMESEVDSRL-VSHQDGKWQVSHGPFESYDHLGE 76 P+VL ++ + L + + V L ++ D W + Sbjct: 16 PIVLDALVKHWPAVTKWRDGAYLDEIVGDRTVPVELGKTYVDDAWSQKLMTMREFMDAYV 75 Query: 77 TN------------WSLLVQAVNHWHEPTAALMRPFRELPDWRID----DLMISFSVPGG 120 + A + E L R E + + ++ P Sbjct: 76 DGDDDESTRRASGGADVGYLAQHELFEQCPELKRDIEEPLYCALGTGTVCAVNAWFGPAH 135 Query: 121 GVGP-HLDQYDVFIIQGTGRRRWRVGEKLQMKQHCP--------------HPDLLQVDPF 165 P H D + + Q G +R R+ + + P HP+L + F Sbjct: 136 TESPAHTDPHHNLLCQVIGVKRVRLFAPSETPKMYPRDAPMSNTSRVDVMHPNLDEFPLF 195 Query: 166 EAI--IDEELEPGDILYIPPGFPHEGYALENAMNYSVGF 202 + ID L PGD LYIPPG+ H A + + S + Sbjct: 196 VDVEFIDATLYPGDALYIPPGWWHRVKAATVSFSVSYWW 234 >UniRef50_D0MSD2 Putative uncharacterized protein n=1 Tax=Phytophthora infestans T30-4 RepID=D0MSD2_PHYIN Length = 249 Score = 125 bits (315), Expect = 2e-27, Method: Composition-based stats. Identities = 42/246 (17%), Positives = 73/246 (29%), Gaps = 51/246 (20%) Query: 8 NWPDFLERH-WQKRPVVLKRGFNNFIDP----------ISPDELAGLAMESEVDSRL-VS 55 +F Q +PVV+ + S + L +A V + S Sbjct: 3 TLEEFRRTVMLQNKPVVITGAMEFWPALGRAAGPERAWKSVEYLRRIAGLRTVPVEIGSS 62 Query: 56 HQDGKWQVSHGPFESY------------DHLGETNWSLLVQAVNHWHEPTAALMRPFREL 103 + W + + + L A + + AL R Sbjct: 63 YLGDDWGQELMTLNEFLDRHIIPPLAEENDHPVSPRKLGYLAQHRLFDQIPALGRDIMTP 122 Query: 104 PDWRI----------DDLMISFSVPGGGVGP-HLDQYDVFIIQGTGRRRWRVGEKLQMKQ 152 + D + + PGG V P H D D + Q G + R+ + + Sbjct: 123 DYCTVQRDEDAGDEEDITINCWFGPGGTVSPLHFDPKDNVLCQVVGSKYLRLYAPEESDK 182 Query: 153 HCP--------------HPDLLQVDPF--EAIIDEELEPGDILYIPPGFPHEGYALENAM 196 P PD + F ++ L G++LYIPP + H +L + Sbjct: 183 LYPIEGLLSNTSLVQVEDPDDERFPKFRNARYVECVLHEGEMLYIPPKYWHYVKSLFTSF 242 Query: 197 NYSVGF 202 + S + Sbjct: 243 SVSFWW 248 >UniRef50_D0S717 JmjC domain-containing protein n=1 Tax=Acinetobacter calcoaceticus RUH2202 RepID=D0S717_ACICA Length = 377 Score = 125 bits (314), Expect = 3e-27, Method: Composition-based stats. Identities = 43/237 (18%), Positives = 87/237 (36%), Gaps = 39/237 (16%) Query: 9 WPDFLERHW-QKRPVVLKRGFNNFIDPISP--DELAGLAMESEVDSRLVSHQDGKWQVSH 65 + DF++ ++ Q RPV+LK+G ++ + A + V+ ++ ++D +++ Sbjct: 135 FSDFIKDYYSQHRPVILKKGVEHWPALYKWTPEYFATRFGQHLVEVQMNRNKDKQFERHS 194 Query: 66 GPFESYDHLGE------------------TNWSLLVQAVNHWHEPTAALMRPFRELPDWR 107 + + E N + Q + + L Sbjct: 195 PLLKQTMKMSEFVSKVMSVEASNDFYMTANNATNSHQMLQELFLDIGDFADGYSNL--AL 252 Query: 108 IDDLMISFSVPGGGVGP-HLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHP--------- 157 D+ + P G P H D + ++Q GR++ + LQ+ Sbjct: 253 KDERSFLWFGPKGTFTPLHHDLTNNMLVQIYGRKKVTLIPALQVPHLYNDHWVFSELSDA 312 Query: 158 ---DLLQVDPFEAII--DEELEPGDILYIPPGFPHEGYALENAMNYSV-GFRAPNTR 208 D + ++I + L G+ L+IP G+ H +L+ +M+ S F APN Sbjct: 313 NKIDFKKYPLAKSITPVECILNAGEALFIPIGWWHSVESLDVSMSISFTNFNAPNHF 369 >UniRef50_B0X4Y9 Putative uncharacterized protein n=2 Tax=Culicini RepID=B0X4Y9_CULQU Length = 417 Score = 125 bits (313), Expect = 4e-27, Method: Composition-based stats. Identities = 45/233 (19%), Positives = 76/233 (32%), Gaps = 39/233 (16%) Query: 8 NWPDFLERHWQKR-PVVLKRGFNNFIDPISPDELAGL---AMESEVDSRL-VSHQDGKWQ 62 F H+ +R P +L+ ++ + L A E V + + W Sbjct: 185 TLEYFGTHHYDRREPALLEGIIEDWPALERWHDPNYLIAAAGERTVPVEVGSQYSSDDWS 244 Query: 63 VSHGPFESY--DHLGETNWSLLVQA--------VNHWHEPTAALMRPFRELPDW--RIDD 110 F+ + HL E + + + + + L R +PD+ R D Sbjct: 245 QRLVKFKDFIAQHLTEESATRNIDNEQDRAYLAQHELFDQIPTLREDIR-VPDYIGRTDT 303 Query: 111 L--MISFSVPGGGVGP-HLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLL------- 160 + ++ P G V P H D + Q G + + PH + Sbjct: 304 NPRIKAWLGPKGTVSPLHTDPGHNLLCQVFGSKIIILAPPDSTPNLYPHEHFILNNTSQI 363 Query: 161 ------QVDPFEAIIDE-----ELEPGDILYIPPGFPHEGYALENAMNYSVGF 202 + F D EL G +LYIPPG+ H +L + + S F Sbjct: 364 VDAKAIDYERFPRARDVRFRRLELRRGQVLYIPPGWWHYVESLSPSFSVSFWF 416 >UniRef50_C6XNR6 Transcription factor jumonji jmjC domain protein n=1 Tax=Hirschia baltica ATCC 49814 RepID=C6XNR6_HIRBI Length = 347 Score = 124 bits (311), Expect = 6e-27, Method: Composition-based stats. Identities = 37/225 (16%), Positives = 77/225 (34%), Gaps = 30/225 (13%) Query: 7 LNWPDFLERHWQK-RPVVLKRGFNNFIDP--ISPDELAGLAMESEVDSRLVSHQDGKWQV 63 L F ++ P+++K +++ S D +++++ + + ++++ Sbjct: 105 LTPQAFFANYYATNTPLLIKNMVSHWPAMQRWSLDYFEEKLGDAKIEVQFDRDTNARYEI 164 Query: 64 SH------GPFESYDHLGETN-----WSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLM 112 F Y L + L N + A L +L D+ D Sbjct: 165 DSVSHKKVMHFREYIALLRKGEETNNYYLTANNGNTNAKALAPLWDDIIQLDDYLQPDKT 224 Query: 113 --ISFSVPGGGVGP-HLDQYDVFIIQGTGRRRWRVGE-------------KLQMKQHCPH 156 + P G + P H D + F++Q +GR++ + Sbjct: 225 PGYLWIGPKGTLTPFHHDLTNNFLLQISGRKQVVLAPGFEVDRMRNSQHCFSDWSVDIEG 284 Query: 157 PDLLQVDPFEAIIDEELEPGDILYIPPGFPHEGYALENAMNYSVG 201 + ++D LEPGD+L++P G+ H L+ S Sbjct: 285 AANAEAGRRPGMVDCILEPGDVLFLPVGWWHYVKGLDMTFGMSFT 329 >UniRef50_B9IN14 Predicted protein n=2 Tax=rosids RepID=B9IN14_POPTR Length = 764 Score = 123 bits (309), Expect = 1e-26, Method: Composition-based stats. Identities = 51/294 (17%), Positives = 91/294 (30%), Gaps = 79/294 (26%) Query: 6 TLNWPDFLERHWQKRPVVLKRGF-----------------------NNFID--------- 33 L + +F+ HW+ P +++R F+ Sbjct: 279 DLGFENFMLHHWESSPSLVRRLSGSLTEENDILSSFAESLNCKEPCPTFVASILQSFISC 338 Query: 34 -PISPDELAGLAMESEVDS----RLVSHQDGKWQVSHGPFES------------------ 70 PI+ DEL ++ EV S ++ QD + + P + Sbjct: 339 VPIASDELNIISFLEEVRSELGCPIIYDQDIRVLRTEQPSKKEVHFFQKKVDPCCFKKLA 398 Query: 71 --------YDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWR-IDDLMISFSVPGGG 121 + + +++ ++ V AA+ L + + G Sbjct: 399 FNNVDIMKCEEAFKEGYTIALRGVEFRFASIAAVADALASLFGQPSVGANIYLTPPNSQG 458 Query: 122 VGPHLDQYDVFIIQGTGRRRWRVGE------KLQMKQHCPHPDLLQVDPFEAIIDEELEP 175 + H D + VF+ Q G ++W + L + + L Sbjct: 459 LARHCDDHCVFVCQLFGTKQWTIYPRPNLQLPRLYDPFDREHCLGEQNSLAECRKFLLRE 518 Query: 176 GDILYIPPGFPHEGYALEN--------AMNYSVGFRAPNTRELISGFADYVLQR 221 GDILYIP GFPHE ++ +++ + G E GFA L R Sbjct: 519 GDILYIPRGFPHEACTHDDGSSDLARFSLHVTFGVEVEPPFE-WEGFAHVALHR 571 >UniRef50_Q2RW70 Cupin region n=1 Tax=Rhodospirillum rubrum ATCC 11170 RepID=Q2RW70_RHORT Length = 301 Score = 122 bits (307), Expect = 2e-26, Method: Composition-based stats. Identities = 47/290 (16%), Positives = 94/290 (32%), Gaps = 28/290 (9%) Query: 7 LNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQVSHG 66 L+ ++ E+ +++ + F D +S L L V Sbjct: 12 LSVSEWNEQFSKRQLRHIPAAFPESADLMSWGWLEDFLNREYTRPELFRFFMNGRPVEPS 71 Query: 67 PFESYDHLGE-----------TNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISF 115 F D G + + ++ + + I Sbjct: 72 RFGLIDGKGRLDRKALRPLLTQGITTIFNGLDSSSGYFWEEAVKLEQALGAVVTIDAIGS 131 Query: 116 SVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEELEP 175 G+ PH D D+ I+Q GR+ W++ L P P + ++ Sbjct: 132 FGTVCGLPPHYDDRDLIIVQVAGRKHWKI---LGTPVEGPWRKRTMSVPDTVTDEFVMQG 188 Query: 176 GDILYIPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPP 235 GD+L++P G H+ LE +++ P + +L+ +DP++ Sbjct: 189 GDMLFVPAGLYHQCVPLEPSLHLGALITRPCGAD--------LLKMVQPRWETTDPELAA 240 Query: 236 RAHPAD---VLPQEMDKLREMMLELINQPEHFK---QWFGEFISQSRHEL 279 R + D L Q+ +L+E ++ L+ + W + R +L Sbjct: 241 RLYVGDGETDLQQQDARLKEALIRLVQDMDVAALTRAWLAQKQRPVRADL 290 >UniRef50_Q8S3P4 OSJNBa0011F23.16 protein n=4 Tax=Oryza sativa RepID=Q8S3P4_ORYSJ Length = 774 Score = 121 bits (305), Expect = 3e-26, Method: Composition-based stats. Identities = 53/340 (15%), Positives = 107/340 (31%), Gaps = 95/340 (27%) Query: 8 NWPDFLERHWQKRPVVLKRGFNN------FIDPIS------------------------- 36 ++ +FL +W+K ++ R N F ++ Sbjct: 287 DYENFLLNYWEKSTYLVTRKQKNLHVDSVFTSLLNEFDLKTPDTIIQSLVNGIVSCPAIA 346 Query: 37 PDELA------------GLAMESEVDSRLVSHQDG-------------------KWQVSH 65 DEL G ++ D R+V D +Q + Sbjct: 347 SDELDISSFLREVQGSLGATVKYRQDIRVVRINDQCDQTSIGYAMEEHFFDDGMTFQDAD 406 Query: 66 GPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWR-IDDLMISFSVPGGGVGP 124 E + +S+ ++ + E AA+ +L + + G+ Sbjct: 407 AFVEKCKDAFKNGFSVALRGMEFRSEKIAAIASAVADLFGQPSVGANIYFSPPRAQGLAR 466 Query: 125 HLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEE---------LEP 175 H D + V + Q G ++W + K P +PFE + D L Sbjct: 467 HYDDHCVLVWQLLGCKKWMIWP--DTKLLLP----RLYEPFEPLDDLVDDCGGRMEILLE 520 Query: 176 GDILYIPPGFPHEGY------------ALENAMNYSVGFRAPNTRELISGFADYVLQREL 223 GDI+Y+P GF HE + ++ +++ ++ E GF ++ Sbjct: 521 GDIMYVPRGFVHEAHTDVDVGGFEVNSTVDCSLHLTLAIEVEPPFE-WEGFT-HIALHCW 578 Query: 224 GGNYYSDPDVPPRAHPADVLPQEMDKLREMMLELINQPEH 263 ++S P V + + L + + L+++ + Sbjct: 579 TEKHWSSPFVKSQE---EARTSLFALLLHVAIRLLSKNDA 615 >UniRef50_Q96EW2 HSPB1-associated protein 1 n=23 Tax=Amniota RepID=HBAP1_HUMAN Length = 488 Score = 121 bits (304), Expect = 4e-26, Method: Composition-based stats. Identities = 40/264 (15%), Positives = 80/264 (30%), Gaps = 34/264 (12%) Query: 65 HGPFESYDHLG---ETNWSLLVQAVNHWHEPTAAL-MRPFRELPDWRIDDLMISFSVPGG 120 GPF YDH ++ V + + F P + + G Sbjct: 112 SGPFRDYDHSKFWAYADYKYFVSLFEDKTDLFQDVKWSDFGF-PGRNGQESTLWIGSLGA 170 Query: 121 GVGPHLDQY-DVFIIQGTGRRRWRVGEKLQMKQHCP-----------------HPDLLQV 162 HLD Y + Q GR+RW + P +PDL + Sbjct: 171 HTPCHLDSYGCNLVFQVQGRKRWHLFPPEDTPFLYPTRIPYEESSVFSKINVVNPDLKRF 230 Query: 163 DPFEAIID--EELEPGDILYIPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQ 220 F L PG +L++P + H +++ + S+ + ++ + + + Sbjct: 231 PQFRKAQRHAVTLSPGQVLFVPRHWWHYVESIDP-VTVSINSWIELEEDHLARVEEAITR 289 Query: 221 RELGGNYYSDPDVPPRA--HPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHE 278 + ++ RA +P +V + ++ +F + E Sbjct: 290 MLVCALKTAENPQNTRAWLNPTEVEETSHAVNCCYLNAAVS------AFFDRCRTSEVVE 343 Query: 279 LDIAPPEPPYQPDEIYDALKQGEV 302 + + + E + EV Sbjct: 344 IQALRTDGEHMKKEELNVCNHMEV 367 >UniRef50_UPI00005241B3 PREDICTED: similar to JmjC domain-containing protein 5 (Jumonji domain-containing protein 5) n=1 Tax=Ciona intestinalis RepID=UPI00005241B3 Length = 346 Score = 118 bits (296), Expect = 3e-25, Method: Composition-based stats. Identities = 43/301 (14%), Positives = 95/301 (31%), Gaps = 55/301 (18%) Query: 3 YQLTLNWPDFLERHWQK-RPVVLKRGFNNFIDP-ISPDELAGLAMESEVDSRLVSHQDG- 59 TL+ +F E + +K +PVV+ G + S D L + V R ++ D Sbjct: 14 RLNTLDKLEFEESYLRKGKPVVITAGLDGLACSKWSIDYLLERVGLNNVTVRGRTNSDEY 73 Query: 60 ----KWQVSHGPFESYDHLGE----TNWS--LLVQAVNHWHEPTAALMR------PFREL 103 ++ + F Y + + VQ ++ + Sbjct: 74 KVGKQYIIRETTFREYISDMRAKSVRGLTSYMAVQNISKTFPQLQDDCKIPDIGKLHNGP 133 Query: 104 PDWRIDDLMISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHP------ 157 W + H D ++ G +R ++ + +++ P+P Sbjct: 134 FLWVAHKGHYEY--------CHYDPDASLLMMIEGSKRVKLFSCIDLEKMYPNPLGSRGK 185 Query: 158 -----------DLLQVDPFEAI--IDEELEPGDILYIPPGFPHEGYALENAMNYSVGFRA 204 DL + F + L+ GD+L IP + H+ +L ++++ + F Sbjct: 186 TIQSQVDCDNVDLEKFPKFSEVTCYSCTLQAGDLLLIPAFWWHQVTSLSDSVSMNAFFGE 245 Query: 205 PNTRELISGFADY---------VLQRELGGNYYSDPDVPPRAHPADVLPQEMDKLREMML 255 I+ + +L ++ + + P V + + E+ Sbjct: 246 SGEDRYITRIMEEPVWPSFKYWLLNIVEQNRPFASFERTLQRLPLCVENLLLKQFHEVAP 305 Query: 256 E 256 + Sbjct: 306 Q 306 >UniRef50_Q86NX2 GM21055p n=10 Tax=Drosophila RepID=Q86NX2_DROME Length = 401 Score = 118 bits (295), Expect = 4e-25, Method: Composition-based stats. Identities = 36/232 (15%), Positives = 71/232 (30%), Gaps = 38/232 (16%) Query: 8 NWPDFLERHWQ-KRPVVLKRGFNNFIDPISPDELAGL---AMESEVDSRLVS-HQDGKWQ 62 + +F + ++ +P +L ++ +L L A V + S + +W Sbjct: 170 SLEEFQTKCFEAGQPTLLLNTIQHWPALHKWLDLNYLLQVAGNRTVPIEIGSNYASDEWS 229 Query: 63 VSHGPFESY------DHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDL----- 111 + + ++ A + AL +PD+ Sbjct: 230 QQLVKIRDFLSRQFGKEPSKAGQNIEYLAQHELFAQIPALKEDIS-IPDYCTISNEDTPG 288 Query: 112 ---MISFSVPGGGVGP-HLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH----------- 156 + ++ P G V P H D + Q G +R + PH Sbjct: 289 AVDIKAWLGPAGTVSPMHYDPKHNLLCQVFGSKRIILAAPADTDNLYPHDSEFLANTARI 348 Query: 157 ------PDLLQVDPFEAIIDEELEPGDILYIPPGFPHEGYALENAMNYSVGF 202 P+ + L+PGD LY+PP + H + + + S + Sbjct: 349 DAAQLDPETYPLVAKVKFYQLLLQPGDCLYMPPKWWHYVRSEAPSFSVSFWW 400 >UniRef50_Q17765 Protein C06H2.3, partially confirmed by transcript evidence n=2 Tax=Caenorhabditis RepID=Q17765_CAEEL Length = 578 Score = 117 bits (294), Expect = 5e-25, Method: Composition-based stats. Identities = 40/212 (18%), Positives = 70/212 (33%), Gaps = 36/212 (16%) Query: 21 PVVLKR------GFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQVSHGPFESYDHL 74 P++++R + P EL E+ + D W F+ + Sbjct: 372 PLIVRRHSSNMPAIEKWSFPFLLQELHSRTFPVEIG---TKYSDENWSQKLMTFKEFIRN 428 Query: 75 GETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDL-------MISFSVPGGGVGP-HL 126 E L Q + + L R +PD + M + P V P H Sbjct: 429 SENERLYLAQ--HRLFDQVPHLKRDV-IIPDVCFGESSNPENVDMNMWIGPQDTVSPLHT 485 Query: 127 DQYDVFIIQGTGRRRWRVGEKLQMKQHCP--------------HPDLLQVDPFEAI--ID 170 D +Q G + +R+ + P +PDL FE + +D Sbjct: 486 DPRKNMFVQVHGTKLFRMVAPESSESVYPFDGILSNTSQVDVENPDLKIFPNFEQVEVLD 545 Query: 171 EELEPGDILYIPPGFPHEGYALENAMNYSVGF 202 + PGD ++IP + H + +++ S F Sbjct: 546 AVINPGDAIFIPEKWWHFVRSTSPSISISFWF 577 >UniRef50_UPI000180C0C1 PREDICTED: similar to reserved n=1 Tax=Ciona intestinalis RepID=UPI000180C0C1 Length = 468 Score = 116 bits (292), Expect = 9e-25, Method: Composition-based stats. Identities = 48/261 (18%), Positives = 92/261 (35%), Gaps = 55/261 (21%) Query: 13 LERHW-QKR---PVVLKRGFNNFI---DPISPDELAGLAMESEVDSRLVSHQ------DG 59 L+++ +++ PVV + F + + L G+ + ++ +R+ Q +G Sbjct: 6 LKKYLSEQKLELPVVFRNVNQVFEWKVSSLDLNRLQGMIGDKKIKARVGQKQSENLQWEG 65 Query: 60 KWQVSHGPFESYDHLGETNW---SLLVQAVNHWH-------EPTAALMRPFR--ELPDWR 107 + + + + L +N S + N W ++ F + W+ Sbjct: 66 ECRYINITISDFIQLINSNNDQNSFNIDLNNEWVYADYIYVRDLFNEVQDFEVLDFIQWK 125 Query: 108 I--------DDLMISFSVPGGGVGPHLDQYD-VFIIQGTGRRRWRVGEKLQMKQHCP--- 155 D I G H D Y ++Q GR+RW + + P Sbjct: 126 DLGFSGTEGTDSAIWIGTQGAHTVCHYDTYGYNLVLQVQGRKRWMLFPPSDSQHLHPTRI 185 Query: 156 --------------HPDLLQVDPFE--AIIDEELEPGDILYIPPGFPHEGYALENAMNYS 199 HPDL + + F LEPGD+LY+P + H LE ++ S Sbjct: 186 PYEESSVFSKVDLQHPDLEEHESFTSCHPHVITLEPGDMLYVPQQWWHYVENLETSI--S 243 Query: 200 VGFRAPNTRELISGFADYVLQ 220 V P+ + ++ + + Sbjct: 244 VNAWFPSDEDDLTRVKEAASK 264 >UniRef50_Q6AXL5 HSPB1-associated protein 1 homolog n=3 Tax=Clupeocephala RepID=HBAP1_DANRE Length = 449 Score = 116 bits (292), Expect = 9e-25, Method: Composition-based stats. Identities = 42/315 (13%), Positives = 93/315 (29%), Gaps = 60/315 (19%) Query: 6 TLNWPDFLERHWQ--KRPVVLKRGFNNFIDP-ISPDELAGLAMESEVDSRLVSHQDG--- 59 + R Q ++P V ++ + + L+ + + R+ + Sbjct: 5 PFTPEE-ARRIVQVLQKPAVFLNMTTDWPALHWTVEHLSACLTKR-IRFRVGKRSEDMAP 62 Query: 60 ----KWQVSHGPFESYDHLGE-TNWSLLVQAVNHWHEPTAA------LMRPFRELPDWRI 108 + + + L+ +++ + A + + F++ P Sbjct: 63 LFETECSYVEATIKEFLSWTANDGEPLVGPFLDYHCKEFWAYADYKYIAQLFQDKPAMFQ 122 Query: 109 DDLMISFSVPG--------------GGVGPHLDQY-DVFIIQGTGRRRWRVGEKLQMKQH 153 D + F PG HLD Y + Q GR+RW + Sbjct: 123 DVVWSDFGFPGRDGRDSTLWIGTQCANTPCHLDSYGCNLVFQIQGRKRWHLFPPDDTACL 182 Query: 154 CP-----------------HPDLLQVDPF--EAIIDEELEPGDILYIPPGFPHEGYALEN 194 P PDL + + + L+PG +L++P + H +++ Sbjct: 183 YPTRVPYEESSVFSHVNVIRPDLKKFPAYGRARLYTVTLQPGQVLFVPRHWWHYVESVDP 242 Query: 195 AMNYSVGFRAPNTRELISGFADYVLQ------RELGGNYYSDPDVPPRAHPADVLPQEMD 248 + SV + + A+ + + + SD + P + M Sbjct: 243 -VTVSVNSWIEMDMDDEARVAEALTKTIVCAVKSSPSLDNSDQWLNPTEDGVSSHDENMQ 301 Query: 249 KLREMMLELINQPEH 263 L + +N+ Sbjct: 302 YLNLAVKVCMNKKRD 316 >UniRef50_UPI0001925EF6 PREDICTED: similar to predicted protein n=1 Tax=Hydra magnipapillata RepID=UPI0001925EF6 Length = 519 Score = 116 bits (292), Expect = 1e-24, Method: Composition-based stats. Identities = 50/348 (14%), Positives = 110/348 (31%), Gaps = 62/348 (17%) Query: 11 DFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESE-VDSRLVSHQDGKWQVSHGP-F 68 FL + QK PVV+ N++ I+ L +A E +++ L ++ + +S Sbjct: 18 AFL--YSQKEPVVITNMINDW-SLINW-TLDQIATEFGSIETSLKIYKRKSYNLSSITNT 73 Query: 69 ESYDHLGETNWSL----LVQAVNHWHEPTAALMRPFRELPD---WRIDDLMI-------- 113 S+ ++ L L++ W ++ P W D Sbjct: 74 NSHPNVPTETDCLYIKCLLKEFIFWITEKKDMIGKLAAFPYSEYWGYADYNYMFEFFKDF 133 Query: 114 -----------------------SFSVPGGGVGP-HLDQYD-VFIIQGTGRRRWRVGEKL 148 + G P H D Y + Q G + W + Sbjct: 134 SYVLNEVNWGKFGYPNRNGFHSTIWFGSKGSFTPCHQDAYGTNLVAQILGIKEWILFSPH 193 Query: 149 QMK----QHCPHPDLLQVDPFEAI------IDEELEPGDILYIPPGFPHEGYALENAMNY 198 + P+ + + I L PG++LY+P + H N++ Sbjct: 194 ASELMQATRIPYEESTIFSKVDVKSYLQYGIKVRLLPGEVLYVPKHWWHYVENETNSI-- 251 Query: 199 SVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRA-HPADVLPQEMDKLREMMLEL 257 S+ + + +++ + D P R +P + +P+ +++ Sbjct: 252 SINTWLEMETDSYDRICEALVKILVFSMKKYDGTSPDRWLNPNEEVPKNFSDCLQLLKSA 311 Query: 258 INQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQGEVLVR 305 +N+ F +++ + ++ ++I G+ + R Sbjct: 312 LNENYEVNACLSTFHTRAVVDDQHGSL---HKLEDIVPDTSTGQDINR 356 >UniRef50_B4LGI6 GJ13228 n=3 Tax=Drosophila RepID=B4LGI6_DROVI Length = 409 Score = 116 bits (291), Expect = 1e-24, Method: Composition-based stats. Identities = 33/221 (14%), Positives = 63/221 (28%), Gaps = 34/221 (15%) Query: 16 HWQKRPVVLKRGFNNFIDPISPDELAGL---AMESEVDSRLVS-HQDGKWQVSHGPFESY 71 + +P +L N++ +L L A V + S + +W + Sbjct: 188 YQALQPTLLLNTINHWPALSKWRDLNYLLKVAGNRTVPIEIGSNYASDEWSQQLVKLRVF 247 Query: 72 DHL------GETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMIS------FSVPG 119 H G + + A + AL + + + P Sbjct: 248 LHRQFGPSNGRADHEIEYLAQHELFAQIPALKADICVPDYCTVSSNNAAGVDIKAWLGPS 307 Query: 120 GGVGP-HLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---------------PDLLQVD 163 + P H D + Q G + + H PD + Sbjct: 308 HTISPMHYDPKHNLLCQVFGCKSIILASPEDTANLYAHESEFLNNTSQIDAAKPDFERFP 367 Query: 164 PFEAI--IDEELEPGDILYIPPGFPHEGYALENAMNYSVGF 202 + + L+PGD LY+PP + H + + + S + Sbjct: 368 LLRRVRFYELLLQPGDCLYLPPKWWHYVRSETPSFSVSFWW 408 >UniRef50_UPI0001927319 PREDICTED: similar to predicted protein n=1 Tax=Hydra magnipapillata RepID=UPI0001927319 Length = 344 Score = 116 bits (290), Expect = 2e-24, Method: Composition-based stats. Identities = 44/291 (15%), Positives = 92/291 (31%), Gaps = 50/291 (17%) Query: 7 LNWPDFLERHWQKR-PVVLKRGFNNFIDP-ISPDELAGLAMESEVDSRLVSHQDG----- 59 L+ +F + PV+++ + + + A ++V R + Q+ Sbjct: 14 LSVEEFENYYLNSETPVIIENYIKEWPAMKWTLSSIKAKAGHNKVFVRRNTSQEDYKVGK 73 Query: 60 KWQVSHGPFESYDHLGETNW------SLLVQAVNHWHEPTAALM-------RPFRELPDW 106 K+ + F Y E N L VQ + A + + W Sbjct: 74 KYNIESMTFNEYVENIEANNKKAQSSYLAVQNIKIALPELANDICIPSYVKKLHGGPFLW 133 Query: 107 RIDDLMISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHP--------- 157 F H D D F+I +G + R+ ++ P+P Sbjct: 134 LARKGHYEFC--------HFDPDDNFLIVFSGEKHVRLYRANDLENLYPNPFGSNGRTIQ 185 Query: 158 --------DLLQVDPFEAI--IDEELEPGDILYIPPGFPHEGYALENAMNYSVGFRAPNT 207 D + F + + L+PG++LY P + H+ + + ++ ++ F T Sbjct: 186 SQVNCDNPDFNKFPNFRNVQFFECILKPGEMLYFPAFWWHQVTSTDTTISMNIFFGNDGT 245 Query: 208 RELISGFADYVLQRELGGNYYSDPDVPPRAHPADVLPQEMDKLREMMLELI 258 IS + Y ++ + + ++ L E + + Sbjct: 246 NTYISKIM---SGNQWLSFKYWILNIIEQNRHLESFRSVLEYLPESLTSFL 293 >UniRef50_A2RUC4 JmjC domain-containing protein C2orf60 n=26 Tax=Metazoa RepID=CB060_HUMAN Length = 315 Score = 114 bits (286), Expect = 4e-24, Method: Composition-based stats. Identities = 47/313 (15%), Positives = 101/313 (32%), Gaps = 60/313 (19%) Query: 1 MEYQLTLNWPDFLER-HWQKRPVVLKRGFNNF-IDPISPDELAGLAMESEVDSRLV---- 54 + ++ F++ + Q++P+VL+ + D L+ + + EV + Sbjct: 8 VPRLEGVSREQFMQHLYPQRKPLVLEGIDLGPCTSKWTVDYLSQVGGKKEVKIHVAAVAQ 67 Query: 55 -SHQDGKWQVSHGPFE------------SYDHLGETNWSLL---------VQAVNHWHEP 92 + PF+ + + + L V + Sbjct: 68 MDFISKNFVYRTLPFDQLVQRAAEEKHKEFFVSEDEKYYLRSLGEDPRKDVADIRKQFPL 127 Query: 93 TAALMRPFRELPDWRIDDLMISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQ 152 ++ + + + S PG + H D D +IQ TG++R + + Sbjct: 128 LKGDIKFPEFFKEEQFFSSVFRISSPGLQLWTHYDVMDNLLIQVTGKKRVVLFSPRDAQY 187 Query: 153 HC-----------PHPDLLQVDPFEAI--IDEELEPGDILYIPPGFPHEGYALENAMNYS 199 +PDL + F + LE GD+L+IP + H + E + + Sbjct: 188 LYLKGTKSEVLNIDNPDLAKYPLFSKARRYECSLEAGDVLFIPALWFHNVISEEFGVGVN 247 Query: 200 VGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADVLPQEMDKLREMMLELIN 259 + F E + Y + D + A +L + + L E+ Sbjct: 248 I-FWKHLPSECYD-----------KTDTYGNKDPTAASRAAQILDRALKTLAEL------ 289 Query: 260 QPEHFKQWFGEFI 272 PE ++ ++ + Sbjct: 290 -PEEYRDFYARRM 301 >UniRef50_B3SDY7 Putative uncharacterized protein n=1 Tax=Trichoplax adhaerens RepID=B3SDY7_TRIAD Length = 329 Score = 114 bits (285), Expect = 6e-24, Method: Composition-based stats. Identities = 32/206 (15%), Positives = 74/206 (35%), Gaps = 28/206 (13%) Query: 78 NWSLLVQAVNHWHEPTAALMRPFRELPDWRIDD-LMISFSVPGGGVGPHLDQY-DVFIIQ 135 ++ + + + + + R D G H D Y + Q Sbjct: 82 DYQYIALTFQAY-PEIPQSIDWSKFGLNGRKGDQSTFWMGSKGASTPCHYDSYGCNLVAQ 140 Query: 136 GTGRRRWRVGEKLQMKQHCP-----------------HPDLLQVDPFE--AIIDEELEPG 176 GR++W + + + P P+L+ F I + LEPG Sbjct: 141 LYGRKKWLLVAPDESQYMYPIRVPYEESSIFSAVNMKSPNLVSYPKFANVTIYEVILEPG 200 Query: 177 DILYIPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPR 236 D+L++P + H+ LE A++ + P+ + + + V + + ++ + + Sbjct: 201 DVLFVPKYWWHDVECLETAISVNTWIALPS--DGMDRLCEAVTKTVVFALKSAEGNKITQ 258 Query: 237 A-HPADVL---PQEMDKLREMMLELI 258 +P++V + L+ + +L+ Sbjct: 259 WLNPSEVPTSYQTNIGYLKNSLEQLM 284 >UniRef50_A9V2P6 Predicted protein n=3 Tax=Monosiga brevicollis RepID=A9V2P6_MONBE Length = 1934 Score = 114 bits (285), Expect = 6e-24, Method: Composition-based stats. Identities = 47/319 (14%), Positives = 98/319 (30%), Gaps = 56/319 (17%) Query: 1 MEYQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDEL-AGLAMESEVDSRLVSHQD- 58 M L N F+ + +R + + ++ D L A E++++ L + + Sbjct: 25 MAKLLGSNSTHFVREIYGQRHAIFPAPLAPTDNWLARDPLLNSFAAEADLNRSLATLVNL 84 Query: 59 ----------GKWQVSHGPFES----------YDHLGETNWSLLVQAVNHWHEPTAA--L 96 ++ P ++ YD+L + +S++++ + L Sbjct: 85 YGRSQDAIAFRDRHAANVPLDTLLERVSLEHLYDYLVQHGYSVVIR--EEYFPTYTQTSL 142 Query: 97 MRPFRELPDWRIDDLMISFS-VPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCP 155 + + S + PH D YDV ++ G++ W V L Sbjct: 143 EELIQTAFGTSTVTSHVYLSGSEAHALNPHTDPYDVLVLHLHGQKHWTVCHPLDSSPEWA 202 Query: 156 HPDLLQV-DPFE--------------------AIIDEELEPGDILYIPPGFPHEGYALEN 194 Q+ FE L PGD+LY+P H N Sbjct: 203 DASQAQLAQLFEIQRKSVDGCTNFDIAETDKMRCEHFTLSPGDVLYLPKSTIHFATTSPN 262 Query: 195 A----MNYSVGFRAPNTRELISGFADYV---LQRELGGNYYSDPDVPPRAHPADVLPQEM 247 + S+ + + +L + L+ G + + P P + + + Sbjct: 263 TTTAHITLSLERQGQSWIDLACTVIESTLASLETSPSGLRWLELATFPTDEPCQMAQKRL 322 Query: 248 DKL-REMMLELINQPEHFK 265 ++ + L+ Q Sbjct: 323 LGFEKDSLRTLVAQDPQLS 341 >UniRef50_UPI00015B5A68 PREDICTED: hypothetical protein n=1 Tax=Nasonia vitripennis RepID=UPI00015B5A68 Length = 409 Score = 113 bits (284), Expect = 8e-24, Method: Composition-based stats. Identities = 43/247 (17%), Positives = 86/247 (34%), Gaps = 47/247 (19%) Query: 19 KRPVVLKR------GFNNFIDP-ISPDELAGLAMESEVDSRL---VSHQDGKWQVS---- 64 + PV+ K G NN+ S ++ A + ++ R+ + + +W+V Sbjct: 20 QEPVLFKNILCTTEGSNNWKLIHWSLEDFANKSGNIKLPFRVGKNIRTDEPQWEVETPIE 79 Query: 65 HGPFESY------DHLGETNWSLLVQAVNHWHEPTAALMRPF---RELPDWRIDDLMISF 115 + + + + E + + +N W + +++ F + D + D I Sbjct: 80 YKTMKEFLNNVTENSNPEKWFYFDYKRMNEWFKDIPEIVKSFDWHQFGIDLDVSDSTIWI 139 Query: 116 SVPGGGVGPHLDQY-DVFIIQGTGRRRWRVGEKLQMKQHCP----------HPDLLQVDP 164 G H D Y + Q GR+ W + P + P Sbjct: 140 GSKGAHTNCHQDTYGCNLVAQIQGRKLWLLFSPECGDLMQPTRIPYEESTVYSKYNFFAP 199 Query: 165 FEAIID-----------EELEPGDILYIPPGFPHEGYALENAMNYSVGFRAPNTRELISG 213 + I+ LEP D+L+IP G+ H +L+ ++ SV P + S Sbjct: 200 SKQEIEAIKNMPGSVKMVTLEPKDLLFIPKGWWHYVESLD--ISLSVNVWLPLKEDCESR 257 Query: 214 FADYVLQ 220 + ++ Sbjct: 258 LKETLVH 264 >UniRef50_A2SGT4 Putative uncharacterized protein n=1 Tax=Methylibium petroleiphilum PM1 RepID=A2SGT4_METPP Length = 353 Score = 113 bits (283), Expect = 1e-23, Method: Composition-based stats. Identities = 49/233 (21%), Positives = 90/233 (38%), Gaps = 32/233 (13%) Query: 1 MEYQLTLNWPDFLERHW-QKRPVVLKRGFNNFIDP--ISPDELAGLAMESEVDSRLVSHQ 57 +E + ++ +F ER+ RP+VL ++ SP +L +V+ + Sbjct: 103 VEKRSHVSPAEFFERYVVGSRPLVLTDVAGDWPALHRWSPADLRERFGHLDVEIQAERAV 162 Query: 58 DGKWQVSHGPFESYDHLGE-----------TNWSLLVQAVNHWHEPTAALMRPFRELP-- 104 + K++ LG+ ++ L A L+ LP Sbjct: 163 NPKYEQDKLKHRHNVRLGDFVDRVLAGGATNDYYLTANNEILRRPEFAPLLADIGTLPLF 222 Query: 105 --DWRIDDLMISFSVPGGGVGP-HLDQYDVFIIQGTGRRRWRVGEKLQMKQHC------- 154 ++ + P G V P H D + Q GR+RWR L+ + Sbjct: 223 CDPAQLAQRSSFWFGPAGTVTPLHHDTLMLLHTQVVGRKRWRFISPLETPRLYNHDGVFS 282 Query: 155 ----PHPDLLQVDPFE--AIIDEELEPGDILYIPPGFPHEGYALENAMNYSVG 201 HPDL + F +++ LEPGD +++P G+ H+ +LE ++++S Sbjct: 283 AIDLDHPDLDRYPAFRDVKVLEVVLEPGDTVFLPLGWWHQVASLEVSLSFSFS 335 >UniRef50_C7J1Y3 Os04g0659150 protein n=4 Tax=Poaceae RepID=C7J1Y3_ORYSJ Length = 430 Score = 113 bits (283), Expect = 1e-23, Method: Composition-based stats. Identities = 46/286 (16%), Positives = 88/286 (30%), Gaps = 90/286 (31%) Query: 8 NWPDFLERHWQKRPVVLKRGFNN------FIDPIS------------------------- 36 ++ +FL +W+K ++ R N F ++ Sbjct: 151 DYENFLLNYWEKSTYLVTRKQKNLHVDSVFTSLLNEFDLKTPDTIIQSLVNGIVSCPAIA 210 Query: 37 PDELA------------GLAMESEVDSRLVSHQDG-------------------KWQVSH 65 DEL G ++ D R+V D +Q + Sbjct: 211 SDELDISSFLREVQGSLGATVKYRQDIRVVRINDQCDQTSIGYAMEEHFFDDGMTFQDAD 270 Query: 66 GPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWR-IDDLMISFSVPGGGVGP 124 E + +S+ ++ + E AA+ +L + + G+ Sbjct: 271 AFVEKCKDAFKNGFSVALRGMEFRSEKIAAIASAVADLFGQPSVGANIYFSPPRAQGLAR 330 Query: 125 HLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFEAIIDEE---------LEP 175 H D + V + Q G ++W + K P +PFE + D L Sbjct: 331 HYDDHCVLVWQLLGCKKWMIWP--DTKLLLP----RLYEPFEPLDDLVDDCGGRMEILLE 384 Query: 176 GDILYIPPGFPHEGY------------ALENAMNYSVGFRAPNTRE 209 GDI+Y+P GF HE + ++ +++ ++ E Sbjct: 385 GDIMYVPRGFVHEAHTDVDVGGFEVNSTVDCSLHLTLAIEVEPPFE 430 >UniRef50_UPI0000DB7045 PREDICTED: similar to Hspb associated protein 1 n=1 Tax=Apis mellifera RepID=UPI0000DB7045 Length = 310 Score = 112 bits (281), Expect = 2e-23, Method: Composition-based stats. Identities = 47/236 (19%), Positives = 80/236 (33%), Gaps = 52/236 (22%) Query: 19 KRPVVLKRGFNNFID-----PISPD--ELAGLAMESEVDSRL---VSHQDGKWQVSHGPF 68 K PV+ +R N D + ELA + ++ R+ + +W+V+ Sbjct: 20 KEPVIFQRLLQNAKDDYCWKLFEWNLSELAEKFGDIKLPFRVGYNARSMNPQWEVNCPTV 79 Query: 69 ESYDHLGETNWSLL--VQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPHL 126 +LL +Q +N + D DD I G H Sbjct: 80 LM---------TLLEFIQNMNFH-------ENHKKFDIDKTGDDSTIWIGSKGAHTNCHQ 123 Query: 127 DQY-DVFIIQGTGRRRWRVGEKLQMKQHCP-------HPDLLQVDPFEAIID-------- 170 D Y + Q GR++W + P + + F + Sbjct: 124 DSYGCNLVAQIHGRKQWLLFPPNSTNFLRPTRIPYEESTIYSKYNFFCPTKEDEINILKI 183 Query: 171 ------EELEPGDILYIPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQ 220 LEPGDIL++PPG+ H +L+ + SV P + IS + +++ Sbjct: 184 KDTAKLVTLEPGDILFVPPGWWHYVESLD--FSISVNMWLPILTDNISRVKEAIVK 237 >UniRef50_A9UR02 Predicted protein (Fragment) n=1 Tax=Monosiga brevicollis RepID=A9UR02_MONBE Length = 217 Score = 112 bits (280), Expect = 2e-23, Method: Composition-based stats. Identities = 34/218 (15%), Positives = 60/218 (27%), Gaps = 36/218 (16%) Query: 20 RPVVLKRGFNNFIDPISPD---ELAGLAMESEVDSRLVS-HQDGKWQVSHGPFESY---- 71 RP + N+ LA + + ++ W + Sbjct: 1 RPAIFAGAVGNWPAVRRWQSRSYFDRLAGQRTIPVEWGRDYRGDGWSQRLMTLTDFLTAV 60 Query: 72 --------DHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDL--MISFSVPGGG 121 + + A + + L + ++ P G Sbjct: 61 FDTPIAPSPKRPKHEA-VGYLAQHPLFDQVPELRDDIVVPDYCYCAQSLRINAWFGPQGT 119 Query: 122 VGP-HLDQYDVFIIQGTGRRRWRVGEKLQMKQHCP----------------HPDLLQVDP 164 V P H D D + Q G + R+ E Q P P + Sbjct: 120 VSPCHQDPDDNLLAQVVGYKYVRLFEPRAATQLYPCEGLLSNTSQANVVAPDPAAFPLVQ 179 Query: 165 FEAIIDEELEPGDILYIPPGFPHEGYALENAMNYSVGF 202 + L PGD+L+IP G+ H +L + + S+ F Sbjct: 180 DVPCWEAILGPGDLLFIPQGWWHYVQSLSTSFSVSMWF 217 >UniRef50_A9TBQ2 Predicted protein n=1 Tax=Physcomitrella patens subsp. patens RepID=A9TBQ2_PHYPA Length = 992 Score = 111 bits (279), Expect = 3e-23, Method: Composition-based stats. Identities = 40/244 (16%), Positives = 67/244 (27%), Gaps = 46/244 (18%) Query: 8 NWPDFLERHWQKRPVVLK-----RGFNNFI----------------------DPISPDEL 40 + F ++HW+ P + + F P DEL Sbjct: 387 SMDHFFQKHWELSPAQMTIAPEINALSRFFEKIAGYSPVGLLERLVDTVTACPPAVADEL 446 Query: 41 A------------GLAMESEVDSRLVSHQDG-----KWQVSHGPFESYDHLGETNWSLLV 83 G D RL+ + G S + + +++++ Sbjct: 447 DLTILLKDMEHDLGCLPVYNQDIRLLKCKGGVEVSYPISTSSVSSKDCIQAYISGYTVVL 506 Query: 84 QAVNHWHEPTAALMRPFRELPDW-RIDDLMISFSVPGGGVGPHLDQYDVFIIQGTGRRRW 142 + + AL + + G+ H D + VF+ Q GR+ W Sbjct: 507 RGLQFRFPEICALSNGLAAELGQVTVGANLYLTPPGSQGLRVHFDDHCVFVCQLRGRKGW 566 Query: 143 RVGEKLQMKQH-CPHPDLLQVDPFEAIIDEELEPGDILYIPPGFPHEGYALENAMNYSVG 201 V L+ L + +L+ D LYIP GF HE V Sbjct: 567 DVYPPLEQLPRLYSFKTLSTEVTKDYATHFDLQEWDTLYIPRGFLHEARTECPEQTIEVQ 626 Query: 202 FRAP 205 Sbjct: 627 IDRH 630 >UniRef50_C5AHL8 JmjC domain protein n=2 Tax=Burkholderia RepID=C5AHL8_BURGB Length = 296 Score = 110 bits (276), Expect = 7e-23, Method: Composition-based stats. Identities = 41/269 (15%), Positives = 74/269 (27%), Gaps = 47/269 (17%) Query: 20 RPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSH---------------QDGKWQVS 64 RP VL+ +++ +A D + + + + Sbjct: 29 RPAVLQGFIDDWPALARWTPEFFVAQHGGHDITVETSSLCPTPTRPDLYLASRRYEKAPL 88 Query: 65 HGPFESYDHLGETNWSLLVQA-VNHWHEPTAALMRPFRE---LPDWRIDDLM-------- 112 G + + A + + E P W D L Sbjct: 89 GKTIREMQSQGAARTAYITYAEIYEAIPSLREDITLLHERYGFPRWLPDGLRRRLILRPG 148 Query: 113 ISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---------------- 156 G H D+++ +Q GR+RW + Q Q Sbjct: 149 FWLGPEGISSPLHFDRHENLNVQVYGRKRWVLFGPGQSHQVYYRQRRDLPVIFSPVDMTR 208 Query: 157 PDLLQVDP--FEAIIDEELEPGDILYIPPGFPHEGYALENAMNYSVGFRAPNTRELISGF 214 PDL D LE G++LY+PPG+ H +L +++N + + +P + Sbjct: 209 PDLDAFPRLGDAQRHDFVLEAGEVLYLPPGWWHFVTSLSDSINVNYWWWSPRALRTWARV 268 Query: 215 ADYVLQRELGGNYYSDPDVPPRAHPADVL 243 + D P + Sbjct: 269 --ELASLAQALARRFDRGTDATGKPTSMP 295 >UniRef50_A9V7C3 Predicted protein n=3 Tax=Monosiga brevicollis RepID=A9V7C3_MONBE Length = 3197 Score = 109 bits (273), Expect = 2e-22, Method: Composition-based stats. Identities = 44/322 (13%), Positives = 93/322 (28%), Gaps = 71/322 (22%) Query: 11 DFLERHWQKRPVVLKRGFNNFIDPISPD--------------ELAGLAMESEVDSRLVSH 56 F+ + +R + + ++ D LA L ++ Sbjct: 35 HFVREIYGQRHAIFPAPLAPAANWLARDPLLDSFKAGADLNRTLATLVNLYGRSQDAIAF 94 Query: 57 QDGKWQVSHGPFES----------YDHLGETNWSLLVQAVNHWHEPTAA--LMRPFRELP 104 +D + P ++ YD+L + ++S++++ + L + Sbjct: 95 RD--RHAADVPLDTLLERVSLEYLYDYLVQHSYSVVIR--EEYFPGYTQTSLEELIQTAF 150 Query: 105 DWRIDDLMISFS-VPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQV- 162 + S + PH D YDV ++ G++ W V L Q+ Sbjct: 151 GTSTVTSHVYLSGSEAHALNPHTDPYDVLVLHLHGQKHWTVCHPLDSSTEWADASQAQLA 210 Query: 163 DPFE--------------------AIIDEELEPGDILYIPPGFPHEGYALENA----MNY 198 FE L PGD+LY+P H N + Sbjct: 211 QLFEMQRKSVDGCTNFDIAETDKMRCEHFTLSPGDVLYLPKSTIHFATTSPNTTTAHITL 270 Query: 199 SVGFRAPNTRELISGFADYV--------------LQRELGGNYYSDPDVPPRAHPADVLP 244 S+ + + +L+ + L+ G + + P P + Sbjct: 271 SLERQGQSWIDLVRRACAVLDNQACTVIESTLASLETSPSGLRWLELATFPTDEPCQMAQ 330 Query: 245 QEMDKL-REMMLELINQPEHFK 265 + + ++ + L+ Q Sbjct: 331 KRLLGFEKDSLRTLVAQDPQLD 352 >UniRef50_B5S2S3 Putative uncharacterized protein n=1 Tax=Ralstonia solanacearum MolK2 RepID=B5S2S3_RALSO Length = 329 Score = 109 bits (273), Expect = 2e-22, Method: Composition-based stats. Identities = 35/234 (14%), Positives = 75/234 (32%), Gaps = 39/234 (16%) Query: 1 MEYQLTLNWPDFLERHWQKR-PVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDG 59 + +L+ F E ++ + PV+++ + + ++ +V++QD Sbjct: 86 VPRARSLSSEAFHENYYSRNLPVLIEDAAHAWPALTKW---TNAYLKENYGHCIVTYQDR 142 Query: 60 KWQVSH--------------GPFESYDHLGETNWSLLVQAVNHWH--EPTAALMRPFREL 103 H E ++ GE+N L+ + A L+ Sbjct: 143 GKPSDHRHSFIDHSTQIAFSKYIERVENSGESNACYLIAH-DRLLDRPEFAPLLDDIAFD 201 Query: 104 -----PDWRIDDLMISFSVPGGGVGPHLDQYDVFIIQGTGRRRW---------RVGEK-- 147 P + + G H D +VF++Q GR+R +V Sbjct: 202 ERYLDPIGPVGKVFFWLGPKGAKTPLHRDLGNVFLVQVRGRKRVNFIPALEMHKVYNSFG 261 Query: 148 LQMKQHCPHPDLLQVDPFEAII--DEELEPGDILYIPPGFPHEGYALENAMNYS 199 D + + G++L+IP G+ H A++ ++ + Sbjct: 262 YHSDLDLDDYDPKKFPRMAKAHVSTTIVSSGEMLFIPVGWWHHVVAIDECISIT 315 >UniRef50_UPI00006A359E PREDICTED: similar to predicted protein n=1 Tax=Ciona intestinalis RepID=UPI00006A359E Length = 398 Score = 109 bits (272), Expect = 2e-22, Method: Composition-based stats. Identities = 34/247 (13%), Positives = 84/247 (34%), Gaps = 29/247 (11%) Query: 7 LNWPDFLERHWQK-RPVVLKRGFNNFIDPISP--DELAGLAMESEVDSRLVSHQDGKWQV 63 + +F + K +P+++K+ + L + V+ ++ Sbjct: 50 ITSLEFYNNYVAKNKPLLIKQVLQRSTPVLKWTDQYLKEKFGKLRVNVDNNKQENRLIPS 109 Query: 64 SHGPFESYDHLGETNWS-LLVQAVN---HWHEPTAALMRPFRELPDWRIDDLMISFSVPG 119 S F+ + + + + ++ +N P ++ + R D ++ FS Sbjct: 110 SEMEFQQFLSIYLESKTHYMISTMNMEMQKEFPLPTVINCDGFV--SRFQDFVMWFSGGN 167 Query: 120 GGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMK--------------QHCPHPDLLQVDPF 165 H D + Q +G + W + + + + D+L+ Sbjct: 168 TRSKLHYDNVENMYCQISGTKHWFIVDPADAEGHIVIDHPEGAFSGVNVTSVDMLKYPGM 227 Query: 166 EAIIDEE--LEPGDILYIPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQREL 223 + + L PGD +Y+P + H+ + + N ++ ++ ++ QR+ Sbjct: 228 QGLQWWSANLTPGDCIYMPLNWWHQVTS-PPSRNMAINIW---WAPVLEEVVNFPCQRQH 283 Query: 224 GGNYYSD 230 G SD Sbjct: 284 VGATLSD 290 >UniRef50_Q4DVQ0 Putative uncharacterized protein n=2 Tax=Trypanosoma cruzi RepID=Q4DVQ0_TRYCR Length = 1155 Score = 108 bits (269), Expect = 4e-22, Method: Composition-based stats. Identities = 58/353 (16%), Positives = 106/353 (30%), Gaps = 70/353 (19%) Query: 6 TLNWPDFLERHWQ-KRPVVLK-----RGFNNFIDP--ISPDELAGLAMES-EVDSRLVSH 56 + FL Q RPVV + R + + +P + E + ++ L+ Sbjct: 826 PFSKEAFLALVRQPNRPVVFRKVNMGRCVDLWANPTYLKEAEKNTIVSVHVARETYLLDF 885 Query: 57 QDGKWQVSHGPFESYDH-----------LGETNWSLLVQAVNHW------HEPTAALMRP 99 + H F S G W L A N + L Sbjct: 886 VKKNFTFRHVSFGSLVDHCVKAEENPHAAGSEAWYLRAVAPNMKTERANVWKDFPKLGGD 945 Query: 100 FRELP------DWRIDDLMISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQH 153 F P + R+ + + P + H D D + Q G++R + + Sbjct: 946 FVLPPAVAEHVESRMHQACLRLNAPPLQLWTHYDTLDNVLCQVVGKKRVVLFPPSEYNNL 1005 Query: 154 C-----------PHPDLLQVDPFE----AIIDEELEPGDILYIPPGFPHEGYALENAMNY 198 PDL + F + LEPGD+L++P + H +E + + Sbjct: 1006 YMSGSSSAVLNIDAPDLGRFPRFADACRHATEVVLEPGDMLFLPSLWFHHITTMEGSYSI 1065 Query: 199 SVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADVLPQEMDKLREMMLELI 258 SV F + + Y + D+P A ++ E + +L+ Sbjct: 1066 SVNV-------FFERFPH---EDYDKKDLYGNKDLPAAAR-------LRKRIVEQVQQLV 1108 Query: 259 NQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQGEVLVRLGGLRV 311 ++P + + D ++ + V R G R+ Sbjct: 1109 SEP-----FIEKTSDGKPLPPDFVEFALRQAIQDLEEISDNMAVAHR-GSKRL 1155 >UniRef50_Q55DF5 JmjC domain-containing protein D n=1 Tax=Dictyostelium discoideum RepID=JMJCD_DICDI Length = 448 Score = 107 bits (268), Expect = 6e-22, Method: Composition-based stats. Identities = 42/244 (17%), Positives = 81/244 (33%), Gaps = 58/244 (23%) Query: 8 NWPDFLERHWQK-RPVVLKRGFNNFI--DPISPDELAGL---AMESEVDSRL-VSHQDGK 60 + +F + K P V++ + + + +L L A V + ++ K Sbjct: 208 SLNEFKNEYMIKGNPCVIENLMKEWPCFNERNWSDLNYLKNVAGSRLVPIEIGPNYLHEK 267 Query: 61 WQVSHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMR--PFRELPDWRIDDLM------ 112 + F + ++ + + ++ L + F ++P R D L+ Sbjct: 268 MKQKLINFNKFIDEY-----IISKNSDDDNDDIGYLAQTKLFEQIPQLRNDILIPEYCKI 322 Query: 113 -------------------ISFSVPGGGVGP-HLDQYDVFIIQGTGRRRWRVGEKLQMKQ 152 ++ P G V P H D F+ Q GR+ ++ + Sbjct: 323 KIGCGDDDNDNNKEDNVEINAWLGPKGTVTPLHYDPKHNFLCQIVGRKYIKLFSPKESNN 382 Query: 153 HCPH----------------PDLLQVDPFE--AIIDEELEPGDILYIPPGFPHEGYALEN 194 PH PD + F+ I+ L G+ILYIPP + H +L Sbjct: 383 LYPHLNSKLFFNTSMVDVENPDHSKFPLFKNCDYIELILNAGEILYIPPTYWHFVKSLSQ 442 Query: 195 AMNY 198 + + Sbjct: 443 SFSI 446 >UniRef50_UPI0000D567FA PREDICTED: similar to reserved n=1 Tax=Tribolium castaneum RepID=UPI0000D567FA Length = 372 Score = 107 bits (268), Expect = 6e-22, Method: Composition-based stats. Identities = 48/334 (14%), Positives = 109/334 (32%), Gaps = 60/334 (17%) Query: 13 LERH--WQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQ----VSHG 66 + W ++K +++ + + +EL + Q+ +W+ + Sbjct: 18 FRNYVNWG----LVKWKLDDWSNLLKNEELVFRCGKKA------FTQEPQWERATRIKKT 67 Query: 67 PFESYDHLGETN----WSLLVQAVNHWHEPTAALMRPFRE----LPDWRIDDLMISFSVP 118 F+ + E++ + + W + T L + P+ +D I Sbjct: 68 TFKEFISFTESDNNSWMYFDYKYLKDWFKNTKELKKEVNWSLFGFPELSSEDSTIWIGSA 127 Query: 119 GGGVGPHLDQY-DVFIIQGTGRRRWRVGEKLQMK-----QHCPHPDLLQVDPFEAII--- 169 G H+D Y ++Q GR++W + + + +++ F +I Sbjct: 128 GAHTPCHIDTYGCNIVVQIHGRKQWILFPPDENLKPTRIPYEESSIYSKLNFFSPMITDF 187 Query: 170 -------DEELEPGDILYIPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRE 222 LEPGD L++P + H L+ A++ +V P + F + ++Q Sbjct: 188 DGVGNCRRVVLEPGDALFVPHKWWHYVENLDTAISINV--WLPLPEDHEERFREALVQFF 245 Query: 223 LGG-----NYYSDPDVPPRAHPADVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRH 277 + + + + ++ +D +++ + K Q R Sbjct: 246 ITQVTQNLDLETKKAILNPNSDQEIGDVTLDASLDIVNKC-------KDILTRTSIQKRP 298 Query: 278 ------ELDIAPPEPPYQPDEIYDALKQGEVLVR 305 L P P E + L+ + R Sbjct: 299 IFDENKCLPFVEKIPVLSPQEFANFLQDQKSRFR 332 >UniRef50_A9V0X5 Predicted protein n=2 Tax=Monosiga brevicollis RepID=A9V0X5_MONBE Length = 2283 Score = 106 bits (266), Expect = 9e-22, Method: Composition-based stats. Identities = 44/283 (15%), Positives = 77/283 (27%), Gaps = 43/283 (15%) Query: 9 WPDFLERHWQKRPVVLKRGFNNFIDPI-----------SPDELAGLAMESEV--DSRLVS 55 DF ++ R N D I S + ++ + +V Sbjct: 42 VKDFFLNTFETRVHHWTNHRANAADFIAQLNTIEPLVESLTSMDDFIERFDLWGEDIMVR 101 Query: 56 HQDGKWQVSHGPFESYDH-----LGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDD 110 G+ E+ D L + S +++ + L R L + Sbjct: 102 TNGGEISDILTTAETVDEPFLRGLLDDGNSFVIKT-ELHQSHGSDLERQLTTLFATTVAV 160 Query: 111 LMISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKL-------------QMKQHCPHP 157 + PH D YDV +IQ G ++W + Q+++ H Sbjct: 161 HGYVTPTGAQALKPHTDPYDVLVIQTYGEKQWTICTPQPAGAQNRTDAEKAQLQEIVRHS 220 Query: 158 DLLQVDPFE------AIIDEELEPGDILYIPPGFPHEGYALE----NAMNYSVGFRAPNT 207 L+ GD+LY+P G H E + S+ + Sbjct: 221 IQGCTQYEAWQLAKMECQAITLKAGDVLYLPKGIIHYATTTESMGSTHITLSLERLTHSW 280 Query: 208 RELISGFADYVLQRELGGNYYSDPDVPPRAHPADVLPQEMDKL 250 L L R Y D + P + + + Sbjct: 281 LALFGRACGLGLDRATCQQ-YEDVLLTASLTPNGLAWLNLAAM 322 >UniRef50_A9VDC2 Predicted protein n=1 Tax=Monosiga brevicollis RepID=A9VDC2_MONBE Length = 2266 Score = 106 bits (265), Expect = 1e-21, Method: Composition-based stats. Identities = 29/164 (17%), Positives = 54/164 (32%), Gaps = 22/164 (13%) Query: 69 ESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPHLDQ 128 ++ +L S +V+ P + L R + L + + PH D Sbjct: 150 DTIQNLLADGNSFVVKT-EATDRPYSILQRQLQALFTTMTNVHAYISPPNAQALAPHTDP 208 Query: 129 YDVFIIQGTGRRRWRVGEKLQ---------MKQHCPHPDLLQVDPFEAIIDE-------- 171 YDVF++Q G++ W + K + + Sbjct: 209 YDVFVVQVYGQKEWTLCTPQPPGGQNLSDAHKAQWQEIAKHNIQGCTNYQEWQLAKMDCQ 268 Query: 172 --ELEPGDILYIPPGFPHEGY--ALENAMNYSVGFRAPNTRELI 211 L PGD+LYIP G H ++ + + +V + ++ Sbjct: 269 HITLLPGDLLYIPKGVIHYATTGSVTGSTHLTVSIERLSHSWMM 312 >UniRef50_UPI000186E75E Hypoxia-inducible factor 1 alpha inhibitor, putative n=1 Tax=Pediculus humanus corporis RepID=UPI000186E75E Length = 306 Score = 106 bits (264), Expect = 2e-21, Method: Composition-based stats. Identities = 45/283 (15%), Positives = 96/283 (33%), Gaps = 78/283 (27%) Query: 7 LNWPDFLERHWQKRPV----VLKRGFNNFID--PISPDELAGLAMESEVDSRLVSHQDGK 60 DF+++++Q + + ++KR +N++ D + P+E+ L Sbjct: 62 FKKLDFIKKNFQYKTLAFDELIKRIYNSYNDSYFLDPEEVYYL----------------- 104 Query: 61 WQVSHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDL---MISFSV 117 + + P + V ++ L + F P + DD + S Sbjct: 105 RSLGNDP--------------RGKDVANFLSQCKELAKDFNVPPFFNKDDFFSSVFRASS 150 Query: 118 PGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHC-----------PHPDLLQVDPFE 166 G + H D D +IQ G++R + ++ +PD+ + F Sbjct: 151 QGLQLWTHYDVMDNILIQVQGKKRALLWSPDEVSNMYMIGDKSQVLDVDNPDVEKFPKFS 210 Query: 167 AI--IDEELEPGDILYIPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELG 224 + + + GDIL+IP + H ALE ++ ++ + + Sbjct: 211 SSCRYECYMNAGDILFIPALWFHNITALEPSLGVNI---------FWKNLPHELYDKHDS 261 Query: 225 GNYYSDPDVPPRAHPADVLPQEMDKLREMMLELINQPEHFKQW 267 Y + D+ P + + L P+ +K + Sbjct: 262 ---YGNKDLLP-------------GFKSSLKSLKTLPDDYKNF 288 >UniRef50_Q6MPD0 Putative RNA methylase n=1 Tax=Bdellovibrio bacteriovorus RepID=Q6MPD0_BDEBA Length = 419 Score = 105 bits (263), Expect = 2e-21, Method: Composition-based stats. Identities = 44/311 (14%), Positives = 92/311 (29%), Gaps = 39/311 (12%) Query: 9 WPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSR-------LVSHQDGKW 61 W +F + HW+K+P+V + + ++ E+ L + R L DG Sbjct: 20 WQNFAKNHWEKKPLVARNVKSGLLEMTDA-EIFELLVAYSDRCREMNDPEGLKFFIDGAK 78 Query: 62 QVSHGPFESYDHLGET--------------NWSLLVQAV-------NHWHEPTAALMRPF 100 E + ++ L+ + H + + Sbjct: 79 ADPEEVLELLPEKSDKSLLGYHKRMNAQFPDYCLVCDELLQVNLKKQHLLQDFTDDLFRH 138 Query: 101 RELPDWRIDDLMISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCP-HPDL 159 P+ + + + G H+D VF G +R+R+ +H Sbjct: 139 VGFPNRFSEIGLYLGNYRKTPFGVHVDSCGVFSFPVAGVKRFRLWPAAYGDEHPELDRTF 198 Query: 160 LQVDPFEAIIDEELEPGDILYIPPGFPHEGYALEN-AMNYSVGFRA-PNTRELISGFADY 217 + E+ PGD+ Y P H + + + +S+G ++ Sbjct: 199 NYEKHKKHSQLVEVGPGDMTYWPSSEWHIAESDGSFSATWSLGVWVDQTHGDMFGSALKD 258 Query: 218 VLQRELGGNYY--SDPDVPPRAHPADVLP-----QEMDKLREMMLELINQPEHFKQWFGE 270 ++ +LG + P +V Q+ K + + + K W Sbjct: 259 LVDTKLGSARLKVTTPFKALHDKSGEVGELPKIYQDSLKALKSLSAAELEEAFLKSWMKH 318 Query: 271 FISQSRHELDI 281 Q + + Sbjct: 319 ISLQGFKTVPV 329 >UniRef50_UPI0000588708 PREDICTED: hypothetical protein, partial n=1 Tax=Strongylocentrotus purpuratus RepID=UPI0000588708 Length = 279 Score = 105 bits (263), Expect = 2e-21, Method: Composition-based stats. Identities = 40/246 (16%), Positives = 75/246 (30%), Gaps = 43/246 (17%) Query: 21 PVVLKRGFNNFIDP-ISPDELAGLAMESEVDSRLVSHQDGKWQVS---HGPFESYDHLGE 76 P V+K ++ ++L+ L + ++ RL E H Sbjct: 13 PTVIKDITKSWPCFRWDVEDLSELLGDDKIRFRLGRKNVEGSDPKVHGLMHSEESAHPTS 72 Query: 77 TNWS--LLVQAVNHWH--EPTAALMRPFRELPDWRIDDLM-------------ISFSVPG 119 S L+ + + + + F+ P D + G Sbjct: 73 EGSSNPLMFYDRSRYWCYADYKHMKQLFKNCPSVLEDVRWRDLGFDRDGGQSTMWIGSEG 132 Query: 120 GGVGPHLDQYD-VFIIQGTGRRRWRVGEKLQMKQHCP-----------------HPDLLQ 161 H D Y + Q GR++W + Q + P PDL Sbjct: 133 ANTPCHQDTYGFNLVAQIRGRKKWHLFPPSQTELMYPTRIPYEESSVFSQVNVRSPDLQH 192 Query: 162 VDPF--EAIIDEELEPGDILYIPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVL 219 F L PGDIL++P + H +L+ ++ S+ + +S + + Sbjct: 193 HPKFGRATPYVAVLHPGDILFVPKSWWHFVESLDTSI--SINCWMDLESDHVSRVDEAIA 250 Query: 220 QRELGG 225 + + G Sbjct: 251 RTLVCG 256 >UniRef50_Q38DD6 Putative uncharacterized protein n=2 Tax=Trypanosoma brucei RepID=Q38DD6_9TRYP Length = 1145 Score = 104 bits (260), Expect = 5e-21, Method: Composition-based stats. Identities = 36/218 (16%), Positives = 67/218 (30%), Gaps = 41/218 (18%) Query: 90 HEPTAALMRPFRELPDWRID-------DLMISFSVPGGGVGPHLDQYDVFIIQGTGRRRW 142 L F LP +D + S P + H D D + Q GR+R Sbjct: 931 WRDFQGLKDDFV-LPSATVDHIKPRMHQACLRISAPPLQLWTHYDTLDNVLCQIVGRKRV 989 Query: 143 RVGEKLQMKQHC-----------PHPDLLQVDPF----EAIIDEELEPGDILYIPPGFPH 187 + + PD ++ F ++ E+ PGD+L+IP + H Sbjct: 990 VLFPPSEYNNLYISGSSSAVTNIDKPDYMRFPRFIDASRHALEVEIGPGDMLFIPALWFH 1049 Query: 188 EGYALEN--AMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPD------------- 232 LE +M+ +V F + D +++ Sbjct: 1050 HITTLEGSYSMSVNVFFERLPHADYDKK--DLYGNKDIPAASRLRAGIVKKVKEMISETA 1107 Query: 233 VPPRAHPADVLPQEMD-KLREMMLELINQPEHFKQWFG 269 V + + P ++ LR+ + +L+ + Sbjct: 1108 VERTSDGKALTPDLVEFALRQALQDLMEVADDMCTALR 1145 >UniRef50_A9V428 Predicted protein n=2 Tax=Monosiga brevicollis RepID=A9V428_MONBE Length = 2336 Score = 104 bits (260), Expect = 5e-21, Method: Composition-based stats. Identities = 33/243 (13%), Positives = 75/243 (30%), Gaps = 36/243 (14%) Query: 8 NWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQVSHGP 67 + FL + + P+ + + + + + + + + Q G H P Sbjct: 56 DADAFLTQLRETEPL-----AQHMASLLDFENIY--VQAAGFPNAIRTTQHGGIDTLHSP 108 Query: 68 FESYDHLGETNWSLLVQA--VNHWHEPTAALMRPFRELPDWRIDDLMISFSVPG--GGVG 123 + ++ + +V + +P+A L ID + ++ P + Sbjct: 109 LTLAEFKNMSDVTSVVYKREFDRKAQPSA-LESLIESELG--IDATVHAYFTPANAQTLE 165 Query: 124 PHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDPFE----------------- 166 PH D YDV ++Q ++ W + + + E Sbjct: 166 PHTDPYDVVVVQVANQKHWTLCLPQTDNATVSLSEADRAQLQEIKRSHLDGCTTYTMSML 225 Query: 167 ---AIIDEELEPGDILYIPPGFPHEGYALE-NAMNYSVGFRAPNTRELISGFADYVLQRE 222 + L GD +Y+P G H + + + ++G + R + + Sbjct: 226 QPMICRNVTLHQGDSMYLPKGVIHYAVTTDTPSAHLTIGL-SRTGRTWLDVLTAQCQRTT 284 Query: 223 LGG 225 L Sbjct: 285 LPS 287 >UniRef50_B1TGS3 Transcription factor jumonji jmjC domain protein n=1 Tax=Burkholderia ambifaria MEX-5 RepID=B1TGS3_9BURK Length = 266 Score = 104 bits (260), Expect = 5e-21, Method: Composition-based stats. Identities = 41/238 (17%), Positives = 69/238 (28%), Gaps = 37/238 (15%) Query: 2 EYQLTLNWPDFLERHWQKR-PVVLKRGFNNFIDP--ISPDELAGLAMESEVDSRLV---- 54 E T + +F + + PVVL+ S L A + V Sbjct: 18 ERVTTPSAKEFYRHYVRPGLPVVLRGAALGLGALQYWSSAYLKAAAGKRSVPIEFSPDKE 77 Query: 55 -----SHQDGKWQVSHGPFESYDHLGE--TNWSLLVQAVN--HWHEPTAALMRPFRELPD 105 + G F Y G+ + + + V+ + + P Sbjct: 78 FALPERIGKDRIHSKFGRFVDYLLDGDASSRTTYYLAQVDTLRYLPELVGDIVRPSFAPL 137 Query: 106 WRIDDLMISFSVPGG-GVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLLQVDP 164 I + GG H D YD +GR+ + P+ D + Sbjct: 138 AEIMRPPYLWMGIGGNASTLHYDSYDNLYAMVSGRKHITLFPPSDRAHLYPYVDQRKHRH 197 Query: 165 FEAI------------------IDEELEPGDILYIPPGFPHEGYALENAMNYSVGFRA 204 F + + L GDILYIP G+ H + + +N +V + Sbjct: 198 FSQVNLRCPDLSQFPDLLNARPFECVLSRGDILYIPEGWWHYLRS--HGLNVAVNWWW 253 >UniRef50_A9VDD4 Predicted protein n=1 Tax=Monosiga brevicollis RepID=A9VDD4_MONBE Length = 2295 Score = 104 bits (260), Expect = 6e-21, Method: Composition-based stats. Identities = 53/344 (15%), Positives = 94/344 (27%), Gaps = 60/344 (17%) Query: 9 WPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQVSHGPF 68 W DF + V + + + +P + L + LV G Sbjct: 100 WADFAPSYQTSHRVYPAKADSRALGSATPLPVFELLQRPGTLTALVQDDAGALVYRSDTQ 159 Query: 69 ESYDHLGETNWSLLVQAVNHWHEPTAALM--RPFRELPD-----WRIDDL---------- 111 + W L Q + ++ ++ F W DD Sbjct: 160 VAPIPPDLNTWDKLFQHIQTHQPTSSLVIFPERFSHTFALDLDPWLYDDYYQALTDAFNV 219 Query: 112 -----MISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVG-----EKLQMKQHCPHPDLLQ 161 + G + PH D DVF+ Q +G + W++ + + P P Sbjct: 220 PVTQHVYITGPAGRALNPHTDGGDVFVRQISGSKHWQLCVPRLQDPSVCQPSAPSPTAHP 279 Query: 162 VDPFE-----------------------AIIDE---ELEPGDILYIPPGFPHEGYALENA 195 A +D L PGD LY+P G H + Sbjct: 280 CTDGARARYAEWKRDQFEGCTPYTMGQLADMDCSNITLHPGDTLYLPRGIVHHAWTDSGI 339 Query: 196 MNYSVGFRAPNT-RELISGFADYVLQRELGGNYYSDPDVPPRAHPADVLPQEMDKLREMM 254 + V ++ + L + R G + D A+ ++ +L + Sbjct: 340 TSTHVTYQLQSKDATLFDRLRNQCYARASTGTRWCDSLDNLVANNFPHATSDLLRLASTL 399 Query: 255 LE-----LINQPE-HFKQWFGEFISQSRHELDIAPPEPPYQPDE 292 E ++ Q ++K R P P D+ Sbjct: 400 HEDKIAAMLWQLRFNYKDLMTTLARPRRDITSKDHPPPGVCCDD 443 >UniRef50_A4QNS2 LOC733353 protein n=3 Tax=Xenopus RepID=A4QNS2_XENLA Length = 446 Score = 104 bits (259), Expect = 6e-21, Method: Composition-based stats. Identities = 54/367 (14%), Positives = 104/367 (28%), Gaps = 64/367 (17%) Query: 52 RLVSHQDGKWQVSHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDL 111 + G Q G F YDH W+ L F++ + D + Sbjct: 88 QFQKWVGGTSQDDWGSFSDYDH--SEYWAYA---------DYKYLAVVFKDQAEMLQDVV 136 Query: 112 MISFSVPG----------GGVG----PHLDQY-DVFIIQGTGRRRWRVGEKLQMKQHCPH 156 F PG G G H+D Y ++Q GR+ W + P Sbjct: 137 WADFGFPGRDGKESSLWVGSFGANTPCHVDSYGCNLVLQVEGRKTWHLFPPEDTPYMYPT 196 Query: 157 -----------------PDLLQVDPF--EAIIDEELEPGDILYIPPGFPHEGYALENAMN 197 PD + F + L PG +L++P + H ++++ + Sbjct: 197 RIPYEESSIFSKVNIVKPDQSRFPLFSRASPHVVTLHPGQVLFVPQHWWHYVQSVDD-IT 255 Query: 198 YSVGFRAPNTRELISGFADYVLQRELGGNYYSDP-----DVPPRAHPADVLPQEMDKLRE 252 S+ + + + + + DP + P M L Sbjct: 256 VSINSWIELDSDHEFRVQEAITRTLVCLFKSVDPRSTLDWLNPTEDEVPSHQTNMQYLSR 315 Query: 253 MMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQGEVLVR----LGG 308 + + +Q S + + D + G L G Sbjct: 316 SVAAYAEHHKQEQQSSDSCPGNSGPKKRKTGKDAKAAEDVSLPSAPFGPFLTPVLPIPKG 375 Query: 309 LRVLRIGDDVYANGEKIDSPHRPALDALA---------SNIALTAENFGDALEDPSFLAM 359 + R+ + + + + +P A S ++++ D L DP + + Sbjct: 376 TNLGRVENSLADKQQNASTEIQPHPMVKAYDSSPGQSNSGRIISSDEVMDCLVDPRVIQL 435 Query: 360 LAALVNS 366 + L+ Sbjct: 436 VTELLLQ 442 >UniRef50_Q2SID9 Uncharacterized conserved protein n=1 Tax=Hahella chejuensis KCTC 2396 RepID=Q2SID9_HAHCH Length = 304 Score = 104 bits (259), Expect = 7e-21, Method: Composition-based stats. Identities = 37/211 (17%), Positives = 65/211 (30%), Gaps = 27/211 (12%) Query: 20 RPVVLKRGFNNFIDP--ISPDELAGLAMESEVDSRL-------VSHQDGKWQVSHGPFES 70 +PV+L G + SPD V L K Sbjct: 33 KPVILTGGALAWPALQKWSPDYFRRRFASQRVRPSLQLPAYGAPYFSTEKHHRREMYLSE 92 Query: 71 YDHLGETNWSLLVQAVNHWHE-PTAALMRPFRELPDWRIDDLMISFSVPGGGVGPHLDQY 129 + + E+ + V + R + +P + + + G H D Sbjct: 93 FVDILESGDACYVDQTDVHSFIGLEEDYRYQQFIPPGL--NFISLWIGSKTRSGLHYDNM 150 Query: 130 DVFIIQGTGRRRWRVGEKLQMKQHCP-------------HPDLLQVDPFEAIIDEE--LE 174 D +Q G ++ + + + P PDL+ F L+ Sbjct: 151 DNLFVQVYGEKKAILLAPREARNLYPFGDCISKSRVDPERPDLMHYPRFAKAQTLTARLQ 210 Query: 175 PGDILYIPPGFPHEGYALENAMNYSVGFRAP 205 PGDIL+ P G+ H + +++ S + AP Sbjct: 211 PGDILFFPRGWWHHFSSAGPSISLSCWYGAP 241 >UniRef50_C3Z534 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3Z534_BRAFL Length = 346 Score = 104 bits (259), Expect = 7e-21, Method: Composition-based stats. Identities = 32/169 (18%), Positives = 55/169 (32%), Gaps = 24/169 (14%) Query: 97 MRPFRELPDWRIDDLMISFSVPGGGVGPHLDQY-DVFIIQGTGRRRWRVGEKLQMKQHCP 155 F P + I G H D Y ++Q GR++W + + P Sbjct: 144 WEDFGF-PGRTGTESTIWVGSAGAHTPCHYDTYGCNLVLQVYGRKKWVLFPPEDSPKLYP 202 Query: 156 -----------------HPDLLQVD--PFEAIIDEELEPGDILYIPPGFPHEGYALENAM 196 HPD+ + LEPGD+L++P + H +L + Sbjct: 203 TRLPYEESSVFSQVNVAHPDVEEHPKVMSSHPHVVILEPGDVLFVPKHWWHYVESL--ST 260 Query: 197 NYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADVLPQ 245 + +V + + V++ L SD D +P +V Sbjct: 261 SVAVNSWIEMASDAEDRLQEAVVRLVLLS-IKSDSDQSQWLNPTEVTES 308 >UniRef50_A7SQQ9 Predicted protein (Fragment) n=1 Tax=Nematostella vectensis RepID=A7SQQ9_NEMVE Length = 280 Score = 103 bits (258), Expect = 8e-21, Method: Composition-based stats. Identities = 39/274 (14%), Positives = 74/274 (27%), Gaps = 51/274 (18%) Query: 20 RPVVLKRGFNNFI-DPISPDELAGLAMESEVDSRLVSHQ---------DGKWQVSHGPFE 69 +P+V + +P LA + E R+ + + G F+ Sbjct: 9 KPLVFHGFIKEWSCSMWTPLFLASELGQLETRFRMCQRRSIPKQKPLMETDCCYVQGTFK 68 Query: 70 SY--------------DHLGETNWSLLVQA--VNHWHEPTAALMRPFRE----LPDWRID 109 + + + + L R P Sbjct: 69 DFCSWLESDNSESGKLIQYPRSEYCCYADYKYMAELFHDFPDLCRASDWSKFGFPGRTGQ 128 Query: 110 DLMISFSVPGGGVGPHLDQYD-VFIIQGTGRRRWRVGEKLQMKQHCP------------- 155 I G H+D Y + + Q GR++W + + + P Sbjct: 129 QSTIWVGSEGAFTPCHMDTYGSILVAQIFGRKKWTLFDPMDTDNLYPTRIPYEESSVFSK 188 Query: 156 ----HPDLLQVDPFEAIIDEELEPGDILYIPPGFPHEGYALENAMNYSVGFRAPNTRELI 211 PD F E + GD+L +P + H L+ S+ + + Sbjct: 189 VNITSPDYQAFPLFRKATPYEAKAGDVLLVPKHWWHFVECLDT--TISINTWVEMDDDKL 246 Query: 212 SGFADYVLQRELGGNYYSDP-DVPPRAHPADVLP 244 + + + + SD DV +P +V Sbjct: 247 DRVKEAIARVIIFSLKDSDKEDVAGWLNPTEVRK 280 >UniRef50_Q1DDR6 JmjC domain protein n=1 Tax=Myxococcus xanthus DK 1622 RepID=Q1DDR6_MYXXD Length = 335 Score = 103 bits (257), Expect = 1e-20, Method: Composition-based stats. Identities = 44/232 (18%), Positives = 85/232 (36%), Gaps = 32/232 (13%) Query: 1 MEYQLTLNWPDFL-ERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDG 59 +E + ++ +F ++ RPVVL+ ++ +V+ ++S +D Sbjct: 88 LEKRRGVSPEEFFTRYYFGHRPVVLQGFMEDWPAMRRWSLADFRERFGDVEVEIMSGRDA 147 Query: 60 KWQVSHGP------------FESYDHLGETNWSLLVQAVNHWHEP-TAALMRPFREL--- 103 + P + + GE+N +V +W A L R Sbjct: 148 NPDHASQPDKHRQVVKLRDYVQRVETGGESNDFYMVPRNENWKRDGLARLREDIRAPAGI 207 Query: 104 --PDWRIDDLMISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH----- 156 P+ R D + PG H D +V + Q GR+ R+ Q P Sbjct: 208 IDPELRPDMTTLLLGPPGTVTPLHHDNMNVLLGQVMGRKHVRLVPSFQRHLVYPRHGTFS 267 Query: 157 ------PDLLQVDPF--EAIIDEELEPGDILYIPPGFPHEGYALENAMNYSV 200 PD + + +++ +EPG++L++P G+ H AL+ + + Sbjct: 268 SVDAASPDAARFPLYGEATVLEGVVEPGELLFLPVGWWHWVRALDVSATVTF 319 >UniRef50_A4RKC1 Putative uncharacterized protein n=1 Tax=Magnaporthe grisea RepID=A4RKC1_MAGGR Length = 532 Score = 103 bits (257), Expect = 1e-20, Method: Composition-based stats. Identities = 43/302 (14%), Positives = 82/302 (27%), Gaps = 101/302 (33%) Query: 1 MEYQLTLNWPDFLERHWQK--------RPVVLKRGFNNFI-----DPISPDELAGLAMES 47 + L+ F + + + P+++ +++ P L + Sbjct: 231 VRRLEQLSLESF-QSYMDRPSDSDLGPEPLIITGVTDDWPARTTNPWCKPAYLLSRTLNG 289 Query: 48 EVDSRL---VSHQDGKWQVSHGPFESYDHLGETN-------------------------- 78 + + S+ D W PF ++ Sbjct: 290 QRLVPVETGRSYVDEGWGQKIIPFAAFLEGYIDRPAVSSSADHGSRNSERTEGGGTAGSL 349 Query: 79 ---WSLLVQAVNHWHEPTAALMRPFRELPDWRI---------------------DDLMIS 114 S+ A + +L R D ++ + Sbjct: 350 HKQASIAYLAQHQLFAQLPSLRDDIRIPDYCYTAPPPPPASMFPMSEQRPPELEDPILNA 409 Query: 115 FSVPGGGVGP-HLDQYDVFIIQGTGRRRWRVGEKLQMKQHC-------------PHPDLL 160 + P G + P H D Y + Q GR+ R+ LQ + H D+ Sbjct: 410 WFGPPGTITPLHTDPYHNMLSQVVGRKYVRLYSHLQTPRMAARGVEDGVEMSNTSHFDVG 469 Query: 161 QVDPFE--------------------AIIDEELEPGDILYIPPGFPHEGYALENAMNYSV 200 ++ ++ +D LEPGD LYIP G+ H L + + S Sbjct: 470 VMEGWDEANGQDEKESQKNAVDFGSIPFLDCILEPGDTLYIPVGWWHYVRGLSVSFSVSF 529 Query: 201 GF 202 + Sbjct: 530 WW 531 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.310 0.145 0.404 Lambda K H 0.267 0.0448 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 2,252,728,646 Number of Sequences: 3077464 Number of extensions: 98177688 Number of successful extensions: 270729 Number of sequences better than 1.0e-01: 250 Number of HSP's better than 0.1 without gapping: 971 Number of HSP's successfully gapped in prelim test: 453 Number of HSP's that attempted gapping in prelim test: 267371 Number of HSP's gapped (non-prelim): 1713 length of query: 373 length of database: 1,040,396,356 effective HSP length: 130 effective length of query: 243 effective length of database: 640,326,036 effective search space: 155599226748 effective search space used: 155599226748 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.1 bits) S2: 94 (40.7 bits)