BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (325 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_B7LDM7 Protein csiD n=104 Tax=Bacteria RepID=CSID_ECO55 640 0.0 UniRef50_Q47V67 Putative uncharacterized protein n=2 Tax=Alterom... 301 2e-80 UniRef50_D0RN41 Protein CsiD n=1 Tax=alpha proteobacterium HIMB1... 283 7e-75 UniRef50_Q4FKY1 Gab protein n=3 Tax=Candidatus Pelagibacter RepI... 275 2e-72 UniRef50_A9VGL1 Putative uncharacterized protein n=17 Tax=Bacter... 259 6e-68 UniRef50_Q2KU12 Carbon starvation-inducible protein n=1 Tax=Bord... 254 2e-66 >UniRef50_B7LDM7 Protein csiD n=104 Tax=Bacteria RepID=CSID_ECO55 Length = 325 Score = 640 bits (1652), Expect = 0.0, Method: Compositional matrix adjust. Identities = 317/325 (97%), Positives = 321/325 (98%) Query: 1 MNALTAVQNNAVDSGQDYSGFTLTPSAQSPRLLELTFTEQTTKQFLEQVAEWPVQALEYK 60 MNALTAV NNAVDSGQDYSGFTL PSAQSPRLLELTFTEQTTKQFLEQVAEWPVQALEYK Sbjct: 1 MNALTAVHNNAVDSGQDYSGFTLIPSAQSPRLLELTFTEQTTKQFLEQVAEWPVQALEYK 60 Query: 61 SFLRFRVAKILDDLCANQLQPLLLKTLLNRAEGALLINAVGVDDVKQADEMVKLATAVAH 120 SFLRFRV KILDDLCANQLQPLLLKTLLNRAEGALLINAVG+DDV QADEMVKLATAVAH Sbjct: 61 SFLRFRVGKILDDLCANQLQPLLLKTLLNRAEGALLINAVGIDDVAQADEMVKLATAVAH 120 Query: 121 LIGRSNFDAMSGQYYARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDE 180 LIGRSNFDAMSGQYYARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDE Sbjct: 121 LIGRSNFDAMSGQYYARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDE 180 Query: 181 QNMQGGNSLLLHLDDWEHLDNYFRHPLARRPMRFAAPPSKNVSKDVFHPVFDVDQQGRPV 240 QNMQGGNSLLLHLDDWEHLD+YFRHPLARRPMRFAAPPSKNVSKDVFHPVFDVDQQGRPV Sbjct: 181 QNMQGGNSLLLHLDDWEHLDHYFRHPLARRPMRFAAPPSKNVSKDVFHPVFDVDQQGRPV 240 Query: 241 MRYIDQFVQPKDFEEGVWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPH 300 MRYIDQFVQPKDFEEGVWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPH Sbjct: 241 MRYIDQFVQPKDFEEGVWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPH 300 Query: 301 PDLRRELMRQRGYFAYASNHYQTHQ 325 PDLRRELMRQRGYFAYA++HYQTHQ Sbjct: 301 PDLRRELMRQRGYFAYATHHYQTHQ 325 >UniRef50_Q47V67 Putative uncharacterized protein n=2 Tax=Alteromonadales RepID=Q47V67_COLP3 Length = 320 Score = 301 bits (771), Expect = 2e-80, Method: Compositional matrix adjust. Identities = 151/302 (50%), Positives = 204/302 (67%), Gaps = 4/302 (1%) Query: 20 GFTLTPSAQSPRLLELTFTEQTTKQFLEQVAEWPVQALEYKSFLRFRVAKILDDLCANQL 79 GF++ P A +PRL + + +T K F+EQ VQA+EYK FLRF VA IL+ L + L Sbjct: 16 GFSVAPFAANPRLQVIKLSSETVKGFIEQALPLGVQAIEYKPFLRFHVAGILNHLTNDTL 75 Query: 80 QPLLLKTLLNRAEGALLINAVGVD-DVKQADEMVKLATAVAHLIGRSNFDAMSGQYYARF 138 LLL + NR GA ++ V + Q + + L+TAV+HLIG N DAM G++YARF Sbjct: 76 GALLLGIIKNRDTGAFMLQCEPVAAEFDQLEFNILLSTAVSHLIGVPNLDAMYGKFYARF 135 Query: 139 VVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDDWEH 198 VKN DNSDSYLRQ HR MELHNDGTYVEE TD+V+M KI E+NM GG+SLLLH+D+W+ Sbjct: 136 SVKNEDNSDSYLRQAHRRMELHNDGTYVEERTDWVIMQKIAEENMAGGDSLLLHVDEWQD 195 Query: 199 LDNYFRHPLARRPMRFAAPPSKNVSKDVFHPV-FDVDQQGRPVMRYIDQFVQPKDFEEGV 257 L+ ++ HPLA+ +++ +P SKN++ + HPV F+ D GRP M +IDQF +P + +G Sbjct: 196 LEKFYNHPLAKEDIQWTSPASKNITYKMQHPVFFEEDDNGRPKMLFIDQFAEPLNMRQGQ 255 Query: 258 WLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLRRELMRQRGYFAYA 317 +L E+ ++E + +V VPVG L++NN WLHGRD+F L REL+RQRG AY Sbjct: 256 YLYEMGTSLEAEQNTFNVRVPVGSMLVVNNHAWLHGRDKFVADKGLYRELLRQRG--AYC 313 Query: 318 SN 319 N Sbjct: 314 EN 315 >UniRef50_D0RN41 Protein CsiD n=1 Tax=alpha proteobacterium HIMB114 RepID=D0RN41_9RICK Length = 313 Score = 283 bits (723), Expect = 7e-75, Method: Compositional matrix adjust. Identities = 133/297 (44%), Positives = 194/297 (65%), Gaps = 1/297 (0%) Query: 19 SGFTLTPSAQSPRLLELTFTEQTTKQFLEQVAEWPVQALEYKSFLRFRVAKILDDLCANQ 78 +G + + S RL+++ + EQ ++ + +EYK FLRF + I + + + Sbjct: 15 TGLIVNNHSSSQRLVDIKIENDYLDKVKEQFDQFDLLDIEYKPFLRFHITDIFNKIFNEK 74 Query: 79 LQPLLLKTLLNRAEGALLINAVGVDDVK-QADEMVKLATAVAHLIGRSNFDAMSGQYYAR 137 +Q L LLNR +GA +I +D K D +VKL+TA+ HL+G NFDAM G+YYAR Sbjct: 75 IQSLTKTILLNRNQGAFVIGPEAMDQSKYDTDFLVKLSTALTHLVGIPNFDAMYGKYYAR 134 Query: 138 FVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDDWE 197 F VKN DNSDSYLR+ + ++LH DGTYV+E TD++LMMK+ E+N +GG S LLHLDDWE Sbjct: 135 FEVKNTDNSDSYLRKAAKKLDLHTDGTYVKEKTDWLLMMKMKEENSEGGESTLLHLDDWE 194 Query: 198 HLDNYFRHPLARRPMRFAAPPSKNVSKDVFHPVFDVDQQGRPVMRYIDQFVQPKDFEEGV 257 + +F P+ + + +P SKNV + HP+F D+QG+P++ YIDQF +P+ ++G+ Sbjct: 195 DCEKFFTDPIGKENFVWGSPKSKNVDYKIEHPIFSTDRQGKPIISYIDQFPEPQSLKQGL 254 Query: 258 WLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLRRELMRQRGYF 314 +L+ LS+++E SK ++S +P G + NN F LHGR F PHP L REL+RQRG F Sbjct: 255 YLNSLSESLEGSKKLISFKLPPGYAIFSNNYFMLHGRKAFKPHPKLYRELLRQRGIF 311 >UniRef50_Q4FKY1 Gab protein n=3 Tax=Candidatus Pelagibacter RepID=Q4FKY1_PELUB Length = 303 Score = 275 bits (703), Expect = 2e-72, Method: Compositional matrix adjust. Identities = 135/299 (45%), Positives = 197/299 (65%) Query: 16 QDYSGFTLTPSAQSPRLLELTFTEQTTKQFLEQVAEWPVQALEYKSFLRFRVAKILDDLC 75 ++ SG T+T S R++++ ++ + + ++ + ALEYK F RF +AK LDDL Sbjct: 2 ENISGITITEHQNSKRIIDIRIEDEILDKLIFPFNKFDITALEYKPFTRFTIAKSLDDLT 61 Query: 76 ANQLQPLLLKTLLNRAEGALLINAVGVDDVKQADEMVKLATAVAHLIGRSNFDAMSGQYY 135 +N+L LL + +R G +I V +VKL+TA+A+LIG N+DAM+G+YY Sbjct: 62 SNKLSKLLNSIVRDRETGCFIIGPKKVSAKINDIFLVKLSTAIAYLIGNPNYDAMAGKYY 121 Query: 136 ARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDD 195 ARF VK+ D SDSYLR+ + M+LH DGTYV+EITD++LM KI+EQN+QGG + +LHLDD Sbjct: 122 ARFFVKHEDKSDSYLRKAYTNMDLHTDGTYVKEITDWLLMTKIEEQNVQGGETAMLHLDD 181 Query: 196 WEHLDNYFRHPLARRPMRFAAPPSKNVSKDVFHPVFDVDQQGRPVMRYIDQFVQPKDFEE 255 WEH ++ F P+ ++ + +P SKN+ V HPVF D G+P + YIDQF +PK+ ++ Sbjct: 182 WEHCEDLFNDPIGKQNFVWGSPKSKNIEYKVEHPVFTTDDNGKPNISYIDQFPEPKNMDQ 241 Query: 256 GVWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLRRELMRQRGYF 314 G++L +LSDA+E SK + + G ++ NN FWLHGR F + DL REL+R RG F Sbjct: 242 GIFLQKLSDALEESKNKVITKLVPGSTIVANNYFWLHGRKPFKENKDLSRELLRIRGSF 300 >UniRef50_A9VGL1 Putative uncharacterized protein n=17 Tax=Bacteria RepID=A9VGL1_BACWK Length = 312 Score = 259 bits (663), Expect = 6e-68, Method: Compositional matrix adjust. Identities = 131/313 (41%), Positives = 185/313 (59%), Gaps = 3/313 (0%) Query: 3 ALTAVQNNAVDSGQDYSGFTLTPSAQSPRLLELTFTEQTTKQFLEQVAEWPVQALEYKSF 62 ++ QN + + Y G+ + P + RL + +Q KQF E+V E Q+L+Y + Sbjct: 2 SIITEQNTKMKFAKKYEGYEIVPHPEHKRLYHIVSNQQLLKQFFEEVKEHSEQSLQYIPY 61 Query: 63 LRFRVAKILDDLCANQLQPLLLKTLLNRAEGALLINAVGVDDVKQADEMVKLATAVAHLI 122 RF +A + + + + +R G I G + + VK ATA+ HLI Sbjct: 62 SRFNLADGMRKIFGQSFMDNIRGIVHDRETGGFTIGVQG--ETSDPADYVKFATALTHLI 119 Query: 123 GRSNFDAMSGQYYARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQN 182 G NFDAM+G YYARF VK+ D+SDSYLRQ +R+ LH DGT+V+E TD++LMMKI+EQN Sbjct: 120 GEPNFDAMTGTYYARFNVKDTDSSDSYLRQAYRLFTLHTDGTFVDEPTDWLLMMKIEEQN 179 Query: 183 MQGGNSLLLHLDDWEHLDNYFRHPLARRPMRFAAPPSKNVSKDVFHPVFDVDQQGRPVMR 242 GG S LLHLDDWE L + H LA + + APPSKN + V+ F D P + Sbjct: 180 AVGGESRLLHLDDWEDLHKFRNHSLASVKVTYKAPPSKNAQEIVYRETF-FDVNNAPCIC 238 Query: 243 YIDQFVQPKDFEEGVWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPD 302 +IDQF P + E+ +L +LS ++E S ++ +P+G +L+NNLFW+HGR F + D Sbjct: 239 FIDQFAYPDNIEQANYLKDLSYSVENSPATHALKLPIGDLVLLNNLFWMHGRAAFEKNKD 298 Query: 303 LRRELMRQRGYFA 315 L RELMRQRG F+ Sbjct: 299 LYRELMRQRGCFS 311 >UniRef50_Q2KU12 Carbon starvation-inducible protein n=1 Tax=Bordetella avium 197N RepID=Q2KU12_BORA1 Length = 305 Score = 254 bits (650), Expect = 2e-66, Method: Compositional matrix adjust. Identities = 132/307 (42%), Positives = 194/307 (63%), Gaps = 4/307 (1%) Query: 9 NNAVDSGQDYSGFTLTPSAQSPRLLELTFTEQTTKQFLEQVAEWPVQALEYKSFLRFRVA 68 N+ +D + +SG T++ R+ ++T + ++FL+Q VQ LEY F+RF++A Sbjct: 2 NDRLDLQRLFSG-TVSDHQTHTRVRQVTLESEGLERFLDQARAIDVQNLEYVPFMRFKLA 60 Query: 69 KILDDLCANQLQPLLLKTLLNRAEGALLINAVGVDDVKQADEMVKLATAVAHLIGRSNFD 128 +L C L+ L + +R G I G+ D+ V+ TAV +L+G +N D Sbjct: 61 DMLLQACGEGLRATLNALVEDRRHGGFTIGLQGLS--ADPDDFVRFGTAVGYLLGPANHD 118 Query: 129 AMSGQYYARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNS 188 +MSG+YYARF+VK+ DNSDSYLRQ +R+ +H DGTYV E TD++LMMK DE+N GG S Sbjct: 119 SMSGKYYARFLVKHTDNSDSYLRQAYRLFTMHTDGTYVTEATDWLLMMKFDERNAVGGES 178 Query: 189 LLLHLDDWEHLDNYFRHPLARRPMRFAAPPSKNVSKDVFHPVFDVDQQGRPVMRYIDQFV 248 LHLDDW LD + + PLA +P+ + +P SKNV++ V P+F + G V +IDQFV Sbjct: 179 RFLHLDDWADLDRFTQDPLATQPLLYKSPASKNVAEQVERPLFFQSRYGLSVC-FIDQFV 237 Query: 249 QPKDFEEGVWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLRRELM 308 QP EE ++L +LS ++E+S G+ + +P G+ +++NN F+LHGR F + L RELM Sbjct: 238 QPATLEEALYLHDLSASMESSAGVQEITLPPGELVVLNNYFYLHGRAPFEKNEALHRELM 297 Query: 309 RQRGYFA 315 R RG FA Sbjct: 298 RIRGLFA 304 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_B7LDM7 Protein csiD n=104 Tax=Bacteria RepID=CSID_ECO55 520 e-146 UniRef50_A9VGL1 Putative uncharacterized protein n=17 Tax=Bacter... 462 e-129 UniRef50_D0RN41 Protein CsiD n=1 Tax=alpha proteobacterium HIMB1... 460 e-128 UniRef50_Q47V67 Putative uncharacterized protein n=2 Tax=Alterom... 455 e-126 UniRef50_Q4FKY1 Gab protein n=3 Tax=Candidatus Pelagibacter RepI... 451 e-125 UniRef50_Q2KU12 Carbon starvation-inducible protein n=1 Tax=Bord... 449 e-125 Sequences not found previously or not previously below threshold: UniRef50_Q05XB9 Putative uncharacterized protein n=1 Tax=Synecho... 60 1e-07 UniRef50_A9UUQ7 Predicted protein n=1 Tax=Monosiga brevicollis R... 57 1e-06 UniRef50_C5PDZ6 Putative uncharacterized protein n=2 Tax=Coccidi... 56 2e-06 UniRef50_A9UW47 Predicted protein n=1 Tax=Monosiga brevicollis R... 52 3e-05 UniRef50_C4JUF4 Predicted protein n=6 Tax=Eurotiomycetidae RepID... 52 4e-05 UniRef50_B6HPB8 Pc22g01880 protein n=1 Tax=Penicillium chrysogen... 49 2e-04 UniRef50_A1TTH8 Taurine catabolism dioxygenase TauD/TfdA n=9 Tax... 48 4e-04 UniRef50_Q112B1 Taurine catabolism dioxygenase TauD/TfdA n=1 Tax... 48 5e-04 UniRef50_UPI0001AF241C oxygenase (secreted protein) n=1 Tax=Stre... 48 5e-04 UniRef50_B0JNS2 Putative uncharacterized protein n=1 Tax=Microcy... 47 8e-04 UniRef50_A7SHP2 Predicted protein (Fragment) n=4 Tax=Nematostell... 47 0.001 UniRef50_Q2UFS3 Predicted protein n=3 Tax=Aspergillus RepID=Q2UF... 46 0.001 UniRef50_A8TRC8 Putative uncharacterized protein n=1 Tax=alpha p... 46 0.001 UniRef50_A8TRK7 Gamma-butyrobetaine,2-oxoglutarate dioxygenase n... 46 0.002 UniRef50_D2SNE3 Putative uncharacterized protein n=1 Tax=Strepto... 45 0.002 UniRef50_C7ZJ20 Putative uncharacterized protein n=1 Tax=Nectria... 45 0.004 UniRef50_A0YX02 Gamma-butyrobetaine hydroxylase, putative n=2 Ta... 45 0.005 UniRef50_Q2CHG0 Putative uncharacterized protein n=1 Tax=Oceanic... 44 0.006 UniRef50_C7N0A3 Taurine catabolism dioxygenase TauD, TfdA family... 44 0.007 UniRef50_C3K215 Putative uncharacterized protein n=1 Tax=Pseudom... 44 0.008 UniRef50_UPI0000521F63 PREDICTED: similar to CG14630 CG14630-PA ... 43 0.010 UniRef50_B6EK40 Putative uncharacterized protein n=1 Tax=Aliivib... 43 0.010 UniRef50_D0N1N0 Trimethyllysine dioxygenase, putative n=1 Tax=Ph... 43 0.012 UniRef50_UPI000023E985 hypothetical protein FG04441.1 n=1 Tax=Gi... 43 0.013 UniRef50_A0KBP7 Putative uncharacterized protein n=5 Tax=Burkhol... 43 0.015 UniRef50_Q1AWV7 Putative uncharacterized protein n=1 Tax=Rubroba... 43 0.015 UniRef50_Q9NF72 EG:BACR7A4.9 protein n=10 Tax=Drosophila RepID=Q... 43 0.017 UniRef50_B3T5P9 Putative gamma-butyrobetaine hydroxylase n=1 Tax... 42 0.022 UniRef50_A8TTL2 Gamma-butyrobetaine,2-oxoglutarate dioxygenase n... 42 0.035 UniRef50_Q6CQT2 KLLA0D14553p n=1 Tax=Kluyveromyces lactis RepID=... 42 0.039 >UniRef50_B7LDM7 Protein csiD n=104 Tax=Bacteria RepID=CSID_ECO55 Length = 325 Score = 520 bits (1339), Expect = e-146, Method: Composition-based stats. Identities = 317/325 (97%), Positives = 321/325 (98%) Query: 1 MNALTAVQNNAVDSGQDYSGFTLTPSAQSPRLLELTFTEQTTKQFLEQVAEWPVQALEYK 60 MNALTAV NNAVDSGQDYSGFTL PSAQSPRLLELTFTEQTTKQFLEQVAEWPVQALEYK Sbjct: 1 MNALTAVHNNAVDSGQDYSGFTLIPSAQSPRLLELTFTEQTTKQFLEQVAEWPVQALEYK 60 Query: 61 SFLRFRVAKILDDLCANQLQPLLLKTLLNRAEGALLINAVGVDDVKQADEMVKLATAVAH 120 SFLRFRV KILDDLCANQLQPLLLKTLLNRAEGALLINAVG+DDV QADEMVKLATAVAH Sbjct: 61 SFLRFRVGKILDDLCANQLQPLLLKTLLNRAEGALLINAVGIDDVAQADEMVKLATAVAH 120 Query: 121 LIGRSNFDAMSGQYYARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDE 180 LIGRSNFDAMSGQYYARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDE Sbjct: 121 LIGRSNFDAMSGQYYARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDE 180 Query: 181 QNMQGGNSLLLHLDDWEHLDNYFRHPLARRPMRFAAPPSKNVSKDVFHPVFDVDQQGRPV 240 QNMQGGNSLLLHLDDWEHLD+YFRHPLARRPMRFAAPPSKNVSKDVFHPVFDVDQQGRPV Sbjct: 181 QNMQGGNSLLLHLDDWEHLDHYFRHPLARRPMRFAAPPSKNVSKDVFHPVFDVDQQGRPV 240 Query: 241 MRYIDQFVQPKDFEEGVWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPH 300 MRYIDQFVQPKDFEEGVWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPH Sbjct: 241 MRYIDQFVQPKDFEEGVWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPH 300 Query: 301 PDLRRELMRQRGYFAYASNHYQTHQ 325 PDLRRELMRQRGYFAYA++HYQTHQ Sbjct: 301 PDLRRELMRQRGYFAYATHHYQTHQ 325 >UniRef50_A9VGL1 Putative uncharacterized protein n=17 Tax=Bacteria RepID=A9VGL1_BACWK Length = 312 Score = 462 bits (1190), Expect = e-129, Method: Composition-based stats. Identities = 131/313 (41%), Positives = 185/313 (59%), Gaps = 3/313 (0%) Query: 3 ALTAVQNNAVDSGQDYSGFTLTPSAQSPRLLELTFTEQTTKQFLEQVAEWPVQALEYKSF 62 ++ QN + + Y G+ + P + RL + +Q KQF E+V E Q+L+Y + Sbjct: 2 SIITEQNTKMKFAKKYEGYEIVPHPEHKRLYHIVSNQQLLKQFFEEVKEHSEQSLQYIPY 61 Query: 63 LRFRVAKILDDLCANQLQPLLLKTLLNRAEGALLINAVGVDDVKQADEMVKLATAVAHLI 122 RF +A + + + + +R G I G + + VK ATA+ HLI Sbjct: 62 SRFNLADGMRKIFGQSFMDNIRGIVHDRETGGFTIGVQG--ETSDPADYVKFATALTHLI 119 Query: 123 GRSNFDAMSGQYYARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQN 182 G NFDAM+G YYARF VK+ D+SDSYLRQ +R+ LH DGT+V+E TD++LMMKI+EQN Sbjct: 120 GEPNFDAMTGTYYARFNVKDTDSSDSYLRQAYRLFTLHTDGTFVDEPTDWLLMMKIEEQN 179 Query: 183 MQGGNSLLLHLDDWEHLDNYFRHPLARRPMRFAAPPSKNVSKDVFHPVFDVDQQGRPVMR 242 GG S LLHLDDWE L + H LA + + APPSKN + V+ F D P + Sbjct: 180 AVGGESRLLHLDDWEDLHKFRNHSLASVKVTYKAPPSKNAQEIVYRETF-FDVNNAPCIC 238 Query: 243 YIDQFVQPKDFEEGVWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPD 302 +IDQF P + E+ +L +LS ++E S ++ +P+G +L+NNLFW+HGR F + D Sbjct: 239 FIDQFAYPDNIEQANYLKDLSYSVENSPATHALKLPIGDLVLLNNLFWMHGRAAFEKNKD 298 Query: 303 LRRELMRQRGYFA 315 L RELMRQRG F+ Sbjct: 299 LYRELMRQRGCFS 311 >UniRef50_D0RN41 Protein CsiD n=1 Tax=alpha proteobacterium HIMB114 RepID=D0RN41_9RICK Length = 313 Score = 460 bits (1184), Expect = e-128, Method: Composition-based stats. Identities = 133/300 (44%), Positives = 195/300 (65%), Gaps = 1/300 (0%) Query: 16 QDYSGFTLTPSAQSPRLLELTFTEQTTKQFLEQVAEWPVQALEYKSFLRFRVAKILDDLC 75 + +G + + S RL+++ + EQ ++ + +EYK FLRF + I + + Sbjct: 12 EKVTGLIVNNHSSSQRLVDIKIENDYLDKVKEQFDQFDLLDIEYKPFLRFHITDIFNKIF 71 Query: 76 ANQLQPLLLKTLLNRAEGALLINAVGVDDVK-QADEMVKLATAVAHLIGRSNFDAMSGQY 134 ++Q L LLNR +GA +I +D K D +VKL+TA+ HL+G NFDAM G+Y Sbjct: 72 NEKIQSLTKTILLNRNQGAFVIGPEAMDQSKYDTDFLVKLSTALTHLVGIPNFDAMYGKY 131 Query: 135 YARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLD 194 YARF VKN DNSDSYLR+ + ++LH DGTYV+E TD++LMMK+ E+N +GG S LLHLD Sbjct: 132 YARFEVKNTDNSDSYLRKAAKKLDLHTDGTYVKEKTDWLLMMKMKEENSEGGESTLLHLD 191 Query: 195 DWEHLDNYFRHPLARRPMRFAAPPSKNVSKDVFHPVFDVDQQGRPVMRYIDQFVQPKDFE 254 DWE + +F P+ + + +P SKNV + HP+F D+QG+P++ YIDQF +P+ + Sbjct: 192 DWEDCEKFFTDPIGKENFVWGSPKSKNVDYKIEHPIFSTDRQGKPIISYIDQFPEPQSLK 251 Query: 255 EGVWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLRRELMRQRGYF 314 +G++L+ LS+++E SK ++S +P G + NN F LHGR F PHP L REL+RQRG F Sbjct: 252 QGLYLNSLSESLEGSKKLISFKLPPGYAIFSNNYFMLHGRKAFKPHPKLYRELLRQRGIF 311 >UniRef50_Q47V67 Putative uncharacterized protein n=2 Tax=Alteromonadales RepID=Q47V67_COLP3 Length = 320 Score = 455 bits (1170), Expect = e-126, Method: Composition-based stats. Identities = 151/302 (50%), Positives = 203/302 (67%), Gaps = 4/302 (1%) Query: 20 GFTLTPSAQSPRLLELTFTEQTTKQFLEQVAEWPVQALEYKSFLRFRVAKILDDLCANQL 79 GF++ P A +PRL + + +T K F+EQ VQA+EYK FLRF VA IL+ L + L Sbjct: 16 GFSVAPFAANPRLQVIKLSSETVKGFIEQALPLGVQAIEYKPFLRFHVAGILNHLTNDTL 75 Query: 80 QPLLLKTLLNRAEGALLINAVGVD-DVKQADEMVKLATAVAHLIGRSNFDAMSGQYYARF 138 LLL + NR GA ++ V + Q + + L+TAV+HLIG N DAM G++YARF Sbjct: 76 GALLLGIIKNRDTGAFMLQCEPVAAEFDQLEFNILLSTAVSHLIGVPNLDAMYGKFYARF 135 Query: 139 VVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDDWEH 198 VKN DNSDSYLRQ HR MELHNDGTYVEE TD+V+M KI E+NM GG+SLLLH+D+W+ Sbjct: 136 SVKNEDNSDSYLRQAHRRMELHNDGTYVEERTDWVIMQKIAEENMAGGDSLLLHVDEWQD 195 Query: 199 LDNYFRHPLARRPMRFAAPPSKNVSKDVFHPVFD-VDQQGRPVMRYIDQFVQPKDFEEGV 257 L+ ++ HPLA+ +++ +P SKN++ + HPVF D GRP M +IDQF +P + +G Sbjct: 196 LEKFYNHPLAKEDIQWTSPASKNITYKMQHPVFFEEDDNGRPKMLFIDQFAEPLNMRQGQ 255 Query: 258 WLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLRRELMRQRGYFAYA 317 +L E+ ++E + +V VPVG L++NN WLHGRD+F L REL+RQRG AY Sbjct: 256 YLYEMGTSLEAEQNTFNVRVPVGSMLVVNNHAWLHGRDKFVADKGLYRELLRQRG--AYC 313 Query: 318 SN 319 N Sbjct: 314 EN 315 >UniRef50_Q4FKY1 Gab protein n=3 Tax=Candidatus Pelagibacter RepID=Q4FKY1_PELUB Length = 303 Score = 451 bits (1160), Expect = e-125, Method: Composition-based stats. Identities = 135/299 (45%), Positives = 197/299 (65%) Query: 16 QDYSGFTLTPSAQSPRLLELTFTEQTTKQFLEQVAEWPVQALEYKSFLRFRVAKILDDLC 75 ++ SG T+T S R++++ ++ + + ++ + ALEYK F RF +AK LDDL Sbjct: 2 ENISGITITEHQNSKRIIDIRIEDEILDKLIFPFNKFDITALEYKPFTRFTIAKSLDDLT 61 Query: 76 ANQLQPLLLKTLLNRAEGALLINAVGVDDVKQADEMVKLATAVAHLIGRSNFDAMSGQYY 135 +N+L LL + +R G +I V +VKL+TA+A+LIG N+DAM+G+YY Sbjct: 62 SNKLSKLLNSIVRDRETGCFIIGPKKVSAKINDIFLVKLSTAIAYLIGNPNYDAMAGKYY 121 Query: 136 ARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDD 195 ARF VK+ D SDSYLR+ + M+LH DGTYV+EITD++LM KI+EQN+QGG + +LHLDD Sbjct: 122 ARFFVKHEDKSDSYLRKAYTNMDLHTDGTYVKEITDWLLMTKIEEQNVQGGETAMLHLDD 181 Query: 196 WEHLDNYFRHPLARRPMRFAAPPSKNVSKDVFHPVFDVDQQGRPVMRYIDQFVQPKDFEE 255 WEH ++ F P+ ++ + +P SKN+ V HPVF D G+P + YIDQF +PK+ ++ Sbjct: 182 WEHCEDLFNDPIGKQNFVWGSPKSKNIEYKVEHPVFTTDDNGKPNISYIDQFPEPKNMDQ 241 Query: 256 GVWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLRRELMRQRGYF 314 G++L +LSDA+E SK + + G ++ NN FWLHGR F + DL REL+R RG F Sbjct: 242 GIFLQKLSDALEESKNKVITKLVPGSTIVANNYFWLHGRKPFKENKDLSRELLRIRGSF 300 >UniRef50_Q2KU12 Carbon starvation-inducible protein n=1 Tax=Bordetella avium 197N RepID=Q2KU12_BORA1 Length = 305 Score = 449 bits (1156), Expect = e-125, Method: Composition-based stats. Identities = 132/307 (42%), Positives = 194/307 (63%), Gaps = 4/307 (1%) Query: 9 NNAVDSGQDYSGFTLTPSAQSPRLLELTFTEQTTKQFLEQVAEWPVQALEYKSFLRFRVA 68 N+ +D + +SG T++ R+ ++T + ++FL+Q VQ LEY F+RF++A Sbjct: 2 NDRLDLQRLFSG-TVSDHQTHTRVRQVTLESEGLERFLDQARAIDVQNLEYVPFMRFKLA 60 Query: 69 KILDDLCANQLQPLLLKTLLNRAEGALLINAVGVDDVKQADEMVKLATAVAHLIGRSNFD 128 +L C L+ L + +R G I G+ D+ V+ TAV +L+G +N D Sbjct: 61 DMLLQACGEGLRATLNALVEDRRHGGFTIGLQGLS--ADPDDFVRFGTAVGYLLGPANHD 118 Query: 129 AMSGQYYARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNS 188 +MSG+YYARF+VK+ DNSDSYLRQ +R+ +H DGTYV E TD++LMMK DE+N GG S Sbjct: 119 SMSGKYYARFLVKHTDNSDSYLRQAYRLFTMHTDGTYVTEATDWLLMMKFDERNAVGGES 178 Query: 189 LLLHLDDWEHLDNYFRHPLARRPMRFAAPPSKNVSKDVFHPVFDVDQQGRPVMRYIDQFV 248 LHLDDW LD + + PLA +P+ + +P SKNV++ V P+F + G V +IDQFV Sbjct: 179 RFLHLDDWADLDRFTQDPLATQPLLYKSPASKNVAEQVERPLFFQSRYGLSVC-FIDQFV 237 Query: 249 QPKDFEEGVWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLRRELM 308 QP EE ++L +LS ++E+S G+ + +P G+ +++NN F+LHGR F + L RELM Sbjct: 238 QPATLEEALYLHDLSASMESSAGVQEITLPPGELVVLNNYFYLHGRAPFEKNEALHRELM 297 Query: 309 RQRGYFA 315 R RG FA Sbjct: 298 RIRGLFA 304 >UniRef50_Q05XB9 Putative uncharacterized protein n=1 Tax=Synechococcus sp. RS9916 RepID=Q05XB9_9SYNE Length = 290 Score = 59.7 bits (143), Expect = 1e-07, Method: Composition-based stats. Identities = 53/241 (21%), Positives = 87/241 (36%), Gaps = 34/241 (14%) Query: 92 EGALLINAVGVDDVKQADEMVKLATAVAHLIGRSNFDAMSGQYYARFVVKNVDNSDSYLR 151 G ++ G+ ++ +L A+ +G D G+ Y D+ SYL Sbjct: 54 HGYGVVLIRGLSPQPESTFR-RLYLALGRCLGTP--DGTYGELY-----DVTDSGQSYLT 105 Query: 152 QPHRV------MELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLL---------LHLDDW 196 + V +H D + +E +V + + +Q GG S L L D Sbjct: 106 KAIPVSQTRAATSMHTDSSRLETHPRWVGLACV-QQAPVGGGSRLASALAVHNHLKASDP 164 Query: 197 EHLDNYFRHPLARRPMRFAAPPSKNVSKDVFHPVFDVDQQGRPVMRYI--------DQFV 248 L+ + AR + A + PVF D G P +RY+ + Sbjct: 165 RSLERL-QRSFARDVVTPGAVDPLALIAHNRFPVFSTDTDG-PTLRYMRYWIEKGHQRLG 222 Query: 249 QPKDFEEGVWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLRRELM 308 QP D + L A+ + S + G LLI+N +H R+ + P R L+ Sbjct: 223 QPLDANDLRAFDALDAALNHPRFCHSFQLQEGDILLIDNHKLVHDREAYEDDPHRPRRLI 282 Query: 309 R 309 R Sbjct: 283 R 283 >UniRef50_A9UUQ7 Predicted protein n=1 Tax=Monosiga brevicollis RepID=A9UUQ7_MONBE Length = 453 Score = 56.6 bits (135), Expect = 1e-06, Method: Composition-based stats. Identities = 54/249 (21%), Positives = 97/249 (38%), Gaps = 29/249 (11%) Query: 76 ANQLQPLLLKTLLNRAEGALLINAVGVDDVKQADEMVKLATAVAHLIGRSNFDAMSGQYY 135 + L+ L + +G ++I + D++ D + A + F + +Y Sbjct: 185 SQLLEDPLPALTAVQEDGMVIITDMPTCDLRDPDTSTQQPLIEATALAMRQFGQLQRTFY 244 Query: 136 AR--FVVKNVDNSDSYLRQPHRVM--ELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLL 191 + + D +D + + LH D TY+ E + + Q+ GG SL Sbjct: 245 SDGLWDTAPKDAADVN-DTAYTNLGLPLHTDATYMREPPG-LQLFCCTAQSSDGGASLFG 302 Query: 192 HLD---------DWEHLDNYFRHPLARRPMRFAAPPSKNVSKDVFHPVFDVDQQGRPVMR 242 H++ + + L F P A + + NV P+F + VMR Sbjct: 303 HVNLVLKALYEQEPDLLTYCFNTPWAFQAL------ETNVHYHAMGPIFGAEAPQDVVMR 356 Query: 243 Y--IDQFVQPK---DFEEGVW--LSELSDAIETSKGI-LSVPVPVGKFLLINNLFWLHGR 294 + D+ P+ D E + L L+ + + K + S + VG+ +INN LHGR Sbjct: 357 FNPTDRAAMPQLTFDELEAYYHCLERLTALLASPKCLYHSERLAVGEMAVINNHKVLHGR 416 Query: 295 DRFTPHPDL 303 + F H +L Sbjct: 417 EAFVGHRNL 425 >UniRef50_C5PDZ6 Putative uncharacterized protein n=2 Tax=Coccidioides RepID=C5PDZ6_COCP7 Length = 311 Score = 55.8 bits (133), Expect = 2e-06, Method: Composition-based stats. Identities = 38/170 (22%), Positives = 64/170 (37%), Gaps = 13/170 (7%) Query: 152 QPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDDWEHL-DNYFRHPLARR 210 + R H D +Y + + + GG +L D L R L+R Sbjct: 131 ETARDFPWHTDCSYEHLPPRFFALQVLQPDRCGGGTLSILDADKIAGLLSPATRRSLSRP 190 Query: 211 PMRFAAPPS--KNVSKDVFHPVFDVDQ-QGRPVMRYIDQFVQPKDFEEGVWLSELSDAIE 267 R P K+ + + P+ D G +R+ ++ ++P + L E ++++ Sbjct: 191 EYRITVPAEFIKSDERHITAPLLSKDSGSGAAELRFREEILEPLTNGAKLALQEFGESLQ 250 Query: 268 TSKGILSVP------VPVGKFLLINNLFWLHGRDRFTPHPDLRRELMRQR 311 + + +P G +LINN WLH R D RR L R R Sbjct: 251 SPNAKAATLHLTPELLPRGSIILINNRRWLHARSEV---KDPRRHLRRVR 297 >UniRef50_A9UW47 Predicted protein n=1 Tax=Monosiga brevicollis RepID=A9UW47_MONBE Length = 389 Score = 51.9 bits (123), Expect = 3e-05, Method: Composition-based stats. Identities = 35/160 (21%), Positives = 64/160 (40%), Gaps = 20/160 (12%) Query: 152 QPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDDWE-HLDNYFRHPLA-- 208 Q H +H D V E +Y+L+ + + +GG S+LL E +L P A Sbjct: 189 QSHEGGSMHTDNVNVPETWEYMLLTCL-QPAAEGGESILLSSSTVEAYLAK--HDPEALE 245 Query: 209 --RRPMRFAAPPSKNVSKDVFH-PVFDVDQQGRPVMRYIDQFVQ--------PKDFEEGV 257 + + + S+ + P+ G P R++ ++++ P + Sbjct: 246 TLKEDFLWEL---RGFSERFYRAPILFHGTDGYPCFRWLREYLESAHARAGEPLTDRQIS 302 Query: 258 WLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRF 297 L+ L++A L + G+ L N++ LHGR F Sbjct: 303 ALNALTNATLEESLQLRYNMAKGEILFANDMSLLHGRTTF 342 >UniRef50_C4JUF4 Predicted protein n=6 Tax=Eurotiomycetidae RepID=C4JUF4_UNCRE Length = 301 Score = 51.6 bits (122), Expect = 4e-05, Method: Composition-based stats. Identities = 38/170 (22%), Positives = 62/170 (36%), Gaps = 13/170 (7%) Query: 152 QPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDDWEHL-DNYFRHPLARR 210 + + H D +Y + + + GG +L +D L R L+R Sbjct: 121 ETAQDFPWHTDCSYEHLPPRFFALQVLQPDRCGGGTLSILSIDMLVRLLSPSTRTSLSRP 180 Query: 211 PMRFAAPPS--KNVSKDVFHPVFDVDQ-QGRPVMRYIDQFVQPKDFEEGVWLSELSDAIE 267 R PP K+ + + + D G P R+ ++ + P + L EL + Sbjct: 181 EYRITVPPEFIKSDERHITAGLLGEDPGNGAPEFRFREEILCPLTAGAKMALQELGAVLS 240 Query: 268 TSKGILSVP------VPVGKFLLINNLFWLHGRDRFTPHPDLRRELMRQR 311 + + + +P G +LINN WLH R D R L R R Sbjct: 241 SPQAKAATLHLTPELLPRGSVILINNRRWLHARSEV---RDPHRHLRRVR 287 >UniRef50_B6HPB8 Pc22g01880 protein n=1 Tax=Penicillium chrysogenum Wisconsin 54-1255 RepID=B6HPB8_PENCW Length = 298 Score = 49.3 bits (116), Expect = 2e-04, Method: Composition-based stats. Identities = 32/162 (19%), Positives = 63/162 (38%), Gaps = 12/162 (7%) Query: 156 VMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDDWEHL-DNYFRHPLARRPMRF 214 E H D +Y E+ + + + GG +L++D L + + L+ + Sbjct: 124 RFEWHTDCSYEEQPPRFFALQVLQPDRYGGGTLSVLNVDRLLTLLSPFAQRWLSSYNYKI 183 Query: 215 AAPPSKNVSKDVFHPV-----FDVDQQGRPVMRYIDQFVQPKDFEEGVWLSELSDAIETS 269 PP + + V ++ Q +R+ + P + L EL + + + Sbjct: 184 NVPPEFTKTARTQYIVGNLLAVNLSQSSGSRLRFREDITVPLTLDASKALDELKEILYSG 243 Query: 270 KGILSVPVPV-----GKFLLINNLFWLHGRDRFT-PHPDLRR 305 S+ +P G ++++N WLH R+ P+ LRR Sbjct: 244 AQEESLHLPPQSLPQGSIIMMDNRRWLHSRNEVKDPNRHLRR 285 >UniRef50_A1TTH8 Taurine catabolism dioxygenase TauD/TfdA n=9 Tax=Proteobacteria RepID=A1TTH8_ACIAC Length = 289 Score = 48.1 bits (113), Expect = 4e-04, Method: Composition-based stats. Identities = 41/182 (22%), Positives = 68/182 (37%), Gaps = 25/182 (13%) Query: 159 LHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDDWEHLDNYFRHP-------LARRP 211 LH DG+Y+ T ++ + GG S+L D RH LA + Sbjct: 97 LHTDGSYLPIGTIKTSILLCRQHAASGGESILF--DSLSAFQALSRHDPGLAQSLLAPKV 154 Query: 212 MRFAAPPSKNVSKDVFH--PVFDVDQQGRPVMRYI---------DQFVQPKDFEEGVWLS 260 R + + + + H PVF + G+ + + P+ + +L Sbjct: 155 FRRRSTDPR-LDQQYEHIGPVFHTGENGQMASGFTLDVTADWDYSRRADPRVIDAVAYLK 213 Query: 261 ELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLRRELMRQRGYFAYASNH 320 L++ S LS + G+ L++ N HGR+ + P R L+ RG F A Sbjct: 214 HLAEP--GSGYTLSFTLQRGQALVMRNDQLSHGRNAYIDDPAHPRVLL--RGLFLSAPRA 269 Query: 321 YQ 322 Q Sbjct: 270 MQ 271 >UniRef50_Q112B1 Taurine catabolism dioxygenase TauD/TfdA n=1 Tax=Trichodesmium erythraeum IMS101 RepID=Q112B1_TRIEI Length = 371 Score = 47.7 bits (112), Expect = 5e-04, Method: Composition-based stats. Identities = 38/183 (20%), Positives = 72/183 (39%), Gaps = 17/183 (9%) Query: 135 YARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLD 194 Y F N L + M H D TY + + E + GG SL+ +D Sbjct: 168 YGLFAPSKTTNEGKDLAETGNAMSFHTDYTYWHTPP-LLTSLYCVENSASGGESLI--VD 224 Query: 195 DWEHLDNYFR-HP-----LARRPMRFAAPPSK-NVSKDVFHPVFDVDQQGR-PVMRYIDQ 246 + +D++ + HP L + P++F +K P+ ++D+ G+ + + + Sbjct: 225 GFRVVDDFRQQHPDYFQILTQTPIQFKQVYTKWQYFYSRTQPILELDEYGKVTRINFANS 284 Query: 247 FVQ----PKDFEEGVWLSELS--DAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPH 300 P D E + + ++ ++ + G LL+N+ +HGR FT + Sbjct: 285 HSYTWKLPFDQMEEFYAAYITFFQYVKNPVYEYCFSLEPGDLLLMNDSRIMHGRKAFTGN 344 Query: 301 PDL 303 L Sbjct: 345 RHL 347 >UniRef50_UPI0001AF241C oxygenase (secreted protein) n=1 Tax=Streptomyces roseosporus NRRL 11379 RepID=UPI0001AF241C Length = 333 Score = 47.7 bits (112), Expect = 5e-04, Method: Composition-based stats. Identities = 45/220 (20%), Positives = 76/220 (34%), Gaps = 16/220 (7%) Query: 98 NAVGVDDVKQADEMVKLATAVAHLIG---RSNFDAMSGQYYARFVVKNVDNSDSYLRQPH 154 + + + + LIG N D + + R ++NS S Sbjct: 90 GTQALVGHASNGTLALVGDVLGSLIGYADEKNGDLLHDVHPVRGEEHRMENSGSV----- 144 Query: 155 RVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDDWEHLDNYFRHPLARRPMRF 214 + H + + D++ ++ + + + + HL L R R Sbjct: 145 -AFDFHTENVHHPLRPDFLGLLSLRQGHEATATRVASIRHAVAHLSEDENAVLRRLRFRS 203 Query: 215 AAPPSKNVSKDVFHPVFDVDQ--QGRPVMRYI--DQF-VQPKDFEEGVWLSELSDAIETS 269 P S + P + G P ++ D F +P D E L L++A+E Sbjct: 204 LFPTSFTRGRTGERPATAEHRVLFGNPGQEFLRFDSFNTKPADPEAERALGALAEALEAV 263 Query: 270 KGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLRRELMR 309 + V + G LL+NN HGR FTP D R +R Sbjct: 264 --CVEVVLEPGDLLLVNNHIAAHGRSAFTPRYDGRDRWLR 301 >UniRef50_B0JNS2 Putative uncharacterized protein n=1 Tax=Microcystis aeruginosa NIES-843 RepID=B0JNS2_MICAN Length = 305 Score = 47.3 bits (111), Expect = 8e-04, Method: Composition-based stats. Identities = 40/168 (23%), Positives = 68/168 (40%), Gaps = 23/168 (13%) Query: 160 HNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDD-----WEH---LDNYFRHPLARRP 211 H D T D+V ++ + + +GG+SL+++ + W+H L Y PLAR Sbjct: 138 HTDSTAKNYFPDFVGLLCLAAAD-EGGDSLVVNAANLYQYFWQHHTDLVPYLYEPLARDV 196 Query: 212 MRFAAPPSKNVSKDVFHPVFDVDQQGRPVMRYID----------QFVQPKDFEEGVWLSE 261 + S+ + P+F D QG V RY+ + QP+ + L Sbjct: 197 ITPGEINSQEAIQKNNFPLFSADSQGL-VFRYMRYWIEVAYSKLEIAQPEAITKT--LDI 253 Query: 262 LSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLRRELMR 309 + D + + + G INN F H R F + R+++R Sbjct: 254 IDDFFSKPENTVRFKMKKGDVFYINNRFLCHNRTAF-KNSGKPRQMVR 300 >UniRef50_A7SHP2 Predicted protein (Fragment) n=4 Tax=Nematostella vectensis RepID=A7SHP2_NEMVE Length = 385 Score = 46.6 bits (109), Expect = 0.001, Method: Composition-based stats. Identities = 38/175 (21%), Positives = 62/175 (35%), Gaps = 22/175 (12%) Query: 153 PHRVMEL--HNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDDWEHLDNYFRHP---- 206 + EL H D Y E +L+ ID+ GG + +D + L + Sbjct: 191 AYTGYELQPHTDLPYYEFKPSVILLHCIDQVRSSGGENTF--VDGYSILKAFRNDNPDGF 248 Query: 207 --LARRPMRFA----APPSKNVSKDVFHPVFDVDQQGR-PVMRYIDQFVQ-----PKDFE 254 LA P+ P + P+ ++D +GR + + D + P + Sbjct: 249 DLLASTPVLHRVKGVEPTYGEFEQLFARPIIELDVKGRIRRINFNDPLREEFLDTPAEQI 308 Query: 255 EGVW--LSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLRREL 307 V+ +L+ K I+ + G I+N LHGR F D R L Sbjct: 309 PKVYRAYHKLTQMFYEPKFIVRNKMAPGDICAIDNDRLLHGRSAFEVKSDDLRLL 363 >UniRef50_Q2UFS3 Predicted protein n=3 Tax=Aspergillus RepID=Q2UFS3_ASPOR Length = 187 Score = 46.2 bits (108), Expect = 0.001, Method: Composition-based stats. Identities = 38/172 (22%), Positives = 64/172 (37%), Gaps = 25/172 (14%) Query: 157 MELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDDW-EHLDNYFRHPLARRPMRFA 215 H D +Y + + + GG ++ +D + L + L + Sbjct: 4 FPWHTDCSYEHAPPRFFALQVLQHDRYGGGTLSVMKIDRLSQFLSPTTKAALLEPEFQIT 63 Query: 216 APPSKNVSKDVFHP--------VFDVDQQGRPVM-RYIDQFVQPKDFEEGVWLSELSDAI 266 PP + + HP +F +D + +M RY D+ V P L EL A+ Sbjct: 64 IPP-----EFIKHPDQRHIVGSLFAIDTEDHCLMMRYRDEIVTPLSARAAAALKELKGAL 118 Query: 267 ET----SKGILSVP---VPVGKFLLINNLFWLHGRDRFTPHPDLRRELMRQR 311 + S+ L + +P +L++N WLH R+ D R L R R Sbjct: 119 QDMEALSQSTLHLTAADLPERSIILLDNYRWLHARNGI---KDPARHLRRVR 167 >UniRef50_A8TRC8 Putative uncharacterized protein n=1 Tax=alpha proteobacterium BAL199 RepID=A8TRC8_9PROT Length = 363 Score = 46.2 bits (108), Expect = 0.001, Method: Composition-based stats. Identities = 57/278 (20%), Positives = 98/278 (35%), Gaps = 46/278 (16%) Query: 68 AKILDDLCANQLQPLLLKTLLNRAEGALLINAVGVDDVKQADEMVKLATAVAHL------ 121 A + A +L LL + R G L++ + V + D + A AH+ Sbjct: 66 ADFPLPILAPKLSALLDQLESGR--GVALLSGLPVQNYDDEDLRMLWAGIGAHVGPLLPQ 123 Query: 122 -----IGRSNFDAMSGQYYARFVVKNVDNSDSYLR-QPHRVMELHNDGTYVEEITDYVLM 175 + R D SG++ A K ++ S + Q + + H D D + + Sbjct: 124 SIDGKLMRDVRDEASGKFQAHAPSKAENSLTSTQKAQSNGPLRFHTD------RADVLGL 177 Query: 176 MKIDEQNMQGGNSLLLH--------LDDWEHLDNYFRHPLARRPMRFAAPPSKNVS--KD 225 + + Q GG S + L L + P + +++V Sbjct: 178 LCVR-QAGAGGESKVASSLAVRNEILKRRPDLHDLLCQPY------WRTRETQDVGGGHK 230 Query: 226 VF-HPVFDVDQQGRPVMRYIDQFVQPKDFEEGV--WLSELSDAIE-----TSKGILSVPV 277 VF PVF + G +Y FV+ +GV E+ A++ + + Sbjct: 231 VFAMPVFSF-RDGHFSSQYSRTFVEEAQRIDGVPKMTPEMDAALDLLAEVAEEKCHVFRL 289 Query: 278 PVGKFLLINNLFWLHGRDRFTPHPDLRRELMRQRGYFA 315 G+ + NN HGR FT R + + R +FA Sbjct: 290 QPGEMVFYNNHLVYHGRTPFTDDAASRADRLLYRLWFA 327 >UniRef50_A8TRK7 Gamma-butyrobetaine,2-oxoglutarate dioxygenase n=1 Tax=alpha proteobacterium BAL199 RepID=A8TRK7_9PROT Length = 390 Score = 45.8 bits (107), Expect = 0.002, Method: Composition-based stats. Identities = 37/160 (23%), Positives = 64/160 (40%), Gaps = 15/160 (9%) Query: 150 LRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDDWEHLDNYFRH-PLA 208 L Q +R +E H D Y Y+L+ + + GG+S L +D + + R P A Sbjct: 193 LSQTNRELEPHADNPYRLPAPGYILLHCLR-NDADGGDSTL--VDGFHVAEILRRDDPDA 249 Query: 209 RRPMRFAAPPSKNVSKD--VFH--PVFDVDQQGR-PVMRYIDQFVQPKDFEEGVWL---- 259 + A + V D + H P+ ++ G +R+ ++ +P G Sbjct: 250 FDVLTTTATRFRYVDPDTVLEHYGPLIELAPDGSVRRLRFNNRTEEPPALPAGRLAAYYA 309 Query: 260 --SELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRF 297 + + S L + G+ L+INN LHGR + Sbjct: 310 ARQRYATLLHASSNTLVFKLEPGQLLMINNYRLLHGRRGY 349 >UniRef50_D2SNE3 Putative uncharacterized protein n=1 Tax=Streptomyces fradiae RepID=D2SNE3_STRFR Length = 419 Score = 45.4 bits (106), Expect = 0.002, Method: Composition-based stats. Identities = 38/176 (21%), Positives = 70/176 (39%), Gaps = 15/176 (8%) Query: 144 DNSDSYLRQPHRVMELHNDG-TYVEEITDYVLMMKIDEQNMQGGNSLLLHLDDWEHLDNY 202 D D+Y Q + + H++G + Y+ + + + +GG + +L D + LD + Sbjct: 244 DMDDTYHAQNRKSLLPHSEGYEFRGVPPRYLGLWCVTPASGEGGETTML--DGNQILDEF 301 Query: 203 F---RHPLARRPMRFAAP---PSKNVSKDVFHPVFDVDQQGRPVMRY-IDQFVQPKDFEE 255 R L + + + V V HPV + ++ G V R+ + + P+ E Sbjct: 302 TEEERQRLFDTTYEWKSTDGLSRRGVDFRVEHPVLE-NRNGGRVFRFSYNNMIVPEGDEL 360 Query: 256 GVWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLRRELMRQR 311 L E I ++V ++ +N LH R+ F D R L R + Sbjct: 361 ATRLRERGKEIYDE-NHIAVSYEQRDLIVWDNWRMLHSRNAFE---DPSRHLKRVQ 412 >UniRef50_C7ZJ20 Putative uncharacterized protein n=1 Tax=Nectria haematococca mpVI 77-13-4 RepID=C7ZJ20_NECH7 Length = 323 Score = 44.6 bits (104), Expect = 0.004, Method: Composition-based stats. Identities = 37/167 (22%), Positives = 64/167 (38%), Gaps = 15/167 (8%) Query: 157 MELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDDW-EHLDNYFRHPLARRPMRFA 215 H D +Y + + + + GG ++++D E L R L + R Sbjct: 149 FPWHTDCSYEDPPPRFFALQVLQHDRCGGGTLSVMNVDKLSELLSPEIRSALLAQEYRIT 208 Query: 216 APPS--KNVSKD-VFHPVFDVDQQGRPVM-RYIDQFVQPKDFEEGVWLSELSDAI----- 266 PP K+ + + VF + M R+ + + P L EL +A+ Sbjct: 209 IPPEFIKDPEQKHIIGSVFVTSPNDQSTMIRFREDILTPLTDRASRALVELKEALLKEEV 268 Query: 267 --ETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLRRELMRQR 311 ++ + S +P G +L++N WLH R+ D R L R R Sbjct: 269 QAHSTVHLKSADLPKGSIILMDNRRWLHARNDI---KDPERHLRRVR 312 >UniRef50_A0YX02 Gamma-butyrobetaine hydroxylase, putative n=2 Tax=Lyngbya sp. PCC 8106 RepID=A0YX02_9CYAN Length = 378 Score = 44.6 bits (104), Expect = 0.005, Method: Composition-based stats. Identities = 42/198 (21%), Positives = 76/198 (38%), Gaps = 22/198 (11%) Query: 122 IGRSNFDAMSGQYYARFVVKNVDNS-DSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDE 180 IG +++ G+Y VK + N+ D L + H D T++ V ++ E Sbjct: 157 IGPAHYLGKYGRYSP---VKAIPNAQDLSLSAEGNELSPHTDITFMSTPP-LVQLLYCVE 212 Query: 181 QNMQGGNSLLLHLDDWEHLDNYFRHP------LARRPMRFAAPPSKNVSKDVFH--PVFD 232 GG S+L +D ++ ++ +H L + P++F + V P+ + Sbjct: 213 NLATGGESVL--VDGFKVARDFQQHHPQYFEILTKVPVKFEQ-FYQEWEYYVSRTTPIIE 269 Query: 233 VDQQGRPVMRYIDQ--FVQPKDFEEGVWLSELSDA----IETSKGILSVPVPVGKFLLIN 286 ++Q G Y F F++ E ++ + G LL+ Sbjct: 270 LEQDGLVSGIYFSHKNFSSQLPFDQVEEFYEAYKTFFLYLKNPAYQYWFRLEPGDCLLVE 329 Query: 287 NLFWLHGRDRFTPHPDLR 304 N LHGR F P+ +R Sbjct: 330 NFRVLHGRKAFNPNSGMR 347 >UniRef50_Q2CHG0 Putative uncharacterized protein n=1 Tax=Oceanicola granulosus HTCC2516 RepID=Q2CHG0_9RHOB Length = 269 Score = 44.2 bits (103), Expect = 0.006, Method: Composition-based stats. Identities = 40/165 (24%), Positives = 56/165 (33%), Gaps = 26/165 (15%) Query: 157 MELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDDW-EHLDNYFRHPLARRPMRFA 215 + LH D +Y+ + VL I GG S + DD L LAR F Sbjct: 114 LALHTDSSYLAKPHPLVLFQFIRSA-ADGGASTMACADDIVAALPPPLVETLARPQFPFG 172 Query: 216 APPSKNVSKDVFHPVFDVDQQGRPVMRY----IDQFVQ-----PKDFEEGVWLSELSDAI 266 P P+ ++ P MRY ID + P+ L L + + Sbjct: 173 KGPM---------PIL-FGRRSAPQMRYYRSQIDTAAEEAGGLPQPLVTA--LDRLDEIL 220 Query: 267 ETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLRRELMRQR 311 E G+ + + N LHGR F D R + R R Sbjct: 221 EQVP-TYEFKAQPGEIVFMQNTRVLHGRRGFGGDSD--RLMYRIR 262 >UniRef50_C7N0A3 Taurine catabolism dioxygenase TauD, TfdA family n=2 Tax=Actinomycetales RepID=C7N0A3_SACVD Length = 327 Score = 44.2 bits (103), Expect = 0.007, Method: Composition-based stats. Identities = 36/177 (20%), Positives = 64/177 (36%), Gaps = 34/177 (19%) Query: 155 RVMELHNDGTYVEEITDYVLM--MKIDEQN----MQGGNSLLLHLDDWEHLDNYFRHPL- 207 ++E H + Y + YV++ + D +N + + L D + P+ Sbjct: 143 TLLEFHTEMAYHQHQPQYVMLACSRSDHENKAATLVASIRRAIQLIDEKTKSRLMDRPIP 202 Query: 208 --------ARRPMRFAAPPSKN--VSKDVFHPVFDVDQQGRPVMRYIDQFVQPKDFEEGV 257 P PP++ +S D P+ D+ + + P + E+ Sbjct: 203 CNVDVSFRGDDPELKKGPPARVCVLSGDPEDPMLGYDR----------ELLAPDNAEDEQ 252 Query: 258 WLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPD-----LRRELMR 309 LS LS A++ V + G L+++N H R F P D L R +R Sbjct: 253 ALSVLSKALDEV--TKPVKLSPGDLLIVDNYRTTHARTPFKPRWDGRDRWLHRMYIR 307 >UniRef50_C3K215 Putative uncharacterized protein n=1 Tax=Pseudomonas fluorescens SBW25 RepID=C3K215_PSEFS Length = 315 Score = 43.9 bits (102), Expect = 0.008, Method: Composition-based stats. Identities = 30/151 (19%), Positives = 56/151 (37%), Gaps = 15/151 (9%) Query: 159 LHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDDWEHLDNYFRHPLARR----PMRF 214 +HNDG DY+++ ++ + GG S+L +D P F Sbjct: 137 VHNDGVSDPLPIDYLIL-ACGQKALLGGESIL--IDASAVYAELMTFPEILEELKCDFFF 193 Query: 215 AAPPSKNVSKDVFHPVFDVDQQGRPVMRYIDQFVQ--------PKDFEEGVWLSELSDAI 266 + K P+ +G P++RY +++ P + L+ L + Sbjct: 194 ENRGMSDEEKLFKAPILSFSNEGIPLIRYFRVYIESAHLKAGVPLTLAQSQALNFLDTVL 253 Query: 267 ETSKGILSVPVPVGKFLLINNLFWLHGRDRF 297 + S V + G+ L+ + +LH R F Sbjct: 254 DQSSVQHRVLLEPGQILISADNKFLHTRTHF 284 >UniRef50_UPI0000521F63 PREDICTED: similar to CG14630 CG14630-PA n=1 Tax=Ciona intestinalis RepID=UPI0000521F63 Length = 404 Score = 43.5 bits (101), Expect = 0.010, Method: Composition-based stats. Identities = 36/166 (21%), Positives = 63/166 (37%), Gaps = 19/166 (11%) Query: 157 MELHNDGTYVEEITDYVLMMKID-EQNMQGGNSLLLHLDDWEHLDNYFRHPLARRPMRFA 215 + LH D + E L+ I + +QGG SLL +D + L+ + + + Sbjct: 203 LPLHTDQCHYEAGPGVQLLHAIQFDDCVQGGESLL--VDMFYVLETFRKEFPEDFNILSK 260 Query: 216 AP-PSKNVSKDVFHPVFDVDQQGRPVMRYIDQ---FVQPKDFEEGVWLS----------- 260 P P + +P + ++ V Y DQ F K EE + + Sbjct: 261 VPVPFGTIDYQRKNPCYLYTRKPVIVTDYDDQIVGFNFNKGIEEPLRIHAKYVEKFYQAY 320 Query: 261 -ELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLRR 305 +L I ++ + + G+ L NN LH R+ + + RR Sbjct: 321 NKLDRMINRNEFVFKHRLRTGELLFFNNRRMLHSREAYMSNGGRRR 366 >UniRef50_B6EK40 Putative uncharacterized protein n=1 Tax=Aliivibrio salmonicida LFI1238 RepID=B6EK40_ALISL Length = 340 Score = 43.5 bits (101), Expect = 0.010, Method: Composition-based stats. Identities = 25/110 (22%), Positives = 42/110 (38%), Gaps = 1/110 (0%) Query: 207 LARRPMRFAAPPSKNVSKDVFHPVFDVDQQGRPVMRYIDQFVQPKDFEEGVWLSELSDAI 266 L++ + P S S P+ D G RY + P E L + Sbjct: 214 LSKPHFEISRPDSFKQSVKTILPLISFDNDGVAYCRYDKENTTPLTTEAAAALVMWEAQL 273 Query: 267 ETSKGILSVPVPVGKFLLINNLFWLHGRDRFTP-HPDLRRELMRQRGYFA 315 + ++ ++ G FL+I N +H R+ F+P R L+R G + Sbjct: 274 KNTELNNAITYQPGDFLIIKNQRLMHSREGFSPRDDGTDRWLIRLFGMSS 323 >UniRef50_D0N1N0 Trimethyllysine dioxygenase, putative n=1 Tax=Phytophthora infestans T30-4 RepID=D0N1N0_PHYIN Length = 435 Score = 43.1 bits (100), Expect = 0.012, Method: Composition-based stats. Identities = 39/171 (22%), Positives = 72/171 (42%), Gaps = 18/171 (10%) Query: 143 VDNSDSYLRQPHRVMEL--HNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDDWEHLD 200 ++++ Y +EL H DGTY+ + + + Q +GG S ++D + ++ Sbjct: 242 TNDAEDYNDTASTNLELLHHTDGTYIRDPPG-LQIFNCAAQAGEGGESR--YVDAFHVVE 298 Query: 201 NYFR-HPLARRPMRFAAPPSKNVSKDVF----HPVFDVDQQGRPV-MRYIDQFVQPKD-- 252 + +P A R + + P V D P+ VD G V R+ D P Sbjct: 299 TLRKENPEAFRVLSTTSLPYFTVDNDAHLATMEPLIRVDYAGNVVQFRHNDYDRAPLTHL 358 Query: 253 -FEE-GVWLS---ELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFT 298 FEE G + +L + + + + + VG ++++N +HGR F Sbjct: 359 SFEEVGEFYQAHRKLLEVLRRPEMEFCMKLQVGDMVVVDNQRVMHGRHAFQ 409 >UniRef50_UPI000023E985 hypothetical protein FG04441.1 n=1 Tax=Gibberella zeae PH-1 RepID=UPI000023E985 Length = 291 Score = 43.1 bits (100), Expect = 0.013, Method: Composition-based stats. Identities = 37/160 (23%), Positives = 61/160 (38%), Gaps = 11/160 (6%) Query: 157 MELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDDW-EHLDNYFRHPLARRPMRFA 215 H D +Y + + Y + + GG +++++ E L + L R Sbjct: 130 FPWHTDCSYEDPLPRYFALQVLQHDRYGGGTLSVMNVEKLNELLSPESKAALMSSEFRIE 189 Query: 216 APPS--KNVSKDVFHPVFDVDQQGRPVMRYIDQFVQPKDFEEGVWLSELSDAI------- 266 PP KN K ++R+ + V P + L EL DA+ Sbjct: 190 IPPEFIKNADKKHITGSILKSNGESTMIRFREDIVTPLTDRARLALQELRDALVQHEVQA 249 Query: 267 ETSKGILSVPVPVGKFLLINNLFWLHGRDRFT-PHPDLRR 305 T+ + S +P G +L++N WLH R+ P LRR Sbjct: 250 HTTVHLKSSDLPKGSIILMDNRRWLHARNDIKDPERHLRR 289 >UniRef50_A0KBP7 Putative uncharacterized protein n=5 Tax=Burkholderia cepacia complex RepID=A0KBP7_BURCH Length = 373 Score = 43.1 bits (100), Expect = 0.015, Method: Composition-based stats. Identities = 35/114 (30%), Positives = 47/114 (41%), Gaps = 5/114 (4%) Query: 190 LLHLDDWEHLDNYFRHPLARRPMRF-AAPPSKNVSKDVFHPVFDVDQQGRPVMRYIDQFV 248 LL DD L Y H + R P R+ A P+ + D+ + R + + V Sbjct: 195 LLSPDDIAQL--YGEHYIIRVPYRWRGAAPTPRDNTDLSAVLSGPLDAPRVTVAFYPDMV 252 Query: 249 QPKDFEEGVWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPD 302 + L+ L A+ V V GK +LINN F LH RDRF P D Sbjct: 253 LAVNTRAQEALANLYRAVREVS--FGVQVSPGKLVLINNHFTLHSRDRFDPQYD 304 >UniRef50_Q1AWV7 Putative uncharacterized protein n=1 Tax=Rubrobacter xylanophilus DSM 9941 RepID=Q1AWV7_RUBXD Length = 316 Score = 43.1 bits (100), Expect = 0.015, Method: Composition-based stats. Identities = 36/164 (21%), Positives = 60/164 (36%), Gaps = 18/164 (10%) Query: 160 HNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDDWEHLDNYFRHPLARRPMRFAAPPS 219 H + DY+ +M + +N + +DD E LD PL F PS Sbjct: 147 HTEDARYSYRGDYIGLMCL--RNPDAVPTTYASIDDIE-LDPERAAPLFEPRFVFRPDPS 203 Query: 220 KNVSKDVFHPVFDVDQQGRPVMRYIDQFV--QPKDFEEGVWLSELSDAIETSKGILSVPV 277 P +R+ D + +P+D E + L+ ++ + + V + Sbjct: 204 HPTDTGCERASILFGDPSSPYLRF-DPYSMDRPEDEEARAAMDYLAGELD--RRLTGVAL 260 Query: 278 PVGKFLLINNLFWLHGRDRFTPHPD----------LRRELMRQR 311 G+ L I+N +HGR F D + R+L R R Sbjct: 261 RPGECLFIDNYKVVHGRSAFKARFDGTDRWLKRVNITRDLRRSR 304 >UniRef50_Q9NF72 EG:BACR7A4.9 protein n=10 Tax=Drosophila RepID=Q9NF72_DROME Length = 504 Score = 42.7 bits (99), Expect = 0.017, Method: Composition-based stats. Identities = 43/193 (22%), Positives = 71/193 (36%), Gaps = 30/193 (15%) Query: 138 FVVKNVDNSDSYLRQPH--RVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLL----- 190 F VK+ N+ +Y + + LH D Y E ++ + + +GG + L Sbjct: 297 FEVKSKPNARNY---AYLMTPLPLHTDMPYYEYKAGINILHTLVQSESKGGANTLTDGFN 353 Query: 191 -------LHLDDWEHLDNYFRHPLARRPMRFAAPPSKNVSKDVFHPVFDVDQQGRPVMRY 243 LH +D+E L + P+ + SK PV +D GR Sbjct: 354 VASQLQKLHPEDFEVLKSV---PVNWFDIGHDGDDSKPFHSLWRAPVICLDVDGR--FAR 408 Query: 244 IDQFVQPKD------FEEGV-WLSELSDAIETSKG-ILSVPVPVGKFLLINNLFWLHGRD 295 I+Q +D + V W +E ++ + G + NNL LHGR Sbjct: 409 INQNTTKRDSRFSVSLAQAVSWYKAYDKFLEIAQSEAVEFKTQAGDVFVFNNLRMLHGRT 468 Query: 296 RFTPHPDLRRELM 308 + P +R L+ Sbjct: 469 AYEDAPGNKRHLV 481 >UniRef50_B3T5P9 Putative gamma-butyrobetaine hydroxylase n=1 Tax=uncultured marine microorganism HF4000_ANIW141K23 RepID=B3T5P9_9ZZZZ Length = 309 Score = 42.3 bits (98), Expect = 0.022, Method: Composition-based stats. Identities = 32/167 (19%), Positives = 54/167 (32%), Gaps = 29/167 (17%) Query: 160 HNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLH--------LDDWEHL-----DNYFRHP 206 H D + ++ D + M+ I+ Q +GG S + L + + + + Sbjct: 141 HTDSPHWTKVPDLIGMLCIN-QAKKGGISKFVSAYTIHNQLLKEQNDILKTLYEKFHFDK 199 Query: 207 LARRPMRFAAPPSKNVSKDVFHPVFDVDQQGRPVMRYIDQFV-------QPKDFEEGVWL 259 + N + VF P+F ID V P + L Sbjct: 200 RGEFKI--------NEPQTVFEPIFVFKNDKLYCRFLIDYIVAGHQIQNYPLSKLQETAL 251 Query: 260 SELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLRRE 306 L + E +LS + +N LHGR F + D R+ Sbjct: 252 QSLEEISENENNVLSYDLKANDMTFFDNHRILHGRTEFEDYEDENRK 298 >UniRef50_A8TTL2 Gamma-butyrobetaine,2-oxoglutarate dioxygenase n=1 Tax=alpha proteobacterium BAL199 RepID=A8TTL2_9PROT Length = 385 Score = 41.5 bits (96), Expect = 0.035, Method: Composition-based stats. Identities = 32/143 (22%), Positives = 52/143 (36%), Gaps = 24/143 (16%) Query: 180 EQNMQGGNSLLL---HLDD---------WEHLDNY---FRHPLARRPMRFAAPPSKNVSK 224 E GG SLL+ H + WE L FR A +R+ P + Sbjct: 225 EFGATGGESLLVDGFHAAEQLRAVDPQAWEVLTRVGLPFRFHDADCDVRWKGTPIALDAD 284 Query: 225 DVFHPVFDVDQQGRPVMRYIDQFVQPKDFEEGVW--LSELSDAIETSKGILSVPVPVGKF 282 +H V + + +D P + V+ L + ++ +L + G Sbjct: 285 GRYHEV----RYNPGLGAALD---VPATQVKQVYRALHAFAARLKDPANVLQFKLQAGDM 337 Query: 283 LLINNLFWLHGRDRFTPHPDLRR 305 ++ NN LHGR F P+ R+ Sbjct: 338 MVFNNRRVLHGRAAFDPNTGPRK 360 >UniRef50_Q6CQT2 KLLA0D14553p n=1 Tax=Kluyveromyces lactis RepID=Q6CQT2_KLULA Length = 420 Score = 41.5 bits (96), Expect = 0.039, Method: Composition-based stats. Identities = 37/193 (19%), Positives = 68/193 (35%), Gaps = 41/193 (21%) Query: 133 QYYAR-FVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLL 191 +Y F VKN + + + + + LH D Y+E I + L+ I N +G Sbjct: 213 TFYGELFDVKNQASQANNIAYTAKPLPLHMDLLYLENIPGWQLLHCIK--NSEGLE---- 266 Query: 192 HLDDWEHLDNYFRHPLA-------RRPMRFAAPPSKNVSKD--------------VFHPV 230 E+ NYF L + P A + ++ V H Sbjct: 267 -----ENGQNYFVDSLGALNYIKNKDPSVLKALETIPITYHYRRDDKRYYQQRPLVEHKK 321 Query: 231 FDVDQQGRP------VMRYIDQFVQPKDFEEGVWLSELSDAIETSKGILSVPVPVGKFLL 284 ++ P ++ I F++G+++ E + I K + +P ++ Sbjct: 322 YETVVNYSPPFQGPFNLKDITDIPLLNQFKKGLYMFE--EYINDPKNQFQIKLPENSCVI 379 Query: 285 INNLFWLHGRDRF 297 +N LH R +F Sbjct: 380 FHNRRILHARRQF 392 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_B7LDM7 Protein csiD n=104 Tax=Bacteria RepID=CSID_ECO55 432 e-120 UniRef50_D0RN41 Protein CsiD n=1 Tax=alpha proteobacterium HIMB1... 396 e-109 UniRef50_A9VGL1 Putative uncharacterized protein n=17 Tax=Bacter... 392 e-108 UniRef50_Q47V67 Putative uncharacterized protein n=2 Tax=Alterom... 386 e-106 UniRef50_Q4FKY1 Gab protein n=3 Tax=Candidatus Pelagibacter RepI... 381 e-104 UniRef50_Q2KU12 Carbon starvation-inducible protein n=1 Tax=Bord... 369 e-101 UniRef50_A8TRC8 Putative uncharacterized protein n=1 Tax=alpha p... 184 3e-45 UniRef50_Q05XB9 Putative uncharacterized protein n=1 Tax=Synecho... 177 4e-43 UniRef50_A9UUQ7 Predicted protein n=1 Tax=Monosiga brevicollis R... 174 3e-42 UniRef50_UPI0001AF241C oxygenase (secreted protein) n=1 Tax=Stre... 171 3e-41 UniRef50_C4JUF4 Predicted protein n=6 Tax=Eurotiomycetidae RepID... 160 7e-38 UniRef50_C5PDZ6 Putative uncharacterized protein n=2 Tax=Coccidi... 157 6e-37 UniRef50_B6HPB8 Pc22g01880 protein n=1 Tax=Penicillium chrysogen... 145 2e-33 UniRef50_Q112B1 Taurine catabolism dioxygenase TauD/TfdA n=1 Tax... 141 4e-32 UniRef50_Q2UFS3 Predicted protein n=3 Tax=Aspergillus RepID=Q2UF... 137 4e-31 UniRef50_A7SHP2 Predicted protein (Fragment) n=4 Tax=Nematostell... 131 5e-29 UniRef50_A1TTH8 Taurine catabolism dioxygenase TauD/TfdA n=9 Tax... 130 7e-29 UniRef50_B0JNS2 Putative uncharacterized protein n=1 Tax=Microcy... 126 9e-28 UniRef50_A9UW47 Predicted protein n=1 Tax=Monosiga brevicollis R... 125 3e-27 Sequences not found previously or not previously below threshold: UniRef50_C7ZJ20 Putative uncharacterized protein n=1 Tax=Nectria... 131 4e-29 UniRef50_UPI000023E985 hypothetical protein FG04441.1 n=1 Tax=Gi... 115 2e-24 UniRef50_Q2MF05 Putative oxygenase, TobO n=1 Tax=Streptomyces sp... 111 3e-23 UniRef50_A7HQL7 Taurine catabolism dioxygenase TauD/TfdA n=1 Tax... 95 4e-18 UniRef50_Q9Z4Z5 L-asparagine oxygenase n=5 Tax=Actinomycetales R... 91 5e-17 UniRef50_A3JU53 Putative uncharacterized protein n=1 Tax=Rhodoba... 90 8e-17 UniRef50_B5JAG6 Taurine catabolism dioxygenase TauD, TfdA family... 90 8e-17 UniRef50_Q98KK0 Probable gamma-butyrobetaine dioxygenase n=11 Ta... 88 3e-16 UniRef50_UPI0001B558E5 oxygenase n=2 Tax=Streptomyces RepID=UPI0... 88 3e-16 UniRef50_D0N1N0 Trimethyllysine dioxygenase, putative n=1 Tax=Ph... 88 4e-16 UniRef50_A0YX02 Gamma-butyrobetaine hydroxylase, putative n=2 Ta... 88 6e-16 UniRef50_A4D938 CrpF n=1 Tax=Nostoc sp. ATCC 53789 RepID=A4D938_... 87 7e-16 UniRef50_Q1QTU1 Gamma-butyrobetaine,2-oxoglutarate dioxygenase n... 87 8e-16 UniRef50_A8PQ25 Taurine catabolism dioxygenase TauD, TfdA family... 87 9e-16 UniRef50_D0LMF7 Gamma-butyrobetaine dioxygenase n=1 Tax=Haliangi... 87 1e-15 UniRef50_A6C4G7 Putative uncharacterized protein n=1 Tax=Plancto... 86 1e-15 UniRef50_B3T5P9 Putative gamma-butyrobetaine hydroxylase n=1 Tax... 86 1e-15 UniRef50_A8TRK7 Gamma-butyrobetaine,2-oxoglutarate dioxygenase n... 86 2e-15 UniRef50_Q5KP77 Mitochondrion protein, putative n=3 Tax=Filobasi... 85 3e-15 UniRef50_B5K1H8 Taurine catabolism dioxygenase TauD, TfdA family... 85 3e-15 UniRef50_B3RYG1 Putative uncharacterized protein n=1 Tax=Trichop... 84 6e-15 UniRef50_Q2CHG0 Putative uncharacterized protein n=1 Tax=Oceanic... 84 9e-15 UniRef50_D1UM90 Taurine catabolism dioxygenase TauD/TfdA n=1 Tax... 84 9e-15 UniRef50_B6H119 Pc12g14480 protein n=1 Tax=Penicillium chrysogen... 84 9e-15 UniRef50_O75936 Gamma-butyrobetaine dioxygenase n=28 Tax=Euteleo... 83 1e-14 UniRef50_D1VL46 Gamma-butyrobetaine dioxygenase n=1 Tax=Frankia ... 83 1e-14 UniRef50_B3SBK5 Putative uncharacterized protein n=1 Tax=Trichop... 83 2e-14 UniRef50_UPI0000521F63 PREDICTED: similar to CG14630 CG14630-PA ... 83 2e-14 UniRef50_Q13PB1 Putative uncharacterized protein n=1 Tax=Burkhol... 83 2e-14 UniRef50_A4KCF3 TMC biosynthetic enzyme L5 n=2 Tax=Streptomyces ... 82 2e-14 UniRef50_B0C2I2 Putative uncharacterized protein n=1 Tax=Acaryoc... 82 2e-14 UniRef50_Q9NF72 EG:BACR7A4.9 protein n=10 Tax=Drosophila RepID=Q... 82 2e-14 UniRef50_C3YQN6 Putative uncharacterized protein n=2 Tax=Branchi... 82 3e-14 UniRef50_B7S2Z5 Taurine catabolism dioxygenase TauD, TfdA family... 82 3e-14 UniRef50_Q5A0G4 Potential gamma-butyrobetaine hydroxylase n=2 Ta... 81 4e-14 UniRef50_Q45R77 Possible hydrolase n=1 Tax=Streptomyces fradiae ... 81 4e-14 UniRef50_C3K215 Putative uncharacterized protein n=1 Tax=Pseudom... 81 5e-14 UniRef50_C0SMX2 Predicted hydroxylase n=1 Tax=Streptomyces spiro... 81 5e-14 UniRef50_B5GF41 Putative uncharacterized protein n=2 Tax=Strepto... 81 7e-14 UniRef50_Q7N4X1 Similar to hypothetical gamma-butyrobetaine n=1 ... 80 9e-14 UniRef50_Q1AWV7 Putative uncharacterized protein n=1 Tax=Rubroba... 80 9e-14 UniRef50_A4S5K2 Predicted protein n=2 Tax=Ostreococcus RepID=A4S... 80 1e-13 UniRef50_UPI0000E48C37 PREDICTED: similar to gamma butyrobetaine... 80 1e-13 UniRef50_A6SL62 Putative uncharacterized protein n=2 Tax=Sclerot... 79 1e-13 UniRef50_P80193 Gamma-butyrobetaine dioxygenase n=18 Tax=Proteob... 79 2e-13 UniRef50_B6GWY4 Pc12g00050 protein n=1 Tax=Penicillium chrysogen... 79 2e-13 UniRef50_C1YTV8 Taurine catabolism dioxygenase TauD, TfdA family... 79 2e-13 UniRef50_A8TJ02 Putative uncharacterized protein n=1 Tax=alpha p... 79 2e-13 UniRef50_A8TTL2 Gamma-butyrobetaine,2-oxoglutarate dioxygenase n... 79 2e-13 UniRef50_C7G1S1 Putative dioxygenase n=1 Tax=Streptomyces griseu... 79 2e-13 UniRef50_Q7N1E8 Similarities with putative oxygenase and clavami... 79 2e-13 UniRef50_B2AD39 Predicted CDS Pa_3_10500 n=1 Tax=Podospora anser... 79 3e-13 UniRef50_UPI0000586B6F PREDICTED: hypothetical protein n=2 Tax=S... 79 3e-13 UniRef50_Q1GKN1 Gamma-butyrobetaine2-oxoglutarate dioxygenase n=... 79 3e-13 UniRef50_A2R6Y2 Contig An16c0060, complete genome n=4 Tax=Tricho... 79 3e-13 UniRef50_C5E3H3 KLTH0H13574p n=3 Tax=Saccharomycetaceae RepID=C5... 78 4e-13 UniRef50_A4SJT0 Pyoverdine biosynthesis protein n=9 Tax=Gammapro... 78 5e-13 UniRef50_Q05582 Clavaminate synthase 2 n=7 Tax=Streptomyces RepI... 77 6e-13 UniRef50_Q7QKQ0 AGAP012477-PA (Fragment) n=4 Tax=Diptera RepID=Q... 77 6e-13 UniRef50_Q2GN61 Putative uncharacterized protein n=1 Tax=Chaetom... 77 7e-13 UniRef50_B6EK40 Putative uncharacterized protein n=1 Tax=Aliivib... 77 8e-13 UniRef50_Q4WLY7 Putative uncharacterized protein n=5 Tax=Trichoc... 77 8e-13 UniRef50_UPI0001B5617D Taurine catabolism dioxygenase TauD/TfdA ... 77 8e-13 UniRef50_B6QK04 Gamma-butyrobetaine hydroxylase subfamily, putat... 77 8e-13 UniRef50_A7S7D2 Predicted protein n=2 Tax=Nematostella vectensis... 77 1e-12 UniRef50_C7N0A3 Taurine catabolism dioxygenase TauD, TfdA family... 77 1e-12 UniRef50_C4NCK2 Dioxygenase n=1 Tax=Streptomyces sp. MK730-62F2 ... 77 1e-12 UniRef50_Q21526 Protein M05D6.7, partially confirmed by transcri... 76 1e-12 UniRef50_Q2UCW9 Predicted gamma-butyrobetaine n=3 Tax=Aspergillu... 76 2e-12 UniRef50_UPI0000587DDD PREDICTED: hypothetical protein n=1 Tax=S... 76 2e-12 UniRef50_B3SAJ9 Putative uncharacterized protein n=2 Tax=Trichop... 76 2e-12 UniRef50_A4TYV3 PA0187 n=5 Tax=Proteobacteria RepID=A4TYV3_9PROT 76 2e-12 UniRef50_Q643C1 Predicted non-heme iron hydroxylase MppO n=1 Tax... 76 2e-12 UniRef50_B3SDF1 Putative uncharacterized protein n=3 Tax=Trichop... 76 2e-12 UniRef50_Q097J3 Gamma-butyrobetaine dioxygenase n=1 Tax=Stigmate... 76 2e-12 UniRef50_Q2JKI0 Clavaminate synthase n=1 Tax=Synechococcus sp. J... 76 2e-12 UniRef50_A3YI34 Gamma-butyrobetaine hydroxylase n=1 Tax=Marinomo... 75 3e-12 UniRef50_B2VF28 Clavaminate synthase 1 n=1 Tax=Erwinia tasmanien... 75 3e-12 UniRef50_C6XP42 Taurine catabolism dioxygenase TauD/TfdA n=2 Tax... 75 3e-12 UniRef50_B9WJZ0 Uncharacterized oxidoreductase (Gamma-butyrobeta... 75 3e-12 UniRef50_C1E2T4 Predicted protein n=1 Tax=Micromonas sp. RCC299 ... 75 4e-12 UniRef50_D2SNE3 Putative uncharacterized protein n=1 Tax=Strepto... 75 4e-12 UniRef50_B7V2A4 Putative uncharacterized protein n=5 Tax=Pseudom... 75 4e-12 UniRef50_Q1R056 Gamma-butyrobetaine,2-oxoglutarate dioxygenase n... 75 4e-12 UniRef50_B6BSV9 Gamma-butyrobetaine,2-oxoglutarate dioxygenase, ... 75 4e-12 UniRef50_A4VRE5 Predicted non-heme iron hydroxylase MppO n=1 Tax... 74 5e-12 UniRef50_B5GLY6 Putative uncharacterized protein n=1 Tax=Strepto... 74 5e-12 UniRef50_UPI000186F191 trimethyllysine dioxygenase, putative n=1... 74 6e-12 UniRef50_UPI000192475E PREDICTED: similar to Trimethyllysine dio... 74 7e-12 UniRef50_B6Q7A5 TfdA family oxidoreductase, putative n=7 Tax=Leo... 74 7e-12 UniRef50_B7S301 Taurine catabolism dioxygenase TauD, TfdA family... 74 7e-12 UniRef50_C7YUK5 Putative uncharacterized protein n=1 Tax=Nectria... 74 7e-12 UniRef50_Q7WGN5 Putative uncharacterized protein n=3 Tax=Bordete... 74 8e-12 UniRef50_C3Y5M9 Putative uncharacterized protein n=1 Tax=Branchi... 74 8e-12 UniRef50_B6BQI4 Gamma-butyrobetaine hydroxylase n=1 Tax=Candidat... 74 9e-12 UniRef50_Q1YSL8 Gamma-butyrobetaine hydroxylase n=1 Tax=gamma pr... 73 1e-11 UniRef50_A5DCB6 Trimethyllysine dioxygenase n=9 Tax=Saccharomyce... 73 1e-11 UniRef50_A8HS86 Predicted protein n=2 Tax=Chlamydomonas reinhard... 73 1e-11 UniRef50_Q3SG12 Putative uncharacterized protein n=3 Tax=Proteob... 73 1e-11 UniRef50_A3NJS9 Taurine catabolism dioxygenase, TauD/TfdA family... 73 2e-11 UniRef50_Q1QSP3 Taurine catabolism dioxygenase TauD/TfdA n=1 Tax... 73 2e-11 UniRef50_Q5KF50 Mitochondrion protein, putative n=2 Tax=Filobasi... 72 2e-11 UniRef50_B3RWG0 Putative uncharacterized protein n=7 Tax=Trichop... 72 2e-11 UniRef50_A8TU20 Gamma-butyrobetaine hydroxylase, putative n=1 Ta... 72 2e-11 UniRef50_A7SLB9 Predicted protein (Fragment) n=1 Tax=Nematostell... 72 3e-11 UniRef50_A8U3F8 Putative uncharacterized protein n=1 Tax=alpha p... 72 3e-11 UniRef50_C7YXK7 Putative uncharacterized protein (Fragment) n=1 ... 72 4e-11 UniRef50_C3K0C1 Putative gamma-butyrobetaine dioxygenase n=1 Tax... 71 4e-11 UniRef50_B9K395 Putative uncharacterized protein n=1 Tax=Agrobac... 71 4e-11 UniRef50_Q4V6I6 IP11337p (Fragment) n=10 Tax=Drosophila RepID=Q4... 71 4e-11 UniRef50_A8TSD0 Putative uncharacterized protein n=1 Tax=alpha p... 71 6e-11 UniRef50_A4R0Y1 Putative uncharacterized protein n=1 Tax=Magnapo... 71 7e-11 UniRef50_Q2UHV9 Predicted protein n=2 Tax=Aspergillus RepID=Q2UH... 71 8e-11 UniRef50_UPI000023D763 hypothetical protein FG05953.1 n=1 Tax=Gi... 70 1e-10 UniRef50_UPI000180D107 PREDICTED: similar to gamma-butyrobetaine... 70 1e-10 UniRef50_Q4V6C2 IP11527p (Fragment) n=6 Tax=melanogaster subgrou... 70 1e-10 UniRef50_A2R5A1 Catalytic activity: H. sapiens BBH converts gamm... 70 1e-10 UniRef50_Q1GF28 Gamma-butyrobetaine2-oxoglutarate dioxygenase n=... 70 1e-10 UniRef50_B0XU78 Gamma-butyrobetaine hydroxylase subfamily, putat... 70 1e-10 UniRef50_Q4V6P2 IP11427p n=19 Tax=Drosophila RepID=Q4V6P2_DROME 69 2e-10 UniRef50_C4JHH9 Predicted protein n=3 Tax=Onygenales RepID=C4JHH... 69 2e-10 UniRef50_D2VKX3 Predicted protein n=1 Tax=Naegleria gruberi RepI... 69 2e-10 UniRef50_B0XCW0 Gamma-butyrobetaine dioxygenase n=1 Tax=Culex qu... 69 2e-10 UniRef50_C8VBQ5 Gamma-butyrobetaine hydroxylase subfamily, putat... 69 3e-10 UniRef50_A3YAS9 Gamma-butyrobetaine hydroxylase n=1 Tax=Marinomo... 69 3e-10 UniRef50_B6H3W1 Pc13g10890 protein n=19 Tax=Dikarya RepID=B6H3W1... 68 4e-10 UniRef50_A8H4N7 Trimethyllysine dioxygenase n=2 Tax=Shewanella R... 68 4e-10 UniRef50_B5HKD0 Oxygenase n=1 Tax=Streptomyces pristinaespiralis... 68 5e-10 UniRef50_Q10ZR8 Putative uncharacterized protein n=2 Tax=Trichod... 67 5e-10 UniRef50_C6WRI3 Taurine catabolism dioxygenase TauD/TfdA n=1 Tax... 67 6e-10 UniRef50_C3ZMX7 Putative uncharacterized protein (Fragment) n=1 ... 67 6e-10 UniRef50_A1SFH6 Oxygenase (Secreted protein) n=1 Tax=Nocardioide... 67 6e-10 UniRef50_A9LFI0 Clavaminic acid synthetase-like protein 1 (Fragm... 67 7e-10 UniRef50_A0P0W1 Putative uncharacterized protein n=1 Tax=Labrenz... 67 7e-10 UniRef50_C4XYW1 Putative uncharacterized protein n=2 Tax=Sacchar... 67 7e-10 UniRef50_Q7K4A8 CG10814 n=19 Tax=Drosophila RepID=Q7K4A8_DROME 67 7e-10 UniRef50_C0P0S4 Gamma-butyrobetaine dioxygenase n=7 Tax=Onygenal... 67 7e-10 UniRef50_B2WEF8 Trimethyllysine dioxygenase n=2 Tax=Pleosporinea... 67 7e-10 UniRef50_B8M6P7 Trimethyllysine dioxygenase TmlH, putative n=1 T... 67 8e-10 UniRef50_C5P9Y0 Trimethyllysine dioxygenase, putative n=2 Tax=Co... 67 8e-10 UniRef50_D1KB60 Putative uncharacterized protein n=1 Tax=uncultu... 67 8e-10 UniRef50_A4FLP1 Oxygenase n=1 Tax=Saccharopolyspora erythraea NR... 67 8e-10 UniRef50_C3Z1Z7 Putative uncharacterized protein n=2 Tax=Branchi... 67 9e-10 UniRef50_UPI0001699593 Probable taurine catabolism dioxygenase n... 67 1e-09 UniRef50_Q108L2 Oxygenase n=1 Tax=uncultured organism RepID=Q108... 67 1e-09 UniRef50_A3Y505 Gamma-butyrobetaine hydroxylase, putative n=3 Ta... 67 1e-09 UniRef50_A0YHS0 Gamma-butyrobetaine,2-oxoglutarate dioxygenase n... 67 1e-09 UniRef50_UPI0001AEE667 hypothetical protein SalbJ_25454 n=1 Tax=... 66 1e-09 UniRef50_Q2JC07 Putative uncharacterized protein n=1 Tax=Frankia... 66 1e-09 UniRef50_Q6CQT2 KLLA0D14553p n=1 Tax=Kluyveromyces lactis RepID=... 66 1e-09 UniRef50_B2HPX4 Gamma-butyrobetaine hydroxylase, TauD_1 n=1 Tax=... 66 2e-09 UniRef50_A6F7M8 Gamma-butyrobetaine hydroxylase n=1 Tax=Moritell... 66 2e-09 UniRef50_A4REX2 Putative uncharacterized protein n=1 Tax=Magnapo... 66 2e-09 UniRef50_B0D9T9 Predicted protein n=3 Tax=Agaricales RepID=B0D9T... 66 2e-09 UniRef50_B6BRE2 Trimethyllysine dioxygenase n=1 Tax=Candidatus P... 66 2e-09 UniRef50_UPI0001B55A0D hypothetical protein StAA4_00370 n=1 Tax=... 65 3e-09 UniRef50_A3P1R2 Pyoverdine biosynthesis protein PvcB n=35 Tax=Pr... 65 3e-09 UniRef50_A5E9P2 Putative uncharacterized protein n=1 Tax=Bradyrh... 65 3e-09 UniRef50_A9FN86 Putative uncharacterized protein n=1 Tax=Sorangi... 65 3e-09 UniRef50_Q6CCC7 YALI0C10604p n=1 Tax=Yarrowia lipolytica RepID=Q... 65 3e-09 UniRef50_D0LLC5 Putative uncharacterized protein n=1 Tax=Haliang... 64 5e-09 UniRef50_A4X6E3 Putative uncharacterized protein n=1 Tax=Salinis... 64 5e-09 UniRef50_B1MME3 Putative uncharacterized protein n=1 Tax=Mycobac... 64 5e-09 UniRef50_A8TVU7 Dihydroxyacid dehydratase n=1 Tax=alpha proteoba... 64 6e-09 UniRef50_C0ZLG3 Putative uncharacterized protein n=2 Tax=Rhodoco... 64 6e-09 UniRef50_Q0CAI8 Predicted protein n=3 Tax=Aspergillus RepID=Q0CA... 64 6e-09 UniRef50_C0NV43 Trimethyllysine dioxygenase n=21 Tax=Leotiomycet... 64 6e-09 UniRef50_B1J339 Putative uncharacterized protein n=1 Tax=Pseudom... 64 6e-09 UniRef50_UPI000180CBE5 PREDICTED: similar to gamma-butyrobetaine... 64 6e-09 UniRef50_D1ZLJ6 Whole genome shotgun sequence assembly, scaffold... 64 7e-09 UniRef50_Q4JN27 Putative uncharacterized protein n=1 Tax=uncultu... 64 9e-09 UniRef50_Q0RGQ6 Putative clavaminate synthase-like (Oxidase) n=2... 64 9e-09 UniRef50_C7QKA8 Oxygenase (Secreted protein) n=1 Tax=Catenulispo... 64 9e-09 UniRef50_B2AFB2 Predicted CDS Pa_0_1600 n=1 Tax=Podospora anseri... 64 1e-08 UniRef50_A9VA54 Predicted protein n=1 Tax=Monosiga brevicollis R... 63 1e-08 UniRef50_B2WCC9 Gamma-butyrobetaine dioxygenase n=2 Tax=Pleospor... 63 1e-08 UniRef50_UPI000023E495 hypothetical protein FG06105.1 n=1 Tax=Gi... 63 1e-08 UniRef50_C5AK38 Putative uncharacterized protein n=8 Tax=Proteob... 63 1e-08 UniRef50_A0Z404 Gamma-butyrobetaine,2-oxoglutarate dioxygenase n... 63 1e-08 UniRef50_A6F048 Putative uncharacterized protein n=1 Tax=Marinob... 63 1e-08 UniRef50_A6WF32 Clavaminate synthase n=2 Tax=Actinobacteria (cla... 63 2e-08 UniRef50_Q2J9C7 Pyoverdine biosynthesis protein n=1 Tax=Frankia ... 62 2e-08 UniRef50_A9FPD4 Putative uncharacterized protein n=1 Tax=Sorangi... 62 2e-08 UniRef50_A9M275 Tou3 n=10 Tax=Neisseria RepID=A9M275_NEIM0 62 2e-08 UniRef50_D1KCD0 Putative uncharacterized protein n=1 Tax=uncultu... 62 2e-08 UniRef50_UPI000186E3FD gamma-butyrobetaine dioxygenase, putative... 62 2e-08 UniRef50_A5F6V5 PvcB protein n=35 Tax=Proteobacteria RepID=A5F6V... 62 3e-08 UniRef50_C4WXU3 ACYPI008997 protein n=2 Tax=Acyrthosiphon pisum ... 62 3e-08 UniRef50_A6RF74 Predicted protein n=1 Tax=Ajellomyces capsulatus... 62 3e-08 UniRef50_D2ATF7 Putative uncharacterized protein n=1 Tax=Strepto... 62 3e-08 UniRef50_A1UJG6 Taurine catabolism dioxygenase TauD/TfdA n=17 Ta... 62 3e-08 UniRef50_A6DBS5 Putative clavaminate synthase-like (Oxidase) n=1... 62 3e-08 UniRef50_C5S2J6 Putative uncharacterized protein n=1 Tax=Actinob... 62 3e-08 UniRef50_D2QGX9 Putative uncharacterized protein n=1 Tax=Spiroso... 62 4e-08 UniRef50_C5FYM2 Mitochondrial protein n=1 Tax=Microsporum canis ... 62 4e-08 UniRef50_Q7SAI5 Predicted protein n=4 Tax=Sordariales RepID=Q7SA... 62 4e-08 UniRef50_Q0A839 Putative taurine catabolism dioxygenase n=1 Tax=... 61 4e-08 UniRef50_B8NNC7 Trimethyllysine dioxygenase TmlH, putative n=7 T... 61 4e-08 UniRef50_A0L5C8 Putative uncharacterized protein n=1 Tax=Magneto... 61 5e-08 UniRef50_Q6C1G9 YALI0F16357p n=1 Tax=Yarrowia lipolytica RepID=Q... 61 5e-08 UniRef50_Q0ZQ39 FrbJ n=2 Tax=Streptomyces RepID=Q0ZQ39_9ACTO 61 5e-08 UniRef50_A8YB34 Similar to tr|Q2UDI5|Q2UDI5_ASPOR Predicted prot... 61 5e-08 UniRef50_C7Q942 Putative uncharacterized protein n=1 Tax=Catenul... 61 5e-08 UniRef50_Q9FB35 Clavaminic acid synthase-like protein n=1 Tax=St... 61 6e-08 UniRef50_Q4PCW2 Putative uncharacterized protein n=1 Tax=Ustilag... 61 7e-08 UniRef50_A6G9Y5 Pyoverdine biosynthesis protein n=1 Tax=Plesiocy... 60 9e-08 UniRef50_Q93JQ4 Clavaminate synthase n=1 Tax=Rhodococcus fascian... 60 1e-07 UniRef50_B2AD86 Predicted CDS Pa_4_400 n=4 Tax=Leotiomyceta RepI... 60 1e-07 UniRef50_C7YUC3 Putative uncharacterized protein n=1 Tax=Nectria... 60 1e-07 UniRef50_B8GUE4 Taurine catabolism dioxygenase n=1 Tax=Thioalkal... 60 1e-07 UniRef50_B2AMD0 Predicted CDS Pa_5_7570 n=1 Tax=Podospora anseri... 60 1e-07 UniRef50_C5A800 Putative uncharacterized protein n=1 Tax=Burkhol... 60 1e-07 UniRef50_C1E7Q6 Predicted protein n=2 Tax=Micromonas RepID=C1E7Q... 60 1e-07 UniRef50_Q19000 Probable gamma-butyrobetaine dioxygenase n=4 Tax... 60 1e-07 UniRef50_Q16V01 Epsilon-trimethyllysine 2-oxoglutarate dioxygena... 59 2e-07 UniRef50_D1SG31 Putative uncharacterized protein n=1 Tax=Micromo... 59 2e-07 UniRef50_C7QJ42 Clavaminate synthase n=1 Tax=Catenulispora acidi... 59 2e-07 UniRef50_Q9NVH6 Trimethyllysine dioxygenase, mitochondrial n=39 ... 59 2e-07 UniRef50_C8V493 Trimethyllysine dioxygenase TmlH, putative (AFU_... 59 2e-07 UniRef50_B7GDP6 Predicted protein n=1 Tax=Phaeodactylum tricornu... 59 2e-07 UniRef50_UPI0001B55A0A putative taurine catabolism dioxygenase n... 59 3e-07 UniRef50_A3NB26 Dioxygenase, TauD/TfdA family n=29 Tax=pseudomal... 59 3e-07 UniRef50_A0P0W2 Putative uncharacterized protein n=1 Tax=Labrenz... 58 4e-07 UniRef50_UPI0001B59B43 hypothetical protein MaviaA2_08448 n=2 Ta... 57 7e-07 UniRef50_B0T4H7 Taurine catabolism dioxygenase TauD/TfdA n=3 Tax... 57 7e-07 UniRef50_B0KR96 Putative uncharacterized protein n=1 Tax=Pseudom... 57 7e-07 UniRef50_C9S8L0 Trimethyllysine dioxygenase n=1 Tax=Verticillium... 57 7e-07 UniRef50_C5E1T6 ZYRO0G01342p n=1 Tax=Zygosaccharomyces rouxii Re... 57 8e-07 UniRef50_B9JZY5 Gamma butyrobetaine hydroxylase protein n=3 Tax=... 57 8e-07 UniRef50_A0YLG8 Gamma-butyrobetaine hydroxylase n=1 Tax=Lyngbya ... 57 9e-07 UniRef50_A3P5A5 Gamma-butyrobetaine dioxygenase n=19 Tax=pseudom... 57 1e-06 >UniRef50_B7LDM7 Protein csiD n=104 Tax=Bacteria RepID=CSID_ECO55 Length = 325 Score = 432 bits (1111), Expect = e-120, Method: Composition-based stats. Identities = 317/325 (97%), Positives = 321/325 (98%) Query: 1 MNALTAVQNNAVDSGQDYSGFTLTPSAQSPRLLELTFTEQTTKQFLEQVAEWPVQALEYK 60 MNALTAV NNAVDSGQDYSGFTL PSAQSPRLLELTFTEQTTKQFLEQVAEWPVQALEYK Sbjct: 1 MNALTAVHNNAVDSGQDYSGFTLIPSAQSPRLLELTFTEQTTKQFLEQVAEWPVQALEYK 60 Query: 61 SFLRFRVAKILDDLCANQLQPLLLKTLLNRAEGALLINAVGVDDVKQADEMVKLATAVAH 120 SFLRFRV KILDDLCANQLQPLLLKTLLNRAEGALLINAVG+DDV QADEMVKLATAVAH Sbjct: 61 SFLRFRVGKILDDLCANQLQPLLLKTLLNRAEGALLINAVGIDDVAQADEMVKLATAVAH 120 Query: 121 LIGRSNFDAMSGQYYARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDE 180 LIGRSNFDAMSGQYYARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDE Sbjct: 121 LIGRSNFDAMSGQYYARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDE 180 Query: 181 QNMQGGNSLLLHLDDWEHLDNYFRHPLARRPMRFAAPPSKNVSKDVFHPVFDVDQQGRPV 240 QNMQGGNSLLLHLDDWEHLD+YFRHPLARRPMRFAAPPSKNVSKDVFHPVFDVDQQGRPV Sbjct: 181 QNMQGGNSLLLHLDDWEHLDHYFRHPLARRPMRFAAPPSKNVSKDVFHPVFDVDQQGRPV 240 Query: 241 MRYIDQFVQPKDFEEGVWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPH 300 MRYIDQFVQPKDFEEGVWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPH Sbjct: 241 MRYIDQFVQPKDFEEGVWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPH 300 Query: 301 PDLRRELMRQRGYFAYASNHYQTHQ 325 PDLRRELMRQRGYFAYA++HYQTHQ Sbjct: 301 PDLRRELMRQRGYFAYATHHYQTHQ 325 >UniRef50_D0RN41 Protein CsiD n=1 Tax=alpha proteobacterium HIMB114 RepID=D0RN41_9RICK Length = 313 Score = 396 bits (1018), Expect = e-109, Method: Composition-based stats. Identities = 133/300 (44%), Positives = 195/300 (65%), Gaps = 1/300 (0%) Query: 16 QDYSGFTLTPSAQSPRLLELTFTEQTTKQFLEQVAEWPVQALEYKSFLRFRVAKILDDLC 75 + +G + + S RL+++ + EQ ++ + +EYK FLRF + I + + Sbjct: 12 EKVTGLIVNNHSSSQRLVDIKIENDYLDKVKEQFDQFDLLDIEYKPFLRFHITDIFNKIF 71 Query: 76 ANQLQPLLLKTLLNRAEGALLINAVGVDDVK-QADEMVKLATAVAHLIGRSNFDAMSGQY 134 ++Q L LLNR +GA +I +D K D +VKL+TA+ HL+G NFDAM G+Y Sbjct: 72 NEKIQSLTKTILLNRNQGAFVIGPEAMDQSKYDTDFLVKLSTALTHLVGIPNFDAMYGKY 131 Query: 135 YARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLD 194 YARF VKN DNSDSYLR+ + ++LH DGTYV+E TD++LMMK+ E+N +GG S LLHLD Sbjct: 132 YARFEVKNTDNSDSYLRKAAKKLDLHTDGTYVKEKTDWLLMMKMKEENSEGGESTLLHLD 191 Query: 195 DWEHLDNYFRHPLARRPMRFAAPPSKNVSKDVFHPVFDVDQQGRPVMRYIDQFVQPKDFE 254 DWE + +F P+ + + +P SKNV + HP+F D+QG+P++ YIDQF +P+ + Sbjct: 192 DWEDCEKFFTDPIGKENFVWGSPKSKNVDYKIEHPIFSTDRQGKPIISYIDQFPEPQSLK 251 Query: 255 EGVWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLRRELMRQRGYF 314 +G++L+ LS+++E SK ++S +P G + NN F LHGR F PHP L REL+RQRG F Sbjct: 252 QGLYLNSLSESLEGSKKLISFKLPPGYAIFSNNYFMLHGRKAFKPHPKLYRELLRQRGIF 311 >UniRef50_A9VGL1 Putative uncharacterized protein n=17 Tax=Bacteria RepID=A9VGL1_BACWK Length = 312 Score = 392 bits (1007), Expect = e-108, Method: Composition-based stats. Identities = 131/313 (41%), Positives = 185/313 (59%), Gaps = 3/313 (0%) Query: 3 ALTAVQNNAVDSGQDYSGFTLTPSAQSPRLLELTFTEQTTKQFLEQVAEWPVQALEYKSF 62 ++ QN + + Y G+ + P + RL + +Q KQF E+V E Q+L+Y + Sbjct: 2 SIITEQNTKMKFAKKYEGYEIVPHPEHKRLYHIVSNQQLLKQFFEEVKEHSEQSLQYIPY 61 Query: 63 LRFRVAKILDDLCANQLQPLLLKTLLNRAEGALLINAVGVDDVKQADEMVKLATAVAHLI 122 RF +A + + + + +R G I G + + VK ATA+ HLI Sbjct: 62 SRFNLADGMRKIFGQSFMDNIRGIVHDRETGGFTIGVQG--ETSDPADYVKFATALTHLI 119 Query: 123 GRSNFDAMSGQYYARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQN 182 G NFDAM+G YYARF VK+ D+SDSYLRQ +R+ LH DGT+V+E TD++LMMKI+EQN Sbjct: 120 GEPNFDAMTGTYYARFNVKDTDSSDSYLRQAYRLFTLHTDGTFVDEPTDWLLMMKIEEQN 179 Query: 183 MQGGNSLLLHLDDWEHLDNYFRHPLARRPMRFAAPPSKNVSKDVFHPVFDVDQQGRPVMR 242 GG S LLHLDDWE L + H LA + + APPSKN + V+ F D P + Sbjct: 180 AVGGESRLLHLDDWEDLHKFRNHSLASVKVTYKAPPSKNAQEIVYRETFF-DVNNAPCIC 238 Query: 243 YIDQFVQPKDFEEGVWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPD 302 +IDQF P + E+ +L +LS ++E S ++ +P+G +L+NNLFW+HGR F + D Sbjct: 239 FIDQFAYPDNIEQANYLKDLSYSVENSPATHALKLPIGDLVLLNNLFWMHGRAAFEKNKD 298 Query: 303 LRRELMRQRGYFA 315 L RELMRQRG F+ Sbjct: 299 LYRELMRQRGCFS 311 >UniRef50_Q47V67 Putative uncharacterized protein n=2 Tax=Alteromonadales RepID=Q47V67_COLP3 Length = 320 Score = 386 bits (992), Expect = e-106, Method: Composition-based stats. Identities = 151/303 (49%), Positives = 203/303 (66%), Gaps = 4/303 (1%) Query: 20 GFTLTPSAQSPRLLELTFTEQTTKQFLEQVAEWPVQALEYKSFLRFRVAKILDDLCANQL 79 GF++ P A +PRL + + +T K F+EQ VQA+EYK FLRF VA IL+ L + L Sbjct: 16 GFSVAPFAANPRLQVIKLSSETVKGFIEQALPLGVQAIEYKPFLRFHVAGILNHLTNDTL 75 Query: 80 QPLLLKTLLNRAEGALLINAVGVD-DVKQADEMVKLATAVAHLIGRSNFDAMSGQYYARF 138 LLL + NR GA ++ V + Q + + L+TAV+HLIG N DAM G++YARF Sbjct: 76 GALLLGIIKNRDTGAFMLQCEPVAAEFDQLEFNILLSTAVSHLIGVPNLDAMYGKFYARF 135 Query: 139 VVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDDWEH 198 VKN DNSDSYLRQ HR MELHNDGTYVEE TD+V+M KI E+NM GG+SLLLH+D+W+ Sbjct: 136 SVKNEDNSDSYLRQAHRRMELHNDGTYVEERTDWVIMQKIAEENMAGGDSLLLHVDEWQD 195 Query: 199 LDNYFRHPLARRPMRFAAPPSKNVSKDVFHPVFD-VDQQGRPVMRYIDQFVQPKDFEEGV 257 L+ ++ HPLA+ +++ +P SKN++ + HPVF D GRP M +IDQF +P + +G Sbjct: 196 LEKFYNHPLAKEDIQWTSPASKNITYKMQHPVFFEEDDNGRPKMLFIDQFAEPLNMRQGQ 255 Query: 258 WLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLRRELMRQRGYFAYA 317 +L E+ ++E + +V VPVG L++NN WLHGRD+F L REL+RQRG AY Sbjct: 256 YLYEMGTSLEAEQNTFNVRVPVGSMLVVNNHAWLHGRDKFVADKGLYRELLRQRG--AYC 313 Query: 318 SNH 320 N Sbjct: 314 ENA 316 >UniRef50_Q4FKY1 Gab protein n=3 Tax=Candidatus Pelagibacter RepID=Q4FKY1_PELUB Length = 303 Score = 381 bits (979), Expect = e-104, Method: Composition-based stats. Identities = 135/299 (45%), Positives = 197/299 (65%) Query: 16 QDYSGFTLTPSAQSPRLLELTFTEQTTKQFLEQVAEWPVQALEYKSFLRFRVAKILDDLC 75 ++ SG T+T S R++++ ++ + + ++ + ALEYK F RF +AK LDDL Sbjct: 2 ENISGITITEHQNSKRIIDIRIEDEILDKLIFPFNKFDITALEYKPFTRFTIAKSLDDLT 61 Query: 76 ANQLQPLLLKTLLNRAEGALLINAVGVDDVKQADEMVKLATAVAHLIGRSNFDAMSGQYY 135 +N+L LL + +R G +I V +VKL+TA+A+LIG N+DAM+G+YY Sbjct: 62 SNKLSKLLNSIVRDRETGCFIIGPKKVSAKINDIFLVKLSTAIAYLIGNPNYDAMAGKYY 121 Query: 136 ARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDD 195 ARF VK+ D SDSYLR+ + M+LH DGTYV+EITD++LM KI+EQN+QGG + +LHLDD Sbjct: 122 ARFFVKHEDKSDSYLRKAYTNMDLHTDGTYVKEITDWLLMTKIEEQNVQGGETAMLHLDD 181 Query: 196 WEHLDNYFRHPLARRPMRFAAPPSKNVSKDVFHPVFDVDQQGRPVMRYIDQFVQPKDFEE 255 WEH ++ F P+ ++ + +P SKN+ V HPVF D G+P + YIDQF +PK+ ++ Sbjct: 182 WEHCEDLFNDPIGKQNFVWGSPKSKNIEYKVEHPVFTTDDNGKPNISYIDQFPEPKNMDQ 241 Query: 256 GVWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLRRELMRQRGYF 314 G++L +LSDA+E SK + + G ++ NN FWLHGR F + DL REL+R RG F Sbjct: 242 GIFLQKLSDALEESKNKVITKLVPGSTIVANNYFWLHGRKPFKENKDLSRELLRIRGSF 300 >UniRef50_Q2KU12 Carbon starvation-inducible protein n=1 Tax=Bordetella avium 197N RepID=Q2KU12_BORA1 Length = 305 Score = 369 bits (948), Expect = e-101, Method: Composition-based stats. Identities = 132/307 (42%), Positives = 194/307 (63%), Gaps = 4/307 (1%) Query: 9 NNAVDSGQDYSGFTLTPSAQSPRLLELTFTEQTTKQFLEQVAEWPVQALEYKSFLRFRVA 68 N+ +D + +SG T++ R+ ++T + ++FL+Q VQ LEY F+RF++A Sbjct: 2 NDRLDLQRLFSG-TVSDHQTHTRVRQVTLESEGLERFLDQARAIDVQNLEYVPFMRFKLA 60 Query: 69 KILDDLCANQLQPLLLKTLLNRAEGALLINAVGVDDVKQADEMVKLATAVAHLIGRSNFD 128 +L C L+ L + +R G I G+ D+ V+ TAV +L+G +N D Sbjct: 61 DMLLQACGEGLRATLNALVEDRRHGGFTIGLQGLS--ADPDDFVRFGTAVGYLLGPANHD 118 Query: 129 AMSGQYYARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNS 188 +MSG+YYARF+VK+ DNSDSYLRQ +R+ +H DGTYV E TD++LMMK DE+N GG S Sbjct: 119 SMSGKYYARFLVKHTDNSDSYLRQAYRLFTMHTDGTYVTEATDWLLMMKFDERNAVGGES 178 Query: 189 LLLHLDDWEHLDNYFRHPLARRPMRFAAPPSKNVSKDVFHPVFDVDQQGRPVMRYIDQFV 248 LHLDDW LD + + PLA +P+ + +P SKNV++ V P+F + G V +IDQFV Sbjct: 179 RFLHLDDWADLDRFTQDPLATQPLLYKSPASKNVAEQVERPLFFQSRYGLSVC-FIDQFV 237 Query: 249 QPKDFEEGVWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLRRELM 308 QP EE ++L +LS ++E+S G+ + +P G+ +++NN F+LHGR F + L RELM Sbjct: 238 QPATLEEALYLHDLSASMESSAGVQEITLPPGELVVLNNYFYLHGRAPFEKNEALHRELM 297 Query: 309 RQRGYFA 315 R RG FA Sbjct: 298 RIRGLFA 304 >UniRef50_A8TRC8 Putative uncharacterized protein n=1 Tax=alpha proteobacterium BAL199 RepID=A8TRC8_9PROT Length = 363 Score = 184 bits (468), Expect = 3e-45, Method: Composition-based stats. Identities = 55/278 (19%), Positives = 97/278 (34%), Gaps = 46/278 (16%) Query: 68 AKILDDLCANQLQPLLLKTLLNRAEGALLINAVGVDDVKQADEMVKLATAVAHL------ 121 A + A +L LL + R G L++ + V + D + A AH+ Sbjct: 66 ADFPLPILAPKLSALLDQLESGR--GVALLSGLPVQNYDDEDLRMLWAGIGAHVGPLLPQ 123 Query: 122 -----IGRSNFDAMSGQYYARFVVKNVDNSDSYLR-QPHRVMELHNDGTYVEEITDYVLM 175 + R D SG++ A K ++ S + Q + + H D D + + Sbjct: 124 SIDGKLMRDVRDEASGKFQAHAPSKAENSLTSTQKAQSNGPLRFHTD------RADVLGL 177 Query: 176 MKIDEQNMQGGNSLLLH--------LDDWEHLDNYFRHPLARRPMRFAAPPSKNV---SK 224 + + + GG S + L L + P + +++V K Sbjct: 178 LCVRQ-AGAGGESKVASSLAVRNEILKRRPDLHDLLCQPY------WRTRETQDVGGGHK 230 Query: 225 DVFHPVFDVDQQGRPVMRYIDQFVQPKDFEEGV--WLSELSDAIE-----TSKGILSVPV 277 PVF + G +Y FV+ +GV E+ A++ + + Sbjct: 231 VFAMPVFSF-RDGHFSSQYSRTFVEEAQRIDGVPKMTPEMDAALDLLAEVAEEKCHVFRL 289 Query: 278 PVGKFLLINNLFWLHGRDRFTPHPDLRRELMRQRGYFA 315 G+ + NN HGR FT R + + R +FA Sbjct: 290 QPGEMVFYNNHLVYHGRTPFTDDAASRADRLLYRLWFA 327 >UniRef50_Q05XB9 Putative uncharacterized protein n=1 Tax=Synechococcus sp. RS9916 RepID=Q05XB9_9SYNE Length = 290 Score = 177 bits (449), Expect = 4e-43, Method: Composition-based stats. Identities = 50/240 (20%), Positives = 86/240 (35%), Gaps = 32/240 (13%) Query: 92 EGALLINAVGVDDVKQADEMVKLATAVAHLIGRSNFDAMSGQYYARFVVKNVDNSDSYLR 151 G ++ G+ ++ +L A+ +G D G+ Y D+ SYL Sbjct: 54 HGYGVVLIRGLSPQPESTFR-RLYLALGRCLGTP--DGTYGELY-----DVTDSGQSYLT 105 Query: 152 QPHR------VMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLL--------LHLDDWE 197 + +H D + +E +V + + + + GG+ L L D Sbjct: 106 KAIPVSQTRAATSMHTDSSRLETHPRWVGLACVQQAPVGGGSRLASALAVHNHLKASDPR 165 Query: 198 HLDNYFRHPLARRPMRFAAPPSKNVSKDVFHPVFDVDQQGRPVMRYID--------QFVQ 249 L+ + AR + A + PVF D G P +RY+ + Q Sbjct: 166 SLERL-QRSFARDVVTPGAVDPLALIAHNRFPVFSTDTDG-PTLRYMRYWIEKGHQRLGQ 223 Query: 250 PKDFEEGVWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLRRELMR 309 P D + L A+ + S + G LLI+N +H R+ + P R L+R Sbjct: 224 PLDANDLRAFDALDAALNHPRFCHSFQLQEGDILLIDNHKLVHDREAYEDDPHRPRRLIR 283 >UniRef50_A9UUQ7 Predicted protein n=1 Tax=Monosiga brevicollis RepID=A9UUQ7_MONBE Length = 453 Score = 174 bits (442), Expect = 3e-42, Method: Composition-based stats. Identities = 54/249 (21%), Positives = 97/249 (38%), Gaps = 29/249 (11%) Query: 76 ANQLQPLLLKTLLNRAEGALLINAVGVDDVKQADEMVKLATAVAHLIGRSNFDAMSGQYY 135 + L+ L + +G ++I + D++ D + A + F + +Y Sbjct: 185 SQLLEDPLPALTAVQEDGMVIITDMPTCDLRDPDTSTQQPLIEATALAMRQFGQLQRTFY 244 Query: 136 AR--FVVKNVDNSDSYLRQPHR--VMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLL 191 + + D +D + + LH D TY+ E + + Q+ GG SL Sbjct: 245 SDGLWDTAPKDAADVN-DTAYTNLGLPLHTDATYMREPPG-LQLFCCTAQSSDGGASLFG 302 Query: 192 HLD---------DWEHLDNYFRHPLARRPMRFAAPPSKNVSKDVFHPVFDVDQQGRPVMR 242 H++ + + L F P A + + NV P+F + VMR Sbjct: 303 HVNLVLKALYEQEPDLLTYCFNTPWAFQAL------ETNVHYHAMGPIFGAEAPQDVVMR 356 Query: 243 Y--IDQFVQPK---DFEEGVW--LSELSDAIETSKGI-LSVPVPVGKFLLINNLFWLHGR 294 + D+ P+ D E + L L+ + + K + S + VG+ +INN LHGR Sbjct: 357 FNPTDRAAMPQLTFDELEAYYHCLERLTALLASPKCLYHSERLAVGEMAVINNHKVLHGR 416 Query: 295 DRFTPHPDL 303 + F H +L Sbjct: 417 EAFVGHRNL 425 >UniRef50_UPI0001AF241C oxygenase (secreted protein) n=1 Tax=Streptomyces roseosporus NRRL 11379 RepID=UPI0001AF241C Length = 333 Score = 171 bits (433), Expect = 3e-41, Method: Composition-based stats. Identities = 52/266 (19%), Positives = 90/266 (33%), Gaps = 31/266 (11%) Query: 67 VAKILDDLCANQLQPLLLKTLLNRAEGALLI---------------NAVGVDDVKQADEM 111 +++ L D+ A L+ + G LL+ + + Sbjct: 44 LSEFLPDVPAQLCDALVDFREGTASSGYLLVKGVTTGDLPDTPLAYGTQALVGHASNGTL 103 Query: 112 VKLATAVAHLIG---RSNFDAMSGQYYARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEE 168 + + LIG N D + + R ++NS S + H + + Sbjct: 104 ALVGDVLGSLIGYADEKNGDLLHDVHPVRGEEHRMENSGSV------AFDFHTENVHHPL 157 Query: 169 ITDYVLMMKIDEQNMQGGNSLLLHLDDWEHLDNYFRHPLARRPMRFAAPPSKNVSKDVFH 228 D++ ++ + + + + HL L R R P S + Sbjct: 158 RPDFLGLLSLRQGHEATATRVASIRHAVAHLSEDENAVLRRLRFRSLFPTSFTRGRTGER 217 Query: 229 PVFDVDQ--QGRPVMRYI--DQF-VQPKDFEEGVWLSELSDAIETSKGILSVPVPVGKFL 283 P + G P ++ D F +P D E L L++A+E + V + G L Sbjct: 218 PATAEHRVLFGNPGQEFLRFDSFNTKPADPEAERALGALAEALEAV--CVEVVLEPGDLL 275 Query: 284 LINNLFWLHGRDRFTPHPDLRRELMR 309 L+NN HGR FTP D R +R Sbjct: 276 LVNNHIAAHGRSAFTPRYDGRDRWLR 301 >UniRef50_C4JUF4 Predicted protein n=6 Tax=Eurotiomycetidae RepID=C4JUF4_UNCRE Length = 301 Score = 160 bits (404), Expect = 7e-38, Method: Composition-based stats. Identities = 38/170 (22%), Positives = 62/170 (36%), Gaps = 13/170 (7%) Query: 152 QPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDDW-EHLDNYFRHPLARR 210 + + H D +Y + + + GG +L +D L R L+R Sbjct: 121 ETAQDFPWHTDCSYEHLPPRFFALQVLQPDRCGGGTLSILSIDMLVRLLSPSTRTSLSRP 180 Query: 211 PMRFAAPPS--KNVSKDVFHPVFDVDQ-QGRPVMRYIDQFVQPKDFEEGVWLSELSDAIE 267 R PP K+ + + + D G P R+ ++ + P + L EL + Sbjct: 181 EYRITVPPEFIKSDERHITAGLLGEDPGNGAPEFRFREEILCPLTAGAKMALQELGAVLS 240 Query: 268 TSKGILSV------PVPVGKFLLINNLFWLHGRDRFTPHPDLRRELMRQR 311 + + + +P G +LINN WLH R D R L R R Sbjct: 241 SPQAKAATLHLTPELLPRGSVILINNRRWLHARSEV---RDPHRHLRRVR 287 >UniRef50_C5PDZ6 Putative uncharacterized protein n=2 Tax=Coccidioides RepID=C5PDZ6_COCP7 Length = 311 Score = 157 bits (396), Expect = 6e-37, Method: Composition-based stats. Identities = 38/170 (22%), Positives = 64/170 (37%), Gaps = 13/170 (7%) Query: 152 QPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDDWE-HLDNYFRHPLARR 210 + R H D +Y + + + GG +L D L R L+R Sbjct: 131 ETARDFPWHTDCSYEHLPPRFFALQVLQPDRCGGGTLSILDADKIAGLLSPATRRSLSRP 190 Query: 211 PMRFAAPPS--KNVSKDVFHPVFDVDQ-QGRPVMRYIDQFVQPKDFEEGVWLSELSDAIE 267 R P K+ + + P+ D G +R+ ++ ++P + L E ++++ Sbjct: 191 EYRITVPAEFIKSDERHITAPLLSKDSGSGAAELRFREEILEPLTNGAKLALQEFGESLQ 250 Query: 268 TSKGILSV------PVPVGKFLLINNLFWLHGRDRFTPHPDLRRELMRQR 311 + + +P G +LINN WLH R D RR L R R Sbjct: 251 SPNAKAATLHLTPELLPRGSIILINNRRWLHARSEV---KDPRRHLRRVR 297 >UniRef50_B6HPB8 Pc22g01880 protein n=1 Tax=Penicillium chrysogenum Wisconsin 54-1255 RepID=B6HPB8_PENCW Length = 298 Score = 145 bits (366), Expect = 2e-33, Method: Composition-based stats. Identities = 34/177 (19%), Positives = 65/177 (36%), Gaps = 14/177 (7%) Query: 146 SDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDDWEH-LDNYFR 204 S + E H D +Y E+ + + + GG +L++D L + + Sbjct: 114 SHQTRSETMFRFEWHTDCSYEEQPPRFFALQVLQPDRYGGGTLSVLNVDRLLTLLSPFAQ 173 Query: 205 HPLARRPMRFAAPPSKNVSKDVFHPV-----FDVDQQGRPVMRYIDQFVQPKDFEEGVWL 259 L+ + PP + + V ++ Q +R+ + P + L Sbjct: 174 RWLSSYNYKINVPPEFTKTARTQYIVGNLLAVNLSQSSGSRLRFREDITVPLTLDASKAL 233 Query: 260 SELSDAIETSKGILSVPVPV-----GKFLLINNLFWLHGRDRFTPHPDLRRELMRQR 311 EL + + + S+ +P G ++++N WLH R+ D R L R R Sbjct: 234 DELKEILYSGAQEESLHLPPQSLPQGSIIMMDNRRWLHSRNEV---KDPNRHLRRVR 287 >UniRef50_Q112B1 Taurine catabolism dioxygenase TauD/TfdA n=1 Tax=Trichodesmium erythraeum IMS101 RepID=Q112B1_TRIEI Length = 371 Score = 141 bits (354), Expect = 4e-32, Method: Composition-based stats. Identities = 36/181 (19%), Positives = 68/181 (37%), Gaps = 13/181 (7%) Query: 135 YARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLH-- 192 Y F N L + M H D TY + + E + GG SL++ Sbjct: 168 YGLFAPSKTTNEGKDLAETGNAMSFHTDYTYWHTPP-LLTSLYCVENSASGGESLIVDGF 226 Query: 193 --LDDWEHLDNYFRHPLARRPMRFAAPPSK-NVSKDVFHPVFDVDQQGR-PVMRYIDQFV 248 +DD+ + L + P++F +K P+ ++D+ G+ + + + Sbjct: 227 RVVDDFRQQHPDYFQILTQTPIQFKQVYTKWQYFYSRTQPILELDEYGKVTRINFANSHS 286 Query: 249 Q----PKDFEEGVWLSELS--DAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPD 302 P D E + + ++ ++ + G LL+N+ +HGR FT + Sbjct: 287 YTWKLPFDQMEEFYAAYITFFQYVKNPVYEYCFSLEPGDLLLMNDSRIMHGRKAFTGNRH 346 Query: 303 L 303 L Sbjct: 347 L 347 >UniRef50_Q2UFS3 Predicted protein n=3 Tax=Aspergillus RepID=Q2UFS3_ASPOR Length = 187 Score = 137 bits (346), Expect = 4e-31, Method: Composition-based stats. Identities = 36/169 (21%), Positives = 63/169 (37%), Gaps = 15/169 (8%) Query: 155 RVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDDW-EHLDNYFRHPLARRPMR 213 + H D +Y + + + GG ++ +D + L + L + Sbjct: 2 QEFPWHTDCSYEHAPPRFFALQVLQHDRYGGGTLSVMKIDRLSQFLSPTTKAALLEPEFQ 61 Query: 214 FAAPPSK---NVSKDVFHPVFDVDQQGRPVM-RYIDQFVQPKDFEEGVWLSELSDAIET- 268 PP + + +F +D + +M RY D+ V P L EL A++ Sbjct: 62 ITIPPEFIKHPDQRHIVGSLFAIDTEDHCLMMRYRDEIVTPLSARAAAALKELKGALQDM 121 Query: 269 ---SKGILSVP---VPVGKFLLINNLFWLHGRDRFTPHPDLRRELMRQR 311 S+ L + +P +L++N WLH R+ D R L R R Sbjct: 122 EALSQSTLHLTAADLPERSIILLDNYRWLHARNGI---KDPARHLRRVR 167 >UniRef50_C7ZJ20 Putative uncharacterized protein n=1 Tax=Nectria haematococca mpVI 77-13-4 RepID=C7ZJ20_NECH7 Length = 323 Score = 131 bits (329), Expect = 4e-29, Method: Composition-based stats. Identities = 36/172 (20%), Positives = 60/172 (34%), Gaps = 15/172 (8%) Query: 152 QPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDDW-EHLDNYFRHPLARR 210 + H D +Y + + + + GG ++++D E L R L + Sbjct: 144 ETMNEFPWHTDCSYEDPPPRFFALQVLQHDRCGGGTLSVMNVDKLSELLSPEIRSALLAQ 203 Query: 211 PMRFAAPPSKNVS---KDVFHPVFDVDQQGRPVM-RYIDQFVQPKDFEEGVWLSELSDAI 266 R PP K + VF + M R+ + + P L EL +A+ Sbjct: 204 EYRITIPPEFIKDPEQKHIIGSVFVTSPNDQSTMIRFREDILTPLTDRASRALVELKEAL 263 Query: 267 ETSKGILSVP-------VPVGKFLLINNLFWLHGRDRFTPHPDLRRELMRQR 311 + +P G +L++N WLH R+ D R L R R Sbjct: 264 LKEEVQAHSTVHLKSADLPKGSIILMDNRRWLHARN---DIKDPERHLRRVR 312 >UniRef50_A7SHP2 Predicted protein (Fragment) n=4 Tax=Nematostella vectensis RepID=A7SHP2_NEMVE Length = 385 Score = 131 bits (328), Expect = 5e-29, Method: Composition-based stats. Identities = 46/246 (18%), Positives = 84/246 (34%), Gaps = 33/246 (13%) Query: 80 QPLLLKTLLNRAEGALLINAVGVDDVKQADEMVKLATAVAHLIGRSNFDAMSGQYYARFV 139 Q LL G ++++ ++ ++ ++ +AV L +Y V Sbjct: 133 QSLLDWMEALHKYGLVIMSGAP----RELGQVHRIGSAVGFL---------RKTFYGSSV 179 Query: 140 VKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDDWEHL 199 + L ++ H D Y E +L+ ID+ GG + +D + L Sbjct: 180 ALRSEPQARSLAYTGYELQPHTDLPYYEFKPSVILLHCIDQVRSSGGENTF--VDGYSIL 237 Query: 200 DNYFRHP------LARRPMRFA----APPSKNVSKDVFHPVFDVDQQGR-PVMRYIDQFV 248 + LA P+ P + P+ ++D +GR + + D Sbjct: 238 KAFRNDNPDGFDLLASTPVLHRVKGVEPTYGEFEQLFARPIIELDVKGRIRRINFNDPLR 297 Query: 249 Q-----PKDFEEGVW--LSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHP 301 + P + V+ +L+ K I+ + G I+N LHGR F Sbjct: 298 EEFLDTPAEQIPKVYRAYHKLTQMFYEPKFIVRNKMAPGDICAIDNDRLLHGRSAFEVKS 357 Query: 302 DLRREL 307 D R L Sbjct: 358 DDLRLL 363 >UniRef50_A1TTH8 Taurine catabolism dioxygenase TauD/TfdA n=9 Tax=Proteobacteria RepID=A1TTH8_ACIAC Length = 289 Score = 130 bits (326), Expect = 7e-29, Method: Composition-based stats. Identities = 41/182 (22%), Positives = 68/182 (37%), Gaps = 25/182 (13%) Query: 159 LHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDDWEHLDNYFRHP-------LARRP 211 LH DG+Y+ T ++ + GG S+L D RH LA + Sbjct: 97 LHTDGSYLPIGTIKTSILLCRQHAASGGESILF--DSLSAFQALSRHDPGLAQSLLAPKV 154 Query: 212 MRFAAPPSKNVSKDVFH--PVFDVDQQGRPVMRYI---------DQFVQPKDFEEGVWLS 260 R + + + + H PVF + G+ + + P+ + +L Sbjct: 155 FRRRSTDPR-LDQQYEHIGPVFHTGENGQMASGFTLDVTADWDYSRRADPRVIDAVAYLK 213 Query: 261 ELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLRRELMRQRGYFAYASNH 320 L++ S LS + G+ L++ N HGR+ + P R L+ RG F A Sbjct: 214 HLAEP--GSGYTLSFTLQRGQALVMRNDQLSHGRNAYIDDPAHPRVLL--RGLFLSAPRA 269 Query: 321 YQ 322 Q Sbjct: 270 MQ 271 >UniRef50_B0JNS2 Putative uncharacterized protein n=1 Tax=Microcystis aeruginosa NIES-843 RepID=B0JNS2_MICAN Length = 305 Score = 126 bits (317), Expect = 9e-28, Method: Composition-based stats. Identities = 53/263 (20%), Positives = 100/263 (38%), Gaps = 34/263 (12%) Query: 71 LDDLCANQLQPLLLKTLLNRAEGALLINAVGVDDVKQADEMVKLATAVAHLIGRSNFDAM 130 +++ + L L R + L+ G+ + E+ ++ + + F + Sbjct: 48 PNEIGQETVNENLDFFLPFREK---LLQKPGIIIFRLEPELTEIEMRLIYAFISRIFGLL 104 Query: 131 SGQYYARFVVKNVDNSDSYLRQPHRV------MELHNDGTYVEEITDYVLMMKIDEQNMQ 184 + +Y F V +D Y ++ V H D T D+V ++ + + + Sbjct: 105 NHRYGYFFDV--IDRGMDYTKKAIPVSMTNAETGYHTDSTAKNYFPDFVGLLCLAAAD-E 161 Query: 185 GGNSLLLHLDD-----WEH---LDNYFRHPLARRPMRFAAPPSKNVSKDVFHPVFDVDQQ 236 GG+SL+++ + W+H L Y PLAR + S+ + P+F D Q Sbjct: 162 GGDSLVVNAANLYQYFWQHHTDLVPYLYEPLARDVITPGEINSQEAIQKNNFPLFSADSQ 221 Query: 237 GRPVMRYID----------QFVQPKDFEEGVWLSELSDAIETSKGILSVPVPVGKFLLIN 286 G V RY+ + QP+ + L + D + + + G IN Sbjct: 222 GL-VFRYMRYWIEVAYSKLEIAQPEAITKT--LDIIDDFFSKPENTVRFKMKKGDVFYIN 278 Query: 287 NLFWLHGRDRFTPHPDLRRELMR 309 N F H R F + R+++R Sbjct: 279 NRFLCHNRTAFK-NSGKPRQMVR 300 >UniRef50_A9UW47 Predicted protein n=1 Tax=Monosiga brevicollis RepID=A9UW47_MONBE Length = 389 Score = 125 bits (313), Expect = 3e-27, Method: Composition-based stats. Identities = 47/257 (18%), Positives = 93/257 (36%), Gaps = 34/257 (13%) Query: 92 EGALLINAVGVDDVKQADEMVKLA-TAVAHLIGRSNFDAMSGQ----YYARFVVKNVDNS 146 +G L + + + ++ A A++ +G +G + R + + + Sbjct: 125 DGPGLALLQPMPSLAKNIHAMRFAALAMSSCLGTPLVQNAAGDKSILVFDRDAGRKMVDG 184 Query: 147 DSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDDWE-HLDNYFRH 205 Y Q H +H D V E +Y+L+ + +GG S+LL E +L Sbjct: 185 QRY-HQSHEGGSMHTDNVNVPETWEYMLLTCLQP-AAEGGESILLSSSTVEAYLAK--HD 240 Query: 206 PLA----RRPMRFAAPPSKNVSKDVFH-PVFDVDQQGRPVMRYIDQFVQ--------PKD 252 P A + + + S+ + P+ G P R++ ++++ P Sbjct: 241 PEALETLKEDFLWEL---RGFSERFYRAPILFHGTDGYPCFRWLREYLESAHARAGEPLT 297 Query: 253 FEEGVWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLRRE------ 306 + L+ L++A L + G+ L N++ LHGR F E Sbjct: 298 DRQISALNALTNATLEESLQLRYNMAKGEILFANDMSLLHGRTTFYDRQQASTEYDFGTQ 357 Query: 307 --LMRQRGYFAYASNHY 321 + QR + ++ Y Sbjct: 358 ANRLLQRNWVKTKTSQY 374 >UniRef50_UPI000023E985 hypothetical protein FG04441.1 n=1 Tax=Gibberella zeae PH-1 RepID=UPI000023E985 Length = 291 Score = 115 bits (288), Expect = 2e-24, Method: Composition-based stats. Identities = 36/169 (21%), Positives = 60/169 (35%), Gaps = 13/169 (7%) Query: 152 QPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDDW-EHLDNYFRHPLARR 210 + H D +Y + + Y + + GG +++++ E L + L Sbjct: 125 ETMSEFPWHTDCSYEDPLPRYFALQVLQHDRYGGGTLSVMNVEKLNELLSPESKAALMSS 184 Query: 211 PMRFAAPPSKNVSKDVFHPVFD-VDQQGRPVM-RYIDQFVQPKDFEEGVWLSELSDAIET 268 R PP + D H + G M R+ + V P + L EL DA+ Sbjct: 185 EFRIEIPPEFIKNADKKHITGSILKSNGESTMIRFREDIVTPLTDRARLALQELRDALVQ 244 Query: 269 SKGILSVP-------VPVGKFLLINNLFWLHGRDRFTPHPDLRRELMRQ 310 + +P G +L++N WLH R+ D R L R Sbjct: 245 HEVQAHTTVHLKSSDLPKGSIILMDNRRWLHARN---DIKDPERHLRRI 290 >UniRef50_Q2MF05 Putative oxygenase, TobO n=1 Tax=Streptomyces sp. DSM 40477 RepID=Q2MF05_STRSD Length = 327 Score = 111 bits (278), Expect = 3e-23, Method: Composition-based stats. Identities = 37/234 (15%), Positives = 76/234 (32%), Gaps = 19/234 (8%) Query: 99 AVGVDDVKQADEMVKLATAVAHLIG---RSNFDAMSGQYYARFVVKNVDNSDSYLRQPHR 155 +D + + +A + ++G + + ++NS S Sbjct: 84 PTTLDAHPTSATLQLVAETLGTMVGYADEKDGRLVHEVQPVPGDETRIENSGSV------ 137 Query: 156 VMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHL-DDWEHLDNYFRHPLARRPMRF 214 + H + + DY+ ++ + + ++ + + + LD+ L Sbjct: 138 AFDFHTENVHHPLRPDYLGLLCLRQDHLGVAATRVASVRHALALLDDDTVATLRGLWFLS 197 Query: 215 AAPPSKNVSKDVF------HPVFDVDQQGRPVMRYIDQFVQPKDFEEGVWLSELSDAIET 268 P S + HPV D+ RP +R+ L L+ A+E Sbjct: 198 NYPTSFTRAAGGELPPVGPHPVIFGDED-RPFLRFNSHNTFSTTPAGRAALVRLTQALEE 256 Query: 269 SKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLRRELMRQRGYFAYASNHYQ 322 V + G ++++N HGR FTP D + +R+ Q Sbjct: 257 V--CHDVVLEPGDCVIVDNNVAAHGRSGFTPRYDGQDRWLRRFYSVRAIPRTVQ 308 >UniRef50_A7HQL7 Taurine catabolism dioxygenase TauD/TfdA n=1 Tax=Parvibaculum lavamentivorans DS-1 RepID=A7HQL7_PARL1 Length = 339 Score = 94.8 bits (234), Expect = 4e-18, Method: Composition-based stats. Identities = 46/270 (17%), Positives = 80/270 (29%), Gaps = 34/270 (12%) Query: 69 KILDDLCANQLQPLLLKTLLNRAEGALLINAVGVDDVKQADEMVKLATAVAHLIGRSNFD 128 L A +L L + R G I + V+ D + HL G+ Sbjct: 56 DFPLPLLAGKLARLEDDLINGR--GFCRIAGLPVERYSDDDASLIYWGIGMHL-GKPWPQ 112 Query: 129 AMSGQYYARFVVKNVDNSDSYLRQ---PHRVMELHNDGTYVEEITDYVLMMKIDEQNMQG 185 G + +D R H+DG+ D V +M + + G Sbjct: 113 NKHGHLLGDVTDQGKSGADPTSRGNEIGGVAFPYHSDGS------DLVGLMCLRK-AKSG 165 Query: 186 GNSLLLH--------LDDWEHLDNYFRHPLARRPMRFAAPPSKNVSKDVFHPVFDVDQQG 237 G S + + + L P +P + + F PVF G Sbjct: 166 GISTVANAVAIHNELVRTRPDLAALLYEP---QPYDYRGEQPEGGQPFYFVPVFTEH-GG 221 Query: 238 RPVMRYIDQFVQP---------KDFEEGVWLSELSDAIETSKGILSVPVPVGKFLLINNL 288 R +RYI +++ + + + + + G +NN Sbjct: 222 RLFVRYIRPYIESSQRHADAPRLSAKAREAFDLVDAMCADPAFNVYMDLEPGDMQFVNNY 281 Query: 289 FWLHGRDRFTPHPDLRRELMRQRGYFAYAS 318 LH R + P+ R+ +R + A Sbjct: 282 HVLHARTAYEDWPERNRKRHLKRLWLETAQ 311 >UniRef50_Q9Z4Z5 L-asparagine oxygenase n=5 Tax=Actinomycetales RepID=ASNO_STRCO Length = 333 Score = 91.0 bits (224), Expect = 5e-17, Method: Composition-based stats. Identities = 35/194 (18%), Positives = 66/194 (34%), Gaps = 12/194 (6%) Query: 119 AHLIGRSNFDAMSGQYYARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKI 178 + N + + + N+ S L + HN+ + E D+V+++ + Sbjct: 120 GAFLPEKNGALVQDVVPVPGMEEFQGNAGSTL------LTFHNENAFHEHRPDFVMLLCL 173 Query: 179 DEQNMQGGNSLLLHLDD-WEHLDNYFRHPLARRPMRFAAPPSKNVS--KDVFHPVFDVDQ 235 + L + L R A PPS +S ++ PV D+ Sbjct: 174 RADPTGRAGLRTACVRRVLPLLSDSTVDALWAPEFRTAPPPSFQLSGPEEAPAPVLLGDR 233 Query: 236 QGRPVMRYIDQFVQPKDFEEGVWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRD 295 P +R +P L EL + + + G+ +++N +HGR Sbjct: 234 S-DPDLRVDLAATEPVTERAAEALRELQAHFDATAVTH--RLLPGELAIVDNRVTVHGRT 290 Query: 296 RFTPHPDLRRELMR 309 FTP D ++ Sbjct: 291 EFTPRYDGTDRWLQ 304 >UniRef50_A3JU53 Putative uncharacterized protein n=1 Tax=Rhodobacterales bacterium HTCC2150 RepID=A3JU53_9RHOB Length = 328 Score = 90.2 bits (222), Expect = 8e-17, Method: Composition-based stats. Identities = 42/264 (15%), Positives = 82/264 (31%), Gaps = 32/264 (12%) Query: 62 FLRFRVAKILDDLCANQLQPLLLKTLLNRAEGALLINAVGVDDVKQADEMVKLATAVAHL 121 F +F +L + R G LL+ + V+ + + H Sbjct: 52 FPKFTKDDFPIPSLMPRLAEFAKELETGR--GFLLLRGLPVERYTEDQIRILYYAIGLH- 108 Query: 122 IGRSNFDAMSGQYYAR-FVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDE 180 +G G V + ++ +S + + + + H D +D V ++ + + Sbjct: 109 MGEPVGQNAKGDLLGNVMNVADPNDKNSRVFETNLYLPYHTD------PSDVVGLLCLRK 162 Query: 181 QNMQGGNSLLLHLDDWEHLDNYFRHPLARRPMRFAAPPSKNVSKDVF--HPVFDVDQQGR 238 +GG S L+ + + L P + + P Sbjct: 163 -AKRGGLSSLVSVASI------YNEILGHHPELLGLFYKQYYYAHLGSGKPALSSLFNYH 215 Query: 239 PV---MRYIDQFVQ--------PKDFEEGVWLSELSDAIETSKGILSVPVPVGKFLLINN 287 RY+ Q+++ P E L + + + + + G NN Sbjct: 216 DCKLSCRYLRQYIELGHEIMEHPLSAVEIEALDRFDEIMHREALRIDMMLEPGDLQFANN 275 Query: 288 LFWLHGRDRFTPH--PDLRRELMR 309 LH R F H D RR+++R Sbjct: 276 YAVLHSRTDFEDHAEEDKRRKMLR 299 >UniRef50_B5JAG6 Taurine catabolism dioxygenase TauD, TfdA family n=4 Tax=Proteobacteria RepID=B5JAG6_9RHOB Length = 362 Score = 90.2 bits (222), Expect = 8e-17, Method: Composition-based stats. Identities = 57/266 (21%), Positives = 81/266 (30%), Gaps = 40/266 (15%) Query: 70 ILDDLCANQLQPLLLKTLLNRAEGALLINAVGVDDVKQADEMVKLATAVAHLIGRSNFDA 129 A L+ L L R G ++ + V Q AHL GR+ Sbjct: 69 FPLPNFAAHLKTLSETLLHGR--GFEVLRGLPVSSYTQETAATIFCGIGAHL-GRARSQN 125 Query: 130 MSGQYYARFVVKNVDNSDSYLR--QPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGN 187 +G VD +D R Q H D D V ++ + E GG Sbjct: 126 AAGHILGHVRNIGVDANDPTTRIYQTADRQTFHTDS------ADVVGLLCLRE-AQVGGM 178 Query: 188 SLLLHLDDW--------EHLDNYFRHPLARRPMRFAAPPSKNVSKDVFHPVFDVDQQGRP 239 SLL+ L P+A R P + + PV + G Sbjct: 179 SLLVSAQTIANRMAAQRPDLLALLFDPIATDR-RGEIPDGADPFMRI--PVLNWH-DGNL 234 Query: 240 VMRYIDQFVQPKDFEEGV---------WLSELSDAIETSKGILSVPVPVGKFLLINNLFW 290 + Y Q+++ G L LS+ + G L + N Sbjct: 235 TVFYQRQYIESAQRFAGAPRLTDQHIEALDLFDSLANDPDLHLSMQLQPGDMLFVYNHSQ 294 Query: 291 LHGRDRFT-------PHPDLRRELMR 309 LH R FT P P+ RR L+R Sbjct: 295 LHDRTGFTDWPKSNWPDPNKRRHLLR 320 >UniRef50_Q98KK0 Probable gamma-butyrobetaine dioxygenase n=11 Tax=Alphaproteobacteria RepID=BODG_RHILO Length = 383 Score = 88.3 bits (217), Expect = 3e-16, Method: Composition-based stats. Identities = 31/226 (13%), Positives = 81/226 (35%), Gaps = 26/226 (11%) Query: 86 TLLNRAEGALLINAVGVDDVKQADEMVKLATAVAHLIGRSNFDAMSGQYYARFVVKNVDN 145 R G +++ + ++ + K++ ++ Y R+ + Sbjct: 140 LSAVRTYGFAVMDGLP----AESGALCKVSDLFGYI---------RETNYGRWFEVRAEV 186 Query: 146 SDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLH----LDDWEHLDN 201 + + L + ++ H D Y + + + ++ E ++GG S ++ + + Sbjct: 187 NPNNLAYTNLGLQAHTDNPYRDPVPT-LQILACVENTVEGGESSVIDGFAVAAALQAENP 245 Query: 202 YFRHPLARRPMRFAAPPSKNVSKDVFHPVFDVDQQGR-PVMRYIDQFVQP------KDFE 254 L+ P RF S V P+ ++ G +R+ ++ + P D + Sbjct: 246 EGFRLLSSCPARFEYAGSSGVRLQAKRPMIELGPDGELICIRFNNRSLAPVVDVPFADMD 305 Query: 255 EGVW-LSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTP 299 ++ IE ++ + G+ +++N +H R F+ Sbjct: 306 AYYAAYRRFAELIEDPDFEVTFKLQPGQAFIVDNTRVMHARKAFSG 351 >UniRef50_UPI0001B558E5 oxygenase n=2 Tax=Streptomyces RepID=UPI0001B558E5 Length = 351 Score = 88.3 bits (217), Expect = 3e-16, Method: Composition-based stats. Identities = 37/198 (18%), Positives = 64/198 (32%), Gaps = 17/198 (8%) Query: 118 VAHLIGRSN---FDAMSGQYYARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVL 174 + +I N + + N+ S L +ELH + + + DYV Sbjct: 140 LGEVIAFRNEKGGALVQNVVPVPGKEDSQSNAGSVL------LELHTENAFHDNRPDYVG 193 Query: 175 MMKIDEQNMQGGNSLLLHLDD-WEHLDNYFRHPLARRPMRFAAPPSKNV--SKDVFHPVF 231 ++ + + L RH LA PPS HPV Sbjct: 194 LLCLRGDPTGDAKLCTSSIRRALPLLSAATRHVLAEPRFLTEPPPSFGRLGGVRSAHPVL 253 Query: 232 -DVDQQGRPVMRYIDQFVQPKDFEEGVWLSELSDAIETSKGILSVPVPVGKFLLINNLFW 290 + ++ + P D ++EL +A + + V G +++N Sbjct: 254 LGASEDPNVLVDFAATH--PLDEGAKAAMAELREAFVATMTPHRLRV--GDLAIVDNRVA 309 Query: 291 LHGRDRFTPHPDLRRELM 308 +HGR FTP D + Sbjct: 310 VHGRTSFTPRYDGADRWL 327 >UniRef50_D0N1N0 Trimethyllysine dioxygenase, putative n=1 Tax=Phytophthora infestans T30-4 RepID=D0N1N0_PHYIN Length = 435 Score = 87.9 bits (216), Expect = 4e-16, Method: Composition-based stats. Identities = 44/240 (18%), Positives = 86/240 (35%), Gaps = 33/240 (13%) Query: 75 CANQLQPLLLKTLLNRAEGALLINAVGVDDVKQADEMVKLATAVAHLIGRSNFDAMSGQY 134 C + ++PL+ N G + ++ K+ + + G Sbjct: 189 CVDNIEPLMRDLYTN---GLVRVSGTPTSMEATEKFSKKIGFVLRTIYGT---------- 235 Query: 135 YARFVVKNVDNSDSYLRQPHRVMEL--HNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLH 192 + ++++ Y +EL H DGTY+ + + + Q +GG S + Sbjct: 236 --MWTTNPTNDAEDYNDTASTNLELLHHTDGTYIRDPPG-LQIFNCAAQAGEGGESR--Y 290 Query: 193 LDDWEHLDNYFRH-PLARRPMRFAAPPSKNVSKD----VFHPVFDVDQQGRPV-MRYIDQ 246 +D + ++ + P A R + + P V D P+ VD G V R+ D Sbjct: 291 VDAFHVVETLRKENPEAFRVLSTTSLPYFTVDNDAHLATMEPLIRVDYAGNVVQFRHNDY 350 Query: 247 FVQPKD----FEEGVWLSE---LSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTP 299 P E G + L + + + + + VG ++++N +HGR F Sbjct: 351 DRAPLTHLSFEEVGEFYQAHRKLLEVLRRPEMEFCMKLQVGDMVVVDNQRVMHGRHAFQG 410 >UniRef50_A0YX02 Gamma-butyrobetaine hydroxylase, putative n=2 Tax=Lyngbya sp. PCC 8106 RepID=A0YX02_9CYAN Length = 378 Score = 87.5 bits (215), Expect = 6e-16, Method: Composition-based stats. Identities = 42/226 (18%), Positives = 78/226 (34%), Gaps = 29/226 (12%) Query: 95 LLINAVGVDDVKQADEMVKLATAVAHLIGRSNFDAMSGQYYARFVVKNVDNS-DSYLRQP 153 L + V ++ D ++ IG +++ G+Y VK + N+ D L Sbjct: 135 LTLGFTIVKNLPDEDLDNFISD-----IGPAHYLGKYGRY---SPVKAIPNAQDLSLSAE 186 Query: 154 HRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDDWEHLDNYFRHPLARRPMR 213 + H D T++ V ++ E GG S+L +D ++ ++ +H + Sbjct: 187 GNELSPHTDITFMSTPP-LVQLLYCVENLATGGESVL--VDGFKVARDFQQHHPQYFEIL 243 Query: 214 FAAPPSKNVSKD-------VFHPVFDVDQQGRPV------MRYIDQFVQPKDFEEGVW-- 258 P P+ +++Q G + Q P D E + Sbjct: 244 TKVPVKFEQFYQEWEYYVSRTTPIIELEQDGLVSGIYFSHKNFSSQL--PFDQVEEFYEA 301 Query: 259 LSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLR 304 ++ + G LL+ N LHGR F P+ +R Sbjct: 302 YKTFFLYLKNPAYQYWFRLEPGDCLLVENFRVLHGRKAFNPNSGMR 347 >UniRef50_A4D938 CrpF n=1 Tax=Nostoc sp. ATCC 53789 RepID=A4D938_9NOSO Length = 294 Score = 87.1 bits (214), Expect = 7e-16, Method: Composition-based stats. Identities = 37/226 (16%), Positives = 71/226 (31%), Gaps = 15/226 (6%) Query: 91 AEGALLINAVGVDDVKQADEMVKLATAVAHLIGRSNFDAMSGQYYARFVVKNVDNSDSYL 150 G +++ K ++KL+ +I + D+ + VD+ Y+ Sbjct: 33 EFGFVILEHEPSATPKNN--LLKLSDYFGTIIQHEHSDS-----QGIVPISPVDSYPEYV 85 Query: 151 RQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDDWEHLDNYFRHPLARR 210 + LH DG + + M G L+ +EHL Sbjct: 86 NTTTTDLSLHTDGAFTITPPKVMAMQCQIAAANGGFTKLIDGKLVYEHLKR-TNPVGLLT 144 Query: 211 PMRFAAPPSKNVSKDVFHPVFDVDQQGRPVMRYIDQFVQPKDFEEG--VWLSELSDAIET 268 A K +K P+F+ G ++R+ E + + Sbjct: 145 LFNPDAITVKRDNKKATKPIFEEHHAGL-IVRFRADNAAHVSVESKSFAAFKSFENFVNN 203 Query: 269 SKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLRRELMRQRGYF 314 + + + ++++N LHGR F+ R L R +F Sbjct: 204 PDNQVIFKLAQNQIIIVDNTRVLHGRTAFSKQE--YRLL--NRLWF 245 >UniRef50_Q1QTU1 Gamma-butyrobetaine,2-oxoglutarate dioxygenase n=1 Tax=Chromohalobacter salexigens DSM 3043 RepID=Q1QTU1_CHRSD Length = 408 Score = 87.1 bits (214), Expect = 8e-16, Method: Composition-based stats. Identities = 41/187 (21%), Positives = 69/187 (36%), Gaps = 19/187 (10%) Query: 134 YYARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHL 193 + ARF V++ N + +ELH D D + ++ E +GG SL Sbjct: 189 FGARFDVQSKPNPN-NAAYTAIGLELHTDLPNWRHPPD-IQLLYCLENEAEGGESLF--A 244 Query: 194 DDWEHLDNYFRHP------LARRPMRFAAPPSKNVSKDVFHPVFDVDQQGRPV-MRY--- 243 D + + L P+ F + V PV +VD GR +R+ Sbjct: 245 DGFAVAEALRHEAPELFLRLRDTPIDFRFQDEDSDIA-VRAPVIEVDDTGRIREVRFNNW 303 Query: 244 -IDQFVQPKDFEEGVWLSELS--DAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPH 300 D P + + + + L + + + + G+ + +N LHGR F P+ Sbjct: 304 IRDTLRLPPEEADAWYEAYLVFWQRLREPRFRVDFALEPGQMVAFDNRRVLHGRGAFDPN 363 Query: 301 PDLRREL 307 RR L Sbjct: 364 TG-RRHL 369 >UniRef50_A8PQ25 Taurine catabolism dioxygenase TauD, TfdA family n=1 Tax=Rickettsiella grylli RepID=A8PQ25_9COXI Length = 262 Score = 86.7 bits (213), Expect = 9e-16, Method: Composition-based stats. Identities = 36/179 (20%), Positives = 64/179 (35%), Gaps = 11/179 (6%) Query: 138 FVVKNVDNSDSYLRQPHRVMELH--NDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLD- 194 + +K + L + H+ E H D +Y E + DY + + GGN+L++ Sbjct: 86 WNIKISRENKKQLPRSHKDYEFHFHTDCSYEENVPDYFALYVLHADQKMGGNNLIVDSKI 145 Query: 195 DWEHLDNYFRHPLARRPMRFAAPPSKNVSKDVFHPVFDVDQQGRPVMRYIDQF--VQPKD 252 L L P+ P K +R+ + + Sbjct: 146 LLNSLSKETLKVLQNFPVTIKVPHEFFKGKSCIQACI---IDANCNIRFRREIIDLDSLT 202 Query: 253 FEEGVWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLRRELMRQR 311 F++ + EL + I + + + + L++NN +LH R D RR L R R Sbjct: 203 FKQLKAIEELENLIYFPQYSRKLTLKNDQILILNNKRFLHARTHV---KDSRRHLQRIR 258 >UniRef50_D0LMF7 Gamma-butyrobetaine dioxygenase n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LMF7_HALO1 Length = 424 Score = 86.7 bits (213), Expect = 1e-15, Method: Composition-based stats. Identities = 45/220 (20%), Positives = 77/220 (35%), Gaps = 21/220 (9%) Query: 100 VGVDDVKQADEMVKLATAVAHLIGRSNFDAMSGQYYARFVVKNVDNSDSYLRQPHRVMEL 159 GV ++ + AVA IG + G Y+ V K+ N+++Y + Sbjct: 179 EGVALLRDCPRRDREVMAVAQRIG-PIRETNFGAYF-DVVSKHQPNNNAY---TSLALPP 233 Query: 160 HNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDDWEHLDNYFRHPLARRPMRFAAPPS 219 H D + + +D Q +GG+SL +D ++ A + P Sbjct: 234 HTDLPNWADPPGLQFLHCLDNQ-AEGGDSLF--VDGLRVVEELRAADPAALALLCRLPLG 290 Query: 220 KNVSK-----DVFHPVFDVDQQGR-PVMRY----IDQFVQPKDFEEGVWLSE--LSDAIE 267 P +D+ G V+RY +D+ E ++ + L + I Sbjct: 291 FRFQDVDADIRYRAPAIALDEHGALTVLRYNQGVLDEMGAAFADMEALYRAHRALGERIR 350 Query: 268 TSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLRREL 307 + G ++ +N LHGR F P RR L Sbjct: 351 QPALCHGFRLGPGDLVVFDNHRVLHGRAAFDPSTG-RRHL 389 >UniRef50_A6C4G7 Putative uncharacterized protein n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C4G7_9PLAN Length = 346 Score = 86.4 bits (212), Expect = 1e-15, Method: Composition-based stats. Identities = 46/300 (15%), Positives = 92/300 (30%), Gaps = 42/300 (14%) Query: 34 ELTFTEQTTKQFLEQVAEWPVQALEYKSFLRFRVAKILDDLCANQLQPLLLKTLLNRAEG 93 ++TF + V E S + ++ + ++ + + G Sbjct: 32 QVTFDQNQLDDLQ---LALDVVLKENLSREQITPSRFPLPQVSPVIRQIQQQLETG--SG 86 Query: 94 ALLINAVGVDDVKQADEMVKLATAVAHLIGRSNFDAMSGQ--YYAR---FVVKNVDNSDS 148 A + + V+ + + HL G + +G+ ++ R F V + Sbjct: 87 ACQLKRLPVEHYSAPELEILFWLISVHL-GSPVSQSANGEKIFHVRDEGFQVGQKEARGP 145 Query: 149 YLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLD-------DWEHLDN 201 R+ + H D D + + + + G N L+ + + L Sbjct: 146 NTRK---RLSFHTD------RCDVIGFLCLQQARSGGNNQLVSSVSLFNEMRRRFPELTK 196 Query: 202 YFRHPLARRPMRFAAPPSKNVSKDVFHPVFDVDQQGRPVMRYIDQFVQPK---------D 252 P + N PVF V Q G ++ ++ Sbjct: 197 ILMQPFY---YLRHNVDTGNQKPFCQQPVFSV-QDGHFAGSFLRVLIERAYASPDLPDMT 252 Query: 253 FEEGVWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFT--PHPDLRRELMRQ 310 ++ L +L E+ + ++ G L +NN H RD F L+R L+R Sbjct: 253 SQQREALDQLEAVAESPELSVTFRQEPGDLLFLNNWVTFHRRDDFEDADESHLKRHLLRV 312 >UniRef50_B3T5P9 Putative gamma-butyrobetaine hydroxylase n=1 Tax=uncultured marine microorganism HF4000_ANIW141K23 RepID=B3T5P9_9ZZZZ Length = 309 Score = 86.4 bits (212), Expect = 1e-15, Method: Composition-based stats. Identities = 46/301 (15%), Positives = 95/301 (31%), Gaps = 30/301 (9%) Query: 27 AQSPRLLELTFTEQTTKQFLEQVAEWPVQALEYKSFLRFRVAKILDDLCANQLQPLLLKT 86 P + T +T + E L+ +++ + + +L Sbjct: 21 ENKPERFLVKLTSKTIDEIKRNRKELGNLNESSFPELK-------NEINELKTKKILQGV 73 Query: 87 LLNRAEGALLINAVGVDDVKQADEMVKLATAVAHLIGRSNFDAMSGQYYARFVVKNVD-N 145 G L+I+ D + + + K+ + +++G ++ + K Sbjct: 74 ------GLLIIDGKSFLDFSKNE-ITKIYEIICNMLGTLYIQNINSEKIVEIKDKGKSMT 126 Query: 146 SDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDDWEH-LDNYFR 204 Q H D + ++ D + M+ I++ +GG S + + L Sbjct: 127 LGGRYHQTKEGGSYHTDSPHWTKVPDLIGMLCINQ-AKKGGISKFVSAYTIHNQLLKEQN 185 Query: 205 HPLAR--RPMRFAAPPSK--NVSKDVFHPVFDVDQQGRPVMRYIDQFV--------QPKD 252 L F N + VF P+F + + R++ ++ P Sbjct: 186 DILKTLYEKFHFDKRGEFKINEPQTVFEPIFVF-KNDKLYCRFLIDYIVAGHQIQNYPLS 244 Query: 253 FEEGVWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLRRELMRQRG 312 + L L + E +LS + +N LHGR F + D R+ R Sbjct: 245 KLQETALQSLEEISENENNVLSYDLKANDMTFFDNHRILHGRTEFEDYEDENRKRYFLRT 304 Query: 313 Y 313 + Sbjct: 305 W 305 >UniRef50_A8TRK7 Gamma-butyrobetaine,2-oxoglutarate dioxygenase n=1 Tax=alpha proteobacterium BAL199 RepID=A8TRK7_9PROT Length = 390 Score = 86.0 bits (211), Expect = 2e-15, Method: Composition-based stats. Identities = 39/182 (21%), Positives = 65/182 (35%), Gaps = 15/182 (8%) Query: 148 SYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLL---LHLDD-WEHLDNYF 203 L Q +R +E H D Y Y+L+ + + GG+S L H+ + D Sbjct: 191 VDLSQTNRELEPHADNPYRLPAPGYILLHCLR-NDADGGDSTLVDGFHVAEILRRDDPDA 249 Query: 204 RHPLARRPMRFAAPPSKNVSKDVFHPVFDVDQQGR-PVMRYIDQFVQPKDFEEGV----- 257 L RF V + + P+ ++ G +R+ ++ +P G Sbjct: 250 FDVLTTTATRFRYVDPDTVLEH-YGPLIELAPDGSVRRLRFNNRTEEPPALPAGRLAAYY 308 Query: 258 -WLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLR--RELMRQRGYF 314 + + S L + G+ L+INN LHGR + R R+ R Sbjct: 309 AARQRYATLLHASSNTLVFKLEPGQLLMINNYRLLHGRRGYALEAGGRHMRQAYLDRDCV 368 Query: 315 AY 316 A Sbjct: 369 AS 370 >UniRef50_Q5KP77 Mitochondrion protein, putative n=3 Tax=Filobasidiella RepID=Q5KP77_CRYNE Length = 575 Score = 85.2 bits (209), Expect = 3e-15, Method: Composition-based stats. Identities = 36/252 (14%), Positives = 81/252 (32%), Gaps = 36/252 (14%) Query: 75 CANQLQPLLLKTLLNRAEGALLINAVGVDDVKQADEMVKLATAVAHLIGRSNFDAMSGQY 134 ++ + LL + G ++I V D + M++ V +IG+ + + Sbjct: 279 YSDLSESLLKVLEQLQVYGIVVIEGVPTDPTDDKECMLR---KVTDMIGK-----IRNTF 330 Query: 135 YAR-FVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHL 193 Y + VK+V S + + + LH D Y + + + ++GG+S + Sbjct: 331 YGETWDVKSVKQS-KNIAYTNLNLGLHMDLLYFSSPPRFQALHCLR-NKVEGGSSYF--V 386 Query: 194 DDWEHLDNYFRHPLAR-RPMRFAAPPSKNVSK-DVFHPVFDVD-----QQGRPVMRYIDQ 246 D + + + R + + + HP+ D + + Sbjct: 387 DSFRTVSDLPRDQFEFLQKINITYQYDNDNHYFRYRHPIISSDFVRGRNNRHAAVNWSPP 446 Query: 247 FVQPKDFEE----------------GVWLSELSDAIETSKGILSVPVPVGKFLLINNLFW 290 F + + +++ + + + + G +L +N Sbjct: 447 FRAAAEALDFPQHDFVAAAKHEQKVLQAIADFEERLSDPRYRYEFTMQEGDLVLFDNRRV 506 Query: 291 LHGRDRFTPHPD 302 LH R F D Sbjct: 507 LHARTAFRDKKD 518 >UniRef50_B5K1H8 Taurine catabolism dioxygenase TauD, TfdA family n=2 Tax=Rhodobacterales RepID=B5K1H8_9RHOB Length = 371 Score = 85.2 bits (209), Expect = 3e-15, Method: Composition-based stats. Identities = 52/262 (19%), Positives = 79/262 (30%), Gaps = 37/262 (14%) Query: 70 ILDDLCANQLQPLLLKTLLNRAEGALLINAVGVDDVKQADEMVKLATAVAHLIGRSNFDA 129 + L K L G ++ + + +A AH +G + Sbjct: 83 FPLAAFGEHILQLKQKLLSG--IGLEVLRGLPISGYSKAFAATIFCGIGAH-MGSARSQN 139 Query: 130 MSGQYYARFVVKNVDNSDSYLR--QPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGN 187 G D SD R Q H D D V ++ I E +GG Sbjct: 140 AEGHILGHVRDIGADESDPNSRIYQTCERQTFHTDS------ADVVGLLCIREAR-EGGK 192 Query: 188 SLLLHLDD---------WEHLDNYFRHPLARRPMRFAAPPSKNVSKDVFHPVFDVDQQGR 238 SLL+ + + L+ F P+A R P+ + PV + G Sbjct: 193 SLLVSAETIYNRMKQEHPDLLEKLF-DPIATD--RRGEIPNGAKPY-MEIPVLSWYE-GY 247 Query: 239 PVMRYIDQFVQPKDFEEGV---------WLSELSDAIETSKGILSVPVPVGKFLLINNLF 289 + Y Q+++ EG L + + + G + N Sbjct: 248 LTVFYQRQYIESAQRFEGAMRLSPEHVEALDMFDALANNPELCFGMQLQPGDMQFVYNHS 307 Query: 290 WLHGRDRFTPHPDL--RRELMR 309 LH R F PD RR LMR Sbjct: 308 QLHDRTGFLDWPDPSQRRHLMR 329 >UniRef50_B3RYG1 Putative uncharacterized protein n=1 Tax=Trichoplax adhaerens RepID=B3RYG1_TRIAD Length = 438 Score = 84.0 bits (206), Expect = 6e-15, Method: Composition-based stats. Identities = 38/204 (18%), Positives = 77/204 (37%), Gaps = 17/204 (8%) Query: 118 VAHLIGRSNFDAMSGQYYARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMK 177 V LI R ++ + Y F VK+ ++ S L + +H D TY I + Sbjct: 208 VTKLIARVGYNRTTN-YGPTFQVKSKTDA-SNLAYTTGGLGMHTDLTYFNYIPGIQFLHC 265 Query: 178 IDEQNMQGGNSLLLH----LDDWEHLDNYFRHPLARRPMRFAAP--PSKNVSKDVFHPVF 231 I + +GG + L+ ++ + L++ P+++ + + H + Sbjct: 266 IK-RAGEGGENQLVDGFKVAEELRDNEPEAFKMLSKYPIQYFDIGKDFVDFHQLAQHTII 324 Query: 232 DVDQQGR-PVMRYIDQFVQP--KDFEEGVW--LSELSDAI---ETSKGILSVPVPVGKFL 283 + G + Y D P ++ V + + K +++V + G Sbjct: 325 QLHSNGSIARICYSDHARSPNLAVPQDKVMPFYDAMGTFLKYVYDPKYMVNVTLDSGDIA 384 Query: 284 LINNLFWLHGRDRFTPHPDLRREL 307 + +N +HGR F+ P+ R L Sbjct: 385 VFDNYRVMHGRSPFSLAPNSLRHL 408 >UniRef50_Q2CHG0 Putative uncharacterized protein n=1 Tax=Oceanicola granulosus HTCC2516 RepID=Q2CHG0_9RHOB Length = 269 Score = 83.7 bits (205), Expect = 9e-15, Method: Composition-based stats. Identities = 41/187 (21%), Positives = 62/187 (33%), Gaps = 24/187 (12%) Query: 135 YARFVVKNVDNSDSYLR--QPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLH 192 R +N+ + + ++ + LH D +Y+ + VL I GG S + Sbjct: 90 QVRIDRSRAENAGAVTAYSRTNQPLALHTDSSYLAKPHPLVLFQFIRS-AADGGASTMAC 148 Query: 193 LDD-WEHLDNYFRHPLARRPMRFAAPPSKNVSKDVFHPVFDVDQQGRPVMRYIDQFVQPK 251 DD L LAR F P P+ ++ P MRY + Sbjct: 149 ADDIVAALPPPLVETLARPQFPFGKGP---------MPILF-GRRSAPQMRYYRSQIDTA 198 Query: 252 DFEEG-------VWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLR 304 E G L L + +E G+ + + N LHGR F D Sbjct: 199 AEEAGGLPQPLVTALDRLDEILEQVP-TYEFKAQPGEIVFMQNTRVLHGRRGFGGDSD-- 255 Query: 305 RELMRQR 311 R + R R Sbjct: 256 RLMYRIR 262 >UniRef50_D1UM90 Taurine catabolism dioxygenase TauD/TfdA n=1 Tax=Burkholderia sp. CCGE1001 RepID=D1UM90_9BURK Length = 339 Score = 83.7 bits (205), Expect = 9e-15, Method: Composition-based stats. Identities = 48/289 (16%), Positives = 89/289 (30%), Gaps = 43/289 (14%) Query: 47 EQVAEWP--VQALEYKSFLRFRVAKILDDLCANQLQPLLLKTLLNRAEGALLINAVGVDD 104 +++ E +Q + + S + D A L+ + G +++ + VD Sbjct: 32 DELEELDAALQHVRHLSITDITKSDFPLDRLAASLKEAAQEIHHG--HGLVVLRGLQVDR 89 Query: 105 VKQADEMVKLATAVAHLIGRSNFDAMSGQYYARFVVKNVDNSDSYLRQP-----HRVMEL 159 + D HL G + A + D +Y ++ + + Sbjct: 90 YSKDDASRIYWGVGLHL-GTPV----TQNSRAHLLGHVKDEGVTYSQKTRGYNTNAKLNF 144 Query: 160 HNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLH---------LDDWEHLDNYFRHPLARR 210 H D D V ++ + GG S D + L F Sbjct: 145 HTDNC------DIVGLLCLRTPK-SGGLSRFTSSTTIFNRILADRPDLLAPLFDGFY--Y 195 Query: 211 PMRFAAPPSKNVSKDVFHPVFDVDQQGRPVMRYIDQFVQPK--------DFEEGVWLSEL 262 ++ P D PVF D +G RY+ ++P + L+ Sbjct: 196 DLKGEGRPGAGELSDHKIPVFS-DYEGYLSCRYVRNAIEPAFAKSGEAKTDLQEEALNLF 254 Query: 263 SDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTP--HPDLRRELMR 309 ++ + + G L+NN +H R F D +R L+R Sbjct: 255 DKLADSPELCFEMQFEPGDMQLLNNHVIVHSRTHFEDFEEEDKKRHLLR 303 >UniRef50_B6H119 Pc12g14480 protein n=1 Tax=Penicillium chrysogenum Wisconsin 54-1255 RepID=B6H119_PENCW Length = 378 Score = 83.7 bits (205), Expect = 9e-15, Method: Composition-based stats. Identities = 54/313 (17%), Positives = 103/313 (32%), Gaps = 33/313 (10%) Query: 21 FTLTPSAQSPRLLELTFTEQTTKQFLEQVAEWPVQALEY-----KSFLRFRVAKILDDLC 75 ++ Q+ R ++ + + E AL Y K L A Sbjct: 1 MSIGTRPQAWRTQDVERDSSWIIRLTPEQIEGFQHALVYAKKHPKPLLDMTQADYPLPEA 60 Query: 76 ANQLQPLLLKTLLNRAEGALLINAVGVDDVKQADEMVKLATAVAHL-IGRSNFDAMSGQY 134 ++ + T R G L+ VD+ +AD V ++ +GR+ A Sbjct: 61 TKKVLQDAITTTQGR-WGMCLVKGFPVDEWSEADMRVAYWGMGLYIGVGRTQNRA----- 114 Query: 135 YARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLH-- 192 + + D SY + R + + ++ D V ++ +GG S ++ Sbjct: 115 -SEVINDVRDAGGSYKVKGGRGYNTNAGLDFHQDSADVVSLLC-RRTAKEGGTSKVMSSI 172 Query: 193 --LDDWEHLDNYFRHPLARRPMRFAAPPSKNVSKDVFH--PVFDVDQQGRPVMRYID--- 245 D L H L + +++ + ++ P+ G R Sbjct: 173 ALRDRVAELRPDLLHVLESNNWFHSFQNAQDSIQPPYYRCPLMGE-SGGYFCARTNRKNT 231 Query: 246 ---QFVQP----KDFEEGVWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFT 298 Q P ++ L L + + + ++ + G L+N+ LH R F Sbjct: 232 IAAQRDFPDVPRLTAQQVEALDLLDIIMPSEEYCYTMELERGDMQLLNSFVTLHSRTPFE 291 Query: 299 PH--PDLRRELMR 309 + PD +R LMR Sbjct: 292 DYELPDEKRHLMR 304 >UniRef50_O75936 Gamma-butyrobetaine dioxygenase n=28 Tax=Euteleostomi RepID=BODG_HUMAN Length = 387 Score = 83.3 bits (204), Expect = 1e-14, Method: Composition-based stats. Identities = 29/195 (14%), Positives = 74/195 (37%), Gaps = 21/195 (10%) Query: 130 MSGQYYAR-FVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNS 188 + +Y + V++ ++++ + + H D + L+ I + + GG+S Sbjct: 172 LYLTFYGHTWQVQDKIDANNVAYTTGK-LSFHTDYPALHHPPGVQLLHCIKQ-TVTGGDS 229 Query: 189 LLLHLDDWEHLDNYF-RHPLARRPMRFAAPPSKNVSKDV-------FHPVFDVDQQGRP- 239 + +D + +P A + + ++ D H + ++D +G+ Sbjct: 230 EI--VDGFNVCQKLKKNNPQAFQILSSTFVDFTDIGVDYCDFSVQSKHKIIELDDKGQVV 287 Query: 240 VMRYIDQ-----FVQPKDFEEGVW--LSELSDAIETSKGILSVPVPVGKFLLINNLFWLH 292 + + + F P + + + L E D + + + + + G + +N LH Sbjct: 288 RINFNNATRDTIFDVPVERVQPFYAALKEFVDLMNSKESKFTFKMNPGDVITFDNWRLLH 347 Query: 293 GRDRFTPHPDLRREL 307 GR + ++ R L Sbjct: 348 GRRSYEAGTEISRHL 362 >UniRef50_D1VL46 Gamma-butyrobetaine dioxygenase n=1 Tax=Frankia sp. EuI1c RepID=D1VL46_9ACTO Length = 409 Score = 83.3 bits (204), Expect = 1e-14, Method: Composition-based stats. Identities = 40/231 (17%), Positives = 76/231 (32%), Gaps = 28/231 (12%) Query: 81 PLLLKTLLNRAEGALLINAVGVDDVKQADEMVKLATAVAHLIGRSNFDAMSGQYYARFVV 140 + GA +I V + ++ + ++ H + +N+ + R Sbjct: 128 AMATALGAVEKLGAAVITEVP----ARPGMVLTVGRSLGH-VRVTNYGELFD---VRVEP 179 Query: 141 KNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSL---LLHLDDWE 197 +L + H D Y + + L+ + G +L D Sbjct: 180 DPE-----HLAYTGLALAPHTDNPYRDPVPTVQLLHCLRAAGAGGDTTLVDGFAAADRLR 234 Query: 198 HLDNYFRHPLAR--RPMRFAAPPSKNVSKDVFHPVFDVDQQGRPVM-RYIDQFVQPKDFE 254 D L + P + P S PV VD +G R+ D+ +QP D Sbjct: 235 ETDRAAFDTLTQVWLPFSYDGPTS---VLTCRAPVISVDDEGAVTQVRWNDRGLQPPDVA 291 Query: 255 EGV------WLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTP 299 L+ + ++ + +S+ + G L+ +N LHGR + P Sbjct: 292 PDRIGAVYRALAAFGEVLDEADLAVSLRLVPGDCLIFDNTRVLHGRSAYGP 342 >UniRef50_B3SBK5 Putative uncharacterized protein n=1 Tax=Trichoplax adhaerens RepID=B3SBK5_TRIAD Length = 441 Score = 82.9 bits (203), Expect = 2e-14, Method: Composition-based stats. Identities = 43/242 (17%), Positives = 85/242 (35%), Gaps = 28/242 (11%) Query: 81 PLLLKTLLNRAEGALLINAVGVDDVKQADEMVKLATAVAHLIGRSNFDAMSGQYYARFVV 140 L L G ++ + + D ++ +A +A++ + G+Y F V Sbjct: 190 ALYKWISLIHRYGVAIVKGTPI----EKDFILDMAERIAYV-----KETSYGKY---FDV 237 Query: 141 KNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQ---GGNSLL---LHLD 194 +S+L R ++ H D Y E ++ + + GG + LH+ Sbjct: 238 VVEPKPNSHLAFSARGLDHHTDMNYRENSPGLQMLHCLKNNHDVSNPGGRTFFVDGLHVV 297 Query: 195 DW-EHLDNYFRHPLARRPMRFAAPPSKNVSKDVFHPVFDVDQQG-----RPVMRYIDQFV 248 W + + L ++F + + + P+ +D+ G R + Sbjct: 298 SWLKKHHPSAFYTLCSIEVKFELTTE-SFTYNQTKPIICLDKDGNFSEIHVNNRTMGPIQ 356 Query: 249 QPKDFEEGVW--LSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLRRE 306 PK+ + + L I + S+ + G + NN LHGR F + R Sbjct: 357 APKETINPFYSAYNILMRKIRDPELQYSLGLQPGDLVAFNNRRVLHGRSAFNSDR-VSRH 415 Query: 307 LM 308 L+ Sbjct: 416 LV 417 >UniRef50_UPI0000521F63 PREDICTED: similar to CG14630 CG14630-PA n=1 Tax=Ciona intestinalis RepID=UPI0000521F63 Length = 404 Score = 82.9 bits (203), Expect = 2e-14, Method: Composition-based stats. Identities = 41/243 (16%), Positives = 81/243 (33%), Gaps = 32/243 (13%) Query: 80 QPLLLKTLLNRAEGALLINAVGVDDVKQADEMVKLATAVAHLIGRSNFDAMSGQYYARFV 139 + +L G ++I V + + +K+ ++ G+ + Sbjct: 139 EGMLNWMKQVIDIGFVVIQNVPL----EEGACIKVGEKIS-----PVLQNTYGKLFDVLD 189 Query: 140 VKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNM-QGGNSLLLHLDDWEH 198 ++ +N + + LH D + E L+ I + QGG SLL +D + Sbjct: 190 NQSTEN----IANSQIWLPLHTDQCHYEAGPGVQLLHAIQFDDCVQGGESLL--VDMFYV 243 Query: 199 LDNYFRH------PLARRPMRFAAPPS--KNVSKDVFH-PVFDVD-QQGRPVMRYIDQFV 248 L+ + + L++ P+ F KN PV D + Sbjct: 244 LETFRKEFPEDFNILSKVPVPFGTIDYQRKNPCYLYTRKPVIVTDYDDQIVGFNFNKGIE 303 Query: 249 QPKDFEEGV------WLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPD 302 +P ++L I ++ + + G+ L NN LH R+ + + Sbjct: 304 EPLRIHAKYVEKFYQAYNKLDRMINRNEFVFKHRLRTGELLFFNNRRMLHSREAYMSNGG 363 Query: 303 LRR 305 RR Sbjct: 364 RRR 366 >UniRef50_Q13PB1 Putative uncharacterized protein n=1 Tax=Burkholderia xenovorans LB400 RepID=Q13PB1_BURXL Length = 337 Score = 82.9 bits (203), Expect = 2e-14, Method: Composition-based stats. Identities = 43/247 (17%), Positives = 82/247 (33%), Gaps = 25/247 (10%) Query: 93 GALLINAVGVDDVKQADEMVKLATAVAHLIGRSNFDAMSGQ--YYARFVVKNVDNSDSYL 150 G ++I+ + +DD + D + ++ ++ R + G+ Y R K N Sbjct: 95 GFVIIDKLPLDDYSKDDAK-NIYWLLSQMVARPVAQSWDGKMVYDVRDQGKPPGN-GVRP 152 Query: 151 RQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLH--------LDDWEHLDNY 202 + H D +Y +YV ++ + ++GG S ++ ++ L Sbjct: 153 DVTNAEQNFHTDNSYNLYPPEYVALLCLQP-ALEGGISSVVSFYTVYNQMIERHPELLAR 211 Query: 203 FRHPLARRPMRFAAPPSKNVSKDVFHPVFDVDQQGRPVMRYI--------DQFVQPKDFE 254 P R AP K + HP+F + G R D E Sbjct: 212 LYQPYIFDRQREHAPA---DPKLISHPLFH-HEDGHLRCRLSHVHVVNGYRMAGSTLDAE 267 Query: 255 EGVWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLRRELMRQRGYF 314 L L + + G+ +++N H R F +P+ R+ R + Sbjct: 268 GLEALETLEAVMRERQWCREFFFEPGQIQIVDNQRCGHRRTGFVDYPEAERKRHLVRLWL 327 Query: 315 AYASNHY 321 A + Sbjct: 328 RDAGRRF 334 >UniRef50_A4KCF3 TMC biosynthetic enzyme L5 n=2 Tax=Streptomyces RepID=A4KCF3_9ACTO Length = 340 Score = 82.5 bits (202), Expect = 2e-14, Method: Composition-based stats. Identities = 45/282 (15%), Positives = 86/282 (30%), Gaps = 31/282 (10%) Query: 50 AEWPVQALEYKSFLRFRVAKILDDLCANQLQPLLLKTLLNRAEGALLINAVGVDDVKQAD 109 V K A +L+ ++ G L+ V V D + + Sbjct: 21 DALDVLLKSGKPNFAASPADFPLPTLGPRLRGIVDSIE--NEPGFALVRGVPVGDKSEDE 78 Query: 110 EMVKLATAVAHLIGRSNFDAMSGQYYARFVVKNVDNSDSY-LRQPHRVMELHNDGTYVEE 168 +L + IG + + +++ + + + ++ + H D T Sbjct: 79 VR-RLYWGLGMYIGVPMIQNNNDS--SMVDIRDERRAGKLRVHKSNQHIGFHIDST---- 131 Query: 169 ITDYVLMMKIDEQNMQGGNSLLLHLDDWEH-----LDNYFRHPLARRPMRFAAPPSKNVS 223 D V ++ QGG SL++ + P A P Sbjct: 132 --DVVTLLC-RRAASQGGTSLVVSAEAVRREMSWECPELLSALYEPLPFADVASPDDERP 188 Query: 224 KDVFHPVFDVDQQGRPVMRY-----IDQFVQP----KDFEEGVWLSELSDAIETSKGILS 274 PVF + G R+ + P + ++++ + + Sbjct: 189 DVFLSPVFGRHE-GLTTTRFYIRRVLRSQDNPDAPRLTERQLEAINKVEEIAARPGLVTP 247 Query: 275 VPVPVGKFLLINNLFWLHGRDRF-TPHPDLRRELMRQRGYFA 315 + G +INN LHGR F + P R L+R +F+ Sbjct: 248 MQFEPGDLQMINNHLVLHGRTAFASEEPGEGRHLLRM--WFS 287 >UniRef50_B0C2I2 Putative uncharacterized protein n=1 Tax=Acaryochloris marina MBIC11017 RepID=B0C2I2_ACAM1 Length = 345 Score = 82.5 bits (202), Expect = 2e-14, Method: Composition-based stats. Identities = 42/258 (16%), Positives = 88/258 (34%), Gaps = 36/258 (13%) Query: 79 LQPLLLKTLLNRAEGALLINAVGVDDVKQADEMVKLATAVAHLIGRSNFDAMSGQYYARF 138 L +++ L + G +L++ V + ++ + L + +++G R Sbjct: 72 LLANVMRYHLTQTTGVVLLSGFDVAEYGESASRLLLLQ-LGYVLGS--------VLDKRG 122 Query: 139 VVKNVDNSDSYLRQPHRVME-------LHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLL 191 ++ +V + RQ + + H D T + + V ++ + G N L Sbjct: 123 LLYDVCDRGQDHRQANVLFSSTCTSPGYHTDSTDADLMPGIVSLLCLRTAREGGVNRLAN 182 Query: 192 HLDDWEHLDNY-------FRHPLARRPMRFAAPPSKN--VSKDVFHPVFDVDQ--QGRPV 240 L ++ L P R + +++ PVF+ + G Sbjct: 183 TLTAYQRLLRSQPEVLRRLCEPFIRDKIIVGEVGTRSHLDRLRNAFPVFEWGRWYPGLTC 242 Query: 241 --MRY-----IDQFVQPKDFEEGVWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHG 293 MRY D+ P E+ L + + + + G + +NN H Sbjct: 243 RYMRYWIEAGHDKAGLPLSEEDLFVLDQFEATLNQEDITYQLQLQPGDMIFLNNHLIAHD 302 Query: 294 RDRFTP--HPDLRRELMR 309 R + P+ +R L+R Sbjct: 303 RTEYLDWEEPEKKRHLVR 320 >UniRef50_Q9NF72 EG:BACR7A4.9 protein n=10 Tax=Drosophila RepID=Q9NF72_DROME Length = 504 Score = 82.1 bits (201), Expect = 2e-14, Method: Composition-based stats. Identities = 59/301 (19%), Positives = 98/301 (32%), Gaps = 47/301 (15%) Query: 32 LLELTFTEQTTKQFLEQVAEWPVQ---ALEYKSFLR-FRVAKILDDLCANQLQPLLLKTL 87 L E F E K +LE+V + P Q E++ R F+ +L Q L + Sbjct: 204 LRERDFGETGRKHYLEEVYKPPAQLWGKTEFEDVKREFQYEDVL-----EQDAALRVWLE 258 Query: 88 LNRAEGALLINAVGVDDVKQADEMVKLATAVAHLIGRSNFDAMSGQYYARFVVKNVDNSD 147 +G ++ + KLA + ++ + D F VK+ N+ Sbjct: 259 ALAVQGFAILKGAP----NDINVAKKLAERIGYIKRTTYGDV--------FEVKSKPNAR 306 Query: 148 SYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLH----LDDWEHLDNYF 203 +Y + LH D Y E ++ + + +GG + L + L Sbjct: 307 NY-AYLMTPLPLHTDMPYYEYKAGINILHTLVQSESKGGANTLTDGFNVASQLQKLHPED 365 Query: 204 RHPLARRPMRFAAP-----PSKNVSKDVFHPVFDVDQQGRPV----------MRYIDQFV 248 L P+ + SK PV +D GR R+ Sbjct: 366 FEVLKSVPVNWFDIGHDGDDSKPFHSLWRAPVICLDVDGRFARINQNTTKRDSRFSVSLA 425 Query: 249 QPKDFEEGVWLSELSDAIETSKGI-LSVPVPVGKFLLINNLFWLHGRDRFTPHPDLRREL 307 Q W +E ++ + G + NNL LHGR + P +R L Sbjct: 426 Q-----AVSWYKAYDKFLEIAQSEAVEFKTQAGDVFVFNNLRMLHGRTAYEDAPGNKRHL 480 Query: 308 M 308 + Sbjct: 481 V 481 >UniRef50_C3YQN6 Putative uncharacterized protein n=2 Tax=Branchiostoma floridae RepID=C3YQN6_BRAFL Length = 2149 Score = 81.7 bits (200), Expect = 3e-14, Method: Composition-based stats. Identities = 30/171 (17%), Positives = 59/171 (34%), Gaps = 21/171 (12%) Query: 147 DSYLRQ----PHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDDWEHLDNY 202 S RQ + LH D T+ E ++ ++E + GG S+L +D + + Sbjct: 1957 TSDQRQGAGYTTEALPLHTDNTHFNEPAGLLVSHMLEEGD-SGGTSVL--VDGFHAAERL 2013 Query: 203 FRHPLARRPMRFAAPPSKNVSK----DVFHPVFDVD---QQGRPVMRY-------IDQFV 248 + + + P + P+ +++ ++ ++RY +D Sbjct: 2014 RQDDPEGFDVLSSVPVPHFLEHSLDITGTGPIVELEPGPRKELRMIRYSDYVRGSLDTIP 2073 Query: 249 QPKDFEEGVWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTP 299 L + + + G+ LL+ N LHGR FT Sbjct: 2074 MEAVGRWYSALRRFTTYLRDPANAYWFQLKPGQILLMYNWRLLHGRSAFTG 2124 >UniRef50_B7S2Z5 Taurine catabolism dioxygenase TauD, TfdA family n=1 Tax=marine gamma proteobacterium HTCC2148 RepID=B7S2Z5_9GAMM Length = 370 Score = 81.7 bits (200), Expect = 3e-14, Method: Composition-based stats. Identities = 40/248 (16%), Positives = 76/248 (30%), Gaps = 31/248 (12%) Query: 74 LCANQLQPLLLKTLLNRAEGALLINAVGVDDVKQADEMVKLATAVAHLIGRSNFDAMSGQ 133 LC L+ + G ++ G++D QA E + LIG Sbjct: 122 LCRESGSTLVDALQEAKRHGLVIF--EGLEDDDQAGEN------LGELIGFKRRTNFGTT 173 Query: 134 YYARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHL 193 + + L + LH D E++ Y + + ++ GG S+ Sbjct: 174 FEVVNKPSPNN-----LAYTALPLPLHTDLPNQEQVPGYQFLHCLR-NSVTGGASVF--A 225 Query: 194 DDWEHLDNYF-RHP-----LARRPMRFAAPPSKNVSKDVFHPVFDVDQQGR-PVMRYIDQ 246 D + + + P L+ + + N + ++D G + + Sbjct: 226 DGFRICTDLRAQAPDDFDLLSGLQIPWRFHDE-NDDVRFRRSIIELDSMGDLSGLAFNAH 284 Query: 247 FVQPKDFEEGV------WLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPH 300 D L I + + + G+ ++ +N LHGR+ F P Sbjct: 285 IADVADLPSSQLLDFYQAYQGLMQRIREPQYRICHTLGPGEMVMFDNRRVLHGREGFDPG 344 Query: 301 PDLRRELM 308 R L Sbjct: 345 SG-ERHLR 351 >UniRef50_Q5A0G4 Potential gamma-butyrobetaine hydroxylase n=2 Tax=Candida albicans RepID=Q5A0G4_CANAL Length = 407 Score = 81.3 bits (199), Expect = 4e-14, Method: Composition-based stats. Identities = 61/326 (18%), Positives = 108/326 (33%), Gaps = 29/326 (8%) Query: 1 MNALTAVQNNAVDSGQDYSGFTLTPSAQSPRLLELTFTEQTTKQFLEQVAE-WPVQALEY 59 +NA V+++++ +G LT S LE T + +F ++ + W Q LE Sbjct: 71 INAPPVVEDSSLKIQWSNNG-KLTNSVYPVSFLENYSTNKRLGKFFDKDRKLWDKQELE- 128 Query: 60 KSFLRFRVAKILDDLCANQLQPLLLKTLLNRAEGALLINAVGVDDVKQ--ADEMVKLATA 117 +F + D L + G +N + + D + Sbjct: 129 NNFASLNM-DYDDILTND--NSFFQTLYNLNRYGLTFVNNIPTPQISDMTEDNATQWPV- 184 Query: 118 VAHLIGRSNFDAMSGQYYA-RFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMM 176 + I F + +Y F VKN + + + + LH D Y E L+ Sbjct: 185 --YKIAEK-FGYIKKTFYGTLFDVKNKKEKATNIAYTNTFLPLHMDLLYYESPPGLQLLH 241 Query: 177 KIDEQNMQGGNSLL----LHLDDWEHLDNYFRHPLARRPMRFAAPPSKNVSKDVFHPVFD 232 I + + GG ++ L + D L + P+ F + N P+ Sbjct: 242 AI-QNSTLGGENIFCDSYLAAEHVRKTDPRAYTALTQTPITFHY-DNNNEYYYYKRPLIV 299 Query: 233 VDQ---QGRP---VMRYIDQFVQPKDFEE----GVWLSELSDAIETSKGILSVPVPVGKF 282 D G P + Y F P + + + I + +P G Sbjct: 300 EDPEVGDGFPKIASINYAPPFQGPFEVDPHPDFIRGMQLFETFINDPANHFEIKMPEGTC 359 Query: 283 LLINNLFWLHGRDRFTPHPDLRRELM 308 ++ N LH R+ F+ + R LM Sbjct: 360 VIFENRRALHSRNAFSDSNNGDRWLM 385 >UniRef50_Q45R77 Possible hydrolase n=1 Tax=Streptomyces fradiae RepID=Q45R77_STRFR Length = 319 Score = 81.3 bits (199), Expect = 4e-14, Method: Composition-based stats. Identities = 32/202 (15%), Positives = 63/202 (31%), Gaps = 11/202 (5%) Query: 110 EMVKLATAVAHLIGRSNFDAMSGQYYARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEI 169 ++L +A+ + N+ S +E+H + + Sbjct: 104 LSMELGDVIAYR-NEKQGALVQNVVPVPGREGQQSNAGSV------PLEMHTENAFHPHR 156 Query: 170 TDYVLMMKIDEQNMQGGNSLLLHLDDW-EHLDNYFRHPLARRPMRFAAPPSKNVSKDVFH 228 DYV + + + + + + +HLD R L + PPS Sbjct: 157 PDYVGLFCVRSDHDRAAGLRVASVRAVMDHLDAGTREMLRQPLFTTEPPPSFGRPDSGTK 216 Query: 229 P-VFDVDQQGRPVMRYIDQFVQPKDFEEGVWLSELSDAIETSKGILSVPVPVGKFLLINN 287 P P +R D + L++A+ T + + + ++N Sbjct: 217 PHAVLTGDAEDPDIRVDFHATHTSDPWGRQAMEALAEAVRTVS--EELVLEPADLVYVDN 274 Query: 288 LFWLHGRDRFTPHPDLRRELMR 309 LHGR F P D + ++ Sbjct: 275 RVALHGRTAFVPRYDGQDRWLQ 296 >UniRef50_C3K215 Putative uncharacterized protein n=1 Tax=Pseudomonas fluorescens SBW25 RepID=C3K215_PSEFS Length = 315 Score = 81.0 bits (198), Expect = 5e-14, Method: Composition-based stats. Identities = 43/254 (16%), Positives = 83/254 (32%), Gaps = 30/254 (11%) Query: 71 LDDLCANQLQPLLLKTLLNRAEGALLINAVGVDDVKQADEMVKLATAVAHLIGRSNFDAM 130 ++ L L L + + ++ G + K + T L+G Sbjct: 48 INPLQLPVLSEELDRIRTMIDHESHVLILEGFEVDKLESFKSLIWT-FGSLLGVPMVQNH 106 Query: 131 SG----QYYARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGG 186 G + Y R + + Q + +HNDG DY+++ ++ + GG Sbjct: 107 LGHKVIEVYDRGAKSIEE--GARYHQTRQGAYVHNDGVSDPLPIDYLILAC-GQKALLGG 163 Query: 187 NSLLLHLDD--------WEHLDNYFRHPLARRPMRFAAPPSKNVSKDVFHPVFDVDQQGR 238 S+L+ E L+ F + K P+ +G Sbjct: 164 ESILIDASAVYAELMTFPEILEELKCD------FFFENRGMSDEEKLFKAPILSFSNEGI 217 Query: 239 PVMRYIDQFVQ--------PKDFEEGVWLSELSDAIETSKGILSVPVPVGKFLLINNLFW 290 P++RY +++ P + L+ L ++ S V + G+ L+ + + Sbjct: 218 PLIRYFRVYIESAHLKAGVPLTLAQSQALNFLDTVLDQSSVQHRVLLEPGQILISADNKF 277 Query: 291 LHGRDRFTPHPDLR 304 LH R F R Sbjct: 278 LHTRTHFIDTNTPR 291 >UniRef50_C0SMX2 Predicted hydroxylase n=1 Tax=Streptomyces spiroverticillatus RepID=C0SMX2_9ACTO Length = 353 Score = 81.0 bits (198), Expect = 5e-14, Method: Composition-based stats. Identities = 46/261 (17%), Positives = 84/261 (32%), Gaps = 35/261 (13%) Query: 68 AKILDDLCANQLQPLLLKTLLNRAEGALLINAVGVDDVKQADEMVKLATAVAHLIGRSNF 127 A +L+ +L G L+ + V + + +L + +G Sbjct: 52 ADFPLPRVGPKLRCILRSLED--EPGFALVRGIPVAGKSEHEVR-RLYWGLGMYLGVPLI 108 Query: 128 DAMSGQYYARFVVKNVDNSDSY-LRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGG 186 S + +++ + + ++ + H D T D V ++ GG Sbjct: 109 QNNSDSHMV--DIRDEGRAGRLRVHSSNQHIGFHIDST------DIVGLLC-RRAAAHGG 159 Query: 187 NSLLLHLDD--------WEHLDNYFRHPLARRPMRFAAPPSKNVSKDVFHPVFDVDQQGR 238 SL++ + + L PL P A P ++ PVF + GR Sbjct: 160 TSLVVSAEAVRREISWVYPDLLPALYEPL---PFADVASPDEHHPDFFLCPVFGRHE-GR 215 Query: 239 PVMRY--------IDQFVQP-KDFEEGVWLSELSDAIETSKGILSVPVPVGKFLLINNLF 289 R+ D P + + + + + ++ G LINN Sbjct: 216 TTTRFYLRRILRSQDNSAAPRLTDRQRRAIEAVEEIAARPDLVTAMEFQPGDLQLINNHL 275 Query: 290 WLHGRDRF-TPHPDLRRELMR 309 LHGR F + D R L+R Sbjct: 276 VLHGRTTFDSEEADTGRHLLR 296 >UniRef50_B5GF41 Putative uncharacterized protein n=2 Tax=Streptomyces RepID=B5GF41_9ACTO Length = 334 Score = 80.6 bits (197), Expect = 7e-14, Method: Composition-based stats. Identities = 38/189 (20%), Positives = 64/189 (33%), Gaps = 14/189 (7%) Query: 124 RSNFDAMSGQYYARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNM 183 + N+ S L +E+HN+ + DYV ++ + E Sbjct: 132 EKTGALVQNVVPVPGQETLQSNAGSVL------LEMHNENAFHPNRPDYVGLLCVREDPT 185 Query: 184 QGGNSLLLHLDD-WEHLDNYFRHPLARRPMRFAAPPSKNVSKDV--FHPVF-DVDQQGRP 239 + L R L+ APPS + V H V + Sbjct: 186 GQARLCTASVRRALPLLSAQARQVLSGERFLTEAPPSFEALESVVPAHAVLQGAGEDPDI 245 Query: 240 VMRYIDQFVQPKDFEEGVWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTP 299 ++ + P D E V ++EL DA+ L++ G +++N +HGR FTP Sbjct: 246 LVDF--SATHPLDDEARVAMAELRDALVQVSSALALR--AGDLAIVDNRLAVHGRTPFTP 301 Query: 300 HPDLRRELM 308 D + Sbjct: 302 RYDGTDRWL 310 >UniRef50_Q7N4X1 Similar to hypothetical gamma-butyrobetaine n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7N4X1_PHOLL Length = 335 Score = 80.2 bits (196), Expect = 9e-14, Method: Composition-based stats. Identities = 48/302 (15%), Positives = 94/302 (31%), Gaps = 43/302 (14%) Query: 28 QSPRLLELTFTEQTTKQFLEQVAEWPVQALEYKSFLRFRVAKILDDLCANQLQPLLLKTL 87 +S + + L +++ + F + A+E + F + + + + + Sbjct: 21 ESKKDVLLPVSDEQIEAF-----RHHLSAMEDRPSEAFNASDFSFEEITILQERIHQRLT 75 Query: 88 LNRAEGALLINAVGVDDVKQADEMVKLATAVAHLIGRSNFDAMSGQYYARF-VVKNVDNS 146 R G ++++ + + + +GR G KN N+ Sbjct: 76 EGR--GVVVVSGIPREMFTDSILSHLFWGI-GTGLGRPVVQNSQGHRIGHVRNEKNNPNN 132 Query: 147 DSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHL--------DDWEH 198 Y+ +R + H+D + V +M + E G ++ L + E Sbjct: 133 RGYM--SNRELGFHSDAF------EIVGLMCLREAASGGLTQIVSGLAIYNQMLREKPEL 184 Query: 199 LDNYF--RHPLARRPMRFAAPPSKNVSKDVFHPVFDVDQQGRPVMR---YIDQFVQ---- 249 LD F H P + P+F M Y+ + Sbjct: 185 LDALFEGYHYATAERSSSKLPYT-----SYKIPIFSKMSGRVSSMCLGAYMRAAAKLQGL 239 Query: 250 --PKDFEEGVWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLRREL 307 P + G L + + L + G+ L +NN LH R F +R L Sbjct: 240 ALPDALDAG--LHAFYEICSRPEFRLEFMLEPGEILFLNNYTTLHSRTEFQDDALNQRHL 297 Query: 308 MR 309 +R Sbjct: 298 LR 299 >UniRef50_Q1AWV7 Putative uncharacterized protein n=1 Tax=Rubrobacter xylanophilus DSM 9941 RepID=Q1AWV7_RUBXD Length = 316 Score = 80.2 bits (196), Expect = 9e-14, Method: Composition-based stats. Identities = 31/153 (20%), Positives = 54/153 (35%), Gaps = 8/153 (5%) Query: 159 LHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDDWEHLDNYFRHPLARRPMRFAAPP 218 H + DY+ +M + + + +DD E LD PL F P Sbjct: 146 WHTEDARYSYRGDYIGLMCLRNPDAV--PTTYASIDDIE-LDPERAAPLFEPRFVFRPDP 202 Query: 219 SKNVSKDVFHPVFDVDQQGRPVMRYIDQFVQ--PKDFEEGVWLSELSDAIETSKGILSVP 276 S P +R+ D + P+D E + L+ ++ + + V Sbjct: 203 SHPTDTGCERASILFGDPSSPYLRF-DPYSMDRPEDEEARAAMDYLAGELD--RRLTGVA 259 Query: 277 VPVGKFLLINNLFWLHGRDRFTPHPDLRRELMR 309 + G+ L I+N +HGR F D ++ Sbjct: 260 LRPGECLFIDNYKVVHGRSAFKARFDGTDRWLK 292 >UniRef50_A4S5K2 Predicted protein n=2 Tax=Ostreococcus RepID=A4S5K2_OSTLU Length = 342 Score = 79.8 bits (195), Expect = 1e-13, Method: Composition-based stats. Identities = 41/257 (15%), Positives = 71/257 (27%), Gaps = 28/257 (10%) Query: 69 KILDDLCANQLQPLLLKTLLNRAEGALLINAVGVDDVKQADEMVKLATAVAHLIGRSNFD 128 L+ + + + R G L+ V + A+ +G Sbjct: 42 NFSLPTLGPTLEDMGRELVHGR--GFALMRNFPVHRYSSWERCAAFY-AMGRYMGTCVPQ 98 Query: 129 AMSGQYYARFVVKNVDNSD--SYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGG 186 G D +D + L H D D V ++ + + G Sbjct: 99 NKLGHVVGHVKDLGGDPNDPLTRLYTTSAAQPYHTDS------ADIVGLLCLSQATEGGH 152 Query: 187 NSLLLHLDDWEHLDNYFRHPLARRPMRF------AAPPSKNVSKDVFHPVFDVDQQGRPV 240 + + + W L F A F P K + + P+F + GR Sbjct: 153 SQVTSSVAIWNALVERFPESAATLQKEFVVSRKGEVPVGKEATYKI--PIFHAHE-GRCA 209 Query: 241 MRYIDQFV--------QPKDFEEGVWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLH 292 Y F+ + L L + + L + + G ++N LH Sbjct: 210 AIYDRSFINSAVALTGVELSKTQTAALDHLDALACSDELRLDMTLEPGDIQWLHNHTTLH 269 Query: 293 GRDRFTPHPDLRRELMR 309 R F R L+R Sbjct: 270 ARSAFKNEGAEPRHLVR 286 >UniRef50_UPI0000E48C37 PREDICTED: similar to gamma butyrobetaine hydroxylase, partial n=1 Tax=Strongylocentrotus purpuratus RepID=UPI0000E48C37 Length = 318 Score = 79.8 bits (195), Expect = 1e-13, Method: Composition-based stats. Identities = 49/289 (16%), Positives = 94/289 (32%), Gaps = 34/289 (11%) Query: 23 LTPSAQSPRLLELTFTEQTTKQFLEQVAEWPVQALEYKSFLRFRVAKILDDLCANQLQPL 82 ++P R L L ++T + Q+ P + + LRF ++++D + L+ Sbjct: 8 VSPFPS--RWLYLNRFDKTNFDPVSQLVPKPWGSEQVNELLRFDYKEVMED--SRVLRDW 63 Query: 83 LLKTLLNRAEGALLINAVGVDDVKQADEMVKLATAVAHLIGRSNFDAMSGQYYARFVVKN 142 L ++ G L+ K+ + + V HL Y Sbjct: 64 LRSLVV---SGIALLTGAP----KETGVIESIGKRVGHL---------RTTMYGHTFEVL 107 Query: 143 VDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHL----DDWEH 198 S S L + LH D E ++ I + GG S +D + Sbjct: 108 AIASSSNLAYTTLKLGLHVDLPLYEVPPSVQMLHCIKQCKTVGGESQFCDALKVTNDLKE 167 Query: 199 LDNYFRHPLARRPMRFAAPPSKNVSKDVF--HPVFDVDQQGRP-VMRYIDQFVQPKDFEE 255 D F + L R + + P+ ++D +G+ + + + P Sbjct: 168 SDPEFYNTLTRVKVDIRLRGKDYIPYHFQYARPIIELDDEGKFKAITHNNGVRAPYMNLP 227 Query: 256 GV-------WLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRF 297 L+ L + + ++ + G + NN +HGR F Sbjct: 228 VADVKTWYKSLACLDGKLNAKENMIQFKLKEGDVVTFNNNRVMHGRGSF 276 >UniRef50_A6SL62 Putative uncharacterized protein n=2 Tax=Sclerotiniaceae RepID=A6SL62_BOTFB Length = 467 Score = 79.4 bits (194), Expect = 1e-13, Method: Composition-based stats. Identities = 44/266 (16%), Positives = 84/266 (31%), Gaps = 44/266 (16%) Query: 57 LEYKSFLRFRVAKILDDLCANQLQPLLLKTLLNRAEGALLINAVGVDDVKQADEMVKLAT 116 L+Y SF D N + L ++ R G L++ V + D ++ Sbjct: 154 LQYVSF----------DDYINSEEGLFRALIMLRDYGLLILRDVPESETSVVDIAKRIGN 203 Query: 117 AVAHLIGRSNFDAMSGQYYA-RFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLM 175 + +Y + VK+V + + + LH D Y+ + + Sbjct: 204 -------------LRDTFYGVTWDVKSVPQP-KNVAYTSQYLGLHMDLLYMANPPGFQFL 249 Query: 176 MKIDEQNMQGGNSLLLHL-DDWEHLDNYFRHPLARRPMRFAAPPSKNVSKDVFHPVFDVD 234 + GG+S+ LD L + + + + HPV + Sbjct: 250 HCLR-NTCSGGSSIFSDAFHAARQLDRDNYIQLCTKKVGYHYRNAGE-HYHFKHPVISIH 307 Query: 235 Q--------QGRPVMRYID-------QFVQPKDFEE-GVWLSELSDAIETSKGILSVPVP 278 ++YI+ F +P L + + +E + + + Sbjct: 308 SKKGGDASSPSDNNIQYINYSPPFQATFDKPFGSLPIARALRQFASRVEAPENMYEYRLQ 367 Query: 279 VGKFLLINNLFWLHGRDRFTPHPDLR 304 G+ ++ NN LHGR F R Sbjct: 368 EGECVIFNNRRVLHGRKEFDTSAGER 393 >UniRef50_P80193 Gamma-butyrobetaine dioxygenase n=18 Tax=Proteobacteria RepID=BODG_PSESK Length = 383 Score = 79.4 bits (194), Expect = 2e-13, Method: Composition-based stats. Identities = 42/235 (17%), Positives = 73/235 (31%), Gaps = 28/235 (11%) Query: 82 LLLKTLLNRAEGALLINAVGVDDVKQADEMVKLATAVAHLIGRSNFDAMSGQYYARFVVK 141 LL L R G ++ V + ++ LA ++ I SNF + F V+ Sbjct: 144 LLEWLLAVRDVGLTQLHGVPT----EPGALIPLAKRIS-FIRESNFGVL-------FDVR 191 Query: 142 NVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDDWEHLDN 201 + ++DS + LH D E + + + GGNS +D + + Sbjct: 192 SKADADSNAYTAF-NLPLHTDLPTRELQPGLQFLHCLV-NDATGGNSTF--VDGFAIAEA 247 Query: 202 YFRHPLARRPMRFAAPPS-----KNVSKDVFHPVFDVDQQGRPV-MRYIDQFVQPKDFEE 255 A + P ++ PV +D G +R + P + Sbjct: 248 LRIEAPAAYRLLCETPVEFRNKDRHSDYRCTAPVIALDSSGEVREIRLANFLRAPFQMDA 307 Query: 256 GV------WLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLR 304 + + + G+ +N LH RD F P R Sbjct: 308 QRMPDYYLAYRRFIQMTREPRFCFTRRLEAGQLWCFDNRRVLHARDAFDPASGDR 362 >UniRef50_B6GWY4 Pc12g00050 protein n=1 Tax=Penicillium chrysogenum Wisconsin 54-1255 RepID=B6GWY4_PENCW Length = 482 Score = 79.4 bits (194), Expect = 2e-13, Method: Composition-based stats. Identities = 34/190 (17%), Positives = 60/190 (31%), Gaps = 28/190 (14%) Query: 130 MSGQYYARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSL 189 + +Y + + ++ + H D Y+ E Y L+ + E + GG SL Sbjct: 249 LRNTFYGSTFDVRTVPEATNVAYTNQFLGFHMDLMYMNEPPGYQLLHCL-ENSCSGGESL 307 Query: 190 LLHL--------DDWEHLDNYFRHPLARRPMRFAAPPSKNVSKDVFHPVFDVDQQGR--P 239 D + R R K+ PVF+ D Sbjct: 308 FADTFSAAKVMKDRYPEDYRVLRDQRLGYEYRH-----KDHIYYNERPVFEHDSDTDELR 362 Query: 240 VMRYIDQFVQPKDFEEGV------------WLSELSDAIETSKGILSVPVPVGKFLLINN 287 + Y F P G L++L+ I+ K I + + G ++ +N Sbjct: 363 HVNYSPPFQSPLPPRHGNGHDAEPVNKLRDALAKLTSIIDNQKHIFELRLNPGDCVIFDN 422 Query: 288 LFWLHGRDRF 297 +H R +F Sbjct: 423 RRIVHARRQF 432 >UniRef50_C1YTV8 Taurine catabolism dioxygenase TauD, TfdA family n=1 Tax=Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 RepID=C1YTV8_NOCDA Length = 286 Score = 79.4 bits (194), Expect = 2e-13, Method: Composition-based stats. Identities = 29/160 (18%), Positives = 57/160 (35%), Gaps = 11/160 (6%) Query: 161 NDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDDWEHLDNYFR-HPLARRPMRFAAPPS 219 + + + D ++++ + GG++ L R LA + + Sbjct: 119 TELGFHCDTCDVLVLLCLQPAVDGGGDTKLASARTVRETVAGERPDVLATLERDWTFDRT 178 Query: 220 KNVSKDVF-HPVFDVDQQGRPVMRYIDQFVQ--------PKDFEEGVWLSELSDAIETSK 270 + V P+ V + G Y + V+ P E+ L L + + + Sbjct: 179 GRAGQQVVVSPILFVQEDGSVGCYYQPRTVRTSPERGGPPLSEEQWEALGFLDEVLYRPE 238 Query: 271 GILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLR-RELMR 309 + + G+ LLI N +HGR + P R R ++R Sbjct: 239 IAFRLRLEAGELLLIRNNRVMHGRSPYVDVPGPRARRVLR 278 >UniRef50_A8TJ02 Putative uncharacterized protein n=1 Tax=alpha proteobacterium BAL199 RepID=A8TJ02_9PROT Length = 360 Score = 79.0 bits (193), Expect = 2e-13, Method: Composition-based stats. Identities = 35/237 (14%), Positives = 82/237 (34%), Gaps = 21/237 (8%) Query: 93 GALLINAVGVDDVKQADEMVK-LATAVAHLIGRSNFDAMSGQYYARFVVKNVDNSDSYLR 151 G L+ + +++ + + A + + R A R + ++ Sbjct: 77 GVTLVKGLPREELSAEEFRLLNWAIGLNLGVARPQGKASQYMSEVRATGTDYRSAGGRGY 136 Query: 152 QPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDDWEHLDNYFRH--PLAR 209 + ++ H D D V + ++ G + + + W+ L +AR Sbjct: 137 SSNAGLDFHVDSC------DLVTLACYNKAKAGGQSMVSSSVTAWQILVAERPDLAEVAR 190 Query: 210 RPMRFAAPPSKNVSKDVFH--PVFDVDQQGRPVMRYIDQFVQ---------PKDFEEGVW 258 + F+ + + F+ P+FD + GR ++ V+ P + Sbjct: 191 QDFHFSRNQEEAADETPFYGQPLFDF-EGGRLFCKWNRNRVRTAQDLEGVPPMSQAQRDC 249 Query: 259 LSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLRRELMRQRGYFA 315 L ++ + + ++ + G ++NN LH R F + R+ + R + A Sbjct: 250 ADLLDAILQRPEVMFTMWLEPGDLQIMNNHVMLHSRTAFEDFAEPERKRLLYRLWLA 306 >UniRef50_A8TTL2 Gamma-butyrobetaine,2-oxoglutarate dioxygenase n=1 Tax=alpha proteobacterium BAL199 RepID=A8TTL2_9PROT Length = 385 Score = 79.0 bits (193), Expect = 2e-13, Method: Composition-based stats. Identities = 49/236 (20%), Positives = 82/236 (34%), Gaps = 26/236 (11%) Query: 81 PLLLKTLLNRAEGALLINAVGVDDVKQADEMVKLATAVAHLIGRSNFDAMSGQYYARFVV 140 LL L R G +++ V + D + + A GR NF+ +S Sbjct: 140 SLLDYHLTVRNTGFVIMRNVPIRDWVCEEVARRTAFTRETNFGR-NFEVISR-------- 190 Query: 141 KNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLH----LDDW 196 N+ +Y H + H D E + + E GG SLL+ + Sbjct: 191 -PNPNNQAY---THDALLAHTDLANREMPPGVQFLHCL-EFGATGGESLLVDGFHAAEQL 245 Query: 197 EHLDNYFRHPLARRPMRFAAPPSKNVSKDVFHPVFDVDQQGRP-VMRYIDQFV----QPK 251 +D L R + F + + P+ +D GR +RY P Sbjct: 246 RAVDPQAWEVLTRVGLPFRFHDADCDVRWKGTPI-ALDADGRYHEVRYNPGLGAALDVPA 304 Query: 252 DFEEGVW--LSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLRR 305 + V+ L + ++ +L + G ++ NN LHGR F P+ R+ Sbjct: 305 TQVKQVYRALHAFAARLKDPANVLQFKLQAGDMMVFNNRRVLHGRAAFDPNTGPRK 360 >UniRef50_C7G1S1 Putative dioxygenase n=1 Tax=Streptomyces griseus RepID=C7G1S1_STRGR Length = 293 Score = 79.0 bits (193), Expect = 2e-13, Method: Composition-based stats. Identities = 49/271 (18%), Positives = 79/271 (29%), Gaps = 25/271 (9%) Query: 68 AKILDDLCANQLQ----PLLLKTLLNRAEGALLINAVGVDDVKQADEMVKLATAVAHLIG 123 A D CA + + + E + + A A A +G Sbjct: 17 ADFSDPRCAQPVDFAEHGAPNQVVNILEERGFAVVTMAEPGPPDAK---LTALAQTLRLG 73 Query: 124 RSNFDAMSGQYYARFVVKNVDNSDSYLRQPHRVME------LHNDGTYVEEITDYVLMMK 177 A+ + + + S + H H DG ++ Sbjct: 74 DPYIPALYRYAETQDYSASFSDIRSDTKDQHPGFSTTAGQAWHVDGLLDAIGDIRTTVLY 133 Query: 178 IDEQNMQGGNSLLLHL----DDWEHLDNYFRHPLARRPMRFAAPPSKNVSKDVFHPVFDV 233 +GG +LL + + D+ L + PVF V Sbjct: 134 CVRAAHRGGETLLFNSLAAFAELRERDSAAAEALLSPRALNRRSTLPAIDMSNTGPVFSV 193 Query: 234 DQQGRPVMRYIDQFVQPKDFEEG------VWLSELSDAIETSKGILSVPVPVGKFLLINN 287 D+ G RY D +F G L+ A + + L+V + G+ L+ N Sbjct: 194 DEAGTLATRYTDNDTCTWNFSAGPPGGLRRALAFFRKASDNPRYRLAVRLAAGEALIFRN 253 Query: 288 LFWLHGRDRFTPHPDLRRELMRQRGYFAYAS 318 HGR + P RR L R +A A Sbjct: 254 DRLSHGRRPYEDSPGARRHL--IRALYAEAP 282 >UniRef50_Q7N1E8 Similarities with putative oxygenase and clavaminate synthase 1 n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7N1E8_PHOLL Length = 326 Score = 79.0 bits (193), Expect = 2e-13, Method: Composition-based stats. Identities = 49/307 (15%), Positives = 103/307 (33%), Gaps = 30/307 (9%) Query: 21 FTLTPSAQSPR-LLELTFTEQTT--KQFLEQVAEWPVQALE-------YKSF---LRFRV 67 + ++ +S + EL TEQT +QFLE ++ + E Y+SF +R + Sbjct: 7 YMVSDIQESNDFVFELDSTEQTYLKQQFLESTLKYDIDNFEEFYWKCFYQSFNLPVRLQK 66 Query: 68 AKILDDLCANQLQPLLLKTLLNRAEGALLINAVGVDDVKQADEMVKLATAVAHL------ 121 N+ L+ + ++ + + A ++T ++ + Sbjct: 67 KLFDFRALENENYLLIKGLPIPDDLCLTPLSYEQTNSSEVAGVRTLISTILSRIGYIYSF 126 Query: 122 IGRSNFDAMSGQYYARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQ 181 + + NF+ + + D L +E H + + + D+V + + Sbjct: 127 VNKKNFNFIDDVFPM------EKYKDMQLGTNKEFLEWHVEDGFHDAKADWVALYCLRGD 180 Query: 182 NMQGGNSLLLHLDDWEHLDNYFRHPLARRPMRFAAPPSKNVSKDVFHPVFDVDQQGRPVM 241 + L + HLD L + P+ + V + Q P M Sbjct: 181 KNA--ETYLFQTKNI-HLDAETIAELQKSNFEIDVDPTFVSNVSGRRCVAVLSQHKEPEM 237 Query: 242 RYIDQFVQPKDFEEGVWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHP 301 + +++ + L +L+ I + + G LL +N H R + P Sbjct: 238 IFDPAYMRCLTPKAEAALEKLNQCINE--NREAFILSPGDMLLFDNRKVAHARSEYEPRY 295 Query: 302 DLRRELM 308 D + Sbjct: 296 DGYDRWL 302 >UniRef50_B2AD39 Predicted CDS Pa_3_10500 n=1 Tax=Podospora anserina RepID=B2AD39_PODAN Length = 483 Score = 78.6 bits (192), Expect = 3e-13, Method: Composition-based stats. Identities = 49/302 (16%), Positives = 94/302 (31%), Gaps = 44/302 (14%) Query: 33 LELTFTEQTTKQFLEQVAEWPVQALEYKSFLRFRVAKILDDLCANQLQPLLLKTLLNRAE 92 +T ++ K+ ++ + L S +L+ + + Sbjct: 149 YTITLDDREIKEVRSALSHFNELDL---SGSEVSTTTFPLPTLGPKLR----QAAEDVHN 201 Query: 93 GALLINAVGVDD-----VKQADEMVKLATAVAHLIGRSNFDAMSGQYYAR---FVVKNVD 144 G + G+ D D+++ +++ G +G + + Sbjct: 202 GKGFVVVRGIRDMQPGEFSPEDKIIIFLGISSYIAGARGRQDENGNMLSHIRNAKLSKTP 261 Query: 145 NSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHL----DDWEHLD 200 S R R H D D + + M+GG +L+ + + Sbjct: 262 QSQRPTRYSSRASTFHTD-----TFCDILALQS-RSNAMEGGATLVSSTWTVYNKLQKEH 315 Query: 201 NYFRHPLARRPMRFAAPPSKNVSKDVFHPVFDVDQQGRPVMRYIDQFVQPK----DFEEG 256 LAR + + P+ G+ +M + +P D + Sbjct: 316 PDLCELLARP--IWPFDSRGSFFPCSTRPLLYHH-DGKVMMNFAR---EPLLGLEDVKRK 369 Query: 257 VWLSELSD----AIE-----TSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLRREL 307 L LS A+E ++G +S+ G L INN LH R+ FT + R L Sbjct: 370 AGLPVLSQGQKNALEIVERLATEGQISINTEPGDLLFINNHGVLHSREEFTDAVENPRYL 429 Query: 308 MR 309 +R Sbjct: 430 VR 431 >UniRef50_UPI0000586B6F PREDICTED: hypothetical protein n=2 Tax=Strongylocentrotus purpuratus RepID=UPI0000586B6F Length = 481 Score = 78.6 bits (192), Expect = 3e-13, Method: Composition-based stats. Identities = 34/193 (17%), Positives = 65/193 (33%), Gaps = 19/193 (9%) Query: 134 YYARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHL 193 Y + F V+N+ S S L + LH D Y + + + + ++GG S + Sbjct: 264 YGSDFRVENIFESSS-LGFTTAALGLHLDLPYYDYRPGVQFLNCLRQCEVKGGESQFVDA 322 Query: 194 DDW-EHLDNYFRHPL-----ARRPMRFAAPPSKNVSKDVFHPVFDVDQQGRP-VMRYIDQ 246 E L + R + + ++D+QG + Y DQ Sbjct: 323 KRVAETLKKEEPEWYEYMTNVKLDFRLLGIDYIDSHLQHARNLIELDEQGEFKTLAYNDQ 382 Query: 247 FVQPKDFEEG-------VWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFT- 298 P L + ++ + + + + G+ + +N +HGR +T Sbjct: 383 TRSPYMNVPVEEVNKIYQALKKFNEFLYRKENFIDYKLQPGEIIAFDNNRVMHGRSAYTV 442 Query: 299 ---PHPDLRRELM 308 D R L+ Sbjct: 443 KYVDGEDHSRLLI 455 >UniRef50_Q1GKN1 Gamma-butyrobetaine2-oxoglutarate dioxygenase n=7 Tax=Proteobacteria RepID=Q1GKN1_SILST Length = 402 Score = 78.6 bits (192), Expect = 3e-13, Method: Composition-based stats. Identities = 31/186 (16%), Positives = 62/186 (33%), Gaps = 17/186 (9%) Query: 134 YYARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHL 193 + F VK+ N + L + LH D T E + + + + GG+ L Sbjct: 201 FGVTFEVKSKPNPN-NLAYTPIALPLHTDLTNQELPPGFQFLHCLANEARGGGS---LFC 256 Query: 194 DDWEHLDNYFRHPLARRPMRFAAPPSKNVSK-----DVFHPVFDVDQQGRPV-MRYIDQF 247 D + ++ R + V +D+ GR + + + Sbjct: 257 DGYAIAEDLRRDDPESFELLSTVSVPFRFHDQDTDIRNRKKVITLDEDGRVIEICFNAHL 316 Query: 248 VQPKDFEEG------VWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHP 301 D E + ++ ++++ + G+ ++ +N LHGR+ F P Sbjct: 317 ADIFDLEPALMQRYYRAYRKFMILTRSTNYLVTLKLKGGEMVVFDNRRVLHGREAFDPQT 376 Query: 302 DLRREL 307 R L Sbjct: 377 G-YRHL 381 >UniRef50_A2R6Y2 Contig An16c0060, complete genome n=4 Tax=Trichocomaceae RepID=A2R6Y2_ASPNC Length = 476 Score = 78.6 bits (192), Expect = 3e-13, Method: Composition-based stats. Identities = 38/153 (24%), Positives = 60/153 (39%), Gaps = 9/153 (5%) Query: 159 LHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDDWEHLDNYFRHPLARRP-MRFAAP 217 H D + +E +LM + Q+ GG SLL +D L+ +H + Sbjct: 88 FHTDRSGWDEPPR-ILMSTLRSQSESGGESLL--VDGQSVLNALKKHDEDLYNLFTSSKH 144 Query: 218 PSKNVSKDVFHPVFDVDQQ-GRPVMRYIDQFVQPKDFEEGVWLSELSDAIETSKGILSVP 276 S F P VD+ G R+ D G +L D I + V Sbjct: 145 TSFRADDGTFVPRAMVDKDTGIFRFRFDDGIQMSASMVVGFA--KLQDIIY--QHAYFVT 200 Query: 277 VPVGKFLLINNLFWLHGRDRFTPHPDLRRELMR 309 + G+ +++N +LHGR FT +L R L++ Sbjct: 201 LRPGQGYVLDNHRYLHGRASFTGSRELLRVLVK 233 >UniRef50_C5E3H3 KLTH0H13574p n=3 Tax=Saccharomycetaceae RepID=C5E3H3_LACTC Length = 410 Score = 78.3 bits (191), Expect = 4e-13, Method: Composition-based stats. Identities = 33/192 (17%), Positives = 60/192 (31%), Gaps = 21/192 (10%) Query: 133 QYYARFVVKNV-DNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLL 191 +Y V D + + +H DG Y E L ++ +GG + ++ Sbjct: 191 THYDLGVWDFTSDLAKKDTAYTSLAINMHTDGNYWNEPPGLQLFHLLEHSRGKGGETRIV 250 Query: 192 HLDD-WEHLD---------NYFRHPLARRPMRFAAPPSKNVSK-DVFHPVFDVDQQ-GRP 239 + E L L +P+ F + PV V ++ Sbjct: 251 DVSKVLEVLVSLAKEDESWRCTLDVLTTQPLSFHQAGEEGSFYIQDQFPVLTVSKELELL 310 Query: 240 VMRYIDQFVQPKDFEEGV--------WLSELSDAIETSKGILSVPVPVGKFLLINNLFWL 291 R+ + PK L + + K + + G+ L+ +N L Sbjct: 311 QCRWNNSDRSPKIPLPCKFPMGQVYEALFRFNSLVNDPKYYVQFQLKPGQILVFDNWRVL 370 Query: 292 HGRDRFTPHPDL 303 H R+ FT + L Sbjct: 371 HARNAFTGYRRL 382 >UniRef50_A4SJT0 Pyoverdine biosynthesis protein n=9 Tax=Gammaproteobacteria RepID=A4SJT0_AERS4 Length = 291 Score = 77.9 bits (190), Expect = 5e-13, Method: Composition-based stats. Identities = 43/243 (17%), Positives = 84/243 (34%), Gaps = 33/243 (13%) Query: 87 LLNRAEGALLINAVGVDDVKQADEMVKLATAVAHLIGRSNFDAMSGQYYARFVVK-NVDN 145 L R G L++ G + + A ++ + A VK + + Sbjct: 44 ELARRHGVLILRGFG-SGFVDPERLTHYAERWGEIMMWP--------FGAVLDVKEHENA 94 Query: 146 SDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDDWEHLDNYFRH 205 +D + + LH DG Y + ++ L + G + ++ + L + Sbjct: 95 TDHIFDSSY--VPLHWDGMYKPTVPEFQLFHCVHAPAADEGGRTIF-INTRQLLVDLDGE 151 Query: 206 PLAR-RPMRFAAPPSKNVSKDVF--------HPVFDV-----DQQGRPVMRYIDQFVQPK 251 LAR +R + V HPV ++ R R+++Q Sbjct: 152 RLARWERVRITYRIKQVVHYGGEVSSPLLVPHPVSGETVMRYNEPPREGGRFLNQHALQI 211 Query: 252 D----FEEGVWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLRREL 307 + E+G +L +L + + + + G ++ +N LHGR+ FT R + Sbjct: 212 EGIPPEEQGAFLQDLHERLYDPRYFYAHQWQPGDVVIADNFGLLHGREGFTARSA--RHI 269 Query: 308 MRQ 310 R Sbjct: 270 QRV 272 >UniRef50_Q05582 Clavaminate synthase 2 n=7 Tax=Streptomyces RepID=CAS2_STRCL Length = 325 Score = 77.5 bits (189), Expect = 6e-13, Method: Composition-based stats. Identities = 43/266 (16%), Positives = 84/266 (31%), Gaps = 31/266 (11%) Query: 68 AKILDDLCANQLQPLLLKTLL-NRAEGALLINAVGVDD--VKQADEMVKLATAVAHLIGR 124 AK L L L +G LL+ + VDD + + L+ Sbjct: 37 AKTLAARLPEGLAAALDTFNAVGSEDGYLLLRGLPVDDSELPETPTSTPAPLDRKRLVME 96 Query: 125 SNFDAMSGQYYARFVVKNVDNSDSYLR-------------QPHRVMELHNDGTYVEEITD 171 + + + + + Y ++E H + Y + Sbjct: 97 AMRALAGRRLGLHTGYQELRSGTVYHDVYPSPGAHYLSSETSETLLEFHTEMAYHILQPN 156 Query: 172 YVLMMKIDEQNMQGGNSLLLHL-DDWEHLDNYFRHPLARRP--------MRFAAPPSKNV 222 YV++ + +L+ + LD R L R R + Sbjct: 157 YVMLACSRADHENRAETLVGSVRKALPLLDEKTRARLFDRKVPCCVDVAFRGGVDDPGAI 216 Query: 223 SKDVFHPVFDVDQQGRPVMRYIDQFVQPKDFEEGVWLSELSDAIETSKGILSVPVPVGKF 282 + P++ P + Y + + P+D + ++ LS A++ + V + G Sbjct: 217 A--NVKPLY--GDANDPFLGYDRELLAPEDPADKEAVAHLSQALDDV--TVGVKLVPGDV 270 Query: 283 LLINNLFWLHGRDRFTPHPDLRRELM 308 L+I+N H R F+P D + + Sbjct: 271 LIIDNFRTTHARTPFSPRWDGKDRWL 296 >UniRef50_Q7QKQ0 AGAP012477-PA (Fragment) n=4 Tax=Diptera RepID=Q7QKQ0_ANOGA Length = 416 Score = 77.5 bits (189), Expect = 6e-13, Method: Composition-based stats. Identities = 30/192 (15%), Positives = 59/192 (30%), Gaps = 18/192 (9%) Query: 133 QYYARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQ-NMQGGNSLLL 191 +Y + S + +++H D Y + L+ + + N GG +L+ Sbjct: 200 THYGEEFIVKAKEGTSNVAYLSTPLQMHTDLPYYDYKPGCNLLHCLVQSSNPTGGENLIA 259 Query: 192 H----LDDWEHLDNYFRHPLARRPMRF---AAPPSKNVSKDVFHPVFDVDQQG------R 238 + H L+ + + A + PV + + G Sbjct: 260 DGFYVAEQLRHHHPDDFRLLSETLVDWSDLGADEAGTFHSIYRAPVICIGRDGRLERINH 319 Query: 239 PVMRYIDQFVQPKDFEEG--VWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDR 296 V + F P D E + + +S G L +N+ +HGR Sbjct: 320 SVPQRDSHFSVPLDRVEPWYRAMQRFVTILHREA--VSFKTAPGDILTFSNVRMVHGRTG 377 Query: 297 FTPHPDLRRELM 308 +T R ++ Sbjct: 378 YTDTAGNTRHIV 389 >UniRef50_Q2GN61 Putative uncharacterized protein n=1 Tax=Chaetomium globosum RepID=Q2GN61_CHAGB Length = 1130 Score = 77.1 bits (188), Expect = 7e-13, Method: Composition-based stats. Identities = 42/281 (14%), Positives = 81/281 (28%), Gaps = 34/281 (12%) Query: 46 LEQVAEWPVQALEYKSFLRFRVAKILDDLCANQLQPLLLKTLLNRAEGALLINAVGVDDV 105 L+ E + E +L+ + R G ++ + D+ Sbjct: 816 LQHFNELGLYGNEVSP------TTFPLPTLGPKLRQISADVHRGR--GFAVVRGLKPDEF 867 Query: 106 KQADEMVKLATAVAHL---IGRSNFDAMSGQYYARFVVKNVDNSDSYLRQPHRVMELHND 162 D ++ ++ GR + + + + D R R H D Sbjct: 868 SPEDNVLVFLGISCYVGVKRGRQDEEGNMLMHIRDAKLSKTPQQDRPTRYSSRASTFHTD 927 Query: 163 GTYVEEITDYVLMMKIDEQNMQGGNSLLLHL----DDWEHLDNYFRHPLARRPMRFAAPP 218 D + + I GG ++L + + R LA+ ++ Sbjct: 928 -----TFCDILALQ-IRNNASSGGKNMLASSWTIYNTLMRTHPHLRELLAQP--IWSFDS 979 Query: 219 SKNVSKDVFHPVFDVDQQGRPVMRYIDQFVQPKD-FEEGVWLSELSDAIE---------T 268 + P+ GR ++ + + + D L L DA Sbjct: 980 RGKLLPSSTRPLLYHHA-GRVLLNFAREPLLGLDGVRRAAGLPALDDAQRRALDVVEEIA 1038 Query: 269 SKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLRRELMR 309 + + + G L +NN LH R+ F R L+R Sbjct: 1039 KRSQIVLDARPGDVLFVNNHAVLHSREAFEDDAASPRYLVR 1079 >UniRef50_B6EK40 Putative uncharacterized protein n=1 Tax=Aliivibrio salmonicida LFI1238 RepID=B6EK40_ALISL Length = 340 Score = 77.1 bits (188), Expect = 8e-13, Method: Composition-based stats. Identities = 33/188 (17%), Positives = 59/188 (31%), Gaps = 13/188 (6%) Query: 132 GQYYARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEI----------TDYVLMMKIDEQ 181 G + V +D H D + +++ +M + Sbjct: 130 GLLFRHVVPSLKGRNDKSSHGSKHTFGHHVDNPDLPLTNEKITDKSGCPEFLSLMSLRSD 189 Query: 182 NMQGGNSLLLHLDD-WEHLDNYFRHPLARRPMRFAAPPSKNVSKDVFHPVFDVDQQGRPV 240 S + +DD L N L++ + P S S P+ D G Sbjct: 190 LKV--KSNFILVDDVLNQLSNGVIEQLSKPHFEISRPDSFKQSVKTILPLISFDNDGVAY 247 Query: 241 MRYIDQFVQPKDFEEGVWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPH 300 RY + P E L ++ ++ ++ G FL+I N +H R+ F+P Sbjct: 248 CRYDKENTTPLTTEAAAALVMWEAQLKNTELNNAITYQPGDFLIIKNQRLMHSREGFSPR 307 Query: 301 PDLRRELM 308 D + Sbjct: 308 DDGTDRWL 315 >UniRef50_Q4WLY7 Putative uncharacterized protein n=5 Tax=Trichocomaceae RepID=Q4WLY7_ASPFU Length = 397 Score = 77.1 bits (188), Expect = 8e-13, Method: Composition-based stats. Identities = 43/238 (18%), Positives = 69/238 (28%), Gaps = 23/238 (9%) Query: 90 RAEGALLINAVGVDDVKQADEMVKLATAVAHLIGRSNFDAMSG-----QYYARFVVKNVD 144 R G + + L A + G Y V+ Sbjct: 158 REYGIIAVEL----GFSDPKSQFMLEVVEAMGCSPDTHSSTQGALWDVTYRPEGVISKKT 213 Query: 145 NSDSY-LRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDDW-EHLDNY 202 + + H DG + + Y + + + GG +L +DD + L Sbjct: 214 GGNVVSISHSLGEFAWHTDGCFEVKPQRYFGLHILHPDKLGGGIFRVLAVDDLIKLLSPA 273 Query: 203 FRHPLARRPMRFAAPPSKNVS-KDVFHPVFDVDQQ-GRPVMRYIDQFV------QPKDFE 254 L PP H + ++ GR MR+ + P Sbjct: 274 SIETLLNYEFELQVPPEFYKGAATTRHKLLSIEPNTGRYHMRFRRDILADPPSDDPAANA 333 Query: 255 EGVWLSELSDAIETSKGILSVPV-PVGKFLLINNLFWLHGRDRFTPHPDLRRELMRQR 311 L+ + D +T S V LL++N +LH R + D RR L R R Sbjct: 334 AVAELNAILDKQDTVGKSFSEDVFKENVILLMDNARFLHCRTQI---KDPRRFLRRIR 388 >UniRef50_UPI0001B5617D Taurine catabolism dioxygenase TauD/TfdA n=1 Tax=Streptomyces sp. AA4 RepID=UPI0001B5617D Length = 256 Score = 77.1 bits (188), Expect = 8e-13, Method: Composition-based stats. Identities = 36/160 (22%), Positives = 57/160 (35%), Gaps = 6/160 (3%) Query: 153 PHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDDWEHLDNYFRHPLARRPM 212 ++ + H D + V+M + G L+ + L P A + Sbjct: 86 TNQALAPHTDCSDKARPPQLVVMACACPASSGGACVLVDGQAVYSDLA--TTDPEALAGL 143 Query: 213 RFAAPPSKNVSKDVFHPVFDVDQQGRPVMRYI-DQFVQPKDFEEGVWLSELSDAIETSKG 271 +S VF+ G +R D+ Q E WL L AIE + Sbjct: 144 SAPRGAYFGLSAGYVGNVFETGPDGTVGLRLRLDKHAQ-FSPETKRWLPALRAAIE--RH 200 Query: 272 ILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLRRELMRQR 311 ++ + G +NN WLHGRD F+ + R L+ R Sbjct: 201 TITFSLEAGAGYAVNNRRWLHGRDEFSGLRLMYRALVEPR 240 >UniRef50_B6QK04 Gamma-butyrobetaine hydroxylase subfamily, putative n=2 Tax=Trichocomaceae RepID=B6QK04_PENMQ Length = 479 Score = 77.1 bits (188), Expect = 8e-13, Method: Composition-based stats. Identities = 34/196 (17%), Positives = 60/196 (30%), Gaps = 23/196 (11%) Query: 130 MSGQYYARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSL 189 + +Y D + + + H D YV Y L+ + + +GG SL Sbjct: 244 LRNTFYGSTWDVRNDPKAKNVAYTNLNLGFHMDLLYVHNPPGYQLLHCLR-NSCEGGESL 302 Query: 190 LLHL-DDWEHLDNYFRHPLARRPMRFAAPPSKNVSKDVFHPVFDVDQQGRPV-------M 241 + LD L+ + + K PV ++ G + Sbjct: 303 FVDAFGAARDLDRKSWETLSAHNVPYHY-NHKENYYRTTRPVLEIPDNGNSANDRTLTHV 361 Query: 242 RYIDQFVQPKDFEE----------GVWLSELSDA---IETSKGILSVPVPVGKFLLINNL 288 Y F P + +LS + + IE + I + + G+ ++ N Sbjct: 362 NYSPPFQGPYHIKSSESPEWPVKMKDYLSAIREFEKRIEDPQRIFELKLEPGQCVIFENR 421 Query: 289 FWLHGRDRFTPHPDLR 304 LH R F R Sbjct: 422 RVLHARRAFDTSSGER 437 >UniRef50_A7S7D2 Predicted protein n=2 Tax=Nematostella vectensis RepID=A7S7D2_NEMVE Length = 432 Score = 76.7 bits (187), Expect = 1e-12, Method: Composition-based stats. Identities = 42/263 (15%), Positives = 81/263 (30%), Gaps = 37/263 (14%) Query: 63 LRFRVAKILDDLCANQLQPLLLKTLLNRAEGALLINAVGVDDVKQADEMVKLATAVAHLI 122 +RF ++ + L + G LI + + +LAT V ++ Sbjct: 162 VRFNYEDVM-----TKKSALFEWLHTLHSVGIALIEEAP--SGMKPIAVERLATRVGYI- 213 Query: 123 GRSNFDAMSGQYYARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQN 182 +Y N + L + LH D E ++ + + Sbjct: 214 --------KDTHYGHTFDVNAKFDANNLAYTTADLPLHCDIPQSEYYPGVQMLHCLQQAP 265 Query: 183 MQGGNSLLLHLDDWEHLDNY-FRHP-----LARRPMRF---AAPPSKNVSKDVFHPVFDV 233 +GG S+ +D + +HP LA P+ + + ++ Sbjct: 266 TEGGESIF--VDGFFIAQEIKEQHPRLFNLLATTPIPYVDIGKDEFGDFHLKNKRESIEL 323 Query: 234 DQ---------QGRPVMRYIDQFVQPKDFEEGVWLSELSDAIETSKGILSVPVPVGKFLL 284 D+ ++D V+ +L L + +L + G+ + Sbjct: 324 DELGHIVRFTYNNHVRDYFMDSPVEKVQLLYQAYL-ILGQMMRDPVNMLEYKLSPGEVVS 382 Query: 285 INNLFWLHGRDRFTPHPDLRREL 307 NN LHGR +T + R L Sbjct: 383 FNNSRVLHGRRGYTITGEGNRHL 405 >UniRef50_C7N0A3 Taurine catabolism dioxygenase TauD, TfdA family n=2 Tax=Actinomycetales RepID=C7N0A3_SACVD Length = 327 Score = 76.7 bits (187), Expect = 1e-12, Method: Composition-based stats. Identities = 47/260 (18%), Positives = 89/260 (34%), Gaps = 31/260 (11%) Query: 74 LCANQLQPLLLKTLLNRAEGALLINAVGVDD--------VKQADEMVKLAT-----AVAH 120 L A+ ++ L +G LL+ + + + LA+ V Sbjct: 48 LPADLVRRLTDFRDNGNEDGYLLLRGLPREADLPETPNSTPAPVDRPLLASEAWLALVGR 107 Query: 121 LIGRSN--FDAMSGQYYARFVVKNVDNSDSYL--RQPHRVMELHNDGTYVEEITDYVLMM 176 ++G + G Y YL ++E H + Y + YV++ Sbjct: 108 VLGLPTGYHELRFGTVYHDIYPSP---GAHYLSSETSETLLEFHTEMAYHQHQPQYVMLA 164 Query: 177 KIDEQNMQGGNSLLLHLDD-WEHLDNYFRHPLARRPMRFAAPPSK--NVSKDVFHP---- 229 + +L+ + + +D + L RP+ S + + P Sbjct: 165 CSRSDHENKAATLVASIRRAIQLIDEKTKSRLMDRPIPCNVDVSFRGDDPELKKGPPARV 224 Query: 230 -VFDVDQQGRPVMRYIDQFVQPKDFEEGVWLSELSDAIETSKGILSVPVPVGKFLLINNL 288 V D + P++ Y + + P + E+ LS LS A++ V + G L+++N Sbjct: 225 CVLSGDPE-DPMLGYDRELLAPDNAEDEQALSVLSKALDEV--TKPVKLSPGDLLIVDNY 281 Query: 289 FWLHGRDRFTPHPDLRRELM 308 H R F P D R + Sbjct: 282 RTTHARTPFKPRWDGRDRWL 301 >UniRef50_C4NCK2 Dioxygenase n=1 Tax=Streptomyces sp. MK730-62F2 RepID=C4NCK2_9ACTO Length = 274 Score = 76.7 bits (187), Expect = 1e-12, Method: Composition-based stats. Identities = 39/201 (19%), Positives = 65/201 (32%), Gaps = 13/201 (6%) Query: 122 IGRSNFDAMSGQYYARFVVKNVDNSDSY--LRQPHRVMELHNDGTYVEEITDYVLMMKID 179 +G + V + DS + H DG + T ++ Sbjct: 63 LGEPYVPLLYRGRDTPIVTEVTRKGDSDHPVFHTGEAQGWHTDGLLEDIGTIKTTLLYCV 122 Query: 180 EQNMQGGNSLLLHL----DDWEHLDNYFRHPLARRPMRFAAPPSKNVSKDVFHPVFDVDQ 235 +GG + LL+ ++ D L R + V+++ PVF Sbjct: 123 SPAYRGGRTFLLNAGRVFEELRTEDPEAADVLMRDTILGRRSTIPGVNREAVGPVFAELG 182 Query: 236 QGRPVMRYIDQFVQ---PKDFEEGVWLSELSDAI----ETSKGILSVPVPVGKFLLINNL 288 G RY + V+ P D + L + + + + G+ L+ N Sbjct: 183 DGHYATRYGEGRVERWYPGDAAQRRALDRALRHFRMRRDDPDVRIDLLLQAGQCLIFRND 242 Query: 289 FWLHGRDRFTPHPDLRRELMR 309 HGR+ FT P R LMR Sbjct: 243 VLAHGRENFTDDPQSPRLLMR 263 >UniRef50_Q21526 Protein M05D6.7, partially confirmed by transcript evidence n=3 Tax=Caenorhabditis RepID=Q21526_CAEEL Length = 409 Score = 76.3 bits (186), Expect = 1e-12, Method: Composition-based stats. Identities = 40/251 (15%), Positives = 77/251 (30%), Gaps = 38/251 (15%) Query: 94 ALLINAVGVDDVKQADEMVKLATAVAHLIGRSNFDAMSGQYYARFVVKNVDNSDSYLRQP 153 +I GV+ +A E + + D GQ++ F ++ +Y Sbjct: 159 YGVIIVDGVEGTSEATEKLCQSLV-------PVHDTFFGQFWV-FSNSATNDEPAYEDTA 210 Query: 154 HRVMEL--HNDGTYVEEITDYVLMMKIDEQNMQGGNSLL--------LHLDDWEHLDNYF 203 + E+ H DGTY ++ + + G L+ L + E + Sbjct: 211 YGSDEIGPHTDGTYFDQTPGIQVFHCLTPAKTGGDTVLVDSFYCAEKLRNESPEDFEILC 270 Query: 204 RHPLARRPMRFAAPPSKNVSKDVFHPVFDVDQQGR-PVMRY----IDQFVQPKDFEEGVW 258 ++ + + P S S + PV + + G +R+ F E Sbjct: 271 NTKISHHYLEGSPPGSSIHSVSLEKPVIERNSFGNITQIRFNPYDRAPFSCLNSSEASAA 330 Query: 259 --------LSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLRRELMRQ 310 + S + + + G + I+N LH R F R++ Sbjct: 331 ETIKFYEAYEKFSKICHNPDNSIEISLRPGSVIFIDNFRILHSRTSFQG----YRQMC-- 384 Query: 311 RGYFAYASNHY 321 G + N Sbjct: 385 -GCYLSRDNFM 394 >UniRef50_Q2UCW9 Predicted gamma-butyrobetaine n=3 Tax=Aspergillus RepID=Q2UCW9_ASPOR Length = 475 Score = 76.3 bits (186), Expect = 2e-12, Method: Composition-based stats. Identities = 34/192 (17%), Positives = 57/192 (29%), Gaps = 31/192 (16%) Query: 130 MSGQYYARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSL 189 + +Y + + + H D Y+ E Y L+ + + + GG SL Sbjct: 239 LRNTFYGSTWDVRTVPEAKNVAYTSQFLGFHMDLMYMNEPPGYQLLHCL-QNSCDGGESL 297 Query: 190 L---------LHLDDWEHLDNYFRHPLARRPMRFAAPPSKNVSKDVFHPVFDVDQQGRPV 240 L +DD E L+ +N PVF Sbjct: 298 FADSFAVARQLSIDDPEAFKALCNLRLSYEY------NHENDIYTNDWPVFQTYVDEYTQ 351 Query: 241 M------RYIDQFVQP---------KDFEEGVWLSELSDAIETSKGILSVPVPVGKFLLI 285 Y F P + E L + + +E K I + + G+ ++ Sbjct: 352 QQRLMHANYSPPFQAPMHGQRRPFNRTMSEMRALDKFAKMLEDEKYIYELKLNPGECVIF 411 Query: 286 NNLFWLHGRDRF 297 N LH R +F Sbjct: 412 ENRRVLHARRQF 423 >UniRef50_UPI0000587DDD PREDICTED: hypothetical protein n=1 Tax=Strongylocentrotus purpuratus RepID=UPI0000587DDD Length = 395 Score = 76.0 bits (185), Expect = 2e-12, Method: Composition-based stats. Identities = 33/179 (18%), Positives = 60/179 (33%), Gaps = 23/179 (12%) Query: 150 LRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDDWEHLDNYFR----- 204 L + + LH D ++ I + + +GG++ +D + + Sbjct: 192 LAYTAKQLGLHVDLPGFYNTPGVQMLHCIKQVDSEGGDNEF--VDGLRVAEQLEQEYPKI 249 Query: 205 -HPLARRPMRFAAPPSKNVSKDV--FHPVFDVDQQGRP-VMRYIDQFVQP------KDFE 254 L R + F ++ V PV + DQ G + Y D P ++ Sbjct: 250 LQTLTRMKVDFRTLGAEYVPYHTMTQRPVIEYDQDGVFQGINYNDGVRAPYWSLPVEEIT 309 Query: 255 EG-VWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRF-----TPHPDLRREL 307 EG L A+ + + + G ++ +N LHGR F +R L Sbjct: 310 EGYRALKTFHRAMYDERNCIYYKMEKGDMVIFDNRRVLHGRLGFQIRVQEGEESKKRHL 368 >UniRef50_B3SAJ9 Putative uncharacterized protein n=2 Tax=Trichoplax adhaerens RepID=B3SAJ9_TRIAD Length = 455 Score = 76.0 bits (185), Expect = 2e-12, Method: Composition-based stats. Identities = 34/168 (20%), Positives = 53/168 (31%), Gaps = 18/168 (10%) Query: 156 VMELHNDGTYVEEITDYVLMMKIDEQNMQ-GGNSLLLH---------LDDWEHLDNYFRH 205 + H D Y E ++ + GG SLLL L E Sbjct: 254 PLGYHMDLMYYESPPGIQVLHCVRFDECVTGGESLLLDSFSVAEELRLQSPEDFRILSTI 313 Query: 206 PLARRPMRFAAPPSKNVSKDVFHPVFDVDQQGRPVMRYIDQFVQPKDFEEGV------WL 259 P + + F ++ H V + DQ+ + + F P +E Sbjct: 314 PATFQKIHFEREFPVSMRYQRPHIVLNHDQE-VVAVNWSPSFEGPLCVDESYVDSYYKAY 372 Query: 260 SELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLRREL 307 + S I+ S + + G L+ NN LH R F + R L Sbjct: 373 RKFSSIIDKSNHEVQYRLRPGDMLIFNNRRILHARKGFALN-GGSRHL 419 >UniRef50_A4TYV3 PA0187 n=5 Tax=Proteobacteria RepID=A4TYV3_9PROT Length = 315 Score = 76.0 bits (185), Expect = 2e-12, Method: Composition-based stats. Identities = 40/172 (23%), Positives = 60/172 (34%), Gaps = 13/172 (7%) Query: 149 YLRQPHRVMELHNDGTYVEEITDY--VLMMKIDEQNMQGGNSLLLHLD----DWEHLDNY 202 Y+ R + H+DG Y + +++ + GG + L+ + D Sbjct: 140 YIPYTERPIAWHSDGYYNPQHARVRGLILHCVRP-AAAGGENRLMDPELLYIALRERDPA 198 Query: 203 FRHPLARRPMR--FAAPPSKNVSKDVFHPVFDVDQQGRPVMRYI-DQFVQPKDFEEGVWL 259 L R + PVF V + GR MRY E Sbjct: 199 LIAALMRPDAMTIPGNEDEGMTRPAMTGPVFFV-EDGRLCMRYTARTRSIEWHPEAAAAA 257 Query: 260 SELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLRRELMRQR 311 + D +E+ + + + G LL NN LH R+RFT P R L R R Sbjct: 258 QVIRDILESGAELFTGLLQPGWGLLCNN--VLHTRERFTDDPAQPRLLYRAR 307 >UniRef50_Q643C1 Predicted non-heme iron hydroxylase MppO n=1 Tax=Streptomyces hygroscopicus RepID=Q643C1_STRHY Length = 341 Score = 76.0 bits (185), Expect = 2e-12, Method: Composition-based stats. Identities = 48/306 (15%), Positives = 96/306 (31%), Gaps = 64/306 (20%) Query: 53 PVQALEYKSFLRFRVAKILDDLCANQLQPLLLKTLLNRAEGALLINAVGVDD-------- 104 V++ E+++ R ++ + + L + G L++ + VDD Sbjct: 27 SVESTEFQAESRLYADELPRRV-----RRALHEYRSTEKSGILVVTGLPVDDSALGATPA 81 Query: 105 ----VKQADEMVKLATA---VAHLIGRS-----NFDA--MSGQYYARFVVKNVDNSDSYL 150 ++ A +A+L+G D M Y + S Sbjct: 82 DRRHKPVPSTSLRQDIAFYLIANLLGDPIGWATQQDGFIMHDVYPVQGFEHEQIGWGS-- 139 Query: 151 RQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDDWEHLDNYFRHPLARR 210 + H + + TDY+ +M + N G + + D E +D+ R L++ Sbjct: 140 ---EETLTWHTEDAFHPLRTDYLGLMCLR--NPDGVETTACDIADVE-IDDETRETLSQE 193 Query: 211 PMRF----------------AAPPSKNVSKDVFHPVFDVDQQGRPVMRYIDQ-----FVQ 249 R +A S + ++ + + D+ + Sbjct: 194 RFRILPDDAHRIHGKAPGDESARESALRERSRQRVASALESPDPVAVLFGDRDDPYLRID 253 Query: 250 P------KDFEEGVWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDL 303 P + E L + AI+ + V + G + I+N +HGR F D Sbjct: 254 PHYMQGVQGETEQRALETIGAAIDD--AMSGVVLSPGDIVFIDNYRVVHGRKPFRARFDG 311 Query: 304 RRELMR 309 +R Sbjct: 312 TDRWLR 317 >UniRef50_B3SDF1 Putative uncharacterized protein n=3 Tax=Trichoplax adhaerens RepID=B3SDF1_TRIAD Length = 438 Score = 76.0 bits (185), Expect = 2e-12, Method: Composition-based stats. Identities = 29/195 (14%), Positives = 64/195 (32%), Gaps = 19/195 (9%) Query: 135 YARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLD 194 Y R S L + + LH D Y + + + + + GG + + Sbjct: 222 YGRTFTVKSKLDPSNLAFTDKGLALHTDLPYFDLVPGIQFLHCVQQAAGGGGENQFVDGF 281 Query: 195 DWEHL----DNYFRHPLARRPMRFAAP--PSKNVSKDVFHPVFDVDQQGRPVMRYID--- 245 + + + Y+ L + F ++ HP+ ++ + V+R++ Sbjct: 282 NVAEVLRKENPYYFDLLTQHAFVFFDIGNDYVEYNQLNHHPLITLNSRNE-VIRFVHNNH 340 Query: 246 ------QFVQPKDFEEGVWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTP 299 + K + + + +++ + G+ ++ +N LHGR F Sbjct: 341 ARSSILDVPEDKVTDIYRAFRAIVHIMYDQSNLVTNKMESGEMVVFDNWRVLHGRSPFQL 400 Query: 300 HPDLRRELMRQRGYF 314 R L G F Sbjct: 401 TAGGNRHL---EGCF 412 >UniRef50_Q097J3 Gamma-butyrobetaine dioxygenase n=1 Tax=Stigmatella aurantiaca DW4/3-1 RepID=Q097J3_STIAU Length = 357 Score = 76.0 bits (185), Expect = 2e-12, Method: Composition-based stats. Identities = 34/181 (18%), Positives = 61/181 (33%), Gaps = 12/181 (6%) Query: 133 QYYARFVVKNVDNSDS----YLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNS 188 ++ R DN+ + L ++LH D +++ Y L+ G N Sbjct: 166 THFGRIEDLRTDNTTNRNTDQLGYTDSAVQLHTDQPFLDRPPRYQLLHSQRPAETGGANF 225 Query: 189 LLLHLDDWEHLDNYFRHP--LARRPMRFAAPPSKNVSKDVFHPVFDVDQQGRPVMRY--- 243 ++ L +L R L R K+ + + P+ D D G +RY Sbjct: 226 VVDGLAAARYLSGLDRPAFELLRTVPVTFHRKQKSFERVLVSPILDFDAPGGFRIRYSYF 285 Query: 244 -IDQFVQPKDFEEG--VWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPH 300 + +P E + + + + + G FL+ +N LH R FT Sbjct: 286 TLAPHQRPFAEMEAWYRAYNRFAKLVRDERHQYRFLLQTGDFLIYDNWRMLHARTSFTGA 345 Query: 301 P 301 Sbjct: 346 R 346 >UniRef50_Q2JKI0 Clavaminate synthase n=1 Tax=Synechococcus sp. JA-2-3B'a(2-13) RepID=Q2JKI0_SYNJB Length = 351 Score = 75.6 bits (184), Expect = 2e-12, Method: Composition-based stats. Identities = 32/206 (15%), Positives = 65/206 (31%), Gaps = 14/206 (6%) Query: 111 MVKLATAVAHL---IGRSNFDAMSGQYYARFVVKNVDNSDSYLRQPHRVMELHNDGTYVE 167 + + + + HL I N D + + S ++ H + + Sbjct: 107 LAMIGSRLGHLVSYIQEKNGDLFQNLVPTPDSEEVQSSEGS-----RTRLQFHRETVFHP 161 Query: 168 EITDYVLMMKIDEQNMQGGNSLLLHL-DDWEHLDNYFRHPLARRPMRFAAPPSK-NVSKD 225 +++L+ + + + + + L R L R S N Sbjct: 162 HSPEFLLLFCLRPDHDRVAETTYASIRHVLPLLGERDRELLFEPLYRTGIDYSFGNRQAL 221 Query: 226 VFHPVFDV--DQQGRPVMRYIDQFVQPKDFEEGVWLSELSDAIETSKGILSVPVPVGKFL 283 + P+ V ++ P Y + + E + L L I + + G L Sbjct: 222 INGPILPVLYGRREDPFWNYDEDLMVGLTPEASLALEALRKGIHAVYR--GIKLETGDLL 279 Query: 284 LINNLFWLHGRDRFTPHPDLRRELMR 309 I+N +HGR F+P D ++ Sbjct: 280 CIDNRRTVHGRTAFSPRYDGFDRWIQ 305 >UniRef50_A3YI34 Gamma-butyrobetaine hydroxylase n=1 Tax=Marinomonas sp. MED121 RepID=A3YI34_9GAMM Length = 372 Score = 75.2 bits (183), Expect = 3e-12, Method: Composition-based stats. Identities = 38/180 (21%), Positives = 62/180 (34%), Gaps = 18/180 (10%) Query: 138 FVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDDWE 197 F V NSD + + H D Y I L+ I + QGG S L +D Sbjct: 171 FSVYTRPNSDDLAYRSVALGP-HTDNPYRNPIPGIQLLHCI-QNETQGGLSTL--VDSLS 226 Query: 198 HLDNYFRHP------LARRPMRFAAPPSKNVSKDVFHPVFDVDQQGR-------PVMRYI 244 + + L+R P+R+ K++ + +D G+ P + ++ Sbjct: 227 VVSQLKQEDPEGFDLLSRVPVRYRHLD-KSICLSERRTMIQLDINGQVEGVAYSPRLDFL 285 Query: 245 DQFVQPKDFEEGVWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLR 304 Q L + K + G+ + +N LHGR F P+ LR Sbjct: 286 PLLKQDDLIVFHRARKRLGQLLSDPKFEWRFKLAPGQLQMFHNSRVLHGRTEFDPNEGLR 345 >UniRef50_B2VF28 Clavaminate synthase 1 n=1 Tax=Erwinia tasmaniensis RepID=B2VF28_ERWT9 Length = 327 Score = 75.2 bits (183), Expect = 3e-12, Method: Composition-based stats. Identities = 28/154 (18%), Positives = 53/154 (34%), Gaps = 6/154 (3%) Query: 159 LHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDDW-EHLDNYFRHPLARRPMRFAAP 217 H + + E DY++++ I + + + +L + L+ + Sbjct: 148 FHTEIAFHHEKPDYIILLCIRPDHDKKAKTFTSSTRRIKSYLSDGDISVLSEERFKTGVD 207 Query: 218 PSKNV---SKDVFHPVFDVDQQGRPVMRYIDQFVQPKDFEEGVWLSELSDAIETSKGILS 274 S K V + Y ++ D E L +L+ A +K +S Sbjct: 208 YSFGSKHGEKGNGRIVSVFKINNHDHICYDLDLMEGIDNEANEVLKKLAKAANHAKCYVS 267 Query: 275 VPVPVGKFLLINNLFWLHGRDRFTPHPDLRRELM 308 + G +I+N +HGR FTP D + Sbjct: 268 --LESGDLFIIDNNRAIHGRTSFTPRYDGFDRWL 299 >UniRef50_C6XP42 Taurine catabolism dioxygenase TauD/TfdA n=2 Tax=Alphaproteobacteria RepID=C6XP42_HIRBI Length = 359 Score = 75.2 bits (183), Expect = 3e-12, Method: Composition-based stats. Identities = 47/301 (15%), Positives = 98/301 (32%), Gaps = 43/301 (14%) Query: 40 QTTKQFLEQVAEWPVQALEYKSFLRFRVAK-ILDDLCANQLQPLLLKTLLNRAEGALLIN 98 Q + ++++ + + LRF A+ CA + + G + Sbjct: 30 QLSDAQIDELEALGSKFVNDDPDLRFVTAEEYPLKQCA----DAINAWGHDVDYGRGFVL 85 Query: 99 AVGVDDVKQADEMVKLATAVAHLIGRSNFDAMS--------GQYYARFVVKNVDNSDSYL 150 G+ +D L+ A+ +++G D + YA K +D+ + Sbjct: 86 VRGLRTHLYSD---ALSAAIYYILGLHMGDPIRQNELGDVIDNIYATSD-KTMDDPTALS 141 Query: 151 RQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLH--------LDDWEHLDNY 202 + + H+D + D V +M + GG S L+ L L Sbjct: 142 SKVKDELTYHSDSS------DIVALMCLRP-AKDGGKSCLVSGAEIYNEILKRRPDLAPL 194 Query: 203 FRHPLARRPMRFAAPPSKNVSKDVFHPVFDVDQ------QGRPVMRYIDQFVQ-P-KDFE 254 P + K + PV +++ G + ++ + P E Sbjct: 195 LLEPFHWD---WRRQDPKAPANSYSSPVISLEEGVFSMYAGSLYVLTAQEYPEVPRLTPE 251 Query: 255 EGVWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLRRELMRQRGYF 314 + L+ + + + G ++N LH R RF +P+ +R+ R + Sbjct: 252 QIEVLNLFDEITYEPGMAIEMDFRPGDIQWLSNYAALHSRTRFEDYPEPQRKRHLLRLWL 311 Query: 315 A 315 + Sbjct: 312 S 312 >UniRef50_B9WJZ0 Uncharacterized oxidoreductase (Gamma-butyrobetaine hydroxylase, putative) n=8 Tax=Saccharomycetales RepID=B9WJZ0_CANDC Length = 412 Score = 75.2 bits (183), Expect = 3e-12, Method: Composition-based stats. Identities = 38/204 (18%), Positives = 67/204 (32%), Gaps = 22/204 (10%) Query: 125 SNFDAMSGQYYA-RFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNM 183 F + +Y F VKN + + + + LH D Y E L+ I + + Sbjct: 188 DKFGYIKKTFYGTLFDVKNKKEKATNIAYTNTFLPLHMDLLYYESPPGLQLLHAI-QNST 246 Query: 184 QGGNSLL----LHLDDWEHLDNYFRHPLARRPMRFAAPPSKNVSKDVFHPVFDVDQQGRP 239 GG ++ L + D L + P+ F + N P+ D++ P Sbjct: 247 LGGENIFCDSFLAAEHIRKSDPKAYTALTQIPITFHY-DNNNEYYYYKRPLIIEDKEAPP 305 Query: 240 VM------RYIDQFVQPKDFEEGVW---------LSELSDAIETSKGILSVPVPVGKFLL 284 Y F P + + + + I + +P G ++ Sbjct: 306 AFVPISAINYAPPFQGPFEIDPAQYSMFDDFVRGMRLFESFINDPANHFEIKMPEGTCVI 365 Query: 285 INNLFWLHGRDRFTPHPDLRRELM 308 N LH R+ F+ + R LM Sbjct: 366 FENRRALHSRNAFSDSNNGDRWLM 389 >UniRef50_C1E2T4 Predicted protein n=1 Tax=Micromonas sp. RCC299 RepID=C1E2T4_9CHLO Length = 453 Score = 74.8 bits (182), Expect = 4e-12, Method: Composition-based stats. Identities = 43/254 (16%), Positives = 79/254 (31%), Gaps = 36/254 (14%) Query: 65 FRVAKILDDLCANQLQPLLLKTLLNRAEGALLINAVGVDDVKQADEMVKLATAVAHL--- 121 F + L A L GA++I V++V + D HL Sbjct: 109 FALGPECLPLVAEMAAELEDG------TGAVMIRNFPVENVPEEDVGALYVGFCTHLGVP 162 Query: 122 -----IGRSNFDAMSGQYYARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMM 176 G + G Y R + N+ +Q + LH D D + ++ Sbjct: 163 RWQSSAGLRSRSRGYGVYLGRVRAEMKGNTPEAGKQSNNYFRLHTD------RCDVISLL 216 Query: 177 KIDEQNMQGGNSLLLHLDDW--EHLDNYFRHPLARRPMRFAAPPSKNVSKDVFHPVFDVD 234 I +GG S + E L+ Y L + + V P+ D+ Sbjct: 217 GIRA-AAKGGASRVASAVTIYNEMLEKYPT--LVPKLFNPVERIWEGKDGKVALPLMDIT 273 Query: 235 QQGRPVMRYIDQFVQ---------PKDFEEGVWLSELSDAIETSKGILSVPVPVGKFLLI 285 ++G+ + +++ P D + + + + K + + G + Sbjct: 274 KEGKFTSQISPSYIECAQLLPGSCPLDADTVEAIDLVEEI--GLKHCVEFVMQPGCVYWL 331 Query: 286 NNLFWLHGRDRFTP 299 NN HGR + Sbjct: 332 NNHQVYHGRTAWAD 345 >UniRef50_D2SNE3 Putative uncharacterized protein n=1 Tax=Streptomyces fradiae RepID=D2SNE3_STRFR Length = 419 Score = 74.8 bits (182), Expect = 4e-12, Method: Composition-based stats. Identities = 39/176 (22%), Positives = 69/176 (39%), Gaps = 15/176 (8%) Query: 144 DNSDSYLRQPHRVMELHNDG-TYVEEITDYVLMMKIDEQNMQGGNSLLLHLDDWEHLDNY 202 D D+Y Q + + H++G + Y+ + + + +GG + +L D + LD + Sbjct: 244 DMDDTYHAQNRKSLLPHSEGYEFRGVPPRYLGLWCVTPASGEGGETTML--DGNQILDEF 301 Query: 203 F---RHPLARRPMRFAAPP---SKNVSKDVFHPVFDVDQQGRPVMRYI-DQFVQPKDFEE 255 R L + + + V V HPV ++ G V R+ + + P+ E Sbjct: 302 TEEERQRLFDTTYEWKSTDGLSRRGVDFRVEHPVL-ENRNGGRVFRFSYNNMIVPEGDEL 360 Query: 256 GVWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLRRELMRQR 311 L E I I +V ++ +N LH R+ F D R L R + Sbjct: 361 ATRLRERGKEIYDENHI-AVSYEQRDLIVWDNWRMLHSRNAFE---DPSRHLKRVQ 412 >UniRef50_B7V2A4 Putative uncharacterized protein n=5 Tax=Pseudomonas aeruginosa RepID=B7V2A4_PSEA8 Length = 271 Score = 74.8 bits (182), Expect = 4e-12, Method: Composition-based stats. Identities = 35/175 (20%), Positives = 60/175 (34%), Gaps = 29/175 (16%) Query: 149 YLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDDWEHLDNYFRHPLA 208 YL +LH DG +++ + + +GG +LL LA Sbjct: 91 YLGSTPLEHKLHTDGAFLDTPEQLCSLQCVRNAR-EGGETLLASAG------------LA 137 Query: 209 RRPMRFAAPPSK------------NVSKDVFHPVFDVDQQG-RPVMRYIDQFVQPKD-FE 254 +R P + PVF ++ + R D + + Sbjct: 138 FERLRRRMPTKHLGLLRGDALTIVRKHQSSTQPVFRLNGEALGIKFRQNDGAAEVVEHPV 197 Query: 255 EGVWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLRRELMR 309 +EL A+E L + + G+ L+++N LHGR F + RE+ R Sbjct: 198 AVEAFAELVAALEDPACQLRIKLEPGEILVLDNTAVLHGRTAFCANE--LREMRR 250 >UniRef50_Q1R056 Gamma-butyrobetaine,2-oxoglutarate dioxygenase n=2 Tax=Chromohalobacter salexigens RepID=Q1R056_CHRSD Length = 407 Score = 74.8 bits (182), Expect = 4e-12, Method: Composition-based stats. Identities = 32/172 (18%), Positives = 60/172 (34%), Gaps = 14/172 (8%) Query: 149 YLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLL----LHLDDWEHLDNYFR 204 L R +E H D Y + I Y+ + + GG+S L + + Sbjct: 203 DLTMTQRGLEPHTDNPYRDPIPGYIWLHCLS-NAADGGDSTLTDGFMAAQRLKAEAPEDF 261 Query: 205 HPLARRPMRFAAPPSKNVSKDVFHPVFDVDQQGR-PVMRYIDQFVQPKDFEEGV------ 257 L R RF + + P+ ++D +GR +RY ++ + + + Sbjct: 262 ACLTRLSPRFRYTDATTDLES-EGPLIELDSRGRLARVRYSNRTERIAAHDAALLERYYA 320 Query: 258 WLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLRRELMR 309 I + + + G L+++N LHGR + R L + Sbjct: 321 ARQRFYRLITDEALTVHLKLGPGDMLIMDNYRLLHGRTAYQL-EGGVRHLRQ 371 >UniRef50_B6BSV9 Gamma-butyrobetaine,2-oxoglutarate dioxygenase, putative n=2 Tax=Bacteria RepID=B6BSV9_9RICK Length = 367 Score = 74.8 bits (182), Expect = 4e-12, Method: Composition-based stats. Identities = 52/303 (17%), Positives = 111/303 (36%), Gaps = 33/303 (10%) Query: 21 FTLTPSAQSPRLLELTFTEQTTKQFL--EQVAEWPVQALEYKSFLRFRVAKILDDLCANQ 78 ++ + + LE+ F + + + + E+ + K + + L D+ + Sbjct: 54 ISINKANINENFLEIDFNDGVSSKIEINKIAQEFSNEDTVIKPITKIKWDSTLKDIKNFR 113 Query: 79 LQPLL-------LKTLLNRAEGALLINAVGVDDVKQADEMVKLATAVAHLIGRSNFDAMS 131 Q + G ++I + + + +VK A ++ + R+NF Sbjct: 114 YQDNFFESKEMHDLLVSFYKYGFVVIKNIPT----EDNFIVKFANSIGS-VRRTNF---- 164 Query: 132 GQYYARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLL 191 G+Y F VK+ N + L + H D Y + L+ I + GG S L+ Sbjct: 165 GEY---FDVKSKPNPN-DLAYTSLALAPHTDNPYRNPVPCIQLLHCIIS-KVSGGLSTLV 219 Query: 192 H----LDDWEHLDNYFRHPLARRPMRFAAPPSKNVSKDVFHPVFDVDQQGRPVMRYIDQF 247 +D + + F L+ ++F + V +D + D + +R+ + Sbjct: 220 DGYTVTEDLKTENPDFYKILSEVKVKFKFIDKEVVLEDWSELIKLNDDKSLKQIRFSPRL 279 Query: 248 -VQPKDFEEGVWL-----SELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHP 301 P +E + L +LS+ + K + + ++++N LHGR + Sbjct: 280 DFVPILEKEKLDLYYRARKKLSEMYNSDKYRIEFKLEEKDLMMMDNYRLLHGRTAYKTSE 339 Query: 302 DLR 304 R Sbjct: 340 GDR 342 >UniRef50_A4VRE5 Predicted non-heme iron hydroxylase MppO n=1 Tax=Pseudomonas stutzeri A1501 RepID=A4VRE5_PSEU5 Length = 293 Score = 74.4 bits (181), Expect = 5e-12, Method: Composition-based stats. Identities = 48/290 (16%), Positives = 98/290 (33%), Gaps = 31/290 (10%) Query: 54 VQALEYKSFLRFRVAKILDDLCANQLQPLLLKTLLNRAEGALLINAVGVDDVKQ------ 107 +Q Y+ L VA+ + Q+ ++ L +G LL+ + + D+ Sbjct: 1 MQNDHYERLLANAVAQGVLR--HEQIGQIINFRLFGNKQGFLLLENLPIGDIPPTPVSRE 58 Query: 108 -----ADEMVKLATAVAHLIGRSNF------DAMSGQYYARFVVKNVDNSDSYLRQPHRV 156 D +L L+G + ++ + + SDS+ + Sbjct: 59 DIRKPDDSSERLLLQATALLGEPIGYTQESDGCIVNNFFPQQALSRAATSDSFDTE---- 114 Query: 157 MELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDD-WEHLDNYFRHPLARRPMRFA 215 ++LH + + + D+++++ + + + + ++ + L + P F Sbjct: 115 LDLHTENAFHAVLPDHLVLLCLRQDPAAEAVTYIASIERILQRLTFEEQAFFLTEPYNFL 174 Query: 216 A---PPSKN-VSKDVFHPVFDVDQQGRPVMRYIDQFVQPKDFEEGVWLSELSDAIETSKG 271 + P SKN H P R+ F+Q L +L Sbjct: 175 SDYGPTSKNQRIDINRHQTVLYGDPDAPFFRFDPHFMQAFSNRAQQLLDKLRSIAWDV-- 232 Query: 272 ILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLRRELMRQRGYFAYASNHY 321 + V + G L+I+N H R F+ D + QR + N Y Sbjct: 233 VEPVRLNRGDLLIIDNRRTAHARSPFSARFDGSDRWI-QRAFAISNPNFY 281 >UniRef50_B5GLY6 Putative uncharacterized protein n=1 Tax=Streptomyces clavuligerus ATCC 27064 RepID=B5GLY6_STRCL Length = 342 Score = 74.4 bits (181), Expect = 5e-12, Method: Composition-based stats. Identities = 44/276 (15%), Positives = 86/276 (31%), Gaps = 40/276 (14%) Query: 65 FRVAKILDDLCANQLQPLLLKTLLNRAEGALLINAVGVDDVK------------------ 106 +R A + L +L L GA +I+ VDD Sbjct: 47 YRYAPLATRLLPERLLAFLHHFRQEEPAGACVISGWQVDDHAIGPTPDTWRTGTTRSRAL 106 Query: 107 -QADEMVKLATAVAHLIG---RSNFDAMSGQYYARFVVKNVDNSDSYLRQPHRVMELHND 162 +V L++ + + + + + S +++LH++ Sbjct: 107 ADEVYLVLLSSVLGEVFAWSTVQDGHLVQDLFPVPGEEHEKSAGSS-----ASLLDLHSE 161 Query: 163 GTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDDWEHLDNYFRHPLARRPMRFAAPPSKNV 222 + DY+ ++ + + + +D L R LA P S+++ Sbjct: 162 DAFSPLRCDYLGLLCLRNDTAV--PTSYVPVDGLS-LTPEQRTVLAEPRFV-LLPDSEHL 217 Query: 223 SKDVFH------PVFDVDQQGRPVMRYI--DQFVQPKDFEEGVWLSELSDAIETSKGILS 274 + P G P Y+ D+F D ++ L+ + Sbjct: 218 KRAAERGEQPPAPPRTALLFGDPASPYLVVDEFFIRTDPDDTQAQDALAALLRLLNDSQR 277 Query: 275 -VPVPVGKFLLINNLFWLHGRDRFTPHPDLRRELMR 309 V + G+ L I+N +HGR F D R ++ Sbjct: 278 DVTLAPGELLFIDNYRAVHGRKPFHASYDGRDRWLK 313 >UniRef50_UPI000186F191 trimethyllysine dioxygenase, putative n=1 Tax=Pediculus humanus corporis RepID=UPI000186F191 Length = 371 Score = 74.0 bits (180), Expect = 6e-12, Method: Composition-based stats. Identities = 47/266 (17%), Positives = 81/266 (30%), Gaps = 36/266 (13%) Query: 46 LEQVAEWPVQALEYKSFLRFRVAKILDDLCANQLQPLLLKTLLNRAEGALLINAVGVDDV 105 + + ++ V ++YK L + ++ + G I V + Sbjct: 109 KDDIIKYDVCHVDYKKILE----------SDEGIYSVMKSLV---DYGVGFIENVPPNVK 155 Query: 106 KQADEMVKLATAVAHLIGRSNFDAMSGQYYARFVVKNVDNSDSYLRQPHRVMELHNDGTY 165 + ++KL+ + F +M +F N +Y +Q H D TY Sbjct: 156 STEEVILKLSHVQNTV-----FGSMW-----QFSDDMDHNDTAYTKQYLGP---HTDNTY 202 Query: 166 VEEITDYVLMMKIDEQNMQGGNSLLLHLDDWEHLDNYFRHPLARRPMRFAAPPSKNVSKD 225 E ++ I+ G N L+ E L F R + Sbjct: 203 FTEAAGLQVLHCINHTGTGGENFLIDGFKAIEILKENFPDTFTRLTLIEVEAEYLEPGYH 262 Query: 226 VF--------HPVFDVDQQGRPVMRYIDQFVQPKDFEEGVW--LSELSDAIETSKGILSV 275 HPV +Q R + P+D + L L I+ Sbjct: 263 YTFSGPLIKLHPVTKQPEQIRYNIHDRAPVTIPQDQILQHYNDLRILGSIIKNPDLEWKF 322 Query: 276 PVPVGKFLLINNLFWLHGRDRFTPHP 301 + G L+ +N LHGR +T H Sbjct: 323 KLNPGTVLIFDNFRILHGRTSYTGHR 348 >UniRef50_UPI000192475E PREDICTED: similar to Trimethyllysine dioxygenase, mitochondrial n=4 Tax=Hydra magnipapillata RepID=UPI000192475E Length = 332 Score = 74.0 bits (180), Expect = 7e-12, Method: Composition-based stats. Identities = 43/251 (17%), Positives = 84/251 (33%), Gaps = 35/251 (13%) Query: 71 LDDLCANQLQPLLLKTLLNRAEGALLINAVGVDDVKQADEMVKLATAVAHLIGRSNFDAM 130 +++C + + LL G ++ + + T + LIG Sbjct: 72 FENVCKEK-EDLLKLLENVAIYGFAIV-------KNTPTTLESVRTI-SILIGYP-RKTF 121 Query: 131 SGQYYARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLL 190 G + D + L + + H DGTY + + M E GG ++L Sbjct: 122 YGDVWLTTNRSEEDMDHADLAYSNVALPCHTDGTYFLDSPG-LQMFHCHEHTGTGGETVL 180 Query: 191 ---------LHLDDWEHLDNYFRHPLARRPMRFAAPPSKNVSKDVFHPVFDVDQQGRPVM 241 L L + + ++ + + A K+ P+ ++ + + Sbjct: 181 VDGIGAAEKLKLTNPDAVEYLAKTWI------PATYYEKDRCSKALGPIIKINPYSKEIY 234 Query: 242 RYI----DQFVQ---PKDFEEGVWLSELS--DAIETSKGILSVPVPVGKFLLINNLFWLH 292 + D+ V D E + ++ D I + + + + G L++NN LH Sbjct: 235 QVRINDGDRDVLNCLSFDEIEKFYNHYIAFLDIIYSKEQEFRLKLTPGNLLIVNNWRVLH 294 Query: 293 GRDRFTPHPDL 303 GR FT L Sbjct: 295 GRTSFTGKRSL 305 >UniRef50_B6Q7A5 TfdA family oxidoreductase, putative n=7 Tax=Leotiomyceta RepID=B6Q7A5_PENMQ Length = 744 Score = 74.0 bits (180), Expect = 7e-12, Method: Composition-based stats. Identities = 45/283 (15%), Positives = 86/283 (30%), Gaps = 49/283 (17%) Query: 60 KSFLRFRVAKILDDLCANQLQPLLLKTLLNRAEGALLINAVGVDDVKQADEMVKLATAVA 119 K F + L+ + R G ++ + V+ +E + + T V+ Sbjct: 87 KPFGYINRTTFPLPSLGSLLREGSRELHFGR--GFFILRGIPVEKYT-TEENIIIYTGVS 143 Query: 120 HLIG-----RSNFDAMSGQYYARFVVKNVDNSDSYLRQ-------PHRVMELHNDGTYVE 167 +G + + + G+ A + D + +Y R+ + H D Sbjct: 144 SYVGDLRGRQEKRNPIDGK--AVVLSHIKDLTGTYSRENIGGPASTNDKQVFHTDSG--- 198 Query: 168 EITDYVLMMKIDEQNMQGGNSLLLHL---------DDWEHLDNYFRHPLARRPMRFAAPP 218 D V + + ++ +GG S L + + + L AA P Sbjct: 199 ---DIVSLFCL-QRAAEGGESQLASIWQVYNILAESRPDLIHTLTNDWL-FDGFNNAAQP 253 Query: 219 SKNVSKDVFHPVFDVDQQGRPVMRYIDQFVQ------------PKDFEEGVWLSELSDAI 266 + P R +++Y ++ P + L L Sbjct: 254 YTARPLLYYQPA-STLASARLIVQYARRYFTGFLAQPRSSTIPPITEAQAEALDAL--HF 310 Query: 267 ETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLRRELMR 309 K + + G +NNL H R F+ P R L+R Sbjct: 311 LGEKHSIQLDFQKGDIQYVNNLAIFHARKGFSNSPSKERHLLR 353 >UniRef50_B7S301 Taurine catabolism dioxygenase TauD, TfdA family n=1 Tax=marine gamma proteobacterium HTCC2148 RepID=B7S301_9GAMM Length = 408 Score = 74.0 bits (180), Expect = 7e-12, Method: Composition-based stats. Identities = 34/186 (18%), Positives = 57/186 (30%), Gaps = 19/186 (10%) Query: 138 FVVKNVDNSDSYLRQPHRVMEL--HNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDD 195 V V++ S + + L H+D E I + + E GG SLL +D Sbjct: 195 GDVFEVESLPSANSNAYTALPLKVHSDLATREYIPGMQFLFCL-ENEATGGESLL--VDG 251 Query: 196 WEHLDNYFRHPLARRPMRFAAPPS-----KNVSKDVFHPVFDVDQQGR-PVMRYIDQFVQ 249 + + P + PV + + G +R+ Sbjct: 252 FAAAQQLNSESTEFFEVLATLPVPFGTKDREFDHRYCAPVLEHNAVGELSSVRHTYWLRS 311 Query: 250 PKDFEEG------VWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDL 303 P + + + L + G+ + +N LHGRD F P Sbjct: 312 PMSGDFSTLNTFYAAYRRFQEICDDPDNQLRFRLQPGQLMAFDNRRVLHGRDAFDPESG- 370 Query: 304 RRELMR 309 R L+R Sbjct: 371 -RRLLR 375 >UniRef50_C7YUK5 Putative uncharacterized protein n=1 Tax=Nectria haematococca mpVI 77-13-4 RepID=C7YUK5_NECH7 Length = 518 Score = 74.0 bits (180), Expect = 7e-12, Method: Composition-based stats. Identities = 39/189 (20%), Positives = 60/189 (31%), Gaps = 17/189 (8%) Query: 133 QYYARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLL-- 190 +Y R + + LH D Y+E L+ + E + GG SL Sbjct: 258 TFYGRTFDVRAKPDAENVAYTSGYLGLHQDLLYLESPPAIQLLHCL-ENSCNGGESLFSD 316 Query: 191 -LHLDDWEHLDNYFR-HPLARRPMRFAAPPSKNVSKDVFHPVFDVDQQGRPVM-----RY 243 LH L N LAR + + +FDV G + Sbjct: 317 GLHAGKLLWLQNSSAVENLARVRIPYHY-EKHGYFYRQKRSLFDVGVDGNMAAVYWSPPF 375 Query: 244 IDQFVQPKDFEEGVWLSE---LSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPH 300 D+F Q + WL I + + + G+ +L +NL +HGR F Sbjct: 376 QDRF-QLASVDAREWLEPARLFDGLINDPDAMFEMKMAPGECVLFDNLRVMHGRKAF--D 432 Query: 301 PDLRRELMR 309 +R Sbjct: 433 VGGGSRWLR 441 >UniRef50_Q7WGN5 Putative uncharacterized protein n=3 Tax=Bordetella RepID=Q7WGN5_BORBR Length = 357 Score = 73.6 bits (179), Expect = 8e-12, Method: Composition-based stats. Identities = 55/319 (17%), Positives = 90/319 (28%), Gaps = 34/319 (10%) Query: 1 MNALTAVQNNAVDSGQDYSGFTLTPSAQSPRLLELTFTEQTTKQFLEQVAEWPVQALEYK 60 MN ++DS ++ + + P T +Q V K Sbjct: 1 MNEACTPTAASLDSYRNAQAWYGPQLDKHPEYWVHHLTGPELEQLDRAVRHADAGG---K 57 Query: 61 SFLRFRVAKILDDLCANQLQPLLLKTLLNRAEGALLINAVGVDDVKQADEMVKLATAVAH 120 +LQ + + L R G LI V V+ + A+ Sbjct: 58 DITELSQDDFELGELGQRLQQVKHEVLHGR--GLYLIRGVPVEQYTMRQSAIAFW-ALGT 114 Query: 121 LIGRSNFDAMSGQYYARFVVKNVDNSDSYLR--QPHRVMELHNDGTYVEEITDYVLMMKI 178 +G G +D +D+ +R Q + H D + D V ++ + Sbjct: 115 NLGLPVSQNGKGHVLGHVANLGLDYADAAVRGYQTSNRLPYHTDSS------DIVGLLCV 168 Query: 179 DEQNMQGGNSLLLHLDDWEHLDNYFRHPL-------ARRPMRFAAPPSKNVSKDVFHPVF 231 G +S++ W L RHP + R+ P PVF Sbjct: 169 RPAKAGGLSSVVSSTTVWNELTA--RHPEHARTLLDSFHRTRWGEIPEGQKPYSSS-PVF 225 Query: 232 DVDQQGRPVMRYIDQFVQPKDFEEGV---------WLSELSDAIETSKGILSVPVPVGKF 282 QGR Y+ ++ V L L L + G Sbjct: 226 A-PYQGRMYANYVRSAIRKAQALPSVPRLSAQQNEALDCLDALTCDPALYLDMDFKPGDV 284 Query: 283 LLINNLFWLHGRDRFTPHP 301 L++N H R + P Sbjct: 285 QLLSNFTIFHSRTAYEDWP 303 >UniRef50_C3Y5M9 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3Y5M9_BRAFL Length = 426 Score = 73.6 bits (179), Expect = 8e-12, Method: Composition-based stats. Identities = 26/188 (13%), Positives = 56/188 (29%), Gaps = 22/188 (11%) Query: 133 QYYARFVVKNVDN-SDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLL- 190 +Y + S L + + H D +L+ + E +GG + L Sbjct: 217 THYGLHESSVYSKPNTSNLAFTGQRLHCHTDMPQYSSPAGILLLHCVQE-AEEGGENELI 275 Query: 191 --------LHLDDWEHLDNYFRHPLARRPMRFAAPPSKNVSKDVFHPVFDVDQQGRP-VM 241 L D E P++ + P+ ++ +G + Sbjct: 276 DGYHAAYQLKEKDPEAFHTLTTTPVS---FAMHGKDYIRYDLEKTRPIISLNPEGEVVRI 332 Query: 242 RYIDQFV-----QPKDFEEGVW--LSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGR 294 Y +Q P + + + + D + + + G+ ++ +N +HGR Sbjct: 333 CYANQLRSALMDVPVERVQPFYRAMRAWDDVLYHPDNCILTKLNAGEMVVFDNWRLVHGR 392 Query: 295 DRFTPHPD 302 F Sbjct: 393 RGFQGGRH 400 >UniRef50_B6BQI4 Gamma-butyrobetaine hydroxylase n=1 Tax=Candidatus Pelagibacter sp. HTCC7211 RepID=B6BQI4_9RICK Length = 375 Score = 73.6 bits (179), Expect = 9e-12, Method: Composition-based stats. Identities = 36/180 (20%), Positives = 61/180 (33%), Gaps = 21/180 (11%) Query: 142 NVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDDWEHLDN 201 N+ +Y R H D + E Y + + + +GG+S +D + + Sbjct: 183 PKPNNSAYTAHALRN---HMDLPWFELPPGYQFLHCLI-NSAKGGDSSA--VDGFAVAEY 236 Query: 202 YFR------HPLARRPMRFAAPPSKNVSKDVFH-PVFDVDQQG-RPVMRY----IDQFVQ 249 L P++F VS FH P + + G +R+ +D Sbjct: 237 LKNNEKEIFETLVNVPLKFKDTDYTQVSHRAFHAPAISLTKDGDYHDIRFSVATMDVLDC 296 Query: 250 PKDFEEGVW--LSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLRREL 307 D E V+ + + + + + G NN LHGR F P+ R L Sbjct: 297 HPDLMEKVYKAHHRFGNLLHDDRFQIKFRLGPGDIYSFNNRRVLHGRTEFDPNSG-HRHL 355 >UniRef50_Q1YSL8 Gamma-butyrobetaine hydroxylase n=1 Tax=gamma proteobacterium HTCC2207 RepID=Q1YSL8_9GAMM Length = 366 Score = 73.3 bits (178), Expect = 1e-11, Method: Composition-based stats. Identities = 37/169 (21%), Positives = 61/169 (36%), Gaps = 19/169 (11%) Query: 153 PHRVMEL--HNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDDWEHLDNYFRHP---- 206 H +E+ HND + + +GG S++ +D + L++ Sbjct: 180 AHTSLEVPPHNDFASYSWPPSVQALHML-ANECEGGESMI--VDGYSVLNDLQNDNPNLF 236 Query: 207 --LARRPMRFAAPPSKNVSKDVFHPVFDVDQQGRPVMRYIDQFVQPKDFEEG------VW 258 L P+ F +N + V Q R+ +Q +Q D E + Sbjct: 237 KILCSFPVPFREFDEENETYTKEPIVRLNSQNKITGFRFSNQLMQMIDPIEDTLDLFYMA 296 Query: 259 LSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLRREL 307 EL + I + K + G LL++ LHGR F P D +R L Sbjct: 297 YHELCNRINSKKYKSKFRLESGHILLVHGHRVLHGRCEFQP--DGKRHL 343 >UniRef50_A5DCB6 Trimethyllysine dioxygenase n=9 Tax=Saccharomycetales RepID=TMLH_PICGU Length = 399 Score = 73.3 bits (178), Expect = 1e-11, Method: Composition-based stats. Identities = 57/309 (18%), Positives = 103/309 (33%), Gaps = 36/309 (11%) Query: 10 NAVDSGQDYSGFTLTPSAQSPRLLELTFTEQTTKQFLEQVAEWPVQALEYK-SFLRFRVA 68 N + +Y L + +PR + +T ++ L + W V+ +E + + F Sbjct: 80 NHEEHQSEYECRWLVIHSYNPRQIPVTEKVSGEREILAR-EYWTVKDMEGRLPSVDF--- 135 Query: 69 KILDDLCANQLQPLLLKTLLNRAEGALLINAVGVDDVKQADEMVKLATAVAHLIGRSNFD 128 K + +P+ L G I+ V VD + KL R Sbjct: 136 KTVMASTDENEEPIKDWCLKIWKHGFCFIDNVPVDPQETEKLCEKLMYI------RP--- 186 Query: 129 AMSGQYYARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNS 188 +Y F D S + + + H DGTY + L + + G S Sbjct: 187 ----THYGGFWDFTSDLSKNDTAYTNIDISSHTDGTYWSDTPGLQLFHLLMHEGTGGTTS 242 Query: 189 LLLHLDDWEHLDNYFRHPLARRPMRFAAPPSKNVSK-------DVFHPVFDVDQQGRPVM 241 L+ E L HP + + P+ + + D+ P+F +D G + Sbjct: 243 LVDAFHCAEILKK--EHPESFELLTRIPVPAHSAGEEKVCIQPDIPQPIFKLDTNGELIQ 300 Query: 242 -RY-------IDQFVQPKDFEE-GVWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLH 292 R+ +D + P + + + + I L + G+ L+ +N H Sbjct: 301 VRWNQSDRSTMDSWENPLEVVKFYRAIKQWHKIISDPANELFYQLRPGQCLIFDNWRCFH 360 Query: 293 GRDRFTPHP 301 R FT Sbjct: 361 SRTEFTGKR 369 >UniRef50_A8HS86 Predicted protein n=2 Tax=Chlamydomonas reinhardtii RepID=A8HS86_CHLRE Length = 362 Score = 72.9 bits (177), Expect = 1e-11, Method: Composition-based stats. Identities = 50/268 (18%), Positives = 87/268 (32%), Gaps = 30/268 (11%) Query: 60 KSFLRFRVAKILDDLCANQLQPLLLKTLLNRAEGALLINAVGVDDVKQADEMVKLATAVA 119 K +A + + L + + R G L+ V V + ++ Sbjct: 61 KPLQDVSLADVHLPTLSLPLIDVGQQAQHGR--GWSLLRGVPVQRYSRQQQLTAWWILGL 118 Query: 120 HLIGRSNFDAMSGQYYARFVVKNVDNSDSYLR--QPHRVMELHNDGTYVEEITDYVLMMK 177 H GR+ G D +D R + HNDG D V ++ Sbjct: 119 HW-GRAVPQNAKGHLIGHIKDLGRDPADPNTRLYATNAAQPWHNDG-----PADLVGLLC 172 Query: 178 IDEQNMQG--GNSLLLHLDD-----WEHLDNYFRHPLARRPMRFAAPPSKNVSKDVFHPV 230 + + G G S + + + HL + P K PV Sbjct: 173 LSDGAEGGESGWSSSISVHNEILRTAPHLARVLADSWFFDR-KGEVPAGKKPF--FEIPV 229 Query: 231 FDVDQQGRPVMRYID-------QFVQ-PKDFEEGVWLSELSDAIETSKGI-LSVPVPVGK 281 F+ +G + Y D + + P+ + EL +++ S+ + L + G Sbjct: 230 FNYH-KGYLSVNYSDNYYHLSQRHAEVPRLGPDHHAAMELFNSLACSQQLSLRHILQPGD 288 Query: 282 FLLINNLFWLHGRDRFTPHPDLRRELMR 309 L++N LH R F P+ R L+R Sbjct: 289 VQLLSNHTCLHYRGAFRDSPEHTRHLLR 316 >UniRef50_Q3SG12 Putative uncharacterized protein n=3 Tax=Proteobacteria RepID=Q3SG12_THIDA Length = 301 Score = 72.9 bits (177), Expect = 1e-11, Method: Composition-based stats. Identities = 42/167 (25%), Positives = 59/167 (35%), Gaps = 14/167 (8%) Query: 147 DSYLRQPHRVMELHNDGTYVEEITDYVLMMK-IDEQNMQGGNSLLLHLD----DWEHLDN 201 Y+ H+ + H DG Y + M + GG + LL + D Sbjct: 112 GDYIPYTHKPINWHTDGYYNALDRRILGMTLHCAQDAEAGGENALLDHEIAYIQLRDTDP 171 Query: 202 YFRHPLARRP-MRFAAPPSKNVSKDVFH--PVFDVD-QQGRPVMRY---IDQFVQPKDFE 254 + L + M A +N PVF VD QG MRY V D Sbjct: 172 DYVAALMQPDAMTIPARMDENDIARPAQSGPVFAVDPDQGFLYMRYTARTRSIVWKDDAL 231 Query: 255 EGVWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHP 301 + L++ + S ILS + G L+ NN LH R F+ P Sbjct: 232 TQCAVKALTEILAGSPYILSARLRPGMGLVCNN--VLHTRSAFSDSP 276 >UniRef50_A3NJS9 Taurine catabolism dioxygenase, TauD/TfdA family n=21 Tax=pseudomallei group RepID=A3NJS9_BURP6 Length = 278 Score = 72.9 bits (177), Expect = 2e-11, Method: Composition-based stats. Identities = 43/265 (16%), Positives = 84/265 (31%), Gaps = 21/265 (7%) Query: 52 WPVQALEYKSFL-RFRVAKILDDLCANQLQ--PLLLKTLLNRAEGALLINAVGVDDVKQA 108 QAL + F ++++ + + + + +I + + Sbjct: 8 HSAQALSRQPFDGTLPLSEVTLPIRTPRTMTDDEVSGMIETFNRYGFVILD---CESSER 64 Query: 109 DEMVKLATAVAHLIGRSNFDAMSGQYYARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEE 168 D+++ L T + + DA G + DS H H DG + + Sbjct: 65 DDLLALKTWLGNAAPHKRADA-DGVVPINAFEPVAGHIDS----SHEAHLPHTDGAFSDT 119 Query: 169 ITDYVLMMKIDEQNMQGGNSLLLHLDDWEHLDNYFRH--PLARRPMRFAAPPSKNVSKDV 226 + + + G ++L + H+ + PL R + + Sbjct: 120 PERIITLQCVRPSRNGGLSTLSSAKAAYRHVVACYGDITPLTRADALTIERTT----QKS 175 Query: 227 FHPVFDVDQQGRPV-MRYID-QFVQPKDFEEGVWLSELSDAIETSKGILSVPVPVGKFLL 284 VF D G + R D L+ + + +L P+ G+ L+ Sbjct: 176 TAAVFKEDDDGWSIKFRMNDGAATATPAPAAADMYGSLACFLTDPENMLLFPLEPGQILI 235 Query: 285 INNLFWLHGRDRFTPHPDLRRELMR 309 +N HGR + PH RR + R Sbjct: 236 GDNTAVTHGRTSYPPHQ--RRNMRR 258 >UniRef50_Q1QSP3 Taurine catabolism dioxygenase TauD/TfdA n=1 Tax=Chromohalobacter salexigens DSM 3043 RepID=Q1QSP3_CHRSD Length = 431 Score = 72.9 bits (177), Expect = 2e-11, Method: Composition-based stats. Identities = 42/204 (20%), Positives = 71/204 (34%), Gaps = 20/204 (9%) Query: 116 TAVAHLIGRSNFDAMSGQYYARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLM 175 A+A IG + + F V+ + DS + H D E L+ Sbjct: 204 DAIARRIGPPR----TTNFGTLFDVRAKPDPDSNAYTSI-ALPPHVDLPTREYQPGLQLL 258 Query: 176 MKIDEQNMQGGNSLLLHLDDWEHLDNYF-RHPL---ARRPMRFAAPPSKNVSKDVFH-PV 230 + E + GG+++++ D + + RHP +R+ + + V+ P+ Sbjct: 259 HCL-ENDTVGGDAVMM--DGFAVAEALRERHPEHFATLTRVRWCYANTARTTDHVWFDPM 315 Query: 231 FDVDQQGRP-VMRYIDQFVQPK-----DFEEGVW-LSELSDAIETSKGILSVPVPVGKFL 283 +D G +R D P D E L L + + L G + Sbjct: 316 IKLDANGHFDEVRIADFLRGPLMAPFEDVEPAYAALMALQRLLREPEFALRFSYAPGDMV 375 Query: 284 LINNLFWLHGRDRFTPHPDLRREL 307 + +N LH RD F RR L Sbjct: 376 IFDNRRLLHARDAFDVGQGGRRWL 399 >UniRef50_Q5KF50 Mitochondrion protein, putative n=2 Tax=Filobasidiella neoformans RepID=Q5KF50_CRYNE Length = 447 Score = 72.5 bits (176), Expect = 2e-11, Method: Composition-based stats. Identities = 41/249 (16%), Positives = 73/249 (29%), Gaps = 29/249 (11%) Query: 69 KILDDLCANQLQPLLLKTLLNRAEGALLINAVGVDDVKQADEMVKLATAVAHLIGRSNFD 128 I+ Q +L G + V +D + + Sbjct: 175 DIMSQQVHQHEQAVLQVLNKVHQFGFCFVTGVPIDAKETETL-------------IKSIG 221 Query: 129 AMSGQYYARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNS 188 + +Y F D S L + + H D TY + + + + G Sbjct: 222 PIRQTHYGGFWSFTADLSHGDLAYSAQSLPAHTDTTYFTDPAGLQIFHLLSHPSPGQGGK 281 Query: 189 LLLH-----LDDWEHLDNYFRHPLARRPMRFAAPPSKNVSKD--VFHPVFDVDQQGR-PV 240 LL +D L+R P+ A +K + PV D+ GR Sbjct: 282 TLLADGFHAASQLSAVDPASYSVLSRLPIPAHASGTKGTLLRPLISFPVLRHDECGRLAQ 341 Query: 241 MRYIDQ----FVQPKDFEEGV----WLSELSDAIETSKGILSVPVPVGKFLLINNLFWLH 292 +R+ ++ E +++ + V + G L+I+N +H Sbjct: 342 VRWNNEDRGIIGHGWSATEVRQWYQAAQRFESLVKSEQNEYWVQLNPGTMLIIDNWRVMH 401 Query: 293 GRDRFTPHP 301 GR FT Sbjct: 402 GRSEFTGSR 410 >UniRef50_B3RWG0 Putative uncharacterized protein n=7 Tax=Trichoplax adhaerens RepID=B3RWG0_TRIAD Length = 430 Score = 72.5 bits (176), Expect = 2e-11, Method: Composition-based stats. Identities = 28/176 (15%), Positives = 64/176 (36%), Gaps = 19/176 (10%) Query: 148 SYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDDWEHLDNY-FRHP 206 S + + + +H D ++ + ++ I + QGG S +D + ++P Sbjct: 228 SNVAFTNAGLAMHTDLAFITYVPGVQMLHCISK-AGQGGESRF--VDGFRAATQLKEKYP 284 Query: 207 -----LARRPMRFAAP--PSKNVSKDVFHPVFDVDQQGRPV-MRYIDQFVQPK------- 251 L + P+R+ ++ HP+ + G + Y D P Sbjct: 285 EAFNLLVKYPIRYYDVGKDYITFNQLTQHPIIRLHDNGELKQIVYADHPRSPLMGVPQNK 344 Query: 252 DFEEGVWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLRREL 307 + +S+ + + ++ + G ++++N +HGR + P R L Sbjct: 345 VIDLYDAMSKFLECVYDPSNMIYTLLESGSMMVLDNFRVMHGRASYEVKPGSCRHL 400 >UniRef50_A8TU20 Gamma-butyrobetaine hydroxylase, putative n=1 Tax=alpha proteobacterium BAL199 RepID=A8TU20_9PROT Length = 365 Score = 72.1 bits (175), Expect = 2e-11, Method: Composition-based stats. Identities = 33/185 (17%), Positives = 59/185 (31%), Gaps = 16/185 (8%) Query: 134 YYARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLH- 192 + A F V+ + L + H D Y I + + + +GG S L+ Sbjct: 164 FGALFEVRTTEAPT-DLAYTALALYAHTDNPYRRPIPGIQFLSCLV-NDAEGGESTLVDG 221 Query: 193 ---LDDWEHLDNYFRHPLARRPMRFAAPPSKNVSKDVFHPVFDVDQQG-------RPVMR 242 D L +R+ +V +DV + +VD+ G P + Sbjct: 222 LAVAQSLRSDDPDAFAALTTTGIRYRYDGDGSVLEDVGR-MIEVDEAGEIQRIRFNPRVE 280 Query: 243 YIDQFVQPKDFEEGVWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPD 302 Y+ + + + + + G L+ +N HGR F+ Sbjct: 281 YVLPAPPAALAAFYRARAVFGARLGDPAFEIRLKLEPGDTLMFDNHRLAHGRTSFS--AA 338 Query: 303 LRREL 307 RR L Sbjct: 339 GRRHL 343 >UniRef50_A7SLB9 Predicted protein (Fragment) n=1 Tax=Nematostella vectensis RepID=A7SLB9_NEMVE Length = 337 Score = 72.1 bits (175), Expect = 3e-11, Method: Composition-based stats. Identities = 36/198 (18%), Positives = 74/198 (37%), Gaps = 18/198 (9%) Query: 118 VAHLIGRSNFDAMSGQYYARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMK 177 +A +G + G+ +A F + +D++D+ + HND TY + M+ Sbjct: 136 LAKSLGCFVRETHFGRLWA-FSNEVMDHADT--AYTSGFLHAHNDNTYYTSPAG-LQMLH 191 Query: 178 IDEQNMQGGNSLLLH----LDDWEHLDNYFRHPLARRPMRFAAPPSKNVSKDVFHPVFDV 233 + +GG SLL+ ++ + L + + + S F P ++ Sbjct: 192 CVHHDGKGGESLLVDGFNAANELKKEHPGAYTFLTTKVLPYRYIDS-ERHLKAFGPTIEL 250 Query: 234 DQQGRP--VMRY-------IDQFVQPKDFEEGVWLSELSDAIETSKGILSVPVPVGKFLL 284 D + +RY ID + ++ + + + G+ ++ Sbjct: 251 DPFSKDFHQIRYNHYDRAVIDCLESDDVPSYYKAIQAYAEILRRPESEYWFKLVPGQLMV 310 Query: 285 INNLFWLHGRDRFTPHPD 302 + N +HGR+RFT D Sbjct: 311 MGNWRVMHGRNRFTGRRD 328 >UniRef50_A8U3F8 Putative uncharacterized protein n=1 Tax=alpha proteobacterium BAL199 RepID=A8U3F8_9PROT Length = 366 Score = 72.1 bits (175), Expect = 3e-11, Method: Composition-based stats. Identities = 46/261 (17%), Positives = 82/261 (31%), Gaps = 34/261 (13%) Query: 70 ILDDLCANQLQPLLLKTLLNRAEGALLINAVGVDDVKQADEMVKLATAVAHLIGRSNFDA 129 +L L + G I + V+ + ++++ + +G Sbjct: 70 FPLGRLTTRLGALRAQIRSG--LGLGYIKGLPVERY-DRETLIRVYWGLCRHLGDPVTQN 126 Query: 130 MSGQYYARFVV--KNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGN 187 +G + V + + L Q + + H D D V ++ I + GG Sbjct: 127 RNGHLLGHVIDVGDAVADHNKRLTQTNAELCFHADSC------DVVALLCIRHARI-GGE 179 Query: 188 SLLLHL--------DDWEHLDNYFRHPLARRPMRFAAPPSKNVSKDVFHPVFDVDQ---Q 236 SL++ L P+ R P + P+F+ Sbjct: 180 SLIVSAVAVHDEMMRRRPDLLPELYKPIYMD--RRGEVPPGKLPWFGV-PLFNWHAGMLN 236 Query: 237 GR-PVMRYIDQFVQ-PKDFE----EGVWLSELSDAIETSKGILSVPVPVGKFLLINNLFW 290 G PV++YI+ + P + L E + L +P G + N Sbjct: 237 GYSPVLQYIESLKRFPDAPRMSNAQREALDLYFAICEEDRFCLRLPFEPGDIQFLQNHVV 296 Query: 291 LHGRDRFT--PHPDLRRELMR 309 H R + P P+ RR LMR Sbjct: 297 FHSRTSYLDWPEPERRRHLMR 317 >UniRef50_C7YXK7 Putative uncharacterized protein (Fragment) n=1 Tax=Nectria haematococca mpVI 77-13-4 RepID=C7YXK7_NECH7 Length = 387 Score = 71.7 bits (174), Expect = 4e-11, Method: Composition-based stats. Identities = 45/289 (15%), Positives = 90/289 (31%), Gaps = 56/289 (19%) Query: 47 EQVAEWPVQALEYKSFLRFRVAKILDDLCANQLQPLLLKTLLNRAEGALLINAVGVDDVK 106 E + +Y+++++ DD L L+ + R +G + V + Sbjct: 104 EPRKSLDLPDYDYETYMK-------DDAT---LYKLINQL---RIDGLAFVTNVPGVEES 150 Query: 107 QADEMVKLATAVAHLIGRSNFDAMSG-QYYARFVVKNVDNSDSYLRQPHRVMELHNDGTY 165 A ++ D G + R V K ++ + + H D Y Sbjct: 151 LATIATRIG---------PIKDTFYGYTWDVRTVPKAINAA-----YTSHGLGFHTDLLY 196 Query: 166 VEEITDYVLMMKIDEQNMQGGNSLLLHLDDWEHLDNYFR------HPLARRPMRFAAPPS 219 ++ L+ + + GG S+ D ++ + F LA P+ + Sbjct: 197 FQQPPHIQLLHCV-QSASSGGASVF--ADAYKAAVDLFETDMEAFDTLATVPVNYHYNHP 253 Query: 220 KNVSKDVFHPVFDV-----DQQGRPVMRYIDQFVQPKDFEEGVWLSELSD---------- 264 PV D+ Q + + F+ P E +S L+D Sbjct: 254 NANVYRTTKPVIDLRPMRIGDQIYTRINWGPPFLAPFSNHEAQGVSALNDKVERWHDAAV 313 Query: 265 ----AIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLRRELMR 309 ++ + + + G +L +N LH R F + +R Sbjct: 314 KFNALLQRPEYLFERKMNPGDCVLFDNTRTLHSRRAFDMADVGKPRWLR 362 >UniRef50_C3K0C1 Putative gamma-butyrobetaine dioxygenase n=1 Tax=Pseudomonas fluorescens SBW25 RepID=C3K0C1_PSEFS Length = 375 Score = 71.3 bits (173), Expect = 4e-11, Method: Composition-based stats. Identities = 48/263 (18%), Positives = 86/263 (32%), Gaps = 36/263 (13%) Query: 55 QALEYKSFLRFRVAKILDDLCANQLQPLLLKTLLNRAEGALLINAVGVDDVKQADEMVKL 114 L Y F R + + L G ++I V + D Sbjct: 115 TQLIYIDFNRLTDFDYRLEAFSKFLT-----------YGVVVIQNVPTEAESVLD----- 158 Query: 115 ATAVAHLIGRSNFDAMSGQYYARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVL 174 V + G+ + G+Y+ + +++D R H ++ H D Y + + L Sbjct: 159 ---VGRIFGQ-VRETNFGKYFEVY--SRPNSNDLAYRSIH--LDPHTDNPYRDPMPGIQL 210 Query: 175 MMKIDEQNMQGGNSLLLHLDDWEHL---DNYFRHPLARRPMRFAAPPSKNVSKDVFHPVF 231 + + + G ++L+ L E L D LA P+R+ + +V P+ Sbjct: 211 LHCLINETSGGLSTLVDSLAVAEQLKLEDPEGFELLANVPVRYRHVDN-DVELIERRPII 269 Query: 232 DVDQQGR-------PVMRYIDQFVQPKDFEEGVWLSELSDAIETSKGILSVPVPVGKFLL 284 D GR P + Y+ + + + + G+ + Sbjct: 270 MTDALGRMTGVSYSPRLDYLPLLNEHDMAVFHRARRRMGELFVDPAFERRFKLEKGELQM 329 Query: 285 INNLFWLHGRDRFTPHPDLRREL 307 NN LHGR F + RR L Sbjct: 330 FNNTRVLHGRTSFDTNEG-RRHL 351 >UniRef50_B9K395 Putative uncharacterized protein n=1 Tax=Agrobacterium vitis S4 RepID=B9K395_AGRVS Length = 313 Score = 71.3 bits (173), Expect = 4e-11, Method: Composition-based stats. Identities = 47/269 (17%), Positives = 84/269 (31%), Gaps = 42/269 (15%) Query: 69 KILDDLCANQLQPLLLKTLLNRAEGALLINAVGVDDVKQADEM-VKLATAVAHLIGRSNF 127 + +L L + G LI + ++D +A + + + Sbjct: 10 EFPLPNFGLRLHSLTKELENG--LGFRLIKGLPINDKDEAGARLISWGLGLYIGVALPQN 67 Query: 128 DAMSGQYYARFVVKNVDNSDSYLRQ--PHRVMELHNDGTYVEEITDYVLMMKIDEQNMQG 185 + + R + S + LR ++ H D D V + G Sbjct: 68 SDGALIHDVR---DRGETSAATLRGNGTSEEIQFHID------PCDVVALFC-RRSAAVG 117 Query: 186 GNSLL---------LHLDDWEHLDNYFRHPLARRPMRFAAPPSKNVSKDVFHPVFDVDQQ 236 G S L L ++D + L+ + + +APP + PVF + Sbjct: 118 GQSRLCSSIEIHNRLAMEDPDLLEVLY--SMLPFASLGSAPP---DAHVYNTPVFGW-KN 171 Query: 237 GRPVMRYIDQF--------VQPKDFEEGVWLSELSDAIETSKGILSVPVPVGKFLLINNL 288 G + E+ + +S+ L + + G L+NN Sbjct: 172 GLFTSHFYRARIIHAGSLPSVNLSPEQRRAVDRVSEIASNPDMYLEMNLEPGDLQLVNNH 231 Query: 289 FWLHGRDRFT--PHPDLRRELMRQRGYFA 315 H R + P PDLRR L R +F+ Sbjct: 232 ILYHARSSYEDYPDPDLRRHL--FRLWFS 258 >UniRef50_Q4V6I6 IP11337p (Fragment) n=10 Tax=Drosophila RepID=Q4V6I6_DROME Length = 421 Score = 71.3 bits (173), Expect = 4e-11, Method: Composition-based stats. Identities = 31/167 (18%), Positives = 63/167 (37%), Gaps = 17/167 (10%) Query: 157 MELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLL---LHLDDW---EHLDNYFRHPLARR 210 + LH D Y E ++ + + + GG+++L H+ D +H +++ R L+R Sbjct: 232 LPLHTDLPYYEYKPSVNILHCVVQTDSPGGSNMLVDGFHVADLLRRDHPEDFER--LSRI 289 Query: 211 PMRF---AAPPSKNVSKDVFHPVFDVDQQGR-PVMRYI-----DQFVQPKDFEEGVWLSE 261 + + + + PV +D++GR + + F P + + S Sbjct: 290 VVDWNDIGSEDGREFHNIWRAPVICLDEEGRYTRINHSVPQRDSHFNVPLEEVLPWYESY 349 Query: 262 LSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLRRELM 308 + G L NN+ LHGR + + R ++ Sbjct: 350 ALFVRLAIADSHAFKTRPGDVLTFNNIRLLHGRTGYDDSEESPRYIV 396 >UniRef50_A8TSD0 Putative uncharacterized protein n=1 Tax=alpha proteobacterium BAL199 RepID=A8TSD0_9PROT Length = 349 Score = 70.9 bits (172), Expect = 6e-11, Method: Composition-based stats. Identities = 45/259 (17%), Positives = 75/259 (28%), Gaps = 35/259 (13%) Query: 79 LQPLLLKTLLNRAEGALLINAVGVDDVK-QADEMVKLATAVAHLIGRSNFDAMSGQYYAR 137 L + + G + G+D + D + L +A +G G Sbjct: 73 LADRIAGWVHALEHGRGCVLVKGLDPGRYDDDALAILYWGLAVHLGNPIPQNAKGDLIGH 132 Query: 138 FVVKNVDNSDSYLRQPHRVMEL--HNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDD 195 D + +R EL H D + D V ++ GG SL+ Sbjct: 133 VRDTGRDYTSKNVRGYTTRAELKAHCDAS------DIVGLLC-RHPAKSGGESLIASSTA 185 Query: 196 W------------EHLDNYFRHPLARRPMRFAAPPSKNVSKDVFHPVFDVDQQGRPVMRY 243 L F + L + P + PVF + GR RY Sbjct: 186 IYNHLAAERPELIPALLEGFHYDLRGEGVT-DDPDETTFHRV---PVFSWFE-GRLSCRY 240 Query: 244 --------IDQFVQPKDFEEGVWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRD 295 + + P + E+ ++ + G L++N LH R Sbjct: 241 NGKSIEEGMAKRKMPLTGLALEAVREVGRLAVSAPFRYDMTFEKGDLQLLSNHAILHSRA 300 Query: 296 RFTPHPDLRRELMRQRGYF 314 F P+ R+ R +F Sbjct: 301 AFEDWPERERQRDLWRIWF 319 >UniRef50_A4R0Y1 Putative uncharacterized protein n=1 Tax=Magnaporthe grisea RepID=A4R0Y1_MAGGR Length = 573 Score = 70.6 bits (171), Expect = 7e-11, Method: Composition-based stats. Identities = 35/232 (15%), Positives = 73/232 (31%), Gaps = 36/232 (15%) Query: 97 INAVGVDDVKQADEMVK-LATAVAHLIGRSNFDAMSGQYYARFVVKNVDNSDSYLRQPHR 155 + V + DV +++ V+ +A AV H +Y + + + Sbjct: 328 LGLVFIKDVPESETAVEEMACAVGH---------AQTTFYGKTWDVVSKPQAENVAYTNV 378 Query: 156 VMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDDWEHLDNYFRHPLARRPMRFA 215 + LH D Y+++ L+ + + +GG SL D + + + Sbjct: 379 FLCLHQDLLYMQDPPRLQLLHCL-ANSCEGGESLFS--DGIRAAEQVRSKNPKQFELLKN 435 Query: 216 APPSKNVS-----KDVFHPVFDVDQQGRPVMR-------YIDQFVQPKDFEEG------- 256 P + + PV + + G + + D F P+ Sbjct: 436 KPVYYHYDKNGHWYEYNRPVVTLSKDGSGAIDSIGWSPPFQDNFPAPQGLSASINSQDAL 495 Query: 257 ----VWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLR 304 D+ + + + G+ + +N+ LHGR +F R Sbjct: 496 EEWRAAARSFRDSSTAPESMFEYKMKPGECAIFDNMRILHGRRQFQLTSGKR 547 >UniRef50_Q2UHV9 Predicted protein n=2 Tax=Aspergillus RepID=Q2UHV9_ASPOR Length = 389 Score = 70.6 bits (171), Expect = 8e-11, Method: Composition-based stats. Identities = 47/318 (14%), Positives = 90/318 (28%), Gaps = 46/318 (14%) Query: 33 LELTFTEQTTKQFLEQVAEWPVQALEYKSFLRFRVAKILDDLCANQLQPLLLKTLLNRAE 92 L + K+ + + + + A + L+ + Sbjct: 60 YILKLDDSQLKEIDDALQHFKALG---QPLELLSPATFPLPSLHSVLRGVSDNIHKG--T 114 Query: 93 GALLINAVGVDDVKQADEMVKLATAVAHLIGRSNFDA-MSGQYYARFVVKNVDN----SD 147 G L+ + VD + M+ +H+ VV ++ + SD Sbjct: 115 GFSLVRGIPVDRYSAEENMIIYVGISSHIGRMRGRQGYQYNVSPVDVVVTHITDMRPPSD 174 Query: 148 SYLR-----QPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLH----LDDWEH 198 L + M H D D V + + E +GG S L ++ Sbjct: 175 PTLSVRVAGYTNEDMPFHTDDG------DIVSLFALGEP-AEGGESQLASGWRVYNELAR 227 Query: 199 LDNYFRHPLARRPMRFAAPPSKNVSKDVFHPVFD----VDQQGRPVMRYIDQ-------- 246 LA + P SK ++ P+ R ++ + + Sbjct: 228 TRPDIIQVLASD---WPIPRSKKEDPFLYRPLIHYQNCSGTPERLLINFSRRWLAGYGDL 284 Query: 247 -FVQPKDFEEGVWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLRR 305 + + L L + +S+ + G NN LH R + P +R Sbjct: 285 KRTRLLSVRQAEALDAL--HFLAERFHISMKLQKGDMQFFNNWSILHARRGYKDGPQRKR 342 Query: 306 ELMRQRGYFAYASNHYQT 323 L+ R + N + T Sbjct: 343 HLL--RLWLRDPDNAWPT 358 >UniRef50_UPI000023D763 hypothetical protein FG05953.1 n=1 Tax=Gibberella zeae PH-1 RepID=UPI000023D763 Length = 494 Score = 69.8 bits (169), Expect = 1e-10, Method: Composition-based stats. Identities = 32/188 (17%), Positives = 56/188 (29%), Gaps = 15/188 (7%) Query: 133 QYYARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLH 192 +Y R + + LH D Y+E L+ + E + +GG SL Sbjct: 233 TFYGRTFDVRAKPDAENVAYTSGYLGLHQDLLYLESPPAIQLLHCM-ENSCEGGESLFSD 291 Query: 193 LDDWEHLDNYFRHPLARRPMRFAAPPSKN---VSKDVFHPVFDVDQQGRP-----VMRYI 244 L P R + P P+ ++ + Sbjct: 292 GLFAGKLLFLQSSPTIRNLWKVMVPYHYEKHGYFYHQRRPILELGPNENLAGVNWSPPFQ 351 Query: 245 DQFVQPKDFEEGVWL---SELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHP 301 DQF + WL I + + + G+ +L +N +HGR++F Sbjct: 352 DQFSS-AAVDAREWLEPAKLFDRMINNPDVMYEMKMEPGECVLFDNTRIMHGRNKF--DV 408 Query: 302 DLRRELMR 309 +R Sbjct: 409 GGGSRWLR 416 >UniRef50_UPI000180D107 PREDICTED: similar to gamma-butyrobetaine dioxygenase n=1 Tax=Ciona intestinalis RepID=UPI000180D107 Length = 397 Score = 69.8 bits (169), Expect = 1e-10, Method: Composition-based stats. Identities = 40/234 (17%), Positives = 72/234 (30%), Gaps = 34/234 (14%) Query: 91 AEGALLINAVGVDDVKQADEMVKLATAVAHLIGRSNFDAMSGQYYARFVVKNVDNSDSYL 150 G LI V +++K+A+ + + G + K + + + L Sbjct: 145 DSGICLIQNVPTT----PGQVLKVASMFG-----PIYSTLYGNIFDVVDRKAENAAYTNL 195 Query: 151 RQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQ-GGNSLLLHLDDWEHLDNYFRHPLAR 209 ++ H D E + L+ + + GG S +L D + + + R Sbjct: 196 -----LLPFHQDQPQYESMPGVQLLHALRFDSCVTGGESQIL--DLFRAAETFRRENPKY 248 Query: 210 RPMRFAAPPSKN-VSKDVFHPVFDVDQQGRPVMRYIDQFV----QPKDFEEGV------- 257 P + K HPVF Q+ V+ Y + V P Sbjct: 249 FQTLCEVPCKFETMDKKRSHPVFQEHQKPHFVVDYFGKLVAVNWHPGIAVATRVRFDDVP 308 Query: 258 ----WLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLRREL 307 +E +G+ + G ++ NN H R F + R L Sbjct: 309 RYYEAYRAFLKLLERKEGMFEFRLKTGDLIIFNNRRVAHARSAFKSN-GGVRHL 361 >UniRef50_Q4V6C2 IP11527p (Fragment) n=6 Tax=melanogaster subgroup RepID=Q4V6C2_DROME Length = 366 Score = 69.8 bits (169), Expect = 1e-10, Method: Composition-based stats. Identities = 44/243 (18%), Positives = 85/243 (34%), Gaps = 31/243 (12%) Query: 79 LQPLLLKTLLNRAEGALLI------NAVGVDDVKQADEMVKLATAVAHLIGRSNFDAMSG 132 L+ L + + + E L+ V +DDV M +LA + ++ F M Sbjct: 112 LRFPLPQLVSSDNEVRSLVESLVRYGIVFIDDVAPTANMTELALRRVFPLMKTFFGEMW- 170 Query: 133 QYYARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLH 192 N D++D+ + + H D TY + + I E + GG + Sbjct: 171 -----TFSDNPDHADTAYTKLYLGS--HTDNTYFCDAAGLQALHCI-EHSGSGGENFF-- 220 Query: 193 LDDWEHLDNYFRHPLARRPMRFAAPPSKNVSKDVFH-----PVFDVD----QQGRPVMRY 243 +D + R A + + + H P+ VD + + + Sbjct: 221 VDGLHVVHELKRRYPAAYDVLCSVQVPGEYIEKGEHHYHTAPIIQVDPLTQEFVQLRLNV 280 Query: 244 IDQFVQ---PKDFEEGVW--LSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFT 298 D+ V P+ + L +L + + ++ + G +L +N LHGR+ +T Sbjct: 281 YDRAVFNTIPQAEMAEFYDSLRQLLLIVRDKQQQWALKLCPGSIVLFDNWRVLHGREAYT 340 Query: 299 PHP 301 Sbjct: 341 GSR 343 >UniRef50_A2R5A1 Catalytic activity: H. sapiens BBH converts gamma-butyrobetaine n=1 Tax=Aspergillus niger CBS 513.88 RepID=A2R5A1_ASPNC Length = 543 Score = 69.8 bits (169), Expect = 1e-10, Method: Composition-based stats. Identities = 32/204 (15%), Positives = 65/204 (31%), Gaps = 38/204 (18%) Query: 132 GQYYARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLL- 190 +Y R + + + + H D Y+ + Y L+ + + + +GG SL Sbjct: 295 DTFYGRTWDVRSIPQATNVAYTDQFLGFHMDLMYMNDPPGYQLLHCL-QNSCEGGESLFV 353 Query: 191 --------LHLDDWEHLDNYFRHPLARRPMRFAAPPSKNVSKDVFHPVFDVDQQGRPV-- 240 + DD ++ H + P + P PVF+ + V Sbjct: 354 DTFRVAYDMKQDDHKNYSRLLHHHI---PYHYNHPDHF---YTNSWPVFETETFDNSVTE 407 Query: 241 -----------MRYIDQFVQPKD---------FEEGVWLSELSDAIETSKGILSVPVPVG 280 + Y F P+ E+ L++ + +E + + + + G Sbjct: 408 GTNFSKSRLVHVNYSPPFQAPRKVQSPVPRKFREKNEALAKFASLLEDERYMFELKLNPG 467 Query: 281 KFLLINNLFWLHGRDRFTPHPDLR 304 + ++ N H R F R Sbjct: 468 ECVVFENRRVAHARRGFKTSTGER 491 >UniRef50_Q1GF28 Gamma-butyrobetaine2-oxoglutarate dioxygenase n=6 Tax=Rhodobacteraceae RepID=Q1GF28_SILST Length = 382 Score = 69.8 bits (169), Expect = 1e-10, Method: Composition-based stats. Identities = 35/246 (14%), Positives = 67/246 (27%), Gaps = 32/246 (13%) Query: 92 EGALLINAVGVDDVKQADEMVKLATAVAHLIGRSNFDAMSGQYYARFVVKNVDNSDSYLR 151 G ++ + D + R F N Sbjct: 147 HGFTIVTDMPDSDAALTQTAELMGFV------RPTFFGTYFDVKTHINPTNT-------A 193 Query: 152 QPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDDWEHLDNYFRHPLARRP 211 +ELH D E + + GG S L+ D +++ + Sbjct: 194 YTAGALELHTDTPAEEFAPGIQFLHC-RINTVDGGES--LYADGVAVANDFRKRDPEGFR 250 Query: 212 MRFAAPPSK-----NVSKDVFHPVFDVDQQGRPVMRYIDQF-VQPKDFEEGV------WL 259 + P V ++DQ G I Q D ++ + Sbjct: 251 LLSEVPIPFYCEHDTYDARSRQYVIELDQHGEVEGLTISQHMADIFDLDQKLLDDYYPAF 310 Query: 260 SELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLRRELMRQRGYFAYASN 319 ++ K ++ + G+ ++ +N +HGR +T R RG + S Sbjct: 311 CRFGRMLQEEKYMMRFLMKGGECMVFDNHRIVHGRAAYTASSGDR----YLRGCYVDRSE 366 Query: 320 HYQTHQ 325 T++ Sbjct: 367 MRSTYR 372 >UniRef50_B0XU78 Gamma-butyrobetaine hydroxylase subfamily, putative n=5 Tax=Fungi/Metazoa group RepID=B0XU78_ASPFC Length = 483 Score = 69.8 bits (169), Expect = 1e-10, Method: Composition-based stats. Identities = 31/164 (18%), Positives = 56/164 (34%), Gaps = 21/164 (12%) Query: 159 LHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDDWEHLDNYFR----HPLARRPMRF 214 H D Y+ E + L+ + E + +GG SL + L + L + + + Sbjct: 268 FHMDLMYMNEPPGFQLLHCL-ENSCEGGESLFVDGFRVAELIRWKYPEQFEDLTKLRLNY 326 Query: 215 AAPPSKNVSKDVFHPVFDVDQQGRPVMR-----YIDQFVQPKDFEE---------GVWLS 260 K + PV + + G P R Y F P ++ L Sbjct: 327 EY-NHKEHIYNNSWPVVET-EDGDPKKRILHVNYSPPFQAPLLSDDNHQMPWIEYSRALR 384 Query: 261 ELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLR 304 + IE + + + G+ ++ N LH R++F R Sbjct: 385 AFAREIERPYNVFQLKLNPGECVIFENRRILHARNQFNTEQGKR 428 >UniRef50_Q4V6P2 IP11427p n=19 Tax=Drosophila RepID=Q4V6P2_DROME Length = 307 Score = 69.4 bits (168), Expect = 2e-10, Method: Composition-based stats. Identities = 44/243 (18%), Positives = 85/243 (34%), Gaps = 31/243 (12%) Query: 79 LQPLLLKTLLNRAEGALLI------NAVGVDDVKQADEMVKLATAVAHLIGRSNFDAMSG 132 L+ L + + + E L+ V +DDV M +LA + ++ F M Sbjct: 53 LRFPLPQLVSSDNEVRSLVESLVRYGIVFIDDVAPTANMTELALRRVFPLMKTFFGEMW- 111 Query: 133 QYYARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLH 192 N D++D+ + + H D TY + + I E + GG + Sbjct: 112 -----TFSDNPDHADTAYTKLYLGS--HTDNTYFCDAAGLQALHCI-EHSGSGGENFF-- 161 Query: 193 LDDWEHLDNYFRHPLARRPMRFAAPPSKNVSKDVFH-----PVFDVD----QQGRPVMRY 243 +D + R A + + + H P+ VD + + + Sbjct: 162 VDGLHVVHELKRRYPAAYDVLCSVQVPGEYIEKGEHHYHTAPIIQVDPLTQEFVQLRLNV 221 Query: 244 IDQFVQ---PKDFEEGVW--LSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFT 298 D+ V P+ + L +L + + ++ + G +L +N LHGR+ +T Sbjct: 222 YDRAVFNTIPQAEMAEFYDSLRQLLLIVRDKQQQWALKLCPGSIVLFDNWRVLHGREAYT 281 Query: 299 PHP 301 Sbjct: 282 GSR 284 >UniRef50_C4JHH9 Predicted protein n=3 Tax=Onygenales RepID=C4JHH9_UNCRE Length = 510 Score = 69.4 bits (168), Expect = 2e-10, Method: Composition-based stats. Identities = 25/188 (13%), Positives = 60/188 (31%), Gaps = 12/188 (6%) Query: 130 MSGQYYARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSL 189 + +Y + ++ + H D Y+ + Y L+ + ++ GG S+ Sbjct: 265 LRNSFYGSTWDVRSQPDAKNVAYTNKFLGFHMDLLYMADPPGYQLLHCMS-NSLPGGESM 323 Query: 190 LLHLDDWEHLDNYFRHPLARRPMRFAAPPSKNVS-----KDVFHPVFDVDQQGRPVMRYI 244 D + +R + P + P F+ D+ G + Sbjct: 324 FS--DTVRAAEQLYRTHRHDYNRLWTTPVRFGYFNDGQRYEYTRPTFEGDRMGDIKLHNG 381 Query: 245 DQFVQPKDFEEGVWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFT---PHP 301 + P + ++++E + + + + G+ + N +H R+ F Sbjct: 382 VEVG-PTFRSWVKAMRVFANSLEKPENVFKLKLNPGECAIFANRRVVHAREAFDLSGSDN 440 Query: 302 DLRRELMR 309 R +R Sbjct: 441 QERSRWLR 448 >UniRef50_D2VKX3 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2VKX3_NAEGR Length = 404 Score = 69.0 bits (167), Expect = 2e-10, Method: Composition-based stats. Identities = 31/184 (16%), Positives = 68/184 (36%), Gaps = 16/184 (8%) Query: 133 QYYARFVVKNVDNSDSY----LRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNS 188 ++ R VDN+ + L ++LH D ++E + L+ I G N Sbjct: 211 THFGRLEDLRVDNTTNQNNDQLGYTDAPVDLHTDQPFIENPPELQLLHCIIPATEGGDNY 270 Query: 189 LLLHLDD---WEHLDNYFRHPLARRPMRFAAPPSKNVSKDVFHPVFDVDQQGRPV--MRY 243 ++ + + +D L+ P++F + ++ P+ ++ Q G+ + +R+ Sbjct: 271 VVNSVQAAWYLKQIDPLAFKILSEFPVKFH-RKQQKFESILYKPIIELSQDGKEIKQLRF 329 Query: 244 I----DQFVQPKDFEEGVW--LSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRF 297 V D+ + ++ ++ + K + G F+ NN H R F Sbjct: 330 SYFTYAPHVFDFDYLPEFYKAYNKFTNIVRDRKNQFYFRLEKGDFIFYNNQKMFHARTSF 389 Query: 298 TPHP 301 Sbjct: 390 KGER 393 >UniRef50_B0XCW0 Gamma-butyrobetaine dioxygenase n=1 Tax=Culex quinquefasciatus RepID=B0XCW0_CULQU Length = 436 Score = 69.0 bits (167), Expect = 2e-10, Method: Composition-based stats. Identities = 33/197 (16%), Positives = 64/197 (32%), Gaps = 23/197 (11%) Query: 132 GQYYAR-----FVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGG 186 + Y+R FVV N +N + + +++H D Y + L+ + + GG Sbjct: 221 DEIYSRLRVEEFVVANKEN-TTNVAYLSTPLQMHTDLPYYDYKPGCNLLHCLVQSTSTGG 279 Query: 187 NSLLLH----LDDWEHLDNYFRHPLARRPMRFAAP---PSKNVSKDVFHPVFDVDQQG-- 237 +L+ D L+ + + PV +D++G Sbjct: 280 QNLIADAFWVADHMRREHPEDFRLLSETLVNWTDVGVDEGGEFHSIYRAPVICLDREGKL 339 Query: 238 ----RPVMRYIDQFVQPKDFEEG--VWLSELSDAIETSKGILSVPVPVGKFLLINNLFWL 291 V + F P D E ++ + + G L +N+ + Sbjct: 340 ERINHSVPQRDSFFNVPLDKVEPWYRAMARFVQLLHQEA--VEFKTMPGDILTFSNIRMV 397 Query: 292 HGRDRFTPHPDLRRELM 308 HGR +T R ++ Sbjct: 398 HGRTGYTDTEGNMRHIV 414 Score = 69.0 bits (167), Expect = 2e-10, Method: Composition-based stats. Identities = 31/186 (16%), Positives = 60/186 (32%), Gaps = 18/186 (9%) Query: 138 FVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLH----L 193 FVV N +N + + +++H D Y + L+ + + GG +L+ Sbjct: 32 FVVANKEN-TTNVAYLSTPLQMHTDLPYYDYKPGCNLLHCLVQSTSTGGQNLIADAFWVA 90 Query: 194 DDWEHLDNYFRHPLARRPMRFAAP---PSKNVSKDVFHPVFDVDQQG------RPVMRYI 244 D L+ + + PV +D++G V + Sbjct: 91 DHMRREHPEDFRLLSETLVNWTDVGVDEGGEFHSIYRAPVICLDREGKLERINHSVPQRD 150 Query: 245 DQFVQPKDFEEG--VWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPD 302 F P D E ++ + + G L +N+ +HGR +T Sbjct: 151 SFFNVPLDKVEPWYRAMARFVQLLHQEA--VEFKTMPGDILTFSNIRMVHGRTGYTDTEG 208 Query: 303 LRRELM 308 R ++ Sbjct: 209 NMRHIV 214 >UniRef50_C8VBQ5 Gamma-butyrobetaine hydroxylase subfamily, putative (AFU_orthologue; AFUA_2G14970) n=2 Tax=Emericella nidulans RepID=C8VBQ5_EMENI Length = 555 Score = 68.6 bits (166), Expect = 3e-10, Method: Composition-based stats. Identities = 36/223 (16%), Positives = 76/223 (34%), Gaps = 30/223 (13%) Query: 97 INAVGVDDVKQADEMVKLATAVAHLIGRSNFDAMSGQYYARFVVKNVDNSDSYLRQPHRV 156 + +G+ +K + ++ +A IG + +Y + + Sbjct: 259 LATMGLIFLKDIPDSREMVEKIATRIGP-----LRNTFYGSTWDVRKVPEAKNVAYTSQY 313 Query: 157 MELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDDWEHLDNYFRHP-----LARRP 211 + H D Y+++ + L+ + + GG SL + Y P LA+ Sbjct: 314 LGFHMDLMYMKDPPAFQLLHCLR-NSCDGGESLFADTFNVAGYL-YRNRPEIFQILAKTK 371 Query: 212 MRFAAPPSKNVSKDVFHPVFDVD--QQGRPVMR--YIDQFVQP-------------KDFE 254 +R+ K+ S PV + +G + R Y F P K Sbjct: 372 LRYEY-QHKDQSYSNAWPVLERGPLDKGHFLARVAYSPPFQAPILNDSNADPEYIAKLQT 430 Query: 255 EGVWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRF 297 + L + ++E + + + G+ ++ N +H R +F Sbjct: 431 QLGALKYFASSLEREDNMFELKLQPGECVIFENRRIVHARRQF 473 >UniRef50_A3YAS9 Gamma-butyrobetaine hydroxylase n=1 Tax=Marinomonas sp. MED121 RepID=A3YAS9_9GAMM Length = 394 Score = 68.6 bits (166), Expect = 3e-10, Method: Composition-based stats. Identities = 45/240 (18%), Positives = 77/240 (32%), Gaps = 31/240 (12%) Query: 81 PLLLKTLLNRAEGALLINAVGVDDVKQADEMVKLATAVAHLIGRSNFDAMSGQYYARFVV 140 LL R G L+ V Q + +VK+A ++ I +NF + Sbjct: 151 ELLSWLKDLRDYGLALVTQVDT----QTNTLVKVANRIS-FIRETNFGTIFNV-----QA 200 Query: 141 KNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDDWEHLD 200 K NS +Y + + LH D E + + + GG S+ +D ++ + Sbjct: 201 KADANSTAY---TNLRLPLHTDLPTRELQPGLQFLHCLI-NDATGGESIF--VDGFKIAE 254 Query: 201 NYFRH------PLARRPMRFAAPPSKNVSKDVFHPVFDVDQQGRPV----MRYID-QFVQ 249 + H L+ PM F K D G+ V ++ Sbjct: 255 HMREHYPEDFASLSAIPMSFYNKD-KETDYRFRGTAIVTDSNGKIVEVRLANFLRGPIDV 313 Query: 250 PKDFEEGVW--LSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLRREL 307 P ++ +K + G ++ +N LH R+ F RR L Sbjct: 314 PSHQTMALYKAYRRFISLTRETKFQHFQRLNQGDLIVFDNRRVLHARNAF-DLKAGRRHL 372 >UniRef50_B6H3W1 Pc13g10890 protein n=19 Tax=Dikarya RepID=B6H3W1_PENCW Length = 400 Score = 68.2 bits (165), Expect = 4e-10, Method: Composition-based stats. Identities = 41/310 (13%), Positives = 82/310 (26%), Gaps = 50/310 (16%) Query: 50 AEWPVQALEYKSFLRFRVA-------KILDDLCANQLQPLLLKTLLNRAEGALLINAVGV 102 + KSF ++ L+ L + R G +++ + + Sbjct: 73 DQLDELDAALKSFKALNLSLGHINQSTFPLPTLRPVLRDLSKEIHTGR--GFVVLRGLRI 130 Query: 103 DDVKQADEMVKLATAVAHLIGRSNFDAMSG---------QYYARFVVKNVDNSDSYLRQP 153 DD + D ++ +H+ + + N Sbjct: 131 DDYSREDNIIIYTGVSSHIGNIRGRQQEARLADGSSPVISHIKDLTRDTEKNLIGAPSNT 190 Query: 154 HRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLH--------LDDWEHLDNYFRH 205 H D D + ++ ++ + +GG S L + L + Sbjct: 191 ADKQVFHTDAG------DIISLLCLN-RAAEGGESYLSSSWHVYNILAKERPDLIHTLSQ 243 Query: 206 PLARRPMRFAAPPSKNVSKDVFHPVFDVDQQGRPVMRYIDQFVQ------------PKDF 253 A P P R +++Y ++ P Sbjct: 244 DWPVDGFNNPARPYSLRPLLYHQPATAT-TPERVLIQYARRYFTGFLAQPRSKDIPPITE 302 Query: 254 EEGVWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLRRELMRQRGY 313 + L L + ++ G +NNL H R+ F P R L+ R + Sbjct: 303 AQAEALDAL--HFLAEEHSAALDFQKGDVQYVNNLSIFHARNGFRDEPGKERHLL--RLW 358 Query: 314 FAYASNHYQT 323 N ++T Sbjct: 359 LRDPENAWET 368 >UniRef50_A8H4N7 Trimethyllysine dioxygenase n=2 Tax=Shewanella RepID=A8H4N7_SHEPA Length = 371 Score = 68.2 bits (165), Expect = 4e-10, Method: Composition-based stats. Identities = 42/305 (13%), Positives = 85/305 (27%), Gaps = 35/305 (11%) Query: 17 DYSGFTLTPSAQSPRL--LELTFTEQTTKQFLEQVAEWPVQALEYKSF---LRFRVAKI- 70 F + + R+ L+ + FL +A PV Y+ + L+ +V Sbjct: 59 QVEQFQIINNGAQLRIHWLDGDLVSEFDASFLFNMACTPVNDPSYQLWANELQNQVPDFD 118 Query: 71 LDDLCANQLQPLLLKTLLNRAEGALLINAVGVDDVKQADEMVKLATAVAHLIGRSNFDAM 130 + + AN L G + + + + + ++ + G Sbjct: 119 FEQVTAND-AAFLPVLESMDRYGLVTFSGMPSNMEATKKLLNQVGYIRDTVFG------- 170 Query: 131 SGQYYARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLL 190 + + N + + S + LH D TY + L+ + +G + Sbjct: 171 -----SLWDFSN-NGAHSDSAYTSVGIGLHTDSTYTLDPPGLQLLHCLAFD-GEGAFNQF 223 Query: 191 LHLDDWEHLDNYFRHPLARRPMRFAAPPSKNVSK-----DVFHPVFDVDQQGRP---VMR 242 D ++ A + + H V D G+ Sbjct: 224 --ADGFKVAQTIKNEDPAAYETLSKIKVPAHYIEPGIQLRGQHEVVREDINGQFEQICFN 281 Query: 243 YIDQFVQPKDFEEGVWLSE----LSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFT 298 D+ E I K ++ + G+ + +N LH R F+ Sbjct: 282 NFDRSPFMLSASEQKAFYHAYGLFQRLINDPKYQVNFQLQPGRAVWFDNWRVLHARSAFS 341 Query: 299 PHPDL 303 L Sbjct: 342 GFRHL 346 >UniRef50_B5HKD0 Oxygenase n=1 Tax=Streptomyces pristinaespiralis ATCC 25486 RepID=B5HKD0_STRPR Length = 326 Score = 67.9 bits (164), Expect = 5e-10, Method: Composition-based stats. Identities = 33/204 (16%), Positives = 59/204 (28%), Gaps = 24/204 (11%) Query: 121 LIGRSNFDAMSGQYYARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEI------TDYVL 174 ++ + + + N S + H+D TY D+++ Sbjct: 108 VLTEKDGRLIHDIVPVAGGERTQTNQSS-----AVFLNFHSDITYDPTGRYDVANPDFLV 162 Query: 175 MMKIDEQNMQGGNSLLLHLDDWEHLDNYFRHPLA---RRPMRFAAPPSKNVSKDVFHPVF 231 + + G ++ + D + L+ R AP S S V Sbjct: 163 LNCLRGDPS--GAAVTYYADARDICGRLPEEELSLLRSPLFRLNAPGSYTRSAAGGAEVL 220 Query: 232 DV------DQQGRPVMRYIDQFVQPKDFEEGVWLSELSDAIETSKGILSVPVPVGKFLLI 285 + P + V+P E L V + G+ LL+ Sbjct: 221 SEPVPVISGPEAFPEIAVSANGVRPLSEEAAAAFERLQRTCREVA--HPVRLEPGQALLV 278 Query: 286 NNLFWLHGRDRFTPHPDLRRELMR 309 NN +H R FT D R ++ Sbjct: 279 NNRKGVHARSPFTARHDGRDRWLQ 302 >UniRef50_Q10ZR8 Putative uncharacterized protein n=2 Tax=Trichodesmium erythraeum IMS101 RepID=Q10ZR8_TRIEI Length = 375 Score = 67.5 bits (163), Expect = 5e-10, Method: Composition-based stats. Identities = 39/274 (14%), Positives = 94/274 (34%), Gaps = 33/274 (12%) Query: 62 FLRFRVAKILDDLCANQLQPLLLKTLLNRAEGALLINAVGVDDVKQ-ADEMVKLATAVAH 120 F + +++K D +N L G+ ++ G+ K ++ L A++ Sbjct: 41 FKKSQISKNKDINLSNSLLKTFKDISEELEFGSGIVLLKGIPVHKYLETDLSDLYLALSR 100 Query: 121 LIGRSNFDAMSG---------QYYARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITD 171 IG ++ S Q+ + ++S +Q + + H D D Sbjct: 101 KIGVPIRESNSDFDSPIRERNQFITEIKAEAKNSSQEN-KQSNDAFKFHTD------RCD 153 Query: 172 YVLMMKIDEQNMQGGNSLLLHLDDWEHLDNYFRHPLARRPMRFAAPPSKNVSKD-VFHPV 230 ++ + + + G N L + + + Y+ +A+ + P + + +P+ Sbjct: 154 LNSLLCVRQARIGGENRLASAITIYNEMLKYYPD-IAQELFK-EIPFFFEGENNWITYPL 211 Query: 231 FDVDQQGRPVMRYIDQFV--------QP-KDFEEGVWLSELSDAIETSKGILSVPVPVGK 281 + + + G+ +Y +V P ++ L L + + + G Sbjct: 212 WCIYE-GKFTTQYSSGYVALSQLIPDCPRLTQKQKQGLDLLEEIGLKVGITM--KLEPGD 268 Query: 282 FLLINNLFWLHGRDRFTPHPDLRRELMRQRGYFA 315 + + NN H R + L+ R + + Sbjct: 269 WFIANNHIIYHARSTWEIESGDYDRLLL-RVWLS 301 >UniRef50_C6WRI3 Taurine catabolism dioxygenase TauD/TfdA n=1 Tax=Actinosynnema mirum DSM 43827 RepID=C6WRI3_ACTMD Length = 250 Score = 67.5 bits (163), Expect = 6e-10, Method: Composition-based stats. Identities = 35/155 (22%), Positives = 53/155 (34%), Gaps = 12/155 (7%) Query: 155 RVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLH----LDDWEHLDNYFRHPLARR 210 R +E H D + V +LM+ GG L+ D L Sbjct: 80 RALEPHTDRSGVSNPP-VLLMLACARPGTTGGECTLIDGQAVYADLAETAPGALDALCSP 138 Query: 211 PMRFAAPPSKNVSKDVFHPVFDVDQQGRPVMRYIDQFVQPKDFEEGVWLSELSDAIETSK 270 S ++ +F + G +R + L L +AIE + Sbjct: 139 RSVLFGGASGHIGA-----IFSPARGGLVSVRLRTDDLVQFSPAVERRLPALREAIE--R 191 Query: 271 GILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLRR 305 L +P+ G+ +NN WLHGR FT H + R Sbjct: 192 HTLELPLEEGQGYFLNNARWLHGRRAFTGHRVVHR 226 >UniRef50_C3ZMX7 Putative uncharacterized protein (Fragment) n=1 Tax=Branchiostoma floridae RepID=C3ZMX7_BRAFL Length = 341 Score = 67.5 bits (163), Expect = 6e-10, Method: Composition-based stats. Identities = 32/165 (19%), Positives = 54/165 (32%), Gaps = 11/165 (6%) Query: 153 PHRVMELHNDGTYVEEITDYVLMMKID-EQNMQGGNSLLLHL---------DDWEHLDNY 202 + ++ H D Y E L+ + + +QGG S+LL + E + Sbjct: 142 SNVGLDFHMDLMYYESPPGLQLLHCVRFDPEVQGGESVLLDVFPVVEHLRVHHPEDFNTL 201 Query: 203 FRHPLARRPMRFAAPPSKNVSKDVFHPVFDVDQQGRPVMRYIDQFVQPKDFEEGVWLSEL 262 R P+ + F + H V + D++ Y + + + L + Sbjct: 202 VRVPVTFQKFHFDREYPVCMRYQRPHIVLNPDKETDVE-PYYQAYRRLVQAMQDSPLKRI 260 Query: 263 SDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLRREL 307 + G L+ NN LH R FT P R L Sbjct: 261 FIQSCPPHFQKERRLVAGDCLIFNNRRMLHSRRAFTLAPGTVRHL 305 >UniRef50_A1SFH6 Oxygenase (Secreted protein) n=1 Tax=Nocardioides sp. JS614 RepID=A1SFH6_NOCSJ Length = 354 Score = 67.5 bits (163), Expect = 6e-10, Method: Composition-based stats. Identities = 29/206 (14%), Positives = 62/206 (30%), Gaps = 11/206 (5%) Query: 113 KLATAVAHLIGRSNF--DAMSGQYYARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEIT 170 L A A +G D SG + + + + + H++ + Sbjct: 105 LLMVAFASELGCPISYADQRSGAVFHDIYPTRANAAAVSSQSHLVGLGFHSEMFFHPTPP 164 Query: 171 DYVLMMKIDEQNMQGGNSLLLHLDDWE-HLDNYFRHPLARRPMRFAAPPSKNVSKDVFHP 229 D++++ + + + + E L + L+ + P Sbjct: 165 DFLVLHCLRPDPGGAALTGVSDMASVELSLSRADMNVLSAPSFALDLARLHGSYTYLGRP 224 Query: 230 VFDVDQQ------GRPVMRYIDQFVQPKDFEEGVWLSELSDAIETSKGILSVPVPVGKFL 283 + + D + GR +R+ P L +S + + + G L Sbjct: 225 IAEADPRPCVPVVGRDRIRFEPALTTPTSGTAQRAL--VSAERVAEREVAFGALGDGAML 282 Query: 284 LINNLFWLHGRDRFTPHPDLRRELMR 309 L++N +H R F D +R Sbjct: 283 LVDNRRAVHSRTSFPARYDGTDRWLR 308 >UniRef50_A9LFI0 Clavaminic acid synthetase-like protein 1 (Fragment) n=2 Tax=Karenia brevis RepID=A9LFI0_KARBR Length = 404 Score = 67.5 bits (163), Expect = 7e-10, Method: Composition-based stats. Identities = 40/245 (16%), Positives = 68/245 (27%), Gaps = 38/245 (15%) Query: 75 CANQLQPLLLKTL--LNRAEGALLINAVGVDDVK-QADEMVKLATAV----AHLI----- 122 + L L +G ++I + V D + D+M + V H+I Sbjct: 73 LGPTMMEKLDTMRDHLENQKGLVMIRNMPVSDSRFSEDDMAIMYLGVSAHIGHIILQSSS 132 Query: 123 GRSNFDAMSGQYYARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQN 182 G + G R + + +Q + H D D + +M I Sbjct: 133 GLRSVSRGYGLPLGRVQAEMTGETPKGGKQTNNHFRYHTD------RCDVISLMCIRP-A 185 Query: 183 MQGGNSLLLHL----DDWEHLDNYFRHPLARRPMRFAAPPSKNVSKDVFHPVFDVDQQGR 238 GG S + + D LA+ R + PV + G+ Sbjct: 186 PSGGASRVCSAPAIYNALLEQDPELADALAQPIDRIWEGENG----YFRLPVMGLTPDGK 241 Query: 239 PVMRYIDQFVQ---------PKDFEEGVWLSELSDAIETSKGILSVPVPVGKFLLINNLF 289 + +V+ + L L D + G +NN Sbjct: 242 FTSQISPSYVENAQFLDNTIKATPTQIRALDALEDIGMDVGA--EFMMKPGMLYFLNNHQ 299 Query: 290 WLHGR 294 HGR Sbjct: 300 VYHGR 304 >UniRef50_A0P0W1 Putative uncharacterized protein n=1 Tax=Labrenzia aggregata IAM 12614 RepID=A0P0W1_9RHOB Length = 280 Score = 67.5 bits (163), Expect = 7e-10, Method: Composition-based stats. Identities = 40/248 (16%), Positives = 80/248 (32%), Gaps = 14/248 (5%) Query: 74 LCANQLQPLLLKTLLNRAEGALLINAVGVDDVKQADEMVKLATAVAHLIGRSNFDAMSGQ 133 + A+Q L+ + +G ++ V Q + +LA A+ + +F A Sbjct: 9 VVADQATSLIDIIVDQLNQGIGVLIIRNV--HLQVHHLERLAHALGTPVELKSFHANGSN 66 Query: 134 YYARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHL 193 D S L + R H D T Y + +D + GG + L Sbjct: 67 VVLDLKSHGEDGQLSDLYRFGR--SWHTDYTTTSITGGYTALYCLDAPEVGGGT-KFISL 123 Query: 194 DDWE-------HLDNYFRHPLARRPMRFAAPPSKNVSKDVFHPVFDVDQQGRPVMRYIDQ 246 D E ++ + + + P + K HP+F +G + Sbjct: 124 DTRELPESARSLIEKFCGRNPCEVDVEHSNPKADKTMKSAMHPLFRKGMKGEGCYISLGS 183 Query: 247 FVQPKDFEEGV-WLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLRR 305 F + E ++ + + +E + ++ NN + H + RR Sbjct: 184 FAYSRALIENQGFIRNMYNVLEQEGKSHVAECSRNQLIVWNNAYVAHKAMPYNSEQ-YRR 242 Query: 306 ELMRQRGY 313 ++R + Sbjct: 243 HMLRIATW 250 >UniRef50_C4XYW1 Putative uncharacterized protein n=2 Tax=Saccharomycetales RepID=C4XYW1_CLAL4 Length = 413 Score = 67.1 bits (162), Expect = 7e-10, Method: Composition-based stats. Identities = 33/188 (17%), Positives = 62/188 (32%), Gaps = 20/188 (10%) Query: 133 QYYARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLH 192 +Y F D + + H DGTY + L + G SL Sbjct: 201 THYGGFWDFTSDLGKKDTAYTNFDISAHTDGTYWSDTPGLQLFHLLYHDGTGGTTSL--- 257 Query: 193 LDDWEHLDNYFRHPLARRPMRFAAPPSKN--------VSKDVFHPVFDVDQQGRPVM-RY 243 +D ++ + + + P + + D+ PVF +D +G + R+ Sbjct: 258 VDAFQCAEQLKQTDPESYELLTRIPVPAHSAGEEKVCIQPDIPQPVFKLDLEGNLIQVRW 317 Query: 244 -------IDQFVQPKDFEEGV-WLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRD 295 +D + P D + + I + + + + G+ L+ +N H R Sbjct: 318 NQTDRSCMDCWADPTDVPKFYKAIRSWYKIISSPENEIFYQLKPGQCLIFDNWRCFHSRT 377 Query: 296 RFTPHPDL 303 FT L Sbjct: 378 EFTGKRRL 385 >UniRef50_Q7K4A8 CG10814 n=19 Tax=Drosophila RepID=Q7K4A8_DROME Length = 402 Score = 67.1 bits (162), Expect = 7e-10, Method: Composition-based stats. Identities = 33/192 (17%), Positives = 59/192 (30%), Gaps = 13/192 (6%) Query: 130 MSGQYYARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSL 189 M Y S + LH D Y E + ++ +++ GG +L Sbjct: 189 MRRTTYGEEFSVRAQPGGSNYAYLAAPLPLHTDMPYFEYLPGVTMLHTLEQSASPGGVNL 248 Query: 190 LLHLDDWEHLDNYFR----HPLARRPMRFAAPPS---KNVSKDVFHPVFDVDQQGRPV-- 240 L + L + P+ +A S PV ++D +GR V Sbjct: 249 LADAFYVAEVMRERYPEQFRVLCQTPVDWADIGSDGDLQFHNIWRAPVINLDAEGRCVRI 308 Query: 241 ---MRYIDQ-FVQPKDFEEGVWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDR 296 + D F P + + + + + G L +NL +HGR Sbjct: 309 NHSIPQRDSHFSVPVEQVRPWYEAMATFVGLAHEHSCRFKTTPGDVLTFDNLRLVHGRTG 368 Query: 297 FTPHPDLRRELM 308 + R ++ Sbjct: 369 YDDTDRNVRHIL 380 >UniRef50_C0P0S4 Gamma-butyrobetaine dioxygenase n=7 Tax=Onygenales RepID=C0P0S4_AJECG Length = 546 Score = 67.1 bits (162), Expect = 7e-10, Method: Composition-based stats. Identities = 27/180 (15%), Positives = 57/180 (31%), Gaps = 33/180 (18%) Query: 148 SYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDDWEHLDNYFRHPL 207 + ++ ++ H D Y++E Y L+ + + GG SL D ++ R+ Sbjct: 282 KNVAYTNKHLDFHMDLLYMKEPPGYQLLHCLR-NSFSGGESLFS--DTFQAAVRLLRNDP 338 Query: 208 ARRPMRFAAPPSKNV-----SKDVFHPVFDVD---------QQGRPV-----MRYIDQFV 248 + P HP +++ + PV + Y F Sbjct: 339 ILFDILCKTPTRFEYKNNNQHYQYSHPTIEIEGGEEFLKNPPKKNPVPYVNYVNYSPPFQ 398 Query: 249 QPKDFEE-----------GVWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRF 297 P + + + + + I V + G+ ++ N +H R+ F Sbjct: 399 APSYLTKHLVDGRDIKLYVRAMKAFAAELGKPENIFQVKLEPGQCVIFQNRRVVHARNAF 458 >UniRef50_B2WEF8 Trimethyllysine dioxygenase n=2 Tax=Pleosporineae RepID=B2WEF8_PYRTR Length = 430 Score = 67.1 bits (162), Expect = 7e-10, Method: Composition-based stats. Identities = 34/185 (18%), Positives = 61/185 (32%), Gaps = 18/185 (9%) Query: 133 QYYARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLH 192 +Y F D + + +E H D TY + + + +GG SLL Sbjct: 221 THYGGFYDFTSDLASKDTAYTNIALEAHTDTTYFSDPAGLQAFHLLSHTDGEGGASLL-- 278 Query: 193 LDDWEHLDNYF-RHPLARR---PMRFAAPPSKNVSKDVF----HPVFDVDQQ--GRPVMR 242 +D ++ P A + ++ A S N + PV + D +R Sbjct: 279 VDGFKVAAELLETDPEAYKILSTVKVHAHASGNEGISIQPYRSFPVLEHDPTIGDLVRVR 338 Query: 243 YIDQ----FVQPKDFEEG--VWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDR 296 + P + E + ++ + + G+ L+ +N LHGR Sbjct: 339 WNSSDRAWIDLPIEQVETWYRAARKFDALLKKKENEYWEQLRPGRVLIFDNWRVLHGRSS 398 Query: 297 FTPHP 301 FT Sbjct: 399 FTGKR 403 >UniRef50_B8M6P7 Trimethyllysine dioxygenase TmlH, putative n=1 Tax=Talaromyces stipitatus ATCC 10500 RepID=B8M6P7_TALSN Length = 536 Score = 67.1 bits (162), Expect = 8e-10, Method: Composition-based stats. Identities = 46/241 (19%), Positives = 78/241 (32%), Gaps = 35/241 (14%) Query: 80 QPLLLKTLLNRAEGALLINAVGVDDVKQADEMVKLATAVAHLIGRSNFDAMSGQYYARFV 139 Q L L G L+ V V + ++A + G ++ F Sbjct: 273 QGLTLWLQKVVDWGYCLVKGVPVTPEATKQLLERIAFI---------RETHYGGFW-DFT 322 Query: 140 VKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDDW--- 196 +Y + H D TY + L ++ +GG +LL +D + Sbjct: 323 SDLTFKDTAYTTEALGA---HTDNTYFSDPARLQLFHLLEHTEGEGGETLL--VDGFYAA 377 Query: 197 -EHLDNYFRHPLARRPMRFAAPPSKN----VSKDVFHPVFDVDQQGRPVMRYI----DQF 247 L + A S N + V+ PVF+ D +MR D+ Sbjct: 378 QRMLIEAPHNVEAFTDYAHPWHSSGNEHISIQPYVYFPVFERDPTTARLMRIRWNNYDRA 437 Query: 248 VQPKDFEEGVWLSELSDA-------IETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPH 300 + D+ + +S S A + + + + G +L +N LHGR FT Sbjct: 438 AK-IDWTPYMAMSWYSAARHWNAILLRKDQTQKWLQLEPGTAILFDNWRMLHGRSEFTGK 496 Query: 301 P 301 Sbjct: 497 R 497 >UniRef50_C5P9Y0 Trimethyllysine dioxygenase, putative n=2 Tax=Coccidioides RepID=C5P9Y0_COCP7 Length = 450 Score = 67.1 bits (162), Expect = 8e-10, Method: Composition-based stats. Identities = 36/188 (19%), Positives = 66/188 (35%), Gaps = 23/188 (12%) Query: 133 QYYARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLH 192 +Y F D + + + + +H D Y + + + + + GG S L Sbjct: 237 THYGGFWDFTSDLAMKDMAYTTQGLGVHTDNAYFTDPSGLQMFHLLSHTDGDGGESTL-- 294 Query: 193 LDDWEHL------DNYFRHPLARRPMRFAAPPSKNVSKDV----FHPVFDVDQQGRPV-- 240 +D +E + L+ A S N + H F Q + Sbjct: 295 VDGFEAARTLWSENPDAYAVLSNPIFSHHA--SGNEHVHIMPAKTHETFSHRQPTGELYQ 352 Query: 241 MRYIDQ-----FVQPKDFEEGVWLS--ELSDAIETSKGILSVPVPVGKFLLINNLFWLHG 293 +R+ D+ F +D +++ E S ++ K +L + G L+ +N LHG Sbjct: 353 IRWNDEDRGANFTGSQDSLLAWYVAAREWSQMLKRPKLLLKFKLEPGMPLIFDNWRMLHG 412 Query: 294 RDRFTPHP 301 R FT Sbjct: 413 RTAFTGAR 420 >UniRef50_D1KB60 Putative uncharacterized protein n=1 Tax=uncultured SUP05 cluster bacterium RepID=D1KB60_9GAMM Length = 308 Score = 67.1 bits (162), Expect = 8e-10, Method: Composition-based stats. Identities = 36/177 (20%), Positives = 60/177 (33%), Gaps = 15/177 (8%) Query: 148 SYLRQPHRVMELHNDGTY-VEEITDYVLMMKIDEQNMQGGNSLLLHLDDWEHLDNYFRHP 206 Y+ ++++ H DG Y + + L + + ++ +GG + L+ + L Sbjct: 122 KYIPYTNKLIHWHTDGYYNDPDKQIHALNLHVVQKAEKGGENQLMDHEIAYILLREKNPD 181 Query: 207 LARRPMR-------FAAPPSKNVSKDVFHPVFDVDQQGRPVMRYI---DQFVQPKDFEEG 256 R M+ S D PVF + G MRY V +D Sbjct: 182 FVRALMQNNVMMIPAGTTSSGGFRSDRPGPVFSISTDGNLHMRYTARKRNIVWSQDPLVT 241 Query: 257 VWLSELSDAIETSKGILSVP--VPVGKFLLINNLFWLHGRDRFTPHPDLRRELMRQR 311 + L + + G L+ NN LH R FT + +R R R Sbjct: 242 EAIDYLQQILNDENSDYVFKGLLEPGMGLISNN--VLHDRSAFTDSIEHKRHYYRAR 296 >UniRef50_A4FLP1 Oxygenase n=1 Tax=Saccharopolyspora erythraea NRRL 2338 RepID=A4FLP1_SACEN Length = 200 Score = 67.1 bits (162), Expect = 8e-10, Method: Composition-based stats. Identities = 26/165 (15%), Positives = 59/165 (35%), Gaps = 11/165 (6%) Query: 156 VMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDDWE--HLDNYFRHP-LARRPM 212 +E H + +Y + + +L+ + + G + + D + P L Sbjct: 10 RLEWHTEDSYHPDRPELLLLACVRNPDDIGTDIASVRRADLSEADIALLSTTPVLIEPDD 69 Query: 213 RFAAPPSKNVSKDVFHPVFDVDQQGRPVMRYIDQFVQ-PKDFEEG-VWLSELSDAIETSK 270 +A S+ + + G MR+ + + P+D E LS++++ + Sbjct: 70 SYAGSWSEGQGDEGKMTTLWQTEDGL-CMRFDPPYTRLPEDAPELCAAWRRLSESLDQAG 128 Query: 271 GILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLRRELMR---QRG 312 ++ G ++++N H R F D ++ RG Sbjct: 129 MTVAG--QPGDIVVVDNDVAAHARRPFQARYDGTDRWLKRILVRG 171 >UniRef50_C3Z1Z7 Putative uncharacterized protein n=2 Tax=Branchiostoma floridae RepID=C3Z1Z7_BRAFL Length = 386 Score = 67.1 bits (162), Expect = 9e-10, Method: Composition-based stats. Identities = 27/166 (16%), Positives = 46/166 (27%), Gaps = 15/166 (9%) Query: 156 VMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDDWEHLDNYFRHP------LAR 209 + LH D Y + +M I + GG S L+ + + + Sbjct: 197 PLGLHTDLAYYNYLPGVQMMHCIKQTESDGGASELVDAFNAAYQLKEENPDAFKLLTTVK 256 Query: 210 RPMRFAAPPSKNVSKDVFHPVFDVDQQGR-----PVMRYIDQFVQ-PKDFEEGVW--LSE 261 H + V QG D + P D + + + Sbjct: 257 VNFHRIGKAEPKHHMRERHHIISVSDQGEVQKVVCGKHSRDSVLDVPVDQVKPFYRAMRA 316 Query: 262 LSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLRREL 307 + + + + G L +NL LH R F+ R L Sbjct: 317 FDTILSNPRNCIRYKMKEGDMLAFDNLRILHDRTAFSM-SGGERHL 361 >UniRef50_UPI0001699593 Probable taurine catabolism dioxygenase n=1 Tax=Endoriftia persephone 'Hot96_1+Hot96_2' RepID=UPI0001699593 Length = 214 Score = 66.7 bits (161), Expect = 1e-09, Method: Composition-based stats. Identities = 41/178 (23%), Positives = 64/178 (35%), Gaps = 21/178 (11%) Query: 149 YLRQPHRVMELHNDGTYVEEITDYVLMMK-IDEQNMQGGNSLLLHLDDWEHLDNYFRHPL 207 Y+ +R + H DG Y + ++ E +GG + LL + +L P Sbjct: 35 YIPYSNRPIAWHTDGYYNQSEEQIHGLLLHCVEPAAKGGENALLD-HEIVYLQIRDYQPA 93 Query: 208 ARRPMRF----AAPPSKNVSKDVF----HPVFDVDQQGRPVMRYIDQ------FVQPKDF 253 + + P ++ + + PVF + G MRY D+ P Sbjct: 94 YIQALMHPQAMTIPANQVDGEILRPARSGPVFSIAPDGHLHMRYTDRSRSIECRADPLLA 153 Query: 254 EEGVWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLRRELMRQR 311 E +L L +A S + G+ LL NN LH R F +R L R R Sbjct: 154 EAVNYLKALLNA--PSPWHFRGKLDAGQGLLSNN--VLHTRSAFNKDK-SQRLLYRAR 206 >UniRef50_Q108L2 Oxygenase n=1 Tax=uncultured organism RepID=Q108L2_9ZZZZ Length = 271 Score = 66.7 bits (161), Expect = 1e-09, Method: Composition-based stats. Identities = 43/261 (16%), Positives = 77/261 (29%), Gaps = 36/261 (13%) Query: 77 NQLQPLLLKTLLNRAEGALLINAVGVDDVKQADEMVKLATAVAHLI----GRSNFDAMSG 132 QL P + R G + + AD + A A + +I DA+ Sbjct: 17 QQLLPTFGMLVHAREAGT------PLSSLP-ADTLRAWAEAESLVILRGFAPPEGDALPS 69 Query: 133 QYYARFVVKNVD----------NSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKI-DEQ 181 + D + +R + H DG +V I ++ + Sbjct: 70 YCRGLGDLLEWDFGAINNLQAQSEAKNYLFTNRAVPFHWDGAFVGRIPHWIFFHCASAPE 129 Query: 182 NMQGGNSLLLHLDDW-EHLDNYFRHPLARRPMRFAAPPSKNVSKDVFHPVFDVDQ-QGRP 239 GG +L H E + R +R++ + P+ G+ Sbjct: 130 ENTGGETLFCHTPLLLEAVSAAGRAQWENISIRYSTEKLAHYGGSFTSPLLAAHPIHGQT 189 Query: 240 VMRYIDQF--VQPKDFE--------EGVWLSELSDAIETSKGILSVPVPVGKFLLINNLF 289 ++RY + + P E +L + + + G ++ +N Sbjct: 190 ILRYAEPVNDLNPVHLEIQGLPEESHTAFLEGMHTRLYDPAVCYAHAWQTGDIVIADNFT 249 Query: 290 WLHGRDRFTPHPDLRRELMRQ 310 LHGR F R L R Sbjct: 250 LLHGRRAFL--RPESRHLRRV 268 >UniRef50_A3Y505 Gamma-butyrobetaine hydroxylase, putative n=3 Tax=Bacteria RepID=A3Y505_9GAMM Length = 397 Score = 66.7 bits (161), Expect = 1e-09, Method: Composition-based stats. Identities = 28/184 (15%), Positives = 69/184 (37%), Gaps = 13/184 (7%) Query: 125 SNFDAMSGQYYARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQ 184 F + Y + + L + + LH D Y + + L+ + E ++ Sbjct: 170 DTFGYVRDTNYGKLFEVKTQVEPNNLAFTNLGLGLHADNPYRDPVPTVQLLHCL-ENTVE 228 Query: 185 GGNSLLLH-LDDWEHLDNYFRHP---LARRPMRFAAPPSKNVSKDVFHPVFDVDQQGRPV 240 GG S+L L + L++ + F K+ P+ +V+ +G+ V Sbjct: 229 GGESILGDGFKAARILREESQADFDLLSQTWINFRFQD-KDTDLQSRVPLIEVNDKGQVV 287 Query: 241 -MRYIDQFVQPKDFEEGV------WLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHG 293 +R+ ++ + P + ++ ++ + + ++ + G+ ++ +N H Sbjct: 288 KVRFNNRSIAPINIDKHKMKAFYKAYQHYAEILNRTSIMVDFKLTQGQLVMFDNTRVFHA 347 Query: 294 RDRF 297 R F Sbjct: 348 RKAF 351 >UniRef50_A0YHS0 Gamma-butyrobetaine,2-oxoglutarate dioxygenase n=1 Tax=marine gamma proteobacterium HTCC2143 RepID=A0YHS0_9GAMM Length = 394 Score = 66.7 bits (161), Expect = 1e-09, Method: Composition-based stats. Identities = 36/186 (19%), Positives = 55/186 (29%), Gaps = 20/186 (10%) Query: 138 FVVKNVDNSDSYLRQPHRVMEL--HNDGTYVEEITDYVLMMKIDEQNMQG------GNSL 189 + VK + + L H D E Y + ++ G G ++ Sbjct: 192 WSVKAEILGNEENSTANTPFRLGPHTDLPTREIPPGYQFLHCLENTVTGGFATMADGEAI 251 Query: 190 LLHLDDWEHLDNYFRHPLARRPMRFAAPPSKNVSKDVFHPVFDVDQQGRP----VMRYID 245 HL + E LA F S++ P+ D P + Sbjct: 252 ARHLKEEE---PKIHQALASLNWIF-FNRSRDHDHRWSGPMLDYGVSQAPLSIRAFYPVR 307 Query: 246 QFVQPKDFEEGVWLSELS---DAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPD 302 F D + G + + +S P G + +N LHGRD F P Sbjct: 308 AFPDMADEDVGRAYRAVRRFHQLAADPQFQISYPYQSGDLIGFDNRRLLHGRDSFDPG-A 366 Query: 303 LRRELM 308 RR L Sbjct: 367 GRRHLR 372 >UniRef50_UPI0001AEE667 hypothetical protein SalbJ_25454 n=1 Tax=Streptomyces albus J1074 RepID=UPI0001AEE667 Length = 326 Score = 66.3 bits (160), Expect = 1e-09, Method: Composition-based stats. Identities = 31/197 (15%), Positives = 65/197 (32%), Gaps = 21/197 (10%) Query: 126 NFDAMSGQYYARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQG 185 N + ++ ++ ++ S + + LH + + DYV +M + + G Sbjct: 115 NGRLVHDVCPSKGQENSLTSASSQQQ-----LTLHTEDVFHSCRGDYVALMCLRNPDAVG 169 Query: 186 GN-------------SLLLHLDDWEHLDNYFRHPLARRPMRFAAPPSKNVSKDVFHPVFD 232 +LH + + + + P+ A PS + Sbjct: 170 TTVALVESVEFEEPLREVLHQNRFRFFPDDSHQVV---PLHTAEAPSALEERPHEVASVL 226 Query: 233 VDQQGRPVMRYIDQFVQPKDFEEGVWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLH 292 + P +R F P + + A + V + G+ + ++N +H Sbjct: 227 FGPEDAPYLRIDADFTSPVPGDREAERVMVQAAELLADAAERVVLAPGEAVFVDNYKVIH 286 Query: 293 GRDRFTPHPDLRRELMR 309 GRD FTP D ++ Sbjct: 287 GRDTFTPRYDGTDRWLK 303 >UniRef50_Q2JC07 Putative uncharacterized protein n=1 Tax=Frankia sp. CcI3 RepID=Q2JC07_FRASC Length = 363 Score = 66.3 bits (160), Expect = 1e-09, Method: Composition-based stats. Identities = 42/236 (17%), Positives = 73/236 (30%), Gaps = 30/236 (12%) Query: 92 EGALLINAVGVDDVKQADEMVKLATAVAHL--IGRSNFDAMSGQYYARFVVKNVDNSDSY 149 G ++ + V+D+ + +V + A L I + D GQ R D Sbjct: 91 HGFAVLRGLPVEDLDDRECLVLIRGIAARLGRIATQSRD---GQLVRRVRASGRQLGDGR 147 Query: 150 LR--QPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDDW--------EHL 199 +R Q + H DG D L++ +GG S + +L Sbjct: 148 VRGHQTAERLWFHTDG------ADATLLLC-RRPADRGGMSRVASAAAVHNAMLASAPNL 200 Query: 200 DNYFRHPL----ARRPMRFAAPPSKNVSKDVFHPVFDVDQQGRPVMRYIDQFVQPKDFEE 255 P A + P + + +F V ++ P Sbjct: 201 VAELYQPFHFHMAGGNIPGGPPTFLSPIFCAYRGLFSVRFVRHTLLETQTVTGVPLSPTA 260 Query: 256 GVWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLR--RELMR 309 L + + + + G +++N LH R +T PD R R L+R Sbjct: 261 LAAFDLLDEVADQLACDMELR--PGDLQIVHNHCVLHSRTAYTDAPDPRQARHLLR 314 >UniRef50_Q6CQT2 KLLA0D14553p n=1 Tax=Kluyveromyces lactis RepID=Q6CQT2_KLULA Length = 420 Score = 66.3 bits (160), Expect = 1e-09, Method: Composition-based stats. Identities = 32/187 (17%), Positives = 62/187 (33%), Gaps = 21/187 (11%) Query: 133 QYYA-RFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKID--EQNMQGGNSL 189 +Y F VKN + + + + + LH D Y+E I + L+ I E + G + Sbjct: 213 TFYGELFDVKNQASQANNIAYTAKPLPLHMDLLYLENIPGWQLLHCIKNSEGLEENGQNY 272 Query: 190 LLHLDDWEHLDNYFRHPLARRPMRFAAPPSK-----NVSKDVFHPVFDVDQQGRPVMRYI 244 +D L+ + P + + P+ + + V+ Y Sbjct: 273 F--VDSLGALNYIKNKDPSVLKALETIPITYHYRRDDKRYYQQRPLVEHKKY-ETVVNYS 329 Query: 245 DQFVQPKDFEE----------GVWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGR 294 F P + ++ L + I K + +P ++ +N LH R Sbjct: 330 PPFQGPFNLKDITDIPLLNQFKKGLYMFEEYINDPKNQFQIKLPENSCVIFHNRRILHAR 389 Query: 295 DRFTPHP 301 +F Sbjct: 390 RQFDGER 396 >UniRef50_B2HPX4 Gamma-butyrobetaine hydroxylase, TauD_1 n=1 Tax=Mycobacterium marinum M RepID=B2HPX4_MYCMM Length = 377 Score = 66.3 bits (160), Expect = 2e-09, Method: Composition-based stats. Identities = 31/199 (15%), Positives = 57/199 (28%), Gaps = 20/199 (10%) Query: 133 QYYARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLH 192 Y + V++ + + H D E L+ I G Sbjct: 161 TIYGDTWLVRVEDRPVNIAYTACALPFHQDLCAYETPPGLQLLHCIAFDAAVTGGQTWF- 219 Query: 193 LDDWEHLDNYFRHPLARRPM-----RFAAPPSKNVSKD--VFHP-VFDVDQQGRPVMRYI 244 D + A +A + N + V P + D+ + + Sbjct: 220 CDGLAAAERVRAQ--APDDFETLCQVWATFATVNRQQHMVVRKPHLVVDDRANLVGINWA 277 Query: 245 DQF-----VQPKDFEE-GVWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFT 298 F P D E + A+E L++ + G+ ++ N LH R +T Sbjct: 278 PPFEGAFAGDPADEERYRRAYQTFTSAVEDGPR-LALRLQPGEIVVFQNRRTLHARQAYT 336 Query: 299 PHPDLRRELMRQRGYFAYA 317 R ++ G + A Sbjct: 337 QPTASARRVL--EGCYLSA 353 >UniRef50_A6F7M8 Gamma-butyrobetaine hydroxylase n=1 Tax=Moritella sp. PE36 RepID=A6F7M8_9GAMM Length = 373 Score = 66.3 bits (160), Expect = 2e-09, Method: Composition-based stats. Identities = 27/180 (15%), Positives = 58/180 (32%), Gaps = 11/180 (6%) Query: 134 YYARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHL 193 Y + F V + +N L + + LH D Y + L+ + G +L Sbjct: 159 YGSHFEVISEENP-VNLAYTPKPLSLHTDNAYRHPVPTLQLLHCLISAEQGGITALTDGF 217 Query: 194 DDWEHLD---NYFRHPLARRPMRFAAPPSKNVSKDVFHPVFDVDQQGRPVMRYIDQFVQ- 249 + L L P+ + + + + + ++ +R ++ +Q Sbjct: 218 YAAQLLQQRFPQQYQLLTSTPVMYRFKNADTHLEHTGYIIELNNRGELERIRLNNRAIQA 277 Query: 250 ---PKDFEEGVW--LSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRD-RFTPHPDL 303 P + S + + + + G+ ++ NN LHGR+ L Sbjct: 278 IKLPFAEMAAFYDAYQNFSRILHSDECKFLCTLQPGELMIFNNERILHGREVAAEGARHL 337 >UniRef50_A4REX2 Putative uncharacterized protein n=1 Tax=Magnaporthe grisea RepID=A4REX2_MAGGR Length = 374 Score = 65.9 bits (159), Expect = 2e-09, Method: Composition-based stats. Identities = 45/259 (17%), Positives = 84/259 (32%), Gaps = 31/259 (11%) Query: 70 ILDDLCANQLQPLLLKTLLNRAEGALLINAVGVDDVKQADEMVKLATAVAHLIGRSNFDA 129 L + + R G L+ + D + +++ + Sbjct: 81 FPLKALGAILDGVSAEIYEGR--GFGLVRGLDAQKYTTEDLTLIYLGVQSYVADQRGRQD 138 Query: 130 MSGQYYARFVVKNVDN-SDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNS 188 G + + + ++ R ++ + H D E D + + Q QGG+S Sbjct: 139 AKGNMLVHVIQDQTNQMAANHHRHSNKAITFHTD-----EDGDVIGWLT-RGQAAQGGSS 192 Query: 189 LLLHLDDWEHLDNYFRHPLARRPMRFAAPPSKNVSKDVFHPVFDVDQQGRPVMRY----- 243 ++ +L L A+P + ++ K P + GR ++ + Sbjct: 193 VIASAYAIYNLLASEHPELINA---LASPFTFSIPKIERRPAIFYHE-GRLIVNFGRTPL 248 Query: 244 IDQFVQPKDF-------EEGVWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDR 296 I P+D E+ L + + + L + G INN+ LH RD Sbjct: 249 IGSEAHPRDQKLPLPTDEQLRALDLVEEI--AHRVELRIKTQPGDIHFINNMAVLHRRDA 306 Query: 297 FTPH----PDLRRELMRQR 311 FT RR L+R R Sbjct: 307 FTNDGTQAQFGRRHLVRVR 325 >UniRef50_B0D9T9 Predicted protein n=3 Tax=Agaricales RepID=B0D9T9_LACBS Length = 395 Score = 65.6 bits (158), Expect = 2e-09, Method: Composition-based stats. Identities = 34/190 (17%), Positives = 59/190 (31%), Gaps = 21/190 (11%) Query: 135 YARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLH-- 192 Y RF D + + H D TY + L + GG +LL+ Sbjct: 162 YGRFWEITSDLAKGDTAYTTLALGAHTDNTYFTDPCGLQLFHLLSHTGGTGGATLLVDGF 221 Query: 193 --LDDWEHLDNYFRHPLARRPMRFAAPPSKNVSKDVF----HPVFDVDQQGR--PVMRYI 244 + + L L+R P+ A + +PV D +R+ Sbjct: 222 YVANIMKELHPEAYDLLSRIPVPAHAAGESSALYRPSPPSGYPVLGHDAFTGELTQVRWN 281 Query: 245 DQ----FVQPKDFEEGVWLSEL---SDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRF 297 + W + + + + V + G ++++N LHGR F Sbjct: 282 NDDRSVMSHLTSDLVEKWYDAIRTWNKCLTSPDSQYWVQLQPGTAVVVDNHRVLHGRSAF 341 Query: 298 TPHPDLRREL 307 D RR + Sbjct: 342 ----DGRRRM 347 >UniRef50_B6BRE2 Trimethyllysine dioxygenase n=1 Tax=Candidatus Pelagibacter sp. HTCC7211 RepID=B6BRE2_9RICK Length = 364 Score = 65.6 bits (158), Expect = 2e-09, Method: Composition-based stats. Identities = 27/153 (17%), Positives = 55/153 (35%), Gaps = 14/153 (9%) Query: 160 HNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDDWEHLDNYFRHPLARRPMRFAAPPS 219 H D TY ++ L++ E + GG S++ +D ++ + + P Sbjct: 193 HTDSTYSKDAPGLQLLLCC-ELDATGGESIM--VDGFKIAETLKEQKEVFEILSNVEVPG 249 Query: 220 KNVSKDV----FHPVFDVDQQGR-PVMRYID----QFVQPKDFEEGVW--LSELSDAIET 268 K V V P+F + + + + + +F D + +++ + Sbjct: 250 KYVGDGVILEARRPIFRHNSKKELTQVSFNNYDRAEFRMENDLMLKFYEAITQFDNLANN 309 Query: 269 SKGILSVPVPVGKFLLINNLFWLHGRDRFTPHP 301 + + G+ L+ NN LHGR F Sbjct: 310 IEYQWRHILKPGELLIFNNWRVLHGRGSFQGKR 342 >UniRef50_UPI0001B55A0D hypothetical protein StAA4_00370 n=1 Tax=Streptomyces sp. AA4 RepID=UPI0001B55A0D Length = 257 Score = 65.2 bits (157), Expect = 3e-09, Method: Composition-based stats. Identities = 42/231 (18%), Positives = 72/231 (31%), Gaps = 19/231 (8%) Query: 96 LINAVGVDDVKQADEMVKLATAVAHLI-----GRSNFDAMSGQYYARFVVKNVDNSDSYL 150 ++ DV +LA+A+ H G+ R+ + S Sbjct: 28 VVVLRRAADVDSDAFYWRLASALGHFHFHFRDEDPGGLDKPGRLDIRYDPDL--AAGSRY 85 Query: 151 RQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDDWEHLDNYFRHPLARR 210 R + M LH DG Y + D + G + E+L L R Sbjct: 86 RYGNGRMPLHVDGVYSDVDFDVFFIRCRAAARFGGATFAVDGTTVVEYLSA-SDPALLRA 144 Query: 211 PMRFAAPPSKNVSKDVFHPVFDVDQQGRPVMRYIDQFVQPKDFEE-----GVWLSELSDA 265 + SK + V V D + G PV + V + E + Sbjct: 145 LLNVEVRFSKG-ERVVTRRVIDY-EGGDPVFNWSSTRVAGDNPPEVVRMCARFADFCEQR 202 Query: 266 IETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLRRELM----RQRG 312 + + + + G + ++N LHGR F + + ++ R RG Sbjct: 203 LVDGGLVEQIRLAPGDAVFVHNRKVLHGRYAFWGERCMLKGVLDLRPRIRG 253 >UniRef50_A3P1R2 Pyoverdine biosynthesis protein PvcB n=35 Tax=Proteobacteria RepID=A3P1R2_BURP0 Length = 304 Score = 65.2 bits (157), Expect = 3e-09, Method: Composition-based stats. Identities = 49/246 (19%), Positives = 86/246 (34%), Gaps = 34/246 (13%) Query: 83 LLKTLLNRAEGALLINAVGVDDVKQADEMVKLATAVAHLIGRSNFDAMSGQYYARFVVKN 142 L + ++ L+ G D A M + A ++ + + A V + Sbjct: 46 LRALVRSQQ----LVVLRGFDSFADAPGMTRYCAAFGEIMMWPFGAVLELREQAN-PVDH 100 Query: 143 VDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDE-QNMQGGNSLLLHL-DDWEHLD 200 V S SY+ LH DG Y+E + ++ + + + GG + + Sbjct: 101 VFAS-SYV-------PLHWDGMYLETVPEFQVFQCVQAIGDAHGGRTTFSSTTEALRVAT 152 Query: 201 NYFRHPLARRPMRFAAPPSKNVSKDVFHPVFDVD-QQGRPVMRYIDQ-------FVQP-- 250 R R R+ S V P+ + ++ P++R+ + F+ P Sbjct: 153 PEARALWQRAHGRYRRTVEL-YSNTVEAPIVERHPRREFPILRFCEPPIADDPTFINPSS 211 Query: 251 ------KDFEEGVWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLR 304 D E L L+ A+ + + G +L +N LHGR+RFT Sbjct: 212 YAFGGIADSERDALLGSLTRALYDPRAHYAHRWRTGDVVLTDNFTLLHGRERFTSRSG-- 269 Query: 305 RELMRQ 310 R L R Sbjct: 270 RHLRRV 275 >UniRef50_A5E9P2 Putative uncharacterized protein n=1 Tax=Bradyrhizobium sp. BTAi1 RepID=A5E9P2_BRASB Length = 209 Score = 65.2 bits (157), Expect = 3e-09, Method: Composition-based stats. Identities = 40/217 (18%), Positives = 68/217 (31%), Gaps = 28/217 (12%) Query: 110 EMVKLATAVAHLIGRSNFDAMSGQYYARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEI 169 + +L T V GR +FD Y Q + H + Sbjct: 2 TLRRLGTIVPQYDGRVSFDVTYRPGYDDAPYS----------QSMNALGAHTEAPGFMPP 51 Query: 170 TDYVLMMKIDEQNMQGGNSLLLHLDDW--EHLDNYFRHPLARRPMRF-AAPPSKNVSKDV 226 Y+ + + G +LL + E L R + F ++ + ++ Sbjct: 52 PKYLALYCHRQARCGNGQTLLADGIRFYDEALSPDLRRWSQNNEVEFVSSATPGDETRSS 111 Query: 227 FHPVFDVDQQGRPVMRYIDQF-----VQPKDFEEGVWLSELSDAIET---------SKGI 272 G PV+R+ V P E + SD + S+ + Sbjct: 112 LRAPIRATVAGEPVLRFSYNLFRYGNVNPDAIEVRRVGDDPSDPLGQIAEEGEAFFSRNL 171 Query: 273 LSVPVPVGKFLLINNLFWLHGRDRFTPH-PDLRRELM 308 + V +P G L+ NN +HGR R+ L R + Sbjct: 172 IPVLIPDGCMLVWNNHRLMHGRGRYADQARHLTRYWL 208 >UniRef50_A9FN86 Putative uncharacterized protein n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9FN86_SORC5 Length = 199 Score = 65.2 bits (157), Expect = 3e-09, Method: Composition-based stats. Identities = 37/174 (21%), Positives = 60/174 (34%), Gaps = 25/174 (14%) Query: 153 PHRVMELHNDGT--YVEEITDYVLMMKIDEQNMQGGNSLLL--HLDDWEHLDNYFRHPLA 208 + LH++G+ EE Y+++M D + +L E L L Sbjct: 29 ATNPLTLHSEGSGRRAEEQPRYIVLMCRDPGDEGTAAQTVLVPMAAVAEGLSPDALATLE 88 Query: 209 RRPMRFAAPPSKNVSKDVFHPVFDVDQQGRPVMRYIDQFVQPKDFEEGV----------W 258 + R +P P GR V + D QP ++ Sbjct: 89 QTRYR-RSPD---------GPWIARRVDGRWVFSFRDFLSQPLEWTHAGHAPSADAINGA 138 Query: 259 LSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDR-FTPHPDLRRELMRQR 311 L +L ++ + V GK ++I+N F+ HGR + RR L R R Sbjct: 139 LRDLLASMYAPGAAIRVHWTRGKLVIIDNTFFFHGRTAGRSAGSSRRRHLQRLR 192 >UniRef50_Q6CCC7 YALI0C10604p n=1 Tax=Yarrowia lipolytica RepID=Q6CCC7_YARLI Length = 382 Score = 65.2 bits (157), Expect = 3e-09, Method: Composition-based stats. Identities = 32/171 (18%), Positives = 63/171 (36%), Gaps = 23/171 (13%) Query: 151 RQPHRVMEL--HNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDDWEHLDNYFR-HPL 207 + L H DGTY + L + + +GG ++L +D + + + +P Sbjct: 185 DTAYTNFHLASHTDGTYWTDTPGLQLFHCL-HHDGKGGENML--VDGFRAAQEFKKLNPE 241 Query: 208 ARRPMRFAAPPSKNVSKD-------VFHPVFDVD--QQGRPVMRY-------IDQFVQPK 251 + P+ + +D V PVF D +R+ +D + P+ Sbjct: 242 GYELLSRVRIPAHSAGEDSVCITPEVPQPVFTHDPITGELQQVRWNNDDRSVMDTWDSPE 301 Query: 252 DFEEGV-WLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHP 301 D + + + + + K + G+ L+ +N LHGR F + Sbjct: 302 DVPKFYKAIRQWNGILTDPKFEYVCKLVAGECLIFDNWRVLHGRKGFVGNR 352 >UniRef50_D0LLC5 Putative uncharacterized protein n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LLC5_HALO1 Length = 246 Score = 64.4 bits (155), Expect = 5e-09, Method: Composition-based stats. Identities = 27/170 (15%), Positives = 53/170 (31%), Gaps = 23/170 (13%) Query: 160 HNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDDWEHLDNYFRHPLAR-RPMRFAAPP 218 H +G Y+ + + G++ L D + L + A + + Sbjct: 78 HTEGPIYPAPPKYLALYCHRQARCGSGHTAL--ADGLDFLHSLTAEERAFCQNYTVSFST 135 Query: 219 SKNVS----KDVFHPVFDVDQQGRPVMRYIDQFVQ----------PKDFEEGVWLSELSD 264 + + DV P+ G+ ++R+ P + E SD Sbjct: 136 YNDATGYEATDVKFPLETRRDNGQFILRFSHNLFYFGDLHAANESPSEQAASREQKEFSD 195 Query: 265 AIET-----SKGILSVPVPVGKFLLINNLFWLHGRDRFTP-HPDLRRELM 308 + + L + +P G L+ +N LH R + L R + Sbjct: 196 IVARCTDFFEQQKLRILIPDGALLIWDNHRMLHARSEYADTDRHLTRYWL 245 >UniRef50_A4X6E3 Putative uncharacterized protein n=1 Tax=Salinispora tropica CNB-440 RepID=A4X6E3_SALTO Length = 251 Score = 64.4 bits (155), Expect = 5e-09, Method: Composition-based stats. Identities = 32/154 (20%), Positives = 53/154 (34%), Gaps = 11/154 (7%) Query: 157 MELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDDWEHLDNYFRHPLARRPMRFAA 216 + +H + + ++ +L+ GG L+ H L + Sbjct: 83 LTVHTESSAEDQPPRLMLLFCAR-GAASGGECRLVDGARLHHEMALRDPELLNALCAPRS 141 Query: 217 PPSKNVSKDVFHPVFDVDQQGRPVMRYI-DQFVQ--PKDFEEGVWLSELSDAIETSKGIL 273 + + VF GR V+R+ D + P+ L + +ET Sbjct: 142 VLFGGAAGHL-GSVFTKH-DGRTVVRFRLDDLARFSPQITRRLASLRAILHDLETP---- 195 Query: 274 SVPVPVGKFLLINNLFWLHGRDRFTPHPDLRREL 307 VP+ G L+ N WLHGR FT + R L Sbjct: 196 -VPLRPGHGYLLCNTRWLHGRAAFTGERLMYRLL 228 >UniRef50_B1MME3 Putative uncharacterized protein n=1 Tax=Mycobacterium abscessus ATCC 19977 RepID=B1MME3_MYCA9 Length = 315 Score = 64.4 bits (155), Expect = 5e-09, Method: Composition-based stats. Identities = 39/229 (17%), Positives = 75/229 (32%), Gaps = 28/229 (12%) Query: 92 EGALLINAVGVDDVKQADEMVK-LATAVAHLIGRSNFDAMSGQYYARFVVKNVDNSDSYL 150 +G + G + D ++ + L+GR + + Sbjct: 64 QGPGFVRLRGFPIHELTDTQIEHAYLGLGQLLGRPVGQDRHANLLTHIRDEQIGAEPGVR 123 Query: 151 R-QPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDDWEHLDNYFRHPLAR 209 + + + + H+DG+ D V ++ + GG S ++ + + R P Sbjct: 124 KYRTNLRQDFHSDGS------DLVGLLCLRP-AKTGGESKIVSAHAV-YNEMLDRAPHLV 175 Query: 210 RPMRFAAPPSKNVSKDVFHPVFDV-----DQQGRPVMRYI-----DQFVQPKD------F 253 M P +N + V P F + G P + +I D P Sbjct: 176 EVMYRPMPWDRNNEQPVGEPPFFELPPIIEIDGIPRVFFIAWYIRDSQRHPGAPRLTGGQ 235 Query: 254 EEGVWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPD 302 + + L+E+ + + G L+NN LH R+ +T H D Sbjct: 236 RQALALAEIIA--NDPAFHIQMQFAPGDVQLLNNTTVLHSREEYTDHDD 282 >UniRef50_A8TVU7 Dihydroxyacid dehydratase n=1 Tax=alpha proteobacterium BAL199 RepID=A8TVU7_9PROT Length = 401 Score = 64.4 bits (155), Expect = 6e-09, Method: Composition-based stats. Identities = 32/181 (17%), Positives = 53/181 (29%), Gaps = 19/181 (10%) Query: 138 FVVKNVDNSDSYLRQPHRVMEL--HNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDD 195 F V++ + DS + +EL H D E ++ + E GG ++++ Sbjct: 194 FDVRSKVDPDSN---AYTALELTPHVDLPTREYQPGLQILHCL-ENTAPGGEAVMVDGLR 249 Query: 196 WEHL----DNYFRHPLARRPMRFAAPPSKNVSKDVFHPVFDVD--QQGRPVMRYIDQFVQ 249 + L F SK PV +D +R Sbjct: 250 IARFLVEHEPETYRILTTERFTFQNR-SKTSDYRWLSPVIVLDPTTNAPVEVRIAGFLRG 308 Query: 250 PKDFEEGV------WLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDL 303 P D L + + + G +L +N +HGR F P Sbjct: 309 PTDLPADRTGPAYRALRRIFALSRDPRFQIRYGYRPGDLVLFDNRRLMHGRAAFDPSAGS 368 Query: 304 R 304 R Sbjct: 369 R 369 >UniRef50_C0ZLG3 Putative uncharacterized protein n=2 Tax=Rhodococcus erythropolis RepID=C0ZLG3_RHOE4 Length = 282 Score = 64.4 bits (155), Expect = 6e-09, Method: Composition-based stats. Identities = 43/207 (20%), Positives = 69/207 (33%), Gaps = 18/207 (8%) Query: 125 SNFDAMSGQYYARFVVKNVDNSDSYLRQ-----PHRVMELHNDG-TYVEEITDYVLMMKI 178 F+A + A V + D R+ M HNDG + + DY+ + Sbjct: 60 RQFEASARSQDAEAAVVDTQPVDERGRKRSFGISGERMTAHNDGFAFGDYAPDYLFLWCK 119 Query: 179 DEQNMQGGNSLLLHLDDWEHLDNYFRHPLARRPMRFAAP---PSKNVSKDVFHPVFDVDQ 235 GG+S L+ L + ++ N + F P+ Sbjct: 120 RPAYPSGGDSFLIDAVKLTRLLAFDPATAELAEFCWSTDIDHSEPNFPQSTFAPIARRVP 179 Query: 236 QGRPVMRYIDQFVQPKDFEEGVWLSELSDAIETSKGILS-----VPVPVGKFLLINNLFW 290 GR +RY ++ P E L + + G+ + I+N Sbjct: 180 SGRNQVRY-HPYLAPIAGESEEAQWPLVKQWSEAVIHVRDSGPMFRAEAGEMICIDNYRV 238 Query: 291 LHGRDRFTPHPDLRRELMRQRGYFAYA 317 LHGRD +T D REL G+ + A Sbjct: 239 LHGRDGYT---DPGRELYSIWGWSSDA 262 >UniRef50_Q0CAI8 Predicted protein n=3 Tax=Aspergillus RepID=Q0CAI8_ASPTN Length = 401 Score = 64.0 bits (154), Expect = 6e-09, Method: Composition-based stats. Identities = 41/265 (15%), Positives = 81/265 (30%), Gaps = 31/265 (11%) Query: 60 KSFLRFRVAKILDDLCANQLQPLLLKTLLNRAEGALLINAVGVDDVKQADEMVKLATAVA 119 K L+ + +R G L+ + VD + D V + ++ Sbjct: 89 KPLGSLDPTTFPLPSLHPILRDISNNI--HRGTGFSLVRGIPVDQYSRED-NVIIYVGLS 145 Query: 120 HLIG--RSNFDAMSGQYYARFVVKNVDNSDSYLRQPHRVMELHNDG--TYVEEITDYVLM 175 IG R D + A ++ ++ + S + + DG + + D V + Sbjct: 146 SHIGCIRGRQDHQNNSVPADVMIAHIMDFSSSADSRSVTLPAYTDGEVIFHTDTGDIVSL 205 Query: 176 MKIDEQNMQGGNSLLLH----LDDWEHLDNYFRHPLARRPMRFAAPPSKNVSKDVFHPVF 231 ++E GG SLL ++ LA + P SK + P+ Sbjct: 206 FVLEEPAH-GGESLLASGWRVYNELAKTRPDLVEALASD---WPIPSSKKEGLIIRRPLI 261 Query: 232 -----DVDQQGRPVMRYIDQ---------FVQPKDFEEGVWLSELSDAIETSKGILSVPV 277 ++ + + ++ L L + +++ + Sbjct: 262 FPLNPSATAPDGILIHFSRRSFSGFGAWSHSNALSVKQAEALDAL--HFLAERFHVAMEL 319 Query: 278 PVGKFLLINNLFWLHGRDRFTPHPD 302 G +NNL LH R + P Sbjct: 320 RKGDMQFLNNLSILHARRSYPDIPG 344 >UniRef50_C0NV43 Trimethyllysine dioxygenase n=21 Tax=Leotiomyceta RepID=C0NV43_AJECG Length = 929 Score = 64.0 bits (154), Expect = 6e-09, Method: Composition-based stats. Identities = 33/186 (17%), Positives = 61/186 (32%), Gaps = 19/186 (10%) Query: 133 QYYARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLH 192 +Y F D S + + H D TY + + + N GG SLL Sbjct: 716 THYGGFWDFTSDLSLKDMAYTTEGLGGHTDTTYFTDPAGLQMFHMLSHTNGSGGESLL-- 773 Query: 193 LDDWEHLDNYFR-HPLARRPMR-FAAPPSKNVSKDV------FHPVFDVD--QQGRPVMR 242 +D +E + P A ++ F + ++ V PVF+ +R Sbjct: 774 VDGFEAAKTLYNEDPEAYEVLKEFGVDGHASGNEHVCIQPACPFPVFNHHPITGELYQVR 833 Query: 243 YIDQFVQPKDFEEG-------VWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRD 295 + ++ +P + ++ I + G ++ +N LHGR Sbjct: 834 WNNEDRRPMVNGSPEEISKWYAAARKWNEIITRKDMEHWFQLDPGSPMIFDNWRMLHGRA 893 Query: 296 RFTPHP 301 F+ Sbjct: 894 PFSGKR 899 >UniRef50_B1J339 Putative uncharacterized protein n=1 Tax=Pseudomonas putida W619 RepID=B1J339_PSEPW Length = 349 Score = 64.0 bits (154), Expect = 6e-09, Method: Composition-based stats. Identities = 36/210 (17%), Positives = 65/210 (30%), Gaps = 19/210 (9%) Query: 132 GQYYARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVL-MMKIDEQNMQGG---- 186 G+ + + +S H D + ++ + + E G Sbjct: 137 GRLFRHVIPARTSSSQKSSHGSRLRFSYHVDNPDLPLSSEPLGTVSCCPEYLSFFGMRCD 196 Query: 187 ---NSLLLHLDDW-EHLDNYFRHPLARRPMRFAAPPSKNVSKD--VFHPVFDVDQQGRPV 240 ++ L++LD L L P S ++ PV G + Sbjct: 197 PRISTTLVNLDAVIASLPLATVRELCEAQFEIRRPDSFTGNRRCTAELPVLVRGHSGEWL 256 Query: 241 MRYIDQFVQPKDFEEGVWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPH 300 R+ + + L+ L +ET + + G F++ N LH RD F P Sbjct: 257 CRFDSENTHGLNLRAEQALATLRGLLETRRFDEPHLLLPGDFMIFKNQRVLHARDGFEPQ 316 Query: 301 PDLRRELMRQ--------RGYFAYASNHYQ 322 D + + R FA A + Y+ Sbjct: 317 NDGADRWLLRVFAVNDLNRVRFARADHLYE 346 >UniRef50_UPI000180CBE5 PREDICTED: similar to gamma-butyrobetaine dioxygenase n=1 Tax=Ciona intestinalis RepID=UPI000180CBE5 Length = 402 Score = 64.0 bits (154), Expect = 6e-09, Method: Composition-based stats. Identities = 28/185 (15%), Positives = 62/185 (33%), Gaps = 15/185 (8%) Query: 135 YARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNM-QGGNSLLLHL 193 Y + V + +++ + + H D T E+ + + GG + ++ L Sbjct: 180 YGKLSVVKISSTEGNAAYSNIKLPFHQDQTMYEKPPGIEFLHCMKFDECITGGETKIVDL 239 Query: 194 DDWEHL----DNYFRHPLARRPMRFAAPPSKNVSK---DVFHPVFDVDQQGRPV-MRYID 245 + ++ L P+ ++ K +K V +D G+ V + + Sbjct: 240 YEVINILKRESPADFQTLVEVPVIYSTIDYKRTNKVYLHHQKHVVVLDYFGKVVAINWHP 299 Query: 246 QFVQPKDFEE---GVWLSE---LSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTP 299 + E + L I + + + G ++ NN LH R++F Sbjct: 300 ALLSTLQVREKDVERYYEAHVKLLQIIRRDELHFTHRMQPGDLVVFNNRRVLHSRNKFEA 359 Query: 300 HPDLR 304 + R Sbjct: 360 NGGDR 364 >UniRef50_D1ZLJ6 Whole genome shotgun sequence assembly, scaffold_56 n=2 Tax=Sordariaceae RepID=D1ZLJ6_SORMA Length = 402 Score = 64.0 bits (154), Expect = 7e-09, Method: Composition-based stats. Identities = 42/259 (16%), Positives = 75/259 (28%), Gaps = 29/259 (11%) Query: 68 AKILDDLCANQLQPLLLKTLLNRAEGALLINAVGVDDVKQADEMVKLATAVAHLIGRSNF 127 + L + + G +I +G+ D Q + + L A +++ Sbjct: 77 TSFPLPILRPVLDEARRQVHEGQ--GFAIIRGIGLSDSAQDNTNMFLGLA-SYIGDVRGI 133 Query: 128 DAMSGQYYARFVVKNVDNSDSYLRQ---PHRVMELHNDGTYVEEITDYVLMMKIDEQNMQ 184 G LR + HND D + + I + Sbjct: 134 QDKQGSMLTHVTASKTWTVPPELRHGIHTSTGLAWHND-----MGADTIALH-IRSLAEE 187 Query: 185 GGNSLLLHLDDWEHLDNYFRHPLARRPMRFAAPPSKNVSKDVFHPVFDVDQQGRPVMRYI 244 GGN+ + W + +A S N K + P+ V + Sbjct: 188 GGNTFV--ASSWTIYKELVTSFPQVLKLLWAIRSSGNPPKHIVAPLVQVHNNRVYLSVDP 245 Query: 245 DQF-VQPKDFEEGV----------WLSELSDAIETSKGILSVPV--PVGKFLLINNLFWL 291 + + P + G+ L L + + + + G L INN + Sbjct: 246 GRLGLHPVTAKAGLGSSVPSLTTSHLQAL-ETLSELATKHRLMLDTKPGDMLFINNWALI 304 Query: 292 HGRDRFTPHPDLR-RELMR 309 H RD + D R L+R Sbjct: 305 HARDSYKDPKDGPGRHLVR 323 >UniRef50_Q4JN27 Putative uncharacterized protein n=1 Tax=uncultured bacterium BAC13K9BAC RepID=Q4JN27_9BACT Length = 279 Score = 63.6 bits (153), Expect = 9e-09, Method: Composition-based stats. Identities = 34/179 (18%), Positives = 71/179 (39%), Gaps = 11/179 (6%) Query: 142 NVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDDWEHLDN 201 + SY+ ++ + H DG Y +E + + ++ +GG + LL + Sbjct: 102 RETSKVSYIPYTNKALNWHTDG-YYDERSIFSWLLHCVSSASEGGENYLLDHELVLREYI 160 Query: 202 YFRHP--LARRPMRFAAPPSKNVSK-DVFHPVFDVD-QQGRPVMRY-IDQFVQPKDFEEG 256 + P SK+ S+ ++ +F + MR+ + + + Sbjct: 161 LRNDDINILMDDGALIIPESKDTSRPEISTYIFSFSNVYKKLHMRFSMRKDNIASNTRAS 220 Query: 257 VWLSELSDAIET--SKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLRRELMRQRGY 313 ++ L I + ++ L+ + + +L NN LHGR+ F + R+L+R R Y Sbjct: 221 DAITRLKKIIVSHCAQYSLTYKLSKNQGILTNN--ILHGRNSFKDDK-VNRKLLRIRSY 276 >UniRef50_Q0RGQ6 Putative clavaminate synthase-like (Oxidase) n=2 Tax=Actinomycetales RepID=Q0RGQ6_FRAAA Length = 317 Score = 63.6 bits (153), Expect = 9e-09, Method: Composition-based stats. Identities = 35/166 (21%), Positives = 52/166 (31%), Gaps = 15/166 (9%) Query: 157 MELHNDGTYVEEITD----------YVLMMKIDEQNMQGGNSLLLHLDDWEHLDNYFRHP 206 H+D + Y+ I Q L D LD R Sbjct: 149 FFWHSDNPQQQFGPIGSDPRLYTPPYLTFFAIRNQEQVPTEVAALD-DVVAGLDEKTRLG 207 Query: 207 LARRPMRFAAPPSKNVSKD---VFHPVFDVDQQGRPVMRYIDQFVQPKDFEEGVWLSELS 263 L P S + + PV +V GR +RY + L+ S Sbjct: 208 LMAAEFEVGVPYSNDRDATGPLMNTPVLEVGPDGRYRVRYDRGTTVGRTDAARETLTRWS 267 Query: 264 DAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLRRELMR 309 A+ ++ + G F++ +N LH R FTP PD +R Sbjct: 268 AALGAMPS-VAFVLGTGDFMIFDNHRVLHRRKSFTPAPDATARWLR 312 >UniRef50_C7QKA8 Oxygenase (Secreted protein) n=1 Tax=Catenulispora acidiphila DSM 44928 RepID=C7QKA8_CATAD Length = 341 Score = 63.6 bits (153), Expect = 9e-09, Method: Composition-based stats. Identities = 37/201 (18%), Positives = 56/201 (27%), Gaps = 20/201 (9%) Query: 122 IGRSNFDAMSGQYYARFVVKNVDNSDSYLRQPHRVMELHND------GTYVEEITDYVLM 175 + + K N S + + HND G Y D++++ Sbjct: 124 LTEKQGQLVHDVVPVPGGAKTQTNQGSTV-----FLNFHNDIVHDSIGRYDVSNPDFLVL 178 Query: 176 MKIDEQNMQGGNSLLLHL-DDWEHLDNYFRHPLARRPMRFAAPPSKNVSKDVFHPVFD-- 232 + + + D LD+ L R AP S V Sbjct: 179 SCLRADHEGIAATYYADARDIVAALDDQALEILRSPLFRLNAPGSYVRDVAGGGEVLSDP 238 Query: 233 ----VDQQGRPVMRYIDQFVQPKDFEEGVWLSELSDAIETSKGILSVPVPVGKFLLINNL 288 Q P + V+ L L A V + G+ LLINN Sbjct: 239 VPILSGPQEFPEIAVSANGVRAMTSGAQTVLDRLQAACREVS--HQVFLRPGQALLINNR 296 Query: 289 FWLHGRDRFTPHPDLRRELMR 309 LH R +FT D ++ Sbjct: 297 KGLHARSQFTARYDGEDRWLQ 317 >UniRef50_B2AFB2 Predicted CDS Pa_0_1600 n=1 Tax=Podospora anserina RepID=B2AFB2_PODAN Length = 555 Score = 63.6 bits (153), Expect = 1e-08, Method: Composition-based stats. Identities = 33/211 (15%), Positives = 68/211 (32%), Gaps = 19/211 (9%) Query: 110 EMVKLATAVAHLIGRSNF-DAMSGQYYA-RFVVKNVDNSDSYLRQPHRVMELHNDGTYVE 167 + + +I +N + +Y + VK+ ++ + + + LH D Y + Sbjct: 295 FLKNVPQEEHSVINIANRIGPLQYTFYGWTWDVKSKPRAE-NVAYTNVFLGLHQDLMYHD 353 Query: 168 EITDYVLMMKIDEQNMQGGNSLLLHL--DDWEHL--DNYFRHPLARRPMRFAAPPSKNVS 223 I L+ + + +GG SL + L D L + + F Sbjct: 354 PIPRLQLLHCL-ANSCEGGESLFSDGVHAALQLLNTDPEAYDILTKTDVHFGY-DKGGHH 411 Query: 224 KDVFHPVFDVDQQ-GRPVM-RYIDQFVQPKDFEEGV--------WLSELSDAIETSKGIL 273 + D G P + + F ++ + +E + + Sbjct: 412 YYATRKTIEADPNTGAPFITHWAPPFQTSFPAKDKNNRLRRWRDAAEKFQRLLEKEENMY 471 Query: 274 SVPVPVGKFLLINNLFWLHGRDRFTPHPDLR 304 V + G+ ++ +N LHGR F R Sbjct: 472 EVKMKPGECVIFDNSRVLHGRREFETSTGSR 502 >UniRef50_A9VA54 Predicted protein n=1 Tax=Monosiga brevicollis RepID=A9VA54_MONBE Length = 473 Score = 63.2 bits (152), Expect = 1e-08, Method: Composition-based stats. Identities = 42/196 (21%), Positives = 65/196 (33%), Gaps = 19/196 (9%) Query: 130 MSGQYYARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKID-EQNMQGGNS 188 M Y A F V +N + ++LH D Y E + D + ++GG S Sbjct: 241 MPTIYGATFDVVATENP-INIAYSSVALDLHQDLVYYESTPGLQFLHCADFHETIEGGES 299 Query: 189 LLLH----LDDWEHLDNYFRHPLARRPMRFAAP-PSKNVSKD--VFHPVFDVDQQG---- 237 LL+ D H L + PM F + + P+ D + Sbjct: 300 LLMDGLAVAARLRDQDPDAYHLLQQVPMTFEKVHYDRALPAHLVAQRPIITADPESGEPL 359 Query: 238 ----RPVMRYIDQFVQP-KDFEEGVWLSELSDAI-ETSKGILSVPVPVGKFLLINNLFWL 291 P F++P + L I E++ ++ + + L NN L Sbjct: 360 ELVWAPPFEGALPFLEPGLCRRYFDAYAALGMVIRESTDLLIEHRLLPNETLCFNNRRML 419 Query: 292 HGRDRFTPHPDLRREL 307 HGR F H R L Sbjct: 420 HGRRSFVLHEGHIRFL 435 >UniRef50_B2WCC9 Gamma-butyrobetaine dioxygenase n=2 Tax=Pleosporineae RepID=B2WCC9_PYRTR Length = 497 Score = 63.2 bits (152), Expect = 1e-08, Method: Composition-based stats. Identities = 37/247 (14%), Positives = 61/247 (24%), Gaps = 40/247 (16%) Query: 82 LLLKTLLNRAEGALLINAVGVDDVKQADEMVKLATAVAHLIGRSNFDAMSGQYYARFVVK 141 L G L IN V + +V LA+ + L +Y R Sbjct: 229 LFDALTHLNRYGLLFINNVP----DSEESVVSLASRIGTL---------KDTFYGRTWDV 275 Query: 142 NVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDDWEHLDN 201 + + LH D Y L+ + GG S D + Sbjct: 276 RSKPKAENIAYTPDYLGLHMDLLYTANPPHLQLLHSLRA-RTPGGESFFS--DAFNAAHQ 332 Query: 202 YFRHPLARRPMRFAAPPSK-----NVSKDVFHPVFDVDQQGR--------PVMRYIDQFV 248 R P + N + PV + R + + F Sbjct: 333 LRRQSEGYFRTLCTFPVTYHYHHPNQHYHMTRPVIETFPSLRYEATDCAIRRINWSPPFQ 392 Query: 249 QPKDFEEG-----------VWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRF 297 P + G + + + + G+ ++ +N LH R F Sbjct: 393 APFEARIGSEHSKSLRTFIAASHAYEKLLSKEENLFEYRLNEGECVIFDNRRVLHARRAF 452 Query: 298 TPHPDLR 304 R Sbjct: 453 DATKGER 459 >UniRef50_UPI000023E495 hypothetical protein FG06105.1 n=1 Tax=Gibberella zeae PH-1 RepID=UPI000023E495 Length = 369 Score = 63.2 bits (152), Expect = 1e-08, Method: Composition-based stats. Identities = 37/210 (17%), Positives = 66/210 (31%), Gaps = 35/210 (16%) Query: 133 QYYARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQG------- 185 +Y F D + + + + H D TY E ++ + G Sbjct: 124 THYGGFYDFVPDLALADTAYTNIALAPHTDTTYFSEPAGLQAFHCLEHEAPPGHNPDEPL 183 Query: 186 -GNSLLLHLDDWEHL----DNYFRHPLARRPMRFAAPPSKNVS--KDVFHPVFDVDQQGR 238 G SLL+ L L + + A +K ++ D +PV +VD + R Sbjct: 184 GGESLLVDGLQAARLLKRETPNLFDTLRDIRVPWHASGNKGIAIAPDRTYPVIEVDNETR 243 Query: 239 P--VMRYIDQ-------FVQPKDFEEGVWLSELSDAIETSKGILSVPVPVGKFLLINNLF 289 +R+ + F P + +++ I + + G ++ NN Sbjct: 244 RINRIRWNNDDRGVVHLFDSPPWYVAARQWNDI---INRERSQYRFKLTPGTIVIFNNWR 300 Query: 290 WLHGRDRFTPHPDLRRELMRQRGYFAYASN 319 +HGR F R AY Sbjct: 301 VMHGRTAFKGTR---------RICGAYIPR 321 >UniRef50_C5AK38 Putative uncharacterized protein n=8 Tax=Proteobacteria RepID=C5AK38_BURGB Length = 339 Score = 62.9 bits (151), Expect = 1e-08, Method: Composition-based stats. Identities = 39/189 (20%), Positives = 61/189 (32%), Gaps = 27/189 (14%) Query: 139 VVKNVDNSDSYLRQPHRVME-----LHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHL 193 VK+ + DS L M H+D D ++ E GG + Sbjct: 112 EVKDAGHRDSTLAPARGHMTNQELAFHSD------RADVTVLACW-EPATSGGEFKVCSS 164 Query: 194 DDWEHL----DNYFRHPLARRPMRFAAPPSKNVSKDVF--HPVFDVDQQGRPVMRYIDQF 247 L D +R L R P+ + P+ ++ V+RYI +F Sbjct: 165 ARLIELIERHDPAWRAWLTR-PIPHDLRDEGGSPDQGYCLLPILTETRE-TFVLRYIRKF 222 Query: 248 VQ-------PKDFEEGVWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPH 300 + E L + +E + P G +L NN LH R+ F Sbjct: 223 NESVKRHGIELPDEVRAMLDGIDALLEDDSISVEFPFAKGTLVLTNNHTTLHSRNAFVDV 282 Query: 301 PDLRRELMR 309 +R L+R Sbjct: 283 APQQRCLLR 291 >UniRef50_A0Z404 Gamma-butyrobetaine,2-oxoglutarate dioxygenase n=1 Tax=marine gamma proteobacterium HTCC2080 RepID=A0Z404_9GAMM Length = 416 Score = 62.9 bits (151), Expect = 1e-08, Method: Composition-based stats. Identities = 40/291 (13%), Positives = 81/291 (27%), Gaps = 37/291 (12%) Query: 32 LLELTFTEQTTKQFLEQVAEWPVQALEYKSFLRFRVAKILDDLCANQLQPLLLKTLLNRA 91 L + + + ++ + W Q L A D + L+ Sbjct: 118 LRHIADGQHLPESWIPKPEAWDAQILPQPPRRSTSGALESDRELCEVMNDLI-------R 170 Query: 92 EGALLINAVGVDDVKQADEMVKLATAVAHLIGRSNFDAMSGQYYARFVVKNVDNSDSYLR 151 G ++ + + + KLA + D+ G + ++ Sbjct: 171 FGVCVLESAP----SEEGFLNKLAARIG-----PVRDSNFGALWDVVADISLAGDAKTNT 221 Query: 152 QPHRVMEL--HNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLH----LDDWEHLDNYFRH 205 + + L H D E Y + + + GG S L + + + Sbjct: 222 TANTGLRLGPHTDLPTREIPPGYQFLHCLINE-ADGGESTLTDGAALVQELKMHHPADYE 280 Query: 206 PLARRPMRFAAPPSKNVSKDVFHPVFDVDQQGRPVMRYIDQF---------VQPKDFEEG 256 L+ R F + P+ D G + + F + E Sbjct: 281 LLSTRRWVFFNRGP-GIDHRWSAPII--DTSGAHALPTLRAFYPVRAFPDMPECDVAEAY 337 Query: 257 VWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLRREL 307 L + + L+ + G + +N +HGR F+ +R L Sbjct: 338 EALRRFHKLADDPRFELTFRLGAGDIMCFDNRRVMHGRKGFSGS--GKRHL 386 >UniRef50_A6F048 Putative uncharacterized protein n=1 Tax=Marinobacter algicola DG893 RepID=A6F048_9ALTE Length = 352 Score = 62.9 bits (151), Expect = 1e-08, Method: Composition-based stats. Identities = 43/261 (16%), Positives = 72/261 (27%), Gaps = 24/261 (9%) Query: 68 AKILDDLCANQLQPLLLKTLLNRAEGALLINAVGVDDVKQADEMVKLATAVAHLIGRSNF 127 A+ +L + + + R G L+ + V + V AH IG++ Sbjct: 72 AEFPLPTLGPRLAEVRNEVMEGR--GFALVRGLPVAGRSRFQNAVAFWGIGAH-IGQARS 128 Query: 128 DAMSGQYYARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGN 187 G D + M+ H D +L + GG Sbjct: 129 QNKHGHLLGHVKNLGGDYAKVRGYMTQAAMDFHCDSAD-------ILSLCCLSPAKSGGE 181 Query: 188 ----SLLLHLDDWEHLDNYFRHPLARRPM--RFAAPPSKNVSKDVFHPVFDVDQQGRPVM 241 S + ++ L R R P P+F V G Sbjct: 182 HRICSSVTLYNEMLKARPDLVEELTARFYLARKGDIPPGETEPWERLPIFSV-TDGYFAG 240 Query: 242 RYIDQFVQPKDFEEGV--WLSELSDAIE-----TSKGILSVPVPVGKFLLINNLFWLHGR 294 R I + GV + +AI K + + G + N LH R Sbjct: 241 RGISAHMAKAQKIPGVPKYTELQEEAIRMYKETAPKIAIDIDFEQGDISYVCNHTMLHSR 300 Query: 295 DRFTPHPDLRRELMRQRGYFA 315 F + + R+ R + + Sbjct: 301 TDFEDYDEPERKRHLLRLWLS 321 >UniRef50_A6WF32 Clavaminate synthase n=2 Tax=Actinobacteria (class) RepID=A6WF32_KINRD Length = 331 Score = 62.9 bits (151), Expect = 2e-08, Method: Composition-based stats. Identities = 40/220 (18%), Positives = 69/220 (31%), Gaps = 21/220 (9%) Query: 109 DEMVKLATAVAHLIGRSNFDAM--SGQYYARFVVKNVDNSDSYLRQPHRVMELHNDGTYV 166 + + V+H +G A G V + + +E H + + Sbjct: 101 TLLARQQAIVSHALGHMVGYAAEGHGHLLQDMVPNARLAATQQSQGSRVELEAHTEQCFS 160 Query: 167 EEITDYVLMMKIDEQNMQGGNSLLLHLDDWEHLDNYFRHPLARRPMRFAAPPSKNV---S 223 + DYV++ + + LD H+D L R S + Sbjct: 161 DLRPDYVVLGCLR-GDADAATYAFRALDLLAHVDPTDVMELFRPLWTTLVDESFADFLDT 219 Query: 224 KDVFHP--VFDVDQQGRPVMRYIDQFVQPKDFEEGVWLSELSDAIETSKGILSVPVPVGK 281 ++V P + D P M + L + + + +V + G Sbjct: 220 REVRGPFSILSGDVD-DPTMLVDQDLMHGITKHAQALLERVLEIYVAHR--HAVVLQPGD 276 Query: 282 FLLINNLFWLHGRDRFTPHPDLR----------RELMRQR 311 LL++NL +HGR F P D R+L R R Sbjct: 277 VLLLDNLRAMHGRSPFAPRFDGTDRFISRGFVVRDLRRSR 316 >UniRef50_Q2J9C7 Pyoverdine biosynthesis protein n=1 Tax=Frankia sp. CcI3 RepID=Q2J9C7_FRASC Length = 270 Score = 62.5 bits (150), Expect = 2e-08, Method: Composition-based stats. Identities = 25/174 (14%), Positives = 53/174 (30%), Gaps = 18/174 (10%) Query: 153 PHRVMELHNDGTYVEEITDYVLMMKIDEQNM-QGGNSLLLHLDDWEHLDNY-FRHPLARR 210 + H DG + +++ + + GG + R A Sbjct: 96 TTGPVPFHWDGAFARLTPRFIVFRCLRAPDPDSGGETTFCDTARIISAAPAHLRDTWAGV 155 Query: 211 PMRFAAPPSKNVSKDVFHP-VFDVDQQGRPVMRYID-----QFVQPK--------DFEEG 256 + + K+ + V G P +R+ + +++ P + Sbjct: 156 RIDYQTEKIKHYGGHIVQDLVVPHAVTGVPTLRFAEALDPAEYLNPLFLDIHGVSRNDRE 215 Query: 257 VWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLRRELMRQ 310 +L +LS + + G ++ +N LHGR ++ R L R Sbjct: 216 DFLDDLSARLYDPSVSYAHAWRSGDVVIADNHALLHGRRSYSL--ASSRRLQRI 267 >UniRef50_A9FPD4 Putative uncharacterized protein n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9FPD4_SORC5 Length = 340 Score = 62.5 bits (150), Expect = 2e-08, Method: Composition-based stats. Identities = 33/155 (21%), Positives = 51/155 (32%), Gaps = 14/155 (9%) Query: 158 ELHNDGT-YVEEITDYVLMMKIDEQNMQGGNSLLLHLDDWEHLDNYFRHP--LARRPMRF 214 LH D + +M + GG LL D W L R L R Sbjct: 81 PLHTDSQCFAGAPPAVQIMACVRPAERGGGCLLL---DGWPLLSAIERADPELFRALFTV 137 Query: 215 AAPPSKNVSKDVFHPVFDVDQQGRPVMRYIDQFVQPKDFEEGVWLSELSDAIETSKGIL- 273 D F P + + G + + V P D L+ +E S+ + Sbjct: 138 YRRIPFVFG-DFFGPTVSL-RGGALALTHA-PVVPPGDGIAAR----LARFVEASRREVI 190 Query: 274 SVPVPVGKFLLINNLFWLHGRDRFTPHPDLRRELM 308 + + L+++N LHGR F + R L+ Sbjct: 191 ELSLGPADILVVDNRRMLHGRRAFEGAREFVRLLV 225 >UniRef50_A9M275 Tou3 n=10 Tax=Neisseria RepID=A9M275_NEIM0 Length = 259 Score = 62.5 bits (150), Expect = 2e-08, Method: Composition-based stats. Identities = 43/242 (17%), Positives = 85/242 (35%), Gaps = 26/242 (10%) Query: 89 NRAEGALLINAVGVDDVKQADEMVKLATAVAHLIGRSNF--DAMSGQYYA---RFVVKNV 143 ++ G ++ + ++ + + KL ++ L +N D ++ R + Sbjct: 22 VQSIGYCIVRGLNLNHLDDSRRNKKLFDFLSQLGMLTNHKDDGFKSIFWDIKYRGDDYVI 81 Query: 144 DNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDDWEHLDNYF 203 +N D + LH+D ++ E Y++M + N G + L D L Sbjct: 82 NN-DITFSEDVGECPLHSDSSFSENPESYLVMYVVKSANDGGNSLFLSSSDIVNQLSKTE 140 Query: 204 R-----HPLARRPMRFAAPPSKNVSKDVFHP-VFDVDQQGRPVMRYIDQFVQ-------- 249 L F P S + + V + V+ Q ++R+ + Sbjct: 141 TGKKHLKTLTGNLYPFKTPTSFDKKQGVRWGNILSVNTQ---MIRFRRDCIYKGIEENRN 197 Query: 250 PKDFEEGVWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLRRELMR 309 E + L L + I+ + I ++I+N+ LH R +T D R +R Sbjct: 198 KVSKEMVLALDYLVNVIKNASDIQEFSAQDDDLIIIDNVNGLHARTDYT---DKNRHYIR 254 Query: 310 QR 311 R Sbjct: 255 AR 256 >UniRef50_D1KCD0 Putative uncharacterized protein n=1 Tax=uncultured SUP05 cluster bacterium RepID=D1KCD0_9GAMM Length = 287 Score = 62.5 bits (150), Expect = 2e-08, Method: Composition-based stats. Identities = 41/206 (19%), Positives = 73/206 (35%), Gaps = 20/206 (9%) Query: 123 GRSNFDA---MSGQYYARFVVKNVDNSDSYLRQPHRVMELHNDGTY--VEEITDYVLMMK 177 G +FD + + A N ++ ++ + H DG Y +E+ + Sbjct: 79 GLKDFDRHLYVQDRGLAHITQSTNKNQSEFIPYTNKAIGWHTDGYYNAIEQRIRAFSLFC 138 Query: 178 IDEQNMQGGNSLLLHLDDWEHLDNYFRHPLARRPMRFAA--PPSKNVSKDVFH-----PV 230 + +GG + + L +A+ + A P V+ V PV Sbjct: 139 VRP-ASEGGTNEWIDPQMVYLLLREDNPDVAKALIHPNAMTIPEHKVNGKVRRTTSTGPV 197 Query: 231 FDVDQ-QGRPVMRYIDQ---FVQPKDFEEGVWLSELSDAI-ETSKGILSVPVPVGKFLLI 285 F +D+ G MRY + E + L + + + + + G+ +L Sbjct: 198 FFIDEASGELYMRYTQRKKNIEFLDSIEVNQAIEHLDELLSKHTDYHFKHLMHSGQGMLC 257 Query: 286 NNLFWLHGRDRFTPHPDLRRELMRQR 311 NN LH R F P+ R L+R R Sbjct: 258 NN--VLHKRSDFNDDPNNPRLLLRGR 281 >UniRef50_UPI000186E3FD gamma-butyrobetaine dioxygenase, putative n=1 Tax=Pediculus humanus corporis RepID=UPI000186E3FD Length = 371 Score = 62.1 bits (149), Expect = 2e-08, Method: Composition-based stats. Identities = 42/268 (15%), Positives = 88/268 (32%), Gaps = 37/268 (13%) Query: 26 SAQSPRLLELTFTEQTTKQFLEQVAEWPVQALEYKSFLRFRVAKILDDLCANQLQPLLLK 85 + + E+ +Q ++ +++ + FR I+ + LLK Sbjct: 106 HSFHKKHQIERLNEEYVQQISWDSNDFN----KFQPEISFRFDDII------KYDDTLLK 155 Query: 86 -TLLNRAEGALLINAVGVDDVKQADEMVKLATAVAHLIGRSNFDAMSGQYYARFVVKNVD 144 G I+ V + + ++ KLA V+ + YY D Sbjct: 156 WLETLAKFGVGKISGVPL----KNGQLQKLAERVSFI---------RKTYYGEEFFIKGD 202 Query: 145 NSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLH-LDDWEHLDNYF 203 N + + +++H D Y VL+ I + + +GG ++L+ L+ + L Sbjct: 203 NDANNVAYKKGPLQMHTDLPYYHYAPGVVLLHCILQ-STEGGENILVDGLNLVKQLPKSS 261 Query: 204 RHPLARRPMRFAAPPSKNVSK---DVFHPVFDVDQQGRP-VMRYIDQFVQP-KDFEEGVW 258 L + + ++ +N + PV VD G + + + V Sbjct: 262 YEVLTKTIVDWSDVGCENKYEFCTKNRAPVICVDDLGNVNRINWSQPQRDSVFNVSPEVA 321 Query: 259 L------SELSDAIETSKGILSVPVPVG 280 L + + I +K + G Sbjct: 322 LEWYKGYKKFMEMINDTKNAVIFKNEEG 349 >UniRef50_A5F6V5 PvcB protein n=35 Tax=Proteobacteria RepID=A5F6V5_VIBC3 Length = 287 Score = 62.1 bits (149), Expect = 3e-08, Method: Composition-based stats. Identities = 29/189 (15%), Positives = 58/189 (30%), Gaps = 21/189 (11%) Query: 140 VKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQ-GGNSLLLHLDDWEH 198 V+ D D + M +H DG Y ++ +Y + + GG + H Sbjct: 83 VQKEDPGDHIFDSSY--MPMHWDGMYRPQVPEYQIFQCVKAPLPGHGGRTTFSHTMLALQ 140 Query: 199 LDNYFRHPLARRPMRFAAPPSKNVSKDVFHPVFDVDQQ-GRPVMRYIDQFVQP------- 250 L ++ + P+ V+RY + + Sbjct: 141 HAPQPDLELWQQVTGHYQRKMEFYHSKTVSPIVMQHPYRDYQVIRYNEPHFEENGDLLNP 200 Query: 251 -------KDFEEGVWLSE-LSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPD 302 E+ + + L A+ + + G ++ +N LHGR+ F + Sbjct: 201 PDVSLSGITPEQAIEFHKSLRRALYDPRNFYAHEWQTGDIVITDNFSLLHGREAF--NSH 258 Query: 303 LRRELMRQR 311 R + R + Sbjct: 259 TPRHIRRVQ 267 >UniRef50_C4WXU3 ACYPI008997 protein n=2 Tax=Acyrthosiphon pisum RepID=C4WXU3_ACYPI Length = 356 Score = 62.1 bits (149), Expect = 3e-08, Method: Composition-based stats. Identities = 33/163 (20%), Positives = 60/163 (36%), Gaps = 16/163 (9%) Query: 153 PHRVMEL--HNDGTYVEEITDYVLMMKIDEQ-NMQGGNSLLLHLDDWEHLDNYFRH-PLA 208 + L HND TY E T + ++ N +GG S + +D ++ + + PL Sbjct: 173 AYLNGPLIVHNDSTYFNESTGLQVFHMLERDINCKGGLSTI--VDGFKVAEILKKENPLH 230 Query: 209 RRPMRFAAPPSK----NVSKDVFHPVFDVDQQGRPV--MRYI--DQFVQPKDFEEGVW-- 258 + + S+ PV +D + V +R+ D+ P + Sbjct: 231 FKNLTEIEIESEYIEPGFHYKCTGPVIKIDPTSKEVYQIRFNIYDRSALPPSRVNEFYES 290 Query: 259 LSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHP 301 + + +E + + G ++ NN LHGR FT Sbjct: 291 YAHFIEILEREEMYWKHCLQPGTVMIYNNWRVLHGRTSFTGKR 333 >UniRef50_A6RF74 Predicted protein n=1 Tax=Ajellomyces capsulatus NAm1 RepID=A6RF74_AJECN Length = 485 Score = 62.1 bits (149), Expect = 3e-08, Method: Composition-based stats. Identities = 26/180 (14%), Positives = 57/180 (31%), Gaps = 33/180 (18%) Query: 148 SYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDDWEHLDNYFRHPL 207 + ++ ++ H D Y+++ Y L+ + + GG SL D ++ R+ Sbjct: 221 KNVAYTNKHLDFHMDLLYMKDPPGYQLLHCLR-NSFSGGESLFS--DTFQAAVRLLRNDP 277 Query: 208 ARRPMRFAAPPSKNV-----SKDVFHPVFDVD---------QQGRPV-----MRYIDQFV 248 + P HP +++ + PV + Y F Sbjct: 278 ILFDILCKTPTRFEYKNNNQHYQYSHPTIEIEGGEEFLKNPPKKNPVPYVNYVNYSPPFQ 337 Query: 249 QPKDFEE-----------GVWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRF 297 P + + + + + I V + G+ ++ N +H R+ F Sbjct: 338 APSYLTKHLVDGRDIKLYVRAMKAFAAELGKQENIFQVKLEPGQCVIFQNRRVVHARNAF 397 >UniRef50_D2ATF7 Putative uncharacterized protein n=1 Tax=Streptosporangium roseum DSM 43021 RepID=D2ATF7_STRRD Length = 323 Score = 62.1 bits (149), Expect = 3e-08, Method: Composition-based stats. Identities = 30/209 (14%), Positives = 70/209 (33%), Gaps = 21/209 (10%) Query: 111 MVKLATAVAHLIG---RSNFDAMSGQYYARFVVKNVDNSDSYLRQPHRVMELHNDGTYVE 167 M+ LA+A+ G + + + +R + S + HN+ + Sbjct: 94 MLLLASAMGRPFGWEGQQDGRLVHDIVPSRGHEHEQTGASSTVTLAA-----HNEDAFHH 148 Query: 168 EITDYVLMMKID-EQNMQGGNSLLLHLDDWEHLDNYFRHPLARRPMRFAAPPSKNVSKDV 226 + +L+ + + + + H LD+ L+ + + + ++D Sbjct: 149 RRANLMLLGCLRNPDRVATSAASVRHA----GLDDADVEVLSAPVLPILPDDAYDDARDS 204 Query: 227 -----FHPVFDVDQQGRPVMRYIDQFVQPKDFEEGVWLSELSDAIET-SKGILSVPVPVG 280 P + G +R+ D P + + + + + V + G Sbjct: 205 SLEPPRVPTLWHGEDGL-CLRF-DPAYTPLEQAGADYRAAYGRLCRELDRVKVGVRLEPG 262 Query: 281 KFLLINNLFWLHGRDRFTPHPDLRRELMR 309 L+I+N +HGR+ F D ++ Sbjct: 263 DVLVIDNDAVVHGREPFRARYDGTDRWLK 291 >UniRef50_A1UJG6 Taurine catabolism dioxygenase TauD/TfdA n=17 Tax=Actinomycetales RepID=A1UJG6_MYCSK Length = 281 Score = 62.1 bits (149), Expect = 3e-08, Method: Composition-based stats. Identities = 45/245 (18%), Positives = 85/245 (34%), Gaps = 30/245 (12%) Query: 86 TLLNRAEGALLINAVGVDDVKQADEMVKLATAVAHLIGRSNFDAMSGQYYARFVVKNVDN 145 G L+ + +D Q + +L G ++G Y K+ ++ Sbjct: 36 LEALEDNGVLVFPGLHLDPQAQVEFCRRLGEVDHSSDG---HHPVAGIYPVTLD-KSKNS 91 Query: 146 SDSYLRQPHRVMELHNDG--TYVEEITDYVLMMKIDEQNMQGGNSLLLHLDD-WEHLDNY 202 S +YLR + H DG +E ++ GG + ++HLD+ Sbjct: 92 SAAYLRAT---FDWHIDGCTPTGDEYPQMATVLSARRVAESGGETEFASSYGAYDHLDDD 148 Query: 203 FRHPLARRPMRFAAPPSKNVSK---------------DVFHPVFDVDQQGRP--VMRYID 245 + LA + + S+ HP+ + GR V+ Sbjct: 149 EKQRLASLRVVHSLEASQRRVTPDPSPELLARWRSRPTHEHPLVWTHRSGRKSLVLGASA 208 Query: 246 QFVQPKDFEEGVWL-SELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLR 304 ++ D +EG L ++L D + + S VG ++ +N LH + + Sbjct: 209 DYIVGMDLDEGRALLADLLDRATQPELVYSHTWSVGDTVIWDNRGVLHRAAPYPENS--P 266 Query: 305 RELMR 309 RE++R Sbjct: 267 REMLR 271 >UniRef50_A6DBS5 Putative clavaminate synthase-like (Oxidase) n=1 Tax=Caminibacter mediatlanticus TB-2 RepID=A6DBS5_9PROT Length = 186 Score = 61.7 bits (148), Expect = 3e-08, Method: Composition-based stats. Identities = 36/187 (19%), Positives = 64/187 (34%), Gaps = 19/187 (10%) Query: 134 YYARFVVKNVDNSDSYLRQPHRVMELHNDGT-YVEEI---------TDYVLMMKIDEQNM 183 Y VK + S+ + H D Y+ E + + ++ + M Sbjct: 2 YRNLTPVKKQNMVGSHGTK---NFYFHVDNPIYLLEPEYKECSLNSPEILAILGLRSNPM 58 Query: 184 QGGNSLLLHLDDWEHLDNYFRHPLARRPMRFAAPPSKNVSKDVFH-PVF-DVDQQGRPVM 241 SLL + ++LD L + P S + ++ + P+ + Sbjct: 59 AI-TSLLAIEEVIKNLDKTTIDNLKKPIYTVRTPDSFDKKHEIKNIPILYSYGTEYFTRF 117 Query: 242 RYIDQFVQPKDFEEGVWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHP 301 Y + + E LS DAI SK I + + G ++ N LH R+ FT + Sbjct: 118 DYHNTYSC--KEEGIKALSNFKDAISMSKKI-EICIDKGTMIIFKNQKILHARNEFTTNY 174 Query: 302 DLRRELM 308 D + Sbjct: 175 DGYDRWL 181 >UniRef50_C5S2J6 Putative uncharacterized protein n=1 Tax=Actinobacillus minor NM305 RepID=C5S2J6_9PAST Length = 176 Score = 61.7 bits (148), Expect = 3e-08, Method: Composition-based stats. Identities = 27/156 (17%), Positives = 55/156 (35%), Gaps = 7/156 (4%) Query: 168 EITDYVLMMKIDEQNMQGGNSLLLHLDDWEHLDNYFRHPLARRPMRFAAPPSKN-VSKDV 226 I D + ++ + +Q S++L L + L P S + S V Sbjct: 13 YIPDSLTLLCLRQQKGV-ATSIVL----LADLTEEDKALLEEPAFSIKRPASFSGASISV 67 Query: 227 FHPVFDVDQQGRPVMRYIDQFVQPKDFEEGVWLSELSDAIETSKGILSVPVPVGKFLLIN 286 P+ G + R+ V+ + L + + +S+ + G+ + + Sbjct: 68 NVPLIMKCSDGSYISRFDYHNVETSSPKHQPVLEKFRQSAIDQNKWISLYLEPGQVVTFD 127 Query: 287 NLFWLHGRDRFTPHPDLRRELM-RQRGYFAYASNHY 321 N LH R+ F + D + R G + ++ Y Sbjct: 128 NQKTLHTRNGFKANFDGNDRWLIRLFGTYEKTTSEY 163 >UniRef50_D2QGX9 Putative uncharacterized protein n=1 Tax=Spirosoma linguale DSM 74 RepID=D2QGX9_9SPHI Length = 262 Score = 61.7 bits (148), Expect = 4e-08, Method: Composition-based stats. Identities = 44/227 (19%), Positives = 81/227 (35%), Gaps = 13/227 (5%) Query: 104 DVKQADEMVKLATAVAHLIGRSNFDAMSGQYYA-RFVVKNVDNS--DSYLRQPHRVMELH 160 D+ D KL+ + I ++ D +G+ R++ D D Y R + +H Sbjct: 37 DLPVHDFYSKLSETIGR-IHAADEDLATGKMTGNRWIDITYDPQIPDRY-RSSNTRQPMH 94 Query: 161 NDGTYVEEI-TDYVLMMKIDEQNMQGGNSLLLHLDDWEHLDNYF-RHPLARRPMRFAAPP 218 D +YVE + V + GG + L D L M Sbjct: 95 TDDSYVELGGEEAVNFFYCASRAKIGGATTFFDLPDLVECMKLDGEEALLEELMATDVVH 154 Query: 219 SKNVSKDVFHPVFDVDQQGRPV----MRYIDQFVQPKDFE-EGVWLSELSDAIETSKGIL 273 +K ++ V + D D +G + P+ + + L I + IL Sbjct: 155 AKGGARKV-RKIIDKDGEGYLANWNYFCLSREENTPEVLDLCERFHQFLESRIMNAGVIL 213 Query: 274 SVPVPVGKFLLINNLFWLHGRDRFTPHPDLRRELMRQRGYFAYASNH 320 V + G+ + ++ LHGR+ F +R L++ + A++ Sbjct: 214 PVQLQKGEAVFFHDDRVLHGRNAFFAEYPGQRSLIKGKIIITPAADA 260 >UniRef50_C5FYM2 Mitochondrial protein n=1 Tax=Microsporum canis CBS 113480 RepID=C5FYM2_NANOT Length = 507 Score = 61.7 bits (148), Expect = 4e-08, Method: Composition-based stats. Identities = 28/200 (14%), Positives = 57/200 (28%), Gaps = 20/200 (10%) Query: 133 QYYAR-FVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLL 191 +Y + V+N+ N + + + H D Y+ + + + + GG SL Sbjct: 295 TFYGSTWDVRNIPNP-KNVAYTNVDLGFHMDLLYLVQPPGLQFLHCMKNELP-GGESLFA 352 Query: 192 -HLDDWEHLDNYFRHPLARRPMRFAAPPSKNVSKDV-----------FHPVFDVDQQGRP 239 + L R + N + P D P Sbjct: 353 DSFHAAKILRKTSREQFNLLTKAWMTWGYNNDDQIYSATRRVINVVSQFPGTIKDINYSP 412 Query: 240 VMR---YIDQFVQPKDF--EEGVWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGR 294 + + + +P L+ +E + I + + G+ ++ N +H R Sbjct: 413 PFQMPFWNSRPGEPGSLCTRVAAALNSFKSILEDKRNIFELKMKPGECVIFQNRRVVHAR 472 Query: 295 DRFTPHPDLRRELMRQRGYF 314 F RG + Sbjct: 473 RAFGDTDSQPGGDRWLRGCY 492 >UniRef50_Q7SAI5 Predicted protein n=4 Tax=Sordariales RepID=Q7SAI5_NEUCR Length = 429 Score = 61.7 bits (148), Expect = 4e-08, Method: Composition-based stats. Identities = 43/272 (15%), Positives = 79/272 (29%), Gaps = 45/272 (16%) Query: 69 KILDDLCANQLQPLLLKTLLNRAEGALLINAVGVDDVKQADEMVKLATAVAHLI---GRS 125 +L+ + R G +++ + D D ++ +++ G Sbjct: 113 NFPLLNLGPKLEEVSDIIHNGR--GFVVLRGLNPDKYSSTDNILLYLGVTSYIAETRGMQ 170 Query: 126 NFDA-MSGQYYARFVVKNVDNSDSYLRQPH--RVMELHNDGTYVEEITDYVLMMKIDEQN 182 +FD M A +V S P+ R H D D + + + Sbjct: 171 DFDGRMILHIQAVRKESDVAQHGSMPNSPYVNRAQPFHTDLC------DVLSLYALGVAK 224 Query: 183 MQGGNSLLLHL----DDWEHLDNYFRHPLARRPMRFAAPPSKNVSKDVFHPVF--DVDQQ 236 GG S L ++ L H LAR F ++ P+ Sbjct: 225 Y-GGESFLASSATIYNEIARLRPDVIHVLARDDWPF---DEFYKNQYHMRPLLYNFSSVD 280 Query: 237 GR------PVMRYIDQ------------FVQPKDFEEGVWLSELSDAIETSKGILSVPVP 278 GR P ++ + V + L + + L++ + Sbjct: 281 GRKEHEHGPGFQFSRRPLTGAHFSPHHPLVPAMSEVQAEALDMV--YFLAKEHALAIQLQ 338 Query: 279 VGKFLLINNLFWLHGRDRFTPH-PDLRRELMR 309 G + NN LH R F +R ++R Sbjct: 339 KGDMQIFNNFAMLHARSSFVDEGEHHKRHMLR 370 >UniRef50_Q0A839 Putative taurine catabolism dioxygenase n=1 Tax=Alkalilimnicola ehrlichii MLHE-1 RepID=Q0A839_ALHEH Length = 289 Score = 61.3 bits (147), Expect = 4e-08, Method: Composition-based stats. Identities = 47/187 (25%), Positives = 65/187 (34%), Gaps = 21/187 (11%) Query: 149 YLRQPHRVMELHNDGTYVE--EITDYVLMMKIDEQNMQGGNSLLLHLDDWEHLDNYFRHP 206 Y+ R + H DG Y +++ + G N LL H W+ L Sbjct: 112 YIPYTDRPINWHTDGYYNPPERRVRGLILHCVRPAREGGVNRLLDHRLVWQALAVSRPDA 171 Query: 207 L--ARRPMRFAAPPSKNVSKDVFH--PVFDVDQQGRPVMRYIDQFVQ----PKDF-EEGV 257 L + P PP + PVF ++ G MRY + P E V Sbjct: 172 LKALQHPRAMTIPPDPRAPEQGDRSGPVFLLEPTGL-HMRYTARTRSIRWAPDGPAEASV 230 Query: 258 WLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLR-RELMRQRGYFAY 316 L E+ D + S +L + G+ L+ NN LH R F D R L R R Y Sbjct: 231 VLREMLDHL--SPWVLEHRLAPGEGLICNN--VLHTRTGFVDPTDGPGRCLFRAR----Y 282 Query: 317 ASNHYQT 323 T Sbjct: 283 HERVMAT 289 >UniRef50_B8NNC7 Trimethyllysine dioxygenase TmlH, putative n=7 Tax=Eurotiomycetidae RepID=B8NNC7_ASPFN Length = 468 Score = 61.3 bits (147), Expect = 4e-08, Method: Composition-based stats. Identities = 36/233 (15%), Positives = 61/233 (26%), Gaps = 44/233 (18%) Query: 92 EGALLINAVGVDDVKQADEMVKLATAVAHLIGRSNFDAMSGQYYARFVVKNVDNSDSYLR 151 G + V VD + ++A +Y F D + Sbjct: 213 HGFCFVKGVPVDPESTQTLLERIAFI-------------RHTHYGGFWDFTADLTFKDTA 259 Query: 152 QPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDDWEHLDNYFRHPLARRP 211 + H D TY + L + + GG SLL +D + + Sbjct: 260 YTTEFLGAHTDNTYFTDPARLQLFHLLSHTDGDGGASLL--VDGFSAAEVLREEN--PEN 315 Query: 212 MRFAAPPSKNVSKDV----------FHPVFDVDQQGRPVMRYIDQF-------------V 248 + A + P+F + P Y+ Q Sbjct: 316 YQLLAATPQPFHSSGNEDTCIQPAEQMPIFRI----HPQFNYLYQIRWNNYDRAAKKDWS 371 Query: 249 QPKDFEEGVWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHP 301 + +D + K + + G L+ +N LHGR FT Sbjct: 372 LEQQNRWYNAARHFNDIVTREKMQIWTQLEPGTALIFDNWRMLHGRSEFTGKR 424 >UniRef50_A0L5C8 Putative uncharacterized protein n=1 Tax=Magnetococcus sp. MC-1 RepID=A0L5C8_MAGSM Length = 301 Score = 61.3 bits (147), Expect = 5e-08, Method: Composition-based stats. Identities = 43/184 (23%), Positives = 67/184 (36%), Gaps = 23/184 (12%) Query: 149 YLRQPHRVMELHNDGTY-VEEITDYVLMMKIDEQNMQGGNSLLLHLD----DWEHLDNYF 203 Y+ + ++ H DG Y E T + + Q QGG + LL + + Sbjct: 116 YIPYKAQAIQWHTDGYYNEPERTIRGMALHCARQAEQGGENDLLDHEIMYIRLRDQNPEH 175 Query: 204 RHPLARRPMRFAAPPSKNVSKDVFH-----PVFDVDQQGRPVMRYIDQFVQPKDFEEGVW 258 L + P N + +V PVF VD G MRY + V E+ Sbjct: 176 IRALMAEDVL-TIPARTNRAGEVVRAACVGPVFSVDANGYLHMRYTARTVSIAWREDAAT 234 Query: 259 LSE-------LSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLR--RELMR 309 L+ L A + + + G+ L+ NN LH R F P+ + R + R Sbjct: 235 LAAVTALEAILKHATDEP-YAHHLRMQPGQGLICNN--VLHTRGHFDAAPEGQANRLMYR 291 Query: 310 QRGY 313 R + Sbjct: 292 IRCW 295 >UniRef50_Q6C1G9 YALI0F16357p n=1 Tax=Yarrowia lipolytica RepID=Q6C1G9_YARLI Length = 453 Score = 61.3 bits (147), Expect = 5e-08, Method: Composition-based stats. Identities = 34/194 (17%), Positives = 53/194 (27%), Gaps = 24/194 (12%) Query: 133 QYYARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSL--- 189 +Y R + + + + LH D Y E L+ I Q GG S+ Sbjct: 239 TFYGRSWDTRSVPNPKNVAYTSQYLPLHMDLLYYESPPGIQLLHVIKNQ-AVGGESIFTD 297 Query: 190 -LLHLDDWEHLDNYFRHPLARRPMRFAAPPSKNVSKDVFHPVFDVDQQ-------GRPVM 241 + L P+ F P+ QQ + Sbjct: 298 SFASAKYVWEKNPAAYRALCEIPLTFHYINDGQ-HYHNTVPMIVEHQQTDKSKWTNPKAI 356 Query: 242 RYIDQFVQPKDFEEGV-----------WLSELSDAIETSKGILSVPVPVGKFLLINNLFW 290 Y F P D E V L + + +++ L + +L N Sbjct: 357 NYAPPFQGPFDAVELVEGGEKCELFREGLRLFEEHLTSAENELRTKMEENSCVLFLNRRV 416 Query: 291 LHGRDRFTPHPDLR 304 LH R F +R Sbjct: 417 LHSRTEFDAQSGVR 430 >UniRef50_Q0ZQ39 FrbJ n=2 Tax=Streptomyces RepID=Q0ZQ39_9ACTO Length = 339 Score = 61.3 bits (147), Expect = 5e-08, Method: Composition-based stats. Identities = 37/295 (12%), Positives = 83/295 (28%), Gaps = 35/295 (11%) Query: 36 TFTEQTTKQFLEQVAEWPVQALEYKSFLRFRVAKILDDLCANQLQPLLLKTLLNRAEGAL 95 E + LE Q L R + + + + + G Sbjct: 29 VLDEGMRAEILEAAERINEQGLTVWDLDR---KAVPLERAGKLVAQCVEQLEHG--FGLA 83 Query: 96 LINAVGVDDVKQADEMVKLATAVAHLIGRSNFDAMSGQYYA---RFVVKNVDNSDSYLRQ 152 ++ V + + A+ V + HL G + G + +++ Q Sbjct: 84 MLRGVPTEGLTVAESQVVMGVVGLHL-GTAVAQNGHGDRVVSIRDYGKGRLNSKTIRGYQ 142 Query: 153 PHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDDWEHLDNYFRHPLA---- 208 + + H+D D ++ + + G + + + L L Sbjct: 143 TNESLPWHSDA------PDIAALLCLTQAKHGGEFHVASAMHIYNTLLQEAPELLGLYYA 196 Query: 209 --RRPMRFAAPPSKNVSKDVFHPVFDVDQQGRPVMRYIDQFVQPKDFE--------EGVW 258 R PP + + +F ++ F + E + Sbjct: 197 GVFFDYRGEEPPGEPPAYRNA--IFGYHNGQLSCRYFLRNFADSGTAKLGFEQPEVEKLA 254 Query: 259 LSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDL----RRELMR 309 L + + +S+ + G L+++ +H R ++ D R L+R Sbjct: 255 LDTFEEIASRPENHVSMRLEPGDMQLVDDNVTVHRRGAYSDEEDGSTDSSRHLLR 309 >UniRef50_A8YB34 Similar to tr|Q2UDI5|Q2UDI5_ASPOR Predicted protein n=1 Tax=Microcystis aeruginosa PCC 7806 RepID=A8YB34_MICAE Length = 237 Score = 60.9 bits (146), Expect = 5e-08, Method: Composition-based stats. Identities = 35/185 (18%), Positives = 67/185 (36%), Gaps = 16/185 (8%) Query: 134 YYARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHL 193 Y + + SD+ + P + H + +Y Y+ + GG + L+ Sbjct: 54 YPIKANPDAMGKSDA--QSPGIGLLPHTEWSYKAIPPKYLCLRCKTPDRWGGGATTLVKF 111 Query: 194 DD-WEHLDNYFRHPLARRPMRFAAPPSKNVSKDVFHPVFDVD----QQGRPVMRYID--- 245 DD H +H +A +P F SK+ + F P++ D + V+ Y + Sbjct: 112 DDLLRHFTLEEQHFMAAQPQYFM---SKDGKESCFAPIWQRDAEIIRFSYNVLVYREFSP 168 Query: 246 QFVQPKDFEEGVWLSELSDAIET--SKGILSVPVPVGKFLLINNLFWLHGRDRFT-PHPD 302 +P L L + + + + L+++N LH R + P+ Sbjct: 169 ALSKPIASGLDSRLMALCGKFLALFEACHVPLYLKAEQILIMDNYTCLHSRQSYIDPNRA 228 Query: 303 LRREL 307 L R + Sbjct: 229 LERVM 233 >UniRef50_C7Q942 Putative uncharacterized protein n=1 Tax=Catenulispora acidiphila DSM 44928 RepID=C7Q942_CATAD Length = 343 Score = 60.9 bits (146), Expect = 5e-08, Method: Composition-based stats. Identities = 34/228 (14%), Positives = 63/228 (27%), Gaps = 31/228 (13%) Query: 108 ADEMVKLATAVAHLIG---RSNFDAMSGQYYARFVVKNVDNSDSYLRQPHRVMELHNDGT 164 + A+ + G + + M + S+S + H + + Sbjct: 97 EIFFLLCGAALGDVFGWATQQDGRIMHDVLPIKGHEHYELGSNSLQH-----LSWHTEDS 151 Query: 165 YVEEITDYVLMMKIDEQN------MQGGNSLLLHLDDWEHLDNYF-----RHPLARRPMR 213 + DYV +M + G+ +LD + F L + Sbjct: 152 FHPCRGDYVALMCLKNPYEAETMVCDAGDLDWPNLDVDALFEPVFTQMPDNSHLPQNTAE 211 Query: 214 FAAPPSKNVSKD---------VFHPVFDVDQQGRPVMRYI--DQFVQPKDFEEGVWLSEL 262 P+K+ + +PV G Y+ D + D L Sbjct: 212 STGDPTKDRLRARSFELIKSWNENPVRRAVLYGDRQNPYMALDPYHMKMDDWSERSLEAF 271 Query: 263 SDAIET-SKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLRRELMR 309 E + V + G I+N +HGR F D ++ Sbjct: 272 QALCEEIEAKMQDVVLHPGDIAFIDNFRAVHGRRSFRARYDGSDRWLK 319 >UniRef50_Q9FB35 Clavaminic acid synthase-like protein n=1 Tax=Streptomyces verticillus RepID=Q9FB35_9ACTO Length = 337 Score = 60.9 bits (146), Expect = 6e-08, Method: Composition-based stats. Identities = 29/175 (16%), Positives = 49/175 (28%), Gaps = 32/175 (18%) Query: 159 LHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDDWEHLDNYF---RHPLARRPMRFA 215 H + + DYV ++ + W L R L Sbjct: 148 WHTEDAFHPLRCDYVGLLCLRNHQRAATTV------GWPDLSRLTTEDRAVLLEPRYLIR 201 Query: 216 --------APPSKNVSKDVFHPVFDVDQ--------QGRPVMRYID-----QFVQPKDFE 254 + S + F + ++D G P Y+ P D Sbjct: 202 PDTSHTPAQNATGTRSAERFAAIAEMDDAPERVAVLFGDPEDPYLRIDPAYMSPAPGDAA 261 Query: 255 EGVWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLRRELMR 309 ++ IE + V + G LL++N +HGR F D R ++ Sbjct: 262 ARRAYDTVTALIEDE--LRHVVLDAGSLLLVDNYQAVHGRKPFAAAYDGRDRWLK 314 >UniRef50_Q4PCW2 Putative uncharacterized protein n=1 Tax=Ustilago maydis RepID=Q4PCW2_USTMA Length = 777 Score = 60.5 bits (145), Expect = 7e-08, Method: Composition-based stats. Identities = 31/140 (22%), Positives = 48/140 (34%), Gaps = 22/140 (15%) Query: 183 MQGGNSLL----LHLDDWEHLDNYFRHPLARRPMRFAAPPSKNVSKDVFHPVF------- 231 GG SLL L + + L+R +R S + P+F Sbjct: 606 PSGGESLLVDGFLAAAVLKDVHPDAYETLSRVRIR---THSAGDENTMIRPLFEGGYPIL 662 Query: 232 -DVDQQGRPVM-RYIDQ----FVQPKDFEEGVW--LSELSDAIETSKGILSVPVPVGKFL 283 D G V+ RY + D E + L + + + +G V + G L Sbjct: 663 QHDDATGELVLVRYNNDDRSVLRIDADDVERFYDALRKWNQILTNPEGEYWVQLKPGSAL 722 Query: 284 LINNLFWLHGRDRFTPHPDL 303 + +N LHGR F + L Sbjct: 723 IFDNHRVLHGRSAFVGNRRL 742 >UniRef50_A6G9Y5 Pyoverdine biosynthesis protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6G9Y5_9DELT Length = 666 Score = 60.2 bits (144), Expect = 9e-08, Method: Composition-based stats. Identities = 28/178 (15%), Positives = 52/178 (29%), Gaps = 31/178 (17%) Query: 153 PHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDDWEHLDNYFRHPLAR-RP 211 + +H DG ++ Y++ + GG + ++ E + LA Sbjct: 59 TEGPVPVHWDGAFLHTPPHYIVFQCDEAGPGCGGETTF--VNTVELMKGISEEELAAWDE 116 Query: 212 MRFAAPPSKNVSK--------DVFHPVFDVDQQGRPVMRYIDQ-----------FVQPKD 252 K V HPV +RY + P D Sbjct: 117 YTVTYLTKKVVHYGGDFMASIIGEHPVTKE-----RTLRYAEPVELLNPVKVFIHGMPVD 171 Query: 253 FEEGVWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLRRELMRQ 310 G ++ + + + + + G +L +N LHGR + R + R Sbjct: 172 EHSG-FIETMKERLYDPSVLYAHRWVEGDIVLADNHALLHGRYALN---NSNRRIRRV 225 >UniRef50_Q93JQ4 Clavaminate synthase n=1 Tax=Rhodococcus fascians RepID=Q93JQ4_RHOFA Length = 247 Score = 60.2 bits (144), Expect = 1e-07, Method: Composition-based stats. Identities = 44/247 (17%), Positives = 70/247 (28%), Gaps = 47/247 (19%) Query: 91 AEGALLINAVGVDDVKQADEMVKLATAVAHLIGRSNFDAMSGQYYARFVVKNVDNSDSYL 150 +G L+ V + L T H G + + A + V S Sbjct: 15 RDGFALVGDVDFGRSDFEVFVQGLGTLATHRFGTGSAGLL--DLDASPLPDRVVTGRS-- 70 Query: 151 RQPHRVMELHNDGTYVEEITDYVLMMK--IDEQNMQGGNSLLLHLD-------------- 194 + LH DGT V Y+ + +D+ G L L D Sbjct: 71 -----ALPLHTDGTLVGTCPKYIALYCNAVDQPTGSGRTELCLQSDLLDATLPPDLSTVT 125 Query: 195 --DWEHLDNYFRHPLARRPMRFAAPPSKNVSKDVFHPVFDVDQQGRPVMRYIDQFVQPK- 251 DWE+ H R+ + P S + + + ++ P Sbjct: 126 DVDWEYYVTDQSH-FPDVAHRWLSIPPAVRSATGTTRL-------NVALPFTEERTTPAG 177 Query: 252 ---------DFEEGVWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPD 302 + + + L + S S G L+++N LHGR P Sbjct: 178 WSVRLAGSTEEDSRAVFARLDSYLRGSTSFYSHDWAPGDLLVLDNEQVLHGRTTIEP--G 235 Query: 303 LRRELMR 309 R L R Sbjct: 236 SVRHLFR 242 >UniRef50_B2AD86 Predicted CDS Pa_4_400 n=4 Tax=Leotiomyceta RepID=B2AD86_PODAN Length = 440 Score = 60.2 bits (144), Expect = 1e-07, Method: Composition-based stats. Identities = 37/260 (14%), Positives = 69/260 (26%), Gaps = 31/260 (11%) Query: 70 ILDDLCANQLQPLLLKTLLNRAEGALLINAVGVDDVKQADEMVKLATAVAHLIGRSNFDA 129 L + L + G ++ + DD AD +++ R Sbjct: 132 FPLPDLGPILSGICNDIYLGK--GFHIVRGLDPDDYPLADLTAIYLGLSSYVASRRGRQD 189 Query: 130 MSGQYYARFVVKNVDNSDSYLRQPH--RVMELHNDGTYVEEITDYVLMMKIDEQNMQGGN 187 G + + +++S L H D TD + + E GG Sbjct: 190 QRGSMLIHVMQRGDQSAESTLHDSIYSSDKPFHTDTV-----TDTLCLFT-QELASSGGR 243 Query: 188 SLLLHL----DDWEHLDNYFRHPLARRPMRFAAPPSKNVSKDVFHPVFDVDQQGRPVMRY 243 S ++ H L F + + R + + Sbjct: 244 STFASAWTVYNELAATRPDLIHTLGSPDWPF---DTYGRDPPFYRRALMYFHDHRLITSF 300 Query: 244 IDQFVQ---PKDFEEGVWLSELSDAIETSKGILSV---------PVPVGKFLLINNLFWL 291 + + P + L++A + + + G +NNL L Sbjct: 301 SRRLLVGHAPFTPRSK-AIPGLTEAQAEALDAVHFIAKKHEIKPRMERGDIRFVNNLGLL 359 Query: 292 HGRDRFTPHPD-LRRELMRQ 310 H R+ F R L+R Sbjct: 360 HRREAFENVQGSKPRHLVRI 379 >UniRef50_C7YUC3 Putative uncharacterized protein n=1 Tax=Nectria haematococca mpVI 77-13-4 RepID=C7YUC3_NECH7 Length = 385 Score = 60.2 bits (144), Expect = 1e-07, Method: Composition-based stats. Identities = 32/206 (15%), Positives = 62/206 (30%), Gaps = 28/206 (13%) Query: 133 QYYARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQN---------M 183 +Y F D + + + + H D TY E + Sbjct: 146 THYGGFYDFIPDLALADTAYTNLALPAHTDTTYFSEPAGLQAFHLLSHHAPPNAKPSDDA 205 Query: 184 QGGNSLLLH----LDDWEHLDNYFRHPLARRPMRFAAPPSKNVS--KDVFHPVFDVDQQG 237 GG SLL+ + L + + A +K ++ D +PV + D Sbjct: 206 LGGQSLLVDGFQVASTLKKESPEAYETLRTIQVPWHASGNKGIAIVPDRTYPVIEEDNGE 265 Query: 238 RPVMRYIDQ---FVQPKDFEEGV-WLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHG 293 +R+ + + + E+ + ++ I + G+ ++ +N +HG Sbjct: 266 LTRVRWNNDDRGILDVFNAEKWYSAARKWNEIIRRRSSEYWFQLTPGRVVIFDNWRVMHG 325 Query: 294 RDRFTPHPDLRRELMRQRGYFAYASN 319 R F R AY Sbjct: 326 RSAFEGIR---------RICGAYTPR 342 >UniRef50_B8GUE4 Taurine catabolism dioxygenase n=1 Tax=Thioalkalivibrio sp. HL-EbGR7 RepID=B8GUE4_THISH Length = 314 Score = 59.8 bits (143), Expect = 1e-07, Method: Composition-based stats. Identities = 40/182 (21%), Positives = 63/182 (34%), Gaps = 19/182 (10%) Query: 149 YLRQPHRVMELHNDGTYV-EEITDYVLMMKIDEQNMQGGNSLLLHLDDWEHLDNYFRHPL 207 Y+ +R + H DG Y E +++ +GG + LL + Sbjct: 131 YIPYTNRPLSWHCDGYYNPPERRIRAMLLHCVTDAAEGGENALLDHELLYIRLRDENPGW 190 Query: 208 ARRPMRFAAPP-SKNVS------KDVFHPVFDVDQQ-GRPVMRYIDQ------FVQPKDF 253 M A +NV + PVF VD + GR +RY + P Sbjct: 191 IEALMHPQAMTIPENVENGVVLREAQSGPVFSVDPETGRLHLRYTARERNVHWRDDPLTR 250 Query: 254 EEGVWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLRRELMRQRGY 313 E + EL + GI+ + G+ +L NN LH R F P + + R Sbjct: 251 EAAARIRELLAS--DMPGIIRHRLSPGEGILCNN--VLHNRTGFRDDPQAGQTRLIYRAR 306 Query: 314 FA 315 + Sbjct: 307 YL 308 >UniRef50_B2AMD0 Predicted CDS Pa_5_7570 n=1 Tax=Podospora anserina RepID=B2AMD0_PODAN Length = 193 Score = 59.8 bits (143), Expect = 1e-07, Method: Composition-based stats. Identities = 33/166 (19%), Positives = 48/166 (28%), Gaps = 25/166 (15%) Query: 176 MKIDEQNMQGGNSLLLHLDDW-EHLD---NYFRHPLARRPMRFAAPPSKNVSKDVFHPVF 231 M + E + +GG + + HL LA+ F PP V PV Sbjct: 1 MFVIEASSEGGQGIFCPVASICNHLAATCPSLLQELAKADWPFDRPPD-GVGSFYRRPVM 59 Query: 232 DV-DQQGRPVMRYIDQ--FVQP----------KDFEEGVWLSELSDAIETSKGILSVPVP 278 + G P M + P + L + A + + Sbjct: 60 YLNSTTGAPEMLFSRGALIRSPQGFRPSDVPLLTVRQNAALDAIHFAATSKALKVVYR-- 117 Query: 279 VGKFLLINNLFWLHGRDRFTPH-----PDLRRELMRQRGYFAYASN 319 G L NN LHGR+ FT L R +R + Sbjct: 118 PGDVLFFNNRRVLHGREAFTDDSVNGTRHLLRLWLRDEELAGTPPH 163 >UniRef50_C5A800 Putative uncharacterized protein n=1 Tax=Burkholderia glumae BGR1 RepID=C5A800_BURGB Length = 254 Score = 59.8 bits (143), Expect = 1e-07, Method: Composition-based stats. Identities = 34/178 (19%), Positives = 60/178 (33%), Gaps = 23/178 (12%) Query: 152 QPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLH-LDDWEHLDNYFRHPLARR 210 Q + H + Y+ + + G++ L L L + ++ Sbjct: 75 QSRNGIGPHTEVPVGAPPPRYLALHCHRQARCGQGHTRLADGLAFCRSLPPELQRFVSEV 134 Query: 211 PMRF--AAPPSKNVSKDVFHPVFDVDQQGRPVMRYI-DQFVQ-------------PKDFE 254 P+ F P + + P+ D + RPV R+ +QF+ P Sbjct: 135 PVTFAATLVPGTRSRQTLTVPIMSRDGE-RPVFRFSYNQFLYGDVNPSEDALERAPSGEG 193 Query: 255 EGVWLSELSDAIET--SKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLRRELMRQ 310 L+ L+ + + V +P G L+ +N +H R FT D R L R Sbjct: 194 ADTPLARLAKLAQAYFEADAVPVLIPDGHLLIWDNHRLIHARSTFT---DPERHLTRY 248 >UniRef50_C1E7Q6 Predicted protein n=2 Tax=Micromonas RepID=C1E7Q6_9CHLO Length = 433 Score = 59.8 bits (143), Expect = 1e-07, Method: Composition-based stats. Identities = 43/293 (14%), Positives = 71/293 (24%), Gaps = 63/293 (21%) Query: 70 ILDDLCANQLQPLLLKTLLNRAEGALLINAVGVDDVKQADEMVKLATAVAHLIGRSNFDA 129 +L + + R G L V V AH +G + Sbjct: 105 FPLPTLKTRLDATRRELVHGR--GLCLFRGVPVHRYTPWQRCAVFYAMGAH-MGWTCPQN 161 Query: 130 MSGQYYARFVVKNVDNSDSYLR--QPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGN 187 G D +D R H D D V +M + E + GG Sbjct: 162 ARGHVLGHVKDLGADPNDPTTRIYTTCAAQPFHTDS------ADIVGLMCL-ENSTTGGE 214 Query: 188 SLLLH-LDDWEHLDN----YFRHPLARRPMRFAAPPSKNVSKDVFHPVFDVD-------- 234 S+++ + W L R L P+ PVF Sbjct: 215 SMVVSSVAVWNELARTAPQLARTLLEPFPVDRKGEVPPGKRPTYDMPVFHRHGARASDVD 274 Query: 235 -------------------QQGRPVM---------------RYIDQFVQP-KDFEEGVWL 259 ++G + R+ + P + L Sbjct: 275 GSAVTVCVGAGEAGDSEGGREGGCELLSGIYDRNFIDAAQARFTEDDGVPRLTPTQIAAL 334 Query: 260 SELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDL---RRELMR 309 L + L + + G ++N H R + +R L+R Sbjct: 335 DALDATCDDPSVRLDMRLEPGDVQWLHNHTTFHARREYGDGEGKGPSKRHLLR 387 >UniRef50_Q19000 Probable gamma-butyrobetaine dioxygenase n=4 Tax=Rhabditida RepID=BODG_CAEEL Length = 421 Score = 59.8 bits (143), Expect = 1e-07, Method: Composition-based stats. Identities = 27/197 (13%), Positives = 64/197 (32%), Gaps = 32/197 (16%) Query: 138 FVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDDWE 197 F V ++ + + + H D + + M+ + + +GG+SL +D + Sbjct: 200 FEVSLKADASNMAYASNGGLPFHTDFPSLSHPPQ-LQMLHMLQSAEEGGHSLF--VDGFH 256 Query: 198 HLDNYFRHPLARRPMRFAAPPSKNVSKDVF-------------------HPVFDVDQQGR 238 + + S ++ + H V ++ G+ Sbjct: 257 VAEQL--RVEKPEIFKILTTQSMEYIEEGYDVHEINGKTIRFDYDMCARHKVIRLNDDGK 314 Query: 239 P-VMRYIDQFV------QPKDFEEG-VWLSELSDAIETSKGILSVPVPVGKFLLINNLFW 290 +++ + +P ++ + ++ + +L + G +L N Sbjct: 315 VNKIQFGNAMRSWFYDCEPSKVQDVYRAMKTFTEYCYQPRNMLKFRLEDGDTVLWANQRL 374 Query: 291 LHGRDRFTPHPDLRREL 307 LH RD F P+ R L Sbjct: 375 LHTRDGFRNAPEKARTL 391 >UniRef50_Q16V01 Epsilon-trimethyllysine 2-oxoglutarate dioxygenase n=3 Tax=Culicidae RepID=Q16V01_AEDAE Length = 713 Score = 59.4 bits (142), Expect = 2e-07, Method: Composition-based stats. Identities = 27/194 (13%), Positives = 61/194 (31%), Gaps = 27/194 (13%) Query: 125 SNFDAMSGQYYARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQ 184 + G+ + ++D+SD+ + + + H D TY + + ++ I + Sbjct: 507 PIHKTLFGEMWT--FSDSMDHSDTAYTKNY--LGPHTDNTYFSDASGLQVLHCIQFKGSG 562 Query: 185 GGNSLL--------LHLDDWEHLDNYFRHPLARRPMRFAAPPSKNVSKDVFHPVFD---- 232 G L+ L L E + +P+ + + P+ Sbjct: 563 GQTILIDGFKAAEQLRLKKPEVFERLCNYPVTGEYL------EEGKHHTYCAPIIKRNII 616 Query: 233 VDQQGRPVMRYIDQF---VQPKDFEEGVW--LSELSDAIETSKGILSVPVPVGKFLLINN 287 + + D+ P++ + EL I + + G ++ +N Sbjct: 617 TGEVEQLRFNIYDRAILKTIPQEQVPQFYADFKELGAEINEESMAWTFQLTPGTVMIFDN 676 Query: 288 LFWLHGRDRFTPHP 301 LHGR + Sbjct: 677 WRVLHGRMAYNGKR 690 >UniRef50_D1SG31 Putative uncharacterized protein n=1 Tax=Micromonospora aurantiaca ATCC 27029 RepID=D1SG31_9ACTO Length = 340 Score = 59.4 bits (142), Expect = 2e-07, Method: Composition-based stats. Identities = 32/174 (18%), Positives = 58/174 (33%), Gaps = 27/174 (15%) Query: 157 MELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDDWEHLDNYFRHPLARRPMRFAA 216 +E H + + DY+++ + + + L + D E L + R L++ Sbjct: 149 LEFHTEDGFHPARCDYLMLFGVRNNDEV--PTYLASIGDVE-LSDAVRSVLSQPRFHIFP 205 Query: 217 PPSKNVSKDVFHP-------------------VFDVDQQGRPVMRYIDQFVQ--PKDFEE 255 HP V D+ P +R F++ D E Sbjct: 206 DDEHVRQLQRRHPDHPALARALELQRAPEAVAVLFGDRL-NPYLRIDRPFMRCAEGDVEA 264 Query: 256 GVWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLRRELMR 309 L L ++ + SV V G L+++N +HGR F D ++ Sbjct: 265 ERALDALMAELK--RHQQSVVVERGTLLVVDNYRAVHGRKAFRSRYDGSDRWLK 316 >UniRef50_C7QJ42 Clavaminate synthase n=1 Tax=Catenulispora acidiphila DSM 44928 RepID=C7QJ42_CATAD Length = 358 Score = 59.4 bits (142), Expect = 2e-07, Method: Composition-based stats. Identities = 26/186 (13%), Positives = 48/186 (25%), Gaps = 12/186 (6%) Query: 131 SGQYYARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLL 190 G + V + +ELH + + D+V + + + + L Sbjct: 149 HGHTFQDMVPSAMSAHSQTSLGSAVELELHTEQAFSPLRPDFVSLACLR-GDPRALTYLF 207 Query: 191 LHLDDWEHLDNYFRHPLARRPMRFAAPPSKNVSKDVFH--------PVFDVDQQGRPVMR 242 L L S F P+ P + Sbjct: 208 SARQLVATLTTQEIAMLREPMWTTTVDESFLAEGRTFLLGFERGPIPILS-GADDDPFIV 266 Query: 243 YIDQFVQPKDFEEGVWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPD 302 + ++ + A + + G+ LLI+N +HGR F P D Sbjct: 267 FDQDLMRGISAPAQELQQTVIRAYYAERVSHC--LAPGEMLLIDNRRAVHGRSIFAPRFD 324 Query: 303 LRRELM 308 + Sbjct: 325 GADRFL 330 >UniRef50_Q9NVH6 Trimethyllysine dioxygenase, mitochondrial n=39 Tax=Chordata RepID=TMLH_HUMAN Length = 421 Score = 59.4 bits (142), Expect = 2e-07, Method: Composition-based stats. Identities = 36/189 (19%), Positives = 64/189 (33%), Gaps = 21/189 (11%) Query: 133 QYYARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLH 192 Y R D S ++ H D TY +E + + + G L Sbjct: 215 TIYGRMWYFTSDFSRGDTAYTKLALDRHTDTTYFQEPCGIQVFHCLKHEGTGGRTLL--- 271 Query: 193 LDDWEHLDNYFRHPLARRPMRFAAPPSKNVSKDV---------FHPVFDVDQQGRPV--M 241 +D + + + + P +DV PV ++ + + + Sbjct: 272 VDGFYAAEQVLQKAPEEFELLSKVPLKHEYIEDVGECHNHMIGIGPVLNIYPWNKELYLI 331 Query: 242 RYI--DQFV---QPKDFEEGVWLSE--LSDAIETSKGILSVPVPVGKFLLINNLFWLHGR 294 RY D+ V P D + + L+ + + V + G+ L I+N LHGR Sbjct: 332 RYNNYDRAVINTVPYDVVHRWYTAHRTLTIELRRPENEFWVKLKPGRVLFIDNWRVLHGR 391 Query: 295 DRFTPHPDL 303 + FT + L Sbjct: 392 ECFTGYRQL 400 >UniRef50_C8V493 Trimethyllysine dioxygenase TmlH, putative (AFU_orthologue; AFUA_1G06180) n=1 Tax=Aspergillus nidulans FGSC A4 RepID=C8V493_EMENI Length = 519 Score = 59.0 bits (141), Expect = 2e-07, Method: Composition-based stats. Identities = 31/187 (16%), Positives = 49/187 (26%), Gaps = 20/187 (10%) Query: 133 QYYARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLH 192 +Y F D S + + H D TY + L + + GG +LL Sbjct: 291 THYGGFWDFTADMSFKDTAYTNEALGAHTDNTYFTDPARLQLFHMLSHTDGDGGATLL-- 348 Query: 193 LDDWEHLDNYFRHPLARRPMRFAAPPSKNV---SKDVFHPV-----------FDVDQQGR 238 +D + + + ++ PV F Sbjct: 349 VDGFRAARRLYAESKQNLNHLRNIRQPFHASGNEDSIYQPVEQQVVLRAHAQFKHRLYQV 408 Query: 239 PVMRYIDQFVQPKDFEEG----VWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGR 294 Y EE +D I + + G L+ +N LHGR Sbjct: 409 RWNNYDRAVKWNWSLEEQEAWYKAAKHFNDIIHREDMEIWTQLQPGTALIFDNWRMLHGR 468 Query: 295 DRFTPHP 301 FT Sbjct: 469 SAFTGKR 475 >UniRef50_B7GDP6 Predicted protein n=1 Tax=Phaeodactylum tricornutum CCAP 1055/1 RepID=B7GDP6_PHATR Length = 549 Score = 59.0 bits (141), Expect = 2e-07, Method: Composition-based stats. Identities = 43/253 (16%), Positives = 71/253 (28%), Gaps = 47/253 (18%) Query: 81 PLLLKTLLNRAEGALLINAVGVDDVKQADEMVKLATAVAHLIGRSNFDAMSGQYYARFVV 140 +L + + +I A D + Q + V V H + S GQ Y Sbjct: 293 DILDAVVHD----GAVIVAQTPDTLDQHETTV---GYVGHSL--SGGGLSHGQLYGDIFH 343 Query: 141 KNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQ-GGNSLLLH----LDD 195 ++ L + H D Y + L+ + GG S+L+ + Sbjct: 344 VETAHNAHNLAYTSVALPPHQDLAYYDSKPGLQLLHCVANTADVLGGESVLVDAVAAAQE 403 Query: 196 WEHLDNYFRHPLARRPMRF-AAPPSKNVSKDVFHPVFDVDQQGRPV-MRYIDQFVQPKDF 253 +L L + P F ++ H + D G V + + F P Sbjct: 404 LRNLAPDHFEALTKCPATFLKQREGADMVYRRTH--VEQDSTGSVVAVHWSPPFQGPLCI 461 Query: 254 EE-------------GVWLS--------------ELSDAI--ETSKGILSVPVPVGKFLL 284 L EL ++ + + G L+ Sbjct: 462 RPDLVDNYFVAYAVLERMLDNSLPRDRFILPIAPELEQSLIDYAHEYTWQHRLEEGHLLI 521 Query: 285 INNLFWLHGRDRF 297 NN LHGR F Sbjct: 522 FNNQRMLHGRRGF 534 >UniRef50_UPI0001B55A0A putative taurine catabolism dioxygenase n=1 Tax=Streptomyces sp. AA4 RepID=UPI0001B55A0A Length = 283 Score = 58.6 bits (140), Expect = 3e-07, Method: Composition-based stats. Identities = 32/164 (19%), Positives = 49/164 (29%), Gaps = 13/164 (7%) Query: 159 LHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLD-DWEHLDNYFRHPLA--RRPMRFA 215 H+D + E + ++ GG + + D + LD LA R A Sbjct: 102 WHSDQQH-RERPATLAVLYCVVPAASGGATSFVSADVESAGLDEATVADLAGRRAVYEPA 160 Query: 216 APPSKNVSKDVFHPVFDVDQQGRPVMRYIDQ----FVQPKDFEEGVWLSELSDAIETSKG 271 V HP + G Y+ F E + + Sbjct: 161 FNHDNAPRVRVSHPALLTSRTGDRHYAYVSDNTLGFTGLAADESAALKQRVLSRLLEPSR 220 Query: 272 ILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLRRELMRQRGYFA 315 I + G F L +N LH R+RF R ++ FA Sbjct: 221 IYAHRWQAGDFALYDNTQLLHRRERFQG-----RRWLKAAKVFA 259 >UniRef50_A3NB26 Dioxygenase, TauD/TfdA family n=29 Tax=pseudomallei group RepID=A3NB26_BURP6 Length = 292 Score = 58.6 bits (140), Expect = 3e-07, Method: Composition-based stats. Identities = 38/238 (15%), Positives = 74/238 (31%), Gaps = 35/238 (14%) Query: 107 QADEMVKLATAVAHLIGR--SNFDAMSGQYYARFVVKNVDNSDSYLRQPHRVMELHNDGT 164 Q + V+ A L G +NF + + + + V D + S R H D + Sbjct: 49 QPELSVETQIAFGRLFGELETNFPSFTAKPEGQPEVTVFDGAVSTGRASI----WHTDLS 104 Query: 165 YVEEITDYVLMMKIDEQNMQGGNSLLLHLDD-WEHLDNYFR---------HPLARRPMRF 214 + + ++ + E GG+++ L+ + L + H + Sbjct: 105 IAKT-PSAMGILCVKETPDSGGDTMWADLEAAYAALSPGMQAFLEGQRAVHDMMTPQYAQ 163 Query: 215 AAPPSKNVSKD---------VFHPVFDVDQQGRPVMRYIDQFVQPK-----DFEEGVWLS 260 + + HPV V + +++ F+ E L+ Sbjct: 164 RPGAFQTRGRSDMDLSEVFGAEHPVVRVHPETGRKCLFVNPFLTSHLVGFHSAESATILN 223 Query: 261 ELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLRRELMR--QRGYFAY 316 L +E + ++ G L +N +H + RR R RG Y Sbjct: 224 YLYALMERPQYVVRWHWSRGDVALWDNRCTMH--TAVDDYGAGRRFARRVCVRGDVPY 279 >UniRef50_A0P0W2 Putative uncharacterized protein n=1 Tax=Labrenzia aggregata IAM 12614 RepID=A0P0W2_9RHOB Length = 272 Score = 57.8 bits (138), Expect = 4e-07, Method: Composition-based stats. Identities = 28/175 (16%), Positives = 53/175 (30%), Gaps = 6/175 (3%) Query: 139 VVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDDWEH 198 +K + L + H D Y E + D V M+ + GG S+++ + Sbjct: 88 EIKPRNGEHRELIYTEKGQPAHCDSAYHETMPDIV-MLGCSKAASSGGLSIIIDIRSLLS 146 Query: 199 LDNYFRHPLARRPMRFAAPPSKNVSKDVFHPVFDVDQQ-GRPVMRYIDQFVQPK---DFE 254 + R + + V P ++ R + F E Sbjct: 147 DHHMEYLKERLRAHQQIDVIYSKRNIRVEKPFVKMNPNTERAEFAFT-PFALSAMFKSKE 205 Query: 255 EGVWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLRRELMR 309 + + + + G F++++N LHGR F L R + Sbjct: 206 DANLYDCIIQNSNSPTYSRYFLLEDGDFVILDNTSMLHGRTAFAGERSLSRFWFK 260 >UniRef50_UPI0001B59B43 hypothetical protein MaviaA2_08448 n=2 Tax=Mycobacterium avium complex (MAC) RepID=UPI0001B59B43 Length = 371 Score = 57.5 bits (137), Expect = 7e-07, Method: Composition-based stats. Identities = 39/264 (14%), Positives = 79/264 (29%), Gaps = 34/264 (12%) Query: 71 LDDLCANQLQPLLLKTLLNRAEGALL--INAVGVDDVKQAD-EMVKLA--TAVAHLIGRS 125 +D L L + + G L + V+ D E + A T + +L+ + Sbjct: 56 PEDARHPDLDDDLARLYHDLMFGKGLACVRGFPVEQHSIEDLERIYWAFCTHLGYLVSNN 115 Query: 126 NFDAMSGQYYARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQG 185 +F + + + + + +HND D + ++ + QG Sbjct: 116 SFG--HRMVRVQEEILPGGVQPARGTKSRAELAMHNDA------ADILSLLCVYP-AAQG 166 Query: 186 GNSLLLH--------LDDWEHLDNYFRHPLARRPMRFAAPPSKNVSKDVFHPVFDVDQQG 237 G S L + L + R P + PVF G Sbjct: 167 GESQFASGPAAHNRILAERPDLLDVLYEGFPHHR-RSEQPDDQPDVTPYNVPVFSQ-ING 224 Query: 238 RPVMRYIDQFVQPK--------DFEEGVWLSELSDAIETSKGILSVPVPVGKFLLINNLF 289 R + + + P ++ + L + + + + + G+ + NN Sbjct: 225 RICINFTYSSILPAMKTLGREFTPKQEEAIELLRNILVDQQ--VEFRLESGEAAVANNFA 282 Query: 290 WLHGRDRFTPHPDLRRELMRQRGY 313 H R F D ++ R + Sbjct: 283 MCHSRSDFVSSDDPKKARCFLRAW 306 >UniRef50_B0T4H7 Taurine catabolism dioxygenase TauD/TfdA n=3 Tax=Alphaproteobacteria RepID=B0T4H7_CAUSK Length = 287 Score = 57.5 bits (137), Expect = 7e-07, Method: Composition-based stats. Identities = 37/283 (13%), Positives = 79/283 (27%), Gaps = 34/283 (12%) Query: 53 PVQALEYKSFLRFRVAKILDDLCANQLQPLLLKTLLN-RAEGALLINAVGVDDVKQADEM 111 + E K ++ I ++ + K + GA+++ + + Sbjct: 8 DLTISELKPGFGAQIHDIDLPSSSD---AEIDKVVAAFHRHGAVVLRGQDMTPDDLMRFI 64 Query: 112 VKLATAVAHLIGRSNFDAMSGQYYARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITD 171 + A H R Y F++ N L + + H D +Y E Sbjct: 65 GRFGDAEDHTQTRFTLPG----YPKIFILSNRVVDGKPLGAHNDGVGWHTDYSYKPEPVM 120 Query: 172 YVLMMKIDEQNMQGGNSLLLHLDDWEHLDNYFRHPLARRPMRFA---------------- 215 ++ ++ + L W L + L + + Sbjct: 121 LTMLYAVEVPDEGSDTLLADGCAAWNALSPEKQAELLPLSLHHSYKHFMATRQFGQQQTL 180 Query: 216 APPSKNVSKDVFHPVFDVDQQGRPVMRY------IDQFVQPKDFEEGVWLSELSDAIETS 269 +P + + DV HP+ + + +P E L EL + + Sbjct: 181 SPELEAANPDVEHPLIRTHPADGRKALWPSTGTVTEVIGKP-GPEGLALLDELVEFMTGD 239 Query: 270 KGILSVPVPVGKFLLINNLFWLHGRDRFTPH---PDLRRELMR 309 + G L+ +N LH + + R ++ Sbjct: 240 DFVYRHKWAKGDLLMWDNRCTLHTGTLYDDTKYFRTMHRLWVK 282 >UniRef50_B0KR96 Putative uncharacterized protein n=1 Tax=Pseudomonas putida GB-1 RepID=B0KR96_PSEPG Length = 324 Score = 57.5 bits (137), Expect = 7e-07, Method: Composition-based stats. Identities = 26/168 (15%), Positives = 58/168 (34%), Gaps = 10/168 (5%) Query: 156 VMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDDWEHLDNYFRHPLARRPMRFA 215 H + + +Y+ ++ + N + ++L +D + L + L R + Sbjct: 147 DFGFHVEDAFHPARPEYLGLVCMR--NDERAATVLSSIDGIK-LSEEEMNVLFESRFRIS 203 Query: 216 APPSKN----VSKDVFHPVFDVDQQGRPVMRYIDQFVQPKDFEEGVWLSELSDAIETSKG 271 P N V ++ +F + V + E L +L + S+ Sbjct: 204 HNPIHNTSDVVEENAQTILFGHRDAPYVKINAATLDVGEYEGIERQALEKLLNHF--SEN 261 Query: 272 ILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLRRELMRQRGYFAYASN 319 +++ + + I+N +H RD F + + R FA + Sbjct: 262 RVTLILEQADCVFIDNYRCVHARDSFNANYGGGARWL-SRVVFASSLR 308 >UniRef50_C9S8L0 Trimethyllysine dioxygenase n=1 Tax=Verticillium albo-atrum VaMs.102 RepID=C9S8L0_VERA1 Length = 441 Score = 57.5 bits (137), Expect = 7e-07, Method: Composition-based stats. Identities = 30/191 (15%), Positives = 60/191 (31%), Gaps = 22/191 (11%) Query: 133 QYYARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQG------- 185 +Y F D + + + + H D TY + + + G Sbjct: 219 THYGGFYDFIPDMAHADTAYTNLALPAHTDTTYFSDPAGLQAFHMLSHEAPPGAPAGELG 278 Query: 186 GNSLLLH----LDDWEHLDNYFRHPLARRPMRFAAPPSK--NVSKDVFHPVFDVDQQGRP 239 G SLL+ D L++ + + A ++ ++ + +PV ++D+ Sbjct: 279 GKSLLVDGFYAASILLEEDPQAYEILSKVKLPWHASGNEGITIAPNKRYPVLELDETTGK 338 Query: 240 VMRYI----DQFVQPKDFEEG-----VWLSELSDAIETSKGILSVPVPVGKFLLINNLFW 290 + R D+ V P + + V + G L+ +N Sbjct: 339 LARVRWNNDDRGVVPFGEGYSPAEWYAAARKWDAILRRPAVEYWVQLNPGHLLIFDNWRV 398 Query: 291 LHGRDRFTPHP 301 +HGR F Sbjct: 399 MHGRSAFEGRR 409 >UniRef50_C5E1T6 ZYRO0G01342p n=1 Tax=Zygosaccharomyces rouxii RepID=C5E1T6_ZYGRO Length = 460 Score = 57.1 bits (136), Expect = 8e-07, Method: Composition-based stats. Identities = 32/211 (15%), Positives = 70/211 (33%), Gaps = 44/211 (20%) Query: 133 QYYAR-FVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKID-EQNMQGGNSLL 190 +Y F VK +N + + + LH D Y+E + +Y L+ I+ + GG ++ Sbjct: 227 TFYGETFDVKGANNETPNIAYSNLALPLHMDLLYMENVPEYQLLHTINNPKEGSGGVNVF 286 Query: 191 LH----LDDWEHLDNYFRHPLARRPMRFAAPPSKNVSKDVF-HPVFDVDQQGR------- 238 + ++ LD L P+ + S+ + P+ + + + Sbjct: 287 VDGFRAANEVRELDTISYQALQNVPINYH--YSRGDKRYYQSKPMIEHHESHKGNTLDDY 344 Query: 239 -----PVMRYIDQFVQPKDFE----------------EGVWLSELSDAIE-------TSK 270 + Y F P F E ++ ++ +E Sbjct: 345 FDGLIKAVNYSPPFQAPFTFGIHAKSPESDACKSKLAERHMFNDFTNGLELFEDCITDKS 404 Query: 271 GILSVPVPVGKFLLINNLFWLHGRDRFTPHP 301 + +P ++++N LH R ++ Sbjct: 405 NQFEIKLPPNSCVILDNRRVLHARSAYSSDS 435 >UniRef50_B9JZY5 Gamma butyrobetaine hydroxylase protein n=3 Tax=Proteobacteria RepID=B9JZY5_AGRVS Length = 398 Score = 57.1 bits (136), Expect = 8e-07, Method: Composition-based stats. Identities = 35/237 (14%), Positives = 65/237 (27%), Gaps = 29/237 (12%) Query: 77 NQLQPLLLKTLLNRAEGALLINAVGVDDVKQADEMVKLATAVAHLIGRSNFDAMSGQYYA 136 L+ L L G + + V ++A + D + Sbjct: 127 ETLRAALDALL---RFGVVTFKGDPIRKVSFETFTDRVAGFL---------DRTYFGEFF 174 Query: 137 RFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKID-----EQNMQGGNSLLL 191 VK +DS R + LH D Y D+ + +D + G + Sbjct: 175 DLNVKPEATTDSVAFST-RELPLHTDIPYYSPPPDFQFLYGLDVSPECARRGIGSTRFVD 233 Query: 192 HLD---DWEHLDNYFRHPLARRPMRFAAPPSK-NVSKDVFHPVFDVDQQG-------RPV 240 L + L R + A + + + P+ + + G P Sbjct: 234 GLAAAIGLRSEEPEAFAALTRIAVINRAEYPQASKIYENIAPIIKLAKDGSIERLLNNPS 293 Query: 241 MRYIDQFVQPKDFEEGVWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRF 297 + D + + ++ S G ++ +N LHGR F Sbjct: 294 KMFFDNVEFDDMLPLYRAYKIFKERLVSASPAYSHNWSDGDLVIWDNRRILHGRGEF 350 >UniRef50_A0YLG8 Gamma-butyrobetaine hydroxylase n=1 Tax=Lyngbya sp. PCC 8106 RepID=A0YLG8_9CYAN Length = 358 Score = 57.1 bits (136), Expect = 9e-07, Method: Composition-based stats. Identities = 31/194 (15%), Positives = 61/194 (31%), Gaps = 18/194 (9%) Query: 125 SNFDAMSGQYYARFVVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQ 184 F+A G +S + + H+D +Y V + E Sbjct: 152 PIFNADYGTIMPLETRDKTT--ESLPSRDGCPLPPHHDLSYWGGH-RLVEFLYCVENQNS 208 Query: 185 GGNSLLLHLDDWEHLDNYFRH------PLARRPMR-FAAPPSKNVSKDVFHPVFDVDQQG 237 GG S L +D ++ ++ + L P++ + + + + D+ G Sbjct: 209 GGESTL--VDGFQVAQDFSQDYPQYYQTLLETPVQFWLVDKTHQYRFCNIATILECDRYG 266 Query: 238 R-PVMRYIDQFVQPKDFEEGV-----WLSELSDAIETSKGILSVPVPVGKFLLINNLFWL 291 +R+ + +P E + ++ + + LL N L Sbjct: 267 NLTTVRFSKRNCRPHLPFEQLEDFYQAYHTFFHYLKKNDYKHQFQLRSHNCLLFQNFRIL 326 Query: 292 HGRDRFTPHPDLRR 305 HGR F P R+ Sbjct: 327 HGRTAFDPALGKRK 340 >UniRef50_A3P5A5 Gamma-butyrobetaine dioxygenase n=19 Tax=pseudomallei group RepID=A3P5A5_BURP0 Length = 382 Score = 56.7 bits (135), Expect = 1e-06, Method: Composition-based stats. Identities = 40/214 (18%), Positives = 61/214 (28%), Gaps = 34/214 (15%) Query: 134 YYARF---VVKNVDNSDSYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQ----NMQGG 186 Y+ F VK D +DS + LH D Y DY + +D Q G Sbjct: 166 YFGDFFDLEVKADDATDSVSFST-SALPLHTDIPYCSPPPDYQFLYGLDVDPRCAREQVG 224 Query: 187 NSLLLHLDDWEHLDNYFRHP------LARRPMRFAAPPSKNVSKDVFH-PVFDVDQQG-- 237 + +D W L LAR + + A + P+ + G Sbjct: 225 CTRF--VDGWAVLRELRDASPEMFERLARTRVVYRADYPGARKRYEHRTPIVRLRADGTV 282 Query: 238 -----RPVMRYIDQFVQPKDFEEGVWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLH 292 P + D + + + G ++ +N H Sbjct: 283 ERLINNPTKMFFDGIGFDELMPFFRAYHAFKARLVATMRSYLHAWTQGDMVVWDNRRIFH 342 Query: 293 GRDRFTPHPDLRREL---------MRQRGYFAYA 317 GR F P + R L +R R F A Sbjct: 343 GRGDF-GAPGIVRTLRGGYFREGELRARDAFLAA 375 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.308 0.127 0.311 Lambda K H 0.267 0.0388 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,644,254,948 Number of Sequences: 3077464 Number of extensions: 61845708 Number of successful extensions: 153424 Number of sequences better than 1.0e-01: 250 Number of HSP's better than 0.1 without gapping: 179 Number of HSP's successfully gapped in prelim test: 262 Number of HSP's that attempted gapping in prelim test: 152890 Number of HSP's gapped (non-prelim): 463 length of query: 325 length of database: 1,040,396,356 effective HSP length: 129 effective length of query: 196 effective length of database: 643,403,500 effective search space: 126107086000 effective search space used: 126107086000 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.2 bits) S2: 93 (40.5 bits)