BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (225 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_B5YS96 PKHD-type hydroxylase ybiX n=227 Tax=Bacteria Re... 460 e-128 UniRef50_Q1LIS6 PKHD-type hydroxylase Rmet_3078 n=4 Tax=Burkhold... 260 3e-68 UniRef50_A3P5N3 PKHD-type hydroxylase BURPS1106A_A1609 n=30 Tax=... 247 2e-64 UniRef50_A8ESR5 PKHD-type hydroxylase Abu_0724 n=7 Tax=Proteobac... 246 4e-64 UniRef50_C5T9F3 2OG-Fe(II) oxygenase n=1 Tax=Acidovorax delafiel... 225 9e-58 UniRef50_Q3SUS4 PKHD-type hydroxylase Nwi_0701 n=10 Tax=Proteoba... 223 5e-57 UniRef50_Q15TJ8 PKHD-type hydroxylase Patl_2273 n=6 Tax=Proteoba... 222 6e-57 UniRef50_B9TN05 PKHD-type hydroxylase ybiX, putative (Fragment) ... 205 9e-52 UniRef50_Q087U3 PKHD-type hydroxylase Sfri_0612 n=2 Tax=Alteromo... 204 2e-51 UniRef50_UPI00006A2339 UPI00006A2339 related cluster n=1 Tax=Xen... 188 2e-46 UniRef50_A5WFM3 PKHD-type hydroxylase PsycPRwf_1523 n=1 Tax=Psyc... 187 2e-46 UniRef50_B8GUF9 PKHD-type hydroxylase Tgr7_2199 n=1 Tax=Thioalka... 184 1e-45 UniRef50_Q0AP20 PKHD-type hydroxylase Mmar10_1675 n=1 Tax=Marica... 169 5e-41 UniRef50_Q2JHA7 PKHD-type hydroxylase CYB_2270 n=16 Tax=Cyanobac... 167 3e-40 UniRef50_Q0C1R0 PKHD-type hydroxylase HNE_1625 n=4 Tax=Alphaprot... 167 4e-40 UniRef50_A8TQK0 Putative hydroxylase n=1 Tax=alpha proteobacteri... 160 4e-38 UniRef50_Q5QUG6 PKHD-type hydroxylase IL0759 n=2 Tax=Idiomarina ... 145 7e-34 UniRef50_Q47YL9 PKHD-type hydroxylase CPS_3426 n=1 Tax=Colwellia... 142 1e-32 UniRef50_Q3AJA6 PKHD-type hydroxylase Syncc9605_1577 n=13 Tax=Cy... 140 4e-32 UniRef50_A3PE10 PKHD-type hydroxylase P9301_13621 n=4 Tax=Prochl... 135 1e-30 UniRef50_C7R7F4 2OG-Fe(II) oxygenase n=1 Tax=Kangiella koreensis... 131 2e-29 UniRef50_A3YZT5 Putative uncharacterized protein n=2 Tax=Chrooco... 129 1e-28 UniRef50_Q1GXG2 PKHD-type hydroxylase Mfla_0096 n=1 Tax=Methylob... 129 1e-28 UniRef50_A7HP27 PKHD-type hydroxylase Plav_0037 n=1 Tax=Parvibac... 128 2e-28 UniRef50_Q0I9X3 PKHD-type hydroxylase sync_1544 n=2 Tax=Synechoc... 124 2e-27 UniRef50_Q1GRV0 PKHD-type hydroxylase Sala_1910 n=1 Tax=Sphingop... 121 2e-26 UniRef50_Q05TP1 Putative hydroxylase n=1 Tax=Synechococcus sp. R... 105 1e-21 UniRef50_A8TKW7 2OG-Fe(II) oxygenase n=1 Tax=alpha proteobacteri... 102 1e-20 UniRef50_Q5GQB0 Putative uncharacterized protein n=1 Tax=Synecho... 100 3e-20 UniRef50_Q111M8 2OG-Fe(II) oxygenase n=2 Tax=Trichodesmium eryth... 54 4e-06 UniRef50_C7BVH5 2OG-Fe(II) oxygenase family like protein n=1 Tax... 51 3e-05 UniRef50_C7BVA8 2OG-Fe(II) oxygenase superfamily like protein n=... 50 5e-05 UniRef50_B0C441 2OG-Fe(II) oxygenase, putative n=1 Tax=Acaryochl... 50 7e-05 UniRef50_A3PDM8 Putative uncharacterized protein n=2 Tax=root Re... 48 2e-04 UniRef50_A5VDF3 Alkyl hydroperoxide reductase/ Thiol specific an... 47 7e-04 UniRef50_Q98E25 Mlr4439 protein n=1 Tax=Mesorhizobium loti RepID... 45 0.002 UniRef50_A5VC31 2OG-Fe(II) oxygenase n=1 Tax=Sphingomonas wittic... 43 0.007 UniRef50_Q9NDP6 Leprecan n=1 Tax=Ciona intestinalis RepID=Q9NDP6... 43 0.007 UniRef50_Q2S7M4 Uncharacterized iron-regulated protein n=1 Tax=H... 42 0.019 UniRef50_B4RHL4 Putative uncharacterized protein n=2 Tax=Phenylo... 41 0.025 UniRef50_C6B8G5 Alkyl hydroperoxide reductase/ Thiol specific an... 40 0.057 UniRef50_Q0FVH2 Oxidoreductase domain protein n=9 Tax=Rhodobacte... 40 0.062 UniRef50_Q2T4K0 Oxidoreductase domain protein n=1 Tax=Burkholder... 40 0.066 >UniRef50_B5YS96 PKHD-type hydroxylase ybiX n=227 Tax=Bacteria RepID=YBIX_ECO5E Length = 228 Score = 460 bits (1183), Expect = e-128, Method: Compositional matrix adjust. Identities = 222/228 (97%), Positives = 224/228 (98%), Gaps = 3/228 (1%) Query: 1 MMYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEV 60 MMYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEV Sbjct: 1 MMYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEV 60 Query: 61 LNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFL 120 LNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFL Sbjct: 61 LNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFL 120 Query: 121 SDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQSMIR 180 SDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASF+WIQSMIR Sbjct: 121 SDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFIWIQSMIR 180 Query: 181 DDKKRAMLFELDN---NIQSLKSRYGESEEILSLLNLYHNLLREWSEI 225 DDKKRAMLFELD NIQSLKSRYGE+EEILSLLNLYHNLLREWSEI Sbjct: 181 DDKKRAMLFELDKNIQNIQSLKSRYGENEEILSLLNLYHNLLREWSEI 228 >UniRef50_Q1LIS6 PKHD-type hydroxylase Rmet_3078 n=4 Tax=Burkholderiaceae RepID=Y3078_RALME Length = 228 Score = 260 bits (664), Expect = 3e-68, Method: Compositional matrix adjust. Identities = 127/227 (55%), Positives = 161/227 (70%), Gaps = 3/227 (1%) Query: 1 MMYHIPGVLSPQDVARFREQLEQA--EWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQN 58 M+ IP VL+ + +A REQL+ A WVDGRVT G GA VK NQQ+D RS A Q+ Sbjct: 1 MLVRIPQVLNAEQLAMLREQLDHAGDAWVDGRVTAGYSGAPVKFNQQIDERSEAAAQCQH 60 Query: 59 EVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNG-WMRTDLSAT 117 VL+A+ ++ LF +A LP + P+FNRY T+G HVDG VR HP NG +RTD+SAT Sbjct: 61 LVLSALERNPLFISAVLPNIVYPPMFNRYSEGMTFGLHVDGGVRLHPHNGRKLRTDVSAT 120 Query: 118 LFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQS 177 LFLSDP SYDGGEL + DT+G H VKL AGD+V+YPS+SLH V P+TRGVRV F WIQS Sbjct: 121 LFLSDPASYDGGELQIEDTYGVHSVKLAAGDMVVYPSTSLHQVKPITRGVRVGCFFWIQS 180 Query: 178 MIRDDKKRAMLFELDNNIQSLKSRYGESEEILSLLNLYHNLLREWSE 224 +IRDD +RA+LF++DN IQ+L + +L+ YHNLLR+WS+ Sbjct: 181 LIRDDGQRALLFDMDNAIQTLNQTNADERARRTLVGCYHNLLRQWSD 227 >UniRef50_A3P5N3 PKHD-type hydroxylase BURPS1106A_A1609 n=30 Tax=Proteobacteria RepID=Y5709_BURP0 Length = 227 Score = 247 bits (631), Expect = 2e-64, Method: Compositional matrix adjust. Identities = 119/226 (52%), Positives = 159/226 (70%), Gaps = 2/226 (0%) Query: 1 MMYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEV 60 MM HIPGVL+ + VA+ R+ L+ A+W DG T+GAQ A K N+Q+ S A + + Sbjct: 1 MMLHIPGVLTKEQVAQCRDILDAADWTDGNATSGAQSALAKRNRQLPEGSPAARAAGDAI 60 Query: 61 LNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGW-MRTDLSATLF 119 +A+ ++ALFF+AALP + PLFNRY + +G HVD A+R + +R+DLSATLF Sbjct: 61 QDALARNALFFSAALPLKVFPPLFNRYAGGDAFGTHVDNAIRLLRGTDFRVRSDLSATLF 120 Query: 120 LSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQSMI 179 L +P+ YDGGEL V DT+G HR KLPAGD+VLYP+SSLH VTPVTRG RVASF WIQSM+ Sbjct: 121 LEEPEHYDGGELCVEDTYGVHRAKLPAGDMVLYPASSLHHVTPVTRGARVASFFWIQSMV 180 Query: 180 RDDKKRAMLFELDNNIQSLKS-RYGESEEILSLLNLYHNLLREWSE 224 RDD R +L++LD IQ L + + G +++L +YHNLLR W++ Sbjct: 181 RDDADRTLLYQLDTQIQRLTAEKGGRDASVIALTGIYHNLLRRWAD 226 >UniRef50_A8ESR5 PKHD-type hydroxylase Abu_0724 n=7 Tax=Proteobacteria RepID=Y724_ARCB4 Length = 226 Score = 246 bits (628), Expect = 4e-64, Method: Compositional matrix adjust. Identities = 116/226 (51%), Positives = 156/226 (69%), Gaps = 1/226 (0%) Query: 1 MMYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEV 60 M+ HIP VLS + + R L +A W+DG++T G Q KNN Q+ L L++ + Sbjct: 1 MILHIPEVLSKEQLTECRNLLNKANWIDGKITAGNQAINAKNNFQLAESDPLTNYLRDII 60 Query: 61 LNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVR-SHPQNGWMRTDLSATLF 119 A+N + LF +AALP+ + +P FN+Y+N YG HVD ++ + RTD+S +LF Sbjct: 61 KTALNSNPLFISAALPKHIISPFFNKYENGGNYGNHVDNSILFDMNEKKAFRTDISCSLF 120 Query: 120 LSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQSMI 179 +DP+ Y+GGE+V+ DTFG H VKLPAGDL+LYPS+SLH V PVT+GVR+ SFMWIQSMI Sbjct: 121 FTDPEEYEGGEMVIEDTFGTHEVKLPAGDLILYPSTSLHRVEPVTKGVRMVSFMWIQSMI 180 Query: 180 RDDKKRAMLFELDNNIQSLKSRYGESEEILSLLNLYHNLLREWSEI 225 R KR++LFELDN IQSL+ YGE EE L+L YH L++EWSE+ Sbjct: 181 RSAWKRSILFELDNTIQSLRVNYGEIEETLNLSIHYHKLIQEWSEL 226 >UniRef50_C5T9F3 2OG-Fe(II) oxygenase n=1 Tax=Acidovorax delafieldii 2AN RepID=C5T9F3_ACIDE Length = 229 Score = 225 bits (573), Expect = 9e-58, Method: Compositional matrix adjust. Identities = 111/229 (48%), Positives = 157/229 (68%), Gaps = 4/229 (1%) Query: 1 MMYHIPGVLSPQDVARFREQL-EQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNE 59 M HI VL P+++A FR+ L A WVDG + G Q KNN Q+ S L A LQ Sbjct: 1 MFLHIKDVLPPEELAFFRQALGADAPWVDGARSAGGQAIHQKNNLQLAQGSELSAQLQAR 60 Query: 60 VLNAVNQHALFFAAALPRTLSTPLFNRYQNNET-YGFHVDGAV-RSHPQNGWMRTDLSAT 117 V A++++ALFF+AALPR + PLFN Y + YG HVD AV SH N W+R+DLS T Sbjct: 61 VKAALHRNALFFSAALPRRIYNPLFNNYGDGTNFYGNHVDSAVMHSHADNCWVRSDLSCT 120 Query: 118 LFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQS 177 LFL+ P+ Y+GGELV + FG+ R+KLPAGD++LYPSS++H V+PVTRG R++ F W++S Sbjct: 121 LFLTPPEDYEGGELVATEAFGEKRIKLPAGDMILYPSSTVHQVSPVTRGHRISCFFWVES 180 Query: 178 MIRDDKKRAMLFELDNNIQSLKSRYGESE-EILSLLNLYHNLLREWSEI 225 M+R ++R +LF++D ++ L+ +GE E +++L YHNLLR W+++ Sbjct: 181 MVRGLEQRQLLFDMDMSLLKLRQAHGEKEPSVIALSGTYHNLLRMWADV 229 >UniRef50_Q3SUS4 PKHD-type hydroxylase Nwi_0701 n=10 Tax=Proteobacteria RepID=Y701_NITWN Length = 226 Score = 223 bits (567), Expect = 5e-57, Method: Compositional matrix adjust. Identities = 110/225 (48%), Positives = 146/225 (64%), Gaps = 1/225 (0%) Query: 1 MMYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEV 60 M+ I VL+P ++ RFRE L QA+W DGR T G + K N+Q+ +L L + Sbjct: 1 MIQVISDVLTPDELKRFRELLGQAQWQDGRATAGHVAVRAKANEQLSHEDSLGQQLSEFL 60 Query: 61 LNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGW-MRTDLSATLF 119 L + + + F AAALP + P FNRY +YG H+D A+ S P G +R DLSATLF Sbjct: 61 LERLGKISHFIAAALPLKVLPPRFNRYTGGGSYGDHIDNAIFSVPGAGVRIRGDLSATLF 120 Query: 120 LSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQSMI 179 LS+P YDGGEL++ F +H+ KLPAG ++LYP+S+ H VTPVTRG R+A+F W QS++ Sbjct: 121 LSEPGDYDGGELIIQGEFARHQFKLPAGQMILYPASTFHQVTPVTRGARLAAFFWTQSLV 180 Query: 180 RDDKKRAMLFELDNNIQSLKSRYGESEEILSLLNLYHNLLREWSE 224 R+ +RA+LFELDN IQ+L E + L LYHNLLREWSE Sbjct: 181 REHSRRALLFELDNTIQALAQDNPEQPAVARLTGLYHNLLREWSE 225 >UniRef50_Q15TJ8 PKHD-type hydroxylase Patl_2273 n=6 Tax=Proteobacteria RepID=Y2273_PSEA6 Length = 227 Score = 222 bits (566), Expect = 6e-57, Method: Compositional matrix adjust. Identities = 106/225 (47%), Positives = 157/225 (69%), Gaps = 2/225 (0%) Query: 1 MMYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEV 60 M+ I +LS ++V +F + L++ +W+DG+ T G+Q ++VK NQQ+D S L L+N V Sbjct: 1 MLTVIEDLLSKKEVTQFTQALDKGQWLDGKHTAGSQASKVKYNQQLDDGSALAIELRNTV 60 Query: 61 LNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWM-RTDLSATLF 119 + ++ +ALF ++ALP + P FNRYQ E YG HVD +V P + M RTDLSATLF Sbjct: 61 IRKLSGNALFMSSALPNKIYPPKFNRYQGGEHYGLHVDASVMPIPNSHQMLRTDLSATLF 120 Query: 120 LSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQSMI 179 LS+P++YDGGEL + FG ++KL AG ++LYP++SLH V PVT+G R ASF WI+S++ Sbjct: 121 LSEPKTYDGGELSIETQFGLQQIKLNAGSVILYPANSLHQVNPVTKGRRTASFFWIESLV 180 Query: 180 RDDKKRAMLFELDNNIQSLKSRYGESE-EILSLLNLYHNLLREWS 223 R + +R+MLF+LD +IQ+L G ++ E+ L +YHNL+R W+ Sbjct: 181 RSNDQRSMLFDLDQSIQALTVELGSNDAEVKRLTGVYHNLMRSWA 225 >UniRef50_B9TN05 PKHD-type hydroxylase ybiX, putative (Fragment) n=1 Tax=Ricinus communis RepID=B9TN05_RICCO Length = 204 Score = 205 bits (522), Expect = 9e-52, Method: Compositional matrix adjust. Identities = 99/172 (57%), Positives = 124/172 (72%), Gaps = 3/172 (1%) Query: 1 MMYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEV 60 MM HIP +L +V + R+ L A+W DGR T G+QGAQVK NQQ+ S L L+ V Sbjct: 33 MMLHIPEILRTDEVKQLRDHLNSAQWSDGRATAGSQGAQVKQNQQLPENSPLMPELRQIV 92 Query: 61 LNAVNQHALFFAAALPRTLSTPLFNRYQNN--ETYGFHVDGAVRSHPQN-GWMRTDLSAT 117 A+ +HAL+F+AALP LS P FNRY E YGFHVDGAVRS P + GWMRTDLSAT Sbjct: 93 EQALKRHALYFSAALPLRLSPPQFNRYAAAQLEHYGFHVDGAVRSFPAHPGWMRTDLSAT 152 Query: 118 LFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRV 169 LFL + Y+GG+L V DT+G+H V+LPAGD++LYPS+S+H VTP+TRG R+ Sbjct: 153 LFLCESDEYEGGDLTVRDTYGEHEVRLPAGDMILYPSTSVHSVTPLTRGARI 204 >UniRef50_Q087U3 PKHD-type hydroxylase Sfri_0612 n=2 Tax=Alteromonadales RepID=Y612_SHEFN Length = 226 Score = 204 bits (518), Expect = 2e-51, Method: Compositional matrix adjust. Identities = 102/226 (45%), Positives = 144/226 (63%), Gaps = 2/226 (0%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEVL 61 M I +LS QDV +R+QL + W DGR T A VKNN Q D + L N++L Sbjct: 1 MIVIEQILSKQDVGAYRQQLAECPWGDGRKTAMGMAASVKNNNQADAQHANVRQLANQLL 60 Query: 62 NAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNG-WMRTDLSATLFL 120 + + +AALP + P FNRY E YG+HVD A+ P +R+D+S T+FL Sbjct: 61 ARIGETPKIVSAALPHKIFPPCFNRYNETEEYGYHVDAAIMRIPNTSEVIRSDVSMTVFL 120 Query: 121 SDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQSMIR 180 S+P+ YDGGELV+ FGQ ++KLPAG V+YPSSSLH VT VTRG R+A+ W+QSM+ Sbjct: 121 SEPEEYDGGELVIATEFGQQQIKLPAGYAVVYPSSSLHKVTAVTRGQRIAAITWMQSMVA 180 Query: 181 DDKKRAMLFELDNNIQSL-KSRYGESEEILSLLNLYHNLLREWSEI 225 D R L++LD +IQ+L K+ + E+ +L N+YHNL+R+++++ Sbjct: 181 DVTLRQTLYQLDQSIQNLIKANNTDRAELDNLHNVYHNLIRQFTQL 226 >UniRef50_UPI00006A2339 UPI00006A2339 related cluster n=1 Tax=Xenopus (Silurana) tropicalis RepID=UPI00006A2339 Length = 185 Score = 188 bits (477), Expect = 2e-46, Method: Compositional matrix adjust. Identities = 89/152 (58%), Positives = 110/152 (72%), Gaps = 1/152 (0%) Query: 75 LPRTLSTPLFNRYQNNETYGFHVDGAV-RSHPQNGWMRTDLSATLFLSDPQSYDGGELVV 133 LP PLFNRY YG HVDG+V R +R+D+S TLFLS+P+ Y+GGEL+V Sbjct: 34 LPLRTLLPLFNRYAGGGQYGLHVDGSVMRQLGSEQPLRSDVSTTLFLSEPEEYEGGELIV 93 Query: 134 NDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQSMIRDDKKRAMLFELDN 193 DT+G+H VKLPAGD+++YPS+SLH VTPVTRG RVASF W QSM+R D +R LFELD Sbjct: 94 VDTYGEHEVKLPAGDMIVYPSTSLHRVTPVTRGARVASFFWTQSMVRQDSQRLRLFELDQ 153 Query: 194 NIQSLKSRYGESEEILSLLNLYHNLLREWSEI 225 IQ L+ R G+ EE+ SL YHNLLR W+E+ Sbjct: 154 AIQKLRLRLGDDEEVTSLTGHYHNLLRMWAEV 185 >UniRef50_A5WFM3 PKHD-type hydroxylase PsycPRwf_1523 n=1 Tax=Psychrobacter sp. PRwf-1 RepID=Y1523_PSYWF Length = 259 Score = 187 bits (475), Expect = 2e-46, Method: Compositional matrix adjust. Identities = 102/258 (39%), Positives = 148/258 (57%), Gaps = 34/258 (13%) Query: 1 MMYHIPGVLSPQDVARFREQL--EQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQN 58 M++ I +L +++ L + A+W DG++T G Q KNN Q+ + Y A+ N Sbjct: 1 MLHIIENLLDTAQLSQLTSILTHQHAQWQDGKLTAGISAQQQKNNWQLSRQDPSYQAMAN 60 Query: 59 EVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNG-WMRTDLSAT 117 L A+ QH +F +AALP+ + PLF+ YQ + YG HVD A+++HP + MRTDLS T Sbjct: 61 LCLEALQQHPVFMSAALPKVIMPPLFSAYQLGQGYGMHVDNALQTHPDSKQLMRTDLSLT 120 Query: 118 LFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQS 177 LFL++P Y+GGELV++D +G+H +KL AGD VLYPS+SLH V VT G R+A W+QS Sbjct: 121 LFLNNPADYEGGELVISDEYGEHSIKLSAGDAVLYPSTSLHRVNTVTSGQRLAMVTWVQS 180 Query: 178 MIRDDKKRAMLFELD-------------------NNIQSLKSRYGESEE----------- 207 ++R D++R +L +LD QS +++ G+ E Sbjct: 181 LVRSDEQRQILHDLDVSHILLRQKLLATSDQAQSTQAQSTQAQCGQLSEQHSTDQQLTHQ 240 Query: 208 -ILSLLNLYHNLLREWSE 224 I L YHNLLR W+E Sbjct: 241 AIEKLNQSYHNLLRLWAE 258 >UniRef50_B8GUF9 PKHD-type hydroxylase Tgr7_2199 n=1 Tax=Thioalkalivibrio sp. HL-EbGR7 RepID=Y2199_THISH Length = 224 Score = 184 bits (468), Expect = 1e-45, Method: Compositional matrix adjust. Identities = 98/225 (43%), Positives = 136/225 (60%), Gaps = 1/225 (0%) Query: 1 MMYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEV 60 M+ IP +L +A R L A + DGR + GA +VK+N++VD T AL V Sbjct: 1 MLLTIPELLDAAQLAEIRRLLADAPFTDGRYSAGADARRVKHNEEVDPSDTRVRALNQLV 60 Query: 61 LNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFL 120 L + +H F AAALPR LS F RY YG HVD V P+ G RTD+S T+F+ Sbjct: 61 LMPLYRHETFQAAALPRKLSGAFFARYLPGMQYGAHVDDPVMG-PEGGRYRTDVSVTVFI 119 Query: 121 SDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQSMIR 180 +PQSY+GGELVV FG+ +VKLPAG V+YPSSSLH V+PVT G R+ + W +SM+R Sbjct: 120 GEPQSYEGGELVVETDFGEQQVKLPAGHAVIYPSSSLHRVSPVTGGERLVAVAWAESMVR 179 Query: 181 DDKKRAMLFELDNNIQSLKSRYGESEEILSLLNLYHNLLREWSEI 225 D +R ML+EL ++L+ ++E ++ NL+R W+++ Sbjct: 180 DPARRQMLYELYQVHEALRRDNPDAEVTRRAGHVRANLMRMWADV 224 >UniRef50_Q0AP20 PKHD-type hydroxylase Mmar10_1675 n=1 Tax=Maricaulis maris MCS10 RepID=Y1675_MARMM Length = 219 Score = 169 bits (429), Expect = 5e-41, Method: Compositional matrix adjust. Identities = 92/224 (41%), Positives = 137/224 (61%), Gaps = 6/224 (2%) Query: 1 MMYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEV 60 M+ H+ V + V R+ + Q +VDG T G VK N+Q++ + + +++EV Sbjct: 1 MLIHLQKVCPSEQVDHLRDLIGQGGFVDGGTTAGQVARAVKANEQLEAGARV-DTVRSEV 59 Query: 61 LNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFL 120 A+ HA F + A P+TLS L +RY++ YG H+D A+ G R DLS TLFL Sbjct: 60 RKALMAHAGFVSFARPKTLSRILVSRYRDGMAYGPHIDDAL-----MGGRRADLSFTLFL 114 Query: 121 SDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQSMIR 180 SDP SYDGGELV++ G+ +KL AGD V+Y +S++H V PVTRG RVA W++S++R Sbjct: 115 SDPDSYDGGELVMDGPDGETEIKLAAGDAVVYATSAIHQVAPVTRGERVAVVGWVRSLVR 174 Query: 181 DDKKRAMLFELDNNIQSLKSRYGESEEILSLLNLYHNLLREWSE 224 +R +LF+LD +L +R G++ E+ +L NLLR+W+E Sbjct: 175 RPDQREILFDLDQVSAALFARDGKTRELDLVLKTKANLLRQWAE 218 >UniRef50_Q2JHA7 PKHD-type hydroxylase CYB_2270 n=16 Tax=Cyanobacteria RepID=Y2270_SYNJB Length = 224 Score = 167 bits (422), Expect = 3e-40, Method: Compositional matrix adjust. Identities = 89/225 (39%), Positives = 137/225 (60%), Gaps = 1/225 (0%) Query: 1 MMYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEV 60 M+ I VLS ++ + + AE+VDG +T G VKNN+Q+ S ++ + Sbjct: 1 MILCIGDVLSLAELQQILSLIADAEFVDGALTAGWNARLVKNNRQMPKGSLQQRKIEEII 60 Query: 61 LNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFL 120 L A+ ++ LF AA P+ + + L + Y+ +YG H D A+ ++ MRTD+S TLFL Sbjct: 61 LAALERNLLFQMAARPKLIHSILISCYEAGMSYGTHTDDALMLD-RHQLMRTDISFTLFL 119 Query: 121 SDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQSMIR 180 S P+ YDGGEL + + G+ KLPAG L+LYP+S+LH V PVTRG+R A+ W+QS+IR Sbjct: 120 SAPEDYDGGELKIESSEGEQAYKLPAGALILYPASTLHRVEPVTRGIRYAAVSWVQSLIR 179 Query: 181 DDKKRAMLFELDNNIQSLKSRYGESEEILSLLNLYHNLLREWSEI 225 D ++R +LF+L Q + G++ + +Y NLLR+W+E+ Sbjct: 180 DPQEREILFDLQTVRQQMFQESGKTRHFDLISKVYANLLRKWAEL 224 >UniRef50_Q0C1R0 PKHD-type hydroxylase HNE_1625 n=4 Tax=Alphaproteobacteria RepID=Y1625_HYPNA Length = 224 Score = 167 bits (422), Expect = 4e-40, Method: Compositional matrix adjust. Identities = 93/225 (41%), Positives = 124/225 (55%), Gaps = 2/225 (0%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEVL 61 M I +L + L + W DGR T GA +VK NQQ D S + ++ +L Sbjct: 1 MIVIENILGQDVLTEVAAALRELRWEDGRNTAGATARRVKRNQQADLSSRTGSKVREVLL 60 Query: 62 NAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLS 121 AV +H + A A P + PL + + YG H+D V + +RTDLS TLFLS Sbjct: 61 EAVKRHPVVEAYARPLKFAPPLISCSGEGDAYGLHIDNPVMGK-GDARLRTDLSFTLFLS 119 Query: 122 DPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQSMIRD 181 P+SYDGGEL + F VKLPAG +V+YPS+ LH VTPVT G R WIQS I+D Sbjct: 120 PPESYDGGELEIETVFKTESVKLPAGSMVIYPSTELHRVTPVTSGERFVFVGWIQSAIKD 179 Query: 182 DKKRAMLFELDNNIQSLKSRYGE-SEEILSLLNLYHNLLREWSEI 225 +RA+LF++ N L R+ S E+L+L NL+R WS+I Sbjct: 180 AAQRAILFDVTNLKAGLARRFPPGSPELLTLAKTESNLIRMWSDI 224 >UniRef50_A8TQK0 Putative hydroxylase n=1 Tax=alpha proteobacterium BAL199 RepID=A8TQK0_9PROT Length = 223 Score = 160 bits (404), Expect = 4e-38, Method: Compositional matrix adjust. Identities = 90/224 (40%), Positives = 128/224 (57%), Gaps = 2/224 (0%) Query: 1 MMYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEV 60 M+ I G+LS + V ++L A++VDG ++ G G +K N QV +S Y L V Sbjct: 1 MVAVIEGLLSKEQVQTIAKRLFGAQFVDGTLSGGPLGEAIKKNTQVSPQSPEYRELSQLV 60 Query: 61 LNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFL 120 L + Q+ AALP+ + +P+F Y YG HVD A+ P G MRTDLS T+FL Sbjct: 61 LGIMRQNDQVAIAALPKRILSPIFASYVEGNRYGEHVDAALMG-PYPG-MRTDLSITIFL 118 Query: 121 SDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQSMIR 180 +DP +YDGGELV+ FG+ K AGD VLYP+ +H V P+TRG R+A WI+SM+R Sbjct: 119 NDPGAYDGGELVLKTAFGEQIYKRAAGDAVLYPTHYVHRVNPITRGRRLAIVTWIESMVR 178 Query: 181 DDKKRAMLFELDNNIQSLKSRYGESEEILSLLNLYHNLLREWSE 224 D +R ++ +L + L + E I + NLLR W++ Sbjct: 179 DPARREVIEDLAEAMDKLVRDGADGEIIRRVEKARLNLLRMWAD 222 >UniRef50_Q5QUG6 PKHD-type hydroxylase IL0759 n=2 Tax=Idiomarina RepID=Y759_IDILO Length = 218 Score = 145 bits (367), Expect = 7e-34, Method: Compositional matrix adjust. Identities = 83/228 (36%), Positives = 130/228 (57%), Gaps = 13/228 (5%) Query: 1 MMYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEV 60 M+ I + V L+ ++ DG+ T G VKNNQQ+ + + A + Sbjct: 1 MILQISNAVDTDTVKSIVAGLDAGQFSDGKKTAGWAAKDVKNNQQLSGKKS--EAATQVL 58 Query: 61 LNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFL 120 L+ + Q+AL + P+ ++ NRYQ E YG H+D ++ NG +RTD+S TL L Sbjct: 59 LDRLQQNALVQSVMRPKQVARVTINRYQQGEYYGTHMDDSL----MNG-VRTDISFTLGL 113 Query: 121 SDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQSMIR 180 S ++GGELV+ D G+ +L GD+++YPS LH V PVT+G R+A W+QS+++ Sbjct: 114 SPLSDFEGGELVIEDASGERSWRLGQGDILMYPSHYLHRVNPVTKGSRLAMIGWVQSLVK 173 Query: 181 DDKKRAMLFELDNNIQSLKSRY---GESEEILSLLNLYHNLLREWSEI 225 R +LF+++ QSLK+ + G+SE L ++HNLLREWS++ Sbjct: 174 QPNYRELLFDIE---QSLKAEFDANGKSENFDRLTKVFHNLLREWSDV 218 >UniRef50_Q47YL9 PKHD-type hydroxylase CPS_3426 n=1 Tax=Colwellia psychrerythraea 34H RepID=Y3426_COLP3 Length = 223 Score = 142 bits (357), Expect = 1e-32, Method: Compositional matrix adjust. Identities = 78/223 (34%), Positives = 122/223 (54%), Gaps = 2/223 (0%) Query: 1 MMYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEV 60 M+ +P VLSP VA + +E + G+ T G VKNN Q + L +Q + Sbjct: 1 MITKLPQVLSPIQVASIIQLIEHGSFNSGKDTAGWHAKAVKNNLQWQGETELNEQIQTGI 60 Query: 61 LNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFL 120 A+ QH F AA +++ + + YG H+D A+ + +RTD+S TLFL Sbjct: 61 QGALTQHPQFTGAAYAKSMMPFIISESTLGGGYGDHIDDALMVN--ETVLRTDISCTLFL 118 Query: 121 SDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQSMIR 180 + PQ Y+GGELV+N + + KL AGD ++YPS++LH V PVT G R + WI+S I Sbjct: 119 TPPQDYEGGELVMNLSGMEMAFKLNAGDAIIYPSTTLHRVNPVTSGSRKVALTWIESHIP 178 Query: 181 DDKKRAMLFELDNNIQSLKSRYGESEEILSLLNLYHNLLREWS 223 +R +LF+LD + + +G+++ + + NLLR+W+ Sbjct: 179 QASQREILFDLDCARKDIMEHHGKTDAFDRITKTHANLLRQWA 221 >UniRef50_Q3AJA6 PKHD-type hydroxylase Syncc9605_1577 n=13 Tax=Cyanobacteria RepID=Y1577_SYNSC Length = 222 Score = 140 bits (352), Expect = 4e-32, Method: Compositional matrix adjust. Identities = 81/200 (40%), Positives = 114/200 (57%), Gaps = 8/200 (4%) Query: 26 WVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEVLNAVNQHALFFAAALPRTLSTPLFN 85 W DGR+T G Q A VK N Q+D + L A+ N + A+ L + +L R + + L + Sbjct: 28 WRDGRLTAGDQAALVKKNYQLDPNAELSLAISNCISTALTSDPLVKSFSLVRKVHSLLVS 87 Query: 86 RYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLSDPQSYDGGELVVNDTFGQ--HRVK 143 R E+YG+HVD +NG R DLS T FLSD SY+GG L++ T G+ + Sbjct: 88 RSSAGESYGWHVDNPFS---RNG--RRDLSFTCFLSDEDSYEGGSLMIQ-TGGEDTKEFR 141 Query: 144 LPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQSMIRDDKKRAMLFELDNNIQSLKSRYG 203 LP G +VLYPSS+LHCVTPV G R WI+S ++ R+MLF +D + L +R+G Sbjct: 142 LPPGQVVLYPSSTLHCVTPVLSGDRYVCVGWIESYVKAADDRSMLFNIDAGARGLLARHG 201 Query: 204 ESEEILSLLNLYHNLLREWS 223 S+E+ + Y N +R S Sbjct: 202 RSDELDLIFQSYTNAVRRLS 221 >UniRef50_A3PE10 PKHD-type hydroxylase P9301_13621 n=4 Tax=Prochlorococcus marinus RepID=Y1362_PROM0 Length = 222 Score = 135 bits (339), Expect = 1e-30, Method: Compositional matrix adjust. Identities = 74/227 (32%), Positives = 124/227 (54%), Gaps = 8/227 (3%) Query: 1 MMYHIPGVLSPQDVARFREQLE---QAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQ 57 M Y I +L+ +++ +++L+ Q +W DG+ T G+ + VKNN Q++ + + Sbjct: 1 MNYLIHQLLNAEEINLIKKELDKCSQQDWEDGKKTAGSHASMVKNNLQLNRNTEVSKKNA 60 Query: 58 NEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSAT 117 V + L + +LP+ + +F + N YG H+D +P R+DLS T Sbjct: 61 QLVTKKILSSQLIKSFSLPKKIHGIMFTKSSKNMHYGRHID-----NPYMSSGRSDLSFT 115 Query: 118 LFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQS 177 + L++ YDGGEL++ + + KL G+++LYPSS LH V V G R+ WI+S Sbjct: 116 ISLTNKDFYDGGELIIETMNTEEKFKLNPGEIILYPSSYLHAVNEVNNGERLVCVGWIES 175 Query: 178 MIRDDKKRAMLFELDNNIQSLKSRYGESEEILSLLNLYHNLLREWSE 224 ++ +KR LF+LD +SL S++G S+E+ + Y NLLR+ E Sbjct: 176 YVKSTEKREYLFDLDAGARSLLSKHGRSDELDLIFKSYSNLLRDIGE 222 >UniRef50_C7R7F4 2OG-Fe(II) oxygenase n=1 Tax=Kangiella koreensis DSM 16069 RepID=C7R7F4_KANKD Length = 218 Score = 131 bits (330), Expect = 2e-29, Method: Compositional matrix adjust. Identities = 78/227 (34%), Positives = 120/227 (52%), Gaps = 12/227 (5%) Query: 1 MMYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQ---VDTRSTLYAALQ 57 M+ + ++ P + E++ + ++ G+ T G +KNNQQ VD + A L Sbjct: 1 MILQLSDIIEPNTLNVICEEVAKLDFHSGQQTAGKAVRSLKNNQQILLVDDQPAPLAML- 59 Query: 58 NEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSAT 117 + + +F AA LP+ + + NRYQ YG H+D A + +RTD+S T Sbjct: 60 ---FRHLQKSPIFQAACLPKQFARVMLNRYQQGMQYGNHIDDAYIAG-----VRTDVSFT 111 Query: 118 LFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQS 177 LS Y+GGELV+ D+ G+ KL G++++YPSS LH V PVT G R+A W+QS Sbjct: 112 YCLSSTSDYNGGELVLCDSTGERSWKLDKGEVLIYPSSYLHRVNPVTEGTRIAMVGWLQS 171 Query: 178 MIRDDKKRAMLFELDNNIQSLKSRYGESEEILSLLNLYHNLLREWSE 224 + D +R +LF+L + G+SE+ L Y NLLR W++ Sbjct: 172 KVGDASQRELLFDLKQAVTHELETQGKSEQYDRLSKSYSNLLRMWAD 218 >UniRef50_A3YZT5 Putative uncharacterized protein n=2 Tax=Chroococcales RepID=A3YZT5_9SYNE Length = 221 Score = 129 bits (323), Expect = 1e-28, Method: Compositional matrix adjust. Identities = 88/226 (38%), Positives = 123/226 (54%), Gaps = 7/226 (3%) Query: 1 MMYHIPGVLSPQDVARFREQL--EQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQN 58 M + + +L P V + L E A W G T G VK N Q++ S L+A L Sbjct: 1 MRFVLEPLLQPHQVEDWCLALSSEHASWRPGAETAGWHARSVKRNHQLERGSPLHAQLAE 60 Query: 59 EVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATL 118 ++ +A+ H L AAALP ++ LF+R E YG HVD A + R+DLS TL Sbjct: 61 QLQSALLAHPLLLAAALPVSIHGVLFSRSTRGEGYGSHVDNAYMAGG-----RSDLSFTL 115 Query: 119 FLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQSM 178 FLSDP +Y GGELV+ + ++ PAG ++YPS+ LH V PV G R+ + WIQS Sbjct: 116 FLSDPDTYSGGELVLEGPADEEALRCPAGHALVYPSTQLHRVEPVRDGQRLVAVGWIQSR 175 Query: 179 IRDDKKRAMLFELDNNIQSLKSRYGESEEILSLLNLYHNLLREWSE 224 +R +R +LFELD +++ R G+ E + Y NLLR+W E Sbjct: 176 VRRADQRELLFELDTARRAIFKRDGKDEVFDLISRSYTNLLRQWGE 221 >UniRef50_Q1GXG2 PKHD-type hydroxylase Mfla_0096 n=1 Tax=Methylobacillus flagellatus KT RepID=Y096_METFK Length = 176 Score = 129 bits (323), Expect = 1e-28, Method: Compositional matrix adjust. Identities = 60/133 (45%), Positives = 89/133 (66%), Gaps = 1/133 (0%) Query: 1 MMYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEV 60 M+ IP V +P++ R++L+ EW+DG+VT G Q A+ KNN Q+ L L + + Sbjct: 1 MLITIPEVFTPEEAESIRQRLDATEWLDGKVTAGYQSAKAKNNLQLAENHPLAIELGDLI 60 Query: 61 LNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRS-HPQNGWMRTDLSATLF 119 ++ + QH LF +AALPR + PLFNRY++ +++GFH+D AVRS +RTDLS+TLF Sbjct: 61 VSRLTQHPLFMSAALPRKVFPPLFNRYESGQSFGFHIDNAVRSLSGSRERVRTDLSSTLF 120 Query: 120 LSDPQSYDGGELV 132 + P+ YDGGEL+ Sbjct: 121 FTPPEDYDGGELI 133 >UniRef50_A7HP27 PKHD-type hydroxylase Plav_0037 n=1 Tax=Parvibaculum lavamentivorans DS-1 RepID=Y037_PARL1 Length = 219 Score = 128 bits (321), Expect = 2e-28, Method: Compositional matrix adjust. Identities = 84/226 (37%), Positives = 119/226 (52%), Gaps = 10/226 (4%) Query: 1 MMYHIPGVLSPQDVARFREQL--EQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQN 58 M I G+L D+ R + + ++ + G T G VKNN+Q + L A L Sbjct: 1 MFIEIAGILGAADL-RLADTVFAQKDAFESGARTAGRIARAVKNNEQAKP-AGLAADLTM 58 Query: 59 EVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATL 118 V + ++ +F AAA PR L +RY YG H D A R DLS TL Sbjct: 59 LVEKRLMKNDVFRAAARPRNFIRILLSRYTQGMAYGLHSDDAFMER-----QRVDLSFTL 113 Query: 119 FLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQSM 178 FLS P+SY+GGEL+V + G+ VKL AG LVLYPS++LH V VT G R A+ WI+S+ Sbjct: 114 FLSPPESYEGGELIVEEPAGERLVKLEAGSLVLYPSATLHRVAEVTSGERRAAVGWIRSL 173 Query: 179 IRDDKKRAMLFELDNNIQSLKSRYGESEEILSLLNLYHNLLREWSE 224 +R + R LF++ ++ ++ G+ LL + +LLR W E Sbjct: 174 VRSAEDRETLFDVALALRQAEA-AGDRALTDRLLKIQGSLLRRWGE 218 >UniRef50_Q0I9X3 PKHD-type hydroxylase sync_1544 n=2 Tax=Synechococcus RepID=Y1544_SYNS3 Length = 220 Score = 124 bits (312), Expect = 2e-27, Method: Compositional matrix adjust. Identities = 72/214 (33%), Positives = 115/214 (53%), Gaps = 6/214 (2%) Query: 8 VLSPQDVARFREQL-EQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEVLNAVNQ 66 +L R E+L +AEW+DG +T GA K N Q++ S L + V A+ Sbjct: 8 ILDQATCERLLERLANEAEWLDGSLTAGAHAKGGKRNFQINYDSALRKEIHELVERAMWN 67 Query: 67 HALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLSDPQSY 126 H + LPR L L ++ + Y HVD A S R+DLS TL L+D Y Sbjct: 68 HPVVKGFCLPRKLHRFLISKTEKEGGYDTHVDNAYMSSG-----RSDLSFTLSLTDDTMY 122 Query: 127 DGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQSMIRDDKKRA 186 +GG+L ++ + +KL G++++YPS+SLH V VT G+R WI+S ++ + R Sbjct: 123 EGGDLEIDSISESYPIKLKQGEILIYPSTSLHRVCNVTSGIRTVCVGWIESYVQAENDRI 182 Query: 187 MLFELDNNIQSLKSRYGESEEILSLLNLYHNLLR 220 LF+L++ +++ +++G S+E+ + Y NLLR Sbjct: 183 CLFQLESGARAVLAKHGRSDELDLIFLAYTNLLR 216 >UniRef50_Q1GRV0 PKHD-type hydroxylase Sala_1910 n=1 Tax=Sphingopyxis alaskensis RepID=Y1910_SPHAL Length = 218 Score = 121 bits (303), Expect = 2e-26, Method: Compositional matrix adjust. Identities = 78/221 (35%), Positives = 116/221 (52%), Gaps = 5/221 (2%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEVL 61 M+ + +L V R+ +VDG+++ ++VKNN Q+ + Y +L Sbjct: 1 MFKLVQLLGDNAVRALRDIAASGTFVDGKISN--PHSRVKNNLQLHDAAA-YERSSKILL 57 Query: 62 NAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLS 121 +A+ Q+A F + P ++ PL RY YG H D A P +G +RTD+S T+FLS Sbjct: 58 DAMIQNADFMEFSFPARIAPPLLTRYTPGMHYGLHPDAAYIPLP-DGQLRTDVSCTIFLS 116 Query: 122 DPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQSMIRD 181 DP YDGG L V R K G ++YPS +LH V PVTRG R+ + +IQS+I D Sbjct: 117 DPADYDGGALHVQLGNADLRFKEAPGVAIVYPSHTLHEVEPVTRGERLVAITFIQSLIPD 176 Query: 182 DKKRAMLFELDNNIQSLKSRYGESEEILSLLNLYHNLLREW 222 ++R ++ EL N I +L+ E L + + LLR W Sbjct: 177 VQQRNLMHEL-NEIAALEGGKMEPANYTRLQAVQYQLLRMW 216 >UniRef50_Q05TP1 Putative hydroxylase n=1 Tax=Synechococcus sp. RS9916 RepID=Q05TP1_9SYNE Length = 222 Score = 105 bits (262), Expect = 1e-21, Method: Compositional matrix adjust. Identities = 64/200 (32%), Positives = 98/200 (49%), Gaps = 11/200 (5%) Query: 26 WVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEVLNAVNQHALFFAAALPRTLSTPLFN 85 WVDG+ TTG+ K N Q+ + L+ + + + F + +P+ + L + Sbjct: 29 WVDGKTTTGSHAKTKKINLQLKPDTQENKELERAIRERLRNNPSFKSFCIPKKMHHNLIS 88 Query: 86 RYQNNETYGFHVDGAVRSHPQNGWMRT---DLSATLFLSDPQSYDGGELVVNDTFGQHRV 142 R + YG HVD N +M+T D+S T+ LS + Y GGELV++ V Sbjct: 89 RTEAGGGYGTHVD--------NAFMKTGRADISYTICLSSEKDYKGGELVIHGATETTTV 140 Query: 143 KLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQSMIRDDKKRAMLFELDNNIQSLKSRY 202 K+ G +YPS+ LH V VT G+R+A W+QS I + R LF L+ L + Sbjct: 141 KMKQGHAFIYPSNQLHQVNTVTSGIRLACIGWVQSYIASQELRMNLFNLEAGANYLLATQ 200 Query: 203 GESEEILSLLNLYHNLLREW 222 G SE + + + NLLR + Sbjct: 201 GRSEALDRIFLAHANLLRSF 220 >UniRef50_A8TKW7 2OG-Fe(II) oxygenase n=1 Tax=alpha proteobacterium BAL199 RepID=A8TKW7_9PROT Length = 238 Score = 102 bits (254), Expect = 1e-20, Method: Compositional matrix adjust. Identities = 78/215 (36%), Positives = 112/215 (52%), Gaps = 14/215 (6%) Query: 2 MYHIPGVLSPQDVARFRE--QLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNE 59 +Y I +L+P VA R Q E A WVDG+ T G G + K N ++ S L L ++ Sbjct: 4 VYPIRNLLAPGLVAELRAALQAEGAPWVDGQQTVGRDGTK-KRNHEIAADSPLRQELSDK 62 Query: 60 VLNAV-----NQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAV--RSHPQNGWMRT 112 V + N+ F PR S LF+R Y H+D AV R P+ MR+ Sbjct: 63 VSAYLRGPLTNETLAFRHVCDPRRWSPFLFSRTGPGGGYRDHMDSAVMFRGSPEE--MRS 120 Query: 113 DLSATLFLSDPQSYDGGELVVN-DTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVAS 171 DLS T+FL++P SY GGELVV+ D K+ AG VLY ++++H V VT G R+ + Sbjct: 121 DLSMTIFLTEPDSYQGGELVVDSDMPYAPTFKMAAGGAVLYATNAIHRVAEVTAGERLVA 180 Query: 172 FMWIQSMIRDDKKRAMLFELDNNIQSLKSRYGESE 206 +WI+S I D R + +L + S+ +R G + Sbjct: 181 VIWIESRIADVGTRQINADL-LQVMSVLTRDGACD 214 >UniRef50_Q5GQB0 Putative uncharacterized protein n=1 Tax=Synechococcus phage S-PM2 RepID=Q5GQB0_BPSYP Length = 231 Score = 100 bits (250), Expect = 3e-20, Method: Compositional matrix adjust. Identities = 69/229 (30%), Positives = 110/229 (48%), Gaps = 16/229 (6%) Query: 8 VLSPQDVARFREQLEQAEWVDGRVTTGAQGA---QVKNNQQVDTRSTLYAALQNEVLNAV 64 +L+ +V R EQ ++A DG V + + Q+KN++++D ++ Y + + A+ Sbjct: 7 LLTQDEVRRINEQYDKAALKDGTVKINLENSVEKQLKNSKEIDGNTSHYRYCLDLIQKAM 66 Query: 65 NQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLSDPQ 124 ++ALF + ++ P+ Y Y HVD QN +RTD S TLFLS+P Sbjct: 67 RRNALFKTTYILGEITPPIMVEYAEGCYYIPHVDSI---QIQN--LRTDHSMTLFLSEPD 121 Query: 125 SYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQSMIRDDKK 184 Y+GGELV+ K AG +++YP+ LH V P+T G R S MW S+I D Sbjct: 122 EYEGGELVIGIGDVAKSFKEKAGTVIMYPTGMLHEVRPITSGKRRVSVMWATSIIDDTFM 181 Query: 185 RAMLFELDNNIQSLKSRYGESEE--------ILSLLNLYHNLLREWSEI 225 R L ++ + E E+ ++ L + N LR + I Sbjct: 182 RHELINFGMGLKKILDYLEEKEDDQLKIQELLIPLEQVRSNFLRGYGNI 230 >UniRef50_Q111M8 2OG-Fe(II) oxygenase n=2 Tax=Trichodesmium erythraeum IMS101 RepID=Q111M8_TRIEI Length = 210 Score = 53.9 bits (128), Expect = 4e-06, Method: Compositional matrix adjust. Identities = 38/153 (24%), Positives = 66/153 (43%), Gaps = 10/153 (6%) Query: 27 VDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEVLNAVNQHALFFAAALPRTLSTPLFNR 86 VDG++ + V ++ L+ L N V A N+ + + ++ + Sbjct: 58 VDGKIKPEIRQVNVWGLSYSESTRWLWEKLINSVKYANNKWWNYDIYGIMDSMQLLCYEA 117 Query: 87 YQNNET----YGFHVDGAVRSHPQNGWMRTDLSATLFLSDPQSYDGGELVVNDTFGQHRV 142 +N E+ Y H+D + +S ++ LSDPQ Y+G EL + + Sbjct: 118 SKNQESIQDHYNKHIDVG------EAYYYRKISISIQLSDPQDYEGSELKLYTRREAENL 171 Query: 143 KLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWI 175 G ++L+PS LH VTP+ +G R A W+ Sbjct: 172 PKARGTMILFPSFVLHEVTPIIKGKRWALVCWV 204 >UniRef50_C7BVH5 2OG-Fe(II) oxygenase family like protein n=1 Tax=Synechococcus phage S-RSM4 RepID=C7BVH5_9CAUD Length = 215 Score = 50.8 bits (120), Expect = 3e-05, Method: Compositional matrix adjust. Identities = 31/84 (36%), Positives = 45/84 (53%), Gaps = 6/84 (7%) Query: 111 RTDLSATLFLSDPQSYDGGELVVNDTFGQHRV--KLPAGDLVLYPSSSLHCVTPVTRGVR 168 RTD + + L+D Y+GGE + G R+ K+ G ++YP+ +H V PVT GVR Sbjct: 102 RTDYTCVVNLND--DYEGGEHYIQ--IGTERIEKKVEPGKALIYPTEFIHGVNPVTSGVR 157 Query: 169 VASFMWIQSMIRDDKKRAMLFELD 192 W++S I D R L EL+ Sbjct: 158 KCLTFWMESSIVDPTIRYYLAELN 181 >UniRef50_C7BVA8 2OG-Fe(II) oxygenase superfamily like protein n=1 Tax=Synechococcus phage S-RSM4 RepID=C7BVA8_9CAUD Length = 206 Score = 50.4 bits (119), Expect = 5e-05, Method: Compositional matrix adjust. Identities = 41/132 (31%), Positives = 60/132 (45%), Gaps = 12/132 (9%) Query: 52 LYAALQNEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQ---NG 108 +Y L+ +L A F TL F +Y + Y +H D +P N Sbjct: 71 IYNRLKKYILAANKNAGWNFNVDHTETLQ---FTKYDVGQFYDWHPDQHHYLYPDDDTNE 127 Query: 109 WMR---TDLSATLFLSDPQSYDGGELVV--NDTFGQHRVKLPA-GDLVLYPSSSLHCVTP 162 MR LS TL L+DP ++GG+L N + KL + G L+++PS H VTP Sbjct: 128 NMRGKYRKLSTTLLLNDPSEFEGGDLEFHFNMKETEKATKLNSKGSLIVFPSFVYHRVTP 187 Query: 163 VTRGVRVASFMW 174 +T+G R + W Sbjct: 188 ITKGTRYSLVSW 199 >UniRef50_B0C441 2OG-Fe(II) oxygenase, putative n=1 Tax=Acaryochloris marina MBIC11017 RepID=B0C441_ACAM1 Length = 226 Score = 49.7 bits (117), Expect = 7e-05, Method: Compositional matrix adjust. Identities = 37/99 (37%), Positives = 53/99 (53%), Gaps = 11/99 (11%) Query: 80 STPLFNRYQNNETYGFHVD-GAVRSHPQNGWMRTDLSATLFLSDPQSYDGGELVVNDTFG 138 S+ Y+ Y +H D G+ RS + LS ++ LSDP++Y GG L ++ T Sbjct: 129 SSIQLTEYEPGGHYTWHQDIGSRRSGLRK------LSVSVQLSDPETYVGGGLELHAT-- 180 Query: 139 QHRVKLP--AGDLVLYPSSSLHCVTPVTRGVRVASFMWI 175 Q V +P G LV++PS +LH VT +T G R A WI Sbjct: 181 QKPVMMPRSRGTLVIFPSYTLHRVTAMTEGTRRALVTWI 219 >UniRef50_A3PDM8 Putative uncharacterized protein n=2 Tax=root RepID=A3PDM8_PROM0 Length = 186 Score = 48.1 bits (113), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 29/94 (30%), Positives = 46/94 (48%), Gaps = 8/94 (8%) Query: 87 YQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLSDPQSYDGGELVVNDTFGQHR----- 141 Y + Y +HVD + + G +R +S TLF+++P Y+GGE + + F + Sbjct: 88 YSDGGKYDWHVDQGAKMFLKGGSVRK-ISMTLFINNPDEYEGGEFDL-ELFPPEKEPRYE 145 Query: 142 -VKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMW 174 KL G + + S H V PV+ GVR + W Sbjct: 146 TFKLKKGSAIFFQSDVWHRVRPVSSGVRKSLVAW 179 >UniRef50_A5VDF3 Alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal allergen n=2 Tax=Sphingomonadales RepID=A5VDF3_SPHWW Length = 363 Score = 46.6 bits (109), Expect = 7e-04, Method: Compositional matrix adjust. Identities = 25/77 (32%), Positives = 45/77 (58%), Gaps = 3/77 (3%) Query: 126 YDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVA--SFMWIQSMIRDDK 183 Y+GG+L+ + FGQ R + P G V++ S LH TPVTRG R A F++ ++ R + Sbjct: 286 YEGGDLIFPE-FGQRRYRAPTGGAVVFSCSLLHEATPVTRGKRYAYLPFLYDEAAARQRE 344 Query: 184 KRAMLFELDNNIQSLKS 200 + A ++ ++ + ++ Sbjct: 345 ENARSGKVGADLATYRA 361 >UniRef50_Q98E25 Mlr4439 protein n=1 Tax=Mesorhizobium loti RepID=Q98E25_RHILO Length = 192 Score = 45.1 bits (105), Expect = 0.002, Method: Compositional matrix adjust. Identities = 35/108 (32%), Positives = 52/108 (48%), Gaps = 13/108 (12%) Query: 78 TLSTPLFNRYQNNETYGFHVDGAVR-SHPQNGWMRTDLSATLFLS------DPQSYDGGE 130 T P F Y+ + + H DG H ++ + + +SA +FL+ P+ Y GG Sbjct: 83 TCEKPQFLHYREGDFFVPHQDGNTPLIHDESRFRK--ISAVIFLNRQSDDPSPEDYSGGS 140 Query: 131 LVVNDTFGQH--RVKLPA--GDLVLYPSSSLHCVTPVTRGVRVASFMW 174 LV++ + RV +PA G LV + S + H VTPVTR R W Sbjct: 141 LVLHGPYSGPNLRVTMPALPGSLVAFRSETTHEVTPVTRNERFTIVSW 188 >UniRef50_A5VC31 2OG-Fe(II) oxygenase n=1 Tax=Sphingomonas wittichii RW1 RepID=A5VC31_SPHWW Length = 186 Score = 43.1 bits (100), Expect = 0.007, Method: Compositional matrix adjust. Identities = 27/66 (40%), Positives = 37/66 (56%), Gaps = 6/66 (9%) Query: 114 LSATLFLSDPQSYDGGELVVNDTFG-QHRVKL--PAGDLVLYPSSSLHCVTPVTRGVRVA 170 LS + LSDP Y+GG + FG QH L P G L+++PS H V PVT G+R + Sbjct: 119 LSLVVQLSDPADYEGGAF---EFFGLQHPGALFAPRGSLLIFPSWMQHRVLPVTGGIRRS 175 Query: 171 SFMWIQ 176 W++ Sbjct: 176 LVSWVE 181 >UniRef50_Q9NDP6 Leprecan n=1 Tax=Ciona intestinalis RepID=Q9NDP6_CIOIN Length = 412 Score = 43.1 bits (100), Expect = 0.007, Method: Compositional matrix adjust. Identities = 26/80 (32%), Positives = 41/80 (51%), Gaps = 9/80 (11%) Query: 100 AVRSHPQNGWMRTDLSATLFLSDPQSYDGGELVVNDTFGQH---RVKLPAGDLVLYPS-- 154 ++ P W D SA L+L+D ++GGE ++ D + +V+ G LV + + Sbjct: 310 CLKERPAYTW--RDYSAILYLND--EFEGGEFIMTDATARRVKVQVRPKCGRLVSFSAGK 365 Query: 155 SSLHCVTPVTRGVRVASFMW 174 LH V PVT+G R A +W Sbjct: 366 ECLHGVKPVTKGRRCAMALW 385 >UniRef50_Q2S7M4 Uncharacterized iron-regulated protein n=1 Tax=Hahella chejuensis KCTC 2396 RepID=Q2S7M4_HAHCH Length = 182 Score = 41.6 bits (96), Expect = 0.019, Method: Compositional matrix adjust. Identities = 42/180 (23%), Positives = 79/180 (43%), Gaps = 33/180 (18%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQ----VKNNQQVDTRSTLYAALQ 57 ++ I G L+ + + E + + + T A+G+Q ++NN +V A + Sbjct: 10 VFAIQGFLTAHECDAYISDSEAMGYDEAEIQT-ARGSQMYKDIRNNDRVIFDD---AVMA 65 Query: 58 NEVLNAVNQHALFFAAALPRTL---------STPLFNRYQNNETYGFHVDGAVRSHPQNG 108 N + N + A LP+ L F RY+ + + +H DG+ + Sbjct: 66 NNIFNRIE-------AMLPQELDGWELVGLNERLRFYRYEPGQYFKWHRDGSYARSEKEA 118 Query: 109 WMRTDLSATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVR 168 + LS +FL+ + Y+GGE+ F ++K G +V++P + +H T V GV+ Sbjct: 119 SL---LSFLIFLN--EDYEGGEI----AFRWDKIKPERGSVVVFPHAMMHQGTTVESGVK 169 >UniRef50_B4RHL4 Putative uncharacterized protein n=2 Tax=Phenylobacterium zucineum HLK1 RepID=B4RHL4_PHEZH Length = 365 Score = 41.2 bits (95), Expect = 0.025, Method: Compositional matrix adjust. Identities = 21/52 (40%), Positives = 30/52 (57%), Gaps = 1/52 (1%) Query: 124 QSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWI 175 + Y+GG+L + FG + P G V++ S LH TPVTRG R AS ++ Sbjct: 279 EDYEGGDLRFPE-FGSRTYRAPTGGAVVFSCSLLHEATPVTRGRRYASLPFL 329 >UniRef50_C6B8G5 Alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal allergen n=3 Tax=Alphaproteobacteria RepID=C6B8G5_RHILS Length = 380 Score = 40.0 bits (92), Expect = 0.057, Method: Compositional matrix adjust. Identities = 25/71 (35%), Positives = 37/71 (52%), Gaps = 3/71 (4%) Query: 100 AVRSHPQNGWMRTDLSATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHC 159 A R + G + ++ L+D +DGGE+ + +G K PAG V++ S LH Sbjct: 276 AHRDNTTKGTAHRRFAVSVNLND--DFDGGEVSFPE-YGSRSFKAPAGGAVIFSCSLLHA 332 Query: 160 VTPVTRGVRVA 170 V+ VTRG R A Sbjct: 333 VSKVTRGRRYA 343 >UniRef50_Q0FVH2 Oxidoreductase domain protein n=9 Tax=Rhodobacterales RepID=Q0FVH2_9RHOB Length = 204 Score = 40.0 bits (92), Expect = 0.062, Method: Compositional matrix adjust. Identities = 23/64 (35%), Positives = 32/64 (50%) Query: 111 RTDLSATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVA 170 R L+ + LS+P +Y GG L V + + G L+PS LH VTPV G R + Sbjct: 134 RRKLTMVVQLSEPGAYRGGALEVMPSAHTVEAERARGSATLFPSYLLHRVTPVEAGERRS 193 Query: 171 SFMW 174 +W Sbjct: 194 MTIW 197 >UniRef50_Q2T4K0 Oxidoreductase domain protein n=1 Tax=Burkholderia thailandensis E264 RepID=Q2T4K0_BURTA Length = 97 Score = 40.0 bits (92), Expect = 0.066, Method: Compositional matrix adjust. Identities = 24/65 (36%), Positives = 32/65 (49%), Gaps = 5/65 (7%) Query: 113 DLSATLFLSDPQSYDGGELVVNDTFGQHRVKLPA--GDLVLYPSSSLHCVTPVTRGVRVA 170 L+ + LS+P Y+GG+L + FG P G ++ PS H VTPV GVR Sbjct: 30 KLTVIVQLSEPHEYEGGDL---EVFGSSIAVAPRHRGSIICLPSFVEHRVTPVVAGVRRV 86 Query: 171 SFMWI 175 WI Sbjct: 87 LVAWI 91 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_B5YS96 PKHD-type hydroxylase ybiX n=227 Tax=Bacteria Re... 338 8e-92 UniRef50_A8ESR5 PKHD-type hydroxylase Abu_0724 n=7 Tax=Proteobac... 322 6e-87 UniRef50_Q1LIS6 PKHD-type hydroxylase Rmet_3078 n=4 Tax=Burkhold... 309 6e-83 UniRef50_C5T9F3 2OG-Fe(II) oxygenase n=1 Tax=Acidovorax delafiel... 306 5e-82 UniRef50_Q15TJ8 PKHD-type hydroxylase Patl_2273 n=6 Tax=Proteoba... 303 2e-81 UniRef50_A3P5N3 PKHD-type hydroxylase BURPS1106A_A1609 n=30 Tax=... 301 1e-80 UniRef50_Q2JHA7 PKHD-type hydroxylase CYB_2270 n=16 Tax=Cyanobac... 297 2e-79 UniRef50_Q3SUS4 PKHD-type hydroxylase Nwi_0701 n=10 Tax=Proteoba... 294 1e-78 UniRef50_Q47YL9 PKHD-type hydroxylase CPS_3426 n=1 Tax=Colwellia... 290 3e-77 UniRef50_B8GUF9 PKHD-type hydroxylase Tgr7_2199 n=1 Tax=Thioalka... 286 4e-76 UniRef50_Q087U3 PKHD-type hydroxylase Sfri_0612 n=2 Tax=Alteromo... 280 3e-74 UniRef50_Q0C1R0 PKHD-type hydroxylase HNE_1625 n=4 Tax=Alphaprot... 278 1e-73 UniRef50_A3PE10 PKHD-type hydroxylase P9301_13621 n=4 Tax=Prochl... 275 9e-73 UniRef50_Q5QUG6 PKHD-type hydroxylase IL0759 n=2 Tax=Idiomarina ... 274 1e-72 UniRef50_Q0AP20 PKHD-type hydroxylase Mmar10_1675 n=1 Tax=Marica... 273 2e-72 UniRef50_A5WFM3 PKHD-type hydroxylase PsycPRwf_1523 n=1 Tax=Psyc... 273 3e-72 UniRef50_A8TQK0 Putative hydroxylase n=1 Tax=alpha proteobacteri... 270 3e-71 UniRef50_C7R7F4 2OG-Fe(II) oxygenase n=1 Tax=Kangiella koreensis... 263 3e-69 UniRef50_Q3AJA6 PKHD-type hydroxylase Syncc9605_1577 n=13 Tax=Cy... 255 7e-67 UniRef50_Q0I9X3 PKHD-type hydroxylase sync_1544 n=2 Tax=Synechoc... 255 8e-67 UniRef50_A3YZT5 Putative uncharacterized protein n=2 Tax=Chrooco... 254 1e-66 UniRef50_Q05TP1 Putative hydroxylase n=1 Tax=Synechococcus sp. R... 245 1e-63 UniRef50_Q1GRV0 PKHD-type hydroxylase Sala_1910 n=1 Tax=Sphingop... 237 2e-61 UniRef50_A7HP27 PKHD-type hydroxylase Plav_0037 n=1 Tax=Parvibac... 237 2e-61 UniRef50_B9TN05 PKHD-type hydroxylase ybiX, putative (Fragment) ... 233 3e-60 UniRef50_UPI00006A2339 UPI00006A2339 related cluster n=1 Tax=Xen... 221 1e-56 UniRef50_Q5GQB0 Putative uncharacterized protein n=1 Tax=Synecho... 217 2e-55 UniRef50_A8TKW7 2OG-Fe(II) oxygenase n=1 Tax=alpha proteobacteri... 208 1e-52 UniRef50_Q1GXG2 PKHD-type hydroxylase Mfla_0096 n=1 Tax=Methylob... 178 1e-43 UniRef50_Q111M8 2OG-Fe(II) oxygenase n=2 Tax=Trichodesmium eryth... 147 3e-34 UniRef50_C7BVH5 2OG-Fe(II) oxygenase family like protein n=1 Tax... 134 2e-30 UniRef50_C7BVA8 2OG-Fe(II) oxygenase superfamily like protein n=... 125 1e-27 UniRef50_A3PDM8 Putative uncharacterized protein n=2 Tax=root Re... 114 2e-24 UniRef50_Q98E25 Mlr4439 protein n=1 Tax=Mesorhizobium loti RepID... 108 1e-22 UniRef50_B0C441 2OG-Fe(II) oxygenase, putative n=1 Tax=Acaryochl... 108 1e-22 UniRef50_A5VDF3 Alkyl hydroperoxide reductase/ Thiol specific an... 101 2e-20 Sequences not found previously or not previously below threshold: UniRef50_Q0QZ85 2OG-Fe(II) oxygenase superfamily n=1 Tax=Synecho... 99 1e-19 UniRef50_A5VC31 2OG-Fe(II) oxygenase n=1 Tax=Sphingomonas wittic... 87 3e-16 UniRef50_A0NVC8 Oxidoreductase domain protein n=2 Tax=Labrenzia ... 83 5e-15 UniRef50_Q0FVH2 Oxidoreductase domain protein n=9 Tax=Rhodobacte... 83 9e-15 UniRef50_Q5GQX6 Putative uncharacterized protein n=1 Tax=Synecho... 80 6e-14 UniRef50_B4RHL4 Putative uncharacterized protein n=2 Tax=Phenylo... 80 6e-14 UniRef50_B0CEH1 Putative uncharacterized protein n=1 Tax=Acaryoc... 79 1e-13 UniRef50_A8TIV2 Putative uncharacterized protein n=1 Tax=alpha p... 77 4e-13 UniRef50_A8TUG3 Putative uncharacterized protein n=1 Tax=alpha p... 76 9e-13 UniRef50_A6D9X4 Putative uncharacterized protein n=1 Tax=Caminib... 75 1e-12 UniRef50_A8TW57 Putative uncharacterized protein n=1 Tax=alpha p... 75 2e-12 UniRef50_Q2T4K0 Oxidoreductase domain protein n=1 Tax=Burkholder... 73 8e-12 UniRef50_A8LEY1 Prolyl 4-hydroxylase alpha subunit n=1 Tax=Frank... 72 1e-11 UniRef50_Q8DKV0 Tlr0755 protein n=1 Tax=Thermosynechococcus elon... 72 2e-11 UniRef50_Q2S7M4 Uncharacterized iron-regulated protein n=1 Tax=H... 72 2e-11 UniRef50_C6B8G5 Alkyl hydroperoxide reductase/ Thiol specific an... 70 6e-11 UniRef50_Q58MI5 Putative uncharacterized protein n=1 Tax=Prochlo... 70 6e-11 UniRef50_B8GU65 2OG-Fe(II) oxygenase n=1 Tax=Thioalkalivibrio sp... 69 1e-10 UniRef50_A8TVG3 Putative uncharacterized protein n=1 Tax=alpha p... 69 1e-10 UniRef50_A3WCU8 Putative uncharacterized protein n=1 Tax=Erythro... 68 2e-10 UniRef50_C7JE98 Putative uncharacterized protein n=8 Tax=Acetoba... 68 2e-10 UniRef50_B5VUP4 Alkyl hydroperoxide reductase/ Thiol specific an... 68 3e-10 UniRef50_Q58MX3 Putative uncharacterized protein n=1 Tax=Prochlo... 67 3e-10 UniRef50_Q54PP0 Putative uncharacterized protein n=2 Tax=Dictyos... 64 4e-09 UniRef50_A6G260 Uncharacterized iron-regulated protein n=1 Tax=P... 63 7e-09 UniRef50_A5EM79 Putative uncharacterized protein n=1 Tax=Bradyrh... 63 9e-09 UniRef50_A5V2J9 Putative uncharacterized protein n=1 Tax=Sphingo... 63 9e-09 UniRef50_UPI00016C3A48 hypothetical protein GobsU_06128 n=1 Tax=... 62 1e-08 UniRef50_Q08MC8 Oxidoreductase, 2OG-Fe(II) oxygenase family fami... 62 2e-08 UniRef50_A4EST3 Uncharacterized iron-regulated protein n=1 Tax=R... 61 3e-08 UniRef50_B9F6P1 Putative uncharacterized protein n=3 Tax=Poaceae... 60 5e-08 UniRef50_UPI000187D48A hypothetical protein MPER_04725 n=2 Tax=M... 60 8e-08 UniRef50_D1P2W6 Oxidoreductase, 2OG-Fe(II) oxygenase family n=5 ... 59 1e-07 UniRef50_A5WD57 2OG-Fe(II) oxygenase n=4 Tax=Moraxellaceae RepID... 58 2e-07 UniRef50_C0YUD2 Possible iron-regulated protein n=1 Tax=Chryseob... 58 3e-07 UniRef50_C1EAF2 Predicted protein n=2 Tax=Micromonas RepID=C1EAF... 57 4e-07 UniRef50_A1ZDU9 Oxidoreductase, 2OG-Fe(II) oxygenase family fami... 57 5e-07 UniRef50_D2W6C1 Predicted protein n=2 Tax=Naegleria gruberi RepI... 56 8e-07 UniRef50_A8P9E2 Putative uncharacterized protein n=1 Tax=Coprino... 56 8e-07 UniRef50_C1FF01 Predicted protein (Fragment) n=3 Tax=Mamiellales... 56 9e-07 UniRef50_Q94H92 Os03g0761900 protein n=23 Tax=Embryophyta RepID=... 56 1e-06 UniRef50_C1DZC3 Prolyl 4-hydroxylase n=3 Tax=Viridiplantae RepID... 56 1e-06 UniRef50_D2VSR0 Predicted protein n=1 Tax=Naegleria gruberi RepI... 55 2e-06 UniRef50_B8HX71 Prolyl 4-hydroxylase alpha subunit n=1 Tax=Cyano... 55 2e-06 UniRef50_D2VZW5 Predicted protein n=1 Tax=Naegleria gruberi RepI... 55 2e-06 UniRef50_Q4SNF8 Chromosome 8 SCAF14543, whole genome shotgun seq... 55 2e-06 UniRef50_D2V646 Putative uncharacterized protein n=1 Tax=Naegler... 55 2e-06 UniRef50_A9YW24 Putative uncharacterized protein n=2 Tax=unclass... 55 2e-06 UniRef50_D2VHD9 Predicted protein n=1 Tax=Naegleria gruberi RepI... 55 2e-06 UniRef50_B7G5U8 Predicted protein n=1 Tax=Phaeodactylum tricornu... 54 3e-06 UniRef50_A0KRU0 2OG-Fe(II) oxygenase n=7 Tax=Shewanella RepID=A0... 54 3e-06 UniRef50_Q9AMY0 Blr2042 protein n=1 Tax=Bradyrhizobium japonicum... 54 3e-06 UniRef50_D2VRB4 Predicted protein n=1 Tax=Naegleria gruberi RepI... 54 4e-06 UniRef50_B8C881 Predicted protein n=1 Tax=Thalassiosira pseudona... 54 4e-06 UniRef50_Q3IHW6 Putative prolyl 4-hydroxylase, alpha subunit dom... 54 4e-06 UniRef50_D0KWT8 2OG-Fe(II) oxygenase n=1 Tax=Halothiobacillus ne... 54 4e-06 UniRef50_B2HZ49 Predicted proline hydroxylase n=18 Tax=Acinetoba... 53 5e-06 UniRef50_B0CZ29 Predicted protein n=2 Tax=Agaricales RepID=B0CZ2... 53 5e-06 UniRef50_A9DQY6 Uncharacterized iron-regulated protein n=1 Tax=K... 53 5e-06 UniRef50_D0SK49 Predicted protein n=1 Tax=Acinetobacter junii SH... 53 6e-06 UniRef50_D2W6G4 Predicted protein (Fragment) n=1 Tax=Naegleria g... 53 6e-06 UniRef50_UPI00016C3513 hypothetical protein GobsU_05758 n=1 Tax=... 53 6e-06 UniRef50_Q1MT87 Novel protein similar to vertebrate leprecan-lik... 53 6e-06 UniRef50_Q2SAS7 FOG: WD40 repeat n=1 Tax=Hahella chejuensis KCTC... 53 6e-06 UniRef50_B0E4Q8 Predicted protein n=1 Tax=Laccaria bicolor S238N... 53 7e-06 UniRef50_Q9NDP6 Leprecan n=1 Tax=Ciona intestinalis RepID=Q9NDP6... 53 7e-06 UniRef50_A8P9G8 Putative uncharacterized protein n=2 Tax=Coprino... 53 8e-06 UniRef50_A8J470 Prolyl 4-hydroxylase alpha-1 subunit-like protei... 53 8e-06 UniRef50_A3WJ18 Prolyl 4-hydroxylase alpha subunit-like protein,... 53 9e-06 UniRef50_A8IL40 Predicted protein n=2 Tax=Chlamydomonas reinhard... 53 9e-06 UniRef50_Q5CZM3 Leprel2 protein (Fragment) n=4 Tax=Euteleostomi ... 53 1e-05 UniRef50_D0N498 Putative uncharacterized protein n=1 Tax=Phytoph... 53 1e-05 UniRef50_B7G721 Predicted protein n=1 Tax=Phaeodactylum tricornu... 52 1e-05 UniRef50_D0Z4R6 SM-20-related protein n=1 Tax=Photobacterium dam... 52 1e-05 UniRef50_C1MRD1 Predicted protein n=1 Tax=Micromonas pusilla CCM... 52 1e-05 UniRef50_A4RVI8 Predicted protein n=2 Tax=Ostreococcus RepID=A4R... 52 1e-05 UniRef50_UPI00006A17CC Probable G-protein coupled receptor 162 (... 52 2e-05 UniRef50_B7FSE0 Predicted protein n=1 Tax=Phaeodactylum tricornu... 52 2e-05 UniRef50_A8P9H2 Putative uncharacterized protein n=1 Tax=Coprino... 52 2e-05 UniRef50_Q6LGS5 Putative uncharacterized protein NCU03445 n=1 Ta... 52 2e-05 UniRef50_D2VV82 Prolyl 4-hydroxylase alpha subunit family protei... 52 2e-05 UniRef50_Q5N1K6 Putative uncharacterized protein n=2 Tax=Synecho... 52 2e-05 UniRef50_B8MEI3 Putative uncharacterized protein n=1 Tax=Talarom... 52 2e-05 UniRef50_A5BUA7 Putative uncharacterized protein n=1 Tax=Vitis v... 52 2e-05 UniRef50_B5Y446 Predicted protein n=1 Tax=Phaeodactylum tricornu... 52 2e-05 UniRef50_C7YTN2 Putative uncharacterized protein n=1 Tax=Nectria... 52 2e-05 UniRef50_D1I073 Whole genome shotgun sequence of line PN40024, s... 52 2e-05 UniRef50_Q11WG9 Probable proline hydroxylase n=3 Tax=Bacteroidet... 51 2e-05 UniRef50_B3PB13 SM-20 domain protein n=1 Tax=Cellvibrio japonicu... 51 3e-05 UniRef50_UPI000051007A 2OG-Fe(II) oxygenase n=1 Tax=Brevibacteri... 51 3e-05 UniRef50_B8BZB7 Predicted protein n=1 Tax=Thalassiosira pseudona... 51 4e-05 UniRef50_B9ZRT4 2OG-Fe(II) oxygenase n=1 Tax=Thioalkalivibrio sp... 51 4e-05 UniRef50_Q1DA84 Oxidoreductase, 2OG-Fe(II) oxygenase family n=1 ... 51 4e-05 UniRef50_B5YLK5 Predicted protein n=1 Tax=Thalassiosira pseudona... 51 4e-05 UniRef50_B7GCB6 Predicted protein (Fragment) n=1 Tax=Phaeodactyl... 51 4e-05 UniRef50_A4RT30 Protein Lysyl hydroxylase fusion protein, putati... 50 4e-05 UniRef50_B8C4F7 Predicted protein n=1 Tax=Thalassiosira pseudona... 50 5e-05 UniRef50_B0C2I7 Oxidoreductase, 2OG-Fe(II) oxygenase family n=1 ... 50 6e-05 UniRef50_B0DEZ6 Predicted protein n=2 Tax=Agaricomycetes RepID=B... 50 7e-05 UniRef50_C1N7Y0 Predicted protein (Fragment) n=2 Tax=Micromonas ... 50 7e-05 UniRef50_Q2S9H1 Putative uncharacterized protein n=1 Tax=Hahella... 50 8e-05 UniRef50_A6VYC6 2OG-Fe(II) oxygenase n=2 Tax=Marinomonas RepID=A... 50 9e-05 UniRef50_C7YTL8 Putative uncharacterized protein n=1 Tax=Nectria... 50 9e-05 UniRef50_B9SNI2 Oxidoreductase, putative n=1 Tax=Ricinus communi... 49 9e-05 UniRef50_B7VAY0 Putative enzyme n=7 Tax=Pseudomonas aeruginosa R... 49 1e-04 UniRef50_A4RTV5 Predicted protein n=2 Tax=Ostreococcus RepID=A4R... 49 1e-04 UniRef50_C3ZC48 Putative uncharacterized protein n=1 Tax=Branchi... 49 1e-04 UniRef50_D1LX56 Leprecan-like protein n=1 Tax=Saccoglossus kowal... 49 1e-04 UniRef50_Q2JRT8 Oxidoreductase, 2OG-Fe(II) oxygenase family n=1 ... 49 1e-04 UniRef50_B7G8Q9 Predicted protein n=1 Tax=Phaeodactylum tricornu... 49 1e-04 UniRef50_D2V7Q2 Predicted protein n=1 Tax=Naegleria gruberi RepI... 49 1e-04 UniRef50_B5JX69 2OG-Fe(II) oxygenase n=1 Tax=gamma proteobacteri... 49 1e-04 UniRef50_A4S8T4 Predicted protein n=3 Tax=Mamiellales RepID=A4S8... 49 1e-04 UniRef50_Q6PK18 PKHD domain-containing transmembrane protein C17... 49 2e-04 UniRef50_Q486F0 Oxidoreductase, 2OG-Fe(II) oxygenase family n=8 ... 48 2e-04 UniRef50_A6C2C6 Uncharacterized iron-regulated protein n=1 Tax=P... 48 2e-04 UniRef50_D2VWJ6 Predicted protein n=1 Tax=Naegleria gruberi RepI... 48 2e-04 UniRef50_UPI0000D55437 PREDICTED: similar to leprecan 1 n=1 Tax=... 48 2e-04 UniRef50_B2B745 Predicted CDS Pa_2_9860 n=1 Tax=Podospora anseri... 48 2e-04 UniRef50_Q5GQC0 Putative uncharacterized protein n=1 Tax=Synecho... 48 2e-04 UniRef50_D2VTG5 Predicted protein n=1 Tax=Naegleria gruberi RepI... 48 2e-04 UniRef50_B8C289 Predicted protein (Fragment) n=1 Tax=Thalassiosi... 48 2e-04 UniRef50_A0L8V5 2OG-Fe(II) oxygenase n=1 Tax=Magnetococcus sp. M... 48 2e-04 UniRef50_B6BWB5 Putative uncharacterized protein n=1 Tax=beta pr... 48 2e-04 UniRef50_C1EJE5 Predicted protein n=1 Tax=Micromonas sp. RCC299 ... 48 2e-04 UniRef50_Q5PP31 At1g68080 n=4 Tax=Magnoliophyta RepID=Q5PP31_ARATH 48 2e-04 UniRef50_A8IDI8 Prolyl 4-hydroxylase n=2 Tax=Chlamydomonas reinh... 48 2e-04 UniRef50_B7G1Z5 Predicted protein n=1 Tax=Phaeodactylum tricornu... 48 2e-04 UniRef50_Q338D2 Prolyl 4-hydroxylase, putative, expressed n=11 T... 48 3e-04 UniRef50_B8BWH5 Putative uncharacterized protein n=1 Tax=Thalass... 48 3e-04 UniRef50_A4Y066 2OG-Fe(II) oxygenase n=16 Tax=Proteobacteria Rep... 48 3e-04 UniRef50_Q15SJ2 2OG-Fe(II) oxygenase n=6 Tax=Proteobacteria RepI... 48 3e-04 UniRef50_A0YIP7 Putative uncharacterized protein n=1 Tax=Lyngbya... 48 3e-04 UniRef50_Q4QF16 Putative uncharacterized protein n=7 Tax=Trypano... 48 3e-04 UniRef50_C7BVA4 2OG-Fe(II) oxygenase family like protein n=1 Tax... 48 3e-04 UniRef50_Q8T294 Putative uncharacterized protein n=1 Tax=Dictyos... 48 3e-04 UniRef50_A3TG91 Putative uncharacterized protein n=1 Tax=Janibac... 47 3e-04 UniRef50_A8L657 2OG-Fe(II) oxygenase n=1 Tax=Frankia sp. EAN1pec... 47 4e-04 UniRef50_A8I8D2 Predicted protein n=3 Tax=Chlamydomonas reinhard... 47 4e-04 UniRef50_Q2TWV5 Predicted protein n=2 Tax=Aspergillus RepID=Q2TW... 47 4e-04 UniRef50_A4RSI6 Predicted protein (Fragment) n=5 Tax=Viridiplant... 47 4e-04 UniRef50_B5Y4Z8 Predicted protein n=3 Tax=Bacillariophyta RepID=... 47 4e-04 UniRef50_B8CBF7 Putative uncharacterized protein n=1 Tax=Thalass... 47 4e-04 UniRef50_Q01F56 SmkH (IC) n=2 Tax=Ostreococcus tauri RepID=Q01F5... 47 4e-04 UniRef50_D0N7E1 Putative uncharacterized protein n=1 Tax=Phytoph... 47 4e-04 UniRef50_Q8IVL5 Prolyl 3-hydroxylase 2 n=29 Tax=Euteleostomi Rep... 47 4e-04 UniRef50_UPI0001925DF6 PREDICTED: similar to predicted protein n... 47 5e-04 UniRef50_Q2N914 Putative uncharacterized protein n=1 Tax=Erythro... 47 5e-04 UniRef50_C1MIQ0 Predicted protein n=2 Tax=Micromonas RepID=C1MIQ... 47 5e-04 UniRef50_A4RVD9 Predicted protein n=2 Tax=Ostreococcus RepID=A4R... 47 5e-04 UniRef50_B9IJQ5 Oxidoreductase, 2OG-Fe(II) oxygenase family prot... 47 5e-04 UniRef50_C4Y2U7 Putative uncharacterized protein n=1 Tax=Clavisp... 47 5e-04 UniRef50_Q5UP57 Putative prolyl 4-hydroxylase n=1 Tax=Acanthamoe... 47 6e-04 UniRef50_Q0EXH1 2OG-Fe(II) oxygenase n=1 Tax=Mariprofundus ferro... 47 6e-04 UniRef50_A8IV51 Predicted protein n=1 Tax=Chlamydomonas reinhard... 47 6e-04 UniRef50_Q5GQB2 Putative uncharacterized protein n=1 Tax=Synecho... 47 6e-04 UniRef50_B4X4U7 Oxidoreductase, 2OG-Fe(II) oxygenase family n=2 ... 47 6e-04 UniRef50_B5YLH9 Predicted protein n=1 Tax=Thalassiosira pseudona... 47 6e-04 UniRef50_Q4JN23 Putative uncharacterized protein n=1 Tax=uncultu... 47 6e-04 UniRef50_Q54CK1 Putative uncharacterized protein n=1 Tax=Dictyos... 47 6e-04 UniRef50_Q21FK1 2OG-Fe(II) oxygenase n=1 Tax=Saccharophagus degr... 47 7e-04 UniRef50_B8CBV6 Predicted protein (Fragment) n=2 Tax=Thalassiosi... 47 7e-04 UniRef50_D2VKT5 Predicted protein n=2 Tax=Naegleria gruberi RepI... 47 7e-04 UniRef50_UPI0000E46D75 PREDICTED: hypothetical protein n=1 Tax=S... 47 7e-04 UniRef50_Q0APS3 2OG-Fe(II) oxygenase n=1 Tax=Maricaulis maris MC... 47 7e-04 UniRef50_Q8T5S8 Prolyl-4-hydroxylase-alpha PV n=14 Tax=Drosophil... 47 7e-04 UniRef50_A6FBT0 Putative prolyl 4-hydroxylase, alpha subunit dom... 47 8e-04 UniRef50_D2VER2 Predicted protein n=1 Tax=Naegleria gruberi RepI... 46 8e-04 UniRef50_Q58LI8 Possible dioxygenase n=1 Tax=Prochlorococcus pha... 46 8e-04 UniRef50_Q5CU77 Prolyl 4-hydroxylase alpha subunit n=2 Tax=Crypt... 46 8e-04 UniRef50_D2VW34 Oxidoreductase n=3 Tax=Naegleria gruberi RepID=D... 46 8e-04 UniRef50_D2VKE9 Type IIB DNA topoisomerase n=1 Tax=Naegleria gru... 46 9e-04 UniRef50_Q6MK70 SM-20-related protein n=1 Tax=Bdellovibrio bacte... 46 9e-04 UniRef50_UPI0000EB0B63 Guanine nucleotide-binding protein G(I)/G... 46 0.001 UniRef50_A4S1S9 Predicted protein n=1 Tax=Ostreococcus lucimarin... 46 0.001 UniRef50_C1MLG0 Predicted protein n=1 Tax=Micromonas pusilla CCM... 46 0.001 UniRef50_D2VXK7 Predicted protein n=2 Tax=Naegleria gruberi RepI... 46 0.001 UniRef50_Q0AGD5 Prolyl 4-hydroxylase, alpha subunit n=4 Tax=Bact... 46 0.001 UniRef50_A8I7G7 Predicted protein (Fragment) n=1 Tax=Chlamydomon... 46 0.001 UniRef50_A1ZDI6 Oxidoreductase, 2OG-Fe(II) oxygenase family fami... 46 0.001 UniRef50_UPI0001926E68 PREDICTED: similar to Novel 2OG-Fe(II) ox... 46 0.001 UniRef50_Q1NF66 2OG-Fe(II) oxygenase n=1 Tax=Sphingomonas sp. SK... 46 0.001 UniRef50_D2V3Y8 Predicted protein (Fragment) n=1 Tax=Naegleria g... 46 0.001 UniRef50_B6HE60 Pc20g15710 protein n=14 Tax=Leotiomyceta RepID=B... 46 0.001 UniRef50_D2VJ99 Predicted protein n=1 Tax=Naegleria gruberi RepI... 46 0.001 UniRef50_A8NWL8 Predicted protein n=1 Tax=Coprinopsis cinerea ok... 46 0.001 UniRef50_B6BGN8 2OG-Fe(II) oxygenase n=1 Tax=Campylobacterales b... 46 0.001 UniRef50_D2V6G1 Predicted protein n=1 Tax=Naegleria gruberi RepI... 46 0.001 UniRef50_C5FBE9 Putative uncharacterized protein n=1 Tax=Microsp... 46 0.001 UniRef50_Q4KE77 Oxidoreductase, 2OG-Fe(II) oxygenase family fami... 45 0.001 UniRef50_B1HNF1 Putative uncharacterized protein n=3 Tax=Bacilla... 45 0.002 UniRef50_B8HQ53 Prolyl 4-hydroxylase alpha subunit n=1 Tax=Cyano... 45 0.002 UniRef50_A0A9R2 Leprecan n=1 Tax=Molgula tectiformis RepID=A0A9R... 45 0.002 UniRef50_B2JN14 Procollagen-proline dioxygenase n=6 Tax=Burkhold... 45 0.002 UniRef50_Q1MZY2 Oxidoreductase, 2OG-Fe(II) oxygenase family prot... 45 0.002 UniRef50_Q4SQX1 Chromosome 11 SCAF14528, whole genome shotgun se... 45 0.002 UniRef50_Q08AW1 LOC100158404 protein n=6 Tax=Tetrapoda RepID=Q08... 45 0.002 UniRef50_B8C609 Predicted protein n=1 Tax=Thalassiosira pseudona... 45 0.002 UniRef50_A5WGK8 Procollagen-proline dioxygenase n=1 Tax=Psychrob... 45 0.002 UniRef50_A9G957 Putative uncharacterized protein n=1 Tax=Sorangi... 45 0.002 UniRef50_Q4RU77 Chromosome 1 SCAF14995, whole genome shotgun seq... 45 0.002 UniRef50_Q8IVL6 Prolyl 3-hydroxylase 3 n=34 Tax=Amniota RepID=P3... 45 0.002 UniRef50_A5FT70 Putative uncharacterized protein n=1 Tax=Acidiph... 45 0.002 UniRef50_B6QKY4 Putative uncharacterized protein n=1 Tax=Penicil... 45 0.002 UniRef50_Q9LSI6 Prolyl 4-hydroxylase alpha subunit-like protein ... 45 0.003 UniRef50_P74376 Sll0428 protein n=1 Tax=Synechocystis sp. PCC 68... 45 0.003 UniRef50_Q26DQ8 Oxidoreductase, 20G-Fe(II) oxygenase superfamily... 45 0.003 UniRef50_B8C4H5 Prolyl 4-hydrolase-like protein (Fragment) n=1 T... 44 0.003 UniRef50_C6J1E0 Prolyl 4-hydroxylase n=1 Tax=Paenibacillus sp. o... 44 0.003 UniRef50_Q4Q6N6 Putative uncharacterized protein n=2 Tax=Leishma... 44 0.003 UniRef50_Q47UG1 Oxidoreductase, 2OG-Fe(II) oxygenase family n=7 ... 44 0.003 UniRef50_Q82QN7 Putative oxygenase n=1 Tax=Streptomyces avermiti... 44 0.003 UniRef50_C1EER8 Predicted protein n=2 Tax=Micromonas RepID=C1EER... 44 0.003 UniRef50_Q84406 A85R protein n=4 Tax=Chlorovirus RepID=Q84406_PBCV1 44 0.003 >UniRef50_B5YS96 PKHD-type hydroxylase ybiX n=227 Tax=Bacteria RepID=YBIX_ECO5E Length = 228 Score = 338 bits (867), Expect = 8e-92, Method: Composition-based stats. Identities = 222/228 (97%), Positives = 224/228 (98%), Gaps = 3/228 (1%) Query: 1 MMYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEV 60 MMYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEV Sbjct: 1 MMYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEV 60 Query: 61 LNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFL 120 LNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFL Sbjct: 61 LNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFL 120 Query: 121 SDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQSMIR 180 SDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASF+WIQSMIR Sbjct: 121 SDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFIWIQSMIR 180 Query: 181 DDKKRAMLFELD---NNIQSLKSRYGESEEILSLLNLYHNLLREWSEI 225 DDKKRAMLFELD NIQSLKSRYGE+EEILSLLNLYHNLLREWSEI Sbjct: 181 DDKKRAMLFELDKNIQNIQSLKSRYGENEEILSLLNLYHNLLREWSEI 228 >UniRef50_A8ESR5 PKHD-type hydroxylase Abu_0724 n=7 Tax=Proteobacteria RepID=Y724_ARCB4 Length = 226 Score = 322 bits (825), Expect = 6e-87, Method: Composition-based stats. Identities = 116/226 (51%), Positives = 156/226 (69%), Gaps = 1/226 (0%) Query: 1 MMYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEV 60 M+ HIP VLS + + R L +A W+DG++T G Q KNN Q+ L L++ + Sbjct: 1 MILHIPEVLSKEQLTECRNLLNKANWIDGKITAGNQAINAKNNFQLAESDPLTNYLRDII 60 Query: 61 LNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVR-SHPQNGWMRTDLSATLF 119 A+N + LF +AALP+ + +P FN+Y+N YG HVD ++ + RTD+S +LF Sbjct: 61 KTALNSNPLFISAALPKHIISPFFNKYENGGNYGNHVDNSILFDMNEKKAFRTDISCSLF 120 Query: 120 LSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQSMI 179 +DP+ Y+GGE+V+ DTFG H VKLPAGDL+LYPS+SLH V PVT+GVR+ SFMWIQSMI Sbjct: 121 FTDPEEYEGGEMVIEDTFGTHEVKLPAGDLILYPSTSLHRVEPVTKGVRMVSFMWIQSMI 180 Query: 180 RDDKKRAMLFELDNNIQSLKSRYGESEEILSLLNLYHNLLREWSEI 225 R KR++LFELDN IQSL+ YGE EE L+L YH L++EWSE+ Sbjct: 181 RSAWKRSILFELDNTIQSLRVNYGEIEETLNLSIHYHKLIQEWSEL 226 >UniRef50_Q1LIS6 PKHD-type hydroxylase Rmet_3078 n=4 Tax=Burkholderiaceae RepID=Y3078_RALME Length = 228 Score = 309 bits (791), Expect = 6e-83, Method: Composition-based stats. Identities = 127/227 (55%), Positives = 161/227 (70%), Gaps = 3/227 (1%) Query: 1 MMYHIPGVLSPQDVARFREQLEQA--EWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQN 58 M+ IP VL+ + +A REQL+ A WVDGRVT G GA VK NQQ+D RS A Q+ Sbjct: 1 MLVRIPQVLNAEQLAMLREQLDHAGDAWVDGRVTAGYSGAPVKFNQQIDERSEAAAQCQH 60 Query: 59 EVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGW-MRTDLSAT 117 VL+A+ ++ LF +A LP + P+FNRY T+G HVDG VR HP NG +RTD+SAT Sbjct: 61 LVLSALERNPLFISAVLPNIVYPPMFNRYSEGMTFGLHVDGGVRLHPHNGRKLRTDVSAT 120 Query: 118 LFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQS 177 LFLSDP SYDGGEL + DT+G H VKL AGD+V+YPS+SLH V P+TRGVRV F WIQS Sbjct: 121 LFLSDPASYDGGELQIEDTYGVHSVKLAAGDMVVYPSTSLHQVKPITRGVRVGCFFWIQS 180 Query: 178 MIRDDKKRAMLFELDNNIQSLKSRYGESEEILSLLNLYHNLLREWSE 224 +IRDD +RA+LF++DN IQ+L + +L+ YHNLLR+WS+ Sbjct: 181 LIRDDGQRALLFDMDNAIQTLNQTNADERARRTLVGCYHNLLRQWSD 227 >UniRef50_C5T9F3 2OG-Fe(II) oxygenase n=1 Tax=Acidovorax delafieldii 2AN RepID=C5T9F3_ACIDE Length = 229 Score = 306 bits (783), Expect = 5e-82, Method: Composition-based stats. Identities = 111/229 (48%), Positives = 157/229 (68%), Gaps = 4/229 (1%) Query: 1 MMYHIPGVLSPQDVARFREQL-EQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNE 59 M HI VL P+++A FR+ L A WVDG + G Q KNN Q+ S L A LQ Sbjct: 1 MFLHIKDVLPPEELAFFRQALGADAPWVDGARSAGGQAIHQKNNLQLAQGSELSAQLQAR 60 Query: 60 VLNAVNQHALFFAAALPRTLSTPLFNRYQNN-ETYGFHVDGAVR-SHPQNGWMRTDLSAT 117 V A++++ALFF+AALPR + PLFN Y + YG HVD AV SH N W+R+DLS T Sbjct: 61 VKAALHRNALFFSAALPRRIYNPLFNNYGDGTNFYGNHVDSAVMHSHADNCWVRSDLSCT 120 Query: 118 LFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQS 177 LFL+ P+ Y+GGELV + FG+ R+KLPAGD++LYPSS++H V+PVTRG R++ F W++S Sbjct: 121 LFLTPPEDYEGGELVATEAFGEKRIKLPAGDMILYPSSTVHQVSPVTRGHRISCFFWVES 180 Query: 178 MIRDDKKRAMLFELDNNIQSLKSRYGESE-EILSLLNLYHNLLREWSEI 225 M+R ++R +LF++D ++ L+ +GE E +++L YHNLLR W+++ Sbjct: 181 MVRGLEQRQLLFDMDMSLLKLRQAHGEKEPSVIALSGTYHNLLRMWADV 229 >UniRef50_Q15TJ8 PKHD-type hydroxylase Patl_2273 n=6 Tax=Proteobacteria RepID=Y2273_PSEA6 Length = 227 Score = 303 bits (777), Expect = 2e-81, Method: Composition-based stats. Identities = 106/225 (47%), Positives = 157/225 (69%), Gaps = 2/225 (0%) Query: 1 MMYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEV 60 M+ I +LS ++V +F + L++ +W+DG+ T G+Q ++VK NQQ+D S L L+N V Sbjct: 1 MLTVIEDLLSKKEVTQFTQALDKGQWLDGKHTAGSQASKVKYNQQLDDGSALAIELRNTV 60 Query: 61 LNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWM-RTDLSATLF 119 + ++ +ALF ++ALP + P FNRYQ E YG HVD +V P + M RTDLSATLF Sbjct: 61 IRKLSGNALFMSSALPNKIYPPKFNRYQGGEHYGLHVDASVMPIPNSHQMLRTDLSATLF 120 Query: 120 LSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQSMI 179 LS+P++YDGGEL + FG ++KL AG ++LYP++SLH V PVT+G R ASF WI+S++ Sbjct: 121 LSEPKTYDGGELSIETQFGLQQIKLNAGSVILYPANSLHQVNPVTKGRRTASFFWIESLV 180 Query: 180 RDDKKRAMLFELDNNIQSLKSRYGESE-EILSLLNLYHNLLREWS 223 R + +R+MLF+LD +IQ+L G ++ E+ L +YHNL+R W+ Sbjct: 181 RSNDQRSMLFDLDQSIQALTVELGSNDAEVKRLTGVYHNLMRSWA 225 >UniRef50_A3P5N3 PKHD-type hydroxylase BURPS1106A_A1609 n=30 Tax=Proteobacteria RepID=Y5709_BURP0 Length = 227 Score = 301 bits (771), Expect = 1e-80, Method: Composition-based stats. Identities = 119/226 (52%), Positives = 159/226 (70%), Gaps = 2/226 (0%) Query: 1 MMYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEV 60 MM HIPGVL+ + VA+ R+ L+ A+W DG T+GAQ A K N+Q+ S A + + Sbjct: 1 MMLHIPGVLTKEQVAQCRDILDAADWTDGNATSGAQSALAKRNRQLPEGSPAARAAGDAI 60 Query: 61 LNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGW-MRTDLSATLF 119 +A+ ++ALFF+AALP + PLFNRY + +G HVD A+R + +R+DLSATLF Sbjct: 61 QDALARNALFFSAALPLKVFPPLFNRYAGGDAFGTHVDNAIRLLRGTDFRVRSDLSATLF 120 Query: 120 LSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQSMI 179 L +P+ YDGGEL V DT+G HR KLPAGD+VLYP+SSLH VTPVTRG RVASF WIQSM+ Sbjct: 121 LEEPEHYDGGELCVEDTYGVHRAKLPAGDMVLYPASSLHHVTPVTRGARVASFFWIQSMV 180 Query: 180 RDDKKRAMLFELDNNIQSLKS-RYGESEEILSLLNLYHNLLREWSE 224 RDD R +L++LD IQ L + + G +++L +YHNLLR W++ Sbjct: 181 RDDADRTLLYQLDTQIQRLTAEKGGRDASVIALTGIYHNLLRRWAD 226 >UniRef50_Q2JHA7 PKHD-type hydroxylase CYB_2270 n=16 Tax=Cyanobacteria RepID=Y2270_SYNJB Length = 224 Score = 297 bits (760), Expect = 2e-79, Method: Composition-based stats. Identities = 89/225 (39%), Positives = 137/225 (60%), Gaps = 1/225 (0%) Query: 1 MMYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEV 60 M+ I VLS ++ + + AE+VDG +T G VKNN+Q+ S ++ + Sbjct: 1 MILCIGDVLSLAELQQILSLIADAEFVDGALTAGWNARLVKNNRQMPKGSLQQRKIEEII 60 Query: 61 LNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFL 120 L A+ ++ LF AA P+ + + L + Y+ +YG H D A+ ++ MRTD+S TLFL Sbjct: 61 LAALERNLLFQMAARPKLIHSILISCYEAGMSYGTHTDDALML-DRHQLMRTDISFTLFL 119 Query: 121 SDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQSMIR 180 S P+ YDGGEL + + G+ KLPAG L+LYP+S+LH V PVTRG+R A+ W+QS+IR Sbjct: 120 SAPEDYDGGELKIESSEGEQAYKLPAGALILYPASTLHRVEPVTRGIRYAAVSWVQSLIR 179 Query: 181 DDKKRAMLFELDNNIQSLKSRYGESEEILSLLNLYHNLLREWSEI 225 D ++R +LF+L Q + G++ + +Y NLLR+W+E+ Sbjct: 180 DPQEREILFDLQTVRQQMFQESGKTRHFDLISKVYANLLRKWAEL 224 >UniRef50_Q3SUS4 PKHD-type hydroxylase Nwi_0701 n=10 Tax=Proteobacteria RepID=Y701_NITWN Length = 226 Score = 294 bits (754), Expect = 1e-78, Method: Composition-based stats. Identities = 110/225 (48%), Positives = 146/225 (64%), Gaps = 1/225 (0%) Query: 1 MMYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEV 60 M+ I VL+P ++ RFRE L QA+W DGR T G + K N+Q+ +L L + Sbjct: 1 MIQVISDVLTPDELKRFRELLGQAQWQDGRATAGHVAVRAKANEQLSHEDSLGQQLSEFL 60 Query: 61 LNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNG-WMRTDLSATLF 119 L + + + F AAALP + P FNRY +YG H+D A+ S P G +R DLSATLF Sbjct: 61 LERLGKISHFIAAALPLKVLPPRFNRYTGGGSYGDHIDNAIFSVPGAGVRIRGDLSATLF 120 Query: 120 LSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQSMI 179 LS+P YDGGEL++ F +H+ KLPAG ++LYP+S+ H VTPVTRG R+A+F W QS++ Sbjct: 121 LSEPGDYDGGELIIQGEFARHQFKLPAGQMILYPASTFHQVTPVTRGARLAAFFWTQSLV 180 Query: 180 RDDKKRAMLFELDNNIQSLKSRYGESEEILSLLNLYHNLLREWSE 224 R+ +RA+LFELDN IQ+L E + L LYHNLLREWSE Sbjct: 181 REHSRRALLFELDNTIQALAQDNPEQPAVARLTGLYHNLLREWSE 225 >UniRef50_Q47YL9 PKHD-type hydroxylase CPS_3426 n=1 Tax=Colwellia psychrerythraea 34H RepID=Y3426_COLP3 Length = 223 Score = 290 bits (742), Expect = 3e-77, Method: Composition-based stats. Identities = 78/223 (34%), Positives = 121/223 (54%), Gaps = 2/223 (0%) Query: 1 MMYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEV 60 M+ +P VLSP VA + +E + G+ T G VKNN Q + L +Q + Sbjct: 1 MITKLPQVLSPIQVASIIQLIEHGSFNSGKDTAGWHAKAVKNNLQWQGETELNEQIQTGI 60 Query: 61 LNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFL 120 A+ QH F AA +++ + + YG H+D A+ +RTD+S TLFL Sbjct: 61 QGALTQHPQFTGAAYAKSMMPFIISESTLGGGYGDHIDDALMV--NETVLRTDISCTLFL 118 Query: 121 SDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQSMIR 180 + PQ Y+GGELV+N + + KL AGD ++YPS++LH V PVT G R + WI+S I Sbjct: 119 TPPQDYEGGELVMNLSGMEMAFKLNAGDAIIYPSTTLHRVNPVTSGSRKVALTWIESHIP 178 Query: 181 DDKKRAMLFELDNNIQSLKSRYGESEEILSLLNLYHNLLREWS 223 +R +LF+LD + + +G+++ + + NLLR+W+ Sbjct: 179 QASQREILFDLDCARKDIMEHHGKTDAFDRITKTHANLLRQWA 221 >UniRef50_B8GUF9 PKHD-type hydroxylase Tgr7_2199 n=1 Tax=Thioalkalivibrio sp. HL-EbGR7 RepID=Y2199_THISH Length = 224 Score = 286 bits (732), Expect = 4e-76, Method: Composition-based stats. Identities = 98/225 (43%), Positives = 136/225 (60%), Gaps = 1/225 (0%) Query: 1 MMYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEV 60 M+ IP +L +A R L A + DGR + GA +VK+N++VD T AL V Sbjct: 1 MLLTIPELLDAAQLAEIRRLLADAPFTDGRYSAGADARRVKHNEEVDPSDTRVRALNQLV 60 Query: 61 LNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFL 120 L + +H F AAALPR LS F RY YG HVD V P+ G RTD+S T+F+ Sbjct: 61 LMPLYRHETFQAAALPRKLSGAFFARYLPGMQYGAHVDDPVMG-PEGGRYRTDVSVTVFI 119 Query: 121 SDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQSMIR 180 +PQSY+GGELVV FG+ +VKLPAG V+YPSSSLH V+PVT G R+ + W +SM+R Sbjct: 120 GEPQSYEGGELVVETDFGEQQVKLPAGHAVIYPSSSLHRVSPVTGGERLVAVAWAESMVR 179 Query: 181 DDKKRAMLFELDNNIQSLKSRYGESEEILSLLNLYHNLLREWSEI 225 D +R ML+EL ++L+ ++E ++ NL+R W+++ Sbjct: 180 DPARRQMLYELYQVHEALRRDNPDAEVTRRAGHVRANLMRMWADV 224 >UniRef50_Q087U3 PKHD-type hydroxylase Sfri_0612 n=2 Tax=Alteromonadales RepID=Y612_SHEFN Length = 226 Score = 280 bits (716), Expect = 3e-74, Method: Composition-based stats. Identities = 102/226 (45%), Positives = 145/226 (64%), Gaps = 2/226 (0%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEVL 61 M I +LS QDV +R+QL + W DGR T A VKNN Q D + L N++L Sbjct: 1 MIVIEQILSKQDVGAYRQQLAECPWGDGRKTAMGMAASVKNNNQADAQHANVRQLANQLL 60 Query: 62 NAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQ-NGWMRTDLSATLFL 120 + + +AALP + P FNRY E YG+HVD A+ P + +R+D+S T+FL Sbjct: 61 ARIGETPKIVSAALPHKIFPPCFNRYNETEEYGYHVDAAIMRIPNTSEVIRSDVSMTVFL 120 Query: 121 SDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQSMIR 180 S+P+ YDGGELV+ FGQ ++KLPAG V+YPSSSLH VT VTRG R+A+ W+QSM+ Sbjct: 121 SEPEEYDGGELVIATEFGQQQIKLPAGYAVVYPSSSLHKVTAVTRGQRIAAITWMQSMVA 180 Query: 181 DDKKRAMLFELDNNIQSL-KSRYGESEEILSLLNLYHNLLREWSEI 225 D R L++LD +IQ+L K+ + E+ +L N+YHNL+R+++++ Sbjct: 181 DVTLRQTLYQLDQSIQNLIKANNTDRAELDNLHNVYHNLIRQFTQL 226 >UniRef50_Q0C1R0 PKHD-type hydroxylase HNE_1625 n=4 Tax=Alphaproteobacteria RepID=Y1625_HYPNA Length = 224 Score = 278 bits (710), Expect = 1e-73, Method: Composition-based stats. Identities = 93/225 (41%), Positives = 123/225 (54%), Gaps = 2/225 (0%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEVL 61 M I +L + L + W DGR T GA +VK NQQ D S + ++ +L Sbjct: 1 MIVIENILGQDVLTEVAAALRELRWEDGRNTAGATARRVKRNQQADLSSRTGSKVREVLL 60 Query: 62 NAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLS 121 AV +H + A A P + PL + + YG H+D V +RTDLS TLFLS Sbjct: 61 EAVKRHPVVEAYARPLKFAPPLISCSGEGDAYGLHIDNPVMGKGD-ARLRTDLSFTLFLS 119 Query: 122 DPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQSMIRD 181 P+SYDGGEL + F VKLPAG +V+YPS+ LH VTPVT G R WIQS I+D Sbjct: 120 PPESYDGGELEIETVFKTESVKLPAGSMVIYPSTELHRVTPVTSGERFVFVGWIQSAIKD 179 Query: 182 DKKRAMLFELDNNIQSLKSRYGE-SEEILSLLNLYHNLLREWSEI 225 +RA+LF++ N L R+ S E+L+L NL+R WS+I Sbjct: 180 AAQRAILFDVTNLKAGLARRFPPGSPELLTLAKTESNLIRMWSDI 224 >UniRef50_A3PE10 PKHD-type hydroxylase P9301_13621 n=4 Tax=Prochlorococcus marinus RepID=Y1362_PROM0 Length = 222 Score = 275 bits (703), Expect = 9e-73, Method: Composition-based stats. Identities = 73/227 (32%), Positives = 123/227 (54%), Gaps = 8/227 (3%) Query: 1 MMYHIPGVLSPQDVARFREQLEQA---EWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQ 57 M Y I +L+ +++ +++L++ +W DG+ T G+ + VKNN Q++ + + Sbjct: 1 MNYLIHQLLNAEEINLIKKELDKCSQQDWEDGKKTAGSHASMVKNNLQLNRNTEVSKKNA 60 Query: 58 NEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSAT 117 V + L + +LP+ + +F + N YG H+D S R+DLS T Sbjct: 61 QLVTKKILSSQLIKSFSLPKKIHGIMFTKSSKNMHYGRHIDNPYMSSG-----RSDLSFT 115 Query: 118 LFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQS 177 + L++ YDGGEL++ + + KL G+++LYPSS LH V V G R+ WI+S Sbjct: 116 ISLTNKDFYDGGELIIETMNTEEKFKLNPGEIILYPSSYLHAVNEVNNGERLVCVGWIES 175 Query: 178 MIRDDKKRAMLFELDNNIQSLKSRYGESEEILSLLNLYHNLLREWSE 224 ++ +KR LF+LD +SL S++G S+E+ + Y NLLR+ E Sbjct: 176 YVKSTEKREYLFDLDAGARSLLSKHGRSDELDLIFKSYSNLLRDIGE 222 >UniRef50_Q5QUG6 PKHD-type hydroxylase IL0759 n=2 Tax=Idiomarina RepID=Y759_IDILO Length = 218 Score = 274 bits (701), Expect = 1e-72, Method: Composition-based stats. Identities = 77/225 (34%), Positives = 127/225 (56%), Gaps = 7/225 (3%) Query: 1 MMYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEV 60 M+ I + V L+ ++ DG+ T G VKNNQQ+ + + A + Sbjct: 1 MILQISNAVDTDTVKSIVAGLDAGQFSDGKKTAGWAAKDVKNNQQLSGKKS--EAATQVL 58 Query: 61 LNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFL 120 L+ + Q+AL + P+ ++ NRYQ E YG H+D ++ + +RTD+S TL L Sbjct: 59 LDRLQQNALVQSVMRPKQVARVTINRYQQGEYYGTHMDDSLMNG-----VRTDISFTLGL 113 Query: 121 SDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQSMIR 180 S ++GGELV+ D G+ +L GD+++YPS LH V PVT+G R+A W+QS+++ Sbjct: 114 SPLSDFEGGELVIEDASGERSWRLGQGDILMYPSHYLHRVNPVTKGSRLAMIGWVQSLVK 173 Query: 181 DDKKRAMLFELDNNIQSLKSRYGESEEILSLLNLYHNLLREWSEI 225 R +LF+++ ++++ G+SE L ++HNLLREWS++ Sbjct: 174 QPNYRELLFDIEQSLKAEFDANGKSENFDRLTKVFHNLLREWSDV 218 >UniRef50_Q0AP20 PKHD-type hydroxylase Mmar10_1675 n=1 Tax=Maricaulis maris MCS10 RepID=Y1675_MARMM Length = 219 Score = 273 bits (699), Expect = 2e-72, Method: Composition-based stats. Identities = 92/224 (41%), Positives = 137/224 (61%), Gaps = 6/224 (2%) Query: 1 MMYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEV 60 M+ H+ V + V R+ + Q +VDG T G VK N+Q++ + + +++EV Sbjct: 1 MLIHLQKVCPSEQVDHLRDLIGQGGFVDGGTTAGQVARAVKANEQLEAGARV-DTVRSEV 59 Query: 61 LNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFL 120 A+ HA F + A P+TLS L +RY++ YG H+D A+ G R DLS TLFL Sbjct: 60 RKALMAHAGFVSFARPKTLSRILVSRYRDGMAYGPHIDDALM-----GGRRADLSFTLFL 114 Query: 121 SDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQSMIR 180 SDP SYDGGELV++ G+ +KL AGD V+Y +S++H V PVTRG RVA W++S++R Sbjct: 115 SDPDSYDGGELVMDGPDGETEIKLAAGDAVVYATSAIHQVAPVTRGERVAVVGWVRSLVR 174 Query: 181 DDKKRAMLFELDNNIQSLKSRYGESEEILSLLNLYHNLLREWSE 224 +R +LF+LD +L +R G++ E+ +L NLLR+W+E Sbjct: 175 RPDQREILFDLDQVSAALFARDGKTRELDLVLKTKANLLRQWAE 218 >UniRef50_A5WFM3 PKHD-type hydroxylase PsycPRwf_1523 n=1 Tax=Psychrobacter sp. PRwf-1 RepID=Y1523_PSYWF Length = 259 Score = 273 bits (698), Expect = 3e-72, Method: Composition-based stats. Identities = 102/258 (39%), Positives = 148/258 (57%), Gaps = 34/258 (13%) Query: 1 MMYHIPGVLSPQDVARFREQL--EQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQN 58 M++ I +L +++ L + A+W DG++T G Q KNN Q+ + Y A+ N Sbjct: 1 MLHIIENLLDTAQLSQLTSILTHQHAQWQDGKLTAGISAQQQKNNWQLSRQDPSYQAMAN 60 Query: 59 EVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGW-MRTDLSAT 117 L A+ QH +F +AALP+ + PLF+ YQ + YG HVD A+++HP + MRTDLS T Sbjct: 61 LCLEALQQHPVFMSAALPKVIMPPLFSAYQLGQGYGMHVDNALQTHPDSKQLMRTDLSLT 120 Query: 118 LFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQS 177 LFL++P Y+GGELV++D +G+H +KL AGD VLYPS+SLH V VT G R+A W+QS Sbjct: 121 LFLNNPADYEGGELVISDEYGEHSIKLSAGDAVLYPSTSLHRVNTVTSGQRLAMVTWVQS 180 Query: 178 MIRDDKKRAMLFELD-------------------NNIQSLKSRYGESEE----------- 207 ++R D++R +L +LD QS +++ G+ E Sbjct: 181 LVRSDEQRQILHDLDVSHILLRQKLLATSDQAQSTQAQSTQAQCGQLSEQHSTDQQLTHQ 240 Query: 208 -ILSLLNLYHNLLREWSE 224 I L YHNLLR W+E Sbjct: 241 AIEKLNQSYHNLLRLWAE 258 >UniRef50_A8TQK0 Putative hydroxylase n=1 Tax=alpha proteobacterium BAL199 RepID=A8TQK0_9PROT Length = 223 Score = 270 bits (690), Expect = 3e-71, Method: Composition-based stats. Identities = 90/224 (40%), Positives = 128/224 (57%), Gaps = 2/224 (0%) Query: 1 MMYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEV 60 M+ I G+LS + V ++L A++VDG ++ G G +K N QV +S Y L V Sbjct: 1 MVAVIEGLLSKEQVQTIAKRLFGAQFVDGTLSGGPLGEAIKKNTQVSPQSPEYRELSQLV 60 Query: 61 LNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFL 120 L + Q+ AALP+ + +P+F Y YG HVD A+ P G MRTDLS T+FL Sbjct: 61 LGIMRQNDQVAIAALPKRILSPIFASYVEGNRYGEHVDAALMG-PYPG-MRTDLSITIFL 118 Query: 121 SDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQSMIR 180 +DP +YDGGELV+ FG+ K AGD VLYP+ +H V P+TRG R+A WI+SM+R Sbjct: 119 NDPGAYDGGELVLKTAFGEQIYKRAAGDAVLYPTHYVHRVNPITRGRRLAIVTWIESMVR 178 Query: 181 DDKKRAMLFELDNNIQSLKSRYGESEEILSLLNLYHNLLREWSE 224 D +R ++ +L + L + E I + NLLR W++ Sbjct: 179 DPARREVIEDLAEAMDKLVRDGADGEIIRRVEKARLNLLRMWAD 222 >UniRef50_C7R7F4 2OG-Fe(II) oxygenase n=1 Tax=Kangiella koreensis DSM 16069 RepID=C7R7F4_KANKD Length = 218 Score = 263 bits (673), Expect = 3e-69, Method: Composition-based stats. Identities = 77/227 (33%), Positives = 120/227 (52%), Gaps = 12/227 (5%) Query: 1 MMYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQV---DTRSTLYAALQ 57 M+ + ++ P + E++ + ++ G+ T G +KNNQQ+ D + A L Sbjct: 1 MILQLSDIIEPNTLNVICEEVAKLDFHSGQQTAGKAVRSLKNNQQILLVDDQPAPLAML- 59 Query: 58 NEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSAT 117 + + +F AA LP+ + + NRYQ YG H+D A + +RTD+S T Sbjct: 60 ---FRHLQKSPIFQAACLPKQFARVMLNRYQQGMQYGNHIDDAYIAG-----VRTDVSFT 111 Query: 118 LFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQS 177 LS Y+GGELV+ D+ G+ KL G++++YPSS LH V PVT G R+A W+QS Sbjct: 112 YCLSSTSDYNGGELVLCDSTGERSWKLDKGEVLIYPSSYLHRVNPVTEGTRIAMVGWLQS 171 Query: 178 MIRDDKKRAMLFELDNNIQSLKSRYGESEEILSLLNLYHNLLREWSE 224 + D +R +LF+L + G+SE+ L Y NLLR W++ Sbjct: 172 KVGDASQRELLFDLKQAVTHELETQGKSEQYDRLSKSYSNLLRMWAD 218 >UniRef50_Q3AJA6 PKHD-type hydroxylase Syncc9605_1577 n=13 Tax=Cyanobacteria RepID=Y1577_SYNSC Length = 222 Score = 255 bits (652), Expect = 7e-67, Method: Composition-based stats. Identities = 82/226 (36%), Positives = 120/226 (53%), Gaps = 8/226 (3%) Query: 1 MMYHIPGVLSPQDVARFREQLEQ--AEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQN 58 M + +L +V +++L W DGR+T G Q A VK N Q+D + L A+ N Sbjct: 1 MEFLTHSLLPLHEVCALQQRLSAPNLPWRDGRLTAGDQAALVKKNYQLDPNAELSLAISN 60 Query: 59 EVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATL 118 + A+ L + +L R + + L +R E+YG+HVD P + R DLS T Sbjct: 61 CISTALTSDPLVKSFSLVRKVHSLLVSRSSAGESYGWHVDN-----PFSRNGRRDLSFTC 115 Query: 119 FLSDPQSYDGGELVVNDTFG-QHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQS 177 FLSD SY+GG L++ +LP G +VLYPSS+LHCVTPV G R WI+S Sbjct: 116 FLSDEDSYEGGSLMIQTGGEDTKEFRLPPGQVVLYPSSTLHCVTPVLSGDRYVCVGWIES 175 Query: 178 MIRDDKKRAMLFELDNNIQSLKSRYGESEEILSLLNLYHNLLREWS 223 ++ R+MLF +D + L +R+G S+E+ + Y N +R S Sbjct: 176 YVKAADDRSMLFNIDAGARGLLARHGRSDELDLIFQSYTNAVRRLS 221 >UniRef50_Q0I9X3 PKHD-type hydroxylase sync_1544 n=2 Tax=Synechococcus RepID=Y1544_SYNS3 Length = 220 Score = 255 bits (652), Expect = 8e-67, Method: Composition-based stats. Identities = 73/224 (32%), Positives = 117/224 (52%), Gaps = 6/224 (2%) Query: 1 MMYHIPGVLSPQDVARFREQL-EQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNE 59 M + +L R E+L +AEW+DG +T GA K N Q++ S L + Sbjct: 1 MNHLRLQILDQATCERLLERLANEAEWLDGSLTAGAHAKGGKRNFQINYDSALRKEIHEL 60 Query: 60 VLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLF 119 V A+ H + LPR L L ++ + Y HVD A S R+DLS TL Sbjct: 61 VERAMWNHPVVKGFCLPRKLHRFLISKTEKEGGYDTHVDNAYMSSG-----RSDLSFTLS 115 Query: 120 LSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQSMI 179 L+D Y+GG+L ++ + +KL G++++YPS+SLH V VT G+R WI+S + Sbjct: 116 LTDDTMYEGGDLEIDSISESYPIKLKQGEILIYPSTSLHRVCNVTSGIRTVCVGWIESYV 175 Query: 180 RDDKKRAMLFELDNNIQSLKSRYGESEEILSLLNLYHNLLREWS 223 + + R LF+L++ +++ +++G S+E+ + Y NLLR Sbjct: 176 QAENDRICLFQLESGARAVLAKHGRSDELDLIFLAYTNLLRRLG 219 >UniRef50_A3YZT5 Putative uncharacterized protein n=2 Tax=Chroococcales RepID=A3YZT5_9SYNE Length = 221 Score = 254 bits (649), Expect = 1e-66, Method: Composition-based stats. Identities = 88/226 (38%), Positives = 123/226 (54%), Gaps = 7/226 (3%) Query: 1 MMYHIPGVLSPQDVARFREQL--EQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQN 58 M + + +L P V + L E A W G T G VK N Q++ S L+A L Sbjct: 1 MRFVLEPLLQPHQVEDWCLALSSEHASWRPGAETAGWHARSVKRNHQLERGSPLHAQLAE 60 Query: 59 EVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATL 118 ++ +A+ H L AAALP ++ LF+R E YG HVD A + R+DLS TL Sbjct: 61 QLQSALLAHPLLLAAALPVSIHGVLFSRSTRGEGYGSHVDNAYMAGG-----RSDLSFTL 115 Query: 119 FLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQSM 178 FLSDP +Y GGELV+ + ++ PAG ++YPS+ LH V PV G R+ + WIQS Sbjct: 116 FLSDPDTYSGGELVLEGPADEEALRCPAGHALVYPSTQLHRVEPVRDGQRLVAVGWIQSR 175 Query: 179 IRDDKKRAMLFELDNNIQSLKSRYGESEEILSLLNLYHNLLREWSE 224 +R +R +LFELD +++ R G+ E + Y NLLR+W E Sbjct: 176 VRRADQRELLFELDTARRAIFKRDGKDEVFDLISRSYTNLLRQWGE 221 >UniRef50_Q05TP1 Putative hydroxylase n=1 Tax=Synechococcus sp. RS9916 RepID=Q05TP1_9SYNE Length = 222 Score = 245 bits (625), Expect = 1e-63, Method: Composition-based stats. Identities = 65/220 (29%), Positives = 102/220 (46%), Gaps = 8/220 (3%) Query: 7 GVLSPQDVARFREQLEQAE---WVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEVLNA 63 + + + F+ +L WVDG+ TTG+ K N Q+ + L+ + Sbjct: 7 KIFTESECEDFKAKLLTLTPKHWVDGKTTTGSHAKTKKINLQLKPDTQENKELERAIRER 66 Query: 64 VNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLSDP 123 + + F + +P+ + L +R + YG HVD A R D+S T+ LS Sbjct: 67 LRNNPSFKSFCIPKKMHHNLISRTEAGGGYGTHVDNAFMKTG-----RADISYTICLSSE 121 Query: 124 QSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQSMIRDDK 183 + Y GGELV++ VK+ G +YPS+ LH V VT G+R+A W+QS I + Sbjct: 122 KDYKGGELVIHGATETTTVKMKQGHAFIYPSNQLHQVNTVTSGIRLACIGWVQSYIASQE 181 Query: 184 KRAMLFELDNNIQSLKSRYGESEEILSLLNLYHNLLREWS 223 R LF L+ L + G SE + + + NLLR + Sbjct: 182 LRMNLFNLEAGANYLLATQGRSEALDRIFLAHANLLRSFG 221 >UniRef50_Q1GRV0 PKHD-type hydroxylase Sala_1910 n=1 Tax=Sphingopyxis alaskensis RepID=Y1910_SPHAL Length = 218 Score = 237 bits (606), Expect = 2e-61, Method: Composition-based stats. Identities = 78/221 (35%), Positives = 115/221 (52%), Gaps = 5/221 (2%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEVL 61 M+ + +L V R+ +VDG+++ ++VKNN Q+ + Y +L Sbjct: 1 MFKLVQLLGDNAVRALRDIAASGTFVDGKIS--NPHSRVKNNLQLH-DAAAYERSSKILL 57 Query: 62 NAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLS 121 +A+ Q+A F + P ++ PL RY YG H D A P G +RTD+S T+FLS Sbjct: 58 DAMIQNADFMEFSFPARIAPPLLTRYTPGMHYGLHPDAAYIPLPD-GQLRTDVSCTIFLS 116 Query: 122 DPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQSMIRD 181 DP YDGG L V R K G ++YPS +LH V PVTRG R+ + +IQS+I D Sbjct: 117 DPADYDGGALHVQLGNADLRFKEAPGVAIVYPSHTLHEVEPVTRGERLVAITFIQSLIPD 176 Query: 182 DKKRAMLFELDNNIQSLKSRYGESEEILSLLNLYHNLLREW 222 ++R ++ EL N I +L+ E L + + LLR W Sbjct: 177 VQQRNLMHEL-NEIAALEGGKMEPANYTRLQAVQYQLLRMW 216 >UniRef50_A7HP27 PKHD-type hydroxylase Plav_0037 n=1 Tax=Parvibaculum lavamentivorans DS-1 RepID=Y037_PARL1 Length = 219 Score = 237 bits (605), Expect = 2e-61, Method: Composition-based stats. Identities = 84/226 (37%), Positives = 119/226 (52%), Gaps = 10/226 (4%) Query: 1 MMYHIPGVLSPQDVARFREQL--EQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQN 58 M I G+L D+ R + + ++ + G T G VKNN+Q + L A L Sbjct: 1 MFIEIAGILGAADL-RLADTVFAQKDAFESGARTAGRIARAVKNNEQAKP-AGLAADLTM 58 Query: 59 EVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATL 118 V + ++ +F AAA PR L +RY YG H D A R DLS TL Sbjct: 59 LVEKRLMKNDVFRAAARPRNFIRILLSRYTQGMAYGLHSDDAFM-----ERQRVDLSFTL 113 Query: 119 FLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQSM 178 FLS P+SY+GGEL+V + G+ VKL AG LVLYPS++LH V VT G R A+ WI+S+ Sbjct: 114 FLSPPESYEGGELIVEEPAGERLVKLEAGSLVLYPSATLHRVAEVTSGERRAAVGWIRSL 173 Query: 179 IRDDKKRAMLFELDNNIQSLKSRYGESEEILSLLNLYHNLLREWSE 224 +R + R LF++ ++ ++ G+ LL + +LLR W E Sbjct: 174 VRSAEDRETLFDVALALRQAEAA-GDRALTDRLLKIQGSLLRRWGE 218 >UniRef50_B9TN05 PKHD-type hydroxylase ybiX, putative (Fragment) n=1 Tax=Ricinus communis RepID=B9TN05_RICCO Length = 204 Score = 233 bits (595), Expect = 3e-60, Method: Composition-based stats. Identities = 99/172 (57%), Positives = 124/172 (72%), Gaps = 3/172 (1%) Query: 1 MMYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEV 60 MM HIP +L +V + R+ L A+W DGR T G+QGAQVK NQQ+ S L L+ V Sbjct: 33 MMLHIPEILRTDEVKQLRDHLNSAQWSDGRATAGSQGAQVKQNQQLPENSPLMPELRQIV 92 Query: 61 LNAVNQHALFFAAALPRTLSTPLFNRYQNN--ETYGFHVDGAVRSHPQN-GWMRTDLSAT 117 A+ +HAL+F+AALP LS P FNRY E YGFHVDGAVRS P + GWMRTDLSAT Sbjct: 93 EQALKRHALYFSAALPLRLSPPQFNRYAAAQLEHYGFHVDGAVRSFPAHPGWMRTDLSAT 152 Query: 118 LFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRV 169 LFL + Y+GG+L V DT+G+H V+LPAGD++LYPS+S+H VTP+TRG R+ Sbjct: 153 LFLCESDEYEGGDLTVRDTYGEHEVRLPAGDMILYPSTSVHSVTPLTRGARI 204 >UniRef50_UPI00006A2339 UPI00006A2339 related cluster n=1 Tax=Xenopus (Silurana) tropicalis RepID=UPI00006A2339 Length = 185 Score = 221 bits (564), Expect = 1e-56, Method: Composition-based stats. Identities = 88/152 (57%), Positives = 109/152 (71%), Gaps = 1/152 (0%) Query: 75 LPRTLSTPLFNRYQNNETYGFHVDGAVR-SHPQNGWMRTDLSATLFLSDPQSYDGGELVV 133 LP PLFNRY YG HVDG+V +R+D+S TLFLS+P+ Y+GGEL+V Sbjct: 34 LPLRTLLPLFNRYAGGGQYGLHVDGSVMRQLGSEQPLRSDVSTTLFLSEPEEYEGGELIV 93 Query: 134 NDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQSMIRDDKKRAMLFELDN 193 DT+G+H VKLPAGD+++YPS+SLH VTPVTRG RVASF W QSM+R D +R LFELD Sbjct: 94 VDTYGEHEVKLPAGDMIVYPSTSLHRVTPVTRGARVASFFWTQSMVRQDSQRLRLFELDQ 153 Query: 194 NIQSLKSRYGESEEILSLLNLYHNLLREWSEI 225 IQ L+ R G+ EE+ SL YHNLLR W+E+ Sbjct: 154 AIQKLRLRLGDDEEVTSLTGHYHNLLRMWAEV 185 >UniRef50_Q5GQB0 Putative uncharacterized protein n=1 Tax=Synechococcus phage S-PM2 RepID=Q5GQB0_BPSYP Length = 231 Score = 217 bits (553), Expect = 2e-55, Method: Composition-based stats. Identities = 70/236 (29%), Positives = 111/236 (47%), Gaps = 17/236 (7%) Query: 1 MMYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQG---AQVKNNQQVDTRSTLYAALQ 57 M+Y +L+ +V R EQ ++A DG V + Q+KN++++D ++ Y Sbjct: 1 MIYKF-DLLTQDEVRRINEQYDKAALKDGTVKINLENSVEKQLKNSKEIDGNTSHYRYCL 59 Query: 58 NEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSAT 117 + + A+ ++ALF + ++ P+ Y Y HVD Q +RTD S T Sbjct: 60 DLIQKAMRRNALFKTTYILGEITPPIMVEYAEGCYYIPHVDS-----IQIQNLRTDHSMT 114 Query: 118 LFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQS 177 LFLS+P Y+GGELV+ K AG +++YP+ LH V P+T G R S MW S Sbjct: 115 LFLSEPDEYEGGELVIGIGDVAKSFKEKAGTVIMYPTGMLHEVRPITSGKRRVSVMWATS 174 Query: 178 MIRDDKKRAMLFELDNNIQSLKSRYGESEE--------ILSLLNLYHNLLREWSEI 225 +I D R L ++ + E E+ ++ L + N LR + I Sbjct: 175 IIDDTFMRHELINFGMGLKKILDYLEEKEDDQLKIQELLIPLEQVRSNFLRGYGNI 230 >UniRef50_A8TKW7 2OG-Fe(II) oxygenase n=1 Tax=alpha proteobacterium BAL199 RepID=A8TKW7_9PROT Length = 238 Score = 208 bits (530), Expect = 1e-52, Method: Composition-based stats. Identities = 78/234 (33%), Positives = 116/234 (49%), Gaps = 16/234 (6%) Query: 2 MYHIPGVLSPQDVARFREQL--EQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNE 59 +Y I +L+P VA R L E A WVDG+ T G G + K N ++ S L L ++ Sbjct: 4 VYPIRNLLAPGLVAELRAALQAEGAPWVDGQQTVGRDGTK-KRNHEIAADSPLRQELSDK 62 Query: 60 VLNAV-----NQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDL 114 V + N+ F PR S LF+R Y H+D AV MR+DL Sbjct: 63 VSAYLRGPLTNETLAFRHVCDPRRWSPFLFSRTGPGGGYRDHMDSAVMFRGSPEEMRSDL 122 Query: 115 SATLFLSDPQSYDGGELVVN-DTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFM 173 S T+FL++P SY GGELVV+ D K+ AG VLY ++++H V VT G R+ + + Sbjct: 123 SMTIFLTEPDSYQGGELVVDSDMPYAPTFKMAAGGAVLYATNAIHRVAEVTAGERLVAVI 182 Query: 174 WIQSMIRDDKKRAMLFELDNNIQSLKSRYGESE------EILSLLNLYHNLLRE 221 WI+S I D R + +L + S+ +R G + + L + N+++ Sbjct: 183 WIESRIADVGTRQINADL-LQVMSVLTRDGACDPEIRESVVTKLEKVRSNVVKR 235 >UniRef50_Q1GXG2 PKHD-type hydroxylase Mfla_0096 n=1 Tax=Methylobacillus flagellatus KT RepID=Y096_METFK Length = 176 Score = 178 bits (452), Expect = 1e-43, Method: Composition-based stats. Identities = 60/138 (43%), Positives = 91/138 (65%), Gaps = 1/138 (0%) Query: 1 MMYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEV 60 M+ IP V +P++ R++L+ EW+DG+VT G Q A+ KNN Q+ L L + + Sbjct: 1 MLITIPEVFTPEEAESIRQRLDATEWLDGKVTAGYQSAKAKNNLQLAENHPLAIELGDLI 60 Query: 61 LNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRS-HPQNGWMRTDLSATLF 119 ++ + QH LF +AALPR + PLFNRY++ +++GFH+D AVRS +RTDLS+TLF Sbjct: 61 VSRLTQHPLFMSAALPRKVFPPLFNRYESGQSFGFHIDNAVRSLSGSRERVRTDLSSTLF 120 Query: 120 LSDPQSYDGGELVVNDTF 137 + P+ YDGGEL+ + + Sbjct: 121 FTPPEDYDGGELIRHASN 138 >UniRef50_Q111M8 2OG-Fe(II) oxygenase n=2 Tax=Trichodesmium erythraeum IMS101 RepID=Q111M8_TRIEI Length = 210 Score = 147 bits (370), Expect = 3e-34, Method: Composition-based stats. Identities = 38/153 (24%), Positives = 65/153 (42%), Gaps = 10/153 (6%) Query: 27 VDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEVLNAVNQHALFFAAALPRTLSTPLFNR 86 VDG++ + V ++ L+ L N V A N+ + + ++ + Sbjct: 58 VDGKIKPEIRQVNVWGLSYSESTRWLWEKLINSVKYANNKWWNYDIYGIMDSMQLLCYEA 117 Query: 87 YQNNE----TYGFHVDGAVRSHPQNGWMRTDLSATLFLSDPQSYDGGELVVNDTFGQHRV 142 +N E Y H+D + +S ++ LSDPQ Y+G EL + + Sbjct: 118 SKNQESIQDHYNKHIDVG------EAYYYRKISISIQLSDPQDYEGSELKLYTRREAENL 171 Query: 143 KLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWI 175 G ++L+PS LH VTP+ +G R A W+ Sbjct: 172 PKARGTMILFPSFVLHEVTPIIKGKRWALVCWV 204 >UniRef50_C7BVH5 2OG-Fe(II) oxygenase family like protein n=1 Tax=Synechococcus phage S-RSM4 RepID=C7BVH5_9CAUD Length = 215 Score = 134 bits (338), Expect = 2e-30, Method: Composition-based stats. Identities = 46/221 (20%), Positives = 88/221 (39%), Gaps = 12/221 (5%) Query: 1 MMYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEV 60 M+Y L + + + ++ DG + G + + K+N + + N Sbjct: 1 MIYEY-DFLDKNKLRQMLSLFDAGKFEDGAKS-GPKDKKYKHNSE--QSDIEIGKMVNTA 56 Query: 61 LNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFL 120 + + + + + S L +Y+ Y H D RTD + + L Sbjct: 57 VYKLIRESEISKIHILNKCSPSLMLKYEVGNHYADHSD-----FFDMWGTRTDYTCVVNL 111 Query: 121 SDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQSMIR 180 +D Y+GGE + + K+ G ++YP+ +H V PVT GVR W++S I Sbjct: 112 ND--DYEGGEHYIQIGTERIEKKVEPGKALIYPTEFIHGVNPVTSGVRKCLTFWMESSIV 169 Query: 181 DDKKRAMLFELDNNIQSLKSRYGESEEILSLLNLYHNLLRE 221 D R L EL+ + + E++++ + L++ Sbjct: 170 DPTIRYYLAELNKFYYKI-EGSMDREDLVNFDLIRMGLIKR 209 >UniRef50_C7BVA8 2OG-Fe(II) oxygenase superfamily like protein n=1 Tax=Synechococcus phage S-RSM4 RepID=C7BVA8_9CAUD Length = 206 Score = 125 bits (314), Expect = 1e-27, Method: Composition-based stats. Identities = 40/136 (29%), Positives = 59/136 (43%), Gaps = 12/136 (8%) Query: 48 TRSTLYAALQNEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQ- 106 +Y L+ +L A F TL F +Y + Y +H D +P Sbjct: 67 DEPWIYNRLKKYILAANKNAGWNFNVDHTETL---QFTKYDVGQFYDWHPDQHHYLYPDD 123 Query: 107 --NGWMR---TDLSATLFLSDPQSYDGGELVVN--DTFGQHRVKLP-AGDLVLYPSSSLH 158 N MR LS TL L+DP ++GG+L + + KL G L+++PS H Sbjct: 124 DTNENMRGKYRKLSTTLLLNDPSEFEGGDLEFHFNMKETEKATKLNSKGSLIVFPSFVYH 183 Query: 159 CVTPVTRGVRVASFMW 174 VTP+T+G R + W Sbjct: 184 RVTPITKGTRYSLVSW 199 >UniRef50_A3PDM8 Putative uncharacterized protein n=2 Tax=root RepID=A3PDM8_PROM0 Length = 186 Score = 114 bits (285), Expect = 2e-24, Method: Composition-based stats. Identities = 28/103 (27%), Positives = 47/103 (45%), Gaps = 6/103 (5%) Query: 77 RTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLSDPQSYDGGELVVNDT 136 + + + Y + Y +HVD + + G +R +S TLF+++P Y+GGE + Sbjct: 78 KGIEPIQYGIYSDGGKYDWHVDQGAKMFLKGGSVR-KISMTLFINNPDEYEGGEFDLELF 136 Query: 137 FGQHR-----VKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMW 174 + KL G + + S H V PV+ GVR + W Sbjct: 137 PPEKEPRYETFKLKKGSAIFFQSDVWHRVRPVSSGVRKSLVAW 179 >UniRef50_Q98E25 Mlr4439 protein n=1 Tax=Mesorhizobium loti RepID=Q98E25_RHILO Length = 192 Score = 108 bits (271), Expect = 1e-22, Method: Composition-based stats. Identities = 50/197 (25%), Positives = 73/197 (37%), Gaps = 32/197 (16%) Query: 1 MM--YHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGA-----------QVKNNQQVD 47 M+ I VL + ++ A G G G + + + + Sbjct: 1 MIDYIQITEVLDEAACSALCAEIRAA----GGRAAGLMGRADQKPAWPEVRRTRRAEVSE 56 Query: 48 TRSTLYAALQNEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQN 107 AL A+ +H F AL T P F Y+ + + H DG Sbjct: 57 ATEGSVNALLARQKTALERH---FGLAL-GTCEKPQFLHYREGDFFVPHQDGNTPLIHDE 112 Query: 108 GWMRTDLSATLFLS------DPQSYDGGELVVNDTFGQH--RVKLPA--GDLVLYPSSSL 157 R +SA +FL+ P+ Y GG LV++ + RV +PA G LV + S + Sbjct: 113 SRFR-KISAVIFLNRQSDDPSPEDYSGGSLVLHGPYSGPNLRVTMPALPGSLVAFRSETT 171 Query: 158 HCVTPVTRGVRVASFMW 174 H VTPVTR R W Sbjct: 172 HEVTPVTRNERFTIVSW 188 >UniRef50_B0C441 2OG-Fe(II) oxygenase, putative n=1 Tax=Acaryochloris marina MBIC11017 RepID=B0C441_ACAM1 Length = 226 Score = 108 bits (270), Expect = 1e-22, Method: Composition-based stats. Identities = 39/131 (29%), Positives = 58/131 (44%), Gaps = 12/131 (9%) Query: 47 DTRSTLYAALQNEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQ 106 +Y + + V ++ F L S+ Y+ Y +H D R Sbjct: 99 PDSVWIYEKIMHHVAQVNAENWQFR---LDGFESSIQLTEYEPGGHYTWHQDIGSRRSGL 155 Query: 107 NGWMRTDLSATLFLSDPQSYDGGELVVNDTFGQHRVKLPA--GDLVLYPSSSLHCVTPVT 164 LS ++ LSDP++Y GG L ++ T Q V +P G LV++PS +LH VT +T Sbjct: 156 -----RKLSVSVQLSDPETYVGGGLELHAT--QKPVMMPRSRGTLVIFPSYTLHRVTAMT 208 Query: 165 RGVRVASFMWI 175 G R A WI Sbjct: 209 EGTRRALVTWI 219 >UniRef50_A5VDF3 Alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal allergen n=2 Tax=Sphingomonadales RepID=A5VDF3_SPHWW Length = 363 Score = 101 bits (252), Expect = 2e-20, Method: Composition-based stats. Identities = 48/218 (22%), Positives = 88/218 (40%), Gaps = 30/218 (13%) Query: 2 MYHIPGVLSPQDVARFREQLEQ-----AEW---VDGRVTTGAQGAQVKNNQQVDT----- 48 + P + P R + E+ + + VDGR T G + VK Sbjct: 157 VLVAPNIFDPAFCRRLIDLYERHGGSPSGFMREVDGR-TVGVMDSSVKRRSDYYLDDDDV 215 Query: 49 -RSTLYAALQNEVLNAVNQHALFFAAALPRTLSTPLFNRYQNN--ETYGFHVDGAVRSHP 105 R + A L ++ + + F A+ + + Y + + H D Sbjct: 216 LREQVRARLSRFLVPQIERVFQFRAS----RIERYMIACYDSGDSGFFQAHRDNTT---- 267 Query: 106 QNGWMRTDLSATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTR 165 G + T+ L+ Y+GG+L+ + FGQ R + P G V++ S LH TPVTR Sbjct: 268 -GGTAHRRFACTINLNA-GDYEGGDLIFPE-FGQRRYRAPTGGAVVFSCSLLHEATPVTR 324 Query: 166 GVRVA--SFMWIQSMIRDDKKRAMLFELDNNIQSLKSR 201 G R A F++ ++ R ++ A ++ ++ + ++ Sbjct: 325 GKRYAYLPFLYDEAAARQREENARSGKVGADLATYRAE 362 >UniRef50_Q0QZ85 2OG-Fe(II) oxygenase superfamily n=1 Tax=Synechococcus phage syn9 RepID=Q0QZ85_BPSYS Length = 183 Score = 98.9 bits (245), Expect = 1e-19, Method: Composition-based stats. Identities = 26/101 (25%), Positives = 39/101 (38%), Gaps = 11/101 (10%) Query: 79 LSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLSDPQSYDGGELVVNDTFG 138 + F Y Y +HVD + +S +LFL+ + Y+GGE + Sbjct: 82 VEPVQFGSYPKGGFYNWHVDQHSMP----EKVVRKISMSLFLN--EDYEGGEFDLELYRP 135 Query: 139 Q-----HRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMW 174 KLP G + + H V PVT G+R + W Sbjct: 136 GTDQRYETFKLPTGSAIFFQGDQWHRVRPVTSGLRKSLVSW 176 >UniRef50_A5VC31 2OG-Fe(II) oxygenase n=1 Tax=Sphingomonas wittichii RW1 RepID=A5VC31_SPHWW Length = 186 Score = 87.4 bits (215), Expect = 3e-16, Method: Composition-based stats. Identities = 33/131 (25%), Positives = 52/131 (39%), Gaps = 9/131 (6%) Query: 48 TRSTLYAALQNEVLNAVNQHALFFAAALPRTLSTPLFNRY--QNNETYGFHVDGAVRSHP 105 + A L + V+++ + A P F Y + Y +H D + S Sbjct: 58 AEQAIVARLMSFVVSSNRTNFGVDIVA-P---FDLQFTEYHGTSQGKYDWHQDVWLES-- 111 Query: 106 QNGWMRTDLSATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTR 165 LS + LSDP Y+GG + P G L+++PS H V PVT Sbjct: 112 -TRPYDRKLSLVVQLSDPADYEGGAFEFFGLQHPGALFAPRGSLLIFPSWMQHRVLPVTG 170 Query: 166 GVRVASFMWIQ 176 G+R + W++ Sbjct: 171 GIRRSLVSWVE 181 >UniRef50_A0NVC8 Oxidoreductase domain protein n=2 Tax=Labrenzia RepID=A0NVC8_9RHOB Length = 182 Score = 83.5 bits (205), Expect = 5e-15, Method: Composition-based stats. Identities = 37/173 (21%), Positives = 63/173 (36%), Gaps = 12/173 (6%) Query: 8 VLSPQDVARFREQLEQAEWVD-GRVTTGAQG---AQVKNNQQVDTRS--TLYAALQNEVL 61 + S +D +L QAE D G + G Q + + + D + + + V Sbjct: 9 LFSQKDCDEII-RLSQAEAQDTGGLVGGKQQGEIRRARISWLDDEGTAGWVMDRVMTAVA 67 Query: 62 NAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLS 121 A + F L +++ + Y +H D P + + + LS Sbjct: 68 KANREAFDFDITEFREKLQVAIYDESEEG-HYDWHSDVG--EGPIAQFRKA--TIVTQLS 122 Query: 122 DPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMW 174 +Y+GG L ++ G L+ S LH V PVT+G R + W Sbjct: 123 PSDAYEGGALEISLGHKVMAASRDQGCATLFASFMLHRVVPVTKGTRYSLTCW 175 >UniRef50_Q0FVH2 Oxidoreductase domain protein n=9 Tax=Rhodobacterales RepID=Q0FVH2_9RHOB Length = 204 Score = 82.7 bits (203), Expect = 9e-15, Method: Composition-based stats. Identities = 40/180 (22%), Positives = 62/180 (34%), Gaps = 14/180 (7%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKN-----NQQVDTRSTLYAAL 56 ++ IP S D R + A D R+ Q ++ V + + Sbjct: 25 VHRIPAAFSEIDCDRIIDLSRTAHSADARLVGRNQDHNLRRADLVWLDDVAGAEWVMEKI 84 Query: 57 QNEVLNAVNQHALFFAAALPRTLSTPLFNRY--QNNETYGFHVDGAVRSHPQNGWMRTDL 114 V A + L + RY + + +H D R L Sbjct: 85 IELVRQANRA---VYGFDLDAFDESAQVARYGAERQGHFSWHSDVG----DGRLAARRKL 137 Query: 115 SATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMW 174 + + LS+P +Y GG L V + + G L+PS LH VTPV G R + +W Sbjct: 138 TMVVQLSEPGAYRGGALEVMPSAHTVEAERARGSATLFPSYLLHRVTPVEAGERRSMTIW 197 >UniRef50_Q5GQX6 Putative uncharacterized protein n=1 Tax=Synechococcus phage S-PM2 RepID=Q5GQX6_BPSYP Length = 200 Score = 80.0 bits (196), Expect = 6e-14, Method: Composition-based stats. Identities = 28/143 (19%), Positives = 50/143 (34%), Gaps = 18/143 (12%) Query: 46 VDTRSTLYAALQNEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDG------ 99 +D + + + + A + F + F Y Y +H D Sbjct: 57 LDAQHWFVGMIWHHISRANMHNFQFDITSFDND--NVEFLSYDKGGHYAWHCDDFCGRYS 114 Query: 100 ------AVRSHPQNGWMRTDLSATLFLSDPQSYDGGELVVNDTFGQH-RVKLPAGDLVLY 152 +++ N + R LS +L L+D Y+GGE + + G L+++ Sbjct: 115 PQVPADGLKTEVYNEYSR-KLSFSLLLND--DYEGGEFQIYFPPHHMITIPKEKGKLIIF 171 Query: 153 PSSSLHCVTPVTRGVRVASFMWI 175 S +H V V G R W+ Sbjct: 172 DSRCVHRVRKVKSGTRDVLVGWV 194 >UniRef50_B4RHL4 Putative uncharacterized protein n=2 Tax=Phenylobacterium zucineum HLK1 RepID=B4RHL4_PHEZH Length = 365 Score = 79.6 bits (195), Expect = 6e-14, Method: Composition-based stats. Identities = 46/220 (20%), Positives = 78/220 (35%), Gaps = 20/220 (9%) Query: 2 MYHIPGVLSPQDVARFREQLE------QAEWVD-GRVTTGAQGAQVKNNQQVDTRSTLYA 54 + +P + P E E D G T G K L Sbjct: 154 VLIVPRIFEPTLCRAMIEHYERRGGSPSGVMRDVGGRTVGVLDDFKKRRDAPVDDERLLQ 213 Query: 55 ALQNEVLNAVN---QHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMR 111 A++ + + + Q A FAA ++ + H D G Sbjct: 214 AMRTAIAHRLLPEVQRAFQFAATRVERYIVACYDA-AEGGYFRPHRDNTT-----AGTAH 267 Query: 112 TDLSATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVAS 171 + ++ L+ + Y+GG+L + FG + P G V++ S LH TPVTRG R AS Sbjct: 268 RKFAVSINLNA-EDYEGGDLRFPE-FGSRTYRAPTGGAVVFSCSLLHEATPVTRGRRYAS 325 Query: 172 --FMWIQSMIRDDKKRAMLFELDNNIQSLKSRYGESEEIL 209 F++ ++ R ++ L + D + + E + Sbjct: 326 LPFLYDEAGARVREQNRHLLQSDPPPPPVAAPVPAGEAVT 365 >UniRef50_B0CEH1 Putative uncharacterized protein n=1 Tax=Acaryochloris marina MBIC11017 RepID=B0CEH1_ACAM1 Length = 202 Score = 78.9 bits (193), Expect = 1e-13, Method: Composition-based stats. Identities = 43/194 (22%), Positives = 68/194 (35%), Gaps = 26/194 (13%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQG-----AQVKNNQQVDTRS--TLYA 54 ++ PG L PQ + + + R+T + + Q+ + + Sbjct: 12 VFVDPGFLDPQFCESYLTEAQTCPCEPARLTRYGEAVTDDSRRKTGQLQISPTTIKGIRE 71 Query: 55 ALQNEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDL 114 L + + H AL P RYQ + +G H D S P + + + L Sbjct: 72 RLI-AIKPRLETHFEVQLHAL----EPPSCYRYQVGDFFGLHRDVIDPSLPGSKFEKNRL 126 Query: 115 -SATLFLS------DPQSYDGGELVVNDTFGQHR-----VKLPA--GDLVLYPSSSLHCV 160 S +FL+ PQ++ GG L + R L G L+ + S H V Sbjct: 127 VSLIIFLNGMSAEPSPQTFGGGALALYGLLNDARGQNYGFPLEPEQGQLIAFRSDLWHEV 186 Query: 161 TPVTRGVRVASFMW 174 PVT G R W Sbjct: 187 KPVTHGERFTIVSW 200 >UniRef50_A8TIV2 Putative uncharacterized protein n=1 Tax=alpha proteobacterium BAL199 RepID=A8TIV2_9PROT Length = 338 Score = 77.0 bits (188), Expect = 4e-13, Method: Composition-based stats. Identities = 44/198 (22%), Positives = 69/198 (34%), Gaps = 44/198 (22%) Query: 1 MMYHIPGVLSPQDVARFREQLEQA--------EWVDGRVTTGAQG-AQVKNNQQVDTRST 51 MM IP VLSP+ + + V G + +V+ + V S Sbjct: 161 MM--IPDVLSPEWCRWLIHVHDSQGNEPSGFLQQVKGESVLLSDAEVKVRRDHVVPEGSP 218 Query: 52 LYAALQNEVLNAV------------NQHALFFAAALPRTLSTPLFNRYQ--NNETYGFHV 97 L A +++ + +H LF RY+ + H Sbjct: 219 LEAEIRHIFQRRLIPEIARATHSPIQRHELFKIV------------RYEAEEGGHFRPHR 266 Query: 98 DGAVRSHPQNGWMRTDLSATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSL 157 D + TL L+ +YDGG LV + +G + AG+ V++ S L Sbjct: 267 DNT-----STAGRTRRFAVTLNLN-TGAYDGGHLVFPE-YGDIGYRPAAGEAVVFSCSLL 319 Query: 158 HCVTPVTRGVRVASFMWI 175 H PVTRG R ++ Sbjct: 320 HEARPVTRGTRYVLLAFL 337 >UniRef50_A8TUG3 Putative uncharacterized protein n=1 Tax=alpha proteobacterium BAL199 RepID=A8TUG3_9PROT Length = 355 Score = 76.2 bits (186), Expect = 9e-13, Method: Composition-based stats. Identities = 43/199 (21%), Positives = 68/199 (34%), Gaps = 42/199 (21%) Query: 2 MYHIPGVLSPQDVARFREQLE--QAEWVD-------GRVTT--------GAQGA------ 38 + +P VLSP+D R E+VD GR T G Sbjct: 170 VLVVPDVLSPEDCRRLISIYAMQGQEFVDPGHNQLKGRTTDCKMRIPEYGRNDRIDHWVC 229 Query: 39 QVKNNQQVDTR--STLYAALQNEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFH 96 NN +D R L + V +H + A + Y+ +G H Sbjct: 230 STANNNIIDARLVPRLMPEIHKAFQYKVTRHERYRIAC---------YEGYRGGSQHG-H 279 Query: 97 VDGAVRSHPQNGWMRTDLSATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSS 156 D + + + T+ L+ Y+G EL + F + K P G +++ S Sbjct: 280 RDNTLPFVAHRRF-----AVTINLNA-DEYEGAELRFPE-FSEAAYKTPTGSAIVFSCSL 332 Query: 157 LHCVTPVTRGVRVASFMWI 175 LH V + G R A ++ Sbjct: 333 LHEVMAMRSGRRFALLAFL 351 >UniRef50_A6D9X4 Putative uncharacterized protein n=1 Tax=Caminibacter mediatlanticus TB-2 RepID=A6D9X4_9PROT Length = 220 Score = 75.4 bits (184), Expect = 1e-12, Method: Composition-based stats. Identities = 36/187 (19%), Positives = 63/187 (33%), Gaps = 17/187 (9%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRV-TTGAQGAQVKNNQQVDTRSTLYAALQNEV 60 I LS + L++ E ++ + ++ + L + Sbjct: 33 FLIINNFLSKNECHEIINSLDKNEKYKAKIISNNNLNESIRKTILHNPTDKLRELFHKRI 92 Query: 61 LNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWM-------RTD 113 N FF +L + + Y+ Y H D A N + Sbjct: 93 NKYKNDIEKFFGVSLLKG-TDIQILEYKEGGHYNCHADNASVIMKNNHIVGYKVVRSERK 151 Query: 114 LSATLFLSDPQSYDGGELVVNDT--FGQHRVKLPA--GDLVLYPSSSL--HCVTPVTRGV 167 L+ LFL+ + + GGE+ + RV L G ++++PS L H V + +G Sbjct: 152 LTTLLFLN--EDFLGGEIEFCHLRYYNNKRVILKPKIGMMIVFPSHGLFAHKVFEIKKGK 209 Query: 168 RVASFMW 174 R A W Sbjct: 210 RFAIVKW 216 >UniRef50_A8TW57 Putative uncharacterized protein n=1 Tax=alpha proteobacterium BAL199 RepID=A8TW57_9PROT Length = 383 Score = 74.6 bits (182), Expect = 2e-12, Method: Composition-based stats. Identities = 43/203 (21%), Positives = 72/203 (35%), Gaps = 22/203 (10%) Query: 2 MYHIPGVLSPQDVARFREQLEQ-----AEW---VDGRVTTGAQGAQVKNNQQVD-TRSTL 52 + IP V+SP + + E + + VDG +T G ++K + +L Sbjct: 170 VLMIPDVVSPAFCRQLIDYYEARGGGASGFMRDVDG-LTRGLLDPKMKRRKDCSIEDESL 228 Query: 53 YAALQNEVLNAVNQHALFFAAALPRTLSTPLFNRY--QNNETYGFHVDGAVRSHPQNGWM 110 L+ + V + + Y + + H D Sbjct: 229 LKQLRRALETRVIPEIGKAFGYRVSRVERYIIGCYDAADQGFFKAHRDNT-----SKATA 283 Query: 111 RTDLSATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVA 170 + +L L+ Y+GG L + +GQH K G V++ S H TPVTRG R Sbjct: 284 HRKFAMSLNLN-TDEYEGGALRFPE-YGQHTYKPGVGCAVVFSCSLFHEATPVTRGRRYV 341 Query: 171 SFMWI---QSMIRDDKKRAMLFE 190 ++ Q + + R L E Sbjct: 342 VLPFLYDEQGAAQRAETRRFLAE 364 >UniRef50_Q2T4K0 Oxidoreductase domain protein n=1 Tax=Burkholderia thailandensis E264 RepID=Q2T4K0_BURTA Length = 97 Score = 72.7 bits (177), Expect = 8e-12, Method: Composition-based stats. Identities = 26/95 (27%), Positives = 41/95 (43%), Gaps = 6/95 (6%) Query: 82 PLFNRYQNN-ETYGFHVDGAVRSHPQNGWMRTDLSATLFLSDPQSYDGGELVVNDTFGQH 140 P + Y + +H D + ++ L+ + LS+P Y+GG+L V + Sbjct: 2 PHYVEYHAGFGHFHWHNDYSH----ESEEAPRKLTVIVQLSEPHEYEGGDLEVFGSSIAV 57 Query: 141 RVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWI 175 + G ++ PS H VTPV GVR WI Sbjct: 58 APR-HRGSIICLPSFVEHRVTPVVAGVRRVLVAWI 91 >UniRef50_A8LEY1 Prolyl 4-hydroxylase alpha subunit n=1 Tax=Frankia sp. EAN1pec RepID=A8LEY1_FRASN Length = 165 Score = 72.3 bits (176), Expect = 1e-11, Method: Composition-based stats. Identities = 30/145 (20%), Positives = 48/145 (33%), Gaps = 12/145 (8%) Query: 33 TGAQGAQVKNNQQVDTRSTLYAALQNEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNET 92 G + Q + +D + V ++ + RY+ + Sbjct: 33 PGRRAEQGW-SVDLDELHEVAGRALRYVTQVNARNWRWR-----LREIRFAVLRYEIGQR 86 Query: 93 YGFHVDGAVRSHPQNGWMRTDLSATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLY 152 HVD P M S ++ L+ Y GG+L + G V + Sbjct: 87 MARHVD------PSPPGMVRRASVSVQLTPGDDYAGGDLTLWPDGRPIVASRDVGTAVAF 140 Query: 153 PSSSLHCVTPVTRGVRVASFMWIQS 177 P+++ H V VT GVR A W S Sbjct: 141 PAATAHEVGEVTSGVRWALVGWSYS 165 >UniRef50_Q8DKV0 Tlr0755 protein n=1 Tax=Thermosynechococcus elongatus BP-1 RepID=Q8DKV0_THEEB Length = 197 Score = 71.9 bits (175), Expect = 2e-11, Method: Composition-based stats. Identities = 38/188 (20%), Positives = 72/188 (38%), Gaps = 21/188 (11%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKN--NQQVDTRSTLYAALQNE 59 + ++ + E+ + D ++ G V+ + D + A ++ Sbjct: 15 ILLFQRLIPVHHCQQVIATAEKVGFEDAQILMGTVDRSVRGGSLLRFDPQDPQQAMVRQM 74 Query: 60 VLNAVN--QHALFFAAAL--PRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMR---- 111 +L A Q L+ + P + RY+ E Y HVD + Q + Sbjct: 75 LLQATQTIQIVLYQHYGIRFPE-IENFSVLRYRVGEGYRRHVDNLLLGSRQMELAQGIPT 133 Query: 112 TDLSATLFLSDPQSYDGGELVVNDTFGQHRVKLPA--GDLVLYPSSSL--HCVTPVTRGV 167 D+S +L+ + + GGE F + VK+ GD+V++P+ H PV +G Sbjct: 134 RDVSLVGYLN--EDFQGGE----TYFDRQGVKITPRTGDIVVFPAYYTHPHAALPVVQGT 187 Query: 168 RVASFMWI 175 + A W+ Sbjct: 188 KYAFATWL 195 >UniRef50_Q2S7M4 Uncharacterized iron-regulated protein n=1 Tax=Hahella chejuensis KCTC 2396 RepID=Q2S7M4_HAHCH Length = 182 Score = 71.6 bits (174), Expect = 2e-11, Method: Composition-based stats. Identities = 35/142 (24%), Positives = 62/142 (43%), Gaps = 24/142 (16%) Query: 45 QVDTRSTLYAALQN----EVLNAVNQHALF--FAAALPRTL---------STPLFNRYQN 89 Q S +Y ++N +AV + +F A LP+ L F RY+ Sbjct: 40 QTARGSQMYKDIRNNDRVIFDDAVMANNIFNRIEAMLPQELDGWELVGLNERLRFYRYEP 99 Query: 90 NETYGFHVDGAVRSHPQNGWMRTDLSATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDL 149 + + +H DG+ + + LS +FL+ + Y+GGE+ F ++K G + Sbjct: 100 GQYFKWHRDGSYARSEKEASL---LSFLIFLN--EDYEGGEI----AFRWDKIKPERGSV 150 Query: 150 VLYPSSSLHCVTPVTRGVRVAS 171 V++P + +H T V GV+ Sbjct: 151 VVFPHAMMHQGTTVESGVKYVL 172 >UniRef50_C6B8G5 Alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal allergen n=3 Tax=Alphaproteobacteria RepID=C6B8G5_RHILS Length = 380 Score = 70.0 bits (170), Expect = 6e-11, Method: Composition-based stats. Identities = 35/180 (19%), Positives = 65/180 (36%), Gaps = 20/180 (11%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVD-------GRVTT--GAQGAQVKNNQQVDTRSTL 52 + +P V P + E++ + G T G + + + + + + Sbjct: 173 IIVLPNVFEPDLCKKLIGLYERSGGEESGVMREVGGKTVQVNDHGYKRRKDYDIQEKDVI 232 Query: 53 YAALQNEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNE--TYGFHVDGAVRSHPQNGWM 110 V V + R + + Y + + H D + + Sbjct: 233 AETQGRFVRRIVPEIQKVHQFTATR-MERYIVACYAAEDEAHFRAHRDNTTKGTAHRRF- 290 Query: 111 RTDLSATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVA 170 + ++ L+D +DGGE+ + +G K PAG V++ S LH V+ VTRG R A Sbjct: 291 ----AVSVNLND--DFDGGEVSFPE-YGSRSFKAPAGGAVIFSCSLLHAVSKVTRGRRYA 343 >UniRef50_Q58MI5 Putative uncharacterized protein n=1 Tax=Prochlorococcus phage P-SSM2 RepID=Q58MI5_BPPRM Length = 197 Score = 70.0 bits (170), Expect = 6e-11, Method: Composition-based stats. Identities = 36/190 (18%), Positives = 70/190 (36%), Gaps = 21/190 (11%) Query: 1 MMYHIPGVLSPQDVARFREQLEQAEWVDGR---VTTGAQGAQVKNNQQVDTRSTLYAAL- 56 +++ + G++ E++E+ +W + T+ + ++ K V + + Sbjct: 11 LIHVVNGIIPSNICEDVIEEIERRDWSPHQWYNPTSDSSLSKPKKELDVQHITPELQEIL 70 Query: 57 --------QNEVLNAVNQHAL--FFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQ 106 + QH ++ FNRY ++ H+D Sbjct: 71 TPFVIESGREYNNKYAYQHPSCYINTRSIMDNFCQIRFNRYSGDQIMHQHIDHIYSLFDG 130 Query: 107 NGWMRTDLSATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSL--HCVTPVT 164 LS L +D Y+G L + VKL GD++++PS+ L H VT Sbjct: 131 TNKGIPVLSFILNFND--DYEGANLFF---WEDTIVKLGKGDIIMFPSNFLFPHGVTEAK 185 Query: 165 RGVRVASFMW 174 +G+R + W Sbjct: 186 KGIRYSGVCW 195 >UniRef50_B8GU65 2OG-Fe(II) oxygenase n=1 Tax=Thioalkalivibrio sp. HL-EbGR7 RepID=B8GU65_THISH Length = 325 Score = 68.9 bits (167), Expect = 1e-10, Method: Composition-based stats. Identities = 39/188 (20%), Positives = 57/188 (30%), Gaps = 23/188 (12%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEW-------VDGRVTTGAQGAQVKNNQQVDTRSTLYA 54 + +P V+SP E A + VDG+ + L Sbjct: 146 VLMLPDVVSPDLCEALIRCHESAHFDSGMVRMVDGKPALVPDYGAKRRLDHRLVDEALTD 205 Query: 55 ALQNEVLNAVNQHALFFAA--ALPRTLSTPLFNRYQN--NETYGFHVDGAVRSHPQNGWM 110 L + V A Y++ + H D P Sbjct: 206 RLTEVLSRRVL--PGIATAFNYRVTRFEPFKVVCYESSTGGYFRRHRDN---VTPDARHR 260 Query: 111 RTDLSATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVA 170 R LS L Y GG LV + FG+ + P G +++ LH T VT G R Sbjct: 261 RFALSINLN----DGYQGGNLVFPE-FGRQGYRPPRGGAIVFSGGLLHEATDVTGGRRYV 315 Query: 171 S--FMWIQ 176 F+W + Sbjct: 316 LLSFLWGE 323 >UniRef50_A8TVG3 Putative uncharacterized protein n=1 Tax=alpha proteobacterium BAL199 RepID=A8TVG3_9PROT Length = 318 Score = 68.9 bits (167), Expect = 1e-10, Method: Composition-based stats. Identities = 25/142 (17%), Positives = 55/142 (38%), Gaps = 14/142 (9%) Query: 38 AQVKNNQQVDTRSTLYAALQNEVLNAV----NQHALFFAAALPRTLSTPLFNRYQNNETY 93 + + + ++ LYA ++ + + + + P + + + Sbjct: 183 RKRRRDLTLNRGRPLYADVRTAIADRLMPELWKAWWIDRL-RPEAFYVASYEA-GRGDFF 240 Query: 94 GFHVDGAVRSHPQNGWMRTDLSATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYP 153 H D ++ + ++ ++ L+D Y+GG LV + + R + PAG + + Sbjct: 241 AAHRDNSLPATAD-----RRIAVSIELND--DYEGGGLVFPE-YSDDRWRAPAGGGLAFS 292 Query: 154 SSSLHCVTPVTRGVRVASFMWI 175 S LH PVT G R ++ Sbjct: 293 CSLLHEAVPVTAGCRYVLLAFL 314 >UniRef50_A3WCU8 Putative uncharacterized protein n=1 Tax=Erythrobacter sp. NAP1 RepID=A3WCU8_9SPHN Length = 364 Score = 68.1 bits (165), Expect = 2e-10, Method: Composition-based stats. Identities = 42/189 (22%), Positives = 67/189 (35%), Gaps = 29/189 (15%) Query: 2 MYHIPGVLSPQDVARFREQLE-QAEWVDGRVTTGAQGAQVK-----NNQQ-----VDTRS 50 + +P VLSP++ + +E ++ + G K +N+Q + Sbjct: 184 ILIVPNVLSPEECGNLVKSVETDTPFMVRKPQPGEISGNYKVPVYDHNRQDRIDLIIKDP 243 Query: 51 TLYAALQNEVLNAVNQHALF---FAAALPRTLSTPLFNRY---QNNETYGFHVDGAVRSH 104 L + V + FA + R RY + G H D Sbjct: 244 NTLRFLDERIFGRVT--PMIKKAFAYDVTRR-EDLHIARYVGKREGIAMG-HRDN---VD 296 Query: 105 PQNGWMRTDLSATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVT 164 P R LS +L Y+GGE+ + F ++PAG +++ SS LH V T Sbjct: 297 PPGAHRRFALSMSLN----DEYEGGEITFEE-FSPKGYRVPAGTAMVFSSSLLHEVQETT 351 Query: 165 RGVRVASFM 173 GVR Sbjct: 352 SGVRYNLIS 360 >UniRef50_C7JE98 Putative uncharacterized protein n=8 Tax=Acetobacter pasteurianus RepID=C7JE98_ACEP3 Length = 336 Score = 67.7 bits (164), Expect = 2e-10, Method: Composition-based stats. Identities = 27/110 (24%), Positives = 47/110 (42%), Gaps = 12/110 (10%) Query: 79 LSTPLFNRYQNN--ETYGFHVDGAVRSHPQNGWMRTDLSATLFLSDPQSYDGGELVVNDT 136 + + + Y + H D V G + ++ L+D +++GGEL + Sbjct: 232 MDRMIISCYDAAHKGHFAPHRDNTV-----EGAKHRLFAISINLND--TFEGGELTFPE- 283 Query: 137 FGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVAS--FMWIQSMIRDDKK 184 F PAG +++ S+ LH V PVT+G R A F + + I + Sbjct: 284 FSNQGFCPPAGGALIFSSALLHAVRPVTKGKRYACLPFAFNEDSINSAHQ 333 >UniRef50_B5VUP4 Alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal allergen n=3 Tax=Oscillatoriales RepID=B5VUP4_SPIMA Length = 377 Score = 67.7 bits (164), Expect = 3e-10, Method: Composition-based stats. Identities = 33/183 (18%), Positives = 63/183 (34%), Gaps = 17/183 (9%) Query: 2 MYHIPGVLSPQDVARFREQL-----EQAEW--VDGRVTTGAQGAQVKNNQ-QVDTRSTLY 53 + IP VL + + +++ + +G T G K + + Sbjct: 173 VLLIPKVLDLRLCRELIKIWETQGNDESGFMKREGEKTVGYVDPSFKRRRDHFIQDGPVK 232 Query: 54 AALQNEVLNAVN-QHALFFAAALPRT-LSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMR 111 + + + V + F L R ++ + H D G + Sbjct: 233 NYIDSIMQRRVFPEILQAFQFQLTRRECYKIGCYDSESGGFFRPHRDNTT-----GGTLH 287 Query: 112 TDLSATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVAS 171 + T+ L+ + Y+GG L + H K GD +++ S++H T VT G R A Sbjct: 288 RRFAMTINLN-TEEYEGGCLRFPE-HAPHLYKPATGDAIIFSCSTMHEATDVTSGRRFAL 345 Query: 172 FMW 174 + Sbjct: 346 LSF 348 >UniRef50_Q58MX3 Putative uncharacterized protein n=1 Tax=Prochlorococcus phage P-SSM2 RepID=Q58MX3_BPPRM Length = 197 Score = 67.3 bits (163), Expect = 3e-10, Method: Composition-based stats. Identities = 36/197 (18%), Positives = 72/197 (36%), Gaps = 33/197 (16%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTL----YAALQ 57 + + +L+ D+ + + ++ ++ D V G G + + N + + + + + Sbjct: 4 LIQVIKILNGTDLKKVNQYVDTLDFEDNTV-FGKPGEECQTNTDIRSSTGVSLDDAHEIT 62 Query: 58 NEVLNAVNQ------------HALFFAAALPRTL------STPLFNRYQNNETYGFHVDG 99 N + ++N H F +P + Y+ + Y FH D Sbjct: 63 NVIHTSMNNGLDEYKRRVQKIHPNFSYYPVPGAVGTRSWREGIQILDYKKGQEYKFHHDA 122 Query: 100 AVRSHPQNGWMRTDLSATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSS--SL 157 A P+ G +S L+L + GG F VK G +++PS+ Sbjct: 123 AT--EPRLGEYHRKISVILYLKEAT--KGGG----TAFSHLSVKPKPGYALIFPSNWCYP 174 Query: 158 HCVTPVTRGVRVASFMW 174 H PV+ G + + W Sbjct: 175 HAGEPVSAGKKRVAVTW 191 >UniRef50_Q54PP0 Putative uncharacterized protein n=2 Tax=Dictyostelium discoideum RepID=Q54PP0_DICDI Length = 252 Score = 63.9 bits (154), Expect = 4e-09, Method: Composition-based stats. Identities = 32/180 (17%), Positives = 71/180 (39%), Gaps = 21/180 (11%) Query: 3 YHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQ----VKNNQQ-----VDTRSTLY 53 I V + ++ + + E+ + V G Q ++NN + V+ +Y Sbjct: 22 ITIDDVFTEEECKEWIDLTEKTGYEPALVNIGYGQQQLMTDIRNNDRCIIDSVEMADKIY 81 Query: 54 AALQNEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTD 113 ++ + + NQ + F RY + + H DG + + Sbjct: 82 QRVKKFIPHTFNQKWEVVSLN-----ERLRFLRYYVGQEFKKHQDGNYKRNNGETSF--- 133 Query: 114 LSATLFLSDPQSYDGGELVVNDTFGQHRVKL--PAGDLVLYPSSSLHCVTPVTRGVRVAS 171 ++ L+L++ + +GG GQ+ +++ G ++L+ + H +PVT+GV+ Sbjct: 134 ITLQLYLNNVE--EGGSTKFFLKSGQNEIEIIPKPGKVLLFQHNIWHQGSPVTKGVKYVI 191 >UniRef50_A6G260 Uncharacterized iron-regulated protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6G260_9DELT Length = 212 Score = 63.1 bits (152), Expect = 7e-09, Method: Composition-based stats. Identities = 35/186 (18%), Positives = 72/186 (38%), Gaps = 21/186 (11%) Query: 3 YHIPGVLSPQDVARFREQLEQAEWVDGRVTTGA---QGAQVKNN-QQVDTRSTLYAALQN 58 + + G+ + + + E+ E + + + TG + A ++NN + + AAL Sbjct: 19 FTVDGLFTADECRAWIERGEALGFGEAPINTGRGEVRNANIRNNDRTLVDDPEAAAALFE 78 Query: 59 EVLNAVNQHALFFAAALPRT--LSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSA 116 + + ++ LP T F RY + + H DG ++ R+ LS Sbjct: 79 RLRPVLPPTTWMYSQDLPLTGLNERLRFYRYDPGQRFALHRDGHFTRPDRSE--RSRLSL 136 Query: 117 TLFLSDPQSYDGGELVVNDTFG-----------QHRVKLPAGDLVLYPSSSLHCVTPVTR 165 ++L+ + ++GGE + + G R G ++++P H VT Sbjct: 137 LVYLN--EDFEGGETLFFSSPGYGSHASGGWQETDRAVPKTGRVLVFPHPMFHEGAAVTA 194 Query: 166 GVRVAS 171 G + Sbjct: 195 GRKYVL 200 >UniRef50_A5EM79 Putative uncharacterized protein n=1 Tax=Bradyrhizobium sp. BTAi1 RepID=A5EM79_BRASB Length = 193 Score = 62.7 bits (151), Expect = 9e-09, Method: Composition-based stats. Identities = 38/184 (20%), Positives = 62/184 (33%), Gaps = 25/184 (13%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTT---GAQGAQVKNNQQV-----DTRSTLY 53 + I LS + + E + D ++T V+NN++V D LY Sbjct: 11 IETIANFLSAAECDDYVSWGEAIGFKDAPISTSMGMIMAKDVRNNERVMVDDRDRTQALY 70 Query: 54 AALQNEVLNAVNQHALFFAAALPRTL-STPLFNRYQNNETYGFHVDGAVRSHPQNGWMRT 112 L + F P L RY + + +H DG R+ Sbjct: 71 QRLAGHL------APSFQHRWQPVGLNERLRLYRYDVGQKFDWHRDGHFARDNGE---RS 121 Query: 113 DLSATLFLSDPQSYDGGELVVND-----TFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGV 167 + ++L+D ++GG D G RV G +L+ +H VTRG Sbjct: 122 QFTFLIYLND--DFEGGATSFCDDTGLMPDGPLRVTPEKGMALLFHHPIMHRGDRVTRGR 179 Query: 168 RVAS 171 + Sbjct: 180 KYVL 183 >UniRef50_A5V2J9 Putative uncharacterized protein n=1 Tax=Sphingomonas wittichii RW1 RepID=A5V2J9_SPHWW Length = 223 Score = 62.7 bits (151), Expect = 9e-09, Method: Composition-based stats. Identities = 44/191 (23%), Positives = 66/191 (34%), Gaps = 21/191 (10%) Query: 1 MMYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDT----RSTLYAAL 56 M++ G L+P AR L + K Q L Sbjct: 39 MIHRFDGALNPAICARLC-ALSHSR----TEKPEGAADPAKLPWQDSDTFAFDHWEEGEL 93 Query: 57 QNEVL-NAVNQHALFFAAALPRTLSTPLFN---RYQNNETYGFHVDGAVRSHPQNGWMRT 112 ++ + + L A R + P F R++ ++ H D + R Sbjct: 94 RHLIGGYRLMVGQLICLAV--REIVFPHFTDLVRWRPGKSMDEHKDDGYPGDDELMSCR- 150 Query: 113 DLSATLFLSDPQSYDGGELVVNDTFGQHRVKLPA-GDLVLYPSS--SLHCVTPVTRGVRV 169 SA + +D +Y GGE + + G + V P G LV YPS + H V PV G RV Sbjct: 151 HYSAVTYCND--NYSGGETFIRNEHGGYYVSAPRTGTLVFYPSDERATHGVKPVVGGDRV 208 Query: 170 ASFMWIQSMIR 180 W +R Sbjct: 209 TLSTWFTRDVR 219 >UniRef50_UPI00016C3A48 hypothetical protein GobsU_06128 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C3A48 Length = 202 Score = 61.9 bits (149), Expect = 1e-08, Method: Composition-based stats. Identities = 37/184 (20%), Positives = 65/184 (35%), Gaps = 23/184 (12%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDG--RVTTGAQGAQ-VKNNQQVDTRST-----LY 53 ++ I SP + + E A + D T G + ++NN +V ++ Sbjct: 15 LFVIHDFFSPDECDYYITMTESAGYGDAPITTTGGPVMRKDIRNNDRVMIDDAGIARSVW 74 Query: 54 AALQNEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTD 113 L+ V + V L F RY + + +H DGA P R+ Sbjct: 75 ERLRPFVPDRVQFW---QPVGLNERW---RFYRYDPGQQFDWHFDGAYERSPAE---RSA 125 Query: 114 LSATLFLSDPQSYDGGELVVNDTFGQH------RVKLPAGDLVLYPSSSLHCVTPVTRGV 167 + ++L+ S E + G RV+ AG ++++P H PV G Sbjct: 126 FTLMIYLNGGVSGGATEFNLRSHGGTRGDDPIVRVQPEAGKVLVFPHRLYHRGAPVADGR 185 Query: 168 RVAS 171 + Sbjct: 186 KYVM 189 >UniRef50_Q08MC8 Oxidoreductase, 2OG-Fe(II) oxygenase family family (Fragment) n=1 Tax=Stigmatella aurantiaca DW4/3-1 RepID=Q08MC8_STIAU Length = 484 Score = 61.5 bits (148), Expect = 2e-08, Method: Composition-based stats. Identities = 39/180 (21%), Positives = 69/180 (38%), Gaps = 19/180 (10%) Query: 3 YHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNN-QQVDTRSTLYA----ALQ 57 + GV S + R E+ E A + + T G ++N +QV L L+ Sbjct: 1 LLLRGVFSRSECLRLIEEAEGAGF---QATGGDYPPSYRDNDRQVHDDGALAEAVFTRLR 57 Query: 58 NEVLNAVNQHALFFAAALPRTLST-PLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSA 116 + + A R L+ F RY+ + + H DGA P +R+ L+ Sbjct: 58 PFLPERLVDAEG--EAWRLRGLNPRFRFCRYRGGQRFCIHRDGAYAPSP---SVRSHLTC 112 Query: 117 TLFLSDPQSYDGGELVVNDTFGQHR-----VKLPAGDLVLYPSSSLHCVTPVTRGVRVAS 171 L+L+D + + GG + V+ AG L+++ + H V+ G + Sbjct: 113 MLYLNDAEDFSGGATRYYAERSEGSELLGAVRPQAGTLIVFDHALWHDGEAVSAGTKYVL 172 >UniRef50_A4EST3 Uncharacterized iron-regulated protein n=1 Tax=Roseobacter sp. SK209-2-6 RepID=A4EST3_9RHOB Length = 196 Score = 60.8 bits (146), Expect = 3e-08, Method: Composition-based stats. Identities = 38/176 (21%), Positives = 62/176 (35%), Gaps = 16/176 (9%) Query: 5 IPGVLSPQDVARFREQLEQAEWVDGRVTTG---AQGAQVKNNQQVDTRST-LYAALQNEV 60 IP LS A +Q E + +T+ ++++NN +V L A L + Sbjct: 14 IPNFLSTDLCAEQIKQAEALGFASAPITSETGTQVVSEIRNNTRVIRDLPTLSAQLWQDA 73 Query: 61 LNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFL 120 N V ++ F RYQ + + +H DG+ R+ TL + Sbjct: 74 RNLVPRN--FKGRDAAGLNDRFRLYRYQPGQFFDWHQDGSYRAADGQESQ-----FTLLI 126 Query: 121 SDPQSYDGGELVVNDTFGQH-----RVKLPAGDLVLYPSSSLHCVTPVTRGVRVAS 171 Q ++GG D F H + G +L+ H PV G + Sbjct: 127 YLNQGFEGGGTRFADVFSSHVFSDFTIAPEPGKALLFHHPISHRGDPVLSGTKYVL 182 >UniRef50_B9F6P1 Putative uncharacterized protein n=3 Tax=Poaceae RepID=B9F6P1_ORYSJ Length = 409 Score = 60.0 bits (144), Expect = 5e-08, Method: Composition-based stats. Identities = 44/187 (23%), Positives = 72/187 (38%), Gaps = 36/187 (19%) Query: 3 YHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNN----------QQVDTRSTL 52 +H+ G LSP+ T G + + V + + + Sbjct: 20 FHLRGFLSPETCKELEFVHRSCG------TAGYRPSVVSTSLPHLAATGCGHLLLPFVPV 73 Query: 53 YAALQNEVLNAVNQH-ALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMR 111 L++ V +A + H LF T L + + + G+H D Q + Sbjct: 74 RERLRDAVESAFSCHFDLFIEF-------TGLIS-WCKGASIGWHSDDNKPYLRQRAF-- 123 Query: 112 TDLSATLFLSDP-QSYDGGELVVNDTFGQHRVKLP-AGDLVLYPS--SSLHCVTPVTRGV 167 +A +L+D + Y GG L D G+ P AGD+V+Y + S+ HCV VT G Sbjct: 124 ---TAVCYLNDHGKDYKGGILQFQD--GEPSFITPVAGDVVIYTADNSNTHCVDEVTEGE 178 Query: 168 RVASFMW 174 R+ +W Sbjct: 179 RLTLTLW 185 >UniRef50_UPI000187D48A hypothetical protein MPER_04725 n=2 Tax=Moniliophthora perniciosa FA553 RepID=UPI000187D48A Length = 195 Score = 59.6 bits (143), Expect = 8e-08, Method: Composition-based stats. Identities = 27/164 (16%), Positives = 56/164 (34%), Gaps = 21/164 (12%) Query: 10 SPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEVLNAVNQHAL 69 + + + + + A + G T + + ++ +T + +L+ + Q + Sbjct: 24 TTEQLTKLVSACDAAPFGRGAQTVMDESYRKALKLELAQFATPFDLAATGILHKIQQDLV 83 Query: 70 FFAAAL--PRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLSDPQSYD 127 AL P N Y + H D + +L L P ++ Sbjct: 84 DTDTALRRPIRAEPYKLNIYDKGAFFKAHQDTPRAENMFG---------SLVLVFPTPHE 134 Query: 128 GGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVAS 171 GG L++ + G + + ++H V PVT G R+ Sbjct: 135 GGNLILTEE----------GQQWTFEADTMHEVQPVTSGARITL 168 >UniRef50_D1P2W6 Oxidoreductase, 2OG-Fe(II) oxygenase family n=5 Tax=Providencia RepID=D1P2W6_9ENTR Length = 194 Score = 58.8 bits (141), Expect = 1e-07, Method: Composition-based stats. Identities = 34/135 (25%), Positives = 53/135 (39%), Gaps = 14/135 (10%) Query: 44 QQVDTRSTLYAALQNEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRS 103 + + + + L ++ Q + L F RY N Y HVD Sbjct: 63 LESNMGAPIVRYLDKM--ESLRQELNYQLF-LGLRDFETHFCRYPNGGFYKKHVDN---- 115 Query: 104 HPQNGWMRTDLSATLFLSDP-QSYDGGELVVNDTFGQHRVKLP--AGDLVLYPSSSL-HC 159 G R ++ L++++ Q DGGELV+ D + KL AG +V + S H Sbjct: 116 --FKGQGRRKITTVLYMNESWQLGDGGELVMYDLQDKALFKLEPLAGRMVFFLSEDFPHE 173 Query: 160 VTPVTRGVRVASFMW 174 V P T+ R + W Sbjct: 174 VLP-TQQKRESIAGW 187 >UniRef50_A5WD57 2OG-Fe(II) oxygenase n=4 Tax=Moraxellaceae RepID=A5WD57_PSYWF Length = 304 Score = 58.1 bits (139), Expect = 2e-07, Method: Composition-based stats. Identities = 38/181 (20%), Positives = 68/181 (37%), Gaps = 17/181 (9%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEVL 61 + + P + + + E+ D ++T G + ++ D + Sbjct: 124 FIVLDDLYQPTALLALQAESGFVEYRDAKLTEGVRKTDIRG----DRIRWITKDFFAGFY 179 Query: 62 NAVNQHALFFAAALPR----TLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSAT 117 + + L F S + Y Y +H D V G +SA Sbjct: 180 YLNSINDLAFLFNRTLFAGIRHSEAHYACYPPGFGYKWHSDNPV------GRDERVISAV 233 Query: 118 LFLSDPQSY-DGGELVVNDTFGQ-HRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWI 175 +L+D + DGGEL + D+ GQ H++ A LV++ S+ LH V R R + W+ Sbjct: 234 FYLNDDWTLDDGGELSIIDSEGQTHKLMPKANRLVIFDSNLLHQVELAHR-QRYSIATWL 292 Query: 176 Q 176 + Sbjct: 293 R 293 >UniRef50_C0YUD2 Possible iron-regulated protein n=1 Tax=Chryseobacterium gleum ATCC 35910 RepID=C0YUD2_9FLAO Length = 184 Score = 57.7 bits (138), Expect = 3e-07, Method: Composition-based stats. Identities = 27/177 (15%), Positives = 65/177 (36%), Gaps = 20/177 (11%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTT-GAQG--AQVKNNQQVD-TRSTLYAALQ 57 ++ I L+ + + ++ + + ++ G Q ++NN ++ +T+ L Sbjct: 10 IFLIEDFLTESECDHYISLSQEKVFEEAKINVFGRQQMNKGIRNNDRLMIFDTTIAEELF 69 Query: 58 NEVLNAVNQHA---LFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDL 114 N+V+ + Q F+ + +Y + + H DG+ + T Sbjct: 70 NKVVEFLPQEQDEYQVFSFNEMLRIY-----KYAPGQQFKMHRDGSYIRNENEKSFYT-- 122 Query: 115 SATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVAS 171 ++L+D ++GGE + F K G +++ H + G + Sbjct: 123 -FMIYLND--DFEGGETEFENLFTVAPKK---GTALIFYHPLRHEGKTLISGHKYVL 173 >UniRef50_C1EAF2 Predicted protein n=2 Tax=Micromonas RepID=C1EAF2_9CHLO Length = 898 Score = 57.3 bits (137), Expect = 4e-07, Method: Composition-based stats. Identities = 41/187 (21%), Positives = 70/187 (37%), Gaps = 36/187 (19%) Query: 6 PGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEVL--NA 63 +++ + A + E+A G G + + V T A+ + + NA Sbjct: 732 SPLMTEAECAEWVRLAEKA----GEARGGWTTS---RHYAVPTTDIPVHAIPDLLPLWNA 784 Query: 64 VNQH--ALFFAAALPRTLSTP--------LFNRYQNNETYGF--HVDGAVRSHPQNGWMR 111 + + A +AA P + P RY+ + H D + Sbjct: 785 LMRDKLASLLSAACPEEMPKPSSVRVHDAFVVRYEAGAQHHLPMHADQSA---------- 834 Query: 112 TDLSATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRV-- 169 +S TL L+D Y+GG G V+ G +V + H +PVTRGVR Sbjct: 835 --VSVTLALNDEGEYEGGGTTFAVPVG-KTVRPGRGHVVAFKGGLQHGGSPVTRGVRYIV 891 Query: 170 ASFMWIQ 176 A+F++ + Sbjct: 892 AAFLFAE 898 >UniRef50_A1ZDU9 Oxidoreductase, 2OG-Fe(II) oxygenase family family n=1 Tax=Microscilla marina ATCC 23134 RepID=A1ZDU9_9SPHI Length = 181 Score = 56.9 bits (136), Expect = 5e-07, Method: Composition-based stats. Identities = 33/178 (18%), Positives = 68/178 (38%), Gaps = 21/178 (11%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGA---QGAQVKNNQQV-----DTRSTLY 53 ++ I V +P++ + + E+ + VTT V+NN++V + ++L+ Sbjct: 10 VFTISNVFTPEECEHYIDFTEKVGYAPAPVTTPWGPEMMPDVRNNERVMFDDNNLAASLW 69 Query: 54 AALQNEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTD 113 LQ + + A L F +Y + + H DG R + Q + Sbjct: 70 QKLQPLLPTRLQGK---KAVGLNERF---RFYKYHPGQEFKEHKDGHFRRNAQEVSV--- 120 Query: 114 LSATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVAS 171 L+ ++L+ + + GG+ G +++ +H PV GV+ Sbjct: 121 LTLLIYLN--EDFTGGDTFFRTMDI--NFVPKQGAALIFEHRVVHAGLPVIEGVKYVL 174 >UniRef50_D2W6C1 Predicted protein n=2 Tax=Naegleria gruberi RepID=D2W6C1_NAEGR Length = 251 Score = 56.2 bits (134), Expect = 8e-07, Method: Composition-based stats. Identities = 35/183 (19%), Positives = 68/183 (37%), Gaps = 21/183 (11%) Query: 3 YHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEVLN 62 + + VLS + E E+ + D + N ++ + V N Sbjct: 62 FVLDQVLSKDECKLMIELSEKMGYEDADKFC--YAYNDRFNDRLMSDDP---KFTEIVWN 116 Query: 63 AVNQHALFFAAALPRTLSTPLFN------RYQNNETYGFHVDGAVRSHPQNGWMRTDLSA 116 + QH + RTL N +Y+ + HVDG+ H ++ L+ Sbjct: 117 RIKQHLPQTLSKDGRTLHLASINPRWRLCKYKPGHYFNKHVDGSFEDHKNKT--KSYLTL 174 Query: 117 TLFLSD--PQSYDGGELVVNDTFGQ---HRVKLPAGDLVLY---PSSSLHCVTPVTRGVR 168 ++L+ ++GG + D+ + +V PAG+ +++ LH V +GV+ Sbjct: 175 IIYLNSQLDGEFEGGSTIFYDSRMELMTRKVTEPAGNALIFLQNDKHMLHGGEKVFKGVK 234 Query: 169 VAS 171 Sbjct: 235 YIM 237 >UniRef50_A8P9E2 Putative uncharacterized protein n=1 Tax=Coprinopsis cinerea okayama7#130 RepID=A8P9E2_COPC7 Length = 1733 Score = 56.2 bits (134), Expect = 8e-07, Method: Composition-based stats. Identities = 37/207 (17%), Positives = 70/207 (33%), Gaps = 31/207 (14%) Query: 9 LSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDT------RSTLYAALQNEVLN 62 LS + R Q QA + G T V+N +++ L++ VL Sbjct: 89 LSEVEAKRLISQAAQAPFGKGDQTV--VDTSVRNTWEIEPELVSFENPEWTGWLESTVLK 146 Query: 63 AVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLSD 122 V Y+ + H D + AT+ + Sbjct: 147 TVWNSLGVAPYTSKPRCELYKLLVYERGSHFKPHQDTQKAGG---------MFATVIVVL 197 Query: 123 PQSYDGGELVVNDTFGQHRVKLPAGDL-----VLYPSSSLHCVTPVTRGVRVASFMWIQS 177 P +++GG++ V+ + H + + + + + + +H V P+T G R+A S Sbjct: 198 PSAFEGGQIRVSHSGSSHTIDIASTSATETSVLAWYTDVIHEVLPITSGYRLAL-----S 252 Query: 178 MIRDDKKRAM----LFELDNNIQSLKS 200 R M L ++ + I L+ Sbjct: 253 YNLIHTSREMPRPSLPDMGDAISQLRR 279 >UniRef50_C1FF01 Predicted protein (Fragment) n=3 Tax=Mamiellales RepID=C1FF01_9CHLO Length = 252 Score = 56.2 bits (134), Expect = 9e-07, Method: Composition-based stats. Identities = 35/200 (17%), Positives = 61/200 (30%), Gaps = 31/200 (15%) Query: 3 YHIPGVLSPQDVARFREQLEQAE---WVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNE 59 + + ++S + + E A W T + + L+ + Sbjct: 4 FILDDIVSASECEALIKCAEGAGYSFWNAAVSTATFRNSDTVEIHSAAVADELWRRCAHL 63 Query: 60 VLNAV---NQHALFFAAALPRTLS------TPLFNRYQNNETYGFHVDGAVRSHPQNGWM 110 V+ V H L+ L T LFN+Y+ + H DGA + Sbjct: 64 VVPTVVIEQGHPLWEP-GLEGTWKACGVNDHLLFNKYEPGGHFSPHTDGASIVDMNRRSL 122 Query: 111 RTDLSA---------TLFLSDPQSYDGGELVVNDTFGQHRV---------KLPAGDLVLY 152 + L T S P+ G+ VV+ G +R + G +++ Sbjct: 123 YSMLVYLNRCPDGGGTALFSPPEGTSMGKFVVDPALGVYRWPEEWQTGVAPVEPGTALVF 182 Query: 153 PSSSLHCVTPVTRGVRVASF 172 + H PV G R Sbjct: 183 RQDTSHEGVPVGPGHRKIII 202 >UniRef50_Q94H92 Os03g0761900 protein n=23 Tax=Embryophyta RepID=Q94H92_ORYSJ Length = 310 Score = 55.8 bits (133), Expect = 1e-06, Method: Composition-based stats. Identities = 31/129 (24%), Positives = 51/129 (39%), Gaps = 25/129 (19%) Query: 71 FAAALPRTLSTPL-FNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLSDPQSYDGG 129 A +PR P RY+ + Y H D + + S L+L+D + +GG Sbjct: 178 KATMIPRHHGEPFNILRYEIGQRYASHYDAFDPAQYGPQKSQRVASFLLYLTDVE--EGG 235 Query: 130 ELVVNDTFGQH-------------RVKLPAGDLVLYPS---------SSLHCVTPVTRGV 167 E + G++ +VK GD +L+ S +SLH PV +G Sbjct: 236 ETMFPYENGENMDIGYDYEKCIGLKVKPRKGDGLLFYSLMVNGTIDPTSLHGSCPVIKGE 295 Query: 168 RVASFMWIQ 176 + + WI+ Sbjct: 296 KWVATKWIR 304 >UniRef50_C1DZC3 Prolyl 4-hydroxylase n=3 Tax=Viridiplantae RepID=C1DZC3_9CHLO Length = 454 Score = 55.8 bits (133), Expect = 1e-06, Method: Composition-based stats. Identities = 44/214 (20%), Positives = 74/214 (34%), Gaps = 52/214 (24%) Query: 3 YHIPGVLSPQDVARFREQLEQA---EWVDGRVTTGAQGAQVKNNQQV---DTRSTLYAAL 56 Y L+P + + ++ V G +G+ ++++ + + + A+ Sbjct: 180 YMFRNFLTPHECEHLMQLAKKQLAPSTVVGDKGSGSMVSKIRTSAGMFLGRGQDPTVRAI 239 Query: 57 QNEVLNAVNQHALFFAAALPR-TLSTPLFNRYQNNETYGFHVD---GAVRSHPQNGWMRT 112 + + A + LP RY+N + Y H D V S P+ G R Sbjct: 240 EERIAAA---------SGLPEPNGEGLQILRYENGQKYDPHFDYFHDQVNSSPRRGGQRM 290 Query: 113 DLSATLFLSDPQSYDGGELVVN----------DTFGQH-----------RVKLPAGDLVL 151 AT+ + + +GGE + D G H VK GD VL Sbjct: 291 ---ATMLIYLEDTTEGGETIFPNGVRPEDWDADEPGNHNSWSDCAKKGIPVKSHRGDAVL 347 Query: 152 YPS---------SSLHCVTPVTRGVRVASFMWIQ 176 + S SLH PV G + + WI+ Sbjct: 348 FWSLKEDYTLDNGSLHGACPVIAGEKWTAVKWIR 381 >UniRef50_D2VSR0 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2VSR0_NAEGR Length = 208 Score = 55.4 bits (132), Expect = 2e-06, Method: Composition-based stats. Identities = 35/178 (19%), Positives = 69/178 (38%), Gaps = 16/178 (8%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQ-NEV 60 ++ I + SP++ + + E +V+ +NN +V +A L V Sbjct: 15 IWTIENLYSPEECQQLIKICESNGFVEAPFNAN-MAKDTRNNDRVILDLPQHAQLFWERV 73 Query: 61 LNAVNQHA------LFFAAAL-------PRTLSTPLFNRYQNNETYGFHVDGAVRSHPQN 107 + QHA + + A P + F RY+ + + H DG + Sbjct: 74 SPYLPQHASQLGNQVLESNAKSGFQLLNPGFSNRLRFYRYKKGQYFAPHTDGCYFDNRDK 133 Query: 108 GWMRTDLSATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTR 165 ++ L+ L+L+D + GGE +H V+ +G ++++ + H VT Sbjct: 134 YVDQSFLTILLYLNDVNN-AGGETNFIQNGIKHSVQPKSGSVLIFVHWNCHEGAEVTS 190 >UniRef50_B8HX71 Prolyl 4-hydroxylase alpha subunit n=1 Tax=Cyanothece sp. PCC 7425 RepID=B8HX71_CYAP4 Length = 457 Score = 55.4 bits (132), Expect = 2e-06, Method: Composition-based stats. Identities = 37/204 (18%), Positives = 70/204 (34%), Gaps = 38/204 (18%) Query: 2 MYHIPGVLSPQDVARFRE--QLEQAEWVDGRVTTGAQ----GAQVKNNQQVDT----RST 51 + LSP+D+ + R+ +L+Q ++D + Q + ++ ++ Sbjct: 260 VVQFENFLSPEDLEKVRDFVRLQQEHFLDSALMGNRQNVQTQVRQSKLLYLEDFPEFKAW 319 Query: 52 LYAALQNEVLNAVNQ--HALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGW 109 L ++ A+ Q H F + + + YG H D + Sbjct: 320 FQRFLLAKLPAALQQLQHPEFMVSG-----MEMQLTLHGDGCYYGIHPDTTFTEVAKVA- 373 Query: 110 MRTDLSATLFLS-DPQSYDGGELVVNDT-------FGQHRVKLPA---GDLVLYPSSSLH 158 R +++ + +P + GGEL + T F K+ L+ + S H Sbjct: 374 -RREITFVYYFCLEPGGFSGGELRMYPTQICDRQGFTSADFKVIEPLHNSLIFFNSRCYH 432 Query: 159 CVTPVT-------RGVRVASFMWI 175 V PV +G R WI Sbjct: 433 EVMPVVCPGNRFDQG-RFTINGWI 455 >UniRef50_D2VZW5 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2VZW5_NAEGR Length = 292 Score = 55.0 bits (131), Expect = 2e-06, Method: Composition-based stats. Identities = 44/257 (17%), Positives = 94/257 (36%), Gaps = 47/257 (18%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQ-------GAQVKNNQQ-VDTRSTLY 53 ++H +LS ++ + E+ + T G Q ++V+NNQ+ ++ + Sbjct: 13 IFHFRNLLSKEECEEIIQHGEKTSYKQVPTTGGRQLVWCGVEASEVRNNQRIIEECTEFT 72 Query: 54 AALQNEVLNAVNQH----ALFFAAALPRTLSTP-------------LFN--------RYQ 88 + V +H F +LP ++ LF+ +Y+ Sbjct: 73 RRYSATIFERVAKHLPKDLEFRFKSLPTEVNRADVCPTLEKAKEWKLFSVSDKFRMYKYE 132 Query: 89 NNETYGFHVDG------AVRSHPQNGWMRTDLSATLFLSDPQSYD-GGELVVNDTFGQHR 141 + + H DG ++ + T+ S FL + GGE DT+ + Sbjct: 133 KKQHFLKHFDGTNKRILSLTEGKKPVKQFTEQSFMTFLVYLNDVEKGGETQFFDTYSKEE 192 Query: 142 ---VKLPAGDLVLYPSSSLHCVTPVTRGVRVAS---FMWI-QSMIRDDKKRAMLFELDNN 194 +K G V++ LH V GV+ ++ Q I +D ++ + + + + Sbjct: 193 TFDIKPEMGSGVVFLHELLHQGNDVLGGVKYLLRTDICYMKQKEIVEDGQKNINYTMKVS 252 Query: 195 IQSLKSRYGESEEILSL 211 K++ ++ + L Sbjct: 253 ADQKKNKNSQNNILELL 269 >UniRef50_Q4SNF8 Chromosome 8 SCAF14543, whole genome shotgun sequence n=1 Tax=Tetraodon nigroviridis RepID=Q4SNF8_TETNG Length = 673 Score = 55.0 bits (131), Expect = 2e-06, Method: Composition-based stats. Identities = 35/127 (27%), Positives = 58/127 (45%), Gaps = 18/127 (14%) Query: 96 HVDGAVRSHP-------QNGWMRTDLSATLFLSDPQSYDGGELVVNDTFGQH---RVKLP 145 HVD + Q ++ DLSA L+L++ ++DGG+L + RVK Sbjct: 524 HVDNCLLEPDTRQCWREQPAFIHRDLSAVLYLNN--NFDGGDLFFTSRDAKTVTARVKPG 581 Query: 146 AGDLVLYPSSSL--HCVTPVTRGVRVASFMWI--QSMIRD--DKKRAMLFELDNNIQSLK 199 G LV + S + H VT VT G R A +W + + RD ++ L++L + + Sbjct: 582 CGRLVGFSSGPVNPHGVTAVTGGRRCALALWFTKEKLYRDMEREEAEALWDLGKPAKKDE 641 Query: 200 SRYGESE 206 G++ Sbjct: 642 EEEGKTA 648 >UniRef50_D2V646 Putative uncharacterized protein n=1 Tax=Naegleria gruberi RepID=D2V646_NAEGR Length = 563 Score = 55.0 bits (131), Expect = 2e-06, Method: Composition-based stats. Identities = 32/184 (17%), Positives = 70/184 (38%), Gaps = 30/184 (16%) Query: 2 MYHIPGVLSPQDVA-RFREQLE----QAEWVDGRVTTGAQGAQVKNNQQVDTRST-LYAA 55 + IP +LS D + R+ + ++E + +V+N++QV+ L Sbjct: 42 IVSIPKLLSKADCEEQIRKSYDYTFEESEVGNSSRKKNVGNKKVRNSEQVELNDEELSKK 101 Query: 56 LQNEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLS 115 + + +A+ + + L Y+ + + H+D + ++ Sbjct: 102 VFVKCEDAI------------KKMVGCLQTLYKKGQFFNSHIDSGRNIEGHPSF----IT 145 Query: 116 ATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWI 175 ++L+D ++GGE V D +L G VL+ H +T+ + I Sbjct: 146 VLIYLND--DFEGGETVFEDEDVTIEPEL--GKCVLFLHQIKHTAEEITKNTKFV----I 197 Query: 176 QSMI 179 +S + Sbjct: 198 KSAV 201 >UniRef50_A9YW24 Putative uncharacterized protein n=2 Tax=unclassified Phycodnaviridae RepID=A9YW24_OSV5 Length = 206 Score = 55.0 bits (131), Expect = 2e-06, Method: Composition-based stats. Identities = 22/135 (16%), Positives = 43/135 (31%), Gaps = 26/135 (19%) Query: 46 VDTRSTLYAALQNEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHP 105 +D + + + ++ + L RY+ Y H D + Sbjct: 68 LDASDPVVKRVMEKCVS-LTDRPLV-------NCEHIQVLRYKPGGHYSPHQD----TFS 115 Query: 106 QNGWMRTDLSATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLY---------PSSS 156 + + L L+D Y+GGE + +++ GD + + S + Sbjct: 116 DTKGNKRMYTVILALND--DYEGGETEFPNLKKKYKW---GGDALFFHTLDNYELMTSKA 170 Query: 157 LHCVTPVTRGVRVAS 171 LH PV G + Sbjct: 171 LHGGRPVESGEKWIC 185 >UniRef50_D2VHD9 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2VHD9_NAEGR Length = 1139 Score = 54.6 bits (130), Expect = 2e-06, Method: Composition-based stats. Identities = 31/211 (14%), Positives = 72/211 (34%), Gaps = 32/211 (15%) Query: 7 GVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDT----------RSTLYAAL 56 ++ + +++ + G T QV+N+ +++ + L+ L Sbjct: 104 PIIYKEQADEIIRIGKKSPFGKGEETI---HDQVRNSFELEPHQFRITNPVWQKELHLLL 160 Query: 57 QNEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSA 116 +++ + + P L L Y+ + FH + + + Sbjct: 161 NSKIKSGLGIEKHKKVVQSPCKLHKLLL--YEKGGHFDFHKEKECQQ-----------TC 207 Query: 117 TLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDL------VLYPSSSLHCVTPVTRGVRVA 170 TL + P Y+GG + + + D + + S H V P+T G R Sbjct: 208 TLAIILPSLYEGGSFKIRHNSSEREIDYSDEDASTSAHFISFYSDCDHAVMPLTSGYRTC 267 Query: 171 SFMWIQSMIRDDKKRAMLFELDNNIQSLKSR 201 I D + + +++N+++ L + Sbjct: 268 LVYNIIVTTHDINQPLPIVDIENDLEMLSQQ 298 >UniRef50_B7G5U8 Predicted protein n=1 Tax=Phaeodactylum tricornutum CCAP 1055/1 RepID=B7G5U8_PHATR Length = 291 Score = 54.2 bits (129), Expect = 3e-06, Method: Composition-based stats. Identities = 46/227 (20%), Positives = 72/227 (31%), Gaps = 60/227 (26%) Query: 2 MYHIPGVLSPQDVARFRE---QLEQAEWVDGRVTTGAQGAQ------VKNN-----QQVD 47 + VLS + R L A + T G A V+ N Q Sbjct: 65 LVCFKNVLSKVLLESLRSDATALRSAGF---GATAGIAQAADGISDEVRRNVHQVWLQSP 121 Query: 48 TRSTLYAALQNEVLNAVNQHALFFA------AALPR--TLST----PLFNRYQNNETYGF 95 + A + +A + F + AALP+ +L + Y+ Y Sbjct: 122 GQRPQEAFCGDI--DARKKLLSFVSELRLDLAALPKYESLVPGGIELSYLLYEPGSFYKR 179 Query: 96 HVDGAVRSHPQNGWM--RTDLSATLFLSDPQS-------YDGGELVVNDTF--------- 137 H+D + M + +S L+L D DGG L + Sbjct: 180 HIDSPKVRDREGNVMVDKRAVSMILYLGDAHENRDWDISIDGGALRIYGNENVKCRQAQE 239 Query: 138 -GQHRVKL--------PAGDLVLYPSSSL-HCVTPVTRGVRVASFMW 174 G+ R++L G +VL+ S + H V P T R+ W Sbjct: 240 KGEGRLQLDDYLDIVPERGTMVLFESEQVAHEVRP-TERDRICVAGW 285 >UniRef50_A0KRU0 2OG-Fe(II) oxygenase n=7 Tax=Shewanella RepID=A0KRU0_SHESA Length = 327 Score = 54.2 bits (129), Expect = 3e-06, Method: Composition-based stats. Identities = 34/164 (20%), Positives = 57/164 (34%), Gaps = 30/164 (18%) Query: 29 GRVTTGAQGAQVKNNQQVD---TRSTLYAALQNEVLNAVNQHALFFAAALPRTLSTPLFN 85 G G+ A ++NN ++ A V + A A P + Sbjct: 162 GHQGKGSIEAGIRNNMHYPTPIPNGSVALACAERVTAKF--SGIKIAYAEPMVVL----- 214 Query: 86 RYQNNETYGFHVDGAVRSHPQ-----NGWMRTDLSATLFLSDPQSYDGGELVVNDTFGQH 140 RY+ + Y +H D + + + + L+L+D + GGE + Q Sbjct: 215 RYEPGQFYQWHYDAIHAHTSEIKAELERFGQRCRTGILYLND--DFQGGETEFKAPYIQ- 271 Query: 141 RVKLPAGDLVLY----------PSSSLHCVTPVTRGVRVASFMW 174 VK A ++++ PSS LH VT G + W Sbjct: 272 -VKPQAAAILVFDNTDKSGKPIPSS-LHRGCEVTSGHKWVCTQW 313 >UniRef50_Q9AMY0 Blr2042 protein n=1 Tax=Bradyrhizobium japonicum RepID=Q9AMY0_BRAJA Length = 225 Score = 54.2 bits (129), Expect = 3e-06, Method: Composition-based stats. Identities = 27/121 (22%), Positives = 46/121 (38%), Gaps = 12/121 (9%) Query: 55 ALQNEVLNAVNQHAL--FFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRT 112 + + + AL F A+ + +R G H+D A + Sbjct: 93 QIMELLESRDRSFALRQIFGASADYVVRRCQMHRMPPGSFIGIHLDAASDPDFE------ 146 Query: 113 DLSATLFLSDPQSYDGGELVVN-DTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVAS 171 S + L+ + +DGGE VV + QH + P G +++ H V PV+ G R + Sbjct: 147 -YSVIVQLA--RDFDGGEFVVYPTGYEQHVFRPPFGAVLVTTCKVRHEVKPVSSGERRSL 203 Query: 172 F 172 Sbjct: 204 V 204 >UniRef50_D2VRB4 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2VRB4_NAEGR Length = 265 Score = 53.8 bits (128), Expect = 4e-06, Method: Composition-based stats. Identities = 30/180 (16%), Positives = 64/180 (35%), Gaps = 13/180 (7%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEVL 61 + + VL ++ E E+ + D + N++ + L + N V Sbjct: 75 VLILENVLLKEECKLLIELSEKLGYED-ADSYCYAYNDRFNDRLMVDDDALTQVIWNRVK 133 Query: 62 NAVNQHALFFAAALPRTLSTPL--FNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLF 119 + + Q + +Y+ +G H DG ++ ++ L+ ++ Sbjct: 134 DHLPQELNHHGMDMTLHSLNNRWRLCKYKPGHYFGTHTDGTY--SNRSNRTKSALTFMIY 191 Query: 120 LSD--PQSYDGGELVVNDTF---GQHRVKLPAGDLVLYPSS---SLHCVTPVTRGVRVAS 171 L+ + GG + + + RV +G +++P LHC VT GV+ Sbjct: 192 LNSQLEGDFKGGSTIFFEQYHRKETARVIERSGTCIVFPQEDMDMLHCGEKVTDGVKYIL 251 >UniRef50_B8C881 Predicted protein n=1 Tax=Thalassiosira pseudonana CCMP1335 RepID=B8C881_THAPS Length = 288 Score = 53.8 bits (128), Expect = 4e-06, Method: Composition-based stats. Identities = 31/124 (25%), Positives = 46/124 (37%), Gaps = 14/124 (11%) Query: 64 VNQHALFFAAALPRTLSTPL---FNRYQNNETYGFHV-DGAVRSHPQNGWMR-------- 111 + Q FF AL + L F Y + V DG V + G Sbjct: 148 LPQTLDFFRVALVERIYPLLRQQFGMYLPDGGKSLRVADGFVVKYDAEGGQAELKPHRDG 207 Query: 112 TDLSATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVAS 171 + LS + L+ +DGG G VK+ G++V + SS LH +T G R Sbjct: 208 SVLSFNIALNPADEFDGGGTWFQSLDGA--VKIDQGEVVSHSSSLLHGGHGITSGKRYIM 265 Query: 172 FMWI 175 ++ Sbjct: 266 VCFV 269 >UniRef50_Q3IHW6 Putative prolyl 4-hydroxylase, alpha subunit domain n=1 Tax=Pseudoalteromonas haloplanktis TAC125 RepID=Q3IHW6_PSEHT Length = 176 Score = 53.8 bits (128), Expect = 4e-06, Method: Composition-based stats. Identities = 35/157 (22%), Positives = 55/157 (35%), Gaps = 34/157 (21%) Query: 22 EQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEVLNAVNQHALFFAAALPRTLST 81 ++ W DG +Q + N Q + +N+ FF Sbjct: 35 DKTYWFDG-------SSQAQKNYQTA---------MEGIRTTLNR--CFFMGLFDYECH- 75 Query: 82 PLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLSDPQSYDGGELVVNDTFG--- 138 + +Y + Y HVD G + L+L+ PQ GGELV+ Sbjct: 76 --YAKYTLGDFYKKHVDA------FKGRSNRVFTTVLYLNTPQ--QGGELVIYKPKSKDI 125 Query: 139 QHRVKLPAGDLVLYPSS-SLHCVTPVTRGVRVASFMW 174 + ++ AG LVL+ S +H V P R + W Sbjct: 126 EITIRPTAGTLVLFESERFVHEVLPAVD-ERYSIAGW 161 >UniRef50_D0KWT8 2OG-Fe(II) oxygenase n=1 Tax=Halothiobacillus neapolitanus c2 RepID=D0KWT8_HALNC Length = 226 Score = 53.8 bits (128), Expect = 4e-06, Method: Composition-based stats. Identities = 37/161 (22%), Positives = 56/161 (34%), Gaps = 36/161 (22%) Query: 22 EQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEVLNAVNQHALFFAAALPRTLST 81 ++ +W DGR T + +D + L L + LF Sbjct: 79 DEIKWFDGR-TAAQKA-------YLDQMAELQTYLNRSLFL-----GLFEYECH------ 119 Query: 82 PLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLSDPQSYD-GGELVVNDTFGQH 140 F RYQ Y H+D G +S +L+ D GGELV+ + Sbjct: 120 --FARYQPGGFYKKHLDS------FRGRASRMVSVVCYLNPEWQADWGGELVIYGENAED 171 Query: 141 R------VKLPAGDLVLYPSSSL-HCVTPVTRGVRVASFMW 174 + G LV++ S S+ H V P T+ R + W Sbjct: 172 SGDIRAVITPEMGKLVVFMSESMPHEVLP-TQHPRTSIAGW 211 >UniRef50_B2HZ49 Predicted proline hydroxylase n=18 Tax=Acinetobacter RepID=B2HZ49_ACIBC Length = 201 Score = 53.5 bits (127), Expect = 5e-06, Method: Composition-based stats. Identities = 33/169 (19%), Positives = 62/169 (36%), Gaps = 15/169 (8%) Query: 11 PQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEVLNAVNQHALF 70 ++ + ++ +A +G V+T + N+ + L + + L Sbjct: 40 AKECSHHFDEFREAGIQNGVVSTIRSDHILWINESLPVAEQHVETLSSFCQH------LN 93 Query: 71 FAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLSDP-QSYDGG 129 A L F Y E Y H D + + +S +L Q GG Sbjct: 94 QAFFLGIKEVEAHFACYNPGEFYALHRDNPQQKND------RIMSTVYYLHPEWQDDWGG 147 Query: 130 ELVVNDTFGQ-HRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQS 177 +L + D H + LV++ S+ LH V V++ R++ W++S Sbjct: 148 QLRLQDKNNIWHIITPEPNRLVIFQSNLLHEVL-VSKQQRLSITAWLRS 195 >UniRef50_B0CZ29 Predicted protein n=2 Tax=Agaricales RepID=B0CZ29_LACBS Length = 261 Score = 53.5 bits (127), Expect = 5e-06, Method: Composition-based stats. Identities = 32/194 (16%), Positives = 61/194 (31%), Gaps = 29/194 (14%) Query: 5 IPGVLSPQDVARFREQLEQ-AEWVDGRVTTGAQGAQVKNNQQVDT---------RSTLYA 54 I V +P + A +W ++ V N + S +Y Sbjct: 59 IDNVFTPNECADLIALASSTGDWSPAGLSAEGPTQTVHTNFRNSDRVLVIDEEVSSRIYE 118 Query: 55 ALQNEVLNAVNQHA---LFFAAALPRTLSTPL-----------FNRYQNNETYGFHVDGA 100 L+ V + P P F RY + + + H DG Sbjct: 119 KLRPLVDEICEIAPGSRWSCITSRPGKEQGPTWKMVRINPRLSFLRYGSGQYFKPHCDG- 177 Query: 101 VRSHPQNGWMRTDLSATLFLSDPQSYDGGELVVNDTFGQHRVKLPA--GDLVLYPSSSL- 157 + +G ++ ++ L+L++P GG + + + G ++++ L Sbjct: 178 -LNDLLDGKQKSFVTLHLYLNEPDGLTGGATRFWTPDKKEHLDVEPKLGRVLVFQQRMLV 236 Query: 158 HCVTPVTRGVRVAS 171 H VT GV+ Sbjct: 237 HSGEEVTGGVKYTM 250 >UniRef50_A9DQY6 Uncharacterized iron-regulated protein n=1 Tax=Kordia algicida OT-1 RepID=A9DQY6_9FLAO Length = 183 Score = 53.5 bits (127), Expect = 5e-06, Method: Composition-based stats. Identities = 39/176 (22%), Positives = 69/176 (39%), Gaps = 18/176 (10%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQV-----KNNQQVDTRSTLYAA- 55 +Y + LS Q+ + E+ + + +V G QV +NN +V + YAA Sbjct: 8 IYVVDNFLSHQECDELIAKSEKMGYEEAKV--NMHGKQVLMTTVRNNLRVTYKDEAYAAI 65 Query: 56 LQNEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLS 115 L N++ V + + A + F +Y+ + H DG+ R + + S Sbjct: 66 LWNKIKMHVPEQIGYSYAFGLNEMLR--FYKYEKGHRFKMHRDGSYRRNETEA---SQYS 120 Query: 116 ATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVAS 171 ++L+D +DGGE V H K G +L+ H + G + Sbjct: 121 FLIYLND--DFDGGETVFRSGTTIHPKK---GSALLFLHGLRHEGAVLKSGTKYVL 171 >UniRef50_D0SK49 Predicted protein n=1 Tax=Acinetobacter junii SH205 RepID=D0SK49_ACIJU Length = 232 Score = 53.5 bits (127), Expect = 6e-06, Method: Composition-based stats. Identities = 41/188 (21%), Positives = 73/188 (38%), Gaps = 28/188 (14%) Query: 1 MMYHIPGVLSPQDVARFREQLEQAEWVDGRV---TTGAQGAQVKNNQQVDTRS-----TL 52 +++ + V S Q+ F E Q + + + V+NN++V TL Sbjct: 39 LIFTVDDVFSDQECLSFIELSNQYHYETADIFLNSARQVLTNVRNNKRVIYDDIQLAETL 98 Query: 53 YAALQNEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRT 112 ++ L++ + +N L F RY+N ET+ H DG H N W + Sbjct: 99 FSKLKHLLPKQLNGW------ILSGLNERFRFYRYENGETFKPHWDG---IHEVNDWHSS 149 Query: 113 DLSATLFLSDPQSYDGGELVVNDTFGQ---------HRVKLPAGDLVLYPSSSLHCVTPV 163 L+ ++LS + + GGE + G V+ G ++++ LH PV Sbjct: 150 KLTLLIYLS--EDFTGGETIFYRDSGMLKPCKETQIASVQPKLGQILVFEHQQLHEGAPV 207 Query: 164 TRGVRVAS 171 G + Sbjct: 208 LSGQKYVL 215 >UniRef50_D2W6G4 Predicted protein (Fragment) n=1 Tax=Naegleria gruberi RepID=D2W6G4_NAEGR Length = 489 Score = 53.5 bits (127), Expect = 6e-06, Method: Composition-based stats. Identities = 27/176 (15%), Positives = 55/176 (31%), Gaps = 21/176 (11%) Query: 7 GVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTR-----STLYAALQNEVL 61 +++ + E+A + G T V+ Q+ + + + ++ Sbjct: 12 PIITKNQAFEIIQYFEKAPFGQGDKT--IYDETVRKTWQLSPSQFRITNQHWKQYIDNLV 69 Query: 62 NAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLS 121 + L +++ S Y+ + FH D + ATL + Sbjct: 70 EKQVKPGLGIHSSVVVRNSLYKLLLYEEGGHFDFHRD---------TEKEDKMFATLVVQ 120 Query: 122 DPQSYDGGELVVNDTFGQHRVKLPA-GDL----VLYPSSSLHCVTPVTRGVRVASF 172 P Y GGE++V + + G + H V +T G R+ Sbjct: 121 LPSLYSGGEIIVQHADSEETYEFAKEGSSKPFFFSFYCDCNHKVAKLTSGYRLCLV 176 >UniRef50_UPI00016C3513 hypothetical protein GobsU_05758 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C3513 Length = 779 Score = 53.5 bits (127), Expect = 6e-06, Method: Composition-based stats. Identities = 33/176 (18%), Positives = 54/176 (30%), Gaps = 25/176 (14%) Query: 9 LSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRS-----TLYAALQNEVLNA 63 L+ +QA + G T V+ Q+D + + + Sbjct: 47 LTAHQAKELAAVCKQAPYGKGEET--LVDTSVRRVWQLDPDHFSLTNPEWDEFLRDAVAT 104 Query: 64 VNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLSDP 123 V + L L L Y+ + H DG + ATL + P Sbjct: 105 VQRDLGLEKQQLESHLYNLLL--YEPGGFFLPHRDGEKLDR---------MVATLVVVLP 153 Query: 124 QSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSL-------HCVTPVTRGVRVASF 172 + GGEL+V + + A L L+ + H V P+ G R+ Sbjct: 154 SPFTGGELIVRHDGEERAIDFGAPGLNLFHTHFAAFYADCEHEVRPLRTGHRLCLV 209 >UniRef50_Q1MT87 Novel protein similar to vertebrate leprecan-like family (Fragment) n=8 Tax=Clupeocephala RepID=Q1MT87_DANRE Length = 676 Score = 53.1 bits (126), Expect = 6e-06, Method: Composition-based stats. Identities = 31/124 (25%), Positives = 47/124 (37%), Gaps = 17/124 (13%) Query: 66 QHALFFAAALPRTLSTPLFNRYQNNET---YGFHVDGAVRSHPQNGWMR-------TDLS 115 + A + +Q + + HVD + R DLS Sbjct: 545 RTPSLLYFAYTHLVCRSAITGHQEGRSDLSHPVHVDNCILEPESRQCWREAPAFTHRDLS 604 Query: 116 ATLFLSDPQSYDGGELVVNDTFGQH---RVKLPAGDLVLYPSSSL--HCVTPVTRGVRVA 170 A L+L+D ++GG+ D + VK G LV + S + H VT VT+G R A Sbjct: 605 AVLYLND--DFEGGDFFFTDRDAKTVTATVKPKCGRLVGFTSGPVNPHGVTAVTKGRRCA 662 Query: 171 SFMW 174 +W Sbjct: 663 LALW 666 >UniRef50_Q2SAS7 FOG: WD40 repeat n=1 Tax=Hahella chejuensis KCTC 2396 RepID=Q2SAS7_HAHCH Length = 505 Score = 53.1 bits (126), Expect = 6e-06, Method: Composition-based stats. Identities = 35/164 (21%), Positives = 58/164 (35%), Gaps = 32/164 (19%) Query: 30 RVTTGAQGAQVK------NN-QQVDTRSTLYAALQNEVLNAVNQHALFFAAALPRTLSTP 82 +T G A K NN +QV L L + Q +LP ++P Sbjct: 49 AITRGFSPADAKYPPSYRNNARQVVDDPMLARRLFEVCGQLLPQ-------SLPDAANSP 101 Query: 83 LFN-----------RYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLSDPQSYDGGE- 130 ++ RY +++ H DG ++ + L+ L+L+D + GG+ Sbjct: 102 AWSLHSLNPRLRLCRYSAGQSFFPHQDGVYACPDRSE---SKLTFLLYLNDATEFSGGDT 158 Query: 131 LVVNDTFGQH---RVKLPAGDLVLYPSSSLHCVTPVTRGVRVAS 171 L D R GDL+++ S H V G + Sbjct: 159 LFFKDASAAEISARFTPRRGDLIVFDHSLWHSGDTVLSGEKYIL 202 >UniRef50_B0E4Q8 Predicted protein n=1 Tax=Laccaria bicolor S238N-H82 RepID=B0E4Q8_LACBS Length = 479 Score = 53.1 bits (126), Expect = 7e-06, Method: Composition-based stats. Identities = 31/196 (15%), Positives = 63/196 (32%), Gaps = 31/196 (15%) Query: 3 YHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVK---NNQQVDTRSTLYAALQNE 59 + I V + + E+ + G+ +N + L + Sbjct: 269 FIINDVFESTECESLVKAAEKVGLLPDEPIAGSAAQLASVLAHNLIWLADTEFIGTLYDR 328 Query: 60 VLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNG----------- 108 +++ + Q + A+ + RY+ Y H+DGA + N Sbjct: 329 IVDLLPQ--IVHGGAVKGINARFRLYRYRPGALYRPHIDGAWPASALNATTSPHSYVYDS 386 Query: 109 --WMRTDLSATLFLSDPQSYDGGELVV------NDTFGQHRVKLPAGDLVLYP-----SS 155 + + L+ ++L+D ++GG VK G + ++P S Sbjct: 387 DPTVYSRLTLLIYLND--DFEGGCTTFFLPSSTQGILEARPVKPRTGTVCVFPHGAAKGS 444 Query: 156 SLHCVTPVTRGVRVAS 171 LH + VT G + Sbjct: 445 LLHEGSGVTSGAKYVI 460 >UniRef50_Q9NDP6 Leprecan n=1 Tax=Ciona intestinalis RepID=Q9NDP6_CIOIN Length = 412 Score = 53.1 bits (126), Expect = 7e-06, Method: Composition-based stats. Identities = 26/90 (28%), Positives = 42/90 (46%), Gaps = 13/90 (14%) Query: 96 HVDGAVR------SHPQNGWMRTDLSATLFLSDPQSYDGGELVVNDTFGQH---RVKLPA 146 H D + + + D SA L+L+D ++GGE ++ D + +V+ Sbjct: 298 HSDNCLLKENGSCLKERPAYTWRDYSAILYLND--EFEGGEFIMTDATARRVKVQVRPKC 355 Query: 147 GDLVLYPS--SSLHCVTPVTRGVRVASFMW 174 G LV + + LH V PVT+G R A +W Sbjct: 356 GRLVSFSAGKECLHGVKPVTKGRRCAMALW 385 >UniRef50_A8P9G8 Putative uncharacterized protein n=2 Tax=Coprinopsis cinerea okayama7#130 RepID=A8P9G8_COPC7 Length = 543 Score = 53.1 bits (126), Expect = 8e-06, Method: Composition-based stats. Identities = 35/177 (19%), Positives = 60/177 (33%), Gaps = 29/177 (16%) Query: 9 LSPQDVARFREQLEQAEWVDG---------RVTTGAQGAQVKNNQQVDTRSTLYAALQNE 59 L V E ++ + G R T G++VK N ++ + Sbjct: 89 LDDYKVDEVLEHASRSPFGMGDQKVVDTKVRDTWEIDGSKVKLN------DPFAEWVEYK 142 Query: 60 VLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLF 119 VL V + + Y+ + H D + G AT+ Sbjct: 143 VLTDVWKGLGVATPSTKPRCEFYKLLIYKPGGHFHAHQD----TQKAEGMF-----ATVI 193 Query: 120 LSDPQSYDGGELVVNDTFGQHRVKLP----AGDLVL-YPSSSLHCVTPVTRGVRVAS 171 + P Y+GGE+ + + + L G ++ + + LH + PVT G RVA Sbjct: 194 VLLPSEYEGGEVKITHSGKTDIIDLNYISNRGMAIMAWYTDVLHEIKPVTSGYRVAL 250 >UniRef50_A8J470 Prolyl 4-hydroxylase alpha-1 subunit-like protein n=3 Tax=Viridiplantae RepID=A8J470_CHLRE Length = 343 Score = 52.7 bits (125), Expect = 8e-06, Method: Composition-based stats. Identities = 41/205 (20%), Positives = 70/205 (34%), Gaps = 42/205 (20%) Query: 2 MYHIPGVLSPQDVARF----REQLEQAEWVDGRVTTGAQG--AQVKNNQQVDTRSTLYAA 55 ++ G+L+ ++ + R +LE++ D GA + L Sbjct: 76 VFLYKGILTHEECDQLMDNSRSRLERSGVSDATTGAGAVSDIRTSSGMFYERGETELVKR 135 Query: 56 LQNEVLNAVNQHALFFAAALP-RTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDL 114 ++N + A+ LP RY+ + Y H D + Sbjct: 136 IENRL--AMW-------TMLPVENGEGIQVLRYEKTQKYDPHHDYFSFDGADDNGGNRMA 186 Query: 115 SATLFLSDPQSYDGGELVVNDTFG---------------QHRVKLPAGDLVLYPS----- 154 + ++L+ P+ +GGE V G VK GD VL+ S Sbjct: 187 TVLMYLATPE--EGGETVFPKVVGWVVQLTTTASAPCRQGLAVKPAKGDAVLFWSIRPDG 244 Query: 155 ----SSLHCVTPVTRGVRVASFMWI 175 SLH PV +GV+ ++ WI Sbjct: 245 RFDPGSLHGSCPVIKGVKWSATKWI 269 >UniRef50_A3WJ18 Prolyl 4-hydroxylase alpha subunit-like protein, 2OG-Fe(II) oxygenase family protein n=2 Tax=Idiomarina RepID=A3WJ18_9GAMM Length = 210 Score = 52.7 bits (125), Expect = 9e-06, Method: Composition-based stats. Identities = 46/189 (24%), Positives = 70/189 (37%), Gaps = 33/189 (17%) Query: 4 HIPGVLS---PQDVARFREQLEQAEWVD---GRVTTGAQGAQVKNNQ--QVDTRSTLYA- 54 IP L + + + Q+ A W GR T V+N++ +D ST A Sbjct: 22 IIPSFLPHAVAEPLYQEASQIASAHWQTAAIGRAQTHTINTMVRNDRIIWLDDNSTYGAP 81 Query: 55 --ALQNEVLNAVNQH---ALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGW 109 L ++ AVN+ LF Y Y H+D G Sbjct: 82 FLNLMEQLRLAVNRTLFMGLFDYECH--------LAHYPKGAFYKKHLDA------FKGK 127 Query: 110 MRTDLSATLFLSDP-QSYDGGELVVNDTFGQ--HRVKLPAGDLVLYPSSS-LHCVTPVTR 165 L+ L+L+ DGGELV+ G+ +V G LV++ S +H V P ++ Sbjct: 128 SNRKLTTVLYLNPKWSEADGGELVMYGKRGEVLEKVLPKRGTLVVFLSDQFVHEVLP-SQ 186 Query: 166 GVRVASFMW 174 R + W Sbjct: 187 KDRFSLTGW 195 >UniRef50_A8IL40 Predicted protein n=2 Tax=Chlamydomonas reinhardtii RepID=A8IL40_CHLRE Length = 503 Score = 52.7 bits (125), Expect = 9e-06, Method: Composition-based stats. Identities = 29/90 (32%), Positives = 41/90 (45%), Gaps = 8/90 (8%) Query: 88 QNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLSDPQ-SYDGGELVVNDTFGQHRVKLPA 146 ++ G+H D + R +SA L+LSD + GG+ D RV A Sbjct: 134 RSGAALGWHHDAN-----REYLSRRHVSAVLYLSDQGVDFGGGDFRFQDGPEPLRVAPRA 188 Query: 147 GDLVLYPSSS--LHCVTPVTRGVRVASFMW 174 G LV Y + + +HCV V G RVA +W Sbjct: 189 GRLVAYTADARNMHCVERVAWGERVALTLW 218 >UniRef50_Q5CZM3 Leprel2 protein (Fragment) n=4 Tax=Euteleostomi RepID=Q5CZM3_DANRE Length = 244 Score = 52.7 bits (125), Expect = 1e-05, Method: Composition-based stats. Identities = 31/124 (25%), Positives = 47/124 (37%), Gaps = 17/124 (13%) Query: 66 QHALFFAAALPRTLSTPLFNRYQNNET---YGFHVDGAVRSHPQNGWMR-------TDLS 115 + A + +Q + + HVD + R DLS Sbjct: 58 RTPSLLYFAYTHLVCRSAITGHQEGRSDLSHPVHVDNCILEPESRQCWREAPAFTHRDLS 117 Query: 116 ATLFLSDPQSYDGGELVVNDTFGQH---RVKLPAGDLVLYPSSSL--HCVTPVTRGVRVA 170 A L+L+D ++GG+ D + VK G LV + S + H VT VT+G R A Sbjct: 118 AVLYLND--DFEGGDFFFTDRDAKTVTATVKPKCGRLVGFTSGPVNPHGVTAVTKGRRCA 175 Query: 171 SFMW 174 +W Sbjct: 176 LALW 179 >UniRef50_D0N498 Putative uncharacterized protein n=1 Tax=Phytophthora infestans T30-4 RepID=D0N498_PHYIN Length = 780 Score = 52.7 bits (125), Expect = 1e-05, Method: Composition-based stats. Identities = 43/217 (19%), Positives = 77/217 (35%), Gaps = 30/217 (13%) Query: 5 IPGVLSPQDVARFREQLEQAEWVDGRVTTGAQG-AQVKNNQ--QVDTRSTLYAALQNEVL 61 IP L+P+ + + ++ + T + Q Q++ R+ L+ ++ Sbjct: 71 IPLPLAPEHAEKLIAKCAKSPFGHNLDTKMDDSVRKSWQLQPDQLELRNPLWQGGIEKLT 130 Query: 62 NAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLS 121 + + P Y + H D + ++G + ATL + Sbjct: 131 ETIAARLGYKGV--PLNCVLYKLLVYGEGGHFLKHQD----TEKEDGMI-----ATLVVQ 179 Query: 122 DPQSYDGGELVVN---DTFGQHRVKLPAGDLVLYP------SSSLHCVTPVTRGVRVASF 172 P +++GG+LVV +H G P S + H + VT+G R+A Sbjct: 180 PPSTHEGGDLVVYRNGQVEHRHDFGKIDGTAAYLPHYAVHYSDAEHALEDVTKGYRLALV 239 Query: 173 MWI------QSMIRDDKKRAMLFELDNNIQSLKSRYG 203 I QS+ RD M EL N + + G Sbjct: 240 YSICLPSSLQSLKRDPNV-TMSAELANAFKEMGPEDG 275 >UniRef50_B7G721 Predicted protein n=1 Tax=Phaeodactylum tricornutum CCAP 1055/1 RepID=B7G721_PHATR Length = 455 Score = 52.3 bits (124), Expect = 1e-05, Method: Composition-based stats. Identities = 26/139 (18%), Positives = 58/139 (41%), Gaps = 25/139 (17%) Query: 50 STLYAALQNEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAV-------- 101 L +++ + + + A+ + S F RY + Y H+DG+ Sbjct: 311 DPLNERVKSLLPPIMKESAVVHSIN-----SRWRFFRYSQDSVYRPHIDGSWPESRINEK 365 Query: 102 --RSHPQNGWMRTDLSATLFLSDPQSYDGGELVVNDTFGQ----HRVKLPAGDLVLYP-- 153 + ++G +++ L+ ++L+D ++GGE + Q V AG ++++P Sbjct: 366 GEYEYDESGSVKSYLTFLIYLND--DFEGGETLFYIPSSQGMSARGVVPKAGAVLVFPQG 423 Query: 154 --SSSLHCVTPVTRGVRVA 170 +S +H + V G + Sbjct: 424 NTASLIHEGSAVANGTKYV 442 >UniRef50_D0Z4R6 SM-20-related protein n=1 Tax=Photobacterium damselae subsp. damselae CIP 102761 RepID=D0Z4R6_LISDA Length = 237 Score = 52.3 bits (124), Expect = 1e-05, Method: Composition-based stats. Identities = 34/187 (18%), Positives = 62/187 (33%), Gaps = 38/187 (20%) Query: 7 GVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRS--------TLYAALQN 58 L+ + V R+ + + A++ N ++ S L Sbjct: 55 DFLNNEQVEHLRQCIPD----------NWKKARIGRNDEIMRESSIRSDKIQWLTPEQGW 104 Query: 59 EVLNAVNQHALF-----FAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTD 113 + + + + + L F +Y+ + Y H+D N R Sbjct: 105 PIQDYLERMEVIRREVNQNFFLGLFEYEAHFAKYEQGDFYQKHLD----CFKGNENRR-- 158 Query: 114 LSATLFLSD---PQSYDGGELVVNDTFGQHRVKLPA--GDLVLYPSSSL-HCVTPVTRGV 167 L+ ++++ P+ GGELVV D Q + G L ++ S H V P T Sbjct: 159 LTTVFYMNESWSPED--GGELVVYDLNDQKITTIAPKSGRLFIFLSEKFPHEVLP-TNAE 215 Query: 168 RVASFMW 174 R + W Sbjct: 216 RFSIAGW 222 >UniRef50_C1MRD1 Predicted protein n=1 Tax=Micromonas pusilla CCMP1545 RepID=C1MRD1_9CHLO Length = 613 Score = 52.3 bits (124), Expect = 1e-05, Method: Composition-based stats. Identities = 35/90 (38%), Positives = 43/90 (47%), Gaps = 18/90 (20%) Query: 96 HVDGAVRSHPQNGWMRTDLSATLFLSDPQS-YDGGELVVNDTFGQHRVKLPA-GDLVLYP 153 HVD A + D SA L+L+ P + +DGGE V D V LPA G L+L+ Sbjct: 452 HVDKANIASY-------DYSAVLYLNAPGACFDGGEFVFRDGGDVDEVVLPAVGRLLLFA 504 Query: 154 S--SSLHCVTPVTR-------GVRVASFMW 174 S +LH VT VTR R A MW Sbjct: 505 SGAENLHQVTAVTRRKDARAPAARFALAMW 534 >UniRef50_A4RVI8 Predicted protein n=2 Tax=Ostreococcus RepID=A4RVI8_OSTLU Length = 378 Score = 52.3 bits (124), Expect = 1e-05, Method: Composition-based stats. Identities = 31/136 (22%), Positives = 51/136 (37%), Gaps = 13/136 (9%) Query: 78 TLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLSDPQSYDGGELVVNDTF 137 + P Y E + H DG + + + + ++L+D +GGE Sbjct: 219 SFELPQVAHYSGGEYFKAHEDGFPIAVAADKGYQRRATILVYLNDVD--EGGETRFEHLG 276 Query: 138 GQHRVKLPAGDLVLYPSS--------SLHCVTPVTRG-VRVASFMWIQSMIRDDKKRAML 188 + K LV +PSS +LH TP G + S +WI S L Sbjct: 277 IEVAPKKGK-ALVFFPSSAACMPDARTLHTATPAKEGHEKWVSQLWIASSTPPVPTPEEL 335 Query: 189 FELDNNIQSLKSRYGE 204 E DN ++ ++ Y + Sbjct: 336 -EADNKRKAAEAEYAK 350 >UniRef50_UPI00006A17CC Probable G-protein coupled receptor 162 (Gene-rich cluster gene A protein). n=1 Tax=Xenopus (Silurana) tropicalis RepID=UPI00006A17CC Length = 544 Score = 51.9 bits (123), Expect = 2e-05, Method: Composition-based stats. Identities = 31/127 (24%), Positives = 48/127 (37%), Gaps = 21/127 (16%) Query: 96 HVDGAVRSHPQNGWMR-------TDLSATLFLSDPQSYDGGELVVNDTFGQH---RVKLP 145 H D V + R D S L+L+ + + GG + G ++ Sbjct: 424 HADNCVLDTEEKECWREPPAYVHRDYSGLLYLN--EDFQGGNAFFTEMDGTTITAELRPS 481 Query: 146 AGDLVLYPS--SSLHCVTPVTRGVRVASFMW-IQSMIRDDKKRAMLFELDNNIQSLKSRY 202 G LVL+ S + H V PVT G R A +W QS +++R+ ++ R Sbjct: 482 CGRLVLFSSGGENAHGVRPVTEGKRCAVALWFTQSAEHAEQERS------QARALMRDRD 535 Query: 203 GESEEIL 209 G E Sbjct: 536 GPQESTE 542 >UniRef50_B7FSE0 Predicted protein n=1 Tax=Phaeodactylum tricornutum CCAP 1055/1 RepID=B7FSE0_PHATR Length = 360 Score = 51.9 bits (123), Expect = 2e-05, Method: Composition-based stats. Identities = 28/138 (20%), Positives = 44/138 (31%), Gaps = 24/138 (17%) Query: 92 TYGFHVD--GAVRSHPQNGWMRTDLSATLFLSDPQSYDGGELVVND-----------TFG 138 Y D + + LS ++ ++DP ++GG + + G Sbjct: 146 HYQTKTDCNQGSMDRLEPHRDGSILSFSITINDPDDFEGGGTLFDGLRDVVSTSSVLKNG 205 Query: 139 QHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFM------WIQ-----SMIRDDKKRAM 187 AGD V + +LH +T G R W + S D R Sbjct: 206 GVVRPTRAGDAVFHSGKALHGANAITSGKRTVLVGFVDVAPWCERPGALSAACRDWGRMD 265 Query: 188 LFELDNNIQSLKSRYGES 205 + L QS K+R G Sbjct: 266 VASLRYERQSKKTRSGAK 283 >UniRef50_A8P9H2 Putative uncharacterized protein n=1 Tax=Coprinopsis cinerea okayama7#130 RepID=A8P9H2_COPC7 Length = 946 Score = 51.9 bits (123), Expect = 2e-05, Method: Composition-based stats. Identities = 31/179 (17%), Positives = 68/179 (37%), Gaps = 32/179 (17%) Query: 9 LSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTR----------STLYAALQN 58 L+ ++ +QA + G+ T ++N ++++ + L A + Sbjct: 59 LNEREAKAIIASSKQAPFGKGKKT--LVDKTIRNTWEIESDKVTFSNPKWTTWLEATVFK 116 Query: 59 EVLNAVNQHALFFAAALPR-TLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSAT 117 V +++ PR L L Y+ + H D + AT Sbjct: 117 TVWDSLGVAPY---TTRPRCELYKLLV--YEKGSHFKAHQDTQKADG---------MFAT 162 Query: 118 LFLSDPQSYDGGELVVNDTFGQHRVKLPAGDL-----VLYPSSSLHCVTPVTRGVRVAS 171 + + P +++GGE++++ + V + A + + + +H V P+T G R+A Sbjct: 163 VIVVLPSAFEGGEVILSHSGATETVDITANSAMETSILAWYTDVMHEVRPITSGYRLAL 221 >UniRef50_Q6LGS5 Putative uncharacterized protein NCU03445 n=1 Tax=Photobacterium profundum RepID=Q6LGS5_PHOPR Length = 255 Score = 51.9 bits (123), Expect = 2e-05, Method: Composition-based stats. Identities = 30/197 (15%), Positives = 65/197 (32%), Gaps = 30/197 (15%) Query: 3 YHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEVLN 62 + + VL P++V R + Q + + + + + +N + + Sbjct: 53 FQLRDVLLPEEVTRILDAANQLGFTEDAAVSLPRKVRHNSNLNLIVDPATLELIWQRCQA 112 Query: 63 A-VNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSH---------PQNGWMRT 112 V++++ F A F RY+ + + H DG+ G + Sbjct: 113 HFVDKYSHFAGKAPLGINGRFRFYRYEEGDYFKMHTDGSWPGSQVVNGELVDDAFGDRWS 172 Query: 113 DLSATLFLSDPQSYDGGELVV-------------NDTFGQHRVKLPAGDLVLYPSSS--- 156 + + LSD + GGE ++ V+ P+G ++ +P + Sbjct: 173 MYTFLILLSD--DFVGGETQFMVNRDDPTKPALYQESANIESVRTPSGSVLCFPHGTHPI 230 Query: 157 --LHCVTPVTRGVRVAS 171 LH + G + Sbjct: 231 HCLHGSAQILSGTKYII 247 >UniRef50_D2VV82 Prolyl 4-hydroxylase alpha subunit family protein n=4 Tax=Naegleria gruberi RepID=D2VV82_NAEGR Length = 659 Score = 51.9 bits (123), Expect = 2e-05, Method: Composition-based stats. Identities = 34/146 (23%), Positives = 55/146 (37%), Gaps = 18/146 (12%) Query: 84 FNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLSDPQSYDGGELVV-------NDT 136 ++Y+ E + H DG R + T L ++L+ Q + GGE + Sbjct: 149 LSKYEPGEYFKIHTDGQFRRSEHERSIYTLL---IYLN--QDFKGGETRFYNDPTKTDSD 203 Query: 137 FGQ----HRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQSMIRDDKK--RAMLFE 190 F + H +K G L L+ H PVT+G + I + D R FE Sbjct: 204 FEEYSLLHTLKPSLGQLALFNQDFYHEGCPVTKGTKYILRTEIMYLRVDSLSIPRDEKFE 263 Query: 191 LDNNIQSLKSRYGESEEILSLLNLYH 216 Q + + ESE + ++Y Sbjct: 264 QSEAYQKIGQLFKESERLEKNGDVYA 289 >UniRef50_Q5N1K6 Putative uncharacterized protein n=2 Tax=Synechococcus elongatus RepID=Q5N1K6_SYNP6 Length = 300 Score = 51.5 bits (122), Expect = 2e-05, Method: Composition-based stats. Identities = 35/199 (17%), Positives = 67/199 (33%), Gaps = 35/199 (17%) Query: 5 IPGVLSPQDVARFREQL--EQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEVLN 62 I LSP+ + + L ++++ + G + + +Y+ + ++L Sbjct: 112 ILDFLSPEKLQQLWNYLLTARSQFNPAHNSAGLNNYRQ--SLFTAPPPEIYSEISEKILG 169 Query: 63 AVNQHALFFAAALPRTLSTP-----LFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSAT 117 A+ A LP + + + Y H D + R Sbjct: 170 ALIP----IADELPNSSQEIGEIEMQITAHNDGHYYKIHNDNGSP----DTATRFLTYVY 221 Query: 118 LFLSDPQSYDGGELVVNDTFGQHRVKLPAGD-----------LVLYPSSSLHCVTPVT-- 164 F P+ + GGEL + + + + AGD L+++PS +H V P+ Sbjct: 222 YFYRQPKPFTGGELRLYELAIKDGFYV-AGDRYQDIEPLHNSLIVFPSHYMHEVLPIRCP 280 Query: 165 ----RGVRVASFMWIQSMI 179 R WI+ I Sbjct: 281 SQRFEDSRFTVNGWIRVAI 299 >UniRef50_B8MEI3 Putative uncharacterized protein n=1 Tax=Talaromyces stipitatus ATCC 10500 RepID=B8MEI3_TALSN Length = 914 Score = 51.5 bits (122), Expect = 2e-05, Method: Composition-based stats. Identities = 48/241 (19%), Positives = 85/241 (35%), Gaps = 40/241 (16%) Query: 3 YHIPGV------LSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTR-----ST 51 H+PG+ ++P V + + + G T V+ + Q+D + Sbjct: 82 LHVPGIGAIGLPVTPDQVKAMIQSSRMSPYGKGSET--LVNESVRKSWQLDANQFSLQNP 139 Query: 52 LYA-ALQNEVLNAVNQHALFFAAALPRTLSTPLFNR--YQNNETYGFHVDGAVRSHPQNG 108 L+ L N A+ L A P + L+ Y+ + H D Sbjct: 140 LWKAQLDNFKKEAITGLGL---TANPEEVKAELYKLLIYEEGAFFLPHQDSEKADG---- 192 Query: 109 WMRTDLSATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYP-----SSSLHCVTPV 163 + ATL +S P + GG++V + + + + +H V PV Sbjct: 193 -----MFATLVVSLPSKHQGGDVVASHKDKKMIFSTAGNSEFGFSWAAWYADVMHEVKPV 247 Query: 164 TRGVRVASFMWIQSMIRDDKKRAMLFELDNNIQSLKSRYGESEEILSLLNLYHNLLREWS 223 G R+ +MI AM+ + ++ S+ S ++ N HNL R WS Sbjct: 248 VSGYRIVLVY---NMIHRPS--AMIVKARDSEMGYLSKLLASWA-RAVENSMHNL-RSWS 300 Query: 224 E 224 + Sbjct: 301 D 301 >UniRef50_A5BUA7 Putative uncharacterized protein n=1 Tax=Vitis vinifera RepID=A5BUA7_VITVI Length = 282 Score = 51.5 bits (122), Expect = 2e-05, Method: Composition-based stats. Identities = 31/134 (23%), Positives = 50/134 (37%), Gaps = 30/134 (22%) Query: 75 LPRTLSTPLFN--RYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLSDPQSYDGGELV 132 L + FN RY+ + Y H D + + ++LSD + +GGE + Sbjct: 153 LTDVVYHVAFNILRYEIGQRYNSHYDAFDPAEYGPQKSHRIATFLVYLSDVE--EGGETM 210 Query: 133 VNDTFG-------------QHRVKLPAGDLVLYPS---------SSLHCVTPVTRGVRVA 170 G +VK GD +L+ S +SLH PV +G + Sbjct: 211 FPFENGLNMDKDYDFQRCIGLKVKPHQGDGLLFYSMFPNGTIDPTSLHGSCPVIKGEKWV 270 Query: 171 SFMWIQSMIRDDKK 184 + WI RD ++ Sbjct: 271 ATKWI----RDQEQ 280 >UniRef50_B5Y446 Predicted protein n=1 Tax=Phaeodactylum tricornutum CCAP 1055/1 RepID=B5Y446_PHATR Length = 321 Score = 51.5 bits (122), Expect = 2e-05, Method: Composition-based stats. Identities = 37/158 (23%), Positives = 61/158 (38%), Gaps = 22/158 (13%) Query: 56 LQNEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLS 115 + ++V +N H F L L+ L+ Y Y H D P + + S Sbjct: 154 ILDQVKADLNSH---FGKPLDSQLTELLYAFYPQGGFYRRHRDA----IPGSASTLREYS 206 Query: 116 ATLFLS-DPQSYDGGELVVNDTFGQHRVK----------LPA-GDLVLYPSSSL-HCVTP 162 L+L+ D DGG+L ++ G + LP G LVL+ S+++ H V Sbjct: 207 LLLYLNKDWNEQDGGQLRLHFDSGDDELPAGEEAQCRDVLPQSGTLVLFRSNAIPHEVLD 266 Query: 163 VTRGVRVASFMWIQSMIRDDKKRAMLF-ELDNNIQSLK 199 T+ RVA W + + +L+ +L Sbjct: 267 -TQKERVAIIGWYNRPVASTDIGELAGVDLNPTRVALM 303 >UniRef50_C7YTN2 Putative uncharacterized protein n=1 Tax=Nectria haematococca mpVI 77-13-4 RepID=C7YTN2_NECH7 Length = 984 Score = 51.5 bits (122), Expect = 2e-05, Method: Composition-based stats. Identities = 37/178 (20%), Positives = 59/178 (33%), Gaps = 27/178 (15%) Query: 5 IPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTR---------STLYAA 55 I LS + + QA + G T V+N ++D + Sbjct: 75 IATPLSEFQACQMIAKARQAPYGKGSET--IVDTSVRNTWELDPSQFELRDPTWTAQVQI 132 Query: 56 LQNEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLS 115 L +V + + A L L Y+ + H DG+ P D+ Sbjct: 133 LCKQVAKTLGINGNIKA-----ELYKMLI--YEKGAMFKAHTDGSTEKIP-------DMF 178 Query: 116 ATLFLSDPQSYDGGELVVNDTFGQHRVKLP--AGDLVLYPSSSLHCVTPVTRGVRVAS 171 TL + P ++ GG++V+ H + A + S H V PVT G R Sbjct: 179 GTLVVCLPSTHQGGDVVLRHNGQAHVFRSSDHAQSCAFWYSDVSHEVLPVTSGYRWVL 236 >UniRef50_D1I073 Whole genome shotgun sequence of line PN40024, scaffold_11.assembly12x (Fragment) n=1 Tax=Vitis vinifera RepID=D1I073_VITVI Length = 370 Score = 51.5 bits (122), Expect = 2e-05, Method: Composition-based stats. Identities = 26/117 (22%), Positives = 43/117 (36%), Gaps = 17/117 (14%) Query: 88 QNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLSD-PQSYDGGELVVNDTFGQHRVKLPA 146 + G+H D Q D +A +L+ + GG D L A Sbjct: 91 TRGASIGWHSDDNRPYLKQ-----RDFAAVCYLNSYGNDFKGGLFHFQDGDPTTIEPL-A 144 Query: 147 GDLVLYPSS--SLHCVTPVTRGVRVASFMWIQSMIRDDKKRAMLFELDNNIQSLKSR 201 GD+V+Y + ++H V +T G R+ +W R + D + L S+ Sbjct: 145 GDVVMYTADCRNIHSVDEITDGERLTLTLW--------FSRDCSHDEDAKLVCLLSQ 193 >UniRef50_Q11WG9 Probable proline hydroxylase n=3 Tax=Bacteroidetes RepID=Q11WG9_CYTH3 Length = 196 Score = 51.1 bits (121), Expect = 2e-05, Method: Composition-based stats. Identities = 26/100 (26%), Positives = 42/100 (42%), Gaps = 9/100 (9%) Query: 78 TLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLS-DPQSYDGGELVVNDT 136 + Y+ Y H+D QN R S +L+ D DGGEL ++ T Sbjct: 102 KAYEFHYTIYEQGAFYKRHIDQ-----FQNDSNRA-FSIVSYLNTDWIETDGGELCIHHT 155 Query: 137 FGQHRVKLPAGDLVLYPSSSL-HCVTPVTRGVRVASFMWI 175 + R+ G V + S+ + H V P T+ R++ W+ Sbjct: 156 AAEQRISPTNGKTVFFKSNEIEHEVLP-TQANRLSITGWL 194 >UniRef50_B3PB13 SM-20 domain protein n=1 Tax=Cellvibrio japonicus Ueda107 RepID=B3PB13_CELJU Length = 232 Score = 50.8 bits (120), Expect = 3e-05, Method: Composition-based stats. Identities = 29/96 (30%), Positives = 42/96 (43%), Gaps = 9/96 (9%) Query: 84 FNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLSDP-QSYDGGELVVNDTFGQH-- 140 F Y+ + Y H D ++ + G LS +L+ S DGGELV+ D +H Sbjct: 135 FALYEPGDFYQKHRDAFRDTNARAG---RKLSTVYYLNPDWTSLDGGELVLYDEADEHLL 191 Query: 141 -RVKLPAGDLVLYPSSSL-HCVTPVTRGVRVASFMW 174 RV G L+++ S H V P R R + W Sbjct: 192 ERVAPKQGRLLVFLSEDFPHEVLPARR-PRKSIAGW 226 >UniRef50_UPI000051007A 2OG-Fe(II) oxygenase n=1 Tax=Brevibacterium linens BL2 RepID=UPI000051007A Length = 785 Score = 50.8 bits (120), Expect = 3e-05, Method: Composition-based stats. Identities = 43/182 (23%), Positives = 63/182 (34%), Gaps = 32/182 (17%) Query: 9 LSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDT-----RSTLYAALQNEVLNA 63 + D+A E A + G T V++ +D AA + A Sbjct: 48 IGEDDIAALISIAEPAHFGSGEDTVFDP--TVRDTWVIDPAHVHLGDPTGAAGTDNWTGA 105 Query: 64 VNQHALFFAAAL--PR------TLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLS 115 +N+ +AAL P L + L Y + + H D D S Sbjct: 106 LNESLGCLSAALGIPADATVRAELHSMLV--YGEGQFFLPHQDS-----------EKDDS 152 Query: 116 A--TLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVL--YPSSSLHCVTPVTRGVRVAS 171 TL + P ++ GGELVV+D D+ L + + H V PVT G RV Sbjct: 153 MLATLVVGLPTTHTGGELVVDDHGADRIFTGDPNDITLAAFYADRRHEVKPVTSGHRVTL 212 Query: 172 FM 173 Sbjct: 213 TF 214 >UniRef50_B8BZB7 Predicted protein n=1 Tax=Thalassiosira pseudonana RepID=B8BZB7_THAPS Length = 313 Score = 50.8 bits (120), Expect = 4e-05, Method: Composition-based stats. Identities = 39/213 (18%), Positives = 68/213 (31%), Gaps = 46/213 (21%) Query: 3 YHIPGVLSPQDVARFREQL----EQAEWVD---GRVTTGAQGAQVK-------------- 41 IP L V R+ + + + G+ +T ++ Sbjct: 62 LVIPNFLPRDLVEELRQDIGKLRDNGAFRQAKIGQDSTNELNKSIRIAETCFLGRNRPEL 121 Query: 42 ----NNQQVDTRSTLYAALQNEVLNAVNQHALFFA-AALPRTLSTPLFNRYQNNETYGFH 96 N + RS + + V +++ + + A L ++L+ L+ Y Y H Sbjct: 122 INISGNDSIRDRSGGLYQILDVVCDSLVELCWKESEAKLDKSLTELLYAYYPTGGFYRRH 181 Query: 97 VDGAVRSHPQNGWMRTDLSATLFLSDPQ---SYDGGELVVNDTFGQHRVKLP-------- 145 D P + + S L+L+ GG+L ++ G V Sbjct: 182 RDA----IPGSASVLRKYSLLLYLNRDDWSPEKGGGQLRIHLDGGGDEVLQGVEPNFVDV 237 Query: 146 ---AGDLVLYPSSSL-HCVTPVTRGVRVASFMW 174 G LVL+ S + H V T R A W Sbjct: 238 DPLGGTLVLFKSELIPHEVLD-TNSERFAIVGW 269 >UniRef50_B9ZRT4 2OG-Fe(II) oxygenase n=1 Tax=Thioalkalivibrio sp. K90mix RepID=B9ZRT4_9GAMM Length = 251 Score = 50.8 bits (120), Expect = 4e-05, Method: Composition-based stats. Identities = 52/189 (27%), Positives = 72/189 (38%), Gaps = 24/189 (12%) Query: 6 PGVLSPQDVARFRE---------QLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAAL 56 P L + V R+ QL QA G + + +D S A Sbjct: 63 PDFLPAESVDALRDEVYALRDAAQLAQARIGRGGERHHDRATRGDWIHWLDGASPAQQAF 122 Query: 57 QNEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSA 116 + Q L T S F Y Y HVD Q G R LS Sbjct: 123 MERLDAIRLQVGRTLIPGLFETESH--FALYPPGTHYARHVDA-----FQAGNCRR-LSL 174 Query: 117 TLFLS-DPQSYDGGELVVNDTFGQH--RVKLPAGDLVLYPSSSL-HCVTPVTRGVRVASF 172 +L+ D Q DGG+L + D G+ R++ AG LV++ S S+ H V P TR R + Sbjct: 175 VFYLNRDWQEQDGGQLAIYDDAGRECQRIQPTAGTLVMFLSQSVPHAVLP-TRRWRASIA 233 Query: 173 MWIQSMIRD 181 W++ +RD Sbjct: 234 SWMR--VRD 240 >UniRef50_Q1DA84 Oxidoreductase, 2OG-Fe(II) oxygenase family n=1 Tax=Myxococcus xanthus DK 1622 RepID=Q1DA84_MYXXD Length = 206 Score = 50.8 bits (120), Expect = 4e-05, Method: Composition-based stats. Identities = 37/179 (20%), Positives = 65/179 (36%), Gaps = 20/179 (11%) Query: 1 MMYHIPGVLSPQDVARFREQLEQAEWVDGRVT--TGAQGA-QVKNNQQV-----DTRSTL 52 ++ + +LS ++ A E++E +T G ++NN +V TL Sbjct: 29 LVIVLRDLLSAEECAALIERIEAEGPTAAPITTSAGFVMRPDIRNNSRVMFDDVPLAQTL 88 Query: 53 YAALQNEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRT 112 + + V + + A RY E + H DGA R+ Sbjct: 89 FERVAPHVPHRLEHEWTLCGANERLRCY-----RYDVGEYFAPHFDGAFVRTRDE---RS 140 Query: 113 DLSATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVAS 171 L+ ++L++ GG + G H V G +L+ LH VT+G + A Sbjct: 141 LLTFMVYLNECP---GGGATNFLSLG-HSVTPRTGSALLFNHRLLHEGATVTQGRKYAL 195 >UniRef50_B5YLK5 Predicted protein n=1 Tax=Thalassiosira pseudonana CCMP1335 RepID=B5YLK5_THAPS Length = 207 Score = 50.8 bits (120), Expect = 4e-05, Method: Composition-based stats. Identities = 35/200 (17%), Positives = 68/200 (34%), Gaps = 41/200 (20%) Query: 2 MYHIPGVLSPQDVARFREQ----------LEQAEWVDG----RVTTGAQGAQVKNNQQVD 47 + I G LS ++ RF E +DG + ++G + Sbjct: 11 VVAIEGFLSDEECNRFIELGGDRYERSTEYASTMNLDGTFDSKESSGRTSTNTWCGEGCR 70 Query: 48 TRSTLYAALQNEVLNAVNQHALFFAAALP-RTLSTPLFNRYQNNETYGFHVDGAVRSHPQ 106 + +V+ + +P RY+ + Y H D + SH Sbjct: 71 DDP-----IIKKVIERMES-----LTGIPYANFEDLQLVRYEIGQRYEEHHDYS-SSHEG 119 Query: 107 NGWMRTDLSATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSS---------- 156 + L+ +L+D + +GG ++ K G +++PS++ Sbjct: 120 TQYGPRILTVFFYLNDVE--EGGGTQFDELDFVTEPK--RGMALIWPSTTNEAPDVMDDW 175 Query: 157 -LHCVTPVTRGVRVASFMWI 175 H PVT+G++ + WI Sbjct: 176 TWHEALPVTKGIKYGANTWI 195 >UniRef50_B7GCB6 Predicted protein (Fragment) n=1 Tax=Phaeodactylum tricornutum CCAP 1055/1 RepID=B7GCB6_PHATR Length = 199 Score = 50.8 bits (120), Expect = 4e-05, Method: Composition-based stats. Identities = 38/194 (19%), Positives = 71/194 (36%), Gaps = 25/194 (12%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEW----VDGRVTTGAQGAQVKNNQQVDTRSTLYAALQ 57 +Y I L+P ++ R ++ ++ VD + G + V + T + Sbjct: 8 IYIIEDFLTPTELDYLRSKICAGKFQRSYVDAIESGG--NSIVDKEHRTSTFLSFGKQQD 65 Query: 58 NEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFH-------VDGAVRSHPQNGWM 110 ++V + + A R + RY + +G H DG V P++ + Sbjct: 66 SKVASIEAKAATILGCWSSRIVEPLQLVRYLPGQFFGEHHDMGDLQQDGTVALPPKSLFS 125 Query: 111 RTDLSATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYP---------SSSLHCVT 161 + L TLF + GG + + +V G V++ S ++H Sbjct: 126 KRRL-VTLFCYLNKVEKGGATGF--RYCELKVPPKPGRAVMFSNVLPDGMPDSRTVHSGE 182 Query: 162 PVTRGVRVASFMWI 175 PV GV+ +WI Sbjct: 183 PVLDGVKYGLNIWI 196 >UniRef50_A4RT30 Protein Lysyl hydroxylase fusion protein, putative n=1 Tax=Ostreococcus lucimarinus CCE9901 RepID=A4RT30_OSTLU Length = 618 Score = 50.4 bits (119), Expect = 4e-05, Method: Composition-based stats. Identities = 24/103 (23%), Positives = 33/103 (32%), Gaps = 21/103 (20%) Query: 78 TLSTPLFNRYQ---NNETYGFHVDGAVRSHPQNGWMRTDLSATLFLSDPQSYDGGELVVN 134 + +Y H D S TL L+DP Y GG Sbjct: 529 RVHDAFIVKYDASDGQCQLPVHTDQGH------------FSITLSLNDPIQYKGGG---- 572 Query: 135 DTFGQHRV--KLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWI 175 F +H + GD V + S H P+T GVR ++ Sbjct: 573 TIFPEHEFIVRPKCGDFVAFRSYLTHGGVPITSGVRYIVVAFL 615 >UniRef50_B8C4F7 Predicted protein n=1 Tax=Thalassiosira pseudonana RepID=B8C4F7_THAPS Length = 467 Score = 50.4 bits (119), Expect = 5e-05, Method: Composition-based stats. Identities = 37/192 (19%), Positives = 65/192 (33%), Gaps = 26/192 (13%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWV----DGRVTTGAQGAQVKNNQQVDTRSTLYAALQ 57 + LS +V E + G + ++ + + + + + Sbjct: 266 VVVFNNFLSDNEVDDLIRGGEMEGFERSTDQGAANALGEQEKIVSQTRTSSNAWCMHKCE 325 Query: 58 NE--VLNAVNQHALFFAAALPR-TLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDL 114 V +A + +PR + Y N+ Y H D + R H G L Sbjct: 326 RLGGVRSATTKIEDV--TGIPRVNYESFQLLNYGQNQFYRSHHDSSSRDHTPPGP--RIL 381 Query: 115 SATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSS-----------LHCVTPV 163 + L+LSD + +GGE N VK G +++PS H V Sbjct: 382 TFFLYLSDVE--EGGETYFNKL--DLAVKPKKGRALVWPSVVDNDPEFWDARMYHEAKDV 437 Query: 164 TRGVRVASFMWI 175 +G ++A+ WI Sbjct: 438 IKGKKLAANHWI 449 >UniRef50_B0C2I7 Oxidoreductase, 2OG-Fe(II) oxygenase family n=1 Tax=Acaryochloris marina MBIC11017 RepID=B0C2I7_ACAM1 Length = 184 Score = 50.0 bits (118), Expect = 6e-05, Method: Composition-based stats. Identities = 35/178 (19%), Positives = 67/178 (37%), Gaps = 18/178 (10%) Query: 1 MMYHIPGVLSPQDVARFREQLEQAE------WVDGRVTTGAQGAQVKNNQQVDTRS-TLY 53 ++ IP +L+ ++ ++ Q DG V+NN++V +L Sbjct: 10 LIIEIPNILTFKECDELMGKINQLNPSLATVRNDGEAEIN---TNVRNNERVVFSDFSLA 66 Query: 54 AALQNEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTD 113 L + V L RY+ + H DG++ +NG ++ Sbjct: 67 EKLFLKAQEYVP--PTMQGRILLSANERFRCYRYKVGMKFSPHYDGSL---ERNGNEKSY 121 Query: 114 LSATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVAS 171 S ++L+D ++GG+ T + G +L+ LH V+RGV+ + Sbjct: 122 YSFLVYLND--DFEGGQTNF-LTESICSITPRKGFGLLFQHLILHEGVEVSRGVKYVA 176 >UniRef50_B0DEZ6 Predicted protein n=2 Tax=Agaricomycetes RepID=B0DEZ6_LACBS Length = 1203 Score = 49.6 bits (117), Expect = 7e-05, Method: Composition-based stats. Identities = 34/175 (19%), Positives = 56/175 (32%), Gaps = 14/175 (8%) Query: 9 LSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDT------RSTLYAALQNEVLN 62 LS +D QA + G T +V++ +++ +QN + Sbjct: 261 LSERDAKSIISCSAQAPFGHGERTV--VDREVRDTWEIEPSNLKFLNPAWEPYIQNLAMT 318 Query: 63 AVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTD-LSATLFLS 121 +V Q + Y+ D P + D + ATL + Sbjct: 319 SVWQGLGVVPYSTLPKCELYKLLLYETGSQSRIMADDFFSFLPHQDTQKADGMFATLIIV 378 Query: 122 DPQSYDGGELVVNDTFGQHRVKLPAGD-----LVLYPSSSLHCVTPVTRGVRVAS 171 P Y GGE+ V + L + + + H V PVT G R+A Sbjct: 379 LPSLYTGGEVHVTHASKTMVIDLSPNSLLSTCALAWYTDVKHEVKPVTSGYRLAL 433 >UniRef50_C1N7Y0 Predicted protein (Fragment) n=2 Tax=Micromonas RepID=C1N7Y0_9CHLO Length = 205 Score = 49.6 bits (117), Expect = 7e-05, Method: Composition-based stats. Identities = 36/195 (18%), Positives = 63/195 (32%), Gaps = 34/195 (17%) Query: 9 LSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYA-ALQNEVLNAVNQ- 66 L+P RE+L+ E + + N D L+ L +L+ + + Sbjct: 17 LTPTLCRLLREELDHFE-----RSGMPRARP---NTMNDDGVLLHELGLCENLLDPLLRE 68 Query: 67 --HALFFAAALPRTL--------STPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSA 116 + A PR + Y G D A ++ Sbjct: 69 YIAPMARALYHPRAVPGCDTLDHHRSFSVSYDATTAAGKDADLAYHFDDAE------VTI 122 Query: 117 TLFLSDPQSYDGGELVVNDTFGQ----HRVKLPA----GDLVLYPSSSLHCVTPVTRGVR 168 + +S +++DGGEL+ G+ H + P G V++ H T G R Sbjct: 123 NVCVSPREAFDGGELLFGGVEGEAGSSHARRCPRFHREGVAVMHAGKHCHEAMATTEGRR 182 Query: 169 VASFMWIQSMIRDDK 183 W +S+ R + Sbjct: 183 TNLIAWCRSLARRAE 197 >UniRef50_Q2S9H1 Putative uncharacterized protein n=1 Tax=Hahella chejuensis KCTC 2396 RepID=Q2S9H1_HAHCH Length = 279 Score = 49.6 bits (117), Expect = 8e-05, Method: Composition-based stats. Identities = 41/189 (21%), Positives = 67/189 (35%), Gaps = 32/189 (16%) Query: 3 YHIPGVLSPQDVARFREQLEQ--AEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEV 60 I L V EQ + + G+V+ + A + N + L + Sbjct: 62 IVISDFLPRVRVDELLTLAEQRISRFEPGKVSGAREIAPERRNSLRLRDPLIEKKLNDWF 121 Query: 61 LNAVNQH----------ALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWM 110 + ALF + + F Y N + H D + G M Sbjct: 122 SPHFRKRLTDYCSQLDVALFEISEIELK-----FCCYPNGAYFHIHRDDQAPNSEATG-M 175 Query: 111 RT----DLSATLFLSD-PQSYDGGELVV------NDTFGQHRVK-LPA--GDLVLYPSSS 156 RT +S + P+S+ GGEL + +D + +HR++ +P LVL+PS Sbjct: 176 RTPGVRRISFAYYFHRRPKSFTGGELQLYATDRKHDIYSRHRIESIPPHFNTLVLFPSGF 235 Query: 157 LHCVTPVTR 165 H V +T Sbjct: 236 YHEVLKITE 244 >UniRef50_A6VYC6 2OG-Fe(II) oxygenase n=2 Tax=Marinomonas RepID=A6VYC6_MARMS Length = 230 Score = 49.6 bits (117), Expect = 9e-05, Method: Composition-based stats. Identities = 28/97 (28%), Positives = 38/97 (39%), Gaps = 13/97 (13%) Query: 84 FNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLSDP-QSYDGGELVVNDTFGQH-- 140 F RY+ Y H+D G LS L+L++ Q DGGELV+ D Sbjct: 126 FARYEEGAFYEKHIDA------FKGESNRILSTVLYLNEDWQDGDGGELVIYDENDPSVE 179 Query: 141 --RVKLPAGDLVLYPSS-SLHCVTPVTRGVRVASFMW 174 R G L ++ S H V V + R + W Sbjct: 180 VGRFFPKKGRLAVFLSECFYHEVM-VAKRTRHSIAGW 215 >UniRef50_C7YTL8 Putative uncharacterized protein n=1 Tax=Nectria haematococca mpVI 77-13-4 RepID=C7YTL8_NECH7 Length = 923 Score = 49.6 bits (117), Expect = 9e-05, Method: Composition-based stats. Identities = 25/106 (23%), Positives = 38/106 (35%), Gaps = 19/106 (17%) Query: 77 RTLSTPLFNR-------YQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLSDPQSYDGG 129 R PL + Y+ TY H D G AT+ +S P ++GG Sbjct: 121 RRFLRPLVSVRVEKMVIYEKGATYKAHTD----IEENKGVF-----ATVMISLPSEHEGG 171 Query: 130 ELVV-NDTFGQHRVK--LPAGDLVLYPSSSLHCVTPVTRGVRVASF 172 ++V + R K L + + H +TP+T G R Sbjct: 172 DMVFEHGGQKPKRYKSCLETQSFAFWYTGVSHRLTPITSGYRWVLV 217 >UniRef50_B9SNI2 Oxidoreductase, putative n=1 Tax=Ricinus communis RepID=B9SNI2_RICCO Length = 397 Score = 49.2 bits (116), Expect = 9e-05, Method: Composition-based stats. Identities = 24/117 (20%), Positives = 44/117 (37%), Gaps = 17/117 (14%) Query: 88 QNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLSD-PQSYDGGELVVNDTFGQHRVKLPA 146 + G+H D Q D +A +L+ + + GG + + V + A Sbjct: 93 TRGASIGWHSDDNRPYLKQ-----RDFTAVCYLNSYAKDFKGGLFHFQEGEPRTVVPM-A 146 Query: 147 GDLVLYPSSS--LHCVTPVTRGVRVASFMWIQSMIRDDKKRAMLFELDNNIQSLKSR 201 G++ +Y + S +H V + G R+ +W R + D + SL S Sbjct: 147 GNVAIYTADSCNVHSVDEIIEGERLTLTLW--------FSRDSTHDEDAKLISLLSE 195 >UniRef50_B7VAY0 Putative enzyme n=7 Tax=Pseudomonas aeruginosa RepID=B7VAY0_PSEA8 Length = 230 Score = 49.2 bits (116), Expect = 1e-04, Method: Composition-based stats. Identities = 26/109 (23%), Positives = 41/109 (37%), Gaps = 12/109 (11%) Query: 79 LSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLSDPQSYDGGELVVNDTFG 138 + NR + G H+D D ++ L + GG+ VV D G Sbjct: 130 VRRMQVNRMKAGSFIGRHLDTDSNP---------DYQYSIVLQLGTYFSGGQFVVYDRDG 180 Query: 139 QHR--VKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQSMIRDDKKR 185 R +K +++ S H V VT G RV+ ++ S D +R Sbjct: 181 NLRNDIKPEPRSVIISDCSYPHEVQRVTAGERVSLVFFV-SRHADRNRR 228 >UniRef50_A4RTV5 Predicted protein n=2 Tax=Ostreococcus RepID=A4RTV5_OSTLU Length = 225 Score = 49.2 bits (116), Expect = 1e-04, Method: Composition-based stats. Identities = 30/202 (14%), Positives = 64/202 (31%), Gaps = 38/202 (18%) Query: 1 MMYHIPGVLSPQDVARFREQ----LEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAAL 56 +++ + LS ++ + E ++++ DG+++ G L + Sbjct: 29 LLFVLEDFLSEEEGDQLIEIARPSMQRSRVTDGKLSEGRTSTST-FLTGARAHDDLVLEI 87 Query: 57 QNEVLNAVNQHALF----FAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRT 112 + + A+ + L + +Y E Y H D + G ++ Sbjct: 88 ERRIQAAI-RLPLIVERRKNVKVMYQHEPMQIVQYGPTERYTAHYDN------RAGSLKR 140 Query: 113 DLSATLFLSDPQSYDGGELVVN----------DTFGQHRVKLPAGDLVLY---------P 153 ++ +L +P+ +GG T G G +L+ Sbjct: 141 SMTFMCYLQEPE--EGGATFFPKCVPLCGCDSTTLGIRVFP-KRGRAILFWNVGENGQEA 197 Query: 154 SSSLHCVTPVTRGVRVASFMWI 175 SLH PV G + W+ Sbjct: 198 MRSLHEAQPVVSGKKAIFTQWL 219 >UniRef50_C3ZC48 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3ZC48_BRAFL Length = 235 Score = 48.8 bits (115), Expect = 1e-04, Method: Composition-based stats. Identities = 28/181 (15%), Positives = 61/181 (33%), Gaps = 18/181 (9%) Query: 3 YHIPGVLSPQDVARFREQLEQAEWVDGRVTTG----AQGAQVKNNQQ-VDTRSTLYAALQ 57 + + V S ++ Q E + + G +N+++ + + + Sbjct: 41 FIVDNVFSKKECEELIRQTEDQGYEVAMLNVGGGRQILATDYRNSERCIMDSTERAEQIW 100 Query: 58 NEVLNAV-NQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSA 116 + V + A L L F +Y + H+DG+ R R+ L+ Sbjct: 101 ERIKQYVPRRWARRKVLGLNERL---RFLKYGPGNYFHPHMDGSYRRENGE---RSYLTL 154 Query: 117 TLFLSDPQSYDGGELVVNDTFGQHRVKL------PAGDLVLYPSSSLHCVTPVTRGVRVA 170 L+L++ + + ++K G ++++ H VT GV+ A Sbjct: 155 MLYLNEGSTGGATNFISPMFATGDKIKEKVPVIPKPGRVLVFQHDIYHEGEEVTAGVKYA 214 Query: 171 S 171 Sbjct: 215 M 215 >UniRef50_D1LX56 Leprecan-like protein n=1 Tax=Saccoglossus kowalevskii RepID=D1LX56_SACKO Length = 680 Score = 48.8 bits (115), Expect = 1e-04, Method: Composition-based stats. Identities = 30/120 (25%), Positives = 52/120 (43%), Gaps = 17/120 (14%) Query: 96 HVDGAVRSHPQN------GWMRTDLSATLFLSDPQSYDGGELVVN--DTFGQHRVKLPAG 147 H D N ++ D SA ++L+ + ++GGE + + + V+ G Sbjct: 560 HADNCWLDDKGNCIKKSPAYVWRDYSALMYLN--EDFEGGEFIFARFNKTVEASVQPKCG 617 Query: 148 DLVLYPS--SSLHCVTPVTRGVRVASFMWIQSMIRDDKK-----RAMLFELDNNIQSLKS 200 LV + + +LH V VT+G R A MW + D+K R +L L ++ L+ Sbjct: 618 RLVSFSAGEENLHGVKAVTKGRRCALAMWYTLDTKYDEKAHIEAREILENLQHDELYLRR 677 >UniRef50_Q2JRT8 Oxidoreductase, 2OG-Fe(II) oxygenase family n=1 Tax=Synechococcus sp. JA-3-3Ab RepID=Q2JRT8_SYNJA Length = 296 Score = 48.8 bits (115), Expect = 1e-04, Method: Composition-based stats. Identities = 24/139 (17%), Positives = 46/139 (33%), Gaps = 26/139 (18%) Query: 54 AALQNEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTD 113 L +VL ++ P + + Y H D + Sbjct: 166 QMLIPQVLKSLKMEP------FPIAYIECQMTAHNHGNYYKVHNDNGSPDAKERELTY-- 217 Query: 114 LSATLFLSDPQSYDGGELVVNDTFGQHRVKLPA----------GDLVLYPSSSLHCVTPV 163 F +P+ + GGEL++ D+ ++ + + A ++ +PS +H V PV Sbjct: 218 --VYYFYREPKQFSGGELLIYDSEVRNNMYVKADTYKTYVPANNTIIFFPSYLMHEVLPV 275 Query: 164 T------RGVRVASFMWIQ 176 T R W++ Sbjct: 276 TCPSRQFADSRFTVNGWVR 294 >UniRef50_B7G8Q9 Predicted protein n=1 Tax=Phaeodactylum tricornutum CCAP 1055/1 RepID=B7G8Q9_PHATR Length = 474 Score = 48.8 bits (115), Expect = 1e-04, Method: Composition-based stats. Identities = 30/144 (20%), Positives = 54/144 (37%), Gaps = 25/144 (17%) Query: 47 DTRSTLYAALQN-EVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHP 105 T + Y + EV + + F P + RY+ + Y H D + Sbjct: 326 STNAWCYNECDDHEVTQIIWERMTFLTQIPPENSESLQMLRYEPGQFYAVHHD-----YI 380 Query: 106 QNGWMR----TDLSATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPS------- 154 +N W R L+ L+L+D + +GG + + V+ G +L+PS Sbjct: 381 ENDWNRAVGSRILTVFLYLNDVE--EGGATNFPEL--ELAVQPKRGRALLWPSVLDQYPH 436 Query: 155 ----SSLHCVTPVTRGVRVASFMW 174 + H VT+G++ + W Sbjct: 437 KKDDRTEHEAQVVTKGIKYGANAW 460 >UniRef50_D2V7Q2 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2V7Q2_NAEGR Length = 312 Score = 48.8 bits (115), Expect = 1e-04, Method: Composition-based stats. Identities = 39/217 (17%), Positives = 76/217 (35%), Gaps = 50/217 (23%) Query: 3 YHIPGVLSPQDVARFREQLEQAEWVDGRVT--TGAQGAQVKNN--QQVDTRSTLYAALQN 58 + + V++P + F E E+ + ++ G V NN +Q+ + L Sbjct: 101 FVLENVITPLECKLFVEISEKMGYKPSPLSVLAGKFDTSVINNSTKQIRDSERILTDLPE 160 Query: 59 EVLNAVNQHALFFAAALPRTL----------------STPLFNRYQNNETYGFHVDGAVR 102 +V+ +N+ LP + FN+Y + +G H+D R Sbjct: 161 KVIEVLNKR---IEHLLPEKVDIYGEEWTLRKNTPINERIRFNKYGVTQKFGPHMDAGYR 217 Query: 103 SHPQNGWMRTDLSATLFLSDPQSYDGGE---------LVVNDTFGQHRVKLPAGDL--VL 151 + T L+ +L+ + + GGE ++ + Q +P L V Sbjct: 218 KNDHEM---TQLTIIFYLN--EDFKGGETTFFPGGRRHLLEEATVQEVRIVPKIGLVSVF 272 Query: 152 YPSSSL---HCVTPVTRGVRVAS--------FMWIQS 177 + + L H +PV G + W++S Sbjct: 273 FQNGKLNHRHEGSPVIEGFKYIIRSDIAYIKSSWLES 309 >UniRef50_B5JX69 2OG-Fe(II) oxygenase n=1 Tax=gamma proteobacterium HTCC5015 RepID=B5JX69_9GAMM Length = 218 Score = 48.8 bits (115), Expect = 1e-04, Method: Composition-based stats. Identities = 43/184 (23%), Positives = 66/184 (35%), Gaps = 21/184 (11%) Query: 4 HIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLY-------AAL 56 I LS R R +++Q +W D +G ++ QQ + + A Sbjct: 28 VIDQFLSSDLCQRLRGEIQQLDWRDMPRAGVGRGEDYQH-QQSERSDHIRWLRGNSTAQC 86 Query: 57 QNEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSA 116 Q + A+ + F YQ Y H+D G +S Sbjct: 87 QFLAQMEGLRAAINQQLFMGLFEYEAHFALYQPGAFYKKHLDA------FKGRRNRIVST 140 Query: 117 TLFLSDP-QSYDGGELVVNDTFG---QHRVKLP-AGDLVLYPSSSL-HCVTPVTRGVRVA 170 +L++ S D GELV+ +H+ LP AG LV++ S H V P R R + Sbjct: 141 VCYLNEHWHSDDAGELVIYRADQHALEHQRILPQAGRLVVFSSEDYPHEVLPSHR-ERYS 199 Query: 171 SFMW 174 W Sbjct: 200 IAGW 203 >UniRef50_A4S8T4 Predicted protein n=3 Tax=Mamiellales RepID=A4S8T4_OSTLU Length = 338 Score = 48.8 bits (115), Expect = 1e-04, Method: Composition-based stats. Identities = 27/133 (20%), Positives = 45/133 (33%), Gaps = 22/133 (16%) Query: 90 NETYGFHVDGAVRSHPQNGWMRTDLSATLFLSDP-QSYDGGELVVNDTFGQHRVKLPAGD 148 + +H D P N L+ L+L+ + DGGEL + G P + Sbjct: 145 GACFPWHYDNP--GAPSNRA----LTCILYLNPDWKPGDGGELRLQPFCGVAATIQPKHN 198 Query: 149 --LVLYPSSSLHCVTPVTRGVRVASFMWIQSMIRDD-------------KKRAMLFELDN 193 + + +LH VTP T R A +W+ + D R L ++ Sbjct: 199 RLAIFFSDRTLHRVTPSTAAKRYAVTVWLDADFVDPRTGASTNASELNLNAREALSDIAK 258 Query: 194 NIQSLKSRYGESE 206 + L + Sbjct: 259 TAKELARGNAQRA 271 >UniRef50_Q6PK18 PKHD domain-containing transmembrane protein C17orf101 n=12 Tax=Euteleostomi RepID=CQ101_HUMAN Length = 319 Score = 48.8 bits (115), Expect = 2e-04, Method: Composition-based stats. Identities = 38/195 (19%), Positives = 68/195 (34%), Gaps = 40/195 (20%) Query: 8 VLSPQDVARFREQLEQ-------------AEWVDGRVTTGAQGAQVKN-------NQQVD 47 V++ ++ R R E+ + G ++ G + N + Sbjct: 116 VITREEAERIRSVAEKGLSLGGSDGGASILDLHSGALSVGKHFVNLYRYFGDKIQNIFSE 175 Query: 48 TRSTLYAALQNEVLNAVNQHALFFAAALPRTLSTPLF--------NRYQNNETYGFHVDG 99 LY ++ +V + F +A L+ P F R ++E + HVD Sbjct: 176 EDFRLYREVRQKVQ--LTIAEAFGISASSLHLTKPTFFSRINSTEARTAHDEYWHAHVDK 233 Query: 100 AVRSHPQNGWMRTDLSATLFLSD-PQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSS-- 156 D ++ L+LS+ + + GG + + V+ AG + + S S Sbjct: 234 VTYGSF-------DYTSLLYLSNYLEDFGGGRFMFMEEGANKTVEPRAGRVSFFTSGSEN 286 Query: 157 LHCVTPVTRGVRVAS 171 LH V V G R A Sbjct: 287 LHRVEKVHWGTRYAI 301 >UniRef50_Q486F0 Oxidoreductase, 2OG-Fe(II) oxygenase family n=8 Tax=Gammaproteobacteria RepID=Q486F0_COLP3 Length = 261 Score = 48.4 bits (114), Expect = 2e-04, Method: Composition-based stats. Identities = 34/202 (16%), Positives = 73/202 (36%), Gaps = 40/202 (19%) Query: 3 YHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEVL- 61 + + VL+ + + EQ E++ + V++N D+ + + + ++ Sbjct: 59 FQLFNVLTKDECEKLISISEQLEFLP--DAAVSLPRSVRHN---DSLTWIVDEQTDGIIW 113 Query: 62 ---NAVN--QHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSH---------PQN 107 + + A+F + + F RY ++ + H DG+ Sbjct: 114 QRIAHLMDDRQAIFGGSKALGINARFRFYRYNPDDYFKPHSDGSWPGSRIINDELIANAY 173 Query: 108 GWMRTDLSATLFLSDPQSYDGGE--LVVNDTFGQHR-----------VKLPAGDLVLYPS 154 + ++ +FLS + + GGE +VN V+ PAG ++ +P Sbjct: 174 PDRYSQMTFLIFLS--EDFQGGETRFLVNADDPTKPATSNDNVKNVDVRTPAGGILCFPH 231 Query: 155 -----SSLHCVTPVTRGVRVAS 171 +H P+T GV+ Sbjct: 232 GMHPLHCIHSSVPITDGVKYII 253 >UniRef50_A6C2C6 Uncharacterized iron-regulated protein n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C2C6_9PLAN Length = 187 Score = 48.4 bits (114), Expect = 2e-04, Method: Composition-based stats. Identities = 36/185 (19%), Positives = 62/185 (33%), Gaps = 27/185 (14%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEW---VDGRVTTGAQGAQVKNNQQVDTRST-----LY 53 + + LS + A E+LE + + G + V +Q++ Sbjct: 4 IIQVRNFLSATECAALIERLETQGFKEQLSGDRDRVVRARCVFTDQELADTYWQRLQQHV 63 Query: 54 AALQNEVLNAVNQHALFFAAALPRTLSTPL-FN------RYQNNETYGFHVDGAVRSHPQ 106 AL + + + P P N +Y E + H D A Sbjct: 64 PALTEVYTDGFTPYPHLNS---PLATFQPCGLNEVLRCYKYLPGEQFRRHEDFAY---EW 117 Query: 107 NGWMRTDLSATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRG 166 + RT + +L++ Y GGE TF ++V G V++P H V G Sbjct: 118 SETRRTFYTVLFYLNN--EYTGGE----TTFDHNQVVPETGLAVIFPHELYHSGNMVETG 171 Query: 167 VRVAS 171 ++ A Sbjct: 172 IKYAM 176 >UniRef50_D2VWJ6 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2VWJ6_NAEGR Length = 1113 Score = 48.4 bits (114), Expect = 2e-04, Method: Composition-based stats. Identities = 23/178 (12%), Positives = 51/178 (28%), Gaps = 25/178 (14%) Query: 7 GVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTR------STLYAALQNEV 60 ++ + +++ + G T V+N+ +++ + L + Sbjct: 98 PIVYKEQADEIIRIAKKSPYGKGEETIYDD--NVRNSYELEPNQFRITNTMWQKELNQLL 155 Query: 61 LNAVNQHALFFAAALPR-TLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLF 119 + L L Y+ + +H D + TL Sbjct: 156 ETKIKSGLGIDKYKNVECKLYKLLL--YEKGGHFEYHKDS---------EKECKQTCTLV 204 Query: 120 LSDPQSYDGGELVVNDTFGQHRVKLPAGDL-----VLYPSSSLHCVTPVTRGVRVASF 172 + P ++GGE + + + + + S H V P+T G R Sbjct: 205 IILPSIFEGGEFKIKHNDYEMEIATTNDYATDCHFISFYSDCDHAVMPLTSGYRTCLI 262 >UniRef50_UPI0000D55437 PREDICTED: similar to leprecan 1 n=1 Tax=Tribolium castaneum RepID=UPI0000D55437 Length = 694 Score = 48.4 bits (114), Expect = 2e-04, Method: Composition-based stats. Identities = 28/129 (21%), Positives = 44/129 (34%), Gaps = 29/129 (22%) Query: 66 QHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWM--------------R 111 + LFF L R D + H N + Sbjct: 477 KKPLFFTYTH-------LVCRSAQQHAPINRTDYSHMIHADNCNLISDTVCEKQPPAYTY 529 Query: 112 TDLSATLFLSDPQSYDGGELVVNDTFGQHRVKL----PAGDLVLYPS--SSLHCVTPVTR 165 D SA ++L+D ++GGE V +++ G +V + S +LH V V + Sbjct: 530 RDYSAIIYLND--DFEGGEFVFAGDTNGEKIQSVINPKCGRMVAFSSGPENLHGVRAVKK 587 Query: 166 GVRVASFMW 174 G R A +W Sbjct: 588 GSRCAIALW 596 >UniRef50_B2B745 Predicted CDS Pa_2_9860 n=1 Tax=Podospora anserina RepID=B2B745_PODAN Length = 323 Score = 48.4 bits (114), Expect = 2e-04, Method: Composition-based stats. Identities = 25/104 (24%), Positives = 47/104 (45%), Gaps = 13/104 (12%) Query: 80 STPLFNRYQNNETYGFHVDGAVR-SHPQNGWMRTDLSATLFLS-------DPQSYD--GG 129 F RY+ + H D A S + ++T L+ ++L+ DP S + GG Sbjct: 205 ERLRFLRYEKGGFFQPHCDSAYYASMDKEQVVKTLLTVHIYLNDCKATAEDPDSTELVGG 264 Query: 130 ELVVNDTFGQHR--VKLPAGDLVLYP-SSSLHCVTPVTRGVRVA 170 + + + R V+ AG ++++ S+ LH V +GV+ + Sbjct: 265 ATTLFSSDEKRRYDVECKAGRVLVFQHSAVLHSGDEVKQGVKFS 308 >UniRef50_Q5GQC0 Putative uncharacterized protein n=1 Tax=Synechococcus phage S-PM2 RepID=Q5GQC0_BPSYP Length = 193 Score = 48.4 bits (114), Expect = 2e-04, Method: Composition-based stats. Identities = 24/100 (24%), Positives = 45/100 (45%), Gaps = 18/100 (18%) Query: 79 LSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLSDPQSYDGGELVVNDTFG 138 LS ++ Y N+ Y +HVD + +S L+L+D +++GG N F Sbjct: 97 LSNFVYRYYNTNDHYKWHVDKTHHG------VELKVSFLLYLND--NFEGG----NTMFL 144 Query: 139 QHRVKL--PAGDLVLYPS--SSLHCVTPVTRGVRVASFMW 174 R+K G ++++P +H +P+ G + +W Sbjct: 145 SDRLKFTPKRGSVLMFPCGPYFIHKSSPIKSGEKH--IIW 182 >UniRef50_D2VTG5 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2VTG5_NAEGR Length = 943 Score = 48.4 bits (114), Expect = 2e-04, Method: Composition-based stats. Identities = 37/204 (18%), Positives = 76/204 (37%), Gaps = 24/204 (11%) Query: 7 GVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRS-----TLYAALQNEVL 61 ++ + + + + GR+ V+ + Q+D + + +L +++ Sbjct: 103 PIIYEEQAKDLIKNCSMSPF--GRLDKTIYDESVRKSWQLDPQRFKITNPKWNSLIEDLV 160 Query: 62 NAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLS 121 N + L A S Y+ + FH D + +NG + ATL + Sbjct: 161 NNNVKKDLGIAQEKKIGFSLYKMLLYEEGGFFDFHRD----TEKENGMI-----ATLVVQ 211 Query: 122 DPQSYDGGELVVNDTFGQHRVKLPAGDL-----VLYPSSSLHCVTPVTRGVRVASFMWIQ 176 P S+ GGE+VV ++ K V + H V V G R+A I Sbjct: 212 LPSSFTGGEIVVRHKEKENIYKTSEDATFNPYYVSFYCDVSHKVETVKSGYRLAL---IY 268 Query: 177 SMIRDDKKRAMLFELDNNIQSLKS 200 +++ + F+ D ++ +++ Sbjct: 269 NLVYSGADKIQAFDSDTQLKQIET 292 >UniRef50_B8C289 Predicted protein (Fragment) n=1 Tax=Thalassiosira pseudonana RepID=B8C289_THAPS Length = 207 Score = 48.1 bits (113), Expect = 2e-04, Method: Composition-based stats. Identities = 28/193 (14%), Positives = 59/193 (30%), Gaps = 30/193 (15%) Query: 4 HIPGVLSPQDVARFREQLEQAEWVD-------GRVTTGAQGAQVKNNQQVDTRSTLYAAL 56 I + + ++ + E + + G+ K + + L Sbjct: 4 VIHNLFTHEECTSLINRAEAKGFEEALVHGPFGQEVLRKDIRNCKRC--ILDDTELTNEW 61 Query: 57 QNEVLNAVNQHA---------LFFAAALPRTLSTPL-------FNRYQNNETYGFHVDGA 100 V+NA+ + + ++ + RY + +G H D Sbjct: 62 FTRVMNALEGSELKDKIADAHWVESNDIGKSTFRVVGLNERVRILRYDPGQYFGVHKDNR 121 Query: 101 VRSHPQNGWM---RTDLSATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSL 157 + G + L+ L+L+D GGE + + H V G ++++ Sbjct: 122 FIRGSEFGSREGEESHLTFLLYLNDK--MKGGETRIENGGRYHEVVPKVGSVLIFDHDIS 179 Query: 158 HCVTPVTRGVRVA 170 H V GV+ Sbjct: 180 HEAMRVVSGVKYC 192 >UniRef50_A0L8V5 2OG-Fe(II) oxygenase n=1 Tax=Magnetococcus sp. MC-1 RepID=A0L8V5_MAGSM Length = 211 Score = 48.1 bits (113), Expect = 2e-04, Method: Composition-based stats. Identities = 30/153 (19%), Positives = 56/153 (36%), Gaps = 32/153 (20%) Query: 26 WVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEVLNAVNQHALFFAAALPRTLSTPLFN 85 W+DG Q+ + L L+ + A+ + + F+ Sbjct: 78 WLDGTTAA-----------QLAYMAWL-ERLRLTLNAALQLGLFYVES---------QFS 116 Query: 86 RYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLSDP-QSYDGGELVVNDTFGQHRVKL 144 YQ Y H D V G ++ +L+ Q++ GG L+++ + + + Sbjct: 117 CYQPGGYYRRHKDAFV------GEENRIVTVVTYLNPQWQAHHGGALLIHPSGHGLTLTV 170 Query: 145 PA--GDLVLYPSSSL-HCVTPVTRGVRVASFMW 174 G +V + S + H V+P T+ R A W Sbjct: 171 QPKLGTMVCFLSEAWPHEVSP-TQCDRYAIASW 202 >UniRef50_B6BWB5 Putative uncharacterized protein n=1 Tax=beta proteobacterium KB13 RepID=B6BWB5_9PROT Length = 534 Score = 48.1 bits (113), Expect = 2e-04, Method: Composition-based stats. Identities = 40/197 (20%), Positives = 67/197 (34%), Gaps = 34/197 (17%) Query: 1 MMYHIPGVLS--PQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQ-QVDTRS--TLYAA 55 M +P ++ + +EQAE D + + G Q N Q S TL + Sbjct: 335 MHAQVPELVDNNKALLKSLLNDIEQAEISDRKQSRLFNGTQSSGNLFQRSENSFRTLSES 394 Query: 56 LQNEVLNAVNQHAL---FFAAALPRTLS--TPLFNRYQNNETYGFHVDGAVRSHPQNGWM 110 L+ + + F P+ +S + + R + H+ ++GW Sbjct: 395 LKKLIQLYFQSNQEKSCTFIKMFPKEISFSSSWYVRMKKGGHLTSHI-------HEDGW- 446 Query: 111 RTDLSATLFLSDPQSYD------------GGELVVNDTFGQHRVKLPA-GDLVLYPSSSL 157 +S ++L PQ D G V +V LP GD++ +PSS Sbjct: 447 ---ISGAVYLKIPQHRDNADEGAIELSTHGDNYPVEHNDFPKKVILPKEGDVIFFPSSVF 503 Query: 158 HCVTPVTRGVRVASFMW 174 H P T + Sbjct: 504 HRTIPFTSDEERICIAF 520 >UniRef50_C1EJE5 Predicted protein n=1 Tax=Micromonas sp. RCC299 RepID=C1EJE5_9CHLO Length = 317 Score = 48.1 bits (113), Expect = 2e-04, Method: Composition-based stats. Identities = 30/97 (30%), Positives = 40/97 (41%), Gaps = 15/97 (15%) Query: 91 ETYGFHVDGAVRSHPQNGWMRTDLSATLFLSDPQ--SYDGGELVVNDTFGQ-HRVKLPAG 147 + HVD A D+SA L+LSD + GGEL D G V Sbjct: 223 GYWSPHVDKANVPEY-------DVSAVLYLSDGDGVDFAGGELHFMDAIGGWKTVTPRRN 275 Query: 148 DLVLYPS--SSLHCVTPVTRGVRVASFMWIQSMIRDD 182 LV++ S ++H V VT G R +W + RD Sbjct: 276 RLVVFSSGEENVHAVGVVTSGERTTLNLW---LTRDP 309 >UniRef50_Q5PP31 At1g68080 n=4 Tax=Magnoliophyta RepID=Q5PP31_ARATH Length = 389 Score = 48.1 bits (113), Expect = 2e-04, Method: Composition-based stats. Identities = 21/90 (23%), Positives = 40/90 (44%), Gaps = 11/90 (12%) Query: 89 NNETYGFHVDGAVRSHPQNGWMRTDLSATLFLSD-PQSYDGGELVVNDTFGQHRVKLP-A 146 + G+H D ++ + D +A +L+ + + GG G+ P A Sbjct: 92 KGASIGWHSDDN-----RSYLKQRDFAAVCYLNSYEKDFIGGLFRFQS--GEPVTVAPSA 144 Query: 147 GDLVLYPSS--SLHCVTPVTRGVRVASFMW 174 GD+++Y + ++H V VT G R+ +W Sbjct: 145 GDVIMYTADDRNIHSVDEVTDGERLTLALW 174 >UniRef50_A8IDI8 Prolyl 4-hydroxylase n=2 Tax=Chlamydomonas reinhardtii RepID=A8IDI8_CHLRE Length = 429 Score = 48.1 bits (113), Expect = 2e-04, Method: Composition-based stats. Identities = 28/116 (24%), Positives = 42/116 (36%), Gaps = 27/116 (23%) Query: 87 YQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLSDPQSYDGGELVV------------- 133 Y++ + Y H+D + + + + LSD + GGE V Sbjct: 308 YKHTQHYDSHMDSFDPKEYGQQYSQRIATVIVVLSD-EGLVGGETVFKREGKANIDKPIT 366 Query: 134 ----NDTFGQHRVKLPAGDLVLYPS---------SSLHCVTPVTRGVRVASFMWIQ 176 D G R K AGD VL+ S +LH PV G + + WI+ Sbjct: 367 NWTDCDADGGLRYKPRAGDAVLFWSAFPDGRLDQHALHGSCPVVTGNKWVAVKWIR 422 >UniRef50_B7G1Z5 Predicted protein n=1 Tax=Phaeodactylum tricornutum CCAP 1055/1 RepID=B7G1Z5_PHATR Length = 427 Score = 48.1 bits (113), Expect = 2e-04, Method: Composition-based stats. Identities = 26/119 (21%), Positives = 44/119 (36%), Gaps = 16/119 (13%) Query: 75 LPRTLSTPLFNRYQNNETYGFHVD--GAVRSHPQNGWMRT-------DLSATLFLSDPQS 125 + R +S LF ++ + A ++P G R D TL + ++ Sbjct: 280 ILRPISRHLFESSESFGDLNWRQGYVAAYSANPTEGRPRAQLITHTDDSEVTLNIGLGEN 339 Query: 126 YDGGELVVNDTFGQHR-------VKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQS 177 + GG + G ++ G +++ H VT VT G R A MW +S Sbjct: 340 FTGGAIEFRGLRGTPEAGKLIGTIQPRVGVALIHAGRHFHDVTTVTSGDRFALVMWARS 398 >UniRef50_Q338D2 Prolyl 4-hydroxylase, putative, expressed n=11 Tax=Embryophyta RepID=Q338D2_ORYSJ Length = 309 Score = 47.7 bits (112), Expect = 3e-04, Method: Composition-based stats. Identities = 33/148 (22%), Positives = 50/148 (33%), Gaps = 28/148 (18%) Query: 54 AALQNEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTD 113 Q+EV+ + + + P + YQN E Y H D + Q Sbjct: 104 EKKQDEVVARIEERIAAWTFLPPDNGESIQILHYQNGEKYEPHYDYFHDKNNQALGGHRI 163 Query: 114 LSATLFLSDPQSYDGGELVVNDTFGQH-----------------RVKLPAGDLVLY---- 152 + ++LSD GGE + + VK GD +L+ Sbjct: 164 ATVLMYLSDVG--KGGETIFPEAEVGKLLQPKDDTWSDCAKNGYAVKPVKGDALLFFSLH 221 Query: 153 P-----SSSLHCVTPVTRGVRVASFMWI 175 P S SLH PV G + ++ WI Sbjct: 222 PDATTDSDSLHGSCPVIEGQKWSATKWI 249 >UniRef50_B8BWH5 Putative uncharacterized protein n=1 Tax=Thalassiosira pseudonana RepID=B8BWH5_THAPS Length = 373 Score = 47.7 bits (112), Expect = 3e-04, Method: Composition-based stats. Identities = 24/110 (21%), Positives = 42/110 (38%), Gaps = 16/110 (14%) Query: 80 STPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLSDPQSYDGGELVVNDTFGQ 139 + +Y+ E Y H D + + + R L+ L+L+D + +GGE Sbjct: 254 ESFQILQYKPGEYYKSHHDSSDANKDKVTGHRV-LTFFLYLNDVE--EGGETHFTKLNI- 309 Query: 140 HRVKLPAGDLVLYPS-----------SSLHCVTPVTRGVRVASFMWIQSM 178 VK G +++PS H V +G++ A+ WI Sbjct: 310 -SVKPKRGRALVWPSVLNEDPNSTDNRMYHEAKSVEKGIKYAANHWIHQY 358 >UniRef50_A4Y066 2OG-Fe(II) oxygenase n=16 Tax=Proteobacteria RepID=A4Y066_PSEMY Length = 245 Score = 47.7 bits (112), Expect = 3e-04, Method: Composition-based stats. Identities = 47/183 (25%), Positives = 73/183 (39%), Gaps = 28/183 (15%) Query: 6 PGVLSPQDVARFREQLEQAEWVDGRVTTGA-----QGAQVKNNQQVDTRSTL----YAAL 56 P +L+ + A ++ E V G +G + + Q ++ + Y AL Sbjct: 68 PEILTRELAAECHKRARSGELSAAGVGRGGALQIQEGIRGDHIQWLEPGQSAACDRYLAL 127 Query: 57 QNEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSA 116 +E+ A+N+ ALF L F Y Y HVD + +SA Sbjct: 128 LDELRQALNR-ALF----LGLEDFEGHFACYAPGAFYQRHVDRFRDDDRRT------VSA 176 Query: 117 TLFLSD---PQSYDGGELVVNDTFG-QHRVKLPAGDLVLYPSSSL-HCVTPVTRGVRVAS 171 +L+D P+ GG L + G +H V AG L L+ S + H V P TR R++ Sbjct: 177 VFYLNDNWLPE--QGGALRLYLADGREHDVLPEAGTLALFMSGDMPHEVLPATR-ERLSL 233 Query: 172 FMW 174 W Sbjct: 234 TGW 236 >UniRef50_Q15SJ2 2OG-Fe(II) oxygenase n=6 Tax=Proteobacteria RepID=Q15SJ2_PSEA6 Length = 306 Score = 47.7 bits (112), Expect = 3e-04, Method: Composition-based stats. Identities = 16/87 (18%), Positives = 34/87 (39%), Gaps = 7/87 (8%) Query: 92 TYGFHVDGAVRSHPQNGWMRTDLSATLFLSDPQS-YDGGELVVNDTFGQHRVKL--PAGD 148 Y + D ++R H ++ + L+ P + G + D + L G Sbjct: 188 HYKPNTDTSIRPHTDASA----VTLNINLNLPDEVFTGSNVDFYDPTTGKMIGLAFKPGS 243 Query: 149 LVLYPSSSLHCVTPVTRGVRVASFMWI 175 +++ + +H P+T G R +W+ Sbjct: 244 AMIHRGNVVHAAQPITSGERTNFVLWL 270 >UniRef50_A0YIP7 Putative uncharacterized protein n=1 Tax=Lyngbya sp. PCC 8106 RepID=A0YIP7_9CYAN Length = 301 Score = 47.7 bits (112), Expect = 3e-04, Method: Composition-based stats. Identities = 39/198 (19%), Positives = 69/198 (34%), Gaps = 37/198 (18%) Query: 5 IPGVLSPQDVARFREQL--EQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEVLN 62 I LSP++ + + +++ TT A + + Y ++N++LN Sbjct: 111 IENFLSPEENQEILKIALSKSDQFIGSTTTTQAVNYRQSSILYATLFPEFYNLMRNKILN 170 Query: 63 AVNQHALFFAAALPRTLSTP--------LFNRYQNNETYGFHVDGAVRSHPQNGWMRTDL 114 A+ LP+ P + + Y H D + + RT Sbjct: 171 AL---PDI----LPQLNHQPFNVSQVEMQLTAHNDGCFYKIHNDSG----SEKTYTRTLT 219 Query: 115 SATLFLSDPQSYDGGELVVNDTF---------GQHRVKLPA-GDLVLYPSSSLHCVTPVT 164 F +P+ + GGEL + +T GQ++ P +VL+ S H V PV Sbjct: 220 YVYYFHQEPKQFSGGELRLYETELKNGSAISQGQYKTIEPRNNSIVLFDSRCKHEVMPVR 279 Query: 165 ------RGVRVASFMWIQ 176 R W++ Sbjct: 280 CPSQRFEDGRFTLNGWLR 297 >UniRef50_Q4QF16 Putative uncharacterized protein n=7 Tax=Trypanosomatidae RepID=Q4QF16_LEIMA Length = 399 Score = 47.7 bits (112), Expect = 3e-04, Method: Composition-based stats. Identities = 38/217 (17%), Positives = 65/217 (29%), Gaps = 53/217 (24%) Query: 3 YHIPGVLSPQDVARFREQLEQAEWV----------DGRVTTGAQGAQVKNNQQVDTRST- 51 + L+ ++ + E+ + DG T + V+ ++ Sbjct: 127 IVLENFLTHEECDQLVAACEKVGYTFWLQKNHHDADGEATCDSGSKAVRVVDTIEANFPH 186 Query: 52 LYAALQNEVLNAVNQHALFFAAALPR-----------TLST------PLFNRYQNNETYG 94 L A L + V+ F+ +P T LF RY + Sbjct: 187 LSAKLYERIARVVSLKPKCFSEDMPNAEELFERELAGTWVPHALSENLLFGRYHPGGHFM 246 Query: 95 FHVDGAVRSHPQNGWMRTDLSATLFLSDPQSYDGGE-----------LVVNDTFGQHR-- 141 HVDGA T L ++L+D GGE + +++ Q+R Sbjct: 247 PHVDGATILDLNTRSFYTLL---IYLNDCLH--GGETFIFAGEQCNVMYLDEKENQYRGN 301 Query: 142 -------VKLPAGDLVLYPSSSLHCVTPVTRGVRVAS 171 V G + + LH PV G + Sbjct: 302 ATQRVGAVYPKKGSAAFFYYNLLHEGAPVLEGHKYIC 338 >UniRef50_C7BVA4 2OG-Fe(II) oxygenase family like protein n=1 Tax=Synechococcus phage S-RSM4 RepID=C7BVA4_9CAUD Length = 197 Score = 47.7 bits (112), Expect = 3e-04, Method: Composition-based stats. Identities = 36/182 (19%), Positives = 68/182 (37%), Gaps = 37/182 (20%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVD---------GRVTTGAQGAQVKNNQQVDTRSTL 52 + HI S ++ ++LE +D G T G +K + + + Sbjct: 14 VIHIQNFYSSEEYKLIMKELEFLNGIDRFKNPEEPGGPGTAYVDGKPLKVGKGLHLNAVY 73 Query: 53 YAALQNEVLNAVNQ---------HALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRS 103 Q+++L+ + H F T + + N + Y H D V + Sbjct: 74 DDPRQSDILHINRKLFDQNLMDYHPFFRYVWRSNKDET-KIHYFTNGDHYKSHTDDCVVT 132 Query: 104 HPQNGWMRTDLSATLFLSDPQSYDGGELVVNDTFGQHRVKLP--AGDLVLYPSSSLHCVT 161 + T F +P+ + GG+L++ +++VKLP + ++PS H VT Sbjct: 133 -----------AITWFYKEPKIFTGGDLIL-----ENKVKLPCLNNSIAIFPSILYHEVT 176 Query: 162 PV 163 V Sbjct: 177 EV 178 >UniRef50_Q8T294 Putative uncharacterized protein n=1 Tax=Dictyostelium discoideum RepID=Q8T294_DICDI Length = 551 Score = 47.7 bits (112), Expect = 3e-04, Method: Composition-based stats. Identities = 27/99 (27%), Positives = 41/99 (41%), Gaps = 22/99 (22%) Query: 87 YQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLSDPQSYDGGELVV--NDTFGQH--RV 142 Y + H+D + S G L +Y+GGE ++ NDTF Q Sbjct: 254 YNEGGHFQPHID-TIHSKNHIGTYIVPLGV-------DTYEGGEFIISENDTFDQTTINY 305 Query: 143 KLPAGD----------LVLYPSSSLHCVTPVTRGVRVAS 171 K+ A D + + + +H VTPVT+GVR+ Sbjct: 306 KIEANDNKLQDSIDFKWIAFYNDCIHTVTPVTKGVRIVL 344 >UniRef50_A3TG91 Putative uncharacterized protein n=1 Tax=Janibacter sp. HTCC2649 RepID=A3TG91_9MICO Length = 711 Score = 47.3 bits (111), Expect = 3e-04, Method: Composition-based stats. Identities = 34/135 (25%), Positives = 51/135 (37%), Gaps = 18/135 (13%) Query: 44 QQVDTR-STLYAALQNEVLNAVNQHALFFAAALPRTLSTPLFN-RYQNNETYGFHVDGAV 101 QVD L A L V + A A+PR + F+ R + FH D + Sbjct: 577 YQVDFGLDELEAVLGESVTRRLQGAA--PDHAVPRGVFVRHFSERTRP--FIPFHPDDSH 632 Query: 102 RSHPQNGWMRTDLSATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVT 161 + + L DP GGELV+ G V+ G + +P + +H V Sbjct: 633 W------------TVNVPLEDPDQTSGGELVMLLDGGLRVVERRRGWAISHPGALIHGVR 680 Query: 162 PVTRGVRVASFMWIQ 176 VT G R + + + Sbjct: 681 RVTHGDRWSLIAFYE 695 >UniRef50_A8L657 2OG-Fe(II) oxygenase n=1 Tax=Frankia sp. EAN1pec RepID=A8L657_FRASN Length = 777 Score = 47.3 bits (111), Expect = 4e-04, Method: Composition-based stats. Identities = 33/130 (25%), Positives = 49/130 (37%), Gaps = 17/130 (13%) Query: 46 VDTRSTLYAALQNEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHP 105 + + AL + A ++ L AA L L + L Y+ + + H D Sbjct: 85 LVHATWDDDALNVILTTAKDELGLPIAAELTADLHSLLV--YEPGQFFLAHQDS------ 136 Query: 106 QNGWMRTDLSA--TLFLSDPQSYDGGELVVNDTFGQHRVKLPAG--DLVLYPSSSLHCVT 161 D S TL ++ P +Y GGELVV Q + LV + + H V Sbjct: 137 -----EKDDSMIGTLVVTMPSTYTGGELVVRHNGEQRACRGSKTELSLVAFYADCRHEVA 191 Query: 162 PVTRGVRVAS 171 V G R+A Sbjct: 192 EVRSGYRIAL 201 >UniRef50_A8I8D2 Predicted protein n=3 Tax=Chlamydomonas reinhardtii RepID=A8I8D2_CHLRE Length = 336 Score = 47.3 bits (111), Expect = 4e-04, Method: Composition-based stats. Identities = 32/123 (26%), Positives = 47/123 (38%), Gaps = 32/123 (26%) Query: 82 PLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLSDPQSYDGGELVVN------D 135 RY + +TYG H D S R + ++LSD + +GGE D Sbjct: 105 IQILRYAHGQTYGAHYDSGASSDHVGPKWRL-ATFLMYLSDVE--EGGETAFPHNSVWAD 161 Query: 136 TFGQHRV--------------KLPAGDLVLYPSS---------SLHCVTPVTRGVRVASF 172 +V K AGD VL+ S S+H PV +GV+ A+ Sbjct: 162 PSIPEQVGDKFSDCAKGHVAAKPKAGDAVLFYSFYPNNTMDPASMHTGCPVIKGVKWAAP 221 Query: 173 MWI 175 +W+ Sbjct: 222 VWM 224 >UniRef50_Q2TWV5 Predicted protein n=2 Tax=Aspergillus RepID=Q2TWV5_ASPOR Length = 481 Score = 47.3 bits (111), Expect = 4e-04, Method: Composition-based stats. Identities = 35/191 (18%), Positives = 66/191 (34%), Gaps = 31/191 (16%) Query: 7 GVLSPQDVARFREQLEQAEWV-DGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEVLNAVN 65 VLSP + E ++ D + + + +N +T + L + + V Sbjct: 283 NVLSPAECKAIIAAGESVNFLPDAPLREDGDMSILAHNFYWVVDTTFHDMLWARISSYV- 341 Query: 66 QHALFFAAALPRTLST-PLFNRYQNNETYGFHVDGA-------------VRSHPQNGWMR 111 L R ++ RY Y H+DGA + P++ Sbjct: 342 --PQSINGRLARGINRRFRVYRYVPGAEYRCHIDGAWPPSGILPDDTYVYDASPEDKRQS 399 Query: 112 TDLSATLFLSDPQSYDGGELVV------NDTFGQHRVKLPAGDLVLYP-----SSSLHCV 160 + + L+L+D ++GGE T + V+ G + ++P + LH Sbjct: 400 SMYTFLLYLND--EFEGGETTFFMPAAREGTLNAYPVRPVMGAVAIFPHGEANGALLHEG 457 Query: 161 TPVTRGVRVAS 171 T V +G + Sbjct: 458 TGVRKGAKYII 468 >UniRef50_A4RSI6 Predicted protein (Fragment) n=5 Tax=Viridiplantae RepID=A4RSI6_OSTLU Length = 255 Score = 47.3 bits (111), Expect = 4e-04, Method: Composition-based stats. Identities = 30/118 (25%), Positives = 43/118 (36%), Gaps = 28/118 (23%) Query: 82 PLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLSDPQSYDGGELVVNDT----- 136 RY+N + Y H D H + L+LSD + +GGE V +T Sbjct: 81 LQVLRYENGQEYKAHFD--YFFHKGGKRNNRIATVLLYLSDVE--EGGETVFPNTDVPTD 136 Query: 137 ----------FGQHRVKLPAGDLVLYPS---------SSLHCVTPVTRGVRVASFMWI 175 G VK GD +L+ S S H PV +GV+ + W+ Sbjct: 137 RDRSQYSECGNGGKSVKARKGDALLFWSMKPGGELDPGSSHAGCPVIKGVKWTATKWM 194 >UniRef50_B5Y4Z8 Predicted protein n=3 Tax=Bacillariophyta RepID=B5Y4Z8_PHATR Length = 508 Score = 47.3 bits (111), Expect = 4e-04, Method: Composition-based stats. Identities = 38/215 (17%), Positives = 65/215 (30%), Gaps = 44/215 (20%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNN------------QQVDTR 49 ++ + LS +V + + + G N+ Q Sbjct: 290 VFEVKDFLSDMEVEHLLNIASKRKLKRSTMHAGGSSEATTNDDTRTSTNDWIPRHQDLIT 349 Query: 50 STLYAALQNEVL--NAVNQH------ALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAV 101 T+Y + + A+ + F + + + YQ + Y H D + Sbjct: 350 DTIYRRAADLLQMDEALLRWRRKSEIPEFTESHISIS-ERLQLVNYQVGQQYTPHHDFTM 408 Query: 102 RSHPQNGWMRTDLSATLFLSDPQSYDGGE------LVVNDTFGQHRVKLPAGDLVL---- 151 R ATL DGGE L ++ G +VK G +L Sbjct: 409 PGLVNMQPSRF---ATLLFYLNDDMDGGETAFPRWLHADEEGGSLKVKPEKGKAILFYNL 465 Query: 152 -----YPSSSLHCVTPVTRGVRVASFMWIQSMIRD 181 Y S H PV RG + W+ +++R Sbjct: 466 LPDGNYDERSEHAALPVRRGEK-----WLTNLVRA 495 >UniRef50_B8CBF7 Putative uncharacterized protein n=1 Tax=Thalassiosira pseudonana RepID=B8CBF7_THAPS Length = 248 Score = 47.3 bits (111), Expect = 4e-04, Method: Composition-based stats. Identities = 26/105 (24%), Positives = 37/105 (35%), Gaps = 12/105 (11%) Query: 80 STPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLSDPQSYDGGELVVNDTFGQ 139 Y E Y H D ++ L+L+D + +GGE +G Sbjct: 141 EPLQMVHYDPGEEYTAHHDFGYTHMSAPHQPSRSINMLLYLNDVE--EGGETSFP-RWGG 197 Query: 140 HRVKLPAGDLVLY-------PSSSL--HCVTPVTRGVRVASFMWI 175 VK G VL+ S L H PV +G + S +WI Sbjct: 198 LDVKPVKGKAVLFYMLTADGNSDDLSQHAALPVIKGEKWMSNLWI 242 >UniRef50_Q01F56 SmkH (IC) n=2 Tax=Ostreococcus tauri RepID=Q01F56_OSTTA Length = 637 Score = 47.3 bits (111), Expect = 4e-04, Method: Composition-based stats. Identities = 33/110 (30%), Positives = 47/110 (42%), Gaps = 18/110 (16%) Query: 79 LSTPLFNRYQNNETYGF---HVDGAVRSHPQNGWMR-----TD---LSATLFLSDPQSYD 127 +S + +R++ ++G H D V + N R TD S TL L D Q Y Sbjct: 530 ISPFIRDRFRLPTSFGTLYVH-DAFVVKYNANEGQRELPVHTDQGQFSLTLALHDTQDYS 588 Query: 128 GGELVVNDTFGQHR--VKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWI 175 GG F +H V+ GD V + SS H P+T GVR ++ Sbjct: 589 GGG----TIFPEHECIVRPRCGDFVAFRSSLTHGGVPITAGVRYIVVAFL 634 >UniRef50_D0N7E1 Putative uncharacterized protein n=1 Tax=Phytophthora infestans T30-4 RepID=D0N7E1_PHYIN Length = 803 Score = 47.3 bits (111), Expect = 4e-04, Method: Composition-based stats. Identities = 29/170 (17%), Positives = 61/170 (35%), Gaps = 34/170 (20%) Query: 39 QVKNNQQVDTR-----STLYA----ALQNEVLNAVNQHALFFAAALPRTLSTPLFNRYQN 89 V+ + Q++ + L+ L + + + P Y+ Sbjct: 112 NVRKSWQLEPSQVEFKNPLWESGLHQLTRTITERLGYSGV------PLQCVLYKLLVYEE 165 Query: 90 NETYGFHVDGAVRSHPQNGWMRTDLSATLFLSDPQSYDGGELVVNDTFG---QHRVKLPA 146 + H D + ++G + ATL + P ++GG+LV+ +H Sbjct: 166 GGHFFKHQD----TEKEDGMI-----ATLVVQLPSLHEGGDLVIYSNGEVKHRHDFGKAD 216 Query: 147 GDLVLYPSSSL------HCVTPVTRGVRVASFMWIQSMIRDDKKRAMLFE 190 G + P ++ H + VT+G R+A I + +D + +F+ Sbjct: 217 GTVAFLPHYAVHYADAEHALETVTKGFRLALVYSI-CLPKDRCQLKRVFD 265 >UniRef50_Q8IVL5 Prolyl 3-hydroxylase 2 n=29 Tax=Euteleostomi RepID=P3H2_HUMAN Length = 708 Score = 46.9 bits (110), Expect = 4e-04, Method: Composition-based stats. Identities = 22/91 (24%), Positives = 39/91 (42%), Gaps = 14/91 (15%) Query: 96 HVDGAVRSHPQNGWMR-------TDLSATLFLSDPQSYDGGELVVNDTFGQH---RVKLP 145 H D + N + D SA L+++D ++GGE + + + +K Sbjct: 580 HADNCLLDPEANECWKEPPAYTFRDYSALLYMND--DFEGGEFIFTEMDAKTVTASIKPK 637 Query: 146 AGDLVLYPS--SSLHCVTPVTRGVRVASFMW 174 G ++ + S + H V VT+G R A +W Sbjct: 638 CGRMISFSSGGENPHGVKAVTKGKRCAVALW 668 >UniRef50_UPI0001925DF6 PREDICTED: similar to predicted protein n=1 Tax=Hydra magnipapillata RepID=UPI0001925DF6 Length = 799 Score = 46.9 bits (110), Expect = 5e-04, Method: Composition-based stats. Identities = 35/145 (24%), Positives = 59/145 (40%), Gaps = 22/145 (15%) Query: 48 TRSTLYAALQNEVLNAVNQHALFFAAALPRTLSTPLFNRY--QNNETYGFHV---DGAVR 102 + LY + + + F T L RY ++ E + H D + Sbjct: 483 EEAELYINVSERL--RLLTQEYFDMKVRLNFAFTHLVCRYALEDGEEHISHPIHSDNCIL 540 Query: 103 SHPQNG--------WMRTDLSATLFLSDPQSYDGGELVVNDTFG--QHRVKLPAGDLVLY 152 + NG + D SA L+L+D ++GGE + ++ Q +V+ G +V + Sbjct: 541 NGDGNGTCPKRSPAFTWRDYSALLYLND--DFEGGEFIFANSTDKIQAQVRPKCGRVVAF 598 Query: 153 PS---SSLHCVTPVTRGVRVASFMW 174 S +LH V V +GVR A +W Sbjct: 599 RSKGLENLHGVLGVKKGVRCALPIW 623 >UniRef50_Q2N914 Putative uncharacterized protein n=1 Tax=Erythrobacter litoralis HTCC2594 RepID=Q2N914_ERYLH Length = 203 Score = 46.9 bits (110), Expect = 5e-04, Method: Composition-based stats. Identities = 23/98 (23%), Positives = 41/98 (41%), Gaps = 10/98 (10%) Query: 88 QNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLSDPQS-YDGGELVVNDTFG-QHRVKLP 145 ++ + + H+D + +S ++ P + ++GGELVV FG V++ Sbjct: 101 RDGDFFSEHIDTLTAENRDTLPGDRLISMVYYMHLPGARFEGGELVVRHLFGRTEPVRIA 160 Query: 146 AGD--LVLYPSSSLHCVTPVT------RGVRVASFMWI 175 LV +PS + H V PV R + W+ Sbjct: 161 PRHNRLVAFPSIAPHAVEPVRVPGNAWEDARFSVNCWL 198 >UniRef50_C1MIQ0 Predicted protein n=2 Tax=Micromonas RepID=C1MIQ0_9CHLO Length = 365 Score = 46.9 bits (110), Expect = 5e-04, Method: Composition-based stats. Identities = 22/90 (24%), Positives = 33/90 (36%), Gaps = 12/90 (13%) Query: 81 TPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLSDPQSYDGGELVVNDTFGQH 140 F Y +H D A+ + PQ + T L + + D G Sbjct: 261 PVEFRVYPPGSAMDWHQDVALYTEPQYELVFT-------LDNTSD---SQTQWQDGEGAR 310 Query: 141 R--VKLPAGDLVLYPSSSLHCVTPVTRGVR 168 R P +V+ S +H VTP+T+G R Sbjct: 311 RGGWTEPNSVIVVRAESVIHRVTPITKGER 340 >UniRef50_A4RVD9 Predicted protein n=2 Tax=Ostreococcus RepID=A4RVD9_OSTLU Length = 330 Score = 46.9 bits (110), Expect = 5e-04, Method: Composition-based stats. Identities = 30/131 (22%), Positives = 41/131 (31%), Gaps = 40/131 (30%) Query: 83 LFNRYQNNETYGFHVD---GAVRSHPQNGWMRTDLSATLFLSDPQSYDGGELVV------ 133 RY + Y H D V P+ G R + ++L D GGE Sbjct: 129 QILRYDVGQKYDPHFDYFHDKVNPAPKRGGQRL-ATMLIYLVDTD--KGGETTFPNAKLP 185 Query: 134 --------NDTFGQH-----------RVKLPAGDLVLYPS---------SSLHCVTPVTR 165 + F H VK GD +L+ S SLH PV Sbjct: 186 QSFEADEPENPFASHIEHTDCAKKGIPVKSVRGDAILFFSMTQDGVLDRGSLHGACPVIE 245 Query: 166 GVRVASFMWIQ 176 G + + WI+ Sbjct: 246 GQKWTAVKWIR 256 >UniRef50_B9IJQ5 Oxidoreductase, 2OG-Fe(II) oxygenase family protein n=25 Tax=Viridiplantae RepID=B9IJQ5_POPTR Length = 308 Score = 46.9 bits (110), Expect = 5e-04, Method: Composition-based stats. Identities = 26/121 (21%), Positives = 44/121 (36%), Gaps = 27/121 (22%) Query: 80 STPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLSDPQSYDGGELVVNDTFG- 138 + Y++ + Y H D Q ++ ++LS+ GGE V ++ G Sbjct: 131 ESIQILHYEHGQKYEPHFDYFHDKANQELGGHRVVTVLMYLSNVG--KGGETVFPNSEGK 188 Query: 139 ---------------QHRVKLPAGDLVLYPS---------SSLHCVTPVTRGVRVASFMW 174 + VK GD +L+ S +SLH PV G + ++ W Sbjct: 189 TIQPKDDSWSDCAKNGYAVKPQKGDALLFFSLHPDATTDTNSLHGSCPVIEGEKWSATKW 248 Query: 175 I 175 I Sbjct: 249 I 249 >UniRef50_C4Y2U7 Putative uncharacterized protein n=1 Tax=Clavispora lusitaniae ATCC 42720 RepID=C4Y2U7_CLAL4 Length = 275 Score = 46.9 bits (110), Expect = 5e-04, Method: Composition-based stats. Identities = 27/145 (18%), Positives = 50/145 (34%), Gaps = 25/145 (17%) Query: 48 TRSTLYAALQNEVLNAVNQHALFF---------AAALPRTLSTPL-FNRYQNNETYGFHV 97 +L+ L+ +L Q F A ++L+ L RY +G H Sbjct: 129 AADSLWQYLREILL----QKPQFEDEDLEEIRHIFADAKSLNPQLRVYRYTKGHHFGKHY 184 Query: 98 DGAV---RSHPQNGWMRTDLSATLFLSDPQSYDGGELVVNDTFGQHRVKL-----PAGDL 149 D +V +H +T + ++L+ ++GG + + R K+ G Sbjct: 185 DESVTCPMAHDPKAQGKTKWTLLIYLTGGADFEGGGTIFYPETSRERNKVINVHADKGMA 244 Query: 150 VLYPSS---SLHCVTPVTRGVRVAS 171 +L+ H V GV+ Sbjct: 245 LLHKHGDDCLKHEAELVKSGVKWVL 269 >UniRef50_Q5UP57 Putative prolyl 4-hydroxylase n=1 Tax=Acanthamoeba polyphaga mimivirus RepID=P4H_MIMIV Length = 242 Score = 46.9 bits (110), Expect = 6e-04, Method: Composition-based stats. Identities = 34/189 (17%), Positives = 65/189 (34%), Gaps = 31/189 (16%) Query: 3 YHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQ--VDTRSTLYAALQNEV 60 + + +++P + + D +V +G ++N+QQ + + + + + Sbjct: 61 FVLNNLINPTKCQEIMQFANGKLF-DSQVLSGTD-KNIRNSQQMWISKNNPMVKPIFENI 118 Query: 61 LNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQ-----NGWMRTDLS 115 + F A RY N+ Y H D S Q + L+ Sbjct: 119 CR--QFNVPFDNA------EDLQVVRYLPNQYYNEHHDSCCDSSKQCSEFIERGGQRILT 170 Query: 116 ATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDL-VLYP---------SSSLHCVTPVTR 165 ++L++ + G + + K GD V YP SLH PVT Sbjct: 171 VLIYLNN--EFSDGHTYFPNLN--QKFKPKTGDALVFYPLANNSNKCHPYSLHAGMPVTS 226 Query: 166 GVRVASFMW 174 G + + +W Sbjct: 227 GEKWIANLW 235 >UniRef50_Q0EXH1 2OG-Fe(II) oxygenase n=1 Tax=Mariprofundus ferrooxydans PV-1 RepID=Q0EXH1_9PROT Length = 214 Score = 46.5 bits (109), Expect = 6e-04, Method: Composition-based stats. Identities = 27/99 (27%), Positives = 41/99 (41%), Gaps = 15/99 (15%) Query: 84 FNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLSDP-QSYDGGELVVNDTFGQHRV 142 F Y +Y H D + G + ++ L+L+D Q+ DGG L ++ + Sbjct: 118 FALYPPGASYDIHYDRFI------GALERVVTCILYLNDNWQAEDGGALNIHQGHDASTL 171 Query: 143 KLP------AGDLVLYPSSSL-HCVTPVTRGVRVASFMW 174 LP G LV + S H V P TR R++ W Sbjct: 172 PLPMQILPQGGRLVTFISEQFPHEVLPATR-DRLSLTGW 209 >UniRef50_A8IV51 Predicted protein n=1 Tax=Chlamydomonas reinhardtii RepID=A8IV51_CHLRE Length = 273 Score = 46.5 bits (109), Expect = 6e-04, Method: Composition-based stats. Identities = 41/207 (19%), Positives = 72/207 (34%), Gaps = 43/207 (20%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTR--STLYAALQNE 59 +Y G L+P++ R + E+ G V TG+ G+ V + + D A+ Sbjct: 45 IYLWKGFLTPEECDYIRMKAEKRLERSGVVDTGSGGSVVSDIRTSDGMFFERGEDAIIEA 104 Query: 60 VLNAVNQHALFFAAALPRTLSTP------LFNRYQNNETYGFHVDGAVRSHPQNGWMRTD 113 V + T++ RY+ ++ Y H D + Sbjct: 105 VEQRLADW----------TMTPIWGGESLQVLRYRKDQKYDSHWDYFFHKDGSSNGGNRW 154 Query: 114 LSATLFLSDPQSYDGGELVVNDTFGQH--------------RVKLPAGDLVLYPS----- 154 + L+L++ + +GGE V + VK GD +L+ S Sbjct: 155 ATVLLYLTETE--EGGETVFPKIPAPNGINVGFSECAKYNLAVKPHKGDALLFHSMKPTG 212 Query: 155 ----SSLHCVTPVTRGVRVASFMWIQS 177 S+H PV RG + + WI + Sbjct: 213 ELEERSMHGACPVIRGEKFSMTKWIHA 239 >UniRef50_Q5GQB2 Putative uncharacterized protein n=1 Tax=Synechococcus phage S-PM2 RepID=Q5GQB2_BPSYP Length = 238 Score = 46.5 bits (109), Expect = 6e-04, Method: Composition-based stats. Identities = 18/91 (19%), Positives = 39/91 (42%), Gaps = 10/91 (10%) Query: 87 YQNNETY-GFHVDGAVRSHPQNGWMRTDLSATLFLSDPQSYDGGELVVNDTFGQHRVKLP 145 + Y +H + + +H Q +L+ ++L+D +GGE + R+ Sbjct: 148 TEPGGGYHAWHYENSASTHAQ-----RELTWMIYLNDVPPENGGETEF--LYQHKRISPT 200 Query: 146 AGDLVLYPS--SSLHCVTPVTRGVRVASFMW 174 G +V++P+ + +H V +G + W Sbjct: 201 KGTVVVFPAGMTHVHRGNTVLKGNKYIVTGW 231 >UniRef50_B4X4U7 Oxidoreductase, 2OG-Fe(II) oxygenase family n=2 Tax=Alcanivorax RepID=B4X4U7_9GAMM Length = 221 Score = 46.5 bits (109), Expect = 6e-04, Method: Composition-based stats. Identities = 29/97 (29%), Positives = 41/97 (42%), Gaps = 13/97 (13%) Query: 84 FNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLSDPQSYD-GGELVV----NDTFG 138 F Y + Y HVD + NG + LS L+L+D + GG L + + T Sbjct: 117 FAIYGPGDFYQRHVDA---FNGNNGRL---LSVVLYLNDNWQSEWGGRLRIWPEADATRV 170 Query: 139 QHRVKLPAGDLVLYPSSSL-HCVTPVTRGVRVASFMW 174 V+ AG LV + S + H V TR R + W Sbjct: 171 ATEVEPRAGTLVAFLSEKIPHEVLAATR-ERYSIAGW 206 >UniRef50_B5YLH9 Predicted protein n=1 Tax=Thalassiosira pseudonana CCMP1335 RepID=B5YLH9_THAPS Length = 451 Score = 46.5 bits (109), Expect = 6e-04, Method: Composition-based stats. Identities = 32/200 (16%), Positives = 66/200 (33%), Gaps = 37/200 (18%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWV----------DGRVTTGAQGAQVKNNQQ-VDT-- 48 + + +SP++ R E ++ DG T + +N +D Sbjct: 249 LVTLEDFISPEEAERLIELGHVEQYKRSTDVGHLKADGSYTEDVHSTRTSSNSWCLDKCM 308 Query: 49 RSTLYAALQNEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNG 108 + + + + + + + +L RY+ + YG H D + Sbjct: 309 KDPVAKDVVDRI-EHMTMIPQTNSESL-------QLLRYEEGQYYGVHHDLIEHQKDRPP 360 Query: 109 WMRTDLSATLFLSDPQS--YDGGELVVNDTFGQHRVKLPAGDLVLYPS-----------S 155 +R L+ ++L+ + +GG V G ++ S Sbjct: 361 GVRI-LTFYMYLNGNEDSGLEGGGTKFPRIGAT--VTPKRGRAAMWSSVLDENPHKKDPR 417 Query: 156 SLHCVTPVTRGVRVASFMWI 175 + H PVT+GV+ + WI Sbjct: 418 TDHTALPVTKGVKYGANAWI 437 >UniRef50_Q4JN23 Putative uncharacterized protein n=1 Tax=uncultured bacterium BAC13K9BAC RepID=Q4JN23_9BACT Length = 199 Score = 46.5 bits (109), Expect = 6e-04, Method: Composition-based stats. Identities = 24/96 (25%), Positives = 37/96 (38%), Gaps = 14/96 (14%) Query: 84 FNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLSDPQSYDGGELVVNDTFGQHRVK 143 R E + +HVD + A +L+D + GGE F +VK Sbjct: 111 IQRTDVGEYFHWHVDSGSHQMSDRQLV-----AIWYLNDVEG-PGGE----TEFLHQKVK 160 Query: 144 LPA--GDLVLYPSSSLHCVTPVT--RGVRVASFMWI 175 + G LV++P H VT +G + + WI Sbjct: 161 VKPEEGKLVVFPPFWTHEHRGVTLKKGSKYIATTWI 196 >UniRef50_Q54CK1 Putative uncharacterized protein n=1 Tax=Dictyostelium discoideum RepID=Q54CK1_DICDI Length = 218 Score = 46.5 bits (109), Expect = 6e-04, Method: Composition-based stats. Identities = 33/182 (18%), Positives = 67/182 (36%), Gaps = 28/182 (15%) Query: 3 YHIPGVLSPQDV----ARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQN 58 I +LS ++ L+ E+V G ++ N + D + +Y ++N Sbjct: 36 LLIKNLLSHEECSLIQNSIFNSLDNNEFV-------KHGLRIHKNSE-DFANIIYGRIEN 87 Query: 59 EVLNAVNQ-HALFFAAALP---RTLST-PLFNRYQNNETYGFHVDGAVRSHPQNGWMRTD 113 + + + + L ++S F RY E + H DG V++ + + Sbjct: 88 CCVKKLKRKNPSNSNEQLEWNIDSVSPKFRFIRYNEGELFPNHTDGEVKNENKMSFF--- 144 Query: 114 LSATLFLSDPQ-SYDGGELVVNDTF------GQHRVKLPAGDLVLYPSSSLHCVTPVTRG 166 S +F +D + GGE RV+ G +++ + +H +T G Sbjct: 145 -SILIFTNDCGVDFKGGEFRFFKKDNNLQLEEITRVEPKRGMALIFDHTIIHDSNIITFG 203 Query: 167 VR 168 + Sbjct: 204 QK 205 >UniRef50_Q21FK1 2OG-Fe(II) oxygenase n=1 Tax=Saccharophagus degradans 2-40 RepID=Q21FK1_SACD2 Length = 478 Score = 46.5 bits (109), Expect = 7e-04, Method: Composition-based stats. Identities = 23/109 (21%), Positives = 46/109 (42%), Gaps = 16/109 (14%) Query: 87 YQNNETYGFHVDG-AVRSHPQNGWM--RTDLSATLFLSDPQSYDGGELVVNDTFGQHRVK 143 Y+ + + H D ++ P++G + + + ++L+D + +GGE G +K Sbjct: 183 YEVGQEFKAHTDYFEIKEMPEHGAVMGQRTYTVMIYLNDVE--EGGETDFPAADGA--IK 238 Query: 144 LPAGDLVLYPS---------SSLHCVTPVTRGVRVASFMWIQSMIRDDK 183 AG +++ S S+H PV +G + W +S R Sbjct: 239 PRAGLALIWNSLQSNGAPNPHSMHQAYPVLKGHKAVITKWFRSQSRLPN 287 >UniRef50_B8CBV6 Predicted protein (Fragment) n=2 Tax=Thalassiosira pseudonana RepID=B8CBV6_THAPS Length = 196 Score = 46.5 bits (109), Expect = 7e-04, Method: Composition-based stats. Identities = 36/188 (19%), Positives = 62/188 (32%), Gaps = 29/188 (15%) Query: 5 IPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQ-----VDTRSTLYAALQNE 59 + VLS ++ A E+ E DG A+ K Q+ V L Sbjct: 5 LHNVLSLEECADIIEKSEA----DGYEQATIYDARTKRVQRNCTRCVTDDQVLAENWFER 60 Query: 60 VLNAVNQHA---LFFAAALPRTLS------------TPLFNRYQNNETYGFHVDGAVRSH 104 +L+A+N A T +YQ N+ + H D + Sbjct: 61 ILHALNGTPYEQKVKNAPWMGTRHDAKPLHATSLNERLRILKYQQNQFFSSHHDASFIRD 120 Query: 105 PQNGWM---RTDLSATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVT 161 G ++ +S ++L+D + GG + V G ++L+ + LH Sbjct: 121 ADEGGRTGEKSYVSVQIYLNDK--FKGGTTRFHGGGRFLDVIPKTGSILLFDHNILHEGV 178 Query: 162 PVTRGVRV 169 V G + Sbjct: 179 AVKSGKKY 186 >UniRef50_D2VKT5 Predicted protein n=2 Tax=Naegleria gruberi RepID=D2VKT5_NAEGR Length = 254 Score = 46.5 bits (109), Expect = 7e-04, Method: Composition-based stats. Identities = 31/170 (18%), Positives = 56/170 (32%), Gaps = 30/170 (17%) Query: 3 YHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEVLN 62 + I S + + +E T G + N+ +S + L + Sbjct: 79 FLIENCFSSDECQLMIKAME---------TVGLDFK-LSGNRHCFRKSIMDEKLSEILFE 128 Query: 63 A----VNQHALFFAAALPRTLSTPLFNRYQN------NETYGFHVDGAVRSHPQNGWMRT 112 + Q P P+ RY +G HVD A S+ + Sbjct: 129 RSRDFLPQSYKVSGIDRPLQGLNPMI-RYIKYLHKKFGHKFGPHVDAANHSNNCKSFF-- 185 Query: 113 DLSATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSS---LHC 159 + ++L+D ++GGE V + Q R+K G + ++ LH Sbjct: 186 --TFMVYLND--DFEGGETVFLEKGFQARIKPKTGTVCVFEQDVRQLLHE 231 >UniRef50_UPI0000E46D75 PREDICTED: hypothetical protein n=1 Tax=Strongylocentrotus purpuratus RepID=UPI0000E46D75 Length = 303 Score = 46.5 bits (109), Expect = 7e-04, Method: Composition-based stats. Identities = 38/163 (23%), Positives = 58/163 (35%), Gaps = 22/163 (13%) Query: 25 EWVDGRVTTG------AQGAQVKN--NQQVDTRSTLYAALQNEVLNAVNQHALFFAAALP 76 + G ++ G Q + KN N LY + +++ A+ A F +P Sbjct: 128 DLHSGALSKGDSFINIYQYIEQKNLGNVFSSEDIRLYRKVADKIKVAI--SARFQ---IP 182 Query: 77 RT----LSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRT-DLSATLFLSDPQ-SYDGGE 130 F+R + + H D H T D ++ L+L+D + GG Sbjct: 183 GQKIYLTHPTFFSRMNSKKAKTLH-DEYWHPHVDKKTYETFDYTSLLYLTDYDVDFKGGR 241 Query: 131 LVVNDTFGQHRVKLPAGDLVLYPSSS--LHCVTPVTRGVRVAS 171 V D V+ G L + S S H V VT G R A Sbjct: 242 FVFIDDKANSTVEPKLGRLSFFTSGSENTHFVEKVTSGTRYAI 284 >UniRef50_Q0APS3 2OG-Fe(II) oxygenase n=1 Tax=Maricaulis maris MCS10 RepID=Q0APS3_MARMM Length = 230 Score = 46.5 bits (109), Expect = 7e-04, Method: Composition-based stats. Identities = 40/169 (23%), Positives = 59/169 (34%), Gaps = 34/169 (20%) Query: 10 SPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEVLNAVNQHAL 69 + +AR R ++ +W+DG T G L+ EV + L Sbjct: 71 DHELIARVRR--DKTKWLDGS-TPGQTAWL-----------AFAETLRREVNARLML-GL 115 Query: 70 FFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLSDP-QSYDG 128 F A F Y+ Y H+D G LS L+L+ + DG Sbjct: 116 FAFEAH--------FAVYEAGAFYKRHLDS------FRGARNRVLSTVLYLNPHWREGDG 161 Query: 129 GELVVNDTFGQHRVKLPA--GDLVLYPSSSL-HCVTPVTRGVRVASFMW 174 G L + +L G LVL+ S + H VT V+ R + W Sbjct: 162 GHLRIYGEDDDVITELRPEFGTLVLFLSEEIPHEVT-VSHRERFSIAGW 209 >UniRef50_Q8T5S8 Prolyl-4-hydroxylase-alpha PV n=14 Tax=Drosophila RepID=Q8T5S8_DROME Length = 525 Score = 46.5 bits (109), Expect = 7e-04, Method: Composition-based stats. Identities = 39/214 (18%), Positives = 62/214 (28%), Gaps = 54/214 (25%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEVL 61 M VLSP+++ + G T G + A V Q + V Sbjct: 331 MVLYHDVLSPKEIKELQ----------GMATPGLKRATV---YQASSGR------NEVVK 371 Query: 62 NAVNQHALFFAAALPRTL-----------------STPLFNRYQNNETYGFHVDGAVRSH 104 ++ A F P T+ Y Y H D +++ Sbjct: 372 TRTSKVAWFPDGYNPLTVRLNARISDMTGFNLYGSEMLQLMNYGLGGHYDQHYDFFNKTN 431 Query: 105 PQNGWMRTDLSATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLY---------PSS 155 M D AT+ GG V + V G +V++ + Sbjct: 432 SNMTAMSGDRIATVLFYLTDVEQGGATVF--PNIRKAVFPQRGSVVMWYNLKDNGQIDTQ 489 Query: 156 SLHCVTPVTRGVRVASFMWIQSMIRDDKKRAMLF 189 +LH PV G + WI+ +R +F Sbjct: 490 TLHAACPVIVGSKWVCNKWIR-------EREQIF 516 >UniRef50_A6FBT0 Putative prolyl 4-hydroxylase, alpha subunit domain n=3 Tax=Alteromonadales RepID=A6FBT0_9GAMM Length = 215 Score = 46.5 bits (109), Expect = 8e-04, Method: Composition-based stats. Identities = 25/103 (24%), Positives = 40/103 (38%), Gaps = 16/103 (15%) Query: 84 FNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLSDPQSYDGGELVVNDTFGQH--- 140 + +Y + Y H+D G S +L+ P + GGEL++ + Sbjct: 115 YAKYAVGDFYKKHLDA------FKGKSNRVFSTVCYLNTPDA--GGELLIYAQDSDNVIA 166 Query: 141 RVKLPAGDLVLYPSSSL-HCVTPVTRGVRVASFMWIQ---SMI 179 RV AG LV++ S H V R + W + SM+ Sbjct: 167 RVAPKAGTLVVFESERFPHEVLAAES-ERYSIAGWFRMNNSMV 208 >UniRef50_D2VER2 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2VER2_NAEGR Length = 1108 Score = 46.1 bits (108), Expect = 8e-04, Method: Composition-based stats. Identities = 32/177 (18%), Positives = 60/177 (33%), Gaps = 24/177 (13%) Query: 7 GVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTR-----STLYAALQN-EV 60 +++ Q + +A + G T +V+ Q++ + + + + V Sbjct: 214 PLVTKQQADDIIQLAAKAPYGRGEET--IVDEKVRKTWQLEPNQFAILNPNWEDMIDELV 271 Query: 61 LNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFL 120 N + + + L L Y+ + + FH D ++ ATL + Sbjct: 272 GNQIKKGLGVGDKEIGFNLYKLLL--YEEDGHFQFHRDS---------EKEENMFATLVV 320 Query: 121 SDPQSYDGGELVVNDTFGQ--HRVKLPAGDLVLYPSSSL---HCVTPVTRGVRVASF 172 P Y GGEL V + + + + S H V VT G R+A Sbjct: 321 HLPSIYTGGELTVKHNSKEVVYDYSSKSSYATSFVSFYCDCEHKVNRVTSGYRLALV 377 >UniRef50_Q58LI8 Possible dioxygenase n=1 Tax=Prochlorococcus phage P-SSM4 RepID=Q58LI8_BPPRS Length = 196 Score = 46.1 bits (108), Expect = 8e-04, Method: Composition-based stats. Identities = 18/94 (19%), Positives = 38/94 (40%), Gaps = 12/94 (12%) Query: 86 RYQNNETY-GFHVDGAVRSHPQNGWMRTDLSATLFLSDPQSYDGGELVVNDTFGQHRVKL 144 + + + Y +H + G R + ++L+D ++GGE + R K Sbjct: 108 KTEPGQGYHAWHSENGSL-----GTNRRICATMMYLND--DFEGGETEF--LYQHKRFKP 158 Query: 145 PAGDLVLYPS--SSLHCVTPVTRGVRVASFMWIQ 176 G ++++P+ + H P G + S W + Sbjct: 159 KRGQVLIWPAGFTHTHRGLPPLDGAKYISTSWTE 192 >UniRef50_Q5CU77 Prolyl 4-hydroxylase alpha subunit n=2 Tax=Cryptosporidium RepID=Q5CU77_CRYPV Length = 717 Score = 46.1 bits (108), Expect = 8e-04, Method: Composition-based stats. Identities = 34/197 (17%), Positives = 74/197 (37%), Gaps = 53/197 (26%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQ------VK---------NNQQV 46 ++ IP L+ + ++ D +++ + ++ +K N+Q Sbjct: 459 IFIIPNALNQDECDEIISLVQD-RLEDSKISIAKKESKKSETDEIKDEKTDDQDYKNEQD 517 Query: 47 DTRS----------TLYAALQNEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFH 96 + L ++ N + L ++ L + L ++Y N+ H Sbjct: 518 EDEDFCRSSTACIQPEETPLIRQIENRLG--ILVDSSNL--YMEPILVHKYSVNDYIKEH 573 Query: 97 VDGAVRSHPQNGWMRTDLSATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPS-- 154 DG R+H + ++FLSD + +GGEL + ++K G V++P+ Sbjct: 574 HDGDNRTH----------TISVFLSDVE--NGGELDF--PYAGIKIKPKKGFAVVWPNID 619 Query: 155 -------SSLHCVTPVT 164 +++H V +T Sbjct: 620 SQGKLDYTTVHAVNKIT 636 >UniRef50_D2VW34 Oxidoreductase n=3 Tax=Naegleria gruberi RepID=D2VW34_NAEGR Length = 222 Score = 46.1 bits (108), Expect = 8e-04, Method: Composition-based stats. Identities = 34/181 (18%), Positives = 74/181 (40%), Gaps = 26/181 (14%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQG----AQVKNNQQV-----DTRSTL 52 ++ I + S ++ ++ ++ E+ + + ++TG V+NN + L Sbjct: 21 IWLIKNLFSTEECSKLLKESEEIGYGEAPISTGPTSSTMMKDVRNNSRAMIDKKQYSDML 80 Query: 53 YAALQNEV---LNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGW 109 Y L+ + ++++ L F +Y E + H DG + N Sbjct: 81 YKKLEKYLPQNVSSLKVGPQ-DGFKLCGLNERIRFYKYAAGEYFAPHYDGCFQRPTLNVE 139 Query: 110 M---------RTDLSATLFLSDPQSYDGGELVVNDTFGQ--HRVKLPAGDLVLYPSSSLH 158 + R+ ++ L+L+D +S GGE ++ + H VK AG ++++ S+ H Sbjct: 140 INGKKMKVVERSFITVLLYLNDVES--GGETNFLNSRCEITHSVKPQAGQVLMFVHSNYH 197 Query: 159 C 159 Sbjct: 198 E 198 >UniRef50_D2VKE9 Type IIB DNA topoisomerase n=1 Tax=Naegleria gruberi RepID=D2VKE9_NAEGR Length = 1251 Score = 46.1 bits (108), Expect = 9e-04, Method: Composition-based stats. Identities = 28/174 (16%), Positives = 55/174 (31%), Gaps = 21/174 (12%) Query: 9 LSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRS-----TLYAALQNEVLNA 63 LS + + +A + G T V+ Q+D + L + +++ Sbjct: 426 LSNGSASELIQFCSKAPYGRGEDT--IYDENVRKTWQLDPSRFEITNPTWDDLIDGLVDN 483 Query: 64 VNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLSDP 123 + L + L + + Y+ + FH D R + + L P Sbjct: 484 EIRDGLGISKHLRLSANLYKLLVYEEGGHFQFHKDS-------EKEERMFGTLVVQL--P 534 Query: 124 QSYDGGELVVNDTFGQHRVKLPA-----GDLVLYPSSSLHCVTPVTRGVRVASF 172 Y GGE++V + + + + + H + V G RV Sbjct: 535 SEYSGGEIIVRHGEEEEEYDFASVSRYTPHFISFYADCEHMIKNVNSGYRVCLI 588 >UniRef50_Q6MK70 SM-20-related protein n=1 Tax=Bdellovibrio bacteriovorus RepID=Q6MK70_BDEBA Length = 209 Score = 46.1 bits (108), Expect = 9e-04, Method: Composition-based stats. Identities = 34/159 (21%), Positives = 58/159 (36%), Gaps = 22/159 (13%) Query: 30 RVTTGAQGAQVKNNQQVDTRSTLYAA------LQNEVLNAVN--QHALFFAAALPRTLST 81 + + G + N ++ TL+ LQ + L + + L L Sbjct: 53 KASIGHSATKT-VNAEIRGDFTLWLEQDTGSDLQKQFLAQLEVLRQKLNENFYLGLQRFE 111 Query: 82 PLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLSDP-QSYDGGELVVNDTFGQH 140 F Y Y H+D H +G R ++ L+L+ Q DGGEL + ++ Sbjct: 112 SHFALYPPGGGYDKHIDN----HRGSGARR--ITFILYLNAHWQKGDGGELSLYSPEDEN 165 Query: 141 ----RVKLPAGDLVLYPSSSL-HCVTPVTRGVRVASFMW 174 +V+ G VL+ S H V + R++ W Sbjct: 166 LLLAQVQPRLGTFVLFRSDLFPHQVEK-SHSPRLSLTGW 203 >UniRef50_UPI0000EB0B63 Guanine nucleotide-binding protein G(I)/G(S)/G(T) subunit beta 3 (Transducin beta chain 3). n=2 Tax=Laurasiatheria RepID=UPI0000EB0B63 Length = 704 Score = 46.1 bits (108), Expect = 0.001, Method: Composition-based stats. Identities = 42/210 (20%), Positives = 71/210 (33%), Gaps = 42/210 (20%) Query: 3 YHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQ-------------VKNNQQVDTR 49 + G+L+P + + + A G + G +G + V Q+ Sbjct: 438 VVLDGLLTPAECGVLLQLAKDAT-EAGARS-GYRGRRSPHSPHERFEGLTVLKAAQLAHA 495 Query: 50 STLYAALQNEVLNAVNQ----HALFFAAALPRTLS-TPLFNRYQNNETYGFHVDGAVRSH 104 + + +L + +F+ P LS T L R +D + H Sbjct: 496 GAVGSQGAKLLLEVSERVRTLTQAYFSPERPLHLSFTQLVCRSAIEGEQDQRMDLSHPVH 555 Query: 105 PQNGWM---------------RTDLSATLFLSDPQSYDGGELVVNDTFG---QHRVKLPA 146 N + D S L+L+D + GG+L + +V+ Sbjct: 556 ADNCVLDPDTGECWREPPAYTYRDYSGLLYLND--DFHGGDLFFTEPNALTVTAQVRPRC 613 Query: 147 GDLVLYPS--SSLHCVTPVTRGVRVASFMW 174 G LV + S + H V VTRG R A +W Sbjct: 614 GRLVAFSSGGENPHGVWAVTRGRRCALALW 643 >UniRef50_A4S1S9 Predicted protein n=1 Tax=Ostreococcus lucimarinus CCE9901 RepID=A4S1S9_OSTLU Length = 210 Score = 46.1 bits (108), Expect = 0.001, Method: Composition-based stats. Identities = 32/176 (18%), Positives = 62/176 (35%), Gaps = 15/176 (8%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEVL 61 + + L+ +D AR + + ++ G + N + S +A + Sbjct: 38 IIVVDDALTARDCARIVDAIGD-DFAASSSRGPRHGEARRRNGRFAETSEAFA--RRLYE 94 Query: 62 NAVNQHALFFAAALPRTLST-PLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFL 120 A F L+ RY+ E +G HVD V + + R+ +A +L Sbjct: 95 RANVAATFGFEIDDAVGLNPNIRVYRYRAREHFGAHVDERVTALGR----RSKYTALFYL 150 Query: 121 SDPQSYDGGELVVNDTFGQHRVKLPA--GDLVLYPSSS---LHCVTPVTRGVRVAS 171 S + +GG + D G+ R ++ G + + + H V G + Sbjct: 151 S--EDVEGGSTIFYDEVGEERCRVRPKIGRALYFRHGADMPEHEGEEVREGTKYVL 204 >UniRef50_C1MLG0 Predicted protein n=1 Tax=Micromonas pusilla CCMP1545 RepID=C1MLG0_9CHLO Length = 750 Score = 46.1 bits (108), Expect = 0.001, Method: Composition-based stats. Identities = 34/195 (17%), Positives = 57/195 (29%), Gaps = 55/195 (28%) Query: 27 VDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEVLNAVNQHALFFA-------------- 72 DG+++ G + L A++ +L AV L A Sbjct: 560 TDGKLSEGRTSSST-FLTGCKQEEPLVRAIEQRLLRAVQSATLIAAQPNVYDSNERHGQP 618 Query: 73 --------AALPRTLS---TPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLS 121 + P L RY + Y H D + G +R + ++L+ Sbjct: 619 YRGSTSRFSQRPNLLQGAEPMQVVRYTEGQMYTAHYDN------KQGCLRRTATFMMYLT 672 Query: 122 DPQSYDGGELVVN------------DTFGQHRVKLPAGDLVLYPS--------SSLHCVT 161 D + GG D G G +++ S SLH Sbjct: 673 DV--HSGGATHFPRAVPVSMRDGCGDAAGIRIWP-KRGRALVFWSVSGGIEDVRSLHEAE 729 Query: 162 PVTRGVRVASFMWIQ 176 PV G + + W++ Sbjct: 730 PVIEGEKWIATKWLR 744 >UniRef50_D2VXK7 Predicted protein n=2 Tax=Naegleria gruberi RepID=D2VXK7_NAEGR Length = 506 Score = 46.1 bits (108), Expect = 0.001, Method: Composition-based stats. Identities = 32/234 (13%), Positives = 79/234 (33%), Gaps = 27/234 (11%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQG--AQVKNNQQVDTR--STLYAALQ 57 ++ + +L ++ ++ E + +T+ K D + L+ L+ Sbjct: 92 LFLVDHLLHQEECKEILKKEESLGFES--ITSEYPVEYRNSKRILYNDKELAAKLWKRLK 149 Query: 58 NEVLNAVNQHA---LFFAAALPRTLSTPL-FNRYQNNETYGFHVDGAVRSHPQNGWMRTD 113 +++ +P +++ + ++Y+ + H DG + R+ Sbjct: 150 KYMIDCNFMKPYGLDSEGYWIPISVNECMRLSKYEPGNYFKPHTDGQFVRNDDE---RSI 206 Query: 114 LSATLFLSDPQSYDGGELVVN---DTFGQHRVKLPA--------GDLVLYPSSSLHCVTP 162 + ++L+D + GGE D ++ +K G ++ H Sbjct: 207 YTLIIYLND--GFVGGETKFMRRVDPLAENEMKFKNLCEISPKMGSASVFNHDLYHQGCL 264 Query: 163 VTRGVRVASFMWIQ-SMIRDDKKRAMLFELDNNIQSLKSRYGESEEILSLLNLY 215 VT GV+ I I ++ + D + ES+++ + Y Sbjct: 265 VTEGVKYILRTEIMFKRIDSAEQLVTKQDNDEIYNKVMDLLHESDQLERKGDTY 318 >UniRef50_Q0AGD5 Prolyl 4-hydroxylase, alpha subunit n=4 Tax=Bacteria RepID=Q0AGD5_NITEC Length = 237 Score = 46.1 bits (108), Expect = 0.001, Method: Composition-based stats. Identities = 22/110 (20%), Positives = 39/110 (35%), Gaps = 27/110 (24%) Query: 82 PLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLSDP-QSYDGGELVVNDTFGQH 140 PL +Y H D G + + FL++P + + GGE V+ + Sbjct: 126 PLILKYGKGGFNTLHQD-------LYGDIYFPIQTVFFLTEPDEDFTGGEFVLTQQTSRA 178 Query: 141 R-----VKLPAGDLVL--------------YPSSSLHCVTPVTRGVRVAS 171 + +K GD+++ Y ++ H V+ V G R Sbjct: 179 QSKAIVLKPRKGDMLMMTTHFRPVKGSRGYYRANMKHGVSEVHSGQRYTM 228 >UniRef50_A8I7G7 Predicted protein (Fragment) n=1 Tax=Chlamydomonas reinhardtii RepID=A8I7G7_CHLRE Length = 236 Score = 45.7 bits (107), Expect = 0.001, Method: Composition-based stats. Identities = 23/114 (20%), Positives = 38/114 (33%), Gaps = 30/114 (26%) Query: 86 RYQNNETYGFHVDGAVRSHPQNGWMR---------TDLSATLFLSDPQSYDGGELVVNDT 136 RY Y HVDGA + L+ ++L+D ++GG Sbjct: 111 RYDKGAVYRPHVDGAWPGSGLKDGRYEFDAYGDRWSRLTFLVYLND--DFEGGATTFYTP 168 Query: 137 F--------------GQHRVKLPAGDLVLYP-----SSSLHCVTPVTRGVRVAS 171 H V AG+++++P S +H VT+G + Sbjct: 169 AQPGTRGGTASGACLEAHSVGPVAGNILVFPHGDTMGSLVHEGAAVTQGSKYVI 222 >UniRef50_A1ZDI6 Oxidoreductase, 2OG-Fe(II) oxygenase family family n=1 Tax=Microscilla marina ATCC 23134 RepID=A1ZDI6_9SPHI Length = 486 Score = 45.7 bits (107), Expect = 0.001, Method: Composition-based stats. Identities = 28/142 (19%), Positives = 52/142 (36%), Gaps = 15/142 (10%) Query: 42 NNQQVDTRSTLYAALQNEVLNAVNQHALFFAAA--------LPRTLSTPLFNRYQNNETY 93 N++QV TL A L E+ V L RYQ + + Sbjct: 37 NDRQVVDDDTLAALLFEEIKQYVPSSIDIAGVGKDEAGNWQLKELNHRLRICRYQPEQYF 96 Query: 94 GFHVDGAVRSHPQNGWMRTDLSATLFLSDPQSYDGGELVV----NDTFGQHRVKLPAGDL 149 H+DG H Q+ +++ L+ ++L+D + GG + + GDL Sbjct: 97 NKHLDG---VHYQSATVQSKLTFMVYLNDSHEFIGGRTLFFASKDSDEVIQEFLPETGDL 153 Query: 150 VLYPSSSLHCVTPVTRGVRVAS 171 +++ + H + G++ Sbjct: 154 IIFDHNIWHAGEVLHSGIKYIL 175 >UniRef50_UPI0001926E68 PREDICTED: similar to Novel 2OG-Fe(II) oxygenase superfamily protein n=1 Tax=Hydra magnipapillata RepID=UPI0001926E68 Length = 320 Score = 45.7 bits (107), Expect = 0.001, Method: Composition-based stats. Identities = 25/122 (20%), Positives = 49/122 (40%), Gaps = 15/122 (12%) Query: 98 DGAVRSHPQNGWMRTDLSATLFLSDPQSYDGGELVVNDTFGQHRVKLPA-----GDLVLY 152 D + H N ++++ + L+D +++ GEL + +K G VL+ Sbjct: 202 DTGLSLHYDN----SEVTINICLND--NFEDGELFFSGLRNTQSLKYTRVEHKFGFGVLH 255 Query: 153 PSSSLHCVTPVTRGVRVASFMWIQSMIRDDKKRAMLFELDNNIQSLKSRYGESEEILSLL 212 LH P++ G R MW++S R + + ++ SL G+ + S Sbjct: 256 QGHHLHGAMPISSGKRYNLIMWLRS----SDVRNLRCPMCDSTPSLIETEGDGDGFTSSN 311 Query: 213 NL 214 + Sbjct: 312 QI 313 >UniRef50_Q1NF66 2OG-Fe(II) oxygenase n=1 Tax=Sphingomonas sp. SKA58 RepID=Q1NF66_9SPHN Length = 229 Score = 45.7 bits (107), Expect = 0.001, Method: Composition-based stats. Identities = 37/189 (19%), Positives = 61/189 (32%), Gaps = 37/189 (19%) Query: 6 PGVLSPQDVARFREQLEQ----AEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEVL 61 LSP + A R ++ + G + + N + R L + + Sbjct: 51 QDFLSPDECAELRRLIDANAQPSTLFSGSANADYRTSHSGN---LSPRDPLVERITQRI- 106 Query: 62 NAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMR-----TDLSA 116 A+ L RY + Y H D + MR +A Sbjct: 107 CALTGLPAINGETLQGQ-------RYTPGQEYKVHCDYFPATADYWQRMRGTGGQRTWTA 159 Query: 117 TLFLSDPQSYDGGELVVNDTFGQHRVKLPA--GDLVLY--------PS-SSLHCVTPVTR 165 ++LS ++ GGE F Q +P G ++++ P+ SLH PV R Sbjct: 160 MIYLSAVEA--GGE----THFPQCEFMVPPVEGMILIWNNMDRDGAPNRFSLHAALPVER 213 Query: 166 GVRVASFMW 174 G + W Sbjct: 214 GTKYVVTKW 222 >UniRef50_D2V3Y8 Predicted protein (Fragment) n=1 Tax=Naegleria gruberi RepID=D2V3Y8_NAEGR Length = 202 Score = 45.7 bits (107), Expect = 0.001, Method: Composition-based stats. Identities = 35/188 (18%), Positives = 69/188 (36%), Gaps = 31/188 (16%) Query: 5 IPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQV----KNNQQVDTRST-LYAALQNE 59 I + S ++ + E E + D VT GA ++ +NN++V L + + Sbjct: 1 IDNLFSEEECKSYIELAESQGFNDAPVTVGANTFKMMTDYRNNKRVILDDPNLAQQIYTK 60 Query: 60 VLN-------AVNQHALFFAAALPRTL---STPLFNRYQNNETYGFHVDGAVRSHPQNGW 109 V + +N ++ + F RY ++E + H DG + Sbjct: 61 VKDFVPEFASELNVNSRKINTEFFQKCGVNERFRFYRYTSDEYFKAHFDGNFARNNVECT 120 Query: 110 MR----------TDLSATLFLSDPQSYDGGELVVNDTFGQHR----VKLPAGDLVLYPSS 155 + + ++ ++L+ + GGE + G+ R V G ++L+ Sbjct: 121 LENGKTYLCEESSFITMLIYLNTLE--KGGETNFVNPGGEERILHSVNPKTGRVLLFVHR 178 Query: 156 SLHCVTPV 163 LH PV Sbjct: 179 LLHEAEPV 186 >UniRef50_B6HE60 Pc20g15710 protein n=14 Tax=Leotiomyceta RepID=B6HE60_PENCW Length = 484 Score = 45.7 bits (107), Expect = 0.001, Method: Composition-based stats. Identities = 37/209 (17%), Positives = 68/209 (32%), Gaps = 41/209 (19%) Query: 7 GVLSPQDVARFREQLEQAEW-------VDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNE 59 VLSP + E + DG ++ A N +T + L Sbjct: 286 NVLSPAECKAIIAAGESVNFLPDAPLREDGDISILAH------NFYWIIDTTFHDMLWAR 339 Query: 60 VLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGA-------------VRSHPQ 106 + V + RY Y H+DGA S P+ Sbjct: 340 ISPYVP--PSINGRKVRGINRRFRVYRYVPGAEYRCHIDGAWPPSGILPDDTYVYDSSPE 397 Query: 107 NGWMRTDLSATLFLSDPQSYDGGELVV------NDTFGQHRVKLPAGDLVLYP-----SS 155 + + + L+L+D ++GGE T + V+ G + ++P + Sbjct: 398 DKKQSSMYTFLLYLND--EFEGGETTFFMPAPREGTLNGYPVRPVMGAVAIFPHGESNGA 455 Query: 156 SLHCVTPVTRGVRVASFMWIQSMIRDDKK 184 LH T VT+G + ++ ++ ++ Sbjct: 456 LLHEGTGVTKGAKYIIRTDVEYDVKPSEE 484 >UniRef50_D2VJ99 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2VJ99_NAEGR Length = 568 Score = 45.7 bits (107), Expect = 0.001, Method: Composition-based stats. Identities = 30/183 (16%), Positives = 69/183 (37%), Gaps = 22/183 (12%) Query: 3 YHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEVLN 62 Y I L+P++ + F E+ + + + + +NN+++ A + L Sbjct: 29 YIIKNFLTPKECSEFIEKAVKIGFD---LASHDYPPSYRNNERIIMDDEELAEKMTKKLK 85 Query: 63 AVNQHALFFAA------ALPRTL-STPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLS 115 + +++ S RYQ + + H DG H ++ ++++ L+ Sbjct: 86 PMLDSLNLVEFEKDGLECELKSVNSRFRLCRYQEGQEFRIHQDG---VHYKSKFVKSILT 142 Query: 116 ATLFLSDPQSYDGGELVV-------NDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVR 168 ++L+ + +D G + N + K GDL+++ H V G++ Sbjct: 143 FMIYLN--EEFDNGHTIFFKSGPSSNPPEEMGKYKPQCGDLIVFDHELWHSGEIVNNGIK 200 Query: 169 VAS 171 Sbjct: 201 YVM 203 >UniRef50_A8NWL8 Predicted protein n=1 Tax=Coprinopsis cinerea okayama7#130 RepID=A8NWL8_COPC7 Length = 476 Score = 45.7 bits (107), Expect = 0.001, Method: Composition-based stats. Identities = 27/137 (19%), Positives = 47/137 (34%), Gaps = 38/137 (27%) Query: 56 LQNEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLS 115 + + V A+ Q + + N Y + HVD D+ Sbjct: 109 ILSTVREALLQ---YGNSRHTLEAHLDKMNVYGPGSFFKPHVDTP---------RDRDML 156 Query: 116 ATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLV----------------------LYP 153 ATL + P ++GG+L++ G+ + + + ++V + Sbjct: 157 ATLVIVLPTEHEGGDLLL----GEEKWQFNSAEMVSKSENSLATDGIEVTMHRLAFAAFY 212 Query: 154 SSSLHCVTPVTRGVRVA 170 S H VTPV G RV Sbjct: 213 SDIEHEVTPVKSGYRVT 229 >UniRef50_B6BGN8 2OG-Fe(II) oxygenase n=1 Tax=Campylobacterales bacterium GD 1 RepID=B6BGN8_9PROT Length = 198 Score = 45.7 bits (107), Expect = 0.001, Method: Composition-based stats. Identities = 23/92 (25%), Positives = 37/92 (40%), Gaps = 11/92 (11%) Query: 87 YQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLSDP-QSYDGGELVVNDTFGQ--HRVK 143 Y + Y HVD + ++ +L++ + DGGELV+ D +V Sbjct: 107 YNTGDFYETHVDA------FKNSVNRVVTTVYYLNEGWREGDGGELVIYDEHNNFLKKVP 160 Query: 144 LPAGDLVLYPSSSL-HCVTPVTRGVRVASFMW 174 A LV++ S H V P + R + W Sbjct: 161 PKANTLVVFLSEKFPHEVLPAIK-KRYSIAGW 191 >UniRef50_D2V6G1 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2V6G1_NAEGR Length = 212 Score = 45.7 bits (107), Expect = 0.001, Method: Composition-based stats. Identities = 34/186 (18%), Positives = 68/186 (36%), Gaps = 26/186 (13%) Query: 2 MYHIPGVLSPQDVARF-REQLEQAEWVDGRVTTGAQGAQVKNNQQV-----DTRSTLYAA 55 ++ I G++S ++ + E+ E+ G AQ ++NN ++ + + L+ Sbjct: 7 IWLIDGLVSEEECQEIITNECEKKEFESGTY-NNAQDRSIRNNSRLIMDNQNYSNWLWTR 65 Query: 56 LQNEV-------LNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRS----- 103 +++ + N + A L F RY E + H DG + Sbjct: 66 VKDYIPKLASELSNKCVRRATTIGYELCELSDKIRFYRYYKGEFFAPHSDGGIVLESVET 125 Query: 104 HPQNGWMRTDLSA---TLFLSDPQSYDGGELVV---NDTFGQHRVKLPAGDLVLYPSSSL 157 + T S L+L++ + GGE N + V+ G ++L+ + Sbjct: 126 IDGEEYYVTKKSFLTLLLYLNEIPN-GGGETEFLNKNTKQVEWSVEPKPGRILLFVHENY 184 Query: 158 HCVTPV 163 H V Sbjct: 185 HQAKTV 190 >UniRef50_C5FBE9 Putative uncharacterized protein n=1 Tax=Microsporum canis CBS 113480 RepID=C5FBE9_NANOT Length = 910 Score = 45.7 bits (107), Expect = 0.001, Method: Composition-based stats. Identities = 29/173 (16%), Positives = 56/173 (32%), Gaps = 21/173 (12%) Query: 9 LSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQV-----DTRSTLYAALQNEVLNA 63 LSP+D + ++ + G T V+ ++ D ++ + +V+ Sbjct: 72 LSPEDAKAVVDLCHRSPFGKGAET--LVDTSVRKCWELNVADFDLKAPGWGNYMKKVVAD 129 Query: 64 VNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLSDP 123 V++ A Y+ + H D + ATL + P Sbjct: 130 VSKGLGIAHQADSIRADPYKLLLYEEGAFFLSHQDSPKADG---------MFATLVVCLP 180 Query: 124 QSYDGGELVVNDTFGQHRVKLPAGDLVLYP-----SSSLHCVTPVTRGVRVAS 171 ++GGE+++ K + S H V P+T G R+ Sbjct: 181 TKHEGGEIILKHDDKSLVFKTSTTSRAGFSYAAWYSDVFHEVQPITAGYRLVL 233 >UniRef50_Q4KE77 Oxidoreductase, 2OG-Fe(II) oxygenase family family n=1 Tax=Pseudomonas fluorescens Pf-5 RepID=Q4KE77_PSEF5 Length = 504 Score = 45.4 bits (106), Expect = 0.001, Method: Composition-based stats. Identities = 29/183 (15%), Positives = 60/183 (32%), Gaps = 25/183 (13%) Query: 5 IPGVLSPQDVARFREQLEQAEWVDGRVTTGAQ--GAQVKNNQQVDTRSTLYAALQNEVLN 62 + LS + E EQ + + G+ + N++ V + L + + Sbjct: 32 VHEFLSASECEALIEATEQCGF----ASAGSDYPSSYRDNDRIVADDPAMAGRLFERLKH 87 Query: 63 AVNQHALFFAAA-----LPRTL-STPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSA 116 ++ P + F RY+ + H DG ++ L+ Sbjct: 88 CASRMPKLGTVIDEDGWRPVGINERLRFCRYRPGTQFRAHQDGVH----HRQRQQSRLTF 143 Query: 117 TLFLSDPQSYDGGE-LVVNDTFGQH-------RVKLPAGDLVLYPSSSLHCVTPVTRGVR 168 ++L+D ++ GGE L R++ G L+++ + H V G + Sbjct: 144 MIYLND-DAFSGGETLFFEGRSAAMSNRDSTLRLRPRKGSLIVFDHTLWHAGALVDAGQK 202 Query: 169 VAS 171 Sbjct: 203 YVM 205 >UniRef50_B1HNF1 Putative uncharacterized protein n=3 Tax=Bacillaceae RepID=B1HNF1_LYSSC Length = 237 Score = 45.4 bits (106), Expect = 0.002, Method: Composition-based stats. Identities = 20/108 (18%), Positives = 33/108 (30%), Gaps = 23/108 (21%) Query: 82 PLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLSDPQSYDGGELVVNDTFGQHR 141 PL +Y+ H D + + + L D Y G L+V Sbjct: 127 PLLLKYEAGGFNCLHQD-----LYGDLFFPFQVVFVLNQRDQDYYGGESLLVEQIPRAQS 181 Query: 142 ----VKLPAGDLVLYPS--------------SSLHCVTPVTRGVRVAS 171 + L G +++P+ + H V+ VT G R Sbjct: 182 RGHVITLEQGSALIFPTNHRPVLGKKGYYKNTIRHGVSTVTSGERYGL 229 >UniRef50_B8HQ53 Prolyl 4-hydroxylase alpha subunit n=1 Tax=Cyanothece sp. PCC 7425 RepID=B8HQ53_CYAP4 Length = 309 Score = 45.4 bits (106), Expect = 0.002, Method: Composition-based stats. Identities = 33/204 (16%), Positives = 63/204 (30%), Gaps = 47/204 (23%) Query: 3 YHIPGVLSPQDVARFREQL--EQAEWVDGRVTTGAQGAQVKNNQQVDTRSTL----YAAL 56 + L+P++++ E + ++ + +TG +D R +L + Sbjct: 121 VRLEQFLTPEELSYLIEYVVQQKDNFAPTHTSTGD----------LDYRKSLILYNFPEF 170 Query: 57 QNEVLNAVNQHALFFAAALPRTLSTP--------LFNRYQNNETYGFHVDGAVRSHPQNG 108 V+N + L + P + + Y H D Sbjct: 171 SQLVVNRIR---TVMPEVLTKLKMPPFSVGEIESQLTAHGDGNYYKIHNDNGSPETATRE 227 Query: 109 WMRTDLSATLFLSDPQSYDGGELVVNDTFGQHRVKLPA----------GDLVLYPSSSLH 158 F P+ + GGEL + D+ ++ + A ++ +PS LH Sbjct: 228 LTY----VYYFYQQPKCFSGGELRLFDSKIENGFYVAADSSHIVEPDNNSIIFFPSRYLH 283 Query: 159 CVTPVTRGV------RVASFMWIQ 176 V PV R WI+ Sbjct: 284 EVLPVQCPSREFQYYRFTINGWIR 307 >UniRef50_A0A9R2 Leprecan n=1 Tax=Molgula tectiformis RepID=A0A9R2_9ASCI Length = 740 Score = 45.4 bits (106), Expect = 0.002, Method: Composition-based stats. Identities = 25/112 (22%), Positives = 50/112 (44%), Gaps = 12/112 (10%) Query: 97 VDGAVRSHPQNGWMRTDLSATLFLSDPQSYDGGELVVNDTFGQH---RVKLPAGDLVLYP 153 D ++ D SA L+L++ ++GG L+ D+ + +V+ G + + Sbjct: 631 QDNGECLKKPPAYVWRDYSAILYLNNK--FEGGNLIFVDSTAKRISAQVEPKCGRMAAFC 688 Query: 154 S--SSLHCVTPVTRGVRVASFMWIQSMI-RDDKKRAM----LFELDNNIQSL 198 + H V PV++G R A +W + + + +R + L L+N+ L Sbjct: 689 AGKECFHGVKPVSKGQRCAMALWFTTKKEKQEVQRQIAEETLASLENSKDEL 740 >UniRef50_B2JN14 Procollagen-proline dioxygenase n=6 Tax=Burkholderia RepID=B2JN14_BURP8 Length = 305 Score = 45.0 bits (105), Expect = 0.002, Method: Composition-based stats. Identities = 32/196 (16%), Positives = 50/196 (25%), Gaps = 34/196 (17%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDT------RSTLYAA 55 + VLS + E+ V + V + + Sbjct: 118 VIVFDDVLSRDECDELIERARHRLKRSTTVNPESGREDVIQLRTSEGFWFQRCEDAFIER 177 Query: 56 LQNEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVD------GAVRSHPQNGW 109 L + +A+ L Y Y H D H G Sbjct: 178 LDRRI-SALMNWPL-------EHGEGLQILHYTKGGEYRPHFDYFPPSQSGSVLHTSRGG 229 Query: 110 MRTDLSATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYP---------SSSLHCV 160 R + ++LSD GGE V V G + + +LH Sbjct: 230 QRV-ATLIVYLSDVAG--GGETVF--PNAGLAVMARQGGAIYFRYLNGHRQLDPLTLHGG 284 Query: 161 TPVTRGVRVASFMWIQ 176 PVT G + W++ Sbjct: 285 APVTNGEKWIMTKWMR 300 >UniRef50_Q1MZY2 Oxidoreductase, 2OG-Fe(II) oxygenase family protein n=1 Tax=Bermanella marisrubri RepID=Q1MZY2_9GAMM Length = 219 Score = 45.0 bits (105), Expect = 0.002, Method: Composition-based stats. Identities = 40/183 (21%), Positives = 61/183 (33%), Gaps = 19/183 (10%) Query: 4 HIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEVLNA 63 I L + + + +V + + + NQ V + + + E Sbjct: 29 IIDNALPQALLQSLMNHIATLSSQEFKVAGTGRQDEHQVNQFVRRDEIHWLSEERECERE 88 Query: 64 VNQHALFFAAALPRTLSTPLFN------RYQNNETYGFHVDGAVRSHPQNGWMRTDLSAT 117 A + + L LF+ Y+ Y H+D G LS Sbjct: 89 WFHWAQGLQTEINKRLMLGLFSYEAHMAHYEPGAFYKKHLDA------FKGSRSRVLSTV 142 Query: 118 LFLSDP-QSYDGGELVVNDT----FGQHRVKLPAGDLVLYPSSSL-HCVTPVTRGVRVAS 171 L+L+ QS GGELV+ D RV G LV++ S H V P R R + Sbjct: 143 LYLNPQWQSNYGGELVIYDEHNHDSELTRVSPMPGTLVVFLSEDFPHEVLP-AREHRHSI 201 Query: 172 FMW 174 W Sbjct: 202 AGW 204 >UniRef50_Q4SQX1 Chromosome 11 SCAF14528, whole genome shotgun sequence. (Fragment) n=1 Tax=Tetraodon nigroviridis RepID=Q4SQX1_TETNG Length = 690 Score = 45.0 bits (105), Expect = 0.002, Method: Composition-based stats. Identities = 37/203 (18%), Positives = 73/203 (35%), Gaps = 38/203 (18%) Query: 5 IPGVLSPQDVARFREQLEQAE--WVDGRVTTG--------AQGAQVKNNQQVDTRSTLYA 54 +P VL V+ +QL ++ +DG ++ + A +K + S Sbjct: 449 VPQVLDGVTVSLSEQQLNGSQRVLLDGVISADQCRQLQRLSNAAALKGDGYRGKPSPHSP 508 Query: 55 ALQNEVLNAVNQHALFFAAALPRT--LSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMR- 111 + F + P T + P + + ++ HVD + N ++ Sbjct: 509 --GETFQEIRLESP-FRISYTPTTSPAAPPEKQEDRTDLSHPVHVDNCLLVSETNECVKE 565 Query: 112 ------TDLSATLFLSDPQSYDGGELVVN------------DTFGQHRVKLPAGDLVLYP 153 D SA L+L+ + ++GG+ + F V+ G ++ + Sbjct: 566 PPAYTHRDYSAILYLN--EDFEGGDFIFTKLDAKTVTVYFGLLFPVAEVRPRCGRMIGFG 623 Query: 154 S--SSLHCVTPVTRGVRVASFMW 174 + + H V VT+G R A +W Sbjct: 624 AGKENPHGVRAVTKGQRCAVALW 646 >UniRef50_Q08AW1 LOC100158404 protein n=6 Tax=Tetrapoda RepID=Q08AW1_XENLA Length = 292 Score = 45.0 bits (105), Expect = 0.002, Method: Composition-based stats. Identities = 29/129 (22%), Positives = 50/129 (38%), Gaps = 6/129 (4%) Query: 47 DTRSTLYAALQNEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQ 106 + LY ++ ++ + + + + + F+R ++E H D H Sbjct: 148 EEDFKLYREVRLKIQHEIART-FNISVSSLHLTKPTFFSRMNSSEAKTSH-DEYWHPHID 205 Query: 107 NGWMRT-DLSATLFLSD-PQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSS--LHCVTP 162 + D ++ L+LSD Q + GG V D V+ G + + S S LH V Sbjct: 206 KVTYGSFDYTSLLYLSDYSQDFGGGRFVFIDESANRTVEPRTGRVSFFTSGSENLHRVEK 265 Query: 163 VTRGVRVAS 171 V G R A Sbjct: 266 VNWGTRYAI 274 >UniRef50_B8C609 Predicted protein n=1 Tax=Thalassiosira pseudonana RepID=B8C609_THAPS Length = 438 Score = 45.0 bits (105), Expect = 0.002, Method: Composition-based stats. Identities = 27/121 (22%), Positives = 47/121 (38%), Gaps = 21/121 (17%) Query: 114 LSATLFLSDPQSYDGG--------ELVVNDTF----GQHRVKLP-AGDLVLYPSSSLHCV 160 LS T+ LS P ++GG ++V+ +K P AG L+ LH Sbjct: 219 LSFTVLLSSPDDFEGGGTIFDALRDVVIEGDDSIIQSPGSIKPPKAGYATLHSGKLLHGG 278 Query: 161 TPVTRGVRVASFMWIQSMIRDDKKRAMLFELDNNIQSLKSRYGESEEILSLLNLYHNLLR 220 VT+G RV ++ D +R + + + +G ++ L LL+ Sbjct: 279 HVVTQGQRVVLVGFV-----DVDERNVK---EGTLGDATKEWGRNDVRLFWNKRRLELLK 330 Query: 221 E 221 + Sbjct: 331 Q 331 >UniRef50_A5WGK8 Procollagen-proline dioxygenase n=1 Tax=Psychrobacter sp. PRwf-1 RepID=A5WGK8_PSYWF Length = 268 Score = 44.6 bits (104), Expect = 0.002, Method: Composition-based stats. Identities = 31/195 (15%), Positives = 69/195 (35%), Gaps = 32/195 (16%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGA-----QGAQVKNNQQVDTRSTLYAAL 56 + I LSP++ +Q ++ G+ V+++ + T + + Sbjct: 81 VTVINDFLSPEECDALISDADQ------KLKASRVVDPEDGSFVEHSARTSTSTGYHRGE 134 Query: 57 QNEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVD------GAVRSHPQNGWM 110 + + + A + RY++ Y H D + R + G Sbjct: 135 IDIIKTIEARIADLINWPV-DHGEGLQVLRYEDGGEYRPHFDFFDPAKKSSRLVTKQGGQ 193 Query: 111 RTDLSATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSS---------SLHCVT 161 R + ++LS+ S GG + + ++ G + + ++ +LH Sbjct: 194 RVG-TFLMYLSEVDS--GGSTRFPNLNFE--IRPNKGSALYFANTNLKAEIEPLTLHAGM 248 Query: 162 PVTRGVRVASFMWIQ 176 PVT GV+ + W++ Sbjct: 249 PVTEGVKYLATKWLR 263 >UniRef50_A9G957 Putative uncharacterized protein n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9G957_SORC5 Length = 263 Score = 44.6 bits (104), Expect = 0.002, Method: Composition-based stats. Identities = 23/113 (20%), Positives = 46/113 (40%), Gaps = 12/113 (10%) Query: 55 ALQNEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDL 114 A+++ + +A + ++ + + Y N H D + P + + Sbjct: 95 AIRDTIESAWLSNPIYRNEKATLHDYVIVTHYYGNRGMLSRHRD---YNGPAPLPL---I 148 Query: 115 SATLFLSDP-QSYDGGELVVNDTFGQHR-----VKLPAGDLVLYPSSSLHCVT 161 + LS+P + Y+GG LV+ G R + L GD +++ + LH V Sbjct: 149 QFWVALSEPGKDYEGGNLVIYSKDGTRRRVEADLGLRRGDALIFDKTLLHEVE 201 >UniRef50_Q4RU77 Chromosome 1 SCAF14995, whole genome shotgun sequence n=1 Tax=Tetraodon nigroviridis RepID=Q4RU77_TETNG Length = 472 Score = 44.6 bits (104), Expect = 0.002, Method: Composition-based stats. Identities = 24/91 (26%), Positives = 37/91 (40%), Gaps = 14/91 (15%) Query: 96 HVDGAVRSHPQN-------GWMRTDLSATLFLSDPQSYDGGELVVNDTFGQH---RVKLP 145 H D + N + D SA L+L+ ++GGE + + + VK Sbjct: 347 HADNCLLDPEANECWKEPPAYTYRDYSALLYLN--GDFEGGEFIFTEMDAKTVTASVKPR 404 Query: 146 AGDLVLYPS--SSLHCVTPVTRGVRVASFMW 174 G +V + S + H V VT G R A +W Sbjct: 405 CGRMVGFSSGGENPHGVKAVTGGQRCAVALW 435 >UniRef50_Q8IVL6 Prolyl 3-hydroxylase 3 n=34 Tax=Amniota RepID=P3H3_HUMAN Length = 736 Score = 44.6 bits (104), Expect = 0.002, Method: Composition-based stats. Identities = 27/91 (29%), Positives = 35/91 (38%), Gaps = 14/91 (15%) Query: 96 HVDGAVRSHPQNGWMR-------TDLSATLFLSDPQSYDGGELVVNDTFG---QHRVKLP 145 H D V R D S L+L+D + GG+L + RV+ Sbjct: 584 HADNCVLDPDTGECWREPPAYTYRDYSGLLYLND--DFQGGDLFFTEPNALTVTARVRPR 641 Query: 146 AGDLVLYPSSSL--HCVTPVTRGVRVASFMW 174 G LV + S H V VTRG R A +W Sbjct: 642 CGRLVAFSSGVENPHGVWAVTRGRRCALALW 672 >UniRef50_A5FT70 Putative uncharacterized protein n=1 Tax=Acidiphilium cryptum JF-5 RepID=A5FT70_ACICJ Length = 762 Score = 44.6 bits (104), Expect = 0.002, Method: Composition-based stats. Identities = 33/174 (18%), Positives = 57/174 (32%), Gaps = 24/174 (13%) Query: 9 LSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRS-----TLYAALQNEVLNA 63 L P + + E A + G T V+ Q+ +A ++L Sbjct: 46 LLPIQARQLIDAAEPAPFGRGEQT--IIDPTVRRCGQIGPDRVRLGGRHWARTLEDILAR 103 Query: 64 VNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLSDP 123 V L + L Y + H D + M L+ L P Sbjct: 104 V-SDGLGVDEPIDAEFYKLLI--YDQGGFFVSHRD-----TEKVAGMFATLTVVL----P 151 Query: 124 QSYDGGELVVNDTFGQHRVKLPAGDL--VLYPSSS---LHCVTPVTRGVRVASF 172 + GG+L++ + + L GD V + + +H + PV+ G R+A Sbjct: 152 SHFSGGDLIIRHKGREACLALHTGDPGDVAFAAFYADCVHEILPVSEGCRLALI 205 >UniRef50_B6QKY4 Putative uncharacterized protein n=1 Tax=Penicillium marneffei ATCC 18224 RepID=B6QKY4_PENMQ Length = 459 Score = 44.6 bits (104), Expect = 0.002, Method: Composition-based stats. Identities = 21/93 (22%), Positives = 34/93 (36%), Gaps = 18/93 (19%) Query: 87 YQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLSDPQSYDGGELVVNDTFGQHRVKLPA 146 Y+ + H D + ATL +S P ++GGE++V + +K Sbjct: 140 YEEGAFFLPHQDSEKADG---------MFATLVISLPSKHEGGEVIV--SHKGESLKFET 188 Query: 147 GDLVLYP-------SSSLHCVTPVTRGVRVASF 172 G Y + +H V PV G R+ Sbjct: 189 GSNSEYGFSWAAWYADVMHEVKPVRSGYRIVLV 221 >UniRef50_Q9LSI6 Prolyl 4-hydroxylase alpha subunit-like protein n=31 Tax=Magnoliophyta RepID=Q9LSI6_ARATH Length = 332 Score = 44.6 bits (104), Expect = 0.003, Method: Composition-based stats. Identities = 41/202 (20%), Positives = 68/202 (33%), Gaps = 35/202 (17%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEVL 61 ++ G LS ++ F + L + + V G V++ + + L + V Sbjct: 81 VFLYEGFLSDEECDHFIK-LAKGKLEKSMVADNDSGESVESEVRTSSGMFLSKRQDDIVS 139 Query: 62 --NAVNQHALFFAAALPRTL-STPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATL 118 A F LP + Y+N + Y H D + + Sbjct: 140 NVEAKLAAWTF----LPEENGESMQILHYENGQKYEPHFDYFHDQANLELGGHRIATVLM 195 Query: 119 FLSDPQSYDGGELVVNDTFG----------------QHRVKLPAGDLVLY----P----- 153 +LS+ + GGE V G + VK GD +L+ P Sbjct: 196 YLSNVE--KGGETVFPMWKGKATQLKDDSWTECAKQGYAVKPRKGDALLFFNLHPNATTD 253 Query: 154 SSSLHCVTPVTRGVRVASFMWI 175 S+SLH PV G + ++ WI Sbjct: 254 SNSLHGSCPVVEGEKWSATRWI 275 >UniRef50_P74376 Sll0428 protein n=1 Tax=Synechocystis sp. PCC 6803 RepID=P74376_SYNY3 Length = 350 Score = 44.6 bits (104), Expect = 0.003, Method: Composition-based stats. Identities = 19/111 (17%), Positives = 39/111 (35%), Gaps = 22/111 (19%) Query: 83 LFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLS-DPQSYDGGELVVNDTFGQHR 141 + + Y H D +L+ + + +P+++ GGEL + D+ ++ Sbjct: 243 QLTAHNHGNYYKVHNDNGSPDSAT-----RELTYVYYFNREPKAFSGGELAIYDSKIENN 297 Query: 142 VKLPA----------GDLVLYPSSSLHCVTPVT------RGVRVASFMWIQ 176 + A +V + S +H V PV R W++ Sbjct: 298 FYVAAESFKTVQPVNNSIVFFLSRYMHEVLPVNCPSQAFADSRFTINGWVR 348 >UniRef50_Q26DQ8 Oxidoreductase, 20G-Fe(II) oxygenase superfamily n=2 Tax=Flavobacteria RepID=Q26DQ8_9BACT Length = 320 Score = 44.6 bits (104), Expect = 0.003, Method: Composition-based stats. Identities = 28/152 (18%), Positives = 55/152 (36%), Gaps = 20/152 (13%) Query: 43 NQQVDTRSTLYAALQNE---VLNAVNQH---ALFFAAA------LPRTLSTPLFNRYQ-N 89 N ++ R+ L + H F + R ++ L Y + Sbjct: 120 NSEIPRRAPYGIQLNRYGIMLDPRSEGHLAAPNFQSFYNTIMDRYMRPIARLLLGTYGFD 179 Query: 90 NETYGFHVDGAVRSHPQNGWMRTDLSA-TLFLS---DPQSYDGGELVVNDTFGQHRVK-- 143 N+T+GF + ++ TD SA TL ++ + + G ++ +D V Sbjct: 180 NQTFGFSI-QYNPDKDKDLHAHTDASAATLNININLPDEEFTGSQVDFHDKSTGKVVPTI 238 Query: 144 LPAGDLVLYPSSSLHCVTPVTRGVRVASFMWI 175 G +++ + H P+T G R +W+ Sbjct: 239 FEPGKAIIHRGNVPHATHPITSGQRSNLVVWL 270 >UniRef50_B8C4H5 Prolyl 4-hydrolase-like protein (Fragment) n=1 Tax=Thalassiosira pseudonana RepID=B8C4H5_THAPS Length = 180 Score = 44.2 bits (103), Expect = 0.003, Method: Composition-based stats. Identities = 20/105 (19%), Positives = 41/105 (39%), Gaps = 15/105 (14%) Query: 82 PLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLSDPQSYDGGELVVNDTFGQHR 141 +Y+ E Y H D + H + L+ ++ ++ + +GG G Sbjct: 78 LQLLKYEVGEFYRPH-DDFIDDHVRQAKGPRLLTFFIYFNEVE--EGGGTRFP-KLGNLT 133 Query: 142 VKLPAGDLVLYPS-----------SSLHCVTPVTRGVRVASFMWI 175 ++ G ++++PS + H VT+G + A+ WI Sbjct: 134 IQPKLGRVLIWPSVLDGDAYKKDERTNHEAMEVTKGRKFAANAWI 178 >UniRef50_C6J1E0 Prolyl 4-hydroxylase n=1 Tax=Paenibacillus sp. oral taxon 786 str. D14 RepID=C6J1E0_9BACL Length = 215 Score = 44.2 bits (103), Expect = 0.003, Method: Composition-based stats. Identities = 22/106 (20%), Positives = 38/106 (35%), Gaps = 14/106 (13%) Query: 80 STPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLSDPQSYDGGELVVNDTFGQ 139 Y + Y H D + P R + ++L+D ++ GGE V + Sbjct: 103 EGLQVLHYGPGQEYQAHYDFFGPNSPSASNNRIS-TLIIYLNDVEA--GGETVFPLLDLE 159 Query: 140 HRVKLPAGDLVLYPSS---------SLHCVTPVTRGVRVASFMWIQ 176 VK G + + +LH PV RG + + W++ Sbjct: 160 --VKPERGSALYFEYFYRQQELNNLTLHSSVPVVRGEKWVATQWMR 203 >UniRef50_Q4Q6N6 Putative uncharacterized protein n=2 Tax=Leishmania RepID=Q4Q6N6_LEIMA Length = 325 Score = 44.2 bits (103), Expect = 0.003, Method: Composition-based stats. Identities = 21/94 (22%), Positives = 39/94 (41%), Gaps = 14/94 (14%) Query: 88 QNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLSDPQSYD-GGELVVNDTFGQHRVKLPA 146 + +H D + + + L+ ++L++ + D GGEL + F + +P Sbjct: 142 NEGGAFPWHYDNPGKPN------KRRLTMAVYLTEDWAPDIGGELQM-MPFLGPCITVPP 194 Query: 147 G--DLVLYPSS-SLHCVTPV---TRGVRVASFMW 174 +VL+ S LH V P+ T R +W Sbjct: 195 KFCTVVLFQSDMMLHRVRPILSHTHKTRYCFTIW 228 >UniRef50_Q47UG1 Oxidoreductase, 2OG-Fe(II) oxygenase family n=7 Tax=Proteobacteria RepID=Q47UG1_COLP3 Length = 227 Score = 44.2 bits (103), Expect = 0.003, Method: Composition-based stats. Identities = 33/129 (25%), Positives = 44/129 (34%), Gaps = 24/129 (18%) Query: 63 AVNQHALFFAAALPRTLSTPLFN------RYQNNETYGFHVDGAVRSHPQNGWMRTDLSA 116 A A A L R L LF+ Y + Y H D G LS Sbjct: 91 AWINWAESLQAYLNRRLFLGLFSFESHFAHYAKGDFYKKHKDA------FKGEGNRVLSV 144 Query: 117 TLFLSDP-QSYDGGELVVNDTFGQHRVKLPA---------GDLVLYPSSSL-HCVTPVTR 165 ++L+ S DGGELV+ D V + G +V++ S H V R Sbjct: 145 VVYLNPHWASSDGGELVIYDKNSPSSVVVDNSKITVIPSFGTIVVFLSEEFPHEVLAAKR 204 Query: 166 GVRVASFMW 174 R + W Sbjct: 205 -DRYSIAGW 212 >UniRef50_Q82QN7 Putative oxygenase n=1 Tax=Streptomyces avermitilis RepID=Q82QN7_STRAW Length = 223 Score = 44.2 bits (103), Expect = 0.003, Method: Composition-based stats. Identities = 29/124 (23%), Positives = 46/124 (37%), Gaps = 30/124 (24%) Query: 84 FNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLS-DPQSYDGGELVVNDTF----- 137 N + + + Y H D + P R L+ +L P+ +DGGEL V D Sbjct: 102 LNAHNDGDFYRPHQDTSAEFAP-----RRLLTFVYYLHRTPRPFDGGELRVFDAALPLHT 156 Query: 138 ------GQHRVK--LPAGDLVLY--PSSSLHCVTPVT------RGVRVASFMWIQSMIRD 181 + + P D +++ P+ + H V PV+ R A W+ S D Sbjct: 157 ETAGRWQERTWRDWEPEHDSIVFFLPT-AWHEVRPVSCPSKQHADSRFAINGWLCS--PD 213 Query: 182 DKKR 185 R Sbjct: 214 PANR 217 >UniRef50_C1EER8 Predicted protein n=2 Tax=Micromonas RepID=C1EER8_9CHLO Length = 439 Score = 44.2 bits (103), Expect = 0.003, Method: Composition-based stats. Identities = 37/195 (18%), Positives = 70/195 (35%), Gaps = 32/195 (16%) Query: 6 PGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEVLNAVN 65 P ++ A+ + V G T G +N +V + V A+ Sbjct: 236 PSLVVRHQTAKRGDTAGGDTAVHGEATAGRTS----HNCRVSSSHP-------IVRAAIQ 284 Query: 66 QHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTD-------LSATL 118 + A + P RY ++ Y H D R+HP++ +T+ ++ Sbjct: 285 RAA-YLCGLEPSHAEPAQVVRYLPSQEYKPHHDWFDRAHPESFRAKTEGRGGQRAVTCLA 343 Query: 119 FLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPS---------SSLHCVTPVTRGVRV 169 +L +P+ GG K+ GD +L+ + +LH PV G + Sbjct: 344 YLVEPE--RGGRTYFPKLRAGFEPKV--GDALLWWNVDENGAEDFKTLHAGEPVEAGAKW 399 Query: 170 ASFMWIQSMIRDDKK 184 A +W++ R ++ Sbjct: 400 ALNLWLREKPRRGEE 414 >UniRef50_Q84406 A85R protein n=4 Tax=Chlorovirus RepID=Q84406_PBCV1 Length = 242 Score = 44.2 bits (103), Expect = 0.003, Method: Composition-based stats. Identities = 22/106 (20%), Positives = 40/106 (37%), Gaps = 13/106 (12%) Query: 82 PLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLSDPQSYDGGELVVNDTFGQHR 141 RY+ + Y H DG R + ++L P+ GGE + + Sbjct: 138 VQVARYKPGQYYYHHYDGDDCDDACPKDQRL-ATLMVYLKAPEEGGGGETDFPTL--KTK 194 Query: 142 VKLPAGDLVLY----PSS------SLHCVTPVTRGVRVASFMWIQS 177 +K G + + P + +LH PV G ++ + WI++ Sbjct: 195 IKPKKGTSIFFWVADPVTRKLYKETLHAGLPVKSGEKIIANQWIRA 240 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_B5YS96 PKHD-type hydroxylase ybiX n=227 Tax=Bacteria Re... 282 8e-75 UniRef50_A8ESR5 PKHD-type hydroxylase Abu_0724 n=7 Tax=Proteobac... 263 3e-69 UniRef50_Q1LIS6 PKHD-type hydroxylase Rmet_3078 n=4 Tax=Burkhold... 256 6e-67 UniRef50_Q2JHA7 PKHD-type hydroxylase CYB_2270 n=16 Tax=Cyanobac... 255 7e-67 UniRef50_C5T9F3 2OG-Fe(II) oxygenase n=1 Tax=Acidovorax delafiel... 254 2e-66 UniRef50_Q15TJ8 PKHD-type hydroxylase Patl_2273 n=6 Tax=Proteoba... 252 6e-66 UniRef50_Q3SUS4 PKHD-type hydroxylase Nwi_0701 n=10 Tax=Proteoba... 249 6e-65 UniRef50_A3P5N3 PKHD-type hydroxylase BURPS1106A_A1609 n=30 Tax=... 246 3e-64 UniRef50_B8GUF9 PKHD-type hydroxylase Tgr7_2199 n=1 Tax=Thioalka... 244 1e-63 UniRef50_A5WFM3 PKHD-type hydroxylase PsycPRwf_1523 n=1 Tax=Psyc... 243 4e-63 UniRef50_Q47YL9 PKHD-type hydroxylase CPS_3426 n=1 Tax=Colwellia... 240 2e-62 UniRef50_Q5QUG6 PKHD-type hydroxylase IL0759 n=2 Tax=Idiomarina ... 238 9e-62 UniRef50_Q0AP20 PKHD-type hydroxylase Mmar10_1675 n=1 Tax=Marica... 238 9e-62 UniRef50_A8TQK0 Putative hydroxylase n=1 Tax=alpha proteobacteri... 234 2e-60 UniRef50_Q087U3 PKHD-type hydroxylase Sfri_0612 n=2 Tax=Alteromo... 231 1e-59 UniRef50_Q0C1R0 PKHD-type hydroxylase HNE_1625 n=4 Tax=Alphaprot... 230 2e-59 UniRef50_C7R7F4 2OG-Fe(II) oxygenase n=1 Tax=Kangiella koreensis... 223 3e-57 UniRef50_A3PE10 PKHD-type hydroxylase P9301_13621 n=4 Tax=Prochl... 222 6e-57 UniRef50_Q3AJA6 PKHD-type hydroxylase Syncc9605_1577 n=13 Tax=Cy... 220 4e-56 UniRef50_A3YZT5 Putative uncharacterized protein n=2 Tax=Chrooco... 219 8e-56 UniRef50_Q0I9X3 PKHD-type hydroxylase sync_1544 n=2 Tax=Synechoc... 212 8e-54 UniRef50_A7HP27 PKHD-type hydroxylase Plav_0037 n=1 Tax=Parvibac... 205 8e-52 UniRef50_Q05TP1 Putative hydroxylase n=1 Tax=Synechococcus sp. R... 203 4e-51 UniRef50_Q1GRV0 PKHD-type hydroxylase Sala_1910 n=1 Tax=Sphingop... 201 1e-50 UniRef50_UPI00006A2339 UPI00006A2339 related cluster n=1 Tax=Xen... 192 1e-47 UniRef50_B9TN05 PKHD-type hydroxylase ybiX, putative (Fragment) ... 188 1e-46 UniRef50_Q5GQB0 Putative uncharacterized protein n=1 Tax=Synecho... 187 3e-46 UniRef50_A8TKW7 2OG-Fe(II) oxygenase n=1 Tax=alpha proteobacteri... 186 6e-46 UniRef50_C7BVH5 2OG-Fe(II) oxygenase family like protein n=1 Tax... 159 7e-38 UniRef50_Q1GXG2 PKHD-type hydroxylase Mfla_0096 n=1 Tax=Methylob... 153 3e-36 UniRef50_D2VSR0 Predicted protein n=1 Tax=Naegleria gruberi RepI... 152 1e-35 UniRef50_A6G260 Uncharacterized iron-regulated protein n=1 Tax=P... 147 3e-34 UniRef50_Q2S7M4 Uncharacterized iron-regulated protein n=1 Tax=H... 144 2e-33 UniRef50_UPI00016C3A48 hypothetical protein GobsU_06128 n=1 Tax=... 144 2e-33 UniRef50_Q54PP0 Putative uncharacterized protein n=2 Tax=Dictyos... 143 3e-33 UniRef50_C3ZC48 Putative uncharacterized protein n=1 Tax=Branchi... 143 4e-33 UniRef50_Q1DA84 Oxidoreductase, 2OG-Fe(II) oxygenase family n=1 ... 143 6e-33 UniRef50_D2VZW5 Predicted protein n=1 Tax=Naegleria gruberi RepI... 141 1e-32 UniRef50_C0YUD2 Possible iron-regulated protein n=1 Tax=Chryseob... 141 1e-32 UniRef50_A1ZDU9 Oxidoreductase, 2OG-Fe(II) oxygenase family fami... 141 1e-32 UniRef50_A4EST3 Uncharacterized iron-regulated protein n=1 Tax=R... 141 2e-32 UniRef50_A5EM79 Putative uncharacterized protein n=1 Tax=Bradyrh... 140 3e-32 UniRef50_A9DQY6 Uncharacterized iron-regulated protein n=1 Tax=K... 140 3e-32 UniRef50_D2VV82 Prolyl 4-hydroxylase alpha subunit family protei... 140 3e-32 UniRef50_D2VXK7 Predicted protein n=2 Tax=Naegleria gruberi RepI... 140 4e-32 UniRef50_Q111M8 2OG-Fe(II) oxygenase n=2 Tax=Trichodesmium eryth... 138 1e-31 UniRef50_Q08MC8 Oxidoreductase, 2OG-Fe(II) oxygenase family fami... 136 4e-31 UniRef50_C7BVA8 2OG-Fe(II) oxygenase superfamily like protein n=... 135 9e-31 UniRef50_D0SK49 Predicted protein n=1 Tax=Acinetobacter junii SH... 135 2e-30 UniRef50_D2VRB4 Predicted protein n=1 Tax=Naegleria gruberi RepI... 133 3e-30 UniRef50_Q4KE77 Oxidoreductase, 2OG-Fe(II) oxygenase family fami... 132 8e-30 UniRef50_B6HE60 Pc20g15710 protein n=14 Tax=Leotiomyceta RepID=B... 131 1e-29 UniRef50_A8TIV2 Putative uncharacterized protein n=1 Tax=alpha p... 131 1e-29 UniRef50_A8TW57 Putative uncharacterized protein n=1 Tax=alpha p... 131 2e-29 UniRef50_D2W6C1 Predicted protein n=2 Tax=Naegleria gruberi RepI... 130 3e-29 UniRef50_B8GU65 2OG-Fe(II) oxygenase n=1 Tax=Thioalkalivibrio sp... 130 3e-29 UniRef50_D2V3Y8 Predicted protein (Fragment) n=1 Tax=Naegleria g... 128 1e-28 UniRef50_D2VJ99 Predicted protein n=1 Tax=Naegleria gruberi RepI... 128 1e-28 UniRef50_B4RHL4 Putative uncharacterized protein n=2 Tax=Phenylo... 128 2e-28 UniRef50_B8C289 Predicted protein (Fragment) n=1 Tax=Thalassiosi... 128 2e-28 UniRef50_Q2SAS7 FOG: WD40 repeat n=1 Tax=Hahella chejuensis KCTC... 127 2e-28 UniRef50_Q2TWV5 Predicted protein n=2 Tax=Aspergillus RepID=Q2TW... 127 2e-28 UniRef50_A8IV51 Predicted protein n=1 Tax=Chlamydomonas reinhard... 126 4e-28 UniRef50_Q4QF16 Putative uncharacterized protein n=7 Tax=Trypano... 125 9e-28 UniRef50_A6D9X4 Putative uncharacterized protein n=1 Tax=Caminib... 125 1e-27 UniRef50_B0CEH1 Putative uncharacterized protein n=1 Tax=Acaryoc... 125 1e-27 UniRef50_A5VDF3 Alkyl hydroperoxide reductase/ Thiol specific an... 124 2e-27 UniRef50_B0C2I7 Oxidoreductase, 2OG-Fe(II) oxygenase family n=1 ... 124 2e-27 UniRef50_D2VW34 Oxidoreductase n=3 Tax=Naegleria gruberi RepID=D... 124 2e-27 UniRef50_B0E4Q8 Predicted protein n=1 Tax=Laccaria bicolor S238N... 124 3e-27 UniRef50_B8CBV6 Predicted protein (Fragment) n=2 Tax=Thalassiosi... 123 4e-27 UniRef50_A8P9E2 Putative uncharacterized protein n=1 Tax=Coprino... 123 5e-27 UniRef50_A8J470 Prolyl 4-hydroxylase alpha-1 subunit-like protei... 123 5e-27 UniRef50_Q8DKV0 Tlr0755 protein n=1 Tax=Thermosynechococcus elon... 122 9e-27 UniRef50_Q0FVH2 Oxidoreductase domain protein n=9 Tax=Rhodobacte... 122 9e-27 UniRef50_B7G8Q9 Predicted protein n=1 Tax=Phaeodactylum tricornu... 122 1e-26 UniRef50_B0CZ29 Predicted protein n=2 Tax=Agaricales RepID=B0CZ2... 122 1e-26 UniRef50_Q98E25 Mlr4439 protein n=1 Tax=Mesorhizobium loti RepID... 121 1e-26 UniRef50_A1ZDI6 Oxidoreductase, 2OG-Fe(II) oxygenase family fami... 121 2e-26 UniRef50_B8C4F7 Predicted protein n=1 Tax=Thalassiosira pseudona... 121 2e-26 UniRef50_A4RVI8 Predicted protein n=2 Tax=Ostreococcus RepID=A4R... 120 3e-26 UniRef50_D2V6G1 Predicted protein n=1 Tax=Naegleria gruberi RepI... 120 3e-26 UniRef50_B5VUP4 Alkyl hydroperoxide reductase/ Thiol specific an... 120 3e-26 UniRef50_Q5UP57 Putative prolyl 4-hydroxylase n=1 Tax=Acanthamoe... 120 4e-26 UniRef50_C6B8G5 Alkyl hydroperoxide reductase/ Thiol specific an... 120 4e-26 UniRef50_A4RSI6 Predicted protein (Fragment) n=5 Tax=Viridiplant... 119 6e-26 UniRef50_B5YLH9 Predicted protein n=1 Tax=Thalassiosira pseudona... 119 6e-26 UniRef50_B7GCB6 Predicted protein (Fragment) n=1 Tax=Phaeodactyl... 119 7e-26 UniRef50_B0C441 2OG-Fe(II) oxygenase, putative n=1 Tax=Acaryochl... 118 1e-25 UniRef50_B8MEI3 Putative uncharacterized protein n=1 Tax=Talarom... 118 2e-25 UniRef50_C1DZC3 Prolyl 4-hydroxylase n=3 Tax=Viridiplantae RepID... 117 3e-25 UniRef50_Q338D2 Prolyl 4-hydroxylase, putative, expressed n=11 T... 116 4e-25 UniRef50_Q486F0 Oxidoreductase, 2OG-Fe(II) oxygenase family n=8 ... 116 4e-25 UniRef50_A9YW24 Putative uncharacterized protein n=2 Tax=unclass... 116 7e-25 UniRef50_A8P9H2 Putative uncharacterized protein n=1 Tax=Coprino... 116 8e-25 UniRef50_A5VC31 2OG-Fe(II) oxygenase n=1 Tax=Sphingomonas wittic... 115 9e-25 UniRef50_B5YLK5 Predicted protein n=1 Tax=Thalassiosira pseudona... 115 1e-24 UniRef50_B2JN14 Procollagen-proline dioxygenase n=6 Tax=Burkhold... 115 2e-24 UniRef50_D2VWJ6 Predicted protein n=1 Tax=Naegleria gruberi RepI... 114 2e-24 UniRef50_C1FF01 Predicted protein (Fragment) n=3 Tax=Mamiellales... 114 2e-24 UniRef50_D2VHD9 Predicted protein n=1 Tax=Naegleria gruberi RepI... 114 2e-24 UniRef50_Q21FK1 2OG-Fe(II) oxygenase n=1 Tax=Saccharophagus degr... 113 4e-24 UniRef50_B9IJQ5 Oxidoreductase, 2OG-Fe(II) oxygenase family prot... 113 5e-24 UniRef50_A6C2C6 Uncharacterized iron-regulated protein n=1 Tax=P... 113 6e-24 UniRef50_B7G721 Predicted protein n=1 Tax=Phaeodactylum tricornu... 113 6e-24 UniRef50_C1MLG0 Predicted protein n=1 Tax=Micromonas pusilla CCM... 112 7e-24 UniRef50_A3PDM8 Putative uncharacterized protein n=2 Tax=root Re... 112 7e-24 UniRef50_C7YTN2 Putative uncharacterized protein n=1 Tax=Nectria... 112 8e-24 UniRef50_D2V7Q2 Predicted protein n=1 Tax=Naegleria gruberi RepI... 111 1e-23 UniRef50_Q6LGS5 Putative uncharacterized protein NCU03445 n=1 Ta... 111 2e-23 UniRef50_A0NVC8 Oxidoreductase domain protein n=2 Tax=Labrenzia ... 111 2e-23 UniRef50_A4RTV5 Predicted protein n=2 Tax=Ostreococcus RepID=A4R... 111 2e-23 UniRef50_Q94H92 Os03g0761900 protein n=23 Tax=Embryophyta RepID=... 111 2e-23 UniRef50_A8TUG3 Putative uncharacterized protein n=1 Tax=alpha p... 110 4e-23 UniRef50_A8I7G7 Predicted protein (Fragment) n=1 Tax=Chlamydomon... 110 5e-23 UniRef50_Q1NF66 2OG-Fe(II) oxygenase n=1 Tax=Sphingomonas sp. SK... 110 5e-23 UniRef50_Q5GQX6 Putative uncharacterized protein n=1 Tax=Synecho... 109 7e-23 UniRef50_B2B745 Predicted CDS Pa_2_9860 n=1 Tax=Podospora anseri... 108 1e-22 UniRef50_D2VTG5 Predicted protein n=1 Tax=Naegleria gruberi RepI... 108 1e-22 UniRef50_B8CBF7 Putative uncharacterized protein n=1 Tax=Thalass... 108 2e-22 UniRef50_D2V646 Putative uncharacterized protein n=1 Tax=Naegler... 108 2e-22 UniRef50_C5FBE9 Putative uncharacterized protein n=1 Tax=Microsp... 108 2e-22 UniRef50_B0DEZ6 Predicted protein n=2 Tax=Agaricomycetes RepID=B... 107 2e-22 UniRef50_B5JX69 2OG-Fe(II) oxygenase n=1 Tax=gamma proteobacteri... 107 2e-22 UniRef50_B5Y4Z8 Predicted protein n=3 Tax=Bacillariophyta RepID=... 107 2e-22 UniRef50_A0YIP7 Putative uncharacterized protein n=1 Tax=Lyngbya... 107 2e-22 UniRef50_A4RVD9 Predicted protein n=2 Tax=Ostreococcus RepID=A4R... 107 3e-22 UniRef50_D2VER2 Predicted protein n=1 Tax=Naegleria gruberi RepI... 107 3e-22 UniRef50_UPI00016C3513 hypothetical protein GobsU_05758 n=1 Tax=... 107 3e-22 UniRef50_D0N498 Putative uncharacterized protein n=1 Tax=Phytoph... 106 5e-22 UniRef50_A3WCU8 Putative uncharacterized protein n=1 Tax=Erythro... 106 5e-22 UniRef50_Q0QZ85 2OG-Fe(II) oxygenase superfamily n=1 Tax=Synecho... 106 5e-22 UniRef50_Q58MI5 Putative uncharacterized protein n=1 Tax=Prochlo... 106 6e-22 UniRef50_A5WD57 2OG-Fe(II) oxygenase n=4 Tax=Moraxellaceae RepID... 106 6e-22 UniRef50_D2W6G4 Predicted protein (Fragment) n=1 Tax=Naegleria g... 106 8e-22 UniRef50_C4Y2U7 Putative uncharacterized protein n=1 Tax=Clavisp... 105 2e-21 UniRef50_A8P9G8 Putative uncharacterized protein n=2 Tax=Coprino... 105 2e-21 UniRef50_A0KRU0 2OG-Fe(II) oxygenase n=7 Tax=Shewanella RepID=A0... 103 3e-21 UniRef50_A8I8D2 Predicted protein n=3 Tax=Chlamydomonas reinhard... 103 5e-21 UniRef50_B8BWH5 Putative uncharacterized protein n=1 Tax=Thalass... 103 6e-21 UniRef50_Q5N1K6 Putative uncharacterized protein n=2 Tax=Synecho... 102 8e-21 UniRef50_A6VYC6 2OG-Fe(II) oxygenase n=2 Tax=Marinomonas RepID=A... 102 8e-21 UniRef50_B2HZ49 Predicted proline hydroxylase n=18 Tax=Acinetoba... 102 1e-20 UniRef50_B9F6P1 Putative uncharacterized protein n=3 Tax=Poaceae... 101 1e-20 UniRef50_A5V2J9 Putative uncharacterized protein n=1 Tax=Sphingo... 101 1e-20 UniRef50_D2VKT5 Predicted protein n=2 Tax=Naegleria gruberi RepI... 101 1e-20 UniRef50_Q4SNF8 Chromosome 8 SCAF14543, whole genome shotgun seq... 101 2e-20 UniRef50_Q2JRT8 Oxidoreductase, 2OG-Fe(II) oxygenase family n=1 ... 101 2e-20 UniRef50_D0Z4R6 SM-20-related protein n=1 Tax=Photobacterium dam... 101 2e-20 UniRef50_D0KWT8 2OG-Fe(II) oxygenase n=1 Tax=Halothiobacillus ne... 101 2e-20 UniRef50_A5BUA7 Putative uncharacterized protein n=1 Tax=Vitis v... 101 3e-20 UniRef50_Q2S9H1 Putative uncharacterized protein n=1 Tax=Hahella... 100 3e-20 UniRef50_A3WJ18 Prolyl 4-hydroxylase alpha subunit-like protein,... 100 4e-20 UniRef50_B9ZRT4 2OG-Fe(II) oxygenase n=1 Tax=Thioalkalivibrio sp... 100 5e-20 UniRef50_A4S1S9 Predicted protein n=1 Tax=Ostreococcus lucimarin... 100 6e-20 UniRef50_D2VKE9 Type IIB DNA topoisomerase n=1 Tax=Naegleria gru... 100 7e-20 UniRef50_Q1MZY2 Oxidoreductase, 2OG-Fe(II) oxygenase family prot... 100 7e-20 UniRef50_A8TVG3 Putative uncharacterized protein n=1 Tax=alpha p... 100 7e-20 UniRef50_B8HQ53 Prolyl 4-hydroxylase alpha subunit n=1 Tax=Cyano... 100 8e-20 UniRef50_Q58MX3 Putative uncharacterized protein n=1 Tax=Prochlo... 99 8e-20 UniRef50_C7JE98 Putative uncharacterized protein n=8 Tax=Acetoba... 99 9e-20 UniRef50_B8HX71 Prolyl 4-hydroxylase alpha subunit n=1 Tax=Cyano... 99 9e-20 UniRef50_Q8T5S8 Prolyl-4-hydroxylase-alpha PV n=14 Tax=Drosophil... 98 1e-19 UniRef50_Q1MT87 Novel protein similar to vertebrate leprecan-lik... 98 2e-19 UniRef50_B3PB13 SM-20 domain protein n=1 Tax=Cellvibrio japonicu... 98 2e-19 UniRef50_A8IDI8 Prolyl 4-hydroxylase n=2 Tax=Chlamydomonas reinh... 98 2e-19 UniRef50_UPI0001925DF6 PREDICTED: similar to predicted protein n... 98 2e-19 UniRef50_C1EAF2 Predicted protein n=2 Tax=Micromonas RepID=C1EAF... 98 3e-19 UniRef50_D0N7E1 Putative uncharacterized protein n=1 Tax=Phytoph... 97 6e-19 UniRef50_D1LX56 Leprecan-like protein n=1 Tax=Saccoglossus kowal... 97 7e-19 Sequences not found previously or not previously below threshold: UniRef50_B7GAW7 Predicted protein n=1 Tax=Phaeodactylum tricornu... 120 3e-26 UniRef50_A1TLC5 2OG-Fe(II) oxygenase n=13 Tax=Proteobacteria Rep... 120 3e-26 UniRef50_B9GU89 Predicted protein n=11 Tax=Embryophyta RepID=B9G... 119 6e-26 UniRef50_B8LC79 Predicted protein n=2 Tax=Thalassiosira pseudona... 117 3e-25 UniRef50_B8C9F1 Predicted protein n=1 Tax=Thalassiosira pseudona... 116 4e-25 UniRef50_C9S6P4 Putative uncharacterized protein n=1 Tax=Vertici... 116 4e-25 UniRef50_B9ZQ51 Procollagen-proline dioxygenase n=1 Tax=Thioalka... 116 6e-25 UniRef50_Q24JN5 At2g17720 n=14 Tax=Spermatophyta RepID=Q24JN5_ARATH 116 6e-25 UniRef50_Q9LSI6 Prolyl 4-hydroxylase alpha subunit-like protein ... 115 9e-25 UniRef50_C1MXG7 Predicted protein n=1 Tax=Micromonas pusilla CCM... 115 1e-24 UniRef50_Q2SJN5 Putative uncharacterized protein n=1 Tax=Hahella... 115 1e-24 UniRef50_B8LC77 Predicted protein n=2 Tax=Thalassiosira pseudona... 113 3e-24 UniRef50_A5WGK8 Procollagen-proline dioxygenase n=1 Tax=Psychrob... 113 5e-24 UniRef50_Q84406 A85R protein n=4 Tax=Chlorovirus RepID=Q84406_PBCV1 112 9e-24 UniRef50_B7G8R4 Putative uncharacterized protein n=1 Tax=Phaeoda... 112 9e-24 UniRef50_D2V0A9 Predicted protein n=1 Tax=Naegleria gruberi RepI... 112 1e-23 UniRef50_C1ECX9 Predicted protein n=1 Tax=Micromonas sp. RCC299 ... 112 1e-23 UniRef50_A2WFK5 Putative uncharacterized protein n=1 Tax=Burkhol... 111 2e-23 UniRef50_Q0D7Z6 Os07g0194500 protein n=4 Tax=Oryza sativa RepID=... 111 2e-23 UniRef50_B7HCK8 Prolyl 4-hydroxylase, alpha subunit domain prote... 110 4e-23 UniRef50_Q3AVV8 Prolyl 4-hydroxylase, alpha subunit n=8 Tax=Bact... 110 4e-23 UniRef50_D0J124 2OG-Fe(II) oxygenase n=3 Tax=Comamonadaceae RepI... 110 4e-23 UniRef50_D2QI60 2OG-Fe(II) oxygenase n=1 Tax=Spirosoma linguale ... 109 7e-23 UniRef50_Q54TF2 Putative uncharacterized protein n=1 Tax=Dictyos... 108 1e-22 UniRef50_Q6C0K2 YALI0F23969p n=1 Tax=Yarrowia lipolytica RepID=Q... 108 1e-22 UniRef50_D2V353 2OG-Fe(II) oxygenase family protein n=1 Tax=Naeg... 108 1e-22 UniRef50_B7J8P3 Oxidoreductase, 2OG-Fe(II) oxygenase family n=1 ... 108 2e-22 UniRef50_Q2G5T2 2OG-Fe(II) oxygenase n=4 Tax=Sphingomonadales Re... 108 2e-22 UniRef50_B8C4H5 Prolyl 4-hydrolase-like protein (Fragment) n=1 T... 107 3e-22 UniRef50_Q2BP80 Putative uncharacterized protein n=1 Tax=Neptuni... 107 4e-22 UniRef50_A4RWQ6 Predicted protein n=4 Tax=Chlorophyta RepID=A4RW... 106 5e-22 UniRef50_A7RAS2 Putative uncharacterized protein C118R n=2 Tax=C... 106 6e-22 UniRef50_D2VEV9 Predicted protein n=2 Tax=Naegleria gruberi RepI... 106 6e-22 UniRef50_C1FDT8 Predicted protein n=1 Tax=Micromonas sp. RCC299 ... 106 6e-22 UniRef50_Q0C131 Oxidoreductase, 2OG-Fe(II) oxygenase family n=1 ... 106 7e-22 UniRef50_B4JY14 GH14106 n=1 Tax=Drosophila grimshawi RepID=B4JY1... 105 1e-21 UniRef50_A8J7D3 Predicted protein n=3 Tax=Chlamydomonas reinhard... 105 1e-21 UniRef50_C6J1E0 Prolyl 4-hydroxylase n=1 Tax=Paenibacillus sp. o... 105 2e-21 UniRef50_A6C3X4 Uncharacterized iron-regulated protein n=1 Tax=P... 105 2e-21 UniRef50_B2W510 Oxidoreductase domain containing protein n=2 Tax... 105 2e-21 UniRef50_P74376 Sll0428 protein n=1 Tax=Synechocystis sp. PCC 68... 104 2e-21 UniRef50_B5EQX8 2OG-Fe(II) oxygenase n=2 Tax=Acidithiobacillus f... 104 2e-21 UniRef50_UPI0001AEC671 2OG-Fe(II) oxygenase family oxidoreductas... 104 3e-21 UniRef50_D0N4G0 Putative uncharacterized protein n=1 Tax=Phytoph... 104 3e-21 UniRef50_Q54N51 Putative uncharacterized protein n=1 Tax=Dictyos... 103 4e-21 UniRef50_A5V9G5 2OG-Fe(II) oxygenase n=5 Tax=Sphingomonadales Re... 102 7e-21 UniRef50_A8IBT2 Predicted protein (Fragment) n=2 Tax=Chlamydomon... 102 9e-21 UniRef50_Q6BZD8 DEHA2A02134p n=10 Tax=Saccharomycetales RepID=Q6... 102 1e-20 UniRef50_B6KQ34 2OG-Fe(II) oxygenase family protein, putative n=... 101 2e-20 UniRef50_A0NE94 AGAP004611-PA (Fragment) n=1 Tax=Anopheles gambi... 101 2e-20 UniRef50_Q75DK4 ABR017Cp n=2 Tax=Saccharomycetaceae RepID=Q75DK4... 101 3e-20 UniRef50_B0DS97 Predicted protein n=2 Tax=Agaricales RepID=B0DS9... 101 3e-20 UniRef50_Q29CA7 GA15937 n=4 Tax=pseudoobscura subgroup RepID=Q29... 100 3e-20 UniRef50_B8KEU1 Oxidoreductase, 2OG-Fe(II) oxygenase family n=1 ... 100 4e-20 UniRef50_C5CTL3 Procollagen-proline dioxygenase n=1 Tax=Variovor... 100 4e-20 UniRef50_A8J238 Predicted protein n=1 Tax=Chlamydomonas reinhard... 100 4e-20 UniRef50_Q3BVS8 Putative uncharacterized protein n=3 Tax=Xanthom... 100 4e-20 UniRef50_Q3BXN0 Putative 2OG-Fe(II) oxygenase superfamily protei... 100 6e-20 UniRef50_C1EER8 Predicted protein n=2 Tax=Micromonas RepID=C1EER... 100 7e-20 UniRef50_D2W1Z9 Predicted protein n=1 Tax=Naegleria gruberi RepI... 100 7e-20 UniRef50_Q9FKX6 Prolyl 4-hydroxylase, alpha subunit-like protein... 99 9e-20 UniRef50_B6AGD9 Putative uncharacterized protein n=1 Tax=Cryptos... 99 1e-19 UniRef50_A0KKW9 SM-20 domain protein n=43 Tax=Gammaproteobacteri... 99 1e-19 UniRef50_B8C5J7 Predicted protein n=1 Tax=Thalassiosira pseudona... 98 2e-19 UniRef50_Q2UGW2 Predicted protein n=11 Tax=Trichocomaceae RepID=... 98 2e-19 UniRef50_B8N2A2 Putative uncharacterized protein n=1 Tax=Aspergi... 98 2e-19 UniRef50_B4RXU6 Oxidoreductase, 2OG-Fe(II) oxygenase family prot... 98 2e-19 UniRef50_B7G237 Proly 4-hydroxylase n=1 Tax=Phaeodactylum tricor... 98 3e-19 UniRef50_B7G6B8 Predicted protein (Fragment) n=1 Tax=Phaeodactyl... 98 3e-19 UniRef50_D1HTI8 Whole genome shotgun sequence of line PN40024, s... 97 3e-19 UniRef50_A4RXU6 Predicted protein n=3 Tax=Mamiellales RepID=A4RX... 97 3e-19 UniRef50_A4ACT0 Prolyl 4-hydroxylase, alpha subunit domain prote... 97 4e-19 UniRef50_B2SFP9 Oxidoreductase n=19 Tax=Francisella RepID=B2SFP9... 97 4e-19 UniRef50_A4EM02 Response regulator receiver domain protein (CheY... 97 4e-19 UniRef50_B8C7A3 Putative uncharacterized protein (Fragment) n=1 ... 97 4e-19 UniRef50_B8CFF8 Predicted protein n=2 Tax=Thalassiosira pseudona... 97 4e-19 UniRef50_C5LEZ3 Prolyl 4-hydroxylase alpha subunit, putative n=1... 97 5e-19 UniRef50_Q9LT92 Genomic DNA, chromosome 5, P1 clone:MJM18 n=11 T... 97 6e-19 UniRef50_B6KKV3 2OG-Fe(II) oxygenase family protein n=3 Tax=Toxo... 97 6e-19 UniRef50_Q2ULX3 Predicted protein n=2 Tax=Aspergillus RepID=Q2UL... 97 7e-19 >UniRef50_B5YS96 PKHD-type hydroxylase ybiX n=227 Tax=Bacteria RepID=YBIX_ECO5E Length = 228 Score = 282 bits (721), Expect = 8e-75, Method: Composition-based stats. Identities = 222/228 (97%), Positives = 224/228 (98%), Gaps = 3/228 (1%) Query: 1 MMYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEV 60 MMYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEV Sbjct: 1 MMYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEV 60 Query: 61 LNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFL 120 LNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFL Sbjct: 61 LNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFL 120 Query: 121 SDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQSMIR 180 SDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASF+WIQSMIR Sbjct: 121 SDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFIWIQSMIR 180 Query: 181 DDKKRAMLFELD---NNIQSLKSRYGESEEILSLLNLYHNLLREWSEI 225 DDKKRAMLFELD NIQSLKSRYGE+EEILSLLNLYHNLLREWSEI Sbjct: 181 DDKKRAMLFELDKNIQNIQSLKSRYGENEEILSLLNLYHNLLREWSEI 228 >UniRef50_A8ESR5 PKHD-type hydroxylase Abu_0724 n=7 Tax=Proteobacteria RepID=Y724_ARCB4 Length = 226 Score = 263 bits (672), Expect = 3e-69, Method: Composition-based stats. Identities = 116/226 (51%), Positives = 156/226 (69%), Gaps = 1/226 (0%) Query: 1 MMYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEV 60 M+ HIP VLS + + R L +A W+DG++T G Q KNN Q+ L L++ + Sbjct: 1 MILHIPEVLSKEQLTECRNLLNKANWIDGKITAGNQAINAKNNFQLAESDPLTNYLRDII 60 Query: 61 LNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVR-SHPQNGWMRTDLSATLF 119 A+N + LF +AALP+ + +P FN+Y+N YG HVD ++ + RTD+S +LF Sbjct: 61 KTALNSNPLFISAALPKHIISPFFNKYENGGNYGNHVDNSILFDMNEKKAFRTDISCSLF 120 Query: 120 LSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQSMI 179 +DP+ Y+GGE+V+ DTFG H VKLPAGDL+LYPS+SLH V PVT+GVR+ SFMWIQSMI Sbjct: 121 FTDPEEYEGGEMVIEDTFGTHEVKLPAGDLILYPSTSLHRVEPVTKGVRMVSFMWIQSMI 180 Query: 180 RDDKKRAMLFELDNNIQSLKSRYGESEEILSLLNLYHNLLREWSEI 225 R KR++LFELDN IQSL+ YGE EE L+L YH L++EWSE+ Sbjct: 181 RSAWKRSILFELDNTIQSLRVNYGEIEETLNLSIHYHKLIQEWSEL 226 >UniRef50_Q1LIS6 PKHD-type hydroxylase Rmet_3078 n=4 Tax=Burkholderiaceae RepID=Y3078_RALME Length = 228 Score = 256 bits (653), Expect = 6e-67, Method: Composition-based stats. Identities = 127/227 (55%), Positives = 161/227 (70%), Gaps = 3/227 (1%) Query: 1 MMYHIPGVLSPQDVARFREQLEQAE--WVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQN 58 M+ IP VL+ + +A REQL+ A WVDGRVT G GA VK NQQ+D RS A Q+ Sbjct: 1 MLVRIPQVLNAEQLAMLREQLDHAGDAWVDGRVTAGYSGAPVKFNQQIDERSEAAAQCQH 60 Query: 59 EVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGW-MRTDLSAT 117 VL+A+ ++ LF +A LP + P+FNRY T+G HVDG VR HP NG +RTD+SAT Sbjct: 61 LVLSALERNPLFISAVLPNIVYPPMFNRYSEGMTFGLHVDGGVRLHPHNGRKLRTDVSAT 120 Query: 118 LFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQS 177 LFLSDP SYDGGEL + DT+G H VKL AGD+V+YPS+SLH V P+TRGVRV F WIQS Sbjct: 121 LFLSDPASYDGGELQIEDTYGVHSVKLAAGDMVVYPSTSLHQVKPITRGVRVGCFFWIQS 180 Query: 178 MIRDDKKRAMLFELDNNIQSLKSRYGESEEILSLLNLYHNLLREWSE 224 +IRDD +RA+LF++DN IQ+L + +L+ YHNLLR+WS+ Sbjct: 181 LIRDDGQRALLFDMDNAIQTLNQTNADERARRTLVGCYHNLLRQWSD 227 >UniRef50_Q2JHA7 PKHD-type hydroxylase CYB_2270 n=16 Tax=Cyanobacteria RepID=Y2270_SYNJB Length = 224 Score = 255 bits (652), Expect = 7e-67, Method: Composition-based stats. Identities = 89/225 (39%), Positives = 135/225 (60%), Gaps = 1/225 (0%) Query: 1 MMYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEV 60 M+ I VLS ++ + + AE+VDG +T G VKNN+Q+ S ++ + Sbjct: 1 MILCIGDVLSLAELQQILSLIADAEFVDGALTAGWNARLVKNNRQMPKGSLQQRKIEEII 60 Query: 61 LNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFL 120 L A+ ++ LF AA P+ + + L + Y+ +YG H D A+ MRTD+S TLFL Sbjct: 61 LAALERNLLFQMAARPKLIHSILISCYEAGMSYGTHTDDALMLDRH-QLMRTDISFTLFL 119 Query: 121 SDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQSMIR 180 S P+ YDGGEL + + G+ KLPAG L+LYP+S+LH V PVTRG+R A+ W+QS+IR Sbjct: 120 SAPEDYDGGELKIESSEGEQAYKLPAGALILYPASTLHRVEPVTRGIRYAAVSWVQSLIR 179 Query: 181 DDKKRAMLFELDNNIQSLKSRYGESEEILSLLNLYHNLLREWSEI 225 D ++R +LF+L Q + G++ + +Y NLLR+W+E+ Sbjct: 180 DPQEREILFDLQTVRQQMFQESGKTRHFDLISKVYANLLRKWAEL 224 >UniRef50_C5T9F3 2OG-Fe(II) oxygenase n=1 Tax=Acidovorax delafieldii 2AN RepID=C5T9F3_ACIDE Length = 229 Score = 254 bits (648), Expect = 2e-66, Method: Composition-based stats. Identities = 111/229 (48%), Positives = 157/229 (68%), Gaps = 4/229 (1%) Query: 1 MMYHIPGVLSPQDVARFREQL-EQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNE 59 M HI VL P+++A FR+ L A WVDG + G Q KNN Q+ S L A LQ Sbjct: 1 MFLHIKDVLPPEELAFFRQALGADAPWVDGARSAGGQAIHQKNNLQLAQGSELSAQLQAR 60 Query: 60 VLNAVNQHALFFAAALPRTLSTPLFNRYQNN-ETYGFHVDGAVR-SHPQNGWMRTDLSAT 117 V A++++ALFF+AALPR + PLFN Y + YG HVD AV SH N W+R+DLS T Sbjct: 61 VKAALHRNALFFSAALPRRIYNPLFNNYGDGTNFYGNHVDSAVMHSHADNCWVRSDLSCT 120 Query: 118 LFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQS 177 LFL+ P+ Y+GGELV + FG+ R+KLPAGD++LYPSS++H V+PVTRG R++ F W++S Sbjct: 121 LFLTPPEDYEGGELVATEAFGEKRIKLPAGDMILYPSSTVHQVSPVTRGHRISCFFWVES 180 Query: 178 MIRDDKKRAMLFELDNNIQSLKSRYGESE-EILSLLNLYHNLLREWSEI 225 M+R ++R +LF++D ++ L+ +GE E +++L YHNLLR W+++ Sbjct: 181 MVRGLEQRQLLFDMDMSLLKLRQAHGEKEPSVIALSGTYHNLLRMWADV 229 >UniRef50_Q15TJ8 PKHD-type hydroxylase Patl_2273 n=6 Tax=Proteobacteria RepID=Y2273_PSEA6 Length = 227 Score = 252 bits (644), Expect = 6e-66, Method: Composition-based stats. Identities = 106/225 (47%), Positives = 157/225 (69%), Gaps = 2/225 (0%) Query: 1 MMYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEV 60 M+ I +LS ++V +F + L++ +W+DG+ T G+Q ++VK NQQ+D S L L+N V Sbjct: 1 MLTVIEDLLSKKEVTQFTQALDKGQWLDGKHTAGSQASKVKYNQQLDDGSALAIELRNTV 60 Query: 61 LNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWM-RTDLSATLF 119 + ++ +ALF ++ALP + P FNRYQ E YG HVD +V P + M RTDLSATLF Sbjct: 61 IRKLSGNALFMSSALPNKIYPPKFNRYQGGEHYGLHVDASVMPIPNSHQMLRTDLSATLF 120 Query: 120 LSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQSMI 179 LS+P++YDGGEL + FG ++KL AG ++LYP++SLH V PVT+G R ASF WI+S++ Sbjct: 121 LSEPKTYDGGELSIETQFGLQQIKLNAGSVILYPANSLHQVNPVTKGRRTASFFWIESLV 180 Query: 180 RDDKKRAMLFELDNNIQSLKSRYGESE-EILSLLNLYHNLLREWS 223 R + +R+MLF+LD +IQ+L G ++ E+ L +YHNL+R W+ Sbjct: 181 RSNDQRSMLFDLDQSIQALTVELGSNDAEVKRLTGVYHNLMRSWA 225 >UniRef50_Q3SUS4 PKHD-type hydroxylase Nwi_0701 n=10 Tax=Proteobacteria RepID=Y701_NITWN Length = 226 Score = 249 bits (635), Expect = 6e-65, Method: Composition-based stats. Identities = 110/225 (48%), Positives = 146/225 (64%), Gaps = 1/225 (0%) Query: 1 MMYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEV 60 M+ I VL+P ++ RFRE L QA+W DGR T G + K N+Q+ +L L + Sbjct: 1 MIQVISDVLTPDELKRFRELLGQAQWQDGRATAGHVAVRAKANEQLSHEDSLGQQLSEFL 60 Query: 61 LNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNG-WMRTDLSATLF 119 L + + + F AAALP + P FNRY +YG H+D A+ S P G +R DLSATLF Sbjct: 61 LERLGKISHFIAAALPLKVLPPRFNRYTGGGSYGDHIDNAIFSVPGAGVRIRGDLSATLF 120 Query: 120 LSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQSMI 179 LS+P YDGGEL++ F +H+ KLPAG ++LYP+S+ H VTPVTRG R+A+F W QS++ Sbjct: 121 LSEPGDYDGGELIIQGEFARHQFKLPAGQMILYPASTFHQVTPVTRGARLAAFFWTQSLV 180 Query: 180 RDDKKRAMLFELDNNIQSLKSRYGESEEILSLLNLYHNLLREWSE 224 R+ +RA+LFELDN IQ+L E + L LYHNLLREWSE Sbjct: 181 REHSRRALLFELDNTIQALAQDNPEQPAVARLTGLYHNLLREWSE 225 >UniRef50_A3P5N3 PKHD-type hydroxylase BURPS1106A_A1609 n=30 Tax=Proteobacteria RepID=Y5709_BURP0 Length = 227 Score = 246 bits (629), Expect = 3e-64, Method: Composition-based stats. Identities = 119/226 (52%), Positives = 158/226 (69%), Gaps = 2/226 (0%) Query: 1 MMYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEV 60 MM HIPGVL+ + VA+ R+ L+ A+W DG T+GAQ A K N+Q+ S A + + Sbjct: 1 MMLHIPGVLTKEQVAQCRDILDAADWTDGNATSGAQSALAKRNRQLPEGSPAARAAGDAI 60 Query: 61 LNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGW-MRTDLSATLF 119 +A+ ++ALFF+AALP + PLFNRY + +G HVD A+R + +R+DLSATLF Sbjct: 61 QDALARNALFFSAALPLKVFPPLFNRYAGGDAFGTHVDNAIRLLRGTDFRVRSDLSATLF 120 Query: 120 LSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQSMI 179 L +P+ YDGGEL V DT+G HR KLPAGD+VLYP+SSLH VTPVTRG RVASF WIQSM+ Sbjct: 121 LEEPEHYDGGELCVEDTYGVHRAKLPAGDMVLYPASSLHHVTPVTRGARVASFFWIQSMV 180 Query: 180 RDDKKRAMLFELDNNIQSL-KSRYGESEEILSLLNLYHNLLREWSE 224 RDD R +L++LD IQ L + G +++L +YHNLLR W++ Sbjct: 181 RDDADRTLLYQLDTQIQRLTAEKGGRDASVIALTGIYHNLLRRWAD 226 >UniRef50_B8GUF9 PKHD-type hydroxylase Tgr7_2199 n=1 Tax=Thioalkalivibrio sp. HL-EbGR7 RepID=Y2199_THISH Length = 224 Score = 244 bits (624), Expect = 1e-63, Method: Composition-based stats. Identities = 98/225 (43%), Positives = 136/225 (60%), Gaps = 1/225 (0%) Query: 1 MMYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEV 60 M+ IP +L +A R L A + DGR + GA +VK+N++VD T AL V Sbjct: 1 MLLTIPELLDAAQLAEIRRLLADAPFTDGRYSAGADARRVKHNEEVDPSDTRVRALNQLV 60 Query: 61 LNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFL 120 L + +H F AAALPR LS F RY YG HVD V P+ G RTD+S T+F+ Sbjct: 61 LMPLYRHETFQAAALPRKLSGAFFARYLPGMQYGAHVDDPVMG-PEGGRYRTDVSVTVFI 119 Query: 121 SDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQSMIR 180 +PQSY+GGELVV FG+ +VKLPAG V+YPSSSLH V+PVT G R+ + W +SM+R Sbjct: 120 GEPQSYEGGELVVETDFGEQQVKLPAGHAVIYPSSSLHRVSPVTGGERLVAVAWAESMVR 179 Query: 181 DDKKRAMLFELDNNIQSLKSRYGESEEILSLLNLYHNLLREWSEI 225 D +R ML+EL ++L+ ++E ++ NL+R W+++ Sbjct: 180 DPARRQMLYELYQVHEALRRDNPDAEVTRRAGHVRANLMRMWADV 224 >UniRef50_A5WFM3 PKHD-type hydroxylase PsycPRwf_1523 n=1 Tax=Psychrobacter sp. PRwf-1 RepID=Y1523_PSYWF Length = 259 Score = 243 bits (620), Expect = 4e-63, Method: Composition-based stats. Identities = 99/258 (38%), Positives = 145/258 (56%), Gaps = 34/258 (13%) Query: 1 MMYHIPGVLSPQDVARFREQL--EQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQN 58 M++ I +L +++ L + A+W DG++T G Q KNN Q+ + Y A+ N Sbjct: 1 MLHIIENLLDTAQLSQLTSILTHQHAQWQDGKLTAGISAQQQKNNWQLSRQDPSYQAMAN 60 Query: 59 EVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQ-NGWMRTDLSAT 117 L A+ QH +F +AALP+ + PLF+ YQ + YG HVD A+++HP MRTDLS T Sbjct: 61 LCLEALQQHPVFMSAALPKVIMPPLFSAYQLGQGYGMHVDNALQTHPDSKQLMRTDLSLT 120 Query: 118 LFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQS 177 LFL++P Y+GGELV++D +G+H +KL AGD VLYPS+SLH V VT G R+A W+QS Sbjct: 121 LFLNNPADYEGGELVISDEYGEHSIKLSAGDAVLYPSTSLHRVNTVTSGQRLAMVTWVQS 180 Query: 178 MIRDDKKRAMLFELDNNIQSLKSRYGES-------------------------------E 206 ++R D++R +L +LD + L+ + + + Sbjct: 181 LVRSDEQRQILHDLDVSHILLRQKLLATSDQAQSTQAQSTQAQCGQLSEQHSTDQQLTHQ 240 Query: 207 EILSLLNLYHNLLREWSE 224 I L YHNLLR W+E Sbjct: 241 AIEKLNQSYHNLLRLWAE 258 >UniRef50_Q47YL9 PKHD-type hydroxylase CPS_3426 n=1 Tax=Colwellia psychrerythraea 34H RepID=Y3426_COLP3 Length = 223 Score = 240 bits (613), Expect = 2e-62, Method: Composition-based stats. Identities = 78/223 (34%), Positives = 122/223 (54%), Gaps = 2/223 (0%) Query: 1 MMYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEV 60 M+ +P VLSP VA + +E + G+ T G VKNN Q + L +Q + Sbjct: 1 MITKLPQVLSPIQVASIIQLIEHGSFNSGKDTAGWHAKAVKNNLQWQGETELNEQIQTGI 60 Query: 61 LNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFL 120 A+ QH F AA +++ + + YG H+D A+ + +RTD+S TLFL Sbjct: 61 QGALTQHPQFTGAAYAKSMMPFIISESTLGGGYGDHIDDALMVN--ETVLRTDISCTLFL 118 Query: 121 SDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQSMIR 180 + PQ Y+GGELV+N + + KL AGD ++YPS++LH V PVT G R + WI+S I Sbjct: 119 TPPQDYEGGELVMNLSGMEMAFKLNAGDAIIYPSTTLHRVNPVTSGSRKVALTWIESHIP 178 Query: 181 DDKKRAMLFELDNNIQSLKSRYGESEEILSLLNLYHNLLREWS 223 +R +LF+LD + + +G+++ + + NLLR+W+ Sbjct: 179 QASQREILFDLDCARKDIMEHHGKTDAFDRITKTHANLLRQWA 221 >UniRef50_Q5QUG6 PKHD-type hydroxylase IL0759 n=2 Tax=Idiomarina RepID=Y759_IDILO Length = 218 Score = 238 bits (608), Expect = 9e-62, Method: Composition-based stats. Identities = 77/225 (34%), Positives = 127/225 (56%), Gaps = 7/225 (3%) Query: 1 MMYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEV 60 M+ I + V L+ ++ DG+ T G VKNNQQ+ + + A + Sbjct: 1 MILQISNAVDTDTVKSIVAGLDAGQFSDGKKTAGWAAKDVKNNQQLSGKKSEAA--TQVL 58 Query: 61 LNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFL 120 L+ + Q+AL + P+ ++ NRYQ E YG H+D ++ + +RTD+S TL L Sbjct: 59 LDRLQQNALVQSVMRPKQVARVTINRYQQGEYYGTHMDDSLMN-----GVRTDISFTLGL 113 Query: 121 SDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQSMIR 180 S ++GGELV+ D G+ +L GD+++YPS LH V PVT+G R+A W+QS+++ Sbjct: 114 SPLSDFEGGELVIEDASGERSWRLGQGDILMYPSHYLHRVNPVTKGSRLAMIGWVQSLVK 173 Query: 181 DDKKRAMLFELDNNIQSLKSRYGESEEILSLLNLYHNLLREWSEI 225 R +LF+++ ++++ G+SE L ++HNLLREWS++ Sbjct: 174 QPNYRELLFDIEQSLKAEFDANGKSENFDRLTKVFHNLLREWSDV 218 >UniRef50_Q0AP20 PKHD-type hydroxylase Mmar10_1675 n=1 Tax=Maricaulis maris MCS10 RepID=Y1675_MARMM Length = 219 Score = 238 bits (608), Expect = 9e-62, Method: Composition-based stats. Identities = 92/224 (41%), Positives = 137/224 (61%), Gaps = 6/224 (2%) Query: 1 MMYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEV 60 M+ H+ V + V R+ + Q +VDG T G VK N+Q++ + + +++EV Sbjct: 1 MLIHLQKVCPSEQVDHLRDLIGQGGFVDGGTTAGQVARAVKANEQLEAGARV-DTVRSEV 59 Query: 61 LNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFL 120 A+ HA F + A P+TLS L +RY++ YG H+D A+ G R DLS TLFL Sbjct: 60 RKALMAHAGFVSFARPKTLSRILVSRYRDGMAYGPHIDDALM-----GGRRADLSFTLFL 114 Query: 121 SDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQSMIR 180 SDP SYDGGELV++ G+ +KL AGD V+Y +S++H V PVTRG RVA W++S++R Sbjct: 115 SDPDSYDGGELVMDGPDGETEIKLAAGDAVVYATSAIHQVAPVTRGERVAVVGWVRSLVR 174 Query: 181 DDKKRAMLFELDNNIQSLKSRYGESEEILSLLNLYHNLLREWSE 224 +R +LF+LD +L +R G++ E+ +L NLLR+W+E Sbjct: 175 RPDQREILFDLDQVSAALFARDGKTRELDLVLKTKANLLRQWAE 218 >UniRef50_A8TQK0 Putative hydroxylase n=1 Tax=alpha proteobacterium BAL199 RepID=A8TQK0_9PROT Length = 223 Score = 234 bits (596), Expect = 2e-60, Method: Composition-based stats. Identities = 88/224 (39%), Positives = 126/224 (56%), Gaps = 2/224 (0%) Query: 1 MMYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEV 60 M+ I G+LS + V ++L A++VDG ++ G G +K N QV +S Y L V Sbjct: 1 MVAVIEGLLSKEQVQTIAKRLFGAQFVDGTLSGGPLGEAIKKNTQVSPQSPEYRELSQLV 60 Query: 61 LNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFL 120 L + Q+ AALP+ + +P+F Y YG HVD A+ MRTDLS T+FL Sbjct: 61 LGIMRQNDQVAIAALPKRILSPIFASYVEGNRYGEHVDAALMGPY--PGMRTDLSITIFL 118 Query: 121 SDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQSMIR 180 +DP +YDGGELV+ FG+ K AGD VLYP+ +H V P+TRG R+A WI+SM+R Sbjct: 119 NDPGAYDGGELVLKTAFGEQIYKRAAGDAVLYPTHYVHRVNPITRGRRLAIVTWIESMVR 178 Query: 181 DDKKRAMLFELDNNIQSLKSRYGESEEILSLLNLYHNLLREWSE 224 D +R ++ +L + L + E I + NLLR W++ Sbjct: 179 DPARREVIEDLAEAMDKLVRDGADGEIIRRVEKARLNLLRMWAD 222 >UniRef50_Q087U3 PKHD-type hydroxylase Sfri_0612 n=2 Tax=Alteromonadales RepID=Y612_SHEFN Length = 226 Score = 231 bits (590), Expect = 1e-59, Method: Composition-based stats. Identities = 102/226 (45%), Positives = 145/226 (64%), Gaps = 2/226 (0%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEVL 61 M I +LS QDV +R+QL + W DGR T A VKNN Q D + L N++L Sbjct: 1 MIVIEQILSKQDVGAYRQQLAECPWGDGRKTAMGMAASVKNNNQADAQHANVRQLANQLL 60 Query: 62 NAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQ-NGWMRTDLSATLFL 120 + + +AALP + P FNRY E YG+HVD A+ P + +R+D+S T+FL Sbjct: 61 ARIGETPKIVSAALPHKIFPPCFNRYNETEEYGYHVDAAIMRIPNTSEVIRSDVSMTVFL 120 Query: 121 SDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQSMIR 180 S+P+ YDGGELV+ FGQ ++KLPAG V+YPSSSLH VT VTRG R+A+ W+QSM+ Sbjct: 121 SEPEEYDGGELVIATEFGQQQIKLPAGYAVVYPSSSLHKVTAVTRGQRIAAITWMQSMVA 180 Query: 181 DDKKRAMLFELDNNIQSL-KSRYGESEEILSLLNLYHNLLREWSEI 225 D R L++LD +IQ+L K+ + E+ +L N+YHNL+R+++++ Sbjct: 181 DVTLRQTLYQLDQSIQNLIKANNTDRAELDNLHNVYHNLIRQFTQL 226 >UniRef50_Q0C1R0 PKHD-type hydroxylase HNE_1625 n=4 Tax=Alphaproteobacteria RepID=Y1625_HYPNA Length = 224 Score = 230 bits (587), Expect = 2e-59, Method: Composition-based stats. Identities = 93/225 (41%), Positives = 124/225 (55%), Gaps = 2/225 (0%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEVL 61 M I +L + L + W DGR T GA +VK NQQ D S + ++ +L Sbjct: 1 MIVIENILGQDVLTEVAAALRELRWEDGRNTAGATARRVKRNQQADLSSRTGSKVREVLL 60 Query: 62 NAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLS 121 AV +H + A A P + PL + + YG H+D V + +RTDLS TLFLS Sbjct: 61 EAVKRHPVVEAYARPLKFAPPLISCSGEGDAYGLHIDNPVMG-KGDARLRTDLSFTLFLS 119 Query: 122 DPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQSMIRD 181 P+SYDGGEL + F VKLPAG +V+YPS+ LH VTPVT G R WIQS I+D Sbjct: 120 PPESYDGGELEIETVFKTESVKLPAGSMVIYPSTELHRVTPVTSGERFVFVGWIQSAIKD 179 Query: 182 DKKRAMLFELDNNIQSLKSRYGE-SEEILSLLNLYHNLLREWSEI 225 +RA+LF++ N L R+ S E+L+L NL+R WS+I Sbjct: 180 AAQRAILFDVTNLKAGLARRFPPGSPELLTLAKTESNLIRMWSDI 224 >UniRef50_C7R7F4 2OG-Fe(II) oxygenase n=1 Tax=Kangiella koreensis DSM 16069 RepID=C7R7F4_KANKD Length = 218 Score = 223 bits (569), Expect = 3e-57, Method: Composition-based stats. Identities = 77/227 (33%), Positives = 119/227 (52%), Gaps = 12/227 (5%) Query: 1 MMYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQV---DTRSTLYAALQ 57 M+ + ++ P + E++ + ++ G+ T G +KNNQQ+ D + A L Sbjct: 1 MILQLSDIIEPNTLNVICEEVAKLDFHSGQQTAGKAVRSLKNNQQILLVDDQPAPLAMLF 60 Query: 58 NEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSAT 117 + + +F AA LP+ + + NRYQ YG H+D A +RTD+S T Sbjct: 61 ----RHLQKSPIFQAACLPKQFARVMLNRYQQGMQYGNHIDDAYI-----AGVRTDVSFT 111 Query: 118 LFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQS 177 LS Y+GGELV+ D+ G+ KL G++++YPSS LH V PVT G R+A W+QS Sbjct: 112 YCLSSTSDYNGGELVLCDSTGERSWKLDKGEVLIYPSSYLHRVNPVTEGTRIAMVGWLQS 171 Query: 178 MIRDDKKRAMLFELDNNIQSLKSRYGESEEILSLLNLYHNLLREWSE 224 + D +R +LF+L + G+SE+ L Y NLLR W++ Sbjct: 172 KVGDASQRELLFDLKQAVTHELETQGKSEQYDRLSKSYSNLLRMWAD 218 >UniRef50_A3PE10 PKHD-type hydroxylase P9301_13621 n=4 Tax=Prochlorococcus marinus RepID=Y1362_PROM0 Length = 222 Score = 222 bits (566), Expect = 6e-57, Method: Composition-based stats. Identities = 73/227 (32%), Positives = 123/227 (54%), Gaps = 8/227 (3%) Query: 1 MMYHIPGVLSPQDVARFREQLEQA---EWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQ 57 M Y I +L+ +++ +++L++ +W DG+ T G+ + VKNN Q++ + + Sbjct: 1 MNYLIHQLLNAEEINLIKKELDKCSQQDWEDGKKTAGSHASMVKNNLQLNRNTEVSKKNA 60 Query: 58 NEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSAT 117 V + L + +LP+ + +F + N YG H+D S R+DLS T Sbjct: 61 QLVTKKILSSQLIKSFSLPKKIHGIMFTKSSKNMHYGRHIDNPYMSS-----GRSDLSFT 115 Query: 118 LFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQS 177 + L++ YDGGEL++ + + KL G+++LYPSS LH V V G R+ WI+S Sbjct: 116 ISLTNKDFYDGGELIIETMNTEEKFKLNPGEIILYPSSYLHAVNEVNNGERLVCVGWIES 175 Query: 178 MIRDDKKRAMLFELDNNIQSLKSRYGESEEILSLLNLYHNLLREWSE 224 ++ +KR LF+LD +SL S++G S+E+ + Y NLLR+ E Sbjct: 176 YVKSTEKREYLFDLDAGARSLLSKHGRSDELDLIFKSYSNLLRDIGE 222 >UniRef50_Q3AJA6 PKHD-type hydroxylase Syncc9605_1577 n=13 Tax=Cyanobacteria RepID=Y1577_SYNSC Length = 222 Score = 220 bits (560), Expect = 4e-56, Method: Composition-based stats. Identities = 81/226 (35%), Positives = 119/226 (52%), Gaps = 8/226 (3%) Query: 1 MMYHIPGVLSPQDVARFREQLEQ--AEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQN 58 M + +L +V +++L W DGR+T G Q A VK N Q+D + L A+ N Sbjct: 1 MEFLTHSLLPLHEVCALQQRLSAPNLPWRDGRLTAGDQAALVKKNYQLDPNAELSLAISN 60 Query: 59 EVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATL 118 + A+ L + +L R + + L +R E+YG+HVD + R DLS T Sbjct: 61 CISTALTSDPLVKSFSLVRKVHSLLVSRSSAGESYGWHVDNPFSRN-----GRRDLSFTC 115 Query: 119 FLSDPQSYDGGELVVNDTFG-QHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQS 177 FLSD SY+GG L++ +LP G +VLYPSS+LHCVTPV G R WI+S Sbjct: 116 FLSDEDSYEGGSLMIQTGGEDTKEFRLPPGQVVLYPSSTLHCVTPVLSGDRYVCVGWIES 175 Query: 178 MIRDDKKRAMLFELDNNIQSLKSRYGESEEILSLLNLYHNLLREWS 223 ++ R+MLF +D + L +R+G S+E+ + Y N +R S Sbjct: 176 YVKAADDRSMLFNIDAGARGLLARHGRSDELDLIFQSYTNAVRRLS 221 >UniRef50_A3YZT5 Putative uncharacterized protein n=2 Tax=Chroococcales RepID=A3YZT5_9SYNE Length = 221 Score = 219 bits (557), Expect = 8e-56, Method: Composition-based stats. Identities = 88/226 (38%), Positives = 122/226 (53%), Gaps = 7/226 (3%) Query: 1 MMYHIPGVLSPQDVARFREQL--EQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQN 58 M + + +L P V + L E A W G T G VK N Q++ S L+A L Sbjct: 1 MRFVLEPLLQPHQVEDWCLALSSEHASWRPGAETAGWHARSVKRNHQLERGSPLHAQLAE 60 Query: 59 EVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATL 118 ++ +A+ H L AAALP ++ LF+R E YG HVD A R+DLS TL Sbjct: 61 QLQSALLAHPLLLAAALPVSIHGVLFSRSTRGEGYGSHVDNAYM-----AGGRSDLSFTL 115 Query: 119 FLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQSM 178 FLSDP +Y GGELV+ + ++ PAG ++YPS+ LH V PV G R+ + WIQS Sbjct: 116 FLSDPDTYSGGELVLEGPADEEALRCPAGHALVYPSTQLHRVEPVRDGQRLVAVGWIQSR 175 Query: 179 IRDDKKRAMLFELDNNIQSLKSRYGESEEILSLLNLYHNLLREWSE 224 +R +R +LFELD +++ R G+ E + Y NLLR+W E Sbjct: 176 VRRADQRELLFELDTARRAIFKRDGKDEVFDLISRSYTNLLRQWGE 221 >UniRef50_Q0I9X3 PKHD-type hydroxylase sync_1544 n=2 Tax=Synechococcus RepID=Y1544_SYNS3 Length = 220 Score = 212 bits (540), Expect = 8e-54, Method: Composition-based stats. Identities = 72/224 (32%), Positives = 115/224 (51%), Gaps = 6/224 (2%) Query: 1 MMYHIPGVLSPQDVARFRE-QLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNE 59 M + +L R E +AEW+DG +T GA K N Q++ S L + Sbjct: 1 MNHLRLQILDQATCERLLERLANEAEWLDGSLTAGAHAKGGKRNFQINYDSALRKEIHEL 60 Query: 60 VLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLF 119 V A+ H + LPR L L ++ + Y HVD A S R+DLS TL Sbjct: 61 VERAMWNHPVVKGFCLPRKLHRFLISKTEKEGGYDTHVDNAYMSS-----GRSDLSFTLS 115 Query: 120 LSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQSMI 179 L+D Y+GG+L ++ + +KL G++++YPS+SLH V VT G+R WI+S + Sbjct: 116 LTDDTMYEGGDLEIDSISESYPIKLKQGEILIYPSTSLHRVCNVTSGIRTVCVGWIESYV 175 Query: 180 RDDKKRAMLFELDNNIQSLKSRYGESEEILSLLNLYHNLLREWS 223 + + R LF+L++ +++ +++G S+E+ + Y NLLR Sbjct: 176 QAENDRICLFQLESGARAVLAKHGRSDELDLIFLAYTNLLRRLG 219 >UniRef50_A7HP27 PKHD-type hydroxylase Plav_0037 n=1 Tax=Parvibaculum lavamentivorans DS-1 RepID=Y037_PARL1 Length = 219 Score = 205 bits (522), Expect = 8e-52, Method: Composition-based stats. Identities = 84/226 (37%), Positives = 117/226 (51%), Gaps = 10/226 (4%) Query: 1 MMYHIPGVLSPQDVARFRE--QLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQN 58 M I G+L D+ R + ++ + G T G VKNN+Q L A L Sbjct: 1 MFIEIAGILGAADL-RLADTVFAQKDAFESGARTAGRIARAVKNNEQAKPAG-LAADLTM 58 Query: 59 EVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATL 118 V + ++ +F AAA PR L +RY YG H D A R DLS TL Sbjct: 59 LVEKRLMKNDVFRAAARPRNFIRILLSRYTQGMAYGLHSDDAFM-----ERQRVDLSFTL 113 Query: 119 FLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQSM 178 FLS P+SY+GGEL+V + G+ VKL AG LVLYPS++LH V VT G R A+ WI+S+ Sbjct: 114 FLSPPESYEGGELIVEEPAGERLVKLEAGSLVLYPSATLHRVAEVTSGERRAAVGWIRSL 173 Query: 179 IRDDKKRAMLFELDNNIQSLKSRYGESEEILSLLNLYHNLLREWSE 224 +R + R LF++ ++ ++ G+ LL + +LLR W E Sbjct: 174 VRSAEDRETLFDVALALRQAEAA-GDRALTDRLLKIQGSLLRRWGE 218 >UniRef50_Q05TP1 Putative hydroxylase n=1 Tax=Synechococcus sp. RS9916 RepID=Q05TP1_9SYNE Length = 222 Score = 203 bits (517), Expect = 4e-51, Method: Composition-based stats. Identities = 65/219 (29%), Positives = 102/219 (46%), Gaps = 8/219 (3%) Query: 8 VLSPQDVARFREQLEQA---EWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEVLNAV 64 + + + F+ +L WVDG+ TTG+ K N Q+ + L+ + + Sbjct: 8 IFTESECEDFKAKLLTLTPKHWVDGKTTTGSHAKTKKINLQLKPDTQENKELERAIRERL 67 Query: 65 NQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLSDPQ 124 + F + +P+ + L +R + YG HVD A R D+S T+ LS + Sbjct: 68 RNNPSFKSFCIPKKMHHNLISRTEAGGGYGTHVDNAFMK-----TGRADISYTICLSSEK 122 Query: 125 SYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQSMIRDDKK 184 Y GGELV++ VK+ G +YPS+ LH V VT G+R+A W+QS I + Sbjct: 123 DYKGGELVIHGATETTTVKMKQGHAFIYPSNQLHQVNTVTSGIRLACIGWVQSYIASQEL 182 Query: 185 RAMLFELDNNIQSLKSRYGESEEILSLLNLYHNLLREWS 223 R LF L+ L + G SE + + + NLLR + Sbjct: 183 RMNLFNLEAGANYLLATQGRSEALDRIFLAHANLLRSFG 221 >UniRef50_Q1GRV0 PKHD-type hydroxylase Sala_1910 n=1 Tax=Sphingopyxis alaskensis RepID=Y1910_SPHAL Length = 218 Score = 201 bits (512), Expect = 1e-50, Method: Composition-based stats. Identities = 78/221 (35%), Positives = 115/221 (52%), Gaps = 5/221 (2%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEVL 61 M+ + +L V R+ +VDG+++ ++VKNN Q+ + Y +L Sbjct: 1 MFKLVQLLGDNAVRALRDIAASGTFVDGKIS--NPHSRVKNNLQL-HDAAAYERSSKILL 57 Query: 62 NAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLS 121 +A+ Q+A F + P ++ PL RY YG H D A P G +RTD+S T+FLS Sbjct: 58 DAMIQNADFMEFSFPARIAPPLLTRYTPGMHYGLHPDAAYIPLPD-GQLRTDVSCTIFLS 116 Query: 122 DPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQSMIRD 181 DP YDGG L V R K G ++YPS +LH V PVTRG R+ + +IQS+I D Sbjct: 117 DPADYDGGALHVQLGNADLRFKEAPGVAIVYPSHTLHEVEPVTRGERLVAITFIQSLIPD 176 Query: 182 DKKRAMLFELDNNIQSLKSRYGESEEILSLLNLYHNLLREW 222 ++R ++ EL N I +L+ E L + + LLR W Sbjct: 177 VQQRNLMHEL-NEIAALEGGKMEPANYTRLQAVQYQLLRMW 216 >UniRef50_UPI00006A2339 UPI00006A2339 related cluster n=1 Tax=Xenopus (Silurana) tropicalis RepID=UPI00006A2339 Length = 185 Score = 192 bits (487), Expect = 1e-47, Method: Composition-based stats. Identities = 88/152 (57%), Positives = 109/152 (71%), Gaps = 1/152 (0%) Query: 75 LPRTLSTPLFNRYQNNETYGFHVDGAVRSH-PQNGWMRTDLSATLFLSDPQSYDGGELVV 133 LP PLFNRY YG HVDG+V +R+D+S TLFLS+P+ Y+GGEL+V Sbjct: 34 LPLRTLLPLFNRYAGGGQYGLHVDGSVMRQLGSEQPLRSDVSTTLFLSEPEEYEGGELIV 93 Query: 134 NDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQSMIRDDKKRAMLFELDN 193 DT+G+H VKLPAGD+++YPS+SLH VTPVTRG RVASF W QSM+R D +R LFELD Sbjct: 94 VDTYGEHEVKLPAGDMIVYPSTSLHRVTPVTRGARVASFFWTQSMVRQDSQRLRLFELDQ 153 Query: 194 NIQSLKSRYGESEEILSLLNLYHNLLREWSEI 225 IQ L+ R G+ EE+ SL YHNLLR W+E+ Sbjct: 154 AIQKLRLRLGDDEEVTSLTGHYHNLLRMWAEV 185 >UniRef50_B9TN05 PKHD-type hydroxylase ybiX, putative (Fragment) n=1 Tax=Ricinus communis RepID=B9TN05_RICCO Length = 204 Score = 188 bits (478), Expect = 1e-46, Method: Composition-based stats. Identities = 98/171 (57%), Positives = 122/171 (71%), Gaps = 3/171 (1%) Query: 1 MMYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEV 60 MM HIP +L +V + R+ L A+W DGR T G+QGAQVK NQQ+ S L L+ V Sbjct: 33 MMLHIPEILRTDEVKQLRDHLNSAQWSDGRATAGSQGAQVKQNQQLPENSPLMPELRQIV 92 Query: 61 LNAVNQHALFFAAALPRTLSTPLFNRYQNNE--TYGFHVDGAVRSHP-QNGWMRTDLSAT 117 A+ +HAL+F+AALP LS P FNRY + YGFHVDGAVRS P GWMRTDLSAT Sbjct: 93 EQALKRHALYFSAALPLRLSPPQFNRYAAAQLEHYGFHVDGAVRSFPAHPGWMRTDLSAT 152 Query: 118 LFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVR 168 LFL + Y+GG+L V DT+G+H V+LPAGD++LYPS+S+H VTP+TRG R Sbjct: 153 LFLCESDEYEGGDLTVRDTYGEHEVRLPAGDMILYPSTSVHSVTPLTRGAR 203 >UniRef50_Q5GQB0 Putative uncharacterized protein n=1 Tax=Synechococcus phage S-PM2 RepID=Q5GQB0_BPSYP Length = 231 Score = 187 bits (474), Expect = 3e-46, Method: Composition-based stats. Identities = 70/236 (29%), Positives = 111/236 (47%), Gaps = 17/236 (7%) Query: 1 MMYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQG---AQVKNNQQVDTRSTLYAALQ 57 M+Y +L+ +V R EQ ++A DG V + Q+KN++++D ++ Y Sbjct: 1 MIYKF-DLLTQDEVRRINEQYDKAALKDGTVKINLENSVEKQLKNSKEIDGNTSHYRYCL 59 Query: 58 NEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSAT 117 + + A+ ++ALF + ++ P+ Y Y HVD Q +RTD S T Sbjct: 60 DLIQKAMRRNALFKTTYILGEITPPIMVEYAEGCYYIPHVDSI-----QIQNLRTDHSMT 114 Query: 118 LFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQS 177 LFLS+P Y+GGELV+ K AG +++YP+ LH V P+T G R S MW S Sbjct: 115 LFLSEPDEYEGGELVIGIGDVAKSFKEKAGTVIMYPTGMLHEVRPITSGKRRVSVMWATS 174 Query: 178 MIRDDKKRAMLFELDNNIQSLKSRYGESEE--------ILSLLNLYHNLLREWSEI 225 +I D R L ++ + E E+ ++ L + N LR + I Sbjct: 175 IIDDTFMRHELINFGMGLKKILDYLEEKEDDQLKIQELLIPLEQVRSNFLRGYGNI 230 >UniRef50_A8TKW7 2OG-Fe(II) oxygenase n=1 Tax=alpha proteobacterium BAL199 RepID=A8TKW7_9PROT Length = 238 Score = 186 bits (472), Expect = 6e-46, Method: Composition-based stats. Identities = 78/234 (33%), Positives = 116/234 (49%), Gaps = 16/234 (6%) Query: 2 MYHIPGVLSPQDVARFRE--QLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNE 59 +Y I +L+P VA R Q E A WVDG+ T G G + K N ++ S L L ++ Sbjct: 4 VYPIRNLLAPGLVAELRAALQAEGAPWVDGQQTVGRDGTK-KRNHEIAADSPLRQELSDK 62 Query: 60 VLNAV-----NQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDL 114 V + N+ F PR S LF+R Y H+D AV MR+DL Sbjct: 63 VSAYLRGPLTNETLAFRHVCDPRRWSPFLFSRTGPGGGYRDHMDSAVMFRGSPEEMRSDL 122 Query: 115 SATLFLSDPQSYDGGELVVN-DTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFM 173 S T+FL++P SY GGELVV+ D K+ AG VLY ++++H V VT G R+ + + Sbjct: 123 SMTIFLTEPDSYQGGELVVDSDMPYAPTFKMAAGGAVLYATNAIHRVAEVTAGERLVAVI 182 Query: 174 WIQSMIRDDKKRAMLFELDNNIQSLKSRYGESE------EILSLLNLYHNLLRE 221 WI+S I D R + +L + S+ +R G + + L + N+++ Sbjct: 183 WIESRIADVGTRQINADL-LQVMSVLTRDGACDPEIRESVVTKLEKVRSNVVKR 235 >UniRef50_C7BVH5 2OG-Fe(II) oxygenase family like protein n=1 Tax=Synechococcus phage S-RSM4 RepID=C7BVH5_9CAUD Length = 215 Score = 159 bits (402), Expect = 7e-38, Method: Composition-based stats. Identities = 46/221 (20%), Positives = 88/221 (39%), Gaps = 12/221 (5%) Query: 1 MMYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEV 60 M+Y L + + + ++ DG + G + + K+N + + N Sbjct: 1 MIYEY-DFLDKNKLRQMLSLFDAGKFEDGAKS-GPKDKKYKHNS--EQSDIEIGKMVNTA 56 Query: 61 LNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFL 120 + + + + + S L +Y+ Y H D RTD + + L Sbjct: 57 VYKLIRESEISKIHILNKCSPSLMLKYEVGNHYADHSDFF-----DMWGTRTDYTCVVNL 111 Query: 121 SDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQSMIR 180 +D Y+GGE + + K+ G ++YP+ +H V PVT GVR W++S I Sbjct: 112 ND--DYEGGEHYIQIGTERIEKKVEPGKALIYPTEFIHGVNPVTSGVRKCLTFWMESSIV 169 Query: 181 DDKKRAMLFELDNNIQSLKSRYGESEEILSLLNLYHNLLRE 221 D R L EL+ + + E++++ + L++ Sbjct: 170 DPTIRYYLAELNKFYYKI-EGSMDREDLVNFDLIRMGLIKR 209 >UniRef50_Q1GXG2 PKHD-type hydroxylase Mfla_0096 n=1 Tax=Methylobacillus flagellatus KT RepID=Y096_METFK Length = 176 Score = 153 bits (388), Expect = 3e-36, Method: Composition-based stats. Identities = 60/133 (45%), Positives = 89/133 (66%), Gaps = 1/133 (0%) Query: 1 MMYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEV 60 M+ IP V +P++ R++L+ EW+DG+VT G Q A+ KNN Q+ L L + + Sbjct: 1 MLITIPEVFTPEEAESIRQRLDATEWLDGKVTAGYQSAKAKNNLQLAENHPLAIELGDLI 60 Query: 61 LNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQ-NGWMRTDLSATLF 119 ++ + QH LF +AALPR + PLFNRY++ +++GFH+D AVRS +RTDLS+TLF Sbjct: 61 VSRLTQHPLFMSAALPRKVFPPLFNRYESGQSFGFHIDNAVRSLSGSRERVRTDLSSTLF 120 Query: 120 LSDPQSYDGGELV 132 + P+ YDGGEL+ Sbjct: 121 FTPPEDYDGGELI 133 >UniRef50_D2VSR0 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2VSR0_NAEGR Length = 208 Score = 152 bits (384), Expect = 1e-35, Method: Composition-based stats. Identities = 34/186 (18%), Positives = 68/186 (36%), Gaps = 18/186 (9%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAAL-QNEV 60 ++ I + SP++ + + E +V+ +NN +V +A L V Sbjct: 15 IWTIENLYSPEECQQLIKICESNGFVEAPFNAN-MAKDTRNNDRVILDLPQHAQLFWERV 73 Query: 61 LNAVNQHALFFAAAL-------------PRTLSTPLFNRYQNNETYGFHVDGAVRSHPQN 107 + QHA + P + F RY+ + + H DG + Sbjct: 74 SPYLPQHASQLGNQVLESNAKSGFQLLNPGFSNRLRFYRYKKGQYFAPHTDGCYFDNRDK 133 Query: 108 GWMRTDLSATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGV 167 ++ L+ L+L+D + GGE +H V+ +G ++++ + H VT Sbjct: 134 YVDQSFLTILLYLNDVNN-AGGETNFIQNGIKHSVQPKSGSVLIFVHWNCHEGAEVTSSN 192 Query: 168 --RVAS 171 + Sbjct: 193 ALKYVM 198 >UniRef50_A6G260 Uncharacterized iron-regulated protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6G260_9DELT Length = 212 Score = 147 bits (371), Expect = 3e-34, Method: Composition-based stats. Identities = 35/186 (18%), Positives = 72/186 (38%), Gaps = 21/186 (11%) Query: 3 YHIPGVLSPQDVARFREQLEQAEWVDGRVTTGA---QGAQVKNNQQ-VDTRSTLYAALQN 58 + + G+ + + + E+ E + + + TG + A ++NN + + AAL Sbjct: 19 FTVDGLFTADECRAWIERGEALGFGEAPINTGRGEVRNANIRNNDRTLVDDPEAAAALFE 78 Query: 59 EVLNAVNQHALFFAAALPRT--LSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSA 116 + + ++ LP T F RY + + H DG ++ R+ LS Sbjct: 79 RLRPVLPPTTWMYSQDLPLTGLNERLRFYRYDPGQRFALHRDGHFTRPDRSE--RSRLSL 136 Query: 117 TLFLSDPQSYDGGELVVNDTFG-----------QHRVKLPAGDLVLYPSSSLHCVTPVTR 165 ++L+ + ++GGE + + G R G ++++P H VT Sbjct: 137 LVYLN--EDFEGGETLFFSSPGYGSHASGGWQETDRAVPKTGRVLVFPHPMFHEGAAVTA 194 Query: 166 GVRVAS 171 G + Sbjct: 195 GRKYVL 200 >UniRef50_Q2S7M4 Uncharacterized iron-regulated protein n=1 Tax=Hahella chejuensis KCTC 2396 RepID=Q2S7M4_HAHCH Length = 182 Score = 144 bits (363), Expect = 2e-33, Method: Composition-based stats. Identities = 35/174 (20%), Positives = 70/174 (40%), Gaps = 15/174 (8%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGA---QGAQVKNNQQVDTRSTLYAA-LQ 57 ++ I G L+ + + E + + + T ++NN +V + A + Sbjct: 10 VFAIQGFLTAHECDAYISDSEAMGYDEAEIQTARGSQMYKDIRNNDRVIFDDAVMANNIF 69 Query: 58 NEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSAT 117 N + + Q L F RY+ + + +H DG+ + + LS Sbjct: 70 NRIEAMLPQEL--DGWELVGLNERLRFYRYEPGQYFKWHRDGSYARSEKEA---SLLSFL 124 Query: 118 LFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVAS 171 +FL+ + Y+GGE+ ++K G +V++P + +H T V GV+ Sbjct: 125 IFLN--EDYEGGEIAFR----WDKIKPERGSVVVFPHAMMHQGTTVESGVKYVL 172 >UniRef50_UPI00016C3A48 hypothetical protein GobsU_06128 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C3A48 Length = 202 Score = 144 bits (363), Expect = 2e-33, Method: Composition-based stats. Identities = 35/186 (18%), Positives = 62/186 (33%), Gaps = 27/186 (14%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGA---QGAQVKNNQQVDTRST-----LY 53 ++ I SP + + E A + D +TT ++NN +V ++ Sbjct: 15 LFVIHDFFSPDECDYYITMTESAGYGDAPITTTGGPVMRKDIRNNDRVMIDDAGIARSVW 74 Query: 54 AALQNEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTD 113 L+ V + V F RY + + +H DGA P R+ Sbjct: 75 ERLRPFVPDRVQFWQPV------GLNERWRFYRYDPGQQFDWHFDGAYERSPAE---RSA 125 Query: 114 LSATLFLSDPQSYDGGELVV--------NDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTR 165 + ++L+ GG RV+ AG ++++P H PV Sbjct: 126 FTLMIYLN--GGVSGGATEFNLRSHGGTRGDDPIVRVQPEAGKVLVFPHRLYHRGAPVAD 183 Query: 166 GVRVAS 171 G + Sbjct: 184 GRKYVM 189 >UniRef50_Q54PP0 Putative uncharacterized protein n=2 Tax=Dictyostelium discoideum RepID=Q54PP0_DICDI Length = 252 Score = 143 bits (362), Expect = 3e-33, Method: Composition-based stats. Identities = 31/180 (17%), Positives = 70/180 (38%), Gaps = 21/180 (11%) Query: 3 YHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQG----AQVKNNQQ-----VDTRSTLY 53 I V + ++ + + E+ + V G ++NN + V+ +Y Sbjct: 22 ITIDDVFTEEECKEWIDLTEKTGYEPALVNIGYGQQQLMTDIRNNDRCIIDSVEMADKIY 81 Query: 54 AALQNEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTD 113 ++ + + NQ + F RY + + H DG + + + Sbjct: 82 QRVKKFIPHTFNQKWEVVSL-----NERLRFLRYYVGQEFKKHQDGNYKRNNGET---SF 133 Query: 114 LSATLFLSDPQSYDGGELVVNDTFGQHRVK--LPAGDLVLYPSSSLHCVTPVTRGVRVAS 171 ++ L+L++ + +GG GQ+ ++ G ++L+ + H +PVT+GV+ Sbjct: 134 ITLQLYLNNVE--EGGSTKFFLKSGQNEIEIIPKPGKVLLFQHNIWHQGSPVTKGVKYVI 191 >UniRef50_C3ZC48 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3ZC48_BRAFL Length = 235 Score = 143 bits (362), Expect = 4e-33, Method: Composition-based stats. Identities = 27/182 (14%), Positives = 58/182 (31%), Gaps = 20/182 (10%) Query: 3 YHIPGVLSPQDVARFREQLEQAEWVDGRVTTG----AQGAQVKNNQQ-VDTRSTLYAALQ 57 + + V S ++ Q E + + G +N+++ + + + Sbjct: 41 FIVDNVFSKKECEELIRQTEDQGYEVAMLNVGGGRQILATDYRNSERCIMDSTERAEQIW 100 Query: 58 NEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSAT 117 + V + + F +Y + H+DG+ R R+ L+ Sbjct: 101 ERIKQYVPRRW--ARRKVLGLNERLRFLKYGPGNYFHPHMDGSYRRENGE---RSYLTLM 155 Query: 118 LFLSDPQSYDGGELVVNDT--------FGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRV 169 L+L++ GG + V G ++++ H VT GV+ Sbjct: 156 LYLNEGS--TGGATNFISPMFATGDKIKEKVPVIPKPGRVLVFQHDIYHEGEEVTAGVKY 213 Query: 170 AS 171 A Sbjct: 214 AM 215 >UniRef50_Q1DA84 Oxidoreductase, 2OG-Fe(II) oxygenase family n=1 Tax=Myxococcus xanthus DK 1622 RepID=Q1DA84_MYXXD Length = 206 Score = 143 bits (360), Expect = 6e-33, Method: Composition-based stats. Identities = 37/175 (21%), Positives = 60/175 (34%), Gaps = 12/175 (6%) Query: 1 MMYHIPGVLSPQDVARFREQLEQAEWVDGRVT--TGAQGA-QVKNNQQVDTRS-TLYAAL 56 ++ + +LS ++ A E++E +T G ++NN +V L L Sbjct: 29 LVIVLRDLLSAEECAALIERIEAEGPTAAPITTSAGFVMRPDIRNNSRVMFDDVPLAQTL 88 Query: 57 QNEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSA 116 V V L RY E + H DGA R+ L+ Sbjct: 89 FERVAPHVPHRLE-HEWTLCGANERLRCYRYDVGEYFAPHFDGAFVRTRDE---RSLLTF 144 Query: 117 TLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVAS 171 ++L++ GG H V G +L+ LH VT+G + A Sbjct: 145 MVYLNECP--GGGATNFLSLG--HSVTPRTGSALLFNHRLLHEGATVTQGRKYAL 195 >UniRef50_D2VZW5 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2VZW5_NAEGR Length = 292 Score = 141 bits (357), Expect = 1e-32, Method: Composition-based stats. Identities = 40/261 (15%), Positives = 92/261 (35%), Gaps = 51/261 (19%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQ-------GAQVKNNQQVDTR-STLY 53 ++H +LS ++ + E+ + T G Q ++V+NNQ++ + Sbjct: 13 IFHFRNLLSKEECEEIIQHGEKTSYKQVPTTGGRQLVWCGVEASEVRNNQRIIEECTEFT 72 Query: 54 AA----LQNEVLNAVNQHALFFAAALPRTLST---------------------PLFNRYQ 88 + V + + F +LP ++ +Y+ Sbjct: 73 RRYSATIFERVAKHLPKDLEFRFKSLPTEVNRADVCPTLEKAKEWKLFSVSDKFRMYKYE 132 Query: 89 NNETYGFHVDGAVRS---------HPQNGWMRTDLSATLFLSDPQSYDGGELVVNDTFGQ 139 + + H DG + + ++ ++ ++L+D + GGE DT+ + Sbjct: 133 KKQHFLKHFDGTNKRILSLTEGKKPVKQFTEQSFMTFLVYLNDVE--KGGETQFFDTYSK 190 Query: 140 HR---VKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWI----QSMIRDDKKRAMLFELD 192 +K G V++ LH V GV+ I Q I +D ++ + + + Sbjct: 191 EETFDIKPEMGSGVVFLHELLHQGNDVLGGVKYLLRTDICYMKQKEIVEDGQKNINYTMK 250 Query: 193 NNIQSLKSRYGESEEILSLLN 213 + K++ ++ + L Sbjct: 251 VSADQKKNKNSQNNILELLPQ 271 >UniRef50_C0YUD2 Possible iron-regulated protein n=1 Tax=Chryseobacterium gleum ATCC 35910 RepID=C0YUD2_9FLAO Length = 184 Score = 141 bits (357), Expect = 1e-32, Method: Composition-based stats. Identities = 23/185 (12%), Positives = 67/185 (36%), Gaps = 14/185 (7%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQ---GAQVKNNQQVD-TRSTLYAALQ 57 ++ I L+ + + ++ + + ++ + ++NN ++ +T+ L Sbjct: 10 IFLIEDFLTESECDHYISLSQEKVFEEAKINVFGRQQMNKGIRNNDRLMIFDTTIAEELF 69 Query: 58 NEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSAT 117 N+V+ + Q + +Y + + H DG+ + ++ + Sbjct: 70 NKVVEFLPQEQ--DEYQVFSFNEMLRIYKYAPGQQFKMHRDGSYIRNENE---KSFYTFM 124 Query: 118 LFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQS 177 ++L+D ++GGE + F V G +++ H + G + + Sbjct: 125 IYLND--DFEGGETEFENLF---TVAPKKGTALIFYHPLRHEGKTLISGHKYVLRTDVMY 179 Query: 178 MIRDD 182 + + Sbjct: 180 LRKRP 184 >UniRef50_A1ZDU9 Oxidoreductase, 2OG-Fe(II) oxygenase family family n=1 Tax=Microscilla marina ATCC 23134 RepID=A1ZDU9_9SPHI Length = 181 Score = 141 bits (356), Expect = 1e-32, Method: Composition-based stats. Identities = 31/178 (17%), Positives = 64/178 (35%), Gaps = 21/178 (11%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGA---QGAQVKNNQQVDTRS-----TLY 53 ++ I V +P++ + + E+ + VTT V+NN++V +L+ Sbjct: 10 VFTISNVFTPEECEHYIDFTEKVGYAPAPVTTPWGPEMMPDVRNNERVMFDDNNLAASLW 69 Query: 54 AALQNEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTD 113 LQ + + F +Y + + H DG R + Q + Sbjct: 70 QKLQPLLPTRLQGKKAV------GLNERFRFYKYHPGQEFKEHKDGHFRRNAQEV---SV 120 Query: 114 LSATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVAS 171 L+ ++L+ + + GG+ G +++ +H PV GV+ Sbjct: 121 LTLLIYLN--EDFTGGDTFFRTMDIN--FVPKQGAALIFEHRVVHAGLPVIEGVKYVL 174 >UniRef50_A4EST3 Uncharacterized iron-regulated protein n=1 Tax=Roseobacter sp. SK209-2-6 RepID=A4EST3_9RHOB Length = 196 Score = 141 bits (355), Expect = 2e-32, Method: Composition-based stats. Identities = 38/180 (21%), Positives = 66/180 (36%), Gaps = 16/180 (8%) Query: 1 MMYHIPGVLSPQDVARFREQLEQAEWVDGRVT---TGAQGAQVKNNQQVDTR-STLYAAL 56 ++ IP LS A +Q E + +T ++++NN +V TL A L Sbjct: 10 LVSEIPNFLSTDLCAEQIKQAEALGFASAPITSETGTQVVSEIRNNTRVIRDLPTLSAQL 69 Query: 57 QNEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSA 116 + N V ++ F RYQ + + +H DG+ R+ + + Sbjct: 70 WQDARNLVPRN--FKGRDAAGLNDRFRLYRYQPGQFFDWHQDGSYRAADG---QESQFTL 124 Query: 117 TLFLSDPQSYDGGELVVNDTFGQHRVK-----LPAGDLVLYPSSSLHCVTPVTRGVRVAS 171 ++L+ Q ++GG D F H G +L+ H PV G + Sbjct: 125 LIYLN--QGFEGGGTRFADVFSSHVFSDFTIAPEPGKALLFHHPISHRGDPVLSGTKYVL 182 >UniRef50_A5EM79 Putative uncharacterized protein n=1 Tax=Bradyrhizobium sp. BTAi1 RepID=A5EM79_BRASB Length = 193 Score = 140 bits (354), Expect = 3e-32, Method: Composition-based stats. Identities = 34/180 (18%), Positives = 59/180 (32%), Gaps = 17/180 (9%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVT---TGAQGAQVKNNQQVDTRST-LYAALQ 57 + I LS + + E + D ++ V+NN++V AL Sbjct: 11 IETIANFLSAAECDDYVSWGEAIGFKDAPISTSMGMIMAKDVRNNERVMVDDRDRTQALY 70 Query: 58 NEVLNAVNQHALFFAAALP-RTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSA 116 + + F P RY + + +H DG R+ + Sbjct: 71 QRLAGHL--APSFQHRWQPVGLNERLRLYRYDVGQKFDWHRDGHFARDNGE---RSQFTF 125 Query: 117 TLFLSDPQSYDGGELVVND-----TFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVAS 171 ++L+D ++GG D G RV G +L+ +H VTRG + Sbjct: 126 LIYLND--DFEGGATSFCDDTGLMPDGPLRVTPEKGMALLFHHPIMHRGDRVTRGRKYVL 183 >UniRef50_A9DQY6 Uncharacterized iron-regulated protein n=1 Tax=Kordia algicida OT-1 RepID=A9DQY6_9FLAO Length = 183 Score = 140 bits (354), Expect = 3e-32, Method: Composition-based stats. Identities = 36/174 (20%), Positives = 66/174 (37%), Gaps = 14/174 (8%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQG---AQVKNNQQVDTRSTLYAA-LQ 57 +Y + LS Q+ + E+ + + +V + V+NN +V + YAA L Sbjct: 8 IYVVDNFLSHQECDELIAKSEKMGYEEAKVNMHGKQVLMTTVRNNLRVTYKDEAYAAILW 67 Query: 58 NEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSAT 117 N++ V + + A F +Y+ + H DG+ R + + S Sbjct: 68 NKIKMHVPEQIGYSYAF--GLNEMLRFYKYEKGHRFKMHRDGSYRRNETEA---SQYSFL 122 Query: 118 LFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVAS 171 ++L+D +DGGE V H G +L+ H + G + Sbjct: 123 IYLND--DFDGGETVFRSGTTIH---PKKGSALLFLHGLRHEGAVLKSGTKYVL 171 >UniRef50_D2VV82 Prolyl 4-hydroxylase alpha subunit family protein n=4 Tax=Naegleria gruberi RepID=D2VV82_NAEGR Length = 659 Score = 140 bits (353), Expect = 3e-32, Method: Composition-based stats. Identities = 40/237 (16%), Positives = 78/237 (32%), Gaps = 35/237 (14%) Query: 5 IPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKN-NQQVDTRSTLYAALQNEVLNA 63 IP + ++ ++ D + + +N ++++ T + + Sbjct: 63 IPNFFTSEECNDMIKRCN-----DFKNLEDEYPKEYRNASRELITDKEFAKLIFERLKET 117 Query: 64 VNQHA---LFFAAALPRT--------LSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRT 112 ++ A L L ++Y+ E + H DG R R+ Sbjct: 118 IDLDAIGKLITPYGLDSRGEWKACGINEMMRLSKYEPGEYFKIHTDGQFRRSEHE---RS 174 Query: 113 DLSATLFLSDPQSYDGGELVVNDTFGQ-----------HRVKLPAGDLVLYPSSSLHCVT 161 + ++L+ Q + GGE + + H +K G L L+ H Sbjct: 175 IYTLLIYLN--QDFKGGETRFYNDPTKTDSDFEEYSLLHTLKPSLGQLALFNQDFYHEGC 232 Query: 162 PVTRGVRVASFMWIQSMIRDDKK--RAMLFELDNNIQSLKSRYGESEEILSLLNLYH 216 PVT+G + I + D R FE Q + + ESE + ++Y Sbjct: 233 PVTKGTKYILRTEIMYLRVDSLSIPRDEKFEQSEAYQKIGQLFKESERLEKNGDVYA 289 >UniRef50_D2VXK7 Predicted protein n=2 Tax=Naegleria gruberi RepID=D2VXK7_NAEGR Length = 506 Score = 140 bits (352), Expect = 4e-32, Method: Composition-based stats. Identities = 31/235 (13%), Positives = 77/235 (32%), Gaps = 29/235 (12%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRS-TLYAALQNEV 60 ++ + +L ++ ++ E + +T+ +N++++ L A L + Sbjct: 92 LFLVDHLLHQEECKEILKKEESLGFES--ITSEYPVE-YRNSKRILYNDKELAAKLWKRL 148 Query: 61 LNAV-------NQHALFFAAALPRTLST-PLFNRYQNNETYGFHVDGAVRSHPQNGWMRT 112 + +P +++ ++Y+ + H DG + R+ Sbjct: 149 KKYMIDCNFMKPYGLDSEGYWIPISVNECMRLSKYEPGNYFKPHTDGQFVRNDDE---RS 205 Query: 113 DLSATLFLSDPQSYDGGELVV-----NDTFGQHRVK------LPAGDLVLYPSSSLHCVT 161 + ++L+D + GGE + + K G ++ H Sbjct: 206 IYTLIIYLNDG--FVGGETKFMRRVDPLAENEMKFKNLCEISPKMGSASVFNHDLYHQGC 263 Query: 162 PVTRGVRVASFMWIQ-SMIRDDKKRAMLFELDNNIQSLKSRYGESEEILSLLNLY 215 VT GV+ I I ++ + D + ES+++ + Y Sbjct: 264 LVTEGVKYILRTEIMFKRIDSAEQLVTKQDNDEIYNKVMDLLHESDQLERKGDTY 318 >UniRef50_Q111M8 2OG-Fe(II) oxygenase n=2 Tax=Trichodesmium erythraeum IMS101 RepID=Q111M8_TRIEI Length = 210 Score = 138 bits (348), Expect = 1e-31, Method: Composition-based stats. Identities = 40/180 (22%), Positives = 70/180 (38%), Gaps = 20/180 (11%) Query: 7 GVLSPQDVARFREQLE----QAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEVLN 62 G + Q+ R + VDG++ + V ++ L+ L N V Sbjct: 34 GAFTAQECERVISLSKEMKLSKGIVDGKIKPEIRQVNVWGLSYSESTRWLWEKLINSVKY 93 Query: 63 AVNQHALFFAAALPRTLSTPLFNRYQN-------NETYGFHVDGAVRSHPQNGWMRTDLS 115 A N+ + + ++ Y+ + Y H+D + +S Sbjct: 94 ANNKWWNYDIYGIMDSM---QLLCYEASKNQESIQDHYNKHIDVG------EAYYYRKIS 144 Query: 116 ATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWI 175 ++ LSDPQ Y+G EL + + G ++L+PS LH VTP+ +G R A W+ Sbjct: 145 ISIQLSDPQDYEGSELKLYTRREAENLPKARGTMILFPSFVLHEVTPIIKGKRWALVCWV 204 >UniRef50_Q08MC8 Oxidoreductase, 2OG-Fe(II) oxygenase family family (Fragment) n=1 Tax=Stigmatella aurantiaca DW4/3-1 RepID=Q08MC8_STIAU Length = 484 Score = 136 bits (344), Expect = 4e-31, Method: Composition-based stats. Identities = 39/178 (21%), Positives = 69/178 (38%), Gaps = 15/178 (8%) Query: 3 YHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNN-QQVDTRSTLYAALQNEVL 61 + GV S + R E+ E A + + T G ++N +QV L A+ + Sbjct: 1 LLLRGVFSRSECLRLIEEAEGAGF---QATGGDYPPSYRDNDRQVHDDGALAEAVFTRLR 57 Query: 62 NAVNQH---ALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATL 118 + + A A L F RY+ + + H DGA P +R+ L+ L Sbjct: 58 PFLPERLVDAEGEAWRLRGLNPRFRFCRYRGGQRFCIHRDGAYAPSP---SVRSHLTCML 114 Query: 119 FLSDPQSYDGGELVVNDTFGQHR-----VKLPAGDLVLYPSSSLHCVTPVTRGVRVAS 171 +L+D + + GG + V+ AG L+++ + H V+ G + Sbjct: 115 YLNDAEDFSGGATRYYAERSEGSELLGAVRPQAGTLIVFDHALWHDGEAVSAGTKYVL 172 >UniRef50_C7BVA8 2OG-Fe(II) oxygenase superfamily like protein n=1 Tax=Synechococcus phage S-RSM4 RepID=C7BVA8_9CAUD Length = 206 Score = 135 bits (341), Expect = 9e-31, Method: Composition-based stats. Identities = 45/193 (23%), Positives = 68/193 (35%), Gaps = 26/193 (13%) Query: 5 IPGVLSPQDVARFREQLEQAEWVDGRVT-------TGAQGAQVKNNQQVD-------TRS 50 V S + R + E E G + + KN + Sbjct: 10 FQNVFSSEMCDRIIKMGEAQEQDLGEINRLSKKSISEYTEEDKKNLLETRNSHIAWLDEP 69 Query: 51 TLYAALQNEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQN--- 107 +Y L+ +L A F T F +Y + Y +H D +P + Sbjct: 70 WIYNRLKKYILAANKNAGWNFNVDHT---ETLQFTKYDVGQFYDWHPDQHHYLYPDDDTN 126 Query: 108 ---GWMRTDLSATLFLSDPQSYDGG--ELVVNDTFGQHRVKL-PAGDLVLYPSSSLHCVT 161 LS TL L+DP ++GG E N + KL G L+++PS H VT Sbjct: 127 ENMRGKYRKLSTTLLLNDPSEFEGGDLEFHFNMKETEKATKLNSKGSLIVFPSFVYHRVT 186 Query: 162 PVTRGVRVASFMW 174 P+T+G R + W Sbjct: 187 PITKGTRYSLVSW 199 >UniRef50_D0SK49 Predicted protein n=1 Tax=Acinetobacter junii SH205 RepID=D0SK49_ACIJU Length = 232 Score = 135 bits (339), Expect = 2e-30, Method: Composition-based stats. Identities = 37/184 (20%), Positives = 72/184 (39%), Gaps = 20/184 (10%) Query: 1 MMYHIPGVLSPQDVARFREQLEQAEWVDGRV---TTGAQGAQVKNNQQVDTRS-TLYAAL 56 +++ + V S Q+ F E Q + + + V+NN++V L L Sbjct: 39 LIFTVDDVFSDQECLSFIELSNQYHYETADIFLNSARQVLTNVRNNKRVIYDDIQLAETL 98 Query: 57 QNEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSA 116 +++ + + + + L F RY+N ET+ H DG + W + L+ Sbjct: 99 FSKLKHLLPKQLNGWI--LSGLNERFRFYRYENGETFKPHWDGIHEVND---WHSSKLTL 153 Query: 117 TLFLSDPQSYDGGELVVNDTFGQHR---------VKLPAGDLVLYPSSSLHCVTPVTRGV 167 ++LS + + GGE + G + V+ G ++++ LH PV G Sbjct: 154 LIYLS--EDFTGGETIFYRDSGMLKPCKETQIASVQPKLGQILVFEHQQLHEGAPVLSGQ 211 Query: 168 RVAS 171 + Sbjct: 212 KYVL 215 >UniRef50_D2VRB4 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2VRB4_NAEGR Length = 265 Score = 133 bits (336), Expect = 3e-30, Method: Composition-based stats. Identities = 30/181 (16%), Positives = 63/181 (34%), Gaps = 15/181 (8%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVD-TRSTLYAALQNEV 60 + + VL ++ E E+ + D + + N ++ L + N V Sbjct: 75 VLILENVLLKEECKLLIELSEKLGYEDAD--SYCYAYNDRFNDRLMVDDDALTQVIWNRV 132 Query: 61 LNAVNQHALFFAAALPRT--LSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATL 118 + + Q + + +Y+ +G H DG ++ ++ L+ + Sbjct: 133 KDHLPQELNHHGMDMTLHSLNNRWRLCKYKPGHYFGTHTDGTY--SNRSNRTKSALTFMI 190 Query: 119 FLS--DPQSYDGGELVV---NDTFGQHRVKLPAGDLVLYPS---SSLHCVTPVTRGVRVA 170 +L+ + GG + RV +G +++P LHC VT GV+ Sbjct: 191 YLNSQLEGDFKGGSTIFFEQYHRKETARVIERSGTCIVFPQEDMDMLHCGEKVTDGVKYI 250 Query: 171 S 171 Sbjct: 251 L 251 >UniRef50_Q4KE77 Oxidoreductase, 2OG-Fe(II) oxygenase family family n=1 Tax=Pseudomonas fluorescens Pf-5 RepID=Q4KE77_PSEF5 Length = 504 Score = 132 bits (332), Expect = 8e-30, Method: Composition-based stats. Identities = 26/184 (14%), Positives = 59/184 (32%), Gaps = 23/184 (12%) Query: 3 YHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRST-LYAALQNEVL 61 + LS + E EQ + + ++N ++ + L + Sbjct: 30 VLVHEFLSASECEALIEATEQCGFASAGS---DYPSSYRDNDRIVADDPAMAGRLFERLK 86 Query: 62 NAVNQHALFFAA-----ALPRTL-STPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLS 115 + ++ P + F RY+ + H DG ++ L+ Sbjct: 87 HCASRMPKLGTVIDEDGWRPVGINERLRFCRYRPGTQFRAHQDGVHH----RQRQQSRLT 142 Query: 116 ATLFLSDPQSYDGGELVVNDT--------FGQHRVKLPAGDLVLYPSSSLHCVTPVTRGV 167 ++L+D ++ GGE + + R++ G L+++ + H V G Sbjct: 143 FMIYLND-DAFSGGETLFFEGRSAAMSNRDSTLRLRPRKGSLIVFDHTLWHAGALVDAGQ 201 Query: 168 RVAS 171 + Sbjct: 202 KYVM 205 >UniRef50_B6HE60 Pc20g15710 protein n=14 Tax=Leotiomyceta RepID=B6HE60_PENCW Length = 484 Score = 131 bits (331), Expect = 1e-29, Method: Composition-based stats. Identities = 34/203 (16%), Positives = 67/203 (33%), Gaps = 29/203 (14%) Query: 7 GVLSPQDVARFREQLEQAEWV-DGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEVLNAVN 65 VLSP + E ++ D + + + +N +T + L + V Sbjct: 286 NVLSPAECKAIIAAGESVNFLPDAPLREDGDISILAHNFYWIIDTTFHDMLWARISPYVP 345 Query: 66 QHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRS-------------HPQNGWMRT 112 + RY Y H+DGA P++ + Sbjct: 346 --PSINGRKVRGINRRFRVYRYVPGAEYRCHIDGAWPPSGILPDDTYVYDSSPEDKKQSS 403 Query: 113 DLSATLFLSDPQSYDGGELVV------NDTFGQHRVKLPAGDLVLYPSS-----SLHCVT 161 + L+L+D ++GGE T + V+ G + ++P LH T Sbjct: 404 MYTFLLYLND--EFEGGETTFFMPAPREGTLNGYPVRPVMGAVAIFPHGESNGALLHEGT 461 Query: 162 PVTRGVRVASFMWIQSMIRDDKK 184 VT+G + ++ ++ ++ Sbjct: 462 GVTKGAKYIIRTDVEYDVKPSEE 484 >UniRef50_A8TIV2 Putative uncharacterized protein n=1 Tax=alpha proteobacterium BAL199 RepID=A8TIV2_9PROT Length = 338 Score = 131 bits (331), Expect = 1e-29, Method: Composition-based stats. Identities = 38/186 (20%), Positives = 64/186 (34%), Gaps = 18/186 (9%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVD--------GRVTTGAQGA-QVKNNQQVDTRSTL 52 + IP VLSP+ + G + +V+ + V S L Sbjct: 160 VMMIPDVLSPEWCRWLIHVHDSQGNEPSGFLQQVKGESVLLSDAEVKVRRDHVVPEGSPL 219 Query: 53 YAALQNEVLNAVNQHALFFAAALPRTLSTPLFNRYQ--NNETYGFHVDGAVRSHPQNGWM 110 A +++ + + + RY+ + H D Sbjct: 220 EAEIRHIFQRRLIPEIARATHSPIQRHELFKIVRYEAEEGGHFRPHRDNTS-----TAGR 274 Query: 111 RTDLSATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVA 170 + TL L+ +YDGG LV + +G + AG+ V++ S LH PVTRG R Sbjct: 275 TRRFAVTLNLN-TGAYDGGHLVFPE-YGDIGYRPAAGEAVVFSCSLLHEARPVTRGTRYV 332 Query: 171 SFMWIQ 176 ++ Sbjct: 333 LLAFLH 338 >UniRef50_A8TW57 Putative uncharacterized protein n=1 Tax=alpha proteobacterium BAL199 RepID=A8TW57_9PROT Length = 383 Score = 131 bits (329), Expect = 2e-29, Method: Composition-based stats. Identities = 43/203 (21%), Positives = 71/203 (34%), Gaps = 22/203 (10%) Query: 2 MYHIPGVLSPQDVARFREQLEQ-----AEW---VDGRVTTGAQGAQVKNNQQVD-TRSTL 52 + IP V+SP + + E + + VDG +T G ++K + +L Sbjct: 170 VLMIPDVVSPAFCRQLIDYYEARGGGASGFMRDVDG-LTRGLLDPKMKRRKDCSIEDESL 228 Query: 53 YAALQNEVLNAVNQHALFFAAALPRTLSTPLFNRYQN--NETYGFHVDGAVRSHPQNGWM 110 L+ + V + + Y + H D Sbjct: 229 LKQLRRALETRVIPEIGKAFGYRVSRVERYIIGCYDAADQGFFKAHRDNTS-----KATA 283 Query: 111 RTDLSATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVA 170 + +L L+ Y+GG L + +GQH K G V++ S H TPVTRG R Sbjct: 284 HRKFAMSLNLN-TDEYEGGALRFPE-YGQHTYKPGVGCAVVFSCSLFHEATPVTRGRRYV 341 Query: 171 SFMWI---QSMIRDDKKRAMLFE 190 ++ Q + + R L E Sbjct: 342 VLPFLYDEQGAAQRAETRRFLAE 364 >UniRef50_D2W6C1 Predicted protein n=2 Tax=Naegleria gruberi RepID=D2W6C1_NAEGR Length = 251 Score = 130 bits (328), Expect = 3e-29, Method: Composition-based stats. Identities = 33/191 (17%), Positives = 69/191 (36%), Gaps = 15/191 (7%) Query: 3 YHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRST-LYAALQNEVL 61 + + VLS + E E+ + D + N ++ + + N + Sbjct: 62 FVLDQVLSKDECKLMIELSEKMGYEDADKFC--YAYNDRFNDRLMSDDPKFTEIVWNRIK 119 Query: 62 NAVNQHALFFAAALPRTLSTPL--FNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLF 119 + Q L P +Y+ + HVDG+ H ++ L+ ++ Sbjct: 120 QHLPQTLSKDGRTLHLASINPRWRLCKYKPGHYFNKHVDGSFEDHKNK--TKSYLTLIIY 177 Query: 120 LS--DPQSYDGGELVVNDTFGQ---HRVKLPAGDLVLYPS---SSLHCVTPVTRGVRVAS 171 L+ ++GG + D+ + +V PAG+ +++ LH V +GV+ Sbjct: 178 LNSQLDGEFEGGSTIFYDSRMELMTRKVTEPAGNALIFLQNDKHMLHGGEKVFKGVKYIM 237 Query: 172 FMWIQSMIRDD 182 I R++ Sbjct: 238 RSDIMYSKREE 248 >UniRef50_B8GU65 2OG-Fe(II) oxygenase n=1 Tax=Thioalkalivibrio sp. HL-EbGR7 RepID=B8GU65_THISH Length = 325 Score = 130 bits (328), Expect = 3e-29, Method: Composition-based stats. Identities = 33/183 (18%), Positives = 55/183 (30%), Gaps = 17/183 (9%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEW-------VDGRVTTGAQGAQVKNNQQVDTRSTLYA 54 + +P V+SP E A + VDG+ + L Sbjct: 146 VLMLPDVVSPDLCEALIRCHESAHFDSGMVRMVDGKPALVPDYGAKRRLDHRLVDEALTD 205 Query: 55 ALQNEVLNAVNQHALFFAAALPRTLSTPLFNRYQN--NETYGFHVDGAVRSHPQNGWMRT 112 L + V Y++ + H D Sbjct: 206 RLTEVLSRRVLPGIATAFNYRVTRFEPFKVVCYESSTGGYFRRHRDNVTPD-----ARHR 260 Query: 113 DLSATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASF 172 + ++ L+D Y GG LV + FG+ + P G +++ LH T VT G R Sbjct: 261 RFALSINLNDG--YQGGNLVFPE-FGRQGYRPPRGGAIVFSGGLLHEATDVTGGRRYVLL 317 Query: 173 MWI 175 ++ Sbjct: 318 SFL 320 >UniRef50_D2V3Y8 Predicted protein (Fragment) n=1 Tax=Naegleria gruberi RepID=D2V3Y8_NAEGR Length = 202 Score = 128 bits (323), Expect = 1e-28, Method: Composition-based stats. Identities = 35/200 (17%), Positives = 65/200 (32%), Gaps = 37/200 (18%) Query: 5 IPGVLSPQDVARFREQLEQAEWVDGRVTTG----AQGAQVKNNQQVDTRST-LYAALQNE 59 I + S ++ + E E + D VT G +NN++V L + + Sbjct: 1 IDNLFSEEECKSYIELAESQGFNDAPVTVGANTFKMMTDYRNNKRVILDDPNLAQQIYTK 60 Query: 60 VLNAVN----------QHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGW 109 V + V + F RY ++E + H DG + Sbjct: 61 VKDFVPEFASELNVNSRKINTEFFQKCGVNERFRFYRYTSDEYFKAHFDGNFARNNVECT 120 Query: 110 MR----------TDLSATLFLSDPQSYDGGELVVNDTFGQ----HRVKLPAGDLVLYPSS 155 + + ++ ++L+ + GGE + G+ H V G ++L+ Sbjct: 121 LENGKTYLCEESSFITMLIYLNTLE--KGGETNFVNPGGEERILHSVNPKTGRVLLFVHR 178 Query: 156 SLHCVTPVTRGV----RVAS 171 LH PV G + Sbjct: 179 LLHEAEPV--GEPWQFKYVL 196 >UniRef50_D2VJ99 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2VJ99_NAEGR Length = 568 Score = 128 bits (323), Expect = 1e-28, Method: Composition-based stats. Identities = 28/183 (15%), Positives = 67/183 (36%), Gaps = 22/183 (12%) Query: 3 YHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQV-DTRSTLYAALQNEVL 61 Y I L+P++ + F E+ + + + + +NN+++ L + ++ Sbjct: 29 YIIKNFLTPKECSEFIEKAVKIGF---DLASHDYPPSYRNNERIIMDDEELAEKMTKKLK 85 Query: 62 NAVNQHALFFAAALPRTLS------TPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLS 115 ++ L RYQ + + H DG H ++ ++++ L+ Sbjct: 86 PMLDSLNLVEFEKDGLECELKSVNSRFRLCRYQEGQEFRIHQDG---VHYKSKFVKSILT 142 Query: 116 ATLFLSDPQSYDGGELVVN-------DTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVR 168 ++L+ + +D G + + K GDL+++ H V G++ Sbjct: 143 FMIYLN--EEFDNGHTIFFKSGPSSNPPEEMGKYKPQCGDLIVFDHELWHSGEIVNNGIK 200 Query: 169 VAS 171 Sbjct: 201 YVM 203 >UniRef50_B4RHL4 Putative uncharacterized protein n=2 Tax=Phenylobacterium zucineum HLK1 RepID=B4RHL4_PHEZH Length = 365 Score = 128 bits (322), Expect = 2e-28, Method: Composition-based stats. Identities = 40/219 (18%), Positives = 73/219 (33%), Gaps = 18/219 (8%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVD-------GRVTTGAQGAQVKNNQQVDTRSTLYA 54 + +P + P E E+ G T G K L Sbjct: 154 VLIVPRIFEPTLCRAMIEHYERRGGSPSGVMRDVGGRTVGVLDDFKKRRDAPVDDERLLQ 213 Query: 55 ALQNEVLNAVNQHALFFAAALPRTLSTPLFNRYQ--NNETYGFHVDGAVRSHPQNGWMRT 112 A++ + + + + + Y + H D G Sbjct: 214 AMRTAIAHRLLPEVQRAFQFAATRVERYIVACYDAAEGGYFRPHRDNTTA-----GTAHR 268 Query: 113 DLSATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASF 172 + ++ L+ + Y+GG+L + FG + P G V++ S LH TPVTRG R AS Sbjct: 269 KFAVSINLN-AEDYEGGDLRFPE-FGSRTYRAPTGGAVVFSCSLLHEATPVTRGRRYASL 326 Query: 173 MWI--QSMIRDDKKRAMLFELDNNIQSLKSRYGESEEIL 209 ++ ++ R ++ L + D + + E + Sbjct: 327 PFLYDEAGARVREQNRHLLQSDPPPPPVAAPVPAGEAVT 365 >UniRef50_B8C289 Predicted protein (Fragment) n=1 Tax=Thalassiosira pseudonana RepID=B8C289_THAPS Length = 207 Score = 128 bits (322), Expect = 2e-28, Method: Composition-based stats. Identities = 29/191 (15%), Positives = 59/191 (30%), Gaps = 26/191 (13%) Query: 4 HIPGVLSPQDVARFREQLEQAEWVD----GRVTTGAQGAQVKNNQQ-VDTRSTLYAALQN 58 I + + ++ + E + + G ++N ++ + + L Sbjct: 4 VIHNLFTHEECTSLINRAEAKGFEEALVHGPFGQEVLRKDIRNCKRCILDDTELTNEWFT 63 Query: 59 EVLNAVNQHALFF----------------AAALPRTLSTPLFNRYQNNETYGFHVDGAVR 102 V+NA+ L + RY + +G H D Sbjct: 64 RVMNALEGSELKDKIADAHWVESNDIGKSTFRVVGLNERVRILRYDPGQYFGVHKDNRFI 123 Query: 103 SHPQNGWMR---TDLSATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHC 159 + G + L+ L+L+D GGE + + H V G ++++ H Sbjct: 124 RGSEFGSREGEESHLTFLLYLNDK--MKGGETRIENGGRYHEVVPKVGSVLIFDHDISHE 181 Query: 160 VTPVTRGVRVA 170 V GV+ Sbjct: 182 AMRVVSGVKYC 192 >UniRef50_Q2SAS7 FOG: WD40 repeat n=1 Tax=Hahella chejuensis KCTC 2396 RepID=Q2SAS7_HAHCH Length = 505 Score = 127 bits (320), Expect = 2e-28, Method: Composition-based stats. Identities = 33/194 (17%), Positives = 63/194 (32%), Gaps = 19/194 (9%) Query: 3 YHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNN-QQVDTRSTLYAALQN--- 58 + + GV S + + +NN +QV L L Sbjct: 31 FVLRGVFSEIFCEQLLASAITRGFSPADA---KYPPSYRNNARQVVDDPMLARRLFEVCG 87 Query: 59 -EVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSAT 117 + ++ A A +L RY +++ H DG ++ + L+ Sbjct: 88 QLLPQSLPDAANSPAWSLHSLNPRLRLCRYSAGQSFFPHQDGVYACPDRSE---SKLTFL 144 Query: 118 LFLSDPQSYDGGELVVNDT----FGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFM 173 L+L+D + GG+ + R GDL+++ S H V G + Sbjct: 145 LYLNDATEFSGGDTLFFKDASAAEISARFTPRRGDLIVFDHSLWHSGDTVLSGEKYIL-- 202 Query: 174 WIQSMIRDDKKRAM 187 +S + R++ Sbjct: 203 --RSDLIYRPSRSI 214 >UniRef50_Q2TWV5 Predicted protein n=2 Tax=Aspergillus RepID=Q2TWV5_ASPOR Length = 481 Score = 127 bits (320), Expect = 2e-28, Method: Composition-based stats. Identities = 33/203 (16%), Positives = 66/203 (32%), Gaps = 29/203 (14%) Query: 7 GVLSPQDVARFREQLEQAEWV-DGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEVLNAVN 65 VLSP + E ++ D + + + +N +T + L + + V Sbjct: 283 NVLSPAECKAIIAAGESVNFLPDAPLREDGDMSILAHNFYWVVDTTFHDMLWARISSYVP 342 Query: 66 QHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRS-------------HPQNGWMRT 112 Q RY Y H+DGA P++ + Sbjct: 343 QS--INGRLARGINRRFRVYRYVPGAEYRCHIDGAWPPSGILPDDTYVYDASPEDKRQSS 400 Query: 113 DLSATLFLSDPQSYDGGELVVNDTFGQH------RVKLPAGDLVLYPSS-----SLHCVT 161 + L+L+D ++GGE + V+ G + ++P LH T Sbjct: 401 MYTFLLYLND--EFEGGETTFFMPAAREGTLNAYPVRPVMGAVAIFPHGEANGALLHEGT 458 Query: 162 PVTRGVRVASFMWIQSMIRDDKK 184 V +G + ++ ++ ++ Sbjct: 459 GVRKGAKYIIRTDVEYDVKPCEE 481 >UniRef50_A8IV51 Predicted protein n=1 Tax=Chlamydomonas reinhardtii RepID=A8IV51_CHLRE Length = 273 Score = 126 bits (318), Expect = 4e-28, Method: Composition-based stats. Identities = 47/232 (20%), Positives = 83/232 (35%), Gaps = 33/232 (14%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEVL 61 +Y G L+P++ R + E+ G V TG+ G+ V + + D + ++ ++ Sbjct: 45 IYLWKGFLTPEECDYIRMKAEKRLERSGVVDTGSGGSVVSDIRTSDG--MFFERGEDAII 102 Query: 62 NAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLS 121 AV Q + + RY+ ++ Y H D + + L+L+ Sbjct: 103 EAVEQRLADWTMTPIWGGESLQVLRYRKDQKYDSHWDYFFHKDGSSNGGNRWATVLLYLT 162 Query: 122 DPQSYDGGELVVNDTFGQH--------------RVKLPAGDLVLYPS---------SSLH 158 + + +GGE V + VK GD +L+ S S+H Sbjct: 163 ETE--EGGETVFPKIPAPNGINVGFSECAKYNLAVKPHKGDALLFHSMKPTGELEERSMH 220 Query: 159 CVTPVTRGVRVASFMWIQ------SMIRDDKKRAMLFELDNNIQSLKSRYGE 204 PV RG + + WI + DDK R L + + E Sbjct: 221 GACPVIRGEKFSMTKWIHAGHYVMNDAYDDKAREYKARLGSATDRTGTGSHE 272 >UniRef50_Q4QF16 Putative uncharacterized protein n=7 Tax=Trypanosomatidae RepID=Q4QF16_LEIMA Length = 399 Score = 125 bits (315), Expect = 9e-28, Method: Composition-based stats. Identities = 36/217 (16%), Positives = 63/217 (29%), Gaps = 53/217 (24%) Query: 3 YHIPGVLSPQDVARFREQLEQAEWV----------DGRVTTGAQGAQVKNNQQVDTRST- 51 + L+ ++ + E+ + DG T + V+ ++ Sbjct: 127 IVLENFLTHEECDQLVAACEKVGYTFWLQKNHHDADGEATCDSGSKAVRVVDTIEANFPH 186 Query: 52 LYAALQNEVLNAVNQHALFFAAALPRTLSTP-----------------LFNRYQNNETYG 94 L A L + V+ F+ +P LF RY + Sbjct: 187 LSAKLYERIARVVSLKPKCFSEDMPNAEELFERELAGTWVPHALSENLLFGRYHPGGHFM 246 Query: 95 FHVDGAVRSHPQNGWMRTDLSATLFLSDPQSYDGGELVVN-----------DTFGQHR-- 141 HVDGA R+ + ++L+D GGE + + Q+R Sbjct: 247 PHVDGATILDLNT---RSFYTLLIYLNDCLH--GGETFIFAGEQCNVMYLDEKENQYRGN 301 Query: 142 -------VKLPAGDLVLYPSSSLHCVTPVTRGVRVAS 171 V G + + LH PV G + Sbjct: 302 ATQRVGAVYPKKGSAAFFYYNLLHEGAPVLEGHKYIC 338 >UniRef50_A6D9X4 Putative uncharacterized protein n=1 Tax=Caminibacter mediatlanticus TB-2 RepID=A6D9X4_9PROT Length = 220 Score = 125 bits (314), Expect = 1e-27, Method: Composition-based stats. Identities = 36/187 (19%), Positives = 62/187 (33%), Gaps = 17/187 (9%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGR-VTTGAQGAQVKNNQQVDTRSTLYAALQNEV 60 I LS + L++ E + ++ ++ + L + Sbjct: 33 FLIINNFLSKNECHEIINSLDKNEKYKAKIISNNNLNESIRKTILHNPTDKLRELFHKRI 92 Query: 61 LNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNG-------WMRTD 113 N FF +L + + Y+ Y H D A N Sbjct: 93 NKYKNDIEKFFGVSLLKG-TDIQILEYKEGGHYNCHADNASVIMKNNHIVGYKVVRSERK 151 Query: 114 LSATLFLSDPQSYDGGELVVNDT--FGQHRV--KLPAGDLVLYPSSSL--HCVTPVTRGV 167 L+ LFL+ + + GGE+ + RV K G ++++PS L H V + +G Sbjct: 152 LTTLLFLN--EDFLGGEIEFCHLRYYNNKRVILKPKIGMMIVFPSHGLFAHKVFEIKKGK 209 Query: 168 RVASFMW 174 R A W Sbjct: 210 RFAIVKW 216 >UniRef50_B0CEH1 Putative uncharacterized protein n=1 Tax=Acaryochloris marina MBIC11017 RepID=B0CEH1_ACAM1 Length = 202 Score = 125 bits (313), Expect = 1e-27, Method: Composition-based stats. Identities = 41/194 (21%), Positives = 69/194 (35%), Gaps = 24/194 (12%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQ---GAQVKNNQQVDTRSTLYAALQN 58 ++ PG L PQ + + + R+T + + Q+ T ++ Sbjct: 12 VFVDPGFLDPQFCESYLTEAQTCPCEPARLTRYGEAVTDDSRRKTGQLQISPTTIKGIRE 71 Query: 59 E---VLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMR-TDL 114 + + H AL P RYQ + +G H D S P + + + + Sbjct: 72 RLIAIKPRLETHFEVQLHAL----EPPSCYRYQVGDFFGLHRDVIDPSLPGSKFEKNRLV 127 Query: 115 SATLFLS------DPQSYDGGELVVNDTFGQHR-------VKLPAGDLVLYPSSSLHCVT 161 S +FL+ PQ++ GG L + R ++ G L+ + S H V Sbjct: 128 SLIIFLNGMSAEPSPQTFGGGALALYGLLNDARGQNYGFPLEPEQGQLIAFRSDLWHEVK 187 Query: 162 PVTRGVRVASFMWI 175 PVT G R W Sbjct: 188 PVTHGERFTIVSWF 201 >UniRef50_A5VDF3 Alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal allergen n=2 Tax=Sphingomonadales RepID=A5VDF3_SPHWW Length = 363 Score = 124 bits (312), Expect = 2e-27, Method: Composition-based stats. Identities = 47/218 (21%), Positives = 87/218 (39%), Gaps = 30/218 (13%) Query: 2 MYHIPGVLSPQDVARFREQLEQ-----AEW---VDGRVTTGAQGAQVKNNQQVDT----- 48 + P + P R + E+ + + VDGR T G + VK Sbjct: 157 VLVAPNIFDPAFCRRLIDLYERHGGSPSGFMREVDGR-TVGVMDSSVKRRSDYYLDDDDV 215 Query: 49 -RSTLYAALQNEVLNAVNQHALFFAAALPRTLSTPLFNRYQNN--ETYGFHVDGAVRSHP 105 R + A L ++ + + F A+ + + Y + + H D Sbjct: 216 LREQVRARLSRFLVPQIERVFQFRAS----RIERYMIACYDSGDSGFFQAHRDNTT---- 267 Query: 106 QNGWMRTDLSATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTR 165 G + T+ L+ Y+GG+L+ + FGQ R + P G V++ S LH TPVTR Sbjct: 268 -GGTAHRRFACTINLN-AGDYEGGDLIFPE-FGQRRYRAPTGGAVVFSCSLLHEATPVTR 324 Query: 166 GVRVASFMWI--QSMIRDDKKRAMLFELDNNIQSLKSR 201 G R A ++ ++ R ++ A ++ ++ + ++ Sbjct: 325 GKRYAYLPFLYDEAAARQREENARSGKVGADLATYRAE 362 >UniRef50_B0C2I7 Oxidoreductase, 2OG-Fe(II) oxygenase family n=1 Tax=Acaryochloris marina MBIC11017 RepID=B0C2I7_ACAM1 Length = 184 Score = 124 bits (312), Expect = 2e-27, Method: Composition-based stats. Identities = 32/175 (18%), Positives = 65/175 (37%), Gaps = 12/175 (6%) Query: 1 MMYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQG---AQVKNNQQVDTRS-TLYAAL 56 ++ IP +L+ ++ ++ Q V + V+NN++V +L L Sbjct: 10 LIIEIPNILTFKECDELMGKINQLNPSLATVRNDGEAEINTNVRNNERVVFSDFSLAEKL 69 Query: 57 QNEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSA 116 + V L RY+ + H DG++ + ++ S Sbjct: 70 FLKAQEYVP--PTMQGRILLSANERFRCYRYKVGMKFSPHYDGSLERNGNE---KSYYSF 124 Query: 117 TLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVAS 171 ++L+D ++GG+ T + G +L+ LH V+RGV+ + Sbjct: 125 LVYLND--DFEGGQTNF-LTESICSITPRKGFGLLFQHLILHEGVEVSRGVKYVA 176 >UniRef50_D2VW34 Oxidoreductase n=3 Tax=Naegleria gruberi RepID=D2VW34_NAEGR Length = 222 Score = 124 bits (312), Expect = 2e-27, Method: Composition-based stats. Identities = 34/195 (17%), Positives = 73/195 (37%), Gaps = 28/195 (14%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQG----AQVKNNQQVDTRSTLY-AAL 56 ++ I + S ++ ++ ++ E+ + + ++TG V+NN + Y L Sbjct: 21 IWLIKNLFSTEECSKLLKESEEIGYGEAPISTGPTSSTMMKDVRNNSRAMIDKKQYSDML 80 Query: 57 QNEVLNAVNQH------ALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWM 110 ++ + Q+ L F +Y E + H DG + N + Sbjct: 81 YKKLEKYLPQNVSSLKVGPQDGFKLCGLNERIRFYKYAAGEYFAPHYDGCFQRPTLNVEI 140 Query: 111 ---------RTDLSATLFLSDPQSYDGGELVVNDT--FGQHRVKLPAGDLVLYPSSSLHC 159 R+ ++ L+L+D + GGE ++ H VK AG ++++ S+ H Sbjct: 141 NGKKMKVVERSFITVLLYLNDVE--SGGETNFLNSRCEITHSVKPQAGQVLMFVHSNYHE 198 Query: 160 VTPVTRG---VRVAS 171 + V + Sbjct: 199 GS-VLSDPNEFKYVM 212 >UniRef50_B0E4Q8 Predicted protein n=1 Tax=Laccaria bicolor S238N-H82 RepID=B0E4Q8_LACBS Length = 479 Score = 124 bits (311), Expect = 3e-27, Method: Composition-based stats. Identities = 31/215 (14%), Positives = 69/215 (32%), Gaps = 31/215 (14%) Query: 3 YHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVK---NNQQVDTRSTLYAALQNE 59 + I V + + E+ + G+ +N + L + Sbjct: 269 FIINDVFESTECESLVKAAEKVGLLPDEPIAGSAAQLASVLAHNLIWLADTEFIGTLYDR 328 Query: 60 VLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSH-------------PQ 106 +++ + Q + A+ + RY+ Y H+DGA + Sbjct: 329 IVDLLPQ--IVHGGAVKGINARFRLYRYRPGALYRPHIDGAWPASALNATTSPHSYVYDS 386 Query: 107 NGWMRTDLSATLFLSDPQSYDGGELVVNDTFGQ------HRVKLPAGDLVLYPS-----S 155 + + + L+ ++L+D ++GG VK G + ++P S Sbjct: 387 DPTVYSRLTLLIYLND--DFEGGCTTFFLPSSTQGILEARPVKPRTGTVCVFPHGAAKGS 444 Query: 156 SLHCVTPVTRGVRVASFMWIQSMIRDDKKRAMLFE 190 LH + VT G + + + ++ L + Sbjct: 445 LLHEGSGVTSGAKYVIRTEVLYEVDKSERVDALKD 479 >UniRef50_B8CBV6 Predicted protein (Fragment) n=2 Tax=Thalassiosira pseudonana RepID=B8CBV6_THAPS Length = 196 Score = 123 bits (309), Expect = 4e-27, Method: Composition-based stats. Identities = 32/186 (17%), Positives = 60/186 (32%), Gaps = 21/186 (11%) Query: 3 YHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRST-LYAALQNEVL 61 + VLS ++ A E+ E + + +N + T L +L Sbjct: 3 VLLHNVLSLEECADIIEKSEADGYEQATIYDARTKRVQRNCTRCVTDDQVLAENWFERIL 62 Query: 62 NAVNQHA-----------LFFAAALPRT----LSTPLFNRYQNNETYGFHVDGAVRSHPQ 106 +A+N A P +YQ N+ + H D + Sbjct: 63 HALNGTPYEQKVKNAPWMGTRHDAKPLHATSLNERLRILKYQQNQFFSSHHDASFIRDAD 122 Query: 107 NGWM---RTDLSATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPV 163 G ++ +S ++L+D + GG + V G ++L+ + LH V Sbjct: 123 EGGRTGEKSYVSVQIYLNDK--FKGGTTRFHGGGRFLDVIPKTGSILLFDHNILHEGVAV 180 Query: 164 TRGVRV 169 G + Sbjct: 181 KSGKKY 186 >UniRef50_A8P9E2 Putative uncharacterized protein n=1 Tax=Coprinopsis cinerea okayama7#130 RepID=A8P9E2_COPC7 Length = 1733 Score = 123 bits (309), Expect = 5e-27, Method: Composition-based stats. Identities = 37/207 (17%), Positives = 70/207 (33%), Gaps = 31/207 (14%) Query: 9 LSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDT------RSTLYAALQNEVLN 62 LS + R Q QA + G T V+N +++ L++ VL Sbjct: 89 LSEVEAKRLISQAAQAPFGKGDQTV--VDTSVRNTWEIEPELVSFENPEWTGWLESTVLK 146 Query: 63 AVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLSD 122 V Y+ + H D + AT+ + Sbjct: 147 TVWNSLGVAPYTSKPRCELYKLLVYERGSHFKPHQDTQKAGG---------MFATVIVVL 197 Query: 123 PQSYDGGELVVNDTFGQHRVKLPAGDL-----VLYPSSSLHCVTPVTRGVRVASFMWIQS 177 P +++GG++ V+ + H + + + + + + +H V P+T G R+A S Sbjct: 198 PSAFEGGQIRVSHSGSSHTIDIASTSATETSVLAWYTDVIHEVLPITSGYRLAL-----S 252 Query: 178 MIRDDKKRAM----LFELDNNIQSLKS 200 R M L ++ + I L+ Sbjct: 253 YNLIHTSREMPRPSLPDMGDAISQLRR 279 Score = 70.4 bits (171), Expect = 4e-11, Method: Composition-based stats. Identities = 22/131 (16%), Positives = 40/131 (30%), Gaps = 21/131 (16%) Query: 9 LSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDT------RSTLYAALQNEVLN 62 L+ + R QA + GR T V+N +++ L+N V Sbjct: 973 LNENEAKRLISSAAQAPFGKGRETV--VDTTVRNTWEIEGANVDFLNPRWKGWLENLVFT 1030 Query: 63 AVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLSD 122 V A + + H D A + AT+ + Sbjct: 1031 TVWARLGVAAFTTLPRCELYK----EAGSHFKPHQDTAKAEG---------MFATVVVVL 1077 Query: 123 PQSYDGGELVV 133 P ++GG++ + Sbjct: 1078 PSKFEGGQIPL 1088 >UniRef50_A8J470 Prolyl 4-hydroxylase alpha-1 subunit-like protein n=3 Tax=Viridiplantae RepID=A8J470_CHLRE Length = 343 Score = 123 bits (309), Expect = 5e-27, Method: Composition-based stats. Identities = 38/203 (18%), Positives = 71/203 (34%), Gaps = 36/203 (17%) Query: 2 MYHIPGVLSPQDVARF----REQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQ 57 ++ G+L+ ++ + R +LE++ D GA + ++ + Y + Sbjct: 76 VFLYKGILTHEECDQLMDNSRSRLERSGVSDATTGAGAV-SDIRTS-----SGMFYERGE 129 Query: 58 NEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSAT 117 E++ + + RY+ + Y H D + + Sbjct: 130 TELVKRIENRLAMWTMLPVENGEGIQVLRYEKTQKYDPHHDYFSFDGADDNGGNRMATVL 189 Query: 118 LFLSDPQSYDGGELVVNDTFG---------------QHRVKLPAGDLVLYPS-------- 154 ++L+ P+ +GGE V G VK GD VL+ S Sbjct: 190 MYLATPE--EGGETVFPKVVGWVVQLTTTASAPCRQGLAVKPAKGDAVLFWSIRPDGRFD 247 Query: 155 -SSLHCVTPVTRGVRVASFMWIQ 176 SLH PV +GV+ ++ WI Sbjct: 248 PGSLHGSCPVIKGVKWSATKWIH 270 >UniRef50_Q8DKV0 Tlr0755 protein n=1 Tax=Thermosynechococcus elongatus BP-1 RepID=Q8DKV0_THEEB Length = 197 Score = 122 bits (307), Expect = 9e-27, Method: Composition-based stats. Identities = 32/185 (17%), Positives = 66/185 (35%), Gaps = 15/185 (8%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKN--NQQVDTRSTLYAALQNE 59 + ++ + E+ + D ++ G V+ + D + A ++ Sbjct: 15 ILLFQRLIPVHHCQQVIATAEKVGFEDAQILMGTVDRSVRGGSLLRFDPQDPQQAMVRQM 74 Query: 60 VLNAVNQHALFFAAALPRT---LSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMR----T 112 +L A + + RY+ E Y HVD + Q + Sbjct: 75 LLQATQTIQIVLYQHYGIRFPEIENFSVLRYRVGEGYRRHVDNLLLGSRQMELAQGIPTR 134 Query: 113 DLSATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSL--HCVTPVTRGVRVA 170 D+S +L+ + + GGE + ++ GD+V++P+ H PV +G + A Sbjct: 135 DVSLVGYLN--EDFQGGETYFD--RQGVKITPRTGDIVVFPAYYTHPHAALPVVQGTKYA 190 Query: 171 SFMWI 175 W+ Sbjct: 191 FATWL 195 >UniRef50_Q0FVH2 Oxidoreductase domain protein n=9 Tax=Rhodobacterales RepID=Q0FVH2_9RHOB Length = 204 Score = 122 bits (306), Expect = 9e-27, Method: Composition-based stats. Identities = 40/180 (22%), Positives = 62/180 (34%), Gaps = 14/180 (7%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKN-----NQQVDTRSTLYAAL 56 ++ IP S D R + A D R+ Q ++ V + + Sbjct: 25 VHRIPAAFSEIDCDRIIDLSRTAHSADARLVGRNQDHNLRRADLVWLDDVAGAEWVMEKI 84 Query: 57 QNEVLNAVNQHALFFAAALPRTLSTPLFNRY--QNNETYGFHVDGAVRSHPQNGWMRTDL 114 V A + L + RY + + +H D R L Sbjct: 85 IELVRQANR---AVYGFDLDAFDESAQVARYGAERQGHFSWHSDVG----DGRLAARRKL 137 Query: 115 SATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMW 174 + + LS+P +Y GG L V + + G L+PS LH VTPV G R + +W Sbjct: 138 TMVVQLSEPGAYRGGALEVMPSAHTVEAERARGSATLFPSYLLHRVTPVEAGERRSMTIW 197 >UniRef50_B7G8Q9 Predicted protein n=1 Tax=Phaeodactylum tricornutum CCAP 1055/1 RepID=B7G8Q9_PHATR Length = 474 Score = 122 bits (306), Expect = 1e-26, Method: Composition-based stats. Identities = 29/191 (15%), Positives = 62/191 (32%), Gaps = 21/191 (10%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVD----GRVTTGAQGAQVKNNQQVDTRSTLYAALQ 57 + I L+ + + + G + ++ + T + Y Sbjct: 277 VVIIDDFLNETETSTLIALGADQGYERSTDVGEILEDGSYEDDESETRTSTNAWCYNECD 336 Query: 58 NE-VLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSA 116 + V + + F P + RY+ + Y H D + + L+ Sbjct: 337 DHEVTQIIWERMTFLTQIPPENSESLQMLRYEPGQFYAVHHDY-IENDWNRAVGSRILTV 395 Query: 117 TLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPS-----------SSLHCVTPVTR 165 L+L+D + +GG + + V+ G +L+PS + H VT+ Sbjct: 396 FLYLNDVE--EGGATNFPEL--ELAVQPKRGRALLWPSVLDQYPHKKDDRTEHEAQVVTK 451 Query: 166 GVRVASFMWIQ 176 G++ + W Sbjct: 452 GIKYGANAWFH 462 >UniRef50_B0CZ29 Predicted protein n=2 Tax=Agaricales RepID=B0CZ29_LACBS Length = 261 Score = 122 bits (306), Expect = 1e-26, Method: Composition-based stats. Identities = 32/195 (16%), Positives = 59/195 (30%), Gaps = 29/195 (14%) Query: 4 HIPGVLSPQDVARFREQLEQAE-WVDGRVTTGAQGAQVKNNQQVDT---------RSTLY 53 I V +P + A W ++ V N + S +Y Sbjct: 58 LIDNVFTPNECADLIALASSTGDWSPAGLSAEGPTQTVHTNFRNSDRVLVIDEEVSSRIY 117 Query: 54 AALQNEVLNAVNQHA---LFFAAALPRTLS-----------TPLFNRYQNNETYGFHVDG 99 L+ V + P F RY + + + H DG Sbjct: 118 EKLRPLVDEICEIAPGSRWSCITSRPGKEQGPTWKMVRINPRLSFLRYGSGQYFKPHCDG 177 Query: 100 AVRSHPQNGWMRTDLSATLFLSDPQSYDGGELVVNDTFGQH--RVKLPAGDLVLYPSSSL 157 + +G ++ ++ L+L++P GG + V+ G ++++ L Sbjct: 178 --LNDLLDGKQKSFVTLHLYLNEPDGLTGGATRFWTPDKKEHLDVEPKLGRVLVFQQRML 235 Query: 158 -HCVTPVTRGVRVAS 171 H VT GV+ Sbjct: 236 VHSGEEVTGGVKYTM 250 >UniRef50_Q98E25 Mlr4439 protein n=1 Tax=Mesorhizobium loti RepID=Q98E25_RHILO Length = 192 Score = 121 bits (305), Expect = 1e-26, Method: Composition-based stats. Identities = 43/192 (22%), Positives = 70/192 (36%), Gaps = 18/192 (9%) Query: 1 MM--YHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGA----QVKNNQQVDTRSTLYA 54 M+ I VL + ++ A + A +V+ ++ + Sbjct: 1 MIDYIQITEVLDEAACSALCAEIRAAGGRAAGLMGRADQKPAWPEVRRTRRAEVSEATEG 60 Query: 55 ALQNEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDL 114 ++ + F AL T P F Y+ + + H DG + Sbjct: 61 SVNALLARQKTALERHFGLAL-GTCEKPQFLHYREGDFFVPHQDGNTPLIHDESRF-RKI 118 Query: 115 SATLFLS------DPQSYDGGELVVNDTFGQ--HRVKLP--AGDLVLYPSSSLHCVTPVT 164 SA +FL+ P+ Y GG LV++ + RV +P G LV + S + H VTPVT Sbjct: 119 SAVIFLNRQSDDPSPEDYSGGSLVLHGPYSGPNLRVTMPALPGSLVAFRSETTHEVTPVT 178 Query: 165 RGVRVASFMWIQ 176 R R W + Sbjct: 179 RNERFTIVSWYR 190 >UniRef50_A1ZDI6 Oxidoreductase, 2OG-Fe(II) oxygenase family family n=1 Tax=Microscilla marina ATCC 23134 RepID=A1ZDI6_9SPHI Length = 486 Score = 121 bits (304), Expect = 2e-26, Method: Composition-based stats. Identities = 32/181 (17%), Positives = 63/181 (34%), Gaps = 20/181 (11%) Query: 4 HIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNN-QQVDTRSTLYAALQNEVLN 62 + +P+ + E+ + + +NN +QV TL A L E+ Sbjct: 2 VVKKAFAPELCKKIIEE-RKNNFAKA---ITHYPTSYRNNDRQVVDDDTLAALLFEEIKQ 57 Query: 63 AVNQHALFFAAA--------LPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDL 114 V L RYQ + + H+DG H Q+ +++ L Sbjct: 58 YVPSSIDIAGVGKDEAGNWQLKELNHRLRICRYQPEQYFNKHLDG---VHYQSATVQSKL 114 Query: 115 SATLFLSDPQSYDGGELVVNDTFGQHRVK----LPAGDLVLYPSSSLHCVTPVTRGVRVA 170 + ++L+D + GG + + V GDL+++ + H + G++ Sbjct: 115 TFMVYLNDSHEFIGGRTLFFASKDSDEVIQEFLPETGDLIIFDHNIWHAGEVLHSGIKYI 174 Query: 171 S 171 Sbjct: 175 L 175 >UniRef50_B8C4F7 Predicted protein n=1 Tax=Thalassiosira pseudonana RepID=B8C4F7_THAPS Length = 467 Score = 121 bits (304), Expect = 2e-26, Method: Composition-based stats. Identities = 35/192 (18%), Positives = 62/192 (32%), Gaps = 24/192 (12%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWV----DGRVTTGAQGAQVKNNQQVDTRSTLYAALQ 57 + LS +V E + G + ++ + + + + + Sbjct: 266 VVVFNNFLSDNEVDDLIRGGEMEGFERSTDQGAANALGEQEKIVSQTRTSSNAWCMHKCE 325 Query: 58 NE--VLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLS 115 V +A + + Y N+ Y H D + R H G L+ Sbjct: 326 RLGGVRSATTKIEDVTGIPRV-NYESFQLLNYGQNQFYRSHHDSSSRDHTPPGP--RILT 382 Query: 116 ATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPS-----------SSLHCVTPVT 164 L+LSD + +GGE N VK G +++PS H V Sbjct: 383 FFLYLSDVE--EGGETYFNKLD--LAVKPKKGRALVWPSVVDNDPEFWDARMYHEAKDVI 438 Query: 165 RGVRVASFMWIQ 176 +G ++A+ WI Sbjct: 439 KGKKLAANHWIH 450 >UniRef50_A4RVI8 Predicted protein n=2 Tax=Ostreococcus RepID=A4RVI8_OSTLU Length = 378 Score = 120 bits (302), Expect = 3e-26, Method: Composition-based stats. Identities = 38/229 (16%), Positives = 72/229 (31%), Gaps = 31/229 (13%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRS----------- 50 ++ + LS + E + + GA ++ ++ V S Sbjct: 127 VFTVDDFLSANECDMLTASAEASGGLKVSAIGGAANENIRTSRTVALNSHGLENHATKKA 186 Query: 51 --TLYAALQNEVLNAVNQHALFFAAALPR---TLSTPLFNRYQNNETYGFHVDGAVRSHP 105 + L V F A + P Y E + H DG + Sbjct: 187 ILSRAEYLLPAVEGLSKDADAFRAPEAGEGKWSFELPQVAHYSGGEYFKAHEDGFPIAVA 246 Query: 106 QNGWMRTDLSATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLY---------PSSS 156 + + + ++L+D +GGE + V G +++ + + Sbjct: 247 ADKGYQRRATILVYLNDVD--EGGETRFEHLGIE--VAPKKGKALVFFPSSAACMPDART 302 Query: 157 LHCVTPVTRG-VRVASFMWIQSMIRDDKKRAMLFELDNNIQSLKSRYGE 204 LH TP G + S +WI S L E DN ++ ++ Y + Sbjct: 303 LHTATPAKEGHEKWVSQLWIASSTPPVPTPEEL-EADNKRKAAEAEYAK 350 >UniRef50_B7GAW7 Predicted protein n=1 Tax=Phaeodactylum tricornutum CCAP 1055/1 RepID=B7GAW7_PHATR Length = 584 Score = 120 bits (302), Expect = 3e-26, Method: Composition-based stats. Identities = 32/193 (16%), Positives = 65/193 (33%), Gaps = 23/193 (11%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVD----GRVTTGAQGAQVKNNQQVDTRSTLYAA-- 55 + + L+ ++ + + G+V V++ ++ + Sbjct: 384 VITLDNFLTLEECTELINIGHKHGYNRSKDVGKVKVDGTHEAVQSTRRTSENAWCSNQSG 443 Query: 56 LQNEVLNAVNQHALFFAAALPRTL-STPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDL 114 ++E L + + +P +Y+ + Y H D ++ + L Sbjct: 444 CRDEALPQLLHERMATVMRIPAQNSEDFQLLKYEKGQFYRTHHD-FIQHQTKRQCGPRIL 502 Query: 115 SATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPS-----------SSLHCVTPV 163 + L+LSD + GG D V+ AG +L+PS +H V Sbjct: 503 TFFLYLSDVTA--GGGTNFPDLDIT--VEPKAGRALLWPSVYDSDPMAKDGRMMHQALEV 558 Query: 164 TRGVRVASFMWIQ 176 GV+ A+ WI Sbjct: 559 EDGVKFAANGWIH 571 >UniRef50_D2V6G1 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2V6G1_NAEGR Length = 212 Score = 120 bits (302), Expect = 3e-26, Method: Composition-based stats. Identities = 34/196 (17%), Positives = 71/196 (36%), Gaps = 28/196 (14%) Query: 2 MYHIPGVLSPQDVARFR-EQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRS-----TLYAA 55 ++ I G++S ++ + E+ E+ G AQ ++NN ++ + L+ Sbjct: 7 IWLIDGLVSEEECQEIITNECEKKEFESGTYN-NAQDRSIRNNSRLIMDNQNYSNWLWTR 65 Query: 56 LQNEVL-------NAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNG 108 +++ + N + A L F RY E + H DG + Sbjct: 66 VKDYIPKLASELSNKCVRRATTIGYELCELSDKIRFYRYYKGEFFAPHSDGGIVLESVET 125 Query: 109 --------WMRTDLSATLFLSDPQSYDGGELVV---NDTFGQHRVKLPAGDLVLYPSSSL 157 ++ L+ L+L++ + GGE N + V+ G ++L+ + Sbjct: 126 IDGEEYYVTKKSFLTLLLYLNEIPN-GGGETEFLNKNTKQVEWSVEPKPGRILLFVHENY 184 Query: 158 HCVTPVTR--GVRVAS 171 H V + V+ Sbjct: 185 HQAKTVNQDGDVKFVM 200 >UniRef50_B5VUP4 Alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal allergen n=3 Tax=Oscillatoriales RepID=B5VUP4_SPIMA Length = 377 Score = 120 bits (301), Expect = 3e-26, Method: Composition-based stats. Identities = 36/196 (18%), Positives = 67/196 (34%), Gaps = 20/196 (10%) Query: 2 MYHIPGVLSPQDVARFREQLE-----QAEW--VDGRVTTGAQGAQ-VKNNQQVDTRSTLY 53 + IP VL + + E ++ + +G T G + + Sbjct: 173 VLLIPKVLDLRLCRELIKIWETQGNDESGFMKREGEKTVGYVDPSFKRRRDHFIQDGPVK 232 Query: 54 AALQNEVLNAV-NQHALFFAAALPRTLSTPLFNRY--QNNETYGFHVDGAVRSHPQNGWM 110 + + + V + F L R Y ++ + H D G + Sbjct: 233 NYIDSIMQRRVFPEILQAFQFQLTRR-ECYKIGCYDSESGGFFRPHRDNTT-----GGTL 286 Query: 111 RTDLSATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVA 170 + T+ L+ + Y+GG L + H K GD +++ S++H T VT G R A Sbjct: 287 HRRFAMTINLN-TEEYEGGCLRFPE-HAPHLYKPATGDAIIFSCSTMHEATDVTSGRRFA 344 Query: 171 SFMWIQSMIRDDKKRA 186 + D ++R Sbjct: 345 LLSFFYGD-EDAERRN 359 >UniRef50_A1TLC5 2OG-Fe(II) oxygenase n=13 Tax=Proteobacteria RepID=A1TLC5_ACIAC Length = 309 Score = 120 bits (301), Expect = 3e-26, Method: Composition-based stats. Identities = 28/193 (14%), Positives = 54/193 (27%), Gaps = 30/193 (15%) Query: 2 MYHIPGVLSPQDVARFREQLE---QAEWVDGRVTTG---AQGAQVKNNQQVDTRSTLYAA 55 + +LSP++ + T G + + A Sbjct: 124 VVLFGNLLSPEECDAIIDAARPRMARSLTVATRTGGEEVNDDRTSNGMFFQREENPVVAR 183 Query: 56 LQNEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQ-----NGWM 110 L+ + + L Y+ Y H D + P Sbjct: 184 LEARI-ARLVNWPL-------ENGEGLQVLHYRPGAEYKPHYDYFDPAEPGTPTILRRGG 235 Query: 111 RTDLSATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLY-------PSSSLHCVTPV 163 + + ++L+DP+ GG D V G+ V + + +LH PV Sbjct: 236 QRVATIVIYLNDPE--KGGGTTFPD--VHLEVAPRRGNAVFFSYERPHPSTRTLHGGAPV 291 Query: 164 TRGVRVASFMWIQ 176 G + + W++ Sbjct: 292 VAGDKWIATKWLR 304 >UniRef50_Q5UP57 Putative prolyl 4-hydroxylase n=1 Tax=Acanthamoeba polyphaga mimivirus RepID=P4H_MIMIV Length = 242 Score = 120 bits (301), Expect = 4e-26, Method: Composition-based stats. Identities = 31/191 (16%), Positives = 66/191 (34%), Gaps = 31/191 (16%) Query: 3 YHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQ--VDTRSTLYAALQNEV 60 + + +++P + + D +V +G ++N+QQ + + + + + Sbjct: 61 FVLNNLINPTKCQEIMQFANGKLF-DSQVLSG-TDKNIRNSQQMWISKNNPMVKPIFENI 118 Query: 61 LNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQ-----NGWMRTDLS 115 + F A RY N+ Y H D S Q + L+ Sbjct: 119 CR--QFNVPFDNA------EDLQVVRYLPNQYYNEHHDSCCDSSKQCSEFIERGGQRILT 170 Query: 116 ATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPS----------SSLHCVTPVTR 165 ++L++ + G + + K GD +++ SLH PVT Sbjct: 171 VLIYLNN--EFSDGHTYFPNLN--QKFKPKTGDALVFYPLANNSNKCHPYSLHAGMPVTS 226 Query: 166 GVRVASFMWIQ 176 G + + +W + Sbjct: 227 GEKWIANLWFR 237 >UniRef50_C6B8G5 Alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal allergen n=3 Tax=Alphaproteobacteria RepID=C6B8G5_RHILS Length = 380 Score = 120 bits (301), Expect = 4e-26, Method: Composition-based stats. Identities = 35/180 (19%), Positives = 64/180 (35%), Gaps = 20/180 (11%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVD-------GRVTT--GAQGAQVKNNQQVDTRSTL 52 + +P V P + E++ + G T G + + + + + + Sbjct: 173 IIVLPNVFEPDLCKKLIGLYERSGGEESGVMREVGGKTVQVNDHGYKRRKDYDIQEKDVI 232 Query: 53 YAALQNEVLNAVNQHALFFAAALPRTLSTPLFNRY--QNNETYGFHVDGAVRSHPQNGWM 110 V V + + + Y ++ + H D G Sbjct: 233 AETQGRFVRRIVPEIQKVHQFTAT-RMERYIVACYAAEDEAHFRAHRDNTT-----KGTA 286 Query: 111 RTDLSATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVA 170 + ++ L+D +DGGE+ + +G K PAG V++ S LH V+ VTRG R A Sbjct: 287 HRRFAVSVNLND--DFDGGEVSFPE-YGSRSFKAPAGGAVIFSCSLLHAVSKVTRGRRYA 343 >UniRef50_A4RSI6 Predicted protein (Fragment) n=5 Tax=Viridiplantae RepID=A4RSI6_OSTLU Length = 255 Score = 119 bits (299), Expect = 6e-26, Method: Composition-based stats. Identities = 37/202 (18%), Positives = 66/202 (32%), Gaps = 38/202 (18%) Query: 3 YHIPGVLSPQDVARFREQ----LEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQN 58 + G L+ ++ L ++ VD + T G+ + ++ + + Sbjct: 4 FVYEGFLTDEECDHILALSKGHLHKSGVVDAK-TGGSTTSDIRTSTGTFISRAHDPTIT- 61 Query: 59 EVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATL 118 + + + RY+N + Y H D H + L Sbjct: 62 AIEERIELWSQIPV----DHGEALQVLRYENGQEYKAHFDYFF--HKGGKRNNRIATVLL 115 Query: 119 FLSDPQSYDGGELVVNDTF---------------GQHRVKLPAGDLVLYPS--------- 154 +LSD + +GGE V +T G VK GD +L+ S Sbjct: 116 YLSDVE--EGGETVFPNTDVPTDRDRSQYSECGNGGKSVKARKGDALLFWSMKPGGELDP 173 Query: 155 SSLHCVTPVTRGVRVASFMWIQ 176 S H PV +GV+ + W+ Sbjct: 174 GSSHAGCPVIKGVKWTATKWMH 195 >UniRef50_B5YLH9 Predicted protein n=1 Tax=Thalassiosira pseudonana CCMP1335 RepID=B5YLH9_THAPS Length = 451 Score = 119 bits (299), Expect = 6e-26, Method: Composition-based stats. Identities = 29/201 (14%), Positives = 62/201 (30%), Gaps = 37/201 (18%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWV----------DGRVTTGAQGAQVKNNQQVDTR-- 49 + + +SP++ R E ++ DG T + +N + Sbjct: 249 LVTLEDFISPEEAERLIELGHVEQYKRSTDVGHLKADGSYTEDVHSTRTSSNSWCLDKCM 308 Query: 50 -STLYAALQNEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNG 108 + + + + + + + RY+ + YG H D + Sbjct: 309 KDPVAKDVVDRI-EHMTMIPQTNS-------ESLQLLRYEEGQYYGVHHDLIEHQKDRPP 360 Query: 109 WMRTDLSATLFLSDPQS--YDGGELVVNDTFGQHRVKLPAGDLVLYPS-----------S 155 + L+ ++L+ + +GG V G ++ S Sbjct: 361 GV-RILTFYMYLNGNEDSGLEGGGTKFPRIGAT--VTPKRGRAAMWSSVLDENPHKKDPR 417 Query: 156 SLHCVTPVTRGVRVASFMWIQ 176 + H PVT+GV+ + WI Sbjct: 418 TDHTALPVTKGVKYGANAWIH 438 >UniRef50_B9GU89 Predicted protein n=11 Tax=Embryophyta RepID=B9GU89_POPTR Length = 287 Score = 119 bits (299), Expect = 6e-26, Method: Composition-based stats. Identities = 34/198 (17%), Positives = 68/198 (34%), Gaps = 28/198 (14%) Query: 2 MYHIPGVLSPQDVARFREQLE---QAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQN 58 + + LS ++ R + + V T ++V+ + + S Sbjct: 90 IIVLHDFLSSEECDYLRALAKPRLRISTVVDVKTGKGIESKVRTSSGMFLSS---EEKTY 146 Query: 59 EVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATL 118 +V+ A+ + ++ RY+ N+ Y H D + + + + Sbjct: 147 QVVQAIEKRISVYSQVPIENGELIQVLRYEKNQYYKPHHDYFSDTFNLKRGGQRVATMLM 206 Query: 119 FLSDPQSYDGGELVVNDTFGQH-----------RVKLPAGDLVLYPS---------SSLH 158 +LSD + +GGE VK G+ VL+ S SS+H Sbjct: 207 YLSD--NVEGGETYFPMAGSGKCSCGGKVVDGLSVKPIKGNAVLFWSMGLDGQSDPSSIH 264 Query: 159 CVTPVTRGVRVASFMWIQ 176 V GV+ ++ W++ Sbjct: 265 GGCEVLSGVKWSATKWMR 282 >UniRef50_B7GCB6 Predicted protein (Fragment) n=1 Tax=Phaeodactylum tricornutum CCAP 1055/1 RepID=B7GCB6_PHATR Length = 199 Score = 119 bits (299), Expect = 7e-26, Method: Composition-based stats. Identities = 33/195 (16%), Positives = 66/195 (33%), Gaps = 27/195 (13%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEW----VDGRVTTGAQGAQVKNNQQVDTRSTLYAALQ 57 +Y I L+P ++ R ++ ++ VD + G + V + T + Sbjct: 8 IYIIEDFLTPTELDYLRSKICAGKFQRSYVDAIESGG--NSIVDKEHRTSTFLSFGKQQD 65 Query: 58 NEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNG--------W 109 ++V + + A R + RY + +G H D Sbjct: 66 SKVASIEAKAATILGCWSSRIVEPLQLVRYLPGQFFGEHHDMGDLQQDGTVALPPKSLFS 125 Query: 110 MRTDLSATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYP---------SSSLHCV 160 R ++ +L+ + GG + + +V G V++ S ++H Sbjct: 126 KRRLVTLFCYLNKVE--KGGATGFR--YCELKVPPKPGRAVMFSNVLPDGMPDSRTVHSG 181 Query: 161 TPVTRGVRVASFMWI 175 PV GV+ +WI Sbjct: 182 EPVLDGVKYGLNIWI 196 >UniRef50_B0C441 2OG-Fe(II) oxygenase, putative n=1 Tax=Acaryochloris marina MBIC11017 RepID=B0C441_ACAM1 Length = 226 Score = 118 bits (296), Expect = 1e-25, Method: Composition-based stats. Identities = 41/178 (23%), Positives = 69/178 (38%), Gaps = 15/178 (8%) Query: 5 IPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNN--QQVDTRS-----TLYAALQ 57 + S Q+ Q + + G V + +++ + +Y + Sbjct: 50 LEHCFSDQECDTIETYFSQVKAMTGEVGGKSVHKATRDSTLHWLRLNDHPDSVWIYEKIM 109 Query: 58 NEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSAT 117 + V ++ F L S+ Y+ Y +H D R LS + Sbjct: 110 HHVAQVNAENWQFR---LDGFESSIQLTEYEPGGHYTWHQDIGSRRSGL-----RKLSVS 161 Query: 118 LFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWI 175 + LSDP++Y GG L ++ T + G LV++PS +LH VT +T G R A WI Sbjct: 162 VQLSDPETYVGGGLELHATQKPVMMPRSRGTLVIFPSYTLHRVTAMTEGTRRALVTWI 219 >UniRef50_B8MEI3 Putative uncharacterized protein n=1 Tax=Talaromyces stipitatus ATCC 10500 RepID=B8MEI3_TALSN Length = 914 Score = 118 bits (295), Expect = 2e-25, Method: Composition-based stats. Identities = 43/238 (18%), Positives = 78/238 (32%), Gaps = 34/238 (14%) Query: 3 YHIPGV------LSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTR-----ST 51 H+PG+ ++P V + + + G T V+ + Q+D + Sbjct: 82 LHVPGIGAIGLPVTPDQVKAMIQSSRMSPYGKGSET--LVNESVRKSWQLDANQFSLQNP 139 Query: 52 LYAALQNEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMR 111 L+ A + A Y+ + H D Sbjct: 140 LWKAQLDNFKKEAITGLGLTANPEEVKAELYKLLIYEEGAFFLPHQDSEKADG------- 192 Query: 112 TDLSATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGD-----LVLYPSSSLHCVTPVTRG 166 + ATL +S P + GG++V + + + + +H V PV G Sbjct: 193 --MFATLVVSLPSKHQGGDVVASHKDKKMIFSTAGNSEFGFSWAAWYADVMHEVKPVVSG 250 Query: 167 VRVASFMWIQSMIRDDKKRAMLFELDNNIQSLKSRYGESEEILSLLNLYHNLLREWSE 224 R+ +MI AM+ + ++ S+ S ++ N HNL R WS+ Sbjct: 251 YRIVLVY---NMIHRPS--AMIVKARDSEMGYLSKLLASWA-RAVENSMHNL-RSWSD 301 >UniRef50_B8LC79 Predicted protein n=2 Tax=Thalassiosira pseudonana CCMP1335 RepID=B8LC79_THAPS Length = 541 Score = 117 bits (294), Expect = 3e-25, Method: Composition-based stats. Identities = 25/195 (12%), Positives = 59/195 (30%), Gaps = 24/195 (12%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVD----GRVTTGAQGAQVKNNQQVDTRSTLYAALQ 57 + +S + + + G N+ + + L Sbjct: 342 IVVFENFVSDEQATALIAAGAKKGYERSADVGIENPDGSHEDDVNDDRTSHNAWCDDELC 401 Query: 58 N---EVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDL 114 N + + + A ++ + +Y+ + Y H D + L Sbjct: 402 NNDPVIAPVIERIASVTKTSVNNS-EFLQLLQYEPGQYYKQHHDYIEYQEDMPCGV-RML 459 Query: 115 SATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPS-----------SSLHCVTPV 163 + L+L+D + +GG ++ G+ V++PS + H PV Sbjct: 460 TLFLYLNDVE--EGGGTHFPLLDIT--IQPKKGNAVIWPSVLDDKPETKDPRTDHEALPV 515 Query: 164 TRGVRVASFMWIQSM 178 G++ + W+ + Sbjct: 516 INGIKYGANAWLHTR 530 >UniRef50_C1DZC3 Prolyl 4-hydroxylase n=3 Tax=Viridiplantae RepID=C1DZC3_9CHLO Length = 454 Score = 117 bits (293), Expect = 3e-25, Method: Composition-based stats. Identities = 40/218 (18%), Positives = 71/218 (32%), Gaps = 50/218 (22%) Query: 3 YHIPGVLSPQDVARFREQLEQA---EWVDGRVTTGAQGAQVKNNQQV---DTRSTLYAAL 56 Y L+P + + ++ V G +G+ ++++ + + + A+ Sbjct: 180 YMFRNFLTPHECEHLMQLAKKQLAPSTVVGDKGSGSMVSKIRTSAGMFLGRGQDPTVRAI 239 Query: 57 QNEVLNAVNQHALFFAAALPR-TLSTPLFNRYQNNETYGFHVDGAV--RSHPQNGWMRTD 113 + + A + LP RY+N + Y H D + + Sbjct: 240 EERIAAA---------SGLPEPNGEGLQILRYENGQKYDPHFDYFHDQVNSSPRRGGQRM 290 Query: 114 LSATLFLSDPQSYDGGELVVN----------DTFGQH-----------RVKLPAGDLVLY 152 + ++L D +GGE + D G H VK GD VL+ Sbjct: 291 ATMLIYLEDTT--EGGETIFPNGVRPEDWDADEPGNHNSWSDCAKKGIPVKSHRGDAVLF 348 Query: 153 PS---------SSLHCVTPVTRGVRVASFMWIQSMIRD 181 S SLH PV G + + WI+ D Sbjct: 349 WSLKEDYTLDNGSLHGACPVIAGEKWTAVKWIRVAKFD 386 >UniRef50_B8C9F1 Predicted protein n=1 Tax=Thalassiosira pseudonana RepID=B8C9F1_THAPS Length = 490 Score = 116 bits (292), Expect = 4e-25, Method: Composition-based stats. Identities = 28/194 (14%), Positives = 60/194 (30%), Gaps = 25/194 (12%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTT-----GAQGAQVKNNQQVDTRST--LYA 54 + L+ ++ + + +A++ + G+ + V + + Sbjct: 290 IITFDNFLTDEECNQMIQLGYKAKYERSKDVGEMQIDGSYDSVVSKG-RTSENAWCSFRD 348 Query: 55 ALQNEVLNAVNQHALFFAAALP-RTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTD 113 +N + + +P +Y+ + Y H D + Sbjct: 349 KCRNTTTAQLIHDRISTVTGIPANHSEDFQILKYEKGQFYRSHHDYIEHQEKRRCGP-RV 407 Query: 114 LSATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPS-----------SSLHCVTP 162 L+ L+LSD + +GG+ VK G VL+PS + H Sbjct: 408 LTFFLYLSDVE--EGGDTNFPKLSI--AVKPKKGSAVLWPSVLDSNPSMKDPRTDHEAQE 463 Query: 163 VTRGVRVASFMWIQ 176 V G + + W+ Sbjct: 464 VVNGTKFGANAWLH 477 >UniRef50_Q338D2 Prolyl 4-hydroxylase, putative, expressed n=11 Tax=Embryophyta RepID=Q338D2_ORYSJ Length = 309 Score = 116 bits (292), Expect = 4e-25, Method: Composition-based stats. Identities = 38/210 (18%), Positives = 65/210 (30%), Gaps = 30/210 (14%) Query: 3 YHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEVLN 62 + G L+ + + + V G V + + + Q+EV+ Sbjct: 55 FLHKGFLTDAECEHLISLA-KDKLEKSMVADNESGKSVMSEVRTSSG-MFLEKKQDEVVA 112 Query: 63 AVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLSD 122 + + + P + YQN E Y H D + Q + ++LSD Sbjct: 113 RIEERIAAWTFLPPDNGESIQILHYQNGEKYEPHYDYFHDKNNQALGGHRIATVLMYLSD 172 Query: 123 PQSYDGGELVVNDTF-----------------GQHRVKLPAGDLVLY---------PSSS 156 GGE + + + VK GD +L+ S S Sbjct: 173 VG--KGGETIFPEAEVGKLLQPKDDTWSDCAKNGYAVKPVKGDALLFFSLHPDATTDSDS 230 Query: 157 LHCVTPVTRGVRVASFMWIQSMIRDDKKRA 186 LH PV G + ++ WI D + Sbjct: 231 LHGSCPVIEGQKWSATKWIHVRSFDISVKQ 260 >UniRef50_Q486F0 Oxidoreductase, 2OG-Fe(II) oxygenase family n=8 Tax=Gammaproteobacteria RepID=Q486F0_COLP3 Length = 261 Score = 116 bits (292), Expect = 4e-25, Method: Composition-based stats. Identities = 34/202 (16%), Positives = 71/202 (35%), Gaps = 40/202 (19%) Query: 3 YHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQV------DTRSTLYAAL 56 + + VL+ + + EQ E++ + V++N + T ++ + Sbjct: 59 FQLFNVLTKDECEKLISISEQLEFLP--DAAVSLPRSVRHNDSLTWIVDEQTDGIIWQRI 116 Query: 57 QNEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSH---------PQN 107 + + + A+F + + F RY ++ + H DG+ Sbjct: 117 AHLMDDR---QAIFGGSKALGINARFRFYRYNPDDYFKPHSDGSWPGSRIINDELIANAY 173 Query: 108 GWMRTDLSATLFLSDPQSYDGGELVV-------------NDTFGQHRVKLPAGDLVLYPS 154 + ++ +FLS + + GGE ND V+ PAG ++ +P Sbjct: 174 PDRYSQMTFLIFLS--EDFQGGETRFLVNADDPTKPATSNDNVKNVDVRTPAGGILCFPH 231 Query: 155 -----SSLHCVTPVTRGVRVAS 171 +H P+T GV+ Sbjct: 232 GMHPLHCIHSSVPITDGVKYII 253 >UniRef50_C9S6P4 Putative uncharacterized protein n=1 Tax=Verticillium albo-atrum VaMs.102 RepID=C9S6P4_VERA1 Length = 294 Score = 116 bits (292), Expect = 4e-25, Method: Composition-based stats. Identities = 38/220 (17%), Positives = 66/220 (30%), Gaps = 44/220 (20%) Query: 4 HIPGVLSPQDVARFREQLEQA------EWVDGRVTTGAQGA----QVKNNQQVD-TRSTL 52 + VL+PQ+ A E++ W V GA + +N+ ++ + Sbjct: 65 VLDNVLTPQECALLLALAERSATDQSRPWQPAMVNIGAGREVLEPEYRNSDRIVWDEDEV 124 Query: 53 YAALQNEVLNAVNQHAL---FFAAALPRTL----------------STPLFNRYQNNETY 93 L V + +HA FFA P F +Y + + Sbjct: 125 VRRLWARV--RLARHADGAPFFADEGPLAWASGDAVDGGWRFWGLNRRMRFLKYGPGQFF 182 Query: 94 GFHVDGAVRSHPQNGWMRTDLSATLFLSD---------PQSYDGGELVV--NDTFGQHRV 142 H DG ++RT + +L+D GG D + V Sbjct: 183 RPHCDGTYEEASGGRYLRTYYTVHFYLNDSVQAVGDNAGADLKGGATCFLSYDEKRRLDV 242 Query: 143 KLPAGDLVLYPS-SSLHCVTPVTRGVRVASFMWIQSMIRD 181 AG +++ H V G + + + D Sbjct: 243 DPKAGRALIFQHPRMYHAGDDVLAGTKYTMRTEMMYELVD 282 >UniRef50_B9ZQ51 Procollagen-proline dioxygenase n=1 Tax=Thioalkalivibrio sp. K90mix RepID=B9ZQ51_9GAMM Length = 575 Score = 116 bits (291), Expect = 6e-25, Method: Composition-based stats. Identities = 36/239 (15%), Positives = 77/239 (32%), Gaps = 36/239 (15%) Query: 1 MMYHIPGVLSPQDVARFREQLE-----QAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAA 55 ++ ++ L P + + +DG + +QG N L Sbjct: 58 LVVYLDEFLEPGECEALIHLAQGRMKRALVSLDGS-SGVSQGRTGSNCWLRYQEEPLARR 116 Query: 56 LQNEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQ-----NGWM 110 + V V + Y + + Y H D P+ Sbjct: 117 IGERVAKRVGFPLEYA--------EPLQVIHYGHEQEYRPHYDAYDLDTPRGLRCTRQGG 168 Query: 111 RTDLSATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLY---------PS-SSLHCV 160 + ++A L+L++ + +GG + + V G + ++ P SLH Sbjct: 169 QRMVTALLYLNEVE--EGGATAFPNAGVE--VAPRKGRIAIFNNVGADPGRPHPRSLHGG 224 Query: 161 TPVTRGVRVASFMWIQSMIRDDKKRAMLF-ELDNNIQSLKSRYGESEEILSLLNLYHNL 218 PV G + A+ +W ++ R +R F ++++ + G +++ L Sbjct: 225 MPVKSGEKWAASIWFRA--RPAHERQPWFDDVEDASAQVPEGEGGHWPVVASNRAQSIL 281 >UniRef50_Q24JN5 At2g17720 n=14 Tax=Spermatophyta RepID=Q24JN5_ARATH Length = 291 Score = 116 bits (291), Expect = 6e-25, Method: Composition-based stats. Identities = 35/204 (17%), Positives = 63/204 (30%), Gaps = 38/204 (18%) Query: 3 YHIPGVLSPQDVARFREQLE----QAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQN 58 L+ ++ + ++ VD T G++ ++V+ + T + Sbjct: 90 VVYHNFLTNEECEHLISLAKPSMVKSTVVD-EKTGGSKDSRVRTS-----SGTFLRRGHD 143 Query: 59 EVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATL 118 EV+ + + F YQ + Y H D + + + + Sbjct: 144 EVVEVIEKRISDFTFIPVENGEGLQVLHYQVGQKYEPHYDYFLDEFNTKNGGQRIATVLM 203 Query: 119 FLSDPQSYDGGELVVNDTFG-----------------QHRVKLPAGDLVLYPS------- 154 +LSD GGE V G V D +L+ + Sbjct: 204 YLSDVDD--GGETVFPAARGNISAVPWWNELSKCGKEGLSVLPKKRDALLFWNMRPDASL 261 Query: 155 --SSLHCVTPVTRGVRVASFMWIQ 176 SSLH PV +G + +S W Sbjct: 262 DPSSLHGGCPVVKGNKWSSTKWFH 285 >UniRef50_A9YW24 Putative uncharacterized protein n=2 Tax=unclassified Phycodnaviridae RepID=A9YW24_OSV5 Length = 206 Score = 116 bits (290), Expect = 7e-25, Method: Composition-based stats. Identities = 24/181 (13%), Positives = 58/181 (32%), Gaps = 28/181 (15%) Query: 4 HIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQ--QVDTRSTLYAALQNEVL 61 I ++ ++ + ++ V ++++++ +D + + + + Sbjct: 24 VIKEFITEEERKHIIRKAQKKLEVSTVAENRVVDKKIRDSETAWLDASDPVVKRVMEKCV 83 Query: 62 NAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLS 121 + + L RY+ Y H D + + L L+ Sbjct: 84 S-LTDRPLV-------NCEHIQVLRYKPGGHYSPHQDTFS----DTKGNKRMYTVILALN 131 Query: 122 DPQSYDGGELVVNDTFGQHRVKLPAGDLVLY---------PSSSLHCVTPVTRGVRVASF 172 D Y+GGE + +++ GD + + S +LH PV G + Sbjct: 132 D--DYEGGETEFPNLKKKYKW---GGDALFFHTLDNYELMTSKALHGGRPVESGEKWICN 186 Query: 173 M 173 + Sbjct: 187 L 187 >UniRef50_A8P9H2 Putative uncharacterized protein n=1 Tax=Coprinopsis cinerea okayama7#130 RepID=A8P9H2_COPC7 Length = 946 Score = 116 bits (290), Expect = 8e-25, Method: Composition-based stats. Identities = 30/203 (14%), Positives = 72/203 (35%), Gaps = 23/203 (11%) Query: 9 LSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTR------STLYAALQNEVLN 62 L+ ++ +QA + G+ T ++N ++++ L+ V Sbjct: 59 LNEREAKAIIASSKQAPFGKGKKT--LVDKTIRNTWEIESDKVTFSNPKWTTWLEATVFK 116 Query: 63 AVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLSD 122 V Y+ + H D + AT+ + Sbjct: 117 TVWDSLGVAPYTTRPRCELYKLLVYEKGSHFKAHQDTQKADG---------MFATVIVVL 167 Query: 123 PQSYDGGELVVNDTFGQHRVKLPAGDL-----VLYPSSSLHCVTPVTRGVRVASFMWIQS 177 P +++GGE++++ + V + A + + + +H V P+T G R+A + Sbjct: 168 PSAFEGGEVILSHSGATETVDITANSAMETSILAWYTDVMHEVRPITSGYRLALSYNLIH 227 Query: 178 MIRDDKKRAMLFELDNNIQSLKS 200 + R +L ++ + + L++ Sbjct: 228 TSPNVP-RPVLPDMSDAAKRLQA 249 >UniRef50_A5VC31 2OG-Fe(II) oxygenase n=1 Tax=Sphingomonas wittichii RW1 RepID=A5VC31_SPHWW Length = 186 Score = 115 bits (289), Expect = 9e-25, Method: Composition-based stats. Identities = 40/177 (22%), Positives = 69/177 (38%), Gaps = 16/177 (9%) Query: 9 LSPQDVARFREQL-----EQAEWVDGRVTTGAQGAQVKNNQQVDTRST--LYAALQNEVL 61 LS ++ + E+ + A + T G + + +DT + + A L + V+ Sbjct: 12 LSSEECDQIIERGLRYAPQTATVGFAKDTRQDDGYRTSVVRWLDTGAEQAIVARLMSFVV 71 Query: 62 NAVNQHALFFAAALPRTLSTPLFNRY--QNNETYGFHVDGAVRSHPQNGWMRTDLSATLF 119 ++ + A F Y + Y +H D + S LS + Sbjct: 72 SSNRTNFGVDIVAP----FDLQFTEYHGTSQGKYDWHQDVWLES---TRPYDRKLSLVVQ 124 Query: 120 LSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQ 176 LSDP Y+GG + P G L+++PS H V PVT G+R + W++ Sbjct: 125 LSDPADYEGGAFEFFGLQHPGALFAPRGSLLIFPSWMQHRVLPVTGGIRRSLVSWVE 181 >UniRef50_Q9LSI6 Prolyl 4-hydroxylase alpha subunit-like protein n=31 Tax=Magnoliophyta RepID=Q9LSI6_ARATH Length = 332 Score = 115 bits (289), Expect = 9e-25, Method: Composition-based stats. Identities = 37/200 (18%), Positives = 65/200 (32%), Gaps = 29/200 (14%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEVL 61 ++ G LS ++ F + + + V G V++ + + L + V Sbjct: 81 VFLYEGFLSDEECDHFIKLA-KGKLEKSMVADNDSGESVESEVRTSSGMFLSKRQDDIVS 139 Query: 62 NAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLS 121 N V + + Y+N + Y H D + ++LS Sbjct: 140 N-VEAKLAAWTFLPEENGESMQILHYENGQKYEPHFDYFHDQANLELGGHRIATVLMYLS 198 Query: 122 DPQSYDGGELVVNDTFG----------------QHRVKLPAGDLVLY---------PSSS 156 + + GGE V G + VK GD +L+ S+S Sbjct: 199 NVE--KGGETVFPMWKGKATQLKDDSWTECAKQGYAVKPRKGDALLFFNLHPNATTDSNS 256 Query: 157 LHCVTPVTRGVRVASFMWIQ 176 LH PV G + ++ WI Sbjct: 257 LHGSCPVVEGEKWSATRWIH 276 >UniRef50_C1MXG7 Predicted protein n=1 Tax=Micromonas pusilla CCMP1545 RepID=C1MXG7_9CHLO Length = 369 Score = 115 bits (288), Expect = 1e-24, Method: Composition-based stats. Identities = 38/203 (18%), Positives = 68/203 (33%), Gaps = 36/203 (17%) Query: 3 YHIPGVLSPQDVARFREQL----EQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQN 58 Y G L+ + F + ++ VD T + ++ + + ++ Sbjct: 86 YVYRGFLTDAECDHFIARASPKLAKSNVVD-TDTGEGVPSAIRTS-----DGMFFDRGED 139 Query: 59 EVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSH--PQNGWMRTDLSA 116 +V++AV + + RY + Y H+D V + + Sbjct: 140 DVVDAVERRISAWTRLPTENGEGMQVLRYAGGQKYDAHLDAFVDKFNADDAHGGQRVATV 199 Query: 117 TLFLSDPQSYDGGELVVNDTFGQH---------------RVKLPAGDLVLYPS------S 155 ++L+D GGE V +T + VK GD +L+ S Sbjct: 200 LMYLNDVDD--GGETVFPETTAKPHVGDERYSACARRGVAVKPRRGDALLFWSMDETFTR 257 Query: 156 SLHCVTPV-TRGVRVASFMWIQS 177 SLH PV GV+ + WI Sbjct: 258 SLHGGCPVGAGGVKWSMTKWIHK 280 >UniRef50_Q2SJN5 Putative uncharacterized protein n=1 Tax=Hahella chejuensis KCTC 2396 RepID=Q2SJN5_HAHCH Length = 185 Score = 115 bits (288), Expect = 1e-24, Method: Composition-based stats. Identities = 25/174 (14%), Positives = 62/174 (35%), Gaps = 12/174 (6%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRV---TTGAQGAQVKNNQQVDTRSTLYAA-LQ 57 + + +++ Q+ ++ + ++ + +NN++V YAA L Sbjct: 10 ISTVEEIITVQECHQWIQFIDNQNPQIAPLHTHKGEVYKPDYRNNERVMKSDPDYAACLY 69 Query: 58 NEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSAT 117 ++ + + Q F ++ RY+ + H DG R+ + Sbjct: 70 QKLKSELPQT--VFGWSIAGLNELFRCYRYKPGMKFAPHSDGFYGRSKDE---RSFYTLL 124 Query: 118 LFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVAS 171 L+L++ ++ GGE + ++ G + + H V G++ Sbjct: 125 LYLNEVEA--GGETGFF-VSPEVKISPKPGLALAFQHEIFHEGCEVKAGIKYVL 175 >UniRef50_B5YLK5 Predicted protein n=1 Tax=Thalassiosira pseudonana CCMP1335 RepID=B5YLK5_THAPS Length = 207 Score = 115 bits (287), Expect = 1e-24, Method: Composition-based stats. Identities = 34/200 (17%), Positives = 67/200 (33%), Gaps = 39/200 (19%) Query: 2 MYHIPGVLSPQDVARFREQ----LEQA----------EWVDGRVTTGAQGAQVKNNQQVD 47 + I G LS ++ RF E E++ D + ++G + Sbjct: 11 VVAIEGFLSDEECNRFIELGGDRYERSTEYASTMNLDGTFDSKESSGRTSTNTWCGEGCR 70 Query: 48 TRSTLYAALQNEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQN 107 + +V+ + A RY+ + Y H D + SH Sbjct: 71 DDP-----IIKKVIERMESLTGIPYA----NFEDLQLVRYEIGQRYEEHHDYSS-SHEGT 120 Query: 108 GWMRTDLSATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSS----------- 156 + L+ +L+D + +GG ++ G +++PS++ Sbjct: 121 QYGPRILTVFFYLNDVE--EGGGTQFDELDFVTE--PKRGMALIWPSTTNEAPDVMDDWT 176 Query: 157 LHCVTPVTRGVRVASFMWIQ 176 H PVT+G++ + WI Sbjct: 177 WHEALPVTKGIKYGANTWIH 196 >UniRef50_B2JN14 Procollagen-proline dioxygenase n=6 Tax=Burkholderia RepID=B2JN14_BURP8 Length = 305 Score = 115 bits (287), Expect = 2e-24, Method: Composition-based stats. Identities = 29/190 (15%), Positives = 52/190 (27%), Gaps = 22/190 (11%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLY-AALQNEV 60 + VLS + E+ V + V Q+ T + ++ Sbjct: 118 VIVFDDVLSRDECDELIERARHRLKRSTTVNPESGREDV---IQLRTSEGFWFQRCEDAF 174 Query: 61 LNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQ-----NGWMRTDLS 115 + +++ Y Y H D S + + + Sbjct: 175 IERLDRRISALMNWPLEHGEGLQILHYTKGGEYRPHFDYFPPSQSGSVLHTSRGGQRVAT 234 Query: 116 ATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYP----SS-----SLHCVTPVTRG 166 ++LSD GGE V V G + + +LH PVT G Sbjct: 235 LIVYLSDVA--GGGETVFP--NAGLAVMARQGGAIYFRYLNGHRQLDPLTLHGGAPVTNG 290 Query: 167 VRVASFMWIQ 176 + W++ Sbjct: 291 EKWIMTKWMR 300 >UniRef50_D2VWJ6 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2VWJ6_NAEGR Length = 1113 Score = 114 bits (286), Expect = 2e-24, Method: Composition-based stats. Identities = 23/209 (11%), Positives = 62/209 (29%), Gaps = 23/209 (11%) Query: 7 GVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTR------STLYAALQNEV 60 ++ + +++ + G T V+N+ +++ + L + Sbjct: 98 PIVYKEQADEIIRIAKKSPYGKGEET--IYDDNVRNSYELEPNQFRITNTMWQKELNQLL 155 Query: 61 LNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFL 120 + Y+ + +H D + TL + Sbjct: 156 ETKIKSGLGIDKYKNVE-CKLYKLLLYEKGGHFEYHKDS---------EKECKQTCTLVI 205 Query: 121 SDPQSYDGGELVVNDTFGQHRVKLPAGDL-----VLYPSSSLHCVTPVTRGVRVASFMWI 175 P ++GGE + + + + + S H V P+T G R I Sbjct: 206 ILPSIFEGGEFKIKHNDYEMEIATTNDYATDCHFISFYSDCDHAVMPLTSGYRTCLIYNI 265 Query: 176 QSMIRDDKKRAMLFELDNNIQSLKSRYGE 204 + ++ + + + ++++ + E Sbjct: 266 IVTTHQENQQLSIADNHVHSENVREQLSE 294 >UniRef50_C1FF01 Predicted protein (Fragment) n=3 Tax=Mamiellales RepID=C1FF01_9CHLO Length = 252 Score = 114 bits (286), Expect = 2e-24, Method: Composition-based stats. Identities = 33/202 (16%), Positives = 61/202 (30%), Gaps = 35/202 (17%) Query: 3 YHIPGVLSPQDVARFREQLEQAE---WVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNE 59 + + ++S + + E A W T + + L+ + Sbjct: 4 FILDDIVSASECEALIKCAEGAGYSFWNAAVSTATFRNSDTVEIHSAAVADELWRRCAHL 63 Query: 60 VLNAV---NQHALFFAA-----ALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMR 111 V+ V H L+ LFN+Y+ + H DGA R Sbjct: 64 VVPTVVIEQGHPLWEPGLEGTWKACGVNDHLLFNKYEPGGHFSPHTDGASIVDMNR---R 120 Query: 112 TDLSATLFLS------------DPQSYDGGELVVNDTFGQHRV---------KLPAGDLV 150 + S ++L+ P+ G+ VV+ G +R + G + Sbjct: 121 SLYSMLVYLNRCPDGGGTALFSPPEGTSMGKFVVDPALGVYRWPEEWQTGVAPVEPGTAL 180 Query: 151 LYPSSSLHCVTPVTRGVRVASF 172 ++ + H PV G R Sbjct: 181 VFRQDTSHEGVPVGPGHRKIII 202 >UniRef50_D2VHD9 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2VHD9_NAEGR Length = 1139 Score = 114 bits (286), Expect = 2e-24, Method: Composition-based stats. Identities = 30/212 (14%), Positives = 70/212 (33%), Gaps = 32/212 (15%) Query: 7 GVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDT-----RSTLYAA-----L 56 ++ + +++ + G T Q V+N+ +++ + ++ L Sbjct: 104 PIIYKEQADEIIRIGKKSPFGKGEETIHDQ---VRNSFELEPHQFRITNPVWQKELHLLL 160 Query: 57 QNEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSA 116 +++ + + P L L Y+ + FH + Sbjct: 161 NSKIKSGLGIEKHKKVVQSPCKLHKLLL--YEKGGHFDFH-----------KEKECQQTC 207 Query: 117 TLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDL------VLYPSSSLHCVTPVTRGVRVA 170 TL + P Y+GG + + + D + + S H V P+T G R Sbjct: 208 TLAIILPSLYEGGSFKIRHNSSEREIDYSDEDASTSAHFISFYSDCDHAVMPLTSGYRTC 267 Query: 171 SFMWIQSMIRDDKKRAMLFELDNNIQSLKSRY 202 I D + + +++N+++ L + Sbjct: 268 LVYNIIVTTHDINQPLPIVDIENDLEMLSQQL 299 >UniRef50_B8LC77 Predicted protein n=2 Tax=Thalassiosira pseudonana RepID=B8LC77_THAPS Length = 601 Score = 113 bits (284), Expect = 3e-24, Method: Composition-based stats. Identities = 30/201 (14%), Positives = 59/201 (29%), Gaps = 38/201 (18%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGR----------VTTGAQGAQVKNNQQVDT--- 48 + + G LS ++ R + Q + + G + +N Sbjct: 402 VVSLEGFLSDEEADRLVQLGNQQGYKRSTKVQTHKGGNSIDAGITEDRTSHNTWCQEPSC 461 Query: 49 -RSTLYAALQNEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQN 107 L A + + A +Y + Y H D + Sbjct: 462 YDDPLVAPIIERIAMLTKSSA--------NHSEHLQLLQYTEGQFYKQHNDY-IPQQRDM 512 Query: 108 GWMRTDLSATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPS-----------SS 156 ++ L+L+D + +GG V+ G+ +L+ S + Sbjct: 513 ACGPRIMTLFLYLNDVE--EGGGTRFPLLD--LTVQPKRGNAILWASVRDDDPEEKDIRT 568 Query: 157 LHCVTPVTRGVRVASFMWIQS 177 H PV +G++ + WI S Sbjct: 569 DHEALPVAKGMKYGANAWIHS 589 >UniRef50_Q21FK1 2OG-Fe(II) oxygenase n=1 Tax=Saccharophagus degradans 2-40 RepID=Q21FK1_SACD2 Length = 478 Score = 113 bits (284), Expect = 4e-24, Method: Composition-based stats. Identities = 29/209 (13%), Positives = 66/209 (31%), Gaps = 28/209 (13%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDT---RSTLYAALQN 58 MY + L+ ++ R + +++ +++ + ++ D + + Sbjct: 104 MYALGEFLTTEECERIIANI-RSKLRPSELSSQESDKTYRTSRTCDLGTIDDPFIHYVDS 162 Query: 59 EVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGW---MRTDLS 115 + V + Y+ + + H D + + Sbjct: 163 RICKLVGIDPSYS--------EVIQGQLYEVGQEFKAHTDYFEIKEMPEHGAVMGQRTYT 214 Query: 116 ATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPS---------SSLHCVTPVTRG 166 ++L+D + +GGE G +K AG +++ S S+H PV +G Sbjct: 215 VMIYLNDVE--EGGETDFPAADG--AIKPRAGLALIWNSLQSNGAPNPHSMHQAYPVLKG 270 Query: 167 VRVASFMWIQSMIRDDKKRAMLFELDNNI 195 + W +S R M + N Sbjct: 271 HKAVITKWFRSQSRLPNAPPMYSKEANEY 299 >UniRef50_B9IJQ5 Oxidoreductase, 2OG-Fe(II) oxygenase family protein n=25 Tax=Viridiplantae RepID=B9IJQ5_POPTR Length = 308 Score = 113 bits (283), Expect = 5e-24, Method: Composition-based stats. Identities = 33/199 (16%), Positives = 66/199 (33%), Gaps = 29/199 (14%) Query: 3 YHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEVLN 62 + G LS ++ + V G +++ + + Q+E+++ Sbjct: 56 FLYKGFLSDEECDHLMNLARD-KLEKSMVADNESGKSIESEVRTSSG-MFIGKSQDEIVD 113 Query: 63 AVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLSD 122 + + + Y++ + Y H D Q ++ ++LS+ Sbjct: 114 DIEARIAAWTFLPQENGESIQILHYEHGQKYEPHFDYFHDKANQELGGHRVVTVLMYLSN 173 Query: 123 PQSYDGGELVVNDTFG----------------QHRVKLPAGDLVLYPS---------SSL 157 GGE V ++ G + VK GD +L+ S +SL Sbjct: 174 VG--KGGETVFPNSEGKTIQPKDDSWSDCAKNGYAVKPQKGDALLFFSLHPDATTDTNSL 231 Query: 158 HCVTPVTRGVRVASFMWIQ 176 H PV G + ++ WI Sbjct: 232 HGSCPVIEGEKWSATKWIH 250 >UniRef50_A5WGK8 Procollagen-proline dioxygenase n=1 Tax=Psychrobacter sp. PRwf-1 RepID=A5WGK8_PSYWF Length = 268 Score = 113 bits (282), Expect = 5e-24, Method: Composition-based stats. Identities = 29/189 (15%), Positives = 65/189 (34%), Gaps = 20/189 (10%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEVL 61 + I LSP++ +Q + RV G+ V+++ + T + + + + Sbjct: 81 VTVINDFLSPEECDALISDADQ-KLKASRVVDPEDGSFVEHSARTSTSTGYHRGEIDIIK 139 Query: 62 NAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQ-----NGWMRTDLSA 116 + A RY++ Y H D + + + Sbjct: 140 TIEARIADLIN-WPVDHGEGLQVLRYEDGGEYRPHFDFFDPAKKSSRLVTKQGGQRVGTF 198 Query: 117 TLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSS---------SLHCVTPVTRGV 167 ++LS+ GG + + ++ G + + ++ +LH PVT GV Sbjct: 199 LMYLSEVD--SGGSTRFPNLNFE--IRPNKGSALYFANTNLKAEIEPLTLHAGMPVTEGV 254 Query: 168 RVASFMWIQ 176 + + W++ Sbjct: 255 KYLATKWLR 263 >UniRef50_A6C2C6 Uncharacterized iron-regulated protein n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C2C6_9PLAN Length = 187 Score = 113 bits (282), Expect = 6e-24, Method: Composition-based stats. Identities = 31/182 (17%), Positives = 56/182 (30%), Gaps = 21/182 (11%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWV---DGRVTTGAQGAQVKNNQQVDTRST-----LY 53 + + LS + A E+LE + G + V +Q++ Sbjct: 4 IIQVRNFLSATECAALIERLETQGFKEQLSGDRDRVVRARCVFTDQELADTYWQRLQQHV 63 Query: 54 AALQNEVLNAVNQHALFFA----AALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGW 109 AL + + + +Y E + H D A + Sbjct: 64 PALTEVYTDGFTPYPHLNSPLATFQPCGLNEVLRCYKYLPGEQFRRHEDFAY---EWSET 120 Query: 110 MRTDLSATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRV 169 RT + +L++ Y GGE + +V G V++P H V G++ Sbjct: 121 RRTFYTVLFYLNN--EYTGGETTFDHN----QVVPETGLAVIFPHELYHSGNMVETGIKY 174 Query: 170 AS 171 A Sbjct: 175 AM 176 >UniRef50_B7G721 Predicted protein n=1 Tax=Phaeodactylum tricornutum CCAP 1055/1 RepID=B7G721_PHATR Length = 455 Score = 113 bits (282), Expect = 6e-24, Method: Composition-based stats. Identities = 30/191 (15%), Positives = 73/191 (38%), Gaps = 28/191 (14%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTR---STLYAALQN 58 ++ + VL+ + + R + T + + + + + L +++ Sbjct: 260 VFLLDHVLTRSECNQLRSVATMLGYRPDHPVTVDKPTGIDSCEWLVDASIMDPLNERVKS 319 Query: 59 EVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSH----------PQNG 108 + + + A+ + S F RY + Y H+DG+ ++G Sbjct: 320 LLPPIMKESAVVHSI-----NSRWRFFRYSQDSVYRPHIDGSWPESRINEKGEYEYDESG 374 Query: 109 WMRTDLSATLFLSDPQSYDGGELVVNDTFGQ----HRVKLPAGDLVLYP----SSSLHCV 160 +++ L+ ++L+D ++GGE + Q V AG ++++P +S +H Sbjct: 375 SVKSYLTFLIYLND--DFEGGETLFYIPSSQGMSARGVVPKAGAVLVFPQGNTASLIHEG 432 Query: 161 TPVTRGVRVAS 171 + V G + Sbjct: 433 SAVANGTKYVV 443 >UniRef50_C1MLG0 Predicted protein n=1 Tax=Micromonas pusilla CCMP1545 RepID=C1MLG0_9CHLO Length = 750 Score = 112 bits (281), Expect = 7e-24, Method: Composition-based stats. Identities = 34/221 (15%), Positives = 61/221 (27%), Gaps = 57/221 (25%) Query: 4 HIPGVLSPQDVARFREQLE----QAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNE 59 LS + ++ DG+++ G + L A++ Sbjct: 533 VFDHFLSAVECDDLVAIAAPDLRRSRVTDGKLSEGRTSSST-FLTGCKQEEPLVRAIEQR 591 Query: 60 VLNAVNQHALFFAA-------------------------ALPRTLSTPLFNRYQNNETYG 94 +L AV L A L + RY + Y Sbjct: 592 LLRAVQSATLIAAQPNVYDSNERHGQPYRGSTSRFSQRPNLLQGAEPMQVVRYTEGQMYT 651 Query: 95 FHVDGAVRSHPQNGWMRTDLSATLFLSDPQSYDGGELVVN-----------DTFGQHRVK 143 H D + G +R + ++L+D + GG R+ Sbjct: 652 AHYDN------KQGCLRRTATFMMYLTDV--HSGGATHFPRAVPVSMRDGCGDAAGIRIW 703 Query: 144 LPAGDLVLYPS--------SSLHCVTPVTRGVRVASFMWIQ 176 G +++ S SLH PV G + + W++ Sbjct: 704 PKRGRALVFWSVSGGIEDVRSLHEAEPVIEGEKWIATKWLR 744 >UniRef50_A3PDM8 Putative uncharacterized protein n=2 Tax=root RepID=A3PDM8_PROM0 Length = 186 Score = 112 bits (281), Expect = 7e-24, Method: Composition-based stats. Identities = 27/103 (26%), Positives = 43/103 (41%), Gaps = 6/103 (5%) Query: 77 RTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLSDPQSYDGGELVVNDT 136 + + + Y + Y +HVD G +S TLF+++P Y+GGE + Sbjct: 78 KGIEPIQYGIYSDGGKYDWHVDQG-AKMFLKGGSVRKISMTLFINNPDEYEGGEFDLELF 136 Query: 137 FGQHR-----VKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMW 174 + KL G + + S H V PV+ GVR + W Sbjct: 137 PPEKEPRYETFKLKKGSAIFFQSDVWHRVRPVSSGVRKSLVAW 179 >UniRef50_C7YTN2 Putative uncharacterized protein n=1 Tax=Nectria haematococca mpVI 77-13-4 RepID=C7YTN2_NECH7 Length = 984 Score = 112 bits (281), Expect = 8e-24, Method: Composition-based stats. Identities = 41/229 (17%), Positives = 70/229 (30%), Gaps = 28/229 (12%) Query: 9 LSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTR-----STLYAALQNEVLNA 63 LS + + QA + G T V+N ++D + A + Sbjct: 79 LSEFQACQMIAKARQAPYGKGSET--IVDTSVRNTWELDPSQFELRDPTWTAQVQILCKQ 136 Query: 64 VNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLSDP 123 V + Y+ + H DG+ P D+ TL + P Sbjct: 137 VAKTLGING---NIKAELYKMLIYEKGAMFKAHTDGSTEKIP-------DMFGTLVVCLP 186 Query: 124 QSYDGGELVVNDTFGQHRVKL--PAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQSMIRD 181 ++ GG++V+ H + A + S H V PVT G R ++ D Sbjct: 187 STHQGGDVVLRHNGQAHVFRSSDHAQSCAFWYSDVSHEVLPVTSGYRWVLTY---NLALD 243 Query: 182 DKKRAMLFEL------DNNIQSLKSRYGESEEILSLLNLYHNLLREWSE 224 + L Q+L + YH L +++E Sbjct: 244 SAQPRPSASLLSQVNTQPLRQALNRWLAQDPTTRENEYFYHVLDHDYTE 292 >UniRef50_Q84406 A85R protein n=4 Tax=Chlorovirus RepID=Q84406_PBCV1 Length = 242 Score = 112 bits (280), Expect = 9e-24, Method: Composition-based stats. Identities = 26/189 (13%), Positives = 60/189 (31%), Gaps = 19/189 (10%) Query: 5 IPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNN--QQVDTRSTLYA---ALQNE 59 I G LS + + + V + +K + + ++ + ++ Sbjct: 55 IDGFLSDIECDVLINAAIKKGLIKSEVGGATENDPIKLDPKSRNSEQTWFMPGEHEVIDK 114 Query: 60 VLNAVNQHALFFAAALPR-TLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATL 118 + + + + RY+ + Y H DG + + + Sbjct: 115 IQKKTREFLNSKKHCIDKYNFEDVQVARYKPGQYYYHHYDGDDCDDACPKD-QRLATLMV 173 Query: 119 FLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLY----------PSSSLHCVTPVTRGVR 168 +L P+ GGE + ++K G + + +LH PV G + Sbjct: 174 YLKAPEEGGGGETDFPTL--KTKIKPKKGTSIFFWVADPVTRKLYKETLHAGLPVKSGEK 231 Query: 169 VASFMWIQS 177 + + WI++ Sbjct: 232 IIANQWIRA 240 >UniRef50_B7G8R4 Putative uncharacterized protein n=1 Tax=Phaeodactylum tricornutum CCAP 1055/1 RepID=B7G8R4_PHATR Length = 427 Score = 112 bits (280), Expect = 9e-24, Method: Composition-based stats. Identities = 33/189 (17%), Positives = 57/189 (30%), Gaps = 27/189 (14%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTR---STLYAALQN 58 + + LS ++ R E + + N A+ + Sbjct: 240 VIMLDNALSSEEADRLIELGGIEGYERSADVAETNSGRTSTNAWCQHDCYKDPTARAVMD 299 Query: 59 EVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATL 118 V N + + L +Y+ ++ Y H D + + L+ Sbjct: 300 RVAN-ITSIPEVNSEYL-------QMLQYEKSQFYQTHSDYIPYQVNRPTGV-RILTFYF 350 Query: 119 FLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPS-----------SSLHCVTPVTRGV 167 +LSD + +GG V G VL+PS S H PV +GV Sbjct: 351 YLSDVE--EGGGTNFPKLG--LTVTPKKGRAVLWPSVLDDEPNQKDARSDHQALPVIKGV 406 Query: 168 RVASFMWIQ 176 + + WI Sbjct: 407 KYGANAWIH 415 >UniRef50_D2V0A9 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2V0A9_NAEGR Length = 576 Score = 112 bits (280), Expect = 1e-23, Method: Composition-based stats. Identities = 29/289 (10%), Positives = 74/289 (25%), Gaps = 80/289 (27%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQG---------AQVKNNQQVDTRS-T 51 ++ + +L P + + ++ + V + ++ + +N+ ++ Sbjct: 97 LFVLNELLHPDECCQILSIEKKQKKQLDEVKSLSKDFESIQAEYPPEYRNSSRIIYNDKE 156 Query: 52 LYAALQNEVLNAVNQHALFFAAALPRT--------LSTPLFNRYQNNETYGFHVDGAVRS 103 L L + + ++ L ++Y+ + HVDG Sbjct: 157 LAQRLWRRMKKHLVRYTFVRPYGLDNGGHWIPVGVNECFRMSKYEPGNYFKPHVDGQFVR 216 Query: 104 HPQNGWMRTDLSATLFLSDPQSYDGGELVV----------NDTFGQHRVK---------- 143 + R+ + ++L+ + + GGE +VK Sbjct: 217 NTDE---RSVYTLLIYLN--EDFTGGETRFLTVVNNVEEGQGLDSSKKVKKLSRSDKKMA 271 Query: 144 -------------------------------------LPAGDLVLYPSSSLHCVTPVTRG 166 G ++ H PVT G Sbjct: 272 KKKFAKEATKLNRFNDNPEKSLKENSTLQFKHLCSVSPKIGSAAIFNHDLYHEGCPVTNG 331 Query: 167 VRVASFMWIQSMIRDDKKRAMLFELDNNIQSLKSRYGESEEILSLLNLY 215 ++ I D + + +S+++ +Y Sbjct: 332 IKYILRTEIMFKRVDSNSVVKNKNDTKLYNQVMAILKQSDDLEKKGKIY 380 >UniRef50_C1ECX9 Predicted protein n=1 Tax=Micromonas sp. RCC299 RepID=C1ECX9_9CHLO Length = 380 Score = 112 bits (280), Expect = 1e-23, Method: Composition-based stats. Identities = 27/199 (13%), Positives = 60/199 (30%), Gaps = 27/199 (13%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLY------AA 55 + + L+P + + + + G ++ ++ S A Sbjct: 140 ILTVDDFLTPDECDALIDAAASSGEMRVSAVGGVDNVNIRTSKTCTLDSPALTDHPTKKA 199 Query: 56 LQNEVLNAVNQHALFFAA---------ALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQ 106 + + + Q A A+ P + P YQ E + H D + Sbjct: 200 ILTKAEALLPQLAGLSASKSAFKPPTSQSPYSFELPQVAHYQGGEYFKTHEDAFPEAVAS 259 Query: 107 NGWMRTDLSATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLY---------PSSSL 157 + + ++L+D +GG + V G ++L+ + +L Sbjct: 260 RKGYQRRATVLVYLNDVS--EGGATRFDKLSPPLDVTPRKGRMLLFFPGTKASMPDARTL 317 Query: 158 HCVTPVTRG-VRVASFMWI 175 H G + S +W+ Sbjct: 318 HTALEAVPGHEKWISQLWV 336 >UniRef50_D2V7Q2 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2V7Q2_NAEGR Length = 312 Score = 111 bits (279), Expect = 1e-23, Method: Composition-based stats. Identities = 39/217 (17%), Positives = 74/217 (34%), Gaps = 50/217 (23%) Query: 3 YHIPGVLSPQDVARFREQLEQAEWVDGRVT--TGAQGAQVKNNQ--QVDTRSTLYAALQN 58 + + V++P + F E E+ + ++ G V NN Q+ + L Sbjct: 101 FVLENVITPLECKLFVEISEKMGYKPSPLSVLAGKFDTSVINNSTKQIRDSERILTDLPE 160 Query: 59 EVLNAVNQHALFFAAALPRTL----------------STPLFNRYQNNETYGFHVDGAVR 102 +V+ +N+ LP + FN+Y + +G H+D R Sbjct: 161 KVIEVLNKR---IEHLLPEKVDIYGEEWTLRKNTPINERIRFNKYGVTQKFGPHMDAGYR 217 Query: 103 SHPQNGWMRTDLSATLFLSDPQSYDGGELVVNDTFGQH----------RVKLPAGDL-VL 151 + T L+ +L+ + + GGE +H R+ G + V Sbjct: 218 KNDHEM---TQLTIIFYLN--EDFKGGETTFFPGGRRHLLEEATVQEVRIVPKIGLVSVF 272 Query: 152 YPSSSL---HCVTPVTRGVRVAS--------FMWIQS 177 + + L H +PV G + W++S Sbjct: 273 FQNGKLNHRHEGSPVIEGFKYIIRSDIAYIKSSWLES 309 >UniRef50_Q6LGS5 Putative uncharacterized protein NCU03445 n=1 Tax=Photobacterium profundum RepID=Q6LGS5_PHOPR Length = 255 Score = 111 bits (278), Expect = 2e-23, Method: Composition-based stats. Identities = 30/197 (15%), Positives = 64/197 (32%), Gaps = 30/197 (15%) Query: 3 YHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEVLN 62 + + VL P++V R + Q + + + + + +N + + Sbjct: 53 FQLRDVLLPEEVTRILDAANQLGFTEDAAVSLPRKVRHNSNLNLIVDPATLELIWQRCQA 112 Query: 63 A-VNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRS---------HPQNGWMRT 112 V++++ F A F RY+ + + H DG+ G + Sbjct: 113 HFVDKYSHFAGKAPLGINGRFRFYRYEEGDYFKMHTDGSWPGSQVVNGELVDDAFGDRWS 172 Query: 113 DLSATLFLSDPQSYDGGELVV-------------NDTFGQHRVKLPAGDLVLYPS----- 154 + + LSD + GGE ++ V+ P+G ++ +P Sbjct: 173 MYTFLILLSD--DFVGGETQFMVNRDDPTKPALYQESANIESVRTPSGSVLCFPHGTHPI 230 Query: 155 SSLHCVTPVTRGVRVAS 171 LH + G + Sbjct: 231 HCLHGSAQILSGTKYII 247 >UniRef50_A0NVC8 Oxidoreductase domain protein n=2 Tax=Labrenzia RepID=A0NVC8_9RHOB Length = 182 Score = 111 bits (278), Expect = 2e-23, Method: Composition-based stats. Identities = 32/177 (18%), Positives = 57/177 (32%), Gaps = 10/177 (5%) Query: 3 YHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKN---NQQVDTRS--TLYAALQ 57 Y + S +D + G + G Q +++ + D + + + Sbjct: 4 YSYDALFSQKDCDEIIRLSQAEAQDTGGLVGGKQQGEIRRARISWLDDEGTAGWVMDRVM 63 Query: 58 NEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSAT 117 V A + F L +++ + Y +H D + Sbjct: 64 TAVAKANREAFDFDITEFREKLQVAIYDESEEG-HYDWHSDVG----EGPIAQFRKATIV 118 Query: 118 LFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMW 174 LS +Y+GG L ++ G L+ S LH V PVT+G R + W Sbjct: 119 TQLSPSDAYEGGALEISLGHKVMAASRDQGCATLFASFMLHRVVPVTKGTRYSLTCW 175 >UniRef50_A4RTV5 Predicted protein n=2 Tax=Ostreococcus RepID=A4RTV5_OSTLU Length = 225 Score = 111 bits (278), Expect = 2e-23, Method: Composition-based stats. Identities = 30/200 (15%), Positives = 64/200 (32%), Gaps = 34/200 (17%) Query: 1 MMYHIPGVLSPQDVARFREQL----EQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAAL 56 +++ + LS ++ + E +++ DG+++ G L + Sbjct: 29 LLFVLEDFLSEEEGDQLIEIARPSMQRSRVTDGKLSEGRTSTSTFLTGARAHDD-LVLEI 87 Query: 57 QNEVLNAVNQHALF---FAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTD 113 + + A+ + + +Y E Y H D + G ++ Sbjct: 88 ERRIQAAIRLPLIVERRKNVKVMYQHEPMQIVQYGPTERYTAHYDN------RAGSLKRS 141 Query: 114 LSATLFLSDPQSYDGGELVVN---------DTFGQHRVKLPAGDLVLYPS---------S 155 ++ +L +P+ +GG T RV G +L+ + Sbjct: 142 MTFMCYLQEPE--EGGATFFPKCVPLCGCDSTTLGIRVFPKRGRAILFWNVGENGQEAMR 199 Query: 156 SLHCVTPVTRGVRVASFMWI 175 SLH PV G + W+ Sbjct: 200 SLHEAQPVVSGKKAIFTQWL 219 >UniRef50_A2WFK5 Putative uncharacterized protein n=1 Tax=Burkholderia dolosa AUO158 RepID=A2WFK5_9BURK Length = 285 Score = 111 bits (277), Expect = 2e-23, Method: Composition-based stats. Identities = 25/189 (13%), Positives = 50/189 (26%), Gaps = 20/189 (10%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEVL 61 + VL + ++ V +V + + ++ ++ Sbjct: 98 IVVFGNVLDQDECDEMIQRSMHKLEQSTTVNAETGTQEVIR-HRTSHGTWF-QNGEDALI 155 Query: 62 NAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQ-----NGWMRTDLSA 116 + RY Y H D + + + Sbjct: 156 RRIETRLAALMNCPVENGEGLQVLRYTPGGEYRSHYDYFQPTAAGSLTHVRTGGQRVATL 215 Query: 117 TLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLY---------PSSSLHCVTPVTRGV 167 ++L+D GGE V + V GD V + ++LH PV G Sbjct: 216 IVYLNDVP--SGGETVFPEAGI--SVVPRRGDAVYFRYMNRLRQLDPATLHAGAPVRDGE 271 Query: 168 RVASFMWIQ 176 + W++ Sbjct: 272 KWIMTKWVR 280 >UniRef50_Q0D7Z6 Os07g0194500 protein n=4 Tax=Oryza sativa RepID=Q0D7Z6_ORYSJ Length = 319 Score = 111 bits (277), Expect = 2e-23, Method: Composition-based stats. Identities = 43/236 (18%), Positives = 77/236 (32%), Gaps = 38/236 (16%) Query: 2 MYHIPGVLSPQDVARFREQLE---QAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQN 58 ++ G LS + + + Q V + + ++V+ + Q+ Sbjct: 66 VFVYKGFLSDDECDHLVKLGKRKMQRSMVADNKSGKSVMSEVRTS-----SGMFLDKRQD 120 Query: 59 EVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATL 118 V++ + + + RY++ + Y H D Q + + Sbjct: 121 PVVSRIEKRIAAWTFLPEENAENIQILRYEHGQKYEPHFDYFHDKVNQALGGHRYATVLM 180 Query: 119 FLSDPQSYDGGELVVNDTFG----------------QHRVKLPAGDLVLYPS-------- 154 +LS + GGE V + G VK GD VL+ S Sbjct: 181 YLSTVE--KGGETVFPNAEGWENQPKDDTFSECAQKGLAVKPVKGDTVLFFSLHIDGVPD 238 Query: 155 -SSLHCVTPVTRGVRVASFMW--IQSMIRDDKKRAMLFELDNNIQSLK-SRYGESE 206 SLH PV G + ++ W I+S + DN+ + K + GE E Sbjct: 239 PLSLHGSCPVIEGEKWSAPKWIRIRSYEHPPVSKVTEGCSDNSARCAKWAEAGECE 294 >UniRef50_Q94H92 Os03g0761900 protein n=23 Tax=Embryophyta RepID=Q94H92_ORYSJ Length = 310 Score = 111 bits (277), Expect = 2e-23, Method: Composition-based stats. Identities = 37/204 (18%), Positives = 68/204 (33%), Gaps = 43/204 (21%) Query: 5 IPGVLSPQDVARFREQLEQ------AEWVDGRV---TTGAQGAQVKNNQQVDTRSTLYAA 55 P + Q + +Q G T G + + + + A Sbjct: 112 FPQFATSQQCENIVKTAKQRLMPSTLALRKGETEESTKGIRTSSGTFLSSDEDPTGTLAE 171 Query: 56 LQNEVLNAVNQHALFFAAALPRTL-STPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDL 114 ++ ++ A +PR RY+ + Y H D + + Sbjct: 172 VEKKIAKA---------TMIPRHHGEPFNILRYEIGQRYASHYDAFDPAQYGPQKSQRVA 222 Query: 115 SATLFLSDPQSYDGGELVVNDTFGQH-------------RVKLPAGDLVLYPS------- 154 S L+L+D + +GGE + G++ +VK GD +L+ S Sbjct: 223 SFLLYLTDVE--EGGETMFPYENGENMDIGYDYEKCIGLKVKPRKGDGLLFYSLMVNGTI 280 Query: 155 --SSLHCVTPVTRGVRVASFMWIQ 176 +SLH PV +G + + WI+ Sbjct: 281 DPTSLHGSCPVIKGEKWVATKWIR 304 >UniRef50_B7HCK8 Prolyl 4-hydroxylase, alpha subunit domain protein n=82 Tax=Bacillales RepID=B7HCK8_BACC4 Length = 216 Score = 110 bits (275), Expect = 4e-23, Method: Composition-based stats. Identities = 27/185 (14%), Positives = 60/185 (32%), Gaps = 22/185 (11%) Query: 1 MMYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEV 60 ++ + VLS ++ E + + V + + L +++ Sbjct: 40 LIVVLANVLSDEECGELIEMSK----NKMERSKIGSSRDVNDIRTSSGAFLEDNELTSKI 95 Query: 61 LNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFL 120 ++ A+ Y+ ++ Y H D H ++ + ++L Sbjct: 96 EKRISSIMNVPAS----HGEGLHILNYEVDQQYKAHYDY-FAEHSRSAANNRISTLVMYL 150 Query: 121 SDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSS---------SLHCVTPVTRGVRVAS 171 +D + +GGE V G V + +LH PVT+G + + Sbjct: 151 NDVE--EGGETYFPKLN--LSVHPRKGMAVYFEYFYQDQSINELTLHGGAPVTKGEKWIA 206 Query: 172 FMWIQ 176 W++ Sbjct: 207 TQWVR 211 >UniRef50_Q3AVV8 Prolyl 4-hydroxylase, alpha subunit n=8 Tax=Bacteria RepID=Q3AVV8_SYNS9 Length = 277 Score = 110 bits (275), Expect = 4e-23, Method: Composition-based stats. Identities = 30/190 (15%), Positives = 55/190 (28%), Gaps = 28/190 (14%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRST-LYAALQNEV 60 +Y +P +LS + E + A + VT G+ + + L L Sbjct: 94 LYELPSLLSSLECQELIEAINSA-LLPSTVTRGSNDYRTSRTCHLRHNHPVLSRRLDQRF 152 Query: 61 LNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVR-----SHPQNGWMRTDLS 115 + + F RY E + H D H + + Sbjct: 153 ADLLGVDPRFS--------EPIQGQRYDPGEYFKQHTDWFSPGTKEFDHNTRNGGQRTWT 204 Query: 116 ATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPS---------SSLHCVTPVTRG 166 ++L+ + GGE R G + + + ++LH PV G Sbjct: 205 VMVYLNAVE--SGGETWFQHLD--QRFTPRPGLGLAWNNLQEDGTPNRNTLHEAIPVAVG 260 Query: 167 VRVASFMWIQ 176 + W + Sbjct: 261 SKWVITKWFR 270 >UniRef50_A8TUG3 Putative uncharacterized protein n=1 Tax=alpha proteobacterium BAL199 RepID=A8TUG3_9PROT Length = 355 Score = 110 bits (275), Expect = 4e-23, Method: Composition-based stats. Identities = 34/189 (17%), Positives = 62/189 (32%), Gaps = 22/189 (11%) Query: 2 MYHIPGVLSPQDVARFREQLEQAE--WVD-------GRVTTGA-QGAQVKNNQQVDT--- 48 + +P VLSP+D R +VD GR T + + N ++D Sbjct: 170 VLVVPDVLSPEDCRRLISIYAMQGQEFVDPGHNQLKGRTTDCKMRIPEYGRNDRIDHWVC 229 Query: 49 RSTLYAALQNEVLNAVNQHALFFAAALPRTLSTPLFNRYQN--NETYGFHVDGAVRSHPQ 106 + + ++ + Y+ + H D + Sbjct: 230 STANNNIIDARLVPRLMPEIHKAFQYKVTRHERYRIACYEGYRGGSQHGHRDNTLPFVAH 289 Query: 107 NGWMRTDLSATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRG 166 + T+ L+ Y+G EL + F + K P G +++ S LH V + G Sbjct: 290 -----RRFAVTINLN-ADEYEGAELRFPE-FSEAAYKTPTGSAIVFSCSLLHEVMAMRSG 342 Query: 167 VRVASFMWI 175 R A ++ Sbjct: 343 RRFALLAFL 351 >UniRef50_D0J124 2OG-Fe(II) oxygenase n=3 Tax=Comamonadaceae RepID=D0J124_COMTE Length = 306 Score = 110 bits (274), Expect = 4e-23, Method: Composition-based stats. Identities = 26/187 (13%), Positives = 57/187 (30%), Gaps = 18/187 (9%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEVL 61 + +LS ++ + +T Q N + + +N+++ Sbjct: 121 VVVFGNLLSDEECDAIIAAAR--PRMRRSLTVDNQSGGEAVNDDRTSNGMFFQRGENDLI 178 Query: 62 NAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQ-----NGWMRTDLSA 116 + V Q Y+ Y H D + P + + Sbjct: 179 SLVEQRIARLLNWPLENGEGMQVLHYRPGAEYKPHYDYFAPNEPGTPTILKRGGQRVGTL 238 Query: 117 TLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYP-------SSSLHCVTPVTRGVRV 169 ++L++P GG D ++ G+ V + + +LH PV G + Sbjct: 239 VMYLNEPA--RGGATTFPDVG--LQIVPRRGNAVFFSYNRPDPATKTLHGGAPVLEGEKW 294 Query: 170 ASFMWIQ 176 + W++ Sbjct: 295 IATKWLR 301 >UniRef50_A8I7G7 Predicted protein (Fragment) n=1 Tax=Chlamydomonas reinhardtii RepID=A8I7G7_CHLRE Length = 236 Score = 110 bits (274), Expect = 5e-23, Method: Composition-based stats. Identities = 33/209 (15%), Positives = 60/209 (28%), Gaps = 44/209 (21%) Query: 3 YHIPGVLSPQDVARFREQLEQAEWVD------------GRVTTGAQGAQVKNNQQVDTRS 50 + + G LS ++ A+ E+ + G + Sbjct: 18 FVLVGALSREECAQIMACAEEMGYTQLAAWVSSLLVRCAGNVAGPLHTHTEPPWFPRLSK 77 Query: 51 TLYAALQNEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSH------ 104 L + + + + Q L RY Y HVDGA Sbjct: 78 ACSGPLYDRIAHLLPQRL--CGGDLAGINCRWRLYRYDKGAVYRPHVDGAWPGSGLKDGR 135 Query: 105 ---PQNGWMRTDLSATLFLSDPQSYDGGELVVNDTF--------------GQHRVKLPAG 147 G + L+ ++L+D ++GG H V AG Sbjct: 136 YEFDAYGDRWSRLTFLVYLND--DFEGGATTFYTPAQPGTRGGTASGACLEAHSVGPVAG 193 Query: 148 DLVLYPS-----SSLHCVTPVTRGVRVAS 171 +++++P S +H VT+G + Sbjct: 194 NILVFPHGDTMGSLVHEGAAVTQGSKYVI 222 >UniRef50_Q1NF66 2OG-Fe(II) oxygenase n=1 Tax=Sphingomonas sp. SKA58 RepID=Q1NF66_9SPHN Length = 229 Score = 110 bits (274), Expect = 5e-23, Method: Composition-based stats. Identities = 35/191 (18%), Positives = 64/191 (33%), Gaps = 29/191 (15%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQ--VDTRSTLYAALQNE 59 ++ LSP + A R ++ A + +G+ A + + + R L + Sbjct: 47 IFGRQDFLSPDECAELRRLID-ANAQPSTLFSGSANADYRTSHSGNLSPRDPLVERITQR 105 Query: 60 VLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMR-----TDL 114 + A+ L RY + Y H D + MR Sbjct: 106 IC-ALTGLPAINGETLQGQ-------RYTPGQEYKVHCDYFPATADYWQRMRGTGGQRTW 157 Query: 115 SATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPS---------SSLHCVTPVTR 165 +A ++LS ++ GGE + V G ++++ + SLH PV R Sbjct: 158 TAMIYLSAVEA--GGETHFPQ--CEFMVPPVEGMILIWNNMDRDGAPNRFSLHAALPVER 213 Query: 166 GVRVASFMWIQ 176 G + W + Sbjct: 214 GTKYVVTKWFR 224 >UniRef50_D2QI60 2OG-Fe(II) oxygenase n=1 Tax=Spirosoma linguale DSM 74 RepID=D2QI60_9SPHI Length = 328 Score = 109 bits (273), Expect = 7e-23, Method: Composition-based stats. Identities = 27/188 (14%), Positives = 64/188 (34%), Gaps = 35/188 (18%) Query: 6 PGVLSPQDVARFREQLE-QAEWVDGRV-----TTGAQGAQVKNNQQV-DTRSTLYAALQN 58 P S + A + E + + ++ T + + + D + ++ A+ Sbjct: 156 PHFFSADECAYIIQYAEEKTLFTRSQLEYDDNTVNESDTRTSYSAFLKDRQHPVFQAIYE 215 Query: 59 EVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATL 118 V ++ + + RY + + H D +H + + Sbjct: 216 RVAASLKVDLNY--------IEPLQCVRYGEGQQFKPHFDSMSANH-------RLHTMLV 260 Query: 119 FLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSS---------SLHCVTPVTRGVRV 169 +L+D + GGE + V G + + + S+H P+ +G++ Sbjct: 261 YLND--DFVGGETYFPELN--MNVHPKRGSALYFLNRDDNNLLLLNSVHAGLPIAQGMKY 316 Query: 170 ASFMWIQS 177 A +WI++ Sbjct: 317 ACNIWIRN 324 >UniRef50_Q5GQX6 Putative uncharacterized protein n=1 Tax=Synechococcus phage S-PM2 RepID=Q5GQX6_BPSYP Length = 200 Score = 109 bits (273), Expect = 7e-23, Method: Composition-based stats. Identities = 33/168 (19%), Positives = 55/168 (32%), Gaps = 21/168 (12%) Query: 25 EWVDGRVTTGAQGAQVKNNQQ-----VDTRSTLYAALQNEVLNAVNQHALFFAAALPRTL 79 + GRV +G +V + +D + + + + A + F + Sbjct: 31 QLNPGRVMSGGNFKEVVRIRDCKSVGLDAQHWFVGMIWHHISRANMHNFQFDITSFDNDN 90 Query: 80 STPLFNRYQNNETYGFHVDGA----VRSHPQNG-------WMRTDLSATLFLSDPQSYDG 128 F Y Y +H D P +G LS +L L+D Y+G Sbjct: 91 VE--FLSYDKGGHYAWHCDDFCGRYSPQVPADGLKTEVYNEYSRKLSFSLLLND--DYEG 146 Query: 129 GELVVNDTFGQHRVKLP-AGDLVLYPSSSLHCVTPVTRGVRVASFMWI 175 GE + G L+++ S +H V V G R W+ Sbjct: 147 GEFQIYFPPHHMITIPKEKGKLIIFDSRCVHRVRKVKSGTRDVLVGWV 194 >UniRef50_B2B745 Predicted CDS Pa_2_9860 n=1 Tax=Podospora anserina RepID=B2B745_PODAN Length = 323 Score = 108 bits (271), Expect = 1e-22, Method: Composition-based stats. Identities = 38/234 (16%), Positives = 78/234 (33%), Gaps = 61/234 (26%) Query: 3 YHIPGVLSPQDVARFREQLEQA---------------EWVDGRVTTGA----QGAQVKNN 43 + + VLSP + + E E++ W V G + + Sbjct: 84 FILENVLSPSECGQLIEYAEESVPLDSPTAASGANNGPWSPALVNMGPGFELYEPSYRRS 143 Query: 44 QQVDTRS-TLYAALQNEVL---------NAVNQHALFF--------------AAALPRTL 79 ++ + + + L + H + R + Sbjct: 144 DRIIWDTKEVADRIWERCLSVEGLRREIEVIQGHERIKKVTGRGEWHGDGGRGRWVMRRM 203 Query: 80 -STPLFNRYQNNETYGFHVDGAVR-SHPQNGWMRTDLSATLFLS-------DPQSYD--G 128 F RY+ + H D A S + ++T L+ ++L+ DP S + G Sbjct: 204 NERLRFLRYEKGGFFQPHCDSAYYASMDKEQVVKTLLTVHIYLNDCKATAEDPDSTELVG 263 Query: 129 GELVVNDTFGQHR--VKLPAGDLVLYPSS-SLHCVTPVTRGVRVASFMWIQSMI 179 G + + + R V+ AG ++++ S LH V +GV+ + ++S + Sbjct: 264 GATTLFSSDEKRRYDVECKAGRVLVFQHSAVLHSGDEVKQGVKFS----VRSDV 313 >UniRef50_Q54TF2 Putative uncharacterized protein n=1 Tax=Dictyostelium discoideum RepID=Q54TF2_DICDI Length = 781 Score = 108 bits (271), Expect = 1e-22, Method: Composition-based stats. Identities = 31/219 (14%), Positives = 69/219 (31%), Gaps = 28/219 (12%) Query: 3 YHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQ-VDTRSTLYAALQNEVL 61 + + VL+ ++ F E+ E+ +V +NN + + L L + Sbjct: 51 FKVTNVLTKEECLHFIEESERKGYVSIEK---EFPTGYRNNLRYLGKSDKLSDILWERLE 107 Query: 62 NAVNQHAL---------FFAAALPRTLST-PLFNRYQNNETYGFHVDGAVRSHPQNGWMR 111 + L +P + F++Y + H D +P R Sbjct: 108 AIFRESDLEGVRPYGFDQKGVWIPIGIDNCFTFSKYLPGSRFKAHYDAVFADNPDR---R 164 Query: 112 TDLSATLFLSDPQSYDGGELVVNDTFG---------QHRVKLPAGDLVLYPSSSLHCVTP 162 + + ++L+D + GG + + V +G +++ +LH Sbjct: 165 SIYTIQIYLNDG--FKGGNTNFFTSENPLLLDKHVLEDTVVPESGSAIIFNHDTLHDGGE 222 Query: 163 VTRGVRVASFMWIQSMIRDDKKRAMLFELDNNIQSLKSR 201 V GV+ + + + D + + L Sbjct: 223 VLEGVKYIVRVDMMFLRIDKNLDEEISAQEKAEMELAKT 261 >UniRef50_Q6C0K2 YALI0F23969p n=1 Tax=Yarrowia lipolytica RepID=Q6C0K2_YARLI Length = 232 Score = 108 bits (271), Expect = 1e-22, Method: Composition-based stats. Identities = 34/189 (17%), Positives = 66/189 (34%), Gaps = 19/189 (10%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDG--------RVTTGAQGAQVKNNQQVDTRSTLY 53 +Y I G L + + + + + D +T A N++ + + Sbjct: 45 IYVIHGFLPAKVCNDLVQTIVKTDADDTHTFKMETTPLTKRKDYAARVNDRGAVEDTGIC 104 Query: 54 AALQNEVLNAVNQHALFFAAALPRTLST-PLFNRYQNNETYGFHVDGAVRSHPQNGWMRT 112 L +++ + L+ RY + + H D AVR + T Sbjct: 105 NYLWHQLEPIIESDPELSEFKSAFGLNPNIRMYRYTPGQFFDQHYDEAVRCKVGSTVCTT 164 Query: 113 DLSATLFLSDPQSYDGGELVVNDTFGQ-HRVKLPAGDLVLYPSS---SLHCVTPVTRGVR 168 + L+LS+ Q GG+ + + G+ + V+ G ++L+ LH VT G + Sbjct: 165 RWTLLLYLSECQ---GGQTMFYEDGGKSYEVQPTNGSVLLHKHGEDCLLHEGREVTAGEK 221 Query: 169 VAS---FMW 174 W Sbjct: 222 WILRSDIAW 230 >UniRef50_D2VTG5 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2VTG5_NAEGR Length = 943 Score = 108 bits (271), Expect = 1e-22, Method: Composition-based stats. Identities = 36/205 (17%), Positives = 75/205 (36%), Gaps = 26/205 (12%) Query: 7 GVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTR-----STLYAALQNEVL 61 ++ + + + + GR+ V+ + Q+D + + + +L +++ Sbjct: 103 PIIYEEQAKDLIKNCSMSPF--GRLDKTIYDESVRKSWQLDPQRFKITNPKWNSLIEDLV 160 Query: 62 NAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLS 121 N + L A S Y+ + FH D + + ATL + Sbjct: 161 NNNVKKDLGIAQEKKIGFSLYKMLLYEEGGFFDFHRDTEKENG---------MIATLVVQ 211 Query: 122 DPQSYDGGELVVNDTFGQHRVKLPAGDL------VLYPSSSLHCVTPVTRGVRVASFMWI 175 P S+ GGE+VV ++ K D V + H V V G R+A I Sbjct: 212 LPSSFTGGEIVVRHKEKENIYKTSE-DATFNPYYVSFYCDVSHKVETVKSGYRLAL---I 267 Query: 176 QSMIRDDKKRAMLFELDNNIQSLKS 200 +++ + F+ D ++ +++ Sbjct: 268 YNLVYSGADKIQAFDSDTQLKQIET 292 >UniRef50_D2V353 2OG-Fe(II) oxygenase family protein n=1 Tax=Naegleria gruberi RepID=D2V353_NAEGR Length = 725 Score = 108 bits (270), Expect = 1e-22, Method: Composition-based stats. Identities = 32/184 (17%), Positives = 62/184 (33%), Gaps = 24/184 (13%) Query: 3 YHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAA-LQNEVL 61 Y I +LS ++ EQ E+ + D + ++N++V + A L V Sbjct: 540 YLIHNLLSEEECQHLIEQTEKLSYED----LYGYAKEYRSNKRVIVEDKISAQILFERVK 595 Query: 62 NAVNQH----ALFFAAALPRTLSTPLFNRYQNNETY-GFHVDGAVRSHPQNGWMRTDLSA 116 + Q L S F +Y + + H DG + G R+ + Sbjct: 596 SMAPQVYVDPESGDTWDLAFLNSRWRFCKYTPGQHFLAPHYDGCIELD---GDTRSFYTF 652 Query: 117 TLFLSDPQSYDGGELVV---------NDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGV 167 +L+ Y+ G + V+ G +++P + LH + + G Sbjct: 653 MFYLN--GDYEEGRTTFIKSHTFPPAPPFEIKGHVEPEPGLCIIFPHNILHYGSVLKSGT 710 Query: 168 RVAS 171 + Sbjct: 711 KYLM 714 >UniRef50_B8CBF7 Putative uncharacterized protein n=1 Tax=Thalassiosira pseudonana RepID=B8CBF7_THAPS Length = 248 Score = 108 bits (270), Expect = 2e-22, Method: Composition-based stats. Identities = 33/200 (16%), Positives = 60/200 (30%), Gaps = 29/200 (14%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAA---LQN 58 ++ + +S + + + T + ++ + + +Y + + Sbjct: 46 IFELEHFISDVEADHILMLTNRTHELHRSSTGDSSHHSDHDSTRTSMNTWIYREETAIID 105 Query: 59 EVLNAVNQHALFFAAALPRT--------------LSTPLFNRYQNNETYGFHVDGAVRSH 104 + V A L R Y E Y H D Sbjct: 106 TIYRRVADVLRIDEALLRRRQPDEHPRLGTRSSIAEPLQMVHYDPGEEYTAHHDFGYTHM 165 Query: 105 PQNGWMRTDLSATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYP-------SSSL 157 ++ L+L+D + +GGE +G VK G VL+ S L Sbjct: 166 SAPHQPSRSINMLLYLNDVE--EGGETSFP-RWGGLDVKPVKGKAVLFYMLTADGNSDDL 222 Query: 158 --HCVTPVTRGVRVASFMWI 175 H PV +G + S +WI Sbjct: 223 SQHAALPVIKGEKWMSNLWI 242 >UniRef50_D2V646 Putative uncharacterized protein n=1 Tax=Naegleria gruberi RepID=D2V646_NAEGR Length = 563 Score = 108 bits (270), Expect = 2e-22, Method: Composition-based stats. Identities = 29/176 (16%), Positives = 64/176 (36%), Gaps = 26/176 (14%) Query: 2 MYHIPGVLSPQDVARFREQL-----EQAEWVDGRVTTGAQGAQVKNNQQVDTRST-LYAA 55 + IP +LS D + E++E + +V+N++QV+ L Sbjct: 42 IVSIPKLLSKADCEEQIRKSYDYTFEESEVGNSSRKKNVGNKKVRNSEQVELNDEELSKK 101 Query: 56 LQNEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLS 115 + + +A+ + + L Y+ + + H+D + ++ Sbjct: 102 VFVKCEDAI------------KKMVGCLQTLYKKGQFFNSHIDSGRNIEGHP----SFIT 145 Query: 116 ATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVAS 171 ++L+D ++GGE V D ++ G VL+ H +T+ + Sbjct: 146 VLIYLND--DFEGGETVFEDEDVT--IEPELGKCVLFLHQIKHTAEEITKNTKFVI 197 >UniRef50_B7J8P3 Oxidoreductase, 2OG-Fe(II) oxygenase family n=1 Tax=Acidithiobacillus ferrooxidans ATCC 23270 RepID=B7J8P3_ACIF2 Length = 248 Score = 108 bits (270), Expect = 2e-22, Method: Composition-based stats. Identities = 30/180 (16%), Positives = 57/180 (31%), Gaps = 16/180 (8%) Query: 7 GVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEVLNAVNQ 66 G+L+P++ + VT G +V + ++V + + + Sbjct: 70 GLLTPENCQNLIAIGQSL-LRPATVTDEQTGQEVAHGERVSEMAWPKRDDYPILQSLAEG 128 Query: 67 HALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQ-NGWMRTDLSATLFLSDPQS 125 A + Y+ Y H D P + L+L+ + Sbjct: 129 IAQLTGIPIDCQ-EPLQILHYRPGGEYKPHYDAFAADAPTLRQGGNRQATLILYLNAVE- 186 Query: 126 YDGGELVVNDTFGQHRVKLPAGDLVLYPS---------SSLHCVTPVTRGVRVASFMWIQ 176 +GGE + +V G V + + SLH PV +G + + WI+ Sbjct: 187 -EGGETAFPELG--LQVSPIPGGGVFFRNLNEEGQRHPLSLHAGLPVRKGEKWIATQWIR 243 >UniRef50_C5FBE9 Putative uncharacterized protein n=1 Tax=Microsporum canis CBS 113480 RepID=C5FBE9_NANOT Length = 910 Score = 108 bits (269), Expect = 2e-22, Method: Composition-based stats. Identities = 31/180 (17%), Positives = 59/180 (32%), Gaps = 21/180 (11%) Query: 4 HIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQV-----DTRSTLYAALQN 58 + LSP+D + ++ + G T V+ ++ D ++ + Sbjct: 67 VVDLPLSPEDAKAVVDLCHRSPFGKGAET--LVDTSVRKCWELNVADFDLKAPGWGNYMK 124 Query: 59 EVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATL 118 +V+ V++ A Y+ + H D + ATL Sbjct: 125 KVVADVSKGLGIAHQADSIRADPYKLLLYEEGAFFLSHQDSPKADG---------MFATL 175 Query: 119 FLSDPQSYDGGELVVNDTFGQHRVKL----PAG-DLVLYPSSSLHCVTPVTRGVRVASFM 173 + P ++GGE+++ K AG + S H V P+T G R+ Sbjct: 176 VVCLPTKHEGGEIILKHDDKSLVFKTSTTSRAGFSYAAWYSDVFHEVQPITAGYRLVLTY 235 >UniRef50_Q2G5T2 2OG-Fe(II) oxygenase n=4 Tax=Sphingomonadales RepID=Q2G5T2_NOVAD Length = 257 Score = 108 bits (269), Expect = 2e-22, Method: Composition-based stats. Identities = 28/191 (14%), Positives = 61/191 (31%), Gaps = 26/191 (13%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEVL 61 ++ + S ++ A+ ++ + + VD LQ + Sbjct: 76 IFGVADFFSAEECAKLMAIVDSVARPSPTYSGTDASGRTSYTGDVDPFDPFVLMLQRRID 135 Query: 62 NAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHP-----QNGWMRTDLSA 116 + + F T RY + + H D + S P Q + +A Sbjct: 136 DLMGIDPSFG--------ETIQGQRYAPGQEFRGHYDHFLPSQPFWDAEQKRGGQRSWTA 187 Query: 117 TLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYP---------SSSLHCVTPVTRGV 167 +L+ + +GG+ + G L+++ ++++H PV RG Sbjct: 188 MAYLNAVE--EGGQTEF--ARVNLSIPPQPGALLIWNNMKPDGTPNANAMHAGMPVVRGT 243 Query: 168 RVASFMWIQSM 178 + W +S Sbjct: 244 KYVLTKWYRSR 254 >UniRef50_B0DEZ6 Predicted protein n=2 Tax=Agaricomycetes RepID=B0DEZ6_LACBS Length = 1203 Score = 107 bits (268), Expect = 2e-22, Method: Composition-based stats. Identities = 34/175 (19%), Positives = 56/175 (32%), Gaps = 14/175 (8%) Query: 9 LSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDT------RSTLYAALQNEVLN 62 LS +D QA + G T +V++ +++ +QN + Sbjct: 261 LSERDAKSIISCSAQAPFGHGERTV--VDREVRDTWEIEPSNLKFLNPAWEPYIQNLAMT 318 Query: 63 AVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTD-LSATLFLS 121 +V Q + Y+ D P + D + ATL + Sbjct: 319 SVWQGLGVVPYSTLPKCELYKLLLYETGSQSRIMADDFFSFLPHQDTQKADGMFATLIIV 378 Query: 122 DPQSYDGGELVVNDTFGQHRVKLPAGD-----LVLYPSSSLHCVTPVTRGVRVAS 171 P Y GGE+ V + L + + + H V PVT G R+A Sbjct: 379 LPSLYTGGEVHVTHASKTMVIDLSPNSLLSTCALAWYTDVKHEVKPVTSGYRLAL 433 >UniRef50_B5JX69 2OG-Fe(II) oxygenase n=1 Tax=gamma proteobacterium HTCC5015 RepID=B5JX69_9GAMM Length = 218 Score = 107 bits (268), Expect = 2e-22, Method: Composition-based stats. Identities = 40/185 (21%), Positives = 64/185 (34%), Gaps = 19/185 (10%) Query: 4 HIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKN--NQQVDTRSTL----YAALQ 57 I LS R R +++Q +W D +G ++ +++ D L A Q Sbjct: 28 VIDQFLSSDLCQRLRGEIQQLDWRDMPRAGVGRGEDYQHQQSERSDHIRWLRGNSTAQCQ 87 Query: 58 NEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSAT 117 + A+ + F YQ Y H+D G +S Sbjct: 88 FLAQMEGLRAAINQQLFMGLFEYEAHFALYQPGAFYKKHLDAF------KGRRNRIVSTV 141 Query: 118 LFLSDPQS-YDGGELVVN----DTFGQHRVKLPAGDLVLYPS-SSLHCVTPVTRGVRVAS 171 +L++ D GELV+ R+ AG LV++ S H V P R R + Sbjct: 142 CYLNEHWHSDDAGELVIYRADQHALEHQRILPQAGRLVVFSSEDYPHEVLPSHR-ERYSI 200 Query: 172 FMWIQ 176 W + Sbjct: 201 AGWFR 205 >UniRef50_B5Y4Z8 Predicted protein n=3 Tax=Bacillariophyta RepID=B5Y4Z8_PHATR Length = 508 Score = 107 bits (268), Expect = 2e-22, Method: Composition-based stats. Identities = 34/216 (15%), Positives = 66/216 (30%), Gaps = 44/216 (20%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGA-------QGAQVKNNQQVDT-----R 49 ++ + LS +V + + + G + N + Sbjct: 290 VFEVKDFLSDMEVEHLLNIASKRKLKRSTMHAGGSSEATTNDDTRTSTNDWIPRHQDLIT 349 Query: 50 STLYAALQNEVL--NAVNQH------ALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAV 101 T+Y + + A+ + F + + + YQ + Y H D + Sbjct: 350 DTIYRRAADLLQMDEALLRWRRKSEIPEFTESHISIS-ERLQLVNYQVGQQYTPHHDFTM 408 Query: 102 RSHPQNGWMRTDLSATLFLSDPQSYDGGELVVN------DTFGQHRVKLPAGDLVLYPS- 154 R + +L+D DGGE + G +VK G +L+ + Sbjct: 409 PGLVNMQPSRF-ATLLFYLND--DMDGGETAFPRWLHADEEGGSLKVKPEKGKAILFYNL 465 Query: 155 --------SSLHCVTPVTRGVRVASFMWIQSMIRDD 182 S H PV RG + W+ +++R Sbjct: 466 LPDGNYDERSEHAALPVRRGEK-----WLTNLVRAS 496 >UniRef50_A0YIP7 Putative uncharacterized protein n=1 Tax=Lyngbya sp. PCC 8106 RepID=A0YIP7_9CYAN Length = 301 Score = 107 bits (268), Expect = 2e-22, Method: Composition-based stats. Identities = 31/192 (16%), Positives = 61/192 (31%), Gaps = 25/192 (13%) Query: 5 IPGVLSPQDVARF--REQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEVLN 62 I LSP++ + +++ TT A + + Y ++N++LN Sbjct: 111 IENFLSPEENQEILKIALSKSDQFIGSTTTTQAVNYRQSSILYATLFPEFYNLMRNKILN 170 Query: 63 AVNQH-ALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLS 121 A+ + + + Y H D L+ + Sbjct: 171 ALPDILPQLNHQPFNVSQVEMQLTAHNDGCFYKIHNDSGS-----EKTYTRTLTYVYYFH 225 Query: 122 -DPQSYDGGELVVNDTFGQ----------HRVKLPAGDLVLYPSSSLHCVTPVT------ 164 +P+ + GGEL + +T + ++ +VL+ S H V PV Sbjct: 226 QEPKQFSGGELRLYETELKNGSAISQGQYKTIEPRNNSIVLFDSRCKHEVMPVRCPSQRF 285 Query: 165 RGVRVASFMWIQ 176 R W++ Sbjct: 286 EDGRFTLNGWLR 297 >UniRef50_A4RVD9 Predicted protein n=2 Tax=Ostreococcus RepID=A4RVD9_OSTLU Length = 330 Score = 107 bits (268), Expect = 3e-22, Method: Composition-based stats. Identities = 31/224 (13%), Positives = 63/224 (28%), Gaps = 45/224 (20%) Query: 3 YHIPGVLSPQDVARFREQLEQ--AEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEV 60 Y + LS ++ + ++ A + + ++ + Q+++ Sbjct: 52 YLLRNFLSAEECDHLMKLAKRELAPSTVVGEAGDSVPSDIRTS-----AGMFLRKGQDKI 106 Query: 61 LNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAV--RSHPQNGWMRTDLSATL 118 + A+ + + RY + Y H D + + + + Sbjct: 107 VKAIEERIARLSGTPVDNGEGMQILRYDVGQKYDPHFDYFHDKVNPAPKRGGQRLATMLI 166 Query: 119 FLSDPQSYDGGELVVNDTFGQHR-------------------------VKLPAGDLVLYP 153 +L D GGE + VK GD +L+ Sbjct: 167 YLVDTD--KGGETTFPNAKLPQSFEADEPENPFASHIEHTDCAKKGIPVKSVRGDAILFF 224 Query: 154 S---------SSLHCVTPVTRGVRVASFMWIQSMIRDDKKRAML 188 S SLH PV G + + WI+ D + + Sbjct: 225 SMTQDGVLDRGSLHGACPVIEGQKWTAVKWIRVGKFDGNYQEEI 268 >UniRef50_D2VER2 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2VER2_NAEGR Length = 1108 Score = 107 bits (268), Expect = 3e-22, Method: Composition-based stats. Identities = 34/180 (18%), Positives = 57/180 (31%), Gaps = 24/180 (13%) Query: 7 GVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTR------STLYAALQNEV 60 +++ Q + +A + G T +V+ Q++ + V Sbjct: 214 PLVTKQQADDIIQLAAKAPYGRGEET--IVDEKVRKTWQLEPNQFAILNPNWEDMIDELV 271 Query: 61 LNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFL 120 N + + + L L Y+ + + FH D + + ATL + Sbjct: 272 GNQIKKGLGVGDKEIGFNLYKLLL--YEEDGHFQFHRDSEKEEN---------MFATLVV 320 Query: 121 SDPQSYDGGELVVNDTFGQHRVKLP-----AGDLVLYPSSSLHCVTPVTRGVRVASFMWI 175 P Y GGEL V + A V + H V VT G R+A I Sbjct: 321 HLPSIYTGGELTVKHNSKEVVYDYSSKSSYATSFVSFYCDCEHKVNRVTSGYRLALVYNI 380 >UniRef50_UPI00016C3513 hypothetical protein GobsU_05758 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C3513 Length = 779 Score = 107 bits (267), Expect = 3e-22, Method: Composition-based stats. Identities = 32/186 (17%), Positives = 54/186 (29%), Gaps = 25/186 (13%) Query: 9 LSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRS-----TLYAALQNEVLNA 63 L+ +QA + G T V+ Q+D + + + Sbjct: 47 LTAHQAKELAAVCKQAPYGKGEET--LVDTSVRRVWQLDPDHFSLTNPEWDEFLRDAVAT 104 Query: 64 VNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLSDP 123 V + L L L Y+ + H DG + ATL + P Sbjct: 105 VQRDLGLEKQQLESHLYNLLL--YEPGGFFLPHRDGEKLD---------RMVATLVVVLP 153 Query: 124 QSYDGGELVVNDTFGQHRVKLPAGDL-------VLYPSSSLHCVTPVTRGVRVASFMWIQ 176 + GGEL+V + + A L + + H V P+ G R+ + Sbjct: 154 SPFTGGELIVRHDGEERAIDFGAPGLNLFHTHFAAFYADCEHEVRPLRTGHRLCLVYNLT 213 Query: 177 SMIRDD 182 Sbjct: 214 LARAQP 219 >UniRef50_B8C4H5 Prolyl 4-hydrolase-like protein (Fragment) n=1 Tax=Thalassiosira pseudonana RepID=B8C4H5_THAPS Length = 180 Score = 107 bits (267), Expect = 3e-22, Method: Composition-based stats. Identities = 25/189 (13%), Positives = 63/189 (33%), Gaps = 26/189 (13%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDT---RSTLYAALQN 58 + + +SP ++ E + + + ++ ++ A+L+ Sbjct: 3 VLTLENFISPTEIHTLLEWGSRFNYERSQAGDVISDSRTSSHTWCVDGCYDDPTVASLRQ 62 Query: 59 EVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATL 118 +++ +Y+ E Y H D + H + L+ + Sbjct: 63 RIVDVTGISE--------ENYECLQLLKYEVGEFYRPH-DDFIDDHVRQAKGPRLLTFFI 113 Query: 119 FLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPS-----------SSLHCVTPVTRGV 167 + ++ + +GG G ++ G ++++PS + H VT+G Sbjct: 114 YFNEVE--EGGGTRFPKL-GNLTIQPKLGRVLIWPSVLDGDAYKKDERTNHEAMEVTKGR 170 Query: 168 RVASFMWIQ 176 + A+ WI Sbjct: 171 KFAANAWIH 179 >UniRef50_Q2BP80 Putative uncharacterized protein n=1 Tax=Neptuniibacter caesariensis RepID=Q2BP80_9GAMM Length = 441 Score = 107 bits (267), Expect = 4e-22, Method: Composition-based stats. Identities = 28/193 (14%), Positives = 64/193 (33%), Gaps = 29/193 (15%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTR---STLYAALQN 58 ++++ LSP++ A+ E ++ + + ++ D ST A + Sbjct: 72 VFYVNNFLSPEECAQMIELIQHHQRPSTTTNETGHYKHYRTSKTCDLSLLESTFVAEIDQ 131 Query: 59 EVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQ-----NGWMRTD 113 + + + + + Y E + H D + + Sbjct: 132 RICKMLGIEPSY-SEGIQGQW-------YDIGEEFKPHTDYFEPKSDEFLEHAEARGQRT 183 Query: 114 LSATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPS---------SSLHCVTPVT 164 + ++L++ Q +GG + R G V++ + ++LH PV Sbjct: 184 WTFMIYLNNTQ--EGGGTFFPELG--QRFLPSQGKAVIWNNLTTDGSPNPATLHHGEPVK 239 Query: 165 RGVRVASFMWIQS 177 RG + W +S Sbjct: 240 RGYKAIITKWFRS 252 >UniRef50_D0N498 Putative uncharacterized protein n=1 Tax=Phytophthora infestans T30-4 RepID=D0N498_PHYIN Length = 780 Score = 106 bits (266), Expect = 5e-22, Method: Composition-based stats. Identities = 40/219 (18%), Positives = 73/219 (33%), Gaps = 34/219 (15%) Query: 5 IPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTR-----STLYAALQNE 59 IP L+P+ + + ++ + T V+ + Q+ + L+ + Sbjct: 71 IPLPLAPEHAEKLIAKCAKSPFGHNLDT--KMDDSVRKSWQLQPDQLELRNPLWQGGIEK 128 Query: 60 VLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLF 119 + + + P Y + H D + ATL Sbjct: 129 LTETIAARLGYKGV--PLNCVLYKLLVYGEGGHFLKHQDTEKEDG---------MIATLV 177 Query: 120 LSDPQSYDGGELVVNDTF---GQHRVKLPAGDL------VLYPSSSLHCVTPVTRGVRVA 170 + P +++GG+LVV +H G ++ S + H + VT+G R+A Sbjct: 178 VQPPSTHEGGDLVVYRNGQVEHRHDFGKIDGTAAYLPHYAVHYSDAEHALEDVTKGYRLA 237 Query: 171 SFMWI------QSMIRDDKKRAMLFELDNNIQSLKSRYG 203 I QS+ RD M EL N + + G Sbjct: 238 LVYSICLPSSLQSLKRDPNV-TMSAELANAFKEMGPEDG 275 >UniRef50_A4RWQ6 Predicted protein n=4 Tax=Chlorophyta RepID=A4RWQ6_OSTLU Length = 328 Score = 106 bits (266), Expect = 5e-22, Method: Composition-based stats. Identities = 35/219 (15%), Positives = 74/219 (33%), Gaps = 40/219 (18%) Query: 4 HIPGVLSPQDVARFREQL----EQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNE 59 G L+ ++ + ++ VD G+ + ++ + +++ Sbjct: 67 VYRGFLTREECDHLKALATPSLGRSTVVDAS-NGGSVPSDIRTS-----SGMFLLRGEDD 120 Query: 60 VLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQ--NGWMRTDLSAT 117 V+ ++ + + RY+ + Y H D Q + + Sbjct: 121 VVASIERRIASWTHVPESHGEGFQVLRYEFGQEYRPHFDYFQDEFNQKREKGGQRVATVL 180 Query: 118 LFLSDPQSYDGGELVVNDTF----------------GQHRVKLPAGDLVLYPSS------ 155 ++L+D + +GGE + D G+ VK GD + + S Sbjct: 181 MYLTDVE--EGGETIFPDAEAGANPGGGDDASSCAAGKLAVKPRKGDALFFRSLHHNGTS 238 Query: 156 ---SLHCVTPVTRGVRVASFMWIQSM-IRDDKKRAMLFE 190 S H PV +GV+ ++ W+ I D ++ FE Sbjct: 239 DAMSSHAGCPVVKGVKFSATKWMHVAPIEDSATASVRFE 277 >UniRef50_A3WCU8 Putative uncharacterized protein n=1 Tax=Erythrobacter sp. NAP1 RepID=A3WCU8_9SPHN Length = 364 Score = 106 bits (265), Expect = 5e-22, Method: Composition-based stats. Identities = 39/186 (20%), Positives = 63/186 (33%), Gaps = 23/186 (12%) Query: 2 MYHIPGVLSPQDVARFREQLE-QAEW-----VDGRVTTGAQGAQVKNNQQ-----VDTRS 50 + +P VLSP++ + +E + G ++ + +N+Q + Sbjct: 184 ILIVPNVLSPEECGNLVKSVETDTPFMVRKPQPGEISGNYKVPVYDHNRQDRIDLIIKDP 243 Query: 51 TLYAALQNEVLNAVNQHALFFAAALPRTLSTPLFNRY---QNNETYGFHVDGAVRSHPQN 107 L + V A RY + G H D P Sbjct: 244 NTLRFLDERIFGRVTPMIKKAFAYDVTRREDLHIARYVGKREGIAMG-HRDN---VDPPG 299 Query: 108 GWMRTDLSATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGV 167 R LS +L Y+GGE+ + F ++PAG +++ SS LH V T GV Sbjct: 300 AHRRFALSMSLN----DEYEGGEITFEE-FSPKGYRVPAGTAMVFSSSLLHEVQETTSGV 354 Query: 168 RVASFM 173 R Sbjct: 355 RYNLIS 360 >UniRef50_Q0QZ85 2OG-Fe(II) oxygenase superfamily n=1 Tax=Synechococcus phage syn9 RepID=Q0QZ85_BPSYS Length = 183 Score = 106 bits (265), Expect = 5e-22, Method: Composition-based stats. Identities = 30/147 (20%), Positives = 49/147 (33%), Gaps = 13/147 (8%) Query: 36 QGAQVKNNQQVDTRSTLYAALQNEVLNAVNQHALFFAAALPR--TLSTPLFNRYQNNETY 93 N + + + + +L QH + + F Y Y Sbjct: 37 VDKHQDKNPRSSEVAWINNQYLDNLLLKYVQHINIECKWNLKITGVEPVQFGSYPKGGFY 96 Query: 94 GFHVDGAVRSHPQNGWMRTDLSATLFLSDPQSYDGGELVVN-----DTFGQHRVKLPAGD 148 +HVD + +S +LFL+ + Y+GGE + KLP G Sbjct: 97 NWHVDQHSM----PEKVVRKISMSLFLN--EDYEGGEFDLELYRPGTDQRYETFKLPTGS 150 Query: 149 LVLYPSSSLHCVTPVTRGVRVASFMWI 175 + + H V PVT G+R + W Sbjct: 151 AIFFQGDQWHRVRPVTSGLRKSLVSWF 177 >UniRef50_Q58MI5 Putative uncharacterized protein n=1 Tax=Prochlorococcus phage P-SSM2 RepID=Q58MI5_BPPRM Length = 197 Score = 106 bits (265), Expect = 6e-22, Method: Composition-based stats. Identities = 36/190 (18%), Positives = 70/190 (36%), Gaps = 21/190 (11%) Query: 1 MMYHIPGVLSPQDVARFREQLEQAEWVDGR---VTTGAQGAQVKNNQQVDTRSTLYAAL- 56 +++ + G++ E++E+ +W + T+ + ++ K V + + Sbjct: 11 LIHVVNGIIPSNICEDVIEEIERRDWSPHQWYNPTSDSSLSKPKKELDVQHITPELQEIL 70 Query: 57 --------QNEVLNAVNQHAL--FFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQ 106 + QH ++ FNRY ++ H+D Sbjct: 71 TPFVIESGREYNNKYAYQHPSCYINTRSIMDNFCQIRFNRYSGDQIMHQHIDHIYSLFDG 130 Query: 107 NGWMRTDLSATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSL--HCVTPVT 164 LS L +D Y+G L + VKL GD++++PS+ L H VT Sbjct: 131 TNKGIPVLSFILNFND--DYEGANLFFWED---TIVKLGKGDIIMFPSNFLFPHGVTEAK 185 Query: 165 RGVRVASFMW 174 +G+R + W Sbjct: 186 KGIRYSGVCW 195 >UniRef50_A5WD57 2OG-Fe(II) oxygenase n=4 Tax=Moraxellaceae RepID=A5WD57_PSYWF Length = 304 Score = 106 bits (265), Expect = 6e-22, Method: Composition-based stats. Identities = 38/183 (20%), Positives = 68/183 (37%), Gaps = 17/183 (9%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEVL 61 + + P + + + E+ D ++T G + ++ D + Sbjct: 124 FIVLDDLYQPTALLALQAESGFVEYRDAKLTEGVRKTDIRG----DRIRWITKDFFAGFY 179 Query: 62 NAVNQHALFFAAALPR----TLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSAT 117 + + L F S + Y Y +H D V G +SA Sbjct: 180 YLNSINDLAFLFNRTLFAGIRHSEAHYACYPPGFGYKWHSDNPV------GRDERVISAV 233 Query: 118 LFLSDPQSYD-GGELVVNDTFGQ-HRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWI 175 +L+D + D GGEL + D+ GQ H++ A LV++ S+ LH V R R + W+ Sbjct: 234 FYLNDDWTLDDGGELSIIDSEGQTHKLMPKANRLVIFDSNLLHQVELAHR-QRYSIATWL 292 Query: 176 QSM 178 + Sbjct: 293 RCD 295 >UniRef50_A7RAS2 Putative uncharacterized protein C118R n=2 Tax=Chlorovirus RepID=A7RAS2_PBCVA Length = 230 Score = 106 bits (265), Expect = 6e-22, Method: Composition-based stats. Identities = 25/189 (13%), Positives = 61/189 (32%), Gaps = 21/189 (11%) Query: 5 IPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNN--QQVDTRSTLYA---ALQNE 59 I L+ + +V V + +K + + ++ + ++ Sbjct: 46 IDDFLTDVECDILINDASNKGFVKSEVGGATENDPIKLDPKSRNSEQTWFAPGEHDVIDK 105 Query: 60 VLNAVNQHALFFAAALPR-TLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATL 118 + N + + + RY++ + Y H DG + + + Sbjct: 106 IQNKTRELLDSKRHCIDKYKFEDVQVARYKSEQYYYHHYDGDDCDDACPKD-QRLATLMV 164 Query: 119 FLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLY----------PSSSLHCVTPVTRGVR 168 +L +P +GGE + ++ G + + +LH PV G + Sbjct: 165 YLKEPN--EGGETDFPTL--KTKIIPKKGKAIFFWVADPVTRKLYKETLHAGLPVKSGEK 220 Query: 169 VASFMWIQS 177 + + WI++ Sbjct: 221 IIANQWIRA 229 >UniRef50_D2VEV9 Predicted protein n=2 Tax=Naegleria gruberi RepID=D2VEV9_NAEGR Length = 1977 Score = 106 bits (265), Expect = 6e-22, Method: Composition-based stats. Identities = 37/238 (15%), Positives = 80/238 (33%), Gaps = 34/238 (14%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRST-LYAALQNEV 60 ++ + VLSP++ + ++ + + +N+++V S + L N + Sbjct: 96 LFVLEQVLSPEECQDLIDVTDELGYHSIDR---EYSKEYRNSERVVVTSKKVAEILWNRI 152 Query: 61 LNAVNQH---------ALFFAAALPRTLST-PLFNRYQNNETYGFHVDGAVRSHPQNGWM 110 + + + +P LS FNRY+ + H+D + Sbjct: 153 VPMMKKEDITNVKPYGFDNNGKWVPIELSECLRFNRYKEGNFFKPHMDSHFVRNDNE--- 209 Query: 111 RTDLSATLFLSDPQSYDGGELV---VNDTFGQH----------RVKLPAGDLVLYPSSSL 157 R+ + ++L D Y G + D + V+ AG + ++ Sbjct: 210 RSIFTILIYL-DDSPY--GTTFWKKIPDETNEDLSKMKFKKLLTVQPKAGSIAIFNHDIY 266 Query: 158 HCVTPVTRGVRVASFMW-IQSMIRDDKKRAMLFELDNNIQSLKSRYGESEEILSLLNL 214 H V G + I I + + ++ Q + ESE++ N+ Sbjct: 267 HSGDYVKEGFKYVVRTEMIFKRIDSESVYKQNYIENSQYQKVLQLLEESEQLEKEGNV 324 >UniRef50_C1FDT8 Predicted protein n=1 Tax=Micromonas sp. RCC299 RepID=C1FDT8_9CHLO Length = 433 Score = 106 bits (265), Expect = 6e-22, Method: Composition-based stats. Identities = 38/210 (18%), Positives = 63/210 (30%), Gaps = 33/210 (15%) Query: 7 GVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTR--STLYAALQNEVLNAV 64 G LS ++ E + G V G+ N + T++ N+V+ + Sbjct: 173 GFLSERECDLLVEYARPNMYKSGVVDASNGGSSFSNIRTSTGSFVPTVFPLGMNDVVRRI 232 Query: 65 NQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLSDPQ 124 + + RYQ + Y H D H + ++LSD + Sbjct: 233 ERRIAAWTQIPAAHGEPIQVLRYQIGQEYQSHFDYFF--HEGGMKNNRIATVLMYLSDVK 290 Query: 125 SYDGGELVVNDTFG---------------QHRVKLPAGDLVLY---------PSSSLHCV 160 GGE V V GD +L+ S H Sbjct: 291 D--GGETVFPSAESLQVKPEPIHHACAKNGITVIPKKGDAILFWNMKVGGDLDGGSTHAG 348 Query: 161 TPVTRGVRVASFMWIQ---SMIRDDKKRAM 187 PV G + + W+ S D ++R + Sbjct: 349 CPVVLGEKWTATKWLHVSSSTEFDARQRVL 378 >UniRef50_Q0C131 Oxidoreductase, 2OG-Fe(II) oxygenase family n=1 Tax=Hyphomonas neptunium ATCC 15444 RepID=Q0C131_HYPNA Length = 298 Score = 106 bits (264), Expect = 7e-22, Method: Composition-based stats. Identities = 27/192 (14%), Positives = 59/192 (30%), Gaps = 30/192 (15%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRST---LYAALQN 58 +Y P L+P+ ++ T +++ ++ D + L L Sbjct: 104 LYVWPNFLAPETCDALIALTDE-RLRASTTTDAFADPKIRTSRSSDIGTMGHNLVMQLDE 162 Query: 59 EVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNG-----WMRTD 113 + A+ H + A + RY N+ Y H D + Sbjct: 163 LIAEALGIHWSYSDATQTQ--------RYDVNQEYKAHYDYFTPGTRDYQVHCQFTGQRT 214 Query: 114 LSATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPS---------SSLHCVTPVT 164 + ++L+D + +GG + + G V++ + ++H V Sbjct: 215 WTFMIYLNDVE--EGGGTRFRRL--EKTIMPEKGKAVIWNNLNPDGSVNPYTIHHGMKVR 270 Query: 165 RGVRVASFMWIQ 176 G + W + Sbjct: 271 SGAKYVITKWFR 282 >UniRef50_D2W6G4 Predicted protein (Fragment) n=1 Tax=Naegleria gruberi RepID=D2W6G4_NAEGR Length = 489 Score = 106 bits (264), Expect = 8e-22, Method: Composition-based stats. Identities = 27/177 (15%), Positives = 55/177 (31%), Gaps = 21/177 (11%) Query: 7 GVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTR-----STLYAALQNEVL 61 +++ + E+A + G T V+ Q+ + + + ++ Sbjct: 12 PIITKNQAFEIIQYFEKAPFGQGDKT--IYDETVRKTWQLSPSQFRITNQHWKQYIDNLV 69 Query: 62 NAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLS 121 + L +++ S Y+ + FH D + ATL + Sbjct: 70 EKQVKPGLGIHSSVVVRNSLYKLLLYEEGGHFDFHRDTEKED---------KMFATLVVQ 120 Query: 122 DPQSYDGGELVVNDTFGQHRVKLPA-GD----LVLYPSSSLHCVTPVTRGVRVASFM 173 P Y GGE++V + + G + H V +T G R+ Sbjct: 121 LPSLYSGGEIIVQHADSEETYEFAKEGSSKPFFFSFYCDCNHKVAKLTSGYRLCLVY 177 >UniRef50_B4JY14 GH14106 n=1 Tax=Drosophila grimshawi RepID=B4JY14_DROGR Length = 511 Score = 105 bits (262), Expect = 1e-21, Method: Composition-based stats. Identities = 35/198 (17%), Positives = 61/198 (30%), Gaps = 25/198 (12%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEVL 61 + LSPQ++ + + R T G V + + L L N + Sbjct: 319 IVVFHDALSPQEIDYLQNLAR--PLLK-RTTVHVNGKYVSRRVRTSKGAWLERDLNN-LT 374 Query: 62 NAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQ-NGWMRTDLSATLFL 120 + + + + Y Y H D + Q + + +L Sbjct: 375 RRIERRVVDMTELSMQGSEAYNIMNYGLGGHYAAHYDFFNTTKQQTSETGDRIATVLFYL 434 Query: 121 SDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYP---------SSSLHCVTPVTRGVRVAS 171 SD + GG V + + V G + + + +LH PV G + Sbjct: 435 SDVE--QGGATVFPNL--KLAVSPERGMALFWYNLLDNGTGDTRTLHGGCPVLVGSKWVM 490 Query: 172 FMWIQSMIRDDKKRAMLF 189 +WI +RA LF Sbjct: 491 TLWIH-------ERAQLF 501 >UniRef50_A8J7D3 Predicted protein n=3 Tax=Chlamydomonas reinhardtii RepID=A8J7D3_CHLRE Length = 297 Score = 105 bits (261), Expect = 1e-21, Method: Composition-based stats. Identities = 31/198 (15%), Positives = 63/198 (31%), Gaps = 28/198 (14%) Query: 3 YHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEVLN 62 + + LS ++ E+ + V V G V + + T + + + Sbjct: 52 FLLKNFLSDEECDYIVEKARP-KMVKSSVVDNESGKSVDSEIRTSTGTWFAKGEDSVISK 110 Query: 63 AVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQ--NGWMRTDLSATLFL 120 + A L Y + + Y H D + ++ ++L Sbjct: 111 IEKRVAQVTMIPL-ENHEGLQVLHYHDGQKYEPHYDYFHDPVNAGPEHGGQRVVTMLMYL 169 Query: 121 SDPQSYDGGELVVNDTFG-------------QHRVKLPAGDLVLYPS---------SSLH 158 + + +GGE V+ + VK GD +++ S +SLH Sbjct: 170 TTVE--EGGETVLPNAEQKVTGDGWSECAKRGLAVKPIKGDALMFYSLKPDGSNDPASLH 227 Query: 159 CVTPVTRGVRVASFMWIQ 176 P +G + ++ WI Sbjct: 228 GSCPTLKGDKWSATKWIH 245 >UniRef50_C6J1E0 Prolyl 4-hydroxylase n=1 Tax=Paenibacillus sp. oral taxon 786 str. D14 RepID=C6J1E0_9BACL Length = 215 Score = 105 bits (261), Expect = 2e-21, Method: Composition-based stats. Identities = 25/188 (13%), Positives = 63/188 (33%), Gaps = 27/188 (14%) Query: 1 MMYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQV---DTRSTLYAALQ 57 ++ +L+ + + E A + ++++ ++ + + + ++ Sbjct: 31 LIMRFERLLTDDECRQLIEAA--APRLRESKLVNKVVSEIRTSRGMFFEEEENPFIHRIE 88 Query: 58 NEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSAT 117 + +A+ + Y + Y H D + P + + Sbjct: 89 KRI-SALMNVPI-------EHAEGLQVLHYGPGQEYQAHYDFFGPNSP-SASNNRISTLI 139 Query: 118 LFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSS---------SLHCVTPVTRGVR 168 ++L+D ++ GGE V VK G + + +LH PV RG + Sbjct: 140 IYLNDVEA--GGETVFPLLD--LEVKPERGSALYFEYFYRQQELNNLTLHSSVPVVRGEK 195 Query: 169 VASFMWIQ 176 + W++ Sbjct: 196 WVATQWMR 203 >UniRef50_A6C3X4 Uncharacterized iron-regulated protein n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C3X4_9PLAN Length = 192 Score = 105 bits (261), Expect = 2e-21, Method: Composition-based stats. Identities = 31/173 (17%), Positives = 61/173 (35%), Gaps = 11/173 (6%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEVL 61 + I L+P++ + E + V Q + + L +++ Sbjct: 12 ILMIDQFLTPEECESYINYSEYLGYELADVDFYGVRKQSNQIRTNERADIESQELADKLW 71 Query: 62 NAVNQHALFFAA-ALPRTLSTP-LFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLF 119 N + + L + P LS F RYQ N+ + FH DG + + + ++ Sbjct: 72 NELRNYPLPSSELGNPAGLSPFIRFYRYQGNQRFNFHKDGVKKYSN----YESQFTVLIY 127 Query: 120 LSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGV-RVAS 171 L+ + GGE + +V+ +G +L+ H VT + Sbjct: 128 LNSIK--QGGETIFRKNAI--KVQPQSGRCLLFAHDLWHSGLAVTDEEIKYIM 176 >UniRef50_B2W510 Oxidoreductase domain containing protein n=2 Tax=Pleosporineae RepID=B2W510_PYRTR Length = 255 Score = 105 bits (261), Expect = 2e-21, Method: Composition-based stats. Identities = 33/209 (15%), Positives = 63/209 (30%), Gaps = 34/209 (16%) Query: 3 YHIPGVLSPQDVARFREQLEQ---AEWVDGRVTTGA----QGAQVKNNQQVD-TRSTLYA 54 + GVL+ ++ E E +W V G + ++ L Sbjct: 42 VVLDGVLTEKECKTLLEAAEATTDGKWERALVNIGGGMQAMYEDTRKCGRIIWDNKELMT 101 Query: 55 ALQNEVL------NAVNQHALFFAAA---------LPRTLSTPLFNRYQNNETYGFHVDG 99 L + + + A + R + +Y E + H DG Sbjct: 102 RLWARIETSVPEIHRLQNWAEVTGYGPVKRKETWKVTRLNERGRYLKYIGGEYFKPHCDG 161 Query: 100 AVRSHPQNGWMRTDLSATLFLS------DPQSYDGGELVVNDTFGQHR--VKLPAGDLVL 151 A + + R+ + L+L+ + +GG R V+ AG ++L Sbjct: 162 AYETPDRTE--RSYFTLHLYLNGAVEKSGVRQLEGGATTFFSGNLVQRIDVEPKAGRVLL 219 Query: 152 YPSSSL-HCVTPVTRGVRVASFMWIQSMI 179 + +L H V G + I + Sbjct: 220 FQHRNLIHSGDDVVSGTKYTLRTDIMYTV 248 >UniRef50_C4Y2U7 Putative uncharacterized protein n=1 Tax=Clavispora lusitaniae ATCC 42720 RepID=C4Y2U7_CLAL4 Length = 275 Score = 105 bits (261), Expect = 2e-21, Method: Composition-based stats. Identities = 27/192 (14%), Positives = 54/192 (28%), Gaps = 22/192 (11%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDT-RSTLYAALQNEV 60 + I +P ++ ++ + N +V +L + Sbjct: 78 IITIDHFFTPDFCDELLSSFSDKLVLETTPLIKSKEYAARFNDRVSLTNFRAADSLWQYL 137 Query: 61 LNAVNQHALFFAAALPRTLS----------TPLFNRYQNNETYGFHVDGAV---RSHPQN 107 + Q F L RY +G H D +V +H Sbjct: 138 REILLQKPQFEDEDLEEIRHIFADAKSLNPQLRVYRYTKGHHFGKHYDESVTCPMAHDPK 197 Query: 108 GWMRTDLSATLFLSDPQSYDGGELVVNDTFGQHRVKL-----PAGDLVLYPSS---SLHC 159 +T + ++L+ ++GG + + R K+ G +L+ H Sbjct: 198 AQGKTKWTLLIYLTGGADFEGGGTIFYPETSRERNKVINVHADKGMALLHKHGDDCLKHE 257 Query: 160 VTPVTRGVRVAS 171 V GV+ Sbjct: 258 AELVKSGVKWVL 269 >UniRef50_A8P9G8 Putative uncharacterized protein n=2 Tax=Coprinopsis cinerea okayama7#130 RepID=A8P9G8_COPC7 Length = 543 Score = 105 bits (261), Expect = 2e-21, Method: Composition-based stats. Identities = 30/177 (16%), Positives = 57/177 (32%), Gaps = 21/177 (11%) Query: 9 LSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDT-----RSTLYAALQNEVLNA 63 L V E ++ + G +V++ ++D ++ +VL Sbjct: 89 LDDYKVDEVLEHASRSPFGMGDQKV--VDTKVRDTWEIDGSKVKLNDPFAEWVEYKVLTD 146 Query: 64 VNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLSDP 123 V + + Y+ + H D + + L P Sbjct: 147 VWKGLGVATPSTKPRCEFYKLLIYKPGGHFHAHQD--------TQKAEGMFATVIVL-LP 197 Query: 124 QSYDGGELVVNDTFGQHRVKL----PAGDLVL-YPSSSLHCVTPVTRGVRVASFMWI 175 Y+GGE+ + + + L G ++ + + LH + PVT G RVA I Sbjct: 198 SEYEGGEVKITHSGKTDIIDLNYISNRGMAIMAWYTDVLHEIKPVTSGYRVALSFNI 254 >UniRef50_P74376 Sll0428 protein n=1 Tax=Synechocystis sp. PCC 6803 RepID=P74376_SYNY3 Length = 350 Score = 104 bits (260), Expect = 2e-21, Method: Composition-based stats. Identities = 23/196 (11%), Positives = 60/196 (30%), Gaps = 27/196 (13%) Query: 2 MYHIPGVLSPQDVARFREQL--EQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNE 59 + LS ++ + + + + ++ + + + + + + + Sbjct: 161 FIQVKDFLSATELEQLFKFVIQNENNFLPTSNSAN--DSDYRRSMFLPIFAPFSELIIER 218 Query: 60 VLNAVNQ-HALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATL 118 V + Q + + + Y H D +L+ Sbjct: 219 VKAILPQLISDLQIQPFQIDYVEAQLTAHNHGNYYKVHNDNGSPDSAT-----RELTYVY 273 Query: 119 FLS-DPQSYDGGELVVNDTFGQ----------HRVKLPAGDLVLYPSSSLHCVTPVT--- 164 + + +P+++ GGEL + D+ + V+ +V + S +H V PV Sbjct: 274 YFNREPKAFSGGELAIYDSKIENNFYVAAESFKTVQPVNNSIVFFLSRYMHEVLPVNCPS 333 Query: 165 ---RGVRVASFMWIQS 177 R W++ Sbjct: 334 QAFADSRFTINGWVRK 349 >UniRef50_B5EQX8 2OG-Fe(II) oxygenase n=2 Tax=Acidithiobacillus ferrooxidans RepID=B5EQX8_ACIF5 Length = 213 Score = 104 bits (260), Expect = 2e-21, Method: Composition-based stats. Identities = 36/189 (19%), Positives = 59/189 (31%), Gaps = 23/189 (12%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQV----KNNQQVDTRSTLYAALQ 57 + H G+LS + A ++ V GA A + + V Y + Sbjct: 14 LVHFKGLLSLDECAELIAIGSVSDAKPSVVVDGASDAAYETPGRCSTVVAPSVDAYPIIL 73 Query: 58 NEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQ-NGWMRTDLSA 116 + + F+ Y Y H D PQ L+ Sbjct: 74 E-----IRRRIELFSGISQENQEPLQILHYTRGGKYDIHYDAFSDGSPQLRNGGNRLLTV 128 Query: 117 TLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPS---------SSLHCVTPVTRGV 167 L+L+D + GG + AG +L+ + SLH PVT G Sbjct: 129 LLYLNDVEY--GGWTQFPHIMAN--IVPNAGSGILFRNTDAQNRQLRESLHAGLPVTHGE 184 Query: 168 RVASFMWIQ 176 + + +WI+ Sbjct: 185 KWIASIWIR 193 >UniRef50_UPI0001AEC671 2OG-Fe(II) oxygenase family oxidoreductase n=1 Tax=Alteromonas macleodii ATCC 27126 RepID=UPI0001AEC671 Length = 266 Score = 104 bits (259), Expect = 3e-21, Method: Composition-based stats. Identities = 33/189 (17%), Positives = 57/189 (30%), Gaps = 24/189 (12%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEVL 61 ++ LS Q+ + + ++ A ++ + + L L +V Sbjct: 85 LFSYDDFLSSQECDDIVALT-KDKLAPSKLAGAASADDIRTSSTCELAF-LGNKLVKDVD 142 Query: 62 NAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNG-----WMRTDLSA 116 N + Y E Y H D PQ + + Sbjct: 143 NRIVSTLSLGV----GEGEVIQAQHYNVGEYYKPHYDFFPPGSPQYKAHCLSRGQRTWTC 198 Query: 117 TLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLY----PS-----SSLHCVTPVTRGV 167 ++L+D DGG VK G + + PS +S+H PVTRG Sbjct: 199 MIYLND--ECDGGHTRFTKLDI--AVKPKKGMALFWNNLLPSGDPNLNSIHFAEPVTRGH 254 Query: 168 RVASFMWIQ 176 + W + Sbjct: 255 KTVITKWFR 263 >UniRef50_D0N4G0 Putative uncharacterized protein n=1 Tax=Phytophthora infestans T30-4 RepID=D0N4G0_PHYIN Length = 250 Score = 104 bits (259), Expect = 3e-21, Method: Composition-based stats. Identities = 31/183 (16%), Positives = 57/183 (31%), Gaps = 16/183 (8%) Query: 1 MMYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEV 60 ++Y +P LS + R R +Q + A N++ + L + Sbjct: 61 LIYAVPSFLSRVECQRVRSFADQEGFERVTQRATRDYAFRDNDRLLLRLPAFADLLWKRL 120 Query: 61 LNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFL 120 V A + F RY + +G HVD + + + + ++L Sbjct: 121 QPHVP--AEYEGLHAVGLNPAIRFYRYNTGQRFGCHVDQS--DVDRVTGYHSRFTVLVYL 176 Query: 121 SD--PQSYDGGELVVNDTFGQHR-------VKLPAGDLVLY---PSSSLHCVTPVTRGVR 168 +D GG + + V G +++ LH VTRG + Sbjct: 177 NDSTDSDLQGGNTIFYANEADAKEENLVLSVAPETGAALVHGHGDHCLLHEGALVTRGAK 236 Query: 169 VAS 171 Sbjct: 237 YLL 239 >UniRef50_A0KRU0 2OG-Fe(II) oxygenase n=7 Tax=Shewanella RepID=A0KRU0_SHESA Length = 327 Score = 103 bits (258), Expect = 3e-21, Method: Composition-based stats. Identities = 31/165 (18%), Positives = 58/165 (35%), Gaps = 28/165 (16%) Query: 29 GRVTTGAQGAQVKNNQQVD---TRSTLYAALQNEVLNAVNQHALFFAAALPRTLSTPLFN 85 G G+ A ++NN ++ A V + + +A + Sbjct: 162 GHQGKGSIEAGIRNNMHYPTPIPNGSVALACAERVTAKFSGIKIAYA-------EPMVVL 214 Query: 86 RYQNNETYGFHVDGAVRSHPQ-----NGWMRTDLSATLFLSDPQSYDGGELVVNDTFGQH 140 RY+ + Y +H D + + + + L+L+D + GGE + Q Sbjct: 215 RYEPGQFYQWHYDAIHAHTSEIKAELERFGQRCRTGILYLND--DFQGGETEFKAPYIQ- 271 Query: 141 RVKLPAGDLVLYPS---------SSLHCVTPVTRGVRVASFMWIQ 176 VK A ++++ + SSLH VT G + W + Sbjct: 272 -VKPQAAAILVFDNTDKSGKPIPSSLHRGCEVTSGHKWVCTQWFR 315 >UniRef50_Q54N51 Putative uncharacterized protein n=1 Tax=Dictyostelium discoideum RepID=Q54N51_DICDI Length = 322 Score = 103 bits (257), Expect = 4e-21, Method: Composition-based stats. Identities = 36/229 (15%), Positives = 65/229 (28%), Gaps = 66/229 (28%) Query: 3 YHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRST-LYAALQNEVL 61 + + LS ++ F E+ E+ W G + + N ++ S + V Sbjct: 37 FILHNALSEEECKYFIEEAEKIGWESLHWQRGEKN-DFRINDRIMVMSEDIARFAWERVE 95 Query: 62 NAVNQHALFF-----------------AAALPRTLST-PLFNRYQNNETYGFHVDGAVRS 103 N +N + L P L+ +Y + H DG+ Sbjct: 96 NFLNDNGLDITTTVKPGDGLYKIGSPSGVWKPIGLNPKFRMCKYYKGGLFKKHYDGSYV- 154 Query: 104 HPQNGWMRTDLSATLFLSDPQSYDGGELVVNDTFGQHRV--------------------- 142 Q+ R+ + +L+ Q Y GG D + Sbjct: 155 --QSSTKRSLYTFMFYLN--QDYTGGATNFLDDQSLKSISSVLHFKDTNTGTGNDLELND 210 Query: 143 --------------------KLPAGDLVLYPSSSLHCVTPVTRGVRVAS 171 L G+LV++P H +PV G++ Sbjct: 211 DDDSLKAVDLQSLKVIDVVGPLKTGNLVVFPQDLFHEGSPVLDGIKYIM 259 >UniRef50_A8I8D2 Predicted protein n=3 Tax=Chlamydomonas reinhardtii RepID=A8I8D2_CHLRE Length = 336 Score = 103 bits (256), Expect = 5e-21, Method: Composition-based stats. Identities = 40/211 (18%), Positives = 70/211 (33%), Gaps = 35/211 (16%) Query: 3 YHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEVLN 62 Y+ L+ + A A + G +G V ++ + L + V+ Sbjct: 29 YYFHNFLTKAERAHLVRVA--APKLKRSTVVGGKGEGVVDDIRTSYG-MFIRRLSDPVVT 85 Query: 63 AVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLSD 122 + + + RY + +TYG H D S G + ++LSD Sbjct: 86 RIEKRISLWTHLPVEHQEDIQILRYAHGQTYGAHYDSG-ASSDHVGPKWRLATFLMYLSD 144 Query: 123 PQSYDGGELVVNDTF--------------------GQHRVKLPAGDLVLYPSS------- 155 + +GGE G K AGD VL+ S Sbjct: 145 VE--EGGETAFPHNSVWADPSIPEQVGDKFSDCAKGHVAAKPKAGDAVLFYSFYPNNTMD 202 Query: 156 --SLHCVTPVTRGVRVASFMWIQSMIRDDKK 184 S+H PV +GV+ A+ +W+ + ++ Sbjct: 203 PASMHTGCPVIKGVKWAAPVWMHDIPFRPEE 233 >UniRef50_B8BWH5 Putative uncharacterized protein n=1 Tax=Thalassiosira pseudonana RepID=B8BWH5_THAPS Length = 373 Score = 103 bits (256), Expect = 6e-21, Method: Composition-based stats. Identities = 26/157 (16%), Positives = 52/157 (33%), Gaps = 25/157 (15%) Query: 38 AQVKNNQQVDTRSTLYAALQNE-----VLNAVNQHALFFAAALPRTLSTPLFNRYQNNET 92 +V + + + + +N V + + + +Y+ E Sbjct: 211 EKVISTHRTSSNAWCRKECENLTGVKGVSKRIEEMTGIPQ----NNYESFQILQYKPGEY 266 Query: 93 YGFHVDGAVRSHPQNGWMRTDLSATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLY 152 Y H D + ++ L+ L+L+D + +GGE VK G +++ Sbjct: 267 YKSHHDSS-DANKDKVTGHRVLTFFLYLNDVE--EGGETHFTKLNI--SVKPKRGRALVW 321 Query: 153 PS-----------SSLHCVTPVTRGVRVASFMWIQSM 178 PS H V +G++ A+ WI Sbjct: 322 PSVLNEDPNSTDNRMYHEAKSVEKGIKYAANHWIHQY 358 >UniRef50_A5V9G5 2OG-Fe(II) oxygenase n=5 Tax=Sphingomonadales RepID=A5V9G5_SPHWW Length = 234 Score = 102 bits (255), Expect = 7e-21, Method: Composition-based stats. Identities = 31/194 (15%), Positives = 63/194 (32%), Gaps = 35/194 (18%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTT-----GAQGAQVKNNQQVDTRSTLYAAL 56 ++ I L + A +++ D R +T G + +D R L A+ Sbjct: 49 LFMIRDFLPAETCAELVALIDR----DNRPSTIADDQGIAYFRTSKTCDLDPRDALAQAI 104 Query: 57 QNEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQN-----GWMR 111 + +A+ PR +Y + + H D + + Sbjct: 105 DARIADALGID--------PRLGEPIQGQKYDVGDEFRDHTDTFEPTGFDYLAHCGETGQ 156 Query: 112 TDLSATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPS---------SSLHCVTP 162 +A ++L++P + GG + G LV + + ++LHC Sbjct: 157 RSWTAMIYLNEPAA--GGATRFRRLD--KIIPPERGKLVAWANIDRSGRPNEATLHCGMK 212 Query: 163 VTRGVRVASFMWIQ 176 V +G + W + Sbjct: 213 VRKGAKYVITKWYR 226 >UniRef50_Q5N1K6 Putative uncharacterized protein n=2 Tax=Synechococcus elongatus RepID=Q5N1K6_SYNP6 Length = 300 Score = 102 bits (255), Expect = 8e-21, Method: Composition-based stats. Identities = 28/197 (14%), Positives = 64/197 (32%), Gaps = 27/197 (13%) Query: 3 YHIPGVLSPQDVARFREQL--EQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEV 60 + I LSP+ + + L ++++ + G + + +Y+ + ++ Sbjct: 110 WRILDFLSPEKLQQLWNYLLTARSQFNPAHNSAGLN--NYRQSLFTAPPPEIYSEISEKI 167 Query: 61 LNAV-NQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLF 119 L A+ ++ + + Y H D L+ + Sbjct: 168 LGALIPIADELPNSSQEIGEIEMQITAHNDGHYYKIHNDNGSPDTAT-----RFLTYVYY 222 Query: 120 LS-DPQSYDGGELVVNDTFGQ----------HRVKLPAGDLVLYPSSSLHCVTPVT---- 164 P+ + GGEL + + + ++ L+++PS +H V P+ Sbjct: 223 FYRQPKPFTGGELRLYELAIKDGFYVAGDRYQDIEPLHNSLIVFPSHYMHEVLPIRCPSQ 282 Query: 165 --RGVRVASFMWIQSMI 179 R WI+ I Sbjct: 283 RFEDSRFTVNGWIRVAI 299 >UniRef50_A6VYC6 2OG-Fe(II) oxygenase n=2 Tax=Marinomonas RepID=A6VYC6_MARMS Length = 230 Score = 102 bits (255), Expect = 8e-21, Method: Composition-based stats. Identities = 39/185 (21%), Positives = 59/185 (31%), Gaps = 23/185 (12%) Query: 6 PGVLSPQDVARFREQLE--------QAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQ 57 S ++ E QA + A+ Q +D + + Sbjct: 42 DNFFSTDFTQALMDEAESIQNAFMLQAGVGRKQDHQIILDARRDYIQWIDPDNPVRRDFL 101 Query: 58 NEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSAT 117 + + + AL L F RY+ Y H+D G LS Sbjct: 102 KMMAD--LRVALNRRLFLGLFDYEAHFARYEEGAFYEKHIDAF------KGESNRILSTV 153 Query: 118 LFLS-DPQSYDGGELVVNDTFGQH----RVKLPAGDLVLYPSSS-LHCVTPVTRGVRVAS 171 L+L+ D Q DGGELV+ D R G L ++ S H V V + R + Sbjct: 154 LYLNEDWQDGDGGELVIYDENDPSVEVGRFFPKKGRLAVFLSECFYHEVM-VAKRTRHSI 212 Query: 172 FMWIQ 176 W + Sbjct: 213 AGWFR 217 >UniRef50_A8IBT2 Predicted protein (Fragment) n=2 Tax=Chlamydomonas reinhardtii RepID=A8IBT2_CHLRE Length = 252 Score = 102 bits (255), Expect = 9e-21, Method: Composition-based stats. Identities = 34/214 (15%), Positives = 63/214 (29%), Gaps = 48/214 (22%) Query: 3 YHIPGVLSPQDVARFREQLE-----QAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQ 57 + I L+ Q+ + + D + + + A ++ Sbjct: 11 FVIRNFLTDQEATHIADVAQVHMRRSTVVADNGSSVLDDYRTSYGTFINRYATPVVARVE 70 Query: 58 NEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSAT 117 + V + + + + RY N + Y H D P + Sbjct: 71 DRV-AVLTRVPVHYQ-------EDMQVLRYGNGQYYHRHTDSLENDSP------RLATVL 116 Query: 118 LFLSDPQSYDGGELVVNDTFGQH-----------------RVKLPAGDLVLYPSS----- 155 L+LSDP+ GGE + K GD +L+ S Sbjct: 117 LYLSDPEL--GGETAFPLAWAHPDMPKVFGPFSECVKNNVAFKPRKGDALLFWSVKPDGK 174 Query: 156 -----SLHCVTPVTRGVRVASFMWIQSMIRDDKK 184 S H PV RGV+ + +W+ + ++ Sbjct: 175 TEDPLSEHEGCPVIRGVKWTATVWVHTKPFRPEE 208 >UniRef50_Q6BZD8 DEHA2A02134p n=10 Tax=Saccharomycetales RepID=Q6BZD8_DEBHA Length = 253 Score = 102 bits (254), Expect = 1e-20, Method: Composition-based stats. Identities = 33/206 (16%), Positives = 62/206 (30%), Gaps = 29/206 (14%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQV--KNNQQ-----VDTRSTLYA 54 + + S + + E + + T + + N + D L++ Sbjct: 53 IIIVEKFFSNELCNELIKSFESSPDLKMETTPLIKSKDYAARFNDRGFSVDFDASRNLWS 112 Query: 55 ALQNEVLNAV-------NQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVR----S 103 LQ +L V N F A+ S RY+ +G H D +V Sbjct: 113 YLQKILLRDVEYEDDDDNDVKSIFNDAIALN-SQLRIYRYRKGHHFGQHYDESVICPLTE 171 Query: 104 HPQNGWMRTDLSATLFLSDPQSYDGGELVVNDTFGQHR---VKLPAGDLVLYPSS---SL 157 N T + ++L+ + GG + + + V G +L+ Sbjct: 172 DKNNQKGITKWTLLIYLTGDDEFKGGGTIFYPDYSSAKHLNVHPSKGMALLHKHGDDCLR 231 Query: 158 HCVTPVTRGVRVASFMWIQSMIRDDK 183 H V GV+ +S + Sbjct: 232 HEAELVEDGVKWVL----RSDVVYPS 253 >UniRef50_B2HZ49 Predicted proline hydroxylase n=18 Tax=Acinetobacter RepID=B2HZ49_ACIBC Length = 201 Score = 102 bits (254), Expect = 1e-20, Method: Composition-based stats. Identities = 33/169 (19%), Positives = 63/169 (37%), Gaps = 15/169 (8%) Query: 11 PQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEVLNAVNQHALF 70 ++ + ++ +A +G V+T + N+ + L + + L Sbjct: 40 AKECSHHFDEFREAGIQNGVVSTIRSDHILWINESLPVAEQHVETLSSFCQH------LN 93 Query: 71 FAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLS-DPQSYDGG 129 A L F Y E Y H D + + +S +L + Q GG Sbjct: 94 QAFFLGIKEVEAHFACYNPGEFYALHRDNPQQKND------RIMSTVYYLHPEWQDDWGG 147 Query: 130 ELVVNDTFGQ-HRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQS 177 +L + D H + LV++ S+ LH V V++ R++ W++S Sbjct: 148 QLRLQDKNNIWHIITPEPNRLVIFQSNLLHEVL-VSKQQRLSITAWLRS 195 >UniRef50_B9F6P1 Putative uncharacterized protein n=3 Tax=Poaceae RepID=B9F6P1_ORYSJ Length = 409 Score = 101 bits (253), Expect = 1e-20, Method: Composition-based stats. Identities = 41/199 (20%), Positives = 71/199 (35%), Gaps = 20/199 (10%) Query: 3 YHIPGVLSPQDVARF---REQLEQAEWVDGRV-TTGAQGAQVKNNQQVDTRSTLYAALQN 58 +H+ G LSP+ A + V T+ A + + L++ Sbjct: 20 FHLRGFLSPETCKELEFVHRSCGTAGYRPSVVSTSLPHLAATGCGHLLLPFVPVRERLRD 79 Query: 59 EVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATL 118 V +A + H F + + + G+H D + + +A Sbjct: 80 AVESAFSCHFDLF-------IEFTGLISWCKGASIGWHSD-----DNKPYLRQRAFTAVC 127 Query: 119 FLSD-PQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSS--LHCVTPVTRGVRVASFMWI 175 +L+D + Y GG L D + AGD+V+Y + + HCV VT G R+ +W Sbjct: 128 YLNDHGKDYKGGILQFQDGEPSF-ITPVAGDVVIYTADNSNTHCVDEVTEGERLTLTLWF 186 Query: 176 QSMIRDDKKRAMLFELDNN 194 D+ +L L Sbjct: 187 TRDSAYDEDPKLLSFLSQT 205 >UniRef50_A5V2J9 Putative uncharacterized protein n=1 Tax=Sphingomonas wittichii RW1 RepID=A5V2J9_SPHWW Length = 223 Score = 101 bits (253), Expect = 1e-20, Method: Composition-based stats. Identities = 37/189 (19%), Positives = 58/189 (30%), Gaps = 17/189 (8%) Query: 1 MMYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDT----RSTLYAAL 56 M++ G L+P AR K Q L Sbjct: 39 MIHRFDGALNPAICARLCALSHSR-----TEKPEGAADPAKLPWQDSDTFAFDHWEEGEL 93 Query: 57 QNEVLNAVNQHALFFAAALPRTLSTPL--FNRYQNNETYGFHVDGAVRSHPQNGWMRTDL 114 ++ + A+ + R++ ++ H D + Sbjct: 94 RHLIGGYRLMVGQLICLAVREIVFPHFTDLVRWRPGKSMDEHKDDGYPGDDELMSC-RHY 152 Query: 115 SATLFLSDPQSYDGGELVVNDTFGQHRVK-LPAGDLVLYPSS--SLHCVTPVTRGVRVAS 171 SA + +D +Y GGE + + G + V G LV YPS + H V PV G RV Sbjct: 153 SAVTYCND--NYSGGETFIRNEHGGYYVSAPRTGTLVFYPSDERATHGVKPVVGGDRVTL 210 Query: 172 FMWIQSMIR 180 W +R Sbjct: 211 STWFTRDVR 219 >UniRef50_D2VKT5 Predicted protein n=2 Tax=Naegleria gruberi RepID=D2VKT5_NAEGR Length = 254 Score = 101 bits (253), Expect = 1e-20, Method: Composition-based stats. Identities = 30/181 (16%), Positives = 57/181 (31%), Gaps = 25/181 (13%) Query: 3 YHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEVLN 62 + I S + + +E D +++ + L L + Sbjct: 79 FLIENCFSSDECQLMIKAMETVGL-DFKLSGNRHCFRKS-----IMDEKLSEILFERSRD 132 Query: 63 AVNQHALFFAAALPRTLSTPLFNRYQN------NETYGFHVDGAVRSHPQNGWMRTDLSA 116 + Q P P+ RY +G HVD A S+ ++ + Sbjct: 133 FLPQSYKVSGIDRPLQGLNPMI-RYIKYLHKKFGHKFGPHVDAANHSNN----CKSFFTF 187 Query: 117 TLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSS---SLHCVTPVTRGV---RVA 170 ++L+D ++GGE V + Q R+K G + ++ LH V + Sbjct: 188 MVYLND--DFEGGETVFLEKGFQARIKPKTGTVCVFEQDVRQLLHEGLEVEGDENCKKYI 245 Query: 171 S 171 Sbjct: 246 L 246 >UniRef50_Q4SNF8 Chromosome 8 SCAF14543, whole genome shotgun sequence n=1 Tax=Tetraodon nigroviridis RepID=Q4SNF8_TETNG Length = 673 Score = 101 bits (253), Expect = 2e-20, Method: Composition-based stats. Identities = 50/244 (20%), Positives = 86/244 (35%), Gaps = 44/244 (18%) Query: 3 YHIPGVLSPQDVARFREQLEQAEW-VDG---------------RVTTGAQGAQVKNNQQV 46 + GV++ ++ R A DG +T ++ Sbjct: 409 VVLDGVMTEKECERILHLANAAGSAGDGYKGRRSPHTPHETFEGLTVLRAAKLAQDGLLN 468 Query: 47 DTRSTLYAALQNEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNE--------TYGFHVD 98 T + L L V + + F + + T L R ++ HVD Sbjct: 469 QTDARLLYELGERVRTLLRSY--FRSPSELFVSFTHLVCRSAVAGDQEGRLDLSHPVHVD 526 Query: 99 GAVRSHPQNGWMR-------TDLSATLFLSDPQSYDGGELVVNDTFGQH---RVKLPAGD 148 + R DLSA L+L++ ++DGG+L + RVK G Sbjct: 527 NCLLEPDTRQCWREQPAFIHRDLSAVLYLNN--NFDGGDLFFTSRDAKTVTARVKPGCGR 584 Query: 149 LVLYPSS--SLHCVTPVTRGVRVASFMWI--QSMIRD--DKKRAMLFELDNNIQSLKSRY 202 LV + S + H VT VT G R A +W + + RD ++ L++L + + Sbjct: 585 LVGFSSGPVNPHGVTAVTGGRRCALALWFTKEKLYRDMEREEAEALWDLGKPAKKDEEEE 644 Query: 203 GESE 206 G++ Sbjct: 645 GKTA 648 >UniRef50_Q2JRT8 Oxidoreductase, 2OG-Fe(II) oxygenase family n=1 Tax=Synechococcus sp. JA-3-3Ab RepID=Q2JRT8_SYNJA Length = 296 Score = 101 bits (252), Expect = 2e-20, Method: Composition-based stats. Identities = 26/192 (13%), Positives = 58/192 (30%), Gaps = 27/192 (14%) Query: 5 IPGVLSPQDVARFRE--QLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEVLN 62 + L + E + +V R + ++ + + + + + V Sbjct: 110 VDNFLPRDKNQQLLEYAIANEGRFVPSRNSA--DDSEYRRSWVLYDFPEFSGVILHYVQM 167 Query: 63 AVNQHAL-FFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLS 121 + Q P + + Y H D + +L+ + Sbjct: 168 LIPQVLKSLKMEPFPIAYIECQMTAHNHGNYYKVHNDNGSPDAKE-----RELTYVYYFY 222 Query: 122 -DPQSYDGGELVVNDTFGQ----------HRVKLPAGDLVLYPSSSLHCVTPVT------ 164 +P+ + GGEL++ D+ + ++ +PS +H V PVT Sbjct: 223 REPKQFSGGELLIYDSEVRNNMYVKADTYKTYVPANNTIIFFPSYLMHEVLPVTCPSRQF 282 Query: 165 RGVRVASFMWIQ 176 R W++ Sbjct: 283 ADSRFTVNGWVR 294 >UniRef50_D0Z4R6 SM-20-related protein n=1 Tax=Photobacterium damselae subsp. damselae CIP 102761 RepID=D0Z4R6_LISDA Length = 237 Score = 101 bits (252), Expect = 2e-20, Method: Composition-based stats. Identities = 35/195 (17%), Positives = 67/195 (34%), Gaps = 20/195 (10%) Query: 3 YHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQ-QVDTRSTLYAALQNEVL 61 + L+ + V R+ + W R+ G ++ + + D L + Sbjct: 51 FLWDDFLNNEQVEHLRQCIPD-NWKKARI--GRNDEIMRESSIRSDKIQWLTPEQGWPIQ 107 Query: 62 NAVNQHALF-----FAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSA 116 + + + + L F +Y+ + Y H+D G L+ Sbjct: 108 DYLERMEVIRREVNQNFFLGLFEYEAHFAKYEQGDFYQKHLDCF------KGNENRRLTT 161 Query: 117 TLFLSD-PQSYDGGELVVNDTFGQH--RVKLPAGDLVLYPSSSL-HCVTPVTRGVRVASF 172 ++++ DGGELVV D Q + +G L ++ S H V P T R + Sbjct: 162 VFYMNESWSPEDGGELVVYDLNDQKITTIAPKSGRLFIFLSEKFPHEVLP-TNAERFSIA 220 Query: 173 MWIQSMIRDDKKRAM 187 W + D + + Sbjct: 221 GWFRINGVKDNQLDI 235 >UniRef50_B6KQ34 2OG-Fe(II) oxygenase family protein, putative n=3 Tax=Toxoplasma gondii RepID=B6KQ34_TOXGO Length = 401 Score = 101 bits (251), Expect = 2e-20, Method: Composition-based stats. Identities = 38/202 (18%), Positives = 67/202 (33%), Gaps = 32/202 (15%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGA------QGAQVKNNQQVDTRSTLYAA 55 ++ IP +L+ D R + E W + +TG K+ + L A Sbjct: 214 VFLIPELLTDSDCERLLQLCE-GRWERSKTSTGYATAEPRDYTSSKSPSRTSWSVPLAIA 272 Query: 56 LQNEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLS 115 V N + + FA L + RY+ + + H DG + Sbjct: 273 ETEIVEN-IERIVSAFAGMPVEHLEPLVVVRYEEGQYFKLHSDGGF----------RPKT 321 Query: 116 ATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLY----------PSSSLHCVTPVTR 165 L+L+D ++ GGE + RV G VL+ +H P + Sbjct: 322 ILLYLNDVEA--GGETSFENLGF--RVAPMKGAGVLWNNSYPGTNEIDPRLIHAGLPPEK 377 Query: 166 GVRVASFMWIQSMIRDDKKRAM 187 GV+ + + R++ Sbjct: 378 GVKFVVNCFFNKDPIRNDLRSI 399 >UniRef50_D0KWT8 2OG-Fe(II) oxygenase n=1 Tax=Halothiobacillus neapolitanus c2 RepID=D0KWT8_HALNC Length = 226 Score = 101 bits (251), Expect = 2e-20, Method: Composition-based stats. Identities = 43/191 (22%), Positives = 64/191 (33%), Gaps = 26/191 (13%) Query: 3 YHIPGVLSPQDVARFREQLEQ-------AEWVDGRVTTGAQGAQVKNNQ--QVDTRSTLY 53 Y P LS + EQ GR A ++ ++ D R+ Sbjct: 32 YQWPRALSTSLCQALLAEAEQHEADGHLEPAGVGRGVIHQVNATIRRDEIKWFDGRTAAQ 91 Query: 54 AALQNEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTD 113 A + A Q L + L F RYQ Y H+D G Sbjct: 92 KAYLD--QMAELQTYLNRSLFLGLFEYECHFARYQPGGFYKKHLDSF------RGRASRM 143 Query: 114 LSATLFLSDPQSYD-GGELVVNDTFGQHR------VKLPAGDLVLYPSSSL-HCVTPVTR 165 +S +L+ D GGELV+ + + G LV++ S S+ H V P T+ Sbjct: 144 VSVVCYLNPEWQADWGGELVIYGENAEDSGDIRAVITPEMGKLVVFMSESMPHEVLP-TQ 202 Query: 166 GVRVASFMWIQ 176 R + W + Sbjct: 203 HPRTSIAGWFR 213 >UniRef50_A0NE94 AGAP004611-PA (Fragment) n=1 Tax=Anopheles gambiae RepID=A0NE94_ANOGA Length = 515 Score = 101 bits (251), Expect = 2e-20, Method: Composition-based stats. Identities = 29/190 (15%), Positives = 52/190 (27%), Gaps = 22/190 (11%) Query: 1 MMYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEV 60 M+ V+S +++ + + + V + + + L + + V Sbjct: 318 MIVMYHDVISNKEIDAIISISK--PLMHRSMVGDDHEKAVSKT-RTSSNAWL-DDVMHPV 373 Query: 61 LNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQN-----GWMRTDLS 115 + ++Q Y Y H D AV + G + Sbjct: 374 VRTLSQRTEDMTNLAMTAAERLQVGNYGIGGHYLPHYDYAVAEEGKEVYPSIGKGNRIAT 433 Query: 116 ATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYP---------SSSLHCVTPVTRG 166 +LSD GG V V G + + +LH PV G Sbjct: 434 VMYYLSDVAI--GGATVFPQLG--LGVFPQKGSAIFWYNLHANGTVDHRTLHGACPVFVG 489 Query: 167 VRVASFMWIQ 176 + WI Sbjct: 490 SKWVGNKWIH 499 >UniRef50_A5BUA7 Putative uncharacterized protein n=1 Tax=Vitis vinifera RepID=A5BUA7_VITVI Length = 282 Score = 101 bits (251), Expect = 3e-20, Method: Composition-based stats. Identities = 38/198 (19%), Positives = 64/198 (32%), Gaps = 32/198 (16%) Query: 15 ARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQ--NEVLNAVNQHALFFA 72 R TG + +V + ++ LN + Q F Sbjct: 87 RRIIRLCSGVFISASEDKTGTLDLIEQKIARVIMIPRTHGEIKPKENCLNWLGQVPPFEF 146 Query: 73 AALPRTLST------PLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLSDPQSY 126 + R L+ RY+ + Y H D + + ++LSD + Sbjct: 147 VVMKRFLTDVVYHVAFNILRYEIGQRYNSHYDAFDPAEYGPQKSHRIATFLVYLSDVE-- 204 Query: 127 DGGELVVNDTFG-------------QHRVKLPAGDLVLYPS---------SSLHCVTPVT 164 +GGE + G +VK GD +L+ S +SLH PV Sbjct: 205 EGGETMFPFENGLNMDKDYDFQRCIGLKVKPHQGDGLLFYSMFPNGTIDPTSLHGSCPVI 264 Query: 165 RGVRVASFMWIQSMIRDD 182 +G + + WI+ +DD Sbjct: 265 KGEKWVATKWIRDQEQDD 282 >UniRef50_Q75DK4 ABR017Cp n=2 Tax=Saccharomycetaceae RepID=Q75DK4_ASHGO Length = 229 Score = 101 bits (251), Expect = 3e-20, Method: Composition-based stats. Identities = 32/180 (17%), Positives = 58/180 (32%), Gaps = 13/180 (7%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVD-TRSTLYAALQNEV 60 + + + + + ++ G++ + N + T S L ++ Sbjct: 47 VILVRKFFTAAVCEQLIAHFSRPNVLELFHQRGSRDYAERLNDRASVTDSETALKLWRKL 106 Query: 61 LNAVNQHALFFA----AALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSA 116 + Q + + RYQ +G H D +V G RT + Sbjct: 107 HAVLAQDPHVMQDLRFSEAKGLIGKLRLYRYQKGHHFGKHYDESVSV---PGAGRTQWTV 163 Query: 117 TLFLSDPQSYDGGELVV--NDTFGQHRVKLPAGDLVLYPSS---SLHCVTPVTRGVRVAS 171 ++LS S +GG+ V G V G +L+ LH V RGV+ Sbjct: 164 LIYLSGGDSLEGGDTVFYKYHERGHEAVHPMPGLALLHKHGDDCLLHEAQMVMRGVKWVL 223 >UniRef50_B0DS97 Predicted protein n=2 Tax=Agaricales RepID=B0DS97_LACBS Length = 242 Score = 101 bits (251), Expect = 3e-20, Method: Composition-based stats. Identities = 25/196 (12%), Positives = 58/196 (29%), Gaps = 32/196 (16%) Query: 4 HIPGVLSPQDVARFREQLEQAE-WVDGRVTTG------AQGAQVKNNQQ-VDTRSTLYAA 55 + V +P + E + W + G +N+++ + Sbjct: 38 VLDDVFTPDECDALIALAESDQTWKQAALHYGLKPQESYVNTDYRNSERILRFDHEAANR 97 Query: 56 LQNEVLNAVNQHALFFA-------AALPRTL----------STPLFNRYQNNETYGFHVD 98 L +L V + P + + RY + H D Sbjct: 98 LYERLLPYVQELLEIRPGGEWEGVVGAPGRVEGVWKLIGVNERLSYLRYGAGNFFRGHCD 157 Query: 99 GAVRSHPQNGWMRTDLSATLFLSDPQSYDGGELVVNDTFGQH--RVKLPAGDLVLYPSS- 155 G ++ ++ ++ ++L + GG + + V+ G ++++ Sbjct: 158 GQLQLPDGR---KSRVTLQIYLG-KEGVQGGATRIYSGNEKQWVDVEPKLGRVLIFQQRG 213 Query: 156 SLHCVTPVTRGVRVAS 171 H VT+G + A Sbjct: 214 IWHTGEEVTKGFKYAL 229 >UniRef50_Q2S9H1 Putative uncharacterized protein n=1 Tax=Hahella chejuensis KCTC 2396 RepID=Q2S9H1_HAHCH Length = 279 Score = 100 bits (250), Expect = 3e-20, Method: Composition-based stats. Identities = 40/208 (19%), Positives = 63/208 (30%), Gaps = 26/208 (12%) Query: 3 YHIPGVLSPQDVARFREQLEQ--AEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEV 60 I L V EQ + + G+V+ + A + N + L + Sbjct: 62 IVISDFLPRVRVDELLTLAEQRISRFEPGKVSGAREIAPERRNSLRLRDPLIEKKLNDWF 121 Query: 61 LNAVNQHALFFAAALPRTLST-----PLFNRYQNNETYGFHVDGAVRSHPQNGWM---RT 112 + + + L L F Y N + H D + G Sbjct: 122 SPHFRKRLTDYCSQLDVALFEISEIELKFCCYPNGAYFHIHRDDQAPNSEATGMRTPGVR 181 Query: 113 DLSATLFLSD-PQSYDGGELVVNDTFGQH---------RVKLPAGDLVLYPSSSLHCVTP 162 +S + P+S+ GGEL + T +H + LVL+PS H V Sbjct: 182 RISFAYYFHRRPKSFTGGELQLYATDRKHDIYSRHRIESIPPHFNTLVLFPSGFYHEVLK 241 Query: 163 V--TRGV----RVASFMWIQSMIRDDKK 184 + T G R A I R + Sbjct: 242 ITETSGEIMNGRFAINGHICEAFRSQES 269 >UniRef50_Q29CA7 GA15937 n=4 Tax=pseudoobscura subgroup RepID=Q29CA7_DROPS Length = 510 Score = 100 bits (250), Expect = 3e-20, Method: Composition-based stats. Identities = 31/197 (15%), Positives = 61/197 (30%), Gaps = 26/197 (13%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEVL 61 + VLS ++A E E+ R +T AQ + + + + L N + Sbjct: 321 VVVYHDVLSDSEIAEILEMAER---RMARTSTVAQPNRTSSPTRTAMGAWL-KRSSNALT 376 Query: 62 NAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLS 121 + + + Y Y H D + + +L+ Sbjct: 377 RRIARRVRDMSGLQLEGSERMQVINYGIGGHYVPHKD--WFTQHPEVMGNRLATVLFYLT 434 Query: 122 DPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPS---------SSLHCVTPVTRGVRVASF 172 D + GG + +H+V G + + + S+ H P+ G + Sbjct: 435 DVE--QGGATMF--NKAEHKVLPRRGTALFWYNLHTDGEGDWSTTHAACPIIVGSKWVLT 490 Query: 173 MWIQSMIRDDKKRAMLF 189 WI+ +R +F Sbjct: 491 QWIR-------ERNQIF 500 >UniRef50_B8KEU1 Oxidoreductase, 2OG-Fe(II) oxygenase family n=1 Tax=gamma proteobacterium NOR5-3 RepID=B8KEU1_9GAMM Length = 375 Score = 100 bits (249), Expect = 4e-20, Method: Composition-based stats. Identities = 23/189 (12%), Positives = 60/189 (31%), Gaps = 24/189 (12%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEVL 61 ++ +P L + R + + +V Q + +++++ + + + + Sbjct: 187 LFTLPEFLKAEQCDRLIDIIRSNA-HPSQVDGYQQQSDMRSSRTCNLDVREHPYIAE-ID 244 Query: 62 NAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWM-----RTDLSA 116 +A+++ ++ + Y+ + Y H D P+ + + Sbjct: 245 DAISRALGISLGW--SEINQGQW--YEPGQQYKPHPDYFPPGTPEYARFAATSGQRTWTF 300 Query: 117 TLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPS---------SSLHCVTPVTRGV 167 ++L+ + GG V G V + + S H PV G Sbjct: 301 MIYLNKTE--RGGGTHFTKIN--RTVMPEQGRAVCWNNLLKNGEPNPDSEHAGLPVEAGS 356 Query: 168 RVASFMWIQ 176 + W + Sbjct: 357 KFILTKWFR 365 >UniRef50_C5CTL3 Procollagen-proline dioxygenase n=1 Tax=Variovorax paradoxus S110 RepID=C5CTL3_VARPS Length = 296 Score = 100 bits (249), Expect = 4e-20, Method: Composition-based stats. Identities = 34/195 (17%), Positives = 60/195 (30%), Gaps = 36/195 (18%) Query: 4 HIPGVLSPQDVARFREQLE-------QAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAAL 56 + V S ++ + + GR GAQ + + ++ A L Sbjct: 103 VLSDVFSAEECEALIALARPRLAPSTSVDPLTGRNRLGAQRSSLGMFFRLREN-AFVARL 161 Query: 57 QNEVLNAVNQHALFFAAALP-RTLSTPLFNRYQNNETYGFHVDGAVRSHPQNG-----WM 110 + +N LP Y H D V S+ N Sbjct: 162 DERLSELMN---------LPVENGEGLQVLHYPAGAQSLPHFDFLVPSNAANQASLQRSG 212 Query: 111 RTDLSATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLY---------PSSSLHCVT 161 + + +L++ + +GGE V +T V G V + +SLH Sbjct: 213 QRVSTLVAYLNEVE--EGGETVFPETG--WSVSPQRGGAVYFEYCNSLGQVDHASLHAGA 268 Query: 162 PVTRGVRVASFMWIQ 176 PV G + + W++ Sbjct: 269 PVLSGEKWVATKWMR 283 >UniRef50_A3WJ18 Prolyl 4-hydroxylase alpha subunit-like protein, 2OG-Fe(II) oxygenase family protein n=2 Tax=Idiomarina RepID=A3WJ18_9GAMM Length = 210 Score = 100 bits (249), Expect = 4e-20, Method: Composition-based stats. Identities = 44/188 (23%), Positives = 65/188 (34%), Gaps = 27/188 (14%) Query: 4 HIPGVLSPQDVARFREQLEQ---AEWVD---GRVTTGAQGAQVKNNQ--QVDTRSTLYA- 54 IP L ++ Q A W GR T V+N++ +D ST A Sbjct: 22 IIPSFLPHAVAEPLYQEASQIASAHWQTAAIGRAQTHTINTMVRNDRIIWLDDNSTYGAP 81 Query: 55 --ALQNEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRT 112 L ++ AVN+ Y Y H+D G Sbjct: 82 FLNLMEQLRLAVNRTLFMGLFD-----YECHLAHYPKGAFYKKHLDAF------KGKSNR 130 Query: 113 DLSATLFLSD-PQSYDGGELVVNDTFGQ--HRVKLPAGDLVLYPSSSL-HCVTPVTRGVR 168 L+ L+L+ DGGELV+ G+ +V G LV++ S H V P + R Sbjct: 131 KLTTVLYLNPKWSEADGGELVMYGKRGEVLEKVLPKRGTLVVFLSDQFVHEVLPSQK-DR 189 Query: 169 VASFMWIQ 176 + W + Sbjct: 190 FSLTGWFR 197 >UniRef50_A8J238 Predicted protein n=1 Tax=Chlamydomonas reinhardtii RepID=A8J238_CHLRE Length = 1887 Score = 100 bits (249), Expect = 4e-20, Method: Composition-based stats. Identities = 33/205 (16%), Positives = 54/205 (26%), Gaps = 34/205 (16%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEV- 60 + + G L P T ++V + S A L V Sbjct: 1692 VLVVDGFLPPGLCDALCAVAAPRLIRSRVSTGAETPSRVSQSTFFTGDS---ARLPEVVA 1748 Query: 61 ----LNAVNQHALFFAAALPRTL--STPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDL 114 L A+ + A P + Y Y H D G + Sbjct: 1749 VEARLQALMERPEVTAGGRPTLVKSEALQVVSYDVGGFYSEHYDN-----KTGGVISRAA 1803 Query: 115 SATLFLSDPQSYDGGELVVND--------TFGQHRVKLPAGDLVLYPSSS---------L 157 + ++L D Q+ GG + RV G +++ S L Sbjct: 1804 TIIIYLQDTQA--GGSTHFPNQQLRLMRVARPGLRVYPAKGRALIFWSRLPDGSEDLASL 1861 Query: 158 HCVTPVTRGVRVASFMWIQSMIRDD 182 H PV G + W + + + Sbjct: 1862 HSAEPVRAGSKWICTRWFKELAAAE 1886 >UniRef50_Q3BVS8 Putative uncharacterized protein n=3 Tax=Xanthomonas RepID=Q3BVS8_XANC5 Length = 418 Score = 100 bits (249), Expect = 4e-20, Method: Composition-based stats. Identities = 33/188 (17%), Positives = 56/188 (29%), Gaps = 17/188 (9%) Query: 8 VLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEVLNAVNQH 67 VLS + L + +V + + + +TL +++ A Sbjct: 236 VLSADECRLLM-LLARPHLRASKVIDPNDASTGRAPVRTSHGATLDPIIEDFAARAAQSR 294 Query: 68 ALFFAAALPRTLSTPLFNRYQNNETYGFHVDG---AVRSHPQNGWMRTDLSATLFLSDPQ 124 A Y E Y H D + + + ++L+D Sbjct: 295 LAACAQLPLAHAEPLSVLCYAPGEQYRAHRDYLPPGTIAADRPTAGNRQRTVCVYLNDVG 354 Query: 125 SYDGGELVVNDTFGQHRVKLPAGDLVLY---------PSSSLHCVTPVTRGVRVASFMWI 175 + GGE RV+ G LV + + SLH PVT G + +W Sbjct: 355 A--GGETEFP--VAGVRVRPRPGTLVCFDNLHADGRPDADSLHAGLPVTAGSKWLGTLWF 410 Query: 176 QSMIRDDK 183 + D Sbjct: 411 RQQRYRDW 418 >UniRef50_B9ZRT4 2OG-Fe(II) oxygenase n=1 Tax=Thioalkalivibrio sp. K90mix RepID=B9ZRT4_9GAMM Length = 251 Score = 100 bits (248), Expect = 5e-20, Method: Composition-based stats. Identities = 48/191 (25%), Positives = 69/191 (36%), Gaps = 24/191 (12%) Query: 4 HIPGVLSPQDVARFREQ---------LEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYA 54 P L + V R++ L QA G + + +D S Sbjct: 61 VWPDFLPAESVDALRDEVYALRDAAQLAQARIGRGGERHHDRATRGDWIHWLDGASPAQQ 120 Query: 55 ALQNEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDL 114 A + Q L T S F Y Y HVD + L Sbjct: 121 AFMERLDAIRLQVGRTLIPGLFETES--HFALYPPGTHYARHVDAFQAGNC------RRL 172 Query: 115 SATLFLS-DPQSYDGGELVVNDTFGQH--RVKLPAGDLVLYPS-SSLHCVTPVTRGVRVA 170 S +L+ D Q DGG+L + D G+ R++ AG LV++ S S H V P TR R + Sbjct: 173 SLVFYLNRDWQEQDGGQLAIYDDAGRECQRIQPTAGTLVMFLSQSVPHAVLP-TRRWRAS 231 Query: 171 SFMWIQSMIRD 181 W++ +RD Sbjct: 232 IASWMR--VRD 240 >UniRef50_A4S1S9 Predicted protein n=1 Tax=Ostreococcus lucimarinus CCE9901 RepID=A4S1S9_OSTLU Length = 210 Score = 99.6 bits (247), Expect = 6e-20, Method: Composition-based stats. Identities = 32/176 (18%), Positives = 57/176 (32%), Gaps = 15/176 (8%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRST-LYAALQNEV 60 + + L+ +D AR + + ++ G + N + S L Sbjct: 38 IIVVDDALTARDCARIVDAIGD-DFAASSSRGPRHGEARRRNGRFAETSEAFARRLYER- 95 Query: 61 LNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFL 120 V F RY+ E +G HVD V + R+ +A +L Sbjct: 96 -ANVAATFGFEIDDAVGLNPNIRVYRYRAREHFGAHVDERVTAL----GRRSKYTALFYL 150 Query: 121 SDPQSYDGGELVVNDTFGQH--RVKLPAGDLVLYPSSS---LHCVTPVTRGVRVAS 171 S + +GG + D G+ RV+ G + + + H V G + Sbjct: 151 S--EDVEGGSTIFYDEVGEERCRVRPKIGRALYFRHGADMPEHEGEEVREGTKYVL 204 >UniRef50_Q3BXN0 Putative 2OG-Fe(II) oxygenase superfamily protein n=3 Tax=Xanthomonas RepID=Q3BXN0_XANC5 Length = 296 Score = 99.6 bits (247), Expect = 6e-20, Method: Composition-based stats. Identities = 31/195 (15%), Positives = 53/195 (27%), Gaps = 24/195 (12%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTL---YAALQN 58 + + G LS ++ R A G V + + L AL Sbjct: 108 VVVLGGFLSDEECDALIALARP-RLARSRTVDNANGEHVVHAARTSDSMCLRLGQDALCQ 166 Query: 59 EVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQN-----GWMRTD 113 + + + + RY Y H D + Sbjct: 167 RIEARIARLLDWPV----DHGEGLQVLRYATGAEYRPHYDYFDPDAAGTPVLVQAGGQRV 222 Query: 114 LSATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYP-------SSSLHCVTPVTRG 166 S ++L+ P+ GG D V G+ V + + SLH PV G Sbjct: 223 ASLVMYLNTPE--RGGATRFPD--AHLDVAAVKGNAVFFSYDRPHPMTRSLHAGAPVLAG 278 Query: 167 VRVASFMWIQSMIRD 181 + + W++ Sbjct: 279 DKWVATKWLRERAVR 293 >UniRef50_D2VKE9 Type IIB DNA topoisomerase n=1 Tax=Naegleria gruberi RepID=D2VKE9_NAEGR Length = 1251 Score = 99.6 bits (247), Expect = 7e-20, Method: Composition-based stats. Identities = 35/220 (15%), Positives = 74/220 (33%), Gaps = 24/220 (10%) Query: 9 LSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTR-----STLYAALQNEVLNA 63 LS + + +A + G T V+ Q+D + + L + +++ Sbjct: 426 LSNGSASELIQFCSKAPYGRGEDT--IYDENVRKTWQLDPSRFEITNPTWDDLIDGLVDN 483 Query: 64 VNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLSDP 123 + L + L + + Y+ + FH D + TL + P Sbjct: 484 EIRDGLGISKHLRLSANLYKLLVYEEGGHFQFHKDSEKEE---------RMFGTLVVQLP 534 Query: 124 QSYDGGELVVNDTFGQHRVKLP-----AGDLVLYPSSSLHCVTPVTRGVRVASFMWIQSM 178 Y GGE++V + + + + H + V G RV + Sbjct: 535 SEYSGGEIIVRHGEEEEEYDFASVSRYTPHFISFYADCEHMIKNVNSGYRVCLIYNLCFS 594 Query: 179 IRDDKKRAMLF--ELDNNIQSLKSR-YGESEEILSLLNLY 215 + K + EL +Q + S +++++ +L +Y Sbjct: 595 GNKESKPTLKGYEELAKRLQQIVSEWNADAKDVDTLYRMY 634 >UniRef50_Q1MZY2 Oxidoreductase, 2OG-Fe(II) oxygenase family protein n=1 Tax=Bermanella marisrubri RepID=Q1MZY2_9GAMM Length = 219 Score = 99.6 bits (247), Expect = 7e-20, Method: Composition-based stats. Identities = 37/185 (20%), Positives = 58/185 (31%), Gaps = 19/185 (10%) Query: 4 HIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEVLNA 63 I L + + + +V + + + NQ V + + + E Sbjct: 29 IIDNALPQALLQSLMNHIATLSSQEFKVAGTGRQDEHQVNQFVRRDEIHWLSEERECERE 88 Query: 64 VNQHALFFAAALPRTL------STPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSAT 117 A + + L Y+ Y H+D G LS Sbjct: 89 WFHWAQGLQTEINKRLMLGLFSYEAHMAHYEPGAFYKKHLDAF------KGSRSRVLSTV 142 Query: 118 LFLSD-PQSYDGGELVVNDTFGQ----HRVKLPAGDLVLYPSSSL-HCVTPVTRGVRVAS 171 L+L+ QS GGELV+ D RV G LV++ S H V P R + Sbjct: 143 LYLNPQWQSNYGGELVIYDEHNHDSELTRVSPMPGTLVVFLSEDFPHEVLPARE-HRHSI 201 Query: 172 FMWIQ 176 W + Sbjct: 202 AGWFR 206 >UniRef50_A8TVG3 Putative uncharacterized protein n=1 Tax=alpha proteobacterium BAL199 RepID=A8TVG3_9PROT Length = 318 Score = 99.6 bits (247), Expect = 7e-20, Method: Composition-based stats. Identities = 25/141 (17%), Positives = 54/141 (38%), Gaps = 10/141 (7%) Query: 37 GAQVKNNQQVDTRSTLYAALQNEVLNAVNQHALFFAAALPRTLSTPLFNRYQNN--ETYG 94 + + + ++ LYA ++ + + + Y+ + + Sbjct: 182 DRKRRRDLTLNRGRPLYADVRTAIADRLMPELWKAWWIDRLRPEAFYVASYEAGRGDFFA 241 Query: 95 FHVDGAVRSHPQNGWMRTDLSATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPS 154 H D ++ + ++ ++ L+D Y+GG LV + + R + PAG + + Sbjct: 242 AHRDNSLPATAD-----RRIAVSIELND--DYEGGGLVFPE-YSDDRWRAPAGGGLAFSC 293 Query: 155 SSLHCVTPVTRGVRVASFMWI 175 S LH PVT G R ++ Sbjct: 294 SLLHEAVPVTAGCRYVLLAFL 314 >UniRef50_C1EER8 Predicted protein n=2 Tax=Micromonas RepID=C1EER8_9CHLO Length = 439 Score = 99.6 bits (247), Expect = 7e-20, Method: Composition-based stats. Identities = 35/214 (16%), Positives = 69/214 (32%), Gaps = 45/214 (21%) Query: 4 HIPGVLSPQDVARFREQLE--------------QAEWVDGRVTT---GAQGAQVKNNQQV 46 I +S ++ A + + G T A + +N +V Sbjct: 213 VIHNFISKEEAAAIVDVAAPELHPSLVVRHQTAKRGDTAGGDTAVHGEATAGRTSHNCRV 272 Query: 47 DTRSTLYAALQNEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQ 106 + + V A+ + A P RY ++ Y H D R+HP+ Sbjct: 273 SSSHPI-------VRAAIQRAAYLCGLE-PSHAEPAQVVRYLPSQEYKPHHDWFDRAHPE 324 Query: 107 N-------GWMRTDLSATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPS----- 154 + + ++ +L +P+ GG + GD +L+ + Sbjct: 325 SFRAKTEGRGGQRAVTCLAYLVEPE--RGGRTYFPKLRAG--FEPKVGDALLWWNVDENG 380 Query: 155 ----SSLHCVTPVTRGVRVASFMWIQSMIRDDKK 184 +LH PV G + A +W++ R ++ Sbjct: 381 AEDFKTLHAGEPVEAGAKWALNLWLREKPRRGEE 414 >UniRef50_D2W1Z9 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2W1Z9_NAEGR Length = 488 Score = 99.6 bits (247), Expect = 7e-20, Method: Composition-based stats. Identities = 33/236 (13%), Positives = 70/236 (29%), Gaps = 34/236 (14%) Query: 3 YHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEVLN 62 ++I VL+ + + E+ E + + + N++ + L N + Sbjct: 229 FYIDNVLTEDECSAIIEKTETVGYR--TLEHEFLSYERDNDRSLLIFPYFADLLWNRIRP 286 Query: 63 AVNQHALFFAAALPRTLST------------PLFNRYQNNE-TYGFHVDGAVRSHPQNGW 109 + ++ P RY + H DG Sbjct: 287 IFEKDEKLWSQQRPFGWHNEGTWNPSSINNCMRCCRYNGPCVGFVPHRDGNFVQSVDE-- 344 Query: 110 MRTDLSATLFLSD-----PQSY-------DGGELVVNDTF----GQHRVKLPAGDLVLYP 153 R+ + L+L+D + GELV + + +K G ++++ Sbjct: 345 -RSKFTIILYLNDNFDDGTTDFLRAHQPASMGELVREELERGFSNEFSLKPKRGRVLIFD 403 Query: 154 SSSLHCVTPVTRGVRVASFMWIQSMIRDDKKRAMLFELDNNIQSLKSRYGESEEIL 209 + LH P+ G + I D+ + M + D + Y + Sbjct: 404 HALLHQGRPIPSGHKYIIRTDIVLKRVDNPQPDMSYLQDPEYHKMLYLYKAAANYE 459 >UniRef50_B8HQ53 Prolyl 4-hydroxylase alpha subunit n=1 Tax=Cyanothece sp. PCC 7425 RepID=B8HQ53_CYAP4 Length = 309 Score = 99.6 bits (247), Expect = 8e-20, Method: Composition-based stats. Identities = 28/196 (14%), Positives = 61/196 (31%), Gaps = 27/196 (13%) Query: 3 YHIPGVLSPQDVARFREQL--EQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEV 60 + L+P++++ E + ++ + +TG + + + + N + Sbjct: 121 VRLEQFLTPEELSYLIEYVVQQKDNFAPTHTSTGDLD--YRKSLILYNFPEFSQLVVNRI 178 Query: 61 LNAVNQHAL-FFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLF 119 + + + + Y H D +L+ + Sbjct: 179 RTVMPEVLTKLKMPPFSVGEIESQLTAHGDGNYYKIHNDNGSPETAT-----RELTYVYY 233 Query: 120 L-SDPQSYDGGELVVNDT----------FGQHRVKLPAGDLVLYPSSSLHCVTPVT---- 164 P+ + GGEL + D+ H V+ ++ +PS LH V PV Sbjct: 234 FYQQPKCFSGGELRLFDSKIENGFYVAADSSHIVEPDNNSIIFFPSRYLHEVLPVQCPSR 293 Query: 165 --RGVRVASFMWIQSM 178 + R WI+ Sbjct: 294 EFQYYRFTINGWIRRA 309 >UniRef50_Q58MX3 Putative uncharacterized protein n=1 Tax=Prochlorococcus phage P-SSM2 RepID=Q58MX3_BPPRM Length = 197 Score = 99.2 bits (246), Expect = 8e-20, Method: Composition-based stats. Identities = 35/197 (17%), Positives = 74/197 (37%), Gaps = 33/197 (16%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTL----YAALQ 57 + + +L+ D+ + + ++ ++ D V G G + + N + + + + + Sbjct: 4 LIQVIKILNGTDLKKVNQYVDTLDFEDNTV-FGKPGEECQTNTDIRSSTGVSLDDAHEIT 62 Query: 58 NEVLNAVNQ------------HALFFAAALPRTL------STPLFNRYQNNETYGFHVDG 99 N + ++N H F +P + Y+ + Y FH D Sbjct: 63 NVIHTSMNNGLDEYKRRVQKIHPNFSYYPVPGAVGTRSWREGIQILDYKKGQEYKFHHDA 122 Query: 100 AVRSHPQNGWMRTDLSATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSS--SL 157 A + P+ G +S L+L ++ GG + VK G +++PS+ Sbjct: 123 A--TEPRLGEYHRKISVILYL--KEATKGGGTAFSHL----SVKPKPGYALIFPSNWCYP 174 Query: 158 HCVTPVTRGVRVASFMW 174 H PV+ G + + W Sbjct: 175 HAGEPVSAGKKRVAVTW 191 >UniRef50_Q9FKX6 Prolyl 4-hydroxylase, alpha subunit-like protein n=18 Tax=Spermatophyta RepID=Q9FKX6_ARATH Length = 267 Score = 99.2 bits (246), Expect = 9e-20, Method: Composition-based stats. Identities = 32/186 (17%), Positives = 61/186 (32%), Gaps = 38/186 (20%) Query: 4 HIPGVLSPQDVARFREQL----EQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNE 59 L+ ++ E E++ VD T + ++V+ + T A +++ Sbjct: 89 VYHNFLTKEECKYLIELAKPHMEKSTVVD-EKTGKSTDSRVRTS-----SGTFLARGRDK 142 Query: 60 VLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLF 119 + + + F Y+ + Y H D + + + + ++ Sbjct: 143 TIREIEKRISDFTFIPVEHGEGLQVLHYEIGQKYEPHYDYFMDEYNTRNGGQRIATVLMY 202 Query: 120 LSDPQSYDGGELVVNDTFGQH-----------------RVKLPAGDLVLYPS-------- 154 LSD + +GGE V G + VK GD +L+ S Sbjct: 203 LSDVE--EGGETVFPAAKGNYSAVPWWNELSECGKGGLSVKPKMGDALLFWSMTPDATLD 260 Query: 155 -SSLHC 159 SSLH Sbjct: 261 PSSLHG 266 >UniRef50_C7JE98 Putative uncharacterized protein n=8 Tax=Acetobacter pasteurianus RepID=C7JE98_ACEP3 Length = 336 Score = 99.2 bits (246), Expect = 9e-20, Method: Composition-based stats. Identities = 36/180 (20%), Positives = 60/180 (33%), Gaps = 19/180 (10%) Query: 3 YHIPGVLSPQDVARFREQLE-QAEWVDGRVTTGAQGAQV-------KNNQQV-DTRSTLY 53 + +L + G +T A G V K L Sbjct: 147 LLLTDILEADFCHALIHYYHYNSPTPSGFLTKNADGLAVEKIDPMFKRRYDCKIKNQNLI 206 Query: 54 AALQNEVLNAVNQHALFFAAALPRTLSTPLFNRYQ--NNETYGFHVDGAVRSHPQNGWMR 111 LQ ++ V + + + Y + + H D V G Sbjct: 207 KGLQARIIRRVVPEIRKTFQCTVTGMDRMIISCYDAAHKGHFAPHRDNTV-----EGAKH 261 Query: 112 TDLSATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVAS 171 + ++ L+D +++GGEL + F PAG +++ S+ LH V PVT+G R A Sbjct: 262 RLFAISINLND--TFEGGELTFPE-FSNQGFCPPAGGALIFSSALLHAVRPVTKGKRYAC 318 >UniRef50_B8HX71 Prolyl 4-hydroxylase alpha subunit n=1 Tax=Cyanothece sp. PCC 7425 RepID=B8HX71_CYAP4 Length = 457 Score = 99.2 bits (246), Expect = 9e-20, Method: Composition-based stats. Identities = 36/199 (18%), Positives = 67/199 (33%), Gaps = 28/199 (14%) Query: 2 MYHIPGVLSPQDVARFRE--QLEQAEWVDGRVTTGAQG--AQVK--NNQQVDTRSTLYAA 55 + LSP+D+ + R+ +L+Q ++D + Q QV+ ++ A Sbjct: 260 VVQFENFLSPEDLEKVRDFVRLQQEHFLDSALMGNRQNVQTQVRQSKLLYLEDFPEFKAW 319 Query: 56 LQNEVLNAVNQHAL-FFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDL 114 Q +L + + + + YG H D + R ++ Sbjct: 320 FQRFLLAKLPAALQQLQHPEFMVSGMEMQLTLHGDGCYYGIHPDTTFTEVAK--VARREI 377 Query: 115 SATLFLS-DPQSYDGGELVVNDTFGQHR----------VKLPAGDLVLYPSSSLHCVTPV 163 + + +P + GGEL + T R ++ L+ + S H V PV Sbjct: 378 TFVYYFCLEPGGFSGGELRMYPTQICDRQGFTSADFKVIEPLHNSLIFFNSRCYHEVMPV 437 Query: 164 T-------RGVRVASFMWI 175 +G R WI Sbjct: 438 VCPGNRFDQG-RFTINGWI 455 >UniRef50_B6AGD9 Putative uncharacterized protein n=1 Tax=Cryptosporidium muris RN66 RepID=B6AGD9_9CRYT Length = 310 Score = 98.9 bits (245), Expect = 1e-19, Method: Composition-based stats. Identities = 32/203 (15%), Positives = 64/203 (31%), Gaps = 37/203 (18%) Query: 3 YHIPGVLSPQDVARFREQLEQAE----WVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQN 58 Y VL+ ++ + E+LE ++ W + + + D L+ ++ Sbjct: 51 YIWDNVLTEEECSVLIEKLESSKSYSFWNPASQSKEFRNVDTIESNMNDFAEFLWKRIEP 110 Query: 59 EVLNAV---NQHALFFAAALPRTLS------TPLFNRYQNNETYGFHVDGAVRSHPQNGW 109 + + F LF RY +G H DG+V Sbjct: 111 IFRYKILNIKEDDNFSEYDTIGEWEASTIYPKMLFGRYSEGGHFGPHTDGSVSLDINT-- 168 Query: 110 MRTDLSATLFLSDPQSYDGGELVV--------------------NDTFGQHRVKLPAGDL 149 RT + ++L+ + GG V+ F RV+ G + Sbjct: 169 -RTFWTILIYLNTIPTEGGGATVLVDKRQKSLPFYKDKQDRDRSYPEFILSRVQPKCGRV 227 Query: 150 VLYPSSSLHCVTPVTRG-VRVAS 171 + + + +H P++ G + Sbjct: 228 LTFSFADMHEGEPISPGFYKYII 250 >UniRef50_A0KKW9 SM-20 domain protein n=43 Tax=Gammaproteobacteria RepID=A0KKW9_AERHH Length = 231 Score = 98.9 bits (245), Expect = 1e-19, Method: Composition-based stats. Identities = 40/183 (21%), Positives = 64/183 (34%), Gaps = 18/183 (9%) Query: 3 YHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEVLN 62 + L+ +V + L A W + A + ++ D L +L V + Sbjct: 45 VIVDDFLTAAEVDALKACLPDA-WRPAGIGRDALHQDNRTIRR-DQIHWLEPSLGAPVAD 102 Query: 63 AVNQHALFFAAA-----LPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSAT 117 + + AA L F RY++ + Y H D G L++ Sbjct: 103 YLARMEALRLAANRMLMLGLFDYEAHFARYRSGDFYATHRDAF------AGRSNRRLTSV 156 Query: 118 LFL-SDPQSYDGGELVVNDTFGQ--HRVKLPAGDLVLYPSSSL-HCVTPVTRGVRVASFM 173 +L +D Q GG L + D Q V G LVL+ S H V P + R + Sbjct: 157 FYLNNDWQPQAGGVLRMYDDDEQLLMDVSPRGGRLVLFLSEEFPHEVLPANQ-ERYSIAG 215 Query: 174 WIQ 176 W + Sbjct: 216 WFR 218 >UniRef50_Q8T5S8 Prolyl-4-hydroxylase-alpha PV n=14 Tax=Drosophila RepID=Q8T5S8_DROME Length = 525 Score = 98.5 bits (244), Expect = 1e-19, Method: Composition-based stats. Identities = 35/202 (17%), Positives = 62/202 (30%), Gaps = 30/202 (14%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNE-- 59 M VLSP+++ + G A N+ V TR++ A + Sbjct: 331 MVLYHDVLSPKEIKELQGMA-----TPGLKRATVYQASSGRNEVVKTRTSKVAWFPDGYN 385 Query: 60 -VLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMR--TDLSA 116 + +N Y Y H D +++ M + Sbjct: 386 PLTVRLNARISDMTGFNLYGSEMLQLMNYGLGGHYDQHYDFFNKTNSNMTAMSGDRIATV 445 Query: 117 TLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYP---------SSSLHCVTPVTRGV 167 +L+D + GG V + V G +V++ + +LH PV G Sbjct: 446 LFYLTDVE--QGGATVFP--NIRKAVFPQRGSVVMWYNLKDNGQIDTQTLHAACPVIVGS 501 Query: 168 RVASFMWIQSMIRDDKKRAMLF 189 + WI+ +R +F Sbjct: 502 KWVCNKWIR-------EREQIF 516 >UniRef50_B8C5J7 Predicted protein n=1 Tax=Thalassiosira pseudonana RepID=B8C5J7_THAPS Length = 395 Score = 98.5 bits (244), Expect = 2e-19, Method: Composition-based stats. Identities = 37/205 (18%), Positives = 65/205 (31%), Gaps = 24/205 (11%) Query: 3 YHIPGVLSPQDVARFREQLEQAEWV--------DGRVTTGAQGAQVKNNQQVDTRSTLYA 54 + I L+ + + +V DG K+ V + Sbjct: 105 FTIDNFLNKDECKAILQLSTGFHYVTEAAHTDNDGVTHVVRLQEPNKHKLSVFEHAPTVD 164 Query: 55 ALQNEVLNAVNQHALFFAAAL----PRTLST-PLFNRYQN--NETYGFHVDGA--VRSHP 105 L ++ + H F P L+ RY N+ + H D V S Sbjct: 165 TLWTKLQPMILPHIGSFIDDTKCGQPLGLNPRLRVLRYDASDNDVFEPHFDATTRVASDD 224 Query: 106 QNGWMRTDLSATLFLSDP--QSYDGGELVV-----NDTFGQHRVKLPAGDLVLYPSSSLH 158 N + + L+ ++L+D + +DGGE V +G +V++ H Sbjct: 225 CNKTLTSLLTVLIYLNDGGGKEFDGGETFYLDSVNPKNDNATIVTPASGKVVVFEHDLFH 284 Query: 159 CVTPVTRGVRVASFMWIQSMIRDDK 183 P+T G + I + DD Sbjct: 285 SSVPLTFGAKYVLRTDILFQVNDDD 309 >UniRef50_Q2UGW2 Predicted protein n=11 Tax=Trichocomaceae RepID=Q2UGW2_ASPOR Length = 332 Score = 98.5 bits (244), Expect = 2e-19, Method: Composition-based stats. Identities = 32/229 (13%), Positives = 62/229 (27%), Gaps = 63/229 (27%) Query: 4 HIPGVLSPQDVARFREQLEQAE---------WVDGRVTTG----AQGAQVKNNQQVD-TR 49 I +L+ ++ E + W + G +N ++ Sbjct: 97 VIDNILTEEECNELIRLAEASTVTPQSPTPVWERAMINVGNGKQKLATDTRNCGRIIWDT 156 Query: 50 STLYAALQNEVL--------NAVNQHALF-------FAAALPRTLSTPLFNRYQNNETYG 94 L L N ++ + + L L R F RY+ E + Sbjct: 157 PELADKLLNRLMPFLREFEIDRLENRPLVTGLAGRNKTYRLTRLNERLRFLRYEGGEYFR 216 Query: 95 FHVDGAVRSHPQNGWMRTDLSATLFLSDPQSYDGGEL----------------------- 131 H D + + + ++ + L+L+ D EL Sbjct: 217 PHWDASYTTPDRKE--KSFFTVHLYLNGDGEQDLKELRREQARVERGEGDVNLGVGGKLL 274 Query: 132 --------VVNDTFGQHRVKLPAGDLVLYPS-SSLHCVTPVTRGVRVAS 171 + RV AG ++++ LH V RG ++ Sbjct: 275 GGATSFLPRFEEKERHLRVFPKAGSVLVFQHNDLLHAGDSVFRGTKLTM 323 >UniRef50_Q1MT87 Novel protein similar to vertebrate leprecan-like family (Fragment) n=8 Tax=Clupeocephala RepID=Q1MT87_DANRE Length = 676 Score = 98.1 bits (243), Expect = 2e-19, Method: Composition-based stats. Identities = 43/214 (20%), Positives = 70/214 (32%), Gaps = 40/214 (18%) Query: 3 YHIPGVLSPQDVARFREQL----EQAEWVDGRVTTGAQGAQVKNNQQVDT---------- 48 + GVLS + R + + GR + + + + Sbjct: 463 VVLDGVLSQSECDRVMQLATVAASAGDGYRGRRSPHTPHEKFEGLTVLRALKLAQGGLVN 522 Query: 49 --RSTLYAALQNEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNET--------YGFHVD 98 + L + V ++ + F +L T L R + HVD Sbjct: 523 QSDARLLHGIGETVKELMDSY--FRTPSLLYFAYTHLVCRSAITGHQEGRSDLSHPVHVD 580 Query: 99 GAVRSHPQNGWMR-------TDLSATLFLSDPQSYDGGELVVNDTFGQH---RVKLPAGD 148 + R DLSA L+L+D ++GG+ D + VK G Sbjct: 581 NCILEPESRQCWREAPAFTHRDLSAVLYLND--DFEGGDFFFTDRDAKTVTATVKPKCGR 638 Query: 149 LVLYPSS--SLHCVTPVTRGVRVASFMWIQSMIR 180 LV + S + H VT VT+G R A +W + Sbjct: 639 LVGFTSGPVNPHGVTAVTKGRRCALALWFTTEKH 672 >UniRef50_B3PB13 SM-20 domain protein n=1 Tax=Cellvibrio japonicus Ueda107 RepID=B3PB13_CELJU Length = 232 Score = 98.1 bits (243), Expect = 2e-19, Method: Composition-based stats. Identities = 38/191 (19%), Positives = 68/191 (35%), Gaps = 27/191 (14%) Query: 3 YHIPGVLSPQDVARFREQLEQAEWVDGRVTTG-------AQGAQVKNN--QQVDTRSTLY 53 + G L ++ +Q + + G Q ++ + ++ + Sbjct: 48 LVLDGALPAHLSQALLDEWDQ-HFAVHLQSAGVGRAGDYRQSPDIRRDKILWLEPETPAV 106 Query: 54 AAL---QNEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWM 110 +++ +NQ L F Y+ + Y H D ++ + G Sbjct: 107 TEFLSWMDKLRTGLNQRLF-----LGLFDYESHFALYEPGDFYQKHRDAFRDTNARAG-- 159 Query: 111 RTDLSATLFLS-DPQSYDGGELVVNDTFGQH---RVKLPAGDLVLYPSSSL-HCVTPVTR 165 LS +L+ D S DGGELV+ D +H RV G L+++ S H V P R Sbjct: 160 -RKLSTVYYLNPDWTSLDGGELVLYDEADEHLLERVAPKQGRLLVFLSEDFPHEVLPARR 218 Query: 166 GVRVASFMWIQ 176 R + W + Sbjct: 219 -PRKSIAGWFR 228 >UniRef50_A8IDI8 Prolyl 4-hydroxylase n=2 Tax=Chlamydomonas reinhardtii RepID=A8IDI8_CHLRE Length = 429 Score = 98.1 bits (243), Expect = 2e-19, Method: Composition-based stats. Identities = 37/210 (17%), Positives = 62/210 (29%), Gaps = 44/210 (20%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDG-------RVTTGAQGAQVKNNQQVDTRSTLYA 54 + P + + + G +V Q K S Sbjct: 224 IKVFPNFVDKARREEIIALASKFMYPSGLAYRPGEQVEAEQQVRTSKGTFLGGDSSPALT 283 Query: 55 ALQNEVLNAVNQHALFFAAALPRTLSTPL-FNRYQNNETYGFHVDGAVRSHPQNGWMRTD 113 L++++ +PR Y++ + Y H+D + + Sbjct: 284 WLESKIAAV---------TDIPRQNGEFWNVLNYKHTQHYDSHMDSFDPKEYGQQYSQRI 334 Query: 114 LSATLFLSDPQSYDGGELVV-----------------NDTFGQHRVKLPAGDLVLYPS-- 154 + + LSD + GGE V D G R K AGD VL+ S Sbjct: 335 ATVIVVLSD-EGLVGGETVFKREGKANIDKPITNWTDCDADGGLRYKPRAGDAVLFWSAF 393 Query: 155 -------SSLHCVTPVTRGVRVASFMWIQS 177 +LH PV G + + WI++ Sbjct: 394 PDGRLDQHALHGSCPVVTGNKWVAVKWIRN 423 >UniRef50_B8N2A2 Putative uncharacterized protein n=1 Tax=Aspergillus flavus NRRL3357 RepID=B8N2A2_ASPFN Length = 606 Score = 98.1 bits (243), Expect = 2e-19, Method: Composition-based stats. Identities = 37/225 (16%), Positives = 70/225 (31%), Gaps = 41/225 (18%) Query: 5 IPGV------LSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTR-----STLY 53 IP V +S + + + + G T V+ + Q+D + + Sbjct: 225 IPDVGNIGLPISTEHAKAIIQSCHPSPYGKGTET--LVDESVRKSWQLDASQFALQNPRW 282 Query: 54 AALQNEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTD 113 ++ A Y+ + H D Sbjct: 283 QLQVELFVDKAVTGLGLTANGREVKAKLYKLLIYEEGAFFLPHRDSEKADG--------- 333 Query: 114 LSATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGD-----LVLYPSSSLHCVTPVTRGVR 168 + TL + P ++GG+++V+ + Q + + + + H V PVT G R Sbjct: 334 MFGTLAVCLPSKHEGGDVIVSHSRDQLKFQTAPTSEFGISWAAWYADVTHEVKPVTSGYR 393 Query: 169 VASFMWIQSMIRDDKKRAMLFELDNNIQSLKSRYGESEEILSLLN 213 V I ++I + L+SR +E I LL+ Sbjct: 394 VVL---IYNLIHRP-----------STALLESRGSSTENITRLLD 424 >UniRef50_UPI0001925DF6 PREDICTED: similar to predicted protein n=1 Tax=Hydra magnipapillata RepID=UPI0001925DF6 Length = 799 Score = 98.1 bits (243), Expect = 2e-19, Method: Composition-based stats. Identities = 43/244 (17%), Positives = 81/244 (33%), Gaps = 64/244 (26%) Query: 3 YHIPGVLSPQDVARFREQL--------------EQAEWVDGRVTTGAQGAQVKNNQQVDT 48 V S ++ + + ++++ D +++ G +++ Q Sbjct: 397 VVFDHVTSEEECNQLMDLAKTLNKDVNLKIIEKRKSQYEDMKISVTGDG--YRHSHQDPA 454 Query: 49 RSTLYAALQNEV---------------------------LNAVNQHALFFAAALPRTLST 81 R + V + F T Sbjct: 455 RPFTEKEVFKGVTLSGAVDAVVAGKASVEEAELYINVSERLRLLTQEYFDMKVRLNFAFT 514 Query: 82 PLFNRY--QNNET---YGFHVDGAVRSHPQNG--------WMRTDLSATLFLSDPQSYDG 128 L RY ++ E + H D + + NG + D SA L+L+D ++G Sbjct: 515 HLVCRYALEDGEEHISHPIHSDNCILNGDGNGTCPKRSPAFTWRDYSALLYLND--DFEG 572 Query: 129 GELVVNDTFG--QHRVKLPAGDLVLYPS---SSLHCVTPVTRGVRVASFMWIQ-SMIRDD 182 GE + ++ Q +V+ G +V + S +LH V V +GVR A +W S + + Sbjct: 573 GEFIFANSTDKIQAQVRPKCGRVVAFRSKGLENLHGVLGVKKGVRCALPIWFTLSTDKSE 632 Query: 183 KKRA 186 RA Sbjct: 633 DGRA 636 >UniRef50_B4RXU6 Oxidoreductase, 2OG-Fe(II) oxygenase family protein n=3 Tax=Gammaproteobacteria RepID=B4RXU6_ALTMD Length = 223 Score = 97.7 bits (242), Expect = 2e-19, Method: Composition-based stats. Identities = 38/184 (20%), Positives = 60/184 (32%), Gaps = 24/184 (13%) Query: 5 IPGVLSPQ--DVARFREQLEQAEWVDGRVTTGAQGAQVKNN-----QQVDTRSTLYAALQ 57 +P ++ D R + E GR + V+ + L+ A Sbjct: 39 LPDFITQALLDCQRSISEAEYKTAGIGRAENYKKATNVRGDAICWITGSSQEGALWLAWC 98 Query: 58 NEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSAT 117 + +N+ + F Y + Y HVD G LS Sbjct: 99 EAMQQYINRSLFMGLFSF-----ESHFACYGPGKFYKRHVDAF------KGQGNRVLSLV 147 Query: 118 LFLS-DPQSYDGGELVVN---DTFGQHRVKLPAGDLVLYPSSSL-HCVTPVTRGVRVASF 172 +L+ D +GGELV+ D +V LV++ S H V P TR R + Sbjct: 148 GYLNEDWLEENGGELVIYNSSDDVEGTKVLPKKNTLVVFLSEQFPHEVLPATR-TRHSIA 206 Query: 173 MWIQ 176 W + Sbjct: 207 GWFR 210 >UniRef50_C1EAF2 Predicted protein n=2 Tax=Micromonas RepID=C1EAF2_9CHLO Length = 898 Score = 97.7 bits (242), Expect = 3e-19, Method: Composition-based stats. Identities = 37/176 (21%), Positives = 62/176 (35%), Gaps = 20/176 (11%) Query: 7 GVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNN---QQVDTRSTLYAALQNEVLNA 63 +++ + A + E+A G TT A + + L+ AL + L + Sbjct: 733 PLMTEAECAEWVRLAEKAGEARGGWTTSRHYAVPTTDIPVHAIPDLLPLWNALMRDKLAS 792 Query: 64 VNQHALFFAAALPR--TLSTPLFNRYQNN--ETYGFHVDGAVRSHPQNGWMRTDLSATLF 119 + A P + RY+ H D ++ +S TL Sbjct: 793 LLSAACPEEMPKPSSVRVHDAFVVRYEAGAQHHLPMHAD------------QSAVSVTLA 840 Query: 120 LSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWI 175 L+D Y+GG G V+ G +V + H +PVTRGVR ++ Sbjct: 841 LNDEGEYEGGGTTFAVPVG-KTVRPGRGHVVAFKGGLQHGGSPVTRGVRYIVAAFL 895 >UniRef50_B7G237 Proly 4-hydroxylase n=1 Tax=Phaeodactylum tricornutum CCAP 1055/1 RepID=B7G237_PHATR Length = 226 Score = 97.7 bits (242), Expect = 3e-19, Method: Composition-based stats. Identities = 39/208 (18%), Positives = 63/208 (30%), Gaps = 39/208 (18%) Query: 1 MMYHIPGVLSPQDVARFREQLE-QAEWVDGRVTTGAQGAQVK--NNQQVDTRSTLYAALQ 57 ++ + G LS + +E E E+ + + QG Q A+ Sbjct: 15 LVLSVEGFLSDDECTYIQETAEPHMEYSEVTLMDKDQGRPASDFRTSQSAFIRAHDDAIL 74 Query: 58 NEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHP----------QN 107 ++ R RY E Y H D + +N Sbjct: 75 TDIDYRTASLVRIP----RRHQEDVQVLRYDVTEKYDSHADYFDPALYTKDKRTLALIRN 130 Query: 108 GWMRTDLSATLFLSDPQSYDGGELVVNDTFGQHR-----------VKLPAGDLVLYPS-- 154 G + +LSD + GGE V G VK G ++++ S Sbjct: 131 GHRNRMATVFWYLSDVE--KGGETVFPRFNGAQETSMKDCKTGLKVKPEKGKVIIFYSMT 188 Query: 155 -------SSLHCVTPVTRGVRVASFMWI 175 SLH PV +G + A+ W+ Sbjct: 189 PDGALDEYSLHGACPVQKGTKWAANKWV 216 >UniRef50_B7G6B8 Predicted protein (Fragment) n=1 Tax=Phaeodactylum tricornutum CCAP 1055/1 RepID=B7G6B8_PHATR Length = 193 Score = 97.7 bits (242), Expect = 3e-19, Method: Composition-based stats. Identities = 28/189 (14%), Positives = 58/189 (30%), Gaps = 22/189 (11%) Query: 3 YHIPGVLSPQDVARFREQLEQAEWVDGRVTTG--AQGAQVKNNQQVDTRSTLYAALQNEV 60 + + L+ + +++ + T G ++ + ++ V Sbjct: 11 FQVENFLTDVEADHIVGLVQKKNDMQRSSTNGHISETRTSSTTWLARHSDPVIDSIFRRV 70 Query: 61 LNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFL 120 + + A R Y + Y H D G ++ ++L Sbjct: 71 ADTLKMD---EAMLHRRINEDLQIVHYGVGQQYTAHHDFGYPKGD-PGSPSRSINFCMYL 126 Query: 121 SDPQSYDGGELVVN-----DTFGQHRVKLPAGDLVLY----PSSSL-----HCVTPVTRG 166 +D + GG+ +T G V G +++ P +L H PV G Sbjct: 127 NDVPA--GGQTSFPRWRNAETNGALNVVPKKGTAMIFYMVNPDGNLDDLTHHAALPVIEG 184 Query: 167 VRVASFMWI 175 + S +WI Sbjct: 185 EKFFSNLWI 193 >UniRef50_D1HTI8 Whole genome shotgun sequence of line PN40024, scaffold_22.assembly12x (Fragment) n=2 Tax=rosids RepID=D1HTI8_VITVI Length = 240 Score = 97.3 bits (241), Expect = 3e-19, Method: Composition-based stats. Identities = 30/192 (15%), Positives = 58/192 (30%), Gaps = 26/192 (13%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAA-LQNEV 60 ++ + + + F + E + +G ++N + + A + Sbjct: 44 LFTVQNFFTSAESKAFVKIAESMGFTHQGSLGPTKGEAYRDNDRTSVNDPVLAYTIWQSG 103 Query: 61 LNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFL 120 LN + F RY+ + +G H+D +V RT + ++L Sbjct: 104 LNKLFSDIKIRGKVAVGLNPNIRFYRYKVGQRFGRHIDESVDLGEGK---RTHYTLLIYL 160 Query: 121 SDPQSYD-----------------GGELVVNDTFGQ--HRVKLPAGDLVLYPSS---SLH 158 S GGE V + V G +L+ LH Sbjct: 161 SGGSKQKTKGVQSNVGDSSSEPLVGGETVFYGSRNGIVAEVAPTEGMALLHIHGDMCMLH 220 Query: 159 CVTPVTRGVRVA 170 VT+G++ Sbjct: 221 EARNVTKGIKYV 232 >UniRef50_A4RXU6 Predicted protein n=3 Tax=Mamiellales RepID=A4RXU6_OSTLU Length = 317 Score = 97.3 bits (241), Expect = 3e-19, Method: Composition-based stats. Identities = 28/208 (13%), Positives = 61/208 (29%), Gaps = 42/208 (20%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEVL 61 ++ + LS ++ E E+ ++ + + + + + + L + Sbjct: 45 VFLLKNFLSDEECEHLIELGEKK--LERSTVVNSDESGAVSTARTSFGTFVTRRLTETLQ 102 Query: 62 NAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFLS 121 ++ A + RY++ + Y H DG + + + + +FL Sbjct: 103 RVEDRVAKYSGIPW-EHQEQLQLLRYRDGQEYVAHHDGIISENGG----KRIATVLMFLR 157 Query: 122 DPQSYDGGELVVN------------------------DTFGQHRVKLPAGDLVLYPSSSL 157 +P GGE + V G+ VL+ S + Sbjct: 158 EPT--SGGETSFPQGTPLPETKAAFLANKDKLSECGWNDGNGFSVIPKKGEAVLFFSFHI 215 Query: 158 ---------HCVTPVTRGVRVASFMWIQ 176 H P G + + WI Sbjct: 216 NGTNDPFANHASCPTLGGTKYTATKWIH 243 >UniRef50_A4ACT0 Prolyl 4-hydroxylase, alpha subunit domain protein n=1 Tax=Congregibacter litoralis KT71 RepID=A4ACT0_9GAMM Length = 290 Score = 97.3 bits (241), Expect = 4e-19, Method: Composition-based stats. Identities = 31/192 (16%), Positives = 65/192 (33%), Gaps = 30/192 (15%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVD---TRSTLYAALQN 58 ++ I LS Q E + +A +V Q + ++++ A + Sbjct: 101 LFAIENFLSTQQCNTLVEAI-RANAHPSKVDGYEQQSDFRSSRTCSLRVHEHPFVADINK 159 Query: 59 EVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWM-----RTD 113 + N + + + ++ + Y+ + Y H D V P+ + Sbjct: 160 SIANTLGLNHRW------GEVTQGQW--YEPGQQYKAHPDYFVPGTPEYDQFALADGQRT 211 Query: 114 LSATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLY----PS-----SSLHCVTPVT 164 + ++L+ P+ GG + V AG + + PS S++H PV Sbjct: 212 WTFMIYLNKPE--QGGGTHFSKID--RTVMPEAGMAICWNNLLPSGEPNPSTVHAGLPVE 267 Query: 165 RGVRVASFMWIQ 176 G + W + Sbjct: 268 AGSKFIITKWFR 279 >UniRef50_B2SFP9 Oxidoreductase n=19 Tax=Francisella RepID=B2SFP9_FRATM Length = 206 Score = 96.9 bits (240), Expect = 4e-19, Method: Composition-based stats. Identities = 35/183 (19%), Positives = 63/183 (34%), Gaps = 15/183 (8%) Query: 4 HIPGVLSPQDVARFREQLE---QAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYA-ALQNE 59 I L+ + + R+QLE QA + ++ + + D L Sbjct: 29 IIDNWLTTDETDKLRQQLEELYQANYFKKSAVGNRLNENLERSIRSDFIFWLDETKYAQV 88 Query: 60 VLNAVNQHALFFAAALPRTLS--TPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSAT 117 +N + + + Y Y H+D R +S Sbjct: 89 FFEKINSFIEYINKTCFAGIVTKEFHYAVYPQGSFYKKHIDTFQNDD------RRTISVV 142 Query: 118 LFLS-DPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPSSSL-HCVTPV-TRGVRVASFMW 174 +L+ D Q GG+L + + G +VL+ S S+ H V PV T R++ W Sbjct: 143 CYLNQDWQDCFGGQLKLYLKDQTLEIFPTDGKIVLFDSKSIEHEVLPVLTENKRLSITGW 202 Query: 175 IQS 177 +++ Sbjct: 203 LKA 205 >UniRef50_A4EM02 Response regulator receiver domain protein (CheY-like) n=1 Tax=Roseobacter sp. CCS2 RepID=A4EM02_9RHOB Length = 217 Score = 96.9 bits (240), Expect = 4e-19, Method: Composition-based stats. Identities = 28/213 (13%), Positives = 61/213 (28%), Gaps = 32/213 (15%) Query: 1 MMYHIPGVLSPQDVARFREQLEQAEWVDGRV--TTGAQGAQVKNNQQVDTRSTLYAALQN 58 ++ I V ++A V G + + + N L + Sbjct: 14 LVAVIDDVFDEDLAQHVISLGQEALVRATVVDSAGGGKLDESRTNDSGTIDQWSDPKLAS 73 Query: 59 EVLNA--VNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQN----GWMRT 112 V + + P RY+ + + H D + + Sbjct: 74 LVTTISDLVRLP-------PENSEPSQLLRYEGEQKFDPHTDAFDNTVGGRDFISRGGQR 126 Query: 113 DLSATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLY-----------PSSSLHCVT 161 + +L++ GGE + ++ G ++++ P S+ H Sbjct: 127 LFTTICYLNNVG--KGGETEFPAL--KIKIAPKLGRVLIFGNTRLGTAMEHPHST-HGGR 181 Query: 162 PVTRGVRVASFMWIQSMIRDDKKRAMLFELDNN 194 PV G + A +W + + +R E + Sbjct: 182 PVKDGEKYALSIWWRQLAY-HVQRDYPAEEGDT 213 >UniRef50_B8C7A3 Putative uncharacterized protein (Fragment) n=1 Tax=Thalassiosira pseudonana RepID=B8C7A3_THAPS Length = 232 Score = 96.9 bits (240), Expect = 4e-19, Method: Composition-based stats. Identities = 27/224 (12%), Positives = 62/224 (27%), Gaps = 50/224 (22%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKN----NQQVDTRSTLYAA-- 55 + + LSP +V + A+ + + ++ + + + ++ Sbjct: 9 VLEVKKFLSPVEVQHLIDLASGAKGDVAMQRSTVLASNIRGATKTDTRSSSGGWIHREQD 68 Query: 56 -----LQNEVLNAVNQHALFFAAALPRT------LSTPLFNRYQNNETYGFHVDGAVRSH 104 + + + + P + RY+ E Y H D S Sbjct: 69 VIVDTIFRRIADLLKIDKNLMRDQRPPHLIGAHVVEAMQLLRYEPGEEYNPHHDFTYPSI 128 Query: 105 PQNGWMRTDLSATLFL-------------------SDPQSYDGGELVVN-----DTFGQH 140 + ++ L+L +D GGE + Sbjct: 129 DNRYQPKRYVTILLYLTGEGDVIQDGIRLSPKNTNTDVDGLQGGETTFPRAITTEYHDGI 188 Query: 141 RVKLPAGDLVLYPSSSL---------HCVTPVTRGVRVASFMWI 175 +V +G V++ + H V +GV+ + +W Sbjct: 189 KVAPQSGKAVVFYNILPDGNMDDLSQHSGGKVEKGVKYLANVWF 232 >UniRef50_B8CFF8 Predicted protein n=2 Tax=Thalassiosira pseudonana RepID=B8CFF8_THAPS Length = 1164 Score = 96.9 bits (240), Expect = 4e-19, Method: Composition-based stats. Identities = 27/188 (14%), Positives = 70/188 (37%), Gaps = 20/188 (10%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRSTLYAALQNEVL 61 + + L+ ++ A E ++ ++ + ++ A + V+ Sbjct: 939 LVSLENFLTEEEAAYLIEVGKRQQYQRSDYKEIDPEHRTSSSAWCRRDCWKDDATVSSVV 998 Query: 62 NAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNG-WMRTDLSATLFL 120 + + + + + LS RY+ + + H D + + ++ ++L Sbjct: 999 DRIAK----VTKSETKQLSNLQILRYEEGQKFNQHSDFSRPIRRLKRVQGQRLMTFLIYL 1054 Query: 121 SDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPS-----------SSLHCVTPVTRGVRV 169 SD + +GGE + +++ G +L+P+ + H PV +GV+ Sbjct: 1055 SDVE--EGGETSFP--YSGVKIQPRKGLAILWPNVMNDDPDAKEERADHLSLPVLKGVKH 1110 Query: 170 ASFMWIQS 177 A ++I + Sbjct: 1111 AVSIYIHA 1118 >UniRef50_C5LEZ3 Prolyl 4-hydroxylase alpha subunit, putative n=1 Tax=Perkinsus marinus ATCC 50983 RepID=C5LEZ3_9ALVE Length = 383 Score = 96.9 bits (240), Expect = 5e-19, Method: Composition-based stats. Identities = 31/190 (16%), Positives = 62/190 (32%), Gaps = 32/190 (16%) Query: 4 HIPGVLSPQDVARFREQLEQAEWVDGRVTT-------GAQGAQV--KNNQQVDTRSTLYA 54 +P L+P++ E +W V G V ++ + + L Sbjct: 179 LVPDFLTPEECEYMISLAE-GKWRPSTVGRSSSSISDGKSDKYVNKRSKGRTSSSFMLLH 237 Query: 55 ALQNEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDL 114 + + V + A + RY++ E +G H DGA Sbjct: 238 SQDDVVAEIERRAASLVGFP-ADHVERLNMLRYESGEFFGQHHDGAF----------RPW 286 Query: 115 SATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPS---------SSLHCVTPVTR 165 + + L+D GGE + +++ AG +++P+ +H P T Sbjct: 287 TVFITLNDIPRGAGGETLFPALG--LKIRPKAGTALVWPNCLEDGQADDRVVHEALPPTG 344 Query: 166 GVRVASFMWI 175 + A ++ Sbjct: 345 VRKYAINCFV 354 >UniRef50_Q9LT92 Genomic DNA, chromosome 5, P1 clone:MJM18 n=11 Tax=Embryophyta RepID=Q9LT92_ARATH Length = 250 Score = 96.5 bits (239), Expect = 6e-19, Method: Composition-based stats. Identities = 31/194 (15%), Positives = 58/194 (29%), Gaps = 28/194 (14%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTRST-LYAALQNEV 60 ++ + L+ + F + E + A G ++N ++ L L Sbjct: 52 LFTVENCLTSDESKAFVKIAESLGFTHQGSRGPAYGEAYRDNHRISVNDPVLADTLWQSG 111 Query: 61 LNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTDLSATLFL 120 L+ + F RY + +G H+D + N RT + ++L Sbjct: 112 LSNLFTDIKIRRKVAVGLNPNIRFYRYSAGQHFGRHIDESADLEDGN---RTYYTLLIYL 168 Query: 121 SDP-------------------QSYDGGELVVNDTFGQ--HRVKLPAGDLVLYPSS---S 156 S + GGE V + V G + + Sbjct: 169 SGNSTKSKSKSSSSKTNDSSSAEPLVGGETVFYGSRNSIVAEVAPVEGMALFHIHGDKCM 228 Query: 157 LHCVTPVTRGVRVA 170 LH V++GV+ Sbjct: 229 LHEGRNVSKGVKYV 242 >UniRef50_D0N7E1 Putative uncharacterized protein n=1 Tax=Phytophthora infestans T30-4 RepID=D0N7E1_PHYIN Length = 803 Score = 96.5 bits (239), Expect = 6e-19, Method: Composition-based stats. Identities = 33/208 (15%), Positives = 70/208 (33%), Gaps = 34/208 (16%) Query: 3 YHIPGV------LSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTR-----ST 51 I V D + E+ E++ + T V+ + Q++ + Sbjct: 72 ICINDVGTISVPFQEHDATKLIEKCEKSPFGHNFDT--KMDDNVRKSWQLEPSQVEFKNP 129 Query: 52 LYAALQNEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMR 111 L+ + +++ + + + P Y+ + H D Sbjct: 130 LWESGLHQLTRTITERLGYSGV--PLQCVLYKLLVYEEGGHFFKHQDTEKEDG------- 180 Query: 112 TDLSATLFLSDPQSYDGGELVVNDTFG---QHRVKLPAGDLVLYPS------SSLHCVTP 162 + ATL + P ++GG+LV+ +H G + P + H + Sbjct: 181 --MIATLVVQLPSLHEGGDLVIYSNGEVKHRHDFGKADGTVAFLPHYAVHYADAEHALET 238 Query: 163 VTRGVRVASFMWIQSMIRDDKKRAMLFE 190 VT+G R+A I + +D + +F+ Sbjct: 239 VTKGFRLALVYSI-CLPKDRCQLKRVFD 265 >UniRef50_B6KKV3 2OG-Fe(II) oxygenase family protein n=3 Tax=Toxoplasma gondii RepID=B6KKV3_TOXGO Length = 967 Score = 96.5 bits (239), Expect = 6e-19, Method: Composition-based stats. Identities = 25/195 (12%), Positives = 58/195 (29%), Gaps = 43/195 (22%) Query: 2 MYHIPGVLSPQDVARFREQLEQAEWVDGRVTTGAQG------------AQVKNNQQVDTR 49 ++ +P +L+P + + Q W + + G ++ + ++ Sbjct: 485 VFIMPELLTPATCDKLVKMC-QGRWQPSKTSRGPLHALPNEYSSGESLSRTSMSVRLAPG 543 Query: 50 STLYAALQNEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGW 109 T ++ A + + L + +Y+ E + H DG Sbjct: 544 ETPEVENLENIVAAFAEMPI-------SHLEPLVVVKYEEGEFFKQHHDGHF-------- 588 Query: 110 MRTDLSATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGDLVLYPS---------SSLHCV 160 + ++L+D GGE ++ G V++ + LH Sbjct: 589 --RRTTILVYLNDVAH--GGETEFPHLG--LKLSPTKGSAVMWRNVFESNQIDPRVLHAG 642 Query: 161 TPVTRGVRVASFMWI 175 P G + + Sbjct: 643 LPTLAGQKYVINCFF 657 >UniRef50_D1LX56 Leprecan-like protein n=1 Tax=Saccoglossus kowalevskii RepID=D1LX56_SACKO Length = 680 Score = 96.5 bits (239), Expect = 7e-19, Method: Composition-based stats. Identities = 44/237 (18%), Positives = 79/237 (33%), Gaps = 44/237 (18%) Query: 4 HIPGVL-SPQDVARFREQL-EQAEWVDG---------------RVTTGAQGAQVKNNQQV 46 + + S + E A DG +T + + Sbjct: 445 MVADLFASQMQCDQLIELALAGAVEGDGYSGKAKPHTTFETFSGLTVLNAAEKAREGLVK 504 Query: 47 DTRSTLYAALQNEVLNAVNQHALFFAAALPRTLSTPLFNR-YQNNE-------TYGFHVD 98 + LY L + V + F T L R + + ++ H D Sbjct: 505 AEDALLYLDLSEKARRYVESY--FKLPTRLFFSYTHLVCREAEPDAPQDRDDLSHPIHAD 562 Query: 99 GAVRSHPQN------GWMRTDLSATLFLSDPQSYDGGELVV--NDTFGQHRVKLPAGDLV 150 N ++ D SA ++L+ + ++GGE + + + V+ G LV Sbjct: 563 NCWLDDKGNCIKKSPAYVWRDYSALMYLN--EDFEGGEFIFARFNKTVEASVQPKCGRLV 620 Query: 151 LYPS--SSLHCVTPVTRGVRVASFMWIQSMIRDDKK-----RAMLFELDNNIQSLKS 200 + + +LH V VT+G R A MW + D+K R +L L ++ L+ Sbjct: 621 SFSAGEENLHGVKAVTKGRRCALAMWYTLDTKYDEKAHIEAREILENLQHDELYLRR 677 >UniRef50_Q2ULX3 Predicted protein n=2 Tax=Aspergillus RepID=Q2ULX3_ASPOR Length = 427 Score = 96.5 bits (239), Expect = 7e-19, Method: Composition-based stats. Identities = 37/225 (16%), Positives = 70/225 (31%), Gaps = 41/225 (18%) Query: 5 IPGV------LSPQDVARFREQLEQAEWVDGRVTTGAQGAQVKNNQQVDTR-----STLY 53 IP V +S + + + + G T V+ + Q+D + + Sbjct: 46 IPDVGNIGLPISTEHAKAIIQSCHPSPYGKGTET--LVDESVRKSWQLDASQFALQNPRW 103 Query: 54 AALQNEVLNAVNQHALFFAAALPRTLSTPLFNRYQNNETYGFHVDGAVRSHPQNGWMRTD 113 ++ A Y+ + H D Sbjct: 104 QLQVELFVDKAVTGLGLTANGREVKAKLYKLLIYEEGAFFLPHRDSEKADG--------- 154 Query: 114 LSATLFLSDPQSYDGGELVVNDTFGQHRVKLPAGD-----LVLYPSSSLHCVTPVTRGVR 168 + TL + P ++GG+++V+ + Q + + + + H V PVT G R Sbjct: 155 MFGTLAVCFPSKHEGGDVIVSHSRDQLKFQTAPTSEFGISWAAWYADVTHEVKPVTSGYR 214 Query: 169 VASFMWIQSMIRDDKKRAMLFELDNNIQSLKSRYGESEEILSLLN 213 V I ++I + L+SR +E I LL+ Sbjct: 215 VVL---IYNLIHRP-----------STALLESRGSSTENITRLLD 245 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.308 0.146 0.432 Lambda K H 0.267 0.0456 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,387,284,043 Number of Sequences: 3077464 Number of extensions: 55153246 Number of successful extensions: 148332 Number of sequences better than 1.0e-01: 250 Number of HSP's better than 0.1 without gapping: 282 Number of HSP's successfully gapped in prelim test: 1103 Number of HSP's that attempted gapping in prelim test: 145663 Number of HSP's gapped (non-prelim): 1540 length of query: 225 length of database: 1,040,396,356 effective HSP length: 124 effective length of query: 101 effective length of database: 658,790,820 effective search space: 66537872820 effective search space used: 66537872820 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.0 bits) S2: 91 (39.5 bits)