BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (187 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_A7ZR71 UPF0301 protein yqgE n=133 Tax=Gammaproteobacter... 384 e-106 UniRef50_A1JPT5 UPF0301 protein YE3428 n=22 Tax=Gammaproteobacte... 271 8e-72 UniRef50_Q87LK0 UPF0301 protein VP2612 n=11 Tax=Gammaproteobacte... 206 2e-52 UniRef50_B6EMV3 UPF0301 protein VSAL_I0547 n=39 Tax=Gammaproteob... 203 2e-51 UniRef50_Q7MHK0 UPF0301 protein VV2869 n=47 Tax=Gammaproteobacte... 199 3e-50 UniRef50_B8E9P8 UPF0301 protein Sbal223_1344 n=7 Tax=Shewanella ... 192 7e-48 UniRef50_Q21EI7 UPF0301 protein Sde_3637 n=7 Tax=Proteobacteria ... 186 5e-46 UniRef50_A1RHM8 UPF0301 protein Sputw3181_1330 n=13 Tax=Proteoba... 185 8e-46 UniRef50_Q605E8 UPF0301 protein MCA2336 2 n=3 Tax=Gammaproteobac... 182 3e-45 UniRef50_A1U764 Putative uncharacterized protein n=3 Tax=Marinob... 180 3e-44 UniRef50_D0L0G7 Putative uncharacterized protein n=4 Tax=Gammapr... 179 4e-44 UniRef50_Q478W0 UPF0301 protein Daro_3893 n=3 Tax=Betaproteobact... 179 4e-44 UniRef50_Q1YVF9 Putative uncharacterized protein n=1 Tax=gamma p... 176 4e-43 UniRef50_A1TKL7 UPF0301 protein Aave_0907 n=10 Tax=Comamonadacea... 174 1e-42 UniRef50_A1KUG1 UPF0301 protein NMC1274 n=27 Tax=Neisseriaceae R... 172 4e-42 UniRef50_Q486M0 UPF0301 protein CPS_1252 n=1 Tax=Colwellia psych... 172 4e-42 UniRef50_B9MF60 UPF0301 protein Dtpsy_2896 n=28 Tax=Proteobacter... 172 6e-42 UniRef50_Q2P5W3 UPF0301 protein XOO1309 n=20 Tax=Xanthomonadacea... 171 1e-41 UniRef50_Q1MZ13 Putative uncharacterized protein n=1 Tax=Bermane... 169 5e-41 UniRef50_A1SUE8 Putative uncharacterized protein n=2 Tax=Psychro... 168 1e-40 UniRef50_C6NYY5 Putative uncharacterized protein n=1 Tax=Acidith... 167 1e-40 UniRef50_B8KLD5 Putative uncharacterized protein n=3 Tax=Proteob... 167 1e-40 UniRef50_Q5WYW5 UPF0301 protein lpl0620 n=6 Tax=Legionella RepID... 166 3e-40 UniRef50_A8PPF4 Putative uncharacterized protein n=1 Tax=Rickett... 166 4e-40 UniRef50_Q3SFS4 UPF0301 protein Tbd_2579 n=2 Tax=Proteobacteria ... 163 3e-39 UniRef50_Q0I1B4 UPF0301 protein HS_0009 n=26 Tax=Pasteurellaceae... 160 3e-38 UniRef50_A4BI12 Putative uncharacterized protein n=1 Tax=Reineke... 159 4e-38 UniRef50_Q3IZ52 UPF0301 protein RHOS4_26140 n=6 Tax=Rhodobactera... 159 5e-38 UniRef50_B8FL92 UPF0301 protein Dalk_3037 n=1 Tax=Desulfatibacil... 158 8e-38 UniRef50_A9KDE7 UPF0301 protein CBUD_2193 n=7 Tax=Coxiella burne... 157 2e-37 UniRef50_Q1BYL1 UPF0301 protein Bcen_0382 n=60 Tax=Betaproteobac... 157 2e-37 UniRef50_A5WBR3 UPF0301 protein PsycPRwf_0144 n=21 Tax=Moraxella... 155 7e-37 UniRef50_B5JTR0 Putative uncharacterized protein n=1 Tax=gamma p... 154 1e-36 UniRef50_A3MYV4 UPF0301 protein APL_0232 n=7 Tax=Pasteurellaceae... 154 2e-36 UniRef50_A4A9E0 Protein containing DUF179 n=1 Tax=Congregibacter... 152 7e-36 UniRef50_Q0AMH8 Putative uncharacterized protein n=3 Tax=Hyphomo... 151 1e-35 UniRef50_C3K3J9 UPF0301 protein PFLU_5755 n=5 Tax=cellular organ... 148 8e-35 UniRef50_Q163D2 UPF0301 protein RD1_3419 n=14 Tax=Rhodobacterale... 147 1e-34 UniRef50_A8ZZX8 Putative uncharacterized protein n=1 Tax=Desulfo... 147 2e-34 UniRef50_B9NUS5 Putative uncharacterized protein n=2 Tax=Rhodoba... 146 3e-34 UniRef50_Q4ZZ67 UPF0301 protein Psyr_0485 n=31 Tax=Proteobacteri... 146 3e-34 UniRef50_A4SVF0 Putative uncharacterized protein n=1 Tax=Polynuc... 145 6e-34 UniRef50_A6VSP6 UPF0301 protein Mmwyl1_0539 n=2 Tax=Marinomonas ... 144 2e-33 UniRef50_Q31EK4 UPF0301 protein Tcr_1827 n=1 Tax=Thiomicrospira ... 144 2e-33 UniRef50_Q0EWH5 Putative uncharacterized protein n=1 Tax=Maripro... 139 4e-32 UniRef50_Q0C3C2 Putative uncharacterized protein n=1 Tax=Hyphomo... 139 5e-32 UniRef50_B4REX9 Transcriptional regulator n=4 Tax=Caulobacterace... 139 5e-32 UniRef50_Q3SNY6 UPF0301 protein Nwi_2752 n=121 Tax=Alphaproteoba... 139 6e-32 UniRef50_C5SQD7 Putative uncharacterized protein n=1 Tax=Asticca... 136 4e-31 UniRef50_C9CSD6 Putative uncharacterized protein n=1 Tax=Silicib... 133 3e-30 UniRef50_Q5FQY8 UPF0301 protein GOX1459 n=11 Tax=Acetobacteracea... 131 1e-29 UniRef50_Q6AL28 UPF0301 protein DP2218 n=1 Tax=Desulfotalea psyc... 130 2e-29 UniRef50_A7C130 Protein containing DUF179 n=1 Tax=Beggiatoa sp. ... 130 3e-29 UniRef50_Q2GAJ3 UPF0301 protein Saro_0683 n=4 Tax=Sphingomonadac... 129 5e-29 UniRef50_C0QFZ0 UPF0301 protein HRM2_24640 n=1 Tax=Desulfobacter... 129 5e-29 UniRef50_A9DAK3 Putative uncharacterized protein n=1 Tax=Hoeflea... 128 9e-29 UniRef50_Q1NQW6 Putative uncharacterized protein n=2 Tax=Deltapr... 128 1e-28 UniRef50_A0L5K4 UPF0301 protein Mmc1_0726 n=1 Tax=Magnetococcus ... 128 1e-28 UniRef50_A6WWH2 Putative uncharacterized protein n=2 Tax=Ochroba... 127 2e-28 UniRef50_Q60BQ2 UPF0301 protein MCA0413 1 n=1 Tax=Methylococcus ... 110 2e-23 UniRef50_Q5NQN1 UPF0301 protein ZMO0349 n=3 Tax=Zymomonas mobili... 110 3e-23 UniRef50_C0AXX7 Putative uncharacterized protein n=1 Tax=Proteus... 105 6e-22 UniRef50_C8CIK8 Putative uncharacterized protein n=1 Tax=uncultu... 104 1e-21 UniRef50_Q0BLI0 UPF0301 protein FTH_1193 n=18 Tax=Francisella Re... 101 1e-20 UniRef50_A9GTQ2 Putative uncharacterized protein n=1 Tax=Sorangi... 100 2e-20 UniRef50_Q2BRE1 Putative uncharacterized protein n=1 Tax=Neptuni... 100 4e-20 UniRef50_Q0FVR8 Putative uncharacterized protein (Fragment) n=1 ... 98 1e-19 UniRef50_A6G4Z9 Putative uncharacterized protein n=1 Tax=Plesioc... 96 6e-19 UniRef50_D0LIU3 Putative uncharacterized protein n=1 Tax=Haliang... 95 1e-18 UniRef50_D2QR79 Putative uncharacterized protein n=2 Tax=Flexiba... 94 3e-18 UniRef50_B0BVW3 UPF0301 protein RrIowa_0061 n=15 Tax=Rickettsia ... 93 4e-18 UniRef50_A7H7H6 UPF0301 protein Anae109_0457 n=4 Tax=Anaeromyxob... 92 6e-18 UniRef50_Q2S591 UPF0301 protein SRU_0495 n=2 Tax=Rhodothermaceae... 92 1e-17 UniRef50_C1D0N0 Putative uncharacterized protein n=3 Tax=Deinoco... 92 1e-17 UniRef50_Q3KMF1 UPF0301 protein CTA_0231 n=9 Tax=Chlamydia RepID... 91 1e-17 UniRef50_B3QT15 Putative uncharacterized protein n=1 Tax=Chloroh... 90 4e-17 UniRef50_A6LBX4 UPF0301 protein BDI_1431 n=6 Tax=Bacteroidales R... 89 8e-17 UniRef50_A3VRH6 Putative uncharacterized protein n=1 Tax=Parvula... 87 2e-16 UniRef50_Q254Z3 UPF0301 protein CF0373 n=7 Tax=Chlamydiales RepI... 85 1e-15 UniRef50_D2R140 Putative uncharacterized protein n=1 Tax=Pirellu... 82 1e-14 UniRef50_Q5LDK5 UPF0301 protein BF2109 n=20 Tax=Bacteroides RepI... 81 2e-14 UniRef50_Q11U74 UPF0301 protein CHU_1773 n=2 Tax=Flexibacteracea... 81 2e-14 UniRef50_A3ZQK2 Putative uncharacterized protein n=1 Tax=Blastop... 80 2e-14 UniRef50_Q1DAS2 UPF0301 protein MXAN_2022 n=2 Tax=Cystobacterine... 80 3e-14 UniRef50_A3HT39 Putative uncharacterized protein n=1 Tax=Algorip... 79 6e-14 UniRef50_Q3B561 UPF0301 protein Plut_0637 n=11 Tax=Chlorobiaceae... 78 2e-13 UniRef50_C3Q021 UPF0301 protein n=8 Tax=Bacteroides RepID=C3Q021... 77 2e-13 UniRef50_B9XJW1 Putative uncharacterized protein n=1 Tax=bacteri... 77 4e-13 UniRef50_A6C880 Putative uncharacterized protein n=1 Tax=Plancto... 77 4e-13 UniRef50_A5FNN9 Putative uncharacterized protein n=18 Tax=Bacter... 76 4e-13 UniRef50_Q1Q3L0 Putative uncharacterized protein n=1 Tax=Candida... 74 2e-12 UniRef50_C7PSM3 Putative uncharacterized protein n=1 Tax=Chitino... 72 9e-12 UniRef50_UPI0001745679 hypothetical protein VspiD_25265 n=1 Tax=... 72 1e-11 UniRef50_UPI0001C3133A protein of unknown function DUF179 n=1 Ta... 71 2e-11 UniRef50_B4CVG9 Putative uncharacterized protein n=1 Tax=Chthoni... 70 3e-11 UniRef50_A4C260 Putative transcriptional regulator n=1 Tax=Polar... 70 4e-11 UniRef50_B0SHS8 Transcriptional regulator n=6 Tax=Leptospira Rep... 70 4e-11 UniRef50_B1ZWF6 Putative uncharacterized protein n=2 Tax=Opituta... 69 7e-11 UniRef50_B1MML1 UPF0301 protein MAB_4928c n=20 Tax=Corynebacteri... 67 3e-10 UniRef50_Q47MA0 UPF0301 protein Tfu_2389 n=3 Tax=Actinomycetales... 67 4e-10 UniRef50_C2G2S7 Transcriptional regulator n=3 Tax=Sphingobacteri... 66 5e-10 UniRef50_C7MS43 Predicted transcriptional regulator n=3 Tax=Acti... 65 8e-10 UniRef50_C1ZJG4 Predicted transcriptional regulator, COG1678 n=1... 64 4e-09 UniRef50_C7PBJ3 Putative uncharacterized protein n=1 Tax=Chitino... 63 4e-09 UniRef50_A6E847 Putative uncharacterized protein n=1 Tax=Pedobac... 63 6e-09 UniRef50_C1B7P4 UPF0301 protein ROP_34500 n=13 Tax=Corynebacteri... 60 3e-08 UniRef50_B5JR88 Putative uncharacterized protein n=1 Tax=Verruco... 60 4e-08 UniRef50_C6X421 Putative transcriptional regulator n=1 Tax=Flavo... 60 4e-08 UniRef50_Q82D55 UPF0301 protein SAV_5129 n=12 Tax=Actinomycetale... 60 4e-08 UniRef50_C1RPZ7 Predicted transcriptional regulator, COG1678 n=1... 60 5e-08 UniRef50_A1SG68 Putative uncharacterized protein n=2 Tax=Actinom... 59 6e-08 UniRef50_C4DLM5 Predicted transcriptional regulator, COG1678 n=5... 59 8e-08 UniRef50_C8XE74 Putative uncharacterized protein n=2 Tax=Actinom... 59 8e-08 UniRef50_Q4CSL3 Putative uncharacterized protein n=2 Tax=Trypano... 58 1e-07 UniRef50_C0AXX9 Putative uncharacterized protein n=1 Tax=Proteus... 56 5e-07 UniRef50_B7G677 Predicted protein n=2 Tax=Bacillariophyta RepID=... 56 7e-07 UniRef50_D0A6S5 Putative uncharacterized protein n=2 Tax=Trypano... 54 2e-06 UniRef50_C1E1K3 Predicted protein n=2 Tax=cellular organisms Rep... 54 2e-06 UniRef50_A9RVW3 Predicted protein n=1 Tax=Physcomitrella patens ... 53 5e-06 UniRef50_C0YNI4 Transcriptional regulator n=1 Tax=Chryseobacteri... 53 5e-06 UniRef50_B2UMM6 Putative uncharacterized protein n=1 Tax=Akkerma... 52 1e-05 UniRef50_Q6A827 Conserved protein, DUF179 n=2 Tax=Propionibacter... 52 1e-05 UniRef50_C5BU30 Putative uncharacterized protein n=1 Tax=Teredin... 51 2e-05 UniRef50_Q4QG99 Putative uncharacterized protein n=3 Tax=Leishma... 50 4e-05 UniRef50_C7Q9Z6 Putative uncharacterized protein n=5 Tax=Actinom... 49 1e-04 UniRef50_A8IT21 Predicted protein n=1 Tax=Chlamydomonas reinhard... 47 2e-04 UniRef50_A8IRU3 Predicted protein n=1 Tax=Chlamydomonas reinhard... 47 2e-04 UniRef50_Q8FSW7 UPF0301 protein CE2927 n=11 Tax=Corynebacterium ... 47 2e-04 UniRef50_Q8NL65 UPF0301 protein Cgl3084/cg3414 n=5 Tax=Corynebac... 46 5e-04 UniRef50_Q9LQ30 F14M2.10 protein n=6 Tax=rosids RepID=Q9LQ30_ARATH 45 0.001 UniRef50_Q7G645 Os10g0330400 protein n=3 Tax=Oryza sativa RepID=... 43 0.005 UniRef50_B9SPX9 Electron transporter, putative n=2 Tax=fabids Re... 43 0.005 UniRef50_Q7URG7 Probable transcriptional regulator n=1 Tax=Rhodo... 42 0.011 UniRef50_Q9LS71 Emb|CAB72194.1 n=4 Tax=rosids RepID=Q9LS71_ARATH 42 0.014 UniRef50_C5YY61 Putative uncharacterized protein Sb09g020680 n=4... 41 0.016 UniRef50_O68558 Putative uncharacterized protein (Fragment) n=1 ... 41 0.016 UniRef50_B8C1S9 Predicted protein n=1 Tax=Thalassiosira pseudona... 40 0.039 >UniRef50_A7ZR71 UPF0301 protein yqgE n=133 Tax=Gammaproteobacteria RepID=YQGE_ECO24 Length = 187 Score = 384 bits (986), Expect = e-106, Method: Compositional matrix adjust. Identities = 187/187 (100%), Positives = 187/187 (100%) Query: 1 MNLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPE 60 MNLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPE Sbjct: 1 MNLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPE 60 Query: 61 PRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQ 120 PRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQ Sbjct: 61 PRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQ 120 Query: 121 PSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTM 180 PSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTM Sbjct: 121 PSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTM 180 Query: 181 PGVAGHA 187 PGVAGHA Sbjct: 181 PGVAGHA 187 >UniRef50_A1JPT5 UPF0301 protein YE3428 n=22 Tax=Gammaproteobacteria RepID=Y3428_YERE8 Length = 187 Score = 271 bits (693), Expect = 8e-72, Method: Compositional matrix adjust. Identities = 125/187 (66%), Positives = 154/187 (82%) Query: 1 MNLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPE 60 MNLQHHFLIAMP+LQDP F RSV+YICEHN GAMG+++NKP+E +E +L+KLKI+P Sbjct: 1 MNLQHHFLIAMPSLQDPHFMRSVIYICEHNKEGAMGLVINKPMEQFTVETVLKKLKISPT 60 Query: 61 PRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQ 120 PRD SIRLDK V+ GGPLAEDRGFILH+P F SSI IS +T++TTS+DVLETLGT +Q Sbjct: 61 PRDPSIRLDKAVLAGGPLAEDRGFILHSPQEGFGSSIPISPDTMITTSKDVLETLGTPEQ 120 Query: 121 PSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTM 180 P ++LVALGYA W++GQLEQE+LDNAWLT AD +ILF TPIA+RW+ AA +G++I + Sbjct: 121 PKNLLVALGYAGWQQGQLEQELLDNAWLTIEADTHILFNTPIAERWQAAANKLGINIFNI 180 Query: 181 PGVAGHA 187 AGHA Sbjct: 181 APQAGHA 187 >UniRef50_Q87LK0 UPF0301 protein VP2612 n=11 Tax=Gammaproteobacteria RepID=Y2612_VIBPA Length = 187 Score = 206 bits (525), Expect = 2e-52, Method: Compositional matrix adjust. Identities = 92/188 (48%), Positives = 136/188 (72%), Gaps = 2/188 (1%) Query: 1 MNLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITP- 59 MNL +HFL+AMP ++DP F+ SV+Y+CEHN GAMG+++N P++ + + +L+++ + P Sbjct: 1 MNLTNHFLVAMPGMKDPYFQNSVIYVCEHNEEGAMGLMINAPVD-ITVGNMLKQVDVQPV 59 Query: 60 EPRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDK 119 PR LD+PV GGP++EDRGFILH P + SSI+++D+ +TTSRD+L LGT+ Sbjct: 60 HPRLFEASLDRPVYNGGPISEDRGFILHKPKDYYESSIQMTDDLAVTTSRDILSVLGTEA 119 Query: 120 QPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILT 179 +PSD LVALGY+ W GQLE E+++N+WLT A I+F TPI +RW++A + +G+D Sbjct: 120 EPSDYLVALGYSGWSAGQLENELVENSWLTIEATPEIIFDTPITERWKKAVEKLGIDPSQ 179 Query: 180 MPGVAGHA 187 + AGHA Sbjct: 180 LSADAGHA 187 >UniRef50_B6EMV3 UPF0301 protein VSAL_I0547 n=39 Tax=Gammaproteobacteria RepID=Y547_ALISL Length = 187 Score = 203 bits (517), Expect = 2e-51, Method: Compositional matrix adjust. Identities = 88/188 (46%), Positives = 140/188 (74%), Gaps = 2/188 (1%) Query: 1 MNLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKI-TP 59 M+L++HFL+AMP++ DP+F RSV+YICEH+++G MG+ +N+P++ + ++G+L+++K+ P Sbjct: 1 MDLKNHFLVAMPSMNDPVFTRSVIYICEHDSDGTMGLRINQPVQ-ISLKGMLDQIKLDNP 59 Query: 60 EPRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDK 119 P L +PV+ GGP+++DRGF+LH P N++SSI +++ +TTS+D+L TLGT+ Sbjct: 60 SPIIFPQTLSQPVLNGGPVSDDRGFVLHYPKDNYSSSIEVTEELSVTTSKDILATLGTED 119 Query: 120 QPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILT 179 QP LVALGY+ W+ GQLEQE+ +N WL AD +++F TPI DRWR A +++G+ + Sbjct: 120 QPYKYLVALGYSGWDAGQLEQELSENTWLILEADSSVIFDTPIPDRWRRAIEILGISPVN 179 Query: 180 MPGVAGHA 187 + GHA Sbjct: 180 ISSEVGHA 187 >UniRef50_Q7MHK0 UPF0301 protein VV2869 n=47 Tax=Gammaproteobacteria RepID=Y2869_VIBVY Length = 187 Score = 199 bits (507), Expect = 3e-50, Method: Compositional matrix adjust. Identities = 89/188 (47%), Positives = 135/188 (71%), Gaps = 2/188 (1%) Query: 1 MNLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITP- 59 MNL +HFL+AMP ++DP F+ SV+YICEHN GAMG+++N P++ + + +LE++ + P Sbjct: 1 MNLTNHFLVAMPGMKDPYFQHSVIYICEHNEEGAMGLMINAPID-ITVGKMLEQVDVQPV 59 Query: 60 EPRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDK 119 P+ + L KPV GGP+AEDRGFILH P + SS+++++ +TTS+D+L LGT+ Sbjct: 60 HPQLNTSSLTKPVYNGGPVAEDRGFILHRPKDFYESSLQMTEQISVTTSKDILTVLGTEA 119 Query: 120 QPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILT 179 +PS +VALGY+ W GQLE E+ +N+WLT A+ +I+F TPIA RW++A +++G+ Sbjct: 120 EPSSYIVALGYSGWSAGQLEAELAENSWLTVEANPDIIFDTPIAMRWQKAVQMLGIHASQ 179 Query: 180 MPGVAGHA 187 + AGHA Sbjct: 180 LSDQAGHA 187 >UniRef50_B8E9P8 UPF0301 protein Sbal223_1344 n=7 Tax=Shewanella RepID=Y1344_SHEB2 Length = 187 Score = 192 bits (487), Expect = 7e-48, Method: Compositional matrix adjust. Identities = 87/186 (46%), Positives = 126/186 (67%), Gaps = 1/186 (0%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 +LQ+HFLIAMP+L D F RSV+YICEH+ GAMG+++NKPL +++ +LE++ + E Sbjct: 3 SLQNHFLIAMPSLHDTFFERSVIYICEHDAKGAMGLVINKPL-GIEVNSLLEQMDLPAEQ 61 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQP 121 + + VM+GGP+++DRGF+LHT +A+S + ++TTSRDVL +G+++ P Sbjct: 62 VSTDLAFNANVMMGGPVSQDRGFVLHTSQPYWANSTDLGCGLMLTTSRDVLTAIGSNRSP 121 Query: 122 SDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMP 181 LVALGYA W K QLEQE+ DN+WLT PA +LF DRW +A++ +G D + Sbjct: 122 EKFLVALGYAGWSKDQLEQELADNSWLTIPATNALLFDIKHEDRWPQASRALGFDAWQVS 181 Query: 182 GVAGHA 187 AGHA Sbjct: 182 AQAGHA 187 >UniRef50_Q21EI7 UPF0301 protein Sde_3637 n=7 Tax=Proteobacteria RepID=Y3637_SACD2 Length = 203 Score = 186 bits (471), Expect = 5e-46, Method: Compositional matrix adjust. Identities = 86/187 (45%), Positives = 125/187 (66%), Gaps = 5/187 (2%) Query: 1 MNLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPE 60 ++L+ HFLIAMP LQDPIF RS+ YIC+H GAMGI+VN+P+ NL + I E+L++ Sbjct: 22 VSLRDHFLIAMPGLQDPIFSRSLTYICDHTAQGAMGIVVNQPM-NLTLGDIFEQLEL--- 77 Query: 61 PRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQ 120 +D++ + + V+ GGP+ +RGF+LH + S++ I+ + +T SRD++ + + Sbjct: 78 -QDKAQQAGRAVLAGGPVNTERGFVLHRDSGAWESTMHIAPDVNLTASRDIVHAIANNTG 136 Query: 121 PSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTM 180 P L ALGYA W GQLE+EI N+WLT PAD +I+F P+ DRW AA+ +G+DI M Sbjct: 137 PKSSLFALGYAGWSAGQLEEEISANSWLTIPADSSIIFDIPVEDRWAAAARQLGIDIHLM 196 Query: 181 PGVAGHA 187 AGHA Sbjct: 197 SATAGHA 203 >UniRef50_A1RHM8 UPF0301 protein Sputw3181_1330 n=13 Tax=Proteobacteria RepID=Y1330_SHESW Length = 187 Score = 185 bits (469), Expect = 8e-46, Method: Compositional matrix adjust. Identities = 86/186 (46%), Positives = 124/186 (66%), Gaps = 1/186 (0%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 +LQ+HFLIAMP+L D F RSV+Y+CEH+ GAMGI++NKPL +++ +LE++ + E Sbjct: 3 SLQNHFLIAMPSLDDTFFERSVIYLCEHDDKGAMGIVINKPL-GIEVSSLLEQMDLPAEQ 61 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQP 121 I + V++GGP+++DRGF+LHT +A+S + ++TTSRDVL +G + P Sbjct: 62 VFADIAQNAQVLMGGPVSQDRGFVLHTSQPYWANSTDLGSGLMLTTSRDVLTAIGGKRSP 121 Query: 122 SDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMP 181 LVALGYA W K QLEQE+ +N+WLT PA +LF DRW +A++ +G D + Sbjct: 122 DKFLVALGYAGWGKHQLEQELAENSWLTIPATNALLFDVKHEDRWPQASRSLGFDAWQVS 181 Query: 182 GVAGHA 187 AGHA Sbjct: 182 AQAGHA 187 >UniRef50_Q605E8 UPF0301 protein MCA2336 2 n=3 Tax=Gammaproteobacteria RepID=Y2336_METCA Length = 188 Score = 182 bits (463), Expect = 3e-45, Method: Compositional matrix adjust. Identities = 87/185 (47%), Positives = 128/185 (69%), Gaps = 4/185 (2%) Query: 3 LQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPR 62 L +HFLIAMP L DP F ++V +C+HN +GA+GII+N+P E LK+ I+ +++I + Sbjct: 8 LANHFLIAMPGLTDPHFAKTVTLVCQHNADGALGIIINRPSE-LKLSDIMRQMEIDLKVA 66 Query: 63 DESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQPS 122 + D PV GGP+ +RGFILH P + +AS++ +S+ +TTSRD+LE +G + P Sbjct: 67 ELG---DLPVFFGGPVHPERGFILHEPATVWASTLVVSERLALTTSRDILEAVGRGEGPR 123 Query: 123 DVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMPG 182 +L+ALGYA W +GQLE+EI+DN+WL AP+D ++F+ P RW+ AA L+GVDI + Sbjct: 124 RMLLALGYAGWGQGQLEREIIDNSWLNAPSDNAVIFEHPPGRRWKAAADLVGVDISLLTS 183 Query: 183 VAGHA 187 AGH Sbjct: 184 QAGHG 188 >UniRef50_A1U764 Putative uncharacterized protein n=3 Tax=Marinobacter RepID=A1U764_MARAV Length = 188 Score = 180 bits (456), Expect = 3e-44, Method: Compositional matrix adjust. Identities = 82/186 (44%), Positives = 126/186 (67%), Gaps = 7/186 (3%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 +L+HHFL+A P L DP F V+Y+CEH+ GA+G+++N+PL ++ + ILE+L + Sbjct: 10 SLRHHFLVASPWLADPRFHGGVIYLCEHSEEGALGLMINQPL-DIHLGEILEQLDM---- 64 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQP 121 LD PV GGP+ +RGF+LH+P + ++ R++D ++TTSRD+LE++G D+ P Sbjct: 65 --HGGELDLPVYTGGPVQPERGFVLHSPGRQWQNTARVTDEVLLTTSRDILESIGRDEGP 122 Query: 122 SDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMP 181 LVALGY+ W +GQLE+E+ NAWLT PA +ILF+TP R++ +L+G+D+ + Sbjct: 123 ESFLVALGYSGWGEGQLEEELGSNAWLTCPASTDILFRTPADQRYQAVLRLMGIDLNQLS 182 Query: 182 GVAGHA 187 GHA Sbjct: 183 DSVGHA 188 >UniRef50_D0L0G7 Putative uncharacterized protein n=4 Tax=Gammaproteobacteria RepID=D0L0G7_HALNC Length = 216 Score = 179 bits (455), Expect = 4e-44, Method: Compositional matrix adjust. Identities = 84/187 (44%), Positives = 127/187 (67%), Gaps = 4/187 (2%) Query: 1 MNLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPE 60 + L++ LIAMP+L DP F +V Y+CEHN +GAMGI +N+PL ++ + I + +KI+ Sbjct: 34 IQLKNQILIAMPSLDDPNFNHTVTYVCEHNEDGAMGITINRPL-DVTLGDIFDHMKISCS 92 Query: 61 PRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQ 120 + SIR +PV +GGP+A +RGF+LHTP + S++ I+D +TTS+D+L+ L Sbjct: 93 --NPSIR-GRPVFMGGPVALERGFVLHTPHGGWESTLEITDEIGLTTSKDILQALAEGAG 149 Query: 121 PSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTM 180 P+ ++ALGY+ W +GQLEQE+ DN WLT A ++F P+ +RW AA+ +GVD+ + Sbjct: 150 PARAVIALGYSGWSEGQLEQELADNTWLTVAATTELIFDYPVEERWAAAARSLGVDMNLL 209 Query: 181 PGVAGHA 187 G AGHA Sbjct: 210 SGEAGHA 216 >UniRef50_Q478W0 UPF0301 protein Daro_3893 n=3 Tax=Betaproteobacteria RepID=Y3893_DECAR Length = 186 Score = 179 bits (455), Expect = 4e-44, Method: Compositional matrix adjust. Identities = 86/187 (45%), Positives = 126/187 (67%), Gaps = 4/187 (2%) Query: 1 MNLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPE 60 +NL +FLIAMP L+DP F ++VYICEHN NGA+GIIVN+P++ + + +LEK+ I E Sbjct: 4 VNLTDNFLIAMPTLEDPYFSNALVYICEHNENGALGIIVNRPID-MNLASLLEKIDIKLE 62 Query: 61 PRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQ 120 + + D PV GGP+ DRGF+LH P + S++ I+ + +T+SRDVL ++G+ Sbjct: 63 AENLA---DMPVYFGGPVQLDRGFVLHRPIGQWQSTLAINSDVGLTSSRDVLSSVGSAGL 119 Query: 121 PSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTM 180 P+++LV LGYA W+ GQLE+E+ N+WLT PA +ILF P +R A + +G+ + Sbjct: 120 PAEILVTLGYAGWDAGQLEEELAQNSWLTVPAKASILFDLPPEERLPAAMQKLGISFTQL 179 Query: 181 PGVAGHA 187 VAGHA Sbjct: 180 SDVAGHA 186 >UniRef50_Q1YVF9 Putative uncharacterized protein n=1 Tax=gamma proteobacterium HTCC2207 RepID=Q1YVF9_9GAMM Length = 192 Score = 176 bits (445), Expect = 4e-43, Method: Compositional matrix adjust. Identities = 90/188 (47%), Positives = 127/188 (67%), Gaps = 8/188 (4%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 +L+ HFL+AMP L+DP F SVVYICEHN++GAMG+I+N+ + ++ ++ I ++LK+ E Sbjct: 11 SLKDHFLLAMPGLEDPTFSDSVVYICEHNSDGAMGLIINQQM-DIPVKAIFDQLKL--EY 67 Query: 62 RDESIRLDKPVML-GGPLAEDRGFILHT-PPSNFASSIRISDNTVMTTSRDVLETLGTDK 119 +DE R P++ GGP+ DRGFILH + S++ ISD +T SRD+L + K Sbjct: 68 QDECGR---PLLFDGGPVQRDRGFILHANCEQQWESTLMISDQVCLTASRDILSDMALGK 124 Query: 120 QPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILT 179 P D LV LGY+SWE GQLE+E+ +N+WLT PA+ I+FKT A R AA IG+D+ Sbjct: 125 GPKDSLVTLGYSSWEAGQLERELGENSWLTIPAEAEIIFKTDCAKRASAAALSIGLDLRM 184 Query: 180 MPGVAGHA 187 + AGHA Sbjct: 185 LSHQAGHA 192 >UniRef50_A1TKL7 UPF0301 protein Aave_0907 n=10 Tax=Comamonadaceae RepID=Y907_ACIAC Length = 213 Score = 174 bits (442), Expect = 1e-42, Method: Compositional matrix adjust. Identities = 90/210 (42%), Positives = 126/210 (60%), Gaps = 27/210 (12%) Query: 1 MNLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPE 60 MNL HHFLIAMP L+D F RSVVY+CEH+ GA+G+I+NKP +L ++G+ +K+ ++ Sbjct: 8 MNLTHHFLIAMPGLEDESFARSVVYLCEHSERGALGLIINKP-SDLSLKGLFDKVDLSLR 66 Query: 61 PRDESIRLDKPVMLGGPLAEDRGFILH-----------------------TPPSNFASSI 97 D S+ +PV GGP+ +RGF+LH S +AS++ Sbjct: 67 REDLSL---EPVFRGGPVQTERGFVLHEAMGPSSGKQAAGEGGAQAEGEGAEESAYASTM 123 Query: 98 RISDNTVMTTSRDVLETLGTDKQPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNIL 157 I MTTS+DVLE L T P VLV LGY+SW +GQLE E+ +N+WLT ADL+++ Sbjct: 124 SIPGGLEMTTSKDVLEALSTGAGPRRVLVTLGYSSWGEGQLESELAENSWLTVGADLSVI 183 Query: 158 FKTPIADRWREAAKLIGVDILTMPGVAGHA 187 F TP+ R+ A L+G+ + AGHA Sbjct: 184 FDTPVGQRYDRALALLGLQSWMLSPEAGHA 213 >UniRef50_A1KUG1 UPF0301 protein NMC1274 n=27 Tax=Neisseriaceae RepID=Y1274_NEIMF Length = 182 Score = 172 bits (437), Expect = 4e-42, Method: Compositional matrix adjust. Identities = 83/188 (44%), Positives = 123/188 (65%), Gaps = 7/188 (3%) Query: 1 MNLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPE 60 MNL +HFL+AMP ++D F +SVVYIC+H+ +GA+GI +NKP I + + Sbjct: 1 MNLSNHFLVAMPDMEDAFFSQSVVYICKHDEDGALGIAINKP------SPITMDMIFSAT 54 Query: 61 PRDESIRLDK-PVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDK 119 ++ +R+ VM+GGP+ +RG+++HTP N+ SSI +SDN +T+SRDV+E + + Sbjct: 55 GKNIPMRMQHDSVMMGGPVQVERGYVVHTPIGNWQSSIGVSDNIALTSSRDVIENISREG 114 Query: 120 QPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILT 179 L+++GY+SW KGQLE+E+ DNAWLT PAD +ILF P R+ A +G+D L Sbjct: 115 AVDKALISIGYSSWGKGQLERELADNAWLTVPADEHILFDIPYEHRYAAAFAKLGIDPLA 174 Query: 180 MPGVAGHA 187 + AGHA Sbjct: 175 LFSGAGHA 182 >UniRef50_Q486M0 UPF0301 protein CPS_1252 n=1 Tax=Colwellia psychrerythraea 34H RepID=Y1252_COLP3 Length = 210 Score = 172 bits (437), Expect = 4e-42, Method: Compositional matrix adjust. Identities = 86/211 (40%), Positives = 131/211 (62%), Gaps = 28/211 (13%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 +L++ LIAMP+L DP F ++V YICEHN +GAMG+I+N P+ N+ + +L++ I P+ Sbjct: 3 SLENQLLIAMPSLGDPYFNKTVTYICEHNEDGAMGLIINLPV-NITLADLLKQ--IEPDE 59 Query: 62 RDESIR-------------------------LDKPVMLGGPLAEDRGFILHTPPSNFASS 96 D++ L++ V+ GGP+A+ RGF+LH+ ++SS Sbjct: 60 GDKTGNVNSNSELTKSDDVNDITLVTDITNSLEQLVLAGGPIAQQRGFVLHSSQPGWSSS 119 Query: 97 IRISDNTVMTTSRDVLETLGTDKQPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNI 156 + +S ++TTS+D+L LGT + P +V LGYA W GQLEQE+ N+WLT PAD+ I Sbjct: 120 LVLSKELMITTSKDILMALGTQQAPEQFIVTLGYAGWGPGQLEQELQANSWLTTPADIEI 179 Query: 157 LFKTPIADRWREAAKLIGVDILTMPGVAGHA 187 LFKTPI RW++A + +G+D+ + GHA Sbjct: 180 LFKTPIEQRWKKATEKLGIDLAHLSTDIGHA 210 >UniRef50_B9MF60 UPF0301 protein Dtpsy_2896 n=28 Tax=Proteobacteria RepID=Y2896_DIAST Length = 199 Score = 172 bits (435), Expect = 6e-42, Method: Compositional matrix adjust. Identities = 86/196 (43%), Positives = 126/196 (64%), Gaps = 13/196 (6%) Query: 1 MNLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPE 60 MNL HHFLIAMP ++D F RSVVY+CEH+ GA+G+I+NKP + +EG+ EK+ ++ Sbjct: 8 MNLTHHFLIAMPGVEDASFSRSVVYLCEHSERGALGLIINKPT-PISLEGLFEKVDLSLG 66 Query: 61 PRDESIRLDKPVMLGGPLAEDRGFILHTP---------PSNFASSIRISDNTVMTTSRDV 111 D ++ +PV GGP+ +RGF+LH S +AS++ I MTTS+DV Sbjct: 67 REDLTL---QPVFQGGPVQTERGFVLHEAMRGPQESEDESPYASTMTIPGGLEMTTSKDV 123 Query: 112 LETLGTDKQPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAK 171 LE L P VLV LGY++W +GQLE E+ +N+WLT AD++++F+TP+ +R+ A Sbjct: 124 LEALAHGAGPRRVLVTLGYSAWGEGQLESELAENSWLTVGADVSVIFETPVQERYDRALG 183 Query: 172 LIGVDILTMPGVAGHA 187 L+G+ + AGHA Sbjct: 184 LLGLQSWMLSPEAGHA 199 >UniRef50_Q2P5W3 UPF0301 protein XOO1309 n=20 Tax=Xanthomonadaceae RepID=Y1309_XANOM Length = 188 Score = 171 bits (433), Expect = 1e-41, Method: Compositional matrix adjust. Identities = 85/185 (45%), Positives = 121/185 (65%), Gaps = 4/185 (2%) Query: 3 LQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPR 62 L + LIA+PAL DP F RSV IC+H+ NGAMG++VN+P E E +L ++ I + Sbjct: 8 LANQLLIALPALSDPTFSRSVALICQHDENGAMGVLVNRPSEYTLGE-VLSQMGIDTD-- 64 Query: 63 DESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQPS 122 DE +R ++ V+ GGP+ +RGF++H + SS+ + +TTSRD+LE + P Sbjct: 65 DEPLR-EQIVLSGGPVHPERGFVIHDDAREWDSSLEVGQGVFLTTSRDILEAMAAGNGPR 123 Query: 123 DVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMPG 182 +VLVALG A W GQLE E+ +N+WLTAP+D N+LF T + DRW+ AA IGVD+ + Sbjct: 124 NVLVALGCAGWGAGQLEFELGENSWLTAPSDANVLFATALEDRWQTAAGRIGVDLFRLTD 183 Query: 183 VAGHA 187 +GHA Sbjct: 184 YSGHA 188 >UniRef50_Q1MZ13 Putative uncharacterized protein n=1 Tax=Bermanella marisrubri RepID=Q1MZ13_9GAMM Length = 185 Score = 169 bits (427), Expect = 5e-41, Method: Compositional matrix adjust. Identities = 82/186 (44%), Positives = 121/186 (65%), Gaps = 6/186 (3%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 NL+HH L+AMP+L DP F SV YIC+HN G+MG+++NKP+ +++ +L +L I Sbjct: 6 NLKHHLLLAMPSLSDPYFGHSVCYICDHNEQGSMGLVLNKPM-GIELTDVLSELDIE--- 61 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQP 121 D+ I P++ GGP++ ++GF+L+ + ++ I+ + +TTS+D+L L P Sbjct: 62 TDKPIHF--PILQGGPVSPEQGFVLYRGSESELQNMVINGDIRLTTSKDILSQLALGSGP 119 Query: 122 SDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMP 181 DV + LGYA WE GQLEQE++ NAWLT PAD +LF TP+ +AA IGVD+ + Sbjct: 120 DDVRICLGYAGWEAGQLEQELIQNAWLTVPADEELLFHTPMDQMLEKAASRIGVDMSLIS 179 Query: 182 GVAGHA 187 G AGHA Sbjct: 180 GEAGHA 185 >UniRef50_A1SUE8 Putative uncharacterized protein n=2 Tax=Psychromonas RepID=A1SUE8_PSYIN Length = 197 Score = 168 bits (425), Expect = 1e-40, Method: Compositional matrix adjust. Identities = 81/188 (43%), Positives = 123/188 (65%), Gaps = 9/188 (4%) Query: 3 LQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGIL---EKLKITP 59 L+ HFLIAMP+L DP F+ SVVYICEH+ GAMG I+N P++ L ++ +L + + P Sbjct: 16 LKDHFLIAMPSLNDPYFKHSVVYICEHDEKGAMGFIINFPVK-LTLQELLNNVDSIDHYP 74 Query: 60 EPRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDK 119 EP L PV LGGPL +RGF+LH+P ++ + S +++D +++ S +L TLGT+ Sbjct: 75 EPP-----LLNPVFLGGPLELERGFVLHSPVTDNSQSTKLNDQLMVSNSNAILSTLGTEN 129 Query: 120 QPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILT 179 +P + +V LGYASW GQLE+E+ DN W++ + +I+F TP+ RW E+ + +G+ Sbjct: 130 EPEEYIVTLGYASWSSGQLEKEMNDNHWISMESQNDIIFSTPVEQRWIESLQRLGIHPEQ 189 Query: 180 MPGVAGHA 187 + GHA Sbjct: 190 LSTEIGHA 197 >UniRef50_C6NYY5 Putative uncharacterized protein n=1 Tax=Acidithiobacillus caldus ATCC 51756 RepID=C6NYY5_9GAMM Length = 185 Score = 167 bits (424), Expect = 1e-40, Method: Compositional matrix adjust. Identities = 82/186 (44%), Positives = 122/186 (65%), Gaps = 5/186 (2%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 +L++H LIAMP L D +F RSV+ ICEH+ GAMG+++N+ L ++ + LE + ITP P Sbjct: 5 SLKNHLLIAMPNLHDGMFDRSVIVICEHSPEGAMGLVINR-LLDISLAKALEAVNITP-P 62 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQP 121 D + KPV GGP+ GFILH ++ S+ + + +T+S D+L + + P Sbjct: 63 EDAA---QKPVFWGGPVQPQHGFILHEGAGDWQVSMAVGEGLFLTSSPDILMAIAEHRGP 119 Query: 122 SDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMP 181 L+ALGYA W +GQLEQE+ +N+WL P DL++LF+ P A+RW+ AA+ +GVD+ + Sbjct: 120 ERFLLALGYAGWGEGQLEQELSENSWLHGPIDLSVLFELPPAERWQAAARGLGVDMRLLS 179 Query: 182 GVAGHA 187 G AGHA Sbjct: 180 GAAGHA 185 >UniRef50_B8KLD5 Putative uncharacterized protein n=3 Tax=Proteobacteria RepID=B8KLD5_9GAMM Length = 208 Score = 167 bits (424), Expect = 1e-40, Method: Compositional matrix adjust. Identities = 85/186 (45%), Positives = 118/186 (63%), Gaps = 6/186 (3%) Query: 3 LQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPR 62 L+ HFL+AMP L +F S+ Y+CEH GAMG+++N+PL+ L + I + L I Sbjct: 28 LRDHFLLAMPGLDAGLFSGSITYLCEHGEAGAMGLVINQPLD-LSLGEIFDHLDIAA--- 83 Query: 63 DESIRLDKPVMLGGPLAEDRGFILH-TPPSNFASSIRISDNTVMTTSRDVLETLGTDKQP 121 D R D+PV+ GGP+ D GF+LH + + SS+R++D +TTSRDVL+ + + P Sbjct: 84 DAHFR-DQPVLAGGPVQIDHGFVLHPSGGKRWDSSLRVTDEVQLTTSRDVLKAIACGEGP 142 Query: 122 SDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMP 181 D +V LGYA W GQLE+EI +N+WLT PAD I+F T I DR AA +G+D+ M Sbjct: 143 RDFVVTLGYAGWSAGQLEEEIANNSWLTLPADKRIIFHTAIEDRVAAAASALGIDMNLMS 202 Query: 182 GVAGHA 187 AGHA Sbjct: 203 AQAGHA 208 >UniRef50_Q5WYW5 UPF0301 protein lpl0620 n=6 Tax=Legionella RepID=Y620_LEGPL Length = 187 Score = 166 bits (421), Expect = 3e-40, Method: Compositional matrix adjust. Identities = 77/189 (40%), Positives = 120/189 (63%), Gaps = 10/189 (5%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 +L + LIAMP+L+DP F RSVVY+CEHN G++G+I+N+PL+ + + E+L+I P Sbjct: 6 SLANQLLIAMPSLKDPNFERSVVYLCEHNEQGSVGLIINRPLQ-FPLSIVFEQLQIEP-- 62 Query: 62 RDESIRLDK---PVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTD 118 IR++K P++ GGP+ +RGF++H + SS+ + D +TTS D++ + D Sbjct: 63 ----IRVEKNGLPLLFGGPVQPERGFVIHKQMGGWRSSLFLQDEVTVTTSNDIIRAIAYD 118 Query: 119 KQPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDIL 178 + P DVL+ LGYA+W + QLE+EI+ N WL P IL++ P +RW A +G+ + Sbjct: 119 EGPKDVLITLGYAAWTEQQLEREIMSNTWLVCPYKSEILYEVPFEERWEYAGLTLGIKMN 178 Query: 179 TMPGVAGHA 187 + AGHA Sbjct: 179 QLSSDAGHA 187 >UniRef50_A8PPF4 Putative uncharacterized protein n=1 Tax=Rickettsiella grylli RepID=A8PPF4_9COXI Length = 195 Score = 166 bits (420), Expect = 4e-40, Method: Compositional matrix adjust. Identities = 74/188 (39%), Positives = 118/188 (62%), Gaps = 3/188 (1%) Query: 3 LQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKI---EGILEKLKITP 59 ++FL+AMP L D F RSVVYICEH GA+GI++N+PL++L + E + E + Sbjct: 8 FTNYFLVAMPILTDAYFSRSVVYICEHTEKGAVGIVINQPLQSLHVNLAEIVQEITESNL 67 Query: 60 EPRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDK 119 + + + P++ GGP+ +RGF++H P + SS++++ +TTS+D+L + + Sbjct: 68 KSTKTTAGANFPILCGGPIHPERGFVIHAPSGAWQSSLKMNSEISVTTSKDILLAIAKQQ 127 Query: 120 QPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILT 179 P + +LGYA+W GQ+EQEI++N WLT PA+ N+LF P RW +A +GVD+ Sbjct: 128 GPEKFIFSLGYANWIAGQMEQEIINNFWLTLPANPNLLFDVPFEQRWLKAMDYLGVDVTK 187 Query: 180 MPGVAGHA 187 + + GHA Sbjct: 188 LAYMGGHA 195 >UniRef50_Q3SFS4 UPF0301 protein Tbd_2579 n=2 Tax=Proteobacteria RepID=Y2579_THIDA Length = 185 Score = 163 bits (412), Expect = 3e-39, Method: Compositional matrix adjust. Identities = 78/188 (41%), Positives = 119/188 (63%), Gaps = 7/188 (3%) Query: 1 MNLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKIT-P 59 +NL +HFLIAMP + DP F ++ YIC+H+ GA+G++VN+P++ L + + E++ ++ P Sbjct: 4 VNLTNHFLIAMPGMVDPNFNGTLTYICDHSDQGALGVVVNRPID-LDLSTLFEQIGLSLP 62 Query: 60 EPRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDK 119 E I V GGP+ +RGF+LHTPP F+S++ ++D +TTS+DVLE + Sbjct: 63 EGLHGEI-----VYFGGPVQTERGFVLHTPPLTFSSTLTVNDAVSLTTSKDVLEAVSQGA 117 Query: 120 QPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILT 179 P +V+LGYA W GQLE E+ NAWL+ AD ++F +R A KL+G+D + Sbjct: 118 GPEKFIVSLGYAGWSAGQLEDELKQNAWLSVAADPQVIFDLAPEERLPAAMKLLGIDFAS 177 Query: 180 MPGVAGHA 187 + AGHA Sbjct: 178 LSDEAGHA 185 >UniRef50_Q0I1B4 UPF0301 protein HS_0009 n=26 Tax=Pasteurellaceae RepID=Y009_HAES1 Length = 187 Score = 160 bits (404), Expect = 3e-38, Method: Compositional matrix adjust. Identities = 81/177 (45%), Positives = 111/177 (62%), Gaps = 4/177 (2%) Query: 1 MNLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPE 60 MNLQ HFLIAMP L+D F+RSVVYICE+N G+MG+++ + + L I + K+ Sbjct: 1 MNLQDHFLIAMPHLEDENFQRSVVYICENNEQGSMGLVLTQATD-LSIAELCAKMNFMM- 58 Query: 61 PRDESIRLDKPVMLGGPLAEDRGFILHTPPSN-FASSIRISDNTVMTTSRDVLETLGTDK 119 DE DK V+LGGP+ + GFILH + F S +++D +TTS D++ T GT + Sbjct: 59 -ADEREYSDKLVLLGGPVNLEHGFILHKKTAQEFQHSYKVTDQIYLTTSADIINTFGTAQ 117 Query: 120 QPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVD 176 P LV LG A WE QLE EI +N WL PAD +ILF I++RW A +L+G++ Sbjct: 118 SPEKYLVTLGCARWEPNQLENEIANNDWLVVPADEDILFDVDISERWFAANQLLGIE 174 >UniRef50_A4BI12 Putative uncharacterized protein n=1 Tax=Reinekea blandensis MED297 RepID=A4BI12_9GAMM Length = 183 Score = 159 bits (403), Expect = 4e-38, Method: Compositional matrix adjust. Identities = 80/188 (42%), Positives = 119/188 (63%), Gaps = 6/188 (3%) Query: 1 MNLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITP- 59 MNL HHFLIAMP + DP+F ++ Y+ +H+ GA+G+IVN+PL NL +E + E +++ Sbjct: 1 MNLNHHFLIAMPQMGDPVFSGTLTYLVQHDEQGALGLIVNRPL-NLNLEEVFESSELSGY 59 Query: 60 EPRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDK 119 PR S KPV GGP+A+++GFILH P S ++ V+TTSRD+LE + D+ Sbjct: 60 SPRTGS----KPVYHGGPVAQEQGFILHPPTEQTWISSLSNEQLVLTTSRDMLEAIAQDE 115 Query: 120 QPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILT 179 P L LGY+ W GQLE+E+ +NAWLT A+ I+F+ +++ A +G+D+ T Sbjct: 116 GPERFLFCLGYSGWSPGQLEEELKENAWLTVEANEAIIFQDDEVGKYQHALSDLGIDLAT 175 Query: 180 MPGVAGHA 187 + G G A Sbjct: 176 LSGHGGLA 183 >UniRef50_Q3IZ52 UPF0301 protein RHOS4_26140 n=6 Tax=Rhodobacteraceae RepID=Y2614_RHOS4 Length = 184 Score = 159 bits (402), Expect = 5e-38, Method: Compositional matrix adjust. Identities = 81/188 (43%), Positives = 114/188 (60%), Gaps = 5/188 (2%) Query: 1 MNLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPE 60 M+L LIAMP++ DP F RS+V IC H+ +GAMG++VNKP+E+L G+LE+L I Sbjct: 1 MDLSGSLLIAMPSMADPRFERSLVLICAHSPDGAMGLVVNKPVEDLSFAGMLEQLNIPRA 60 Query: 61 PRDESIRLDKPVMLGGPLAEDRGFILHTPPS-NFASSIRISDNTVMTTSRDVLETLGTDK 119 P IR V LGGP+ RGF+LH+P + +++ +S MT + D+LE L + Sbjct: 61 PNGRDIR----VHLGGPMERGRGFVLHSPDYMSVGATMLVSGKFGMTATVDILEALARGQ 116 Query: 120 QPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILT 179 PS L+ALGY+ W GQLE E+ N WLTA A ++F +W + +G+D LT Sbjct: 117 GPSSALMALGYSGWGPGQLEAEVQRNDWLTAEAPSELVFSDDDPGKWTGMLRHMGIDPLT 176 Query: 180 MPGVAGHA 187 + AGHA Sbjct: 177 LSSTAGHA 184 >UniRef50_B8FL92 UPF0301 protein Dalk_3037 n=1 Tax=Desulfatibacillum alkenivorans AK-01 RepID=Y3037_DESAA Length = 189 Score = 158 bits (400), Expect = 8e-38, Method: Compositional matrix adjust. Identities = 73/186 (39%), Positives = 118/186 (63%), Gaps = 4/186 (2%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 +L FLIAMPAL DP F SV YIC HN +GA G+++N+ +++ + + +++ + Sbjct: 8 SLAGQFLIAMPALNDPNFALSVTYICVHNQDGAFGLVINQSFDSVTGKTLFDQMDMPAVK 67 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQP 121 E ++ V +GGP+ + F+LH P + +S++ISD M+ S D+L+ + + P Sbjct: 68 AAE----NQTVHIGGPVHQGYVFVLHGRPMEWKASLQISDTVAMSNSTDILQAIASGVGP 123 Query: 122 SDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMP 181 ++ LG A W GQLE E+ +N+WLT P + ++LF+TP+ +RW +AA+ IGVD+ + Sbjct: 124 DPCMIFLGCAGWAPGQLEAELAENSWLTCPGNDDLLFRTPLEERWEKAAQSIGVDLNLLS 183 Query: 182 GVAGHA 187 GVAGHA Sbjct: 184 GVAGHA 189 >UniRef50_A9KDE7 UPF0301 protein CBUD_2193 n=7 Tax=Coxiella burnetii RepID=Y2193_COXBN Length = 181 Score = 157 bits (397), Expect = 2e-37, Method: Compositional matrix adjust. Identities = 79/186 (42%), Positives = 118/186 (63%), Gaps = 12/186 (6%) Query: 3 LQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKI-TPEP 61 L +HFL+AMP L D F ++V+Y+ +H+ GA+GII+N+PL L + +LE L I +P Sbjct: 7 LSNHFLVAMPQLNDFTFTKAVIYVSQHDAKGALGIIINRPLA-LTLGKVLEHLNIEIAQP 65 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQP 121 + + PV++GGP+ ++ GFI++ S + I +++ S+D+L+ + +K P Sbjct: 66 QIA----NHPVLMGGPIGQEHGFIVYEQESPQGAEI------LLSASKDMLDDIAKNKGP 115 Query: 122 SDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMP 181 D L+ LGYA WE GQLE EI N WL P + ILF+TP+ RW++AA LIGVDI + Sbjct: 116 DDFLITLGYAGWEAGQLENEIARNDWLVVPFNRKILFETPLKSRWQKAAALIGVDINQLS 175 Query: 182 GVAGHA 187 G GHA Sbjct: 176 GQIGHA 181 >UniRef50_Q1BYL1 UPF0301 protein Bcen_0382 n=60 Tax=Betaproteobacteria RepID=Y382_BURCA Length = 192 Score = 157 bits (396), Expect = 2e-37, Method: Compositional matrix adjust. Identities = 81/191 (42%), Positives = 117/191 (61%), Gaps = 10/191 (5%) Query: 1 MNLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEK--LKIT 58 +NL + FLIAMP + DP F +VVY+C+H+ GA+G+++N+P ++ +E + + LK+ Sbjct: 8 INLTNQFLIAMPNMADPTFSGTVVYLCDHSERGALGLVINRP-TDIDLESLFNRIDLKLD 66 Query: 59 PEPRDESIRLDKPVMLGGPLAEDRGFILHTP--PSNFASSIRISDNTVMTTSRDVLETLG 116 EP L PV GGP+ +RGF+LH P +++ SS+ + MTTS+DVLE + Sbjct: 67 IEPL-----LHIPVYFGGPVQTERGFVLHEPVEGASYNSSMSVDGGLEMTTSKDVLEAVA 121 Query: 117 TDKQPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVD 176 T P L+ LG+A W GQLE+EI N WLT AD I+F TP +R+ A L+GV Sbjct: 122 TGTGPKRFLLTLGHAGWGAGQLEEEIARNGWLTVAADPRIVFDTPAEERFEAALGLLGVS 181 Query: 177 ILTMPGVAGHA 187 + G AGHA Sbjct: 182 SSMLSGEAGHA 192 >UniRef50_A5WBR3 UPF0301 protein PsycPRwf_0144 n=21 Tax=Moraxellaceae RepID=Y144_PSYWF Length = 188 Score = 155 bits (391), Expect = 7e-37, Method: Compositional matrix adjust. Identities = 78/187 (41%), Positives = 118/187 (63%), Gaps = 4/187 (2%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 NL HHFLIA P++ D F +S+VYIC H+ +G +G++VN+P+ + ++ +L+ L I E Sbjct: 5 NLTHHFLIAAPSMPDERFAQSLVYICRHDRHGVLGLVVNRPIFDTQVGHLLDNLDI--EV 62 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQP 121 D S+ D P + GGP+ + GF+LHT +ASS IS+N +TTS+D+L+ + Sbjct: 63 TDTSVMYDTP-LDGGPVYPEVGFVLHTGQPTWASSFPISENVCITTSKDILQNIAAGSAG 121 Query: 122 -SDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTM 180 + LG+ASW +GQLE+EI WL +P DL++LF+ P +RWR AA+ IGV + + Sbjct: 122 IGHYHLCLGHASWHEGQLEKEISQGDWLVSPGDLSLLFEIPFEERWRHAAEKIGVHLDFL 181 Query: 181 PGVAGHA 187 G A Sbjct: 182 SDEVGRA 188 >UniRef50_B5JTR0 Putative uncharacterized protein n=1 Tax=gamma proteobacterium HTCC5015 RepID=B5JTR0_9GAMM Length = 187 Score = 154 bits (390), Expect = 1e-36, Method: Compositional matrix adjust. Identities = 86/190 (45%), Positives = 117/190 (61%), Gaps = 7/190 (3%) Query: 1 MNLQHHFLIAMPALQDPIFRRSVVYICEHNTN-GAMGIIVNKPLENLKIEGILEKLKITP 59 +L +HFLIAMP L+DP F RSV IC H+ + GA+GI + + ++ +E +L++L I Sbjct: 2 QSLTNHFLIAMPDLEDPNFSRSVTLICHHSEDEGAIGITLTRATDH-SVEELLDQLDIQE 60 Query: 60 EPRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFA--SSIRISDNTVMTTSRDVLETLGT 117 + L P+ +GGP+ +DRGFILH + + ISD+ +T+S D+LE L Sbjct: 61 AKLAATHAL--PLYIGGPVEQDRGFILHPNRKEYQWEGTETISDHLAITSSLDILEDLAR 118 Query: 118 DKQPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDI 177 K P + L+ALGYA W GQLEQEI DNAWL PAD I+F P RW AA+ +GVDI Sbjct: 119 GKGPDNCLIALGYAGWSSGQLEQEITDNAWLHGPADPEIIFSLPAEQRWTAAAQSLGVDI 178 Query: 178 LTMPGVAGHA 187 + AGHA Sbjct: 179 RLIHS-AGHA 187 >UniRef50_A3MYV4 UPF0301 protein APL_0232 n=7 Tax=Pasteurellaceae RepID=Y232_ACTP2 Length = 186 Score = 154 bits (389), Expect = 2e-36, Method: Compositional matrix adjust. Identities = 78/185 (42%), Positives = 117/185 (63%), Gaps = 4/185 (2%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 NLQ FLIA P + D F R+V+YICEHN+NGAMG+++N P + L + ++ ++ Sbjct: 4 NLQGKFLIATPEIDDDYFDRTVIYICEHNSNGAMGLVINTPTD-LSVLELITRMDFQM-A 61 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTPPSN-FASSIRISDNTVMTTSRDVLETLGTDKQ 120 + D+ V+ GGP+++DRGFI+HT F S R++DN ++TTS DVL++LG + Sbjct: 62 NQRNYHKDQMVLSGGPVSQDRGFIIHTKTEQEFLHSYRVTDNILLTTSGDVLDSLGKPEA 121 Query: 121 PSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVD-ILT 179 P +V LG A+W+ QLEQEI N WL + A+ LF+T +RW EA +++G+ +L Sbjct: 122 PEKFIVCLGCATWKPEQLEQEIARNYWLISEANDKTLFETGYLERWVEANEMLGISGVLA 181 Query: 180 MPGVA 184 G A Sbjct: 182 RAGRA 186 >UniRef50_A4A9E0 Protein containing DUF179 n=1 Tax=Congregibacter litoralis KT71 RepID=A4A9E0_9GAMM Length = 173 Score = 152 bits (383), Expect = 7e-36, Method: Compositional matrix adjust. Identities = 81/178 (45%), Positives = 109/178 (61%), Gaps = 6/178 (3%) Query: 11 MPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRDESIRLDK 70 MP L +F S+ YICEH GAMGI++N+PL +L + I + L+I PR + D+ Sbjct: 1 MPGLDSGLFSGSITYICEHGEAGAMGIVINQPL-DLSLGEIFDHLEIDCAPRFQ----DQ 55 Query: 71 PVMLGGPLAEDRGFILH-TPPSNFASSIRISDNTVMTTSRDVLETLGTDKQPSDVLVALG 129 V+ GGP+ D GF+LH + SS+R++ +TTSRDVL + + P D VALG Sbjct: 56 VVLAGGPVQIDHGFVLHPRGEQTWDSSLRVTPEVQLTTSRDVLSAIAAGEGPKDYAVALG 115 Query: 130 YASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMPGVAGHA 187 YA W GQLE+EI +N+WLT PAD I+F T I DR AA +G+D+ M AGHA Sbjct: 116 YAGWSAGQLEEEIANNSWLTLPADKRIIFHTAIEDRVAAAAAALGIDMNLMSAEAGHA 173 >UniRef50_Q0AMH8 Putative uncharacterized protein n=3 Tax=Hyphomonadaceae RepID=Q0AMH8_MARMM Length = 195 Score = 151 bits (381), Expect = 1e-35, Method: Compositional matrix adjust. Identities = 75/189 (39%), Positives = 117/189 (61%), Gaps = 11/189 (5%) Query: 3 LQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKI---TP 59 L LIA PA+ DP F R+V+ +C+H GAMGII+NKP L++ + E+L++ P Sbjct: 14 LGGKLLIATPAIGDPRFDRAVILVCDHTAEGAMGIIINKPAAGLRLPELFEQLEVDSSQP 73 Query: 60 EPRDESIRLDKPVMLGGPLAEDRGFILHTPP-SNFASSIRISDNTVMTTSRDVLETLGTD 118 P D PV++GGP+ +DRGF+LHT +N +++ I+D +T ++DVLE + +D Sbjct: 74 AP-------DGPVLVGGPVDKDRGFVLHTRDYANDEATLPINDRIGLTATKDVLEAMASD 126 Query: 119 KQPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDIL 178 P L+ALGY+ W GQL+ E++ NAWL D ++F+T AD+W A + +G+ Sbjct: 127 SPPQRSLLALGYSGWAAGQLDDELVANAWLVCDMDEQLVFETDDADKWPRALECLGISPE 186 Query: 179 TMPGVAGHA 187 + ++GHA Sbjct: 187 HLSALSGHA 195 >UniRef50_C3K3J9 UPF0301 protein PFLU_5755 n=5 Tax=cellular organisms RepID=Y5755_PSEFS Length = 189 Score = 148 bits (374), Expect = 8e-35, Method: Compositional matrix adjust. Identities = 77/187 (41%), Positives = 112/187 (59%), Gaps = 8/187 (4%) Query: 3 LQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLK--ITPE 60 L+H FLIAMP + DP F +++ YI EH GAMG+++N+P E L + ILE+L+ + P Sbjct: 9 LKHQFLIAMPHMADPNFAQTLTYIVEHTAKGAMGLVINRPQE-LNLADILEQLRPEVDPP 67 Query: 61 PRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQ 120 R + + P+ +GGP+ DRGF+LH F +++ + + ++TS+DVL + Sbjct: 68 ARCQGV----PIYIGGPVQTDRGFVLHPTGPKFQATVDL-EGVSLSTSQDVLFAIADGVG 122 Query: 121 PSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTM 180 P ++ LGYA WE GQLE E+ NAWLT P D ILF TP R AA + V++ + Sbjct: 123 PEQSVITLGYAGWEAGQLEAELASNAWLTCPFDAEILFNTPSELRLEAAAAKLRVNLNLL 182 Query: 181 PGVAGHA 187 AGHA Sbjct: 183 TSQAGHA 189 >UniRef50_Q163D2 UPF0301 protein RD1_3419 n=14 Tax=Rhodobacterales RepID=Y3419_ROSDO Length = 184 Score = 147 bits (372), Expect = 1e-34, Method: Compositional matrix adjust. Identities = 76/190 (40%), Positives = 113/190 (59%), Gaps = 9/190 (4%) Query: 1 MNLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPE 60 MNL+ L+AMP++ DP F+ +V+ IC H+ GAMG+I+NKP ++I +L++L I Sbjct: 1 MNLEGKLLVAMPSMGDPRFQNAVILICAHSAKGAMGLIINKPTPEIRISDVLDQLDILSS 60 Query: 61 PRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIR--ISDNTV-MTTSRDVLETLGT 117 + + V GGP+ RGF+LH+ +++ASS+ I D MT + D+LE + Sbjct: 61 QKTREMV----VHFGGPVETGRGFVLHS--TDYASSLNTLIVDGAFGMTATLDILEEIAD 114 Query: 118 DKQPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDI 177 + P+ L+ LGYA W GQLE EI N WLT A +++F P A +W EA +GVD Sbjct: 115 GRGPAQALMMLGYAGWGGGQLENEIAQNGWLTTNATSDLVFDLPAARKWSEALHSLGVDP 174 Query: 178 LTMPGVAGHA 187 + + AGHA Sbjct: 175 INLSPAAGHA 184 >UniRef50_A8ZZX8 Putative uncharacterized protein n=1 Tax=Desulfococcus oleovorans Hxd3 RepID=A8ZZX8_DESOH Length = 184 Score = 147 bits (370), Expect = 2e-34, Method: Compositional matrix adjust. Identities = 76/188 (40%), Positives = 111/188 (59%), Gaps = 5/188 (2%) Query: 1 MNLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPE 60 M L+ FLIAMP L DP FR++VV ICEH+ +GA+G+IVN+ L + I E+LK+ Sbjct: 1 MELRGEFLIAMPMLTDPNFRQTVVCICEHSADGALGLIVNRIYPALTAKDIFEELKMKYV 60 Query: 61 PRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQ 120 P + PV GGP+ F+LH PP + I + +T ++D+L + + Sbjct: 61 PETGPL----PVYNGGPVHTGDLFVLHEPPFGWEGCRPIRPDLALTNTKDLLAAIAEGQG 116 Query: 121 PSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGV-DILT 179 P L+ LGYA W QLE E+L+N+WLT P D ++F TP+A RW +A KL+ + D Sbjct: 117 PRRFLILLGYAGWGPDQLEAEVLENSWLTVPVDQRVIFDTPVARRWADAMKLMNIPDPAF 176 Query: 180 MPGVAGHA 187 + G++G A Sbjct: 177 LSGISGSA 184 >UniRef50_B9NUS5 Putative uncharacterized protein n=2 Tax=Rhodobacteraceae RepID=B9NUS5_9RHOB Length = 224 Score = 146 bits (369), Expect = 3e-34, Method: Compositional matrix adjust. Identities = 81/190 (42%), Positives = 115/190 (60%), Gaps = 9/190 (4%) Query: 1 MNLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPE 60 M+L LIAMP + DP F SVVY+C H +GAMG+IVNKP +L+I+ +LE+L I Sbjct: 41 MDLTGKLLIAMPGMGDPRFEHSVVYVCSHGDDGAMGLIVNKP-SDLRIKTLLEQLNI--- 96 Query: 61 PRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFAS---SIRISDNTVMTTSRDVLETLGT 117 P + ++ V GGP+ RGF+LH+ +++ + S++ISD MT + DVLE L + Sbjct: 97 PCRIPVVGERLVQFGGPVEMSRGFVLHS--ADYEANLHSMQISDEFSMTATLDVLEDLAS 154 Query: 118 DKQPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDI 177 K P + ++ALGY+ W QLE EI N WLT A ++F P ++W A +GVD Sbjct: 155 GKGPLNSMLALGYSGWGPDQLEDEIAMNGWLTTEASSKLIFDVPDDEKWGAALATLGVDP 214 Query: 178 LTMPGVAGHA 187 LT+ AG A Sbjct: 215 LTLSASAGRA 224 >UniRef50_Q4ZZ67 UPF0301 protein Psyr_0485 n=31 Tax=Proteobacteria RepID=Y485_PSEU2 Length = 190 Score = 146 bits (369), Expect = 3e-34, Method: Compositional matrix adjust. Identities = 74/186 (39%), Positives = 112/186 (60%), Gaps = 5/186 (2%) Query: 3 LQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKIT-PEP 61 L+HHFLIAMP + D F +++ YI EHN NGAMG+++N+P ++L + +LE+L+ P P Sbjct: 9 LKHHFLIAMPHMHDENFAQTLTYIVEHNANGAMGLVINRP-QSLTLADVLEQLRPELPAP 67 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQP 121 R D + GGP+ DRGF+LH F +++ + ++TS+DVL ++ P Sbjct: 68 RHCQ---DIVIHTGGPVQTDRGFVLHPSGQTFQATVNLPGGISLSTSQDVLFSIADGYGP 124 Query: 122 SDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMP 181 ++ LGYA W+ GQL+ E+ DNAWLT D ILF R AA+ +G+++ + Sbjct: 125 DQNVITLGYAGWDAGQLDAEMADNAWLTCSFDPAILFDVDSDQRLEAAARRLGINLNLIS 184 Query: 182 GVAGHA 187 AGHA Sbjct: 185 TQAGHA 190 >UniRef50_A4SVF0 Putative uncharacterized protein n=1 Tax=Polynucleobacter necessarius subsp. asymbioticus QLW-P1DMWA-1 RepID=A4SVF0_POLSQ Length = 210 Score = 145 bits (366), Expect = 6e-34, Method: Compositional matrix adjust. Identities = 80/195 (41%), Positives = 110/195 (56%), Gaps = 16/195 (8%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLE---NLKIEGILEKLKIT 58 +L + FLIAMP + D F SV+Y+ EHN GAMG++VNKP E + I KL+I Sbjct: 23 HLANQFLIAMPGMVDANFAGSVIYLFEHNARGAMGLVVNKPTEVDLATLFDKIELKLEIA 82 Query: 59 PEPRDESIRLDKPVMLGGPLAEDRGFILHTPPSN--FASSIRISDNTVMTTSRDVLETLG 116 P L++PV GGP+ +RGF+LH N ++SS+ I MTTS+DVLE + Sbjct: 83 P-------LLEQPVYFGGPVQIERGFVLHESNKNLSYSSSLIIPGGLTMTTSKDVLEAVA 135 Query: 117 TDKQPSDVLVALGYASWEKGQLEQEILDNAWLTAPAD----LNILFKTPIADRWREAAKL 172 P L+ LGYA W GQLE+EI N W+ P + I+F TP + R+ + Sbjct: 136 IGNGPRKFLMTLGYAGWSAGQLEEEITLNGWMNVPLSREQMMEIIFNTPPSQRYEKTMNH 195 Query: 173 IGVDILTMPGVAGHA 187 +G D+ + G AGHA Sbjct: 196 LGFDLSHLSGEAGHA 210 >UniRef50_A6VSP6 UPF0301 protein Mmwyl1_0539 n=2 Tax=Marinomonas RepID=Y539_MARMS Length = 188 Score = 144 bits (362), Expect = 2e-33, Method: Compositional matrix adjust. Identities = 71/187 (37%), Positives = 111/187 (59%), Gaps = 6/187 (3%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKIT-PE 60 + ++HFLI+MP L DP F +V+Y+CEH GAMGII+N+P N+ + + L I Sbjct: 7 SFKNHFLISMPHLDDPHFEHTVIYLCEHTKAGAMGIIINRP-SNVDFTELADHLGIQIHS 65 Query: 61 PRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQ 120 PR S +P+ GGP+ +RGFILHT +++++R++D ++ S + LE + Sbjct: 66 PRLSS----EPIYTGGPVEAERGFILHTTDKVWSNTLRVTDEVSLSASLEALEDIAQGNG 121 Query: 121 PSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTM 180 P + LG A W+ GQLE EI +N WL ADL++LF TP ++ A +++G+D+ + Sbjct: 122 PDAFRITLGCAGWDAGQLEAEIANNDWLVCEADLDVLFHTPSDMQFTAATRVLGIDMTRL 181 Query: 181 PGVAGHA 187 GH Sbjct: 182 SPDIGHG 188 >UniRef50_Q31EK4 UPF0301 protein Tcr_1827 n=1 Tax=Thiomicrospira crunogena XCL-2 RepID=Y1827_THICR Length = 188 Score = 144 bits (362), Expect = 2e-33, Method: Compositional matrix adjust. Identities = 71/185 (38%), Positives = 109/185 (58%), Gaps = 3/185 (1%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 +L+HHFLIAMP L + F ++V+YI E N +G MG+++N NL + +L+ ++T E Sbjct: 6 SLEHHFLIAMPNLTESWFDKTVIYIVEDNEHGTMGLVINLE-HNLTVPELLDHFELTVEA 64 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQP 121 + D+PV++GGP+ + GFILH P + S+ + DN MT S D L+ + P Sbjct: 65 PEN--YADQPVLMGGPVDLEHGFILHEPQGTWQKSLPLRDNLAMTVSEDFLKAMADGTAP 122 Query: 122 SDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMP 181 ++V LG++ WEKGQL EI N WLT P + +LF P +W+ A +G+ ++ Sbjct: 123 EKIVVCLGFSGWEKGQLNDEIQANNWLTIPYNEALLFDVPNDQKWQVALNTLGISPESLS 182 Query: 182 GVAGH 186 AGH Sbjct: 183 MDAGH 187 >UniRef50_Q0EWH5 Putative uncharacterized protein n=1 Tax=Mariprofundus ferrooxydans PV-1 RepID=Q0EWH5_9PROT Length = 191 Score = 139 bits (350), Expect = 4e-32, Method: Compositional matrix adjust. Identities = 72/188 (38%), Positives = 109/188 (57%), Gaps = 3/188 (1%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPE- 60 L L+A P+LQDP FR +VV IC+H+ +G +G+I+N+P ++ + I + + I E Sbjct: 5 GLTGQILLATPSLQDPNFRDTVVLICQHDRDGCLGLIINRP-RDIILGEIFDDMGIRYET 63 Query: 61 -PRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDK 119 + R+ V GGP+ RGF+LH + S++++S +T SRD LE L + Sbjct: 64 GSAENHERIQPVVYEGGPMDGFRGFLLHDGWDVYDSTMQVSPELHLTASRDALEELARGQ 123 Query: 120 QPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILT 179 P ++ LGYA W GQLEQE+ DN+WL APA I+F+ P RW AA+ +G++ Sbjct: 124 GPEHYMLLLGYAGWGAGQLEQELCDNSWLIAPASHQIIFQEPPEKRWDFAARCMGIERGQ 183 Query: 180 MPGVAGHA 187 + GHA Sbjct: 184 LSSQIGHA 191 >UniRef50_Q0C3C2 Putative uncharacterized protein n=1 Tax=Hyphomonas neptunium ATCC 15444 RepID=Q0C3C2_HYPNA Length = 188 Score = 139 bits (350), Expect = 5e-32, Method: Compositional matrix adjust. Identities = 69/187 (36%), Positives = 112/187 (59%), Gaps = 5/187 (2%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 +L LIAMP + DP F RSV+ +C H + AMGII+NKP++ + ++ I++++ I P Sbjct: 6 DLTGKLLIAMPGIGDPRFERSVILVCAHTPDFAMGIILNKPMDGIDLQEIIDQMDI---P 62 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTPPSNF-ASSIRISDNTVMTTSRDVLETLGTDKQ 120 +D + ++ GGP+A +RGF+LHT +++ + D MT +R++L ++ + Sbjct: 63 QDVDLE-GVAILEGGPVATERGFVLHTDDVICDGATMEVEDELCMTATREILASIASAAP 121 Query: 121 PSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTM 180 P ++ALGYA W GQLEQE+ NAWL D +++F +WR A +GVD+ + Sbjct: 122 PRKFVMALGYAGWGAGQLEQELAQNAWLIGAPDSDLVFGDAYEHKWRHAMTRMGVDLSRL 181 Query: 181 PGVAGHA 187 AG+A Sbjct: 182 QSNAGNA 188 >UniRef50_B4REX9 Transcriptional regulator n=4 Tax=Caulobacteraceae RepID=B4REX9_PHEZH Length = 187 Score = 139 bits (350), Expect = 5e-32, Method: Compositional matrix adjust. Identities = 72/187 (38%), Positives = 108/187 (57%), Gaps = 7/187 (3%) Query: 3 LQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPR 62 L LIAMP + DP F R+++ +C H+ + AMG+ +N P+E L + +LE+L+I Sbjct: 6 LSGQLLIAMPGISDPRFERTLILVCAHDAHHAMGLALNHPVEGLTVPDLLERLEIK---- 61 Query: 63 DESIRLDKP-VMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGT-DKQ 120 +IRL V++GGP+ +RGF+LHT S+ + +T +R+VLE +G+ D + Sbjct: 62 -STIRLPPDLVLVGGPVERERGFVLHTDDYQGEFSLPVGGGVALTATREVLEAMGSSDGR 120 Query: 121 PSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTM 180 P L+ALGYA W GQLE EI +N WLT AD ++F +W A +G+D + Sbjct: 121 PRRSLLALGYAGWGAGQLEHEIRENVWLTCEADEALIFDADYDTKWARAVAKLGIDPTFL 180 Query: 181 PGVAGHA 187 AG A Sbjct: 181 TAEAGRA 187 >UniRef50_Q3SNY6 UPF0301 protein Nwi_2752 n=121 Tax=Alphaproteobacteria RepID=Y2752_NITWN Length = 221 Score = 139 bits (349), Expect = 6e-32, Method: Compositional matrix adjust. Identities = 76/193 (39%), Positives = 112/193 (58%), Gaps = 12/193 (6%) Query: 3 LQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPR 62 L LIAMP ++D F RSV+Y+C H++ GAMGII+N+P ++ +L +L I R Sbjct: 33 LDGQLLIAMPVMEDERFARSVIYVCAHSSEGAMGIILNRPAGSVDFSDLLVQLDIIK--R 90 Query: 63 DESIRLDK-----PVMLGGPLAEDRGFILHTPPSNF---ASSIRISDNTVMTTSRDVLET 114 + I+L + VM GGP+ RGF+LH+ S+F +++ I + +T + D+LE Sbjct: 91 ADLIKLPETAETMKVMKGGPVETGRGFVLHS--SDFFIEDATLPIDEGICLTATLDILEA 148 Query: 115 LGTDKQPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIG 174 + P ++ALGYA W GQLE EI DN WL PAD +++F I D++ A IG Sbjct: 149 IAKGAGPKHAILALGYAGWAPGQLETEIQDNGWLHCPADQDLIFGRDIEDKYVRALHKIG 208 Query: 175 VDILTMPGVAGHA 187 +D + AGHA Sbjct: 209 IDPGMLSNEAGHA 221 >UniRef50_C5SQD7 Putative uncharacterized protein n=1 Tax=Asticcacaulis excentricus CB 48 RepID=C5SQD7_9CAUL Length = 216 Score = 136 bits (342), Expect = 4e-31, Method: Compositional matrix adjust. Identities = 71/184 (38%), Positives = 105/184 (57%), Gaps = 13/184 (7%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 +LQ L+AMP+L DP F SV+Y+C+H+ AMGI++N+P+ L ++E+L I Sbjct: 24 SLQGRLLVAMPSLDDPNFDHSVIYMCQHDPESAMGIVLNQPIGGLTFPRMMEELGID--- 80 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHT----------PPSNFASSIRISDNTVMTTSRDV 111 ++ + P+ GGP+ +RGF+LH+ P ++ + D +T SRD+ Sbjct: 81 ITDNRHVATPIYNGGPVQNERGFVLHSLDYFIDEVTLPLDIDPEALELRDGIGLTVSRDI 140 Query: 112 LETLGTDKQPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAK 171 L L PS VL+ALGYA W GQLE EI DNAWL AP ++LF + W + K Sbjct: 141 LVDLARGAGPSRVLIALGYAGWGPGQLEAEIRDNAWLVAPCQADLLFSHDASALWSKTLK 200 Query: 172 LIGV 175 L+G+ Sbjct: 201 LLGI 204 >UniRef50_C9CSD6 Putative uncharacterized protein n=1 Tax=Silicibacter sp. TrichCH4B RepID=C9CSD6_9RHOB Length = 219 Score = 133 bits (334), Expect = 3e-30, Method: Compositional matrix adjust. Identities = 73/188 (38%), Positives = 102/188 (54%), Gaps = 5/188 (2%) Query: 1 MNLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPE 60 M L LIAMP + DP F SVV++C H GAMG+I+NK + ++ ++++L+I E Sbjct: 36 MELTGKLLIAMPGIGDPRFDNSVVFLCSHGDEGAMGLIINKLAPGVALQTLMDQLEIDIE 95 Query: 61 PRDESIRLDKPVMLGGPLAEDRGFILHTPPS-NFASSIRISDNTVMTTSRDVLETLGTDK 119 P S PV GGP+ RGF+LH+ + +S+ + MT + DVLE + + Sbjct: 96 PAIAS----APVYFGGPVETQRGFVLHSDEYISTVNSLPVKPGFSMTATLDVLEDIAEGR 151 Query: 120 QPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILT 179 P LV LGYA W GQLE EI N WLT A+ ++F +W A +GV L Sbjct: 152 GPERYLVMLGYAGWGPGQLEDEIAQNGWLTTDAEPEMIFTDTADTKWEAALASLGVTPLN 211 Query: 180 MPGVAGHA 187 + AGHA Sbjct: 212 LSMDAGHA 219 >UniRef50_Q5FQY8 UPF0301 protein GOX1459 n=11 Tax=Acetobacteraceae RepID=Y1459_GLUOX Length = 187 Score = 131 bits (329), Expect = 1e-29, Method: Compositional matrix adjust. Identities = 70/189 (37%), Positives = 106/189 (56%), Gaps = 6/189 (3%) Query: 1 MNLQHHFLIAMPALQDPIFRRSVVYICEHN-TNGAMGIIVNKPLENLKIEGILEKLKITP 59 + L L+A PAL + F R+V+Y+C H+ +GAMG+IVN+ L ++ + +L I P Sbjct: 3 LGLTGKLLVAAPALAETFFERTVIYLCAHSEQDGAMGLIVNRRLSQPGLDDLFAQLGIEP 62 Query: 60 EPRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDK 119 P + I V +GGP+ RGF+LH+ S+ + +T +T S D+L + Sbjct: 63 SPPERRIG----VCMGGPVEHARGFVLHSADWAGEGSLDVDGHTTLTASLDILREIAAGH 118 Query: 120 QPSDVLVALGYASWEKGQLEQEIL-DNAWLTAPADLNILFKTPIADRWREAAKLIGVDIL 178 P ++ALG+A+W GQLE+EIL D++W APA I+F T A +WR+A I D L Sbjct: 119 GPRQAVMALGHAAWAPGQLEEEILRDSSWFIAPATDEIVFGTDHAKKWRQALVAIDFDPL 178 Query: 179 TMPGVAGHA 187 + G A Sbjct: 179 LLSSSVGEA 187 >UniRef50_Q6AL28 UPF0301 protein DP2218 n=1 Tax=Desulfotalea psychrophila RepID=Y2218_DESPS Length = 190 Score = 130 bits (327), Expect = 2e-29, Method: Compositional matrix adjust. Identities = 64/187 (34%), Positives = 105/187 (56%), Gaps = 7/187 (3%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 +L +FL++ + D F VVY+C HN+NGA+G+++NKP NL +L ++ + Sbjct: 10 SLAGYFLVSTLQMPDSRFAGQVVYVCSHNSNGALGLVINKPDCNLSFAQVLREMGM---- 65 Query: 62 RDESIRLDKP-VMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQ 120 E R + P V +GGP++ D F+L+ + I I+DN ++ +++LE + + Sbjct: 66 --EVSRAELPSVYIGGPVSLDAAFVLYRSHPYEGNHIDITDNISLSREKELLELVVGENS 123 Query: 121 PSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTM 180 + L +GY WE GQLE E+ DN+WL P D ++F P ++W+ AA G+DI T Sbjct: 124 SRNYLFLVGYVGWESGQLELELRDNSWLVVPGDEQVIFDLPDGEKWKAAAAYYGIDITTF 183 Query: 181 PGVAGHA 187 G+A Sbjct: 184 NENLGYA 190 >UniRef50_A7C130 Protein containing DUF179 n=1 Tax=Beggiatoa sp. PS RepID=A7C130_9GAMM Length = 158 Score = 130 bits (326), Expect = 3e-29, Method: Compositional matrix adjust. Identities = 62/165 (37%), Positives = 103/165 (62%), Gaps = 8/165 (4%) Query: 24 VYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRDE-SIRLDKPVMLGGPLAEDR 82 +++C AMGI++N+PL+ + + +LE + I E D+ + R+ P+ GGP+ +R Sbjct: 1 MFVCTMKR--AMGIVINRPLD-VDLGDVLEHMNI--EANDQRATRM--PIFDGGPVQRER 53 Query: 83 GFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQPSDVLVALGYASWEKGQLEQEI 142 GF++H P + + + I++N + TSRD++ + + PS+ L+ALGYA W GQLEQE+ Sbjct: 54 GFVIHQPVGQWDAMLSINNNLGIATSRDIISAIANGQGPSNALIALGYAGWTAGQLEQEM 113 Query: 143 LDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMPGVAGHA 187 DNAWL+ PAD +++F+T RW AA +G+D+ + GH Sbjct: 114 ADNAWLSTPADYSVIFQTTPEQRWHAAAASMGIDLTLLSSQVGHG 158 >UniRef50_Q2GAJ3 UPF0301 protein Saro_0683 n=4 Tax=Sphingomonadaceae RepID=Y683_NOVAD Length = 186 Score = 129 bits (324), Expect = 5e-29, Method: Compositional matrix adjust. Identities = 66/185 (35%), Positives = 99/185 (53%), Gaps = 5/185 (2%) Query: 3 LQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPR 62 L L+AMP + DP F +V+ +C H+ +GA+GI V E + + G+LE + I P Sbjct: 7 LGGRLLLAMPGMGDPRFDHAVIAMCVHDEHGALGIGVGHVREGITLHGLLEDVGIDP--- 63 Query: 63 DESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQPS 122 + D PV+ GGP+ RGF+LH+ S+ ++ ++ S D+L + + PS Sbjct: 64 --GLAPDMPVLNGGPVETARGFVLHSDDWGGEGSVTVNGLCCLSASLDILRAIAEGRGPS 121 Query: 123 DVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMPG 182 ++ALGYA W GQLE E+ + W A ILF+TP RW +A K G+D + G Sbjct: 122 RFVIALGYAGWGGGQLEGEMRRHGWYAAQGRPEILFETPTGRRWTQAWKREGIDPAHLVG 181 Query: 183 VAGHA 187 G A Sbjct: 182 QTGSA 186 >UniRef50_C0QFZ0 UPF0301 protein HRM2_24640 n=1 Tax=Desulfobacterium autotrophicum HRM2 RepID=Y2464_DESAH Length = 189 Score = 129 bits (324), Expect = 5e-29, Method: Compositional matrix adjust. Identities = 67/173 (38%), Positives = 99/173 (57%), Gaps = 4/173 (2%) Query: 3 LQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPR 62 L+ HFL+A+P L DP F ++V ICEHN GA+G I+N+ L + + E LKIT Sbjct: 9 LKGHFLMAIPGLPDPNFAQTVTCICEHNKTGALGFIINRIHPLLTGQELFEDLKITCNQA 68 Query: 63 DESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQPS 122 + I + LGGP+ F+LH PP ++ ++I+D ++ +RD+LE + + P Sbjct: 69 IDKIA----IHLGGPVQPSGVFVLHGPPFDWHGCLKINDWLGLSNTRDILEAVARQEGPE 124 Query: 123 DVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGV 175 + +V LG A W QL+ EI DNAWLT P ILFKT + +W +G+ Sbjct: 125 NFIVLLGCAGWGPLQLDNEINDNAWLTIPVSQEILFKTDVKLKWEMTMMQMGI 177 >UniRef50_A9DAK3 Putative uncharacterized protein n=1 Tax=Hoeflea phototrophica DFL-43 RepID=A9DAK3_9RHIZ Length = 209 Score = 128 bits (322), Expect = 9e-29, Method: Compositional matrix adjust. Identities = 66/195 (33%), Positives = 109/195 (55%), Gaps = 11/195 (5%) Query: 3 LQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPR 62 L HFL+AMP++ D F R+V+++C H+ +GAMG I+N+P + L E ++E L + + R Sbjct: 16 LDGHFLLAMPSMSDERFERAVIFVCAHSEDGAMGFILNQP-QPLSFEELVENLDLDSQER 74 Query: 63 DESIRLDK----------PVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVL 112 ++ + K P+ GGP+ RGF+LH+ S++ ++D+ +T + D+L Sbjct: 75 RDADKSRKIGMSECARNFPIQFGGPVDPGRGFVLHSDDYMTESTMPVNDDLCLTATIDIL 134 Query: 113 ETLGTDKQPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKL 172 + P ++ LGYA W GQLEQE+ NAWL+ PA +I+F + ++ Sbjct: 135 RAIKDGCGPVRGMMLLGYAGWGPGQLEQEMAANAWLSCPASDDIVFDRDHSAKYDRVLSH 194 Query: 173 IGVDILTMPGVAGHA 187 +GV + AGHA Sbjct: 195 MGVSPAMLSMEAGHA 209 >UniRef50_Q1NQW6 Putative uncharacterized protein n=2 Tax=Deltaproteobacteria RepID=Q1NQW6_9DELT Length = 201 Score = 128 bits (321), Expect = 1e-28, Method: Compositional matrix adjust. Identities = 65/186 (34%), Positives = 102/186 (54%), Gaps = 3/186 (1%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 +LQ +FLIA P + DP F+ +V+ +C HN GAMG+++N+P+ ++++E I I P Sbjct: 19 SLQGYFLIATPQMSDPRFQETVILLCAHNEEGAMGLVINQPIRDVELEDIFHNAGIPLPP 78 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQP 121 + V LGGP+ FI+++ + + ++ + ++ +L L + P Sbjct: 79 GAGPL---GSVYLGGPVETGNVFIVYSAEYEVVNHLAVTPSISLSRDPQLLYDLAAGRGP 135 Query: 122 SDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMP 181 LV+LGYA W GQLE E+ + WL PA I+F TP +WR AA++ GVDI Sbjct: 136 RHYLVSLGYAGWGAGQLEAELSVDGWLALPAKDEIIFNTPNQHKWRRAAQIHGVDIGLFG 195 Query: 182 GVAGHA 187 V G A Sbjct: 196 AVVGSA 201 >UniRef50_A0L5K4 UPF0301 protein Mmc1_0726 n=1 Tax=Magnetococcus sp. MC-1 RepID=Y726_MAGSM Length = 186 Score = 128 bits (321), Expect = 1e-28, Method: Compositional matrix adjust. Identities = 65/177 (36%), Positives = 102/177 (57%), Gaps = 10/177 (5%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENL--KIEGILEKLKITP 59 L FLIA+P+L DP F R+V+Y+C HN +GA+G+++N+PL+ ++ G LE P Sbjct: 5 TLAGKFLIAVPSLADPFFERTVLYLCAHNEDGALGLVINQPLDTTMSQMAGYLELDWQRP 64 Query: 60 EPRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDK 119 +D+ V +GGP++ ++GF+L + + + D+ M T+ D++ +G Sbjct: 65 G-------VDR-VYMGGPVSPEQGFVLFEQALDLPGIMMLPDDLYMGTNPDIIRLMGRAG 116 Query: 120 QPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVD 176 L ALGYA WE GQLE E+ +N+WL A +ILF A RW A + +G+D Sbjct: 117 AQERFLFALGYAGWEAGQLEHELQENSWLVCDAQRSILFDMGYAQRWEAAIRSMGID 173 >UniRef50_A6WWH2 Putative uncharacterized protein n=2 Tax=Ochrobactrum RepID=A6WWH2_OCHA4 Length = 214 Score = 127 bits (319), Expect = 2e-28, Method: Compositional matrix adjust. Identities = 71/190 (37%), Positives = 110/190 (57%), Gaps = 8/190 (4%) Query: 3 LQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPR 62 L FL+AMP + D F RSVVYIC H+ GAMG I+N+ L+ ++ +L ++ + E Sbjct: 28 LNGQFLLAMPGMSDERFARSVVYICAHSDEGAMGFIINQ-LQPVEFPDLLRQIGVIDE-- 84 Query: 63 DESIRL-DKP----VMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGT 117 DE I L D+ V GGP+ RGF+LH+ S++ +S+ +T + D+L + Sbjct: 85 DELIILPDRAQHMMVRNGGPVDRTRGFVLHSDDYMVDSTMPVSEEVCLTATVDILRAIYG 144 Query: 118 DKQPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDI 177 + PS L+ALGY+ W GQ+E E+ +N WLT A L++LF + I ++ +G+D+ Sbjct: 145 GRGPSRALMALGYSGWAPGQIEVELAENGWLTCDAPLDMLFDSDIEGKYSRLMLHMGIDM 204 Query: 178 LTMPGVAGHA 187 + AGHA Sbjct: 205 SRLVSDAGHA 214 >UniRef50_Q60BQ2 UPF0301 protein MCA0413 1 n=1 Tax=Methylococcus capsulatus RepID=Y413_METCA Length = 182 Score = 110 bits (275), Expect = 2e-23, Method: Compositional matrix adjust. Identities = 61/170 (35%), Positives = 90/170 (52%), Gaps = 5/170 (2%) Query: 6 HFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRDES 65 FL+A P + IF SV+Y+ HN +GAMG+IVN+ + +LE + + + E Sbjct: 16 QFLVAHPKMPANIFAHSVIYVVSHNADGAMGLIVNRLAGAGPLGKLLEAFGLASKAQRE- 74 Query: 66 IRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQPSDVL 125 I+L LGGP+ +GF+LH+ AS+ + ++T DVLE + + P V Sbjct: 75 IKL----YLGGPVGIGQGFVLHSDDYAGASTRALKKGLSLSTGLDVLEAIARGRGPRQVR 130 Query: 126 VALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGV 175 + GYA W GQL+ EI WL APAD +++F W EA K G+ Sbjct: 131 MLFGYAGWSPGQLDGEIARGDWLLAPADTSLIFSEEPDKVWEEALKHAGL 180 >UniRef50_Q5NQN1 UPF0301 protein ZMO0349 n=3 Tax=Zymomonas mobilis RepID=Y349_ZYMMO Length = 188 Score = 110 bits (274), Expect = 3e-23, Method: Compositional matrix adjust. Identities = 54/176 (30%), Positives = 92/176 (52%), Gaps = 5/176 (2%) Query: 12 PALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRDESIRLDKP 71 P ++D F+++V+ +C N GA+G+ + + + ++ + ++ +L I P + D+P Sbjct: 18 PNMRDVEFQKAVIALCAFNEKGALGLNIGRIIPDVTLHSLMHQLGIQP-----GLVPDRP 72 Query: 72 VMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQPSDVLVALGYA 131 V GGP RG +LH+ + S+ + + +T + DVL L + P LVALGYA Sbjct: 73 VHDGGPCEPQRGMVLHSRDWHSPDSMMVGQDWALTCTLDVLHALSRGEGPQHWLVALGYA 132 Query: 132 SWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMPGVAGHA 187 W GQL+QE+ W + D +LF P +RW++ + GVD + G A Sbjct: 133 GWGAGQLDQEMKQADWFLSKVDDQLLFSCPAENRWQQGYQQAGVDFYRLATKIGQA 188 >UniRef50_C0AXX7 Putative uncharacterized protein n=1 Tax=Proteus penneri ATCC 35198 RepID=C0AXX7_9ENTR Length = 93 Score = 105 bits (263), Expect = 6e-22, Method: Compositional matrix adjust. Identities = 48/93 (51%), Positives = 64/93 (68%) Query: 1 MNLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPE 60 MNL +HFLIAMP+L DP+F RSVVY+CEHN NGAMG+I+NKP+E++ +EG+L++L+I Sbjct: 1 MNLLNHFLIAMPSLSDPLFERSVVYVCEHNENGAMGLIINKPIEDISVEGVLDQLEIFST 60 Query: 61 PRDESIRLDKPVMLGGPLAEDRGFILHTPPSNF 93 RDE+I L K + P F L F Sbjct: 61 DRDEAISLQKTCDVRRPSCRRAWFYLTYSSVRF 93 >UniRef50_C8CIK8 Putative uncharacterized protein n=1 Tax=uncultured bacterium B7P37metaSE RepID=C8CIK8_9BACT Length = 200 Score = 104 bits (260), Expect = 1e-21, Method: Compositional matrix adjust. Identities = 56/163 (34%), Positives = 84/163 (51%), Gaps = 7/163 (4%) Query: 3 LQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPR 62 L LIA P + DP F R+V+ + HN++GAM I++N+PL + IL+ P Sbjct: 30 LTGQLLIAAPGMTDPRFDRTVLVMVRHNSDGAMAIVINRPLGERSMARILQAFG-EKAPD 88 Query: 63 DESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQPS 122 D + PV LGGP+ + +LH+ ++ I + +T S ++ + + P Sbjct: 89 DSAT---VPVYLGGPVQLEMSTVLHSAEYRRNGTLDIDGHVAVTASMEIYRDIAANTGPE 145 Query: 123 DVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADR 165 LV GYA W GQLE E+ N W TAP D+ ++F ADR Sbjct: 146 KSLVVFGYAGWAPGQLEGEMAQNVWFTAPLDVKLVFD---ADR 185 >UniRef50_Q0BLI0 UPF0301 protein FTH_1193 n=18 Tax=Francisella RepID=Y1193_FRATO Length = 194 Score = 101 bits (251), Expect = 1e-20, Method: Compositional matrix adjust. Identities = 59/182 (32%), Positives = 101/182 (55%), Gaps = 5/182 (2%) Query: 2 NLQHHFLIAMPALQDPI-FRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPE 60 N + L+A P ++D I F +SVVY+C+++ +GAMG+I+NKPL + ++ + E+L I P Sbjct: 4 NHKSEILLATPLIKDDIVFTKSVVYLCQNDRHGAMGLIINKPLAD-TLKDVFEELHI-PH 61 Query: 61 PRDESIRLDKPVMLGGPLAEDRGFILHTPPS-NFASSIRISDNTVMTTSRDVLETLGTDK 119 L+ P+ +GGP++ + ILHT N+ S+I++ + +T S D+LE + + Sbjct: 62 TNTFKEILEYPLYMGGPISPHKIMILHTTNGRNYTSTIKLDEGLAITASIDILEDIANNI 121 Query: 120 QPSDVLVALGYASWEKGQLEQEILDNAWL-TAPADLNILFKTPIADRWREAAKLIGVDIL 178 P L +GY+ W QL EI N W+ T + ILF +W+ + G + Sbjct: 122 LPEYFLPVVGYSCWTANQLTDEIKSNDWIVTNKLNKKILFNHENKVKWQNHLEHAGYTLQ 181 Query: 179 TM 180 ++ Sbjct: 182 SL 183 >UniRef50_A9GTQ2 Putative uncharacterized protein n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9GTQ2_SORC5 Length = 198 Score = 100 bits (250), Expect = 2e-20, Method: Compositional matrix adjust. Identities = 70/196 (35%), Positives = 93/196 (47%), Gaps = 29/196 (14%) Query: 3 LQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPL-----ENLKIEGILEKLKI 57 L FLIA P L DP F R+VV + H+ GA+G +VN+P E L G LK Sbjct: 8 LAPGFLIASPPLGDPNFDRTVVLLAVHSEGGALGFVVNRPAPMTLGELLSFAGYGNDLK- 66 Query: 58 TPEPRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASS---IRISDNTVMTTSRDVLET 114 +P PV LGGP+ G+IL P+ A I + +T+SR +T Sbjct: 67 --DP--------APVYLGGPVQPSSGWILCLDPALGAEETGVIPVGSRVRVTSSRSAFDT 116 Query: 115 LGTDK-------QPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWR 167 L D P V LGY+ W GQLE+EI AWL D ILF A RW Sbjct: 117 LAADAVRGTAAADPRRRTVLLGYSGWGPGQLEREIAAGAWLPVSLDERILFDVEAAQRWE 176 Query: 168 EAAKLIG---VDILTM 180 +A L+G +++++M Sbjct: 177 QAYALLGLRPIEVMSM 192 >UniRef50_Q2BRE1 Putative uncharacterized protein n=1 Tax=Neptuniibacter caesariensis RepID=Q2BRE1_9GAMM Length = 151 Score = 99.8 bits (247), Expect = 4e-20, Method: Compositional matrix adjust. Identities = 52/155 (33%), Positives = 88/155 (56%), Gaps = 6/155 (3%) Query: 35 MGIIVNKPLENLKIEGILEKLKITPEPRDESIRLDKPVMLGGPLAEDRGFILH--TPPSN 92 MG++VN+P+ + + + + L + P + + GGP+ +RG++LH + P Sbjct: 1 MGLVVNRPV-GITLSDLCDHLNL---PCISNENQQDEIFSGGPVKPERGYVLHRSSDPFE 56 Query: 93 FASSIRISDNTVMTTSRDVLETLGTDKQPSDVLVALGYASWEKGQLEQEILDNAWLTAPA 152 + SS +++ ++TS D +E + + L+ALG A W GQLEQEI DN WL+ PA Sbjct: 57 WPSSHCVAEEIFLSTSVDAIEAAAEGRFKHEYLIALGCAGWSPGQLEQEISDNVWLSCPA 116 Query: 153 DLNILFKTPIADRWREAAKLIGVDILTMPGVAGHA 187 + +ILF P DR + AA ++G+++ + GHA Sbjct: 117 NSDILFGIPAGDRLQAAASILGINLDLLTAHPGHA 151 >UniRef50_Q0FVR8 Putative uncharacterized protein (Fragment) n=1 Tax=Roseovarius sp. HTCC2601 RepID=Q0FVR8_9RHOB Length = 158 Score = 97.8 bits (242), Expect = 1e-19, Method: Compositional matrix adjust. Identities = 52/133 (39%), Positives = 77/133 (57%), Gaps = 10/133 (7%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 +L LIAMP + DP F SV+Y+C H+ GAMG+IVNKP ++ + +LE+L ITP P Sbjct: 9 DLTGKILIAMPGMGDPRFEHSVIYLCAHSEEGAMGLIVNKPSADVSMAALLEQLSITPSP 68 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTPPS-NFASSIRISDNTVMTTSRDVLETLGTDKQ 120 + V GGP+ RGF+LH+P + ++++++D MT + DVLET+ Sbjct: 69 GLGP----RQVHFGGPVEMGRGFVLHSPDYMSGLTTLQVNDGFSMTGTLDVLETIARGDG 124 Query: 121 PSDVLVALGYASW 133 P A G+ W Sbjct: 125 P-----ATGWRCW 132 >UniRef50_A6G4Z9 Putative uncharacterized protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6G4Z9_9DELT Length = 210 Score = 95.9 bits (237), Expect = 6e-19, Method: Compositional matrix adjust. Identities = 65/199 (32%), Positives = 96/199 (48%), Gaps = 23/199 (11%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKI--TP 59 L H L A+P L DP F+RSVV + EH+ GA+G+++N+ + N + + E L + Sbjct: 15 GLACHLLCAVPQLLDPNFKRSVVLMLEHDERGALGLVINRTM-NTSLSEVAEALDLEWCG 73 Query: 60 EPRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGT-- 117 +P D V +GGP+ RG+ LH + + + D +TTS + + G+ Sbjct: 74 DP-------DAQVRIGGPVEPVRGWFLHDQGAWDPDASSLVDGLWVTTSLEGVGAAGSVR 126 Query: 118 -DKQPSDVLVALGYASWEKGQLEQEILDNAWLTAP----------ADLNILFKTPIADRW 166 + S+ L LGYA W GQLE EI +W+ P D LF TP W Sbjct: 127 FGSEESNFLFLLGYAGWSGGQLEGEIAAGSWVLVPLVDDDDPRVGVDPTFLFDTPPEHMW 186 Query: 167 REAAKLIGVDILTMPGVAG 185 A + IGVD + G+ G Sbjct: 187 SLALQSIGVDPQRLVGLQG 205 >UniRef50_D0LIU3 Putative uncharacterized protein n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LIU3_HALO1 Length = 198 Score = 94.7 bits (234), Expect = 1e-18, Method: Compositional matrix adjust. Identities = 57/185 (30%), Positives = 96/185 (51%), Gaps = 22/185 (11%) Query: 7 FLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRDESI 66 L+AMP L DP FRRSVV + EH+ G+ G++VN+P E L ++ + E L + + E++ Sbjct: 10 LLLAMPHLLDPNFRRSVVLMVEHDDEGSFGLVVNQPTE-LSMDELYESLDLAWKGSSEAM 68 Query: 67 RLDKPVMLGGPLAEDRGFILHTPPSNFASS------IRISDNTVMTTSRDV--------- 111 V GGP+ +++H P + + S + + D + ++ Sbjct: 69 -----VWRGGPVMPTHLWLVHAPLAGSSDSGTESALLGLGDGGTVAVGPELRVSGAMPEL 123 Query: 112 LETLGTDKQPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAK 171 +E G ++ P+ + V LGYA W GQL QE+ AWL A ++F+TP + W A + Sbjct: 124 IEMFG-NEPPAQLRVLLGYAGWGGGQLAQEMSQGAWLHVDATPELIFETPAEEMWERAVR 182 Query: 172 LIGVD 176 +G++ Sbjct: 183 TLGIN 187 >UniRef50_D2QR79 Putative uncharacterized protein n=2 Tax=Flexibacteraceae RepID=D2QR79_9SPHI Length = 186 Score = 93.6 bits (231), Expect = 3e-18, Method: Compositional matrix adjust. Identities = 53/170 (31%), Positives = 88/170 (51%), Gaps = 16/170 (9%) Query: 8 LIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRDESIR 67 LIA P + D F RSVV +CEHN G G+++N+ + +++ ++E I Sbjct: 14 LIAEPFMGDNNFERSVVLVCEHNAVGTFGLVLNQQTD-IQLGDVIE-----------DIH 61 Query: 68 LDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLE---TLGTDKQPSDV 124 D P+ +GGP+ ++ +H P +SI + D + D ++ LGT + D+ Sbjct: 62 TDLPLFVGGPVQQNTLHFIHRRPDLIDNSICVVDGLYWSGDFDQIKRGVNLGTLTE-RDI 120 Query: 125 LVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIG 174 +GY+ W +GQL+ E+L AW+ + + LF+TP + WRE K G Sbjct: 121 RFFIGYSGWNEGQLDSELLQKAWIISRTKADFLFETPTTEFWREVLKRKG 170 >UniRef50_B0BVW3 UPF0301 protein RrIowa_0061 n=15 Tax=Rickettsia RepID=Y061_RICRO Length = 189 Score = 93.2 bits (230), Expect = 4e-18, Method: Compositional matrix adjust. Identities = 49/187 (26%), Positives = 95/187 (50%), Gaps = 6/187 (3%) Query: 2 NLQHHFLIAMP-ALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPE 60 NL L+A P + I+ +S++Y+ H GA+G+I N+ + ++ ++ KI + Sbjct: 8 NLSGKTLVATPHVITKGIYHKSLIYMLSHTEEGAIGLIFNRLVNHIDLKSFF---KIKND 64 Query: 61 PRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQ 120 + + P+ LGGP+ ++GF LH+ N + ++ ++++ ++ E + K Sbjct: 65 EITTPVMV--PIYLGGPVEHEKGFFLHSSDYNKNLLLDFHNDLAVSSNLEISEDIAFGKG 122 Query: 121 PSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTM 180 P + L +GY +W+ GQLE+E+ N WL + +F +W A K +G+D + Sbjct: 123 PKNSLFIVGYTAWKPGQLEEELETNLWLVMDCNKEFIFADNPESKWHNALKHLGIDEIHF 182 Query: 181 PGVAGHA 187 G+A Sbjct: 183 SSQIGNA 189 >UniRef50_A7H7H6 UPF0301 protein Anae109_0457 n=4 Tax=Anaeromyxobacter RepID=Y457_ANADF Length = 199 Score = 92.4 bits (228), Expect = 6e-18, Method: Compositional matrix adjust. Identities = 58/184 (31%), Positives = 96/184 (52%), Gaps = 14/184 (7%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIE---GILEKLKIT 58 L FL+A PAL DP F S+V + EH+ GA+G +VN+P E + E L+ Sbjct: 8 GLAPGFLVAAPALGDPNFAGSLVLMAEHHGEGALGFVVNRPGPVTVAEVLASVDEDLRRA 67 Query: 59 PEPRDESIRLDKPVMLGGPLAEDRGFILHTP---PSNFASSIRISDNTVMTTSRDVLETL 115 E R PV++GGP+ +R +IL P ++ ++ + + + SR++LE L Sbjct: 68 AEANG---RAGAPVLVGGPVQPERLWILFRPGGIGADAEGAVPVGNGLSLGGSRELLEAL 124 Query: 116 GTDKQPSDVLVALGYASWEKGQLEQEILDNAWLTAPADL---NILFKTPIADRWREAAKL 172 + L+ LGYA W Q+E+E+ AW+ P +L +++F P+ RW A + Sbjct: 125 VRAPRGDPFLLLLGYAGWAPMQVEREVAAGAWV--PLELEGSDLVFDVPLEQRWETAVRR 182 Query: 173 IGVD 176 +G++ Sbjct: 183 LGLE 186 >UniRef50_Q2S591 UPF0301 protein SRU_0495 n=2 Tax=Rhodothermaceae RepID=Y495_SALRD Length = 188 Score = 91.7 bits (226), Expect = 1e-17, Method: Compositional matrix adjust. Identities = 55/171 (32%), Positives = 85/171 (49%), Gaps = 15/171 (8%) Query: 7 FLIAMPALQDPIFRRSVVYICEHNT-NGAMGIIVNKPLENLKIEGILEKLKITPEPRDES 65 LI+ P +QDP FRRSVV +CEHN G G+I+N+ L ++ + +L DE Sbjct: 14 LLISAPMMQDPNFRRSVVLLCEHNDREGTFGLILNREL-DVSLGDVL----------DEY 62 Query: 66 IRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETL--GTDKQPSD 123 + D P+ +GGP+ + LHT + + + + + ++ L G D P + Sbjct: 63 VTYDPPLYMGGPVQRETLHYLHTR-EDIPGGVALPGDMTWGGDFEAVQQLAKGGDAAPDN 121 Query: 124 VLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIG 174 + LGYA W GQLE E+ + AW+ AP +F T WR + +G Sbjct: 122 LRFFLGYAGWGPGQLEGELGEEAWIPAPGAAEFVFDTDPDQLWRAILRRMG 172 >UniRef50_C1D0N0 Putative uncharacterized protein n=3 Tax=Deinococcus RepID=C1D0N0_DEIDV Length = 185 Score = 91.7 bits (226), Expect = 1e-17, Method: Compositional matrix adjust. Identities = 58/180 (32%), Positives = 90/180 (50%), Gaps = 15/180 (8%) Query: 7 FLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRDESI 66 FL+A P LQ +F +V+ + EH+ GAMG+IVN P + E ++ Sbjct: 17 FLVASPHLQGEVFEGTVILLLEHDRKGAMGLIVNAPTP-----------QTVAELMADAA 65 Query: 67 RLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQPSDVLV 126 ++ LGGP+ G+ L+ P I++ D+ +++S +VL + Q + ++ Sbjct: 66 GQNRRAWLGGPVDPTLGWCLYHHPVGLDGEIKLVDDLHLSSSLEVLRAVMASDQ--EYML 123 Query: 127 ALGYASWEKGQLEQEILDNAWLTAPADL-NILFKTPIADRWREAAKLIGVDILT-MPGVA 184 LGYA W GQLE+E AW+ +L++ P RW EA K +GV T MPG A Sbjct: 124 ILGYAGWTAGQLEEEARAGAWVWVEQSTPELLWEVPAPQRWAEALKRLGVTPGTLMPGGA 183 >UniRef50_Q3KMF1 UPF0301 protein CTA_0231 n=9 Tax=Chlamydia RepID=Y231_CHLTA Length = 189 Score = 91.3 bits (225), Expect = 1e-17, Method: Compositional matrix adjust. Identities = 53/167 (31%), Positives = 79/167 (47%), Gaps = 8/167 (4%) Query: 8 LIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRDESIR 67 L+A P + IF RSVV +CEH+ NG+ G+I+NK LE E I P D Sbjct: 15 LVASPDVNGGIFSRSVVLVCEHSLNGSFGLILNKILEIDLPEEIF--------PLDHFDE 66 Query: 68 LDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQPSDVLVA 127 +GGPL ++ +LHT P + SSI I + + + +L+ Sbjct: 67 SKVRFCMGGPLQANQIMLLHTSPDSANSSIEICPSVFLGGDFSFAGEKEGRTRDDKMLLC 126 Query: 128 LGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIG 174 GY+ W+ GQLE+E L+ W AP+ I+F W + + +G Sbjct: 127 FGYSGWQGGQLEKEFLEGLWFLAPSSQEIIFTDAPERMWSDVLQHLG 173 >UniRef50_B3QT15 Putative uncharacterized protein n=1 Tax=Chloroherpeton thalassium ATCC 35110 RepID=B3QT15_CHLT3 Length = 187 Score = 90.1 bits (222), Expect = 4e-17, Method: Compositional matrix adjust. Identities = 56/173 (32%), Positives = 85/173 (49%), Gaps = 15/173 (8%) Query: 7 FLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRDESI 66 LIA L DP F+RSVV +CEHN G G+I+NKPL+ + I +E ++ Sbjct: 13 LLIAGAQLIDPNFKRSVVLLCEHNEEGTFGLILNKPLD-INISEAIEDIE---------- 61 Query: 67 RLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQ--PSDV 124 D + GGP+ + +LH +I + D + + + ++ + P D Sbjct: 62 DWDIALHAGGPVQPNTVHVLHRLGDEIEDAIEVVDGVYWGGNYETIRSMINTRHASPDDF 121 Query: 125 LVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADR-WREAAKLIGVD 176 LGY+ W GQL+QEI ++W A A N++F P+ DR W A + G D Sbjct: 122 RFFLGYSGWGPGQLQQEIDQDSWYQAKATANVVF-NPVYDRMWARALRAKGGD 173 >UniRef50_A6LBX4 UPF0301 protein BDI_1431 n=6 Tax=Bacteroidales RepID=Y1431_PARD8 Length = 198 Score = 88.6 bits (218), Expect = 8e-17, Method: Compositional matrix adjust. Identities = 54/180 (30%), Positives = 89/180 (49%), Gaps = 13/180 (7%) Query: 3 LQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPR 62 +Q LIA P LQD F+RSVV + EH +G+MG ++NK + L + ++ PE Sbjct: 18 IQGSILIAEPFLQDAYFQRSVVLLIEHTEHGSMGFVLNKKTD-LIVNSFFKEFAEFPEI- 75 Query: 63 DESIRLDKPVMLGGPLAEDRGFILHTPPSNFA-SSIRISDNTVMTTSRDVLETLGTDKQP 121 P+ LGGP++ +R F +H+ N +++I+D + L+ + P Sbjct: 76 --------PIYLGGPVSPNRLFFIHSLGDNIIPDALKINDYLYFDGDFNALKRYILNGHP 127 Query: 122 SD--VLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILT 179 D V LGY+ W +GQL EI N+W + + + W+++ +L+G D T Sbjct: 128 IDGKVKFFLGYSGWTEGQLNHEIKRNSWAVSHITTDNILSADGEGYWKDSVELLGNDYKT 187 >UniRef50_A3VRH6 Putative uncharacterized protein n=1 Tax=Parvularcula bermudensis HTCC2503 RepID=A3VRH6_9PROT Length = 207 Score = 87.0 bits (214), Expect = 2e-16, Method: Compositional matrix adjust. Identities = 56/176 (31%), Positives = 83/176 (47%), Gaps = 12/176 (6%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITP-E 60 L +++MP L D F +SV+YIC H+ A G+I+NKP IEG++ + E Sbjct: 22 GLAGRLIVSMPQLNDGPFAQSVIYICTHDIEHAFGLILNKP-----IEGVVATEAVADME 76 Query: 61 PRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQ 120 +D +D P+ GGP RG ILH+ S I ++T+ + L LGT Sbjct: 77 EKD----IDLPLFFGGPCEPRRGIILHSDQFVLEDSETIGAGLAISTTNEALAALGTPLL 132 Query: 121 PS-DVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGV 175 P+ + G+A W GQL+ E+ + WL + F P W A IG+ Sbjct: 133 PAQSARLFTGHAGWGPGQLDDELRRHTWLDLETSTDFAFSDP-ETMWDRAMAEIGI 187 >UniRef50_Q254Z3 UPF0301 protein CF0373 n=7 Tax=Chlamydiales RepID=Y373_CHLFF Length = 189 Score = 84.7 bits (208), Expect = 1e-15, Method: Compositional matrix adjust. Identities = 48/167 (28%), Positives = 81/167 (48%), Gaps = 8/167 (4%) Query: 8 LIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRDESIR 67 L+A P +F RSV+ +CEH+ NG+ G+I+NK L + I K+T + +IR Sbjct: 15 LLASPDTDQGVFARSVILLCEHSLNGSFGLILNKTLGLELADDIFSFDKVT----NNNIR 70 Query: 68 LDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQPSDVLVA 127 +GGPL ++ +LH+ ++ I + + L+ + + + Sbjct: 71 F----CMGGPLQANQMMLLHSCSEIPEQTLEICPSVYLGGDLSFLQEIAASDAGPMINLC 126 Query: 128 LGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIG 174 GY+ W+ GQLE+E LD W APA + +F + W + K +G Sbjct: 127 FGYSGWQAGQLEREFLDGNWFLAPASYDYVFMDNPENLWSKILKDLG 173 >UniRef50_D2R140 Putative uncharacterized protein n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R140_9PLAN Length = 183 Score = 82.0 bits (201), Expect = 1e-14, Method: Compositional matrix adjust. Identities = 54/179 (30%), Positives = 88/179 (49%), Gaps = 14/179 (7%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 +LQ HFL A P L DP F R+VV + +H+ GA+G+++ +P++ E + +++ Sbjct: 3 SLQGHFLAASPHLGDPNFFRTVVLMIKHDAQGALGLVLTRPMQETVAE-LWQRVTAETIA 61 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTT-SRDVLETLGTDKQ 120 S+ L PV GPL +H S A+ + D + S + + K+ Sbjct: 62 NTGSVHLGGPV--NGPLVA-----IHRMAS--AAEAEVFDGVYFSAHSEQISRIVHQTKK 112 Query: 121 PSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILT 179 P L+ GY+ W GQLE E+ WL APA ++F + D W + IG+ +L+ Sbjct: 113 P--YLLFAGYSGWSGGQLEAELEQGGWLIAPATTELVFSS-TDDLWERVVQSIGLAVLS 168 >UniRef50_Q5LDK5 UPF0301 protein BF2109 n=20 Tax=Bacteroides RepID=Y2109_BACFN Length = 196 Score = 80.9 bits (198), Expect = 2e-14, Method: Compositional matrix adjust. Identities = 53/169 (31%), Positives = 80/169 (47%), Gaps = 13/169 (7%) Query: 8 LIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRDESIR 67 LI+ P L D F RSVV + +H G+MG+I+NKPL L + I+++ K Sbjct: 23 LISEPFLHDVTFGRSVVLLVDHTEEGSMGLIINKPLP-LMLNDIIKEFKYIE-------- 73 Query: 68 LDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQP--SDVL 125 D P+ GGP+ D F LHT ++ I++ + D ++ P + Sbjct: 74 -DIPLHKGGPIGTDTLFYLHT-LHEIPGTLPINNGLYLNGDFDAIKKYILQGNPIKGKIR 131 Query: 126 VALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIG 174 LGY+ WE QL QEI +N W+ + + L I W+EA +G Sbjct: 132 FFLGYSGWECEQLIQEIKENTWIISKEENTYLMNEDIKGMWKEALGKLG 180 >UniRef50_Q11U74 UPF0301 protein CHU_1773 n=2 Tax=Flexibacteraceae RepID=Y1773_CYTH3 Length = 182 Score = 80.9 bits (198), Expect = 2e-14, Method: Compositional matrix adjust. Identities = 54/173 (31%), Positives = 85/173 (49%), Gaps = 16/173 (9%) Query: 8 LIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRDESIR 67 LI+ P L D F RSVV +CEHN +GA G ++NK L I +LE E + Sbjct: 8 LISEPYLGDSTFERSVVLLCEHNDSGAFGFMLNKS-TTLTINSVLE----------EQLT 56 Query: 68 LDKPVMLGGPLAEDR-GFILHTPPSNFASSIRISDNTVMTTSRDVLETL---GTDKQPSD 123 ++ + LGGP+A+D F+L + S+ I D+ + L+TL GT + + Sbjct: 57 FEQNLFLGGPVAQDSLFFLLRQDRAILKDSVHIKDDLYWGGDFEHLKTLIQEGT-LELDN 115 Query: 124 VLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVD 176 LGY+ W + QLE E+ ++W+ A + +F W+ + +G D Sbjct: 116 CRFFLGYSGWGEDQLEYELEKHSWIIADINSEDMFVKNPESMWQNVLRSMGGD 168 >UniRef50_A3ZQK2 Putative uncharacterized protein n=1 Tax=Blastopirellula marina DSM 3645 RepID=A3ZQK2_9PLAN Length = 184 Score = 80.5 bits (197), Expect = 2e-14, Method: Compositional matrix adjust. Identities = 53/178 (29%), Positives = 86/178 (48%), Gaps = 13/178 (7%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 +LQ LIA P L DP F R+VV + +H+ GA+G+++ +P E L + E Sbjct: 3 SLQGQLLIASPHLPDPNFLRTVVLMVQHDEEGALGLVLTRPTE-------LTMAAMWREI 55 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETL-GTDKQ 120 E I + V LGGP+ +G ++ I I ++ ++ +E L D + Sbjct: 56 AGEEIADENLVFLGGPV---QGPLMAIHSHAPCQEIEILPGVYFSSDKENIEKLVREDHE 112 Query: 121 PSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDIL 178 P + + GY+ W + QLE E+ WL PA+ +F T + W++ IG DI+ Sbjct: 113 PKRIFI--GYSGWGEQQLEAEMEAGGWLLLPAEAAHVFTTDVERLWKDVTGKIGADIM 168 >UniRef50_Q1DAS2 UPF0301 protein MXAN_2022 n=2 Tax=Cystobacterineae RepID=Y2022_MYXXD Length = 181 Score = 80.1 bits (196), Expect = 3e-14, Method: Compositional matrix adjust. Identities = 57/177 (32%), Positives = 82/177 (46%), Gaps = 11/177 (6%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNK--PLENLKIEGILEKLKITP 59 NL L+AMP L DP F RSVV + EH+ +G+MG+++N+ PL L +L Sbjct: 3 NLAPGLLLAMPQLGDPNFYRSVVLMLEHSESGSMGLVINRGAPL-------TLGELARGQ 55 Query: 60 EPRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDK 119 + R + V LGGP+ RGF+LH + ++ + D L L T+ Sbjct: 56 NLGIAAGRKEHSVYLGGPVEPQRGFVLHDDTEQREKH-SVLPGLFLSVTLDALGPLLTNP 114 Query: 120 QPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVD 176 P + LGYA W QLE EI +WL A + + W + +GVD Sbjct: 115 NPR-LRFCLGYAGWGPRQLESEIAAGSWLFTEATAEAVLGHEPSKLWDTTLRGMGVD 170 >UniRef50_A3HT39 Putative uncharacterized protein n=1 Tax=Algoriphagus sp. PR1 RepID=A3HT39_9SPHI Length = 189 Score = 79.3 bits (194), Expect = 6e-14, Method: Compositional matrix adjust. Identities = 54/170 (31%), Positives = 84/170 (49%), Gaps = 14/170 (8%) Query: 8 LIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRDESIR 67 LI+ P LQD F RSVV +CEHN G+ G+++NKP LK+ ++E L Sbjct: 15 LISEPFLQDENFVRSVVMLCEHNEEGSFGLVINKP-SILKLGELVESLDF---------- 63 Query: 68 LDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVL-ETLGTDK-QPSDVL 125 LD V +GGP+ ++ ++T SI+I + + L E L T P V Sbjct: 64 LDAEVFVGGPVEQNTLHYIYTGEKELERSIQIGTDLWWGGDYEQLVEKLKTGLINPDRVR 123 Query: 126 VALGYASWEKGQLEQEILDNAWLTAPADLN-ILFKTPIADRWREAAKLIG 174 +GY+ W QLE+E+ D W+ +++ F+ + WR+ K +G Sbjct: 124 FFIGYSGWGLDQLEEELEDKTWIVCRTEVDPKTFEYTPEELWRKLLKNMG 173 >UniRef50_Q3B561 UPF0301 protein Plut_0637 n=11 Tax=Chlorobiaceae RepID=Y637_PELLD Length = 189 Score = 77.8 bits (190), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 51/169 (30%), Positives = 78/169 (46%), Gaps = 13/169 (7%) Query: 8 LIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRDESIR 67 LIA L + F+R+V+ +CEHN G++G I+N+P+E E + DE Sbjct: 16 LIASANLLESNFKRTVLMMCEHNPQGSLGFILNRPMEFQVREAV--------AGFDE--- 64 Query: 68 LDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDK--QPSDVL 125 +D+P+ +GGP+ + LH S +I R+ L L +PS++ Sbjct: 65 VDEPLHMGGPVQSNTVHFLHMRGDLIDGSEQILPGLYWGGDREELGYLLNTGVLKPSEIR 124 Query: 126 VALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIG 174 LGYA W GQLE E + +W TA A ++F W + G Sbjct: 125 FFLGYAGWSAGQLEAEFEEGSWYTADATPAMVFSGEYERMWSRTVRSKG 173 >UniRef50_C3Q021 UPF0301 protein n=8 Tax=Bacteroides RepID=C3Q021_9BACE Length = 196 Score = 77.4 bits (189), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 48/174 (27%), Positives = 83/174 (47%), Gaps = 13/174 (7%) Query: 4 QHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLE-NLKIEGILEKLKITPEPR 62 Q LI+ P + D F R+VV + EHN G+MGII+NK ++ + ++ +L+ Sbjct: 17 QGSILISSPFMNDYHFTRAVVLLIEHNDEGSMGIIMNKDFRYHILLNDLIPELEFAQRV- 75 Query: 63 DESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQPS 122 PV GGP++ + F LHT + ++ + + + + ++ D +P Sbjct: 76 --------PVYKGGPMSRETIFFLHT-LKDLEGALPLGNGLYLNGDFNAVQQYILDGKPI 126 Query: 123 DVLVAL--GYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIG 174 + ++ GYA W+ GQL +EI +N+WL A L D W + +G Sbjct: 127 EGVIRFFAGYAGWDHGQLAKEIKENSWLIGKAGKETLLNQHFRDLWHTSLNEMG 180 >UniRef50_B9XJW1 Putative uncharacterized protein n=1 Tax=bacterium Ellin514 RepID=B9XJW1_9BACT Length = 186 Score = 76.6 bits (187), Expect = 4e-13, Method: Compositional matrix adjust. Identities = 52/173 (30%), Positives = 82/173 (47%), Gaps = 11/173 (6%) Query: 3 LQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPR 62 L+ L+ L F+R+VV +C+H+ GA+G+++N+ N E +L L PE Sbjct: 8 LKGQLLLDSGQLSGSFFQRTVVLVCQHDAEGALGLVLNRDSGNKLGEMVLADL---PEQL 64 Query: 63 DESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQPS 122 D + LGGP+ L++ + + N + S + L LG P Sbjct: 65 T-----DNALYLGGPVQLSALSYLYS--DTYLPEASVLPNVELGHSLETLVELGESFSPG 117 Query: 123 D-VLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIG 174 + + GYA W GQLE+E+ AWLT PA ++++F T D W+ K G Sbjct: 118 KRIKLFAGYAGWSPGQLEEEMKRKAWLTHPATVDLVFDTDPDDLWQYVLKQKG 170 >UniRef50_A6C880 Putative uncharacterized protein n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C880_9PLAN Length = 188 Score = 76.6 bits (187), Expect = 4e-13, Method: Compositional matrix adjust. Identities = 56/169 (33%), Positives = 79/169 (46%), Gaps = 12/169 (7%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 +L+ HFL+A L D F RSVV I EHN GA G+IVN+P + I L + P Sbjct: 4 SLKGHFLVASRKLNDLNFYRSVVLIVEHNEQGATGLIVNRP-SSFSITNALSRYFDMP-- 60 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETL---GTD 118 +L+ V +GGP+ + F LH S+ I + M +S ++ E + ++ Sbjct: 61 -----KLEDMVFMGGPVEPNGMFALHNAGDLEKSTEAIVPDLFMGSSPEIFEQVIWRISE 115 Query: 119 KQPS-DVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRW 166 P D + G A W QLE EI WL PA +F+ D W Sbjct: 116 GDPHLDFRIFFGCAGWAPLQLESEINRMDWLNTPATTEDIFEIDPYDIW 164 >UniRef50_A5FNN9 Putative uncharacterized protein n=18 Tax=Bacteroidetes RepID=A5FNN9_FLAJ1 Length = 209 Score = 76.3 bits (186), Expect = 4e-13, Method: Compositional matrix adjust. Identities = 50/179 (27%), Positives = 85/179 (47%), Gaps = 20/179 (11%) Query: 6 HFLIAMPAL-QDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRDE 64 H LIA P++ D F RSV+ + +HN G++G I+NKP LK T Sbjct: 33 HLLIAEPSIIGDLSFNRSVILLADHNKEGSIGFIINKP------------LKYTINDLIP 80 Query: 65 SIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTV----MTTSRDVLETLGTDKQ 120 I + + GGP+ +D + +H P +S+ IS+ +++D++ +K Sbjct: 81 EIDANFKIYNGGPVEQDNLYFIHNIPDLIPNSVEISNGIYWGGDFESTKDLINDGSINK- 139 Query: 121 PSDVLVALGYASWEKGQLEQEILDNAWLTAPADL-NILFKTPIADRWREAAKLIGVDIL 178 +++ LGY W++ QLE E+ N+W+ A + N + W+E +G D L Sbjct: 140 -NNIRFFLGYTGWDENQLENEMQGNSWIIADNNYKNKIIGKSTTHFWKEQIIELGGDYL 197 >UniRef50_Q1Q3L0 Putative uncharacterized protein n=1 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1Q3L0_9BACT Length = 188 Score = 73.9 bits (180), Expect = 2e-12, Method: Compositional matrix adjust. Identities = 46/154 (29%), Positives = 70/154 (45%), Gaps = 11/154 (7%) Query: 8 LIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRDESIR 67 LIA P DP F ++VV ICEH+ G +G+I+NK L E + + Sbjct: 11 LIANPQGTDPNFMQTVVLICEHSKRGTLGLILNKTLGKKGQEIFVSSANTKTK------- 63 Query: 68 LDKPVMLGGPLAEDRGFILHTPPSNFA-SSIRISDNTVMTTSRDVLETLGTDKQPSDVL- 125 DK + GGP+ + F LH N + ++I + + +++ + K SD + Sbjct: 64 -DKEIFFGGPVDTNNMFYLHGNFKNETHNCVKICEGVYLGSNQGCFNAFMSRKNVSDNIF 122 Query: 126 -VALGYASWEKGQLEQEILDNAWLTAPADLNILF 158 + LG A W GQLE EI W A ++F Sbjct: 123 RLYLGCACWSGGQLESEIETKCWTVGTATEKMVF 156 >UniRef50_C7PSM3 Putative uncharacterized protein n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PSM3_CHIPD Length = 184 Score = 72.0 bits (175), Expect = 9e-12, Method: Compositional matrix adjust. Identities = 48/176 (27%), Positives = 81/176 (46%), Gaps = 14/176 (7%) Query: 8 LIAMPALQDPIFRRSVVYICEHN-TNGAMGIIVNKPLENLKIEGILEKLKITPEPRDESI 66 LIA P L+D F R+VV +CEH + G+ G ++NK + E + PE +I Sbjct: 10 LIADPFLKDQNFARTVVLLCEHQESRGSFGFVLNKVFDQSLNE-------LVPEVLINNI 62 Query: 67 RLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQP--SDV 124 R V GGP+ D +H P I D D + +L + + + Sbjct: 63 R----VYYGGPVQIDTIHFIHQQPELIRGGFEIRDGVYWGGEFDQVVSLINSGRLDLNKI 118 Query: 125 LVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTM 180 +GY+ W GQLE E+ + +W+ + ++ ++F+ + W +A K +G + M Sbjct: 119 KFFIGYSGWSSGQLENELNEKSWILSESNAPLIFEAKEQNIWPQALKNLGANFAIM 174 >UniRef50_UPI0001745679 hypothetical protein VspiD_25265 n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001745679 Length = 204 Score = 71.6 bits (174), Expect = 1e-11, Method: Compositional matrix adjust. Identities = 54/167 (32%), Positives = 86/167 (51%), Gaps = 17/167 (10%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNT-NGAMGIIVNKPLENLKIEGILEKLKITPE 60 +L L+A PAL+DP F +V+ + HNT +GA G I+N+PL+ ++ +L+ Sbjct: 29 SLSGSLLVASPALRDPNFFHTVLLLASHNTEDGAFGYILNRPLDK-RVADLLDD------ 81 Query: 61 PRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIR-ISDNTVMTTSRDVLETLGTDK 119 +D + PV LGGP+ ++ L N+ S R + T ++T + + E DK Sbjct: 82 -KDLGRLGEVPVFLGGPVGTNK---LSFAAFNWNSKKRELRMQTHLSTEQAMKEL---DK 134 Query: 120 QPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRW 166 S V +GY+ W +GQLE E+ N+W+T I+ P D W Sbjct: 135 GRS-VRGFVGYSGWSEGQLENELEQNSWITCAPLSKIVTAQPSTDLW 180 >UniRef50_UPI0001C3133A protein of unknown function DUF179 n=1 Tax=Conexibacter woesei DSM 14684 RepID=UPI0001C3133A Length = 182 Score = 71.2 bits (173), Expect = 2e-11, Method: Compositional matrix adjust. Identities = 52/184 (28%), Positives = 86/184 (46%), Gaps = 20/184 (10%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 +L+ L+A PALQDP F R+VV I EHN +GAMG+++N+P E + Sbjct: 4 SLKGKLLLASPALQDPNFARTVVLIAEHNEDGAMGLVLNRPATTTVAE--------SAPE 55 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNT-VMTTSRDVLETLGTDKQ 120 +E + ++P+ +GGP+ +L A+ + + D+ ++ D + +Q Sbjct: 56 LEELVEAEEPIYIGGPVQPSAVIVLAAFEEPAAAGLLVRDDVGFLSAEADFATSRDATRQ 115 Query: 121 PSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTM 180 + V G+A W GQL++E+ W+ P LF + W D+LT Sbjct: 116 ---LRVFAGHAGWGPGQLDEELEREDWIVEPPLPQELFSEDAEELWG--------DVLTR 164 Query: 181 PGVA 184 G A Sbjct: 165 KGGA 168 >UniRef50_B4CVG9 Putative uncharacterized protein n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4CVG9_9BACT Length = 186 Score = 70.5 bits (171), Expect = 3e-11, Method: Compositional matrix adjust. Identities = 51/175 (29%), Positives = 77/175 (44%), Gaps = 14/175 (8%) Query: 1 MNLQHHFLIAMPALQDPIFRRSVVYICEHNTN-GAMGIIVNKPLENLKIEGILEKLKITP 59 ++L LIA P L DP FRRSV++I ++ G+ G+I+N+P E + K Sbjct: 9 ISLAGSLLIAHPGLLDPNFRRSVLFISSNDAQEGSFGLIINRPASRTVAELLPNK----- 63 Query: 60 EPRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDK 119 D + PV LGGP+A D+ + + V+ + +++ Sbjct: 64 ---DLGMLSRVPVFLGGPVATDQLVFAAFQWHEETERMVCRPHLVIDEAAEIVH-----D 115 Query: 120 QPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIG 174 + + V +GYA W KGQLE E+ WL PA + L WRE G Sbjct: 116 ETTIVRAFVGYAGWSKGQLEGELAQRTWLVRPAARDTLDLERCPTLWREITSTFG 170 >UniRef50_A4C260 Putative transcriptional regulator n=1 Tax=Polaribacter irgensii 23-P RepID=A4C260_9FLAO Length = 185 Score = 70.1 bits (170), Expect = 4e-11, Method: Compositional matrix adjust. Identities = 43/164 (26%), Positives = 76/164 (46%), Gaps = 15/164 (9%) Query: 8 LIAMPA-LQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRDESI 66 L+A P+ L D F +++V + EH N ++G I+NKPL + +L +K + + Sbjct: 12 LVAEPSILNDTSFNKAIVLLTEHTANNSVGFILNKPLA-YNLNDLLPNIKCSFK------ 64 Query: 67 RLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDK--QPSDV 124 + GGP+ +D + LH P + SI +S+ + L L + S++ Sbjct: 65 -----IYQGGPVEQDNLYFLHRVPQLLSKSIAVSNGVYWGGDFNQLTELLNNSVLDTSEI 119 Query: 125 LVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWRE 168 LGY+ W+K QL E+ + +W D + + W+E Sbjct: 120 RFFLGYSGWDKEQLGAELKEKSWFVTENDFENILSNDEKNLWKE 163 >UniRef50_B0SHS8 Transcriptional regulator n=6 Tax=Leptospira RepID=B0SHS8_LEPBA Length = 188 Score = 70.1 bits (170), Expect = 4e-11, Method: Compositional matrix adjust. Identities = 51/170 (30%), Positives = 86/170 (50%), Gaps = 13/170 (7%) Query: 8 LIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRDESIR 67 LI+ ++ F +SVV + +H+ +GA G+++NKP + +E +++ L +++ Sbjct: 13 LISNSSVIQDFFHKSVVLMVDHDDDGAFGLVLNKPTDQ-TMESLIKNLP-------DTVH 64 Query: 68 LDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRD-VLETLGTDKQPSDVLV 126 +KPV GGP+ ILH + + M S D +LE L +D+ VL Sbjct: 65 SNKPVYAGGPVDNLFVSILHNGKQTADPGVEVVPGIYMARSFDTMLEVLSSDQIQFRVL- 123 Query: 127 ALGYASWEKGQLEQEILDNAWLTAP-ADLNILFKTPIADR-WREAAKLIG 174 GYA W GQLE E +W+ + D +I+FK ++ W+EA + G Sbjct: 124 -QGYAGWSSGQLESEFDRLSWVVSDLVDDSIVFKEDESEVIWKEALRSKG 172 >UniRef50_B1ZWF6 Putative uncharacterized protein n=2 Tax=Opitutaceae RepID=B1ZWF6_OPITP Length = 184 Score = 68.9 bits (167), Expect = 7e-11, Method: Compositional matrix adjust. Identities = 51/177 (28%), Positives = 77/177 (43%), Gaps = 23/177 (12%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKP----LENLKIEGILEKLKI 57 +L L+A PAL+DP FRR++V + HN GAMG+++N+P L L E L L Sbjct: 11 SLAGSLLLAHPALRDPNFRRAIVLMSVHNAEGAMGVVLNRPMGKRLGELNGEFALGSLAS 70 Query: 58 TPEPRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGT 117 P ++ ++ V++ ED GF LH + M + + Sbjct: 71 VPLFHGGPVQTEQLVLVAWQPQED-GFRLH---------FGVEPERAMQLAAE------- 113 Query: 118 DKQPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIG 174 + + + LGY+ W GQLE E+ WL A +L A WR +G Sbjct: 114 --EGTQLRAFLGYSGWGGGQLEAELKQKTWLVADMPAGLLEGPQDAAMWRSVVSSLG 168 >UniRef50_B1MML1 UPF0301 protein MAB_4928c n=20 Tax=Corynebacterineae RepID=Y4928_MYCA9 Length = 208 Score = 67.0 bits (162), Expect = 3e-10, Method: Compositional matrix adjust. Identities = 43/153 (28%), Positives = 69/153 (45%), Gaps = 19/153 (12%) Query: 7 FLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRDESI 66 LIA L +P FRRSV++I EHN G +G+++N+P E + + K+ +P Sbjct: 30 LLIANTNLFEPTFRRSVIFIVEHNDGGTLGVVLNRPSETAVYNVLPQWAKLAGKP----- 84 Query: 67 RLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQPSDVL- 125 K + +GGP+ D L T + + I + + + D +P D+ Sbjct: 85 ---KTMFVGGPVKRDAALCLAT----LRAGVSIDGVKGLRHVAGRMAMVDLDAEPEDIAP 137 Query: 126 ------VALGYASWEKGQLEQEILDNAWLTAPA 152 V GY+ W GQLE E+ + W+ A Sbjct: 138 LVEGIRVFAGYSGWTIGQLEGEVERDDWIVLSA 170 >UniRef50_Q47MA0 UPF0301 protein Tfu_2389 n=3 Tax=Actinomycetales RepID=Y2389_THEFY Length = 198 Score = 66.6 bits (161), Expect = 4e-10, Method: Compositional matrix adjust. Identities = 51/167 (30%), Positives = 76/167 (45%), Gaps = 17/167 (10%) Query: 1 MNLQHHFLIAMPALQDPIFRRSVVYICEHNTN-GAMGIIVNKPLENLKIEGILEKLKITP 59 ++L L+A P L+DP F RSVV++ + + G +G+I+N+P E L + +L + Sbjct: 8 LSLTGALLVATPLLEDPNFYRSVVFVIDDTPDEGTLGVILNRPSE-LGVGEVLAEWG--- 63 Query: 60 EPRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFAS-SIRISDNTVMTTSRDVLETLGTD 118 E + + GGP+ +D G L P + D T + L T+ D Sbjct: 64 ----EHVSQPAVMFAGGPVGQDAGLALAVPDDGQRPLGWKSLDAMDAKTWPNGLGTVDLD 119 Query: 119 KQPSDVLVAL-------GYASWEKGQLEQEILDNAWLTAPADLNILF 158 P V AL GYA W GQL EI AW PA ++ +F Sbjct: 120 TPPQLVADALRQMRVFAGYAGWSAGQLRAEIDQGAWYVLPATVDDVF 166 >UniRef50_C2G2S7 Transcriptional regulator n=3 Tax=Sphingobacteriaceae RepID=C2G2S7_9SPHI Length = 189 Score = 66.2 bits (160), Expect = 5e-10, Method: Compositional matrix adjust. Identities = 45/171 (26%), Positives = 84/171 (49%), Gaps = 14/171 (8%) Query: 8 LIAMPALQDPIFRRSVVYICEHN-TNGAMGIIVNKPLENLKIEGILEKLKITPEPRDESI 66 L++ P + D F+RSV+ + +HN T+G +G I+N+ + L ++ +D Sbjct: 13 LVSEPFMLDQNFKRSVILLADHNETDGTVGFILNQRTQ----------LMLSDVFQDVER 62 Query: 67 RLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQ--PSDV 124 D P+ LGGP+ + F +H S I D+ ++L L +++ +V Sbjct: 63 EADFPIYLGGPVECEALFFIHKAYDLLLSGEHIIDDVYWGGDIELLLRLAKEEKITSDEV 122 Query: 125 LVALGYASWEKGQLEQEILDNAW-LTAPADLNILFKTPIADRWREAAKLIG 174 +GY+ W QL++EI +N+W + + ++ F T D W++A +G Sbjct: 123 KFFIGYSGWSPSQLDREIKENSWAVDNKFNKDLTFITDGEDLWKQALISMG 173 >UniRef50_C7MS43 Predicted transcriptional regulator n=3 Tax=Actinomycetales RepID=C7MS43_SACVD Length = 198 Score = 65.5 bits (158), Expect = 8e-10, Method: Compositional matrix adjust. Identities = 46/171 (26%), Positives = 71/171 (41%), Gaps = 27/171 (15%) Query: 7 FLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRDESI 66 L+A P + DP FRR+VV++ +H G +G+++N+P E E + EPR Sbjct: 20 LLVAAPTMFDPNFRRTVVFVIDHRAEGTLGVVLNRPSEVAVREVLPRWGDHVAEPRS--- 76 Query: 67 RLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTS----RDVLETLGTDKQPS 122 V +GGP+ + L +++R + R + + D P Sbjct: 77 -----VFVGGPVEKKTALCL--------AALRTGETAATVPGVIGVRGPVALVDLDSDPE 123 Query: 123 -------DVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRW 166 + V GYA W+ GQL EI WL PA + + P D W Sbjct: 124 MLASKVRGLRVFAGYAGWDGGQLASEIERGDWLIVPALPSDVMAGPTRDLW 174 >UniRef50_C1ZJG4 Predicted transcriptional regulator, COG1678 n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZJG4_PLALI Length = 188 Score = 63.5 bits (153), Expect = 4e-09, Method: Compositional matrix adjust. Identities = 48/161 (29%), Positives = 71/161 (44%), Gaps = 12/161 (7%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 +L+ L+A L+D F ++VV I E N NG+MG+++N+P L + E ++ Sbjct: 4 SLRGKLLVASKQLKDSNFYKTVVLIVEDNENGSMGLVLNRPSSILVNHALSEHFQLP--- 60 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQP 121 ES L V +GGP+ FILH + + S + E + P Sbjct: 61 --ESAEL---VHVGGPVEPAALFILHNLEELSHEGTGVIPGVWLGNSGEAFEDVLRSSDP 115 Query: 122 SD----VLVALGYASWEKGQLEQEILDNAWLTAPADLNILF 158 V G A W GQLE E+ W APA +I+F Sbjct: 116 HQPGVRFRVFCGCAGWSPGQLEGELAHGDWHVAPAIKSIVF 156 >UniRef50_C7PBJ3 Putative uncharacterized protein n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PBJ3_CHIPD Length = 146 Score = 63.2 bits (152), Expect = 4e-09, Method: Compositional matrix adjust. Identities = 39/146 (26%), Positives = 71/146 (48%), Gaps = 19/146 (13%) Query: 7 FLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRDESI 66 F+ + L+ +F +V+YI E+N NGAMG IVN + L +L+ R Sbjct: 6 FINSTSLLEKSVFESTVIYITEYNENGAMGFIVNN-----RFPRKLNELEEFSHGR---- 56 Query: 67 RLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVM-----TTSRDVLETLGTDKQP 121 D P+ GGP+ ++ F +H P + ++ DN + + + E T++ Sbjct: 57 --DFPLWEGGPVDKEHLFFIHQRPDLISGGEQVGDNIFLGGDFQAAVKHINEHTLTEQ-- 112 Query: 122 SDVLVALGYASWEKGQLEQEILDNAW 147 D+ + +GY W+ +L++EI + +W Sbjct: 113 -DIKIFIGYCGWDYKELDEEIDEGSW 137 >UniRef50_A6E847 Putative uncharacterized protein n=1 Tax=Pedobacter sp. BAL39 RepID=A6E847_9SPHI Length = 168 Score = 62.8 bits (151), Expect = 6e-09, Method: Compositional matrix adjust. Identities = 43/162 (26%), Positives = 77/162 (47%), Gaps = 14/162 (8%) Query: 16 DPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRDESIRLDKPVMLG 75 DP F+RSVV + +H G +G I+N+ + IL L PE ++ PV +G Sbjct: 2 DPNFKRSVVLLTDHQEEGTVGFILNQ-----RSTLILSDL--VPEFAGVAL----PVYIG 50 Query: 76 GPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETL--GTDKQPSDVLVALGYASW 133 GP+A D +H ++ + + L+ L +P+++ +GY+ W Sbjct: 51 GPVATDTLHFIHRCYDRLNDGQEVAKGIYWGGNFEALKVLLLTGSIEPAEIKFFIGYSGW 110 Query: 134 EKGQLEQEILDNAWLTAPA-DLNILFKTPIADRWREAAKLIG 174 +GQL+ E+ +N W+ + +++F + WREA +G Sbjct: 111 SEGQLKLELEENTWMVSDRFHADVVFSDNEEELWREAVINLG 152 >UniRef50_C1B7P4 UPF0301 protein ROP_34500 n=13 Tax=Corynebacterineae RepID=Y3450_RHOOB Length = 201 Score = 60.5 bits (145), Expect = 3e-08, Method: Compositional matrix adjust. Identities = 42/171 (24%), Positives = 76/171 (44%), Gaps = 17/171 (9%) Query: 7 FLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRDESI 66 L++ L +P FRR+V+Y+ EHN G++G+++N+P E + + + +T P Sbjct: 23 LLVSSTDLVEPAFRRTVIYVIEHNEAGSLGVVINRPSETAVHDVLPQWAPLTARP----- 77 Query: 67 RLDKPVMLGGPLAEDRGFILHTPPSNF-ASSIRISDNTVMTTSRDVLETLGTDKQ----- 120 + +GGP+ D L T + A +R R V+ L +D + Sbjct: 78 ---SALYVGGPVKRDAALCLATLRTGAQADGVR---GLRRVHGRVVMVDLDSDPEVVAPL 131 Query: 121 PSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAK 171 V + GY+ W GQL+ E+ + W+ A + + D W + + Sbjct: 132 VEGVRIFAGYSGWTYGQLDSELQRDDWIVISALASDVLAPARVDVWAQVLR 182 >UniRef50_B5JR88 Putative uncharacterized protein n=1 Tax=Verrucomicrobiae bacterium DG1235 RepID=B5JR88_9BACT Length = 187 Score = 60.1 bits (144), Expect = 4e-08, Method: Compositional matrix adjust. Identities = 48/169 (28%), Positives = 80/169 (47%), Gaps = 20/169 (11%) Query: 3 LQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKL-KITPEP 61 L L+A P L+DP F SVV + H +G++G+++NK G E+L +++ E Sbjct: 12 LTGSLLLAHPHLKDPNFASSVVLLTRHEESGSLGVVLNK--------GTGERLGQLSSEF 63 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTP--PSNFASSIRISDNTVMTTSRDVLETLGTDK 119 D + + PV LGGP+ +++ + P + ++ S+ +ET Sbjct: 64 ADCGLG-EVPVYLGGPVNQNQIILAAWKLIPEKGQFQLYFGMEPLVAQSK--MET----- 115 Query: 120 QPSDVLVAL-GYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWR 167 P A GY+ W +GQL E+ DNAW+ + D + +D WR Sbjct: 116 DPDLEFRAFKGYSGWSEGQLVGELEDNAWVVSEVDAESISTKEGSDLWR 164 >UniRef50_C6X421 Putative transcriptional regulator n=1 Tax=Flavobacteriaceae bacterium 3519-10 RepID=C6X421_FLAB3 Length = 183 Score = 60.1 bits (144), Expect = 4e-08, Method: Compositional matrix adjust. Identities = 46/151 (30%), Positives = 67/151 (44%), Gaps = 16/151 (10%) Query: 1 MNLQHH--FLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKIT 58 MN + +I+ P + IF RSVV + +HN GA G+I+NK +N+ L I Sbjct: 2 MNYSYKGKIIISTPDISGDIFSRSVVLVIDHNAEGAFGLILNKKNQNMS----ARLLNIF 57 Query: 59 PEPRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRD--VLETLG 116 R+D V GGP+ D+ F ++ S I+D +T + V + Sbjct: 58 ------GFRVD--VYEGGPVENDKIFFINKGEKVTESFSEINDGFYLTEDIENVVAAIIE 109 Query: 117 TDKQPSDVLVALGYASWEKGQLEQEILDNAW 147 D+ V GY+ W GQLE EI W Sbjct: 110 GRLSAEDIKVFSGYSGWAPGQLENEIRRKLW 140 >UniRef50_Q82D55 UPF0301 protein SAV_5129 n=12 Tax=Actinomycetales RepID=Y5129_STRAW Length = 193 Score = 59.7 bits (143), Expect = 4e-08, Method: Compositional matrix adjust. Identities = 50/181 (27%), Positives = 82/181 (45%), Gaps = 26/181 (14%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLK-ITPE 60 +L L+A PAL DP F R+VV + +H+ G++G+++N+P + + ILE + E Sbjct: 9 SLTGRLLVATPALADPNFDRAVVLLLDHDEEGSLGVVLNRPTP-VDVSDILEGWADLAGE 67 Query: 61 PRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFA------SSIRISDNTVMTTSRDVLET 114 P V GGP++ D + P + R+ + E Sbjct: 68 P--------GVVFQGGPVSLDSALGVAVIPGGASVDGAPLGWRRVHGAIGLVDLEAPPEL 119 Query: 115 LGTDKQPSDVLVALGYASWEKGQLEQEILDNAWL---TAPADLNILFKTPIADR-WREAA 170 L K + + GYA W GQLE E+++ AW + P D++ +P +R WRE Sbjct: 120 LA--KALGSLRIFAGYAGWGPGQLEDELVEGAWYVVESEPGDVS----SPSPERLWREVL 173 Query: 171 K 171 + Sbjct: 174 R 174 >UniRef50_C1RPZ7 Predicted transcriptional regulator, COG1678 n=12 Tax=Actinomycetales RepID=C1RPZ7_9CELL Length = 184 Score = 59.7 bits (143), Expect = 5e-08, Method: Compositional matrix adjust. Identities = 45/172 (26%), Positives = 72/172 (41%), Gaps = 20/172 (11%) Query: 6 HFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRDES 65 L+A P L+D FRR+VV + +H GA+G+++++PL+ ++ + P+ E Sbjct: 8 RLLVATPGLRDRSFRRAVVLVLDHTAEGALGVVLDRPLD-------IDARTVLPQ-WQEH 59 Query: 66 IRLDKPVMLGGPLAEDRGFIL------HTPPSNFASSIRISDNTVMTTSRDVLETLGTDK 119 + + GGP+A D L PP A S R+ D L D Sbjct: 60 LSTPGRLFQGGPVARDTALALADLPGADAPPGVQALSPRLG-----VVDLDAPPALVVDA 114 Query: 120 QPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAK 171 + + V +GYA W GQL+ E+ W + F WR + Sbjct: 115 VRA-LRVFVGYAGWGPGQLDDEVDVGGWFVVDHEPGDAFSADPRGLWRRVLR 165 >UniRef50_A1SG68 Putative uncharacterized protein n=2 Tax=Actinomycetales RepID=A1SG68_NOCSJ Length = 191 Score = 59.3 bits (142), Expect = 6e-08, Method: Compositional matrix adjust. Identities = 49/174 (28%), Positives = 78/174 (44%), Gaps = 25/174 (14%) Query: 7 FLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKL-KITPEPRDES 65 L+A PAL DP F +VV + + + GA+G+++N+P + ++ +L+ + EP E Sbjct: 15 LLVATPALLDPNFADTVVLLLDVDEQGALGVVLNRP-SAIPVDDVLDGWGDVAAEP--EV 71 Query: 66 IRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISD--------NTVMTTSRDVLETLGT 117 + PV L G LA P F R+ D +T + R LE L Sbjct: 72 LFQGGPVGLQGALAVALLARADDVPVGF----RVVDGRLGLVDLDTPLELVRGGLEGL-- 125 Query: 118 DKQPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAK 171 V GYA W QL EI + +W P + +F++ +D WR+ + Sbjct: 126 -------RVFAGYAGWGADQLRDEIEEGSWYVVPGEARDVFRSDASDLWRDVLR 172 >UniRef50_C4DLM5 Predicted transcriptional regulator, COG1678 n=5 Tax=Actinomycetales RepID=C4DLM5_9ACTO Length = 196 Score = 58.9 bits (141), Expect = 8e-08, Method: Compositional matrix adjust. Identities = 46/155 (29%), Positives = 72/155 (46%), Gaps = 21/155 (13%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 +L L+A PALQDP F R+VV + H + GA+G+++N+ E E + + ++ EP Sbjct: 15 SLVGRLLVATPALQDPNFERTVVLLVSHESAGALGVVLNRATEVPVAEVLGDWSELAREP 74 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLET--LGTDK 119 + GGP+ + L S + + + L T L D Sbjct: 75 --------AVLFEGGPVQPEAAIALGWMRSG------VGEPSCFKPFAGRLGTLDLSVDP 120 Query: 120 QP-SDVL----VALGYASWEKGQLEQEILDNAWLT 149 +P +D L V GY+SW GQL+ E+ D AW+ Sbjct: 121 EPLADRLEGMRVFAGYSSWGAGQLDDELKDGAWMV 155 >UniRef50_C8XE74 Putative uncharacterized protein n=2 Tax=Actinomycetales RepID=C8XE74_NAKMY Length = 190 Score = 58.9 bits (141), Expect = 8e-08, Method: Compositional matrix adjust. Identities = 49/167 (29%), Positives = 74/167 (44%), Gaps = 19/167 (11%) Query: 7 FLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRDESI 66 L+A P L+DP FRR+VVY+ H+ +G +G+I+N+P E ++ +L P + Sbjct: 12 LLVATPGLRDPHFRRTVVYLVAHSVDGTVGVILNRPSET-AVQNVL------PGWASHTA 64 Query: 67 RLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQPSDVLV 126 R V GGP+ L ++ R V T VL L D P+ V Sbjct: 65 R-PHAVFAGGPVQTSAAMCLGV--CRIGTNPREVQGVVGVTGPVVLVDL--DGDPATVTQ 119 Query: 127 AL-------GYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRW 166 +L G A W+ QL EI++ +W P + + P D W Sbjct: 120 SLRGIRIYAGRAGWDAEQLVDEIIEGSWYVVPGLPDDVLAGPRTDLW 166 >UniRef50_Q4CSL3 Putative uncharacterized protein n=2 Tax=Trypanosoma cruzi RepID=Q4CSL3_TRYCR Length = 523 Score = 58.2 bits (139), Expect = 1e-07, Method: Compositional matrix adjust. Identities = 52/165 (31%), Positives = 77/165 (46%), Gaps = 33/165 (20%) Query: 6 HFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRDES 65 H L+A P L + FR SV+ + N A I+NKPLEN EG+L ++ T Sbjct: 280 HMLLAHPQLYE-FFRYSVMIVVRSTPNEAAAFILNKPLEN--DEGMLMQVNST------- 329 Query: 66 IRLDK------------PVMLGGPLAEDRG------FILHTPPSNFASSIRISDNTVMTT 107 IRL+ VM+GGP++ RG +LH P + +I +S + + Sbjct: 330 IRLNHVHPILGKHLGNHTVMIGGPVS--RGSFDSSILLLHRIP-DVEDAIPVSQSLWVDG 386 Query: 108 SRDVLETLGTD--KQPSDVLVALGYASWEKGQLEQEILDNAWLTA 150 + DVL+ D D++V G++ W GQL EI W+ A Sbjct: 387 NYDVLQKKLDDGTADAKDIVVICGFSGWGAGQLAGEISSGTWVVA 431 >UniRef50_C0AXX9 Putative uncharacterized protein n=1 Tax=Proteus penneri ATCC 35198 RepID=C0AXX9_9ENTR Length = 56 Score = 56.2 bits (134), Expect = 5e-07, Method: Compositional matrix adjust. Identities = 24/49 (48%), Positives = 35/49 (71%) Query: 139 EQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMPGVAGHA 187 E+ N+WLT A I+F TP+A+RW +AA+LIG++I T+ +AGHA Sbjct: 8 ERNFRKNSWLTVEASPQIIFDTPVAERWHKAAELIGINIHTISPIAGHA 56 >UniRef50_B7G677 Predicted protein n=2 Tax=Bacillariophyta RepID=B7G677_PHATR Length = 393 Score = 55.8 bits (133), Expect = 7e-07, Method: Compositional matrix adjust. Identities = 42/150 (28%), Positives = 72/150 (48%), Gaps = 16/150 (10%) Query: 18 IFRRSVVYICEHN-TNGAMGIIVNKPLENLKIEGILEKLKITPEPR---DESIRL---DK 70 +F ++VV I +H+ T G+ GI++N+P++ + LKI E D S++L Sbjct: 208 VFHQTVVLIIDHHETTGSTGIVINRPMDG-------DLLKIASEQESSLDLSLKLAFSQA 260 Query: 71 PVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDK-QPSDVLVALG 129 V GGP+ D +LH S ++ + S +++ + T + P+ L G Sbjct: 261 RVTYGGPVLTDEFSVLHGF-GEVEGSRKLCPGVYIGGSEELMNEVRTLRFDPAHALFVKG 319 Query: 130 YASWEKGQLEQEILDNAWLTAPADLNILFK 159 +A W GQL +EI W TA A + + + Sbjct: 320 HAGWVPGQLTREISKGVWYTAAASSDFILR 349 >UniRef50_D0A6S5 Putative uncharacterized protein n=2 Tax=Trypanosoma brucei RepID=D0A6S5_TRYBG Length = 475 Score = 54.3 bits (129), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 51/172 (29%), Positives = 79/172 (45%), Gaps = 25/172 (14%) Query: 6 HFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKIT-----PE 60 L+A P L D FR +V+ + N + +++NKPLEN K G L + +T Sbjct: 237 QLLLAHPQLYD-FFRYTVMIVVRVTPNESAALVLNKPLENDK--GALMPVSMTMRLSSAH 293 Query: 61 PRDESIRLDKPVMLGGPLAEDRG------FILHTPPSNFASSIRISDNTVMTTSRDVLET 114 P + VM+GGP++ RG +LH P + +I +S + + S D L+ Sbjct: 294 PLFAKHLCNHTVMIGGPVS--RGSFDSTMLLLHRIP-DVDDAIPLSHSLWIDGSYDTLQQ 350 Query: 115 LGTD--KQPSDVLVALGYASWEKGQLEQEILDNAWLTA------PADLNILF 158 D P D++V G++ W QLE E+ W+ A PA N +F Sbjct: 351 KIEDGTADPKDIVVICGFSGWGVQQLEGELQSGTWVAASGSTDDPALDNFVF 402 >UniRef50_C1E1K3 Predicted protein n=2 Tax=cellular organisms RepID=C1E1K3_9CHLO Length = 271 Score = 53.9 bits (128), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 37/151 (24%), Positives = 72/151 (47%), Gaps = 26/151 (17%) Query: 8 LIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKIT-PEPRDESI 66 L+ P ++ +R SVV + H+ G+ G+I+N+P N +++ ++ ++ + P R + Sbjct: 79 LLLAPETEEGWWRHSVVLVLNHDAEGSTGVILNRPT-NAQLKNVVPEIDYSAPHHR---V 134 Query: 67 RLDKPVMLGGPLAEDRG-----FILHTPPSNFASSI-----RISDNTVMTTSRDVLETLG 116 ++ V +GGP+ ++G + HT S + +SD + Sbjct: 135 LANRHVSMGGPMGTEKGARCLVALSHTRLDGATSEVFPGLWHVSD----------FSAVK 184 Query: 117 TDKQPSDVLVALGYASWEKGQLEQEILDNAW 147 + +PS ++V +GY W GQL E+ N W Sbjct: 185 PEHEPS-LMVFVGYCGWMSGQLNAEVAANGW 214 >UniRef50_A9RVW3 Predicted protein n=1 Tax=Physcomitrella patens subsp. patens RepID=A9RVW3_PHYPA Length = 324 Score = 53.1 bits (126), Expect = 5e-06, Method: Compositional matrix adjust. Identities = 38/146 (26%), Positives = 64/146 (43%), Gaps = 11/146 (7%) Query: 19 FRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLK-ITPEPRDESIRLDKPVMLGGP 77 F R V++I H+ G+ G+I+N+P + G L++ K + PE P+ GG Sbjct: 162 FHRVVIFIFAHDAGGSAGVILNRPTQYSL--GQLDEFKDLMPELSS------CPLYFGGD 213 Query: 78 LAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQ-PSDVLVALGYASWEKG 136 + ++H P S I + M + + + + + + P+D L +A W G Sbjct: 214 VGPQCTQVIHGIP-GLEDSREIMNGVYMGGTASIQDNIRSGQSTPNDYRWFLRFAGWGPG 272 Query: 137 QLEQEILDNAWLTAPADLNILFKTPI 162 QLEQE+ W A + K I Sbjct: 273 QLEQEVAAGVWYLASCSKRFVLKQCI 298 >UniRef50_C0YNI4 Transcriptional regulator n=1 Tax=Chryseobacterium gleum ATCC 35910 RepID=C0YNI4_9FLAO Length = 182 Score = 52.8 bits (125), Expect = 5e-06, Method: Compositional matrix adjust. Identities = 40/142 (28%), Positives = 62/142 (43%), Gaps = 14/142 (9%) Query: 8 LIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRDESIR 67 LI+ P + IF RSVV + EHN +GA G+I+NK K + K K + + E Sbjct: 10 LISTPDISGDIFSRSVVLVIEHNESGAFGLILNK-----KNSQMSSKFKDFFDFKIE--- 61 Query: 68 LDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMT--TSRDVLETLGTDKQPSDVL 125 V GGP+ D+ F + I+D +T R + L ++ + Sbjct: 62 ----VYDGGPVENDKVFFIVKGKRVTEIYTDITDEYYLTEDIERIINAVLSSELSIEHIK 117 Query: 126 VALGYASWEKGQLEQEILDNAW 147 + GY+ W QL+ E+ W Sbjct: 118 IFSGYSGWSPNQLDTEVQRKMW 139 >UniRef50_B2UMM6 Putative uncharacterized protein n=1 Tax=Akkermansia muciniphila ATCC BAA-835 RepID=B2UMM6_AKKM8 Length = 187 Score = 52.0 bits (123), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 43/171 (25%), Positives = 73/171 (42%), Gaps = 16/171 (9%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 NL H L+A P L F SV+++ +G I+N P + + + +I Sbjct: 13 NLAGHLLVAAPYLDGAGFHHSVIFLSRAEKEFVIGHILNHP-SGMNVGDVARHTEIP--- 68 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQP 121 ES+ P+ GGP+ ++ FA+ IR D + + L + P Sbjct: 69 --ESL-YAVPIFKGGPVERNQLI--------FAAFIRTEDKLRVQFHLQEEQALEYLEDP 117 Query: 122 SDVLVA-LGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAK 171 +L A +G++ W QL +E+ D AW +P +I +T + W A + Sbjct: 118 RAILRAYVGHSGWTPPQLRRELNDRAWYVSPMVPDICLETDSSKVWAMAMR 168 >UniRef50_Q6A827 Conserved protein, DUF179 n=2 Tax=Propionibacterium acnes RepID=Q6A827_PROAC Length = 192 Score = 52.0 bits (123), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 43/168 (25%), Positives = 73/168 (43%), Gaps = 20/168 (11%) Query: 7 FLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEK-LKITPEPRDES 65 L+A + + IF SVVY+ + +G +G+IVN+P + L + + P+D Sbjct: 15 LLVASRQIDEGIFYESVVYLIDVALDGVLGVIVNQPCTAGTLHRQLPGWVHLATPPQD-- 72 Query: 66 IRLDKPVMLGGPLAEDRGFIL------HTPPSNFASSIRISDNTVMTTSRDVLETLGTDK 119 + LGGP++ + L P + ++ + T +++E TD Sbjct: 73 ------LFLGGPMSPNGAICLARVQRSSEEPPGWRRVQGLTGLLHLDTPTELVEGAFTD- 125 Query: 120 QPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWR 167 V + GYA W GQLE E++ W+ A A +F + WR Sbjct: 126 ----VRIFAGYAEWVPGQLEAELIRGDWIRAVAHPEDIFSSEPRGLWR 169 >UniRef50_C5BU30 Putative uncharacterized protein n=1 Tax=Teredinibacter turnerae T7901 RepID=C5BU30_TERTT Length = 192 Score = 50.8 bits (120), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 50/173 (28%), Positives = 77/173 (44%), Gaps = 24/173 (13%) Query: 8 LIAMPALQD-PIFRRS---VVYICEHNTNGAMGIIVN----KPLENLKIEGILEKLKITP 59 LIA PA D P F S ++Y+ H +GA+G+ +N KPL + E+ +I Sbjct: 10 LIANPATTDLPQFAASAEKLIYVVHHGDDGAVGVCLNEYFGKPLADFS-----EQYEILA 64 Query: 60 EPRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDK 119 S+ V GGPLA + +IL + ++ I + + S + + Sbjct: 65 SVSPLSLA-SVTVHSGGPLATELPWIL-------SRAVDIYPHQINNKSLSLNFSQEAFA 116 Query: 120 QPS---DVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREA 169 PS D LV LG SW GQLE+E+ W PA +L + +++ A Sbjct: 117 DPSIHMDALVGLGSFSWGPGQLEKEVSGFMWHCFPAQKPLLNRLHFEHKYQSA 169 >UniRef50_Q4QG99 Putative uncharacterized protein n=3 Tax=Leishmania RepID=Q4QG99_LEIMA Length = 670 Score = 49.7 bits (117), Expect = 4e-05, Method: Compositional matrix adjust. Identities = 42/157 (26%), Positives = 73/157 (46%), Gaps = 15/157 (9%) Query: 6 HFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLEN-----LKIEGILEKLKITPE 60 LI+ P + FRR+V+ + H T+ + +++NKPL N + IE + ++ P Sbjct: 361 QLLISHPTARG-FFRRTVLLMVRHVTHESAALVLNKPLRNEEGLEMSIEATVRLGRVHPI 419 Query: 61 PR----DESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLG 116 R ++ + PVM G +D F+LH P ++ + N + DVL Sbjct: 420 FRRHLAQHTLMIGGPVMSGSSF-DDSIFLLHRVP-GVPHALPLGSNLWLDGDLDVLMAKL 477 Query: 117 TDKQPS---DVLVALGYASWEKGQLEQEILDNAWLTA 150 ++ S D++V G+A W QL+ E+ W+ A Sbjct: 478 DAEEASAEEDIVVLCGFAGWGFDQLKGELGHGYWVVA 514 >UniRef50_C7Q9Z6 Putative uncharacterized protein n=5 Tax=Actinomycetales RepID=C7Q9Z6_CATAD Length = 205 Score = 48.5 bits (114), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 53/184 (28%), Positives = 78/184 (42%), Gaps = 19/184 (10%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLK-ITPE 60 L L+A L DP F R+VV + +H+ +G +G+++N+P +L +E +LE + E Sbjct: 20 RLTGKLLVATTVLVDPNFDRTVVLVVDHDDDGTLGVVLNRP-GSLDVEDVLETWAPLAAE 78 Query: 61 PRDESIRLDKPVMLGGPLAEDR--GFILHTP----PSNFASSIRISDNTVMTTSRDV-LE 113 P V LGGP+A D G P P R + D E Sbjct: 79 P--------PTVFLGGPVALDSALGIACVRPEAAVPGEEPLGWRQFSGRLGLVDLDAPPE 130 Query: 114 TLGTDKQPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLI 173 L D + + + GYA W GQL E+ AW +L +F T + WR + Sbjct: 131 VLAPDL--TALRIFAGYAGWGPGQLAGELAQRAWYVVEPELADVFTTEPEELWRRVLRRQ 188 Query: 174 GVDI 177 G I Sbjct: 189 GGTI 192 >UniRef50_A8IT21 Predicted protein n=1 Tax=Chlamydomonas reinhardtii RepID=A8IT21_CHLRE Length = 234 Score = 47.4 bits (111), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 41/168 (24%), Positives = 76/168 (45%), Gaps = 11/168 (6%) Query: 14 LQDPIFRRSVVYICEHNTNGAMGIIVNKP---LENLKIEGILEKLKITPEPRDESIRLDK 70 L+D + V+++ H +G++GII+N+P + K G+ +L P P + + D Sbjct: 56 LKDDRLFQLVIFLTTHGPDGSVGIILNRPTGMVLGRKPGGLPLELG-GPVP-IQRVFQDN 113 Query: 71 PVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQPS-DVLVALG 129 V GG A+ I+H + +++ M E + + P+ D G Sbjct: 114 MVYCGGFTAQQVIHIMHG--HRLQNCVQVVPGVYMAGEVAATEAVSGGRLPAGDFKFFSG 171 Query: 130 YASWEKGQLEQEILDNAWLTAPADLNILFKTPI---ADRWREAAKLIG 174 +W G+LE ++ AW TA +++ K+ + WRE +L+G Sbjct: 172 AITWAPGELEAQMDRGAWYTAACSRSLVLKSALQLPVPLWREVLQLMG 219 >UniRef50_A8IRU3 Predicted protein n=1 Tax=Chlamydomonas reinhardtii RepID=A8IRU3_CHLRE Length = 315 Score = 47.4 bits (111), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 46/184 (25%), Positives = 72/184 (39%), Gaps = 21/184 (11%) Query: 7 FLIAMPAL---QDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRD 63 L+A P L F R+ + + EH NG+ G+I+N+P ++ P R Sbjct: 126 LLLAHPLLFQNSQTYFHRAAILLLEHGDNGSYGVILNRPSTYF--------IRDIPLKRP 177 Query: 64 ESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMT---TSRDVLETLGTDKQ 120 ++ D + +GG + +LH P + A ++ + M RD ++ Q Sbjct: 178 QTQFNDCRLYVGGDVGGGEVQVLH-PHGDLAGAVEVVKGVYMGGLDAGRDAIDA--GKAQ 234 Query: 121 PSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADR----WREAAKLIGVD 176 D YA W GQL E W TA A +L K + W E L+G D Sbjct: 235 AQDFRWFSAYAGWAPGQLAMECKRGVWFTAAASPKLLLKEVEHGQGPSFWHELMTLLGGD 294 Query: 177 ILTM 180 + Sbjct: 295 YAEL 298 >UniRef50_Q8FSW7 UPF0301 protein CE2927 n=11 Tax=Corynebacterium RepID=Y2927_COREF Length = 201 Score = 47.4 bits (111), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 41/151 (27%), Positives = 64/151 (42%), Gaps = 15/151 (9%) Query: 7 FLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRDESI 66 L+A P L P F RSV+ + EH+ G+ NL L + PE + + Sbjct: 23 LLVAAPDLASPEFSRSVILVIEHSHATTFGV-------NLASRSDLAVANVLPEWTELTA 75 Query: 67 RLDKPVMLGGPLAEDR--GFILHTPPSNFASSIR---ISDNTVMTTSRDVLETLGTDKQP 121 + + + +GGPL++ G + P + SS + +++ V R + + D + Sbjct: 76 K-PQALYIGGPLSQQAVVGLGVTKPGVDIESSTKFNKLANRLVHVDLRVTPDEVRDDLEG 134 Query: 122 SDVLVALGYASWEKGQLEQEILDNAWLTAPA 152 GYA W GQL EI W APA Sbjct: 135 MRFFA--GYAEWAPGQLNDEIEQGDWYVAPA 163 >UniRef50_Q8NL65 UPF0301 protein Cgl3084/cg3414 n=5 Tax=Corynebacterium RepID=Y3084_CORGL Length = 189 Score = 46.2 bits (108), Expect = 5e-04, Method: Compositional matrix adjust. Identities = 38/152 (25%), Positives = 67/152 (44%), Gaps = 17/152 (11%) Query: 7 FLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGIL-EKLKITPEPRDES 65 L+A P + F RS+V I EH+ G+ ++ ++ + +L E + +T +P Sbjct: 11 LLVAAPDMASEDFERSIVLIIEHSPATTFGVNISS-RSDVAVANVLPEWVDLTSKP---- 65 Query: 66 IRLDKPVMLGGPLAEDR--GFILHTPPSNFASSI---RISDNTVMTTSRDVLETLGTDKQ 120 + + +GGPL++ G + P + +S ++++ V R E + D + Sbjct: 66 ----QALYIGGPLSQQAVVGLGVTKPGVDIENSTSFNKLANRLVHVDLRSAPEDVADDLE 121 Query: 121 PSDVLVALGYASWEKGQLEQEILDNAWLTAPA 152 GYA W GQL +EI W PA Sbjct: 122 GMRFFA--GYAEWAPGQLNEEIEQGDWFVTPA 151 >UniRef50_Q9LQ30 F14M2.10 protein n=6 Tax=rosids RepID=Q9LQ30_ARATH Length = 341 Score = 44.7 bits (104), Expect = 0.001, Method: Compositional matrix adjust. Identities = 43/174 (24%), Positives = 73/174 (41%), Gaps = 34/174 (19%) Query: 19 FRRSVVYI----CEHNTNGAMGIIVNKPL-ENLKIEGILEKLKITPEPRDESIRLDKPVM 73 F R+VV + H G G+++N+PL +N+K +K T + + + Sbjct: 170 FARTVVLLLRAGTRHPQEGPFGVVINRPLHKNIK------HMKSTKTELATTFS-ECSLY 222 Query: 74 LGGPLAEDRGFILHT-------------PPSNFASSIRISDNTVMTTSRDVLETLGTDKQ 120 GGPL E F+L T P NF + + + V+ + VL + Sbjct: 223 FGGPL-EASMFLLKTGDKTKIPGFEEVMPGLNFGTRNSLDEAAVLV-KKGVL-------K 273 Query: 121 PSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIG 174 P + +GYA W+ QL +EI + W A +++ + W E +L+G Sbjct: 274 PQEFRFFVGYAGWQLDQLREEIESDYWHVAACSSDLICGASSENLWEEILQLMG 327 >UniRef50_Q7G645 Os10g0330400 protein n=3 Tax=Oryza sativa RepID=Q7G645_ORYSJ Length = 296 Score = 43.1 bits (100), Expect = 0.005, Method: Compositional matrix adjust. Identities = 40/171 (23%), Positives = 63/171 (36%), Gaps = 20/171 (11%) Query: 18 IFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRDESIRLDKPVMLGGP 77 IF R+VV + G +G+I+N+P + I E + E +P+ GGP Sbjct: 127 IFERTVVLLLSAGVLGPVGVILNRP----SLMSIKEAQAVFAETDIAGAFSGRPLFFGGP 182 Query: 78 LAEDRGFILHTPPSNFASSI----RISDNTVMTTSRDVLETLGTDKQ--------PSDVL 125 L E F L P + A + + D + E++G + D Sbjct: 183 LEEC--FFLLGPRAAAAGDVVGRTGLFDEVMPGVHYGTRESVGCAAELVKRGVVGVRDFR 240 Query: 126 VALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPI--ADRWREAAKLIG 174 G+ WE+ QL E+ W A +L + W E L+G Sbjct: 241 FFDGFCGWEREQLRDEVRAGLWRVAACSPAVLGLATVVKGGLWEEVQGLVG 291 >UniRef50_B9SPX9 Electron transporter, putative n=2 Tax=fabids RepID=B9SPX9_RICCO Length = 350 Score = 42.7 bits (99), Expect = 0.005, Method: Compositional matrix adjust. Identities = 45/178 (25%), Positives = 70/178 (39%), Gaps = 38/178 (21%) Query: 19 FRRSVVYI----CEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRDESIRL-DKPVM 73 F R+VV + H G G+++N+PL N KI+ + P ++ + D + Sbjct: 175 FERTVVLLLRSGTRHPQEGPFGVVINRPL-NKKIK------HMKPTNKELATTFADCSLH 227 Query: 74 LGGPLAEDRGFILHT-------------PPSNFASSIRISDNTVMTTSRDVLETLGTDKQ 120 GGPL E F+L T P F + + D + VL + Sbjct: 228 FGGPL-EASMFLLQTGEKEKLPGFEEVIPGLCFGARNSL-DEAAALVKKGVL-------K 278 Query: 121 PSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADR----WREAAKLIG 174 P D +GYA W+ QL +EI + W A N++ W E +L+G Sbjct: 279 PQDFRFFVGYAGWQLDQLREEIESDYWYVASCSSNLICGNSSDSSSESLWEEILQLMG 336 >UniRef50_Q7URG7 Probable transcriptional regulator n=1 Tax=Rhodopirellula baltica RepID=Q7URG7_RHOBA Length = 259 Score = 42.0 bits (97), Expect = 0.011, Method: Compositional matrix adjust. Identities = 55/235 (23%), Positives = 83/235 (35%), Gaps = 58/235 (24%) Query: 1 MNLQHHFLIAMPALQDPIFRRSVVYICEHNTNGA-------------------------- 34 N FLIA P L D F RSVV I H GA Sbjct: 5 QNCTGCFLIASPYLHDGNFFRSVVLIIRHTHEGAFGVVINRAGPQRFGDVIEMSDPSWQA 64 Query: 35 -----MGIIVNKPLENLKI------EGILEKLKITPEPRDESIRLDKPVM-------LGG 76 M ++ E+ + E L+I P+ ++ PV+ +G Sbjct: 65 SSGPDMSSLLASQAESASLGDASDPNKTNESLQIHPDQIYLGGPVNGPVLALHNIAGIGD 124 Query: 77 PLAEDRG-----------FILHTPPSN--FASSIRISDNTVMTTSRDVLETLGTDKQPSD 123 P D G LH P+ + SI+ +D TS + L + + Sbjct: 125 PCGVDIGEGAENDPAGSKTQLHDHPAEPWGSMSIQWADVPAWVTSDEDHLRLLARRDDAK 184 Query: 124 VLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDIL 178 + +GY+ W QLE E+ + WL PAD + +F P + W + + G I+ Sbjct: 185 LRYVVGYSGWGPMQLESELEEGGWLITPADTDSIFG-PCEEVWEKLVRRCGQAIM 238 >UniRef50_Q9LS71 Emb|CAB72194.1 n=4 Tax=rosids RepID=Q9LS71_ARATH Length = 317 Score = 41.6 bits (96), Expect = 0.014, Method: Compositional matrix adjust. Identities = 47/183 (25%), Positives = 70/183 (38%), Gaps = 46/183 (25%) Query: 18 IFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRDESIRL-----DKPV 72 IF ++V+ + +G +G+I+N+P L E + + + DK + Sbjct: 148 IFEKTVILLLSVGPSGPIGVILNRP-----------SLMSIKETKSTILDMAGTFSDKRL 196 Query: 73 MLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTS--RDVLETLGTDKQPSDVLVAL-- 128 GGPL E G L +P S DN V + R V++ L + S L A Sbjct: 197 FFGGPLEE--GLFLVSPRSG-------GDNEVGKSGVFRQVMKGLYYGTRESVGLAAEMV 247 Query: 129 --------------GYASWEKGQLEQEILDNAWLTAPADLNIL-FKTPIADR--WREAAK 171 GY WEK QL+ EIL W A ++ + + W E Sbjct: 248 KRNLVGRSELRFFDGYCGWEKEQLKAEILGGYWTVAACSSTVVELGSAVQSHGLWDEVLG 307 Query: 172 LIG 174 LIG Sbjct: 308 LIG 310 >UniRef50_C5YY61 Putative uncharacterized protein Sb09g020680 n=4 Tax=Andropogoneae RepID=C5YY61_SORBI Length = 355 Score = 41.2 bits (95), Expect = 0.016, Method: Compositional matrix adjust. Identities = 45/189 (23%), Positives = 77/189 (40%), Gaps = 38/189 (20%) Query: 8 LIAMPALQD-PIFRRSVVYICEHNT----NGAMGIIVNKPLENLKIEGILEKLK-ITPEP 61 L+A AL D IF R+V++I + +G G+I+N+PL K+K + P Sbjct: 168 LVATEALDDDSIFERTVIFILRLGSRGTFDGPFGVILNRPL--------YTKIKHVNPTF 219 Query: 62 RDESIRL-DKPVMLGGPL------------AEDRGFILHTPPSNFASSIRISDNTVMTTS 108 +D++ D P+ GGP+ + +GF P + + V+ S Sbjct: 220 QDQATPFGDSPLFFGGPVDMSMFLVRTDDSSRLKGFEEVVPGICYGFRTDLEKAAVLMKS 279 Query: 109 RDVLETLGTDKQPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADR--- 165 + + D+ +G+A+W+ QL EI W A ++ D Sbjct: 280 GAI--------RTQDLRFYVGHAAWDYEQLLGEIRAGYWAVASCSTELISDALTGDPSCL 331 Query: 166 WREAAKLIG 174 W E +L+G Sbjct: 332 WTEILQLMG 340 >UniRef50_O68558 Putative uncharacterized protein (Fragment) n=1 Tax=Mycobacterium bovis RepID=O68558_MYCBO Length = 82 Score = 41.2 bits (95), Expect = 0.016, Method: Compositional matrix adjust. Identities = 16/50 (32%), Positives = 28/50 (56%) Query: 14 LQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRD 63 L +P FRRSV+YI EHN G +G+++ N ++ + + + +D Sbjct: 2 LLEPTFRRSVIYIVEHNDGGTLGVVLQSAQRNRGLQRVAAVGQTRGQAKD 51 >UniRef50_B8C1S9 Predicted protein n=1 Tax=Thalassiosira pseudonana RepID=B8C1S9_THAPS Length = 632 Score = 40.0 bits (92), Expect = 0.039, Method: Compositional matrix adjust. Identities = 39/169 (23%), Positives = 68/169 (40%), Gaps = 21/169 (12%) Query: 13 ALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRDESIRLDKP- 71 L F ++V+ + H++ GII+N+P NL L+ E + I+ D Sbjct: 115 GLSQQYFHKAVLLVTYHSSEFTKGIILNRPT-NLH----LDDEDFIDESGEPFIKSDNAL 169 Query: 72 -------VMLGGPL----AEDRGFI-LHTPPSNFASSI--RISDNTVMTTSRDVLETLGT 117 + GG + ++D + LH+ SN ++ I N +T + + Sbjct: 170 EDMNSWRIWFGGDVNGMYSDDPEIVCLHSIDSNLGKNLSEEIIKNIFLTNYEGARKLIDA 229 Query: 118 DKQPS-DVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADR 165 ++ S D V GY W GQL E+ +W AD ++ + R Sbjct: 230 NEATSQDFWVFAGYCGWSAGQLLDELKHESWYMVSADSQTVWSELVRQR 278 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_A7ZR71 UPF0301 protein yqgE n=133 Tax=Gammaproteobacter... 294 1e-78 UniRef50_Q87LK0 UPF0301 protein VP2612 n=11 Tax=Gammaproteobacte... 277 1e-73 UniRef50_A1JPT5 UPF0301 protein YE3428 n=22 Tax=Gammaproteobacte... 274 1e-72 UniRef50_Q7MHK0 UPF0301 protein VV2869 n=47 Tax=Gammaproteobacte... 271 1e-71 UniRef50_D0L0G7 Putative uncharacterized protein n=4 Tax=Gammapr... 269 3e-71 UniRef50_B6EMV3 UPF0301 protein VSAL_I0547 n=39 Tax=Gammaproteob... 262 4e-69 UniRef50_B8E9P8 UPF0301 protein Sbal223_1344 n=7 Tax=Shewanella ... 259 3e-68 UniRef50_A1RHM8 UPF0301 protein Sputw3181_1330 n=13 Tax=Proteoba... 256 3e-67 UniRef50_Q21EI7 UPF0301 protein Sde_3637 n=7 Tax=Proteobacteria ... 253 2e-66 UniRef50_A1U764 Putative uncharacterized protein n=3 Tax=Marinob... 251 1e-65 UniRef50_Q1BYL1 UPF0301 protein Bcen_0382 n=60 Tax=Betaproteobac... 249 4e-65 UniRef50_Q486M0 UPF0301 protein CPS_1252 n=1 Tax=Colwellia psych... 248 7e-65 UniRef50_Q5WYW5 UPF0301 protein lpl0620 n=6 Tax=Legionella RepID... 247 1e-64 UniRef50_B8FL92 UPF0301 protein Dalk_3037 n=1 Tax=Desulfatibacil... 246 2e-64 UniRef50_C9CSD6 Putative uncharacterized protein n=1 Tax=Silicib... 246 3e-64 UniRef50_Q3SFS4 UPF0301 protein Tbd_2579 n=2 Tax=Proteobacteria ... 245 6e-64 UniRef50_Q478W0 UPF0301 protein Daro_3893 n=3 Tax=Betaproteobact... 245 6e-64 UniRef50_Q605E8 UPF0301 protein MCA2336 2 n=3 Tax=Gammaproteobac... 243 2e-63 UniRef50_Q3SNY6 UPF0301 protein Nwi_2752 n=121 Tax=Alphaproteoba... 243 2e-63 UniRef50_A8ZZX8 Putative uncharacterized protein n=1 Tax=Desulfo... 241 6e-63 UniRef50_Q3IZ52 UPF0301 protein RHOS4_26140 n=6 Tax=Rhodobactera... 241 1e-62 UniRef50_A9DAK3 Putative uncharacterized protein n=1 Tax=Hoeflea... 240 2e-62 UniRef50_Q163D2 UPF0301 protein RD1_3419 n=14 Tax=Rhodobacterale... 239 4e-62 UniRef50_B9NUS5 Putative uncharacterized protein n=2 Tax=Rhodoba... 239 4e-62 UniRef50_Q0EWH5 Putative uncharacterized protein n=1 Tax=Maripro... 238 5e-62 UniRef50_B9MF60 UPF0301 protein Dtpsy_2896 n=28 Tax=Proteobacter... 238 6e-62 UniRef50_Q1YVF9 Putative uncharacterized protein n=1 Tax=gamma p... 238 7e-62 UniRef50_A6VSP6 UPF0301 protein Mmwyl1_0539 n=2 Tax=Marinomonas ... 238 1e-61 UniRef50_C6NYY5 Putative uncharacterized protein n=1 Tax=Acidith... 237 1e-61 UniRef50_C5SQD7 Putative uncharacterized protein n=1 Tax=Asticca... 236 2e-61 UniRef50_A1SUE8 Putative uncharacterized protein n=2 Tax=Psychro... 236 2e-61 UniRef50_Q1MZ13 Putative uncharacterized protein n=1 Tax=Bermane... 236 3e-61 UniRef50_A4SVF0 Putative uncharacterized protein n=1 Tax=Polynuc... 236 4e-61 UniRef50_C3K3J9 UPF0301 protein PFLU_5755 n=5 Tax=cellular organ... 235 5e-61 UniRef50_Q0AMH8 Putative uncharacterized protein n=3 Tax=Hyphomo... 235 7e-61 UniRef50_Q4ZZ67 UPF0301 protein Psyr_0485 n=31 Tax=Proteobacteri... 235 7e-61 UniRef50_B5JTR0 Putative uncharacterized protein n=1 Tax=gamma p... 234 1e-60 UniRef50_Q0C3C2 Putative uncharacterized protein n=1 Tax=Hyphomo... 234 1e-60 UniRef50_B8KLD5 Putative uncharacterized protein n=3 Tax=Proteob... 234 1e-60 UniRef50_A1TKL7 UPF0301 protein Aave_0907 n=10 Tax=Comamonadacea... 233 2e-60 UniRef50_Q2P5W3 UPF0301 protein XOO1309 n=20 Tax=Xanthomonadacea... 233 2e-60 UniRef50_B4REX9 Transcriptional regulator n=4 Tax=Caulobacterace... 230 1e-59 UniRef50_A8PPF4 Putative uncharacterized protein n=1 Tax=Rickett... 230 1e-59 UniRef50_A1KUG1 UPF0301 protein NMC1274 n=27 Tax=Neisseriaceae R... 229 3e-59 UniRef50_A4BI12 Putative uncharacterized protein n=1 Tax=Reineke... 229 4e-59 UniRef50_A6WWH2 Putative uncharacterized protein n=2 Tax=Ochroba... 228 9e-59 UniRef50_A9KDE7 UPF0301 protein CBUD_2193 n=7 Tax=Coxiella burne... 221 1e-56 UniRef50_Q31EK4 UPF0301 protein Tcr_1827 n=1 Tax=Thiomicrospira ... 219 4e-56 UniRef50_Q0I1B4 UPF0301 protein HS_0009 n=26 Tax=Pasteurellaceae... 218 7e-56 UniRef50_Q1NQW6 Putative uncharacterized protein n=2 Tax=Deltapr... 216 4e-55 UniRef50_A5WBR3 UPF0301 protein PsycPRwf_0144 n=21 Tax=Moraxella... 216 4e-55 UniRef50_C0QFZ0 UPF0301 protein HRM2_24640 n=1 Tax=Desulfobacter... 214 1e-54 UniRef50_A4A9E0 Protein containing DUF179 n=1 Tax=Congregibacter... 213 2e-54 UniRef50_Q2GAJ3 UPF0301 protein Saro_0683 n=4 Tax=Sphingomonadac... 212 4e-54 UniRef50_A3MYV4 UPF0301 protein APL_0232 n=7 Tax=Pasteurellaceae... 211 8e-54 UniRef50_Q5FQY8 UPF0301 protein GOX1459 n=11 Tax=Acetobacteracea... 211 1e-53 UniRef50_A0L5K4 UPF0301 protein Mmc1_0726 n=1 Tax=Magnetococcus ... 209 4e-53 UniRef50_Q6AL28 UPF0301 protein DP2218 n=1 Tax=Desulfotalea psyc... 198 6e-50 UniRef50_C8CIK8 Putative uncharacterized protein n=1 Tax=uncultu... 198 9e-50 UniRef50_Q60BQ2 UPF0301 protein MCA0413 1 n=1 Tax=Methylococcus ... 196 3e-49 UniRef50_B3QT15 Putative uncharacterized protein n=1 Tax=Chloroh... 196 3e-49 UniRef50_B0BVW3 UPF0301 protein RrIowa_0061 n=15 Tax=Rickettsia ... 192 6e-48 UniRef50_A7C130 Protein containing DUF179 n=1 Tax=Beggiatoa sp. ... 191 1e-47 UniRef50_Q2S591 UPF0301 protein SRU_0495 n=2 Tax=Rhodothermaceae... 190 2e-47 UniRef50_Q1DAS2 UPF0301 protein MXAN_2022 n=2 Tax=Cystobacterine... 189 3e-47 UniRef50_Q3B561 UPF0301 protein Plut_0637 n=11 Tax=Chlorobiaceae... 188 1e-46 UniRef50_A6G4Z9 Putative uncharacterized protein n=1 Tax=Plesioc... 186 3e-46 UniRef50_Q5NQN1 UPF0301 protein ZMO0349 n=3 Tax=Zymomonas mobili... 186 4e-46 UniRef50_D2QR79 Putative uncharacterized protein n=2 Tax=Flexiba... 186 4e-46 UniRef50_A6LBX4 UPF0301 protein BDI_1431 n=6 Tax=Bacteroidales R... 183 2e-45 UniRef50_D0LIU3 Putative uncharacterized protein n=1 Tax=Haliang... 183 3e-45 UniRef50_Q11U74 UPF0301 protein CHU_1773 n=2 Tax=Flexibacteracea... 181 8e-45 UniRef50_A6C880 Putative uncharacterized protein n=1 Tax=Plancto... 181 1e-44 UniRef50_Q254Z3 UPF0301 protein CF0373 n=7 Tax=Chlamydiales RepI... 179 6e-44 UniRef50_Q3KMF1 UPF0301 protein CTA_0231 n=9 Tax=Chlamydia RepID... 178 8e-44 UniRef50_Q0BLI0 UPF0301 protein FTH_1193 n=18 Tax=Francisella Re... 178 1e-43 UniRef50_A3HT39 Putative uncharacterized protein n=1 Tax=Algorip... 177 1e-43 UniRef50_A3VRH6 Putative uncharacterized protein n=1 Tax=Parvula... 177 2e-43 UniRef50_C1ZJG4 Predicted transcriptional regulator, COG1678 n=1... 175 6e-43 UniRef50_Q5LDK5 UPF0301 protein BF2109 n=20 Tax=Bacteroides RepI... 174 1e-42 UniRef50_B9XJW1 Putative uncharacterized protein n=1 Tax=bacteri... 174 1e-42 UniRef50_A3ZQK2 Putative uncharacterized protein n=1 Tax=Blastop... 174 1e-42 UniRef50_A7H7H6 UPF0301 protein Anae109_0457 n=4 Tax=Anaeromyxob... 173 2e-42 UniRef50_A9GTQ2 Putative uncharacterized protein n=1 Tax=Sorangi... 172 4e-42 UniRef50_UPI0001C3133A protein of unknown function DUF179 n=1 Ta... 171 1e-41 UniRef50_C3Q021 UPF0301 protein n=8 Tax=Bacteroides RepID=C3Q021... 171 1e-41 UniRef50_C2G2S7 Transcriptional regulator n=3 Tax=Sphingobacteri... 170 2e-41 UniRef50_C7PSM3 Putative uncharacterized protein n=1 Tax=Chitino... 169 3e-41 UniRef50_A5FNN9 Putative uncharacterized protein n=18 Tax=Bacter... 168 9e-41 UniRef50_C1D0N0 Putative uncharacterized protein n=3 Tax=Deinoco... 165 6e-40 UniRef50_D2R140 Putative uncharacterized protein n=1 Tax=Pirellu... 164 2e-39 UniRef50_B0SHS8 Transcriptional regulator n=6 Tax=Leptospira Rep... 162 7e-39 UniRef50_Q2BRE1 Putative uncharacterized protein n=1 Tax=Neptuni... 161 1e-38 UniRef50_A6E847 Putative uncharacterized protein n=1 Tax=Pedobac... 160 3e-38 UniRef50_A4C260 Putative transcriptional regulator n=1 Tax=Polar... 158 9e-38 UniRef50_UPI0001745679 hypothetical protein VspiD_25265 n=1 Tax=... 157 1e-37 UniRef50_Q1Q3L0 Putative uncharacterized protein n=1 Tax=Candida... 156 5e-37 UniRef50_C6X421 Putative transcriptional regulator n=1 Tax=Flavo... 154 1e-36 UniRef50_Q47MA0 UPF0301 protein Tfu_2389 n=3 Tax=Actinomycetales... 154 1e-36 UniRef50_C7MS43 Predicted transcriptional regulator n=3 Tax=Acti... 154 2e-36 UniRef50_Q82D55 UPF0301 protein SAV_5129 n=12 Tax=Actinomycetale... 152 4e-36 UniRef50_Q0FVR8 Putative uncharacterized protein (Fragment) n=1 ... 151 1e-35 UniRef50_B4CVG9 Putative uncharacterized protein n=1 Tax=Chthoni... 149 4e-35 UniRef50_C1RPZ7 Predicted transcriptional regulator, COG1678 n=1... 149 6e-35 UniRef50_A9RVW3 Predicted protein n=1 Tax=Physcomitrella patens ... 147 1e-34 UniRef50_B1ZWF6 Putative uncharacterized protein n=2 Tax=Opituta... 146 4e-34 UniRef50_C0YNI4 Transcriptional regulator n=1 Tax=Chryseobacteri... 145 9e-34 UniRef50_B1MML1 UPF0301 protein MAB_4928c n=20 Tax=Corynebacteri... 144 1e-33 UniRef50_C4DLM5 Predicted transcriptional regulator, COG1678 n=5... 142 6e-33 UniRef50_B7G677 Predicted protein n=2 Tax=Bacillariophyta RepID=... 138 9e-32 UniRef50_C1B7P4 UPF0301 protein ROP_34500 n=13 Tax=Corynebacteri... 137 1e-31 UniRef50_A1SG68 Putative uncharacterized protein n=2 Tax=Actinom... 136 3e-31 UniRef50_B5JR88 Putative uncharacterized protein n=1 Tax=Verruco... 135 9e-31 UniRef50_C8XE74 Putative uncharacterized protein n=2 Tax=Actinom... 129 3e-29 UniRef50_C0AXX7 Putative uncharacterized protein n=1 Tax=Proteus... 129 5e-29 UniRef50_Q9LQ30 F14M2.10 protein n=6 Tax=rosids RepID=Q9LQ30_ARATH 129 5e-29 UniRef50_Q8NL65 UPF0301 protein Cgl3084/cg3414 n=5 Tax=Corynebac... 127 2e-28 UniRef50_Q8FSW7 UPF0301 protein CE2927 n=11 Tax=Corynebacterium ... 127 2e-28 UniRef50_C7Q9Z6 Putative uncharacterized protein n=5 Tax=Actinom... 127 2e-28 UniRef50_D0A6S5 Putative uncharacterized protein n=2 Tax=Trypano... 126 3e-28 UniRef50_A8IRU3 Predicted protein n=1 Tax=Chlamydomonas reinhard... 126 3e-28 UniRef50_Q4CSL3 Putative uncharacterized protein n=2 Tax=Trypano... 125 7e-28 UniRef50_B2UMM6 Putative uncharacterized protein n=1 Tax=Akkerma... 125 8e-28 UniRef50_A8IT21 Predicted protein n=1 Tax=Chlamydomonas reinhard... 124 2e-27 UniRef50_Q6A827 Conserved protein, DUF179 n=2 Tax=Propionibacter... 123 2e-27 UniRef50_Q4QG99 Putative uncharacterized protein n=3 Tax=Leishma... 120 2e-26 UniRef50_C7PBJ3 Putative uncharacterized protein n=1 Tax=Chitino... 112 6e-24 UniRef50_C1E1K3 Predicted protein n=2 Tax=cellular organisms Rep... 107 1e-22 UniRef50_C5BU30 Putative uncharacterized protein n=1 Tax=Teredin... 105 9e-22 UniRef50_C0AXX9 Putative uncharacterized protein n=1 Tax=Proteus... 72 7e-12 Sequences not found previously or not previously below threshold: UniRef50_A4S673 Predicted protein (Fragment) n=2 Tax=Ostreococcu... 129 7e-29 UniRef50_C1E5G6 Predicted protein n=2 Tax=Micromonas RepID=C1E5G... 128 8e-29 UniRef50_Q7URG7 Probable transcriptional regulator n=1 Tax=Rhodo... 125 9e-28 UniRef50_B9SPX9 Electron transporter, putative n=2 Tax=fabids Re... 122 6e-27 UniRef50_Q8S0Q9 Os01g0886000 protein n=6 Tax=Poaceae RepID=Q8S0Q... 107 2e-22 UniRef50_C5YY61 Putative uncharacterized protein Sb09g020680 n=4... 106 3e-22 UniRef50_D0NN30 Putative uncharacterized protein n=1 Tax=Phytoph... 104 1e-21 UniRef50_B8C551 Predicted protein n=1 Tax=Thalassiosira pseudona... 99 9e-20 UniRef50_Q9LS71 Emb|CAB72194.1 n=4 Tax=rosids RepID=Q9LS71_ARATH 97 3e-19 UniRef50_B8C1S9 Predicted protein n=1 Tax=Thalassiosira pseudona... 92 1e-17 UniRef50_Q54HU6 Putative uncharacterized protein n=1 Tax=Dictyos... 91 2e-17 UniRef50_Q7G645 Os10g0330400 protein n=3 Tax=Oryza sativa RepID=... 88 1e-16 UniRef50_C5L030 Membrane associated RING finger, putative n=1 Ta... 88 1e-16 UniRef50_C5WYB1 Putative uncharacterized protein Sb01g018920 n=1... 86 7e-16 UniRef50_Q7UKQ8 Putative uncharacterized protein n=1 Tax=Rhodopi... 80 4e-14 UniRef50_D1HM83 Whole genome shotgun sequence of line PN40024, s... 78 1e-13 UniRef50_A9S274 Predicted protein n=1 Tax=Physcomitrella patens ... 77 2e-13 UniRef50_A4RT64 Predicted protein n=2 Tax=Ostreococcus RepID=A4R... 76 5e-13 UniRef50_Q0IWV7 Os10g0485100 protein (Fragment) n=5 Tax=Poaceae ... 76 6e-13 UniRef50_Q9LT30 Genomic DNA, chromosome 3, P1 clone: MPN9 n=2 Ta... 74 3e-12 UniRef50_D1I242 Whole genome shotgun sequence of line PN40024, s... 73 5e-12 UniRef50_B6SSQ6 Uncharacterized ACR, COG1678 family protein n=3 ... 73 5e-12 UniRef50_B9HGN1 Predicted protein n=1 Tax=Populus trichocarpa Re... 70 3e-11 UniRef50_A9T9H0 Predicted protein n=1 Tax=Physcomitrella patens ... 70 4e-11 UniRef50_B6KQL2 Putative uncharacterized protein n=4 Tax=Toxopla... 61 1e-08 UniRef50_B8BQD7 Predicted protein n=1 Tax=Thalassiosira pseudona... 59 1e-07 UniRef50_D2VPR5 Predicted protein n=1 Tax=Naegleria gruberi RepI... 57 2e-07 UniRef50_B7G772 Predicted protein n=1 Tax=Phaeodactylum tricornu... 51 2e-05 UniRef50_B8CF73 Predicted protein n=2 Tax=Thalassiosira pseudona... 51 2e-05 UniRef50_O68558 Putative uncharacterized protein (Fragment) n=1 ... 50 3e-05 UniRef50_B8CDL7 Predicted protein n=1 Tax=Thalassiosira pseudona... 50 5e-05 UniRef50_B7G8R3 Predicted protein n=1 Tax=Phaeodactylum tricornu... 49 1e-04 UniRef50_B8BZN2 Predicted protein n=1 Tax=Thalassiosira pseudona... 49 1e-04 UniRef50_A5AUI4 Putative uncharacterized protein n=1 Tax=Vitis v... 46 6e-04 UniRef50_C5KHP8 Putative uncharacterized protein n=1 Tax=Perkins... 45 0.001 UniRef50_C1FFU8 Predicted protein n=1 Tax=Micromonas sp. RCC299 ... 43 0.006 UniRef50_B7GE41 Predicted protein n=1 Tax=Phaeodactylum tricornu... 42 0.009 UniRef50_UPI0001BCD1A1 hypothetical protein AmarD1_07934 n=1 Tax... 41 0.015 UniRef50_A5K110 Putative uncharacterized protein n=1 Tax=Plasmod... 41 0.025 UniRef50_B3LAF2 Putative uncharacterized protein n=1 Tax=Plasmod... 40 0.026 UniRef50_C1E0N3 Predicted protein n=1 Tax=Micromonas sp. RCC299 ... 39 0.066 >UniRef50_A7ZR71 UPF0301 protein yqgE n=133 Tax=Gammaproteobacteria RepID=YQGE_ECO24 Length = 187 Score = 294 bits (753), Expect = 1e-78, Method: Composition-based stats. Identities = 187/187 (100%), Positives = 187/187 (100%) Query: 1 MNLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPE 60 MNLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPE Sbjct: 1 MNLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPE 60 Query: 61 PRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQ 120 PRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQ Sbjct: 61 PRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQ 120 Query: 121 PSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTM 180 PSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTM Sbjct: 121 PSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTM 180 Query: 181 PGVAGHA 187 PGVAGHA Sbjct: 181 PGVAGHA 187 >UniRef50_Q87LK0 UPF0301 protein VP2612 n=11 Tax=Gammaproteobacteria RepID=Y2612_VIBPA Length = 187 Score = 277 bits (709), Expect = 1e-73, Method: Composition-based stats. Identities = 92/188 (48%), Positives = 136/188 (72%), Gaps = 2/188 (1%) Query: 1 MNLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITP- 59 MNL +HFL+AMP ++DP F+ SV+Y+CEHN GAMG+++N P++ + + +L+++ + P Sbjct: 1 MNLTNHFLVAMPGMKDPYFQNSVIYVCEHNEEGAMGLMINAPVD-ITVGNMLKQVDVQPV 59 Query: 60 EPRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDK 119 PR LD+PV GGP++EDRGFILH P + SSI+++D+ +TTSRD+L LGT+ Sbjct: 60 HPRLFEASLDRPVYNGGPISEDRGFILHKPKDYYESSIQMTDDLAVTTSRDILSVLGTEA 119 Query: 120 QPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILT 179 +PSD LVALGY+ W GQLE E+++N+WLT A I+F TPI +RW++A + +G+D Sbjct: 120 EPSDYLVALGYSGWSAGQLENELVENSWLTIEATPEIIFDTPITERWKKAVEKLGIDPSQ 179 Query: 180 MPGVAGHA 187 + AGHA Sbjct: 180 LSADAGHA 187 >UniRef50_A1JPT5 UPF0301 protein YE3428 n=22 Tax=Gammaproteobacteria RepID=Y3428_YERE8 Length = 187 Score = 274 bits (701), Expect = 1e-72, Method: Composition-based stats. Identities = 125/187 (66%), Positives = 154/187 (82%) Query: 1 MNLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPE 60 MNLQHHFLIAMP+LQDP F RSV+YICEHN GAMG+++NKP+E +E +L+KLKI+P Sbjct: 1 MNLQHHFLIAMPSLQDPHFMRSVIYICEHNKEGAMGLVINKPMEQFTVETVLKKLKISPT 60 Query: 61 PRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQ 120 PRD SIRLDK V+ GGPLAEDRGFILH+P F SSI IS +T++TTS+DVLETLGT +Q Sbjct: 61 PRDPSIRLDKAVLAGGPLAEDRGFILHSPQEGFGSSIPISPDTMITTSKDVLETLGTPEQ 120 Query: 121 PSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTM 180 P ++LVALGYA W++GQLEQE+LDNAWLT AD +ILF TPIA+RW+ AA +G++I + Sbjct: 121 PKNLLVALGYAGWQQGQLEQELLDNAWLTIEADTHILFNTPIAERWQAAANKLGINIFNI 180 Query: 181 PGVAGHA 187 AGHA Sbjct: 181 APQAGHA 187 >UniRef50_Q7MHK0 UPF0301 protein VV2869 n=47 Tax=Gammaproteobacteria RepID=Y2869_VIBVY Length = 187 Score = 271 bits (693), Expect = 1e-71, Method: Composition-based stats. Identities = 89/188 (47%), Positives = 135/188 (71%), Gaps = 2/188 (1%) Query: 1 MNLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITP- 59 MNL +HFL+AMP ++DP F+ SV+YICEHN GAMG+++N P++ + + +LE++ + P Sbjct: 1 MNLTNHFLVAMPGMKDPYFQHSVIYICEHNEEGAMGLMINAPID-ITVGKMLEQVDVQPV 59 Query: 60 EPRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDK 119 P+ + L KPV GGP+AEDRGFILH P + SS+++++ +TTS+D+L LGT+ Sbjct: 60 HPQLNTSSLTKPVYNGGPVAEDRGFILHRPKDFYESSLQMTEQISVTTSKDILTVLGTEA 119 Query: 120 QPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILT 179 +PS +VALGY+ W GQLE E+ +N+WLT A+ +I+F TPIA RW++A +++G+ Sbjct: 120 EPSSYIVALGYSGWSAGQLEAELAENSWLTVEANPDIIFDTPIAMRWQKAVQMLGIHASQ 179 Query: 180 MPGVAGHA 187 + AGHA Sbjct: 180 LSDQAGHA 187 >UniRef50_D0L0G7 Putative uncharacterized protein n=4 Tax=Gammaproteobacteria RepID=D0L0G7_HALNC Length = 216 Score = 269 bits (689), Expect = 3e-71, Method: Composition-based stats. Identities = 81/187 (43%), Positives = 123/187 (65%), Gaps = 4/187 (2%) Query: 1 MNLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPE 60 + L++ LIAMP+L DP F +V Y+CEHN +GAMGI +N+PL+ + + I + +KI+ Sbjct: 34 IQLKNQILIAMPSLDDPNFNHTVTYVCEHNEDGAMGITINRPLD-VTLGDIFDHMKISCS 92 Query: 61 PRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQ 120 +PV +GGP+A +RGF+LHTP + S++ I+D +TTS+D+L+ L Sbjct: 93 ---NPSIRGRPVFMGGPVALERGFVLHTPHGGWESTLEITDEIGLTTSKDILQALAEGAG 149 Query: 121 PSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTM 180 P+ ++ALGY+ W +GQLEQE+ DN WLT A ++F P+ +RW AA+ +GVD+ + Sbjct: 150 PARAVIALGYSGWSEGQLEQELADNTWLTVAATTELIFDYPVEERWAAAARSLGVDMNLL 209 Query: 181 PGVAGHA 187 G AGHA Sbjct: 210 SGEAGHA 216 >UniRef50_B6EMV3 UPF0301 protein VSAL_I0547 n=39 Tax=Gammaproteobacteria RepID=Y547_ALISL Length = 187 Score = 262 bits (670), Expect = 4e-69, Method: Composition-based stats. Identities = 88/188 (46%), Positives = 140/188 (74%), Gaps = 2/188 (1%) Query: 1 MNLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKIT-P 59 M+L++HFL+AMP++ DP+F RSV+YICEH+++G MG+ +N+P++ + ++G+L+++K+ P Sbjct: 1 MDLKNHFLVAMPSMNDPVFTRSVIYICEHDSDGTMGLRINQPVQ-ISLKGMLDQIKLDNP 59 Query: 60 EPRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDK 119 P L +PV+ GGP+++DRGF+LH P N++SSI +++ +TTS+D+L TLGT+ Sbjct: 60 SPIIFPQTLSQPVLNGGPVSDDRGFVLHYPKDNYSSSIEVTEELSVTTSKDILATLGTED 119 Query: 120 QPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILT 179 QP LVALGY+ W+ GQLEQE+ +N WL AD +++F TPI DRWR A +++G+ + Sbjct: 120 QPYKYLVALGYSGWDAGQLEQELSENTWLILEADSSVIFDTPIPDRWRRAIEILGISPVN 179 Query: 180 MPGVAGHA 187 + GHA Sbjct: 180 ISSEVGHA 187 >UniRef50_B8E9P8 UPF0301 protein Sbal223_1344 n=7 Tax=Shewanella RepID=Y1344_SHEB2 Length = 187 Score = 259 bits (663), Expect = 3e-68, Method: Composition-based stats. Identities = 87/186 (46%), Positives = 126/186 (67%), Gaps = 1/186 (0%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 +LQ+HFLIAMP+L D F RSV+YICEH+ GAMG+++NKPL +++ +LE++ + E Sbjct: 3 SLQNHFLIAMPSLHDTFFERSVIYICEHDAKGAMGLVINKPL-GIEVNSLLEQMDLPAEQ 61 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQP 121 + + VM+GGP+++DRGF+LHT +A+S + ++TTSRDVL +G+++ P Sbjct: 62 VSTDLAFNANVMMGGPVSQDRGFVLHTSQPYWANSTDLGCGLMLTTSRDVLTAIGSNRSP 121 Query: 122 SDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMP 181 LVALGYA W K QLEQE+ DN+WLT PA +LF DRW +A++ +G D + Sbjct: 122 EKFLVALGYAGWSKDQLEQELADNSWLTIPATNALLFDIKHEDRWPQASRALGFDAWQVS 181 Query: 182 GVAGHA 187 AGHA Sbjct: 182 AQAGHA 187 >UniRef50_A1RHM8 UPF0301 protein Sputw3181_1330 n=13 Tax=Proteobacteria RepID=Y1330_SHESW Length = 187 Score = 256 bits (654), Expect = 3e-67, Method: Composition-based stats. Identities = 86/186 (46%), Positives = 124/186 (66%), Gaps = 1/186 (0%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 +LQ+HFLIAMP+L D F RSV+Y+CEH+ GAMGI++NKPL +++ +LE++ + E Sbjct: 3 SLQNHFLIAMPSLDDTFFERSVIYLCEHDDKGAMGIVINKPL-GIEVSSLLEQMDLPAEQ 61 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQP 121 I + V++GGP+++DRGF+LHT +A+S + ++TTSRDVL +G + P Sbjct: 62 VFADIAQNAQVLMGGPVSQDRGFVLHTSQPYWANSTDLGSGLMLTTSRDVLTAIGGKRSP 121 Query: 122 SDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMP 181 LVALGYA W K QLEQE+ +N+WLT PA +LF DRW +A++ +G D + Sbjct: 122 DKFLVALGYAGWGKHQLEQELAENSWLTIPATNALLFDVKHEDRWPQASRSLGFDAWQVS 181 Query: 182 GVAGHA 187 AGHA Sbjct: 182 AQAGHA 187 >UniRef50_Q21EI7 UPF0301 protein Sde_3637 n=7 Tax=Proteobacteria RepID=Y3637_SACD2 Length = 203 Score = 253 bits (647), Expect = 2e-66, Method: Composition-based stats. Identities = 86/187 (45%), Positives = 125/187 (66%), Gaps = 5/187 (2%) Query: 1 MNLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPE 60 ++L+ HFLIAMP LQDPIF RS+ YIC+H GAMGI+VN+P+ NL + I E+L++ Sbjct: 22 VSLRDHFLIAMPGLQDPIFSRSLTYICDHTAQGAMGIVVNQPM-NLTLGDIFEQLEL--- 77 Query: 61 PRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQ 120 +D++ + + V+ GGP+ +RGF+LH + S++ I+ + +T SRD++ + + Sbjct: 78 -QDKAQQAGRAVLAGGPVNTERGFVLHRDSGAWESTMHIAPDVNLTASRDIVHAIANNTG 136 Query: 121 PSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTM 180 P L ALGYA W GQLE+EI N+WLT PAD +I+F P+ DRW AA+ +G+DI M Sbjct: 137 PKSSLFALGYAGWSAGQLEEEISANSWLTIPADSSIIFDIPVEDRWAAAARQLGIDIHLM 196 Query: 181 PGVAGHA 187 AGHA Sbjct: 197 SATAGHA 203 >UniRef50_A1U764 Putative uncharacterized protein n=3 Tax=Marinobacter RepID=A1U764_MARAV Length = 188 Score = 251 bits (641), Expect = 1e-65, Method: Composition-based stats. Identities = 82/186 (44%), Positives = 126/186 (67%), Gaps = 7/186 (3%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 +L+HHFL+A P L DP F V+Y+CEH+ GA+G+++N+PL+ + + ILE+L + Sbjct: 10 SLRHHFLVASPWLADPRFHGGVIYLCEHSEEGALGLMINQPLD-IHLGEILEQLDM---- 64 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQP 121 LD PV GGP+ +RGF+LH+P + ++ R++D ++TTSRD+LE++G D+ P Sbjct: 65 --HGGELDLPVYTGGPVQPERGFVLHSPGRQWQNTARVTDEVLLTTSRDILESIGRDEGP 122 Query: 122 SDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMP 181 LVALGY+ W +GQLE+E+ NAWLT PA +ILF+TP R++ +L+G+D+ + Sbjct: 123 ESFLVALGYSGWGEGQLEEELGSNAWLTCPASTDILFRTPADQRYQAVLRLMGIDLNQLS 182 Query: 182 GVAGHA 187 GHA Sbjct: 183 DSVGHA 188 >UniRef50_Q1BYL1 UPF0301 protein Bcen_0382 n=60 Tax=Betaproteobacteria RepID=Y382_BURCA Length = 192 Score = 249 bits (636), Expect = 4e-65, Method: Composition-based stats. Identities = 78/189 (41%), Positives = 115/189 (60%), Gaps = 6/189 (3%) Query: 1 MNLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPE 60 +NL + FLIAMP + DP F +VVY+C+H+ GA+G+++N+P + + +E + ++ + Sbjct: 8 INLTNQFLIAMPNMADPTFSGTVVYLCDHSERGALGLVINRPTD-IDLESLFNRIDLK-- 64 Query: 61 PRDESIRLDKPVMLGGPLAEDRGFILHTP--PSNFASSIRISDNTVMTTSRDVLETLGTD 118 D L PV GGP+ +RGF+LH P +++ SS+ + MTTS+DVLE + T Sbjct: 65 -LDIEPLLHIPVYFGGPVQTERGFVLHEPVEGASYNSSMSVDGGLEMTTSKDVLEAVATG 123 Query: 119 KQPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDIL 178 P L+ LG+A W GQLE+EI N WLT AD I+F TP +R+ A L+GV Sbjct: 124 TGPKRFLLTLGHAGWGAGQLEEEIARNGWLTVAADPRIVFDTPAEERFEAALGLLGVSSS 183 Query: 179 TMPGVAGHA 187 + G AGHA Sbjct: 184 MLSGEAGHA 192 >UniRef50_Q486M0 UPF0301 protein CPS_1252 n=1 Tax=Colwellia psychrerythraea 34H RepID=Y1252_COLP3 Length = 210 Score = 248 bits (634), Expect = 7e-65, Method: Composition-based stats. Identities = 84/209 (40%), Positives = 127/209 (60%), Gaps = 24/209 (11%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGIL--------- 52 +L++ LIAMP+L DP F ++V YICEHN +GAMG+I+N P+ N+ + +L Sbjct: 3 SLENQLLIAMPSLGDPYFNKTVTYICEHNEDGAMGLIINLPV-NITLADLLKQIEPDEGD 61 Query: 53 --------------EKLKITPEPRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIR 98 + + D + L++ V+ GGP+A+ RGF+LH+ ++SS+ Sbjct: 62 KTGNVNSNSELTKSDDVNDITLVTDITNSLEQLVLAGGPIAQQRGFVLHSSQPGWSSSLV 121 Query: 99 ISDNTVMTTSRDVLETLGTDKQPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILF 158 +S ++TTS+D+L LGT + P +V LGYA W GQLEQE+ N+WLT PAD+ ILF Sbjct: 122 LSKELMITTSKDILMALGTQQAPEQFIVTLGYAGWGPGQLEQELQANSWLTTPADIEILF 181 Query: 159 KTPIADRWREAAKLIGVDILTMPGVAGHA 187 KTPI RW++A + +G+D+ + GHA Sbjct: 182 KTPIEQRWKKATEKLGIDLAHLSTDIGHA 210 >UniRef50_Q5WYW5 UPF0301 protein lpl0620 n=6 Tax=Legionella RepID=Y620_LEGPL Length = 187 Score = 247 bits (632), Expect = 1e-64, Method: Composition-based stats. Identities = 74/186 (39%), Positives = 116/186 (62%), Gaps = 4/186 (2%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 +L + LIAMP+L+DP F RSVVY+CEHN G++G+I+N+PL+ + + E+L+I P Sbjct: 6 SLANQLLIAMPSLKDPNFERSVVYLCEHNEQGSVGLIINRPLQ-FPLSIVFEQLQIEP-- 62 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQP 121 + P++ GGP+ +RGF++H + SS+ + D +TTS D++ + D+ P Sbjct: 63 -IRVEKNGLPLLFGGPVQPERGFVIHKQMGGWRSSLFLQDEVTVTTSNDIIRAIAYDEGP 121 Query: 122 SDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMP 181 DVL+ LGYA+W + QLE+EI+ N WL P IL++ P +RW A +G+ + + Sbjct: 122 KDVLITLGYAAWTEQQLEREIMSNTWLVCPYKSEILYEVPFEERWEYAGLTLGIKMNQLS 181 Query: 182 GVAGHA 187 AGHA Sbjct: 182 SDAGHA 187 >UniRef50_B8FL92 UPF0301 protein Dalk_3037 n=1 Tax=Desulfatibacillum alkenivorans AK-01 RepID=Y3037_DESAA Length = 189 Score = 246 bits (630), Expect = 2e-64, Method: Composition-based stats. Identities = 73/186 (39%), Positives = 118/186 (63%), Gaps = 4/186 (2%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 +L FLIAMPAL DP F SV YIC HN +GA G+++N+ +++ + + +++ + Sbjct: 8 SLAGQFLIAMPALNDPNFALSVTYICVHNQDGAFGLVINQSFDSVTGKTLFDQMDMPAVK 67 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQP 121 E ++ V +GGP+ + F+LH P + +S++ISD M+ S D+L+ + + P Sbjct: 68 AAE----NQTVHIGGPVHQGYVFVLHGRPMEWKASLQISDTVAMSNSTDILQAIASGVGP 123 Query: 122 SDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMP 181 ++ LG A W GQLE E+ +N+WLT P + ++LF+TP+ +RW +AA+ IGVD+ + Sbjct: 124 DPCMIFLGCAGWAPGQLEAELAENSWLTCPGNDDLLFRTPLEERWEKAAQSIGVDLNLLS 183 Query: 182 GVAGHA 187 GVAGHA Sbjct: 184 GVAGHA 189 >UniRef50_C9CSD6 Putative uncharacterized protein n=1 Tax=Silicibacter sp. TrichCH4B RepID=C9CSD6_9RHOB Length = 219 Score = 246 bits (629), Expect = 3e-64, Method: Composition-based stats. Identities = 73/188 (38%), Positives = 101/188 (53%), Gaps = 5/188 (2%) Query: 1 MNLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPE 60 M L LIAMP + DP F SVV++C H GAMG+I+NK + ++ ++++L+I E Sbjct: 36 MELTGKLLIAMPGIGDPRFDNSVVFLCSHGDEGAMGLIINKLAPGVALQTLMDQLEIDIE 95 Query: 61 PRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFA-SSIRISDNTVMTTSRDVLETLGTDK 119 P S PV GGP+ RGF+LH+ +S+ + MT + DVLE + + Sbjct: 96 PAIASA----PVYFGGPVETQRGFVLHSDEYISTVNSLPVKPGFSMTATLDVLEDIAEGR 151 Query: 120 QPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILT 179 P LV LGYA W GQLE EI N WLT A+ ++F +W A +GV L Sbjct: 152 GPERYLVMLGYAGWGPGQLEDEIAQNGWLTTDAEPEMIFTDTADTKWEAALASLGVTPLN 211 Query: 180 MPGVAGHA 187 + AGHA Sbjct: 212 LSMDAGHA 219 >UniRef50_Q3SFS4 UPF0301 protein Tbd_2579 n=2 Tax=Proteobacteria RepID=Y2579_THIDA Length = 185 Score = 245 bits (626), Expect = 6e-64, Method: Composition-based stats. Identities = 75/187 (40%), Positives = 116/187 (62%), Gaps = 5/187 (2%) Query: 1 MNLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPE 60 +NL +HFLIAMP + DP F ++ YIC+H+ GA+G++VN+P++ L + + E++ ++ Sbjct: 4 VNLTNHFLIAMPGMVDPNFNGTLTYICDHSDQGALGVVVNRPID-LDLSTLFEQIGLSLP 62 Query: 61 PRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQ 120 V GGP+ +RGF+LHTPP F+S++ ++D +TTS+DVLE + Sbjct: 63 EGLHGEI----VYFGGPVQTERGFVLHTPPLTFSSTLTVNDAVSLTTSKDVLEAVSQGAG 118 Query: 121 PSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTM 180 P +V+LGYA W GQLE E+ NAWL+ AD ++F +R A KL+G+D ++ Sbjct: 119 PEKFIVSLGYAGWSAGQLEDELKQNAWLSVAADPQVIFDLAPEERLPAAMKLLGIDFASL 178 Query: 181 PGVAGHA 187 AGHA Sbjct: 179 SDEAGHA 185 >UniRef50_Q478W0 UPF0301 protein Daro_3893 n=3 Tax=Betaproteobacteria RepID=Y3893_DECAR Length = 186 Score = 245 bits (626), Expect = 6e-64, Method: Composition-based stats. Identities = 86/187 (45%), Positives = 126/187 (67%), Gaps = 4/187 (2%) Query: 1 MNLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPE 60 +NL +FLIAMP L+DP F ++VYICEHN NGA+GIIVN+P++ + + +LEK+ I E Sbjct: 4 VNLTDNFLIAMPTLEDPYFSNALVYICEHNENGALGIIVNRPID-MNLASLLEKIDIKLE 62 Query: 61 PRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQ 120 + + D PV GGP+ DRGF+LH P + S++ I+ + +T+SRDVL ++G+ Sbjct: 63 AENLA---DMPVYFGGPVQLDRGFVLHRPIGQWQSTLAINSDVGLTSSRDVLSSVGSAGL 119 Query: 121 PSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTM 180 P+++LV LGYA W+ GQLE+E+ N+WLT PA +ILF P +R A + +G+ + Sbjct: 120 PAEILVTLGYAGWDAGQLEEELAQNSWLTVPAKASILFDLPPEERLPAAMQKLGISFTQL 179 Query: 181 PGVAGHA 187 VAGHA Sbjct: 180 SDVAGHA 186 >UniRef50_Q605E8 UPF0301 protein MCA2336 2 n=3 Tax=Gammaproteobacteria RepID=Y2336_METCA Length = 188 Score = 243 bits (622), Expect = 2e-63, Method: Composition-based stats. Identities = 87/185 (47%), Positives = 128/185 (69%), Gaps = 4/185 (2%) Query: 3 LQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPR 62 L +HFLIAMP L DP F ++V +C+HN +GA+GII+N+P E LK+ I+ +++I + Sbjct: 8 LANHFLIAMPGLTDPHFAKTVTLVCQHNADGALGIIINRPSE-LKLSDIMRQMEIDLKVA 66 Query: 63 DESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQPS 122 + D PV GGP+ +RGFILH P + +AS++ +S+ +TTSRD+LE +G + P Sbjct: 67 ELG---DLPVFFGGPVHPERGFILHEPATVWASTLVVSERLALTTSRDILEAVGRGEGPR 123 Query: 123 DVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMPG 182 +L+ALGYA W +GQLE+EI+DN+WL AP+D ++F+ P RW+ AA L+GVDI + Sbjct: 124 RMLLALGYAGWGQGQLEREIIDNSWLNAPSDNAVIFEHPPGRRWKAAADLVGVDISLLTS 183 Query: 183 VAGHA 187 AGH Sbjct: 184 QAGHG 188 >UniRef50_Q3SNY6 UPF0301 protein Nwi_2752 n=121 Tax=Alphaproteobacteria RepID=Y2752_NITWN Length = 221 Score = 243 bits (621), Expect = 2e-63, Method: Composition-based stats. Identities = 71/189 (37%), Positives = 103/189 (54%), Gaps = 4/189 (2%) Query: 3 LQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPR 62 L LIAMP ++D F RSV+Y+C H++ GAMGII+N+P ++ +L +L I Sbjct: 33 LDGQLLIAMPVMEDERFARSVIYVCAHSSEGAMGIILNRPAGSVDFSDLLVQLDIIKRAD 92 Query: 63 DESI---RLDKPVMLGGPLAEDRGFILHTPPSNFAS-SIRISDNTVMTTSRDVLETLGTD 118 + VM GGP+ RGF+LH+ ++ I + +T + D+LE + Sbjct: 93 LIKLPETAETMKVMKGGPVETGRGFVLHSSDFFIEDATLPIDEGICLTATLDILEAIAKG 152 Query: 119 KQPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDIL 178 P ++ALGYA W GQLE EI DN WL PAD +++F I D++ A IG+D Sbjct: 153 AGPKHAILALGYAGWAPGQLETEIQDNGWLHCPADQDLIFGRDIEDKYVRALHKIGIDPG 212 Query: 179 TMPGVAGHA 187 + AGHA Sbjct: 213 MLSNEAGHA 221 >UniRef50_A8ZZX8 Putative uncharacterized protein n=1 Tax=Desulfococcus oleovorans Hxd3 RepID=A8ZZX8_DESOH Length = 184 Score = 241 bits (617), Expect = 6e-63, Method: Composition-based stats. Identities = 76/188 (40%), Positives = 111/188 (59%), Gaps = 5/188 (2%) Query: 1 MNLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPE 60 M L+ FLIAMP L DP FR++VV ICEH+ +GA+G+IVN+ L + I E+LK+ Sbjct: 1 MELRGEFLIAMPMLTDPNFRQTVVCICEHSADGALGLIVNRIYPALTAKDIFEELKMKYV 60 Query: 61 PRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQ 120 P + PV GGP+ F+LH PP + I + +T ++D+L + + Sbjct: 61 PETGPL----PVYNGGPVHTGDLFVLHEPPFGWEGCRPIRPDLALTNTKDLLAAIAEGQG 116 Query: 121 PSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGV-DILT 179 P L+ LGYA W QLE E+L+N+WLT P D ++F TP+A RW +A KL+ + D Sbjct: 117 PRRFLILLGYAGWGPDQLEAEVLENSWLTVPVDQRVIFDTPVARRWADAMKLMNIPDPAF 176 Query: 180 MPGVAGHA 187 + G++G A Sbjct: 177 LSGISGSA 184 >UniRef50_Q3IZ52 UPF0301 protein RHOS4_26140 n=6 Tax=Rhodobacteraceae RepID=Y2614_RHOS4 Length = 184 Score = 241 bits (615), Expect = 1e-62, Method: Composition-based stats. Identities = 81/188 (43%), Positives = 114/188 (60%), Gaps = 5/188 (2%) Query: 1 MNLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPE 60 M+L LIAMP++ DP F RS+V IC H+ +GAMG++VNKP+E+L G+LE+L I Sbjct: 1 MDLSGSLLIAMPSMADPRFERSLVLICAHSPDGAMGLVVNKPVEDLSFAGMLEQLNIPRA 60 Query: 61 PRDESIRLDKPVMLGGPLAEDRGFILHTPPS-NFASSIRISDNTVMTTSRDVLETLGTDK 119 P IR V LGGP+ RGF+LH+P + +++ +S MT + D+LE L + Sbjct: 61 PNGRDIR----VHLGGPMERGRGFVLHSPDYMSVGATMLVSGKFGMTATVDILEALARGQ 116 Query: 120 QPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILT 179 PS L+ALGY+ W GQLE E+ N WLTA A ++F +W + +G+D LT Sbjct: 117 GPSSALMALGYSGWGPGQLEAEVQRNDWLTAEAPSELVFSDDDPGKWTGMLRHMGIDPLT 176 Query: 180 MPGVAGHA 187 + AGHA Sbjct: 177 LSSTAGHA 184 >UniRef50_A9DAK3 Putative uncharacterized protein n=1 Tax=Hoeflea phototrophica DFL-43 RepID=A9DAK3_9RHIZ Length = 209 Score = 240 bits (614), Expect = 2e-62, Method: Composition-based stats. Identities = 65/195 (33%), Positives = 109/195 (55%), Gaps = 11/195 (5%) Query: 3 LQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPR 62 L HFL+AMP++ D F R+V+++C H+ +GAMG I+N+P + L E ++E L + + R Sbjct: 16 LDGHFLLAMPSMSDERFERAVIFVCAHSEDGAMGFILNQP-QPLSFEELVENLDLDSQER 74 Query: 63 DESIRL----------DKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVL 112 ++ + + P+ GGP+ RGF+LH+ S++ ++D+ +T + D+L Sbjct: 75 RDADKSRKIGMSECARNFPIQFGGPVDPGRGFVLHSDDYMTESTMPVNDDLCLTATIDIL 134 Query: 113 ETLGTDKQPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKL 172 + P ++ LGYA W GQLEQE+ NAWL+ PA +I+F + ++ Sbjct: 135 RAIKDGCGPVRGMMLLGYAGWGPGQLEQEMAANAWLSCPASDDIVFDRDHSAKYDRVLSH 194 Query: 173 IGVDILTMPGVAGHA 187 +GV + AGHA Sbjct: 195 MGVSPAMLSMEAGHA 209 >UniRef50_Q163D2 UPF0301 protein RD1_3419 n=14 Tax=Rhodobacterales RepID=Y3419_ROSDO Length = 184 Score = 239 bits (610), Expect = 4e-62, Method: Composition-based stats. Identities = 73/188 (38%), Positives = 108/188 (57%), Gaps = 5/188 (2%) Query: 1 MNLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPE 60 MNL+ L+AMP++ DP F+ +V+ IC H+ GAMG+I+NKP ++I +L++L I Sbjct: 1 MNLEGKLLVAMPSMGDPRFQNAVILICAHSAKGAMGLIINKPTPEIRISDVLDQLDILSS 60 Query: 61 PRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTV-MTTSRDVLETLGTDK 119 + + V GGP+ RGF+LH+ + + I D MT + D+LE + + Sbjct: 61 QKTR----EMVVHFGGPVETGRGFVLHSTDYASSLNTLIVDGAFGMTATLDILEEIADGR 116 Query: 120 QPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILT 179 P+ L+ LGYA W GQLE EI N WLT A +++F P A +W EA +GVD + Sbjct: 117 GPAQALMMLGYAGWGGGQLENEIAQNGWLTTNATSDLVFDLPAARKWSEALHSLGVDPIN 176 Query: 180 MPGVAGHA 187 + AGHA Sbjct: 177 LSPAAGHA 184 >UniRef50_B9NUS5 Putative uncharacterized protein n=2 Tax=Rhodobacteraceae RepID=B9NUS5_9RHOB Length = 224 Score = 239 bits (610), Expect = 4e-62, Method: Composition-based stats. Identities = 81/188 (43%), Positives = 111/188 (59%), Gaps = 5/188 (2%) Query: 1 MNLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPE 60 M+L LIAMP + DP F SVVY+C H +GAMG+IVNKP +L+I+ +LE+L I Sbjct: 41 MDLTGKLLIAMPGMGDPRFEHSVVYVCSHGDDGAMGLIVNKP-SDLRIKTLLEQLNI--- 96 Query: 61 PRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFA-SSIRISDNTVMTTSRDVLETLGTDK 119 P + ++ V GGP+ RGF+LH+ S++ISD MT + DVLE L + K Sbjct: 97 PCRIPVVGERLVQFGGPVEMSRGFVLHSADYEANLHSMQISDEFSMTATLDVLEDLASGK 156 Query: 120 QPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILT 179 P + ++ALGY+ W QLE EI N WLT A ++F P ++W A +GVD LT Sbjct: 157 GPLNSMLALGYSGWGPDQLEDEIAMNGWLTTEASSKLIFDVPDDEKWGAALATLGVDPLT 216 Query: 180 MPGVAGHA 187 + AG A Sbjct: 217 LSASAGRA 224 >UniRef50_Q0EWH5 Putative uncharacterized protein n=1 Tax=Mariprofundus ferrooxydans PV-1 RepID=Q0EWH5_9PROT Length = 191 Score = 238 bits (609), Expect = 5e-62, Method: Composition-based stats. Identities = 72/188 (38%), Positives = 109/188 (57%), Gaps = 3/188 (1%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPE- 60 L L+A P+LQDP FR +VV IC+H+ +G +G+I+N+P ++ + I + + I E Sbjct: 5 GLTGQILLATPSLQDPNFRDTVVLICQHDRDGCLGLIINRP-RDIILGEIFDDMGIRYET 63 Query: 61 -PRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDK 119 + R+ V GGP+ RGF+LH + S++++S +T SRD LE L + Sbjct: 64 GSAENHERIQPVVYEGGPMDGFRGFLLHDGWDVYDSTMQVSPELHLTASRDALEELARGQ 123 Query: 120 QPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILT 179 P ++ LGYA W GQLEQE+ DN+WL APA I+F+ P RW AA+ +G++ Sbjct: 124 GPEHYMLLLGYAGWGAGQLEQELCDNSWLIAPASHQIIFQEPPEKRWDFAARCMGIERGQ 183 Query: 180 MPGVAGHA 187 + GHA Sbjct: 184 LSSQIGHA 191 >UniRef50_B9MF60 UPF0301 protein Dtpsy_2896 n=28 Tax=Proteobacteria RepID=Y2896_DIAST Length = 199 Score = 238 bits (609), Expect = 6e-62, Method: Composition-based stats. Identities = 86/196 (43%), Positives = 126/196 (64%), Gaps = 13/196 (6%) Query: 1 MNLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPE 60 MNL HHFLIAMP ++D F RSVVY+CEH+ GA+G+I+NKP + +EG+ EK+ ++ Sbjct: 8 MNLTHHFLIAMPGVEDASFSRSVVYLCEHSERGALGLIINKPTP-ISLEGLFEKVDLSLG 66 Query: 61 PRDESIRLDKPVMLGGPLAEDRGFILHT---------PPSNFASSIRISDNTVMTTSRDV 111 D ++ +PV GGP+ +RGF+LH S +AS++ I MTTS+DV Sbjct: 67 REDLTL---QPVFQGGPVQTERGFVLHEAMRGPQESEDESPYASTMTIPGGLEMTTSKDV 123 Query: 112 LETLGTDKQPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAK 171 LE L P VLV LGY++W +GQLE E+ +N+WLT AD++++F+TP+ +R+ A Sbjct: 124 LEALAHGAGPRRVLVTLGYSAWGEGQLESELAENSWLTVGADVSVIFETPVQERYDRALG 183 Query: 172 LIGVDILTMPGVAGHA 187 L+G+ + AGHA Sbjct: 184 LLGLQSWMLSPEAGHA 199 >UniRef50_Q1YVF9 Putative uncharacterized protein n=1 Tax=gamma proteobacterium HTCC2207 RepID=Q1YVF9_9GAMM Length = 192 Score = 238 bits (608), Expect = 7e-62, Method: Composition-based stats. Identities = 85/187 (45%), Positives = 121/187 (64%), Gaps = 6/187 (3%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 +L+ HFL+AMP L+DP F SVVYICEHN++GAMG+I+N+ ++ + ++ I ++LK+ + Sbjct: 11 SLKDHFLLAMPGLEDPTFSDSVVYICEHNSDGAMGLIINQQMD-IPVKAIFDQLKLEYQD 69 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILH-TPPSNFASSIRISDNTVMTTSRDVLETLGTDKQ 120 + GGP+ DRGFILH + S++ ISD +T SRD+L + K Sbjct: 70 ECGR----PLLFDGGPVQRDRGFILHANCEQQWESTLMISDQVCLTASRDILSDMALGKG 125 Query: 121 PSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTM 180 P D LV LGY+SWE GQLE+E+ +N+WLT PA+ I+FKT A R AA IG+D+ + Sbjct: 126 PKDSLVTLGYSSWEAGQLERELGENSWLTIPAEAEIIFKTDCAKRASAAALSIGLDLRML 185 Query: 181 PGVAGHA 187 AGHA Sbjct: 186 SHQAGHA 192 >UniRef50_A6VSP6 UPF0301 protein Mmwyl1_0539 n=2 Tax=Marinomonas RepID=Y539_MARMS Length = 188 Score = 238 bits (607), Expect = 1e-61, Method: Composition-based stats. Identities = 68/186 (36%), Positives = 108/186 (58%), Gaps = 4/186 (2%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 + ++HFLI+MP L DP F +V+Y+CEH GAMGII+N+P N+ + + L I Sbjct: 7 SFKNHFLISMPHLDDPHFEHTVIYLCEHTKAGAMGIIINRP-SNVDFTELADHLGIQIH- 64 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQP 121 +P+ GGP+ +RGFILHT +++++R++D ++ S + LE + P Sbjct: 65 --SPRLSSEPIYTGGPVEAERGFILHTTDKVWSNTLRVTDEVSLSASLEALEDIAQGNGP 122 Query: 122 SDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMP 181 + LG A W+ GQLE EI +N WL ADL++LF TP ++ A +++G+D+ + Sbjct: 123 DAFRITLGCAGWDAGQLEAEIANNDWLVCEADLDVLFHTPSDMQFTAATRVLGIDMTRLS 182 Query: 182 GVAGHA 187 GH Sbjct: 183 PDIGHG 188 >UniRef50_C6NYY5 Putative uncharacterized protein n=1 Tax=Acidithiobacillus caldus ATCC 51756 RepID=C6NYY5_9GAMM Length = 185 Score = 237 bits (606), Expect = 1e-61, Method: Composition-based stats. Identities = 80/186 (43%), Positives = 119/186 (63%), Gaps = 5/186 (2%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 +L++H LIAMP L D +F RSV+ ICEH+ GAMG+++N+ L ++ + LE + ITP Sbjct: 5 SLKNHLLIAMPNLHDGMFDRSVIVICEHSPEGAMGLVINR-LLDISLAKALEAVNITPPE 63 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQP 121 KPV GGP+ GFILH ++ S+ + + +T+S D+L + + P Sbjct: 64 D----AAQKPVFWGGPVQPQHGFILHEGAGDWQVSMAVGEGLFLTSSPDILMAIAEHRGP 119 Query: 122 SDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMP 181 L+ALGYA W +GQLEQE+ +N+WL P DL++LF+ P A+RW+ AA+ +GVD+ + Sbjct: 120 ERFLLALGYAGWGEGQLEQELSENSWLHGPIDLSVLFELPPAERWQAAARGLGVDMRLLS 179 Query: 182 GVAGHA 187 G AGHA Sbjct: 180 GAAGHA 185 >UniRef50_C5SQD7 Putative uncharacterized protein n=1 Tax=Asticcacaulis excentricus CB 48 RepID=C5SQD7_9CAUL Length = 216 Score = 236 bits (604), Expect = 2e-61, Method: Composition-based stats. Identities = 74/196 (37%), Positives = 109/196 (55%), Gaps = 13/196 (6%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 +LQ L+AMP+L DP F SV+Y+C+H+ AMGI++N+P+ L ++E+L I Sbjct: 24 SLQGRLLVAMPSLDDPNFDHSVIYMCQHDPESAMGIVLNQPIGGLTFPRMMEELGID--- 80 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHT----------PPSNFASSIRISDNTVMTTSRDV 111 ++ + P+ GGP+ +RGF+LH+ P ++ + D +T SRD+ Sbjct: 81 ITDNRHVATPIYNGGPVQNERGFVLHSLDYFIDEVTLPLDIDPEALELRDGIGLTVSRDI 140 Query: 112 LETLGTDKQPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAK 171 L L PS VL+ALGYA W GQLE EI DNAWL AP ++LF + W + K Sbjct: 141 LVDLARGAGPSRVLIALGYAGWGPGQLEAEIRDNAWLVAPCQADLLFSHDASALWSKTLK 200 Query: 172 LIGVDILTMPGVAGHA 187 L+G+ + AG A Sbjct: 201 LLGISPEHLSLNAGRA 216 >UniRef50_A1SUE8 Putative uncharacterized protein n=2 Tax=Psychromonas RepID=A1SUE8_PSYIN Length = 197 Score = 236 bits (604), Expect = 2e-61, Method: Composition-based stats. Identities = 77/185 (41%), Positives = 119/185 (64%), Gaps = 3/185 (1%) Query: 3 LQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPR 62 L+ HFLIAMP+L DP F+ SVVYICEH+ GAMG I+N P+ L ++ +L + Sbjct: 16 LKDHFLIAMPSLNDPYFKHSVVYICEHDEKGAMGFIINFPV-KLTLQELLNNVDSIDHYP 74 Query: 63 DESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQPS 122 + + PV LGGPL +RGF+LH+P ++ + S +++D +++ S +L TLGT+ +P Sbjct: 75 EPPLL--NPVFLGGPLELERGFVLHSPVTDNSQSTKLNDQLMVSNSNAILSTLGTENEPE 132 Query: 123 DVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMPG 182 + +V LGYASW GQLE+E+ DN W++ + +I+F TP+ RW E+ + +G+ + Sbjct: 133 EYIVTLGYASWSSGQLEKEMNDNHWISMESQNDIIFSTPVEQRWIESLQRLGIHPEQLST 192 Query: 183 VAGHA 187 GHA Sbjct: 193 EIGHA 197 >UniRef50_Q1MZ13 Putative uncharacterized protein n=1 Tax=Bermanella marisrubri RepID=Q1MZ13_9GAMM Length = 185 Score = 236 bits (602), Expect = 3e-61, Method: Composition-based stats. Identities = 80/186 (43%), Positives = 119/186 (63%), Gaps = 6/186 (3%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 NL+HH L+AMP+L DP F SV YIC+HN G+MG+++NKP+ +++ +L +L I + Sbjct: 6 NLKHHLLLAMPSLSDPYFGHSVCYICDHNEQGSMGLVLNKPM-GIELTDVLSELDIETDK 64 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQP 121 P++ GGP++ ++GF+L+ + ++ I+ + +TTS+D+L L P Sbjct: 65 PIH-----FPILQGGPVSPEQGFVLYRGSESELQNMVINGDIRLTTSKDILSQLALGSGP 119 Query: 122 SDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMP 181 DV + LGYA WE GQLEQE++ NAWLT PAD +LF TP+ +AA IGVD+ + Sbjct: 120 DDVRICLGYAGWEAGQLEQELIQNAWLTVPADEELLFHTPMDQMLEKAASRIGVDMSLIS 179 Query: 182 GVAGHA 187 G AGHA Sbjct: 180 GEAGHA 185 >UniRef50_A4SVF0 Putative uncharacterized protein n=1 Tax=Polynucleobacter necessarius subsp. asymbioticus QLW-P1DMWA-1 RepID=A4SVF0_POLSQ Length = 210 Score = 236 bits (602), Expect = 4e-61, Method: Composition-based stats. Identities = 77/191 (40%), Positives = 110/191 (57%), Gaps = 10/191 (5%) Query: 3 LQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPR 62 L + FLIAMP + D F SV+Y+ EHN GAMG++VNKP E + + + +K+++ E Sbjct: 24 LANQFLIAMPGMVDANFAGSVIYLFEHNARGAMGLVVNKPTE-VDLATLFDKIELKLEIA 82 Query: 63 DESIRLDKPVMLGGPLAEDRGFILHTPPSN--FASSIRISDNTVMTTSRDVLETLGTDKQ 120 L++PV GGP+ +RGF+LH N ++SS+ I MTTS+DVLE + Sbjct: 83 PL---LEQPVYFGGPVQIERGFVLHESNKNLSYSSSLIIPGGLTMTTSKDVLEAVAIGNG 139 Query: 121 PSDVLVALGYASWEKGQLEQEILDNAWLTAPADL----NILFKTPIADRWREAAKLIGVD 176 P L+ LGYA W GQLE+EI N W+ P I+F TP + R+ + +G D Sbjct: 140 PRKFLMTLGYAGWSAGQLEEEITLNGWMNVPLSREQMMEIIFNTPPSQRYEKTMNHLGFD 199 Query: 177 ILTMPGVAGHA 187 + + G AGHA Sbjct: 200 LSHLSGEAGHA 210 >UniRef50_C3K3J9 UPF0301 protein PFLU_5755 n=5 Tax=cellular organisms RepID=Y5755_PSEFS Length = 189 Score = 235 bits (601), Expect = 5e-61, Method: Composition-based stats. Identities = 76/185 (41%), Positives = 110/185 (59%), Gaps = 4/185 (2%) Query: 3 LQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPR 62 L+H FLIAMP + DP F +++ YI EH GAMG+++N+P + L + ILE+L+ PE Sbjct: 9 LKHQFLIAMPHMADPNFAQTLTYIVEHTAKGAMGLVINRP-QELNLADILEQLR--PEVD 65 Query: 63 DESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQPS 122 + P+ +GGP+ DRGF+LH F +++ + + ++TS+DVL + P Sbjct: 66 PPARCQGVPIYIGGPVQTDRGFVLHPTGPKFQATVDL-EGVSLSTSQDVLFAIADGVGPE 124 Query: 123 DVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMPG 182 ++ LGYA WE GQLE E+ NAWLT P D ILF TP R AA + V++ + Sbjct: 125 QSVITLGYAGWEAGQLEAELASNAWLTCPFDAEILFNTPSELRLEAAAAKLRVNLNLLTS 184 Query: 183 VAGHA 187 AGHA Sbjct: 185 QAGHA 189 >UniRef50_Q0AMH8 Putative uncharacterized protein n=3 Tax=Hyphomonadaceae RepID=Q0AMH8_MARMM Length = 195 Score = 235 bits (600), Expect = 7e-61, Method: Composition-based stats. Identities = 73/186 (39%), Positives = 114/186 (61%), Gaps = 5/186 (2%) Query: 3 LQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPR 62 L LIA PA+ DP F R+V+ +C+H GAMGII+NKP L++ + E+L++ Sbjct: 14 LGGKLLIATPAIGDPRFDRAVILVCDHTAEGAMGIIINKPAAGLRLPELFEQLEVDSSQP 73 Query: 63 DESIRLDKPVMLGGPLAEDRGFILHTPPS-NFASSIRISDNTVMTTSRDVLETLGTDKQP 121 D PV++GGP+ +DRGF+LHT N +++ I+D +T ++DVLE + +D P Sbjct: 74 ----APDGPVLVGGPVDKDRGFVLHTRDYANDEATLPINDRIGLTATKDVLEAMASDSPP 129 Query: 122 SDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMP 181 L+ALGY+ W GQL+ E++ NAWL D ++F+T AD+W A + +G+ + Sbjct: 130 QRSLLALGYSGWAAGQLDDELVANAWLVCDMDEQLVFETDDADKWPRALECLGISPEHLS 189 Query: 182 GVAGHA 187 ++GHA Sbjct: 190 ALSGHA 195 >UniRef50_Q4ZZ67 UPF0301 protein Psyr_0485 n=31 Tax=Proteobacteria RepID=Y485_PSEU2 Length = 190 Score = 235 bits (600), Expect = 7e-61, Method: Composition-based stats. Identities = 73/185 (39%), Positives = 111/185 (60%), Gaps = 3/185 (1%) Query: 3 LQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPR 62 L+HHFLIAMP + D F +++ YI EHN NGAMG+++N+P ++L + +LE+L+ PE Sbjct: 9 LKHHFLIAMPHMHDENFAQTLTYIVEHNANGAMGLVINRP-QSLTLADVLEQLR--PELP 65 Query: 63 DESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQPS 122 D + GGP+ DRGF+LH F +++ + ++TS+DVL ++ P Sbjct: 66 APRHCQDIVIHTGGPVQTDRGFVLHPSGQTFQATVNLPGGISLSTSQDVLFSIADGYGPD 125 Query: 123 DVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMPG 182 ++ LGYA W+ GQL+ E+ DNAWLT D ILF R AA+ +G+++ + Sbjct: 126 QNVITLGYAGWDAGQLDAEMADNAWLTCSFDPAILFDVDSDQRLEAAARRLGINLNLIST 185 Query: 183 VAGHA 187 AGHA Sbjct: 186 QAGHA 190 >UniRef50_B5JTR0 Putative uncharacterized protein n=1 Tax=gamma proteobacterium HTCC5015 RepID=B5JTR0_9GAMM Length = 187 Score = 234 bits (598), Expect = 1e-60, Method: Composition-based stats. Identities = 86/189 (45%), Positives = 117/189 (61%), Gaps = 7/189 (3%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTN-GAMGIIVNKPLENLKIEGILEKLKITPE 60 +L +HFLIAMP L+DP F RSV IC H+ + GA+GI + + ++ +E +L++L I Sbjct: 3 SLTNHFLIAMPDLEDPNFSRSVTLICHHSEDEGAIGITLTRATDH-SVEELLDQLDIQEA 61 Query: 61 PRDESIRLDKPVMLGGPLAEDRGFILH--TPPSNFASSIRISDNTVMTTSRDVLETLGTD 118 + L P+ +GGP+ +DRGFILH + + ISD+ +T+S D+LE L Sbjct: 62 KLAATHAL--PLYIGGPVEQDRGFILHPNRKEYQWEGTETISDHLAITSSLDILEDLARG 119 Query: 119 KQPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDIL 178 K P + L+ALGYA W GQLEQEI DNAWL PAD I+F P RW AA+ +GVDI Sbjct: 120 KGPDNCLIALGYAGWSSGQLEQEITDNAWLHGPADPEIIFSLPAEQRWTAAAQSLGVDIR 179 Query: 179 TMPGVAGHA 187 + AGHA Sbjct: 180 LIHS-AGHA 187 >UniRef50_Q0C3C2 Putative uncharacterized protein n=1 Tax=Hyphomonas neptunium ATCC 15444 RepID=Q0C3C2_HYPNA Length = 188 Score = 234 bits (598), Expect = 1e-60, Method: Composition-based stats. Identities = 68/187 (36%), Positives = 110/187 (58%), Gaps = 5/187 (2%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 +L LIAMP + DP F RSV+ +C H + AMGII+NKP++ + ++ I++++ I + Sbjct: 6 DLTGKLLIAMPGIGDPRFERSVILVCAHTPDFAMGIILNKPMDGIDLQEIIDQMDIPQDV 65 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTPPSNFAS-SIRISDNTVMTTSRDVLETLGTDKQ 120 E + ++ GGP+A +RGF+LHT ++ + D MT +R++L ++ + Sbjct: 66 DLEGV----AILEGGPVATERGFVLHTDDVICDGATMEVEDELCMTATREILASIASAAP 121 Query: 121 PSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTM 180 P ++ALGYA W GQLEQE+ NAWL D +++F +WR A +GVD+ + Sbjct: 122 PRKFVMALGYAGWGAGQLEQELAQNAWLIGAPDSDLVFGDAYEHKWRHAMTRMGVDLSRL 181 Query: 181 PGVAGHA 187 AG+A Sbjct: 182 QSNAGNA 188 >UniRef50_B8KLD5 Putative uncharacterized protein n=3 Tax=Proteobacteria RepID=B8KLD5_9GAMM Length = 208 Score = 234 bits (598), Expect = 1e-60, Method: Composition-based stats. Identities = 83/186 (44%), Positives = 118/186 (63%), Gaps = 6/186 (3%) Query: 3 LQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPR 62 L+ HFL+AMP L +F S+ Y+CEH GAMG+++N+PL+ L + I + L I + Sbjct: 28 LRDHFLLAMPGLDAGLFSGSITYLCEHGEAGAMGLVINQPLD-LSLGEIFDHLDIAAD-- 84 Query: 63 DESIRLDKPVMLGGPLAEDRGFILH-TPPSNFASSIRISDNTVMTTSRDVLETLGTDKQP 121 + D+PV+ GGP+ D GF+LH + + SS+R++D +TTSRDVL+ + + P Sbjct: 85 --AHFRDQPVLAGGPVQIDHGFVLHPSGGKRWDSSLRVTDEVQLTTSRDVLKAIACGEGP 142 Query: 122 SDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMP 181 D +V LGYA W GQLE+EI +N+WLT PAD I+F T I DR AA +G+D+ M Sbjct: 143 RDFVVTLGYAGWSAGQLEEEIANNSWLTLPADKRIIFHTAIEDRVAAAASALGIDMNLMS 202 Query: 182 GVAGHA 187 AGHA Sbjct: 203 AQAGHA 208 >UniRef50_A1TKL7 UPF0301 protein Aave_0907 n=10 Tax=Comamonadaceae RepID=Y907_ACIAC Length = 213 Score = 233 bits (596), Expect = 2e-60, Method: Composition-based stats. Identities = 90/210 (42%), Positives = 125/210 (59%), Gaps = 27/210 (12%) Query: 1 MNLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPE 60 MNL HHFLIAMP L+D F RSVVY+CEH+ GA+G+I+NKP +L ++G+ +K+ ++ Sbjct: 8 MNLTHHFLIAMPGLEDESFARSVVYLCEHSERGALGLIINKP-SDLSLKGLFDKVDLSLR 66 Query: 61 PRDESIRLDKPVMLGGPLAEDRGFILHTP-----------------------PSNFASSI 97 D S+ PV GGP+ +RGF+LH S +AS++ Sbjct: 67 REDLSLE---PVFRGGPVQTERGFVLHEAMGPSSGKQAAGEGGAQAEGEGAEESAYASTM 123 Query: 98 RISDNTVMTTSRDVLETLGTDKQPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNIL 157 I MTTS+DVLE L T P VLV LGY+SW +GQLE E+ +N+WLT ADL+++ Sbjct: 124 SIPGGLEMTTSKDVLEALSTGAGPRRVLVTLGYSSWGEGQLESELAENSWLTVGADLSVI 183 Query: 158 FKTPIADRWREAAKLIGVDILTMPGVAGHA 187 F TP+ R+ A L+G+ + AGHA Sbjct: 184 FDTPVGQRYDRALALLGLQSWMLSPEAGHA 213 >UniRef50_Q2P5W3 UPF0301 protein XOO1309 n=20 Tax=Xanthomonadaceae RepID=Y1309_XANOM Length = 188 Score = 233 bits (596), Expect = 2e-60, Method: Composition-based stats. Identities = 82/185 (44%), Positives = 118/185 (63%), Gaps = 4/185 (2%) Query: 3 LQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPR 62 L + LIA+PAL DP F RSV IC+H+ NGAMG++VN+P E + +L ++ I Sbjct: 8 LANQLLIALPALSDPTFSRSVALICQHDENGAMGVLVNRPSEY-TLGEVLSQMGID---T 63 Query: 63 DESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQPS 122 D+ ++ V+ GGP+ +RGF++H + SS+ + +TTSRD+LE + P Sbjct: 64 DDEPLREQIVLSGGPVHPERGFVIHDDAREWDSSLEVGQGVFLTTSRDILEAMAAGNGPR 123 Query: 123 DVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMPG 182 +VLVALG A W GQLE E+ +N+WLTAP+D N+LF T + DRW+ AA IGVD+ + Sbjct: 124 NVLVALGCAGWGAGQLEFELGENSWLTAPSDANVLFATALEDRWQTAAGRIGVDLFRLTD 183 Query: 183 VAGHA 187 +GHA Sbjct: 184 YSGHA 188 >UniRef50_B4REX9 Transcriptional regulator n=4 Tax=Caulobacteraceae RepID=B4REX9_PHEZH Length = 187 Score = 230 bits (588), Expect = 1e-59, Method: Composition-based stats. Identities = 70/186 (37%), Positives = 105/186 (56%), Gaps = 5/186 (2%) Query: 3 LQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPR 62 L LIAMP + DP F R+++ +C H+ + AMG+ +N P+E L + +LE+L+I R Sbjct: 6 LSGQLLIAMPGISDPRFERTLILVCAHDAHHAMGLALNHPVEGLTVPDLLERLEIKSTIR 65 Query: 63 DESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGT-DKQP 121 V++GGP+ +RGF+LHT S+ + +T +R+VLE +G+ D +P Sbjct: 66 LPPDL----VLVGGPVERERGFVLHTDDYQGEFSLPVGGGVALTATREVLEAMGSSDGRP 121 Query: 122 SDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMP 181 L+ALGYA W GQLE EI +N WLT AD ++F +W A +G+D + Sbjct: 122 RRSLLALGYAGWGAGQLEHEIRENVWLTCEADEALIFDADYDTKWARAVAKLGIDPTFLT 181 Query: 182 GVAGHA 187 AG A Sbjct: 182 AEAGRA 187 >UniRef50_A8PPF4 Putative uncharacterized protein n=1 Tax=Rickettsiella grylli RepID=A8PPF4_9COXI Length = 195 Score = 230 bits (588), Expect = 1e-59, Method: Composition-based stats. Identities = 74/187 (39%), Positives = 118/187 (63%), Gaps = 3/187 (1%) Query: 4 QHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKI---EGILEKLKITPE 60 ++FL+AMP L D F RSVVYICEH GA+GI++N+PL++L + E + E + + Sbjct: 9 TNYFLVAMPILTDAYFSRSVVYICEHTEKGAVGIVINQPLQSLHVNLAEIVQEITESNLK 68 Query: 61 PRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQ 120 + + P++ GGP+ +RGF++H P + SS++++ +TTS+D+L + + Sbjct: 69 STKTTAGANFPILCGGPIHPERGFVIHAPSGAWQSSLKMNSEISVTTSKDILLAIAKQQG 128 Query: 121 PSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTM 180 P + +LGYA+W GQ+EQEI++N WLT PA+ N+LF P RW +A +GVD+ + Sbjct: 129 PEKFIFSLGYANWIAGQMEQEIINNFWLTLPANPNLLFDVPFEQRWLKAMDYLGVDVTKL 188 Query: 181 PGVAGHA 187 + GHA Sbjct: 189 AYMGGHA 195 >UniRef50_A1KUG1 UPF0301 protein NMC1274 n=27 Tax=Neisseriaceae RepID=Y1274_NEIMF Length = 182 Score = 229 bits (585), Expect = 3e-59, Method: Composition-based stats. Identities = 83/188 (44%), Positives = 123/188 (65%), Gaps = 7/188 (3%) Query: 1 MNLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPE 60 MNL +HFL+AMP ++D F +SVVYIC+H+ +GA+GI +NKP I + + Sbjct: 1 MNLSNHFLVAMPDMEDAFFSQSVVYICKHDEDGALGIAINKPSP------ITMDMIFSAT 54 Query: 61 PRDESIRLDK-PVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDK 119 ++ +R+ VM+GGP+ +RG+++HTP N+ SSI +SDN +T+SRDV+E + + Sbjct: 55 GKNIPMRMQHDSVMMGGPVQVERGYVVHTPIGNWQSSIGVSDNIALTSSRDVIENISREG 114 Query: 120 QPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILT 179 L+++GY+SW KGQLE+E+ DNAWLT PAD +ILF P R+ A +G+D L Sbjct: 115 AVDKALISIGYSSWGKGQLERELADNAWLTVPADEHILFDIPYEHRYAAAFAKLGIDPLA 174 Query: 180 MPGVAGHA 187 + AGHA Sbjct: 175 LFSGAGHA 182 >UniRef50_A4BI12 Putative uncharacterized protein n=1 Tax=Reinekea blandensis MED297 RepID=A4BI12_9GAMM Length = 183 Score = 229 bits (584), Expect = 4e-59, Method: Composition-based stats. Identities = 77/187 (41%), Positives = 116/187 (62%), Gaps = 4/187 (2%) Query: 1 MNLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPE 60 MNL HHFLIAMP + DP+F ++ Y+ +H+ GA+G+IVN+PL NL +E + E +++ Sbjct: 1 MNLNHHFLIAMPQMGDPVFSGTLTYLVQHDEQGALGLIVNRPL-NLNLEEVFESSELS-- 57 Query: 61 PRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQ 120 KPV GGP+A+++GFILH P S ++ V+TTSRD+LE + D+ Sbjct: 58 -GYSPRTGSKPVYHGGPVAQEQGFILHPPTEQTWISSLSNEQLVLTTSRDMLEAIAQDEG 116 Query: 121 PSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTM 180 P L LGY+ W GQLE+E+ +NAWLT A+ I+F+ +++ A +G+D+ T+ Sbjct: 117 PERFLFCLGYSGWSPGQLEEELKENAWLTVEANEAIIFQDDEVGKYQHALSDLGIDLATL 176 Query: 181 PGVAGHA 187 G G A Sbjct: 177 SGHGGLA 183 >UniRef50_A6WWH2 Putative uncharacterized protein n=2 Tax=Ochrobactrum RepID=A6WWH2_OCHA4 Length = 214 Score = 228 bits (581), Expect = 9e-59, Method: Composition-based stats. Identities = 66/188 (35%), Positives = 105/188 (55%), Gaps = 4/188 (2%) Query: 3 LQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPR 62 L FL+AMP + D F RSVVYIC H+ GAMG I+N+ L+ ++ +L ++ + E Sbjct: 28 LNGQFLLAMPGMSDERFARSVVYICAHSDEGAMGFIINQ-LQPVEFPDLLRQIGVIDEDE 86 Query: 63 DESI---RLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDK 119 + V GGP+ RGF+LH+ S++ +S+ +T + D+L + + Sbjct: 87 LIILPDRAQHMMVRNGGPVDRTRGFVLHSDDYMVDSTMPVSEEVCLTATVDILRAIYGGR 146 Query: 120 QPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILT 179 PS L+ALGY+ W GQ+E E+ +N WLT A L++LF + I ++ +G+D+ Sbjct: 147 GPSRALMALGYSGWAPGQIEVELAENGWLTCDAPLDMLFDSDIEGKYSRLMLHMGIDMSR 206 Query: 180 MPGVAGHA 187 + AGHA Sbjct: 207 LVSDAGHA 214 >UniRef50_A9KDE7 UPF0301 protein CBUD_2193 n=7 Tax=Coxiella burnetii RepID=Y2193_COXBN Length = 181 Score = 221 bits (563), Expect = 1e-56, Method: Composition-based stats. Identities = 76/185 (41%), Positives = 113/185 (61%), Gaps = 10/185 (5%) Query: 3 LQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPR 62 L +HFL+AMP L D F ++V+Y+ +H+ GA+GII+N+PL L + +LE L I Sbjct: 7 LSNHFLVAMPQLNDFTFTKAVIYVSQHDAKGALGIIINRPL-ALTLGKVLEHLNIE---I 62 Query: 63 DESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQPS 122 + + PV++GGP+ ++ GFI+ + +++ S+D+L+ + +K P Sbjct: 63 AQPQIANHPVLMGGPIGQEHGFIV------YEQESPQGAEILLSASKDMLDDIAKNKGPD 116 Query: 123 DVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMPG 182 D L+ LGYA WE GQLE EI N WL P + ILF+TP+ RW++AA LIGVDI + G Sbjct: 117 DFLITLGYAGWEAGQLENEIARNDWLVVPFNRKILFETPLKSRWQKAAALIGVDINQLSG 176 Query: 183 VAGHA 187 GHA Sbjct: 177 QIGHA 181 >UniRef50_Q31EK4 UPF0301 protein Tcr_1827 n=1 Tax=Thiomicrospira crunogena XCL-2 RepID=Y1827_THICR Length = 188 Score = 219 bits (559), Expect = 4e-56, Method: Composition-based stats. Identities = 71/185 (38%), Positives = 109/185 (58%), Gaps = 3/185 (1%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 +L+HHFLIAMP L + F ++V+YI E N +G MG+++N NL + +L+ ++T E Sbjct: 6 SLEHHFLIAMPNLTESWFDKTVIYIVEDNEHGTMGLVIN-LEHNLTVPELLDHFELTVEA 64 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQP 121 + D+PV++GGP+ + GFILH P + S+ + DN MT S D L+ + P Sbjct: 65 PEN--YADQPVLMGGPVDLEHGFILHEPQGTWQKSLPLRDNLAMTVSEDFLKAMADGTAP 122 Query: 122 SDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMP 181 ++V LG++ WEKGQL EI N WLT P + +LF P +W+ A +G+ ++ Sbjct: 123 EKIVVCLGFSGWEKGQLNDEIQANNWLTIPYNEALLFDVPNDQKWQVALNTLGISPESLS 182 Query: 182 GVAGH 186 AGH Sbjct: 183 MDAGH 187 >UniRef50_Q0I1B4 UPF0301 protein HS_0009 n=26 Tax=Pasteurellaceae RepID=Y009_HAES1 Length = 187 Score = 218 bits (556), Expect = 7e-56, Method: Composition-based stats. Identities = 81/184 (44%), Positives = 111/184 (60%), Gaps = 4/184 (2%) Query: 1 MNLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPE 60 MNLQ HFLIAMP L+D F+RSVVYICE+N G+MG+++ + + L I + K+ Sbjct: 1 MNLQDHFLIAMPHLEDENFQRSVVYICENNEQGSMGLVLTQATD-LSIAELCAKMNFMM- 58 Query: 61 PRDESIRLDKPVMLGGPLAEDRGFILHTP-PSNFASSIRISDNTVMTTSRDVLETLGTDK 119 DE DK V+LGGP+ + GFILH F S +++D +TTS D++ T GT + Sbjct: 59 -ADEREYSDKLVLLGGPVNLEHGFILHKKTAQEFQHSYKVTDQIYLTTSADIINTFGTAQ 117 Query: 120 QPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILT 179 P LV LG A WE QLE EI +N WL PAD +ILF I++RW A +L+G++ + Sbjct: 118 SPEKYLVTLGCARWEPNQLENEIANNDWLVVPADEDILFDVDISERWFAANQLLGIEHVN 177 Query: 180 MPGV 183 Sbjct: 178 FSYQ 181 >UniRef50_Q1NQW6 Putative uncharacterized protein n=2 Tax=Deltaproteobacteria RepID=Q1NQW6_9DELT Length = 201 Score = 216 bits (550), Expect = 4e-55, Method: Composition-based stats. Identities = 65/186 (34%), Positives = 102/186 (54%), Gaps = 3/186 (1%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 +LQ +FLIA P + DP F+ +V+ +C HN GAMG+++N+P+ ++++E I I P Sbjct: 19 SLQGYFLIATPQMSDPRFQETVILLCAHNEEGAMGLVINQPIRDVELEDIFHNAGIPLPP 78 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQP 121 + V LGGP+ FI+++ + + ++ + ++ +L L + P Sbjct: 79 GAGPLGS---VYLGGPVETGNVFIVYSAEYEVVNHLAVTPSISLSRDPQLLYDLAAGRGP 135 Query: 122 SDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMP 181 LV+LGYA W GQLE E+ + WL PA I+F TP +WR AA++ GVDI Sbjct: 136 RHYLVSLGYAGWGAGQLEAELSVDGWLALPAKDEIIFNTPNQHKWRRAAQIHGVDIGLFG 195 Query: 182 GVAGHA 187 V G A Sbjct: 196 AVVGSA 201 >UniRef50_A5WBR3 UPF0301 protein PsycPRwf_0144 n=21 Tax=Moraxellaceae RepID=Y144_PSYWF Length = 188 Score = 216 bits (550), Expect = 4e-55, Method: Composition-based stats. Identities = 78/187 (41%), Positives = 118/187 (63%), Gaps = 4/187 (2%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 NL HHFLIA P++ D F +S+VYIC H+ +G +G++VN+P+ + ++ +L+ L I E Sbjct: 5 NLTHHFLIAAPSMPDERFAQSLVYICRHDRHGVLGLVVNRPIFDTQVGHLLDNLDI--EV 62 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTD-KQ 120 D S+ D P+ GGP+ + GF+LHT +ASS IS+N +TTS+D+L+ + Sbjct: 63 TDTSVMYDTPL-DGGPVYPEVGFVLHTGQPTWASSFPISENVCITTSKDILQNIAAGSAG 121 Query: 121 PSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTM 180 + LG+ASW +GQLE+EI WL +P DL++LF+ P +RWR AA+ IGV + + Sbjct: 122 IGHYHLCLGHASWHEGQLEKEISQGDWLVSPGDLSLLFEIPFEERWRHAAEKIGVHLDFL 181 Query: 181 PGVAGHA 187 G A Sbjct: 182 SDEVGRA 188 >UniRef50_C0QFZ0 UPF0301 protein HRM2_24640 n=1 Tax=Desulfobacterium autotrophicum HRM2 RepID=Y2464_DESAH Length = 189 Score = 214 bits (546), Expect = 1e-54, Method: Composition-based stats. Identities = 69/185 (37%), Positives = 102/185 (55%), Gaps = 4/185 (2%) Query: 3 LQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPR 62 L+ HFL+A+P L DP F ++V ICEHN GA+G I+N+ L + + E LKIT Sbjct: 9 LKGHFLMAIPGLPDPNFAQTVTCICEHNKTGALGFIINRIHPLLTGQELFEDLKITCNQA 68 Query: 63 DESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQPS 122 + I + LGGP+ F+LH PP ++ ++I+D ++ +RD+LE + + P Sbjct: 69 IDKI----AIHLGGPVQPSGVFVLHGPPFDWHGCLKINDWLGLSNTRDILEAVARQEGPE 124 Query: 123 DVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMPG 182 + +V LG A W QL+ EI DNAWLT P ILFKT + +W +G+ Sbjct: 125 NFIVLLGCAGWGPLQLDNEINDNAWLTIPVSQEILFKTDVKLKWEMTMMQMGIVSDNHSD 184 Query: 183 VAGHA 187 +G A Sbjct: 185 NSGKA 189 >UniRef50_A4A9E0 Protein containing DUF179 n=1 Tax=Congregibacter litoralis KT71 RepID=A4A9E0_9GAMM Length = 173 Score = 213 bits (543), Expect = 2e-54, Method: Composition-based stats. Identities = 81/178 (45%), Positives = 108/178 (60%), Gaps = 6/178 (3%) Query: 11 MPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRDESIRLDK 70 MP L +F S+ YICEH GAMGI++N+PL+ L + I + L+I PR D+ Sbjct: 1 MPGLDSGLFSGSITYICEHGEAGAMGIVINQPLD-LSLGEIFDHLEIDCAPR----FQDQ 55 Query: 71 PVMLGGPLAEDRGFILH-TPPSNFASSIRISDNTVMTTSRDVLETLGTDKQPSDVLVALG 129 V+ GGP+ D GF+LH + SS+R++ +TTSRDVL + + P D VALG Sbjct: 56 VVLAGGPVQIDHGFVLHPRGEQTWDSSLRVTPEVQLTTSRDVLSAIAAGEGPKDYAVALG 115 Query: 130 YASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMPGVAGHA 187 YA W GQLE+EI +N+WLT PAD I+F T I DR AA +G+D+ M AGHA Sbjct: 116 YAGWSAGQLEEEIANNSWLTLPADKRIIFHTAIEDRVAAAAAALGIDMNLMSAEAGHA 173 >UniRef50_Q2GAJ3 UPF0301 protein Saro_0683 n=4 Tax=Sphingomonadaceae RepID=Y683_NOVAD Length = 186 Score = 212 bits (541), Expect = 4e-54, Method: Composition-based stats. Identities = 66/185 (35%), Positives = 99/185 (53%), Gaps = 5/185 (2%) Query: 3 LQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPR 62 L L+AMP + DP F +V+ +C H+ +GA+GI V E + + G+LE + I P Sbjct: 7 LGGRLLLAMPGMGDPRFDHAVIAMCVHDEHGALGIGVGHVREGITLHGLLEDVGIDP--- 63 Query: 63 DESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQPS 122 + D PV+ GGP+ RGF+LH+ S+ ++ ++ S D+L + + PS Sbjct: 64 --GLAPDMPVLNGGPVETARGFVLHSDDWGGEGSVTVNGLCCLSASLDILRAIAEGRGPS 121 Query: 123 DVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMPG 182 ++ALGYA W GQLE E+ + W A ILF+TP RW +A K G+D + G Sbjct: 122 RFVIALGYAGWGGGQLEGEMRRHGWYAAQGRPEILFETPTGRRWTQAWKREGIDPAHLVG 181 Query: 183 VAGHA 187 G A Sbjct: 182 QTGSA 186 >UniRef50_A3MYV4 UPF0301 protein APL_0232 n=7 Tax=Pasteurellaceae RepID=Y232_ACTP2 Length = 186 Score = 211 bits (539), Expect = 8e-54, Method: Composition-based stats. Identities = 78/187 (41%), Positives = 117/187 (62%), Gaps = 5/187 (2%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 NLQ FLIA P + D F R+V+YICEHN+NGAMG+++N P + L + ++ ++ Sbjct: 4 NLQGKFLIATPEIDDDYFDRTVIYICEHNSNGAMGLVINTPTD-LSVLELITRMDFQM-A 61 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTP-PSNFASSIRISDNTVMTTSRDVLETLGTDKQ 120 + D+ V+ GGP+++DRGFI+HT F S R++DN ++TTS DVL++LG + Sbjct: 62 NQRNYHKDQMVLSGGPVSQDRGFIIHTKTEQEFLHSYRVTDNILLTTSGDVLDSLGKPEA 121 Query: 121 PSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTM 180 P +V LG A+W+ QLEQEI N WL + A+ LF+T +RW EA +++G+ + Sbjct: 122 PEKFIVCLGCATWKPEQLEQEIARNYWLISEANDKTLFETGYLERWVEANEMLGISGVL- 180 Query: 181 PGVAGHA 187 AG A Sbjct: 181 -ARAGRA 186 >UniRef50_Q5FQY8 UPF0301 protein GOX1459 n=11 Tax=Acetobacteraceae RepID=Y1459_GLUOX Length = 187 Score = 211 bits (538), Expect = 1e-53, Method: Composition-based stats. Identities = 69/189 (36%), Positives = 105/189 (55%), Gaps = 6/189 (3%) Query: 1 MNLQHHFLIAMPALQDPIFRRSVVYICEHNT-NGAMGIIVNKPLENLKIEGILEKLKITP 59 + L L+A PAL + F R+V+Y+C H+ +GAMG+IVN+ L ++ + +L I P Sbjct: 3 LGLTGKLLVAAPALAETFFERTVIYLCAHSEQDGAMGLIVNRRLSQPGLDDLFAQLGIEP 62 Query: 60 EPRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDK 119 P + I V +GGP+ RGF+LH+ S+ + +T +T S D+L + Sbjct: 63 SPPERRIG----VCMGGPVEHARGFVLHSADWAGEGSLDVDGHTTLTASLDILREIAAGH 118 Query: 120 QPSDVLVALGYASWEKGQLEQEI-LDNAWLTAPADLNILFKTPIADRWREAAKLIGVDIL 178 P ++ALG+A+W GQLE+EI D++W APA I+F T A +WR+A I D L Sbjct: 119 GPRQAVMALGHAAWAPGQLEEEILRDSSWFIAPATDEIVFGTDHAKKWRQALVAIDFDPL 178 Query: 179 TMPGVAGHA 187 + G A Sbjct: 179 LLSSSVGEA 187 >UniRef50_A0L5K4 UPF0301 protein Mmc1_0726 n=1 Tax=Magnetococcus sp. MC-1 RepID=Y726_MAGSM Length = 186 Score = 209 bits (533), Expect = 4e-53, Method: Composition-based stats. Identities = 63/185 (34%), Positives = 102/185 (55%), Gaps = 7/185 (3%) Query: 3 LQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPR 62 L FLIA+P+L DP F R+V+Y+C HN +GA+G+++N+PL+ + + L++ + Sbjct: 6 LAGKFLIAVPSLADPFFERTVLYLCAHNEDGALGLVINQPLD-TTMSQMAGYLELDWQRP 64 Query: 63 DESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQPS 122 V +GGP++ ++GF+L + + + D+ M T+ D++ +G Sbjct: 65 GVD-----RVYMGGPVSPEQGFVLFEQALDLPGIMMLPDDLYMGTNPDIIRLMGRAGAQE 119 Query: 123 DVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMPG 182 L ALGYA WE GQLE E+ +N+WL A +ILF A RW A + +G+D + Sbjct: 120 RFLFALGYAGWEAGQLEHELQENSWLVCDAQRSILFDMGYAQRWEAAIRSMGIDPALLV- 178 Query: 183 VAGHA 187 A H Sbjct: 179 DASHG 183 >UniRef50_Q6AL28 UPF0301 protein DP2218 n=1 Tax=Desulfotalea psychrophila RepID=Y2218_DESPS Length = 190 Score = 198 bits (505), Expect = 6e-50, Method: Composition-based stats. Identities = 61/186 (32%), Positives = 102/186 (54%), Gaps = 5/186 (2%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 +L +FL++ + D F VVY+C HN+NGA+G+++NKP NL +L ++ + Sbjct: 10 SLAGYFLVSTLQMPDSRFAGQVVYVCSHNSNGALGLVINKPDCNLSFAQVLREMGMEVSR 69 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQP 121 + V +GGP++ D F+L+ + I I+DN ++ +++LE + + Sbjct: 70 AELPS-----VYIGGPVSLDAAFVLYRSHPYEGNHIDITDNISLSREKELLELVVGENSS 124 Query: 122 SDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMP 181 + L +GY WE GQLE E+ DN+WL P D ++F P ++W+ AA G+DI T Sbjct: 125 RNYLFLVGYVGWESGQLELELRDNSWLVVPGDEQVIFDLPDGEKWKAAAAYYGIDITTFN 184 Query: 182 GVAGHA 187 G+A Sbjct: 185 ENLGYA 190 >UniRef50_C8CIK8 Putative uncharacterized protein n=1 Tax=uncultured bacterium B7P37metaSE RepID=C8CIK8_9BACT Length = 200 Score = 198 bits (504), Expect = 9e-50, Method: Composition-based stats. Identities = 53/170 (31%), Positives = 83/170 (48%), Gaps = 4/170 (2%) Query: 3 LQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPR 62 L LIA P + DP F R+V+ + HN++GAM I++N+PL + IL+ Sbjct: 30 LTGQLLIAAPGMTDPRFDRTVLVMVRHNSDGAMAIVINRPLGERSMARILQAFGEKAPDD 89 Query: 63 DESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQPS 122 ++ PV LGGP+ + +LH+ ++ I + +T S ++ + + P Sbjct: 90 SATV----PVYLGGPVQLEMSTVLHSAEYRRNGTLDIDGHVAVTASMEIYRDIAANTGPE 145 Query: 123 DVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKL 172 LV GYA W GQLE E+ N W TAP D+ ++F W A + Sbjct: 146 KSLVVFGYAGWAPGQLEGEMAQNVWFTAPLDVKLVFDADRDKVWDLAMER 195 >UniRef50_Q60BQ2 UPF0301 protein MCA0413 1 n=1 Tax=Methylococcus capsulatus RepID=Y413_METCA Length = 182 Score = 196 bits (500), Expect = 3e-49, Method: Composition-based stats. Identities = 59/172 (34%), Positives = 88/172 (51%), Gaps = 5/172 (2%) Query: 4 QHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRD 63 FL+A P + IF SV+Y+ HN +GAMG+IVN+ + +LE + + + Sbjct: 14 SGQFLVAHPKMPANIFAHSVIYVVSHNADGAMGLIVNRLAGAGPLGKLLEAFGLASKAQR 73 Query: 64 ESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQPSD 123 E + LGGP+ +GF+LH+ AS+ + ++T DVLE + + P Sbjct: 74 E-----IKLYLGGPVGIGQGFVLHSDDYAGASTRALKKGLSLSTGLDVLEAIARGRGPRQ 128 Query: 124 VLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGV 175 V + GYA W GQL+ EI WL APAD +++F W EA K G+ Sbjct: 129 VRMLFGYAGWSPGQLDGEIARGDWLLAPADTSLIFSEEPDKVWEEALKHAGL 180 >UniRef50_B3QT15 Putative uncharacterized protein n=1 Tax=Chloroherpeton thalassium ATCC 35110 RepID=B3QT15_CHLT3 Length = 187 Score = 196 bits (499), Expect = 3e-49, Method: Composition-based stats. Identities = 53/180 (29%), Positives = 83/180 (46%), Gaps = 13/180 (7%) Query: 4 QHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRD 63 + LIA L DP F+RSVV +CEHN G G+I+NKPL+ + I +E ++ Sbjct: 10 RGILLIAGAQLIDPNFKRSVVLLCEHNEEGTFGLILNKPLD-INISEAIEDIEDW----- 63 Query: 64 ESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDK--QP 121 D + GGP+ + +LH +I + D + + + ++ + P Sbjct: 64 -----DIALHAGGPVQPNTVHVLHRLGDEIEDAIEVVDGVYWGGNYETIRSMINTRHASP 118 Query: 122 SDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMP 181 D LGY+ W GQL+QEI ++W A A N++F W A + G D + Sbjct: 119 DDFRFFLGYSGWGPGQLQQEIDQDSWYQAKATANVVFNPVYDRMWARALRAKGGDYAIIA 178 >UniRef50_B0BVW3 UPF0301 protein RrIowa_0061 n=15 Tax=Rickettsia RepID=Y061_RICRO Length = 189 Score = 192 bits (488), Expect = 6e-48, Method: Composition-based stats. Identities = 49/187 (26%), Positives = 95/187 (50%), Gaps = 6/187 (3%) Query: 2 NLQHHFLIAMPA-LQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPE 60 NL L+A P + I+ +S++Y+ H GA+G+I N+ + ++ ++ KI + Sbjct: 8 NLSGKTLVATPHVITKGIYHKSLIYMLSHTEEGAIGLIFNRLVNHIDLKSFF---KIKND 64 Query: 61 PRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQ 120 + + P+ LGGP+ ++GF LH+ N + ++ ++++ ++ E + K Sbjct: 65 EITTPVMV--PIYLGGPVEHEKGFFLHSSDYNKNLLLDFHNDLAVSSNLEISEDIAFGKG 122 Query: 121 PSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTM 180 P + L +GY +W+ GQLE+E+ N WL + +F +W A K +G+D + Sbjct: 123 PKNSLFIVGYTAWKPGQLEEELETNLWLVMDCNKEFIFADNPESKWHNALKHLGIDEIHF 182 Query: 181 PGVAGHA 187 G+A Sbjct: 183 SSQIGNA 189 >UniRef50_A7C130 Protein containing DUF179 n=1 Tax=Beggiatoa sp. PS RepID=A7C130_9GAMM Length = 158 Score = 191 bits (485), Expect = 1e-47, Method: Composition-based stats. Identities = 58/154 (37%), Positives = 95/154 (61%), Gaps = 4/154 (2%) Query: 34 AMGIIVNKPLENLKIEGILEKLKITPEPRDESIRLDKPVMLGGPLAEDRGFILHTPPSNF 93 AMGI++N+PL+ + + +LE + I + + P+ GGP+ +RGF++H P + Sbjct: 9 AMGIVINRPLD-VDLGDVLEHMNIEANDQRATRM---PIFDGGPVQRERGFVIHQPVGQW 64 Query: 94 ASSIRISDNTVMTTSRDVLETLGTDKQPSDVLVALGYASWEKGQLEQEILDNAWLTAPAD 153 + + I++N + TSRD++ + + PS+ L+ALGYA W GQLEQE+ DNAWL+ PAD Sbjct: 65 DAMLSINNNLGIATSRDIISAIANGQGPSNALIALGYAGWTAGQLEQEMADNAWLSTPAD 124 Query: 154 LNILFKTPIADRWREAAKLIGVDILTMPGVAGHA 187 +++F+T RW AA +G+D+ + GH Sbjct: 125 YSVIFQTTPEQRWHAAAASMGIDLTLLSSQVGHG 158 >UniRef50_Q2S591 UPF0301 protein SRU_0495 n=2 Tax=Rhodothermaceae RepID=Y495_SALRD Length = 188 Score = 190 bits (483), Expect = 2e-47, Method: Composition-based stats. Identities = 51/180 (28%), Positives = 84/180 (46%), Gaps = 15/180 (8%) Query: 5 HHFLIAMPALQDPIFRRSVVYICEHNT-NGAMGIIVNKPLENLKIEGILEKLKITPEPRD 63 LI+ P +QDP FRRSVV +CEHN G G+I+N+ L+ + + +L++ Sbjct: 12 GTLLISAPMMQDPNFRRSVVLLCEHNDREGTFGLILNRELD-VSLGDVLDEY-------- 62 Query: 64 ESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQ--P 121 + D P+ +GGP+ + LHT + + + + ++ L P Sbjct: 63 --VTYDPPLYMGGPVQRETLHYLHTRED-IPGGVALPGDMTWGGDFEAVQQLAKGGDAAP 119 Query: 122 SDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMP 181 ++ LGYA W GQLE E+ + AW+ AP +F T WR + +G + + Sbjct: 120 DNLRFFLGYAGWGPGQLEGELGEEAWIPAPGAAEFVFDTDPDQLWRAILRRMGGEYAVLA 179 >UniRef50_Q1DAS2 UPF0301 protein MXAN_2022 n=2 Tax=Cystobacterineae RepID=Y2022_MYXXD Length = 181 Score = 189 bits (482), Expect = 3e-47, Method: Composition-based stats. Identities = 56/184 (30%), Positives = 82/184 (44%), Gaps = 7/184 (3%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 NL L+AMP L DP F RSVV + EH+ +G+MG+++N+ L +L Sbjct: 3 NLAPGLLLAMPQLGDPNFYRSVVLMLEHSESGSMGLVINR-----GAPLTLGELARGQNL 57 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQP 121 + R + V LGGP+ RGF+LH + ++ + D L L T+ P Sbjct: 58 GIAAGRKEHSVYLGGPVEPQRGFVLHDDTEQREKH-SVLPGLFLSVTLDALGPLLTNPNP 116 Query: 122 SDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMP 181 + LGYA W QLE EI +WL A + + W + +GVD + Sbjct: 117 -RLRFCLGYAGWGPRQLESEIAAGSWLFTEATAEAVLGHEPSKLWDTTLRGMGVDPAMLV 175 Query: 182 GVAG 185 G Sbjct: 176 MGRG 179 >UniRef50_Q3B561 UPF0301 protein Plut_0637 n=11 Tax=Chlorobiaceae RepID=Y637_PELLD Length = 189 Score = 188 bits (477), Expect = 1e-46, Method: Composition-based stats. Identities = 49/180 (27%), Positives = 78/180 (43%), Gaps = 13/180 (7%) Query: 4 QHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRD 63 LIA L + F+R+V+ +CEHN G++G I+N+P+E E + Sbjct: 12 AGKLLIASANLLESNFKRTVLMMCEHNPQGSLGFILNRPMEFQVREAV-----------A 60 Query: 64 ESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDK--QP 121 +D+P+ +GGP+ + LH S +I R+ L L +P Sbjct: 61 GFDEVDEPLHMGGPVQSNTVHFLHMRGDLIDGSEQILPGLYWGGDREELGYLLNTGVLKP 120 Query: 122 SDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMP 181 S++ LGYA W GQLE E + +W TA A ++F W + G + + Sbjct: 121 SEIRFFLGYAGWSAGQLEAEFEEGSWYTADATPAMVFSGEYERMWSRTVRSKGGEYQLIA 180 >UniRef50_A6G4Z9 Putative uncharacterized protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6G4Z9_9DELT Length = 210 Score = 186 bits (474), Expect = 3e-46, Method: Composition-based stats. Identities = 64/197 (32%), Positives = 94/197 (47%), Gaps = 19/197 (9%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 L H L A+P L DP F+RSVV + EH+ GA+G+++N+ + N + + E L + Sbjct: 15 GLACHLLCAVPQLLDPNFKRSVVLMLEHDERGALGLVINRTM-NTSLSEVAEALDLEW-- 71 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGT---D 118 D V +GGP+ RG+ LH + + + D +TTS + + G+ Sbjct: 72 ---CGDPDAQVRIGGPVEPVRGWFLHDQGAWDPDASSLVDGLWVTTSLEGVGAAGSVRFG 128 Query: 119 KQPSDVLVALGYASWEKGQLEQEILDNAWLTAP----------ADLNILFKTPIADRWRE 168 + S+ L LGYA W GQLE EI +W+ P D LF TP W Sbjct: 129 SEESNFLFLLGYAGWSGGQLEGEIAAGSWVLVPLVDDDDPRVGVDPTFLFDTPPEHMWSL 188 Query: 169 AAKLIGVDILTMPGVAG 185 A + IGVD + G+ G Sbjct: 189 ALQSIGVDPQRLVGLQG 205 >UniRef50_Q5NQN1 UPF0301 protein ZMO0349 n=3 Tax=Zymomonas mobilis RepID=Y349_ZYMMO Length = 188 Score = 186 bits (473), Expect = 4e-46, Method: Composition-based stats. Identities = 54/176 (30%), Positives = 92/176 (52%), Gaps = 5/176 (2%) Query: 12 PALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRDESIRLDKP 71 P ++D F+++V+ +C N GA+G+ + + + ++ + ++ +L I P + D+P Sbjct: 18 PNMRDVEFQKAVIALCAFNEKGALGLNIGRIIPDVTLHSLMHQLGIQP-----GLVPDRP 72 Query: 72 VMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQPSDVLVALGYA 131 V GGP RG +LH+ + S+ + + +T + DVL L + P LVALGYA Sbjct: 73 VHDGGPCEPQRGMVLHSRDWHSPDSMMVGQDWALTCTLDVLHALSRGEGPQHWLVALGYA 132 Query: 132 SWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMPGVAGHA 187 W GQL+QE+ W + D +LF P +RW++ + GVD + G A Sbjct: 133 GWGAGQLDQEMKQADWFLSKVDDQLLFSCPAENRWQQGYQQAGVDFYRLATKIGQA 188 >UniRef50_D2QR79 Putative uncharacterized protein n=2 Tax=Flexibacteraceae RepID=D2QR79_9SPHI Length = 186 Score = 186 bits (472), Expect = 4e-46, Method: Composition-based stats. Identities = 53/180 (29%), Positives = 91/180 (50%), Gaps = 16/180 (8%) Query: 5 HHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRDE 64 LIA P + D F RSVV +CEHN G G+++N+ + +++ ++E Sbjct: 11 GDLLIAEPFMGDNNFERSVVLVCEHNAVGTFGLVLNQQTD-IQLGDVIED---------- 59 Query: 65 SIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLE---TLGTDKQP 121 I D P+ +GGP+ ++ +H P +SI + D + D ++ LGT + Sbjct: 60 -IHTDLPLFVGGPVQQNTLHFIHRRPDLIDNSICVVDGLYWSGDFDQIKRGVNLGTLTE- 117 Query: 122 SDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMP 181 D+ +GY+ W +GQL+ E+L AW+ + + LF+TP + WRE K G + ++ Sbjct: 118 RDIRFFIGYSGWNEGQLDSELLQKAWIISRTKADFLFETPTTEFWREVLKRKGGEYKSIA 177 >UniRef50_A6LBX4 UPF0301 protein BDI_1431 n=6 Tax=Bacteroidales RepID=Y1431_PARD8 Length = 198 Score = 183 bits (466), Expect = 2e-45, Method: Composition-based stats. Identities = 54/179 (30%), Positives = 88/179 (49%), Gaps = 13/179 (7%) Query: 4 QHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRD 63 Q LIA P LQD F+RSVV + EH +G+MG ++NK + L + ++ PE Sbjct: 19 QGSILIAEPFLQDAYFQRSVVLLIEHTEHGSMGFVLNKKTD-LIVNSFFKEFAEFPE--- 74 Query: 64 ESIRLDKPVMLGGPLAEDRGFILHTPPSN-FASSIRISDNTVMTTSRDVLETLGTDKQPS 122 P+ LGGP++ +R F +H+ N +++I+D + L+ + P Sbjct: 75 ------IPIYLGGPVSPNRLFFIHSLGDNIIPDALKINDYLYFDGDFNALKRYILNGHPI 128 Query: 123 D--VLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILT 179 D V LGY+ W +GQL EI N+W + + + W+++ +L+G D T Sbjct: 129 DGKVKFFLGYSGWTEGQLNHEIKRNSWAVSHITTDNILSADGEGYWKDSVELLGNDYKT 187 >UniRef50_D0LIU3 Putative uncharacterized protein n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LIU3_HALO1 Length = 198 Score = 183 bits (465), Expect = 3e-45, Method: Composition-based stats. Identities = 59/197 (29%), Positives = 99/197 (50%), Gaps = 20/197 (10%) Query: 3 LQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPR 62 L L+AMP L DP FRRSVV + EH+ G+ G++VN+P E L ++ + E L + + Sbjct: 6 LAPGLLLAMPHLLDPNFRRSVVLMVEHDDEGSFGLVVNQPTE-LSMDELYESLDLAWKGS 64 Query: 63 DESIRLDKPVMLGGPLAEDRGFILHTPPSNFASS--------------IRISDNTVMTTS 108 E++ V GGP+ +++H P + + S + + ++ + Sbjct: 65 SEAM-----VWRGGPVMPTHLWLVHAPLAGSSDSGTESALLGLGDGGTVAVGPELRVSGA 119 Query: 109 RDVLETLGTDKQPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWRE 168 L + ++ P+ + V LGYA W GQL QE+ AWL A ++F+TP + W Sbjct: 120 MPELIEMFGNEPPAQLRVLLGYAGWGGGQLAQEMSQGAWLHVDATPELIFETPAEEMWER 179 Query: 169 AAKLIGVDILTMPGVAG 185 A + +G++ T+ AG Sbjct: 180 AVRTLGINPETIIHGAG 196 >UniRef50_Q11U74 UPF0301 protein CHU_1773 n=2 Tax=Flexibacteraceae RepID=Y1773_CYTH3 Length = 182 Score = 181 bits (461), Expect = 8e-45, Method: Composition-based stats. Identities = 52/181 (28%), Positives = 84/181 (46%), Gaps = 14/181 (7%) Query: 4 QHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRD 63 + LI+ P L D F RSVV +CEHN +GA G ++NK L I +LE Sbjct: 4 KGKILISEPYLGDSTFERSVVLLCEHNDSGAFGFMLNK-STTLTINSVLE---------- 52 Query: 64 ESIRLDKPVMLGGPLAEDRGFILHTPPSNF-ASSIRISDNTVMTTSRDVLETLGTDK--Q 120 E + ++ + LGGP+A+D F L S+ I D+ + L+TL + + Sbjct: 53 EQLTFEQNLFLGGPVAQDSLFFLLRQDRAILKDSVHIKDDLYWGGDFEHLKTLIQEGTLE 112 Query: 121 PSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTM 180 + LGY+ W + QLE E+ ++W+ A + +F W+ + +G D + Sbjct: 113 LDNCRFFLGYSGWGEDQLEYELEKHSWIIADINSEDMFVKNPESMWQNVLRSMGGDYKVL 172 Query: 181 P 181 Sbjct: 173 S 173 >UniRef50_A6C880 Putative uncharacterized protein n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C880_9PLAN Length = 188 Score = 181 bits (460), Expect = 1e-44, Method: Composition-based stats. Identities = 55/189 (29%), Positives = 80/189 (42%), Gaps = 12/189 (6%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 +L+ HFL+A L D F RSVV I EHN GA G+IVN+P + + Sbjct: 4 SLKGHFLVASRKLNDLNFYRSVVLIVEHNEQGATGLIVNRPSSFSITNALSRYFDMP--- 60 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETL---GTD 118 +L+ V +GGP+ + F LH S+ I + M +S ++ E + ++ Sbjct: 61 -----KLEDMVFMGGPVEPNGMFALHNAGDLEKSTEAIVPDLFMGSSPEIFEQVIWRISE 115 Query: 119 KQPS-DVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDI 177 P D + G A W QLE EI WL PA +F+ D W + Sbjct: 116 GDPHLDFRIFFGCAGWAPLQLESEINRMDWLNTPATTEDIFEIDPYDIWDTLLDRAMAER 175 Query: 178 LTMPGVAGH 186 +P H Sbjct: 176 RFLPQETAH 184 >UniRef50_Q254Z3 UPF0301 protein CF0373 n=7 Tax=Chlamydiales RepID=Y373_CHLFF Length = 189 Score = 179 bits (454), Expect = 6e-44, Method: Composition-based stats. Identities = 46/179 (25%), Positives = 81/179 (45%), Gaps = 8/179 (4%) Query: 4 QHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRD 63 + L+A P +F RSV+ +CEH+ NG+ G+I+NK L + I K+T Sbjct: 11 KGSLLLASPDTDQGVFARSVILLCEHSLNGSFGLILNKTLGLELADDIFSFDKVT----- 65 Query: 64 ESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQPSD 123 + +GGPL ++ +LH+ ++ I + + L+ + Sbjct: 66 ---NNNIRFCMGGPLQANQMMLLHSCSEIPEQTLEICPSVYLGGDLSFLQEIAASDAGPM 122 Query: 124 VLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMPG 182 + + GY+ W+ GQLE+E LD W APA + +F + W + K +G ++ Sbjct: 123 INLCFGYSGWQAGQLEREFLDGNWFLAPASYDYVFMDNPENLWSKILKDLGGKYASLST 181 >UniRef50_Q3KMF1 UPF0301 protein CTA_0231 n=9 Tax=Chlamydia RepID=Y231_CHLTA Length = 189 Score = 178 bits (453), Expect = 8e-44, Method: Composition-based stats. Identities = 53/179 (29%), Positives = 82/179 (45%), Gaps = 8/179 (4%) Query: 4 QHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRD 63 + L+A P + IF RSVV +CEH+ NG+ G+I+NK LE E I P D Sbjct: 11 KGSLLVASPDVNGGIFSRSVVLVCEHSLNGSFGLILNKILEIDLPEEIF--------PLD 62 Query: 64 ESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQPSD 123 +GGPL ++ +LHT P + SSI I + + + Sbjct: 63 HFDESKVRFCMGGPLQANQIMLLHTSPDSANSSIEICPSVFLGGDFSFAGEKEGRTRDDK 122 Query: 124 VLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMPG 182 +L+ GY+ W+ GQLE+E L+ W AP+ I+F W + + +G ++ Sbjct: 123 MLLCFGYSGWQGGQLEKEFLEGLWFLAPSSQEIIFTDAPERMWSDVLQHLGGRFASLST 181 >UniRef50_Q0BLI0 UPF0301 protein FTH_1193 n=18 Tax=Francisella RepID=Y1193_FRATO Length = 194 Score = 178 bits (452), Expect = 1e-43, Method: Composition-based stats. Identities = 58/182 (31%), Positives = 100/182 (54%), Gaps = 5/182 (2%) Query: 2 NLQHHFLIAMPALQDPI-FRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPE 60 N + L+A P ++D I F +SVVY+C+++ +GAMG+I+NKPL + ++ + E+L I P Sbjct: 4 NHKSEILLATPLIKDDIVFTKSVVYLCQNDRHGAMGLIINKPLAD-TLKDVFEELHI-PH 61 Query: 61 PRDESIRLDKPVMLGGPLAEDRGFILHT-PPSNFASSIRISDNTVMTTSRDVLETLGTDK 119 L+ P+ +GGP++ + ILHT N+ S+I++ + +T S D+LE + + Sbjct: 62 TNTFKEILEYPLYMGGPISPHKIMILHTTNGRNYTSTIKLDEGLAITASIDILEDIANNI 121 Query: 120 QPSDVLVALGYASWEKGQLEQEILDNAWLTA-PADLNILFKTPIADRWREAAKLIGVDIL 178 P L +GY+ W QL EI N W+ + ILF +W+ + G + Sbjct: 122 LPEYFLPVVGYSCWTANQLTDEIKSNDWIVTNKLNKKILFNHENKVKWQNHLEHAGYTLQ 181 Query: 179 TM 180 ++ Sbjct: 182 SL 183 >UniRef50_A3HT39 Putative uncharacterized protein n=1 Tax=Algoriphagus sp. PR1 RepID=A3HT39_9SPHI Length = 189 Score = 177 bits (450), Expect = 1e-43, Method: Composition-based stats. Identities = 54/181 (29%), Positives = 86/181 (47%), Gaps = 14/181 (7%) Query: 4 QHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRD 63 LI+ P LQD F RSVV +CEHN G+ G+++NKP LK+ ++E L Sbjct: 11 AGDLLISEPFLQDENFVRSVVMLCEHNEEGSFGLVINKP-SILKLGELVESLDF------ 63 Query: 64 ESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVL-ETLGTDK-QP 121 LD V +GGP+ ++ ++T SI+I + + L E L T P Sbjct: 64 ----LDAEVFVGGPVEQNTLHYIYTGEKELERSIQIGTDLWWGGDYEQLVEKLKTGLINP 119 Query: 122 SDVLVALGYASWEKGQLEQEILDNAWLTAPADLN-ILFKTPIADRWREAAKLIGVDILTM 180 V +GY+ W QLE+E+ D W+ +++ F+ + WR+ K +G + + Sbjct: 120 DRVRFFIGYSGWGLDQLEEELEDKTWIVCRTEVDPKTFEYTPEELWRKLLKNMGGEFKVI 179 Query: 181 P 181 Sbjct: 180 A 180 >UniRef50_A3VRH6 Putative uncharacterized protein n=1 Tax=Parvularcula bermudensis HTCC2503 RepID=A3VRH6_9PROT Length = 207 Score = 177 bits (449), Expect = 2e-43, Method: Composition-based stats. Identities = 52/182 (28%), Positives = 80/182 (43%), Gaps = 10/182 (5%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 L +++MP L D F +SV+YIC H+ A G+I+NKP+E + + + Sbjct: 22 GLAGRLIVSMPQLNDGPFAQSVIYICTHDIEHAFGLILNKPIEGVVATEAVADM------ 75 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQP 121 E +D P+ GGP RG ILH+ S I ++T+ + L LGT P Sbjct: 76 --EEKDIDLPLFFGGPCEPRRGIILHSDQFVLEDSETIGAGLAISTTNEALAALGTPLLP 133 Query: 122 SD-VLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTM 180 + + G+A W GQL+ E+ + WL + F W A IG+ + Sbjct: 134 AQSARLFTGHAGWGPGQLDDELRRHTWLDLETSTDFAFS-DPETMWDRAMAEIGIPFQNL 192 Query: 181 PG 182 Sbjct: 193 TA 194 >UniRef50_C1ZJG4 Predicted transcriptional regulator, COG1678 n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZJG4_PLALI Length = 188 Score = 175 bits (445), Expect = 6e-43, Method: Composition-based stats. Identities = 45/174 (25%), Positives = 72/174 (41%), Gaps = 12/174 (6%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 +L+ L+A L+D F ++VV I E N NG+MG+++N+P L + E ++ Sbjct: 4 SLRGKLLVASKQLKDSNFYKTVVLIVEDNENGSMGLVLNRPSSILVNHALSEHFQLPESA 63 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQP 121 V +GGP+ FILH + + S + E + P Sbjct: 64 E--------LVHVGGPVEPAALFILHNLEELSHEGTGVIPGVWLGNSGEAFEDVLRSSDP 115 Query: 122 S----DVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAK 171 V G A W GQLE E+ W APA +I+F + + + + Sbjct: 116 HQPGVRFRVFCGCAGWSPGQLEGELAHGDWHVAPAIKSIVFAEDPYEIYEQMLQ 169 >UniRef50_Q5LDK5 UPF0301 protein BF2109 n=20 Tax=Bacteroides RepID=Y2109_BACFN Length = 196 Score = 174 bits (443), Expect = 1e-42, Method: Composition-based stats. Identities = 54/180 (30%), Positives = 82/180 (45%), Gaps = 13/180 (7%) Query: 4 QHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRD 63 + LI+ P L D F RSVV + +H G+MG+I+NKPL L + I+++ K Sbjct: 19 RGKILISEPFLHDVTFGRSVVLLVDHTEEGSMGLIINKPLP-LMLNDIIKEFKYI----- 72 Query: 64 ESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQP-- 121 D P+ GGP+ D F LHT ++ I++ + D ++ P Sbjct: 73 ----EDIPLHKGGPIGTDTLFYLHT-LHEIPGTLPINNGLYLNGDFDAIKKYILQGNPIK 127 Query: 122 SDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMP 181 + LGY+ WE QL QEI +N W+ + + L I W+EA +G T Sbjct: 128 GKIRFFLGYSGWECEQLIQEIKENTWIISKEENTYLMNEDIKGMWKEALGKLGSKYETWS 187 >UniRef50_B9XJW1 Putative uncharacterized protein n=1 Tax=bacterium Ellin514 RepID=B9XJW1_9BACT Length = 186 Score = 174 bits (442), Expect = 1e-42, Method: Composition-based stats. Identities = 51/180 (28%), Positives = 82/180 (45%), Gaps = 11/180 (6%) Query: 3 LQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPR 62 L+ L+ L F+R+VV +C+H+ GA+G+++N+ N E +L L Sbjct: 8 LKGQLLLDSGQLSGSFFQRTVVLVCQHDAEGALGLVLNRDSGNKLGEMVLADL------- 60 Query: 63 DESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQP- 121 D + LGGP+ L++ +S + N + S + L LG P Sbjct: 61 -PEQLTDNALYLGGPVQLSALSYLYSDTYLPEAS--VLPNVELGHSLETLVELGESFSPG 117 Query: 122 SDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMP 181 + + GYA W GQLE+E+ AWLT PA ++++F T D W+ K G + Sbjct: 118 KRIKLFAGYAGWSPGQLEEEMKRKAWLTHPATVDLVFDTDPDDLWQYVLKQKGGMYRVLA 177 >UniRef50_A3ZQK2 Putative uncharacterized protein n=1 Tax=Blastopirellula marina DSM 3645 RepID=A3ZQK2_9PLAN Length = 184 Score = 174 bits (442), Expect = 1e-42, Method: Composition-based stats. Identities = 53/179 (29%), Positives = 85/179 (47%), Gaps = 13/179 (7%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 +LQ LIA P L DP F R+VV + +H+ GA+G+++ +P E L + E Sbjct: 3 SLQGQLLIASPHLPDPNFLRTVVLMVQHDEEGALGLVLTRPTE-------LTMAAMWREI 55 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETL-GTDKQ 120 E I + V LGGP+ G ++ I I ++ ++ +E L D + Sbjct: 56 AGEEIADENLVFLGGPVQ---GPLMAIHSHAPCQEIEILPGVYFSSDKENIEKLVREDHE 112 Query: 121 PSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILT 179 P + +GY+ W + QLE E+ WL PA+ +F T + W++ IG DI+ Sbjct: 113 PK--RIFIGYSGWGEQQLEAEMEAGGWLLLPAEAAHVFTTDVERLWKDVTGKIGADIMR 169 >UniRef50_A7H7H6 UPF0301 protein Anae109_0457 n=4 Tax=Anaeromyxobacter RepID=Y457_ANADF Length = 199 Score = 173 bits (440), Expect = 2e-42, Method: Composition-based stats. Identities = 53/180 (29%), Positives = 92/180 (51%), Gaps = 4/180 (2%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 L FL+A PAL DP F S+V + EH+ GA+G +VN+P E + + Sbjct: 8 GLAPGFLVAAPALGDPNFAGSLVLMAEHHGEGALGFVVNRPGPVTVAEVLASVDEDLRRA 67 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTPPS---NFASSIRISDNTVMTTSRDVLETLGTD 118 + + R PV++GGP+ +R +IL P + ++ + + + SR++LE L Sbjct: 68 AEANGRAGAPVLVGGPVQPERLWILFRPGGIGADAEGAVPVGNGLSLGGSRELLEALVRA 127 Query: 119 KQPSDVLVALGYASWEKGQLEQEILDNAWLTAPAD-LNILFKTPIADRWREAAKLIGVDI 177 + L+ LGYA W Q+E+E+ AW+ + +++F P+ RW A + +G++ Sbjct: 128 PRGDPFLLLLGYAGWAPMQVEREVAAGAWVPLELEGSDLVFDVPLEQRWETAVRRLGLEP 187 >UniRef50_A9GTQ2 Putative uncharacterized protein n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9GTQ2_SORC5 Length = 198 Score = 172 bits (437), Expect = 4e-42, Method: Composition-based stats. Identities = 64/185 (34%), Positives = 87/185 (47%), Gaps = 16/185 (8%) Query: 3 LQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPR 62 L FLIA P L DP F R+VV + H+ GA+G +VN+P + + +L Sbjct: 8 LAPGFLIASPPLGDPNFDRTVVLLAVHSEGGALGFVVNRPAP-MTLGELLSFAGY----- 61 Query: 63 DESIRLDKPVMLGGPLAEDRGFILHTPPSNFASS---IRISDNTVMTTSRDVLETLGTDK 119 ++ PV LGGP+ G+IL P+ A I + +T+SR +TL D Sbjct: 62 GNDLKDPAPVYLGGPVQPSSGWILCLDPALGAEETGVIPVGSRVRVTSSRSAFDTLAADA 121 Query: 120 -------QPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKL 172 P V LGY+ W GQLE+EI AWL D ILF A RW +A L Sbjct: 122 VRGTAAADPRRRTVLLGYSGWGPGQLEREIAAGAWLPVSLDERILFDVEAAQRWEQAYAL 181 Query: 173 IGVDI 177 +G+ Sbjct: 182 LGLRP 186 >UniRef50_UPI0001C3133A protein of unknown function DUF179 n=1 Tax=Conexibacter woesei DSM 14684 RepID=UPI0001C3133A Length = 182 Score = 171 bits (434), Expect = 1e-41, Method: Composition-based stats. Identities = 47/181 (25%), Positives = 80/181 (44%), Gaps = 12/181 (6%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 +L+ L+A PALQDP F R+VV I EHN +GAMG+++N+P E Sbjct: 4 SLKGKLLLASPALQDPNFARTVVLIAEHNEDGAMGLVLNRPATTTVAES--------APE 55 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNT-VMTTSRDVLETLGTDKQ 120 +E + ++P+ +GGP+ +L A+ + + D+ ++ D + Sbjct: 56 LEELVEAEEPIYIGGPVQPSAVIVLAAFEEPAAAGLLVRDDVGFLSAEADFAT---SRDA 112 Query: 121 PSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTM 180 + V G+A W GQL++E+ W+ P LF + W + G + Sbjct: 113 TRQLRVFAGHAGWGPGQLDEELEREDWIVEPPLPQELFSEDAEELWGDVLTRKGGAFALV 172 Query: 181 P 181 Sbjct: 173 A 173 >UniRef50_C3Q021 UPF0301 protein n=8 Tax=Bacteroides RepID=C3Q021_9BACE Length = 196 Score = 171 bits (433), Expect = 1e-41, Method: Composition-based stats. Identities = 48/178 (26%), Positives = 82/178 (46%), Gaps = 13/178 (7%) Query: 4 QHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLE-NLKIEGILEKLKITPEPR 62 Q LI+ P + D F R+VV + EHN G+MGII+NK ++ + ++ +L+ Sbjct: 17 QGSILISSPFMNDYHFTRAVVLLIEHNDEGSMGIIMNKDFRYHILLNDLIPELEFA---- 72 Query: 63 DESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQPS 122 PV GGP++ + F LHT + ++ + + + + ++ D +P Sbjct: 73 -----QRVPVYKGGPMSRETIFFLHT-LKDLEGALPLGNGLYLNGDFNAVQQYILDGKPI 126 Query: 123 D--VLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDIL 178 + + GYA W+ GQL +EI +N+WL A L D W + +G Sbjct: 127 EGVIRFFAGYAGWDHGQLAKEIKENSWLIGKAGKETLLNQHFRDLWHTSLNEMGGKYA 184 >UniRef50_C2G2S7 Transcriptional regulator n=3 Tax=Sphingobacteriaceae RepID=C2G2S7_9SPHI Length = 189 Score = 170 bits (432), Expect = 2e-41, Method: Composition-based stats. Identities = 44/182 (24%), Positives = 84/182 (46%), Gaps = 14/182 (7%) Query: 4 QHHFLIAMPALQDPIFRRSVVYICEHNT-NGAMGIIVNKPLENLKIEGILEKLKITPEPR 62 + L++ P + D F+RSV+ + +HN +G +G I+N+ +L ++ + Sbjct: 9 KGSLLVSEPFMLDQNFKRSVILLADHNETDGTVGFILNQRT----------QLMLSDVFQ 58 Query: 63 DESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQ-- 120 D D P+ LGGP+ + F +H S I D+ ++L L +++ Sbjct: 59 DVEREADFPIYLGGPVECEALFFIHKAYDLLLSGEHIIDDVYWGGDIELLLRLAKEEKIT 118 Query: 121 PSDVLVALGYASWEKGQLEQEILDNAWLT-APADLNILFKTPIADRWREAAKLIGVDILT 179 +V +GY+ W QL++EI +N+W + ++ F T D W++A +G Sbjct: 119 SDEVKFFIGYSGWSPSQLDREIKENSWAVDNKFNKDLTFITDGEDLWKQALISMGQKYAH 178 Query: 180 MP 181 + Sbjct: 179 VA 180 >UniRef50_C7PSM3 Putative uncharacterized protein n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PSM3_CHIPD Length = 184 Score = 169 bits (430), Expect = 3e-41, Method: Composition-based stats. Identities = 47/185 (25%), Positives = 80/185 (43%), Gaps = 15/185 (8%) Query: 1 MNLQ-HHFLIAMPALQDPIFRRSVVYICEHNT-NGAMGIIVNKPLENLKIEGILEKLKIT 58 ++L LIA P L+D F R+VV +CEH G+ G ++NK + E + E L Sbjct: 2 VSLSPGILLIADPFLKDQNFARTVVLLCEHQESRGSFGFVLNKVFDQSLNELVPEVL--- 58 Query: 59 PEPRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTD 118 + V GGP+ D +H P I D D + +L Sbjct: 59 --------INNIRVYYGGPVQIDTIHFIHQQPELIRGGFEIRDGVYWGGEFDQVVSLINS 110 Query: 119 K--QPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVD 176 + + +GY+ W GQLE E+ + +W+ + ++ ++F+ + W +A K +G + Sbjct: 111 GRLDLNKIKFFIGYSGWSSGQLENELNEKSWILSESNAPLIFEAKEQNIWPQALKNLGAN 170 Query: 177 ILTMP 181 M Sbjct: 171 FAIMA 175 >UniRef50_A5FNN9 Putative uncharacterized protein n=18 Tax=Bacteroidetes RepID=A5FNN9_FLAJ1 Length = 209 Score = 168 bits (426), Expect = 9e-41, Method: Composition-based stats. Identities = 48/182 (26%), Positives = 85/182 (46%), Gaps = 16/182 (8%) Query: 4 QHHFLIAMPAL-QDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPR 62 + H LIA P++ D F RSV+ + +HN G++G I+NKPL+ I ++ ++ Sbjct: 31 KGHLLIAEPSIIGDLSFNRSVILLADHNKEGSIGFIINKPLKY-TINDLIPEID------ 83 Query: 63 DESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQ-- 120 + + GGP+ +D + +H P +S+ IS+ + + L D Sbjct: 84 -----ANFKIYNGGPVEQDNLYFIHNIPDLIPNSVEISNGIYWGGDFESTKDLINDGSIN 138 Query: 121 PSDVLVALGYASWEKGQLEQEILDNAWLTAPAD-LNILFKTPIADRWREAAKLIGVDILT 179 +++ LGY W++ QLE E+ N+W+ A + N + W+E +G D L Sbjct: 139 KNNIRFFLGYTGWDENQLENEMQGNSWIIADNNYKNKIIGKSTTHFWKEQIIELGGDYLI 198 Query: 180 MP 181 Sbjct: 199 WS 200 >UniRef50_C1D0N0 Putative uncharacterized protein n=3 Tax=Deinococcus RepID=C1D0N0_DEIDV Length = 185 Score = 165 bits (419), Expect = 6e-40, Method: Composition-based stats. Identities = 54/182 (29%), Positives = 88/182 (48%), Gaps = 14/182 (7%) Query: 7 FLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRDESI 66 FL+A P LQ +F +V+ + EH+ GAMG+IVN P E + + + Sbjct: 17 FLVASPHLQGEVFEGTVILLLEHDRKGAMGLIVNAPTPQTVAELMAD-----------AA 65 Query: 67 RLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQPSDVLV 126 ++ LGGP+ G+ L+ P I++ D+ +++S +VL + + ++ Sbjct: 66 GQNRRAWLGGPVDPTLGWCLYHHPVGLDGEIKLVDDLHLSSSLEVLRAVMASD--QEYML 123 Query: 127 ALGYASWEKGQLEQEILDNAWLTAP-ADLNILFKTPIADRWREAAKLIGVDILTMPGVAG 185 LGYA W GQLE+E AW+ + +L++ P RW EA K +GV T+ Sbjct: 124 ILGYAGWTAGQLEEEARAGAWVWVEQSTPELLWEVPAPQRWAEALKRLGVTPGTLMPGGA 183 Query: 186 HA 187 A Sbjct: 184 QA 185 >UniRef50_D2R140 Putative uncharacterized protein n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R140_9PLAN Length = 183 Score = 164 bits (415), Expect = 2e-39, Method: Composition-based stats. Identities = 52/175 (29%), Positives = 82/175 (46%), Gaps = 14/175 (8%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 +LQ HFL A P L DP F R+VV + +H+ GA+G+++ +P++ E + Sbjct: 3 SLQGHFLAASPHLGDPNFFRTVVLMIKHDAQGALGLVLTRPMQETVAE-------LWQRV 55 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTT-SRDVLETLGTDKQ 120 E+I V LGGP+ +H S + + D + S + + K+ Sbjct: 56 TAETIANTGSVHLGGPV-NGPLVAIHRMASAAEA--EVFDGVYFSAHSEQISRIVHQTKK 112 Query: 121 PSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGV 175 P L+ GY+ W GQLE E+ WL APA ++F + D W + IG+ Sbjct: 113 P--YLLFAGYSGWSGGQLEAELEQGGWLIAPATTELVFSS-TDDLWERVVQSIGL 164 >UniRef50_B0SHS8 Transcriptional regulator n=6 Tax=Leptospira RepID=B0SHS8_LEPBA Length = 188 Score = 162 bits (410), Expect = 7e-39, Method: Composition-based stats. Identities = 50/177 (28%), Positives = 87/177 (49%), Gaps = 13/177 (7%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 + + LI+ ++ F +SVV + +H+ +GA G+++NKP + +E +++ L Sbjct: 7 STRGKLLISNSSVIQDFFHKSVVLMVDHDDDGAFGLVLNKPTDQ-TMESLIKNL------ 59 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRD-VLETLGTDKQ 120 +++ +KPV GGP+ ILH + + M S D +LE L +D+ Sbjct: 60 -PDTVHSNKPVYAGGPVDNLFVSILHNGKQTADPGVEVVPGIYMARSFDTMLEVLSSDQ- 117 Query: 121 PSDVLVALGYASWEKGQLEQEILDNAWLTAP-ADLNILFKTPIADR-WREAAKLIGV 175 V GYA W GQLE E +W+ + D +I+FK ++ W+EA + G Sbjct: 118 -IQFRVLQGYAGWSSGQLESEFDRLSWVVSDLVDDSIVFKEDESEVIWKEALRSKGG 173 >UniRef50_Q2BRE1 Putative uncharacterized protein n=1 Tax=Neptuniibacter caesariensis RepID=Q2BRE1_9GAMM Length = 151 Score = 161 bits (408), Expect = 1e-38, Method: Composition-based stats. Identities = 52/155 (33%), Positives = 87/155 (56%), Gaps = 6/155 (3%) Query: 35 MGIIVNKPLENLKIEGILEKLKITPEPRDESIRLDKPVMLGGPLAEDRGFILHT--PPSN 92 MG++VN+P+ + + + + L + P + + GGP+ +RG++LH P Sbjct: 1 MGLVVNRPV-GITLSDLCDHLNL---PCISNENQQDEIFSGGPVKPERGYVLHRSSDPFE 56 Query: 93 FASSIRISDNTVMTTSRDVLETLGTDKQPSDVLVALGYASWEKGQLEQEILDNAWLTAPA 152 + SS +++ ++TS D +E + + L+ALG A W GQLEQEI DN WL+ PA Sbjct: 57 WPSSHCVAEEIFLSTSVDAIEAAAEGRFKHEYLIALGCAGWSPGQLEQEISDNVWLSCPA 116 Query: 153 DLNILFKTPIADRWREAAKLIGVDILTMPGVAGHA 187 + +ILF P DR + AA ++G+++ + GHA Sbjct: 117 NSDILFGIPAGDRLQAAASILGINLDLLTAHPGHA 151 >UniRef50_A6E847 Putative uncharacterized protein n=1 Tax=Pedobacter sp. BAL39 RepID=A6E847_9SPHI Length = 168 Score = 160 bits (405), Expect = 3e-38, Method: Composition-based stats. Identities = 39/169 (23%), Positives = 75/169 (44%), Gaps = 14/169 (8%) Query: 16 DPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRDESIRLDKPVMLG 75 DP F+RSVV + +H G +G I+N+ + + + E + PV +G Sbjct: 2 DPNFKRSVVLLTDHQEEGTVGFILNQRSTLILSDLVPEFAGVA-----------LPVYIG 50 Query: 76 GPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDK--QPSDVLVALGYASW 133 GP+A D +H ++ + + L+ L +P+++ +GY+ W Sbjct: 51 GPVATDTLHFIHRCYDRLNDGQEVAKGIYWGGNFEALKVLLLTGSIEPAEIKFFIGYSGW 110 Query: 134 EKGQLEQEILDNAWLTAP-ADLNILFKTPIADRWREAAKLIGVDILTMP 181 +GQL+ E+ +N W+ + +++F + WREA +G + Sbjct: 111 SEGQLKLELEENTWMVSDRFHADVVFSDNEEELWREAVINLGPRYAHIS 159 >UniRef50_A4C260 Putative transcriptional regulator n=1 Tax=Polaribacter irgensii 23-P RepID=A4C260_9FLAO Length = 185 Score = 158 bits (400), Expect = 9e-38, Method: Composition-based stats. Identities = 44/175 (25%), Positives = 78/175 (44%), Gaps = 15/175 (8%) Query: 4 QHHFLIAMPA-LQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPR 62 + L+A P+ L D F +++V + EH N ++G I+NKPL + +L +K + + Sbjct: 8 KGRLLVAEPSILNDTSFNKAIVLLTEHTANNSVGFILNKPLAY-NLNDLLPNIKCSFK-- 64 Query: 63 DESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDK--Q 120 + GGP+ +D + LH P + SI +S+ + L L + Sbjct: 65 ---------IYQGGPVEQDNLYFLHRVPQLLSKSIAVSNGVYWGGDFNQLTELLNNSVLD 115 Query: 121 PSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGV 175 S++ LGY+ W+K QL E+ + +W D + + W+E G Sbjct: 116 TSEIRFFLGYSGWDKEQLGAELKEKSWFVTENDFENILSNDEKNLWKEKLLQKGG 170 >UniRef50_UPI0001745679 hypothetical protein VspiD_25265 n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001745679 Length = 204 Score = 157 bits (399), Expect = 1e-37, Method: Composition-based stats. Identities = 50/181 (27%), Positives = 82/181 (45%), Gaps = 15/181 (8%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHN-TNGAMGIIVNKPLENLKIEGILEKLKITPE 60 +L L+A PAL+DP F +V+ + HN +GA G I+N+PL+ ++ +L+ Sbjct: 29 SLSGSLLVASPALRDPNFFHTVLLLASHNTEDGAFGYILNRPLDK-RVADLLDD------ 81 Query: 61 PRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQ 120 +D + PV LGGP+ ++ L N+ S R M T + + + Sbjct: 82 -KDLGRLGEVPVFLGGPVGTNK---LSFAAFNWNSKKR---ELRMQTHLSTEQAMKELDK 134 Query: 121 PSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTM 180 V +GY+ W +GQLE E+ N+W+T I+ P D W +G + Sbjct: 135 GRSVRGFVGYSGWSEGQLENELEQNSWITCAPLSKIVTAQPSTDLWTTVLDDLGPYYKLL 194 Query: 181 P 181 Sbjct: 195 A 195 >UniRef50_Q1Q3L0 Putative uncharacterized protein n=1 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1Q3L0_9BACT Length = 188 Score = 156 bits (394), Expect = 5e-37, Method: Composition-based stats. Identities = 49/178 (27%), Positives = 76/178 (42%), Gaps = 12/178 (6%) Query: 4 QHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRD 63 + LIA P DP F ++VV ICEH+ G +G+I+NK L E + + Sbjct: 7 KGSILIANPQGTDPNFMQTVVLICEHSKRGTLGLILNKTLGKKGQEIFVSSANTKTK--- 63 Query: 64 ESIRLDKPVMLGGPLAEDRGFILHTPPSNFA-SSIRISDNTVMTTSRDVLETLGTDKQPS 122 DK + GGP+ + F LH N + ++I + + +++ + K S Sbjct: 64 -----DKEIFFGGPVDTNNMFYLHGNFKNETHNCVKICEGVYLGSNQGCFNAFMSRKNVS 118 Query: 123 D--VLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLI-GVDI 177 D + LG A W GQLE EI W A ++F + W + I +D Sbjct: 119 DNIFRLYLGCACWSGGQLESEIETKCWTVGTATEKMVFYPSPDNIWWNILRSISSIDP 176 >UniRef50_C6X421 Putative transcriptional regulator n=1 Tax=Flavobacteriaceae bacterium 3519-10 RepID=C6X421_FLAB3 Length = 183 Score = 154 bits (391), Expect = 1e-36, Method: Composition-based stats. Identities = 47/181 (25%), Positives = 79/181 (43%), Gaps = 16/181 (8%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 + + +I+ P + IF RSVV + +HN GA G+I+NK +N+ + Sbjct: 5 SYKGKIIISTPDISGDIFSRSVVLVIDHNAEGAFGLILNKKNQNMSARLL---------- 54 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRD-VLETLGTDK- 119 R+D V GGP+ D+ F ++ S I+D +T + V+ + + Sbjct: 55 NIFGFRVD--VYEGGPVENDKIFFINKGEKVTESFSEINDGFYLTEDIENVVAAIIEGRL 112 Query: 120 QPSDVLVALGYASWEKGQLEQEILDNAWLTAPA-DLNILFKTPIADRWREAAKLIGVDIL 178 D+ V GY+ W GQLE EI W +L+ T + W+ + +G + L Sbjct: 113 SAEDIKVFSGYSGWAPGQLENEIRRKLWTVVDVYNLDYTLPTDHS-LWKNIMQNLGGEFL 171 Query: 179 T 179 Sbjct: 172 L 172 >UniRef50_Q47MA0 UPF0301 protein Tfu_2389 n=3 Tax=Actinomycetales RepID=Y2389_THEFY Length = 198 Score = 154 bits (390), Expect = 1e-36, Method: Composition-based stats. Identities = 51/191 (26%), Positives = 81/191 (42%), Gaps = 17/191 (8%) Query: 1 MNLQHHFLIAMPALQDPIFRRSVVYICEHNTN-GAMGIIVNKPLENLKIEGILEKLKITP 59 ++L L+A P L+DP F RSVV++ + + G +G+I+N+P E L + +L + Sbjct: 8 LSLTGALLVATPLLEDPNFYRSVVFVIDDTPDEGTLGVILNRPSE-LGVGEVLAEWG--- 63 Query: 60 EPRDESIRLDKPVMLGGPLAEDRGFILHTPPSNF-ASSIRISDNTVMTTSRDVLETLGTD 118 E + + GGP+ +D G L P + D T + L T+ D Sbjct: 64 ----EHVSQPAVMFAGGPVGQDAGLALAVPDDGQRPLGWKSLDAMDAKTWPNGLGTVDLD 119 Query: 119 KQP-------SDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAK 171 P + V GYA W GQL EI AW PA ++ +F W + Sbjct: 120 TPPQLVADALRQMRVFAGYAGWSAGQLRAEIDQGAWYVLPATVDDVFCADPRGLWSRVLR 179 Query: 172 LIGVDILTMPG 182 G ++ + Sbjct: 180 RQGGELAFVAT 190 >UniRef50_C7MS43 Predicted transcriptional regulator n=3 Tax=Actinomycetales RepID=C7MS43_SACVD Length = 198 Score = 154 bits (389), Expect = 2e-36, Method: Composition-based stats. Identities = 47/186 (25%), Positives = 75/186 (40%), Gaps = 15/186 (8%) Query: 5 HHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRDE 64 L+A P + DP FRR+VV++ +H G +G+++N+P E E + EPR Sbjct: 18 GTLLVAAPTMFDPNFRRTVVFVIDHRAEGTLGVVLNRPSEVAVREVLPRWGDHVAEPRS- 76 Query: 65 SIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQ---- 120 V +GGP+ + L + ++ + L L +D + Sbjct: 77 -------VFVGGPVEKKTALCLAALRTGETAAT--VPGVIGVRGPVALVDLDSDPEMLAS 127 Query: 121 -PSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILT 179 + V GYA W+ GQL EI WL PA + + P D W + G+ Sbjct: 128 KVRGLRVFAGYAGWDGGQLASEIERGDWLIVPALPSDVMAGPTRDLWGHVLRRQGLPTAL 187 Query: 180 MPGVAG 185 + G Sbjct: 188 LATHPG 193 >UniRef50_Q82D55 UPF0301 protein SAV_5129 n=12 Tax=Actinomycetales RepID=Y5129_STRAW Length = 193 Score = 152 bits (386), Expect = 4e-36, Method: Composition-based stats. Identities = 42/177 (23%), Positives = 72/177 (40%), Gaps = 16/177 (9%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 +L L+A PAL DP F R+VV + +H+ G++G+++N+P + + + EP Sbjct: 9 SLTGRLLVATPALADPNFDRAVVLLLDHDEEGSLGVVLNRPTPVDVSDILEGWADLAGEP 68 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTPPSNFA------SSIRISDNTVMTTSRDVLETL 115 V GGP++ D + P + R+ + E L Sbjct: 69 --------GVVFQGGPVSLDSALGVAVIPGGASVDGAPLGWRRVHGAIGLVDLEAPPELL 120 Query: 116 GTDKQPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKL 172 + + GYA W GQLE E+++ AW ++ + WRE + Sbjct: 121 AKALG--SLRIFAGYAGWGPGQLEDELVEGAWYVVESEPGDVSSPSPERLWREVLRR 175 >UniRef50_Q0FVR8 Putative uncharacterized protein (Fragment) n=1 Tax=Roseovarius sp. HTCC2601 RepID=Q0FVR8_9RHOB Length = 158 Score = 151 bits (382), Expect = 1e-35, Method: Composition-based stats. Identities = 52/134 (38%), Positives = 76/134 (56%), Gaps = 10/134 (7%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 +L LIAMP + DP F SV+Y+C H+ GAMG+IVNKP ++ + +LE+L ITP P Sbjct: 9 DLTGKILIAMPGMGDPRFEHSVIYLCAHSEEGAMGLIVNKPSADVSMAALLEQLSITPSP 68 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTPPSNFA-SSIRISDNTVMTTSRDVLETLGTDKQ 120 + V GGP+ RGF+LH+P ++++++D MT + DVLET+ Sbjct: 69 GLGP----RQVHFGGPVEMGRGFVLHSPDYMSGLTTLQVNDGFSMTGTLDVLETIARGDG 124 Query: 121 PSDVLVALGYASWE 134 P A G+ W Sbjct: 125 P-----ATGWRCWA 133 >UniRef50_B4CVG9 Putative uncharacterized protein n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4CVG9_9BACT Length = 186 Score = 149 bits (377), Expect = 4e-35, Method: Composition-based stats. Identities = 50/175 (28%), Positives = 77/175 (44%), Gaps = 14/175 (8%) Query: 1 MNLQHHFLIAMPALQDPIFRRSVVYICEHNT-NGAMGIIVNKPLENLKIEGILEKLKITP 59 ++L LIA P L DP FRRSV++I ++ G+ G+I+N+P E + Sbjct: 9 ISLAGSLLIAHPGLLDPNFRRSVLFISSNDAQEGSFGLIINRPASRTVAELLPN------ 62 Query: 60 EPRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDK 119 +D + PV LGGP+A D+ + + V+ + +++ Sbjct: 63 --KDLGMLSRVPVFLGGPVATDQLVFAAFQWHEETERMVCRPHLVIDEAAEIVHD----- 115 Query: 120 QPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIG 174 + + V +GYA W KGQLE E+ WL PA + L WRE G Sbjct: 116 ETTIVRAFVGYAGWSKGQLEGELAQRTWLVRPAARDTLDLERCPTLWREITSTFG 170 >UniRef50_C1RPZ7 Predicted transcriptional regulator, COG1678 n=12 Tax=Actinomycetales RepID=C1RPZ7_9CELL Length = 184 Score = 149 bits (376), Expect = 6e-35, Method: Composition-based stats. Identities = 43/181 (23%), Positives = 71/181 (39%), Gaps = 12/181 (6%) Query: 4 QHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRD 63 L+A P L+D FRR+VV + +H GA+G+++++PL+ + +L + Sbjct: 6 TGRLLVATPGLRDRSFRRAVVLVLDHTAEGALGVVLDRPLD-IDARTVLPQW-------Q 57 Query: 64 ESIRLDKPVMLGGPLAEDRGFILHTPP--SNFASSIRISDNTVMTTSRDVLETLGTDKQP 121 E + + GGP+A D L P +S + D L D Sbjct: 58 EHLSTPGRLFQGGPVARDTALALADLPGADAPPGVQALSPRLGVV-DLDAPPALVVDA-V 115 Query: 122 SDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMP 181 + V +GYA W GQL+ E+ W + F WR + D+ + Sbjct: 116 RALRVFVGYAGWGPGQLDDEVDVGGWFVVDHEPGDAFSADPRGLWRRVLRRQPGDLALLS 175 Query: 182 G 182 Sbjct: 176 T 176 >UniRef50_A9RVW3 Predicted protein n=1 Tax=Physcomitrella patens subsp. patens RepID=A9RVW3_PHYPA Length = 324 Score = 147 bits (373), Expect = 1e-34, Method: Composition-based stats. Identities = 42/177 (23%), Positives = 70/177 (39%), Gaps = 15/177 (8%) Query: 5 HHFLIAMPAL---QDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 LIA P F R V++I H+ G+ G+I+N+P + + + E + PE Sbjct: 145 GCLLIAHPNAFTESQQYFHRVVIFIFAHDAGGSAGVILNRPTQY-SLGQLDEFKDLMPE- 202 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQ- 120 P+ GG + ++H P S I + M + + + + + + Sbjct: 203 -----LSSCPLYFGGDVGPQCTQVIHGIP-GLEDSREIMNGVYMGGTASIQDNIRSGQST 256 Query: 121 PSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTP---IADRWREAAKLIG 174 P+D L +A W GQLEQE+ W A + K W E + +G Sbjct: 257 PNDYRWFLRFAGWGPGQLEQEVAAGVWYLASCSKRFVLKQCIQLPKPLWNEVMEHMG 313 >UniRef50_B1ZWF6 Putative uncharacterized protein n=2 Tax=Opitutaceae RepID=B1ZWF6_OPITP Length = 184 Score = 146 bits (369), Expect = 4e-34, Method: Composition-based stats. Identities = 45/182 (24%), Positives = 75/182 (41%), Gaps = 15/182 (8%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 +L L+A PAL+DP FRR++V + HN GAMG+++N+P+ ++L Sbjct: 11 SLAGSLLLAHPALRDPNFRRAIVLMSVHNAEGAMGVVLNRPMG--------KRLGELNGE 62 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQP 121 P+ GGP+ ++ ++ P R + L ++ Sbjct: 63 FALGSLASVPLFHGGPVQTEQLVLVAWQPQ--EDGFR----LHFGVEPERAMQLAAEEG- 115 Query: 122 SDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMP 181 + + LGY+ W GQLE E+ WL A +L A WR +G + + Sbjct: 116 TQLRAFLGYSGWGGGQLEAELKQKTWLVADMPAGLLEGPQDAAMWRSVVSSLGEEWRLLA 175 Query: 182 GV 183 Sbjct: 176 QE 177 >UniRef50_C0YNI4 Transcriptional regulator n=1 Tax=Chryseobacterium gleum ATCC 35910 RepID=C0YNI4_9FLAO Length = 182 Score = 145 bits (366), Expect = 9e-34, Method: Composition-based stats. Identities = 41/180 (22%), Positives = 70/180 (38%), Gaps = 14/180 (7%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 + + LI+ P + IF RSVV + EHN +GA G+I+NK + K K + Sbjct: 4 SYKGKILISTPDISGDIFSRSVVLVIEHNESGAFGLILNKKNSQMS-----SKFKDFFDF 58 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRD--VLETLGTDK 119 + E V GGP+ D+ F + I+D +T + + L ++ Sbjct: 59 KIE-------VYDGGPVENDKVFFIVKGKRVTEIYTDITDEYYLTEDIERIINAVLSSEL 111 Query: 120 QPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILT 179 + + GY+ W QL+ E+ W W+ + +G + L Sbjct: 112 SIEHIKIFSGYSGWSPNQLDTEVQRKMWTVVDVYNLDYTLPNDQTLWKSIMQNLGGEFLL 171 >UniRef50_B1MML1 UPF0301 protein MAB_4928c n=20 Tax=Corynebacterineae RepID=Y4928_MYCA9 Length = 208 Score = 144 bits (364), Expect = 1e-33, Method: Composition-based stats. Identities = 50/190 (26%), Positives = 80/190 (42%), Gaps = 22/190 (11%) Query: 4 QHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRD 63 LIA L +P FRRSV++I EHN G +G+++N+P E + + K+ +P Sbjct: 27 AGTLLIANTNLFEPTFRRSVIFIVEHNDGGTLGVVLNRPSETAVYNVLPQWAKLAGKP-- 84 Query: 64 ESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQPSD 123 K + +GGP+ D L T + SI R + + D +P D Sbjct: 85 ------KTMFVGGPVKRDAALCLATLRAGV--SIDGVKGLRHVAGR--MAMVDLDAEPED 134 Query: 124 -------VLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVD 176 + V GY+ W GQLE E+ + W+ A + + D W + + Sbjct: 135 IAPLVEGIRVFAGYSGWTIGQLEGEVERDDWIVLSALPSDVLTDASEDLWAKVLRR---Q 191 Query: 177 ILTMPGVAGH 186 L + +A H Sbjct: 192 PLPLSLLATH 201 >UniRef50_C4DLM5 Predicted transcriptional regulator, COG1678 n=5 Tax=Actinomycetales RepID=C4DLM5_9ACTO Length = 196 Score = 142 bits (359), Expect = 6e-33, Method: Composition-based stats. Identities = 47/177 (26%), Positives = 74/177 (41%), Gaps = 13/177 (7%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 +L L+A PALQDP F R+VV + H + GA+G+++N+ E E + + ++ EP Sbjct: 15 SLVGRLLVATPALQDPNFERTVVLLVSHESAGALGVVLNRATEVPVAEVLGDWSELAREP 74 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILH--TPPSNFASSIRISDNTVMTTSRDV-LETLGTD 118 + GGP+ + L S + + T V E L Sbjct: 75 A--------VLFEGGPVQPEAAIALGWMRSGVGEPSCFKPFAGRLGTLDLSVDPEPLADR 126 Query: 119 KQPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGV 175 + V GY+SW GQL+ E+ D AW+ + F + D W + G Sbjct: 127 LEGM--RVFAGYSSWGAGQLDDELKDGAWMVFDSLPGDPFGSRPEDLWAMVWRRQGG 181 >UniRef50_B7G677 Predicted protein n=2 Tax=Bacillariophyta RepID=B7G677_PHATR Length = 393 Score = 138 bits (349), Expect = 9e-32, Method: Composition-based stats. Identities = 43/187 (22%), Positives = 80/187 (42%), Gaps = 14/187 (7%) Query: 8 LIAMPALQDPIFRRSVVYICEHNTN-GAMGIIVNKPLENLKIEGILEKLKITPEPRDESI 66 LIA L +F ++VV I +H+ G+ GI++N+P++ ++ E+ + + + + Sbjct: 199 LIANEKLG-GVFHQTVVLIIDHHETTGSTGIVINRPMDGDLLKIASEQ-ESSLDLSLKLA 256 Query: 67 RLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDK-QPSDVL 125 V GGP+ D +LH S ++ + S +++ + T + P+ L Sbjct: 257 FSQARVTYGGPVLTDEFSVLHGFG-EVEGSRKLCPGVYIGGSEELMNEVRTLRFDPAHAL 315 Query: 126 VALGYASWEKGQLEQEILDNAWLTAPADLNILF---------KTPIADRWREAAKLIGVD 176 G+A W GQL +EI W TA A + + D W + +G + Sbjct: 316 FVKGHAGWVPGQLTREISKGVWYTAAASSDFILRYAGAPVTEDDNANDLWADILSCMGGN 375 Query: 177 ILTMPGV 183 + G Sbjct: 376 YAKIAGK 382 >UniRef50_C1B7P4 UPF0301 protein ROP_34500 n=13 Tax=Corynebacterineae RepID=Y3450_RHOOB Length = 201 Score = 137 bits (347), Expect = 1e-31, Method: Composition-based stats. Identities = 41/176 (23%), Positives = 74/176 (42%), Gaps = 21/176 (11%) Query: 5 HHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRDE 64 L++ L +P FRR+V+Y+ EHN G++G+++N+P E + + + +T P Sbjct: 21 GSLLVSSTDLVEPAFRRTVIYVIEHNEAGSLGVVINRPSETAVHDVLPQWAPLTARPS-- 78 Query: 65 SIRLDKPVMLGGPLAEDRGFILHT-----PPSNFASSIRISDNTVM---TTSRDVLETLG 116 + +GGP+ D L T R+ VM + +V+ L Sbjct: 79 ------ALYVGGPVKRDAALCLATLRTGAQADGVRGLRRVHGRVVMVDLDSDPEVVAPLV 132 Query: 117 TDKQPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKL 172 V + GY+ W GQL+ E+ + W+ A + + D W + + Sbjct: 133 EG-----VRIFAGYSGWTYGQLDSELQRDDWIVISALASDVLAPARVDVWAQVLRR 183 >UniRef50_A1SG68 Putative uncharacterized protein n=2 Tax=Actinomycetales RepID=A1SG68_NOCSJ Length = 191 Score = 136 bits (344), Expect = 3e-31, Method: Composition-based stats. Identities = 39/171 (22%), Positives = 70/171 (40%), Gaps = 11/171 (6%) Query: 4 QHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRD 63 L+A PAL DP F +VV + + + GA+G+++N+P + ++ +L+ Sbjct: 12 AGMLLVATPALLDPNFADTVVLLLDVDEQGALGVVLNRP-SAIPVDDVLDGWGDVAAE-- 68 Query: 64 ESIRLDKPVMLGGPLAED--RGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQP 121 + + GGP+ L + R+ D + D L Sbjct: 69 -----PEVLFQGGPVGLQGALAVALLARADDVPVGFRVVDGRLGLVDLDTPLELVRG-GL 122 Query: 122 SDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKL 172 + V GYA W QL EI + +W P + +F++ +D WR+ + Sbjct: 123 EGLRVFAGYAGWGADQLRDEIEEGSWYVVPGEARDVFRSDASDLWRDVLRR 173 >UniRef50_B5JR88 Putative uncharacterized protein n=1 Tax=Verrucomicrobiae bacterium DG1235 RepID=B5JR88_9BACT Length = 187 Score = 135 bits (340), Expect = 9e-31, Method: Composition-based stats. Identities = 41/179 (22%), Positives = 73/179 (40%), Gaps = 12/179 (6%) Query: 3 LQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPR 62 L L+A P L+DP F SVV + H +G++G+++NK G E+L Sbjct: 12 LTGSLLLAHPHLKDPNFASSVVLLTRHEESGSLGVVLNK--------GTGERLGQLSSEF 63 Query: 63 DESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQPS 122 + + PV LGGP+ +++ + + V ++ Sbjct: 64 ADCGLGEVPVYLGGPVNQNQIILAAWKLIPEKGQFQ----LYFGMEPLVAQSKMETDPDL 119 Query: 123 DVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMP 181 + GY+ W +GQL E+ DNAW+ + D + +D WR + ++ + Sbjct: 120 EFRAFKGYSGWSEGQLVGELEDNAWVVSEVDAESISTKEGSDLWRHLIMEVNPELGLLS 178 >UniRef50_C8XE74 Putative uncharacterized protein n=2 Tax=Actinomycetales RepID=C8XE74_NAKMY Length = 190 Score = 129 bits (326), Expect = 3e-29, Method: Composition-based stats. Identities = 42/172 (24%), Positives = 67/172 (38%), Gaps = 11/172 (6%) Query: 4 QHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRD 63 L+A P L+DP FRR+VVY+ H+ +G +G+I+N+P E + T P Sbjct: 9 AGMLLVATPGLRDPHFRRTVVYLVAHSVDGTVGVILNRPSETAVQNVLPGWASHTARP-- 66 Query: 64 ESIRLDKPVMLGGPLAEDRGFIL--HTPPSNFASSIRISDNTVMTTSRDVLETLGT-DKQ 120 V GGP+ L +N + T D+ T + Sbjct: 67 ------HAVFAGGPVQTSAAMCLGVCRIGTNPREVQGVVGVTGPVVLVDLDGDPATVTQS 120 Query: 121 PSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKL 172 + + G A W+ QL EI++ +W P + + P D W + Sbjct: 121 LRGIRIYAGRAGWDAEQLVDEIIEGSWYVVPGLPDDVLAGPRTDLWFSVLRR 172 >UniRef50_C0AXX7 Putative uncharacterized protein n=1 Tax=Proteus penneri ATCC 35198 RepID=C0AXX7_9ENTR Length = 93 Score = 129 bits (325), Expect = 5e-29, Method: Composition-based stats. Identities = 48/93 (51%), Positives = 64/93 (68%) Query: 1 MNLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPE 60 MNL +HFLIAMP+L DP+F RSVVY+CEHN NGAMG+I+NKP+E++ +EG+L++L+I Sbjct: 1 MNLLNHFLIAMPSLSDPLFERSVVYVCEHNENGAMGLIINKPIEDISVEGVLDQLEIFST 60 Query: 61 PRDESIRLDKPVMLGGPLAEDRGFILHTPPSNF 93 RDE+I L K + P F L F Sbjct: 61 DRDEAISLQKTCDVRRPSCRRAWFYLTYSSVRF 93 >UniRef50_Q9LQ30 F14M2.10 protein n=6 Tax=rosids RepID=Q9LQ30_ARATH Length = 341 Score = 129 bits (325), Expect = 5e-29, Method: Composition-based stats. Identities = 41/188 (21%), Positives = 69/188 (36%), Gaps = 17/188 (9%) Query: 4 QHHFLIAMPALQDPI-FRRSVVYICE----HNTNGAMGIIVNKPLENLKIEGILEKLKIT 58 L+A L F R+VV + H G G+++N+PL ++ +K T Sbjct: 154 TGCVLVATEKLDGYRTFARTVVLLLRAGTRHPQEGPFGVVINRPL-----HKNIKHMKST 208 Query: 59 PEPRDESIRLDKPVMLGGPLAEDRGFILHTPPSN-FASSIRISDNTVMTT--SRDVLETL 115 + + + GGPL F+L T + T S D L Sbjct: 209 KTELATT-FSECSLYFGGPLEA-SMFLLKTGDKTKIPGFEEVMPGLNFGTRNSLDEAAVL 266 Query: 116 GTDK--QPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLI 173 +P + +GYA W+ QL +EI + W A +++ + W E +L+ Sbjct: 267 VKKGVLKPQEFRFFVGYAGWQLDQLREEIESDYWHVAACSSDLICGASSENLWEEILQLM 326 Query: 174 GVDILTMP 181 G + Sbjct: 327 GGQYSELS 334 >UniRef50_A4S673 Predicted protein (Fragment) n=2 Tax=Ostreococcus RepID=A4S673_OSTLU Length = 288 Score = 129 bits (324), Expect = 7e-29, Method: Composition-based stats. Identities = 37/190 (19%), Positives = 67/190 (35%), Gaps = 19/190 (10%) Query: 4 QHHFLIAMPA---LQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPE 60 + L+A + F ++V+ + EH+ NG+MG+I+N+P + + Sbjct: 85 KGCLLVAADHEFRMSQQYFHQAVILVLEHHENGSMGVILNRPTQY--------DMGYVSG 136 Query: 61 PRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQ 120 + + + GG + + LH + S+ + + E + D Sbjct: 137 EANGPFAKN-ALYFGGDVGDGTVSFLHGR-EDVKGSVEVLPGVYLGGYDSACELVQQDGS 194 Query: 121 ---PSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTP---IADRWREAAKLIG 174 + Y W GQLE E W A + K WRE ++L G Sbjct: 195 TCHADEFKFFARYCGWAPGQLESECERGVWFPVAAAKELSLKQVIQLPKPLWREISELCG 254 Query: 175 VDILTMPGVA 184 ++ M A Sbjct: 255 GELEEMARKA 264 >UniRef50_C1E5G6 Predicted protein n=2 Tax=Micromonas RepID=C1E5G6_9CHLO Length = 369 Score = 128 bits (323), Expect = 8e-29, Method: Composition-based stats. Identities = 34/188 (18%), Positives = 65/188 (34%), Gaps = 17/188 (9%) Query: 4 QHHFLIAMPA---LQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPE 60 + L+A L F ++V+ + EH+ G+MG+I+N+P + + Sbjct: 184 KGCLLLAAADEFTLGQQYFHQAVILLLEHHDKGSMGVILNRPTQY--------NMGYVSG 235 Query: 61 PRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDK- 119 D + + GG + + LH S + + E + ++ Sbjct: 236 QSDGP-FAENALYFGGDVGDGTVSFLHGSDK-VQGSAEVLPGVYLGGYDSACELVKKEEV 293 Query: 120 QPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTP---IADRWREAAKLIGVD 176 ++ Y W GQL++E W + K WRE +L G + Sbjct: 294 DANEFKFFARYCGWAPGQLKRECERGVWYPVACSKQLALKQVIQLPKPLWREILELCGGE 353 Query: 177 ILTMPGVA 184 + + A Sbjct: 354 LKSAAARA 361 >UniRef50_Q8NL65 UPF0301 protein Cgl3084/cg3414 n=5 Tax=Corynebacterium RepID=Y3084_CORGL Length = 189 Score = 127 bits (320), Expect = 2e-28, Method: Composition-based stats. Identities = 39/173 (22%), Positives = 70/173 (40%), Gaps = 15/173 (8%) Query: 5 HHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRDE 64 L+A P + F RS+V I EH+ G+ ++ + + E + +T +P Sbjct: 9 GMLLVAAPDMASEDFERSIVLIIEHSPATTFGVNISSRSDVAVANVLPEWVDLTSKP--- 65 Query: 65 SIRLDKPVMLGGPLAEDR--GFILHTPPSNFASSI---RISDNTVMTTSRDVLETLGTDK 119 + + +GGPL++ G + P + +S ++++ V R E + D Sbjct: 66 -----QALYIGGPLSQQAVVGLGVTKPGVDIENSTSFNKLANRLVHVDLRSAPEDVADDL 120 Query: 120 QPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKL 172 + GYA W GQL +EI W PA + + D W + + Sbjct: 121 EGM--RFFAGYAEWAPGQLNEEIEQGDWFVTPALPSDIIAPGRVDIWGDVMRR 171 >UniRef50_Q8FSW7 UPF0301 protein CE2927 n=11 Tax=Corynebacterium RepID=Y2927_COREF Length = 201 Score = 127 bits (320), Expect = 2e-28, Method: Composition-based stats. Identities = 41/173 (23%), Positives = 70/173 (40%), Gaps = 15/173 (8%) Query: 5 HHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRDE 64 L+A P L P F RSV+ + EH+ G+ + + + E ++T +P Sbjct: 21 GSLLVAAPDLASPEFSRSVILVIEHSHATTFGVNLASRSDLAVANVLPEWTELTAKP--- 77 Query: 65 SIRLDKPVMLGGPLAEDR--GFILHTPPSNFASSIR---ISDNTVMTTSRDVLETLGTDK 119 + + +GGPL++ G + P + SS + +++ V R + + D Sbjct: 78 -----QALYIGGPLSQQAVVGLGVTKPGVDIESSTKFNKLANRLVHVDLRVTPDEVRDDL 132 Query: 120 QPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKL 172 + GYA W GQL EI W APA + + D W + + Sbjct: 133 EGM--RFFAGYAEWAPGQLNDEIEQGDWYVAPALPSDVLAPGRVDVWGDVMRR 183 >UniRef50_C7Q9Z6 Putative uncharacterized protein n=5 Tax=Actinomycetales RepID=C7Q9Z6_CATAD Length = 205 Score = 127 bits (319), Expect = 2e-28, Method: Composition-based stats. Identities = 47/186 (25%), Positives = 75/186 (40%), Gaps = 17/186 (9%) Query: 3 LQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPR 62 L L+A L DP F R+VV + +H+ +G +G+++N+P +L +E +LE Sbjct: 21 LTGKLLVATTVLVDPNFDRTVVLVVDHDDDGTLGVVLNRP-GSLDVEDVLETWAPLAAE- 78 Query: 63 DESIRLDKPVMLGGPLAEDRGFILH--TPPSNFASSIRIS-----DNTVMTTSRDVLETL 115 V LGGP+A D + P + + + E L Sbjct: 79 ------PPTVFLGGPVALDSALGIACVRPEAAVPGEEPLGWRQFSGRLGLVDLDAPPEVL 132 Query: 116 GTDKQPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGV 175 D + + + GYA W GQL E+ AW +L +F T + WR + G Sbjct: 133 APDL--TALRIFAGYAGWGPGQLAGELAQRAWYVVEPELADVFTTEPEELWRRVLRRQGG 190 Query: 176 DILTMP 181 I + Sbjct: 191 TIAMVA 196 >UniRef50_D0A6S5 Putative uncharacterized protein n=2 Tax=Trypanosoma brucei RepID=D0A6S5_TRYBG Length = 475 Score = 126 bits (318), Expect = 3e-28, Method: Composition-based stats. Identities = 42/160 (26%), Positives = 71/160 (44%), Gaps = 11/160 (6%) Query: 6 HFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKI---TPEPR 62 L+A P L D FR +V+ + N + +++NKPLEN K + + + + P Sbjct: 237 QLLLAHPQLYD-FFRYTVMIVVRVTPNESAALVLNKPLENDKGALMPVSMTMRLSSAHPL 295 Query: 63 DESIRLDKPVMLGGPLAED----RGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTD 118 + VM+GGP++ +LH P +I +S + + S D L+ D Sbjct: 296 FAKHLCNHTVMIGGPVSRGSFDSTMLLLHRIPD-VDDAIPLSHSLWIDGSYDTLQQKIED 354 Query: 119 --KQPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNI 156 P D++V G++ W QLE E+ W+ A + Sbjct: 355 GTADPKDIVVICGFSGWGVQQLEGELQSGTWVAASGSTDD 394 >UniRef50_A8IRU3 Predicted protein n=1 Tax=Chlamydomonas reinhardtii RepID=A8IRU3_CHLRE Length = 315 Score = 126 bits (318), Expect = 3e-28, Method: Composition-based stats. Identities = 46/188 (24%), Positives = 73/188 (38%), Gaps = 21/188 (11%) Query: 4 QHHFLIAMPAL---QDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPE 60 + L+A P L F R+ + + EH NG+ G+I+N+P ++ P Sbjct: 123 KGALLLAHPLLFQNSQTYFHRAAILLLEHGDNGSYGVILNRPSTYF--------IRDIPL 174 Query: 61 PRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTT---SRDVLETLGT 117 R ++ D + +GG + +LH P + A ++ + M RD ++ Sbjct: 175 KRPQTQFNDCRLYVGGDVGGGEVQVLH-PHGDLAGAVEVVKGVYMGGLDAGRDAIDA--G 231 Query: 118 DKQPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADR----WREAAKLI 173 Q D YA W GQL E W TA A +L K + W E L+ Sbjct: 232 KAQAQDFRWFSAYAGWAPGQLAMECKRGVWFTAAASPKLLLKEVEHGQGPSFWHELMTLL 291 Query: 174 GVDILTMP 181 G D + Sbjct: 292 GGDYAELS 299 >UniRef50_Q4CSL3 Putative uncharacterized protein n=2 Tax=Trypanosoma cruzi RepID=Q4CSL3_TRYCR Length = 523 Score = 125 bits (315), Expect = 7e-28, Method: Composition-based stats. Identities = 51/170 (30%), Positives = 77/170 (45%), Gaps = 21/170 (12%) Query: 6 HFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITP-----E 60 H L+A P L + FR SV+ + N A I+NKPLEN EG+L ++ T Sbjct: 280 HMLLAHPQLYE-FFRYSVMIVVRSTPNEAAAFILNKPLEND--EGMLMQVNSTIRLNHVH 336 Query: 61 PRDESIRLDKPVMLGGPLAEDR----GFILHTPPSNFASSIRISDNTVMTTSRDVLETLG 116 P + VM+GGP++ +LH P +I +S + + + DVL+ Sbjct: 337 PILGKHLGNHTVMIGGPVSRGSFDSSILLLHRIPD-VEDAIPVSQSLWVDGNYDVLQKKL 395 Query: 117 TD--KQPSDVLVALGYASWEKGQLEQEILDNAWLTA------PADLNILF 158 D D++V G++ W GQL EI W+ A PA + +F Sbjct: 396 DDGTADAKDIVVICGFSGWGAGQLAGEISSGTWVVARGSADDPAMDDFIF 445 >UniRef50_B2UMM6 Putative uncharacterized protein n=1 Tax=Akkermansia muciniphila ATCC BAA-835 RepID=B2UMM6_AKKM8 Length = 187 Score = 125 bits (315), Expect = 8e-28, Method: Composition-based stats. Identities = 40/173 (23%), Positives = 70/173 (40%), Gaps = 16/173 (9%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 NL H L+A P L F SV+++ +G I+N P + + + +I Sbjct: 13 NLAGHLLVAAPYLDGAGFHHSVIFLSRAEKEFVIGHILNHP-SGMNVGDVARHTEIP--- 68 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQP 121 P+ GGP+ ++ A+ IR D + + L + P Sbjct: 69 ---ESLYAVPIFKGGPVERNQLIF--------AAFIRTEDKLRVQFHLQEEQALEYLEDP 117 Query: 122 SDVLVA-LGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLI 173 +L A +G++ W QL +E+ D AW +P +I +T + W A + + Sbjct: 118 RAILRAYVGHSGWTPPQLRRELNDRAWYVSPMVPDICLETDSSKVWAMAMRRL 170 >UniRef50_Q7URG7 Probable transcriptional regulator n=1 Tax=Rhodopirellula baltica RepID=Q7URG7_RHOBA Length = 259 Score = 125 bits (314), Expect = 9e-28, Method: Composition-based stats. Identities = 52/232 (22%), Positives = 79/232 (34%), Gaps = 62/232 (26%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILE-------- 53 N FLIA P L D F RSVV I H GA G+++N+ + ++E Sbjct: 6 NCTGCFLIASPYLHDGNFFRSVVLIIRHTHEGAFGVVINR-AGPQRFGDVIEMSDPSWQA 64 Query: 54 -----------------KLKITPEPRDESIRLDKP---VMLGGPLAEDRGFI-------- 85 L +P + L + LGGP+ + Sbjct: 65 SSGPDMSSLLASQAESASLGDASDPNKTNESLQIHPDQIYLGGPVNGPVLALHNIAGIGD 124 Query: 86 --------------------LHTPPSNFASSIRI---SDNTVMTTSRDVLETLGTDKQPS 122 LH P+ S+ I +T+ D L L + Sbjct: 125 PCGVDIGEGAENDPAGSKTQLHDHPAEPWGSMSIQWADVPAWVTSDEDHLRLLARRDD-A 183 Query: 123 DVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIG 174 + +GY+ W QLE E+ + WL PAD + +F P + W + + G Sbjct: 184 KLRYVVGYSGWGPMQLESELEEGGWLITPADTDSIFG-PCEEVWEKLVRRCG 234 >UniRef50_A8IT21 Predicted protein n=1 Tax=Chlamydomonas reinhardtii RepID=A8IT21_CHLRE Length = 234 Score = 124 bits (312), Expect = 2e-27, Method: Composition-based stats. Identities = 39/177 (22%), Positives = 74/177 (41%), Gaps = 11/177 (6%) Query: 14 LQDPIFRRSVVYICEHNTNGAMGIIVNKPLE---NLKIEGILEKLKITPEPRDESIRLDK 70 L+D + V+++ H +G++GII+N+P K G+ L++ + + D Sbjct: 56 LKDDRLFQLVIFLTTHGPDGSVGIILNRPTGMVLGRKPGGLP--LELGGPVPIQRVFQDN 113 Query: 71 PVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQPS-DVLVALG 129 V GG A+ I+H + +++ M E + + P+ D G Sbjct: 114 MVYCGGFTAQQVIHIMH--GHRLQNCVQVVPGVYMAGEVAATEAVSGGRLPAGDFKFFSG 171 Query: 130 YASWEKGQLEQEILDNAWLTAPADLNILFKTP---IADRWREAAKLIGVDILTMPGV 183 +W G+LE ++ AW TA +++ K+ WRE +L+G + Sbjct: 172 AITWAPGELEAQMDRGAWYTAACSRSLVLKSALQLPVPLWREVLQLMGGQYSEVASE 228 >UniRef50_Q6A827 Conserved protein, DUF179 n=2 Tax=Propionibacterium acnes RepID=Q6A827_PROAC Length = 192 Score = 123 bits (310), Expect = 2e-27, Method: Composition-based stats. Identities = 42/184 (22%), Positives = 71/184 (38%), Gaps = 18/184 (9%) Query: 5 HHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRDE 64 L+A + + IF SVVY+ + +G +G+IVN+P + L P + Sbjct: 13 GDLLVASRQIDEGIFYESVVYLIDVALDGVLGVIVNQPCTAGTLHRQLPGWVHLATPPQD 72 Query: 65 SIRLDKPVMLGGPLAEDRGFILHTP------PSNFASSIRISDNTVMTTSRDVLETLGTD 118 + LGGP++ + L P + ++ + T +++E TD Sbjct: 73 -------LFLGGPMSPNGAICLARVQRSSEEPPGWRRVQGLTGLLHLDTPTELVEGAFTD 125 Query: 119 KQPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDIL 178 V + GYA W GQLE E++ W+ A A +F + WR Sbjct: 126 -----VRIFAGYAEWVPGQLEAELIRGDWIRAVAHPEDIFSSEPRGLWRAVLHRQNGPAA 180 Query: 179 TMPG 182 + Sbjct: 181 LLAT 184 >UniRef50_B9SPX9 Electron transporter, putative n=2 Tax=fabids RepID=B9SPX9_RICCO Length = 350 Score = 122 bits (307), Expect = 6e-27, Method: Composition-based stats. Identities = 42/192 (21%), Positives = 66/192 (34%), Gaps = 21/192 (10%) Query: 4 QHHFLIAMPALQDPI-FRRSVVYICE----HNTNGAMGIIVNKPLENLKIEGILEKLKIT 58 L+A L F R+VV + H G G+++N+PL ++ +K Sbjct: 159 TGCVLVATEKLDGVRTFERTVVLLLRSGTRHPQEGPFGVVINRPL-----NKKIKHMK-P 212 Query: 59 PEPRDESIRLDKPVMLGGPLAEDRGFILHTPPSN-FASSIRISDNTVMTT--SRDVLETL 115 + D + GGPL F+L T + S D L Sbjct: 213 TNKELATTFADCSLHFGGPLEA-SMFLLQTGEKEKLPGFEEVIPGLCFGARNSLDEAAAL 271 Query: 116 GTDK--QPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKT----PIADRWREA 169 +P D +GYA W+ QL +EI + W A N++ W E Sbjct: 272 VKKGVLKPQDFRFFVGYAGWQLDQLREEIESDYWYVASCSSNLICGNSSDSSSESLWEEI 331 Query: 170 AKLIGVDILTMP 181 +L+G + Sbjct: 332 LQLMGGHYSELS 343 >UniRef50_Q4QG99 Putative uncharacterized protein n=3 Tax=Leishmania RepID=Q4QG99_LEIMA Length = 670 Score = 120 bits (302), Expect = 2e-26, Method: Composition-based stats. Identities = 41/158 (25%), Positives = 72/158 (45%), Gaps = 17/158 (10%) Query: 6 HFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLEN-----LKIEGILEKLKITPE 60 LI+ P + FRR+V+ + H T+ + +++NKPL N + IE + ++ P Sbjct: 361 QLLISHPTAR-GFFRRTVLLMVRHVTHESAALVLNKPLRNEEGLEMSIEATVRLGRVHPI 419 Query: 61 PRDESIRLDKPVMLGGPLA-----EDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETL 115 R +M+GGP+ +D F+LH P ++ + N + DVL Sbjct: 420 FRRH--LAQHTLMIGGPVMSGSSFDDSIFLLHRVP-GVPHALPLGSNLWLDGDLDVLMAK 476 Query: 116 GTDKQP---SDVLVALGYASWEKGQLEQEILDNAWLTA 150 ++ D++V G+A W QL+ E+ W+ A Sbjct: 477 LDAEEASAEEDIVVLCGFAGWGFDQLKGELGHGYWVVA 514 >UniRef50_C7PBJ3 Putative uncharacterized protein n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PBJ3_CHIPD Length = 146 Score = 112 bits (281), Expect = 6e-24, Method: Composition-based stats. Identities = 37/146 (25%), Positives = 66/146 (45%), Gaps = 13/146 (8%) Query: 4 QHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRD 63 F+ + L+ +F +V+YI E+N NGAMG IVN + L +L+ R Sbjct: 3 AGIFINSTSLLEKSVFESTVIYITEYNENGAMGFIVNN-----RFPRKLNELEEFSHGR- 56 Query: 64 ESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQP-- 121 D P+ GGP+ ++ F +H P + ++ DN + + Sbjct: 57 -----DFPLWEGGPVDKEHLFFIHQRPDLISGGEQVGDNIFLGGDFQAAVKHINEHTLTE 111 Query: 122 SDVLVALGYASWEKGQLEQEILDNAW 147 D+ + +GY W+ +L++EI + +W Sbjct: 112 QDIKIFIGYCGWDYKELDEEIDEGSW 137 >UniRef50_C1E1K3 Predicted protein n=2 Tax=cellular organisms RepID=C1E1K3_9CHLO Length = 271 Score = 107 bits (269), Expect = 1e-22, Method: Composition-based stats. Identities = 34/151 (22%), Positives = 69/151 (45%), Gaps = 15/151 (9%) Query: 4 QHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRD 63 L+A P ++ +R SVV + H+ G+ G+I+N+P N +++ ++ ++ + Sbjct: 76 TGCLLLA-PETEEGWWRHSVVLVLNHDAEGSTGVILNRPT-NAQLKNVVPEIDYSA--PH 131 Query: 64 ESIRLDKPVMLGGPLAEDRG-----FILHTPPSNFASSIRISDNTVMTTSRDVLETLGTD 118 + ++ V +GGP+ ++G + HT S + + + + Sbjct: 132 HRVLANRHVSMGGPMGTEKGARCLVALSHTRLDGATS--EVFPGLWHVSDFS---AVKPE 186 Query: 119 KQPSDVLVALGYASWEKGQLEQEILDNAWLT 149 +PS ++V +GY W GQL E+ N W Sbjct: 187 HEPS-LMVFVGYCGWMSGQLNAEVAANGWTV 216 >UniRef50_Q8S0Q9 Os01g0886000 protein n=6 Tax=Poaceae RepID=Q8S0Q9_ORYSJ Length = 354 Score = 107 bits (267), Expect = 2e-22, Method: Composition-based stats. Identities = 39/187 (20%), Positives = 69/187 (36%), Gaps = 20/187 (10%) Query: 8 LIAMPALQD-PIFRRSVVYICEHNT----NGAMGIIVNKPLENLKIEGILEKLKITPEPR 62 L+A L F R+V+ + + +G G+I+N+PL K++ + + P Sbjct: 167 LVAAEELDGNGTFERTVILLLRLGSRDAYDGPFGVILNRPL-YTKMKHVNPSFRNQATP- 224 Query: 63 DESIRLDKPVMLGGPLAEDRGFIL-HTPPSNFASSIRISDNTVMT--TSRDVLETLGTDK 119 D + GGP+ F++ T +S T + L Sbjct: 225 ----FSDCSLFFGGPVDM-SIFLMRTTDDRPIKGFEEVSPGVCFGFRTDLEKASALLKSG 279 Query: 120 --QPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNIL---FKTPIADRWREAAKLIG 174 +P D+ +GY++W+ QL EI W ++ T + W E KL+G Sbjct: 280 AVKPEDLNFYVGYSAWDYDQLLSEIDQGYWHVTSCSSGLISDSLATDPSCLWTEILKLMG 339 Query: 175 VDILTMP 181 + Sbjct: 340 GQYAELS 346 >UniRef50_C5YY61 Putative uncharacterized protein Sb09g020680 n=4 Tax=Andropogoneae RepID=C5YY61_SORBI Length = 355 Score = 106 bits (266), Expect = 3e-22, Method: Composition-based stats. Identities = 41/186 (22%), Positives = 68/186 (36%), Gaps = 18/186 (9%) Query: 8 LIAMPALQDP-IFRRSVVYICEHNTNGAM----GIIVNKPLENLKIEGILEKLKITPEPR 62 L+A AL D IF R+V++I + G G+I+N+PL KI+ + + P Sbjct: 168 LVATEALDDDSIFERTVIFILRLGSRGTFDGPFGVILNRPL-YTKIKHVNPTFQDQATP- 225 Query: 63 DESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMT--TSRDVLETLGTDKQ 120 D P+ GGP+ + S + T + L Sbjct: 226 ----FGDSPLFFGGPVDMSMFLVRTDDSSRLKGFEEVVPGICYGFRTDLEKAAVLMKSGA 281 Query: 121 --PSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNIL---FKTPIADRWREAAKLIGV 175 D+ +G+A+W+ QL EI W A ++ + W E +L+G Sbjct: 282 IRTQDLRFYVGHAAWDYEQLLGEIRAGYWAVASCSTELISDALTGDPSCLWTEILQLMGG 341 Query: 176 DILTMP 181 + Sbjct: 342 QYSELS 347 >UniRef50_C5BU30 Putative uncharacterized protein n=1 Tax=Teredinibacter turnerae T7901 RepID=C5BU30_TERTT Length = 192 Score = 105 bits (262), Expect = 9e-22, Method: Composition-based stats. Identities = 50/179 (27%), Positives = 77/179 (43%), Gaps = 18/179 (10%) Query: 3 LQHHFLIAMPALQD-PIFRRS---VVYICEHNTNGAMGIIVN----KPLENLKIEGILEK 54 L + LIA PA D P F S ++Y+ H +GA+G+ +N KPL + E+ Sbjct: 5 LTDNVLIANPATTDLPQFAASAEKLIYVVHHGDDGAVGVCLNEYFGKPLADFS-----EQ 59 Query: 55 LKITPEPRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLET 114 +I S+ V GGPLA + +IL + I + + + S+ E Sbjct: 60 YEILASVSPLSLAS-VTVHSGGPLATELPWILSRAVDIYPHQIN-NKSLSLNFSQ---EA 114 Query: 115 LGTDKQPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLI 173 D LV LG SW GQLE+E+ W PA +L + +++ A + Sbjct: 115 FADPSIHMDALVGLGSFSWGPGQLEKEVSGFMWHCFPAQKPLLNRLHFEHKYQSAVDTL 173 >UniRef50_D0NN30 Putative uncharacterized protein n=1 Tax=Phytophthora infestans T30-4 RepID=D0NN30_PHYIN Length = 304 Score = 104 bits (261), Expect = 1e-21, Method: Composition-based stats. Identities = 44/187 (23%), Positives = 70/187 (37%), Gaps = 39/187 (20%) Query: 4 QHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRD 63 FL+A P LQ IF RSVV + EH G+ G IVNK Sbjct: 136 SGVFLLAHPLLQ-GIFSRSVVILTEHKPEGSKGFIVNK---------------------- 172 Query: 64 ESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRIS-------DNTVMTTSRDVLETLG 116 V GGP+ +LH + S + + D Sbjct: 173 ------VTVRKGGPVFTRNAEVLHGRADFGGQRVATSNFPTANDPSLFVGVDLDTAARAI 226 Query: 117 TDK--QPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIG 174 D+ + +DV+ G ++W GQL+ E+ +W+ A +++ AD W++ + +G Sbjct: 227 YDETAKQTDVVFMSGVSAWSPGQLDSELKQGSWVAVKAPVSLALNA-SADLWQDLMRTLG 285 Query: 175 VDILTMP 181 + M Sbjct: 286 GEYAEMS 292 >UniRef50_B8C551 Predicted protein n=1 Tax=Thalassiosira pseudonana RepID=B8C551_THAPS Length = 645 Score = 98.6 bits (245), Expect = 9e-20, Method: Composition-based stats. Identities = 35/174 (20%), Positives = 64/174 (36%), Gaps = 29/174 (16%) Query: 13 ALQDPIFRRSVVYICEHNTNG-AMGIIVNKPLENLKIEGILEKLKITPEPRDESIRLDKP 71 L+ F ++V+ + EH+ N GII+N+P + + + + + +K Sbjct: 155 GLRQQYFHKAVILVLEHDENTFTKGIILNRPSDQMMDDDVNDGVK-------------WR 201 Query: 72 VMLGGPLA-EDRGF----ILHT--PPSNFASSIRISDNTVMTTSRDVLETLGTD-KQPSD 123 V GG + D LH+ + +S+ + T+ + + + D Sbjct: 202 VWFGGDVQGLDSLLPDIVCLHSLKSEAAKDASVTVVKGIQWTSFSNAKQLVKRGVASVED 261 Query: 124 VLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDI 177 + GYA W QL E+ +W D L K A + G+D Sbjct: 262 FWLFAGYAGWGPRQLSGELDRKSWYMCATDSQTLLK-------ELARQSYGIDP 308 >UniRef50_Q9LS71 Emb|CAB72194.1 n=4 Tax=rosids RepID=Q9LS71_ARATH Length = 317 Score = 96.7 bits (240), Expect = 3e-19, Method: Composition-based stats. Identities = 43/186 (23%), Positives = 64/186 (34%), Gaps = 23/186 (12%) Query: 4 QHHFLIAMPALQDPI-FRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPR 62 LIA L F ++V+ + +G +G+I+N+P E L + Sbjct: 133 TGCLLIATEKLDGVHIFEKTVILLLSVGPSGPIGVILNRPSLMSIKETKSTILDMAGTF- 191 Query: 63 DESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSI-------RISDNTVMTT----SRDV 111 DK + GGPL E G L +P S + + ++ T Sbjct: 192 -----SDKRLFFGGPLEE--GLFLVSPRSGGDNEVGKSGVFRQVMKGLYYGTRESVGLAA 244 Query: 112 LETLGTDKQPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNIL---FKTPIADRWRE 168 S++ GY WEK QL+ EIL W A ++ W E Sbjct: 245 EMVKRNLVGRSELRFFDGYCGWEKEQLKAEILGGYWTVAACSSTVVELGSAVQSHGLWDE 304 Query: 169 AAKLIG 174 LIG Sbjct: 305 VLGLIG 310 >UniRef50_B8C1S9 Predicted protein n=1 Tax=Thalassiosira pseudonana RepID=B8C1S9_THAPS Length = 632 Score = 91.7 bits (227), Expect = 1e-17, Method: Composition-based stats. Identities = 36/165 (21%), Positives = 64/165 (38%), Gaps = 13/165 (7%) Query: 13 ALQDPIFRRSVVYICEHNTNGAMGIIVNKPLE-NLKIEGILEKLKITPEPRDESIRLDK- 70 L F ++V+ + H++ GII+N+P +L E +++ D ++ Sbjct: 115 GLSQQYFHKAVLLVTYHSSEFTKGIILNRPTNLHLDDEDFIDESGEPFIKSDNALEDMNS 174 Query: 71 -PVMLGGPL------AEDRGFILHTPPSNF--ASSIRISDNTVMTTSRDVLETLGTDKQP 121 + GG + + LH+ SN S I N +T + + ++ Sbjct: 175 WRIWFGGDVNGMYSDDPE-IVCLHSIDSNLGKNLSEEIIKNIFLTNYEGARKLIDANEAT 233 Query: 122 S-DVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADR 165 S D V GY W GQL E+ +W AD ++ + R Sbjct: 234 SQDFWVFAGYCGWSAGQLLDELKHESWYMVSADSQTVWSELVRQR 278 >UniRef50_Q54HU6 Putative uncharacterized protein n=1 Tax=Dictyostelium discoideum RepID=Q54HU6_DICDI Length = 493 Score = 91.3 bits (226), Expect = 2e-17, Method: Composition-based stats. Identities = 46/218 (21%), Positives = 82/218 (37%), Gaps = 49/218 (22%) Query: 5 HHFLIAMPALQDPIFRRSVVYICEHNTNGA-MGIIVNKPLENLKIEGILEKLKITPEPRD 63 LI+ P+L + + VV I H+ G G +N P++ ++ ++ Sbjct: 278 GTILISHPSLGE-HLNKKVVLIT-HSIGGHHYGFFINTPIQTGAALSYID-FEMKTHYDR 334 Query: 64 ESIRLDK--------------PVMLGGPLAE-----------------DRGF-------I 85 ++ ++ P+ GPL RGF I Sbjct: 335 AAVEANRSKNFSRLFIYALKRPI---GPLKTLNWNLGGLQSITDHSVLSRGFDGLNCTQI 391 Query: 86 LHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQPSD--VLVALGYASWEKGQLEQEIL 143 +H P SN + S +I D + + +K+ +L+ +G ++W GQLE+EI Sbjct: 392 VH-PYSNLSGSKKIRDGLYIGGKLKEVGDKIRNKEIDKNKLLMFVGCSTWNPGQLEKEIK 450 Query: 144 DNAWLTAPADLNILFKT-PIADRWREAAKLIGVDILTM 180 + AW A + K + W EA + +G D + Sbjct: 451 EGAWFRADCSNETILKQLKPKNFWAEALESMGGDYSDL 488 >UniRef50_Q7G645 Os10g0330400 protein n=3 Tax=Oryza sativa RepID=Q7G645_ORYSJ Length = 296 Score = 88.2 bits (218), Expect = 1e-16, Method: Composition-based stats. Identities = 41/185 (22%), Positives = 64/185 (34%), Gaps = 19/185 (10%) Query: 4 QHHFLIAMPALQDPI-FRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPR 62 + LIA L F R+VV + G +G+I+N+P + I E + E Sbjct: 112 KGCLLIATEKLDGSHIFERTVVLLLSAGVLGPVGVILNRP----SLMSIKEAQAVFAETD 167 Query: 63 DESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSI-------RISDNTVMTTSRDV---L 112 +P+ GGPL + F+L + + + T V Sbjct: 168 IAGAFSGRPLFFGGPLE-ECFFLLGPRAAAAGDVVGRTGLFDEVMPGVHYGTRESVGCAA 226 Query: 113 ETLGTDK-QPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNIL-FKTPIA-DRWREA 169 E + D G+ WE+ QL E+ W A +L T + W E Sbjct: 227 ELVKRGVVGVRDFRFFDGFCGWEREQLRDEVRAGLWRVAACSPAVLGLATVVKGGLWEEV 286 Query: 170 AKLIG 174 L+G Sbjct: 287 QGLVG 291 >UniRef50_C5L030 Membrane associated RING finger, putative n=1 Tax=Perkinsus marinus ATCC 50983 RepID=C5L030_9ALVE Length = 388 Score = 87.8 bits (217), Expect = 1e-16, Method: Composition-based stats. Identities = 39/197 (19%), Positives = 68/197 (34%), Gaps = 36/197 (18%) Query: 4 QHHFLIAMPALQDP--IFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 + L+A + P IF RSV + EH+ G++ +I+NKP+ I + + P Sbjct: 194 KGALLVANDNMIGPGSIFYRSVALVLEHDHMGSLALILNKPVARSPIADYRDAAEGPPVE 253 Query: 62 RDESIRLDKPVMLGGPL---AEDRGFILHTPPSNFASSIRIS----DNTVMTTSRDVL-- 112 GGP+ E R I+H + + +S D+ + + Sbjct: 254 LLTVR--------GGPVRINEERR--IMHQARGVVGARVLLSQDREDSVYLGGDLTAVLA 303 Query: 113 ----ETLGTDKQPSDVLVALGYASWEKGQLEQEILDNA--WLTAPADLNILFK------- 159 + G + + ++ G A W GQL E+ + W+ P IL Sbjct: 304 GIAQQDRGAGDERAHAIIFDGCARWAPGQLYGELRAGSWRWINPPWPEEILLSAFDQHGA 363 Query: 160 --TPIADRWREAAKLIG 174 + G Sbjct: 364 GVDNGEAMYHMIMNDYG 380 >UniRef50_C5WYB1 Putative uncharacterized protein Sb01g018920 n=1 Tax=Sorghum bicolor RepID=C5WYB1_SORBI Length = 1193 Score = 85.5 bits (211), Expect = 7e-16, Method: Composition-based stats. Identities = 29/153 (18%), Positives = 56/153 (36%), Gaps = 15/153 (9%) Query: 4 QHHFLIAMPALQDPI-FRRSVVYICEHNTNGAM-GIIVNKPLENLKIEGILEKLKITPEP 61 L A L + F + V I ++ G+I+NK L + + ++ Sbjct: 1038 TGSILTATEKLGAAVPFDNAKVLIVSSGSHEGFHGLIINKRLSWGVFKDLDSSMERIKHA 1097 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTV----MTTSRDVLETLGT 117 P+ GGP+ ++ + +++ + TSR V Sbjct: 1098 ---------PLFYGGPVVVQGYHLVSLSRVAWEGYMQVIPGVYYGNIVATSRVVTRIKLG 1148 Query: 118 DKQPSDVLVALGYASWEKGQLEQEILDNAWLTA 150 ++ D+ +GY+ W QL E+ + AWL + Sbjct: 1149 EQSVEDLWFFVGYSGWGYSQLFDELSEGAWLVS 1181 >UniRef50_Q7UKQ8 Putative uncharacterized protein n=1 Tax=Rhodopirellula baltica RepID=Q7UKQ8_RHOBA Length = 313 Score = 79.8 bits (196), Expect = 4e-14, Method: Composition-based stats. Identities = 35/219 (15%), Positives = 71/219 (32%), Gaps = 55/219 (25%) Query: 4 QHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENL----------------- 46 + L++ + + ++V + +T A+G+++N+P++ + Sbjct: 78 AGNLLVSSTLVDGTVLNQAVCLMVHEDTEHAIGLMLNRPMQAMAGAITIQGTPQETPKIP 137 Query: 47 --KIEGILEKLKITPEPRDESIRLDKPV-------------------------------M 73 E + E + S ++ PV Sbjct: 138 RWNAEDLDETSEGDSTIDPSSDSIEHPVSGVPSVVISGDQKDQLAQQLLSGKLANGSSLH 197 Query: 74 LGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQPSDVLVALGYASW 133 GGPL+ G I+ + + + + R+ LE L + +G+ W Sbjct: 198 FGGPLS---GPIVAVHSNRELAEAETGEGIFVAAQRENLEALMKSSD-LPYRLIIGHLGW 253 Query: 134 EKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKL 172 QLE EI + W PA ++L + A W + Sbjct: 254 TAEQLENEIEEGIWHRIPATSDLL-NSDDAMMWPRMIRR 291 >UniRef50_D1HM83 Whole genome shotgun sequence of line PN40024, scaffold_108.assembly12x (Fragment) n=1 Tax=Vitis vinifera RepID=D1HM83_VITVI Length = 266 Score = 77.8 bits (191), Expect = 1e-13, Method: Composition-based stats. Identities = 35/178 (19%), Positives = 50/178 (28%), Gaps = 43/178 (24%) Query: 4 QHHFLIAMPALQDPI-FRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPR 62 + LIA L F R+V+ + G GII+N+P + Sbjct: 120 KGCLLIATEKLDGVHIFERTVILLLSTGPVGPTGIILNRPS-------------LMSIKE 166 Query: 63 DESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDV---LETLGTDK 119 S LD + + T V E + + Sbjct: 167 TRSTVLDTGLF-----------------------EEVMKGLYYGTKESVGCAAEMVKRNA 203 Query: 120 -QPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNIL--FKTPIADRWREAAKLIG 174 D GY WEK QL EI W A +++ W E L+G Sbjct: 204 VAVEDFRFFDGYCGWEKEQLRDEIRAGYWTVAACSPSVIGLTSVGSVGLWEEIIGLMG 261 >UniRef50_A9S274 Predicted protein n=1 Tax=Physcomitrella patens subsp. patens RepID=A9S274_PHYPA Length = 1306 Score = 77.4 bits (190), Expect = 2e-13, Method: Composition-based stats. Identities = 34/153 (22%), Positives = 59/153 (38%), Gaps = 20/153 (13%) Query: 4 QHHFLIAMPALQD-PIFRRSVVYICEHNTNGAM-GIIVNKPLENLKIEGILEKLKITPEP 61 L+A P L +F V+ I + +G + G+++NKPL + Sbjct: 1145 AGTLLLASPLLDGTSVFSGCVILIVHAHEHGDVRGLMLNKPLS----------WDYVAKT 1194 Query: 62 RDESIRLDKPVMLGGPLAEDR--GFILHTPPSNFASSIRISDNTVMTTSR----DVLETL 115 + + P+ GGP+ E F+L P + S D+++ + Sbjct: 1195 IGQDSLHEAPLGFGGPVGEQSHPFFVLTKVP-GLDDFHEVMPGVFYGVSAKSVEDLIQLM 1253 Query: 116 GTDKQPS-DVLVALGYASWEKGQLEQEILDNAW 147 + K DV V LG +W QL++E+ W Sbjct: 1254 QSGKLIEADVWVFLGCTAWSWFQLQEELAQQIW 1286 >UniRef50_A4RT64 Predicted protein n=2 Tax=Ostreococcus RepID=A4RT64_OSTLU Length = 236 Score = 76.3 bits (187), Expect = 5e-13, Method: Composition-based stats. Identities = 31/162 (19%), Positives = 53/162 (32%), Gaps = 17/162 (10%) Query: 4 QHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEK------LKI 57 + L+A+ + V+++ +H G+ GII+N+ + E + Sbjct: 36 KGALLVAVEEDASSFWSHVVIFMLDHTPYGSTGIILNRTQSWTLAKHCPEVKHDNLYWSL 95 Query: 58 TPEPRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGT 117 E + DR I T + + + D L L Sbjct: 96 LSEEVVGVGGPVGLDH-----SLDRSVIALTTKEQPGMTEEVIPGIYRVINLDQLAKLNA 150 Query: 118 DKQ------PSDVLVALGYASWEKGQLEQEILDNAWLTAPAD 153 P D+ + +GY+ W GQL+ EI W A A Sbjct: 151 KLSGPGTLRPEDLSLFVGYSGWSPGQLQSEIDAGFWTLASAS 192 >UniRef50_Q0IWV7 Os10g0485100 protein (Fragment) n=5 Tax=Poaceae RepID=Q0IWV7_ORYSJ Length = 855 Score = 75.9 bits (186), Expect = 6e-13, Method: Composition-based stats. Identities = 38/173 (21%), Positives = 66/173 (38%), Gaps = 24/173 (13%) Query: 3 LQHHFLIAMPALQDPI-FRRSVVYICEHN-TNGAMGIIVNKPLENLKIEGILEKLKITPE 60 L L A L + F S V I + G G+I+NK L + L + E Sbjct: 699 LTGSVLTATSKLGSAVPFDNSQVLIVSADSREGFHGLIINKRLSW----DTFKNLDGSME 754 Query: 61 PRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTV---MTTSRDVLETLGT 117 P + P+ GGP+ +++ F +++ + + V + + Sbjct: 755 PIKHA-----PLFYGGPVVVQGYYLVSLSRVAFDGYLQVIPGVYYGNVAATAQVTRRIKS 809 Query: 118 DKQPSDVL-VALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADR-WRE 168 +Q ++ L LG+++WE QL E+ + AW + PI W E Sbjct: 810 GEQSAENLWFFLGFSNWEYSQLFDELSEGAWQVSE--------EPIEHLVWPE 854 >UniRef50_Q9LT30 Genomic DNA, chromosome 3, P1 clone: MPN9 n=2 Tax=Arabidopsis thaliana RepID=Q9LT30_ARATH Length = 963 Score = 73.6 bits (180), Expect = 3e-12, Method: Composition-based stats. Identities = 35/154 (22%), Positives = 55/154 (35%), Gaps = 19/154 (12%) Query: 4 QHHFLIAMPALQDP-IFRRSVVYICEHNTN-GAMGIIVNKPLENLKIEGILEKLKITPEP 61 L+A L F +S + I + G +G+I NK + + E ++ E Sbjct: 807 TGTVLVATEKLAASLTFAKSKILIIKAGPEIGFLGLIFNKRIRWKSFPDLGETAELLKE- 865 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILH----TPPSNFASSIRISDNTVM----TTSRDVLE 113 P+ GGP+ + +L S IS + +R + E Sbjct: 866 --------TPLSFGGPVVDPGIPLLALTRERDSSTNHDHPEISPGVYFLDHQSVARRIQE 917 Query: 114 TLGTDKQPSDVLVALGYASWEKGQLEQEILDNAW 147 + PS+ LGY+SW QL EI W Sbjct: 918 LKSRELNPSEYWFFLGYSSWSYEQLFDEIGLGVW 951 >UniRef50_D1I242 Whole genome shotgun sequence of line PN40024, scaffold_10.assembly12x (Fragment) n=3 Tax=rosids RepID=D1I242_VITVI Length = 1106 Score = 73.2 bits (179), Expect = 5e-12, Method: Composition-based stats. Identities = 29/150 (19%), Positives = 56/150 (37%), Gaps = 16/150 (10%) Query: 5 HHFLIAMPALQDPI-FRRSVVYICEHNTNGAM-GIIVNKPLENLKIEGILEKLKITPEPR 62 L+A L D F +S + I + + G+I+NK + + + E + E Sbjct: 951 GSILVATDKLLDAHPFDKSTILIVKADQATGFHGLIINKHINWESLNELAEGVDHLKEA- 1009 Query: 63 DESIRLDKPVMLGGPL-AEDRGFILHTPPSNFASSIRISDNTVM---TTSRDVLETLGTD 118 P+ GGP+ + + T + + + +E L + Sbjct: 1010 --------PLSFGGPVVKRGKPLVALTRRVFKDQHPEVLPGVYFLDQSATVSEIEGLKSG 1061 Query: 119 KQ-PSDVLVALGYASWEKGQLEQEILDNAW 147 + S+ +G+++W QL EI + AW Sbjct: 1062 NESVSEYWFFVGFSNWGWDQLFDEIAEGAW 1091 >UniRef50_B6SSQ6 Uncharacterized ACR, COG1678 family protein n=3 Tax=Andropogoneae RepID=B6SSQ6_MAIZE Length = 292 Score = 72.8 bits (178), Expect = 5e-12, Method: Composition-based stats. Identities = 35/189 (18%), Positives = 63/189 (33%), Gaps = 21/189 (11%) Query: 4 QHHFLIAMPALQDPI-FRRSVVYICEHNTN-GAMGIIVNKPLENLKIEGILEKLKITPEP 61 + LIA L F R+V+ + ++ G +G+I+N+P + I+ + + Sbjct: 102 KGCLLIATEKLDGSHIFERTVILLLSSPSSLGPVGVILNRP-SLMSIKEASGSI-FADDA 159 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASS----------IRISDNTVMTT--SR 109 +P+ GGPL + F++ + + T + Sbjct: 160 DIARAFAGRPLFFGGPLE-ECFFVIGPRAAAGGGGDDAVARTGLFEEVMPGLHYGTRETV 218 Query: 110 DVLETLGTDK--QPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPI--ADR 165 L D G+ WE+ QL E+ W A +L + Sbjct: 219 GCAAELAKRGVVGVRDFRFFDGFCGWEREQLRDEVRAGLWHVAACSAAVLELATVVKGGL 278 Query: 166 WREAAKLIG 174 W E L+G Sbjct: 279 WEEVQGLVG 287 >UniRef50_C0AXX9 Putative uncharacterized protein n=1 Tax=Proteus penneri ATCC 35198 RepID=C0AXX9_9ENTR Length = 56 Score = 72.4 bits (177), Expect = 7e-12, Method: Composition-based stats. Identities = 24/49 (48%), Positives = 35/49 (71%) Query: 139 EQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMPGVAGHA 187 E+ N+WLT A I+F TP+A+RW +AA+LIG++I T+ +AGHA Sbjct: 8 ERNFRKNSWLTVEASPQIIFDTPVAERWHKAAELIGINIHTISPIAGHA 56 >UniRef50_B9HGN1 Predicted protein n=1 Tax=Populus trichocarpa RepID=B9HGN1_POPTR Length = 1080 Score = 70.1 bits (171), Expect = 3e-11, Method: Composition-based stats. Identities = 35/157 (22%), Positives = 60/157 (38%), Gaps = 15/157 (9%) Query: 5 HHFLIAMPALQDPIFRRSVVYICEHNTNGAM-GIIVNKPLENLKIEGILEKLKITPEPRD 63 L+A L F +S + I + + N G+I NK L ++ + E+ K+ E Sbjct: 928 GSILVATEKLNTQPFDKSRILIVKSDQNTGFQGLIYNKHLRWDTLQELEEESKLLKEA-- 985 Query: 64 ESIRLDKPVMLGGP-LAEDRGFILHTPPSNFASSIRISDNTVM---TTSRDVLETLGTDK 119 P+ GGP + + T + ++ T + + +E + + Sbjct: 986 -------PLSFGGPLVTRGMPLVALTRRAVGGQYPEVAPGTYFLGQSATLHEIEEISSGN 1038 Query: 120 Q-PSDVLVALGYASWEKGQLEQEILDNAWLTAPADLN 155 Q SD LG++SW QL EI AW + Sbjct: 1039 QCVSDYWFFLGFSSWGWEQLFDEIAQGAWNLSEHKKE 1075 >UniRef50_A9T9H0 Predicted protein n=1 Tax=Physcomitrella patens subsp. patens RepID=A9T9H0_PHYPA Length = 461 Score = 70.1 bits (171), Expect = 4e-11, Method: Composition-based stats. Identities = 45/216 (20%), Positives = 58/216 (26%), Gaps = 51/216 (23%) Query: 5 HHFLIAMPALQDPIFRRSVVYICEHNT-NGAMGIIVNKPLENLKIEGILEKLKITPEPRD 63 L+A P + P + +VV + EH G+ GII+NK E IL+ Sbjct: 249 GTLLVATPTMLSPYYFGTVVLLYEHERCRGSRGIILNKQAEK---AEILKWENQLFLGVH 305 Query: 64 ESIRLDKPVM-LGGPLAEDRGFIL-----------HTPPSNFAS-------SIRISDNTV 104 S L GG D FIL H I Sbjct: 306 NSAALRHITHGTGGSHKPDDWFILQRCSPSPVKCKHCAADTARPKTCSKDWGREILPGIF 365 Query: 105 MTTSRD-VLETL----------------GTDKQPSDVL-----------VALGYASWEKG 136 + VL L + V V G+A W G Sbjct: 366 LGKDVGPVLRHLNGCKRLSVMGQQCSVKAESIKEDRVRKNEECYYVDHQVIHGHAEWYVG 425 Query: 137 QLEQEILDNAWLTAPADLNILFKTPIADRWREAAKL 172 QL + W T IL TP + W Sbjct: 426 QLGSAVKRGLWKTRENASAILLSTPPHELWHTLLSE 461 >UniRef50_B6KQL2 Putative uncharacterized protein n=4 Tax=Toxoplasma gondii RepID=B6KQL2_TOXGO Length = 1530 Score = 61.3 bits (148), Expect = 1e-08, Method: Composition-based stats. Identities = 20/81 (24%), Positives = 26/81 (32%), Gaps = 21/81 (25%) Query: 125 LVALGYASWEKGQLEQEILDNAWLTAPADL-----NILFKTPI---------------AD 164 V LG ASW GQLE+EI AW+ I+F Sbjct: 1421 RVFLGKASWSPGQLEREIEKGAWVVVGCTDSGVMQEIVFGREPSAPGGAPGGHTPPSEEH 1480 Query: 165 RWREAAKLIGVDILTMPGVAG 185 WR + + + AG Sbjct: 1481 LWRRVLSALAANPAS-GPQAG 1500 >UniRef50_B8BQD7 Predicted protein n=1 Tax=Thalassiosira pseudonana CCMP1335 RepID=B8BQD7_THAPS Length = 499 Score = 58.6 bits (141), Expect = 1e-07, Method: Composition-based stats. Identities = 23/109 (21%), Positives = 35/109 (32%), Gaps = 14/109 (12%) Query: 86 LHTPPSNFAS-SIRISDNTVMTTSRD------VLETLGTDKQPSDVLVALGYASWEKGQL 138 LH P S ++ + D + E L D + W QL Sbjct: 377 LHMCPFVTDSQNLTMCDGLYWGGDPGQAQEAMIDERLDKPMSGFDFKFFVKDTRWLPSQL 436 Query: 139 EQEILDNAWLTAPADLNILFK-------TPIADRWREAAKLIGVDILTM 180 E+EI+D W +LF+ W E +L+G D + Sbjct: 437 EEEIMDGTWHVTTVSKEVLFRNRDRLGPKRAKPLWTEIMELLGDDYKHI 485 >UniRef50_D2VPR5 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2VPR5_NAEGR Length = 413 Score = 57.4 bits (138), Expect = 2e-07, Method: Composition-based stats. Identities = 41/187 (21%), Positives = 65/187 (34%), Gaps = 50/187 (26%) Query: 3 LQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLK-------IEGILEKL 55 L++H LIA P L + F R+V+ + ++ + G IV K + E +LE Sbjct: 236 LKNHLLIAHPMLANKYFERTVIRMDDNIQD--TGYIVGKLSNDENTNEDKKKFEDLLEDD 293 Query: 56 KITPEPRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETL 115 + TPE ++ L V +R F + LE Sbjct: 294 EKTPESEAKAKTLMDFV-----SQINREF-------------------------ESLEKK 323 Query: 116 -GTDKQPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILF----KTPIADRWREAA 170 + +P + A W QL EILD W+ D+ +TP W Sbjct: 324 HSKEAEPERL------ARWYTHQLANEILDGVWIVVAVDMEAFLPFAKETPYDKVWEYLV 377 Query: 171 KLIGVDI 177 +G + Sbjct: 378 SRLGGEY 384 >UniRef50_B7G772 Predicted protein n=1 Tax=Phaeodactylum tricornutum CCAP 1055/1 RepID=B7G772_PHATR Length = 390 Score = 51.3 bits (122), Expect = 2e-05, Method: Composition-based stats. Identities = 28/162 (17%), Positives = 55/162 (33%), Gaps = 22/162 (13%) Query: 37 IIVNKPLENL--KIEGILEKLKITPEPRDESIRLD----KPVMLGGPLAE-DRGF-ILHT 88 +++N+ L +E L T + + L+ +P+ GG + G +LH Sbjct: 218 VLLNRRTGYLLGDLEQADTNLGSTSSKKAPTPVLEKFCIQPLWFGG-VDNVSAGLDMLHQ 276 Query: 89 PPSNFASSIRISDNTVMTTSRDVLETLGTD-----KQPSDVLVALGYASWEK-GQLEQEI 142 P+ + + + + + D + W +L++EI Sbjct: 277 CPTVPDAEPLSDEGLYWGGDPALAQDAMDEVTDKVLTGFDFKFFVQSTVWGSSKELQKEI 336 Query: 143 LDNAWLTAPADLNILFKT-------PIADRWREAAKLIGVDI 177 + W TA +LFK+ W E +L+G Sbjct: 337 DNGTWFTARVSKEVLFKSRDRMGTRRAKPLWTEVMELLGGKY 378 >UniRef50_B8CF73 Predicted protein n=2 Tax=Thalassiosira pseudonana RepID=B8CF73_THAPS Length = 656 Score = 50.9 bits (121), Expect = 2e-05, Method: Composition-based stats. Identities = 15/64 (23%), Positives = 22/64 (34%), Gaps = 10/64 (15%) Query: 122 SDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMP 181 D +G + W GQLE+EI WL D ++ D+ T+ Sbjct: 519 DDFSFIIGASCWAPGQLEKEIERGCWLPFRGDPSMALTGECDH----------NDVATLS 568 Query: 182 GVAG 185 G Sbjct: 569 TNDG 572 Score = 47.0 bits (111), Expect = 3e-04, Method: Composition-based stats. Identities = 13/57 (22%), Positives = 22/57 (38%), Gaps = 21/57 (36%) Query: 5 HHFLIAMPALQDPIFRRSVVYICEH----------------NTN----GAMGIIVNK 41 FLIA P + F +SV+ + +H + G G+I+N+ Sbjct: 276 GSFLIAHPLMT-GYFAKSVILLLDHTEASSKSTSSTESQAESEEVGSGGTYGLIINR 331 >UniRef50_O68558 Putative uncharacterized protein (Fragment) n=1 Tax=Mycobacterium bovis RepID=O68558_MYCBO Length = 82 Score = 50.5 bits (120), Expect = 3e-05, Method: Composition-based stats. Identities = 14/27 (51%), Positives = 20/27 (74%) Query: 13 ALQDPIFRRSVVYICEHNTNGAMGIIV 39 L +P FRRSV+YI EHN G +G+++ Sbjct: 1 DLLEPTFRRSVIYIVEHNDGGTLGVVL 27 >UniRef50_B8CDL7 Predicted protein n=1 Tax=Thalassiosira pseudonana RepID=B8CDL7_THAPS Length = 531 Score = 49.7 bits (118), Expect = 5e-05, Method: Composition-based stats. Identities = 21/116 (18%), Positives = 42/116 (36%), Gaps = 6/116 (5%) Query: 72 VMLGGPLAEDRGFILHTPPSNFASSIRISDN--TVMTTSRDVLETLGTDK-QPSDVLVAL 128 V +GGP D L ++ S+ IS ++ + + K +P D + Sbjct: 405 VYVGGPDKMDEPATLIHGIADLPGSVEISPGTGIYEGGLEAAMDGVLSGKYKPLDFRFFI 464 Query: 129 GYASWEKGQLEQEILDNAWLTAPADLNILFKTP---IADRWREAAKLIGVDILTMP 181 G+ S+E G+L+ + ++ K W E + G ++ + Sbjct: 465 GHTSYEGGELDYACEVGKYQPVACSRPLVLKQCMQLPKPLWHEVLEFCGGELKEIS 520 >UniRef50_B7G8R3 Predicted protein n=1 Tax=Phaeodactylum tricornutum CCAP 1055/1 RepID=B7G8R3_PHATR Length = 491 Score = 48.6 bits (115), Expect = 1e-04, Method: Composition-based stats. Identities = 29/125 (23%), Positives = 51/125 (40%), Gaps = 19/125 (15%) Query: 23 VVYICEHNT-NGAMG-IIVNKPLENLKIEGILEKLKITPEPRDESIRL------------ 68 V + E N NGA +++N+P+ LK+ L +L + R E + Sbjct: 264 VCLVMERNESNGAATTLVLNRPM-ALKLTDSLGQLVLNGAYRGEKTKPKKDVTRFMRAFG 322 Query: 69 -DKPVMLGGPLAEDRGFILHTPPSNFASSIRISD--NTVMTTSRDVLETLGTDK-QPSDV 124 + V +GGP +D+ +L ++ A + IS +E + + K QP D Sbjct: 323 GECAVYIGGPDDQDQPAVLVHGLADLAGANEISPGSGIYQGGIEAAVEGVISGKYQPLDF 382 Query: 125 LVALG 129 +G Sbjct: 383 RFFVG 387 >UniRef50_B8BZN2 Predicted protein n=1 Tax=Thalassiosira pseudonana RepID=B8BZN2_THAPS Length = 646 Score = 48.6 bits (115), Expect = 1e-04, Method: Composition-based stats. Identities = 31/161 (19%), Positives = 53/161 (32%), Gaps = 22/161 (13%) Query: 20 RRSVVYICEHNTNGAMGIIVNKPLE----------NLKIEGILEKLKITPEPRDESIRLD 69 R+S+V + + N GII+N+P + E +I S Sbjct: 148 RKSIVLVLDVQQNFIQGIILNRPTNIGVKQGMQFVQPGHGEVFEN-EIGSCDGSGSSPHR 206 Query: 70 KPVMLGGPL-----AEDRGFILHTP--PSNFASSIRISDNTVMTTSRDVLETL--GTDKQ 120 V GG + + LH+ S + ++T S D + L + Sbjct: 207 WKVWFGGEVAGPFSEYPQVMCLHSVNTDLGVELSDAVLPGILIT-SFDGAQRLVDAGEAN 265 Query: 121 PSDVLVALGYASWEKGQLEQEI-LDNAWLTAPADLNILFKT 160 PS + G WE E+ + WL +D + + Sbjct: 266 PSSFWLFCGICGWETSSFYSEMHDEGLWLVVSSDGGTILEE 306 Score = 44.3 bits (104), Expect = 0.002, Method: Composition-based stats. Identities = 25/130 (19%), Positives = 53/130 (40%), Gaps = 22/130 (16%) Query: 7 FLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRDESI 66 FL+A D F +S++ I + + + G+I+N ++ +E I+ Sbjct: 437 FLLA-----DQGFHKSLILIVRDDVDCSEGVILN----HMTMESIM----------LGDG 477 Query: 67 RLDKPVMLGGPLAE-DRGFILHTPPSNFASSIRISD-NTVMTTSRDVLETLGTD-KQPSD 123 + PV GGP+ + L++ S +++ + T +++E++ D Sbjct: 478 KTCLPVRYGGPMQVSEPIMYLYSNESLDCVGVQMGNSEIYSCTEDEIIESIELGLASADD 537 Query: 124 VLVALGYASW 133 L G + W Sbjct: 538 FLAIQGISVW 547 >UniRef50_A5AUI4 Putative uncharacterized protein n=1 Tax=Vitis vinifera RepID=A5AUI4_VITVI Length = 218 Score = 46.2 bits (109), Expect = 6e-04, Method: Composition-based stats. Identities = 11/53 (20%), Positives = 21/53 (39%), Gaps = 3/53 (5%) Query: 120 QPSDVLVALGYASWEKGQLEQEILDNAWLTAPADL---NILFKTPIADRWREA 169 +P D + +GY W+ QL +E+ + A + + + W E Sbjct: 155 KPEDFIFFVGYVGWQLDQLREEMGSDYGYVAAYSPYVIDGVLTESSSGVWDEV 207 >UniRef50_C5KHP8 Putative uncharacterized protein n=1 Tax=Perkinsus marinus ATCC 50983 RepID=C5KHP8_9ALVE Length = 473 Score = 45.1 bits (106), Expect = 0.001, Method: Composition-based stats. Identities = 25/112 (22%), Positives = 36/112 (32%), Gaps = 20/112 (17%) Query: 74 LGGPLAEDRGFILHTPPSNFASSIRISD-----NTV--MTTSRDVLETLGTDKQPSDVLV 126 GGP+A +LH + + I + D E L P Sbjct: 316 FGGPVA--SVEVLHESLVRGKNPLSIGPNETSIGLFHGWKQTEDT-ELLTEATTPR--RT 370 Query: 127 ALGYASWEKGQLEQEILDNAWLTA-----PADLNILFKTP---IADRWREAA 170 +G A+WE+GQLE+E+ W A I+ D W Sbjct: 371 FVGKAAWERGQLEREMNLGVWYPVRVTCPEALRKIMLGNHELSEDDLWAAMV 422 >UniRef50_C1FFU8 Predicted protein n=1 Tax=Micromonas sp. RCC299 RepID=C1FFU8_9CHLO Length = 437 Score = 42.8 bits (100), Expect = 0.006, Method: Composition-based stats. Identities = 30/159 (18%), Positives = 38/159 (23%), Gaps = 51/159 (32%) Query: 52 LEKLKITPEPRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTV------- 104 LE L + + R +GGP+ R IL P+ + D Sbjct: 267 LESLNHDSDFLESLTRT----YVGGPVHPRRRIILFVSPTGANALSPTGDALCEVPLDRT 322 Query: 105 -MTTSRDVLETL----------------------------------GTDKQPSDVLVALG 129 SR V V G Sbjct: 323 CHPGSRARAFAYHPTATDSDEYVDEIAGEAAAAALAESEEEDAAFNADGFFVPRVHVFEG 382 Query: 130 YASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWRE 168 +A W + QL EI W PA L RW E Sbjct: 383 HAKWSRTQLMNEIARGDWGLCPATPEDL-----TSRWAE 416 >UniRef50_B7GE41 Predicted protein n=1 Tax=Phaeodactylum tricornutum CCAP 1055/1 RepID=B7GE41_PHATR Length = 258 Score = 42.0 bits (98), Expect = 0.009, Method: Composition-based stats. Identities = 26/152 (17%), Positives = 53/152 (34%), Gaps = 20/152 (13%) Query: 24 VYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRDESIRLDKPVMLGGPLAEDRG 83 VY+ G+I+++P + ++E ++ D + GG D+ Sbjct: 109 VYVIR-------GVILDQPTP-FTLGEMMEH----NPALQKTPLKDNLLFRGGDKGGDQV 156 Query: 84 FILHTPPSNFASSIRISDNTVMTTSRDVLETLGTD-KQPSDVLVALGYASWEKGQLEQEI 142 +LH SSI +S + +Q SD + Y + + ++E + Sbjct: 157 VLLHNHEEIGQSSIGVS-GVFQGGFDQAMAACEKGHRQTSDFKLFFNYCEFTELEMEDLL 215 Query: 143 LD----NAWLTAPADLNILFKTPIA--DRWRE 168 +AW++ D + + D W Sbjct: 216 ASDEDGDAWISVEVDSDFVLNDGWERGDAWSR 247 >UniRef50_UPI0001BCD1A1 hypothetical protein AmarD1_07934 n=1 Tax=Aeromicrobium marinum DSM 15272 RepID=UPI0001BCD1A1 Length = 53 Score = 41.2 bits (96), Expect = 0.015, Method: Composition-based stats. Identities = 8/34 (23%), Positives = 15/34 (44%) Query: 139 EQEILDNAWLTAPADLNILFKTPIADRWREAAKL 172 E E+ +++W+ AD L WR+ + Sbjct: 2 EDEVAESSWMVVAADPEDLLSPHPDTLWRQVLRR 35 >UniRef50_A5K110 Putative uncharacterized protein n=1 Tax=Plasmodium vivax RepID=A5K110_PLAVI Length = 1004 Score = 40.9 bits (95), Expect = 0.025, Method: Composition-based stats. Identities = 18/111 (16%), Positives = 38/111 (34%), Gaps = 19/111 (17%) Query: 90 PSNFASSIRISDNTVMTTSRDVLETLGTDKQPSDVLV--ALGYASWEKGQLEQEILDNAW 147 A+ + ++ + +G + + LV +G A+W+ QL E+ ++ W Sbjct: 874 EVGAANEVGAANEVGAANEVGIANQVGAANEAENKLVKRFIGKATWDVNQLMDELKNDYW 933 Query: 148 LTAPADLN-----ILFKTPIAD------------RWREAAKLIGVDILTMP 181 + D I+F T + W + I D ++ Sbjct: 934 IALNCDSKELLSRIIFNTATSGSGAAGSVYRGEFLWEKIVASINSDYESIS 984 >UniRef50_B3LAF2 Putative uncharacterized protein n=1 Tax=Plasmodium knowlesi strain H RepID=B3LAF2_PLAKH Length = 1030 Score = 40.5 bits (94), Expect = 0.026, Method: Composition-based stats. Identities = 15/73 (20%), Positives = 26/73 (35%), Gaps = 15/73 (20%) Query: 124 VLVALGYASWEKGQLEQEILDNAWLTAPAD-----LNILFKTPI----------ADRWRE 168 + +G A+W+ QL E+ +N W+ D +I+F T W + Sbjct: 938 IKRFIGKATWDINQLMDELNNNYWIALNCDNKELLSSIIFNTADSVTDGSAYKGEFLWEK 997 Query: 169 AAKLIGVDILTMP 181 I D + Sbjct: 998 IVASISNDYENIS 1010 >UniRef50_C1E0N3 Predicted protein n=1 Tax=Micromonas sp. RCC299 RepID=C1E0N3_9CHLO Length = 511 Score = 39.3 bits (91), Expect = 0.066, Method: Composition-based stats. Identities = 22/87 (25%), Positives = 33/87 (37%), Gaps = 3/87 (3%) Query: 81 DRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQPSDVLVALGYASWEKGQLEQ 140 + G L + IR+ + + D LE + D + V G A W + QL Sbjct: 376 EVGLALRYDSESGRWMIRLRNGEGKSVKPDNLEAM--DGEGGRVFAFWGDARWSRAQLLG 433 Query: 141 EILDNAWLTAPADLNILFKTPIADRWR 167 EI W A + + T A+RW Sbjct: 434 EIARGHWGLCRAGVGDI-TTSTAERWE 459 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_A7ZR71 UPF0301 protein yqgE n=133 Tax=Gammaproteobacter... 239 4e-62 UniRef50_Q87LK0 UPF0301 protein VP2612 n=11 Tax=Gammaproteobacte... 233 3e-60 UniRef50_D0L0G7 Putative uncharacterized protein n=4 Tax=Gammapr... 230 1e-59 UniRef50_Q7MHK0 UPF0301 protein VV2869 n=47 Tax=Gammaproteobacte... 229 5e-59 UniRef50_A1JPT5 UPF0301 protein YE3428 n=22 Tax=Gammaproteobacte... 227 2e-58 UniRef50_B6EMV3 UPF0301 protein VSAL_I0547 n=39 Tax=Gammaproteob... 222 7e-57 UniRef50_B8E9P8 UPF0301 protein Sbal223_1344 n=7 Tax=Shewanella ... 216 3e-55 UniRef50_C9CSD6 Putative uncharacterized protein n=1 Tax=Silicib... 215 8e-55 UniRef50_A1RHM8 UPF0301 protein Sputw3181_1330 n=13 Tax=Proteoba... 215 8e-55 UniRef50_Q21EI7 UPF0301 protein Sde_3637 n=7 Tax=Proteobacteria ... 214 2e-54 UniRef50_Q1BYL1 UPF0301 protein Bcen_0382 n=60 Tax=Betaproteobac... 213 2e-54 UniRef50_Q3SFS4 UPF0301 protein Tbd_2579 n=2 Tax=Proteobacteria ... 213 3e-54 UniRef50_A1U764 Putative uncharacterized protein n=3 Tax=Marinob... 212 4e-54 UniRef50_B8FL92 UPF0301 protein Dalk_3037 n=1 Tax=Desulfatibacil... 211 7e-54 UniRef50_A9DAK3 Putative uncharacterized protein n=1 Tax=Hoeflea... 211 8e-54 UniRef50_Q478W0 UPF0301 protein Daro_3893 n=3 Tax=Betaproteobact... 211 9e-54 UniRef50_Q0EWH5 Putative uncharacterized protein n=1 Tax=Maripro... 210 2e-53 UniRef50_Q3SNY6 UPF0301 protein Nwi_2752 n=121 Tax=Alphaproteoba... 210 3e-53 UniRef50_Q605E8 UPF0301 protein MCA2336 2 n=3 Tax=Gammaproteobac... 210 3e-53 UniRef50_Q486M0 UPF0301 protein CPS_1252 n=1 Tax=Colwellia psych... 209 3e-53 UniRef50_Q163D2 UPF0301 protein RD1_3419 n=14 Tax=Rhodobacterale... 209 6e-53 UniRef50_Q5WYW5 UPF0301 protein lpl0620 n=6 Tax=Legionella RepID... 208 7e-53 UniRef50_B9NUS5 Putative uncharacterized protein n=2 Tax=Rhodoba... 207 2e-52 UniRef50_C5SQD7 Putative uncharacterized protein n=1 Tax=Asticca... 207 2e-52 UniRef50_A8ZZX8 Putative uncharacterized protein n=1 Tax=Desulfo... 206 2e-52 UniRef50_Q3IZ52 UPF0301 protein RHOS4_26140 n=6 Tax=Rhodobactera... 206 3e-52 UniRef50_Q0AMH8 Putative uncharacterized protein n=3 Tax=Hyphomo... 205 5e-52 UniRef50_Q0C3C2 Putative uncharacterized protein n=1 Tax=Hyphomo... 205 7e-52 UniRef50_A6VSP6 UPF0301 protein Mmwyl1_0539 n=2 Tax=Marinomonas ... 204 1e-51 UniRef50_A1SUE8 Putative uncharacterized protein n=2 Tax=Psychro... 204 1e-51 UniRef50_Q1YVF9 Putative uncharacterized protein n=1 Tax=gamma p... 203 2e-51 UniRef50_C6NYY5 Putative uncharacterized protein n=1 Tax=Acidith... 202 6e-51 UniRef50_Q4ZZ67 UPF0301 protein Psyr_0485 n=31 Tax=Proteobacteri... 201 8e-51 UniRef50_Q1MZ13 Putative uncharacterized protein n=1 Tax=Bermane... 201 9e-51 UniRef50_Q2P5W3 UPF0301 protein XOO1309 n=20 Tax=Xanthomonadacea... 201 1e-50 UniRef50_A4SVF0 Putative uncharacterized protein n=1 Tax=Polynuc... 200 1e-50 UniRef50_B5JTR0 Putative uncharacterized protein n=1 Tax=gamma p... 200 2e-50 UniRef50_B9MF60 UPF0301 protein Dtpsy_2896 n=28 Tax=Proteobacter... 200 2e-50 UniRef50_B4REX9 Transcriptional regulator n=4 Tax=Caulobacterace... 199 4e-50 UniRef50_B8KLD5 Putative uncharacterized protein n=3 Tax=Proteob... 198 8e-50 UniRef50_A8PPF4 Putative uncharacterized protein n=1 Tax=Rickett... 197 2e-49 UniRef50_A4BI12 Putative uncharacterized protein n=1 Tax=Reineke... 197 2e-49 UniRef50_C3K3J9 UPF0301 protein PFLU_5755 n=5 Tax=cellular organ... 194 1e-48 UniRef50_A6WWH2 Putative uncharacterized protein n=2 Tax=Ochroba... 194 1e-48 UniRef50_A1KUG1 UPF0301 protein NMC1274 n=27 Tax=Neisseriaceae R... 194 1e-48 UniRef50_A1TKL7 UPF0301 protein Aave_0907 n=10 Tax=Comamonadacea... 193 3e-48 UniRef50_C0QFZ0 UPF0301 protein HRM2_24640 n=1 Tax=Desulfobacter... 188 5e-47 UniRef50_A9KDE7 UPF0301 protein CBUD_2193 n=7 Tax=Coxiella burne... 188 1e-46 UniRef50_Q31EK4 UPF0301 protein Tcr_1827 n=1 Tax=Thiomicrospira ... 187 2e-46 UniRef50_A5WBR3 UPF0301 protein PsycPRwf_0144 n=21 Tax=Moraxella... 186 3e-46 UniRef50_Q1NQW6 Putative uncharacterized protein n=2 Tax=Deltapr... 185 9e-46 UniRef50_Q2GAJ3 UPF0301 protein Saro_0683 n=4 Tax=Sphingomonadac... 184 1e-45 UniRef50_A0L5K4 UPF0301 protein Mmc1_0726 n=1 Tax=Magnetococcus ... 184 1e-45 UniRef50_Q0I1B4 UPF0301 protein HS_0009 n=26 Tax=Pasteurellaceae... 183 2e-45 UniRef50_A3MYV4 UPF0301 protein APL_0232 n=7 Tax=Pasteurellaceae... 183 2e-45 UniRef50_B3QT15 Putative uncharacterized protein n=1 Tax=Chloroh... 182 5e-45 UniRef50_A4A9E0 Protein containing DUF179 n=1 Tax=Congregibacter... 182 7e-45 UniRef50_Q5FQY8 UPF0301 protein GOX1459 n=11 Tax=Acetobacteracea... 180 2e-44 UniRef50_Q60BQ2 UPF0301 protein MCA0413 1 n=1 Tax=Methylococcus ... 176 3e-43 UniRef50_Q3B561 UPF0301 protein Plut_0637 n=11 Tax=Chlorobiaceae... 175 8e-43 UniRef50_B0BVW3 UPF0301 protein RrIowa_0061 n=15 Tax=Rickettsia ... 174 1e-42 UniRef50_Q2S591 UPF0301 protein SRU_0495 n=2 Tax=Rhodothermaceae... 173 2e-42 UniRef50_Q6AL28 UPF0301 protein DP2218 n=1 Tax=Desulfotalea psyc... 173 3e-42 UniRef50_C8CIK8 Putative uncharacterized protein n=1 Tax=uncultu... 172 6e-42 UniRef50_A6C880 Putative uncharacterized protein n=1 Tax=Plancto... 172 8e-42 UniRef50_Q254Z3 UPF0301 protein CF0373 n=7 Tax=Chlamydiales RepI... 171 1e-41 UniRef50_D2QR79 Putative uncharacterized protein n=2 Tax=Flexiba... 169 3e-41 UniRef50_Q11U74 UPF0301 protein CHU_1773 n=2 Tax=Flexibacteracea... 166 4e-40 UniRef50_Q3KMF1 UPF0301 protein CTA_0231 n=9 Tax=Chlamydia RepID... 165 7e-40 UniRef50_Q1DAS2 UPF0301 protein MXAN_2022 n=2 Tax=Cystobacterine... 165 7e-40 UniRef50_Q5NQN1 UPF0301 protein ZMO0349 n=3 Tax=Zymomonas mobili... 163 3e-39 UniRef50_A7C130 Protein containing DUF179 n=1 Tax=Beggiatoa sp. ... 162 4e-39 UniRef50_A6LBX4 UPF0301 protein BDI_1431 n=6 Tax=Bacteroidales R... 162 5e-39 UniRef50_A3VRH6 Putative uncharacterized protein n=1 Tax=Parvula... 162 8e-39 UniRef50_Q5LDK5 UPF0301 protein BF2109 n=20 Tax=Bacteroides RepI... 161 1e-38 UniRef50_A3HT39 Putative uncharacterized protein n=1 Tax=Algorip... 160 3e-38 UniRef50_A6G4Z9 Putative uncharacterized protein n=1 Tax=Plesioc... 160 3e-38 UniRef50_C2G2S7 Transcriptional regulator n=3 Tax=Sphingobacteri... 159 5e-38 UniRef50_C3Q021 UPF0301 protein n=8 Tax=Bacteroides RepID=C3Q021... 159 5e-38 UniRef50_A5FNN9 Putative uncharacterized protein n=18 Tax=Bacter... 158 7e-38 UniRef50_C7PSM3 Putative uncharacterized protein n=1 Tax=Chitino... 158 1e-37 UniRef50_C1ZJG4 Predicted transcriptional regulator, COG1678 n=1... 157 2e-37 UniRef50_Q0BLI0 UPF0301 protein FTH_1193 n=18 Tax=Francisella Re... 157 2e-37 UniRef50_B9XJW1 Putative uncharacterized protein n=1 Tax=bacteri... 157 3e-37 UniRef50_A9RVW3 Predicted protein n=1 Tax=Physcomitrella patens ... 156 5e-37 UniRef50_D0LIU3 Putative uncharacterized protein n=1 Tax=Haliang... 155 6e-37 UniRef50_UPI0001C3133A protein of unknown function DUF179 n=1 Ta... 153 2e-36 UniRef50_A7H7H6 UPF0301 protein Anae109_0457 n=4 Tax=Anaeromyxob... 152 5e-36 UniRef50_A3ZQK2 Putative uncharacterized protein n=1 Tax=Blastop... 151 9e-36 UniRef50_C6X421 Putative transcriptional regulator n=1 Tax=Flavo... 150 2e-35 UniRef50_A4C260 Putative transcriptional regulator n=1 Tax=Polar... 150 2e-35 UniRef50_C1E5G6 Predicted protein n=2 Tax=Micromonas RepID=C1E5G... 150 3e-35 UniRef50_A4S673 Predicted protein (Fragment) n=2 Tax=Ostreococcu... 150 3e-35 UniRef50_Q9LQ30 F14M2.10 protein n=6 Tax=rosids RepID=Q9LQ30_ARATH 148 6e-35 UniRef50_A9GTQ2 Putative uncharacterized protein n=1 Tax=Sorangi... 148 7e-35 UniRef50_B0SHS8 Transcriptional regulator n=6 Tax=Leptospira Rep... 148 8e-35 UniRef50_A6E847 Putative uncharacterized protein n=1 Tax=Pedobac... 148 1e-34 UniRef50_C1D0N0 Putative uncharacterized protein n=3 Tax=Deinoco... 147 1e-34 UniRef50_UPI0001745679 hypothetical protein VspiD_25265 n=1 Tax=... 147 1e-34 UniRef50_Q1Q3L0 Putative uncharacterized protein n=1 Tax=Candida... 145 6e-34 UniRef50_B7G677 Predicted protein n=2 Tax=Bacillariophyta RepID=... 142 4e-33 UniRef50_D2R140 Putative uncharacterized protein n=1 Tax=Pirellu... 142 5e-33 UniRef50_Q47MA0 UPF0301 protein Tfu_2389 n=3 Tax=Actinomycetales... 142 5e-33 UniRef50_B9SPX9 Electron transporter, putative n=2 Tax=fabids Re... 142 6e-33 UniRef50_C0YNI4 Transcriptional regulator n=1 Tax=Chryseobacteri... 142 7e-33 UniRef50_C1RPZ7 Predicted transcriptional regulator, COG1678 n=1... 141 8e-33 UniRef50_Q82D55 UPF0301 protein SAV_5129 n=12 Tax=Actinomycetale... 141 1e-32 UniRef50_C7MS43 Predicted transcriptional regulator n=3 Tax=Acti... 140 3e-32 UniRef50_C5YY61 Putative uncharacterized protein Sb09g020680 n=4... 138 7e-32 UniRef50_Q2BRE1 Putative uncharacterized protein n=1 Tax=Neptuni... 138 1e-31 UniRef50_B4CVG9 Putative uncharacterized protein n=1 Tax=Chthoni... 137 2e-31 UniRef50_Q8S0Q9 Os01g0886000 protein n=6 Tax=Poaceae RepID=Q8S0Q... 137 2e-31 UniRef50_B1ZWF6 Putative uncharacterized protein n=2 Tax=Opituta... 134 1e-30 UniRef50_B1MML1 UPF0301 protein MAB_4928c n=20 Tax=Corynebacteri... 134 1e-30 UniRef50_Q0FVR8 Putative uncharacterized protein (Fragment) n=1 ... 133 3e-30 UniRef50_B5JR88 Putative uncharacterized protein n=1 Tax=Verruco... 131 1e-29 UniRef50_C4DLM5 Predicted transcriptional regulator, COG1678 n=5... 131 1e-29 UniRef50_A8IT21 Predicted protein n=1 Tax=Chlamydomonas reinhard... 130 2e-29 UniRef50_C1B7P4 UPF0301 protein ROP_34500 n=13 Tax=Corynebacteri... 130 2e-29 UniRef50_A8IRU3 Predicted protein n=1 Tax=Chlamydomonas reinhard... 129 4e-29 UniRef50_A1SG68 Putative uncharacterized protein n=2 Tax=Actinom... 125 6e-28 UniRef50_C8XE74 Putative uncharacterized protein n=2 Tax=Actinom... 125 1e-27 UniRef50_B2UMM6 Putative uncharacterized protein n=1 Tax=Akkerma... 123 2e-27 UniRef50_Q8NL65 UPF0301 protein Cgl3084/cg3414 n=5 Tax=Corynebac... 122 5e-27 UniRef50_Q8FSW7 UPF0301 protein CE2927 n=11 Tax=Corynebacterium ... 121 9e-27 UniRef50_Q7URG7 Probable transcriptional regulator n=1 Tax=Rhodo... 119 5e-26 UniRef50_Q6A827 Conserved protein, DUF179 n=2 Tax=Propionibacter... 118 1e-25 UniRef50_C7Q9Z6 Putative uncharacterized protein n=5 Tax=Actinom... 117 2e-25 UniRef50_Q7G645 Os10g0330400 protein n=3 Tax=Oryza sativa RepID=... 116 4e-25 UniRef50_Q9LS71 Emb|CAB72194.1 n=4 Tax=rosids RepID=Q9LS71_ARATH 116 5e-25 UniRef50_C0AXX7 Putative uncharacterized protein n=1 Tax=Proteus... 114 2e-24 UniRef50_C7PBJ3 Putative uncharacterized protein n=1 Tax=Chitino... 113 3e-24 UniRef50_D0NN30 Putative uncharacterized protein n=1 Tax=Phytoph... 111 8e-24 UniRef50_D0A6S5 Putative uncharacterized protein n=2 Tax=Trypano... 111 1e-23 UniRef50_Q54HU6 Putative uncharacterized protein n=1 Tax=Dictyos... 111 2e-23 UniRef50_C5WYB1 Putative uncharacterized protein Sb01g018920 n=1... 110 2e-23 UniRef50_Q4CSL3 Putative uncharacterized protein n=2 Tax=Trypano... 110 2e-23 UniRef50_B8C551 Predicted protein n=1 Tax=Thalassiosira pseudona... 106 4e-22 UniRef50_B9HGN1 Predicted protein n=1 Tax=Populus trichocarpa Re... 105 7e-22 UniRef50_C1E1K3 Predicted protein n=2 Tax=cellular organisms Rep... 105 1e-21 UniRef50_Q0IWV7 Os10g0485100 protein (Fragment) n=5 Tax=Poaceae ... 105 1e-21 UniRef50_D1HM83 Whole genome shotgun sequence of line PN40024, s... 103 4e-21 UniRef50_Q4QG99 Putative uncharacterized protein n=3 Tax=Leishma... 103 4e-21 UniRef50_B8C1S9 Predicted protein n=1 Tax=Thalassiosira pseudona... 103 5e-21 UniRef50_D1I242 Whole genome shotgun sequence of line PN40024, s... 103 5e-21 UniRef50_Q9LT30 Genomic DNA, chromosome 3, P1 clone: MPN9 n=2 Ta... 101 1e-20 UniRef50_B6SSQ6 Uncharacterized ACR, COG1678 family protein n=3 ... 100 4e-20 UniRef50_A9S274 Predicted protein n=1 Tax=Physcomitrella patens ... 98 1e-19 UniRef50_C5BU30 Putative uncharacterized protein n=1 Tax=Teredin... 98 2e-19 UniRef50_A4RT64 Predicted protein n=2 Tax=Ostreococcus RepID=A4R... 96 7e-19 UniRef50_Q7UKQ8 Putative uncharacterized protein n=1 Tax=Rhodopi... 94 2e-18 UniRef50_C5L030 Membrane associated RING finger, putative n=1 Ta... 93 3e-18 UniRef50_B8CDL7 Predicted protein n=1 Tax=Thalassiosira pseudona... 93 5e-18 UniRef50_A9T9H0 Predicted protein n=1 Tax=Physcomitrella patens ... 92 1e-17 UniRef50_D2VPR5 Predicted protein n=1 Tax=Naegleria gruberi RepI... 87 3e-16 UniRef50_B7G772 Predicted protein n=1 Tax=Phaeodactylum tricornu... 84 3e-15 UniRef50_B7G8R3 Predicted protein n=1 Tax=Phaeodactylum tricornu... 81 2e-14 UniRef50_B8BQD7 Predicted protein n=1 Tax=Thalassiosira pseudona... 81 2e-14 UniRef50_B8BZN2 Predicted protein n=1 Tax=Thalassiosira pseudona... 79 9e-14 UniRef50_C0AXX9 Putative uncharacterized protein n=1 Tax=Proteus... 66 7e-10 UniRef50_B6KQL2 Putative uncharacterized protein n=4 Tax=Toxopla... 58 1e-07 UniRef50_A5AUI4 Putative uncharacterized protein n=1 Tax=Vitis v... 57 4e-07 UniRef50_B8CF73 Predicted protein n=2 Tax=Thalassiosira pseudona... 56 6e-07 UniRef50_C5KHP8 Putative uncharacterized protein n=1 Tax=Perkins... 55 2e-06 Sequences not found previously or not previously below threshold: UniRef50_B7GE41 Predicted protein n=1 Tax=Phaeodactylum tricornu... 48 2e-04 UniRef50_Q4DTN4 Putative uncharacterized protein n=3 Tax=Trypano... 46 5e-04 UniRef50_UPI0001BCD1A1 hypothetical protein AmarD1_07934 n=1 Tax... 46 7e-04 UniRef50_B3LAF2 Putative uncharacterized protein n=1 Tax=Plasmod... 44 0.002 UniRef50_C1FFU8 Predicted protein n=1 Tax=Micromonas sp. RCC299 ... 44 0.003 UniRef50_Q7RTH1 Glutamic acid-rich protein n=3 Tax=Plasmodium (V... 43 0.004 UniRef50_A5K110 Putative uncharacterized protein n=1 Tax=Plasmod... 43 0.006 UniRef50_O68558 Putative uncharacterized protein (Fragment) n=1 ... 41 0.017 UniRef50_B8BQX9 Predicted protein n=1 Tax=Thalassiosira pseudona... 40 0.032 UniRef50_Q00WL0 Protein involved in mRNA turnover and stability ... 40 0.046 UniRef50_A4I406 Putative uncharacterized protein n=3 Tax=Leishma... 40 0.056 UniRef50_C1E0N3 Predicted protein n=1 Tax=Micromonas sp. RCC299 ... 39 0.070 UniRef50_C1MQ75 Predicted protein n=1 Tax=Micromonas pusilla CCM... 39 0.084 >UniRef50_A7ZR71 UPF0301 protein yqgE n=133 Tax=Gammaproteobacteria RepID=YQGE_ECO24 Length = 187 Score = 239 bits (610), Expect = 4e-62, Method: Composition-based stats. Identities = 187/187 (100%), Positives = 187/187 (100%) Query: 1 MNLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPE 60 MNLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPE Sbjct: 1 MNLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPE 60 Query: 61 PRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQ 120 PRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQ Sbjct: 61 PRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQ 120 Query: 121 PSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTM 180 PSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTM Sbjct: 121 PSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTM 180 Query: 181 PGVAGHA 187 PGVAGHA Sbjct: 181 PGVAGHA 187 >UniRef50_Q87LK0 UPF0301 protein VP2612 n=11 Tax=Gammaproteobacteria RepID=Y2612_VIBPA Length = 187 Score = 233 bits (594), Expect = 3e-60, Method: Composition-based stats. Identities = 92/188 (48%), Positives = 136/188 (72%), Gaps = 2/188 (1%) Query: 1 MNLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITP- 59 MNL +HFL+AMP ++DP F+ SV+Y+CEHN GAMG+++N P++ + + +L+++ + P Sbjct: 1 MNLTNHFLVAMPGMKDPYFQNSVIYVCEHNEEGAMGLMINAPVD-ITVGNMLKQVDVQPV 59 Query: 60 EPRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDK 119 PR LD+PV GGP++EDRGFILH P + SSI+++D+ +TTSRD+L LGT+ Sbjct: 60 HPRLFEASLDRPVYNGGPISEDRGFILHKPKDYYESSIQMTDDLAVTTSRDILSVLGTEA 119 Query: 120 QPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILT 179 +PSD LVALGY+ W GQLE E+++N+WLT A I+F TPI +RW++A + +G+D Sbjct: 120 EPSDYLVALGYSGWSAGQLENELVENSWLTIEATPEIIFDTPITERWKKAVEKLGIDPSQ 179 Query: 180 MPGVAGHA 187 + AGHA Sbjct: 180 LSADAGHA 187 >UniRef50_D0L0G7 Putative uncharacterized protein n=4 Tax=Gammaproteobacteria RepID=D0L0G7_HALNC Length = 216 Score = 230 bits (588), Expect = 1e-59, Method: Composition-based stats. Identities = 81/187 (43%), Positives = 123/187 (65%), Gaps = 4/187 (2%) Query: 1 MNLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPE 60 + L++ LIAMP+L DP F +V Y+CEHN +GAMGI +N+PL+ + + I + +KI+ Sbjct: 34 IQLKNQILIAMPSLDDPNFNHTVTYVCEHNEDGAMGITINRPLD-VTLGDIFDHMKISCS 92 Query: 61 PRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQ 120 +PV +GGP+A +RGF+LHTP + S++ I+D +TTS+D+L+ L Sbjct: 93 ---NPSIRGRPVFMGGPVALERGFVLHTPHGGWESTLEITDEIGLTTSKDILQALAEGAG 149 Query: 121 PSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTM 180 P+ ++ALGY+ W +GQLEQE+ DN WLT A ++F P+ +RW AA+ +GVD+ + Sbjct: 150 PARAVIALGYSGWSEGQLEQELADNTWLTVAATTELIFDYPVEERWAAAARSLGVDMNLL 209 Query: 181 PGVAGHA 187 G AGHA Sbjct: 210 SGEAGHA 216 >UniRef50_Q7MHK0 UPF0301 protein VV2869 n=47 Tax=Gammaproteobacteria RepID=Y2869_VIBVY Length = 187 Score = 229 bits (583), Expect = 5e-59, Method: Composition-based stats. Identities = 89/188 (47%), Positives = 135/188 (71%), Gaps = 2/188 (1%) Query: 1 MNLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITP- 59 MNL +HFL+AMP ++DP F+ SV+YICEHN GAMG+++N P++ + + +LE++ + P Sbjct: 1 MNLTNHFLVAMPGMKDPYFQHSVIYICEHNEEGAMGLMINAPID-ITVGKMLEQVDVQPV 59 Query: 60 EPRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDK 119 P+ + L KPV GGP+AEDRGFILH P + SS+++++ +TTS+D+L LGT+ Sbjct: 60 HPQLNTSSLTKPVYNGGPVAEDRGFILHRPKDFYESSLQMTEQISVTTSKDILTVLGTEA 119 Query: 120 QPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILT 179 +PS +VALGY+ W GQLE E+ +N+WLT A+ +I+F TPIA RW++A +++G+ Sbjct: 120 EPSSYIVALGYSGWSAGQLEAELAENSWLTVEANPDIIFDTPIAMRWQKAVQMLGIHASQ 179 Query: 180 MPGVAGHA 187 + AGHA Sbjct: 180 LSDQAGHA 187 >UniRef50_A1JPT5 UPF0301 protein YE3428 n=22 Tax=Gammaproteobacteria RepID=Y3428_YERE8 Length = 187 Score = 227 bits (579), Expect = 2e-58, Method: Composition-based stats. Identities = 125/187 (66%), Positives = 154/187 (82%) Query: 1 MNLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPE 60 MNLQHHFLIAMP+LQDP F RSV+YICEHN GAMG+++NKP+E +E +L+KLKI+P Sbjct: 1 MNLQHHFLIAMPSLQDPHFMRSVIYICEHNKEGAMGLVINKPMEQFTVETVLKKLKISPT 60 Query: 61 PRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQ 120 PRD SIRLDK V+ GGPLAEDRGFILH+P F SSI IS +T++TTS+DVLETLGT +Q Sbjct: 61 PRDPSIRLDKAVLAGGPLAEDRGFILHSPQEGFGSSIPISPDTMITTSKDVLETLGTPEQ 120 Query: 121 PSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTM 180 P ++LVALGYA W++GQLEQE+LDNAWLT AD +ILF TPIA+RW+ AA +G++I + Sbjct: 121 PKNLLVALGYAGWQQGQLEQELLDNAWLTIEADTHILFNTPIAERWQAAANKLGINIFNI 180 Query: 181 PGVAGHA 187 AGHA Sbjct: 181 APQAGHA 187 >UniRef50_B6EMV3 UPF0301 protein VSAL_I0547 n=39 Tax=Gammaproteobacteria RepID=Y547_ALISL Length = 187 Score = 222 bits (565), Expect = 7e-57, Method: Composition-based stats. Identities = 88/188 (46%), Positives = 140/188 (74%), Gaps = 2/188 (1%) Query: 1 MNLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKIT-P 59 M+L++HFL+AMP++ DP+F RSV+YICEH+++G MG+ +N+P++ + ++G+L+++K+ P Sbjct: 1 MDLKNHFLVAMPSMNDPVFTRSVIYICEHDSDGTMGLRINQPVQ-ISLKGMLDQIKLDNP 59 Query: 60 EPRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDK 119 P L +PV+ GGP+++DRGF+LH P N++SSI +++ +TTS+D+L TLGT+ Sbjct: 60 SPIIFPQTLSQPVLNGGPVSDDRGFVLHYPKDNYSSSIEVTEELSVTTSKDILATLGTED 119 Query: 120 QPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILT 179 QP LVALGY+ W+ GQLEQE+ +N WL AD +++F TPI DRWR A +++G+ + Sbjct: 120 QPYKYLVALGYSGWDAGQLEQELSENTWLILEADSSVIFDTPIPDRWRRAIEILGISPVN 179 Query: 180 MPGVAGHA 187 + GHA Sbjct: 180 ISSEVGHA 187 >UniRef50_B8E9P8 UPF0301 protein Sbal223_1344 n=7 Tax=Shewanella RepID=Y1344_SHEB2 Length = 187 Score = 216 bits (550), Expect = 3e-55, Method: Composition-based stats. Identities = 87/186 (46%), Positives = 126/186 (67%), Gaps = 1/186 (0%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 +LQ+HFLIAMP+L D F RSV+YICEH+ GAMG+++NKPL +++ +LE++ + E Sbjct: 3 SLQNHFLIAMPSLHDTFFERSVIYICEHDAKGAMGLVINKPL-GIEVNSLLEQMDLPAEQ 61 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQP 121 + + VM+GGP+++DRGF+LHT +A+S + ++TTSRDVL +G+++ P Sbjct: 62 VSTDLAFNANVMMGGPVSQDRGFVLHTSQPYWANSTDLGCGLMLTTSRDVLTAIGSNRSP 121 Query: 122 SDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMP 181 LVALGYA W K QLEQE+ DN+WLT PA +LF DRW +A++ +G D + Sbjct: 122 EKFLVALGYAGWSKDQLEQELADNSWLTIPATNALLFDIKHEDRWPQASRALGFDAWQVS 181 Query: 182 GVAGHA 187 AGHA Sbjct: 182 AQAGHA 187 >UniRef50_C9CSD6 Putative uncharacterized protein n=1 Tax=Silicibacter sp. TrichCH4B RepID=C9CSD6_9RHOB Length = 219 Score = 215 bits (547), Expect = 8e-55, Method: Composition-based stats. Identities = 73/188 (38%), Positives = 101/188 (53%), Gaps = 5/188 (2%) Query: 1 MNLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPE 60 M L LIAMP + DP F SVV++C H GAMG+I+NK + ++ ++++L+I E Sbjct: 36 MELTGKLLIAMPGIGDPRFDNSVVFLCSHGDEGAMGLIINKLAPGVALQTLMDQLEIDIE 95 Query: 61 PRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFA-SSIRISDNTVMTTSRDVLETLGTDK 119 P S PV GGP+ RGF+LH+ +S+ + MT + DVLE + + Sbjct: 96 PAIASA----PVYFGGPVETQRGFVLHSDEYISTVNSLPVKPGFSMTATLDVLEDIAEGR 151 Query: 120 QPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILT 179 P LV LGYA W GQLE EI N WLT A+ ++F +W A +GV L Sbjct: 152 GPERYLVMLGYAGWGPGQLEDEIAQNGWLTTDAEPEMIFTDTADTKWEAALASLGVTPLN 211 Query: 180 MPGVAGHA 187 + AGHA Sbjct: 212 LSMDAGHA 219 >UniRef50_A1RHM8 UPF0301 protein Sputw3181_1330 n=13 Tax=Proteobacteria RepID=Y1330_SHESW Length = 187 Score = 215 bits (547), Expect = 8e-55, Method: Composition-based stats. Identities = 86/186 (46%), Positives = 124/186 (66%), Gaps = 1/186 (0%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 +LQ+HFLIAMP+L D F RSV+Y+CEH+ GAMGI++NKPL +++ +LE++ + E Sbjct: 3 SLQNHFLIAMPSLDDTFFERSVIYLCEHDDKGAMGIVINKPL-GIEVSSLLEQMDLPAEQ 61 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQP 121 I + V++GGP+++DRGF+LHT +A+S + ++TTSRDVL +G + P Sbjct: 62 VFADIAQNAQVLMGGPVSQDRGFVLHTSQPYWANSTDLGSGLMLTTSRDVLTAIGGKRSP 121 Query: 122 SDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMP 181 LVALGYA W K QLEQE+ +N+WLT PA +LF DRW +A++ +G D + Sbjct: 122 DKFLVALGYAGWGKHQLEQELAENSWLTIPATNALLFDVKHEDRWPQASRSLGFDAWQVS 181 Query: 182 GVAGHA 187 AGHA Sbjct: 182 AQAGHA 187 >UniRef50_Q21EI7 UPF0301 protein Sde_3637 n=7 Tax=Proteobacteria RepID=Y3637_SACD2 Length = 203 Score = 214 bits (544), Expect = 2e-54, Method: Composition-based stats. Identities = 86/187 (45%), Positives = 124/187 (66%), Gaps = 5/187 (2%) Query: 1 MNLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPE 60 ++L+ HFLIAMP LQDPIF RS+ YIC+H GAMGI+VN+P+ NL + I E+L++ Sbjct: 22 VSLRDHFLIAMPGLQDPIFSRSLTYICDHTAQGAMGIVVNQPM-NLTLGDIFEQLELQ-- 78 Query: 61 PRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQ 120 D++ + + V+ GGP+ +RGF+LH + S++ I+ + +T SRD++ + + Sbjct: 79 --DKAQQAGRAVLAGGPVNTERGFVLHRDSGAWESTMHIAPDVNLTASRDIVHAIANNTG 136 Query: 121 PSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTM 180 P L ALGYA W GQLE+EI N+WLT PAD +I+F P+ DRW AA+ +G+DI M Sbjct: 137 PKSSLFALGYAGWSAGQLEEEISANSWLTIPADSSIIFDIPVEDRWAAAARQLGIDIHLM 196 Query: 181 PGVAGHA 187 AGHA Sbjct: 197 SATAGHA 203 >UniRef50_Q1BYL1 UPF0301 protein Bcen_0382 n=60 Tax=Betaproteobacteria RepID=Y382_BURCA Length = 192 Score = 213 bits (543), Expect = 2e-54, Method: Composition-based stats. Identities = 78/189 (41%), Positives = 115/189 (60%), Gaps = 6/189 (3%) Query: 1 MNLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPE 60 +NL + FLIAMP + DP F +VVY+C+H+ GA+G+++N+P + + +E + ++ + Sbjct: 8 INLTNQFLIAMPNMADPTFSGTVVYLCDHSERGALGLVINRPTD-IDLESLFNRIDL--- 63 Query: 61 PRDESIRLDKPVMLGGPLAEDRGFILHTP--PSNFASSIRISDNTVMTTSRDVLETLGTD 118 D L PV GGP+ +RGF+LH P +++ SS+ + MTTS+DVLE + T Sbjct: 64 KLDIEPLLHIPVYFGGPVQTERGFVLHEPVEGASYNSSMSVDGGLEMTTSKDVLEAVATG 123 Query: 119 KQPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDIL 178 P L+ LG+A W GQLE+EI N WLT AD I+F TP +R+ A L+GV Sbjct: 124 TGPKRFLLTLGHAGWGAGQLEEEIARNGWLTVAADPRIVFDTPAEERFEAALGLLGVSSS 183 Query: 179 TMPGVAGHA 187 + G AGHA Sbjct: 184 MLSGEAGHA 192 >UniRef50_Q3SFS4 UPF0301 protein Tbd_2579 n=2 Tax=Proteobacteria RepID=Y2579_THIDA Length = 185 Score = 213 bits (542), Expect = 3e-54, Method: Composition-based stats. Identities = 75/187 (40%), Positives = 116/187 (62%), Gaps = 5/187 (2%) Query: 1 MNLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPE 60 +NL +HFLIAMP + DP F ++ YIC+H+ GA+G++VN+P++ L + + E++ ++ Sbjct: 4 VNLTNHFLIAMPGMVDPNFNGTLTYICDHSDQGALGVVVNRPID-LDLSTLFEQIGLSLP 62 Query: 61 PRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQ 120 V GGP+ +RGF+LHTPP F+S++ ++D +TTS+DVLE + Sbjct: 63 EGLHGEI----VYFGGPVQTERGFVLHTPPLTFSSTLTVNDAVSLTTSKDVLEAVSQGAG 118 Query: 121 PSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTM 180 P +V+LGYA W GQLE E+ NAWL+ AD ++F +R A KL+G+D ++ Sbjct: 119 PEKFIVSLGYAGWSAGQLEDELKQNAWLSVAADPQVIFDLAPEERLPAAMKLLGIDFASL 178 Query: 181 PGVAGHA 187 AGHA Sbjct: 179 SDEAGHA 185 >UniRef50_A1U764 Putative uncharacterized protein n=3 Tax=Marinobacter RepID=A1U764_MARAV Length = 188 Score = 212 bits (541), Expect = 4e-54, Method: Composition-based stats. Identities = 82/186 (44%), Positives = 126/186 (67%), Gaps = 7/186 (3%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 +L+HHFL+A P L DP F V+Y+CEH+ GA+G+++N+PL+ + + ILE+L + Sbjct: 10 SLRHHFLVASPWLADPRFHGGVIYLCEHSEEGALGLMINQPLD-IHLGEILEQLDM---- 64 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQP 121 LD PV GGP+ +RGF+LH+P + ++ R++D ++TTSRD+LE++G D+ P Sbjct: 65 --HGGELDLPVYTGGPVQPERGFVLHSPGRQWQNTARVTDEVLLTTSRDILESIGRDEGP 122 Query: 122 SDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMP 181 LVALGY+ W +GQLE+E+ NAWLT PA +ILF+TP R++ +L+G+D+ + Sbjct: 123 ESFLVALGYSGWGEGQLEEELGSNAWLTCPASTDILFRTPADQRYQAVLRLMGIDLNQLS 182 Query: 182 GVAGHA 187 GHA Sbjct: 183 DSVGHA 188 >UniRef50_B8FL92 UPF0301 protein Dalk_3037 n=1 Tax=Desulfatibacillum alkenivorans AK-01 RepID=Y3037_DESAA Length = 189 Score = 211 bits (538), Expect = 7e-54, Method: Composition-based stats. Identities = 73/186 (39%), Positives = 118/186 (63%), Gaps = 4/186 (2%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 +L FLIAMPAL DP F SV YIC HN +GA G+++N+ +++ + + +++ + Sbjct: 8 SLAGQFLIAMPALNDPNFALSVTYICVHNQDGAFGLVINQSFDSVTGKTLFDQMDMPAVK 67 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQP 121 E ++ V +GGP+ + F+LH P + +S++ISD M+ S D+L+ + + P Sbjct: 68 AAE----NQTVHIGGPVHQGYVFVLHGRPMEWKASLQISDTVAMSNSTDILQAIASGVGP 123 Query: 122 SDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMP 181 ++ LG A W GQLE E+ +N+WLT P + ++LF+TP+ +RW +AA+ IGVD+ + Sbjct: 124 DPCMIFLGCAGWAPGQLEAELAENSWLTCPGNDDLLFRTPLEERWEKAAQSIGVDLNLLS 183 Query: 182 GVAGHA 187 GVAGHA Sbjct: 184 GVAGHA 189 >UniRef50_A9DAK3 Putative uncharacterized protein n=1 Tax=Hoeflea phototrophica DFL-43 RepID=A9DAK3_9RHIZ Length = 209 Score = 211 bits (538), Expect = 8e-54, Method: Composition-based stats. Identities = 65/195 (33%), Positives = 109/195 (55%), Gaps = 11/195 (5%) Query: 3 LQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPR 62 L HFL+AMP++ D F R+V+++C H+ +GAMG I+N+P + L E ++E L + + R Sbjct: 16 LDGHFLLAMPSMSDERFERAVIFVCAHSEDGAMGFILNQP-QPLSFEELVENLDLDSQER 74 Query: 63 DESIRL----------DKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVL 112 ++ + + P+ GGP+ RGF+LH+ S++ ++D+ +T + D+L Sbjct: 75 RDADKSRKIGMSECARNFPIQFGGPVDPGRGFVLHSDDYMTESTMPVNDDLCLTATIDIL 134 Query: 113 ETLGTDKQPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKL 172 + P ++ LGYA W GQLEQE+ NAWL+ PA +I+F + ++ Sbjct: 135 RAIKDGCGPVRGMMLLGYAGWGPGQLEQEMAANAWLSCPASDDIVFDRDHSAKYDRVLSH 194 Query: 173 IGVDILTMPGVAGHA 187 +GV + AGHA Sbjct: 195 MGVSPAMLSMEAGHA 209 >UniRef50_Q478W0 UPF0301 protein Daro_3893 n=3 Tax=Betaproteobacteria RepID=Y3893_DECAR Length = 186 Score = 211 bits (538), Expect = 9e-54, Method: Composition-based stats. Identities = 86/187 (45%), Positives = 125/187 (66%), Gaps = 4/187 (2%) Query: 1 MNLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPE 60 +NL +FLIAMP L+DP F ++VYICEHN NGA+GIIVN+P++ + + +LEK+ I E Sbjct: 4 VNLTDNFLIAMPTLEDPYFSNALVYICEHNENGALGIIVNRPID-MNLASLLEKIDIKLE 62 Query: 61 PRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQ 120 + D PV GGP+ DRGF+LH P + S++ I+ + +T+SRDVL ++G+ Sbjct: 63 AEN---LADMPVYFGGPVQLDRGFVLHRPIGQWQSTLAINSDVGLTSSRDVLSSVGSAGL 119 Query: 121 PSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTM 180 P+++LV LGYA W+ GQLE+E+ N+WLT PA +ILF P +R A + +G+ + Sbjct: 120 PAEILVTLGYAGWDAGQLEEELAQNSWLTVPAKASILFDLPPEERLPAAMQKLGISFTQL 179 Query: 181 PGVAGHA 187 VAGHA Sbjct: 180 SDVAGHA 186 >UniRef50_Q0EWH5 Putative uncharacterized protein n=1 Tax=Mariprofundus ferrooxydans PV-1 RepID=Q0EWH5_9PROT Length = 191 Score = 210 bits (535), Expect = 2e-53, Method: Composition-based stats. Identities = 72/188 (38%), Positives = 109/188 (57%), Gaps = 3/188 (1%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPE- 60 L L+A P+LQDP FR +VV IC+H+ +G +G+I+N+P ++ + I + + I E Sbjct: 5 GLTGQILLATPSLQDPNFRDTVVLICQHDRDGCLGLIINRP-RDIILGEIFDDMGIRYET 63 Query: 61 -PRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDK 119 + R+ V GGP+ RGF+LH + S++++S +T SRD LE L + Sbjct: 64 GSAENHERIQPVVYEGGPMDGFRGFLLHDGWDVYDSTMQVSPELHLTASRDALEELARGQ 123 Query: 120 QPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILT 179 P ++ LGYA W GQLEQE+ DN+WL APA I+F+ P RW AA+ +G++ Sbjct: 124 GPEHYMLLLGYAGWGAGQLEQELCDNSWLIAPASHQIIFQEPPEKRWDFAARCMGIERGQ 183 Query: 180 MPGVAGHA 187 + GHA Sbjct: 184 LSSQIGHA 191 >UniRef50_Q3SNY6 UPF0301 protein Nwi_2752 n=121 Tax=Alphaproteobacteria RepID=Y2752_NITWN Length = 221 Score = 210 bits (534), Expect = 3e-53, Method: Composition-based stats. Identities = 71/189 (37%), Positives = 102/189 (53%), Gaps = 4/189 (2%) Query: 3 LQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPR 62 L LIAMP ++D F RSV+Y+C H++ GAMGII+N+P ++ +L +L I Sbjct: 33 LDGQLLIAMPVMEDERFARSVIYVCAHSSEGAMGIILNRPAGSVDFSDLLVQLDIIKRAD 92 Query: 63 D---ESIRLDKPVMLGGPLAEDRGFILHTPPSNFAS-SIRISDNTVMTTSRDVLETLGTD 118 VM GGP+ RGF+LH+ ++ I + +T + D+LE + Sbjct: 93 LIKLPETAETMKVMKGGPVETGRGFVLHSSDFFIEDATLPIDEGICLTATLDILEAIAKG 152 Query: 119 KQPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDIL 178 P ++ALGYA W GQLE EI DN WL PAD +++F I D++ A IG+D Sbjct: 153 AGPKHAILALGYAGWAPGQLETEIQDNGWLHCPADQDLIFGRDIEDKYVRALHKIGIDPG 212 Query: 179 TMPGVAGHA 187 + AGHA Sbjct: 213 MLSNEAGHA 221 >UniRef50_Q605E8 UPF0301 protein MCA2336 2 n=3 Tax=Gammaproteobacteria RepID=Y2336_METCA Length = 188 Score = 210 bits (534), Expect = 3e-53, Method: Composition-based stats. Identities = 87/185 (47%), Positives = 128/185 (69%), Gaps = 4/185 (2%) Query: 3 LQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPR 62 L +HFLIAMP L DP F ++V +C+HN +GA+GII+N+P E LK+ I+ +++I + Sbjct: 8 LANHFLIAMPGLTDPHFAKTVTLVCQHNADGALGIIINRPSE-LKLSDIMRQMEIDLKV- 65 Query: 63 DESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQPS 122 + D PV GGP+ +RGFILH P + +AS++ +S+ +TTSRD+LE +G + P Sbjct: 66 --AELGDLPVFFGGPVHPERGFILHEPATVWASTLVVSERLALTTSRDILEAVGRGEGPR 123 Query: 123 DVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMPG 182 +L+ALGYA W +GQLE+EI+DN+WL AP+D ++F+ P RW+ AA L+GVDI + Sbjct: 124 RMLLALGYAGWGQGQLEREIIDNSWLNAPSDNAVIFEHPPGRRWKAAADLVGVDISLLTS 183 Query: 183 VAGHA 187 AGH Sbjct: 184 QAGHG 188 >UniRef50_Q486M0 UPF0301 protein CPS_1252 n=1 Tax=Colwellia psychrerythraea 34H RepID=Y1252_COLP3 Length = 210 Score = 209 bits (533), Expect = 3e-53, Method: Composition-based stats. Identities = 84/209 (40%), Positives = 129/209 (61%), Gaps = 24/209 (11%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLK----- 56 +L++ LIAMP+L DP F ++V YICEHN +GAMG+I+N P+ N+ + +L++++ Sbjct: 3 SLENQLLIAMPSLGDPYFNKTVTYICEHNEDGAMGLIINLPV-NITLADLLKQIEPDEGD 61 Query: 57 ------------------ITPEPRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIR 98 D + L++ V+ GGP+A+ RGF+LH+ ++SS+ Sbjct: 62 KTGNVNSNSELTKSDDVNDITLVTDITNSLEQLVLAGGPIAQQRGFVLHSSQPGWSSSLV 121 Query: 99 ISDNTVMTTSRDVLETLGTDKQPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILF 158 +S ++TTS+D+L LGT + P +V LGYA W GQLEQE+ N+WLT PAD+ ILF Sbjct: 122 LSKELMITTSKDILMALGTQQAPEQFIVTLGYAGWGPGQLEQELQANSWLTTPADIEILF 181 Query: 159 KTPIADRWREAAKLIGVDILTMPGVAGHA 187 KTPI RW++A + +G+D+ + GHA Sbjct: 182 KTPIEQRWKKATEKLGIDLAHLSTDIGHA 210 >UniRef50_Q163D2 UPF0301 protein RD1_3419 n=14 Tax=Rhodobacterales RepID=Y3419_ROSDO Length = 184 Score = 209 bits (531), Expect = 6e-53, Method: Composition-based stats. Identities = 73/188 (38%), Positives = 108/188 (57%), Gaps = 5/188 (2%) Query: 1 MNLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPE 60 MNL+ L+AMP++ DP F+ +V+ IC H+ GAMG+I+NKP ++I +L++L I Sbjct: 1 MNLEGKLLVAMPSMGDPRFQNAVILICAHSAKGAMGLIINKPTPEIRISDVLDQLDILSS 60 Query: 61 PRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTV-MTTSRDVLETLGTDK 119 + + V GGP+ RGF+LH+ + + I D MT + D+LE + + Sbjct: 61 QKTR----EMVVHFGGPVETGRGFVLHSTDYASSLNTLIVDGAFGMTATLDILEEIADGR 116 Query: 120 QPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILT 179 P+ L+ LGYA W GQLE EI N WLT A +++F P A +W EA +GVD + Sbjct: 117 GPAQALMMLGYAGWGGGQLENEIAQNGWLTTNATSDLVFDLPAARKWSEALHSLGVDPIN 176 Query: 180 MPGVAGHA 187 + AGHA Sbjct: 177 LSPAAGHA 184 >UniRef50_Q5WYW5 UPF0301 protein lpl0620 n=6 Tax=Legionella RepID=Y620_LEGPL Length = 187 Score = 208 bits (530), Expect = 7e-53, Method: Composition-based stats. Identities = 74/186 (39%), Positives = 118/186 (63%), Gaps = 4/186 (2%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 +L + LIAMP+L+DP F RSVVY+CEHN G++G+I+N+PL+ + + E+L+I P Sbjct: 6 SLANQLLIAMPSLKDPNFERSVVYLCEHNEQGSVGLIINRPLQ-FPLSIVFEQLQIEPIR 64 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQP 121 +++ P++ GGP+ +RGF++H + SS+ + D +TTS D++ + D+ P Sbjct: 65 VEKNGL---PLLFGGPVQPERGFVIHKQMGGWRSSLFLQDEVTVTTSNDIIRAIAYDEGP 121 Query: 122 SDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMP 181 DVL+ LGYA+W + QLE+EI+ N WL P IL++ P +RW A +G+ + + Sbjct: 122 KDVLITLGYAAWTEQQLEREIMSNTWLVCPYKSEILYEVPFEERWEYAGLTLGIKMNQLS 181 Query: 182 GVAGHA 187 AGHA Sbjct: 182 SDAGHA 187 >UniRef50_B9NUS5 Putative uncharacterized protein n=2 Tax=Rhodobacteraceae RepID=B9NUS5_9RHOB Length = 224 Score = 207 bits (527), Expect = 2e-52, Method: Composition-based stats. Identities = 81/188 (43%), Positives = 111/188 (59%), Gaps = 5/188 (2%) Query: 1 MNLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPE 60 M+L LIAMP + DP F SVVY+C H +GAMG+IVNKP + L+I+ +LE+L I Sbjct: 41 MDLTGKLLIAMPGMGDPRFEHSVVYVCSHGDDGAMGLIVNKPSD-LRIKTLLEQLNI--- 96 Query: 61 PRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFA-SSIRISDNTVMTTSRDVLETLGTDK 119 P + ++ V GGP+ RGF+LH+ S++ISD MT + DVLE L + K Sbjct: 97 PCRIPVVGERLVQFGGPVEMSRGFVLHSADYEANLHSMQISDEFSMTATLDVLEDLASGK 156 Query: 120 QPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILT 179 P + ++ALGY+ W QLE EI N WLT A ++F P ++W A +GVD LT Sbjct: 157 GPLNSMLALGYSGWGPDQLEDEIAMNGWLTTEASSKLIFDVPDDEKWGAALATLGVDPLT 216 Query: 180 MPGVAGHA 187 + AG A Sbjct: 217 LSASAGRA 224 >UniRef50_C5SQD7 Putative uncharacterized protein n=1 Tax=Asticcacaulis excentricus CB 48 RepID=C5SQD7_9CAUL Length = 216 Score = 207 bits (526), Expect = 2e-52, Method: Composition-based stats. Identities = 74/196 (37%), Positives = 106/196 (54%), Gaps = 13/196 (6%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 +LQ L+AMP+L DP F SV+Y+C+H+ AMGI++N+P+ L ++E+L I Sbjct: 24 SLQGRLLVAMPSLDDPNFDHSVIYMCQHDPESAMGIVLNQPIGGLTFPRMMEELGIDITD 83 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHT----------PPSNFASSIRISDNTVMTTSRDV 111 P+ GGP+ +RGF+LH+ P ++ + D +T SRD+ Sbjct: 84 NRHVAT---PIYNGGPVQNERGFVLHSLDYFIDEVTLPLDIDPEALELRDGIGLTVSRDI 140 Query: 112 LETLGTDKQPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAK 171 L L PS VL+ALGYA W GQLE EI DNAWL AP ++LF + W + K Sbjct: 141 LVDLARGAGPSRVLIALGYAGWGPGQLEAEIRDNAWLVAPCQADLLFSHDASALWSKTLK 200 Query: 172 LIGVDILTMPGVAGHA 187 L+G+ + AG A Sbjct: 201 LLGISPEHLSLNAGRA 216 >UniRef50_A8ZZX8 Putative uncharacterized protein n=1 Tax=Desulfococcus oleovorans Hxd3 RepID=A8ZZX8_DESOH Length = 184 Score = 206 bits (525), Expect = 2e-52, Method: Composition-based stats. Identities = 76/188 (40%), Positives = 111/188 (59%), Gaps = 5/188 (2%) Query: 1 MNLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPE 60 M L+ FLIAMP L DP FR++VV ICEH+ +GA+G+IVN+ L + I E+LK+ Sbjct: 1 MELRGEFLIAMPMLTDPNFRQTVVCICEHSADGALGLIVNRIYPALTAKDIFEELKMKYV 60 Query: 61 PRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQ 120 P + PV GGP+ F+LH PP + I + +T ++D+L + + Sbjct: 61 PETGPL----PVYNGGPVHTGDLFVLHEPPFGWEGCRPIRPDLALTNTKDLLAAIAEGQG 116 Query: 121 PSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGV-DILT 179 P L+ LGYA W QLE E+L+N+WLT P D ++F TP+A RW +A KL+ + D Sbjct: 117 PRRFLILLGYAGWGPDQLEAEVLENSWLTVPVDQRVIFDTPVARRWADAMKLMNIPDPAF 176 Query: 180 MPGVAGHA 187 + G++G A Sbjct: 177 LSGISGSA 184 >UniRef50_Q3IZ52 UPF0301 protein RHOS4_26140 n=6 Tax=Rhodobacteraceae RepID=Y2614_RHOS4 Length = 184 Score = 206 bits (525), Expect = 3e-52, Method: Composition-based stats. Identities = 80/188 (42%), Positives = 112/188 (59%), Gaps = 5/188 (2%) Query: 1 MNLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPE 60 M+L LIAMP++ DP F RS+V IC H+ +GAMG++VNKP+E+L G+LE+L I Sbjct: 1 MDLSGSLLIAMPSMADPRFERSLVLICAHSPDGAMGLVVNKPVEDLSFAGMLEQLNIPRA 60 Query: 61 PRDESIRLDKPVMLGGPLAEDRGFILHTPPSN-FASSIRISDNTVMTTSRDVLETLGTDK 119 P D V LGGP+ RGF+LH+P +++ +S MT + D+LE L + Sbjct: 61 PNGR----DIRVHLGGPMERGRGFVLHSPDYMSVGATMLVSGKFGMTATVDILEALARGQ 116 Query: 120 QPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILT 179 PS L+ALGY+ W GQLE E+ N WLTA A ++F +W + +G+D LT Sbjct: 117 GPSSALMALGYSGWGPGQLEAEVQRNDWLTAEAPSELVFSDDDPGKWTGMLRHMGIDPLT 176 Query: 180 MPGVAGHA 187 + AGHA Sbjct: 177 LSSTAGHA 184 >UniRef50_Q0AMH8 Putative uncharacterized protein n=3 Tax=Hyphomonadaceae RepID=Q0AMH8_MARMM Length = 195 Score = 205 bits (523), Expect = 5e-52, Method: Composition-based stats. Identities = 73/186 (39%), Positives = 114/186 (61%), Gaps = 5/186 (2%) Query: 3 LQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPR 62 L LIA PA+ DP F R+V+ +C+H GAMGII+NKP L++ + E+L++ Sbjct: 14 LGGKLLIATPAIGDPRFDRAVILVCDHTAEGAMGIIINKPAAGLRLPELFEQLEVDSSQP 73 Query: 63 DESIRLDKPVMLGGPLAEDRGFILHTPPS-NFASSIRISDNTVMTTSRDVLETLGTDKQP 121 D PV++GGP+ +DRGF+LHT N +++ I+D +T ++DVLE + +D P Sbjct: 74 A----PDGPVLVGGPVDKDRGFVLHTRDYANDEATLPINDRIGLTATKDVLEAMASDSPP 129 Query: 122 SDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMP 181 L+ALGY+ W GQL+ E++ NAWL D ++F+T AD+W A + +G+ + Sbjct: 130 QRSLLALGYSGWAAGQLDDELVANAWLVCDMDEQLVFETDDADKWPRALECLGISPEHLS 189 Query: 182 GVAGHA 187 ++GHA Sbjct: 190 ALSGHA 195 >UniRef50_Q0C3C2 Putative uncharacterized protein n=1 Tax=Hyphomonas neptunium ATCC 15444 RepID=Q0C3C2_HYPNA Length = 188 Score = 205 bits (521), Expect = 7e-52, Method: Composition-based stats. Identities = 68/187 (36%), Positives = 110/187 (58%), Gaps = 5/187 (2%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 +L LIAMP + DP F RSV+ +C H + AMGII+NKP++ + ++ I++++ I + Sbjct: 6 DLTGKLLIAMPGIGDPRFERSVILVCAHTPDFAMGIILNKPMDGIDLQEIIDQMDIPQDV 65 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTPPSNFAS-SIRISDNTVMTTSRDVLETLGTDKQ 120 E + ++ GGP+A +RGF+LHT ++ + D MT +R++L ++ + Sbjct: 66 DLEGVA----ILEGGPVATERGFVLHTDDVICDGATMEVEDELCMTATREILASIASAAP 121 Query: 121 PSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTM 180 P ++ALGYA W GQLEQE+ NAWL D +++F +WR A +GVD+ + Sbjct: 122 PRKFVMALGYAGWGAGQLEQELAQNAWLIGAPDSDLVFGDAYEHKWRHAMTRMGVDLSRL 181 Query: 181 PGVAGHA 187 AG+A Sbjct: 182 QSNAGNA 188 >UniRef50_A6VSP6 UPF0301 protein Mmwyl1_0539 n=2 Tax=Marinomonas RepID=Y539_MARMS Length = 188 Score = 204 bits (519), Expect = 1e-51, Method: Composition-based stats. Identities = 68/186 (36%), Positives = 108/186 (58%), Gaps = 4/186 (2%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 + ++HFLI+MP L DP F +V+Y+CEH GAMGII+N+P N+ + + L I Sbjct: 7 SFKNHFLISMPHLDDPHFEHTVIYLCEHTKAGAMGIIINRPS-NVDFTELADHLGIQIH- 64 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQP 121 +P+ GGP+ +RGFILHT +++++R++D ++ S + LE + P Sbjct: 65 --SPRLSSEPIYTGGPVEAERGFILHTTDKVWSNTLRVTDEVSLSASLEALEDIAQGNGP 122 Query: 122 SDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMP 181 + LG A W+ GQLE EI +N WL ADL++LF TP ++ A +++G+D+ + Sbjct: 123 DAFRITLGCAGWDAGQLEAEIANNDWLVCEADLDVLFHTPSDMQFTAATRVLGIDMTRLS 182 Query: 182 GVAGHA 187 GH Sbjct: 183 PDIGHG 188 >UniRef50_A1SUE8 Putative uncharacterized protein n=2 Tax=Psychromonas RepID=A1SUE8_PSYIN Length = 197 Score = 204 bits (519), Expect = 1e-51, Method: Composition-based stats. Identities = 77/185 (41%), Positives = 119/185 (64%), Gaps = 3/185 (1%) Query: 3 LQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPR 62 L+ HFLIAMP+L DP F+ SVVYICEH+ GAMG I+N P+ L ++ +L + Sbjct: 16 LKDHFLIAMPSLNDPYFKHSVVYICEHDEKGAMGFIINFPV-KLTLQELLNNVDSIDHYP 74 Query: 63 DESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQPS 122 + + PV LGGPL +RGF+LH+P ++ + S +++D +++ S +L TLGT+ +P Sbjct: 75 EPPLL--NPVFLGGPLELERGFVLHSPVTDNSQSTKLNDQLMVSNSNAILSTLGTENEPE 132 Query: 123 DVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMPG 182 + +V LGYASW GQLE+E+ DN W++ + +I+F TP+ RW E+ + +G+ + Sbjct: 133 EYIVTLGYASWSSGQLEKEMNDNHWISMESQNDIIFSTPVEQRWIESLQRLGIHPEQLST 192 Query: 183 VAGHA 187 GHA Sbjct: 193 EIGHA 197 >UniRef50_Q1YVF9 Putative uncharacterized protein n=1 Tax=gamma proteobacterium HTCC2207 RepID=Q1YVF9_9GAMM Length = 192 Score = 203 bits (517), Expect = 2e-51, Method: Composition-based stats. Identities = 85/187 (45%), Positives = 121/187 (64%), Gaps = 6/187 (3%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 +L+ HFL+AMP L+DP F SVVYICEHN++GAMG+I+N+ ++ + ++ I ++LK+ + Sbjct: 11 SLKDHFLLAMPGLEDPTFSDSVVYICEHNSDGAMGLIINQQMD-IPVKAIFDQLKLEYQD 69 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILH-TPPSNFASSIRISDNTVMTTSRDVLETLGTDKQ 120 + GGP+ DRGFILH + S++ ISD +T SRD+L + K Sbjct: 70 ECGRPL----LFDGGPVQRDRGFILHANCEQQWESTLMISDQVCLTASRDILSDMALGKG 125 Query: 121 PSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTM 180 P D LV LGY+SWE GQLE+E+ +N+WLT PA+ I+FKT A R AA IG+D+ + Sbjct: 126 PKDSLVTLGYSSWEAGQLERELGENSWLTIPAEAEIIFKTDCAKRASAAALSIGLDLRML 185 Query: 181 PGVAGHA 187 AGHA Sbjct: 186 SHQAGHA 192 >UniRef50_C6NYY5 Putative uncharacterized protein n=1 Tax=Acidithiobacillus caldus ATCC 51756 RepID=C6NYY5_9GAMM Length = 185 Score = 202 bits (513), Expect = 6e-51, Method: Composition-based stats. Identities = 80/186 (43%), Positives = 119/186 (63%), Gaps = 5/186 (2%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 +L++H LIAMP L D +F RSV+ ICEH+ GAMG+++N+ L+ + + LE + ITP Sbjct: 5 SLKNHLLIAMPNLHDGMFDRSVIVICEHSPEGAMGLVINRLLD-ISLAKALEAVNITPPE 63 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQP 121 KPV GGP+ GFILH ++ S+ + + +T+S D+L + + P Sbjct: 64 DA----AQKPVFWGGPVQPQHGFILHEGAGDWQVSMAVGEGLFLTSSPDILMAIAEHRGP 119 Query: 122 SDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMP 181 L+ALGYA W +GQLEQE+ +N+WL P DL++LF+ P A+RW+ AA+ +GVD+ + Sbjct: 120 ERFLLALGYAGWGEGQLEQELSENSWLHGPIDLSVLFELPPAERWQAAARGLGVDMRLLS 179 Query: 182 GVAGHA 187 G AGHA Sbjct: 180 GAAGHA 185 >UniRef50_Q4ZZ67 UPF0301 protein Psyr_0485 n=31 Tax=Proteobacteria RepID=Y485_PSEU2 Length = 190 Score = 201 bits (512), Expect = 8e-51, Method: Composition-based stats. Identities = 73/185 (39%), Positives = 111/185 (60%), Gaps = 3/185 (1%) Query: 3 LQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPR 62 L+HHFLIAMP + D F +++ YI EHN NGAMG+++N+P ++L + +LE+L+ PE Sbjct: 9 LKHHFLIAMPHMHDENFAQTLTYIVEHNANGAMGLVINRP-QSLTLADVLEQLR--PELP 65 Query: 63 DESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQPS 122 D + GGP+ DRGF+LH F +++ + ++TS+DVL ++ P Sbjct: 66 APRHCQDIVIHTGGPVQTDRGFVLHPSGQTFQATVNLPGGISLSTSQDVLFSIADGYGPD 125 Query: 123 DVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMPG 182 ++ LGYA W+ GQL+ E+ DNAWLT D ILF R AA+ +G+++ + Sbjct: 126 QNVITLGYAGWDAGQLDAEMADNAWLTCSFDPAILFDVDSDQRLEAAARRLGINLNLIST 185 Query: 183 VAGHA 187 AGHA Sbjct: 186 QAGHA 190 >UniRef50_Q1MZ13 Putative uncharacterized protein n=1 Tax=Bermanella marisrubri RepID=Q1MZ13_9GAMM Length = 185 Score = 201 bits (512), Expect = 9e-51, Method: Composition-based stats. Identities = 80/186 (43%), Positives = 119/186 (63%), Gaps = 6/186 (3%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 NL+HH L+AMP+L DP F SV YIC+HN G+MG+++NKP+ +++ +L +L I + Sbjct: 6 NLKHHLLLAMPSLSDPYFGHSVCYICDHNEQGSMGLVLNKPM-GIELTDVLSELDIETDK 64 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQP 121 P++ GGP++ ++GF+L+ + ++ I+ + +TTS+D+L L P Sbjct: 65 PIH-----FPILQGGPVSPEQGFVLYRGSESELQNMVINGDIRLTTSKDILSQLALGSGP 119 Query: 122 SDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMP 181 DV + LGYA WE GQLEQE++ NAWLT PAD +LF TP+ +AA IGVD+ + Sbjct: 120 DDVRICLGYAGWEAGQLEQELIQNAWLTVPADEELLFHTPMDQMLEKAASRIGVDMSLIS 179 Query: 182 GVAGHA 187 G AGHA Sbjct: 180 GEAGHA 185 >UniRef50_Q2P5W3 UPF0301 protein XOO1309 n=20 Tax=Xanthomonadaceae RepID=Y1309_XANOM Length = 188 Score = 201 bits (511), Expect = 1e-50, Method: Composition-based stats. Identities = 81/185 (43%), Positives = 117/185 (63%), Gaps = 4/185 (2%) Query: 3 LQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPR 62 L + LIA+PAL DP F RSV IC+H+ NGAMG++VN+P E + +L ++ I + Sbjct: 8 LANQLLIALPALSDPTFSRSVALICQHDENGAMGVLVNRPSEY-TLGEVLSQMGIDTDDE 66 Query: 63 DESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQPS 122 ++ V+ GGP+ +RGF++H + SS+ + +TTSRD+LE + P Sbjct: 67 PLR---EQIVLSGGPVHPERGFVIHDDAREWDSSLEVGQGVFLTTSRDILEAMAAGNGPR 123 Query: 123 DVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMPG 182 +VLVALG A W GQLE E+ +N+WLTAP+D N+LF T + DRW+ AA IGVD+ + Sbjct: 124 NVLVALGCAGWGAGQLEFELGENSWLTAPSDANVLFATALEDRWQTAAGRIGVDLFRLTD 183 Query: 183 VAGHA 187 +GHA Sbjct: 184 YSGHA 188 >UniRef50_A4SVF0 Putative uncharacterized protein n=1 Tax=Polynucleobacter necessarius subsp. asymbioticus QLW-P1DMWA-1 RepID=A4SVF0_POLSQ Length = 210 Score = 200 bits (510), Expect = 1e-50, Method: Composition-based stats. Identities = 77/191 (40%), Positives = 111/191 (58%), Gaps = 10/191 (5%) Query: 3 LQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPR 62 L + FLIAMP + D F SV+Y+ EHN GAMG++VNKP E + + + +K+++ E Sbjct: 24 LANQFLIAMPGMVDANFAGSVIYLFEHNARGAMGLVVNKPTE-VDLATLFDKIELKLE-- 80 Query: 63 DESIRLDKPVMLGGPLAEDRGFILHTPPSN--FASSIRISDNTVMTTSRDVLETLGTDKQ 120 + L++PV GGP+ +RGF+LH N ++SS+ I MTTS+DVLE + Sbjct: 81 -IAPLLEQPVYFGGPVQIERGFVLHESNKNLSYSSSLIIPGGLTMTTSKDVLEAVAIGNG 139 Query: 121 PSDVLVALGYASWEKGQLEQEILDNAWLTAPADL----NILFKTPIADRWREAAKLIGVD 176 P L+ LGYA W GQLE+EI N W+ P I+F TP + R+ + +G D Sbjct: 140 PRKFLMTLGYAGWSAGQLEEEITLNGWMNVPLSREQMMEIIFNTPPSQRYEKTMNHLGFD 199 Query: 177 ILTMPGVAGHA 187 + + G AGHA Sbjct: 200 LSHLSGEAGHA 210 >UniRef50_B5JTR0 Putative uncharacterized protein n=1 Tax=gamma proteobacterium HTCC5015 RepID=B5JTR0_9GAMM Length = 187 Score = 200 bits (508), Expect = 2e-50, Method: Composition-based stats. Identities = 86/189 (45%), Positives = 116/189 (61%), Gaps = 7/189 (3%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNT-NGAMGIIVNKPLENLKIEGILEKLKITPE 60 +L +HFLIAMP L+DP F RSV IC H+ GA+GI + + ++ +E +L++L I Sbjct: 3 SLTNHFLIAMPDLEDPNFSRSVTLICHHSEDEGAIGITLTRATDH-SVEELLDQLDIQEA 61 Query: 61 PRDESIRLDKPVMLGGPLAEDRGFILH--TPPSNFASSIRISDNTVMTTSRDVLETLGTD 118 + L P+ +GGP+ +DRGFILH + + ISD+ +T+S D+LE L Sbjct: 62 KLAATHAL--PLYIGGPVEQDRGFILHPNRKEYQWEGTETISDHLAITSSLDILEDLARG 119 Query: 119 KQPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDIL 178 K P + L+ALGYA W GQLEQEI DNAWL PAD I+F P RW AA+ +GVDI Sbjct: 120 KGPDNCLIALGYAGWSSGQLEQEITDNAWLHGPADPEIIFSLPAEQRWTAAAQSLGVDIR 179 Query: 179 TMPGVAGHA 187 + AGHA Sbjct: 180 LIHS-AGHA 187 >UniRef50_B9MF60 UPF0301 protein Dtpsy_2896 n=28 Tax=Proteobacteria RepID=Y2896_DIAST Length = 199 Score = 200 bits (508), Expect = 2e-50, Method: Composition-based stats. Identities = 86/196 (43%), Positives = 126/196 (64%), Gaps = 13/196 (6%) Query: 1 MNLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPE 60 MNL HHFLIAMP ++D F RSVVY+CEH+ GA+G+I+NKP + +EG+ EK+ ++ Sbjct: 8 MNLTHHFLIAMPGVEDASFSRSVVYLCEHSERGALGLIINKPTP-ISLEGLFEKVDLSLG 66 Query: 61 PRDESIRLDKPVMLGGPLAEDRGFILHT---------PPSNFASSIRISDNTVMTTSRDV 111 D ++ +PV GGP+ +RGF+LH S +AS++ I MTTS+DV Sbjct: 67 REDLTL---QPVFQGGPVQTERGFVLHEAMRGPQESEDESPYASTMTIPGGLEMTTSKDV 123 Query: 112 LETLGTDKQPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAK 171 LE L P VLV LGY++W +GQLE E+ +N+WLT AD++++F+TP+ +R+ A Sbjct: 124 LEALAHGAGPRRVLVTLGYSAWGEGQLESELAENSWLTVGADVSVIFETPVQERYDRALG 183 Query: 172 LIGVDILTMPGVAGHA 187 L+G+ + AGHA Sbjct: 184 LLGLQSWMLSPEAGHA 199 >UniRef50_B4REX9 Transcriptional regulator n=4 Tax=Caulobacteraceae RepID=B4REX9_PHEZH Length = 187 Score = 199 bits (506), Expect = 4e-50, Method: Composition-based stats. Identities = 69/186 (37%), Positives = 103/186 (55%), Gaps = 5/186 (2%) Query: 3 LQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPR 62 L LIAMP + DP F R+++ +C H+ + AMG+ +N P+E L + +LE+L+I R Sbjct: 6 LSGQLLIAMPGISDPRFERTLILVCAHDAHHAMGLALNHPVEGLTVPDLLERLEIKSTIR 65 Query: 63 DESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQ-P 121 V++GGP+ +RGF+LHT S+ + +T +R+VLE +G+ P Sbjct: 66 LPPDL----VLVGGPVERERGFVLHTDDYQGEFSLPVGGGVALTATREVLEAMGSSDGRP 121 Query: 122 SDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMP 181 L+ALGYA W GQLE EI +N WLT AD ++F +W A +G+D + Sbjct: 122 RRSLLALGYAGWGAGQLEHEIRENVWLTCEADEALIFDADYDTKWARAVAKLGIDPTFLT 181 Query: 182 GVAGHA 187 AG A Sbjct: 182 AEAGRA 187 >UniRef50_B8KLD5 Putative uncharacterized protein n=3 Tax=Proteobacteria RepID=B8KLD5_9GAMM Length = 208 Score = 198 bits (504), Expect = 8e-50, Method: Composition-based stats. Identities = 83/186 (44%), Positives = 118/186 (63%), Gaps = 6/186 (3%) Query: 3 LQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPR 62 L+ HFL+AMP L +F S+ Y+CEH GAMG+++N+PL+ L + I + L I + Sbjct: 28 LRDHFLLAMPGLDAGLFSGSITYLCEHGEAGAMGLVINQPLD-LSLGEIFDHLDIAAD-- 84 Query: 63 DESIRLDKPVMLGGPLAEDRGFILH-TPPSNFASSIRISDNTVMTTSRDVLETLGTDKQP 121 + D+PV+ GGP+ D GF+LH + + SS+R++D +TTSRDVL+ + + P Sbjct: 85 --AHFRDQPVLAGGPVQIDHGFVLHPSGGKRWDSSLRVTDEVQLTTSRDVLKAIACGEGP 142 Query: 122 SDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMP 181 D +V LGYA W GQLE+EI +N+WLT PAD I+F T I DR AA +G+D+ M Sbjct: 143 RDFVVTLGYAGWSAGQLEEEIANNSWLTLPADKRIIFHTAIEDRVAAAASALGIDMNLMS 202 Query: 182 GVAGHA 187 AGHA Sbjct: 203 AQAGHA 208 >UniRef50_A8PPF4 Putative uncharacterized protein n=1 Tax=Rickettsiella grylli RepID=A8PPF4_9COXI Length = 195 Score = 197 bits (501), Expect = 2e-49, Method: Composition-based stats. Identities = 74/187 (39%), Positives = 118/187 (63%), Gaps = 3/187 (1%) Query: 4 QHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKI---EGILEKLKITPE 60 ++FL+AMP L D F RSVVYICEH GA+GI++N+PL++L + E + E + + Sbjct: 9 TNYFLVAMPILTDAYFSRSVVYICEHTEKGAVGIVINQPLQSLHVNLAEIVQEITESNLK 68 Query: 61 PRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQ 120 + + P++ GGP+ +RGF++H P + SS++++ +TTS+D+L + + Sbjct: 69 STKTTAGANFPILCGGPIHPERGFVIHAPSGAWQSSLKMNSEISVTTSKDILLAIAKQQG 128 Query: 121 PSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTM 180 P + +LGYA+W GQ+EQEI++N WLT PA+ N+LF P RW +A +GVD+ + Sbjct: 129 PEKFIFSLGYANWIAGQMEQEIINNFWLTLPANPNLLFDVPFEQRWLKAMDYLGVDVTKL 188 Query: 181 PGVAGHA 187 + GHA Sbjct: 189 AYMGGHA 195 >UniRef50_A4BI12 Putative uncharacterized protein n=1 Tax=Reinekea blandensis MED297 RepID=A4BI12_9GAMM Length = 183 Score = 197 bits (500), Expect = 2e-49, Method: Composition-based stats. Identities = 77/187 (41%), Positives = 116/187 (62%), Gaps = 4/187 (2%) Query: 1 MNLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPE 60 MNL HHFLIAMP + DP+F ++ Y+ +H+ GA+G+IVN+PL NL +E + E +++ Sbjct: 1 MNLNHHFLIAMPQMGDPVFSGTLTYLVQHDEQGALGLIVNRPL-NLNLEEVFESSELSG- 58 Query: 61 PRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQ 120 KPV GGP+A+++GFILH P S ++ V+TTSRD+LE + D+ Sbjct: 59 --YSPRTGSKPVYHGGPVAQEQGFILHPPTEQTWISSLSNEQLVLTTSRDMLEAIAQDEG 116 Query: 121 PSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTM 180 P L LGY+ W GQLE+E+ +NAWLT A+ I+F+ +++ A +G+D+ T+ Sbjct: 117 PERFLFCLGYSGWSPGQLEEELKENAWLTVEANEAIIFQDDEVGKYQHALSDLGIDLATL 176 Query: 181 PGVAGHA 187 G G A Sbjct: 177 SGHGGLA 183 >UniRef50_C3K3J9 UPF0301 protein PFLU_5755 n=5 Tax=cellular organisms RepID=Y5755_PSEFS Length = 189 Score = 194 bits (494), Expect = 1e-48, Method: Composition-based stats. Identities = 76/185 (41%), Positives = 110/185 (59%), Gaps = 4/185 (2%) Query: 3 LQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPR 62 L+H FLIAMP + DP F +++ YI EH GAMG+++N+P + L + ILE+L+ PE Sbjct: 9 LKHQFLIAMPHMADPNFAQTLTYIVEHTAKGAMGLVINRP-QELNLADILEQLR--PEVD 65 Query: 63 DESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQPS 122 + P+ +GGP+ DRGF+LH F +++ + + ++TS+DVL + P Sbjct: 66 PPARCQGVPIYIGGPVQTDRGFVLHPTGPKFQATVDL-EGVSLSTSQDVLFAIADGVGPE 124 Query: 123 DVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMPG 182 ++ LGYA WE GQLE E+ NAWLT P D ILF TP R AA + V++ + Sbjct: 125 QSVITLGYAGWEAGQLEAELASNAWLTCPFDAEILFNTPSELRLEAAAAKLRVNLNLLTS 184 Query: 183 VAGHA 187 AGHA Sbjct: 185 QAGHA 189 >UniRef50_A6WWH2 Putative uncharacterized protein n=2 Tax=Ochrobactrum RepID=A6WWH2_OCHA4 Length = 214 Score = 194 bits (493), Expect = 1e-48, Method: Composition-based stats. Identities = 66/188 (35%), Positives = 104/188 (55%), Gaps = 4/188 (2%) Query: 3 LQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPR 62 L FL+AMP + D F RSVVYIC H+ GAMG I+N+ L+ ++ +L ++ + E Sbjct: 28 LNGQFLLAMPGMSDERFARSVVYICAHSDEGAMGFIINQ-LQPVEFPDLLRQIGVIDEDE 86 Query: 63 D---ESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDK 119 V GGP+ RGF+LH+ S++ +S+ +T + D+L + + Sbjct: 87 LIILPDRAQHMMVRNGGPVDRTRGFVLHSDDYMVDSTMPVSEEVCLTATVDILRAIYGGR 146 Query: 120 QPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILT 179 PS L+ALGY+ W GQ+E E+ +N WLT A L++LF + I ++ +G+D+ Sbjct: 147 GPSRALMALGYSGWAPGQIEVELAENGWLTCDAPLDMLFDSDIEGKYSRLMLHMGIDMSR 206 Query: 180 MPGVAGHA 187 + AGHA Sbjct: 207 LVSDAGHA 214 >UniRef50_A1KUG1 UPF0301 protein NMC1274 n=27 Tax=Neisseriaceae RepID=Y1274_NEIMF Length = 182 Score = 194 bits (493), Expect = 1e-48, Method: Composition-based stats. Identities = 82/187 (43%), Positives = 119/187 (63%), Gaps = 5/187 (2%) Query: 1 MNLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPE 60 MNL +HFL+AMP ++D F +SVVYIC+H+ +GA+GI +NKP + ++ I Sbjct: 1 MNLSNHFLVAMPDMEDAFFSQSVVYICKHDEDGALGIAINKPSP-ITMDMIFSATG---- 55 Query: 61 PRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQ 120 VM+GGP+ +RG+++HTP N+ SSI +SDN +T+SRDV+E + + Sbjct: 56 KNIPMRMQHDSVMMGGPVQVERGYVVHTPIGNWQSSIGVSDNIALTSSRDVIENISREGA 115 Query: 121 PSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTM 180 L+++GY+SW KGQLE+E+ DNAWLT PAD +ILF P R+ A +G+D L + Sbjct: 116 VDKALISIGYSSWGKGQLERELADNAWLTVPADEHILFDIPYEHRYAAAFAKLGIDPLAL 175 Query: 181 PGVAGHA 187 AGHA Sbjct: 176 FSGAGHA 182 >UniRef50_A1TKL7 UPF0301 protein Aave_0907 n=10 Tax=Comamonadaceae RepID=Y907_ACIAC Length = 213 Score = 193 bits (490), Expect = 3e-48, Method: Composition-based stats. Identities = 90/210 (42%), Positives = 125/210 (59%), Gaps = 27/210 (12%) Query: 1 MNLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPE 60 MNL HHFLIAMP L+D F RSVVY+CEH+ GA+G+I+NKP + L ++G+ +K+ ++ Sbjct: 8 MNLTHHFLIAMPGLEDESFARSVVYLCEHSERGALGLIINKPSD-LSLKGLFDKVDLSLR 66 Query: 61 PRDESIRLDKPVMLGGPLAEDRGFILHTP-----------------------PSNFASSI 97 D S+ PV GGP+ +RGF+LH S +AS++ Sbjct: 67 REDLSLE---PVFRGGPVQTERGFVLHEAMGPSSGKQAAGEGGAQAEGEGAEESAYASTM 123 Query: 98 RISDNTVMTTSRDVLETLGTDKQPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNIL 157 I MTTS+DVLE L T P VLV LGY+SW +GQLE E+ +N+WLT ADL+++ Sbjct: 124 SIPGGLEMTTSKDVLEALSTGAGPRRVLVTLGYSSWGEGQLESELAENSWLTVGADLSVI 183 Query: 158 FKTPIADRWREAAKLIGVDILTMPGVAGHA 187 F TP+ R+ A L+G+ + AGHA Sbjct: 184 FDTPVGQRYDRALALLGLQSWMLSPEAGHA 213 >UniRef50_C0QFZ0 UPF0301 protein HRM2_24640 n=1 Tax=Desulfobacterium autotrophicum HRM2 RepID=Y2464_DESAH Length = 189 Score = 188 bits (479), Expect = 5e-47, Method: Composition-based stats. Identities = 69/185 (37%), Positives = 102/185 (55%), Gaps = 4/185 (2%) Query: 3 LQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPR 62 L+ HFL+A+P L DP F ++V ICEHN GA+G I+N+ L + + E LKIT Sbjct: 9 LKGHFLMAIPGLPDPNFAQTVTCICEHNKTGALGFIINRIHPLLTGQELFEDLKITCNQA 68 Query: 63 DESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQPS 122 + I + LGGP+ F+LH PP ++ ++I+D ++ +RD+LE + + P Sbjct: 69 IDKIA----IHLGGPVQPSGVFVLHGPPFDWHGCLKINDWLGLSNTRDILEAVARQEGPE 124 Query: 123 DVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMPG 182 + +V LG A W QL+ EI DNAWLT P ILFKT + +W +G+ Sbjct: 125 NFIVLLGCAGWGPLQLDNEINDNAWLTIPVSQEILFKTDVKLKWEMTMMQMGIVSDNHSD 184 Query: 183 VAGHA 187 +G A Sbjct: 185 NSGKA 189 >UniRef50_A9KDE7 UPF0301 protein CBUD_2193 n=7 Tax=Coxiella burnetii RepID=Y2193_COXBN Length = 181 Score = 188 bits (477), Expect = 1e-46, Method: Composition-based stats. Identities = 77/185 (41%), Positives = 115/185 (62%), Gaps = 10/185 (5%) Query: 3 LQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPR 62 L +HFL+AMP L D F ++V+Y+ +H+ GA+GII+N+PL L + +LE L I Sbjct: 7 LSNHFLVAMPQLNDFTFTKAVIYVSQHDAKGALGIIINRPL-ALTLGKVLEHLNI---EI 62 Query: 63 DESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQPS 122 + + PV++GGP+ ++ GFI++ S + +++ S+D+L+ + +K P Sbjct: 63 AQPQIANHPVLMGGPIGQEHGFIVYEQESPQGA------EILLSASKDMLDDIAKNKGPD 116 Query: 123 DVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMPG 182 D L+ LGYA WE GQLE EI N WL P + ILF+TP+ RW++AA LIGVDI + G Sbjct: 117 DFLITLGYAGWEAGQLENEIARNDWLVVPFNRKILFETPLKSRWQKAAALIGVDINQLSG 176 Query: 183 VAGHA 187 GHA Sbjct: 177 QIGHA 181 >UniRef50_Q31EK4 UPF0301 protein Tcr_1827 n=1 Tax=Thiomicrospira crunogena XCL-2 RepID=Y1827_THICR Length = 188 Score = 187 bits (475), Expect = 2e-46, Method: Composition-based stats. Identities = 71/185 (38%), Positives = 109/185 (58%), Gaps = 3/185 (1%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 +L+HHFLIAMP L + F ++V+YI E N +G MG+++N NL + +L+ ++T E Sbjct: 6 SLEHHFLIAMPNLTESWFDKTVIYIVEDNEHGTMGLVIN-LEHNLTVPELLDHFELTVEA 64 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQP 121 + D+PV++GGP+ + GFILH P + S+ + DN MT S D L+ + P Sbjct: 65 PEN--YADQPVLMGGPVDLEHGFILHEPQGTWQKSLPLRDNLAMTVSEDFLKAMADGTAP 122 Query: 122 SDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMP 181 ++V LG++ WEKGQL EI N WLT P + +LF P +W+ A +G+ ++ Sbjct: 123 EKIVVCLGFSGWEKGQLNDEIQANNWLTIPYNEALLFDVPNDQKWQVALNTLGISPESLS 182 Query: 182 GVAGH 186 AGH Sbjct: 183 MDAGH 187 >UniRef50_A5WBR3 UPF0301 protein PsycPRwf_0144 n=21 Tax=Moraxellaceae RepID=Y144_PSYWF Length = 188 Score = 186 bits (473), Expect = 3e-46, Method: Composition-based stats. Identities = 78/187 (41%), Positives = 118/187 (63%), Gaps = 4/187 (2%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 NL HHFLIA P++ D F +S+VYIC H+ +G +G++VN+P+ + ++ +L+ L I E Sbjct: 5 NLTHHFLIAAPSMPDERFAQSLVYICRHDRHGVLGLVVNRPIFDTQVGHLLDNLDI--EV 62 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTD-KQ 120 D S+ D P+ GGP+ + GF+LHT +ASS IS+N +TTS+D+L+ + Sbjct: 63 TDTSVMYDTPLD-GGPVYPEVGFVLHTGQPTWASSFPISENVCITTSKDILQNIAAGSAG 121 Query: 121 PSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTM 180 + LG+ASW +GQLE+EI WL +P DL++LF+ P +RWR AA+ IGV + + Sbjct: 122 IGHYHLCLGHASWHEGQLEKEISQGDWLVSPGDLSLLFEIPFEERWRHAAEKIGVHLDFL 181 Query: 181 PGVAGHA 187 G A Sbjct: 182 SDEVGRA 188 >UniRef50_Q1NQW6 Putative uncharacterized protein n=2 Tax=Deltaproteobacteria RepID=Q1NQW6_9DELT Length = 201 Score = 185 bits (469), Expect = 9e-46, Method: Composition-based stats. Identities = 65/186 (34%), Positives = 102/186 (54%), Gaps = 3/186 (1%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 +LQ +FLIA P + DP F+ +V+ +C HN GAMG+++N+P+ ++++E I I P Sbjct: 19 SLQGYFLIATPQMSDPRFQETVILLCAHNEEGAMGLVINQPIRDVELEDIFHNAGIPLPP 78 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQP 121 + V LGGP+ FI+++ + + ++ + ++ +L L + P Sbjct: 79 GAGPLGS---VYLGGPVETGNVFIVYSAEYEVVNHLAVTPSISLSRDPQLLYDLAAGRGP 135 Query: 122 SDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMP 181 LV+LGYA W GQLE E+ + WL PA I+F TP +WR AA++ GVDI Sbjct: 136 RHYLVSLGYAGWGAGQLEAELSVDGWLALPAKDEIIFNTPNQHKWRRAAQIHGVDIGLFG 195 Query: 182 GVAGHA 187 V G A Sbjct: 196 AVVGSA 201 >UniRef50_Q2GAJ3 UPF0301 protein Saro_0683 n=4 Tax=Sphingomonadaceae RepID=Y683_NOVAD Length = 186 Score = 184 bits (467), Expect = 1e-45, Method: Composition-based stats. Identities = 66/185 (35%), Positives = 99/185 (53%), Gaps = 5/185 (2%) Query: 3 LQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPR 62 L L+AMP + DP F +V+ +C H+ +GA+GI V E + + G+LE + I P Sbjct: 7 LGGRLLLAMPGMGDPRFDHAVIAMCVHDEHGALGIGVGHVREGITLHGLLEDVGIDP--- 63 Query: 63 DESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQPS 122 + D PV+ GGP+ RGF+LH+ S+ ++ ++ S D+L + + PS Sbjct: 64 --GLAPDMPVLNGGPVETARGFVLHSDDWGGEGSVTVNGLCCLSASLDILRAIAEGRGPS 121 Query: 123 DVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMPG 182 ++ALGYA W GQLE E+ + W A ILF+TP RW +A K G+D + G Sbjct: 122 RFVIALGYAGWGGGQLEGEMRRHGWYAAQGRPEILFETPTGRRWTQAWKREGIDPAHLVG 181 Query: 183 VAGHA 187 G A Sbjct: 182 QTGSA 186 >UniRef50_A0L5K4 UPF0301 protein Mmc1_0726 n=1 Tax=Magnetococcus sp. MC-1 RepID=Y726_MAGSM Length = 186 Score = 184 bits (467), Expect = 1e-45, Method: Composition-based stats. Identities = 63/185 (34%), Positives = 102/185 (55%), Gaps = 7/185 (3%) Query: 3 LQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPR 62 L FLIA+P+L DP F R+V+Y+C HN +GA+G+++N+PL+ + + L++ + Sbjct: 6 LAGKFLIAVPSLADPFFERTVLYLCAHNEDGALGLVINQPLD-TTMSQMAGYLELDWQRP 64 Query: 63 DESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQPS 122 V +GGP++ ++GF+L + + + D+ M T+ D++ +G Sbjct: 65 GVD-----RVYMGGPVSPEQGFVLFEQALDLPGIMMLPDDLYMGTNPDIIRLMGRAGAQE 119 Query: 123 DVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMPG 182 L ALGYA WE GQLE E+ +N+WL A +ILF A RW A + +G+D + Sbjct: 120 RFLFALGYAGWEAGQLEHELQENSWLVCDAQRSILFDMGYAQRWEAAIRSMGIDPALLV- 178 Query: 183 VAGHA 187 A H Sbjct: 179 DASHG 183 >UniRef50_Q0I1B4 UPF0301 protein HS_0009 n=26 Tax=Pasteurellaceae RepID=Y009_HAES1 Length = 187 Score = 183 bits (466), Expect = 2e-45, Method: Composition-based stats. Identities = 81/184 (44%), Positives = 111/184 (60%), Gaps = 4/184 (2%) Query: 1 MNLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPE 60 MNLQ HFLIAMP L+D F+RSVVYICE+N G+MG+++ + + L I + K+ Sbjct: 1 MNLQDHFLIAMPHLEDENFQRSVVYICENNEQGSMGLVLTQATD-LSIAELCAKMNFMM- 58 Query: 61 PRDESIRLDKPVMLGGPLAEDRGFILHTP-PSNFASSIRISDNTVMTTSRDVLETLGTDK 119 DE DK V+LGGP+ + GFILH F S +++D +TTS D++ T GT + Sbjct: 59 -ADEREYSDKLVLLGGPVNLEHGFILHKKTAQEFQHSYKVTDQIYLTTSADIINTFGTAQ 117 Query: 120 QPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILT 179 P LV LG A WE QLE EI +N WL PAD +ILF I++RW A +L+G++ + Sbjct: 118 SPEKYLVTLGCARWEPNQLENEIANNDWLVVPADEDILFDVDISERWFAANQLLGIEHVN 177 Query: 180 MPGV 183 Sbjct: 178 FSYQ 181 >UniRef50_A3MYV4 UPF0301 protein APL_0232 n=7 Tax=Pasteurellaceae RepID=Y232_ACTP2 Length = 186 Score = 183 bits (466), Expect = 2e-45, Method: Composition-based stats. Identities = 78/187 (41%), Positives = 117/187 (62%), Gaps = 5/187 (2%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 NLQ FLIA P + D F R+V+YICEHN+NGAMG+++N P + L + ++ ++ Sbjct: 4 NLQGKFLIATPEIDDDYFDRTVIYICEHNSNGAMGLVINTPTD-LSVLELITRMDFQM-A 61 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTP-PSNFASSIRISDNTVMTTSRDVLETLGTDKQ 120 + D+ V+ GGP+++DRGFI+HT F S R++DN ++TTS DVL++LG + Sbjct: 62 NQRNYHKDQMVLSGGPVSQDRGFIIHTKTEQEFLHSYRVTDNILLTTSGDVLDSLGKPEA 121 Query: 121 PSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTM 180 P +V LG A+W+ QLEQEI N WL + A+ LF+T +RW EA +++G+ + Sbjct: 122 PEKFIVCLGCATWKPEQLEQEIARNYWLISEANDKTLFETGYLERWVEANEMLGISGVL- 180 Query: 181 PGVAGHA 187 AG A Sbjct: 181 -ARAGRA 186 >UniRef50_B3QT15 Putative uncharacterized protein n=1 Tax=Chloroherpeton thalassium ATCC 35110 RepID=B3QT15_CHLT3 Length = 187 Score = 182 bits (462), Expect = 5e-45, Method: Composition-based stats. Identities = 53/180 (29%), Positives = 82/180 (45%), Gaps = 13/180 (7%) Query: 4 QHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRD 63 + LIA L DP F+RSVV +CEHN G G+I+NKPL+ + I +E ++ Sbjct: 10 RGILLIAGAQLIDPNFKRSVVLLCEHNEEGTFGLILNKPLD-INISEAIEDIEDW----- 63 Query: 64 ESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTD--KQP 121 D + GGP+ + +LH +I + D + + + ++ P Sbjct: 64 -----DIALHAGGPVQPNTVHVLHRLGDEIEDAIEVVDGVYWGGNYETIRSMINTRHASP 118 Query: 122 SDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMP 181 D LGY+ W GQL+QEI ++W A A N++F W A + G D + Sbjct: 119 DDFRFFLGYSGWGPGQLQQEIDQDSWYQAKATANVVFNPVYDRMWARALRAKGGDYAIIA 178 >UniRef50_A4A9E0 Protein containing DUF179 n=1 Tax=Congregibacter litoralis KT71 RepID=A4A9E0_9GAMM Length = 173 Score = 182 bits (461), Expect = 7e-45, Method: Composition-based stats. Identities = 81/178 (45%), Positives = 108/178 (60%), Gaps = 6/178 (3%) Query: 11 MPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRDESIRLDK 70 MP L +F S+ YICEH GAMGI++N+PL+ L + I + L+I PR D+ Sbjct: 1 MPGLDSGLFSGSITYICEHGEAGAMGIVINQPLD-LSLGEIFDHLEIDCAPR----FQDQ 55 Query: 71 PVMLGGPLAEDRGFILH-TPPSNFASSIRISDNTVMTTSRDVLETLGTDKQPSDVLVALG 129 V+ GGP+ D GF+LH + SS+R++ +TTSRDVL + + P D VALG Sbjct: 56 VVLAGGPVQIDHGFVLHPRGEQTWDSSLRVTPEVQLTTSRDVLSAIAAGEGPKDYAVALG 115 Query: 130 YASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMPGVAGHA 187 YA W GQLE+EI +N+WLT PAD I+F T I DR AA +G+D+ M AGHA Sbjct: 116 YAGWSAGQLEEEIANNSWLTLPADKRIIFHTAIEDRVAAAAAALGIDMNLMSAEAGHA 173 >UniRef50_Q5FQY8 UPF0301 protein GOX1459 n=11 Tax=Acetobacteraceae RepID=Y1459_GLUOX Length = 187 Score = 180 bits (458), Expect = 2e-44, Method: Composition-based stats. Identities = 69/189 (36%), Positives = 105/189 (55%), Gaps = 6/189 (3%) Query: 1 MNLQHHFLIAMPALQDPIFRRSVVYICEHNT-NGAMGIIVNKPLENLKIEGILEKLKITP 59 + L L+A PAL + F R+V+Y+C H+ +GAMG+IVN+ L ++ + +L I P Sbjct: 3 LGLTGKLLVAAPALAETFFERTVIYLCAHSEQDGAMGLIVNRRLSQPGLDDLFAQLGIEP 62 Query: 60 EPRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDK 119 P + I V +GGP+ RGF+LH+ S+ + +T +T S D+L + Sbjct: 63 SPPERRIG----VCMGGPVEHARGFVLHSADWAGEGSLDVDGHTTLTASLDILREIAAGH 118 Query: 120 QPSDVLVALGYASWEKGQLEQEILDN-AWLTAPADLNILFKTPIADRWREAAKLIGVDIL 178 P ++ALG+A+W GQLE+EIL + +W APA I+F T A +WR+A I D L Sbjct: 119 GPRQAVMALGHAAWAPGQLEEEILRDSSWFIAPATDEIVFGTDHAKKWRQALVAIDFDPL 178 Query: 179 TMPGVAGHA 187 + G A Sbjct: 179 LLSSSVGEA 187 >UniRef50_Q60BQ2 UPF0301 protein MCA0413 1 n=1 Tax=Methylococcus capsulatus RepID=Y413_METCA Length = 182 Score = 176 bits (447), Expect = 3e-43, Method: Composition-based stats. Identities = 58/171 (33%), Positives = 87/171 (50%), Gaps = 5/171 (2%) Query: 4 QHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRD 63 FL+A P + IF SV+Y+ HN +GAMG+IVN+ + +LE + + + Sbjct: 14 SGQFLVAHPKMPANIFAHSVIYVVSHNADGAMGLIVNRLAGAGPLGKLLEAFGLASKAQR 73 Query: 64 ESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQPSD 123 + + LGGP+ +GF+LH+ AS+ + ++T DVLE + + P Sbjct: 74 -----EIKLYLGGPVGIGQGFVLHSDDYAGASTRALKKGLSLSTGLDVLEAIARGRGPRQ 128 Query: 124 VLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIG 174 V + GYA W GQL+ EI WL APAD +++F W EA K G Sbjct: 129 VRMLFGYAGWSPGQLDGEIARGDWLLAPADTSLIFSEEPDKVWEEALKHAG 179 >UniRef50_Q3B561 UPF0301 protein Plut_0637 n=11 Tax=Chlorobiaceae RepID=Y637_PELLD Length = 189 Score = 175 bits (443), Expect = 8e-43, Method: Composition-based stats. Identities = 49/180 (27%), Positives = 79/180 (43%), Gaps = 13/180 (7%) Query: 4 QHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRD 63 LIA L + F+R+V+ +CEHN G++G I+N+P+E E + ++ Sbjct: 12 AGKLLIASANLLESNFKRTVLMMCEHNPQGSLGFILNRPMEFQVREAVAGFDEV------ 65 Query: 64 ESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDK--QP 121 D+P+ +GGP+ + LH S +I R+ L L +P Sbjct: 66 -----DEPLHMGGPVQSNTVHFLHMRGDLIDGSEQILPGLYWGGDREELGYLLNTGVLKP 120 Query: 122 SDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMP 181 S++ LGYA W GQLE E + +W TA A ++F W + G + + Sbjct: 121 SEIRFFLGYAGWSAGQLEAEFEEGSWYTADATPAMVFSGEYERMWSRTVRSKGGEYQLIA 180 >UniRef50_B0BVW3 UPF0301 protein RrIowa_0061 n=15 Tax=Rickettsia RepID=Y061_RICRO Length = 189 Score = 174 bits (442), Expect = 1e-42, Method: Composition-based stats. Identities = 47/187 (25%), Positives = 95/187 (50%), Gaps = 6/187 (3%) Query: 2 NLQHHFLIAMPA-LQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPE 60 NL L+A P + I+ +S++Y+ H GA+G+I N+ + ++ ++ + + Sbjct: 8 NLSGKTLVATPHVITKGIYHKSLIYMLSHTEEGAIGLIFNRLVNHIDLKSFFK-----IK 62 Query: 61 PRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQ 120 + + + P+ LGGP+ ++GF LH+ N + ++ ++++ ++ E + K Sbjct: 63 NDEITTPVMVPIYLGGPVEHEKGFFLHSSDYNKNLLLDFHNDLAVSSNLEISEDIAFGKG 122 Query: 121 PSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTM 180 P + L +GY +W+ GQLE+E+ N WL + +F +W A K +G+D + Sbjct: 123 PKNSLFIVGYTAWKPGQLEEELETNLWLVMDCNKEFIFADNPESKWHNALKHLGIDEIHF 182 Query: 181 PGVAGHA 187 G+A Sbjct: 183 SSQIGNA 189 >UniRef50_Q2S591 UPF0301 protein SRU_0495 n=2 Tax=Rhodothermaceae RepID=Y495_SALRD Length = 188 Score = 173 bits (439), Expect = 2e-42, Method: Composition-based stats. Identities = 51/180 (28%), Positives = 84/180 (46%), Gaps = 15/180 (8%) Query: 5 HHFLIAMPALQDPIFRRSVVYICEHNT-NGAMGIIVNKPLENLKIEGILEKLKITPEPRD 63 LI+ P +QDP FRRSVV +CEHN G G+I+N+ L+ + + +L++ Sbjct: 12 GTLLISAPMMQDPNFRRSVVLLCEHNDREGTFGLILNRELD-VSLGDVLDEY-------- 62 Query: 64 ESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQ--P 121 + D P+ +GGP+ + LHT + + + + ++ L P Sbjct: 63 --VTYDPPLYMGGPVQRETLHYLHTRED-IPGGVALPGDMTWGGDFEAVQQLAKGGDAAP 119 Query: 122 SDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMP 181 ++ LGYA W GQLE E+ + AW+ AP +F T WR + +G + + Sbjct: 120 DNLRFFLGYAGWGPGQLEGELGEEAWIPAPGAAEFVFDTDPDQLWRAILRRMGGEYAVLA 179 >UniRef50_Q6AL28 UPF0301 protein DP2218 n=1 Tax=Desulfotalea psychrophila RepID=Y2218_DESPS Length = 190 Score = 173 bits (438), Expect = 3e-42, Method: Composition-based stats. Identities = 61/186 (32%), Positives = 102/186 (54%), Gaps = 5/186 (2%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 +L +FL++ + D F VVY+C HN+NGA+G+++NKP NL +L ++ + Sbjct: 10 SLAGYFLVSTLQMPDSRFAGQVVYVCSHNSNGALGLVINKPDCNLSFAQVLREMGMEVSR 69 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQP 121 + V +GGP++ D F+L+ + I I+DN ++ +++LE + + Sbjct: 70 AELPS-----VYIGGPVSLDAAFVLYRSHPYEGNHIDITDNISLSREKELLELVVGENSS 124 Query: 122 SDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMP 181 + L +GY WE GQLE E+ DN+WL P D ++F P ++W+ AA G+DI T Sbjct: 125 RNYLFLVGYVGWESGQLELELRDNSWLVVPGDEQVIFDLPDGEKWKAAAAYYGIDITTFN 184 Query: 182 GVAGHA 187 G+A Sbjct: 185 ENLGYA 190 >UniRef50_C8CIK8 Putative uncharacterized protein n=1 Tax=uncultured bacterium B7P37metaSE RepID=C8CIK8_9BACT Length = 200 Score = 172 bits (436), Expect = 6e-42, Method: Composition-based stats. Identities = 53/170 (31%), Positives = 83/170 (48%), Gaps = 4/170 (2%) Query: 3 LQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPR 62 L LIA P + DP F R+V+ + HN++GAM I++N+PL + IL+ Sbjct: 30 LTGQLLIAAPGMTDPRFDRTVLVMVRHNSDGAMAIVINRPLGERSMARILQAFGEKAPDD 89 Query: 63 DESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQPS 122 ++ PV LGGP+ + +LH+ ++ I + +T S ++ + + P Sbjct: 90 SATV----PVYLGGPVQLEMSTVLHSAEYRRNGTLDIDGHVAVTASMEIYRDIAANTGPE 145 Query: 123 DVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKL 172 LV GYA W GQLE E+ N W TAP D+ ++F W A + Sbjct: 146 KSLVVFGYAGWAPGQLEGEMAQNVWFTAPLDVKLVFDADRDKVWDLAMER 195 >UniRef50_A6C880 Putative uncharacterized protein n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C880_9PLAN Length = 188 Score = 172 bits (435), Expect = 8e-42, Method: Composition-based stats. Identities = 54/189 (28%), Positives = 76/189 (40%), Gaps = 12/189 (6%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 +L+ HFL+A L D F RSVV I EHN GA G+IVN+P + + Sbjct: 4 SLKGHFLVASRKLNDLNFYRSVVLIVEHNEQGATGLIVNRPSSFSITNALSRYFDMPK-- 61 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLET----LGT 117 L+ V +GGP+ + F LH S+ I + M +S ++ E + Sbjct: 62 ------LEDMVFMGGPVEPNGMFALHNAGDLEKSTEAIVPDLFMGSSPEIFEQVIWRISE 115 Query: 118 DKQPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDI 177 D + G A W QLE EI WL PA +F+ D W + Sbjct: 116 GDPHLDFRIFFGCAGWAPLQLESEINRMDWLNTPATTEDIFEIDPYDIWDTLLDRAMAER 175 Query: 178 LTMPGVAGH 186 +P H Sbjct: 176 RFLPQETAH 184 >UniRef50_Q254Z3 UPF0301 protein CF0373 n=7 Tax=Chlamydiales RepID=Y373_CHLFF Length = 189 Score = 171 bits (434), Expect = 1e-41, Method: Composition-based stats. Identities = 46/179 (25%), Positives = 81/179 (45%), Gaps = 8/179 (4%) Query: 4 QHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRD 63 + L+A P +F RSV+ +CEH+ NG+ G+I+NK L + I K+T Sbjct: 11 KGSLLLASPDTDQGVFARSVILLCEHSLNGSFGLILNKTLGLELADDIFSFDKVT----- 65 Query: 64 ESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQPSD 123 + +GGPL ++ +LH+ ++ I + + L+ + Sbjct: 66 ---NNNIRFCMGGPLQANQMMLLHSCSEIPEQTLEICPSVYLGGDLSFLQEIAASDAGPM 122 Query: 124 VLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMPG 182 + + GY+ W+ GQLE+E LD W APA + +F + W + K +G ++ Sbjct: 123 INLCFGYSGWQAGQLEREFLDGNWFLAPASYDYVFMDNPENLWSKILKDLGGKYASLST 181 >UniRef50_D2QR79 Putative uncharacterized protein n=2 Tax=Flexibacteraceae RepID=D2QR79_9SPHI Length = 186 Score = 169 bits (429), Expect = 3e-41, Method: Composition-based stats. Identities = 49/179 (27%), Positives = 87/179 (48%), Gaps = 14/179 (7%) Query: 5 HHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRDE 64 LIA P + D F RSVV +CEHN G G+++N+ + +++ ++E + Sbjct: 11 GDLLIAEPFMGDNNFERSVVLVCEHNAVGTFGLVLNQQTD-IQLGDVIEDI--------- 60 Query: 65 SIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGT--DKQPS 122 D P+ +GGP+ ++ +H P +SI + D + D ++ Sbjct: 61 --HTDLPLFVGGPVQQNTLHFIHRRPDLIDNSICVVDGLYWSGDFDQIKRGVNLGTLTER 118 Query: 123 DVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMP 181 D+ +GY+ W +GQL+ E+L AW+ + + LF+TP + WRE K G + ++ Sbjct: 119 DIRFFIGYSGWNEGQLDSELLQKAWIISRTKADFLFETPTTEFWREVLKRKGGEYKSIA 177 >UniRef50_Q11U74 UPF0301 protein CHU_1773 n=2 Tax=Flexibacteraceae RepID=Y1773_CYTH3 Length = 182 Score = 166 bits (420), Expect = 4e-40, Method: Composition-based stats. Identities = 51/181 (28%), Positives = 84/181 (46%), Gaps = 14/181 (7%) Query: 4 QHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRD 63 + LI+ P L D F RSVV +CEHN +GA G ++NK L I +LE+ Sbjct: 4 KGKILISEPYLGDSTFERSVVLLCEHNDSGAFGFMLNK-STTLTINSVLEE--------- 53 Query: 64 ESIRLDKPVMLGGPLAEDRGFILHTPPSNF-ASSIRISDNTVMTTSRDVLETLGTDK--Q 120 + ++ + LGGP+A+D F L S+ I D+ + L+TL + + Sbjct: 54 -QLTFEQNLFLGGPVAQDSLFFLLRQDRAILKDSVHIKDDLYWGGDFEHLKTLIQEGTLE 112 Query: 121 PSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTM 180 + LGY+ W + QLE E+ ++W+ A + +F W+ + +G D + Sbjct: 113 LDNCRFFLGYSGWGEDQLEYELEKHSWIIADINSEDMFVKNPESMWQNVLRSMGGDYKVL 172 Query: 181 P 181 Sbjct: 173 S 173 >UniRef50_Q3KMF1 UPF0301 protein CTA_0231 n=9 Tax=Chlamydia RepID=Y231_CHLTA Length = 189 Score = 165 bits (418), Expect = 7e-40, Method: Composition-based stats. Identities = 53/179 (29%), Positives = 82/179 (45%), Gaps = 8/179 (4%) Query: 4 QHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRD 63 + L+A P + IF RSVV +CEH+ NG+ G+I+NK LE E I P D Sbjct: 11 KGSLLVASPDVNGGIFSRSVVLVCEHSLNGSFGLILNKILEIDLPEEIF--------PLD 62 Query: 64 ESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQPSD 123 +GGPL ++ +LHT P + SSI I + + + Sbjct: 63 HFDESKVRFCMGGPLQANQIMLLHTSPDSANSSIEICPSVFLGGDFSFAGEKEGRTRDDK 122 Query: 124 VLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMPG 182 +L+ GY+ W+ GQLE+E L+ W AP+ I+F W + + +G ++ Sbjct: 123 MLLCFGYSGWQGGQLEKEFLEGLWFLAPSSQEIIFTDAPERMWSDVLQHLGGRFASLST 181 >UniRef50_Q1DAS2 UPF0301 protein MXAN_2022 n=2 Tax=Cystobacterineae RepID=Y2022_MYXXD Length = 181 Score = 165 bits (418), Expect = 7e-40, Method: Composition-based stats. Identities = 56/184 (30%), Positives = 82/184 (44%), Gaps = 7/184 (3%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 NL L+AMP L DP F RSVV + EH+ +G+MG+++N+ L +L Sbjct: 3 NLAPGLLLAMPQLGDPNFYRSVVLMLEHSESGSMGLVINR-----GAPLTLGELARGQNL 57 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQP 121 + R + V LGGP+ RGF+LH + ++ + D L L T+ P Sbjct: 58 GIAAGRKEHSVYLGGPVEPQRGFVLHDDTEQREKH-SVLPGLFLSVTLDALGPLLTNPNP 116 Query: 122 SDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMP 181 + LGYA W QLE EI +WL A + + W + +GVD + Sbjct: 117 -RLRFCLGYAGWGPRQLESEIAAGSWLFTEATAEAVLGHEPSKLWDTTLRGMGVDPAMLV 175 Query: 182 GVAG 185 G Sbjct: 176 MGRG 179 >UniRef50_Q5NQN1 UPF0301 protein ZMO0349 n=3 Tax=Zymomonas mobilis RepID=Y349_ZYMMO Length = 188 Score = 163 bits (413), Expect = 3e-39, Method: Composition-based stats. Identities = 54/176 (30%), Positives = 92/176 (52%), Gaps = 5/176 (2%) Query: 12 PALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRDESIRLDKP 71 P ++D F+++V+ +C N GA+G+ + + + ++ + ++ +L I P + D+P Sbjct: 18 PNMRDVEFQKAVIALCAFNEKGALGLNIGRIIPDVTLHSLMHQLGIQP-----GLVPDRP 72 Query: 72 VMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQPSDVLVALGYA 131 V GGP RG +LH+ + S+ + + +T + DVL L + P LVALGYA Sbjct: 73 VHDGGPCEPQRGMVLHSRDWHSPDSMMVGQDWALTCTLDVLHALSRGEGPQHWLVALGYA 132 Query: 132 SWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMPGVAGHA 187 W GQL+QE+ W + D +LF P +RW++ + GVD + G A Sbjct: 133 GWGAGQLDQEMKQADWFLSKVDDQLLFSCPAENRWQQGYQQAGVDFYRLATKIGQA 188 >UniRef50_A7C130 Protein containing DUF179 n=1 Tax=Beggiatoa sp. PS RepID=A7C130_9GAMM Length = 158 Score = 162 bits (411), Expect = 4e-39, Method: Composition-based stats. Identities = 58/154 (37%), Positives = 95/154 (61%), Gaps = 4/154 (2%) Query: 34 AMGIIVNKPLENLKIEGILEKLKITPEPRDESIRLDKPVMLGGPLAEDRGFILHTPPSNF 93 AMGI++N+PL+ + + +LE + I + + P+ GGP+ +RGF++H P + Sbjct: 9 AMGIVINRPLD-VDLGDVLEHMNIEANDQRAT---RMPIFDGGPVQRERGFVIHQPVGQW 64 Query: 94 ASSIRISDNTVMTTSRDVLETLGTDKQPSDVLVALGYASWEKGQLEQEILDNAWLTAPAD 153 + + I++N + TSRD++ + + PS+ L+ALGYA W GQLEQE+ DNAWL+ PAD Sbjct: 65 DAMLSINNNLGIATSRDIISAIANGQGPSNALIALGYAGWTAGQLEQEMADNAWLSTPAD 124 Query: 154 LNILFKTPIADRWREAAKLIGVDILTMPGVAGHA 187 +++F+T RW AA +G+D+ + GH Sbjct: 125 YSVIFQTTPEQRWHAAAASMGIDLTLLSSQVGHG 158 >UniRef50_A6LBX4 UPF0301 protein BDI_1431 n=6 Tax=Bacteroidales RepID=Y1431_PARD8 Length = 198 Score = 162 bits (411), Expect = 5e-39, Method: Composition-based stats. Identities = 52/181 (28%), Positives = 88/181 (48%), Gaps = 13/181 (7%) Query: 4 QHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRD 63 Q LIA P LQD F+RSVV + EH +G+MG ++NK + L + ++ Sbjct: 19 QGSILIAEPFLQDAYFQRSVVLLIEHTEHGSMGFVLNKKTD-LIVNSFFKEF-------- 69 Query: 64 ESIRLDKPVMLGGPLAEDRGFILHTPPSNF-ASSIRISDNTVMTTSRDVLETLGTDKQPS 122 + + P+ LGGP++ +R F +H+ N +++I+D + L+ + P Sbjct: 70 -AEFPEIPIYLGGPVSPNRLFFIHSLGDNIIPDALKINDYLYFDGDFNALKRYILNGHPI 128 Query: 123 D--VLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTM 180 D V LGY+ W +GQL EI N+W + + + W+++ +L+G D T Sbjct: 129 DGKVKFFLGYSGWTEGQLNHEIKRNSWAVSHITTDNILSADGEGYWKDSVELLGNDYKTW 188 Query: 181 P 181 Sbjct: 189 T 189 >UniRef50_A3VRH6 Putative uncharacterized protein n=1 Tax=Parvularcula bermudensis HTCC2503 RepID=A3VRH6_9PROT Length = 207 Score = 162 bits (409), Expect = 8e-39, Method: Composition-based stats. Identities = 51/182 (28%), Positives = 80/182 (43%), Gaps = 10/182 (5%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 L +++MP L D F +SV+YIC H+ A G+I+NKP+E + + ++ Sbjct: 22 GLAGRLIVSMPQLNDGPFAQSVIYICTHDIEHAFGLILNKPIEGVVATEAVADME----- 76 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQP 121 +D P+ GGP RG ILH+ S I ++T+ + L LGT P Sbjct: 77 ---EKDIDLPLFFGGPCEPRRGIILHSDQFVLEDSETIGAGLAISTTNEALAALGTPLLP 133 Query: 122 SD-VLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTM 180 + + G+A W GQL+ E+ + WL + F W A IG+ + Sbjct: 134 AQSARLFTGHAGWGPGQLDDELRRHTWLDLETSTDFAFS-DPETMWDRAMAEIGIPFQNL 192 Query: 181 PG 182 Sbjct: 193 TA 194 >UniRef50_Q5LDK5 UPF0301 protein BF2109 n=20 Tax=Bacteroides RepID=Y2109_BACFN Length = 196 Score = 161 bits (408), Expect = 1e-38, Method: Composition-based stats. Identities = 54/180 (30%), Positives = 82/180 (45%), Gaps = 13/180 (7%) Query: 4 QHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRD 63 + LI+ P L D F RSVV + +H G+MG+I+NKPL L + I+++ K Sbjct: 19 RGKILISEPFLHDVTFGRSVVLLVDHTEEGSMGLIINKPLP-LMLNDIIKEFKYI----- 72 Query: 64 ESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQP-- 121 D P+ GGP+ D F LHT ++ I++ + D ++ P Sbjct: 73 ----EDIPLHKGGPIGTDTLFYLHT-LHEIPGTLPINNGLYLNGDFDAIKKYILQGNPIK 127 Query: 122 SDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMP 181 + LGY+ WE QL QEI +N W+ + + L I W+EA +G T Sbjct: 128 GKIRFFLGYSGWECEQLIQEIKENTWIISKEENTYLMNEDIKGMWKEALGKLGSKYETWS 187 >UniRef50_A3HT39 Putative uncharacterized protein n=1 Tax=Algoriphagus sp. PR1 RepID=A3HT39_9SPHI Length = 189 Score = 160 bits (404), Expect = 3e-38, Method: Composition-based stats. Identities = 54/181 (29%), Positives = 86/181 (47%), Gaps = 14/181 (7%) Query: 4 QHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRD 63 LI+ P LQD F RSVV +CEHN G+ G+++NKP LK+ ++E L Sbjct: 11 AGDLLISEPFLQDENFVRSVVMLCEHNEEGSFGLVINKPS-ILKLGELVESLD------- 62 Query: 64 ESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVL-ETLGTDK-QP 121 LD V +GGP+ ++ ++T SI+I + + L E L T P Sbjct: 63 ---FLDAEVFVGGPVEQNTLHYIYTGEKELERSIQIGTDLWWGGDYEQLVEKLKTGLINP 119 Query: 122 SDVLVALGYASWEKGQLEQEILDNAWLTAPADLN-ILFKTPIADRWREAAKLIGVDILTM 180 V +GY+ W QLE+E+ D W+ +++ F+ + WR+ K +G + + Sbjct: 120 DRVRFFIGYSGWGLDQLEEELEDKTWIVCRTEVDPKTFEYTPEELWRKLLKNMGGEFKVI 179 Query: 181 P 181 Sbjct: 180 A 180 >UniRef50_A6G4Z9 Putative uncharacterized protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6G4Z9_9DELT Length = 210 Score = 160 bits (404), Expect = 3e-38, Method: Composition-based stats. Identities = 64/197 (32%), Positives = 94/197 (47%), Gaps = 19/197 (9%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 L H L A+P L DP F+RSVV + EH+ GA+G+++N+ + N + + E L + Sbjct: 15 GLACHLLCAVPQLLDPNFKRSVVLMLEHDERGALGLVINRTM-NTSLSEVAEALDLEWCG 73 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGT---D 118 D V +GGP+ RG+ LH + + + D +TTS + + G+ Sbjct: 74 D-----PDAQVRIGGPVEPVRGWFLHDQGAWDPDASSLVDGLWVTTSLEGVGAAGSVRFG 128 Query: 119 KQPSDVLVALGYASWEKGQLEQEILDNAWLTAP----------ADLNILFKTPIADRWRE 168 + S+ L LGYA W GQLE EI +W+ P D LF TP W Sbjct: 129 SEESNFLFLLGYAGWSGGQLEGEIAAGSWVLVPLVDDDDPRVGVDPTFLFDTPPEHMWSL 188 Query: 169 AAKLIGVDILTMPGVAG 185 A + IGVD + G+ G Sbjct: 189 ALQSIGVDPQRLVGLQG 205 >UniRef50_C2G2S7 Transcriptional regulator n=3 Tax=Sphingobacteriaceae RepID=C2G2S7_9SPHI Length = 189 Score = 159 bits (402), Expect = 5e-38, Method: Composition-based stats. Identities = 43/182 (23%), Positives = 85/182 (46%), Gaps = 14/182 (7%) Query: 4 QHHFLIAMPALQDPIFRRSVVYICEHNT-NGAMGIIVNKPLENLKIEGILEKLKITPEPR 62 + L++ P + D F+RSV+ + +HN +G +G I+N+ + L + + + ++ Sbjct: 9 KGSLLVSEPFMLDQNFKRSVILLADHNETDGTVGFILNQRTQ-LMLSDVFQDVE------ 61 Query: 63 DESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQP- 121 D P+ LGGP+ + F +H S I D+ ++L L +++ Sbjct: 62 ---READFPIYLGGPVECEALFFIHKAYDLLLSGEHIIDDVYWGGDIELLLRLAKEEKIT 118 Query: 122 -SDVLVALGYASWEKGQLEQEILDNAWLT-APADLNILFKTPIADRWREAAKLIGVDILT 179 +V +GY+ W QL++EI +N+W + ++ F T D W++A +G Sbjct: 119 SDEVKFFIGYSGWSPSQLDREIKENSWAVDNKFNKDLTFITDGEDLWKQALISMGQKYAH 178 Query: 180 MP 181 + Sbjct: 179 VA 180 >UniRef50_C3Q021 UPF0301 protein n=8 Tax=Bacteroides RepID=C3Q021_9BACE Length = 196 Score = 159 bits (402), Expect = 5e-38, Method: Composition-based stats. Identities = 48/181 (26%), Positives = 81/181 (44%), Gaps = 13/181 (7%) Query: 4 QHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLEN-LKIEGILEKLKITPEPR 62 Q LI+ P + D F R+VV + EHN G+MGII+NK + + ++ +L+ Sbjct: 17 QGSILISSPFMNDYHFTRAVVLLIEHNDEGSMGIIMNKDFRYHILLNDLIPELEF----- 71 Query: 63 DESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQPS 122 PV GGP++ + F LHT + ++ + + + + ++ D +P Sbjct: 72 ----AQRVPVYKGGPMSRETIFFLHT-LKDLEGALPLGNGLYLNGDFNAVQQYILDGKPI 126 Query: 123 D--VLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTM 180 + + GYA W+ GQL +EI +N+WL A L D W + +G Sbjct: 127 EGVIRFFAGYAGWDHGQLAKEIKENSWLIGKAGKETLLNQHFRDLWHTSLNEMGGKYAIW 186 Query: 181 P 181 Sbjct: 187 A 187 >UniRef50_A5FNN9 Putative uncharacterized protein n=18 Tax=Bacteroidetes RepID=A5FNN9_FLAJ1 Length = 209 Score = 158 bits (401), Expect = 7e-38, Method: Composition-based stats. Identities = 48/182 (26%), Positives = 84/182 (46%), Gaps = 16/182 (8%) Query: 4 QHHFLIAMPAL-QDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPR 62 + H LIA P++ D F RSV+ + +HN G++G I+NKPL+ I ++ ++ Sbjct: 31 KGHLLIAEPSIIGDLSFNRSVILLADHNKEGSIGFIINKPLKY-TINDLIPEID------ 83 Query: 63 DESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQPS 122 + + GGP+ +D + +H P +S+ IS+ + + L D + Sbjct: 84 -----ANFKIYNGGPVEQDNLYFIHNIPDLIPNSVEISNGIYWGGDFESTKDLINDGSIN 138 Query: 123 D--VLVALGYASWEKGQLEQEILDNAWLTAPAD-LNILFKTPIADRWREAAKLIGVDILT 179 + LGY W++ QLE E+ N+W+ A + N + W+E +G D L Sbjct: 139 KNNIRFFLGYTGWDENQLENEMQGNSWIIADNNYKNKIIGKSTTHFWKEQIIELGGDYLI 198 Query: 180 MP 181 Sbjct: 199 WS 200 >UniRef50_C7PSM3 Putative uncharacterized protein n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PSM3_CHIPD Length = 184 Score = 158 bits (399), Expect = 1e-37, Method: Composition-based stats. Identities = 47/185 (25%), Positives = 80/185 (43%), Gaps = 15/185 (8%) Query: 1 MNLQ-HHFLIAMPALQDPIFRRSVVYICEHNT-NGAMGIIVNKPLENLKIEGILEKLKIT 58 ++L LIA P L+D F R+VV +CEH G+ G ++NK + E + E L Sbjct: 2 VSLSPGILLIADPFLKDQNFARTVVLLCEHQESRGSFGFVLNKVFDQSLNELVPEVL--- 58 Query: 59 PEPRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTD 118 + V GGP+ D +H P I D D + +L Sbjct: 59 --------INNIRVYYGGPVQIDTIHFIHQQPELIRGGFEIRDGVYWGGEFDQVVSLINS 110 Query: 119 K--QPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVD 176 + + +GY+ W GQLE E+ + +W+ + ++ ++F+ + W +A K +G + Sbjct: 111 GRLDLNKIKFFIGYSGWSSGQLENELNEKSWILSESNAPLIFEAKEQNIWPQALKNLGAN 170 Query: 177 ILTMP 181 M Sbjct: 171 FAIMA 175 >UniRef50_C1ZJG4 Predicted transcriptional regulator, COG1678 n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZJG4_PLALI Length = 188 Score = 157 bits (398), Expect = 2e-37, Method: Composition-based stats. Identities = 45/174 (25%), Positives = 72/174 (41%), Gaps = 12/174 (6%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 +L+ L+A L+D F ++VV I E N NG+MG+++N+P L + E ++ Sbjct: 4 SLRGKLLVASKQLKDSNFYKTVVLIVEDNENGSMGLVLNRPSSILVNHALSEHFQLPESA 63 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQP 121 V +GGP+ FILH + + S + E + P Sbjct: 64 EL--------VHVGGPVEPAALFILHNLEELSHEGTGVIPGVWLGNSGEAFEDVLRSSDP 115 Query: 122 S----DVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAK 171 V G A W GQLE E+ W APA +I+F + + + + Sbjct: 116 HQPGVRFRVFCGCAGWSPGQLEGELAHGDWHVAPAIKSIVFAEDPYEIYEQMLQ 169 >UniRef50_Q0BLI0 UPF0301 protein FTH_1193 n=18 Tax=Francisella RepID=Y1193_FRATO Length = 194 Score = 157 bits (397), Expect = 2e-37, Method: Composition-based stats. Identities = 58/182 (31%), Positives = 100/182 (54%), Gaps = 5/182 (2%) Query: 2 NLQHHFLIAMPALQDPI-FRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPE 60 N + L+A P ++D I F +SVVY+C+++ +GAMG+I+NKPL + ++ + E+L I P Sbjct: 4 NHKSEILLATPLIKDDIVFTKSVVYLCQNDRHGAMGLIINKPLAD-TLKDVFEELHI-PH 61 Query: 61 PRDESIRLDKPVMLGGPLAEDRGFILHTP-PSNFASSIRISDNTVMTTSRDVLETLGTDK 119 L+ P+ +GGP++ + ILHT N+ S+I++ + +T S D+LE + + Sbjct: 62 TNTFKEILEYPLYMGGPISPHKIMILHTTNGRNYTSTIKLDEGLAITASIDILEDIANNI 121 Query: 120 QPSDVLVALGYASWEKGQLEQEILDNAWLTA-PADLNILFKTPIADRWREAAKLIGVDIL 178 P L +GY+ W QL EI N W+ + ILF +W+ + G + Sbjct: 122 LPEYFLPVVGYSCWTANQLTDEIKSNDWIVTNKLNKKILFNHENKVKWQNHLEHAGYTLQ 181 Query: 179 TM 180 ++ Sbjct: 182 SL 183 >UniRef50_B9XJW1 Putative uncharacterized protein n=1 Tax=bacterium Ellin514 RepID=B9XJW1_9BACT Length = 186 Score = 157 bits (396), Expect = 3e-37, Method: Composition-based stats. Identities = 51/181 (28%), Positives = 82/181 (45%), Gaps = 11/181 (6%) Query: 3 LQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPR 62 L+ L+ L F+R+VV +C+H+ GA+G+++N+ N E +L L Sbjct: 8 LKGQLLLDSGQLSGSFFQRTVVLVCQHDAEGALGLVLNRDSGNKLGEMVLADL------- 60 Query: 63 DESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQP- 121 D + LGGP+ L++ +S + N + S + L LG P Sbjct: 61 -PEQLTDNALYLGGPVQLSALSYLYSDTYLPEAS--VLPNVELGHSLETLVELGESFSPG 117 Query: 122 SDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMP 181 + + GYA W GQLE+E+ AWLT PA ++++F T D W+ K G + Sbjct: 118 KRIKLFAGYAGWSPGQLEEEMKRKAWLTHPATVDLVFDTDPDDLWQYVLKQKGGMYRVLA 177 Query: 182 G 182 Sbjct: 178 Q 178 >UniRef50_A9RVW3 Predicted protein n=1 Tax=Physcomitrella patens subsp. patens RepID=A9RVW3_PHYPA Length = 324 Score = 156 bits (394), Expect = 5e-37, Method: Composition-based stats. Identities = 43/187 (22%), Positives = 72/187 (38%), Gaps = 15/187 (8%) Query: 5 HHFLIAMPAL---QDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 LIA P F R V++I H+ G+ G+I+N+P + + + E + PE Sbjct: 145 GCLLIAHPNAFTESQQYFHRVVIFIFAHDAGGSAGVILNRPTQY-SLGQLDEFKDLMPE- 202 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQ- 120 P+ GG + ++H P S I + M + + + + + + Sbjct: 203 -----LSSCPLYFGGDVGPQCTQVIHGIP-GLEDSREIMNGVYMGGTASIQDNIRSGQST 256 Query: 121 PSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTP---IADRWREAAKLIGVDI 177 P+D L +A W GQLEQE+ W A + K W E + +G Sbjct: 257 PNDYRWFLRFAGWGPGQLEQEVAAGVWYLASCSKRFVLKQCIQLPKPLWNEVMEHMGPPY 316 Query: 178 LTMPGVA 184 + A Sbjct: 317 SDIARKA 323 >UniRef50_D0LIU3 Putative uncharacterized protein n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LIU3_HALO1 Length = 198 Score = 155 bits (393), Expect = 6e-37, Method: Composition-based stats. Identities = 58/197 (29%), Positives = 97/197 (49%), Gaps = 20/197 (10%) Query: 3 LQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPR 62 L L+AMP L DP FRRSVV + EH+ G+ G++VN+P E L ++ + E L + + Sbjct: 6 LAPGLLLAMPHLLDPNFRRSVVLMVEHDDEGSFGLVVNQPTE-LSMDELYESLDLAWK-- 62 Query: 63 DESIRLDKPVMLGGPLAEDRGFILHTPPSNFASS--------------IRISDNTVMTTS 108 + V GGP+ +++H P + + S + + ++ + Sbjct: 63 ---GSSEAMVWRGGPVMPTHLWLVHAPLAGSSDSGTESALLGLGDGGTVAVGPELRVSGA 119 Query: 109 RDVLETLGTDKQPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWRE 168 L + ++ P+ + V LGYA W GQL QE+ AWL A ++F+TP + W Sbjct: 120 MPELIEMFGNEPPAQLRVLLGYAGWGGGQLAQEMSQGAWLHVDATPELIFETPAEEMWER 179 Query: 169 AAKLIGVDILTMPGVAG 185 A + +G++ T+ AG Sbjct: 180 AVRTLGINPETIIHGAG 196 >UniRef50_UPI0001C3133A protein of unknown function DUF179 n=1 Tax=Conexibacter woesei DSM 14684 RepID=UPI0001C3133A Length = 182 Score = 153 bits (387), Expect = 2e-36, Method: Composition-based stats. Identities = 47/181 (25%), Positives = 81/181 (44%), Gaps = 12/181 (6%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 +L+ L+A PALQDP F R+VV I EHN +GAMG+++N+P E + Sbjct: 4 SLKGKLLLASPALQDPNFARTVVLIAEHNEDGAMGLVLNRPATTTVAE--------SAPE 55 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNT-VMTTSRDVLETLGTDKQ 120 +E + ++P+ +GGP+ +L A+ + + D+ ++ D + Sbjct: 56 LEELVEAEEPIYIGGPVQPSAVIVLAAFEEPAAAGLLVRDDVGFLSAEADFAT---SRDA 112 Query: 121 PSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTM 180 + V G+A W GQL++E+ W+ P LF + W + G + Sbjct: 113 TRQLRVFAGHAGWGPGQLDEELEREDWIVEPPLPQELFSEDAEELWGDVLTRKGGAFALV 172 Query: 181 P 181 Sbjct: 173 A 173 >UniRef50_A7H7H6 UPF0301 protein Anae109_0457 n=4 Tax=Anaeromyxobacter RepID=Y457_ANADF Length = 199 Score = 152 bits (384), Expect = 5e-36, Method: Composition-based stats. Identities = 53/180 (29%), Positives = 93/180 (51%), Gaps = 4/180 (2%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 L FL+A PAL DP F S+V + EH+ GA+G +VN+P E + + Sbjct: 8 GLAPGFLVAAPALGDPNFAGSLVLMAEHHGEGALGFVVNRPGPVTVAEVLASVDEDLRRA 67 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTP---PSNFASSIRISDNTVMTTSRDVLETLGTD 118 + + R PV++GGP+ +R +IL P ++ ++ + + + SR++LE L Sbjct: 68 AEANGRAGAPVLVGGPVQPERLWILFRPGGIGADAEGAVPVGNGLSLGGSRELLEALVRA 127 Query: 119 KQPSDVLVALGYASWEKGQLEQEILDNAWLTAPAD-LNILFKTPIADRWREAAKLIGVDI 177 + L+ LGYA W Q+E+E+ AW+ + +++F P+ RW A + +G++ Sbjct: 128 PRGDPFLLLLGYAGWAPMQVEREVAAGAWVPLELEGSDLVFDVPLEQRWETAVRRLGLEP 187 >UniRef50_A3ZQK2 Putative uncharacterized protein n=1 Tax=Blastopirellula marina DSM 3645 RepID=A3ZQK2_9PLAN Length = 184 Score = 151 bits (383), Expect = 9e-36, Method: Composition-based stats. Identities = 50/176 (28%), Positives = 81/176 (46%), Gaps = 11/176 (6%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 +LQ LIA P L DP F R+VV + +H+ GA+G+++ +P E + E Sbjct: 3 SLQGQLLIASPHLPDPNFLRTVVLMVQHDEEGALGLVLTRPTELTMAA-------MWREI 55 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQP 121 E I + V LGGP+ G ++ I I ++ ++ +E L + Sbjct: 56 AGEEIADENLVFLGGPVQ---GPLMAIHSHAPCQEIEILPGVYFSSDKENIEKLVRE-DH 111 Query: 122 SDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDI 177 + +GY+ W + QLE E+ WL PA+ +F T + W++ IG DI Sbjct: 112 EPKRIFIGYSGWGEQQLEAEMEAGGWLLLPAEAAHVFTTDVERLWKDVTGKIGADI 167 >UniRef50_C6X421 Putative transcriptional regulator n=1 Tax=Flavobacteriaceae bacterium 3519-10 RepID=C6X421_FLAB3 Length = 183 Score = 150 bits (380), Expect = 2e-35, Method: Composition-based stats. Identities = 43/182 (23%), Positives = 71/182 (39%), Gaps = 14/182 (7%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 + + +I+ P + IF RSVV + +HN GA G+I+NK +N+ + Sbjct: 5 SYKGKIIISTPDISGDIFSRSVVLVIDHNAEGAFGLILNKKNQNMSARLL---------- 54 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRD-VLETLGTDK- 119 V GGP+ D+ F ++ S I+D +T + V+ + + Sbjct: 55 --NIFGFRVDVYEGGPVENDKIFFINKGEKVTESFSEINDGFYLTEDIENVVAAIIEGRL 112 Query: 120 QPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILT 179 D+ V GY+ W GQLE EI W W+ + +G + L Sbjct: 113 SAEDIKVFSGYSGWAPGQLENEIRRKLWTVVDVYNLDYTLPTDHSLWKNIMQNLGGEFLL 172 Query: 180 MP 181 Sbjct: 173 WA 174 >UniRef50_A4C260 Putative transcriptional regulator n=1 Tax=Polaribacter irgensii 23-P RepID=A4C260_9FLAO Length = 185 Score = 150 bits (379), Expect = 2e-35, Method: Composition-based stats. Identities = 44/181 (24%), Positives = 78/181 (43%), Gaps = 15/181 (8%) Query: 4 QHHFLIAMPA-LQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPR 62 + L+A P+ L D F +++V + EH N ++G I+NKPL + +L +K + + Sbjct: 8 KGRLLVAEPSILNDTSFNKAIVLLTEHTANNSVGFILNKPLAY-NLNDLLPNIKCSFK-- 64 Query: 63 DESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDK--Q 120 + GGP+ +D + LH P + SI +S+ + L L + Sbjct: 65 ---------IYQGGPVEQDNLYFLHRVPQLLSKSIAVSNGVYWGGDFNQLTELLNNSVLD 115 Query: 121 PSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTM 180 S++ LGY+ W+K QL E+ + +W D + + W+E G Sbjct: 116 TSEIRFFLGYSGWDKEQLGAELKEKSWFVTENDFENILSNDEKNLWKEKLLQKGGAYKIW 175 Query: 181 P 181 Sbjct: 176 A 176 >UniRef50_C1E5G6 Predicted protein n=2 Tax=Micromonas RepID=C1E5G6_9CHLO Length = 369 Score = 150 bits (379), Expect = 3e-35, Method: Composition-based stats. Identities = 34/188 (18%), Positives = 65/188 (34%), Gaps = 17/188 (9%) Query: 4 QHHFLIAMPA---LQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPE 60 + L+A L F ++V+ + EH+ G+MG+I+N+P + + Sbjct: 184 KGCLLLAAADEFTLGQQYFHQAVILLLEHHDKGSMGVILNRPTQY--------NMGYVSG 235 Query: 61 PRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDK- 119 D + + GG + + LH S + + E + ++ Sbjct: 236 QSDGP-FAENALYFGGDVGDGTVSFLHGSDK-VQGSAEVLPGVYLGGYDSACELVKKEEV 293 Query: 120 QPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTP---IADRWREAAKLIGVD 176 ++ Y W GQL++E W + K WRE +L G + Sbjct: 294 DANEFKFFARYCGWAPGQLKRECERGVWYPVACSKQLALKQVIQLPKPLWREILELCGGE 353 Query: 177 ILTMPGVA 184 + + A Sbjct: 354 LKSAAARA 361 >UniRef50_A4S673 Predicted protein (Fragment) n=2 Tax=Ostreococcus RepID=A4S673_OSTLU Length = 288 Score = 150 bits (378), Expect = 3e-35, Method: Composition-based stats. Identities = 37/190 (19%), Positives = 66/190 (34%), Gaps = 19/190 (10%) Query: 4 QHHFLIAMPA---LQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPE 60 + L+A + F ++V+ + EH+ NG+MG+I+N+P + + Sbjct: 85 KGCLLVAADHEFRMSQQYFHQAVILVLEHHENGSMGVILNRPTQY--------DMGYVSG 136 Query: 61 PRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQ 120 + + + GG + + LH S+ + + E + D Sbjct: 137 EANGPFAKN-ALYFGGDVGDGTVSFLHGRED-VKGSVEVLPGVYLGGYDSACELVQQDGS 194 Query: 121 ---PSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTP---IADRWREAAKLIG 174 + Y W GQLE E W A + K WRE ++L G Sbjct: 195 TCHADEFKFFARYCGWAPGQLESECERGVWFPVAAAKELSLKQVIQLPKPLWREISELCG 254 Query: 175 VDILTMPGVA 184 ++ M A Sbjct: 255 GELEEMARKA 264 >UniRef50_Q9LQ30 F14M2.10 protein n=6 Tax=rosids RepID=Q9LQ30_ARATH Length = 341 Score = 148 bits (375), Expect = 6e-35, Method: Composition-based stats. Identities = 37/187 (19%), Positives = 67/187 (35%), Gaps = 15/187 (8%) Query: 4 QHHFLIAMPALQDPI-FRRSVVYICE----HNTNGAMGIIVNKPLENLKIEGILEKLKIT 58 L+A L F R+VV + H G G+++N+P + ++ +K T Sbjct: 154 TGCVLVATEKLDGYRTFARTVVLLLRAGTRHPQEGPFGVVINRP-----LHKNIKHMKST 208 Query: 59 PEPRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTT--SRDVLETLG 116 + + + GGPL + + + T S D L Sbjct: 209 KTE-LATTFSECSLYFGGPLEASMFLLKTGDKTKIPGFEEVMPGLNFGTRNSLDEAAVLV 267 Query: 117 TDK--QPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIG 174 +P + +GYA W+ QL +EI + W A +++ + W E +L+G Sbjct: 268 KKGVLKPQEFRFFVGYAGWQLDQLREEIESDYWHVAACSSDLICGASSENLWEEILQLMG 327 Query: 175 VDILTMP 181 + Sbjct: 328 GQYSELS 334 >UniRef50_A9GTQ2 Putative uncharacterized protein n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9GTQ2_SORC5 Length = 198 Score = 148 bits (375), Expect = 7e-35, Method: Composition-based stats. Identities = 64/190 (33%), Positives = 90/190 (47%), Gaps = 16/190 (8%) Query: 3 LQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPR 62 L FLIA P L DP F R+VV + H+ GA+G +VN+P + + +L + + Sbjct: 8 LAPGFLIASPPLGDPNFDRTVVLLAVHSEGGALGFVVNRPAP-MTLGELLSFAGYGNDLK 66 Query: 63 DESIRLDKPVMLGGPLAEDRGFILHTPPS---NFASSIRISDNTVMTTSRDVLETLGTDK 119 D + PV LGGP+ G+IL P+ I + +T+SR +TL D Sbjct: 67 DPA-----PVYLGGPVQPSSGWILCLDPALGAEETGVIPVGSRVRVTSSRSAFDTLAADA 121 Query: 120 -------QPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKL 172 P V LGY+ W GQLE+EI AWL D ILF A RW +A L Sbjct: 122 VRGTAAADPRRRTVLLGYSGWGPGQLEREIAAGAWLPVSLDERILFDVEAAQRWEQAYAL 181 Query: 173 IGVDILTMPG 182 +G+ + + Sbjct: 182 LGLRPIEVMS 191 >UniRef50_B0SHS8 Transcriptional regulator n=6 Tax=Leptospira RepID=B0SHS8_LEPBA Length = 188 Score = 148 bits (374), Expect = 8e-35, Method: Composition-based stats. Identities = 50/180 (27%), Positives = 86/180 (47%), Gaps = 13/180 (7%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 + + LI+ ++ F +SVV + +H+ +GA G+++NKP + +E +++ L Sbjct: 7 STRGKLLISNSSVIQDFFHKSVVLMVDHDDDGAFGLVLNKPTDQ-TMESLIKNL------ 59 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDV-LETLGTDKQ 120 +++ +KPV GGP+ ILH + + M S D LE L +D+ Sbjct: 60 -PDTVHSNKPVYAGGPVDNLFVSILHNGKQTADPGVEVVPGIYMARSFDTMLEVLSSDQ- 117 Query: 121 PSDVLVALGYASWEKGQLEQEILDNAWLTAP-ADLNILFKTPIADR-WREAAKLIGVDIL 178 V GYA W GQLE E +W+ + D +I+FK ++ W+EA + G Sbjct: 118 -IQFRVLQGYAGWSSGQLESEFDRLSWVVSDLVDDSIVFKEDESEVIWKEALRSKGGIYK 176 >UniRef50_A6E847 Putative uncharacterized protein n=1 Tax=Pedobacter sp. BAL39 RepID=A6E847_9SPHI Length = 168 Score = 148 bits (373), Expect = 1e-34, Method: Composition-based stats. Identities = 39/169 (23%), Positives = 75/169 (44%), Gaps = 14/169 (8%) Query: 16 DPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRDESIRLDKPVMLG 75 DP F+RSVV + +H G +G I+N+ + + + E + PV +G Sbjct: 2 DPNFKRSVVLLTDHQEEGTVGFILNQRSTLILSDLVPEFAGVAL-----------PVYIG 50 Query: 76 GPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDK--QPSDVLVALGYASW 133 GP+A D +H ++ + + L+ L +P+++ +GY+ W Sbjct: 51 GPVATDTLHFIHRCYDRLNDGQEVAKGIYWGGNFEALKVLLLTGSIEPAEIKFFIGYSGW 110 Query: 134 EKGQLEQEILDNAWLTAP-ADLNILFKTPIADRWREAAKLIGVDILTMP 181 +GQL+ E+ +N W+ + +++F + WREA +G + Sbjct: 111 SEGQLKLELEENTWMVSDRFHADVVFSDNEEELWREAVINLGPRYAHIS 159 >UniRef50_C1D0N0 Putative uncharacterized protein n=3 Tax=Deinococcus RepID=C1D0N0_DEIDV Length = 185 Score = 147 bits (372), Expect = 1e-34, Method: Composition-based stats. Identities = 54/183 (29%), Positives = 87/183 (47%), Gaps = 14/183 (7%) Query: 6 HFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRDES 65 FL+A P LQ +F +V+ + EH+ GAMG+IVN P E + + Sbjct: 16 TFLVASPHLQGEVFEGTVILLLEHDRKGAMGLIVNAPTPQTVAELMADAAG--------- 66 Query: 66 IRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQPSDVL 125 ++ LGGP+ G+ L+ P I++ D+ +++S +VL + + + Sbjct: 67 --QNRRAWLGGPVDPTLGWCLYHHPVGLDGEIKLVDDLHLSSSLEVLRAVMASD--QEYM 122 Query: 126 VALGYASWEKGQLEQEILDNAWLTAP-ADLNILFKTPIADRWREAAKLIGVDILTMPGVA 184 + LGYA W GQLE+E AW+ + +L++ P RW EA K +GV T+ Sbjct: 123 LILGYAGWTAGQLEEEARAGAWVWVEQSTPELLWEVPAPQRWAEALKRLGVTPGTLMPGG 182 Query: 185 GHA 187 A Sbjct: 183 AQA 185 >UniRef50_UPI0001745679 hypothetical protein VspiD_25265 n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001745679 Length = 204 Score = 147 bits (372), Expect = 1e-34, Method: Composition-based stats. Identities = 48/181 (26%), Positives = 80/181 (44%), Gaps = 15/181 (8%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHN-TNGAMGIIVNKPLENLKIEGILEKLKITPE 60 +L L+A PAL+DP F +V+ + HN +GA G I+N+PL+ ++ +L+ + Sbjct: 29 SLSGSLLVASPALRDPNFFHTVLLLASHNTEDGAFGYILNRPLDK-RVADLLDDKDL--- 84 Query: 61 PRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQ 120 + PV LGGP+ ++ N+ S R M T + + + Sbjct: 85 ----GRLGEVPVFLGGPVGTNKLSFAA---FNWNSKKR---ELRMQTHLSTEQAMKELDK 134 Query: 121 PSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTM 180 V +GY+ W +GQLE E+ N+W+T I+ P D W +G + Sbjct: 135 GRSVRGFVGYSGWSEGQLENELEQNSWITCAPLSKIVTAQPSTDLWTTVLDDLGPYYKLL 194 Query: 181 P 181 Sbjct: 195 A 195 >UniRef50_Q1Q3L0 Putative uncharacterized protein n=1 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1Q3L0_9BACT Length = 188 Score = 145 bits (367), Expect = 6e-34, Method: Composition-based stats. Identities = 48/173 (27%), Positives = 73/173 (42%), Gaps = 11/173 (6%) Query: 4 QHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRD 63 + LIA P DP F ++VV ICEH+ G +G+I+NK L E Sbjct: 7 KGSILIANPQGTDPNFMQTVVLICEHSKRGTLGLILNKTLGKKGQE--------IFVSSA 58 Query: 64 ESIRLDKPVMLGGPLAEDRGFILHTPPSNFA-SSIRISDNTVMTTSRDVLETLGTDKQPS 122 + DK + GGP+ + F LH N + ++I + + +++ + K S Sbjct: 59 NTKTKDKEIFFGGPVDTNNMFYLHGNFKNETHNCVKICEGVYLGSNQGCFNAFMSRKNVS 118 Query: 123 D--VLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLI 173 D + LG A W GQLE EI W A ++F + W + I Sbjct: 119 DNIFRLYLGCACWSGGQLESEIETKCWTVGTATEKMVFYPSPDNIWWNILRSI 171 >UniRef50_B7G677 Predicted protein n=2 Tax=Bacillariophyta RepID=B7G677_PHATR Length = 393 Score = 142 bits (359), Expect = 4e-33, Method: Composition-based stats. Identities = 43/190 (22%), Positives = 80/190 (42%), Gaps = 14/190 (7%) Query: 5 HHFLIAMPALQDPIFRRSVVYICEHNTN-GAMGIIVNKPLENLKIEGILEKLKITPEPRD 63 LIA L +F ++VV I +H+ G+ GI++N+P++ ++ E+ + + + Sbjct: 196 GCVLIANEKLGG-VFHQTVVLIIDHHETTGSTGIVINRPMDGDLLKIASEQ-ESSLDLSL 253 Query: 64 ESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDK-QPS 122 + V GGP+ D +LH S ++ + S +++ + T + P+ Sbjct: 254 KLAFSQARVTYGGPVLTDEFSVLH-GFGEVEGSRKLCPGVYIGGSEELMNEVRTLRFDPA 312 Query: 123 DVLVALGYASWEKGQLEQEILDNAWLTAPADLNILF---------KTPIADRWREAAKLI 173 L G+A W GQL +EI W TA A + + D W + + Sbjct: 313 HALFVKGHAGWVPGQLTREISKGVWYTAAASSDFILRYAGAPVTEDDNANDLWADILSCM 372 Query: 174 GVDILTMPGV 183 G + + G Sbjct: 373 GGNYAKIAGK 382 >UniRef50_D2R140 Putative uncharacterized protein n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R140_9PLAN Length = 183 Score = 142 bits (359), Expect = 5e-33, Method: Composition-based stats. Identities = 48/181 (26%), Positives = 81/181 (44%), Gaps = 14/181 (7%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 +LQ HFL A P L DP F R+VV + +H+ GA+G+++ +P++ ++ Sbjct: 3 SLQGHFLAASPHLGDPNFFRTVVLMIKHDAQGALGLVLTRPMQETVA-------ELWQRV 55 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQP 121 E+I V LGGP+ +H S + + D + + + + + Sbjct: 56 TAETIANTGSVHLGGPV-NGPLVAIHRMASAAEA--EVFDGVYFSAHSEQISRIVHQTK- 111 Query: 122 SDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMP 181 L+ GY+ W GQLE E+ WL APA ++F + D W + IG + + Sbjct: 112 KPYLLFAGYSGWSGGQLEAELEQGGWLIAPATTELVFSS-TDDLWERVVQSIG--LAVLS 168 Query: 182 G 182 Sbjct: 169 P 169 >UniRef50_Q47MA0 UPF0301 protein Tfu_2389 n=3 Tax=Actinomycetales RepID=Y2389_THEFY Length = 198 Score = 142 bits (359), Expect = 5e-33, Method: Composition-based stats. Identities = 51/191 (26%), Positives = 79/191 (41%), Gaps = 17/191 (8%) Query: 1 MNLQHHFLIAMPALQDPIFRRSVVYICEHNT-NGAMGIIVNKPLENLKIEGILEKLKITP 59 ++L L+A P L+DP F RSVV++ + G +G+I+N+P E E + E + Sbjct: 8 LSLTGALLVATPLLEDPNFYRSVVFVIDDTPDEGTLGVILNRPSELGVGEVLAEWGEHVS 67 Query: 60 EPRDESIRLDKPVMLGGPLAEDRGFILHTPPSNF-ASSIRISDNTVMTTSRDVLETLGTD 118 +P + GGP+ +D G L P + D T + L T+ D Sbjct: 68 QPAV--------MFAGGPVGQDAGLALAVPDDGQRPLGWKSLDAMDAKTWPNGLGTVDLD 119 Query: 119 KQP-------SDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAK 171 P + V GYA W GQL EI AW PA ++ +F W + Sbjct: 120 TPPQLVADALRQMRVFAGYAGWSAGQLRAEIDQGAWYVLPATVDDVFCADPRGLWSRVLR 179 Query: 172 LIGVDILTMPG 182 G ++ + Sbjct: 180 RQGGELAFVAT 190 >UniRef50_B9SPX9 Electron transporter, putative n=2 Tax=fabids RepID=B9SPX9_RICCO Length = 350 Score = 142 bits (358), Expect = 6e-33, Method: Composition-based stats. Identities = 42/192 (21%), Positives = 67/192 (34%), Gaps = 21/192 (10%) Query: 4 QHHFLIAMPALQDPI-FRRSVVYICE----HNTNGAMGIIVNKPLENLKIEGILEKLKIT 58 L+A L F R+VV + H G G+++N+PL ++ +K Sbjct: 159 TGCVLVATEKLDGVRTFERTVVLLLRSGTRHPQEGPFGVVINRPLNKK-----IKHMK-P 212 Query: 59 PEPRDESIRLDKPVMLGGPLAEDRGFILHTPPSN-FASSIRISDNTVMTT--SRDVLETL 115 + D + GGPL F+L T + S D L Sbjct: 213 TNKELATTFADCSLHFGGPLEA-SMFLLQTGEKEKLPGFEEVIPGLCFGARNSLDEAAAL 271 Query: 116 GTDK--QPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFK----TPIADRWREA 169 +P D +GYA W+ QL +EI + W A N++ + W E Sbjct: 272 VKKGVLKPQDFRFFVGYAGWQLDQLREEIESDYWYVASCSSNLICGNSSDSSSESLWEEI 331 Query: 170 AKLIGVDILTMP 181 +L+G + Sbjct: 332 LQLMGGHYSELS 343 >UniRef50_C0YNI4 Transcriptional regulator n=1 Tax=Chryseobacterium gleum ATCC 35910 RepID=C0YNI4_9FLAO Length = 182 Score = 142 bits (358), Expect = 7e-33, Method: Composition-based stats. Identities = 39/182 (21%), Positives = 68/182 (37%), Gaps = 14/182 (7%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 + + LI+ P + IF RSVV + EHN +GA G+I+NK + + + E Sbjct: 4 SYKGKILISTPDISGDIFSRSVVLVIEHNESGAFGLILNKKNSQMSSK-FKDFFDFKIE- 61 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRD--VLETLGTDK 119 V GGP+ D+ F + I+D +T + + L ++ Sbjct: 62 ----------VYDGGPVENDKVFFIVKGKRVTEIYTDITDEYYLTEDIERIINAVLSSEL 111 Query: 120 QPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILT 179 + + GY+ W QL+ E+ W W+ + +G + L Sbjct: 112 SIEHIKIFSGYSGWSPNQLDTEVQRKMWTVVDVYNLDYTLPNDQTLWKSIMQNLGGEFLL 171 Query: 180 MP 181 Sbjct: 172 WA 173 >UniRef50_C1RPZ7 Predicted transcriptional regulator, COG1678 n=12 Tax=Actinomycetales RepID=C1RPZ7_9CELL Length = 184 Score = 141 bits (357), Expect = 8e-33, Method: Composition-based stats. Identities = 41/181 (22%), Positives = 68/181 (37%), Gaps = 12/181 (6%) Query: 4 QHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRD 63 L+A P L+D FRR+VV + +H GA+G+++++PL+ + + + P Sbjct: 6 TGRLLVATPGLRDRSFRRAVVLVLDHTAEGALGVVLDRPLDIDARTVLPQWQEHLSTPG- 64 Query: 64 ESIRLDKPVMLGGPLAEDRGFILHT--PPSNFASSIRISDNTVMTTSRDVLETLGTDKQP 121 + GGP+A D L +S + D L D Sbjct: 65 -------RLFQGGPVARDTALALADLPGADAPPGVQALSPRLGVV-DLDAPPALVVDA-V 115 Query: 122 SDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMP 181 + V +GYA W GQL+ E+ W + F WR + D+ + Sbjct: 116 RALRVFVGYAGWGPGQLDDEVDVGGWFVVDHEPGDAFSADPRGLWRRVLRRQPGDLALLS 175 Query: 182 G 182 Sbjct: 176 T 176 >UniRef50_Q82D55 UPF0301 protein SAV_5129 n=12 Tax=Actinomycetales RepID=Y5129_STRAW Length = 193 Score = 141 bits (355), Expect = 1e-32, Method: Composition-based stats. Identities = 42/187 (22%), Positives = 75/187 (40%), Gaps = 16/187 (8%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 +L L+A PAL DP F R+VV + +H+ G++G+++N+P + + + EP Sbjct: 9 SLTGRLLVATPALADPNFDRAVVLLLDHDEEGSLGVVLNRPTPVDVSDILEGWADLAGEP 68 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTPPSNFA------SSIRISDNTVMTTSRDVLETL 115 V GGP++ D + P + R+ + E L Sbjct: 69 GV--------VFQGGPVSLDSALGVAVIPGGASVDGAPLGWRRVHGAIGLVDLEAPPELL 120 Query: 116 GTDKQPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGV 175 + + GYA W GQLE E+++ AW ++ + WRE + Sbjct: 121 AKALG--SLRIFAGYAGWGPGQLEDELVEGAWYVVESEPGDVSSPSPERLWREVLRRQRN 178 Query: 176 DILTMPG 182 ++ + Sbjct: 179 ELAMVAT 185 >UniRef50_C7MS43 Predicted transcriptional regulator n=3 Tax=Actinomycetales RepID=C7MS43_SACVD Length = 198 Score = 140 bits (353), Expect = 3e-32, Method: Composition-based stats. Identities = 47/186 (25%), Positives = 74/186 (39%), Gaps = 15/186 (8%) Query: 5 HHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRDE 64 L+A P + DP FRR+VV++ +H G +G+++N+P E E + EPR Sbjct: 18 GTLLVAAPTMFDPNFRRTVVFVIDHRAEGTLGVVLNRPSEVAVREVLPRWGDHVAEPRS- 76 Query: 65 SIRLDKPVMLGGPLAEDRGFILH-----TPPSNFASSIRISDNTVMTTSRDVLETLGTDK 119 V +GGP+ + L + I + + E L + Sbjct: 77 -------VFVGGPVEKKTALCLAALRTGETAATVPGVIGVRGPVALVDLDSDPEMLAS-- 127 Query: 120 QPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILT 179 + + V GYA W+ GQL EI WL PA + + P D W + G+ Sbjct: 128 KVRGLRVFAGYAGWDGGQLASEIERGDWLIVPALPSDVMAGPTRDLWGHVLRRQGLPTAL 187 Query: 180 MPGVAG 185 + G Sbjct: 188 LATHPG 193 >UniRef50_C5YY61 Putative uncharacterized protein Sb09g020680 n=4 Tax=Andropogoneae RepID=C5YY61_SORBI Length = 355 Score = 138 bits (349), Expect = 7e-32, Method: Composition-based stats. Identities = 41/190 (21%), Positives = 69/190 (36%), Gaps = 18/190 (9%) Query: 4 QHHFLIAMPALQD-PIFRRSVVYICEHNT----NGAMGIIVNKPLENLKIEGILEKLKIT 58 L+A AL D IF R+V++I + +G G+I+N+PL KI+ + + Sbjct: 164 AGCVLVATEALDDDSIFERTVIFILRLGSRGTFDGPFGVILNRPL-YTKIKHVNPTFQDQ 222 Query: 59 PEPRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMT--TSRDVLETLG 116 P D P+ GGP+ + S + T + L Sbjct: 223 ATP-----FGDSPLFFGGPVDMSMFLVRTDDSSRLKGFEEVVPGICYGFRTDLEKAAVLM 277 Query: 117 TDKQPS--DVLVALGYASWEKGQLEQEILDNAWLTAPADLNIL---FKTPIADRWREAAK 171 D+ +G+A+W+ QL EI W A ++ + W E + Sbjct: 278 KSGAIRTQDLRFYVGHAAWDYEQLLGEIRAGYWAVASCSTELISDALTGDPSCLWTEILQ 337 Query: 172 LIGVDILTMP 181 L+G + Sbjct: 338 LMGGQYSELS 347 >UniRef50_Q2BRE1 Putative uncharacterized protein n=1 Tax=Neptuniibacter caesariensis RepID=Q2BRE1_9GAMM Length = 151 Score = 138 bits (347), Expect = 1e-31, Method: Composition-based stats. Identities = 52/155 (33%), Positives = 87/155 (56%), Gaps = 6/155 (3%) Query: 35 MGIIVNKPLENLKIEGILEKLKITPEPRDESIRLDKPVMLGGPLAEDRGFILHT--PPSN 92 MG++VN+P+ + + + + L + P + + GGP+ +RG++LH P Sbjct: 1 MGLVVNRPV-GITLSDLCDHLNL---PCISNENQQDEIFSGGPVKPERGYVLHRSSDPFE 56 Query: 93 FASSIRISDNTVMTTSRDVLETLGTDKQPSDVLVALGYASWEKGQLEQEILDNAWLTAPA 152 + SS +++ ++TS D +E + + L+ALG A W GQLEQEI DN WL+ PA Sbjct: 57 WPSSHCVAEEIFLSTSVDAIEAAAEGRFKHEYLIALGCAGWSPGQLEQEISDNVWLSCPA 116 Query: 153 DLNILFKTPIADRWREAAKLIGVDILTMPGVAGHA 187 + +ILF P DR + AA ++G+++ + GHA Sbjct: 117 NSDILFGIPAGDRLQAAASILGINLDLLTAHPGHA 151 >UniRef50_B4CVG9 Putative uncharacterized protein n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4CVG9_9BACT Length = 186 Score = 137 bits (345), Expect = 2e-31, Method: Composition-based stats. Identities = 50/182 (27%), Positives = 78/182 (42%), Gaps = 14/182 (7%) Query: 1 MNLQHHFLIAMPALQDPIFRRSVVYICEHN-TNGAMGIIVNKPLENLKIEGILEKLKITP 59 ++L LIA P L DP FRRSV++I ++ G+ G+I+N+P E + Sbjct: 9 ISLAGSLLIAHPGLLDPNFRRSVLFISSNDAQEGSFGLIINRPASRTVAELLPN------ 62 Query: 60 EPRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDK 119 +D + PV LGGP+A D+ + + V+ + +++ Sbjct: 63 --KDLGMLSRVPVFLGGPVATDQLVFAAFQWHEETERMVCRPHLVIDEAAEIVHD----- 115 Query: 120 QPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILT 179 + + V +GYA W KGQLE E+ WL PA + L WRE G Sbjct: 116 ETTIVRAFVGYAGWSKGQLEGELAQRTWLVRPAARDTLDLERCPTLWREITSTFGPWFRL 175 Query: 180 MP 181 + Sbjct: 176 LA 177 >UniRef50_Q8S0Q9 Os01g0886000 protein n=6 Tax=Poaceae RepID=Q8S0Q9_ORYSJ Length = 354 Score = 137 bits (345), Expect = 2e-31, Method: Composition-based stats. Identities = 38/192 (19%), Positives = 67/192 (34%), Gaps = 18/192 (9%) Query: 4 QHHFLIAMPALQD-PIFRRSVVYICEHNT----NGAMGIIVNKPLENLKIEGILEKLKIT 58 L+A L F R+V+ + + +G G+I+N+PL K++ + + Sbjct: 163 SGCVLVAAEELDGNGTFERTVILLLRLGSRDAYDGPFGVILNRPL-YTKMKHVNPSFRNQ 221 Query: 59 PEPRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMT--TSRDVLETLG 116 P D + GGP+ + T +S T + L Sbjct: 222 ATP-----FSDCSLFFGGPVDMSIFLMRTTDDRPIKGFEEVSPGVCFGFRTDLEKASALL 276 Query: 117 TDK--QPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNIL---FKTPIADRWREAAK 171 +P D+ +GY++W+ QL EI W ++ T + W E K Sbjct: 277 KSGAVKPEDLNFYVGYSAWDYDQLLSEIDQGYWHVTSCSSGLISDSLATDPSCLWTEILK 336 Query: 172 LIGVDILTMPGV 183 L+G + Sbjct: 337 LMGGQYAELSQK 348 >UniRef50_B1ZWF6 Putative uncharacterized protein n=2 Tax=Opitutaceae RepID=B1ZWF6_OPITP Length = 184 Score = 134 bits (338), Expect = 1e-30, Method: Composition-based stats. Identities = 44/182 (24%), Positives = 74/182 (40%), Gaps = 15/182 (8%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 +L L+A PAL+DP FRR++V + HN GAMG+++N+P+ ++L Sbjct: 11 SLAGSLLLAHPALRDPNFRRAIVLMSVHNAEGAMGVVLNRPMG--------KRLGELNGE 62 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQP 121 P+ GGP+ ++ ++ P + L ++ Sbjct: 63 FALGSLASVPLFHGGPVQTEQLVLVAWQPQEDGF------RLHFGVEPERAMQLAAEEG- 115 Query: 122 SDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMP 181 + + LGY+ W GQLE E+ WL A +L A WR +G + + Sbjct: 116 TQLRAFLGYSGWGGGQLEAELKQKTWLVADMPAGLLEGPQDAAMWRSVVSSLGEEWRLLA 175 Query: 182 GV 183 Sbjct: 176 QE 177 >UniRef50_B1MML1 UPF0301 protein MAB_4928c n=20 Tax=Corynebacterineae RepID=Y4928_MYCA9 Length = 208 Score = 134 bits (338), Expect = 1e-30, Method: Composition-based stats. Identities = 45/185 (24%), Positives = 76/185 (41%), Gaps = 15/185 (8%) Query: 4 QHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRD 63 LIA L +P FRRSV++I EHN G +G+++N+P E + + K+ +P Sbjct: 27 AGTLLIANTNLFEPTFRRSVIFIVEHNDGGTLGVVLNRPSETAVYNVLPQWAKLAGKP-- 84 Query: 64 ESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQ--- 120 K + +GGP+ D L T + SI R + L + + Sbjct: 85 ------KTMFVGGPVKRDAALCLATLRAGV--SIDGVKGLRHVAGRMAMVDLDAEPEDIA 136 Query: 121 --PSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDIL 178 + V GY+ W GQLE E+ + W+ A + + D W + + + + Sbjct: 137 PLVEGIRVFAGYSGWTIGQLEGEVERDDWIVLSALPSDVLTDASEDLWAKVLRRQPLPLS 196 Query: 179 TMPGV 183 + Sbjct: 197 LLATH 201 >UniRef50_Q0FVR8 Putative uncharacterized protein (Fragment) n=1 Tax=Roseovarius sp. HTCC2601 RepID=Q0FVR8_9RHOB Length = 158 Score = 133 bits (335), Expect = 3e-30, Method: Composition-based stats. Identities = 52/134 (38%), Positives = 76/134 (56%), Gaps = 10/134 (7%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 +L LIAMP + DP F SV+Y+C H+ GAMG+IVNKP ++ + +LE+L ITP P Sbjct: 9 DLTGKILIAMPGMGDPRFEHSVIYLCAHSEEGAMGLIVNKPSADVSMAALLEQLSITPSP 68 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTPPSNFA-SSIRISDNTVMTTSRDVLETLGTDKQ 120 + V GGP+ RGF+LH+P ++++++D MT + DVLET+ Sbjct: 69 GLGPRQ----VHFGGPVEMGRGFVLHSPDYMSGLTTLQVNDGFSMTGTLDVLETIARGDG 124 Query: 121 PSDVLVALGYASWE 134 P A G+ W Sbjct: 125 P-----ATGWRCWA 133 >UniRef50_B5JR88 Putative uncharacterized protein n=1 Tax=Verrucomicrobiae bacterium DG1235 RepID=B5JR88_9BACT Length = 187 Score = 131 bits (330), Expect = 1e-29, Method: Composition-based stats. Identities = 40/179 (22%), Positives = 72/179 (40%), Gaps = 12/179 (6%) Query: 3 LQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPR 62 L L+A P L+DP F SVV + H +G++G+++NK E+L Sbjct: 12 LTGSLLLAHPHLKDPNFASSVVLLTRHEESGSLGVVLNKGTG--------ERLGQLSSEF 63 Query: 63 DESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQPS 122 + + PV LGGP+ +++ + + V ++ Sbjct: 64 ADCGLGEVPVYLGGPVNQNQIILAAWKLIPEKGQFQ----LYFGMEPLVAQSKMETDPDL 119 Query: 123 DVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMP 181 + GY+ W +GQL E+ DNAW+ + D + +D WR + ++ + Sbjct: 120 EFRAFKGYSGWSEGQLVGELEDNAWVVSEVDAESISTKEGSDLWRHLIMEVNPELGLLS 178 >UniRef50_C4DLM5 Predicted transcriptional regulator, COG1678 n=5 Tax=Actinomycetales RepID=C4DLM5_9ACTO Length = 196 Score = 131 bits (330), Expect = 1e-29, Method: Composition-based stats. Identities = 47/183 (25%), Positives = 77/183 (42%), Gaps = 13/183 (7%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 +L L+A PALQDP F R+VV + H + GA+G+++N+ E E + + ++ EP Sbjct: 15 SLVGRLLVATPALQDPNFERTVVLLVSHESAGALGVVLNRATEVPVAEVLGDWSELAREP 74 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILH--TPPSNFASSIRISDNTVMTTSRDV-LETLGTD 118 + GGP+ + L S + + T V E L Sbjct: 75 AV--------LFEGGPVQPEAAIALGWMRSGVGEPSCFKPFAGRLGTLDLSVDPEPLADR 126 Query: 119 KQPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDIL 178 + + V GY+SW GQL+ E+ D AW+ + F + D W + G + Sbjct: 127 LEG--MRVFAGYSSWGAGQLDDELKDGAWMVFDSLPGDPFGSRPEDLWAMVWRRQGGLLA 184 Query: 179 TMP 181 + Sbjct: 185 AVA 187 >UniRef50_A8IT21 Predicted protein n=1 Tax=Chlamydomonas reinhardtii RepID=A8IT21_CHLRE Length = 234 Score = 130 bits (328), Expect = 2e-29, Method: Composition-based stats. Identities = 38/180 (21%), Positives = 74/180 (41%), Gaps = 9/180 (5%) Query: 10 AMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKL--KITPEPRDESIR 67 A L+D + V+++ H +G++GII+N+P + + L ++ + + Sbjct: 52 APELLKDDRLFQLVIFLTTHGPDGSVGIILNRPT-GMVLGRKPGGLPLELGGPVPIQRVF 110 Query: 68 LDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQPS-DVLV 126 D V GG A+ I+H + +++ M E + + P+ D Sbjct: 111 QDNMVYCGGFTAQQVIHIMH--GHRLQNCVQVVPGVYMAGEVAATEAVSGGRLPAGDFKF 168 Query: 127 ALGYASWEKGQLEQEILDNAWLTAPADLNILFKTP---IADRWREAAKLIGVDILTMPGV 183 G +W G+LE ++ AW TA +++ K+ WRE +L+G + Sbjct: 169 FSGAITWAPGELEAQMDRGAWYTAACSRSLVLKSALQLPVPLWREVLQLMGGQYSEVASE 228 >UniRef50_C1B7P4 UPF0301 protein ROP_34500 n=13 Tax=Corynebacterineae RepID=Y3450_RHOOB Length = 201 Score = 130 bits (327), Expect = 2e-29, Method: Composition-based stats. Identities = 41/187 (21%), Positives = 77/187 (41%), Gaps = 21/187 (11%) Query: 5 HHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRDE 64 L++ L +P FRR+V+Y+ EHN G++G+++N+P E + + + +T P Sbjct: 21 GSLLVSSTDLVEPAFRRTVIYVIEHNEAGSLGVVINRPSETAVHDVLPQWAPLTARPSA- 79 Query: 65 SIRLDKPVMLGGPLAEDRGFILHT-----PPSNFASSIRISDNTVMT---TSRDVLETLG 116 + +GGP+ D L T R+ VM + +V+ L Sbjct: 80 -------LYVGGPVKRDAALCLATLRTGAQADGVRGLRRVHGRVVMVDLDSDPEVVAPLV 132 Query: 117 TDKQPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVD 176 V + GY+ W GQL+ E+ + W+ A + + D W + + + Sbjct: 133 EG-----VRIFAGYSGWTYGQLDSELQRDDWIVISALASDVLAPARVDVWAQVLRRQPLP 187 Query: 177 ILTMPGV 183 + + Sbjct: 188 LALLATH 194 >UniRef50_A8IRU3 Predicted protein n=1 Tax=Chlamydomonas reinhardtii RepID=A8IRU3_CHLRE Length = 315 Score = 129 bits (325), Expect = 4e-29, Method: Composition-based stats. Identities = 43/186 (23%), Positives = 68/186 (36%), Gaps = 17/186 (9%) Query: 4 QHHFLIAMPAL---QDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPE 60 + L+A P L F R+ + + EH NG+ G+I+N+P ++ P Sbjct: 123 KGALLLAHPLLFQNSQTYFHRAAILLLEHGDNGSYGVILNRPSTYF--------IRDIPL 174 Query: 61 PRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTD-K 119 R ++ D + +GG + +LH A ++ + M + + Sbjct: 175 KRPQTQFNDCRLYVGGDVGGGEVQVLHPHGD-LAGAVEVVKGVYMGGLDAGRDAIDAGKA 233 Query: 120 QPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIAD----RWREAAKLIGV 175 Q D YA W GQL E W TA A +L K W E L+G Sbjct: 234 QAQDFRWFSAYAGWAPGQLAMECKRGVWFTAAASPKLLLKEVEHGQGPSFWHELMTLLGG 293 Query: 176 DILTMP 181 D + Sbjct: 294 DYAELS 299 >UniRef50_A1SG68 Putative uncharacterized protein n=2 Tax=Actinomycetales RepID=A1SG68_NOCSJ Length = 191 Score = 125 bits (315), Expect = 6e-28, Method: Composition-based stats. Identities = 39/182 (21%), Positives = 72/182 (39%), Gaps = 11/182 (6%) Query: 4 QHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRD 63 L+A PAL DP F +VV + + + GA+G+++N+P + ++ +L+ + Sbjct: 12 AGMLLVATPALLDPNFADTVVLLLDVDEQGALGVVLNRPS-AIPVDDVLDGWGDVAAEPE 70 Query: 64 ESIRLDKPVMLGGPLAED--RGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQP 121 + GGP+ L + R+ D + D L Sbjct: 71 -------VLFQGGPVGLQGALAVALLARADDVPVGFRVVDGRLGLVDLDTPLELVRG-GL 122 Query: 122 SDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMP 181 + V GYA W QL EI + +W P + +F++ +D WR+ + ++ Sbjct: 123 EGLRVFAGYAGWGADQLRDEIEEGSWYVVPGEARDVFRSDASDLWRDVLRRQPGELAWHS 182 Query: 182 GV 183 Sbjct: 183 TR 184 >UniRef50_C8XE74 Putative uncharacterized protein n=2 Tax=Actinomycetales RepID=C8XE74_NAKMY Length = 190 Score = 125 bits (313), Expect = 1e-27, Method: Composition-based stats. Identities = 44/185 (23%), Positives = 69/185 (37%), Gaps = 11/185 (5%) Query: 4 QHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRD 63 L+A P L+DP FRR+VVY+ H+ +G +G+I+N+P E + T P Sbjct: 9 AGMLLVATPGLRDPHFRRTVVYLVAHSVDGTVGVILNRPSETAVQNVLPGWASHTARP-- 66 Query: 64 ESIRLDKPVMLGGPLAEDRGFIL--HTPPSNFASSIRISDNTVMTTSRDVLETLGT-DKQ 120 V GGP+ L +N + T D+ T + Sbjct: 67 ------HAVFAGGPVQTSAAMCLGVCRIGTNPREVQGVVGVTGPVVLVDLDGDPATVTQS 120 Query: 121 PSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTM 180 + + G A W+ QL EI++ +W P + + P D W + M Sbjct: 121 LRGIRIYAGRAGWDAEQLVDEIIEGSWYVVPGLPDDVLAGPRTDLWFSVLRRQPYPQSLM 180 Query: 181 PGVAG 185 G Sbjct: 181 AYHPG 185 >UniRef50_B2UMM6 Putative uncharacterized protein n=1 Tax=Akkermansia muciniphila ATCC BAA-835 RepID=B2UMM6_AKKM8 Length = 187 Score = 123 bits (310), Expect = 2e-27, Method: Composition-based stats. Identities = 38/173 (21%), Positives = 68/173 (39%), Gaps = 16/173 (9%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 NL H L+A P L F SV+++ +G I+N P + + + +I Sbjct: 13 NLAGHLLVAAPYLDGAGFHHSVIFLSRAEKEFVIGHILNHPS-GMNVGDVARHTEIP--- 68 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQP 121 P+ GGP+ ++ A+ IR D + + L + P Sbjct: 69 ---ESLYAVPIFKGGPVERNQLIF--------AAFIRTEDKLRVQFHLQEEQALEYLEDP 117 Query: 122 SDV-LVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLI 173 + +G++ W QL +E+ D AW +P +I +T + W A + + Sbjct: 118 RAILRAYVGHSGWTPPQLRRELNDRAWYVSPMVPDICLETDSSKVWAMAMRRL 170 >UniRef50_Q8NL65 UPF0301 protein Cgl3084/cg3414 n=5 Tax=Corynebacterium RepID=Y3084_CORGL Length = 189 Score = 122 bits (307), Expect = 5e-27, Method: Composition-based stats. Identities = 39/173 (22%), Positives = 70/173 (40%), Gaps = 15/173 (8%) Query: 5 HHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRDE 64 L+A P + F RS+V I EH+ G+ ++ + + E + +T +P Sbjct: 9 GMLLVAAPDMASEDFERSIVLIIEHSPATTFGVNISSRSDVAVANVLPEWVDLTSKP--- 65 Query: 65 SIRLDKPVMLGGPLAEDRGFIL--HTPPSNFASSI---RISDNTVMTTSRDVLETLGTDK 119 + + +GGPL++ L P + +S ++++ V R E + D Sbjct: 66 -----QALYIGGPLSQQAVVGLGVTKPGVDIENSTSFNKLANRLVHVDLRSAPEDVADDL 120 Query: 120 QPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKL 172 + + GYA W GQL +EI W PA + + D W + + Sbjct: 121 EG--MRFFAGYAEWAPGQLNEEIEQGDWFVTPALPSDIIAPGRVDIWGDVMRR 171 >UniRef50_Q8FSW7 UPF0301 protein CE2927 n=11 Tax=Corynebacterium RepID=Y2927_COREF Length = 201 Score = 121 bits (305), Expect = 9e-27, Method: Composition-based stats. Identities = 41/184 (22%), Positives = 72/184 (39%), Gaps = 15/184 (8%) Query: 5 HHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRDE 64 L+A P L P F RSV+ + EH+ G+ + + + E ++T +P Sbjct: 21 GSLLVAAPDLASPEFSRSVILVIEHSHATTFGVNLASRSDLAVANVLPEWTELTAKP--- 77 Query: 65 SIRLDKPVMLGGPLAEDRGFIL--HTPPSNFASSIR---ISDNTVMTTSRDVLETLGTDK 119 + + +GGPL++ L P + SS + +++ V R + + D Sbjct: 78 -----QALYIGGPLSQQAVVGLGVTKPGVDIESSTKFNKLANRLVHVDLRVTPDEVRDDL 132 Query: 120 QPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILT 179 + + GYA W GQL EI W APA + + D W + + + + Sbjct: 133 EG--MRFFAGYAEWAPGQLNDEIEQGDWYVAPALPSDVLAPGRVDVWGDVMRRQPMPLPL 190 Query: 180 MPGV 183 Sbjct: 191 YSTH 194 >UniRef50_Q7URG7 Probable transcriptional regulator n=1 Tax=Rhodopirellula baltica RepID=Q7URG7_RHOBA Length = 259 Score = 119 bits (299), Expect = 5e-26, Method: Composition-based stats. Identities = 49/232 (21%), Positives = 76/232 (32%), Gaps = 62/232 (26%) Query: 2 NLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 N FLIA P L D F RSVV I H GA G+++N+ + ++E + + Sbjct: 6 NCTGCFLIASPYLHDGNFFRSVVLIIRHTHEGAFGVVINR-AGPQRFGDVIEMSDPSWQA 64 Query: 62 RDESIRL----------------------------DKPVMLGGPLAEDRGFI-------- 85 + LGGP+ + Sbjct: 65 SSGPDMSSLLASQAESASLGDASDPNKTNESLQIHPDQIYLGGPVNGPVLALHNIAGIGD 124 Query: 86 --------------------LHTPPSNFASSIRI---SDNTVMTTSRDVLETLGTDKQPS 122 LH P+ S+ I +T+ D L L + Sbjct: 125 PCGVDIGEGAENDPAGSKTQLHDHPAEPWGSMSIQWADVPAWVTSDEDHLRLLARRDD-A 183 Query: 123 DVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIG 174 + +GY+ W QLE E+ + WL PAD + +F P + W + + G Sbjct: 184 KLRYVVGYSGWGPMQLESELEEGGWLITPADTDSIFG-PCEEVWEKLVRRCG 234 >UniRef50_Q6A827 Conserved protein, DUF179 n=2 Tax=Propionibacterium acnes RepID=Q6A827_PROAC Length = 192 Score = 118 bits (295), Expect = 1e-25, Method: Composition-based stats. Identities = 42/180 (23%), Positives = 65/180 (36%), Gaps = 10/180 (5%) Query: 5 HHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRDE 64 L+A + + IF SVVY+ + +G +G+IVN+P + L P + Sbjct: 13 GDLLVASRQIDEGIFYESVVYLIDVALDGVLGVIVNQPCTAGTLHRQLPGWVHLATPPQD 72 Query: 65 SIRLDKPVMLGGPLAEDRGFILHT--PPSNFASSIRISDNTVMTTSRDVLETLGTDKQPS 122 + LGGP++ + L S R D L + Sbjct: 73 -------LFLGGPMSPNGAICLARVQRSSEEPPGWRRVQGLTGLLHLDTPTELVEGAF-T 124 Query: 123 DVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMPG 182 DV + GYA W GQLE E++ W+ A A +F + WR + Sbjct: 125 DVRIFAGYAEWVPGQLEAELIRGDWIRAVAHPEDIFSSEPRGLWRAVLHRQNGPAALLAT 184 >UniRef50_C7Q9Z6 Putative uncharacterized protein n=5 Tax=Actinomycetales RepID=C7Q9Z6_CATAD Length = 205 Score = 117 bits (294), Expect = 2e-25, Method: Composition-based stats. Identities = 48/186 (25%), Positives = 75/186 (40%), Gaps = 17/186 (9%) Query: 3 LQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPR 62 L L+A L DP F R+VV + +H+ +G +G+++N+P +L +E +LE Sbjct: 21 LTGKLLVATTVLVDPNFDRTVVLVVDHDDDGTLGVVLNRP-GSLDVEDVLETWAPLAAEP 79 Query: 63 DESIRLDKPVMLGGPLAEDRGFILH--TPPSNFASSIRI-----SDNTVMTTSRDVLETL 115 V LGGP+A D + P + + S + E L Sbjct: 80 P-------TVFLGGPVALDSALGIACVRPEAAVPGEEPLGWRQFSGRLGLVDLDAPPEVL 132 Query: 116 GTDKQPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGV 175 D + + GYA W GQL E+ AW +L +F T + WR + G Sbjct: 133 APDLTA--LRIFAGYAGWGPGQLAGELAQRAWYVVEPELADVFTTEPEELWRRVLRRQGG 190 Query: 176 DILTMP 181 I + Sbjct: 191 TIAMVA 196 >UniRef50_Q7G645 Os10g0330400 protein n=3 Tax=Oryza sativa RepID=Q7G645_ORYSJ Length = 296 Score = 116 bits (290), Expect = 4e-25, Method: Composition-based stats. Identities = 41/185 (22%), Positives = 64/185 (34%), Gaps = 19/185 (10%) Query: 4 QHHFLIAMPALQDPI-FRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPR 62 + LIA L F R+VV + G +G+I+N+P + I E + E Sbjct: 112 KGCLLIATEKLDGSHIFERTVVLLLSAGVLGPVGVILNRP----SLMSIKEAQAVFAETD 167 Query: 63 DESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSI-------RISDNTVMTTSRDV---L 112 +P+ GGPL + F+L + + + T V Sbjct: 168 IAGAFSGRPLFFGGPLE-ECFFLLGPRAAAAGDVVGRTGLFDEVMPGVHYGTRESVGCAA 226 Query: 113 ETLGTDK-QPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNIL-FKTPIA-DRWREA 169 E + D G+ WE+ QL E+ W A +L T + W E Sbjct: 227 ELVKRGVVGVRDFRFFDGFCGWEREQLRDEVRAGLWRVAACSPAVLGLATVVKGGLWEEV 286 Query: 170 AKLIG 174 L+G Sbjct: 287 QGLVG 291 >UniRef50_Q9LS71 Emb|CAB72194.1 n=4 Tax=rosids RepID=Q9LS71_ARATH Length = 317 Score = 116 bits (290), Expect = 5e-25, Method: Composition-based stats. Identities = 44/186 (23%), Positives = 68/186 (36%), Gaps = 23/186 (12%) Query: 4 QHHFLIAMPALQDPI-FRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPR 62 LIA L F ++V+ + +G +G+I+N+P E L + Sbjct: 133 TGCLLIATEKLDGVHIFEKTVILLLSVGPSGPIGVILNRPSLMSIKETKSTILDMAGT-- 190 Query: 63 DESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSI-------RISDNTVMTTSRDV---L 112 DK + GGPL + G L +P S + + ++ T V Sbjct: 191 ----FSDKRLFFGGPL--EEGLFLVSPRSGGDNEVGKSGVFRQVMKGLYYGTRESVGLAA 244 Query: 113 ETLGTDK-QPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNIL---FKTPIADRWRE 168 E + + S++ GY WEK QL+ EIL W A ++ W E Sbjct: 245 EMVKRNLVGRSELRFFDGYCGWEKEQLKAEILGGYWTVAACSSTVVELGSAVQSHGLWDE 304 Query: 169 AAKLIG 174 LIG Sbjct: 305 VLGLIG 310 >UniRef50_C0AXX7 Putative uncharacterized protein n=1 Tax=Proteus penneri ATCC 35198 RepID=C0AXX7_9ENTR Length = 93 Score = 114 bits (285), Expect = 2e-24, Method: Composition-based stats. Identities = 48/93 (51%), Positives = 64/93 (68%) Query: 1 MNLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPE 60 MNL +HFLIAMP+L DP+F RSVVY+CEHN NGAMG+I+NKP+E++ +EG+L++L+I Sbjct: 1 MNLLNHFLIAMPSLSDPLFERSVVYVCEHNENGAMGLIINKPIEDISVEGVLDQLEIFST 60 Query: 61 PRDESIRLDKPVMLGGPLAEDRGFILHTPPSNF 93 RDE+I L K + P F L F Sbjct: 61 DRDEAISLQKTCDVRRPSCRRAWFYLTYSSVRF 93 >UniRef50_C7PBJ3 Putative uncharacterized protein n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PBJ3_CHIPD Length = 146 Score = 113 bits (283), Expect = 3e-24, Method: Composition-based stats. Identities = 37/146 (25%), Positives = 65/146 (44%), Gaps = 13/146 (8%) Query: 4 QHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRD 63 F+ + L+ +F +V+YI E+N NGAMG IVN L +L+ R Sbjct: 3 AGIFINSTSLLEKSVFESTVIYITEYNENGAMGFIVNNR-----FPRKLNELEEFSHGR- 56 Query: 64 ESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDK--QP 121 D P+ GGP+ ++ F +H P + ++ DN + + Sbjct: 57 -----DFPLWEGGPVDKEHLFFIHQRPDLISGGEQVGDNIFLGGDFQAAVKHINEHTLTE 111 Query: 122 SDVLVALGYASWEKGQLEQEILDNAW 147 D+ + +GY W+ +L++EI + +W Sbjct: 112 QDIKIFIGYCGWDYKELDEEIDEGSW 137 >UniRef50_D0NN30 Putative uncharacterized protein n=1 Tax=Phytophthora infestans T30-4 RepID=D0NN30_PHYIN Length = 304 Score = 111 bits (279), Expect = 8e-24, Method: Composition-based stats. Identities = 44/187 (23%), Positives = 68/187 (36%), Gaps = 39/187 (20%) Query: 4 QHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRD 63 FL+A P LQ IF RSVV + EH G+ G IVNK Sbjct: 136 SGVFLLAHPLLQG-IFSRSVVILTEHKPEGSKGFIVNK---------------------- 172 Query: 64 ESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRIS-------DNTVMTTSRDVLETLG 116 V GGP+ +LH + S + + D Sbjct: 173 ------VTVRKGGPVFTRNAEVLHGRADFGGQRVATSNFPTANDPSLFVGVDLDTAARAI 226 Query: 117 TDKQPS--DVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIG 174 D+ DV+ G ++W GQL+ E+ +W+ A +++ AD W++ + +G Sbjct: 227 YDETAKQTDVVFMSGVSAWSPGQLDSELKQGSWVAVKAPVSLALNA-SADLWQDLMRTLG 285 Query: 175 VDILTMP 181 + M Sbjct: 286 GEYAEMS 292 >UniRef50_D0A6S5 Putative uncharacterized protein n=2 Tax=Trypanosoma brucei RepID=D0A6S5_TRYBG Length = 475 Score = 111 bits (278), Expect = 1e-23, Method: Composition-based stats. Identities = 42/162 (25%), Positives = 72/162 (44%), Gaps = 11/162 (6%) Query: 4 QHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKI---TPE 60 + L+A P L D FR +V+ + N + +++NKPLEN K + + + + Sbjct: 235 KAQLLLAHPQLYD-FFRYTVMIVVRVTPNESAALVLNKPLENDKGALMPVSMTMRLSSAH 293 Query: 61 PRDESIRLDKPVMLGGPLAED----RGFILHTPPSNFASSIRISDNTVMTTSRDVLETLG 116 P + VM+GGP++ +LH P +I +S + + S D L+ Sbjct: 294 PLFAKHLCNHTVMIGGPVSRGSFDSTMLLLHRIPD-VDDAIPLSHSLWIDGSYDTLQQKI 352 Query: 117 TD--KQPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNI 156 D P D++V G++ W QLE E+ W+ A + Sbjct: 353 EDGTADPKDIVVICGFSGWGVQQLEGELQSGTWVAASGSTDD 394 >UniRef50_Q54HU6 Putative uncharacterized protein n=1 Tax=Dictyostelium discoideum RepID=Q54HU6_DICDI Length = 493 Score = 111 bits (277), Expect = 2e-23, Method: Composition-based stats. Identities = 40/214 (18%), Positives = 74/214 (34%), Gaps = 41/214 (19%) Query: 5 HHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRDE 64 LI+ P+L + + VV I G +N P++ ++ ++ Sbjct: 278 GTILISHPSLGE-HLNKKVVLITHSIGGHHYGFFINTPIQTGAALSYID-FEMKTHYDRA 335 Query: 65 SIRL-----------------------------------DKPVMLGGPLAEDRGFILHTP 89 ++ D V+ G + I+H P Sbjct: 336 AVEANRSKNFSRLFIYALKRPIGPLKTLNWNLGGLQSITDHSVLSRGFDGLNCTQIVH-P 394 Query: 90 PSNFASSIRISDNTVMTTSRDVLETLGTDKQPS--DVLVALGYASWEKGQLEQEILDNAW 147 SN + S +I D + + +K+ +L+ +G ++W GQLE+EI + AW Sbjct: 395 YSNLSGSKKIRDGLYIGGKLKEVGDKIRNKEIDKNKLLMFVGCSTWNPGQLEKEIKEGAW 454 Query: 148 LTAPADLNILFKT-PIADRWREAAKLIGVDILTM 180 A + K + W EA + +G D + Sbjct: 455 FRADCSNETILKQLKPKNFWAEALESMGGDYSDL 488 >UniRef50_C5WYB1 Putative uncharacterized protein Sb01g018920 n=1 Tax=Sorghum bicolor RepID=C5WYB1_SORBI Length = 1193 Score = 110 bits (276), Expect = 2e-23, Method: Composition-based stats. Identities = 30/153 (19%), Positives = 55/153 (35%), Gaps = 15/153 (9%) Query: 4 QHHFLIAMPALQDPI-FRRSVVYICEH-NTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 L A L + F + V I + G G+I+NK L + + ++ Sbjct: 1038 TGSILTATEKLGAAVPFDNAKVLIVSSGSHEGFHGLIINKRLSWGVFKDLDSSMERIKHA 1097 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMT----TSRDVLETLGT 117 P+ GGP+ ++ + +++ TSR V Sbjct: 1098 ---------PLFYGGPVVVQGYHLVSLSRVAWEGYMQVIPGVYYGNIVATSRVVTRIKLG 1148 Query: 118 DKQPSDVLVALGYASWEKGQLEQEILDNAWLTA 150 ++ D+ +GY+ W QL E+ + AWL + Sbjct: 1149 EQSVEDLWFFVGYSGWGYSQLFDELSEGAWLVS 1181 >UniRef50_Q4CSL3 Putative uncharacterized protein n=2 Tax=Trypanosoma cruzi RepID=Q4CSL3_TRYCR Length = 523 Score = 110 bits (276), Expect = 2e-23, Method: Composition-based stats. Identities = 48/162 (29%), Positives = 73/162 (45%), Gaps = 15/162 (9%) Query: 6 HFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKIT-----PE 60 H L+A P L + FR SV+ + N A I+NKPLEN EG+L ++ T Sbjct: 280 HMLLAHPQLYE-FFRYSVMIVVRSTPNEAAAFILNKPLEND--EGMLMQVNSTIRLNHVH 336 Query: 61 PRDESIRLDKPVMLGGPLAEDR----GFILHTPPSNFASSIRISDNTVMTTSRDVLETLG 116 P + VM+GGP++ +LH P +I +S + + + DVL+ Sbjct: 337 PILGKHLGNHTVMIGGPVSRGSFDSSILLLHRIPD-VEDAIPVSQSLWVDGNYDVLQKKL 395 Query: 117 TD--KQPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNI 156 D D++V G++ W GQL EI W+ A + Sbjct: 396 DDGTADAKDIVVICGFSGWGAGQLAGEISSGTWVVARGSADD 437 >UniRef50_B8C551 Predicted protein n=1 Tax=Thalassiosira pseudonana RepID=B8C551_THAPS Length = 645 Score = 106 bits (265), Expect = 4e-22, Method: Composition-based stats. Identities = 34/175 (19%), Positives = 63/175 (36%), Gaps = 29/175 (16%) Query: 13 ALQDPIFRRSVVYICEHNTN-GAMGIIVNKPLENLKIEGILEKLKITPEPRDESIRLDKP 71 L+ F ++V+ + EH+ N GII+N+P + + + + + +K Sbjct: 155 GLRQQYFHKAVILVLEHDENTFTKGIILNRPSDQMMDDDVNDGVK-------------WR 201 Query: 72 VMLGGPLA-----EDRGFILHT--PPSNFASSIRISDNTVMTTSRDVLETLGTD-KQPSD 123 V GG + LH+ + +S+ + T+ + + + D Sbjct: 202 VWFGGDVQGLDSLLPDIVCLHSLKSEAAKDASVTVVKGIQWTSFSNAKQLVKRGVASVED 261 Query: 124 VLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDIL 178 + GYA W QL E+ +W D L K A + G+D Sbjct: 262 FWLFAGYAGWGPRQLSGELDRKSWYMCATDSQTLLK-------ELARQSYGIDPR 309 Score = 43.4 bits (101), Expect = 0.004, Method: Composition-based stats. Identities = 30/169 (17%), Positives = 50/169 (29%), Gaps = 30/169 (17%) Query: 4 QHHFLIAMPA------LQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKI 57 L A A LQ F +S+V + + +G+++N P Sbjct: 397 AGALLRASSAERSPFLLQKQEFHKSLVLVILEDDKATVGLMLNHPATK------------ 444 Query: 58 TPEPRDESIRLDKPVMLGGP--LAEDRGFI-LHTPPSNFASSIRIS------DNTVMTTS 108 E R + P+ GG + + LH + + ++ D + Sbjct: 445 GCEVRIGTETTTIPLRYGGDYAVKGASPLMWLHCSKKLRDAGVGVAFSENHKDGIYKCSQ 504 Query: 109 RDVLETLGTD-KQPSDVLVALGYASW--EKGQLEQEILDNAWLTAPADL 154 + L P D L G W G L E+ + P D Sbjct: 505 DQASQALTAGIAHPKDFLAVSGVCVWPKLGGSLLSEVKRGVFDIIPKDR 553 >UniRef50_B9HGN1 Predicted protein n=1 Tax=Populus trichocarpa RepID=B9HGN1_POPTR Length = 1080 Score = 105 bits (262), Expect = 7e-22, Method: Composition-based stats. Identities = 36/157 (22%), Positives = 61/157 (38%), Gaps = 15/157 (9%) Query: 5 HHFLIAMPALQDPIFRRSVVYICEHNTN-GAMGIIVNKPLENLKIEGILEKLKITPEPRD 63 L+A L F +S + I + + N G G+I NK L ++ + E+ K+ E Sbjct: 928 GSILVATEKLNTQPFDKSRILIVKSDQNTGFQGLIYNKHLRWDTLQELEEESKLLKEA-- 985 Query: 64 ESIRLDKPVMLGGP-LAEDRGFILHTPPSNFASSIRISDNTVM---TTSRDVLETLGTDK 119 P+ GGP + + T + ++ T + + +E + + Sbjct: 986 -------PLSFGGPLVTRGMPLVALTRRAVGGQYPEVAPGTYFLGQSATLHEIEEISSGN 1038 Query: 120 Q-PSDVLVALGYASWEKGQLEQEILDNAWLTAPADLN 155 Q SD LG++SW QL EI AW + Sbjct: 1039 QCVSDYWFFLGFSSWGWEQLFDEIAQGAWNLSEHKKE 1075 >UniRef50_C1E1K3 Predicted protein n=2 Tax=cellular organisms RepID=C1E1K3_9CHLO Length = 271 Score = 105 bits (261), Expect = 1e-21, Method: Composition-based stats. Identities = 36/184 (19%), Positives = 68/184 (36%), Gaps = 21/184 (11%) Query: 4 QHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRD 63 L+A P ++ +R SVV + H+ G+ G+I+N+P + E I Sbjct: 76 TGCLLLA-PETEEGWWRHSVVLVLNHDAEGSTGVILNRPTNAQLKNVVPE---IDYSAPH 131 Query: 64 ESIRLDKPVMLGGPLAEDRG---FILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQ 120 + ++ V +GGP+ ++G + + ++ + + + + + Sbjct: 132 HRVLANRHVSMGGPMGTEKGARCLVALSHTRLDGATSEVFPGLWHVSDFS---AVKPEHE 188 Query: 121 PSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILF----------KTPIADRWREAA 170 PS ++V +GY W GQL E+ N W A A W Sbjct: 189 PS-LMVFVGYCGWMSGQLNAEVAANGWTVAAASAANTLALVNASARAGDVMGESMWATMR 247 Query: 171 KLIG 174 +G Sbjct: 248 GRLG 251 >UniRef50_Q0IWV7 Os10g0485100 protein (Fragment) n=5 Tax=Poaceae RepID=Q0IWV7_ORYSJ Length = 855 Score = 105 bits (261), Expect = 1e-21, Method: Composition-based stats. Identities = 33/173 (19%), Positives = 62/173 (35%), Gaps = 24/173 (13%) Query: 3 LQHHFLIAMPALQDPI-FRRSVVYICEHNT-NGAMGIIVNKPLENLKIEGILEKLKITPE 60 L L A L + F S V I ++ G G+I+NK L + + ++ Sbjct: 699 LTGSVLTATSKLGSAVPFDNSQVLIVSADSREGFHGLIINKRLSWDTFKNLDGSMEPIKH 758 Query: 61 PRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMT---TSRDVLETLGT 117 P+ GGP+ +++ F +++ + V + + Sbjct: 759 A---------PLFYGGPVVVQGYYLVSLSRVAFDGYLQVIPGVYYGNVAATAQVTRRIKS 809 Query: 118 DK-QPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADR-WRE 168 + ++ LG+++WE QL E+ + AW + PI W E Sbjct: 810 GEQSAENLWFFLGFSNWEYSQLFDELSEGAWQVSE--------EPIEHLVWPE 854 >UniRef50_D1HM83 Whole genome shotgun sequence of line PN40024, scaffold_108.assembly12x (Fragment) n=1 Tax=Vitis vinifera RepID=D1HM83_VITVI Length = 266 Score = 103 bits (256), Expect = 4e-21, Method: Composition-based stats. Identities = 35/178 (19%), Positives = 50/178 (28%), Gaps = 43/178 (24%) Query: 4 QHHFLIAMPALQDPI-FRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPR 62 + LIA L F R+V+ + G GII+N+P + Sbjct: 120 KGCLLIATEKLDGVHIFERTVILLLSTGPVGPTGIILNRPS-------------LMSIKE 166 Query: 63 DESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDV---LETLGTDK 119 S LD + + T V E + + Sbjct: 167 TRSTVLDTGLF-----------------------EEVMKGLYYGTKESVGCAAEMVKRNA 203 Query: 120 -QPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNIL--FKTPIADRWREAAKLIG 174 D GY WEK QL EI W A +++ W E L+G Sbjct: 204 VAVEDFRFFDGYCGWEKEQLRDEIRAGYWTVAACSPSVIGLTSVGSVGLWEEIIGLMG 261 >UniRef50_Q4QG99 Putative uncharacterized protein n=3 Tax=Leishmania RepID=Q4QG99_LEIMA Length = 670 Score = 103 bits (256), Expect = 4e-21, Method: Composition-based stats. Identities = 38/162 (23%), Positives = 69/162 (42%), Gaps = 13/162 (8%) Query: 6 HFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITP---EPR 62 LI+ P + FRR+V+ + H T+ + +++NKPL N + + + + P Sbjct: 361 QLLISHPTARG-FFRRTVLLMVRHVTHESAALVLNKPLRNEEGLEMSIEATVRLGRVHPI 419 Query: 63 DESIRLDKPVMLGGPLA-----EDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGT 117 +M+GGP+ +D F+LH P ++ + N + DVL Sbjct: 420 FRRHLAQHTLMIGGPVMSGSSFDDSIFLLHRVP-GVPHALPLGSNLWLDGDLDVLMAKLD 478 Query: 118 DKQP---SDVLVALGYASWEKGQLEQEILDNAWLTAPADLNI 156 ++ D++V G+A W QL+ E+ W+ A Sbjct: 479 AEEASAEEDIVVLCGFAGWGFDQLKGELGHGYWVVASGPSAD 520 >UniRef50_B8C1S9 Predicted protein n=1 Tax=Thalassiosira pseudonana RepID=B8C1S9_THAPS Length = 632 Score = 103 bits (256), Expect = 5e-21, Method: Composition-based stats. Identities = 35/164 (21%), Positives = 62/164 (37%), Gaps = 11/164 (6%) Query: 13 ALQDPIFRRSVVYICEHNTNGAMGIIVNKPLE-NLKIEGILEKLKITPEPRDESIRLD-- 69 L F ++V+ + H++ GII+N+P +L E +++ D ++ Sbjct: 115 GLSQQYFHKAVLLVTYHSSEFTKGIILNRPTNLHLDDEDFIDESGEPFIKSDNALEDMNS 174 Query: 70 KPVMLGGPL-----AEDRGFILHTPPSNF--ASSIRISDNTVMTTSRDVLETL-GTDKQP 121 + GG + + LH+ SN S I N +T + + + Sbjct: 175 WRIWFGGDVNGMYSDDPEIVCLHSIDSNLGKNLSEEIIKNIFLTNYEGARKLIDANEATS 234 Query: 122 SDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADR 165 D V GY W GQL E+ +W AD ++ + R Sbjct: 235 QDFWVFAGYCGWSAGQLLDELKHESWYMVSADSQTVWSELVRQR 278 >UniRef50_D1I242 Whole genome shotgun sequence of line PN40024, scaffold_10.assembly12x (Fragment) n=3 Tax=rosids RepID=D1I242_VITVI Length = 1106 Score = 103 bits (256), Expect = 5e-21, Method: Composition-based stats. Identities = 30/154 (19%), Positives = 57/154 (37%), Gaps = 16/154 (10%) Query: 5 HHFLIAMPALQDPI-FRRSVVYICEHNTN-GAMGIIVNKPLENLKIEGILEKLKITPEPR 62 L+A L D F +S + I + + G G+I+NK + + + E + E Sbjct: 951 GSILVATDKLLDAHPFDKSTILIVKADQATGFHGLIINKHINWESLNELAEGVDHLKEA- 1009 Query: 63 DESIRLDKPVMLGGPL-AEDRGFILHTPPSNFASSIRISDNTVM---TTSRDVLETLGTD 118 P+ GGP+ + + T + + + +E L + Sbjct: 1010 --------PLSFGGPVVKRGKPLVALTRRVFKDQHPEVLPGVYFLDQSATVSEIEGLKSG 1061 Query: 119 -KQPSDVLVALGYASWEKGQLEQEILDNAWLTAP 151 + S+ +G+++W QL EI + AW Sbjct: 1062 NESVSEYWFFVGFSNWGWDQLFDEIAEGAWNITD 1095 >UniRef50_Q9LT30 Genomic DNA, chromosome 3, P1 clone: MPN9 n=2 Tax=Arabidopsis thaliana RepID=Q9LT30_ARATH Length = 963 Score = 101 bits (251), Expect = 1e-20, Method: Composition-based stats. Identities = 35/154 (22%), Positives = 55/154 (35%), Gaps = 19/154 (12%) Query: 4 QHHFLIAMPALQDP-IFRRSVVYICEHNTN-GAMGIIVNKPLENLKIEGILEKLKITPEP 61 L+A L F +S + I + G +G+I NK + + E ++ E Sbjct: 807 TGTVLVATEKLAASLTFAKSKILIIKAGPEIGFLGLIFNKRIRWKSFPDLGETAELLKE- 865 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILH----TPPSNFASSIRISDNTVM----TTSRDVLE 113 P+ GGP+ + +L S IS + +R + E Sbjct: 866 --------TPLSFGGPVVDPGIPLLALTRERDSSTNHDHPEISPGVYFLDHQSVARRIQE 917 Query: 114 TLGTDKQPSDVLVALGYASWEKGQLEQEILDNAW 147 + PS+ LGY+SW QL EI W Sbjct: 918 LKSRELNPSEYWFFLGYSSWSYEQLFDEIGLGVW 951 >UniRef50_B6SSQ6 Uncharacterized ACR, COG1678 family protein n=3 Tax=Andropogoneae RepID=B6SSQ6_MAIZE Length = 292 Score = 99.6 bits (247), Expect = 4e-20, Method: Composition-based stats. Identities = 36/189 (19%), Positives = 64/189 (33%), Gaps = 21/189 (11%) Query: 4 QHHFLIAMPALQDPI-FRRSVVYICEHNTN-GAMGIIVNKPLENLKIEGILEKLKITPEP 61 + LIA L F R+V+ + ++ G +G+I+N+P + I+ + + Sbjct: 102 KGCLLIATEKLDGSHIFERTVILLLSSPSSLGPVGVILNRPS-LMSIKEASGSI-FADDA 159 Query: 62 RDESIRLDKPVMLGGPLAEDRGFILHTPP----------SNFASSIRISDNTVMTT--SR 109 +P+ GGPL + F++ + + T + Sbjct: 160 DIARAFAGRPLFFGGPLE-ECFFVIGPRAAAGGGGDDAVARTGLFEEVMPGLHYGTRETV 218 Query: 110 DVLETLGTDK--QPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNIL-FKTPIA-DR 165 L D G+ WE+ QL E+ W A +L T + Sbjct: 219 GCAAELAKRGVVGVRDFRFFDGFCGWEREQLRDEVRAGLWHVAACSAAVLELATVVKGGL 278 Query: 166 WREAAKLIG 174 W E L+G Sbjct: 279 WEEVQGLVG 287 >UniRef50_A9S274 Predicted protein n=1 Tax=Physcomitrella patens subsp. patens RepID=A9S274_PHYPA Length = 1306 Score = 98.1 bits (243), Expect = 1e-19, Method: Composition-based stats. Identities = 32/162 (19%), Positives = 61/162 (37%), Gaps = 18/162 (11%) Query: 4 QHHFLIAMPALQD-PIFRRSVVYICEHNTNGAM-GIIVNKPLENLKIEGILEKLKITPEP 61 L+A P L +F V+ I + +G + G+++NKPL + Sbjct: 1145 AGTLLLASPLLDGTSVFSGCVILIVHAHEHGDVRGLMLNKPLS----------WDYVAKT 1194 Query: 62 RDESIRLDKPVMLGGPL-AEDRGFILHTPPSNFASSIRISDNTVMTTS----RDVLETLG 116 + + P+ GGP+ + F + T + S D+++ + Sbjct: 1195 IGQDSLHEAPLGFGGPVGEQSHPFFVLTKVPGLDDFHEVMPGVFYGVSAKSVEDLIQLMQ 1254 Query: 117 TDKQPS-DVLVALGYASWEKGQLEQEILDNAWLTAPADLNIL 157 + K DV V LG +W QL++E+ W + ++ Sbjct: 1255 SGKLIEADVWVFLGCTAWSWFQLQEELAQQIWNVSGHYNGLV 1296 >UniRef50_C5BU30 Putative uncharacterized protein n=1 Tax=Teredinibacter turnerae T7901 RepID=C5BU30_TERTT Length = 192 Score = 97.7 bits (242), Expect = 2e-19, Method: Composition-based stats. Identities = 47/175 (26%), Positives = 75/175 (42%), Gaps = 10/175 (5%) Query: 3 LQHHFLIAMPALQD-PIFRRS---VVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKIT 58 L + LIA PA D P F S ++Y+ H +GA+G+ +N+ + E+ +I Sbjct: 5 LTDNVLIANPATTDLPQFAASAEKLIYVVHHGDDGAVGVCLNE-YFGKPLADFSEQYEIL 63 Query: 59 PEPRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTD 118 S+ V GGPLA + +IL + I + + + S+ E Sbjct: 64 ASVSPLSLAS-VTVHSGGPLATELPWILSRAVDIYPHQIN-NKSLSLNFSQ---EAFADP 118 Query: 119 KQPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLI 173 D LV LG SW GQLE+E+ W PA +L + +++ A + Sbjct: 119 SIHMDALVGLGSFSWGPGQLEKEVSGFMWHCFPAQKPLLNRLHFEHKYQSAVDTL 173 >UniRef50_A4RT64 Predicted protein n=2 Tax=Ostreococcus RepID=A4RT64_OSTLU Length = 236 Score = 95.8 bits (237), Expect = 7e-19, Method: Composition-based stats. Identities = 31/166 (18%), Positives = 54/166 (32%), Gaps = 17/166 (10%) Query: 4 QHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILE------KLKI 57 + L+A+ + V+++ +H G+ GII+N+ + E + Sbjct: 36 KGALLVAVEEDASSFWSHVVIFMLDHTPYGSTGIILNRTQSWTLAKHCPEVKHDNLYWSL 95 Query: 58 TPEPRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGT 117 E + DR I T + + + D L L Sbjct: 96 LSEEVVGVGGPVGLDH-----SLDRSVIALTTKEQPGMTEEVIPGIYRVINLDQLAKLNA 150 Query: 118 DKQ------PSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNIL 157 P D+ + +GY+ W GQL+ EI W A A + Sbjct: 151 KLSGPGTLRPEDLSLFVGYSGWSPGQLQSEIDAGFWTLASASGTYV 196 >UniRef50_Q7UKQ8 Putative uncharacterized protein n=1 Tax=Rhodopirellula baltica RepID=Q7UKQ8_RHOBA Length = 313 Score = 94.2 bits (233), Expect = 2e-18, Method: Composition-based stats. Identities = 35/219 (15%), Positives = 70/219 (31%), Gaps = 55/219 (25%) Query: 4 QHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENL----------------- 46 + L++ + + ++V + +T A+G+++N+P++ + Sbjct: 78 AGNLLVSSTLVDGTVLNQAVCLMVHEDTEHAIGLMLNRPMQAMAGAITIQGTPQETPKIP 137 Query: 47 --KIEGILEKLKITPEPRDESIRLDKPV-------------------------------M 73 E + E + S ++ PV Sbjct: 138 RWNAEDLDETSEGDSTIDPSSDSIEHPVSGVPSVVISGDQKDQLAQQLLSGKLANGSSLH 197 Query: 74 LGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQPSDVLVALGYASW 133 GGPL+ G I+ + + + + R+ LE L + +G+ W Sbjct: 198 FGGPLS---GPIVAVHSNRELAEAETGEGIFVAAQRENLEALMKSSD-LPYRLIIGHLGW 253 Query: 134 EKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKL 172 QLE EI + W PA + L + A W + Sbjct: 254 TAEQLENEIEEGIWHRIPATSD-LLNSDDAMMWPRMIRR 291 >UniRef50_C5L030 Membrane associated RING finger, putative n=1 Tax=Perkinsus marinus ATCC 50983 RepID=C5L030_9ALVE Length = 388 Score = 93.4 bits (231), Expect = 3e-18, Method: Composition-based stats. Identities = 39/197 (19%), Positives = 67/197 (34%), Gaps = 36/197 (18%) Query: 4 QHHFLIAMPALQDP--IFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEP 61 + L+A + P IF RSV + EH+ G++ +I+NKP+ I + + P Sbjct: 194 KGALLVANDNMIGPGSIFYRSVALVLEHDHMGSLALILNKPVARSPIADYRDAAEGPPVE 253 Query: 62 RDESIRLDKPVMLGGPL---AEDRGFILHTPPSNFASSIRIS----DNTVMTTSRDVL-- 112 GGP+ E R I+H + + +S D+ + + Sbjct: 254 LLTVR--------GGPVRINEERR--IMHQARGVVGARVLLSQDREDSVYLGGDLTAVLA 303 Query: 113 ----ETLGTDKQPSDVLVALGYASWEKGQLEQEILDNAWLTA--PADLNILFK------- 159 + G + + ++ G A W GQL E+ +W P IL Sbjct: 304 GIAQQDRGAGDERAHAIIFDGCARWAPGQLYGELRAGSWRWINPPWPEEILLSAFDQHGA 363 Query: 160 --TPIADRWREAAKLIG 174 + G Sbjct: 364 GVDNGEAMYHMIMNDYG 380 >UniRef50_B8CDL7 Predicted protein n=1 Tax=Thalassiosira pseudonana RepID=B8CDL7_THAPS Length = 531 Score = 93.1 bits (230), Expect = 5e-18, Method: Composition-based stats. Identities = 21/123 (17%), Positives = 43/123 (34%), Gaps = 6/123 (4%) Query: 65 SIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDN--TVMTTSRDVLETLGTDK-QP 121 + V +GGP D L ++ S+ IS ++ + + K +P Sbjct: 398 AFENQCGVYVGGPDKMDEPATLIHGIADLPGSVEISPGTGIYEGGLEAAMDGVLSGKYKP 457 Query: 122 SDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTP---IADRWREAAKLIGVDIL 178 D +G+ S+E G+L+ + ++ K W E + G ++ Sbjct: 458 LDFRFFIGHTSYEGGELDYACEVGKYQPVACSRPLVLKQCMQLPKPLWHEVLEFCGGELK 517 Query: 179 TMP 181 + Sbjct: 518 EIS 520 >UniRef50_A9T9H0 Predicted protein n=1 Tax=Physcomitrella patens subsp. patens RepID=A9T9H0_PHYPA Length = 461 Score = 91.5 bits (226), Expect = 1e-17, Method: Composition-based stats. Identities = 45/215 (20%), Positives = 58/215 (26%), Gaps = 51/215 (23%) Query: 5 HHFLIAMPALQDPIFRRSVVYICEHNT-NGAMGIIVNKPLENLKIEGILEKLKITPEPRD 63 L+A P + P + +VV + EH G+ GII+NK E IL+ Sbjct: 249 GTLLVATPTMLSPYYFGTVVLLYEHERCRGSRGIILNKQAEK---AEILKWENQLFLGVH 305 Query: 64 ESIRLDKPVM-LGGPLAEDRGFIL-----------HTPPSNFAS-------SIRISDNTV 104 S L GG D FIL H I Sbjct: 306 NSAALRHITHGTGGSHKPDDWFILQRCSPSPVKCKHCAADTARPKTCSKDWGREILPGIF 365 Query: 105 MTTSRD-VLETL----------------GTDKQPSDVL-----------VALGYASWEKG 136 + VL L + V V G+A W G Sbjct: 366 LGKDVGPVLRHLNGCKRLSVMGQQCSVKAESIKEDRVRKNEECYYVDHQVIHGHAEWYVG 425 Query: 137 QLEQEILDNAWLTAPADLNILFKTPIADRWREAAK 171 QL + W T IL TP + W Sbjct: 426 QLGSAVKRGLWKTRENASAILLSTPPHELWHTLLS 460 >UniRef50_D2VPR5 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2VPR5_NAEGR Length = 413 Score = 86.9 bits (214), Expect = 3e-16, Method: Composition-based stats. Identities = 41/188 (21%), Positives = 65/188 (34%), Gaps = 50/188 (26%) Query: 3 LQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLK-------IEGILEKL 55 L++H LIA P L + F R+V+ + ++ + G IV K + E +LE Sbjct: 236 LKNHLLIAHPMLANKYFERTVIRMDDNIQDT--GYIVGKLSNDENTNEDKKKFEDLLEDD 293 Query: 56 KITPEPRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETL 115 + TPE ++ L V +R F + LE Sbjct: 294 EKTPESEAKAKTLMDFV-----SQINREF-------------------------ESLEKK 323 Query: 116 -GTDKQPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILF----KTPIADRWREAA 170 + +P + A W QL EILD W+ D+ +TP W Sbjct: 324 HSKEAEPERL------ARWYTHQLANEILDGVWIVVAVDMEAFLPFAKETPYDKVWEYLV 377 Query: 171 KLIGVDIL 178 +G + Sbjct: 378 SRLGGEYE 385 >UniRef50_B7G772 Predicted protein n=1 Tax=Phaeodactylum tricornutum CCAP 1055/1 RepID=B7G772_PHATR Length = 390 Score = 83.8 bits (206), Expect = 3e-15, Method: Composition-based stats. Identities = 31/177 (17%), Positives = 58/177 (32%), Gaps = 22/177 (12%) Query: 24 VYICEHNTNGAM--GIIVNKPLENL--KIEGILEKLKITPEPRDESIRLD----KPVMLG 75 V I G+ +++N+ L +E L T + + L+ +P+ G Sbjct: 203 VLIVVECGEGSRVRAVLLNRRTGYLLGDLEQADTNLGSTSSKKAPTPVLEKFCIQPLWFG 262 Query: 76 GPLAEDRGF-ILHTPPSNFASSIRISDNTVMTTSRDVLETLGTD-----KQPSDVLVALG 129 G G +LH P+ + + + + + D + Sbjct: 263 GVDNVSAGLDMLHQCPTVPDAEPLSDEGLYWGGDPALAQDAMDEVTDKVLTGFDFKFFVQ 322 Query: 130 YASWEKG-QLEQEILDNAWLTAPADLNILFKT-------PIADRWREAAKLIGVDIL 178 W +L++EI + W TA +LFK+ W E +L+G Sbjct: 323 STVWGSSKELQKEIDNGTWFTARVSKEVLFKSRDRMGTRRAKPLWTEVMELLGGKYK 379 >UniRef50_B7G8R3 Predicted protein n=1 Tax=Phaeodactylum tricornutum CCAP 1055/1 RepID=B7G8R3_PHATR Length = 491 Score = 81.1 bits (199), Expect = 2e-14, Method: Composition-based stats. Identities = 34/180 (18%), Positives = 68/180 (37%), Gaps = 22/180 (12%) Query: 23 VVYICEHNT-NGAMG-IIVNKPLENLKIEGILEKLKITPEPRDESIRL------------ 68 V + E N NGA +++N+P+ LK+ L +L + R E + Sbjct: 264 VCLVMERNESNGAATTLVLNRPM-ALKLTDSLGQLVLNGAYRGEKTKPKKDVTRFMRAFG 322 Query: 69 -DKPVMLGGPLAEDRGFILHTPPSNFASSIRISD--NTVMTTSRDVLETLGTDK-QPSDV 124 + V +GGP +D+ +L ++ A + IS +E + + K QP D Sbjct: 323 GECAVYIGGPDDQDQPAVLVHGLADLAGANEISPGSGIYQGGIEAAVEGVISGKYQPLDF 382 Query: 125 LVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTP---IADRWREAAKLIGVDILTMP 181 +G + + L+ ++ + ++ K W E +L ++ + Sbjct: 383 RFFVGRHVYVESTLDLSVVLGKYQPVACARSVALKQCLSLPKPLWHEVLELCAGELADIS 442 >UniRef50_B8BQD7 Predicted protein n=1 Tax=Thalassiosira pseudonana CCMP1335 RepID=B8BQD7_THAPS Length = 499 Score = 80.7 bits (198), Expect = 2e-14, Method: Composition-based stats. Identities = 31/183 (16%), Positives = 52/183 (28%), Gaps = 36/183 (19%) Query: 33 GAMGIIVNKPLENLKIEGILEKLKITPEPRDESIRLDKPVMLGG---------------- 76 G ++VN+ +L I + +P+ +GG Sbjct: 304 GCKALMVNQRSGSL-IGDVATNPNYNTNSSKWGPFFIQPLWIGGTQAVITSWDSTSYDIT 362 Query: 77 -----PLAEDRGFILHTPPSNFASS-IRISDNTVMTTSRD------VLETLGTDKQPSDV 124 + +LH P S + + D + E L D Sbjct: 363 EFETFEKSLQNPGMLHMCPFVTDSQNLTMCDGLYWGGDPGQAQEAMIDERLDKPMSGFDF 422 Query: 125 LVALGYASWEKGQLEQEILDNAWLTAPADLNILFK-------TPIADRWREAAKLIGVDI 177 + W QLE+EI+D W +LF+ W E +L+G D Sbjct: 423 KFFVKDTRWLPSQLEEEIMDGTWHVTTVSKEVLFRNRDRLGPKRAKPLWTEIMELLGDDY 482 Query: 178 LTM 180 + Sbjct: 483 KHI 485 >UniRef50_B8BZN2 Predicted protein n=1 Tax=Thalassiosira pseudonana RepID=B8BZN2_THAPS Length = 646 Score = 78.8 bits (193), Expect = 9e-14, Method: Composition-based stats. Identities = 33/168 (19%), Positives = 57/168 (33%), Gaps = 22/168 (13%) Query: 13 ALQDPIFRRSVVYICEHNTNGAMGIIVNKPLE----------NLKIEGILEKLKITPEPR 62 AL + +R+S+V + + N GII+N+P + E +I Sbjct: 141 ALNNQCYRKSIVLVLDVQQNFIQGIILNRPTNIGVKQGMQFVQPGHGEVFEN-EIGSCDG 199 Query: 63 DESIRLDKPVMLGGPL-----AEDRGFILHT--PPSNFASSIRISDNTVMTTSRDVLETL 115 S V GG + + LH+ S + ++T S D + L Sbjct: 200 SGSSPHRWKVWFGGEVAGPFSEYPQVMCLHSVNTDLGVELSDAVLPGILIT-SFDGAQRL 258 Query: 116 --GTDKQPSDVLVALGYASWEKGQLEQEI-LDNAWLTAPADLNILFKT 160 + PS + G WE E+ + WL +D + + Sbjct: 259 VDAGEANPSSFWLFCGICGWETSSFYSEMHDEGLWLVVSSDGGTILEE 306 Score = 44.9 bits (105), Expect = 0.001, Method: Composition-based stats. Identities = 26/141 (18%), Positives = 54/141 (38%), Gaps = 23/141 (16%) Query: 2 NLQHHFLIAMPA------LQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKL 55 +L + A A L D F +S++ I + + + G+I+N + +E I+ Sbjct: 421 SLVGSMIRASSAERSPFLLADQGFHKSLILIVRDDVDCSEGVILNH----MTMESIM--- 473 Query: 56 KITPEPRDESIRLDKPVMLGGPLAE-DRGFILHTPPSNFASSIRISD-NTVMTTSRDVLE 113 + PV GGP+ + L++ S +++ + T +++E Sbjct: 474 -------LGDGKTCLPVRYGGPMQVSEPIMYLYSNESLDCVGVQMGNSEIYSCTEDEIIE 526 Query: 114 TLGTD-KQPSDVLVALGYASW 133 ++ D L G + W Sbjct: 527 SIELGLASADDFLAIQGISVW 547 >UniRef50_C0AXX9 Putative uncharacterized protein n=1 Tax=Proteus penneri ATCC 35198 RepID=C0AXX9_9ENTR Length = 56 Score = 65.7 bits (159), Expect = 7e-10, Method: Composition-based stats. Identities = 24/49 (48%), Positives = 35/49 (71%) Query: 139 EQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMPGVAGHA 187 E+ N+WLT A I+F TP+A+RW +AA+LIG++I T+ +AGHA Sbjct: 8 ERNFRKNSWLTVEASPQIIFDTPVAERWHKAAELIGINIHTISPIAGHA 56 >UniRef50_B6KQL2 Putative uncharacterized protein n=4 Tax=Toxoplasma gondii RepID=B6KQL2_TOXGO Length = 1530 Score = 58.4 bits (140), Expect = 1e-07, Method: Composition-based stats. Identities = 20/90 (22%), Positives = 26/90 (28%), Gaps = 21/90 (23%) Query: 116 GTDKQPSDVLVALGYASWEKGQLEQEILDNAWLTAPADL-----NILFKTPI-------- 162 V LG ASW GQLE+EI AW+ I+F Sbjct: 1412 AEGSPVFLSRVFLGKASWSPGQLEREIEKGAWVVVGCTDSGVMQEIVFGREPSAPGGAPG 1471 Query: 163 -------ADRWREAAKLIGVDILTMPGVAG 185 WR + + + AG Sbjct: 1472 GHTPPSEEHLWRRVLSALAANPAS-GPQAG 1500 >UniRef50_A5AUI4 Putative uncharacterized protein n=1 Tax=Vitis vinifera RepID=A5AUI4_VITVI Length = 218 Score = 56.9 bits (136), Expect = 4e-07, Method: Composition-based stats. Identities = 14/70 (20%), Positives = 24/70 (34%), Gaps = 5/70 (7%) Query: 105 MTTSRDVLETLGTDK--QPSDVLVALGYASWEKGQLEQEILDNAWLTAPADL---NILFK 159 S D L +P D + +GY W+ QL +E+ + A + + Sbjct: 138 WNESLDEAGKLVKQGVLKPEDFIFFVGYVGWQLDQLREEMGSDYGYVAAYSPYVIDGVLT 197 Query: 160 TPIADRWREA 169 + W E Sbjct: 198 ESSSGVWDEV 207 >UniRef50_B8CF73 Predicted protein n=2 Tax=Thalassiosira pseudonana RepID=B8CF73_THAPS Length = 656 Score = 56.1 bits (134), Expect = 6e-07, Method: Composition-based stats. Identities = 24/132 (18%), Positives = 39/132 (29%), Gaps = 30/132 (22%) Query: 57 ITPEPRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLET-- 114 I +E + GG + +S+ +S+ +D T Sbjct: 468 IPSLLDNEDGIDTEAFYFGGDV--------------IRASLDVSEGI---EDQDGNFTHP 510 Query: 115 -LGTDKQPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWREAAKLI 173 L D +G + W GQLE+EI WL D ++ Sbjct: 511 PLLYLHFKDDFSFIIGASCWAPGQLEKEIERGCWLPFRGDPSMALTGECDH--------- 561 Query: 174 GVDILTMPGVAG 185 D+ T+ G Sbjct: 562 -NDVATLSTNDG 572 Score = 41.1 bits (95), Expect = 0.022, Method: Composition-based stats. Identities = 16/74 (21%), Positives = 29/74 (39%), Gaps = 25/74 (33%) Query: 5 HHFLIAMPALQDPIFRRSVVYICEH--------------------NTNGAMGIIVNK--- 41 FLIA P + F +SV+ + +H + G G+I+N+ Sbjct: 276 GSFLIAHPLMTG-YFAKSVILLLDHTEASSKSTSSTESQAESEEVGSGGTYGLIINRLAL 334 Query: 42 -PLENLKIEGILEK 54 P+ + K I+ + Sbjct: 335 QPVSSEKRLDIIRQ 348 >UniRef50_C5KHP8 Putative uncharacterized protein n=1 Tax=Perkinsus marinus ATCC 50983 RepID=C5KHP8_9ALVE Length = 473 Score = 54.5 bits (130), Expect = 2e-06, Method: Composition-based stats. Identities = 24/115 (20%), Positives = 37/115 (32%), Gaps = 17/115 (14%) Query: 74 LGGPLAEDRGFILHTPPSNFASSIRISD-----NTVMTTSRDVLETLGTDKQPSDVLVAL 128 GGP+A +LH + + I + L T+ + Sbjct: 316 FGGPVA--SVEVLHESLVRGKNPLSIGPNETSIGLFHGWKQTEDTELLTEATTPR-RTFV 372 Query: 129 GYASWEKGQLEQEILDNAWLTA-----PADLNILFKTP---IADRWREAA-KLIG 174 G A+WE+GQLE+E+ W A I+ D W + G Sbjct: 373 GKAAWERGQLEREMNLGVWYPVRVTCPEALRKIMLGNHELSEDDLWAAMVTESCG 427 >UniRef50_B7GE41 Predicted protein n=1 Tax=Phaeodactylum tricornutum CCAP 1055/1 RepID=B7GE41_PHATR Length = 258 Score = 47.6 bits (112), Expect = 2e-04, Method: Composition-based stats. Identities = 25/166 (15%), Positives = 59/166 (35%), Gaps = 16/166 (9%) Query: 5 HHFLIAMPALQDPIFRRSVVYICEHNTNG-----AMGIIVNKPLENLKIEGILEKLKITP 59 L+A +R++ +++ + G+I+++P + ++E Sbjct: 78 GVVLLAPTNEYHHYYRQAAIFVHAMGEDDDDVYVIRGVILDQPTP-FTLGEMMEH----N 132 Query: 60 EPRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTD- 118 ++ D + GG D+ +LH SSI +S + Sbjct: 133 PALQKTPLKDNLLFRGGDKGGDQVVLLHNHEEIGQSSIGVS-GVFQGGFDQAMAACEKGH 191 Query: 119 KQPSDVLVALGYASWEKGQLEQEILD----NAWLTAPADLNILFKT 160 +Q SD + Y + + ++E + +AW++ D + + Sbjct: 192 RQTSDFKLFFNYCEFTELEMEDLLASDEDGDAWISVEVDSDFVLND 237 >UniRef50_Q4DTN4 Putative uncharacterized protein n=3 Tax=Trypanosoma RepID=Q4DTN4_TRYCR Length = 552 Score = 46.5 bits (109), Expect = 5e-04, Method: Composition-based stats. Identities = 30/203 (14%), Positives = 48/203 (23%), Gaps = 61/203 (30%) Query: 5 HHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRDE 64 L+A P L R V+ I E N + ++++ + P Sbjct: 182 GVALVAHP-LSSTHVDRRVLLITERNPHVTTAVVLD---------MLFTYPLSRGNPMFP 231 Query: 65 SIRLDKPVMLGGPLAED-------RGFILHTPPSNFASSIR------------------- 98 + V GG + ILHT Sbjct: 232 EVFWGHEVHDGGFSQIGFTMPPTAQVIILHTTEPPTEPESPQYLAWLKWKERKPKTQAGD 291 Query: 99 -----------------------ISDNTVMT--TSRDVLETLGTDKQPSDVLVALGYASW 133 + ++ S L L K S + V G W Sbjct: 292 TPSEHHSLLCKPLIRGGVLEDGTVEPTLYLSKVESLPYLAQLVPGKPRSSLRVYWGSMRW 351 Query: 134 EKGQLEQEILDNAWLTAPADLNI 156 QLE E+ + W+ + Sbjct: 352 PTQQLEAEVANGHWIPVKLSPSF 374 >UniRef50_UPI0001BCD1A1 hypothetical protein AmarD1_07934 n=1 Tax=Aeromicrobium marinum DSM 15272 RepID=UPI0001BCD1A1 Length = 53 Score = 45.7 bits (107), Expect = 7e-04, Method: Composition-based stats. Identities = 9/45 (20%), Positives = 18/45 (40%) Query: 138 LEQEILDNAWLTAPADLNILFKTPIADRWREAAKLIGVDILTMPG 182 +E E+ +++W+ AD L WR+ + D+ Sbjct: 1 MEDEVAESSWMVVAADPEDLLSPHPDTLWRQVLRRQQDDLRFWST 45 >UniRef50_B3LAF2 Putative uncharacterized protein n=1 Tax=Plasmodium knowlesi strain H RepID=B3LAF2_PLAKH Length = 1030 Score = 44.1 bits (103), Expect = 0.002, Method: Composition-based stats. Identities = 16/85 (18%), Positives = 31/85 (36%), Gaps = 15/85 (17%) Query: 112 LETLGTDKQPSDVLVALGYASWEKGQLEQEILDNAWLTAPAD-----LNILFKTPI---- 162 +E +++ + +G A+W+ QL E+ +N W+ D +I+F T Sbjct: 926 MEGKTSEQSNKLIKRFIGKATWDINQLMDELNNNYWIALNCDNKELLSSIIFNTADSVTD 985 Query: 163 ------ADRWREAAKLIGVDILTMP 181 W + I D + Sbjct: 986 GSAYKGEFLWEKIVASISNDYENIS 1010 >UniRef50_C1FFU8 Predicted protein n=1 Tax=Micromonas sp. RCC299 RepID=C1FFU8_9CHLO Length = 437 Score = 43.8 bits (102), Expect = 0.003, Method: Composition-based stats. Identities = 27/138 (19%), Positives = 35/138 (25%), Gaps = 47/138 (34%) Query: 73 MLGGPLAEDRGFILHTPPSNFASSIRISDNT-------------------VMTTSRDVLE 113 +GGP+ R IL P+ + D T+ D E Sbjct: 284 YVGGPVHPRRRIILFVSPTGANALSPTGDALCEVPLDRTCHPGSRARAFAYHPTATDSDE 343 Query: 114 TL-----------------------GTDKQPSDVLVALGYASWEKGQLEQEILDNAWLTA 150 + V V G+A W + QL EI W Sbjct: 344 YVDEIAGEAAAAALAESEEEDAAFNADGFFVPRVHVFEGHAKWSRTQLMNEIARGDWGLC 403 Query: 151 PADLNILFKTPIADRWRE 168 PA L RW E Sbjct: 404 PATPEDLTS-----RWAE 416 >UniRef50_Q7RTH1 Glutamic acid-rich protein n=3 Tax=Plasmodium (Vinckeia) RepID=Q7RTH1_PLAYO Length = 617 Score = 43.4 bits (101), Expect = 0.004, Method: Composition-based stats. Identities = 13/85 (15%), Positives = 26/85 (30%), Gaps = 27/85 (31%) Query: 124 VLVALGYASWEKGQLEQEILDNAWLTAPAD-----LNILFKTPIAD-------------- 164 + +G A+W+ QL +E+ ++ W+ D I+F T Sbjct: 513 IKRFIGKATWDLNQLIEELNNDYWIPINCDNKELLSKIIFNTTSEGNNNNENEQMASSDM 572 Query: 165 --------RWREAAKLIGVDILTMP 181 W + + D + Sbjct: 573 FNTYQGENLWEKILSSLNSDYENIS 597 >UniRef50_A5K110 Putative uncharacterized protein n=1 Tax=Plasmodium vivax RepID=A5K110_PLAVI Length = 1004 Score = 43.0 bits (100), Expect = 0.006, Method: Composition-based stats. Identities = 15/83 (18%), Positives = 29/83 (34%), Gaps = 17/83 (20%) Query: 116 GTDKQPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLN-----ILFKTPIAD------ 164 + + V +G A+W+ QL E+ ++ W+ D I+F T + Sbjct: 902 ANEAENKLVKRFIGKATWDVNQLMDELKNDYWIALNCDSKELLSRIIFNTATSGSGAAGS 961 Query: 165 ------RWREAAKLIGVDILTMP 181 W + I D ++ Sbjct: 962 VYRGEFLWEKIVASINSDYESIS 984 >UniRef50_O68558 Putative uncharacterized protein (Fragment) n=1 Tax=Mycobacterium bovis RepID=O68558_MYCBO Length = 82 Score = 41.4 bits (96), Expect = 0.017, Method: Composition-based stats. Identities = 15/39 (38%), Positives = 24/39 (61%) Query: 13 ALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGI 51 L +P FRRSV+YI EHN G +G+++ N ++ + Sbjct: 1 DLLEPTFRRSVIYIVEHNDGGTLGVVLQSAQRNRGLQRV 39 >UniRef50_B8BQX9 Predicted protein n=1 Tax=Thalassiosira pseudonana CCMP1335 RepID=B8BQX9_THAPS Length = 194 Score = 40.3 bits (93), Expect = 0.032, Method: Composition-based stats. Identities = 18/126 (14%), Positives = 35/126 (27%), Gaps = 17/126 (13%) Query: 60 EPRDESIRLDKPVMLGGPLAEDRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTD- 118 P + GG D +LH+ S + + + +E Sbjct: 67 SPNVMGNLSQNMLYRGGNSGSDTAMMLHSASSLQSGEMIGNSGIYEGGIVSAMEAADAGI 126 Query: 119 KQPSDVLVALGYASWEKGQLEQ-EILD--------NAWLTAPADLNILFKTPIAD--RWR 167 P++ Y Q + EI D +AW++ + + + W Sbjct: 127 ISPNNCKFFFNY-----MQFRELEIDDMFATVEDGDAWVSLEVPSEYVLDSDLDRGAMWS 181 Query: 168 EAAKLI 173 + I Sbjct: 182 KLRNKI 187 >UniRef50_Q00WL0 Protein involved in mRNA turnover and stability (ISS) n=1 Tax=Ostreococcus tauri RepID=Q00WL0_OSTTA Length = 565 Score = 39.9 bits (92), Expect = 0.046, Method: Composition-based stats. Identities = 18/77 (23%), Positives = 28/77 (36%), Gaps = 3/77 (3%) Query: 79 AEDRGFILHTPPSNFASS-IRISDNTVMTTSRDVLETLGTDKQPSDVLVALGYASWEKGQ 137 + +R +L S I + + +RD E + +LV G A W + Q Sbjct: 431 SRERVGVLTEFDDELKSWKIATTVGL-LKRTRDAFE-IIQRVSAVKLLVFFGDARWSRSQ 488 Query: 138 LEQEILDNAWLTAPADL 154 L EI W +D Sbjct: 489 LLGEIARGHWGLTKSDP 505 >UniRef50_A4I406 Putative uncharacterized protein n=3 Tax=Leishmania RepID=A4I406_LEIIN Length = 606 Score = 39.5 bits (91), Expect = 0.056, Method: Composition-based stats. Identities = 11/58 (18%), Positives = 21/58 (36%), Gaps = 2/58 (3%) Query: 101 DNTVMTTSRDV--LETLGTDKQPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNI 156 ++ + + L L + S + V G W QLE E+ + W+ + Sbjct: 349 PTLYLSKAEALPYLAELVEGQPRSSLRVYWGNMRWTTSQLEAEVANGHWMAVKTSPSF 406 >UniRef50_C1E0N3 Predicted protein n=1 Tax=Micromonas sp. RCC299 RepID=C1E0N3_9CHLO Length = 511 Score = 39.1 bits (90), Expect = 0.070, Method: Composition-based stats. Identities = 17/59 (28%), Positives = 23/59 (38%), Gaps = 3/59 (5%) Query: 109 RDVLETLGTDKQPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKTPIADRWR 167 D LE + + V G A W + QL EI W A + + T A+RW Sbjct: 404 PDNLEAMDGEGG--RVFAFWGDARWSRAQLLGEIARGHWGLCRAGVGDI-TTSTAERWE 459 >UniRef50_C1MQ75 Predicted protein n=1 Tax=Micromonas pusilla CCMP1545 RepID=C1MQ75_9CHLO Length = 483 Score = 39.1 bits (90), Expect = 0.084, Method: Composition-based stats. Identities = 12/35 (34%), Positives = 15/35 (42%) Query: 123 DVLVALGYASWEKGQLEQEILDNAWLTAPADLNIL 157 V V G+A W + QL E+ W PA L Sbjct: 421 KVHVFNGHARWSRSQLMNEVARGDWGLCPATRRDL 455 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.315 0.148 0.456 Lambda K H 0.267 0.0453 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,371,547,118 Number of Sequences: 3077464 Number of extensions: 59049988 Number of successful extensions: 134480 Number of sequences better than 1.0e-01: 178 Number of HSP's better than 0.1 without gapping: 431 Number of HSP's successfully gapped in prelim test: 57 Number of HSP's that attempted gapping in prelim test: 133211 Number of HSP's gapped (non-prelim): 528 length of query: 187 length of database: 1,040,396,356 effective HSP length: 121 effective length of query: 66 effective length of database: 668,023,212 effective search space: 44089531992 effective search space used: 44089531992 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.4 bits) S2: 89 (38.7 bits)