BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (268 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P39364 Putative sgc region protein sgcQ n=61 Tax=Bacter... 556 e-157 UniRef50_B1CBM1 Putative uncharacterized protein n=1 Tax=Anaerof... 266 4e-70 UniRef50_A5KMZ4 Putative uncharacterized protein n=2 Tax=Clostri... 248 1e-64 UniRef50_C5CIF3 Photosystem I assembly BtpA n=1 Tax=Kosmotoga ol... 236 4e-61 UniRef50_A7B0Z5 Putative uncharacterized protein n=1 Tax=Ruminoc... 229 1e-58 UniRef50_C0D1F9 Putative uncharacterized protein n=1 Tax=Clostri... 228 2e-58 UniRef50_Q2RUX8 Photosystem I assembly BtpA n=1 Tax=Rhodospirill... 223 8e-57 UniRef50_Q28QI3 Photosystem I assembly BtpA n=5 Tax=Alphaproteob... 220 3e-56 UniRef50_A4E7P3 Putative uncharacterized protein n=1 Tax=Collins... 175 1e-42 UniRef50_B9CKX7 Putative uncharacterized protein n=1 Tax=Atopobi... 137 3e-31 UniRef50_B9XI59 Photosystem I assembly BtpA n=1 Tax=bacterium El... 129 1e-28 UniRef50_A9W9X3 Photosystem I assembly BtpA n=4 Tax=Chloroflexac... 112 1e-23 UniRef50_Q8TVC9 Predicted TIM-barrel enzyme n=1 Tax=Methanopyrus... 106 6e-22 UniRef50_P72966 Photosystem I biogenesis protein btpA n=34 Tax=C... 105 2e-21 UniRef50_A3KNP0 Zgc:162297 protein n=7 Tax=Coelomata RepID=A3KNP... 103 5e-21 UniRef50_A8AAQ2 Photosystem I assembly BtpA n=1 Tax=Ignicoccus h... 103 6e-21 UniRef50_Q5JHL2 Uncharacterized protein TK2179 n=5 Tax=Euryarcha... 102 1e-20 UniRef50_C5ELG9 Putative uncharacterized protein n=1 Tax=Clostri... 99 1e-19 UniRef50_C3ZBU0 Putative uncharacterized protein n=2 Tax=Metazoa... 97 4e-19 UniRef50_A6NZB9 Putative uncharacterized protein n=1 Tax=Bactero... 97 5e-19 UniRef50_O29828 Uncharacterized protein AF_0419 n=1 Tax=Archaeog... 96 1e-18 UniRef50_A8S303 Putative uncharacterized protein n=1 Tax=Clostri... 96 2e-18 UniRef50_UPI0000D55C2D PREDICTED: similar to conserved hypotheti... 94 4e-18 UniRef50_D2RQS6 Photosystem I assembly BtpA n=4 Tax=Halobacteria... 93 9e-18 UniRef50_A4YFU5 Photosystem I assembly BtpA n=1 Tax=Metallosphae... 93 1e-17 UniRef50_UPI000155D1C1 PREDICTED: hypothetical protein n=1 Tax=O... 92 3e-17 UniRef50_D2QXT9 Photosystem I assembly BtpA n=2 Tax=Bacteria Rep... 90 8e-17 UniRef50_UPI000186CA08 conserved hypothetical protein n=1 Tax=Pe... 89 2e-16 UniRef50_B5XQK9 BtpA family protein n=18 Tax=Proteobacteria RepI... 88 3e-16 UniRef50_B8HZU1 Photosystem I assembly BtpA n=5 Tax=Clostridiale... 87 6e-16 UniRef50_C8S5Y9 Photosystem I assembly BtpA n=1 Tax=Ferroglobus ... 82 3e-14 UniRef50_Q8U2H5 Uncharacterized protein PF0860 n=8 Tax=Euryarcha... 81 4e-14 UniRef50_C5EEU2 Photosystem I assembly BtpA n=2 Tax=Clostridiale... 80 5e-14 UniRef50_D2RDD9 Photosystem I assembly BtpA n=1 Tax=Archaeoglobu... 80 9e-14 UniRef50_A3K4E4 Putative uncharacterized protein n=1 Tax=Sagittu... 79 2e-13 UniRef50_UPI0001C369E0 btpA family protein n=1 Tax=Clostridium h... 77 5e-13 UniRef50_B9XEW7 Photosystem I assembly BtpA n=1 Tax=bacterium El... 77 6e-13 UniRef50_Q29E81 GA21203 n=3 Tax=Coelomata RepID=Q29E81_DROPS 75 2e-12 UniRef50_Q1NZ26 Uncharacterized protein F13E9.13, mitochondrial ... 75 3e-12 UniRef50_B9LR39 Photosystem I assembly BtpA n=7 Tax=cellular org... 75 3e-12 UniRef50_A5GQP5 Photosystem I biogenesis protein BtpA n=4 Tax=Ba... 70 6e-11 UniRef50_Q9VS44 CG8607 n=22 Tax=Eukaryota RepID=Q9VS44_DROME 70 8e-11 UniRef50_A2BLV0 Conserved archaeal protein n=1 Tax=Hyperthermus ... 67 5e-10 UniRef50_UPI000069ECFE UPI000069ECFE related cluster n=1 Tax=Xen... 65 2e-09 UniRef50_C0ZRZ3 Putative uncharacterized protein n=2 Tax=Rhodoco... 64 6e-09 UniRef50_A9FN23 Putative uncharacterized protein n=1 Tax=Sorangi... 59 2e-07 UniRef50_Q9Y937 BtpA homolog n=1 Tax=Aeropyrum pernix RepID=Q9Y9... 59 3e-07 UniRef50_A3DLN8 Photosystem I assembly BtpA n=1 Tax=Staphylother... 55 2e-06 UniRef50_Q16GL4 Putative uncharacterized protein n=2 Tax=Aedes a... 55 3e-06 UniRef50_Q2CH81 Putative uncharacterized protein n=1 Tax=Oceanic... 54 4e-06 UniRef50_C2BTC6 Possible photosystem I biogenesis protein BtpA n... 54 6e-06 UniRef50_Q18HA0 Photosystem I biogenesis protein homolog n=1 Tax... 51 4e-05 UniRef50_C5EPH7 Putative uncharacterized protein n=1 Tax=Clostri... 51 4e-05 UniRef50_UPI000180CC4C PREDICTED: similar to F13E9.13 n=1 Tax=Ci... 50 7e-05 UniRef50_Q96YL8 Putative uncharacterized protein ST2156 n=1 Tax=... 50 9e-05 UniRef50_Q166V4 Photosystem I biogenesis protein, putative n=2 T... 48 3e-04 UniRef50_A3MXM0 Photosystem I assembly BtpA n=5 Tax=Thermoprotea... 44 0.004 >UniRef50_P39364 Putative sgc region protein sgcQ n=61 Tax=Bacteria RepID=SGCQ_ECOLI Length = 268 Score = 556 bits (1433), Expect = e-157, Method: Compositional matrix adjust. Identities = 268/268 (100%), Positives = 268/268 (100%) Query: 1 MSWLKEVIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFS 60 MSWLKEVIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFS Sbjct: 1 MSWLKEVIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFS 60 Query: 61 NEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDPVASFDLAMATGAKFIREI 120 NEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDPVASFDLAMATGAKFIREI Sbjct: 61 NEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDPVASFDLAMATGAKFIREI 120 Query: 121 FTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFN 180 FTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFN Sbjct: 121 FTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFN 180 Query: 181 NHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATT 240 NHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATT Sbjct: 181 NHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATT 240 Query: 241 FKKDGVFANFVDQARVSQFMEKVHHIRR 268 FKKDGVFANFVDQARVSQFMEKVHHIRR Sbjct: 241 FKKDGVFANFVDQARVSQFMEKVHHIRR 268 >UniRef50_B1CBM1 Putative uncharacterized protein n=1 Tax=Anaerofustis stercorihominis DSM 17244 RepID=B1CBM1_9FIRM Length = 269 Score = 266 bits (681), Expect = 4e-70, Method: Compositional matrix adjust. Identities = 128/265 (48%), Positives = 176/265 (66%) Query: 3 WLKEVIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNE 62 W+KE+ GT+K ++AM HL ALPGDP +D G+ +VI++A ++ ALQ+GGVD ++ SNE Sbjct: 2 WMKEIFGTDKPIVAMLHLAALPGDPLYDENKGLCYVIERAKREIKALQDGGVDGILISNE 61 Query: 63 FSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDPVASFDLAMATGAKFIREIFT 122 +S PY+ V T +MAR+IGQL IP GV ++ DP +FDLA + GAKF+R FT Sbjct: 62 YSFPYMGDVPIITAMSMARVIGQLKEYFTIPMGVQIISDPYKTFDLAASVGAKFVRGTFT 121 Query: 123 GAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNH 182 G++A D G+ + G+ +RH+ +GA +VK ++N+VPEAA YL +R IA STVF+ Sbjct: 122 GSFAGDHGIAVYDTGKIMRHKIAVGAKDVKCMYNLVPEAAKYLVDRSWEEIADSTVFHCK 181 Query: 183 PDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATTFK 242 PDAL V+G AG D+ ++ RVK+ VP+T V ANTGV EN+E QL+ DG + TTFK Sbjct: 182 PDALMVAGFLAGREADTQIMTRVKKVVPNTPVFANTGVRYENIEMQLAACDGAIVGTTFK 241 Query: 243 KDGVFANFVDQARVSQFMEKVHHIR 267 +DG F RV FM KV R Sbjct: 242 EDGDFYKEAKYDRVKAFMNKVREFR 266 >UniRef50_A5KMZ4 Putative uncharacterized protein n=2 Tax=Clostridiales RepID=A5KMZ4_9FIRM Length = 269 Score = 248 bits (634), Expect = 1e-64, Method: Compositional matrix adjust. Identities = 129/268 (48%), Positives = 168/268 (62%), Gaps = 4/268 (1%) Query: 3 WLKEVIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNE 62 W +++ G EK +IA+ HL ALPGDP + M V + A DL+ALQ+GGVD ++F+NE Sbjct: 2 WTQDMFGVEKPIIALLHLDALPGDPGYCGD--MKTVTEHARKDLLALQDGGVDGILFANE 59 Query: 63 FSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDPVASFDLAMATGAKFIREIFT 122 FSLPY +AMA IIG+L +I +PFGVNV+ +P+A+ DL ATGAKF R F+ Sbjct: 60 FSLPYQPVADIAVVSAMAYIIGKLKDEISVPFGVNVVKNPIATIDLGAATGAKFGRSCFS 119 Query: 123 GAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNH 182 GAY ++GV+ +N GE IRH+ +G ++K LF + PEA YL RD+ +AKS +F + Sbjct: 120 GAYMGEYGVYVSNSGEAIRHRKALGIEDMKLLFKVNPEADAYLVQRDVQVVAKSIMFGDF 179 Query: 183 PDALCVSGLTAGTRTDSALLKRVKETV-PDTV-VLANTGVCLENVEEQLSIADGCVTATT 240 D LCVSG AG D +L RV E P V V NTG NV E+L DG T Sbjct: 180 ADGLCVSGAAAGAEPDDVILSRVHEVAKPRKVPVFCNTGCNHGNVREKLGNCDGVCMGTA 239 Query: 241 FKKDGVFANFVDQARVSQFMEKVHHIRR 268 FKKDGVF VD+ RV +FME V IR+ Sbjct: 240 FKKDGVFNGRVDKERVREFMEIVADIRK 267 >UniRef50_C5CIF3 Photosystem I assembly BtpA n=1 Tax=Kosmotoga olearia TBF 19.5.1 RepID=C5CIF3_KOSOT Length = 260 Score = 236 bits (603), Expect = 4e-61, Method: Compositional matrix adjust. Identities = 117/250 (46%), Positives = 157/250 (62%) Query: 1 MSWLKEVIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFS 60 M+ +KE+ G EK +I M H LPG P +D + G+ +++++ DL +LQNGG+DAVMF Sbjct: 1 MATVKEIFGKEKVIIGMVHFPPLPGSPLYDDKKGVEFIVERIKSDLKSLQNGGIDAVMFC 60 Query: 61 NEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDPVASFDLAMATGAKFIREI 120 NE PY KV T A M+R IG++M +IR+PFGV+VLWDP A+ +A A GAKFIREI Sbjct: 61 NENDRPYKLKVDSATVATMSRAIGEVMDEIRVPFGVDVLWDPFAAIAIAKAVGAKFIREI 120 Query: 121 FTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFN 180 TG Y SD G+W T VGE R++ + A ++ FNI E A L R + IAKS F+ Sbjct: 121 ITGTYVSDMGLWKTEVGEFYRYRKLLDANDIAVFFNISAEFAYNLDRRPLEEIAKSVAFS 180 Query: 181 NHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATT 240 + D + VSG G +K+VK+ V + V ANTGV ENV E L+IADG + T+ Sbjct: 181 SLADVILVSGPMTGESPSLDHIKKVKDKVGEKPVFANTGVTKENVREILNIADGAIIGTS 240 Query: 241 FKKDGVFANF 250 KKDG+ F Sbjct: 241 LKKDGITRRF 250 >UniRef50_A7B0Z5 Putative uncharacterized protein n=1 Tax=Ruminococcus gnavus ATCC 29149 RepID=A7B0Z5_RUMGN Length = 271 Score = 229 bits (583), Expect = 1e-58, Method: Compositional matrix adjust. Identities = 116/268 (43%), Positives = 160/268 (59%) Query: 1 MSWLKEVIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFS 60 M W +++ G +K +IAM HL LPGDP + + M+ +I+ A DL ALQ+GGV+ ++FS Sbjct: 1 MLWTEKLFGVKKPIIAMLHLDPLPGDPLYKKENDMDVIIEHARADLHALQDGGVNGIIFS 60 Query: 61 NEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDPVASFDLAMATGAKFIREI 120 NEFS PY + T AAMA +IG L S+I++P+GV+ + D A +LA A A F+R Sbjct: 61 NEFSFPYQRTMDMVTPAAMAYVIGNLRSEIKVPYGVDAISDGRACLELAAAVKANFVRGT 120 Query: 121 FTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFN 180 F G Y D G ++ + +R + + E+K L+ I PE+ L R + IAK+T+ Sbjct: 121 FCGVYVGDGGFYNNDFSALLRRKAALPLDELKMLYFINPESDQSLDTRPLADIAKTTIAK 180 Query: 181 NHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATT 240 PD LC+S AG D AL+ VKE PD VVL NTG + +E +L+ AD V TT Sbjct: 181 AAPDGLCISADAAGQDVDDALIASVKEANPDIVVLCNTGCRINTIERKLTTADAAVVGTT 240 Query: 241 FKKDGVFANFVDQARVSQFMEKVHHIRR 268 FKKDG F N VD RV +FM+ VH R Sbjct: 241 FKKDGKFENRVDVNRVKEFMQVVHEFRE 268 >UniRef50_C0D1F9 Putative uncharacterized protein n=1 Tax=Clostridium asparagiforme DSM 15981 RepID=C0D1F9_9CLOT Length = 276 Score = 228 bits (580), Expect = 2e-58, Method: Compositional matrix adjust. Identities = 120/273 (43%), Positives = 159/273 (58%), Gaps = 6/273 (2%) Query: 1 MSWLKEVIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFS 60 M W +++ G +K +I M HL LPGDP F M V++ A DL ALQ GGVD +MFS Sbjct: 1 MLWTEKMFGVKKPIITMLHLDPLPGDPRFHYGDTMERVVEHARADLHALQEGGVDGIMFS 60 Query: 61 NEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDPVASFDLAMATGAKFIREI 120 NEFSLPY + T AAMAR+IG+L S+IR+P+GV+ + D AS +LA A AKFIR Sbjct: 61 NEFSLPYERHMSFVTPAAMARVIGELKSEIRVPYGVDCISDGQASIELAAAVDAKFIRGT 120 Query: 121 FTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFN 180 F+G Y D G ++ + +R + + ++K L+ I PE+ + R + IAKST+F Sbjct: 121 FSGVYVGDGGFYNNDFSALLRRKAALHLDDLKMLYFINPESDRSMDTRPLVDIAKSTIFK 180 Query: 181 NHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATT 240 HPD LC+S AG D L+ VK PD VVL NTG + +E +L+ AD V T Sbjct: 181 AHPDGLCISANAAGQDVDDELIASVKSGAPDVVVLCNTGCRPDTIERKLTTADAAVVGTY 240 Query: 241 FKKDGVFAN------FVDQARVSQFMEKVHHIR 267 FK+ G N VD RV +FME VH R Sbjct: 241 FKEGGKLENDKLENVRVDVNRVKEFMEVVHRFR 273 >UniRef50_Q2RUX8 Photosystem I assembly BtpA n=1 Tax=Rhodospirillum rubrum ATCC 11170 RepID=Q2RUX8_RHORT Length = 267 Score = 223 bits (567), Expect = 8e-57, Method: Compositional matrix adjust. Identities = 124/254 (48%), Positives = 158/254 (62%), Gaps = 1/254 (0%) Query: 11 EKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEFSLPYLTK 70 +KAVIAM H+ ALPG P +DA GM +ID D+ LQ GGV A+MF NE PY + Sbjct: 10 KKAVIAMAHIGALPGTPLYDADGGMMKLIDDVVGDIEKLQKGGVHAIMFGNENDRPYQFE 69 Query: 71 VRPETTAAMARIIGQLMSDIRIPFGVNVLWDPVASFDLAMATGAKFIREIFTGAYASDFG 130 + AAM II + + +PFGVN LWDP AS +A+ATGA F REIFTG +ASD G Sbjct: 70 APIASVAAMTAIISAVRPMLSVPFGVNYLWDPAASVAIAVATGASFAREIFTGVFASDMG 129 Query: 131 VWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPDALCVSG 190 VW N E +R + + ++K LFNI E A L +R I A+S +F++ DA+ VSG Sbjct: 130 VWSPNAAEALRLRRNLHRPDLKLLFNINAEFASSLDSRSIGLRARSAIFSSLADAILVSG 189 Query: 191 LTAGTRTDSALLKRVKETVPDTVVL-ANTGVCLENVEEQLSIADGCVTATTFKKDGVFAN 249 G ++ L+ V+E + V L ANTGV LENV++ LSIADGCV T FK DG N Sbjct: 190 PLTGQPAQASDLREVREAIGTEVPLFANTGVRLENVDDVLSIADGCVIGTHFKVDGSTWN 249 Query: 250 FVDQARVSQFMEKV 263 VD RVS+FM+KV Sbjct: 250 RVDGGRVSRFMDKV 263 >UniRef50_Q28QI3 Photosystem I assembly BtpA n=5 Tax=Alphaproteobacteria RepID=Q28QI3_JANSC Length = 267 Score = 220 bits (561), Expect = 3e-56, Method: Compositional matrix adjust. Identities = 126/267 (47%), Positives = 161/267 (60%) Query: 1 MSWLKEVIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFS 60 M ++V GT K VIAM HL A+PG P DA G+ ++ A DL ALQ GVDAVMF Sbjct: 1 MQKFRDVFGTPKPVIAMVHLGAMPGTPLHDADAGLEGLVAAAAADLSALQAAGVDAVMFG 60 Query: 61 NEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDPVASFDLAMATGAKFIREI 120 NE PY V +TA MA +IGQL I +PFGVNVLWDP ++ LA ATGA+F REI Sbjct: 61 NENDRPYEFAVDTASTATMAYVIGQLRGQITVPFGVNVLWDPDSTIALAAATGAQFCREI 120 Query: 121 FTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFN 180 FTG YASD GVW + G +R++ R+G ++ L+N+ E A L R + A+S VF+ Sbjct: 121 FTGTYASDMGVWAPDAGRALRYRKRLGRDDLAMLYNVSAEFADSLDKRPLPDRARSAVFS 180 Query: 181 NHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATT 240 + PDA+ VSG G L+ VK +P+T VLANTGV + V E L IADGC+ ++ Sbjct: 181 SVPDAVLVSGQITGEAARMEDLEAVKAVLPETPVLANTGVKHDTVAEVLRIADGCIVGSS 240 Query: 241 FKKDGVFANFVDQARVSQFMEKVHHIR 267 K DG N VD R FM++ R Sbjct: 241 LKVDGHTWNAVDPDRAKDFMDRARASR 267 >UniRef50_A4E7P3 Putative uncharacterized protein n=1 Tax=Collinsella aerofaciens ATCC 25986 RepID=A4E7P3_9ACTN Length = 274 Score = 175 bits (444), Expect = 1e-42, Method: Compositional matrix adjust. Identities = 100/268 (37%), Positives = 142/268 (52%), Gaps = 2/268 (0%) Query: 1 MSWLKEVIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFS 60 MS+L + TEK VI M HLR LPGDP + ++ V++ A DL ALQ GGVD ++ + Sbjct: 6 MSFLTSMFKTEKPVIGMLHLRPLPGDPLYYPGGSVSQVVEAAKRDLEALQQGGVDGILIT 65 Query: 61 NEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDPVASFDLAMATGAKFIREI 120 NE S+PY V P T A++ +IG L D+ P+G ++D A+ +L A A+F R Sbjct: 66 NELSMPYEQHVSPSTLASVGYVIGTLSHDLSTPWGAEAIYDGDATIELCAAVDAQFTRCN 125 Query: 121 FTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFN 180 F GA+A D G+ + + T+R + + ++K I E VYL +R IA S +FN Sbjct: 126 FCGAWAGDLGLINRDFAHTMRRKAALRLDDLKLFHFITSEGEVYLNDRTTADIADSLLFN 185 Query: 181 NHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLEN-VEEQLSIADGCVTAT 239 PDA+ + G AG L V+E V + V+ TG C EN V + + DG T Sbjct: 186 CLPDAMVIGGSAAGRGASGELADEVRERVGEVPVVCGTG-CRENTVADVFAHYDGAFVGT 244 Query: 240 TFKKDGVFANFVDQARVSQFMEKVHHIR 267 K+DG VD RV++FM R Sbjct: 245 CLKRDGRLDAPVDVERVARFMAAARTAR 272 >UniRef50_B9CKX7 Putative uncharacterized protein n=1 Tax=Atopobium rimae ATCC 49626 RepID=B9CKX7_9ACTN Length = 270 Score = 137 bits (346), Expect = 3e-31, Method: Compositional matrix adjust. Identities = 87/270 (32%), Positives = 138/270 (51%), Gaps = 4/270 (1%) Query: 1 MSWLKE----VIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDA 56 MS L E G+ +I H++ALPG P D+++ + I++ D LQ+ G DA Sbjct: 1 MSTLLEKHYATFGSSCPIIGCLHMQALPGTPFSDSKITLKNQIERLKRDAYTLQDAGFDA 60 Query: 57 VMFSNEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDPVASFDLAMATGAKF 116 V+F+NE PY+T V +T A RI +++ ++ IP+G VL DP A+ A A AKF Sbjct: 61 VVFANEGDRPYITPVGFDTVANYVRIATEVIEELSIPYGCGVLIDPFATLAAAKALEAKF 120 Query: 117 IREIFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKS 176 +R TG+Y FG N GE R+Q +I A +V+ P A L R + ++ Sbjct: 121 VRTYVTGSYEGLFGSQKFNPGEIFRYQKQIEATDVRVYTYFEPHAGTCLDVRSSEEMLEA 180 Query: 177 TVFNNHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCV 236 + N G AG +++ + R+KE + ++ +G EN+ + L ADG + Sbjct: 181 GIANLPIAGALFGGAHAGLPPEASHIVRLKEEFTEVPLIIGSGGTAENISKLLPHADGVI 240 Query: 237 TATTFKKDGVFANFVDQARVSQFMEKVHHI 266 T+ KKDG+ N VD R +F++ ++ Sbjct: 241 VGTSIKKDGILWNNVDPVRAKRFVKAAKNL 270 >UniRef50_B9XI59 Photosystem I assembly BtpA n=1 Tax=bacterium Ellin514 RepID=B9XI59_9BACT Length = 262 Score = 129 bits (324), Expect = 1e-28, Method: Compositional matrix adjust. Identities = 87/258 (33%), Positives = 132/258 (51%), Gaps = 6/258 (2%) Query: 11 EKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEFSLPYL-T 69 K +I + HL LPG P + +G V KA D + + GG DAV N +P+ + Sbjct: 7 RKVLIGVVHLGPLPGAPRWQGDIGA--VARKAVADARSYEQGGADAVFIENFGDVPFTKS 64 Query: 70 KVRPETTAAMARIIGQLMSDIRIPFGVNVLW-DPVASFDLAMATGAKFIR-EIFTGAYAS 127 V PET AAMA + + + +++P G NVL D A+ L A G F+R + TGA + Sbjct: 65 AVGPETVAAMAALGCAVRAAVKLPIGFNVLRNDARAALGLCAACGGSFVRVNVHTGAMLT 124 Query: 128 DFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPDALC 187 D G+ + N +T+R++ I G + ++ + AV LG+ I AK T+ DAL Sbjct: 125 DQGLIEGNAYDTMRYREAISPG-TQVFADVHVKHAVPLGSWTIEDSAKDTIERGLADALI 183 Query: 188 VSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATTFKKDGVF 247 VSG G + L+RV+ P+ +L +GV LEN + L +ADG + ++ K+ G Sbjct: 184 VSGTGTGVAVNLDDLRRVRAACPEAKILLGSGVTLENAGDFLQLADGFIVGSSLKRGGKL 243 Query: 248 ANFVDQARVSQFMEKVHH 265 AN VD RV+ + Sbjct: 244 ANPVDAKRVAALARAMRR 261 >UniRef50_A9W9X3 Photosystem I assembly BtpA n=4 Tax=Chloroflexaceae RepID=A9W9X3_CHLAA Length = 284 Score = 112 bits (280), Expect = 1e-23, Method: Compositional matrix adjust. Identities = 82/261 (31%), Positives = 133/261 (50%), Gaps = 8/261 (3%) Query: 6 EVIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEFSL 65 E+ T K +I M H LPG P + GM +I+ A D AL GG D ++ N + + Sbjct: 19 EMFRTAKPIIGMVHCWPLPGAPGYTG-YGMQTIIEHAIRDAEALAEGGCDGLIVENMWDI 77 Query: 66 PYLT--KVRPETTAAMARIIGQLMSDI-RIPFGVNVLWDP-VASFDLAMATGAKFIRE-I 120 P+ V PE+ AA A + + + +P G+N++ + VA +A+A GA FIR + Sbjct: 78 PFRAGPHVPPESIAAQAVVAHAVRQAVPELPLGINLVHNGGVALLGIAIAAGASFIRVCM 137 Query: 121 FTGAYASDFGVWDTN-VGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVF 179 FTGA D G WD + +R + + A +K ++ + +V D+ + + T F Sbjct: 138 FTGAGVWDAGSWDEGCAADLMRRRKELHAESIKIFADVDKKHSVRFPGIDLVTHIEWTRF 197 Query: 180 NNHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTAT 239 DA+ VSG G D A +++ +E DT +L +G +N+ + +ADG + + Sbjct: 198 FGA-DAIIVSGRMTGDAPDIAKVRQARELAGDTPILLGSGTTEQNIAAFMEVADGVIVGS 256 Query: 240 TFKKDGVFANFVDQARVSQFM 260 + K+DG AN VD RV +F+ Sbjct: 257 SIKQDGEIANPVDVNRVRRFV 277 >UniRef50_Q8TVC9 Predicted TIM-barrel enzyme n=1 Tax=Methanopyrus kandleri RepID=Q8TVC9_METKA Length = 271 Score = 106 bits (265), Expect = 6e-22, Method: Compositional matrix adjust. Identities = 86/259 (33%), Positives = 127/259 (49%), Gaps = 13/259 (5%) Query: 14 VIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEFSLPYLTKVRP 73 V+ + HL LPG P + + V+++A D L++GGVDAV+ N PY P Sbjct: 15 VVGVVHLPPLPGSPRAKS---IEEVVERARRDAARLEDGGVDAVLVENFGDTPYYPDDVP 71 Query: 74 E-TTAAMARIIGQLMSDIRIPFGVNVLW-DPVASFDLAMATGAKFIR-EIFTGAYASDFG 130 + T A M R + +++ + +P GVNVL D VA+ D+ ATGA FIR + A A+D G Sbjct: 72 KITVACMTRAVAEVVDTVSVPVGVNVLRNDGVAAVDVCAATGASFIRVNAYVEAVATDQG 131 Query: 131 VWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPDALCVSG 190 V R R+G +V+ +I + L +R + +A+ V DA+ V+G Sbjct: 132 VLQPVAHMVWREIDRLGV-DVEVYADIRVKHGRPLDDRPVEEVARDAVERGLADAVIVTG 190 Query: 191 LTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSI-ADGCVTATTFKKDGVFAN 249 G+ +++V V VL +GV EN L A G + T FKK+G+ N Sbjct: 191 SATGSPPRPEEVRKVARVV--DRVLVGSGVTPENAHVFLRAGAAGFIVGTYFKKNGITEN 248 Query: 250 FVDQARVSQFMEKVHHIRR 268 VD RV + V IRR Sbjct: 249 PVDVDRVREL---VRFIRR 264 >UniRef50_P72966 Photosystem I biogenesis protein btpA n=34 Tax=Cyanobacteria RepID=BTPA_SYNY3 Length = 287 Score = 105 bits (261), Expect = 2e-21, Method: Compositional matrix adjust. Identities = 73/255 (28%), Positives = 126/255 (49%), Gaps = 6/255 (2%) Query: 10 TEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEFSLPY-L 68 T VI + HL LP + L VI++A + AL GGVD ++ N F P+ Sbjct: 9 THNPVIGVVHLLPLPTSARWGGNL--TAVIERAEQEATALAAGGVDGIIVENFFDAPFPK 66 Query: 69 TKVRPETTAAMARIIGQLMSDIRIPFGVNVLW-DPVASFDLAMATGAKFIR-EIFTGAYA 126 +V P +AM I+ +L + + P G+NVL D ++ +A GAKFIR + TG A Sbjct: 67 QRVDPAVVSAMTLIVDRLQNLVVAPVGINVLRNDAHSALAIASCVGAKFIRVNVLTGVMA 126 Query: 127 SDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPDAL 186 +D G+ + N E +R++ + + +V L +++ + A LG ++ + T+ D + Sbjct: 127 TDQGLIEGNAHELLRYRRELSS-DVAILADVLVKHARPLGTPNLTTAVTDTIERGLADGI 185 Query: 187 CVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATTFKKDGV 246 +SG G+ + L+ T V +G +N+ + + A+G + A++ K+ G Sbjct: 186 ILSGWATGSPPNLEDLELATNAAKGTPVFIGSGADEDNIGQLIQAANGVIVASSLKRHGN 245 Query: 247 FANFVDQARVSQFME 261 +D RVS F+E Sbjct: 246 INEAIDPIRVSAFIE 260 >UniRef50_A3KNP0 Zgc:162297 protein n=7 Tax=Coelomata RepID=A3KNP0_DANRE Length = 268 Score = 103 bits (258), Expect = 5e-21, Method: Compositional matrix adjust. Identities = 82/273 (30%), Positives = 133/273 (48%), Gaps = 10/273 (3%) Query: 1 MSWLKEVIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFS 60 M +L + +I M H+RALPG P + ++ + ++A + N G+D ++ Sbjct: 1 MKFLNLFGRLQSNIIGMIHVRALPGTPL--NRFTISDIKEEACREAEIYYNAGLDGLIIE 58 Query: 61 NEFSLPYLTKVRPETTAAMARIIGQLMSDI-RIPFGVNVLWDPVAS-FDLAMATGAKFIR 118 N +PY V PE A M + + P GV +L S +A+A+G FIR Sbjct: 59 NMHDIPYTLDVGPEVCACMTAVCTAVRGLYPSWPLGVQILSAANHSALAVALASGLDFIR 118 Query: 119 -EIFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKST 177 E F ++ +D G+ + GE +R++ IGA V+ +I + + + D+ SIA++ Sbjct: 119 AEGFVFSHVADEGLLNACAGELLRYRKCIGAEHVQIFTDIKKKHSAHALTADV-SIAETA 177 Query: 178 VFNNH--PDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGC 235 D + V+G G + D L+ V ++V VL +GV +NVE L A Sbjct: 178 QAAEFFLSDGVVVTGSATGAKADPQELREVSQSV-RIPVLIGSGVTDDNVEHYLQ-ASAM 235 Query: 236 VTATTFKKDGVFANFVDQARVSQFMEKVHHIRR 268 + + FKK G +AN VD RV +FM K+H +R Sbjct: 236 IIGSHFKKGGYWANGVDAERVKRFMGKMHKLRE 268 >UniRef50_A8AAQ2 Photosystem I assembly BtpA n=1 Tax=Ignicoccus hospitalis KIN4/I RepID=A8AAQ2_IGNH4 Length = 268 Score = 103 bits (257), Expect = 6e-21, Method: Compositional matrix adjust. Identities = 80/256 (31%), Positives = 130/256 (50%), Gaps = 9/256 (3%) Query: 14 VIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEFSLPYLTKVRP 73 VI + HL LPG S+ + VI++A D AL+ GGVDA++ N P+ +V Sbjct: 3 VIGVVHLLPLPG--SYGWGGDFDAVIERAVKDAKALEKGGVDAIIIENFMDYPFPIRVDY 60 Query: 74 ETTAAMARIIGQLMSDIRIPFGVNVLWDPVA-SFDLAMATGAKFIREIFTGAYASDF--G 130 T AA R++ +++ + + GV++L + + +A+A+GAKF+R + SD G Sbjct: 61 VTVAAATRVVTEVVRSLELSAGVSLLRNSAPEAIAVALASGAKFVRS-NQWCWTSDAPEG 119 Query: 131 VWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPDALCVSG 190 + E + R GA +V + ++ + A + RD+C A+ DAL VSG Sbjct: 120 LLTPVAREGLEVMRRWGA-KVGVVADVRVKHAAPISGRDLCDEARDLGGRCRADALAVSG 178 Query: 191 LTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATTFKKDGVFANF 250 G+ D L+ VK P V++A +G+ ENV + + ADG + T FK+ GV N Sbjct: 179 AATGSEADPRQLEVVKTCTPKPVLVA-SGITPENV-VRFASADGVIVGTYFKEGGVTENP 236 Query: 251 VDQARVSQFMEKVHHI 266 VD RV + ++ + Sbjct: 237 VDVHRVRKLVDAAKRL 252 >UniRef50_Q5JHL2 Uncharacterized protein TK2179 n=5 Tax=Euryarchaeota RepID=Y2179_PYRKO Length = 261 Score = 102 bits (255), Expect = 1e-20, Method: Compositional matrix adjust. Identities = 76/259 (29%), Positives = 124/259 (47%), Gaps = 8/259 (3%) Query: 12 KAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEFSLPYLTKV 71 K +I M HL+ LPG ++ + VI+ A D + L+ G DAVM N +P+ Sbjct: 6 KPLIGMVHLKPLPGSYLYNGDF--DSVIEAALRDAVTLEEAGFDAVMVENFGDVPFPKYA 63 Query: 72 RPETTAAMARIIGQLMSDIRIPFGVNVLW-DPVASFDLAMATGAKFIR-EIFTGAYASDF 129 T A++A + + ++ +P GVNVL D +A++ +A A A FIR + +G +D Sbjct: 64 DKTTVASLAVVAKAIRDEVSLPLGVNVLRNDGIAAYSIAYAVKADFIRVNVLSGVAYTDQ 123 Query: 130 GVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPDALCVS 189 G+ + E + R+ + E+K ++ + AV+ G+ + + TV DA+ VS Sbjct: 124 GIIEGIAHELAMLRKRLPS-EIKVFADVHVKHAVHFGDFEDAFL--DTVERGLADAVVVS 180 Query: 190 GLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATTFKKDGVFAN 249 G G D L KE P V+ +G +N+ E ADG + T K+DG N Sbjct: 181 GKATGRPVDVDKLALAKEISP-VPVIVGSGTSYDNLPELWKYADGFIVGTWIKRDGRVEN 239 Query: 250 FVDQARVSQFMEKVHHIRR 268 V R + +E +R+ Sbjct: 240 EVSLERARKLVELAKELRQ 258 >UniRef50_C5ELG9 Putative uncharacterized protein n=1 Tax=Clostridiales bacterium 1_7_47FAA RepID=C5ELG9_9FIRM Length = 275 Score = 99.0 bits (245), Expect = 1e-19, Method: Compositional matrix adjust. Identities = 85/278 (30%), Positives = 131/278 (47%), Gaps = 17/278 (6%) Query: 1 MSWLKEVIGTEKAVIAMCHLRALPGDPSFD-AQLGMNWVIDKAWDDLMALQNGGVDAVMF 59 M L+ + +K +I M HLR LPG P +D A + M +++ A D+ LQ+ GVD V Sbjct: 1 MRQLQSIFREKKPIIGMVHLRPLPGSPMYDPASMDMTKILEIAVDEAKKLQDAGVDGVQV 60 Query: 60 SNEFSLPYLTKVRPE-----TTAAMARIIGQLMSDIRIPFGVNVLWD-PVASFDLAMATG 113 N + +PY RPE T AA+A I ++ + IP G + + A A G Sbjct: 61 ENMWDIPY---NRPEDIGYETAAALAVGIYEVGKHVSIPVGAECHMNGAECAMASAAAAG 117 Query: 114 AKFIREI-FTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNI-VPEAAVYLGNRDIC 171 A++IR + A+ S G + G R + R+ AG + L ++ V + Y+ + Sbjct: 118 ARWIRVFEWCNAFISQSGFVNGAGGRVSRMRDRLKAGHILALCDVNVKHGSHYIIHDRSV 177 Query: 172 SIAKSTVFNNHPDALCVSGLTAG--TRTDSALLKRVKETVPDTVVLANTGVCLENVEEQL 229 + DA+ V+G G D L + +P VL +G+ EN+ E L Sbjct: 178 KEQAMDIEAQGGDAVIVTGFDTGMPPTVDKVLECKAAIGIP---VLLGSGLAEENITELL 234 Query: 230 SIADGCVTATTFKKDGVFANFVDQARVSQFMEKVHHIR 267 S ADG + +TFK G + N VD R FM++V +R Sbjct: 235 SAADGAIVGSTFKAQGKWQNPVDYYRTKAFMDRVVKLR 272 >UniRef50_C3ZBU0 Putative uncharacterized protein n=2 Tax=Metazoa RepID=C3ZBU0_BRAFL Length = 279 Score = 97.4 bits (241), Expect = 4e-19, Method: Compositional matrix adjust. Identities = 86/281 (30%), Positives = 135/281 (48%), Gaps = 18/281 (6%) Query: 1 MSWLKEVIGT-EKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMF 59 M ++V G + A + M H+ ALPG P +G +IDKA + + G+DAVM Sbjct: 1 MQRFQKVFGRLQAAAVGMVHVGALPGTPRSSETVG--QLIDKACKEAEIYKRAGLDAVMV 58 Query: 60 SNEFSLPYLT--KVRPETTAAMARIIGQLMSDI-RIPFGVNVLWDP-VASFDLAMATGAK 115 N +PYL V E TAAM + ++ R+P GV VL + +A+ATG Sbjct: 59 ENMHDVPYLLGGDVGHEVTAAMTAVCREVRRVCPRLPCGVQVLSAANKQALAVALATGYV 118 Query: 116 FIREIFTGA------YASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRD 169 R S G ++ G+ +R++ +IGA + +I + + + D Sbjct: 119 PCRSGLRACGRVCVLPCSRRGAVNSCAGDLLRYRTQIGADSIMVFTDIKKKHSSHAITAD 178 Query: 170 ICSIAKSTVFNNH--PDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEE 227 + SIA + D + V+G G DS LK V++ V D VL +GV EN+ Sbjct: 179 V-SIADTARAAEFFLSDGVIVTGTETGRPVDSKELKEVRQAV-DIPVLVGSGVSTENLPT 236 Query: 228 QLSIADGCVTATTFKKDGVFANFVDQARVSQFMEKVHHIRR 268 L A+G + + FKK G++ N VD RV+ FM+++ +R+ Sbjct: 237 YLR-ANGLIVGSYFKKHGLWQNEVDLDRVNMFMDRLSTLRQ 276 >UniRef50_A6NZB9 Putative uncharacterized protein n=1 Tax=Bacteroides capillosus ATCC 29799 RepID=A6NZB9_9BACE Length = 266 Score = 97.4 bits (241), Expect = 5e-19, Method: Compositional matrix adjust. Identities = 81/256 (31%), Positives = 120/256 (46%), Gaps = 8/256 (3%) Query: 11 EKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEFSLPYLTK 70 +K VI M HL+ALPG P + M+ + A +DL AL+ GGVDA + N PY Sbjct: 14 QKPVIGMVHLQALPGAPGYGGS--MDEIYRAAVEDLHALEQGGVDAAIVENFGDTPYALN 71 Query: 71 VRPETTAAMARIIGQLMSDIRIPFGVNVLWDPV-ASFDLAMATGAKFIR-EIFTGAYASD 128 T AAM + QL ++ + G+NV ++ A + +A A G FIR E Sbjct: 72 HELITLAAMTALAVQLRAESSLRLGLNVQFNCTEAEWGIAYAAGYDFIRVEALVENRVGV 131 Query: 129 FGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDI-CSIAKSTVFNNHPDALC 187 GV +R + R A E L +I + + + + SI ++ AL Sbjct: 132 HGVAFAAAPSLLRLKSRYPA-ETMLLADINVKHTYPMVEQPLDASIHEAK--EAGAGALI 188 Query: 188 VSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATTFKKDGVF 247 V+G+ G + R KE +T VL +G+ EN IADG + ++FK++G Sbjct: 189 VTGVVTGQNPSLEDVCRCKELAGETPVLLGSGIHQENAAAFFQIADGAIVGSSFKENGDV 248 Query: 248 ANFVDQARVSQFMEKV 263 N VD RV +FME + Sbjct: 249 RNKVDTGRVRRFMEAL 264 >UniRef50_O29828 Uncharacterized protein AF_0419 n=1 Tax=Archaeoglobus fulgidus RepID=Y419_ARCFU Length = 246 Score = 95.9 bits (237), Expect = 1e-18, Method: Compositional matrix adjust. Identities = 82/254 (32%), Positives = 126/254 (49%), Gaps = 18/254 (7%) Query: 11 EKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEFSLPYLTK 70 EK VI + HL LPG P ++ VIDKA D A++ GG DA++ N P+L + Sbjct: 2 EKTVIGVVHLLPLPGSPE---HTDLSAVIDKAVKDARAIEEGGADALILENYGDKPFLKE 58 Query: 71 VRPETTAAMARIIGQLMSDIRIPFGVNVLW-DPVASFDLAMATGAKFIR--EIFTGAYAS 127 V ET AAM I ++ D+ I G+NVL D VA+ +A A A F+R ++F + + Sbjct: 59 VGKETVAAMTVIACEVKRDVSIGLGINVLRNDAVAALAIAKAVNADFVRVNQLFFTSVSP 118 Query: 128 DFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGN-RDICSIAKSTVFNNHPDAL 186 + G+ + GE +R++ + +I + AV+ + D C A+ ++ DA+ Sbjct: 119 E-GILEGKAGEVMRYKKLVDC-RAMIFADIAVKHAVHFASLEDYCLNAERSL----ADAV 172 Query: 187 CVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATTFKKDGV 246 ++G T G LK K+T+ VLA +GV EN L DG + T K+ G Sbjct: 173 ILTGKTTGGEVSLEELKYAKKTLK-MPVLAGSGVNAENAARILKWCDGVIVGTYIKRGG- 230 Query: 247 FANFVDQARVSQFM 260 VD RV + + Sbjct: 231 ---LVDAERVRRIV 241 >UniRef50_A8S303 Putative uncharacterized protein n=1 Tax=Clostridium bolteae ATCC BAA-613 RepID=A8S303_9CLOT Length = 274 Score = 95.5 bits (236), Expect = 2e-18, Method: Compositional matrix adjust. Identities = 73/274 (26%), Positives = 124/274 (45%), Gaps = 7/274 (2%) Query: 1 MSWLKEVIGTEKAVIAMCHLRALPGDPSFD-AQLGMNWVIDKAWDDLMALQNGGVDAVMF 59 M LK+V +K +I M HLR LPG P +D +GM+ +I A ++ L+ GVD V Sbjct: 1 MGKLKDVFKVDKPIIGMVHLRPLPGSPKYDPVNMGMDKIISIALEEAAMLEQAGVDGVQV 60 Query: 60 SNEFSLPYLTK--VRPETTAAMARIIGQLMSDIRIPFGVNVLWDPVASFDLAMATGAKFI 117 N + +PYL + ET AA+A I + + + IP G + Sbjct: 61 ENMWDIPYLRSEDIGYETAAALAVGIHAVRNKVSIPVGAECHMNGADCAMACAVAAGASW 120 Query: 118 REIFT--GAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNI-VPEAAVYLGNRDICSIA 174 +F A+ S G + R + R+ A ++ L ++ V + Y+ + + Sbjct: 121 IRVFEWCNAFVSQSGFINAMGANVSRMRSRLKADQILALCDVNVKHGSHYIIHDRSVAEQ 180 Query: 175 KSTVFNNHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADG 234 + + DA+ V+G GT + + K++ +L +G+ NV E L+ ADG Sbjct: 181 AMDIESQDGDAVIVTGFDTGTPPSVENISKCKKST-SLPILIGSGLNSSNVNELLTAADG 239 Query: 235 CVTATTFKKDGVFANFVDQARVSQFMEKVHHIRR 268 + + FK+ + N V R +FM+KV +R+ Sbjct: 240 AIIGSWFKEGNNWKNPVSYDRTKEFMDKVIALRQ 273 >UniRef50_UPI0000D55C2D PREDICTED: similar to conserved hypothetical protein n=1 Tax=Tribolium castaneum RepID=UPI0000D55C2D Length = 270 Score = 94.4 bits (233), Expect = 4e-18, Method: Compositional matrix adjust. Identities = 78/263 (29%), Positives = 135/263 (51%), Gaps = 17/263 (6%) Query: 10 TEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEFSLPYLT 69 T+ AV+ M H+ ALPG P + + + ++ KA + G+D+++ N +PY+ Sbjct: 11 TKCAVVGMVHVGALPGTPLCNKSV--DSLVFKACKEAEMYLKYGLDSILVENMHDVPYIQ 68 Query: 70 K--VRPETTAAMARIIGQL--MSDIRIPFGVNVL-WDPVASFDLAMATGAKFIR-EIFTG 123 PET A M R+ ++ ++ +P GV VL + + +A A FIR E F Sbjct: 69 SKYFTPETVATMTRVCTEIRKIAPGTVPCGVQVLACGNLEALAVAKACNFDFIRAEGFVF 128 Query: 124 AYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSI--AKSTVFNN 181 + +D G D N G +R++ +I A V L +I + + + D+ + A++ F Sbjct: 129 GHVADEGYTDANAGLILRYRRQIQAENVLILADIKKKHSSHAITSDVSLVETAQAAQFF- 187 Query: 182 HPDALCVSGLTAGTRTDSALLKRVKE--TVPDTVVLANTGVCLENVEEQLSIADGCVTAT 239 D L ++G+ G+ + + L +VK+ ++P VL +GV +N+ + + ADG + + Sbjct: 188 QADGLILTGVATGSPANVSELSQVKKFCSLP---VLVGSGVTGDNLGDYMG-ADGVIVGS 243 Query: 240 TFKKDGVFANFVDQARVSQFMEK 262 FKK GV+ VD+ RV FMEK Sbjct: 244 YFKKGGVWYEDVDEERVRNFMEK 266 >UniRef50_D2RQS6 Photosystem I assembly BtpA n=4 Tax=Halobacteriaceae RepID=D2RQS6_9EURY Length = 278 Score = 93.2 bits (230), Expect = 9e-18, Method: Compositional matrix adjust. Identities = 79/270 (29%), Positives = 127/270 (47%), Gaps = 13/270 (4%) Query: 1 MSWLKEVIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFS 60 ++ L+ ++ V+ M HL +PG P ++ + V D+A +D L+ GGVD ++ Sbjct: 4 ITPLRTRFDADRPVVGMVHLPPVPGAPGYEGD--RDAVRDRALEDARRLEAGGVDGIVLE 61 Query: 61 NEFSLPYLTKVRPETTAAMARIIGQLMSD-IRIPFGVNVLW-DPVASFDLAMATGAKFIR 118 N P+ P+ A + ++D + +P G+NVL D A+ +A A A+F+R Sbjct: 62 NFGDAPFYPDDVPKHVVAEMTAVATAVTDAVDVPLGINVLRNDADAALSIAAAVDAEFVR 121 Query: 119 -EIFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKST 177 + G A+D GV + ET+R + RI A +V L ++ + A +G+R I A Sbjct: 122 VNVHVGTAATDQGVLEGRAHETLRLRDRIDA-DVAILADVHVKHATPIGDRSIDRAALEA 180 Query: 178 VFNNHPDALCVSGLTAGTRTDSALLKRVKE------TVPDTVVLANTGVCLENVEEQLSI 231 V D + VSG G T ++RV T T V +GV E V + L+ Sbjct: 181 VERGRADGVIVSGPGTGDETALEDVERVAAALDGAGTAGRTSVFVGSGVTSETVGDCLAA 240 Query: 232 -ADGCVTATTFKKDGVFANFVDQARVSQFM 260 ADG + T K+ G N V + RV + Sbjct: 241 GADGVIVGTALKEGGETTNPVSRERVKALV 270 >UniRef50_A4YFU5 Photosystem I assembly BtpA n=1 Tax=Metallosphaera sedula DSM 5348 RepID=A4YFU5_METS5 Length = 258 Score = 92.8 bits (229), Expect = 1e-17, Method: Compositional matrix adjust. Identities = 73/240 (30%), Positives = 117/240 (48%), Gaps = 8/240 (3%) Query: 14 VIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEFSLPYLT-KVR 72 + M HL LPG P L ++ A + LQ+ GVDAV+ N P+ + Sbjct: 6 IAGMIHLPPLPGSPRGGQPL--EEIVKYAVTEADKLQSAGVDAVIVENLGDYPFFKDNMP 63 Query: 73 PETTAAMARIIGQLMSDIRIPFGVNVLWDP-VASFDLAMATGAKFIR-EIFTGAYASDFG 130 P T A+M+ I+ ++ + + GVNVL + + +F LA GA FIR I GAYA+D G Sbjct: 64 PITVASMSVIVREVRRKLGLQVGVNVLRNGCIDAFSLAHVNGADFIRCNILIGAYATDQG 123 Query: 131 VWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPDALCVSG 190 V + E +R + + + V+ L ++ + A L N +A+ DA+ VSG Sbjct: 124 VIEGRAAELLRLKRSLNS-RVRILADVHVKHAYPLYNLPTELVAQDLAERGGADAVIVSG 182 Query: 191 LTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATT-FKKDGVFAN 249 + +K+VKE+V V+ +G+ L N +E +ADG + FK++G+ Sbjct: 183 PRSSLPPSIETVKKVKESV-QVPVIVGSGISLGNFKEFCGVADGLIVGEVDFKENGMIGG 241 >UniRef50_UPI000155D1C1 PREDICTED: hypothetical protein n=1 Tax=Ornithorhynchus anatinus RepID=UPI000155D1C1 Length = 261 Score = 91.7 bits (226), Expect = 3e-17, Method: Compositional matrix adjust. Identities = 69/230 (30%), Positives = 113/230 (49%), Gaps = 8/230 (3%) Query: 43 WDDLMALQNGGVDAVMFSNEFSLPYLTKVRPETTAAMARIIGQL-MSDIRIPFGVNVLWD 101 W D ++ D ++ N LPY PE TA M + + M+ R+P GV VL Sbjct: 34 WRDGDGVRFPPQDGLIVENMHDLPYTASAGPEVTATMTAVCAAVRMTCPRLPLGVQVLCS 93 Query: 102 P-VASFDLAMATGAKFIR-EIFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVP 159 + +A+A G FIR E F ++ +D G + G+ +R++ RIGA V+ +I Sbjct: 94 ANQEAVAVALAAGCDFIRAEGFVFSHVADEGFVNACAGDLLRYRRRIGAEHVQIFADIKK 153 Query: 160 EAAVYLGNRDIC--SIAKSTVFNNHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLAN 217 + + + D+ AK+ F D + ++G G D L V++ V + +L Sbjct: 154 KHSAHALTADVSVSETAKAAEFF-LADGVILTGPATGVEADPGELHEVEQAV-NIPLLIG 211 Query: 218 TGVCLENVEEQLSIADGCVTATTFKKDGVFANFVDQARVSQFMEKVHHIR 267 +GV LENV+ L+ A+ + + FK+ G +AN +D RV FM+ V +R Sbjct: 212 SGVTLENVKSYLN-ANALIIGSYFKEGGYWANQIDPTRVKTFMDHVRKLR 260 >UniRef50_D2QXT9 Photosystem I assembly BtpA n=2 Tax=Bacteria RepID=D2QXT9_9PLAN Length = 266 Score = 90.1 bits (222), Expect = 8e-17, Method: Compositional matrix adjust. Identities = 73/251 (29%), Positives = 116/251 (46%), Gaps = 6/251 (2%) Query: 14 VIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEFSLPY-LTKVR 72 VIAM HL LPG P + L ++ + + + L G +M N +P T+V Sbjct: 13 VIAMLHLPPLPGSPR--SALSISAITEHVCREAEMLTALGAAGLMLENFGDMPLPATQVS 70 Query: 73 PETTAAMARIIGQLMSDIRIPFGVNVLW-DPVASFDLAMATGAKFIR-EIFTGAYASDFG 130 P T A M+RI + +P G+NVL D +A+ +A A GA FIR I GA +D G Sbjct: 71 PATVAQMSRIAAAVRMASSLPLGINVLRNDSLAAMAIASAVGASFIRVNILVGARLTDQG 130 Query: 131 VWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPDALCVSG 190 + E +R + +GA E++ ++ + + L + ++T+ DAL V+G Sbjct: 131 IIAGRADELLRLRKSLGAEEIQIWADVNVKHSWPLAPVSLEEETENTIRRGLADALIVTG 190 Query: 191 LTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATTFKKDGVFANF 250 G TD L+ V T VL +GV +++ A G + + K G + Sbjct: 191 RGTGYETDPHELQAVISAAAGTPVLVGSGVTADSL-ANFQGASGAIVGSWIKHQGDARSP 249 Query: 251 VDQARVSQFME 261 +D RV + M+ Sbjct: 250 IDPERVRRLMQ 260 >UniRef50_UPI000186CA08 conserved hypothetical protein n=1 Tax=Pediculus humanus corporis RepID=UPI000186CA08 Length = 278 Score = 88.6 bits (218), Expect = 2e-16, Method: Compositional matrix adjust. Identities = 68/268 (25%), Positives = 140/268 (52%), Gaps = 12/268 (4%) Query: 1 MSWLKEVIG-TEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMF 59 MS L +++ T +I M H++ALPG P + +L +N +I++A +D+ ++ V++++ Sbjct: 1 MSKLPDLLKMTRPYIIGMVHVKALPGTP--NNKLNINSLIEEACNDVEIYKSCNVNSILV 58 Query: 60 SNEFSLPYL--TKVRPETTAAMARIIGQLMSDI--RIPFGVNVLWDP-VASFDLAMATGA 114 N +PY+ V PE A+M +I ++ + + + GV +L + +A A Sbjct: 59 ENMHDVPYVQSKSVGPEIIASMTKICSEIKNILPRHMTCGVQILAGANKEALAVAQAAEL 118 Query: 115 KFIR-EIFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSI 173 ++IR E + ++ +D G+ ++ GE +R++ IGA + +I + + D+ + Sbjct: 119 QYIRAEGYVFSHIADEGLMNSCAGELLRYRKYIGAENISIWTDIKKKHCSHSITSDLTLV 178 Query: 174 AKSTVFNNH-PDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIA 232 + D + ++G T G + +++ + +V+ +GV ENV + L+ A Sbjct: 179 ETALAAEFFLSDGIVLTGKTTGNAIRKSDFIKIQNSCSLPIVIG-SGVTAENVADFLN-A 236 Query: 233 DGCVTATTFKKDGVFANFVDQARVSQFM 260 + + + FKK+G+++N VD+ RV FM Sbjct: 237 NAIIVGSYFKKEGLWSNEVDKNRVENFM 264 >UniRef50_B5XQK9 BtpA family protein n=18 Tax=Proteobacteria RepID=B5XQK9_KLEP3 Length = 281 Score = 88.2 bits (217), Expect = 3e-16, Method: Compositional matrix adjust. Identities = 68/265 (25%), Positives = 128/265 (48%), Gaps = 13/265 (4%) Query: 4 LKEVIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEF 63 ++ + KAVI + H PG P + + ++ ++++A D +GGV ++ N Sbjct: 12 IQAIFSRSKAVIGVIHCDPFPGSPKYRGK-SVSDIVERALRDAENYISGGVHGLIIENHG 70 Query: 64 SLPYLTK--VRPETTAAMARIIGQLMSDIRIPFGVNVLWDP-VASFDLAMATGAKFIR-E 119 +P+ + ET+A MA I ++ +P G+NVL + + + +A+A GA F+R Sbjct: 71 DIPFSKPEDIGHETSALMAVITEKVRERFAVPLGINVLANAAIPAMAIALAGGADFVRVN 130 Query: 120 IFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFN--IVPEAAVYLGNRDICSIAKST 177 + AY ++ G + + +R++ + A ++ + + + + +R I + + Sbjct: 131 QWANAYIANEGFIEGAAAKALRYRSMLRAEHIRVFADSHVKHGSHAIVADRSIQELTRDV 190 Query: 178 VFNNHPDALCVSGLTAGTRTDSALLKRVKE--TVPDTVVLANTGVCLENVEEQLSIADGC 235 F DA+ +G G DSA + + E + +L +GV NV++ L G Sbjct: 191 DFFE-ADAVIATGQRTG---DSATMAEIDEIRAATELPLLVGSGVTPANVKQILGRTQGV 246 Query: 236 VTATTFKKDGVFANFVDQARVSQFM 260 + A+T K DGV+ N V+ ARV FM Sbjct: 247 IVASTMKVDGVWWNDVELARVKHFM 271 >UniRef50_B8HZU1 Photosystem I assembly BtpA n=5 Tax=Clostridiales RepID=B8HZU1_CLOCE Length = 262 Score = 87.0 bits (214), Expect = 6e-16, Method: Compositional matrix adjust. Identities = 69/254 (27%), Positives = 121/254 (47%), Gaps = 7/254 (2%) Query: 14 VIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEFSLPYLTKVRP 73 V+ M H ALPG P F M + D+A + + L+ G+DA++ N + + Sbjct: 12 VMGMVHCLALPGTPDFCGD--MKKITDQAVKEAITLEKSGMDAIIIENMGDNVFGVNMDI 69 Query: 74 ETTAAMARIIGQLMSDIRIPFGVNV-LWDPVASFDLAMATGAKFIR-EIFTGAYASDFGV 131 E + A+A I + ++ IP G++ + D + +A A GA F+R +F G+ Sbjct: 70 EQSCALAAISAIVAQNVNIPIGIDAAMNDYKTALSIAKAIGADFVRIPVFVDTVEFFGGI 129 Query: 132 WDTNVGETIRHQHRIGAGEVKTLFNI-VPEAAVYLGNRDICSIAKSTVFNNHPDALCVSG 190 E ++ + I A VK +I V + L + I AK+ DA+ V+G Sbjct: 130 IQPCAREAMKFRKNIEAENVKIFADIQVKHTHMVLPHVSIEDSAKAAEACG-ADAIIVTG 188 Query: 191 LTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATTFKKDGVFANF 250 G T ++KRVK+ + V+A +GV N++EQL IADG + ++ K+ G N Sbjct: 189 THIGVETPIDIIKRVKKVI-SIPVIAGSGVKTNNIKEQLGIADGAIVGSSLKEGGNIKNP 247 Query: 251 VDQARVSQFMEKVH 264 + ++ ++ ++ Sbjct: 248 ISLELCTELIKALN 261 >UniRef50_C8S5Y9 Photosystem I assembly BtpA n=1 Tax=Ferroglobus placidus DSM 10642 RepID=C8S5Y9_FERPL Length = 249 Score = 81.6 bits (200), Expect = 3e-14, Method: Compositional matrix adjust. Identities = 72/235 (30%), Positives = 115/235 (48%), Gaps = 18/235 (7%) Query: 14 VIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEFSLPYLTKVRP 73 VI HL+ LPG P+F L ID A + + ++N G DA++ N P+ K P Sbjct: 3 VIVSLHLKPLPGSPNF---LNFEDCIDHAVRNAILIENCGADAIIIENFNDKPFFMKAPP 59 Query: 74 ETTAAMARIIGQLMSDIRIPFGVNVLW-DPVASFDLAMATGAKFIR---EIFTGAYASDF 129 ET A+M+ I+ +++ ++ IP GVNVL D VA+ +A A GAKF+R IF A F Sbjct: 60 ETIASMSVIVREVIREVSIPVGVNVLRNDGVAALAIAKAAGAKFVRVNQMIFPAAMPEGF 119 Query: 130 GVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGN-RDICSIAKSTVFNNHPDALCV 188 + + R+ + K +I + +V L D + + DA+ V Sbjct: 120 A---KPIAAKMARYKRLLNCDAKIFADISVKHSVQLAKIEDFV----DNIDRAYCDAVIV 172 Query: 189 SGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATTFKK 243 +G G +++ L+++KE V D V+ +G EN+ + ADG + T K+ Sbjct: 173 TGKKTGKPPEASTLRKIKELV-DVPVILGSGATPENLRKYE--ADGVIVGTYVKE 224 >UniRef50_Q8U2H5 Uncharacterized protein PF0860 n=8 Tax=Euryarchaeota RepID=Y860_PYRFU Length = 262 Score = 80.9 bits (198), Expect = 4e-14, Method: Compositional matrix adjust. Identities = 70/257 (27%), Positives = 117/257 (45%), Gaps = 14/257 (5%) Query: 4 LKEVIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEF 63 +K++ ++K +I + HL+ LPG P + VI+ A D + G D ++ N Sbjct: 1 MKDLDFSKKPLIGVVHLKPLPGSPRYGGDF--EEVIEWAIRDAKTYEEAGFDGIIVENFG 58 Query: 64 SLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLW-DPVASFDLAMATGAKFIR-EIF 121 P+ + E A + + ++ +P G+N L D + ++ +A A G FIR + Sbjct: 59 DSPFSKTLPREVIPAFTVVAKAVKKEVSLPLGINALRNDCIVAYSIAHAVGGSFIRVNVL 118 Query: 122 TGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNN 181 TG +D G+ + E + + RI G++ TL ++ + AV+ N + K TV Sbjct: 119 TGVAFTDQGIIEGCARE-LWNVKRIIGGDILTLADVHVKHAVHFTNFE--DAVKDTVERG 175 Query: 182 HPDALCVSGLTAG---TRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTA 238 D + V+G G + D L KRV ++P VL +GV N ADG + Sbjct: 176 LADGIIVTGRRTGESISLEDLILAKRV-SSIP---VLVGSGVNPRNFRTLFKYADGFIVG 231 Query: 239 TTFKKDGVFANFVDQAR 255 T K++G N V R Sbjct: 232 TWVKENGKINNPVSLER 248 >UniRef50_C5EEU2 Photosystem I assembly BtpA n=2 Tax=Clostridiales RepID=C5EEU2_9FIRM Length = 263 Score = 80.5 bits (197), Expect = 5e-14, Method: Compositional matrix adjust. Identities = 65/256 (25%), Positives = 114/256 (44%), Gaps = 12/256 (4%) Query: 10 TEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEFSLPYLT 69 TEK V++M LPG + + ++ ++D+A + + D ++ N +P Sbjct: 7 TEKVVLSMIQPEPLPGSYRH-SDMRIDAIVDRALRETEMVARNHFDGIIVQNMNDMPVKQ 65 Query: 70 KVRPETTAAMARIIGQLMSDIRIP---FGVNVLWDPVASFDLAMATGAKFIR--EIFTGA 124 + PE A M RI ++ R P G+ + WD VA +A A GA F+R +FTGA Sbjct: 66 QSSPEAIAYMTRIAYEIRK--RFPELVMGILMNWDGVAGLCVADAVGADFVRVEHLFTGA 123 Query: 125 YASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPD 184 + G+ + + + R G+ +V ++ + LG + + A V D Sbjct: 124 SVTSAGILEAQCVDIAGVRKRTGS-KVPVYADVYEVHGIPLGRKPVGDAAWECVHEAFAD 182 Query: 185 ALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATTFKKD 244 L +SG + ++K + V DT + G +N+ E + DG V+ T+ K+ Sbjct: 183 GLFMSGKS--VEESIRMIKEARPRVKDTPIFLGGGATGDNIHELMRYFDG-VSVATWIKN 239 Query: 245 GVFANFVDQARVSQFM 260 G N +D R +F+ Sbjct: 240 GDMKNPIDPERAKRFI 255 >UniRef50_D2RDD9 Photosystem I assembly BtpA n=1 Tax=Archaeoglobus profundus DSM 5631 RepID=D2RDD9_ARCPR Length = 249 Score = 79.7 bits (195), Expect = 9e-14, Method: Compositional matrix adjust. Identities = 72/256 (28%), Positives = 115/256 (44%), Gaps = 31/256 (12%) Query: 14 VIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEFSLPYLTKVRP 73 +I + HL LP P +++ + A D AL G DA++ N P+L +V Sbjct: 3 IIGVLHLDPLPSSPLYES---YEKTFENALKDAKALAEG-CDAIIIENYGDKPFLKEVDR 58 Query: 74 ETTAAMARIIGQLMSDIRIPFGVNVLW-DPVASFDLAMATGAKFIR--EIFTGAYASDFG 130 T A M+ I ++ + +P G+NVL DP ++ +A A A F+R +++ + + + G Sbjct: 59 VTVACMSVIAWEVKRETGLPVGINVLRNDPFSALAIAKAVNADFVRVNQLYFASLSPE-G 117 Query: 131 VWDTNVGETIRHQHRIGAG-------EVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHP 183 + GE +R++ I +VK + V YL N + C Sbjct: 118 FLEGKAGEILRYRRFIDCKAKIYADVKVKHAHHFV-SLEDYLENVERCL----------A 166 Query: 184 DALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATTFKK 243 DAL V+G G D LK V+ + + V +GV EN+ + + DG + T FKK Sbjct: 167 DALIVTGTATGREVDVEELKAVR-NLTNLPVFVGSGVKPENLHRYVGLCDGVIVGTYFKK 225 Query: 244 DGVFANFVDQARVSQF 259 DG VD RV + Sbjct: 226 DG----RVDVERVRRL 237 >UniRef50_A3K4E4 Putative uncharacterized protein n=1 Tax=Sagittula stellata E-37 RepID=A3K4E4_9RHOB Length = 282 Score = 79.0 bits (193), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 69/275 (25%), Positives = 121/275 (44%), Gaps = 15/275 (5%) Query: 2 SWLKEVIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSN 61 S L+ + +K +I + HL ALPG P +D + + A D L GGVD +M N Sbjct: 12 SALETLFEKKKPIIGVIHLAALPGAPFYDGAP-LREIYAAAVRDAKTLAAGGVDGIMIEN 70 Query: 62 EFSLPYLTKVRPE-----TTAAMARIIGQLMSDIRIPFGVNVLWD-PVASFDLAMATGAK 115 +P+ RPE T A + + + P G+ + + + +A A GA+ Sbjct: 71 AGDMPF---ARPEDIGFETVAFLTAACEAVRGAVDTPIGITCVANGAIPGLAVAKAVGAR 127 Query: 116 FIR-EIFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPE--AAVYLGNRDICS 172 ++R + AY ++ G + +R++ +I A +V L ++ + A +R I Sbjct: 128 WVRVNQWANAYVANEGFLNGAASAAMRYRAQIAAKDVAVLADVHVKFGAHAITADRTITE 187 Query: 173 IAKSTVFNNHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIA 232 A + D L +G G+ T +++V+ V+ +G+ E V + +A Sbjct: 188 QATDAEWFGA-DVLIATGQRTGSPTQPEEVRQVRAGT-HLPVIVGSGLSPEQVPALMEVA 245 Query: 233 DGCVTATTFKKDGVFANFVDQARVSQFMEKVHHIR 267 DG + K D + N VD ARV + M + +R Sbjct: 246 DGAIVGQWLKVDARWWNPVDPARVERLMTAMDQVR 280 >UniRef50_UPI0001C369E0 btpA family protein n=1 Tax=Clostridium hathewayi DSM 13479 RepID=UPI0001C369E0 Length = 274 Score = 77.0 bits (188), Expect = 5e-13, Method: Compositional matrix adjust. Identities = 68/261 (26%), Positives = 113/261 (43%), Gaps = 15/261 (5%) Query: 15 IAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEFSLPYLTKVRPE 74 +AM PG + + +ID + +++ ++ G D + N P PE Sbjct: 11 LAMIQPEPFPGSFRHEGK-SFEEIIDISLNEIEMIEANGFDGYIIQNRNDAPVRQHALPE 69 Query: 75 TTAAMARIIGQLMSDIRIP---FGVNVLWDPVASFDLAMATGAKFIR--EIFTGAYASDF 129 TTA M + + R P G+ V WD VAS +A A G+ FIR +TG Sbjct: 70 TTAYMTALARECRR--RFPDMIQGILVDWDGVASLAVADAAGSDFIRVEHTYTGVEVGYA 127 Query: 130 GVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPDALCVS 189 G+ + + + + RIG+ ++ ++ L + I A TV N D L + Sbjct: 128 GMMEAQCVDICQFKKRIGS-DIPVYADVQEVHYEQLAGKSIVDNAWDTVMNAFADGLFLG 186 Query: 190 GLTAGTRTD--SALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATTFKKDGVF 247 G + + + KR+ E +P + ++G +N+ + L DG V+ T+ K+G Sbjct: 187 GKSCEESIEIIKCVRKRLGERIP---IFLSSGATGDNISKILQYYDG-VSVGTWVKNGNM 242 Query: 248 ANFVDQARVSQFMEKVHHIRR 268 N +D R QFME V R+ Sbjct: 243 RNPIDPVRARQFMEGVKSARK 263 >UniRef50_B9XEW7 Photosystem I assembly BtpA n=1 Tax=bacterium Ellin514 RepID=B9XEW7_9BACT Length = 265 Score = 77.0 bits (188), Expect = 6e-13, Method: Compositional matrix adjust. Identities = 75/260 (28%), Positives = 123/260 (47%), Gaps = 11/260 (4%) Query: 10 TEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEFSLPYLT 69 + K +I M H+ ALPG P+ LG + + A + ++ GVD + N +PYL Sbjct: 8 SAKPIIGMIHVGALPGTPANHLSLGK--ITEIAVQEAKIYRDAGVDGIAIENMHDVPYLR 65 Query: 70 K-VRPETTAAMARIIGQLMSDIRIPF-GVNVLWDPVASFDLAMATGA-KFIR-EIFTGAY 125 V PE ++M IIGQ + G+ +L A A ++R E F A+ Sbjct: 66 GGVGPEIVSSMT-IIGQAVKQAFCGVTGIQILAAANREAMAAAHAAALDWVRVEGFVFAH 124 Query: 126 ASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDIC--SIAKSTVFNNHP 183 +D G ++ E +R++ +IGA +V+ +I + + + DI A + F Sbjct: 125 VADEGFINSCAAELLRYRKQIGAEKVQVWADIKKKHSSHAITADISLGETAHAAEFMR-A 183 Query: 184 DALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATTFKK 243 DAL V+G G A + K V+L +G+ N+ + L +ADG + ++FKK Sbjct: 184 DALIVTGPVTGRPPVPADAEETKAHTHLPVIL-GSGMNEANIGQFLPVADGFIVGSSFKK 242 Query: 244 DGVFANFVDQARVSQFMEKV 263 G + N VD +V FM++V Sbjct: 243 AGDWNNPVDSRKVKAFMKRV 262 >UniRef50_Q29E81 GA21203 n=3 Tax=Coelomata RepID=Q29E81_DROPS Length = 275 Score = 75.1 bits (183), Expect = 2e-12, Method: Compositional matrix adjust. Identities = 71/279 (25%), Positives = 126/279 (45%), Gaps = 17/279 (6%) Query: 1 MSWLKEVIGTEKA-VIAMCHLRALPGDPSFDAQLGMNW--VIDKAWDDLMALQNGGVDAV 57 M ++ G +K VI M H+ ALPG P + +W I+KA + + +DAV Sbjct: 1 MRRFLKIFGQQKCKVIGMIHVDALPGTPRYAG----HWKETIEKAIYEANLYKRHQLDAV 56 Query: 58 MFSNEFSLPYLTK--VRPETTAAMARIIGQLMSDI---RIPFGVNVL-WDPVASFDLAMA 111 + N +PY+ + + E TA M R+ GQ + D+ IP GV VL + +A A Sbjct: 57 LIENMHDIPYVPERLLGAEITACMTRL-GQAVRDVIPKEIPCGVQVLACGNKQALAIAKA 115 Query: 112 TGAKFIR-EIFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDI 170 + +FIR E F + +D G D G+ +R++ I A +V ++ + + + D+ Sbjct: 116 SQLQFIRSEGFVFGHVADEGYTDACAGDLLRYRKLIDAEDVLIFTDLKKKHSSHAITSDV 175 Query: 171 CSIAKSTVFNNH-PDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQL 229 + + D + ++G G L+ + V +L +GV +N+ Sbjct: 176 SLLETAHAAEFFLTDGIVITGTATGHAASPQDLQELSGRV-KVPLLIGSGVTKDNIGLYY 234 Query: 230 SIADGCVTATTFKKDGVFANFVDQARVSQFMEKVHHIRR 268 A+ + + FK+ G + + + V FM KV +R+ Sbjct: 235 KDANAVIVGSHFKRHGSWLEEISEEAVENFMRKVCELRQ 273 >UniRef50_Q1NZ26 Uncharacterized protein F13E9.13, mitochondrial n=3 Tax=Caenorhabditis RepID=YSMU_CAEEL Length = 277 Score = 75.1 bits (183), Expect = 3e-12, Method: Compositional matrix adjust. Identities = 73/265 (27%), Positives = 123/265 (46%), Gaps = 15/265 (5%) Query: 14 VIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEFSLPYLTK-VR 72 V M H+ ALPG PS L M+ ++ K + GVD V+ N +PY+ Sbjct: 18 VFGMIHVPALPGTPS--NTLPMSAILKKVRKEADVYFKNGVDGVIVENMHDVPYVKPPAS 75 Query: 73 PETTAAMARIIGQLMS--DIRIP---FGVNVLWDP-VASFDLAMATGAKFIR-EIFTGAY 125 PE ++MA QL+ D P G+ +L + +A TG FIR E F ++ Sbjct: 76 PEIVSSMALASDQLVKSRDAHHPAALTGIQILAAANREALAVAYTTGMDFIRAEGFVYSH 135 Query: 126 ASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDIC--SIAKSTVFNNHP 183 +D G D G +R++ + A + +I + + + D+ +AK FN Sbjct: 136 VADEGWIDGCAGGLLRYRSSLKAENIAIFTDIKKKHSAHSVTSDVSIHEMAKDAKFNC-A 194 Query: 184 DALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATTFKK 243 D + V+G G+ + +V + V + VL +G+ +N E + A G + + FK Sbjct: 195 DGVIVTGSATGSAASLEEMIQVMK-VQEFPVLIGSGINGKNAREFVK-AHGFIVGSDFKI 252 Query: 244 DGVFANFVDQARVSQFMEKVHHIRR 268 G + N +D R+S+FM+ V+ ++R Sbjct: 253 GGEWKNDLDSGRISKFMKHVNTLKR 277 >UniRef50_B9LR39 Photosystem I assembly BtpA n=7 Tax=cellular organisms RepID=B9LR39_HALLT Length = 275 Score = 74.7 bits (182), Expect = 3e-12, Method: Compositional matrix adjust. Identities = 81/269 (30%), Positives = 119/269 (44%), Gaps = 11/269 (4%) Query: 9 GTEKAVIAMCHLRALPGDPSF--DAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEFSLP 66 GT+ VI M HL LPG P D M +D+A D AL GGVD +M N P Sbjct: 8 GTDAPVIGMVHLPPLPGAPKAPADGVAAMRDALDRAAADARALDRGGVDGIMVENFGDAP 67 Query: 67 YLTKVRPE-TTAAMARIIGQLMSDIRIPFGVNVLW-DPVASFDLAMATGAKFIR-EIFTG 123 + P+ A++ R + ++ +P G+NVL D A+ +A A A ++R + TG Sbjct: 68 FYPDDAPKHVVASVTRAATAITTETDLPLGINVLRNDAEAALSVAAAVDADYVRVNVHTG 127 Query: 124 AYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNI-VPEAAVYLGNRDICSIAKSTVFNNH 182 A +D GV ET+R + R+G +V + V +A T Sbjct: 128 ARVTDQGVVQGKAHETLRLRDRLGV-DVGVFADTDVKHSAPLSAEGYTAESFADTAERGL 186 Query: 183 PDALCVSGLTAGTRTDSALLKRVKETVP----DTVVLANTGVCLENVEEQLSIADGCVTA 238 DA+ SG G D L+ V DT VL +GV + V + L++ADG + Sbjct: 187 ADAVIASGRGTGEAMDPEALESVVADRDAHGLDTPVLVGSGVREDTVGDVLAVADGAIVG 246 Query: 239 TTFKKDGVFANFVDQARVSQFMEKVHHIR 267 T K+ G VD RV+ + + +R Sbjct: 247 TALKEGGETTAPVDADRVAALVARADEVR 275 >UniRef50_A5GQP5 Photosystem I biogenesis protein BtpA n=4 Tax=Bacteria RepID=A5GQP5_SYNR3 Length = 275 Score = 70.5 bits (171), Expect = 6e-11, Method: Compositional matrix adjust. Identities = 74/259 (28%), Positives = 117/259 (45%), Gaps = 6/259 (2%) Query: 11 EKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEFSLPYLTK 70 A+I + HL LPG P + Q V A D A GG D ++ N P+ Sbjct: 15 RPALIGVLHLPPLPGSPRW--QGDFEAVRRFALADAAAYLAGGADGLVVENFGDAPFFAS 72 Query: 71 VRP-ETTAAMARIIGQLMSDIR-IPFGVNVLW-DPVASFDLAMATGAKFIR-EIFTGAYA 126 P T AAMARI +++ +P G+NVL D A+ +A A+GA F+R + +GA Sbjct: 73 AVPSHTVAAMARIAAEVVEAAAGVPVGINVLRNDAHAAMGIAAASGASFVRVNVLSGAML 132 Query: 127 SDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPDAL 186 +D G+ + E +R + ++ A EV +++ + A L + I + + D + Sbjct: 133 TDQGLIEGRAAELLRLRRQLEATEVGIFADVLVKHAYPLAPQPIGEAVEDCLGRAGADGV 192 Query: 187 CVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATTFKKDGV 246 VSG+ G D L + VL +G N + ADG + A++ K+D + Sbjct: 193 IVSGVATGAAPDPDDLAAARSAAGSAPVLIGSGCHAGNATSLGASADGVIVASSLKRDSL 252 Query: 247 FANFVDQARVSQFMEKVHH 265 AN VD RV + + Sbjct: 253 LANPVDPLRVQALRQTLQR 271 >UniRef50_Q9VS44 CG8607 n=22 Tax=Eukaryota RepID=Q9VS44_DROME Length = 275 Score = 70.1 bits (170), Expect = 8e-11, Method: Compositional matrix adjust. Identities = 64/266 (24%), Positives = 124/266 (46%), Gaps = 18/266 (6%) Query: 14 VIAMCHLRALPGDPSFDAQLGMNW--VIDKAWDDLMALQNGGVDAVMFSNEFSLPYLTK- 70 +I M H+ ALPG P + NW I+ A + + +DAV+ N +PY+ + Sbjct: 15 IIGMIHVDALPGTPRYAG----NWKQTIENAIYEANLYKKHQLDAVLIENMHDIPYVPER 70 Query: 71 -VRPETTAAMARIIGQLMSDI---RIPFGVNVL-WDPVASFDLAMATGAKFIR-EIFTGA 124 + E A M R+ G+ + ++ IP GV VL + +A A+ +FIR E F Sbjct: 71 LLGAEIVACMTRL-GRAVREVIPQEIPCGVQVLACGNKQALAIAKASQLQFIRAEGFVFG 129 Query: 125 YASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSI--AKSTVFNNH 182 + +D G D G+ +R++ I A +V ++ + + + D+ + A + F Sbjct: 130 HVADEGFTDACAGDLLRYRKLIDAEDVLIFTDLKKKHSSHAITADVSLLETAHAAEFFM- 188 Query: 183 PDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATTFK 242 D + ++G G L+++ V +++ +GV +N++ A + + FK Sbjct: 189 TDGIIITGTATGHAASPEDLQQLSGRVKVPLIIG-SGVTRDNIDSYYKDAHAVIIGSHFK 247 Query: 243 KDGVFANFVDQARVSQFMEKVHHIRR 268 ++G + + + V +FM+K+ +R Sbjct: 248 RNGNWLEEISEPAVDEFMQKICQLRH 273 >UniRef50_A2BLV0 Conserved archaeal protein n=1 Tax=Hyperthermus butylicus DSM 5456 RepID=A2BLV0_HYPBU Length = 285 Score = 67.4 bits (163), Expect = 5e-10, Method: Compositional matrix adjust. Identities = 65/263 (24%), Positives = 116/263 (44%), Gaps = 13/263 (4%) Query: 12 KAVIAMCHLRALPGDPSF-DAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEFSLPYLTK 70 K +I M HL P PS+ ++ ++ ++D A + L + G +AV+ N PY Sbjct: 18 KPLIGMIHL---PPTPSYVKDRVDIDRLVDYALWEAGKLADAGFNAVIIENYGDHPYTVT 74 Query: 71 VRPETTAAMARIIGQLMSDI--RIPFGVNVLWDPVA-SFDLAMATGAKFIR-EIFTGAYA 126 + A+ARI ++ ++ G+N+L + + + A+ +GA FIR + Sbjct: 75 APSLSVLAIARIAAEVARTYSGKLRVGINILRNAAPQALEAALVSGASFIRVNSYCELRV 134 Query: 127 SDFGVWD--TNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPD 184 S G+ + E IR + R V ++ + + L + I PD Sbjct: 135 SMEGILTPAAYIIERIREELR---APVLVFADVDVKHSAPLATASLEQILHDCARRGRPD 191 Query: 185 ALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATTFKKD 244 A+ VSG G + +K VP ++ +G+ ++N+ +ADG + T+ K + Sbjct: 192 AIIVSGSATGEPPSPGYVASIKAMVPYKPIIIGSGISIDNIMAYWRVADGFIVGTSIKLN 251 Query: 245 GVFANFVDQARVSQFMEKVHHIR 267 G N VD+ R Q E V+ +R Sbjct: 252 GKTLNPVDERRARQLAELVNELR 274 >UniRef50_UPI000069ECFE UPI000069ECFE related cluster n=1 Tax=Xenopus (Silurana) tropicalis RepID=UPI000069ECFE Length = 162 Score = 65.1 bits (157), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 50/161 (31%), Positives = 84/161 (52%), Gaps = 7/161 (4%) Query: 1 MSWLKEVIGTEKA-VIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMF 59 M +L ++ GT K VI M H++ALPG P ++L + +I++A + +N G+D +M Sbjct: 1 MKFL-QLFGTVKPIVIGMVHVKALPGTPG--SRLPVAQIIEEACHEAEIYKNAGIDGIMV 57 Query: 60 SNEFSLPYLTKVRPETTAAMARIIGQLMSDI-RIPFGVNVL-WDPVASFDLAMATGAKFI 117 N +PY PE TA MA I + +P GV +L + +A+A G FI Sbjct: 58 ENMHDIPYTFNTGPEITATMATICTAVKQACPHLPLGVQILSCANNQALAVALAAGLDFI 117 Query: 118 R-EIFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNI 157 R E + ++ +D G + G+ +R++ IGA ++ +I Sbjct: 118 RAEGYVFSHVADEGFVNACAGDLLRYRKAIGAEHIQIFADI 158 >UniRef50_C0ZRZ3 Putative uncharacterized protein n=2 Tax=Rhodococcus erythropolis RepID=C0ZRZ3_RHOE4 Length = 280 Score = 63.5 bits (153), Expect = 6e-09, Method: Compositional matrix adjust. Identities = 69/254 (27%), Positives = 115/254 (45%), Gaps = 17/254 (6%) Query: 2 SWLKEVIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSN 61 S L E+ ++I HL ALPG P + Q ++ + A ++ A + G D V+ N Sbjct: 14 SALAEMFTGTPSLIGAIHLPALPGSPHYTGQP-VSEIARFAVEEAHAYVDNGFDGVIVEN 72 Query: 62 EFSLPYLTKVRP--ETTAAMARIIGQLMSDIRIPFGVNVLWDP-VASFDLAMATGAKFIR 118 + +P+L ET A+M I ++ + GV++L + A A GA F+R Sbjct: 73 HWDIPFLKPGEHGYETAASMGVITAAVVGEFGKAVGVSILSNAGECGVAAAWAAGASFVR 132 Query: 119 -EIFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKST 177 + AY ++ G + +T R +HRIGA V+ A V++ + +A T Sbjct: 133 VNQWANAYIANEGFIEGQAAKTTRFRHRIGADPVRIF------ADVHVKHGAHAIVADRT 186 Query: 178 VFNNHPDALCVSG---LTAGTRT-DSALLKRVKETVPDTV--VLANTGVCLENVEEQLSI 231 V DA + G+RT D+A + V +TV V+ +G+ NV + Sbjct: 187 VAEQTEDAEFFDADVLIATGSRTGDAASVDEVSVIRDNTVLPVIIGSGITAANVAALMKE 246 Query: 232 ADGCVTATTFKKDG 245 DG + A++ K +G Sbjct: 247 CDGAIVASSVKDNG 260 >UniRef50_A9FN23 Putative uncharacterized protein n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9FN23_SORC5 Length = 268 Score = 58.5 bits (140), Expect = 2e-07, Method: Compositional matrix adjust. Identities = 67/268 (25%), Positives = 112/268 (41%), Gaps = 13/268 (4%) Query: 8 IGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSN-EFSLP 66 +G KAV+ M HL LPG P F + +D A +AL GG D + E Sbjct: 7 LGRRKAVLGMIHLAPLPGTP-FHEKGSFERTLDVAVQSAIALSEGGADGCLVQTVERVYG 65 Query: 67 YLTKVRPETTAAMARI---IGQLMSDIRIPFGVNVLWDPV-ASFDLAMATGAKFIRE-IF 121 + P T AM I IG+ D GV ++ + + AS +A F+R Sbjct: 66 VKDESDPARTTAMGLIVDAIGRATGD-DFQIGVQLMRNAIRASLAVAKVARGSFVRAGAL 124 Query: 122 TGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGN-RDICSIAKSTVFN 180 GA ++ G+ + N E + ++ +I A VK + ++ +LG + + +A+ Sbjct: 125 VGATLTEHGLVEANPLEVMEYRDKIDAWGVKIIADVASTQFTWLGGAKPVAEVARRA--- 181 Query: 181 NHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATT 240 H A VS A++ V+ PD +L N ++ ADG Sbjct: 182 KHVGADAVSLGDPDEAKTLAMIASVRAAAPDLPILLAGHTNHANAARLMAAADGAFVGAC 241 Query: 241 FKKDGVFANFVDQARVSQFMEKVHHIRR 268 ++ G + +D+ RV+ ++E V + R Sbjct: 242 LEQGG-WGGRIDRDRVAAYVEIVRGLER 268 >UniRef50_Q9Y937 BtpA homolog n=1 Tax=Aeropyrum pernix RepID=Q9Y937_AERPE Length = 287 Score = 58.5 bits (140), Expect = 3e-07, Method: Compositional matrix adjust. Identities = 66/277 (23%), Positives = 115/277 (41%), Gaps = 22/277 (7%) Query: 12 KAVIAMCHLRALPGDPSFDAQ-----LGMNW----VIDKAWDDLMALQNGGVDAVMFSNE 62 K ++ + HL LPG + A+ LG W +I+ A + ++ G D V+ N Sbjct: 8 KPLLGVVHLPPLPGSTGYKARRYPPRLGKVWSLEEIIEYAVSEASVYEDAGFDGVILENY 67 Query: 63 FSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWD-PVASFDLAMATGAKFIR-EI 120 PY P +AM RI+ ++ S + IP GVN+L + V + A + G FIR Sbjct: 68 GDTPYPKTPGPLQVSAMTRIVREVSSAVGIPVGVNMLRNGSVEALASAYSGGGSFIRVNS 127 Query: 121 FTGAYASDFGVWDTNVGETIRHQHRIGAGE---VKTLFNIVPEAAVYLGNRDICSIAKST 177 S G+ + + + +G E ++ L ++ + + L I + Sbjct: 128 LCETRLSPEGILEPDAARLAKSLALLGILEERRIEILADVDVKHSQPLVETSIAQTVRDC 187 Query: 178 VFNNH-PDA-LCVSGLTAGTRTDS----ALLKRVKETVPDTVVLANTGVCLENVEEQLSI 231 + + P A + ++G G D+ A + E TVV +GV N+ + I Sbjct: 188 IERSGVPIAGVVLTGHATGGAPDADEVVAAARTASEYEVKTVV--GSGVSQLNLSKYWHI 245 Query: 232 ADGCVTATTFKKDGVFANFVDQARVSQFMEKVHHIRR 268 ADG + ++ K G N +D+ + +RR Sbjct: 246 ADGFIIGSSIKLGGKPWNPIDKEKARLIASLAERLRR 282 >UniRef50_A3DLN8 Photosystem I assembly BtpA n=1 Tax=Staphylothermus marinus F1 RepID=A3DLN8_STAMF Length = 260 Score = 55.1 bits (131), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 55/258 (21%), Positives = 111/258 (43%), Gaps = 12/258 (4%) Query: 16 AMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEFSLPYLT----KV 71 M HL LP P + + ++ +++ A ++ L G D V+ N P+ V Sbjct: 5 GMIHLPPLPNSPQYSGE-KIDVILEYAINEAEKLVEAGFDGVIIENYMDYPFPVYEKDPV 63 Query: 72 RPETTAAMARIIGQLMSDIRIPFGVNVLWDP-VASFDLAMATGAKFIR-EIFTGAYASDF 129 + +AR I + +I I G+N+L + + S D+A FIR ++ + Sbjct: 64 KLGFIEYIARRIREEFPNILI--GLNILRNSGLESIDIACRNNLDFIRVNVYMETVLAPE 121 Query: 130 GVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPDALCVS 189 G+ E ++++ + VK ++ + + L N + + ++T D + VS Sbjct: 122 GIIKPLAYEIMKYKMQ-KKCNVKIYADVNVKHSQPLMNYTM--VLRNTCSRGLVDGVIVS 178 Query: 190 GLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATTFKKDGVFAN 249 G G T + + K V+ +GV +N+ + +AD + T+ K +G+ N Sbjct: 179 GEHTGYATPVSRVYVAKRICNGKEVIVGSGVNYQNIGLYIGLADAVIVGTSIKNEGITTN 238 Query: 250 FVDQARVSQFMEKVHHIR 267 V+ + +E+V ++ Sbjct: 239 PVNLQKAMYLVERVKRVK 256 >UniRef50_Q16GL4 Putative uncharacterized protein n=2 Tax=Aedes aegypti RepID=Q16GL4_AEDAE Length = 191 Score = 54.7 bits (130), Expect = 3e-06, Method: Compositional matrix adjust. Identities = 42/165 (25%), Positives = 81/165 (49%), Gaps = 9/165 (5%) Query: 108 LAMATGAKFIR-EIFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLG 166 +A A FIR E F ++ +D G D N G+ +R++ I A ++ +I + + + Sbjct: 28 VAKACNFDFIRAEGFVFSHVADEGFTDANAGQLLRYRRNIDAEHIQIFTDIKKKHSAHAI 87 Query: 167 NRDIC--SIAKSTVFNNHPDALCVSGLTAGTRTDSALLKRVKETVPDTVV--LANTGVCL 222 DI AK+ F D + ++G + G + + V+ V +T + + +G+ Sbjct: 88 TNDISLKETAKAAEFF-RSDGIIITGASTGCEAN---VDDVESLVGETELPLIIGSGITA 143 Query: 223 ENVEEQLSIADGCVTATTFKKDGVFANFVDQARVSQFMEKVHHIR 267 EN+ + +IAD + + FK++G + + + +V FM KV+ R Sbjct: 144 ENLNKYWNIADAAIVGSHFKENGNWRGALSEVKVQAFMNKVNGFR 188 >UniRef50_Q2CH81 Putative uncharacterized protein n=1 Tax=Oceanicola granulosus HTCC2516 RepID=Q2CH81_9RHOB Length = 270 Score = 54.3 bits (129), Expect = 4e-06, Method: Compositional matrix adjust. Identities = 65/252 (25%), Positives = 105/252 (41%), Gaps = 7/252 (2%) Query: 12 KAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEFSLPYLTKV 71 + VI M L L G ++ + V++ A ++ L + G+D +M N +P Sbjct: 13 RPVIGMVQLPPLAGGANYGGAP-VGEVLEAALEEARVLADNGIDGLMVQNLGDIPVAHAA 71 Query: 72 RPETTAAMARIIGQLMSDIRIPFGVNVLWDPV-ASFDLAMATGAKFIR-EIFTGAYASDF 129 A M R ++ P G+N+L + V A F +A A GA F+R ++F GA + F Sbjct: 72 TAAQVAWMTRATVEIGRIAACPVGLNMLENDVDAMFAVASAAGADFVRIKVFVGAMVTPF 131 Query: 130 GVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPDALCVS 189 G+ R + G G++ L ++ L + DA+ V+ Sbjct: 132 GLEQGRAHAAARARRGCGGGDIAILADVHDRTGTPLATSGFEEDLDFALRLGGADAVVVT 191 Query: 190 GLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATTFKKDG-VFA 248 G + D + R + P VL GV EN EE + A G + +++ K G Sbjct: 192 GKSHAATLD--MAARARAAHPAAHVLLGGGVTAENFEETMENASGAIVSSSMKDSGSAVG 249 Query: 249 NFVDQARVSQFM 260 FV + RV FM Sbjct: 250 RFVPE-RVEAFM 260 >UniRef50_C2BTC6 Possible photosystem I biogenesis protein BtpA n=1 Tax=Mobiluncus curtisii ATCC 43063 RepID=C2BTC6_9ACTO Length = 252 Score = 53.9 bits (128), Expect = 6e-06, Method: Compositional matrix adjust. Identities = 63/270 (23%), Positives = 104/270 (38%), Gaps = 51/270 (18%) Query: 11 EKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEFS------ 64 EK ++ M HL+ D + +A + GG D V+ N F Sbjct: 17 EKLLLGMIHLKGNEIDDIYS----------RAVRECDIYARGGFDGVIVENYFGTIDDVR 66 Query: 65 --LPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDPVASFDLAMATGAKFIR-EIF 121 LP L P+ + GV+V+WD SFDLA+ FI + Sbjct: 67 YCLPRLQDKFPQ-----------------LYVGVDVIWDNDKSFDLAVEHQLPFIELDSL 109 Query: 122 TGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTV--- 178 G E + + RI + + I+ V L N+ + S V Sbjct: 110 AGQLPPQ---------EEPQFEERIRWCQENSPAVIL--GGVRLKNQPVLSGNPLEVDLM 158 Query: 179 -FNNHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVT 237 D + V+G+ G T+ + + + +E + D +L GV +N EQL+IADG + Sbjct: 159 LAKKRGDGVIVTGVDTGVETELSKIIQFREIIGDFPLLVGAGVNEKNCTEQLTIADGAII 218 Query: 238 ATTFKKDGVFANFVDQARVSQFMEKVHHIR 267 ++ K+ G ++ RV + + V +R Sbjct: 219 GSSLKQGGNAKGDLEMDRVERLVTAVRALR 248 >UniRef50_Q18HA0 Photosystem I biogenesis protein homolog n=1 Tax=Haloquadratum walsbyi DSM 16790 RepID=Q18HA0_HALWD Length = 223 Score = 51.2 bits (121), Expect = 4e-05, Method: Compositional matrix adjust. Identities = 51/193 (26%), Positives = 89/193 (46%), Gaps = 16/193 (8%) Query: 49 LQNGGVDAVMFSNEFSLPYLTKVRPETTAAMARIIGQLMSDIR----IPFGVNVLW-DPV 103 L+ G +DA++ N + P+ P+ T AM I L+ DI+ +P V++L D Sbjct: 34 LEAGSIDAILVKNLGNTPFHADDVPKHTVAM---ISALIKDIQRVVDVPISVDILRNDAE 90 Query: 104 ASFDLAMATGAKFIRE-IFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAA 162 A+ +A AT A FIR + G +D G+ ET+R + + +V+ L ++ + + Sbjct: 91 AALSIAAATTASFIRAGVHVGTLVTDQGIVTRRAAETLRLRDHLRT-DVEILADVSVKHS 149 Query: 163 VYLGNRDICSIAKSTVFNNHPDALCVSGLTAGTRTDSALLKRVKETVPDTV-----VLAN 217 R + + H D + SG+ G + D L V + V D++ V + Sbjct: 150 APAAERPLTETITDIISREHADGIIASGVGTGHKIDCGHLNTVVD-VRDSLETGIPVFVD 208 Query: 218 TGVCLENVEEQLS 230 +GV LE + + S Sbjct: 209 SGVTLETIADIYS 221 >UniRef50_C5EPH7 Putative uncharacterized protein n=1 Tax=Clostridiales bacterium 1_7_47FAA RepID=C5EPH7_9FIRM Length = 273 Score = 50.8 bits (120), Expect = 4e-05, Method: Compositional matrix adjust. Identities = 49/188 (26%), Positives = 81/188 (43%), Gaps = 7/188 (3%) Query: 77 AAMARIIGQLMSDIRIPFGVNVLWDPVASFDLAMATGAKFIRE-IFTGAYASDFGVWDTN 135 +A+ R + ++ D+ + + +P A+ +A A GA FIR+ +F GA GV Sbjct: 90 SALGRTVKRMFPDLSLGIILEAN-NPSAAMYIANACGADFIRQKVFIGAMVKAGGVMTGR 148 Query: 136 VGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPDALCVSGLTAGT 195 GE + + V+ L +I V LG I A D L ++G Sbjct: 149 AGEVWEARKDMDR-PVRVLTDIYDRTGVPLGPLPI-ETAAGQALKYGSDGLILTGKNFEE 206 Query: 196 RTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATTFKKDGVFANFVDQAR 255 D L RV++ P V G+ +NV E + DG + ++ +DG N + + Sbjct: 207 SLD--LADRVRKQYPQAPVYLGGGITEKNVGEAVKHCDGMIVSSCLLEDGK-DNVWSRQK 263 Query: 256 VSQFMEKV 263 + +FME V Sbjct: 264 IRRFMECV 271 >UniRef50_UPI000180CC4C PREDICTED: similar to F13E9.13 n=1 Tax=Ciona intestinalis RepID=UPI000180CC4C Length = 228 Score = 50.1 bits (118), Expect = 7e-05, Method: Compositional matrix adjust. Identities = 52/201 (25%), Positives = 99/201 (49%), Gaps = 8/201 (3%) Query: 73 PETTAAMARIIGQLM--SDIRIPFGVNVLWDPVASFDLAMATGAKFIR-EIFTGAYASDF 129 P++T ++A+I ++ ++I G+ L+ +S + T FIR E F ++ D Sbjct: 30 PKSTMSVAKICDVVLKEAEIYTRAGLFKLFISPSSNLVLYLTDLDFIRAEGFVFSHIGDE 89 Query: 130 GVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRD--ICSIAKSTVFNNHPDALC 187 G D+ +R++ +I A V +I + + + D I +++ F D + Sbjct: 90 GFIDSCAASLLRYRKQIEADHVLVFTDIKKKHSSHSITSDTSISETSRAAEFF-LSDGVI 148 Query: 188 VSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATTFKKDGVF 247 V+G G+ TD +K V++ V VL +GV +NV++ + + + + FK GV+ Sbjct: 149 VTGNETGSSTDLNQIKDVQDEV-GIPVLVGSGVTADNVDKYIHTS-ALIVGSHFKVGGVW 206 Query: 248 ANFVDQARVSQFMEKVHHIRR 268 +N VD V +FM+KV + + Sbjct: 207 SNPVDANLVQKFMKKVREMNK 227 >UniRef50_Q96YL8 Putative uncharacterized protein ST2156 n=1 Tax=Sulfolobus tokodaii RepID=Q96YL8_SULTO Length = 250 Score = 50.1 bits (118), Expect = 9e-05, Method: Compositional matrix adjust. Identities = 59/236 (25%), Positives = 102/236 (43%), Gaps = 16/236 (6%) Query: 14 VIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEFSLPYLTKVRP 73 +I + HL LPG SF + ++D A ++ L+ GG DAV+ N P+ KVR Sbjct: 3 LIGVVHLPPLPG--SFFYKGEFEEIVDFAINESKKLEVGGFDAVILENFNDKPFRKKVRV 60 Query: 74 ETTAAMARIIGQLMSDIRIPFGVNVLWDPV-ASFDLAMATGAKFIR-EIFTGAYASDFGV 131 ET AM+ I ++ + G+N+L + + +A TG FIR +S G+ Sbjct: 61 ETAIAMSIIAREVKKSTSLLVGINLLRNSAYEAASIASLTG-DFIRVNALCETISSPEGI 119 Query: 132 WD---TNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPDALCV 188 + V E + + R ++ L +I + A L ++ S+ D + V Sbjct: 120 IEPASVEVQEVLYYTKR----KISILADINVKHASPLHQMNLESLLLDCKERGFADYIIV 175 Query: 189 SGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATTFKKD 244 +G G + ++K +K P V + +G+ N+ + C T+ KD Sbjct: 176 TGERTGKEPNPEVVKMIKNISPLPVCVG-SGMTPNNIRDY---KVDCFIIGTYLKD 227 >UniRef50_Q166V4 Photosystem I biogenesis protein, putative n=2 Tax=Roseobacter RepID=Q166V4_ROSDO Length = 262 Score = 48.1 bits (113), Expect = 3e-04, Method: Compositional matrix adjust. Identities = 60/265 (22%), Positives = 112/265 (42%), Gaps = 17/265 (6%) Query: 6 EVIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEFSL 65 ++ + K VIA HL D + + L + W D A + G+ + ++ Sbjct: 4 KLFDSNKPVIAALHL----PDFALNRHLSVAWYEDYAVANARVFAEAGIPWIKLQDQTK- 58 Query: 66 PYLTKVRPET---TAAMARIIGQLMSDIRIPFGVNVLWDPVASFDLAMATGAKFIR-EIF 121 + P+T A++AR+I + +R+ V DP A+ +A A+GA FIR ++F Sbjct: 59 -TAGQAAPDTLTLMASLARLIRSEVPQLRLGIIVEA-HDPGAALCVAHASGADFIRLKVF 116 Query: 122 TGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNN 181 G + G D E + + + ++ L +I A+ L + A + + Sbjct: 117 VGGAMTAQGPRDGLSAEVVAMRSELRRADIAILADIHDRTAMPLSSES-QPFAANWAVKS 175 Query: 182 HPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATTF 241 D L ++G A + + V+++ +L GV NV E ++ ADG + ++ Sbjct: 176 GADGLVITG--ASFADTLSRISAVRDSGARRPILIGGGVTESNVHEAMAAADGVIVSSAL 233 Query: 242 KKDGVFANFV---DQARVSQFMEKV 263 + A+ V D +FM+ V Sbjct: 234 MRRDAAADDVIQWDADLCKRFMDAV 258 >UniRef50_A3MXM0 Photosystem I assembly BtpA n=5 Tax=Thermoproteaceae RepID=A3MXM0_PYRCJ Length = 242 Score = 44.3 bits (103), Expect = 0.004, Method: Compositional matrix adjust. Identities = 56/214 (26%), Positives = 90/214 (42%), Gaps = 14/214 (6%) Query: 34 GMNWVIDKAWDDLMALQNGGVDAVMFSNEFSLPYLTKVRPETTAAMARIIGQLMSDIRIP 93 G ++ A L+ G DAV+ N + +P+ K E AMA ++ ++ +P Sbjct: 12 GSPQRLEHAVRSAKRLEEAGFDAVIVENYYDMPFKPKADFEAAVAMAVAAREVAREVSLP 71 Query: 94 FGVNVLWDP-VASFDLAMATGAKFIR-EIFTGAYASDFGVWDTNVGETIRHQHRIGAGEV 151 G+N+L + V + +A GA FIR +T S+ G+ T I+ + V Sbjct: 72 VGINLLRNACVKASIIARHVGATFIRCNAYTDIVLSESGIL-TPQAPYIKGVKVLADVHV 130 Query: 152 KTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPDALCVSGLTAGTRTDSALLKRVKETVPD 211 K +I P R + ++ P A+ V+G G D L + D Sbjct: 131 KHGESIYP--------RTLAEAVEAASTRAAPAAIVVTGRKTGEAPDPVDLATARAYT-D 181 Query: 212 TVVLANTGVCLENVEEQLSIADGCVTATTFKKDG 245 VL +G+C + + L IADG + T KDG Sbjct: 182 LPVLVGSGICFQTL-PLLKIADGAIVGTCV-KDG 213 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P39364 Putative sgc region protein sgcQ n=61 Tax=Bacter... 343 3e-93 UniRef50_C0D1F9 Putative uncharacterized protein n=1 Tax=Clostri... 312 9e-84 UniRef50_B1CBM1 Putative uncharacterized protein n=1 Tax=Anaerof... 309 7e-83 UniRef50_A7B0Z5 Putative uncharacterized protein n=1 Tax=Ruminoc... 308 1e-82 UniRef50_C5CIF3 Photosystem I assembly BtpA n=1 Tax=Kosmotoga ol... 305 8e-82 UniRef50_A3KNP0 Zgc:162297 protein n=7 Tax=Coelomata RepID=A3KNP... 297 2e-79 UniRef50_A4E7P3 Putative uncharacterized protein n=1 Tax=Collins... 294 2e-78 UniRef50_UPI0000D55C2D PREDICTED: similar to conserved hypotheti... 292 7e-78 UniRef50_A5KMZ4 Putative uncharacterized protein n=2 Tax=Clostri... 292 8e-78 UniRef50_Q5JHL2 Uncharacterized protein TK2179 n=5 Tax=Euryarcha... 291 1e-77 UniRef50_A8S303 Putative uncharacterized protein n=1 Tax=Clostri... 291 2e-77 UniRef50_Q28QI3 Photosystem I assembly BtpA n=5 Tax=Alphaproteob... 290 3e-77 UniRef50_Q2RUX8 Photosystem I assembly BtpA n=1 Tax=Rhodospirill... 288 1e-76 UniRef50_Q8U2H5 Uncharacterized protein PF0860 n=8 Tax=Euryarcha... 287 3e-76 UniRef50_Q29E81 GA21203 n=3 Tax=Coelomata RepID=Q29E81_DROPS 285 8e-76 UniRef50_B9XI59 Photosystem I assembly BtpA n=1 Tax=bacterium El... 283 4e-75 UniRef50_A9W9X3 Photosystem I assembly BtpA n=4 Tax=Chloroflexac... 282 1e-74 UniRef50_C3ZBU0 Putative uncharacterized protein n=2 Tax=Metazoa... 280 3e-74 UniRef50_C5ELG9 Putative uncharacterized protein n=1 Tax=Clostri... 280 3e-74 UniRef50_Q9VS44 CG8607 n=22 Tax=Eukaryota RepID=Q9VS44_DROME 279 5e-74 UniRef50_D2QXT9 Photosystem I assembly BtpA n=2 Tax=Bacteria Rep... 276 5e-73 UniRef50_D2RDD9 Photosystem I assembly BtpA n=1 Tax=Archaeoglobu... 275 1e-72 UniRef50_UPI000186CA08 conserved hypothetical protein n=1 Tax=Pe... 274 3e-72 UniRef50_P72966 Photosystem I biogenesis protein btpA n=34 Tax=C... 274 3e-72 UniRef50_A6NZB9 Putative uncharacterized protein n=1 Tax=Bactero... 270 4e-71 UniRef50_Q8TVC9 Predicted TIM-barrel enzyme n=1 Tax=Methanopyrus... 269 9e-71 UniRef50_B9CKX7 Putative uncharacterized protein n=1 Tax=Atopobi... 265 9e-70 UniRef50_A3K4E4 Putative uncharacterized protein n=1 Tax=Sagittu... 265 1e-69 UniRef50_B9XEW7 Photosystem I assembly BtpA n=1 Tax=bacterium El... 263 5e-69 UniRef50_A8AAQ2 Photosystem I assembly BtpA n=1 Tax=Ignicoccus h... 261 2e-68 UniRef50_B5XQK9 BtpA family protein n=18 Tax=Proteobacteria RepI... 261 2e-68 UniRef50_Q1NZ26 Uncharacterized protein F13E9.13, mitochondrial ... 259 8e-68 UniRef50_Q9Y937 BtpA homolog n=1 Tax=Aeropyrum pernix RepID=Q9Y9... 259 9e-68 UniRef50_A4YFU5 Photosystem I assembly BtpA n=1 Tax=Metallosphae... 257 4e-67 UniRef50_A2BLV0 Conserved archaeal protein n=1 Tax=Hyperthermus ... 249 7e-65 UniRef50_B8HZU1 Photosystem I assembly BtpA n=5 Tax=Clostridiale... 247 3e-64 UniRef50_O29828 Uncharacterized protein AF_0419 n=1 Tax=Archaeog... 247 3e-64 UniRef50_A3DLN8 Photosystem I assembly BtpA n=1 Tax=Staphylother... 247 3e-64 UniRef50_D2RQS6 Photosystem I assembly BtpA n=4 Tax=Halobacteria... 244 3e-63 UniRef50_UPI000155D1C1 PREDICTED: hypothetical protein n=1 Tax=O... 243 6e-63 UniRef50_C5EEU2 Photosystem I assembly BtpA n=2 Tax=Clostridiale... 241 2e-62 UniRef50_C0ZRZ3 Putative uncharacterized protein n=2 Tax=Rhodoco... 237 2e-61 UniRef50_UPI0001C369E0 btpA family protein n=1 Tax=Clostridium h... 237 2e-61 UniRef50_Q96YL8 Putative uncharacterized protein ST2156 n=1 Tax=... 235 1e-60 UniRef50_A5GQP5 Photosystem I biogenesis protein BtpA n=4 Tax=Ba... 222 1e-56 UniRef50_B9LR39 Photosystem I assembly BtpA n=7 Tax=cellular org... 222 1e-56 UniRef50_C8S5Y9 Photosystem I assembly BtpA n=1 Tax=Ferroglobus ... 221 2e-56 UniRef50_UPI000180CC4C PREDICTED: similar to F13E9.13 n=1 Tax=Ci... 200 5e-50 UniRef50_C2BTC6 Possible photosystem I biogenesis protein BtpA n... 196 6e-49 UniRef50_Q2CH81 Putative uncharacterized protein n=1 Tax=Oceanic... 193 6e-48 UniRef50_Q16GL4 Putative uncharacterized protein n=2 Tax=Aedes a... 187 3e-46 UniRef50_A9FN23 Putative uncharacterized protein n=1 Tax=Sorangi... 182 1e-44 UniRef50_UPI000069ECFE UPI000069ECFE related cluster n=1 Tax=Xen... 178 1e-43 UniRef50_Q166V4 Photosystem I biogenesis protein, putative n=2 T... 178 2e-43 UniRef50_C5EPH7 Putative uncharacterized protein n=1 Tax=Clostri... 171 3e-41 UniRef50_Q18HA0 Photosystem I biogenesis protein homolog n=1 Tax... 140 5e-32 Sequences not found previously or not previously below threshold: UniRef50_A3MXM0 Photosystem I assembly BtpA n=5 Tax=Thermoprotea... 179 1e-43 UniRef50_C9XNH3 Putative uncharacterized protein n=7 Tax=Firmicu... 145 2e-33 UniRef50_C2JNA9 Photosystem I biogenesis protein BtpA n=9 Tax=En... 132 1e-29 UniRef50_C0C181 Putative uncharacterized protein n=1 Tax=Clostri... 121 2e-26 UniRef50_C2D712 Putative uncharacterized protein n=1 Tax=Atopobi... 115 1e-24 UniRef50_C5KQ75 Putative uncharacterized protein n=1 Tax=Perkins... 66 9e-10 UniRef50_D2VTE2 Predicted protein n=1 Tax=Naegleria gruberi RepI... 66 9e-10 UniRef50_Q0FCT7 Adenine phosphoribosyltransferase n=3 Tax=Rhodob... 63 1e-08 UniRef50_A6G6X7 Putative uncharacterized protein n=1 Tax=Plesioc... 59 1e-07 UniRef50_Q7M877 Tryptophan synthase alpha chain n=9 Tax=Epsilonp... 50 9e-05 UniRef50_Q4JTH5 L-lactate dehydrogenase n=6 Tax=Actinomycetales ... 49 2e-04 UniRef50_C0QR82 Thiamine-phosphate pyrophosphorylase n=1 Tax=Per... 48 3e-04 UniRef50_B9T9A8 Putative uncharacterized protein n=1 Tax=Ricinus... 48 5e-04 UniRef50_A3TLL5 Tryptophan synthase alpha chain n=1 Tax=Janibact... 48 5e-04 UniRef50_A5IKT4 Tryptophan synthase alpha chain n=5 Tax=Thermoto... 46 0.001 UniRef50_A8VXV8 Tryptophan synthase alpha chain n=2 Tax=Bacillus... 46 0.002 UniRef50_A6TM77 Tryptophan synthase alpha chain n=10 Tax=Clostri... 46 0.002 UniRef50_Q3ABS4 Tryptophan synthase alpha chain n=1 Tax=Carboxyd... 45 0.002 UniRef50_C7N999 Tryptophan synthase alpha chain n=2 Tax=Leptotri... 45 0.003 UniRef50_Q67PJ3 Tryptophan synthase alpha chain n=3 Tax=Clostrid... 45 0.003 UniRef50_B5YJ74 Thiamine-phosphate pyrophosphorylase n=1 Tax=The... 45 0.003 UniRef50_B9K6Z6 Tryptophan synthase alpha chain n=1 Tax=Thermoto... 44 0.004 UniRef50_Q5KXV2 Tryptophan synthase alpha chain n=6 Tax=Bacillac... 44 0.005 UniRef50_B8JAL9 Tryptophan synthase alpha chain n=4 Tax=Anaeromy... 44 0.005 UniRef50_Q2LUE0 Tryptophan synthase alpha chain n=1 Tax=Syntroph... 44 0.006 UniRef50_D1BQN8 Tryptophan synthase alpha chain n=17 Tax=cellula... 44 0.006 UniRef50_B0NZY9 Tryptophan synthase alpha chain n=2 Tax=Clostrid... 44 0.007 UniRef50_D1B8G5 CutC family protein n=1 Tax=Thermanaerovibrio ac... 43 0.008 UniRef50_A9A2A1 Tryptophan synthase alpha chain n=3 Tax=Thaumarc... 43 0.010 UniRef50_B5Y770 Enoyl-(Acyl-carrier-protein) reductase II n=1 Ta... 43 0.011 UniRef50_O27697 Tryptophan synthase alpha chain n=3 Tax=Methanob... 43 0.011 UniRef50_C0EA61 Tryptophan synthase alpha chain n=1 Tax=Clostrid... 43 0.013 UniRef50_C4V0L8 N-(5'-phosphoribosyl)anthranilate isomerase n=1 ... 43 0.015 UniRef50_Q4FUL2 Tryptophan synthase alpha chain n=5 Tax=Gammapro... 43 0.015 UniRef50_A5IYV3 Triosephosphate isomerase n=1 Tax=Mycoplasma aga... 43 0.017 UniRef50_UPI00006A2EA9 UPI00006A2EA9 related cluster n=1 Tax=Xen... 42 0.017 UniRef50_D2B8E9 Tryptophan synthase alpha chain n=2 Tax=Actinomy... 42 0.018 UniRef50_C1N2K9 Predicted protein n=1 Tax=Micromonas pusilla CCM... 42 0.020 UniRef50_A0B8J3 Geranylgeranylglyceryl phosphate synthase n=13 T... 42 0.021 UniRef50_Q12TL0 N-(5'-phosphoribosyl)anthranilate isomerase n=1 ... 42 0.021 UniRef50_Q39SS2 Tryptophan synthase alpha chain n=7 Tax=Desulfur... 42 0.021 UniRef50_Q2RIT8 N-(5'-phosphoribosyl)anthranilate isomerase n=7 ... 42 0.023 UniRef50_C6J450 Tryptophan synthase alpha chain n=2 Tax=Bacillal... 42 0.023 UniRef50_A6Q538 Triosephosphate isomerase n=24 Tax=Epsilonproteo... 42 0.026 UniRef50_Q1CZH2 Tryptophan synthase alpha chain n=2 Tax=Cystobac... 41 0.030 UniRef50_C0ZCE6 Tryptophan synthase alpha chain n=1 Tax=Brevibac... 41 0.031 UniRef50_B2WCC2 tRNA (Guanine-N7-)-methyltransferase subunit Trm... 41 0.035 UniRef50_C6CUD6 N-(5'-phosphoribosyl)anthranilate isomerase n=1 ... 41 0.053 UniRef50_Q8G691 Tryptophan synthase alpha chain n=30 Tax=Actinob... 41 0.053 UniRef50_UPI00016C49DE phosphoribosylanthranilate isomerase n=1 ... 41 0.055 UniRef50_C9KJ76 N-(5'-phosphoribosyl)anthranilate isomerase n=3 ... 41 0.056 UniRef50_B1YEC8 CutC family protein n=1 Tax=Exiguobacterium sibi... 41 0.059 UniRef50_Q9URN8 Mutant tryptophan synthase (Fragment) n=4 Tax=Ne... 41 0.062 UniRef50_A9KL40 Tryptophan synthase alpha chain n=13 Tax=Firmicu... 41 0.063 UniRef50_Q5HPG9 Tryptophan synthase alpha chain n=59 Tax=Staphyl... 41 0.064 UniRef50_Q0W630 Tryptophan synthase alpha chain n=1 Tax=uncultur... 40 0.070 UniRef50_D2RNR7 Tryptophan synthase, alpha subunit n=15 Tax=cell... 40 0.084 UniRef50_A8ZVH6 N-(5'-phosphoribosyl)anthranilate isomerase n=1 ... 40 0.088 UniRef50_Q8U092 N-(5'-phosphoribosyl)anthranilate isomerase n=5 ... 40 0.093 UniRef50_Q7UMC3 N-(5'-phosphoribosyl)anthranilate isomerase n=1 ... 40 0.097 >UniRef50_P39364 Putative sgc region protein sgcQ n=61 Tax=Bacteria RepID=SGCQ_ECOLI Length = 268 Score = 343 bits (881), Expect = 3e-93, Method: Composition-based stats. Identities = 268/268 (100%), Positives = 268/268 (100%) Query: 1 MSWLKEVIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFS 60 MSWLKEVIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFS Sbjct: 1 MSWLKEVIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFS 60 Query: 61 NEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDPVASFDLAMATGAKFIREI 120 NEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDPVASFDLAMATGAKFIREI Sbjct: 61 NEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDPVASFDLAMATGAKFIREI 120 Query: 121 FTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFN 180 FTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFN Sbjct: 121 FTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFN 180 Query: 181 NHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATT 240 NHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATT Sbjct: 181 NHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATT 240 Query: 241 FKKDGVFANFVDQARVSQFMEKVHHIRR 268 FKKDGVFANFVDQARVSQFMEKVHHIRR Sbjct: 241 FKKDGVFANFVDQARVSQFMEKVHHIRR 268 >UniRef50_C0D1F9 Putative uncharacterized protein n=1 Tax=Clostridium asparagiforme DSM 15981 RepID=C0D1F9_9CLOT Length = 276 Score = 312 bits (799), Expect = 9e-84, Method: Composition-based stats. Identities = 120/274 (43%), Positives = 159/274 (58%), Gaps = 6/274 (2%) Query: 1 MSWLKEVIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFS 60 M W +++ G +K +I M HL LPGDP F M V++ A DL ALQ GGVD +MFS Sbjct: 1 MLWTEKMFGVKKPIITMLHLDPLPGDPRFHYGDTMERVVEHARADLHALQEGGVDGIMFS 60 Query: 61 NEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDPVASFDLAMATGAKFIREI 120 NEFSLPY + T AAMAR+IG+L S+IR+P+GV+ + D AS +LA A AKFIR Sbjct: 61 NEFSLPYERHMSFVTPAAMARVIGELKSEIRVPYGVDCISDGQASIELAAAVDAKFIRGT 120 Query: 121 FTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFN 180 F+G Y D G ++ + +R + + ++K L+ I PE+ + R + IAKST+F Sbjct: 121 FSGVYVGDGGFYNNDFSALLRRKAALHLDDLKMLYFINPESDRSMDTRPLVDIAKSTIFK 180 Query: 181 NHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATT 240 HPD LC+S AG D L+ VK PD VVL NTG + +E +L+ AD V T Sbjct: 181 AHPDGLCISANAAGQDVDDELIASVKSGAPDVVVLCNTGCRPDTIERKLTTADAAVVGTY 240 Query: 241 FKKDGVFAN------FVDQARVSQFMEKVHHIRR 268 FK+ G N VD RV +FME VH R Sbjct: 241 FKEGGKLENDKLENVRVDVNRVKEFMEVVHRFRE 274 >UniRef50_B1CBM1 Putative uncharacterized protein n=1 Tax=Anaerofustis stercorihominis DSM 17244 RepID=B1CBM1_9FIRM Length = 269 Score = 309 bits (792), Expect = 7e-83, Method: Composition-based stats. Identities = 128/265 (48%), Positives = 176/265 (66%) Query: 3 WLKEVIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNE 62 W+KE+ GT+K ++AM HL ALPGDP +D G+ +VI++A ++ ALQ+GGVD ++ SNE Sbjct: 2 WMKEIFGTDKPIVAMLHLAALPGDPLYDENKGLCYVIERAKREIKALQDGGVDGILISNE 61 Query: 63 FSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDPVASFDLAMATGAKFIREIFT 122 +S PY+ V T +MAR+IGQL IP GV ++ DP +FDLA + GAKF+R FT Sbjct: 62 YSFPYMGDVPIITAMSMARVIGQLKEYFTIPMGVQIISDPYKTFDLAASVGAKFVRGTFT 121 Query: 123 GAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNH 182 G++A D G+ + G+ +RH+ +GA +VK ++N+VPEAA YL +R IA STVF+ Sbjct: 122 GSFAGDHGIAVYDTGKIMRHKIAVGAKDVKCMYNLVPEAAKYLVDRSWEEIADSTVFHCK 181 Query: 183 PDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATTFK 242 PDAL V+G AG D+ ++ RVK+ VP+T V ANTGV EN+E QL+ DG + TTFK Sbjct: 182 PDALMVAGFLAGREADTQIMTRVKKVVPNTPVFANTGVRYENIEMQLAACDGAIVGTTFK 241 Query: 243 KDGVFANFVDQARVSQFMEKVHHIR 267 +DG F RV FM KV R Sbjct: 242 EDGDFYKEAKYDRVKAFMNKVREFR 266 >UniRef50_A7B0Z5 Putative uncharacterized protein n=1 Tax=Ruminococcus gnavus ATCC 29149 RepID=A7B0Z5_RUMGN Length = 271 Score = 308 bits (790), Expect = 1e-82, Method: Composition-based stats. Identities = 116/268 (43%), Positives = 160/268 (59%) Query: 1 MSWLKEVIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFS 60 M W +++ G +K +IAM HL LPGDP + + M+ +I+ A DL ALQ+GGV+ ++FS Sbjct: 1 MLWTEKLFGVKKPIIAMLHLDPLPGDPLYKKENDMDVIIEHARADLHALQDGGVNGIIFS 60 Query: 61 NEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDPVASFDLAMATGAKFIREI 120 NEFS PY + T AAMA +IG L S+I++P+GV+ + D A +LA A A F+R Sbjct: 61 NEFSFPYQRTMDMVTPAAMAYVIGNLRSEIKVPYGVDAISDGRACLELAAAVKANFVRGT 120 Query: 121 FTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFN 180 F G Y D G ++ + +R + + E+K L+ I PE+ L R + IAK+T+ Sbjct: 121 FCGVYVGDGGFYNNDFSALLRRKAALPLDELKMLYFINPESDQSLDTRPLADIAKTTIAK 180 Query: 181 NHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATT 240 PD LC+S AG D AL+ VKE PD VVL NTG + +E +L+ AD V TT Sbjct: 181 AAPDGLCISADAAGQDVDDALIASVKEANPDIVVLCNTGCRINTIERKLTTADAAVVGTT 240 Query: 241 FKKDGVFANFVDQARVSQFMEKVHHIRR 268 FKKDG F N VD RV +FM+ VH R Sbjct: 241 FKKDGKFENRVDVNRVKEFMQVVHEFRE 268 >UniRef50_C5CIF3 Photosystem I assembly BtpA n=1 Tax=Kosmotoga olearia TBF 19.5.1 RepID=C5CIF3_KOSOT Length = 260 Score = 305 bits (783), Expect = 8e-82, Method: Composition-based stats. Identities = 117/252 (46%), Positives = 158/252 (62%) Query: 1 MSWLKEVIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFS 60 M+ +KE+ G EK +I M H LPG P +D + G+ +++++ DL +LQNGG+DAVMF Sbjct: 1 MATVKEIFGKEKVIIGMVHFPPLPGSPLYDDKKGVEFIVERIKSDLKSLQNGGIDAVMFC 60 Query: 61 NEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDPVASFDLAMATGAKFIREI 120 NE PY KV T A M+R IG++M +IR+PFGV+VLWDP A+ +A A GAKFIREI Sbjct: 61 NENDRPYKLKVDSATVATMSRAIGEVMDEIRVPFGVDVLWDPFAAIAIAKAVGAKFIREI 120 Query: 121 FTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFN 180 TG Y SD G+W T VGE R++ + A ++ FNI E A L R + IAKS F+ Sbjct: 121 ITGTYVSDMGLWKTEVGEFYRYRKLLDANDIAVFFNISAEFAYNLDRRPLEEIAKSVAFS 180 Query: 181 NHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATT 240 + D + VSG G +K+VK+ V + V ANTGV ENV E L+IADG + T+ Sbjct: 181 SLADVILVSGPMTGESPSLDHIKKVKDKVGEKPVFANTGVTKENVREILNIADGAIIGTS 240 Query: 241 FKKDGVFANFVD 252 KKDG+ F + Sbjct: 241 LKKDGITRRFWN 252 >UniRef50_A3KNP0 Zgc:162297 protein n=7 Tax=Coelomata RepID=A3KNP0_DANRE Length = 268 Score = 297 bits (762), Expect = 2e-79, Method: Composition-based stats. Identities = 81/273 (29%), Positives = 133/273 (48%), Gaps = 10/273 (3%) Query: 1 MSWLKEVIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFS 60 M +L + +I M H+RALPG P + ++ + ++A + N G+D ++ Sbjct: 1 MKFLNLFGRLQSNIIGMIHVRALPGTPL--NRFTISDIKEEACREAEIYYNAGLDGLIIE 58 Query: 61 NEFSLPYLTKVRPETTAAMARIIGQLMSDI-RIPFGVNVLWDPV-ASFDLAMATGAKFIR 118 N +PY V PE A M + + P GV +L ++ +A+A+G FIR Sbjct: 59 NMHDIPYTLDVGPEVCACMTAVCTAVRGLYPSWPLGVQILSAANHSALAVALASGLDFIR 118 Query: 119 -EIFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRD--ICSIAK 175 E F ++ +D G+ + GE +R++ IGA V+ +I + + + D I A+ Sbjct: 119 AEGFVFSHVADEGLLNACAGELLRYRKCIGAEHVQIFTDIKKKHSAHALTADVSIAETAQ 178 Query: 176 STVFNNHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGC 235 + F D + V+G G + D L+ V ++V VL +GV +NVE L A Sbjct: 179 AAEFF-LSDGVVVTGSATGAKADPQELREVSQSV-RIPVLIGSGVTDDNVEHYLQ-ASAM 235 Query: 236 VTATTFKKDGVFANFVDQARVSQFMEKVHHIRR 268 + + FKK G +AN VD RV +FM K+H +R Sbjct: 236 IIGSHFKKGGYWANGVDAERVKRFMGKMHKLRE 268 >UniRef50_A4E7P3 Putative uncharacterized protein n=1 Tax=Collinsella aerofaciens ATCC 25986 RepID=A4E7P3_9ACTN Length = 274 Score = 294 bits (753), Expect = 2e-78, Method: Composition-based stats. Identities = 97/267 (36%), Positives = 139/267 (52%) Query: 1 MSWLKEVIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFS 60 MS+L + TEK VI M HLR LPGDP + ++ V++ A DL ALQ GGVD ++ + Sbjct: 6 MSFLTSMFKTEKPVIGMLHLRPLPGDPLYYPGGSVSQVVEAAKRDLEALQQGGVDGILIT 65 Query: 61 NEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDPVASFDLAMATGAKFIREI 120 NE S+PY V P T A++ +IG L D+ P+G ++D A+ +L A A+F R Sbjct: 66 NELSMPYEQHVSPSTLASVGYVIGTLSHDLSTPWGAEAIYDGDATIELCAAVDAQFTRCN 125 Query: 121 FTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFN 180 F GA+A D G+ + + T+R + + ++K I E VYL +R IA S +FN Sbjct: 126 FCGAWAGDLGLINRDFAHTMRRKAALRLDDLKLFHFITSEGEVYLNDRTTADIADSLLFN 185 Query: 181 NHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATT 240 PDA+ + G AG L V+E V + V+ TG V + + DG T Sbjct: 186 CLPDAMVIGGSAAGRGASGELADEVRERVGEVPVVCGTGCRENTVADVFAHYDGAFVGTC 245 Query: 241 FKKDGVFANFVDQARVSQFMEKVHHIR 267 K+DG VD RV++FM R Sbjct: 246 LKRDGRLDAPVDVERVARFMAAARTAR 272 >UniRef50_UPI0000D55C2D PREDICTED: similar to conserved hypothetical protein n=1 Tax=Tribolium castaneum RepID=UPI0000D55C2D Length = 270 Score = 292 bits (748), Expect = 7e-78, Method: Composition-based stats. Identities = 78/275 (28%), Positives = 134/275 (48%), Gaps = 14/275 (5%) Query: 1 MSWLKEVI-GTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMF 59 M +++ T+ AV+ M H+ ALPG P + ++ ++ KA + G+D+++ Sbjct: 1 MLKFRQLFHTTKCAVVGMVHVGALPGTPLCNK--SVDSLVFKACKEAEMYLKYGLDSILV 58 Query: 60 SNEFSLPYLTKV--RPETTAAMARIIGQLMSDI--RIPFGVNVL-WDPVASFDLAMATGA 114 N +PY+ PET A M R+ ++ +P GV VL + + +A A Sbjct: 59 ENMHDVPYIQSKYFTPETVATMTRVCTEIRKIAPGTVPCGVQVLACGNLEALAVAKACNF 118 Query: 115 KFIR-EIFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDI--C 171 FIR E F + +D G D N G +R++ +I A V L +I + + + D+ Sbjct: 119 DFIRAEGFVFGHVADEGYTDANAGLILRYRRQIQAENVLILADIKKKHSSHAITSDVSLV 178 Query: 172 SIAKSTVFNNHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSI 231 A++ F D L ++G+ G+ + + L +VK+ VL +GV +N+ + + Sbjct: 179 ETAQAAQFFQ-ADGLILTGVATGSPANVSELSQVKKFC-SLPVLVGSGVTGDNLGDYMG- 235 Query: 232 ADGCVTATTFKKDGVFANFVDQARVSQFMEKVHHI 266 ADG + + FKK GV+ VD+ RV FMEK + Sbjct: 236 ADGVIVGSYFKKGGVWYEDVDEERVRNFMEKRKML 270 >UniRef50_A5KMZ4 Putative uncharacterized protein n=2 Tax=Clostridiales RepID=A5KMZ4_9FIRM Length = 269 Score = 292 bits (748), Expect = 8e-78, Method: Composition-based stats. Identities = 127/268 (47%), Positives = 166/268 (61%), Gaps = 4/268 (1%) Query: 3 WLKEVIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNE 62 W +++ G EK +IA+ HL ALPGDP + M V + A DL+ALQ+GGVD ++F+NE Sbjct: 2 WTQDMFGVEKPIIALLHLDALPGDPGYCGD--MKTVTEHARKDLLALQDGGVDGILFANE 59 Query: 63 FSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDPVASFDLAMATGAKFIREIFT 122 FSLPY +AMA IIG+L +I +PFGVNV+ +P+A+ DL ATGAKF R F+ Sbjct: 60 FSLPYQPVADIAVVSAMAYIIGKLKDEISVPFGVNVVKNPIATIDLGAATGAKFGRSCFS 119 Query: 123 GAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNH 182 GAY ++GV+ +N GE IRH+ +G ++K LF + PEA YL RD+ +AKS +F + Sbjct: 120 GAYMGEYGVYVSNSGEAIRHRKALGIEDMKLLFKVNPEADAYLVQRDVQVVAKSIMFGDF 179 Query: 183 PDALCVSGLTAGTRTDSALLKRVKETVP--DTVVLANTGVCLENVEEQLSIADGCVTATT 240 D LCVSG AG D +L RV E V NTG NV E+L DG T Sbjct: 180 ADGLCVSGAAAGAEPDDVILSRVHEVAKPRKVPVFCNTGCNHGNVREKLGNCDGVCMGTA 239 Query: 241 FKKDGVFANFVDQARVSQFMEKVHHIRR 268 FKKDGVF VD+ RV +FME V IR+ Sbjct: 240 FKKDGVFNGRVDKERVREFMEIVADIRK 267 >UniRef50_Q5JHL2 Uncharacterized protein TK2179 n=5 Tax=Euryarchaeota RepID=Y2179_PYRKO Length = 261 Score = 291 bits (746), Expect = 1e-77, Method: Composition-based stats. Identities = 76/264 (28%), Positives = 122/264 (46%), Gaps = 8/264 (3%) Query: 7 VIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEFSLP 66 + K +I M HL+ LPG ++ + VI+ A D + L+ G DAVM N +P Sbjct: 1 MDFERKPLIGMVHLKPLPGSYLYNGD--FDSVIEAALRDAVTLEEAGFDAVMVENFGDVP 58 Query: 67 YLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLW-DPVASFDLAMATGAKFIR-EIFTGA 124 + T A++A + + ++ +P GVNVL D +A++ +A A A FIR + +G Sbjct: 59 FPKYADKTTVASLAVVAKAIRDEVSLPLGVNVLRNDGIAAYSIAYAVKADFIRVNVLSGV 118 Query: 125 YASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPD 184 +D G+ + E + R+ + +K ++ + AV+ G D TV D Sbjct: 119 AYTDQGIIEGIAHELAMLRKRLPSE-IKVFADVHVKHAVHFG--DFEDAFLDTVERGLAD 175 Query: 185 ALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATTFKKD 244 A+ VSG G D L KE P V+ +G +N+ E ADG + T K+D Sbjct: 176 AVVVSGKATGRPVDVDKLALAKEISP-VPVIVGSGTSYDNLPELWKYADGFIVGTWIKRD 234 Query: 245 GVFANFVDQARVSQFMEKVHHIRR 268 G N V R + +E +R+ Sbjct: 235 GRVENEVSLERARKLVELAKELRQ 258 >UniRef50_A8S303 Putative uncharacterized protein n=1 Tax=Clostridium bolteae ATCC BAA-613 RepID=A8S303_9CLOT Length = 274 Score = 291 bits (745), Expect = 2e-77, Method: Composition-based stats. Identities = 74/275 (26%), Positives = 127/275 (46%), Gaps = 9/275 (3%) Query: 1 MSWLKEVIGTEKAVIAMCHLRALPGDPSFDA-QLGMNWVIDKAWDDLMALQNGGVDAVMF 59 M LK+V +K +I M HLR LPG P +D +GM+ +I A ++ L+ GVD V Sbjct: 1 MGKLKDVFKVDKPIIGMVHLRPLPGSPKYDPVNMGMDKIISIALEEAAMLEQAGVDGVQV 60 Query: 60 SNEFSLPYLT--KVRPETTAAMARIIGQLMSDIRIPFGVNVLWDPVASFDLAMATGA-KF 116 N + +PYL + ET AA+A I + + + IP G + + Sbjct: 61 ENMWDIPYLRSEDIGYETAAALAVGIHAVRNKVSIPVGAECHMNGADCAMACAVAAGASW 120 Query: 117 IREI-FTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVY--LGNRDICSI 173 IR + A+ S G + R + R+ A ++ L ++ + + + +R + Sbjct: 121 IRVFEWCNAFVSQSGFINAMGANVSRMRSRLKADQILALCDVNVKHGSHYIIHDRSVAEQ 180 Query: 174 AKSTVFNNHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIAD 233 A + + DA+ V+G GT + + K++ +L +G+ NV E L+ AD Sbjct: 181 AMD-IESQDGDAVIVTGFDTGTPPSVENISKCKKST-SLPILIGSGLNSSNVNELLTAAD 238 Query: 234 GCVTATTFKKDGVFANFVDQARVSQFMEKVHHIRR 268 G + + FK+ + N V R +FM+KV +R+ Sbjct: 239 GAIIGSWFKEGNNWKNPVSYDRTKEFMDKVIALRQ 273 >UniRef50_Q28QI3 Photosystem I assembly BtpA n=5 Tax=Alphaproteobacteria RepID=Q28QI3_JANSC Length = 267 Score = 290 bits (744), Expect = 3e-77, Method: Composition-based stats. Identities = 126/267 (47%), Positives = 161/267 (60%) Query: 1 MSWLKEVIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFS 60 M ++V GT K VIAM HL A+PG P DA G+ ++ A DL ALQ GVDAVMF Sbjct: 1 MQKFRDVFGTPKPVIAMVHLGAMPGTPLHDADAGLEGLVAAAAADLSALQAAGVDAVMFG 60 Query: 61 NEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDPVASFDLAMATGAKFIREI 120 NE PY V +TA MA +IGQL I +PFGVNVLWDP ++ LA ATGA+F REI Sbjct: 61 NENDRPYEFAVDTASTATMAYVIGQLRGQITVPFGVNVLWDPDSTIALAAATGAQFCREI 120 Query: 121 FTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFN 180 FTG YASD GVW + G +R++ R+G ++ L+N+ E A L R + A+S VF+ Sbjct: 121 FTGTYASDMGVWAPDAGRALRYRKRLGRDDLAMLYNVSAEFADSLDKRPLPDRARSAVFS 180 Query: 181 NHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATT 240 + PDA+ VSG G L+ VK +P+T VLANTGV + V E L IADGC+ ++ Sbjct: 181 SVPDAVLVSGQITGEAARMEDLEAVKAVLPETPVLANTGVKHDTVAEVLRIADGCIVGSS 240 Query: 241 FKKDGVFANFVDQARVSQFMEKVHHIR 267 K DG N VD R FM++ R Sbjct: 241 LKVDGHTWNAVDPDRAKDFMDRARASR 267 >UniRef50_Q2RUX8 Photosystem I assembly BtpA n=1 Tax=Rhodospirillum rubrum ATCC 11170 RepID=Q2RUX8_RHORT Length = 267 Score = 288 bits (737), Expect = 1e-76, Method: Composition-based stats. Identities = 122/258 (47%), Positives = 159/258 (61%), Gaps = 1/258 (0%) Query: 10 TEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEFSLPYLT 69 +KAVIAM H+ ALPG P +DA GM +ID D+ LQ GGV A+MF NE PY Sbjct: 9 RKKAVIAMAHIGALPGTPLYDADGGMMKLIDDVVGDIEKLQKGGVHAIMFGNENDRPYQF 68 Query: 70 KVRPETTAAMARIIGQLMSDIRIPFGVNVLWDPVASFDLAMATGAKFIREIFTGAYASDF 129 + + AAM II + + +PFGVN LWDP AS +A+ATGA F REIFTG +ASD Sbjct: 69 EAPIASVAAMTAIISAVRPMLSVPFGVNYLWDPAASVAIAVATGASFAREIFTGVFASDM 128 Query: 130 GVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPDALCVS 189 GVW N E +R + + ++K LFNI E A L +R I A+S +F++ DA+ VS Sbjct: 129 GVWSPNAAEALRLRRNLHRPDLKLLFNINAEFASSLDSRSIGLRARSAIFSSLADAILVS 188 Query: 190 GLTAGTRTDSALLKRVKETVP-DTVVLANTGVCLENVEEQLSIADGCVTATTFKKDGVFA 248 G G ++ L+ V+E + + + ANTGV LENV++ LSIADGCV T FK DG Sbjct: 189 GPLTGQPAQASDLREVREAIGTEVPLFANTGVRLENVDDVLSIADGCVIGTHFKVDGSTW 248 Query: 249 NFVDQARVSQFMEKVHHI 266 N VD RVS+FM+KV + Sbjct: 249 NRVDGGRVSRFMDKVATL 266 >UniRef50_Q8U2H5 Uncharacterized protein PF0860 n=8 Tax=Euryarchaeota RepID=Y860_PYRFU Length = 262 Score = 287 bits (734), Expect = 3e-76, Method: Composition-based stats. Identities = 67/265 (25%), Positives = 111/265 (41%), Gaps = 8/265 (3%) Query: 4 LKEVIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEF 63 +K++ ++K +I + HL+ LPG P + VI+ A D + G D ++ N Sbjct: 1 MKDLDFSKKPLIGVVHLKPLPGSPRYGGD--FEEVIEWAIRDAKTYEEAGFDGIIVENFG 58 Query: 64 SLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLW-DPVASFDLAMATGAKFIR-EIF 121 P+ + E A + + ++ +P G+N L D + ++ +A A G FIR + Sbjct: 59 DSPFSKTLPREVIPAFTVVAKAVKKEVSLPLGINALRNDCIVAYSIAHAVGGSFIRVNVL 118 Query: 122 TGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNN 181 TG +D G+ + E RI G++ TL ++ + AV+ N K TV Sbjct: 119 TGVAFTDQGIIEGCARELWNV-KRIIGGDILTLADVHVKHAVHFTN--FEDAVKDTVERG 175 Query: 182 HPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATTF 241 D + V+G G L K V VL +GV N ADG + T Sbjct: 176 LADGIIVTGRRTGESISLEDLILAK-RVSSIPVLVGSGVNPRNFRTLFKYADGFIVGTWV 234 Query: 242 KKDGVFANFVDQARVSQFMEKVHHI 266 K++G N V R + + + Sbjct: 235 KENGKINNPVSLERAKILVRMKNSL 259 >UniRef50_Q29E81 GA21203 n=3 Tax=Coelomata RepID=Q29E81_DROPS Length = 275 Score = 285 bits (731), Expect = 8e-76, Method: Composition-based stats. Identities = 68/276 (24%), Positives = 121/276 (43%), Gaps = 11/276 (3%) Query: 1 MSWLKEVIGTEKA-VIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMF 59 M ++ G +K VI M H+ ALPG P + I+KA + + +DAV+ Sbjct: 1 MRRFLKIFGQQKCKVIGMIHVDALPGTPRYAGHW--KETIEKAIYEANLYKRHQLDAVLI 58 Query: 60 SNEFSLPYLTK--VRPETTAAMARIIGQLMSDI--RIPFGVNVL-WDPVASFDLAMATGA 114 N +PY+ + + E TA M R+ + I IP GV VL + +A A+ Sbjct: 59 ENMHDIPYVPERLLGAEITACMTRLGQAVRDVIPKEIPCGVQVLACGNKQALAIAKASQL 118 Query: 115 KFIR-EIFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSI 173 +FIR E F + +D G D G+ +R++ I A +V ++ + + + D+ + Sbjct: 119 QFIRSEGFVFGHVADEGYTDACAGDLLRYRKLIDAEDVLIFTDLKKKHSSHAITSDVSLL 178 Query: 174 AKS-TVFNNHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIA 232 + D + ++G G L+ + V +L +GV +N+ A Sbjct: 179 ETAHAAEFFLTDGIVITGTATGHAASPQDLQELSGRV-KVPLLIGSGVTKDNIGLYYKDA 237 Query: 233 DGCVTATTFKKDGVFANFVDQARVSQFMEKVHHIRR 268 + + + FK+ G + + + V FM KV +R+ Sbjct: 238 NAVIVGSHFKRHGSWLEEISEEAVENFMRKVCELRQ 273 >UniRef50_B9XI59 Photosystem I assembly BtpA n=1 Tax=bacterium Ellin514 RepID=B9XI59_9BACT Length = 262 Score = 283 bits (725), Expect = 4e-75, Method: Composition-based stats. Identities = 87/259 (33%), Positives = 131/259 (50%), Gaps = 6/259 (2%) Query: 10 TEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEFSLPYLT 69 K +I + HL LPG P + +G V KA D + + GG DAV N +P+ Sbjct: 6 RRKVLIGVVHLGPLPGAPRWQGDIG--AVARKAVADARSYEQGGADAVFIENFGDVPFTK 63 Query: 70 K-VRPETTAAMARIIGQLMSDIRIPFGVNVLW-DPVASFDLAMATGAKFIR-EIFTGAYA 126 V PET AAMA + + + +++P G NVL D A+ L A G F+R + TGA Sbjct: 64 SAVGPETVAAMAALGCAVRAAVKLPIGFNVLRNDARAALGLCAACGGSFVRVNVHTGAML 123 Query: 127 SDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPDAL 186 +D G+ + N +T+R++ I G + ++ + AV LG+ I AK T+ DAL Sbjct: 124 TDQGLIEGNAYDTMRYREAISPG-TQVFADVHVKHAVPLGSWTIEDSAKDTIERGLADAL 182 Query: 187 CVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATTFKKDGV 246 VSG G + L+RV+ P+ +L +GV LEN + L +ADG + ++ K+ G Sbjct: 183 IVSGTGTGVAVNLDDLRRVRAACPEAKILLGSGVTLENAGDFLQLADGFIVGSSLKRGGK 242 Query: 247 FANFVDQARVSQFMEKVHH 265 AN VD RV+ + Sbjct: 243 LANPVDAKRVAALARAMRR 261 >UniRef50_A9W9X3 Photosystem I assembly BtpA n=4 Tax=Chloroflexaceae RepID=A9W9X3_CHLAA Length = 284 Score = 282 bits (721), Expect = 1e-74, Method: Composition-based stats. Identities = 81/266 (30%), Positives = 133/266 (50%), Gaps = 8/266 (3%) Query: 6 EVIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEFSL 65 E+ T K +I M H LPG P + GM +I+ A D AL GG D ++ N + + Sbjct: 19 EMFRTAKPIIGMVHCWPLPGAPGYTG-YGMQTIIEHAIRDAEALAEGGCDGLIVENMWDI 77 Query: 66 PYL--TKVRPETTAAMARIIGQLMSDI-RIPFGVNVLWDP-VASFDLAMATGAKFIR-EI 120 P+ V PE+ AA A + + + +P G+N++ + VA +A+A GA FIR + Sbjct: 78 PFRAGPHVPPESIAAQAVVAHAVRQAVPELPLGINLVHNGGVALLGIAIAAGASFIRVCM 137 Query: 121 FTGAYASDFGVW-DTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVF 179 FTGA D G W + + +R + + A +K ++ + +V D+ + + T F Sbjct: 138 FTGAGVWDAGSWDEGCAADLMRRRKELHAESIKIFADVDKKHSVRFPGIDLVTHIEWTRF 197 Query: 180 NNHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTAT 239 DA+ VSG G D A +++ +E DT +L +G +N+ + +ADG + + Sbjct: 198 FG-ADAIIVSGRMTGDAPDIAKVRQARELAGDTPILLGSGTTEQNIAAFMEVADGVIVGS 256 Query: 240 TFKKDGVFANFVDQARVSQFMEKVHH 265 + K+DG AN VD RV +F+ Sbjct: 257 SIKQDGEIANPVDVNRVRRFVAAARG 282 >UniRef50_C3ZBU0 Putative uncharacterized protein n=2 Tax=Metazoa RepID=C3ZBU0_BRAFL Length = 279 Score = 280 bits (718), Expect = 3e-74, Method: Composition-based stats. Identities = 86/281 (30%), Positives = 135/281 (48%), Gaps = 18/281 (6%) Query: 1 MSWLKEVIGT-EKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMF 59 M ++V G + A + M H+ ALPG P +G +IDKA + + G+DAVM Sbjct: 1 MQRFQKVFGRLQAAAVGMVHVGALPGTPRSSETVG--QLIDKACKEAEIYKRAGLDAVMV 58 Query: 60 SNEFSLPYL--TKVRPETTAAMARIIGQLMSDI-RIPFGVNVLWDP-VASFDLAMATGAK 115 N +PYL V E TAAM + ++ R+P GV VL + +A+ATG Sbjct: 59 ENMHDVPYLLGGDVGHEVTAAMTAVCREVRRVCPRLPCGVQVLSAANKQALAVALATGYV 118 Query: 116 FIREIFTGA------YASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRD 169 R S G ++ G+ +R++ +IGA + +I + + + D Sbjct: 119 PCRSGLRACGRVCVLPCSRRGAVNSCAGDLLRYRTQIGADSIMVFTDIKKKHSSHAITAD 178 Query: 170 --ICSIAKSTVFNNHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEE 227 I A++ F D + V+G G DS LK V++ V D VL +GV EN+ Sbjct: 179 VSIADTARAAEFF-LSDGVIVTGTETGRPVDSKELKEVRQAV-DIPVLVGSGVSTENLPT 236 Query: 228 QLSIADGCVTATTFKKDGVFANFVDQARVSQFMEKVHHIRR 268 L A+G + + FKK G++ N VD RV+ FM+++ +R+ Sbjct: 237 YLR-ANGLIVGSYFKKHGLWQNEVDLDRVNMFMDRLSTLRQ 276 >UniRef50_C5ELG9 Putative uncharacterized protein n=1 Tax=Clostridiales bacterium 1_7_47FAA RepID=C5ELG9_9FIRM Length = 275 Score = 280 bits (717), Expect = 3e-74, Method: Composition-based stats. Identities = 76/273 (27%), Positives = 124/273 (45%), Gaps = 7/273 (2%) Query: 1 MSWLKEVIGTEKAVIAMCHLRALPGDPSFD-AQLGMNWVIDKAWDDLMALQNGGVDAVMF 59 M L+ + +K +I M HLR LPG P +D A + M +++ A D+ LQ+ GVD V Sbjct: 1 MRQLQSIFREKKPIIGMVHLRPLPGSPMYDPASMDMTKILEIAVDEAKKLQDAGVDGVQV 60 Query: 60 SNEFSLPY--LTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDPVASFDLAMATGA-KF 116 N + +PY + ET AA+A I ++ + IP G + + A ++ Sbjct: 61 ENMWDIPYNRPEDIGYETAAALAVGIYEVGKHVSIPVGAECHMNGAECAMASAAAAGARW 120 Query: 117 IREI-FTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVY-LGNRDICSIA 174 IR + A+ S G + G R + R+ AG + L ++ + + + + Sbjct: 121 IRVFEWCNAFISQSGFVNGAGGRVSRMRDRLKAGHILALCDVNVKHGSHYIIHDRSVKEQ 180 Query: 175 KSTVFNNHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADG 234 + DA+ V+G G + K + VL +G+ EN+ E LS ADG Sbjct: 181 AMDIEAQGGDAVIVTGFDTGMPPTVDKVLECKAAIG-IPVLLGSGLAEENITELLSAADG 239 Query: 235 CVTATTFKKDGVFANFVDQARVSQFMEKVHHIR 267 + +TFK G + N VD R FM++V +R Sbjct: 240 AIVGSTFKAQGKWQNPVDYYRTKAFMDRVVKLR 272 >UniRef50_Q9VS44 CG8607 n=22 Tax=Eukaryota RepID=Q9VS44_DROME Length = 275 Score = 279 bits (715), Expect = 5e-74, Method: Composition-based stats. Identities = 62/276 (22%), Positives = 120/276 (43%), Gaps = 11/276 (3%) Query: 1 MSWLKEVIGTE-KAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMF 59 M +V + +I M H+ ALPG P + I+ A + + +DAV+ Sbjct: 1 MQRFLKVFKQQTCKIIGMIHVDALPGTPRYAGNW--KQTIENAIYEANLYKKHQLDAVLI 58 Query: 60 SNEFSLPYLTK--VRPETTAAMARIIGQLMSDI--RIPFGVNVL-WDPVASFDLAMATGA 114 N +PY+ + + E A M R+ + I IP GV VL + +A A+ Sbjct: 59 ENMHDIPYVPERLLGAEIVACMTRLGRAVREVIPQEIPCGVQVLACGNKQALAIAKASQL 118 Query: 115 KFIR-EIFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSI 173 +FIR E F + +D G D G+ +R++ I A +V ++ + + + D+ + Sbjct: 119 QFIRAEGFVFGHVADEGFTDACAGDLLRYRKLIDAEDVLIFTDLKKKHSSHAITADVSLL 178 Query: 174 AKS-TVFNNHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIA 232 + D + ++G G L+++ V ++ +GV +N++ A Sbjct: 179 ETAHAAEFFMTDGIIITGTATGHAASPEDLQQLSGRV-KVPLIIGSGVTRDNIDSYYKDA 237 Query: 233 DGCVTATTFKKDGVFANFVDQARVSQFMEKVHHIRR 268 + + FK++G + + + V +FM+K+ +R Sbjct: 238 HAVIIGSHFKRNGNWLEEISEPAVDEFMQKICQLRH 273 >UniRef50_D2QXT9 Photosystem I assembly BtpA n=2 Tax=Bacteria RepID=D2QXT9_9PLAN Length = 266 Score = 276 bits (707), Expect = 5e-73, Method: Composition-based stats. Identities = 73/258 (28%), Positives = 118/258 (45%), Gaps = 6/258 (2%) Query: 13 AVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEFSLPYL-TKV 71 VIAM HL LPG P + L ++ + + + L G +M N +P T+V Sbjct: 12 PVIAMLHLPPLPGSPR--SALSISAITEHVCREAEMLTALGAAGLMLENFGDMPLPATQV 69 Query: 72 RPETTAAMARIIGQLMSDIRIPFGVNVLW-DPVASFDLAMATGAKFIR-EIFTGAYASDF 129 P T A M+RI + +P G+NVL D +A+ +A A GA FIR I GA +D Sbjct: 70 SPATVAQMSRIAAAVRMASSLPLGINVLRNDSLAAMAIASAVGASFIRVNILVGARLTDQ 129 Query: 130 GVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPDALCVS 189 G+ E +R + +GA E++ ++ + + L + ++T+ DAL V+ Sbjct: 130 GIIAGRADELLRLRKSLGAEEIQIWADVNVKHSWPLAPVSLEEETENTIRRGLADALIVT 189 Query: 190 GLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATTFKKDGVFAN 249 G G TD L+ V T VL +GV +++ A G + + K G + Sbjct: 190 GRGTGYETDPHELQAVISAAAGTPVLVGSGVTADSLANF-QGASGAIVGSWIKHQGDARS 248 Query: 250 FVDQARVSQFMEKVHHIR 267 +D RV + M+ + + Sbjct: 249 PIDPERVRRLMQASRNSK 266 >UniRef50_D2RDD9 Photosystem I assembly BtpA n=1 Tax=Archaeoglobus profundus DSM 5631 RepID=D2RDD9_ARCPR Length = 249 Score = 275 bits (703), Expect = 1e-72, Method: Composition-based stats. Identities = 70/248 (28%), Positives = 110/248 (44%), Gaps = 15/248 (6%) Query: 14 VIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEFSLPYLTKVRP 73 +I + HL LP P ++ + A D AL G DA++ N P+L +V Sbjct: 3 IIGVLHLDPLPSSPLYE---SYEKTFENALKDAKALAE-GCDAIIIENYGDKPFLKEVDR 58 Query: 74 ETTAAMARIIGQLMSDIRIPFGVNVLW-DPVASFDLAMATGAKFIR-EIFTGAYASDFGV 131 T A M+ I ++ + +P G+NVL DP ++ +A A A F+R A S G Sbjct: 59 VTVACMSVIAWEVKRETGLPVGINVLRNDPFSALAIAKAVNADFVRVNQLYFASLSPEGF 118 Query: 132 WDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPDALCVSGL 191 + GE +R++ I + K ++ + A + + ++ V DAL V+G Sbjct: 119 LEGKAGEILRYRRFID-CKAKIYADVKVKHAHHFV--SLEDYLEN-VERCLADALIVTGT 174 Query: 192 TAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATTFKKDGVFANFV 251 G D LK V+ + V +GV EN+ + + DG + T FKKDG V Sbjct: 175 ATGREVDVEELKAVRNLT-NLPVFVGSGVKPENLHRYVGLCDGVIVGTYFKKDG----RV 229 Query: 252 DQARVSQF 259 D RV + Sbjct: 230 DVERVRRL 237 >UniRef50_UPI000186CA08 conserved hypothetical protein n=1 Tax=Pediculus humanus corporis RepID=UPI000186CA08 Length = 278 Score = 274 bits (700), Expect = 3e-72, Method: Composition-based stats. Identities = 66/273 (24%), Positives = 140/273 (51%), Gaps = 12/273 (4%) Query: 1 MSWLKEVIGTEKA-VIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMF 59 MS L +++ + +I M H++ALPG P + +L +N +I++A +D+ ++ V++++ Sbjct: 1 MSKLPDLLKMTRPYIIGMVHVKALPGTP--NNKLNINSLIEEACNDVEIYKSCNVNSILV 58 Query: 60 SNEFSLPYLTK--VRPETTAAMARIIGQLMSDI--RIPFGVNVLWDP-VASFDLAMATGA 114 N +PY+ V PE A+M +I ++ + + + GV +L + +A A Sbjct: 59 ENMHDVPYVQSKSVGPEIIASMTKICSEIKNILPRHMTCGVQILAGANKEALAVAQAAEL 118 Query: 115 KFIR-EIFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSI 173 ++IR E + ++ +D G+ ++ GE +R++ IGA + +I + + D+ + Sbjct: 119 QYIRAEGYVFSHIADEGLMNSCAGELLRYRKYIGAENISIWTDIKKKHCSHSITSDLTLV 178 Query: 174 AKS-TVFNNHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIA 232 + D + ++G T G + +++ + ++ +GV ENV + L+ A Sbjct: 179 ETALAAEFFLSDGIVLTGKTTGNAIRKSDFIKIQNSC-SLPIVIGSGVTAENVADFLN-A 236 Query: 233 DGCVTATTFKKDGVFANFVDQARVSQFMEKVHH 265 + + + FKK+G+++N VD+ RV FM + Sbjct: 237 NAIIVGSYFKKEGLWSNEVDKNRVENFMNVLVE 269 >UniRef50_P72966 Photosystem I biogenesis protein btpA n=34 Tax=Cyanobacteria RepID=BTPA_SYNY3 Length = 287 Score = 274 bits (700), Expect = 3e-72, Method: Composition-based stats. Identities = 74/265 (27%), Positives = 129/265 (48%), Gaps = 6/265 (2%) Query: 4 LKEVIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEF 63 L + T VI + HL LP + L VI++A + AL GGVD ++ N F Sbjct: 3 LFQTFQTHNPVIGVVHLLPLPTSARWGGNLT--AVIERAEQEATALAAGGVDGIIVENFF 60 Query: 64 SLPYLTK-VRPETTAAMARIIGQLMSDIRIPFGVNVLW-DPVASFDLAMATGAKFIR-EI 120 P+ + V P +AM I+ +L + + P G+NVL D ++ +A GAKFIR + Sbjct: 61 DAPFPKQRVDPAVVSAMTLIVDRLQNLVVAPVGINVLRNDAHSALAIASCVGAKFIRVNV 120 Query: 121 FTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFN 180 TG A+D G+ + N E +R++ + + +V L +++ + A LG ++ + T+ Sbjct: 121 LTGVMATDQGLIEGNAHELLRYRRELSS-DVAILADVLVKHARPLGTPNLTTAVTDTIER 179 Query: 181 NHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATT 240 D + +SG G+ + L+ T V +G +N+ + + A+G + A++ Sbjct: 180 GLADGIILSGWATGSPPNLEDLELATNAAKGTPVFIGSGADEDNIGQLIQAANGVIVASS 239 Query: 241 FKKDGVFANFVDQARVSQFMEKVHH 265 K+ G +D RVS F+E + Sbjct: 240 LKRHGNINEAIDPIRVSAFIEAMAE 264 >UniRef50_A6NZB9 Putative uncharacterized protein n=1 Tax=Bacteroides capillosus ATCC 29799 RepID=A6NZB9_9BACE Length = 266 Score = 270 bits (690), Expect = 4e-71, Method: Composition-based stats. Identities = 77/262 (29%), Positives = 117/262 (44%), Gaps = 6/262 (2%) Query: 4 LKEVIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEF 63 + +K VI M HL+ALPG P + M+ + A +DL AL+ GGVDA + N Sbjct: 7 FHRMFPGQKPVIGMVHLQALPGAPGYGG--SMDEIYRAAVEDLHALEQGGVDAAIVENFG 64 Query: 64 SLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDPVAS-FDLAMATGAKFIR-EIF 121 PY T AAM + QL ++ + G+NV ++ + + +A A G FIR E Sbjct: 65 DTPYALNHELITLAAMTALAVQLRAESSLRLGLNVQFNCTEAEWGIAYAAGYDFIRVEAL 124 Query: 122 TGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNN 181 GV +R + R A L +I + + + + + Sbjct: 125 VENRVGVHGVAFAAAPSLLRLKSRYPAE-TMLLADINVKHTYPMVEQPLDASIHEAKE-A 182 Query: 182 HPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATTF 241 AL V+G+ G + R KE +T VL +G+ EN IADG + ++F Sbjct: 183 GAGALIVTGVVTGQNPSLEDVCRCKELAGETPVLLGSGIHQENAAAFFQIADGAIVGSSF 242 Query: 242 KKDGVFANFVDQARVSQFMEKV 263 K++G N VD RV +FME + Sbjct: 243 KENGDVRNKVDTGRVRRFMEAL 264 >UniRef50_Q8TVC9 Predicted TIM-barrel enzyme n=1 Tax=Methanopyrus kandleri RepID=Q8TVC9_METKA Length = 271 Score = 269 bits (687), Expect = 9e-71, Method: Composition-based stats. Identities = 82/256 (32%), Positives = 124/256 (48%), Gaps = 10/256 (3%) Query: 14 VIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEFSLPYLTKVRP 73 V+ + HL LPG P + V+++A D L++GGVDAV+ N PY P Sbjct: 15 VVGVVHLPPLPGSPR---AKSIEEVVERARRDAARLEDGGVDAVLVENFGDTPYYPDDVP 71 Query: 74 E-TTAAMARIIGQLMSDIRIPFGVNVLW-DPVASFDLAMATGAKFIR-EIFTGAYASDFG 130 + T A M R + +++ + +P GVNVL D VA+ D+ ATGA FIR + A A+D G Sbjct: 72 KITVACMTRAVAEVVDTVSVPVGVNVLRNDGVAAVDVCAATGASFIRVNAYVEAVATDQG 131 Query: 131 VWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPDALCVSG 190 V R R+G +V+ +I + L +R + +A+ V DA+ V+G Sbjct: 132 VLQPVAHMVWREIDRLGV-DVEVYADIRVKHGRPLDDRPVEEVARDAVERGLADAVIVTG 190 Query: 191 LTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSI-ADGCVTATTFKKDGVFAN 249 G+ +++V V VL +GV EN L A G + T FKK+G+ N Sbjct: 191 SATGSPPRPEEVRKVARVVDR--VLVGSGVTPENAHVFLRAGAAGFIVGTYFKKNGITEN 248 Query: 250 FVDQARVSQFMEKVHH 265 VD RV + + + Sbjct: 249 PVDVDRVRELVRFIRR 264 >UniRef50_B9CKX7 Putative uncharacterized protein n=1 Tax=Atopobium rimae ATCC 49626 RepID=B9CKX7_9ACTN Length = 270 Score = 265 bits (679), Expect = 9e-70, Method: Composition-based stats. Identities = 87/270 (32%), Positives = 138/270 (51%), Gaps = 4/270 (1%) Query: 1 MSWLKE----VIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDA 56 MS L E G+ +I H++ALPG P D+++ + I++ D LQ+ G DA Sbjct: 1 MSTLLEKHYATFGSSCPIIGCLHMQALPGTPFSDSKITLKNQIERLKRDAYTLQDAGFDA 60 Query: 57 VMFSNEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDPVASFDLAMATGAKF 116 V+F+NE PY+T V +T A RI +++ ++ IP+G VL DP A+ A A AKF Sbjct: 61 VVFANEGDRPYITPVGFDTVANYVRIATEVIEELSIPYGCGVLIDPFATLAAAKALEAKF 120 Query: 117 IREIFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKS 176 +R TG+Y FG N GE R+Q +I A +V+ P A L R + ++ Sbjct: 121 VRTYVTGSYEGLFGSQKFNPGEIFRYQKQIEATDVRVYTYFEPHAGTCLDVRSSEEMLEA 180 Query: 177 TVFNNHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCV 236 + N G AG +++ + R+KE + ++ +G EN+ + L ADG + Sbjct: 181 GIANLPIAGALFGGAHAGLPPEASHIVRLKEEFTEVPLIIGSGGTAENISKLLPHADGVI 240 Query: 237 TATTFKKDGVFANFVDQARVSQFMEKVHHI 266 T+ KKDG+ N VD R +F++ ++ Sbjct: 241 VGTSIKKDGILWNNVDPVRAKRFVKAAKNL 270 >UniRef50_A3K4E4 Putative uncharacterized protein n=1 Tax=Sagittula stellata E-37 RepID=A3K4E4_9RHOB Length = 282 Score = 265 bits (677), Expect = 1e-69, Method: Composition-based stats. Identities = 66/272 (24%), Positives = 119/272 (43%), Gaps = 9/272 (3%) Query: 2 SWLKEVIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSN 61 S L+ + +K +I + HL ALPG P +D + + A D L GGVD +M N Sbjct: 12 SALETLFEKKKPIIGVIHLAALPGAPFYDGAP-LREIYAAAVRDAKTLAAGGVDGIMIEN 70 Query: 62 EFSLPYLT--KVRPETTAAMARIIGQLMSDIRIPFGVNVLWD-PVASFDLAMATGAKFIR 118 +P+ + ET A + + + P G+ + + + +A A GA+++R Sbjct: 71 AGDMPFARPEDIGFETVAFLTAACEAVRGAVDTPIGITCVANGAIPGLAVAKAVGARWVR 130 Query: 119 -EIFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGN--RDICSIAK 175 + AY ++ G + +R++ +I A +V L ++ + + R I A Sbjct: 131 VNQWANAYVANEGFLNGAASAAMRYRAQIAAKDVAVLADVHVKFGAHAITADRTITEQAT 190 Query: 176 STVFNNHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGC 235 + D L +G G+ T +++V+ V+ +G+ E V + +ADG Sbjct: 191 DAEWFG-ADVLIATGQRTGSPTQPEEVRQVRAGT-HLPVIVGSGLSPEQVPALMEVADGA 248 Query: 236 VTATTFKKDGVFANFVDQARVSQFMEKVHHIR 267 + K D + N VD ARV + M + +R Sbjct: 249 IVGQWLKVDARWWNPVDPARVERLMTAMDQVR 280 >UniRef50_B9XEW7 Photosystem I assembly BtpA n=1 Tax=bacterium Ellin514 RepID=B9XEW7_9BACT Length = 265 Score = 263 bits (672), Expect = 5e-69, Method: Composition-based stats. Identities = 67/261 (25%), Positives = 117/261 (44%), Gaps = 7/261 (2%) Query: 7 VIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEFSLP 66 + + K +I M H+ ALPG P+ L + + + A + ++ GVD + N +P Sbjct: 5 LFASAKPIIGMIHVGALPGTPA--NHLSLGKITEIAVQEAKIYRDAGVDGIAIENMHDVP 62 Query: 67 YLTK-VRPETTAAMARIIGQLMSDIRIPFGVNVLWDPVASFDLAMATG-AKFIR-EIFTG 123 YL V PE ++M I + G+ +L A ++R E F Sbjct: 63 YLRGGVGPEIVSSMTIIGQAVKQAFCGVTGIQILAAANREAMAAAHAAALDWVRVEGFVF 122 Query: 124 AYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKS-TVFNNH 182 A+ +D G ++ E +R++ +IGA +V+ +I + + + DI + Sbjct: 123 AHVADEGFINSCAAELLRYRKQIGAEKVQVWADIKKKHSSHAITADISLGETAHAAEFMR 182 Query: 183 PDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATTFK 242 DAL V+G G A + K V+ +G+ N+ + L +ADG + ++FK Sbjct: 183 ADALIVTGPVTGRPPVPADAEETKAHT-HLPVILGSGMNEANIGQFLPVADGFIVGSSFK 241 Query: 243 KDGVFANFVDQARVSQFMEKV 263 K G + N VD +V FM++V Sbjct: 242 KAGDWNNPVDSRKVKAFMKRV 262 >UniRef50_A8AAQ2 Photosystem I assembly BtpA n=1 Tax=Ignicoccus hospitalis KIN4/I RepID=A8AAQ2_IGNH4 Length = 268 Score = 261 bits (668), Expect = 2e-68, Method: Composition-based stats. Identities = 77/255 (30%), Positives = 125/255 (49%), Gaps = 7/255 (2%) Query: 14 VIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEFSLPYLTKVRP 73 VI + HL LPG + + VI++A D AL+ GGVDA++ N P+ +V Sbjct: 3 VIGVVHLLPLPGS--YGWGGDFDAVIERAVKDAKALEKGGVDAIIIENFMDYPFPIRVDY 60 Query: 74 ETTAAMARIIGQLMSDIRIPFGVNVLWD-PVASFDLAMATGAKFIR-EIFTGAYASDFGV 131 T AA R++ +++ + + GV++L + + +A+A+GAKF+R + + G+ Sbjct: 61 VTVAAATRVVTEVVRSLELSAGVSLLRNSAPEAIAVALASGAKFVRSNQWCWTSDAPEGL 120 Query: 132 WDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPDALCVSGL 191 E + R GA +V + ++ + A + RD+C A+ DAL VSG Sbjct: 121 LTPVAREGLEVMRRWGA-KVGVVADVRVKHAAPISGRDLCDEARDLGGRCRADALAVSGA 179 Query: 192 TAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATTFKKDGVFANFV 251 G+ D L+ VK P VL +G+ ENV + ADG + T FK+ GV N V Sbjct: 180 ATGSEADPRQLEVVKTCTPK-PVLVASGITPENVVRF-ASADGVIVGTYFKEGGVTENPV 237 Query: 252 DQARVSQFMEKVHHI 266 D RV + ++ + Sbjct: 238 DVHRVRKLVDAAKRL 252 >UniRef50_B5XQK9 BtpA family protein n=18 Tax=Proteobacteria RepID=B5XQK9_KLEP3 Length = 281 Score = 261 bits (667), Expect = 2e-68, Method: Composition-based stats. Identities = 65/270 (24%), Positives = 125/270 (46%), Gaps = 9/270 (3%) Query: 3 WLKEVIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNE 62 ++ + KAVI + H PG P + + ++ ++++A D +GGV ++ N Sbjct: 11 AIQAIFSRSKAVIGVIHCDPFPGSPKYRGK-SVSDIVERALRDAENYISGGVHGLIIENH 69 Query: 63 FSLPYLT--KVRPETTAAMARIIGQLMSDIRIPFGVNVLWDP-VASFDLAMATGAKFIR- 118 +P+ + ET+A MA I ++ +P G+NVL + + + +A+A GA F+R Sbjct: 70 GDIPFSKPEDIGHETSALMAVITEKVRERFAVPLGINVLANAAIPAMAIALAGGADFVRV 129 Query: 119 EIFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYL--GNRDICSIAKS 176 + AY ++ G + + +R++ + A ++ + + + +R I + + Sbjct: 130 NQWANAYIANEGFIEGAAAKALRYRSMLRAEHIRVFADSHVKHGSHAIVADRSIQELTRD 189 Query: 177 TVFNNHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCV 236 F DA+ +G G A + ++ + +L +GV NV++ L G + Sbjct: 190 VDFFE-ADAVIATGQRTGDSATMAEIDEIRAAT-ELPLLVGSGVTPANVKQILGRTQGVI 247 Query: 237 TATTFKKDGVFANFVDQARVSQFMEKVHHI 266 A+T K DGV+ N V+ ARV FM Sbjct: 248 VASTMKVDGVWWNDVELARVKHFMSVAQAA 277 >UniRef50_Q1NZ26 Uncharacterized protein F13E9.13, mitochondrial n=3 Tax=Caenorhabditis RepID=YSMU_CAEEL Length = 277 Score = 259 bits (662), Expect = 8e-68, Method: Composition-based stats. Identities = 72/269 (26%), Positives = 123/269 (45%), Gaps = 15/269 (5%) Query: 10 TEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEFSLPYLT 69 + V M H+ ALPG P L M+ ++ K + GVD V+ N +PY+ Sbjct: 14 SRPLVFGMIHVPALPGTP--SNTLPMSAILKKVRKEADVYFKNGVDGVIVENMHDVPYVK 71 Query: 70 K-VRPETTAAMARIIGQLM--SDIRIPF---GVNVLWDP-VASFDLAMATGAKFIR-EIF 121 PE ++MA QL+ D P G+ +L + +A TG FIR E F Sbjct: 72 PPASPEIVSSMALASDQLVKSRDAHHPAALTGIQILAAANREALAVAYTTGMDFIRAEGF 131 Query: 122 TGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDIC--SIAKSTVF 179 ++ +D G D G +R++ + A + +I + + + D+ +AK F Sbjct: 132 VYSHVADEGWIDGCAGGLLRYRSSLKAENIAIFTDIKKKHSAHSVTSDVSIHEMAKDAKF 191 Query: 180 NNHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTAT 239 N D + V+G G+ + +V + V + VL +G+ +N E + A G + + Sbjct: 192 NC-ADGVIVTGSATGSAASLEEMIQVMK-VQEFPVLIGSGINGKNAREFVK-AHGFIVGS 248 Query: 240 TFKKDGVFANFVDQARVSQFMEKVHHIRR 268 FK G + N +D R+S+FM+ V+ ++R Sbjct: 249 DFKIGGEWKNDLDSGRISKFMKHVNTLKR 277 >UniRef50_Q9Y937 BtpA homolog n=1 Tax=Aeropyrum pernix RepID=Q9Y937_AERPE Length = 287 Score = 259 bits (662), Expect = 9e-68, Method: Composition-based stats. Identities = 59/280 (21%), Positives = 109/280 (38%), Gaps = 18/280 (6%) Query: 7 VIGTEKAVIAMCHLRALPGD---------PSFDAQLGMNWVIDKAWDDLMALQNGGVDAV 57 V K ++ + HL LPG P + +I+ A + ++ G D V Sbjct: 3 VFQRCKPLLGVVHLPPLPGSTGYKARRYPPRLGKVWSLEEIIEYAVSEASVYEDAGFDGV 62 Query: 58 MFSNEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDP-VASFDLAMATGAKF 116 + N PY P +AM RI+ ++ S + IP GVN+L + V + A + G F Sbjct: 63 ILENYGDTPYPKTPGPLQVSAMTRIVREVSSAVGIPVGVNMLRNGSVEALASAYSGGGSF 122 Query: 117 IR-EIFTGAYASDFGVWDTNVGETIRHQHRIGAGE---VKTLFNIVPEAAVYLGNRDICS 172 IR S G+ + + + +G E ++ L ++ + + L I Sbjct: 123 IRVNSLCETRLSPEGILEPDAARLAKSLALLGILEERRIEILADVDVKHSQPLVETSIAQ 182 Query: 173 IAKSTVFNNHP--DALCVSGLTAGTRTDSALLKRVKETVPDTVV--LANTGVCLENVEEQ 228 + + + + ++G G D+ + T + V + +GV N+ + Sbjct: 183 TVRDCIERSGVPIAGVVLTGHATGGAPDADEVVAAARTASEYEVKTVVGSGVSQLNLSKY 242 Query: 229 LSIADGCVTATTFKKDGVFANFVDQARVSQFMEKVHHIRR 268 IADG + ++ K G N +D+ + +RR Sbjct: 243 WHIADGFIIGSSIKLGGKPWNPIDKEKARLIASLAERLRR 282 >UniRef50_A4YFU5 Photosystem I assembly BtpA n=1 Tax=Metallosphaera sedula DSM 5348 RepID=A4YFU5_METS5 Length = 258 Score = 257 bits (656), Expect = 4e-67, Method: Composition-based stats. Identities = 72/257 (28%), Positives = 121/257 (47%), Gaps = 8/257 (3%) Query: 14 VIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEFSLPYLTK-VR 72 + M HL LPG P + ++ A + LQ+ GVDAV+ N P+ + Sbjct: 6 IAGMIHLPPLPGSPR--GGQPLEEIVKYAVTEADKLQSAGVDAVIVENLGDYPFFKDNMP 63 Query: 73 PETTAAMARIIGQLMSDIRIPFGVNVLWDP-VASFDLAMATGAKFIR-EIFTGAYASDFG 130 P T A+M+ I+ ++ + + GVNVL + + +F LA GA FIR I GAYA+D G Sbjct: 64 PITVASMSVIVREVRRKLGLQVGVNVLRNGCIDAFSLAHVNGADFIRCNILIGAYATDQG 123 Query: 131 VWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPDALCVSG 190 V + E +R + + + V+ L ++ + A L N +A+ DA+ VSG Sbjct: 124 VIEGRAAELLRLKRSLNS-RVRILADVHVKHAYPLYNLPTELVAQDLAERGGADAVIVSG 182 Query: 191 LTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTAT-TFKKDGVFAN 249 + +K+VKE+V V+ +G+ L N +E +ADG + FK++G+ Sbjct: 183 PRSSLPPSIETVKKVKESV-QVPVIVGSGISLGNFKEFCGVADGLIVGEVDFKENGMIGG 241 Query: 250 FVDQARVSQFMEKVHHI 266 + ++ + Sbjct: 242 PSKVEAYKKLVKGCKGV 258 >UniRef50_A2BLV0 Conserved archaeal protein n=1 Tax=Hyperthermus butylicus DSM 5456 RepID=A2BLV0_HYPBU Length = 285 Score = 249 bits (637), Expect = 7e-65, Method: Composition-based stats. Identities = 62/261 (23%), Positives = 112/261 (42%), Gaps = 9/261 (3%) Query: 12 KAVIAMCHLRALPGDPSF-DAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEFSLPYLTK 70 K +I M HL PS+ ++ ++ ++D A + L + G +AV+ N PY Sbjct: 18 KPLIGMIHLPP---TPSYVKDRVDIDRLVDYALWEAGKLADAGFNAVIIENYGDHPYTVT 74 Query: 71 VRPETTAAMARIIGQLMSDIR--IPFGVNVLWDPVA-SFDLAMATGAKFIR-EIFTGAYA 126 + A+ARI ++ + G+N+L + + + A+ +GA FIR + Sbjct: 75 APSLSVLAIARIAAEVARTYSGKLRVGINILRNAAPQALEAALVSGASFIRVNSYCELRV 134 Query: 127 SDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPDAL 186 S G+ R + + A V ++ + + L + I PDA+ Sbjct: 135 SMEGILTPAAYIIERIREELRA-PVLVFADVDVKHSAPLATASLEQILHDCARRGRPDAI 193 Query: 187 CVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATTFKKDGV 246 VSG G + +K VP ++ +G+ ++N+ +ADG + T+ K +G Sbjct: 194 IVSGSATGEPPSPGYVASIKAMVPYKPIIIGSGISIDNIMAYWRVADGFIVGTSIKLNGK 253 Query: 247 FANFVDQARVSQFMEKVHHIR 267 N VD+ R Q E V+ +R Sbjct: 254 TLNPVDERRARQLAELVNELR 274 >UniRef50_B8HZU1 Photosystem I assembly BtpA n=5 Tax=Clostridiales RepID=B8HZU1_CLOCE Length = 262 Score = 247 bits (632), Expect = 3e-64, Method: Composition-based stats. Identities = 68/260 (26%), Positives = 122/260 (46%), Gaps = 7/260 (2%) Query: 8 IGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEFSLPY 67 + V+ M H ALPG P F M + D+A + + L+ G+DA++ N + Sbjct: 6 FKDKPIVMGMVHCLALPGTPDFCGD--MKKITDQAVKEAITLEKSGMDAIIIENMGDNVF 63 Query: 68 LTKVRPETTAAMARIIGQLMSDIRIPFGVNV-LWDPVASFDLAMATGAKFIR-EIFTGAY 125 + E + A+A I + ++ IP G++ + D + +A A GA F+R +F Sbjct: 64 GVNMDIEQSCALAAISAIVAQNVNIPIGIDAAMNDYKTALSIAKAIGADFVRIPVFVDTV 123 Query: 126 ASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEA-AVYLGNRDICSIAKSTVFNNHPD 184 G+ E ++ + I A VK +I + + L + I AK+ D Sbjct: 124 EFFGGIIQPCAREAMKFRKNIEAENVKIFADIQVKHTHMVLPHVSIEDSAKAAEA-CGAD 182 Query: 185 ALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATTFKKD 244 A+ V+G G T ++KRVK+ + V+A +GV N++EQL IADG + ++ K+ Sbjct: 183 AIIVTGTHIGVETPIDIIKRVKKVI-SIPVIAGSGVKTNNIKEQLGIADGAIVGSSLKEG 241 Query: 245 GVFANFVDQARVSQFMEKVH 264 G N + ++ ++ ++ Sbjct: 242 GNIKNPISLELCTELIKALN 261 >UniRef50_O29828 Uncharacterized protein AF_0419 n=1 Tax=Archaeoglobus fulgidus RepID=Y419_ARCFU Length = 246 Score = 247 bits (631), Expect = 3e-64, Method: Composition-based stats. Identities = 82/258 (31%), Positives = 120/258 (46%), Gaps = 16/258 (6%) Query: 11 EKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEFSLPYLTK 70 EK VI + HL LPG P ++ VIDKA D A++ GG DA++ N P+L + Sbjct: 2 EKTVIGVVHLLPLPGSPEHT---DLSAVIDKAVKDARAIEEGGADALILENYGDKPFLKE 58 Query: 71 VRPETTAAMARIIGQLMSDIRIPFGVNVLW-DPVASFDLAMATGAKFIR-EIFTGAYASD 128 V ET AAM I ++ D+ I G+NVL D VA+ +A A A F+R S Sbjct: 59 VGKETVAAMTVIACEVKRDVSIGLGINVLRNDAVAALAIAKAVNADFVRVNQLFFTSVSP 118 Query: 129 FGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGN-RDICSIAKSTVFNNHPDALC 187 G+ + GE +R++ + +I + AV+ + D C A + DA+ Sbjct: 119 EGILEGKAGEVMRYKKLVD-CRAMIFADIAVKHAVHFASLEDYCLNA----ERSLADAVI 173 Query: 188 VSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATTFKKDGVF 247 ++G T G LK K+T+ VLA +GV EN L DG + T K+ G+ Sbjct: 174 LTGKTTGGEVSLEELKYAKKTL-KMPVLAGSGVNAENAARILKWCDGVIVGTYIKRGGL- 231 Query: 248 ANFVDQARVSQFMEKVHH 265 VD RV + + Sbjct: 232 ---VDAERVRRIVRAAKG 246 >UniRef50_A3DLN8 Photosystem I assembly BtpA n=1 Tax=Staphylothermus marinus F1 RepID=A3DLN8_STAMF Length = 260 Score = 247 bits (631), Expect = 3e-64, Method: Composition-based stats. Identities = 52/256 (20%), Positives = 109/256 (42%), Gaps = 8/256 (3%) Query: 16 AMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEFSLPYLT-KVRPE 74 M HL LP P + + ++ +++ A ++ L G D V+ N P+ + P Sbjct: 5 GMIHLPPLPNSPQYSGE-KIDVILEYAINEAEKLVEAGFDGVIIENYMDYPFPVYEKDPV 63 Query: 75 TTAAMARIIGQLMSDI-RIPFGVNVLWD-PVASFDLAMATGAKFIR-EIFTGAYASDFGV 131 + I ++ + I G+N+L + + S D+A FIR ++ + G+ Sbjct: 64 KLGFIEYIARRIREEFPNILIGLNILRNSGLESIDIACRNNLDFIRVNVYMETVLAPEGI 123 Query: 132 WDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPDALCVSGL 191 E ++++ + VK ++ + + L N + + ++T D + VSG Sbjct: 124 IKPLAYEIMKYKMQ-KKCNVKIYADVNVKHSQPLMNYTM--VLRNTCSRGLVDGVIVSGE 180 Query: 192 TAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATTFKKDGVFANFV 251 G T + + K V+ +GV +N+ + +AD + T+ K +G+ N V Sbjct: 181 HTGYATPVSRVYVAKRICNGKEVIVGSGVNYQNIGLYIGLADAVIVGTSIKNEGITTNPV 240 Query: 252 DQARVSQFMEKVHHIR 267 + + +E+V ++ Sbjct: 241 NLQKAMYLVERVKRVK 256 >UniRef50_D2RQS6 Photosystem I assembly BtpA n=4 Tax=Halobacteriaceae RepID=D2RQS6_9EURY Length = 278 Score = 244 bits (623), Expect = 3e-63, Method: Composition-based stats. Identities = 78/270 (28%), Positives = 125/270 (46%), Gaps = 13/270 (4%) Query: 1 MSWLKEVIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFS 60 ++ L+ ++ V+ M HL +PG P ++ + V D+A +D L+ GGVD ++ Sbjct: 4 ITPLRTRFDADRPVVGMVHLPPVPGAPGYEGD--RDAVRDRALEDARRLEAGGVDGIVLE 61 Query: 61 NEFSLPYLTKVRPE-TTAAMARIIGQLMSDIRIPFGVNVLW-DPVASFDLAMATGAKFIR 118 N P+ P+ A M + + + +P G+NVL D A+ +A A A+F+R Sbjct: 62 NFGDAPFYPDDVPKHVVAEMTAVATAVTDAVDVPLGINVLRNDADAALSIAAAVDAEFVR 121 Query: 119 -EIFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKST 177 + G A+D GV + ET+R + RI A +V L ++ + A +G+R I A Sbjct: 122 VNVHVGTAATDQGVLEGRAHETLRLRDRIDA-DVAILADVHVKHATPIGDRSIDRAALEA 180 Query: 178 VFNNHPDALCVSGLTAGTRTDSALLKRVKET------VPDTVVLANTGVCLENVEEQLSI 231 V D + VSG G T ++RV T V +GV E V + L+ Sbjct: 181 VERGRADGVIVSGPGTGDETALEDVERVAAALDGAGTAGRTSVFVGSGVTSETVGDCLAA 240 Query: 232 -ADGCVTATTFKKDGVFANFVDQARVSQFM 260 ADG + T K+ G N V + RV + Sbjct: 241 GADGVIVGTALKEGGETTNPVSRERVKALV 270 >UniRef50_UPI000155D1C1 PREDICTED: hypothetical protein n=1 Tax=Ornithorhynchus anatinus RepID=UPI000155D1C1 Length = 261 Score = 243 bits (620), Expect = 6e-63, Method: Composition-based stats. Identities = 68/230 (29%), Positives = 111/230 (48%), Gaps = 8/230 (3%) Query: 43 WDDLMALQNGGVDAVMFSNEFSLPYLTKVRPETTAAMARIIGQLMSDI-RIPFGVNVLWD 101 W D ++ D ++ N LPY PE TA M + + R+P GV VL Sbjct: 34 WRDGDGVRFPPQDGLIVENMHDLPYTASAGPEVTATMTAVCAAVRMTCPRLPLGVQVLCS 93 Query: 102 P-VASFDLAMATGAKFIR-EIFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVP 159 + +A+A G FIR E F ++ +D G + G+ +R++ RIGA V+ +I Sbjct: 94 ANQEAVAVALAAGCDFIRAEGFVFSHVADEGFVNACAGDLLRYRRRIGAEHVQIFADIKK 153 Query: 160 EAAVYLGNRD--ICSIAKSTVFNNHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLAN 217 + + + D + AK+ F D + ++G G D L V++ V + +L Sbjct: 154 KHSAHALTADVSVSETAKAAEFF-LADGVILTGPATGVEADPGELHEVEQAV-NIPLLIG 211 Query: 218 TGVCLENVEEQLSIADGCVTATTFKKDGVFANFVDQARVSQFMEKVHHIR 267 +GV LENV+ L+ A+ + + FK+ G +AN +D RV FM+ V +R Sbjct: 212 SGVTLENVKSYLN-ANALIIGSYFKEGGYWANQIDPTRVKTFMDHVRKLR 260 >UniRef50_C5EEU2 Photosystem I assembly BtpA n=2 Tax=Clostridiales RepID=C5EEU2_9FIRM Length = 263 Score = 241 bits (616), Expect = 2e-62, Method: Composition-based stats. Identities = 63/260 (24%), Positives = 112/260 (43%), Gaps = 8/260 (3%) Query: 9 GTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEFSLPYL 68 TEK V++M LPG + + ++ ++D+A + + D ++ N +P Sbjct: 6 RTEKVVLSMIQPEPLPGSYRH-SDMRIDAIVDRALRETEMVARNHFDGIIVQNMNDMPVK 64 Query: 69 TKVRPETTAAMARIIGQLMSDI-RIPFGVNVLWDPVASFDLAMATGAKFIR--EIFTGAY 125 + PE A M RI ++ + G+ + WD VA +A A GA F+R +FTGA Sbjct: 65 QQSSPEAIAYMTRIAYEIRKRFPELVMGILMNWDGVAGLCVADAVGADFVRVEHLFTGAS 124 Query: 126 ASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPDA 185 + G+ + + + R G+ +V ++ + LG + + A V D Sbjct: 125 VTSAGILEAQCVDIAGVRKRTGS-KVPVYADVYEVHGIPLGRKPVGDAAWECVHEAFADG 183 Query: 186 LCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATTFKKDG 245 L +SG + ++K + V DT + G +N+ E + DG AT K +G Sbjct: 184 LFMSGKS--VEESIRMIKEARPRVKDTPIFLGGGATGDNIHELMRYFDGVSVATWIK-NG 240 Query: 246 VFANFVDQARVSQFMEKVHH 265 N +D R +F+ + Sbjct: 241 DMKNPIDPERAKRFIAEAKR 260 >UniRef50_C0ZRZ3 Putative uncharacterized protein n=2 Tax=Rhodococcus erythropolis RepID=C0ZRZ3_RHOE4 Length = 280 Score = 237 bits (606), Expect = 2e-61, Method: Composition-based stats. Identities = 62/268 (23%), Positives = 115/268 (42%), Gaps = 9/268 (3%) Query: 2 SWLKEVIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSN 61 S L E+ ++I HL ALPG P + Q ++ + A ++ A + G D V+ N Sbjct: 14 SALAEMFTGTPSLIGAIHLPALPGSPHYTGQP-VSEIARFAVEEAHAYVDNGFDGVIVEN 72 Query: 62 EFSLPYLTK--VRPETTAAMARIIGQLMSDIRIPFGVNVLWDPVAS-FDLAMATGAKFIR 118 + +P+L ET A+M I ++ + GV++L + A A GA F+R Sbjct: 73 HWDIPFLKPGEHGYETAASMGVITAAVVGEFGKAVGVSILSNAGECGVAAAWAAGASFVR 132 Query: 119 -EIFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYL--GNRDICSIAK 175 + AY ++ G + +T R +HRIGA V+ ++ + + +R + + Sbjct: 133 VNQWANAYIANEGFIEGQAAKTTRFRHRIGADPVRIFADVHVKHGAHAIVADRTVAEQTE 192 Query: 176 STVFNNHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGC 235 F + D L +G G + +++ V+ +G+ NV + DG Sbjct: 193 DAEFFD-ADVLIATGSRTGDAASVDEVSVIRDNTV-LPVIIGSGITAANVAALMKECDGA 250 Query: 236 VTATTFKKDGVFANFVDQARVSQFMEKV 263 + A++ K +G + V +V + Sbjct: 251 IVASSVKDNGRWWGRVAGEKVRELSRAA 278 >UniRef50_UPI0001C369E0 btpA family protein n=1 Tax=Clostridium hathewayi DSM 13479 RepID=UPI0001C369E0 Length = 274 Score = 237 bits (606), Expect = 2e-61, Method: Composition-based stats. Identities = 65/265 (24%), Positives = 112/265 (42%), Gaps = 10/265 (3%) Query: 9 GTEKAV-IAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEFSLPY 67 G + V +AM PG + + +ID + +++ ++ G D + N P Sbjct: 4 GKQFPVALAMIQPEPFPGSFRHEGK-SFEEIIDISLNEIEMIEANGFDGYIIQNRNDAPV 62 Query: 68 LTKVRPETTAAMARIIGQLMSDI-RIPFGVNVLWDPVASFDLAMATGAKFIR--EIFTGA 124 PETTA M + + + G+ V WD VAS +A A G+ FIR +TG Sbjct: 63 RQHALPETTAYMTALARECRRRFPDMIQGILVDWDGVASLAVADAAGSDFIRVEHTYTGV 122 Query: 125 YASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPD 184 G+ + + + + RIG+ ++ ++ L + I A TV N D Sbjct: 123 EVGYAGMMEAQCVDICQFKKRIGS-DIPVYADVQEVHYEQLAGKSIVDNAWDTVMNAFAD 181 Query: 185 ALCVSGLTAGTRTDSALLKRVKETVPD-TVVLANTGVCLENVEEQLSIADGCVTATTFKK 243 L + G + ++K V++ + + + ++G +N+ + L DG T K Sbjct: 182 GLFLGGKSC--EESIEIIKCVRKRLGERIPIFLSSGATGDNISKILQYYDGVSVGTWVK- 238 Query: 244 DGVFANFVDQARVSQFMEKVHHIRR 268 +G N +D R QFME V R+ Sbjct: 239 NGNMRNPIDPVRARQFMEGVKSARK 263 >UniRef50_Q96YL8 Putative uncharacterized protein ST2156 n=1 Tax=Sulfolobus tokodaii RepID=Q96YL8_SULTO Length = 250 Score = 235 bits (601), Expect = 1e-60, Method: Composition-based stats. Identities = 59/258 (22%), Positives = 104/258 (40%), Gaps = 18/258 (6%) Query: 14 VIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEFSLPYLTKVRP 73 +I + HL LPG + + ++D A ++ L+ GG DAV+ N P+ KVR Sbjct: 3 LIGVVHLPPLPGSFFYKGE--FEEIVDFAINESKKLEVGGFDAVILENFNDKPFRKKVRV 60 Query: 74 ETTAAMARIIGQLMSDIRIPFGVNVLWD-PVASFDLAMATGAKFIR-EIFTGAYASDFGV 131 ET AM+ I ++ + G+N+L + + +A TG FIR +S G+ Sbjct: 61 ETAIAMSIIAREVKKSTSLLVGINLLRNSAYEAASIASLTG-DFIRVNALCETISSPEGI 119 Query: 132 WDTNV---GETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPDALCV 188 + E + + R ++ L +I + A L ++ S+ D + V Sbjct: 120 IEPASVEVQEVLYYTKR----KISILADINVKHASPLHQMNLESLLLDCKERGFADYIIV 175 Query: 189 SGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATTFKKDGVFA 248 +G G + ++K +K P V +G+ N+ + D + T K Sbjct: 176 TGERTGKEPNPEVVKMIKNISP-LPVCVGSGMTPNNIRDY--KVDCFIIGTYLK---DTD 229 Query: 249 NFVDQARVSQFMEKVHHI 266 + RV + V I Sbjct: 230 GKIRVERVKEIANAVKSI 247 >UniRef50_A5GQP5 Photosystem I biogenesis protein BtpA n=4 Tax=Bacteria RepID=A5GQP5_SYNR3 Length = 275 Score = 222 bits (566), Expect = 1e-56, Method: Composition-based stats. Identities = 72/265 (27%), Positives = 118/265 (44%), Gaps = 7/265 (2%) Query: 6 EVIGTEKA-VIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEFS 64 + ++ +I + HL LPG P + V A D A GG D ++ N Sbjct: 9 SLFAHDRPALIGVLHLPPLPGSPRWQGD--FEAVRRFALADAAAYLAGGADGLVVENFGD 66 Query: 65 LPYLTKVRPE-TTAAMARIIGQLMSDI-RIPFGVNVLW-DPVASFDLAMATGAKFIR-EI 120 P+ P T AAMARI +++ +P G+NVL D A+ +A A+GA F+R + Sbjct: 67 APFFASAVPSHTVAAMARIAAEVVEAAAGVPVGINVLRNDAHAAMGIAAASGASFVRVNV 126 Query: 121 FTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFN 180 +GA +D G+ + E +R + ++ A EV +++ + A L + I + + Sbjct: 127 LSGAMLTDQGLIEGRAAELLRLRRQLEATEVGIFADVLVKHAYPLAPQPIGEAVEDCLGR 186 Query: 181 NHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATT 240 D + VSG+ G D L + VL +G N + ADG + A++ Sbjct: 187 AGADGVIVSGVATGAAPDPDDLAAARSAAGSAPVLIGSGCHAGNATSLGASADGVIVASS 246 Query: 241 FKKDGVFANFVDQARVSQFMEKVHH 265 K+D + AN VD RV + + Sbjct: 247 LKRDSLLANPVDPLRVQALRQTLQR 271 >UniRef50_B9LR39 Photosystem I assembly BtpA n=7 Tax=cellular organisms RepID=B9LR39_HALLT Length = 275 Score = 222 bits (565), Expect = 1e-56, Method: Composition-based stats. Identities = 80/274 (29%), Positives = 120/274 (43%), Gaps = 11/274 (4%) Query: 4 LKEVIGTEKAVIAMCHLRALPGDPSF--DAQLGMNWVIDKAWDDLMALQNGGVDAVMFSN 61 + GT+ VI M HL LPG P D M +D+A D AL GGVD +M N Sbjct: 3 FEATFGTDAPVIGMVHLPPLPGAPKAPADGVAAMRDALDRAAADARALDRGGVDGIMVEN 62 Query: 62 EFSLPYLTKVRPE-TTAAMARIIGQLMSDIRIPFGVNVLW-DPVASFDLAMATGAKFIR- 118 P+ P+ A++ R + ++ +P G+NVL D A+ +A A A ++R Sbjct: 63 FGDAPFYPDDAPKHVVASVTRAATAITTETDLPLGINVLRNDAEAALSVAAAVDADYVRV 122 Query: 119 EIFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYL-GNRDICSIAKST 177 + TGA +D GV ET+R + R+G +V + + + L T Sbjct: 123 NVHTGARVTDQGVVQGKAHETLRLRDRLGV-DVGVFADTDVKHSAPLSAEGYTAESFADT 181 Query: 178 VFNNHPDALCVSGLTAGTRTDSALLKRVKETVP----DTVVLANTGVCLENVEEQLSIAD 233 DA+ SG G D L+ V DT VL +GV + V + L++AD Sbjct: 182 AERGLADAVIASGRGTGEAMDPEALESVVADRDAHGLDTPVLVGSGVREDTVGDVLAVAD 241 Query: 234 GCVTATTFKKDGVFANFVDQARVSQFMEKVHHIR 267 G + T K+ G VD RV+ + + +R Sbjct: 242 GAIVGTALKEGGETTAPVDADRVAALVARADEVR 275 >UniRef50_C8S5Y9 Photosystem I assembly BtpA n=1 Tax=Ferroglobus placidus DSM 10642 RepID=C8S5Y9_FERPL Length = 249 Score = 221 bits (564), Expect = 2e-56, Method: Composition-based stats. Identities = 70/255 (27%), Positives = 119/255 (46%), Gaps = 16/255 (6%) Query: 14 VIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEFSLPYLTKVRP 73 VI HL+ LPG P+F L ID A + + ++N G DA++ N P+ K P Sbjct: 3 VIVSLHLKPLPGSPNF---LNFEDCIDHAVRNAILIENCGADAIIIENFNDKPFFMKAPP 59 Query: 74 ETTAAMARIIGQLMSDIRIPFGVNVLW-DPVASFDLAMATGAKFIR-EIFTGAYASDFGV 131 ET A+M+ I+ +++ ++ IP GVNVL D VA+ +A A GAKF+R A G Sbjct: 60 ETIASMSVIVREVIREVSIPVGVNVLRNDGVAALAIAKAAGAKFVRVNQMIFPAAMPEGF 119 Query: 132 WDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPDALCVSGL 191 + R+ R+ + K +I + +V L + + DA+ V+G Sbjct: 120 AKPIAAKMARY-KRLLNCDAKIFADISVKHSVQLAK---IEDFVDNIDRAYCDAVIVTGK 175 Query: 192 TAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATTFKKDGVFANFV 251 G +++ L+++KE V D V+ +G EN+ + ADG + T K+ Sbjct: 176 KTGKPPEASTLRKIKELV-DVPVILGSGATPENLRKY--EADGVIVGTYVKEG----EEY 228 Query: 252 DQARVSQFMEKVHHI 266 ++ + + + + Sbjct: 229 SCEKLKRVVSEAKKL 243 >UniRef50_UPI000180CC4C PREDICTED: similar to F13E9.13 n=1 Tax=Ciona intestinalis RepID=UPI000180CC4C Length = 228 Score = 200 bits (509), Expect = 5e-50, Method: Composition-based stats. Identities = 62/273 (22%), Positives = 109/273 (39%), Gaps = 55/273 (20%) Query: 2 SWLKEVIG-TEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFS 60 +V T +I M HL ALPG Sbjct: 4 RKFVDVFKKTNGVIIGMLHLPALPGT---------------------------------- 29 Query: 61 NEFSLPYLTKVRPETTAAMARIIGQLMSDIRIP--FGVNVLWDPVASFDLAMATGAKFIR 118 P++T ++A+I ++ + I G+ L+ +S + T FIR Sbjct: 30 ------------PKSTMSVAKICDVVLKEAEIYTRAGLFKLFISPSSNLVLYLTDLDFIR 77 Query: 119 -EIFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRD--ICSIAK 175 E F ++ D G D+ +R++ +I A V +I + + + D I ++ Sbjct: 78 AEGFVFSHIGDEGFIDSCAASLLRYRKQIEADHVLVFTDIKKKHSSHSITSDTSISETSR 137 Query: 176 STVFNNHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGC 235 + F D + V+G G+ TD +K V++ V VL +GV +NV++ + Sbjct: 138 AAEFF-LSDGVIVTGNETGSSTDLNQIKDVQDEVG-IPVLVGSGVTADNVDKYIHT-SAL 194 Query: 236 VTATTFKKDGVFANFVDQARVSQFMEKVHHIRR 268 + + FK GV++N VD V +FM+KV + + Sbjct: 195 IVGSHFKVGGVWSNPVDANLVQKFMKKVREMNK 227 >UniRef50_C2BTC6 Possible photosystem I biogenesis protein BtpA n=1 Tax=Mobiluncus curtisii ATCC 43063 RepID=C2BTC6_9ACTO Length = 252 Score = 196 bits (499), Expect = 6e-49, Method: Composition-based stats. Identities = 64/285 (22%), Positives = 110/285 (38%), Gaps = 56/285 (19%) Query: 1 MSWLKEVIG-----TEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVD 55 M + ++G EK ++ M HL+ ++ + +A + GG D Sbjct: 2 MQTKQNLLGNWPGNGEKLLLGMIHLKGN----------EIDDIYSRAVRECDIYARGGFD 51 Query: 56 AVMFSNEFS--------LPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDPVASFD 107 V+ N F LP L P+ + GV+V+WD SFD Sbjct: 52 GVIVENYFGTIDDVRYCLPRLQDKFPQ-----------------LYVGVDVIWDNDKSFD 94 Query: 108 LAMATGAKFIR-EIFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLG 166 LA+ FI + G E + + RI + + I+ V L Sbjct: 95 LAVEHQLPFIELDSLAGQLPPQ---------EEPQFEERIRWCQENSPAVIL--GGVRLK 143 Query: 167 NRDICSIAKSTV----FNNHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCL 222 N+ + S V D + V+G+ G T+ + + + +E + D +L GV Sbjct: 144 NQPVLSGNPLEVDLMLAKKRGDGVIVTGVDTGVETELSKIIQFREIIGDFPLLVGAGVNE 203 Query: 223 ENVEEQLSIADGCVTATTFKKDGVFANFVDQARVSQFMEKVHHIR 267 +N EQL+IADG + ++ K+ G ++ RV + + V +R Sbjct: 204 KNCTEQLTIADGAIIGSSLKQGGNAKGDLEMDRVERLVTAVRALR 248 >UniRef50_Q2CH81 Putative uncharacterized protein n=1 Tax=Oceanicola granulosus HTCC2516 RepID=Q2CH81_9RHOB Length = 270 Score = 193 bits (490), Expect = 6e-48, Method: Composition-based stats. Identities = 63/263 (23%), Positives = 106/263 (40%), Gaps = 6/263 (2%) Query: 1 MSWLKEVIGTE-KAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMF 59 MS L +++ + VI M L L G ++ + V++ A ++ L + G+D +M Sbjct: 1 MSRLLDMLARGGRPVIGMVQLPPLAGGANYGGAP-VGEVLEAALEEARVLADNGIDGLMV 59 Query: 60 SNEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDPVASF-DLAMATGAKFIR 118 N +P A M R ++ P G+N+L + V + +A A GA F+R Sbjct: 60 QNLGDIPVAHAATAAQVAWMTRATVEIGRIAACPVGLNMLENDVDAMFAVASAAGADFVR 119 Query: 119 -EIFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKST 177 ++F GA + FG+ R + G G++ L ++ L Sbjct: 120 IKVFVGAMVTPFGLEQGRAHAAARARRGCGGGDIAILADVHDRTGTPLATSGFEEDLDFA 179 Query: 178 VFNNHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVT 237 + DA+ V+G + + R + P VL GV EN EE + A G + Sbjct: 180 LRLGGADAVVVTGKS--HAATLDMAARARAAHPAAHVLLGGGVTAENFEETMENASGAIV 237 Query: 238 ATTFKKDGVFANFVDQARVSQFM 260 +++ K G RV FM Sbjct: 238 SSSMKDSGSAVGRFVPERVEAFM 260 >UniRef50_Q16GL4 Putative uncharacterized protein n=2 Tax=Aedes aegypti RepID=Q16GL4_AEDAE Length = 191 Score = 187 bits (475), Expect = 3e-46, Method: Composition-based stats. Identities = 43/185 (23%), Positives = 87/185 (47%), Gaps = 6/185 (3%) Query: 87 MSDIRIPFGVNVL-WDPVASFDLAMATGAKFIR-EIFTGAYASDFGVWDTNVGETIRHQH 144 +I +P +VL + +A A FIR E F ++ +D G D N G+ +R++ Sbjct: 6 KDNISVPKQWHVLACGNEEALAVAKACNFDFIRAEGFVFSHVADEGFTDANAGQLLRYRR 65 Query: 145 RIGAGEVKTLFNIVPEAAVYLGNRDIC--SIAKSTVFNNHPDALCVSGLTAGTRTDSALL 202 I A ++ +I + + + DI AK+ F D + ++G + G + + Sbjct: 66 NIDAEHIQIFTDIKKKHSAHAITNDISLKETAKAAEFF-RSDGIIITGASTGCEANVDDV 124 Query: 203 KRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATTFKKDGVFANFVDQARVSQFMEK 262 + + + ++ +G+ EN+ + +IAD + + FK++G + + + +V FM K Sbjct: 125 ESLVGET-ELPLIIGSGITAENLNKYWNIADAAIVGSHFKENGNWRGALSEVKVQAFMNK 183 Query: 263 VHHIR 267 V+ R Sbjct: 184 VNGFR 188 >UniRef50_A9FN23 Putative uncharacterized protein n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9FN23_SORC5 Length = 268 Score = 182 bits (462), Expect = 1e-44, Method: Composition-based stats. Identities = 65/274 (23%), Positives = 111/274 (40%), Gaps = 12/274 (4%) Query: 1 MSWLKEVIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFS 60 M + G KAV+ M HL LPG P F + +D A +AL GG D + Sbjct: 1 MRTFATL-GRRKAVLGMIHLAPLPGTP-FHEKGSFERTLDVAVQSAIALSEGGADGCLVQ 58 Query: 61 N-EFSLPYLTKVRPETTAAMARIIGQLMSDI--RIPFGVNVLWDPV-ASFDLAMATGAKF 116 E + P T AM I+ + GV ++ + + AS +A F Sbjct: 59 TVERVYGVKDESDPARTTAMGLIVDAIGRATGDDFQIGVQLMRNAIRASLAVAKVARGSF 118 Query: 117 IREI-FTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGN-RDICSIA 174 +R GA ++ G+ + N E + ++ +I A VK + ++ +LG + + +A Sbjct: 119 VRAGALVGATLTEHGLVEANPLEVMEYRDKIDAWGVKIIADVASTQFTWLGGAKPVAEVA 178 Query: 175 KSTVFNNHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADG 234 + H A VS A++ V+ PD +L N ++ ADG Sbjct: 179 RRA---KHVGADAVSLGDPDEAKTLAMIASVRAAAPDLPILLAGHTNHANAARLMAAADG 235 Query: 235 CVTATTFKKDGVFANFVDQARVSQFMEKVHHIRR 268 ++ G + +D+ RV+ ++E V + R Sbjct: 236 AFVGACLEQGG-WGGRIDRDRVAAYVEIVRGLER 268 >UniRef50_A3MXM0 Photosystem I assembly BtpA n=5 Tax=Thermoproteaceae RepID=A3MXM0_PYRCJ Length = 242 Score = 179 bits (454), Expect = 1e-43, Method: Composition-based stats. Identities = 56/255 (21%), Positives = 99/255 (38%), Gaps = 27/255 (10%) Query: 14 VIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEFSLPYLTKVRP 73 +I + HL G P ++ A L+ G DAV+ N + +P+ K Sbjct: 2 LIGVVHLLPT-GSP---------QRLEHAVRSAKRLEEAGFDAVIVENYYDMPFKPKADF 51 Query: 74 ETTAAMARIIGQLMSDIRIPFGVNVLWDP-VASFDLAMATGAKFIR-EIFTGAYASDFGV 131 E AMA ++ ++ +P G+N+L + V + +A GA FIR +T S+ G+ Sbjct: 52 EAAVAMAVAAREVAREVSLPVGINLLRNACVKASIIARHVGATFIRCNAYTDIVLSESGI 111 Query: 132 WDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPDALCVSGL 191 VK L ++ + + R + ++ P A+ V+G Sbjct: 112 LTPQAPYI---------KGVKVLADVHVKHGESIYPRTLAEAVEAASTRAAPAAIVVTGR 162 Query: 192 TAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATTFKKDGVFANFV 251 G D L + D VL +G+C + + L IADG + T K + Sbjct: 163 KTGEAPDPVDLATAR-AYTDLPVLVGSGICFQTLP-LLKIADGAIVGTCVKDG----AEI 216 Query: 252 DQARVSQFMEKVHHI 266 D + + + + + Sbjct: 217 DPEKARRLVREAKAV 231 >UniRef50_UPI000069ECFE UPI000069ECFE related cluster n=1 Tax=Xenopus (Silurana) tropicalis RepID=UPI000069ECFE Length = 162 Score = 178 bits (453), Expect = 1e-43, Method: Composition-based stats. Identities = 50/165 (30%), Positives = 85/165 (51%), Gaps = 7/165 (4%) Query: 1 MSWLKEVIGTEKA-VIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMF 59 M +L+ + GT K VI M H++ALPG P ++L + +I++A + +N G+D +M Sbjct: 1 MKFLQ-LFGTVKPIVIGMVHVKALPGTP--GSRLPVAQIIEEACHEAEIYKNAGIDGIMV 57 Query: 60 SNEFSLPYLTKVRPETTAAMARIIGQLMSDI-RIPFGVNVL-WDPVASFDLAMATGAKFI 117 N +PY PE TA MA I + +P GV +L + +A+A G FI Sbjct: 58 ENMHDIPYTFNTGPEITATMATICTAVKQACPHLPLGVQILSCANNQALAVALAAGLDFI 117 Query: 118 R-EIFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEA 161 R E + ++ +D G + G+ +R++ IGA ++ +I + Sbjct: 118 RAEGYVFSHVADEGFVNACAGDLLRYRKAIGAEHIQIFADIKKKH 162 >UniRef50_Q166V4 Photosystem I biogenesis protein, putative n=2 Tax=Roseobacter RepID=Q166V4_ROSDO Length = 262 Score = 178 bits (452), Expect = 2e-43, Method: Composition-based stats. Identities = 59/265 (22%), Positives = 112/265 (42%), Gaps = 17/265 (6%) Query: 6 EVIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEFSL 65 ++ + K VIA HL D + + L + W D A + G+ + ++ Sbjct: 4 KLFDSNKPVIAALHLP----DFALNRHLSVAWYEDYAVANARVFAEAGIPWIKLQDQTKT 59 Query: 66 PYLTKVRPET---TAAMARIIGQLMSDIRIPFGVNVLWDPVASFDLAMATGAKFIR-EIF 121 + P+T A++AR+I + +R+ V DP A+ +A A+GA FIR ++F Sbjct: 60 --AGQAAPDTLTLMASLARLIRSEVPQLRLGIIVEAH-DPGAALCVAHASGADFIRLKVF 116 Query: 122 TGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNN 181 G + G D E + + + ++ L +I A+ L + A + + Sbjct: 117 VGGAMTAQGPRDGLSAEVVAMRSELRRADIAILADIHDRTAMPLSSES-QPFAANWAVKS 175 Query: 182 HPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATTF 241 D L ++G + + + V+++ +L GV NV E ++ ADG + ++ Sbjct: 176 GADGLVITGAS--FADTLSRISAVRDSGARRPILIGGGVTESNVHEAMAAADGVIVSSAL 233 Query: 242 KKDGVFANFV---DQARVSQFMEKV 263 + A+ V D +FM+ V Sbjct: 234 MRRDAAADDVIQWDADLCKRFMDAV 258 >UniRef50_C5EPH7 Putative uncharacterized protein n=1 Tax=Clostridiales bacterium 1_7_47FAA RepID=C5EPH7_9FIRM Length = 273 Score = 171 bits (433), Expect = 3e-41, Method: Composition-based stats. Identities = 59/267 (22%), Positives = 107/267 (40%), Gaps = 14/267 (5%) Query: 2 SWLKEVI--GTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMF 59 SW++ G + + HL G D + +W++++ + G+ ++M Sbjct: 18 SWIRRRYRMGKSCRITGVVHLPPF-GADRLDLEGLESWLLEQ----IGIHAECGITSMMI 72 Query: 60 SNEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDPVASFDLAMATGAKFIRE 119 ++ +A+ R + ++ D+ + + +P A+ +A A GA FIR+ Sbjct: 73 QDQTPGELAGLKNVAILSALGRTVKRMFPDLSLGIILEAN-NPSAAMYIANACGADFIRQ 131 Query: 120 -IFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTV 178 +F GA GV GE + + V+ L +I V LG I A Sbjct: 132 KVFIGAMVKAGGVMTGRAGEVWEARKDMDR-PVRVLTDIYDRTGVPLGPLPI-ETAAGQA 189 Query: 179 FNNHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTA 238 D L ++G D A RV++ P V G+ +NV E + DG + + Sbjct: 190 LKYGSDGLILTGKNFEESLDLAD--RVRKQYPQAPVYLGGGITEKNVGEAVKHCDGMIVS 247 Query: 239 TTFKKDGVFANFVDQARVSQFMEKVHH 265 + +DG N + ++ +FME V Sbjct: 248 SCLLEDGK-DNVWSRQKIRRFMECVCG 273 >UniRef50_C9XNH3 Putative uncharacterized protein n=7 Tax=Firmicutes RepID=C9XNH3_CLODC Length = 247 Score = 145 bits (366), Expect = 2e-33, Method: Composition-based stats. Identities = 54/267 (20%), Positives = 101/267 (37%), Gaps = 29/267 (10%) Query: 4 LKEVIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEF 63 + V ++K +IAM HL+ + ++A ++ + GVD +M N + Sbjct: 6 ILSVFKSKKPIIAMIHLK----------GDTPEDIFERAKKEITIFEENGVDGIMLENYY 55 Query: 64 SLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDPVASFDLAMATGAKFIREIFTG 123 Y + E + +++ IP+GVN L F+LA A +I Sbjct: 56 GNYYDLERILEYVS---------KANLSIPYGVNCLNVDTMGFELATKYNASYI------ 100 Query: 124 AYASDFGVWDTNVGETIR--HQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNN 181 S G T+ + + + + + L D+ K + Sbjct: 101 QVDSVVGHVKPRDEATLEEFFKLQRSKCPAYLIGGVRFKYQPVLSENDVEEDLK--IGMT 158 Query: 182 HPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATTF 241 DA+ V+ G T ++ ++ + D ++ GV LEN ++QL + D + + F Sbjct: 159 RCDAIAVTENATGQETSMEKIELFRKNLGDFPLVIAAGVTLENAKKQLELGDMAIIGSYF 218 Query: 242 KKDGVFANFVDQARVSQFMEKVHHIRR 268 K + V V FM+++ IR Sbjct: 219 KDNYKDFGNVSVEHVKTFMDEIKKIRE 245 >UniRef50_Q18HA0 Photosystem I biogenesis protein homolog n=1 Tax=Haloquadratum walsbyi DSM 16790 RepID=Q18HA0_HALWD Length = 223 Score = 140 bits (353), Expect = 5e-32, Method: Composition-based stats. Identities = 46/190 (24%), Positives = 84/190 (44%), Gaps = 8/190 (4%) Query: 49 LQNGGVDAVMFSNEFSLPYLTKVRPETTAAM-ARIIGQLMSDIRIPFGVNVLW-DPVASF 106 L+ G +DA++ N + P+ P+ T AM + +I + + +P V++L D A+ Sbjct: 34 LEAGSIDAILVKNLGNTPFHADDVPKHTVAMISALIKDIQRVVDVPISVDILRNDAEAAL 93 Query: 107 DLAMATGAKFIR-EIFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYL 165 +A AT A FIR + G +D G+ ET+R + + +V+ L ++ + + Sbjct: 94 SIAAATTASFIRAGVHVGTLVTDQGIVTRRAAETLRLRDHLR-TDVEILADVSVKHSAPA 152 Query: 166 GNRDICSIAKSTVFNNHPDALCVSGLTAGTRTDSALLKRVKETV----PDTVVLANTGVC 221 R + + H D + SG+ G + D L V + V ++GV Sbjct: 153 AERPLTETITDIISREHADGIIASGVGTGHKIDCGHLNTVVDVRDSLETGIPVFVDSGVT 212 Query: 222 LENVEEQLSI 231 LE + + S Sbjct: 213 LETIADIYSN 222 >UniRef50_C2JNA9 Photosystem I biogenesis protein BtpA n=9 Tax=Enterococcus faecalis RepID=C2JNA9_ENTFA Length = 247 Score = 132 bits (332), Expect = 1e-29, Method: Composition-based stats. Identities = 51/266 (19%), Positives = 97/266 (36%), Gaps = 29/266 (10%) Query: 4 LKEVIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEF 63 E+ EK +I + HL+ + ++A ++ GVDA++ N + Sbjct: 7 FLELFAVEKPIIGVIHLK----------GKTDQEIQERAKKEIQIYSEHGVDAILMENYY 56 Query: 64 SLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDPVASFDLAMATGAKF--IREIF 121 + + ++ D+ IP GVNVL F LA +F I + Sbjct: 57 GDYVQLEKALQYVTSL---------DLPIPIGVNVLNVDPLGFHLANKYHLQFLQIDSVV 107 Query: 122 TGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNN 181 D + R + K + + + L + + K + Sbjct: 108 GHVKPRDEASLQAFF-DLYRAK-----TTAKLIGGVRFKYQPMLSEKSVEEDLK--IAQQ 159 Query: 182 HPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATTF 241 DA+ V+ G T +K ++ +P+ ++ G+ ++V+EQL+I D + + F Sbjct: 160 RCDAIAVTENATGEETSLEKIKLFRKQLPEFPLIVAAGLNDKSVKEQLAICDAAIVGSNF 219 Query: 242 KKDGVFANFVDQARVSQFMEKVHHIR 267 K + V FM+ V +R Sbjct: 220 KDTRKDTGDIYAPYVDSFMKIVKELR 245 >UniRef50_C0C181 Putative uncharacterized protein n=1 Tax=Clostridium hylemonae DSM 15053 RepID=C0C181_9CLOT Length = 262 Score = 121 bits (305), Expect = 2e-26, Method: Composition-based stats. Identities = 49/267 (18%), Positives = 103/267 (38%), Gaps = 15/267 (5%) Query: 8 IGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMF--SNEFSL 65 E +I H LP + + + + ++ G+D V N + Sbjct: 3 YAKEPVIIGAVH---LPYYGRNNPSQSVAEIEEYVMANVKVHYENGIDTVYIQDENLNTG 59 Query: 66 PYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDPVASFDLAMATGAKFIR-EIFTGA 124 P L + TA++A+++ + +++ + D VA A A GA F+R ++F G Sbjct: 60 PALPE-TIALTASLAKMVKMEVPGVKLGLIMQAH-DGVAPIAAAAAAGADFVRIKVFAGT 117 Query: 125 YASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPD 184 G+ ++++ I + VK L ++ + + + +A + D Sbjct: 118 MYKAEGIRTGVGETAVQYRTMINS-PVKILADVHDREGIPMPGVPV-DMAIGWASHIGAD 175 Query: 185 ALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATTFKKD 244 L ++G + L+ ++ VL V +N+ + L +G V +++ D Sbjct: 176 GLILTGHD--YKETMEYLETAEKMELGKPVLVGGSVSEDNIYDILDHCEGAVVSSSLMLD 233 Query: 245 GVFAN---FVDQARVSQFMEKVHHIRR 268 D ++ +F +KV H R+ Sbjct: 234 DPVPGSPLRWDAEKIRRFADKVRHYRK 260 >UniRef50_C2D712 Putative uncharacterized protein n=1 Tax=Atopobium vaginae DSM 15829 RepID=C2D712_9ACTN Length = 245 Score = 115 bits (289), Expect = 1e-24, Method: Composition-based stats. Identities = 53/266 (19%), Positives = 91/266 (34%), Gaps = 26/266 (9%) Query: 1 MSWLKEVIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFS 60 MS +K V M HL+ + V++ + GG+D V+ Sbjct: 1 MSNPYAKYFEQKRVFGMLHLK----------GESIPQVLECLKKEFDEYVKGGIDGVVVE 50 Query: 61 NEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDPVASFDLAMATGAKFIR-E 119 N + E AA+ + Q+ S I GVN L +LA A F++ + Sbjct: 51 NYY------NGCDEIIAALDYLHDQIGSQTLI--GVNCLRSESMGLELASAYKTDFVQLD 102 Query: 120 IFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVF 179 G T+ + + + G L + + L + K Sbjct: 103 SVVGHVIPRDDATLTHFFKIWQ-EKYTG----MILGGVRFKKQPLLSENPLSEDLKIAQS 157 Query: 180 NNHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTAT 239 H A+CV+ G T + +E + D +L + G NV++ LS +G + + Sbjct: 158 RCH--AVCVTQAATGEETHLDKIISFREGLKDFPLLISAGATPTNVKKSLSYINGVIAGS 215 Query: 240 TFKKDGVFANFVDQARVSQFMEKVHH 265 FK + V V + + V Sbjct: 216 YFKDTYEVSGTVCSEHVRELVRAVKE 241 >UniRef50_C5KQ75 Putative uncharacterized protein n=1 Tax=Perkinsus marinus ATCC 50983 RepID=C5KQ75_9ALVE Length = 379 Score = 66.4 bits (161), Expect = 9e-10, Method: Composition-based stats. Identities = 38/211 (18%), Positives = 64/211 (30%), Gaps = 20/211 (9%) Query: 40 DKAWDDLMALQNGGVDAVM----FSNEFSLPYLTKVRPETTAAMARIIGQLMSDI-RIP- 93 D+A G ++ N+ V P+ RII + I P Sbjct: 27 DQAVQQARIAVASGAHGILLINQVENDDGSVTTLPVNPD----FTRIISAVRRAIGDKPF 82 Query: 94 FGVNVLWDPVASFDLAMATGAKF-IREIFTGAYASDFGVWDTNVGETIRHQHRIGAGEVK 152 GVN L A L + T I + D G + E R + + Sbjct: 83 LGVNCLA-MTADVALPLVTNDDCRIDAYWADDARIDEGRGVADQVEAERI-SSVRSAHSS 140 Query: 153 TLFNIVPEAAVYLGNRDICSIAKSTVFNNHP----DALCVSGLTAGTRTDSALLKRVKET 208 F V + + + + D + SG G D + + ++ Sbjct: 141 IKFYF---GGVAFKKQRVVAEEDWSKAVALATPFMDVVVTSGTATGVPADINKIIQFRQA 197 Query: 209 VPDTVVLANTGVCLENVEEQLSIADGCVTAT 239 + +GV EN+++ L D + AT Sbjct: 198 ADTNALAVGSGVTPENIDKYLPYVDCIIVAT 228 >UniRef50_D2VTE2 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2VTE2_NAEGR Length = 318 Score = 66.4 bits (161), Expect = 9e-10, Method: Composition-based stats. Identities = 39/244 (15%), Positives = 80/244 (32%), Gaps = 36/244 (14%) Query: 1 MSWLKEVIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFS 60 + +V +K + + H+ W + A + L + VD + Sbjct: 72 IDKFYQVFKKKKVFLPVVHV----------------WDVAHALKNAKLLYDHHVDGLFLI 115 Query: 61 NEFSLPYLTKVRPETTAAMARIIGQLMSDI-RIPFGVNVLWDPVASFDLAMATGAKFIRE 119 N A I + + G+N+L + L F Sbjct: 116 NNN-----CSADILIDA-----IKSVRREFPDKWLGINILGISIRELFL-KIADLDFDGL 164 Query: 120 IFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFN--IVPEAAVYLGNRDICSIAKST 177 A ++ + N+ E I+ ++G+ K L+ I + +D+ + Sbjct: 165 WLDSAMITEESEFQ-NIAEFIQ--DQLGSMNFKGLYFGGIAFK-YQRTVIKDLKKVID-- 218 Query: 178 VFNNHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVT 237 + +++ D + SG G + LK+ E V + +GV +N+ + AD + Sbjct: 219 IASSYVDVILTSGEATGMQIKEEKLKKFTELVKCNPLGIASGVTNKNLITSIKHADVFIV 278 Query: 238 ATTF 241 T Sbjct: 279 GTYI 282 >UniRef50_Q0FCT7 Adenine phosphoribosyltransferase n=3 Tax=Rhodobacterales RepID=Q0FCT7_9RHOB Length = 253 Score = 62.9 bits (152), Expect = 1e-08, Method: Composition-based stats. Identities = 19/82 (23%), Positives = 38/82 (46%), Gaps = 1/82 (1%) Query: 184 DALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATTFKK 243 D + SG+ G D + +E+ + + +G+ ENV+E + D + AT Sbjct: 165 DIVVTSGIATGHAADVNKINIFRESCGENTLAVASGITPENVKEYIKNVDLFMVATGINF 224 Query: 244 DGVFANFVDQARVSQFMEKVHH 265 D F N +D ++++ M + + Sbjct: 225 DNDFYN-IDPNKLNRLMNVIKN 245 >UniRef50_A6G6X7 Putative uncharacterized protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6G6X7_9DELT Length = 243 Score = 59.1 bits (142), Expect = 1e-07, Method: Composition-based stats. Identities = 42/269 (15%), Positives = 95/269 (35%), Gaps = 36/269 (13%) Query: 2 SWLKEVIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSN 61 S + +V + ++ + H P A+ A + GV V + Sbjct: 5 SRVHQVFRVPRVLLPVIH-------PIGHAE---------AISAVDVCVAAGVRGVFLID 48 Query: 62 EFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLW-DPVASFDLAMATGAKFIREI 120 + +R E A+A + + G+N+L DPV + A+ A + + Sbjct: 49 QG-------MRVEEVLALAVEVHA--RHPGLWVGLNLLALDPVEALRGALERCAGRLDGL 99 Query: 121 FTG-AYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVF 179 ++ A+ + + + + + + + + + + A+ V Sbjct: 100 WSDDAHVHEGSREQPRAQAFVDARRELDWDGL-YFGGVAFKYRRPVPDEQLAEAAR--VA 156 Query: 180 NNHPDALCVSGLTAGTRTDSALLKRVKETV---PDTVVLANTGVCLENVEEQLSIADGCV 236 + D +C SG G L R+++ + + +GV L+NV + L D + Sbjct: 157 AGYMDVVCTSGAGTGIAAHRDKLARMRQGLAGRDGAALALASGVTLDNVADYLDFTDAFL 216 Query: 237 TATTFKKDGVFANFVDQARVSQFMEKVHH 265 T +++ +D RV++ ++ Sbjct: 217 VGTGIERE---FGVLDPDRVARLQARIDA 242 >UniRef50_Q7M877 Tryptophan synthase alpha chain n=9 Tax=Epsilonproteobacteria RepID=TRPA_WOLSU Length = 255 Score = 49.8 bits (118), Expect = 9e-05, Method: Composition-based stats. Identities = 16/80 (20%), Positives = 29/80 (36%), Gaps = 2/80 (2%) Query: 165 LGNRDICSIAKST-VFNNHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLE 223 LG I +IA F ++G L++ ++ P+ + GV Sbjct: 148 LGTSRIATIAPMARKFIYLVAYAGITGSGREEPLSP-LIEEIRAINPEIPLYLGFGVNEH 206 Query: 224 NVEEQLSIADGCVTATTFKK 243 N +E+ DG + + K Sbjct: 207 NAKEKSKEVDGVIVGSALVK 226 >UniRef50_Q4JTH5 L-lactate dehydrogenase n=6 Tax=Actinomycetales RepID=Q4JTH5_CORJK Length = 425 Score = 48.7 bits (115), Expect = 2e-04, Method: Composition-based stats. Identities = 25/139 (17%), Positives = 49/139 (35%), Gaps = 20/139 (14%) Query: 45 DLMALQNGGVDAVMFSNEFSL-----PYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVL 99 D L + GVD ++ SN P ++ PE + ++ +D+ + ++ Sbjct: 285 DSKKLADLGVDGIILSNHGGRQLDRAPVPFQLLPE-------VAREVGNDVDVAMDTGIM 337 Query: 100 WDPVASFDLAMATGAKFI---REIFTGAYASDFGVWDTNVGETI--RHQHRIGAGEVKTL 154 +A GAKF R G A + E + + + +V +L Sbjct: 338 NGADIVAAIAK--GAKFTLIGRAYLYGLMAGGEAGVN-RAIEILASEVRRTMRLLQVSSL 394 Query: 155 FNIVPEAAVYLGNRDICSI 173 + PE L ++ + Sbjct: 395 DELTPEHVTQLNTLNLNQV 413 >UniRef50_C0QR82 Thiamine-phosphate pyrophosphorylase n=1 Tax=Persephonella marina EX-H1 RepID=THIE_PERMH Length = 209 Score = 48.3 bits (114), Expect = 3e-04, Method: Composition-based stats. Identities = 34/184 (18%), Positives = 62/184 (33%), Gaps = 32/184 (17%) Query: 80 ARIIGQLMSDIRIPFGVNVLWDPVASFDLAMATGAKFIREIFTGAYASDFGVWDTNVGET 139 A +I ++ IPF VN D+A+A A G + + Sbjct: 53 AVVIKKVCRKYDIPFIVN------DRIDIAIAVDAD-------GVHLGQDDLDVEVARRI 99 Query: 140 IRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPDALCVSGLTAGTRTDS 199 + + IG K +++ ++ + S+ ++ DA+ Sbjct: 100 LGFEKIIGLSTKKI-EDVIKANSLPVDYIGFGSVFPTSTKE---DAVYAG---------L 146 Query: 200 ALLKRVKETVPDTVVLANTGVCLENVEEQLSIA--DGCVTATTFKKDGVFANFVDQARVS 257 LK V + VV G+ +N+ + L + V + FK D + N R+ Sbjct: 147 EKLKEVMKISVQPVVAIG-GINEKNLTDLLKTGCRNVAVVSAVFKDDNIKEN---TERLK 202 Query: 258 QFME 261 ME Sbjct: 203 NIME 206 >UniRef50_B9T9A8 Putative uncharacterized protein n=1 Tax=Ricinus communis RepID=B9T9A8_RICCO Length = 225 Score = 47.5 bits (112), Expect = 5e-04, Method: Composition-based stats. Identities = 37/236 (15%), Positives = 72/236 (30%), Gaps = 32/236 (13%) Query: 39 IDKAWDDLMALQNGGVDAVMFSNEFSLPYLTKVRPETTAAMARIIGQLM-SDIRIPFGVN 97 D+A + + G V +L + E + ++ GVN Sbjct: 14 TDQALRNAEIAFDAGCPGV---------FLISMDGED-ELLGPAAKEVKGRWGGKLVGVN 63 Query: 98 VLW-DPVASFDLAMATGAKFIREIFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFN 156 L V + L + G G +++ G V + + + T F Sbjct: 64 YLSLSAVTALRLNLTHGLDLTWTDNAGVHSTGLGTLAHLVAD------ELKSAPEHTFF- 116 Query: 157 IVPEAAVYLGNRDICSIAKSTVFNNHPDALCV----SGLTAGTRTDSALLKRVKETVPDT 212 A + + LC+ SG G D ++ ++ + Sbjct: 117 ----GACGFKGQRAEP--DTAAAAVMAAGLCMLPTTSGSATGVAADLQKIRSIRAALGTG 170 Query: 213 VVLANTGVCLENVEEQLSIADGCVTATTFKKDGVFANFVDQARVSQFMEKVHHIRR 268 + +G+ ENV + + AT D F NF + +++ + K+ R Sbjct: 171 PLAVASGITPENVLDYAPYVSHFLVATGVSDD--FYNF-NFEKLAVLVGKLRTFSR 223 >UniRef50_A3TLL5 Tryptophan synthase alpha chain n=1 Tax=Janibacter sp. HTCC2649 RepID=A3TLL5_9MICO Length = 270 Score = 47.5 bits (112), Expect = 5e-04, Method: Composition-based stats. Identities = 18/73 (24%), Positives = 26/73 (35%), Gaps = 5/73 (6%) Query: 188 VSGLTAGTRTDSALLKRVKETVPDTVVLANTGV-CLENVEEQLSIADGCVTATTFKK--- 243 V+G + + L + V D V GV + E ADG + T F + Sbjct: 183 VTGERTSVGSSARELVDRTKDVTDLPVCVGLGVSNGDQAAELAQYADGVIVGTAFVRTLS 242 Query: 244 -DGVFANFVDQAR 255 DG +D R Sbjct: 243 GDGELGARLDALR 255 >UniRef50_A5IKT4 Tryptophan synthase alpha chain n=5 Tax=Thermotogaceae RepID=TRPA_THEP1 Length = 239 Score = 46.4 bits (109), Expect = 0.001, Method: Composition-based stats. Identities = 19/78 (24%), Positives = 31/78 (39%), Gaps = 7/78 (8%) Query: 193 AGTRTDS---ALLKRVKETVPDTVVLANTGVC-LENVEEQLSIADGCVTATTFKKDGVFA 248 G R D +KRVKE + + G+ E VE+ IADG + + + + Sbjct: 162 TGEREDLPFADHIKRVKERI-KLPLFVGFGISRHEQVEKVWEIADGAIVGSALVR--IME 218 Query: 249 NFVDQARVSQFMEKVHHI 266 + +EKV + Sbjct: 219 ESPKDEIPKKVVEKVKEL 236 >UniRef50_A8VXV8 Tryptophan synthase alpha chain n=2 Tax=Bacillus RepID=A8VXV8_9BACI Length = 266 Score = 45.6 bits (107), Expect = 0.002, Method: Composition-based stats. Identities = 31/250 (12%), Positives = 73/250 (29%), Gaps = 44/250 (17%) Query: 40 DKAWDDLMALQNGGVDAVMFSNEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVL 99 D + + + LQ+ GV+A+ E+ +PY + A L + + + + Sbjct: 28 DLSVEIALMLQDAGVEAI----EWGVPYSDPLADGPVIQQA-GQRALKNGGSLTVSLQKM 82 Query: 100 WDPVASFDLAMATGAKFIREIFTGAY------ASDFGVWDTNVGETI-----RHQHRIGA 148 + A + ++ + + + D+G + + ++ Sbjct: 83 KEARAKGLTVPSVLFTYVNPVLSYGFTKLIEELKDYGFDGLLIPDLPYEESHEYRALCRE 142 Query: 149 GEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPDALC-------VSGLTAGTRTDSAL 201 + + I + I K D V+G Sbjct: 143 KGISLIPLI-----APSSKSRVEKITKE------ADGFVYYVTSLGVTGTRESFSETLKE 191 Query: 202 LKRVKETVPDTVVLANTGV-CLENVEEQLSIADGCVTATTFKK-----DGVFANFVDQAR 255 ++ VLA G+ E+V+ ADG + + + + N ++ Sbjct: 192 EINTVKSFSKVPVLAGFGISTPEHVQYFQEHADGAIVGSALVRKIASLEDSLKNPEEKDA 251 Query: 256 V----SQFME 261 F++ Sbjct: 252 ALNEIKAFVQ 261 >UniRef50_A6TM77 Tryptophan synthase alpha chain n=10 Tax=Clostridia RepID=TRPA_ALKMQ Length = 266 Score = 45.6 bits (107), Expect = 0.002, Method: Composition-based stats. Identities = 15/83 (18%), Positives = 35/83 (42%), Gaps = 3/83 (3%) Query: 188 VSGLTAGTRTDSALLKRVKETVPDTVVLANTGV-CLENVEEQLSIADGCVTATTF--KKD 244 V+G + + T + ++ G+ E +++ +I DG + + K + Sbjct: 181 VTGKRNSLAGNLEGFMQQLRTYTEIPLVIGFGISNSEMMDKLKNICDGFIIGSAVIEKIE 240 Query: 245 GVFANFVDQARVSQFMEKVHHIR 267 + RVS+F+EK++ + Sbjct: 241 AGLEDRSSVERVSKFIEKLYEFK 263 >UniRef50_Q3ABS4 Tryptophan synthase alpha chain n=1 Tax=Carboxydothermus hydrogenoformans Z-2901 RepID=TRPA_CARHZ Length = 267 Score = 45.2 bits (106), Expect = 0.002, Method: Composition-based stats. Identities = 45/281 (16%), Positives = 98/281 (34%), Gaps = 31/281 (11%) Query: 1 MSWLKEVI-----GTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMA----LQN 51 MS + +V EKA+IA + GDP+ L + + A DL+ + Sbjct: 1 MSRIGQVFAEKRSRGEKALIA----YTMGGDPNLTFSLEIIKTLAAAGADLIEVGLPFSD 56 Query: 52 GGVDAVMFSNEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDPVASFDLAMA 111 D + PE A+ I ++ +P + +P+ + Sbjct: 57 PLADGPVIQRAGQRALAAGSGPEEVLAL---IAAARQELSLPLVIMSYLNPILQIGV--- 110 Query: 112 TGAKFIREIFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDIC 171 +F+R A A G+ ++ + R+ A +++P A G + + Sbjct: 111 --DEFLRRA---AGAGADGLIIPDLPVEEGEEIRVSAAGYGL--DLIPLVAPTTGQKRLE 163 Query: 172 SIAKSTVFNNHPDALC-VSGLTAGTRTDSALLKRVKETVPDTVVLANTGV-CLENVEEQL 229 I + ++ V+G + L + + + + V G+ E + Sbjct: 164 KIVGQASGFIYCVSVTGVTGARDSLPAEVISLLQNVKKLTELPVCLGFGIGKPEQIAYIK 223 Query: 230 SIADGCVTATTFKK--DGVFANFVDQARVSQFM-EKVHHIR 267 DG + + + + N +++ +V + + KV ++ Sbjct: 224 DYCDGVIVGSALVEIIENYVQNRMEKDKVLELIATKVQTLK 264 >UniRef50_C7N999 Tryptophan synthase alpha chain n=2 Tax=Leptotrichia RepID=C7N999_LEPBD Length = 257 Score = 44.8 bits (105), Expect = 0.003, Method: Composition-based stats. Identities = 30/211 (14%), Positives = 75/211 (35%), Gaps = 37/211 (17%) Query: 67 YLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDPVASFDLAMATGAKFIREIFTGAYA 126 +L TT + ++ ++ +DI P + ++ + ++ + +FI++ A Sbjct: 71 FLASEAGVTTDTVFDLLTEIKNDISKPLIFLIYYNLIFAYGI-----DEFIKKC-CEANV 124 Query: 127 SDFGVWDTNVG--ETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPD 184 G+ ++ E ++ + + L + S + + D Sbjct: 125 --KGIIIPDLPYEEAFEMSEKLRENNIALI---------PLV--SVTSGNRMKKIISQGD 171 Query: 185 ALC-------VSGLTAGTRTDSAL-LKRVKETVPDTVVLANTGV-CLENVEEQLSIADGC 235 V+G + ++E V D V G+ +NV ADG Sbjct: 172 GFIYAIGSLGVTGSKQVDLPRLESFINEIRE-VSDLPVSLGFGIKNNDNVNTMRKYADGV 230 Query: 236 VTATTFKKDGVFANFVDQARVSQFMEKVHHI 266 + T+ + F+++ V+ ++K++ + Sbjct: 231 IVGTSIVE------FLEKNDVNYLIQKINEL 255 >UniRef50_Q67PJ3 Tryptophan synthase alpha chain n=3 Tax=Clostridia RepID=TRPA_SYMTH Length = 279 Score = 44.8 bits (105), Expect = 0.003, Method: Composition-based stats. Identities = 16/82 (19%), Positives = 32/82 (39%), Gaps = 5/82 (6%) Query: 188 VSGLTAGTRTDSALLKRVKETVPDTVVLANTGVC-LENVEEQLSIADGCVTATTFKKD-- 244 V+G+ + + V DT V G+ E V + ++AD + + F + Sbjct: 182 VTGVRDRLPPQLTAMVEAVKAVTDTPVAVGFGISRPEQVRQVTAVADAAIVGSAFVRHCG 241 Query: 245 -GVFANFVDQARVSQFMEKVHH 265 G+ + + RV E++ Sbjct: 242 EGLPEHEL-VERVRILAEELKA 262 >UniRef50_B5YJ74 Thiamine-phosphate pyrophosphorylase n=1 Tax=Thermodesulfovibrio yellowstonii DSM 11347 RepID=B5YJ74_THEYD Length = 206 Score = 44.8 bits (105), Expect = 0.003, Method: Composition-based stats. Identities = 31/129 (24%), Positives = 43/129 (33%), Gaps = 21/129 (16%) Query: 106 FDLAMATGAKFIREIFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYL 165 D+A+A A G + G V E + + IG + EA+ + Sbjct: 71 IDIALAVEAD-------GVHLPQSGFPPRIVREVWKDRFLIGVSTHSI--DEAKEASEWA 121 Query: 166 GNRDICSIAKSTVFNNHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENV 225 I + S G LK VKE+V V A G+ LENV Sbjct: 122 DFITFSPIFHTP-----------SKAHYGEPQGVEKLKEVKESVKCK-VFALGGIKLENV 169 Query: 226 EEQLSIADG 234 E + DG Sbjct: 170 HELIPYCDG 178 >UniRef50_B9K6Z6 Tryptophan synthase alpha chain n=1 Tax=Thermotoga neapolitana DSM 4359 RepID=TRPA_THENN Length = 240 Score = 44.1 bits (103), Expect = 0.004, Method: Composition-based stats. Identities = 13/55 (23%), Positives = 23/55 (41%), Gaps = 5/55 (9%) Query: 193 AGTRTDS---ALLKRVKETVPDTVVLANTGVC-LENVEEQLSIADGCVTATTFKK 243 G R D +K+VK+ + + G+ E V + IADG + + + Sbjct: 162 TGEREDLPFAEHIKKVKKKIA-LPLFVGFGISRHEQVRKVWEIADGVIVGSALVR 215 >UniRef50_Q5KXV2 Tryptophan synthase alpha chain n=6 Tax=Bacillaceae RepID=TRPA_GEOKA Length = 268 Score = 44.1 bits (103), Expect = 0.005, Method: Composition-based stats. Identities = 43/247 (17%), Positives = 71/247 (28%), Gaps = 49/247 (19%) Query: 21 RALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEFSLPYLTK--VRPETTAA 78 L + ID A AL+ G D + +PY P A Sbjct: 7 PPLFIPFIVAGDPAPDVTIDLAL----ALEEAGAD--ILE--LGVPYSDPLADGPTIQRA 58 Query: 79 MAR-------------IIGQLMSD-IRIPFGVNVLWDPV------ASFDLAMATGAKFIR 118 AR +IG++ + IP + ++PV + F LA GA Sbjct: 59 AARALDGGMTLPKAIQLIGEMRKKGVNIPIILFTYYNPVLQLGEESFFALARENGAD--- 115 Query: 119 EIFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTV 178 G D ++ + R G + + + I IA + Sbjct: 116 ----GVLIPDLPFEESGPLRELG--ERFGLPLISLVA--------PTSKQRIERIASAAQ 161 Query: 179 FNNHP-DALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVC-LENVEEQLSIADGCV 236 + +L V+G+ R + V G+ E V + DG V Sbjct: 162 GFLYCVSSLGVTGVRETLPETLGDFLREVKRHSRVPVAVGFGISAPEQVAMLKEVCDGVV 221 Query: 237 TATTFKK 243 + + Sbjct: 222 VGSALVQ 228 >UniRef50_B8JAL9 Tryptophan synthase alpha chain n=4 Tax=Anaeromyxobacter RepID=B8JAL9_ANAD2 Length = 285 Score = 44.1 bits (103), Expect = 0.005, Method: Composition-based stats. Identities = 20/87 (22%), Positives = 30/87 (34%), Gaps = 8/87 (9%) Query: 188 VSGLTAGTRTDSALLKRVKETVPDTVVLANTGV-CLENVEEQLSIADGCVTATTFKK--- 243 V+G + A L D V+ GV E +ADG V + K Sbjct: 201 VTGARHAVAEEIAPLVSAVRARTDLPVVIGFGVASPEQARALGPLADGVVVGSAIVKRIA 260 Query: 244 -DGVFANFVDQARVSQFMEKV-HHIRR 268 G RV++F+ + +RR Sbjct: 261 EGGSRRAR--AERVTRFVRSLGRALRR 285 >UniRef50_Q2LUE0 Tryptophan synthase alpha chain n=1 Tax=Syntrophus aciditrophicus SB RepID=TRPA_SYNAS Length = 265 Score = 44.1 bits (103), Expect = 0.006, Method: Composition-based stats. Identities = 44/257 (17%), Positives = 87/257 (33%), Gaps = 40/257 (15%) Query: 31 AQLGMNWVIDKAWDDLMALQNGGVD----AVMFSNEF-SLPYLTKVR---PETTAAMARI 82 ++ + L+ GGVD V FS+ P + +T ++RI Sbjct: 26 GDPDLDKTREILV----GLKEGGVDILEIGVPFSDPTADGPVIQAAAQRALKTGTTLSRI 81 Query: 83 ---IGQLMSDIRIPFGVNVLWDPVASFDL------AMATGAKFIREIFTGAYASDFGVWD 133 I L I +P + ++P+ ++ A A G G D + + Sbjct: 82 LDMIQDLRKIIDLPVVLFGYYNPIYAYGTERFAERAKAAGVD-------GLLVVDLPLEE 134 Query: 134 TNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPDALC-VSGLT 192 + + + + + I P + +C IA+ + ++ V+G Sbjct: 135 AE-----ELRGKTDSKGLDFITLIAPTTS----EERMCRIARRAQGFIYYISITGVTGTA 185 Query: 193 AGTRTDSALLKRVKETVPDTVVLANTGV-CLENVEEQLSIADGCVTATTFKKDGVFANFV 251 +R + R T D ++ G+ E E S+ADG V + F + + N Sbjct: 186 TPSRDNVEREIRRIRTHSDLPLVVGFGISTPEQARELASLADGIVIGSAFVRL-IAENAD 244 Query: 252 DQARVSQFMEKVHHIRR 268 ++ I++ Sbjct: 245 SPELAARVSSFAREIKK 261 >UniRef50_D1BQN8 Tryptophan synthase alpha chain n=17 Tax=cellular organisms RepID=D1BQN8_VEIPT Length = 263 Score = 43.7 bits (102), Expect = 0.006, Method: Composition-based stats. Identities = 43/262 (16%), Positives = 72/262 (27%), Gaps = 47/262 (17%) Query: 1 MSWLKEVIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFS 60 MS +K+ KA I G+ + + G D V Sbjct: 1 MSKIKDAFTKGKAFI----------PFISAGDHGIENTERY----IRIMVKAGADMVEI- 45 Query: 61 NEFSLPYLTKV--RPETTAAMARIIGQLMSDIRIPFGVNVLWDPVASFDLAMATG----- 113 +P+ P A R + + I V L + + + Sbjct: 46 ---GIPFSDPTAEGPVIQEASTRALSTGVKINDIFDMVRCLRTGEEAVTVPLVFMTYLNP 102 Query: 114 -AKFIREIFTGA--YASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAV------- 163 F RE F GV + + E L ++ + V Sbjct: 103 IYVFGREKFFTLCEEVGISGVIVPD----------MPFEEKGELASVAHKHGVEVVSLIA 152 Query: 164 YLGNRDICSIAKSTVFNNHP-DALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVC- 221 I IAK + +L V+G+ + +TD + D V G+ Sbjct: 153 PTSENRIEMIAKDAEGFVYCVSSLGVTGMRSEIKTDIKSIVETIRKYTDIPVAVGFGISK 212 Query: 222 LENVEEQLSIADGCVTATTFKK 243 E E ++DG + + K Sbjct: 213 PEQAETMARVSDGAIVGSAIVK 234 >UniRef50_B0NZY9 Tryptophan synthase alpha chain n=2 Tax=Clostridiales RepID=B0NZY9_9CLOT Length = 262 Score = 43.7 bits (102), Expect = 0.007, Method: Composition-based stats. Identities = 16/106 (15%), Positives = 33/106 (31%), Gaps = 2/106 (1%) Query: 163 VYLGNRDICSIAKSTVFNNHPDALC-VSGLTAGTRTDSALLKRVKETVPDTVVLANTGVC 221 + I IAK + + V+G + TD V + G+ Sbjct: 154 TPTSHDRIAMIAKEAEGFLYCVSSIGVTGTRSEFTTDFDEFFGVIKKNATIPCAVGFGIS 213 Query: 222 -LENVEEQLSIADGCVTATTFKKDGVFANFVDQARVSQFMEKVHHI 266 E ++ + DG + + K +V +F + + + Sbjct: 214 GPEQAKKMSTYCDGVIVGSAIVKLISQYGKESPEKVYEFTKSLRDV 259 >UniRef50_D1B8G5 CutC family protein n=1 Tax=Thermanaerovibrio acidaminovorans DSM 6589 RepID=D1B8G5_THEAS Length = 225 Score = 43.3 bits (101), Expect = 0.008, Method: Composition-based stats. Identities = 41/239 (17%), Positives = 68/239 (28%), Gaps = 22/239 (9%) Query: 35 MNWVIDKAWDDLMALQNGGVDAVMFSNEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPF 94 VI + + ++ G D V F + S LT PE AA + + IP Sbjct: 2 FVEVIAVSPWEAELVEACGGDRVEFVLDLSCGGLTPSVPEVAAA--------VRGVSIP- 52 Query: 95 GVNVLWDPVASFDLAMATGAKFIREIF-----TGAYASDFGVWDTNVGETIRHQHRIGAG 149 VNV+ P +R GA G + + + Sbjct: 53 -VNVMIRPRPGGFQYSPGEMDQMRRSAQAMAEVGARGLVMGFLKDGAVDLDALKSALTWC 111 Query: 150 EVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPDALCVSGLTAGTRTDSALLKRVKETV 209 D A+ D L SG + L+R+ E Sbjct: 112 PGIDFTF----HRAIDEASDPVEAARVACGAGVTD-LLTSGGPGPIEGNLDRLRRMVEAA 166 Query: 210 PDTVVLANTGVCLENVEEQL--SIADGCVTATTFKKDGVFANFVDQARVSQFMEKVHHI 266 V+A G+ EN + + ++D +D + + +E V + Sbjct: 167 GSVRVMAGGGITGENAPRVILHGGVPAVHLGRSVRRDNSPTEPIDSQLLRRMVELVKGV 225 >UniRef50_A9A2A1 Tryptophan synthase alpha chain n=3 Tax=Thaumarchaeota RepID=A9A2A1_NITMS Length = 268 Score = 43.3 bits (101), Expect = 0.010, Method: Composition-based stats. Identities = 14/59 (23%), Positives = 25/59 (42%), Gaps = 4/59 (6%) Query: 189 SGLTAG-TRTDSALLKRVKETV-PDTVVLANTGV-CLENVEEQLSI-ADGCVTATTFKK 243 +G+ G +K VK+ V GV ++V++ + ADG + + F K Sbjct: 181 TGVKTGIKNYTIDAIKNVKKQTKGKIPVGVGFGVSTPDDVKKYIKAGADGVIVGSAFLK 239 >UniRef50_B5Y770 Enoyl-(Acyl-carrier-protein) reductase II n=1 Tax=Coprothermobacter proteolyticus DSM 5265 RepID=B5Y770_COPPD Length = 297 Score = 42.9 bits (100), Expect = 0.011, Method: Composition-based stats. Identities = 28/165 (16%), Positives = 51/165 (30%), Gaps = 30/165 (18%) Query: 83 IGQLMSDIRIPFGVNVLWDPVASFDLAMATGAKFIREIFTGAYASDFGVWDTNVGETIRH 142 I ++ + PFGVN++ ++F ++ V T G Sbjct: 54 IREVRRNTSKPFGVNLM------------LQSEFWQDQIKVVLEEKPPVITTGAGNPSSF 101 Query: 143 QHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPDALCV----SGLTAGTRTD 198 + +K L L + + D + SG G T Sbjct: 102 MKTLKEKGIKIL---------PLVG---SANQALLLEKAGADGVIAEGKESGGHIGDVTT 149 Query: 199 SALLKRVKETVPDTVVLANTGVCLENVEEQLSI--ADGCVTATTF 241 L+ V ++V + V+A G+ ++ + A G T F Sbjct: 150 IVLVNAVLKSVTNIPVIAAGGIVDKDSYRAMRAMGAAGVQMGTRF 194 >UniRef50_O27697 Tryptophan synthase alpha chain n=3 Tax=Methanobacteriaceae RepID=TRPA_METTH Length = 270 Score = 42.9 bits (100), Expect = 0.011, Method: Composition-based stats. Identities = 37/235 (15%), Positives = 75/235 (31%), Gaps = 48/235 (20%) Query: 31 AQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEFSLPYLTKVRPETT----------AAMA 80 M ++ + L + G DA+ P+ + T+ A M Sbjct: 28 GDPDMETSLEI----IRTLVDAGADALEV----GFPFSDPIADGTSVQGADLRALRAGMT 79 Query: 81 R-----IIGQLMSDIRIPFGVNVLWDPVASFDL------AMATGAKFIREIFTGAYASDF 129 +I ++ IP G+ V ++ + + A G + I + Sbjct: 80 TEKCFQLIERVREFTSIPIGLLVYYNLIYRMGVDEFYRRAAEAG---VTGILAADLPPEE 136 Query: 130 GVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPDALCVS 189 + +R + ++ + + L + I + S+ F+ + V+ Sbjct: 137 ------ASDALRAAEKYDIDQIFIVA--PTTGSERL--KRISEV--SSGFHYLVSVMGVT 184 Query: 190 GLTAGTR-TDSALLKRVKETVPDTVVLANTGVC-LENVEEQL-SIADGCVTATTF 241 G + L+KRVK V+ GV E+V + ADG + + Sbjct: 185 GARSRVEDATIELIKRVKAE-GSLPVMVGFGVSRPEHVRMLRDAGADGVIVGSAI 238 >UniRef50_C0EA61 Tryptophan synthase alpha chain n=1 Tax=Clostridium methylpentosum DSM 5476 RepID=C0EA61_9CLOT Length = 264 Score = 42.9 bits (100), Expect = 0.013, Method: Composition-based stats. Identities = 33/195 (16%), Positives = 60/195 (30%), Gaps = 29/195 (14%) Query: 83 IGQLMSDIRIPFGVNVLWDP------VASFDLAMATGAKFIREIFTGAYASDFGVWDTNV 136 + QL + +IP + ++ A F G G D +++ Sbjct: 85 VTQLREETQIPLVFLMYYNSLLHYGQDAFFARCKEAGID-------GVILPDLPFEESD- 136 Query: 137 GETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSI-AKSTVFNNHPDALCVSGLTAGT 195 E + R G ++ + + + I AK+ F +L V+G+ + Sbjct: 137 -EISEYTERYGVYQISLVA--------PTSSERLQQITAKAKGFLYCVSSLGVTGMRSEI 187 Query: 196 RTDSALLKRVKETVPDTVVLANTGV-CLENVEEQLSIADGCVTATTFKKD-GVFANFVD- 252 RTD A + G+ E DG + + G D Sbjct: 188 RTDLAQFFAQIDRCCTIPTCIGFGISTPEQAAAVKQYCDGVIIGSAIVNRIGTAQTP-DR 246 Query: 253 -QARVSQFMEKVHHI 266 VS+F+ +V Sbjct: 247 AVESVSEFVRQVSAA 261 >UniRef50_C4V0L8 N-(5'-phosphoribosyl)anthranilate isomerase n=1 Tax=Selenomonas flueggei ATCC 43531 RepID=C4V0L8_9FIRM Length = 203 Score = 42.5 bits (99), Expect = 0.015, Method: Composition-based stats. Identities = 36/203 (17%), Positives = 64/203 (31%), Gaps = 36/203 (17%) Query: 68 LTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDPVASFDLAMATGAKFIR-EIFTGAYA 126 PET +AR + ++ GV V +A A G +++ A Sbjct: 34 RRYAAPETAHEIAREMQRVKK-----VGVFVDAPMAEVNRIADAVGLDYVQLHGHETAEM 88 Query: 127 SDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPDAL 186 + + V + R+ A I + Sbjct: 89 ARMA--ERPVIKAYRYGDDFDAEAANVY--------------PAEIILVDSYVKGAAGG- 131 Query: 187 CVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIAD--GCVTATTFKKD 244 +GL + + + RV + VL G+ NV E + G + ++D Sbjct: 132 --TGLAFHWQEAAREIARVTK-----PVLIAGGITAANVREAVETFHPFGIDVSGGLEED 184 Query: 245 GVFANFVDQARVSQFMEKVHHIR 267 GV +A+++ FME V +R Sbjct: 185 GVK----SKAKITAFMEAVCALR 203 >UniRef50_Q4FUL2 Tryptophan synthase alpha chain n=5 Tax=Gammaproteobacteria RepID=TRPA_PSYA2 Length = 278 Score = 42.5 bits (99), Expect = 0.015, Method: Composition-based stats. Identities = 17/87 (19%), Positives = 30/87 (34%), Gaps = 7/87 (8%) Query: 188 VSGLTAGTRTDSALLKRVKETVPDTVVLANTGVC-LENVEEQLSIADGCVTATTFKK--- 243 V+G D A + + D V G+ + + + ADG + + + Sbjct: 182 VTGSATLDTDDVATQVQAIKAETDLPVCVGFGIRDAASAKAIGAHADGIIVGSALVQNFA 241 Query: 244 --DGVFANFVDQARVSQFMEKVHHIRR 268 DG A V + M K+ +R Sbjct: 242 DIDGNDATAV-AHAQQKIMAKMTELRE 267 >UniRef50_A5IYV3 Triosephosphate isomerase n=1 Tax=Mycoplasma agalactiae PG2 RepID=TPIS_MYCAP Length = 258 Score = 42.5 bits (99), Expect = 0.017, Method: Composition-based stats. Identities = 11/61 (18%), Positives = 21/61 (34%), Gaps = 8/61 (13%) Query: 192 TAGTRTDSALLKRVKETV-----PDTVVLANTGVCLENVEEQ--LSIADGCVTAT-TFKK 243 G ++ V + P+ VL V N+ + L +G + + + K Sbjct: 182 GTGITPTPEEVENVSALIHKLTSPEVPVLYGGSVNENNINDFTKLPNLNGFLVGSASLKI 241 Query: 244 D 244 D Sbjct: 242 D 242 >UniRef50_UPI00006A2EA9 UPI00006A2EA9 related cluster n=1 Tax=Xenopus (Silurana) tropicalis RepID=UPI00006A2EA9 Length = 270 Score = 42.1 bits (98), Expect = 0.017, Method: Composition-based stats. Identities = 17/113 (15%), Positives = 28/113 (24%), Gaps = 9/113 (7%) Query: 162 AVYLGNRDICSIAKSTVFNNHPDALC-------VSGLTAGTRTDSALLKRVKETVPDTVV 214 V A+ V V+G + + V Sbjct: 146 DVIFLTTPTSDEARLPVILEQASGFVYHVSIAGVTGAASANLDHVRDMVEKIRHHTPLPV 205 Query: 215 LANTGV-CLENVEEQLSIADGCVTATTFKKDGVFANFVDQARVSQFMEKVHHI 266 G+ +E V E ADG V + + + D R + V + Sbjct: 206 CVGFGIRTVEQVTELAGFADGVVVGSAI-VNAAMSAPTDAERTIAALTLVRQL 257 >UniRef50_D2B8E9 Tryptophan synthase alpha chain n=2 Tax=Actinomycetales RepID=D2B8E9_STRRD Length = 261 Score = 42.1 bits (98), Expect = 0.018, Method: Composition-based stats. Identities = 44/237 (18%), Positives = 71/237 (29%), Gaps = 34/237 (14%) Query: 47 MALQNGGVDAVMF-----SNEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWD 101 A + G DAV + + AA + G L + V ++ Sbjct: 37 HAYADAGADAVELGFPFSDPMLDGVTIQEASDRAIAAGTTVKGILEEVATLDVDVPLIAM 96 Query: 102 PVASFDLAMATGAKFIREIFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEA 161 ++ + +T A+F + G G+ + EV L + Sbjct: 97 TYSNLVVQQST-AEFCAALTAGGL---RGLIVPDS----------PLEEVGELADAAAAE 142 Query: 162 AVYLG--NRDICSIAKSTVFNNHPDALC-------VSGLTAGTRTDSALLKRVKETVPDT 212 ++L S A+ +G + +A L R + + D Sbjct: 143 GLHLVLLAAPSSSRARLREIAERSRGFVYALTRMGTTGEHSEVPEQAARLGRELKGLTDR 202 Query: 213 VVLANTGV-CLENVEEQLSIADGCVTATTFKKDGVFANFVDQARVSQFMEKVHHIRR 268 VL GV E ADG V A+ +D AR + E V IRR Sbjct: 203 PVLFGFGVSNPAQAAELAGHADGVVVASAL-----MRKLLDGARPRELGEYVASIRR 254 >UniRef50_C1N2K9 Predicted protein n=1 Tax=Micromonas pusilla CCMP1545 RepID=C1N2K9_9CHLO Length = 307 Score = 42.1 bits (98), Expect = 0.020, Method: Composition-based stats. Identities = 15/113 (13%), Positives = 33/113 (29%), Gaps = 10/113 (8%) Query: 165 LGNRDICSIAKSTVFNNHPDALC-------VSGLTAGTRTDSALLKRVKETVPDTVVLAN 217 L + A+ T + V+G+ + L + V D V Sbjct: 170 LLSTPTTPEARMTKIAEASNGFIYLVSVTGVTGVRTNVESRVQELVSGLKKVTDKPVAVG 229 Query: 218 TGVC-LENVEEQLSI-ADGCVTATTFKKDGVFANFVDQARVSQFMEKVHHIRR 268 G+ E + + ADG + + + + + + + +R Sbjct: 230 FGISKKEQAAQVVGWGADGVIVGSALVR-ALGEAPTPEEGLERLTALAKELRE 281 >UniRef50_A0B8J3 Geranylgeranylglyceryl phosphate synthase n=13 Tax=Euryarchaeota RepID=GGGPS_METTP Length = 255 Score = 42.1 bits (98), Expect = 0.021, Method: Composition-based stats. Identities = 38/226 (16%), Positives = 82/226 (36%), Gaps = 31/226 (13%) Query: 40 DKAWDDLMALQNGGVDAVMFSNEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVL 99 ++A + + G A+M T +A+ R + + + +P + Sbjct: 38 ERAVQMARSAADAGTTALMV---------GGSVGATGSALDRTVRAIKDSVDLPVILF-- 86 Query: 100 WDPVASFDLAMATGAKFIREIFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVP 159 P ++ L A F + ++ + + + +G I ++ I A + IV Sbjct: 87 --PSSAAGLCDNADAVFF-MSLLNSRSTSYLIENQALGAPIVSRYGIEAIPMG---YIVV 140 Query: 160 E--------AAVYLGNRDICSIAKSTVFNNHPDALCV----SGLTAGTRTDSALLKRVKE 207 E L R IA + + + +G A + ++++ V++ Sbjct: 141 EPGGTVGWVGDAKLVPRRKPDIAAAYALAGRYLGMRLIYLEAGSGAESPVPTSMVSAVRD 200 Query: 208 TVPDTVVLANTGVCLENVEEQL--SIADGCVTATTFKKDGVFANFV 251 + DT+++ G+ +L + AD VT T ++ G FV Sbjct: 201 AIGDTLLVVGGGIRDAEAARKLVSAGADLIVTGTGVEESGDVFRFV 246 >UniRef50_Q12TL0 N-(5'-phosphoribosyl)anthranilate isomerase n=1 Tax=Methanococcoides burtonii DSM 6242 RepID=Q12TL0_METBU Length = 222 Score = 42.1 bits (98), Expect = 0.021, Method: Composition-based stats. Identities = 25/136 (18%), Positives = 46/136 (33%), Gaps = 11/136 (8%) Query: 138 ETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPDALC----VSGL-- 191 + + +V + + V D+C+ N D + VSG Sbjct: 92 DIEEMRILRDNTDVSIIRTFHVQGDVSAD--DLCNNINMFTSENLIDGVLLDSYVSGKVG 149 Query: 192 TAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATTFKKDGVFANFV 251 G D ++ KRV + V D + G+ +NV+ ++ T Sbjct: 150 GTGQLHDLSVSKRVVDLV-DVPAILAGGLNPDNVKACVNEVIPFAVDTA--SGVETDGLK 206 Query: 252 DQARVSQFMEKVHHIR 267 D +V+ F+ V +R Sbjct: 207 DVDKVAAFVNAVRCVR 222 >UniRef50_Q39SS2 Tryptophan synthase alpha chain n=7 Tax=Desulfuromonadales RepID=TRPA_GEOMG Length = 269 Score = 42.1 bits (98), Expect = 0.021, Method: Composition-based stats. Identities = 13/57 (22%), Positives = 21/57 (36%), Gaps = 1/57 (1%) Query: 188 VSGLTAGTRTDSALLKRVKETVPDTVVLANTGV-CLENVEEQLSIADGCVTATTFKK 243 V+G+ +G A + + V G+ E E + ADG V + K Sbjct: 181 VTGVRSGIEASVAGNVNIIKECSKVPVAVGFGIATPEQAGEVAATADGVVVGSAIVK 237 >UniRef50_Q2RIT8 N-(5'-phosphoribosyl)anthranilate isomerase n=7 Tax=Bacteria RepID=TRPF_MOOTA Length = 223 Score = 41.7 bits (97), Expect = 0.023, Method: Composition-based stats. Identities = 37/228 (16%), Positives = 69/228 (30%), Gaps = 36/228 (15%) Query: 43 WDDLMALQNGGVDAVMFSNEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDP 102 W++ + + GVD + + R A II +L GV V Sbjct: 12 WEEARMVLDAGVDTL------GFVFARSPRAIKPEAAREIITKL-PPFTTTVGVFVNEPR 64 Query: 103 VASFDLAMATGAKFIREIFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAA 162 + ++A F R + D + G + R I + +L ++ Sbjct: 65 YSLMEIA-----SFCRLDVLQLH-GDE-PPEYCHGLSQRLIKAIRVRDAASLASLE---- 113 Query: 163 VYLGNRDICSIAKSTVFNNHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCL 222 R++ GT + L++ V+ G+ Sbjct: 114 ---AYREVQGFLLDAWVPGKAGG-------TGTTFNWELVR--GAATGGKPVILAGGLTP 161 Query: 223 ENVEEQLSIAD--GCVTATTFKKDGVFANFVDQARVSQFMEKVHHIRR 268 ENV + + ++ + DG N AR++ F+E V Sbjct: 162 ENVGAAIQLVHPYAVDVSSGVEVDGR-KNP---ARIAAFLEAVRKAEE 205 >UniRef50_C6J450 Tryptophan synthase alpha chain n=2 Tax=Bacillales RepID=C6J450_9BACL Length = 273 Score = 41.7 bits (97), Expect = 0.023, Method: Composition-based stats. Identities = 18/103 (17%), Positives = 28/103 (27%), Gaps = 6/103 (5%) Query: 147 GAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHP-----DALCVSGLTAGTRTDSAL 201 A EV L + + L + +L V+G Sbjct: 140 EAEEVLKLADAAGVRLIPLVAPTSSGRIARILERARGFVYCVSSLGVTGERTSFHASVDE 199 Query: 202 LKRVKETVPDTVVLANTGVCL-ENVEEQLSIADGCVTATTFKK 243 + D V G+ E VE +I DG V + + Sbjct: 200 FIASVKAQTDLPVAVGFGISSREQVERFAAICDGAVVGSAIVR 242 >UniRef50_A6Q538 Triosephosphate isomerase n=24 Tax=Epsilonproteobacteria RepID=TPIS_NITSB Length = 232 Score = 41.7 bits (97), Expect = 0.026, Method: Composition-based stats. Identities = 16/81 (19%), Positives = 27/81 (33%), Gaps = 14/81 (17%) Query: 192 TAGTRTDSALLKRVKE---TVPDTVVLANTGVCLENVEEQLSI--ADGCVTATTFKKDGV 246 G ++ V ++ D +L V N++E LSI DG + T Sbjct: 161 GTGVAAKPEEIEEVLAYLASLTDAPLLYGGSVKPANIKEVLSIPKCDGALIGTA------ 214 Query: 247 FANFVDQARVSQFMEKVHHIR 267 D + +E +R Sbjct: 215 ---SWDVENFIKMIEIAKEMR 232 >UniRef50_Q1CZH2 Tryptophan synthase alpha chain n=2 Tax=Cystobacterineae RepID=Q1CZH2_MYXXD Length = 263 Score = 41.4 bits (96), Expect = 0.030, Method: Composition-based stats. Identities = 14/83 (16%), Positives = 25/83 (30%), Gaps = 5/83 (6%) Query: 188 VSGLTAGTRTDSALLKRVKETVPDTVVLANTGV-CLENVEEQLSIADGCVTATTFKKDGV 246 V+G+ A D + + V+A G+ E + ADG V + + Sbjct: 184 VTGMRAELPPDLSQRLDLVRKAATVPVVAGFGISTAEQARMLSAHADGVVVGSALVRAAH 243 Query: 247 FANFVDQARVSQF-MEKVHHIRR 268 + +RR Sbjct: 244 TEG---LEAAKALCADIKRGLRR 263 >UniRef50_C0ZCE6 Tryptophan synthase alpha chain n=1 Tax=Brevibacillus brevis NBRC 100599 RepID=C0ZCE6_BREBN Length = 272 Score = 41.4 bits (96), Expect = 0.031, Method: Composition-based stats. Identities = 10/57 (17%), Positives = 17/57 (29%), Gaps = 1/57 (1%) Query: 188 VSGLTAGTRTDSALLKRVKETVPDTVVLANTGV-CLENVEEQLSIADGCVTATTFKK 243 V+G R D A + G+ + V ADG + + + Sbjct: 182 VTGARTDLREDLADFLERVKASTSVPTAVGFGISTPDQVRTVAPHADGVIVGSAIVQ 238 >UniRef50_B2WCC2 tRNA (Guanine-N7-)-methyltransferase subunit Trm82 n=1 Tax=Pyrenophora tritici-repentis Pt-1C-BFP RepID=B2WCC2_PYRTR Length = 272 Score = 41.4 bits (96), Expect = 0.035, Method: Composition-based stats. Identities = 23/121 (19%), Positives = 41/121 (33%), Gaps = 9/121 (7%) Query: 134 TNVGETIRHQHRIGAGEVKTLFNI---VPEAAVYLGNRDICSIAKSTVFNNHPDALCVSG 190 E +R +H++ G V L ++ Y+ D + + P A + G Sbjct: 53 AKSKEPMRFKHQLLLGHVSMLTDVVYAKVNGRSYIITADRDEHIR--ISRGLPQAHIIEG 110 Query: 191 LTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATTFKKDGVFANF 250 G + L P ++++ G V+E L +G FK AN Sbjct: 111 FCFGHEAFISKLC----LTPSGLLVSGGGDDHLYVDEILVACEGVPALFHFKMGDAHANH 166 Query: 251 V 251 + Sbjct: 167 I 167 >UniRef50_C6CUD6 N-(5'-phosphoribosyl)anthranilate isomerase n=1 Tax=Paenibacillus sp. JDR-2 RepID=C6CUD6_PAESJ Length = 220 Score = 40.6 bits (94), Expect = 0.053, Method: Composition-based stats. Identities = 21/98 (21%), Positives = 38/98 (38%), Gaps = 10/98 (10%) Query: 174 AKSTVFNNHPDALCV--SGLTAGTRTDSALLKRVKETVP--DTVVLANTGVCLENVEEQL 229 A+ + + DA+ + +G G D L+ + D + G+ NV E L Sbjct: 126 ARVSAYEGAVDAILIDTAGGGTGQTFDWQLITDYQNAAAAIDVPLYVAGGLHPGNVGELL 185 Query: 230 SI--ADGCVTATTFKKDGVFANFVDQARVSQFMEKVHH 265 + DG ++ + DG D ++ F+ KV Sbjct: 186 AGNPVDGIDVSSGVETDG----RKDIEKIRLFVRKVIE 219 >UniRef50_Q8G691 Tryptophan synthase alpha chain n=30 Tax=Actinobacteridae RepID=TRPA_BIFLO Length = 291 Score = 40.6 bits (94), Expect = 0.053, Method: Composition-based stats. Identities = 38/215 (17%), Positives = 64/215 (29%), Gaps = 38/215 (17%) Query: 47 MALQNGGVDAVMFSNEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDPVASF 106 + GVDAV +S P + + A++A G+ + + A Sbjct: 58 KTMVEHGVDAVEIGLPYSDPVMDGPVIQAAASIALNNGETIKRVF-----------EAVE 106 Query: 107 DLAMATGAKFIREIFTGAY-------------ASDFGVWDTN-----VGETIRHQHRIGA 148 +A A G I + Y A G+ + GE I R G Sbjct: 107 TVANAGGVPLIMSYWNLVYHYGVERFARDFENAGGAGLITPDLIPDEAGEWIEASDRHGL 166 Query: 149 GEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPDALC-VSGLTAGTRTDSALLKRVKE 207 + + + + ++A++ + A V+G A LL Sbjct: 167 DRIFLV-------SPDSSTERLETVARNARGFVYAAARMGVTGERATIDASPELLVERTR 219 Query: 208 TVPDTVVLANTGV-CLENVEEQLSIADGCVTATTF 241 V GV E + S ADG + + Sbjct: 220 QAGAENVCVGIGVSTAEQGAKVGSYADGVIVGSAL 254 >UniRef50_UPI00016C49DE phosphoribosylanthranilate isomerase n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C49DE Length = 217 Score = 40.6 bits (94), Expect = 0.055, Method: Composition-based stats. Identities = 42/234 (17%), Positives = 70/234 (29%), Gaps = 46/234 (19%) Query: 44 DDLMALQNGGVDAVMFSNEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDPV 103 +D G DAV N Y R T A ++ L GV V Sbjct: 13 EDARFAAEAGADAVGL-NF----YPQSPRYITPQQAAPLVRAL-PAFTSAVGVFVGMPMR 66 Query: 104 ASFDLAMATGAKFI---------REIFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTL 154 + +A G + + + F A+ R + R G V+ Sbjct: 67 QACAIAFQLGLRGVQSYDDHPPTEDTFPFAHVP-----------AFRVKDREGLEAVRRF 115 Query: 155 FNIVPEAAVYLGNRDICSIAKSTVFNNHPDALCVSGLTAGTRTDSALLKRVKETVPDTVV 214 + A + S V G R LL++ V + Sbjct: 116 VDAAVAAGRP----PSAVLIDSFVVGQMG--------GTGHRAPWQLLQQFDVGV---PL 160 Query: 215 LANTGVCLENVEEQLSIAD--GCVTATTFKKDGVFANFVDQARVSQFMEKVHHI 266 + G+ ENVEE ++ G A+ ++ D +V++F++ V Sbjct: 161 ILAGGLTPENVEEAVATVRPWGVDVASGVERAPGVK---DPDKVTRFVQNVRRA 211 >UniRef50_C9KJ76 N-(5'-phosphoribosyl)anthranilate isomerase n=3 Tax=Veillonellaceae RepID=C9KJ76_9FIRM Length = 233 Score = 40.6 bits (94), Expect = 0.056, Method: Composition-based stats. Identities = 34/191 (17%), Positives = 57/191 (29%), Gaps = 31/191 (16%) Query: 80 ARIIGQLMSDIRIPFGVNVLWDPVASFDLAMATGAKFIREIFTGAYASDFGVWDTNVGET 139 ARI+ L GV V DP +A F+ + E Sbjct: 70 ARIVEALQRVKT--VGVFVDEDPALVNAIARQCHLDFV---------------QLHGHED 112 Query: 140 IRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPDALCVSGLTAGTRTDS 199 + + +I +K +A I +G + Sbjct: 113 VAYAKQIEVPVIKAYRYGDGFSAEAANAFPAAMILVDAYQKGAAGG---TGTCFDWQQAK 169 Query: 200 ALLKRVKETVPDTVVLANTGVCLENVEEQLSIAD--GCVTATTFKKDGVFANFVDQARVS 257 + V++ VL G+ NV E +I + + + + AR++ Sbjct: 170 REVAAVRK-----PVLIAGGISEANVAEVNTIFHPFAVDVSGSLEVN----REKSAARIA 220 Query: 258 QFMEKVHHIRR 268 FME+VH I R Sbjct: 221 AFMEQVHEINR 231 >UniRef50_B1YEC8 CutC family protein n=1 Tax=Exiguobacterium sibiricum 255-15 RepID=B1YEC8_EXIS2 Length = 221 Score = 40.6 bits (94), Expect = 0.059, Method: Composition-based stats. Identities = 53/252 (21%), Positives = 88/252 (34%), Gaps = 52/252 (20%) Query: 35 MNWVIDKAWDDLMALQNGGVDAVMFSNEFSLPYLTKVRPETTAAMAR--------IIGQL 86 M +I ++ A + G D R E +A++ +I Q+ Sbjct: 1 MLEIIAATVEEATAAERAGAD----------------RLELVSALSEGGLTPSYGLIRQV 44 Query: 87 MSDIRIPFGVNVLWDPVASFDLAMA------TGAKFIRE-----IFTGAYASDFGVWDTN 135 +S + IP V V SF + A T IRE I G+ +D V + Sbjct: 45 LSSVEIPVHVLV-RPHSKSFVYSKADQETIITDIDLIRELGAAGIVVGSLTADGRVDEGF 103 Query: 136 VGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPDALCVSGLTAGT 195 +G I+H+ + +RDI A+ D + SG A Sbjct: 104 LGRIIKHKGDLSLT----------FHRAIDSSRDILEAAEVLADFPEVDRILTSGGHATA 153 Query: 196 RTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADG--CVTATTFKKDGVFANFVDQ 253 A++ ++ E PD +VL +G+ +E EE L + DG VD Sbjct: 154 LEGQAVIAQLIEQNPDLIVLPGSGITVERAEELLKATQATELHVGSAVLVDGQ----VDA 209 Query: 254 ARVSQFMEKVHH 265 +V+ + + Sbjct: 210 DKVAALKQLLAR 221 >UniRef50_Q9URN8 Mutant tryptophan synthase (Fragment) n=4 Tax=Neurospora crassa RepID=Q9URN8_NEUCR Length = 193 Score = 40.6 bits (94), Expect = 0.062, Method: Composition-based stats. Identities = 20/86 (23%), Positives = 31/86 (36%), Gaps = 10/86 (11%) Query: 163 VYLGNRDICSIAKSTVFNNHPDALC-------VSGLT-AGTRTDSALLKRVKETVPDTVV 214 V L S A+ V D+ V+G + LL RVK+ + Sbjct: 29 VPLI-APATSDARMRVLCQLADSFIYVVSRQGVTGASGTLNANLPELLARVKKYSGNKPA 87 Query: 215 LANTGV-CLENVEEQLSIADGCVTAT 239 GV ++ + +IADG V + Sbjct: 88 AVGFGVSTHDHFTQVGAIADGVVVGS 113 >UniRef50_A9KL40 Tryptophan synthase alpha chain n=13 Tax=Firmicutes RepID=A9KL40_CLOPH Length = 257 Score = 40.6 bits (94), Expect = 0.063, Method: Composition-based stats. Identities = 10/55 (18%), Positives = 20/55 (36%), Gaps = 1/55 (1%) Query: 188 VSGLTAGTRTDSALLKRVKETVPDTVVLANTGV-CLENVEEQLSIADGCVTATTF 241 V+G+ + + + + V D G+ E E I+DG + + Sbjct: 175 VTGVRTEIGSQVDYMIKEAKKVTDIPCAVGFGISTPEQAREMAEISDGVIVGSAI 229 >UniRef50_Q5HPG9 Tryptophan synthase alpha chain n=59 Tax=Staphylococcus RepID=TRPA_STAEQ Length = 243 Score = 40.6 bits (94), Expect = 0.064, Method: Composition-based stats. Identities = 17/80 (21%), Positives = 32/80 (40%), Gaps = 7/80 (8%) Query: 189 SGLTAGTRTDSALLKRVKETVPDTVVLANTGV-CLENVEEQLSIADGCVTATTFKKDGVF 247 +G + D + V V+A G+ E+V++ S+ADG V + K Sbjct: 164 TGNSGEFHPDLKRKIEYIKKVSKIPVVAGFGIKNPEHVKDIASVADGIVIGSEIVK---- 219 Query: 248 ANFVDQARVSQFMEKVHHIR 267 ++ +F+ + IR Sbjct: 220 --RIEIDSRKEFITYIKSIR 237 >UniRef50_Q0W630 Tryptophan synthase alpha chain n=1 Tax=uncultured methanogenic archaeon RC-I RepID=Q0W630_UNCMA Length = 273 Score = 40.2 bits (93), Expect = 0.070, Method: Composition-based stats. Identities = 15/57 (26%), Positives = 25/57 (43%), Gaps = 3/57 (5%) Query: 188 VSGLTAGTRTDS-ALLKRVKETVPDTVVLANTGVC-LENVEEQLSI-ADGCVTATTF 241 V+G A +L R K +T V G+ ++V E +S+ ADG + + Sbjct: 185 VTGKRASLSAGIKEVLDRAKAAAGNTPVAVGFGISGPDHVTEIISMGADGAIVGSAI 241 >UniRef50_D2RNR7 Tryptophan synthase, alpha subunit n=15 Tax=cellular organisms RepID=D2RNR7_ACIFE Length = 260 Score = 40.2 bits (93), Expect = 0.084, Method: Composition-based stats. Identities = 15/80 (18%), Positives = 29/80 (36%), Gaps = 1/80 (1%) Query: 188 VSGLTAGTRTDSALLKRVKETVPDTVVLANTGV-CLENVEEQLSIADGCVTATTFKKDGV 246 V+G+ T+ + V D + G+ E E +I+DG + + K Sbjct: 176 VTGVRKEITTNLKEIVEKIREVSDKPIAVGFGISTPEQAREMAAISDGAIVGSAIVKLCA 235 Query: 247 FANFVDQARVSQFMEKVHHI 266 V Q+++K+ Sbjct: 236 QYGKDCVEPVKQYVKKMAEA 255 >UniRef50_A8ZVH6 N-(5'-phosphoribosyl)anthranilate isomerase n=1 Tax=Desulfococcus oleovorans Hxd3 RepID=A8ZVH6_DESOH Length = 260 Score = 39.8 bits (92), Expect = 0.088, Method: Composition-based stats. Identities = 21/87 (24%), Positives = 35/87 (40%), Gaps = 19/87 (21%) Query: 193 AGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCV------TATTFKKD-- 244 G +D A++ ++ + P V+LA G+ ENV DG V + + + Sbjct: 168 TGRPSDWAVVAKLVASTPVPVILAG-GMSPENV------YDGIVQTRPAGVDSCTQTNRR 220 Query: 245 GVFANFV----DQARVSQFMEKVHHIR 267 N V D RV +F+E+ Sbjct: 221 DPAGNPVRFSKDMDRVRRFVEETRRAE 247 >UniRef50_Q8U092 N-(5'-phosphoribosyl)anthranilate isomerase n=5 Tax=Euryarchaeota RepID=TRPF_PYRFU Length = 208 Score = 39.8 bits (92), Expect = 0.093, Method: Composition-based stats. Identities = 20/117 (17%), Positives = 45/117 (38%), Gaps = 13/117 (11%) Query: 156 NIVPEAAVYLGNRDICSIAK---STVFNNHPDALCV-SGLTAGTRTDSALLKRVKETVPD 211 ++ V +++ A S + + D + + +G +G L+ Sbjct: 97 FVMKAFRVPTISKNPEEDANRLLSEISRYNADMVLLDTGAGSGK---LHDLRVSSLVARK 153 Query: 212 TVVLANTGVCLENVEEQLSIAD--GCVTATTFKKDGVFANFVDQARVSQFMEKVHHI 266 V+ G+ ENVEE + + G ++ +K G+ D V +F+ + ++ Sbjct: 154 IPVIVAGGLNAENVEEVIKVVKPYGVDVSSGVEKYGIK----DPKLVEEFVRRAKNV 206 >UniRef50_Q7UMC3 N-(5'-phosphoribosyl)anthranilate isomerase n=1 Tax=Rhodopirellula baltica RepID=Q7UMC3_RHOBA Length = 245 Score = 39.8 bits (92), Expect = 0.097, Method: Composition-based stats. Identities = 44/259 (16%), Positives = 91/259 (35%), Gaps = 31/259 (11%) Query: 17 MCHLRAL-----PGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEFSLPYLTKV 71 M L + P F + + ++ D+ ++ + G DAV N + P + + Sbjct: 1 MTTLPPVGPSDPPTTQPFRSPFAIKICGMRSVQDVRSVADAGADAVGL-NFYE-PSVRSL 58 Query: 72 RPETTAAMARIIGQLMSDIRI-PFGVNVLWDPVASFDLAMATGAKFIREIFTGAYASDFG 130 P+ A I ++ ++ + G+ V D +A + +I ++ S Sbjct: 59 NPD--AEETIRINEVAREVGLTRVGLFVNHDLEFIQRVAGSLQLDWI-QLHGDEPVSLAE 115 Query: 131 VWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPDALCVSG 190 +R R+ G++K L I + D+ + + + Sbjct: 116 DLVRAGQSILRA-IRLPRGKLK-LGQIDDVIGKW-NEVDVSYLLDADAGASFGGG----- 167 Query: 191 LTAGTRTDSALLKRVKETVPDTV---VLANTGVCLENVEEQLSI--ADGCVTATTFKKDG 245 G D ++ + D+ VLA G+ ENV E + + A G A+ ++ Sbjct: 168 ---GQPLDWPSIRAWADRRGDSAAGWVLAG-GLNPENVREAIQVSGATGVDVASGVEQ-- 221 Query: 246 VFANFVDQARVSQFMEKVH 264 + ++ QF+ V Sbjct: 222 -PKGRKNAEKIRQFVAAVQ 239 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P39364 Putative sgc region protein sgcQ n=61 Tax=Bacter... 302 8e-81 UniRef50_C0D1F9 Putative uncharacterized protein n=1 Tax=Clostri... 289 6e-77 UniRef50_C5CIF3 Photosystem I assembly BtpA n=1 Tax=Kosmotoga ol... 285 1e-75 UniRef50_A7B0Z5 Putative uncharacterized protein n=1 Tax=Ruminoc... 284 3e-75 UniRef50_A3KNP0 Zgc:162297 protein n=7 Tax=Coelomata RepID=A3KNP... 283 5e-75 UniRef50_B1CBM1 Putative uncharacterized protein n=1 Tax=Anaerof... 282 9e-75 UniRef50_Q5JHL2 Uncharacterized protein TK2179 n=5 Tax=Euryarcha... 278 1e-73 UniRef50_Q8U2H5 Uncharacterized protein PF0860 n=8 Tax=Euryarcha... 277 3e-73 UniRef50_UPI0000D55C2D PREDICTED: similar to conserved hypotheti... 276 5e-73 UniRef50_Q29E81 GA21203 n=3 Tax=Coelomata RepID=Q29E81_DROPS 276 5e-73 UniRef50_A4E7P3 Putative uncharacterized protein n=1 Tax=Collins... 274 2e-72 UniRef50_Q28QI3 Photosystem I assembly BtpA n=5 Tax=Alphaproteob... 274 3e-72 UniRef50_Q9VS44 CG8607 n=22 Tax=Eukaryota RepID=Q9VS44_DROME 272 9e-72 UniRef50_Q2RUX8 Photosystem I assembly BtpA n=1 Tax=Rhodospirill... 270 4e-71 UniRef50_A8S303 Putative uncharacterized protein n=1 Tax=Clostri... 269 7e-71 UniRef50_B9XI59 Photosystem I assembly BtpA n=1 Tax=bacterium El... 269 8e-71 UniRef50_A5KMZ4 Putative uncharacterized protein n=2 Tax=Clostri... 268 1e-70 UniRef50_D2QXT9 Photosystem I assembly BtpA n=2 Tax=Bacteria Rep... 268 2e-70 UniRef50_A9W9X3 Photosystem I assembly BtpA n=4 Tax=Chloroflexac... 264 3e-69 UniRef50_P72966 Photosystem I biogenesis protein btpA n=34 Tax=C... 263 4e-69 UniRef50_C5ELG9 Putative uncharacterized protein n=1 Tax=Clostri... 263 5e-69 UniRef50_UPI000186CA08 conserved hypothetical protein n=1 Tax=Pe... 262 9e-69 UniRef50_D2RDD9 Photosystem I assembly BtpA n=1 Tax=Archaeoglobu... 260 2e-68 UniRef50_C3ZBU0 Putative uncharacterized protein n=2 Tax=Metazoa... 259 1e-67 UniRef50_Q9Y937 BtpA homolog n=1 Tax=Aeropyrum pernix RepID=Q9Y9... 255 2e-66 UniRef50_A3K4E4 Putative uncharacterized protein n=1 Tax=Sagittu... 254 3e-66 UniRef50_A6NZB9 Putative uncharacterized protein n=1 Tax=Bactero... 254 3e-66 UniRef50_B9XEW7 Photosystem I assembly BtpA n=1 Tax=bacterium El... 254 3e-66 UniRef50_Q8TVC9 Predicted TIM-barrel enzyme n=1 Tax=Methanopyrus... 253 4e-66 UniRef50_B5XQK9 BtpA family protein n=18 Tax=Proteobacteria RepI... 249 7e-65 UniRef50_A4YFU5 Photosystem I assembly BtpA n=1 Tax=Metallosphae... 249 9e-65 UniRef50_Q1NZ26 Uncharacterized protein F13E9.13, mitochondrial ... 249 1e-64 UniRef50_A8AAQ2 Photosystem I assembly BtpA n=1 Tax=Ignicoccus h... 248 2e-64 UniRef50_B9CKX7 Putative uncharacterized protein n=1 Tax=Atopobi... 248 2e-64 UniRef50_A3DLN8 Photosystem I assembly BtpA n=1 Tax=Staphylother... 239 7e-62 UniRef50_O29828 Uncharacterized protein AF_0419 n=1 Tax=Archaeog... 237 4e-61 UniRef50_B8HZU1 Photosystem I assembly BtpA n=5 Tax=Clostridiale... 234 4e-60 UniRef50_C0ZRZ3 Putative uncharacterized protein n=2 Tax=Rhodoco... 233 5e-60 UniRef50_Q96YL8 Putative uncharacterized protein ST2156 n=1 Tax=... 231 2e-59 UniRef50_D2RQS6 Photosystem I assembly BtpA n=4 Tax=Halobacteria... 231 2e-59 UniRef50_A2BLV0 Conserved archaeal protein n=1 Tax=Hyperthermus ... 231 2e-59 UniRef50_UPI000155D1C1 PREDICTED: hypothetical protein n=1 Tax=O... 230 4e-59 UniRef50_C5EEU2 Photosystem I assembly BtpA n=2 Tax=Clostridiale... 228 2e-58 UniRef50_UPI0001C369E0 btpA family protein n=1 Tax=Clostridium h... 227 3e-58 UniRef50_C8S5Y9 Photosystem I assembly BtpA n=1 Tax=Ferroglobus ... 213 6e-54 UniRef50_B9LR39 Photosystem I assembly BtpA n=7 Tax=cellular org... 212 7e-54 UniRef50_A5GQP5 Photosystem I biogenesis protein BtpA n=4 Tax=Ba... 211 2e-53 UniRef50_C9XNH3 Putative uncharacterized protein n=7 Tax=Firmicu... 204 3e-51 UniRef50_Q2CH81 Putative uncharacterized protein n=1 Tax=Oceanic... 193 4e-48 UniRef50_UPI000180CC4C PREDICTED: similar to F13E9.13 n=1 Tax=Ci... 192 8e-48 UniRef50_C2JNA9 Photosystem I biogenesis protein BtpA n=9 Tax=En... 192 1e-47 UniRef50_C5EPH7 Putative uncharacterized protein n=1 Tax=Clostri... 187 3e-46 UniRef50_A3MXM0 Photosystem I assembly BtpA n=5 Tax=Thermoprotea... 187 4e-46 UniRef50_C2BTC6 Possible photosystem I biogenesis protein BtpA n... 186 8e-46 UniRef50_Q16GL4 Putative uncharacterized protein n=2 Tax=Aedes a... 185 9e-46 UniRef50_C2D712 Putative uncharacterized protein n=1 Tax=Atopobi... 185 2e-45 UniRef50_A9FN23 Putative uncharacterized protein n=1 Tax=Sorangi... 174 2e-42 UniRef50_Q166V4 Photosystem I biogenesis protein, putative n=2 T... 171 2e-41 UniRef50_UPI000069ECFE UPI000069ECFE related cluster n=1 Tax=Xen... 168 1e-40 UniRef50_C0C181 Putative uncharacterized protein n=1 Tax=Clostri... 162 9e-39 UniRef50_Q18HA0 Photosystem I biogenesis protein homolog n=1 Tax... 136 6e-31 UniRef50_D2VTE2 Predicted protein n=1 Tax=Naegleria gruberi RepI... 135 2e-30 UniRef50_A6G6X7 Putative uncharacterized protein n=1 Tax=Plesioc... 121 2e-26 UniRef50_C5KQ75 Putative uncharacterized protein n=1 Tax=Perkins... 114 4e-24 UniRef50_A8VXV8 Tryptophan synthase alpha chain n=2 Tax=Bacillus... 103 7e-21 UniRef50_B9T9A8 Putative uncharacterized protein n=1 Tax=Ricinus... 101 2e-20 UniRef50_Q0FCT7 Adenine phosphoribosyltransferase n=3 Tax=Rhodob... 101 4e-20 UniRef50_C0QR82 Thiamine-phosphate pyrophosphorylase n=1 Tax=Per... 79 1e-13 UniRef50_A6TM77 Tryptophan synthase alpha chain n=10 Tax=Clostri... 74 6e-12 UniRef50_Q4JTH5 L-lactate dehydrogenase n=6 Tax=Actinomycetales ... 67 5e-10 UniRef50_Q7M877 Tryptophan synthase alpha chain n=9 Tax=Epsilonp... 66 9e-10 UniRef50_A3TLL5 Tryptophan synthase alpha chain n=1 Tax=Janibact... 60 1e-07 UniRef50_A5IKT4 Tryptophan synthase alpha chain n=5 Tax=Thermoto... 57 8e-07 Sequences not found previously or not previously below threshold: UniRef50_Q3ABS4 Tryptophan synthase alpha chain n=1 Tax=Carboxyd... 63 1e-08 UniRef50_A9A2A1 Tryptophan synthase alpha chain n=3 Tax=Thaumarc... 61 3e-08 UniRef50_C0ZCE6 Tryptophan synthase alpha chain n=1 Tax=Brevibac... 60 1e-07 UniRef50_Q5KXV2 Tryptophan synthase alpha chain n=6 Tax=Bacillac... 59 2e-07 UniRef50_D1BQN8 Tryptophan synthase alpha chain n=17 Tax=cellula... 59 2e-07 UniRef50_Q2LUE0 Tryptophan synthase alpha chain n=1 Tax=Syntroph... 58 2e-07 UniRef50_B0NZY9 Tryptophan synthase alpha chain n=2 Tax=Clostrid... 58 4e-07 UniRef50_C7N999 Tryptophan synthase alpha chain n=2 Tax=Leptotri... 58 4e-07 UniRef50_Q67PJ3 Tryptophan synthase alpha chain n=3 Tax=Clostrid... 57 7e-07 UniRef50_B0TFQ0 Tryptophan synthase alpha chain n=2 Tax=Heliobac... 57 8e-07 UniRef50_C6J450 Tryptophan synthase alpha chain n=2 Tax=Bacillal... 56 9e-07 UniRef50_Q3Z6G1 Tryptophan synthase alpha chain n=5 Tax=Dehaloco... 56 1e-06 UniRef50_C6PE99 Tryptophan synthase alpha chain n=1 Tax=Thermoan... 56 2e-06 UniRef50_D2RNR7 Tryptophan synthase, alpha subunit n=15 Tax=cell... 55 2e-06 UniRef50_Q8G691 Tryptophan synthase alpha chain n=30 Tax=Actinob... 54 3e-06 UniRef50_C0EA61 Tryptophan synthase alpha chain n=1 Tax=Clostrid... 54 3e-06 UniRef50_A9KL40 Tryptophan synthase alpha chain n=13 Tax=Firmicu... 54 4e-06 UniRef50_Q1ISI7 Tryptophan synthase alpha chain n=2 Tax=Acidobac... 54 4e-06 UniRef50_B0SDM7 Tryptophan synthase alpha chain n=6 Tax=Leptospi... 54 5e-06 UniRef50_A0RMX3 Tryptophan synthase alpha chain n=5 Tax=Campylob... 53 8e-06 UniRef50_C6LHS6 Tryptophan synthase alpha chain n=1 Tax=Bryantel... 53 1e-05 UniRef50_A0AJ79 Tryptophan synthase alpha chain n=11 Tax=Listeri... 53 1e-05 UniRef50_P95143 Putative L-lactate dehydrogenase [cytochrome] 2 ... 53 1e-05 UniRef50_C8WX57 Tryptophan synthase alpha chain n=2 Tax=Alicyclo... 53 1e-05 UniRef50_C6CUD8 Tryptophan synthase alpha chain n=2 Tax=Paenibac... 53 1e-05 UniRef50_Q7TUC8 Tryptophan synthase alpha chain n=31 Tax=Cyanoba... 53 1e-05 UniRef50_C6HY57 Tryptophan synthase alpha chain n=1 Tax=Leptospi... 53 1e-05 UniRef50_C0QR00 Tryptophan synthase alpha chain n=1 Tax=Persepho... 52 2e-05 UniRef50_Q5GMK6 Tryptophan synthase alpha chain n=2 Tax=Bacteria... 52 2e-05 UniRef50_Q5WGS0 Tryptophan synthase alpha chain n=4 Tax=Bacillac... 52 2e-05 UniRef50_Q254T1 Tryptophan synthase alpha chain n=3 Tax=Bacteria... 52 2e-05 UniRef50_B7FQI2 Predicted protein n=1 Tax=Phaeodactylum tricornu... 52 3e-05 UniRef50_O27697 Tryptophan synthase alpha chain n=3 Tax=Methanob... 52 3e-05 UniRef50_Q4FUL2 Tryptophan synthase alpha chain n=5 Tax=Gammapro... 52 3e-05 UniRef50_B9K6Z6 Tryptophan synthase alpha chain n=1 Tax=Thermoto... 51 3e-05 UniRef50_Q39SS2 Tryptophan synthase alpha chain n=7 Tax=Desulfur... 51 3e-05 UniRef50_B5YJI8 Tryptophan synthase alpha chain n=1 Tax=Thermode... 51 4e-05 UniRef50_B7APJ8 Tryptophan synthase alpha chain n=1 Tax=Bacteroi... 51 5e-05 UniRef50_A1HS65 Tryptophan synthase alpha chain n=2 Tax=Veillone... 50 6e-05 UniRef50_B8JAL9 Tryptophan synthase alpha chain n=4 Tax=Anaeromy... 50 6e-05 UniRef50_UPI00016C5413 tryptophan synthase alpha chain n=1 Tax=G... 50 7e-05 UniRef50_P00931 Tryptophan synthase n=45 Tax=Eukaryota RepID=TRP... 50 7e-05 UniRef50_C1N2K9 Predicted protein n=1 Tax=Micromonas pusilla CCM... 50 7e-05 UniRef50_UPI00016C49DE phosphoribosylanthranilate isomerase n=1 ... 50 9e-05 UniRef50_D1B8G5 CutC family protein n=1 Tax=Thermanaerovibrio ac... 50 1e-04 UniRef50_D1AQ86 Tryptophan synthase alpha chain n=6 Tax=Bacteria... 49 1e-04 UniRef50_Q01NI9 Tryptophan synthase alpha chain n=1 Tax=Candidat... 49 2e-04 UniRef50_B2WJB5 L-lactate dehydrogenase n=2 Tax=Pleosporineae Re... 49 2e-04 UniRef50_UPI00006A2EA9 UPI00006A2EA9 related cluster n=1 Tax=Xen... 49 2e-04 UniRef50_Q8TYA2 Tryptophan synthase alpha chain n=1 Tax=Methanop... 49 2e-04 UniRef50_Q5HPG9 Tryptophan synthase alpha chain n=59 Tax=Staphyl... 49 2e-04 UniRef50_C4V0L5 Tryptophan synthase alpha chain n=1 Tax=Selenomo... 49 2e-04 UniRef50_C4KZ63 Tryptophan synthase alpha chain n=1 Tax=Exiguoba... 48 2e-04 UniRef50_Q1CZH2 Tryptophan synthase alpha chain n=2 Tax=Cystobac... 48 2e-04 UniRef50_D2B8E9 Tryptophan synthase alpha chain n=2 Tax=Actinomy... 48 2e-04 UniRef50_B1QUK5 Tryptophan synthase alpha chain n=4 Tax=Clostrid... 48 2e-04 UniRef50_A3ESF4 Tryptophan synthase alpha chain n=2 Tax=Leptospi... 48 3e-04 UniRef50_Q03X02 Tryptophan synthase alpha chain n=2 Tax=Leuconos... 48 3e-04 UniRef50_C7TFE0 Tryptophan synthase alpha chain n=4 Tax=Lactobac... 48 3e-04 UniRef50_Q60180 Tryptophan synthase alpha chain n=5 Tax=Methanoc... 48 4e-04 UniRef50_A3CRL2 Tryptophan synthase alpha chain n=1 Tax=Methanoc... 48 4e-04 UniRef50_A6Q538 Triosephosphate isomerase n=24 Tax=Epsilonproteo... 48 4e-04 UniRef50_P17166 Tryptophan synthase alpha chain n=11 Tax=Lactoba... 48 4e-04 UniRef50_B5YJ74 Thiamine-phosphate pyrophosphorylase n=1 Tax=The... 48 5e-04 UniRef50_P42390 Indole-3-glycerol phosphate lyase, chloroplastic... 47 5e-04 UniRef50_A4RVE9 Predicted protein n=3 Tax=cellular organisms Rep... 47 6e-04 UniRef50_A4J150 Tryptophan synthase alpha chain n=1 Tax=Desulfot... 47 6e-04 UniRef50_Q8ESU5 Tryptophan synthase alpha chain n=4 Tax=Bacillac... 47 7e-04 UniRef50_D2QFY3 Tryptophan synthase, alpha subunit n=1 Tax=Spiro... 47 8e-04 UniRef50_Q0W630 Tryptophan synthase alpha chain n=1 Tax=uncultur... 47 8e-04 UniRef50_A8FKD8 Tryptophan synthase alpha chain n=15 Tax=Campylo... 47 8e-04 UniRef50_Q2S1Z2 Tryptophan synthase alpha chain n=2 Tax=Rhodothe... 46 0.001 UniRef50_A8ZZW7 Tryptophan synthase alpha chain n=3 Tax=Deltapro... 46 0.001 UniRef50_C6N065 Tryptophan synthase alpha chain n=2 Tax=Legionel... 46 0.001 UniRef50_A7I4T3 Tryptophan synthase alpha chain n=2 Tax=Methanom... 46 0.001 UniRef50_B5JDA4 Tryptophan synthase alpha chain n=1 Tax=Verrucom... 46 0.001 UniRef50_A9BHQ2 Tryptophan synthase alpha chain n=1 Tax=Petrotog... 46 0.001 UniRef50_Q5LYI8 Tryptophan synthase alpha chain n=59 Tax=Bacilli... 46 0.001 UniRef50_Q8U092 N-(5'-phosphoribosyl)anthranilate isomerase n=5 ... 46 0.001 UniRef50_Q0BWJ7 Tryptophan synthase alpha chain n=2 Tax=Hyphomon... 46 0.001 UniRef50_B3E0A3 Tryptophan synthase alpha chain n=1 Tax=Methylac... 46 0.002 UniRef50_A5IYV3 Triosephosphate isomerase n=1 Tax=Mycoplasma aga... 46 0.002 UniRef50_C4V0L8 N-(5'-phosphoribosyl)anthranilate isomerase n=1 ... 46 0.002 UniRef50_C7IFK5 Tryptophan synthase alpha chain n=1 Tax=Clostrid... 46 0.002 UniRef50_P16608 Tryptophan synthase alpha chain n=6 Tax=Deinococ... 46 0.002 UniRef50_Q12TL0 N-(5'-phosphoribosyl)anthranilate isomerase n=1 ... 45 0.002 UniRef50_A9WTC6 L-lactate dehydrogenase n=27 Tax=Actinobacteria ... 45 0.002 UniRef50_A7BCI9 Tryptophan synthase alpha chain n=1 Tax=Actinomy... 45 0.002 UniRef50_D1CBJ1 Tryptophan synthase alpha chain n=1 Tax=Thermoba... 45 0.002 UniRef50_Q7URN0 Tryptophan synthase alpha chain n=3 Tax=Planctom... 45 0.003 UniRef50_C7LZ69 Tryptophan synthase alpha chain n=1 Tax=Acidimic... 45 0.003 UniRef50_Q4PNH8 Putative L-lactate dehydrogenase n=1 Tax=uncultu... 45 0.003 UniRef50_Q5WX31 Tryptophan synthase alpha chain n=5 Tax=Gammapro... 45 0.003 UniRef50_C2KZB1 N-(5'-phosphoribosyl)anthranilate isomerase n=1 ... 45 0.003 UniRef50_Q2W414 Dioxygenase related to 2-nitropropane dioxygenas... 45 0.003 UniRef50_A6C191 Tryptophan synthase alpha chain n=1 Tax=Planctom... 45 0.003 UniRef50_A3S0B4 Tryptophan synthase alpha chain n=4 Tax=Bacteria... 45 0.003 UniRef50_B9M7D5 Tryptophan synthase alpha chain n=3 Tax=Geobacte... 45 0.003 UniRef50_A4CE02 FMN-dependent alpha-hydroxy acid dehydrogenase n... 44 0.004 UniRef50_B1I3Z4 Tryptophan synthase alpha chain n=1 Tax=Candidat... 44 0.004 UniRef50_C6VWH1 Tryptophan synthase alpha chain n=1 Tax=Dyadobac... 44 0.004 UniRef50_C6M905 Tryptophan synthase alpha chain n=5 Tax=Neisseri... 44 0.004 UniRef50_C6CUD6 N-(5'-phosphoribosyl)anthranilate isomerase n=1 ... 44 0.004 UniRef50_Q2FKX6 Tryptophan synthase alpha chain n=1 Tax=Methanos... 44 0.004 UniRef50_Q49Z39 Thiamine-phosphate pyrophosphorylase n=6 Tax=Sta... 44 0.005 UniRef50_A7Z615 Tryptophan synthase alpha chain n=15 Tax=Bacilla... 44 0.005 UniRef50_UPI0001850C57 tryptophan synthase subunit alpha n=1 Tax... 44 0.005 UniRef50_A6LU97 Tryptophan synthase alpha chain n=1 Tax=Clostrid... 44 0.005 UniRef50_C0RFZ2 Tryptophan synthase alpha chain n=134 Tax=Bacter... 44 0.006 UniRef50_B5Y770 Enoyl-(Acyl-carrier-protein) reductase II n=1 Ta... 44 0.006 UniRef50_A3IFH8 Tryptophan synthase alpha chain n=2 Tax=Bacillac... 44 0.006 UniRef50_Q9URN8 Mutant tryptophan synthase (Fragment) n=4 Tax=Ne... 44 0.006 UniRef50_Q0EXD5 Tryptophan synthase alpha chain n=1 Tax=Mariprof... 44 0.006 UniRef50_B0C6F8 Tryptophan synthase alpha chain n=32 Tax=cellula... 44 0.007 UniRef50_A4XLM5 N-(5'-phosphoribosyl)anthranilate isomerase n=2 ... 44 0.007 UniRef50_C1TRA2 Tryptophan synthase alpha chain n=1 Tax=Dethiosu... 44 0.007 UniRef50_B7HH02 Tryptophan synthase alpha chain n=76 Tax=Bacillu... 43 0.008 UniRef50_A6VFT8 Tryptophan synthase alpha chain n=9 Tax=Euryarch... 43 0.008 UniRef50_A3HZI7 Tryptophan synthase alpha chain n=3 Tax=Bacteroi... 43 0.009 UniRef50_A1VL09 FMN-dependent alpha-hydroxy acid dehydrogenase n... 43 0.009 UniRef50_D2R0T1 Phosphoribosylanthranilate isomerase n=1 Tax=Pir... 43 0.011 UniRef50_Q4KKP5 Tryptophan synthase alpha chain n=18 Tax=cellula... 43 0.012 UniRef50_B2JRL0 L-lactate dehydrogenase (Cytochrome) n=1 Tax=Bur... 43 0.012 UniRef50_A0QHG8 Tryptophan synthase alpha chain n=15 Tax=Actinob... 43 0.013 UniRef50_P51382 Tryptophan synthase alpha chain n=1 Tax=Porphyra... 43 0.013 UniRef50_C2CM78 L-lactate dehydrogenase n=2 Tax=Actinomycetales ... 43 0.013 UniRef50_C7Q1I4 FMN-dependent alpha-hydroxy acid dehydrogenase n... 43 0.014 UniRef50_D1JG56 Tryptophan synthase alpha chain n=1 Tax=uncultur... 43 0.014 UniRef50_B1YEC8 CutC family protein n=1 Tax=Exiguobacterium sibi... 43 0.014 UniRef50_B6JXZ5 Anthranilate synthase component 2 n=1 Tax=Schizo... 43 0.014 UniRef50_A5D1S2 Tryptophan synthase alpha chain n=2 Tax=Peptococ... 43 0.015 UniRef50_B8F7P2 Thiamine-phosphate pyrophosphorylase n=1 Tax=Hae... 43 0.015 UniRef50_B0K8T5 N-(5'-phosphoribosyl)anthranilate isomerase n=11... 42 0.017 UniRef50_A0YKS6 Tryptophan synthase alpha chain n=3 Tax=cellular... 42 0.019 UniRef50_Q5LBZ2 Tryptophan synthase alpha chain n=30 Tax=cellula... 42 0.019 UniRef50_A3K9S1 L-lactate dehydrogenase n=2 Tax=Rhodobacteraceae... 42 0.019 UniRef50_Q7UMC3 N-(5'-phosphoribosyl)anthranilate isomerase n=1 ... 42 0.020 UniRef50_D1AFY8 Thiamine-phosphate pyrophosphorylase n=3 Tax=Bac... 42 0.022 UniRef50_C0GDL2 Tryptophan synthase alpha chain n=1 Tax=Dethioba... 42 0.023 UniRef50_Q0RGR2 Tryptophan synthase alpha chain n=1 Tax=Frankia ... 42 0.024 UniRef50_O67502 Tryptophan synthase alpha chain n=5 Tax=Aquifica... 42 0.026 UniRef50_C2E9Z7 N-(5'-phosphoribosyl)anthranilate isomerase n=1 ... 42 0.026 UniRef50_B8DET0 Thiamine-phosphate pyrophosphorylase n=27 Tax=Ba... 42 0.027 UniRef50_Q1PX46 Tryptophan synthase alpha chain n=2 Tax=Planctom... 42 0.027 UniRef50_C4KZE0 Thiamine-phosphate pyrophosphorylase n=1 Tax=Exi... 41 0.029 UniRef50_C9KJ79 Tryptophan synthase alpha chain n=2 Tax=Veillone... 41 0.029 UniRef50_A9VWE9 Tryptophan synthase alpha chain n=29 Tax=Proteob... 41 0.029 UniRef50_B2V7Q4 N-(5'-phosphoribosyl)anthranilate isomerase n=4 ... 41 0.029 UniRef50_B6K1N0 Lactate 2-monooxygenase n=1 Tax=Schizosaccharomy... 41 0.029 UniRef50_B7JUK2 Tryptophan synthase alpha chain n=17 Tax=cellula... 41 0.030 UniRef50_A7RW57 Predicted protein n=9 Tax=Eukaryota RepID=A7RW57... 41 0.031 UniRef50_A9B6M7 Tryptophan synthase alpha chain n=8 Tax=Chlorofl... 41 0.032 UniRef50_C9SFI6 Copper homeostasis protein cutC n=1 Tax=Verticil... 41 0.033 UniRef50_Q2SS12 Copper homeostasis protein CutC, putative n=3 Ta... 41 0.035 UniRef50_C0Z6B9 Putative oxidoreductase n=1 Tax=Brevibacillus br... 41 0.035 UniRef50_Q7VGK6 Triosephosphate isomerase n=3 Tax=Helicobacter R... 41 0.035 UniRef50_Q0TRL8 Dihydropteroate synthase n=16 Tax=Clostridiales ... 41 0.040 UniRef50_B8MZR1 Trytophan synthase alpha subunit, putative n=2 T... 41 0.040 UniRef50_Q9YGA9 Tryptophan synthase alpha chain n=5 Tax=Euryarch... 41 0.040 UniRef50_C6A2A3 Geranylgeranylglyceryl phosphate synthase n=1 Ta... 41 0.040 UniRef50_C2W9E4 Copper homeostasis protein CutC n=4 Tax=Bacillus... 41 0.040 UniRef50_A8ZVH6 N-(5'-phosphoribosyl)anthranilate isomerase n=1 ... 41 0.041 UniRef50_A6VPG9 Thiamine-phosphate pyrophosphorylase n=5 Tax=Pas... 41 0.044 UniRef50_Q72EU7 Tryptophan synthase alpha chain n=12 Tax=Bacteri... 41 0.046 UniRef50_C9A792 Tryptophan synthase alpha chain n=3 Tax=Enteroco... 41 0.049 UniRef50_C9KJ76 N-(5'-phosphoribosyl)anthranilate isomerase n=3 ... 41 0.050 UniRef50_D1JG54 N-(5'-phosphoribosyl)anthranilate isomerase n=1 ... 41 0.053 UniRef50_Q6BVL8 DEHA2C01584p n=4 Tax=Dikarya RepID=Q6BVL8_DEBHA 41 0.055 UniRef50_A5FY58 Tryptophan synthase alpha chain n=1 Tax=Acidiphi... 41 0.055 UniRef50_Q2RIT8 N-(5'-phosphoribosyl)anthranilate isomerase n=7 ... 41 0.055 UniRef50_D0MQ90 Tryptophan synthase, putative n=2 Tax=Phytophtho... 41 0.056 UniRef50_D2S4B7 (S)-2-hydroxy-acid oxidase n=1 Tax=Geodermatophi... 41 0.056 UniRef50_A0B8J3 Geranylgeranylglyceryl phosphate synthase n=13 T... 41 0.061 UniRef50_A4FLZ5 L-lactate dehydrogenase n=2 Tax=Actinomycetales ... 40 0.064 UniRef50_Q3ABS2 N-(5'-phosphoribosyl)anthranilate isomerase n=1 ... 40 0.069 UniRef50_B1I3Z6 N-(5'-phosphoribosyl)anthranilate isomerase n=2 ... 40 0.071 UniRef50_Q93Q21 Tryptophan synthase alpha chain n=1 Tax=Nostoc p... 40 0.075 >UniRef50_P39364 Putative sgc region protein sgcQ n=61 Tax=Bacteria RepID=SGCQ_ECOLI Length = 268 Score = 302 bits (774), Expect = 8e-81, Method: Composition-based stats. Identities = 268/268 (100%), Positives = 268/268 (100%) Query: 1 MSWLKEVIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFS 60 MSWLKEVIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFS Sbjct: 1 MSWLKEVIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFS 60 Query: 61 NEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDPVASFDLAMATGAKFIREI 120 NEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDPVASFDLAMATGAKFIREI Sbjct: 61 NEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDPVASFDLAMATGAKFIREI 120 Query: 121 FTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFN 180 FTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFN Sbjct: 121 FTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFN 180 Query: 181 NHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATT 240 NHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATT Sbjct: 181 NHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATT 240 Query: 241 FKKDGVFANFVDQARVSQFMEKVHHIRR 268 FKKDGVFANFVDQARVSQFMEKVHHIRR Sbjct: 241 FKKDGVFANFVDQARVSQFMEKVHHIRR 268 >UniRef50_C0D1F9 Putative uncharacterized protein n=1 Tax=Clostridium asparagiforme DSM 15981 RepID=C0D1F9_9CLOT Length = 276 Score = 289 bits (741), Expect = 6e-77, Method: Composition-based stats. Identities = 120/274 (43%), Positives = 159/274 (58%), Gaps = 6/274 (2%) Query: 1 MSWLKEVIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFS 60 M W +++ G +K +I M HL LPGDP F M V++ A DL ALQ GGVD +MFS Sbjct: 1 MLWTEKMFGVKKPIITMLHLDPLPGDPRFHYGDTMERVVEHARADLHALQEGGVDGIMFS 60 Query: 61 NEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDPVASFDLAMATGAKFIREI 120 NEFSLPY + T AAMAR+IG+L S+IR+P+GV+ + D AS +LA A AKFIR Sbjct: 61 NEFSLPYERHMSFVTPAAMARVIGELKSEIRVPYGVDCISDGQASIELAAAVDAKFIRGT 120 Query: 121 FTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFN 180 F+G Y D G ++ + +R + + ++K L+ I PE+ + R + IAKST+F Sbjct: 121 FSGVYVGDGGFYNNDFSALLRRKAALHLDDLKMLYFINPESDRSMDTRPLVDIAKSTIFK 180 Query: 181 NHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATT 240 HPD LC+S AG D L+ VK PD VVL NTG + +E +L+ AD V T Sbjct: 181 AHPDGLCISANAAGQDVDDELIASVKSGAPDVVVLCNTGCRPDTIERKLTTADAAVVGTY 240 Query: 241 FKKDGVFAN------FVDQARVSQFMEKVHHIRR 268 FK+ G N VD RV +FME VH R Sbjct: 241 FKEGGKLENDKLENVRVDVNRVKEFMEVVHRFRE 274 >UniRef50_C5CIF3 Photosystem I assembly BtpA n=1 Tax=Kosmotoga olearia TBF 19.5.1 RepID=C5CIF3_KOSOT Length = 260 Score = 285 bits (729), Expect = 1e-75, Method: Composition-based stats. Identities = 117/252 (46%), Positives = 158/252 (62%) Query: 1 MSWLKEVIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFS 60 M+ +KE+ G EK +I M H LPG P +D + G+ +++++ DL +LQNGG+DAVMF Sbjct: 1 MATVKEIFGKEKVIIGMVHFPPLPGSPLYDDKKGVEFIVERIKSDLKSLQNGGIDAVMFC 60 Query: 61 NEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDPVASFDLAMATGAKFIREI 120 NE PY KV T A M+R IG++M +IR+PFGV+VLWDP A+ +A A GAKFIREI Sbjct: 61 NENDRPYKLKVDSATVATMSRAIGEVMDEIRVPFGVDVLWDPFAAIAIAKAVGAKFIREI 120 Query: 121 FTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFN 180 TG Y SD G+W T VGE R++ + A ++ FNI E A L R + IAKS F+ Sbjct: 121 ITGTYVSDMGLWKTEVGEFYRYRKLLDANDIAVFFNISAEFAYNLDRRPLEEIAKSVAFS 180 Query: 181 NHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATT 240 + D + VSG G +K+VK+ V + V ANTGV ENV E L+IADG + T+ Sbjct: 181 SLADVILVSGPMTGESPSLDHIKKVKDKVGEKPVFANTGVTKENVREILNIADGAIIGTS 240 Query: 241 FKKDGVFANFVD 252 KKDG+ F + Sbjct: 241 LKKDGITRRFWN 252 >UniRef50_A7B0Z5 Putative uncharacterized protein n=1 Tax=Ruminococcus gnavus ATCC 29149 RepID=A7B0Z5_RUMGN Length = 271 Score = 284 bits (726), Expect = 3e-75, Method: Composition-based stats. Identities = 116/268 (43%), Positives = 160/268 (59%) Query: 1 MSWLKEVIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFS 60 M W +++ G +K +IAM HL LPGDP + + M+ +I+ A DL ALQ+GGV+ ++FS Sbjct: 1 MLWTEKLFGVKKPIIAMLHLDPLPGDPLYKKENDMDVIIEHARADLHALQDGGVNGIIFS 60 Query: 61 NEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDPVASFDLAMATGAKFIREI 120 NEFS PY + T AAMA +IG L S+I++P+GV+ + D A +LA A A F+R Sbjct: 61 NEFSFPYQRTMDMVTPAAMAYVIGNLRSEIKVPYGVDAISDGRACLELAAAVKANFVRGT 120 Query: 121 FTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFN 180 F G Y D G ++ + +R + + E+K L+ I PE+ L R + IAK+T+ Sbjct: 121 FCGVYVGDGGFYNNDFSALLRRKAALPLDELKMLYFINPESDQSLDTRPLADIAKTTIAK 180 Query: 181 NHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATT 240 PD LC+S AG D AL+ VKE PD VVL NTG + +E +L+ AD V TT Sbjct: 181 AAPDGLCISADAAGQDVDDALIASVKEANPDIVVLCNTGCRINTIERKLTTADAAVVGTT 240 Query: 241 FKKDGVFANFVDQARVSQFMEKVHHIRR 268 FKKDG F N VD RV +FM+ VH R Sbjct: 241 FKKDGKFENRVDVNRVKEFMQVVHEFRE 268 >UniRef50_A3KNP0 Zgc:162297 protein n=7 Tax=Coelomata RepID=A3KNP0_DANRE Length = 268 Score = 283 bits (724), Expect = 5e-75, Method: Composition-based stats. Identities = 77/271 (28%), Positives = 129/271 (47%), Gaps = 9/271 (3%) Query: 3 WLKEVIGT-EKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSN 61 + G + +I M H+RALPG P + ++ + ++A + N G+D ++ N Sbjct: 2 KFLNLFGRLQSNIIGMIHVRALPGTPL--NRFTISDIKEEACREAEIYYNAGLDGLIIEN 59 Query: 62 EFSLPYLTKVRPETTAAMARIIGQLMSDI-RIPFGVNVLWDPV-ASFDLAMATGAKFIR- 118 +PY V PE A M + + P GV +L ++ +A+A+G FIR Sbjct: 60 MHDIPYTLDVGPEVCACMTAVCTAVRGLYPSWPLGVQILSAANHSALAVALASGLDFIRA 119 Query: 119 EIFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKS-T 177 E F ++ +D G+ + GE +R++ IGA V+ +I + + + D+ + Sbjct: 120 EGFVFSHVADEGLLNACAGELLRYRKCIGAEHVQIFTDIKKKHSAHALTADVSIAETAQA 179 Query: 178 VFNNHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVT 237 D + V+G G + D L+ V ++V VL +GV +NVE L A + Sbjct: 180 AEFFLSDGVVVTGSATGAKADPQELREVSQSV-RIPVLIGSGVTDDNVEHYLQ-ASAMII 237 Query: 238 ATTFKKDGVFANFVDQARVSQFMEKVHHIRR 268 + FKK G +AN VD RV +FM K+H +R Sbjct: 238 GSHFKKGGYWANGVDAERVKRFMGKMHKLRE 268 >UniRef50_B1CBM1 Putative uncharacterized protein n=1 Tax=Anaerofustis stercorihominis DSM 17244 RepID=B1CBM1_9FIRM Length = 269 Score = 282 bits (722), Expect = 9e-75, Method: Composition-based stats. Identities = 128/265 (48%), Positives = 176/265 (66%) Query: 3 WLKEVIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNE 62 W+KE+ GT+K ++AM HL ALPGDP +D G+ +VI++A ++ ALQ+GGVD ++ SNE Sbjct: 2 WMKEIFGTDKPIVAMLHLAALPGDPLYDENKGLCYVIERAKREIKALQDGGVDGILISNE 61 Query: 63 FSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDPVASFDLAMATGAKFIREIFT 122 +S PY+ V T +MAR+IGQL IP GV ++ DP +FDLA + GAKF+R FT Sbjct: 62 YSFPYMGDVPIITAMSMARVIGQLKEYFTIPMGVQIISDPYKTFDLAASVGAKFVRGTFT 121 Query: 123 GAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNH 182 G++A D G+ + G+ +RH+ +GA +VK ++N+VPEAA YL +R IA STVF+ Sbjct: 122 GSFAGDHGIAVYDTGKIMRHKIAVGAKDVKCMYNLVPEAAKYLVDRSWEEIADSTVFHCK 181 Query: 183 PDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATTFK 242 PDAL V+G AG D+ ++ RVK+ VP+T V ANTGV EN+E QL+ DG + TTFK Sbjct: 182 PDALMVAGFLAGREADTQIMTRVKKVVPNTPVFANTGVRYENIEMQLAACDGAIVGTTFK 241 Query: 243 KDGVFANFVDQARVSQFMEKVHHIR 267 +DG F RV FM KV R Sbjct: 242 EDGDFYKEAKYDRVKAFMNKVREFR 266 >UniRef50_Q5JHL2 Uncharacterized protein TK2179 n=5 Tax=Euryarchaeota RepID=Y2179_PYRKO Length = 261 Score = 278 bits (712), Expect = 1e-73, Method: Composition-based stats. Identities = 76/264 (28%), Positives = 122/264 (46%), Gaps = 8/264 (3%) Query: 7 VIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEFSLP 66 + K +I M HL+ LPG ++ + VI+ A D + L+ G DAVM N +P Sbjct: 1 MDFERKPLIGMVHLKPLPGSYLYNGD--FDSVIEAALRDAVTLEEAGFDAVMVENFGDVP 58 Query: 67 YLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLW-DPVASFDLAMATGAKFIR-EIFTGA 124 + T A++A + + ++ +P GVNVL D +A++ +A A A FIR + +G Sbjct: 59 FPKYADKTTVASLAVVAKAIRDEVSLPLGVNVLRNDGIAAYSIAYAVKADFIRVNVLSGV 118 Query: 125 YASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPD 184 +D G+ + E + R+ + +K ++ + AV+ G D TV D Sbjct: 119 AYTDQGIIEGIAHELAMLRKRLPSE-IKVFADVHVKHAVHFG--DFEDAFLDTVERGLAD 175 Query: 185 ALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATTFKKD 244 A+ VSG G D L KE P V+ +G +N+ E ADG + T K+D Sbjct: 176 AVVVSGKATGRPVDVDKLALAKEISP-VPVIVGSGTSYDNLPELWKYADGFIVGTWIKRD 234 Query: 245 GVFANFVDQARVSQFMEKVHHIRR 268 G N V R + +E +R+ Sbjct: 235 GRVENEVSLERARKLVELAKELRQ 258 >UniRef50_Q8U2H5 Uncharacterized protein PF0860 n=8 Tax=Euryarchaeota RepID=Y860_PYRFU Length = 262 Score = 277 bits (709), Expect = 3e-73, Method: Composition-based stats. Identities = 67/265 (25%), Positives = 112/265 (42%), Gaps = 8/265 (3%) Query: 4 LKEVIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEF 63 +K++ ++K +I + HL+ LPG P + VI+ A D + G D ++ N Sbjct: 1 MKDLDFSKKPLIGVVHLKPLPGSPRYGGD--FEEVIEWAIRDAKTYEEAGFDGIIVENFG 58 Query: 64 SLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLW-DPVASFDLAMATGAKFIR-EIF 121 P+ + E A + + ++ +P G+N L D + ++ +A A G FIR + Sbjct: 59 DSPFSKTLPREVIPAFTVVAKAVKKEVSLPLGINALRNDCIVAYSIAHAVGGSFIRVNVL 118 Query: 122 TGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNN 181 TG +D G+ + E + IG G++ TL ++ + AV+ N K TV Sbjct: 119 TGVAFTDQGIIEGCARELWNVKRIIG-GDILTLADVHVKHAVHFTN--FEDAVKDTVERG 175 Query: 182 HPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATTF 241 D + V+G G L K V VL +GV N ADG + T Sbjct: 176 LADGIIVTGRRTGESISLEDLILAKR-VSSIPVLVGSGVNPRNFRTLFKYADGFIVGTWV 234 Query: 242 KKDGVFANFVDQARVSQFMEKVHHI 266 K++G N V R + + + Sbjct: 235 KENGKINNPVSLERAKILVRMKNSL 259 >UniRef50_UPI0000D55C2D PREDICTED: similar to conserved hypothetical protein n=1 Tax=Tribolium castaneum RepID=UPI0000D55C2D Length = 270 Score = 276 bits (707), Expect = 5e-73, Method: Composition-based stats. Identities = 76/274 (27%), Positives = 132/274 (48%), Gaps = 12/274 (4%) Query: 1 MSWLKEVI-GTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMF 59 M +++ T+ AV+ M H+ ALPG P + ++ ++ KA + G+D+++ Sbjct: 1 MLKFRQLFHTTKCAVVGMVHVGALPGTPLCN--KSVDSLVFKACKEAEMYLKYGLDSILV 58 Query: 60 SNEFSLPYLTKV--RPETTAAMARIIGQLMSDI--RIPFGVNVL-WDPVASFDLAMATGA 114 N +PY+ PET A M R+ ++ +P GV VL + + +A A Sbjct: 59 ENMHDVPYIQSKYFTPETVATMTRVCTEIRKIAPGTVPCGVQVLACGNLEALAVAKACNF 118 Query: 115 KFIR-EIFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSI 173 FIR E F + +D G D N G +R++ +I A V L +I + + + D+ + Sbjct: 119 DFIRAEGFVFGHVADEGYTDANAGLILRYRRQIQAENVLILADIKKKHSSHAITSDVSLV 178 Query: 174 AKS-TVFNNHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIA 232 + D L ++G+ G+ + + L +VK+ VL +GV +N+ + + A Sbjct: 179 ETAQAAQFFQADGLILTGVATGSPANVSELSQVKKFC-SLPVLVGSGVTGDNLGDYMG-A 236 Query: 233 DGCVTATTFKKDGVFANFVDQARVSQFMEKVHHI 266 DG + + FKK GV+ VD+ RV FMEK + Sbjct: 237 DGVIVGSYFKKGGVWYEDVDEERVRNFMEKRKML 270 >UniRef50_Q29E81 GA21203 n=3 Tax=Coelomata RepID=Q29E81_DROPS Length = 275 Score = 276 bits (707), Expect = 5e-73, Method: Composition-based stats. Identities = 68/276 (24%), Positives = 121/276 (43%), Gaps = 11/276 (3%) Query: 1 MSWLKEVIGTEKA-VIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMF 59 M ++ G +K VI M H+ ALPG P + I+KA + + +DAV+ Sbjct: 1 MRRFLKIFGQQKCKVIGMIHVDALPGTPRYAGHW--KETIEKAIYEANLYKRHQLDAVLI 58 Query: 60 SNEFSLPYLTK--VRPETTAAMARIIGQLMSDI--RIPFGVNVL-WDPVASFDLAMATGA 114 N +PY+ + + E TA M R+ + I IP GV VL + +A A+ Sbjct: 59 ENMHDIPYVPERLLGAEITACMTRLGQAVRDVIPKEIPCGVQVLACGNKQALAIAKASQL 118 Query: 115 KFIR-EIFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSI 173 +FIR E F + +D G D G+ +R++ I A +V ++ + + + D+ + Sbjct: 119 QFIRSEGFVFGHVADEGYTDACAGDLLRYRKLIDAEDVLIFTDLKKKHSSHAITSDVSLL 178 Query: 174 AKS-TVFNNHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIA 232 + D + ++G G L+ + V +L +GV +N+ A Sbjct: 179 ETAHAAEFFLTDGIVITGTATGHAASPQDLQELSGRV-KVPLLIGSGVTKDNIGLYYKDA 237 Query: 233 DGCVTATTFKKDGVFANFVDQARVSQFMEKVHHIRR 268 + + + FK+ G + + + V FM KV +R+ Sbjct: 238 NAVIVGSHFKRHGSWLEEISEEAVENFMRKVCELRQ 273 >UniRef50_A4E7P3 Putative uncharacterized protein n=1 Tax=Collinsella aerofaciens ATCC 25986 RepID=A4E7P3_9ACTN Length = 274 Score = 274 bits (702), Expect = 2e-72, Method: Composition-based stats. Identities = 97/267 (36%), Positives = 139/267 (52%) Query: 1 MSWLKEVIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFS 60 MS+L + TEK VI M HLR LPGDP + ++ V++ A DL ALQ GGVD ++ + Sbjct: 6 MSFLTSMFKTEKPVIGMLHLRPLPGDPLYYPGGSVSQVVEAAKRDLEALQQGGVDGILIT 65 Query: 61 NEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDPVASFDLAMATGAKFIREI 120 NE S+PY V P T A++ +IG L D+ P+G ++D A+ +L A A+F R Sbjct: 66 NELSMPYEQHVSPSTLASVGYVIGTLSHDLSTPWGAEAIYDGDATIELCAAVDAQFTRCN 125 Query: 121 FTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFN 180 F GA+A D G+ + + T+R + + ++K I E VYL +R IA S +FN Sbjct: 126 FCGAWAGDLGLINRDFAHTMRRKAALRLDDLKLFHFITSEGEVYLNDRTTADIADSLLFN 185 Query: 181 NHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATT 240 PDA+ + G AG L V+E V + V+ TG V + + DG T Sbjct: 186 CLPDAMVIGGSAAGRGASGELADEVRERVGEVPVVCGTGCRENTVADVFAHYDGAFVGTC 245 Query: 241 FKKDGVFANFVDQARVSQFMEKVHHIR 267 K+DG VD RV++FM R Sbjct: 246 LKRDGRLDAPVDVERVARFMAAARTAR 272 >UniRef50_Q28QI3 Photosystem I assembly BtpA n=5 Tax=Alphaproteobacteria RepID=Q28QI3_JANSC Length = 267 Score = 274 bits (701), Expect = 3e-72, Method: Composition-based stats. Identities = 126/267 (47%), Positives = 161/267 (60%) Query: 1 MSWLKEVIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFS 60 M ++V GT K VIAM HL A+PG P DA G+ ++ A DL ALQ GVDAVMF Sbjct: 1 MQKFRDVFGTPKPVIAMVHLGAMPGTPLHDADAGLEGLVAAAAADLSALQAAGVDAVMFG 60 Query: 61 NEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDPVASFDLAMATGAKFIREI 120 NE PY V +TA MA +IGQL I +PFGVNVLWDP ++ LA ATGA+F REI Sbjct: 61 NENDRPYEFAVDTASTATMAYVIGQLRGQITVPFGVNVLWDPDSTIALAAATGAQFCREI 120 Query: 121 FTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFN 180 FTG YASD GVW + G +R++ R+G ++ L+N+ E A L R + A+S VF+ Sbjct: 121 FTGTYASDMGVWAPDAGRALRYRKRLGRDDLAMLYNVSAEFADSLDKRPLPDRARSAVFS 180 Query: 181 NHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATT 240 + PDA+ VSG G L+ VK +P+T VLANTGV + V E L IADGC+ ++ Sbjct: 181 SVPDAVLVSGQITGEAARMEDLEAVKAVLPETPVLANTGVKHDTVAEVLRIADGCIVGSS 240 Query: 241 FKKDGVFANFVDQARVSQFMEKVHHIR 267 K DG N VD R FM++ R Sbjct: 241 LKVDGHTWNAVDPDRAKDFMDRARASR 267 >UniRef50_Q9VS44 CG8607 n=22 Tax=Eukaryota RepID=Q9VS44_DROME Length = 275 Score = 272 bits (696), Expect = 9e-72, Method: Composition-based stats. Identities = 62/276 (22%), Positives = 120/276 (43%), Gaps = 11/276 (3%) Query: 1 MSWLKEVIGTE-KAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMF 59 M +V + +I M H+ ALPG P + I+ A + + +DAV+ Sbjct: 1 MQRFLKVFKQQTCKIIGMIHVDALPGTPRYAGNW--KQTIENAIYEANLYKKHQLDAVLI 58 Query: 60 SNEFSLPYLTK--VRPETTAAMARIIGQLMSDI--RIPFGVNVL-WDPVASFDLAMATGA 114 N +PY+ + + E A M R+ + I IP GV VL + +A A+ Sbjct: 59 ENMHDIPYVPERLLGAEIVACMTRLGRAVREVIPQEIPCGVQVLACGNKQALAIAKASQL 118 Query: 115 KFIR-EIFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSI 173 +FIR E F + +D G D G+ +R++ I A +V ++ + + + D+ + Sbjct: 119 QFIRAEGFVFGHVADEGFTDACAGDLLRYRKLIDAEDVLIFTDLKKKHSSHAITADVSLL 178 Query: 174 AKS-TVFNNHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIA 232 + D + ++G G L+++ V ++ +GV +N++ A Sbjct: 179 ETAHAAEFFMTDGIIITGTATGHAASPEDLQQLSGRV-KVPLIIGSGVTRDNIDSYYKDA 237 Query: 233 DGCVTATTFKKDGVFANFVDQARVSQFMEKVHHIRR 268 + + FK++G + + + V +FM+K+ +R Sbjct: 238 HAVIIGSHFKRNGNWLEEISEPAVDEFMQKICQLRH 273 >UniRef50_Q2RUX8 Photosystem I assembly BtpA n=1 Tax=Rhodospirillum rubrum ATCC 11170 RepID=Q2RUX8_RHORT Length = 267 Score = 270 bits (690), Expect = 4e-71, Method: Composition-based stats. Identities = 122/258 (47%), Positives = 159/258 (61%), Gaps = 1/258 (0%) Query: 10 TEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEFSLPYLT 69 +KAVIAM H+ ALPG P +DA GM +ID D+ LQ GGV A+MF NE PY Sbjct: 9 RKKAVIAMAHIGALPGTPLYDADGGMMKLIDDVVGDIEKLQKGGVHAIMFGNENDRPYQF 68 Query: 70 KVRPETTAAMARIIGQLMSDIRIPFGVNVLWDPVASFDLAMATGAKFIREIFTGAYASDF 129 + + AAM II + + +PFGVN LWDP AS +A+ATGA F REIFTG +ASD Sbjct: 69 EAPIASVAAMTAIISAVRPMLSVPFGVNYLWDPAASVAIAVATGASFAREIFTGVFASDM 128 Query: 130 GVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPDALCVS 189 GVW N E +R + + ++K LFNI E A L +R I A+S +F++ DA+ VS Sbjct: 129 GVWSPNAAEALRLRRNLHRPDLKLLFNINAEFASSLDSRSIGLRARSAIFSSLADAILVS 188 Query: 190 GLTAGTRTDSALLKRVKETVP-DTVVLANTGVCLENVEEQLSIADGCVTATTFKKDGVFA 248 G G ++ L+ V+E + + + ANTGV LENV++ LSIADGCV T FK DG Sbjct: 189 GPLTGQPAQASDLREVREAIGTEVPLFANTGVRLENVDDVLSIADGCVIGTHFKVDGSTW 248 Query: 249 NFVDQARVSQFMEKVHHI 266 N VD RVS+FM+KV + Sbjct: 249 NRVDGGRVSRFMDKVATL 266 >UniRef50_A8S303 Putative uncharacterized protein n=1 Tax=Clostridium bolteae ATCC BAA-613 RepID=A8S303_9CLOT Length = 274 Score = 269 bits (688), Expect = 7e-71, Method: Composition-based stats. Identities = 74/275 (26%), Positives = 127/275 (46%), Gaps = 9/275 (3%) Query: 1 MSWLKEVIGTEKAVIAMCHLRALPGDPSFDA-QLGMNWVIDKAWDDLMALQNGGVDAVMF 59 M LK+V +K +I M HLR LPG P +D +GM+ +I A ++ L+ GVD V Sbjct: 1 MGKLKDVFKVDKPIIGMVHLRPLPGSPKYDPVNMGMDKIISIALEEAAMLEQAGVDGVQV 60 Query: 60 SNEFSLPYLT--KVRPETTAAMARIIGQLMSDIRIPFGVNVLWDPVASFDLAMATGA-KF 116 N + +PYL + ET AA+A I + + + IP G + + Sbjct: 61 ENMWDIPYLRSEDIGYETAAALAVGIHAVRNKVSIPVGAECHMNGADCAMACAVAAGASW 120 Query: 117 IREI-FTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVY--LGNRDICSI 173 IR + A+ S G + R + R+ A ++ L ++ + + + +R + Sbjct: 121 IRVFEWCNAFVSQSGFINAMGANVSRMRSRLKADQILALCDVNVKHGSHYIIHDRSVAEQ 180 Query: 174 AKSTVFNNHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIAD 233 A + + DA+ V+G GT + + K++ +L +G+ NV E L+ AD Sbjct: 181 AMD-IESQDGDAVIVTGFDTGTPPSVENISKCKKS-TSLPILIGSGLNSSNVNELLTAAD 238 Query: 234 GCVTATTFKKDGVFANFVDQARVSQFMEKVHHIRR 268 G + + FK+ + N V R +FM+KV +R+ Sbjct: 239 GAIIGSWFKEGNNWKNPVSYDRTKEFMDKVIALRQ 273 >UniRef50_B9XI59 Photosystem I assembly BtpA n=1 Tax=bacterium Ellin514 RepID=B9XI59_9BACT Length = 262 Score = 269 bits (688), Expect = 8e-71, Method: Composition-based stats. Identities = 85/259 (32%), Positives = 130/259 (50%), Gaps = 6/259 (2%) Query: 10 TEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEFSLPYLT 69 K +I + HL LPG P + + V KA D + + GG DAV N +P+ Sbjct: 6 RRKVLIGVVHLGPLPGAPRWQGD--IGAVARKAVADARSYEQGGADAVFIENFGDVPFTK 63 Query: 70 K-VRPETTAAMARIIGQLMSDIRIPFGVNVLW-DPVASFDLAMATGAKFIR-EIFTGAYA 126 V PET AAMA + + + +++P G NVL D A+ L A G F+R + TGA Sbjct: 64 SAVGPETVAAMAALGCAVRAAVKLPIGFNVLRNDARAALGLCAACGGSFVRVNVHTGAML 123 Query: 127 SDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPDAL 186 +D G+ + N +T+R++ I + + ++ + AV LG+ I AK T+ DAL Sbjct: 124 TDQGLIEGNAYDTMRYREAI-SPGTQVFADVHVKHAVPLGSWTIEDSAKDTIERGLADAL 182 Query: 187 CVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATTFKKDGV 246 VSG G + L+RV+ P+ +L +GV LEN + L +ADG + ++ K+ G Sbjct: 183 IVSGTGTGVAVNLDDLRRVRAACPEAKILLGSGVTLENAGDFLQLADGFIVGSSLKRGGK 242 Query: 247 FANFVDQARVSQFMEKVHH 265 AN VD RV+ + Sbjct: 243 LANPVDAKRVAALARAMRR 261 >UniRef50_A5KMZ4 Putative uncharacterized protein n=2 Tax=Clostridiales RepID=A5KMZ4_9FIRM Length = 269 Score = 268 bits (686), Expect = 1e-70, Method: Composition-based stats. Identities = 127/268 (47%), Positives = 166/268 (61%), Gaps = 4/268 (1%) Query: 3 WLKEVIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNE 62 W +++ G EK +IA+ HL ALPGDP + M V + A DL+ALQ+GGVD ++F+NE Sbjct: 2 WTQDMFGVEKPIIALLHLDALPGDPGYCGD--MKTVTEHARKDLLALQDGGVDGILFANE 59 Query: 63 FSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDPVASFDLAMATGAKFIREIFT 122 FSLPY +AMA IIG+L +I +PFGVNV+ +P+A+ DL ATGAKF R F+ Sbjct: 60 FSLPYQPVADIAVVSAMAYIIGKLKDEISVPFGVNVVKNPIATIDLGAATGAKFGRSCFS 119 Query: 123 GAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNH 182 GAY ++GV+ +N GE IRH+ +G ++K LF + PEA YL RD+ +AKS +F + Sbjct: 120 GAYMGEYGVYVSNSGEAIRHRKALGIEDMKLLFKVNPEADAYLVQRDVQVVAKSIMFGDF 179 Query: 183 PDALCVSGLTAGTRTDSALLKRVKETVP--DTVVLANTGVCLENVEEQLSIADGCVTATT 240 D LCVSG AG D +L RV E V NTG NV E+L DG T Sbjct: 180 ADGLCVSGAAAGAEPDDVILSRVHEVAKPRKVPVFCNTGCNHGNVREKLGNCDGVCMGTA 239 Query: 241 FKKDGVFANFVDQARVSQFMEKVHHIRR 268 FKKDGVF VD+ RV +FME V IR+ Sbjct: 240 FKKDGVFNGRVDKERVREFMEIVADIRK 267 >UniRef50_D2QXT9 Photosystem I assembly BtpA n=2 Tax=Bacteria RepID=D2QXT9_9PLAN Length = 266 Score = 268 bits (685), Expect = 2e-70, Method: Composition-based stats. Identities = 73/258 (28%), Positives = 118/258 (45%), Gaps = 6/258 (2%) Query: 13 AVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEFSLPYL-TKV 71 VIAM HL LPG P + L ++ + + + L G +M N +P T+V Sbjct: 12 PVIAMLHLPPLPGSPR--SALSISAITEHVCREAEMLTALGAAGLMLENFGDMPLPATQV 69 Query: 72 RPETTAAMARIIGQLMSDIRIPFGVNVLW-DPVASFDLAMATGAKFIR-EIFTGAYASDF 129 P T A M+RI + +P G+NVL D +A+ +A A GA FIR I GA +D Sbjct: 70 SPATVAQMSRIAAAVRMASSLPLGINVLRNDSLAAMAIASAVGASFIRVNILVGARLTDQ 129 Query: 130 GVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPDALCVS 189 G+ E +R + +GA E++ ++ + + L + ++T+ DAL V+ Sbjct: 130 GIIAGRADELLRLRKSLGAEEIQIWADVNVKHSWPLAPVSLEEETENTIRRGLADALIVT 189 Query: 190 GLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATTFKKDGVFAN 249 G G TD L+ V T VL +GV +++ A G + + K G + Sbjct: 190 GRGTGYETDPHELQAVISAAAGTPVLVGSGVTADSLANF-QGASGAIVGSWIKHQGDARS 248 Query: 250 FVDQARVSQFMEKVHHIR 267 +D RV + M+ + + Sbjct: 249 PIDPERVRRLMQASRNSK 266 >UniRef50_A9W9X3 Photosystem I assembly BtpA n=4 Tax=Chloroflexaceae RepID=A9W9X3_CHLAA Length = 284 Score = 264 bits (674), Expect = 3e-69, Method: Composition-based stats. Identities = 82/266 (30%), Positives = 131/266 (49%), Gaps = 8/266 (3%) Query: 6 EVIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEFSL 65 E+ T K +I M H LPG P + GM +I+ A D AL GG D ++ N + + Sbjct: 19 EMFRTAKPIIGMVHCWPLPGAPGYTG-YGMQTIIEHAIRDAEALAEGGCDGLIVENMWDI 77 Query: 66 PYL--TKVRPETTAAMARIIGQLMSDI-RIPFGVN-VLWDPVASFDLAMATGAKFIR-EI 120 P+ V PE+ AA A + + + +P G+N V VA +A+A GA FIR + Sbjct: 78 PFRAGPHVPPESIAAQAVVAHAVRQAVPELPLGINLVHNGGVALLGIAIAAGASFIRVCM 137 Query: 121 FTGAYASDFGVW-DTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVF 179 FTGA D G W + + +R + + A +K ++ + +V D+ + + T F Sbjct: 138 FTGAGVWDAGSWDEGCAADLMRRRKELHAESIKIFADVDKKHSVRFPGIDLVTHIEWTRF 197 Query: 180 NNHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTAT 239 DA+ VSG G D A +++ +E DT +L +G +N+ + +ADG + + Sbjct: 198 FG-ADAIIVSGRMTGDAPDIAKVRQARELAGDTPILLGSGTTEQNIAAFMEVADGVIVGS 256 Query: 240 TFKKDGVFANFVDQARVSQFMEKVHH 265 + K+DG AN VD RV +F+ Sbjct: 257 SIKQDGEIANPVDVNRVRRFVAAARG 282 >UniRef50_P72966 Photosystem I biogenesis protein btpA n=34 Tax=Cyanobacteria RepID=BTPA_SYNY3 Length = 287 Score = 263 bits (673), Expect = 4e-69, Method: Composition-based stats. Identities = 74/265 (27%), Positives = 129/265 (48%), Gaps = 6/265 (2%) Query: 4 LKEVIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEF 63 L + T VI + HL LP + L VI++A + AL GGVD ++ N F Sbjct: 3 LFQTFQTHNPVIGVVHLLPLPTSARWGGNLT--AVIERAEQEATALAAGGVDGIIVENFF 60 Query: 64 SLPYLTK-VRPETTAAMARIIGQLMSDIRIPFGVNVLW-DPVASFDLAMATGAKFIR-EI 120 P+ + V P +AM I+ +L + + P G+NVL D ++ +A GAKFIR + Sbjct: 61 DAPFPKQRVDPAVVSAMTLIVDRLQNLVVAPVGINVLRNDAHSALAIASCVGAKFIRVNV 120 Query: 121 FTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFN 180 TG A+D G+ + N E +R++ + + +V L +++ + A LG ++ + T+ Sbjct: 121 LTGVMATDQGLIEGNAHELLRYRREL-SSDVAILADVLVKHARPLGTPNLTTAVTDTIER 179 Query: 181 NHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATT 240 D + +SG G+ + L+ T V +G +N+ + + A+G + A++ Sbjct: 180 GLADGIILSGWATGSPPNLEDLELATNAAKGTPVFIGSGADEDNIGQLIQAANGVIVASS 239 Query: 241 FKKDGVFANFVDQARVSQFMEKVHH 265 K+ G +D RVS F+E + Sbjct: 240 LKRHGNINEAIDPIRVSAFIEAMAE 264 >UniRef50_C5ELG9 Putative uncharacterized protein n=1 Tax=Clostridiales bacterium 1_7_47FAA RepID=C5ELG9_9FIRM Length = 275 Score = 263 bits (672), Expect = 5e-69, Method: Composition-based stats. Identities = 77/274 (28%), Positives = 126/274 (45%), Gaps = 9/274 (3%) Query: 1 MSWLKEVIGTEKAVIAMCHLRALPGDPSFDA-QLGMNWVIDKAWDDLMALQNGGVDAVMF 59 M L+ + +K +I M HLR LPG P +D + M +++ A D+ LQ+ GVD V Sbjct: 1 MRQLQSIFREKKPIIGMVHLRPLPGSPMYDPASMDMTKILEIAVDEAKKLQDAGVDGVQV 60 Query: 60 SNEFSLPY--LTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDPVASFDLAMATGA-KF 116 N + +PY + ET AA+A I ++ + IP G + + A ++ Sbjct: 61 ENMWDIPYNRPEDIGYETAAALAVGIYEVGKHVSIPVGAECHMNGAECAMASAAAAGARW 120 Query: 117 IREI-FTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVY--LGNRDICSI 173 IR + A+ S G + G R + R+ AG + L ++ + + + +R + Sbjct: 121 IRVFEWCNAFISQSGFVNGAGGRVSRMRDRLKAGHILALCDVNVKHGSHYIIHDRSVKEQ 180 Query: 174 AKSTVFNNHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIAD 233 A + DA+ V+G G + K + VL +G+ EN+ E LS AD Sbjct: 181 AMD-IEAQGGDAVIVTGFDTGMPPTVDKVLECKAAIG-IPVLLGSGLAEENITELLSAAD 238 Query: 234 GCVTATTFKKDGVFANFVDQARVSQFMEKVHHIR 267 G + +TFK G + N VD R FM++V +R Sbjct: 239 GAIVGSTFKAQGKWQNPVDYYRTKAFMDRVVKLR 272 >UniRef50_UPI000186CA08 conserved hypothetical protein n=1 Tax=Pediculus humanus corporis RepID=UPI000186CA08 Length = 278 Score = 262 bits (670), Expect = 9e-69, Method: Composition-based stats. Identities = 66/273 (24%), Positives = 139/273 (50%), Gaps = 12/273 (4%) Query: 1 MSWLKEVIGTEKA-VIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMF 59 MS L +++ + +I M H++ALPG P + +L +N +I++A +D+ ++ V++++ Sbjct: 1 MSKLPDLLKMTRPYIIGMVHVKALPGTP--NNKLNINSLIEEACNDVEIYKSCNVNSILV 58 Query: 60 SNEFSLPY--LTKVRPETTAAMARIIGQLMSDI--RIPFGVNVLWDP-VASFDLAMATGA 114 N +PY V PE A+M +I ++ + + + GV +L + +A A Sbjct: 59 ENMHDVPYVQSKSVGPEIIASMTKICSEIKNILPRHMTCGVQILAGANKEALAVAQAAEL 118 Query: 115 KFIR-EIFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSI 173 ++IR E + ++ +D G+ ++ GE +R++ IGA + +I + + D+ + Sbjct: 119 QYIRAEGYVFSHIADEGLMNSCAGELLRYRKYIGAENISIWTDIKKKHCSHSITSDLTLV 178 Query: 174 AKS-TVFNNHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIA 232 + D + ++G T G + +++ + ++ +GV ENV + L+ A Sbjct: 179 ETALAAEFFLSDGIVLTGKTTGNAIRKSDFIKIQNSC-SLPIVIGSGVTAENVADFLN-A 236 Query: 233 DGCVTATTFKKDGVFANFVDQARVSQFMEKVHH 265 + + + FKK+G+++N VD+ RV FM + Sbjct: 237 NAIIVGSYFKKEGLWSNEVDKNRVENFMNVLVE 269 >UniRef50_D2RDD9 Photosystem I assembly BtpA n=1 Tax=Archaeoglobus profundus DSM 5631 RepID=D2RDD9_ARCPR Length = 249 Score = 260 bits (666), Expect = 2e-68, Method: Composition-based stats. Identities = 70/248 (28%), Positives = 111/248 (44%), Gaps = 15/248 (6%) Query: 14 VIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEFSLPYLTKVRP 73 +I + HL LP P ++ + A D AL G DA++ N P+L +V Sbjct: 3 IIGVLHLDPLPSSPLYE---SYEKTFENALKDAKALAE-GCDAIIIENYGDKPFLKEVDR 58 Query: 74 ETTAAMARIIGQLMSDIRIPFGVNVLW-DPVASFDLAMATGAKFIR-EIFTGAYASDFGV 131 T A M+ I ++ + +P G+NVL DP ++ +A A A F+R A S G Sbjct: 59 VTVACMSVIAWEVKRETGLPVGINVLRNDPFSALAIAKAVNADFVRVNQLYFASLSPEGF 118 Query: 132 WDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPDALCVSGL 191 + GE +R++ I + K ++ + A + + ++ V DAL V+G Sbjct: 119 LEGKAGEILRYRRFID-CKAKIYADVKVKHAHHFV--SLEDYLEN-VERCLADALIVTGT 174 Query: 192 TAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATTFKKDGVFANFV 251 G D LK V+ + + V +GV EN+ + + DG + T FKKDG V Sbjct: 175 ATGREVDVEELKAVRN-LTNLPVFVGSGVKPENLHRYVGLCDGVIVGTYFKKDG----RV 229 Query: 252 DQARVSQF 259 D RV + Sbjct: 230 DVERVRRL 237 >UniRef50_C3ZBU0 Putative uncharacterized protein n=2 Tax=Metazoa RepID=C3ZBU0_BRAFL Length = 279 Score = 259 bits (661), Expect = 1e-67, Method: Composition-based stats. Identities = 85/281 (30%), Positives = 135/281 (48%), Gaps = 18/281 (6%) Query: 1 MSWLKEVIGT-EKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMF 59 M ++V G + A + M H+ ALPG P + + +IDKA + + G+DAVM Sbjct: 1 MQRFQKVFGRLQAAAVGMVHVGALPGTPR--SSETVGQLIDKACKEAEIYKRAGLDAVMV 58 Query: 60 SNEFSLPYLT--KVRPETTAAMARIIGQLMSDI-RIPFGVNVLWDP-VASFDLAMATGAK 115 N +PYL V E TAAM + ++ R+P GV VL + +A+ATG Sbjct: 59 ENMHDVPYLLGGDVGHEVTAAMTAVCREVRRVCPRLPCGVQVLSAANKQALAVALATGYV 118 Query: 116 FIREIF------TGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRD 169 R S G ++ G+ +R++ +IGA + +I + + + D Sbjct: 119 PCRSGLRACGRVCVLPCSRRGAVNSCAGDLLRYRTQIGADSIMVFTDIKKKHSSHAITAD 178 Query: 170 --ICSIAKSTVFNNHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEE 227 I A++ F D + V+G G DS LK V++ V D VL +GV EN+ Sbjct: 179 VSIADTARAAEFF-LSDGVIVTGTETGRPVDSKELKEVRQAV-DIPVLVGSGVSTENLPT 236 Query: 228 QLSIADGCVTATTFKKDGVFANFVDQARVSQFMEKVHHIRR 268 L A+G + + FKK G++ N VD RV+ FM+++ +R+ Sbjct: 237 YLR-ANGLIVGSYFKKHGLWQNEVDLDRVNMFMDRLSTLRQ 276 >UniRef50_Q9Y937 BtpA homolog n=1 Tax=Aeropyrum pernix RepID=Q9Y937_AERPE Length = 287 Score = 255 bits (651), Expect = 2e-66, Method: Composition-based stats. Identities = 59/280 (21%), Positives = 109/280 (38%), Gaps = 18/280 (6%) Query: 7 VIGTEKAVIAMCHLRALPGD---------PSFDAQLGMNWVIDKAWDDLMALQNGGVDAV 57 V K ++ + HL LPG P + +I+ A + ++ G D V Sbjct: 3 VFQRCKPLLGVVHLPPLPGSTGYKARRYPPRLGKVWSLEEIIEYAVSEASVYEDAGFDGV 62 Query: 58 MFSNEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDP-VASFDLAMATGAKF 116 + N PY P +AM RI+ ++ S + IP GVN+L + V + A + G F Sbjct: 63 ILENYGDTPYPKTPGPLQVSAMTRIVREVSSAVGIPVGVNMLRNGSVEALASAYSGGGSF 122 Query: 117 IR-EIFTGAYASDFGVWDTNVGETIRHQHRIGAGE---VKTLFNIVPEAAVYLGNRDICS 172 IR S G+ + + + +G E ++ L ++ + + L I Sbjct: 123 IRVNSLCETRLSPEGILEPDAARLAKSLALLGILEERRIEILADVDVKHSQPLVETSIAQ 182 Query: 173 IAKSTVFNNHP--DALCVSGLTAGTRTDSALLKRVKETVPDTVV--LANTGVCLENVEEQ 228 + + + + ++G G D+ + T + V + +GV N+ + Sbjct: 183 TVRDCIERSGVPIAGVVLTGHATGGAPDADEVVAAARTASEYEVKTVVGSGVSQLNLSKY 242 Query: 229 LSIADGCVTATTFKKDGVFANFVDQARVSQFMEKVHHIRR 268 IADG + ++ K G N +D+ + +RR Sbjct: 243 WHIADGFIIGSSIKLGGKPWNPIDKEKARLIASLAERLRR 282 >UniRef50_A3K4E4 Putative uncharacterized protein n=1 Tax=Sagittula stellata E-37 RepID=A3K4E4_9RHOB Length = 282 Score = 254 bits (648), Expect = 3e-66, Method: Composition-based stats. Identities = 66/272 (24%), Positives = 119/272 (43%), Gaps = 9/272 (3%) Query: 2 SWLKEVIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSN 61 S L+ + +K +I + HL ALPG P +D + + A D L GGVD +M N Sbjct: 12 SALETLFEKKKPIIGVIHLAALPGAPFYDGAP-LREIYAAAVRDAKTLAAGGVDGIMIEN 70 Query: 62 EFSLPY--LTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWD-PVASFDLAMATGAKFIR 118 +P+ + ET A + + + P G+ + + + +A A GA+++R Sbjct: 71 AGDMPFARPEDIGFETVAFLTAACEAVRGAVDTPIGITCVANGAIPGLAVAKAVGARWVR 130 Query: 119 -EIFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGN--RDICSIAK 175 + AY ++ G + +R++ +I A +V L ++ + + R I A Sbjct: 131 VNQWANAYVANEGFLNGAASAAMRYRAQIAAKDVAVLADVHVKFGAHAITADRTITEQAT 190 Query: 176 STVFNNHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGC 235 + D L +G G+ T +++V+ V+ +G+ E V + +ADG Sbjct: 191 DAEWFG-ADVLIATGQRTGSPTQPEEVRQVRAG-THLPVIVGSGLSPEQVPALMEVADGA 248 Query: 236 VTATTFKKDGVFANFVDQARVSQFMEKVHHIR 267 + K D + N VD ARV + M + +R Sbjct: 249 IVGQWLKVDARWWNPVDPARVERLMTAMDQVR 280 >UniRef50_A6NZB9 Putative uncharacterized protein n=1 Tax=Bacteroides capillosus ATCC 29799 RepID=A6NZB9_9BACE Length = 266 Score = 254 bits (648), Expect = 3e-66, Method: Composition-based stats. Identities = 77/262 (29%), Positives = 117/262 (44%), Gaps = 6/262 (2%) Query: 4 LKEVIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEF 63 + +K VI M HL+ALPG P + M+ + A +DL AL+ GGVDA + N Sbjct: 7 FHRMFPGQKPVIGMVHLQALPGAPGYGG--SMDEIYRAAVEDLHALEQGGVDAAIVENFG 64 Query: 64 SLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDPVAS-FDLAMATGAKFIR-EIF 121 PY T AAM + QL ++ + G+NV ++ + + +A A G FIR E Sbjct: 65 DTPYALNHELITLAAMTALAVQLRAESSLRLGLNVQFNCTEAEWGIAYAAGYDFIRVEAL 124 Query: 122 TGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNN 181 GV +R + R A L +I + + + + + Sbjct: 125 VENRVGVHGVAFAAAPSLLRLKSRYPAET-MLLADINVKHTYPMVEQPLDASIHEAKE-A 182 Query: 182 HPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATTF 241 AL V+G+ G + R KE +T VL +G+ EN IADG + ++F Sbjct: 183 GAGALIVTGVVTGQNPSLEDVCRCKELAGETPVLLGSGIHQENAAAFFQIADGAIVGSSF 242 Query: 242 KKDGVFANFVDQARVSQFMEKV 263 K++G N VD RV +FME + Sbjct: 243 KENGDVRNKVDTGRVRRFMEAL 264 >UniRef50_B9XEW7 Photosystem I assembly BtpA n=1 Tax=bacterium Ellin514 RepID=B9XEW7_9BACT Length = 265 Score = 254 bits (648), Expect = 3e-66, Method: Composition-based stats. Identities = 67/262 (25%), Positives = 117/262 (44%), Gaps = 7/262 (2%) Query: 6 EVIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEFSL 65 + + K +I M H+ ALPG P+ L + + + A + ++ GVD + N + Sbjct: 4 RLFASAKPIIGMIHVGALPGTPA--NHLSLGKITEIAVQEAKIYRDAGVDGIAIENMHDV 61 Query: 66 PYLTK-VRPETTAAMARIIGQLMSDIRIPFGVNVLWDPVASFDLAMATG-AKFIR-EIFT 122 PYL V PE ++M I + G+ +L A ++R E F Sbjct: 62 PYLRGGVGPEIVSSMTIIGQAVKQAFCGVTGIQILAAANREAMAAAHAAALDWVRVEGFV 121 Query: 123 GAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKS-TVFNN 181 A+ +D G ++ E +R++ +IGA +V+ +I + + + DI + Sbjct: 122 FAHVADEGFINSCAAELLRYRKQIGAEKVQVWADIKKKHSSHAITADISLGETAHAAEFM 181 Query: 182 HPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATTF 241 DAL V+G G A + K V+ +G+ N+ + L +ADG + ++F Sbjct: 182 RADALIVTGPVTGRPPVPADAEETKA-HTHLPVILGSGMNEANIGQFLPVADGFIVGSSF 240 Query: 242 KKDGVFANFVDQARVSQFMEKV 263 KK G + N VD +V FM++V Sbjct: 241 KKAGDWNNPVDSRKVKAFMKRV 262 >UniRef50_Q8TVC9 Predicted TIM-barrel enzyme n=1 Tax=Methanopyrus kandleri RepID=Q8TVC9_METKA Length = 271 Score = 253 bits (647), Expect = 4e-66, Method: Composition-based stats. Identities = 82/256 (32%), Positives = 124/256 (48%), Gaps = 10/256 (3%) Query: 14 VIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEFSLPYLTKVRP 73 V+ + HL LPG P + V+++A D L++GGVDAV+ N PY P Sbjct: 15 VVGVVHLPPLPGSPR---AKSIEEVVERARRDAARLEDGGVDAVLVENFGDTPYYPDDVP 71 Query: 74 E-TTAAMARIIGQLMSDIRIPFGVNVLW-DPVASFDLAMATGAKFIR-EIFTGAYASDFG 130 + T A M R + +++ + +P GVNVL D VA+ D+ ATGA FIR + A A+D G Sbjct: 72 KITVACMTRAVAEVVDTVSVPVGVNVLRNDGVAAVDVCAATGASFIRVNAYVEAVATDQG 131 Query: 131 VWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPDALCVSG 190 V R R+G +V+ +I + L +R + +A+ V DA+ V+G Sbjct: 132 VLQPVAHMVWREIDRLGV-DVEVYADIRVKHGRPLDDRPVEEVARDAVERGLADAVIVTG 190 Query: 191 LTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSI-ADGCVTATTFKKDGVFAN 249 G+ +++V V VL +GV EN L A G + T FKK+G+ N Sbjct: 191 SATGSPPRPEEVRKVARVVDR--VLVGSGVTPENAHVFLRAGAAGFIVGTYFKKNGITEN 248 Query: 250 FVDQARVSQFMEKVHH 265 VD RV + + + Sbjct: 249 PVDVDRVRELVRFIRR 264 >UniRef50_B5XQK9 BtpA family protein n=18 Tax=Proteobacteria RepID=B5XQK9_KLEP3 Length = 281 Score = 249 bits (636), Expect = 7e-65, Method: Composition-based stats. Identities = 65/268 (24%), Positives = 126/268 (47%), Gaps = 9/268 (3%) Query: 4 LKEVIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEF 63 ++ + KAVI + H PG P + + ++ ++++A D +GGV ++ N Sbjct: 12 IQAIFSRSKAVIGVIHCDPFPGSPKYRGK-SVSDIVERALRDAENYISGGVHGLIIENHG 70 Query: 64 SLPY--LTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDP-VASFDLAMATGAKFIR-E 119 +P+ + ET+A MA I ++ +P G+NVL + + + +A+A GA F+R Sbjct: 71 DIPFSKPEDIGHETSALMAVITEKVRERFAVPLGINVLANAAIPAMAIALAGGADFVRVN 130 Query: 120 IFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVY--LGNRDICSIAKST 177 + AY ++ G + + +R++ + A ++ + + + + +R I + + Sbjct: 131 QWANAYIANEGFIEGAAAKALRYRSMLRAEHIRVFADSHVKHGSHAIVADRSIQELTRDV 190 Query: 178 VFNNHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVT 237 F DA+ +G G A + ++ + +L +GV NV++ L G + Sbjct: 191 DFFE-ADAVIATGQRTGDSATMAEIDEIRAA-TELPLLVGSGVTPANVKQILGRTQGVIV 248 Query: 238 ATTFKKDGVFANFVDQARVSQFMEKVHH 265 A+T K DGV+ N V+ ARV FM Sbjct: 249 ASTMKVDGVWWNDVELARVKHFMSVAQA 276 >UniRef50_A4YFU5 Photosystem I assembly BtpA n=1 Tax=Metallosphaera sedula DSM 5348 RepID=A4YFU5_METS5 Length = 258 Score = 249 bits (636), Expect = 9e-65, Method: Composition-based stats. Identities = 72/257 (28%), Positives = 121/257 (47%), Gaps = 8/257 (3%) Query: 14 VIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEFSLPYLTK-VR 72 + M HL LPG P + ++ A + LQ+ GVDAV+ N P+ + Sbjct: 6 IAGMIHLPPLPGSPR--GGQPLEEIVKYAVTEADKLQSAGVDAVIVENLGDYPFFKDNMP 63 Query: 73 PETTAAMARIIGQLMSDIRIPFGVNVLWDP-VASFDLAMATGAKFIR-EIFTGAYASDFG 130 P T A+M+ I+ ++ + + GVNVL + + +F LA GA FIR I GAYA+D G Sbjct: 64 PITVASMSVIVREVRRKLGLQVGVNVLRNGCIDAFSLAHVNGADFIRCNILIGAYATDQG 123 Query: 131 VWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPDALCVSG 190 V + E +R + + + V+ L ++ + A L N +A+ DA+ VSG Sbjct: 124 VIEGRAAELLRLKRSLNS-RVRILADVHVKHAYPLYNLPTELVAQDLAERGGADAVIVSG 182 Query: 191 LTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTAT-TFKKDGVFAN 249 + +K+VKE+V V+ +G+ L N +E +ADG + FK++G+ Sbjct: 183 PRSSLPPSIETVKKVKESV-QVPVIVGSGISLGNFKEFCGVADGLIVGEVDFKENGMIGG 241 Query: 250 FVDQARVSQFMEKVHHI 266 + ++ + Sbjct: 242 PSKVEAYKKLVKGCKGV 258 >UniRef50_Q1NZ26 Uncharacterized protein F13E9.13, mitochondrial n=3 Tax=Caenorhabditis RepID=YSMU_CAEEL Length = 277 Score = 249 bits (635), Expect = 1e-64, Method: Composition-based stats. Identities = 72/269 (26%), Positives = 122/269 (45%), Gaps = 15/269 (5%) Query: 10 TEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEFSLPY-L 68 + V M H+ ALPG P L M+ ++ K + GVD V+ N +PY Sbjct: 14 SRPLVFGMIHVPALPGTP--SNTLPMSAILKKVRKEADVYFKNGVDGVIVENMHDVPYVK 71 Query: 69 TKVRPETTAAMARIIGQLM--SDIRIPF---GVNVLWDP-VASFDLAMATGAKFIR-EIF 121 PE ++MA QL+ D P G+ +L + +A TG FIR E F Sbjct: 72 PPASPEIVSSMALASDQLVKSRDAHHPAALTGIQILAAANREALAVAYTTGMDFIRAEGF 131 Query: 122 TGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDIC--SIAKSTVF 179 ++ +D G D G +R++ + A + +I + + + D+ +AK F Sbjct: 132 VYSHVADEGWIDGCAGGLLRYRSSLKAENIAIFTDIKKKHSAHSVTSDVSIHEMAKDAKF 191 Query: 180 NNHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTAT 239 N D + V+G G+ + +V + V + VL +G+ +N E + A G + + Sbjct: 192 NC-ADGVIVTGSATGSAASLEEMIQVMK-VQEFPVLIGSGINGKNAREFVK-AHGFIVGS 248 Query: 240 TFKKDGVFANFVDQARVSQFMEKVHHIRR 268 FK G + N +D R+S+FM+ V+ ++R Sbjct: 249 DFKIGGEWKNDLDSGRISKFMKHVNTLKR 277 >UniRef50_A8AAQ2 Photosystem I assembly BtpA n=1 Tax=Ignicoccus hospitalis KIN4/I RepID=A8AAQ2_IGNH4 Length = 268 Score = 248 bits (633), Expect = 2e-64, Method: Composition-based stats. Identities = 77/255 (30%), Positives = 124/255 (48%), Gaps = 7/255 (2%) Query: 14 VIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEFSLPYLTKVRP 73 VI + HL LPG + + VI++A D AL+ GGVDA++ N P+ +V Sbjct: 3 VIGVVHLLPLPGS--YGWGGDFDAVIERAVKDAKALEKGGVDAIIIENFMDYPFPIRVDY 60 Query: 74 ETTAAMARIIGQLMSDIRIPFGVNVLWD-PVASFDLAMATGAKFIR-EIFTGAYASDFGV 131 T AA R++ +++ + + GV++L + + +A+A+GAKF+R + + G+ Sbjct: 61 VTVAAATRVVTEVVRSLELSAGVSLLRNSAPEAIAVALASGAKFVRSNQWCWTSDAPEGL 120 Query: 132 WDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPDALCVSGL 191 E + R GA V + ++ + A + RD+C A+ DAL VSG Sbjct: 121 LTPVAREGLEVMRRWGAK-VGVVADVRVKHAAPISGRDLCDEARDLGGRCRADALAVSGA 179 Query: 192 TAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATTFKKDGVFANFV 251 G+ D L+ VK P VL +G+ ENV + ADG + T FK+ GV N V Sbjct: 180 ATGSEADPRQLEVVKTCTPK-PVLVASGITPENVVRF-ASADGVIVGTYFKEGGVTENPV 237 Query: 252 DQARVSQFMEKVHHI 266 D RV + ++ + Sbjct: 238 DVHRVRKLVDAAKRL 252 >UniRef50_B9CKX7 Putative uncharacterized protein n=1 Tax=Atopobium rimae ATCC 49626 RepID=B9CKX7_9ACTN Length = 270 Score = 248 bits (633), Expect = 2e-64, Method: Composition-based stats. Identities = 87/270 (32%), Positives = 138/270 (51%), Gaps = 4/270 (1%) Query: 1 MSWLKE----VIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDA 56 MS L E G+ +I H++ALPG P D+++ + I++ D LQ+ G DA Sbjct: 1 MSTLLEKHYATFGSSCPIIGCLHMQALPGTPFSDSKITLKNQIERLKRDAYTLQDAGFDA 60 Query: 57 VMFSNEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDPVASFDLAMATGAKF 116 V+F+NE PY+T V +T A RI +++ ++ IP+G VL DP A+ A A AKF Sbjct: 61 VVFANEGDRPYITPVGFDTVANYVRIATEVIEELSIPYGCGVLIDPFATLAAAKALEAKF 120 Query: 117 IREIFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKS 176 +R TG+Y FG N GE R+Q +I A +V+ P A L R + ++ Sbjct: 121 VRTYVTGSYEGLFGSQKFNPGEIFRYQKQIEATDVRVYTYFEPHAGTCLDVRSSEEMLEA 180 Query: 177 TVFNNHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCV 236 + N G AG +++ + R+KE + ++ +G EN+ + L ADG + Sbjct: 181 GIANLPIAGALFGGAHAGLPPEASHIVRLKEEFTEVPLIIGSGGTAENISKLLPHADGVI 240 Query: 237 TATTFKKDGVFANFVDQARVSQFMEKVHHI 266 T+ KKDG+ N VD R +F++ ++ Sbjct: 241 VGTSIKKDGILWNNVDPVRAKRFVKAAKNL 270 >UniRef50_A3DLN8 Photosystem I assembly BtpA n=1 Tax=Staphylothermus marinus F1 RepID=A3DLN8_STAMF Length = 260 Score = 239 bits (611), Expect = 7e-62, Method: Composition-based stats. Identities = 52/256 (20%), Positives = 109/256 (42%), Gaps = 8/256 (3%) Query: 16 AMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEFSLPYLT-KVRPE 74 M HL LP P + + ++ +++ A ++ L G D V+ N P+ + P Sbjct: 5 GMIHLPPLPNSPQYSGEK-IDVILEYAINEAEKLVEAGFDGVIIENYMDYPFPVYEKDPV 63 Query: 75 TTAAMARIIGQLMSDI-RIPFGVNVLWD-PVASFDLAMATGAKFIR-EIFTGAYASDFGV 131 + I ++ + I G+N+L + + S D+A FIR ++ + G+ Sbjct: 64 KLGFIEYIARRIREEFPNILIGLNILRNSGLESIDIACRNNLDFIRVNVYMETVLAPEGI 123 Query: 132 WDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPDALCVSGL 191 E ++++ + VK ++ + + L N + + ++T D + VSG Sbjct: 124 IKPLAYEIMKYKMQ-KKCNVKIYADVNVKHSQPLMNYTM--VLRNTCSRGLVDGVIVSGE 180 Query: 192 TAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATTFKKDGVFANFV 251 G T + + K V+ +GV +N+ + +AD + T+ K +G+ N V Sbjct: 181 HTGYATPVSRVYVAKRICNGKEVIVGSGVNYQNIGLYIGLADAVIVGTSIKNEGITTNPV 240 Query: 252 DQARVSQFMEKVHHIR 267 + + +E+V ++ Sbjct: 241 NLQKAMYLVERVKRVK 256 >UniRef50_O29828 Uncharacterized protein AF_0419 n=1 Tax=Archaeoglobus fulgidus RepID=Y419_ARCFU Length = 246 Score = 237 bits (604), Expect = 4e-61, Method: Composition-based stats. Identities = 78/257 (30%), Positives = 118/257 (45%), Gaps = 14/257 (5%) Query: 11 EKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEFSLPYLTK 70 EK VI + HL LPG P ++ VIDKA D A++ GG DA++ N P+L + Sbjct: 2 EKTVIGVVHLLPLPGSPEHT---DLSAVIDKAVKDARAIEEGGADALILENYGDKPFLKE 58 Query: 71 VRPETTAAMARIIGQLMSDIRIPFGVNVLWDP-VASFDLAMATGAKFIR-EIFTGAYASD 128 V ET AAM I ++ D+ I G+NVL + VA+ +A A A F+R S Sbjct: 59 VGKETVAAMTVIACEVKRDVSIGLGINVLRNDAVAALAIAKAVNADFVRVNQLFFTSVSP 118 Query: 129 FGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPDALCV 188 G+ + GE +R++ + +I + AV+ + + + DA+ + Sbjct: 119 EGILEGKAGEVMRYKKLVD-CRAMIFADIAVKHAVHFA--SLEDYCLNA-ERSLADAVIL 174 Query: 189 SGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATTFKKDGVFA 248 +G T G LK K+T+ VLA +GV EN L DG + T K+ G+ Sbjct: 175 TGKTTGGEVSLEELKYAKKTL-KMPVLAGSGVNAENAARILKWCDGVIVGTYIKRGGL-- 231 Query: 249 NFVDQARVSQFMEKVHH 265 VD RV + + Sbjct: 232 --VDAERVRRIVRAAKG 246 >UniRef50_B8HZU1 Photosystem I assembly BtpA n=5 Tax=Clostridiales RepID=B8HZU1_CLOCE Length = 262 Score = 234 bits (596), Expect = 4e-60, Method: Composition-based stats. Identities = 68/260 (26%), Positives = 121/260 (46%), Gaps = 7/260 (2%) Query: 8 IGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEFSLPY 67 + V+ M H ALPG P F M + D+A + + L+ G+DA++ N + Sbjct: 6 FKDKPIVMGMVHCLALPGTPDFCGD--MKKITDQAVKEAITLEKSGMDAIIIENMGDNVF 63 Query: 68 LTKVRPETTAAMARIIGQLMSDIRIPFGVNV-LWDPVASFDLAMATGAKFIR-EIFTGAY 125 + E + A+A I + ++ IP G++ + D + +A A GA F+R +F Sbjct: 64 GVNMDIEQSCALAAISAIVAQNVNIPIGIDAAMNDYKTALSIAKAIGADFVRIPVFVDTV 123 Query: 126 ASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEA-AVYLGNRDICSIAKSTVFNNHPD 184 G+ E ++ + I A VK +I + + L + I AK D Sbjct: 124 EFFGGIIQPCAREAMKFRKNIEAENVKIFADIQVKHTHMVLPHVSIEDSAK-AAEACGAD 182 Query: 185 ALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATTFKKD 244 A+ V+G G T ++KRVK+ + V+A +GV N++EQL IADG + ++ K+ Sbjct: 183 AIIVTGTHIGVETPIDIIKRVKKVI-SIPVIAGSGVKTNNIKEQLGIADGAIVGSSLKEG 241 Query: 245 GVFANFVDQARVSQFMEKVH 264 G N + ++ ++ ++ Sbjct: 242 GNIKNPISLELCTELIKALN 261 >UniRef50_C0ZRZ3 Putative uncharacterized protein n=2 Tax=Rhodococcus erythropolis RepID=C0ZRZ3_RHOE4 Length = 280 Score = 233 bits (595), Expect = 5e-60, Method: Composition-based stats. Identities = 61/268 (22%), Positives = 116/268 (43%), Gaps = 9/268 (3%) Query: 2 SWLKEVIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSN 61 S L E+ ++I HL ALPG P + Q ++ + A ++ A + G D V+ N Sbjct: 14 SALAEMFTGTPSLIGAIHLPALPGSPHYTGQP-VSEIARFAVEEAHAYVDNGFDGVIVEN 72 Query: 62 EFSLPY--LTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDPVAS-FDLAMATGAKFIR 118 + +P+ + ET A+M I ++ + GV++L + A A GA F+R Sbjct: 73 HWDIPFLKPGEHGYETAASMGVITAAVVGEFGKAVGVSILSNAGECGVAAAWAAGASFVR 132 Query: 119 -EIFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVY--LGNRDICSIAK 175 + AY ++ G + +T R +HRIGA V+ ++ + + + +R + + Sbjct: 133 VNQWANAYIANEGFIEGQAAKTTRFRHRIGADPVRIFADVHVKHGAHAIVADRTVAEQTE 192 Query: 176 STVFNNHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGC 235 F + D L +G G + +++ V+ +G+ NV + DG Sbjct: 193 DAEFFD-ADVLIATGSRTGDAASVDEVSVIRDNTV-LPVIIGSGITAANVAALMKECDGA 250 Query: 236 VTATTFKKDGVFANFVDQARVSQFMEKV 263 + A++ K +G + V +V + Sbjct: 251 IVASSVKDNGRWWGRVAGEKVRELSRAA 278 >UniRef50_Q96YL8 Putative uncharacterized protein ST2156 n=1 Tax=Sulfolobus tokodaii RepID=Q96YL8_SULTO Length = 250 Score = 231 bits (590), Expect = 2e-59, Method: Composition-based stats. Identities = 59/258 (22%), Positives = 104/258 (40%), Gaps = 18/258 (6%) Query: 14 VIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEFSLPYLTKVRP 73 +I + HL LPG + + ++D A ++ L+ GG DAV+ N P+ KVR Sbjct: 3 LIGVVHLPPLPGSFFYKGE--FEEIVDFAINESKKLEVGGFDAVILENFNDKPFRKKVRV 60 Query: 74 ETTAAMARIIGQLMSDIRIPFGVNVLWD-PVASFDLAMATGAKFIR-EIFTGAYASDFGV 131 ET AM+ I ++ + G+N+L + + +A TG FIR +S G+ Sbjct: 61 ETAIAMSIIAREVKKSTSLLVGINLLRNSAYEAASIASLTG-DFIRVNALCETISSPEGI 119 Query: 132 WDTNV---GETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPDALCV 188 + E + + R ++ L +I + A L ++ S+ D + V Sbjct: 120 IEPASVEVQEVLYYTKR----KISILADINVKHASPLHQMNLESLLLDCKERGFADYIIV 175 Query: 189 SGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATTFKKDGVFA 248 +G G + ++K +K P V +G+ N+ + D + T K Sbjct: 176 TGERTGKEPNPEVVKMIKNISP-LPVCVGSGMTPNNIRDYK--VDCFIIGTYLKD---TD 229 Query: 249 NFVDQARVSQFMEKVHHI 266 + RV + V I Sbjct: 230 GKIRVERVKEIANAVKSI 247 >UniRef50_D2RQS6 Photosystem I assembly BtpA n=4 Tax=Halobacteriaceae RepID=D2RQS6_9EURY Length = 278 Score = 231 bits (589), Expect = 2e-59, Method: Composition-based stats. Identities = 78/270 (28%), Positives = 125/270 (46%), Gaps = 13/270 (4%) Query: 1 MSWLKEVIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFS 60 ++ L+ ++ V+ M HL +PG P ++ + V D+A +D L+ GGVD ++ Sbjct: 4 ITPLRTRFDADRPVVGMVHLPPVPGAPGYEGDR--DAVRDRALEDARRLEAGGVDGIVLE 61 Query: 61 NEFSLPYLTKVRPE-TTAAMARIIGQLMSDIRIPFGVNVLW-DPVASFDLAMATGAKFIR 118 N P+ P+ A M + + + +P G+NVL D A+ +A A A+F+R Sbjct: 62 NFGDAPFYPDDVPKHVVAEMTAVATAVTDAVDVPLGINVLRNDADAALSIAAAVDAEFVR 121 Query: 119 -EIFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKST 177 + G A+D GV + ET+R + RI A +V L ++ + A +G+R I A Sbjct: 122 VNVHVGTAATDQGVLEGRAHETLRLRDRIDA-DVAILADVHVKHATPIGDRSIDRAALEA 180 Query: 178 VFNNHPDALCVSGLTAGTRTDSALLKRVKET------VPDTVVLANTGVCLENVEEQLSI 231 V D + VSG G T ++RV T V +GV E V + L+ Sbjct: 181 VERGRADGVIVSGPGTGDETALEDVERVAAALDGAGTAGRTSVFVGSGVTSETVGDCLAA 240 Query: 232 -ADGCVTATTFKKDGVFANFVDQARVSQFM 260 ADG + T K+ G N V + RV + Sbjct: 241 GADGVIVGTALKEGGETTNPVSRERVKALV 270 >UniRef50_A2BLV0 Conserved archaeal protein n=1 Tax=Hyperthermus butylicus DSM 5456 RepID=A2BLV0_HYPBU Length = 285 Score = 231 bits (589), Expect = 2e-59, Method: Composition-based stats. Identities = 61/260 (23%), Positives = 110/260 (42%), Gaps = 7/260 (2%) Query: 12 KAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEFSLPYLTKV 71 K +I M HL P ++ ++ ++D A + L + G +AV+ N PY Sbjct: 18 KPLIGMIHLPPTPSY--VKDRVDIDRLVDYALWEAGKLADAGFNAVIIENYGDHPYTVTA 75 Query: 72 RPETTAAMARIIGQLMSDIR--IPFGVNVLWDP-VASFDLAMATGAKFIR-EIFTGAYAS 127 + A+ARI ++ + G+N+L + + + A+ +GA FIR + S Sbjct: 76 PSLSVLAIARIAAEVARTYSGKLRVGINILRNAAPQALEAALVSGASFIRVNSYCELRVS 135 Query: 128 DFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPDALC 187 G+ R + + A V ++ + + L + I PDA+ Sbjct: 136 MEGILTPAAYIIERIREELRA-PVLVFADVDVKHSAPLATASLEQILHDCARRGRPDAII 194 Query: 188 VSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATTFKKDGVF 247 VSG G + +K VP ++ +G+ ++N+ +ADG + T+ K +G Sbjct: 195 VSGSATGEPPSPGYVASIKAMVPYKPIIIGSGISIDNIMAYWRVADGFIVGTSIKLNGKT 254 Query: 248 ANFVDQARVSQFMEKVHHIR 267 N VD+ R Q E V+ +R Sbjct: 255 LNPVDERRARQLAELVNELR 274 >UniRef50_UPI000155D1C1 PREDICTED: hypothetical protein n=1 Tax=Ornithorhynchus anatinus RepID=UPI000155D1C1 Length = 261 Score = 230 bits (587), Expect = 4e-59, Method: Composition-based stats. Identities = 63/217 (29%), Positives = 104/217 (47%), Gaps = 6/217 (2%) Query: 55 DAVMFSNEFSLPYLTKVRPETTAAMARIIGQLMSDI-RIPFGVNVLWDPV-ASFDLAMAT 112 D ++ N LPY PE TA M + + R+P GV VL + +A+A Sbjct: 46 DGLIVENMHDLPYTASAGPEVTATMTAVCAAVRMTCPRLPLGVQVLCSANQEAVAVALAA 105 Query: 113 GAKFIR-EIFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDIC 171 G FIR E F ++ +D G + G+ +R++ RIGA V+ +I + + + D+ Sbjct: 106 GCDFIRAEGFVFSHVADEGFVNACAGDLLRYRRRIGAEHVQIFADIKKKHSAHALTADVS 165 Query: 172 SIAKS-TVFNNHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLS 230 + D + ++G G D L V++ V + +L +GV LENV+ L+ Sbjct: 166 VSETAKAAEFFLADGVILTGPATGVEADPGELHEVEQAV-NIPLLIGSGVTLENVKSYLN 224 Query: 231 IADGCVTATTFKKDGVFANFVDQARVSQFMEKVHHIR 267 A+ + + FK+ G +AN +D RV FM+ V +R Sbjct: 225 -ANALIIGSYFKEGGYWANQIDPTRVKTFMDHVRKLR 260 >UniRef50_C5EEU2 Photosystem I assembly BtpA n=2 Tax=Clostridiales RepID=C5EEU2_9FIRM Length = 263 Score = 228 bits (581), Expect = 2e-58, Method: Composition-based stats. Identities = 63/260 (24%), Positives = 110/260 (42%), Gaps = 8/260 (3%) Query: 9 GTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEFSLPYL 68 TEK V++M LPG + + ++ ++D+A + + D ++ N +P Sbjct: 6 RTEKVVLSMIQPEPLPGSYRH-SDMRIDAIVDRALRETEMVARNHFDGIIVQNMNDMPVK 64 Query: 69 TKVRPETTAAMARIIGQLMSDI-RIPFGVNVLWDPVASFDLAMATGAKFIR--EIFTGAY 125 + PE A M RI ++ + G+ + WD VA +A A GA F+R +FTGA Sbjct: 65 QQSSPEAIAYMTRIAYEIRKRFPELVMGILMNWDGVAGLCVADAVGADFVRVEHLFTGAS 124 Query: 126 ASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPDA 185 + G+ + + + R G+ V ++ + LG + + A V D Sbjct: 125 VTSAGILEAQCVDIAGVRKRTGSK-VPVYADVYEVHGIPLGRKPVGDAAWECVHEAFADG 183 Query: 186 LCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATTFKKDG 245 L +SG ++K + V DT + G +N+ E + DG AT K +G Sbjct: 184 LFMSGK--SVEESIRMIKEARPRVKDTPIFLGGGATGDNIHELMRYFDGVSVATWIK-NG 240 Query: 246 VFANFVDQARVSQFMEKVHH 265 N +D R +F+ + Sbjct: 241 DMKNPIDPERAKRFIAEAKR 260 >UniRef50_UPI0001C369E0 btpA family protein n=1 Tax=Clostridium hathewayi DSM 13479 RepID=UPI0001C369E0 Length = 274 Score = 227 bits (580), Expect = 3e-58, Method: Composition-based stats. Identities = 65/265 (24%), Positives = 112/265 (42%), Gaps = 10/265 (3%) Query: 9 GTEKAV-IAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEFSLPY 67 G + V +AM PG + + +ID + +++ ++ G D + N P Sbjct: 4 GKQFPVALAMIQPEPFPGSFRHEGK-SFEEIIDISLNEIEMIEANGFDGYIIQNRNDAPV 62 Query: 68 LTKVRPETTAAMARIIGQLMSDI-RIPFGVNVLWDPVASFDLAMATGAKFIR--EIFTGA 124 PETTA M + + + G+ V WD VAS +A A G+ FIR +TG Sbjct: 63 RQHALPETTAYMTALARECRRRFPDMIQGILVDWDGVASLAVADAAGSDFIRVEHTYTGV 122 Query: 125 YASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPD 184 G+ + + + + RIG+ ++ ++ L + I A TV N D Sbjct: 123 EVGYAGMMEAQCVDICQFKKRIGS-DIPVYADVQEVHYEQLAGKSIVDNAWDTVMNAFAD 181 Query: 185 ALCVSGLTAGTRTDSALLKRVKETVPD-TVVLANTGVCLENVEEQLSIADGCVTATTFKK 243 L + G + ++K V++ + + + ++G +N+ + L DG T K Sbjct: 182 GLFLGGKSC--EESIEIIKCVRKRLGERIPIFLSSGATGDNISKILQYYDGVSVGTWVK- 238 Query: 244 DGVFANFVDQARVSQFMEKVHHIRR 268 +G N +D R QFME V R+ Sbjct: 239 NGNMRNPIDPVRARQFMEGVKSARK 263 >UniRef50_C8S5Y9 Photosystem I assembly BtpA n=1 Tax=Ferroglobus placidus DSM 10642 RepID=C8S5Y9_FERPL Length = 249 Score = 213 bits (542), Expect = 6e-54, Method: Composition-based stats. Identities = 69/255 (27%), Positives = 119/255 (46%), Gaps = 16/255 (6%) Query: 14 VIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEFSLPYLTKVRP 73 VI HL+ LPG P+F L ID A + + ++N G DA++ N P+ K P Sbjct: 3 VIVSLHLKPLPGSPNF---LNFEDCIDHAVRNAILIENCGADAIIIENFNDKPFFMKAPP 59 Query: 74 ETTAAMARIIGQLMSDIRIPFGVNVLW-DPVASFDLAMATGAKFIR-EIFTGAYASDFGV 131 ET A+M+ I+ +++ ++ IP GVNVL D VA+ +A A GAKF+R A G Sbjct: 60 ETIASMSVIVREVIREVSIPVGVNVLRNDGVAALAIAKAAGAKFVRVNQMIFPAAMPEGF 119 Query: 132 WDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPDALCVSGL 191 + R++ + + K +I + +V L + + DA+ V+G Sbjct: 120 AKPIAAKMARYKRLLN-CDAKIFADISVKHSVQLAKI---EDFVDNIDRAYCDAVIVTGK 175 Query: 192 TAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATTFKKDGVFANFV 251 G +++ L+++KE V D V+ +G EN+ + ADG + T K+ Sbjct: 176 KTGKPPEASTLRKIKELV-DVPVILGSGATPENLRKY--EADGVIVGTYVKEG----EEY 228 Query: 252 DQARVSQFMEKVHHI 266 ++ + + + + Sbjct: 229 SCEKLKRVVSEAKKL 243 >UniRef50_B9LR39 Photosystem I assembly BtpA n=7 Tax=cellular organisms RepID=B9LR39_HALLT Length = 275 Score = 212 bits (541), Expect = 7e-54, Method: Composition-based stats. Identities = 80/274 (29%), Positives = 120/274 (43%), Gaps = 11/274 (4%) Query: 4 LKEVIGTEKAVIAMCHLRALPGDPSF--DAQLGMNWVIDKAWDDLMALQNGGVDAVMFSN 61 + GT+ VI M HL LPG P D M +D+A D AL GGVD +M N Sbjct: 3 FEATFGTDAPVIGMVHLPPLPGAPKAPADGVAAMRDALDRAAADARALDRGGVDGIMVEN 62 Query: 62 EFSLPYLTKVRPE-TTAAMARIIGQLMSDIRIPFGVNVLW-DPVASFDLAMATGAKFIR- 118 P+ P+ A++ R + ++ +P G+NVL D A+ +A A A ++R Sbjct: 63 FGDAPFYPDDAPKHVVASVTRAATAITTETDLPLGINVLRNDAEAALSVAAAVDADYVRV 122 Query: 119 EIFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYL-GNRDICSIAKST 177 + TGA +D GV ET+R + R+G +V + + + L T Sbjct: 123 NVHTGARVTDQGVVQGKAHETLRLRDRLGV-DVGVFADTDVKHSAPLSAEGYTAESFADT 181 Query: 178 VFNNHPDALCVSGLTAGTRTDSALLKRVKETVP----DTVVLANTGVCLENVEEQLSIAD 233 DA+ SG G D L+ V DT VL +GV + V + L++AD Sbjct: 182 AERGLADAVIASGRGTGEAMDPEALESVVADRDAHGLDTPVLVGSGVREDTVGDVLAVAD 241 Query: 234 GCVTATTFKKDGVFANFVDQARVSQFMEKVHHIR 267 G + T K+ G VD RV+ + + +R Sbjct: 242 GAIVGTALKEGGETTAPVDADRVAALVARADEVR 275 >UniRef50_A5GQP5 Photosystem I biogenesis protein BtpA n=4 Tax=Bacteria RepID=A5GQP5_SYNR3 Length = 275 Score = 211 bits (538), Expect = 2e-53, Method: Composition-based stats. Identities = 69/265 (26%), Positives = 115/265 (43%), Gaps = 7/265 (2%) Query: 6 EVIGTEKA-VIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEFS 64 + ++ +I + HL LPG P + V A D A GG D ++ N Sbjct: 9 SLFAHDRPALIGVLHLPPLPGSPRWQGD--FEAVRRFALADAAAYLAGGADGLVVENFGD 66 Query: 65 LPYLTKVRPE-TTAAMARIIGQLMSDI-RIPFGVNVLWDPVASFDLAMA-TGAKFIR-EI 120 P+ P T AAMARI +++ +P G+NVL + + A +GA F+R + Sbjct: 67 APFFASAVPSHTVAAMARIAAEVVEAAAGVPVGINVLRNDAHAAMGIAAASGASFVRVNV 126 Query: 121 FTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFN 180 +GA +D G+ + E +R + ++ A EV +++ + A L + I + + Sbjct: 127 LSGAMLTDQGLIEGRAAELLRLRRQLEATEVGIFADVLVKHAYPLAPQPIGEAVEDCLGR 186 Query: 181 NHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATT 240 D + VSG+ G D L + VL +G N + ADG + A++ Sbjct: 187 AGADGVIVSGVATGAAPDPDDLAAARSAAGSAPVLIGSGCHAGNATSLGASADGVIVASS 246 Query: 241 FKKDGVFANFVDQARVSQFMEKVHH 265 K+D + AN VD RV + + Sbjct: 247 LKRDSLLANPVDPLRVQALRQTLQR 271 >UniRef50_C9XNH3 Putative uncharacterized protein n=7 Tax=Firmicutes RepID=C9XNH3_CLODC Length = 247 Score = 204 bits (519), Expect = 3e-51, Method: Composition-based stats. Identities = 54/267 (20%), Positives = 101/267 (37%), Gaps = 29/267 (10%) Query: 4 LKEVIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEF 63 + V ++K +IAM HL+ + ++A ++ + GVD +M N + Sbjct: 6 ILSVFKSKKPIIAMIHLK----------GDTPEDIFERAKKEITIFEENGVDGIMLENYY 55 Query: 64 SLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDPVASFDLAMATGAKFIREIFTG 123 Y + E + +++ IP+GVN L F+LA A +I Sbjct: 56 GNYYDLERILEYVS---------KANLSIPYGVNCLNVDTMGFELATKYNASYI------ 100 Query: 124 AYASDFGVWDTNVGETIR--HQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNN 181 S G T+ + + + + + L D+ K + Sbjct: 101 QVDSVVGHVKPRDEATLEEFFKLQRSKCPAYLIGGVRFKYQPVLSENDVEEDLKIGMTRC 160 Query: 182 HPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATTF 241 DA+ V+ G T ++ ++ + D ++ GV LEN ++QL + D + + F Sbjct: 161 --DAIAVTENATGQETSMEKIELFRKNLGDFPLVIAAGVTLENAKKQLELGDMAIIGSYF 218 Query: 242 KKDGVFANFVDQARVSQFMEKVHHIRR 268 K + V V FM+++ IR Sbjct: 219 KDNYKDFGNVSVEHVKTFMDEIKKIRE 245 >UniRef50_Q2CH81 Putative uncharacterized protein n=1 Tax=Oceanicola granulosus HTCC2516 RepID=Q2CH81_9RHOB Length = 270 Score = 193 bits (492), Expect = 4e-48, Method: Composition-based stats. Identities = 64/263 (24%), Positives = 106/263 (40%), Gaps = 6/263 (2%) Query: 1 MSWLKEVIGTE-KAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMF 59 MS L +++ + VI M L L G ++ + V++ A ++ L + G+D +M Sbjct: 1 MSRLLDMLARGGRPVIGMVQLPPLAGGANYGGAP-VGEVLEAALEEARVLADNGIDGLMV 59 Query: 60 SNEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDPVAS-FDLAMATGAKFIR 118 N +P A M R ++ P G+N+L + V + F +A A GA F+R Sbjct: 60 QNLGDIPVAHAATAAQVAWMTRATVEIGRIAACPVGLNMLENDVDAMFAVASAAGADFVR 119 Query: 119 -EIFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKST 177 ++F GA + FG+ R + G G++ L ++ L Sbjct: 120 IKVFVGAMVTPFGLEQGRAHAAARARRGCGGGDIAILADVHDRTGTPLATSGFEEDLDFA 179 Query: 178 VFNNHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVT 237 + DA+ V+G + R + P VL GV EN EE + A G + Sbjct: 180 LRLGGADAVVVTGK--SHAATLDMAARARAAHPAAHVLLGGGVTAENFEETMENASGAIV 237 Query: 238 ATTFKKDGVFANFVDQARVSQFM 260 +++ K G RV FM Sbjct: 238 SSSMKDSGSAVGRFVPERVEAFM 260 >UniRef50_UPI000180CC4C PREDICTED: similar to F13E9.13 n=1 Tax=Ciona intestinalis RepID=UPI000180CC4C Length = 228 Score = 192 bits (489), Expect = 8e-48, Method: Composition-based stats. Identities = 61/272 (22%), Positives = 105/272 (38%), Gaps = 53/272 (19%) Query: 2 SWLKEVIG-TEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFS 60 +V T +I M HL ALPG P Sbjct: 4 RKFVDVFKKTNGVIIGMLHLPALPGTP--------------------------------- 30 Query: 61 NEFSLPYLTKVRPETTAAMARIIGQLMSDIRIP--FGVNVLWDPVASFDLAMATGAKFIR 118 ++T ++A+I ++ + I G+ L+ +S + T FIR Sbjct: 31 -------------KSTMSVAKICDVVLKEAEIYTRAGLFKLFISPSSNLVLYLTDLDFIR 77 Query: 119 -EIFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDIC-SIAKS 176 E F ++ D G D+ +R++ +I A V +I + + + D S Sbjct: 78 AEGFVFSHIGDEGFIDSCAASLLRYRKQIEADHVLVFTDIKKKHSSHSITSDTSISETSR 137 Query: 177 TVFNNHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCV 236 D + V+G G+ TD +K V++ V VL +GV +NV++ + + Sbjct: 138 AAEFFLSDGVIVTGNETGSSTDLNQIKDVQDEVG-IPVLVGSGVTADNVDKYI-HTSALI 195 Query: 237 TATTFKKDGVFANFVDQARVSQFMEKVHHIRR 268 + FK GV++N VD V +FM+KV + + Sbjct: 196 VGSHFKVGGVWSNPVDANLVQKFMKKVREMNK 227 >UniRef50_C2JNA9 Photosystem I biogenesis protein BtpA n=9 Tax=Enterococcus faecalis RepID=C2JNA9_ENTFA Length = 247 Score = 192 bits (487), Expect = 1e-47, Method: Composition-based stats. Identities = 51/266 (19%), Positives = 96/266 (36%), Gaps = 29/266 (10%) Query: 4 LKEVIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEF 63 E+ EK +I + HL+ + ++A ++ GVDA++ N + Sbjct: 7 FLELFAVEKPIIGVIHLK----------GKTDQEIQERAKKEIQIYSEHGVDAILMENYY 56 Query: 64 SLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDPVASFDLAMATGAKF--IREIF 121 + + ++ D+ IP GVNVL F LA +F I + Sbjct: 57 GDYVQLEKALQYVTSL---------DLPIPIGVNVLNVDPLGFHLANKYHLQFLQIDSVV 107 Query: 122 TGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNN 181 D + R + K + + + L + + K Sbjct: 108 GHVKPRDEASLQA-FFDLYRAK-----TTAKLIGGVRFKYQPMLSEKSVEEDLKIAQQRC 161 Query: 182 HPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATTF 241 DA+ V+ G T +K ++ +P+ ++ G+ ++V+EQL+I D + + F Sbjct: 162 --DAIAVTENATGEETSLEKIKLFRKQLPEFPLIVAAGLNDKSVKEQLAICDAAIVGSNF 219 Query: 242 KKDGVFANFVDQARVSQFMEKVHHIR 267 K + V FM+ V +R Sbjct: 220 KDTRKDTGDIYAPYVDSFMKIVKELR 245 >UniRef50_C5EPH7 Putative uncharacterized protein n=1 Tax=Clostridiales bacterium 1_7_47FAA RepID=C5EPH7_9FIRM Length = 273 Score = 187 bits (475), Expect = 3e-46, Method: Composition-based stats. Identities = 59/267 (22%), Positives = 107/267 (40%), Gaps = 14/267 (5%) Query: 2 SWLKEVI--GTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMF 59 SW++ G + + HL G D + +W++++ + G+ ++M Sbjct: 18 SWIRRRYRMGKSCRITGVVHLPPF-GADRLDLEGLESWLLEQ----IGIHAECGITSMMI 72 Query: 60 SNEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDPVASFDLAMATGAKFIR- 118 ++ +A+ R + ++ D+ + + +P A+ +A A GA FIR Sbjct: 73 QDQTPGELAGLKNVAILSALGRTVKRMFPDLSLGIILEA-NNPSAAMYIANACGADFIRQ 131 Query: 119 EIFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTV 178 ++F GA GV GE + + V+ L +I V LG I A Sbjct: 132 KVFIGAMVKAGGVMTGRAGEVWEARKDMDR-PVRVLTDIYDRTGVPLGPLPI-ETAAGQA 189 Query: 179 FNNHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTA 238 D L ++G D A RV++ P V G+ +NV E + DG + + Sbjct: 190 LKYGSDGLILTGKNFEESLDLAD--RVRKQYPQAPVYLGGGITEKNVGEAVKHCDGMIVS 247 Query: 239 TTFKKDGVFANFVDQARVSQFMEKVHH 265 + +DG N + ++ +FME V Sbjct: 248 SCLLEDGK-DNVWSRQKIRRFMECVCG 273 >UniRef50_A3MXM0 Photosystem I assembly BtpA n=5 Tax=Thermoproteaceae RepID=A3MXM0_PYRCJ Length = 242 Score = 187 bits (474), Expect = 4e-46, Method: Composition-based stats. Identities = 56/255 (21%), Positives = 99/255 (38%), Gaps = 27/255 (10%) Query: 14 VIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEFSLPYLTKVRP 73 +I + HL G P ++ A L+ G DAV+ N + +P+ K Sbjct: 2 LIGVVHLLPT-GSP---------QRLEHAVRSAKRLEEAGFDAVIVENYYDMPFKPKADF 51 Query: 74 ETTAAMARIIGQLMSDIRIPFGVNVLWDP-VASFDLAMATGAKFIR-EIFTGAYASDFGV 131 E AMA ++ ++ +P G+N+L + V + +A GA FIR +T S+ G+ Sbjct: 52 EAAVAMAVAAREVAREVSLPVGINLLRNACVKASIIARHVGATFIRCNAYTDIVLSESGI 111 Query: 132 WDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPDALCVSGL 191 VK L ++ + + R + ++ P A+ V+G Sbjct: 112 LTPQAPYI---------KGVKVLADVHVKHGESIYPRTLAEAVEAASTRAAPAAIVVTGR 162 Query: 192 TAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATTFKKDGVFANFV 251 G D L + D VL +G+C + + L IADG + T K + Sbjct: 163 KTGEAPDPVDLATAR-AYTDLPVLVGSGICFQTLP-LLKIADGAIVGTCVKDGA----EI 216 Query: 252 DQARVSQFMEKVHHI 266 D + + + + + Sbjct: 217 DPEKARRLVREAKAV 231 >UniRef50_C2BTC6 Possible photosystem I biogenesis protein BtpA n=1 Tax=Mobiluncus curtisii ATCC 43063 RepID=C2BTC6_9ACTO Length = 252 Score = 186 bits (472), Expect = 8e-46, Method: Composition-based stats. Identities = 59/285 (20%), Positives = 104/285 (36%), Gaps = 56/285 (19%) Query: 1 MSWLKEVIG-----TEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVD 55 M + ++G EK ++ M HL+ ++ + +A + GG D Sbjct: 2 MQTKQNLLGNWPGNGEKLLLGMIHLK----------GNEIDDIYSRAVRECDIYARGGFD 51 Query: 56 AVMFSNEFSL--------PYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDPVASFD 107 V+ N F P L P+ + GV+V+WD SFD Sbjct: 52 GVIVENYFGTIDDVRYCLPRLQDKFPQ-----------------LYVGVDVIWDNDKSFD 94 Query: 108 LAMATGAKFIR-EIFTGAYASDFGVWDTNVGETIRHQHRIGAGEVK----TLFNIVPEAA 162 LA+ FI + G E + + RI + L + + Sbjct: 95 LAVEHQLPFIELDSLAGQLPPQ---------EEPQFEERIRWCQENSPAVILGGVRLKNQ 145 Query: 163 VYLGNRDICSIAKSTVFNNHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCL 222 L + D + V+G+ G T+ + + + +E + D +L GV Sbjct: 146 PVLSGNPLEVDLMLAKKRG--DGVIVTGVDTGVETELSKIIQFREIIGDFPLLVGAGVNE 203 Query: 223 ENVEEQLSIADGCVTATTFKKDGVFANFVDQARVSQFMEKVHHIR 267 +N EQL+IADG + ++ K+ G ++ RV + + V +R Sbjct: 204 KNCTEQLTIADGAIIGSSLKQGGNAKGDLEMDRVERLVTAVRALR 248 >UniRef50_Q16GL4 Putative uncharacterized protein n=2 Tax=Aedes aegypti RepID=Q16GL4_AEDAE Length = 191 Score = 185 bits (471), Expect = 9e-46, Method: Composition-based stats. Identities = 40/184 (21%), Positives = 84/184 (45%), Gaps = 4/184 (2%) Query: 87 MSDIRIPFGVNVL-WDPVASFDLAMATGAKFIR-EIFTGAYASDFGVWDTNVGETIRHQH 144 +I +P +VL + +A A FIR E F ++ +D G D N G+ +R++ Sbjct: 6 KDNISVPKQWHVLACGNEEALAVAKACNFDFIRAEGFVFSHVADEGFTDANAGQLLRYRR 65 Query: 145 RIGAGEVKTLFNIVPEAAVYLGNRDICSIAKS-TVFNNHPDALCVSGLTAGTRTDSALLK 203 I A ++ +I + + + DI + D + ++G + G + ++ Sbjct: 66 NIDAEHIQIFTDIKKKHSAHAITNDISLKETAKAAEFFRSDGIIITGASTGCEANVDDVE 125 Query: 204 RVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATTFKKDGVFANFVDQARVSQFMEKV 263 + + ++ +G+ EN+ + +IAD + + FK++G + + + +V FM KV Sbjct: 126 SLVGE-TELPLIIGSGITAENLNKYWNIADAAIVGSHFKENGNWRGALSEVKVQAFMNKV 184 Query: 264 HHIR 267 + R Sbjct: 185 NGFR 188 >UniRef50_C2D712 Putative uncharacterized protein n=1 Tax=Atopobium vaginae DSM 15829 RepID=C2D712_9ACTN Length = 245 Score = 185 bits (469), Expect = 2e-45, Method: Composition-based stats. Identities = 52/266 (19%), Positives = 90/266 (33%), Gaps = 26/266 (9%) Query: 1 MSWLKEVIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFS 60 MS +K V M HL+ + V++ + GG+D V+ Sbjct: 1 MSNPYAKYFEQKRVFGMLHLK----------GESIPQVLECLKKEFDEYVKGGIDGVVVE 50 Query: 61 NEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDPVASFDLAMATGAKFIR-E 119 N + E AA+ + Q+ S I GVN L +LA A F++ + Sbjct: 51 NYY------NGCDEIIAALDYLHDQIGSQTLI--GVNCLRSESMGLELASAYKTDFVQLD 102 Query: 120 IFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVF 179 G T+ + + + G L + + L + K Sbjct: 103 SVVGHVIPRDDATLTHFFKIWQ-EKYTG----MILGGVRFKKQPLLSENPLSEDLKIAQS 157 Query: 180 NNHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTAT 239 H +CV+ G T + +E + D +L + G NV++ LS +G + + Sbjct: 158 RCHA--VCVTQAATGEETHLDKIISFREGLKDFPLLISAGATPTNVKKSLSYINGVIAGS 215 Query: 240 TFKKDGVFANFVDQARVSQFMEKVHH 265 FK + V V + + V Sbjct: 216 YFKDTYEVSGTVCSEHVRELVRAVKE 241 >UniRef50_A9FN23 Putative uncharacterized protein n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9FN23_SORC5 Length = 268 Score = 174 bits (442), Expect = 2e-42, Method: Composition-based stats. Identities = 65/274 (23%), Positives = 112/274 (40%), Gaps = 12/274 (4%) Query: 1 MSWLKEVIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFS 60 M + G KAV+ M HL LPG P F + +D A +AL GG D + Sbjct: 1 MRTFATL-GRRKAVLGMIHLAPLPGTP-FHEKGSFERTLDVAVQSAIALSEGGADGCLVQ 58 Query: 61 N-EFSLPYLTKVRPETTAAMARIIGQLMSDI--RIPFGVNVLWDPV-ASFDLAMATGAKF 116 E + P T AM I+ + GV ++ + + AS +A F Sbjct: 59 TVERVYGVKDESDPARTTAMGLIVDAIGRATGDDFQIGVQLMRNAIRASLAVAKVARGSF 118 Query: 117 IREI-FTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGN-RDICSIA 174 +R GA ++ G+ + N E + ++ +I A VK + ++ +LG + + +A Sbjct: 119 VRAGALVGATLTEHGLVEANPLEVMEYRDKIDAWGVKIIADVASTQFTWLGGAKPVAEVA 178 Query: 175 KSTVFNNHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADG 234 + + DA VS A++ V+ PD +L N ++ ADG Sbjct: 179 RRAK-HVGADA--VSLGDPDEAKTLAMIASVRAAAPDLPILLAGHTNHANAARLMAAADG 235 Query: 235 CVTATTFKKDGVFANFVDQARVSQFMEKVHHIRR 268 ++ G + +D+ RV+ ++E V + R Sbjct: 236 AFVGACLEQGG-WGGRIDRDRVAAYVEIVRGLER 268 >UniRef50_Q166V4 Photosystem I biogenesis protein, putative n=2 Tax=Roseobacter RepID=Q166V4_ROSDO Length = 262 Score = 171 bits (433), Expect = 2e-41, Method: Composition-based stats. Identities = 60/266 (22%), Positives = 112/266 (42%), Gaps = 17/266 (6%) Query: 5 KEVIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEFS 64 ++ + K VIA HL D + + L + W D A + G+ + ++ Sbjct: 3 LKLFDSNKPVIAALHLP----DFALNRHLSVAWYEDYAVANARVFAEAGIPWIKLQDQTK 58 Query: 65 LPYLTKVRPETT---AAMARIIGQLMSDIRIPFGVNVLWDPVASFDLAMATGAKFIR-EI 120 + P+T A++AR+I + +R+ V DP A+ +A A+GA FIR ++ Sbjct: 59 T--AGQAAPDTLTLMASLARLIRSEVPQLRLGIIVEAH-DPGAALCVAHASGADFIRLKV 115 Query: 121 FTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFN 180 F G + G D E + + + ++ L +I A+ L + A + Sbjct: 116 FVGGAMTAQGPRDGLSAEVVAMRSELRRADIAILADIHDRTAMPLSSES-QPFAANWAVK 174 Query: 181 NHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATT 240 + D L ++G A + + V+++ +L GV NV E ++ ADG + ++ Sbjct: 175 SGADGLVITG--ASFADTLSRISAVRDSGARRPILIGGGVTESNVHEAMAAADGVIVSSA 232 Query: 241 FKKDGVFANFV---DQARVSQFMEKV 263 + A+ V D +FM+ V Sbjct: 233 LMRRDAAADDVIQWDADLCKRFMDAV 258 >UniRef50_UPI000069ECFE UPI000069ECFE related cluster n=1 Tax=Xenopus (Silurana) tropicalis RepID=UPI000069ECFE Length = 162 Score = 168 bits (427), Expect = 1e-40, Method: Composition-based stats. Identities = 47/163 (28%), Positives = 82/163 (50%), Gaps = 6/163 (3%) Query: 3 WLKEVIGTEKAV-IAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSN 61 ++ GT K + I M H++ALPG P ++L + +I++A + +N G+D +M N Sbjct: 2 KFLQLFGTVKPIVIGMVHVKALPGTP--GSRLPVAQIIEEACHEAEIYKNAGIDGIMVEN 59 Query: 62 EFSLPYLTKVRPETTAAMARIIGQLMSDI-RIPFGVNVL-WDPVASFDLAMATGAKFIR- 118 +PY PE TA MA I + +P GV +L + +A+A G FIR Sbjct: 60 MHDIPYTFNTGPEITATMATICTAVKQACPHLPLGVQILSCANNQALAVALAAGLDFIRA 119 Query: 119 EIFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEA 161 E + ++ +D G + G+ +R++ IGA ++ +I + Sbjct: 120 EGYVFSHVADEGFVNACAGDLLRYRKAIGAEHIQIFADIKKKH 162 >UniRef50_C0C181 Putative uncharacterized protein n=1 Tax=Clostridium hylemonae DSM 15053 RepID=C0C181_9CLOT Length = 262 Score = 162 bits (411), Expect = 9e-39, Method: Composition-based stats. Identities = 49/267 (18%), Positives = 103/267 (38%), Gaps = 15/267 (5%) Query: 8 IGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMF--SNEFSL 65 E +I H LP + + + + ++ G+D V N + Sbjct: 3 YAKEPVIIGAVH---LPYYGRNNPSQSVAEIEEYVMANVKVHYENGIDTVYIQDENLNTG 59 Query: 66 PYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDPVASFDLAMATGAKFIR-EIFTGA 124 P L + TA++A+++ + +++ + D VA A A GA F+R ++F G Sbjct: 60 PALPE-TIALTASLAKMVKMEVPGVKLGLIMQAH-DGVAPIAAAAAAGADFVRIKVFAGT 117 Query: 125 YASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPD 184 G+ ++++ I + VK L ++ + + + +A + D Sbjct: 118 MYKAEGIRTGVGETAVQYRTMINS-PVKILADVHDREGIPMPGVPV-DMAIGWASHIGAD 175 Query: 185 ALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATTFKKD 244 L ++G + L+ ++ VL V +N+ + L +G V +++ D Sbjct: 176 GLILTGHD--YKETMEYLETAEKMELGKPVLVGGSVSEDNIYDILDHCEGAVVSSSLMLD 233 Query: 245 GVFAN---FVDQARVSQFMEKVHHIRR 268 D ++ +F +KV H R+ Sbjct: 234 DPVPGSPLRWDAEKIRRFADKVRHYRK 260 >UniRef50_Q18HA0 Photosystem I biogenesis protein homolog n=1 Tax=Haloquadratum walsbyi DSM 16790 RepID=Q18HA0_HALWD Length = 223 Score = 136 bits (344), Expect = 6e-31, Method: Composition-based stats. Identities = 43/191 (22%), Positives = 81/191 (42%), Gaps = 8/191 (4%) Query: 49 LQNGGVDAVMFSNEFSLPYLTKVRPETTAAM-ARIIGQLMSDIRIPFGVNVLWDPVASFD 107 L+ G +DA++ N + P+ P+ T AM + +I + + +P V++L + + Sbjct: 34 LEAGSIDAILVKNLGNTPFHADDVPKHTVAMISALIKDIQRVVDVPISVDILRNDAEAAL 93 Query: 108 LAMA-TGAKFIR-EIFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYL 165 A T A FIR + G +D G+ ET+R + + +V+ L ++ + + Sbjct: 94 SIAAATTASFIRAGVHVGTLVTDQGIVTRRAAETLRLRDHLR-TDVEILADVSVKHSAPA 152 Query: 166 GNRDICSIAKSTVFNNHPDALCVSGLTAGTRTDSALLKRVKETV----PDTVVLANTGVC 221 R + + H D + SG+ G + D L V + V ++GV Sbjct: 153 AERPLTETITDIISREHADGIIASGVGTGHKIDCGHLNTVVDVRDSLETGIPVFVDSGVT 212 Query: 222 LENVEEQLSIA 232 LE + + S Sbjct: 213 LETIADIYSNV 223 >UniRef50_D2VTE2 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2VTE2_NAEGR Length = 318 Score = 135 bits (339), Expect = 2e-30, Method: Composition-based stats. Identities = 38/261 (14%), Positives = 84/261 (32%), Gaps = 38/261 (14%) Query: 1 MSWLKEVIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFS 60 + +V +K + + H+ W + A + L + VD + Sbjct: 72 IDKFYQVFKKKKVFLPVVHV----------------WDVAHALKNAKLLYDHHVDGLFLI 115 Query: 61 NEFSLPYLTKVRPETTAAMARIIGQLMSDI-RIPFGVNVLWDPVASFDLAMATGAKFIRE 119 N A I + + G+N+L + L F Sbjct: 116 NNN-----CSADILIDA-----IKSVRREFPDKWLGINILGISIRELFL-KIADLDFDGL 164 Query: 120 IFTGAYASDFGVWDTNVGETIRHQHRIGAGEVK--TLFNIVPEAAVYLGNRDICSIAKST 177 A ++ + N+ E I+ ++G+ K I + +D+ + Sbjct: 165 WLDSAMITEESEFQ-NIAEFIQ--DQLGSMNFKGLYFGGIAFKYQ-RTVIKDLKKVID-- 218 Query: 178 VFNNHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVT 237 + +++ D + SG G + LK+ E V + +GV +N+ + AD + Sbjct: 219 IASSYVDVILTSGEATGMQIKEEKLKKFTELVKCNPLGIASGVTNKNLITSIKHADVFIV 278 Query: 238 ATTFKKDGVFANFVDQARVSQ 258 T + + + ++ + Sbjct: 279 GTYIEH--YVTGDIVEEKLEE 297 >UniRef50_A6G6X7 Putative uncharacterized protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6G6X7_9DELT Length = 243 Score = 121 bits (304), Expect = 2e-26, Method: Composition-based stats. Identities = 42/269 (15%), Positives = 95/269 (35%), Gaps = 36/269 (13%) Query: 2 SWLKEVIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSN 61 S + +V + ++ + H P A+ A + GV V + Sbjct: 5 SRVHQVFRVPRVLLPVIH-------PIGHAE---------AISAVDVCVAAGVRGVFLID 48 Query: 62 EFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLW-DPVASFDLAMATGAKFIREI 120 + +R E A+A + + G+N+L DPV + A+ A + + Sbjct: 49 QG-------MRVEEVLALAVEVHA--RHPGLWVGLNLLALDPVEALRGALERCAGRLDGL 99 Query: 121 FTG-AYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVF 179 ++ A+ + + + + + + + + + + A+ V Sbjct: 100 WSDDAHVHEGSREQPRAQAFVDARRELDWDGL-YFGGVAFKYRRPVPDEQLAEAAR--VA 156 Query: 180 NNHPDALCVSGLTAGTRTDSALLKRVKETV---PDTVVLANTGVCLENVEEQLSIADGCV 236 + D +C SG G L R+++ + + +GV L+NV + L D + Sbjct: 157 AGYMDVVCTSGAGTGIAAHRDKLARMRQGLAGRDGAALALASGVTLDNVADYLDFTDAFL 216 Query: 237 TATTFKKDGVFANFVDQARVSQFMEKVHH 265 T +++ +D RV++ ++ Sbjct: 217 VGTGIERE---FGVLDPDRVARLQARIDA 242 >UniRef50_C5KQ75 Putative uncharacterized protein n=1 Tax=Perkinsus marinus ATCC 50983 RepID=C5KQ75_9ALVE Length = 379 Score = 114 bits (285), Expect = 4e-24, Method: Composition-based stats. Identities = 39/214 (18%), Positives = 65/214 (30%), Gaps = 18/214 (8%) Query: 40 DKAWDDLMALQNGGVDAVM----FSNEFSLPYLTKVRPETTAAMARIIGQLMSDI-RIPF 94 D+A G ++ N+ V P+ RII + I PF Sbjct: 27 DQAVQQARIAVASGAHGILLINQVENDDGSVTTLPVNPD----FTRIISAVRRAIGDKPF 82 Query: 95 -GVNVLWDPVASFDLAMATGAKF-IREIFTGAYASDFGVWDTNVGETIRHQHRIGAGEVK 152 GVN L A L + T I + D G + E R + + Sbjct: 83 LGVNCLA-MTADVALPLVTNDDCRIDAYWADDARIDEGRGVADQVEAERI-SSVRSAHSS 140 Query: 153 T---LFNIVPEAAVYLGNRDICSIAKSTVFNNHPDALCVSGLTAGTRTDSALLKRVKETV 209 + + + D + D + SG G D + + ++ Sbjct: 141 IKFYFGGVAFKKQRVVAEEDWSKAV--ALATPFMDVVVTSGTATGVPADINKIIQFRQAA 198 Query: 210 PDTVVLANTGVCLENVEEQLSIADGCVTATTFKK 243 + +GV EN+++ L D + AT K Sbjct: 199 DTNALAVGSGVTPENIDKYLPYVDCIIVATGVSK 232 >UniRef50_A8VXV8 Tryptophan synthase alpha chain n=2 Tax=Bacillus RepID=A8VXV8_9BACI Length = 266 Score = 103 bits (257), Expect = 7e-21, Method: Composition-based stats. Identities = 32/244 (13%), Positives = 77/244 (31%), Gaps = 32/244 (13%) Query: 40 DKAWDDLMALQNGGVDAVMFSNEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVL 99 D + + + LQ+ GV+A+ E+ +PY + A L + + + + Sbjct: 28 DLSVEIALMLQDAGVEAI----EWGVPYSDPLADGPVIQQA-GQRALKNGGSLTVSLQKM 82 Query: 100 WDPVASFDLAMATGAKFIREIFTGAY------ASDFGVWDTNVGETI-----RHQHRIGA 148 + A + ++ + + + D+G + + ++ Sbjct: 83 KEARAKGLTVPSVLFTYVNPVLSYGFTKLIEELKDYGFDGLLIPDLPYEESHEYRALCRE 142 Query: 149 GEVKTLFNIVPEAAVYLGNRDICSIAKSTV-FNNHPDALCVSGLTAGTRTDSALLKRVKE 207 + + I + I K F + +L V+G + Sbjct: 143 KGISLIPLI-----APSSKSRVEKITKEADGFVYYVTSLGVTGTRESFSETLKEEINTVK 197 Query: 208 TVPDTVVLANTGV-CLENVEEQLSIADGCVTATTFKK-----DGVFANFVDQAR----VS 257 + VLA G+ E+V+ ADG + + + + N ++ + Sbjct: 198 SFSKVPVLAGFGISTPEHVQYFQEHADGAIVGSALVRKIASLEDSLKNPEEKDAALNEIK 257 Query: 258 QFME 261 F++ Sbjct: 258 AFVQ 261 >UniRef50_B9T9A8 Putative uncharacterized protein n=1 Tax=Ricinus communis RepID=B9T9A8_RICCO Length = 225 Score = 101 bits (252), Expect = 2e-20, Method: Composition-based stats. Identities = 34/236 (14%), Positives = 67/236 (28%), Gaps = 32/236 (13%) Query: 39 IDKAWDDLMALQNGGVDAVMFSNEFSLPYLTKVRPETTAAMARIIGQLM-SDIRIPFGVN 97 D+A + + G V + + ++ GVN Sbjct: 14 TDQALRNAEIAFDAGCPGVFLISMDG----------EDELLGPAAKEVKGRWGGKLVGVN 63 Query: 98 VLW-DPVASFDLAMATGAKFIREIFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFN 156 L V + L + G G +++ G V + + + T F Sbjct: 64 YLSLSAVTALRLNLTHGLDLTWTDNAGVHSTGLGTLAHLVAD------ELKSAPEHTFF- 116 Query: 157 IVPEAAVYLGNRDICSIAKSTVFNNHPDALCV----SGLTAGTRTDSALLKRVKETVPDT 212 A + + LC+ SG G D ++ ++ + Sbjct: 117 ----GACGFKGQRAEPDTAAAAVMAA--GLCMLPTTSGSATGVAADLQKIRSIRAALGTG 170 Query: 213 VVLANTGVCLENVEEQLSIADGCVTATTFKKDGVFANFVDQARVSQFMEKVHHIRR 268 + +G+ ENV + + AT D NF +++ + K+ R Sbjct: 171 PLAVASGITPENVLDYAPYVSHFLVATGVSDDFYNFNF---EKLAVLVGKLRTFSR 223 >UniRef50_Q0FCT7 Adenine phosphoribosyltransferase n=3 Tax=Rhodobacterales RepID=Q0FCT7_9RHOB Length = 253 Score = 101 bits (251), Expect = 4e-20, Method: Composition-based stats. Identities = 38/270 (14%), Positives = 88/270 (32%), Gaps = 38/270 (14%) Query: 4 LKEVIGTEKAV-IAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNE 62 G++K + + + H+ ++ +++ G V N Sbjct: 6 FHLKFGSKKPIVLPVIHVL----------------DHEQTHNNISTAIVCGCPGVFLINH 49 Query: 63 FSLPYLTKVRPETTAAMARIIGQLMSDI-RIPFGVNVLWDPVA-SFDLAMATGAK--FIR 118 + + + II + ++ GVN L +F + ++ F+ Sbjct: 50 D---FEKEK-------LIPIIKSIRAEFPDYWIGVNFLAVTGEFAFPILGKMQSEGIFVD 99 Query: 119 EIFTGAYASDFGVWDTN---VGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAK 175 + D + N + + G + + + ++ Sbjct: 100 GYWADDACIDERCAENNQLIAKKINIIRKESGWQGLYV-GGTAFKKQREVDPSMYAYSSR 158 Query: 176 STVFNNHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGC 235 NH D + SG+ G D + +E+ + + +G+ ENV+E + D Sbjct: 159 LAT--NHMDIVVTSGIATGHAADVNKINIFRESCGENTLAVASGITPENVKEYIKNVDLF 216 Query: 236 VTATTFKKDGVFANFVDQARVSQFMEKVHH 265 + AT D F N +D ++++ M + + Sbjct: 217 MVATGINFDNDFYN-IDPNKLNRLMNVIKN 245 >UniRef50_C0QR82 Thiamine-phosphate pyrophosphorylase n=1 Tax=Persephonella marina EX-H1 RepID=THIE_PERMH Length = 209 Score = 79.2 bits (194), Expect = 1e-13, Method: Composition-based stats. Identities = 34/184 (18%), Positives = 62/184 (33%), Gaps = 32/184 (17%) Query: 80 ARIIGQLMSDIRIPFGVNVLWDPVASFDLAMATGAKFIREIFTGAYASDFGVWDTNVGET 139 A +I ++ IPF VN D+A+A A G + + Sbjct: 53 AVVIKKVCRKYDIPFIVN------DRIDIAIAVDAD-------GVHLGQDDLDVEVARRI 99 Query: 140 IRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPDALCVSGLTAGTRTDS 199 + + IG K +++ ++ + S+ ++ DA+ Sbjct: 100 LGFEKIIGLSTKKI-EDVIKANSLPVDYIGFGSVFPTSTKE---DAVYAG---------L 146 Query: 200 ALLKRVKETVPDTVVLANTGVCLENVEEQLSIA--DGCVTATTFKKDGVFANFVDQARVS 257 LK V + VV G+ +N+ + L + V + FK D + N R+ Sbjct: 147 EKLKEVMKISVQPVVAIG-GINEKNLTDLLKTGCRNVAVVSAVFKDDNIKEN---TERLK 202 Query: 258 QFME 261 ME Sbjct: 203 NIME 206 >UniRef50_A6TM77 Tryptophan synthase alpha chain n=10 Tax=Clostridia RepID=TRPA_ALKMQ Length = 266 Score = 73.8 bits (180), Expect = 6e-12, Method: Composition-based stats. Identities = 16/109 (14%), Positives = 40/109 (36%), Gaps = 4/109 (3%) Query: 163 VYLGNRDICSIAKSTVFNNHP-DALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGV- 220 + I + + + V+G + + T + ++ G+ Sbjct: 155 APTSEDRMKEIVQDAEGFIYCVSSTGVTGKRNSLAGNLEGFMQQLRTYTEIPLVIGFGIS 214 Query: 221 CLENVEEQLSIADGCVTATTF--KKDGVFANFVDQARVSQFMEKVHHIR 267 E +++ +I DG + + K + + RVS+F+EK++ + Sbjct: 215 NSEMMDKLKNICDGFIIGSAVIEKIEAGLEDRSSVERVSKFIEKLYEFK 263 >UniRef50_Q4JTH5 L-lactate dehydrogenase n=6 Tax=Actinomycetales RepID=Q4JTH5_CORJK Length = 425 Score = 67.2 bits (163), Expect = 5e-10, Method: Composition-based stats. Identities = 23/134 (17%), Positives = 46/134 (34%), Gaps = 10/134 (7%) Query: 45 DLMALQNGGVDAVMFSNEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDPVA 104 D L + GVD ++ SN P + + ++ +D+ + ++ Sbjct: 285 DSKKLADLGVDGIILSNHGGR--QLDRAPVPFQLLPEVAREVGNDVDVAMDTGIMNGADI 342 Query: 105 SFDLAMATGAKFI---REIFTGAYASDFGVWDTNVGETI--RHQHRIGAGEVKTLFNIVP 159 +A GAKF R G A + E + + + +V +L + P Sbjct: 343 VAAIAK--GAKFTLIGRAYLYGLMAGGEAGVN-RAIEILASEVRRTMRLLQVSSLDELTP 399 Query: 160 EAAVYLGNRDICSI 173 E L ++ + Sbjct: 400 EHVTQLNTLNLNQV 413 >UniRef50_Q7M877 Tryptophan synthase alpha chain n=9 Tax=Epsilonproteobacteria RepID=TRPA_WOLSU Length = 255 Score = 66.4 bits (161), Expect = 9e-10, Method: Composition-based stats. Identities = 20/121 (16%), Positives = 40/121 (33%), Gaps = 8/121 (6%) Query: 126 ASDFGVWDTN--VGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKST-VFNNH 182 G+ + E+ + + + + I P LG I +IA F Sbjct: 111 LGVSGLIIPDVPFEESAPFEEQCLQNNLALIRFIAPT----LGTSRIATIAPMARKFIYL 166 Query: 183 PDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATTFK 242 ++G L++ ++ P+ + GV N +E+ DG + + Sbjct: 167 VAYAGITGSGREEPLSP-LIEEIRAINPEIPLYLGFGVNEHNAKEKSKEVDGVIVGSALV 225 Query: 243 K 243 K Sbjct: 226 K 226 >UniRef50_Q3ABS4 Tryptophan synthase alpha chain n=1 Tax=Carboxydothermus hydrogenoformans Z-2901 RepID=TRPA_CARHZ Length = 267 Score = 63.0 bits (152), Expect = 1e-08, Method: Composition-based stats. Identities = 45/281 (16%), Positives = 98/281 (34%), Gaps = 31/281 (11%) Query: 1 MSWLKEVI-----GTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMA----LQN 51 MS + +V EKA+IA + GDP+ L + + A DL+ + Sbjct: 1 MSRIGQVFAEKRSRGEKALIA----YTMGGDPNLTFSLEIIKTLAAAGADLIEVGLPFSD 56 Query: 52 GGVDAVMFSNEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDPVASFDLAMA 111 D + PE A+ I ++ +P + +P+ + Sbjct: 57 PLADGPVIQRAGQRALAAGSGPEEVLAL---IAAARQELSLPLVIMSYLNPILQIGV--- 110 Query: 112 TGAKFIREIFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDIC 171 +F+R A A G+ ++ + R+ A +++P A G + + Sbjct: 111 --DEFLRRA---AGAGADGLIIPDLPVEEGEEIRVSAAGYGL--DLIPLVAPTTGQKRLE 163 Query: 172 SIAKSTVFNNHPDAL-CVSGLTAGTRTDSALLKRVKETVPDTVVLANTGV-CLENVEEQL 229 I + ++ V+G + L + + + + V G+ E + Sbjct: 164 KIVGQASGFIYCVSVTGVTGARDSLPAEVISLLQNVKKLTELPVCLGFGIGKPEQIAYIK 223 Query: 230 SIADGCVTATTFKK--DGVFANFVDQARVSQFME-KVHHIR 267 DG + + + + N +++ +V + + KV ++ Sbjct: 224 DYCDGVIVGSALVEIIENYVQNRMEKDKVLELIATKVQTLK 264 >UniRef50_A9A2A1 Tryptophan synthase alpha chain n=3 Tax=Thaumarchaeota RepID=A9A2A1_NITMS Length = 268 Score = 61.4 bits (148), Expect = 3e-08, Method: Composition-based stats. Identities = 42/272 (15%), Positives = 84/272 (30%), Gaps = 66/272 (24%) Query: 1 MSWLKEVI-----GTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVD 55 MS +KE EKA+I+ + G P+ + + + + L GGVD Sbjct: 1 MSKIKEKFAELQTRKEKALISYI----MAGFPNEKSTMSV----------VRGLVKGGVD 46 Query: 56 AVMFSNEFSLPYLTKVRPETTAAMA---------------RIIGQLMSDIRIPF------ 94 + E P+ + A ++ ++ + IP Sbjct: 47 II----ELGFPFSDPLADGPVIQNASTISLEKGTKIDKFFALVKKIRKETDIPLVLMTYT 102 Query: 95 GVNVLWDPVASFDLAMATGAKFIREIFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTL 154 + A +G G ++ + + A Sbjct: 103 NILYHKGYSKFIAEAKKSGID--------------GFILPDMS-IEESKDYLKAARKN-- 145 Query: 155 FNIVPEAAVYLGNRDICSIAKSTVFNNHPDALC-VSGLTAGTRT-DSALLKRVKETV-PD 211 + + + I IAK++ + A+ +G+ G + +K VK+ Sbjct: 146 ADTIFLISPNTNKTRIQKIAKASSGFLYLVAVYGTTGVKTGIKNYTIDAIKNVKKQTKGK 205 Query: 212 TVVLANTGV-CLENVEEQLSI-ADGCVTATTF 241 V GV ++V++ + ADG + + F Sbjct: 206 IPVGVGFGVSTPDDVKKYIKAGADGVIVGSAF 237 >UniRef50_A3TLL5 Tryptophan synthase alpha chain n=1 Tax=Janibacter sp. HTCC2649 RepID=A3TLL5_9MICO Length = 270 Score = 59.5 bits (143), Expect = 1e-07, Method: Composition-based stats. Identities = 19/100 (19%), Positives = 33/100 (33%), Gaps = 6/100 (6%) Query: 162 AVYLGNRDICSIAKSTVFNNHPDA-LCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGV 220 A + + + + + + V+G + + L + V D V GV Sbjct: 156 APSSTPERLSRVTHACRGFVYAASTMGVTGERTSVGSSARELVDRTKDVTDLPVCVGLGV 215 Query: 221 -CLENVEEQLSIADGCVTATTFKK----DGVFANFVDQAR 255 + E ADG + T F + DG +D R Sbjct: 216 SNGDQAAELAQYADGVIVGTAFVRTLSGDGELGARLDALR 255 >UniRef50_C0ZCE6 Tryptophan synthase alpha chain n=1 Tax=Brevibacillus brevis NBRC 100599 RepID=C0ZCE6_BREBN Length = 272 Score = 59.5 bits (143), Expect = 1e-07, Method: Composition-based stats. Identities = 32/246 (13%), Positives = 72/246 (29%), Gaps = 30/246 (12%) Query: 36 NWVIDKAWDDLMALQNGGVDAVMFSNEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFG 95 + I+ + + A+ G D + E +PY + T A L + + I Sbjct: 28 DPTIEATFHLVKAMVEAGADLI----ELGVPYSDPLADGPTIQRASE-RALKNGVTIGDA 82 Query: 96 VNVLWDPVASFDLAMATGAKFIREIF------TGAYASDFGVWDTNVGETIRHQH----- 144 + ++ S A + + A + +G + + + Sbjct: 83 LQLVKRLRESGMEACIVLFTYFNPVLQYGIERFFADLAAYGADGVVIPDLPIEESGPAVT 142 Query: 145 RIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPDALCVSGLTAGTRTDSALLKR 204 + + + P ++ + A++T F +L V+G R D A Sbjct: 143 AAKQNGIHVISLVAPTSSSRINTIG----AQATGFLYCVSSLGVTGARTDLREDLADFLE 198 Query: 205 VKETVPDTVVLANTGV-CLENVEEQLSIADGCVTATTFKKD---------GVFANFVDQA 254 + G+ + V ADG + + + + V Sbjct: 199 RVKASTSVPTAVGFGISTPDQVRTVAPHADGVIVGSAIVQQIEEHAEQLKDLKQMPVAVE 258 Query: 255 RVSQFM 260 ++ F+ Sbjct: 259 KIKTFV 264 >UniRef50_Q5KXV2 Tryptophan synthase alpha chain n=6 Tax=Bacillaceae RepID=TRPA_GEOKA Length = 268 Score = 59.1 bits (142), Expect = 2e-07, Method: Composition-based stats. Identities = 42/243 (17%), Positives = 69/243 (28%), Gaps = 41/243 (16%) Query: 21 RALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEFSLPYLTKVRPETTAAMA 80 L + ID A AL+ G D + +S P + AA A Sbjct: 7 PPLFIPFIVAGDPAPDVTIDLAL----ALEEAGADILELGVPYSDPLADGPTIQRAAARA 62 Query: 81 R-----------IIGQLMSD-IRIPFGVNVLWDPVASF------DLAMATGAKFIREIFT 122 +IG++ + IP + ++PV LA GA Sbjct: 63 LDGGMTLPKAIQLIGEMRKKGVNIPIILFTYYNPVLQLGEESFFALARENGAD------- 115 Query: 123 GAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNH 182 G D ++ G R G + + + I IA + + Sbjct: 116 GVLIPDLPFEES--GPLRELGERFGLPLISLVA--------PTSKQRIERIASAAQGFLY 165 Query: 183 P-DALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVC-LENVEEQLSIADGCVTATT 240 +L V+G+ R + V G+ E V + DG V + Sbjct: 166 CVSSLGVTGVRETLPETLGDFLREVKRHSRVPVAVGFGISAPEQVAMLKEVCDGVVVGSA 225 Query: 241 FKK 243 + Sbjct: 226 LVQ 228 >UniRef50_D1BQN8 Tryptophan synthase alpha chain n=17 Tax=cellular organisms RepID=D1BQN8_VEIPT Length = 263 Score = 58.7 bits (141), Expect = 2e-07, Method: Composition-based stats. Identities = 40/263 (15%), Positives = 72/263 (27%), Gaps = 49/263 (18%) Query: 1 MSWLKEVIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFS 60 MS +K+ KA I G+ + + G D V Sbjct: 1 MSKIKDAFTKGKAFIPFI----------SAGDHGIENTERY----IRIMVKAGADMVEI- 45 Query: 61 NEFSLPYLTKV--RPETTAAMARIIGQLMSDIRIPFGVNVLWDPVASFDLAMATGAKFIR 118 +P+ P A R + + I V L + + + ++ Sbjct: 46 ---GIPFSDPTAEGPVIQEASTRALSTGVKINDIFDMVRCLRTGEEAVTVPLVF-MTYLN 101 Query: 119 EIFTGAY---------ASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAV------ 163 I+ GV + + E L ++ + V Sbjct: 102 PIYVFGREKFFTLCEEVGISGVIVPD----------MPFEEKGELASVAHKHGVEVVSLI 151 Query: 164 -YLGNRDICSIAKSTVFNNHP-DALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVC 221 I IAK + +L V+G+ + +TD + D V G+ Sbjct: 152 APTSENRIEMIAKDAEGFVYCVSSLGVTGMRSEIKTDIKSIVETIRKYTDIPVAVGFGIS 211 Query: 222 -LENVEEQLSIADGCVTATTFKK 243 E E ++DG + + K Sbjct: 212 KPEQAETMARVSDGAIVGSAIVK 234 >UniRef50_Q2LUE0 Tryptophan synthase alpha chain n=1 Tax=Syntrophus aciditrophicus SB RepID=TRPA_SYNAS Length = 265 Score = 58.4 bits (140), Expect = 2e-07, Method: Composition-based stats. Identities = 41/243 (16%), Positives = 82/243 (33%), Gaps = 24/243 (9%) Query: 39 IDKAWDDLMALQNGGVDAVMFSNEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNV 98 +DK + L+ L+ GGVD + +P+ A L + + +++ Sbjct: 30 LDKTREILVGLKEGGVDILEI----GVPFSDPTADGPVIQ-AAAQRALKTGTTLSRILDM 84 Query: 99 LWDPVASFDLAMATGAKFIREIFTGA---------YASDFGVWDTNVG--ETIRHQHRIG 147 + D DL + I+ A G+ ++ E + + Sbjct: 85 IQDLRKIIDL-PVVLFGYYNPIYAYGTERFAERAKAAGVDGLLVVDLPLEEAEELRGKTD 143 Query: 148 AGEVKTLFNIVPEAAVYLGNRDICSIAKSTV-FNNHPDALCVSGLTAGTRTDSALLKRVK 206 + + + I P + +C IA+ F + V+G +R + R Sbjct: 144 SKGLDFITLIAPTTS----EERMCRIARRAQGFIYYISITGVTGTATPSRDNVEREIRRI 199 Query: 207 ETVPDTVVLANTGV-CLENVEEQLSIADGCVTATTFKKDGVFANFVDQARVSQFMEKVHH 265 T D ++ G+ E E S+ADG V + F + + N ++ Sbjct: 200 RTHSDLPLVVGFGISTPEQARELASLADGIVIGSAFVR-LIAENADSPELAARVSSFARE 258 Query: 266 IRR 268 I++ Sbjct: 259 IKK 261 >UniRef50_B0NZY9 Tryptophan synthase alpha chain n=2 Tax=Clostridiales RepID=B0NZY9_9CLOT Length = 262 Score = 57.6 bits (138), Expect = 4e-07, Method: Composition-based stats. Identities = 16/106 (15%), Positives = 34/106 (32%), Gaps = 2/106 (1%) Query: 163 VYLGNRDICSIAKSTVFNNHP-DALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVC 221 + I IAK + ++ V+G + TD V + G+ Sbjct: 154 TPTSHDRIAMIAKEAEGFLYCVSSIGVTGTRSEFTTDFDEFFGVIKKNATIPCAVGFGIS 213 Query: 222 -LENVEEQLSIADGCVTATTFKKDGVFANFVDQARVSQFMEKVHHI 266 E ++ + DG + + K +V +F + + + Sbjct: 214 GPEQAKKMSTYCDGVIVGSAIVKLISQYGKESPEKVYEFTKSLRDV 259 >UniRef50_C7N999 Tryptophan synthase alpha chain n=2 Tax=Leptotrichia RepID=C7N999_LEPBD Length = 257 Score = 57.6 bits (138), Expect = 4e-07, Method: Composition-based stats. Identities = 45/274 (16%), Positives = 92/274 (33%), Gaps = 35/274 (12%) Query: 2 SWLKEVIG-TEKAVIAMCHLRALPGDPSFDAQLGMNWVIDK-AWDDLMA---LQNGGVDA 56 + ++ EK I + G PS D +D A D L + D Sbjct: 8 KKIIDIFREKEKVNIGYI----VAGYPSVDFTKQFLQNLDNTALDMLEVGIPYSDPIADG 63 Query: 57 VMFSNEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDPVASFDLAMATGAK- 115 + S +L TT + ++ ++ +DI P + ++ L A G Sbjct: 64 KLIS---QASFLASEAGVTTDTVFDLLTEIKNDISKPLIFLIYYN------LIFAYGIDE 114 Query: 116 FIREIFTGAYASDFGVWDTNVG--ETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSI 173 FI++ A G+ ++ E ++ + + + + + I Sbjct: 115 FIKKC-CEANVK--GIIIPDLPYEEAFEMSEKLRENNIALIPLVSVTSGNRMKKI----I 167 Query: 174 AKSTVFNNHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGV-CLENVEEQLSIA 232 ++ F +L V+G V D V G+ +NV A Sbjct: 168 SQGDGFIYAIGSLGVTGSKQVDLPRLESFINEIREVSDLPVSLGFGIKNNDNVNTMRKYA 227 Query: 233 DGCVTATTFKKDGVFANFVDQARVSQFMEKVHHI 266 DG + T+ + F+++ V+ ++K++ + Sbjct: 228 DGVIVGTSIVE------FLEKNDVNYLIQKINEL 255 >UniRef50_Q67PJ3 Tryptophan synthase alpha chain n=3 Tax=Clostridia RepID=TRPA_SYMTH Length = 279 Score = 56.8 bits (136), Expect = 7e-07, Method: Composition-based stats. Identities = 44/234 (18%), Positives = 81/234 (34%), Gaps = 22/234 (9%) Query: 46 LMALQNGGVDAVMFSNEFSLPYLTKV--RPETTAAMARIIGQLMSDIRIPFGVNVLWDPV 103 + AL GVD V E LP+ + P AA R + + + V L Sbjct: 37 VHALAEAGVDMV----ELGLPFSDPLADGPVIQAASQRALAAGANTDNVLELVAALRADG 92 Query: 104 ASFDLAMATGAKFI-REIFTG-----AYASDFGVWDTNVG--ETIRHQHRIGAGEVKTLF 155 L + T + R A A G+ +V E+ + GA + + Sbjct: 93 LQIPLLIMTYYNLVLRPGVENFCHRAAAAGVDGLILPDVPVEESDEIRAAAGAVGLDLIQ 152 Query: 156 NIVPEAAVYLGNRDICSIAKSTVFNNHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVL 215 + P + R +A+ ++ V+G+ + + V DT V Sbjct: 153 FVAP-TSPPERIRRAAELARGFIYAVSSTG--VTGVRDRLPPQLTAMVEAVKAVTDTPVA 209 Query: 216 ANTGVC-LENVEEQLSIADGCVTATTFKK---DGVFANFVDQARVSQFMEKVHH 265 G+ E V + ++AD + + F + +G+ + + RV E++ Sbjct: 210 VGFGISRPEQVRQVTAVADAAIVGSAFVRHCGEGLPEHEL-VERVRILAEELKA 262 >UniRef50_A5IKT4 Tryptophan synthase alpha chain n=5 Tax=Thermotogaceae RepID=TRPA_THEP1 Length = 239 Score = 56.8 bits (136), Expect = 8e-07, Method: Composition-based stats. Identities = 19/78 (24%), Positives = 31/78 (39%), Gaps = 7/78 (8%) Query: 193 AGTRTDS---ALLKRVKETVPDTVVLANTGVC-LENVEEQLSIADGCVTATTFKKDGVFA 248 G R D +KRVKE + + G+ E VE+ IADG + + + + Sbjct: 162 TGEREDLPFADHIKRVKERI-KLPLFVGFGISRHEQVEKVWEIADGAIVGSALVR--IME 218 Query: 249 NFVDQARVSQFMEKVHHI 266 + +EKV + Sbjct: 219 ESPKDEIPKKVVEKVKEL 236 >UniRef50_B0TFQ0 Tryptophan synthase alpha chain n=2 Tax=Heliobacteriaceae RepID=B0TFQ0_HELMI Length = 271 Score = 56.8 bits (136), Expect = 8e-07, Method: Composition-based stats. Identities = 29/190 (15%), Positives = 59/190 (31%), Gaps = 22/190 (11%) Query: 64 SLPYLTKVRPETTAAMA------RIIGQLMSDIRIPFGVNVLWDPVASFDL---AMATGA 114 P + + A ++ L + IP + ++PV F L A A Sbjct: 66 DGPVIQEAAVRALAGGTTLTKVLAMVRTLRRETSIPIVLLTYYNPVLRFGLLRFAREAAA 125 Query: 115 KFIREIFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIA 174 + + ++ G + + + + + P + + I A Sbjct: 126 SGVDGVIVADLPAEEGGV---------LREPLDELGLALIPLVAP-TSTPERIQRIAEKA 175 Query: 175 KSTVFNNHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVC-LENVEEQLSIAD 233 + F L V+G+ + D+A L + D + G+ E+V D Sbjct: 176 RG--FIYCVSLLGVTGMRSDLPPDAAALLERVRAMTDVPLALGFGISRAEHVAIVAPNCD 233 Query: 234 GCVTATTFKK 243 G + + K Sbjct: 234 GVIVGSAIVK 243 >UniRef50_C6J450 Tryptophan synthase alpha chain n=2 Tax=Bacillales RepID=C6J450_9BACL Length = 273 Score = 56.4 bits (135), Expect = 9e-07, Method: Composition-based stats. Identities = 18/108 (16%), Positives = 31/108 (28%), Gaps = 7/108 (6%) Query: 138 ETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHP-DALCVSGLTAGTR 196 E A V+ + + + I I + + +L V+G Sbjct: 140 EAEEVLKLADAAGVRLIPLV-----APTSSGRIARILERARGFVYCVSSLGVTGERTSFH 194 Query: 197 TDSALLKRVKETVPDTVVLANTGVCL-ENVEEQLSIADGCVTATTFKK 243 + D V G+ E VE +I DG V + + Sbjct: 195 ASVDEFIASVKAQTDLPVAVGFGISSREQVERFAAICDGAVVGSAIVR 242 >UniRef50_Q3Z6G1 Tryptophan synthase alpha chain n=5 Tax=Dehalococcoides RepID=TRPA_DEHE1 Length = 255 Score = 55.7 bits (133), Expect = 1e-06, Method: Composition-based stats. Identities = 44/271 (16%), Positives = 90/271 (33%), Gaps = 25/271 (9%) Query: 1 MSWLKEVIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMAL----QNGGVDA 56 MS + + K++IA + G P + L + ++++ D++ L + D Sbjct: 1 MSRISDAFQKRKSLIAYITV----GYPDIETTLRLVPLLEENGVDIIELGIPFSDPLADG 56 Query: 57 VMFSNEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDPVASFDLAMATGAKF 116 V N V PE ++A + L I IP ++P+ ++ L K Sbjct: 57 VTIQNASYQALQNGVTPEVCLSVAAL---LKEKISIPMVFMGYYNPIYNYGLTKFCQ-KC 112 Query: 117 IREIFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSI-AK 175 +G D + + + + +I+ A + I + AK Sbjct: 113 ATAGVSGFIIPDLPPGEAQDIDFAATEAGL---------DIIFLLAPTSTDERIKLVAAK 163 Query: 176 STVFNNHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGV-CLENVEEQLSIADG 234 S F V+G TA D + + G+ E + +DG Sbjct: 164 SRGFIYLVSHSGVTGATANLPADLSSFVNRVRKTARQPLAVGFGISTPEQAQNIAKFSDG 223 Query: 235 CVTATTFKKDGVFANFVDQARVSQFMEKVHH 265 + + + + +V+ F+ ++ Sbjct: 224 IIVGSRILQ--LVQTDPSLEKVATFIRQLRQ 252 >UniRef50_C6PE99 Tryptophan synthase alpha chain n=1 Tax=Thermoanaerobacterium thermosaccharolyticum DSM 571 RepID=C6PE99_CLOTS Length = 275 Score = 55.7 bits (133), Expect = 2e-06, Method: Composition-based stats. Identities = 36/294 (12%), Positives = 84/294 (28%), Gaps = 63/294 (21%) Query: 1 MSWLKEVIG--TEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVM 58 M+ + + +K + AM ++ ++ + ++ GG D + Sbjct: 14 MNRIDKKFYELKQKGLKAMI-------PFITAGDPSLDVTVELVFK----MEEGGADIIE 62 Query: 59 FSNEFSLPYLTKV--RPETTAAMARI-------------IGQLMSDIRIPFGVNVLWDPV 103 +PY + P A+ R + ++ IP V ++ + Sbjct: 63 I----GIPYSDPLADGPIIQASSTRALKNGTKINNIMNAVKKIRQKSEIPLVYLVYYNSI 118 Query: 104 ASFDLAMATGAKFIREIFTGAYASDFGVWDTNVGET-IRHQHRIGAGEVKTLFNIVPEAA 162 + + +F+ E A + G+ + + + + I I + Sbjct: 119 FKYGI-----ERFVNE------AKESGIDGLIIPDLPLEERKDIK--------EISEKYG 159 Query: 163 V---YLGNRDICSIAKSTVFNNHPDALCV-----SGLTAGTRTDSALLKRVKETVPDTVV 214 + L KS + CV +G+ TD R + Sbjct: 160 IYLIPLVAPTSKERIKSICESGKGFVYCVSTKGVTGIRNSIETDIKEYMRTVSEYTNMPK 219 Query: 215 LANTGVC-LENVEEQLSIADGCVTATTFKK--DGVFANFVDQARVSQFMEKVHH 265 G+ + + DG + + K + + V +F+ + Sbjct: 220 AIGFGISGPDMAKRFAPYCDGIIVGSAIVKMINDSRSKEEIYDNVKKFVFSIKE 273 >UniRef50_D2RNR7 Tryptophan synthase, alpha subunit n=15 Tax=cellular organisms RepID=D2RNR7_ACIFE Length = 260 Score = 55.3 bits (132), Expect = 2e-06, Method: Composition-based stats. Identities = 20/105 (19%), Positives = 37/105 (35%), Gaps = 2/105 (1%) Query: 163 VYLGNRDICSIAKSTVFN-NHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGV- 220 ++ I IAK +L V+G+ T+ + V D + G+ Sbjct: 150 APTSHQRIQMIAKEAQGYIYLVSSLGVTGVRKEITTNLKEIVEKIREVSDKPIAVGFGIS 209 Query: 221 CLENVEEQLSIADGCVTATTFKKDGVFANFVDQARVSQFMEKVHH 265 E E +I+DG + + K V Q+++K+ Sbjct: 210 TPEQAREMAAISDGAIVGSAIVKLCAQYGKDCVEPVKQYVKKMAE 254 >UniRef50_Q8G691 Tryptophan synthase alpha chain n=30 Tax=Actinobacteridae RepID=TRPA_BIFLO Length = 291 Score = 54.5 bits (130), Expect = 3e-06, Method: Composition-based stats. Identities = 40/222 (18%), Positives = 71/222 (31%), Gaps = 38/222 (17%) Query: 40 DKAWDDLMALQNGGVDAVMFSNEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVL 99 D + D + GVDAV +S P + + A++A G+ + + Sbjct: 51 DVSLDAFKTMVEHGVDAVEIGLPYSDPVMDGPVIQAAASIALNNGETIKRV--------- 101 Query: 100 WDPVASFDLAMATGAKFIREIFTGAY-------------ASDFGVWDTN-----VGETIR 141 ++ V + +A A G I + Y A G+ + GE I Sbjct: 102 FEAVET--VANAGGVPLIMSYWNLVYHYGVERFARDFENAGGAGLITPDLIPDEAGEWIE 159 Query: 142 HQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPDA-LCVSGLTAGTRTDSA 200 R G + + + + ++A++ + A + V+G A Sbjct: 160 ASDRHGLDRIFLV-------SPDSSTERLETVARNARGFVYAAARMGVTGERATIDASPE 212 Query: 201 LLKRVKETVPDTVVLANTGV-CLENVEEQLSIADGCVTATTF 241 LL V GV E + S ADG + + Sbjct: 213 LLVERTRQAGAENVCVGIGVSTAEQGAKVGSYADGVIVGSAL 254 >UniRef50_C0EA61 Tryptophan synthase alpha chain n=1 Tax=Clostridium methylpentosum DSM 5476 RepID=C0EA61_9CLOT Length = 264 Score = 54.5 bits (130), Expect = 3e-06, Method: Composition-based stats. Identities = 30/191 (15%), Positives = 57/191 (29%), Gaps = 27/191 (14%) Query: 83 IGQLMSDIRIPFGVNVLWDP------VASFDLAMATGAKFIREIFTGAYASDFGVWDTNV 136 + QL + +IP + ++ A F G G D +++ Sbjct: 85 VTQLREETQIPLVFLMYYNSLLHYGQDAFFARCKEAGID-------GVILPDLPFEESD- 136 Query: 137 GETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHP-DALCVSGLTAGT 195 E + R G ++ + + + I + +L V+G+ + Sbjct: 137 -EISEYTERYGVYQISLVA--------PTSSERLQQITAKAKGFLYCVSSLGVTGMRSEI 187 Query: 196 RTDSALLKRVKETVPDTVVLANTGV-CLENVEEQLSIADGCVTATTFKKDGVFANFVD-- 252 RTD A + G+ E DG + + A D Sbjct: 188 RTDLAQFFAQIDRCCTIPTCIGFGISTPEQAAAVKQYCDGVIIGSAIVNRIGTAQTPDRA 247 Query: 253 QARVSQFMEKV 263 VS+F+ +V Sbjct: 248 VESVSEFVRQV 258 >UniRef50_A9KL40 Tryptophan synthase alpha chain n=13 Tax=Firmicutes RepID=A9KL40_CLOPH Length = 257 Score = 54.5 bits (130), Expect = 4e-06, Method: Composition-based stats. Identities = 38/276 (13%), Positives = 87/276 (31%), Gaps = 34/276 (12%) Query: 1 MSWLKEVIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQ------NGGV 54 M+ +++ KA+I GDP ++ ++ A + ++ + Sbjct: 1 MTRIEQAFTKGKALITFI----TGGDP--GIEVTEELIVSMAMEGADLIEIGIPFSDPIA 54 Query: 55 DAVMFSNEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDPVASFDLAMATGA 114 + + TT + +IG++ S + +P +P+ GA Sbjct: 55 EGPIIQEANER---ALKAGATTDLLFDMIGRVRSKVLVPLVFMTYINPI------YTYGA 105 Query: 115 K-FIREIFTGAYASDFGVWDTNVGETIRHQHR--IGAGEVKTLFNIVPEAAVYLGNRDIC 171 F+R + G+ V + + + ++ + I I Sbjct: 106 DRFLR------RCKEVGIDGVIVPDLPYEEKEELLPLCKLHDITLISM--IAPTSKERIQ 157 Query: 172 SIAKSTVFNNHP-DALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGV-CLENVEEQL 229 +I K + +L V+G+ + + + + V D G+ E E Sbjct: 158 TILKEAEGFLYCVSSLGVTGVRTEIGSQVDYMIKEAKKVTDIPCAVGFGISTPEQAREMA 217 Query: 230 SIADGCVTATTFKKDGVFANFVDQARVSQFMEKVHH 265 I+DG + + V+ F++ + Sbjct: 218 EISDGVIVGSAIVDIIAKHQEQCVQPVTDFIKVLKK 253 >UniRef50_Q1ISI7 Tryptophan synthase alpha chain n=2 Tax=Acidobacteria RepID=Q1ISI7_ACIBL Length = 261 Score = 54.5 bits (130), Expect = 4e-06, Method: Composition-based stats. Identities = 35/243 (14%), Positives = 68/243 (27%), Gaps = 33/243 (13%) Query: 39 IDKAWDDLMALQNGGVDAVMFSNEFSLPYLTKVRPETTAAMAR---------------II 83 +D A ++A G D + E +P+ V A + Sbjct: 22 VDTAIRIILAAVEAGADVI----ELGVPFSDPVADGPVIQAASERALHGGTSVKDVLHVA 77 Query: 84 GQLMSDIRIPFGVNVLWDPVASFDLAMATGAKFIREIFTGAYASDFGVWDTNVGETIRHQ 143 ++ V +P+ + L A G +D V + G + Sbjct: 78 REVRRHTDAGLIVFSYMNPLLRYGLEKFC-ADAANAGVDGVLVTDLPVEE--AGPYLAAM 134 Query: 144 HRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHP-DALCVSGLTAGTRTDSALL 202 + + + + + IA+ + + V+G +D+ L Sbjct: 135 SKHNLDPIFLVA-------PTSPDARLKLIAEHSRGFVYAVSRTGVTGTRQEVASDARDL 187 Query: 203 KRVKETVPDTVVLANTGV-CLENVEEQLSIADGCVTATTFKKDGVFANF-VDQARVSQFM 260 + + GV E E AD V + + VF N + V F+ Sbjct: 188 VKRLRQFTKLPIAVGFGVSNAEQFAEVGRFADAAVIGSAIMQR-VFDNPGSEPEAVKAFL 246 Query: 261 EKV 263 + Sbjct: 247 RGL 249 >UniRef50_B0SDM7 Tryptophan synthase alpha chain n=6 Tax=Leptospira RepID=TRPA_LEPBA Length = 266 Score = 54.1 bits (129), Expect = 5e-06, Method: Composition-based stats. Identities = 42/290 (14%), Positives = 84/290 (28%), Gaps = 51/290 (17%) Query: 1 MSWLKEVIGTEK---AVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAV 57 MS +KE+ + K A I L GDP+++ + I +GG D Sbjct: 1 MSKIKELFESGKFKSAFIPYFTL----GDPNYNDSIEFGKTI----------LDGGAD-- 44 Query: 58 MFSNEFSLPYLTKVRPETTAAMARIIGQLMSDIR------IPFGVNVLWDPVASFDLAM- 110 + +P+ V A + L + + +++ L Sbjct: 45 ILE--LGIPFSDPVADGPVIQRA-VARSLKNKFSFDEIFRVTKQIHLHKQETPLVYLTYF 101 Query: 111 ----ATG-AKFIRE----IFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEA 161 G KF+ G D + + ++ + + Sbjct: 102 NPIYHCGITKFLDNAKDSGVVGLVIPDLPFDTIESETLFQ---ELRLRDMDLIHLV---- 154 Query: 162 AVYLGNRDICSIAK--STVFNNHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTG 219 + + ST F + + V+G D R + + A G Sbjct: 155 -TPASTKKRIEALRKTSTGFIYYVTSFGVTGERREFSVDLKERIRFLKDTIQLPICAGFG 213 Query: 220 V-CLENVEEQLSIADGCVTATTFKKDGVFANFVDQARVSQ-FMEKVHHIR 267 + E + ADG + + ++ + N D ++ + + IR Sbjct: 214 ISTPEQASQIAGYADGIIIGSAIQR-VIEENGQDASKAKNVLADYITKIR 262 >UniRef50_A0RMX3 Tryptophan synthase alpha chain n=5 Tax=Campylobacter RepID=A0RMX3_CAMFF Length = 249 Score = 53.3 bits (127), Expect = 8e-06, Method: Composition-based stats. Identities = 36/273 (13%), Positives = 84/273 (30%), Gaps = 37/273 (13%) Query: 1 MSWLKEVIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFS 60 M +++ +KA I + G P+ + ++D++ D++ + Sbjct: 1 MDKIQKAFEGKKANIGYI----VAGYPNLEYTKEFLNLLDESCLDIIEI----------- 45 Query: 61 NEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDPVASFDLAMATGAKFI--- 117 +PY + +MA S + ++L D L I Sbjct: 46 ---GIPYSDPLADGKLISMASF-SACQSGVTTDTVFDMLKDVKTDKALVFLVYYNLILAY 101 Query: 118 --REIFTGAY-ASDFGVWDTNVG--ETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICS 172 + A G+ ++ E+ + + ++ + I ++ Sbjct: 102 GEDKFLAKAKEVGISGLIVPDMPHDESSEFRVKTSKFKLCLIPLI-----SPTSSKRTKD 156 Query: 173 IAKSTVFNNHPDA-LCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGV-CLENVEEQLS 230 I + L V+G + + ++ V G+ E+V++ Sbjct: 157 ILSDANGFIYAVGSLGVTGGEQSPVDRLKDMIKDIKSSTSLPVAVGFGIKTNEDVKKTKL 216 Query: 231 IADGCVTATTFKKDGVFANFVDQARVSQFMEKV 263 ADG + T K N ++ +E++ Sbjct: 217 YADGAIVGTEIVK---LTNKYSPNEINYHIEEI 246 >UniRef50_C6LHS6 Tryptophan synthase alpha chain n=1 Tax=Bryantella formatexigens DSM 14469 RepID=C6LHS6_9FIRM Length = 301 Score = 53.0 bits (126), Expect = 1e-05, Method: Composition-based stats. Identities = 22/157 (14%), Positives = 50/157 (31%), Gaps = 18/157 (11%) Query: 116 FIREIFTGAYASDFGVWDTNVG-----ETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDI 170 FIR T A G+ +V E + G + + + I Sbjct: 154 FIR---TAAEIGMDGLILPDVPYEEKEEFDMVCKKYGLDFISLIA--------PTSHERI 202 Query: 171 CSIAKSTVFNNHP-DALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGV-CLENVEEQ 228 IA + ++ V+G+ + +D + R+ + D G+ E + Sbjct: 203 RRIAADASGFVYCVSSMGVTGMRSEITSDVGSMVRLVKETKDIPCAVGFGISTPEQAAKM 262 Query: 229 LSIADGCVTATTFKKDGVFANFVDQARVSQFMEKVHH 265 +++DG + + + V +++ + Sbjct: 263 AALSDGAIVGSAIVRLCGQYGKECVPYVREYIRTMKE 299 >UniRef50_A0AJ79 Tryptophan synthase alpha chain n=11 Tax=Listeria RepID=TRPA_LISW6 Length = 257 Score = 52.6 bits (125), Expect = 1e-05, Method: Composition-based stats. Identities = 34/223 (15%), Positives = 69/223 (30%), Gaps = 21/223 (9%) Query: 26 DPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEFSLPYLTKVRPETTAAMARIIGQ 85 G++ + ++ L+ GV A+ +P+ V +A + Sbjct: 19 TYIMGGDGGLDNLEEQLL----FLEKSGVSAIEI----GIPFSDPVADGPIIQLAGL-RA 69 Query: 86 LMSDIRIPFGVNVLWDPVASFDLAMATGAKFIREIF-----TGAYASDFGVWDTNVGETI 140 L + + +N L L + +I IF + + Sbjct: 70 LKKQVSLEAILNKLATSQVQIPLII---MSYINPIFHLGIPKFVEMVQKTPVKGLIIPDL 126 Query: 141 RHQHRIGAGEVKTLFNIVPEAAVYL--GNRDICSIAKSTVFNNHPDAL-CVSGLTAGTRT 197 ++H+ +I V L + IAK + + +G+ T Sbjct: 127 PYEHQTLITPELEGTDIALIPLVSLTSPKERLKEIAKQAEGFIYAVTVNGTTGVRNKFDT 186 Query: 198 DSALLKRVKETVPDTVVLANTGVCL-ENVEEQLSIADGCVTAT 239 +++ VLA GV E+VE+ + DG + + Sbjct: 187 HIDTHLAYLKSISPVPVLAGFGVSSIEHVEKFAHVCDGVIIGS 229 >UniRef50_P95143 Putative L-lactate dehydrogenase [cytochrome] 2 n=24 Tax=Actinobacteria (class) RepID=LLDD2_MYCTU Length = 414 Score = 52.6 bits (125), Expect = 1e-05, Method: Composition-based stats. Identities = 21/126 (16%), Positives = 37/126 (29%), Gaps = 6/126 (4%) Query: 43 WDDLMALQNGGVDAVMFSNEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDP 102 DD A+ + GVD ++ SN P + + +L I ++ Sbjct: 282 LDDARAVVDRGVDGIVLSNHGGR--QLDRAPVPFHLLPHVARELGKHTEILVDTGIMSGA 339 Query: 103 VASFDLAMATGAKFI-REIFTGAYASDFGVWDTNVGETIR--HQHRIGAGEVKTLFNIVP 159 +A+ I R G A + E ++ + V L + P Sbjct: 340 DIVAAIALGARCTLIGRAYLYGLMAGGEAGVN-RAIEILQTGVIRTMRLLGVTCLEELSP 398 Query: 160 EAAVYL 165 L Sbjct: 399 RHVTQL 404 >UniRef50_C8WX57 Tryptophan synthase alpha chain n=2 Tax=Alicyclobacillus acidocaldarius RepID=C8WX57_ALIAD Length = 267 Score = 52.6 bits (125), Expect = 1e-05, Method: Composition-based stats. Identities = 43/284 (15%), Positives = 84/284 (29%), Gaps = 45/284 (15%) Query: 1 MSWLKEVI-GTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMF 59 M ++E K +I + GDP + +++ D A+ + G D + Sbjct: 1 MDRIREAFQAGHKLLIPFV----VAGDPDY----------ERSLDIACAILDAGADMLEI 46 Query: 60 SNEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLW--DPVASFDLAMATGAKFI 117 PY + A + L R+ V+ L + + ++ Sbjct: 47 ----GFPYSDPLADGPVIQ-AAAVRSLKQGTRL---VDCLRLVRDIRARSPKPLVAFTYV 98 Query: 118 REIFTG---------AYASDFGVWDTNVG--ETIRHQHRIGAGEVKTLFNIVPEAAVYLG 166 + A A GV +V E A ++ + + P + Sbjct: 99 NPLIQYGAERFFAELAAAGGDGVIVPDVPLEEAGEVASAAEAHGIQFVPLVAPTSG---- 154 Query: 167 NRDICSIAKSTVFNNHP-DALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVC-LEN 224 + +I ++ + +L V+G L + D GV E+ Sbjct: 155 EERVRAIVRAARGFVYCVSSLGVTGERQQVSRQVRELVDLVRKHTDLPACVGFGVSRPEH 214 Query: 225 VEEQLSIADGCVTATTFKK---DGVFANFVDQARVSQFMEKVHH 265 V E ADG + + + + D V F ++ Sbjct: 215 VREIAEFADGVIVGSAYVRRIGDAADEGRDPVEAVRSFTRELKQ 258 >UniRef50_C6CUD8 Tryptophan synthase alpha chain n=2 Tax=Paenibacillus RepID=C6CUD8_PAESJ Length = 268 Score = 52.6 bits (125), Expect = 1e-05, Method: Composition-based stats. Identities = 36/222 (16%), Positives = 62/222 (27%), Gaps = 37/222 (16%) Query: 42 AWDDLMALQNGGVDAVMFSNEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWD 101 + + L+ G D V E +PY + A L + I I D Sbjct: 33 SIAIIKELEQAGADLV----ELGVPYSDPLADGPVIQRASE-RALKNRISIL-------D 80 Query: 102 PVASFDLAMATG--------------AKFIREIFTGAYA--SDFGVWDTN--VGETIRHQ 143 + + A G +F E F G+ + + E + Sbjct: 81 CIETAAQAREAGVKLPFILFTYFNPVLQFGLEDFMNLVVEKGISGLIIPDLPIEEDEEVR 140 Query: 144 HRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHP-DALCVSGLTAGTRTDSALL 202 A V + + + I+K + +L V+G+ A + Sbjct: 141 TLAEAKGVHLIPLV-----APTSKDRVVRISKKARGFVYCVSSLGVTGVRAEFHSGIDEF 195 Query: 203 KRVKETVPDTVVLANTGVCL-ENVEEQLSIADGCVTATTFKK 243 D + G+ E VE I DG V + + Sbjct: 196 LATVREATDLPIAVGFGISSREQVERFEKICDGVVVGSAIVR 237 >UniRef50_Q7TUC8 Tryptophan synthase alpha chain n=31 Tax=Cyanobacteria RepID=TRPA_PROMP Length = 279 Score = 52.6 bits (125), Expect = 1e-05, Method: Composition-based stats. Identities = 36/251 (14%), Positives = 76/251 (30%), Gaps = 28/251 (11%) Query: 31 AQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEFSLPYLTKVRPETTAAMARIIGQLMSDI 90 + + LQ G D + E +PY + ++ L S Sbjct: 40 GDPNIETTSEILLK----LQEKGADLI----ELGIPYSDPLADGPIIQLSA-SRALKSGT 90 Query: 91 RIPFGVNVLWDPVASFDLAMAT------GAKFIREIFTG--AYASDFGVWDTNVG--ETI 140 + + +L + + F E F + G+ ++ E Sbjct: 91 TLKNVIQLLESLKDKLHIPIILFTYFNPVLNFGLENFCELASKVGVSGLIIPDLPLEEAY 150 Query: 141 RHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTV-FNNHPDALCVSGLTAGTRTDS 199 + I + + + + P + + I+ +T F V+G Sbjct: 151 KFSEIISSYSIDLILLVAPTT----PSERMKIISNNTKGFTYLVSVTGVTGERNKMENRV 206 Query: 200 ALLKRVKETVPDTVVLANTGV-CLENVEEQLSI-ADGCVTATTFKKDGVFANFVDQARVS 257 L + + V G+ E+V + ADG + + F K +N ++ V+ Sbjct: 207 ENLITKLQEISINPVAVGFGISSPEHVNKVRKWGADGVIIGSAFVK--RISNSNEKEVVN 264 Query: 258 QFMEKVHHIRR 268 Q + +R+ Sbjct: 265 QIGKFCEEMRK 275 >UniRef50_C6HY57 Tryptophan synthase alpha chain n=1 Tax=Leptospirillum ferrodiazotrophum RepID=C6HY57_9BACT Length = 270 Score = 52.6 bits (125), Expect = 1e-05, Method: Composition-based stats. Identities = 21/150 (14%), Positives = 43/150 (28%), Gaps = 15/150 (10%) Query: 121 FTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGN-RDICSIAKSTVF 179 G D D+ ++ + + + + P + R I A+ F Sbjct: 129 VAGVVIPDLSFEDSR-----DYRPLFRSFGINLVGFVSP--TTPVDRARRIVREARG--F 179 Query: 180 NNHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVC-LENVEEQLSIADGCVTA 238 + + +G + + + V GV E L ADG + Sbjct: 180 VYYVGLMGTTGASLSITPAVRDMVGQLKQWTALPVCLGFGVNEPRMAREILGFADGVIVG 239 Query: 239 TTFKKDGVFANFVDQARVSQFMEKVHHIRR 268 + ++ + F+E V R+ Sbjct: 240 SRLVREEEDPGLWE----KTFLEFVAECRK 265 >UniRef50_C0QR00 Tryptophan synthase alpha chain n=1 Tax=Persephonella marina EX-H1 RepID=C0QR00_PERMH Length = 256 Score = 52.2 bits (124), Expect = 2e-05, Method: Composition-based stats. Identities = 46/296 (15%), Positives = 98/296 (33%), Gaps = 74/296 (25%) Query: 1 MSWLKEVIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFS 60 MS + E+ +K +I + G PS ++K+++ AL G D + Sbjct: 1 MSLIGEIFSRKKPLI----CYFMAGYPS----------LEKSYETAKALIQSGADILEV- 45 Query: 61 NEFSLPYLTKVRPETTAAMA---------------RIIGQLMSDI-RIPFGVNVLWDPVA 104 +P+ V T +A ++ +L + IP + ++P+ Sbjct: 46 ---GVPFSDPVADGPTIQVAHEKAVKDGITPVNVFQLTEKLKKEFPDIPLILMTYYNPIY 102 Query: 105 SFD------LAMATGAKFIREIFTGAYASDFGVWDTN--VGETIRHQHRIGAGEVKTLFN 156 LA GA G + E + + ++T+F Sbjct: 103 VMGEQDFCKLAKEKGAD--------------GFIVPDLPPEEAENFKRIANSSGLETIFL 148 Query: 157 IVPEAAVYLGNRDICSIAKSTVFNNHPDALCVSGLTAGTRTDS------ALLKRVKETVP 210 + P R I I + + + + ++G+ G R ++++K+ Sbjct: 149 LAPT----SHERRIKLIGEMSDSFIY--YVSLTGI-TGERDTLPWEELENKVRQIKKITG 201 Query: 211 DTVVLANTGVCL-ENVEEQLSIADGCVTATTFKKDGVFANFVDQARVSQFMEKVHH 265 V GV E+ ++ I+DG + + K D + ++++ Sbjct: 202 KK-VAVGFGVSKKEHTQKLSQISDGVIVGSAVVK---LQGKADIEGIKSLVKQLKE 253 >UniRef50_Q5GMK6 Tryptophan synthase alpha chain n=2 Tax=Bacteria RepID=Q5GMK6_9BACT Length = 266 Score = 52.2 bits (124), Expect = 2e-05, Method: Composition-based stats. Identities = 36/225 (16%), Positives = 76/225 (33%), Gaps = 31/225 (13%) Query: 36 NWVIDKAWDDLMALQNGGVDAVMFSNEFSLPYLTKVRPETTAAMARI------------- 82 + +++ + ++AL+ G D V E +P+ + A + Sbjct: 27 DPTLERTAEIVLALEQAGADVV----ELGIPFSDPLADGAVNQEAALRALRHGASLRDVL 82 Query: 83 --IGQLMSDIRIPFGVNVLWDPVASFDLAMATGAKFIREIFTGAYASDFGVWDTNVGETI 140 + L +P + ++PV S+ L G G D E Sbjct: 83 GMVKTLRQRSHVPIVLFTYFNPVHSYGL-DRFGPDCRDAGVDGVLCVDL-----PPEEAG 136 Query: 141 RHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTV-FNNHPDALCVSGLTAGTRTDS 199 ++ + A ++ T+F + P ++ I ++++ F + V+G A D Sbjct: 137 EYKASLDALDIATIFLLAPT----STDKRIETVSRHCTGFVYYVSRTGVTGEQARVGEDV 192 Query: 200 ALLKRVKETVPDTVVLANTGVC-LENVEEQLSIADGCVTATTFKK 243 + + D V G+ E+ E ADG + + + Sbjct: 193 RAMVAKIKQHTDKPVAVGFGISKPEHAAEIARYADGVIVGSAIVR 237 >UniRef50_Q5WGS0 Tryptophan synthase alpha chain n=4 Tax=Bacillaceae RepID=TRPA_BACSK Length = 269 Score = 52.2 bits (124), Expect = 2e-05, Method: Composition-based stats. Identities = 39/224 (17%), Positives = 69/224 (30%), Gaps = 29/224 (12%) Query: 36 NWVIDKAWDDLMALQNGGVD----AVMFSN-EFSLPYLTKVRPETTA-------AMARII 83 + V + ++LQ G D + +S+ P + A AMA + Sbjct: 23 DPVREATIALALSLQRSGADVLELGIPYSDPLADGPVIQNASKRALAGGMTLAKAMALVP 82 Query: 84 GQLMSDIRIPFGVNVLWDPVASFDLAMATGAKFIREIFTGAYASDFGVWDTNVGETI--R 141 + IP V +P+ +F E F D+G+ V + Sbjct: 83 EMRKEGLTIPVIVFTYANPL----------LQFGFERFCETAC-DYGIDGLLVPDLPFEE 131 Query: 142 HQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHP-DALCVSGLTAGTRTDSA 200 + E + L I + I +IA + +L V+G Sbjct: 132 SETLANECEKQGLALISLV--APTSQQRIKAIASRAQGFLYCVSSLGVTGARKTLHPQVE 189 Query: 201 LLKRVKETVPDTVVLANTGVC-LENVEEQLSIADGCVTATTFKK 243 ++ + + G+ E VEE ADG V + + Sbjct: 190 AFLKLVKEASPVPFVVGFGISSYEQVEEMGRHADGVVIGSAIVE 233 >UniRef50_Q254T1 Tryptophan synthase alpha chain n=3 Tax=Bacteria RepID=TRPA_CHLFF Length = 257 Score = 51.8 bits (123), Expect = 2e-05, Method: Composition-based stats. Identities = 42/286 (14%), Positives = 74/286 (25%), Gaps = 54/286 (18%) Query: 1 MSWLKEVIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFS 60 M+ ++ K I G D + AL GGVD + Sbjct: 1 MNRIETAFKNTKPFIG----------YLTGGDGGF----DYSVACAHALLRGGVD--ILE 44 Query: 61 NEFSLPYLTKV--RPETTAAMAR-------------IIGQLMSDIRIPFGVN-----VLW 100 P+ V P A R I L IP + +L Sbjct: 45 --IGFPFSDPVADGPIIQKAHTRALKEKTDSTTILEIAKALRQTSDIPLVLFSYYNPLLQ 102 Query: 101 DPVASFDLAMATGAKFIREIFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPE 160 A G A + N E+ + E K ++ Sbjct: 103 KGPQYLHQLKAAGFD--------AVLTVDLPIPRNANESESFFQAL--MEAKLFPILLVT 152 Query: 161 AAVYLGNRDICSIAKSTV-FNNHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTG 219 + + I+K F + +G+ + D + ++A G Sbjct: 153 PSTQ--EERLLQISKLAKGFLYYVSHKGTTGIRSKLSDDFSTQIARLRRYFQIPIVAGFG 210 Query: 220 V-CLENVEEQLSIADGCVTATTFKKDGVFANFVDQARVSQFMEKVH 264 + + L ADG V + F + + ++ F + + Sbjct: 211 IANRASAIAALEHADGFVVGSAFVE--KLEKKISPEELTTFAQSID 254 >UniRef50_B7FQI2 Predicted protein n=1 Tax=Phaeodactylum tricornutum CCAP 1055/1 RepID=B7FQI2_PHATR Length = 671 Score = 51.8 bits (123), Expect = 3e-05, Method: Composition-based stats. Identities = 10/55 (18%), Positives = 18/55 (32%), Gaps = 1/55 (1%) Query: 188 VSGLTAGTRTDSALLKRVKETVPDTVVLANTGV-CLENVEEQLSIADGCVTATTF 241 V+G D + + + G+ E V ++ADG V + Sbjct: 188 VTGARESLPPDLEEFITRVRSKTELPLAVGFGISNPEMVNGVANMADGVVVGSAI 242 >UniRef50_O27697 Tryptophan synthase alpha chain n=3 Tax=Methanobacteriaceae RepID=TRPA_METTH Length = 270 Score = 51.8 bits (123), Expect = 3e-05, Method: Composition-based stats. Identities = 38/225 (16%), Positives = 77/225 (34%), Gaps = 42/225 (18%) Query: 40 DKAWDDLMALQNGGVDAVMFSNEFSLPYLTKVRPETT----------AAMAR-----IIG 84 + + + + L + G DA+ P+ + T+ A M +I Sbjct: 33 ETSLEIIRTLVDAGADALEV----GFPFSDPIADGTSVQGADLRALRAGMTTEKCFQLIE 88 Query: 85 QLMSDIRIPFGVNVLWDPVASFDLAMATGAKFIREIFTGAYASDFGVWDTN-----VGET 139 ++ IP G+ V ++ + + +F R A A G+ + + Sbjct: 89 RVREFTSIPIGLLVYYNLIYRMGV-----DEFYRRA---AEAGVTGILAADLPPEEASDA 140 Query: 140 IRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPDALCVSGLTAGTR-TD 198 +R + ++ + + L + I + S+ F+ + V+G + Sbjct: 141 LRAAEKYDIDQIFIVA--PTTGSERL--KRISEV--SSGFHYLVSVMGVTGARSRVEDAT 194 Query: 199 SALLKRVKETVPDTVVLANTGVC-LENVEEQLSI-ADGCVTATTF 241 L+KRVK V+ GV E+V ADG + + Sbjct: 195 IELIKRVKAE-GSLPVMVGFGVSRPEHVRMLRDAGADGVIVGSAI 238 >UniRef50_Q4FUL2 Tryptophan synthase alpha chain n=5 Tax=Gammaproteobacteria RepID=TRPA_PSYA2 Length = 278 Score = 51.8 bits (123), Expect = 3e-05, Method: Composition-based stats. Identities = 15/86 (17%), Positives = 30/86 (34%), Gaps = 5/86 (5%) Query: 188 VSGLTAGTRTDSALLKRVKETVPDTVVLANTGVC-LENVEEQLSIADGCVTATTFKKDGV 246 V+G D A + + D V G+ + + + ADG + + ++ Sbjct: 182 VTGSATLDTDDVATQVQAIKAETDLPVCVGFGIRDAASAKAIGAHADGIIVGSALVQNFA 241 Query: 247 FANFVDQARV----SQFMEKVHHIRR 268 + D V + M K+ +R Sbjct: 242 DIDGNDATAVAHAQQKIMAKMTELRE 267 >UniRef50_B9K6Z6 Tryptophan synthase alpha chain n=1 Tax=Thermotoga neapolitana DSM 4359 RepID=TRPA_THENN Length = 240 Score = 51.4 bits (122), Expect = 3e-05, Method: Composition-based stats. Identities = 15/78 (19%), Positives = 30/78 (38%), Gaps = 7/78 (8%) Query: 193 AGTRTDS---ALLKRVKETVPDTVVLANTGVC-LENVEEQLSIADGCVTATTFKKDGVFA 248 G R D +K+VK+ + + G+ E V + IADG + + + + Sbjct: 162 TGEREDLPFAEHIKKVKKKIA-LPLFVGFGISRHEQVRKVWEIADGVIVGSALVR--IME 218 Query: 249 NFVDQARVSQFMEKVHHI 266 Q + + +V + Sbjct: 219 ESKRQEIPQKVVARVKEL 236 >UniRef50_Q39SS2 Tryptophan synthase alpha chain n=7 Tax=Desulfuromonadales RepID=TRPA_GEOMG Length = 269 Score = 51.4 bits (122), Expect = 3e-05, Method: Composition-based stats. Identities = 13/57 (22%), Positives = 21/57 (36%), Gaps = 1/57 (1%) Query: 188 VSGLTAGTRTDSALLKRVKETVPDTVVLANTGV-CLENVEEQLSIADGCVTATTFKK 243 V+G+ +G A + + V G+ E E + ADG V + K Sbjct: 181 VTGVRSGIEASVAGNVNIIKECSKVPVAVGFGIATPEQAGEVAATADGVVVGSAIVK 237 >UniRef50_B5YJI8 Tryptophan synthase alpha chain n=1 Tax=Thermodesulfovibrio yellowstonii DSM 11347 RepID=B5YJI8_THEYD Length = 259 Score = 51.0 bits (121), Expect = 4e-05, Method: Composition-based stats. Identities = 40/240 (16%), Positives = 80/240 (33%), Gaps = 28/240 (11%) Query: 39 IDKAWDDLMALQNGGVDAVMFSNEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNV 98 +++ L L+ G D + E +P+ + T A L S + +N Sbjct: 33 LEETAKRLKILEQAGADLI----ELGVPFSDPLADGPTIQKAAE-RALNSGTTLRKILNF 87 Query: 99 LWDPVASFDLAMATGAKFIREIFTGAY---------ASDFGVWDTN--VGETIRHQHRIG 147 L D S + + ++ +F + GV + V E+ ++ Sbjct: 88 LEDFKKSINTPIIL-MTYLNPVFCYGIERFFHDAKGVNVGGVIFPDLTVEESKYYRTFAK 146 Query: 148 AGEVKTLFNIVPEAAVYLGNRDICSIAKS-TVFNNHPDALCVSGLTAGTRTDSALLKRVK 206 + T+F + P + + I K+ T F + ++G T Sbjct: 147 KYGIDTIFLVAPTSTP----ERVKKIVKASTGFVYYVSITGITGTTLSLDKSFNDHINFV 202 Query: 207 ETVPDTVVLANTGVC-LENVEEQLSIADGCVTATTFKKDGVFANFVDQARVSQFMEKVHH 265 ++ V GV E + IADG + + K +F + S+F++ + Sbjct: 203 KSFGK-PVCVGFGVSKPEEAKYISRIADGVIVGSAIVK--IFHE--KPEKASEFIKSLRE 257 >UniRef50_B7APJ8 Tryptophan synthase alpha chain n=1 Tax=Bacteroides pectinophilus ATCC 43243 RepID=B7APJ8_9BACE Length = 271 Score = 50.6 bits (120), Expect = 5e-05, Method: Composition-based stats. Identities = 14/105 (13%), Positives = 31/105 (29%), Gaps = 2/105 (1%) Query: 163 VYLGNRDICSIAKSTVFNNHP-DALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVC 221 + I IA + + + V+G +G TD + G+ Sbjct: 161 SPASHDRIKKIAADSSGFLYCVSSNGVTGTRSGFSTDFDEFFGLIRDNASCPYCVGFGIS 220 Query: 222 -LENVEEQLSIADGCVTATTFKKDGVFANFVDQARVSQFMEKVHH 265 + + S DG + + +V F++++ Sbjct: 221 GAQQARDMASYCDGVIVGSAIVNIIAENGKDCVNKVGDFVQELKE 265 >UniRef50_A1HS65 Tryptophan synthase alpha chain n=2 Tax=Veillonellaceae RepID=A1HS65_9FIRM Length = 271 Score = 50.3 bits (119), Expect = 6e-05, Method: Composition-based stats. Identities = 53/279 (18%), Positives = 90/279 (32%), Gaps = 39/279 (13%) Query: 1 MSWLKEVI-----GTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVD 55 MS L EV K +I +L A G P F A L ++ A +++ + Sbjct: 1 MSRLNEVFCQLKAQGRKGLI--VYLTA--GCPDFAATLEAVQAVEAAGANIIEI------ 50 Query: 56 AVMFSN-EFSLPYLTKV------RPETTAAMARIIGQLMSDIRIPFGVNVLWDPVASFDL 108 + FS+ P + K TT + ++ Q+ IP V + V F + Sbjct: 51 GIPFSDPMADGPVIQKAASLALQGGATTGKVLELVRQIRQKSAIPLVVMTYINTVLQFGV 110 Query: 109 AMATGAKFIREIFTGAYASDFGVWDTN--VGETIRHQHRIGAGEVKTLFNIVPEAAVYLG 166 KF+R + A A G+ + V E+ ++ + + I P A Sbjct: 111 -----EKFVR---SFAQAGLDGLIVPDLPVEESALLENYCREAGLALIQFIAPTTA---P 159 Query: 167 NRDICSIAKSTVFNNHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGV-CLENV 225 R + K+ F A V+G+ + L V G+ + Sbjct: 160 ERAVTICHKAAGFLYCISATGVTGVRQVDYSQIGSLINNVRQYTSLPVAIGFGIGSPQAA 219 Query: 226 EEQLSIADGCVTATTFKKDGVFANFVDQARVSQFMEKVH 264 E AD + + + + F E V Sbjct: 220 REAADYADAVIIGSAIMQQLIDKG---VDAARAFTESVR 255 >UniRef50_B8JAL9 Tryptophan synthase alpha chain n=4 Tax=Anaeromyxobacter RepID=B8JAL9_ANAD2 Length = 285 Score = 50.3 bits (119), Expect = 6e-05, Method: Composition-based stats. Identities = 45/254 (17%), Positives = 72/254 (28%), Gaps = 48/254 (18%) Query: 42 AWDDLMALQNGGVDAVMFSNEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWD 101 A D +A GG D V E +P+ + T A L + + Sbjct: 53 ARDMALACVEGGADLV----ELGVPFSDPIADGPTIQ-AAAQRALGAGTTL--------- 98 Query: 102 PVASFDLAMATGAKFIREIFTGAYASDFGVWDT-NVGETIRHQHRIGAGEV--KTLFNIV 158 +A A A+ + + G + G R V + +++ Sbjct: 99 -EDVLGIAAAVRAR------SQVPIALMGYLNPMLAGGVERLVRGCADAGVDALIIPDLL 151 Query: 159 PEAAVYLGNRDICSIAK--------------------STVFNNHPDALCVSGLTAGTRTD 198 PE A L + K +T F V+G + Sbjct: 152 PEEAEVLAPASAEAGVKLVYLLAPTSNPARVEAAARAATGFLYFVSVTGVTGARHAVAEE 211 Query: 199 SALLKRVKETVPDTVVLANTGV-CLENVEEQLSIADGCVTATTFKKDGVFAN--FVDQAR 255 A L D V+ GV E +ADG V + K R Sbjct: 212 IAPLVSAVRARTDLPVVIGFGVASPEQARALGPLADGVVVGSAIVKRIAEGGSRRARAER 271 Query: 256 VSQFMEKV-HHIRR 268 V++F+ + +RR Sbjct: 272 VTRFVRSLGRALRR 285 >UniRef50_UPI00016C5413 tryptophan synthase alpha chain n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C5413 Length = 265 Score = 50.3 bits (119), Expect = 7e-05, Method: Composition-based stats. Identities = 15/85 (17%), Positives = 28/85 (32%), Gaps = 4/85 (4%) Query: 185 ALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVC-LENVEEQLSIADGCVTATTFKK 243 + ++G + T+ D + GV E V + IADG + + K Sbjct: 180 VVGITGAREALPSALREQLARLRTMTDLPLCVGFGVSRPEQVRDLKEIADGVIVGSAVVK 239 Query: 244 DGVFANFVDQ---ARVSQFMEKVHH 265 A V +F+ ++ Sbjct: 240 KLEAAGADRAKGLEDVKRFVAELRA 264 >UniRef50_P00931 Tryptophan synthase n=45 Tax=Eukaryota RepID=TRP_YEAST Length = 707 Score = 50.3 bits (119), Expect = 7e-05, Method: Composition-based stats. Identities = 33/210 (15%), Positives = 73/210 (34%), Gaps = 23/210 (10%) Query: 46 LMALQNGGVDAVMFSNEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDPVAS 105 L Q+GGVD + E +P+ + T ++ + L + + +P + ++ Sbjct: 38 LKGFQDGGVDII----ELGMPFSDPIADGPTIQLSNTV-ALQNGVTLPQTLEMVSQARNE 92 Query: 106 FDLAMATGAKFIREIFTG---------AYASDFGVWDTN--VGETIRHQHRIGAGEVKTL 154 + I A A G + E ++ ++ I + + Sbjct: 93 GVTVPIILMGYYNPILNYGEERFIQDAAKAGANGFIIVDLPPEEALKVRNYINDNGLSLI 152 Query: 155 FNIVPEAAVYLGNRDICSIAKSTVFNNHP-DALCVSGLTAGTRTDSALLK-RVKETVPDT 212 + A + + ++ + + +G+ + +D L RV++ DT Sbjct: 153 PLV----APSTTDERLELLSHIADSFVYVVSRMGTTGVQSSVASDLDELISRVRKYTKDT 208 Query: 213 VVLANTGV-CLENVEEQLSIADGCVTATTF 241 + GV E+ + S+ADG V + Sbjct: 209 PLAVGFGVSTREHFQSVGSVADGVVIGSKI 238 >UniRef50_C1N2K9 Predicted protein n=1 Tax=Micromonas pusilla CCMP1545 RepID=C1N2K9_9CHLO Length = 307 Score = 50.3 bits (119), Expect = 7e-05, Method: Composition-based stats. Identities = 15/113 (13%), Positives = 33/113 (29%), Gaps = 10/113 (8%) Query: 165 LGNRDICSIAKSTVFNNHPDALC-------VSGLTAGTRTDSALLKRVKETVPDTVVLAN 217 L + A+ T + V+G+ + L + V D V Sbjct: 170 LLSTPTTPEARMTKIAEASNGFIYLVSVTGVTGVRTNVESRVQELVSGLKKVTDKPVAVG 229 Query: 218 TGVC-LENVEEQLSI-ADGCVTATTFKKDGVFANFVDQARVSQFMEKVHHIRR 268 G+ E + + ADG + + + + + + + +R Sbjct: 230 FGISKKEQAAQVVGWGADGVIVGSALVR-ALGEAPTPEEGLERLTALAKELRE 281 >UniRef50_UPI00016C49DE phosphoribosylanthranilate isomerase n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C49DE Length = 217 Score = 49.9 bits (118), Expect = 9e-05, Method: Composition-based stats. Identities = 39/225 (17%), Positives = 68/225 (30%), Gaps = 30/225 (13%) Query: 44 DDLMALQNGGVDAVMFSNEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDPV 103 +D G DAV N Y R T A ++ L GV V Sbjct: 13 EDARFAAEAGADAVGL-NF----YPQSPRYITPQQAAPLVRAL-PAFTSAVGVFVGMPMR 66 Query: 104 ASFDLAMATGAKFIREIFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAV 163 + +A G + ++ + + R + R G V+ + A Sbjct: 67 QACAIAFQLGLRGVQSY--DDHPPTEDTFPFAHVPAFRVKDREGLEAVRRFVDAAVAAGR 124 Query: 164 YLGNRDICSIAKSTVFNNHPDALCVSG-LTAGTRTDSALLKRVKETVPDTVVLANTGVCL 222 D+ V G R LL++ ++ G+ Sbjct: 125 P-------------PSAVLIDSFVVGQMGGTGHRAPWQLLQQF---DVGVPLILAGGLTP 168 Query: 223 ENVEEQLSIAD--GCVTATTFKKDGVFANFVDQARVSQFMEKVHH 265 ENVEE ++ G A+ ++ D +V++F++ V Sbjct: 169 ENVEEAVATVRPWGVDVASGVER---APGVKDPDKVTRFVQNVRR 210 >UniRef50_D1B8G5 CutC family protein n=1 Tax=Thermanaerovibrio acidaminovorans DSM 6589 RepID=D1B8G5_THEAS Length = 225 Score = 49.9 bits (118), Expect = 1e-04, Method: Composition-based stats. Identities = 42/240 (17%), Positives = 70/240 (29%), Gaps = 24/240 (10%) Query: 35 MNWVIDKAWDDLMALQNGGVDAVMFSNEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPF 94 VI + + ++ G D V F + S LT PE AA+ + IP Sbjct: 2 FVEVIAVSPWEAELVEACGGDRVEFVLDLSCGGLTPSVPEVAAAV--------RGVSIP- 52 Query: 95 GVNVLWDPVASFDLAMATGAKFIREIF-----TGAYASDFGVWDTNVGETIRHQHRIGAG 149 VNV+ P +R GA G + + + Sbjct: 53 -VNVMIRPRPGGFQYSPGEMDQMRRSAQAMAEVGARGLVMGFLKDGAVDLDALKSALTWC 111 Query: 150 EVKTLFNIVPEAAVYLGN-RDICSIAKSTVFNNHPDALCVSGLTAGTRTDSALLKRVKET 208 I + D A+ D L SG + L+R+ E Sbjct: 112 P-----GIDFTFHRAIDEASDPVEAARVACGAGVTD-LLTSGGPGPIEGNLDRLRRMVEA 165 Query: 209 VPDTVVLANTGVCLENVEEQLSI--ADGCVTATTFKKDGVFANFVDQARVSQFMEKVHHI 266 V+A G+ EN + + ++D +D + + +E V + Sbjct: 166 AGSVRVMAGGGITGENAPRVILHGGVPAVHLGRSVRRDNSPTEPIDSQLLRRMVELVKGV 225 >UniRef50_D1AQ86 Tryptophan synthase alpha chain n=6 Tax=Bacteria RepID=D1AQ86_SEBTE Length = 252 Score = 49.1 bits (116), Expect = 1e-04, Method: Composition-based stats. Identities = 31/250 (12%), Positives = 79/250 (31%), Gaps = 25/250 (10%) Query: 1 MSWLKEVIGTEKAV-IAMCHLRALPG-----DPSFDAQLGMNWVIDKAWDDLMALQNGGV 54 M+ L+ + +K V I + P + + +++ + Sbjct: 1 MNKLQNIFKDKKNVNIGYI-VGGFPNIDYTRKFLLNLKKSPIDILEIGIP----YSDPIA 55 Query: 55 DAVMFSNEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDPVASFDLAMATGA 114 D + S + P+T +I + +I P + ++ + ++ + Sbjct: 56 DGKIISEASFKASQNNITPDTV--FDLLISE-KENIHTPLVFLIYYNLIFAYGIDHFIKK 112 Query: 115 KFIREIFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIA 174 + TG D + E I ++ + + I + + ++ Sbjct: 113 S-VEAGITGLVIPDLPYEE--AQELIE---KLNKNNIAFIPLISVTSQNRIPK----LVS 162 Query: 175 KSTVFNNHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGV-CLENVEEQLSIAD 233 + + F +L V+G + + D V G+ E++ + + AD Sbjct: 163 QGSGFIYAIASLGVTGTKQVSPDRLTEFIHEIKKYTDLPVAIGFGIKNREDIIKLRNSAD 222 Query: 234 GCVTATTFKK 243 G + T+ + Sbjct: 223 GLIVGTSIVR 232 >UniRef50_Q01NI9 Tryptophan synthase alpha chain n=1 Tax=Candidatus Solibacter usitatus Ellin6076 RepID=TRPA_SOLUE Length = 267 Score = 49.1 bits (116), Expect = 2e-04, Method: Composition-based stats. Identities = 39/215 (18%), Positives = 69/215 (32%), Gaps = 31/215 (14%) Query: 46 LMALQNGGVDAVMFSNEFSLPYLTKVRPE---------------TTAAMARIIGQLMSDI 90 + AL GG D + E +P+ + T A++ I ++ Sbjct: 37 VEALVRGGADLI----ELGVPFTDPIADGPVIQRAGERALKAGTTLASVLEIARKIRETS 92 Query: 91 RIPFGVNVLWDPVASFDLAMATGAKFIREIFTGAYASDFGVWDTNVGETIRHQHRIGAGE 150 +P + +PV + G + A + D +V E + + Sbjct: 93 EVPLLLFTYLNPV------LRYGLDRLGADAAAAGIDGCLLTDASVEEAESYVAAMHKHG 146 Query: 151 VKTLFNIVPEAAVYLGNRDICSIAK-STVFNNHPDALCVSGLTAGTRTDSALLKRVKETV 209 + + V AA R I +A+ ST F V+G A L + Sbjct: 147 L----DTVFLAAPTSTARRIELVARYSTGFVYLVSRTGVTGERESLSASVAPLIQAVRAA 202 Query: 210 PDTVVLANTGVC-LENVEEQLSIADGCVTATTFKK 243 D + G+ E+V E S + V + F + Sbjct: 203 TDLPLAVGFGISKPEHVAELGSQVEAVVVGSAFVR 237 >UniRef50_B2WJB5 L-lactate dehydrogenase n=2 Tax=Pleosporineae RepID=B2WJB5_PYRTR Length = 401 Score = 49.1 bits (116), Expect = 2e-04, Method: Composition-based stats. Identities = 25/124 (20%), Positives = 43/124 (34%), Gaps = 8/124 (6%) Query: 44 DDLMALQNGGVDAVMFSNEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDPV 103 +D + + GVD ++ SN T A+ ++ + S V+V Sbjct: 267 EDALLACHHGVDGIVVSNHGGR--QLNGALATIDALPEVVAAVRSHTGKKVPVHVDGGIR 324 Query: 104 ASFDL--AMATGAKFIREI----FTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNI 157 D+ A+A GA F+ + AY GV + +G V + +I Sbjct: 325 HGTDIFKALALGADFVWVGRPVLWGLAYKGQEGVELALRLLADEFRLCMGLAGVTRVEDI 384 Query: 158 VPEA 161 E Sbjct: 385 GKEY 388 >UniRef50_UPI00006A2EA9 UPI00006A2EA9 related cluster n=1 Tax=Xenopus (Silurana) tropicalis RepID=UPI00006A2EA9 Length = 270 Score = 48.7 bits (115), Expect = 2e-04, Method: Composition-based stats. Identities = 14/80 (17%), Positives = 24/80 (30%), Gaps = 2/80 (2%) Query: 188 VSGLTAGTRTDSALLKRVKETVPDTVVLANTGV-CLENVEEQLSIADGCVTATTFKKDGV 246 V+G + + V G+ +E V E ADG V + + Sbjct: 179 VTGAASANLDHVRDMVEKIRHHTPLPVCVGFGIRTVEQVTELAGFADGVVVGSAIV-NAA 237 Query: 247 FANFVDQARVSQFMEKVHHI 266 + D R + V + Sbjct: 238 MSAPTDAERTIAALTLVRQL 257 >UniRef50_Q8TYA2 Tryptophan synthase alpha chain n=1 Tax=Methanopyrus kandleri RepID=TRPA_METKA Length = 275 Score = 48.7 bits (115), Expect = 2e-04, Method: Composition-based stats. Identities = 36/244 (14%), Positives = 84/244 (34%), Gaps = 30/244 (12%) Query: 39 IDKAWDDLMALQNGGVDAVMFSNEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNV 98 +++ AL++GGVD + +P+ + T + + + + P+ + Sbjct: 32 LEETVSLARALRDGGVD--ILE--LGVPFSEPIADGPTIQ--KAVDEALRAGTTPW--DC 83 Query: 99 LWDPVASFDLAMATGA---------KFIREIFTGAYASDFGVWDTNVGETIRHQHRIGAG 149 L + F R + A A G+ ++ + A Sbjct: 84 LEVAEEVSEFVPVVLLCYYNTLHANGFERYLSAAAEAGVSGIIVADMPVEESDEVHSVAR 143 Query: 150 EVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPDALC----VSGLTAGTRTDSALLKRV 205 +++ +++ A + + I + + + V+G D+ L R Sbjct: 144 DLEI--DVIYLVAPSTTDERLKKIGERASGFVY---VISRYGVTGARRDLSEDTLELVRW 198 Query: 206 KETVPDTVVLANTGVCLE-NVEEQLSI-ADGCVTATTFKKDGVFANFV--DQARVSQFME 261 D V G+ +VEE ++ ADG + + F K+ + + + RV + + Sbjct: 199 VRDHVDVPVAVGFGISERWHVEEVIAAGADGAIVGSAFIKEIHRSEDIAEAEERVRELAK 258 Query: 262 KVHH 265 ++ Sbjct: 259 ELVE 262 >UniRef50_Q5HPG9 Tryptophan synthase alpha chain n=59 Tax=Staphylococcus RepID=TRPA_STAEQ Length = 243 Score = 48.7 bits (115), Expect = 2e-04, Method: Composition-based stats. Identities = 17/81 (20%), Positives = 32/81 (39%), Gaps = 7/81 (8%) Query: 188 VSGLTAGTRTDSALLKRVKETVPDTVVLANTGV-CLENVEEQLSIADGCVTATTFKKDGV 246 +G + D + V V+A G+ E+V++ S+ADG V + K Sbjct: 163 TTGNSGEFHPDLKRKIEYIKKVSKIPVVAGFGIKNPEHVKDIASVADGIVIGSEIVK--- 219 Query: 247 FANFVDQARVSQFMEKVHHIR 267 ++ +F+ + IR Sbjct: 220 ---RIEIDSRKEFITYIKSIR 237 >UniRef50_C4V0L5 Tryptophan synthase alpha chain n=1 Tax=Selenomonas flueggei ATCC 43531 RepID=C4V0L5_9FIRM Length = 261 Score = 48.7 bits (115), Expect = 2e-04, Method: Composition-based stats. Identities = 33/233 (14%), Positives = 62/233 (26%), Gaps = 38/233 (16%) Query: 40 DKAWDDLMALQNGGVDAVMFSNEFSLPYLTKVRPETTAAMAR---------------IIG 84 + D + + G D + E LP+ + A I+ Sbjct: 32 EATVDIVRRAEEAGADLI----ELGLPFSDPMADGPVIQAASVAALKNGMTMKKALEIVK 87 Query: 85 QLMSDIRIP-FGVNVLWDPVASFDLAMATGAKFIREIFTGAYASDFGVWDTNVG--ETIR 141 ++ IP G+ + M F R + A GV +V E+ Sbjct: 88 EIRRHSEIPIVGMGYIN---------MVNHYGFERFVTDFKAAGMDGVILPDVPHEESEE 138 Query: 142 HQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHP-DALCVSGLTAGTRTDSA 200 + GA + I P + + + V+G+ + Sbjct: 139 MRRICGAHDFTLTEFITP----GTTEERMAETCRHAAGFVYCVSNYGVTGVKEIDYSIIG 194 Query: 201 LLKRVKETVPDTVVLANTGV-CLENVEEQLSIADGCVTATTFKKDGVFANFVD 252 + D + G+ E ADG + + K + +D Sbjct: 195 KVCSAARRYTDIPLAIGFGIGTPEAAARAAKQADGVIVGSAVVKH-ILDGDID 246 >UniRef50_C4KZ63 Tryptophan synthase alpha chain n=1 Tax=Exiguobacterium sp. AT1b RepID=C4KZ63_EXISA Length = 255 Score = 48.3 bits (114), Expect = 2e-04, Method: Composition-based stats. Identities = 36/249 (14%), Positives = 77/249 (30%), Gaps = 30/249 (12%) Query: 31 AQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEFSLPYLTKVRPETTAAMARIIGQLMSDI 90 GM+ +I + L+ G A+ E +P+ V A + L + + Sbjct: 22 GDGGMDRLIPTIVE----LERMGATAI----ELGIPFSDPVADGPIIE-AAGMRALEAGV 72 Query: 91 RIPFGVNVLWDPVASFDLAMATGAKFIREIFTGAYASDFGVWDTNVGETIRHQHRIGAGE 150 + +N L + F++ + ++ +F F + + Sbjct: 73 TLEMVLNHLIEGNDQFNVPI-VLMTYLNPVFRFGVQPFFDRANECGVSGVIIPDLPFEES 131 Query: 151 VKTLFNI--VPEAAVYLGNRDICSIAKSTVFNNHPDALC--VSGLTAGTRTD--SALLKR 204 ++ ++ A + L A++T + V+ + D L Sbjct: 132 LQLFADVNRHDVAVIPLVTLTSSE-ARTTSILEKAEGFVYAVTLKGTTGKADVFPDELLA 190 Query: 205 VKETVPD---TVVLANTGV-CLENVEEQLSIADGCVTATTFKK---DGVFANFVDQARVS 257 + + VLA GV E V+ DG V + + +G + V Sbjct: 191 YLTRLTEQSRVPVLAGFGVHRPEQVQTLGQACDGVVVGSFIVEALHEGRTED------VE 244 Query: 258 QFMEKVHHI 266 ++ H+ Sbjct: 245 TLIQSAKHL 253 >UniRef50_Q1CZH2 Tryptophan synthase alpha chain n=2 Tax=Cystobacterineae RepID=Q1CZH2_MYXXD Length = 263 Score = 48.3 bits (114), Expect = 2e-04, Method: Composition-based stats. Identities = 14/83 (16%), Positives = 25/83 (30%), Gaps = 5/83 (6%) Query: 188 VSGLTAGTRTDSALLKRVKETVPDTVVLANTGV-CLENVEEQLSIADGCVTATTFKKDGV 246 V+G+ A D + + V+A G+ E + ADG V + + Sbjct: 184 VTGMRAELPPDLSQRLDLVRKAATVPVVAGFGISTAEQARMLSAHADGVVVGSALVRAAH 243 Query: 247 FANFVDQARVSQF-MEKVHHIRR 268 + +RR Sbjct: 244 TEG---LEAAKALCADIKRGLRR 263 >UniRef50_D2B8E9 Tryptophan synthase alpha chain n=2 Tax=Actinomycetales RepID=D2B8E9_STRRD Length = 261 Score = 48.3 bits (114), Expect = 2e-04, Method: Composition-based stats. Identities = 44/238 (18%), Positives = 77/238 (32%), Gaps = 34/238 (14%) Query: 46 LMALQNGGVDAVMF-----SNEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLW 100 + A + G DAV + + AA + G L + V ++ Sbjct: 36 VHAYADAGADAVELGFPFSDPMLDGVTIQEASDRAIAAGTTVKGILEEVATLDVDVPLIA 95 Query: 101 DPVASFDLAMATGAKFIREIFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPE 160 ++ + +T A+F + G G+ + EV L + Sbjct: 96 MTYSNLVVQQST-AEFCAALTAG---GLRGLIVPDSP----------LEEVGELADAAAA 141 Query: 161 AAVYLG--------NRDICSIAKSTVFNNHP-DALCVSGLTAGTRTDSALLKRVKETVPD 211 ++L + IA+ + + + +G + +A L R + + D Sbjct: 142 EGLHLVLLAAPSSSRARLREIAERSRGFVYALTRMGTTGEHSEVPEQAARLGRELKGLTD 201 Query: 212 TVVLANTGV-CLENVEEQLSIADGCVTATTFKKDGVFANFVDQARVSQFMEKVHHIRR 268 VL GV E ADG V A+ + +D AR + E V IRR Sbjct: 202 RPVLFGFGVSNPAQAAELAGHADGVVVASALMR-----KLLDGARPRELGEYVASIRR 254 >UniRef50_B1QUK5 Tryptophan synthase alpha chain n=4 Tax=Clostridiales RepID=B1QUK5_CLOBU Length = 263 Score = 48.3 bits (114), Expect = 2e-04, Method: Composition-based stats. Identities = 31/213 (14%), Positives = 64/213 (30%), Gaps = 30/213 (14%) Query: 46 LMALQNGGVDAVMFSNEFSLPYLTKVRPETTAAMAR---------------IIGQLMSDI 90 + A GVD + E +P+ + A ++ ++ + Sbjct: 37 IKAQSEAGVDII----ELGIPFSDPIADGPVIQDASYRSIQKGTNLKKIFELVKEVREEC 92 Query: 91 RIPFGVNVLWDPVASFDLAMATGAKFIREIFTGAYASDFGVWDTNVGETIRHQHRIGAGE 150 IP + ++ V + + K I G D ET + + E Sbjct: 93 EIPIIFMLYYNTVLYYGVENFVK-KCIECSVDGIIIPDLPF-----EETFEIREFLNKDE 146 Query: 151 VKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHP-DALCVSGLTAGTRTDSALLKRVKETV 209 + + I + + + ++ V+G D V + Sbjct: 147 DAPFLIPLV---SPVSKDRIPMLVEDQKGFVYCVSSMGVTGQNGEFHKDVKNYLNVVKNS 203 Query: 210 PDTVVLANTGV-CLENVEEQLSIADGCVTATTF 241 T V+ G+ E++E + DGC+ + F Sbjct: 204 SKTPVMMGFGIKNPEDIEPYKDVIDGCIVGSHF 236 >UniRef50_A3ESF4 Tryptophan synthase alpha chain n=2 Tax=Leptospirillum sp. Group II RepID=A3ESF4_9BACT Length = 264 Score = 48.3 bits (114), Expect = 3e-04, Method: Composition-based stats. Identities = 23/140 (16%), Positives = 44/140 (31%), Gaps = 15/140 (10%) Query: 106 FDLAMATGAKFIREIFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYL 165 LA TG IR GA D D + + ++ + + + Sbjct: 110 VSLAQKTGG--IR----GAVIPDLSYEDAAS-----VRKTFHSAKLSLIPFVSLTTSEKR 158 Query: 166 GNRDICSIAKSTVFNNHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGV-CLEN 224 R +A+S F L +G +L+ + ++ V G+ Sbjct: 159 MKR---IVARSEDFIYLVSLLGTTGKELSETGTLSLMVDRIRSQTESPVCVGFGIQSPAV 215 Query: 225 VEEQLSIADGCVTATTFKKD 244 + L ADG + + ++ Sbjct: 216 AAKVLQFADGVIVGSRLVRE 235 >UniRef50_Q03X02 Tryptophan synthase alpha chain n=2 Tax=Leuconostoc mesenteroides RepID=Q03X02_LEUMM Length = 258 Score = 48.0 bits (113), Expect = 3e-04, Method: Composition-based stats. Identities = 35/264 (13%), Positives = 85/264 (32%), Gaps = 55/264 (20%) Query: 1 MSWLKEVIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFS 60 M+ + + + + A I A+ G P F + +++D A ++ + FS Sbjct: 1 MTRISQALENKNAFIGF----AVAGYPDF--EKSAKYIVDMAEAGADLIEI----GIAFS 50 Query: 61 N-EFSLPYLTKVRPE------TTAAMARIIGQLMSDIRIPFGVNVLWDPVASFDLAMATG 113 + P + E TTA + ++ ++ +P Sbjct: 51 DPSADGPTIMNADQEVLENGSTTADVFNLVAEVRKHTDVPL-----------------VF 93 Query: 114 AKFIREIFTGAYASD---------FGVWDTNVG-----ETIRHQHRIGAGEVKTLFNIVP 159 + +F Y GV ++ E + H+ G ++ + Sbjct: 94 LTYTNPVFKYGYEKFLSKMQSLDIQGVIMPDMPMEEQDEFLELVHQYGRDFIQL---VTL 150 Query: 160 EAAVYLGNRDICSIAKSTVFNNHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTG 219 ++ L + +++ F +L ++G + ++ + + + D V G Sbjct: 151 QSGNRLPKI----VERASGFIYVVSSLGITGSRSKLSNEADEIVTKIKRLTDLPVAVGFG 206 Query: 220 VCLENVEEQLSIADGCVTATTFKK 243 + + +Q AD + + K Sbjct: 207 ITTYDQAKQFESADAIIVGSALVK 230 >UniRef50_C7TFE0 Tryptophan synthase alpha chain n=4 Tax=Lactobacillus rhamnosus RepID=C7TFE0_LACRL Length = 269 Score = 48.0 bits (113), Expect = 3e-04, Method: Composition-based stats. Identities = 48/277 (17%), Positives = 84/277 (30%), Gaps = 36/277 (12%) Query: 1 MSWLKEVIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFS 60 MS L + KA I + + +D +++AL +GG D V Sbjct: 1 MSNLAAIFKNHKAFIPFV--------------VADDPDLDTTVKNIVALAHGGADIV--- 43 Query: 61 NEFSLPYLTKV--RPETTAAMARIIGQLMSDIRIPFGVNVLWDPVAS-------FDLAMA 111 E +P+ V P AA R + + V A ++ Sbjct: 44 -ELGIPFSDPVADGPVIQAADLRAFAANVRTKTVFEIVEAARKKTAVPIVFLTYLNIVFK 102 Query: 112 TGAK-FIREIFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDI 170 G F++ A G+ ++ + I K +I+P I Sbjct: 103 YGYDTFLKRC---AELKVSGLVIPDLP--YESRAEIVPFAEKYGIDIIPL-ITPTSGHRI 156 Query: 171 CSIAKSTVFNNHP-DALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGV-CLENVEEQ 228 IAKS + ++ ++G T L + D G+ + + Sbjct: 157 EKIAKSASGFIYVVSSVGITGERDEFFTGLKTLVTEIKRYTDVPTAIGFGIHTPQQAQTM 216 Query: 229 LSIADGCVTATTFKKDGVFANFVDQARVSQFMEKVHH 265 SIADG + + A + QF + + Sbjct: 217 ASIADGVIIGSAIVDIVAKEAQNAPAAIEQFTKAIRA 253 >UniRef50_Q60180 Tryptophan synthase alpha chain n=5 Tax=Methanocaldococcus RepID=TRPA_METJA Length = 281 Score = 47.6 bits (112), Expect = 4e-04, Method: Composition-based stats. Identities = 12/57 (21%), Positives = 22/57 (38%), Gaps = 1/57 (1%) Query: 188 VSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCL-ENVEEQLSIADGCVTATTFKK 243 ++G ++ L + + G+ E+VEE IADG + + K Sbjct: 182 ITGAREKVAEETKELIKRVKKFSKIPACVGFGISKREHVEEITEIADGAIVGSAIVK 238 >UniRef50_A3CRL2 Tryptophan synthase alpha chain n=1 Tax=Methanoculleus marisnigri JR1 RepID=A3CRL2_METMJ Length = 270 Score = 47.6 bits (112), Expect = 4e-04, Method: Composition-based stats. Identities = 34/253 (13%), Positives = 67/253 (26%), Gaps = 29/253 (11%) Query: 1 MSWLKEVIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKA----WDDLMALQNGGVDA 56 MS + + A I + GDP +A L + + +A + + + D Sbjct: 1 MSRIDALFARSPAFIGF----TVAGDPGIEASLRVAKAMIEAGVDLLEIAIPYSDPVADG 56 Query: 57 VMFSNEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGV--NVL--WDPVASFDLAMAT 112 + + A+ R I + + + N++ P F A A Sbjct: 57 PVIERAHVRALRAGTTTDDVFALVRRIREHAPALPLVLFTYHNIIHRRGPDRFFSEAAAA 116 Query: 113 GAKFIREIFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICS 172 GA + + + G R G + + + Sbjct: 117 GADGV--LIVDLPVEESGEVAPCAA-------RYGIDRIALIAPTTSPERQHAILSGASG 167 Query: 173 IAKSTVFNNHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVC-LENVEEQLSI 231 V+G A L + GV E+V ++ Sbjct: 168 FVYLISLEG------VTGERDRLPPGIAGLVGAVREKTGLPLAVGFGVSRPEHVRTVVAA 221 Query: 232 -ADGCVTATTFKK 243 A+G + + + Sbjct: 222 GANGVIVGSALVR 234 >UniRef50_A6Q538 Triosephosphate isomerase n=24 Tax=Epsilonproteobacteria RepID=TPIS_NITSB Length = 232 Score = 47.6 bits (112), Expect = 4e-04, Method: Composition-based stats. Identities = 16/81 (19%), Positives = 27/81 (33%), Gaps = 14/81 (17%) Query: 192 TAGTRTDSALLKRVKE---TVPDTVVLANTGVCLENVEEQLSI--ADGCVTATTFKKDGV 246 G ++ V ++ D +L V N++E LSI DG + T Sbjct: 161 GTGVAAKPEEIEEVLAYLASLTDAPLLYGGSVKPANIKEVLSIPKCDGALIGTA------ 214 Query: 247 FANFVDQARVSQFMEKVHHIR 267 D + +E +R Sbjct: 215 ---SWDVENFIKMIEIAKEMR 232 >UniRef50_P17166 Tryptophan synthase alpha chain n=11 Tax=Lactobacillus RepID=TRPA_LACCA Length = 266 Score = 47.6 bits (112), Expect = 4e-04, Method: Composition-based stats. Identities = 40/273 (14%), Positives = 83/273 (30%), Gaps = 30/273 (10%) Query: 1 MSWLKEVIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFS 60 MS L +V K I + + + +++AL GG D V Sbjct: 3 MSKLADVFKNHKVFIPFI--------------VADDPDFETTVKNVVALAKGGADIV--- 45 Query: 61 NEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDPVASFDLAMATGAKFIREI 120 E +P+ V A + +++R +++ + + ++ + Sbjct: 46 -ELGIPFSDPVADGPVIQ-AADLRAFAANVRTKTVFDIVEAARKETAVPIVF-LTYLNIV 102 Query: 121 FTGAY------ASDFGVWDTNVGETI-RHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSI 173 F Y +D V + + + I K +I+P I I Sbjct: 103 FKYGYDAFLKRCADLNVAGLVIPDLPYESRDEIVPIAEKYGIDIIPL-ITPTSGHRIEKI 161 Query: 174 AKSTVFNNH-PDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGV-CLENVEEQLSI 231 AKS + ++ ++G L + + G+ E + I Sbjct: 162 AKSASGFIYVVSSMGITGERDEFFAGLKALVAEIKQYTNVPTAIGFGIHTPEQAQTMAGI 221 Query: 232 ADGCVTATTFKKDGVFANFVDQARVSQFMEKVH 264 ADG + + A + +F +++ Sbjct: 222 ADGVIIGSAIVDLVAKEKQQAPAAIEKFTKQIR 254 >UniRef50_B5YJ74 Thiamine-phosphate pyrophosphorylase n=1 Tax=Thermodesulfovibrio yellowstonii DSM 11347 RepID=B5YJ74_THEYD Length = 206 Score = 47.6 bits (112), Expect = 5e-04, Method: Composition-based stats. Identities = 31/148 (20%), Positives = 45/148 (30%), Gaps = 21/148 (14%) Query: 106 FDLAMATGAKFIREIFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYL 165 D+A+A A G + G V E + + IG + EA+ + Sbjct: 71 IDIALAVEAD-------GVHLPQSGFPPRIVREVWKDRFLIGVSTHSI--DEAKEASEWA 121 Query: 166 GNRDICSIAKSTVFNNHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENV 225 I + S G LK VKE+V V G+ LENV Sbjct: 122 DFITFSPIFHTP-----------SKAHYGEPQGVEKLKEVKESVKCKVFALG-GIKLENV 169 Query: 226 EEQLSIADGCVTATTFKKDGVFANFVDQ 253 E + DG + V + Sbjct: 170 HELIPYCDGIALISGILAQRNIEGTVKK 197 >UniRef50_P42390 Indole-3-glycerol phosphate lyase, chloroplastic n=55 Tax=Viridiplantae RepID=TRPA_MAIZE Length = 347 Score = 47.2 bits (111), Expect = 5e-04, Method: Composition-based stats. Identities = 18/107 (16%), Positives = 39/107 (36%), Gaps = 6/107 (5%) Query: 165 LGNRDICSIAKSTV-FNNHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVC-L 222 + + I K++ F V+G A L + + V + V G+ Sbjct: 239 IPEDRMKEITKASEGFVYLVSVNGVTGPRANVNPRVESLIQEVKKVTNKPVAVGFGISKP 298 Query: 223 ENVEEQLSI-ADGCVTATTFKKD-GVFANFVDQARVSQFMEKVHHIR 267 E+V++ ADG + + + G A+ + + + E ++ Sbjct: 299 EHVKQIAQWGADGVIIGSAMVRQLGEAASP--KQGLRRLEEYARGMK 343 >UniRef50_A4RVE9 Predicted protein n=3 Tax=cellular organisms RepID=A4RVE9_OSTLU Length = 302 Score = 47.2 bits (111), Expect = 6e-04, Method: Composition-based stats. Identities = 28/221 (12%), Positives = 63/221 (28%), Gaps = 28/221 (12%) Query: 39 IDKAWDDLMALQNGGVDAVMFSNEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNV 98 ++ L L + G D + E +PY + A L + + +++ Sbjct: 66 LESTKKALKILDDAGADVI----ELGVPYSDPLADGPVIQ-AAATRALENGATLNKVIDL 120 Query: 99 LWDPVASFDLAMATGAKFIREIFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKT----- 153 + + A + I+ GV + ++ Sbjct: 121 VREMTPQIK-APIVMFTYYNPIY------QRGVDKFCADIAAAGAKGLLVPDIPLEETYD 173 Query: 154 LFNIVPEAAVYL-----GNRDICSIAKSTV----FNNHPDALCVSGLTAGTRTDSALLKR 204 + I + + L + K F V+G+ + T L Sbjct: 174 VSEIASKHGIELVLLSTPTTPVERAKKIAQATKGFVYLVSVTGVTGVQSNVATRVEQLVE 233 Query: 205 VKETVPDTVVLANTGVCL-ENVEEQLSI-ADGCVTATTFKK 243 +V D + GV ++ ++ + ADG + + + Sbjct: 234 ELRSVTDKPIAVGFGVSEAKHAKQIVDWGADGVIVGSALVR 274 >UniRef50_A4J150 Tryptophan synthase alpha chain n=1 Tax=Desulfotomaculum reducens MI-1 RepID=A4J150_DESRM Length = 267 Score = 47.2 bits (111), Expect = 6e-04, Method: Composition-based stats. Identities = 28/188 (14%), Positives = 61/188 (32%), Gaps = 16/188 (8%) Query: 82 IIGQLMSDIRIPFGVNVLWDPVASFDLAMATGAKFIREIFTGAYASDFGVWDTNVGETIR 141 ++ ++ + P + ++P+ + L +F R+ A A G+ ++ Sbjct: 87 LVKEVKEQVLAPLILMSYYNPILQYGL-----LEFCRDA---ANAGAAGLIVPDLPLEES 138 Query: 142 HQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPDALCVSG-LTAGTRTD-- 198 + + AG+V L I P A R + I + + + V+G Sbjct: 139 TELLLAAGQVG-LALI-PLVAPTTNRRRLARITAAAQAFVYC--VTVTGITGTSQNVTGE 194 Query: 199 SALLKRVKETVPDTVVLANTGV-CLENVEEQLSIADGCVTATTFKKDGVFANFVDQARVS 257 L + + ++A G+ E + DG V + K V+ Sbjct: 195 IEELSKEVREATELPMVAGFGIASPEQAVKVAKYCDGVVVGSALVKLVETHQEDSLEPVA 254 Query: 258 QFMEKVHH 265 ++ Sbjct: 255 NLTRQLKQ 262 >UniRef50_Q8ESU5 Tryptophan synthase alpha chain n=4 Tax=Bacillaceae RepID=TRPA_OCEIH Length = 262 Score = 46.8 bits (110), Expect = 7e-04, Method: Composition-based stats. Identities = 35/215 (16%), Positives = 66/215 (30%), Gaps = 17/215 (7%) Query: 39 IDKAWDDLMALQNGGVDAVMFSNEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNV 98 ID + L+ G AV E +P+ V L + I + Sbjct: 32 IDNLNKRIQFLEEAGASAV----ELGIPFSDPVADGPVIQ-DAGQRALANKTTIGSVLEA 86 Query: 99 LWDPVASFDLAMATGAKFIREIFTGAY------ASDFGVWDTNVGETIRHQHRIGAGEVK 152 L + ++ +I ++ + S GV + + + Sbjct: 87 LESEKSQRNI-PVVLMTYINPVWKYGFEQFARDCSQAGVDGIIIPDIP-MEEEDDVASSL 144 Query: 153 TLFNIVPE--AAVYLGNRDICSIAKSTVFNNHPDAL-CVSGLTAGTRTDSALLKRVKETV 209 T +I AA+ + IAK + + ++ +G A D+ + + Sbjct: 145 TQHDIAFIRLAAMTSTEDRLERIAKRSEGFLYAVSVTGTTGERAQHENDAFHFLEKLKQI 204 Query: 210 PDTVVLANTGV-CLENVEEQLSIADGCVTATTFKK 243 VLA G+ E E + DG V + + Sbjct: 205 SHVPVLAGFGISTAERARELSAACDGVVVGSKIVQ 239 >UniRef50_D2QFY3 Tryptophan synthase, alpha subunit n=1 Tax=Spirosoma linguale DSM 74 RepID=D2QFY3_9SPHI Length = 278 Score = 46.8 bits (110), Expect = 8e-04, Method: Composition-based stats. Identities = 13/92 (14%), Positives = 28/92 (30%), Gaps = 10/92 (10%) Query: 183 PDALC-------VSGLTAGTRTDS-ALLKRVKETVPDTVVLANTGVC-LENVEEQLSIAD 233 D ++G G A +R++ L G+ E + + A+ Sbjct: 188 SDGFIYMVSSASITGSVKGVSDTMKAYFERIQGMNLRNPRLIGFGINNHETFDTACAFAN 247 Query: 234 GCVTATTFKKDGVFANFVDQARVSQFMEKVHH 265 G + + F + + F+E + Sbjct: 248 GAIVGSAFIRHLSEKGT-SSESIRSFVETIRS 278 >UniRef50_Q0W630 Tryptophan synthase alpha chain n=1 Tax=uncultured methanogenic archaeon RC-I RepID=Q0W630_UNCMA Length = 273 Score = 46.8 bits (110), Expect = 8e-04, Method: Composition-based stats. Identities = 15/59 (25%), Positives = 24/59 (40%), Gaps = 3/59 (5%) Query: 188 VSGLTAGTRTDSAL-LKRVKETVPDTVVLANTGVC-LENVEEQLSI-ADGCVTATTFKK 243 V+G A L R K +T V G+ ++V E +S+ ADG + + Sbjct: 185 VTGKRASLSAGIKEVLDRAKAAAGNTPVAVGFGISGPDHVTEIISMGADGAIVGSAIVD 243 >UniRef50_A8FKD8 Tryptophan synthase alpha chain n=15 Tax=Campylobacter RepID=TRPA_CAMJ8 Length = 249 Score = 46.8 bits (110), Expect = 8e-04, Method: Composition-based stats. Identities = 10/102 (9%), Positives = 31/102 (30%), Gaps = 6/102 (5%) Query: 144 HRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTV-FNNHPDALCVSGLTAGTRTDSALL 202 + + + + + K F ++ ++G + Sbjct: 132 KECERYNIALITLVSVTT----PKERVKKLVKHAKGFIYLLASIGITGTKSVEEAILQDK 187 Query: 203 KRVKETVPDTVVLANTGV-CLENVEEQLSIADGCVTATTFKK 243 + + + + G+ ++V+ +ADG + T+ K Sbjct: 188 VKEIRSFTNLPIFVGFGIQNNQDVKRMRKVADGVIVGTSIVK 229 >UniRef50_Q2S1Z2 Tryptophan synthase alpha chain n=2 Tax=Rhodothermaceae RepID=TRPA_SALRD Length = 278 Score = 46.4 bits (109), Expect = 0.001, Method: Composition-based stats. Identities = 12/87 (13%), Positives = 28/87 (32%), Gaps = 7/87 (8%) Query: 173 IAKSTVFNNHPDALCVSGLTAGTRTDSAL-LKRVKETVPDTVVLANTGV-CLENVEEQLS 230 ++T F ++G L ++ V +L G+ ++ E Sbjct: 165 DERATGFVYAVSVTGLTGSDLAETPSVDEYLMHARDFVAQNPLLVGFGIKTHDDAMELSR 224 Query: 231 IADGCVTATTF--KKDGVFANFVDQAR 255 DG + + + + ++ D R Sbjct: 225 HTDGFIVGSALINRVEALWE---DPER 248 >UniRef50_A8ZZW7 Tryptophan synthase alpha chain n=3 Tax=Deltaproteobacteria RepID=TRPA_DESOH Length = 265 Score = 46.4 bits (109), Expect = 0.001, Method: Composition-based stats. Identities = 13/57 (22%), Positives = 23/57 (40%), Gaps = 1/57 (1%) Query: 188 VSGLTAGTRTDSALLKRVKETVPDTVVLANTGV-CLENVEEQLSIADGCVTATTFKK 243 V+G + T A L + + GV ++V + ADG V + F++ Sbjct: 181 VTGSSGLDTTHIADLCAKVQRHTALPICVGFGVSTADDVAKIAQHADGVVIGSAFER 237 >UniRef50_C6N065 Tryptophan synthase alpha chain n=2 Tax=Legionella RepID=C6N065_9GAMM Length = 275 Score = 46.0 bits (108), Expect = 0.001, Method: Composition-based stats. Identities = 11/55 (20%), Positives = 22/55 (40%), Gaps = 1/55 (1%) Query: 188 VSGLTAGTRTDSALLKRVKETVPDTVVLANTGV-CLENVEEQLSIADGCVTATTF 241 V+G +A + + ++T + ++ G+ E E S ADG + Sbjct: 182 VTGSSALDMSSLKSEYQHRKTQTELPLMVGFGIKTPEMAAEVASFADGVIVGAAL 236 >UniRef50_A7I4T3 Tryptophan synthase alpha chain n=2 Tax=Methanomicrobiales RepID=TRPA_METB6 Length = 265 Score = 46.0 bits (108), Expect = 0.001, Method: Composition-based stats. Identities = 34/249 (13%), Positives = 73/249 (29%), Gaps = 39/249 (15%) Query: 40 DKAWDDLMALQNGGVD----AVMFSN-EFSLPYLTKVRPETTAAMA------RIIGQLMS 88 + + AL +GG D V FS+ P + + A+ I+ ++ + Sbjct: 29 ETSIRIAKALIDGGTDILEFGVPFSDPVADGPTIQRADDRALASCTTPDTIFAIVREVRA 88 Query: 89 DIRIPFGV-----NVLWDPVASFDL-AMATGAKFIREIFTGAYASDFGVWDTNVGETIRH 142 +P + + F L A G + I + E Sbjct: 89 YSEVPIVFLTYYNTIYRRGIDRFYLEAHEAG---VDGILVADMPVEESDEVAATAE---- 141 Query: 143 QHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFN-NHPDALCVSGLTAGTRTDSAL 201 + G + + N + +I + L V+G ++ Sbjct: 142 --KYGIDPIFLVT-------QTTSNERMDTIVRHARGYLYLVSVLGVTGARKTVAPEALA 192 Query: 202 LKRVKETVPDTVVLANTGV-CLENVEEQ-LSIADGCVTATTFKK--DGVFANFVDQAR-V 256 L + D + G+ E+V L+ ADG + + + N + + Sbjct: 193 LLNRVRSHTDLPLAIGFGISTPEHVTTCNLAGADGVIVGSAIVDIVEKNLGNPDAMEQDL 252 Query: 257 SQFMEKVHH 265 +++ + Sbjct: 253 RRYVSVMKK 261 >UniRef50_B5JDA4 Tryptophan synthase alpha chain n=1 Tax=Verrucomicrobiae bacterium DG1235 RepID=B5JDA4_9BACT Length = 272 Score = 46.0 bits (108), Expect = 0.001, Method: Composition-based stats. Identities = 12/55 (21%), Positives = 21/55 (38%), Gaps = 1/55 (1%) Query: 188 VSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCL-ENVEEQLSIADGCVTATTF 241 V+G+ D + + D ++ GV E V+ +ADG V + Sbjct: 186 VTGVREDLADDLNEMVGMIREATDKPLVVGFGVSKREQVKSICELADGVVVGSAI 240 >UniRef50_A9BHQ2 Tryptophan synthase alpha chain n=1 Tax=Petrotoga mobilis SJ95 RepID=TRPA_PETMO Length = 258 Score = 46.0 bits (108), Expect = 0.001, Method: Composition-based stats. Identities = 33/253 (13%), Positives = 73/253 (28%), Gaps = 35/253 (13%) Query: 1 MSWLKEVIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFS 60 MS + EV KA+I GDP+ ++ + ++ L GVD + Sbjct: 1 MSKISEVFKNSKALITYV----TAGDPN----------LEVTKEIILELNKDGVDIIEV- 45 Query: 61 NEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDPVASFDLAMATGAKF---- 116 +P+ + A L + + + L + + + Sbjct: 46 ---GIPFSDPLADGPIIQKAS-QKALKNGVTLKKIFETLNEIKEEVTCPLVLMGYYNSIL 101 Query: 117 ---IREIFTGAYASD-FGVWDTN--VGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDI 170 I T A + GV + E +I + + + P + + Sbjct: 102 NYGIDNFITEAVNTGISGVIIPDLPFDEEEEFYAKIKENGIDPILLVAPNTS----EERL 157 Query: 171 CSIAKSTVFNNHPDALC-VSGLTAGTRTDSALLKRVKETVPDTVVLANTGV-CLENVEEQ 228 I+K + ++ V+G + + + + G+ + Sbjct: 158 KEISKVCSGFLYCVSIMGVTGDSQAPMEHLKEYSQRVRKYVNIPLAIGFGIDSPTKAKNI 217 Query: 229 LSIADGCVTATTF 241 + DG + + Sbjct: 218 IEYFDGIIVGSAL 230 >UniRef50_Q5LYI8 Tryptophan synthase alpha chain n=59 Tax=Bacilli RepID=TRPA_STRT1 Length = 260 Score = 46.0 bits (108), Expect = 0.001, Method: Composition-based stats. Identities = 17/82 (20%), Positives = 32/82 (39%), Gaps = 4/82 (4%) Query: 188 VSGLTAGTRTDSALLKRVKETVPDTVVLANTGV-CLENVEEQLSIADGCVTATTFKKDGV 246 V+G T R D D VL GV E+++ +++DG + + +D Sbjct: 182 VTGKTGNYRDDLDKHLSNLTAYADIPVLTGFGVSTEEDIKRFNAVSDGVIVGSKIVRDLH 241 Query: 247 FANFVDQARVSQFMEKVHHIRR 268 + V++F+ H + Sbjct: 242 DG---KEEEVAEFVTFGSHFEK 260 >UniRef50_Q8U092 N-(5'-phosphoribosyl)anthranilate isomerase n=5 Tax=Euryarchaeota RepID=TRPF_PYRFU Length = 208 Score = 46.0 bits (108), Expect = 0.001, Method: Composition-based stats. Identities = 20/117 (17%), Positives = 45/117 (38%), Gaps = 13/117 (11%) Query: 156 NIVPEAAVYLGNRDICSIAK---STVFNNHPDALCV-SGLTAGTRTDSALLKRVKETVPD 211 ++ V +++ A S + + D + + +G +G L+ Sbjct: 97 FVMKAFRVPTISKNPEEDANRLLSEISRYNADMVLLDTGAGSGK---LHDLRVSSLVARK 153 Query: 212 TVVLANTGVCLENVEEQLSIAD--GCVTATTFKKDGVFANFVDQARVSQFMEKVHHI 266 V+ G+ ENVEE + + G ++ +K G+ D V +F+ + ++ Sbjct: 154 IPVIVAGGLNAENVEEVIKVVKPYGVDVSSGVEKYGIK----DPKLVEEFVRRAKNV 206 >UniRef50_Q0BWJ7 Tryptophan synthase alpha chain n=2 Tax=Hyphomonadaceae RepID=Q0BWJ7_HYPNA Length = 275 Score = 46.0 bits (108), Expect = 0.001, Method: Composition-based stats. Identities = 15/83 (18%), Positives = 23/83 (27%), Gaps = 8/83 (9%) Query: 169 DICSIAKSTVFNNHPDALC-------VSGLTAGTRTDSALLKRVKETVPDTVVLANTGV- 220 A++ + V+G + A + V V GV Sbjct: 158 PTTDEARAAKVADGASGFIYFVAVTGVTGSGSADPAAIAGKVAMVRKVSGLPVCVGFGVK 217 Query: 221 CLENVEEQLSIADGCVTATTFKK 243 E IADG V + F + Sbjct: 218 NGAQAVEMAKIADGVVVGSAFVE 240 >UniRef50_B3E0A3 Tryptophan synthase alpha chain n=1 Tax=Methylacidiphilum infernorum V4 RepID=B3E0A3_METI4 Length = 268 Score = 45.6 bits (107), Expect = 0.002, Method: Composition-based stats. Identities = 13/76 (17%), Positives = 22/76 (28%), Gaps = 3/76 (3%) Query: 188 VSGLTAGTRTDSALLKRVKETVPDTVVLANTGV-CLENVEEQLSIADGCVTATTFKKD-- 244 ++G ++ L + E + GV E S ADG V + + Sbjct: 185 ITGPQKEISKEAYELVKSVEPYKRVPLCVGFGVSTPEQARAISSYADGVVVGSALVGEVA 244 Query: 245 GVFANFVDQARVSQFM 260 + R F Sbjct: 245 RIAEGKSTVERFKDFA 260 >UniRef50_A5IYV3 Triosephosphate isomerase n=1 Tax=Mycoplasma agalactiae PG2 RepID=TPIS_MYCAP Length = 258 Score = 45.6 bits (107), Expect = 0.002, Method: Composition-based stats. Identities = 11/61 (18%), Positives = 21/61 (34%), Gaps = 8/61 (13%) Query: 192 TAGTRTDSALLKRVKETV-----PDTVVLANTGVCLENVEEQ--LSIADGCVTAT-TFKK 243 G ++ V + P+ VL V N+ + L +G + + + K Sbjct: 182 GTGITPTPEEVENVSALIHKLTSPEVPVLYGGSVNENNINDFTKLPNLNGFLVGSASLKI 241 Query: 244 D 244 D Sbjct: 242 D 242 >UniRef50_C4V0L8 N-(5'-phosphoribosyl)anthranilate isomerase n=1 Tax=Selenomonas flueggei ATCC 43531 RepID=C4V0L8_9FIRM Length = 203 Score = 45.6 bits (107), Expect = 0.002, Method: Composition-based stats. Identities = 35/200 (17%), Positives = 62/200 (31%), Gaps = 36/200 (18%) Query: 71 VRPETTAAMARIIGQLMSDIRIPFGVNVLWDPVASFDLAMATGAKFIREIFTGAYASDFG 130 PET +AR + ++ GV V +A A G ++ G ++ Sbjct: 37 AAPETAHEIAREMQRVKK-----VGVFVDAPMAEVNRIADAVGLDYV--QLHGHETAEMA 89 Query: 131 VWDTNVG-ETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPDALCVS 189 + R+ A I + + Sbjct: 90 RMAERPVIKAYRYGDDFDAEAANVY--------------PAEIILVDSYVKGAAGG---T 132 Query: 190 GLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIAD--GCVTATTFKKDGVF 247 GL + + + RV + VL G+ NV E + G + ++DGV Sbjct: 133 GLAFHWQEAAREIARVTK-----PVLIAGGITAANVREAVETFHPFGIDVSGGLEEDGVK 187 Query: 248 ANFVDQARVSQFMEKVHHIR 267 +A+++ FME V +R Sbjct: 188 ----SKAKITAFMEAVCALR 203 >UniRef50_C7IFK5 Tryptophan synthase alpha chain n=1 Tax=Clostridium papyrosolvens DSM 2782 RepID=C7IFK5_9CLOT Length = 242 Score = 45.6 bits (107), Expect = 0.002, Method: Composition-based stats. Identities = 12/75 (16%), Positives = 30/75 (40%), Gaps = 5/75 (6%) Query: 170 ICSIAKSTVFNNHPDALCVSGLTAGTRTDSALLKR-VKETVPDTVVLANTGV-CLENVEE 227 + AK +F D V+G + + R K+ + + + G+ ++ ++ Sbjct: 146 VKDSAKGYIFLQATDG--VTGARNELESGLKDIIRETKKKLDNIPICPGFGISNADHCKQ 203 Query: 228 QLSI-ADGCVTATTF 241 + ADG + ++ Sbjct: 204 IKEMGADGVIIGSSL 218 >UniRef50_P16608 Tryptophan synthase alpha chain n=6 Tax=Deinococci RepID=TRPA_THET2 Length = 271 Score = 45.6 bits (107), Expect = 0.002, Method: Composition-based stats. Identities = 26/163 (15%), Positives = 54/163 (33%), Gaps = 11/163 (6%) Query: 82 IIGQLMSDIRIPFGVNVLWDPVASFDLAMATGAKFIREIFTGAYASDFGVWDTNVGETIR 141 ++ ++ + P + +PV ++ G F + TG D + +R Sbjct: 82 LVREVRALTEKPLFLMTYLNPVLAWGPERFFGL-FKQAGATGVILPDLPPDE--DPGLVR 138 Query: 142 HQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPDAL-CVSGLTAGTRTDSA 200 IG V L + I ++ + + ++ V+G+ + Sbjct: 139 LAQEIGLETVFLLA-------PTSTDARIATVVRHATGFVYAVSVTGVTGMRERLPEEVK 191 Query: 201 LLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATTFKK 243 L R + V GV + Q ++ADG V + + Sbjct: 192 DLVRRIKARTALPVAVGFGVSGKATAAQAAVADGVVVGSALVR 234 >UniRef50_Q12TL0 N-(5'-phosphoribosyl)anthranilate isomerase n=1 Tax=Methanococcoides burtonii DSM 6242 RepID=Q12TL0_METBU Length = 222 Score = 45.3 bits (106), Expect = 0.002, Method: Composition-based stats. Identities = 25/136 (18%), Positives = 46/136 (33%), Gaps = 11/136 (8%) Query: 138 ETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPDALC----VSGL-- 191 + + +V + + V D+C+ N D + VSG Sbjct: 92 DIEEMRILRDNTDVSIIRTFHVQGDVSAD--DLCNNINMFTSENLIDGVLLDSYVSGKVG 149 Query: 192 TAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATTFKKDGVFANFV 251 G D ++ KRV + V D + G+ +NV+ ++ T Sbjct: 150 GTGQLHDLSVSKRVVDLV-DVPAILAGGLNPDNVKACVNEVIPFAVDTA--SGVETDGLK 206 Query: 252 DQARVSQFMEKVHHIR 267 D +V+ F+ V +R Sbjct: 207 DVDKVAAFVNAVRCVR 222 >UniRef50_A9WTC6 L-lactate dehydrogenase n=27 Tax=Actinobacteria (class) RepID=A9WTC6_RENSM Length = 426 Score = 45.3 bits (106), Expect = 0.002, Method: Composition-based stats. Identities = 20/126 (15%), Positives = 41/126 (32%), Gaps = 10/126 (7%) Query: 41 KAWDDLMALQNGGVDAVMFSNEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLW 100 ++ D + G D V+ SN P + ++ + ++ I ++ Sbjct: 298 QSVTDAQKAIDHGADGVVLSNHGGR--QLDRAPLPFHLIPKVRATVGTEATIMMDTGIMC 355 Query: 101 DPVASFDLAMATGAKFI---REIFTGAYASDFGVWDTNVGETIRHQ--HRIGAGEVKTLF 155 A+A+GA F R G A E +R + + V + Sbjct: 356 GGD--IIAAIASGADFTLIGRAYLYGLMAGGQ-RGVARSLEILRTEMVRTMTLLGVTKIS 412 Query: 156 NIVPEA 161 ++ P+ Sbjct: 413 DLNPDH 418 >UniRef50_A7BCI9 Tryptophan synthase alpha chain n=1 Tax=Actinomyces odontolyticus ATCC 17982 RepID=A7BCI9_9ACTO Length = 269 Score = 45.3 bits (106), Expect = 0.002, Method: Composition-based stats. Identities = 37/249 (14%), Positives = 70/249 (28%), Gaps = 40/249 (16%) Query: 39 IDKAWDDLMALQNGGVDAVMFSNEFSLPYLTK------VRPETTAAMARIIG------QL 86 ++ + L + GVD + E PY ++ T AA+ R + Sbjct: 33 VEDSIRAGKVLADCGVDVI----ELGFPYSDPGMDGPTIQRATVAALDRGTHLEDLFHAV 88 Query: 87 MSDIRIPFGVNVL--WDPVASFDLAMATGAKFIREIFTGAYASDFGVWDTN-----VGET 139 + W+PV + + R A G+ + + Sbjct: 89 DELTSYGISTCSMTYWNPVEWWGVE--------RFAKDFAAVGGSGLITPDLPPEEGAQW 140 Query: 140 IRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPDALCVSGLTAGTRTDS 199 + + PE + L I ++ V+ V+G A Sbjct: 141 EAASDKYDLERIYLSAPSSPEHRLKL----IAEHSRGWVYAASSMG--VTGARAAVGAHV 194 Query: 200 ALLKRVKETVPDTVVLANTGV-CLENVEEQLSIADGCVTATTFKKDGVFANFVDQARVSQ 258 A + V GV E + ADG + + K +F +D+ + Sbjct: 195 ADVVERTRVAGAERVCVGLGVSNGAQAREIGAYADGVIVGSALVKT-LFDENIDRG-LKA 252 Query: 259 FMEKVHHIR 267 E ++ Sbjct: 253 LGELATELK 261 >UniRef50_D1CBJ1 Tryptophan synthase alpha chain n=1 Tax=Thermobaculum terrenum ATCC BAA-798 RepID=D1CBJ1_THET1 Length = 279 Score = 45.3 bits (106), Expect = 0.002, Method: Composition-based stats. Identities = 10/55 (18%), Positives = 19/55 (34%), Gaps = 1/55 (1%) Query: 188 VSGLTAGTRTDSALLKRVKETVPDTVVLANTGVC-LENVEEQLSIADGCVTATTF 241 V+G R D ++ G+ +++ + ADG V A+ Sbjct: 181 VTGARDSLPDYLEPFLRTVRGQTDLPLVVGFGISKPDHIRQLHQFADGVVVASAI 235 >UniRef50_Q7URN0 Tryptophan synthase alpha chain n=3 Tax=Planctomycetaceae RepID=TRPA_RHOBA Length = 278 Score = 44.9 bits (105), Expect = 0.003, Method: Composition-based stats. Identities = 10/79 (12%), Positives = 24/79 (30%), Gaps = 12/79 (15%) Query: 188 VSGLTAGTRTDSALLKRVKETVPDTVVLANTGVC-LENVEEQLSIADGCVTATTFKKDGV 246 ++G T+ + + G+ E + ++DG + + + Sbjct: 184 ITGERTALPTNLVDNVGWLREQTELPICIGFGISGPETAAQLAPVSDGLIVGSAIVR--- 240 Query: 247 FANFVDQARVSQFMEKVHH 265 RV+ +EK Sbjct: 241 --------RVADAVEKAKS 251 >UniRef50_C7LZ69 Tryptophan synthase alpha chain n=1 Tax=Acidimicrobium ferrooxidans DSM 10331 RepID=C7LZ69_ACIFD Length = 269 Score = 44.9 bits (105), Expect = 0.003, Method: Composition-based stats. Identities = 40/233 (17%), Positives = 76/233 (32%), Gaps = 24/233 (10%) Query: 45 DLMALQNGGVDAVMFSNEFSLP--------YLTKVRPETTAAMARIIGQLMS-DIRIPFG 95 D++AL GG DA+ FS P ++V + +A I L + + +P Sbjct: 34 DVVALAEGGADAIEIGIPFSDPSMDGPTIQRASEVALQRGTTVAGTIAALRNLVVGVPLV 93 Query: 96 VNVLWDPVASFDLAMATGAKFIREIFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLF 155 V ++ + L A G + G D + + R +G V+ + Sbjct: 94 VMTYYNLLYHQGLERAAG-RLAEAGVAGVIIPDLSIEE--AEPWWRAAREVGLETVQLVA 150 Query: 156 NIVPEAAVYLGNRDICSIAKSTVFNNHPDALC-VSGLTAGTRTDSALLKRVKETVPDTVV 214 + I ++ + L V+G + + + V Sbjct: 151 -------PSTPPARMERIVRAAEGFVYAVGLMGVTGERDQLAETATEIAGRLAGRAELAV 203 Query: 215 LANTGV-CLENVEEQLSI-ADGCVTATTFKKDGVFANFVDQARVSQFMEKVHH 265 L GV E+ + ADG V + + + A + +F+ +H Sbjct: 204 LVGIGVSSPEHAAAVVEAGADGAVVGSAIVRRVLEGAP--PAELGEFVRALHE 254 >UniRef50_Q4PNH8 Putative L-lactate dehydrogenase n=1 Tax=uncultured marine bacterium 66A03 RepID=Q4PNH8_9BACT Length = 383 Score = 44.9 bits (105), Expect = 0.003, Method: Composition-based stats. Identities = 27/122 (22%), Positives = 46/122 (37%), Gaps = 23/122 (18%) Query: 43 WDDLMALQNGGVDAVMFSNEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPF------GV 96 D M + GVD ++ SN + P + +A+ I + + F G+ Sbjct: 261 KSDAMKAVDIGVDGIVVSNHGGRQF--DGNPASISALPSIRQAVGPKYPVVFDSGIRSGL 318 Query: 97 NVLWDPVASFDLAMATGAKFI---REIFTGAYA--SDFGVWDTNV--GETIRHQHRIGAG 149 ++L A+A GA F+ R G A + G + E I +IGA Sbjct: 319 DILR--------ALALGADFVLVGRPFLYGLAAIGTRGGEHVARILEEEIINALLQIGAK 370 Query: 150 EV 151 ++ Sbjct: 371 KI 372 >UniRef50_Q5WX31 Tryptophan synthase alpha chain n=5 Tax=Gammaproteobacteria RepID=TRPA_LEGPL Length = 272 Score = 44.9 bits (105), Expect = 0.003, Method: Composition-based stats. Identities = 8/55 (14%), Positives = 17/55 (30%), Gaps = 1/55 (1%) Query: 188 VSGLTAGTRTDSALLKRVKETVPDTVVLANTGV-CLENVEEQLSIADGCVTATTF 241 V+G A + ++ ++ G+ E + ADG + Sbjct: 182 VTGSDALKLPELKAQYLQRKAQSKLPLMVGFGIKTPEMAAQVAEFADGVIVGAAL 236 >UniRef50_C2KZB1 N-(5'-phosphoribosyl)anthranilate isomerase n=1 Tax=Oribacterium sinus F0268 RepID=C2KZB1_9FIRM Length = 212 Score = 44.9 bits (105), Expect = 0.003, Method: Composition-based stats. Identities = 19/127 (14%), Positives = 40/127 (31%), Gaps = 17/127 (13%) Query: 144 HRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPDALCVSGLTAGTRTDSALLK 203 ++ I+ E +DI S D G G ++ LL+ Sbjct: 98 KQLRGDSFLWQAFILKE------EKDIERANASPADLILLDG----GKGEGKEAEAELLQ 147 Query: 204 RVKETVPDTVVLANTGVCLENVEEQLSIAD--GCVTATTFKKDGVFANFVDQARVSQFME 261 ++ + G+ ENV E++ G ++ ++ G ++ F+ Sbjct: 148 KIHR-----PYILAGGLSPENVVEKIKAFSPYGLDVSSGIEEQGEDGVRKSPEKMDSFIH 202 Query: 262 KVHHIRR 268 V + Sbjct: 203 LVREYEK 209 >UniRef50_Q2W414 Dioxygenase related to 2-nitropropane dioxygenase n=4 Tax=Proteobacteria RepID=Q2W414_MAGSA Length = 351 Score = 44.9 bits (105), Expect = 0.003, Method: Composition-based stats. Identities = 33/195 (16%), Positives = 57/195 (29%), Gaps = 27/195 (13%) Query: 67 YLTKVRPETTAAMARIIGQLMSDI-RIPFGVNVLWDPVASFDLAMATGAKFIREIFTGAY 125 ++T E A + I + PFGVNV L + + +F Sbjct: 67 FITAASYEDIADLRTAIRRCRDLSEGKPFGVNV-------SMLPKLVQGERTQAVFDLIV 119 Query: 126 ASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPDA 185 + +T+ + + A I V + + A+ DA Sbjct: 120 EENVRFVETSGRNPAAYLPALKA------AGITVVHKVPAVKYALKAEAE------GVDA 167 Query: 186 LCVSGLTAGTRTDSALL-----KRVKETVPDTVVLANTGV-CLENVEEQLSI-ADGCVTA 238 + V G G ++ V +L G+ ++ L++ ADG V Sbjct: 168 VAVVGAECGGHPGMDMVGTMVQANVAAAKLSIPLLVGGGIGTGAHLVAALALGADGVVVG 227 Query: 239 TTFKKDGVFANFVDQ 253 T F D Sbjct: 228 TRFLVAEEIWAHADY 242 >UniRef50_A6C191 Tryptophan synthase alpha chain n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C191_9PLAN Length = 273 Score = 44.9 bits (105), Expect = 0.003, Method: Composition-based stats. Identities = 11/69 (15%), Positives = 21/69 (30%), Gaps = 1/69 (1%) Query: 188 VSGLTAGTRTDSALLKRVKETVPDTVVLANTGV-CLENVEEQLSIADGCVTATTFKKDGV 246 +G+ + + D + G+ E+V ADG + + K Sbjct: 185 TTGVRDELPPELTAHLEALRELTDLPLAVGFGISNPEHVNTLRGKADGFIIGSAIVKQFA 244 Query: 247 FANFVDQAR 255 + Q R Sbjct: 245 AFSDESQKR 253 >UniRef50_A3S0B4 Tryptophan synthase alpha chain n=4 Tax=Bacteria RepID=A3S0B4_RALSO Length = 264 Score = 44.9 bits (105), Expect = 0.003, Method: Composition-based stats. Identities = 48/253 (18%), Positives = 75/253 (29%), Gaps = 55/253 (21%) Query: 11 EKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEFSLPYLTK 70 +K ++ M HL + G PS +A M +D+A GVD V LP+ Sbjct: 19 QKPLLLMTHL--VVGYPSLEANWAMLEAMDRA----------GVDLVELQ----LPFS-- 60 Query: 71 VRPETTAAMARIIGQLMSDIRIPFGVNVLWDPVASFDLAMATGAKFIREIFTGAYASDFG 130 E A R + + I G W F A F + +F G Y S FG Sbjct: 61 ---EPIADGPRFVKA--NHDAIQAGTT--WHTYFDFAAKAAQRFSF-KIVFMGYYNSVFG 112 Query: 131 VWDTNVGETIRHQHRIGAGEVK--TLFNIVPEAAVYL-------------------GNRD 169 + R + + L ++ PE A L Sbjct: 113 MG------AERFCASLSESGMSGFILPDLPPEEATQLNAIARGRDLDPILIMTPTSSPTR 166 Query: 170 ICSIAKSTVFNNHPDA-LCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGV-CLENVEE 227 + I + + A + V+G D + G+ +V Sbjct: 167 LAEIGRQASGFLYCAARVGVTGRRTDLSQDVVAFMDKCRAATSLPLGLGFGIRTPSDVRG 226 Query: 228 QLSIADGCVTATT 240 +AD + T Sbjct: 227 LRGLADMAIVGTA 239 >UniRef50_B9M7D5 Tryptophan synthase alpha chain n=3 Tax=Geobacter RepID=TRPA_GEOSF Length = 264 Score = 44.9 bits (105), Expect = 0.003, Method: Composition-based stats. Identities = 27/213 (12%), Positives = 68/213 (31%), Gaps = 33/213 (15%) Query: 49 LQNGGVDAVMFSNEFSLPYLTKVRPETTAAMA---------------RIIGQLMSDIRIP 93 L+ G D + E +P+ + T ++ + + +IP Sbjct: 40 LEKAGADLI----ELGVPFSDPMADGPTIQLSSERALAAGTTLPKILATVKSVRRKTQIP 95 Query: 94 FGVNVLWDPVASFDLAMATGAKFIREIFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKT 153 + ++P + + +F+ + + D E + + Sbjct: 96 IILMGYYNP-----IFLHGVERFVSDAVAAGVDGVL-LVDLPPEEAGEFKAIADRSGLAV 149 Query: 154 LFNIVPEAAVYLGNRDICSIAK-STVFNNHPDALCVSGLTAGTRTDS-ALLKRVKETVPD 211 +F + P I +A + F + V+G + + + ++++++ V Sbjct: 150 IFLLTPT----SDEERIRKVAHLGSGFIYYVSVTGVTGARSSVAENVFSDVQKIRKRVT- 204 Query: 212 TVVLANTGVC-LENVEEQLSIADGCVTATTFKK 243 V+ G+ S+ADG V + + Sbjct: 205 LPVVVGFGISDPAQAGSIASVADGVVVGSALVR 237 >UniRef50_A4CE02 FMN-dependent alpha-hydroxy acid dehydrogenase n=1 Tax=Pseudoalteromonas tunicata D2 RepID=A4CE02_9GAMM Length = 357 Score = 44.5 bits (104), Expect = 0.004, Method: Composition-based stats. Identities = 18/122 (14%), Positives = 43/122 (35%), Gaps = 4/122 (3%) Query: 42 AWDDLMALQNGGVDAVMFSNEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWD 101 A +D M + GV ++ SN + P + ++ I + +D I + Sbjct: 232 AVEDAMLAKELGVAGIVVSNHGGR--VLDTMPASVMMLSLIRQAVGNDFLILCDSGIRRG 289 Query: 102 PVASFDLAMATGAKFIREIFTGAYASDFGVWDTNVGETIRHQHRI--GAGEVKTLFNIVP 159 LA+ A I A A+ + ++ ++ + ++ ++ +I Sbjct: 290 SDIFKALALGADAVLIGRPIMYALATAGPLGVAHMLRILKDELQLTMALCGCASIADIST 349 Query: 160 EA 161 + Sbjct: 350 KH 351 >UniRef50_B1I3Z4 Tryptophan synthase alpha chain n=1 Tax=Candidatus Desulforudis audaxviator MP104C RepID=B1I3Z4_DESAP Length = 267 Score = 44.5 bits (104), Expect = 0.004, Method: Composition-based stats. Identities = 13/81 (16%), Positives = 23/81 (28%), Gaps = 4/81 (4%) Query: 188 VSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLE-NVEEQLSIADGCVTATTFKK--D 244 V+G AG + R + G+ + V D + + D Sbjct: 183 VTGTQAGYSEKLVGVCRTVREHTGLPIAVGFGIARDAQVRALAPHVDALIVGSAIVDLID 242 Query: 245 GVFANFVDQARVSQFMEKVHH 265 G N RV + ++ Sbjct: 243 GADENG-SVKRVCSLVLELKQ 262 >UniRef50_C6VWH1 Tryptophan synthase alpha chain n=1 Tax=Dyadobacter fermentans DSM 18053 RepID=C6VWH1_DYAFD Length = 267 Score = 44.5 bits (104), Expect = 0.004, Method: Composition-based stats. Identities = 38/241 (15%), Positives = 75/241 (31%), Gaps = 26/241 (10%) Query: 43 WDDLMALQNGGVDAVMFSNEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDP 102 L ALQ+GG D V +PY V T + L + + + + L Sbjct: 34 RRVLQALQDGGADLVEI----GMPYSDPVADGETIQQSND-RALENGMSVRVLFDQLQGM 88 Query: 103 VASFDLAMATGAKFIREIFTG---------AYASDFGVWDTNVG---ETIRHQHRIGAGE 150 + + +I + A G+ ++ ++ A Sbjct: 89 RETITV-PVLLMGYINPVLQYGIEAFCAKCAEVGVDGLILPDMPMDVYLDEYKSIFDAHG 147 Query: 151 VKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPDALCVSGLTAGTRTDSA-LLKRVKETV 209 + +F + P+ + R I +++ F + V+G +G D +R+ Sbjct: 148 LLNIFLVTPQTSEKRI-RQIDEVSEG--FIYTVSSASVTGSKSGVSGDMQSYFERLDAMN 204 Query: 210 PDTVVLANTGVC-LENVEEQLSIADGCVTATTFKKDGVFANFVDQAR-VSQFMEKVHHIR 267 L G+ + + S A G + + F + V D + F+ V Sbjct: 205 LRNPRLIGFGIKDHDTFVKASSHAAGAIIGSAFIR--VLQESTDLENDIKTFVRDVKRSS 262 Query: 268 R 268 Sbjct: 263 E 263 >UniRef50_C6M905 Tryptophan synthase alpha chain n=5 Tax=Neisseria RepID=C6M905_NEISI Length = 299 Score = 44.5 bits (104), Expect = 0.004, Method: Composition-based stats. Identities = 9/78 (11%), Positives = 28/78 (35%), Gaps = 1/78 (1%) Query: 188 VSGLTAGTRTDSALLKRVKETVPDTVVLANTGV-CLENVEEQLSIADGCVTATTFKKDGV 246 V+G + + + + D + G+ E+ + ++AD + + K+ Sbjct: 218 VTGAASLDTEEVSRKIELLRKYIDIPIGVGFGISNAESARKIGAVADAVIVGSRIVKEIE 277 Query: 247 FANFVDQARVSQFMEKVH 264 + V ++++ Sbjct: 278 NNAGREAEAVGALVKELK 295 >UniRef50_C6CUD6 N-(5'-phosphoribosyl)anthranilate isomerase n=1 Tax=Paenibacillus sp. JDR-2 RepID=C6CUD6_PAESJ Length = 220 Score = 44.1 bits (103), Expect = 0.004, Method: Composition-based stats. Identities = 21/98 (21%), Positives = 38/98 (38%), Gaps = 10/98 (10%) Query: 174 AKSTVFNNHPDALCV--SGLTAGTRTDSALLKRVKETVP--DTVVLANTGVCLENVEEQL 229 A+ + + DA+ + +G G D L+ + D + G+ NV E L Sbjct: 126 ARVSAYEGAVDAILIDTAGGGTGQTFDWQLITDYQNAAAAIDVPLYVAGGLHPGNVGELL 185 Query: 230 SI--ADGCVTATTFKKDGVFANFVDQARVSQFMEKVHH 265 + DG ++ + DG D ++ F+ KV Sbjct: 186 AGNPVDGIDVSSGVETDG----RKDIEKIRLFVRKVIE 219 >UniRef50_Q2FKX6 Tryptophan synthase alpha chain n=1 Tax=Methanospirillum hungatei JF-1 RepID=Q2FKX6_METHJ Length = 259 Score = 44.1 bits (103), Expect = 0.004, Method: Composition-based stats. Identities = 42/250 (16%), Positives = 82/250 (32%), Gaps = 41/250 (16%) Query: 39 IDKAWDDLMALQNGGVDAVMFSNEFSLPYLTKVRPETTAAMAR---------------II 83 + + + + AL+NGG D + E LP+ V A ++ Sbjct: 27 FETSLEIIKALENGGADII----ELGLPFSDPVADGPVIQQADQRALASGMNTDRFFDLV 82 Query: 84 GQLMSDIRIPFGVNVLWDPVASFDLAMATGAKFIREIFTG-AYASDFGVWDTNVGETIRH 142 ++ IP + T R+I T A+D G+ V + Sbjct: 83 REVRKSSDIP------------LVVLTYTNLILQRDINTFYQDAADAGIDAVVVADLPYE 130 Query: 143 QH--RIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPDALCVSGLTAGTRTDSA 200 + I A E + ++ + R + + F AL V+G+ T + Sbjct: 131 EAGPYITAAETAGVAPVMMVSTTTSPERLSKILTVKSGFIYLVAALGVTGMRQKTDPVAQ 190 Query: 201 LLKRVKETVPDTVVLANTGVCL-ENVEEQLSI-ADGCVTATTFKKDGVFANFVDQA---- 254 L + D + G+ E V E AD + + ++ + + V Sbjct: 191 KLLADLKNRTDIPIAPGFGISDREQVREWTDAGADAVIVGSALVRE-IEDSLVKPDTLIP 249 Query: 255 RVSQFMEKVH 264 R++ F++ + Sbjct: 250 RITAFVQSLK 259 >UniRef50_Q49Z39 Thiamine-phosphate pyrophosphorylase n=6 Tax=Staphylococcaceae RepID=THIE_STAS1 Length = 212 Score = 44.1 bits (103), Expect = 0.005, Method: Composition-based stats. Identities = 29/175 (16%), Positives = 57/175 (32%), Gaps = 27/175 (15%) Query: 79 MARIIGQLMSDIRIPFGVNVLWDPVASFDLAMATGAKFIREIFTGAYASDFGVWDTNVGE 138 +AR + L D +PF VN LA GA G + + E Sbjct: 57 LARKLLALCHDYAVPFIVN------DDVALANKIGAD-------GIHVGQDDMDVKVFAE 103 Query: 139 TIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPDALCVSGLTAGTRTD 198 + + +I + + ++ + + +T S A Sbjct: 104 --QFKGKIIGLSISNIDEYKTSNLAHVDYIGVGPMYATT-----------SKDDANLPVG 150 Query: 199 SALLKRVKETVPDTVVLANTGVCLENVEEQLSI-ADGCVTATTFKKDGVFANFVD 252 ++ +++ V ++A G+ +EN E + ADG + K +N + Sbjct: 151 PEMITKLRAHVNHFPIVAIGGINVENTREVMQAGADGISIISAITKSENISNTIS 205 >UniRef50_A7Z615 Tryptophan synthase alpha chain n=15 Tax=Bacillaceae RepID=TRPA_BACA2 Length = 265 Score = 44.1 bits (103), Expect = 0.005, Method: Composition-based stats. Identities = 32/221 (14%), Positives = 67/221 (30%), Gaps = 31/221 (14%) Query: 40 DKAWDDLMALQNGGVDAVMFSNEFSLPYLTKVRPETTAAMAR---------IIGQL---- 86 D + + ++LQN G A+ +PY + A I+ + Sbjct: 23 DISVELAISLQNAGASALEI----GVPYTDPLADGPVIQRASKRALENGMNIVKAIELGG 78 Query: 87 ---MSDIRIPFGVNVLWDPVASFDLAMATGAKFIREIFTGAYASDFGVWDTNVGETIRHQ 143 + + IP + ++PV L +RE + + ++ Sbjct: 79 KMKKNGVHIPIILFTYYNPV--LQLEKEYFFALLRENDIDGLLVPDLPLEESA--LLQM- 133 Query: 144 HRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPDALCVSGLTAGTRTDSALLK 203 + + + P + L ++ F +L V+G+ + Sbjct: 134 -TCKKENIAYISLVAPTSEDRLKIIT----EQACGFVYCVSSLGVTGVRSEFDPSVYSFI 188 Query: 204 RVKETVPDTVVLANTGV-CLENVEEQLSIADGCVTATTFKK 243 R + V G+ + V+ I+DG V + K Sbjct: 189 RKVKEFSSVPVAVGFGISNRKQVDGMNEISDGVVVGSALVK 229 >UniRef50_UPI0001850C57 tryptophan synthase subunit alpha n=1 Tax=Bacillus coahuilensis m4-4 RepID=UPI0001850C57 Length = 259 Score = 44.1 bits (103), Expect = 0.005, Method: Composition-based stats. Identities = 14/128 (10%), Positives = 30/128 (23%), Gaps = 23/128 (17%) Query: 139 TIRHQHRIGAGEVK--TLFNIVPEAAVYLG------NRDICSIAKSTVFNNHP------- 183 + VK + ++ E + + + T +N Sbjct: 110 ILEFAKTARKANVKGVIIPDLSFEESGPFKQILNQQEIGLIQLLSLTSSSNRAKSIIQSS 169 Query: 184 DALC-------VSGLTAGTRTDSALLKRVKETVPDTVVLANTGV-CLENVEEQLSIADGC 235 D ++G L + V A G+ + V + DG Sbjct: 170 DGFIYVVTVKGITGARKEILQSVESLLSEVNEISPVPVYAGFGISTRQQVTRFQDLCDGA 229 Query: 236 VTATTFKK 243 + + Sbjct: 230 IVGSKIVD 237 >UniRef50_A6LU97 Tryptophan synthase alpha chain n=1 Tax=Clostridium beijerinckii NCIMB 8052 RepID=TRPA_CLOB8 Length = 258 Score = 44.1 bits (103), Expect = 0.005, Method: Composition-based stats. Identities = 18/108 (16%), Positives = 31/108 (28%), Gaps = 10/108 (9%) Query: 163 VYLGNRDICSIAKSTVFNNHPDALCVSGLTAGTRTDS-----ALLKRVKETVPDTVVLAN 217 I +I K + + T G R L+ V++ V D + Sbjct: 155 ARTSKDRIAAITKGAKGFVYC---VSTNGTTGERKTLDSGTHEYLEEVRKNV-DIPMCIG 210 Query: 218 TGVCL-ENVEEQLSIADGCVTATTFKKDGVFANFVDQARVSQFMEKVH 264 G+ E V+E DG + + K V + + Sbjct: 211 FGISSKEVVKEVKDYCDGVIVGSAIVKRMAEGKEAVIDFVKDLSDGLK 258 >UniRef50_C0RFZ2 Tryptophan synthase alpha chain n=134 Tax=Bacteria RepID=TRPA_BRUMB Length = 279 Score = 43.7 bits (102), Expect = 0.006, Method: Composition-based stats. Identities = 11/55 (20%), Positives = 16/55 (29%), Gaps = 1/55 (1%) Query: 188 VSGLTAGTRTDSALLKRVKETVPDTVVLANTGV-CLENVEEQLSIADGCVTATTF 241 ++G R + D + GV E + ADG V T Sbjct: 183 ITGAAIADTAKVGEAVRHIKKSTDLPICVGFGVKTPEQAAAIATHADGVVVGTAI 237 >UniRef50_B5Y770 Enoyl-(Acyl-carrier-protein) reductase II n=1 Tax=Coprothermobacter proteolyticus DSM 5265 RepID=B5Y770_COPPD Length = 297 Score = 43.7 bits (102), Expect = 0.006, Method: Composition-based stats. Identities = 28/165 (16%), Positives = 51/165 (30%), Gaps = 30/165 (18%) Query: 83 IGQLMSDIRIPFGVNVLWDPVASFDLAMATGAKFIREIFTGAYASDFGVWDTNVGETIRH 142 I ++ + PFGVN++ ++F ++ V T G Sbjct: 54 IREVRRNTSKPFGVNLM------------LQSEFWQDQIKVVLEEKPPVITTGAGNPSSF 101 Query: 143 QHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPDALCV----SGLTAGTRTD 198 + +K L + L + D + SG G T Sbjct: 102 MKTLKEKGIKILPLVGSANQALL------------LEKAGADGVIAEGKESGGHIGDVTT 149 Query: 199 SALLKRVKETVPDTVVLANTGVCLENVEEQLSI--ADGCVTATTF 241 L+ V ++V + V+A G+ ++ + A G T F Sbjct: 150 IVLVNAVLKSVTNIPVIAAGGIVDKDSYRAMRAMGAAGVQMGTRF 194 >UniRef50_A3IFH8 Tryptophan synthase alpha chain n=2 Tax=Bacillaceae RepID=A3IFH8_9BACI Length = 264 Score = 43.7 bits (102), Expect = 0.006, Method: Composition-based stats. Identities = 32/217 (14%), Positives = 74/217 (34%), Gaps = 25/217 (11%) Query: 39 IDKAWDDLMALQNGGVDAVMFSNEFSLPYLTKVRPETTAAMARIIG-----------QLM 87 ++ ++ LQ GV A+ F+ P E A + G Sbjct: 30 LETVKPTILKLQQLGVTAIEVGIPFTDPVADGPTIERAGERALVAGVTLRKVLQALASFK 89 Query: 88 SDIRIPFGVNVLWDPVASFDLAMATGAKFIREIFTGAYASDFGVWDTNVGETIRHQHRIG 147 +I +P + ++PV ++ L A + G D + ++ + + + Sbjct: 90 EEITVPLVIMTYFNPVLAYGLEAFAQA-CVAGGVKGIIVPDVPLEESGI-----LREALN 143 Query: 148 AGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPDAL-CVSGLTAGTRTDSA-LLKRV 205 + + + ++ I IA ++ + + ++G + T+ + Sbjct: 144 PHSIDVIQLV----SLTSPPERITRIAAASQGFVYAVTVNGITGERSNFATNLEQHFASL 199 Query: 206 KETVPDTVVLANTGV-CLENVEEQLSIADGCVTATTF 241 ++ P VLA G+ E V+ + DG + + Sbjct: 200 RQASP-IPVLAGFGISTPEQVKSMGMLGDGVIVGSAV 235 >UniRef50_Q9URN8 Mutant tryptophan synthase (Fragment) n=4 Tax=Neurospora crassa RepID=Q9URN8_NEUCR Length = 193 Score = 43.7 bits (102), Expect = 0.006, Method: Composition-based stats. Identities = 18/82 (21%), Positives = 28/82 (34%), Gaps = 9/82 (10%) Query: 169 DICSIAKSTVFNNHPDALC-------VSG-LTAGTRTDSALLKRVKETVPDTVVLANTGV 220 S A+ V D+ V+G LL RVK+ + GV Sbjct: 34 PATSDARMRVLCQLADSFIYVVSRQGVTGASGTLNANLPELLARVKKYSGNKPAAVGFGV 93 Query: 221 -CLENVEEQLSIADGCVTATTF 241 ++ + +IADG V + Sbjct: 94 STHDHFTQVGAIADGVVVGSMI 115 >UniRef50_Q0EXD5 Tryptophan synthase alpha chain n=1 Tax=Mariprofundus ferrooxydans PV-1 RepID=Q0EXD5_9PROT Length = 269 Score = 43.7 bits (102), Expect = 0.006, Method: Composition-based stats. Identities = 14/65 (21%), Positives = 21/65 (32%), Gaps = 1/65 (1%) Query: 188 VSGLTAGTRTDSALLKRVKETVPDTVVLANTGV-CLENVEEQLSIADGCVTATTFKKDGV 246 ++G G D +T+ D + G+ E S ADG V + F Sbjct: 182 ITGADMGEVADIRSSVAHIKTMTDLPICVGFGIKTPEQASAVASFADGVVVGSHFVNQIA 241 Query: 247 FANFV 251 V Sbjct: 242 GDGDV 246 >UniRef50_B0C6F8 Tryptophan synthase alpha chain n=32 Tax=cellular organisms RepID=TRPA_ACAM1 Length = 267 Score = 43.7 bits (102), Expect = 0.007, Method: Composition-based stats. Identities = 41/238 (17%), Positives = 73/238 (30%), Gaps = 45/238 (18%) Query: 27 PSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEFSLPYLTKV--RPETTAAMARI-- 82 P A A L A G D + E +PY + P AA R Sbjct: 21 PFLTAGDPDLATTASALRQLDA---SGADLI----ELGVPYSDPLADGPVIQAAATRALQ 73 Query: 83 -----------IGQLMSDIRIPFGVNVLWDPVASFDLAMATGAKFIREIFTGAYASDFGV 131 + +L +IR P + ++P+ +A +F+++I A A G+ Sbjct: 74 RGTRLDQVLEMVTELSPEIRAPIILFTYYNPIYHRGVA-----EFLQQI---AKAGVRGL 125 Query: 132 WDTN-----VGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIA-KSTVFNNHPDA 185 + ++ +G + + I IA +S F Sbjct: 126 VVPDLPLEESENLLQQAADLGIEVTLLVAPTSSK-------ERIEKIALRSQGFIYLVST 178 Query: 186 LCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVC-LENVEEQLSI-ADGCVTATTF 241 V+G+ L V D + G+ E+ + + AD + + F Sbjct: 179 TGVTGMRTKVENRVQDLIADLRQVTDKPIGVGFGISRTEHARQVMDWGADAAIVGSAF 236 >UniRef50_A4XLM5 N-(5'-phosphoribosyl)anthranilate isomerase n=2 Tax=Clostridia RepID=A4XLM5_CALS8 Length = 204 Score = 43.7 bits (102), Expect = 0.007, Method: Composition-based stats. Identities = 26/171 (15%), Positives = 58/171 (33%), Gaps = 33/171 (19%) Query: 94 FGVNVLWDPVASFDLAMATGAKFIREIFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKT 153 GV + ++ FI + E+I + + K Sbjct: 55 VGVFKNQNYSEVLSISKDLQLDFI---------------QLHSSESIEFITYLKSQGFKI 99 Query: 154 LFNIVPEAAVYLGNRDICSIAKSTVFNNHPDALCVSGLTAGTRTDSALLKRVKETVPDTV 213 + + E ++ I +S ++ + D + + D ++K K D Sbjct: 100 IKAVEVE--------NMDDIERSKIYKDIADFILL---DRPKGKDVDIIKLAKNADFDF- 147 Query: 214 VLANTGVCLENVEEQLSIAD-GCVTATTFKKDGVFANFVDQARVSQFMEKV 263 + G+ EN+E+ L++ G ++ + +G D ++ MEK+ Sbjct: 148 -IIAGGITPENIEKYLALNPLGVDVSSGIETNGFK----DFNKMKTIMEKL 193 >UniRef50_C1TRA2 Tryptophan synthase alpha chain n=1 Tax=Dethiosulfovibrio peptidovorans DSM 11002 RepID=C1TRA2_9BACT Length = 258 Score = 43.7 bits (102), Expect = 0.007, Method: Composition-based stats. Identities = 35/256 (13%), Positives = 80/256 (31%), Gaps = 38/256 (14%) Query: 1 MSWLKEVIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMF- 59 M LK + A+I + GDP + + + + A+ G DAV Sbjct: 1 MRDLKVIFDGGTALIPFV----MAGDPDLETTVSL----------VEAMVEAGADAVELG 46 Query: 60 ----SNEFSLPYLTKVR------PETTAAMARIIGQLMSDIRIPFGVNVLWDPVASFDLA 109 P + + T ++ + G++ + +P + ++P+ S+ + Sbjct: 47 MPFSDPLADGPVIQEAGQRALACGTTLESVVKTAGEIAGKVAVPIILMGYFNPIMSYGV- 105 Query: 110 MATGAKFIREIFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRD 169 + D + + E + G + + + + + Sbjct: 106 DRFASDLADAGVAAVIVPDLPFDEGD--ELHQALKSRGVTPILMV-------SPNIADDR 156 Query: 170 ICSIAKSTVFNNHPDALC-VSGLTAGTRTDSALLKRVKETVPDTVVLANTGVC-LENVEE 227 + I + +L V+G + ++RV+ V + G+ E + Sbjct: 157 LAEIGNRAEGFVYCVSLLGVTGQGSLHEGMEGYIERVRRNV-SVPLALGFGIDGPERAAQ 215 Query: 228 QLSIADGCVTATTFKK 243 + DG V + K Sbjct: 216 VAPLVDGVVVGSAVVK 231 >UniRef50_B7HH02 Tryptophan synthase alpha chain n=76 Tax=Bacillus RepID=TRPA_BACC4 Length = 258 Score = 43.3 bits (101), Expect = 0.008, Method: Composition-based stats. Identities = 25/159 (15%), Positives = 58/159 (36%), Gaps = 16/159 (10%) Query: 85 QLMSDIRIPFGVNVLWDPVASFDLAMATGAKFIREIFTGAYASDFGVWDTNVGETIR--H 142 ++ +++IPF + +PV +F +FI A G+ ++ + Sbjct: 85 EVRKEVQIPFVLMTYLNPVLAFG-----KERFIENCI---EAGVDGIIVPDLPYEEQNII 136 Query: 143 QHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPDALC-VSGLTAGTRTDSAL 201 + + + + I I + + + V+G+ + + Sbjct: 137 ASLLREVNIALIPLVTVT----SPIERIEKITSESEGFVYAVTVAGVTGVRQNFKEEIHS 192 Query: 202 LKRVKETVPDTVVLANTGV-CLENVEEQLSIADGCVTAT 239 ++ + V+A G+ E+VEE ++I DG V + Sbjct: 193 YLEKVKSHVNLPVVAGFGISTKEHVEEMVTICDGVVVGS 231 >UniRef50_A6VFT8 Tryptophan synthase alpha chain n=9 Tax=Euryarchaeota RepID=TRPA_METM7 Length = 259 Score = 43.3 bits (101), Expect = 0.008, Method: Composition-based stats. Identities = 38/221 (17%), Positives = 64/221 (28%), Gaps = 21/221 (9%) Query: 62 EFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWD----PVASFDLAMATGAKFI 117 E +P+ V T A + L + +I VL + L F Sbjct: 35 ELGIPFSDPVADGPTIQ-AADVRALSNGFKIAKSFEVLKEFRKESDTPVILMTYYNPVFK 93 Query: 118 REIFTGAYASDFGVWDTNV------GETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDIC 171 R I T + + + E ++ E+ T+F AA + Sbjct: 94 RGIETFVMQAKEAGANGLIIVDLPLQEATEYREICKKHEMGTVFL----AAPNTPEERLK 149 Query: 172 -SIAKSTVFNNHPDALCVSGLTAGTR-TDSALLKRVKETVPDTVVLANTGVCLENVEEQL 229 S ST F ++G +KR + T + G+ + E L Sbjct: 150 ISDEASTEFLYLISTFGITGARESFEQMTFDFIKRARTTCKGK-ICVGFGISKGSHAESL 208 Query: 230 --SIADGCVTATTFKKDGVFANFVDQARVSQFMEKVHHIRR 268 ADG + + F ++A V E + Sbjct: 209 IEQGADGVIVGSAFVDIIKNYGDSEEALVK-LEELAKELSE 248 >UniRef50_A3HZI7 Tryptophan synthase alpha chain n=3 Tax=Bacteroidetes RepID=A3HZI7_9SPHI Length = 257 Score = 43.3 bits (101), Expect = 0.009, Method: Composition-based stats. Identities = 12/56 (21%), Positives = 22/56 (39%), Gaps = 2/56 (3%) Query: 188 VSGLTAG-TRTDSALLKRVKETVPDTVVLANTGVC-LENVEEQLSIADGCVTATTF 241 ++G AG + A +RVK L G+ E + ++G + + F Sbjct: 179 ITGAKAGISEEQIAYFERVKAMNLKNPRLIGFGISDAETFSKASEYSNGAIIGSAF 234 >UniRef50_A1VL09 FMN-dependent alpha-hydroxy acid dehydrogenase n=2 Tax=Betaproteobacteria RepID=A1VL09_POLNA Length = 372 Score = 43.3 bits (101), Expect = 0.009, Method: Composition-based stats. Identities = 20/114 (17%), Positives = 37/114 (32%), Gaps = 7/114 (6%) Query: 44 DDLMALQNGGVDAVMFSNEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDPV 103 +D G+D ++ SN P A+ ++ + I + V Sbjct: 256 EDARLAVEHGIDGLIVSNHGGR--QLDAAPSAIHALPAVVASVQGRIPVFMDSGVRRGSD 313 Query: 104 ASFDLAMATGAKFI-REIFTG----AYASDFGVWDTNVGETIRHQHRIGAGEVK 152 + +A+ A F+ R + G A V E +R +GA + Sbjct: 314 IAKAIALGAKAVFLGRPLLYGLAAQGAAGIDAVMKQFSDELVRTMILLGASRIA 367 >UniRef50_D2R0T1 Phosphoribosylanthranilate isomerase n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R0T1_9PLAN Length = 238 Score = 42.9 bits (100), Expect = 0.011, Method: Composition-based stats. Identities = 33/193 (17%), Positives = 56/193 (29%), Gaps = 28/193 (14%) Query: 43 WDDLMALQNGGVDAVMFSNEFSLPYLTKVRPE---TTAAMARIIGQLMSDIRIPFGVNVL 99 DD +A GVDA+ + + T +MAR ++ + GV V Sbjct: 21 VDDAVAAIEAGVDAIGLNFYGKSKRYIPLDAARQLVTESMARSARRV-----LWTGVFVN 75 Query: 100 WDPVASFDLAMATGAKFIREIFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVP 159 + + D E +R V+ L + Sbjct: 76 ASVDEIAEHVQRVPLDIV------QLHGDE------PPEMVRLVRERVGSTVRVLRAVRV 123 Query: 160 EAAVYLGNRDICSIAKSTVFNNHPDALCVSGLTA------GTRTDSALLKRVKETVPDTV 213 D +A+ N+ PDA+ V G + D + R + Sbjct: 124 SQKSLAEVADYLEVARRA--NSEPDAVLVDAHATEEYGGTGKQLDWTSVHRDLAILGSMP 181 Query: 214 VLANTGVCLENVE 226 ++ G+ ENV Sbjct: 182 LILAGGLTPENVG 194 >UniRef50_Q4KKP5 Tryptophan synthase alpha chain n=18 Tax=cellular organisms RepID=TRPA_PSEF5 Length = 270 Score = 42.9 bits (100), Expect = 0.012, Method: Composition-based stats. Identities = 11/55 (20%), Positives = 16/55 (29%), Gaps = 1/55 (1%) Query: 188 VSGLTAGTRTDSALLKRVKETVPDTVVLANTGV-CLENVEEQLSIADGCVTATTF 241 V+G A T D + G+ E +ADG V + Sbjct: 182 VTGAGAATLEHVEEAVARLRRHTDLPISIGFGIRTPEQAAAIARLADGVVVGSAL 236 >UniRef50_B2JRL0 L-lactate dehydrogenase (Cytochrome) n=1 Tax=Burkholderia phymatum STM815 RepID=B2JRL0_BURP8 Length = 399 Score = 42.9 bits (100), Expect = 0.012, Method: Composition-based stats. Identities = 15/73 (20%), Positives = 25/73 (34%), Gaps = 2/73 (2%) Query: 45 DLMALQNGGVDAVMFSNEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDPVA 104 D + Q GVDAV+ SN + A+ I ++ D + V Sbjct: 269 DALNAQRHGVDAVIISNHGGR--QLDRAIASINALPLIRREVGDDFPLMIDGGVRRGSDI 326 Query: 105 SFDLAMATGAKFI 117 + L + F+ Sbjct: 327 AIALCLGANFVFV 339 >UniRef50_A0QHG8 Tryptophan synthase alpha chain n=15 Tax=Actinobacteria (class) RepID=TRPA_MYCA1 Length = 271 Score = 42.9 bits (100), Expect = 0.013, Method: Composition-based stats. Identities = 35/199 (17%), Positives = 62/199 (31%), Gaps = 24/199 (12%) Query: 49 LQNGGVDAVMFSNEFSLPYLTKVRP-ETTAAMARIIGQLMSDIRIPFGVNVLWDPVASFD 107 + G+D + VR +T AA+ I + + + VL V +F Sbjct: 61 YSDPGMDGPTIARATEAALHGGVRVRDTLAAVEAISAAGGRAVVMTYWNPVLRYGVDAFA 120 Query: 108 --LAMATGAKFIREIFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYL 165 LA A G I D G+ + R G + + Sbjct: 121 RDLAAAGGYGLITPDL----IPDE------AGQWLAASERHGLDRIFLVA-------PSS 163 Query: 166 GNRDICSIAKSTVFNNHPDA-LCVSGLTAGTR-TDSALLKRVKETVPDTVVLANTGVCL- 222 + + +++ + + + V+G L+ RV+E D V GV Sbjct: 164 TPQRLALTVEASRGFVYAASTMGVTGARDAVSHAAPELVSRVREI-SDIPVGVGLGVRSR 222 Query: 223 ENVEEQLSIADGCVTATTF 241 E + ADG + + Sbjct: 223 EQAAQIGGYADGVIVGSAL 241 >UniRef50_P51382 Tryptophan synthase alpha chain n=1 Tax=Porphyra purpurea RepID=TRPA_PORPU Length = 273 Score = 42.6 bits (99), Expect = 0.013, Method: Composition-based stats. Identities = 41/261 (15%), Positives = 82/261 (31%), Gaps = 43/261 (16%) Query: 1 MSWLKEVIGT---EKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAV 57 M+ + V + A+I GDP + L L G D + Sbjct: 11 MTTISSVFKKLDKQCALIPFI----TAGDPDLVSTS----------KALKILDQHGADII 56 Query: 58 MFSNEFSLPYLTKV--RPETTAAMARIIGQLMSDIRIPFGVNVLWDPVASFDLAMATGAK 115 E LPY + P AA +R + Q ++ I VN+ + +A Sbjct: 57 ----ELGLPYSDPLADGPIIQAASSRALKQSINLNNILDMVNITSSNI----VAPIVLFT 108 Query: 116 FIREIF------TGAYASDFGVWDTNVGETIRHQHR-----IGAGEVKTLFNIVPEAAVY 164 + + + S G+ + + + ++ +F + P +++ Sbjct: 109 YYNPVLNLGINNFISAISRAGIKGLLIPDLPIEESDYIISVCKLFNIELIFLLSPTSSIE 168 Query: 165 LGNRDICSIAKSTVFNNHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGV-CLE 223 N+ I A ++ V+G L + + ++ G+ E Sbjct: 169 RINK-IVEQAPGCIYLVSTTG--VTGQKPELTGKLKRLTETIKKMTQKPIILGFGISTAE 225 Query: 224 NVEEQLS-IADGCVTATTFKK 243 ++E +G V + F K Sbjct: 226 QIKEIKGWNINGIVIGSAFVK 246 >UniRef50_C2CM78 L-lactate dehydrogenase n=2 Tax=Actinomycetales RepID=C2CM78_CORST Length = 419 Score = 42.6 bits (99), Expect = 0.013, Method: Composition-based stats. Identities = 26/137 (18%), Positives = 49/137 (35%), Gaps = 10/137 (7%) Query: 29 FDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEFSLPYLTKVRPETTAAMARIIGQLMS 88 +D ++ + +++ A D + G D V+ S+ T A+ I +L Sbjct: 270 WDGKMLVKGIVNPA--DARTVIELGADGVVVSSHGGR--QLDRVVNTLRALEAIRAELGP 325 Query: 89 DIRIPFGVNVLWDPVASFDLAMATGAKFI---REIFTGAYA-SDFGVWDTNVGETIRHQH 144 D I + ++ +A+A GA F+ R G A GV T + Sbjct: 326 DAEIVYDSGIMSGTD--IAIALALGANFVLIGRAYLYGLMAGGREGVDRIIELLTSELET 383 Query: 145 RIGAGEVKTLFNIVPEA 161 V ++ ++ E Sbjct: 384 ACTLLGVSSVRDLKREH 400 >UniRef50_C7Q1I4 FMN-dependent alpha-hydroxy acid dehydrogenase n=1 Tax=Catenulispora acidiphila DSM 44928 RepID=C7Q1I4_CATAD Length = 440 Score = 42.6 bits (99), Expect = 0.014, Method: Composition-based stats. Identities = 17/100 (17%), Positives = 29/100 (29%), Gaps = 3/100 (3%) Query: 43 WDDLMALQNGGVDAVMFSNEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDP 102 + + G+DAV+ SN P T + I + + + Sbjct: 275 VEQAREAVDHGLDAVIVSNHGGR--QLDRLPATIDVLPEIADAVGDRVEVLVDSGFRSGG 332 Query: 103 VASFDLAMATGAKFI-REIFTGAYASDFGVWDTNVGETIR 141 + LA+ A + R G A+ V R Sbjct: 333 DIATALALGAKAVLVGRAHLYGLAAAGEAGVRHCVDILAR 372 >UniRef50_D1JG56 Tryptophan synthase alpha chain n=1 Tax=uncultured archaeon RepID=D1JG56_9ARCH Length = 274 Score = 42.6 bits (99), Expect = 0.014, Method: Composition-based stats. Identities = 9/44 (20%), Positives = 19/44 (43%), Gaps = 2/44 (4%) Query: 199 SALLKRVKETVPDTVVLANTGVC-LENVEEQLSIADGCVTATTF 241 L+ ++ +P + G+ E+V +ADG + + F Sbjct: 195 IKDLELARD-LPQIPLAVGFGISKPEHVRAVCEVADGAIVGSAF 237 >UniRef50_B1YEC8 CutC family protein n=1 Tax=Exiguobacterium sibiricum 255-15 RepID=B1YEC8_EXIS2 Length = 221 Score = 42.6 bits (99), Expect = 0.014, Method: Composition-based stats. Identities = 45/191 (23%), Positives = 71/191 (37%), Gaps = 28/191 (14%) Query: 82 IIGQLMSDIRIPFGVNVLWDPVASFDLAMA------TGAKFIRE-----IFTGAYASDFG 130 +I Q++S + IP V V SF + A T IRE I G+ +D Sbjct: 40 LIRQVLSSVEIPVHVLV-RPHSKSFVYSKADQETIITDIDLIRELGAAGIVVGSLTADGR 98 Query: 131 VWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPDALCVSG 190 V + +G I+H+ + +RDI A+ D + SG Sbjct: 99 VDEGFLGRIIKHKGDLSLT----------FHRAIDSSRDILEAAEVLADFPEVDRILTSG 148 Query: 191 LTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADG--CVTATTFKKDGVFA 248 A A++ ++ E PD +VL +G+ +E EE L + DG Sbjct: 149 GHATALEGQAVIAQLIEQNPDLIVLPGSGITVERAEELLKATQATELHVGSAVLVDGQ-- 206 Query: 249 NFVDQARVSQF 259 VD +V+ Sbjct: 207 --VDADKVAAL 215 >UniRef50_B6JXZ5 Anthranilate synthase component 2 n=1 Tax=Schizosaccharomyces japonicus yFS275 RepID=B6JXZ5_SCHJY Length = 747 Score = 42.6 bits (99), Expect = 0.014, Method: Composition-based stats. Identities = 16/85 (18%), Positives = 28/85 (32%), Gaps = 6/85 (7%) Query: 182 HPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIADGCVTATTF 241 DA V G + G T K + + G+ ENV+E + + V Sbjct: 669 LIDAF-VGGQSGGLGTTVDWEAAAK---LNVPFILAGGLNPENVKEAIQKTNAAVV--DV 722 Query: 242 KKDGVFANFVDQARVSQFMEKVHHI 266 D A++ F++ + Sbjct: 723 SSGVETNGEQDLAKIKAFVKNAKGL 747 >UniRef50_A5D1S2 Tryptophan synthase alpha chain n=2 Tax=Peptococcaceae RepID=A5D1S2_PELTS Length = 273 Score = 42.6 bits (99), Expect = 0.015, Method: Composition-based stats. Identities = 28/220 (12%), Positives = 63/220 (28%), Gaps = 35/220 (15%) Query: 43 WDDLMALQNGGVDAVMFSNEFSLPYLTKVRPETTAAMAR---------------IIGQLM 87 + + + G D + E +P+ + MA + ++ Sbjct: 37 VEIVRHVAEAGADLI----ELGIPFSDPIADGPVIQMASARALAAGATLPGILEAVREIK 92 Query: 88 SDIRIPFGVNVLWDPVASFDLAMATGAKFIREIFTGAYASDFGVWDTNVG--ETIRHQHR 145 P + ++P+ F + A + A G+ ++ ET + Sbjct: 93 KVCTKPLLLMGYYNPIYRFGIRQFAAA--------ASVAGVDGLIVPDLPYEETRPLREA 144 Query: 146 IGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPDAL-CVSGLTAGTRTDSALLKR 204 + ++ + A +R + IA + ++ V+G TD A Sbjct: 145 ATEKGMDLIYLV----APVTPDRRLMKIAAEASGFIYCISVTGVTGARREIDTDLAAFTG 200 Query: 205 VKETVPDTVVLANTGVC-LENVEEQLSIADGCVTATTFKK 243 + G+ E + + D V + K Sbjct: 201 RVRRYTALPLALGFGISGPEQALKASAYCDAVVVGSALVK 240 >UniRef50_B8F7P2 Thiamine-phosphate pyrophosphorylase n=1 Tax=Haemophilus parasuis SH0165 RepID=B8F7P2_HAEPS Length = 220 Score = 42.6 bits (99), Expect = 0.015, Method: Composition-based stats. Identities = 28/165 (16%), Positives = 59/165 (35%), Gaps = 25/165 (15%) Query: 78 AMARIIGQLMSDIRIPFGVNVLWDPVASFDLAMATGAKFIREIFTGAYASDFGVWDTNVG 137 +AR L +PF VL + V +A+ GA G + + Sbjct: 63 QLARDCQALCRQYGVPF---VLNNDVE---MAVKLGAD-------GVHIGQKDMAAIQAI 109 Query: 138 ETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPDALCVSGLTAGTRT 197 + + + +G +L ++ + L D ++ + DA+ Sbjct: 110 QLTQGKLFLGISN-SSLNDLQ--NSFRLPEIDYWAVGAIFNTQSKLDAM--------QNV 158 Query: 198 DSALLKRVKETVPDTVVLANTGVCLENVEEQLSI-ADGCVTATTF 241 L+++ K+ P+ ++A G+ ++NV + ADG + Sbjct: 159 GIDLIRQAKQLNPNQPLVAIGGISVDNVASIWAAGADGVAVISAI 203 >UniRef50_B0K8T5 N-(5'-phosphoribosyl)anthranilate isomerase n=11 Tax=Clostridia RepID=TRPF_THEP3 Length = 203 Score = 42.2 bits (98), Expect = 0.017, Method: Composition-based stats. Identities = 17/67 (25%), Positives = 30/67 (44%), Gaps = 6/67 (8%) Query: 203 KRVKETVPDTVVLANTGVCLENVEEQLSIAD--GCVTATTFKKDGVFANFVDQARVSQFM 260 K +K + ++ G+ NVEE + I D ++ + +G D ++ F+ Sbjct: 141 KVLKGMELNVPIILAGGLNENNVEEAIKIVDPYAVDVSSGVETEGYK----DFKKLKSFI 196 Query: 261 EKVHHIR 267 EKV IR Sbjct: 197 EKVRGIR 203 >UniRef50_A0YKS6 Tryptophan synthase alpha chain n=3 Tax=cellular organisms RepID=A0YKS6_9CYAN Length = 269 Score = 42.2 bits (98), Expect = 0.019, Method: Composition-based stats. Identities = 38/225 (16%), Positives = 72/225 (32%), Gaps = 36/225 (16%) Query: 39 IDKAWDDLMALQNGGVDAVMFSNEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNV 98 ++ L L G D + E +PY + A L + + V Sbjct: 34 LETTAKALQILDRSGADLI----ELGIPYSDPLADGPVIQ-AAATRALAKGVTFDQVLEV 88 Query: 99 LWDPVASFDLAMATGAKF--------IREIFT-GAYASDFGVWDTNVGETIRHQHRIGAG 149 + D V + A + IR + T A A G+ + + Sbjct: 89 VSDVVPTLK-APLILFSYYNPILYRGIRAVLTRFADAGVKGLVVPD----------LPLE 137 Query: 150 EVKTLFNIVPEAAVYLG--------NRDICSIA-KSTVFNNHPDALCVSGLTAGTRTDSA 200 E +TL +I E + + + I +IA KS F V+G+ + ++ Sbjct: 138 ESQTLLDIAAEVGIEVILLVAPTSRSERIQAIAQKSQGFIYLVSVTGVTGMRSQVQSRVQ 197 Query: 201 LLKRVKETVPDTVVLANTGVC-LENVEEQLSI-ADGCVTATTFKK 243 L + D + G+ + ++ AD + + F + Sbjct: 198 TLIGDLKASTDKPIGVGFGISEPKQAQQIAQWGADAVIVGSAFVR 242 >UniRef50_Q5LBZ2 Tryptophan synthase alpha chain n=30 Tax=cellular organisms RepID=TRPA_BACFN Length = 263 Score = 42.2 bits (98), Expect = 0.019, Method: Composition-based stats. Identities = 39/266 (14%), Positives = 76/266 (28%), Gaps = 57/266 (21%) Query: 1 MSWLKEVIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFS 60 M+ + ++ + + + A G P+ + + + L+ GV+ + Sbjct: 1 MNRINQLFDSNPRDLLSIYFCA--GYPTLEGTTEV----------IRTLEKHGVNMIEIG 48 Query: 61 NEFSLPYLTKVRPETTAAMAR-----------IIGQLMSDIRIPFGVNVLWDPVASFDLA 109 FS P + + A A + + D++IP + +P+ F Sbjct: 49 IPFSDPMADGMVIQNAATQALRNGMSLRLLFEQLHDIRRDVKIPLILMGYLNPIMQF--- 105 Query: 110 MATGAK-FIR----EIFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVY 164 G F R G D D + +VK + I PE + Sbjct: 106 ---GFDNFCRQCAECGIDGVIIPDLPFKDYQEH----FRTIAERYDVKVIMLITPETS-- 156 Query: 165 LGNRDICSIAKSTVFNNHPDALCV---SGLTAGTRTDSA-----LLKRVKETVPDTVVLA 216 + H D S T G + D K++++ + Sbjct: 157 ------EERVREIDE--HTDGFIYMVSSAATTGAQQDFDGQKRAYFKKIEKMNLRNPRMV 208 Query: 217 NTGV-CLENVEEQLSIADGCVTATTF 241 G+ A G + + F Sbjct: 209 GFGISNEATFRAACENASGAIIGSRF 234 >UniRef50_A3K9S1 L-lactate dehydrogenase n=2 Tax=Rhodobacteraceae RepID=A3K9S1_9RHOB Length = 391 Score = 42.2 bits (98), Expect = 0.019, Method: Composition-based stats. Identities = 18/121 (14%), Positives = 32/121 (26%), Gaps = 8/121 (6%) Query: 45 DLMALQNGGVDAVMFSNEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDPVA 104 D G D ++ SN P T + I + +++ + Sbjct: 262 DAARCTEAGCDGIVVSNHGGRQVAF--GPATADVLPAIAEAVAGRMKVIVDSGIRRGAD- 318 Query: 105 SFDLAMATGAKFIREI--FTGAYASDFGVWDTNVGETIRHQ--HRIGAGEVKTLFNIVPE 160 A A GA F + E + + +G V ++ PE Sbjct: 319 -MMRAKALGADFTLTGRALAFGVGAGGAPGAARAVEILELELVRALGQLGVPRFADVGPE 377 Query: 161 A 161 Sbjct: 378 H 378 >UniRef50_Q7UMC3 N-(5'-phosphoribosyl)anthranilate isomerase n=1 Tax=Rhodopirellula baltica RepID=Q7UMC3_RHOBA Length = 245 Score = 42.2 bits (98), Expect = 0.020, Method: Composition-based stats. Identities = 31/229 (13%), Positives = 78/229 (34%), Gaps = 24/229 (10%) Query: 41 KAWDDLMALQNGGVDAVMFSNEFSLPYLTKVRPETTAAMARIIGQLMSDIRIP-FGVNVL 99 ++ D+ ++ + G DAV N + P + + P+ A I ++ ++ + G+ V Sbjct: 30 RSVQDVRSVADAGADAVGL-NFYE-PSVRSLNPD--AEETIRINEVAREVGLTRVGLFVN 85 Query: 100 WDPVASFDLAMATGAKFIREIFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVP 159 D +A + +I+ + + + G ++ + +++ Sbjct: 86 HDLEFIQRVAGSLQLDWIQLHGDEPVSLAEDLVRAGQSILRAIRLPRGKLKLGQIDDVIG 145 Query: 160 EAAVYLGNRDICSIAKSTVFNNHPDALCVSGLTAGTRTDSALLKRVKETVPDTVV--LAN 217 + + A + G D ++ + D+ + Sbjct: 146 KWNE--VDVSYLLDADAGASFGGG----------GQPLDWPSIRAWADRRGDSAAGWVLA 193 Query: 218 TGVCLENVEEQLSI--ADGCVTATTFKKDGVFANFVDQARVSQFMEKVH 264 G+ ENV E + + A G A+ ++ + ++ QF+ V Sbjct: 194 GGLNPENVREAIQVSGATGVDVASGVEQ---PKGRKNAEKIRQFVAAVQ 239 >UniRef50_D1AFY8 Thiamine-phosphate pyrophosphorylase n=3 Tax=Bacteria RepID=D1AFY8_SEBTE Length = 197 Score = 41.8 bits (97), Expect = 0.022, Method: Composition-based stats. Identities = 29/206 (14%), Positives = 58/206 (28%), Gaps = 37/206 (17%) Query: 38 VIDKAWDDLMALQNGGVDAVMFSNEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVN 97 ++ A + GGV V + MA I + +P +N Sbjct: 11 TLETAEKKVTEAIEGGVTIVQL-------RAKDISAAEFYNMALKIKMVTGYYNVPLIIN 63 Query: 98 VLWDPVASFDLAMATGAKFIREIFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNI 157 D+A+A A G + + + IG ++ + Sbjct: 64 ------DRVDIAIAADAD-------GVHTGQEDLPAG------EVRKLIGMNKIMGI--- 101 Query: 158 VPEAAVYLGNRDICSIAKSTVFNNHPDALCVSGLTAGTR-TDSALLKRVKETVPDTVVLA 216 + + + A+ + + + L +K+ V V+A Sbjct: 102 -----SVSNTSEAEKAERESADYLGAGAMFPTDTKLDAKYVNLEELGEIKKKV-QIPVVA 155 Query: 217 NTGVCLENVEE-QLSIADGCVTATTF 241 G+ EN + + ADG + Sbjct: 156 IGGINAENAGQLFCAGADGIAVVSAI 181 >UniRef50_C0GDL2 Tryptophan synthase alpha chain n=1 Tax=Dethiobacter alkaliphilus AHT 1 RepID=C0GDL2_9FIRM Length = 280 Score = 41.8 bits (97), Expect = 0.023, Method: Composition-based stats. Identities = 6/80 (7%), Positives = 16/80 (20%), Gaps = 2/80 (2%) Query: 188 VSGLTAGTRTDSALLKRVKETVPDTVVLANTGVC-LENVEEQLSIADGCVTATTFKKD-G 245 V+G L + + G+ + D + + + Sbjct: 181 VTGKRTAMDPAVERLLQRAAKFTAVPLALGFGISDPGQAKIAAQSGDCVIVGSALVQKIA 240 Query: 246 VFANFVDQARVSQFMEKVHH 265 F+ + Sbjct: 241 EANGEAKYDVAGSFIRSLKS 260 >UniRef50_Q0RGR2 Tryptophan synthase alpha chain n=1 Tax=Frankia alni ACN14a RepID=Q0RGR2_FRAAA Length = 264 Score = 41.8 bits (97), Expect = 0.024, Method: Composition-based stats. Identities = 19/88 (21%), Positives = 29/88 (32%), Gaps = 10/88 (11%) Query: 188 VSGLTAGTRTDSALLKRVKETVPDTVVLANTGV-CLENVEEQLSIADGCVTATTFKKD-- 244 V+G+ + VL GV E S+ADG V A+ + Sbjct: 164 VTGVRDTVSDSIVGTVQGVRDATMMPVLVGFGVSTPEQAATVASVADGVVVASALMRSIL 223 Query: 245 -----GVFANFVDQARVSQFMEKVHHIR 267 G VD RV+ ++ +R Sbjct: 224 DGAPAGGIRREVDALRVA--VDSAAAVR 249 >UniRef50_O67502 Tryptophan synthase alpha chain n=5 Tax=Aquificales RepID=TRPA_AQUAE Length = 262 Score = 41.8 bits (97), Expect = 0.026, Method: Composition-based stats. Identities = 20/88 (22%), Positives = 30/88 (34%), Gaps = 10/88 (11%) Query: 186 LCVSGLTAGTRTDS------ALLKRVKETVPDTVVLANTGVCL-ENVEEQLSIADGCVTA 238 + V+G G R ++ +E D V+ GV E+ E S ADG V Sbjct: 177 VSVTGT-TGAREKLPYERIKKKVEEYRELC-DKPVVVGFGVSKKEHAREIGSFADGVVVG 234 Query: 239 TTF-KKDGVFANFVDQARVSQFMEKVHH 265 + K G V + E + Sbjct: 235 SALVKLAGQKKIEDLGNLVKELKEGLRE 262 >UniRef50_C2E9Z7 N-(5'-phosphoribosyl)anthranilate isomerase n=1 Tax=Lactobacillus ruminis ATCC 25644 RepID=C2E9Z7_9LACO Length = 210 Score = 41.8 bits (97), Expect = 0.026, Method: Composition-based stats. Identities = 21/124 (16%), Positives = 41/124 (33%), Gaps = 12/124 (9%) Query: 147 GAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNH--PDALCV-SGLTAGTRTDSALLK 203 G + + + + + I A + D L +G G D +LK Sbjct: 83 GKESESYIEELKRKTSKPVIKAVIPKNATDVEYFCKTKADYLLFDAGKGEGRAFDWEILK 142 Query: 204 RVKETVPDTVVLANTGVCLENVEEQLSIAD--GCVTATTFKKDGVFANFVDQARVSQFME 261 V+ + G+ EN+ L D ++ + DG D A++ + E Sbjct: 143 NVRSSRKY---FLAGGLNEENIGSALDALDPYAIDISSGVESDGKK----DYAKIKRITE 195 Query: 262 KVHH 265 + + Sbjct: 196 FLKN 199 >UniRef50_B8DET0 Thiamine-phosphate pyrophosphorylase n=27 Tax=Bacillales RepID=THIE_LISMH Length = 214 Score = 41.8 bits (97), Expect = 0.027, Method: Composition-based stats. Identities = 32/176 (18%), Positives = 57/176 (32%), Gaps = 29/176 (16%) Query: 79 MARIIGQLMSDIRIPFGVNVLWDPVASFDLAMATGAKFIREIFTGAYAS--DFGVWDTNV 136 MA QL + ++PF +N LA+ GA G + D G+ Sbjct: 55 MALECQQLCAKYQVPFIIN------DDVALALEIGAD-------GIHVGQNDEGIRQVIA 101 Query: 137 GETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPDALCVSGLTAGTR 196 + + + V A LG D + + DA VSG Sbjct: 102 SCAGKMKIGLSVHSVS-----EAAEAERLGAVDYIGVGPIFPTISKADAEPVSGTA---- 152 Query: 197 TDSALLKRVKETVPDTVVLANTGVCLENVEEQLSI-ADGCVTATTFKKDGVFANFV 251 +L+ ++ ++ G+ +N E L+ ADG + + + + Sbjct: 153 ----ILEEIRRAGIKLPIVGIGGINEKNSAEVLTAGADGVSVISAITRSDDCYSVI 204 >UniRef50_Q1PX46 Tryptophan synthase alpha chain n=2 Tax=Planctomycetales RepID=Q1PX46_9BACT Length = 280 Score = 41.8 bits (97), Expect = 0.027, Method: Composition-based stats. Identities = 8/71 (11%), Positives = 24/71 (33%), Gaps = 1/71 (1%) Query: 175 KSTVFNNHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGV-CLENVEEQLSIAD 233 ++ F + + ++G D + + ++ GV + IA+ Sbjct: 171 RTQGFLYYISVVGITGARDMLPDDIVQNINKLKQLTSIPIVVGFGVSTAKQASMVGKIAE 230 Query: 234 GCVTATTFKKD 244 G + + ++ Sbjct: 231 GVIVGSAIVRE 241 >UniRef50_C4KZE0 Thiamine-phosphate pyrophosphorylase n=1 Tax=Exiguobacterium sp. AT1b RepID=C4KZE0_EXISA Length = 213 Score = 41.4 bits (96), Expect = 0.029, Method: Composition-based stats. Identities = 35/165 (21%), Positives = 57/165 (34%), Gaps = 27/165 (16%) Query: 78 AMARIIGQLMSDIRIPFGVNVLWDPVASFDLAMATGAKFIREIFTGAYASDFGVWDTNVG 137 AMAR + +L + +PF VN DLA+A A G + + V Sbjct: 59 AMARTLHRLCREAGVPFIVN------DDVDLALAIDAD-------GIHVGQDDLPAKQVR 105 Query: 138 ETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPDALCVSGLTAGTRT 197 E + +G V A+ D I + DA V G Sbjct: 106 ERLGPDKWLG---VSVHTMEEVRTALPFA--DYVGIGPIHETQSKLDAGQVRGT------ 154 Query: 198 DSALLKRVKETVPDTVVLANTGVCLENVEEQLSI-ADGCVTATTF 241 L+++V+ P ++ G+ E+V ++ ADG + Sbjct: 155 --ELIQQVRRHHPLLPIVGIGGIRPEHVPAIMADGADGVAVISAI 197 >UniRef50_C9KJ79 Tryptophan synthase alpha chain n=2 Tax=Veillonellaceae RepID=C9KJ79_9FIRM Length = 267 Score = 41.4 bits (96), Expect = 0.029, Method: Composition-based stats. Identities = 28/225 (12%), Positives = 59/225 (26%), Gaps = 44/225 (19%) Query: 41 KAWDDLMALQNGGVDAVMFSNEFSLPYLTKVRPETTAAMARII---------------GQ 85 KA + + G D + E LP+ + A + + Sbjct: 36 KAVREA---EKAGADVI----ELGLPFSDPMADGPVIQSASVCALKNGMTLKKELEIVRE 88 Query: 86 LMSDIRIP-FGVNVLWDPVASFDLAMATGAKFIREIFTGAYASDFGVWDTNVG--ETIRH 142 + IP G+ + + F + + A G+ +V E+ Sbjct: 89 IRKFSDIPLIGMGYINNM---------YHYGFEKFVTDFKAAGMDGIIVPDVPHEESGEM 139 Query: 143 QHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHP---DALCVSGLTAGTRTDS 199 + A + I P + K + + V+G+ + Sbjct: 140 RKICAAHDFHLAEFITP----GTTEARMTETCKDATGFIYCVSNNG--VTGVKKIDYSTI 193 Query: 200 ALLKRVKETVPDTVVLANTGV-CLENVEEQLSIADGCVTATTFKK 243 + DT + G+ E + +D + + K Sbjct: 194 GKVCEKARKFTDTPLAVGFGIGSPEAAVAAAAKSDAVIVGSAVVK 238 >UniRef50_A9VWE9 Tryptophan synthase alpha chain n=29 Tax=Proteobacteria RepID=TRPA_METEP Length = 280 Score = 41.4 bits (96), Expect = 0.029, Method: Composition-based stats. Identities = 12/71 (16%), Positives = 20/71 (28%), Gaps = 1/71 (1%) Query: 174 AKSTVFNNHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGV-CLENVEEQLSIA 232 A + F + V+G + + V+ GV + E A Sbjct: 169 ANTAGFVYYVSITGVTGTATPDFGRVSQAVSRITAHTNLPVVVGFGVKTGAHAAEIARGA 228 Query: 233 DGCVTATTFKK 243 DG V + Sbjct: 229 DGVVVGSALVD 239 >UniRef50_B2V7Q4 N-(5'-phosphoribosyl)anthranilate isomerase n=4 Tax=Hydrogenothermaceae RepID=TRPF_SULSY Length = 203 Score = 41.4 bits (96), Expect = 0.029, Method: Composition-based stats. Identities = 37/222 (16%), Positives = 67/222 (30%), Gaps = 31/222 (13%) Query: 44 DDLMALQNGGVDAVMFSNEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDPV 103 + G D + P V I ++ ++ V V+ +P Sbjct: 12 SQAREISEYGADYIGVITYPKSPRYVDVER---------IKEIKEKLKNSKLVAVVVNPS 62 Query: 104 ASFDLAMATGAKFIREIFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAV 163 L + A FI + D G+ R I L I Sbjct: 63 LEQVLELLNIADFI------QFHGDEGLDFVKNFPKDRVIKAIRVKNESDLEKIKT---- 112 Query: 164 YLGNRDICSIAKSTVFNNHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLE 223 N DI + + + G D LLK++ + V+ + G+ Sbjct: 113 -FKNEDITVLVDAFKEGVYG--------GTGEMIDLNLLKKITDMYDK--VIISGGLSES 161 Query: 224 NVEEQLSIADGCVTATTFKKDGVFANFVDQARVSQFMEKVHH 265 N++E L+ + K + V D +V +F++ V + Sbjct: 162 NIKEILNHVKPYGVDASSKLE-VSPGVKDLDKVKKFIDIVKN 202 >UniRef50_B6K1N0 Lactate 2-monooxygenase n=1 Tax=Schizosaccharomyces japonicus yFS275 RepID=B6K1N0_SCHJY Length = 405 Score = 41.4 bits (96), Expect = 0.029, Method: Composition-based stats. Identities = 16/83 (19%), Positives = 31/83 (37%), Gaps = 9/83 (10%) Query: 43 WDDLMALQNGGVDAVMFSNEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPF------GV 96 DD G+D ++ SN + + A+ I+ + + + F GV Sbjct: 282 VDDAKKAVEYGLDGIIVSNHGGRQF--DGGIGSIEALEPIVDAVGDKLTVLFDSGVRSGV 339 Query: 97 NVLWDPVASFDLAMATGAKFIRE 119 +V+ +A A+ G F+ Sbjct: 340 DVMR-ALALGAKAVLIGRPFLWG 361 >UniRef50_B7JUK2 Tryptophan synthase alpha chain n=17 Tax=cellular organisms RepID=TRPA_CYAP8 Length = 267 Score = 41.4 bits (96), Expect = 0.030, Method: Composition-based stats. Identities = 32/227 (14%), Positives = 70/227 (30%), Gaps = 40/227 (17%) Query: 39 IDKAWDDLMALQNGGVDAVMFSNEFSLPYLTKV--RPETTAAMAR-------------II 83 ++ L L G D + E +PY + P AA R ++ Sbjct: 30 LETTAKALRLLDASGADLI----ELGVPYSDPLADGPVIQAAATRALGRGVKLEDVLGVV 85 Query: 84 GQLMSDIRIPFGVNVLWDPVASFDLAMATGAKFIREIFTGA----YASDFGVWDTNVGET 139 ++ +I+ P + ++P+ F R + A G+ ++ Sbjct: 86 KEVSPEIKAPIILFTYYNPI------------FYRGVEAFLQQVKAAGVQGLVVPDLP-L 132 Query: 140 IRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTV-FNNHPDALCVSGLTAGTRTD 198 + + + + A I +IA+ + F V+G+ + + Sbjct: 133 EEAESLLKPAHEVGIA-VTLLVAPTSPIERIEAIARQSQGFIYLVSVTGVTGMRSQVTSR 191 Query: 199 SALLKRVKETVPDTVVLANTGVC-LENVEEQLSI-ADGCVTATTFKK 243 L + D + G+ E+ + + AD + + K Sbjct: 192 VKELLTSLRSATDKPIGVGFGISKPEHALQVKNWGADAVIVGSAMVK 238 >UniRef50_A7RW57 Predicted protein n=9 Tax=Eukaryota RepID=A7RW57_NEMVE Length = 379 Score = 41.4 bits (96), Expect = 0.031, Method: Composition-based stats. Identities = 21/129 (16%), Positives = 41/129 (31%), Gaps = 8/129 (6%) Query: 44 DDLMALQNGGVDAVMFSNEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPF--GVNVLWD 101 +D G+D ++ SN T A+ I+ + + + GV + D Sbjct: 252 EDARLAVEHGIDGIIVSNHGGR--QLDGVQATIDALPDIVKAVQGKLEVYMDGGVRLGTD 309 Query: 102 PVASFDLAMATGAKFI-REIFTG-AYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVP 159 + L A F+ R + G AY + GV + + +L ++ Sbjct: 310 VFKALALG--ARAVFVGRPVIWGLAYKGEEGVRQVLELLREELRLAMILSGCGSLDDVTS 367 Query: 160 EAAVYLGNR 168 + Sbjct: 368 SYVIPANQS 376 >UniRef50_A9B6M7 Tryptophan synthase alpha chain n=8 Tax=Chloroflexi RepID=TRPA_HERA2 Length = 271 Score = 41.4 bits (96), Expect = 0.032, Method: Composition-based stats. Identities = 42/241 (17%), Positives = 70/241 (29%), Gaps = 24/241 (9%) Query: 40 DKAWDDLMALQNGGVDAVMFSNEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVL 99 D A AL GG D M +P+ + T I L + + I F + + Sbjct: 31 DSALSLAQALVAGGAD--MLE--LGMPFSDPLADGATIQRTTDI-ALTNGVDIGFCLETV 85 Query: 100 WDPVASFDLAMATGAKFIREIFTGAY---------ASDFGVWDTN--VGETIRHQHRIGA 148 A+ + +F A G + E A Sbjct: 86 RQLRATGMSIPLLLMGYFNPMFQYGVERFVAEAKAAGADGFIVPDLPPEEADEFHAAAKA 145 Query: 149 GEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPDAL-CVSGLTAGTRTDSALLKRVKE 207 E+ ++V A + I IA + + AL V+G A D Sbjct: 146 HEL----DLVFLLAPTSTDARIAKIASLSSGFIYCVALRGVTGARAALADDLGDFLARVR 201 Query: 208 TVPDTVVLANTGVC-LENVEEQLSIADGCVTATTFKKDGVFANFVDQARVSQFMEKVHHI 266 G+ E+V +A+G + A+ N + R + + V + Sbjct: 202 QYSQLPRAVGFGISKPEHVAAVAKMAEGAICASALLD--YIGNLPAEERATGAQQFVQSL 259 Query: 267 R 267 R Sbjct: 260 R 260 >UniRef50_C9SFI6 Copper homeostasis protein cutC n=1 Tax=Verticillium albo-atrum VaMs.102 RepID=C9SFI6_VERA1 Length = 309 Score = 41.4 bits (96), Expect = 0.033, Method: Composition-based stats. Identities = 13/64 (20%), Positives = 23/64 (35%), Gaps = 1/64 (1%) Query: 171 CSIAKSTVFNNHPDALCVSGLTAGTRTDSALLKRV-KETVPDTVVLANTGVCLENVEEQL 229 A +V DA+ SG + +L+ V ++ ++ GV N+ L Sbjct: 212 VEQAIRSVAACGFDAILTSGGPGRAPANIEILRVVTRKAQGRLTIIIGGGVRSGNIGSVL 271 Query: 230 SIAD 233 D Sbjct: 272 GHLD 275 >UniRef50_Q2SS12 Copper homeostasis protein CutC, putative n=3 Tax=Mycoplasma mycoides group RepID=Q2SS12_MYCCT Length = 227 Score = 41.4 bits (96), Expect = 0.035, Method: Composition-based stats. Identities = 14/86 (16%), Positives = 37/86 (43%), Gaps = 1/86 (1%) Query: 174 AKSTVFNNHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSIAD 233 A + + + DA+ SG T ++K++ + D +L GV N+++ L++ + Sbjct: 133 ALNVLAKHKIDAVLTSG-GTNINTGLEVIKQLVDLNLDIEILIGGGVDKNNIKQCLTVNN 191 Query: 234 GCVTATTFKKDGVFANFVDQARVSQF 259 + + + + + ++ F Sbjct: 192 HIHLGRAIRNNSSWNSDILVDEINIF 217 >UniRef50_C0Z6B9 Putative oxidoreductase n=1 Tax=Brevibacillus brevis NBRC 100599 RepID=C0Z6B9_BREBN Length = 381 Score = 41.4 bits (96), Expect = 0.035, Method: Composition-based stats. Identities = 20/116 (17%), Positives = 33/116 (28%), Gaps = 4/116 (3%) Query: 45 DLMALQNGGVDAVMFSNEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDPVA 104 D GVD ++ SN T A+ I + I + V Sbjct: 261 DARLALEHGVDGIIVSNHGGR--QMDGAISTLDALPAIAEVIAGKIPLLLDSGVRTGADV 318 Query: 105 SFDLAMATGAKFI-REIFTG-AYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIV 158 +A+ A I R G A A + GV + ++ ++ Sbjct: 319 VKAIALGANAILIGRPFLYGLAVAGEQGVTSVLDTLIHEFDVAMALSGSNSIADLN 374 >UniRef50_Q7VGK6 Triosephosphate isomerase n=3 Tax=Helicobacter RepID=TPIS_HELHP Length = 235 Score = 41.4 bits (96), Expect = 0.035, Method: Composition-based stats. Identities = 8/54 (14%), Positives = 14/54 (25%), Gaps = 5/54 (9%) Query: 192 TAGTRTDSALLKRVK---ETVPDTVVLANTGVCLENVEEQL--SIADGCVTATT 240 G ++ T +L V N E L +G + + Sbjct: 167 GTGESATLEQIESTHSMLATFTSAPLLYGGSVNPTNAREILCTPYVNGVLVGSA 220 >UniRef50_Q0TRL8 Dihydropteroate synthase n=16 Tax=Clostridiales RepID=Q0TRL8_CLOP1 Length = 269 Score = 41.0 bits (95), Expect = 0.040, Method: Composition-based stats. Identities = 24/99 (24%), Positives = 37/99 (37%), Gaps = 3/99 (3%) Query: 26 DPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEFSLPYLTKVR-PETTAAMARIIG 84 P + G I+ A + N GVD + E + P T V E + +I Sbjct: 23 TPDSFSDGGKYNDIELALKRAEKMINDGVDIIDIGGESTRPTHTPVGEEEELNRVVPVIK 82 Query: 85 QLMSDIRIPFGVNVLWDPVASFDLAMATGAKFIREIFTG 123 L IP V+ VA + A+ GA I +++ Sbjct: 83 ALREKFDIPISVDTYKGKVA--EEAIKAGADLINDVWGF 119 >UniRef50_B8MZR1 Trytophan synthase alpha subunit, putative n=2 Tax=Leotiomyceta RepID=B8MZR1_ASPFN Length = 641 Score = 41.0 bits (95), Expect = 0.040, Method: Composition-based stats. Identities = 16/54 (29%), Positives = 23/54 (42%), Gaps = 2/54 (3%) Query: 188 VSG-LTAGTRTDSALLKRVKETVPDTVVLANTGV-CLENVEEQLSIADGCVTAT 239 V+G AL++RVK + V G+ E+ SIADG V + Sbjct: 182 VTGATGTLNTELPALIQRVKSLSGNVPVAVGFGISTREHFLSVTSIADGAVIGS 235 >UniRef50_Q9YGA9 Tryptophan synthase alpha chain n=5 Tax=Euryarchaeota RepID=TRPA_PYRKO Length = 251 Score = 41.0 bits (95), Expect = 0.040, Method: Composition-based stats. Identities = 36/188 (19%), Positives = 65/188 (34%), Gaps = 28/188 (14%) Query: 81 RIIGQLMSDIRIPFGVNVLWDPVASFDL------AMATGAKFIREIFTGAYASDFGVWDT 134 RI+ + P + ++PV + A A+GA G D + + Sbjct: 70 RILREFRRHSSTPVILMTYYNPVFRTGVKKFLGEAKASGAD-------GILVVD--LPVS 120 Query: 135 NVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPDALC-VSGLTA 193 + GE + G V AA + + I K++ + +L +G Sbjct: 121 HAGEFLDAAKEEGLKTV-------FLAAPNTPDERLREIDKASTGFVYLISLYGTTGARD 173 Query: 194 GTRTDS-ALLKRVKETVPDTVVLANTGVCL-ENVEEQLSI-ADGCVTATT-FKKDGVFAN 249 + ++R ++ + + GV E VEE L ADG V + + N Sbjct: 174 RLPETAFEFVRRARKICNNK-LAVGFGVSRREQVEELLKAGADGVVVGSALIELISRSEN 232 Query: 250 FVDQARVS 257 V++ R Sbjct: 233 PVEELRRK 240 >UniRef50_C6A2A3 Geranylgeranylglyceryl phosphate synthase n=1 Tax=Thermococcus sibiricus MM 739 RepID=GGGPS_THESM Length = 255 Score = 41.0 bits (95), Expect = 0.040, Method: Composition-based stats. Identities = 37/238 (15%), Positives = 77/238 (32%), Gaps = 32/238 (13%) Query: 42 AWDDLMALQNGGVDAVMFSNEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWD 101 A + GVDA+M S +V + ++ + + +P + Sbjct: 35 ASKLAKISEEVGVDAIMVG--GSTGAEGEV-------LDGVVKAIKENSSLPVILF---- 81 Query: 102 PVASFDLAMATGAKFIREIFTGAYASDFGVWDTNVGETIRHQ-HRIGAGEVKTLFNIVP- 159 P + L+ A F F S + + G + + +V Sbjct: 82 PGSHGGLSKYADAVF----FMSLLNSRNPFFIAGAQALGAFRVKHYGIEPIPMAYLVVEP 137 Query: 160 -EAAVYLGNRDICSIAKSTVFNNHPDA-------LCVSGLTAGTRTD-SALLKRVKETVP 210 E A ++ + ++ K + + A L +G + +V ++ Sbjct: 138 GETAGWVSDANLIPRHKPKIAAAYALAGQYMGMRLVYLEAGSGAPEHIPNEMIKVVKSAI 197 Query: 211 DTVVLANTGV-CLENVEEQLSI-ADGCVTATTFKKDGVFANFVDQARVSQFMEKVHHI 266 D ++ G+ E+ +E + AD VT T +K G + R+ + V + Sbjct: 198 DVPLIVGGGIRTYEDAKEVVQSGADIIVTGTAIEKAGSLEE--SKKRLESIINGVKEV 253 >UniRef50_C2W9E4 Copper homeostasis protein CutC n=4 Tax=Bacillus RepID=C2W9E4_BACCE Length = 282 Score = 41.0 bits (95), Expect = 0.040, Method: Composition-based stats. Identities = 13/64 (20%), Positives = 25/64 (39%), Gaps = 4/64 (6%) Query: 206 KETVPDTVVLANTGVCLENVEEQLSIADGCV---TATTFKKDGVFANFVDQARVSQFMEK 262 KE+ + ++ +GV EN+ L G V T ++ +DQ V ++ Sbjct: 220 KESQGEIQLVVGSGVTKENITRLLHET-GIVEAHVGTAVREGKSCFAEIDQKAVQDLVKL 278 Query: 263 VHHI 266 + Sbjct: 279 TKAL 282 >UniRef50_A8ZVH6 N-(5'-phosphoribosyl)anthranilate isomerase n=1 Tax=Desulfococcus oleovorans Hxd3 RepID=A8ZVH6_DESOH Length = 260 Score = 41.0 bits (95), Expect = 0.041, Method: Composition-based stats. Identities = 17/110 (15%), Positives = 37/110 (33%), Gaps = 10/110 (9%) Query: 164 YLGNRDICSIAKSTVFNNHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLE 223 +L + + + + G +D A++ ++ + P V+ G+ E Sbjct: 142 HLCDWFLTDTSPTAREAAPVPGFV---GITGRPSDWAVVAKLVASTP-VPVILAGGMSPE 197 Query: 224 NVEEQLSIADGCVTATT--FKKDGVFANFV----DQARVSQFMEKVHHIR 267 NV + + + + N V D RV +F+E+ Sbjct: 198 NVYDGIVQTRPAGVDSCTQTNRRDPAGNPVRFSKDMDRVRRFVEETRRAE 247 >UniRef50_A6VPG9 Thiamine-phosphate pyrophosphorylase n=5 Tax=Pasteurellaceae RepID=THIE_ACTSZ Length = 221 Score = 41.0 bits (95), Expect = 0.044, Method: Composition-based stats. Identities = 37/259 (14%), Positives = 73/259 (28%), Gaps = 53/259 (20%) Query: 1 MSWLKEVIGTEKAVIAMCHLRALPGDPSFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFS 60 M+ +K ++ + R LPG P ++ + L G+ F Sbjct: 1 MNKIKSMLSVYF-IAGSQDCRHLPGSP-----------VENLLNILQQALEAGITCFQFR 48 Query: 61 NEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDPVASFDLAMATGAKFIREI 120 + P+ +A QL +PF +N LA+A A Sbjct: 49 EKGERSLAQN--PQLKHRLALQCQQLCRQFNVPFIIN------DDIGLALAIRAD----- 95 Query: 121 FTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFN 180 G + + ++ IG + + A++ Sbjct: 96 --GIHVGQKDTAVERILSRADYRPIIGL------------------SINTLEQAQANKER 135 Query: 181 NHPDA-----LCVSGLTAGTRTD--SALLKRVKETVPDTVVLANTGVCLENVEEQLSI-A 232 D + + A ++++ E D +A G+ +N E + A Sbjct: 136 LGIDYFGIGPIFATQSKADHAPAVGMEFIRQIHELGIDKPCVAIGGIHEDNTAEIRRLGA 195 Query: 233 DGCVTATTFKKDGVFANFV 251 +G + + A V Sbjct: 196 NGVAVISAITRSNDIARTV 214 >UniRef50_Q72EU7 Tryptophan synthase alpha chain n=12 Tax=Bacteria RepID=TRPA_DESVH Length = 257 Score = 41.0 bits (95), Expect = 0.046, Method: Composition-based stats. Identities = 33/234 (14%), Positives = 68/234 (29%), Gaps = 24/234 (10%) Query: 40 DKAWDDLMALQNGGVDAVMFSNEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVL 99 ++ WD+L AL G D + +P+ V A A L S + + + ++ L Sbjct: 33 ERFWDELEALDAAGADIIEV----GVPFSDPVADGPVVA-AASQRALESGVTLRWIMDGL 87 Query: 100 WDPVASFDLAMAT--------GAKFIREIFTGAYASDFGVWDTNVG--ETIRHQHRIGAG 149 + F R + A A G ++ E + + A Sbjct: 88 AARKGRLRAGLVLMGYLNPFMQYGFERFVSDAADAGVAGCIIPDLPLDEDADLRALLAAR 147 Query: 150 EVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPDALCVSGLTAGTRTDSALLKRVKETV 209 ++ + + P R A ++ + + +G+ G + A Sbjct: 148 DMDLIALVGPNTGE---GRMREYAAVASGYVYVVSVMGTTGVRDGLPVEVADTLARARQC 204 Query: 210 PDTVVLANTGVC-LENVEEQLSIADGCVTATTFKKDGVFANFVDQARVSQFMEK 262 V G+ +E D + + + + FM+ Sbjct: 205 FSIPVALGFGISRPAQLEGLSHPPDAVIFGSALLRHLDAGGD-----AASFMKA 253 >UniRef50_C9A792 Tryptophan synthase alpha chain n=3 Tax=Enterococcus casseliflavus RepID=C9A792_ENTCA Length = 258 Score = 40.6 bits (94), Expect = 0.049, Method: Composition-based stats. Identities = 25/227 (11%), Positives = 60/227 (26%), Gaps = 47/227 (20%) Query: 39 IDKAWDDLMALQNGGVDAVMFSNEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNV 98 +++ ++ L G A+ +P+ V A G+ Sbjct: 32 LERLPAEIELLTTHGAAAIEI----GVPFSDPVADGAVIQAA--------------GMQA 73 Query: 99 LWDPVASFDLAMATGAKFIREIFTGAYASDFGVWDTNVGE-TIRHQHRIGAGEVK--TLF 155 L + + A ++ I + G ++ + + + VK + Sbjct: 74 LANGTTLKKIIAA-----LQTIHSEVPLVLMGYANSFFHYGIEQLANELKTTNVKGLIIP 128 Query: 156 NIVPEAAVYL-------------------GNRDICSIAKSTVFNNHPDAL-CVSGLTAGT 195 ++ E + I ++ + + + V+G Sbjct: 129 DLPFEHRSLVTPTFDAADLALLTLVSLTSPPDRIQTLIEEAEGFVYAVTVNGVTGEDRQY 188 Query: 196 RTDSALLKRVKETVPDTVVLANTGV-CLENVEEQLSIADGCVTATTF 241 + VLA GV +V+ + +G V + Sbjct: 189 NEQLDQHLQAISEKSPIPVLAGFGVSTHADVQRFARVCEGVVIGSKI 235 >UniRef50_C9KJ76 N-(5'-phosphoribosyl)anthranilate isomerase n=3 Tax=Veillonellaceae RepID=C9KJ76_9FIRM Length = 233 Score = 40.6 bits (94), Expect = 0.050, Method: Composition-based stats. Identities = 34/191 (17%), Positives = 57/191 (29%), Gaps = 31/191 (16%) Query: 80 ARIIGQLMSDIRIPFGVNVLWDPVASFDLAMATGAKFIREIFTGAYASDFGVWDTNVGET 139 ARI+ L GV V DP +A F+ + E Sbjct: 70 ARIVEALQRV--KTVGVFVDEDPALVNAIARQCHLDFV---------------QLHGHED 112 Query: 140 IRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPDALCVSGLTAGTRTDS 199 + + +I +K +A I +G + Sbjct: 113 VAYAKQIEVPVIKAYRYGDGFSAEAANAFPAAMILVDAYQKGAAGG---TGTCFDWQQAK 169 Query: 200 ALLKRVKETVPDTVVLANTGVCLENVEEQLSIAD--GCVTATTFKKDGVFANFVDQARVS 257 + V++ VL G+ NV E +I + + + + AR++ Sbjct: 170 REVAAVRK-----PVLIAGGISEANVAEVNTIFHPFAVDVSGSLEVNREK----SAARIA 220 Query: 258 QFMEKVHHIRR 268 FME+VH I R Sbjct: 221 AFMEQVHEINR 231 >UniRef50_D1JG54 N-(5'-phosphoribosyl)anthranilate isomerase n=1 Tax=uncultured archaeon RepID=D1JG54_9ARCH Length = 230 Score = 40.6 bits (94), Expect = 0.053, Method: Composition-based stats. Identities = 35/228 (15%), Positives = 70/228 (30%), Gaps = 25/228 (10%) Query: 43 WDDLMALQNGGVDAVMFSNEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDP 102 +D + G DA+ + R A RI + + + + DP Sbjct: 19 IEDAILAAEAGADAI-----GVIYVANTKRYLDLGAATRIFDAVPIFVSKVVVLTLDNDP 73 Query: 103 VASFDLAMATGAKFIREIFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVP--- 159 + S K R + TGA + E I +VK + + Sbjct: 74 IES------VQDKISRIVDTGADCIQLHGDEPV--ELIADLREFLNAQVKLIKKVGVGGT 125 Query: 160 EAAVYLGNRDICSIAKSTVFNNHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTG 219 + S+ + + + D G D + K + E V V+ G Sbjct: 126 KKKCLENALAYESVVDALLLDTVTDGAI---GGTGKEHDWNISKEIVERVKK-PVILAGG 181 Query: 220 VCLENVEEQLSIAD--GCVTATTFKKDGVFANFVDQARVSQFMEKVHH 265 + +NV ++ ++ +++ D +V++F+E Sbjct: 182 LNPDNVANAIAFVKPYAVDVSSGVEREVRIK---DAVKVNRFIEAAKS 226 >UniRef50_Q6BVL8 DEHA2C01584p n=4 Tax=Dikarya RepID=Q6BVL8_DEBHA Length = 378 Score = 40.6 bits (94), Expect = 0.055, Method: Composition-based stats. Identities = 22/128 (17%), Positives = 44/128 (34%), Gaps = 8/128 (6%) Query: 44 DDLMALQNGGVDAVMFSNEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDPV 103 +D N G D ++ SN T A+ ++ + + RIP ++ Sbjct: 240 EDAEMAVNAGADGIIVSNHGGR--QLDGALSTLDALPDVVAAV--NGRIPVHIDGGIRRG 295 Query: 104 ASFDLAMATGAKFIR----EIFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVP 159 + A+A GA ++ AY + GV + + ++ +I P Sbjct: 296 SDIFKALALGADHCWVGRVAVWGLAYKGEEGVSIALNILHDEFRLVMALMGCTSVKDIKP 355 Query: 160 EAAVYLGN 167 E + + Sbjct: 356 EHLARMSS 363 >UniRef50_A5FY58 Tryptophan synthase alpha chain n=1 Tax=Acidiphilium cryptum JF-5 RepID=TRPA_ACICJ Length = 276 Score = 40.6 bits (94), Expect = 0.055, Method: Composition-based stats. Identities = 12/56 (21%), Positives = 20/56 (35%), Gaps = 3/56 (5%) Query: 188 VSGLTAGTRTDSA-LLKRVKETVPDTVVLANTGV-CLENVEEQLSIADGCVTATTF 241 ++G + D A + RV++ D + GV AD V A+ Sbjct: 182 ITGTRTASAEDLARDIPRVRKA-TDMPIAVGFGVRTPAQAATVARFADAAVVASAL 236 >UniRef50_Q2RIT8 N-(5'-phosphoribosyl)anthranilate isomerase n=7 Tax=Bacteria RepID=TRPF_MOOTA Length = 223 Score = 40.6 bits (94), Expect = 0.055, Method: Composition-based stats. Identities = 41/228 (17%), Positives = 67/228 (29%), Gaps = 36/228 (15%) Query: 43 WDDLMALQNGGVDAVMFSNEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDP 102 W++ + + GVD + + R A II +L GV V Sbjct: 12 WEEARMVLDAGVDTL------GFVFARSPRAIKPEAAREIITKL-PPFTTTVGVFVNEPR 64 Query: 103 VASFDLAMATGAKFIREIFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAA 162 + ++A F R + D + G + R I + +L Sbjct: 65 YSLMEIA-----SFCRLDVLQLH-GDE-PPEYCHGLSQRLIKAIRVRDAASLA------- 110 Query: 163 VYLGNRDICSIAKSTVFNNHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCL 222 A V DA V G GT T V+ G+ Sbjct: 111 --------SLEAYREVQGFLLDA-WVPGKAGGTGTTFNWELVRGAATGGKPVILAGGLTP 161 Query: 223 ENVEEQLSIAD--GCVTATTFKKDGVFANFVDQARVSQFMEKVHHIRR 268 ENV + + ++ + DG + AR++ F+E V Sbjct: 162 ENVGAAIQLVHPYAVDVSSGVEVDG----RKNPARIAAFLEAVRKAEE 205 >UniRef50_D0MQ90 Tryptophan synthase, putative n=2 Tax=Phytophthora infestans RepID=D0MQ90_PHYIN Length = 270 Score = 40.6 bits (94), Expect = 0.056, Method: Composition-based stats. Identities = 15/110 (13%), Positives = 32/110 (29%), Gaps = 14/110 (12%) Query: 146 IGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTVFNNHPDALC-----------VSGLTAG 194 + E KTL + + L + S + D++ V+G Sbjct: 138 LPPEEAKTLSDDAAKHG--LAYIPLVSPTTTEERMKLIDSVAHGFVYCVSLTGVTGARNE 195 Query: 195 TRTDSALLKRVKETVPDTVVLANTGV-CLENVEEQLSIADGCVTATTFKK 243 + + G+ ++ + ++ADG V + K Sbjct: 196 LPPNLDAFMAKIRANVKHPLALGFGLSTRQHFVQASALADGVVIGSKIVK 245 >UniRef50_D2S4B7 (S)-2-hydroxy-acid oxidase n=1 Tax=Geodermatophilus obscurus DSM 43160 RepID=D2S4B7_9ACTO Length = 427 Score = 40.6 bits (94), Expect = 0.056, Method: Composition-based stats. Identities = 21/137 (15%), Positives = 35/137 (25%), Gaps = 8/137 (5%) Query: 28 SFDAQLGMNWVIDKAWDDLMALQNGGVDAVMFSNEFSLPYLTKVRPETTAAMARIIGQLM 87 + L + + A + G D V+ SN T + I + Sbjct: 280 RWRGDLLVKGITTPA--SAREVMEHGADGVVVSNHGGR--QLDRSAATLDVLPAIRSAVG 335 Query: 88 SDIRIPFGVNVLWDPVASFDLAMATGAKFI-REIFTGAYASDFGVWDTNVGETI--RHQH 144 + VL A+ A I R G A E + +Q Sbjct: 336 QQAPVLIDGGVLHGQDVVAARALGADAVMIGRAYLYGLMAGGQDGV-LRAYEILAEEYQR 394 Query: 145 RIGAGEVKTLFNIVPEA 161 I V+ ++ Sbjct: 395 SIQLLGVRRSEDLSDRH 411 >UniRef50_A0B8J3 Geranylgeranylglyceryl phosphate synthase n=13 Tax=Euryarchaeota RepID=GGGPS_METTP Length = 255 Score = 40.6 bits (94), Expect = 0.061, Method: Composition-based stats. Identities = 35/223 (15%), Positives = 82/223 (36%), Gaps = 25/223 (11%) Query: 40 DKAWDDLMALQNGGVDAVMFSNEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVL 99 ++A + + G A+M T +A+ R + + + +P + Sbjct: 38 ERAVQMARSAADAGTTALMV---------GGSVGATGSALDRTVRAIKDSVDLPVILF-- 86 Query: 100 WDPVASFDLAMATGAKFIREIFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTL----- 154 P ++ L A F + + ++ + + + +G I ++ I A + + Sbjct: 87 --PSSAAGLCDNADAVFFMSLL-NSRSTSYLIENQALGAPIVSRYGIEAIPMGYIVVEPG 143 Query: 155 FNIVPEAAVYLGNRDICSIAKSTVFNNHPDALCV----SGLTAGTRTDSALLKRVKETVP 210 + L R IA + + + +G A + ++++ V++ + Sbjct: 144 GTVGWVGDAKLVPRRKPDIAAAYALAGRYLGMRLIYLEAGSGAESPVPTSMVSAVRDAIG 203 Query: 211 DTVVLANTGVCLENVEEQL--SIADGCVTATTFKKDGVFANFV 251 DT+++ G+ +L + AD VT T ++ G FV Sbjct: 204 DTLLVVGGGIRDAEAARKLVSAGADLIVTGTGVEESGDVFRFV 246 >UniRef50_A4FLZ5 L-lactate dehydrogenase n=2 Tax=Actinomycetales RepID=A4FLZ5_SACEN Length = 404 Score = 40.2 bits (93), Expect = 0.064, Method: Composition-based stats. Identities = 16/100 (16%), Positives = 28/100 (28%), Gaps = 4/100 (4%) Query: 43 WDDLMALQNGGVDAVMFSNEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDP 102 D + G DAV+ SN P + ++ + I +L Sbjct: 281 VPDARRVVELGADAVILSNHGGR--QLDRAPTMLELLPQVREAIGDRAEIMLDTGILSGA 338 Query: 103 VASFDLAMATGAKFI-REIFTGAYASDFGVWDTNVGETIR 141 LA+ + + R G A + +R Sbjct: 339 DIVAALALGADSCLVGRAYLYGLMAGGEQGVQ-RAVDILR 377 >UniRef50_Q3ABS2 N-(5'-phosphoribosyl)anthranilate isomerase n=1 Tax=Carboxydothermus hydrogenoformans Z-2901 RepID=Q3ABS2_CARHZ Length = 220 Score = 40.2 bits (93), Expect = 0.069, Method: Composition-based stats. Identities = 37/221 (16%), Positives = 71/221 (32%), Gaps = 37/221 (16%) Query: 52 GGVDAV--MFSNEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDPVASFDLA 109 G DA+ +F+N +V+PE R I +++ GV D +A Sbjct: 22 AGADAIGFVFANS-----PRQVKPEVV----REITEILPPFVATVGVVANMDVEDVAQIA 72 Query: 110 MATGAKFIREIFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRD 169 ++ V + GE+ + G + K I+ V + Sbjct: 73 VSCNLD---------------VVQLHGGESPEY---CGKLKEKIRAKIIKSIPVPIETDT 114 Query: 170 ICSIAKSTVFNNHPDALCV---SGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVE 226 + ++ + A SG T G + + ++ + G+ ENV Sbjct: 115 EELKRQIAIYEKYVHAFLFDTSSGNTFGGSGKTFNWQILQGLKIEKPWFLAGGLNPENVG 174 Query: 227 EQLSIAD--GCVTATTFKKDGVFANFVDQARVSQFMEKVHH 265 + L G ++ +K D R+ F++ V Sbjct: 175 KALQQVKPYGVDVSSGVEK---APGIKDYIRIEAFIQAVRR 212 >UniRef50_B1I3Z6 N-(5'-phosphoribosyl)anthranilate isomerase n=2 Tax=Clostridiales RepID=TRPF_DESAP Length = 204 Score = 40.2 bits (93), Expect = 0.071, Method: Composition-based stats. Identities = 37/224 (16%), Positives = 68/224 (30%), Gaps = 40/224 (17%) Query: 46 LMALQNGGVDAVMFSNEFSLPYLTKVRPETTAAMARIIGQLMSDIRIPFGVNVLWDPVAS 105 + G DA+ P ++ P+T A II +L ++ + GV V DP Sbjct: 15 ARTAVDAGADAL---GFVFAPGRRRIAPDTARA---IIRRLPPEV-LTVGVFVDEDPETV 67 Query: 106 FDLAMATGAKFIREIFTGAYASDFGVWDTNVGETIRHQHRIGAGEVKTLF--NIVPEAAV 163 +A G G + E+ + +K N Sbjct: 68 QGIAAHCGL---------------GALQFHGRESPEYCRGFREKVIKAFGVGNASVRELE 112 Query: 164 YLGNRDICSIAKSTVFNNHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLE 223 + + T P G D L++ +K V+ G+ Sbjct: 113 RAEEYPVWCLLLDTFSPGRPGG-------TGRVFDWRLIESLK---FSRPVILAGGLNPG 162 Query: 224 NVEEQLSIAD--GCVTATTFKKDGVFANFVDQARVSQFMEKVHH 265 NV+ ++ G +T + G+ D A++ F++ Sbjct: 163 NVQAAIAAVRPYGVDVSTGVETGGLK----DPAKIRSFIKLARE 202 >UniRef50_Q93Q21 Tryptophan synthase alpha chain n=1 Tax=Nostoc punctiforme PCC 73102 RepID=Q93Q21_NOSP7 Length = 189 Score = 40.2 bits (93), Expect = 0.075, Method: Composition-based stats. Identities = 18/128 (14%), Positives = 39/128 (30%), Gaps = 15/128 (11%) Query: 124 AYASDFGVWDTN-----VGETIRHQHRIGAGEVKTLFNIVPEAAVYLGNRDICSIAKSTV 178 A A G+ + + +G + + + I +IA S+ Sbjct: 35 AAAGVAGLVVPDLPLEEAAGLLEPAKEMGIDVILLVA-------PTSDAKRIEAIAHSSQ 87 Query: 179 -FNNHPDALCVSGLTAGTRTDSALLKRVKETVPDTVVLANTGVCLENVEEQLSI--ADGC 235 F V+G+ + + + L + V + + G+ Q+ AD Sbjct: 88 GFIYLVSVTGVTGVRSQLESRVSDLLKQIRGVTEKPIGVGFGISDAAQARQVKEWGADAA 147 Query: 236 VTATTFKK 243 + + K Sbjct: 148 IVGSAVVK 155 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.315 0.152 0.417 Lambda K H 0.267 0.0464 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,745,778,771 Number of Sequences: 3077464 Number of extensions: 76226381 Number of successful extensions: 243952 Number of sequences better than 1.0e-01: 250 Number of HSP's better than 0.1 without gapping: 157 Number of HSP's successfully gapped in prelim test: 516 Number of HSP's that attempted gapping in prelim test: 242719 Number of HSP's gapped (non-prelim): 766 length of query: 268 length of database: 1,040,396,356 effective HSP length: 126 effective length of query: 142 effective length of database: 652,635,892 effective search space: 92674296664 effective search space used: 92674296664 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.3 bits) S2: 92 (39.9 bits)