BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (280 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_C9Y2J0 Uncharacterized protein yhiR n=11 Tax=Enterobact... 509 e-143 UniRef50_C6C5L8 Putative uncharacterized protein n=3 Tax=Gammapr... 467 e-130 UniRef50_P31777 Uncharacterized protein HI0441 n=170 Tax=Gammapr... 388 e-107 UniRef50_B8F539 Protein involved in catabolism of external DNA n... 378 e-103 UniRef50_A3UP41 Protein involved in catabolism of external DNA n... 365 1e-99 UniRef50_Q0VM19 Putative uncharacterized protein n=2 Tax=Alcaniv... 286 4e-76 UniRef50_Q12I42 Putative uncharacterized protein n=20 Tax=Shewan... 281 2e-74 UniRef50_A1T010 DNA (Exogenous) processing protein n=8 Tax=Gamma... 276 4e-73 UniRef50_A3YH43 Protein involved in external DNA uptake n=2 Tax=... 274 2e-72 UniRef50_Q47Z69 Putative uncharacterized protein n=1 Tax=Colwell... 274 2e-72 UniRef50_A6SXL6 Uncharacterized conserved protein n=70 Tax=cellu... 272 1e-71 UniRef50_Q5NZ63 Predicted protein involved in catabolism of exte... 270 4e-71 UniRef50_A0KP43 Protein involved in external DNA uptake n=3 Tax=... 269 8e-71 UniRef50_A4VR91 Protein involved in catabolism of external DNA n... 254 3e-66 UniRef50_B1XZU6 Putative uncharacterized protein n=1 Tax=Leptoth... 252 1e-65 UniRef50_Q2W9T7 Protein involved in catabolism of external DNA n... 252 1e-65 UniRef50_B6QZ93 Florfenicol resistance protein n=3 Tax=Rhodobact... 241 2e-62 UniRef50_B8GN89 Putative uncharacterized protein n=1 Tax=Thioalk... 240 4e-62 UniRef50_Q984Q7 Mlr7888 protein n=7 Tax=Rhizobiales RepID=Q984Q7... 239 8e-62 UniRef50_B9JCU5 DNA methylase protein n=4 Tax=Rhizobiales RepID=... 236 8e-61 UniRef50_C3X722 External-DNA catabolic protein n=2 Tax=Oxalobact... 233 7e-60 UniRef50_B2S4W3 N-6 Adenine-specific DNA methylase n=51 Tax=Rhiz... 231 2e-59 UniRef50_C6XPG2 Putative uncharacterized protein n=5 Tax=Proteob... 227 4e-58 UniRef50_Q15U81 Putative uncharacterized protein n=1 Tax=Pseudoa... 222 1e-56 UniRef50_Q1N5H6 Putative uncharacterized protein n=1 Tax=Bermane... 221 2e-56 UniRef50_Q89DH2 Blr7467 protein n=16 Tax=Rhizobiales RepID=Q89DH... 219 8e-56 UniRef50_C6M2C4 YhiR family protein n=2 Tax=Neisseriaceae RepID=... 215 1e-54 UniRef50_Q0F148 Putative uncharacterized protein n=1 Tax=Maripro... 214 2e-54 UniRef50_Q2SPJ4 Protein involved in catabolism of external DNA n... 213 9e-54 UniRef50_Q5ZVZ2 Protein involved in catabolism of external DNA n... 212 1e-53 UniRef50_A0YHR6 Putative uncharacterized protein n=1 Tax=marine ... 211 2e-53 UniRef50_B2HZ48 Protein involved in catabolism of external DNA n... 211 2e-53 UniRef50_A4SYR3 Putative uncharacterized protein n=1 Tax=Polynuc... 208 1e-52 UniRef50_D0IYP9 ComJ n=10 Tax=Bacteria RepID=D0IYP9_COMTE 208 2e-52 UniRef50_D0KVW6 Putative uncharacterized protein n=1 Tax=Halothi... 208 2e-52 UniRef50_A1WIT5 Putative uncharacterized protein n=4 Tax=Burkhol... 208 2e-52 UniRef50_A5EWC5 Putative uncharacterized protein n=1 Tax=Dichelo... 206 6e-52 UniRef50_C6QCA2 Putative uncharacterized protein n=1 Tax=Hyphomi... 206 9e-52 UniRef50_B1ZS65 Putative uncharacterized protein n=2 Tax=Opituta... 204 3e-51 UniRef50_C5BQN4 Putative uncharacterized protein n=1 Tax=Teredin... 202 2e-50 UniRef50_Q0G6E9 Putative uncharacterized protein n=1 Tax=Fulvima... 199 9e-50 UniRef50_B4RYZ5 Putative uncharacterized protein n=2 Tax=Alterom... 196 6e-49 UniRef50_B1LXQ5 Putative uncharacterized protein n=9 Tax=Alphapr... 195 1e-48 UniRef50_Q5QVX6 Transformation competence-related protein ComJ n... 195 1e-48 UniRef50_Q0ARP7 Putative uncharacterized protein n=2 Tax=Hyphomo... 195 1e-48 UniRef50_B5EL93 Putative uncharacterized protein n=2 Tax=Acidith... 194 2e-48 UniRef50_Q0BPC8 Putative uncharacterized protein n=3 Tax=Acetoba... 192 1e-47 UniRef50_A1VJI9 Putative uncharacterized protein n=4 Tax=Comamon... 189 7e-47 UniRef50_C8NAD4 Cytoplasmic protein n=34 Tax=Proteobacteria RepI... 189 7e-47 UniRef50_Q1YIC4 Putative uncharacterized protein n=1 Tax=Auranti... 189 1e-46 UniRef50_Q21LZ8 Putative uncharacterized protein n=1 Tax=Sacchar... 187 3e-46 UniRef50_C7JEW3 Putative uncharacterized protein n=8 Tax=Acetoba... 187 4e-46 UniRef50_B7QYF1 Protein involved in catabolism of external DNA n... 186 6e-46 UniRef50_D2LG58 Putative uncharacterized protein n=1 Tax=Rhodomi... 186 7e-46 UniRef50_A3JEY3 Protein involved in catabolism of external DNA n... 183 5e-45 UniRef50_Q87F97 Transformation competence-related protein n=20 T... 176 6e-43 UniRef50_C6NTA4 Putative uncharacterized protein n=1 Tax=Acidith... 175 1e-42 UniRef50_Q1RK44 ComJ n=12 Tax=Rickettsia RepID=Q1RK44_RICBR 173 5e-42 UniRef50_Q2G473 Putative uncharacterized protein n=1 Tax=Novosph... 170 5e-41 UniRef50_UPI0000E1171F protein involved in external DNA uptake n... 166 6e-40 UniRef50_C8PZ79 Protein involved in catabolism of external DNA n... 165 1e-39 UniRef50_Q73R01 Putative uncharacterized protein n=1 Tax=Trepone... 164 4e-39 UniRef50_A5WD58 Putative uncharacterized protein n=3 Tax=Psychro... 163 6e-39 UniRef50_Q1QSA4 Putative uncharacterized protein n=1 Tax=Chromoh... 161 3e-38 UniRef50_B8H3J8 External DNA uptake/catabolism protein n=6 Tax=C... 145 2e-33 UniRef50_A3VP01 Putative uncharacterized protein n=1 Tax=Parvula... 136 7e-31 UniRef50_B7VU08 Putative uncharacterized protein n=4 Tax=Vibrion... 136 1e-30 UniRef50_UPI0001909543 putative DNA methylase protein n=1 Tax=Rh... 128 2e-28 UniRef50_Q0C0F5 Putative uncharacterized protein n=1 Tax=Hyphomo... 126 7e-28 UniRef50_B7G053 Predicted protein (Fragment) n=1 Tax=Phaeodactyl... 112 1e-23 UniRef50_C5SM30 Putative uncharacterized protein n=2 Tax=Cauloba... 110 5e-23 UniRef50_Q2BH49 Putative uncharacterized protein n=1 Tax=Neptuni... 77 5e-13 UniRef50_A0B718 Protein involved in catabolism of external DNA-l... 51 4e-05 >UniRef50_C9Y2J0 Uncharacterized protein yhiR n=11 Tax=Enterobacteriaceae RepID=C9Y2J0_CROTZ Length = 280 Score = 509 bits (1311), Expect = e-143, Method: Compositional matrix adjust. Identities = 242/280 (86%), Positives = 259/280 (92%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRY L EHAERTGE Sbjct: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYLLSGEHAERTGE 60 Query: 61 YLEGIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELH 120 YLEGIARIWQ+DDLPAELE YI+ V HFNRSGQLRYYPGSPLIAR LLR QDSLQLTELH Sbjct: 61 YLEGIARIWQRDDLPAELEPYISAVSHFNRSGQLRYYPGSPLIARQLLRPQDSLQLTELH 120 Query: 121 PSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSGI 180 PSD+PLLR EFQKD RARVE+ADG+QQLK+KLPP SRRGLILIDPPYE+KTDYQAVV GI Sbjct: 121 PSDFPLLRGEFQKDERARVERADGYQQLKSKLPPASRRGLILIDPPYEIKTDYQAVVQGI 180 Query: 181 AEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTASGM 240 EGYKRFATG+YALWYPVVLR QIKRM++DLE+TGIR+ILQIELAV PDSD+RGMTASGM Sbjct: 181 NEGYKRFATGVYALWYPVVLRNQIKRMMNDLESTGIRRILQIELAVRPDSDQRGMTASGM 240 Query: 241 IVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIVPE 280 +VINPPWKLEQQM +LPWLH LVPAGTGH T+ W+VPE Sbjct: 241 VVINPPWKLEQQMGTLLPWLHKALVPAGTGHTTLKWVVPE 280 >UniRef50_C6C5L8 Putative uncharacterized protein n=3 Tax=Gammaproteobacteria RepID=C6C5L8_DICDC Length = 280 Score = 467 bits (1202), Expect = e-130, Method: Compositional matrix adjust. Identities = 219/280 (78%), Positives = 247/280 (88%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 MLSYRHSFHAGNHADVLKHTVQSLII +LKEK+KPFLYLDTH+GAGRYQL EHAERTGE Sbjct: 1 MLSYRHSFHAGNHADVLKHTVQSLIITALKEKEKPFLYLDTHSGAGRYQLHGEHAERTGE 60 Query: 61 YLEGIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELH 120 Y EGI RIWQ+DD+PAE+EAY+ VV+ +N GQLRYYPGSPLIAR LLREQD+L LTELH Sbjct: 61 YREGIGRIWQRDDIPAEMEAYLQVVRSYNSGGQLRYYPGSPLIARQLLREQDTLNLTELH 120 Query: 121 PSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSGI 180 P+D+ LLR EF +D RARV + DG+ QLK++LPP +RRG+ILIDPPYE+KTDYQAVV GI Sbjct: 121 PTDFSLLRQEFARDDRARVVREDGYLQLKSRLPPAARRGVILIDPPYELKTDYQAVVDGI 180 Query: 181 AEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTASGM 240 EGY+RFATG+YALWYPVVLRQQIKR++ LE TGIR+ILQIELAVLPDSDR GMTASGM Sbjct: 181 QEGYRRFATGVYALWYPVVLRQQIKRLLKALEETGIRRILQIELAVLPDSDRHGMTASGM 240 Query: 241 IVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIVPE 280 IVINPPWKLE QM ++LPWLH LVP GTGH V W+VPE Sbjct: 241 IVINPPWKLEAQMKSLLPWLHQVLVPEGTGHTRVEWVVPE 280 >UniRef50_P31777 Uncharacterized protein HI0441 n=170 Tax=Gammaproteobacteria RepID=Y441_HAEIN Length = 281 Score = 388 bits (997), Expect = e-107, Method: Compositional matrix adjust. Identities = 186/281 (66%), Positives = 214/281 (76%), Gaps = 1/281 (0%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 MLSY HSFHAGNHADVLKH V LI+E+LK K+K F YLDTH+G GRY+L S +E+TGE Sbjct: 1 MLSYHHSFHAGNHADVLKHIVLMLILENLKLKEKGFFYLDTHSGVGRYRLSSNESEKTGE 60 Query: 61 YLEGIARIWQQDDLPAELEAYINVVKHFNRSG-QLRYYPGSPLIARLLLREQDSLQLTEL 119 Y EGI R+W Q DLP ++ Y+ ++K N G +LRYY GSPLIA LLR QD LTEL Sbjct: 61 YKEGIGRLWDQTDLPEDIARYVKMIKKLNYGGKELRYYAGSPLIAAELLRSQDRALLTEL 120 Query: 120 HPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSG 179 HPSDYP+LR+ F D V+ +GFQQ+KA LPP RRGL+LIDPPYE+K DY VV Sbjct: 121 HPSDYPILRNNFSDDKNVTVKCDNGFQQVKATLPPKERRGLVLIDPPYELKDDYDLVVKA 180 Query: 180 IAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTASG 239 I EGYKRFATG YA+WYPVVLRQQ KR+ LEATGIRKIL+IELAV PDSD+RGMTASG Sbjct: 181 IEEGYKRFATGTYAIWYPVVLRQQTKRIFKGLEATGIRKILKIELAVRPDSDQRGMTASG 240 Query: 240 MIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIVPE 280 M+VINPPW LE QM +LP+L LVP GTG TV WI PE Sbjct: 241 MVVINPPWTLETQMKEILPYLTKTLVPEGTGSWTVEWITPE 281 >UniRef50_B8F539 Protein involved in catabolism of external DNA n=67 Tax=Gammaproteobacteria RepID=B8F539_HAEPS Length = 279 Score = 378 bits (971), Expect = e-103, Method: Compositional matrix adjust. Identities = 180/280 (64%), Positives = 218/280 (77%), Gaps = 1/280 (0%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 MLSY HSFHAGNHADVLKH V +LI+ +LK+K+K F YLDTH+G GRY L S AE+TGE Sbjct: 1 MLSYHHSFHAGNHADVLKHIVLTLILHALKQKEKGFFYLDTHSGVGRYSLQSSEAEKTGE 60 Query: 61 YLEGIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELH 120 Y+EGIAR+W++ DLP ++ Y+N +K N+ +LR+Y GSPL+A LR QD LTELH Sbjct: 61 YIEGIARLWERTDLPEKVVLYLNEIKKINKD-KLRFYAGSPLLAVQQLRPQDRALLTELH 119 Query: 121 PSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSGI 180 P+D+PLLR+EF K ++ +GFQQLK+ LPP +RGL+LIDPPYE+K DY+ VV I Sbjct: 120 PNDFPLLRNEFAKTPNVVTKRENGFQQLKSALPPKEKRGLVLIDPPYELKEDYELVVKAI 179 Query: 181 AEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTASGM 240 EGYKRFATG+YA+WYPVVLRQ KR++ L TGIRKILQIELAV PDSD+RGMTASGM Sbjct: 180 EEGYKRFATGVYAIWYPVVLRQHTKRIVRGLVETGIRKILQIELAVRPDSDQRGMTASGM 239 Query: 241 IVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIVPE 280 IVINPPW+LE QM +LP+L LVP GTG TV WI PE Sbjct: 240 IVINPPWQLESQMKKILPYLTDVLVPEGTGSWTVEWIKPE 279 >UniRef50_A3UP41 Protein involved in catabolism of external DNA n=17 Tax=Gammaproteobacteria RepID=A3UP41_VIBSP Length = 284 Score = 365 bits (936), Expect = 1e-99, Method: Compositional matrix adjust. Identities = 174/285 (61%), Positives = 215/285 (75%), Gaps = 6/285 (2%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 MLSYRHSFHAGNHADV+KH VQSLI+ LK+KDKPF+Y DTH+G GRY L E +E+TGE Sbjct: 1 MLSYRHSFHAGNHADVVKHIVQSLILNYLKQKDKPFVYHDTHSGVGRYDLTHEWSEKTGE 60 Query: 61 YLEGIARIWQ-----QDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQ 115 Y +GIAR+W Q DLP ++++Y+ + N +LR+YPGSP +AR LR+QD + Sbjct: 61 YKQGIARLWSASEAGQQDLPEDIQSYLESISALNNGEKLRFYPGSPRVARAHLRDQDRMV 120 Query: 116 LTELHPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQA 175 LTELHP+D+PLL EF +D + + K DGFQ+LK LPP RRGL+LIDPPYE+ +Y+ Sbjct: 121 LTELHPADHPLLEQEFHRDRQVSIYKEDGFQRLKGSLPPKERRGLVLIDPPYELAKEYRD 180 Query: 176 VVSGIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGM 235 VV+ IA+ +KR+ATGIYA+WYPVV R I+ MI LE GI KILQIEL V PD++ RGM Sbjct: 181 VVTAIAQSHKRWATGIYAIWYPVVNRCDIEDMIEGLEGLGINKILQIELGVSPDTNERGM 240 Query: 236 TASGMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIVPE 280 TASGMIVINPPWKLE QMN +LP+L + PA TGH V WIVPE Sbjct: 241 TASGMIVINPPWKLESQMNEILPFLKEAIAPA-TGHFKVEWIVPE 284 >UniRef50_Q0VM19 Putative uncharacterized protein n=2 Tax=Alcanivorax RepID=Q0VM19_ALCBS Length = 282 Score = 286 bits (733), Expect = 4e-76, Method: Compositional matrix adjust. Identities = 141/280 (50%), Positives = 183/280 (65%), Gaps = 1/280 (0%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 MLSYRHS+HAGN ADVLKH VQ IIE LK+KDKPF DTHAGAG Y + SEH ++TGE Sbjct: 1 MLSYRHSYHAGNFADVLKHIVQVAIIEYLKKKDKPFTVHDTHAGAGSYAIASEHMQKTGE 60 Query: 61 YLEGIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELH 120 Y +GIA+++ + ++ Y+++V+ N G+L YPGSP I+ LLREQD LQ TELH Sbjct: 61 YQDGIAKLFGKRTGVGVIDQYVSLVEKLNPVGRLMDYPGSPQISASLLREQDVLQCTELH 120 Query: 121 PSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSGI 180 +D+ LL+ EF D R +V K D + LKA LPP RRGL+LIDP YEM+ DY V+ + Sbjct: 121 STDFTLLKREFADDKRVQVLKDDAWHGLKALLPPRHRRGLVLIDPSYEMEADYNGVLPAV 180 Query: 181 AEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTASGM 240 +RFAT YA+WYPV+ R + + I GI +L++E V PD+ RGMT +GM Sbjct: 181 QMAMERFATATYAIWYPVLDRNRTESFIRRFVKAGIPNLLRVECCVRPDASGRGMTGTGM 240 Query: 241 IVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIVPE 280 ++INPP+ L Q M +P L L A GH TV + E Sbjct: 241 LIINPPYTLAQHMAQAMPLLKEALCDA-NGHTTVKMLTGE 279 >UniRef50_Q12I42 Putative uncharacterized protein n=20 Tax=Shewanella RepID=Q12I42_SHEDO Length = 292 Score = 281 bits (719), Expect = 2e-74, Method: Compositional matrix adjust. Identities = 135/281 (48%), Positives = 191/281 (67%), Gaps = 2/281 (0%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 MLSYRH +HAGN+ADVLKH + +++++ +KDK F+Y+DTHAGAG Y L E A++TGE Sbjct: 13 MLSYRHGYHAGNYADVLKHAILLQVLKAMHKKDKAFVYVDTHAGAGAYSLEDEFAQKTGE 72 Query: 61 YLEGIARIWQQDDLPAELEAYINVVKHFN-RSGQLRYYPGSPLIARLLLREQDSLQLTEL 119 YL+G+A++W + DLP L+ Y+ VK FN +L YPGSP LR QD + L EL Sbjct: 73 YLDGVAKLWDKTDLPLALKDYVAAVKTFNAEQDELSLYPGSPAFVDSELRPQDRMVLHEL 132 Query: 120 HPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSG 179 H +D+ LL F KD + +V K DG L A +PP+ RRG++LIDP +E+KTDYQ V Sbjct: 133 HGTDHELLSDYFAKDRQVKVIKGDGLAGLIAAVPPLERRGVVLIDPSFEIKTDYQDVADA 192 Query: 180 IAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTASG 239 I + +KRF+TG++ LWYPVV R+Q + M+ L+ +GI K L++E + DS+ GMTA+G Sbjct: 193 IIKAHKRFSTGVFMLWYPVVDREQTEAMLSKLKNSGITKQLRLEQGIKTDSNEFGMTAAG 252 Query: 240 MIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIVPE 280 + +INPPW+L++ + L +L +K + GH TV W V E Sbjct: 253 LWIINPPWQLDELAKDSLDYL-AKTLGGIDGHVTVKWEVGE 292 >UniRef50_A1T010 DNA (Exogenous) processing protein n=8 Tax=Gammaproteobacteria RepID=A1T010_PSYIN Length = 284 Score = 276 bits (707), Expect = 4e-73, Method: Compositional matrix adjust. Identities = 137/280 (48%), Positives = 183/280 (65%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 +LSYRHSFHAGN ADVLKH V + II+ + +K+K F YLDTHAG G Y S A +T E Sbjct: 5 LLSYRHSFHAGNFADVLKHIVSTSIIDYMLKKEKAFCYLDTHAGCGAYSFQSPEALKTKE 64 Query: 61 YLEGIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELH 120 + GI +W + DLP + Y+ V FN QL++YPGSP IA +LR+ D L L ELH Sbjct: 65 FNNGIFPLWGRSDLPVPVARYMEQVVEFNAQSQLKHYPGSPSIAVQMLRDIDRLFLFELH 124 Query: 121 PSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSGI 180 P+++ + + F + + ++ K+DG Q L A +PP +RRG ILIDP YE+KT+Y VV + Sbjct: 125 PNEFINMCANFSGNRQIKMAKSDGLQGLIANMPPKARRGFILIDPSYEIKTEYHQVVETL 184 Query: 181 AEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTASGM 240 + +KRFATG YALWYPVV R I +M L+A+GI+ I EL + DSD+ GMT+SGM Sbjct: 185 IQAHKRFATGTYALWYPVVNRMTIDKMEKALKASGIKNIQLFELGLQEDSDQMGMTSSGM 244 Query: 241 IVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIVPE 280 IVINPPW L+++M LP+L L G + +V E Sbjct: 245 IVINPPWTLKKEMQASLPFLAKLLGFDNQGFYRIETLVAE 284 >UniRef50_A3YH43 Protein involved in external DNA uptake n=2 Tax=Marinomonas RepID=A3YH43_9GAMM Length = 280 Score = 274 bits (701), Expect = 2e-72, Method: Compositional matrix adjust. Identities = 126/280 (45%), Positives = 177/280 (63%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 MLSYRH +HAGNHAD+LKH V S I L K+ PF YLDTHAG G+Y L S+ A+ E Sbjct: 1 MLSYRHIYHAGNHADILKHLVVSQICHHLTAKEAPFFYLDTHAGIGQYALDSQQAQMNKE 60 Query: 61 YLEGIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELH 120 + GI+++ + P ++ ++ +VK N + L+ YPGSP + R++D + L ELH Sbjct: 61 FKTGISQLLELKSAPDSIKRFLKIVKEMNPTSNLKVYPGSPKVVEAYTRQKDKMHLCELH 120 Query: 121 PSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSGI 180 P D+P L + F RA VEK +GF +KA LPP +RGL+L+DPPYE+K DY+ VV + Sbjct: 121 PKDHPTLAALFPNKRRANVEKGNGFAAVKAMLPPPQKRGLVLMDPPYEVKEDYKTVVKAL 180 Query: 181 AEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTASGM 240 EG++RF+ GIYA+WYPV+ R+Q +I+ ++ T IR +L +EL + +GM SGM Sbjct: 181 VEGHQRFSHGIYAIWYPVLSRKQADNLINSVQRTKIRNVLLLELNIRDIDADKGMNGSGM 240 Query: 241 IVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIVPE 280 I++NPPWK+E + LP L L + WI PE Sbjct: 241 IIVNPPWKMESEAQEFLPILKELLQEDNRSSFQLRWITPE 280 >UniRef50_Q47Z69 Putative uncharacterized protein n=1 Tax=Colwellia psychrerythraea 34H RepID=Q47Z69_COLP3 Length = 295 Score = 274 bits (701), Expect = 2e-72, Method: Compositional matrix adjust. Identities = 127/296 (42%), Positives = 193/296 (65%), Gaps = 17/296 (5%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 MLSYRH+FHAGN ADVLKH+V SL+++ + K+K F Y+D+H+GAG YQL E+A++TGE Sbjct: 1 MLSYRHAFHAGNFADVLKHSVLSLVLDYMTRKEKGFCYIDSHSGAGMYQLADEYAQKTGE 60 Query: 61 YLEGIARIWQQDDLPAELEAYINVVKHFN----------------RSGQLRYYPGSPLIA 104 Y +GIA+I +D P LE Y++++K N S L YPGSP IA Sbjct: 61 YKDGIAKIINDEDAPESLEPYLSLIKSLNLASDRNTDPSADISTDTSNDLDVYPGSPGIA 120 Query: 105 RLLLREQDSLQLTELHPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILID 164 + +R QDS L ELHP+D L + Q+ + V+++DG+Q + +PP SRRG++LID Sbjct: 121 KAFVRRQDSSHLFELHPTDIQHLENFCQRWRKVFVKQSDGYQGVLGLIPPPSRRGVVLID 180 Query: 165 PPYEMKTDYQAVVSGIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIEL 224 PPYE+K DY V I + Y++F+TG Y LWYPVV R+ +++M + + ++ +LQ+E Sbjct: 181 PPYELKEDYHKAVKTIIKAYEKFSTGTYILWYPVVKRELVEQMSYTFTKSSVKNVLQVEF 240 Query: 225 AVLPDSDRRGMTASGMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIVPE 280 + D+D GMT +G+ ++NPPW+L Q+ +LP++ +KL T + T++ ++ E Sbjct: 241 CLESDTDEYGMTGTGLFIVNPPWQLTSQLEEILPYMKTKLGSDDTSY-TLNQLIAE 295 >UniRef50_A6SXL6 Uncharacterized conserved protein n=70 Tax=cellular organisms RepID=A6SXL6_JANMA Length = 305 Score = 272 bits (695), Expect = 1e-71, Method: Compositional matrix adjust. Identities = 134/276 (48%), Positives = 180/276 (65%), Gaps = 16/276 (5%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 MLSYRH+FHAGNHADVLKH VQ +++ L +KD P++Y+DTH+GAG Y L +A + E Sbjct: 1 MLSYRHAFHAGNHADVLKHLVQIQLLKYLNQKDTPYMYIDTHSGAGVYALDGNYAAKNAE 60 Query: 61 YLEGIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELH 120 + GI+++W + DLPA L Y+ V+K N SG+LRYYPGSP A ++REQD L+L ELH Sbjct: 61 FETGISKLWDRKDLPAPLAEYVQVIKALNPSGKLRYYPGSPYCADAVMREQDRLRLFELH 120 Query: 121 PSDYPLLRSEFQK---------------DSRARVEKADGFQQLKAKLPPVSRRGLILIDP 165 P+D LL F+K R +E+ +GFQ LKA LPP SRRGL+LIDP Sbjct: 121 PADSKLLADNFRKLEAHAAEQGKRPTVRGKRIMIERGNGFQGLKALLPPPSRRGLVLIDP 180 Query: 166 PYEMKTDYQAVVSGIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELA 225 PYE KTDY+ VV +++ RFATG YA+WYPV+ R + ++M L+ L + L+ Sbjct: 181 PYEDKTDYRTVVQTVSDALTRFATGTYAVWYPVLNRLESRQMPDKLKRLSANGWLNVTLS 240 Query: 226 V-LPDSDRRGMTASGMIVINPPWKLEQQMNNVLPWL 260 V P D G+ +SGM V NPPW LE + ++P+L Sbjct: 241 VTTPSPDGFGLHSSGMFVHNPPWTLEPMLRELMPYL 276 >UniRef50_Q5NZ63 Predicted protein involved in catabolism of external DNA n=13 Tax=Betaproteobacteria RepID=Q5NZ63_AZOSE Length = 281 Score = 270 bits (690), Expect = 4e-71, Method: Compositional matrix adjust. Identities = 134/272 (49%), Positives = 175/272 (64%), Gaps = 2/272 (0%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 MLSYRH+FHAGNHADVLKH V ++ K+KP+ Y+DTHAGAG Y L SE A + E Sbjct: 1 MLSYRHAFHAGNHADVLKHFVLIELLRYFNRKEKPWWYVDTHAGAGCYALDSEQAGKNAE 60 Query: 61 YLEGIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELH 120 + GI R+WQ+DDLP + Y++ + FN G+L +YPGSP +A LREQD ++L ELH Sbjct: 61 FASGIGRLWQRDDLPDAMRPYLDALAQFNPHGRLTFYPGSPALAMTQLREQDRMRLFELH 120 Query: 121 PSDYPLLRSEFQKD-SRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSG 179 P+D LL F +D R +V KADGF L+ LPP SRR ++LIDPPYE+K DY+ VV Sbjct: 121 PADVALLGQTFARDVQRVQVRKADGFSALRGLLPPPSRRVVVLIDPPYEVKEDYRRVVDT 180 Query: 180 IAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAV-LPDSDRRGMTAS 238 +A+ KRF G YA+WYP++ R + +++ L G L + LAV P D GM S Sbjct: 181 LADAIKRFPAGTYAVWYPLLARTEARQLPARLAGLGAENWLDVRLAVKKPPRDGFGMFGS 240 Query: 239 GMIVINPPWKLEQQMNNVLPWLHSKLVPAGTG 270 G+ V+NPPW L Q + V+PWL L G G Sbjct: 241 GLYVVNPPWVLPQTLEAVMPWLADVLGEDGEG 272 >UniRef50_A0KP43 Protein involved in external DNA uptake n=3 Tax=Aeromonadaceae RepID=A0KP43_AERHH Length = 284 Score = 269 bits (687), Expect = 8e-71, Method: Compositional matrix adjust. Identities = 135/279 (48%), Positives = 178/279 (63%), Gaps = 2/279 (0%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 MLSYRH+FHAGNHADVLKH VQ+LIIESLK+K+KPF+ LDTHAG G Y L + ++ E Sbjct: 1 MLSYRHAFHAGNHADVLKHAVQALIIESLKKKEKPFIVLDTHAGGGLYDLCGDWPQKKAE 60 Query: 61 YLEGIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELH 120 Y +GI R+W + + + Y+ V++ N GQLRYYPGSP ++R L REQD L L ELH Sbjct: 61 YADGIGRLWDERTQWSAMAPYLGVIEEMNSDGQLRYYPGSPELSRRLAREQDKLALMELH 120 Query: 121 PSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSGI 180 ++ LR+ D R V DGF+ L A LPP RRGL+LIDPPYE+K DY AVV + Sbjct: 121 NNEVDDLRANMGYDPRVAVHHRDGFEGLVALLPPTPRRGLVLIDPPYELKEDYFAVVDTL 180 Query: 181 AEGYKRFATGIYALWYPVVLRQQIKR--MIHDLEATGIRKILQIELAVLPDSDRRGMTAS 238 + KR+ATGIYALWYP++ + K M+ ++ +L EL V + GM S Sbjct: 181 KKAQKRWATGIYALWYPILGEEADKSRDMLRAIKRENFGNVLVAELEVAGQTKDWGMNGS 240 Query: 239 GMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWI 277 GM++I+PPW L++Q+ L L +KL V W+ Sbjct: 241 GMLIISPPWMLDEQIEAFLKPLCAKLAQGAGAQYKVEWL 279 >UniRef50_A4VR91 Protein involved in catabolism of external DNA n=21 Tax=Pseudomonadaceae RepID=A4VR91_PSEU5 Length = 279 Score = 254 bits (648), Expect = 3e-66, Method: Compositional matrix adjust. Identities = 124/279 (44%), Positives = 170/279 (60%), Gaps = 1/279 (0%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEY 61 ++YRH+FHAGNHADVLKH V S I L K+ PF YLD+HAG G Y L + A RTGE+ Sbjct: 1 MNYRHAFHAGNHADVLKHLVLSRIFALLSRKEAPFAYLDSHAGVGLYDLAGDQASRTGEW 60 Query: 62 LEGIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELHP 121 L+GIARIWQ + PA L+ Y+ V++ N G LRYYPGSP +AR L REQD LQL E HP Sbjct: 61 LQGIARIWQAETRPALLDDYLGVIRSLNPDGALRYYPGSPELARQLTREQDRLQLNEKHP 120 Query: 122 SDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSGIA 181 D LL+ D R V + +G+ +A +P +R ++LIDPP+E + V+ + Sbjct: 121 EDGALLKDNMSGDRRVAVHRGEGWHVPRALMPTREKRVVLLIDPPFEQADELSRCVTALK 180 Query: 182 EGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTASGMI 241 E R I +WYP+ +Q+KR DL +G K+L+ EL V P D + SG+ Sbjct: 181 EALGRMRQTIGVIWYPIKDERQLKRFYQDLARSGAPKLLRAELFVHPADDASRLAGSGLA 240 Query: 242 VINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIVPE 280 ++NPPW LE ++ +LPWL +L + G + W++ E Sbjct: 241 IVNPPWGLEDELRELLPWLAEQLAQS-QGGWRLDWLIEE 278 >UniRef50_B1XZU6 Putative uncharacterized protein n=1 Tax=Leptothrix cholodnii SP-6 RepID=B1XZU6_LEPCP Length = 280 Score = 252 bits (643), Expect = 1e-65, Method: Compositional matrix adjust. Identities = 117/265 (44%), Positives = 173/265 (65%), Gaps = 1/265 (0%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 ML+YRH+FHAGNHADVLKH V +++ + KDKPF +DTHAG G Y L S +++ GE Sbjct: 1 MLAYRHAFHAGNHADVLKHLVLVQVLQYMASKDKPFRLIDTHAGGGGYALHSSQSQKKGE 60 Query: 61 YLEGIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELH 120 YL+GI+RIW D P + Y+ +V+ FN GQL YPGSP ++++LLR D L+L ELH Sbjct: 61 YLQGISRIWGAGDAPPAVADYLRLVRRFNPDGQLNLYPGSPALSQMLLRRGDQLRLFELH 120 Query: 121 PSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSGI 180 P+++ +L + + ++ + DGF L+ ++PP RRG++L+DP YE+ +DY V+ + Sbjct: 121 PTEFKILTENTRPGRQVQLAQVDGFAALRGQVPPSMRRGVVLMDPSYELVSDYAKVIDSL 180 Query: 181 AEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAV-LPDSDRRGMTASG 239 + +RFA G+Y +WYP V R + ++ L+AT + L + L V PD+ G+T SG Sbjct: 181 RDALQRFAEGVYVVWYPQVSRVESIQIARRLQATAPKGWLHVRLNVQQPDAQGFGLTGSG 240 Query: 240 MIVINPPWKLEQQMNNVLPWLHSKL 264 + VINPP+ L Q+ +PWL KL Sbjct: 241 VFVINPPYTLHAQLAACMPWLTQKL 265 >UniRef50_Q2W9T7 Protein involved in catabolism of external DNA n=7 Tax=Alphaproteobacteria RepID=Q2W9T7_MAGSA Length = 283 Score = 252 bits (643), Expect = 1e-65, Method: Compositional matrix adjust. Identities = 120/279 (43%), Positives = 177/279 (63%), Gaps = 1/279 (0%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEY 61 ++YRH++HAGN ADV+KH + ++++ SLK KD PF LDTHAG G Y L + A++TGEY Sbjct: 1 MNYRHAYHAGNFADVMKHAILAMVVASLKRKDTPFFALDTHAGIGAYDLEAPQADKTGEY 60 Query: 62 LEGIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELHP 121 L GIAR+ D PAELE Y+ +V+ +N G LR YPGSP + R L+R QD + L ELHP Sbjct: 61 LSGIARVLDAADPPAELETYLALVRTWNSEGVLRRYPGSPELMRGLMRPQDRMALVELHP 120 Query: 122 SDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSGIA 181 D LR+ F D R V DG+ K LPP RRGL+L+DPP+E+K +++ +++ + Sbjct: 121 EDVETLRARFHGDRRVGVHHLDGYTAAKGLLPPPERRGLVLMDPPFEVKNEFERLLAALR 180 Query: 182 EGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTASGMI 241 K + TGIY WYP+ R+ + + + + G + L EL + P D + +G++ Sbjct: 181 RARKLWPTGIYLAWYPIKGREPVDQFLQAIADDGGPEALAAELLLRPAKDPFKLNGNGLL 240 Query: 242 VINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIVPE 280 VINPPW+L + ++ VLPWL + + P +G A + ++ E Sbjct: 241 VINPPWQLRESLDRVLPWLAAVMAP-DSGSAAIRQLIGE 278 >UniRef50_B6QZ93 Florfenicol resistance protein n=3 Tax=Rhodobacteraceae RepID=B6QZ93_9RHOB Length = 284 Score = 241 bits (615), Expect = 2e-62, Method: Compositional matrix adjust. Identities = 115/284 (40%), Positives = 180/284 (63%), Gaps = 5/284 (1%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEY 61 ++YRH +HAGN DVLKH V + I++ L++KD + LDTHAG G Y L SE A++TGE+ Sbjct: 1 MNYRHIYHAGNIGDVLKHVVLANILKYLQKKDGAYRVLDTHAGIGLYDLTSEKAQKTGEW 60 Query: 62 LEGIARIWQQDDLPAE-----LEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQL 116 +G+ ++ + D ++ L ++ V++ N G +++YPGSP IA +L R+QD L L Sbjct: 61 QQGVGKVLENIDAASDQVKEVLAPWLETVENLNPGGGVQFYPGSPEIACMLARKQDRLTL 120 Query: 117 TELHPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAV 176 TELHP D+ L++ + D + +V D + L + LPP RRGL+LIDP +E++ +++ + Sbjct: 121 TELHPEDFEELKNNYGGDKKVKVIALDAWLALGSFLPPKERRGLVLIDPAFEVEDEFKRL 180 Query: 177 VSGIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMT 236 G+ G+KR+ TG +A+WYPV ++ + ++I LE GIR +++EL+ SD R M Sbjct: 181 AEGVIRGWKRWQTGTFAIWYPVKNQRIVNQLIVTLEEAGIRNAVKLELSAGQISDDRPMK 240 Query: 237 ASGMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIVPE 280 +SGM+V+NPPW L + MN LPWL L +V ++ E Sbjct: 241 SSGMLVVNPPWTLTRDMNTALPWLCQTLSQGKGAEWSVKQVIAE 284 >UniRef50_B8GN89 Putative uncharacterized protein n=1 Tax=Thioalkalivibrio sp. HL-EbGR7 RepID=B8GN89_THISH Length = 281 Score = 240 bits (612), Expect = 4e-62, Method: Compositional matrix adjust. Identities = 120/280 (42%), Positives = 169/280 (60%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 MLSYRH FHAGN ADV KH V + I+++L K KPF LDTHAG Y L S+ AE+TGE Sbjct: 1 MLSYRHGFHAGNFADVHKHAVLAWIVQALTAKAKPFCVLDTHAGDAGYDLASQWAEKTGE 60 Query: 61 YLEGIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELH 120 + EG+ R+ P + ++ +++ F S R YPGSP IAR LLR D L L ELH Sbjct: 61 WREGVGRLMGCPGAPEAIAPFLQLLEAFRASHGERAYPGSPAIARGLLRPGDRLVLGELH 120 Query: 121 PSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSGI 180 P+ + LR F +D + V + DG++ L A LPP RRGL+L+DPPYE +YQA + Sbjct: 121 PAAWESLRGFFARDDQVAVHRRDGWELLGALLPPAERRGLVLVDPPYERDEEYQAAARAL 180 Query: 181 AEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTASGM 240 +R+ +G+Y LWYP++ + + M+ +LEA +L EL P G+ SG+ Sbjct: 181 TAAARRWPSGVYLLWYPLLAAGRHQAMLRELEAARPGPMLVAELWTAPLDTPAGLNGSGL 240 Query: 241 IVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIVPE 280 ++NPPW+L + + N+ PWL L P G G + + W +P+ Sbjct: 241 CILNPPWRLHEALANLQPWLVDCLAPGGAGGSRLHWAIPD 280 >UniRef50_Q984Q7 Mlr7888 protein n=7 Tax=Rhizobiales RepID=Q984Q7_RHILO Length = 282 Score = 239 bits (610), Expect = 8e-62, Method: Compositional matrix adjust. Identities = 122/282 (43%), Positives = 174/282 (61%), Gaps = 3/282 (1%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEY 61 ++YRH++HAGN ADV+KH V + +++ LK+KDK F +DTHAG GRY L S A++TGE+ Sbjct: 1 MNYRHAYHAGNFADVVKHVVLTRLLDYLKQKDKAFRVVDTHAGIGRYDLSSLEAQKTGEW 60 Query: 62 LEGIARIWQQD---DLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTE 118 GI R+ A L Y+ V+ N ++ YPGSPL+AR LLR+QD L E Sbjct: 61 QGGIGRLIDASLDARAGALLAPYLEAVRSLNPGDGVKKYPGSPLLARHLLRKQDRLSAIE 120 Query: 119 LHPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVS 178 LHP D L++EF D + RV + DG+ L A LPP +RGL+LIDPP+E + ++ +V Sbjct: 121 LHPKDAARLKAEFAGDFQVRVMELDGWLALGAHLPPKEKRGLVLIDPPFEEEGEFGRLVE 180 Query: 179 GIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTAS 238 G+ ++R+ GIYALWYP+ R+ + L+ +GI KIL IE + P S + S Sbjct: 181 GLIRAHRRWPGGIYALWYPIKDRKAVIAFRKALKQSGIPKILDIEFEIRPASSEPSLDGS 240 Query: 239 GMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIVPE 280 GM+V+NPP+ LE ++ VLP LH L H ++ W+ E Sbjct: 241 GMVVVNPPFTLEGELRTVLPALHKLLAVEKPAHWSLEWLAGE 282 >UniRef50_B9JCU5 DNA methylase protein n=4 Tax=Rhizobiales RepID=B9JCU5_AGRRK Length = 283 Score = 236 bits (601), Expect = 8e-61, Method: Compositional matrix adjust. Identities = 119/282 (42%), Positives = 172/282 (60%), Gaps = 3/282 (1%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEY 61 ++YRH +HAGN ADVLKH V + ++ ++ KDK F LDTHAG G Y L SE A++TGE+ Sbjct: 1 MNYRHIYHAGNFADVLKHAVLARLVRYMQNKDKAFRVLDTHAGIGLYDLSSEEAQKTGEW 60 Query: 62 LEGIARIWQQDDLP--AEL-EAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTE 118 +GI R+ + +P AEL E Y+ V+ N G L++YPGSP +AR+L R QD L E Sbjct: 61 QDGIGRLLDAELVPQLAELLEPYLTAVRELNPDGGLQFYPGSPKLARMLFRSQDRLSAME 120 Query: 119 LHPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVS 178 LHP D+ L F+ D AR+ + DG+ L A LPP +RG++L+DPP+E + +Y+ + Sbjct: 121 LHPEDFQRLHRLFEGDHHARITELDGWLALGAHLPPKEKRGIVLVDPPFEEEDEYERLAD 180 Query: 179 GIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTAS 238 G+A+ ++RF G Y LWYP+ IK L+A I K+L EL V D G+T S Sbjct: 181 GLAKAWRRFPGGTYCLWYPIKKDAPIKAFHETLQALEIPKVLCAELTVKSDRGFTGLTGS 240 Query: 239 GMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIVPE 280 G+I++NPP+ L+ +++ +LP L L W+ E Sbjct: 241 GLIIVNPPFTLKDELHALLPALKDMLAQDRFASQRAFWLRGE 282 >UniRef50_C3X722 External-DNA catabolic protein n=2 Tax=Oxalobacter formigenes RepID=C3X722_OXAFO Length = 298 Score = 233 bits (593), Expect = 7e-60, Method: Compositional matrix adjust. Identities = 119/280 (42%), Positives = 167/280 (59%), Gaps = 16/280 (5%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 M SYRH+FHAGNHADVLKH V ++ K+ Y+DTH+GAG Y L A + E Sbjct: 1 MFSYRHAFHAGNHADVLKHVVLMQVLLYAIRKEASLFYIDTHSGAGVYSLEGNEARKNAE 60 Query: 61 YLEGIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELH 120 + GIAR+W + +P + Y+ +V N G+LR+YPGSP IA +LR D L+L E H Sbjct: 61 FQSGIARLWGKKTVPPAVRDYLKLVYDMNPDGKLRFYPGSPYIAERILRSHDRLRLFEWH 120 Query: 121 PSDYPLLRSEF-------QKDSRAR--------VEKADGFQQLKAKLPPVSRRGLILIDP 165 P++ +L F + ++R+R VE+ DGF LKA LPP SRR +ILIDP Sbjct: 121 PAECRVLDENFRGLLKSGESNTRSRPERGKRVLVERKDGFSSLKALLPPPSRRAVILIDP 180 Query: 166 PYEMKTDYQAVVSGIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELA 225 PYE K+DY+ VV +++ KRF+TG +WYP++ R + +R L+ T ++ L + L+ Sbjct: 181 PYEDKSDYRKVVDVVSDALKRFSTGTCLIWYPLLQRPESRRFASRLKQTVSQEWLDVTLS 240 Query: 226 V-LPDSDRRGMTASGMIVINPPWKLEQQMNNVLPWLHSKL 264 P D G +SGM V+NPPWKL + + +P L S L Sbjct: 241 TGSPVPDGFGFVSSGMFVVNPPWKLAESLQETMPCLVSAL 280 >UniRef50_B2S4W3 N-6 Adenine-specific DNA methylase n=51 Tax=Rhizobiales RepID=B2S4W3_BRUA1 Length = 290 Score = 231 bits (589), Expect = 2e-59, Method: Compositional matrix adjust. Identities = 124/286 (43%), Positives = 172/286 (60%), Gaps = 7/286 (2%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEY 61 ++YRH++HAGN ADV+KH + + I+E LK K++ F +DTHAG G Y L A +TGE+ Sbjct: 1 MNYRHAYHAGNFADVVKHVILTRIVEYLKRKEQAFRVIDTHAGIGLYDLKGTEAGKTGEW 60 Query: 62 LEGIARIWQ-----QDDLPAE--LEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSL 114 GI RI Q + P L+ Y++ V N +LR+YPGSPL+ R LLR+QD L Sbjct: 61 AGGIERIMTAVEKGQVEQPVLELLKPYLDAVYAVNTGVRLRHYPGSPLLVRHLLRKQDRL 120 Query: 115 QLTELHPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQ 174 ELHP D L F D + RV + DG+ L A LPP +RGL+L+DPP+E ++ Sbjct: 121 SALELHPQDAAKLAKLFDGDYQVRVTELDGWLALGAHLPPKEKRGLVLVDPPFEKDGEFD 180 Query: 175 AVVSGIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRG 234 + G+A+ +KRF G YALWYPV R++ +R L TGI KI+QIELA+ S Sbjct: 181 RLADGLAKAHKRFGGGTYALWYPVKDRRETERFARRLRETGIPKIMQIELAIRAPSPEPR 240 Query: 235 MTASGMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIVPE 280 + +GMIV+NPP+ LE +M +LP L L + ++ WI E Sbjct: 241 LDGTGMIVVNPPYTLESEMQILLPCLTRLLEEEKGSNFSLRWIRGE 286 >UniRef50_C6XPG2 Putative uncharacterized protein n=5 Tax=Proteobacteria RepID=C6XPG2_HIRBI Length = 334 Score = 227 bits (578), Expect = 4e-58, Method: Compositional matrix adjust. Identities = 115/274 (41%), Positives = 168/274 (61%), Gaps = 16/274 (5%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEY 61 ++YRH+FH GN AD+LKH V L IESLK+KDKPF Y+DTHAG GRY L + A R+ E+ Sbjct: 1 MNYRHAFHVGNFADILKHLVLVLCIESLKKKDKPFRYIDTHAGIGRYDLTGDEARRSPEW 60 Query: 62 LEGIARIWQQ-------DDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSL 114 EGI RIW +D+ A L+ Y++ V N G L YPGSP +A L+REQDSL Sbjct: 61 QEGIGRIWAAHKAGDIPEDVAAILKPYLDAVSEINYDGDLESYPGSPDLAATLMREQDSL 120 Query: 115 QLTELHPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQ 174 +LTELHP+D L F +D R ++E +G++ LKA LPP RRG++L+DPP+E + + Sbjct: 121 RLTELHPADKETLTDHFFRDKRVKIENRNGYEALKAYLPPPERRGVVLVDPPFEHRDELA 180 Query: 175 AVVSGIAEGYKRFATGIYALWYPVVLRQQIKR--------MIHDLEATGIRKILQIELAV 226 + G G R+ TG Y W P+ + ++ ++ D+E + KIL +L V Sbjct: 181 HMAKGAMGGISRWPTGTYIFWRPLKDMENTQKFDDGLAEWLLDDMEFSH-EKILLADLWV 239 Query: 227 LPDSDRRGMTASGMIVINPPWKLEQQMNNVLPWL 260 + + +G++V+NPP+ +++ + VLPW+ Sbjct: 240 KEIVEPGPLCGAGVVVVNPPYGMQEALLTVLPWV 273 >UniRef50_Q15U81 Putative uncharacterized protein n=1 Tax=Pseudoalteromonas atlantica T6c RepID=Q15U81_PSEA6 Length = 293 Score = 222 bits (565), Expect = 1e-56, Method: Compositional matrix adjust. Identities = 120/296 (40%), Positives = 170/296 (57%), Gaps = 20/296 (6%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 M SYRH +HAGNHADVLKH Q LII+ LK+KDK F Y+DTH+GAG Y L SE + +T E Sbjct: 1 MFSYRHGYHAGNHADVLKHICQMLIIDKLKQKDKGFTYIDTHSGAGLYDLSSEQSLKTNE 60 Query: 61 YLEGIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELH 120 + +GI+R+ + AY + + + Q YPGSP IAR+L+R+QD L L E + Sbjct: 61 FQQGISRLADYSGAEPTVLAYQALTSSYLKHQQ---YPGSPEIARVLMRDQDQLHLMEWN 117 Query: 121 PSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSGI 180 + L+ + K + V DG++ L A PP +RGL+L DP YE DYQ VV I Sbjct: 118 NQEVINLKRQI-KGTHISVHHRDGYEGLIALTPPKLKRGLVLTDPSYETSEDYQLVVDAI 176 Query: 181 AEGYKRFATGIYALWYPVVLR----------------QQIKRMIHDLEATGIRKILQIEL 224 ++ YKR+ T IYA+WYP++ + ++ ++M+ DL G + +LQ+EL Sbjct: 177 SKAYKRWPTAIYAIWYPLLSKRDEDQNDGFERATTKHKKSQKMLDDLTQHGFKNVLQVEL 236 Query: 225 AVLPDSDRRGMTASGMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIVPE 280 AV GM SGM +IN PW+L+ Q+ + L L + V+W+V E Sbjct: 237 AVQNPDTFAGMYGSGMAIINAPWQLDAQIRDCLGELTPVMAQHKHASFVVNWLVEE 292 >UniRef50_Q1N5H6 Putative uncharacterized protein n=1 Tax=Bermanella marisrubri RepID=Q1N5H6_9GAMM Length = 284 Score = 221 bits (563), Expect = 2e-56, Method: Compositional matrix adjust. Identities = 118/285 (41%), Positives = 166/285 (58%), Gaps = 11/285 (3%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 MLSYRH+FHAGNHADVLKH+V I + +K + Y+DTH+GAG Y+L + A +T E Sbjct: 1 MLSYRHAFHAGNHADVLKHSVLVAIAKYFHKKQSAYTYIDTHSGAGVYKLSDDLANKTQE 60 Query: 61 YLEGIARIWQQDDLPAELEAYINVVKHFNRS----GQLRYYPGSPLIARLLLREQDSLQL 116 Y GIAR++ DL A + Y+ V+ N + L++YPGSP LLREQD L Sbjct: 61 YKTGIARLYPNSDL-ALISPYLEQVRVLNAAQGEEKNLQFYPGSPWFMTELLREQDQAHL 119 Query: 117 TELHPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAV 176 ELHP D+ LL + +V DGF +KA LPP ++R I+IDPPYE +Y+ V Sbjct: 120 FELHPQDHALLEQNMNTGKQLKVHMEDGFSGIKAVLPPQTKRAFIVIDPPYEQANEYKKV 179 Query: 177 VSGIAEGYKRFATGIYALWYPVVLRQQ----IKRMIHDLEATGIRKILQIELAVLPDSDR 232 V+ I +G KRFA G++A+WYP++ R + M+ +L T I K L + L Sbjct: 180 VNAIEQGIKRFAVGVFAVWYPLLNRNDKQGMSETMVDELAKTDITKYLDVRLWT--SKQT 237 Query: 233 RGMTASGMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWI 277 +GM SG+ ++NPP+ L+ +N LP L L T +V ++ Sbjct: 238 QGMYGSGLFIVNPPYILQDLLNQELPKLLEVLGLDETAGFSVDYV 282 >UniRef50_Q89DH2 Blr7467 protein n=16 Tax=Rhizobiales RepID=Q89DH2_BRAJA Length = 286 Score = 219 bits (558), Expect = 8e-56, Method: Compositional matrix adjust. Identities = 115/282 (40%), Positives = 170/282 (60%), Gaps = 15/282 (5%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEY 61 ++YRH+FHAGN ADV+KH V + I+ L++K F +DTHAGAG Y L S+ A R GE+ Sbjct: 1 MNYRHAFHAGNFADVIKHIVLARILTYLQDKPGAFRVIDTHAGAGLYDLESDEARRGGEW 60 Query: 62 LEGIARIWQ---QDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTE 118 L GIAR+ Q ++ A + Y+++V+ FN G+L+ YPGSPLIAR LLR QD L E Sbjct: 61 LTGIARLMQARLSNETAALTKPYLDIVRAFNPKGELKAYPGSPLIARGLLRPQDRLVACE 120 Query: 119 LHPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVS 178 L P L ++D +ARV DG+ L A +PP RRGL+LIDPP+E K +++ + Sbjct: 121 LEPKARKALIDVLRRDEQARVVDLDGWVALPAFVPPKERRGLVLIDPPFEAKNEFERLGE 180 Query: 179 GIAEGYKRFATGIYALWYPV-------VLRQQIKRMIHDLEATGIRKILQIELAVLPDSD 231 +E + ++ TGIY +WYP L Q + R+ + G K L++E + P D Sbjct: 181 AFSEAFAKWPTGIYVIWYPAKSRRATDALAQLVARLAAAAKPPG--KCLRLEFSAAPQLD 238 Query: 232 RRGMTASGMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHAT 273 +T++G++++NPP+ L ++ +LP L +P G G A Sbjct: 239 GAALTSTGLLIVNPPYTLHGELKTILPELE---MPLGQGGAA 277 >UniRef50_C6M2C4 YhiR family protein n=2 Tax=Neisseriaceae RepID=C6M2C4_NEISI Length = 281 Score = 215 bits (548), Expect = 1e-54, Method: Compositional matrix adjust. Identities = 113/264 (42%), Positives = 163/264 (61%), Gaps = 6/264 (2%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 MLSYRH+FHAGNHAD++KH + L ++ +KDKP+ Y+DTH+GAG Y L A++ GE Sbjct: 1 MLSYRHAFHAGNHADMIKHFILFLTLDYFNQKDKPYWYIDTHSGAGLYDLSGSEAQKVGE 60 Query: 61 YLEGIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELH 120 Y +GI + + + LP EL A+I + QL Y GSP +A+ L R+ D L+L ELH Sbjct: 61 YKQGIRLLQEAEHLPPELSAFIARLNAILPQEQL--YCGSPWLAQALTRDSDKLRLFELH 118 Query: 121 PSDYPLLRSEFQK---DSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVV 177 P+D+ L++ ++ R ++ +ADGF+ L + LPP RR ++LIDPPYE K DYQ VV Sbjct: 119 PADFQHLKNNMEEARLGRRGQIMQADGFRGLISLLPPPLRRAVVLIDPPYEEKQDYQRVV 178 Query: 178 SGIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAV-LPDSDRRGMT 236 + + KRF G Y +WYP + R++ +++ L+ L EL V P D GM Sbjct: 179 QTLKDALKRFEQGCYMVWYPCLSREESRKLPEQLQKLMPDSYLHAELHVHTPRPDGFGMH 238 Query: 237 ASGMIVINPPWKLEQQMNNVLPWL 260 SGM +INPP+ L + + + LP L Sbjct: 239 GSGMFIINPPYLLPELLKSNLPAL 262 >UniRef50_Q0F148 Putative uncharacterized protein n=1 Tax=Mariprofundus ferrooxydans PV-1 RepID=Q0F148_9PROT Length = 303 Score = 214 bits (545), Expect = 2e-54, Method: Compositional matrix adjust. Identities = 107/286 (37%), Positives = 157/286 (54%), Gaps = 10/286 (3%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 MLSY+HS+HAGNHADVLKH + + + KD P LD A G Y L S A + E Sbjct: 22 MLSYQHSYHAGNHADVLKHIILGDVAAGMFNKDAPIFMLDAFASRGIYDLNSPEALKNRE 81 Query: 61 YLEGIARIWQQDDLPAE---LEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLT 117 G+ ++W D P + + ++ N +PGS + + RE D + Sbjct: 82 SDSGVGKLWPLRDEPTNPPGVRRWFKLIASLNMDDSYTRFPGSTAMLHAMAREGDRIAAC 141 Query: 118 ELHPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVV 177 +LHP ++ LR FQ R + K D F+ +K LPP +RGL+ +DP YE+K +Y+A+ Sbjct: 142 DLHPQEFDTLRVSFQASRRFSLLKRDAFEAIKGMLPPKEKRGLVFLDPSYEVKEEYRAIA 201 Query: 178 SGIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAV---LPDSDRRG 234 +A +++FA G+Y +WYP++ ++ + +L+ +GIRKIL+IEL PD Sbjct: 202 KAVAGAHRKFAGGVYVIWYPLLPAERHNELFRELKHSGIRKILRIELDCGDLFPDMQ--- 258 Query: 235 MTASGMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIVPE 280 M SGM+++NPPW EQ M L W+ KL G G SW+VPE Sbjct: 259 MHGSGMLIVNPPWHAEQAMQQSLNWVCDKLTD-GKGRKQFSWLVPE 303 >UniRef50_Q2SPJ4 Protein involved in catabolism of external DNA n=1 Tax=Hahella chejuensis KCTC 2396 RepID=Q2SPJ4_HAHCH Length = 280 Score = 213 bits (541), Expect = 9e-54, Method: Compositional matrix adjust. Identities = 108/282 (38%), Positives = 171/282 (60%), Gaps = 4/282 (1%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 MLSY+H +HAGN AD KH SL++++L +K P+ YL+THAG G Y L SE A++T E Sbjct: 1 MLSYQHVYHAGNFADAHKHWALSLLLQALCKKSTPWRYLETHAGRGDYDLTSEEAQKTSE 60 Query: 61 YLEGIARIWQ-QDDLPAELEAYINVVKHFN-RSGQLRYYPGSPLIARLLLREQDSLQLTE 118 + GI + Q + P E +AY+ V+ N + +L YPGSP IA LRE D L L E Sbjct: 61 WTAGILPLMQAKGPCPPEFDAYLAAVRALNPNTERLTRYPGSPAIAAGFLRETDQLALCE 120 Query: 119 LHPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVS 178 LHP +Y L+ +F ++ + + + DGF+ + A PP +RGL++IDP YE+K DYQ + + Sbjct: 121 LHPGEYAELKRQFGRNRQIHIHQRDGFEGVMAMSPPPEKRGLVMIDPSYELKEDYQRIPA 180 Query: 179 GIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTAS 238 + + K+++ I A+WYP++ ++ ++M+ + + K L+ EL + P + RGM S Sbjct: 181 YVNKLTKKWSNAIIAIWYPILAEKRHEKMLELMRQLPLNKTLRSELILTPVA--RGMYGS 238 Query: 239 GMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIVPE 280 GM+V+N PW+L++Q+ +L L + W++ E Sbjct: 239 GMLVVNSPWRLDEQLQAGWAYLSEALRGDPKASCSADWLIAE 280 >UniRef50_Q5ZVZ2 Protein involved in catabolism of external DNA n=7 Tax=Legionella RepID=Q5ZVZ2_LEGPH Length = 287 Score = 212 bits (539), Expect = 1e-53, Method: Compositional matrix adjust. Identities = 111/279 (39%), Positives = 160/279 (57%), Gaps = 9/279 (3%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 MLSY+H +HAGN ADV+KH + ++ L KDKP YL+TH+G G Y L + + +T E Sbjct: 6 MLSYQHGYHAGNFADVIKHITLTRLLAYLTHKDKPLFYLETHSGRGIYDLKDKQSLKTEE 65 Query: 61 YLEGIARIW-QQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTEL 119 Y EGI +W +++LP+ YI+V+K N + L YYPGSP A LR QD L L EL Sbjct: 66 YKEGINPVWLDRENLPSLFLEYISVIKQINLNSTLSYYPGSPYFAINQLRSQDRLYLCEL 125 Query: 120 HPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSG 179 HP++Y L + + V DG +L A LPP +RGLI IDP YE K +Y+ + Sbjct: 126 HPTEYNFLLKLPHFNKKVYVNHTDGVSKLNALLPPPEKRGLIFIDPSYERKEEYKEIPYA 185 Query: 180 IAEGYKRFATGIYALWYPVVLR---QQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMT 236 I Y +F+TG+Y +WYPVV + +Q R + ++ + +R IEL + P + GMT Sbjct: 186 IKNAYSKFSTGLYCVWYPVVNKAWTEQFLRKMREISSKSVR----IELHLNPLIN-EGMT 240 Query: 237 ASGMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVS 275 G+ +INPP+ ++ VL L + P + + S Sbjct: 241 GCGLWIINPPYTFPSEIKLVLETLTTYFNPGSSSYMIES 279 >UniRef50_A0YHR6 Putative uncharacterized protein n=1 Tax=marine gamma proteobacterium HTCC2143 RepID=A0YHR6_9GAMM Length = 256 Score = 211 bits (538), Expect = 2e-53, Method: Compositional matrix adjust. Identities = 106/250 (42%), Positives = 152/250 (60%), Gaps = 2/250 (0%) Query: 31 EKDKPFLYLDTHAGAGRYQLGSEHAERTGEYLEGIARIWQQDDLPAELEAYINVVKHFNR 90 +K+ F Y+D+HAGAG + L S+ A++ E+ GI+++ D P EL + ++ +N+ Sbjct: 9 KKESAFEYVDSHAGAGLFNLASKDAKKLEEHNYGISKL-VASDFP-ELLDFFTAIRAYNK 66 Query: 91 SGQLRYYPGSPLIARLLLREQDSLQLTELHPSDYPLLRSEFQKDSRARVEKADGFQQLKA 150 S ++ +YPGSP IA+ LR+QD L ELHP DY L + + RV DG + L++ Sbjct: 67 SAKINFYPGSPAIAKHFLRKQDRAWLYELHPQDYKSLCKNVESSKKMRVFCQDGLKALES 126 Query: 151 KLPPVSRRGLILIDPPYEMKTDYQAVVSGIAEGYKRFATGIYALWYPVVLRQQIKRMIHD 210 LPP SRRGLILIDP YE+K++Y+ V YK+F+TG Y +WYPVV R+Q+ M Sbjct: 127 VLPPTSRRGLILIDPSYEIKSEYEHVFRACVNAYKKFSTGTYIVWYPVVERRQVDVMEKK 186 Query: 211 LEATGIRKILQIELAVLPDSDRRGMTASGMIVINPPWKLEQQMNNVLPWLHSKLVPAGTG 270 +GI+ I + EL D+ RGMT+SG+ VINPPW L Q+M+ VLP L + L G Sbjct: 187 FILSGIKNIQRFELGRSADTRERGMTSSGVFVINPPWTLFQKMSAVLPRLATILGDKNDG 246 Query: 271 HATVSWIVPE 280 +V E Sbjct: 247 FFKCDILVAE 256 >UniRef50_B2HZ48 Protein involved in catabolism of external DNA n=17 Tax=Acinetobacter RepID=B2HZ48_ACIBC Length = 285 Score = 211 bits (537), Expect = 2e-53, Method: Compositional matrix adjust. Identities = 111/287 (38%), Positives = 167/287 (58%), Gaps = 10/287 (3%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEY 61 ++YRH FHAGN ADV+KH + ++ L KDKP+ Y+DTH GAG+Y L A+++GE+ Sbjct: 1 MNYRHHFHAGNFADVMKHVLLLQLLNRLNAKDKPYRYIDTHGGAGKYDLSQAPAQKSGEF 60 Query: 62 LEGIARIWQQDDL-----PAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQL 116 L GI R+ Q D+ P ++ Y+ +V+ YPGSP A +RE D + Sbjct: 61 LTGIHRLVQLSDMEKRQAPEAIQQYLKLVEELRAQEGKGSYPGSPWFALQGMREIDKATI 120 Query: 117 TELHPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEM-KTDYQA 175 E+ + LR D RA + + D ++ L A +PP +RGL++IDPPYE+ + D+ Sbjct: 121 FEMQRDVFQQLRHNIH-DKRAGLHERDAYEGLLAVIPPKEKRGLVMIDPPYELERKDFPQ 179 Query: 176 VVSGIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGM 235 +V + YK++ TG++A+WYP+ R I+R + TGIR+ L E+ V PD G+ Sbjct: 180 LVELLQSAYKKWPTGVFAVWYPIKDRAMIERFEKKMFKTGIRRQLICEICVWPDDTPVGL 239 Query: 236 TASGMIVINPPWKLEQQMNNVLPWL--HSKLVPAGTGHATVSWIVPE 280 G++VINPPW+ +Q + L WL H ++ G GHA V W+V E Sbjct: 240 NGCGLLVINPPWQFSEQADQALQWLFPHLRMQETG-GHAAVRWLVGE 285 >UniRef50_A4SYR3 Putative uncharacterized protein n=1 Tax=Polynucleobacter necessarius subsp. asymbioticus QLW-P1DMWA-1 RepID=A4SYR3_POLSQ Length = 289 Score = 208 bits (530), Expect = 1e-52, Method: Compositional matrix adjust. Identities = 120/287 (41%), Positives = 164/287 (57%), Gaps = 11/287 (3%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 M SYRH+FHAG+HAD+LKH ++E L+EK +DTHAGAG Y L A + E Sbjct: 1 MFSYRHAFHAGSHADILKHLTLIHLVEYLQEKPGALTIVDTHAGAGIYSLVDGFATVSKE 60 Query: 61 YLEGIARIWQ----QDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQL 116 GI R+ Q + P + Y+ +++ N +L YPGSP I LLR QD L+L Sbjct: 61 AEGGIFRLSQFFGKNSETPESIRKYLEMIQAENTGEELNTYPGSPFIIARLLRPQDRLKL 120 Query: 117 TELHPSDYPLLR---SEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDY 173 ELHP + +LR E ++ + V AD F +LK LPP SRRGL+LIDP YE K DY Sbjct: 121 FELHPKEIDILRHNIGELKEAKQIDVYAADSFSRLKGLLPPPSRRGLVLIDPSYEDKQDY 180 Query: 174 QAVVSGIAEGYKRFATGIYALWYPVVLRQQIKRM---IHDLEATGIRKILQIELAVLPDS 230 + + + + E +RFATG YA+WYP++ R++ + + + AT R L EL V Sbjct: 181 RYLENAMEEALQRFATGCYAIWYPILSRRESASLPDHLKKIAATHKRSWLHTELRVENAP 240 Query: 231 DRRGMTASGMIVINPPWKLEQQMNNVLPWLHSKL-VPAGTGHATVSW 276 R + ASGM +INPPW LE+ ++ LP L L V AG + S+ Sbjct: 241 GERRLQASGMFIINPPWTLEKHLDEALPVLVKALGVDAGAKYVLKSF 287 >UniRef50_D0IYP9 ComJ n=10 Tax=Bacteria RepID=D0IYP9_COMTE Length = 288 Score = 208 bits (530), Expect = 2e-52, Method: Compositional matrix adjust. Identities = 114/270 (42%), Positives = 161/270 (59%), Gaps = 10/270 (3%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 M SYRH+FHAGNHADVLKHTV ++ L +K+ LDTH GAG Y+L ++A ++GE Sbjct: 1 MFSYRHAFHAGNHADVLKHTVLIATVQYLTQKEAALTVLDTHGGAGLYRLDGDYASKSGE 60 Query: 61 YLEGIARIWQQDDLPAE--LEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTE 118 EG+ R+ + L+ Y+ +V+ FN+ +R YPGSP I + LLR D L+ E Sbjct: 61 AEEGVLRLAAAKEAELAPVLQDYLQMVRRFNQGNAIRNYPGSPFITQALLRGHDRLKAFE 120 Query: 119 LHPSDYPLLRSEF-QKDSRARVE--KADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQA 175 LHP+D L Q + R +V DGF+ +K LPP SRR L+L DP YE+KTDY Sbjct: 121 LHPTDMRSLTGNMAQLEVRRQVAILHEDGFEGVKKFLPPPSRRALLLCDPSYELKTDYGR 180 Query: 176 VVSGIAEGYKRFATGIYALWYPVVLRQQ---IKRMIHDLEATGIRKILQIELAVLPD--S 230 V+ A+G KRF TG YA+WYP++ R + + + + + + L L V + S Sbjct: 181 VLDMAADGLKRFPTGTYAVWYPIIPRPEAHDLPKRLKTMATKAGKSWLHATLTVKSNKTS 240 Query: 231 DRRGMTASGMIVINPPWKLEQQMNNVLPWL 260 +R G+ ASGM +INPP+ L+ Q+ +P L Sbjct: 241 ERGGLPASGMFLINPPFNLKDQLKPAMPQL 270 >UniRef50_D0KVW6 Putative uncharacterized protein n=1 Tax=Halothiobacillus neapolitanus c2 RepID=D0KVW6_HALNC Length = 312 Score = 208 bits (530), Expect = 2e-52, Method: Compositional matrix adjust. Identities = 112/292 (38%), Positives = 160/292 (54%), Gaps = 15/292 (5%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEY 61 ++Y+H FHAGNHADVLKH V +IE +++K FL L+THAGAG Y L + A R+ E Sbjct: 20 MNYQHHFHAGNHADVLKHLVLLQLIELMQQKPTGFLLLETHAGAGLYDLQATEARRSDEA 79 Query: 62 LEGIARIWQQ----DDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLT 117 GIAR+ Q D +P ++ Y+ ++ F L YYPGSPL+A LR QD Sbjct: 80 SGGIARLLQATQAADTVPVLIQTYLKQIEQFGSVPNLGYYPGSPLLAVCALRPQDRYIGV 139 Query: 118 ELHP----------SDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPY 167 EL P + P+L D R +G LKA LPP+ RRGL LIDPPY Sbjct: 140 ELVPKVARELSRNLAQRPMLEPCI-PDRRVIARDGEGLAALKADLPPLERRGLFLIDPPY 198 Query: 168 EMKTDYQAVVSGIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVL 227 E + + + + G +RF TG+YALWYP+ R + R ++ + + R +L IE ++ Sbjct: 199 EQPQERDDIAAALQAGLQRFETGVYALWYPIKQRPYLDRWLNRIAKSTPRPVLTIENSIF 258 Query: 228 PDSDRRGMTASGMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIVP 279 PD +T SG+++INPPW+ + M VL +++ L + W+ P Sbjct: 259 PDESGNRLTGSGLLIINPPWQFDTLMQPVLDFVNDALKQDTAAPRAIRWLNP 310 >UniRef50_A1WIT5 Putative uncharacterized protein n=4 Tax=Burkholderiales RepID=A1WIT5_VEREI Length = 293 Score = 208 bits (529), Expect = 2e-52, Method: Compositional matrix adjust. Identities = 112/275 (40%), Positives = 157/275 (57%), Gaps = 15/275 (5%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 M SYRH+FHAGNHADV KHTV ++ L +KD LD+HAGAG Y+L ++A +GE Sbjct: 1 MFSYRHAFHAGNHADVFKHTVLIATLQYLTDKDAALTVLDSHAGAGLYRLDGDYARTSGE 60 Query: 61 YLEGIARIWQQ--DDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTE 118 +G+ R++ L L+AY+++V FN+ +LR YPGSP I + LLRE D L+L E Sbjct: 61 AADGVVRLFAAPGSALAPALQAYVDMVGAFNQGRRLRVYPGSPCITQRLLRESDKLKLFE 120 Query: 119 LHPSDYPLLR---SEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQA 175 HP+D L ++ Q + V DGFQ ++ LPP RR L+L DP YE+K+DY Sbjct: 121 WHPTDLRALAGHVAQLQAGRQVAVFHEDGFQGIRKFLPPPQRRALLLCDPSYEIKSDYGK 180 Query: 176 VVSGIAEGYKRFATGIYALWYPVVLR---QQIKRMIHDLEATGIRKILQIELAVLPDS-- 230 V+ + KRFATG Y WYP++ R ++ R + L + + L L V Sbjct: 181 VLDLATDSLKRFATGCYMFWYPIIGRPEAHELPRRLKTLASKAGKSWLHATLTVKSGQRT 240 Query: 231 -----DRRGMTASGMIVINPPWKLEQQMNNVLPWL 260 R G+ ASGM +INPP+ L+ + LP + Sbjct: 241 AAGSLKRPGLPASGMFLINPPFTLKAALTPALPQM 275 >UniRef50_A5EWC5 Putative uncharacterized protein n=1 Tax=Dichelobacter nodosus VCS1703A RepID=A5EWC5_DICNV Length = 288 Score = 206 bits (524), Expect = 6e-52, Method: Compositional matrix adjust. Identities = 118/263 (44%), Positives = 158/263 (60%), Gaps = 9/263 (3%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 MLSYRHSFHAGN+ADV KH + +K K+KP LY D+HAGAG Y L S HAE+TGE Sbjct: 1 MLSYRHSFHAGNYADVFKHFCLYQTLTFMKRKEKPLLYFDSHAGAGFYDLHSAHAEKTGE 60 Query: 61 YLEGIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELH 120 Y +GI R++ LP L + N ++ + S + Y GSP +A LL E D+LQ ELH Sbjct: 61 YCDGIMRLYAAQQLPPALIEFRNDLRLWLESENV--YCGSPWLAAHLLGEHDTLQACELH 118 Query: 121 PSDYPLLRSEFQ--KDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVS 178 P+D P L+ + + R+ V + DGF QL A +PP RR LI+IDP YE K+DY AV S Sbjct: 119 PNDAPALQHIIRSIRPRRSFVFQKDGFVQLLASVPPPQRRALIVIDPSYEQKSDYDAVCS 178 Query: 179 GIAEGYKRFATGIYALWYPVVLRQQIKRMIHDL-EATGIRKILQIELAVLPDSDRRGMTA 237 +++ K+FA G Y +W P +LR + + L E G R L+ +L V +S GM Sbjct: 179 VLSKALKKFAQGCYLIWSPCLLRTEAQDFPQQLAEVIGGRGYLRAQLKVRTES-ALGMYG 237 Query: 238 SGMIVINPPWKLE---QQMNNVL 257 + +INPP+ L Q+ NVL Sbjct: 238 CEIHIINPPYLLAPVLQEAGNVL 260 >UniRef50_C6QCA2 Putative uncharacterized protein n=1 Tax=Hyphomicrobium denitrificans ATCC 51888 RepID=C6QCA2_9RHIZ Length = 281 Score = 206 bits (523), Expect = 9e-52, Method: Compositional matrix adjust. Identities = 103/266 (38%), Positives = 159/266 (59%), Gaps = 3/266 (1%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEY 61 ++YRH +HAGN ADVLKH V + ++ +K+K +PF +DTHAGAGRY L A +TGE+ Sbjct: 1 MNYRHGYHAGNFADVLKHVVLARVLTYMKQKPRPFRVIDTHAGAGRYDLAGVEAGKTGEW 60 Query: 62 LEGIARIWQQDDLP--AEL-EAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTE 118 +GI R++ + P AEL + Y++ V+ N SG L YPGS LIAR ++R +D L E Sbjct: 61 QDGIGRVFNAEFAPPVAELLQPYLDAVRADNASGDLEVYPGSSLIARRIMRPEDVLVANE 120 Query: 119 LHPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVS 178 L+ S++ L+ E + D + +K+ LPP RR ++LIDPP+E K+++ + Sbjct: 121 LNASEFERLKRELGRPRNTTFLNIDAWHAVKSLLPPKERRAVVLIDPPFEAKSEFADLAV 180 Query: 179 GIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTAS 238 G+ E RF G+Y +WYP+ + R + + + + L + LAV G+TA+ Sbjct: 181 GVREAMSRFQDGVYVIWYPLKDVEAADRFVAEATSRPGLEFLDVRLAVCAPFPGLGLTAT 240 Query: 239 GMIVINPPWKLEQQMNNVLPWLHSKL 264 G++VINPP+ L ++ VLP L + Sbjct: 241 GVLVINPPYLLRGELETVLPALRDCM 266 >UniRef50_B1ZS65 Putative uncharacterized protein n=2 Tax=Opitutaceae RepID=B1ZS65_OPITP Length = 297 Score = 204 bits (519), Expect = 3e-51, Method: Compositional matrix adjust. Identities = 110/296 (37%), Positives = 163/296 (55%), Gaps = 17/296 (5%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLG----SEHAER 57 ++YRH FHAGN ADV+KH + +I +L++K+K YLDTHAG G Y LG + ER Sbjct: 1 MNYRHLFHAGNFADVMKHALLIELIGALQKKEKGIFYLDTHAGRGSYDLGLAARGDTLER 60 Query: 58 TGEYLEGIARIWQQ--------DDLPAELEAYINVVKHF-----NRSGQLRYYPGSPLIA 104 E+ +GI RI + L AY ++V+ F N +G R+YPGSP IA Sbjct: 61 QPEWPDGIGRILAARSTAAADANATGDPLRAYADLVRRFDAERGNTNGSPRFYPGSPAIA 120 Query: 105 RLLLREQDSLQLTELHPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILID 164 ++L+R QD L L E P ++ LL +EF + R V DG+ ++A LPP RR L+LID Sbjct: 121 QVLVRRQDRLALCEQVPEEHALLAAEFARAPRTSVHAIDGYVAVRAMLPPPERRALVLID 180 Query: 165 PPYEMKTDYQAVVSGIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIEL 224 P+E + ++ + + +AEG R G++A+WYP+ R ++ L + L +EL Sbjct: 181 APFEAQDEFARIETALAEGLARLPAGVFAVWYPLTERARVDAFFAGLAERRLPPTLVLEL 240 Query: 225 AVLPDSDRRGMTASGMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIVPE 280 AV ++ M G++V+NPPW E+ +L L +L A W+VPE Sbjct: 241 AVAGENSALKMRGCGLVVVNPPWHFERTAAPILEALARELAQAPGAAGRQQWLVPE 296 >UniRef50_C5BQN4 Putative uncharacterized protein n=1 Tax=Teredinibacter turnerae T7901 RepID=C5BQN4_TERTT Length = 279 Score = 202 bits (513), Expect = 2e-50, Method: Compositional matrix adjust. Identities = 105/272 (38%), Positives = 159/272 (58%), Gaps = 5/272 (1%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 MLSYRH+FHAGNHADVLKH S+++E L EKDKP YL+THA AG Y L + ++ E Sbjct: 1 MLSYRHAFHAGNHADVLKHLCLSMVLEKLIEKDKPLTYLETHAAAGAYDLNTAMPQKNRE 60 Query: 61 YLEGIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELH 120 Y+ GI+ + + + Y +V + + YPGSP +A +LREQD L L ELH Sbjct: 61 YMSGISPLLASEVSSEAMSRYKALVARYFADYK---YPGSPAVAASVLREQDKLVLMELH 117 Query: 121 PSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSGI 180 +++ +LR+ ++D R + DG + + A PP RRG++LIDPPYE +Y+ + + I Sbjct: 118 NTEFEILRNNMRRDKRVTLHHRDGIEGVLALSPPTPRRGIVLIDPPYEQPLEYERIATLI 177 Query: 181 AEGYKRFATGIYALWYPVVL--RQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTAS 238 A+ ++++ G+ ALWYP++ R + M+ + + + EL V + GM S Sbjct: 178 AQLHRKWPVGVIALWYPLLAQERNRAPAMLDVIARSQPASLFTAELWVEAQASDYGMYGS 237 Query: 239 GMIVINPPWKLEQQMNNVLPWLHSKLVPAGTG 270 GM IN PW +++++ VLP + L P G Sbjct: 238 GMAFINLPWTVDEKIALVLPEIQQILAPDQGG 269 >UniRef50_Q0G6E9 Putative uncharacterized protein n=1 Tax=Fulvimarina pelagi HTCC2506 RepID=Q0G6E9_9RHIZ Length = 301 Score = 199 bits (506), Expect = 9e-50, Method: Compositional matrix adjust. Identities = 108/276 (39%), Positives = 154/276 (55%), Gaps = 6/276 (2%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEY 61 L+YRH+FHAGN ADV+KH + + +I LK K+KPF DTHAG GRY L + A RTGE Sbjct: 25 LNYRHAFHAGNFADVVKHALLTRLIAYLKRKEKPFRVFDTHAGRGRYDLNASEASRTGEA 84 Query: 62 LEGIARIWQQDDLPAE--LEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTEL 119 G+ +I Q L AE L Y + R G +YPGSPLIAR LRE D L EL Sbjct: 85 QAGVLKIAQSTTLRAEPLLADYFAAIDPDLREG---FYPGSPLIARRCLRETDRLSAYEL 141 Query: 120 HPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSG 179 HP D LR F D + + DG+ L A LPP +RGL+LID P+E ++ ++SG Sbjct: 142 HPEDGGALRDLFAGDVQVKAISLDGWLALGAHLPPKEKRGLVLIDSPFEKPSEVDDILSG 201 Query: 180 IAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGM-TAS 238 + + R+ G+YA+WYP+ R +++++ + + L +E+ + D G+ + Sbjct: 202 LEKALSRWRGGVYAIWYPIKRRALVEKLLTAIAGMAAGEALAVEVRIAADESAEGLFLGT 261 Query: 239 GMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATV 274 G VINPP+ ++ ++ L L G+ A V Sbjct: 262 GFAVINPPFVFAEEAKAIVDLLLPALKRDGSATARV 297 >UniRef50_B4RYZ5 Putative uncharacterized protein n=2 Tax=Alteromonas macleodii RepID=B4RYZ5_ALTMD Length = 292 Score = 196 bits (499), Expect = 6e-49, Method: Compositional matrix adjust. Identities = 107/269 (39%), Positives = 153/269 (56%), Gaps = 16/269 (5%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 MLSY+H+FHAGNHADV+KH +I SLK+KDKPF DTHAGAG Y L + + E Sbjct: 1 MLSYQHAFHAGNHADVIKHLCWIGVINSLKKKDKPFTLFDTHAGAGTYDLNDAMSSKNKE 60 Query: 61 YLEGIARI----WQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQL 116 Y GI+RI + D LP L+ Y+ + + F Q YPGSP I+ R D+L L Sbjct: 61 YETGISRIINTGAEHDSLPELLKNYLTLCEPFLAKHQ---YPGSPAISATAKRATDNLHL 117 Query: 117 TELHPSDYPLLRSEFQKDS--RARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQ 174 ELHP+++ L + K + V K DG++ L+A PP RG ILIDPPYE ++Y Sbjct: 118 MELHPAEFDKLEANMGKLHLRKMHVHKRDGYEGLRALTPPKPNRGAILIDPPYERASEYG 177 Query: 175 AVVSGIAEGYKRFATGIYALWYPVVLRQQIKR------MIHDLEATGIRKILQIELAVLP 228 V+ G+ + +KR+ +WYP++ + + M L A G + + E+ V Sbjct: 178 EVIKGVEQVFKRWQQAQIVVWYPLLSERAGAKHGASELMCDKLAALG-KPCFKAEICVEK 236 Query: 229 DSDRRGMTASGMIVINPPWKLEQQMNNVL 257 ++ GM SG+ V+NPPW+L+ Q+ + L Sbjct: 237 NTPEAGMYGSGVFVLNPPWQLDSQLESAL 265 >UniRef50_B1LXQ5 Putative uncharacterized protein n=9 Tax=Alphaproteobacteria RepID=B1LXQ5_METRJ Length = 282 Score = 195 bits (496), Expect = 1e-48, Method: Compositional matrix adjust. Identities = 103/266 (38%), Positives = 154/266 (57%), Gaps = 2/266 (0%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEY 61 ++YRH+FHAGNHADVLKH V + +++ L+ KDKPF LD AG G Y L ++ A RTGE+ Sbjct: 1 MNYRHAFHAGNHADVLKHLVLARVLDHLRLKDKPFRALDAFAGLGVYDLEADEAARTGEW 60 Query: 62 LEGIARIWQ--QDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTEL 119 +G R+ ++ A L Y V YPGSP + R LR D EL Sbjct: 61 RDGWGRMAAPFAPEVEALLAPYRAAVAAVRARHGDTAYPGSPAVIREALRPGDKGVFVEL 120 Query: 120 HPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSG 179 HP+D L+ + +D+R +V DG+ + A++PP RRGL+LIDPPYE+ + + + + Sbjct: 121 HPADAATLQGRYARDARTKVMNLDGWTAINAQIPPPERRGLVLIDPPYEVPGEIERLGAH 180 Query: 180 IAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTASG 239 +A ++ TG++ WYP+ + RM+ DL A R L+++L + D +T SG Sbjct: 181 LARAVAKWPTGLFLAWYPIKDTAVLDRMVRDLGAALPRPALRLDLLIDRPGDPTRLTGSG 240 Query: 240 MIVINPPWKLEQQMNNVLPWLHSKLV 265 +IV+NPPW+L ++ LP L +L Sbjct: 241 LIVVNPPWRLAEEAMLFLPALAERLA 266 >UniRef50_Q5QVX6 Transformation competence-related protein ComJ n=2 Tax=Idiomarina RepID=Q5QVX6_IDILO Length = 283 Score = 195 bits (496), Expect = 1e-48, Method: Compositional matrix adjust. Identities = 103/283 (36%), Positives = 161/283 (56%), Gaps = 4/283 (1%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEY 61 ++YRH FHAGN ADV KH + + +E ++K+KP+ LDTH G G Y L + A RT E Sbjct: 1 MNYRHIFHAGNFADVFKHLLLARALEYFQQKNKPYFVLDTHGGIGYYDLQGDQAIRTAEA 60 Query: 62 LEGIARIWQQDDLPAELEAYINVVKHFNRS-GQLRYYPGSPLIARLLLREQDSLQLTELH 120 +GI R + AY++ V+ N +LRYYPGSP+I LRE D L + ELH Sbjct: 61 EQGIVRFAEHSAEEPLAAAYLSTVRQLNEEQDKLRYYPGSPVITSEFLRENDRLVVCELH 120 Query: 121 PSDYPLLRSE-FQKDSRARV-EKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVS 178 D L++ + + ++ DG+Q ++A+LPP +RGL+LIDPP+E T++ VVS Sbjct: 121 KEDAETLKNTPLGRHKQVQILAPMDGYQAVRAQLPPAEKRGLVLIDPPFENTTEFDDVVS 180 Query: 179 GIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEA-TGIRKILQIELAVLPDSDRRGMTA 237 + +G KR+ +G +A+WYP+ + D+ A + + K L +EL + + +R+G+ Sbjct: 181 ALEQGLKRWKSGSFAVWYPIKDELKTAAFHRDVGALSDLPKTLIMELNIRTNDERKGLHG 240 Query: 238 SGMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIVPE 280 G + +NPP+ + Q ++LP L L + W+V E Sbjct: 241 CGFLWVNPPYGVVQDSEHLLPVLCKTLAQDKGANFHSRWLVGE 283 >UniRef50_Q0ARP7 Putative uncharacterized protein n=2 Tax=Hyphomonadaceae RepID=Q0ARP7_MARMM Length = 311 Score = 195 bits (496), Expect = 1e-48, Method: Compositional matrix adjust. Identities = 104/282 (36%), Positives = 166/282 (58%), Gaps = 14/282 (4%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEY 61 ++YRH+FHAGN ADVLKH+V +L +E L K KP+ +DTHAG G Y L S AER+ E+ Sbjct: 16 MNYRHAFHAGNFADVLKHSVLALCLEHLNAKPKPYRVIDTHAGIGGYDLASSEAERSPEW 75 Query: 62 LEGIARIWQQDDLPAELEA----YINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLT 117 +GI R+ D LP ++A ++++V+ N G ++ YPGSP IA L+RE+D + L Sbjct: 76 KDGIGRLIDAD-LPEPVQAMLGPWLDIVREMNPDG-IKAYPGSPEIAARLIREEDRVHLC 133 Query: 118 ELHPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVV 177 ELH +D L + +++D+R +VE+ DG++ LK+ +PP +RGL+LIDPP+E + + + Sbjct: 134 ELHEADSVTLDNRYRRDARIKVERRDGYKALKSLVPPKEKRGLVLIDPPFEDRDELAHMA 193 Query: 178 SGIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGI-------RKILQIELAVLPDS 230 + ++ TG + W + R + L I KIL+ +L + + Sbjct: 194 EAVMGALAKWPTGTFIFWRSLKNLWAADRFDNGLAEWLISEKDFEPEKILRADLWIRDLA 253 Query: 231 DRRGMTASGMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHA 272 + +G+++INPP+ LE+ + N +PWL L G G+ Sbjct: 254 SEGKLAGAGVVIINPPFTLEETLVNAMPWLAETLA-QGNGYG 294 >UniRef50_B5EL93 Putative uncharacterized protein n=2 Tax=Acidithiobacillus ferrooxidans RepID=B5EL93_ACIF5 Length = 285 Score = 194 bits (494), Expect = 2e-48, Method: Compositional matrix adjust. Identities = 101/270 (37%), Positives = 151/270 (55%), Gaps = 8/270 (2%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEY 61 ++Y H +HAGN AD +KH SL +++L KD P Y++THAGAGRY LG++ GE+ Sbjct: 1 MNYDHQYHAGNTADCVKHLALSLTLQTLVRKDSPLAYIETHAGAGRYALGTQ-----GEH 55 Query: 62 LEGIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELHP 121 L+G++R+W A++ +V + N G LR+YPGSP +A LLR D + L E P Sbjct: 56 LQGVSRLWADRRSLPHAGAWLKIVSNENADGTLRHYPGSPALAAALLRPTDRMVLCEEQP 115 Query: 122 SDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSGIA 181 LR K + V DG++ L ++PP +RGL+LIDPP+E + +++ + + Sbjct: 116 EVATRLRKAIGKRAHTSVVGEDGYRTLFGQIPPPEKRGLVLIDPPFERRDEWERLTDTLI 175 Query: 182 EGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTASGMI 241 Y+R+ G+Y +WYPV +R I R+ L EL +P+ R + SG+I Sbjct: 176 RAYQRWPQGVYLVWYPVKIRGTITRLWQALRER--LPAFACELLQMPEEGREQLFGSGLI 233 Query: 242 VINPPWKLEQQMNNVLPWLHSKL-VPAGTG 270 V+NPPW L + + L L L P G G Sbjct: 234 VVNPPWGLREALAAALTELGPLLSAPQGGG 263 >UniRef50_Q0BPC8 Putative uncharacterized protein n=3 Tax=Acetobacteraceae RepID=Q0BPC8_GRABC Length = 294 Score = 192 bits (488), Expect = 1e-47, Method: Compositional matrix adjust. Identities = 100/271 (36%), Positives = 150/271 (55%), Gaps = 12/271 (4%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEY 61 ++YRH+FHAGN AD KH + ++ +L++K+ PF LDTHAG G L A RTGE+ Sbjct: 1 MNYRHAFHAGNFADCHKHALMVALLTALRQKEAPFFVLDTHAGTGETLLTDGPAARTGEW 60 Query: 62 LEGIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELHP 121 EGI + DD L +Y+ +V L YPGSPLIAR +LR QD + + ELHP Sbjct: 61 REGIGLLL--DDPAPVLASYLALVTSLGMERSL--YPGSPLIARAMLRPQDRMAVCELHP 116 Query: 122 SDYPLLRSEFQKDSRARVEKADGFQQLKAKLPP--------VSRRGLILIDPPYEMKTDY 173 D L F+ D + + DG++ L+ LPP + RRGL LIDPP+E ++ Sbjct: 117 EDCASLAERFRGDPYCAIHRRDGWKALETMLPPKTASSGGVLPRRGLTLIDPPFEQPDEH 176 Query: 174 QAVVSGIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRR 233 + + + +RF TG+ A WYP+ + + H L+ G++++L EL + P +D Sbjct: 177 RRLADAMLRAQQRFPTGMVAGWYPIKGGAPARLLRHQLQDAGLKRVLIAELFLHPPTDTT 236 Query: 234 GMTASGMIVINPPWKLEQQMNNVLPWLHSKL 264 + SGM ++NPPW+ ++ L + L Sbjct: 237 RLNGSGMAILNPPWQFGDDARAIMQALKTGL 267 >UniRef50_A1VJI9 Putative uncharacterized protein n=4 Tax=Comamonadaceae RepID=A1VJI9_POLNA Length = 340 Score = 189 bits (481), Expect = 7e-47, Method: Compositional matrix adjust. Identities = 116/323 (35%), Positives = 165/323 (51%), Gaps = 64/323 (19%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 M SYRH+FHAGNHADVLKHT +++ L +KD +DTHAGAG Y+L ++ E +GE Sbjct: 1 MFSYRHAFHAGNHADVLKHTCLIALMKYLTQKDTALTVIDTHAGAGLYRLDGDYTETSGE 60 Query: 61 YLEGIARIWQQDDL-----------------------------------------PAELE 79 EGI ++ + PA L+ Sbjct: 61 AQEGIFKLLLASKMASAQTGKAGAAIKKVAPAATAKAAPAAPEPAAKPASDYAWAPALLD 120 Query: 80 AYINVVK----HFNRSG---QLRYYPGSPLIARLLLREQDSLQLTELHPSDYPLLRSEFQ 132 Y+ +++ HF ++G L+ YPGSP I + L +D L+L ELHP+D+ L + Sbjct: 121 -YLELLRSLNPHFAQTGDPAHLKIYPGSPFIEQKFLSGRDKLKLFELHPTDFKSLSGNIE 179 Query: 133 KDSRAR---VEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSGIAEGYKRFAT 189 + R V + DGF+ LK LPP +RR ++L DP YEMKTDY V S +A+ KRFAT Sbjct: 180 QLGVGRQVVVAREDGFEALKTFLPPPARRAMVLCDPSYEMKTDYLRVSSCMADAVKRFAT 239 Query: 190 GIYALWYPVVLRQQ---IKRMIHDLEATGIRKILQIELAV-----LPDSD----RRGMTA 237 G Y +WYP++ R + + R + + R L L V D++ R G+ A Sbjct: 240 GTYVVWYPIIPRPEAHDLPRKLKTIAVKAGRSWLNATLTVKSSKLTTDTEGEVVRPGLPA 299 Query: 238 SGMIVINPPWKLEQQMNNVLPWL 260 SGM VINPP L+ ++ LP + Sbjct: 300 SGMFVINPPHTLKAELQAALPQM 322 >UniRef50_C8NAD4 Cytoplasmic protein n=34 Tax=Proteobacteria RepID=C8NAD4_9GAMM Length = 280 Score = 189 bits (481), Expect = 7e-47, Method: Compositional matrix adjust. Identities = 112/282 (39%), Positives = 153/282 (54%), Gaps = 5/282 (1%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 MLSYRH++HAGNHAD+LKH + + + +K KP+ Y+DTHAGAG Y L + +A++ E Sbjct: 1 MLSYRHAYHAGNHADLLKHYLLTRTLAYYNQKPKPYDYIDTHAGAGYYDLTAAYAQKNRE 60 Query: 61 YLEGIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELH 120 Y GIAR+ LPA L A+ + + + YPGS IA LL L L ELH Sbjct: 61 YQSGIARLNAAAHLPAALAAWRDHMHAHQPAPDT--YPGSAWIAARLLPAPGKLHLHELH 118 Query: 121 PSDYPLLRSEFQKDSRA---RVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVV 177 P+D+ L + +ADGF L A LPP SRR +ILIDPPYE K+DYQ + Sbjct: 119 PADHAALTENLRPLRLGRRLHTHRADGFAGLIALLPPASRRAVILIDPPYEQKSDYQTTL 178 Query: 178 SGIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTA 237 +A YKRF +G Y +WYP + R + + L L+ EL V ++ GM Sbjct: 179 DTLAAAYKRFPSGTYLIWYPCLPRDESRHFPAQLNQHFGDNYLRAELHVRAENGAHGMYG 238 Query: 238 SGMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIVP 279 SGM +INPP+ L ++ LP L + T+ +P Sbjct: 239 SGMYLINPPYTLPAELKTTLPALRDLCAESADSRITLDARIP 280 >UniRef50_Q1YIC4 Putative uncharacterized protein n=1 Tax=Aurantimonas manganoxydans SI85-9A1 RepID=Q1YIC4_MOBAS Length = 281 Score = 189 bits (480), Expect = 1e-46, Method: Compositional matrix adjust. Identities = 107/279 (38%), Positives = 154/279 (55%), Gaps = 15/279 (5%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEY 61 ++YRH++HAGN ADV+KH + + +I LK K+KPF DTHAG G Y L S+ A RTGE+ Sbjct: 1 MNYRHAYHAGNFADVVKHALLTRLIAYLKRKEKPFRVFDTHAGRGSYSLTSDEARRTGEH 60 Query: 62 LEGIARIWQ------QDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQ 115 +G+ R+ Q D L AE + +R YPGSPLIAR LLR QD L Sbjct: 61 ADGVGRLVQAAADVMDDPLLAEYRGALASDLSEDR------YPGSPLIARRLLRPQDRLS 114 Query: 116 LTELHPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQA 175 ELHP+D L++ F D + + DG+ L + +PP +RGL+LIDPP+E + A Sbjct: 115 AYELHPADAAALKTLFAGDVQTKAIALDGWLALGSHVPPKEKRGLVLIDPPFERTDEVDA 174 Query: 176 VVSGIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGM 235 + G+A+ +R+A GIYA+WYP+ + + L+A + + + E P + Sbjct: 175 IAEGLAKALQRWAGGIYAVWYPLKRPALVAALHERLDALPVSERVTAEFFREPYTADERF 234 Query: 236 TASGMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATV 274 +G+ VINPP+ + VL L L G+G A Sbjct: 235 VGTGLTVINPPFVFAAEAEAVLTTLAPLL---GSGEAAT 270 >UniRef50_Q21LZ8 Putative uncharacterized protein n=1 Tax=Saccharophagus degradans 2-40 RepID=Q21LZ8_SACD2 Length = 289 Score = 187 bits (475), Expect = 3e-46, Method: Compositional matrix adjust. Identities = 100/261 (38%), Positives = 151/261 (57%), Gaps = 14/261 (5%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 MLSY H +HAGN ADV KH L+++ L K+ P Y+DTHAGAG Y L E AE+T E Sbjct: 1 MLSYLHGYHAGNFADVHKHCTLMLLLKKLHAKNTPITYIDTHAGAGLYALDDEKAEKTRE 60 Query: 61 YLEGIARIWQQDD--LPAELEAYINVVKHFNRSGQL----RYYPGSPLIARLLLREQDSL 114 +G+ + + ++ Y++++ S Q + YPGSP IA+ LLREQD Sbjct: 61 SQQGVDALLASKTGITHSAIKEYLHLLASVRLSKQHTLGEQAYPGSPAIAQALLREQDFG 120 Query: 115 QLTELHPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQ 174 L ELH ++ L+ F++D+R + DGF+ L A PP + RGL LIDP YE+ +DY Sbjct: 121 ILMELHNNEVGKLKQHFKRDTRLSIHHRDGFEGLAALTPPSTARGLALIDPSYELTSDYH 180 Query: 175 AVVSGIAEGYKRFATGIYALWYPVVLRQQ-----IKRMIHDLEATGIRKILQIELAVLPD 229 +++ + R+ TG++A+WYP++ ++ IKR + L+ + +L EL + Sbjct: 181 QLITSLQTATARWRTGVFAVWYPILAGEKNHADFIKRKLAQLD---VASVLNSELHIYTK 237 Query: 230 SDRRGMTASGMIVINPPWKLE 250 + GM SGM +IN PW+L+ Sbjct: 238 EENDGMIGSGMAIINAPWQLD 258 >UniRef50_C7JEW3 Putative uncharacterized protein n=8 Tax=Acetobacter pasteurianus RepID=C7JEW3_ACEP3 Length = 273 Score = 187 bits (474), Expect = 4e-46, Method: Compositional matrix adjust. Identities = 103/280 (36%), Positives = 155/280 (55%), Gaps = 8/280 (2%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEY 61 ++YRH++HAGN AD +KH + +++S K PF+ LDTHAG GRY L S AE+T E+ Sbjct: 1 MNYRHAYHAGNFADCMKHALLVTLLQSFLRKPAPFMVLDTHAGIGRYDLHSPEAEKTQEW 60 Query: 62 LEGIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELHP 121 +GI ++W +D + L ++ VK ++G +YPGSPLI +LR QD+L E HP Sbjct: 61 RDGIGKLWNEDAA-SPLADWLEQVK---KTGGPEFYPGSPLIIAQMLRAQDALICCEKHP 116 Query: 122 SDYPLLRSEFQKDSRARVEKADGFQQLKAKLPP-VSRRGLILIDPPYEMKTDYQAVVSGI 180 D L F V + D ++ L+A LPP ++RGLILIDPP+E ++ + + Sbjct: 117 EDKRSLYRLFTNTPNVTVHERDAYEALRALLPPQTAKRGLILIDPPFEEPGEFDRLAQAV 176 Query: 181 AEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTASGM 240 RFA I A+WYP+ R ++ L TGIR I EL + P + + +G+ Sbjct: 177 QTIQARFANAIIAIWYPIKHRTPVRIFHETLMGTGIRNICVAELLMRPPYNPDQLNGAGL 236 Query: 241 IVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIVPE 280 +VI PP+ ++ + L L L G + V+ +V E Sbjct: 237 LVIRPPFGFAEKASAQLERLQHVL---GAHESCVTQLVEE 273 >UniRef50_B7QYF1 Protein involved in catabolism of external DNA n=35 Tax=Alphaproteobacteria RepID=B7QYF1_9RHOB Length = 266 Score = 186 bits (473), Expect = 6e-46, Method: Compositional matrix adjust. Identities = 103/255 (40%), Positives = 147/255 (57%), Gaps = 4/255 (1%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 MLSY+H +HAGN ADV KH + + ++ L +KDKP YL+THAG G YQL + A +TGE Sbjct: 1 MLSYQHIYHAGNLADVQKHALLARMLAYLTQKDKPLSYLETHAGRGLYQLDAAEAVKTGE 60 Query: 61 YLEGIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELH 120 GI+R+ D L A+ + + YPGSP+IA LLRE DSL ELH Sbjct: 61 AEAGISRLLN-DALLAQDHPLAEAIARTRAAHGAAAYPGSPMIAAHLLREGDSLNFAELH 119 Query: 121 PSDYPLLRSEFQ---KDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVV 177 P + LR + K R RV + DGF+ + PP RRG++LIDP YE+K DY + Sbjct: 120 PQENAALRQAMRPHAKGGRVRVHQQDGFELALSLAPPTPRRGMLLIDPSYEIKRDYAQIP 179 Query: 178 SGIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTA 237 IA+ ++++ G+ ALWYP++ K M++ LEA + +L+ E+ P + M Sbjct: 180 GHIAKLHRKWNVGVIALWYPILTDGAHKPMLNALEAQDLPGVLRHEVRFPPAREGHRMVG 239 Query: 238 SGMIVINPPWKLEQQ 252 SGM ++N P+ E + Sbjct: 240 SGMFIVNAPYGTEDE 254 >UniRef50_D2LG58 Putative uncharacterized protein n=1 Tax=Rhodomicrobium vannielii ATCC 17100 RepID=D2LG58_RHOVA Length = 278 Score = 186 bits (472), Expect = 7e-46, Method: Compositional matrix adjust. Identities = 101/284 (35%), Positives = 152/284 (53%), Gaps = 12/284 (4%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEY 61 ++YRH FHAGN ADV+KH V + ++ L K+ P LD H GAG Y L SE AE+TGE+ Sbjct: 1 MNYRHVFHAGNFADVIKHAVLAFCVDYLLRKESPLCLLDAHGGAGLYDLRSEEAEKTGEW 60 Query: 62 LEGIARIWQQDDLPAE----LEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLT 117 G+ + Q A LE Y+ +V+ G +YPGSPL+ LR QD L Sbjct: 61 ARGVGAVMQAAGGTASAAEALEPYLRLVREDVADG---FYPGSPLLLARRLRPQDRLIAN 117 Query: 118 ELHPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVV 177 ELH S LR + RV AD ++ ++A +PP RRGL+LIDPP+E K +++ ++ Sbjct: 118 ELHESTRGALRGTLAEFPSVRVTGADAYECIRATIPPKERRGLVLIDPPFEEKDEFETLI 177 Query: 178 SGIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTA 237 + E KR+ATG++ LWYP+ + + + A G+ + +E + P + Sbjct: 178 RQMREWKKRWATGVFLLWYPIKAVSPLGALKAEAAALGLPRTWCVETLIYPRGRALSLNG 237 Query: 238 SGMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHAT-VSWIVPE 280 G+I+ N P+ + + + LP A H T +W+VP+ Sbjct: 238 CGLILFNAPYSVPEAVEATLP----AFADAMRLHETHTAWLVPD 277 >UniRef50_A3JEY3 Protein involved in catabolism of external DNA n=3 Tax=Marinobacter RepID=A3JEY3_9ALTE Length = 287 Score = 183 bits (465), Expect = 5e-45, Method: Compositional matrix adjust. Identities = 110/285 (38%), Positives = 156/285 (54%), Gaps = 12/285 (4%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 MLSY H+FHAGN ADV KH L + ++ K DTHAG+ Y L E A +T E Sbjct: 10 MLSYLHAFHAGNFADVHKHAALVLALNMMQAKASGIACTDTHAGSALYDLDDERARKTAE 69 Query: 61 YLEGIARIWQQDD--LPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTE 118 GI ++W Q D A+ + ++ N LR YPGSP LR QDSL + E Sbjct: 70 ADAGIRKLWPQLDSLAAADWQLLRPYLQQLNSGANLRQYPGSPAWFGHYLRAQDSLGVFE 129 Query: 119 LHPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVS 178 LHPS+ L +++ R RV + DG L LPP R L+LIDP YE+KTDY AV Sbjct: 130 LHPSETSSL-NQWASGKRLRVTQQDGLAGLLKVLPPRQPRLLVLIDPSYEVKTDYTAVAE 188 Query: 179 GIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTAS 238 ++ +++ G++ +WYP++ + ++ L A IRKIL+ E+ L RGM S Sbjct: 189 TLSRAWQKCRHGVFLVWYPILTSGLEQTLLEGLRAGPIRKILRSEVR-LHTPPERGMVGS 247 Query: 239 GMIVINPPWKLEQQMNNVLPWLHSKLVPA---GTGHATVSWIVPE 280 GM+VINPPW ++++++ ++ L PA G G + W+ PE Sbjct: 248 GMLVINPPWGMDERLSAMM----RDLEPAARLGLGQ-QMDWLAPE 287 >UniRef50_Q87F97 Transformation competence-related protein n=20 Tax=Xanthomonadaceae RepID=Q87F97_XYLFT Length = 293 Score = 176 bits (447), Expect = 6e-43, Method: Compositional matrix adjust. Identities = 103/294 (35%), Positives = 157/294 (53%), Gaps = 16/294 (5%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEY 61 ++Y H+FHAGNHADVLKH V +++ L K+ PF LD+HAG GRY L + + T E Sbjct: 1 MNYSHAFHAGNHADVLKHIVLLALLDGLVRKETPFFVLDSHAGRGRYLLSAGESRNTREA 60 Query: 62 LEGIARIWQQDDLPAELEAYINVVKHFNRSG--------QLRYYPGSPLIARLLLREQDS 113 G+ R+ + ++ Y++VV+ N S + YPGS L+A + R QD Sbjct: 61 ESGVMRLIARPQRLEVIKRYVDVVQADNVSQTRAASTPMHISRYPGSSLLAAQVCRAQDR 120 Query: 114 LQLTELHPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPP---VSR--RGLILIDPPYE 168 + ELHP + L + F D R RV DG+ ++A LPP R RGL+ IDPPYE Sbjct: 121 MVFCELHPKEAAALNALFVHDPRVRVHAGDGYAAVRAFLPPKVGTQRIGRGLVFIDPPYE 180 Query: 169 MK-TDYQAVVSGIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVL 227 + +Y V+ + E R+ I A+WYP+ R++++ +R +L EL V Sbjct: 181 AQDAEYPLVLGALRETLTRWPQAICAVWYPIKQRRRLQPFFRKAVGLPVRSVLIAELLVR 240 Query: 228 PDSDRRGMTASGMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWI-VPE 280 D + SGM+++N PW+ +Q + LP L ++L +G + W+ VP+ Sbjct: 241 LDDSPLRLNGSGMLLLNVPWQFDQLLAPALPVLKTQLGESG-ARTRLEWLKVPQ 293 >UniRef50_C6NTA4 Putative uncharacterized protein n=1 Tax=Acidithiobacillus caldus ATCC 51756 RepID=C6NTA4_9GAMM Length = 290 Score = 175 bits (444), Expect = 1e-42, Method: Compositional matrix adjust. Identities = 95/267 (35%), Positives = 142/267 (53%), Gaps = 7/267 (2%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEY 61 ++YRH HAGN AD LKH SL +E L KD P YL+THAGAGRY L GE+ Sbjct: 1 MNYRHDHHAGNAADCLKHLALSLALERLLHKDAPLFYLETHAGAGRYSLAD-----AGEH 55 Query: 62 LEGIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELHP 121 G+ R+W L ++++++ G LR+YPGSP++A LLR D + L E Sbjct: 56 SAGVDRVWAARRQLKGLSPWLDLLEEGAEDGVLRHYPGSPVVAARLLRPGDRMVLAEKVA 115 Query: 122 SDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSGIA 181 LR R + DG+ L+ LPP RRGLIL+DPP+E + +++A+ I Sbjct: 116 VVRERLRHNLAGRGRTSILGDDGYAILRGHLPPPERRGLILMDPPFERRDEWEALAKAII 175 Query: 182 EGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTASGMI 241 + R+ G +WYP+ +R I R++ L+ ++ +EL + ++ M SG+I Sbjct: 176 GAHARWPQGCQIVWYPIKVRGMISRLLQSLQRALDMEV--VELRLESETGGTSMVGSGLI 233 Query: 242 VINPPWKLEQQMNNVLPWLHSKLVPAG 268 ++ PPW L +++ L L L G Sbjct: 234 LVRPPWGLRERLLAALAVLGPVLAQGG 260 >UniRef50_Q1RK44 ComJ n=12 Tax=Rickettsia RepID=Q1RK44_RICBR Length = 262 Score = 173 bits (439), Expect = 5e-42, Method: Compositional matrix adjust. Identities = 96/264 (36%), Positives = 146/264 (55%), Gaps = 12/264 (4%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEY 61 ++YRH +HAGN AD++KH V I+E LK K+KPF LD AG G Y L SE A +T EY Sbjct: 1 MNYRHIYHAGNFADIVKHLVLIAILEQLKNKEKPFAVLDAFAGLGLYDLASEAASKTLEY 60 Query: 62 LEGIARIWQQ-DDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELH 120 GI ++ Q D P L+ +++V+ N GQ +YPGSP I + LLR QD L ELH Sbjct: 61 NNGIGKLLQALDHTPNSLKIFLSVI---NSVGQ-NFYPGSPFIIQQLLRPQDRLIACELH 116 Query: 121 PSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSGI 180 P+DY L+ ++ D + +KA LP RGLI +DPP+E+K ++Q +++ + Sbjct: 117 PADYLDLKKLLPNNTHC----IDAYNAIKAFLPFKENRGLIFLDPPFEVKNEFQKLITAL 172 Query: 181 AEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTASGM 240 + +WYP+ + H+ + G ++ L IE +L S + M G+ Sbjct: 173 KKIKVSALNNSTLIWYPIKDLLLVSDFYHNYKTIGFKETLIIEYELL--SSDKNMVKCGL 230 Query: 241 IVINPPWKLEQQMNNVLPWLHSKL 264 ++INPP + Q++ + +L L Sbjct: 231 MLINPP-NIRQELEELTKYLSYTL 253 >UniRef50_Q2G473 Putative uncharacterized protein n=1 Tax=Novosphingobium aromaticivorans DSM 12444 RepID=Q2G473_NOVAD Length = 276 Score = 170 bits (431), Expect = 5e-41, Method: Compositional matrix adjust. Identities = 98/278 (35%), Positives = 153/278 (55%), Gaps = 6/278 (2%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEY 61 ++YRHSFHAGN ADV+KH++ ++ +L+ KD +DTHAG G Y L + A+RTGE Sbjct: 1 MNYRHSFHAGNSADVVKHSLLIALVRALQLKDSALTLIDTHAGCGLYDLHGDAAQRTGES 60 Query: 62 LEGIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELHP 121 +G+ R D L+ Y V+ N + YPGSP I LLR QD+L + E HP Sbjct: 61 AQGVLRALA--DPNPLLDDYRAAVQAVNVGAEPHLYPGSPRILVQLLRPQDALIVNEKHP 118 Query: 122 SDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSGIA 181 D LR + + A V + D ++ A LPP + RG++++DPPYE + + + +A Sbjct: 119 EDAYALRGAM-RGTGAAVHERDAYEFWLAMLPPRTPRGVVVVDPPYEQTDERARITATLA 177 Query: 182 EGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGM-TASGM 240 +++++ G+ +WYP+ R R L GI K L +E L D+D+ G+ +G+ Sbjct: 178 AAHRKWSHGVTVIWYPLKDRATHVRWKEQLRRLGIPKFLNVE-HWLYDADQPGIYNGAGL 236 Query: 241 IVINPPWKLEQQMNNVLPWLHSKLVPAG-TGHATVSWI 277 ++NPP+ Q + ++L L + L P G G W+ Sbjct: 237 FIVNPPYAFTQALPSMLEALRAALAPEGHQGEIAAEWL 274 >UniRef50_UPI0000E1171F protein involved in external DNA uptake n=1 Tax=Rhodobacterales bacterium HTCC2255 RepID=UPI0000E1171F Length = 288 Score = 166 bits (421), Expect = 6e-40, Method: Compositional matrix adjust. Identities = 98/271 (36%), Positives = 146/271 (53%), Gaps = 19/271 (7%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 MLSY+H +HAGNHAD++KH ++ L +K+KP +DTHAGAG Y L + A+ E Sbjct: 1 MLSYQHIYHAGNHADLIKHLTLLSVLLKLGQKNKPCTLIDTHAGAGEYDLSATKAQHNNE 60 Query: 61 YLEGIARI----WQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQL 116 L GI + + Q D A L AY + S + Y GS + LREQD Sbjct: 61 SLTGIGMLDEAFFSQTD-SALLHAYGEGLYTGVVSDK---YCGSAGWMQRYLREQDQAHF 116 Query: 117 TELHPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAV 176 ELHP+ YP L + K A + DGF+QL A +PP+++RG++L+DPPYE ++Y V Sbjct: 117 CELHPNVYPELLNYVYK-PNAHCYQEDGFKQLIALVPPLAKRGIVLVDPPYEQASEYSMV 175 Query: 177 VSGIAEGYKRFATGIYALWYPVVLRQQIK------RMIHDLEATG----IRKILQIELAV 226 + I + KR+ATG Y +WYP++ Q RM+ + + I+ Sbjct: 176 LDVIEKSLKRWATGCYLIWYPMINTQNTNKAQAAIRMLKGFNTLADEHSVSNMANIQWRY 235 Query: 227 LPDSDRRGMTASGMIVINPPWKLEQQMNNVL 257 +D +GM SG+I IN PW + ++++ + Sbjct: 236 DTTNDAQGMYGSGIIAINLPWGCDNEISDAM 266 >UniRef50_C8PZ79 Protein involved in catabolism of external DNA n=1 Tax=Enhydrobacter aerosaccus SK60 RepID=C8PZ79_9GAMM Length = 295 Score = 165 bits (418), Expect = 1e-39, Method: Compositional matrix adjust. Identities = 89/270 (32%), Positives = 155/270 (57%), Gaps = 12/270 (4%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEY 61 ++Y+HS+HAGN ADV+KH + ++E + K KP+ LD + G G Y L S+ A++TGE Sbjct: 1 MNYQHSYHAGNFADVVKHVLLLQLLEMMSAKPKPYYILDAYGGRGLYSLASDEAKKTGEA 60 Query: 62 LEGIARIWQQDD--LPAELEAYINVVKHFNRSGQLRYYPGSP-LIARLLLREQDS----- 113 + GI ++ QD+ P ++ Y+ + + + YPGSP IA + ++QD+ Sbjct: 61 IHGITKLLAQDNSQAPQAVQTYLQDIGYAKKFYDKHVYPGSPWFIAHHIEKQQDAHPEIN 120 Query: 114 --LQLTELHPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMK- 170 + E S++ L + + V+ + ++ + A LPP +RGLILIDPP+E + Sbjct: 121 NRAEAFEWKASEFDALNYQLHQLPIG-VQHRNAYEGILAVLPPQEKRGLILIDPPFEQEH 179 Query: 171 TDYQAVVSGIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDS 230 D+ A+V + + +K+++TG+ ALWYP+ ++ ++ T IR+ L +EL + P Sbjct: 180 RDFSALVDLLVKAHKKWSTGVLALWYPIKNNDAVELFYKKMKRTEIRRQLVLELNIFPPD 239 Query: 231 DRRGMTASGMIVINPPWKLEQQMNNVLPWL 260 G+ +GM+VINPPW+ + + +L +L Sbjct: 240 LPMGLNGTGMLVINPPWQFDAKAEEILQYL 269 >UniRef50_Q73R01 Putative uncharacterized protein n=1 Tax=Treponema denticola RepID=Q73R01_TREDE Length = 279 Score = 164 bits (414), Expect = 4e-39, Method: Compositional matrix adjust. Identities = 89/282 (31%), Positives = 157/282 (55%), Gaps = 19/282 (6%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 MLSYRH FHAGN ADV KH+ ++ +K KPF D +AG+ Y L SE + +TGE Sbjct: 1 MLSYRHGFHAGNQADVFKHSALFSFLKVYTQKQKPFTAFDLNAGSASYNLLSEWSLKTGE 60 Query: 61 YLEGIAR---IWQQDDLPAEL----EAYIN-VVKHFNRSGQLRYYPGSPLIARLLLREQD 112 EGI R +++++ LP + +AY++ +K+++ + Y GSP I R L+++ Sbjct: 61 AEEGIIRFLDLYKKEKLPLPIPEGFKAYLDFCLKNYDENSS---YAGSPEIIRSFLQKES 117 Query: 113 SLQLTELHPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTD 172 +L L +LH ++ L+ +++ V K D ++ ++A PP+ RG L DP YE+ +D Sbjct: 118 NLILCDLHSAEAEKLKELYKRVENVHVHKRDCYEAVRALTPPLPIRGFALFDPSYEVDSD 177 Query: 173 YQAVVSGIAEGYKRFATGIYALWYPVV--LRQQIKRMIHDLEATGIRKILQIELAVLPD- 229 Y A+ + + K++ GI+ +WYP++ ++ + + + K+L IE+ + Sbjct: 178 YTAIAESVEKVCKKWPIGIFIIWYPILNHKTEECRNLKDRISKAMNNKVLNIEVKHFSNK 237 Query: 230 ---SDRRGMTASGMIVINPPWKLEQQMNNVLPWLHSKLVPAG 268 + G+ SG+++ NPPW LE+++ + ++ V AG Sbjct: 238 IDSENEYGLQGSGLLITNPPWGLEEKLKEICEYVEK--VSAG 277 >UniRef50_A5WD58 Putative uncharacterized protein n=3 Tax=Psychrobacter RepID=A5WD58_PSYWF Length = 291 Score = 163 bits (413), Expect = 6e-39, Method: Compositional matrix adjust. Identities = 91/292 (31%), Positives = 156/292 (53%), Gaps = 14/292 (4%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEY 61 ++Y+H++HAGN ADV KH + ++ + +K KP+ LD + G G Y L SE A +TGE Sbjct: 1 MNYKHAYHAGNFADVAKHILLVQLLNQMSKKGKPYYALDAYGGRGLYSLSSEEARKTGEA 60 Query: 62 LEGIARIWQQD--DLPAELEAYINVVKHFNRSGQLRYYPGSP-LIARLLLREQD---SLQ 115 G+ +I + D + P + Y++ +K ++ YPGSP IA + + + + Sbjct: 61 KAGVQKILEADVSEAPEAVRQYVDDIKQARQTYDKYVYPGSPWWIANHVEKHPEVKVRAE 120 Query: 116 LTELHPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMK-TDYQ 174 E ++Y L + + + D F+ ++A +PPV RRG+ILIDPPYE + D+ Sbjct: 121 AFEFKNTEYDALNYQLYQLPIG-IHNRDAFEGIRAVIPPVERRGVILIDPPYEQEHKDFT 179 Query: 175 AVVSGIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRG 234 +V + ++ G+YALW+P+ + ++ ++ TGIRK L EL + P+ G Sbjct: 180 RLVELLVASMTKWPQGVYALWFPIKNIEAVELFYKKMKRTGIRKQLLCELNIYPNDVAVG 239 Query: 235 MTASGMIVINPPWKLEQQMNNVLPWLHSKLVPAG------TGHATVSWIVPE 280 + +GM++INPPW+ +Q +L ++ + P + V W+V E Sbjct: 240 LNGTGMLIINPPWQFDQHARQILNFIQPLMRPEDAPDLPQSQAVNVRWLVGE 291 >UniRef50_Q1QSA4 Putative uncharacterized protein n=1 Tax=Chromohalobacter salexigens DSM 3043 RepID=Q1QSA4_CHRSD Length = 292 Score = 161 bits (407), Expect = 3e-38, Method: Compositional matrix adjust. Identities = 99/285 (34%), Positives = 149/285 (52%), Gaps = 8/285 (2%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 ML+Y+H++HAGN ADV KH +++ L K Y+DTHAG G Y L +E +R E Sbjct: 11 MLAYQHAYHAGNFADVHKHLTLFAVLQYLLRKSSAITYVDTHAGRGLYPLEAEETQRLRE 70 Query: 61 YLEGIARIWQQDDLPAE---LEAYINVVKHFNR-SGQLRYYPGSPLIARLLLREQDSLQL 116 Y +G A +W ++ A+ L A+ + + L +YPGSP REQD L L Sbjct: 71 YRQGAAAVWAAREVLADDSLLAAWCERLGDAQSGASTLSHYPGSPWWLANDCREQDRLAL 130 Query: 117 TELHPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAV 176 ELHP + L ++ +AR + ADG ++A LPP + R LIDP YE K +Y V Sbjct: 131 FELHPGEATHLEAQVLP-PQARRQHADGLAGIRALLPPATPRFCALIDPSYERKQEYTDV 189 Query: 177 VSGIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSD-RRGM 235 + + + I +WYP++ + ++ +G+RK+ + EL + P + GM Sbjct: 190 AATLQAVAAKVRHAIVMIWYPLLPSGRHHDLLTAARRSGLRKLWRSELTLHPPGEATHGM 249 Query: 236 TASGMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIVPE 280 SGM+++NPPW ++ Q+N L + S L T W VPE Sbjct: 250 YGSGMLLLNPPWGIDTQLNASLTRVASCL--GDTASHVSQWWVPE 292 >UniRef50_B8H3J8 External DNA uptake/catabolism protein n=6 Tax=Caulobacteraceae RepID=B8H3J8_CAUCN Length = 273 Score = 145 bits (365), Expect = 2e-33, Method: Compositional matrix adjust. Identities = 86/263 (32%), Positives = 122/263 (46%), Gaps = 2/263 (0%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEY 61 ++YRH+FHAGN AD+ KH + ++ +L+EK +DTHAGAG Y L E A R+GE Sbjct: 1 MNYRHAFHAGNFADLHKHAILLAMLSALQEKSPALAVIDTHAGAGGYDLAGEMARRSGEA 60 Query: 62 LEGIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELHP 121 GI R+ D PA + +N + N YPGSP + LR D EL Sbjct: 61 QAGIFRLKAAADAPAVFQPLLNAITQMNGGKDGDLYPGSPRLMARALRGADRYVGCELRD 120 Query: 122 SDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSGIA 181 D LLR + AR +ADGF K R I+IDPP+E DY +V+ Sbjct: 121 DDADLLRKTLAPCANARALQADGFDT-AVKDAGKGGRAFIVIDPPFERPDDYDRIVATTR 179 Query: 182 EGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTASGMI 241 R A+W P+ + + +E T +L EL + P +D M M+ Sbjct: 180 AVLARAPDAALAIWLPIKDLETFDAFLRAME-TVTSDLLVAELRLRPLTDPMKMNGCAMV 238 Query: 242 VINPPWKLEQQMNNVLPWLHSKL 264 +I P +E W+ ++L Sbjct: 239 MIGAPPSVEDAAVAAGDWIATRL 261 >UniRef50_A3VP01 Putative uncharacterized protein n=1 Tax=Parvularcula bermudensis HTCC2503 RepID=A3VP01_9PROT Length = 262 Score = 136 bits (343), Expect = 7e-31, Method: Compositional matrix adjust. Identities = 86/257 (33%), Positives = 138/257 (53%), Gaps = 10/257 (3%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 MLSY+H++HAGN AD+ KH V ++ L +K + LDTHAG G Y L A++TGE Sbjct: 1 MLSYQHAYHAGNRADLHKHAVWCALLAHLTQKSRGLTILDTHAGRGLYDLAGAEAQKTGE 60 Query: 61 YLEGIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELH 120 +G A + D L + + + + G++ YPGSPL++ R QD + L E H Sbjct: 61 ASDGAAAV--SLDGSHALGSAVAACR--AQYGEMA-YPGSPLLSLHFARPQDQVILMEKH 115 Query: 121 PSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSGI 180 P + L++ + +A V DG++ A PP R+GL++IDP YE+KT+YQ V + Sbjct: 116 PQEGAALKT-VMRGKKAAVHLRDGYEGALALAPPTPRKGLVMIDPSYEVKTEYQNVALFL 174 Query: 181 AEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTASGM 240 ++ LWYP++ ++ + M+ L + + E+ DS R M SG+ Sbjct: 175 PTLIDKWPEASVLLWYPILAAKRHEAMLDTLSPM---QPWRHEVLFTEDSLLR-MKGSGL 230 Query: 241 IVINPPWKLEQQMNNVL 257 ++I+PP+ E ++ L Sbjct: 231 VLISPPYGGEGAIDAAL 247 >UniRef50_B7VU08 Putative uncharacterized protein n=4 Tax=Vibrionales RepID=B7VU08_VIBSL Length = 288 Score = 136 bits (342), Expect = 1e-30, Method: Compositional matrix adjust. Identities = 80/286 (27%), Positives = 143/286 (50%), Gaps = 20/286 (6%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEY 61 + YRH H G+H D LKH V S +++SL ++ +DTH+G G Y L + + GE+ Sbjct: 1 MEYRHQCHVGDHGDALKHPVLSALVQSLMQQHSRLNVIDTHSGTGCYDLTTAPSNHAGEF 60 Query: 62 LEGIARIWQQDD-LPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELH 120 EG+ +W+ LP ++++V++++N + + YPGS I R QDS +++ Sbjct: 61 AEGVGYLWRNKAYLPPAFASFMSVLEYYNPNQLISLYPGSAAITYQQGRSQDSFYFSDIQ 120 Query: 121 PSDYPLLRSE---FQKD----SRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDY 173 + LL++ Q+D S+ + DG + L + LI+IDPPYE ++Y Sbjct: 121 QDEADLLQTNIETLQRDLDVSSKLTITAGDGLKALPDDVAKHDNHHLIVIDPPYETDSEY 180 Query: 174 QAVVSGIAEGYKRFATGIYALWYP--------VVLRQQIKRMIHDLEATGIRKILQIELA 225 AV+ + + Y++ +WYP ++L + + L + I+ L++ Sbjct: 181 LAVIDALVKAYQQSEKVSALIWYPLYTDDKSSLILNHCVTAVKDGLLPSPIKSELRLR-- 238 Query: 226 VLPDSDRRGMTASGMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGH 271 P D R + SG+++ NPP + + + L +LH +L G G+ Sbjct: 239 -DPKGDDR-LIGSGLLLFNPPQGISGIVADTLDYLHCQLSTNGEGY 282 >UniRef50_UPI0001909543 putative DNA methylase protein n=1 Tax=Rhizobium etli IE4771 RepID=UPI0001909543 Length = 171 Score = 128 bits (322), Expect = 2e-28, Method: Compositional matrix adjust. Identities = 60/148 (40%), Positives = 88/148 (59%) Query: 118 ELHPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVV 177 ELHP DY L F+ D AR+ + DG+ L A LPP +RG++L+DPP+E + +YQ + Sbjct: 2 ELHPEDYARLHRLFEGDHHARITELDGWLALGAHLPPKEKRGIVLVDPPFEEEDEYQRLA 61 Query: 178 SGIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTA 237 G+ Y+RF G Y LWYP+ IK L+A I K+L EL V D G+T Sbjct: 62 KGLERAYRRFPGGTYCLWYPLKKGAPIKEFHETLQALDIPKMLCAELTVRSDRGTTGLTG 121 Query: 238 SGMIVINPPWKLEQQMNNVLPWLHSKLV 265 SG++++NPP+ L+ +++ +LP L L Sbjct: 122 SGLVIVNPPFTLKDELHQMLPALKDHLA 149 >UniRef50_Q0C0F5 Putative uncharacterized protein n=1 Tax=Hyphomonas neptunium ATCC 15444 RepID=Q0C0F5_HYPNA Length = 262 Score = 126 bits (317), Expect = 7e-28, Method: Compositional matrix adjust. Identities = 69/198 (34%), Positives = 107/198 (54%), Gaps = 5/198 (2%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 MLSY+H FHAGN ADVLKH V ++ + +P Y++TH+G GRY L + A + GE Sbjct: 1 MLSYQHGFHAGNRADVLKHAVLDTLLRAAATGPRPLFYVETHSGHGRYDLTNAQARKRGE 60 Query: 61 YLEGIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELH 120 +G+ + + P L ++ +V N G+ + YPGSP +A+ LL + + L ELH Sbjct: 61 SDDGVLAL-MKGKPPKPLSGWMELV---NARGE-KDYPGSPALAQTLLPKHARMMLFELH 115 Query: 121 PSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSGI 180 P + L + D R R++KADG+ P + ++L+DP YE D +A+ Sbjct: 116 PQENAALTEAMKGDDRIRIQKADGYAGALKLAPRAGEQMVVLVDPSYETHRDIEALALWT 175 Query: 181 AEGYKRFATGIYALWYPV 198 + KR+ + LW P+ Sbjct: 176 PKALKRWPGALLILWLPL 193 >UniRef50_B7G053 Predicted protein (Fragment) n=1 Tax=Phaeodactylum tricornutum CCAP 1055/1 RepID=B7G053_PHATR Length = 267 Score = 112 bits (281), Expect = 1e-23, Method: Compositional matrix adjust. Identities = 85/275 (30%), Positives = 137/275 (49%), Gaps = 29/275 (10%) Query: 4 YRHSFHAGNHADVLKHTV-QSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEYL 62 Y+H HAGNH DVLKH V ++ + E L + + +D HAG G Y L + +G++ Sbjct: 7 YQHLKHAGNHCDVLKHVVFRACVQEQLNVHENGIILVDCHAGEGLYDLSKQ---TSGDFE 63 Query: 63 EGIARIWQQDD--LPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELH 120 G+AR+ Q D P + Y+N ++ + +++YPGSP++ LLREQD +L +L+ Sbjct: 64 RGVARVVQNLDQTAPPAVHDYVNAIQEADEY--MQFYPGSPMLGAKLLREQDEHRLVDLY 121 Query: 121 PSDYPLLRSEFQKDS----RARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAV 176 D L KD +A V +AD + L R +ILIDPPY + D+ Sbjct: 122 VEDVEGL-----KDGALFWQADVFEADAVEFLVPN--DDDRHKVILIDPPYLDQEDFYRA 174 Query: 177 VSGIAEGYKRFATGIYALWYPVVLRQQIK----RMIHDLEATGIR-KILQIELAVLPDSD 231 R LWYP++ + + + + I D+ + I Q L V D Sbjct: 175 KVLTERILDRDPYCTILLWYPMIQKSRWRYGYAKSIKDMAKKKAKLGIYQAWLTV----D 230 Query: 232 RRGMTASGMIVINPPWKLEQQMN-NVLPWLHSKLV 265 + G+ SGMIV+NP + ++ ++ + + WL + L+ Sbjct: 231 KEGLQGSGMIVVNPTQRFDEIVDEDTIDWLSATLL 265 >UniRef50_C5SM30 Putative uncharacterized protein n=2 Tax=Caulobacteraceae RepID=C5SM30_9CAUL Length = 284 Score = 110 bits (275), Expect = 5e-23, Method: Compositional matrix adjust. Identities = 82/285 (28%), Positives = 127/285 (44%), Gaps = 17/285 (5%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEY 61 ++YRH FHAGN AD+ KH V + +L+E +P +D+HAGAG+Y L R+ E Sbjct: 1 MNYRHGFHAGNFADLFKHAVLLNFLRALRESAQPLQVVDSHAGAGQYDLSDPTFSRSKEA 60 Query: 62 LEGIARIWQQDDLPAELEAYINVVKHFNRSGQLR----YYPGSPLIARLLLREQDSLQLT 117 GI + D+P L + V NR+ + YPGSPL+ L + S Sbjct: 61 EAGIGYLLGG-DVPQSLIPLSDYVWAKNRAAGFKTRIGLYPGSPLLVLDHLTAEGSYMGC 119 Query: 118 ELHPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVV 177 EL DY LR+ +AR DG++ P + +LIDPP+E DY+ + Sbjct: 120 ELRKDDYERLRATVMPRGKAR--HTDGYEAAVEMAEP-DKDFFLLIDPPFEQFEDYERIN 176 Query: 178 SGIAEGYKRFATGIYALWYPV--------VLRQQIKRMIHDLEATGIRKILQIELAVLPD 229 + + K+ T +W P+ LR ++ D G I EL + P Sbjct: 177 LCLRDVLKKQPTAKALVWLPLKDLETFDRFLRHMECELLEDQTGEGGPDIAVAELRLRPL 236 Query: 230 SDRRGMTASGMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATV 274 ++ M ++ +N P + + M ++ L G G A+V Sbjct: 237 TNPLKMNGCALVTVNAPASVVEAMRDIADDLAQVFAEPG-GKASV 280 >UniRef50_Q2BH49 Putative uncharacterized protein n=1 Tax=Neptuniibacter caesariensis RepID=Q2BH49_9GAMM Length = 251 Score = 77.4 bits (189), Expect = 5e-13, Method: Compositional matrix adjust. Identities = 68/200 (34%), Positives = 98/200 (49%), Gaps = 19/200 (9%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 M Y HS +AG ADV+KH + ++ L D Y++THAGAG Y L + GE Sbjct: 1 MAKYLHSKYAGGDADVMKHACLASVLSKL---DISVEYVETHAGAGLYDLDPDR----GE 53 Query: 61 YLEGIARIWQQ-DDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTEL 119 +L+GI R DLPA L+ Y V++ + + + YP SP+IA SL L EL Sbjct: 54 HLKGIGRCRSNLTDLPA-LKPYNGVLEE-SWTLDKKIYPASPIIANSA-SAVKSLCLYEL 110 Query: 120 HPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSG 179 + S L+ + A V + DGF +S +LIDPPY+ DYQ VV Sbjct: 111 NRSVACQLKKNLPE---AVVWEEDGFLSRHH----LSHGSFVLIDPPYKSSDDYQQVVEY 163 Query: 180 IAEGYKRFATGIYALWYPVV 199 + K+ +W+P++ Sbjct: 164 VGAA-KQSQVRAVMVWFPMI 182 >UniRef50_A0B718 Protein involved in catabolism of external DNA-like n=1 Tax=Methanosaeta thermophila PT RepID=A0B718_METTP Length = 246 Score = 51.2 bits (121), Expect = 4e-05, Method: Compositional matrix adjust. Identities = 53/166 (31%), Positives = 74/166 (44%), Gaps = 17/166 (10%) Query: 4 YRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEYLE 63 Y H HAGN DV KH + S L + +Y ++HAG Y L GE+ Sbjct: 2 YDHREHAGNAGDVWKHFLLSEAAAYLLCR-SDLVYAESHAGYTAYTLAP-----NGEWRW 55 Query: 64 GIARIWQQDDLPAELEA-YINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELHPS 122 GI R W L +E+E+ Y V++ N L+ YPGS I L R + EL Sbjct: 56 GIGRCWH---LRSEIESPYFAVLEEMNDE-HLQIYPGSAKIILRLGRFFRRRVVAELWDI 111 Query: 123 DYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRR--GLILIDPP 166 + +S + DGF + + ++RR GL+LIDPP Sbjct: 112 SEDVGKS-WSACPDIHFHLGDGFSGV---MDLLNRRDPGLLLIDPP 153 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_C9Y2J0 Uncharacterized protein yhiR n=11 Tax=Enterobact... 444 e-123 UniRef50_C6C5L8 Putative uncharacterized protein n=3 Tax=Gammapr... 438 e-121 UniRef50_P31777 Uncharacterized protein HI0441 n=170 Tax=Gammapr... 437 e-121 UniRef50_A3UP41 Protein involved in catabolism of external DNA n... 426 e-118 UniRef50_B8F539 Protein involved in catabolism of external DNA n... 421 e-117 UniRef50_A1T010 DNA (Exogenous) processing protein n=8 Tax=Gamma... 411 e-113 UniRef50_Q984Q7 Mlr7888 protein n=7 Tax=Rhizobiales RepID=Q984Q7... 403 e-111 UniRef50_B9JCU5 DNA methylase protein n=4 Tax=Rhizobiales RepID=... 398 e-110 UniRef50_A0KP43 Protein involved in external DNA uptake n=3 Tax=... 398 e-109 UniRef50_Q47Z69 Putative uncharacterized protein n=1 Tax=Colwell... 397 e-109 UniRef50_Q5NZ63 Predicted protein involved in catabolism of exte... 397 e-109 UniRef50_Q12I42 Putative uncharacterized protein n=20 Tax=Shewan... 396 e-109 UniRef50_B6QZ93 Florfenicol resistance protein n=3 Tax=Rhodobact... 395 e-109 UniRef50_Q0VM19 Putative uncharacterized protein n=2 Tax=Alcaniv... 392 e-107 UniRef50_A3YH43 Protein involved in external DNA uptake n=2 Tax=... 389 e-107 UniRef50_B2S4W3 N-6 Adenine-specific DNA methylase n=51 Tax=Rhiz... 387 e-106 UniRef50_A6SXL6 Uncharacterized conserved protein n=70 Tax=cellu... 387 e-106 UniRef50_A4VR91 Protein involved in catabolism of external DNA n... 386 e-106 UniRef50_C3X722 External-DNA catabolic protein n=2 Tax=Oxalobact... 383 e-105 UniRef50_Q2W9T7 Protein involved in catabolism of external DNA n... 381 e-104 UniRef50_B1XZU6 Putative uncharacterized protein n=1 Tax=Leptoth... 372 e-102 UniRef50_Q2SPJ4 Protein involved in catabolism of external DNA n... 370 e-101 UniRef50_Q15U81 Putative uncharacterized protein n=1 Tax=Pseudoa... 368 e-100 UniRef50_B8GN89 Putative uncharacterized protein n=1 Tax=Thioalk... 366 e-100 UniRef50_C6M2C4 YhiR family protein n=2 Tax=Neisseriaceae RepID=... 365 e-100 UniRef50_Q1N5H6 Putative uncharacterized protein n=1 Tax=Bermane... 362 9e-99 UniRef50_Q89DH2 Blr7467 protein n=16 Tax=Rhizobiales RepID=Q89DH... 361 1e-98 UniRef50_B2HZ48 Protein involved in catabolism of external DNA n... 361 2e-98 UniRef50_Q5QVX6 Transformation competence-related protein ComJ n... 360 3e-98 UniRef50_C6XPG2 Putative uncharacterized protein n=5 Tax=Proteob... 359 6e-98 UniRef50_C6QCA2 Putative uncharacterized protein n=1 Tax=Hyphomi... 358 1e-97 UniRef50_Q5ZVZ2 Protein involved in catabolism of external DNA n... 357 3e-97 UniRef50_Q0ARP7 Putative uncharacterized protein n=2 Tax=Hyphomo... 355 9e-97 UniRef50_D0IYP9 ComJ n=10 Tax=Bacteria RepID=D0IYP9_COMTE 354 3e-96 UniRef50_Q0F148 Putative uncharacterized protein n=1 Tax=Maripro... 353 4e-96 UniRef50_C7JEW3 Putative uncharacterized protein n=8 Tax=Acetoba... 352 7e-96 UniRef50_D0KVW6 Putative uncharacterized protein n=1 Tax=Halothi... 352 9e-96 UniRef50_C5BQN4 Putative uncharacterized protein n=1 Tax=Teredin... 351 1e-95 UniRef50_A4SYR3 Putative uncharacterized protein n=1 Tax=Polynuc... 350 3e-95 UniRef50_Q87F97 Transformation competence-related protein n=20 T... 341 1e-92 UniRef50_A1WIT5 Putative uncharacterized protein n=4 Tax=Burkhol... 340 3e-92 UniRef50_Q0G6E9 Putative uncharacterized protein n=1 Tax=Fulvima... 340 4e-92 UniRef50_B1ZS65 Putative uncharacterized protein n=2 Tax=Opituta... 339 6e-92 UniRef50_Q1YIC4 Putative uncharacterized protein n=1 Tax=Auranti... 339 7e-92 UniRef50_Q0BPC8 Putative uncharacterized protein n=3 Tax=Acetoba... 337 2e-91 UniRef50_B4RYZ5 Putative uncharacterized protein n=2 Tax=Alterom... 336 6e-91 UniRef50_D2LG58 Putative uncharacterized protein n=1 Tax=Rhodomi... 330 3e-89 UniRef50_A5EWC5 Putative uncharacterized protein n=1 Tax=Dichelo... 330 3e-89 UniRef50_B1LXQ5 Putative uncharacterized protein n=9 Tax=Alphapr... 330 3e-89 UniRef50_A5WD58 Putative uncharacterized protein n=3 Tax=Psychro... 329 7e-89 UniRef50_C8PZ79 Protein involved in catabolism of external DNA n... 328 2e-88 UniRef50_A1VJI9 Putative uncharacterized protein n=4 Tax=Comamon... 326 5e-88 UniRef50_C8NAD4 Cytoplasmic protein n=34 Tax=Proteobacteria RepI... 326 7e-88 UniRef50_Q21LZ8 Putative uncharacterized protein n=1 Tax=Sacchar... 325 1e-87 UniRef50_Q2G473 Putative uncharacterized protein n=1 Tax=Novosph... 325 1e-87 UniRef50_B5EL93 Putative uncharacterized protein n=2 Tax=Acidith... 324 2e-87 UniRef50_A3JEY3 Protein involved in catabolism of external DNA n... 323 6e-87 UniRef50_A0YHR6 Putative uncharacterized protein n=1 Tax=marine ... 318 2e-85 UniRef50_Q1QSA4 Putative uncharacterized protein n=1 Tax=Chromoh... 316 6e-85 UniRef50_B7QYF1 Protein involved in catabolism of external DNA n... 312 1e-83 UniRef50_B8H3J8 External DNA uptake/catabolism protein n=6 Tax=C... 309 7e-83 UniRef50_C6NTA4 Putative uncharacterized protein n=1 Tax=Acidith... 307 3e-82 UniRef50_Q1RK44 ComJ n=12 Tax=Rickettsia RepID=Q1RK44_RICBR 307 4e-82 UniRef50_Q73R01 Putative uncharacterized protein n=1 Tax=Trepone... 302 1e-80 UniRef50_UPI0000E1171F protein involved in external DNA uptake n... 286 5e-76 UniRef50_A3VP01 Putative uncharacterized protein n=1 Tax=Parvula... 280 4e-74 UniRef50_C5SM30 Putative uncharacterized protein n=2 Tax=Cauloba... 273 5e-72 UniRef50_B7VU08 Putative uncharacterized protein n=4 Tax=Vibrion... 265 8e-70 UniRef50_B7G053 Predicted protein (Fragment) n=1 Tax=Phaeodactyl... 238 2e-61 UniRef50_Q0C0F5 Putative uncharacterized protein n=1 Tax=Hyphomo... 235 1e-60 UniRef50_UPI0001909543 putative DNA methylase protein n=1 Tax=Rh... 210 4e-53 UniRef50_Q2BH49 Putative uncharacterized protein n=1 Tax=Neptuni... 183 4e-45 UniRef50_A0B718 Protein involved in catabolism of external DNA-l... 139 1e-31 Sequences not found previously or not previously below threshold: UniRef50_A3I4J5 Methyltransferase n=3 Tax=Bacillaceae RepID=A3I4... 49 3e-04 UniRef50_D0THA4 Predicted protein n=14 Tax=Bacteroides RepID=D0T... 44 0.008 UniRef50_C1ACG3 Putative uncharacterized protein n=1 Tax=Gemmati... 42 0.032 UniRef50_A6Q3P8 DNA methylase n=3 Tax=Epsilonproteobacteria RepI... 41 0.037 >UniRef50_C9Y2J0 Uncharacterized protein yhiR n=11 Tax=Enterobacteriaceae RepID=C9Y2J0_CROTZ Length = 280 Score = 444 bits (1143), Expect = e-123, Method: Composition-based stats. Identities = 242/280 (86%), Positives = 259/280 (92%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRY L EHAERTGE Sbjct: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYLLSGEHAERTGE 60 Query: 61 YLEGIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELH 120 YLEGIARIWQ+DDLPAELE YI+ V HFNRSGQLRYYPGSPLIAR LLR QDSLQLTELH Sbjct: 61 YLEGIARIWQRDDLPAELEPYISAVSHFNRSGQLRYYPGSPLIARQLLRPQDSLQLTELH 120 Query: 121 PSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSGI 180 PSD+PLLR EFQKD RARVE+ADG+QQLK+KLPP SRRGLILIDPPYE+KTDYQAVV GI Sbjct: 121 PSDFPLLRGEFQKDERARVERADGYQQLKSKLPPASRRGLILIDPPYEIKTDYQAVVQGI 180 Query: 181 AEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTASGM 240 EGYKRFATG+YALWYPVVLR QIKRM++DLE+TGIR+ILQIELAV PDSD+RGMTASGM Sbjct: 181 NEGYKRFATGVYALWYPVVLRNQIKRMMNDLESTGIRRILQIELAVRPDSDQRGMTASGM 240 Query: 241 IVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIVPE 280 +VINPPWKLEQQM +LPWLH LVPAGTGH T+ W+VPE Sbjct: 241 VVINPPWKLEQQMGTLLPWLHKALVPAGTGHTTLKWVVPE 280 >UniRef50_C6C5L8 Putative uncharacterized protein n=3 Tax=Gammaproteobacteria RepID=C6C5L8_DICDC Length = 280 Score = 438 bits (1126), Expect = e-121, Method: Composition-based stats. Identities = 219/280 (78%), Positives = 247/280 (88%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 MLSYRHSFHAGNHADVLKHTVQSLII +LKEK+KPFLYLDTH+GAGRYQL EHAERTGE Sbjct: 1 MLSYRHSFHAGNHADVLKHTVQSLIITALKEKEKPFLYLDTHSGAGRYQLHGEHAERTGE 60 Query: 61 YLEGIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELH 120 Y EGI RIWQ+DD+PAE+EAY+ VV+ +N GQLRYYPGSPLIAR LLREQD+L LTELH Sbjct: 61 YREGIGRIWQRDDIPAEMEAYLQVVRSYNSGGQLRYYPGSPLIARQLLREQDTLNLTELH 120 Query: 121 PSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSGI 180 P+D+ LLR EF +D RARV + DG+ QLK++LPP +RRG+ILIDPPYE+KTDYQAVV GI Sbjct: 121 PTDFSLLRQEFARDDRARVVREDGYLQLKSRLPPAARRGVILIDPPYELKTDYQAVVDGI 180 Query: 181 AEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTASGM 240 EGY+RFATG+YALWYPVVLRQQIKR++ LE TGIR+ILQIELAVLPDSDR GMTASGM Sbjct: 181 QEGYRRFATGVYALWYPVVLRQQIKRLLKALEETGIRRILQIELAVLPDSDRHGMTASGM 240 Query: 241 IVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIVPE 280 IVINPPWKLE QM ++LPWLH LVP GTGH V W+VPE Sbjct: 241 IVINPPWKLEAQMKSLLPWLHQVLVPEGTGHTRVEWVVPE 280 >UniRef50_P31777 Uncharacterized protein HI0441 n=170 Tax=Gammaproteobacteria RepID=Y441_HAEIN Length = 281 Score = 437 bits (1124), Expect = e-121, Method: Composition-based stats. Identities = 186/281 (66%), Positives = 214/281 (76%), Gaps = 1/281 (0%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 MLSY HSFHAGNHADVLKH V LI+E+LK K+K F YLDTH+G GRY+L S +E+TGE Sbjct: 1 MLSYHHSFHAGNHADVLKHIVLMLILENLKLKEKGFFYLDTHSGVGRYRLSSNESEKTGE 60 Query: 61 YLEGIARIWQQDDLPAELEAYINVVKHFNRSG-QLRYYPGSPLIARLLLREQDSLQLTEL 119 Y EGI R+W Q DLP ++ Y+ ++K N G +LRYY GSPLIA LLR QD LTEL Sbjct: 61 YKEGIGRLWDQTDLPEDIARYVKMIKKLNYGGKELRYYAGSPLIAAELLRSQDRALLTEL 120 Query: 120 HPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSG 179 HPSDYP+LR+ F D V+ +GFQQ+KA LPP RRGL+LIDPPYE+K DY VV Sbjct: 121 HPSDYPILRNNFSDDKNVTVKCDNGFQQVKATLPPKERRGLVLIDPPYELKDDYDLVVKA 180 Query: 180 IAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTASG 239 I EGYKRFATG YA+WYPVVLRQQ KR+ LEATGIRKIL+IELAV PDSD+RGMTASG Sbjct: 181 IEEGYKRFATGTYAIWYPVVLRQQTKRIFKGLEATGIRKILKIELAVRPDSDQRGMTASG 240 Query: 240 MIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIVPE 280 M+VINPPW LE QM +LP+L LVP GTG TV WI PE Sbjct: 241 MVVINPPWTLETQMKEILPYLTKTLVPEGTGSWTVEWITPE 281 >UniRef50_A3UP41 Protein involved in catabolism of external DNA n=17 Tax=Gammaproteobacteria RepID=A3UP41_VIBSP Length = 284 Score = 426 bits (1095), Expect = e-118, Method: Composition-based stats. Identities = 173/285 (60%), Positives = 214/285 (75%), Gaps = 6/285 (2%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 MLSYRHSFHAGNHADV+KH VQSLI+ LK+KDKPF+Y DTH+G GRY L E +E+TGE Sbjct: 1 MLSYRHSFHAGNHADVVKHIVQSLILNYLKQKDKPFVYHDTHSGVGRYDLTHEWSEKTGE 60 Query: 61 YLEGIARIWQQD-----DLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQ 115 Y +GIAR+W DLP ++++Y+ + N +LR+YPGSP +AR LR+QD + Sbjct: 61 YKQGIARLWSASEAGQQDLPEDIQSYLESISALNNGEKLRFYPGSPRVARAHLRDQDRMV 120 Query: 116 LTELHPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQA 175 LTELHP+D+PLL EF +D + + K DGFQ+LK LPP RRGL+LIDPPYE+ +Y+ Sbjct: 121 LTELHPADHPLLEQEFHRDRQVSIYKEDGFQRLKGSLPPKERRGLVLIDPPYELAKEYRD 180 Query: 176 VVSGIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGM 235 VV+ IA+ +KR+ATGIYA+WYPVV R I+ MI LE GI KILQIEL V PD++ RGM Sbjct: 181 VVTAIAQSHKRWATGIYAIWYPVVNRCDIEDMIEGLEGLGINKILQIELGVSPDTNERGM 240 Query: 236 TASGMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIVPE 280 TASGMIVINPPWKLE QMN +LP+L + PA TGH V WIVPE Sbjct: 241 TASGMIVINPPWKLESQMNEILPFLKEAIAPA-TGHFKVEWIVPE 284 >UniRef50_B8F539 Protein involved in catabolism of external DNA n=67 Tax=Gammaproteobacteria RepID=B8F539_HAEPS Length = 279 Score = 421 bits (1084), Expect = e-117, Method: Composition-based stats. Identities = 180/280 (64%), Positives = 218/280 (77%), Gaps = 1/280 (0%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 MLSY HSFHAGNHADVLKH V +LI+ +LK+K+K F YLDTH+G GRY L S AE+TGE Sbjct: 1 MLSYHHSFHAGNHADVLKHIVLTLILHALKQKEKGFFYLDTHSGVGRYSLQSSEAEKTGE 60 Query: 61 YLEGIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELH 120 Y+EGIAR+W++ DLP ++ Y+N +K N+ +LR+Y GSPL+A LR QD LTELH Sbjct: 61 YIEGIARLWERTDLPEKVVLYLNEIKKINKD-KLRFYAGSPLLAVQQLRPQDRALLTELH 119 Query: 121 PSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSGI 180 P+D+PLLR+EF K ++ +GFQQLK+ LPP +RGL+LIDPPYE+K DY+ VV I Sbjct: 120 PNDFPLLRNEFAKTPNVVTKRENGFQQLKSALPPKEKRGLVLIDPPYELKEDYELVVKAI 179 Query: 181 AEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTASGM 240 EGYKRFATG+YA+WYPVVLRQ KR++ L TGIRKILQIELAV PDSD+RGMTASGM Sbjct: 180 EEGYKRFATGVYAIWYPVVLRQHTKRIVRGLVETGIRKILQIELAVRPDSDQRGMTASGM 239 Query: 241 IVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIVPE 280 IVINPPW+LE QM +LP+L LVP GTG TV WI PE Sbjct: 240 IVINPPWQLESQMKKILPYLTDVLVPEGTGSWTVEWIKPE 279 >UniRef50_A1T010 DNA (Exogenous) processing protein n=8 Tax=Gammaproteobacteria RepID=A1T010_PSYIN Length = 284 Score = 411 bits (1057), Expect = e-113, Method: Composition-based stats. Identities = 137/280 (48%), Positives = 183/280 (65%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 +LSYRHSFHAGN ADVLKH V + II+ + +K+K F YLDTHAG G Y S A +T E Sbjct: 5 LLSYRHSFHAGNFADVLKHIVSTSIIDYMLKKEKAFCYLDTHAGCGAYSFQSPEALKTKE 64 Query: 61 YLEGIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELH 120 + GI +W + DLP + Y+ V FN QL++YPGSP IA +LR+ D L L ELH Sbjct: 65 FNNGIFPLWGRSDLPVPVARYMEQVVEFNAQSQLKHYPGSPSIAVQMLRDIDRLFLFELH 124 Query: 121 PSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSGI 180 P+++ + + F + + ++ K+DG Q L A +PP +RRG ILIDP YE+KT+Y VV + Sbjct: 125 PNEFINMCANFSGNRQIKMAKSDGLQGLIANMPPKARRGFILIDPSYEIKTEYHQVVETL 184 Query: 181 AEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTASGM 240 + +KRFATG YALWYPVV R I +M L+A+GI+ I EL + DSD+ GMT+SGM Sbjct: 185 IQAHKRFATGTYALWYPVVNRMTIDKMEKALKASGIKNIQLFELGLQEDSDQMGMTSSGM 244 Query: 241 IVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIVPE 280 IVINPPW L+++M LP+L L G + +V E Sbjct: 245 IVINPPWTLKKEMQASLPFLAKLLGFDNQGFYRIETLVAE 284 >UniRef50_Q984Q7 Mlr7888 protein n=7 Tax=Rhizobiales RepID=Q984Q7_RHILO Length = 282 Score = 403 bits (1037), Expect = e-111, Method: Composition-based stats. Identities = 122/282 (43%), Positives = 174/282 (61%), Gaps = 3/282 (1%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEY 61 ++YRH++HAGN ADV+KH V + +++ LK+KDK F +DTHAG GRY L S A++TGE+ Sbjct: 1 MNYRHAYHAGNFADVVKHVVLTRLLDYLKQKDKAFRVVDTHAGIGRYDLSSLEAQKTGEW 60 Query: 62 LEGIARIWQQD---DLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTE 118 GI R+ A L Y+ V+ N ++ YPGSPL+AR LLR+QD L E Sbjct: 61 QGGIGRLIDASLDARAGALLAPYLEAVRSLNPGDGVKKYPGSPLLARHLLRKQDRLSAIE 120 Query: 119 LHPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVS 178 LHP D L++EF D + RV + DG+ L A LPP +RGL+LIDPP+E + ++ +V Sbjct: 121 LHPKDAARLKAEFAGDFQVRVMELDGWLALGAHLPPKEKRGLVLIDPPFEEEGEFGRLVE 180 Query: 179 GIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTAS 238 G+ ++R+ GIYALWYP+ R+ + L+ +GI KIL IE + P S + S Sbjct: 181 GLIRAHRRWPGGIYALWYPIKDRKAVIAFRKALKQSGIPKILDIEFEIRPASSEPSLDGS 240 Query: 239 GMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIVPE 280 GM+V+NPP+ LE ++ VLP LH L H ++ W+ E Sbjct: 241 GMVVVNPPFTLEGELRTVLPALHKLLAVEKPAHWSLEWLAGE 282 >UniRef50_B9JCU5 DNA methylase protein n=4 Tax=Rhizobiales RepID=B9JCU5_AGRRK Length = 283 Score = 398 bits (1024), Expect = e-110, Method: Composition-based stats. Identities = 117/282 (41%), Positives = 170/282 (60%), Gaps = 3/282 (1%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEY 61 ++YRH +HAGN ADVLKH V + ++ ++ KDK F LDTHAG G Y L SE A++TGE+ Sbjct: 1 MNYRHIYHAGNFADVLKHAVLARLVRYMQNKDKAFRVLDTHAGIGLYDLSSEEAQKTGEW 60 Query: 62 LEGIARIWQQDDLP---AELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTE 118 +GI R+ + +P LE Y+ V+ N G L++YPGSP +AR+L R QD L E Sbjct: 61 QDGIGRLLDAELVPQLAELLEPYLTAVRELNPDGGLQFYPGSPKLARMLFRSQDRLSAME 120 Query: 119 LHPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVS 178 LHP D+ L F+ D AR+ + DG+ L A LPP +RG++L+DPP+E + +Y+ + Sbjct: 121 LHPEDFQRLHRLFEGDHHARITELDGWLALGAHLPPKEKRGIVLVDPPFEEEDEYERLAD 180 Query: 179 GIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTAS 238 G+A+ ++RF G Y LWYP+ IK L+A I K+L EL V D G+T S Sbjct: 181 GLAKAWRRFPGGTYCLWYPIKKDAPIKAFHETLQALEIPKVLCAELTVKSDRGFTGLTGS 240 Query: 239 GMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIVPE 280 G+I++NPP+ L+ +++ +LP L L W+ E Sbjct: 241 GLIIVNPPFTLKDELHALLPALKDMLAQDRFASQRAFWLRGE 282 >UniRef50_A0KP43 Protein involved in external DNA uptake n=3 Tax=Aeromonadaceae RepID=A0KP43_AERHH Length = 284 Score = 398 bits (1023), Expect = e-109, Method: Composition-based stats. Identities = 134/279 (48%), Positives = 178/279 (63%), Gaps = 2/279 (0%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 MLSYRH+FHAGNHADVLKH VQ+LIIESLK+K+KPF+ LDTHAG G Y L + ++ E Sbjct: 1 MLSYRHAFHAGNHADVLKHAVQALIIESLKKKEKPFIVLDTHAGGGLYDLCGDWPQKKAE 60 Query: 61 YLEGIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELH 120 Y +GI R+W + + + Y+ V++ N GQLRYYPGSP ++R L REQD L L ELH Sbjct: 61 YADGIGRLWDERTQWSAMAPYLGVIEEMNSDGQLRYYPGSPELSRRLAREQDKLALMELH 120 Query: 121 PSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSGI 180 ++ LR+ D R V DGF+ L A LPP RRGL+LIDPPYE+K DY AVV + Sbjct: 121 NNEVDDLRANMGYDPRVAVHHRDGFEGLVALLPPTPRRGLVLIDPPYELKEDYFAVVDTL 180 Query: 181 AEGYKRFATGIYALWYPVVLRQQ--IKRMIHDLEATGIRKILQIELAVLPDSDRRGMTAS 238 + KR+ATGIYALWYP++ + + M+ ++ +L EL V + GM S Sbjct: 181 KKAQKRWATGIYALWYPILGEEADKSRDMLRAIKRENFGNVLVAELEVAGQTKDWGMNGS 240 Query: 239 GMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWI 277 GM++I+PPW L++Q+ L L +KL V W+ Sbjct: 241 GMLIISPPWMLDEQIEAFLKPLCAKLAQGAGAQYKVEWL 279 >UniRef50_Q47Z69 Putative uncharacterized protein n=1 Tax=Colwellia psychrerythraea 34H RepID=Q47Z69_COLP3 Length = 295 Score = 397 bits (1021), Expect = e-109, Method: Composition-based stats. Identities = 127/296 (42%), Positives = 193/296 (65%), Gaps = 17/296 (5%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 MLSYRH+FHAGN ADVLKH+V SL+++ + K+K F Y+D+H+GAG YQL E+A++TGE Sbjct: 1 MLSYRHAFHAGNFADVLKHSVLSLVLDYMTRKEKGFCYIDSHSGAGMYQLADEYAQKTGE 60 Query: 61 YLEGIARIWQQDDLPAELEAYINVVKHFN----------------RSGQLRYYPGSPLIA 104 Y +GIA+I +D P LE Y++++K N S L YPGSP IA Sbjct: 61 YKDGIAKIINDEDAPESLEPYLSLIKSLNLASDRNTDPSADISTDTSNDLDVYPGSPGIA 120 Query: 105 RLLLREQDSLQLTELHPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILID 164 + +R QDS L ELHP+D L + Q+ + V+++DG+Q + +PP SRRG++LID Sbjct: 121 KAFVRRQDSSHLFELHPTDIQHLENFCQRWRKVFVKQSDGYQGVLGLIPPPSRRGVVLID 180 Query: 165 PPYEMKTDYQAVVSGIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIEL 224 PPYE+K DY V I + Y++F+TG Y LWYPVV R+ +++M + + ++ +LQ+E Sbjct: 181 PPYELKEDYHKAVKTIIKAYEKFSTGTYILWYPVVKRELVEQMSYTFTKSSVKNVLQVEF 240 Query: 225 AVLPDSDRRGMTASGMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIVPE 280 + D+D GMT +G+ ++NPPW+L Q+ +LP++ +KL T + T++ ++ E Sbjct: 241 CLESDTDEYGMTGTGLFIVNPPWQLTSQLEEILPYMKTKLGSDDTSY-TLNQLIAE 295 >UniRef50_Q5NZ63 Predicted protein involved in catabolism of external DNA n=13 Tax=Betaproteobacteria RepID=Q5NZ63_AZOSE Length = 281 Score = 397 bits (1020), Expect = e-109, Method: Composition-based stats. Identities = 134/277 (48%), Positives = 176/277 (63%), Gaps = 2/277 (0%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 MLSYRH+FHAGNHADVLKH V ++ K+KP+ Y+DTHAGAG Y L SE A + E Sbjct: 1 MLSYRHAFHAGNHADVLKHFVLIELLRYFNRKEKPWWYVDTHAGAGCYALDSEQAGKNAE 60 Query: 61 YLEGIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELH 120 + GI R+WQ+DDLP + Y++ + FN G+L +YPGSP +A LREQD ++L ELH Sbjct: 61 FASGIGRLWQRDDLPDAMRPYLDALAQFNPHGRLTFYPGSPALAMTQLREQDRMRLFELH 120 Query: 121 PSDYPLLRSEFQKD-SRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSG 179 P+D LL F +D R +V KADGF L+ LPP SRR ++LIDPPYE+K DY+ VV Sbjct: 121 PADVALLGQTFARDVQRVQVRKADGFSALRGLLPPPSRRVVVLIDPPYEVKEDYRRVVDT 180 Query: 180 IAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAV-LPDSDRRGMTAS 238 +A+ KRF G YA+WYP++ R + +++ L G L + LAV P D GM S Sbjct: 181 LADAIKRFPAGTYAVWYPLLARTEARQLPARLAGLGAENWLDVRLAVKKPPRDGFGMFGS 240 Query: 239 GMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVS 275 G+ V+NPPW L Q + V+PWL L G G + Sbjct: 241 GLYVVNPPWVLPQTLEAVMPWLADVLGEDGEGGFDLE 277 >UniRef50_Q12I42 Putative uncharacterized protein n=20 Tax=Shewanella RepID=Q12I42_SHEDO Length = 292 Score = 396 bits (1019), Expect = e-109, Method: Composition-based stats. Identities = 135/281 (48%), Positives = 188/281 (66%), Gaps = 2/281 (0%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 MLSYRH +HAGN+ADVLKH + +++++ +KDK F+Y+DTHAGAG Y L E A++TGE Sbjct: 13 MLSYRHGYHAGNYADVLKHAILLQVLKAMHKKDKAFVYVDTHAGAGAYSLEDEFAQKTGE 72 Query: 61 YLEGIARIWQQDDLPAELEAYINVVKHFNRSGQ-LRYYPGSPLIARLLLREQDSLQLTEL 119 YL+G+A++W + DLP L+ Y+ VK FN L YPGSP LR QD + L EL Sbjct: 73 YLDGVAKLWDKTDLPLALKDYVAAVKTFNAEQDELSLYPGSPAFVDSELRPQDRMVLHEL 132 Query: 120 HPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSG 179 H +D+ LL F KD + +V K DG L A +PP+ RRG++LIDP +E+KTDYQ V Sbjct: 133 HGTDHELLSDYFAKDRQVKVIKGDGLAGLIAAVPPLERRGVVLIDPSFEIKTDYQDVADA 192 Query: 180 IAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTASG 239 I + +KRF+TG++ LWYPVV R+Q + M+ L+ +GI K L++E + DS+ GMTA+G Sbjct: 193 IIKAHKRFSTGVFMLWYPVVDREQTEAMLSKLKNSGITKQLRLEQGIKTDSNEFGMTAAG 252 Query: 240 MIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIVPE 280 + +INPPW+L++ + L +L L GH TV W V E Sbjct: 253 LWIINPPWQLDELAKDSLDYLAKTLG-GIDGHVTVKWEVGE 292 >UniRef50_B6QZ93 Florfenicol resistance protein n=3 Tax=Rhodobacteraceae RepID=B6QZ93_9RHOB Length = 284 Score = 395 bits (1016), Expect = e-109, Method: Composition-based stats. Identities = 115/284 (40%), Positives = 178/284 (62%), Gaps = 5/284 (1%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEY 61 ++YRH +HAGN DVLKH V + I++ L++KD + LDTHAG G Y L SE A++TGE+ Sbjct: 1 MNYRHIYHAGNIGDVLKHVVLANILKYLQKKDGAYRVLDTHAGIGLYDLTSEKAQKTGEW 60 Query: 62 LEGIARIWQQDDLP-----AELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQL 116 +G+ ++ + D L ++ V++ N G +++YPGSP IA +L R+QD L L Sbjct: 61 QQGVGKVLENIDAASDQVKEVLAPWLETVENLNPGGGVQFYPGSPEIACMLARKQDRLTL 120 Query: 117 TELHPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAV 176 TELHP D+ L++ + D + +V D + L + LPP RRGL+LIDP +E++ +++ + Sbjct: 121 TELHPEDFEELKNNYGGDKKVKVIALDAWLALGSFLPPKERRGLVLIDPAFEVEDEFKRL 180 Query: 177 VSGIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMT 236 G+ G+KR+ TG +A+WYPV ++ + ++I LE GIR +++EL+ SD R M Sbjct: 181 AEGVIRGWKRWQTGTFAIWYPVKNQRIVNQLIVTLEEAGIRNAVKLELSAGQISDDRPMK 240 Query: 237 ASGMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIVPE 280 +SGM+V+NPPW L + MN LPWL L +V ++ E Sbjct: 241 SSGMLVVNPPWTLTRDMNTALPWLCQTLSQGKGAEWSVKQVIAE 284 >UniRef50_Q0VM19 Putative uncharacterized protein n=2 Tax=Alcanivorax RepID=Q0VM19_ALCBS Length = 282 Score = 392 bits (1007), Expect = e-107, Method: Composition-based stats. Identities = 141/280 (50%), Positives = 183/280 (65%), Gaps = 1/280 (0%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 MLSYRHS+HAGN ADVLKH VQ IIE LK+KDKPF DTHAGAG Y + SEH ++TGE Sbjct: 1 MLSYRHSYHAGNFADVLKHIVQVAIIEYLKKKDKPFTVHDTHAGAGSYAIASEHMQKTGE 60 Query: 61 YLEGIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELH 120 Y +GIA+++ + ++ Y+++V+ N G+L YPGSP I+ LLREQD LQ TELH Sbjct: 61 YQDGIAKLFGKRTGVGVIDQYVSLVEKLNPVGRLMDYPGSPQISASLLREQDVLQCTELH 120 Query: 121 PSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSGI 180 +D+ LL+ EF D R +V K D + LKA LPP RRGL+LIDP YEM+ DY V+ + Sbjct: 121 STDFTLLKREFADDKRVQVLKDDAWHGLKALLPPRHRRGLVLIDPSYEMEADYNGVLPAV 180 Query: 181 AEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTASGM 240 +RFAT YA+WYPV+ R + + I GI +L++E V PD+ RGMT +GM Sbjct: 181 QMAMERFATATYAIWYPVLDRNRTESFIRRFVKAGIPNLLRVECCVRPDASGRGMTGTGM 240 Query: 241 IVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIVPE 280 ++INPP+ L Q M +P L L A GH TV + E Sbjct: 241 LIINPPYTLAQHMAQAMPLLKEALCDAN-GHTTVKMLTGE 279 >UniRef50_A3YH43 Protein involved in external DNA uptake n=2 Tax=Marinomonas RepID=A3YH43_9GAMM Length = 280 Score = 389 bits (1001), Expect = e-107, Method: Composition-based stats. Identities = 126/280 (45%), Positives = 177/280 (63%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 MLSYRH +HAGNHAD+LKH V S I L K+ PF YLDTHAG G+Y L S+ A+ E Sbjct: 1 MLSYRHIYHAGNHADILKHLVVSQICHHLTAKEAPFFYLDTHAGIGQYALDSQQAQMNKE 60 Query: 61 YLEGIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELH 120 + GI+++ + P ++ ++ +VK N + L+ YPGSP + R++D + L ELH Sbjct: 61 FKTGISQLLELKSAPDSIKRFLKIVKEMNPTSNLKVYPGSPKVVEAYTRQKDKMHLCELH 120 Query: 121 PSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSGI 180 P D+P L + F RA VEK +GF +KA LPP +RGL+L+DPPYE+K DY+ VV + Sbjct: 121 PKDHPTLAALFPNKRRANVEKGNGFAAVKAMLPPPQKRGLVLMDPPYEVKEDYKTVVKAL 180 Query: 181 AEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTASGM 240 EG++RF+ GIYA+WYPV+ R+Q +I+ ++ T IR +L +EL + +GM SGM Sbjct: 181 VEGHQRFSHGIYAIWYPVLSRKQADNLINSVQRTKIRNVLLLELNIRDIDADKGMNGSGM 240 Query: 241 IVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIVPE 280 I++NPPWK+E + LP L L + WI PE Sbjct: 241 IIVNPPWKMESEAQEFLPILKELLQEDNRSSFQLRWITPE 280 >UniRef50_B2S4W3 N-6 Adenine-specific DNA methylase n=51 Tax=Rhizobiales RepID=B2S4W3_BRUA1 Length = 290 Score = 387 bits (995), Expect = e-106, Method: Composition-based stats. Identities = 122/286 (42%), Positives = 171/286 (59%), Gaps = 7/286 (2%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEY 61 ++YRH++HAGN ADV+KH + + I+E LK K++ F +DTHAG G Y L A +TGE+ Sbjct: 1 MNYRHAYHAGNFADVVKHVILTRIVEYLKRKEQAFRVIDTHAGIGLYDLKGTEAGKTGEW 60 Query: 62 LEGIARIWQQDD-------LPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSL 114 GI RI + + L+ Y++ V N +LR+YPGSPL+ R LLR+QD L Sbjct: 61 AGGIERIMTAVEKGQVEQPVLELLKPYLDAVYAVNTGVRLRHYPGSPLLVRHLLRKQDRL 120 Query: 115 QLTELHPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQ 174 ELHP D L F D + RV + DG+ L A LPP +RGL+L+DPP+E ++ Sbjct: 121 SALELHPQDAAKLAKLFDGDYQVRVTELDGWLALGAHLPPKEKRGLVLVDPPFEKDGEFD 180 Query: 175 AVVSGIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRG 234 + G+A+ +KRF G YALWYPV R++ +R L TGI KI+QIELA+ S Sbjct: 181 RLADGLAKAHKRFGGGTYALWYPVKDRRETERFARRLRETGIPKIMQIELAIRAPSPEPR 240 Query: 235 MTASGMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIVPE 280 + +GMIV+NPP+ LE +M +LP L L + ++ WI E Sbjct: 241 LDGTGMIVVNPPYTLESEMQILLPCLTRLLEEEKGSNFSLRWIRGE 286 >UniRef50_A6SXL6 Uncharacterized conserved protein n=70 Tax=cellular organisms RepID=A6SXL6_JANMA Length = 305 Score = 387 bits (994), Expect = e-106, Method: Composition-based stats. Identities = 135/291 (46%), Positives = 183/291 (62%), Gaps = 16/291 (5%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 MLSYRH+FHAGNHADVLKH VQ +++ L +KD P++Y+DTH+GAG Y L +A + E Sbjct: 1 MLSYRHAFHAGNHADVLKHLVQIQLLKYLNQKDTPYMYIDTHSGAGVYALDGNYAAKNAE 60 Query: 61 YLEGIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELH 120 + GI+++W + DLPA L Y+ V+K N SG+LRYYPGSP A ++REQD L+L ELH Sbjct: 61 FETGISKLWDRKDLPAPLAEYVQVIKALNPSGKLRYYPGSPYCADAVMREQDRLRLFELH 120 Query: 121 PSDYPLLRSEFQ---------------KDSRARVEKADGFQQLKAKLPPVSRRGLILIDP 165 P+D LL F+ + R +E+ +GFQ LKA LPP SRRGL+LIDP Sbjct: 121 PADSKLLADNFRKLEAHAAEQGKRPTVRGKRIMIERGNGFQGLKALLPPPSRRGLVLIDP 180 Query: 166 PYEMKTDYQAVVSGIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELA 225 PYE KTDY+ VV +++ RFATG YA+WYPV+ R + ++M L+ L + L+ Sbjct: 181 PYEDKTDYRTVVQTVSDALTRFATGTYAVWYPVLNRLESRQMPDKLKRLSANGWLNVTLS 240 Query: 226 V-LPDSDRRGMTASGMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVS 275 V P D G+ +SGM V NPPW LE + ++P+L L T+ Sbjct: 241 VTTPSPDGFGLHSSGMFVHNPPWTLEPMLRELMPYLVKTLGGDEGAGFTLE 291 >UniRef50_A4VR91 Protein involved in catabolism of external DNA n=21 Tax=Pseudomonadaceae RepID=A4VR91_PSEU5 Length = 279 Score = 386 bits (993), Expect = e-106, Method: Composition-based stats. Identities = 124/279 (44%), Positives = 170/279 (60%), Gaps = 1/279 (0%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEY 61 ++YRH+FHAGNHADVLKH V S I L K+ PF YLD+HAG G Y L + A RTGE+ Sbjct: 1 MNYRHAFHAGNHADVLKHLVLSRIFALLSRKEAPFAYLDSHAGVGLYDLAGDQASRTGEW 60 Query: 62 LEGIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELHP 121 L+GIARIWQ + PA L+ Y+ V++ N G LRYYPGSP +AR L REQD LQL E HP Sbjct: 61 LQGIARIWQAETRPALLDDYLGVIRSLNPDGALRYYPGSPELARQLTREQDRLQLNEKHP 120 Query: 122 SDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSGIA 181 D LL+ D R V + +G+ +A +P +R ++LIDPP+E + V+ + Sbjct: 121 EDGALLKDNMSGDRRVAVHRGEGWHVPRALMPTREKRVVLLIDPPFEQADELSRCVTALK 180 Query: 182 EGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTASGMI 241 E R I +WYP+ +Q+KR DL +G K+L+ EL V P D + SG+ Sbjct: 181 EALGRMRQTIGVIWYPIKDERQLKRFYQDLARSGAPKLLRAELFVHPADDASRLAGSGLA 240 Query: 242 VINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIVPE 280 ++NPPW LE ++ +LPWL +L + G + W++ E Sbjct: 241 IVNPPWGLEDELRELLPWLAEQLAQS-QGGWRLDWLIEE 278 >UniRef50_C3X722 External-DNA catabolic protein n=2 Tax=Oxalobacter formigenes RepID=C3X722_OXAFO Length = 298 Score = 383 bits (985), Expect = e-105, Method: Composition-based stats. Identities = 119/291 (40%), Positives = 166/291 (57%), Gaps = 16/291 (5%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 M SYRH+FHAGNHADVLKH V ++ K+ Y+DTH+GAG Y L A + E Sbjct: 1 MFSYRHAFHAGNHADVLKHVVLMQVLLYAIRKEASLFYIDTHSGAGVYSLEGNEARKNAE 60 Query: 61 YLEGIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELH 120 + GIAR+W + +P + Y+ +V N G+LR+YPGSP IA +LR D L+L E H Sbjct: 61 FQSGIARLWGKKTVPPAVRDYLKLVYDMNPDGKLRFYPGSPYIAERILRSHDRLRLFEWH 120 Query: 121 PSDYPLLRSEFQ---------------KDSRARVEKADGFQQLKAKLPPVSRRGLILIDP 165 P++ +L F+ + R VE+ DGF LKA LPP SRR +ILIDP Sbjct: 121 PAECRVLDENFRGLLKSGESNTRSRPERGKRVLVERKDGFSSLKALLPPPSRRAVILIDP 180 Query: 166 PYEMKTDYQAVVSGIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELA 225 PYE K+DY+ VV +++ KRF+TG +WYP++ R + +R L+ T ++ L + L+ Sbjct: 181 PYEDKSDYRKVVDVVSDALKRFSTGTCLIWYPLLQRPESRRFASRLKQTVSQEWLDVTLS 240 Query: 226 V-LPDSDRRGMTASGMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVS 275 P D G +SGM V+NPPWKL + + +P L S L T+ Sbjct: 241 TGSPVPDGFGFVSSGMFVVNPPWKLAESLQETMPCLVSALKQDSGAGFTLE 291 >UniRef50_Q2W9T7 Protein involved in catabolism of external DNA n=7 Tax=Alphaproteobacteria RepID=Q2W9T7_MAGSA Length = 283 Score = 381 bits (980), Expect = e-104, Method: Composition-based stats. Identities = 120/279 (43%), Positives = 177/279 (63%), Gaps = 1/279 (0%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEY 61 ++YRH++HAGN ADV+KH + ++++ SLK KD PF LDTHAG G Y L + A++TGEY Sbjct: 1 MNYRHAYHAGNFADVMKHAILAMVVASLKRKDTPFFALDTHAGIGAYDLEAPQADKTGEY 60 Query: 62 LEGIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELHP 121 L GIAR+ D PAELE Y+ +V+ +N G LR YPGSP + R L+R QD + L ELHP Sbjct: 61 LSGIARVLDAADPPAELETYLALVRTWNSEGVLRRYPGSPELMRGLMRPQDRMALVELHP 120 Query: 122 SDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSGIA 181 D LR+ F D R V DG+ K LPP RRGL+L+DPP+E+K +++ +++ + Sbjct: 121 EDVETLRARFHGDRRVGVHHLDGYTAAKGLLPPPERRGLVLMDPPFEVKNEFERLLAALR 180 Query: 182 EGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTASGMI 241 K + TGIY WYP+ R+ + + + + G + L EL + P D + +G++ Sbjct: 181 RARKLWPTGIYLAWYPIKGREPVDQFLQAIADDGGPEALAAELLLRPAKDPFKLNGNGLL 240 Query: 242 VINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIVPE 280 VINPPW+L + ++ VLPWL + + P +G A + ++ E Sbjct: 241 VINPPWQLRESLDRVLPWLAAVMAPD-SGSAAIRQLIGE 278 >UniRef50_B1XZU6 Putative uncharacterized protein n=1 Tax=Leptothrix cholodnii SP-6 RepID=B1XZU6_LEPCP Length = 280 Score = 372 bits (956), Expect = e-102, Method: Composition-based stats. Identities = 117/276 (42%), Positives = 175/276 (63%), Gaps = 1/276 (0%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 ML+YRH+FHAGNHADVLKH V +++ + KDKPF +DTHAG G Y L S +++ GE Sbjct: 1 MLAYRHAFHAGNHADVLKHLVLVQVLQYMASKDKPFRLIDTHAGGGGYALHSSQSQKKGE 60 Query: 61 YLEGIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELH 120 YL+GI+RIW D P + Y+ +V+ FN GQL YPGSP ++++LLR D L+L ELH Sbjct: 61 YLQGISRIWGAGDAPPAVADYLRLVRRFNPDGQLNLYPGSPALSQMLLRRGDQLRLFELH 120 Query: 121 PSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSGI 180 P+++ +L + + ++ + DGF L+ ++PP RRG++L+DP YE+ +DY V+ + Sbjct: 121 PTEFKILTENTRPGRQVQLAQVDGFAALRGQVPPSMRRGVVLMDPSYELVSDYAKVIDSL 180 Query: 181 AEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAV-LPDSDRRGMTASG 239 + +RFA G+Y +WYP V R + ++ L+AT + L + L V PD+ G+T SG Sbjct: 181 RDALQRFAEGVYVVWYPQVSRVESIQIARRLQATAPKGWLHVRLNVQQPDAQGFGLTGSG 240 Query: 240 MIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVS 275 + VINPP+ L Q+ +PWL KL + + Sbjct: 241 VFVINPPYTLHAQLAACMPWLTQKLGQFEGANHLLE 276 >UniRef50_Q2SPJ4 Protein involved in catabolism of external DNA n=1 Tax=Hahella chejuensis KCTC 2396 RepID=Q2SPJ4_HAHCH Length = 280 Score = 370 bits (950), Expect = e-101, Method: Composition-based stats. Identities = 108/282 (38%), Positives = 169/282 (59%), Gaps = 4/282 (1%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 MLSY+H +HAGN AD KH SL++++L +K P+ YL+THAG G Y L SE A++T E Sbjct: 1 MLSYQHVYHAGNFADAHKHWALSLLLQALCKKSTPWRYLETHAGRGDYDLTSEEAQKTSE 60 Query: 61 YLEGIARIWQQDDL-PAELEAYINVVKHFNRS-GQLRYYPGSPLIARLLLREQDSLQLTE 118 + GI + Q P E +AY+ V+ N + +L YPGSP IA LRE D L L E Sbjct: 61 WTAGILPLMQAKGPCPPEFDAYLAAVRALNPNTERLTRYPGSPAIAAGFLRETDQLALCE 120 Query: 119 LHPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVS 178 LHP +Y L+ +F ++ + + + DGF+ + A PP +RGL++IDP YE+K DYQ + + Sbjct: 121 LHPGEYAELKRQFGRNRQIHIHQRDGFEGVMAMSPPPEKRGLVMIDPSYELKEDYQRIPA 180 Query: 179 GIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTAS 238 + + K+++ I A+WYP++ ++ ++M+ + + K L+ EL + P RGM S Sbjct: 181 YVNKLTKKWSNAIIAIWYPILAEKRHEKMLELMRQLPLNKTLRSELILTPV--ARGMYGS 238 Query: 239 GMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIVPE 280 GM+V+N PW+L++Q+ +L L + W++ E Sbjct: 239 GMLVVNSPWRLDEQLQAGWAYLSEALRGDPKASCSADWLIAE 280 >UniRef50_Q15U81 Putative uncharacterized protein n=1 Tax=Pseudoalteromonas atlantica T6c RepID=Q15U81_PSEA6 Length = 293 Score = 368 bits (946), Expect = e-100, Method: Composition-based stats. Identities = 120/296 (40%), Positives = 170/296 (57%), Gaps = 20/296 (6%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 M SYRH +HAGNHADVLKH Q LII+ LK+KDK F Y+DTH+GAG Y L SE + +T E Sbjct: 1 MFSYRHGYHAGNHADVLKHICQMLIIDKLKQKDKGFTYIDTHSGAGLYDLSSEQSLKTNE 60 Query: 61 YLEGIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELH 120 + +GI+R+ + AY + + + Q YPGSP IAR+L+R+QD L L E + Sbjct: 61 FQQGISRLADYSGAEPTVLAYQALTSSYLKHQQ---YPGSPEIARVLMRDQDQLHLMEWN 117 Query: 121 PSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSGI 180 + L+ + K + V DG++ L A PP +RGL+L DP YE DYQ VV I Sbjct: 118 NQEVINLKRQI-KGTHISVHHRDGYEGLIALTPPKLKRGLVLTDPSYETSEDYQLVVDAI 176 Query: 181 AEGYKRFATGIYALWYPVVLRQ----------------QIKRMIHDLEATGIRKILQIEL 224 ++ YKR+ T IYA+WYP++ ++ + ++M+ DL G + +LQ+EL Sbjct: 177 SKAYKRWPTAIYAIWYPLLSKRDEDQNDGFERATTKHKKSQKMLDDLTQHGFKNVLQVEL 236 Query: 225 AVLPDSDRRGMTASGMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIVPE 280 AV GM SGM +IN PW+L+ Q+ + L L + V+W+V E Sbjct: 237 AVQNPDTFAGMYGSGMAIINAPWQLDAQIRDCLGELTPVMAQHKHASFVVNWLVEE 292 >UniRef50_B8GN89 Putative uncharacterized protein n=1 Tax=Thioalkalivibrio sp. HL-EbGR7 RepID=B8GN89_THISH Length = 281 Score = 366 bits (941), Expect = e-100, Method: Composition-based stats. Identities = 120/280 (42%), Positives = 169/280 (60%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 MLSYRH FHAGN ADV KH V + I+++L K KPF LDTHAG Y L S+ AE+TGE Sbjct: 1 MLSYRHGFHAGNFADVHKHAVLAWIVQALTAKAKPFCVLDTHAGDAGYDLASQWAEKTGE 60 Query: 61 YLEGIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELH 120 + EG+ R+ P + ++ +++ F S R YPGSP IAR LLR D L L ELH Sbjct: 61 WREGVGRLMGCPGAPEAIAPFLQLLEAFRASHGERAYPGSPAIARGLLRPGDRLVLGELH 120 Query: 121 PSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSGI 180 P+ + LR F +D + V + DG++ L A LPP RRGL+L+DPPYE +YQA + Sbjct: 121 PAAWESLRGFFARDDQVAVHRRDGWELLGALLPPAERRGLVLVDPPYERDEEYQAAARAL 180 Query: 181 AEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTASGM 240 +R+ +G+Y LWYP++ + + M+ +LEA +L EL P G+ SG+ Sbjct: 181 TAAARRWPSGVYLLWYPLLAAGRHQAMLRELEAARPGPMLVAELWTAPLDTPAGLNGSGL 240 Query: 241 IVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIVPE 280 ++NPPW+L + + N+ PWL L P G G + + W +P+ Sbjct: 241 CILNPPWRLHEALANLQPWLVDCLAPGGAGGSRLHWAIPD 280 >UniRef50_C6M2C4 YhiR family protein n=2 Tax=Neisseriaceae RepID=C6M2C4_NEISI Length = 281 Score = 365 bits (939), Expect = e-100, Method: Composition-based stats. Identities = 114/279 (40%), Positives = 166/279 (59%), Gaps = 6/279 (2%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 MLSYRH+FHAGNHAD++KH + L ++ +KDKP+ Y+DTH+GAG Y L A++ GE Sbjct: 1 MLSYRHAFHAGNHADMIKHFILFLTLDYFNQKDKPYWYIDTHSGAGLYDLSGSEAQKVGE 60 Query: 61 YLEGIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELH 120 Y +GI + + + LP EL A+I + QL Y GSP +A+ L R+ D L+L ELH Sbjct: 61 YKQGIRLLQEAEHLPPELSAFIARLNAILPQEQL--YCGSPWLAQALTRDSDKLRLFELH 118 Query: 121 PSDYPLLRSEFQK---DSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVV 177 P+D+ L++ ++ R ++ +ADGF+ L + LPP RR ++LIDPPYE K DYQ VV Sbjct: 119 PADFQHLKNNMEEARLGRRGQIMQADGFRGLISLLPPPLRRAVVLIDPPYEEKQDYQRVV 178 Query: 178 SGIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVL-PDSDRRGMT 236 + + KRF G Y +WYP + R++ +++ L+ L EL V P D GM Sbjct: 179 QTLKDALKRFEQGCYMVWYPCLSREESRKLPEQLQKLMPDSYLHAELHVHTPRPDGFGMH 238 Query: 237 ASGMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVS 275 SGM +INPP+ L + + + LP L L ++ Sbjct: 239 GSGMFIINPPYLLPELLKSNLPALTDILAQDNGARFVLN 277 >UniRef50_Q1N5H6 Putative uncharacterized protein n=1 Tax=Bermanella marisrubri RepID=Q1N5H6_9GAMM Length = 284 Score = 362 bits (929), Expect = 9e-99, Method: Composition-based stats. Identities = 118/285 (41%), Positives = 165/285 (57%), Gaps = 11/285 (3%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 MLSYRH+FHAGNHADVLKH+V I + +K + Y+DTH+GAG Y+L + A +T E Sbjct: 1 MLSYRHAFHAGNHADVLKHSVLVAIAKYFHKKQSAYTYIDTHSGAGVYKLSDDLANKTQE 60 Query: 61 YLEGIARIWQQDDLPAELEAYINVVKHFNR----SGQLRYYPGSPLIARLLLREQDSLQL 116 Y GIAR++ DL A + Y+ V+ N L++YPGSP LLREQD L Sbjct: 61 YKTGIARLYPNSDL-ALISPYLEQVRVLNAAQGEEKNLQFYPGSPWFMTELLREQDQAHL 119 Query: 117 TELHPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAV 176 ELHP D+ LL + +V DGF +KA LPP ++R I+IDPPYE +Y+ V Sbjct: 120 FELHPQDHALLEQNMNTGKQLKVHMEDGFSGIKAVLPPQTKRAFIVIDPPYEQANEYKKV 179 Query: 177 VSGIAEGYKRFATGIYALWYPVVLR----QQIKRMIHDLEATGIRKILQIELAVLPDSDR 232 V+ I +G KRFA G++A+WYP++ R + M+ +L T I K L + L + Sbjct: 180 VNAIEQGIKRFAVGVFAVWYPLLNRNDKQGMSETMVDELAKTDITKYLDVRLWTSKQTQ- 238 Query: 233 RGMTASGMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWI 277 GM SG+ ++NPP+ L+ +N LP L L T +V ++ Sbjct: 239 -GMYGSGLFIVNPPYILQDLLNQELPKLLEVLGLDETAGFSVDYV 282 >UniRef50_Q89DH2 Blr7467 protein n=16 Tax=Rhizobiales RepID=Q89DH2_BRAJA Length = 286 Score = 361 bits (928), Expect = 1e-98, Method: Composition-based stats. Identities = 111/282 (39%), Positives = 164/282 (58%), Gaps = 8/282 (2%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEY 61 ++YRH+FHAGN ADV+KH V + I+ L++K F +DTHAGAG Y L S+ A R GE+ Sbjct: 1 MNYRHAFHAGNFADVIKHIVLARILTYLQDKPGAFRVIDTHAGAGLYDLESDEARRGGEW 60 Query: 62 LEGIARIWQQD---DLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTE 118 L GIAR+ Q + A + Y+++V+ FN G+L+ YPGSPLIAR LLR QD L E Sbjct: 61 LTGIARLMQARLSNETAALTKPYLDIVRAFNPKGELKAYPGSPLIARGLLRPQDRLVACE 120 Query: 119 LHPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVS 178 L P L ++D +ARV DG+ L A +PP RRGL+LIDPP+E K +++ + Sbjct: 121 LEPKARKALIDVLRRDEQARVVDLDGWVALPAFVPPKERRGLVLIDPPFEAKNEFERLGE 180 Query: 179 GIAEGYKRFATGIYALWYPVVLRQQIKRMIH-----DLEATGIRKILQIELAVLPDSDRR 233 +E + ++ TGIY +WYP R+ + A K L++E + P D Sbjct: 181 AFSEAFAKWPTGIYVIWYPAKSRRATDALAQLVARLAAAAKPPGKCLRLEFSAAPQLDGA 240 Query: 234 GMTASGMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVS 275 +T++G++++NPP+ L ++ +LP L L G + Sbjct: 241 ALTSTGLLIVNPPYTLHGELKTILPELEMPLGQGGAARFRLE 282 >UniRef50_B2HZ48 Protein involved in catabolism of external DNA n=17 Tax=Acinetobacter RepID=B2HZ48_ACIBC Length = 285 Score = 361 bits (927), Expect = 2e-98, Method: Composition-based stats. Identities = 110/286 (38%), Positives = 163/286 (56%), Gaps = 8/286 (2%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEY 61 ++YRH FHAGN ADV+KH + ++ L KDKP+ Y+DTH GAG+Y L A+++GE+ Sbjct: 1 MNYRHHFHAGNFADVMKHVLLLQLLNRLNAKDKPYRYIDTHGGAGKYDLSQAPAQKSGEF 60 Query: 62 LEGIARIWQQDD-----LPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQL 116 L GI R+ Q D P ++ Y+ +V+ YPGSP A +RE D + Sbjct: 61 LTGIHRLVQLSDMEKRQAPEAIQQYLKLVEELRAQEGKGSYPGSPWFALQGMREIDKATI 120 Query: 117 TELHPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYE-MKTDYQA 175 E+ + LR D RA + + D ++ L A +PP +RGL++IDPPYE + D+ Sbjct: 121 FEMQRDVFQQLRHNIH-DKRAGLHERDAYEGLLAVIPPKEKRGLVMIDPPYELERKDFPQ 179 Query: 176 VVSGIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGM 235 +V + YK++ TG++A+WYP+ R I+R + TGIR+ L E+ V PD G+ Sbjct: 180 LVELLQSAYKKWPTGVFAVWYPIKDRAMIERFEKKMFKTGIRRQLICEICVWPDDTPVGL 239 Query: 236 TASGMIVINPPWKLEQQMNNVLPWLHSKL-VPAGTGHATVSWIVPE 280 G++VINPPW+ +Q + L WL L + GHA V W+V E Sbjct: 240 NGCGLLVINPPWQFSEQADQALQWLFPHLRMQETGGHAAVRWLVGE 285 >UniRef50_Q5QVX6 Transformation competence-related protein ComJ n=2 Tax=Idiomarina RepID=Q5QVX6_IDILO Length = 283 Score = 360 bits (925), Expect = 3e-98, Method: Composition-based stats. Identities = 103/283 (36%), Positives = 160/283 (56%), Gaps = 4/283 (1%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEY 61 ++YRH FHAGN ADV KH + + +E ++K+KP+ LDTH G G Y L + A RT E Sbjct: 1 MNYRHIFHAGNFADVFKHLLLARALEYFQQKNKPYFVLDTHGGIGYYDLQGDQAIRTAEA 60 Query: 62 LEGIARIWQQDDLPAELEAYINVVKHFNRSGQ-LRYYPGSPLIARLLLREQDSLQLTELH 120 +GI R + AY++ V+ N LRYYPGSP+I LRE D L + ELH Sbjct: 61 EQGIVRFAEHSAEEPLAAAYLSTVRQLNEEQDKLRYYPGSPVITSEFLRENDRLVVCELH 120 Query: 121 PSDYPLLRSE-FQKDSRARVEK-ADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVS 178 D L++ + + ++ DG+Q ++A+LPP +RGL+LIDPP+E T++ VVS Sbjct: 121 KEDAETLKNTPLGRHKQVQILAPMDGYQAVRAQLPPAEKRGLVLIDPPFENTTEFDDVVS 180 Query: 179 GIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEA-TGIRKILQIELAVLPDSDRRGMTA 237 + +G KR+ +G +A+WYP+ + D+ A + + K L +EL + + +R+G+ Sbjct: 181 ALEQGLKRWKSGSFAVWYPIKDELKTAAFHRDVGALSDLPKTLIMELNIRTNDERKGLHG 240 Query: 238 SGMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIVPE 280 G + +NPP+ + Q ++LP L L + W+V E Sbjct: 241 CGFLWVNPPYGVVQDSEHLLPVLCKTLAQDKGANFHSRWLVGE 283 >UniRef50_C6XPG2 Putative uncharacterized protein n=5 Tax=Proteobacteria RepID=C6XPG2_HIRBI Length = 334 Score = 359 bits (922), Expect = 6e-98, Method: Composition-based stats. Identities = 115/288 (39%), Positives = 167/288 (57%), Gaps = 14/288 (4%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEY 61 ++YRH+FH GN AD+LKH V L IESLK+KDKPF Y+DTHAG GRY L + A R+ E+ Sbjct: 1 MNYRHAFHVGNFADILKHLVLVLCIESLKKKDKPFRYIDTHAGIGRYDLTGDEARRSPEW 60 Query: 62 LEGIARIWQQ-------DDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSL 114 EGI RIW +D+ A L+ Y++ V N G L YPGSP +A L+REQDSL Sbjct: 61 QEGIGRIWAAHKAGDIPEDVAAILKPYLDAVSEINYDGDLESYPGSPDLAATLMREQDSL 120 Query: 115 QLTELHPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQ 174 +LTELHP+D L F +D R ++E +G++ LKA LPP RRG++L+DPP+E + + Sbjct: 121 RLTELHPADKETLTDHFFRDKRVKIENRNGYEALKAYLPPPERRGVVLVDPPFEHRDELA 180 Query: 175 AVVSGIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGI-------RKILQIELAVL 227 + G G R+ TG Y W P+ + ++ L + KIL +L V Sbjct: 181 HMAKGAMGGISRWPTGTYIFWRPLKDMENTQKFDDGLAEWLLDDMEFSHEKILLADLWVK 240 Query: 228 PDSDRRGMTASGMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVS 275 + + +G++V+NPP+ +++ + VLPW+ L ++ Sbjct: 241 EIVEPGPLCGAGVVVVNPPYGMQEALLTVLPWVTELLQQDEGAGWRIN 288 >UniRef50_C6QCA2 Putative uncharacterized protein n=1 Tax=Hyphomicrobium denitrificans ATCC 51888 RepID=C6QCA2_9RHIZ Length = 281 Score = 358 bits (920), Expect = 1e-97, Method: Composition-based stats. Identities = 102/280 (36%), Positives = 160/280 (57%), Gaps = 3/280 (1%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEY 61 ++YRH +HAGN ADVLKH V + ++ +K+K +PF +DTHAGAGRY L A +TGE+ Sbjct: 1 MNYRHGYHAGNFADVLKHVVLARVLTYMKQKPRPFRVIDTHAGAGRYDLAGVEAGKTGEW 60 Query: 62 LEGIARIWQQDDLPA---ELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTE 118 +GI R++ + P L+ Y++ V+ N SG L YPGS LIAR ++R +D L E Sbjct: 61 QDGIGRVFNAEFAPPVAELLQPYLDAVRADNASGDLEVYPGSSLIARRIMRPEDVLVANE 120 Query: 119 LHPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVS 178 L+ S++ L+ E + D + +K+ LPP RR ++LIDPP+E K+++ + Sbjct: 121 LNASEFERLKRELGRPRNTTFLNIDAWHAVKSLLPPKERRAVVLIDPPFEAKSEFADLAV 180 Query: 179 GIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTAS 238 G+ E RF G+Y +WYP+ + R + + + + L + LAV G+TA+ Sbjct: 181 GVREAMSRFQDGVYVIWYPLKDVEAADRFVAEATSRPGLEFLDVRLAVCAPFPGLGLTAT 240 Query: 239 GMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIV 278 G++VINPP+ L ++ VLP L + + +V Sbjct: 241 GVLVINPPYLLRGELETVLPALRDCMAEGEGCGFVLKGVV 280 >UniRef50_Q5ZVZ2 Protein involved in catabolism of external DNA n=7 Tax=Legionella RepID=Q5ZVZ2_LEGPH Length = 287 Score = 357 bits (916), Expect = 3e-97, Method: Composition-based stats. Identities = 107/273 (39%), Positives = 157/273 (57%), Gaps = 3/273 (1%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 MLSY+H +HAGN ADV+KH + ++ L KDKP YL+TH+G G Y L + + +T E Sbjct: 6 MLSYQHGYHAGNFADVIKHITLTRLLAYLTHKDKPLFYLETHSGRGIYDLKDKQSLKTEE 65 Query: 61 YLEGIARIW-QQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTEL 119 Y EGI +W +++LP+ YI+V+K N + L YYPGSP A LR QD L L EL Sbjct: 66 YKEGINPVWLDRENLPSLFLEYISVIKQINLNSTLSYYPGSPYFAINQLRSQDRLYLCEL 125 Query: 120 HPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSG 179 HP++Y L + + V DG +L A LPP +RGLI IDP YE K +Y+ + Sbjct: 126 HPTEYNFLLKLPHFNKKVYVNHTDGVSKLNALLPPPEKRGLIFIDPSYERKEEYKEIPYA 185 Query: 180 IAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTASG 239 I Y +F+TG+Y +WYPVV + ++ + + + + +IEL + P + GMT G Sbjct: 186 IKNAYSKFSTGLYCVWYPVVNKAWTEQFLRKMREISSKSV-RIELHLNPLINE-GMTGCG 243 Query: 240 MIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHA 272 + +INPP+ ++ VL L + P + + Sbjct: 244 LWIINPPYTFPSEIKLVLETLTTYFNPGSSSYM 276 >UniRef50_Q0ARP7 Putative uncharacterized protein n=2 Tax=Hyphomonadaceae RepID=Q0ARP7_MARMM Length = 311 Score = 355 bits (912), Expect = 9e-97, Method: Composition-based stats. Identities = 102/285 (35%), Positives = 163/285 (57%), Gaps = 13/285 (4%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEY 61 ++YRH+FHAGN ADVLKH+V +L +E L K KP+ +DTHAG G Y L S AER+ E+ Sbjct: 16 MNYRHAFHAGNFADVLKHSVLALCLEHLNAKPKPYRVIDTHAGIGGYDLASSEAERSPEW 75 Query: 62 LEGIARIWQQDDLPAELE----AYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLT 117 +GI R+ DLP ++ ++++V+ N G ++ YPGSP IA L+RE+D + L Sbjct: 76 KDGIGRLIDA-DLPEPVQAMLGPWLDIVREMNPDG-IKAYPGSPEIAARLIREEDRVHLC 133 Query: 118 ELHPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVV 177 ELH +D L + +++D+R +VE+ DG++ LK+ +PP +RGL+LIDPP+E + + + Sbjct: 134 ELHEADSVTLDNRYRRDARIKVERRDGYKALKSLVPPKEKRGLVLIDPPFEDRDELAHMA 193 Query: 178 SGIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGI-------RKILQIELAVLPDS 230 + ++ TG + W + R + L I KIL+ +L + + Sbjct: 194 EAVMGALAKWPTGTFIFWRSLKNLWAADRFDNGLAEWLISEKDFEPEKILRADLWIRDLA 253 Query: 231 DRRGMTASGMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVS 275 + +G+++INPP+ LE+ + N +PWL L V Sbjct: 254 SEGKLAGAGVVIINPPFTLEETLVNAMPWLAETLAQGNGYGWRVD 298 >UniRef50_D0IYP9 ComJ n=10 Tax=Bacteria RepID=D0IYP9_COMTE Length = 288 Score = 354 bits (908), Expect = 3e-96, Method: Composition-based stats. Identities = 114/285 (40%), Positives = 163/285 (57%), Gaps = 10/285 (3%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 M SYRH+FHAGNHADVLKHTV ++ L +K+ LDTH GAG Y+L ++A ++GE Sbjct: 1 MFSYRHAFHAGNHADVLKHTVLIATVQYLTQKEAALTVLDTHGGAGLYRLDGDYASKSGE 60 Query: 61 YLEGIAR--IWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTE 118 EG+ R ++ +L L+ Y+ +V+ FN+ +R YPGSP I + LLR D L+ E Sbjct: 61 AEEGVLRLAAAKEAELAPVLQDYLQMVRRFNQGNAIRNYPGSPFITQALLRGHDRLKAFE 120 Query: 119 LHPSDYPLLRSEFQK---DSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQA 175 LHP+D L + + + DGF+ +K LPP SRR L+L DP YE+KTDY Sbjct: 121 LHPTDMRSLTGNMAQLEVRRQVAILHEDGFEGVKKFLPPPSRRALLLCDPSYELKTDYGR 180 Query: 176 VVSGIAEGYKRFATGIYALWYPVVLRQQIKRM---IHDLEATGIRKILQIELAVLPD--S 230 V+ A+G KRF TG YA+WYP++ R + + + + + L L V + S Sbjct: 181 VLDMAADGLKRFPTGTYAVWYPIIPRPEAHDLPKRLKTMATKAGKSWLHATLTVKSNKTS 240 Query: 231 DRRGMTASGMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVS 275 +R G+ ASGM +INPP+ L+ Q+ +P L L T+ Sbjct: 241 ERGGLPASGMFLINPPFNLKDQLKPAMPQLVKLLGQDSNAGFTLE 285 >UniRef50_Q0F148 Putative uncharacterized protein n=1 Tax=Mariprofundus ferrooxydans PV-1 RepID=Q0F148_9PROT Length = 303 Score = 353 bits (906), Expect = 4e-96, Method: Composition-based stats. Identities = 105/283 (37%), Positives = 155/283 (54%), Gaps = 4/283 (1%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 MLSY+HS+HAGNHADVLKH + + + KD P LD A G Y L S A + E Sbjct: 22 MLSYQHSYHAGNHADVLKHIILGDVAAGMFNKDAPIFMLDAFASRGIYDLNSPEALKNRE 81 Query: 61 YLEGIARIWQQDDLP---AELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLT 117 G+ ++W D P + + ++ N +PGS + + RE D + Sbjct: 82 SDSGVGKLWPLRDEPTNPPGVRRWFKLIASLNMDDSYTRFPGSTAMLHAMAREGDRIAAC 141 Query: 118 ELHPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVV 177 +LHP ++ LR FQ R + K D F+ +K LPP +RGL+ +DP YE+K +Y+A+ Sbjct: 142 DLHPQEFDTLRVSFQASRRFSLLKRDAFEAIKGMLPPKEKRGLVFLDPSYEVKEEYRAIA 201 Query: 178 SGIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTA 237 +A +++FA G+Y +WYP++ ++ + +L+ +GIRKIL+IEL M Sbjct: 202 KAVAGAHRKFAGGVYVIWYPLLPAERHNELFRELKHSGIRKILRIELDCGDLFPDMQMHG 261 Query: 238 SGMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIVPE 280 SGM+++NPPW EQ M L W+ KL G G SW+VPE Sbjct: 262 SGMLIVNPPWHAEQAMQQSLNWVCDKL-TDGKGRKQFSWLVPE 303 >UniRef50_C7JEW3 Putative uncharacterized protein n=8 Tax=Acetobacter pasteurianus RepID=C7JEW3_ACEP3 Length = 273 Score = 352 bits (904), Expect = 7e-96, Method: Composition-based stats. Identities = 103/280 (36%), Positives = 155/280 (55%), Gaps = 8/280 (2%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEY 61 ++YRH++HAGN AD +KH + +++S K PF+ LDTHAG GRY L S AE+T E+ Sbjct: 1 MNYRHAYHAGNFADCMKHALLVTLLQSFLRKPAPFMVLDTHAGIGRYDLHSPEAEKTQEW 60 Query: 62 LEGIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELHP 121 +GI ++W +D + L ++ VK ++G +YPGSPLI +LR QD+L E HP Sbjct: 61 RDGIGKLW-NEDAASPLADWLEQVK---KTGGPEFYPGSPLIIAQMLRAQDALICCEKHP 116 Query: 122 SDYPLLRSEFQKDSRARVEKADGFQQLKAKLPP-VSRRGLILIDPPYEMKTDYQAVVSGI 180 D L F V + D ++ L+A LPP ++RGLILIDPP+E ++ + + Sbjct: 117 EDKRSLYRLFTNTPNVTVHERDAYEALRALLPPQTAKRGLILIDPPFEEPGEFDRLAQAV 176 Query: 181 AEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTASGM 240 RFA I A+WYP+ R ++ L TGIR I EL + P + + +G+ Sbjct: 177 QTIQARFANAIIAIWYPIKHRTPVRIFHETLMGTGIRNICVAELLMRPPYNPDQLNGAGL 236 Query: 241 IVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIVPE 280 +VI PP+ ++ + L L L G + V+ +V E Sbjct: 237 LVIRPPFGFAEKASAQLERLQHVL---GAHESCVTQLVEE 273 >UniRef50_D0KVW6 Putative uncharacterized protein n=1 Tax=Halothiobacillus neapolitanus c2 RepID=D0KVW6_HALNC Length = 312 Score = 352 bits (904), Expect = 9e-96, Method: Composition-based stats. Identities = 111/291 (38%), Positives = 157/291 (53%), Gaps = 13/291 (4%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEY 61 ++Y+H FHAGNHADVLKH V +IE +++K FL L+THAGAG Y L + A R+ E Sbjct: 20 MNYQHHFHAGNHADVLKHLVLLQLIELMQQKPTGFLLLETHAGAGLYDLQATEARRSDEA 79 Query: 62 LEGIARIWQQ----DDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLT 117 GIAR+ Q D +P ++ Y+ ++ F L YYPGSPL+A LR QD Sbjct: 80 SGGIARLLQATQAADTVPVLIQTYLKQIEQFGSVPNLGYYPGSPLLAVCALRPQDRYIGV 139 Query: 118 ELHPSDYPLLRSEFQ---------KDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYE 168 EL P L D R +G LKA LPP+ RRGL LIDPPYE Sbjct: 140 ELVPKVARELSRNLAQRPMLEPCIPDRRVIARDGEGLAALKADLPPLERRGLFLIDPPYE 199 Query: 169 MKTDYQAVVSGIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLP 228 + + + + G +RF TG+YALWYP+ R + R ++ + + R +L IE ++ P Sbjct: 200 QPQERDDIAAALQAGLQRFETGVYALWYPIKQRPYLDRWLNRIAKSTPRPVLTIENSIFP 259 Query: 229 DSDRRGMTASGMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIVP 279 D +T SG+++INPPW+ + M VL +++ L + W+ P Sbjct: 260 DESGNRLTGSGLLIINPPWQFDTLMQPVLDFVNDALKQDTAAPRAIRWLNP 310 >UniRef50_C5BQN4 Putative uncharacterized protein n=1 Tax=Teredinibacter turnerae T7901 RepID=C5BQN4_TERTT Length = 279 Score = 351 bits (902), Expect = 1e-95, Method: Composition-based stats. Identities = 106/279 (37%), Positives = 162/279 (58%), Gaps = 6/279 (2%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 MLSYRH+FHAGNHADVLKH S+++E L EKDKP YL+THA AG Y L + ++ E Sbjct: 1 MLSYRHAFHAGNHADVLKHLCLSMVLEKLIEKDKPLTYLETHAAAGAYDLNTAMPQKNRE 60 Query: 61 YLEGIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELH 120 Y+ GI+ + + + Y +V + + YPGSP +A +LREQD L L ELH Sbjct: 61 YMSGISPLLASEVSSEAMSRYKALVARYFADYK---YPGSPAVAASVLREQDKLVLMELH 117 Query: 121 PSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSGI 180 +++ +LR+ ++D R + DG + + A PP RRG++LIDPPYE +Y+ + + I Sbjct: 118 NTEFEILRNNMRRDKRVTLHHRDGIEGVLALSPPTPRRGIVLIDPPYEQPLEYERIATLI 177 Query: 181 AEGYKRFATGIYALWYPVVL--RQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTAS 238 A+ ++++ G+ ALWYP++ R + M+ + + + EL V + GM S Sbjct: 178 AQLHRKWPVGVIALWYPLLAQERNRAPAMLDVIARSQPASLFTAELWVEAQASDYGMYGS 237 Query: 239 GMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWI 277 GM IN PW +++++ VLP + L P G + W+ Sbjct: 238 GMAFINLPWTVDEKIALVLPEIQQILAPD-QGGFSHRWV 275 >UniRef50_A4SYR3 Putative uncharacterized protein n=1 Tax=Polynucleobacter necessarius subsp. asymbioticus QLW-P1DMWA-1 RepID=A4SYR3_POLSQ Length = 289 Score = 350 bits (899), Expect = 3e-95, Method: Composition-based stats. Identities = 115/285 (40%), Positives = 158/285 (55%), Gaps = 10/285 (3%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 M SYRH+FHAG+HAD+LKH ++E L+EK +DTHAGAG Y L A + E Sbjct: 1 MFSYRHAFHAGSHADILKHLTLIHLVEYLQEKPGALTIVDTHAGAGIYSLVDGFATVSKE 60 Query: 61 YLEGIARIWQ----QDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQL 116 GI R+ Q + P + Y+ +++ N +L YPGSP I LLR QD L+L Sbjct: 61 AEGGIFRLSQFFGKNSETPESIRKYLEMIQAENTGEELNTYPGSPFIIARLLRPQDRLKL 120 Query: 117 TELHPSDYPLLRSE---FQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDY 173 ELHP + +LR ++ + V AD F +LK LPP SRRGL+LIDP YE K DY Sbjct: 121 FELHPKEIDILRHNIGELKEAKQIDVYAADSFSRLKGLLPPPSRRGLVLIDPSYEDKQDY 180 Query: 174 QAVVSGIAEGYKRFATGIYALWYPVVLRQQIKRM---IHDLEATGIRKILQIELAVLPDS 230 + + + + E +RFATG YA+WYP++ R++ + + + AT R L EL V Sbjct: 181 RYLENAMEEALQRFATGCYAIWYPILSRRESASLPDHLKKIAATHKRSWLHTELRVENAP 240 Query: 231 DRRGMTASGMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVS 275 R + ASGM +INPPW LE+ ++ LP L L + Sbjct: 241 GERRLQASGMFIINPPWTLEKHLDEALPVLVKALGVDAGAKYVLK 285 >UniRef50_Q87F97 Transformation competence-related protein n=20 Tax=Xanthomonadaceae RepID=Q87F97_XYLFT Length = 293 Score = 341 bits (876), Expect = 1e-92, Method: Composition-based stats. Identities = 101/290 (34%), Positives = 154/290 (53%), Gaps = 15/290 (5%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEY 61 ++Y H+FHAGNHADVLKH V +++ L K+ PF LD+HAG GRY L + + T E Sbjct: 1 MNYSHAFHAGNHADVLKHIVLLALLDGLVRKETPFFVLDSHAGRGRYLLSAGESRNTREA 60 Query: 62 LEGIARIWQQDDLPAELEAYINVVKHFNRSG--------QLRYYPGSPLIARLLLREQDS 113 G+ R+ + ++ Y++VV+ N S + YPGS L+A + R QD Sbjct: 61 ESGVMRLIARPQRLEVIKRYVDVVQADNVSQTRAASTPMHISRYPGSSLLAAQVCRAQDR 120 Query: 114 LQLTELHPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPV---SR--RGLILIDPPYE 168 + ELHP + L + F D R RV DG+ ++A LPP R RGL+ IDPPYE Sbjct: 121 MVFCELHPKEAAALNALFVHDPRVRVHAGDGYAAVRAFLPPKVGTQRIGRGLVFIDPPYE 180 Query: 169 MKT-DYQAVVSGIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVL 227 + +Y V+ + E R+ I A+WYP+ R++++ +R +L EL V Sbjct: 181 AQDAEYPLVLGALRETLTRWPQAICAVWYPIKQRRRLQPFFRKAVGLPVRSVLIAELLVR 240 Query: 228 PDSDRRGMTASGMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWI 277 D + SGM+++N PW+ +Q + LP L ++L +G + W+ Sbjct: 241 LDDSPLRLNGSGMLLLNVPWQFDQLLAPALPVLKTQLGESG-ARTRLEWL 289 >UniRef50_A1WIT5 Putative uncharacterized protein n=4 Tax=Burkholderiales RepID=A1WIT5_VEREI Length = 293 Score = 340 bits (874), Expect = 3e-92, Method: Composition-based stats. Identities = 114/290 (39%), Positives = 160/290 (55%), Gaps = 15/290 (5%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 M SYRH+FHAGNHADV KHTV ++ L +KD LD+HAGAG Y+L ++A +GE Sbjct: 1 MFSYRHAFHAGNHADVFKHTVLIATLQYLTDKDAALTVLDSHAGAGLYRLDGDYARTSGE 60 Query: 61 YLEGIARIWQQ--DDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTE 118 +G+ R++ L L+AY+++V FN+ +LR YPGSP I + LLRE D L+L E Sbjct: 61 AADGVVRLFAAPGSALAPALQAYVDMVGAFNQGRRLRVYPGSPCITQRLLRESDKLKLFE 120 Query: 119 LHPSDYPLLR---SEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQA 175 HP+D L ++ Q + V DGFQ ++ LPP RR L+L DP YE+K+DY Sbjct: 121 WHPTDLRALAGHVAQLQAGRQVAVFHEDGFQGIRKFLPPPQRRALLLCDPSYEIKSDYGK 180 Query: 176 VVSGIAEGYKRFATGIYALWYPVVLR---QQIKRMIHDLEATGIRKILQIELAVLPDSD- 231 V+ + KRFATG Y WYP++ R ++ R + L + + L L V Sbjct: 181 VLDLATDSLKRFATGCYMFWYPIIGRPEAHELPRRLKTLASKAGKSWLHATLTVKSGQRT 240 Query: 232 ------RRGMTASGMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVS 275 R G+ ASGM +INPP+ L+ + LP + L T+ Sbjct: 241 AAGSLKRPGLPASGMFLINPPFTLKAALTPALPQMVQLLAQDRHATHTLE 290 >UniRef50_Q0G6E9 Putative uncharacterized protein n=1 Tax=Fulvimarina pelagi HTCC2506 RepID=Q0G6E9_9RHIZ Length = 301 Score = 340 bits (872), Expect = 4e-92, Method: Composition-based stats. Identities = 106/280 (37%), Positives = 155/280 (55%), Gaps = 6/280 (2%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEY 61 L+YRH+FHAGN ADV+KH + + +I LK K+KPF DTHAG GRY L + A RTGE Sbjct: 25 LNYRHAFHAGNFADVVKHALLTRLIAYLKRKEKPFRVFDTHAGRGRYDLNASEASRTGEA 84 Query: 62 LEGIARIWQQDDLPAE--LEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTEL 119 G+ +I Q L AE L Y + + + +YPGSPLIAR LRE D L EL Sbjct: 85 QAGVLKIAQSTTLRAEPLLADYFAAI---DPDLREGFYPGSPLIARRCLRETDRLSAYEL 141 Query: 120 HPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSG 179 HP D LR F D + + DG+ L A LPP +RGL+LID P+E ++ ++SG Sbjct: 142 HPEDGGALRDLFAGDVQVKAISLDGWLALGAHLPPKEKRGLVLIDSPFEKPSEVDDILSG 201 Query: 180 IAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMT-AS 238 + + R+ G+YA+WYP+ R +++++ + + L +E+ + D G+ + Sbjct: 202 LEKALSRWRGGVYAIWYPIKRRALVEKLLTAIAGMAAGEALAVEVRIAADESAEGLFLGT 261 Query: 239 GMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIV 278 G VINPP+ ++ ++ L L G+ A V + Sbjct: 262 GFAVINPPFVFAEEAKAIVDLLLPALKRDGSATARVFTLT 301 >UniRef50_B1ZS65 Putative uncharacterized protein n=2 Tax=Opitutaceae RepID=B1ZS65_OPITP Length = 297 Score = 339 bits (870), Expect = 6e-92, Method: Composition-based stats. Identities = 110/296 (37%), Positives = 163/296 (55%), Gaps = 17/296 (5%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLG----SEHAER 57 ++YRH FHAGN ADV+KH + +I +L++K+K YLDTHAG G Y LG + ER Sbjct: 1 MNYRHLFHAGNFADVMKHALLIELIGALQKKEKGIFYLDTHAGRGSYDLGLAARGDTLER 60 Query: 58 TGEYLEGIARIWQQ--------DDLPAELEAYINVVKHF-----NRSGQLRYYPGSPLIA 104 E+ +GI RI + L AY ++V+ F N +G R+YPGSP IA Sbjct: 61 QPEWPDGIGRILAARSTAAADANATGDPLRAYADLVRRFDAERGNTNGSPRFYPGSPAIA 120 Query: 105 RLLLREQDSLQLTELHPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILID 164 ++L+R QD L L E P ++ LL +EF + R V DG+ ++A LPP RR L+LID Sbjct: 121 QVLVRRQDRLALCEQVPEEHALLAAEFARAPRTSVHAIDGYVAVRAMLPPPERRALVLID 180 Query: 165 PPYEMKTDYQAVVSGIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIEL 224 P+E + ++ + + +AEG R G++A+WYP+ R ++ L + L +EL Sbjct: 181 APFEAQDEFARIETALAEGLARLPAGVFAVWYPLTERARVDAFFAGLAERRLPPTLVLEL 240 Query: 225 AVLPDSDRRGMTASGMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIVPE 280 AV ++ M G++V+NPPW E+ +L L +L A W+VPE Sbjct: 241 AVAGENSALKMRGCGLVVVNPPWHFERTAAPILEALARELAQAPGAAGRQQWLVPE 296 >UniRef50_Q1YIC4 Putative uncharacterized protein n=1 Tax=Aurantimonas manganoxydans SI85-9A1 RepID=Q1YIC4_MOBAS Length = 281 Score = 339 bits (870), Expect = 7e-92, Method: Composition-based stats. Identities = 102/276 (36%), Positives = 149/276 (53%), Gaps = 6/276 (2%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEY 61 ++YRH++HAGN ADV+KH + + +I LK K+KPF DTHAG G Y L S+ A RTGE+ Sbjct: 1 MNYRHAYHAGNFADVVKHALLTRLIAYLKRKEKPFRVFDTHAGRGSYSLTSDEARRTGEH 60 Query: 62 LEGIARIWQQDDLP---AELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTE 118 +G+ R+ Q L Y + + YPGSPLIAR LLR QD L E Sbjct: 61 ADGVGRLVQAAADVMDDPLLAEYRGALASDLSEDR---YPGSPLIARRLLRPQDRLSAYE 117 Query: 119 LHPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVS 178 LHP+D L++ F D + + DG+ L + +PP +RGL+LIDPP+E + A+ Sbjct: 118 LHPADAAALKTLFAGDVQTKAIALDGWLALGSHVPPKEKRGLVLIDPPFERTDEVDAIAE 177 Query: 179 GIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTAS 238 G+A+ +R+A GIYA+WYP+ + + L+A + + + E P + + Sbjct: 178 GLAKALQRWAGGIYAVWYPLKRPALVAALHERLDALPVSERVTAEFFREPYTADERFVGT 237 Query: 239 GMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATV 274 G+ VINPP+ + VL L L +V Sbjct: 238 GLTVINPPFVFAAEAEAVLTTLAPLLGSGEAATFSV 273 >UniRef50_Q0BPC8 Putative uncharacterized protein n=3 Tax=Acetobacteraceae RepID=Q0BPC8_GRABC Length = 294 Score = 337 bits (865), Expect = 2e-91, Method: Composition-based stats. Identities = 100/271 (36%), Positives = 149/271 (54%), Gaps = 12/271 (4%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEY 61 ++YRH+FHAGN AD KH + ++ +L++K+ PF LDTHAG G L A RTGE+ Sbjct: 1 MNYRHAFHAGNFADCHKHALMVALLTALRQKEAPFFVLDTHAGTGETLLTDGPAARTGEW 60 Query: 62 LEGIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELHP 121 EGI + DD L +Y+ +V L YPGSPLIAR +LR QD + + ELHP Sbjct: 61 REGIGLLL--DDPAPVLASYLALVTSLGMERSL--YPGSPLIARAMLRPQDRMAVCELHP 116 Query: 122 SDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPV--------SRRGLILIDPPYEMKTDY 173 D L F+ D + + DG++ L+ LPP RRGL LIDPP+E ++ Sbjct: 117 EDCASLAERFRGDPYCAIHRRDGWKALETMLPPKTASSGGVLPRRGLTLIDPPFEQPDEH 176 Query: 174 QAVVSGIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRR 233 + + + +RF TG+ A WYP+ + + H L+ G++++L EL + P +D Sbjct: 177 RRLADAMLRAQQRFPTGMVAGWYPIKGGAPARLLRHQLQDAGLKRVLIAELFLHPPTDTT 236 Query: 234 GMTASGMIVINPPWKLEQQMNNVLPWLHSKL 264 + SGM ++NPPW+ ++ L + L Sbjct: 237 RLNGSGMAILNPPWQFGDDARAIMQALKTGL 267 >UniRef50_B4RYZ5 Putative uncharacterized protein n=2 Tax=Alteromonas macleodii RepID=B4RYZ5_ALTMD Length = 292 Score = 336 bits (862), Expect = 6e-91, Method: Composition-based stats. Identities = 112/295 (37%), Positives = 162/295 (54%), Gaps = 19/295 (6%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 MLSY+H+FHAGNHADV+KH +I SLK+KDKPF DTHAGAG Y L + + E Sbjct: 1 MLSYQHAFHAGNHADVIKHLCWIGVINSLKKKDKPFTLFDTHAGAGTYDLNDAMSSKNKE 60 Query: 61 YLEGIARIW----QQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQL 116 Y GI+RI + D LP L+ Y+ + + F Q YPGSP I+ R D+L L Sbjct: 61 YETGISRIINTGAEHDSLPELLKNYLTLCEPFLAKHQ---YPGSPAISATAKRATDNLHL 117 Query: 117 TELHPSDYPLLRSEFQK--DSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQ 174 ELHP+++ L + K + V K DG++ L+A PP RG ILIDPPYE ++Y Sbjct: 118 MELHPAEFDKLEANMGKLHLRKMHVHKRDGYEGLRALTPPKPNRGAILIDPPYERASEYG 177 Query: 175 AVVSGIAEGYKRFATGIYALWYPVVLRQ------QIKRMIHDLEATGIRKILQIELAVLP 228 V+ G+ + +KR+ +WYP++ + + M L A G + + E+ V Sbjct: 178 EVIKGVEQVFKRWQQAQIVVWYPLLSERAGAKHGASELMCDKLAALG-KPCFKAEICVEK 236 Query: 229 DSDRRGMTASGMIVINPPWKLEQQMNNVLPWLHSKLV---PAGTGHATVSWIVPE 280 ++ GM SG+ V+NPPW+L+ Q+ + L + +L + VSWI + Sbjct: 237 NTPEAGMYGSGVFVLNPPWQLDSQLESALQNVVLQLGAKSSDSSASTHVSWINED 291 >UniRef50_D2LG58 Putative uncharacterized protein n=1 Tax=Rhodomicrobium vannielii ATCC 17100 RepID=D2LG58_RHOVA Length = 278 Score = 330 bits (848), Expect = 3e-89, Method: Composition-based stats. Identities = 97/283 (34%), Positives = 149/283 (52%), Gaps = 10/283 (3%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEY 61 ++YRH FHAGN ADV+KH V + ++ L K+ P LD H GAG Y L SE AE+TGE+ Sbjct: 1 MNYRHVFHAGNFADVIKHAVLAFCVDYLLRKESPLCLLDAHGGAGLYDLRSEEAEKTGEW 60 Query: 62 LEGIARIWQQD----DLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLT 117 G+ + Q LE Y+ +V+ G +YPGSPL+ LR QD L Sbjct: 61 ARGVGAVMQAAGGTASAAEALEPYLRLVREDVADG---FYPGSPLLLARRLRPQDRLIAN 117 Query: 118 ELHPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVV 177 ELH S LR + RV AD ++ ++A +PP RRGL+LIDPP+E K +++ ++ Sbjct: 118 ELHESTRGALRGTLAEFPSVRVTGADAYECIRATIPPKERRGLVLIDPPFEEKDEFETLI 177 Query: 178 SGIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTA 237 + E KR+ATG++ LWYP+ + + + A G+ + +E + P + Sbjct: 178 RQMREWKKRWATGVFLLWYPIKAVSPLGALKAEAAALGLPRTWCVETLIYPRGRALSLNG 237 Query: 238 SGMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIVPE 280 G+I+ N P+ + + + LP + +W+VP+ Sbjct: 238 CGLILFNAPYSVPEAVEATLPAFADAMR---LHETHTAWLVPD 277 >UniRef50_A5EWC5 Putative uncharacterized protein n=1 Tax=Dichelobacter nodosus VCS1703A RepID=A5EWC5_DICNV Length = 288 Score = 330 bits (847), Expect = 3e-89, Method: Composition-based stats. Identities = 115/277 (41%), Positives = 156/277 (56%), Gaps = 6/277 (2%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 MLSYRHSFHAGN+ADV KH + +K K+KP LY D+HAGAG Y L S HAE+TGE Sbjct: 1 MLSYRHSFHAGNYADVFKHFCLYQTLTFMKRKEKPLLYFDSHAGAGFYDLHSAHAEKTGE 60 Query: 61 YLEGIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELH 120 Y +GI R++ LP L + N ++ + S Y GSP +A LL E D+LQ ELH Sbjct: 61 YCDGIMRLYAAQQLPPALIEFRNDLRLWLESEN--VYCGSPWLAAHLLGEHDTLQACELH 118 Query: 121 PSDYPLLRSEFQ--KDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVS 178 P+D P L+ + + R+ V + DGF QL A +PP RR LI+IDP YE K+DY AV S Sbjct: 119 PNDAPALQHIIRSIRPRRSFVFQKDGFVQLLASVPPPQRRALIVIDPSYEQKSDYDAVCS 178 Query: 179 GIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEA-TGIRKILQIELAVLPDSDRRGMTA 237 +++ K+FA G Y +W P +LR + + L G R L+ +L V +S GM Sbjct: 179 VLSKALKKFAQGCYLIWSPCLLRTEAQDFPQQLAEVIGGRGYLRAQLKVRTES-ALGMYG 237 Query: 238 SGMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATV 274 + +INPP+ L + L L + + Sbjct: 238 CEIHIINPPYLLAPVLQEAGNVLVQILAADKSAQFQL 274 >UniRef50_B1LXQ5 Putative uncharacterized protein n=9 Tax=Alphaproteobacteria RepID=B1LXQ5_METRJ Length = 282 Score = 330 bits (847), Expect = 3e-89, Method: Composition-based stats. Identities = 104/276 (37%), Positives = 155/276 (56%), Gaps = 2/276 (0%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEY 61 ++YRH+FHAGNHADVLKH V + +++ L+ KDKPF LD AG G Y L ++ A RTGE+ Sbjct: 1 MNYRHAFHAGNHADVLKHLVLARVLDHLRLKDKPFRALDAFAGLGVYDLEADEAARTGEW 60 Query: 62 LEGIARIWQ--QDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTEL 119 +G R+ ++ A L Y V YPGSP + R LR D EL Sbjct: 61 RDGWGRMAAPFAPEVEALLAPYRAAVAAVRARHGDTAYPGSPAVIREALRPGDKGVFVEL 120 Query: 120 HPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSG 179 HP+D L+ + +D+R +V DG+ + A++PP RRGL+LIDPPYE+ + + + + Sbjct: 121 HPADAATLQGRYARDARTKVMNLDGWTAINAQIPPPERRGLVLIDPPYEVPGEIERLGAH 180 Query: 180 IAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTASG 239 +A ++ TG++ WYP+ + RM+ DL A R L+++L + D +T SG Sbjct: 181 LARAVAKWPTGLFLAWYPIKDTAVLDRMVRDLGAALPRPALRLDLLIDRPGDPTRLTGSG 240 Query: 240 MIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVS 275 +IV+NPPW+L ++ LP L +L G Sbjct: 241 LIVVNPPWRLAEEAMLFLPALAERLARQDFGGFRCD 276 >UniRef50_A5WD58 Putative uncharacterized protein n=3 Tax=Psychrobacter RepID=A5WD58_PSYWF Length = 291 Score = 329 bits (844), Expect = 7e-89, Method: Composition-based stats. Identities = 89/292 (30%), Positives = 153/292 (52%), Gaps = 14/292 (4%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEY 61 ++Y+H++HAGN ADV KH + ++ + +K KP+ LD + G G Y L SE A +TGE Sbjct: 1 MNYKHAYHAGNFADVAKHILLVQLLNQMSKKGKPYYALDAYGGRGLYSLSSEEARKTGEA 60 Query: 62 LEGIARIWQQD--DLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQD----SLQ 115 G+ +I + D + P + Y++ +K ++ YPGSP + + + Sbjct: 61 KAGVQKILEADVSEAPEAVRQYVDDIKQARQTYDKYVYPGSPWWIANHVEKHPEVKVRAE 120 Query: 116 LTELHPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMK-TDYQ 174 E ++Y L + + + D F+ ++A +PPV RRG+ILIDPPYE + D+ Sbjct: 121 AFEFKNTEYDALNYQLYQLP-IGIHNRDAFEGIRAVIPPVERRGVILIDPPYEQEHKDFT 179 Query: 175 AVVSGIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRG 234 +V + ++ G+YALW+P+ + ++ ++ TGIRK L EL + P+ G Sbjct: 180 RLVELLVASMTKWPQGVYALWFPIKNIEAVELFYKKMKRTGIRKQLLCELNIYPNDVAVG 239 Query: 235 MTASGMIVINPPWKLEQQMNNVLPWLHSKLVPAG------TGHATVSWIVPE 280 + +GM++INPPW+ +Q +L ++ + P + V W+V E Sbjct: 240 LNGTGMLIINPPWQFDQHARQILNFIQPLMRPEDAPDLPQSQAVNVRWLVGE 291 >UniRef50_C8PZ79 Protein involved in catabolism of external DNA n=1 Tax=Enhydrobacter aerosaccus SK60 RepID=C8PZ79_9GAMM Length = 295 Score = 328 bits (841), Expect = 2e-88, Method: Composition-based stats. Identities = 94/296 (31%), Positives = 161/296 (54%), Gaps = 18/296 (6%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEY 61 ++Y+HS+HAGN ADV+KH + ++E + K KP+ LD + G G Y L S+ A++TGE Sbjct: 1 MNYQHSYHAGNFADVVKHVLLLQLLEMMSAKPKPYYILDAYGGRGLYSLASDEAKKTGEA 60 Query: 62 LEGIARIWQQDD--LPAELEAYINVVKHFNRSGQLRYYPGSPL-IARLLLREQD------ 112 + GI ++ QD+ P ++ Y+ + + + YPGSP IA + ++QD Sbjct: 61 IHGITKLLAQDNSQAPQAVQTYLQDIGYAKKFYDKHVYPGSPWFIAHHIEKQQDAHPEIN 120 Query: 113 -SLQLTELHPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMK- 170 + E S++ L + + V+ + ++ + A LPP +RGLILIDPP+E + Sbjct: 121 NRAEAFEWKASEFDALNYQLHQLP-IGVQHRNAYEGILAVLPPQEKRGLILIDPPFEQEH 179 Query: 171 TDYQAVVSGIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDS 230 D+ A+V + + +K+++TG+ ALWYP+ ++ ++ T IR+ L +EL + P Sbjct: 180 RDFSALVDLLVKAHKKWSTGVLALWYPIKNNDAVELFYKKMKRTEIRRQLVLELNIFPPD 239 Query: 231 DRRGMTASGMIVINPPWKLEQQMNNVLPWLHSKLVPAGT------GHATVSWIVPE 280 G+ +GM+VINPPW+ + + +L +L L + V W+V E Sbjct: 240 LPMGLNGTGMLVINPPWQFDAKAEEILQYLQPILQHPESPQMSVEQRTKVQWLVGE 295 >UniRef50_A1VJI9 Putative uncharacterized protein n=4 Tax=Comamonadaceae RepID=A1VJI9_POLNA Length = 340 Score = 326 bits (837), Expect = 5e-88, Method: Composition-based stats. Identities = 112/337 (33%), Positives = 160/337 (47%), Gaps = 62/337 (18%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 M SYRH+FHAGNHADVLKHT +++ L +KD +DTHAGAG Y+L ++ E +GE Sbjct: 1 MFSYRHAFHAGNHADVLKHTCLIALMKYLTQKDTALTVIDTHAGAGLYRLDGDYTETSGE 60 Query: 61 YLEGIARIWQQDDLP----------------------------------------AELEA 80 EGI ++ + L Sbjct: 61 AQEGIFKLLLASKMASAQTGKAGAAIKKVAPAATAKAAPAAPEPAAKPASDYAWAPALLD 120 Query: 81 YINVVKHFNRS-------GQLRYYPGSPLIARLLLREQDSLQLTELHPSDYPLLRSEFQK 133 Y+ +++ N L+ YPGSP I + L +D L+L ELHP+D+ L ++ Sbjct: 121 YLELLRSLNPHFAQTGDPAHLKIYPGSPFIEQKFLSGRDKLKLFELHPTDFKSLSGNIEQ 180 Query: 134 ---DSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSGIAEGYKRFATG 190 + V + DGF+ LK LPP +RR ++L DP YEMKTDY V S +A+ KRFATG Sbjct: 181 LGVGRQVVVAREDGFEALKTFLPPPARRAMVLCDPSYEMKTDYLRVSSCMADAVKRFATG 240 Query: 191 IYALWYPVVLRQQIKRMIHDLEA---TGIRKILQIELAVL-----PDSDRR----GMTAS 238 Y +WYP++ R + + L+ R L L V D++ G+ AS Sbjct: 241 TYVVWYPIIPRPEAHDLPRKLKTIAVKAGRSWLNATLTVKSSKLTTDTEGEVVRPGLPAS 300 Query: 239 GMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVS 275 GM VINPP L+ ++ LP + + L T+ Sbjct: 301 GMFVINPPHTLKAELQAALPQMVALLGQDRNAGFTLE 337 >UniRef50_C8NAD4 Cytoplasmic protein n=34 Tax=Proteobacteria RepID=C8NAD4_9GAMM Length = 280 Score = 326 bits (836), Expect = 7e-88, Method: Composition-based stats. Identities = 112/282 (39%), Positives = 153/282 (54%), Gaps = 5/282 (1%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 MLSYRH++HAGNHAD+LKH + + + +K KP+ Y+DTHAGAG Y L + +A++ E Sbjct: 1 MLSYRHAYHAGNHADLLKHYLLTRTLAYYNQKPKPYDYIDTHAGAGYYDLTAAYAQKNRE 60 Query: 61 YLEGIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELH 120 Y GIAR+ LPA L A+ + + + YPGS IA LL L L ELH Sbjct: 61 YQSGIARLNAAAHLPAALAAWRDHMHAHQPAPD--TYPGSAWIAARLLPAPGKLHLHELH 118 Query: 121 PSDYPLLRSEFQKDSRA---RVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVV 177 P+D+ L + +ADGF L A LPP SRR +ILIDPPYE K+DYQ + Sbjct: 119 PADHAALTENLRPLRLGRRLHTHRADGFAGLIALLPPASRRAVILIDPPYEQKSDYQTTL 178 Query: 178 SGIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTA 237 +A YKRF +G Y +WYP + R + + L L+ EL V ++ GM Sbjct: 179 DTLAAAYKRFPSGTYLIWYPCLPRDESRHFPAQLNQHFGDNYLRAELHVRAENGAHGMYG 238 Query: 238 SGMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIVP 279 SGM +INPP+ L ++ LP L + T+ +P Sbjct: 239 SGMYLINPPYTLPAELKTTLPALRDLCAESADSRITLDARIP 280 >UniRef50_Q21LZ8 Putative uncharacterized protein n=1 Tax=Saccharophagus degradans 2-40 RepID=Q21LZ8_SACD2 Length = 289 Score = 325 bits (834), Expect = 1e-87, Method: Composition-based stats. Identities = 103/285 (36%), Positives = 160/285 (56%), Gaps = 8/285 (2%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 MLSY H +HAGN ADV KH L+++ L K+ P Y+DTHAGAG Y L E AE+T E Sbjct: 1 MLSYLHGYHAGNFADVHKHCTLMLLLKKLHAKNTPITYIDTHAGAGLYALDDEKAEKTRE 60 Query: 61 YLEGIARIWQQDD--LPAELEAYINVVKHFNRSGQL----RYYPGSPLIARLLLREQDSL 114 +G+ + + ++ Y++++ S Q + YPGSP IA+ LLREQD Sbjct: 61 SQQGVDALLASKTGITHSAIKEYLHLLASVRLSKQHTLGEQAYPGSPAIAQALLREQDFG 120 Query: 115 QLTELHPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQ 174 L ELH ++ L+ F++D+R + DGF+ L A PP + RGL LIDP YE+ +DY Sbjct: 121 ILMELHNNEVGKLKQHFKRDTRLSIHHRDGFEGLAALTPPSTARGLALIDPSYELTSDYH 180 Query: 175 AVVSGIAEGYKRFATGIYALWYPVVLRQQIKR--MIHDLEATGIRKILQIELAVLPDSDR 232 +++ + R+ TG++A+WYP++ ++ + L + +L EL + + Sbjct: 181 QLITSLQTATARWRTGVFAVWYPILAGEKNHADFIKRKLAQLDVASVLNSELHIYTKEEN 240 Query: 233 RGMTASGMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWI 277 GM SGM +IN PW+L+ ++ ++LP L + L + V W+ Sbjct: 241 DGMIGSGMAIINAPWQLDAELESLLPELETLLAQSSKVKYKVEWL 285 >UniRef50_Q2G473 Putative uncharacterized protein n=1 Tax=Novosphingobium aromaticivorans DSM 12444 RepID=Q2G473_NOVAD Length = 276 Score = 325 bits (833), Expect = 1e-87, Method: Composition-based stats. Identities = 94/277 (33%), Positives = 147/277 (53%), Gaps = 4/277 (1%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEY 61 ++YRHSFHAGN ADV+KH++ ++ +L+ KD +DTHAG G Y L + A+RTGE Sbjct: 1 MNYRHSFHAGNSADVVKHSLLIALVRALQLKDSALTLIDTHAGCGLYDLHGDAAQRTGES 60 Query: 62 LEGIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELHP 121 +G+ R D L+ Y V+ N + YPGSP I LLR QD+L + E HP Sbjct: 61 AQGVLRALA--DPNPLLDDYRAAVQAVNVGAEPHLYPGSPRILVQLLRPQDALIVNEKHP 118 Query: 122 SDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSGIA 181 D LR + + A V + D ++ A LPP + RG++++DPPYE + + + +A Sbjct: 119 EDAYALRGAM-RGTGAAVHERDAYEFWLAMLPPRTPRGVVVVDPPYEQTDERARITATLA 177 Query: 182 EGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTASGMI 241 +++++ G+ +WYP+ R R L GI K L +E + +G+ Sbjct: 178 AAHRKWSHGVTVIWYPLKDRATHVRWKEQLRRLGIPKFLNVEHWLYDADQPGIYNGAGLF 237 Query: 242 VINPPWKLEQQMNNVLPWLHSKLVPAG-TGHATVSWI 277 ++NPP+ Q + ++L L + L P G G W+ Sbjct: 238 IVNPPYAFTQALPSMLEALRAALAPEGHQGEIAAEWL 274 >UniRef50_B5EL93 Putative uncharacterized protein n=2 Tax=Acidithiobacillus ferrooxidans RepID=B5EL93_ACIF5 Length = 285 Score = 324 bits (832), Expect = 2e-87, Method: Composition-based stats. Identities = 101/275 (36%), Positives = 153/275 (55%), Gaps = 8/275 (2%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEY 61 ++Y H +HAGN AD +KH SL +++L KD P Y++THAGAGRY LG++ GE+ Sbjct: 1 MNYDHQYHAGNTADCVKHLALSLTLQTLVRKDSPLAYIETHAGAGRYALGTQ-----GEH 55 Query: 62 LEGIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELHP 121 L+G++R+W A++ +V + N G LR+YPGSP +A LLR D + L E P Sbjct: 56 LQGVSRLWADRRSLPHAGAWLKIVSNENADGTLRHYPGSPALAAALLRPTDRMVLCEEQP 115 Query: 122 SDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSGIA 181 LR K + V DG++ L ++PP +RGL+LIDPP+E + +++ + + Sbjct: 116 EVATRLRKAIGKRAHTSVVGEDGYRTLFGQIPPPEKRGLVLIDPPFERRDEWERLTDTLI 175 Query: 182 EGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTASGMI 241 Y+R+ G+Y +WYPV +R I R+ L EL +P+ R + SG+I Sbjct: 176 RAYQRWPQGVYLVWYPVKIRGTITRLWQALRERL--PAFACELLQMPEEGREQLFGSGLI 233 Query: 242 VINPPWKLEQQMNNVLPWLHSKL-VPAGTGHATVS 275 V+NPPW L + + L L L P G G ++ Sbjct: 234 VVNPPWGLREALAAALTELGPLLSAPQGGGLWSLR 268 >UniRef50_A3JEY3 Protein involved in catabolism of external DNA n=3 Tax=Marinobacter RepID=A3JEY3_9ALTE Length = 287 Score = 323 bits (828), Expect = 6e-87, Method: Composition-based stats. Identities = 109/285 (38%), Positives = 154/285 (54%), Gaps = 12/285 (4%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 MLSY H+FHAGN ADV KH L + ++ K DTHAG+ Y L E A +T E Sbjct: 10 MLSYLHAFHAGNFADVHKHAALVLALNMMQAKASGIACTDTHAGSALYDLDDERARKTAE 69 Query: 61 YLEGIARIWQQDDLPAE-----LEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQ 115 GI ++W Q D A L Y+ + N LR YPGSP LR QDSL Sbjct: 70 ADAGIRKLWPQLDSLAAADWQLLRPYL---QQLNSGANLRQYPGSPAWFGHYLRAQDSLG 126 Query: 116 LTELHPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQA 175 + ELHPS+ L +++ R RV + DG L LPP R L+LIDP YE+KTDY A Sbjct: 127 VFELHPSETSSL-NQWASGKRLRVTQQDGLAGLLKVLPPRQPRLLVLIDPSYEVKTDYTA 185 Query: 176 VVSGIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGM 235 V ++ +++ G++ +WYP++ + ++ L A IRKIL+ E+ + RGM Sbjct: 186 VAETLSRAWQKCRHGVFLVWYPILTSGLEQTLLEGLRAGPIRKILRSEVRLHTPP-ERGM 244 Query: 236 TASGMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIVPE 280 SGM+VINPPW ++++++ ++ L G G + W+ PE Sbjct: 245 VGSGMLVINPPWGMDERLSAMMRDLEPA-ARLGLGQ-QMDWLAPE 287 >UniRef50_A0YHR6 Putative uncharacterized protein n=1 Tax=marine gamma proteobacterium HTCC2143 RepID=A0YHR6_9GAMM Length = 256 Score = 318 bits (815), Expect = 2e-85, Method: Composition-based stats. Identities = 105/258 (40%), Positives = 152/258 (58%), Gaps = 2/258 (0%) Query: 23 SLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEYLEGIARIWQQDDLPAELEAYI 82 + +K+ F Y+D+HAGAG + L S+ A++ E+ GI+++ D P L + Sbjct: 1 MEALGHFVKKESAFEYVDSHAGAGLFNLASKDAKKLEEHNYGISKLV-ASDFPELL-DFF 58 Query: 83 NVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELHPSDYPLLRSEFQKDSRARVEKA 142 ++ +N+S ++ +YPGSP IA+ LR+QD L ELHP DY L + + RV Sbjct: 59 TAIRAYNKSAKINFYPGSPAIAKHFLRKQDRAWLYELHPQDYKSLCKNVESSKKMRVFCQ 118 Query: 143 DGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSGIAEGYKRFATGIYALWYPVVLRQ 202 DG + L++ LPP SRRGLILIDP YE+K++Y+ V YK+F+TG Y +WYPVV R+ Sbjct: 119 DGLKALESVLPPTSRRGLILIDPSYEIKSEYEHVFRACVNAYKKFSTGTYIVWYPVVERR 178 Query: 203 QIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTASGMIVINPPWKLEQQMNNVLPWLHS 262 Q+ M +GI+ I + EL D+ RGMT+SG+ VINPPW L Q+M+ VLP L + Sbjct: 179 QVDVMEKKFILSGIKNIQRFELGRSADTRERGMTSSGVFVINPPWTLFQKMSAVLPRLAT 238 Query: 263 KLVPAGTGHATVSWIVPE 280 L G +V E Sbjct: 239 ILGDKNDGFFKCDILVAE 256 >UniRef50_Q1QSA4 Putative uncharacterized protein n=1 Tax=Chromohalobacter salexigens DSM 3043 RepID=Q1QSA4_CHRSD Length = 292 Score = 316 bits (810), Expect = 6e-85, Method: Composition-based stats. Identities = 99/285 (34%), Positives = 147/285 (51%), Gaps = 8/285 (2%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 ML+Y+H++HAGN ADV KH +++ L K Y+DTHAG G Y L +E +R E Sbjct: 11 MLAYQHAYHAGNFADVHKHLTLFAVLQYLLRKSSAITYVDTHAGRGLYPLEAEETQRLRE 70 Query: 61 YLEGIARIWQQDDLPA---ELEAYINVVKHFNRSGQ-LRYYPGSPLIARLLLREQDSLQL 116 Y +G A +W ++ A L A+ + L +YPGSP REQD L L Sbjct: 71 YRQGAAAVWAAREVLADDSLLAAWCERLGDAQSGASTLSHYPGSPWWLANDCREQDRLAL 130 Query: 117 TELHPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAV 176 ELHP + L ++ +AR + ADG ++A LPP + R LIDP YE K +Y V Sbjct: 131 FELHPGEATHLEAQVLP-PQARRQHADGLAGIRALLPPATPRFCALIDPSYERKQEYTDV 189 Query: 177 VSGIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDR-RGM 235 + + + I +WYP++ + ++ +G+RK+ + EL + P + GM Sbjct: 190 AATLQAVAAKVRHAIVMIWYPLLPSGRHHDLLTAARRSGLRKLWRSELTLHPPGEATHGM 249 Query: 236 TASGMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIVPE 280 SGM+++NPPW ++ Q+N L + S L T W VPE Sbjct: 250 YGSGMLLLNPPWGIDTQLNASLTRVASCLGD--TASHVSQWWVPE 292 >UniRef50_B7QYF1 Protein involved in catabolism of external DNA n=35 Tax=Alphaproteobacteria RepID=B7QYF1_9RHOB Length = 266 Score = 312 bits (799), Expect = 1e-83, Method: Composition-based stats. Identities = 102/257 (39%), Positives = 146/257 (56%), Gaps = 4/257 (1%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 MLSY+H +HAGN ADV KH + + ++ L +KDKP YL+THAG G YQL + A +TGE Sbjct: 1 MLSYQHIYHAGNLADVQKHALLARMLAYLTQKDKPLSYLETHAGRGLYQLDAAEAVKTGE 60 Query: 61 YLEGIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELH 120 GI+R+ D L A+ + + YPGSP+IA LLRE DSL ELH Sbjct: 61 AEAGISRLL-NDALLAQDHPLAEAIARTRAAHGAAAYPGSPMIAAHLLREGDSLNFAELH 119 Query: 121 PSDYPLLRSEFQK---DSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVV 177 P + LR + R RV + DGF+ + PP RRG++LIDP YE+K DY + Sbjct: 120 PQENAALRQAMRPHAKGGRVRVHQQDGFELALSLAPPTPRRGMLLIDPSYEIKRDYAQIP 179 Query: 178 SGIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTA 237 IA+ ++++ G+ ALWYP++ K M++ LEA + +L+ E+ P + M Sbjct: 180 GHIAKLHRKWNVGVIALWYPILTDGAHKPMLNALEAQDLPGVLRHEVRFPPAREGHRMVG 239 Query: 238 SGMIVINPPWKLEQQMN 254 SGM ++N P+ E + Sbjct: 240 SGMFIVNAPYGTEDEAK 256 >UniRef50_B8H3J8 External DNA uptake/catabolism protein n=6 Tax=Caulobacteraceae RepID=B8H3J8_CAUCN Length = 273 Score = 309 bits (793), Expect = 7e-83, Method: Composition-based stats. Identities = 86/268 (32%), Positives = 122/268 (45%), Gaps = 2/268 (0%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEY 61 ++YRH+FHAGN AD+ KH + ++ +L+EK +DTHAGAG Y L E A R+GE Sbjct: 1 MNYRHAFHAGNFADLHKHAILLAMLSALQEKSPALAVIDTHAGAGGYDLAGEMARRSGEA 60 Query: 62 LEGIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELHP 121 GI R+ D PA + +N + N YPGSP + LR D EL Sbjct: 61 QAGIFRLKAAADAPAVFQPLLNAITQMNGGKDGDLYPGSPRLMARALRGADRYVGCELRD 120 Query: 122 SDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSGIA 181 D LLR + AR +ADGF R I+IDPP+E DY +V+ Sbjct: 121 DDADLLRKTLAPCANARALQADGFDTAVKDA-GKGGRAFIVIDPPFERPDDYDRIVATTR 179 Query: 182 EGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTASGMI 241 R A+W P+ + + +E T +L EL + P +D M M+ Sbjct: 180 AVLARAPDAALAIWLPIKDLETFDAFLRAME-TVTSDLLVAELRLRPLTDPMKMNGCAMV 238 Query: 242 VINPPWKLEQQMNNVLPWLHSKLVPAGT 269 +I P +E W+ ++L G Sbjct: 239 MIGAPPSVEDAAVAAGDWIATRLGEPGG 266 >UniRef50_C6NTA4 Putative uncharacterized protein n=1 Tax=Acidithiobacillus caldus ATCC 51756 RepID=C6NTA4_9GAMM Length = 290 Score = 307 bits (786), Expect = 3e-82, Method: Composition-based stats. Identities = 95/267 (35%), Positives = 142/267 (53%), Gaps = 7/267 (2%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEY 61 ++YRH HAGN AD LKH SL +E L KD P YL+THAGAGRY L GE+ Sbjct: 1 MNYRHDHHAGNAADCLKHLALSLALERLLHKDAPLFYLETHAGAGRYSLADA-----GEH 55 Query: 62 LEGIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELHP 121 G+ R+W L ++++++ G LR+YPGSP++A LLR D + L E Sbjct: 56 SAGVDRVWAARRQLKGLSPWLDLLEEGAEDGVLRHYPGSPVVAARLLRPGDRMVLAEKVA 115 Query: 122 SDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSGIA 181 LR R + DG+ L+ LPP RRGLIL+DPP+E + +++A+ I Sbjct: 116 VVRERLRHNLAGRGRTSILGDDGYAILRGHLPPPERRGLILMDPPFERRDEWEALAKAII 175 Query: 182 EGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTASGMI 241 + R+ G +WYP+ +R I R++ L+ ++ +EL + ++ M SG+I Sbjct: 176 GAHARWPQGCQIVWYPIKVRGMISRLLQSLQRALDMEV--VELRLESETGGTSMVGSGLI 233 Query: 242 VINPPWKLEQQMNNVLPWLHSKLVPAG 268 ++ PPW L +++ L L L G Sbjct: 234 LVRPPWGLRERLLAALAVLGPVLAQGG 260 >UniRef50_Q1RK44 ComJ n=12 Tax=Rickettsia RepID=Q1RK44_RICBR Length = 262 Score = 307 bits (786), Expect = 4e-82, Method: Composition-based stats. Identities = 95/264 (35%), Positives = 145/264 (54%), Gaps = 12/264 (4%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEY 61 ++YRH +HAGN AD++KH V I+E LK K+KPF LD AG G Y L SE A +T EY Sbjct: 1 MNYRHIYHAGNFADIVKHLVLIAILEQLKNKEKPFAVLDAFAGLGLYDLASEAASKTLEY 60 Query: 62 LEGIARIWQQDD-LPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELH 120 GI ++ Q D P L+ +++V+ N GQ +YPGSP I + LLR QD L ELH Sbjct: 61 NNGIGKLLQALDHTPNSLKIFLSVI---NSVGQ-NFYPGSPFIIQQLLRPQDRLIACELH 116 Query: 121 PSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSGI 180 P+DY L+ ++ D + +KA LP RGLI +DPP+E+K ++Q +++ + Sbjct: 117 PADYLDLKKLLPNNT----HCIDAYNAIKAFLPFKENRGLIFLDPPFEVKNEFQKLITAL 172 Query: 181 AEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTASGM 240 + +WYP+ + H+ + G ++ L IE + S + M G+ Sbjct: 173 KKIKVSALNNSTLIWYPIKDLLLVSDFYHNYKTIGFKETLIIEYEL--LSSDKNMVKCGL 230 Query: 241 IVINPPWKLEQQMNNVLPWLHSKL 264 ++INPP + Q++ + +L L Sbjct: 231 MLINPP-NIRQELEELTKYLSYTL 253 >UniRef50_Q73R01 Putative uncharacterized protein n=1 Tax=Treponema denticola RepID=Q73R01_TREDE Length = 279 Score = 302 bits (773), Expect = 1e-80, Method: Composition-based stats. Identities = 86/277 (31%), Positives = 154/277 (55%), Gaps = 17/277 (6%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 MLSYRH FHAGN ADV KH+ ++ +K KPF D +AG+ Y L SE + +TGE Sbjct: 1 MLSYRHGFHAGNQADVFKHSALFSFLKVYTQKQKPFTAFDLNAGSASYNLLSEWSLKTGE 60 Query: 61 YLEGIAR---IWQQDDLPAEL----EAYIN-VVKHFNRSGQLRYYPGSPLIARLLLREQD 112 EGI R +++++ LP + +AY++ +K+++ + Y GSP I R L+++ Sbjct: 61 AEEGIIRFLDLYKKEKLPLPIPEGFKAYLDFCLKNYDENSS---YAGSPEIIRSFLQKES 117 Query: 113 SLQLTELHPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTD 172 +L L +LH ++ L+ +++ V K D ++ ++A PP+ RG L DP YE+ +D Sbjct: 118 NLILCDLHSAEAEKLKELYKRVENVHVHKRDCYEAVRALTPPLPIRGFALFDPSYEVDSD 177 Query: 173 YQAVVSGIAEGYKRFATGIYALWYPVVL--RQQIKRMIHDLEATGIRKILQIELAVLPD- 229 Y A+ + + K++ GI+ +WYP++ ++ + + + K+L IE+ + Sbjct: 178 YTAIAESVEKVCKKWPIGIFIIWYPILNHKTEECRNLKDRISKAMNNKVLNIEVKHFSNK 237 Query: 230 ---SDRRGMTASGMIVINPPWKLEQQMNNVLPWLHSK 263 + G+ SG+++ NPPW LE+++ + ++ Sbjct: 238 IDSENEYGLQGSGLLITNPPWGLEEKLKEICEYVEKV 274 >UniRef50_UPI0000E1171F protein involved in external DNA uptake n=1 Tax=Rhodobacterales bacterium HTCC2255 RepID=UPI0000E1171F Length = 288 Score = 286 bits (733), Expect = 5e-76, Method: Composition-based stats. Identities = 99/290 (34%), Positives = 150/290 (51%), Gaps = 19/290 (6%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 MLSY+H +HAGNHAD++KH ++ L +K+KP +DTHAGAG Y L + A+ E Sbjct: 1 MLSYQHIYHAGNHADLIKHLTLLSVLLKLGQKNKPCTLIDTHAGAGEYDLSATKAQHNNE 60 Query: 61 YLEGIARIWQQ---DDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLT 117 L GI + + A L AY + S + Y GS + LREQD Sbjct: 61 SLTGIGMLDEAFFSQTDSALLHAYGEGLYTGVVSDK---YCGSAGWMQRYLREQDQAHFC 117 Query: 118 ELHPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVV 177 ELHP+ YP L + K A + DGF+QL A +PP+++RG++L+DPPYE ++Y V+ Sbjct: 118 ELHPNVYPELLNYVYK-PNAHCYQEDGFKQLIALVPPLAKRGIVLVDPPYEQASEYSMVL 176 Query: 178 SGIAEGYKRFATGIYALWYPVVLRQQIK------RMIHDLEATG----IRKILQIELAVL 227 I + KR+ATG Y +WYP++ Q RM+ + + I+ Sbjct: 177 DVIEKSLKRWATGCYLIWYPMINTQNTNKAQAAIRMLKGFNTLADEHSVSNMANIQWRYD 236 Query: 228 PDSDRRGMTASGMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWI 277 +D +GM SG+I IN PW + ++++ + L + ++ WI Sbjct: 237 TTNDAQGMYGSGIIAINLPWGCDNEISDAM--LSIQQSKMTQAAFSLEWI 284 >UniRef50_A3VP01 Putative uncharacterized protein n=1 Tax=Parvularcula bermudensis HTCC2503 RepID=A3VP01_9PROT Length = 262 Score = 280 bits (717), Expect = 4e-74, Method: Composition-based stats. Identities = 83/257 (32%), Positives = 131/257 (50%), Gaps = 10/257 (3%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 MLSY+H++HAGN AD+ KH V ++ L +K + LDTHAG G Y L A++TGE Sbjct: 1 MLSYQHAYHAGNRADLHKHAVWCALLAHLTQKSRGLTILDTHAGRGLYDLAGAEAQKTGE 60 Query: 61 YLEGIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELH 120 +G A + D L + V YPGSPL++ R QD + L E H Sbjct: 61 ASDGAAAV--SLDGSHALG---SAVAACRAQYGEMAYPGSPLLSLHFARPQDQVILMEKH 115 Query: 121 PSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSGI 180 P + L++ + +A V DG++ A PP R+GL++IDP YE+KT+YQ V + Sbjct: 116 PQEGAALKTVM-RGKKAAVHLRDGYEGALALAPPTPRKGLVMIDPSYEVKTEYQNVALFL 174 Query: 181 AEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTASGM 240 ++ LWYP++ ++ + M+ L + E+ + + M SG+ Sbjct: 175 PTLIDKWPEASVLLWYPILAAKRHEAMLDTLSPMQP---WRHEV-LFTEDSLLRMKGSGL 230 Query: 241 IVINPPWKLEQQMNNVL 257 ++I+PP+ E ++ L Sbjct: 231 VLISPPYGGEGAIDAAL 247 >UniRef50_C5SM30 Putative uncharacterized protein n=2 Tax=Caulobacteraceae RepID=C5SM30_9CAUL Length = 284 Score = 273 bits (698), Expect = 5e-72, Method: Composition-based stats. Identities = 78/280 (27%), Positives = 123/280 (43%), Gaps = 16/280 (5%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEY 61 ++YRH FHAGN AD+ KH V + +L+E +P +D+HAGAG+Y L R+ E Sbjct: 1 MNYRHGFHAGNFADLFKHAVLLNFLRALRESAQPLQVVDSHAGAGQYDLSDPTFSRSKEA 60 Query: 62 LEGIARIWQQDDLPAELEAYINVVKHFNRSGQLR----YYPGSPLIARLLLREQDSLQLT 117 GI + D+P L + V NR+ + YPGSPL+ L + S Sbjct: 61 EAGIGYLLG-GDVPQSLIPLSDYVWAKNRAAGFKTRIGLYPGSPLLVLDHLTAEGSYMGC 119 Query: 118 ELHPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVV 177 EL DY LR+ +AR DG++ P + +LIDPP+E DY+ + Sbjct: 120 ELRKDDYERLRATVMPRGKAR--HTDGYEAAVEMAEP-DKDFFLLIDPPFEQFEDYERIN 176 Query: 178 SGIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLE--------ATGIRKILQIELAVLPD 229 + + K+ T +W P+ + R + +E G I EL + P Sbjct: 177 LCLRDVLKKQPTAKALVWLPLKDLETFDRFLRHMECELLEDQTGEGGPDIAVAELRLRPL 236 Query: 230 SDRRGMTASGMIVINPPWKLEQQMNNVLPWLHSKLVPAGT 269 ++ M ++ +N P + + M ++ L G Sbjct: 237 TNPLKMNGCALVTVNAPASVVEAMRDIADDLAQVFAEPGG 276 >UniRef50_B7VU08 Putative uncharacterized protein n=4 Tax=Vibrionales RepID=B7VU08_VIBSL Length = 288 Score = 265 bits (679), Expect = 8e-70, Method: Composition-based stats. Identities = 78/292 (26%), Positives = 142/292 (48%), Gaps = 20/292 (6%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEY 61 + YRH H G+H D LKH V S +++SL ++ +DTH+G G Y L + + GE+ Sbjct: 1 MEYRHQCHVGDHGDALKHPVLSALVQSLMQQHSRLNVIDTHSGTGCYDLTTAPSNHAGEF 60 Query: 62 LEGIARIWQQDD-LPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELH 120 EG+ +W+ LP ++++V++++N + + YPGS I R QDS +++ Sbjct: 61 AEGVGYLWRNKAYLPPAFASFMSVLEYYNPNQLISLYPGSAAITYQQGRSQDSFYFSDIQ 120 Query: 121 PSDYPLLRSE---FQKD----SRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDY 173 + LL++ Q+D S+ + DG + L + LI+IDPPYE ++Y Sbjct: 121 QDEADLLQTNIETLQRDLDVSSKLTITAGDGLKALPDDVAKHDNHHLIVIDPPYETDSEY 180 Query: 174 QAVVSGIAEGYKRFATGIYALWYP--------VVLRQQIKRMIHDLEATGIRKILQIELA 225 AV+ + + Y++ +WYP ++L + + L + I+ EL Sbjct: 181 LAVIDALVKAYQQSEKVSALIWYPLYTDDKSSLILNHCVTAVKDGLLPSPIKS----ELR 236 Query: 226 VLPDSDRRGMTASGMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWI 277 + + SG+++ NPP + + + L +LH +L G G+ + + Sbjct: 237 LRDPKGDDRLIGSGLLLFNPPQGISGIVADTLDYLHCQLSTNGEGYWQMRSL 288 >UniRef50_B7G053 Predicted protein (Fragment) n=1 Tax=Phaeodactylum tricornutum CCAP 1055/1 RepID=B7G053_PHATR Length = 267 Score = 238 bits (607), Expect = 2e-61, Method: Composition-based stats. Identities = 83/270 (30%), Positives = 134/270 (49%), Gaps = 21/270 (7%) Query: 4 YRHSFHAGNHADVLKHTV-QSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEYL 62 Y+H HAGNH DVLKH V ++ + E L + + +D HAG G Y L + +G++ Sbjct: 7 YQHLKHAGNHCDVLKHVVFRACVQEQLNVHENGIILVDCHAGEGLYDLSK---QTSGDFE 63 Query: 63 EGIARIWQQDD--LPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELH 120 G+AR+ Q D P + Y+N ++ + +++YPGSP++ LLREQD +L +L+ Sbjct: 64 RGVARVVQNLDQTAPPAVHDYVNAIQEADEY--MQFYPGSPMLGAKLLREQDEHRLVDLY 121 Query: 121 PSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSGI 180 D L+ +A V +AD + L R +ILIDPPY + D+ Sbjct: 122 VEDVEGLKDGALFW-QADVFEADAVEFLVPN--DDDRHKVILIDPPYLDQEDFYRAKVLT 178 Query: 181 AEGYKRFATGIYALWYPVVLRQQIK----RMIHDLEATGIR-KILQIELAVLPDSDRRGM 235 R LWYP++ + + + + I D+ + I Q L V D+ G+ Sbjct: 179 ERILDRDPYCTILLWYPMIQKSRWRYGYAKSIKDMAKKKAKLGIYQAWLTV----DKEGL 234 Query: 236 TASGMIVINPPWKLEQQM-NNVLPWLHSKL 264 SGMIV+NP + ++ + + + WL + L Sbjct: 235 QGSGMIVVNPTQRFDEIVDEDTIDWLSATL 264 >UniRef50_Q0C0F5 Putative uncharacterized protein n=1 Tax=Hyphomonas neptunium ATCC 15444 RepID=Q0C0F5_HYPNA Length = 262 Score = 235 bits (600), Expect = 1e-60, Method: Composition-based stats. Identities = 78/267 (29%), Positives = 123/267 (46%), Gaps = 6/267 (2%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 MLSY+H FHAGN ADVLKH V ++ + +P Y++TH+G GRY L + A + GE Sbjct: 1 MLSYQHGFHAGNRADVLKHAVLDTLLRAAATGPRPLFYVETHSGHGRYDLTNAQARKRGE 60 Query: 61 YLEGIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELH 120 +G+ + + P L ++ +V N G+ + YPGSP +A+ LL + + L ELH Sbjct: 61 SDDGVLALM-KGKPPKPLSGWMELV---NARGE-KDYPGSPALAQTLLPKHARMMLFELH 115 Query: 121 PSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSGI 180 P + L + D R R++KADG+ P + ++L+DP YE D +A+ Sbjct: 116 PQENAALTEAMKGDDRIRIQKADGYAGALKLAPRAGEQMVVLVDPSYETHRDIEALALWT 175 Query: 181 AEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTASGM 240 + KR+ + LW P+ + L G I+ V + S M Sbjct: 176 PKALKRWPGALLILWLPLFRDGREAEFGEYLATLGDAMIVGARWPV-ALGTESSIEGSAM 234 Query: 241 IVINPPWKLEQQMNNVLPWLHSKLVPA 267 + P + + + L S Sbjct: 235 VAFGAPAEARAKCEAIASSLESYWAQQ 261 >UniRef50_UPI0001909543 putative DNA methylase protein n=1 Tax=Rhizobium etli IE4771 RepID=UPI0001909543 Length = 171 Score = 210 bits (536), Expect = 4e-53, Method: Composition-based stats. Identities = 62/164 (37%), Positives = 91/164 (55%) Query: 117 TELHPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAV 176 ELHP DY L F+ D AR+ + DG+ L A LPP +RG++L+DPP+E + +YQ + Sbjct: 1 MELHPEDYARLHRLFEGDHHARITELDGWLALGAHLPPKEKRGIVLVDPPFEEEDEYQRL 60 Query: 177 VSGIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMT 236 G+ Y+RF G Y LWYP+ IK L+A I K+L EL V D G+T Sbjct: 61 AKGLERAYRRFPGGTYCLWYPLKKGAPIKEFHETLQALDIPKMLCAELTVRSDRGTTGLT 120 Query: 237 ASGMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIVPE 280 SG++++NPP+ L+ +++ +LP L L W+ E Sbjct: 121 GSGLVIVNPPFTLKDELHQMLPALKDHLAQDRFASQRAFWLRGE 164 >UniRef50_Q2BH49 Putative uncharacterized protein n=1 Tax=Neptuniibacter caesariensis RepID=Q2BH49_9GAMM Length = 251 Score = 183 bits (466), Expect = 4e-45, Method: Composition-based stats. Identities = 68/241 (28%), Positives = 102/241 (42%), Gaps = 19/241 (7%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 M Y HS +AG ADV+KH + ++ L D Y++THAGAG Y L + GE Sbjct: 1 MAKYLHSKYAGGDADVMKHACLASVLSKL---DISVEYVETHAGAGLYDLDPDR----GE 53 Query: 61 YLEGIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELH 120 +L+GI R L+ Y V++ + + + YP SP+IA SL L EL+ Sbjct: 54 HLKGIGRCRSNLTDLPALKPYNGVLEE-SWTLDKKIYPASPIIANSA-SAVKSLCLYELN 111 Query: 121 PSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSGI 180 S L+ A V + DGF S +LIDPPY+ DYQ VV + Sbjct: 112 RSVACQLKKNL---PEAVVWEEDGFLSRHHL----SHGSFVLIDPPYKSSDDYQQVVEYV 164 Query: 181 AEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTASGM 240 K+ +W+P++ + L ++L+ M+ G+ Sbjct: 165 GAA-KQSQVRAVMVWFPMIYSDLTADLYDGLLGLYPDGQW-LQLS-RGLHSEGAMSGFGV 221 Query: 241 I 241 Sbjct: 222 F 222 >UniRef50_A0B718 Protein involved in catabolism of external DNA-like n=1 Tax=Methanosaeta thermophila PT RepID=A0B718_METTP Length = 246 Score = 139 bits (351), Expect = 1e-31, Method: Composition-based stats. Identities = 53/185 (28%), Positives = 72/185 (38%), Gaps = 17/185 (9%) Query: 4 YRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEYLE 63 Y H HAGN DV KH + S L + +Y ++HAG Y L GE+ Sbjct: 2 YDHREHAGNAGDVWKHFLLSEAAAYLLCR-SDLVYAESHAGYTAYTLAP-----NGEWRW 55 Query: 64 GIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELHPSD 123 GI R W Y V++ N L+ YPGS I L R + EL Sbjct: 56 GIGRCWHLRSEIES--PYFAVLEEMN-DEHLQIYPGSAKIILRLGRFFRRRVVAELWDIS 112 Query: 124 YPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRR--GLILIDPPYEMKTDYQAVVSGIA 181 + +S + DGF + L +RR GL+LIDPP D + + Sbjct: 113 EDVGKS-WSACPDIHFHLGDGFSGVMDLL---NRRDPGLLLIDPP--SPDDQDKAIELLK 166 Query: 182 EGYKR 186 + R Sbjct: 167 DASDR 171 >UniRef50_A3I4J5 Methyltransferase n=3 Tax=Bacillaceae RepID=A3I4J5_9BACI Length = 197 Score = 48.6 bits (115), Expect = 3e-04, Method: Composition-based stats. Identities = 25/114 (21%), Positives = 44/114 (38%), Gaps = 13/114 (11%) Query: 88 FNRSGQLRYYPGSPLI-ARLLLREQDSLQLTELHPSDYPLLRSEFQKDS---RARVEKAD 143 F+ L + GS + L R + E + +L+ +K + + D Sbjct: 49 FDGGTALDLFAGSGGLGIESLSRGAERAIFIEKDAKAFQVLQENIKKCRYEEHTELFRID 108 Query: 144 GFQQLKAKLPPVSRR----GLILIDPPYEMKTDYQAVVSGIAEGYKRFATGIYA 193 + +KA L +R L+ +DPPY K +Y +V + + K GI Sbjct: 109 AKRAVKALL----KRDITFSLVFLDPPYHQK-EYYDLVQLLVDNEKIQQNGIIL 157 >UniRef50_D0THA4 Predicted protein n=14 Tax=Bacteroides RepID=D0THA4_9BACE Length = 69 Score = 43.6 bits (102), Expect = 0.008, Method: Composition-based stats. Identities = 19/65 (29%), Positives = 31/65 (47%), Gaps = 12/65 (18%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 M++Y H G DVLKH V +++ +KP +Y++T++ Y + T E Sbjct: 1 MVTYTHF---GKQPDVLKHLVLCEVLQI----EKPQIYVETNSACAIYTMT-----HTPE 48 Query: 61 YLEGI 65 GI Sbjct: 49 QEYGI 53 >UniRef50_C1ACG3 Putative uncharacterized protein n=1 Tax=Gemmatimonas aurantiaca T-27 RepID=C1ACG3_GEMAT Length = 192 Score = 41.6 bits (97), Expect = 0.032, Method: Composition-based stats. Identities = 25/123 (20%), Positives = 43/123 (34%), Gaps = 18/123 (14%) Query: 82 INVVKHFNRSGQ-LRYYPGSPLIARLLLREQDSLQ-LTELHPSDYPLLRSEFQKDS---R 136 + +V+ + + + G+ I L E PS L++ + Sbjct: 33 MKLVRADLEGARVIDLFAGTGAIGLEALSRGAKYVDFVEFRPSSLHALKANIAALRVTTK 92 Query: 137 ARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSGIAEGYKRFATGIYALWY 196 ARV K D A + R L +DPPYE + + +R+ A W Sbjct: 93 ARVYKKDALPFANALI--AGRYDLAFVDPPYESR--------MLDRLIERWLE---APWS 139 Query: 197 PVV 199 P++ Sbjct: 140 PIL 142 >UniRef50_A6Q3P8 DNA methylase n=3 Tax=Epsilonproteobacteria RepID=A6Q3P8_NITSB Length = 193 Score = 41.2 bits (96), Expect = 0.037, Method: Composition-based stats. Identities = 18/96 (18%), Positives = 35/96 (36%), Gaps = 6/96 (6%) Query: 99 GSPLIARLLL-REQDSLQLTELHPSDYPLLRSEFQKDSR--ARVEKADGFQQLKAKLPPV 155 GS + L R + E + Y +L+ V D F+ L + + Sbjct: 57 GSGSVGLEALSRGAKRVYFIEKNRDSYKVLKKNVHNCDESSCSVRYGDAFELLWDVIEEL 116 Query: 156 SR---RGLILIDPPYEMKTDYQAVVSGIAEGYKRFA 188 R + DPP+ ++ Y+ + +A+ K+ Sbjct: 117 KRNKEKAYFYFDPPFSIREGYEDIYDEVAQTIKKIP 152 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_C9Y2J0 Uncharacterized protein yhiR n=11 Tax=Enterobact... 435 e-121 UniRef50_P31777 Uncharacterized protein HI0441 n=170 Tax=Gammapr... 435 e-121 UniRef50_C6C5L8 Putative uncharacterized protein n=3 Tax=Gammapr... 431 e-119 UniRef50_A3UP41 Protein involved in catabolism of external DNA n... 424 e-117 UniRef50_B8F539 Protein involved in catabolism of external DNA n... 420 e-116 UniRef50_A1T010 DNA (Exogenous) processing protein n=8 Tax=Gamma... 411 e-113 UniRef50_Q984Q7 Mlr7888 protein n=7 Tax=Rhizobiales RepID=Q984Q7... 408 e-112 UniRef50_B9JCU5 DNA methylase protein n=4 Tax=Rhizobiales RepID=... 405 e-111 UniRef50_A0KP43 Protein involved in external DNA uptake n=3 Tax=... 403 e-111 UniRef50_B6QZ93 Florfenicol resistance protein n=3 Tax=Rhodobact... 401 e-110 UniRef50_Q5NZ63 Predicted protein involved in catabolism of exte... 401 e-110 UniRef50_Q47Z69 Putative uncharacterized protein n=1 Tax=Colwell... 401 e-110 UniRef50_A6SXL6 Uncharacterized conserved protein n=70 Tax=cellu... 395 e-109 UniRef50_B2S4W3 N-6 Adenine-specific DNA methylase n=51 Tax=Rhiz... 394 e-108 UniRef50_Q12I42 Putative uncharacterized protein n=20 Tax=Shewan... 394 e-108 UniRef50_A3YH43 Protein involved in external DNA uptake n=2 Tax=... 394 e-108 UniRef50_Q0VM19 Putative uncharacterized protein n=2 Tax=Alcaniv... 394 e-108 UniRef50_A4VR91 Protein involved in catabolism of external DNA n... 393 e-108 UniRef50_C3X722 External-DNA catabolic protein n=2 Tax=Oxalobact... 393 e-108 UniRef50_B1XZU6 Putative uncharacterized protein n=1 Tax=Leptoth... 380 e-104 UniRef50_Q2W9T7 Protein involved in catabolism of external DNA n... 380 e-104 UniRef50_C6M2C4 YhiR family protein n=2 Tax=Neisseriaceae RepID=... 373 e-102 UniRef50_Q15U81 Putative uncharacterized protein n=1 Tax=Pseudoa... 372 e-102 UniRef50_Q2SPJ4 Protein involved in catabolism of external DNA n... 370 e-101 UniRef50_Q89DH2 Blr7467 protein n=16 Tax=Rhizobiales RepID=Q89DH... 369 e-101 UniRef50_B8GN89 Putative uncharacterized protein n=1 Tax=Thioalk... 369 e-101 UniRef50_C6XPG2 Putative uncharacterized protein n=5 Tax=Proteob... 369 e-101 UniRef50_C6QCA2 Putative uncharacterized protein n=1 Tax=Hyphomi... 368 e-100 UniRef50_Q1N5H6 Putative uncharacterized protein n=1 Tax=Bermane... 367 e-100 UniRef50_Q5QVX6 Transformation competence-related protein ComJ n... 366 e-100 UniRef50_Q0ARP7 Putative uncharacterized protein n=2 Tax=Hyphomo... 365 1e-99 UniRef50_B2HZ48 Protein involved in catabolism of external DNA n... 364 2e-99 UniRef50_D0IYP9 ComJ n=10 Tax=Bacteria RepID=D0IYP9_COMTE 362 7e-99 UniRef50_Q5ZVZ2 Protein involved in catabolism of external DNA n... 359 5e-98 UniRef50_C5BQN4 Putative uncharacterized protein n=1 Tax=Teredin... 357 3e-97 UniRef50_A4SYR3 Putative uncharacterized protein n=1 Tax=Polynuc... 356 5e-97 UniRef50_C7JEW3 Putative uncharacterized protein n=8 Tax=Acetoba... 355 1e-96 UniRef50_Q0F148 Putative uncharacterized protein n=1 Tax=Maripro... 354 2e-96 UniRef50_D0KVW6 Putative uncharacterized protein n=1 Tax=Halothi... 352 8e-96 UniRef50_Q87F97 Transformation competence-related protein n=20 T... 347 2e-94 UniRef50_Q1YIC4 Putative uncharacterized protein n=1 Tax=Auranti... 347 3e-94 UniRef50_A1WIT5 Putative uncharacterized protein n=4 Tax=Burkhol... 346 5e-94 UniRef50_Q0G6E9 Putative uncharacterized protein n=1 Tax=Fulvima... 344 1e-93 UniRef50_B4RYZ5 Putative uncharacterized protein n=2 Tax=Alterom... 344 2e-93 UniRef50_B1ZS65 Putative uncharacterized protein n=2 Tax=Opituta... 341 2e-92 UniRef50_A5EWC5 Putative uncharacterized protein n=1 Tax=Dichelo... 339 6e-92 UniRef50_Q0BPC8 Putative uncharacterized protein n=3 Tax=Acetoba... 338 8e-92 UniRef50_A1VJI9 Putative uncharacterized protein n=4 Tax=Comamon... 336 5e-91 UniRef50_D2LG58 Putative uncharacterized protein n=1 Tax=Rhodomi... 336 6e-91 UniRef50_B1LXQ5 Putative uncharacterized protein n=9 Tax=Alphapr... 332 6e-90 UniRef50_A5WD58 Putative uncharacterized protein n=3 Tax=Psychro... 332 6e-90 UniRef50_C8PZ79 Protein involved in catabolism of external DNA n... 332 9e-90 UniRef50_Q21LZ8 Putative uncharacterized protein n=1 Tax=Sacchar... 331 2e-89 UniRef50_Q2G473 Putative uncharacterized protein n=1 Tax=Novosph... 329 5e-89 UniRef50_B5EL93 Putative uncharacterized protein n=2 Tax=Acidith... 328 1e-88 UniRef50_C8NAD4 Cytoplasmic protein n=34 Tax=Proteobacteria RepI... 328 2e-88 UniRef50_A3JEY3 Protein involved in catabolism of external DNA n... 325 1e-87 UniRef50_A0YHR6 Putative uncharacterized protein n=1 Tax=marine ... 320 3e-86 UniRef50_Q1QSA4 Putative uncharacterized protein n=1 Tax=Chromoh... 317 2e-85 UniRef50_B7QYF1 Protein involved in catabolism of external DNA n... 312 6e-84 UniRef50_Q1RK44 ComJ n=12 Tax=Rickettsia RepID=Q1RK44_RICBR 312 1e-83 UniRef50_C6NTA4 Putative uncharacterized protein n=1 Tax=Acidith... 309 8e-83 UniRef50_B8H3J8 External DNA uptake/catabolism protein n=6 Tax=C... 308 1e-82 UniRef50_Q73R01 Putative uncharacterized protein n=1 Tax=Trepone... 300 2e-80 UniRef50_UPI0000E1171F protein involved in external DNA uptake n... 294 2e-78 UniRef50_A3VP01 Putative uncharacterized protein n=1 Tax=Parvula... 286 6e-76 UniRef50_C5SM30 Putative uncharacterized protein n=2 Tax=Cauloba... 274 2e-72 UniRef50_B7VU08 Putative uncharacterized protein n=4 Tax=Vibrion... 269 7e-71 UniRef50_Q0C0F5 Putative uncharacterized protein n=1 Tax=Hyphomo... 256 7e-67 UniRef50_B7G053 Predicted protein (Fragment) n=1 Tax=Phaeodactyl... 243 6e-63 UniRef50_UPI0001909543 putative DNA methylase protein n=1 Tax=Rh... 214 2e-54 UniRef50_Q2BH49 Putative uncharacterized protein n=1 Tax=Neptuni... 199 6e-50 UniRef50_A0B718 Protein involved in catabolism of external DNA-l... 149 1e-34 UniRef50_A3I4J5 Methyltransferase n=3 Tax=Bacillaceae RepID=A3I4... 73 8e-12 Sequences not found previously or not previously below threshold: UniRef50_C5D8K2 Methyltransferase n=85 Tax=Bacillales RepID=C5D8... 48 4e-04 UniRef50_C9RZH2 Methyltransferase n=5 Tax=Bacillaceae RepID=C9RZ... 48 5e-04 UniRef50_Q1AW93 Putative uncharacterized protein n=1 Tax=Rubroba... 47 0.001 UniRef50_C4L5T0 Methyltransferase n=2 Tax=Bacillales RepID=C4L5T... 46 0.001 UniRef50_C7I1R0 Methyltransferase n=1 Tax=Thiomonas intermedia K... 45 0.002 UniRef50_C1ACG3 Putative uncharacterized protein n=1 Tax=Gemmati... 45 0.003 UniRef50_C0GHJ6 Methyltransferase n=1 Tax=Dethiobacter alkaliphi... 44 0.006 UniRef50_D0THA4 Predicted protein n=14 Tax=Bacteroides RepID=D0T... 44 0.007 UniRef50_A6Q3P8 DNA methylase n=3 Tax=Epsilonproteobacteria RepI... 43 0.009 UniRef50_C8NLD5 N6-adenine-specific methylase n=20 Tax=Corynebac... 42 0.031 UniRef50_Q0VLD7 Putative uncharacterized protein n=2 Tax=Alcaniv... 41 0.035 UniRef50_D1C502 Putative uncharacterized protein n=1 Tax=Sphaero... 41 0.037 UniRef50_C9RAL4 Methyltransferase n=1 Tax=Ammonifex degensii KC4... 41 0.037 UniRef50_Q98DH2 Mlr4706 protein n=1 Tax=Mesorhizobium loti RepID... 40 0.069 >UniRef50_C9Y2J0 Uncharacterized protein yhiR n=11 Tax=Enterobacteriaceae RepID=C9Y2J0_CROTZ Length = 280 Score = 435 bits (1120), Expect = e-121, Method: Composition-based stats. Identities = 242/280 (86%), Positives = 259/280 (92%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRY L EHAERTGE Sbjct: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYLLSGEHAERTGE 60 Query: 61 YLEGIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELH 120 YLEGIARIWQ+DDLPAELE YI+ V HFNRSGQLRYYPGSPLIAR LLR QDSLQLTELH Sbjct: 61 YLEGIARIWQRDDLPAELEPYISAVSHFNRSGQLRYYPGSPLIARQLLRPQDSLQLTELH 120 Query: 121 PSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSGI 180 PSD+PLLR EFQKD RARVE+ADG+QQLK+KLPP SRRGLILIDPPYE+KTDYQAVV GI Sbjct: 121 PSDFPLLRGEFQKDERARVERADGYQQLKSKLPPASRRGLILIDPPYEIKTDYQAVVQGI 180 Query: 181 AEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTASGM 240 EGYKRFATG+YALWYPVVLR QIKRM++DLE+TGIR+ILQIELAV PDSD+RGMTASGM Sbjct: 181 NEGYKRFATGVYALWYPVVLRNQIKRMMNDLESTGIRRILQIELAVRPDSDQRGMTASGM 240 Query: 241 IVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIVPE 280 +VINPPWKLEQQM +LPWLH LVPAGTGH T+ W+VPE Sbjct: 241 VVINPPWKLEQQMGTLLPWLHKALVPAGTGHTTLKWVVPE 280 >UniRef50_P31777 Uncharacterized protein HI0441 n=170 Tax=Gammaproteobacteria RepID=Y441_HAEIN Length = 281 Score = 435 bits (1120), Expect = e-121, Method: Composition-based stats. Identities = 186/281 (66%), Positives = 214/281 (76%), Gaps = 1/281 (0%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 MLSY HSFHAGNHADVLKH V LI+E+LK K+K F YLDTH+G GRY+L S +E+TGE Sbjct: 1 MLSYHHSFHAGNHADVLKHIVLMLILENLKLKEKGFFYLDTHSGVGRYRLSSNESEKTGE 60 Query: 61 YLEGIARIWQQDDLPAELEAYINVVKHFNRSG-QLRYYPGSPLIARLLLREQDSLQLTEL 119 Y EGI R+W Q DLP ++ Y+ ++K N G +LRYY GSPLIA LLR QD LTEL Sbjct: 61 YKEGIGRLWDQTDLPEDIARYVKMIKKLNYGGKELRYYAGSPLIAAELLRSQDRALLTEL 120 Query: 120 HPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSG 179 HPSDYP+LR+ F D V+ +GFQQ+KA LPP RRGL+LIDPPYE+K DY VV Sbjct: 121 HPSDYPILRNNFSDDKNVTVKCDNGFQQVKATLPPKERRGLVLIDPPYELKDDYDLVVKA 180 Query: 180 IAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTASG 239 I EGYKRFATG YA+WYPVVLRQQ KR+ LEATGIRKIL+IELAV PDSD+RGMTASG Sbjct: 181 IEEGYKRFATGTYAIWYPVVLRQQTKRIFKGLEATGIRKILKIELAVRPDSDQRGMTASG 240 Query: 240 MIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIVPE 280 M+VINPPW LE QM +LP+L LVP GTG TV WI PE Sbjct: 241 MVVINPPWTLETQMKEILPYLTKTLVPEGTGSWTVEWITPE 281 >UniRef50_C6C5L8 Putative uncharacterized protein n=3 Tax=Gammaproteobacteria RepID=C6C5L8_DICDC Length = 280 Score = 431 bits (1110), Expect = e-119, Method: Composition-based stats. Identities = 219/280 (78%), Positives = 247/280 (88%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 MLSYRHSFHAGNHADVLKHTVQSLII +LKEK+KPFLYLDTH+GAGRYQL EHAERTGE Sbjct: 1 MLSYRHSFHAGNHADVLKHTVQSLIITALKEKEKPFLYLDTHSGAGRYQLHGEHAERTGE 60 Query: 61 YLEGIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELH 120 Y EGI RIWQ+DD+PAE+EAY+ VV+ +N GQLRYYPGSPLIAR LLREQD+L LTELH Sbjct: 61 YREGIGRIWQRDDIPAEMEAYLQVVRSYNSGGQLRYYPGSPLIARQLLREQDTLNLTELH 120 Query: 121 PSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSGI 180 P+D+ LLR EF +D RARV + DG+ QLK++LPP +RRG+ILIDPPYE+KTDYQAVV GI Sbjct: 121 PTDFSLLRQEFARDDRARVVREDGYLQLKSRLPPAARRGVILIDPPYELKTDYQAVVDGI 180 Query: 181 AEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTASGM 240 EGY+RFATG+YALWYPVVLRQQIKR++ LE TGIR+ILQIELAVLPDSDR GMTASGM Sbjct: 181 QEGYRRFATGVYALWYPVVLRQQIKRLLKALEETGIRRILQIELAVLPDSDRHGMTASGM 240 Query: 241 IVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIVPE 280 IVINPPWKLE QM ++LPWLH LVP GTGH V W+VPE Sbjct: 241 IVINPPWKLEAQMKSLLPWLHQVLVPEGTGHTRVEWVVPE 280 >UniRef50_A3UP41 Protein involved in catabolism of external DNA n=17 Tax=Gammaproteobacteria RepID=A3UP41_VIBSP Length = 284 Score = 424 bits (1091), Expect = e-117, Method: Composition-based stats. Identities = 172/285 (60%), Positives = 213/285 (74%), Gaps = 6/285 (2%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 MLSYRHSFHAGNHADV+KH VQSLI+ LK+KDKPF+Y DTH+G GRY L E +E+TGE Sbjct: 1 MLSYRHSFHAGNHADVVKHIVQSLILNYLKQKDKPFVYHDTHSGVGRYDLTHEWSEKTGE 60 Query: 61 YLEGIARIWQQD-----DLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQ 115 Y +GIAR+W DLP ++++Y+ + N +LR+YPGSP +AR LR+QD + Sbjct: 61 YKQGIARLWSASEAGQQDLPEDIQSYLESISALNNGEKLRFYPGSPRVARAHLRDQDRMV 120 Query: 116 LTELHPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQA 175 LTELHP+D+PLL EF +D + + K DGFQ+LK LPP RRGL+LIDPPYE+ +Y+ Sbjct: 121 LTELHPADHPLLEQEFHRDRQVSIYKEDGFQRLKGSLPPKERRGLVLIDPPYELAKEYRD 180 Query: 176 VVSGIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGM 235 VV+ IA+ +KR+ATGIYA+WYPVV R I+ MI LE GI KILQIEL V PD++ RGM Sbjct: 181 VVTAIAQSHKRWATGIYAIWYPVVNRCDIEDMIEGLEGLGINKILQIELGVSPDTNERGM 240 Query: 236 TASGMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIVPE 280 TASGMIVINPPWKLE QMN +LP+L + P TGH V WIVPE Sbjct: 241 TASGMIVINPPWKLESQMNEILPFLKEAIAP-ATGHFKVEWIVPE 284 >UniRef50_B8F539 Protein involved in catabolism of external DNA n=67 Tax=Gammaproteobacteria RepID=B8F539_HAEPS Length = 279 Score = 420 bits (1081), Expect = e-116, Method: Composition-based stats. Identities = 180/280 (64%), Positives = 218/280 (77%), Gaps = 1/280 (0%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 MLSY HSFHAGNHADVLKH V +LI+ +LK+K+K F YLDTH+G GRY L S AE+TGE Sbjct: 1 MLSYHHSFHAGNHADVLKHIVLTLILHALKQKEKGFFYLDTHSGVGRYSLQSSEAEKTGE 60 Query: 61 YLEGIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELH 120 Y+EGIAR+W++ DLP ++ Y+N +K N+ +LR+Y GSPL+A LR QD LTELH Sbjct: 61 YIEGIARLWERTDLPEKVVLYLNEIKKINKD-KLRFYAGSPLLAVQQLRPQDRALLTELH 119 Query: 121 PSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSGI 180 P+D+PLLR+EF K ++ +GFQQLK+ LPP +RGL+LIDPPYE+K DY+ VV I Sbjct: 120 PNDFPLLRNEFAKTPNVVTKRENGFQQLKSALPPKEKRGLVLIDPPYELKEDYELVVKAI 179 Query: 181 AEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTASGM 240 EGYKRFATG+YA+WYPVVLRQ KR++ L TGIRKILQIELAV PDSD+RGMTASGM Sbjct: 180 EEGYKRFATGVYAIWYPVVLRQHTKRIVRGLVETGIRKILQIELAVRPDSDQRGMTASGM 239 Query: 241 IVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIVPE 280 IVINPPW+LE QM +LP+L LVP GTG TV WI PE Sbjct: 240 IVINPPWQLESQMKKILPYLTDVLVPEGTGSWTVEWIKPE 279 >UniRef50_A1T010 DNA (Exogenous) processing protein n=8 Tax=Gammaproteobacteria RepID=A1T010_PSYIN Length = 284 Score = 411 bits (1056), Expect = e-113, Method: Composition-based stats. Identities = 137/280 (48%), Positives = 183/280 (65%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 +LSYRHSFHAGN ADVLKH V + II+ + +K+K F YLDTHAG G Y S A +T E Sbjct: 5 LLSYRHSFHAGNFADVLKHIVSTSIIDYMLKKEKAFCYLDTHAGCGAYSFQSPEALKTKE 64 Query: 61 YLEGIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELH 120 + GI +W + DLP + Y+ V FN QL++YPGSP IA +LR+ D L L ELH Sbjct: 65 FNNGIFPLWGRSDLPVPVARYMEQVVEFNAQSQLKHYPGSPSIAVQMLRDIDRLFLFELH 124 Query: 121 PSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSGI 180 P+++ + + F + + ++ K+DG Q L A +PP +RRG ILIDP YE+KT+Y VV + Sbjct: 125 PNEFINMCANFSGNRQIKMAKSDGLQGLIANMPPKARRGFILIDPSYEIKTEYHQVVETL 184 Query: 181 AEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTASGM 240 + +KRFATG YALWYPVV R I +M L+A+GI+ I EL + DSD+ GMT+SGM Sbjct: 185 IQAHKRFATGTYALWYPVVNRMTIDKMEKALKASGIKNIQLFELGLQEDSDQMGMTSSGM 244 Query: 241 IVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIVPE 280 IVINPPW L+++M LP+L L G + +V E Sbjct: 245 IVINPPWTLKKEMQASLPFLAKLLGFDNQGFYRIETLVAE 284 >UniRef50_Q984Q7 Mlr7888 protein n=7 Tax=Rhizobiales RepID=Q984Q7_RHILO Length = 282 Score = 408 bits (1049), Expect = e-112, Method: Composition-based stats. Identities = 122/282 (43%), Positives = 174/282 (61%), Gaps = 3/282 (1%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEY 61 ++YRH++HAGN ADV+KH V + +++ LK+KDK F +DTHAG GRY L S A++TGE+ Sbjct: 1 MNYRHAYHAGNFADVVKHVVLTRLLDYLKQKDKAFRVVDTHAGIGRYDLSSLEAQKTGEW 60 Query: 62 LEGIARIWQQD---DLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTE 118 GI R+ A L Y+ V+ N ++ YPGSPL+AR LLR+QD L E Sbjct: 61 QGGIGRLIDASLDARAGALLAPYLEAVRSLNPGDGVKKYPGSPLLARHLLRKQDRLSAIE 120 Query: 119 LHPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVS 178 LHP D L++EF D + RV + DG+ L A LPP +RGL+LIDPP+E + ++ +V Sbjct: 121 LHPKDAARLKAEFAGDFQVRVMELDGWLALGAHLPPKEKRGLVLIDPPFEEEGEFGRLVE 180 Query: 179 GIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTAS 238 G+ ++R+ GIYALWYP+ R+ + L+ +GI KIL IE + P S + S Sbjct: 181 GLIRAHRRWPGGIYALWYPIKDRKAVIAFRKALKQSGIPKILDIEFEIRPASSEPSLDGS 240 Query: 239 GMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIVPE 280 GM+V+NPP+ LE ++ VLP LH L H ++ W+ E Sbjct: 241 GMVVVNPPFTLEGELRTVLPALHKLLAVEKPAHWSLEWLAGE 282 >UniRef50_B9JCU5 DNA methylase protein n=4 Tax=Rhizobiales RepID=B9JCU5_AGRRK Length = 283 Score = 405 bits (1041), Expect = e-111, Method: Composition-based stats. Identities = 117/282 (41%), Positives = 170/282 (60%), Gaps = 3/282 (1%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEY 61 ++YRH +HAGN ADVLKH V + ++ ++ KDK F LDTHAG G Y L SE A++TGE+ Sbjct: 1 MNYRHIYHAGNFADVLKHAVLARLVRYMQNKDKAFRVLDTHAGIGLYDLSSEEAQKTGEW 60 Query: 62 LEGIARIWQQDDLP---AELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTE 118 +GI R+ + +P LE Y+ V+ N G L++YPGSP +AR+L R QD L E Sbjct: 61 QDGIGRLLDAELVPQLAELLEPYLTAVRELNPDGGLQFYPGSPKLARMLFRSQDRLSAME 120 Query: 119 LHPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVS 178 LHP D+ L F+ D AR+ + DG+ L A LPP +RG++L+DPP+E + +Y+ + Sbjct: 121 LHPEDFQRLHRLFEGDHHARITELDGWLALGAHLPPKEKRGIVLVDPPFEEEDEYERLAD 180 Query: 179 GIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTAS 238 G+A+ ++RF G Y LWYP+ IK L+A I K+L EL V D G+T S Sbjct: 181 GLAKAWRRFPGGTYCLWYPIKKDAPIKAFHETLQALEIPKVLCAELTVKSDRGFTGLTGS 240 Query: 239 GMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIVPE 280 G+I++NPP+ L+ +++ +LP L L W+ E Sbjct: 241 GLIIVNPPFTLKDELHALLPALKDMLAQDRFASQRAFWLRGE 282 >UniRef50_A0KP43 Protein involved in external DNA uptake n=3 Tax=Aeromonadaceae RepID=A0KP43_AERHH Length = 284 Score = 403 bits (1037), Expect = e-111, Method: Composition-based stats. Identities = 134/280 (47%), Positives = 178/280 (63%), Gaps = 2/280 (0%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 MLSYRH+FHAGNHADVLKH VQ+LIIESLK+K+KPF+ LDTHAG G Y L + ++ E Sbjct: 1 MLSYRHAFHAGNHADVLKHAVQALIIESLKKKEKPFIVLDTHAGGGLYDLCGDWPQKKAE 60 Query: 61 YLEGIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELH 120 Y +GI R+W + + + Y+ V++ N GQLRYYPGSP ++R L REQD L L ELH Sbjct: 61 YADGIGRLWDERTQWSAMAPYLGVIEEMNSDGQLRYYPGSPELSRRLAREQDKLALMELH 120 Query: 121 PSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSGI 180 ++ LR+ D R V DGF+ L A LPP RRGL+LIDPPYE+K DY AVV + Sbjct: 121 NNEVDDLRANMGYDPRVAVHHRDGFEGLVALLPPTPRRGLVLIDPPYELKEDYFAVVDTL 180 Query: 181 AEGYKRFATGIYALWYPVVLRQQ--IKRMIHDLEATGIRKILQIELAVLPDSDRRGMTAS 238 + KR+ATGIYALWYP++ + + M+ ++ +L EL V + GM S Sbjct: 181 KKAQKRWATGIYALWYPILGEEADKSRDMLRAIKRENFGNVLVAELEVAGQTKDWGMNGS 240 Query: 239 GMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIV 278 GM++I+PPW L++Q+ L L +KL V W+ Sbjct: 241 GMLIISPPWMLDEQIEAFLKPLCAKLAQGAGAQYKVEWLN 280 >UniRef50_B6QZ93 Florfenicol resistance protein n=3 Tax=Rhodobacteraceae RepID=B6QZ93_9RHOB Length = 284 Score = 401 bits (1031), Expect = e-110, Method: Composition-based stats. Identities = 115/284 (40%), Positives = 178/284 (62%), Gaps = 5/284 (1%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEY 61 ++YRH +HAGN DVLKH V + I++ L++KD + LDTHAG G Y L SE A++TGE+ Sbjct: 1 MNYRHIYHAGNIGDVLKHVVLANILKYLQKKDGAYRVLDTHAGIGLYDLTSEKAQKTGEW 60 Query: 62 LEGIARIWQQDDLP-----AELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQL 116 +G+ ++ + D L ++ V++ N G +++YPGSP IA +L R+QD L L Sbjct: 61 QQGVGKVLENIDAASDQVKEVLAPWLETVENLNPGGGVQFYPGSPEIACMLARKQDRLTL 120 Query: 117 TELHPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAV 176 TELHP D+ L++ + D + +V D + L + LPP RRGL+LIDP +E++ +++ + Sbjct: 121 TELHPEDFEELKNNYGGDKKVKVIALDAWLALGSFLPPKERRGLVLIDPAFEVEDEFKRL 180 Query: 177 VSGIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMT 236 G+ G+KR+ TG +A+WYPV ++ + ++I LE GIR +++EL+ SD R M Sbjct: 181 AEGVIRGWKRWQTGTFAIWYPVKNQRIVNQLIVTLEEAGIRNAVKLELSAGQISDDRPMK 240 Query: 237 ASGMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIVPE 280 +SGM+V+NPPW L + MN LPWL L +V ++ E Sbjct: 241 SSGMLVVNPPWTLTRDMNTALPWLCQTLSQGKGAEWSVKQVIAE 284 >UniRef50_Q5NZ63 Predicted protein involved in catabolism of external DNA n=13 Tax=Betaproteobacteria RepID=Q5NZ63_AZOSE Length = 281 Score = 401 bits (1031), Expect = e-110, Method: Composition-based stats. Identities = 134/277 (48%), Positives = 176/277 (63%), Gaps = 2/277 (0%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 MLSYRH+FHAGNHADVLKH V ++ K+KP+ Y+DTHAGAG Y L SE A + E Sbjct: 1 MLSYRHAFHAGNHADVLKHFVLIELLRYFNRKEKPWWYVDTHAGAGCYALDSEQAGKNAE 60 Query: 61 YLEGIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELH 120 + GI R+WQ+DDLP + Y++ + FN G+L +YPGSP +A LREQD ++L ELH Sbjct: 61 FASGIGRLWQRDDLPDAMRPYLDALAQFNPHGRLTFYPGSPALAMTQLREQDRMRLFELH 120 Query: 121 PSDYPLLRSEFQKD-SRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSG 179 P+D LL F +D R +V KADGF L+ LPP SRR ++LIDPPYE+K DY+ VV Sbjct: 121 PADVALLGQTFARDVQRVQVRKADGFSALRGLLPPPSRRVVVLIDPPYEVKEDYRRVVDT 180 Query: 180 IAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAV-LPDSDRRGMTAS 238 +A+ KRF G YA+WYP++ R + +++ L G L + LAV P D GM S Sbjct: 181 LADAIKRFPAGTYAVWYPLLARTEARQLPARLAGLGAENWLDVRLAVKKPPRDGFGMFGS 240 Query: 239 GMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVS 275 G+ V+NPPW L Q + V+PWL L G G + Sbjct: 241 GLYVVNPPWVLPQTLEAVMPWLADVLGEDGEGGFDLE 277 >UniRef50_Q47Z69 Putative uncharacterized protein n=1 Tax=Colwellia psychrerythraea 34H RepID=Q47Z69_COLP3 Length = 295 Score = 401 bits (1030), Expect = e-110, Method: Composition-based stats. Identities = 127/296 (42%), Positives = 192/296 (64%), Gaps = 17/296 (5%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 MLSYRH+FHAGN ADVLKH+V SL+++ + K+K F Y+D+H+GAG YQL E+A++TGE Sbjct: 1 MLSYRHAFHAGNFADVLKHSVLSLVLDYMTRKEKGFCYIDSHSGAGMYQLADEYAQKTGE 60 Query: 61 YLEGIARIWQQDDLPAELEAYINVVKHFN----------------RSGQLRYYPGSPLIA 104 Y +GIA+I +D P LE Y++++K N S L YPGSP IA Sbjct: 61 YKDGIAKIINDEDAPESLEPYLSLIKSLNLASDRNTDPSADISTDTSNDLDVYPGSPGIA 120 Query: 105 RLLLREQDSLQLTELHPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILID 164 + +R QDS L ELHP+D L + Q+ + V+++DG+Q + +PP SRRG++LID Sbjct: 121 KAFVRRQDSSHLFELHPTDIQHLENFCQRWRKVFVKQSDGYQGVLGLIPPPSRRGVVLID 180 Query: 165 PPYEMKTDYQAVVSGIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIEL 224 PPYE+K DY V I + Y++F+TG Y LWYPVV R+ +++M + + ++ +LQ+E Sbjct: 181 PPYELKEDYHKAVKTIIKAYEKFSTGTYILWYPVVKRELVEQMSYTFTKSSVKNVLQVEF 240 Query: 225 AVLPDSDRRGMTASGMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIVPE 280 + D+D GMT +G+ ++NPPW+L Q+ +LP++ +KL T T++ ++ E Sbjct: 241 CLESDTDEYGMTGTGLFIVNPPWQLTSQLEEILPYMKTKLGSDDT-SYTLNQLIAE 295 >UniRef50_A6SXL6 Uncharacterized conserved protein n=70 Tax=cellular organisms RepID=A6SXL6_JANMA Length = 305 Score = 395 bits (1016), Expect = e-109, Method: Composition-based stats. Identities = 135/291 (46%), Positives = 183/291 (62%), Gaps = 16/291 (5%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 MLSYRH+FHAGNHADVLKH VQ +++ L +KD P++Y+DTH+GAG Y L +A + E Sbjct: 1 MLSYRHAFHAGNHADVLKHLVQIQLLKYLNQKDTPYMYIDTHSGAGVYALDGNYAAKNAE 60 Query: 61 YLEGIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELH 120 + GI+++W + DLPA L Y+ V+K N SG+LRYYPGSP A ++REQD L+L ELH Sbjct: 61 FETGISKLWDRKDLPAPLAEYVQVIKALNPSGKLRYYPGSPYCADAVMREQDRLRLFELH 120 Query: 121 PSDYPLLRSEFQ---------------KDSRARVEKADGFQQLKAKLPPVSRRGLILIDP 165 P+D LL F+ + R +E+ +GFQ LKA LPP SRRGL+LIDP Sbjct: 121 PADSKLLADNFRKLEAHAAEQGKRPTVRGKRIMIERGNGFQGLKALLPPPSRRGLVLIDP 180 Query: 166 PYEMKTDYQAVVSGIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELA 225 PYE KTDY+ VV +++ RFATG YA+WYPV+ R + ++M L+ L + L+ Sbjct: 181 PYEDKTDYRTVVQTVSDALTRFATGTYAVWYPVLNRLESRQMPDKLKRLSANGWLNVTLS 240 Query: 226 VLPDS-DRRGMTASGMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVS 275 V S D G+ +SGM V NPPW LE + ++P+L L T+ Sbjct: 241 VTTPSPDGFGLHSSGMFVHNPPWTLEPMLRELMPYLVKTLGGDEGAGFTLE 291 >UniRef50_B2S4W3 N-6 Adenine-specific DNA methylase n=51 Tax=Rhizobiales RepID=B2S4W3_BRUA1 Length = 290 Score = 394 bits (1014), Expect = e-108, Method: Composition-based stats. Identities = 122/286 (42%), Positives = 171/286 (59%), Gaps = 7/286 (2%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEY 61 ++YRH++HAGN ADV+KH + + I+E LK K++ F +DTHAG G Y L A +TGE+ Sbjct: 1 MNYRHAYHAGNFADVVKHVILTRIVEYLKRKEQAFRVIDTHAGIGLYDLKGTEAGKTGEW 60 Query: 62 LEGIARIWQQDD-------LPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSL 114 GI RI + + L+ Y++ V N +LR+YPGSPL+ R LLR+QD L Sbjct: 61 AGGIERIMTAVEKGQVEQPVLELLKPYLDAVYAVNTGVRLRHYPGSPLLVRHLLRKQDRL 120 Query: 115 QLTELHPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQ 174 ELHP D L F D + RV + DG+ L A LPP +RGL+L+DPP+E ++ Sbjct: 121 SALELHPQDAAKLAKLFDGDYQVRVTELDGWLALGAHLPPKEKRGLVLVDPPFEKDGEFD 180 Query: 175 AVVSGIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRG 234 + G+A+ +KRF G YALWYPV R++ +R L TGI KI+QIELA+ S Sbjct: 181 RLADGLAKAHKRFGGGTYALWYPVKDRRETERFARRLRETGIPKIMQIELAIRAPSPEPR 240 Query: 235 MTASGMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIVPE 280 + +GMIV+NPP+ LE +M +LP L L + ++ WI E Sbjct: 241 LDGTGMIVVNPPYTLESEMQILLPCLTRLLEEEKGSNFSLRWIRGE 286 >UniRef50_Q12I42 Putative uncharacterized protein n=20 Tax=Shewanella RepID=Q12I42_SHEDO Length = 292 Score = 394 bits (1014), Expect = e-108, Method: Composition-based stats. Identities = 135/281 (48%), Positives = 188/281 (66%), Gaps = 2/281 (0%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 MLSYRH +HAGN+ADVLKH + +++++ +KDK F+Y+DTHAGAG Y L E A++TGE Sbjct: 13 MLSYRHGYHAGNYADVLKHAILLQVLKAMHKKDKAFVYVDTHAGAGAYSLEDEFAQKTGE 72 Query: 61 YLEGIARIWQQDDLPAELEAYINVVKHFNRSGQ-LRYYPGSPLIARLLLREQDSLQLTEL 119 YL+G+A++W + DLP L+ Y+ VK FN L YPGSP LR QD + L EL Sbjct: 73 YLDGVAKLWDKTDLPLALKDYVAAVKTFNAEQDELSLYPGSPAFVDSELRPQDRMVLHEL 132 Query: 120 HPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSG 179 H +D+ LL F KD + +V K DG L A +PP+ RRG++LIDP +E+KTDYQ V Sbjct: 133 HGTDHELLSDYFAKDRQVKVIKGDGLAGLIAAVPPLERRGVVLIDPSFEIKTDYQDVADA 192 Query: 180 IAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTASG 239 I + +KRF+TG++ LWYPVV R+Q + M+ L+ +GI K L++E + DS+ GMTA+G Sbjct: 193 IIKAHKRFSTGVFMLWYPVVDREQTEAMLSKLKNSGITKQLRLEQGIKTDSNEFGMTAAG 252 Query: 240 MIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIVPE 280 + +INPPW+L++ + L +L L GH TV W V E Sbjct: 253 LWIINPPWQLDELAKDSLDYLAKTLG-GIDGHVTVKWEVGE 292 >UniRef50_A3YH43 Protein involved in external DNA uptake n=2 Tax=Marinomonas RepID=A3YH43_9GAMM Length = 280 Score = 394 bits (1014), Expect = e-108, Method: Composition-based stats. Identities = 126/280 (45%), Positives = 177/280 (63%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 MLSYRH +HAGNHAD+LKH V S I L K+ PF YLDTHAG G+Y L S+ A+ E Sbjct: 1 MLSYRHIYHAGNHADILKHLVVSQICHHLTAKEAPFFYLDTHAGIGQYALDSQQAQMNKE 60 Query: 61 YLEGIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELH 120 + GI+++ + P ++ ++ +VK N + L+ YPGSP + R++D + L ELH Sbjct: 61 FKTGISQLLELKSAPDSIKRFLKIVKEMNPTSNLKVYPGSPKVVEAYTRQKDKMHLCELH 120 Query: 121 PSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSGI 180 P D+P L + F RA VEK +GF +KA LPP +RGL+L+DPPYE+K DY+ VV + Sbjct: 121 PKDHPTLAALFPNKRRANVEKGNGFAAVKAMLPPPQKRGLVLMDPPYEVKEDYKTVVKAL 180 Query: 181 AEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTASGM 240 EG++RF+ GIYA+WYPV+ R+Q +I+ ++ T IR +L +EL + +GM SGM Sbjct: 181 VEGHQRFSHGIYAIWYPVLSRKQADNLINSVQRTKIRNVLLLELNIRDIDADKGMNGSGM 240 Query: 241 IVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIVPE 280 I++NPPWK+E + LP L L + WI PE Sbjct: 241 IIVNPPWKMESEAQEFLPILKELLQEDNRSSFQLRWITPE 280 >UniRef50_Q0VM19 Putative uncharacterized protein n=2 Tax=Alcanivorax RepID=Q0VM19_ALCBS Length = 282 Score = 394 bits (1013), Expect = e-108, Method: Composition-based stats. Identities = 140/280 (50%), Positives = 182/280 (65%), Gaps = 1/280 (0%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 MLSYRHS+HAGN ADVLKH VQ IIE LK+KDKPF DTHAGAG Y + SEH ++TGE Sbjct: 1 MLSYRHSYHAGNFADVLKHIVQVAIIEYLKKKDKPFTVHDTHAGAGSYAIASEHMQKTGE 60 Query: 61 YLEGIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELH 120 Y +GIA+++ + ++ Y+++V+ N G+L YPGSP I+ LLREQD LQ TELH Sbjct: 61 YQDGIAKLFGKRTGVGVIDQYVSLVEKLNPVGRLMDYPGSPQISASLLREQDVLQCTELH 120 Query: 121 PSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSGI 180 +D+ LL+ EF D R +V K D + LKA LPP RRGL+LIDP YEM+ DY V+ + Sbjct: 121 STDFTLLKREFADDKRVQVLKDDAWHGLKALLPPRHRRGLVLIDPSYEMEADYNGVLPAV 180 Query: 181 AEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTASGM 240 +RFAT YA+WYPV+ R + + I GI +L++E V PD+ RGMT +GM Sbjct: 181 QMAMERFATATYAIWYPVLDRNRTESFIRRFVKAGIPNLLRVECCVRPDASGRGMTGTGM 240 Query: 241 IVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIVPE 280 ++INPP+ L Q M +P L L GH TV + E Sbjct: 241 LIINPPYTLAQHMAQAMPLLKEALC-DANGHTTVKMLTGE 279 >UniRef50_A4VR91 Protein involved in catabolism of external DNA n=21 Tax=Pseudomonadaceae RepID=A4VR91_PSEU5 Length = 279 Score = 393 bits (1010), Expect = e-108, Method: Composition-based stats. Identities = 124/279 (44%), Positives = 169/279 (60%), Gaps = 1/279 (0%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEY 61 ++YRH+FHAGNHADVLKH V S I L K+ PF YLD+HAG G Y L + A RTGE+ Sbjct: 1 MNYRHAFHAGNHADVLKHLVLSRIFALLSRKEAPFAYLDSHAGVGLYDLAGDQASRTGEW 60 Query: 62 LEGIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELHP 121 L+GIARIWQ + PA L+ Y+ V++ N G LRYYPGSP +AR L REQD LQL E HP Sbjct: 61 LQGIARIWQAETRPALLDDYLGVIRSLNPDGALRYYPGSPELARQLTREQDRLQLNEKHP 120 Query: 122 SDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSGIA 181 D LL+ D R V + +G+ +A +P +R ++LIDPP+E + V+ + Sbjct: 121 EDGALLKDNMSGDRRVAVHRGEGWHVPRALMPTREKRVVLLIDPPFEQADELSRCVTALK 180 Query: 182 EGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTASGMI 241 E R I +WYP+ +Q+KR DL +G K+L+ EL V P D + SG+ Sbjct: 181 EALGRMRQTIGVIWYPIKDERQLKRFYQDLARSGAPKLLRAELFVHPADDASRLAGSGLA 240 Query: 242 VINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIVPE 280 ++NPPW LE ++ +LPWL +L G + W++ E Sbjct: 241 IVNPPWGLEDELRELLPWLAEQLAQ-SQGGWRLDWLIEE 278 >UniRef50_C3X722 External-DNA catabolic protein n=2 Tax=Oxalobacter formigenes RepID=C3X722_OXAFO Length = 298 Score = 393 bits (1010), Expect = e-108, Method: Composition-based stats. Identities = 119/295 (40%), Positives = 167/295 (56%), Gaps = 16/295 (5%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 M SYRH+FHAGNHADVLKH V ++ K+ Y+DTH+GAG Y L A + E Sbjct: 1 MFSYRHAFHAGNHADVLKHVVLMQVLLYAIRKEASLFYIDTHSGAGVYSLEGNEARKNAE 60 Query: 61 YLEGIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELH 120 + GIAR+W + +P + Y+ +V N G+LR+YPGSP IA +LR D L+L E H Sbjct: 61 FQSGIARLWGKKTVPPAVRDYLKLVYDMNPDGKLRFYPGSPYIAERILRSHDRLRLFEWH 120 Query: 121 PSDYPLLRSEFQ---------------KDSRARVEKADGFQQLKAKLPPVSRRGLILIDP 165 P++ +L F+ + R VE+ DGF LKA LPP SRR +ILIDP Sbjct: 121 PAECRVLDENFRGLLKSGESNTRSRPERGKRVLVERKDGFSSLKALLPPPSRRAVILIDP 180 Query: 166 PYEMKTDYQAVVSGIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELA 225 PYE K+DY+ VV +++ KRF+TG +WYP++ R + +R L+ T ++ L + L+ Sbjct: 181 PYEDKSDYRKVVDVVSDALKRFSTGTCLIWYPLLQRPESRRFASRLKQTVSQEWLDVTLS 240 Query: 226 V-LPDSDRRGMTASGMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIVP 279 P D G +SGM V+NPPWKL + + +P L S L T+ + Sbjct: 241 TGSPVPDGFGFVSSGMFVVNPPWKLAESLQETMPCLVSALKQDSGAGFTLETGIG 295 >UniRef50_B1XZU6 Putative uncharacterized protein n=1 Tax=Leptothrix cholodnii SP-6 RepID=B1XZU6_LEPCP Length = 280 Score = 380 bits (978), Expect = e-104, Method: Composition-based stats. Identities = 117/276 (42%), Positives = 175/276 (63%), Gaps = 1/276 (0%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 ML+YRH+FHAGNHADVLKH V +++ + KDKPF +DTHAG G Y L S +++ GE Sbjct: 1 MLAYRHAFHAGNHADVLKHLVLVQVLQYMASKDKPFRLIDTHAGGGGYALHSSQSQKKGE 60 Query: 61 YLEGIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELH 120 YL+GI+RIW D P + Y+ +V+ FN GQL YPGSP ++++LLR D L+L ELH Sbjct: 61 YLQGISRIWGAGDAPPAVADYLRLVRRFNPDGQLNLYPGSPALSQMLLRRGDQLRLFELH 120 Query: 121 PSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSGI 180 P+++ +L + + ++ + DGF L+ ++PP RRG++L+DP YE+ +DY V+ + Sbjct: 121 PTEFKILTENTRPGRQVQLAQVDGFAALRGQVPPSMRRGVVLMDPSYELVSDYAKVIDSL 180 Query: 181 AEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAV-LPDSDRRGMTASG 239 + +RFA G+Y +WYP V R + ++ L+AT + L + L V PD+ G+T SG Sbjct: 181 RDALQRFAEGVYVVWYPQVSRVESIQIARRLQATAPKGWLHVRLNVQQPDAQGFGLTGSG 240 Query: 240 MIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVS 275 + VINPP+ L Q+ +PWL KL + + Sbjct: 241 VFVINPPYTLHAQLAACMPWLTQKLGQFEGANHLLE 276 >UniRef50_Q2W9T7 Protein involved in catabolism of external DNA n=7 Tax=Alphaproteobacteria RepID=Q2W9T7_MAGSA Length = 283 Score = 380 bits (977), Expect = e-104, Method: Composition-based stats. Identities = 120/279 (43%), Positives = 177/279 (63%), Gaps = 1/279 (0%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEY 61 ++YRH++HAGN ADV+KH + ++++ SLK KD PF LDTHAG G Y L + A++TGEY Sbjct: 1 MNYRHAYHAGNFADVMKHAILAMVVASLKRKDTPFFALDTHAGIGAYDLEAPQADKTGEY 60 Query: 62 LEGIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELHP 121 L GIAR+ D PAELE Y+ +V+ +N G LR YPGSP + R L+R QD + L ELHP Sbjct: 61 LSGIARVLDAADPPAELETYLALVRTWNSEGVLRRYPGSPELMRGLMRPQDRMALVELHP 120 Query: 122 SDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSGIA 181 D LR+ F D R V DG+ K LPP RRGL+L+DPP+E+K +++ +++ + Sbjct: 121 EDVETLRARFHGDRRVGVHHLDGYTAAKGLLPPPERRGLVLMDPPFEVKNEFERLLAALR 180 Query: 182 EGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTASGMI 241 K + TGIY WYP+ R+ + + + + G + L EL + P D + +G++ Sbjct: 181 RARKLWPTGIYLAWYPIKGREPVDQFLQAIADDGGPEALAAELLLRPAKDPFKLNGNGLL 240 Query: 242 VINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIVPE 280 VINPPW+L + ++ VLPWL + + P +G A + ++ E Sbjct: 241 VINPPWQLRESLDRVLPWLAAVMAPD-SGSAAIRQLIGE 278 >UniRef50_C6M2C4 YhiR family protein n=2 Tax=Neisseriaceae RepID=C6M2C4_NEISI Length = 281 Score = 373 bits (959), Expect = e-102, Method: Composition-based stats. Identities = 114/279 (40%), Positives = 166/279 (59%), Gaps = 6/279 (2%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 MLSYRH+FHAGNHAD++KH + L ++ +KDKP+ Y+DTH+GAG Y L A++ GE Sbjct: 1 MLSYRHAFHAGNHADMIKHFILFLTLDYFNQKDKPYWYIDTHSGAGLYDLSGSEAQKVGE 60 Query: 61 YLEGIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELH 120 Y +GI + + + LP EL A+I + QL Y GSP +A+ L R+ D L+L ELH Sbjct: 61 YKQGIRLLQEAEHLPPELSAFIARLNAILPQEQL--YCGSPWLAQALTRDSDKLRLFELH 118 Query: 121 PSDYPLLRSEFQK---DSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVV 177 P+D+ L++ ++ R ++ +ADGF+ L + LPP RR ++LIDPPYE K DYQ VV Sbjct: 119 PADFQHLKNNMEEARLGRRGQIMQADGFRGLISLLPPPLRRAVVLIDPPYEEKQDYQRVV 178 Query: 178 SGIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVL-PDSDRRGMT 236 + + KRF G Y +WYP + R++ +++ L+ L EL V P D GM Sbjct: 179 QTLKDALKRFEQGCYMVWYPCLSREESRKLPEQLQKLMPDSYLHAELHVHTPRPDGFGMH 238 Query: 237 ASGMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVS 275 SGM +INPP+ L + + + LP L L ++ Sbjct: 239 GSGMFIINPPYLLPELLKSNLPALTDILAQDNGARFVLN 277 >UniRef50_Q15U81 Putative uncharacterized protein n=1 Tax=Pseudoalteromonas atlantica T6c RepID=Q15U81_PSEA6 Length = 293 Score = 372 bits (955), Expect = e-102, Method: Composition-based stats. Identities = 120/296 (40%), Positives = 170/296 (57%), Gaps = 20/296 (6%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 M SYRH +HAGNHADVLKH Q LII+ LK+KDK F Y+DTH+GAG Y L SE + +T E Sbjct: 1 MFSYRHGYHAGNHADVLKHICQMLIIDKLKQKDKGFTYIDTHSGAGLYDLSSEQSLKTNE 60 Query: 61 YLEGIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELH 120 + +GI+R+ + AY + + + Q YPGSP IAR+L+R+QD L L E + Sbjct: 61 FQQGISRLADYSGAEPTVLAYQALTSSYLKHQQ---YPGSPEIARVLMRDQDQLHLMEWN 117 Query: 121 PSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSGI 180 + L+ + K + V DG++ L A PP +RGL+L DP YE DYQ VV I Sbjct: 118 NQEVINLKRQI-KGTHISVHHRDGYEGLIALTPPKLKRGLVLTDPSYETSEDYQLVVDAI 176 Query: 181 AEGYKRFATGIYALWYPVVLRQ----------------QIKRMIHDLEATGIRKILQIEL 224 ++ YKR+ T IYA+WYP++ ++ + ++M+ DL G + +LQ+EL Sbjct: 177 SKAYKRWPTAIYAIWYPLLSKRDEDQNDGFERATTKHKKSQKMLDDLTQHGFKNVLQVEL 236 Query: 225 AVLPDSDRRGMTASGMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIVPE 280 AV GM SGM +IN PW+L+ Q+ + L L + V+W+V E Sbjct: 237 AVQNPDTFAGMYGSGMAIINAPWQLDAQIRDCLGELTPVMAQHKHASFVVNWLVEE 292 >UniRef50_Q2SPJ4 Protein involved in catabolism of external DNA n=1 Tax=Hahella chejuensis KCTC 2396 RepID=Q2SPJ4_HAHCH Length = 280 Score = 370 bits (952), Expect = e-101, Method: Composition-based stats. Identities = 108/282 (38%), Positives = 169/282 (59%), Gaps = 4/282 (1%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 MLSY+H +HAGN AD KH SL++++L +K P+ YL+THAG G Y L SE A++T E Sbjct: 1 MLSYQHVYHAGNFADAHKHWALSLLLQALCKKSTPWRYLETHAGRGDYDLTSEEAQKTSE 60 Query: 61 YLEGIARIWQQDDL-PAELEAYINVVKHFNRS-GQLRYYPGSPLIARLLLREQDSLQLTE 118 + GI + Q P E +AY+ V+ N + +L YPGSP IA LRE D L L E Sbjct: 61 WTAGILPLMQAKGPCPPEFDAYLAAVRALNPNTERLTRYPGSPAIAAGFLRETDQLALCE 120 Query: 119 LHPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVS 178 LHP +Y L+ +F ++ + + + DGF+ + A PP +RGL++IDP YE+K DYQ + + Sbjct: 121 LHPGEYAELKRQFGRNRQIHIHQRDGFEGVMAMSPPPEKRGLVMIDPSYELKEDYQRIPA 180 Query: 179 GIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTAS 238 + + K+++ I A+WYP++ ++ ++M+ + + K L+ EL + P RGM S Sbjct: 181 YVNKLTKKWSNAIIAIWYPILAEKRHEKMLELMRQLPLNKTLRSELILTPV--ARGMYGS 238 Query: 239 GMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIVPE 280 GM+V+N PW+L++Q+ +L L + W++ E Sbjct: 239 GMLVVNSPWRLDEQLQAGWAYLSEALRGDPKASCSADWLIAE 280 >UniRef50_Q89DH2 Blr7467 protein n=16 Tax=Rhizobiales RepID=Q89DH2_BRAJA Length = 286 Score = 369 bits (949), Expect = e-101, Method: Composition-based stats. Identities = 111/282 (39%), Positives = 164/282 (58%), Gaps = 8/282 (2%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEY 61 ++YRH+FHAGN ADV+KH V + I+ L++K F +DTHAGAG Y L S+ A R GE+ Sbjct: 1 MNYRHAFHAGNFADVIKHIVLARILTYLQDKPGAFRVIDTHAGAGLYDLESDEARRGGEW 60 Query: 62 LEGIARIWQQD---DLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTE 118 L GIAR+ Q + A + Y+++V+ FN G+L+ YPGSPLIAR LLR QD L E Sbjct: 61 LTGIARLMQARLSNETAALTKPYLDIVRAFNPKGELKAYPGSPLIARGLLRPQDRLVACE 120 Query: 119 LHPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVS 178 L P L ++D +ARV DG+ L A +PP RRGL+LIDPP+E K +++ + Sbjct: 121 LEPKARKALIDVLRRDEQARVVDLDGWVALPAFVPPKERRGLVLIDPPFEAKNEFERLGE 180 Query: 179 GIAEGYKRFATGIYALWYPVVLRQQIKRMIH-----DLEATGIRKILQIELAVLPDSDRR 233 +E + ++ TGIY +WYP R+ + A K L++E + P D Sbjct: 181 AFSEAFAKWPTGIYVIWYPAKSRRATDALAQLVARLAAAAKPPGKCLRLEFSAAPQLDGA 240 Query: 234 GMTASGMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVS 275 +T++G++++NPP+ L ++ +LP L L G + Sbjct: 241 ALTSTGLLIVNPPYTLHGELKTILPELEMPLGQGGAARFRLE 282 >UniRef50_B8GN89 Putative uncharacterized protein n=1 Tax=Thioalkalivibrio sp. HL-EbGR7 RepID=B8GN89_THISH Length = 281 Score = 369 bits (949), Expect = e-101, Method: Composition-based stats. Identities = 120/280 (42%), Positives = 169/280 (60%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 MLSYRH FHAGN ADV KH V + I+++L K KPF LDTHAG Y L S+ AE+TGE Sbjct: 1 MLSYRHGFHAGNFADVHKHAVLAWIVQALTAKAKPFCVLDTHAGDAGYDLASQWAEKTGE 60 Query: 61 YLEGIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELH 120 + EG+ R+ P + ++ +++ F S R YPGSP IAR LLR D L L ELH Sbjct: 61 WREGVGRLMGCPGAPEAIAPFLQLLEAFRASHGERAYPGSPAIARGLLRPGDRLVLGELH 120 Query: 121 PSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSGI 180 P+ + LR F +D + V + DG++ L A LPP RRGL+L+DPPYE +YQA + Sbjct: 121 PAAWESLRGFFARDDQVAVHRRDGWELLGALLPPAERRGLVLVDPPYERDEEYQAAARAL 180 Query: 181 AEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTASGM 240 +R+ +G+Y LWYP++ + + M+ +LEA +L EL P G+ SG+ Sbjct: 181 TAAARRWPSGVYLLWYPLLAAGRHQAMLRELEAARPGPMLVAELWTAPLDTPAGLNGSGL 240 Query: 241 IVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIVPE 280 ++NPPW+L + + N+ PWL L P G G + + W +P+ Sbjct: 241 CILNPPWRLHEALANLQPWLVDCLAPGGAGGSRLHWAIPD 280 >UniRef50_C6XPG2 Putative uncharacterized protein n=5 Tax=Proteobacteria RepID=C6XPG2_HIRBI Length = 334 Score = 369 bits (948), Expect = e-101, Method: Composition-based stats. Identities = 115/288 (39%), Positives = 167/288 (57%), Gaps = 14/288 (4%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEY 61 ++YRH+FH GN AD+LKH V L IESLK+KDKPF Y+DTHAG GRY L + A R+ E+ Sbjct: 1 MNYRHAFHVGNFADILKHLVLVLCIESLKKKDKPFRYIDTHAGIGRYDLTGDEARRSPEW 60 Query: 62 LEGIARIWQQ-------DDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSL 114 EGI RIW +D+ A L+ Y++ V N G L YPGSP +A L+REQDSL Sbjct: 61 QEGIGRIWAAHKAGDIPEDVAAILKPYLDAVSEINYDGDLESYPGSPDLAATLMREQDSL 120 Query: 115 QLTELHPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQ 174 +LTELHP+D L F +D R ++E +G++ LKA LPP RRG++L+DPP+E + + Sbjct: 121 RLTELHPADKETLTDHFFRDKRVKIENRNGYEALKAYLPPPERRGVVLVDPPFEHRDELA 180 Query: 175 AVVSGIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIR-------KILQIELAVL 227 + G G R+ TG Y W P+ + ++ L + KIL +L V Sbjct: 181 HMAKGAMGGISRWPTGTYIFWRPLKDMENTQKFDDGLAEWLLDDMEFSHEKILLADLWVK 240 Query: 228 PDSDRRGMTASGMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVS 275 + + +G++V+NPP+ +++ + VLPW+ L ++ Sbjct: 241 EIVEPGPLCGAGVVVVNPPYGMQEALLTVLPWVTELLQQDEGAGWRIN 288 >UniRef50_C6QCA2 Putative uncharacterized protein n=1 Tax=Hyphomicrobium denitrificans ATCC 51888 RepID=C6QCA2_9RHIZ Length = 281 Score = 368 bits (945), Expect = e-100, Method: Composition-based stats. Identities = 102/280 (36%), Positives = 160/280 (57%), Gaps = 3/280 (1%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEY 61 ++YRH +HAGN ADVLKH V + ++ +K+K +PF +DTHAGAGRY L A +TGE+ Sbjct: 1 MNYRHGYHAGNFADVLKHVVLARVLTYMKQKPRPFRVIDTHAGAGRYDLAGVEAGKTGEW 60 Query: 62 LEGIARIWQQDDLPA---ELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTE 118 +GI R++ + P L+ Y++ V+ N SG L YPGS LIAR ++R +D L E Sbjct: 61 QDGIGRVFNAEFAPPVAELLQPYLDAVRADNASGDLEVYPGSSLIARRIMRPEDVLVANE 120 Query: 119 LHPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVS 178 L+ S++ L+ E + D + +K+ LPP RR ++LIDPP+E K+++ + Sbjct: 121 LNASEFERLKRELGRPRNTTFLNIDAWHAVKSLLPPKERRAVVLIDPPFEAKSEFADLAV 180 Query: 179 GIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTAS 238 G+ E RF G+Y +WYP+ + R + + + + L + LAV G+TA+ Sbjct: 181 GVREAMSRFQDGVYVIWYPLKDVEAADRFVAEATSRPGLEFLDVRLAVCAPFPGLGLTAT 240 Query: 239 GMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIV 278 G++VINPP+ L ++ VLP L + + +V Sbjct: 241 GVLVINPPYLLRGELETVLPALRDCMAEGEGCGFVLKGVV 280 >UniRef50_Q1N5H6 Putative uncharacterized protein n=1 Tax=Bermanella marisrubri RepID=Q1N5H6_9GAMM Length = 284 Score = 367 bits (943), Expect = e-100, Method: Composition-based stats. Identities = 118/285 (41%), Positives = 166/285 (58%), Gaps = 11/285 (3%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 MLSYRH+FHAGNHADVLKH+V I + +K + Y+DTH+GAG Y+L + A +T E Sbjct: 1 MLSYRHAFHAGNHADVLKHSVLVAIAKYFHKKQSAYTYIDTHSGAGVYKLSDDLANKTQE 60 Query: 61 YLEGIARIWQQDDLPAELEAYINVVKHFNRSGQ----LRYYPGSPLIARLLLREQDSLQL 116 Y GIAR++ DL A + Y+ V+ N + L++YPGSP LLREQD L Sbjct: 61 YKTGIARLYPNSDL-ALISPYLEQVRVLNAAQGEEKNLQFYPGSPWFMTELLREQDQAHL 119 Query: 117 TELHPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAV 176 ELHP D+ LL + +V DGF +KA LPP ++R I+IDPPYE +Y+ V Sbjct: 120 FELHPQDHALLEQNMNTGKQLKVHMEDGFSGIKAVLPPQTKRAFIVIDPPYEQANEYKKV 179 Query: 177 VSGIAEGYKRFATGIYALWYPVVLR----QQIKRMIHDLEATGIRKILQIELAVLPDSDR 232 V+ I +G KRFA G++A+WYP++ R + M+ +L T I K L + L + Sbjct: 180 VNAIEQGIKRFAVGVFAVWYPLLNRNDKQGMSETMVDELAKTDITKYLDVRLWTSKQTQ- 238 Query: 233 RGMTASGMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWI 277 GM SG+ ++NPP+ L+ +N LP L L T +V ++ Sbjct: 239 -GMYGSGLFIVNPPYILQDLLNQELPKLLEVLGLDETAGFSVDYV 282 >UniRef50_Q5QVX6 Transformation competence-related protein ComJ n=2 Tax=Idiomarina RepID=Q5QVX6_IDILO Length = 283 Score = 366 bits (941), Expect = e-100, Method: Composition-based stats. Identities = 103/283 (36%), Positives = 160/283 (56%), Gaps = 4/283 (1%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEY 61 ++YRH FHAGN ADV KH + + +E ++K+KP+ LDTH G G Y L + A RT E Sbjct: 1 MNYRHIFHAGNFADVFKHLLLARALEYFQQKNKPYFVLDTHGGIGYYDLQGDQAIRTAEA 60 Query: 62 LEGIARIWQQDDLPAELEAYINVVKHFNRSGQ-LRYYPGSPLIARLLLREQDSLQLTELH 120 +GI R + AY++ V+ N LRYYPGSP+I LRE D L + ELH Sbjct: 61 EQGIVRFAEHSAEEPLAAAYLSTVRQLNEEQDKLRYYPGSPVITSEFLRENDRLVVCELH 120 Query: 121 PSDYPLLRSE-FQKDSRARVEK-ADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVS 178 D L++ + + ++ DG+Q ++A+LPP +RGL+LIDPP+E T++ VVS Sbjct: 121 KEDAETLKNTPLGRHKQVQILAPMDGYQAVRAQLPPAEKRGLVLIDPPFENTTEFDDVVS 180 Query: 179 GIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEA-TGIRKILQIELAVLPDSDRRGMTA 237 + +G KR+ +G +A+WYP+ + D+ A + + K L +EL + + +R+G+ Sbjct: 181 ALEQGLKRWKSGSFAVWYPIKDELKTAAFHRDVGALSDLPKTLIMELNIRTNDERKGLHG 240 Query: 238 SGMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIVPE 280 G + +NPP+ + Q ++LP L L + W+V E Sbjct: 241 CGFLWVNPPYGVVQDSEHLLPVLCKTLAQDKGANFHSRWLVGE 283 >UniRef50_Q0ARP7 Putative uncharacterized protein n=2 Tax=Hyphomonadaceae RepID=Q0ARP7_MARMM Length = 311 Score = 365 bits (937), Expect = 1e-99, Method: Composition-based stats. Identities = 102/285 (35%), Positives = 163/285 (57%), Gaps = 13/285 (4%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEY 61 ++YRH+FHAGN ADVLKH+V +L +E L K KP+ +DTHAG G Y L S AER+ E+ Sbjct: 16 MNYRHAFHAGNFADVLKHSVLALCLEHLNAKPKPYRVIDTHAGIGGYDLASSEAERSPEW 75 Query: 62 LEGIARIWQQDDLPAELE----AYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLT 117 +GI R+ DLP ++ ++++V+ N G ++ YPGSP IA L+RE+D + L Sbjct: 76 KDGIGRLIDA-DLPEPVQAMLGPWLDIVREMNPDG-IKAYPGSPEIAARLIREEDRVHLC 133 Query: 118 ELHPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVV 177 ELH +D L + +++D+R +VE+ DG++ LK+ +PP +RGL+LIDPP+E + + + Sbjct: 134 ELHEADSVTLDNRYRRDARIKVERRDGYKALKSLVPPKEKRGLVLIDPPFEDRDELAHMA 193 Query: 178 SGIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGI-------RKILQIELAVLPDS 230 + ++ TG + W + R + L I KIL+ +L + + Sbjct: 194 EAVMGALAKWPTGTFIFWRSLKNLWAADRFDNGLAEWLISEKDFEPEKILRADLWIRDLA 253 Query: 231 DRRGMTASGMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVS 275 + +G+++INPP+ LE+ + N +PWL L V Sbjct: 254 SEGKLAGAGVVIINPPFTLEETLVNAMPWLAETLAQGNGYGWRVD 298 >UniRef50_B2HZ48 Protein involved in catabolism of external DNA n=17 Tax=Acinetobacter RepID=B2HZ48_ACIBC Length = 285 Score = 364 bits (935), Expect = 2e-99, Method: Composition-based stats. Identities = 110/286 (38%), Positives = 162/286 (56%), Gaps = 8/286 (2%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEY 61 ++YRH FHAGN ADV+KH + ++ L KDKP+ Y+DTH GAG+Y L A+++GE+ Sbjct: 1 MNYRHHFHAGNFADVMKHVLLLQLLNRLNAKDKPYRYIDTHGGAGKYDLSQAPAQKSGEF 60 Query: 62 LEGIARIWQQDD-----LPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQL 116 L GI R+ Q D P ++ Y+ +V+ YPGSP A +RE D + Sbjct: 61 LTGIHRLVQLSDMEKRQAPEAIQQYLKLVEELRAQEGKGSYPGSPWFALQGMREIDKATI 120 Query: 117 TELHPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYE-MKTDYQA 175 E+ + LR D RA + + D ++ L A +PP +RGL++IDPPYE + D+ Sbjct: 121 FEMQRDVFQQLRHNIH-DKRAGLHERDAYEGLLAVIPPKEKRGLVMIDPPYELERKDFPQ 179 Query: 176 VVSGIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGM 235 +V + YK++ TG++A+WYP+ R I+R + TGIR+ L E+ V PD G+ Sbjct: 180 LVELLQSAYKKWPTGVFAVWYPIKDRAMIERFEKKMFKTGIRRQLICEICVWPDDTPVGL 239 Query: 236 TASGMIVINPPWKLEQQMNNVLPWLHSKLV-PAGTGHATVSWIVPE 280 G++VINPPW+ +Q + L WL L GHA V W+V E Sbjct: 240 NGCGLLVINPPWQFSEQADQALQWLFPHLRMQETGGHAAVRWLVGE 285 >UniRef50_D0IYP9 ComJ n=10 Tax=Bacteria RepID=D0IYP9_COMTE Length = 288 Score = 362 bits (930), Expect = 7e-99, Method: Composition-based stats. Identities = 114/285 (40%), Positives = 163/285 (57%), Gaps = 10/285 (3%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 M SYRH+FHAGNHADVLKHTV ++ L +K+ LDTH GAG Y+L ++A ++GE Sbjct: 1 MFSYRHAFHAGNHADVLKHTVLIATVQYLTQKEAALTVLDTHGGAGLYRLDGDYASKSGE 60 Query: 61 YLEGIAR--IWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTE 118 EG+ R ++ +L L+ Y+ +V+ FN+ +R YPGSP I + LLR D L+ E Sbjct: 61 AEEGVLRLAAAKEAELAPVLQDYLQMVRRFNQGNAIRNYPGSPFITQALLRGHDRLKAFE 120 Query: 119 LHPSDYPLLRSEFQK---DSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQA 175 LHP+D L + + + DGF+ +K LPP SRR L+L DP YE+KTDY Sbjct: 121 LHPTDMRSLTGNMAQLEVRRQVAILHEDGFEGVKKFLPPPSRRALLLCDPSYELKTDYGR 180 Query: 176 VVSGIAEGYKRFATGIYALWYPVVLRQQIKRM---IHDLEATGIRKILQIELAVLPD--S 230 V+ A+G KRF TG YA+WYP++ R + + + + + L L V + S Sbjct: 181 VLDMAADGLKRFPTGTYAVWYPIIPRPEAHDLPKRLKTMATKAGKSWLHATLTVKSNKTS 240 Query: 231 DRRGMTASGMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVS 275 +R G+ ASGM +INPP+ L+ Q+ +P L L T+ Sbjct: 241 ERGGLPASGMFLINPPFNLKDQLKPAMPQLVKLLGQDSNAGFTLE 285 >UniRef50_Q5ZVZ2 Protein involved in catabolism of external DNA n=7 Tax=Legionella RepID=Q5ZVZ2_LEGPH Length = 287 Score = 359 bits (923), Expect = 5e-98, Method: Composition-based stats. Identities = 107/272 (39%), Positives = 157/272 (57%), Gaps = 3/272 (1%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 MLSY+H +HAGN ADV+KH + ++ L KDKP YL+TH+G G Y L + + +T E Sbjct: 6 MLSYQHGYHAGNFADVIKHITLTRLLAYLTHKDKPLFYLETHSGRGIYDLKDKQSLKTEE 65 Query: 61 YLEGIARIW-QQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTEL 119 Y EGI +W +++LP+ YI+V+K N + L YYPGSP A LR QD L L EL Sbjct: 66 YKEGINPVWLDRENLPSLFLEYISVIKQINLNSTLSYYPGSPYFAINQLRSQDRLYLCEL 125 Query: 120 HPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSG 179 HP++Y L + + V DG +L A LPP +RGLI IDP YE K +Y+ + Sbjct: 126 HPTEYNFLLKLPHFNKKVYVNHTDGVSKLNALLPPPEKRGLIFIDPSYERKEEYKEIPYA 185 Query: 180 IAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTASG 239 I Y +F+TG+Y +WYPVV + ++ + + + + +IEL + P + GMT G Sbjct: 186 IKNAYSKFSTGLYCVWYPVVNKAWTEQFLRKMREISSKSV-RIELHLNPLINE-GMTGCG 243 Query: 240 MIVINPPWKLEQQMNNVLPWLHSKLVPAGTGH 271 + +INPP+ ++ VL L + P + + Sbjct: 244 LWIINPPYTFPSEIKLVLETLTTYFNPGSSSY 275 >UniRef50_C5BQN4 Putative uncharacterized protein n=1 Tax=Teredinibacter turnerae T7901 RepID=C5BQN4_TERTT Length = 279 Score = 357 bits (917), Expect = 3e-97, Method: Composition-based stats. Identities = 106/279 (37%), Positives = 162/279 (58%), Gaps = 6/279 (2%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 MLSYRH+FHAGNHADVLKH S+++E L EKDKP YL+THA AG Y L + ++ E Sbjct: 1 MLSYRHAFHAGNHADVLKHLCLSMVLEKLIEKDKPLTYLETHAAAGAYDLNTAMPQKNRE 60 Query: 61 YLEGIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELH 120 Y+ GI+ + + + Y +V + + YPGSP +A +LREQD L L ELH Sbjct: 61 YMSGISPLLASEVSSEAMSRYKALVARYFADYK---YPGSPAVAASVLREQDKLVLMELH 117 Query: 121 PSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSGI 180 +++ +LR+ ++D R + DG + + A PP RRG++LIDPPYE +Y+ + + I Sbjct: 118 NTEFEILRNNMRRDKRVTLHHRDGIEGVLALSPPTPRRGIVLIDPPYEQPLEYERIATLI 177 Query: 181 AEGYKRFATGIYALWYPVVL--RQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTAS 238 A+ ++++ G+ ALWYP++ R + M+ + + + EL V + GM S Sbjct: 178 AQLHRKWPVGVIALWYPLLAQERNRAPAMLDVIARSQPASLFTAELWVEAQASDYGMYGS 237 Query: 239 GMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWI 277 GM IN PW +++++ VLP + L P G + W+ Sbjct: 238 GMAFINLPWTVDEKIALVLPEIQQILAPD-QGGFSHRWV 275 >UniRef50_A4SYR3 Putative uncharacterized protein n=1 Tax=Polynucleobacter necessarius subsp. asymbioticus QLW-P1DMWA-1 RepID=A4SYR3_POLSQ Length = 289 Score = 356 bits (915), Expect = 5e-97, Method: Composition-based stats. Identities = 114/285 (40%), Positives = 156/285 (54%), Gaps = 10/285 (3%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 M SYRH+FHAG+HAD+LKH ++E L+EK +DTHAGAG Y L A + E Sbjct: 1 MFSYRHAFHAGSHADILKHLTLIHLVEYLQEKPGALTIVDTHAGAGIYSLVDGFATVSKE 60 Query: 61 YLEGIARIWQ----QDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQL 116 GI R+ Q + P + Y+ +++ N +L YPGSP I LLR QD L+L Sbjct: 61 AEGGIFRLSQFFGKNSETPESIRKYLEMIQAENTGEELNTYPGSPFIIARLLRPQDRLKL 120 Query: 117 TELHPSDYPLLRSE---FQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDY 173 ELHP + +LR ++ + V AD F +LK LPP SRRGL+LIDP YE K DY Sbjct: 121 FELHPKEIDILRHNIGELKEAKQIDVYAADSFSRLKGLLPPPSRRGLVLIDPSYEDKQDY 180 Query: 174 QAVVSGIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGI---RKILQIELAVLPDS 230 + + + + E +RFATG YA+WYP++ R++ + L+ R L EL V Sbjct: 181 RYLENAMEEALQRFATGCYAIWYPILSRRESASLPDHLKKIAATHKRSWLHTELRVENAP 240 Query: 231 DRRGMTASGMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVS 275 R + ASGM +INPPW LE+ ++ LP L L + Sbjct: 241 GERRLQASGMFIINPPWTLEKHLDEALPVLVKALGVDAGAKYVLK 285 >UniRef50_C7JEW3 Putative uncharacterized protein n=8 Tax=Acetobacter pasteurianus RepID=C7JEW3_ACEP3 Length = 273 Score = 355 bits (911), Expect = 1e-96, Method: Composition-based stats. Identities = 103/280 (36%), Positives = 155/280 (55%), Gaps = 8/280 (2%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEY 61 ++YRH++HAGN AD +KH + +++S K PF+ LDTHAG GRY L S AE+T E+ Sbjct: 1 MNYRHAYHAGNFADCMKHALLVTLLQSFLRKPAPFMVLDTHAGIGRYDLHSPEAEKTQEW 60 Query: 62 LEGIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELHP 121 +GI ++W +D + L ++ VK ++G +YPGSPLI +LR QD+L E HP Sbjct: 61 RDGIGKLW-NEDAASPLADWLEQVK---KTGGPEFYPGSPLIIAQMLRAQDALICCEKHP 116 Query: 122 SDYPLLRSEFQKDSRARVEKADGFQQLKAKLPP-VSRRGLILIDPPYEMKTDYQAVVSGI 180 D L F V + D ++ L+A LPP ++RGLILIDPP+E ++ + + Sbjct: 117 EDKRSLYRLFTNTPNVTVHERDAYEALRALLPPQTAKRGLILIDPPFEEPGEFDRLAQAV 176 Query: 181 AEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTASGM 240 RFA I A+WYP+ R ++ L TGIR I EL + P + + +G+ Sbjct: 177 QTIQARFANAIIAIWYPIKHRTPVRIFHETLMGTGIRNICVAELLMRPPYNPDQLNGAGL 236 Query: 241 IVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIVPE 280 +VI PP+ ++ + L L L G + V+ +V E Sbjct: 237 LVIRPPFGFAEKASAQLERLQHVL---GAHESCVTQLVEE 273 >UniRef50_Q0F148 Putative uncharacterized protein n=1 Tax=Mariprofundus ferrooxydans PV-1 RepID=Q0F148_9PROT Length = 303 Score = 354 bits (909), Expect = 2e-96, Method: Composition-based stats. Identities = 105/283 (37%), Positives = 155/283 (54%), Gaps = 4/283 (1%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 MLSY+HS+HAGNHADVLKH + + + KD P LD A G Y L S A + E Sbjct: 22 MLSYQHSYHAGNHADVLKHIILGDVAAGMFNKDAPIFMLDAFASRGIYDLNSPEALKNRE 81 Query: 61 YLEGIARIWQQDDLP---AELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLT 117 G+ ++W D P + + ++ N +PGS + + RE D + Sbjct: 82 SDSGVGKLWPLRDEPTNPPGVRRWFKLIASLNMDDSYTRFPGSTAMLHAMAREGDRIAAC 141 Query: 118 ELHPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVV 177 +LHP ++ LR FQ R + K D F+ +K LPP +RGL+ +DP YE+K +Y+A+ Sbjct: 142 DLHPQEFDTLRVSFQASRRFSLLKRDAFEAIKGMLPPKEKRGLVFLDPSYEVKEEYRAIA 201 Query: 178 SGIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTA 237 +A +++FA G+Y +WYP++ ++ + +L+ +GIRKIL+IEL M Sbjct: 202 KAVAGAHRKFAGGVYVIWYPLLPAERHNELFRELKHSGIRKILRIELDCGDLFPDMQMHG 261 Query: 238 SGMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIVPE 280 SGM+++NPPW EQ M L W+ KL G G SW+VPE Sbjct: 262 SGMLIVNPPWHAEQAMQQSLNWVCDKL-TDGKGRKQFSWLVPE 303 >UniRef50_D0KVW6 Putative uncharacterized protein n=1 Tax=Halothiobacillus neapolitanus c2 RepID=D0KVW6_HALNC Length = 312 Score = 352 bits (904), Expect = 8e-96, Method: Composition-based stats. Identities = 111/291 (38%), Positives = 157/291 (53%), Gaps = 13/291 (4%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEY 61 ++Y+H FHAGNHADVLKH V +IE +++K FL L+THAGAG Y L + A R+ E Sbjct: 20 MNYQHHFHAGNHADVLKHLVLLQLIELMQQKPTGFLLLETHAGAGLYDLQATEARRSDEA 79 Query: 62 LEGIARIWQQ----DDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLT 117 GIAR+ Q D +P ++ Y+ ++ F L YYPGSPL+A LR QD Sbjct: 80 SGGIARLLQATQAADTVPVLIQTYLKQIEQFGSVPNLGYYPGSPLLAVCALRPQDRYIGV 139 Query: 118 ELHPSDYPLLRSEFQ---------KDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYE 168 EL P L D R +G LKA LPP+ RRGL LIDPPYE Sbjct: 140 ELVPKVARELSRNLAQRPMLEPCIPDRRVIARDGEGLAALKADLPPLERRGLFLIDPPYE 199 Query: 169 MKTDYQAVVSGIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLP 228 + + + + G +RF TG+YALWYP+ R + R ++ + + R +L IE ++ P Sbjct: 200 QPQERDDIAAALQAGLQRFETGVYALWYPIKQRPYLDRWLNRIAKSTPRPVLTIENSIFP 259 Query: 229 DSDRRGMTASGMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIVP 279 D +T SG+++INPPW+ + M VL +++ L + W+ P Sbjct: 260 DESGNRLTGSGLLIINPPWQFDTLMQPVLDFVNDALKQDTAAPRAIRWLNP 310 >UniRef50_Q87F97 Transformation competence-related protein n=20 Tax=Xanthomonadaceae RepID=Q87F97_XYLFT Length = 293 Score = 347 bits (892), Expect = 2e-94, Method: Composition-based stats. Identities = 100/290 (34%), Positives = 152/290 (52%), Gaps = 15/290 (5%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEY 61 ++Y H+FHAGNHADVLKH V +++ L K+ PF LD+HAG GRY L + + T E Sbjct: 1 MNYSHAFHAGNHADVLKHIVLLALLDGLVRKETPFFVLDSHAGRGRYLLSAGESRNTREA 60 Query: 62 LEGIARIWQQDDLPAELEAYINVVKHFNRSG--------QLRYYPGSPLIARLLLREQDS 113 G+ R+ + ++ Y++VV+ N S + YPGS L+A + R QD Sbjct: 61 ESGVMRLIARPQRLEVIKRYVDVVQADNVSQTRAASTPMHISRYPGSSLLAAQVCRAQDR 120 Query: 114 LQLTELHPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPV---SR--RGLILIDPPYE 168 + ELHP + L + F D R RV DG+ ++A LPP R RGL+ IDPPYE Sbjct: 121 MVFCELHPKEAAALNALFVHDPRVRVHAGDGYAAVRAFLPPKVGTQRIGRGLVFIDPPYE 180 Query: 169 MKT-DYQAVVSGIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVL 227 + +Y V+ + E R+ I A+WYP+ R++++ +R +L EL V Sbjct: 181 AQDAEYPLVLGALRETLTRWPQAICAVWYPIKQRRRLQPFFRKAVGLPVRSVLIAELLVR 240 Query: 228 PDSDRRGMTASGMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWI 277 D + SGM+++N PW+ +Q + LP L ++L + W+ Sbjct: 241 LDDSPLRLNGSGMLLLNVPWQFDQLLAPALPVLKTQLGE-SGARTRLEWL 289 >UniRef50_Q1YIC4 Putative uncharacterized protein n=1 Tax=Aurantimonas manganoxydans SI85-9A1 RepID=Q1YIC4_MOBAS Length = 281 Score = 347 bits (890), Expect = 3e-94, Method: Composition-based stats. Identities = 102/276 (36%), Positives = 148/276 (53%), Gaps = 6/276 (2%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEY 61 ++YRH++HAGN ADV+KH + + +I LK K+KPF DTHAG G Y L S+ A RTGE+ Sbjct: 1 MNYRHAYHAGNFADVVKHALLTRLIAYLKRKEKPFRVFDTHAGRGSYSLTSDEARRTGEH 60 Query: 62 LEGIARIWQQDDLP---AELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTE 118 +G+ R+ Q L Y + YPGSPLIAR LLR QD L E Sbjct: 61 ADGVGRLVQAAADVMDDPLLAEYRGALAS---DLSEDRYPGSPLIARRLLRPQDRLSAYE 117 Query: 119 LHPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVS 178 LHP+D L++ F D + + DG+ L + +PP +RGL+LIDPP+E + A+ Sbjct: 118 LHPADAAALKTLFAGDVQTKAIALDGWLALGSHVPPKEKRGLVLIDPPFERTDEVDAIAE 177 Query: 179 GIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTAS 238 G+A+ +R+A GIYA+WYP+ + + L+A + + + E P + + Sbjct: 178 GLAKALQRWAGGIYAVWYPLKRPALVAALHERLDALPVSERVTAEFFREPYTADERFVGT 237 Query: 239 GMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATV 274 G+ VINPP+ + VL L L +V Sbjct: 238 GLTVINPPFVFAAEAEAVLTTLAPLLGSGEAATFSV 273 >UniRef50_A1WIT5 Putative uncharacterized protein n=4 Tax=Burkholderiales RepID=A1WIT5_VEREI Length = 293 Score = 346 bits (889), Expect = 5e-94, Method: Composition-based stats. Identities = 113/290 (38%), Positives = 158/290 (54%), Gaps = 15/290 (5%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 M SYRH+FHAGNHADV KHTV ++ L +KD LD+HAGAG Y+L ++A +GE Sbjct: 1 MFSYRHAFHAGNHADVFKHTVLIATLQYLTDKDAALTVLDSHAGAGLYRLDGDYARTSGE 60 Query: 61 YLEGIARIWQQ--DDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTE 118 +G+ R++ L L+AY+++V FN+ +LR YPGSP I + LLRE D L+L E Sbjct: 61 AADGVVRLFAAPGSALAPALQAYVDMVGAFNQGRRLRVYPGSPCITQRLLRESDKLKLFE 120 Query: 119 LHPSDYPLLR---SEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQA 175 HP+D L ++ Q + V DGFQ ++ LPP RR L+L DP YE+K+DY Sbjct: 121 WHPTDLRALAGHVAQLQAGRQVAVFHEDGFQGIRKFLPPPQRRALLLCDPSYEIKSDYGK 180 Query: 176 VVSGIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEAT---GIRKILQIELAVLPDSD- 231 V+ + KRFATG Y WYP++ R + + L+ + L L V Sbjct: 181 VLDLATDSLKRFATGCYMFWYPIIGRPEAHELPRRLKTLASKAGKSWLHATLTVKSGQRT 240 Query: 232 ------RRGMTASGMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVS 275 R G+ ASGM +INPP+ L+ + LP + L T+ Sbjct: 241 AAGSLKRPGLPASGMFLINPPFTLKAALTPALPQMVQLLAQDRHATHTLE 290 >UniRef50_Q0G6E9 Putative uncharacterized protein n=1 Tax=Fulvimarina pelagi HTCC2506 RepID=Q0G6E9_9RHIZ Length = 301 Score = 344 bits (884), Expect = 1e-93, Method: Composition-based stats. Identities = 104/280 (37%), Positives = 153/280 (54%), Gaps = 6/280 (2%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEY 61 L+YRH+FHAGN ADV+KH + + +I LK K+KPF DTHAG GRY L + A RTGE Sbjct: 25 LNYRHAFHAGNFADVVKHALLTRLIAYLKRKEKPFRVFDTHAGRGRYDLNASEASRTGEA 84 Query: 62 LEGIARIWQQDDL--PAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTEL 119 G+ +I Q L L Y + + + +YPGSPLIAR LRE D L EL Sbjct: 85 QAGVLKIAQSTTLRAEPLLADYFAAI---DPDLREGFYPGSPLIARRCLRETDRLSAYEL 141 Query: 120 HPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSG 179 HP D LR F D + + DG+ L A LPP +RGL+LID P+E ++ ++SG Sbjct: 142 HPEDGGALRDLFAGDVQVKAISLDGWLALGAHLPPKEKRGLVLIDSPFEKPSEVDDILSG 201 Query: 180 IAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMT-AS 238 + + R+ G+YA+WYP+ R +++++ + + L +E+ + D G+ + Sbjct: 202 LEKALSRWRGGVYAIWYPIKRRALVEKLLTAIAGMAAGEALAVEVRIAADESAEGLFLGT 261 Query: 239 GMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIV 278 G VINPP+ ++ ++ L L G+ A V + Sbjct: 262 GFAVINPPFVFAEEAKAIVDLLLPALKRDGSATARVFTLT 301 >UniRef50_B4RYZ5 Putative uncharacterized protein n=2 Tax=Alteromonas macleodii RepID=B4RYZ5_ALTMD Length = 292 Score = 344 bits (883), Expect = 2e-93, Method: Composition-based stats. Identities = 112/295 (37%), Positives = 162/295 (54%), Gaps = 19/295 (6%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 MLSY+H+FHAGNHADV+KH +I SLK+KDKPF DTHAGAG Y L + + E Sbjct: 1 MLSYQHAFHAGNHADVIKHLCWIGVINSLKKKDKPFTLFDTHAGAGTYDLNDAMSSKNKE 60 Query: 61 YLEGIARIW----QQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQL 116 Y GI+RI + D LP L+ Y+ + + F Q YPGSP I+ R D+L L Sbjct: 61 YETGISRIINTGAEHDSLPELLKNYLTLCEPFLAKHQ---YPGSPAISATAKRATDNLHL 117 Query: 117 TELHPSDYPLLRSEFQK--DSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQ 174 ELHP+++ L + K + V K DG++ L+A PP RG ILIDPPYE ++Y Sbjct: 118 MELHPAEFDKLEANMGKLHLRKMHVHKRDGYEGLRALTPPKPNRGAILIDPPYERASEYG 177 Query: 175 AVVSGIAEGYKRFATGIYALWYPVVLRQ------QIKRMIHDLEATGIRKILQIELAVLP 228 V+ G+ + +KR+ +WYP++ + + M L A G + + E+ V Sbjct: 178 EVIKGVEQVFKRWQQAQIVVWYPLLSERAGAKHGASELMCDKLAALG-KPCFKAEICVEK 236 Query: 229 DSDRRGMTASGMIVINPPWKLEQQMNNVLPWLHSKLV---PAGTGHATVSWIVPE 280 ++ GM SG+ V+NPPW+L+ Q+ + L + +L + VSWI + Sbjct: 237 NTPEAGMYGSGVFVLNPPWQLDSQLESALQNVVLQLGAKSSDSSASTHVSWINED 291 >UniRef50_B1ZS65 Putative uncharacterized protein n=2 Tax=Opitutaceae RepID=B1ZS65_OPITP Length = 297 Score = 341 bits (875), Expect = 2e-92, Method: Composition-based stats. Identities = 110/296 (37%), Positives = 162/296 (54%), Gaps = 17/296 (5%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLG----SEHAER 57 ++YRH FHAGN ADV+KH + +I +L++K+K YLDTHAG G Y LG + ER Sbjct: 1 MNYRHLFHAGNFADVMKHALLIELIGALQKKEKGIFYLDTHAGRGSYDLGLAARGDTLER 60 Query: 58 TGEYLEGIARIWQQD--------DLPAELEAYINVVKHF-----NRSGQLRYYPGSPLIA 104 E+ +GI RI L AY ++V+ F N +G R+YPGSP IA Sbjct: 61 QPEWPDGIGRILAARSTAAADANATGDPLRAYADLVRRFDAERGNTNGSPRFYPGSPAIA 120 Query: 105 RLLLREQDSLQLTELHPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILID 164 ++L+R QD L L E P ++ LL +EF + R V DG+ ++A LPP RR L+LID Sbjct: 121 QVLVRRQDRLALCEQVPEEHALLAAEFARAPRTSVHAIDGYVAVRAMLPPPERRALVLID 180 Query: 165 PPYEMKTDYQAVVSGIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIEL 224 P+E + ++ + + +AEG R G++A+WYP+ R ++ L + L +EL Sbjct: 181 APFEAQDEFARIETALAEGLARLPAGVFAVWYPLTERARVDAFFAGLAERRLPPTLVLEL 240 Query: 225 AVLPDSDRRGMTASGMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIVPE 280 AV ++ M G++V+NPPW E+ +L L +L A W+VPE Sbjct: 241 AVAGENSALKMRGCGLVVVNPPWHFERTAAPILEALARELAQAPGAAGRQQWLVPE 296 >UniRef50_A5EWC5 Putative uncharacterized protein n=1 Tax=Dichelobacter nodosus VCS1703A RepID=A5EWC5_DICNV Length = 288 Score = 339 bits (870), Expect = 6e-92, Method: Composition-based stats. Identities = 114/277 (41%), Positives = 155/277 (55%), Gaps = 6/277 (2%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 MLSYRHSFHAGN+ADV KH + +K K+KP LY D+HAGAG Y L S HAE+TGE Sbjct: 1 MLSYRHSFHAGNYADVFKHFCLYQTLTFMKRKEKPLLYFDSHAGAGFYDLHSAHAEKTGE 60 Query: 61 YLEGIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELH 120 Y +GI R++ LP L + N ++ + S Y GSP +A LL E D+LQ ELH Sbjct: 61 YCDGIMRLYAAQQLPPALIEFRNDLRLWLESEN--VYCGSPWLAAHLLGEHDTLQACELH 118 Query: 121 PSDYPLLRSEFQ--KDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVS 178 P+D P L+ + + R+ V + DGF QL A +PP RR LI+IDP YE K+DY AV S Sbjct: 119 PNDAPALQHIIRSIRPRRSFVFQKDGFVQLLASVPPPQRRALIVIDPSYEQKSDYDAVCS 178 Query: 179 GIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEA-TGIRKILQIELAVLPDSDRRGMTA 237 +++ K+FA G Y +W P +LR + + L G R L+ +L V + GM Sbjct: 179 VLSKALKKFAQGCYLIWSPCLLRTEAQDFPQQLAEVIGGRGYLRAQLKVRTE-SALGMYG 237 Query: 238 SGMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATV 274 + +INPP+ L + L L + + Sbjct: 238 CEIHIINPPYLLAPVLQEAGNVLVQILAADKSAQFQL 274 >UniRef50_Q0BPC8 Putative uncharacterized protein n=3 Tax=Acetobacteraceae RepID=Q0BPC8_GRABC Length = 294 Score = 338 bits (869), Expect = 8e-92, Method: Composition-based stats. Identities = 100/271 (36%), Positives = 149/271 (54%), Gaps = 12/271 (4%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEY 61 ++YRH+FHAGN AD KH + ++ +L++K+ PF LDTHAG G L A RTGE+ Sbjct: 1 MNYRHAFHAGNFADCHKHALMVALLTALRQKEAPFFVLDTHAGTGETLLTDGPAARTGEW 60 Query: 62 LEGIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELHP 121 EGI + DD L +Y+ +V L YPGSPLIAR +LR QD + + ELHP Sbjct: 61 REGIGLLL--DDPAPVLASYLALVTSLGMERSL--YPGSPLIARAMLRPQDRMAVCELHP 116 Query: 122 SDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPV--------SRRGLILIDPPYEMKTDY 173 D L F+ D + + DG++ L+ LPP RRGL LIDPP+E ++ Sbjct: 117 EDCASLAERFRGDPYCAIHRRDGWKALETMLPPKTASSGGVLPRRGLTLIDPPFEQPDEH 176 Query: 174 QAVVSGIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRR 233 + + + +RF TG+ A WYP+ + + H L+ G++++L EL + P +D Sbjct: 177 RRLADAMLRAQQRFPTGMVAGWYPIKGGAPARLLRHQLQDAGLKRVLIAELFLHPPTDTT 236 Query: 234 GMTASGMIVINPPWKLEQQMNNVLPWLHSKL 264 + SGM ++NPPW+ ++ L + L Sbjct: 237 RLNGSGMAILNPPWQFGDDARAIMQALKTGL 267 >UniRef50_A1VJI9 Putative uncharacterized protein n=4 Tax=Comamonadaceae RepID=A1VJI9_POLNA Length = 340 Score = 336 bits (862), Expect = 5e-91, Method: Composition-based stats. Identities = 112/337 (33%), Positives = 160/337 (47%), Gaps = 62/337 (18%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 M SYRH+FHAGNHADVLKHT +++ L +KD +DTHAGAG Y+L ++ E +GE Sbjct: 1 MFSYRHAFHAGNHADVLKHTCLIALMKYLTQKDTALTVIDTHAGAGLYRLDGDYTETSGE 60 Query: 61 YLEGIARIWQQDDLP----------------------------------------AELEA 80 EGI ++ + L Sbjct: 61 AQEGIFKLLLASKMASAQTGKAGAAIKKVAPAATAKAAPAAPEPAAKPASDYAWAPALLD 120 Query: 81 YINVVKHFNRS-------GQLRYYPGSPLIARLLLREQDSLQLTELHPSDYPLLRSEFQK 133 Y+ +++ N L+ YPGSP I + L +D L+L ELHP+D+ L ++ Sbjct: 121 YLELLRSLNPHFAQTGDPAHLKIYPGSPFIEQKFLSGRDKLKLFELHPTDFKSLSGNIEQ 180 Query: 134 ---DSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSGIAEGYKRFATG 190 + V + DGF+ LK LPP +RR ++L DP YEMKTDY V S +A+ KRFATG Sbjct: 181 LGVGRQVVVAREDGFEALKTFLPPPARRAMVLCDPSYEMKTDYLRVSSCMADAVKRFATG 240 Query: 191 IYALWYPVVLRQQIKRMIHDLEA---TGIRKILQIELAVL-----PDSDRR----GMTAS 238 Y +WYP++ R + + L+ R L L V D++ G+ AS Sbjct: 241 TYVVWYPIIPRPEAHDLPRKLKTIAVKAGRSWLNATLTVKSSKLTTDTEGEVVRPGLPAS 300 Query: 239 GMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVS 275 GM VINPP L+ ++ LP + + L T+ Sbjct: 301 GMFVINPPHTLKAELQAALPQMVALLGQDRNAGFTLE 337 >UniRef50_D2LG58 Putative uncharacterized protein n=1 Tax=Rhodomicrobium vannielii ATCC 17100 RepID=D2LG58_RHOVA Length = 278 Score = 336 bits (862), Expect = 6e-91, Method: Composition-based stats. Identities = 97/283 (34%), Positives = 149/283 (52%), Gaps = 10/283 (3%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEY 61 ++YRH FHAGN ADV+KH V + ++ L K+ P LD H GAG Y L SE AE+TGE+ Sbjct: 1 MNYRHVFHAGNFADVIKHAVLAFCVDYLLRKESPLCLLDAHGGAGLYDLRSEEAEKTGEW 60 Query: 62 LEGIARIWQQD----DLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLT 117 G+ + Q LE Y+ +V+ G +YPGSPL+ LR QD L Sbjct: 61 ARGVGAVMQAAGGTASAAEALEPYLRLVREDVADG---FYPGSPLLLARRLRPQDRLIAN 117 Query: 118 ELHPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVV 177 ELH S LR + RV AD ++ ++A +PP RRGL+LIDPP+E K +++ ++ Sbjct: 118 ELHESTRGALRGTLAEFPSVRVTGADAYECIRATIPPKERRGLVLIDPPFEEKDEFETLI 177 Query: 178 SGIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTA 237 + E KR+ATG++ LWYP+ + + + A G+ + +E + P + Sbjct: 178 RQMREWKKRWATGVFLLWYPIKAVSPLGALKAEAAALGLPRTWCVETLIYPRGRALSLNG 237 Query: 238 SGMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIVPE 280 G+I+ N P+ + + + LP + +W+VP+ Sbjct: 238 CGLILFNAPYSVPEAVEATLPAFADAM---RLHETHTAWLVPD 277 >UniRef50_B1LXQ5 Putative uncharacterized protein n=9 Tax=Alphaproteobacteria RepID=B1LXQ5_METRJ Length = 282 Score = 332 bits (853), Expect = 6e-90, Method: Composition-based stats. Identities = 104/276 (37%), Positives = 155/276 (56%), Gaps = 2/276 (0%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEY 61 ++YRH+FHAGNHADVLKH V + +++ L+ KDKPF LD AG G Y L ++ A RTGE+ Sbjct: 1 MNYRHAFHAGNHADVLKHLVLARVLDHLRLKDKPFRALDAFAGLGVYDLEADEAARTGEW 60 Query: 62 LEGIARIWQ--QDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTEL 119 +G R+ ++ A L Y V YPGSP + R LR D EL Sbjct: 61 RDGWGRMAAPFAPEVEALLAPYRAAVAAVRARHGDTAYPGSPAVIREALRPGDKGVFVEL 120 Query: 120 HPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSG 179 HP+D L+ + +D+R +V DG+ + A++PP RRGL+LIDPPYE+ + + + + Sbjct: 121 HPADAATLQGRYARDARTKVMNLDGWTAINAQIPPPERRGLVLIDPPYEVPGEIERLGAH 180 Query: 180 IAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTASG 239 +A ++ TG++ WYP+ + RM+ DL A R L+++L + D +T SG Sbjct: 181 LARAVAKWPTGLFLAWYPIKDTAVLDRMVRDLGAALPRPALRLDLLIDRPGDPTRLTGSG 240 Query: 240 MIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVS 275 +IV+NPPW+L ++ LP L +L G Sbjct: 241 LIVVNPPWRLAEEAMLFLPALAERLARQDFGGFRCD 276 >UniRef50_A5WD58 Putative uncharacterized protein n=3 Tax=Psychrobacter RepID=A5WD58_PSYWF Length = 291 Score = 332 bits (853), Expect = 6e-90, Method: Composition-based stats. Identities = 89/292 (30%), Positives = 153/292 (52%), Gaps = 14/292 (4%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEY 61 ++Y+H++HAGN ADV KH + ++ + +K KP+ LD + G G Y L SE A +TGE Sbjct: 1 MNYKHAYHAGNFADVAKHILLVQLLNQMSKKGKPYYALDAYGGRGLYSLSSEEARKTGEA 60 Query: 62 LEGIARIWQQD--DLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQD----SLQ 115 G+ +I + D + P + Y++ +K ++ YPGSP + + + Sbjct: 61 KAGVQKILEADVSEAPEAVRQYVDDIKQARQTYDKYVYPGSPWWIANHVEKHPEVKVRAE 120 Query: 116 LTELHPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMK-TDYQ 174 E ++Y L + + + D F+ ++A +PPV RRG+ILIDPPYE + D+ Sbjct: 121 AFEFKNTEYDALNYQLYQLP-IGIHNRDAFEGIRAVIPPVERRGVILIDPPYEQEHKDFT 179 Query: 175 AVVSGIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRG 234 +V + ++ G+YALW+P+ + ++ ++ TGIRK L EL + P+ G Sbjct: 180 RLVELLVASMTKWPQGVYALWFPIKNIEAVELFYKKMKRTGIRKQLLCELNIYPNDVAVG 239 Query: 235 MTASGMIVINPPWKLEQQMNNVLPWLHSKLVPAG------TGHATVSWIVPE 280 + +GM++INPPW+ +Q +L ++ + P + V W+V E Sbjct: 240 LNGTGMLIINPPWQFDQHARQILNFIQPLMRPEDAPDLPQSQAVNVRWLVGE 291 >UniRef50_C8PZ79 Protein involved in catabolism of external DNA n=1 Tax=Enhydrobacter aerosaccus SK60 RepID=C8PZ79_9GAMM Length = 295 Score = 332 bits (852), Expect = 9e-90, Method: Composition-based stats. Identities = 89/296 (30%), Positives = 155/296 (52%), Gaps = 18/296 (6%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEY 61 ++Y+HS+HAGN ADV+KH + ++E + K KP+ LD + G G Y L S+ A++TGE Sbjct: 1 MNYQHSYHAGNFADVVKHVLLLQLLEMMSAKPKPYYILDAYGGRGLYSLASDEAKKTGEA 60 Query: 62 LEGIARIWQ--QDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQ-------- 111 + GI ++ P ++ Y+ + + + YPGSP + +Q Sbjct: 61 IHGITKLLAQDNSQAPQAVQTYLQDIGYAKKFYDKHVYPGSPWFIAHHIEKQQDAHPEIN 120 Query: 112 DSLQLTELHPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMK- 170 + + E S++ L + + V+ + ++ + A LPP +RGLILIDPP+E + Sbjct: 121 NRAEAFEWKASEFDALNYQLHQLP-IGVQHRNAYEGILAVLPPQEKRGLILIDPPFEQEH 179 Query: 171 TDYQAVVSGIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDS 230 D+ A+V + + +K+++TG+ ALWYP+ ++ ++ T IR+ L +EL + P Sbjct: 180 RDFSALVDLLVKAHKKWSTGVLALWYPIKNNDAVELFYKKMKRTEIRRQLVLELNIFPPD 239 Query: 231 DRRGMTASGMIVINPPWKLEQQMNNVLPWLHSKLVPAGT------GHATVSWIVPE 280 G+ +GM+VINPPW+ + + +L +L L + V W+V E Sbjct: 240 LPMGLNGTGMLVINPPWQFDAKAEEILQYLQPILQHPESPQMSVEQRTKVQWLVGE 295 >UniRef50_Q21LZ8 Putative uncharacterized protein n=1 Tax=Saccharophagus degradans 2-40 RepID=Q21LZ8_SACD2 Length = 289 Score = 331 bits (849), Expect = 2e-89, Method: Composition-based stats. Identities = 103/285 (36%), Positives = 160/285 (56%), Gaps = 8/285 (2%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 MLSY H +HAGN ADV KH L+++ L K+ P Y+DTHAGAG Y L E AE+T E Sbjct: 1 MLSYLHGYHAGNFADVHKHCTLMLLLKKLHAKNTPITYIDTHAGAGLYALDDEKAEKTRE 60 Query: 61 YLEGIARIWQQDD--LPAELEAYINVVKHFNRSGQ----LRYYPGSPLIARLLLREQDSL 114 +G+ + + ++ Y++++ S Q + YPGSP IA+ LLREQD Sbjct: 61 SQQGVDALLASKTGITHSAIKEYLHLLASVRLSKQHTLGEQAYPGSPAIAQALLREQDFG 120 Query: 115 QLTELHPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQ 174 L ELH ++ L+ F++D+R + DGF+ L A PP + RGL LIDP YE+ +DY Sbjct: 121 ILMELHNNEVGKLKQHFKRDTRLSIHHRDGFEGLAALTPPSTARGLALIDPSYELTSDYH 180 Query: 175 AVVSGIAEGYKRFATGIYALWYPVVLRQQIKR--MIHDLEATGIRKILQIELAVLPDSDR 232 +++ + R+ TG++A+WYP++ ++ + L + +L EL + + Sbjct: 181 QLITSLQTATARWRTGVFAVWYPILAGEKNHADFIKRKLAQLDVASVLNSELHIYTKEEN 240 Query: 233 RGMTASGMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWI 277 GM SGM +IN PW+L+ ++ ++LP L + L + V W+ Sbjct: 241 DGMIGSGMAIINAPWQLDAELESLLPELETLLAQSSKVKYKVEWL 285 >UniRef50_Q2G473 Putative uncharacterized protein n=1 Tax=Novosphingobium aromaticivorans DSM 12444 RepID=Q2G473_NOVAD Length = 276 Score = 329 bits (845), Expect = 5e-89, Method: Composition-based stats. Identities = 94/277 (33%), Positives = 147/277 (53%), Gaps = 4/277 (1%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEY 61 ++YRHSFHAGN ADV+KH++ ++ +L+ KD +DTHAG G Y L + A+RTGE Sbjct: 1 MNYRHSFHAGNSADVVKHSLLIALVRALQLKDSALTLIDTHAGCGLYDLHGDAAQRTGES 60 Query: 62 LEGIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELHP 121 +G+ R D L+ Y V+ N + YPGSP I LLR QD+L + E HP Sbjct: 61 AQGVLRALA--DPNPLLDDYRAAVQAVNVGAEPHLYPGSPRILVQLLRPQDALIVNEKHP 118 Query: 122 SDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSGIA 181 D LR + + A V + D ++ A LPP + RG++++DPPYE + + + +A Sbjct: 119 EDAYALRGAM-RGTGAAVHERDAYEFWLAMLPPRTPRGVVVVDPPYEQTDERARITATLA 177 Query: 182 EGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTASGMI 241 +++++ G+ +WYP+ R R L GI K L +E + +G+ Sbjct: 178 AAHRKWSHGVTVIWYPLKDRATHVRWKEQLRRLGIPKFLNVEHWLYDADQPGIYNGAGLF 237 Query: 242 VINPPWKLEQQMNNVLPWLHSKLVPAG-TGHATVSWI 277 ++NPP+ Q + ++L L + L P G G W+ Sbjct: 238 IVNPPYAFTQALPSMLEALRAALAPEGHQGEIAAEWL 274 >UniRef50_B5EL93 Putative uncharacterized protein n=2 Tax=Acidithiobacillus ferrooxidans RepID=B5EL93_ACIF5 Length = 285 Score = 328 bits (841), Expect = 1e-88, Method: Composition-based stats. Identities = 101/275 (36%), Positives = 153/275 (55%), Gaps = 8/275 (2%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEY 61 ++Y H +HAGN AD +KH SL +++L KD P Y++THAGAGRY LG++ GE+ Sbjct: 1 MNYDHQYHAGNTADCVKHLALSLTLQTLVRKDSPLAYIETHAGAGRYALGTQ-----GEH 55 Query: 62 LEGIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELHP 121 L+G++R+W A++ +V + N G LR+YPGSP +A LLR D + L E P Sbjct: 56 LQGVSRLWADRRSLPHAGAWLKIVSNENADGTLRHYPGSPALAAALLRPTDRMVLCEEQP 115 Query: 122 SDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSGIA 181 LR K + V DG++ L ++PP +RGL+LIDPP+E + +++ + + Sbjct: 116 EVATRLRKAIGKRAHTSVVGEDGYRTLFGQIPPPEKRGLVLIDPPFERRDEWERLTDTLI 175 Query: 182 EGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTASGMI 241 Y+R+ G+Y +WYPV +R I R+ L EL +P+ R + SG+I Sbjct: 176 RAYQRWPQGVYLVWYPVKIRGTITRLWQALRERLP--AFACELLQMPEEGREQLFGSGLI 233 Query: 242 VINPPWKLEQQMNNVLPWLHSKL-VPAGTGHATVS 275 V+NPPW L + + L L L P G G ++ Sbjct: 234 VVNPPWGLREALAAALTELGPLLSAPQGGGLWSLR 268 >UniRef50_C8NAD4 Cytoplasmic protein n=34 Tax=Proteobacteria RepID=C8NAD4_9GAMM Length = 280 Score = 328 bits (841), Expect = 2e-88, Method: Composition-based stats. Identities = 112/282 (39%), Positives = 153/282 (54%), Gaps = 5/282 (1%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 MLSYRH++HAGNHAD+LKH + + + +K KP+ Y+DTHAGAG Y L + +A++ E Sbjct: 1 MLSYRHAYHAGNHADLLKHYLLTRTLAYYNQKPKPYDYIDTHAGAGYYDLTAAYAQKNRE 60 Query: 61 YLEGIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELH 120 Y GIAR+ LPA L A+ + + + YPGS IA LL L L ELH Sbjct: 61 YQSGIARLNAAAHLPAALAAWRDHMHAHQPAPD--TYPGSAWIAARLLPAPGKLHLHELH 118 Query: 121 PSDYPLLRSEFQKDSRA---RVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVV 177 P+D+ L + +ADGF L A LPP SRR +ILIDPPYE K+DYQ + Sbjct: 119 PADHAALTENLRPLRLGRRLHTHRADGFAGLIALLPPASRRAVILIDPPYEQKSDYQTTL 178 Query: 178 SGIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTA 237 +A YKRF +G Y +WYP + R + + L L+ EL V ++ GM Sbjct: 179 DTLAAAYKRFPSGTYLIWYPCLPRDESRHFPAQLNQHFGDNYLRAELHVRAENGAHGMYG 238 Query: 238 SGMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIVP 279 SGM +INPP+ L ++ LP L + T+ +P Sbjct: 239 SGMYLINPPYTLPAELKTTLPALRDLCAESADSRITLDARIP 280 >UniRef50_A3JEY3 Protein involved in catabolism of external DNA n=3 Tax=Marinobacter RepID=A3JEY3_9ALTE Length = 287 Score = 325 bits (833), Expect = 1e-87, Method: Composition-based stats. Identities = 109/285 (38%), Positives = 154/285 (54%), Gaps = 12/285 (4%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 MLSY H+FHAGN ADV KH L + ++ K DTHAG+ Y L E A +T E Sbjct: 10 MLSYLHAFHAGNFADVHKHAALVLALNMMQAKASGIACTDTHAGSALYDLDDERARKTAE 69 Query: 61 YLEGIARIWQQDDLPAE-----LEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQ 115 GI ++W Q D A L Y+ + N LR YPGSP LR QDSL Sbjct: 70 ADAGIRKLWPQLDSLAAADWQLLRPYL---QQLNSGANLRQYPGSPAWFGHYLRAQDSLG 126 Query: 116 LTELHPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQA 175 + ELHPS+ L +++ R RV + DG L LPP R L+LIDP YE+KTDY A Sbjct: 127 VFELHPSETSSL-NQWASGKRLRVTQQDGLAGLLKVLPPRQPRLLVLIDPSYEVKTDYTA 185 Query: 176 VVSGIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGM 235 V ++ +++ G++ +WYP++ + ++ L A IRKIL+ E+ + RGM Sbjct: 186 VAETLSRAWQKCRHGVFLVWYPILTSGLEQTLLEGLRAGPIRKILRSEVRLHTPP-ERGM 244 Query: 236 TASGMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIVPE 280 SGM+VINPPW ++++++ ++ L G G + W+ PE Sbjct: 245 VGSGMLVINPPWGMDERLSAMMRDLEPA-ARLGLGQ-QMDWLAPE 287 >UniRef50_A0YHR6 Putative uncharacterized protein n=1 Tax=marine gamma proteobacterium HTCC2143 RepID=A0YHR6_9GAMM Length = 256 Score = 320 bits (822), Expect = 3e-86, Method: Composition-based stats. Identities = 105/258 (40%), Positives = 152/258 (58%), Gaps = 2/258 (0%) Query: 23 SLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEYLEGIARIWQQDDLPAELEAYI 82 + +K+ F Y+D+HAGAG + L S+ A++ E+ GI+++ D P L + Sbjct: 1 MEALGHFVKKESAFEYVDSHAGAGLFNLASKDAKKLEEHNYGISKLV-ASDFPELL-DFF 58 Query: 83 NVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELHPSDYPLLRSEFQKDSRARVEKA 142 ++ +N+S ++ +YPGSP IA+ LR+QD L ELHP DY L + + RV Sbjct: 59 TAIRAYNKSAKINFYPGSPAIAKHFLRKQDRAWLYELHPQDYKSLCKNVESSKKMRVFCQ 118 Query: 143 DGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSGIAEGYKRFATGIYALWYPVVLRQ 202 DG + L++ LPP SRRGLILIDP YE+K++Y+ V YK+F+TG Y +WYPVV R+ Sbjct: 119 DGLKALESVLPPTSRRGLILIDPSYEIKSEYEHVFRACVNAYKKFSTGTYIVWYPVVERR 178 Query: 203 QIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTASGMIVINPPWKLEQQMNNVLPWLHS 262 Q+ M +GI+ I + EL D+ RGMT+SG+ VINPPW L Q+M+ VLP L + Sbjct: 179 QVDVMEKKFILSGIKNIQRFELGRSADTRERGMTSSGVFVINPPWTLFQKMSAVLPRLAT 238 Query: 263 KLVPAGTGHATVSWIVPE 280 L G +V E Sbjct: 239 ILGDKNDGFFKCDILVAE 256 >UniRef50_Q1QSA4 Putative uncharacterized protein n=1 Tax=Chromohalobacter salexigens DSM 3043 RepID=Q1QSA4_CHRSD Length = 292 Score = 317 bits (814), Expect = 2e-85, Method: Composition-based stats. Identities = 99/285 (34%), Positives = 147/285 (51%), Gaps = 8/285 (2%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 ML+Y+H++HAGN ADV KH +++ L K Y+DTHAG G Y L +E +R E Sbjct: 11 MLAYQHAYHAGNFADVHKHLTLFAVLQYLLRKSSAITYVDTHAGRGLYPLEAEETQRLRE 70 Query: 61 YLEGIARIWQQDDLPA---ELEAYINVVKHFNRSGQ-LRYYPGSPLIARLLLREQDSLQL 116 Y +G A +W ++ A L A+ + L +YPGSP REQD L L Sbjct: 71 YRQGAAAVWAAREVLADDSLLAAWCERLGDAQSGASTLSHYPGSPWWLANDCREQDRLAL 130 Query: 117 TELHPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAV 176 ELHP + L ++ +AR + ADG ++A LPP + R LIDP YE K +Y V Sbjct: 131 FELHPGEATHLEAQVLP-PQARRQHADGLAGIRALLPPATPRFCALIDPSYERKQEYTDV 189 Query: 177 VSGIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDR-RGM 235 + + + I +WYP++ + ++ +G+RK+ + EL + P + GM Sbjct: 190 AATLQAVAAKVRHAIVMIWYPLLPSGRHHDLLTAARRSGLRKLWRSELTLHPPGEATHGM 249 Query: 236 TASGMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIVPE 280 SGM+++NPPW ++ Q+N L + S L T W VPE Sbjct: 250 YGSGMLLLNPPWGIDTQLNASLTRVASCLG--DTASHVSQWWVPE 292 >UniRef50_B7QYF1 Protein involved in catabolism of external DNA n=35 Tax=Alphaproteobacteria RepID=B7QYF1_9RHOB Length = 266 Score = 312 bits (801), Expect = 6e-84, Method: Composition-based stats. Identities = 106/268 (39%), Positives = 151/268 (56%), Gaps = 5/268 (1%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 MLSY+H +HAGN ADV KH + + ++ L +KDKP YL+THAG G YQL + A +TGE Sbjct: 1 MLSYQHIYHAGNLADVQKHALLARMLAYLTQKDKPLSYLETHAGRGLYQLDAAEAVKTGE 60 Query: 61 YLEGIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELH 120 GI+R+ D L A+ + + YPGSP+IA LLRE DSL ELH Sbjct: 61 AEAGISRLL-NDALLAQDHPLAEAIARTRAAHGAAAYPGSPMIAAHLLREGDSLNFAELH 119 Query: 121 PSDYPLLRSEFQK---DSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVV 177 P + LR + R RV + DGF+ + PP RRG++LIDP YE+K DY + Sbjct: 120 PQENAALRQAMRPHAKGGRVRVHQQDGFELALSLAPPTPRRGMLLIDPSYEIKRDYAQIP 179 Query: 178 SGIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTA 237 IA+ ++++ G+ ALWYP++ K M++ LEA + +L+ E+ P + M Sbjct: 180 GHIAKLHRKWNVGVIALWYPILTDGAHKPMLNALEAQDLPGVLRHEVRFPPAREGHRMVG 239 Query: 238 SGMIVINPPWKLEQQMNNVLPWLHSKLV 265 SGM ++N P+ E + L L S+L Sbjct: 240 SGMFIVNAPYGTEDEAKR-LTKLFSQLG 266 >UniRef50_Q1RK44 ComJ n=12 Tax=Rickettsia RepID=Q1RK44_RICBR Length = 262 Score = 312 bits (800), Expect = 1e-83, Method: Composition-based stats. Identities = 95/264 (35%), Positives = 145/264 (54%), Gaps = 12/264 (4%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEY 61 ++YRH +HAGN AD++KH V I+E LK K+KPF LD AG G Y L SE A +T EY Sbjct: 1 MNYRHIYHAGNFADIVKHLVLIAILEQLKNKEKPFAVLDAFAGLGLYDLASEAASKTLEY 60 Query: 62 LEGIARIWQQDD-LPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELH 120 GI ++ Q D P L+ +++V+ N GQ +YPGSP I + LLR QD L ELH Sbjct: 61 NNGIGKLLQALDHTPNSLKIFLSVI---NSVGQ-NFYPGSPFIIQQLLRPQDRLIACELH 116 Query: 121 PSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSGI 180 P+DY L+ ++ D + +KA LP RGLI +DPP+E+K ++Q +++ + Sbjct: 117 PADYLDLKKLLPNNT----HCIDAYNAIKAFLPFKENRGLIFLDPPFEVKNEFQKLITAL 172 Query: 181 AEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTASGM 240 + +WYP+ + H+ + G ++ L IE + S + M G+ Sbjct: 173 KKIKVSALNNSTLIWYPIKDLLLVSDFYHNYKTIGFKETLIIEYEL--LSSDKNMVKCGL 230 Query: 241 IVINPPWKLEQQMNNVLPWLHSKL 264 ++INPP + Q++ + +L L Sbjct: 231 MLINPP-NIRQELEELTKYLSYTL 253 >UniRef50_C6NTA4 Putative uncharacterized protein n=1 Tax=Acidithiobacillus caldus ATCC 51756 RepID=C6NTA4_9GAMM Length = 290 Score = 309 bits (792), Expect = 8e-83, Method: Composition-based stats. Identities = 95/267 (35%), Positives = 142/267 (53%), Gaps = 7/267 (2%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEY 61 ++YRH HAGN AD LKH SL +E L KD P YL+THAGAGRY L GE+ Sbjct: 1 MNYRHDHHAGNAADCLKHLALSLALERLLHKDAPLFYLETHAGAGRYSLADA-----GEH 55 Query: 62 LEGIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELHP 121 G+ R+W L ++++++ G LR+YPGSP++A LLR D + L E Sbjct: 56 SAGVDRVWAARRQLKGLSPWLDLLEEGAEDGVLRHYPGSPVVAARLLRPGDRMVLAEKVA 115 Query: 122 SDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSGIA 181 LR R + DG+ L+ LPP RRGLIL+DPP+E + +++A+ I Sbjct: 116 VVRERLRHNLAGRGRTSILGDDGYAILRGHLPPPERRGLILMDPPFERRDEWEALAKAII 175 Query: 182 EGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTASGMI 241 + R+ G +WYP+ +R I R++ L+ ++ +EL + ++ M SG+I Sbjct: 176 GAHARWPQGCQIVWYPIKVRGMISRLLQSLQRALDMEV--VELRLESETGGTSMVGSGLI 233 Query: 242 VINPPWKLEQQMNNVLPWLHSKLVPAG 268 ++ PPW L +++ L L L G Sbjct: 234 LVRPPWGLRERLLAALAVLGPVLAQGG 260 >UniRef50_B8H3J8 External DNA uptake/catabolism protein n=6 Tax=Caulobacteraceae RepID=B8H3J8_CAUCN Length = 273 Score = 308 bits (790), Expect = 1e-82, Method: Composition-based stats. Identities = 85/268 (31%), Positives = 121/268 (45%), Gaps = 2/268 (0%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEY 61 ++YRH+FHAGN AD+ KH + ++ +L+EK +DTHAGAG Y L E A R+GE Sbjct: 1 MNYRHAFHAGNFADLHKHAILLAMLSALQEKSPALAVIDTHAGAGGYDLAGEMARRSGEA 60 Query: 62 LEGIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELHP 121 GI R+ D PA + +N + N YPGSP + LR D EL Sbjct: 61 QAGIFRLKAAADAPAVFQPLLNAITQMNGGKDGDLYPGSPRLMARALRGADRYVGCELRD 120 Query: 122 SDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSGIA 181 D LLR + AR +ADGF R I+IDPP+E DY +V+ Sbjct: 121 DDADLLRKTLAPCANARALQADGFDTAVKDA-GKGGRAFIVIDPPFERPDDYDRIVATTR 179 Query: 182 EGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTASGMI 241 R A+W P+ + + + T +L EL + P +D M M+ Sbjct: 180 AVLARAPDAALAIWLPIKDLETFDAFLRAM-ETVTSDLLVAELRLRPLTDPMKMNGCAMV 238 Query: 242 VINPPWKLEQQMNNVLPWLHSKLVPAGT 269 +I P +E W+ ++L G Sbjct: 239 MIGAPPSVEDAAVAAGDWIATRLGEPGG 266 >UniRef50_Q73R01 Putative uncharacterized protein n=1 Tax=Treponema denticola RepID=Q73R01_TREDE Length = 279 Score = 300 bits (770), Expect = 2e-80, Method: Composition-based stats. Identities = 86/277 (31%), Positives = 153/277 (55%), Gaps = 17/277 (6%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 MLSYRH FHAGN ADV KH+ ++ +K KPF D +AG+ Y L SE + +TGE Sbjct: 1 MLSYRHGFHAGNQADVFKHSALFSFLKVYTQKQKPFTAFDLNAGSASYNLLSEWSLKTGE 60 Query: 61 YLEGIAR---IWQQDDLP----AELEAYIN-VVKHFNRSGQLRYYPGSPLIARLLLREQD 112 EGI R +++++ LP +AY++ +K+++ + Y GSP I R L+++ Sbjct: 61 AEEGIIRFLDLYKKEKLPLPIPEGFKAYLDFCLKNYDENSS---YAGSPEIIRSFLQKES 117 Query: 113 SLQLTELHPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTD 172 +L L +LH ++ L+ +++ V K D ++ ++A PP+ RG L DP YE+ +D Sbjct: 118 NLILCDLHSAEAEKLKELYKRVENVHVHKRDCYEAVRALTPPLPIRGFALFDPSYEVDSD 177 Query: 173 YQAVVSGIAEGYKRFATGIYALWYPVVL--RQQIKRMIHDLEATGIRKILQIELAVLPD- 229 Y A+ + + K++ GI+ +WYP++ ++ + + + K+L IE+ + Sbjct: 178 YTAIAESVEKVCKKWPIGIFIIWYPILNHKTEECRNLKDRISKAMNNKVLNIEVKHFSNK 237 Query: 230 ---SDRRGMTASGMIVINPPWKLEQQMNNVLPWLHSK 263 + G+ SG+++ NPPW LE+++ + ++ Sbjct: 238 IDSENEYGLQGSGLLITNPPWGLEEKLKEICEYVEKV 274 >UniRef50_UPI0000E1171F protein involved in external DNA uptake n=1 Tax=Rhodobacterales bacterium HTCC2255 RepID=UPI0000E1171F Length = 288 Score = 294 bits (753), Expect = 2e-78, Method: Composition-based stats. Identities = 99/290 (34%), Positives = 150/290 (51%), Gaps = 19/290 (6%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 MLSY+H +HAGNHAD++KH ++ L +K+KP +DTHAGAG Y L + A+ E Sbjct: 1 MLSYQHIYHAGNHADLIKHLTLLSVLLKLGQKNKPCTLIDTHAGAGEYDLSATKAQHNNE 60 Query: 61 YLEGIARIWQQ---DDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLT 117 L GI + + A L AY + S + Y GS + LREQD Sbjct: 61 SLTGIGMLDEAFFSQTDSALLHAYGEGLYTGVVSDK---YCGSAGWMQRYLREQDQAHFC 117 Query: 118 ELHPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVV 177 ELHP+ YP L + K A + DGF+QL A +PP+++RG++L+DPPYE ++Y V+ Sbjct: 118 ELHPNVYPELLNYVYK-PNAHCYQEDGFKQLIALVPPLAKRGIVLVDPPYEQASEYSMVL 176 Query: 178 SGIAEGYKRFATGIYALWYPVVLRQQIK------RMIHDLEATG----IRKILQIELAVL 227 I + KR+ATG Y +WYP++ Q RM+ + + I+ Sbjct: 177 DVIEKSLKRWATGCYLIWYPMINTQNTNKAQAAIRMLKGFNTLADEHSVSNMANIQWRYD 236 Query: 228 PDSDRRGMTASGMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWI 277 +D +GM SG+I IN PW + ++++ + L + ++ WI Sbjct: 237 TTNDAQGMYGSGIIAINLPWGCDNEISDAM--LSIQQSKMTQAAFSLEWI 284 >UniRef50_A3VP01 Putative uncharacterized protein n=1 Tax=Parvularcula bermudensis HTCC2503 RepID=A3VP01_9PROT Length = 262 Score = 286 bits (733), Expect = 6e-76, Method: Composition-based stats. Identities = 83/257 (32%), Positives = 131/257 (50%), Gaps = 10/257 (3%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 MLSY+H++HAGN AD+ KH V ++ L +K + LDTHAG G Y L A++TGE Sbjct: 1 MLSYQHAYHAGNRADLHKHAVWCALLAHLTQKSRGLTILDTHAGRGLYDLAGAEAQKTGE 60 Query: 61 YLEGIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELH 120 +G A + D L + V YPGSPL++ R QD + L E H Sbjct: 61 ASDGAAAV--SLDGSHALG---SAVAACRAQYGEMAYPGSPLLSLHFARPQDQVILMEKH 115 Query: 121 PSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSGI 180 P + L++ + +A V DG++ A PP R+GL++IDP YE+KT+YQ V + Sbjct: 116 PQEGAALKTVM-RGKKAAVHLRDGYEGALALAPPTPRKGLVMIDPSYEVKTEYQNVALFL 174 Query: 181 AEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTASGM 240 ++ LWYP++ ++ + M+ L + E+ + + M SG+ Sbjct: 175 PTLIDKWPEASVLLWYPILAAKRHEAMLDTLSPMQP---WRHEV-LFTEDSLLRMKGSGL 230 Query: 241 IVINPPWKLEQQMNNVL 257 ++I+PP+ E ++ L Sbjct: 231 VLISPPYGGEGAIDAAL 247 >UniRef50_C5SM30 Putative uncharacterized protein n=2 Tax=Caulobacteraceae RepID=C5SM30_9CAUL Length = 284 Score = 274 bits (701), Expect = 2e-72, Method: Composition-based stats. Identities = 78/280 (27%), Positives = 123/280 (43%), Gaps = 16/280 (5%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEY 61 ++YRH FHAGN AD+ KH V + +L+E +P +D+HAGAG+Y L R+ E Sbjct: 1 MNYRHGFHAGNFADLFKHAVLLNFLRALRESAQPLQVVDSHAGAGQYDLSDPTFSRSKEA 60 Query: 62 LEGIARIWQQDDLPAELEAYINVVKHFNRSGQLR----YYPGSPLIARLLLREQDSLQLT 117 GI + D+P L + V NR+ + YPGSPL+ L + S Sbjct: 61 EAGIGYLLG-GDVPQSLIPLSDYVWAKNRAAGFKTRIGLYPGSPLLVLDHLTAEGSYMGC 119 Query: 118 ELHPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVV 177 EL DY LR+ +AR DG++ P + +LIDPP+E DY+ + Sbjct: 120 ELRKDDYERLRATVMPRGKAR--HTDGYEAAVEMAEP-DKDFFLLIDPPFEQFEDYERIN 176 Query: 178 SGIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLE--------ATGIRKILQIELAVLPD 229 + + K+ T +W P+ + R + +E G I EL + P Sbjct: 177 LCLRDVLKKQPTAKALVWLPLKDLETFDRFLRHMECELLEDQTGEGGPDIAVAELRLRPL 236 Query: 230 SDRRGMTASGMIVINPPWKLEQQMNNVLPWLHSKLVPAGT 269 ++ M ++ +N P + + M ++ L G Sbjct: 237 TNPLKMNGCALVTVNAPASVVEAMRDIADDLAQVFAEPGG 276 >UniRef50_B7VU08 Putative uncharacterized protein n=4 Tax=Vibrionales RepID=B7VU08_VIBSL Length = 288 Score = 269 bits (689), Expect = 7e-71, Method: Composition-based stats. Identities = 78/292 (26%), Positives = 142/292 (48%), Gaps = 20/292 (6%) Query: 2 LSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEY 61 + YRH H G+H D LKH V S +++SL ++ +DTH+G G Y L + + GE+ Sbjct: 1 MEYRHQCHVGDHGDALKHPVLSALVQSLMQQHSRLNVIDTHSGTGCYDLTTAPSNHAGEF 60 Query: 62 LEGIARIWQQDD-LPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELH 120 EG+ +W+ LP ++++V++++N + + YPGS I R QDS +++ Sbjct: 61 AEGVGYLWRNKAYLPPAFASFMSVLEYYNPNQLISLYPGSAAITYQQGRSQDSFYFSDIQ 120 Query: 121 PSDYPLLRSE---FQKD----SRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDY 173 + LL++ Q+D S+ + DG + L + LI+IDPPYE ++Y Sbjct: 121 QDEADLLQTNIETLQRDLDVSSKLTITAGDGLKALPDDVAKHDNHHLIVIDPPYETDSEY 180 Query: 174 QAVVSGIAEGYKRFATGIYALWYP--------VVLRQQIKRMIHDLEATGIRKILQIELA 225 AV+ + + Y++ +WYP ++L + + L + I+ EL Sbjct: 181 LAVIDALVKAYQQSEKVSALIWYPLYTDDKSSLILNHCVTAVKDGLLPSPIKS----ELR 236 Query: 226 VLPDSDRRGMTASGMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWI 277 + + SG+++ NPP + + + L +LH +L G G+ + + Sbjct: 237 LRDPKGDDRLIGSGLLLFNPPQGISGIVADTLDYLHCQLSTNGEGYWQMRSL 288 >UniRef50_Q0C0F5 Putative uncharacterized protein n=1 Tax=Hyphomonas neptunium ATCC 15444 RepID=Q0C0F5_HYPNA Length = 262 Score = 256 bits (654), Expect = 7e-67, Method: Composition-based stats. Identities = 76/267 (28%), Positives = 120/267 (44%), Gaps = 6/267 (2%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 MLSY+H FHAGN ADVLKH V ++ + +P Y++TH+G GRY L + A + GE Sbjct: 1 MLSYQHGFHAGNRADVLKHAVLDTLLRAAATGPRPLFYVETHSGHGRYDLTNAQARKRGE 60 Query: 61 YLEGIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELH 120 +G+ + + P L ++ +V + YPGSP +A+ LL + + L ELH Sbjct: 61 SDDGVLALM-KGKPPKPLSGWMELVNA----RGEKDYPGSPALAQTLLPKHARMMLFELH 115 Query: 121 PSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSGI 180 P + L + D R R++KADG+ P + ++L+DP YE D +A+ Sbjct: 116 PQENAALTEAMKGDDRIRIQKADGYAGALKLAPRAGEQMVVLVDPSYETHRDIEALALWT 175 Query: 181 AEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTASGM 240 + KR+ + LW P+ + L G I+ V + S M Sbjct: 176 PKALKRWPGALLILWLPLFRDGREAEFGEYLATLGDAMIVGARWPV-ALGTESSIEGSAM 234 Query: 241 IVINPPWKLEQQMNNVLPWLHSKLVPA 267 + P + + + L S Sbjct: 235 VAFGAPAEARAKCEAIASSLESYWAQQ 261 >UniRef50_B7G053 Predicted protein (Fragment) n=1 Tax=Phaeodactylum tricornutum CCAP 1055/1 RepID=B7G053_PHATR Length = 267 Score = 243 bits (621), Expect = 6e-63, Method: Composition-based stats. Identities = 83/270 (30%), Positives = 133/270 (49%), Gaps = 21/270 (7%) Query: 4 YRHSFHAGNHADVLKHTV-QSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEYL 62 Y+H HAGNH DVLKH V ++ + E L + + +D HAG G Y L + +G++ Sbjct: 7 YQHLKHAGNHCDVLKHVVFRACVQEQLNVHENGIILVDCHAGEGLYDLSK---QTSGDFE 63 Query: 63 EGIARIWQQDD--LPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELH 120 G+AR+ Q D P + Y+N ++ +++YPGSP++ LLREQD +L +L+ Sbjct: 64 RGVARVVQNLDQTAPPAVHDYVNAIQE--ADEYMQFYPGSPMLGAKLLREQDEHRLVDLY 121 Query: 121 PSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSGI 180 D L+ +A V +AD + L R +ILIDPPY + D+ Sbjct: 122 VEDVEGLKDGALFW-QADVFEADAVEFLVPN--DDDRHKVILIDPPYLDQEDFYRAKVLT 178 Query: 181 AEGYKRFATGIYALWYPVVLRQQIK----RMIHDLEATGIR-KILQIELAVLPDSDRRGM 235 R LWYP++ + + + + I D+ + I Q L V D+ G+ Sbjct: 179 ERILDRDPYCTILLWYPMIQKSRWRYGYAKSIKDMAKKKAKLGIYQAWLTV----DKEGL 234 Query: 236 TASGMIVINPPWKLEQQM-NNVLPWLHSKL 264 SGMIV+NP + ++ + + + WL + L Sbjct: 235 QGSGMIVVNPTQRFDEIVDEDTIDWLSATL 264 >UniRef50_UPI0001909543 putative DNA methylase protein n=1 Tax=Rhizobium etli IE4771 RepID=UPI0001909543 Length = 171 Score = 214 bits (547), Expect = 2e-54, Method: Composition-based stats. Identities = 62/164 (37%), Positives = 91/164 (55%) Query: 117 TELHPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAV 176 ELHP DY L F+ D AR+ + DG+ L A LPP +RG++L+DPP+E + +YQ + Sbjct: 1 MELHPEDYARLHRLFEGDHHARITELDGWLALGAHLPPKEKRGIVLVDPPFEEEDEYQRL 60 Query: 177 VSGIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMT 236 G+ Y+RF G Y LWYP+ IK L+A I K+L EL V D G+T Sbjct: 61 AKGLERAYRRFPGGTYCLWYPLKKGAPIKEFHETLQALDIPKMLCAELTVRSDRGTTGLT 120 Query: 237 ASGMIVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIVPE 280 SG++++NPP+ L+ +++ +LP L L W+ E Sbjct: 121 GSGLVIVNPPFTLKDELHQMLPALKDHLAQDRFASQRAFWLRGE 164 >UniRef50_Q2BH49 Putative uncharacterized protein n=1 Tax=Neptuniibacter caesariensis RepID=Q2BH49_9GAMM Length = 251 Score = 199 bits (508), Expect = 6e-50, Method: Composition-based stats. Identities = 68/241 (28%), Positives = 102/241 (42%), Gaps = 19/241 (7%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 M Y HS +AG ADV+KH + ++ L D Y++THAGAG Y L + GE Sbjct: 1 MAKYLHSKYAGGDADVMKHACLASVLSKL---DISVEYVETHAGAGLYDLDPDR----GE 53 Query: 61 YLEGIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELH 120 +L+GI R L+ Y V++ + + + YP SP+IA SL L EL+ Sbjct: 54 HLKGIGRCRSNLTDLPALKPYNGVLEE-SWTLDKKIYPASPIIANSA-SAVKSLCLYELN 111 Query: 121 PSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSGI 180 S L+ A V + DGF S +LIDPPY+ DYQ VV + Sbjct: 112 RSVACQLKKNL---PEAVVWEEDGFLSRHHL----SHGSFVLIDPPYKSSDDYQQVVEYV 164 Query: 181 AEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTASGM 240 K+ +W+P++ + L ++L+ M+ G+ Sbjct: 165 GAA-KQSQVRAVMVWFPMIYSDLTADLYDGLLGLYPDGQW-LQLS-RGLHSEGAMSGFGV 221 Query: 241 I 241 Sbjct: 222 F 222 >UniRef50_A0B718 Protein involved in catabolism of external DNA-like n=1 Tax=Methanosaeta thermophila PT RepID=A0B718_METTP Length = 246 Score = 149 bits (377), Expect = 1e-34, Method: Composition-based stats. Identities = 53/185 (28%), Positives = 72/185 (38%), Gaps = 17/185 (9%) Query: 4 YRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEYLE 63 Y H HAGN DV KH + S L + +Y ++HAG Y L GE+ Sbjct: 2 YDHREHAGNAGDVWKHFLLSEAAAYLLCR-SDLVYAESHAGYTAYTLAP-----NGEWRW 55 Query: 64 GIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELHPSD 123 GI R W Y V++ N L+ YPGS I L R + EL Sbjct: 56 GIGRCWHLRSEIES--PYFAVLEEMN-DEHLQIYPGSAKIILRLGRFFRRRVVAELWDIS 112 Query: 124 YPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRR--GLILIDPPYEMKTDYQAVVSGIA 181 + +S + DGF + L +RR GL+LIDPP D + + Sbjct: 113 EDVGKS-WSACPDIHFHLGDGFSGVMDLL---NRRDPGLLLIDPP--SPDDQDKAIELLK 166 Query: 182 EGYKR 186 + R Sbjct: 167 DASDR 171 >UniRef50_A3I4J5 Methyltransferase n=3 Tax=Bacillaceae RepID=A3I4J5_9BACI Length = 197 Score = 73.2 bits (179), Expect = 8e-12, Method: Composition-based stats. Identities = 25/114 (21%), Positives = 44/114 (38%), Gaps = 13/114 (11%) Query: 88 FNRSGQLRYYPGSPLI-ARLLLREQDSLQLTELHPSDYPLLRSEFQKDS---RARVEKAD 143 F+ L + GS + L R + E + +L+ +K + + D Sbjct: 49 FDGGTALDLFAGSGGLGIESLSRGAERAIFIEKDAKAFQVLQENIKKCRYEEHTELFRID 108 Query: 144 GFQQLKAKLPPVSRR----GLILIDPPYEMKTDYQAVVSGIAEGYKRFATGIYA 193 + +KA L +R L+ +DPPY K +Y +V + + K GI Sbjct: 109 AKRAVKALL----KRDITFSLVFLDPPYHQK-EYYDLVQLLVDNEKIQQNGIIL 157 >UniRef50_C5D8K2 Methyltransferase n=85 Tax=Bacillales RepID=C5D8K2_GEOSW Length = 189 Score = 47.8 bits (113), Expect = 4e-04, Method: Composition-based stats. Identities = 20/91 (21%), Positives = 36/91 (39%), Gaps = 5/91 (5%) Query: 87 HFNRSGQLRYYPGSPLI-ARLLLREQDSLQLTELHPSDYPLLRSEFQKDS---RARVEKA 142 +F+ L + GS + L R D + + ++ +A + + Sbjct: 39 YFSGGMGLDLFSGSGGLGIEALSRGLDRVIFVDHDAKAVQTVKKNVATCRLLEQAEIYRN 98 Query: 143 DGFQQLKAKLPPVSRRGLILIDPPY-EMKTD 172 D + L+A + R LI +DPPY E K + Sbjct: 99 DAERALRAIIKRGLRFHLIFLDPPYKEQKLE 129 >UniRef50_C9RZH2 Methyltransferase n=5 Tax=Bacillaceae RepID=C9RZH2_GEOSY Length = 198 Score = 47.8 bits (113), Expect = 5e-04, Method: Composition-based stats. Identities = 20/89 (22%), Positives = 33/89 (37%), Gaps = 4/89 (4%) Query: 87 HFNRSGQLRYYPGSPLI-ARLLLREQDSLQLTELHPSDYPLLRSEFQKD---SRARVEKA 142 +F+ L + GS + L R + + + +R RA + Sbjct: 39 YFSGGNGLDLFAGSGGLGIEALSRGIERVIFVDHDRKAVQTVRKNVAACGLEKRAEIYCN 98 Query: 143 DGFQQLKAKLPPVSRRGLILIDPPYEMKT 171 D + LKA R +I +DPPY+ K Sbjct: 99 DAERALKAVAKRGLRFAVIFLDPPYKEKQ 127 >UniRef50_Q1AW93 Putative uncharacterized protein n=1 Tax=Rubrobacter xylanophilus DSM 9941 RepID=Q1AW93_RUBXD Length = 177 Score = 46.6 bits (110), Expect = 0.001, Method: Composition-based stats. Identities = 39/153 (25%), Positives = 58/153 (37%), Gaps = 33/153 (21%) Query: 58 TGEYLEGIARIWQQDDLPAELEAYINVVKH--FNRSGQ-------LRYYPGSPLI-ARLL 107 +GE G+ R+ +P + + V+ FN GQ L Y G+ + L Sbjct: 3 SGEAR-GV-RLAP---VPPGVRPTSDRVRESLFNALGQFFEGGEVLDLYAGTGALGIEAL 57 Query: 108 LREQDSLQLTELHPSDYPLLRSEFQK---DSRARVEKADGFQQLKAKLPPVSRRG----L 160 R D E P +R ++ + RARV D ++++ L R G L Sbjct: 58 SRGCDRAVFVEKSPRAAAAIRENLRRTGLEGRARVVVGDAVREMERLL----RDGKVFNL 113 Query: 161 ILIDPPYEMKTDYQAVVSGIAEGYKRFATGIYA 193 I DPPY + AEG R A + A Sbjct: 114 IFADPPY-------RIAPAGAEGLLRRAEALLA 139 >UniRef50_C4L5T0 Methyltransferase n=2 Tax=Bacillales RepID=C4L5T0_EXISA Length = 187 Score = 45.9 bits (108), Expect = 0.001, Method: Composition-based stats. Identities = 18/89 (20%), Positives = 32/89 (35%), Gaps = 4/89 (4%) Query: 87 HFNRSGQLRYYPGSPLI-ARLLLREQDSLQLTELHPSDYPLLRSEFQKDS---RARVEKA 142 +F+ L Y GS + L R D + P ++ + + +V + Sbjct: 39 YFSGGKALDLYAGSGGLGIEALSRGCDEAIFVDRQPKAVQTIQENLRATHYEVKGKVYRQ 98 Query: 143 DGFQQLKAKLPPVSRRGLILIDPPYEMKT 171 D L+ + LI +DPPY + Sbjct: 99 DAKAVLEQLKVQQEQFKLIFMDPPYHAEE 127 >UniRef50_C7I1R0 Methyltransferase n=1 Tax=Thiomonas intermedia K12 RepID=C7I1R0_THIIN Length = 215 Score = 45.1 bits (106), Expect = 0.002, Method: Composition-based stats. Identities = 29/119 (24%), Positives = 44/119 (36%), Gaps = 16/119 (13%) Query: 65 IARIWQQDDLP----AELEAYINVVKH--FNRSGQ-------LRYYPGSPLI-ARLLLRE 110 I +W++ L L + V+ FN GQ L Y GS + R Sbjct: 28 IGGLWKRSKLAVPDLPGLRPTPDRVRETVFNWLGQTLAGLRVLDLYAGSGALGLEAASRG 87 Query: 111 QDSLQLTELHPSDYPLLRSEFQK--DSRARVEKADGFQQLKAKLPPVSRRGLILIDPPY 167 S+ L E HP + + Q+ ++ +V AD R ++ IDPPY Sbjct: 88 AASVLLIEQHPRCVAAIAAAAQRLGATQVQVRGADALSCAHGLARAGERFDIVFIDPPY 146 >UniRef50_C1ACG3 Putative uncharacterized protein n=1 Tax=Gemmatimonas aurantiaca T-27 RepID=C1ACG3_GEMAT Length = 192 Score = 44.7 bits (105), Expect = 0.003, Method: Composition-based stats. Identities = 25/123 (20%), Positives = 43/123 (34%), Gaps = 18/123 (14%) Query: 82 INVVKHFNRSGQ-LRYYPGSPLIARLLLREQDSLQ-LTELHPSDYPLLRSEFQKDS---R 136 + +V+ + + + G+ I L E PS L++ + Sbjct: 33 MKLVRADLEGARVIDLFAGTGAIGLEALSRGAKYVDFVEFRPSSLHALKANIAALRVTTK 92 Query: 137 ARVEKADGFQQLKAKLPPVSRRGLILIDPPYEMKTDYQAVVSGIAEGYKRFATGIYALWY 196 ARV K D A + R L +DPPYE + + +R+ A W Sbjct: 93 ARVYKKDALPFANALI--AGRYDLAFVDPPYESR--------MLDRLIERWLE---APWS 139 Query: 197 PVV 199 P++ Sbjct: 140 PIL 142 >UniRef50_C0GHJ6 Methyltransferase n=1 Tax=Dethiobacter alkaliphilus AHT 1 RepID=C0GHJ6_9FIRM Length = 198 Score = 43.9 bits (103), Expect = 0.006, Method: Composition-based stats. Identities = 21/119 (17%), Positives = 39/119 (32%), Gaps = 20/119 (16%) Query: 78 LEAYINVVKHFNRSGQLRYYPGSPLI-ARLLLREQDSLQLTELHPSDYPLLRSEF---QK 133 L Y++ + L + G+ + L R D E + +++ Sbjct: 36 LTPYLS------GAEMLDVFAGNGGVGIEALSRGADRCVFVEKNAQCAKIIKDNLILTGL 89 Query: 134 DSRARVEKADGFQQLKAKLPPVSRRG-LILIDPPYEMKTDYQAVVSGIAEGYKRFATGI 191 R + D L + L R +I +DPPY +A+ ++ A G Sbjct: 90 ADRGEILPRDALGAL-SLLQKRENRFNIIFLDPPYHSPE--------LADVLRKIAQGC 139 >UniRef50_D0THA4 Predicted protein n=14 Tax=Bacteroides RepID=D0THA4_9BACE Length = 69 Score = 43.5 bits (102), Expect = 0.007, Method: Composition-based stats. Identities = 19/65 (29%), Positives = 31/65 (47%), Gaps = 12/65 (18%) Query: 1 MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGE 60 M++Y H G DVLKH V +++ +KP +Y++T++ Y + T E Sbjct: 1 MVTYTHF---GKQPDVLKHLVLCEVLQI----EKPQIYVETNSACAIYTMT-----HTPE 48 Query: 61 YLEGI 65 GI Sbjct: 49 QEYGI 53 >UniRef50_A6Q3P8 DNA methylase n=3 Tax=Epsilonproteobacteria RepID=A6Q3P8_NITSB Length = 193 Score = 43.2 bits (101), Expect = 0.009, Method: Composition-based stats. Identities = 18/96 (18%), Positives = 35/96 (36%), Gaps = 6/96 (6%) Query: 99 GSPLIARLLL-REQDSLQLTELHPSDYPLLRSEFQKDSR--ARVEKADGFQQLKAKLPPV 155 GS + L R + E + Y +L+ V D F+ L + + Sbjct: 57 GSGSVGLEALSRGAKRVYFIEKNRDSYKVLKKNVHNCDESSCSVRYGDAFELLWDVIEEL 116 Query: 156 SR---RGLILIDPPYEMKTDYQAVVSGIAEGYKRFA 188 R + DPP+ ++ Y+ + +A+ K+ Sbjct: 117 KRNKEKAYFYFDPPFSIREGYEDIYDEVAQTIKKIP 152 >UniRef50_C8NLD5 N6-adenine-specific methylase n=20 Tax=Corynebacterium RepID=C8NLD5_COREF Length = 202 Score = 41.6 bits (97), Expect = 0.031, Method: Composition-based stats. Identities = 22/106 (20%), Positives = 38/106 (35%), Gaps = 11/106 (10%) Query: 94 LRYYPGSPLI-ARLLLREQDSLQLTELHPSDYPLLRSEF----QKDSRARVEKADGFQQL 148 L + GS + R D + L E H ++R+ D KA + Sbjct: 50 LDLFAGSGALGLEAASRGADEVVLVESHSKAAQIIRANAGVVNHPDVHVVEMKASTYLA- 108 Query: 149 KAKLPPVSRRGLILIDPPYEMKTDYQAVVSGIAEGYKRFATGIYAL 194 P + ++L DPPYE+ + +VV + + G + Sbjct: 109 ---TAPDAHFTMVLADPPYELADE--SVVEMLHALTPKLLDGAVVV 149 >UniRef50_Q0VLD7 Putative uncharacterized protein n=2 Tax=Alcanivorax RepID=Q0VLD7_ALCBS Length = 210 Score = 41.2 bits (96), Expect = 0.035, Method: Composition-based stats. Identities = 22/77 (28%), Positives = 29/77 (37%), Gaps = 6/77 (7%) Query: 94 LRYYPGSPLI-ARLLLREQDSLQLTELHPSDYPLLRSEFQK--DSRARVEKADGFQQLKA 150 L + GS + A L R L E LR + + R V+ AD L Sbjct: 73 LDLFAGSGALGAEALSRGACEAVLVEKQRERSADLRRQLTPLFEGRLTVQCADALSWLPT 132 Query: 151 KLPPVSRRGLILIDPPY 167 + P L+ IDPPY Sbjct: 133 QRQPFD---LVFIDPPY 146 >UniRef50_D1C502 Putative uncharacterized protein n=1 Tax=Sphaerobacter thermophilus DSM 20745 RepID=D1C502_SPHTD Length = 194 Score = 41.2 bits (96), Expect = 0.037, Method: Composition-based stats. Identities = 23/115 (20%), Positives = 38/115 (33%), Gaps = 18/115 (15%) Query: 88 FNRSGQLRYYPGSPLI-ARLLLREQDSLQLTELHPSDYPLLRSEFQKDS---RARVEKAD 143 L Y GS I L R + E +P+ ++R RA V + Sbjct: 41 VRPRRVLDLYAGSGGIGIEALSRGAEWCDFVEQNPAACAVIRDNLASTRFTDRAAVHQT- 99 Query: 144 GFQQLKAKLP-PVSRRGLILIDPPYEMK---------TDYQAVVSGIAEGYKRFA 188 +++ L P+ L+++DPPY + AV G + Sbjct: 100 ---TVQSFLSRPLEPYDLVVMDPPYADPHILQTMQRVAESGAVAEGTILALGHWP 151 >UniRef50_C9RAL4 Methyltransferase n=1 Tax=Ammonifex degensii KC4 RepID=C9RAL4_AMMDK Length = 189 Score = 41.2 bits (96), Expect = 0.037, Method: Composition-based stats. Identities = 28/122 (22%), Positives = 35/122 (28%), Gaps = 17/122 (13%) Query: 59 GEYLEGIARIWQQDDLPAELEAYINVVKHFNRSGQLRYYPGSPLI----------ARLLL 108 GE R +L VK + PGS + L Sbjct: 6 GEAK----RCRLATLKGKDLRPTSERVKEALFNILASQVPGSRFLDLFAGTGGVGIEALS 61 Query: 109 REQDSLQLTELHPSDYPLLRSEFQKDS---RARVEKADGFQQLKAKLPPVSRRGLILIDP 165 R E P L+R ++ RARV D L R L+ IDP Sbjct: 62 RGAKFAVFVERDPRAVKLIRENLERTGLSNRARVYGRDVLSLLPYLARKKERFDLVYIDP 121 Query: 166 PY 167 PY Sbjct: 122 PY 123 >UniRef50_Q98DH2 Mlr4706 protein n=1 Tax=Mesorhizobium loti RepID=Q98DH2_RHILO Length = 302 Score = 40.5 bits (94), Expect = 0.069, Method: Composition-based stats. Identities = 35/175 (20%), Positives = 54/175 (30%), Gaps = 41/175 (23%) Query: 28 SLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEYLEGIARIWQQDDLPAELEAYINVVKH 87 +L K Y+D AG G ERT I DL LE ++ Sbjct: 29 ALHSKFPELWYIDAFAGTG---------ERTVRVAGAI-----ATDLLPALE---KRIER 71 Query: 88 FNRSGQLRYYPGSPLIARLLLREQDSLQLTELHPSDYPLLR--SEFQKDSRARVEKADGF 145 GS IA + + LR + D + + D Sbjct: 72 RR---------GSARIALDITPHFSRYIFMDKMRRHCAALRCLANEYPDRSIDIVRGDAN 122 Query: 146 QQLKAKLPPV---SRRGLILIDPPYEMKTDYQ-AVVSGIAEGYKRFATGIYALWY 196 + +KA+L +R ++ +DP Y V E ++ T +WY Sbjct: 123 EAIKAELASQRWVGKRAVMFLDP-------YGMDVAWSTLEAIRK--TEAIDVWY 168 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.313 0.171 0.606 Lambda K H 0.267 0.0520 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 2,133,309,109 Number of Sequences: 3077464 Number of extensions: 103523033 Number of successful extensions: 248287 Number of sequences better than 1.0e-01: 94 Number of HSP's better than 0.1 without gapping: 217 Number of HSP's successfully gapped in prelim test: 31 Number of HSP's that attempted gapping in prelim test: 247446 Number of HSP's gapped (non-prelim): 271 length of query: 280 length of database: 1,040,396,356 effective HSP length: 127 effective length of query: 153 effective length of database: 649,558,428 effective search space: 99382439484 effective search space used: 99382439484 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.0 bits) S2: 93 (40.1 bits)